CsaV3_6G048410 (gene) Cucumber (Chinese Long) v3

NameCsaV3_6G048410
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionPentatricopeptide repeat-containing protein
Locationchr6 : 28395757 .. 28397469 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTCAAATGTGCCTCTCCACTCACTCGCTCTTCTTTCTCCGCCGTGATTCATCTCATTTCCACGTCCATCGTACCCACTCAACTCCCCACTGTTCATCTCCCCCGCCGCCACAAATCTGAATTCGCCTCTCCCAGGTTCCTCTTTTTCTTCTTCATTTCTTAATTCAATGTTTAATCTTATATGCTCCCCGGCACAGTTCGTCTCTTTTACATTTCATAATTTCCTGCTTCGTACCGATATCCACATTTAATTTATCTGGCAATGCTTCCTTTTCAATTTAAGGTTTTGCATTTTTCTTGGTAATACTCTGCTTTGTTGTATAAAGAGTCCTCTTTTATTTCTCTTTAATTTGACGAACTTAAGTACTATCTCTTAGAAATTCCCAGCTTTCAGGTCAAGCCACACCACAAAGATACCTCATCCTGGGATAAAACGCTTAGGGGGCTATGTTTAACTGGGAAATTGGCGGAAGCTGTTGCACTTTTGTGCTGTATGGCCTTGCAATTTCACTCCAAAACTTACTGTCTTCTGCTACAAGAATGCATTTTCAGGAAAGAGTATATGAAAGGAAAAAGAATCCATGCCCAAATGGTTGTTGTTGGATATGTACCCAATGAATATCTCAACACCAAACTATTGATTTTATATGCCAAATCAGGTGACTTAGAGACTGCATACGTCCTTCATGAACATTTGTTGGAGAAAAGCCTGGTTTCATGGAATTCATTGATTGCCGGGTATGTACAGAAGGGGCTTGCTGAAGTTGGATTGGAGTTTTATTTAAAGATGAGACAAAGCGGTTTAATGCCTGATCAGTATACTTTTGCATCAGTTTTAAGAGCTTGTGCTAGTTTAGCTTCTCTGGAACACGGAAAGAGAGCACATGGAGTTTTGATTAAATGTCAAATTGGTGATAATGTTGTTGTGTCTAGTGCACTTGTAGATATGTACTTCAAATGCAGTAGCTTATCTGACGGGCATAAAGCATTTAACAAATCTTCAAATAGGAATGTAATCACTTGGACTGCTTTGATATCTGGGTATGGACAGCATGGAAGAATTTCTGAAGTTTTGGAATCATTTCATAGTATGATAAACAAAGGCTACCGGCCGAATTACGTCACTTTCCTTGCAGTTCTTGCTGCTTGTAGTCGTGGAGGCTTTGTGTCAGAGGCGTGGAATTACTTTTCTTTGATGACAAAGACCTATGGAATTGAACCAAGAGGGCAACATTATGCTGCCATGGCTGATCTTCTTGCCAGAGCTGGGAGATTGCAAGAAGCCTATGACTTTGTTCTTGATGCACCTTGCAAGGAGCACTCTGTTATGTGGGGTGCTTTGGTTGGAGCTTGTAAAGTTCATGAAGATGTAGATTTGATGAAACATGTTGCAGCAAGTTACTTCGAATTGGATCCTAAAAACTCAGGAAAGTTGGTTGTTTTCTCAAATGCTTTTGCCACATCCGGGTTGTGGGACAATGTTGAAGAGATAAGAGCTATGATGAAGAAATCAGGAATGAGTAAAGATCCTGGTTGCAGCAGAATAGAAATTCAAAGGGAATTCCATATTTTTGTCAAGGGCGATAAATCTCACAGAGAAACTGAGGAGATTTATAGAACCATTGACAGAATAACTCCAATTTTGAAGGATGCAGGATATATTCCTGAACTATGTGAAAAAACAGTGATAGAGGGATTAAGTTGGTGCTGA

mRNA sequence

ATGCTCAAATGTGCCTCTCCACTCACTCGCTCTTCTTTCTCCGCCGTGATTCATCTCATTTCCACGTCCATCGTACCCACTCAACTCCCCACTGTTCATCTCCCCCGCCGCCACAAATCTGAATTCGCCTCTCCCAGCTTTCAGGTCAAGCCACACCACAAAGATACCTCATCCTGGGATAAAACGCTTAGGGGGCTATGTTTAACTGGGAAATTGGCGGAAGCTGTTGCACTTTTGTGCTGTATGGCCTTGCAATTTCACTCCAAAACTTACTGTCTTCTGCTACAAGAATGCATTTTCAGGAAAGAGTATATGAAAGGAAAAAGAATCCATGCCCAAATGGTTGTTGTTGGATATGTACCCAATGAATATCTCAACACCAAACTATTGATTTTATATGCCAAATCAGGTGACTTAGAGACTGCATACGTCCTTCATGAACATTTGTTGGAGAAAAGCCTGGTTTCATGGAATTCATTGATTGCCGGGTATGTACAGAAGGGGCTTGCTGAAGTTGGATTGGAGTTTTATTTAAAGATGAGACAAAGCGGTTTAATGCCTGATCAGTATACTTTTGCATCAGTTTTAAGAGCTTGTGCTAGTTTAGCTTCTCTGGAACACGGAAAGAGAGCACATGGAGTTTTGATTAAATGTCAAATTGGTGATAATGTTGTTGTGTCTAGTGCACTTGTAGATATGTACTTCAAATGCAGTAGCTTATCTGACGGGCATAAAGCATTTAACAAATCTTCAAATAGGAATGTAATCACTTGGACTGCTTTGATATCTGGGTATGGACAGCATGGAAGAATTTCTGAAGTTTTGGAATCATTTCATAGTATGATAAACAAAGGCTACCGGCCGAATTACGTCACTTTCCTTGCAGTTCTTGCTGCTTGTAGTCGTGGAGGCTTTGTGTCAGAGGCGTGGAATTACTTTTCTTTGATGACAAAGACCTATGGAATTGAACCAAGAGGGCAACATTATGCTGCCATGGCTGATCTTCTTGCCAGAGCTGGGAGATTGCAAGAAGCCTATGACTTTGTTCTTGATGCACCTTGCAAGGAGCACTCTGTTATGTGGGGTGCTTTGGTTGGAGCTTGTAAAGTTCATGAAGATGTAGATTTGATGAAACATGTTGCAGCAAGTTACTTCGAATTGGATCCTAAAAACTCAGGAAAGTTGGTTGTTTTCTCAAATGCTTTTGCCACATCCGGGTTGTGGGACAATGTTGAAGAGATAAGAGCTATGATGAAGAAATCAGGAATGAGTAAAGATCCTGGTTGCAGCAGAATAGAAATTCAAAGGGAATTCCATATTTTTGTCAAGGGCGATAAATCTCACAGAGAAACTGAGGAGATTTATAGAACCATTGACAGAATAACTCCAATTTTGAAGGATGCAGGATATATTCCTGAACTATGTGAAAAAACAGTGATAGAGGGATTAAGTTGGTGCTGA

Coding sequence (CDS)

ATGCTCAAATGTGCCTCTCCACTCACTCGCTCTTCTTTCTCCGCCGTGATTCATCTCATTTCCACGTCCATCGTACCCACTCAACTCCCCACTGTTCATCTCCCCCGCCGCCACAAATCTGAATTCGCCTCTCCCAGCTTTCAGGTCAAGCCACACCACAAAGATACCTCATCCTGGGATAAAACGCTTAGGGGGCTATGTTTAACTGGGAAATTGGCGGAAGCTGTTGCACTTTTGTGCTGTATGGCCTTGCAATTTCACTCCAAAACTTACTGTCTTCTGCTACAAGAATGCATTTTCAGGAAAGAGTATATGAAAGGAAAAAGAATCCATGCCCAAATGGTTGTTGTTGGATATGTACCCAATGAATATCTCAACACCAAACTATTGATTTTATATGCCAAATCAGGTGACTTAGAGACTGCATACGTCCTTCATGAACATTTGTTGGAGAAAAGCCTGGTTTCATGGAATTCATTGATTGCCGGGTATGTACAGAAGGGGCTTGCTGAAGTTGGATTGGAGTTTTATTTAAAGATGAGACAAAGCGGTTTAATGCCTGATCAGTATACTTTTGCATCAGTTTTAAGAGCTTGTGCTAGTTTAGCTTCTCTGGAACACGGAAAGAGAGCACATGGAGTTTTGATTAAATGTCAAATTGGTGATAATGTTGTTGTGTCTAGTGCACTTGTAGATATGTACTTCAAATGCAGTAGCTTATCTGACGGGCATAAAGCATTTAACAAATCTTCAAATAGGAATGTAATCACTTGGACTGCTTTGATATCTGGGTATGGACAGCATGGAAGAATTTCTGAAGTTTTGGAATCATTTCATAGTATGATAAACAAAGGCTACCGGCCGAATTACGTCACTTTCCTTGCAGTTCTTGCTGCTTGTAGTCGTGGAGGCTTTGTGTCAGAGGCGTGGAATTACTTTTCTTTGATGACAAAGACCTATGGAATTGAACCAAGAGGGCAACATTATGCTGCCATGGCTGATCTTCTTGCCAGAGCTGGGAGATTGCAAGAAGCCTATGACTTTGTTCTTGATGCACCTTGCAAGGAGCACTCTGTTATGTGGGGTGCTTTGGTTGGAGCTTGTAAAGTTCATGAAGATGTAGATTTGATGAAACATGTTGCAGCAAGTTACTTCGAATTGGATCCTAAAAACTCAGGAAAGTTGGTTGTTTTCTCAAATGCTTTTGCCACATCCGGGTTGTGGGACAATGTTGAAGAGATAAGAGCTATGATGAAGAAATCAGGAATGAGTAAAGATCCTGGTTGCAGCAGAATAGAAATTCAAAGGGAATTCCATATTTTTGTCAAGGGCGATAAATCTCACAGAGAAACTGAGGAGATTTATAGAACCATTGACAGAATAACTCCAATTTTGAAGGATGCAGGATATATTCCTGAACTATGTGAAAAAACAGTGATAGAGGGATTAAGTTGGTGCTGA

Protein sequence

MLKCASPLTRSSFSAVIHLISTSIVPTQLPTVHLPRRHKSEFASPSFQVKPHHKDTSSWDKTLRGLCLTGKLAEAVALLCCMALQFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGYVPNEYLNTKLLILYAKSGDLETAYVLHEHLLEKSLVSWNSLIAGYVQKGLAEVGLEFYLKMRQSGLMPDQYTFASVLRACASLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSLSDGHKAFNKSSNRNVITWTALISGYGQHGRISEVLESFHSMINKGYRPNYVTFLAVLAACSRGGFVSEAWNYFSLMTKTYGIEPRGQHYAAMADLLARAGRLQEAYDFVLDAPCKEHSVMWGALVGACKVHEDVDLMKHVAASYFELDPKNSGKLVVFSNAFATSGLWDNVEEIRAMMKKSGMSKDPGCSRIEIQREFHIFVKGDKSHRETEEIYRTIDRITPILKDAGYIPELCEKTVIEGLSWC
BLAST of CsaV3_6G048410 vs. NCBI nr
Match: XP_004134800.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g16470 [Cucumis sativus] >KGN49049.1 hypothetical protein Csa_6G511640 [Cucumis sativus])

HSP 1 Score: 895.6 bits (2313), Expect = 7.6e-257
Identity = 486/486 (100.00%), Postives = 486/486 (100.00%), Query Frame = 0

Query: 1   MLKCASPLTRSSFSAVIHLISTSIVPTQLPTVHLPRRHKSEFASPSFQVKPHHKDTSSWD 60
           MLKCASPLTRSSFSAVIHLISTSIVPTQLPTVHLPRRHKSEFASPSFQVKPHHKDTSSWD
Sbjct: 1   MLKCASPLTRSSFSAVIHLISTSIVPTQLPTVHLPRRHKSEFASPSFQVKPHHKDTSSWD 60

Query: 61  KTLRGLCLTGKLAEAVALLCCMALQFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGYV 120
           KTLRGLCLTGKLAEAVALLCCMALQFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGYV
Sbjct: 61  KTLRGLCLTGKLAEAVALLCCMALQFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGYV 120

Query: 121 PNEYLNTKLLILYAKSGDLETAYVLHEHLLEKSLVSWNSLIAGYVQKGLAEVGLEFYLKM 180
           PNEYLNTKLLILYAKSGDLETAYVLHEHLLEKSLVSWNSLIAGYVQKGLAEVGLEFYLKM
Sbjct: 121 PNEYLNTKLLILYAKSGDLETAYVLHEHLLEKSLVSWNSLIAGYVQKGLAEVGLEFYLKM 180

Query: 181 RQSGLMPDQYTFASVLRACASLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSL 240
           RQSGLMPDQYTFASVLRACASLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSL
Sbjct: 181 RQSGLMPDQYTFASVLRACASLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSL 240

Query: 241 SDGHKAFNKSSNRNVITWTALISGYGQHGRISEVLEXXXXXXXXXXXXXXXXXXXXXXXX 300
           SDGHKAFNKSSNRNVITWTALISGYGQHGRISEVLEXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 241 SDGHKAFNKSSNRNVITWTALISGYGQHGRISEVLEXXXXXXXXXXXXXXXXXXXXXXXX 300

Query: 301 XXXXXXXXXXXXXSLMTKTYGIEPRGQHYAAMADLLARAGRLQEAYDFVLDAPCKEHSVM 360
           XXXXXXXXXXXXXSLMTKTYGIEPRGQHYAAMADLLARAGRLQEAYDFVLDAPCKEHSVM
Sbjct: 301 XXXXXXXXXXXXXSLMTKTYGIEPRGQHYAAMADLLARAGRLQEAYDFVLDAPCKEHSVM 360

Query: 361 WGALVGACKVHEDVDLMKHVAASYFELDPKNSGKLVVFSNAFATSGLWDNVEEIRAMMKK 420
           WGALVGACKVHEDVDLMKHVAASYFELDPKNSGKLVVFSNAFATSGLWDNVEEIRAMMKK
Sbjct: 361 WGALVGACKVHEDVDLMKHVAASYFELDPKNSGKLVVFSNAFATSGLWDNVEEIRAMMKK 420

Query: 421 SGMSKDPGCSRIEIQREFHIFVKGDKSHRETEEIYRTIDRITPILKDAGYIPELCEKTVI 480
           SGMSKDPGCSRIEIQREFHIFVKGDKSHRETEEIYRTIDRITPILKDAGYIPELCEKTVI
Sbjct: 421 SGMSKDPGCSRIEIQREFHIFVKGDKSHRETEEIYRTIDRITPILKDAGYIPELCEKTVI 480

Query: 481 EGLSWC 487
           EGLSWC
Sbjct: 481 EGLSWC 486

BLAST of CsaV3_6G048410 vs. NCBI nr
Match: XP_008440079.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g16470 [Cucumis melo])

HSP 1 Score: 824.7 bits (2129), Expect = 1.6e-235
Identity = 414/477 (86.79%), Postives = 425/477 (89.10%), Query Frame = 0

Query: 1   MLKCASPLTRSSFSAVIHLISTSIVPTQLPTVHLPRRHKSEFASPSFQVKPHHKDTSSWD 60
           MLKCASPLTRSSFSAVIHL + SIV TQ P VH PRRH+SE A PSFQVK HHKD SSWD
Sbjct: 1   MLKCASPLTRSSFSAVIHLFTKSIVATQFPIVHFPRRHESESACPSFQVKRHHKDNSSWD 60

Query: 61  KTLRGLCLTGKLAEAVALLCCMALQFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGYV 120
           +TLRGLCLTGKLAEAVALLCCMALQF SKTYCLLLQECIFRKEYMKGKRIHAQMVVVGYV
Sbjct: 61  ETLRGLCLTGKLAEAVALLCCMALQFQSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGYV 120

Query: 121 PNEYLNTKLLILYAKSGDLETAYVLHEHLLEKSLVSWNSLIAGYVQKGLAEVGLEFYLKM 180
           PNEYLNTKLLILYAKSGDLETAY+LHEHLLEKSLVSWNSLIAGYVQKGLAEVGLEFYLKM
Sbjct: 121 PNEYLNTKLLILYAKSGDLETAYILHEHLLEKSLVSWNSLIAGYVQKGLAEVGLEFYLKM 180

Query: 181 RQSGLMPDQYTFASVLRACASLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSL 240
           RQSGLMPDQYTFASVLRACASLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSL
Sbjct: 181 RQSGLMPDQYTFASVLRACASLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSL 240

Query: 241 SDGHKAFNKSSNRNVITWTALISGYGQHGRISEVLEXXXXXXXXXXXXXXXXXXXXXXXX 300
           SDGHKAFNKS NRNVITWTALISGYGQHGR+SEVLE                        
Sbjct: 241 SDGHKAFNKSINRNVITWTALISGYGQHGRVSEVLESFHSMIKEGYRPNYVTFLVVLAAC 300

Query: 301 XXXXXXXXXXXXXSLMTKTYGIEPRGQHYAAMADLLARAGRLQEAYDFVLDAPCKEHSVM 360
                        SLMTKTY IEPRGQHYAAMADLLARAGRL+EAY+FVLDAPCKE+SVM
Sbjct: 301 SRGGFVSEAWRYFSLMTKTYEIEPRGQHYAAMADLLARAGRLREAYNFVLDAPCKENSVM 360

Query: 361 WGALVGACKVHEDVDLMKHVAASYFELDPKNSGKLVVFSNAFATSGLWDNVEEIRAMMKK 420
           WGALVGACKVHED+DLMKHVAASYFELDP+NSGKLVVFSNAFATSGLWDNVEEIRAMMKK
Sbjct: 361 WGALVGACKVHEDIDLMKHVAASYFELDPENSGKLVVFSNAFATSGLWDNVEEIRAMMKK 420

Query: 421 SGMSKDPGCSRIEIQREFHIFVKGDKSHRETEEIYRTIDRITPILKDAGYIPELCEK 478
           SGMSKDPGCS+IEIQREFHIFVK DKSHRETEEIYRTIDRITPILKDAGY PELCEK
Sbjct: 421 SGMSKDPGCSKIEIQREFHIFVKNDKSHRETEEIYRTIDRITPILKDAGYTPELCEK 477

BLAST of CsaV3_6G048410 vs. NCBI nr
Match: XP_023003064.1 (pentatricopeptide repeat-containing protein At4g16470 [Cucurbita maxima])

HSP 1 Score: 681.4 bits (1757), Expect = 2.2e-192
Identity = 366/467 (78.37%), Postives = 396/467 (84.80%), Query Frame = 0

Query: 10  RSSFSAVIHLISTSIVPTQLPTVHLPRRHKSEFASPSFQVKPHHKDTSSWDKTLRGLCLT 69
           RSS S VIHL + SIV     T+   RRHKSE+A+   QVKPH KD+SSWD+T R LC+T
Sbjct: 6   RSSSSGVIHLFTKSIVAGATATIR--RRHKSEYANDRSQVKPHQKDSSSWDRTFRSLCIT 65

Query: 70  GKLAEAVALLCCMALQFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGYVPNEYLNTKL 129
           G+L+EAVALLC M  QFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVG++PNEYL TKL
Sbjct: 66  GRLSEAVALLCSMPFQFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEYLKTKL 125

Query: 130 LILYAKSGDLETAYVLHEHLLEKSLVSWNSLIAGYVQKGLAEVGLEFYLKMRQSGLMPDQ 189
           LILYAK GDLETAY+LHE LL+ SLVSWN+LIAG VQKGL EVGLE Y KMR++GL+PDQ
Sbjct: 126 LILYAKLGDLETAYILHEKLLDNSLVSWNALIAGCVQKGLGEVGLELYFKMRRTGLIPDQ 185

Query: 190 YTFASVLRACASLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSLSDGHKAFNK 249
           YTFASV+RACASLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSS+SDGHK F+K
Sbjct: 186 YTFASVIRACASLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSISDGHKVFDK 245

Query: 250 SSNRNVITWTALISGYGQHGRISEVLEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 309
           SS RNVITWTALISGYG HGR+SEVLE          XXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 246 SSTRNVITWTALISGYGHHGRVSEVLESFNSMINEGYXXXXXXXXXXXXXXXXXXXXXXX 305

Query: 310 XXXXSLMTKTYGIEPRGQHYAAMADLLARAGRLQEAYDFVLDAPCKEHSVMWGALVGACK 369
               SLM  TY IEPRGQHYAAMADLLARAGRLQEAYDFV+DAPCKEHSV+WGALVG CK
Sbjct: 306 WRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYDFVVDAPCKEHSVIWGALVGGCK 365

Query: 370 VHEDVDLMKHVAASYFELDPKNSGKLVVFSNAFATSGLWDNVEEIRAMMKKSGMSKDPGC 429
           VHED+DLMKH AA Y  LD  N+GK VV +N FA SGLWDNV EIR MMKKSGM+K+PG 
Sbjct: 366 VHEDIDLMKHAAAHYLALDASNAGKYVVLANGFAASGLWDNVAEIRCMMKKSGMNKEPGY 425

Query: 430 SRIEIQREFHIFVKGDKSHRETEEIYRTIDRITPILKDAGYIPELCE 477
           SRIEIQREFH FVK DKSH++  EIYRTI  ITPILKDAG IPEL E
Sbjct: 426 SRIEIQREFHFFVKSDKSHKQAVEIYRTIHSITPILKDAGSIPELSE 470

BLAST of CsaV3_6G048410 vs. NCBI nr
Match: XP_022926503.1 (pentatricopeptide repeat-containing protein At4g16470 [Cucurbita moschata])

HSP 1 Score: 677.9 bits (1748), Expect = 2.5e-191
Identity = 371/470 (78.94%), Postives = 403/470 (85.74%), Query Frame = 0

Query: 10  RSSFSAVIHLISTSIVPTQLPTVHLPRRHKSEFASPSFQVKPHHKDTSSWDKTLRGLCLT 69
           R S S VIHL + SIV     T+   RRHKSE+A+  FQVKPH KD+SSWD+T R LC+T
Sbjct: 6   RPSSSGVIHLFTKSIVAGATATIR--RRHKSEYANDRFQVKPHQKDSSSWDRTFRSLCIT 65

Query: 70  GKLAEAVALLCCMALQFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGYVPNEYLNTKL 129
           G+L EAVALLCCM  QFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVG++PNEYL TKL
Sbjct: 66  GRLTEAVALLCCMPFQFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEYLKTKL 125

Query: 130 LILYAKSGDLETAYVLHEHLLEKSLVSWNSLIAGYVQKGLAEVGLEFYLKMRQSGLMPDQ 189
           LILYAK GDLETA +LHE LLE SLVSWN+LIAGYVQKG  EVGLE Y KMR++GL+PDQ
Sbjct: 126 LILYAKLGDLETANILHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLIPDQ 185

Query: 190 YTFASVLRACASLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSLSDGHKAFNK 249
           YTFASV RACASLASLEHGKRAHGVLIKC+IGDNVVVSSALVDMYFKCSS+ DGHK FNK
Sbjct: 186 YTFASVFRACASLASLEHGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSILDGHKVFNK 245

Query: 250 SSNRNVITWTALISGYGQHGRISEVLEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 309
           S+ RNVITWTALISGYG HGR+SEVLE   XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 246 STTRNVITWTALISGYGHHGRVSEVLESFNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 305

Query: 310 XXXXSLMTKTYGIEPRGQHYAAMADLLARAGRLQEAYDFVLDAPCKEHSVMWGALVGACK 369
           X   SLM  TY IEPRGQHYAAMADLLARAGRLQEAYDFV+DAPCKEH+V+WGALVG CK
Sbjct: 306 XRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYDFVVDAPCKEHAVIWGALVGGCK 365

Query: 370 VHEDVDLMKHVAASYFELDPKNSGKLVVFSNAFATSGLWDNVEEIRAMMKKSGMSKDPGC 429
           VHED+DLMKH AA+Y  LD  N+GK VV +N FA SGLWDNV EIR MMKKSGM+K+PG 
Sbjct: 366 VHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGMNKEPGY 425

Query: 430 SRIEIQREFHIFVKGDKSHRETEEIYRTIDRITPILKDAGYIPELCEKTV 480
           SRIEIQREFH FVK DKSH++ EEIYRTI  IT ILKDAG I EL E ++
Sbjct: 426 SRIEIQREFHFFVKSDKSHKQAEEIYRTIHSITAILKDAGSIRELSENSL 473

BLAST of CsaV3_6G048410 vs. NCBI nr
Match: XP_023518629.1 (pentatricopeptide repeat-containing protein At4g16470 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 676.0 bits (1743), Expect = 9.4e-191
Identity = 367/467 (78.59%), Postives = 398/467 (85.22%), Query Frame = 0

Query: 10  RSSFSAVIHLISTSIVPTQLPTVHLPRRHKSEFASPSFQVKPHHKDTSSWDKTLRGLCLT 69
           R S S V+HL + SIV     T+   RRHKSE+ +   QVKPH KD+SSWD+T R LC+T
Sbjct: 6   RPSSSGVVHLFTKSIVAGATATIR--RRHKSEYVNDRSQVKPHQKDSSSWDRTFRSLCIT 65

Query: 70  GKLAEAVALLCCMALQFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGYVPNEYLNTKL 129
           G+L+EAVALLCCM  +FHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVG++PNEYL TKL
Sbjct: 66  GRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEYLKTKL 125

Query: 130 LILYAKSGDLETAYVLHEHLLEKSLVSWNSLIAGYVQKGLAEVGLEFYLKMRQSGLMPDQ 189
           LILYAK GDLETA +LHE LLE SLVSWN+LIAGYVQKG  EVGLE Y KMR++GLMPDQ
Sbjct: 126 LILYAKLGDLETANILHEKLLENSLVSWNALIAGYVQKGFGEVGLEIYFKMRRTGLMPDQ 185

Query: 190 YTFASVLRACASLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSLSDGHKAFNK 249
           YTFASV RACASLASLEHGKRAHGVLIKC+IGDNVVVSSALVDMYFKCSS+SDG K FNK
Sbjct: 186 YTFASVFRACASLASLEHGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGRKVFNK 245

Query: 250 SSNRNVITWTALISGYGQHGRISEVLEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 309
           SS RNVITWTALISGYG HGR+SEVLE    XXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 246 SSTRNVITWTALISGYGHHGRVSEVLESFNNXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 305

Query: 310 XXXXSLMTKTYGIEPRGQHYAAMADLLARAGRLQEAYDFVLDAPCKEHSVMWGALVGACK 369
               SLM  TY IEPRGQHYAAMADLLARAGRLQEAYDFV+DAPCKEH+V+WGALVG CK
Sbjct: 306 WRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYDFVVDAPCKEHAVIWGALVGGCK 365

Query: 370 VHEDVDLMKHVAASYFELDPKNSGKLVVFSNAFATSGLWDNVEEIRAMMKKSGMSKDPGC 429
           VHED+DLMKH AA+Y  LD  N+GK VV +N FA SGLWDNV EIR MMKKSGM+K+PG 
Sbjct: 366 VHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGMNKEPGY 425

Query: 430 SRIEIQREFHIFVKGDKSHRETEEIYRTIDRITPILKDAGYIPELCE 477
           SRIEIQREFH FVK DKSH + EEIYRTI  ITPI+KDAG  PEL E
Sbjct: 426 SRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPIIKDAGSFPELSE 470

BLAST of CsaV3_6G048410 vs. TAIR10
Match: AT4G16470.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 450.7 bits (1158), Expect = 1.2e-126
Identity = 228/438 (52.05%), Postives = 290/438 (66.21%), Query Frame = 0

Query: 36  RRHKSEFASPSFQVKPHHKDTSSWDKTLRGLCLTGKLAEAVALLCCMALQFHSKTYCLLL 95
           RR  +E     FQV+   K T   DKTL+GLC+TG+L EAV LL    LQ   +TY +LL
Sbjct: 57  RRMLAEKRIGRFQVENQRK-TEKLDKTLKGLCVTGRLKEAVGLLWSSGLQVEPETYAVLL 116

Query: 96  QECIFRKEYMKGKRIHAQMVVVGYVPNEYLNTKLLILYAKSGDLETAYVLHEHLLEKSLV 155
           QEC  RKEY KGKRIHAQM VVG+  NEYL  KLLILYA SGDL+TA +L   L  + L+
Sbjct: 117 QECKQRKEYTKGKRIHAQMFVVGFALNEYLKVKLLILYALSGDLQTAGILFRSLKIRDLI 176

Query: 156 SWNSLIAGYVQKGLAEVGLEFYLKMRQSGLMPDQYTFASVLRACASLASLEHGKRAHGVL 215
            WN++I+GYVQKGL + GL  Y  MRQ+ ++PDQYTFASV RAC++L  LEHGKRAH V+
Sbjct: 177 PWNAMISGYVQKGLEQEGLFIYYDMRQNRIVPDQYTFASVFRACSALDRLEHGKRAHAVM 236

Query: 216 IKCQIGDNVVVSSALVDMYFKCSSLSDGHKAFNKSSNRNVITWTALISGYGQHGRISEVL 275
           IK  I  N++V SALVDMYFKCSS SDGH+ F++ S RNVITWT+LISGYG HG++SEVL
Sbjct: 237 IKRCIKSNIIVDSALVDMYFKCSSFSDGHRVFDQLSTRNVITWTSLISGYGYHGKVSEVL 296

Query: 276 EXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSLMTKTYGIEPRGQHYAAMADL 335
           +                                       M + YGIEP GQHYAAM D 
Sbjct: 297 KCFEKMKEEGCRPNPVTFLVVLTACNHGGLVDKGWEHFYSMKRDYGIEPEGQHYAAMVDT 356

Query: 336 LARAGRLQEAYDFVLDAPCKEHSVMWGALVGACKVHEDVDLMKHVAASYFELDPKNSGKL 395
           L RAGRLQEAY+FV+ +PCKEH  +WG+L+GAC++H +V L++  A  + ELDP N G  
Sbjct: 357 LGRAGRLQEAYEFVMKSPCKEHPPVWGSLLGACRIHGNVKLLELAATKFLELDPTNGGNY 416

Query: 396 VVFSNAFATSGLWDNVEEIRAMMKKSGMSKDPGCSRIEIQREFHIFVKGDKSHRETEEIY 455
           VVF+N +A+ GL +   ++R  M+ +G+ KDPG S+IE+Q E H F+K D SHR +E+IY
Sbjct: 417 VVFANGYASCGLREAASKVRRKMENAGVKKDPGYSQIELQGEVHRFMKDDTSHRLSEKIY 476

Query: 456 RTIDRITPILKDAGYIPE 474
           + +  +T    D  Y P+
Sbjct: 477 KKVHEMTSFFMDIDYYPD 493

BLAST of CsaV3_6G048410 vs. TAIR10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 251.5 bits (641), Expect = 1.0e-66
Identity = 126/379 (33.25%), Postives = 203/379 (53.56%), Query Frame = 0

Query: 95  LQECIFRKEYMKGKRIHAQMVVVGYVPNEYLNTKLLILYAKSGDLETAYVLHEHLLEKSL 154
           L  C    +  +G+ IH   V +G   N  +   L+ +Y K  +++TA  +   L  ++L
Sbjct: 344 LHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTL 403

Query: 155 VSWNSLIAGYVQKGLAEVGLEFYLKMRQSGLMPDQYTFASVLRACASLASLEHGKRAHGV 214
           VSWN++I G+ Q G     L ++ +MR   + PD +T+ SV+ A A L+   H K  HGV
Sbjct: 404 VSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGV 463

Query: 215 LIKCQIGDNVVVSSALVDMYFKCSSLSDGHKAFNKSSNRNVITWTALISGYGQHGRISEV 274
           +++  +  NV V++ALVDMY KC ++      F+  S R+V TW A+I GYG HG     
Sbjct: 464 VMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAA 523

Query: 275 LEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSLMTKTYGIEPRGQHYAAMAD 334
           LE                                      +M + Y IE    HY AM D
Sbjct: 524 LELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVD 583

Query: 335 LLARAGRLQEAYDFVLDAPCKEHSVMWGALVGACKVHEDVDLMKHVAASYFELDPKNSGK 394
           LL RAGRL EA+DF++  P K    ++GA++GAC++H++V+  +  A   FEL+P + G 
Sbjct: 584 LLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGY 643

Query: 395 LVVFSNAFATSGLWDNVEEIRAMMKKSGMSKDPGCSRIEIQREFHIFVKGDKSHRETEEI 454
            V+ +N +  + +W+ V ++R  M + G+ K PGCS +EI+ E H F  G  +H ++++I
Sbjct: 644 HVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKI 703

Query: 455 YRTIDRITPILKDAGYIPE 474
           Y  ++++   +K+AGY+P+
Sbjct: 704 YAFLEKLICHIKEAGYVPD 722

BLAST of CsaV3_6G048410 vs. TAIR10
Match: AT2G03880.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 249.2 bits (635), Expect = 5.2e-66
Identity = 126/384 (32.81%), Postives = 209/384 (54.43%), Query Frame = 0

Query: 90  TYCLLLQECIFRKEYMKGKRIHAQMVVVGYVPNEYLNTKLLILYAKSGDLETAYVLHEHL 149
           TY  +L+ C    +    + +H  ++  G   + ++ + L+ ++AK G+ E A  + + +
Sbjct: 164 TYSSVLRSCNGMSDV---RMLHCGIIKEGLESDVFVRSALIDVFAKLGEPEDALSVFDEM 223

Query: 150 LEKSLVSWNSLIAGYVQKGLAEVGLEFYLKMRQSGLMPDQYTFASVLRACASLASLEHGK 209
           +    + WNS+I G+ Q   ++V LE + +M+++G + +Q T  SVLRAC  LA LE G 
Sbjct: 224 VTGDAIVWNSIIGGFAQNSRSDVALELFKRMKRAGFIAEQATLTSVLRACTGLALLELGM 283

Query: 210 RAHGVLIKCQIGDNVVVSSALVDMYFKCSSLSDGHKAFNKSSNRNVITWTALISGYGQHG 269
           +AH  ++K     ++++++ALVDMY KC SL D  + FN+   R+VITW+ +ISG  Q+G
Sbjct: 284 QAHVHIVK--YDQDLILNNALVDMYCKCGSLEDALRVFNQMKERDVITWSTMISGLAQNG 343

Query: 270 RISEVLEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSLMTKTYGIEPRGQHY 329
              E L+                                       M K YGI+P  +HY
Sbjct: 344 YSQEALKLFERMKSSGTKPNYITIVGVLFACSHAGLLEDGWYYFRSMKKLYGIDPVREHY 403

Query: 330 AAMADLLARAGRLQEAYDFVLDAPCKEHSVMWGALVGACKVHEDVDLMKHVAASYFELDP 389
             M DLL +AG+L +A   + +  C+  +V W  L+GAC+V  ++ L ++ A     LDP
Sbjct: 404 GCMIDLLGKAGKLDDAVKLLNEMECEPDAVTWRTLLGACRVQRNMVLAEYAAKKVIALDP 463

Query: 390 KNSGKLVVFSNAFATSGLWDNVEEIRAMMKKSGMSKDPGCSRIEIQREFHIFVKGDKSHR 449
           +++G   + SN +A S  WD+VEEIR  M+  G+ K+PGCS IE+ ++ H F+ GD SH 
Sbjct: 464 EDAGTYTLLSNIYANSQKWDSVEEIRTRMRDRGIKKEPGCSWIEVNKQIHAFIIGDNSHP 523

Query: 450 ETEEIYRTIDRITPILKDAGYIPE 474
           +  E+ + ++++   L   GY+PE
Sbjct: 524 QIVEVSKKLNQLIHRLTGIGYVPE 542

BLAST of CsaV3_6G048410 vs. TAIR10
Match: AT3G13770.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 248.8 bits (634), Expect = 6.8e-66
Identity = 146/465 (31.40%), Postives = 244/465 (52.47%), Query Frame = 0

Query: 18  HLISTSIVP-TQLPTVHLPRRHKSEFASPSFQV--KPHHKDTSSWDKTLRGLCLTGKLAE 77
           H+I T  +P T L T  L    K +    + +V  +   K+  SW   +     TG  +E
Sbjct: 77  HMIKTRYLPATYLRTRLLIFYGKCDCLEDARKVLDEMPEKNVVSWTAMISRYSQTGHSSE 136

Query: 78  AVALLCCMAL---QFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGYVPNEYLNTKLLI 137
           A+ +   M     + +  T+  +L  CI       GK+IH  +V   Y  + ++ + LL 
Sbjct: 137 ALTVFAEMMRSDGKPNEFTFATVLTSCIRASGLGLGKQIHGLIVKWNYDSHIFVGSSLLD 196

Query: 138 LYAKSGDLETAYVLHEHLLEKSLVSWNSLIAGYVQKGLAEVGLEFYLKMRQSGLMPDQYT 197
           +YAK+G ++ A  + E L E+ +VS  ++IAGY Q GL E  LE + ++   G+ P+  T
Sbjct: 197 MYAKAGQIKEAREIFECLPERDVVSCTAIIAGYAQLGLDEEALEMFHRLHSEGMSPNYVT 256

Query: 198 FASVLRACASLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSLSDGHKAFNKSS 257
           +AS+L A + LA L+HGK+AH  +++ ++    V+ ++L+DMY KC +LS   + F+   
Sbjct: 257 YASLLTALSGLALLDHGKQAHCHVLRRELPFYAVLQNSLIDMYSKCGNLSYARRLFDNMP 316

Query: 258 NRNVITWTALISGYGQHGRISEVLE--XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 317
            R  I+W A++ GY +HG   EVLE                                   
Sbjct: 317 ERTAISWNAMLVGYSKHGLGREVLELFRLMRDEKRVKPDAVTLLAVLSGCSHGRMEDTGL 376

Query: 318 XXXXSLMTKTYGIEPRGQHYAAMADLLARAGRLQEAYDFVLDAPCKEHSVMWGALVGACK 377
                ++   YG +P  +HY  + D+L RAGR+ EA++F+   P K  + + G+L+GAC+
Sbjct: 377 NIFDGMVAGEYGTKPGTEHYGCIVDMLGRAGRIDEAFEFIKRMPSKPTAGVLGSLLGACR 436

Query: 378 VHEDVDLMKHVAASYFELDPKNSGKLVVFSNAFATSGLWDNVEEIRAMMKKSGMSKDPGC 437
           VH  VD+ + V     E++P+N+G  V+ SN +A++G W +V  +RAMM +  ++K+PG 
Sbjct: 437 VHLSVDIGESVGRRLIEIEPENAGNYVILSNLYASAGRWADVNNVRAMMMQKAVTKEPGR 496

Query: 438 SRIEIQREFHIFVKGDKSHRETEEIYRTIDRITPILKDAGYIPEL 475
           S I+ ++  H F   D++H   EE+   +  I+  +K AGY+P+L
Sbjct: 497 SWIQHEQTLHYFHANDRTHPRREEVLAKMKEISIKMKQAGYVPDL 541

BLAST of CsaV3_6G048410 vs. TAIR10
Match: AT3G23330.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 244.6 bits (623), Expect = 1.3e-64
Identity = 134/430 (31.16%), Postives = 225/430 (52.33%), Query Frame = 0

Query: 47  FQVKPHHKDTSSWDKTLRGLCLTGKLAEAVALLCCMA---LQFHSKTYCLLLQECIFRKE 106
           F+V P  KD  S++  + G   +G   +A+ ++  M    L+  S T   +L       +
Sbjct: 199 FEVMP-RKDVVSYNTIIAGYAQSGMYEDALRMVREMGTTDLKPDSFTLSSVLPIFSEYVD 258

Query: 107 YMKGKRIHAQMVVVGYVPNEYLNTKLLILYAKSGDLETAYVLHEHLLEKSLVSWNSLIAG 166
            +KGK IH  ++  G   + Y+ + L+ +YAKS  +E +  +   L  +  +SWNSL+AG
Sbjct: 259 VIKGKEIHGYVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVFSRLYCRDGISWNSLVAG 318

Query: 167 YVQKGLAEVGLEFYLKMRQSGLMPDQYTFASVLRACASLASLEHGKRAHGVLIKCQIGDN 226
           YVQ G     L  + +M  + + P    F+SV+ ACA LA+L  GK+ HG +++   G N
Sbjct: 319 YVQNGRYNEALRLFRQMVTAKVKPGAVAFSSVIPACAHLATLHLGKQLHGYVLRGGFGSN 378

Query: 227 VVVSSALVDMYFKCSSLSDGHKAFNKSSNRNVITWTALISGYGQHGRISEVLEXXXXXXX 286
           + ++SALVDMY KC ++    K F++ +  + ++WTA+I G+  HG   E +        
Sbjct: 379 IFIASALVDMYSKCGNIKAARKIFDRMNVLDEVSWTAIIMGHALHGHGHEAVSLFEEMKR 438

Query: 287 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSLMTKTYGIEPRGQHYAAMADLLARAGRLQ 346
                                         + MTK YG+    +HYAA+ADLL RAG+L+
Sbjct: 439 QGVKPNQVAFVAVLTACSHVGLVDEAWGYFNSMTKVYGLNQELEHYAAVADLLGRAGKLE 498

Query: 347 EAYDFVLDAPCKEHSVMWGALVGACKVHEDVDLMKHVAASYFELDPKNSGKLVVFSNAFA 406
           EAY+F+     +    +W  L+ +C VH++++L + VA   F +D +N G  V+  N +A
Sbjct: 499 EAYNFISKMCVEPTGSVWSTLLSSCSVHKNLELAEKVAEKIFTVDSENMGAYVLMCNMYA 558

Query: 407 TSGLWDNVEEIRAMMKKSGMSKDPGCSRIEIQREFHIFVKGDKSHRETEEIYRTIDRITP 466
           ++G W  + ++R  M+K G+ K P CS IE++ + H FV GD+SH   ++I   +  +  
Sbjct: 559 SNGRWKEMAKLRLRMRKKGLRKKPACSWIEMKNKTHGFVSGDRSHPSMDKINEFLKAVME 618

Query: 467 ILKDAGYIPE 474
            ++  GY+ +
Sbjct: 619 QMEKEGYVAD 627

BLAST of CsaV3_6G048410 vs. Swiss-Prot
Match: sp|O23491|PP315_ARATH (Pentatricopeptide repeat-containing protein At4g16470 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E12 PE=2 SV=2)

HSP 1 Score: 450.7 bits (1158), Expect = 2.1e-125
Identity = 228/438 (52.05%), Postives = 290/438 (66.21%), Query Frame = 0

Query: 36  RRHKSEFASPSFQVKPHHKDTSSWDKTLRGLCLTGKLAEAVALLCCMALQFHSKTYCLLL 95
           RR  +E     FQV+   K T   DKTL+GLC+TG+L EAV LL    LQ   +TY +LL
Sbjct: 57  RRMLAEKRIGRFQVENQRK-TEKLDKTLKGLCVTGRLKEAVGLLWSSGLQVEPETYAVLL 116

Query: 96  QECIFRKEYMKGKRIHAQMVVVGYVPNEYLNTKLLILYAKSGDLETAYVLHEHLLEKSLV 155
           QEC  RKEY KGKRIHAQM VVG+  NEYL  KLLILYA SGDL+TA +L   L  + L+
Sbjct: 117 QECKQRKEYTKGKRIHAQMFVVGFALNEYLKVKLLILYALSGDLQTAGILFRSLKIRDLI 176

Query: 156 SWNSLIAGYVQKGLAEVGLEFYLKMRQSGLMPDQYTFASVLRACASLASLEHGKRAHGVL 215
            WN++I+GYVQKGL + GL  Y  MRQ+ ++PDQYTFASV RAC++L  LEHGKRAH V+
Sbjct: 177 PWNAMISGYVQKGLEQEGLFIYYDMRQNRIVPDQYTFASVFRACSALDRLEHGKRAHAVM 236

Query: 216 IKCQIGDNVVVSSALVDMYFKCSSLSDGHKAFNKSSNRNVITWTALISGYGQHGRISEVL 275
           IK  I  N++V SALVDMYFKCSS SDGH+ F++ S RNVITWT+LISGYG HG++SEVL
Sbjct: 237 IKRCIKSNIIVDSALVDMYFKCSSFSDGHRVFDQLSTRNVITWTSLISGYGYHGKVSEVL 296

Query: 276 EXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSLMTKTYGIEPRGQHYAAMADL 335
           +                                       M + YGIEP GQHYAAM D 
Sbjct: 297 KCFEKMKEEGCRPNPVTFLVVLTACNHGGLVDKGWEHFYSMKRDYGIEPEGQHYAAMVDT 356

Query: 336 LARAGRLQEAYDFVLDAPCKEHSVMWGALVGACKVHEDVDLMKHVAASYFELDPKNSGKL 395
           L RAGRLQEAY+FV+ +PCKEH  +WG+L+GAC++H +V L++  A  + ELDP N G  
Sbjct: 357 LGRAGRLQEAYEFVMKSPCKEHPPVWGSLLGACRIHGNVKLLELAATKFLELDPTNGGNY 416

Query: 396 VVFSNAFATSGLWDNVEEIRAMMKKSGMSKDPGCSRIEIQREFHIFVKGDKSHRETEEIY 455
           VVF+N +A+ GL +   ++R  M+ +G+ KDPG S+IE+Q E H F+K D SHR +E+IY
Sbjct: 417 VVFANGYASCGLREAASKVRRKMENAGVKKDPGYSQIELQGEVHRFMKDDTSHRLSEKIY 476

Query: 456 RTIDRITPILKDAGYIPE 474
           + +  +T    D  Y P+
Sbjct: 477 KKVHEMTSFFMDIDYYPD 493

BLAST of CsaV3_6G048410 vs. Swiss-Prot
Match: sp|Q3E6Q1|PPR32_ARATH (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 251.5 bits (641), Expect = 1.9e-65
Identity = 126/379 (33.25%), Postives = 203/379 (53.56%), Query Frame = 0

Query: 95  LQECIFRKEYMKGKRIHAQMVVVGYVPNEYLNTKLLILYAKSGDLETAYVLHEHLLEKSL 154
           L  C    +  +G+ IH   V +G   N  +   L+ +Y K  +++TA  +   L  ++L
Sbjct: 344 LHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTL 403

Query: 155 VSWNSLIAGYVQKGLAEVGLEFYLKMRQSGLMPDQYTFASVLRACASLASLEHGKRAHGV 214
           VSWN++I G+ Q G     L ++ +MR   + PD +T+ SV+ A A L+   H K  HGV
Sbjct: 404 VSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGV 463

Query: 215 LIKCQIGDNVVVSSALVDMYFKCSSLSDGHKAFNKSSNRNVITWTALISGYGQHGRISEV 274
           +++  +  NV V++ALVDMY KC ++      F+  S R+V TW A+I GYG HG     
Sbjct: 464 VMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAA 523

Query: 275 LEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSLMTKTYGIEPRGQHYAAMAD 334
           LE                                      +M + Y IE    HY AM D
Sbjct: 524 LELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVD 583

Query: 335 LLARAGRLQEAYDFVLDAPCKEHSVMWGALVGACKVHEDVDLMKHVAASYFELDPKNSGK 394
           LL RAGRL EA+DF++  P K    ++GA++GAC++H++V+  +  A   FEL+P + G 
Sbjct: 584 LLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGY 643

Query: 395 LVVFSNAFATSGLWDNVEEIRAMMKKSGMSKDPGCSRIEIQREFHIFVKGDKSHRETEEI 454
            V+ +N +  + +W+ V ++R  M + G+ K PGCS +EI+ E H F  G  +H ++++I
Sbjct: 644 HVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKI 703

Query: 455 YRTIDRITPILKDAGYIPE 474
           Y  ++++   +K+AGY+P+
Sbjct: 704 YAFLEKLICHIKEAGYVPD 722

BLAST of CsaV3_6G048410 vs. Swiss-Prot
Match: sp|Q9SI53|PP147_ARATH (Pentatricopeptide repeat-containing protein At2g03880, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H44 PE=2 SV=1)

HSP 1 Score: 249.2 bits (635), Expect = 9.3e-65
Identity = 126/384 (32.81%), Postives = 209/384 (54.43%), Query Frame = 0

Query: 90  TYCLLLQECIFRKEYMKGKRIHAQMVVVGYVPNEYLNTKLLILYAKSGDLETAYVLHEHL 149
           TY  +L+ C    +    + +H  ++  G   + ++ + L+ ++AK G+ E A  + + +
Sbjct: 164 TYSSVLRSCNGMSDV---RMLHCGIIKEGLESDVFVRSALIDVFAKLGEPEDALSVFDEM 223

Query: 150 LEKSLVSWNSLIAGYVQKGLAEVGLEFYLKMRQSGLMPDQYTFASVLRACASLASLEHGK 209
           +    + WNS+I G+ Q   ++V LE + +M+++G + +Q T  SVLRAC  LA LE G 
Sbjct: 224 VTGDAIVWNSIIGGFAQNSRSDVALELFKRMKRAGFIAEQATLTSVLRACTGLALLELGM 283

Query: 210 RAHGVLIKCQIGDNVVVSSALVDMYFKCSSLSDGHKAFNKSSNRNVITWTALISGYGQHG 269
           +AH  ++K     ++++++ALVDMY KC SL D  + FN+   R+VITW+ +ISG  Q+G
Sbjct: 284 QAHVHIVK--YDQDLILNNALVDMYCKCGSLEDALRVFNQMKERDVITWSTMISGLAQNG 343

Query: 270 RISEVLEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSLMTKTYGIEPRGQHY 329
              E L+                                       M K YGI+P  +HY
Sbjct: 344 YSQEALKLFERMKSSGTKPNYITIVGVLFACSHAGLLEDGWYYFRSMKKLYGIDPVREHY 403

Query: 330 AAMADLLARAGRLQEAYDFVLDAPCKEHSVMWGALVGACKVHEDVDLMKHVAASYFELDP 389
             M DLL +AG+L +A   + +  C+  +V W  L+GAC+V  ++ L ++ A     LDP
Sbjct: 404 GCMIDLLGKAGKLDDAVKLLNEMECEPDAVTWRTLLGACRVQRNMVLAEYAAKKVIALDP 463

Query: 390 KNSGKLVVFSNAFATSGLWDNVEEIRAMMKKSGMSKDPGCSRIEIQREFHIFVKGDKSHR 449
           +++G   + SN +A S  WD+VEEIR  M+  G+ K+PGCS IE+ ++ H F+ GD SH 
Sbjct: 464 EDAGTYTLLSNIYANSQKWDSVEEIRTRMRDRGIKKEPGCSWIEVNKQIHAFIIGDNSHP 523

Query: 450 ETEEIYRTIDRITPILKDAGYIPE 474
           +  E+ + ++++   L   GY+PE
Sbjct: 524 QIVEVSKKLNQLIHRLTGIGYVPE 542

BLAST of CsaV3_6G048410 vs. Swiss-Prot
Match: sp|Q9LIC3|PP227_ARATH (Putative pentatricopeptide repeat-containing protein At3g13770, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H85 PE=3 SV=1)

HSP 1 Score: 248.8 bits (634), Expect = 1.2e-64
Identity = 146/465 (31.40%), Postives = 244/465 (52.47%), Query Frame = 0

Query: 18  HLISTSIVP-TQLPTVHLPRRHKSEFASPSFQV--KPHHKDTSSWDKTLRGLCLTGKLAE 77
           H+I T  +P T L T  L    K +    + +V  +   K+  SW   +     TG  +E
Sbjct: 77  HMIKTRYLPATYLRTRLLIFYGKCDCLEDARKVLDEMPEKNVVSWTAMISRYSQTGHSSE 136

Query: 78  AVALLCCMAL---QFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGYVPNEYLNTKLLI 137
           A+ +   M     + +  T+  +L  CI       GK+IH  +V   Y  + ++ + LL 
Sbjct: 137 ALTVFAEMMRSDGKPNEFTFATVLTSCIRASGLGLGKQIHGLIVKWNYDSHIFVGSSLLD 196

Query: 138 LYAKSGDLETAYVLHEHLLEKSLVSWNSLIAGYVQKGLAEVGLEFYLKMRQSGLMPDQYT 197
           +YAK+G ++ A  + E L E+ +VS  ++IAGY Q GL E  LE + ++   G+ P+  T
Sbjct: 197 MYAKAGQIKEAREIFECLPERDVVSCTAIIAGYAQLGLDEEALEMFHRLHSEGMSPNYVT 256

Query: 198 FASVLRACASLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSLSDGHKAFNKSS 257
           +AS+L A + LA L+HGK+AH  +++ ++    V+ ++L+DMY KC +LS   + F+   
Sbjct: 257 YASLLTALSGLALLDHGKQAHCHVLRRELPFYAVLQNSLIDMYSKCGNLSYARRLFDNMP 316

Query: 258 NRNVITWTALISGYGQHGRISEVLE--XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 317
            R  I+W A++ GY +HG   EVLE                                   
Sbjct: 317 ERTAISWNAMLVGYSKHGLGREVLELFRLMRDEKRVKPDAVTLLAVLSGCSHGRMEDTGL 376

Query: 318 XXXXSLMTKTYGIEPRGQHYAAMADLLARAGRLQEAYDFVLDAPCKEHSVMWGALVGACK 377
                ++   YG +P  +HY  + D+L RAGR+ EA++F+   P K  + + G+L+GAC+
Sbjct: 377 NIFDGMVAGEYGTKPGTEHYGCIVDMLGRAGRIDEAFEFIKRMPSKPTAGVLGSLLGACR 436

Query: 378 VHEDVDLMKHVAASYFELDPKNSGKLVVFSNAFATSGLWDNVEEIRAMMKKSGMSKDPGC 437
           VH  VD+ + V     E++P+N+G  V+ SN +A++G W +V  +RAMM +  ++K+PG 
Sbjct: 437 VHLSVDIGESVGRRLIEIEPENAGNYVILSNLYASAGRWADVNNVRAMMMQKAVTKEPGR 496

Query: 438 SRIEIQREFHIFVKGDKSHRETEEIYRTIDRITPILKDAGYIPEL 475
           S I+ ++  H F   D++H   EE+   +  I+  +K AGY+P+L
Sbjct: 497 SWIQHEQTLHYFHANDRTHPRREEVLAKMKEISIKMKQAGYVPDL 541

BLAST of CsaV3_6G048410 vs. Swiss-Prot
Match: sp|Q9LW63|PP251_ARATH (Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H32 PE=3 SV=1)

HSP 1 Score: 244.6 bits (623), Expect = 2.3e-63
Identity = 134/430 (31.16%), Postives = 225/430 (52.33%), Query Frame = 0

Query: 47  FQVKPHHKDTSSWDKTLRGLCLTGKLAEAVALLCCMA---LQFHSKTYCLLLQECIFRKE 106
           F+V P  KD  S++  + G   +G   +A+ ++  M    L+  S T   +L       +
Sbjct: 199 FEVMP-RKDVVSYNTIIAGYAQSGMYEDALRMVREMGTTDLKPDSFTLSSVLPIFSEYVD 258

Query: 107 YMKGKRIHAQMVVVGYVPNEYLNTKLLILYAKSGDLETAYVLHEHLLEKSLVSWNSLIAG 166
            +KGK IH  ++  G   + Y+ + L+ +YAKS  +E +  +   L  +  +SWNSL+AG
Sbjct: 259 VIKGKEIHGYVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVFSRLYCRDGISWNSLVAG 318

Query: 167 YVQKGLAEVGLEFYLKMRQSGLMPDQYTFASVLRACASLASLEHGKRAHGVLIKCQIGDN 226
           YVQ G     L  + +M  + + P    F+SV+ ACA LA+L  GK+ HG +++   G N
Sbjct: 319 YVQNGRYNEALRLFRQMVTAKVKPGAVAFSSVIPACAHLATLHLGKQLHGYVLRGGFGSN 378

Query: 227 VVVSSALVDMYFKCSSLSDGHKAFNKSSNRNVITWTALISGYGQHGRISEVLEXXXXXXX 286
           + ++SALVDMY KC ++    K F++ +  + ++WTA+I G+  HG   E +        
Sbjct: 379 IFIASALVDMYSKCGNIKAARKIFDRMNVLDEVSWTAIIMGHALHGHGHEAVSLFEEMKR 438

Query: 287 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSLMTKTYGIEPRGQHYAAMADLLARAGRLQ 346
                                         + MTK YG+    +HYAA+ADLL RAG+L+
Sbjct: 439 QGVKPNQVAFVAVLTACSHVGLVDEAWGYFNSMTKVYGLNQELEHYAAVADLLGRAGKLE 498

Query: 347 EAYDFVLDAPCKEHSVMWGALVGACKVHEDVDLMKHVAASYFELDPKNSGKLVVFSNAFA 406
           EAY+F+     +    +W  L+ +C VH++++L + VA   F +D +N G  V+  N +A
Sbjct: 499 EAYNFISKMCVEPTGSVWSTLLSSCSVHKNLELAEKVAEKIFTVDSENMGAYVLMCNMYA 558

Query: 407 TSGLWDNVEEIRAMMKKSGMSKDPGCSRIEIQREFHIFVKGDKSHRETEEIYRTIDRITP 466
           ++G W  + ++R  M+K G+ K P CS IE++ + H FV GD+SH   ++I   +  +  
Sbjct: 559 SNGRWKEMAKLRLRMRKKGLRKKPACSWIEMKNKTHGFVSGDRSHPSMDKINEFLKAVME 618

Query: 467 ILKDAGYIPE 474
            ++  GY+ +
Sbjct: 619 QMEKEGYVAD 627

BLAST of CsaV3_6G048410 vs. TrEMBL
Match: tr|A0A0A0KJS7|A0A0A0KJS7_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G511640 PE=4 SV=1)

HSP 1 Score: 895.6 bits (2313), Expect = 5.0e-257
Identity = 486/486 (100.00%), Postives = 486/486 (100.00%), Query Frame = 0

Query: 1   MLKCASPLTRSSFSAVIHLISTSIVPTQLPTVHLPRRHKSEFASPSFQVKPHHKDTSSWD 60
           MLKCASPLTRSSFSAVIHLISTSIVPTQLPTVHLPRRHKSEFASPSFQVKPHHKDTSSWD
Sbjct: 1   MLKCASPLTRSSFSAVIHLISTSIVPTQLPTVHLPRRHKSEFASPSFQVKPHHKDTSSWD 60

Query: 61  KTLRGLCLTGKLAEAVALLCCMALQFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGYV 120
           KTLRGLCLTGKLAEAVALLCCMALQFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGYV
Sbjct: 61  KTLRGLCLTGKLAEAVALLCCMALQFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGYV 120

Query: 121 PNEYLNTKLLILYAKSGDLETAYVLHEHLLEKSLVSWNSLIAGYVQKGLAEVGLEFYLKM 180
           PNEYLNTKLLILYAKSGDLETAYVLHEHLLEKSLVSWNSLIAGYVQKGLAEVGLEFYLKM
Sbjct: 121 PNEYLNTKLLILYAKSGDLETAYVLHEHLLEKSLVSWNSLIAGYVQKGLAEVGLEFYLKM 180

Query: 181 RQSGLMPDQYTFASVLRACASLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSL 240
           RQSGLMPDQYTFASVLRACASLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSL
Sbjct: 181 RQSGLMPDQYTFASVLRACASLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSL 240

Query: 241 SDGHKAFNKSSNRNVITWTALISGYGQHGRISEVLEXXXXXXXXXXXXXXXXXXXXXXXX 300
           SDGHKAFNKSSNRNVITWTALISGYGQHGRISEVLEXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 241 SDGHKAFNKSSNRNVITWTALISGYGQHGRISEVLEXXXXXXXXXXXXXXXXXXXXXXXX 300

Query: 301 XXXXXXXXXXXXXSLMTKTYGIEPRGQHYAAMADLLARAGRLQEAYDFVLDAPCKEHSVM 360
           XXXXXXXXXXXXXSLMTKTYGIEPRGQHYAAMADLLARAGRLQEAYDFVLDAPCKEHSVM
Sbjct: 301 XXXXXXXXXXXXXSLMTKTYGIEPRGQHYAAMADLLARAGRLQEAYDFVLDAPCKEHSVM 360

Query: 361 WGALVGACKVHEDVDLMKHVAASYFELDPKNSGKLVVFSNAFATSGLWDNVEEIRAMMKK 420
           WGALVGACKVHEDVDLMKHVAASYFELDPKNSGKLVVFSNAFATSGLWDNVEEIRAMMKK
Sbjct: 361 WGALVGACKVHEDVDLMKHVAASYFELDPKNSGKLVVFSNAFATSGLWDNVEEIRAMMKK 420

Query: 421 SGMSKDPGCSRIEIQREFHIFVKGDKSHRETEEIYRTIDRITPILKDAGYIPELCEKTVI 480
           SGMSKDPGCSRIEIQREFHIFVKGDKSHRETEEIYRTIDRITPILKDAGYIPELCEKTVI
Sbjct: 421 SGMSKDPGCSRIEIQREFHIFVKGDKSHRETEEIYRTIDRITPILKDAGYIPELCEKTVI 480

Query: 481 EGLSWC 487
           EGLSWC
Sbjct: 481 EGLSWC 486

BLAST of CsaV3_6G048410 vs. TrEMBL
Match: tr|A0A1S3AZV2|A0A1S3AZV2_CUCME (pentatricopeptide repeat-containing protein At4g16470 OS=Cucumis melo OX=3656 GN=LOC103484664 PE=4 SV=1)

HSP 1 Score: 824.7 bits (2129), Expect = 1.1e-235
Identity = 414/477 (86.79%), Postives = 425/477 (89.10%), Query Frame = 0

Query: 1   MLKCASPLTRSSFSAVIHLISTSIVPTQLPTVHLPRRHKSEFASPSFQVKPHHKDTSSWD 60
           MLKCASPLTRSSFSAVIHL + SIV TQ P VH PRRH+SE A PSFQVK HHKD SSWD
Sbjct: 1   MLKCASPLTRSSFSAVIHLFTKSIVATQFPIVHFPRRHESESACPSFQVKRHHKDNSSWD 60

Query: 61  KTLRGLCLTGKLAEAVALLCCMALQFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGYV 120
           +TLRGLCLTGKLAEAVALLCCMALQF SKTYCLLLQECIFRKEYMKGKRIHAQMVVVGYV
Sbjct: 61  ETLRGLCLTGKLAEAVALLCCMALQFQSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGYV 120

Query: 121 PNEYLNTKLLILYAKSGDLETAYVLHEHLLEKSLVSWNSLIAGYVQKGLAEVGLEFYLKM 180
           PNEYLNTKLLILYAKSGDLETAY+LHEHLLEKSLVSWNSLIAGYVQKGLAEVGLEFYLKM
Sbjct: 121 PNEYLNTKLLILYAKSGDLETAYILHEHLLEKSLVSWNSLIAGYVQKGLAEVGLEFYLKM 180

Query: 181 RQSGLMPDQYTFASVLRACASLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSL 240
           RQSGLMPDQYTFASVLRACASLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSL
Sbjct: 181 RQSGLMPDQYTFASVLRACASLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSL 240

Query: 241 SDGHKAFNKSSNRNVITWTALISGYGQHGRISEVLEXXXXXXXXXXXXXXXXXXXXXXXX 300
           SDGHKAFNKS NRNVITWTALISGYGQHGR+SEVLE                        
Sbjct: 241 SDGHKAFNKSINRNVITWTALISGYGQHGRVSEVLESFHSMIKEGYRPNYVTFLVVLAAC 300

Query: 301 XXXXXXXXXXXXXSLMTKTYGIEPRGQHYAAMADLLARAGRLQEAYDFVLDAPCKEHSVM 360
                        SLMTKTY IEPRGQHYAAMADLLARAGRL+EAY+FVLDAPCKE+SVM
Sbjct: 301 SRGGFVSEAWRYFSLMTKTYEIEPRGQHYAAMADLLARAGRLREAYNFVLDAPCKENSVM 360

Query: 361 WGALVGACKVHEDVDLMKHVAASYFELDPKNSGKLVVFSNAFATSGLWDNVEEIRAMMKK 420
           WGALVGACKVHED+DLMKHVAASYFELDP+NSGKLVVFSNAFATSGLWDNVEEIRAMMKK
Sbjct: 361 WGALVGACKVHEDIDLMKHVAASYFELDPENSGKLVVFSNAFATSGLWDNVEEIRAMMKK 420

Query: 421 SGMSKDPGCSRIEIQREFHIFVKGDKSHRETEEIYRTIDRITPILKDAGYIPELCEK 478
           SGMSKDPGCS+IEIQREFHIFVK DKSHRETEEIYRTIDRITPILKDAGY PELCEK
Sbjct: 421 SGMSKDPGCSKIEIQREFHIFVKNDKSHRETEEIYRTIDRITPILKDAGYTPELCEK 477

BLAST of CsaV3_6G048410 vs. TrEMBL
Match: tr|A0A2P4LKA5|A0A2P4LKA5_QUESU (Pentatricopeptide repeat-containing protein OS=Quercus suber OX=58331 GN=CFP56_39785 PE=4 SV=1)

HSP 1 Score: 541.2 bits (1393), Expect = 2.4e-150
Identity = 269/444 (60.59%), Postives = 335/444 (75.45%), Query Frame = 0

Query: 31  TVHLPRRHKSEFASPSFQVKPHHKDTSSWDKTLRGLCLTGKLAEAVALLCCMALQFHSKT 90
           T  L RR +SE ++ SFQV+P +K     +KTL+GLC++G+L EA+ LLC   ++   +T
Sbjct: 89  TTTLLRRIQSESSNGSFQVEP-YKSYLQLNKTLKGLCISGRLREALGLLCTTGVRVEPRT 148

Query: 91  YCLLLQECIFRKEYMKGKRIHAQMVVVGYVPNEYLNTKLLILYAKSGDLETAYVLHEHLL 150
           Y LLLQECIFRKEY KG+RIHA MVVVGY PNEYL TKLLILY KSGDL TA+VL ++L 
Sbjct: 149 YALLLQECIFRKEYKKGRRIHALMVVVGYDPNEYLKTKLLILYVKSGDLGTAHVLFDNLR 208

Query: 151 EKSLVSWNSLIAGYVQKGLAEVGLEFYLKMRQSGLMPDQYTFASVLRACASLASLEHGKR 210
           EKSLVSWN++IAGYVQKGL EVGL  Y KMRQ+G +PDQYTFASV RACA+LA+LEHGK+
Sbjct: 209 EKSLVSWNAMIAGYVQKGLEEVGLNLYYKMRQTGFIPDQYTFASVFRACATLATLEHGKQ 268

Query: 211 AHGVLIKCQIGDNVVVSSALVDMYFKCSSLSDGHKAFNKSSNRNVITWTALISGYGQHGR 270
           AHGV+IKCQI +NVVV+SAL+DMYFKCSSLSDGH+ F+KS NRNV+ WTALISGYGQHG+
Sbjct: 269 AHGVMIKCQIRENVVVNSALMDMYFKCSSLSDGHRVFDKSLNRNVVMWTALISGYGQHGK 328

Query: 271 ISEVLEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSLMTKTYGIEPRGQHYA 330
           + EVLE                                     S M K YGI+PRGQHYA
Sbjct: 329 VVEVLESFHRMKTEGFKPNYVTFLSVLSACSHGGLLDEGWAYFSSMNKDYGIQPRGQHYA 388

Query: 331 AMADLLARAGRLQEAYDFVLDAPCKEHSVMWGALVGACKVHEDVDLMKHVAASYFELDPK 390
           A+ DLL RAGRLQEAY+FVL++PCKEHSV+WGAL+GAC++H D++L+K  A  +FEL+P+
Sbjct: 389 AIVDLLGRAGRLQEAYEFVLNSPCKEHSVIWGALLGACRLHGDMNLVKLAAKKFFELEPE 448

Query: 391 NSGKLVVFSNAFATSGLWDNVEEIRAMMKKSGMSKDPGCSRIEIQREFHIFVKGDKSHRE 450
           N GK VV SNA+ATSG W+N  E+RA+M++SG+ K+PG SRIE+Q     F KGDK   +
Sbjct: 449 NPGKYVVLSNAYATSGFWENAAEVRAVMRESGIIKEPGYSRIEVQNGSQFFFKGDK---Q 508

Query: 451 TEEIYRTIDRITPILKDAGYIPEL 475
           ++E+Y  I  +T IL+DAGY+P+L
Sbjct: 509 SKELYELIREMTCILRDAGYVPDL 528

BLAST of CsaV3_6G048410 vs. TrEMBL
Match: tr|A0A2I4EWW9|A0A2I4EWW9_9ROSI (pentatricopeptide repeat-containing protein At4g16470-like isoform X2 OS=Juglans regia OX=51240 GN=LOC108993425 PE=4 SV=1)

HSP 1 Score: 540.4 bits (1391), Expect = 4.1e-150
Identity = 268/443 (60.50%), Postives = 331/443 (74.72%), Query Frame = 0

Query: 31  TVHLPRRHKSEFASPSFQVKPHHKDTSSWDKTLRGLCLTGKLAEAVALLCCMALQFHSKT 90
           T  L RR +SE ++ SFQV+ H+K     DKTL+GLC++G+L EAV LL    +Q    T
Sbjct: 97  TARLLRRIQSESSNGSFQVE-HYKGYLQLDKTLKGLCVSGRLREAVGLLWITGVQVEPGT 156

Query: 91  YCLLLQECIFRKEYMKGKRIHAQMVVVGYVPNEYLNTKLLILYAKSGDLETAYVLHEHLL 150
           Y LLLQECIFRKEY  G+RIH QMVVVGY P+EYL TKLLILY KSG + TA++L + LL
Sbjct: 157 YSLLLQECIFRKEYSNGRRIHGQMVVVGYDPSEYLKTKLLILYVKSGHIGTAHILFDKLL 216

Query: 151 EKSLVSWNSLIAGYVQKGLAEVGLEFYLKMRQSGLMPDQYTFASVLRACASLASLEHGKR 210
             SLVSWN++IAGYVQ+GL EVGL+ Y KMR++G +PDQYTFASV RACA+LA+LEHGKR
Sbjct: 217 GTSLVSWNAMIAGYVQRGLEEVGLDLYYKMRRTGFVPDQYTFASVFRACATLATLEHGKR 276

Query: 211 AHGVLIKCQIGDNVVVSSALVDMYFKCSSLSDGHKAFNKSSNRNVITWTALISGYGQHGR 270
           AHGV+IKCQ+ +NVV +SAL+DMYFKCSSLSDGH+ F KSSNRNVITWTALISGYGQHG+
Sbjct: 277 AHGVMIKCQVRENVVANSALMDMYFKCSSLSDGHQVFEKSSNRNVITWTALISGYGQHGK 336

Query: 271 ISEVLEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSLMTKTYGIEPRGQHYA 330
           + EVLE                                     S M + YGIEPRG+HYA
Sbjct: 337 VVEVLEFFHRMKNEGFKPNYVTFLAVISACSHGGLVDQGWAYFSSMIRDYGIEPRGKHYA 396

Query: 331 AMADLLARAGRLQEAYDFVLDAPCKEHSVMWGALVGACKVHEDVDLMKHVAASYFELDPK 390
           +M DLL RAGRL EAY+FVL++PCKEHSV+WGAL+GAC++H D++L+K  A  +FEL+P+
Sbjct: 397 SMIDLLGRAGRLHEAYEFVLNSPCKEHSVIWGALLGACRIHGDMNLVKLAAKKFFELEPE 456

Query: 391 NSGKLVVFSNAFATSGLWDNVEEIRAMMKKSGMSKDPGCSRIEIQREFHIFVKGDKSHRE 450
           N GK VV SNA+ATSGLWDNV E+RA+M++SG+ K+PG S IEI+   H F KGDKSH++
Sbjct: 457 NPGKYVVLSNAYATSGLWDNVAEVRAVMRESGVIKEPGHSWIEIRNGSHFFFKGDKSHKQ 516

Query: 451 TEEIYRTIDRITPILKDAGYIPE 474
           + EIY  I  +T ILKD GY+P+
Sbjct: 517 SSEIYEMIRDMTCILKDTGYVPD 538

BLAST of CsaV3_6G048410 vs. TrEMBL
Match: tr|A0A2K1X8Y9|A0A2K1X8Y9_POPTR (Uncharacterized protein OS=Populus trichocarpa OX=3694 GN=POPTR_016G010100v3 PE=4 SV=1)

HSP 1 Score: 534.6 bits (1376), Expect = 2.2e-148
Identity = 284/419 (67.78%), Postives = 348/419 (83.05%), Query Frame = 0

Query: 58  SWDKTLRGLCLTGKLAEAVALLCCMALQFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVV 117
           +W K L+GLC+TG++ EAV LL    L+    TY LLLQECIF+K Y KGKRIHAQMVVV
Sbjct: 119 NWKKILKGLCITGRMNEAVGLLWRSGLEVDHGTYALLLQECIFKKLYNKGKRIHAQMVVV 178

Query: 118 GYVPNEYLNTKLLILYAKSGDLETAYVLHEHLLEKSLVSWNSLIAGYVQKGLAEVGLEFY 177
           GYVPNEYL TKL+ILYAKSGDL+T ++L + L+EKSL+SWN+LIAGYVQKGL E+GL FY
Sbjct: 179 GYVPNEYLKTKLMILYAKSGDLKTMHLLFDMLMEKSLISWNALIAGYVQKGLEEMGLSFY 238

Query: 178 LKMRQSGLMPDQYTFASVLRACASLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKC 237
            +MRQ+GL PDQYTFASV RACA+LA+LEHGKRAH V++KC + +NVVVSSAL+DMYFKC
Sbjct: 239 YEMRQNGLTPDQYTFASVFRACATLATLEHGKRAHCVMMKCFLKENVVVSSALMDMYFKC 298

Query: 238 SSLSDGHKAFNKSSNRNVITWTALISGYGQHGRISEVLEXXXXXXXXXXXXXXXXXXXXX 297
           SSLSDGH  F+KSSNRNV+TWT+LISGYG HGR+SEV+E      XXXXXXXXXXXXXXX
Sbjct: 299 SSLSDGHLVFDKSSNRNVVTWTSLISGYGHHGRVSEVIESFHRMKXXXXXXXXXXXXXXX 358

Query: 298 XXXXXXXXXXXXXXXXSLMTKTYGIEPRGQHYAAMADLLARAGRLQEAYDFVLDAPCKEH 357
           XXXXXXXXXXX     S M + YGI+PRG+HYAAM DLL RAGRL+EAY+FV++APCKEH
Sbjct: 359 XXXXXXXXXXXGWAYFSSMRRDYGIQPRGKHYAAMVDLLGRAGRLKEAYEFVVNAPCKEH 418

Query: 358 SVMWGALVGACKVHEDVDLMKHVAASYFELDPKNSGKLVVFSNAFATSGLWDNVEEIRAM 417
           SV+WGAL+GACK+H D+DL++  A  YFELDP+N+GK VV SNA+A  GLWD+V E+R +
Sbjct: 419 SVLWGALLGACKIHGDMDLIELAARKYFELDPENAGKYVVLSNAYAAFGLWDSVAEVRGV 478

Query: 418 MKKSGMSKDPGCSRIEIQREFHIFVKGDKSHRETEEIYRTIDRITPILKDAGYIPELCE 477
           M+ + ++K+P  S IE+Q + H F++GDKSHRE+EEIY+TI  +  IL DAGY+P+L +
Sbjct: 479 MRDTEINKEPAYSSIEVQGKAHFFLQGDKSHRESEEIYKTIIEMIWILNDAGYVPDLSD 537

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004134800.17.6e-257100.00PREDICTED: pentatricopeptide repeat-containing protein At4g16470 [Cucumis sativu... [more]
XP_008440079.11.6e-23586.79PREDICTED: pentatricopeptide repeat-containing protein At4g16470 [Cucumis melo][more]
XP_023003064.12.2e-19278.37pentatricopeptide repeat-containing protein At4g16470 [Cucurbita maxima][more]
XP_022926503.12.5e-19178.94pentatricopeptide repeat-containing protein At4g16470 [Cucurbita moschata][more]
XP_023518629.19.4e-19178.59pentatricopeptide repeat-containing protein At4g16470 [Cucurbita pepo subsp. pep... [more]
Match NameE-valueIdentityDescription
AT4G16470.11.2e-12652.05Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G11290.11.0e-6633.25Pentatricopeptide repeat (PPR) superfamily protein[more]
AT2G03880.15.2e-6632.81Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G13770.16.8e-6631.40Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G23330.11.3e-6431.16Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
sp|O23491|PP315_ARATH2.1e-12552.05Pentatricopeptide repeat-containing protein At4g16470 OS=Arabidopsis thaliana OX... [more]
sp|Q3E6Q1|PPR32_ARATH1.9e-6533.25Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
sp|Q9SI53|PP147_ARATH9.3e-6532.81Pentatricopeptide repeat-containing protein At2g03880, mitochondrial OS=Arabidop... [more]
sp|Q9LIC3|PP227_ARATH1.2e-6431.40Putative pentatricopeptide repeat-containing protein At3g13770, mitochondrial OS... [more]
sp|Q9LW63|PP251_ARATH2.3e-6331.16Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0KJS7|A0A0A0KJS7_CUCSA5.0e-257100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G511640 PE=4 SV=1[more]
tr|A0A1S3AZV2|A0A1S3AZV2_CUCME1.1e-23586.79pentatricopeptide repeat-containing protein At4g16470 OS=Cucumis melo OX=3656 GN... [more]
tr|A0A2P4LKA5|A0A2P4LKA5_QUESU2.4e-15060.59Pentatricopeptide repeat-containing protein OS=Quercus suber OX=58331 GN=CFP56_3... [more]
tr|A0A2I4EWW9|A0A2I4EWW9_9ROSI4.1e-15060.50pentatricopeptide repeat-containing protein At4g16470-like isoform X2 OS=Juglans... [more]
tr|A0A2K1X8Y9|A0A2K1X8Y9_POPTR2.2e-14867.78Uncharacterized protein OS=Populus trichocarpa OX=3694 GN=POPTR_016G010100v3 PE=... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006206 pyrimidine nucleobase metabolic process
biological_process GO:0006230 TMP biosynthetic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0005524 ATP binding
molecular_function GO:0004797 thymidine kinase activity
molecular_function GO:0003674 molecular_function
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_6G048410.1CsaV3_6G048410.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 83..206
e-value: 1.5E-19
score: 72.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 207..463
e-value: 5.9E-31
score: 110.0
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 256..289
e-value: 3.0E-7
score: 28.2
coord: 155..188
e-value: 1.3E-5
score: 23.1
coord: 291..324
e-value: 0.0015
score: 16.6
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 253..301
e-value: 7.2E-10
score: 38.8
coord: 153..200
e-value: 2.0E-10
score: 40.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 289..324
score: 8.133
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 55..85
score: 6.741
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 122..152
score: 6.369
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 188..222
score: 5.492
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 391..425
score: 6.38
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 153..187
score: 11.312
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 223..253
score: 5.24
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 87..121
score: 6.818
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 254..288
score: 12.638
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 325..359
score: 5.634
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 50..456
NoneNo IPR availablePANTHERPTHR24015:SF721SUBFAMILY NOT NAMEDcoord: 50..456

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CsaV3_6G048410CsaV3_1G015380Cucumber (Chinese Long) v3cuccucB053