Lag0006021 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0006021
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionPentatricopeptide repeat-containing protein
Locationchr6: 36009440 .. 36011713 (-)
RNA-Seq ExpressionLag0006021
SyntenyLag0006021
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGATGGATGAATCTGGGCAGTGGATACTTTGCTTCTACTGCTTTTCTGAAACTTTCCCATTATGTTTCTCAAGTTACAATGGCCCAAAGAATCATGTCATTCAACTTGTTTGAGCATCAGCTGTTCAACTCATGTCGCTACCACTCTTCAAATGATGCTTTGGCCAATACCCTTCATGCCAAGATGGTAAAATGTGGTTCTATTTTGGGTTCAGGGAAGTTTGTTTTGAGTTCCTATGTGAAATCTGAGAAATTAAACGATGCAAAGAAAGTGTTTGATGAAATGCCCAGCAGAGATGTACTCACATGGACAGTACTTATATCGGGTTTTGCTAGAGTAAATTGTTCTGAAATGGCATTGCAACTGTTTAGAGAAATGCTGGTTGAAGATATTTGTCCAAATCATTTTACTTTGTCTTGTGTATTTAAGCTTTGCTCTAGAGTAGGTAATGTGCAAATGGGTAAGGGAATTCATGGATGGATACTAAGAAGTGGGGTTAACTTAGATGTTGTCTTGGAGAATTCTATGCTTGACTTGTATGCAAAGTTTGATGCATTTGATTATGCTAAAAAGTTGTTTGATTCAATGAGAGAAAAGAGTACTGCTACTTACAACATAATGCTTGGTGTGTATGTCCGTAGTTGTGATGTTAACAAATCTCTTGATTTATTCAGAAACTTGCCTTGCAGAGATACGGCGAGTTGGAACACGATTATATGTGGGCTAATGCAAGGTGGGTATCTGGATACAGCATTGGAGCTACTCTATGAGATGGTGGAGAATGAACCTGAGTTTAACAAAGTTACTTCTTCCATAGCTTTGAGTGTGGTTTCTTCTTTACTGATTATTGAGCTGGGTAGACAAGTACATGGCCGAATTGTCAGGTTTGGTTTTCATAATGATGGATTTGTAAAGAGTTCACTGATAAATATGTACATTAAATGTGGAAATTTGGAAAAAGCGTCGGTGATATATAGTCAAATGCCTTCTGATTTTGCGAGGAAACAAGATTCCAACATTGTATGTAGCGACACAATGACAGAAATTGTTTCACGGAGCTCCATGGTGTCTGGATATGTTCGAAATGGCAAGTATGAAGATGCCTTCAAAACTTTTGTTTCTATGGTTCGTGAAGGGGTTCTGATGGACAAATTTACCATTGCAAGTGTTGTATCCGCTTGTTCTAATGCTGGCTTTTTAGAGCTTGGACGTCAAATCCATGCATATATTGTGAAAACTGGGGAACAGCTTGATGCTCACTTGGCTTCCTCCTTGATTGACATGTATGCTAAAGGTGGGAGTTTGGATTGTGCCCGTCGAATTTTTGAGCAAACGACTTACTTAAATGTCGTGATATGGACTACCATGATCGCAGGATTTGCTTTGCATGGGCAAGGTAAGGAAGCCATTAGACTGTTTGAACAGATGAGATATGAGGGAATCATACCAAATGATGTTACTTTTATAGGAGTTTTAACAGCTTGCAGTCATGCAGGGCTGCTTGAAGAAGGCCGTCTATATTTTAACATGATGAAAGATGTTTATGCTATTGAACCTAAAGTCGAGCATTTCACTTGTATGGTAGATCTTTATGGTAGAGCTGGACGCTTGAATGAAGTCAAAGAGTTCATCTATGAGAATGATTTGTCACACCTTAGTGCAGTTTGGAAGGCATTCCTATCAGCCTGTCGGCTTTACAAGGACATCGAAATGGGAAATTGGGTTTCCGAAAAATTGTTTAGCCTTGAACCACAAGACGAAGGGCCTTATGTTTTACTCTCAAACATGTGCTCCAGCAATCAGAAGTGGGAAGAAGCTTCCAGAACAAGAAAATATATGCAACATAGAGGGATTAGCAAAACACCTGGTCAATCTTGGATTCATGTGAAAAATCAAGTACACTCTTTTGTTGCGGGAGACAGATCACACCCTCAACACGCTCAGATATATGCATATCTGGACAAGCTAATTGGAAGATTGAAGGAAATTGGATACCTGTCTGATGTAAAATTGGTGATGCAGGATGTAGAAGAAGAACAGGGTGAAGTGCTTCTTGGTTGGCATAGTGAAAAACTTGCAGTTGCTTATGGAATTATCAGCATGTCTTCTGGCATTCCGATCCGAATCATGAAGAACCTTCGGGTATGTACCGATTGTCATAACTTTATGAAGCTAACATCTCAACTTTTAGGCAGGGAGATCATTGTTCGAGATATTCATCGTTTCCATCATTTTAACTCCGGTTGTTGCTCTTGTGGTGATTATTGGTGA

mRNA sequence

ATGAGATGGATGAATCTGGGCAGTGGATACTTTGCTTCTACTGCTTTTCTGAAACTTTCCCATTATGTTTCTCAAGTTACAATGGCCCAAAGAATCATGTCATTCAACTTGTTTGAGCATCAGCTGTTCAACTCATGTCGCTACCACTCTTCAAATGATGCTTTGGCCAATACCCTTCATGCCAAGATGGTAAAATGTGGTTCTATTTTGGGTTCAGGGAAGTTTGTTTTGAGTTCCTATGTGAAATCTGAGAAATTAAACGATGCAAAGAAAGTGTTTGATGAAATGCCCAGCAGAGATGTACTCACATGGACAGTACTTATATCGGGTTTTGCTAGAGTAAATTGTTCTGAAATGGCATTGCAACTGTTTAGAGAAATGCTGGTTGAAGATATTTGTCCAAATCATTTTACTTTGTCTTGTGTATTTAAGCTTTGCTCTAGAGTAGGTAATGTGCAAATGGGTAAGGGAATTCATGGATGGATACTAAGAAGTGGGGTTAACTTAGATGTTGTCTTGGAGAATTCTATGCTTGACTTGTATGCAAAGTTTGATGCATTTGATTATGCTAAAAAGTTGTTTGATTCAATGAGAGAAAAGAGTACTGCTACTTACAACATAATGCTTGGTGTGTATGTCCGTAGTTGTGATGTTAACAAATCTCTTGATTTATTCAGAAACTTGCCTTGCAGAGATACGGCGAGTTGGAACACGATTATATGTGGGCTAATGCAAGGTGGGTATCTGGATACAGCATTGGAGCTACTCTATGAGATGGTGGAGAATGAACCTGAGTTTAACAAAGTTACTTCTTCCATAGCTTTGAGTGTGGTTTCTTCTTTACTGATTATTGAGCTGGGTAGACAAGTACATGGCCGAATTGTCAGGTTTGGTTTTCATAATGATGGATTTGTAAAGAGTTCACTGATAAATATGTACATTAAATGTGGAAATTTGGAAAAAGCGTCGGTGATATATAGTCAAATGCCTTCTGATTTTGCGAGGAAACAAGATTCCAACATTGTATGTAGCGACACAATGACAGAAATTGTTTCACGGAGCTCCATGGTGTCTGGATATGTTCGAAATGGCAAGTATGAAGATGCCTTCAAAACTTTTGTTTCTATGGTTCGTGAAGGGGTTCTGATGGACAAATTTACCATTGCAAGTGTTGTATCCGCTTGTTCTAATGCTGGCTTTTTAGAGCTTGGACGTCAAATCCATGCATATATTGTGAAAACTGGGGAACAGCTTGATGCTCACTTGGCTTCCTCCTTGATTGACATGTATGCTAAAGGTGGGAGTTTGGATTGTGCCCGTCGAATTTTTGAGCAAACGACTTACTTAAATGTCGTGATATGGACTACCATGATCGCAGGATTTGCTTTGCATGGGCAAGGTAAGGAAGCCATTAGACTGTTTGAACAGATGAGATATGAGGGAATCATACCAAATGATGTTACTTTTATAGGAGTTTTAACAGCTTGCAGTCATGCAGGGCTGCTTGAAGAAGGCCGTCTATATTTTAACATGATGAAAGATGTTTATGCTATTGAACCTAAAGTCGAGCATTTCACTTGTATGGTAGATCTTTATGGTAGAGCTGGACGCTTGAATGAAGTCAAAGAGTTCATCTATGAGAATGATTTGTCACACCTTAGTGCAGTTTGGAAGGCATTCCTATCAGCCTGTCGGCTTTACAAGGACATCGAAATGGGAAATTGGGTTTCCGAAAAATTGTTTAGCCTTGAACCACAAGACGAAGGGCCTTATGTTTTACTCTCAAACATGTGCTCCAGCAATCAGAAGTGGGAAGAAGCTTCCAGAACAAGAAAATATATGCAACATAGAGGGATTAGCAAAACACCTGGTCAATCTTGGATTCATGTGAAAAATCAAGTACACTCTTTTGTTGCGGGAGACAGATCACACCCTCAACACGCTCAGATATATGCATATCTGGACAAGCTAATTGGAAGATTGAAGGAAATTGGATACCTGTCTGATGTAAAATTGGTGATGCAGGATGTAGAAGAAGAACAGGGTGAAGTGCTTCTTGGTTGGCATAGTGAAAAACTTGCAGTTGCTTATGGAATTATCAGCATGTCTTCTGGCATTCCGATCCGAATCATGAAGAACCTTCGGGTATGTACCGATTGTCATAACTTTATGAAGCTAACATCTCAACTTTTAGGCAGGGAGATCATTGTTCGAGATATTCATCGTTTCCATCATTTTAACTCCGGTTGTTGCTCTTGTGGTGATTATTGGTGA

Coding sequence (CDS)

ATGAGATGGATGAATCTGGGCAGTGGATACTTTGCTTCTACTGCTTTTCTGAAACTTTCCCATTATGTTTCTCAAGTTACAATGGCCCAAAGAATCATGTCATTCAACTTGTTTGAGCATCAGCTGTTCAACTCATGTCGCTACCACTCTTCAAATGATGCTTTGGCCAATACCCTTCATGCCAAGATGGTAAAATGTGGTTCTATTTTGGGTTCAGGGAAGTTTGTTTTGAGTTCCTATGTGAAATCTGAGAAATTAAACGATGCAAAGAAAGTGTTTGATGAAATGCCCAGCAGAGATGTACTCACATGGACAGTACTTATATCGGGTTTTGCTAGAGTAAATTGTTCTGAAATGGCATTGCAACTGTTTAGAGAAATGCTGGTTGAAGATATTTGTCCAAATCATTTTACTTTGTCTTGTGTATTTAAGCTTTGCTCTAGAGTAGGTAATGTGCAAATGGGTAAGGGAATTCATGGATGGATACTAAGAAGTGGGGTTAACTTAGATGTTGTCTTGGAGAATTCTATGCTTGACTTGTATGCAAAGTTTGATGCATTTGATTATGCTAAAAAGTTGTTTGATTCAATGAGAGAAAAGAGTACTGCTACTTACAACATAATGCTTGGTGTGTATGTCCGTAGTTGTGATGTTAACAAATCTCTTGATTTATTCAGAAACTTGCCTTGCAGAGATACGGCGAGTTGGAACACGATTATATGTGGGCTAATGCAAGGTGGGTATCTGGATACAGCATTGGAGCTACTCTATGAGATGGTGGAGAATGAACCTGAGTTTAACAAAGTTACTTCTTCCATAGCTTTGAGTGTGGTTTCTTCTTTACTGATTATTGAGCTGGGTAGACAAGTACATGGCCGAATTGTCAGGTTTGGTTTTCATAATGATGGATTTGTAAAGAGTTCACTGATAAATATGTACATTAAATGTGGAAATTTGGAAAAAGCGTCGGTGATATATAGTCAAATGCCTTCTGATTTTGCGAGGAAACAAGATTCCAACATTGTATGTAGCGACACAATGACAGAAATTGTTTCACGGAGCTCCATGGTGTCTGGATATGTTCGAAATGGCAAGTATGAAGATGCCTTCAAAACTTTTGTTTCTATGGTTCGTGAAGGGGTTCTGATGGACAAATTTACCATTGCAAGTGTTGTATCCGCTTGTTCTAATGCTGGCTTTTTAGAGCTTGGACGTCAAATCCATGCATATATTGTGAAAACTGGGGAACAGCTTGATGCTCACTTGGCTTCCTCCTTGATTGACATGTATGCTAAAGGTGGGAGTTTGGATTGTGCCCGTCGAATTTTTGAGCAAACGACTTACTTAAATGTCGTGATATGGACTACCATGATCGCAGGATTTGCTTTGCATGGGCAAGGTAAGGAAGCCATTAGACTGTTTGAACAGATGAGATATGAGGGAATCATACCAAATGATGTTACTTTTATAGGAGTTTTAACAGCTTGCAGTCATGCAGGGCTGCTTGAAGAAGGCCGTCTATATTTTAACATGATGAAAGATGTTTATGCTATTGAACCTAAAGTCGAGCATTTCACTTGTATGGTAGATCTTTATGGTAGAGCTGGACGCTTGAATGAAGTCAAAGAGTTCATCTATGAGAATGATTTGTCACACCTTAGTGCAGTTTGGAAGGCATTCCTATCAGCCTGTCGGCTTTACAAGGACATCGAAATGGGAAATTGGGTTTCCGAAAAATTGTTTAGCCTTGAACCACAAGACGAAGGGCCTTATGTTTTACTCTCAAACATGTGCTCCAGCAATCAGAAGTGGGAAGAAGCTTCCAGAACAAGAAAATATATGCAACATAGAGGGATTAGCAAAACACCTGGTCAATCTTGGATTCATGTGAAAAATCAAGTACACTCTTTTGTTGCGGGAGACAGATCACACCCTCAACACGCTCAGATATATGCATATCTGGACAAGCTAATTGGAAGATTGAAGGAAATTGGATACCTGTCTGATGTAAAATTGGTGATGCAGGATGTAGAAGAAGAACAGGGTGAAGTGCTTCTTGGTTGGCATAGTGAAAAACTTGCAGTTGCTTATGGAATTATCAGCATGTCTTCTGGCATTCCGATCCGAATCATGAAGAACCTTCGGGTATGTACCGATTGTCATAACTTTATGAAGCTAACATCTCAACTTTTAGGCAGGGAGATCATTGTTCGAGATATTCATCGTTTCCATCATTTTAACTCCGGTTGTTGCTCTTGTGGTGATTATTGGTGA

Protein sequence

MRWMNLGSGYFASTAFLKLSHYVSQVTMAQRIMSFNLFEHQLFNSCRYHSSNDALANTLHAKMVKCGSILGSGKFVLSSYVKSEKLNDAKKVFDEMPSRDVLTWTVLISGFARVNCSEMALQLFREMLVEDICPNHFTLSCVFKLCSRVGNVQMGKGIHGWILRSGVNLDVVLENSMLDLYAKFDAFDYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLDLFRNLPCRDTASWNTIICGLMQGGYLDTALELLYEMVENEPEFNKVTSSIALSVVSSLLIIELGRQVHGRIVRFGFHNDGFVKSSLINMYIKCGNLEKASVIYSQMPSDFARKQDSNIVCSDTMTEIVSRSSMVSGYVRNGKYEDAFKTFVSMVREGVLMDKFTIASVVSACSNAGFLELGRQIHAYIVKTGEQLDAHLASSLIDMYAKGGSLDCARRIFEQTTYLNVVIWTTMIAGFALHGQGKEAIRLFEQMRYEGIIPNDVTFIGVLTACSHAGLLEEGRLYFNMMKDVYAIEPKVEHFTCMVDLYGRAGRLNEVKEFIYENDLSHLSAVWKAFLSACRLYKDIEMGNWVSEKLFSLEPQDEGPYVLLSNMCSSNQKWEEASRTRKYMQHRGISKTPGQSWIHVKNQVHSFVAGDRSHPQHAQIYAYLDKLIGRLKEIGYLSDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISMSSGIPIRIMKNLRVCTDCHNFMKLTSQLLGREIIVRDIHRFHHFNSGCCSCGDYW
Homology
BLAST of Lag0006021 vs. NCBI nr
Match: KAG7020981.1 (putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1380.2 bits (3571), Expect = 0.0e+00
Identity = 671/757 (88.64%), Postives = 717/757 (94.72%), Query Frame = 0

Query: 1   MRWMNLGSGYFASTAFLKLSHYVSQVTMAQRIMSFNLFEHQLFNSCRYHSSNDALANTLH 60
           MRWMN  SG FASTAFLKL+H VSQV+MAQ+I+ FNL EHQLF SCRYHSSND  +NTLH
Sbjct: 1   MRWMNPCSGGFASTAFLKLTHSVSQVSMAQKIIPFNLSEHQLFKSCRYHSSNDDSSNTLH 60

Query: 61  AKMVKCGSILGSGKFVLSSYVKSEKLNDAKKVFDEMPSRDVLTWTVLISGFARVNCSEMA 120
           AKMVK GSIL  GK V+SSYVKSEKL+DA+KVFDEMP RDVL+WTVLISGFARVNCSE A
Sbjct: 61  AKMVKNGSILYLGKLVMSSYVKSEKLDDAQKVFDEMPHRDVLSWTVLISGFARVNCSERA 120

Query: 121 LQLFREMLVEDICPNHFTLSCVFKLCSRVGNVQMGKGIHGWILRSGVNLDVVLENSMLDL 180
           LQLFREMLVE +CPNHFTLSCV KLCSRVG++QMGKGIHGWILRSGVNLDVVLENSMLDL
Sbjct: 121 LQLFREMLVEGVCPNHFTLSCVLKLCSRVGDLQMGKGIHGWILRSGVNLDVVLENSMLDL 180

Query: 181 YAKFDAFDYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLDLFRNLPCRDTASWNTII 240
           Y KFDAFDYA KLFDSMREKSTA+YNIMLGVYVRSCDVNKSLDLFRNLPCRDTASWNTII
Sbjct: 181 YTKFDAFDYATKLFDSMREKSTASYNIMLGVYVRSCDVNKSLDLFRNLPCRDTASWNTII 240

Query: 241 CGLMQGGYLDTALELLYEMVENEPEFNKVTSSIALSVVSSLLIIELGRQVHGRIVRFGFH 300
           CGLMQGGYL+ A+ELLYEMV+NEPEFN+VTSSIALSVVSSLLIIELGRQVHGRI RFG H
Sbjct: 241 CGLMQGGYLNIAMELLYEMVKNEPEFNEVTSSIALSVVSSLLIIELGRQVHGRIFRFGLH 300

Query: 301 NDGFVKSSLINMYIKCGNLEKASVIYSQMPSDFARKQDSNIVCSDTMTEIVSRSSMVSGY 360
           NDGFV SSLINMYIKCGNLEKASVIYSQMPS+F +++DSNIVCS+TMTEIVSRSS+VSGY
Sbjct: 301 NDGFVNSSLINMYIKCGNLEKASVIYSQMPSNFGKRRDSNIVCSNTMTEIVSRSSIVSGY 360

Query: 361 VRNGKYEDAFKTFVSMVREGVLMDKFTIASVVSACSNAGFLELGRQIHAYIVKTGEQLDA 420
           V+NGKYED+F+TFVSMVRE  +MD+FTIAS++SACSNAG LELGRQIHAYI KTGEQLDA
Sbjct: 361 VQNGKYEDSFQTFVSMVRERAVMDRFTIASIISACSNAGVLELGRQIHAYIQKTGEQLDA 420

Query: 421 HLASSLIDMYAKGGSLDCARRIFEQTTYLNVVIWTTMIAGFALHGQGKEAIRLFEQMRYE 480
           HLASS+IDMYAKGGSLDCA ++FEQTTYLNVV WT+MI G ALHGQGKEAIRLFEQMRYE
Sbjct: 421 HLASSMIDMYAKGGSLDCAHQVFEQTTYLNVVTWTSMITGCALHGQGKEAIRLFEQMRYE 480

Query: 481 GIIPNDVTFIGVLTACSHAGLLEEGRLYFNMMKDVYAIEPKVEHFTCMVDLYGRAGRLNE 540
           GIIPN+VTFIGVLTACSHAGLL+EGRLYFNMMKDVYAIEPKVEHFTCMVD+YGRAGRLNE
Sbjct: 481 GIIPNEVTFIGVLTACSHAGLLDEGRLYFNMMKDVYAIEPKVEHFTCMVDVYGRAGRLNE 540

Query: 541 VKEFIYENDLSHLSAVWKAFLSACRLYKDIEMGNWVSEKLFSLEPQDEGPYVLLSNMCSS 600
           VKEFIY+NDLSH SAVWKAFLS+CRLYKDIEMGNWVSEKLF LEP+DEGPYVLLSNMCSS
Sbjct: 541 VKEFIYQNDLSHHSAVWKAFLSSCRLYKDIEMGNWVSEKLFKLEPRDEGPYVLLSNMCSS 600

Query: 601 NQKWEEASRTRKYMQHRGISKTPGQSWIHVKNQVHSFVAGDRSHPQHAQIYAYLDKLIGR 660
           NQKWEEAS+TR+ MQHRGISKTPGQSWIHVKNQVHSF+AGDRSH QHAQIYAYLDKLIGR
Sbjct: 601 NQKWEEASKTRRSMQHRGISKTPGQSWIHVKNQVHSFIAGDRSHLQHAQIYAYLDKLIGR 660

Query: 661 LKEIGYLSDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISMSSGIPIRIMKNLRVCTDC 720
           LKEIGY  DVKLVMQDVEEEQGEVLLGWHSEKLAVAYGII+++SGIPIRIMKNLRVCTDC
Sbjct: 661 LKEIGYSCDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIINLASGIPIRIMKNLRVCTDC 720

Query: 721 HNFMKLTSQLLGREIIVRDIHRFHHFNSGCCSCGDYW 758
           HNFMKLTSQLL REIIVRDIHRFHHFNSG CSCGDYW
Sbjct: 721 HNFMKLTSQLLDREIIVRDIHRFHHFNSGHCSCGDYW 757

BLAST of Lag0006021 vs. NCBI nr
Match: KAG6586149.1 (putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1375.9 bits (3560), Expect = 0.0e+00
Identity = 670/757 (88.51%), Postives = 715/757 (94.45%), Query Frame = 0

Query: 1   MRWMNLGSGYFASTAFLKLSHYVSQVTMAQRIMSFNLFEHQLFNSCRYHSSNDALANTLH 60
           MRWMN  SG FASTAFLKL+H VSQV MAQ+I+ FNL EHQLF SCRYHSSND  +NTLH
Sbjct: 1   MRWMNPCSGGFASTAFLKLTHSVSQVFMAQKIIPFNLSEHQLFKSCRYHSSNDDSSNTLH 60

Query: 61  AKMVKCGSILGSGKFVLSSYVKSEKLNDAKKVFDEMPSRDVLTWTVLISGFARVNCSEMA 120
           AKMVK GSIL  GK V+SSYVKSEKL+DA+KVFDEMP RDVL+WTVLISGFARVNCSE A
Sbjct: 61  AKMVKNGSILYLGKLVMSSYVKSEKLDDAQKVFDEMPHRDVLSWTVLISGFARVNCSERA 120

Query: 121 LQLFREMLVEDICPNHFTLSCVFKLCSRVGNVQMGKGIHGWILRSGVNLDVVLENSMLDL 180
           LQLFREMLVE +CPNHFTLSCV KLCSRVG++QMGKGIHGWILRSGVNLDVVLENSMLDL
Sbjct: 121 LQLFREMLVEGVCPNHFTLSCVLKLCSRVGDLQMGKGIHGWILRSGVNLDVVLENSMLDL 180

Query: 181 YAKFDAFDYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLDLFRNLPCRDTASWNTII 240
           Y KFDAFDYA KLFDSMREKSTA+YNIMLGVYVRSCDVNKSLDLFRNLPCRDTASWNTII
Sbjct: 181 YTKFDAFDYATKLFDSMREKSTASYNIMLGVYVRSCDVNKSLDLFRNLPCRDTASWNTII 240

Query: 241 CGLMQGGYLDTALELLYEMVENEPEFNKVTSSIALSVVSSLLIIELGRQVHGRIVRFGFH 300
           CGLMQGGYL+ A+ELLYEMV+NEPEFN+VTSSIALSVVSSLLIIELGRQVHGRI RFG H
Sbjct: 241 CGLMQGGYLNIAMELLYEMVKNEPEFNEVTSSIALSVVSSLLIIELGRQVHGRIFRFGLH 300

Query: 301 NDGFVKSSLINMYIKCGNLEKASVIYSQMPSDFARKQDSNIVCSDTMTEIVSRSSMVSGY 360
           NDGFV SSLINMYIKCGNLEKASVIYSQMPS+F +++DSNIVCS+TMTEIVSRSS+VSGY
Sbjct: 301 NDGFVNSSLINMYIKCGNLEKASVIYSQMPSNFGKRRDSNIVCSNTMTEIVSRSSIVSGY 360

Query: 361 VRNGKYEDAFKTFVSMVREGVLMDKFTIASVVSACSNAGFLELGRQIHAYIVKTGEQLDA 420
           V+NGKYED+F+TFVSMVRE  +MD+FTIAS++SACSNAG LELGRQIHAYI KTGEQLDA
Sbjct: 361 VQNGKYEDSFQTFVSMVRERAVMDRFTIASIISACSNAGVLELGRQIHAYIQKTGEQLDA 420

Query: 421 HLASSLIDMYAKGGSLDCARRIFEQTTYLNVVIWTTMIAGFALHGQGKEAIRLFEQMRYE 480
           HLASS+IDMYAKGGSLDCA ++FEQTTYLNVV WT+MI G ALHGQGKEAIRLFEQMRYE
Sbjct: 421 HLASSMIDMYAKGGSLDCAHQVFEQTTYLNVVTWTSMITGCALHGQGKEAIRLFEQMRYE 480

Query: 481 GIIPNDVTFIGVLTACSHAGLLEEGRLYFNMMKDVYAIEPKVEHFTCMVDLYGRAGRLNE 540
           GIIPN+VTFIGVLTACSHAGLL+EGRLYFNMMKDVYAIEPKVEHFTCMVD+YGRAG LNE
Sbjct: 481 GIIPNEVTFIGVLTACSHAGLLDEGRLYFNMMKDVYAIEPKVEHFTCMVDVYGRAGCLNE 540

Query: 541 VKEFIYENDLSHLSAVWKAFLSACRLYKDIEMGNWVSEKLFSLEPQDEGPYVLLSNMCSS 600
           VKEFIY+NDLSH SAVWKAFLS+CRLYKDIEMGNWVSEKLF LEP+DEGPYVLLSNMCSS
Sbjct: 541 VKEFIYQNDLSHHSAVWKAFLSSCRLYKDIEMGNWVSEKLFKLEPRDEGPYVLLSNMCSS 600

Query: 601 NQKWEEASRTRKYMQHRGISKTPGQSWIHVKNQVHSFVAGDRSHPQHAQIYAYLDKLIGR 660
           NQKWEEAS+TR+ MQHRGISKTPGQSWIHVKNQVHSF+AGDRSH QHAQIYAYLDKLIGR
Sbjct: 601 NQKWEEASKTRRSMQHRGISKTPGQSWIHVKNQVHSFIAGDRSHLQHAQIYAYLDKLIGR 660

Query: 661 LKEIGYLSDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISMSSGIPIRIMKNLRVCTDC 720
           LKEIGY  DVKLVMQDVEEEQGEVLLGWHSEKLAVAYGII+++SGIPIRIMKNLRVCTDC
Sbjct: 661 LKEIGYSCDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIINLASGIPIRIMKNLRVCTDC 720

Query: 721 HNFMKLTSQLLGREIIVRDIHRFHHFNSGCCSCGDYW 758
           HNFMKLTSQLL REIIVRDIHRFHHFNSG CSCGDYW
Sbjct: 721 HNFMKLTSQLLDREIIVRDIHRFHHFNSGHCSCGDYW 757

BLAST of Lag0006021 vs. NCBI nr
Match: XP_038889548.1 (putative pentatricopeptide repeat-containing protein At3g23330 [Benincasa hispida] >XP_038889549.1 putative pentatricopeptide repeat-containing protein At3g23330 [Benincasa hispida])

HSP 1 Score: 1358.2 bits (3514), Expect = 0.0e+00
Identity = 672/758 (88.65%), Postives = 710/758 (93.67%), Query Frame = 0

Query: 1   MRWMNLGSGYFASTAFLKLSHYVSQVTMAQRIMSFNLFEHQLFNSCRYHSSNDALANTLH 60
           MR MNL S  FA TAFLKL H + QVTMAQ+I+SFNL EHQLF SC YH+SND+L NTLH
Sbjct: 1   MRLMNLSSCCFA-TAFLKLPHPICQVTMAQKIISFNLSEHQLFKSCCYHTSNDSLVNTLH 60

Query: 61  AKMVKCGSILGSGKFVLSSYVKSEKLNDAKKVFDEMPSRDVLTWTVLISGFARVNCSEMA 120
           AKMVK GSIL SGKFVLSSYVKSEKLNDA+K+FDEMPSRDVLTWTVLISGF+R+NCSEMA
Sbjct: 61  AKMVKNGSILESGKFVLSSYVKSEKLNDAQKLFDEMPSRDVLTWTVLISGFSRINCSEMA 120

Query: 121 LQLFREMLVEDICPNHFTLSCVFKLCSRVGNVQMGKGIHGWILRSGVNLDVVLENSMLDL 180
           LQLFR+MLVE +CPNHFTLS V KLCSRVG++QMGKGIHGWILR+GVNLDVVLENSMLDL
Sbjct: 121 LQLFRKMLVEGVCPNHFTLSTVLKLCSRVGDMQMGKGIHGWILRNGVNLDVVLENSMLDL 180

Query: 181 YAKFDAFDYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLDLFRNLPCRDTASWNTII 240
           YAKFD F  AKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLDLFRN+PCR+TASWNTII
Sbjct: 181 YAKFDDFYCAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLDLFRNMPCRNTASWNTII 240

Query: 241 CGLMQGGYLDTALELLYEMVENEPEFNKVTSSIALSVVSSLLIIELGRQVHGRIVRFGFH 300
           CGLMQGG+L+ ALELLYEMVENEPEFNKVTSSIALSVV+SLLIIELGRQVHGRI+R G H
Sbjct: 241 CGLMQGGHLNAALELLYEMVENEPEFNKVTSSIALSVVASLLIIELGRQVHGRIIRCGLH 300

Query: 301 NDGFVKSSLINMYIKCGNLEKASVIYSQMPSDFARKQDSNIVCSDTMTEIVSRSSMVSGY 360
           NDGFVKSSLINMYIKCGNLEKASVIYSQMPS F  KQDSNIVCSD MTEIVSRSSMVSGY
Sbjct: 301 NDGFVKSSLINMYIKCGNLEKASVIYSQMPSGFVTKQDSNIVCSDMMTEIVSRSSMVSGY 360

Query: 361 VRNGKYEDAFKTFVSMVREGVLMDKFTIASVVSACSNAGFLELGRQIHAYIVKTGEQLDA 420
           + NGKYE+AFKT VSMVRE VLMDKFTIASVVSACSNAG LELGRQIH YI KTGEQLDA
Sbjct: 361 IWNGKYENAFKTVVSMVRERVLMDKFTIASVVSACSNAGVLELGRQIHGYIQKTGEQLDA 420

Query: 421 HLASSLIDMYAKGGSLDCARRIFEQTT-YLNVVIWTTMIAGFALHGQGKEAIRLFEQMRY 480
           HLASSLIDMYAKGGSLDCA RIFEQTT YLNVV+WT+MIAG+ALHGQGKEAIRLFE+MRY
Sbjct: 421 HLASSLIDMYAKGGSLDCAHRIFEQTTNYLNVVLWTSMIAGYALHGQGKEAIRLFERMRY 480

Query: 481 EGIIPNDVTFIGVLTACSHAGLLEEGRLYFNMMKDVYAIEPKVEHFTCMVDLYGRAGRLN 540
           EGIIPN+VTF+GVLTACSHAGLLE GRLYFNMMKDVYAI+PKVEHFTCMVDLYGRAG LN
Sbjct: 481 EGIIPNEVTFVGVLTACSHAGLLEHGRLYFNMMKDVYAIKPKVEHFTCMVDLYGRAGCLN 540

Query: 541 EVKEFIYENDLSHLSAVWKAFLSACRLYKDIEMGNWVSEKLFSLEPQDEGPYVLLSNMCS 600
           EVKEFIYENDLSHLSAVWKAFLS+CRLYK++EMGNWVSEKLFSLE QDEG YVLLSNMCS
Sbjct: 541 EVKEFIYENDLSHLSAVWKAFLSSCRLYKNLEMGNWVSEKLFSLEQQDEGSYVLLSNMCS 600

Query: 601 SNQKWEEASRTRKYMQHRGISKTPGQSWIHVKNQVHSFVAGDRSHPQHAQIYAYLDKLIG 660
            +QKWEEASRTR+ MQHRGI+KTPGQSWIHVKNQVHSFVAGD+SHPQH QIY YLDKLIG
Sbjct: 601 GSQKWEEASRTRRSMQHRGINKTPGQSWIHVKNQVHSFVAGDQSHPQHVQIYEYLDKLIG 660

Query: 661 RLKEIGYLSDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISMSSGIPIRIMKNLRVCTD 720
           RLKEIGYL DVKLVMQDVEEEQGEVLLGWHSEKLA+AYGIIS+ S IPIRIMKNLRVCTD
Sbjct: 661 RLKEIGYLYDVKLVMQDVEEEQGEVLLGWHSEKLALAYGIISLGSAIPIRIMKNLRVCTD 720

Query: 721 CHNFMKLTSQLLGREIIVRDIHRFHHFNSGCCSCGDYW 758
           CHNFMKLTSQLLGREIIVRDIHRFH FNSG CSCGDYW
Sbjct: 721 CHNFMKLTSQLLGREIIVRDIHRFHRFNSGHCSCGDYW 757

BLAST of Lag0006021 vs. NCBI nr
Match: KAG7029890.1 (putative pentatricopeptide repeat-containing protein [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1343.9 bits (3477), Expect = 0.0e+00
Identity = 659/755 (87.28%), Postives = 701/755 (92.85%), Query Frame = 0

Query: 3    WMNLGSGYFASTAFLKLSHYVSQVTMAQRIMSFNLFEHQLFNSCRYHSSNDALANTLHAK 62
            ++    GY ASTAFLKL   VSQVTMAQ+I+ FN   H LF SC +HSSND+L NTLHAK
Sbjct: 285  FLGFSFGYSASTAFLKLFRSVSQVTMAQKIIPFNFSAHHLFESCSFHSSNDSLPNTLHAK 344

Query: 63   MVKCGSILGSGKFVLSSYVKSEKLNDAKKVFDEMPSRDVLTWTVLISGFARVNCSEMALQ 122
            MVK GSI  S KF+LSSYVKSEKLNDA+KVFDEMPSRDVLTWTVLISGFARVNCSEMALQ
Sbjct: 345  MVKNGSIFESRKFILSSYVKSEKLNDARKVFDEMPSRDVLTWTVLISGFARVNCSEMALQ 404

Query: 123  LFREMLVEDICPNHFTLSCVFKLCSRVGNVQMGKGIHGWILRSGVNLDVVLENSMLDLYA 182
            LFREMLVE +CPN FTLS V KLCSRVG+V+MGKGIHGWILRSG++LDVVLENSMLDLYA
Sbjct: 405  LFREMLVEGVCPNPFTLSTVLKLCSRVGDVKMGKGIHGWILRSGISLDVVLENSMLDLYA 464

Query: 183  KFDAFDYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLDLFRNLPCRDTASWNTIICG 242
            KFD FDY  KLFDSMREKSTATYNI+LGV+VRS DVNKSLDLFRNLPCRDTA+WNT+ICG
Sbjct: 465  KFDEFDYVTKLFDSMREKSTATYNILLGVHVRS-DVNKSLDLFRNLPCRDTATWNTVICG 524

Query: 243  LMQGGYLDTALELLYEMVENEPEFNKVTSSIALSVVSSLLIIELGRQVHGRIVRFGFHND 302
            LMQGGYL+ ALELLYEMVENEPEFNKVTSSIALSVVSSLL+ ELGRQVHGRIVR GFHND
Sbjct: 525  LMQGGYLNEALELLYEMVENEPEFNKVTSSIALSVVSSLLVSELGRQVHGRIVRCGFHND 584

Query: 303  GFVKSSLINMYIKCGNLEKASVIYSQMPSDFARKQDSNIVCSDTMTEIVSRSSMVSGYVR 362
            GFVKSSLINMYIKCGNLEKAS IYSQMPS FA++QD +IVCSD MTEIVSRSSMVSGYVR
Sbjct: 585  GFVKSSLINMYIKCGNLEKASAIYSQMPSGFAKRQDFDIVCSDAMTEIVSRSSMVSGYVR 644

Query: 363  NGKYEDAFKTFVSMVREGVLMDKFTIASVVSACSNAGFLELGRQIHAYIVKTGEQLDAHL 422
            NG YEDAFKTFVSMVRE VLMDKFTIASVVSACSNAG  ELGRQIHAYI KTGEQLDAHL
Sbjct: 645  NGNYEDAFKTFVSMVRERVLMDKFTIASVVSACSNAGVFELGRQIHAYIQKTGEQLDAHL 704

Query: 423  ASSLIDMYAKGGSLDCARRIFEQTTYLNVVIWTTMIAGFALHGQGKEAIRLFEQMRYEGI 482
             SSLIDMYAKGGSLDCAR+IFEQTTYLNVVIWT+MI G ALHGQGKEAIRLFE+MRYEG+
Sbjct: 705  TSSLIDMYAKGGSLDCARQIFEQTTYLNVVIWTSMITGCALHGQGKEAIRLFEKMRYEGM 764

Query: 483  IPNDVTFIGVLTACSHAGLLEEGRLYFNMMKDVYAIEPKVEHFTCMVDLYGRAGRLNEVK 542
            IPN+VTFIGVL ACSHAGLLE+GRLYFNMMKDVYAI+PKVEHFTCMVDLYGRAGRLNEVK
Sbjct: 765  IPNEVTFIGVLAACSHAGLLEDGRLYFNMMKDVYAIKPKVEHFTCMVDLYGRAGRLNEVK 824

Query: 543  EFIYENDLSHLSAVWKAFLSACRLYKDIEMGNWVSEKLFSLEPQDEGPYVLLSNMCSSNQ 602
            +FIYEND+SHL+AVWKAFLS+C+LYKDIEMGNWVSE+LF LEP DEGPYVLLSNMCSSN+
Sbjct: 825  KFIYENDISHLNAVWKAFLSSCQLYKDIEMGNWVSERLFRLEPLDEGPYVLLSNMCSSNK 884

Query: 603  KWEEASRTRKYMQHRGISKTPGQSWIHVKNQVHSFVAGDRSHPQHAQIYAYLDKLIGRLK 662
            KWEEA RTR+ MQHRGISKTPGQSWIHVKN+VHSFVAGDRSHPQHAQIY YLDKLIGRLK
Sbjct: 885  KWEEAFRTRRSMQHRGISKTPGQSWIHVKNRVHSFVAGDRSHPQHAQIYEYLDKLIGRLK 944

Query: 663  EIGYLSDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISMSSGIPIRIMKNLRVCTDCHN 722
            EIGYL DVKLVMQDVEEEQGEVLLGWHSEKLA+AYG+IS+ S IPIRIMKNLR+CTDCHN
Sbjct: 945  EIGYLFDVKLVMQDVEEEQGEVLLGWHSEKLAIAYGLISLGSSIPIRIMKNLRICTDCHN 1004

Query: 723  FMKLTSQLLGREIIVRDIHRFHHFNSGCCSCGDYW 758
            FMKLTSQLL REIIVRDIHRFHHFNSG CSCGDYW
Sbjct: 1005 FMKLTSQLLCREIIVRDIHRFHHFNSGHCSCGDYW 1038

BLAST of Lag0006021 vs. NCBI nr
Match: XP_022965499.1 (LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g23330 [Cucurbita maxima])

HSP 1 Score: 1314.3 bits (3400), Expect = 0.0e+00
Identity = 638/716 (89.11%), Postives = 681/716 (95.11%), Query Frame = 0

Query: 42   LFNSCRYHSSNDALANTLHAKMVKCGSILGSGKFVLSSYVKSEKLNDAKKVFDEMPSRDV 101
            LF SC YH+SN A A+TLHAKMVK GSIL  GKF++SS+VKSE+L+DA+KVFDEMP RDV
Sbjct: 299  LFKSCCYHASNGASADTLHAKMVKNGSILYLGKFIMSSHVKSERLDDAQKVFDEMPHRDV 358

Query: 102  LTWTVLISGFARVNCSEMALQLFREMLVEDICPNHFTLSCVFKLCSRVGNVQMGKGIHGW 161
            L+WTVLISGFARVNCSEMALQLFREMLVE +CPNHFTLSCV KLCSRVG++QMGKGIHGW
Sbjct: 359  LSWTVLISGFARVNCSEMALQLFREMLVEGVCPNHFTLSCVLKLCSRVGDLQMGKGIHGW 418

Query: 162  ILRSGVNLDVVLENSMLDLYAKFDAFDYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKS 221
            ILRSGVNLDVVL NSMLDLYAKFDAFDYAK+LFDSM+EKSTATYNIMLGVYVRSCDVNKS
Sbjct: 419  ILRSGVNLDVVLGNSMLDLYAKFDAFDYAKQLFDSMKEKSTATYNIMLGVYVRSCDVNKS 478

Query: 222  LDLFRNLPCRDTASWNTIICGLMQGGYLDTALELLYEMVENEPEFNKVTSSIALSVVSSL 281
            LDLFRNLPCRD ASWNTIICGLMQGGYL+TA+ELLYEMV+NEPEFNKVTSSIALSVVSSL
Sbjct: 479  LDLFRNLPCRDAASWNTIICGLMQGGYLNTAMELLYEMVKNEPEFNKVTSSIALSVVSSL 538

Query: 282  LIIELGRQVHGRIVRFGFHNDGFVKSSLINMYIKCGNLEKASVIYSQMPSDFARKQDSNI 341
            LII+LGRQVHGRI RFGFHNDGFV SSLINMYIKCGNLEKASVIYSQMPS+F +K+DSNI
Sbjct: 539  LIIDLGRQVHGRIFRFGFHNDGFVNSSLINMYIKCGNLEKASVIYSQMPSNFGKKRDSNI 598

Query: 342  VCSDTMTEIVSRSSMVSGYVRNGKYEDAFKTFVSMVREGVLMDKFTIASVVSACSNAGFL 401
            VCS+TMTEIVSRSS+VSGYV+NGKYED+FKTFVSM+RE  +MD+FTIAS++SACSNAG L
Sbjct: 599  VCSNTMTEIVSRSSIVSGYVQNGKYEDSFKTFVSMIRERAVMDRFTIASIISACSNAGVL 658

Query: 402  ELGRQIHAYIVKTGEQLDAHLASSLIDMYAKGGSLDCARRIFEQTTYLNVVIWTTMIAGF 461
            ELGRQIHAYI KTGEQLDAHLASSLIDMYAKGGSLDCA +IF QTTYLNVV WT+MI G 
Sbjct: 659  ELGRQIHAYIQKTGEQLDAHLASSLIDMYAKGGSLDCAYQIFVQTTYLNVVTWTSMITGC 718

Query: 462  ALHGQGKEAIRLFEQMRYEGIIPNDVTFIGVLTACSHAGLLEEGRLYFNMMKDVYAIEPK 521
            ALHGQGKEAIRLFEQMRYEGIIPN+VTFIGVL ACSHAGLL+EGRLYFNMMKDVYAIEPK
Sbjct: 719  ALHGQGKEAIRLFEQMRYEGIIPNEVTFIGVLIACSHAGLLDEGRLYFNMMKDVYAIEPK 778

Query: 522  VEHFTCMVDLYGRAGRLNEVKEFIYENDLSHLSAVWKAFLSACRLYKDIEMGNWVSEKLF 581
            VEHFTCMVDLYGRAGRLNEVKEFIY+N+LSH SAVWKAFLS+CRLYKDI+MGNWVSEKLF
Sbjct: 779  VEHFTCMVDLYGRAGRLNEVKEFIYQNNLSHHSAVWKAFLSSCRLYKDIKMGNWVSEKLF 838

Query: 582  SLEPQDEGPYVLLSNMCSSNQKWEEASRTRKYMQHRGISKTPGQSWIHVKNQVHSFVAGD 641
             LEP+DEGPYVLLSNMCSSNQKWEEAS+TR+ MQHRGISKTPGQSWIHVKNQVHSFVAGD
Sbjct: 839  KLEPRDEGPYVLLSNMCSSNQKWEEASKTRRSMQHRGISKTPGQSWIHVKNQVHSFVAGD 898

Query: 642  RSHPQHAQIYAYLDKLIGRLKEIGYLSDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIIS 701
            RSH QHAQIYAYLDKLIGRLKEIGY  DVKLVMQDVEEEQGEVLLGWHSEKLAV YGIIS
Sbjct: 899  RSHLQHAQIYAYLDKLIGRLKEIGYSCDVKLVMQDVEEEQGEVLLGWHSEKLAVTYGIIS 958

Query: 702  MSSGIPIRIMKNLRVCTDCHNFMKLTSQLLGREIIVRDIHRFHHFNSGCCSCGDYW 758
            ++SGIPIRIMKNLRVCTDCHNFMKLTSQLL REIIVRDIHRFHHF SG CSCGDYW
Sbjct: 959  LASGIPIRIMKNLRVCTDCHNFMKLTSQLLDREIIVRDIHRFHHFISGRCSCGDYW 1014

BLAST of Lag0006021 vs. ExPASy Swiss-Prot
Match: Q9LW63 (Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H32 PE=3 SV=1)

HSP 1 Score: 520.8 bits (1340), Expect = 2.6e-146
Identity = 262/708 (37.01%), Postives = 421/708 (59.46%), Query Frame = 0

Query: 56  ANTLHAKMVKCGSIL-GSGKFVLSSYVKSEKLNDAKKVFDEMPSRDVLTWTVLISGFARV 115
           A  LHA+ ++  S+   S   V+S Y   + L++A  +F  + S  VL W  +I  F   
Sbjct: 24  AKQLHAQFIRTQSLSHTSASIVISIYTNLKLLHEALLLFKTLKSPPVLAWKSVIRCFTDQ 83

Query: 116 NCSEMALQLFREMLVEDICPNHFTLSCVFKLCSRVGNVQMGKGIHGWILRSGVNLDVVLE 175
           +    AL  F EM     CP+H     V K C+ + +++ G+ +HG+I+R G++ D+   
Sbjct: 84  SLFSKALASFVEMRASGRCPDHNVFPSVLKSCTMMMDLRFGESVHGFIVRLGMDCDLYTG 143

Query: 176 NSMLDLYAKFDAFD---YAKKLFDSM--REKSTATYNIMLGVYVRSCDVNKSLDLFRNLP 235
           N+++++YAK            +FD M  R  ++   ++     +    ++    +F  +P
Sbjct: 144 NALMNMYAKLLGMGSKISVGNVFDEMPQRTSNSGDEDVKAETCIMPFGIDSVRRVFEVMP 203

Query: 236 CRDTASWNTIICGLMQGGYLDTALELLYEMVENEPEFNKVTSSIALSVVSSLLIIELGRQ 295
            +D  S+NTII G  Q G  + AL ++ EM   + + +  T S  L + S  + +  G++
Sbjct: 204 RKDVVSYNTIIAGYAQSGMYEDALRMVREMGTTDLKPDSFTLSSVLPIFSEYVDVIKGKE 263

Query: 296 VHGRIVRFGFHNDGFVKSSLINMYIKCGNLEKASVIYSQMPSDFARKQDSNIVCSDTMTE 355
           +HG ++R G  +D ++ SSL++MY K   +E +  ++S+            + C D    
Sbjct: 264 IHGYVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVFSR------------LYCRDG--- 323

Query: 356 IVSRSSMVSGYVRNGKYEDAFKTFVSMVREGVLMDKFTIASVVSACSNAGFLELGRQIHA 415
            +S +S+V+GYV+NG+Y +A + F  MV   V       +SV+ AC++   L LG+Q+H 
Sbjct: 324 -ISWNSLVAGYVQNGRYNEALRLFRQMVTAKVKPGAVAFSSVIPACAHLATLHLGKQLHG 383

Query: 416 YIVKTGEQLDAHLASSLIDMYAKGGSLDCARRIFEQTTYLNVVIWTTMIAGFALHGQGKE 475
           Y+++ G   +  +AS+L+DMY+K G++  AR+IF++   L+ V WT +I G ALHG G E
Sbjct: 384 YVLRGGFGSNIFIASALVDMYSKCGNIKAARKIFDRMNVLDEVSWTAIIMGHALHGHGHE 443

Query: 476 AIRLFEQMRYEGIIPNDVTFIGVLTACSHAGLLEEGRLYFNMMKDVYAIEPKVEHFTCMV 535
           A+ LFE+M+ +G+ PN V F+ VLTACSH GL++E   YFN M  VY +  ++EH+  + 
Sbjct: 444 AVSLFEEMKRQGVKPNQVAFVAVLTACSHVGLVDEAWGYFNSMTKVYGLNQELEHYAAVA 503

Query: 536 DLYGRAGRLNEVKEFIYENDLSHLSAVWKAFLSACRLYKDIEMGNWVSEKLFSLEPQDEG 595
           DL GRAG+L E   FI +  +    +VW   LS+C ++K++E+   V+EK+F+++ ++ G
Sbjct: 504 DLLGRAGKLEEAYNFISKMCVEPTGSVWSTLLSSCSVHKNLELAEKVAEKIFTVDSENMG 563

Query: 596 PYVLLSNMCSSNQKWEEASRTRKYMQHRGISKTPGQSWIHVKNQVHSFVAGDRSHPQHAQ 655
            YVL+ NM +SN +W+E ++ R  M+ +G+ K P  SWI +KN+ H FV+GDRSHP   +
Sbjct: 564 AYVLMCNMYASNGRWKEMAKLRLRMRKKGLRKKPACSWIEMKNKTHGFVSGDRSHPSMDK 623

Query: 656 IYAYLDKLIGRLKEIGYLSDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISMSSGIPIR 715
           I  +L  ++ ++++ GY++D   V+ DV+EE    LL  HSE+LAVA+GII+   G  IR
Sbjct: 624 INEFLKAVMEQMEKEGYVADTSGVLHDVDEEHKRELLFGHSERLAVAFGIINTEPGTTIR 683

Query: 716 IMKNLRVCTDCHNFMKLTSQLLGREIIVRDIHRFHHFNSGCCSCGDYW 758
           + KN+R+CTDCH  +K  S++  REIIVRD  RFHHFN G CSCGDYW
Sbjct: 684 VTKNIRICTDCHVAIKFISKITEREIIVRDNSRFHHFNRGNCSCGDYW 715

BLAST of Lag0006021 vs. ExPASy Swiss-Prot
Match: Q9SHZ8 (Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H41 PE=3 SV=1)

HSP 1 Score: 514.2 bits (1323), Expect = 2.5e-144
Identity = 267/701 (38.09%), Postives = 420/701 (59.91%), Query Frame = 0

Query: 76  VLSSYVKSEKLNDAKKVFDEMPSRDVLTWTVLISGFARVNCSEMALQLFREMLVEDICPN 135
           VLS+Y K   ++   + FD++P RD ++WT +I G+  +     A+++  +M+ E I P 
Sbjct: 86  VLSAYSKRGDMDSTCEFFDQLPQRDSVSWTTMIVGYKNIGQYHKAIRVMGDMVKEGIEPT 145

Query: 136 HFTLSCVFKLCSRVGNVQMGKGIHGWILRSGVNLDVVLENSMLDLYAKFDAFDYAKKLFD 195
            FTL+ V    +    ++ GK +H +I++ G+  +V + NS+L++YAK      AK +FD
Sbjct: 146 QFTLTNVLASVAATRCMETGKKVHSFIVKLGLRGNVSVSNSLLNMYAKCGDPMMAKFVFD 205

Query: 196 SMREKSTATYNIMLGVYVRSCDVNKSLDLFRNLPCRDTASWNTIICGLMQGGYLDTALEL 255
            M  +  +++N M+ ++++   ++ ++  F  +  RD  +WN++I G  Q GY   AL++
Sbjct: 206 RMVVRDISSWNAMIALHMQVGQMDLAMAQFEQMAERDIVTWNSMISGFNQRGYDLRALDI 265

Query: 256 LYEMVENE-PEFNKVTSSIALSVVSSLLIIELGRQVHGRIVRFGFHNDGFVKSSLINMYI 315
             +M+ +     ++ T +  LS  ++L  + +G+Q+H  IV  GF   G V ++LI+MY 
Sbjct: 266 FSKMLRDSLLSPDRFTLASVLSACANLEKLCIGKQIHSHIVTTGFDISGIVLNALISMYS 325

Query: 316 KCGNLEKASVIYSQMPSDFAR-----------------KQDSNIVCSDTMTEIVSRSSMV 375
           +CG +E A  +  Q  +   +                  Q  NI  S    ++V+ ++M+
Sbjct: 326 RCGGVETARRLIEQRGTKDLKIEGFTALLDGYIKLGDMNQAKNIFVSLKDRDVVAWTAMI 385

Query: 376 SGYVRNGKYEDAFKTFVSMVREGVLMDKFTIASVVSACSNAGFLELGRQIHAYIVKTGEQ 435
            GY ++G Y +A   F SMV  G   + +T+A+++S  S+   L  G+QIH   VK+GE 
Sbjct: 386 VGYEQHGSYGEAINLFRSMVGGGQRPNSYTLAAMLSVASSLASLSHGKQIHGSAVKSGEI 445

Query: 436 LDAHLASSLIDMYAKGGSLDCARRIFEQ-TTYLNVVIWTTMIAGFALHGQGKEAIRLFEQ 495
               ++++LI MYAK G++  A R F+      + V WT+MI   A HG  +EA+ LFE 
Sbjct: 446 YSVSVSNALITMYAKAGNITSASRAFDLIRCERDTVSWTSMIIALAQHGHAEEALELFET 505

Query: 496 MRYEGIIPNDVTFIGVLTACSHAGLLEEGRLYFNMMKDVYAIEPKVEHFTCMVDLYGRAG 555
           M  EG+ P+ +T++GV +AC+HAGL+ +GR YF+MMKDV  I P + H+ CMVDL+GRAG
Sbjct: 506 MLMEGLRPDHITYVGVFSACTHAGLVNQGRQYFDMMKDVDKIIPTLSHYACMVDLFGRAG 565

Query: 556 RLNEVKEFIYENDLSHLSAVWKAFLSACRLYKDIEMGNWVSEKLFSLEPQDEGPYVLLSN 615
            L E +EFI +  +      W + LSACR++K+I++G   +E+L  LEP++ G Y  L+N
Sbjct: 566 LLQEAQEFIEKMPIEPDVVTWGSLLSACRVHKNIDLGKVAAERLLLLEPENSGAYSALAN 625

Query: 616 MCSSNQKWEEASRTRKYMQHRGISKTPGQSWIHVKNQVHSFVAGDRSHPQHAQIYAYLDK 675
           + S+  KWEEA++ RK M+   + K  G SWI VK++VH F   D +HP+  +IY  + K
Sbjct: 626 LYSACGKWEEAAKIRKSMKDGRVKKEQGFSWIEVKHKVHVFGVEDGTHPEKNEIYMTMKK 685

Query: 676 LIGRLKEIGYLSDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISMSSGIPIRIMKNLRV 735
           +   +K++GY+ D   V+ D+EEE  E +L  HSEKLA+A+G+IS      +RIMKNLRV
Sbjct: 686 IWDEIKKMGYVPDTASVLHDLEEEVKEQILRHHSEKLAIAFGLISTPDKTTLRIMKNLRV 745

Query: 736 CTDCHNFMKLTSQLLGREIIVRDIHRFHHFNSGCCSCGDYW 758
           C DCH  +K  S+L+GREIIVRD  RFHHF  G CSC DYW
Sbjct: 746 CNDCHTAIKFISKLVGREIIVRDTTRFHHFKDGFCSCRDYW 786

BLAST of Lag0006021 vs. ExPASy Swiss-Prot
Match: Q9SVP7 (Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H42 PE=2 SV=2)

HSP 1 Score: 488.0 bits (1255), Expect = 1.9e-136
Identity = 243/702 (34.62%), Postives = 397/702 (56.55%), Query Frame = 0

Query: 59   LHAKMVKCGSILGSGKF---VLSSYVKSEKLNDAKKVFDEMPSRDVLTWTVLISGFARVN 118
            LHA   K G    + K    +L+ Y K   +  A   F E    +V+ W V++  +  ++
Sbjct: 411  LHAYTTKLG-FASNNKIEGALLNLYAKCADIETALDYFLETEVENVVLWNVMLVAYGLLD 470

Query: 119  CSEMALQLFREMLVEDICPNHFTLSCVFKLCSRVGNVQMGKGIHGWILRSGVNLDVVLEN 178
                + ++FR+M +E+I PN +T   + K C R+G++++G+ IH  I+++   L+  + +
Sbjct: 471  DLRNSFRIFRQMQIEEIVPNQYTYPSILKTCIRLGDLELGEQIHSQIIKTNFQLNAYVCS 530

Query: 179  SMLDLYAKFDAFDYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLDLFRNLPCRDTAS 238
             ++D+YAK    D A                                D+      +D  S
Sbjct: 531  VLIDMYAKLGKLDTA-------------------------------WDILIRFAGKDVVS 590

Query: 239  WNTIICGLMQGGYLDTALELLYEMVENEPEFNKVTSSIALSVVSSLLIIELGRQVHGRIV 298
            W T+I G  Q  + D AL    +M++     ++V  + A+S  + L  ++ G+Q+H +  
Sbjct: 591  WTTMIAGYTQYNFDDKALTTFRQMLDRGIRSDEVGLTNAVSACAGLQALKEGQQIHAQAC 650

Query: 299  RFGFHNDGFVKSSLINMYIKCGNLEKASVIYSQMPSDFARKQDSNIVCSDTMTEIVSRSS 358
              GF +D   +++L+ +Y +CG +E++ + + Q  +                 + ++ ++
Sbjct: 651  VSGFSSDLPFQNALVTLYSRCGKIEESYLAFEQTEAG----------------DNIAWNA 710

Query: 359  MVSGYVRNGKYEDAFKTFVSMVREGVLMDKFTIASVVSACSNAGFLELGRQIHAYIVKTG 418
            +VSG+ ++G  E+A + FV M REG+  + FT  S V A S    ++ G+Q+HA I KTG
Sbjct: 711  LVSGFQQSGNNEEALRVFVRMNREGIDNNNFTFGSAVKAASETANMKQGKQVHAVITKTG 770

Query: 419  EQLDAHLASSLIDMYAKGGSLDCARRIFEQTTYLNVVIWTTMIAGFALHGQGKEAIRLFE 478
               +  + ++LI MYAK GS+  A + F + +  N V W  +I  ++ HG G EA+  F+
Sbjct: 771  YDSETEVCNALISMYAKCGSISDAEKQFLEVSTKNEVSWNAIINAYSKHGFGSEALDSFD 830

Query: 479  QMRYEGIIPNDVTFIGVLTACSHAGLLEEGRLYFNMMKDVYAIEPKVEHFTCMVDLYGRA 538
            QM +  + PN VT +GVL+ACSH GL+++G  YF  M   Y + PK EH+ C+VD+  RA
Sbjct: 831  QMIHSNVRPNHVTLVGVLSACSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRA 890

Query: 539  GRLNEVKEFIYENDLSHLSAVWKAFLSACRLYKDIEMGNWVSEKLFSLEPQDEGPYVLLS 598
            G L+  KEFI E  +   + VW+  LSAC ++K++E+G + +  L  LEP+D   YVLLS
Sbjct: 891  GLLSRAKEFIQEMPIKPDALVWRTLLSACVVHKNMEIGEFAAHHLLELEPEDSATYVLLS 950

Query: 599  NMCSSNQKWEEASRTRKYMQHRGISKTPGQSWIHVKNQVHSFVAGDRSHPQHAQIYAYLD 658
            N+ + ++KW+    TR+ M+ +G+ K PGQSWI VKN +HSF  GD++HP   +I+ Y  
Sbjct: 951  NLYAVSKKWDARDLTRQKMKEKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLADEIHEYFQ 1010

Query: 659  KLIGRLKEIGYLSDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISMSSGIPIRIMKNLR 718
             L  R  EIGY+ D   ++ +++ EQ + ++  HSEKLA+++G++S+ + +PI +MKNLR
Sbjct: 1011 DLTKRASEIGYVQDCFSLLNELQHEQKDPIIFIHSEKLAISFGLLSLPATVPINVMKNLR 1064

Query: 719  VCTDCHNFMKLTSQLLGREIIVRDIHRFHHFNSGCCSCGDYW 758
            VC DCH ++K  S++  REIIVRD +RFHHF  G CSC DYW
Sbjct: 1071 VCNDCHAWIKFVSKVSNREIIVRDAYRFHHFEGGACSCKDYW 1064

BLAST of Lag0006021 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 483.8 bits (1244), Expect = 3.6e-135
Identity = 254/732 (34.70%), Postives = 409/732 (55.87%), Query Frame = 0

Query: 43  FNSCRYHSSNDALAN--------TLHAKMVKCGSILGSGKFVLSSYVK-------SEKLN 102
           ++S R H S   L N         +HA+M+K G  L +  + LS  ++        E L 
Sbjct: 28  YDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIG--LHNTNYALSKLIEFCILSPHFEGLP 87

Query: 103 DAKKVFDEMPSRDVLTWTVLISGFARVNCSEMALQLFREMLVEDICPNHFTLSCVFKLCS 162
            A  VF  +   ++L W  +  G A  +    AL+L+  M+   + PN +T   V K C+
Sbjct: 88  YAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCA 147

Query: 163 RVGNVQMGKGIHGWILRSGVNLDVVLENSMLDLYAKFDAFDYAKKLFDSMREKSTATYNI 222
           +    + G+ IHG +L+ G +LD+ +  S++ +Y +    + A K+FD    +   +Y  
Sbjct: 148 KSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTA 207

Query: 223 MLGVYVRSCDVNKSLDLFRNLPCRDTASWNTIICGLMQGGYLDTALELLYEMVENEPEFN 282
           ++  Y     +  +  LF  +P +D  SWN +I G  + G    ALEL  +M++     +
Sbjct: 208 LIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPD 267

Query: 283 KVTSSIALSVVSSLLIIELGRQVHGRIVRFGFHNDGFVKSSLINMYIKCGNLEKASVIYS 342
           + T    +S  +    IELGRQVH  I   GF ++  + ++LI++Y KCG LE A  ++ 
Sbjct: 268 ESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFE 327

Query: 343 QMPSDFARKQDSNIVCSDTMTEIVSRSSMVSGYVRNGKYEDAFKTFVSMVREGVLMDKFT 402
           ++P                  +++S ++++ GY     Y++A   F  M+R G   +  T
Sbjct: 328 RLP----------------YKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVT 387

Query: 403 IASVVSACSNAGFLELGRQIHAYIVK--TGEQLDAHLASSLIDMYAKGGSLDCARRIFEQ 462
           + S++ AC++ G +++GR IH YI K   G    + L +SLIDMYAK G ++ A ++F  
Sbjct: 388 MLSILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNS 447

Query: 463 TTYLNVVIWTTMIAGFALHGQGKEAIRLFEQMRYEGIIPNDVTFIGVLTACSHAGLLEEG 522
             + ++  W  MI GFA+HG+   +  LF +MR  GI P+D+TF+G+L+ACSH+G+L+ G
Sbjct: 448 ILHKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLG 507

Query: 523 RLYFNMMKDVYAIEPKVEHFTCMVDLYGRAGRLNEVKEFIYENDLSHLSAVWKAFLSACR 582
           R  F  M   Y + PK+EH+ CM+DL G +G   E +E I   ++     +W + L AC+
Sbjct: 508 RHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACK 567

Query: 583 LYKDIEMGNWVSEKLFSLEPQDEGPYVLLSNMCSSNQKWEEASRTRKYMQHRGISKTPGQ 642
           ++ ++E+G   +E L  +EP++ G YVLLSN+ +S  +W E ++TR  +  +G+ K PG 
Sbjct: 568 MHGNVELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGC 627

Query: 643 SWIHVKNQVHSFVAGDRSHPQHAQIYAYLDKLIGRLKEIGYLSDVKLVMQDVEEEQGEVL 702
           S I + + VH F+ GD+ HP++ +IY  L+++   L++ G++ D   V+Q++EEE  E  
Sbjct: 628 SSIEIDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGA 687

Query: 703 LGWHSEKLAVAYGIISMSSGIPIRIMKNLRVCTDCHNFMKLTSQLLGREIIVRDIHRFHH 758
           L  HSEKLA+A+G+IS   G  + I+KNLRVC +CH   KL S++  REII RD  RFHH
Sbjct: 688 LRHHSEKLAIAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHH 741

BLAST of Lag0006021 vs. ExPASy Swiss-Prot
Match: Q9ZUW3 (Pentatricopeptide repeat-containing protein At2g27610 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H60 PE=2 SV=1)

HSP 1 Score: 483.0 bits (1242), Expect = 6.1e-135
Identity = 263/782 (33.63%), Postives = 425/782 (54.35%), Query Frame = 0

Query: 52  NDALANTLHAKMVKCGSI--LGSGKFVLSSYVKSEKLNDAKKVFDEMPSRDVLTWTVLIS 111
           ++     LH + +K G +  +  G  ++ +Y+K     D +KVFDEM  R+V+TWT LIS
Sbjct: 108 DELFGRQLHCQCIKFGFLDDVSVGTSLVDTYMKGSNFKDGRKVFDEMKERNVVTWTTLIS 167

Query: 112 GFARVNCSEMALQLFREMLVEDICPNHFTLSCVFKLCSRVGNVQMGKGIHGWILRSGVNL 171
           G+AR + ++  L LF  M  E   PN FT +    + +  G    G  +H  ++++G++ 
Sbjct: 168 GYARNSMNDEVLTLFMRMQNEGTQPNSFTFAAALGVLAEEGVGGRGLQVHTVVVKNGLDK 227

Query: 172 DVVLENSMLDLYAKFDAFDYAKKLFDSMREKSTATYNIMLGVYVRS-------------- 231
            + + NS+++LY K      A+ LFD    KS  T+N M+  Y  +              
Sbjct: 228 TIPVSNSLINLYLKCGNVRKARILFDKTEVKSVVTWNSMISGYAANGLDLEALGMFYSMR 287

Query: 232 -------------------------------CDVNK------------------------ 291
                                          C V K                        
Sbjct: 288 LNYVRLSESSFASVIKLCANLKELRFTEQLHCSVVKYGFLFDQNIRTALMVAYSKCTAML 347

Query: 292 -SLDLFRNLPC-RDTASWNTIICGLMQGGYLDTALELLYEMVENEPEFNKVTSSIALSVV 351
            +L LF+ + C  +  SW  +I G +Q    + A++L  EM       N+ T S+ L+  
Sbjct: 348 DALRLFKEIGCVGNVVSWTAMISGFLQNDGKEEAVDLFSEMKRKGVRPNEFTYSVILTA- 407

Query: 352 SSLLIIELGRQVHGRIVRFGFHNDGFVKSSLINMYIKCGNLEKASVIYSQMPSDFARKQD 411
              L +    +VH ++V+  +     V ++L++ Y+K G +E+A+ ++S +         
Sbjct: 408 ---LPVISPSEVHAQVVKTNYERSSTVGTALLDAYVKLGKVEEAAKVFSGIDD------- 467

Query: 412 SNIVCSDTMTEIVSRSSMVSGYVRNGKYEDAFKTFVSMVREGVLMDKFTIASVVSAC--S 471
                     +IV+ S+M++GY + G+ E A K F  + + G+  ++FT +S+++ C  +
Sbjct: 468 ---------KDIVAWSAMLAGYAQTGETEAAIKMFGELTKGGIKPNEFTFSSILNVCAAT 527

Query: 472 NAGFLELGRQIHAYIVKTGEQLDAHLASSLIDMYAKGGSLDCARRIFEQTTYLNVVIWTT 531
           NA  +  G+Q H + +K+       ++S+L+ MYAK G+++ A  +F++    ++V W +
Sbjct: 528 NAS-MGQGKQFHGFAIKSRLDSSLCVSSALLTMYAKKGNIESAEEVFKRQREKDLVSWNS 587

Query: 532 MIAGFALHGQGKEAIRLFEQMRYEGIIPNDVTFIGVLTACSHAGLLEEGRLYFNMMKDVY 591
           MI+G+A HGQ  +A+ +F++M+   +  + VTFIGV  AC+HAGL+EEG  YF++M    
Sbjct: 588 MISGYAQHGQAMKALDVFKEMKKRKVKMDGVTFIGVFAACTHAGLVEEGEKYFDIMVRDC 647

Query: 592 AIEPKVEHFTCMVDLYGRAGRLNEVKEFIYENDLSHLSAVWKAFLSACRLYKDIEMGNWV 651
            I P  EH +CMVDLY RAG+L +  + I        S +W+  L+ACR++K  E+G   
Sbjct: 648 KIAPTKEHNSCMVDLYSRAGQLEKAMKVIENMPNPAGSTIWRTILAACRVHKKTELGRLA 707

Query: 652 SEKLFSLEPQDEGPYVLLSNMCSSNQKWEEASRTRKYMQHRGISKTPGQSWIHVKNQVHS 711
           +EK+ +++P+D   YVLLSNM + +  W+E ++ RK M  R + K PG SWI VKN+ +S
Sbjct: 708 AEKIIAMKPEDSAAYVLLSNMYAESGDWQERAKVRKLMNERNVKKEPGYSWIEVKNKTYS 767

Query: 712 FVAGDRSHPQHAQIYAYLDKLIGRLKEIGYLSDVKLVMQDVEEEQGEVLLGWHSEKLAVA 758
           F+AGDRSHP   QIY  L+ L  RLK++GY  D   V+QD+++E  E +L  HSE+LA+A
Sbjct: 768 FLAGDRSHPLKDQIYMKLEDLSTRLKDLGYEPDTSYVLQDIDDEHKEAVLAQHSERLAIA 827

BLAST of Lag0006021 vs. ExPASy TrEMBL
Match: A0A6J1HR62 (LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g23330 OS=Cucurbita maxima OX=3661 GN=LOC111465385 PE=3 SV=1)

HSP 1 Score: 1314.3 bits (3400), Expect = 0.0e+00
Identity = 638/716 (89.11%), Postives = 681/716 (95.11%), Query Frame = 0

Query: 42   LFNSCRYHSSNDALANTLHAKMVKCGSILGSGKFVLSSYVKSEKLNDAKKVFDEMPSRDV 101
            LF SC YH+SN A A+TLHAKMVK GSIL  GKF++SS+VKSE+L+DA+KVFDEMP RDV
Sbjct: 299  LFKSCCYHASNGASADTLHAKMVKNGSILYLGKFIMSSHVKSERLDDAQKVFDEMPHRDV 358

Query: 102  LTWTVLISGFARVNCSEMALQLFREMLVEDICPNHFTLSCVFKLCSRVGNVQMGKGIHGW 161
            L+WTVLISGFARVNCSEMALQLFREMLVE +CPNHFTLSCV KLCSRVG++QMGKGIHGW
Sbjct: 359  LSWTVLISGFARVNCSEMALQLFREMLVEGVCPNHFTLSCVLKLCSRVGDLQMGKGIHGW 418

Query: 162  ILRSGVNLDVVLENSMLDLYAKFDAFDYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKS 221
            ILRSGVNLDVVL NSMLDLYAKFDAFDYAK+LFDSM+EKSTATYNIMLGVYVRSCDVNKS
Sbjct: 419  ILRSGVNLDVVLGNSMLDLYAKFDAFDYAKQLFDSMKEKSTATYNIMLGVYVRSCDVNKS 478

Query: 222  LDLFRNLPCRDTASWNTIICGLMQGGYLDTALELLYEMVENEPEFNKVTSSIALSVVSSL 281
            LDLFRNLPCRD ASWNTIICGLMQGGYL+TA+ELLYEMV+NEPEFNKVTSSIALSVVSSL
Sbjct: 479  LDLFRNLPCRDAASWNTIICGLMQGGYLNTAMELLYEMVKNEPEFNKVTSSIALSVVSSL 538

Query: 282  LIIELGRQVHGRIVRFGFHNDGFVKSSLINMYIKCGNLEKASVIYSQMPSDFARKQDSNI 341
            LII+LGRQVHGRI RFGFHNDGFV SSLINMYIKCGNLEKASVIYSQMPS+F +K+DSNI
Sbjct: 539  LIIDLGRQVHGRIFRFGFHNDGFVNSSLINMYIKCGNLEKASVIYSQMPSNFGKKRDSNI 598

Query: 342  VCSDTMTEIVSRSSMVSGYVRNGKYEDAFKTFVSMVREGVLMDKFTIASVVSACSNAGFL 401
            VCS+TMTEIVSRSS+VSGYV+NGKYED+FKTFVSM+RE  +MD+FTIAS++SACSNAG L
Sbjct: 599  VCSNTMTEIVSRSSIVSGYVQNGKYEDSFKTFVSMIRERAVMDRFTIASIISACSNAGVL 658

Query: 402  ELGRQIHAYIVKTGEQLDAHLASSLIDMYAKGGSLDCARRIFEQTTYLNVVIWTTMIAGF 461
            ELGRQIHAYI KTGEQLDAHLASSLIDMYAKGGSLDCA +IF QTTYLNVV WT+MI G 
Sbjct: 659  ELGRQIHAYIQKTGEQLDAHLASSLIDMYAKGGSLDCAYQIFVQTTYLNVVTWTSMITGC 718

Query: 462  ALHGQGKEAIRLFEQMRYEGIIPNDVTFIGVLTACSHAGLLEEGRLYFNMMKDVYAIEPK 521
            ALHGQGKEAIRLFEQMRYEGIIPN+VTFIGVL ACSHAGLL+EGRLYFNMMKDVYAIEPK
Sbjct: 719  ALHGQGKEAIRLFEQMRYEGIIPNEVTFIGVLIACSHAGLLDEGRLYFNMMKDVYAIEPK 778

Query: 522  VEHFTCMVDLYGRAGRLNEVKEFIYENDLSHLSAVWKAFLSACRLYKDIEMGNWVSEKLF 581
            VEHFTCMVDLYGRAGRLNEVKEFIY+N+LSH SAVWKAFLS+CRLYKDI+MGNWVSEKLF
Sbjct: 779  VEHFTCMVDLYGRAGRLNEVKEFIYQNNLSHHSAVWKAFLSSCRLYKDIKMGNWVSEKLF 838

Query: 582  SLEPQDEGPYVLLSNMCSSNQKWEEASRTRKYMQHRGISKTPGQSWIHVKNQVHSFVAGD 641
             LEP+DEGPYVLLSNMCSSNQKWEEAS+TR+ MQHRGISKTPGQSWIHVKNQVHSFVAGD
Sbjct: 839  KLEPRDEGPYVLLSNMCSSNQKWEEASKTRRSMQHRGISKTPGQSWIHVKNQVHSFVAGD 898

Query: 642  RSHPQHAQIYAYLDKLIGRLKEIGYLSDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIIS 701
            RSH QHAQIYAYLDKLIGRLKEIGY  DVKLVMQDVEEEQGEVLLGWHSEKLAV YGIIS
Sbjct: 899  RSHLQHAQIYAYLDKLIGRLKEIGYSCDVKLVMQDVEEEQGEVLLGWHSEKLAVTYGIIS 958

Query: 702  MSSGIPIRIMKNLRVCTDCHNFMKLTSQLLGREIIVRDIHRFHHFNSGCCSCGDYW 758
            ++SGIPIRIMKNLRVCTDCHNFMKLTSQLL REIIVRDIHRFHHF SG CSCGDYW
Sbjct: 959  LASGIPIRIMKNLRVCTDCHNFMKLTSQLLDREIIVRDIHRFHHFISGRCSCGDYW 1014

BLAST of Lag0006021 vs. ExPASy TrEMBL
Match: A0A6J1EPP7 (putative pentatricopeptide repeat-containing protein At3g23330 OS=Cucurbita moschata OX=3662 GN=LOC111436248 PE=3 SV=1)

HSP 1 Score: 1307.7 bits (3383), Expect = 0.0e+00
Identity = 642/710 (90.42%), Postives = 672/710 (94.65%), Query Frame = 0

Query: 48   YHSSNDALANTLHAKMVKCGSILGSGKFVLSSYVKSEKLNDAKKVFDEMPSRDVLTWTVL 107
            +HSSND+L NTLHAKMVK GSI  S KF+LSSYVKSEKLNDA+KVFDEMPSRDVLTWTVL
Sbjct: 306  FHSSNDSLPNTLHAKMVKNGSIFESRKFILSSYVKSEKLNDARKVFDEMPSRDVLTWTVL 365

Query: 108  ISGFARVNCSEMALQLFREMLVEDICPNHFTLSCVFKLCSRVGNVQMGKGIHGWILRSGV 167
            ISGFARVNCSEMALQLFREMLVE +CPN FTLS V KLCSRVG+V+MGKGIHGWILRSGV
Sbjct: 366  ISGFARVNCSEMALQLFREMLVEGVCPNPFTLSTVLKLCSRVGDVKMGKGIHGWILRSGV 425

Query: 168  NLDVVLENSMLDLYAKFDAFDYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLDLFRN 227
            +LDVVLENSMLDLYAKFD FDY  KLFDSMREKSTATYNI+LGV+VRS DVNKSLDLFRN
Sbjct: 426  SLDVVLENSMLDLYAKFDEFDYVTKLFDSMREKSTATYNILLGVHVRS-DVNKSLDLFRN 485

Query: 228  LPCRDTASWNTIICGLMQGGYLDTALELLYEMVENEPEFNKVTSSIALSVVSSLLIIELG 287
            LPCRDTASWNT+ICGLMQGGYL+ ALELLYEMVENEPEFNKVTSSIALSVVSSLLIIELG
Sbjct: 486  LPCRDTASWNTVICGLMQGGYLNEALELLYEMVENEPEFNKVTSSIALSVVSSLLIIELG 545

Query: 288  RQVHGRIVRFGFHNDGFVKSSLINMYIKCGNLEKASVIYSQMPSDFARKQDSNIVCSDTM 347
            RQVHGRIVR G HNDGFVKSSLINMYIKCGNLEKASVIYSQMPS FA KQD NIVCSDTM
Sbjct: 546  RQVHGRIVRCGLHNDGFVKSSLINMYIKCGNLEKASVIYSQMPSGFATKQDFNIVCSDTM 605

Query: 348  TEIVSRSSMVSGYVRNGKYEDAFKTFVSMVREGVLMDKFTIASVVSACSNAGFLELGRQI 407
            TEIVSRSSMVSGYVRNGKYEDAFKTFVSMVRE VLMDKFTIASVVSACSNAG  ELGRQI
Sbjct: 606  TEIVSRSSMVSGYVRNGKYEDAFKTFVSMVRERVLMDKFTIASVVSACSNAGVFELGRQI 665

Query: 408  HAYIVKTGEQLDAHLASSLIDMYAKGGSLDCARRIFEQTTYLNVVIWTTMIAGFALHGQG 467
            HAYI KTGEQLDAHL SSLIDMYAKGGSLDCAR+IFEQTTYLNVVIWT+MI G ALHGQG
Sbjct: 666  HAYIQKTGEQLDAHLTSSLIDMYAKGGSLDCARQIFEQTTYLNVVIWTSMITGCALHGQG 725

Query: 468  KEAIRLFEQMRYEGIIPNDVTFIGVLTACSHAGLLEEGRLYFNMMKDVYAIEPKVEHFTC 527
            KEAIRLFE+MRYEG+IPN+VTFIGVL ACSHAGLLE+GRLYFNMMKDVYAI+PKVEHFTC
Sbjct: 726  KEAIRLFEKMRYEGMIPNEVTFIGVLAACSHAGLLEDGRLYFNMMKDVYAIKPKVEHFTC 785

Query: 528  MVDLYGRAGRLNEVKEFIYENDLSHLSAVWKAFLSACRLYKDIEMGNWVSEKLFSLEPQD 587
            MVDLYGRAG LNEVK+FIYENDLSHL+AVWKAFLS+C+LYKDIEMGNWVSE+LF LEP D
Sbjct: 786  MVDLYGRAGHLNEVKKFIYENDLSHLNAVWKAFLSSCQLYKDIEMGNWVSERLFRLEPLD 845

Query: 588  EGPYVLLSNMCSSNQKWEEASRTRKYMQHRGISKTPGQSWIHVKNQVHSFVAGDRSHPQH 647
            EGPYVLLSNMCSSNQKWEEA RTR+ MQHRGISKTPGQSWIHVKN+VHSFVAGDRSHPQH
Sbjct: 846  EGPYVLLSNMCSSNQKWEEAFRTRRSMQHRGISKTPGQSWIHVKNRVHSFVAGDRSHPQH 905

Query: 648  AQIYAYLDKLIGRLKEIGYLSDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISMSSGIP 707
            AQIY YLDKLIGRLKEIGYL DVKLVMQDVEEEQGEVLLGWHSEKLA+AYG+IS+ S IP
Sbjct: 906  AQIYEYLDKLIGRLKEIGYLFDVKLVMQDVEEEQGEVLLGWHSEKLAIAYGLISLGSSIP 965

Query: 708  IRIMKNLRVCTDCHNFMKLTSQLLGREIIVRDIHRFHHFNSGCCSCGDYW 758
            IRIMKNLR+CTDCHNFMKLTSQLL REIIVRDIHRFHHFNSG CSCGDYW
Sbjct: 966  IRIMKNLRICTDCHNFMKLTSQLLCREIIVRDIHRFHHFNSGHCSCGDYW 1014

BLAST of Lag0006021 vs. ExPASy TrEMBL
Match: A0A0A0LKI4 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G074230 PE=3 SV=1)

HSP 1 Score: 1306.6 bits (3380), Expect = 0.0e+00
Identity = 644/758 (84.96%), Postives = 689/758 (90.90%), Query Frame = 0

Query: 1   MRWMNLGSGYFASTAFLKLSHYVSQVTMAQRIMSFNLFEHQLFNSCRYHSSNDALANTLH 60
           MRWMNL S  F S AFLKLSH +SQ TM  +I+SFNL EH LF S  YH+SN   +NTLH
Sbjct: 1   MRWMNLSSSCFPSPAFLKLSHSISQGTMTHKIISFNLSEHHLFKSFSYHTSNHFSSNTLH 60

Query: 61  AKMVKCGSILGSGKFVLSSYVKSEKLNDAKKVFDEMPSRDVLTWTVLISGFARVNCSEMA 120
           AKMVK GSI  SGKFVL+SYVKSEKLNDA+K+FDEMP+RDVLTWT LISGF+RVN S MA
Sbjct: 61  AKMVKIGSIFVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMA 120

Query: 121 LQLFREMLVEDICPNHFTLSCVFKLCSRVGNVQMGKGIHGWILRSGVNLDVVLENSMLDL 180
           LQLFREMLVE + PNHFTLS V KLCS+VG+V+MGKGIHGWILR+GV LDVVLENSMLDL
Sbjct: 121 LQLFREMLVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDL 180

Query: 181 YAKFDAFDYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLDLFRNLPCRDTASWNTII 240
           YAKFD F YA+KL+DSMREKST T NI+LGVYVRSCDVNKSL LFRNLPCR+ ASWNTII
Sbjct: 181 YAKFDEFVYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTII 240

Query: 241 CGLMQGGYLDTALELLYEMVENEPEFNKVTSSIALSVVSSLLIIELGRQVHGRIVRFGFH 300
           CGLMQGGYL+ ALELLYEMVENE EFN  TSSIALSVVSSLLI+ELGRQVHGRIVR G H
Sbjct: 241 CGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLH 300

Query: 301 NDGFVKSSLINMYIKCGNLEKASVIYSQMPSDFARKQDSNIVCSDTMTEIVSRSSMVSGY 360
           NDGFVKS+LINMYIKCGNLEKASVIYS++PS FA KQ SNIVCSDTMTEIVSRSSMV GY
Sbjct: 301 NDGFVKSALINMYIKCGNLEKASVIYSRLPSGFATKQSSNIVCSDTMTEIVSRSSMVYGY 360

Query: 361 VRNGKYEDAFKTFVSMVREGVLMDKFTIASVVSACSNAGFLELGRQIHAYIVKTGEQLDA 420
           VRNGKYEDAFKTFVSMVRE VLMDKFTIA+VVSACSNAG LELGRQ+H +I KT EQLDA
Sbjct: 361 VRNGKYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDA 420

Query: 421 HLASSLIDMYAKGGSLDCARRIFEQ-TTYLNVVIWTTMIAGFALHGQGKEAIRLFEQMRY 480
           HLASSLIDMYAKGGSLDCA RIF+Q T YLNVVIWT+MI G ALHG GKEAIRLFEQMRY
Sbjct: 421 HLASSLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRY 480

Query: 481 EGIIPNDVTFIGVLTACSHAGLLEEGRLYFNMMKDVYAIEPKVEHFTCMVDLYGRAGRLN 540
           EGIIPN+VTFIGVLTACSHAGLLE+G LYFNMMKDVYAI+PKVEH+TCMVDLYGRAG LN
Sbjct: 481 EGIIPNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLN 540

Query: 541 EVKEFIYENDLSHLSAVWKAFLSACRLYKDIEMGNWVSEKLFSLEPQDEGPYVLLSNMCS 600
           EVKEFIYENDLSHLSAVWKAFLS+CRLY+D+EMG WVSEKLF L+PQDEG YVLLSNMCS
Sbjct: 541 EVKEFIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCS 600

Query: 601 SNQKWEEASRTRKYMQHRGISKTPGQSWIHVKNQVHSFVAGDRSHPQHAQIYAYLDKLIG 660
            +QKWEEASR R+ MQH GI+KTPGQSWIH+KNQVHSFVAGD+SHPQHAQIY YLDKLIG
Sbjct: 601 GSQKWEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIG 660

Query: 661 RLKEIGYLSDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISMSSGIPIRIMKNLRVCTD 720
           RLKEIGYL DVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIIS+ S IPIRIMKNLR+CTD
Sbjct: 661 RLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTD 720

Query: 721 CHNFMKLTSQLLGREIIVRDIHRFHHFNSGCCSCGDYW 758
           CHNFMKLTSQLLGREIIVRDI+RFHHFNSG CSCGDYW
Sbjct: 721 CHNFMKLTSQLLGREIIVRDIYRFHHFNSGHCSCGDYW 758

BLAST of Lag0006021 vs. ExPASy TrEMBL
Match: A0A6J1KA70 (putative pentatricopeptide repeat-containing protein At3g23330 OS=Cucurbita maxima OX=3661 GN=LOC111492492 PE=3 SV=1)

HSP 1 Score: 1300.0 bits (3363), Expect = 0.0e+00
Identity = 637/710 (89.72%), Postives = 672/710 (94.65%), Query Frame = 0

Query: 48   YHSSNDALANTLHAKMVKCGSILGSGKFVLSSYVKSEKLNDAKKVFDEMPSRDVLTWTVL 107
            YHSSND+L NTLHAKMVK GSI  S KF+LSSYVKSEKLNDA+KVFDEMPSRDVLTWTVL
Sbjct: 306  YHSSNDSLPNTLHAKMVKNGSIFESRKFILSSYVKSEKLNDARKVFDEMPSRDVLTWTVL 365

Query: 108  ISGFARVNCSEMALQLFREMLVEDICPNHFTLSCVFKLCSRVGNVQMGKGIHGWILRSGV 167
            ISGFARVNCSEMALQLFREMLVE + PN FTLS V KLCSRVG+V+MGKGIHGWILRSGV
Sbjct: 366  ISGFARVNCSEMALQLFREMLVEGVYPNPFTLSTVLKLCSRVGDVKMGKGIHGWILRSGV 425

Query: 168  NLDVVLENSMLDLYAKFDAFDYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLDLFRN 227
            +LDVVLENSMLDLYAKFD FDY KKLFDSMREKSTATYNI+LGV+VRS DVNKSLDLFRN
Sbjct: 426  SLDVVLENSMLDLYAKFDEFDYVKKLFDSMREKSTATYNILLGVHVRS-DVNKSLDLFRN 485

Query: 228  LPCRDTASWNTIICGLMQGGYLDTALELLYEMVENEPEFNKVTSSIALSVVSSLLIIELG 287
            LPCRDTASWNT+ICGLMQGGYL+ ALELLYEMVEN+PEFNKVTSSIALSVVSSLLIIELG
Sbjct: 486  LPCRDTASWNTVICGLMQGGYLNEALELLYEMVENQPEFNKVTSSIALSVVSSLLIIELG 545

Query: 288  RQVHGRIVRFGFHNDGFVKSSLINMYIKCGNLEKASVIYSQMPSDFARKQDSNIVCSDTM 347
            RQVHGRI+R GFHNDGFVKSSLINMYIKCGNLEKASVIYSQMPS F +KQD +IV SDTM
Sbjct: 546  RQVHGRILRCGFHNDGFVKSSLINMYIKCGNLEKASVIYSQMPSGFGKKQDFDIVYSDTM 605

Query: 348  TEIVSRSSMVSGYVRNGKYEDAFKTFVSMVREGVLMDKFTIASVVSACSNAGFLELGRQI 407
            TEIVSRSSMVSGYVRNGKYEDAFKTFVSMVRE VLMDKFTIASVVSACSNAG  ELGRQI
Sbjct: 606  TEIVSRSSMVSGYVRNGKYEDAFKTFVSMVRERVLMDKFTIASVVSACSNAGVFELGRQI 665

Query: 408  HAYIVKTGEQLDAHLASSLIDMYAKGGSLDCARRIFEQTTYLNVVIWTTMIAGFALHGQG 467
            HAYI KTGEQLDAHL SSLIDMYAKGGSLDCAR+IFEQ TYLNVVIWT+MI G ALHGQG
Sbjct: 666  HAYIQKTGEQLDAHLTSSLIDMYAKGGSLDCARQIFEQMTYLNVVIWTSMITGCALHGQG 725

Query: 468  KEAIRLFEQMRYEGIIPNDVTFIGVLTACSHAGLLEEGRLYFNMMKDVYAIEPKVEHFTC 527
            KEAIRLFE+MRYEG+IPN+VTFIGVL ACSHAGL+E+GRLYFNMMKDVYAI+PKVEHFTC
Sbjct: 726  KEAIRLFEKMRYEGMIPNEVTFIGVLAACSHAGLIEDGRLYFNMMKDVYAIKPKVEHFTC 785

Query: 528  MVDLYGRAGRLNEVKEFIYENDLSHLSAVWKAFLSACRLYKDIEMGNWVSEKLFSLEPQD 587
            MVDLYGRAGRLNEVK+FIYENDLSHL+AVWKAFLS+C+LYKDIEMGNWVSE+LF LEP D
Sbjct: 786  MVDLYGRAGRLNEVKKFIYENDLSHLNAVWKAFLSSCQLYKDIEMGNWVSERLFRLEPLD 845

Query: 588  EGPYVLLSNMCSSNQKWEEASRTRKYMQHRGISKTPGQSWIHVKNQVHSFVAGDRSHPQH 647
            EGPY+LLSNMCSSNQKWEEA RTR++MQHRGISKTPGQSWIHVKNQVHSFVAGDRSHPQH
Sbjct: 846  EGPYILLSNMCSSNQKWEEAFRTRRFMQHRGISKTPGQSWIHVKNQVHSFVAGDRSHPQH 905

Query: 648  AQIYAYLDKLIGRLKEIGYLSDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISMSSGIP 707
            AQIY YLD LIGRLKEIGYL DVKLVMQDVEEEQGEVLLGWHSEKLA+AYG+IS+ S IP
Sbjct: 906  AQIYEYLDNLIGRLKEIGYLFDVKLVMQDVEEEQGEVLLGWHSEKLAIAYGLISLDSAIP 965

Query: 708  IRIMKNLRVCTDCHNFMKLTSQLLGREIIVRDIHRFHHFNSGCCSCGDYW 758
            IRIMKNLR+CTDCHNFMKLTSQLL REIIVRDIHRFHHFNSG CSCGDYW
Sbjct: 966  IRIMKNLRMCTDCHNFMKLTSQLLCREIIVRDIHRFHHFNSGHCSCGDYW 1014

BLAST of Lag0006021 vs. ExPASy TrEMBL
Match: A0A1S3B4E3 (LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g23330 OS=Cucumis melo OX=3656 GN=LOC103485889 PE=3 SV=1)

HSP 1 Score: 1246.5 bits (3224), Expect = 0.0e+00
Identity = 609/711 (85.65%), Postives = 653/711 (91.84%), Query Frame = 0

Query: 48   YHSSNDALANTLHAKMVKCGSILGSGKFVLSSYVKSEKLNDAKKVFDEMPSRDVLTWTVL 107
            YH+SN   +NTLHAKMVK GSI+ SGKFVL+SYVKS+KLNDA+K+FDEMP+RDVLTWT +
Sbjct: 302  YHTSNSFSSNTLHAKMVKIGSIIESGKFVLTSYVKSKKLNDAQKLFDEMPNRDVLTWTAI 361

Query: 108  ISGFARVNCSEMALQLFREMLVEDICPNHFTLSCVFKLCSRVGNVQMGKGIHGWILRSGV 167
            ISGF+RVNCS MALQLFREMLVE +CPNHFTLS V KLCS+VG+V+MGKGIHGWILR+GV
Sbjct: 362  ISGFSRVNCSGMALQLFREMLVEGVCPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGV 421

Query: 168  NLDVVLENSMLDLYAKFDAFDYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLDLFRN 227
             LDVVLENS+LDLYAKFD F YA+KL+DSM EKST T NI+LGVYVRSCDVNKSL LFRN
Sbjct: 422  KLDVVLENSLLDLYAKFDEFVYARKLYDSMGEKSTDTDNIILGVYVRSCDVNKSLHLFRN 481

Query: 228  LPCRDTASWNTIICGLMQGGYLDTALELLYEMVENEPEFNKVTSSIALSVVSSLLIIELG 287
            LPCR+ ASWNTIICGLMQGGYL+ ALELLYEMVENE EFN  TSSIALSV SSLLI+ELG
Sbjct: 482  LPCRNAASWNTIICGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVASSLLILELG 541

Query: 288  RQVHGRIVRFGFHNDGFVKSSLINMYIKCGNLEKASVIYSQMPSDFARKQDSNIVCSDTM 347
            RQVHGRIVR G HNDGFVKS+LINMYIKCGNLEKASVIYSQ+PS FA KQ SNIVCSDTM
Sbjct: 542  RQVHGRIVRCGLHNDGFVKSALINMYIKCGNLEKASVIYSQLPSGFATKQGSNIVCSDTM 601

Query: 348  TEIVSRSSMVSGYVRNGKYEDAFKTFVSMVREGVLMDKFTIASVVSACSNAGFLELGRQI 407
            TEIVSRSSMV GYVRNGKYEDAFKTFVSMVRE VLMDKFTIASVVSAC+NAG LELGRQ+
Sbjct: 602  TEIVSRSSMVYGYVRNGKYEDAFKTFVSMVRERVLMDKFTIASVVSACANAGVLELGRQV 661

Query: 408  HAYIVKTGEQLDAHLASSLIDMYAKGGSLDCARRIFEQTT-YLNVVIWTTMIAGFALHGQ 467
            H +I K+ EQLDAHLASSLIDMYAKGGSLDCA RIF+Q T YLNVVIWT+MI G +LHG 
Sbjct: 662  HGFIQKSVEQLDAHLASSLIDMYAKGGSLDCAHRIFDQMTYYLNVVIWTSMIVGCSLHGH 721

Query: 468  GKEAIRLFEQMRYEGIIPNDVTFIGVLTACSHAGLLEEGRLYFNMMKDVYAIEPKVEHFT 527
            GKEAIRLFEQMRYEGIIPN+VTFIGVLTACSHAGLLE+G LYFNMMKDVYAI+PKVEH+T
Sbjct: 722  GKEAIRLFEQMRYEGIIPNEVTFIGVLTACSHAGLLEDGLLYFNMMKDVYAIKPKVEHYT 781

Query: 528  CMVDLYGRAGRLNEVKEFIYENDLSHLSAVWKAFLSACRLYKDIEMGNWVSEKLFSLEPQ 587
            CMVDLYGRAG LNEVKEFIYENDLSHLS VWKAFLS+C LY+D+EMG WVSEKLF LEPQ
Sbjct: 782  CMVDLYGRAGLLNEVKEFIYENDLSHLSVVWKAFLSSCLLYRDLEMGKWVSEKLFRLEPQ 841

Query: 588  DEGPYVLLSNMCSSNQKWEEASRTRKYMQHRGISKTPGQSWIHVKNQVHSFVAGDRSHPQ 647
            DEG YVLLSNMCS +QKW+EASR R  MQH GI+KTPGQSWIH+KNQVHSFVAGDRSHPQ
Sbjct: 842  DEGSYVLLSNMCSGSQKWQEASRARSSMQHSGINKTPGQSWIHLKNQVHSFVAGDRSHPQ 901

Query: 648  HAQIYAYLDKLIGRLKEIGYLSDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISMSSGI 707
            HAQIY YLDKLIGRLKEIGYL DVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIIS+ S I
Sbjct: 902  HAQIYEYLDKLIGRLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAI 961

Query: 708  PIRIMKNLRVCTDCHNFMKLTSQLLGREIIVRDIHRFHHFNSGCCSCGDYW 758
            PIRIMKNLR+CTDCHNFMKLTSQLLGREIIVRDI RFHHFNSG CSCGDYW
Sbjct: 962  PIRIMKNLRICTDCHNFMKLTSQLLGREIIVRDICRFHHFNSGHCSCGDYW 1012

BLAST of Lag0006021 vs. TAIR 10
Match: AT3G23330.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 520.8 bits (1340), Expect = 1.9e-147
Identity = 262/708 (37.01%), Postives = 421/708 (59.46%), Query Frame = 0

Query: 56  ANTLHAKMVKCGSIL-GSGKFVLSSYVKSEKLNDAKKVFDEMPSRDVLTWTVLISGFARV 115
           A  LHA+ ++  S+   S   V+S Y   + L++A  +F  + S  VL W  +I  F   
Sbjct: 24  AKQLHAQFIRTQSLSHTSASIVISIYTNLKLLHEALLLFKTLKSPPVLAWKSVIRCFTDQ 83

Query: 116 NCSEMALQLFREMLVEDICPNHFTLSCVFKLCSRVGNVQMGKGIHGWILRSGVNLDVVLE 175
           +    AL  F EM     CP+H     V K C+ + +++ G+ +HG+I+R G++ D+   
Sbjct: 84  SLFSKALASFVEMRASGRCPDHNVFPSVLKSCTMMMDLRFGESVHGFIVRLGMDCDLYTG 143

Query: 176 NSMLDLYAKFDAFD---YAKKLFDSM--REKSTATYNIMLGVYVRSCDVNKSLDLFRNLP 235
           N+++++YAK            +FD M  R  ++   ++     +    ++    +F  +P
Sbjct: 144 NALMNMYAKLLGMGSKISVGNVFDEMPQRTSNSGDEDVKAETCIMPFGIDSVRRVFEVMP 203

Query: 236 CRDTASWNTIICGLMQGGYLDTALELLYEMVENEPEFNKVTSSIALSVVSSLLIIELGRQ 295
            +D  S+NTII G  Q G  + AL ++ EM   + + +  T S  L + S  + +  G++
Sbjct: 204 RKDVVSYNTIIAGYAQSGMYEDALRMVREMGTTDLKPDSFTLSSVLPIFSEYVDVIKGKE 263

Query: 296 VHGRIVRFGFHNDGFVKSSLINMYIKCGNLEKASVIYSQMPSDFARKQDSNIVCSDTMTE 355
           +HG ++R G  +D ++ SSL++MY K   +E +  ++S+            + C D    
Sbjct: 264 IHGYVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVFSR------------LYCRDG--- 323

Query: 356 IVSRSSMVSGYVRNGKYEDAFKTFVSMVREGVLMDKFTIASVVSACSNAGFLELGRQIHA 415
            +S +S+V+GYV+NG+Y +A + F  MV   V       +SV+ AC++   L LG+Q+H 
Sbjct: 324 -ISWNSLVAGYVQNGRYNEALRLFRQMVTAKVKPGAVAFSSVIPACAHLATLHLGKQLHG 383

Query: 416 YIVKTGEQLDAHLASSLIDMYAKGGSLDCARRIFEQTTYLNVVIWTTMIAGFALHGQGKE 475
           Y+++ G   +  +AS+L+DMY+K G++  AR+IF++   L+ V WT +I G ALHG G E
Sbjct: 384 YVLRGGFGSNIFIASALVDMYSKCGNIKAARKIFDRMNVLDEVSWTAIIMGHALHGHGHE 443

Query: 476 AIRLFEQMRYEGIIPNDVTFIGVLTACSHAGLLEEGRLYFNMMKDVYAIEPKVEHFTCMV 535
           A+ LFE+M+ +G+ PN V F+ VLTACSH GL++E   YFN M  VY +  ++EH+  + 
Sbjct: 444 AVSLFEEMKRQGVKPNQVAFVAVLTACSHVGLVDEAWGYFNSMTKVYGLNQELEHYAAVA 503

Query: 536 DLYGRAGRLNEVKEFIYENDLSHLSAVWKAFLSACRLYKDIEMGNWVSEKLFSLEPQDEG 595
           DL GRAG+L E   FI +  +    +VW   LS+C ++K++E+   V+EK+F+++ ++ G
Sbjct: 504 DLLGRAGKLEEAYNFISKMCVEPTGSVWSTLLSSCSVHKNLELAEKVAEKIFTVDSENMG 563

Query: 596 PYVLLSNMCSSNQKWEEASRTRKYMQHRGISKTPGQSWIHVKNQVHSFVAGDRSHPQHAQ 655
            YVL+ NM +SN +W+E ++ R  M+ +G+ K P  SWI +KN+ H FV+GDRSHP   +
Sbjct: 564 AYVLMCNMYASNGRWKEMAKLRLRMRKKGLRKKPACSWIEMKNKTHGFVSGDRSHPSMDK 623

Query: 656 IYAYLDKLIGRLKEIGYLSDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISMSSGIPIR 715
           I  +L  ++ ++++ GY++D   V+ DV+EE    LL  HSE+LAVA+GII+   G  IR
Sbjct: 624 INEFLKAVMEQMEKEGYVADTSGVLHDVDEEHKRELLFGHSERLAVAFGIINTEPGTTIR 683

Query: 716 IMKNLRVCTDCHNFMKLTSQLLGREIIVRDIHRFHHFNSGCCSCGDYW 758
           + KN+R+CTDCH  +K  S++  REIIVRD  RFHHFN G CSCGDYW
Sbjct: 684 VTKNIRICTDCHVAIKFISKITEREIIVRDNSRFHHFNRGNCSCGDYW 715

BLAST of Lag0006021 vs. TAIR 10
Match: AT2G22070.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 514.2 bits (1323), Expect = 1.7e-145
Identity = 267/701 (38.09%), Postives = 420/701 (59.91%), Query Frame = 0

Query: 76  VLSSYVKSEKLNDAKKVFDEMPSRDVLTWTVLISGFARVNCSEMALQLFREMLVEDICPN 135
           VLS+Y K   ++   + FD++P RD ++WT +I G+  +     A+++  +M+ E I P 
Sbjct: 86  VLSAYSKRGDMDSTCEFFDQLPQRDSVSWTTMIVGYKNIGQYHKAIRVMGDMVKEGIEPT 145

Query: 136 HFTLSCVFKLCSRVGNVQMGKGIHGWILRSGVNLDVVLENSMLDLYAKFDAFDYAKKLFD 195
            FTL+ V    +    ++ GK +H +I++ G+  +V + NS+L++YAK      AK +FD
Sbjct: 146 QFTLTNVLASVAATRCMETGKKVHSFIVKLGLRGNVSVSNSLLNMYAKCGDPMMAKFVFD 205

Query: 196 SMREKSTATYNIMLGVYVRSCDVNKSLDLFRNLPCRDTASWNTIICGLMQGGYLDTALEL 255
            M  +  +++N M+ ++++   ++ ++  F  +  RD  +WN++I G  Q GY   AL++
Sbjct: 206 RMVVRDISSWNAMIALHMQVGQMDLAMAQFEQMAERDIVTWNSMISGFNQRGYDLRALDI 265

Query: 256 LYEMVENE-PEFNKVTSSIALSVVSSLLIIELGRQVHGRIVRFGFHNDGFVKSSLINMYI 315
             +M+ +     ++ T +  LS  ++L  + +G+Q+H  IV  GF   G V ++LI+MY 
Sbjct: 266 FSKMLRDSLLSPDRFTLASVLSACANLEKLCIGKQIHSHIVTTGFDISGIVLNALISMYS 325

Query: 316 KCGNLEKASVIYSQMPSDFAR-----------------KQDSNIVCSDTMTEIVSRSSMV 375
           +CG +E A  +  Q  +   +                  Q  NI  S    ++V+ ++M+
Sbjct: 326 RCGGVETARRLIEQRGTKDLKIEGFTALLDGYIKLGDMNQAKNIFVSLKDRDVVAWTAMI 385

Query: 376 SGYVRNGKYEDAFKTFVSMVREGVLMDKFTIASVVSACSNAGFLELGRQIHAYIVKTGEQ 435
            GY ++G Y +A   F SMV  G   + +T+A+++S  S+   L  G+QIH   VK+GE 
Sbjct: 386 VGYEQHGSYGEAINLFRSMVGGGQRPNSYTLAAMLSVASSLASLSHGKQIHGSAVKSGEI 445

Query: 436 LDAHLASSLIDMYAKGGSLDCARRIFEQ-TTYLNVVIWTTMIAGFALHGQGKEAIRLFEQ 495
               ++++LI MYAK G++  A R F+      + V WT+MI   A HG  +EA+ LFE 
Sbjct: 446 YSVSVSNALITMYAKAGNITSASRAFDLIRCERDTVSWTSMIIALAQHGHAEEALELFET 505

Query: 496 MRYEGIIPNDVTFIGVLTACSHAGLLEEGRLYFNMMKDVYAIEPKVEHFTCMVDLYGRAG 555
           M  EG+ P+ +T++GV +AC+HAGL+ +GR YF+MMKDV  I P + H+ CMVDL+GRAG
Sbjct: 506 MLMEGLRPDHITYVGVFSACTHAGLVNQGRQYFDMMKDVDKIIPTLSHYACMVDLFGRAG 565

Query: 556 RLNEVKEFIYENDLSHLSAVWKAFLSACRLYKDIEMGNWVSEKLFSLEPQDEGPYVLLSN 615
            L E +EFI +  +      W + LSACR++K+I++G   +E+L  LEP++ G Y  L+N
Sbjct: 566 LLQEAQEFIEKMPIEPDVVTWGSLLSACRVHKNIDLGKVAAERLLLLEPENSGAYSALAN 625

Query: 616 MCSSNQKWEEASRTRKYMQHRGISKTPGQSWIHVKNQVHSFVAGDRSHPQHAQIYAYLDK 675
           + S+  KWEEA++ RK M+   + K  G SWI VK++VH F   D +HP+  +IY  + K
Sbjct: 626 LYSACGKWEEAAKIRKSMKDGRVKKEQGFSWIEVKHKVHVFGVEDGTHPEKNEIYMTMKK 685

Query: 676 LIGRLKEIGYLSDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISMSSGIPIRIMKNLRV 735
           +   +K++GY+ D   V+ D+EEE  E +L  HSEKLA+A+G+IS      +RIMKNLRV
Sbjct: 686 IWDEIKKMGYVPDTASVLHDLEEEVKEQILRHHSEKLAIAFGLISTPDKTTLRIMKNLRV 745

Query: 736 CTDCHNFMKLTSQLLGREIIVRDIHRFHHFNSGCCSCGDYW 758
           C DCH  +K  S+L+GREIIVRD  RFHHF  G CSC DYW
Sbjct: 746 CNDCHTAIKFISKLVGREIIVRDTTRFHHFKDGFCSCRDYW 786

BLAST of Lag0006021 vs. TAIR 10
Match: AT4G13650.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 488.0 bits (1255), Expect = 1.3e-137
Identity = 243/702 (34.62%), Postives = 397/702 (56.55%), Query Frame = 0

Query: 59   LHAKMVKCGSILGSGKF---VLSSYVKSEKLNDAKKVFDEMPSRDVLTWTVLISGFARVN 118
            LHA   K G    + K    +L+ Y K   +  A   F E    +V+ W V++  +  ++
Sbjct: 411  LHAYTTKLG-FASNNKIEGALLNLYAKCADIETALDYFLETEVENVVLWNVMLVAYGLLD 470

Query: 119  CSEMALQLFREMLVEDICPNHFTLSCVFKLCSRVGNVQMGKGIHGWILRSGVNLDVVLEN 178
                + ++FR+M +E+I PN +T   + K C R+G++++G+ IH  I+++   L+  + +
Sbjct: 471  DLRNSFRIFRQMQIEEIVPNQYTYPSILKTCIRLGDLELGEQIHSQIIKTNFQLNAYVCS 530

Query: 179  SMLDLYAKFDAFDYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLDLFRNLPCRDTAS 238
             ++D+YAK    D A                                D+      +D  S
Sbjct: 531  VLIDMYAKLGKLDTA-------------------------------WDILIRFAGKDVVS 590

Query: 239  WNTIICGLMQGGYLDTALELLYEMVENEPEFNKVTSSIALSVVSSLLIIELGRQVHGRIV 298
            W T+I G  Q  + D AL    +M++     ++V  + A+S  + L  ++ G+Q+H +  
Sbjct: 591  WTTMIAGYTQYNFDDKALTTFRQMLDRGIRSDEVGLTNAVSACAGLQALKEGQQIHAQAC 650

Query: 299  RFGFHNDGFVKSSLINMYIKCGNLEKASVIYSQMPSDFARKQDSNIVCSDTMTEIVSRSS 358
              GF +D   +++L+ +Y +CG +E++ + + Q  +                 + ++ ++
Sbjct: 651  VSGFSSDLPFQNALVTLYSRCGKIEESYLAFEQTEAG----------------DNIAWNA 710

Query: 359  MVSGYVRNGKYEDAFKTFVSMVREGVLMDKFTIASVVSACSNAGFLELGRQIHAYIVKTG 418
            +VSG+ ++G  E+A + FV M REG+  + FT  S V A S    ++ G+Q+HA I KTG
Sbjct: 711  LVSGFQQSGNNEEALRVFVRMNREGIDNNNFTFGSAVKAASETANMKQGKQVHAVITKTG 770

Query: 419  EQLDAHLASSLIDMYAKGGSLDCARRIFEQTTYLNVVIWTTMIAGFALHGQGKEAIRLFE 478
               +  + ++LI MYAK GS+  A + F + +  N V W  +I  ++ HG G EA+  F+
Sbjct: 771  YDSETEVCNALISMYAKCGSISDAEKQFLEVSTKNEVSWNAIINAYSKHGFGSEALDSFD 830

Query: 479  QMRYEGIIPNDVTFIGVLTACSHAGLLEEGRLYFNMMKDVYAIEPKVEHFTCMVDLYGRA 538
            QM +  + PN VT +GVL+ACSH GL+++G  YF  M   Y + PK EH+ C+VD+  RA
Sbjct: 831  QMIHSNVRPNHVTLVGVLSACSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRA 890

Query: 539  GRLNEVKEFIYENDLSHLSAVWKAFLSACRLYKDIEMGNWVSEKLFSLEPQDEGPYVLLS 598
            G L+  KEFI E  +   + VW+  LSAC ++K++E+G + +  L  LEP+D   YVLLS
Sbjct: 891  GLLSRAKEFIQEMPIKPDALVWRTLLSACVVHKNMEIGEFAAHHLLELEPEDSATYVLLS 950

Query: 599  NMCSSNQKWEEASRTRKYMQHRGISKTPGQSWIHVKNQVHSFVAGDRSHPQHAQIYAYLD 658
            N+ + ++KW+    TR+ M+ +G+ K PGQSWI VKN +HSF  GD++HP   +I+ Y  
Sbjct: 951  NLYAVSKKWDARDLTRQKMKEKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLADEIHEYFQ 1010

Query: 659  KLIGRLKEIGYLSDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISMSSGIPIRIMKNLR 718
             L  R  EIGY+ D   ++ +++ EQ + ++  HSEKLA+++G++S+ + +PI +MKNLR
Sbjct: 1011 DLTKRASEIGYVQDCFSLLNELQHEQKDPIIFIHSEKLAISFGLLSLPATVPINVMKNLR 1064

Query: 719  VCTDCHNFMKLTSQLLGREIIVRDIHRFHHFNSGCCSCGDYW 758
            VC DCH ++K  S++  REIIVRD +RFHHF  G CSC DYW
Sbjct: 1071 VCNDCHAWIKFVSKVSNREIIVRDAYRFHHFEGGACSCKDYW 1064

BLAST of Lag0006021 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 483.8 bits (1244), Expect = 2.5e-136
Identity = 254/732 (34.70%), Postives = 409/732 (55.87%), Query Frame = 0

Query: 43  FNSCRYHSSNDALAN--------TLHAKMVKCGSILGSGKFVLSSYVK-------SEKLN 102
           ++S R H S   L N         +HA+M+K G  L +  + LS  ++        E L 
Sbjct: 28  YDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIG--LHNTNYALSKLIEFCILSPHFEGLP 87

Query: 103 DAKKVFDEMPSRDVLTWTVLISGFARVNCSEMALQLFREMLVEDICPNHFTLSCVFKLCS 162
            A  VF  +   ++L W  +  G A  +    AL+L+  M+   + PN +T   V K C+
Sbjct: 88  YAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCA 147

Query: 163 RVGNVQMGKGIHGWILRSGVNLDVVLENSMLDLYAKFDAFDYAKKLFDSMREKSTATYNI 222
           +    + G+ IHG +L+ G +LD+ +  S++ +Y +    + A K+FD    +   +Y  
Sbjct: 148 KSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTA 207

Query: 223 MLGVYVRSCDVNKSLDLFRNLPCRDTASWNTIICGLMQGGYLDTALELLYEMVENEPEFN 282
           ++  Y     +  +  LF  +P +D  SWN +I G  + G    ALEL  +M++     +
Sbjct: 208 LIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPD 267

Query: 283 KVTSSIALSVVSSLLIIELGRQVHGRIVRFGFHNDGFVKSSLINMYIKCGNLEKASVIYS 342
           + T    +S  +    IELGRQVH  I   GF ++  + ++LI++Y KCG LE A  ++ 
Sbjct: 268 ESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFE 327

Query: 343 QMPSDFARKQDSNIVCSDTMTEIVSRSSMVSGYVRNGKYEDAFKTFVSMVREGVLMDKFT 402
           ++P                  +++S ++++ GY     Y++A   F  M+R G   +  T
Sbjct: 328 RLP----------------YKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVT 387

Query: 403 IASVVSACSNAGFLELGRQIHAYIVK--TGEQLDAHLASSLIDMYAKGGSLDCARRIFEQ 462
           + S++ AC++ G +++GR IH YI K   G    + L +SLIDMYAK G ++ A ++F  
Sbjct: 388 MLSILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNS 447

Query: 463 TTYLNVVIWTTMIAGFALHGQGKEAIRLFEQMRYEGIIPNDVTFIGVLTACSHAGLLEEG 522
             + ++  W  MI GFA+HG+   +  LF +MR  GI P+D+TF+G+L+ACSH+G+L+ G
Sbjct: 448 ILHKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLG 507

Query: 523 RLYFNMMKDVYAIEPKVEHFTCMVDLYGRAGRLNEVKEFIYENDLSHLSAVWKAFLSACR 582
           R  F  M   Y + PK+EH+ CM+DL G +G   E +E I   ++     +W + L AC+
Sbjct: 508 RHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACK 567

Query: 583 LYKDIEMGNWVSEKLFSLEPQDEGPYVLLSNMCSSNQKWEEASRTRKYMQHRGISKTPGQ 642
           ++ ++E+G   +E L  +EP++ G YVLLSN+ +S  +W E ++TR  +  +G+ K PG 
Sbjct: 568 MHGNVELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGC 627

Query: 643 SWIHVKNQVHSFVAGDRSHPQHAQIYAYLDKLIGRLKEIGYLSDVKLVMQDVEEEQGEVL 702
           S I + + VH F+ GD+ HP++ +IY  L+++   L++ G++ D   V+Q++EEE  E  
Sbjct: 628 SSIEIDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGA 687

Query: 703 LGWHSEKLAVAYGIISMSSGIPIRIMKNLRVCTDCHNFMKLTSQLLGREIIVRDIHRFHH 758
           L  HSEKLA+A+G+IS   G  + I+KNLRVC +CH   KL S++  REII RD  RFHH
Sbjct: 688 LRHHSEKLAIAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHH 741

BLAST of Lag0006021 vs. TAIR 10
Match: AT2G27610.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 483.0 bits (1242), Expect = 4.3e-136
Identity = 263/782 (33.63%), Postives = 425/782 (54.35%), Query Frame = 0

Query: 52  NDALANTLHAKMVKCGSI--LGSGKFVLSSYVKSEKLNDAKKVFDEMPSRDVLTWTVLIS 111
           ++     LH + +K G +  +  G  ++ +Y+K     D +KVFDEM  R+V+TWT LIS
Sbjct: 108 DELFGRQLHCQCIKFGFLDDVSVGTSLVDTYMKGSNFKDGRKVFDEMKERNVVTWTTLIS 167

Query: 112 GFARVNCSEMALQLFREMLVEDICPNHFTLSCVFKLCSRVGNVQMGKGIHGWILRSGVNL 171
           G+AR + ++  L LF  M  E   PN FT +    + +  G    G  +H  ++++G++ 
Sbjct: 168 GYARNSMNDEVLTLFMRMQNEGTQPNSFTFAAALGVLAEEGVGGRGLQVHTVVVKNGLDK 227

Query: 172 DVVLENSMLDLYAKFDAFDYAKKLFDSMREKSTATYNIMLGVYVRS-------------- 231
            + + NS+++LY K      A+ LFD    KS  T+N M+  Y  +              
Sbjct: 228 TIPVSNSLINLYLKCGNVRKARILFDKTEVKSVVTWNSMISGYAANGLDLEALGMFYSMR 287

Query: 232 -------------------------------CDVNK------------------------ 291
                                          C V K                        
Sbjct: 288 LNYVRLSESSFASVIKLCANLKELRFTEQLHCSVVKYGFLFDQNIRTALMVAYSKCTAML 347

Query: 292 -SLDLFRNLPC-RDTASWNTIICGLMQGGYLDTALELLYEMVENEPEFNKVTSSIALSVV 351
            +L LF+ + C  +  SW  +I G +Q    + A++L  EM       N+ T S+ L+  
Sbjct: 348 DALRLFKEIGCVGNVVSWTAMISGFLQNDGKEEAVDLFSEMKRKGVRPNEFTYSVILTA- 407

Query: 352 SSLLIIELGRQVHGRIVRFGFHNDGFVKSSLINMYIKCGNLEKASVIYSQMPSDFARKQD 411
              L +    +VH ++V+  +     V ++L++ Y+K G +E+A+ ++S +         
Sbjct: 408 ---LPVISPSEVHAQVVKTNYERSSTVGTALLDAYVKLGKVEEAAKVFSGIDD------- 467

Query: 412 SNIVCSDTMTEIVSRSSMVSGYVRNGKYEDAFKTFVSMVREGVLMDKFTIASVVSAC--S 471
                     +IV+ S+M++GY + G+ E A K F  + + G+  ++FT +S+++ C  +
Sbjct: 468 ---------KDIVAWSAMLAGYAQTGETEAAIKMFGELTKGGIKPNEFTFSSILNVCAAT 527

Query: 472 NAGFLELGRQIHAYIVKTGEQLDAHLASSLIDMYAKGGSLDCARRIFEQTTYLNVVIWTT 531
           NA  +  G+Q H + +K+       ++S+L+ MYAK G+++ A  +F++    ++V W +
Sbjct: 528 NAS-MGQGKQFHGFAIKSRLDSSLCVSSALLTMYAKKGNIESAEEVFKRQREKDLVSWNS 587

Query: 532 MIAGFALHGQGKEAIRLFEQMRYEGIIPNDVTFIGVLTACSHAGLLEEGRLYFNMMKDVY 591
           MI+G+A HGQ  +A+ +F++M+   +  + VTFIGV  AC+HAGL+EEG  YF++M    
Sbjct: 588 MISGYAQHGQAMKALDVFKEMKKRKVKMDGVTFIGVFAACTHAGLVEEGEKYFDIMVRDC 647

Query: 592 AIEPKVEHFTCMVDLYGRAGRLNEVKEFIYENDLSHLSAVWKAFLSACRLYKDIEMGNWV 651
            I P  EH +CMVDLY RAG+L +  + I        S +W+  L+ACR++K  E+G   
Sbjct: 648 KIAPTKEHNSCMVDLYSRAGQLEKAMKVIENMPNPAGSTIWRTILAACRVHKKTELGRLA 707

Query: 652 SEKLFSLEPQDEGPYVLLSNMCSSNQKWEEASRTRKYMQHRGISKTPGQSWIHVKNQVHS 711
           +EK+ +++P+D   YVLLSNM + +  W+E ++ RK M  R + K PG SWI VKN+ +S
Sbjct: 708 AEKIIAMKPEDSAAYVLLSNMYAESGDWQERAKVRKLMNERNVKKEPGYSWIEVKNKTYS 767

Query: 712 FVAGDRSHPQHAQIYAYLDKLIGRLKEIGYLSDVKLVMQDVEEEQGEVLLGWHSEKLAVA 758
           F+AGDRSHP   QIY  L+ L  RLK++GY  D   V+QD+++E  E +L  HSE+LA+A
Sbjct: 768 FLAGDRSHPLKDQIYMKLEDLSTRLKDLGYEPDTSYVLQDIDDEHKEAVLAQHSERLAIA 827

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG7020981.10.0e+0088.64putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyros... [more]
KAG6586149.10.0e+0088.51putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyros... [more]
XP_038889548.10.0e+0088.65putative pentatricopeptide repeat-containing protein At3g23330 [Benincasa hispid... [more]
KAG7029890.10.0e+0087.28putative pentatricopeptide repeat-containing protein [Cucurbita argyrosperma sub... [more]
XP_022965499.10.0e+0089.11LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g23... [more]
Match NameE-valueIdentityDescription
Q9LW632.6e-14637.01Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis th... [more]
Q9SHZ82.5e-14438.09Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX... [more]
Q9SVP71.9e-13634.62Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX... [more]
Q9LN013.6e-13534.70Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Q9ZUW36.1e-13533.63Pentatricopeptide repeat-containing protein At2g27610 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1HR620.0e+0089.11LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g23... [more]
A0A6J1EPP70.0e+0090.42putative pentatricopeptide repeat-containing protein At3g23330 OS=Cucurbita mosc... [more]
A0A0A0LKI40.0e+0084.96DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G0742... [more]
A0A6J1KA700.0e+0089.72putative pentatricopeptide repeat-containing protein At3g23330 OS=Cucurbita maxi... [more]
A0A1S3B4E30.0e+0085.65LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g23... [more]
Match NameE-valueIdentityDescription
AT3G23330.11.9e-14737.01Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G22070.11.7e-14538.09pentatricopeptide (PPR) repeat-containing protein [more]
AT4G13650.11.3e-13734.62Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G08070.12.5e-13634.70Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G27610.14.3e-13633.63Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 37..157
e-value: 5.0E-18
score: 67.5
coord: 424..646
e-value: 4.1E-35
score: 123.6
coord: 158..277
e-value: 4.1E-20
score: 74.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 278..418
e-value: 1.9E-18
score: 68.9
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 235..262
e-value: 4.8E-5
score: 21.3
coord: 452..485
e-value: 4.2E-6
score: 24.6
coord: 103..135
e-value: 2.1E-7
score: 28.7
coord: 354..384
e-value: 3.1E-6
score: 25.0
coord: 204..227
e-value: 0.0014
score: 16.6
coord: 76..101
e-value: 0.0031
score: 15.6
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 175..200
e-value: 0.013
score: 15.7
coord: 204..227
e-value: 0.041
score: 14.1
coord: 235..262
e-value: 5.1E-5
score: 23.2
coord: 307..331
e-value: 0.015
score: 15.5
coord: 524..546
e-value: 0.35
score: 11.2
coord: 424..445
e-value: 1.4
score: 9.3
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 352..396
e-value: 2.4E-7
score: 30.9
coord: 450..497
e-value: 1.7E-11
score: 44.1
coord: 100..146
e-value: 2.3E-9
score: 37.3
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 100..134
score: 11.081932
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 450..484
score: 11.904029
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 349..383
score: 10.062531
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 232..266
score: 9.492543
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 623..746
e-value: 8.6E-40
score: 135.5
NoneNo IPR availablePANTHERPTHR24015:SF1922OS07G0239600 PROTEINcoord: 35..739
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 35..739

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0006021.1Lag0006021.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding