Cla97C05G089660 (gene) Watermelon (97103) v2

NameCla97C05G089660
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionPentatricopeptide repeat-containing protein
LocationCla97Chr05 : 7919758 .. 7921527 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTTAATCTAAAATGGGTTCTATTAGATTCAGTTAAAGATTGTAAGAGCTTAAGGATCTTTAGACAAATTCATGCTCAGTTGGTGACATCAGGGTTAGTTTACGATGAATTTGTCAAAACCAAAGTGATGGAATTCTTTGCCAATTTTGTTGAGTATGGTGACTATGCCTGTGATTATTTAAAACAAGACAGCACGCATTTAAGTTCATTTCCATTCAACTCGCTGATTAATGGGTATGCTGGTGGTGACTTGCCACAAATGGCGGTTTCAGTTTATAGAAGTATGGTGAGAGATGGGTTTGTGCCTGATATGTTTACTTTTCCAGTCCTTCTGAAAGCATGCTCTAACTTTTCGGGGAGCAGAGAAGGCAGACAGGTTCATGGCGTGGTGGTTAAGTTGGGGATTTTGGCTGATCATTATGTGCAAAACTCACTGATTCGTTGCTATGGAGCTTGTGGGGATTTTTCTTGTGCTGGTAAGGTGTTTGATGAAATGCTTGTTCGAGATGTTGTTTCGTGGAACAGTTTGATATCTGGGTTCATGAAGGCGGGGCATTTTGATGAGGCTATTTCTTTGTTTTTCAGGATGGATGTGGAGCCAAGCATTGCAACTTTAGTCAGTGTGCTTACTGCTTGTGCAAGAAAGGGTGACTTGTGTACGGGGAAGGGAATTCATGGTGTAATCGAGCGAAGGTTTAAGTTGGATTTAGTACTAGGCAATGCAATGCTAGATATGTATGTAAAGAATGGATGTTTGTATGAAGCTAAGAAAATATTTGACGAGCTCCCAACGAGAGATATTGTATCTTGGACTATCATGATCACTGGATTGGTGCAGAGTGACCATCCAAAAGAGTCCTTGGAACTCTTTTCAATGATGCGAACCCTGGGTATTAGCCCTGATGCAATTATTTTAACTAGTGTTCTCTCTGCTTGTGCTAGCCTAGGAACTCTTGACTTCGGCACATGGGTCCATGAGTACATAAATCAAAGAGGAATCAAATGGGATATCCATATTGGAACTGCTATTGTTGACATGTATGCCAAATGTGGATGTATCGAAATGGCACTGCAAATCTTTTACAATATGCCTCAGAGAAATACCTTCACTTGGAATGCCTTGTTATGCGGTCTGGCAATGCATGGACTTGCGCATGAAGCATTAAATCTTTTTGAAGTAATGATAATATCTGGTGTCAAGCCTAACGAGGTAACGTTTCTAGCAATTATGACAGCCTGCTGCCATTCTGGTCTGGTCAACGAAGGGCGCAAGTGTTTTAATAACATGAGTAGTCAACTTCACAATTTGTTGCCAAAGTTGGAGCATTATGGATGCATGATTGATTTGTTCTGTCGAGCTGGACTCCTGGAGGAAGCTGTGGAGTTGACAAGGACCATGCCAATGAAGCCTGATGTGCTTATCTGGGGAGTGATTCTAAATGCTTGCAGAACTGTTGGAAATGTTGAGCTCTCTCATCACATACAAGATTACATCTTGGAACTTGATCCAGAGGATAGTGGAGTATTCGTGCTGTTGTCCAATATATCTGCAACTAATGAAAGATGGTCTGATGTGACTCGATTAAGGAGGTTGATGAAGGATAGAGGTGTGAAAAAAGCACCTGGATCAAGTGTCATTGAGGTGGATGGTAACGCTCACGAGTTTGTGGTTGGAGATATTAGCCACCTCCAAACTGAAGAAATCTACAAAGTGTTAAACCTCATTAACTCTGTCTACCATGAATGCCATTTGATGCATCCATTGTAG

mRNA sequence

ATGTTTAATCTAAAATGGGTTCTATTAGATTCAGTTAAAGATTGTAAGAGCTTAAGGATCTTTAGACAAATTCATGCTCAGTTGGTGACATCAGGGTTAGTTTACGATGAATTTGTCAAAACCAAAGTGATGGAATTCTTTGCCAATTTTGTTGAGTATGGTGACTATGCCTGTGATTATTTAAAACAAGACAGCACGCATTTAAGTTCATTTCCATTCAACTCGCTGATTAATGGGTATGCTGGTGGTGACTTGCCACAAATGGCGGTTTCAGTTTATAGAAGTATGGTGAGAGATGGGTTTGTGCCTGATATGTTTACTTTTCCAGTCCTTCTGAAAGCATGCTCTAACTTTTCGGGGAGCAGAGAAGGCAGACAGGTTCATGGCGTGGTGGTTAAGTTGGGGATTTTGGCTGATCATTATGTGCAAAACTCACTGATTCGTTGCTATGGAGCTTGTGGGGATTTTTCTTGTGCTGGTAAGGTGTTTGATGAAATGCTTGTTCGAGATGTTGTTTCGTGGAACAGTTTGATATCTGGGTTCATGAAGGCGGGGCATTTTGATGAGGCTATTTCTTTGTTTTTCAGGATGGATGTGGAGCCAAGCATTGCAACTTTAGTCAGTGTGCTTACTGCTTGTGCAAGAAAGGGTGACTTGTGTACGGGGAAGGGAATTCATGGTGTAATCGAGCGAAGGTTTAAGTTGGATTTAGTACTAGGCAATGCAATGCTAGATATGTATGTAAAGAATGGATGTTTGTATGAAGCTAAGAAAATATTTGACGAGCTCCCAACGAGAGATATTGTATCTTGGACTATCATGATCACTGGATTGGTGCAGAGTGACCATCCAAAAGAGTCCTTGGAACTCTTTTCAATGATGCGAACCCTGGGTATTAGCCCTGATGCAATTATTTTAACTAGTGTTCTCTCTGCTTGTGCTAGCCTAGGAACTCTTGACTTCGGCACATGGGTCCATGAGTACATAAATCAAAGAGGAATCAAATGGGATATCCATATTGGAACTGCTATTGTTGACATGTATGCCAAATGTGGATGTATCGAAATGGCACTGCAAATCTTTTACAATATGCCTCAGAGAAATACCTTCACTTGGAATGCCTTGTTATGCGGTCTGGCAATGCATGGACTTGCGCATGAAGCATTAAATCTTTTTGAAGTAATGATAATATCTGGTGTCAAGCCTAACGAGGTAACGTTTCTAGCAATTATGACAGCCTGCTGCCATTCTGGTCTGGTCAACGAAGGGCGCAAGTGTTTTAATAACATGAGTAGTCAACTTCACAATTTGTTGCCAAAGTTGGAGCATTATGGATGCATGATTGATTTGTTCTGTCGAGCTGGACTCCTGGAGGAAGCTGTGGAGTTGACAAGGACCATGCCAATGAAGCCTGATGTGCTTATCTGGGGAGTGATTCTAAATGCTTGCAGAACTGTTGGAAATGTTGAGCTCTCTCATCACATACAAGATTACATCTTGGAACTTGATCCAGAGGATAGTGGAGTATTCGTGCTGTTGTCCAATATATCTGCAACTAATGAAAGATGGTCTGATGTGACTCGATTAAGGAGGTTGATGAAGGATAGAGGTGTGAAAAAAGCACCTGGATCAAGTGTCATTGAGGTGGATGGTAACGCTCACGAGTTTGTGGTTGGAGATATTAGCCACCTCCAAACTGAAGAAATCTACAAAGTGTTAAACCTCATTAACTCTGTCTACCATGAATGCCATTTGATGCATCCATTGTAG

Coding sequence (CDS)

ATGTTTAATCTAAAATGGGTTCTATTAGATTCAGTTAAAGATTGTAAGAGCTTAAGGATCTTTAGACAAATTCATGCTCAGTTGGTGACATCAGGGTTAGTTTACGATGAATTTGTCAAAACCAAAGTGATGGAATTCTTTGCCAATTTTGTTGAGTATGGTGACTATGCCTGTGATTATTTAAAACAAGACAGCACGCATTTAAGTTCATTTCCATTCAACTCGCTGATTAATGGGTATGCTGGTGGTGACTTGCCACAAATGGCGGTTTCAGTTTATAGAAGTATGGTGAGAGATGGGTTTGTGCCTGATATGTTTACTTTTCCAGTCCTTCTGAAAGCATGCTCTAACTTTTCGGGGAGCAGAGAAGGCAGACAGGTTCATGGCGTGGTGGTTAAGTTGGGGATTTTGGCTGATCATTATGTGCAAAACTCACTGATTCGTTGCTATGGAGCTTGTGGGGATTTTTCTTGTGCTGGTAAGGTGTTTGATGAAATGCTTGTTCGAGATGTTGTTTCGTGGAACAGTTTGATATCTGGGTTCATGAAGGCGGGGCATTTTGATGAGGCTATTTCTTTGTTTTTCAGGATGGATGTGGAGCCAAGCATTGCAACTTTAGTCAGTGTGCTTACTGCTTGTGCAAGAAAGGGTGACTTGTGTACGGGGAAGGGAATTCATGGTGTAATCGAGCGAAGGTTTAAGTTGGATTTAGTACTAGGCAATGCAATGCTAGATATGTATGTAAAGAATGGATGTTTGTATGAAGCTAAGAAAATATTTGACGAGCTCCCAACGAGAGATATTGTATCTTGGACTATCATGATCACTGGATTGGTGCAGAGTGACCATCCAAAAGAGTCCTTGGAACTCTTTTCAATGATGCGAACCCTGGGTATTAGCCCTGATGCAATTATTTTAACTAGTGTTCTCTCTGCTTGTGCTAGCCTAGGAACTCTTGACTTCGGCACATGGGTCCATGAGTACATAAATCAAAGAGGAATCAAATGGGATATCCATATTGGAACTGCTATTGTTGACATGTATGCCAAATGTGGATGTATCGAAATGGCACTGCAAATCTTTTACAATATGCCTCAGAGAAATACCTTCACTTGGAATGCCTTGTTATGCGGTCTGGCAATGCATGGACTTGCGCATGAAGCATTAAATCTTTTTGAAGTAATGATAATATCTGGTGTCAAGCCTAACGAGGTAACGTTTCTAGCAATTATGACAGCCTGCTGCCATTCTGGTCTGGTCAACGAAGGGCGCAAGTGTTTTAATAACATGAGTAGTCAACTTCACAATTTGTTGCCAAAGTTGGAGCATTATGGATGCATGATTGATTTGTTCTGTCGAGCTGGACTCCTGGAGGAAGCTGTGGAGTTGACAAGGACCATGCCAATGAAGCCTGATGTGCTTATCTGGGGAGTGATTCTAAATGCTTGCAGAACTGTTGGAAATGTTGAGCTCTCTCATCACATACAAGATTACATCTTGGAACTTGATCCAGAGGATAGTGGAGTATTCGTGCTGTTGTCCAATATATCTGCAACTAATGAAAGATGGTCTGATGTGACTCGATTAAGGAGGTTGATGAAGGATAGAGGTGTGAAAAAAGCACCTGGATCAAGTGTCATTGAGGTGGATGGTAACGCTCACGAGTTTGTGGTTGGAGATATTAGCCACCTCCAAACTGAAGAAATCTACAAAGTGTTAAACCTCATTAACTCTGTCTACCATGAATGCCATTTGATGCATCCATTGTAG

Protein sequence

MFNLKWVLLDSVKDCKSLRIFRQIHAQLVTSGLVYDEFVKTKVMEFFANFVEYGDYACDYLKQDSTHLSSFPFNSLINGYAGGDLPQMAVSVYRSMVRDGFVPDMFTFPVLLKACSNFSGSREGRQVHGVVVKLGILADHYVQNSLIRCYGACGDFSCAGKVFDEMLVRDVVSWNSLISGFMKAGHFDEAISLFFRMDVEPSIATLVSVLTACARKGDLCTGKGIHGVIERRFKLDLVLGNAMLDMYVKNGCLYEAKKIFDELPTRDIVSWTIMITGLVQSDHPKESLELFSMMRTLGISPDAIILTSVLSACASLGTLDFGTWVHEYINQRGIKWDIHIGTAIVDMYAKCGCIEMALQIFYNMPQRNTFTWNALLCGLAMHGLAHEALNLFEVMIISGVKPNEVTFLAIMTACCHSGLVNEGRKCFNNMSSQLHNLLPKLEHYGCMIDLFCRAGLLEEAVELTRTMPMKPDVLIWGVILNACRTVGNVELSHHIQDYILELDPEDSGVFVLLSNISATNERWSDVTRLRRLMKDRGVKKAPGSSVIEVDGNAHEFVVGDISHLQTEEIYKVLNLINSVYHECHLMHPL
BLAST of Cla97C05G089660 vs. NCBI nr
Match: KGN60620.1 (hypothetical protein Csa_2G004700 [Cucumis sativus])

HSP 1 Score: 1104.4 bits (2855), Expect = 0.0e+00
Identity = 532/584 (91.10%), Postives = 560/584 (95.89%), Query Frame = 0

Query: 1   MFNLKWVLLDSVKDCKSLRIFRQIHAQLVTSGLVYDEFVKTKVMEFFANFVEYGDYACDY 60
           MFNLKWVLLDS+KDCK+LRIFRQIHAQLVTSGLVYD+FV +KVMEFFANFVEYGDYACDY
Sbjct: 1   MFNLKWVLLDSIKDCKNLRIFRQIHAQLVTSGLVYDDFVTSKVMEFFANFVEYGDYACDY 60

Query: 61  LKQDSTHLSSFPFNSLINGYAGGDLPQMAVSVYRSMVRDGFVPDMFTFPVLLKACSNFSG 120
           L+Q +T L SFPFNSLINGY GG+ PQMAVSVYR MVRDGFVPDMFTFPVLLKACSNFSG
Sbjct: 61  LEQGNTRLGSFPFNSLINGYVGGEFPQMAVSVYRRMVRDGFVPDMFTFPVLLKACSNFSG 120

Query: 121 SREGRQVHGVVVKLGILADHYVQNSLIRCYGACGDFSCAGKVFDEMLVRDVVSWNSLISG 180
           SREGRQVHGVVVKLG+LADHYVQNSLIRCYGACGDFSCAGKVFDEMLVRDVVSWNSLISG
Sbjct: 121 SREGRQVHGVVVKLGLLADHYVQNSLIRCYGACGDFSCAGKVFDEMLVRDVVSWNSLISG 180

Query: 181 FMKAGHFDEAISLFFRMDVEPSIATLVSVLTACARKGDLCTGKGIHGVIERRFKLDLVLG 240
           FMKAGHFDEAIS+FFRMDVEPS+ TLVSVL ACAR GDLCTGKGIHGVIERRFK++LVLG
Sbjct: 181 FMKAGHFDEAISVFFRMDVEPSMTTLVSVLAACARNGDLCTGKGIHGVIERRFKVNLVLG 240

Query: 241 NAMLDMYVKNGCLYEAKKIFDELPTRDIVSWTIMITGLVQSDHPKESLELFSMMRTLGIS 300
           NAMLDMYVKNGC YEAK IFDELPTRDIVSWTIMITGLVQSDHPK+SLELFSMMRTLGIS
Sbjct: 241 NAMLDMYVKNGCFYEAKNIFDELPTRDIVSWTIMITGLVQSDHPKQSLELFSMMRTLGIS 300

Query: 301 PDAIILTSVLSACASLGTLDFGTWVHEYINQRGIKWDIHIGTAIVDMYAKCGCIEMALQI 360
           PDAIILTSVLSACASLGTLDFGTWVHEYINQRGIKWDIHIGTAIVDMYAKCGCIEMAL+I
Sbjct: 301 PDAIILTSVLSACASLGTLDFGTWVHEYINQRGIKWDIHIGTAIVDMYAKCGCIEMALKI 360

Query: 361 FYNMPQRNTFTWNALLCGLAMHGLAHEALNLFEVMIISGVKPNEVTFLAIMTACCHSGLV 420
           FY+M QRNTFTWNALLCGLAMHGL HEALNLFEVMIISGVKPNE+TFLAI+TACCH GLV
Sbjct: 361 FYSMSQRNTFTWNALLCGLAMHGLVHEALNLFEVMIISGVKPNEITFLAILTACCHCGLV 420

Query: 421 NEGRKCFNNMSSQLHNLLPKLEHYGCMIDLFCRAGLLEEAVELTRTMPMKPDVLIWGVIL 480
           +EGRK F+NM S+L+NLLPKLEHYGCMIDLFCRAGLLEEAVEL RTMPMKPDVLIWG++L
Sbjct: 421 DEGRKYFDNM-SKLYNLLPKLEHYGCMIDLFCRAGLLEEAVELARTMPMKPDVLIWGLLL 480

Query: 481 NACRTVGNVELSHHIQDYILELDPEDSGVFVLLSNISATNERWSDVTRLRRLMKDRGVKK 540
           NAC TVGN+ELSH IQDYILELD +DSGVFVLLSNISA N+RWS+VTRLRRLMKDRGV+K
Sbjct: 481 NACTTVGNIELSHRIQDYILELDHDDSGVFVLLSNISAINQRWSNVTRLRRLMKDRGVRK 540

Query: 541 APGSSVIEVDGNAHEFVVGDISHLQTEEIYKVLNLINSVYHECH 585
           APGSSVIEVDG AHEFVVGDISHLQTEEIYKVLNLINSVYHE H
Sbjct: 541 APGSSVIEVDGKAHEFVVGDISHLQTEEIYKVLNLINSVYHESH 583

BLAST of Cla97C05G089660 vs. NCBI nr
Match: XP_008458315.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g38010 isoform X1 [Cucumis melo])

HSP 1 Score: 1097.4 bits (2837), Expect = 0.0e+00
Identity = 531/584 (90.92%), Postives = 558/584 (95.55%), Query Frame = 0

Query: 1   MFNLKWVLLDSVKDCKSLRIFRQIHAQLVTSGLVYDEFVKTKVMEFFANFVEYGDYACDY 60
           MFNLKWVLLDS+KDCK+LRIFRQIHAQLVTSGLVYD+FV +KVMEFFANFVEYGDYACDY
Sbjct: 1   MFNLKWVLLDSIKDCKNLRIFRQIHAQLVTSGLVYDDFVTSKVMEFFANFVEYGDYACDY 60

Query: 61  LKQDSTHLSSFPFNSLINGYAGGDLPQMAVSVYRSMVRDGFVPDMFTFPVLLKACSNFSG 120
           L+Q +T L SFPFNSLINGY GG+ PQ AVSVYR MVRDGFVPDMFTFPVLLKACSNFSG
Sbjct: 61  LEQGNTRLGSFPFNSLINGYVGGEFPQTAVSVYRRMVRDGFVPDMFTFPVLLKACSNFSG 120

Query: 121 SREGRQVHGVVVKLGILADHYVQNSLIRCYGACGDFSCAGKVFDEMLVRDVVSWNSLISG 180
           SREGRQVHGVVVKLG+LAD YVQNSLIRCYGACGD SCAGKVFDEM+VRDVVSWNSLISG
Sbjct: 121 SREGRQVHGVVVKLGLLADLYVQNSLIRCYGACGDLSCAGKVFDEMVVRDVVSWNSLISG 180

Query: 181 FMKAGHFDEAISLFFRMDVEPSIATLVSVLTACARKGDLCTGKGIHGVIERRFKLDLVLG 240
           FMKAGHFDEAIS+FFRMDVEPSIATLVSVL ACAR G+LCTGKGIHGVIERRFK++LVLG
Sbjct: 181 FMKAGHFDEAISVFFRMDVEPSIATLVSVLAACARNGNLCTGKGIHGVIERRFKVNLVLG 240

Query: 241 NAMLDMYVKNGCLYEAKKIFDELPTRDIVSWTIMITGLVQSDHPKESLELFSMMRTLGIS 300
           NAMLDMYVKNGC YEAKK+FDELPTRDIVSWTIMITGLVQSDHPKESLELFSMMRTLGIS
Sbjct: 241 NAMLDMYVKNGCFYEAKKMFDELPTRDIVSWTIMITGLVQSDHPKESLELFSMMRTLGIS 300

Query: 301 PDAIILTSVLSACASLGTLDFGTWVHEYINQRGIKWDIHIGTAIVDMYAKCGCIEMALQI 360
           PDAIILTSVLSACASLGTLDFGTWVHEYINQRGIKWDIH GTAIVDMYAKCGCIEMALQI
Sbjct: 301 PDAIILTSVLSACASLGTLDFGTWVHEYINQRGIKWDIHTGTAIVDMYAKCGCIEMALQI 360

Query: 361 FYNMPQRNTFTWNALLCGLAMHGLAHEALNLFEVMIISGVKPNEVTFLAIMTACCHSGLV 420
           FY+MPQRNTFTWNALLCGLAMHGL HEAL+LFEVM ISGV+PNE+TFLAI+TACCHSGLV
Sbjct: 361 FYSMPQRNTFTWNALLCGLAMHGLVHEALDLFEVMTISGVEPNEITFLAILTACCHSGLV 420

Query: 421 NEGRKCFNNMSSQLHNLLPKLEHYGCMIDLFCRAGLLEEAVELTRTMPMKPDVLIWGVIL 480
           +EGRK F NM S+L+NLLPKLEHYGCMIDLFCRAGLLEEAVEL RTMPMKPD+LIWGV+L
Sbjct: 421 DEGRKYFENM-SKLYNLLPKLEHYGCMIDLFCRAGLLEEAVELARTMPMKPDMLIWGVLL 480

Query: 481 NACRTVGNVELSHHIQDYILELDPEDSGVFVLLSNISATNERWSDVTRLRRLMKDRGVKK 540
           NAC TVGNVELSH IQDYILELD +DSGVFVLLSNISA N+RWS+VTRLRRLMKDRGVKK
Sbjct: 481 NACTTVGNVELSHRIQDYILELDHDDSGVFVLLSNISAINQRWSNVTRLRRLMKDRGVKK 540

Query: 541 APGSSVIEVDGNAHEFVVGDISHLQTEEIYKVLNLINSVYHECH 585
           APGSSVIEVDG AHEFVVGDISHLQTEEIYKVLNLINSVYHE H
Sbjct: 541 APGSSVIEVDGKAHEFVVGDISHLQTEEIYKVLNLINSVYHESH 583

BLAST of Cla97C05G089660 vs. NCBI nr
Match: XP_022968019.1 (pentatricopeptide repeat-containing protein At4g38010 isoform X1 [Cucurbita maxima])

HSP 1 Score: 1080.5 bits (2793), Expect = 0.0e+00
Identity = 521/588 (88.61%), Postives = 554/588 (94.22%), Query Frame = 0

Query: 1   MFNLKWVLLDSVKDCKSLRIFRQIHAQLVTSGLVYDEFVKTKVMEFFANFVEYGDYACDY 60
           MFNLKWVLLDS+KDCK+LRIF++IHAQLVTSGLVYD+FV  KV+EFFANFVE+GDYACDY
Sbjct: 1   MFNLKWVLLDSIKDCKNLRIFKKIHAQLVTSGLVYDDFVTNKVVEFFANFVEFGDYACDY 60

Query: 61  LKQDSTHLSSFPFNSLINGYAGGDLPQMAVSVYRSMVRDGFVPDMFTFPVLLKACSNFSG 120
           LKQ +T L SFPFNSLINGYAGG+ PQMAVSVYR M RDGFVPD+FTFPVL KACSNFSG
Sbjct: 61  LKQVNTRLGSFPFNSLINGYAGGEFPQMAVSVYRRMARDGFVPDLFTFPVLFKACSNFSG 120

Query: 121 SREGRQVHGVVVKLGILADHYVQNSLIRCYGACGDFSCAGKVFDEMLVRDVVSWNSLISG 180
           SREGRQVHGVV+KLGIL+D +VQNSL+RCYGAC DFSCAGKVFDEMLVRDVVSWNSLISG
Sbjct: 121 SREGRQVHGVVIKLGILSDLFVQNSLVRCYGACEDFSCAGKVFDEMLVRDVVSWNSLISG 180

Query: 181 FMKAGHFDEAISLFFRMDVEPSIATLVSVLTACARKGDLCTGKGIHGVIERRFKLDLVLG 240
           FMKAG FD+AISLFFRMDVEPS+ATLVSVL ACARKGDL  GKGIHG+I+RRFKLDLVLG
Sbjct: 181 FMKAGRFDDAISLFFRMDVEPSVATLVSVLAACARKGDLYMGKGIHGMIQRRFKLDLVLG 240

Query: 241 NAMLDMYVKNGCLYEAKKIFDELPTRDIVSWTIMITGLVQSDHPKESLELFSMMRTLGIS 300
           NAMLDMY KNGCLYEAK IFDELPTRDIVSWTIMITGLVQS+HPKESLELF MMR LGIS
Sbjct: 241 NAMLDMYAKNGCLYEAKNIFDELPTRDIVSWTIMITGLVQSNHPKESLELFWMMRNLGIS 300

Query: 301 PDAIILTSVLSACASLGTLDFGTWVHEYINQRGIKWDIHIGTAIVDMYAKCGCIEMALQI 360
           PD IILTSVLSACASLGTL +GTWVHEYINQRGIKWDIHIGTAIVDMYAKCGC+EMA QI
Sbjct: 301 PDGIILTSVLSACASLGTLKYGTWVHEYINQRGIKWDIHIGTAIVDMYAKCGCVEMARQI 360

Query: 361 FYNMPQRNTFTWNALLCGLAMHGLAHEALNLFEVMIISGVKPNEVTFLAIMTACCHSGLV 420
           F +MPQRNTFTWNALLCGLAMHGLAHEAL LFEVMIISGVK NEVTFLAI+TACCHSGLV
Sbjct: 361 FNSMPQRNTFTWNALLCGLAMHGLAHEALYLFEVMIISGVKTNEVTFLAILTACCHSGLV 420

Query: 421 NEGRKCFNNMSSQLHNLLPKLEHYGCMIDLFCRAGLLEEAVELTRTMPMKPDVLIWGVIL 480
           +EGRK F+NMSSQ +NL PKLEHYGCMIDLFCRAGLLEEAVEL RTMPMKPDVLIWGV+L
Sbjct: 421 DEGRKYFDNMSSQRYNLSPKLEHYGCMIDLFCRAGLLEEAVELVRTMPMKPDVLIWGVLL 480

Query: 481 NACRTVGNVELSHHIQDYILELDPEDSGVFVLLSNISATNERWSDVTRLRRLMKDRGVKK 540
           NAC+TVGNVELS HIQ+YILELDPEDSGVFVLLSNISATNERWS+VTRLRRLMKDRGVKK
Sbjct: 481 NACKTVGNVELSQHIQEYILELDPEDSGVFVLLSNISATNERWSNVTRLRRLMKDRGVKK 540

Query: 541 APGSSVIEVDGNAHEFVVGDISHLQTEEIYKVLNLINSVYHECHLMHP 589
           +PGSSVIEVDG AHEFV GDIS+LQTEEIYKVL LINSV+HE HLMHP
Sbjct: 541 SPGSSVIEVDGKAHEFVAGDISNLQTEEIYKVLTLINSVFHESHLMHP 588

BLAST of Cla97C05G089660 vs. NCBI nr
Match: XP_022945046.1 (pentatricopeptide repeat-containing protein At4g38010 isoform X1 [Cucurbita moschata])

HSP 1 Score: 1066.6 bits (2757), Expect = 3.0e-308
Identity = 515/589 (87.44%), Postives = 550/589 (93.38%), Query Frame = 0

Query: 1   MFNLKWVLLDSVKDCKSLRIFRQIHAQLVTSGLVYDEFVKTKVMEFFANFVEYGDYACDY 60
           MFNLKWVLLDS+KDCK+LRIF++IHAQLV SGLVYD+FV  KV+EFFANFVE+GDYACDY
Sbjct: 1   MFNLKWVLLDSIKDCKNLRIFKKIHAQLVASGLVYDDFVTNKVVEFFANFVEFGDYACDY 60

Query: 61  LKQDSTHLSSFPFNSLINGYAGGDLPQMAVSVYRSMVRDGFVPDMFTFPVLLKACSNFSG 120
           LKQ +  L SFPFNSLINGYAGG+ PQMAVSVYR M RDGFVPD+FTFPVL KACSNFSG
Sbjct: 61  LKQVNIRLGSFPFNSLINGYAGGEFPQMAVSVYRRMARDGFVPDLFTFPVLFKACSNFSG 120

Query: 121 SREGRQVHGVVVKLGILADHYVQNSLIRCYGACGDFSCAGKVFDEMLVRDVVSWNSLISG 180
           SREGRQVHGVVVKLGIL+D +VQNSL+ CYGAC DFSCAGKVFDEMLVRDVVSWNSLISG
Sbjct: 121 SREGRQVHGVVVKLGILSDLFVQNSLVCCYGACEDFSCAGKVFDEMLVRDVVSWNSLISG 180

Query: 181 FMKAGHFDEAISLFFRMDVEPSIATLVSVLTACARKGDLCTGKGIHGVIERRFKLDLVLG 240
           FMKAG FD+AISLFFRMDVEPS+ATLVSVL ACARKG+L  GKGIHG+I+RRFKLDLVLG
Sbjct: 181 FMKAGRFDDAISLFFRMDVEPSVATLVSVLAACARKGELYMGKGIHGMIQRRFKLDLVLG 240

Query: 241 NAMLDMYVKNGCLYEAKKIFDELPTRDIVSWTIMITGLVQSDHPKESLELFSMMRTLGIS 300
           NAMLDMY KNGCLYEAK IFDELPTRDIVSWTIMITGLVQS+HPKESLELF MMR LGIS
Sbjct: 241 NAMLDMYAKNGCLYEAKNIFDELPTRDIVSWTIMITGLVQSNHPKESLELFWMMRNLGIS 300

Query: 301 PDAIILTSVLSACASLGTLDFGTWVHEYINQRGIKWDIHIGTAIVDMYAKCGCIEMALQI 360
           PD IILTSVLSACASLGTL++GTWVHEYI+QRGIKWDIHIGTAIVDMYAKCGC+EMA QI
Sbjct: 301 PDGIILTSVLSACASLGTLEYGTWVHEYIDQRGIKWDIHIGTAIVDMYAKCGCVEMARQI 360

Query: 361 FYNMPQRNTFTWNALLCGLAMHGLAHEALNLFEVMIISGVKPNEVTFLAIMTACCHSGLV 420
           F NMPQRNTFTWNALLCGLAMHGLAHEAL LFEVMIISGVKPNEVTFLAI+TACCHSGLV
Sbjct: 361 FNNMPQRNTFTWNALLCGLAMHGLAHEALFLFEVMIISGVKPNEVTFLAILTACCHSGLV 420

Query: 421 NEGRKCFNNMSSQLHNLLPKLEHYGCMIDLFCRAGLLEEAVELTRTMPMKPDVLIWGVIL 480
           +EGRK F+NMSSQ +NL PKLEHYGCMIDL CRAGLLEEAVEL RTMPMKPDVLIWGV+L
Sbjct: 421 DEGRKYFDNMSSQTYNLSPKLEHYGCMIDLLCRAGLLEEAVELVRTMPMKPDVLIWGVLL 480

Query: 481 NACRTVGNVELSHHIQDYILELDPEDSGVFVLLSNISATNERWSDVTRLRRLMKDRGVKK 540
           NAC+TVGN+ELS HIQ+YILELD EDSGVFVLLSNISATNERWS+VTRLRRLMKDRGVKK
Sbjct: 481 NACKTVGNIELSQHIQEYILELDTEDSGVFVLLSNISATNERWSNVTRLRRLMKDRGVKK 540

Query: 541 APGSSVIEVDGNAHEFVVGDISHLQTEEIYKVLNLINSVYHECHLMHPL 590
           +PGSSVIEVDG AHEFV GDIS+L+ EEIYKVL LINSV HE HLMHPL
Sbjct: 541 SPGSSVIEVDGKAHEFVAGDISNLEIEEIYKVLTLINSVLHESHLMHPL 589

BLAST of Cla97C05G089660 vs. NCBI nr
Match: XP_023541467.1 (pentatricopeptide repeat-containing protein At4g38010 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1060.4 bits (2741), Expect = 2.2e-306
Identity = 513/582 (88.14%), Postives = 546/582 (93.81%), Query Frame = 0

Query: 8   LLDSVKDCKSLRIFRQIHAQLVTSGLVYDEFVKTKVMEFFANFVEYGDYACDYLKQDSTH 67
           LLDS+KDCK+LRIF++IHAQLVTSGLVYD+FV  KV+EFFANFVE+GDYACDYLKQ +T 
Sbjct: 3   LLDSIKDCKNLRIFKKIHAQLVTSGLVYDDFVTNKVVEFFANFVEFGDYACDYLKQLNTR 62

Query: 68  LSSFPFNSLINGYAGGDLPQMAVSVYRSMVRDGFVPDMFTFPVLLKACSNFSGSREGRQV 127
           L SFPFNSLINGYAGG+ PQMAVSVYR M RDGFVPD+FTFPVL KACSNFSGSREGRQV
Sbjct: 63  LGSFPFNSLINGYAGGEFPQMAVSVYRRMARDGFVPDLFTFPVLFKACSNFSGSREGRQV 122

Query: 128 HGVVVKLGILADHYVQNSLIRCYGACGDFSCAGKVFDEMLVRDVVSWNSLISGFMKAGHF 187
           HGVVVKLGIL+D +VQNSL+ CYGAC DFSCAGKVFDEMLVRDVVSWNSLISGFMKAG F
Sbjct: 123 HGVVVKLGILSDIFVQNSLVCCYGACEDFSCAGKVFDEMLVRDVVSWNSLISGFMKAGRF 182

Query: 188 DEAISLFFRMDVEPSIATLVSVLTACARKGDLCTGKGIHGVIERRFKLDLVLGNAMLDMY 247
           D+AISLFFRMDVEPS+ATLVSVL ACARKGDL  GKGIHG+I+RRFKLDLVLGNAMLDMY
Sbjct: 183 DDAISLFFRMDVEPSVATLVSVLAACARKGDLYMGKGIHGMIQRRFKLDLVLGNAMLDMY 242

Query: 248 VKNGCLYEAKKIFDELPTRDIVSWTIMITGLVQSDHPKESLELFSMMRTLGISPDAIILT 307
            KNGCLYEAK IFDELPTRDIVSWTIMITGLVQS+HPKESLELF MMR LGISPD IILT
Sbjct: 243 AKNGCLYEAKNIFDELPTRDIVSWTIMITGLVQSNHPKESLELFWMMRNLGISPDGIILT 302

Query: 308 SVLSACASLGTLDFGTWVHEYINQRGIKWDIHIGTAIVDMYAKCGCIEMALQIFYNMPQR 367
           SVLSACASLGTL++GTWVHEYI+QRGIKWDIHIGTAIVDMYAKCGC+EMA QIF NMPQR
Sbjct: 303 SVLSACASLGTLEYGTWVHEYIDQRGIKWDIHIGTAIVDMYAKCGCVEMARQIFNNMPQR 362

Query: 368 NTFTWNALLCGLAMHGLAHEALNLFEVMIISGVKPNEVTFLAIMTACCHSGLVNEGRKCF 427
           NTFTWNA LCGLAMHGLAHEAL LFEVMIISGVKPNEVTFLAI+TACCHSGLV+EGRK F
Sbjct: 363 NTFTWNAWLCGLAMHGLAHEALYLFEVMIISGVKPNEVTFLAILTACCHSGLVDEGRKYF 422

Query: 428 NNMSSQLHNLLPKLEHYGCMIDLFCRAGLLEEAVELTRTMPMKPDVLIWGVILNACRTVG 487
           +NMSSQ +NL PKLEHYGCMIDLFCRAGLLEEAVEL RTMPMKPDVLIWGV+LNAC+TVG
Sbjct: 423 DNMSSQRYNLSPKLEHYGCMIDLFCRAGLLEEAVELVRTMPMKPDVLIWGVLLNACKTVG 482

Query: 488 NVELSHHIQDYILELDPEDSGVFVLLSNISATNERWSDVTRLRRLMKDRGVKKAPGSSVI 547
           NVELS HIQ+YILELDPEDSGVFVLLSNISATNERWS+VTRLRRLMKDRGVKK+PGSSVI
Sbjct: 483 NVELSQHIQEYILELDPEDSGVFVLLSNISATNERWSNVTRLRRLMKDRGVKKSPGSSVI 542

Query: 548 EVDGNAHEFVVGDISHLQTEEIYKVLNLINSVYHECHLMHPL 590
           EVDG AHEFV GDIS L+ EEIYKVL LINSV+HE HLMHPL
Sbjct: 543 EVDGKAHEFVAGDISTLEIEEIYKVLTLINSVFHESHLMHPL 584

BLAST of Cla97C05G089660 vs. TrEMBL
Match: tr|A0A0A0LF19|A0A0A0LF19_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G004700 PE=4 SV=1)

HSP 1 Score: 1104.4 bits (2855), Expect = 0.0e+00
Identity = 532/584 (91.10%), Postives = 560/584 (95.89%), Query Frame = 0

Query: 1   MFNLKWVLLDSVKDCKSLRIFRQIHAQLVTSGLVYDEFVKTKVMEFFANFVEYGDYACDY 60
           MFNLKWVLLDS+KDCK+LRIFRQIHAQLVTSGLVYD+FV +KVMEFFANFVEYGDYACDY
Sbjct: 1   MFNLKWVLLDSIKDCKNLRIFRQIHAQLVTSGLVYDDFVTSKVMEFFANFVEYGDYACDY 60

Query: 61  LKQDSTHLSSFPFNSLINGYAGGDLPQMAVSVYRSMVRDGFVPDMFTFPVLLKACSNFSG 120
           L+Q +T L SFPFNSLINGY GG+ PQMAVSVYR MVRDGFVPDMFTFPVLLKACSNFSG
Sbjct: 61  LEQGNTRLGSFPFNSLINGYVGGEFPQMAVSVYRRMVRDGFVPDMFTFPVLLKACSNFSG 120

Query: 121 SREGRQVHGVVVKLGILADHYVQNSLIRCYGACGDFSCAGKVFDEMLVRDVVSWNSLISG 180
           SREGRQVHGVVVKLG+LADHYVQNSLIRCYGACGDFSCAGKVFDEMLVRDVVSWNSLISG
Sbjct: 121 SREGRQVHGVVVKLGLLADHYVQNSLIRCYGACGDFSCAGKVFDEMLVRDVVSWNSLISG 180

Query: 181 FMKAGHFDEAISLFFRMDVEPSIATLVSVLTACARKGDLCTGKGIHGVIERRFKLDLVLG 240
           FMKAGHFDEAIS+FFRMDVEPS+ TLVSVL ACAR GDLCTGKGIHGVIERRFK++LVLG
Sbjct: 181 FMKAGHFDEAISVFFRMDVEPSMTTLVSVLAACARNGDLCTGKGIHGVIERRFKVNLVLG 240

Query: 241 NAMLDMYVKNGCLYEAKKIFDELPTRDIVSWTIMITGLVQSDHPKESLELFSMMRTLGIS 300
           NAMLDMYVKNGC YEAK IFDELPTRDIVSWTIMITGLVQSDHPK+SLELFSMMRTLGIS
Sbjct: 241 NAMLDMYVKNGCFYEAKNIFDELPTRDIVSWTIMITGLVQSDHPKQSLELFSMMRTLGIS 300

Query: 301 PDAIILTSVLSACASLGTLDFGTWVHEYINQRGIKWDIHIGTAIVDMYAKCGCIEMALQI 360
           PDAIILTSVLSACASLGTLDFGTWVHEYINQRGIKWDIHIGTAIVDMYAKCGCIEMAL+I
Sbjct: 301 PDAIILTSVLSACASLGTLDFGTWVHEYINQRGIKWDIHIGTAIVDMYAKCGCIEMALKI 360

Query: 361 FYNMPQRNTFTWNALLCGLAMHGLAHEALNLFEVMIISGVKPNEVTFLAIMTACCHSGLV 420
           FY+M QRNTFTWNALLCGLAMHGL HEALNLFEVMIISGVKPNE+TFLAI+TACCH GLV
Sbjct: 361 FYSMSQRNTFTWNALLCGLAMHGLVHEALNLFEVMIISGVKPNEITFLAILTACCHCGLV 420

Query: 421 NEGRKCFNNMSSQLHNLLPKLEHYGCMIDLFCRAGLLEEAVELTRTMPMKPDVLIWGVIL 480
           +EGRK F+NM S+L+NLLPKLEHYGCMIDLFCRAGLLEEAVEL RTMPMKPDVLIWG++L
Sbjct: 421 DEGRKYFDNM-SKLYNLLPKLEHYGCMIDLFCRAGLLEEAVELARTMPMKPDVLIWGLLL 480

Query: 481 NACRTVGNVELSHHIQDYILELDPEDSGVFVLLSNISATNERWSDVTRLRRLMKDRGVKK 540
           NAC TVGN+ELSH IQDYILELD +DSGVFVLLSNISA N+RWS+VTRLRRLMKDRGV+K
Sbjct: 481 NACTTVGNIELSHRIQDYILELDHDDSGVFVLLSNISAINQRWSNVTRLRRLMKDRGVRK 540

Query: 541 APGSSVIEVDGNAHEFVVGDISHLQTEEIYKVLNLINSVYHECH 585
           APGSSVIEVDG AHEFVVGDISHLQTEEIYKVLNLINSVYHE H
Sbjct: 541 APGSSVIEVDGKAHEFVVGDISHLQTEEIYKVLNLINSVYHESH 583

BLAST of Cla97C05G089660 vs. TrEMBL
Match: tr|A0A1S3C7P6|A0A1S3C7P6_CUCME (pentatricopeptide repeat-containing protein At4g38010 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103497763 PE=4 SV=1)

HSP 1 Score: 1097.4 bits (2837), Expect = 0.0e+00
Identity = 531/584 (90.92%), Postives = 558/584 (95.55%), Query Frame = 0

Query: 1   MFNLKWVLLDSVKDCKSLRIFRQIHAQLVTSGLVYDEFVKTKVMEFFANFVEYGDYACDY 60
           MFNLKWVLLDS+KDCK+LRIFRQIHAQLVTSGLVYD+FV +KVMEFFANFVEYGDYACDY
Sbjct: 1   MFNLKWVLLDSIKDCKNLRIFRQIHAQLVTSGLVYDDFVTSKVMEFFANFVEYGDYACDY 60

Query: 61  LKQDSTHLSSFPFNSLINGYAGGDLPQMAVSVYRSMVRDGFVPDMFTFPVLLKACSNFSG 120
           L+Q +T L SFPFNSLINGY GG+ PQ AVSVYR MVRDGFVPDMFTFPVLLKACSNFSG
Sbjct: 61  LEQGNTRLGSFPFNSLINGYVGGEFPQTAVSVYRRMVRDGFVPDMFTFPVLLKACSNFSG 120

Query: 121 SREGRQVHGVVVKLGILADHYVQNSLIRCYGACGDFSCAGKVFDEMLVRDVVSWNSLISG 180
           SREGRQVHGVVVKLG+LAD YVQNSLIRCYGACGD SCAGKVFDEM+VRDVVSWNSLISG
Sbjct: 121 SREGRQVHGVVVKLGLLADLYVQNSLIRCYGACGDLSCAGKVFDEMVVRDVVSWNSLISG 180

Query: 181 FMKAGHFDEAISLFFRMDVEPSIATLVSVLTACARKGDLCTGKGIHGVIERRFKLDLVLG 240
           FMKAGHFDEAIS+FFRMDVEPSIATLVSVL ACAR G+LCTGKGIHGVIERRFK++LVLG
Sbjct: 181 FMKAGHFDEAISVFFRMDVEPSIATLVSVLAACARNGNLCTGKGIHGVIERRFKVNLVLG 240

Query: 241 NAMLDMYVKNGCLYEAKKIFDELPTRDIVSWTIMITGLVQSDHPKESLELFSMMRTLGIS 300
           NAMLDMYVKNGC YEAKK+FDELPTRDIVSWTIMITGLVQSDHPKESLELFSMMRTLGIS
Sbjct: 241 NAMLDMYVKNGCFYEAKKMFDELPTRDIVSWTIMITGLVQSDHPKESLELFSMMRTLGIS 300

Query: 301 PDAIILTSVLSACASLGTLDFGTWVHEYINQRGIKWDIHIGTAIVDMYAKCGCIEMALQI 360
           PDAIILTSVLSACASLGTLDFGTWVHEYINQRGIKWDIH GTAIVDMYAKCGCIEMALQI
Sbjct: 301 PDAIILTSVLSACASLGTLDFGTWVHEYINQRGIKWDIHTGTAIVDMYAKCGCIEMALQI 360

Query: 361 FYNMPQRNTFTWNALLCGLAMHGLAHEALNLFEVMIISGVKPNEVTFLAIMTACCHSGLV 420
           FY+MPQRNTFTWNALLCGLAMHGL HEAL+LFEVM ISGV+PNE+TFLAI+TACCHSGLV
Sbjct: 361 FYSMPQRNTFTWNALLCGLAMHGLVHEALDLFEVMTISGVEPNEITFLAILTACCHSGLV 420

Query: 421 NEGRKCFNNMSSQLHNLLPKLEHYGCMIDLFCRAGLLEEAVELTRTMPMKPDVLIWGVIL 480
           +EGRK F NM S+L+NLLPKLEHYGCMIDLFCRAGLLEEAVEL RTMPMKPD+LIWGV+L
Sbjct: 421 DEGRKYFENM-SKLYNLLPKLEHYGCMIDLFCRAGLLEEAVELARTMPMKPDMLIWGVLL 480

Query: 481 NACRTVGNVELSHHIQDYILELDPEDSGVFVLLSNISATNERWSDVTRLRRLMKDRGVKK 540
           NAC TVGNVELSH IQDYILELD +DSGVFVLLSNISA N+RWS+VTRLRRLMKDRGVKK
Sbjct: 481 NACTTVGNVELSHRIQDYILELDHDDSGVFVLLSNISAINQRWSNVTRLRRLMKDRGVKK 540

Query: 541 APGSSVIEVDGNAHEFVVGDISHLQTEEIYKVLNLINSVYHECH 585
           APGSSVIEVDG AHEFVVGDISHLQTEEIYKVLNLINSVYHE H
Sbjct: 541 APGSSVIEVDGKAHEFVVGDISHLQTEEIYKVLNLINSVYHESH 583

BLAST of Cla97C05G089660 vs. TrEMBL
Match: tr|A0A1S3C7K9|A0A1S3C7K9_CUCME (pentatricopeptide repeat-containing protein At4g38010 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103497763 PE=4 SV=1)

HSP 1 Score: 879.4 bits (2271), Expect = 4.5e-252
Identity = 427/465 (91.83%), Postives = 447/465 (96.13%), Query Frame = 0

Query: 120 GSREGRQVHGVVVKLGILADHYVQNSLIRCYGACGDFSCAGKVFDEMLVRDVVSWNSLIS 179
           GSREGRQVHGVVVKLG+LAD YVQNSLIRCYGACGD SCAGKVFDEM+VRDVVSWNSLIS
Sbjct: 2   GSREGRQVHGVVVKLGLLADLYVQNSLIRCYGACGDLSCAGKVFDEMVVRDVVSWNSLIS 61

Query: 180 GFMKAGHFDEAISLFFRMDVEPSIATLVSVLTACARKGDLCTGKGIHGVIERRFKLDLVL 239
           GFMKAGHFDEAIS+FFRMDVEPSIATLVSVL ACAR G+LCTGKGIHGVIERRFK++LVL
Sbjct: 62  GFMKAGHFDEAISVFFRMDVEPSIATLVSVLAACARNGNLCTGKGIHGVIERRFKVNLVL 121

Query: 240 GNAMLDMYVKNGCLYEAKKIFDELPTRDIVSWTIMITGLVQSDHPKESLELFSMMRTLGI 299
           GNAMLDMYVKNGC YEAKK+FDELPTRDIVSWTIMITGLVQSDHPKESLELFSMMRTLGI
Sbjct: 122 GNAMLDMYVKNGCFYEAKKMFDELPTRDIVSWTIMITGLVQSDHPKESLELFSMMRTLGI 181

Query: 300 SPDAIILTSVLSACASLGTLDFGTWVHEYINQRGIKWDIHIGTAIVDMYAKCGCIEMALQ 359
           SPDAIILTSVLSACASLGTLDFGTWVHEYINQRGIKWDIH GTAIVDMYAKCGCIEMALQ
Sbjct: 182 SPDAIILTSVLSACASLGTLDFGTWVHEYINQRGIKWDIHTGTAIVDMYAKCGCIEMALQ 241

Query: 360 IFYNMPQRNTFTWNALLCGLAMHGLAHEALNLFEVMIISGVKPNEVTFLAIMTACCHSGL 419
           IFY+MPQRNTFTWNALLCGLAMHGL HEAL+LFEVM ISGV+PNE+TFLAI+TACCHSGL
Sbjct: 242 IFYSMPQRNTFTWNALLCGLAMHGLVHEALDLFEVMTISGVEPNEITFLAILTACCHSGL 301

Query: 420 VNEGRKCFNNMSSQLHNLLPKLEHYGCMIDLFCRAGLLEEAVELTRTMPMKPDVLIWGVI 479
           V+EGRK F NM S+L+NLLPKLEHYGCMIDLFCRAGLLEEAVEL RTMPMKPD+LIWGV+
Sbjct: 302 VDEGRKYFENM-SKLYNLLPKLEHYGCMIDLFCRAGLLEEAVELARTMPMKPDMLIWGVL 361

Query: 480 LNACRTVGNVELSHHIQDYILELDPEDSGVFVLLSNISATNERWSDVTRLRRLMKDRGVK 539
           LNAC TVGNVELSH IQDYILELD +DSGVFVLLSNISA N+RWS+VTRLRRLMKDRGVK
Sbjct: 362 LNACTTVGNVELSHRIQDYILELDHDDSGVFVLLSNISAINQRWSNVTRLRRLMKDRGVK 421

Query: 540 KAPGSSVIEVDGNAHEFVVGDISHLQTEEIYKVLNLINSVYHECH 585
           KAPGSSVIEVDG AHEFVVGDISHLQTEEIYKVLNLINSVYHE H
Sbjct: 422 KAPGSSVIEVDGKAHEFVVGDISHLQTEEIYKVLNLINSVYHESH 465

BLAST of Cla97C05G089660 vs. TrEMBL
Match: tr|A0A2N9FVF6|A0A2N9FVF6_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS18831 PE=4 SV=1)

HSP 1 Score: 778.1 bits (2008), Expect = 1.4e-221
Identity = 370/589 (62.82%), Postives = 457/589 (77.59%), Query Frame = 0

Query: 1   MFNLKWVLLDSVKDCKSLRIFRQIHAQLVTSGLVYDEFVKTKVMEFFANFVEYGDYACDY 60
           M + KWVLLD +  C + R F+QIHAQL+TSG+V DE V  KV EF   FVE+ +Y CD 
Sbjct: 1   MLSRKWVLLDFIHRCNNFRFFKQIHAQLLTSGIVRDELVVNKVAEFLGKFVEFVEYGCDI 60

Query: 61  LKQDSTHLSSFPFNSLINGYAGGDLPQMAVSVYRSMVRDGFVPDMFTFPVLLKACSNFSG 120
           LKQ+   +SSFP N LI+ YA  D P++A  VYR +VRDGF+PD +TFPV+LK+C+ F G
Sbjct: 61  LKQNDWCISSFPSNLLISSYASSDTPRVAFLVYRRIVRDGFMPDRYTFPVVLKSCTKFLG 120

Query: 121 SREGRQVHGVVVKLGILADHYVQNSLIRCYGACGDFSCAGKVFDEMLVRDVVSWNSLISG 180
           S EGRQVHGVVVK+G   D +V NSL+  Y  CGDF  A +VFD+MLVRDVVSW  LISG
Sbjct: 121 SGEGRQVHGVVVKMGFKGDVFVGNSLVHFYSVCGDFCAATRVFDDMLVRDVVSWTGLISG 180

Query: 181 FMKAGHFDEAISLFFRMDVEPSIATLVSVLTACARKGDLCTGKGIHGVIERRFKLDLVLG 240
           +++AG FDEA++LF RMDV P++A+ VSVL AC R G L  GKGIHG+I +R  +DLV+G
Sbjct: 181 YVRAGLFDEAVALFLRMDVRPNVASFVSVLVACGRMGYLSLGKGIHGLIFKRVGMDLVVG 240

Query: 241 NAMLDMYVKNGCLYEAKKIFDELPTRDIVSWTIMITGLVQSDHPKESLELFSMMRTLGIS 300
           N ++DMYVK  CL EAK+IFDELP RDIVSWT MI+GLVQ +HPKESLELF  M   GI 
Sbjct: 241 NVIMDMYVKCECLSEAKQIFDELPERDIVSWTTMISGLVQCEHPKESLELFHKMHGSGIE 300

Query: 301 PDAIILTSVLSACASLGTLDFGTWVHEYINQRGIKWDIHIGTAIVDMYAKCGCIEMALQI 360
           PD  +LT+VLSACASLG LD+G W++EYI  RGIK D HIGTA++DMYAKCGC+EMALQ 
Sbjct: 301 PDRFLLTTVLSACASLGALDYGRWINEYIYLRGIKCDTHIGTAMIDMYAKCGCVEMALQT 360

Query: 361 FYNMPQRNTFTWNALLCGLAMHGLAHEALNLFEVMIISGVKPNEVTFLAIMTACCHSGLV 420
           F  MP +N +TWNALL GLAMHG  HE L  FE MI SG++PNEVTFL+I+TACCHSGLV
Sbjct: 361 FNGMPYKNVYTWNALLGGLAMHGHGHEVLKHFEEMIKSGMRPNEVTFLSILTACCHSGLV 420

Query: 421 NEGRKCFNNMSSQLHNLLPKLEHYGCMIDLFCRAGLLEEAVELTRTMPMKPDVLIWGVIL 480
           +EGR+ F  M S+ HNL P+LEHYGCM+D+ CRA +L+EA EL + MPM PDV IWG +L
Sbjct: 421 DEGRRYFYQMVSRQHNLSPRLEHYGCMVDMLCRAEILDEAQELIKIMPMSPDVRIWGALL 480

Query: 481 NACRTVGNVELSHHIQDYILELDPEDSGVFVLLSNISATNERWSDVTRLRRLMKDRGVKK 540
           +AC+  GN+ELS  I D +LE + +DSGV+VLLSNI ATN+RW++VTR+RRLMKD+G+KK
Sbjct: 481 SACKASGNIELSQEILDRLLEHETQDSGVYVLLSNIYATNQRWAEVTRVRRLMKDKGIKK 540

Query: 541 APGSSVIEVDGNAHEFVVGDISHLQTEEIYKVLNLI-NSVYHECHLMHP 589
           APGSSVIEVDG A+EF+ GD SH + E+++ +LN++ N VY E H   P
Sbjct: 541 APGSSVIEVDGKAYEFLAGDTSHPRNEDVHILLNILANQVYLEGHFSVP 589

BLAST of Cla97C05G089660 vs. TrEMBL
Match: tr|A0A2I4FGU0|A0A2I4FGU0_9ROSI (pentatricopeptide repeat-containing protein At4g38010 OS=Juglans regia OX=51240 GN=LOC108998667 PE=4 SV=1)

HSP 1 Score: 763.1 bits (1969), Expect = 4.7e-217
Identity = 362/585 (61.88%), Postives = 453/585 (77.44%), Query Frame = 0

Query: 5   KWVLLDSVKDCKSLRIFRQIHAQLVTSGLVYDEFVKTKVMEFFANFVEYGDYACDYLKQD 64
           KWVLLD +  C ++R F QIHAQL+TSG+V D+ V  +V EFF  F  + +Y CD LKQ 
Sbjct: 9   KWVLLDFIHKCNNVRSFMQIHAQLLTSGVVRDQLVVNRVAEFFGKFANFSEYGCDLLKQI 68

Query: 65  STHLSSFPFNSLINGYAGGDLPQMAVSVYRSMVRDGFVPDMFTFPVLLKACSNFSGSREG 124
              +SSFP+N +I+GYAG D P+ AV VYR M+R+GF+PDM+TFPV+LK+C+ F G  EG
Sbjct: 69  DWSISSFPYNLMISGYAGSDTPRAAVLVYRRMMRNGFMPDMYTFPVVLKSCAKFLGIGEG 128

Query: 125 RQVHGVVVKLGILADHYVQNSLIRCYGACGDFSCAGKVFDEMLVRDVVSWNSLISGFMKA 184
           RQVHGVVVK+G L+D  VQNSL+  YG  GDF  A  VFD+M VRDVVSW+ LISG+++A
Sbjct: 129 RQVHGVVVKMGFLSDLVVQNSLVHFYGGFGDFRVASMVFDDMHVRDVVSWSCLISGYVRA 188

Query: 185 GHFDEAISLFFRMDVEPSIATLVSVLTACARKGDLCTGKGIHGVIERRFKLDLVLGNAML 244
           G FDEA++LF RMDV+P+IAT VS+L AC R   L  GK IHG++ R  ++DLV+GNA++
Sbjct: 189 GLFDEAVALFLRMDVKPNIATFVSMLVACGRMRHLSLGKEIHGLMVRCVRVDLVVGNAIM 248

Query: 245 DMYVKNGCLYEAKKIFDELPTRDIVSWTIMITGLVQSDHPKESLELFSMMRTLGISPDAI 304
           DMYVK  CL EAK+IFDELP  DIVSWT MI+GLVQ   PKESL++F  M   GI PD +
Sbjct: 249 DMYVKCECLCEAKQIFDELPEIDIVSWTTMISGLVQCKRPKESLQMFHKMLASGIEPDRL 308

Query: 305 ILTSVLSACASLGTLDFGTWVHEYINQRGIKWDIHIGTAIVDMYAKCGCIEMALQIFYNM 364
           +LTSVLSACASLG LD+G W+  YI+ R IKWD HIGT ++DMYAKCGC+EMAL  F  M
Sbjct: 309 LLTSVLSACASLGALDYGRWIDVYIDFRDIKWDTHIGTTMIDMYAKCGCVEMALWTFNEM 368

Query: 365 PQRNTFTWNALLCGLAMHGLAHEALNLFEVMIISGVKPNEVTFLAIMTACCHSGLVNEGR 424
           P +N +TWNALL GLAMHG   EAL  FE MI SG++PNEVTFLAI+TACCHSGL++EGR
Sbjct: 369 PFKNVYTWNALLGGLAMHGHGREALKHFEEMIKSGMRPNEVTFLAILTACCHSGLIDEGR 428

Query: 425 KCFNNMSSQLHNLLPKLEHYGCMIDLFCRAGLLEEAVELTRTMPMKPDVLIWGVILNACR 484
           +CF  M SQ HNL P+LEHYGCM+D+ C+A LL+EA EL +TMPM PD+LIWG +L+ C+
Sbjct: 429 RCFYQMISQEHNLSPRLEHYGCMVDMLCKAQLLDEAQELIKTMPMPPDLLIWGALLSGCK 488

Query: 485 TVGNVELSHHIQDYILELDPEDSGVFVLLSNISATNERWSDVTRLRRLMKDRGVKKAPGS 544
             G VELS  I D +LEL+ +DSGV+VLLSNI AT++RW++VTR+RRLMK++G+KKAPGS
Sbjct: 489 ASGAVELSQEILDSLLELESQDSGVYVLLSNIHATDQRWAEVTRIRRLMKEKGIKKAPGS 548

Query: 545 SVIEVDGNAHEFVVGDISHLQTEEIYKVLNLINS-VYHECHLMHP 589
           SVIEVDG  HEF  GD SH Q E+++ +LN+++S +Y E H   P
Sbjct: 549 SVIEVDGKVHEFFAGDASHPQNEDLHYLLNILSSNLYLEGHFSDP 593

BLAST of Cla97C05G089660 vs. Swiss-Prot
Match: sp|Q9SZK1|PP355_ARATH (Pentatricopeptide repeat-containing protein At4g38010 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E45 PE=3 SV=1)

HSP 1 Score: 607.4 bits (1565), Expect = 1.6e-172
Identity = 303/547 (55.39%), Postives = 385/547 (70.38%), Query Frame = 0

Query: 5   KWVLLDSVKDCKSLRIFRQIHAQLVTSGLVYDEFVKTKVMEFFANFVEYGDYACDYLKQD 64
           K VLL+ +  C SLR+F+QI  QL+T  L+ D+ +  KV+ F     ++  Y+   L   
Sbjct: 6   KSVLLELISRCSSLRVFKQIQTQLITRDLLRDDLIINKVVTFLGKSADFASYSSVILHSI 65

Query: 65  STHLSSFPFNSLINGYAGGDLPQMAVSVYRSMVRDGFVPDMFTFPVLLKACSNFSGSREG 124
            + LSSF +N+L++ YA  D P++ +  Y++ V +GF PDMFTFP + KAC  FSG REG
Sbjct: 66  RSVLSSFSYNTLLSSYAVCDKPRVTIFAYKTFVSNGFSPDMFTFPPVFKACGKFSGIREG 125

Query: 125 RQVHGVVVKLGILADHYVQNSLIRCYGACGDFSCAGKVFDEMLVRDVVSWNSLISGFMKA 184
           +Q+HG+V K+G   D YVQNSL+  YG CG+   A KVF EM VRDVVSW  +I+GF + 
Sbjct: 126 KQIHGIVTKMGFYDDIYVQNSLVHFYGVCGESRNACKVFGEMPVRDVVSWTGIITGFTRT 185

Query: 185 GHFDEAISLFFRMDVEPSIATLVSVLTACARKGDLCTGKGIHGVIERRFKL-DLVLGNAM 244
           G + EA+  F +MDVEP++AT V VL +  R G L  GKGIHG+I +R  L  L  GNA+
Sbjct: 186 GLYKEALDTFSKMDVEPNLATYVCVLVSSGRVGCLSLGKGIHGLILKRASLISLETGNAL 245

Query: 245 LDMYVKNGCLYEAKKIFDELPTRDIVSWTIMITGLVQSDHPKESLELFSMMRT-LGISPD 304
           +DMYVK   L +A ++F EL  +D VSW  MI+GLV  +  KE+++LFS+M+T  GI PD
Sbjct: 246 IDMYVKCEQLSDAMRVFGELEKKDKVSWNSMISGLVHCERSKEAIDLFSLMQTSSGIKPD 305

Query: 305 AIILTSVLSACASLGTLDFGTWVHEYINQRGIKWDIHIGTAIVDMYAKCGCIEMALQIFY 364
             ILTSVLSACASLG +D G WVHEYI   GIKWD HIGTAIVDMYAKCG IE AL+IF 
Sbjct: 306 GHILTSVLSACASLGAVDHGRWVHEYILTAGIKWDTHIGTAIVDMYAKCGYIETALEIFN 365

Query: 365 NMPQRNTFTWNALLCGLAMHGLAHEALNLFEVMIISGVKPNEVTFLAIMTACCHSGLVNE 424
            +  +N FTWNALL GLA+HG   E+L  FE M+  G KPN VTFLA + ACCH+GLV+E
Sbjct: 366 GIRSKNVFTWNALLGGLAIHGHGLESLRYFEEMVKLGFKPNLVTFLAALNACCHTGLVDE 425

Query: 425 GRKCFNNMSSQLHNLLPKLEHYGCMIDLFCRAGLLEEAVELTRTMPMKPDVLIWGVILNA 484
           GR+ F+ M S+ +NL PKLEHYGCMIDL CRAGLL+EA+EL + MP+KPDV I G IL+A
Sbjct: 426 GRRYFHKMKSREYNLFPKLEHYGCMIDLLCRAGLLDEALELVKAMPVKPDVRICGAILSA 485

Query: 485 CRTVGN-VELSHHIQDYILELDPEDSGVFVLLSNISATNERWSDVTRLRRLMKDRGVKKA 544
           C+  G  +EL   I D  L+++ EDSGV+VLLSNI A N RW DV R+RRLMK +G+ K 
Sbjct: 486 CKNRGTLMELPKEILDSFLDIEFEDSGVYVLLSNIFAANRRWDDVARIRRLMKVKGISKV 545

Query: 545 PGSSVIE 549
           PGSS IE
Sbjct: 546 PGSSYIE 552

BLAST of Cla97C05G089660 vs. Swiss-Prot
Match: sp|Q9SJZ3|PP169_ARATH (Pentatricopeptide repeat-containing protein At2g22410, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E28 PE=2 SV=1)

HSP 1 Score: 408.3 bits (1048), Expect = 1.5e-112
Identity = 223/608 (36.68%), Postives = 338/608 (55.59%), Query Frame = 0

Query: 8   LLDSVKDCKSLRIFRQIHAQLVTSGLVYDEFVKTKVMEFFA-NFVEYGDYACDYLKQDST 67
           LL  ++ CK L   +QI AQ++ +GL+ D F  ++++ F A +   Y DY+   LK    
Sbjct: 56  LLSLLEKCKLLLHLKQIQAQMIINGLILDPFASSRLIAFCALSESRYLDYSVKILK-GIE 115

Query: 68  HLSSFPFNSLINGYAGGDLPQMAVSVYRSMVRDGFV---PDMFTFPVLLKACSNFSGSRE 127
           + + F +N  I G++  + P+ +  +Y+ M+R G     PD FT+PVL K C++   S  
Sbjct: 116 NPNIFSWNVTIRGFSESENPKESFLLYKQMLRHGCCESRPDHFTYPVLFKVCADLRLSSL 175

Query: 128 GRQVHGVVVKLGILADHYVQNSLIRCYGACGDFSCAGKVFDEMLVRDVVSWNSLISGFMK 187
           G  + G V+KL +    +V N+ I  + +CGD   A KVFDE  VRD+VSWN LI+G+ K
Sbjct: 176 GHMILGHVLKLRLELVSHVHNASIHMFASCGDMENARKVFDESPVRDLVSWNCLINGYKK 235

Query: 188 AGHFDEAISLFFRMD---VEPSIATLVSVLTACARKGDLCTGKGIHGVI-ERRFKLDLVL 247
            G  ++AI ++  M+   V+P   T++ ++++C+  GDL  GK  +  + E   ++ + L
Sbjct: 236 IGEAEKAIYVYKLMESEGVKPDDVTMIGLVSSCSMLGDLNRGKEFYEYVKENGLRMTIPL 295

Query: 248 GNAMLDMYVKNGCLYEAKKIFDEL-------------------------------PTRDI 307
            NA++DM+ K G ++EA++IFD L                                    
Sbjct: 296 VNALMDMFSKCGDIHEARRIFDNLEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 355

Query: 308 VSWTIMITGLVQSDHPKESLELFSMMRTLGISPDAIILTSVLSACASLGTLDFGTWVHEY 367
                                      T    PD I +   LSAC+ LG LD G W+H Y
Sbjct: 356 XXXXXXXXXXXXXXXXXXXXXXXXXXXTSNTKPDEITMIHCLSACSQLGALDVGIWIHRY 415

Query: 368 INQRGIKWDIHIGTAIVDMYAKCGCIEMALQIFYNMPQRNTFTWNALLCGLAMHGLAHEA 427
           I +  +  ++ +GT++VDMYAKCG I  AL +F+ +  RN+ T+ A++ GLA+HG A  A
Sbjct: 416 IEKYSLSLNVALGTSLVDMYAKCGNISEALSVFHGIQTRNSLTYTAIIGGLALHGDASTA 475

Query: 428 LNLFEVMIISGVKPNEVTFLAIMTACCHSGLVNEGRKCFNNMSSQLHNLLPKLEHYGCMI 487
           ++ F  MI +G+ P+E+TF+ +++ACCH G++  GR  F+ M S+  NL P+L+HY  M+
Sbjct: 476 ISYFNEMIDAGIAPDEITFIGLLSACCHGGMIQTGRDYFSQMKSRF-NLNPQLKHYSIMV 535

Query: 488 DLFCRAGLLEEAVELTRTMPMKPDVLIWGVILNACRTVGNVELSHHIQDYILELDPEDSG 547
           DL  RAGLLEEA  L  +MPM+ D  +WG +L  CR  GNVEL       +LELDP DSG
Sbjct: 536 DLLGRAGLLEEADRLMESMPMEADAAVWGALLFGCRMHGNVELGEKAAKKLLELDPSDSG 595

Query: 548 VFVLLSNISATNERWSDVTRLRRLMKDRGVKKAPGSSVIEVDGNAHEFVVGDISHLQTEE 577
           ++VLL  +      W D  R RR+M +RGV+K PG S IEV+G   EF+V D S  ++E+
Sbjct: 596 IYVLLDGMYGEANMWEDAKRARRMMNERGVEKIPGCSSIEVNGIVCEFIVRDKSRPESEK 655

BLAST of Cla97C05G089660 vs. Swiss-Prot
Match: sp|Q9C866|PPR65_ARATH (Pentatricopeptide repeat-containing protein At1g31430 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E55 PE=2 SV=1)

HSP 1 Score: 388.7 bits (997), Expect = 1.2e-106
Identity = 206/542 (38.01%), Postives = 311/542 (57.38%), Query Frame = 0

Query: 73  FNSLINGYAGGDLPQMAVSVYRSMVRDGFVPDMFTFPVLLKACSNFSGSREGRQVHGVVV 132
           +N ++   A G      ++++  +   G  PD FT PV+LK+        EG +VHG  V
Sbjct: 14  YNKMLKSLADGKSFTKVLALFGELRGQGLYPDNFTLPVVLKSIGRLRKVIEGEKVHGYAV 73

Query: 133 KLGILADHYVQNSLIRCYGACGDFSCAGKVFDEMLVRDVVSWNSLISGFMKAGHFDEAIS 192
           K G+  D YV NSL+  Y + G      KVFDEM  RDVVSWN LIS ++  G F++AI 
Sbjct: 74  KAGLEFDSYVSNSLMGMYASLGKIEITHKVFDEMPQRDVVSWNGLISSYVGNGRFEDAIG 133

Query: 193 LFFRMDVEPSI----ATLVSVLTACARKGDLCTGKGIHGVIERRFKLDLVLGNAMLDMYV 252
           +F RM  E ++     T+VS L+AC+   +L  G+ I+  +   F++ + +GNA++DM+ 
Sbjct: 134 VFKRMSQESNLKFDEGTIVSTLSACSALKNLEIGERIYRFVVTEFEMSVRIGNALVDMFC 193

Query: 253 KNGCLYEAKKIFDELPTRD-------------------------------IVSWTIMITG 312
           K GCL +A+ +FD +  ++                                         
Sbjct: 194 KCGCLDKARAVFDSMRDKNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 253

Query: 313 LVQSDHPKESLELFSMMRTLGISPDAIILTSVLSACASLGTLDFGTWVHEYINQRGIKWD 372
                           M+T GI PD  +L S+L+ CA  G L+ G W+H YIN+  +  D
Sbjct: 254 XXXXXXXXXXXXXXXCMQTAGIRPDNFVLVSLLTGCAQTGALEQGKWIHGYINENRVTVD 313

Query: 373 IHIGTAIVDMYAKCGCIEMALQIFYNMPQRNTFTWNALLCGLAMHGLAHEALNLFEVMII 432
             +GTA+VDMYAKCGCIE AL++FY + +R+T +W +L+ GLAM+G++  AL+L+  M  
Sbjct: 314 KVVGTALVDMYAKCGCIETALEVFYEIKERDTASWTSLIYGLAMNGMSGRALDLYYEMEN 373

Query: 433 SGVKPNEVTFLAIMTACCHSGLVNEGRKCFNNMSSQLHNLLPKLEHYGCMIDLFCRAGLL 492
            GV+ + +TF+A++TAC H G V EGRK F++M+ + HN+ PK EH  C+IDL CRAGLL
Sbjct: 374 VGVRLDAITFVAVLTACNHGGFVAEGRKIFHSMTER-HNVQPKSEHCSCLIDLLCRAGLL 433

Query: 493 EEAVELTRTMPMKPD---VLIWGVILNACRTVGNVELSHHIQDYILELDPEDSGVFVLLS 552
           +EA EL   M  + D   V ++  +L+A R  GNV+++  + + + +++  DS    LL+
Sbjct: 434 DEAEELIDKMRGESDETLVPVYCSLLSAARNYGNVKIAERVAEKLEKVEVSDSSAHTLLA 493

Query: 553 NISATNERWSDVTRLRRLMKDRGVKKAPGSSVIEVDGNAHEFVVGD--ISHLQTEEIYKV 575
           ++ A+  RW DVT +RR MKD G++K PG S IE+DG  HEF+VGD  +SH + +EI  +
Sbjct: 494 SVYASANRWEDVTNVRRKMKDLGIRKFPGCSSIEIDGVGHEFIVGDDLLSHPKMDEINSM 553

BLAST of Cla97C05G089660 vs. Swiss-Prot
Match: sp|Q9LN01|PPR21_ARATH (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 387.5 bits (994), Expect = 2.7e-106
Identity = 215/606 (35.48%), Postives = 337/606 (55.61%), Query Frame = 0

Query: 9   LDSVKDCKSLRIFRQIHAQLVTSGLVYDEFVKTKVMEF--FANFVEYGDYACDYLK--QD 68
           L  + +CK+L+  R IHAQ++  GL    +  +K++EF   +   E   YA    K  Q+
Sbjct: 37  LSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFEGLPYAISVFKTIQE 96

Query: 69  STHLSSFPFNSLINGYAGGDLPQMAVSVYRSMVRDGFVPDMFTFPVLLKACSNFSGSREG 128
              L    +N++  G+A    P  A+ +Y  M+  G +P+ +TFP +LK+C+     +EG
Sbjct: 97  PNLLI---WNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEG 156

Query: 129 RQVHGVVVKLGILADHYVQNSLIRCYGACGDFSCAGKVFDEMLVR--------------- 188
           +Q+HG V+KLG   D YV  SLI  Y   G    A KVFD+   R               
Sbjct: 157 QQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRXXXXXXXXXXXXXXX 216

Query: 189 -------------------DVVSWNSLISGFMKAGHFDEAISLFFRMDVEPSIATLVSVL 248
                                                        + +V P  +T+V+V+
Sbjct: 217 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKTNVRPDESTMVTVV 276

Query: 249 TACARKGDLCTGKGIH-GVIERRFKLDLVLGNAMLDMYVKNGCLYEAKKIFDELPTRDIV 308
           +ACA+ G +  G+ +H  + +  F  +L + NA++D+Y K G L  A  +F+ LP +D++
Sbjct: 277 SACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVI 336

Query: 309 SWTIMITGLVQSDHPKESLELFSMMRTLGISPDAIILTSVLSACASLGTLDFGTWVHEYI 368
           SW  +I G    +  KE+L LF  M   G +P+ + + S+L ACA LG +D G W+H YI
Sbjct: 337 SWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYI 396

Query: 369 NQR--GIKWDIHIGTAIVDMYAKCGCIEMALQIFYNMPQRNTFTWNALLCGLAMHGLAHE 428
           ++R  G+     + T+++DMYAKCG IE A Q+F ++  ++  +WNA++ G AMHG A  
Sbjct: 397 DKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADA 456

Query: 429 ALNLFEVMIISGVKPNEVTFLAIMTACCHSGLVNEGRKCFNNMSSQLHNLLPKLEHYGCM 488
           + +LF  M   G++P+++TF+ +++AC HSG+++ GR  F  M +Q + + PKLEHYGCM
Sbjct: 457 SFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTM-TQDYKMTPKLEHYGCM 516

Query: 489 IDLFCRAGLLEEAVELTRTMPMKPDVLIWGVILNACRTVGNVELSHHIQDYILELDPEDS 548
           IDL   +GL +EA E+   M M+PD +IW  +L AC+  GNVEL     + +++++PE+ 
Sbjct: 517 IDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENP 576

Query: 549 GVFVLLSNISATNERWSDVTRLRRLMKDRGVKKAPGSSVIEVDGNAHEFVVGDISHLQTE 574
           G +VLLSNI A+  RW++V + R L+ D+G+KK PG S IE+D   HEF++GD  H +  
Sbjct: 577 GSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNR 636

BLAST of Cla97C05G089660 vs. Swiss-Prot
Match: sp|Q9LSB8|PP235_ARATH (Putative pentatricopeptide repeat-containing protein At3g15930 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E51 PE=3 SV=2)

HSP 1 Score: 387.1 bits (993), Expect = 3.5e-106
Identity = 214/596 (35.91%), Postives = 331/596 (55.54%), Query Frame = 0

Query: 15  CKSLRIFRQIHAQLVTSGLVYDEFVKTKVMEFFANFVEYGDYACDY-LKQDSTHLSSFPF 74
           CK+   F+Q+H+Q +T G+  +   + K+  F+ + +  G  +  Y L           +
Sbjct: 44  CKTTDQFKQLHSQSITRGVAPNPTFQKKLFVFWCSRLG-GHVSYAYKLFVKIPEPDVVVW 103

Query: 75  NSLINGYAGGDLPQMAVSVYRSMVRDGFVPDMFTFPVLLKACSNFSGSRE-GRQVHGVVV 134
           N++I G++  D     V +Y +M+++G  PD  TFP LL       G+   G+++H  VV
Sbjct: 104 NNMIKGWSKVDCDGEGVRLYLNMLKEGVTPDSHTFPFLLNGLKRDGGALACGKKLHCHVV 163

Query: 135 KLGILADHYVQNSLIRCYGACGDFSCAGKVFDEMLVRDVVSWNSLISGFMKAGHFDEAIS 194
           K G+ ++ YVQN+L++ Y  CG    A  VFD     DV SWN +ISG+ +   ++E+I 
Sbjct: 164 KFGLGSNLYVQNALVKMYSLCGLMDMARGVFDRRCKEDVFSWNLMISGYNRMKEYEESIE 223

Query: 195 LFFRMD---VEPSIATLVSVLTACARKGDLCTGKGIHG-VIERRFKLDLVLGNAMLDMYV 254
           L   M+   V P+  TL+ VL+AC++  D    K +H  V E + +  L L NA+++ Y 
Sbjct: 224 LLVEMERNLVSPTSVTLLLVLSACSKVKDKDLCKRVHEYVSECKTEPSLRLENALVNAYA 283

Query: 255 KNGCLYEAKKIFDELPTRDIVSWT-------------------------------IMITG 314
             G +  A +IF  +  RD++SWT                                    
Sbjct: 284 ACGEMDIAVRIFRSMKARDVISWTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 343

Query: 315 LVQSDHPKESLELFSMMRTLGISPDAIILTSVLSACASLGTLDFGTWVHEYINQRGIKWD 374
                            ++ G+ PD   + SVL+ACA LG+L+ G W+  YI++  IK D
Sbjct: 344 XXXXXXXXXXXXXXXXXQSAGMIPDEFTMVSVLTACAHLGSLEIGEWIKTYIDKNKIKND 403

Query: 375 IHIGTAIVDMYAKCGCIEMALQIFYNMPQRNTFTWNALLCGLAMHGLAHEALNLFEVMII 434
           + +G A++DMY KCGC E A ++F++M QR+ FTW A++ GLA +G   EA+ +F  M  
Sbjct: 404 VVVGNALIDMYFKCGCSEKAQKVFHDMDQRDKFTWTAMVVGLANNGQGQEAIKVFFQMQD 463

Query: 435 SGVKPNEVTFLAIMTACCHSGLVNEGRKCFNNMSSQLHNLLPKLEHYGCMIDLFCRAGLL 494
             ++P+++T+L +++AC HSG+V++ RK F  M S  H + P L HYGCM+D+  RAGL+
Sbjct: 464 MSIQPDDITYLGVLSACNHSGMVDQARKFFAKMRSD-HRIEPSLVHYGCMVDMLGRAGLV 523

Query: 495 EEAVELTRTMPMKPDVLIWGVILNACRTVGNVELSHHIQDYILELDPEDSGVFVLLSNIS 554
           +EA E+ R MPM P+ ++WG +L A R   +  ++      ILEL+P++  V+ LL NI 
Sbjct: 524 KEAYEILRKMPMNPNSIVWGALLGASRLHNDEPMAELAAKKILELEPDNGAVYALLCNIY 583

Query: 555 ATNERWSDVTRLRRLMKDRGVKKAPGSSVIEVDGNAHEFVVGDISHLQTEEIYKVL 574
           A  +RW D+  +RR + D  +KK PG S+IEV+G AHEFV GD SHLQ+EEIY  L
Sbjct: 584 AGCKRWKDLREVRRKIVDVAIKKTPGFSLIEVNGFAHEFVAGDKSHLQSEEIYMKL 637

BLAST of Cla97C05G089660 vs. TAIR10
Match: AT4G38010.1 (Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 607.4 bits (1565), Expect = 9.0e-174
Identity = 303/547 (55.39%), Postives = 385/547 (70.38%), Query Frame = 0

Query: 5   KWVLLDSVKDCKSLRIFRQIHAQLVTSGLVYDEFVKTKVMEFFANFVEYGDYACDYLKQD 64
           K VLL+ +  C SLR+F+QI  QL+T  L+ D+ +  KV+ F     ++  Y+   L   
Sbjct: 6   KSVLLELISRCSSLRVFKQIQTQLITRDLLRDDLIINKVVTFLGKSADFASYSSVILHSI 65

Query: 65  STHLSSFPFNSLINGYAGGDLPQMAVSVYRSMVRDGFVPDMFTFPVLLKACSNFSGSREG 124
            + LSSF +N+L++ YA  D P++ +  Y++ V +GF PDMFTFP + KAC  FSG REG
Sbjct: 66  RSVLSSFSYNTLLSSYAVCDKPRVTIFAYKTFVSNGFSPDMFTFPPVFKACGKFSGIREG 125

Query: 125 RQVHGVVVKLGILADHYVQNSLIRCYGACGDFSCAGKVFDEMLVRDVVSWNSLISGFMKA 184
           +Q+HG+V K+G   D YVQNSL+  YG CG+   A KVF EM VRDVVSW  +I+GF + 
Sbjct: 126 KQIHGIVTKMGFYDDIYVQNSLVHFYGVCGESRNACKVFGEMPVRDVVSWTGIITGFTRT 185

Query: 185 GHFDEAISLFFRMDVEPSIATLVSVLTACARKGDLCTGKGIHGVIERRFKL-DLVLGNAM 244
           G + EA+  F +MDVEP++AT V VL +  R G L  GKGIHG+I +R  L  L  GNA+
Sbjct: 186 GLYKEALDTFSKMDVEPNLATYVCVLVSSGRVGCLSLGKGIHGLILKRASLISLETGNAL 245

Query: 245 LDMYVKNGCLYEAKKIFDELPTRDIVSWTIMITGLVQSDHPKESLELFSMMRT-LGISPD 304
           +DMYVK   L +A ++F EL  +D VSW  MI+GLV  +  KE+++LFS+M+T  GI PD
Sbjct: 246 IDMYVKCEQLSDAMRVFGELEKKDKVSWNSMISGLVHCERSKEAIDLFSLMQTSSGIKPD 305

Query: 305 AIILTSVLSACASLGTLDFGTWVHEYINQRGIKWDIHIGTAIVDMYAKCGCIEMALQIFY 364
             ILTSVLSACASLG +D G WVHEYI   GIKWD HIGTAIVDMYAKCG IE AL+IF 
Sbjct: 306 GHILTSVLSACASLGAVDHGRWVHEYILTAGIKWDTHIGTAIVDMYAKCGYIETALEIFN 365

Query: 365 NMPQRNTFTWNALLCGLAMHGLAHEALNLFEVMIISGVKPNEVTFLAIMTACCHSGLVNE 424
            +  +N FTWNALL GLA+HG   E+L  FE M+  G KPN VTFLA + ACCH+GLV+E
Sbjct: 366 GIRSKNVFTWNALLGGLAIHGHGLESLRYFEEMVKLGFKPNLVTFLAALNACCHTGLVDE 425

Query: 425 GRKCFNNMSSQLHNLLPKLEHYGCMIDLFCRAGLLEEAVELTRTMPMKPDVLIWGVILNA 484
           GR+ F+ M S+ +NL PKLEHYGCMIDL CRAGLL+EA+EL + MP+KPDV I G IL+A
Sbjct: 426 GRRYFHKMKSREYNLFPKLEHYGCMIDLLCRAGLLDEALELVKAMPVKPDVRICGAILSA 485

Query: 485 CRTVGN-VELSHHIQDYILELDPEDSGVFVLLSNISATNERWSDVTRLRRLMKDRGVKKA 544
           C+  G  +EL   I D  L+++ EDSGV+VLLSNI A N RW DV R+RRLMK +G+ K 
Sbjct: 486 CKNRGTLMELPKEILDSFLDIEFEDSGVYVLLSNIFAANRRWDDVARIRRLMKVKGISKV 545

Query: 545 PGSSVIE 549
           PGSS IE
Sbjct: 546 PGSSYIE 552

BLAST of Cla97C05G089660 vs. TAIR10
Match: AT2G22410.1 (SLOW GROWTH 1)

HSP 1 Score: 408.3 bits (1048), Expect = 8.1e-114
Identity = 223/608 (36.68%), Postives = 338/608 (55.59%), Query Frame = 0

Query: 8   LLDSVKDCKSLRIFRQIHAQLVTSGLVYDEFVKTKVMEFFA-NFVEYGDYACDYLKQDST 67
           LL  ++ CK L   +QI AQ++ +GL+ D F  ++++ F A +   Y DY+   LK    
Sbjct: 56  LLSLLEKCKLLLHLKQIQAQMIINGLILDPFASSRLIAFCALSESRYLDYSVKILK-GIE 115

Query: 68  HLSSFPFNSLINGYAGGDLPQMAVSVYRSMVRDGFV---PDMFTFPVLLKACSNFSGSRE 127
           + + F +N  I G++  + P+ +  +Y+ M+R G     PD FT+PVL K C++   S  
Sbjct: 116 NPNIFSWNVTIRGFSESENPKESFLLYKQMLRHGCCESRPDHFTYPVLFKVCADLRLSSL 175

Query: 128 GRQVHGVVVKLGILADHYVQNSLIRCYGACGDFSCAGKVFDEMLVRDVVSWNSLISGFMK 187
           G  + G V+KL +    +V N+ I  + +CGD   A KVFDE  VRD+VSWN LI+G+ K
Sbjct: 176 GHMILGHVLKLRLELVSHVHNASIHMFASCGDMENARKVFDESPVRDLVSWNCLINGYKK 235

Query: 188 AGHFDEAISLFFRMD---VEPSIATLVSVLTACARKGDLCTGKGIHGVI-ERRFKLDLVL 247
            G  ++AI ++  M+   V+P   T++ ++++C+  GDL  GK  +  + E   ++ + L
Sbjct: 236 IGEAEKAIYVYKLMESEGVKPDDVTMIGLVSSCSMLGDLNRGKEFYEYVKENGLRMTIPL 295

Query: 248 GNAMLDMYVKNGCLYEAKKIFDEL-------------------------------PTRDI 307
            NA++DM+ K G ++EA++IFD L                                    
Sbjct: 296 VNALMDMFSKCGDIHEARRIFDNLEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 355

Query: 308 VSWTIMITGLVQSDHPKESLELFSMMRTLGISPDAIILTSVLSACASLGTLDFGTWVHEY 367
                                      T    PD I +   LSAC+ LG LD G W+H Y
Sbjct: 356 XXXXXXXXXXXXXXXXXXXXXXXXXXXTSNTKPDEITMIHCLSACSQLGALDVGIWIHRY 415

Query: 368 INQRGIKWDIHIGTAIVDMYAKCGCIEMALQIFYNMPQRNTFTWNALLCGLAMHGLAHEA 427
           I +  +  ++ +GT++VDMYAKCG I  AL +F+ +  RN+ T+ A++ GLA+HG A  A
Sbjct: 416 IEKYSLSLNVALGTSLVDMYAKCGNISEALSVFHGIQTRNSLTYTAIIGGLALHGDASTA 475

Query: 428 LNLFEVMIISGVKPNEVTFLAIMTACCHSGLVNEGRKCFNNMSSQLHNLLPKLEHYGCMI 487
           ++ F  MI +G+ P+E+TF+ +++ACCH G++  GR  F+ M S+  NL P+L+HY  M+
Sbjct: 476 ISYFNEMIDAGIAPDEITFIGLLSACCHGGMIQTGRDYFSQMKSRF-NLNPQLKHYSIMV 535

Query: 488 DLFCRAGLLEEAVELTRTMPMKPDVLIWGVILNACRTVGNVELSHHIQDYILELDPEDSG 547
           DL  RAGLLEEA  L  +MPM+ D  +WG +L  CR  GNVEL       +LELDP DSG
Sbjct: 536 DLLGRAGLLEEADRLMESMPMEADAAVWGALLFGCRMHGNVELGEKAAKKLLELDPSDSG 595

Query: 548 VFVLLSNISATNERWSDVTRLRRLMKDRGVKKAPGSSVIEVDGNAHEFVVGDISHLQTEE 577
           ++VLL  +      W D  R RR+M +RGV+K PG S IEV+G   EF+V D S  ++E+
Sbjct: 596 IYVLLDGMYGEANMWEDAKRARRMMNERGVEKIPGCSSIEVNGIVCEFIVRDKSRPESEK 655

BLAST of Cla97C05G089660 vs. TAIR10
Match: AT1G31430.1 (Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 388.7 bits (997), Expect = 6.6e-108
Identity = 206/542 (38.01%), Postives = 311/542 (57.38%), Query Frame = 0

Query: 73  FNSLINGYAGGDLPQMAVSVYRSMVRDGFVPDMFTFPVLLKACSNFSGSREGRQVHGVVV 132
           +N ++   A G      ++++  +   G  PD FT PV+LK+        EG +VHG  V
Sbjct: 14  YNKMLKSLADGKSFTKVLALFGELRGQGLYPDNFTLPVVLKSIGRLRKVIEGEKVHGYAV 73

Query: 133 KLGILADHYVQNSLIRCYGACGDFSCAGKVFDEMLVRDVVSWNSLISGFMKAGHFDEAIS 192
           K G+  D YV NSL+  Y + G      KVFDEM  RDVVSWN LIS ++  G F++AI 
Sbjct: 74  KAGLEFDSYVSNSLMGMYASLGKIEITHKVFDEMPQRDVVSWNGLISSYVGNGRFEDAIG 133

Query: 193 LFFRMDVEPSI----ATLVSVLTACARKGDLCTGKGIHGVIERRFKLDLVLGNAMLDMYV 252
           +F RM  E ++     T+VS L+AC+   +L  G+ I+  +   F++ + +GNA++DM+ 
Sbjct: 134 VFKRMSQESNLKFDEGTIVSTLSACSALKNLEIGERIYRFVVTEFEMSVRIGNALVDMFC 193

Query: 253 KNGCLYEAKKIFDELPTRD-------------------------------IVSWTIMITG 312
           K GCL +A+ +FD +  ++                                         
Sbjct: 194 KCGCLDKARAVFDSMRDKNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 253

Query: 313 LVQSDHPKESLELFSMMRTLGISPDAIILTSVLSACASLGTLDFGTWVHEYINQRGIKWD 372
                           M+T GI PD  +L S+L+ CA  G L+ G W+H YIN+  +  D
Sbjct: 254 XXXXXXXXXXXXXXXCMQTAGIRPDNFVLVSLLTGCAQTGALEQGKWIHGYINENRVTVD 313

Query: 373 IHIGTAIVDMYAKCGCIEMALQIFYNMPQRNTFTWNALLCGLAMHGLAHEALNLFEVMII 432
             +GTA+VDMYAKCGCIE AL++FY + +R+T +W +L+ GLAM+G++  AL+L+  M  
Sbjct: 314 KVVGTALVDMYAKCGCIETALEVFYEIKERDTASWTSLIYGLAMNGMSGRALDLYYEMEN 373

Query: 433 SGVKPNEVTFLAIMTACCHSGLVNEGRKCFNNMSSQLHNLLPKLEHYGCMIDLFCRAGLL 492
            GV+ + +TF+A++TAC H G V EGRK F++M+ + HN+ PK EH  C+IDL CRAGLL
Sbjct: 374 VGVRLDAITFVAVLTACNHGGFVAEGRKIFHSMTER-HNVQPKSEHCSCLIDLLCRAGLL 433

Query: 493 EEAVELTRTMPMKPD---VLIWGVILNACRTVGNVELSHHIQDYILELDPEDSGVFVLLS 552
           +EA EL   M  + D   V ++  +L+A R  GNV+++  + + + +++  DS    LL+
Sbjct: 434 DEAEELIDKMRGESDETLVPVYCSLLSAARNYGNVKIAERVAEKLEKVEVSDSSAHTLLA 493

Query: 553 NISATNERWSDVTRLRRLMKDRGVKKAPGSSVIEVDGNAHEFVVGD--ISHLQTEEIYKV 575
           ++ A+  RW DVT +RR MKD G++K PG S IE+DG  HEF+VGD  +SH + +EI  +
Sbjct: 494 SVYASANRWEDVTNVRRKMKDLGIRKFPGCSSIEIDGVGHEFIVGDDLLSHPKMDEINSM 553

BLAST of Cla97C05G089660 vs. TAIR10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 387.5 bits (994), Expect = 1.5e-107
Identity = 215/606 (35.48%), Postives = 337/606 (55.61%), Query Frame = 0

Query: 9   LDSVKDCKSLRIFRQIHAQLVTSGLVYDEFVKTKVMEF--FANFVEYGDYACDYLK--QD 68
           L  + +CK+L+  R IHAQ++  GL    +  +K++EF   +   E   YA    K  Q+
Sbjct: 37  LSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFEGLPYAISVFKTIQE 96

Query: 69  STHLSSFPFNSLINGYAGGDLPQMAVSVYRSMVRDGFVPDMFTFPVLLKACSNFSGSREG 128
              L    +N++  G+A    P  A+ +Y  M+  G +P+ +TFP +LK+C+     +EG
Sbjct: 97  PNLLI---WNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEG 156

Query: 129 RQVHGVVVKLGILADHYVQNSLIRCYGACGDFSCAGKVFDEMLVR--------------- 188
           +Q+HG V+KLG   D YV  SLI  Y   G    A KVFD+   R               
Sbjct: 157 QQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRXXXXXXXXXXXXXXX 216

Query: 189 -------------------DVVSWNSLISGFMKAGHFDEAISLFFRMDVEPSIATLVSVL 248
                                                        + +V P  +T+V+V+
Sbjct: 217 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKTNVRPDESTMVTVV 276

Query: 249 TACARKGDLCTGKGIH-GVIERRFKLDLVLGNAMLDMYVKNGCLYEAKKIFDELPTRDIV 308
           +ACA+ G +  G+ +H  + +  F  +L + NA++D+Y K G L  A  +F+ LP +D++
Sbjct: 277 SACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVI 336

Query: 309 SWTIMITGLVQSDHPKESLELFSMMRTLGISPDAIILTSVLSACASLGTLDFGTWVHEYI 368
           SW  +I G    +  KE+L LF  M   G +P+ + + S+L ACA LG +D G W+H YI
Sbjct: 337 SWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYI 396

Query: 369 NQR--GIKWDIHIGTAIVDMYAKCGCIEMALQIFYNMPQRNTFTWNALLCGLAMHGLAHE 428
           ++R  G+     + T+++DMYAKCG IE A Q+F ++  ++  +WNA++ G AMHG A  
Sbjct: 397 DKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADA 456

Query: 429 ALNLFEVMIISGVKPNEVTFLAIMTACCHSGLVNEGRKCFNNMSSQLHNLLPKLEHYGCM 488
           + +LF  M   G++P+++TF+ +++AC HSG+++ GR  F  M +Q + + PKLEHYGCM
Sbjct: 457 SFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTM-TQDYKMTPKLEHYGCM 516

Query: 489 IDLFCRAGLLEEAVELTRTMPMKPDVLIWGVILNACRTVGNVELSHHIQDYILELDPEDS 548
           IDL   +GL +EA E+   M M+PD +IW  +L AC+  GNVEL     + +++++PE+ 
Sbjct: 517 IDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENP 576

Query: 549 GVFVLLSNISATNERWSDVTRLRRLMKDRGVKKAPGSSVIEVDGNAHEFVVGDISHLQTE 574
           G +VLLSNI A+  RW++V + R L+ D+G+KK PG S IE+D   HEF++GD  H +  
Sbjct: 577 GSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNR 636

BLAST of Cla97C05G089660 vs. TAIR10
Match: AT3G15930.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 387.1 bits (993), Expect = 1.9e-107
Identity = 214/596 (35.91%), Postives = 331/596 (55.54%), Query Frame = 0

Query: 15  CKSLRIFRQIHAQLVTSGLVYDEFVKTKVMEFFANFVEYGDYACDY-LKQDSTHLSSFPF 74
           CK+   F+Q+H+Q +T G+  +   + K+  F+ + +  G  +  Y L           +
Sbjct: 44  CKTTDQFKQLHSQSITRGVAPNPTFQKKLFVFWCSRLG-GHVSYAYKLFVKIPEPDVVVW 103

Query: 75  NSLINGYAGGDLPQMAVSVYRSMVRDGFVPDMFTFPVLLKACSNFSGSRE-GRQVHGVVV 134
           N++I G++  D     V +Y +M+++G  PD  TFP LL       G+   G+++H  VV
Sbjct: 104 NNMIKGWSKVDCDGEGVRLYLNMLKEGVTPDSHTFPFLLNGLKRDGGALACGKKLHCHVV 163

Query: 135 KLGILADHYVQNSLIRCYGACGDFSCAGKVFDEMLVRDVVSWNSLISGFMKAGHFDEAIS 194
           K G+ ++ YVQN+L++ Y  CG    A  VFD     DV SWN +ISG+ +   ++E+I 
Sbjct: 164 KFGLGSNLYVQNALVKMYSLCGLMDMARGVFDRRCKEDVFSWNLMISGYNRMKEYEESIE 223

Query: 195 LFFRMD---VEPSIATLVSVLTACARKGDLCTGKGIHG-VIERRFKLDLVLGNAMLDMYV 254
           L   M+   V P+  TL+ VL+AC++  D    K +H  V E + +  L L NA+++ Y 
Sbjct: 224 LLVEMERNLVSPTSVTLLLVLSACSKVKDKDLCKRVHEYVSECKTEPSLRLENALVNAYA 283

Query: 255 KNGCLYEAKKIFDELPTRDIVSWT-------------------------------IMITG 314
             G +  A +IF  +  RD++SWT                                    
Sbjct: 284 ACGEMDIAVRIFRSMKARDVISWTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 343

Query: 315 LVQSDHPKESLELFSMMRTLGISPDAIILTSVLSACASLGTLDFGTWVHEYINQRGIKWD 374
                            ++ G+ PD   + SVL+ACA LG+L+ G W+  YI++  IK D
Sbjct: 344 XXXXXXXXXXXXXXXXXQSAGMIPDEFTMVSVLTACAHLGSLEIGEWIKTYIDKNKIKND 403

Query: 375 IHIGTAIVDMYAKCGCIEMALQIFYNMPQRNTFTWNALLCGLAMHGLAHEALNLFEVMII 434
           + +G A++DMY KCGC E A ++F++M QR+ FTW A++ GLA +G   EA+ +F  M  
Sbjct: 404 VVVGNALIDMYFKCGCSEKAQKVFHDMDQRDKFTWTAMVVGLANNGQGQEAIKVFFQMQD 463

Query: 435 SGVKPNEVTFLAIMTACCHSGLVNEGRKCFNNMSSQLHNLLPKLEHYGCMIDLFCRAGLL 494
             ++P+++T+L +++AC HSG+V++ RK F  M S  H + P L HYGCM+D+  RAGL+
Sbjct: 464 MSIQPDDITYLGVLSACNHSGMVDQARKFFAKMRSD-HRIEPSLVHYGCMVDMLGRAGLV 523

Query: 495 EEAVELTRTMPMKPDVLIWGVILNACRTVGNVELSHHIQDYILELDPEDSGVFVLLSNIS 554
           +EA E+ R MPM P+ ++WG +L A R   +  ++      ILEL+P++  V+ LL NI 
Sbjct: 524 KEAYEILRKMPMNPNSIVWGALLGASRLHNDEPMAELAAKKILELEPDNGAVYALLCNIY 583

Query: 555 ATNERWSDVTRLRRLMKDRGVKKAPGSSVIEVDGNAHEFVVGDISHLQTEEIYKVL 574
           A  +RW D+  +RR + D  +KK PG S+IEV+G AHEFV GD SHLQ+EEIY  L
Sbjct: 584 AGCKRWKDLREVRRKIVDVAIKKTPGFSLIEVNGFAHEFVAGDKSHLQSEEIYMKL 637

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KGN60620.10.0e+0091.10hypothetical protein Csa_2G004700 [Cucumis sativus][more]
XP_008458315.10.0e+0090.92PREDICTED: pentatricopeptide repeat-containing protein At4g38010 isoform X1 [Cuc... [more]
XP_022968019.10.0e+0088.61pentatricopeptide repeat-containing protein At4g38010 isoform X1 [Cucurbita maxi... [more]
XP_022945046.13.0e-30887.44pentatricopeptide repeat-containing protein At4g38010 isoform X1 [Cucurbita mosc... [more]
XP_023541467.12.2e-30688.14pentatricopeptide repeat-containing protein At4g38010 isoform X1 [Cucurbita pepo... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0LF19|A0A0A0LF19_CUCSA0.0e+0091.10Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G004700 PE=4 SV=1[more]
tr|A0A1S3C7P6|A0A1S3C7P6_CUCME0.0e+0090.92pentatricopeptide repeat-containing protein At4g38010 isoform X1 OS=Cucumis melo... [more]
tr|A0A1S3C7K9|A0A1S3C7K9_CUCME4.5e-25291.83pentatricopeptide repeat-containing protein At4g38010 isoform X2 OS=Cucumis melo... [more]
tr|A0A2N9FVF6|A0A2N9FVF6_FAGSY1.4e-22162.82Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS18831 PE=4 SV=1[more]
tr|A0A2I4FGU0|A0A2I4FGU0_9ROSI4.7e-21761.88pentatricopeptide repeat-containing protein At4g38010 OS=Juglans regia OX=51240 ... [more]
Match NameE-valueIdentityDescription
sp|Q9SZK1|PP355_ARATH1.6e-17255.39Pentatricopeptide repeat-containing protein At4g38010 OS=Arabidopsis thaliana OX... [more]
sp|Q9SJZ3|PP169_ARATH1.5e-11236.68Pentatricopeptide repeat-containing protein At2g22410, mitochondrial OS=Arabidop... [more]
sp|Q9C866|PPR65_ARATH1.2e-10638.01Pentatricopeptide repeat-containing protein At1g31430 OS=Arabidopsis thaliana OX... [more]
sp|Q9LN01|PPR21_ARATH2.7e-10635.48Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
sp|Q9LSB8|PP235_ARATH3.5e-10635.91Putative pentatricopeptide repeat-containing protein At3g15930 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
AT4G38010.19.0e-17455.39Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT2G22410.18.1e-11436.68SLOW GROWTH 1[more]
AT1G31430.16.6e-10838.01Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT1G08070.11.5e-10735.48Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G15930.11.9e-10735.91Pentatricopeptide repeat (PPR) superfamily protein[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0003674 molecular_function
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C05G089660.1Cla97C05G089660.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 241..266
e-value: 4.3E-4
score: 20.3
coord: 269..299
e-value: 7.4E-6
score: 25.8
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 368..416
e-value: 1.0E-11
score: 44.7
coord: 72..116
e-value: 4.9E-8
score: 32.9
coord: 170..215
e-value: 9.5E-9
score: 35.2
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 443..467
e-value: 1.4E-5
score: 24.6
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 269..303
e-value: 1.4E-6
score: 26.1
coord: 142..172
e-value: 6.8E-5
score: 20.8
coord: 370..404
e-value: 1.8E-6
score: 25.7
coord: 405..432
e-value: 0.0027
score: 15.7
coord: 241..269
e-value: 0.0019
score: 16.2
coord: 172..197
e-value: 9.6E-7
score: 26.6
coord: 73..104
e-value: 2.9E-6
score: 25.1
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 302..336
score: 6.906
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 472..502
score: 5.163
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 170..200
score: 10.589
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 236..266
score: 7.892
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 104..138
score: 6.38
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 139..169
score: 8.111
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 267..301
score: 11.4
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 403..433
score: 8.298
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 368..402
score: 11.772
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 506..540
score: 6.347
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 440..470
score: 8.133
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 337..367
score: 7.443
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 69..103
score: 10.413
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 227..319
e-value: 3.2E-21
score: 77.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 320..436
e-value: 1.1E-28
score: 102.7
coord: 437..555
e-value: 1.2E-10
score: 43.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 72..226
e-value: 9.5E-33
score: 115.8
NoneNo IPR availablePANTHERPTHR24015:SF266SUBFAMILY NOT NAMEDcoord: 19..568
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 19..568

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cla97C05G089660Cla006218Watermelon (97103) v1wmwmbB206
Cla97C05G089660CmaCh16G001570Cucurbita maxima (Rimu)cmawmbB346
Cla97C05G089660CmoCh18G012370Cucurbita moschata (Rifu)cmowmbB404
Cla97C05G089660Lsi05G012730Bottle gourd (USVL1VR-Ls)lsiwmbB334
Cla97C05G089660Carg26811Silver-seed gourdcarwmbB0006
Cla97C05G089660Carg16708Silver-seed gourdcarwmbB0910
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla97C05G089660Watermelon (97103) v2wmbwmbB023
Cla97C05G089660Watermelon (97103) v2wmbwmbB130
Cla97C05G089660Watermelon (97103) v2wmbwmbB140
Cla97C05G089660Silver-seed gourdcarwmbB0743
Cla97C05G089660Silver-seed gourdcarwmbB0969
Cla97C05G089660Silver-seed gourdcarwmbB0984
Cla97C05G089660Cucumber (Gy14) v2cgybwmbB196
Cla97C05G089660Cucumber (Gy14) v2cgybwmbB367
Cla97C05G089660Cucumber (Gy14) v2cgybwmbB443
Cla97C05G089660Cucumber (Gy14) v1cgywmbB452
Cla97C05G089660Cucumber (Gy14) v1cgywmbB522
Cla97C05G089660Cucumber (Gy14) v1cgywmbB621
Cla97C05G089660Cucurbita maxima (Rimu)cmawmbB304
Cla97C05G089660Cucurbita maxima (Rimu)cmawmbB424
Cla97C05G089660Cucurbita maxima (Rimu)cmawmbB649
Cla97C05G089660Cucurbita maxima (Rimu)cmawmbB651
Cla97C05G089660Cucurbita moschata (Rifu)cmowmbB286
Cla97C05G089660Cucurbita moschata (Rifu)cmowmbB331
Cla97C05G089660Cucurbita moschata (Rifu)cmowmbB621
Cla97C05G089660Cucurbita moschata (Rifu)cmowmbB624
Cla97C05G089660Wild cucumber (PI 183967)cpiwmbB209
Cla97C05G089660Wild cucumber (PI 183967)cpiwmbB404
Cla97C05G089660Wild cucumber (PI 183967)cpiwmbB491
Cla97C05G089660Cucumber (Chinese Long) v3cucwmbB206
Cla97C05G089660Cucumber (Chinese Long) v3cucwmbB400
Cla97C05G089660Cucumber (Chinese Long) v3cucwmbB484
Cla97C05G089660Cucumber (Chinese Long) v2cuwmbB204
Cla97C05G089660Cucumber (Chinese Long) v2cuwmbB466
Cla97C05G089660Bottle gourd (USVL1VR-Ls)lsiwmbB018
Cla97C05G089660Bottle gourd (USVL1VR-Ls)lsiwmbB022
Cla97C05G089660Melon (DHL92) v3.6.1medwmbB021
Cla97C05G089660Melon (DHL92) v3.6.1medwmbB115
Cla97C05G089660Melon (DHL92) v3.6.1medwmbB423
Cla97C05G089660Melon (DHL92) v3.5.1mewmbB024
Cla97C05G089660Melon (DHL92) v3.5.1mewmbB124
Cla97C05G089660Melon (DHL92) v3.5.1mewmbB434
Cla97C05G089660Watermelon (Charleston Gray)wcgwmbB109
Cla97C05G089660Watermelon (Charleston Gray)wcgwmbB229
Cla97C05G089660Watermelon (Charleston Gray)wcgwmbB272
Cla97C05G089660Watermelon (97103) v1wmwmbB269
Cla97C05G089660Watermelon (97103) v1wmwmbB405
Cla97C05G089660Wax gourdwgowmbB192