Cla97C08G156650 (gene) Watermelon (97103) v2.5

Overview
NameCla97C08G156650
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionPentatricopeptide repeat-containing protein
LocationCla97Chr08: 24392356 .. 24394173 (+)
RNA-Seq ExpressionCla97C08G156650
SyntenyCla97C08G156650
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAATCCCAATTCACAAAAACCAAGCTCTTACGTGCAATCAACAATGTTTCAGCTTCTACAACTAATCCTCGTGCAGCGGAGCAGAACTGCTTAGCCCTGCTTCAAGCCTGTAACGCGCTGCCGAAGCTCGCCCAAATCCATACCCACATTCTCAAGTTGGGCCTCCACAACAACCCGCTCGTTCTCACCAAATTCGCCTCCATTTCTTCTCTTATTCATGCTCCTGATTATGCTGCCTCTTTCTTGTTCTCTGCCGAAGCTGATACTCGGCTTTACGACGCGTTTCTTTTCAATACCCTCATCAGAGCCTATGCCCAAACTGCTCACTCGAAGGATAAAGCCTTGTCTTTGTATGGAATAATGCTTCACGGGCGGATTTTGCCTAATAAATTCACGTACCCATTTGTCTTGAAGGCTTGTGCTGGCCTTGAGGTTTTGAATTTGGGGCGATCGGTTCATGGGTCGGTGGTGAAGTTTGGGTTTGATCGTGATGTTCATGTTCAGAACACTTTGGTTCACATGTACTCCTGTTGCGCCGGTGGGATCAGTTCTGCCCGGAAGGTGTTTGATGAAATGCCCAAGGCAGATTCTGTGACTTGGAGTGCTATGATTGGTGGGTATGCTCGTGTAGGGCGCTCCACTGAAGCAGTAGCTTTATTTAGGGAGATGCAAATGGCGGAGGTTTGCCCTGATGAGATCACTATGGTTTCCATTCTCTCTGCTTGCACTGATTTAGGTGCCCTTGAACTGGGAAAGTGGATTGAAGCTTATATAGAGAGACAAGGAATTCAGAAACCAGCAGAGGTTAGCAATGCACTTATTGACATGTTTGCAAAATGTGGTGATATTAGTAAAGCATTGAAGTTATTTAGAACTATGAATGAGAAAACAATAGTTTCTTGGACTTCTGTTATTGTTGGCATGGCAATGCATGGCCGTGGTCAAGAGGCCATTTGTCTATTTGAGGAGATGATAGGTTCTGGTGTTGCTCCGGATGATGTCGCCTTTATCGGCTTGCTTTCTGCTTGTAGCCATTCGGGACTAGTAGAAAAAGGTAGAGAATATTTCAGTTCTATGATGAAGAAATACAAACTTGTTCCTAAGATAGAGCACTACGGATGCATGGTGGACATGTATTGCAGGACTGGACTAGTGAAAGAGGCTCTTGACTTCGTACGTAACATGCCAATCGAGCCAAATCCAGTAATCTTACGAACACTAGTTAGTGCCTGCCGTGGTCATGGTGAATTCAAGCTTGGAGAAAAGATCACCAAACTGCTAATGAGACACGAGCCCTTGCATGAATCAAACTATGTCTTGCTCTCTAACATTTATGCAAAAATGCTTAGCTGGGAGAAGAAGACCAAAATTAGAGAGGTGATGGAAGTGAAGGGCATGAAAAAGGTTCCAGGGAGCACCATGATTGAGATTGATAACGAAATCTATGAATTTGTTGCTGGAGATAAGTCTCATAAACAGTTCAAAGAAATCTATGAAATGGTGGACGAGATGGGGAGAGAAATGAAGAAATCTGGATATCGTCCTACGACATCTGAAGTTTTGCTCGATATCAACGAAGAGGACAAAGAAGATACCTTGAATAGGCATAGTGAAAAACTAGCCATTGCATTTGGTCTTCTTAGTACTCCACCAGGAACTCCGATTCGAATCGTGAAGAATTTGCGAGTTTGCAGCGATTGCCACTCCGCTTCCAAGTTCATCTCTAAGATTTATGATCGTGAAATCATAATGAGAGACCGCAACCGGTTTCACCACTTCAAGTCTGGGCAGTGCTCATGTGGAGATTTCTGGTGA

mRNA sequence

ATGCAATCCCAATTCACAAAAACCAAGCTCTTACGTGCAATCAACAATGTTTCAGCTTCTACAACTAATCCTCGTGCAGCGGAGCAGAACTGCTTAGCCCTGCTTCAAGCCTGTAACGCGCTGCCGAAGCTCGCCCAAATCCATACCCACATTCTCAAGTTGGGCCTCCACAACAACCCGCTCGTTCTCACCAAATTCGCCTCCATTTCTTCTCTTATTCATGCTCCTGATTATGCTGCCTCTTTCTTGTTCTCTGCCGAAGCTGATACTCGGCTTTACGACGCGTTTCTTTTCAATACCCTCATCAGAGCCTATGCCCAAACTGCTCACTCGAAGGATAAAGCCTTGTCTTTGTATGGAATAATGCTTCACGGGCGGATTTTGCCTAATAAATTCACGTACCCATTTGTCTTGAAGGCTTGTGCTGGCCTTGAGGTTTTGAATTTGGGGCGATCGGTTCATGGGTCGGTGGTGAAGTTTGGGTTTGATCGTGATGTTCATGTTCAGAACACTTTGGTTCACATGTACTCCTGTTGCGCCGGTGGGATCAGTTCTGCCCGGAAGGTGTTTGATGAAATGCCCAAGGCAGATTCTGTGACTTGGAGTGCTATGATTGGTGGGTATGCTCGTGTAGGGCGCTCCACTGAAGCAGTAGCTTTATTTAGGGAGATGCAAATGGCGGAGGTTTGCCCTGATGAGATCACTATGGTTTCCATTCTCTCTGCTTGCACTGATTTAGGTGCCCTTGAACTGGGAAAGTGGATTGAAGCTTATATAGAGAGACAAGGAATTCAGAAACCAGCAGAGGTTAGCAATGCACTTATTGACATGTTTGCAAAATGTGGTGATATTAGTAAAGCATTGAAGTTATTTAGAACTATGAATGAGAAAACAATAGTTTCTTGGACTTCTGTTATTGTTGGCATGGCAATGCATGGCCGTGGTCAAGAGGCCATTTGTCTATTTGAGGAGATGATAGGTTCTGGTGTTGCTCCGGATGATGTCGCCTTTATCGGCTTGCTTTCTGCTTGTAGCCATTCGGGACTAGTAGAAAAAGGTAGAGAATATTTCAGTTCTATGATGAAGAAATACAAACTTGTTCCTAAGATAGAGCACTACGGATGCATGGTGGACATGTATTGCAGGACTGGACTAGTGAAAGAGGCTCTTGACTTCGTACGTAACATGCCAATCGAGCCAAATCCAGTAATCTTACGAACACTAGTTAGTGCCTGCCGTGGTCATGGTGAATTCAAGCTTGGAGAAAAGATCACCAAACTGCTAATGAGACACGAGCCCTTGCATGAATCAAACTATGTCTTGCTCTCTAACATTTATGCAAAAATGCTTAGCTGGGAGAAGAAGACCAAAATTAGAGAGGTGATGGAAGTGAAGGGCATGAAAAAGGTTCCAGGGAGCACCATGATTGAGATTGATAACGAAATCTATGAATTTGTTGCTGGAGATAAGTCTCATAAACAGTTCAAAGAAATCTATGAAATGGTGGACGAGATGGGGAGAGAAATGAAGAAATCTGGATATCGTCCTACGACATCTGAAGTTTTGCTCGATATCAACGAAGAGGACAAAGAAGATACCTTGAATAGGCATAGTGAAAAACTAGCCATTGCATTTGGTCTTCTTAGTACTCCACCAGGAACTCCGATTCGAATCGTGAAGAATTTGCGAGTTTGCAGCGATTGCCACTCCGCTTCCAAGTTCATCTCTAAGATTTATGATCGTGAAATCATAATGAGAGACCGCAACCGGTTTCACCACTTCAAGTCTGGGCAGTGCTCATGTGGAGATTTCTGGTGA

Coding sequence (CDS)

ATGCAATCCCAATTCACAAAAACCAAGCTCTTACGTGCAATCAACAATGTTTCAGCTTCTACAACTAATCCTCGTGCAGCGGAGCAGAACTGCTTAGCCCTGCTTCAAGCCTGTAACGCGCTGCCGAAGCTCGCCCAAATCCATACCCACATTCTCAAGTTGGGCCTCCACAACAACCCGCTCGTTCTCACCAAATTCGCCTCCATTTCTTCTCTTATTCATGCTCCTGATTATGCTGCCTCTTTCTTGTTCTCTGCCGAAGCTGATACTCGGCTTTACGACGCGTTTCTTTTCAATACCCTCATCAGAGCCTATGCCCAAACTGCTCACTCGAAGGATAAAGCCTTGTCTTTGTATGGAATAATGCTTCACGGGCGGATTTTGCCTAATAAATTCACGTACCCATTTGTCTTGAAGGCTTGTGCTGGCCTTGAGGTTTTGAATTTGGGGCGATCGGTTCATGGGTCGGTGGTGAAGTTTGGGTTTGATCGTGATGTTCATGTTCAGAACACTTTGGTTCACATGTACTCCTGTTGCGCCGGTGGGATCAGTTCTGCCCGGAAGGTGTTTGATGAAATGCCCAAGGCAGATTCTGTGACTTGGAGTGCTATGATTGGTGGGTATGCTCGTGTAGGGCGCTCCACTGAAGCAGTAGCTTTATTTAGGGAGATGCAAATGGCGGAGGTTTGCCCTGATGAGATCACTATGGTTTCCATTCTCTCTGCTTGCACTGATTTAGGTGCCCTTGAACTGGGAAAGTGGATTGAAGCTTATATAGAGAGACAAGGAATTCAGAAACCAGCAGAGGTTAGCAATGCACTTATTGACATGTTTGCAAAATGTGGTGATATTAGTAAAGCATTGAAGTTATTTAGAACTATGAATGAGAAAACAATAGTTTCTTGGACTTCTGTTATTGTTGGCATGGCAATGCATGGCCGTGGTCAAGAGGCCATTTGTCTATTTGAGGAGATGATAGGTTCTGGTGTTGCTCCGGATGATGTCGCCTTTATCGGCTTGCTTTCTGCTTGTAGCCATTCGGGACTAGTAGAAAAAGGTAGAGAATATTTCAGTTCTATGATGAAGAAATACAAACTTGTTCCTAAGATAGAGCACTACGGATGCATGGTGGACATGTATTGCAGGACTGGACTAGTGAAAGAGGCTCTTGACTTCGTACGTAACATGCCAATCGAGCCAAATCCAGTAATCTTACGAACACTAGTTAGTGCCTGCCGTGGTCATGGTGAATTCAAGCTTGGAGAAAAGATCACCAAACTGCTAATGAGACACGAGCCCTTGCATGAATCAAACTATGTCTTGCTCTCTAACATTTATGCAAAAATGCTTAGCTGGGAGAAGAAGACCAAAATTAGAGAGGTGATGGAAGTGAAGGGCATGAAAAAGGTTCCAGGGAGCACCATGATTGAGATTGATAACGAAATCTATGAATTTGTTGCTGGAGATAAGTCTCATAAACAGTTCAAAGAAATCTATGAAATGGTGGACGAGATGGGGAGAGAAATGAAGAAATCTGGATATCGTCCTACGACATCTGAAGTTTTGCTCGATATCAACGAAGAGGACAAAGAAGATACCTTGAATAGGCATAGTGAAAAACTAGCCATTGCATTTGGTCTTCTTAGTACTCCACCAGGAACTCCGATTCGAATCGTGAAGAATTTGCGAGTTTGCAGCGATTGCCACTCCGCTTCCAAGTTCATCTCTAAGATTTATGATCGTGAAATCATAATGAGAGACCGCAACCGGTTTCACCACTTCAAGTCTGGGCAGTGCTCATGTGGAGATTTCTGGTGA

Protein sequence

MQSQFTKTKLLRAINNVSASTTNPRAAEQNCLALLQACNALPKLAQIHTHILKLGLHNNPLVLTKFASISSLIHAPDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTAHSKDKALSLYGIMLHGRILPNKFTYPFVLKACAGLEVLNLGRSVHGSVVKFGFDRDVHVQNTLVHMYSCCAGGISSARKVFDEMPKADSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSILSACTDLGALELGKWIEAYIERQGIQKPAEVSNALIDMFAKCGDISKALKLFRTMNEKTIVSWTSVIVGMAMHGRGQEAICLFEEMIGSGVAPDDVAFIGLLSACSHSGLVEKGREYFSSMMKKYKLVPKIEHYGCMVDMYCRTGLVKEALDFVRNMPIEPNPVILRTLVSACRGHGEFKLGEKITKLLMRHEPLHESNYVLLSNIYAKMLSWEKKTKIREVMEVKGMKKVPGSTMIEIDNEIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPTTSEVLLDINEEDKEDTLNRHSEKLAIAFGLLSTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQCSCGDFW
Homology
BLAST of Cla97C08G156650 vs. NCBI nr
Match: XP_038884201.1 (pentatricopeptide repeat-containing protein At4g21065-like [Benincasa hispida])

HSP 1 Score: 1162.5 bits (3006), Expect = 0.0e+00
Identity = 573/605 (94.71%), Postives = 589/605 (97.36%), Query Frame = 0

Query: 1   MQSQFTKTKLLRAINNVSASTTNPRAAEQNCLALLQACNALPKLAQIHTHILKLGLHNNP 60
           MQSQFTKTKLLRAINNV ASTTNPRAAEQNCLALLQACNALPKL QIHTHILKLGLHNNP
Sbjct: 1   MQSQFTKTKLLRAINNVVASTTNPRAAEQNCLALLQACNALPKLTQIHTHILKLGLHNNP 60

Query: 61  LVLTKFASISSLIHAPDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTAHSKDKALSLYG 120
           LVLTKFASISSLIHA DYAASFLFSAEADTRLYDAFLFNTLIRAYAQT HSKDKALSLY 
Sbjct: 61  LVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALSLYS 120

Query: 121 IMLHGRILPNKFTYPFVLKACAGLEVLNLGRSVHGSVVKFGFDRDVHVQNTLVHMYSCCA 180
           IMLH  ILPNKFTYPFVLKACAGLEVLNLG+SVHGSVVKFGFDRD+HVQNT++HMYSCCA
Sbjct: 121 IMLHDGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFDRDIHVQNTMIHMYSCCA 180

Query: 181 GGISSARKVFDEMPKADSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSIL 240
           GGI+SARKVFDEMPK+DSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSIL
Sbjct: 181 GGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSIL 240

Query: 241 SACTDLGALELGKWIEAYIERQGIQKPAEVSNALIDMFAKCGDISKALKLFRTMNEKTIV 300
           SACTDLGALELGKWIEAYIERQGI KP EVSNALIDMFAKCGDI+KALKLFR +NEKTIV
Sbjct: 241 SACTDLGALELGKWIEAYIERQGIHKPVEVSNALIDMFAKCGDINKALKLFRALNEKTIV 300

Query: 301 SWTSVIVGMAMHGRGQEAICLFEEMIGSGVAPDDVAFIGLLSACSHSGLVEKGREYFSSM 360
           SWTSVIVGMAMHGRGQEAICLFEEMI SGVAPDDV+FIGLLSACSHSGLVE+GREYFSSM
Sbjct: 301 SWTSVIVGMAMHGRGQEAICLFEEMIVSGVAPDDVSFIGLLSACSHSGLVERGREYFSSM 360

Query: 361 MKKYKLVPKIEHYGCMVDMYCRTGLVKEALDFVRNMPIEPNPVILRTLVSACRGHGEFKL 420
           MKKYKL PKIEHYGCMVDMYCRTGLVKEAL FV NMP+EPNPVILRTLVSACRGHGEFKL
Sbjct: 361 MKKYKLAPKIEHYGCMVDMYCRTGLVKEALQFVHNMPVEPNPVILRTLVSACRGHGEFKL 420

Query: 421 GEKITKLLMRHEPLHESNYVLLSNIYAKMLSWEKKTKIREVMEVKGMKKVPGSTMIEIDN 480
           GEKITKLLMRHEPLHESNYVLLSNIYAKMLSWEKKTKIREVMEVKGMKK+PGSTMIEIDN
Sbjct: 421 GEKITKLLMRHEPLHESNYVLLSNIYAKMLSWEKKTKIREVMEVKGMKKIPGSTMIEIDN 480

Query: 481 EIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPTTSEVLLDINEEDKEDTLNRHSEK 540
           EIYEFVAGDKSHKQ+KEIYEMVDEMGREMKKSGYRP+TSEVLLDINEEDKEDTLNRHSEK
Sbjct: 481 EIYEFVAGDKSHKQYKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDTLNRHSEK 540

Query: 541 LAIAFGLLSTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQCS 600
           LAIAFGLLSTPPGTPIRIVKNLRVCSDCHSASK+IS IY+REIIMRDRNRFHHFKSG CS
Sbjct: 541 LAIAFGLLSTPPGTPIRIVKNLRVCSDCHSASKYISNIYNREIIMRDRNRFHHFKSGLCS 600

Query: 601 CGDFW 606
           CGDFW
Sbjct: 601 CGDFW 605

BLAST of Cla97C08G156650 vs. NCBI nr
Match: KAA0064932.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1151.3 bits (2977), Expect = 0.0e+00
Identity = 571/606 (94.22%), Postives = 587/606 (96.86%), Query Frame = 0

Query: 1   MQSQFTKTKLLRAINNV-SASTTNPRAAEQNCLALLQACNALPKLAQIHTHILKLGLHNN 60
           MQSQFTK KLLR INNV ++STTNPRAAEQNCLALLQACNALPKL QIHTHILKLGLHNN
Sbjct: 1   MQSQFTKPKLLRTINNVLASSTTNPRAAEQNCLALLQACNALPKLTQIHTHILKLGLHNN 60

Query: 61  PLVLTKFASISSLIHAPDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTAHSKDKALSLY 120
           PLVLTKFASISSLIHA DYAASFLFSAEADTRLYDAFLFNTLIRAYAQT HSKDKAL+LY
Sbjct: 61  PLVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALY 120

Query: 121 GIMLHGRILPNKFTYPFVLKACAGLEVLNLGRSVHGSVVKFGFDRDVHVQNTLVHMYSCC 180
           GIMLH  ILPNKFTYPFVLKACAGLEVLNLG+SVHGSVVKFGFD D+HVQNT+VHMYSCC
Sbjct: 121 GIMLHDGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFDCDIHVQNTMVHMYSCC 180

Query: 181 AGGISSARKVFDEMPKADSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSI 240
           AGGI+SARKVFDEMPK+DSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVS+
Sbjct: 181 AGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSM 240

Query: 241 LSACTDLGALELGKWIEAYIERQGIQKPAEVSNALIDMFAKCGDISKALKLFRTMNEKTI 300
           LSACTDLGALELGKWIEAYIER GI KP EVSNALIDMFAKCGDISKALKLFR MNEKTI
Sbjct: 241 LSACTDLGALELGKWIEAYIERHGIHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTI 300

Query: 301 VSWTSVIVGMAMHGRGQEAICLFEEMIGSGVAPDDVAFIGLLSACSHSGLVEKGREYFSS 360
           VSWTSVIVGMAMHGRG+EA CLFEEMI SGVAPDDVAFIGLLSACSHSGLVE+GREYF S
Sbjct: 301 VSWTSVIVGMAMHGRGREATCLFEEMITSGVAPDDVAFIGLLSACSHSGLVERGREYFGS 360

Query: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALDFVRNMPIEPNPVILRTLVSACRGHGEFK 420
           MMKKYKLVPKIEHYGCMVDMYCRTGLVKEAL+FVRNMPIEPNPVILRTLV+ACRGHGEFK
Sbjct: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVTACRGHGEFK 420

Query: 421 LGEKITKLLMRHEPLHESNYVLLSNIYAKMLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480
           LGEKITKLLM+HEPLHESNYVLLSNIYAK LSWEKKTKIREVMEVKGMKKVPGSTMIEID
Sbjct: 421 LGEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480

Query: 481 NEIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPTTSEVLLDINEEDKEDTLNRHSE 540
           NEIYEFVAGDKSHKQ KEIYEMVDEMGREMKKSGYRP+TSEVLLDINEEDKED+LNRHSE
Sbjct: 481 NEIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSLNRHSE 540

Query: 541 KLAIAFGLLSTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 600
           KLAIAFGLL TPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC
Sbjct: 541 KLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 600

Query: 601 SCGDFW 606
           SCGDFW
Sbjct: 601 SCGDFW 606

BLAST of Cla97C08G156650 vs. NCBI nr
Match: XP_008445200.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Cucumis melo])

HSP 1 Score: 1150.2 bits (2974), Expect = 0.0e+00
Identity = 570/606 (94.06%), Postives = 587/606 (96.86%), Query Frame = 0

Query: 1   MQSQFTKTKLLRAINNV-SASTTNPRAAEQNCLALLQACNALPKLAQIHTHILKLGLHNN 60
           MQSQFTK KLLR INNV ++STTNPRAAEQNCLALLQACNALPKL QIHTHI+KLGLHNN
Sbjct: 1   MQSQFTKPKLLRTINNVLASSTTNPRAAEQNCLALLQACNALPKLTQIHTHIVKLGLHNN 60

Query: 61  PLVLTKFASISSLIHAPDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTAHSKDKALSLY 120
           PLVLTKFASISSLIHA DYAASFLFSAEADTRLYDAFLFNTLIRAYAQT HSKDKAL+LY
Sbjct: 61  PLVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALY 120

Query: 121 GIMLHGRILPNKFTYPFVLKACAGLEVLNLGRSVHGSVVKFGFDRDVHVQNTLVHMYSCC 180
           GIMLH  ILPNKFTYPFVLKACAGLEVLNLG+SVHGSVVKFGFD D+HVQNT+VHMYSCC
Sbjct: 121 GIMLHDGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFDCDIHVQNTMVHMYSCC 180

Query: 181 AGGISSARKVFDEMPKADSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSI 240
           AGGI+SARKVFDEMPK+DSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVS+
Sbjct: 181 AGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSM 240

Query: 241 LSACTDLGALELGKWIEAYIERQGIQKPAEVSNALIDMFAKCGDISKALKLFRTMNEKTI 300
           LSACTDLGALELGKWIEAYIER GI KP EVSNALIDMFAKCGDISKALKLFR MNEKTI
Sbjct: 241 LSACTDLGALELGKWIEAYIERHGIHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTI 300

Query: 301 VSWTSVIVGMAMHGRGQEAICLFEEMIGSGVAPDDVAFIGLLSACSHSGLVEKGREYFSS 360
           VSWTSVIVGMAMHGRG+EA CLFEEMI SGVAPDDVAFIGLLSACSHSGLVE+GREYF S
Sbjct: 301 VSWTSVIVGMAMHGRGREATCLFEEMITSGVAPDDVAFIGLLSACSHSGLVERGREYFGS 360

Query: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALDFVRNMPIEPNPVILRTLVSACRGHGEFK 420
           MMKKYKLVPKIEHYGCMVDMYCRTGLVKEAL+FVRNMPIEPNPVILRTLV+ACRGHGEFK
Sbjct: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVTACRGHGEFK 420

Query: 421 LGEKITKLLMRHEPLHESNYVLLSNIYAKMLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480
           LGEKITKLLM+HEPLHESNYVLLSNIYAK LSWEKKTKIREVMEVKGMKKVPGSTMIEID
Sbjct: 421 LGEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480

Query: 481 NEIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPTTSEVLLDINEEDKEDTLNRHSE 540
           NEIYEFVAGDKSHKQ KEIYEMVDEMGREMKKSGYRP+TSEVLLDINEEDKED+LNRHSE
Sbjct: 481 NEIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSLNRHSE 540

Query: 541 KLAIAFGLLSTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 600
           KLAIAFGLL TPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC
Sbjct: 541 KLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 600

Query: 601 SCGDFW 606
           SCGDFW
Sbjct: 601 SCGDFW 606

BLAST of Cla97C08G156650 vs. NCBI nr
Match: XP_004138859.1 (pentatricopeptide repeat-containing protein At4g21065 [Cucumis sativus] >KGN62942.1 hypothetical protein Csa_021798 [Cucumis sativus])

HSP 1 Score: 1145.6 bits (2962), Expect = 0.0e+00
Identity = 569/606 (93.89%), Postives = 583/606 (96.20%), Query Frame = 0

Query: 1   MQSQFTKTKLLRAINNVSASTT-NPRAAEQNCLALLQACNALPKLAQIHTHILKLGLHNN 60
           MQSQFTK KLLR INNV AS+T NPRA EQNCLALLQACNALPKL QIHTHILKLGLHNN
Sbjct: 1   MQSQFTKPKLLRTINNVLASSTPNPRAPEQNCLALLQACNALPKLTQIHTHILKLGLHNN 60

Query: 61  PLVLTKFASISSLIHAPDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTAHSKDKALSLY 120
           PLVLTKFASISSLIHA DYAASFLFSAEADTRLYDAFLFNTLIRAYAQT HSKDKAL+LY
Sbjct: 61  PLVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALY 120

Query: 121 GIMLHGRILPNKFTYPFVLKACAGLEVLNLGRSVHGSVVKFGFDRDVHVQNTLVHMYSCC 180
           GIMLH  ILPNKFTYPFVLKACAGLEVLNLG++VHGSVVKFGFD D+HVQNT+VHMYSCC
Sbjct: 121 GIMLHDAILPNKFTYPFVLKACAGLEVLNLGQTVHGSVVKFGFDCDIHVQNTMVHMYSCC 180

Query: 181 AGGISSARKVFDEMPKADSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSI 240
           AGGI+SARKVFDEMPK+DSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVS+
Sbjct: 181 AGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSM 240

Query: 241 LSACTDLGALELGKWIEAYIERQGIQKPAEVSNALIDMFAKCGDISKALKLFRTMNEKTI 300
           LSACTDLGALELGKWIEAYIER  I KP EVSNALIDMFAKCGDISKALKLFR MNEKTI
Sbjct: 241 LSACTDLGALELGKWIEAYIERHEIHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTI 300

Query: 301 VSWTSVIVGMAMHGRGQEAICLFEEMIGSGVAPDDVAFIGLLSACSHSGLVEKGREYFSS 360
           VSWTSVIVGMAMHGRGQEA CLFEEM  SGVAPDDVAFIGLLSACSHSGLVE+GREYF S
Sbjct: 301 VSWTSVIVGMAMHGRGQEATCLFEEMTSSGVAPDDVAFIGLLSACSHSGLVERGREYFGS 360

Query: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALDFVRNMPIEPNPVILRTLVSACRGHGEFK 420
           MMKKYKLVPKIEHYGCMVDMYCRTGLVKEAL+FVRNMPIEPNPVILRTLVSACRGHGEFK
Sbjct: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVSACRGHGEFK 420

Query: 421 LGEKITKLLMRHEPLHESNYVLLSNIYAKMLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480
           LGEKITKLLM+HEPLHESNYVLLSNIYAK LSWEKKTKIREVMEVKGMKKVPGSTMIEID
Sbjct: 421 LGEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480

Query: 481 NEIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPTTSEVLLDINEEDKEDTLNRHSE 540
           NEIYEFVAGDKSHKQ KEIYEMVDEMGREMKKSGYRP+TSEVLLDINEEDKED+LNRHSE
Sbjct: 481 NEIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSLNRHSE 540

Query: 541 KLAIAFGLLSTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 600
           KLAIAFGLL TPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC
Sbjct: 541 KLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 600

Query: 601 SCGDFW 606
           SCGDFW
Sbjct: 601 SCGDFW 606

BLAST of Cla97C08G156650 vs. NCBI nr
Match: XP_022131416.1 (pentatricopeptide repeat-containing protein At4g21065-like [Momordica charantia] >XP_022131419.1 pentatricopeptide repeat-containing protein At4g21065-like [Momordica charantia] >XP_022131420.1 pentatricopeptide repeat-containing protein At4g21065-like [Momordica charantia])

HSP 1 Score: 1120.5 bits (2897), Expect = 0.0e+00
Identity = 555/606 (91.58%), Postives = 581/606 (95.87%), Query Frame = 0

Query: 1   MQSQFTKTKLLRAINNVSA-STTNPRAAEQNCLALLQACNALPKLAQIHTHILKLGLHNN 60
           MQSQF+KTKLL AINN    S  NPRAAEQ+CLALLQACNALPKLAQIH HILKLGLHNN
Sbjct: 1   MQSQFSKTKLLLAINNAPVFSRANPRAAEQDCLALLQACNALPKLAQIHAHILKLGLHNN 60

Query: 61  PLVLTKFASISSLIHAPDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTAHSKDKALSLY 120
           PLVLTKFASISS+I A DYAASFLFSA ADTRLYDAFLFNTLIRAYAQT HSK KAL+LY
Sbjct: 61  PLVLTKFASISSVISATDYAASFLFSAGADTRLYDAFLFNTLIRAYAQTGHSKPKALALY 120

Query: 121 GIMLHGRILPNKFTYPFVLKACAGLEVLNLGRSVHGSVVKFGFDRDVHVQNTLVHMYSCC 180
           G+ML   ILPNKFTYPFVLKACAGLEVLNLG+SVHGSVVKFGFDRDVHV+NT+VHMYSCC
Sbjct: 121 GLMLRDGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFDRDVHVRNTMVHMYSCC 180

Query: 181 AGGISSARKVFDEMPKADSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSI 240
           AGGI+ ARKVFDEMPK+DSVTWSAMIGGYARVGR TEAV+LFREMQ+AEVCPDEITMVSI
Sbjct: 181 AGGINFARKVFDEMPKSDSVTWSAMIGGYARVGRPTEAVSLFREMQLAEVCPDEITMVSI 240

Query: 241 LSACTDLGALELGKWIEAYIERQGIQKPAEVSNALIDMFAKCGDISKALKLFRTMNEKTI 300
           LSACTDLGALELGKW+EAYIERQGIQKP EVSNALIDMFAKCGDISKALKLF+TM+EKTI
Sbjct: 241 LSACTDLGALELGKWLEAYIERQGIQKPEEVSNALIDMFAKCGDISKALKLFKTMSEKTI 300

Query: 301 VSWTSVIVGMAMHGRGQEAICLFEEMIGSGVAPDDVAFIGLLSACSHSGLVEKGREYFSS 360
           VSWTSVIVGMAMHGRGQ+AICLFEEMIGSGVAPDDVAFIGLLSACSHSG+VE+GREYFSS
Sbjct: 301 VSWTSVIVGMAMHGRGQDAICLFEEMIGSGVAPDDVAFIGLLSACSHSGMVERGREYFSS 360

Query: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALDFVRNMPIEPNPVILRTLVSACRGHGEFK 420
           M KKYKLVPKIEHYGCMVDM+CRTGLVKEAL+FV +MPIEPN VILRTLVSACRGHGEF+
Sbjct: 361 MTKKYKLVPKIEHYGCMVDMFCRTGLVKEALEFVHSMPIEPNAVILRTLVSACRGHGEFQ 420

Query: 421 LGEKITKLLMRHEPLHESNYVLLSNIYAKMLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480
           LGEKITK LMRHEP+HESNYVLLSNIYAKMLSWEKKTKIREVMEVKGMKKVPGSTMIEID
Sbjct: 421 LGEKITKQLMRHEPMHESNYVLLSNIYAKMLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480

Query: 481 NEIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPTTSEVLLDINEEDKEDTLNRHSE 540
           NEIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRP+TSEVLLDINEEDKEDTLNRH E
Sbjct: 481 NEIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDTLNRHGE 540

Query: 541 KLAIAFGLLSTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 600
           KLAIAFGLL+TPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFK+G C
Sbjct: 541 KLAIAFGLLNTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKAGIC 600

Query: 601 SCGDFW 606
           SCGDFW
Sbjct: 601 SCGDFW 606

BLAST of Cla97C08G156650 vs. ExPASy Swiss-Prot
Match: A8MQA3 (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 508.4 bits (1308), Expect = 1.1e-142
Identity = 261/584 (44.69%), Postives = 388/584 (66.44%), Query Frame = 0

Query: 29  QNCLALLQ--ACNALPKLAQIHTHILKLGLHNNPLVLTK---FASISSLIHAPDYAASFL 88
           + C+ LLQ    +++ KL QIH   ++ G+  +   L K   F  +S     P   A  +
Sbjct: 16  EKCINLLQTYGVSSITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKV 75

Query: 89  FSAEADTRLYDAFLFNTLIRAYAQTAHSKDKALSLYGIM-LHGRILPNKFTYPFVLKACA 148
           FS     +  + F++NTLIR YA+  +S   A SLY  M + G + P+  TYPF++KA  
Sbjct: 76  FS--KIEKPINVFIWNTLIRGYAEIGNS-ISAFSLYREMRVSGLVEPDTHTYPFLIKAVT 135

Query: 149 GLEVLNLGRSVHGSVVKFGFDRDVHVQNTLVHMYSCCAGGISSARKVFDEMPKADSVTWS 208
            +  + LG ++H  V++ GF   ++VQN+L+H+Y+ C G ++SA KVFD+MP+ D V W+
Sbjct: 136 TMADVRLGETIHSVVIRSGFGSLIYVQNSLLHLYANC-GDVASAYKVFDKMPEKDLVAWN 195

Query: 209 AMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSILSACTDLGALELGKWIEAYIERQ 268
           ++I G+A  G+  EA+AL+ EM    + PD  T+VS+LSAC  +GAL LGK +  Y+ + 
Sbjct: 196 SVINGFAENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKV 255

Query: 269 GIQKPAEVSNALIDMFAKCGDISKALKLFRTMNEKTIVSWTSVIVGMAMHGRGQEAICLF 328
           G+ +    SN L+D++A+CG + +A  LF  M +K  VSWTS+IVG+A++G G+EAI LF
Sbjct: 256 GLTRNLHSSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELF 315

Query: 329 EEMIGS-GVAPDDVAFIGLLSACSHSGLVEKGREYFSSMMKKYKLVPKIEHYGCMVDMYC 388
           + M  + G+ P ++ F+G+L ACSH G+V++G EYF  M ++YK+ P+IEH+GCMVD+  
Sbjct: 316 KYMESTEGLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLA 375

Query: 389 RTGLVKEALDFVRNMPIEPNPVILRTLVSACRGHGEFKLGEKITKLLMRHEPLHESNYVL 448
           R G VK+A +++++MP++PN VI RTL+ AC  HG+  L E     +++ EP H  +YVL
Sbjct: 376 RAGQVKKAYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVL 435

Query: 449 LSNIYAKMLSWEKKTKIREVMEVKGMKKVPGSTMIEIDNEIYEFVAGDKSHKQFKEIYEM 508
           LSN+YA    W    KIR+ M   G+KKVPG +++E+ N ++EF+ GDKSH Q   IY  
Sbjct: 436 LSNMYASEQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAK 495

Query: 509 VDEMGREMKKSGYRPTTSEVLLDINEEDKEDTLNRHSEKLAIAFGLLSTPPGTPIRIVKN 568
           + EM   ++  GY P  S V +D+ EE+KE+ +  HSEK+AIAF L+STP  +PI +VKN
Sbjct: 496 LKEMTGRLRSEGYVPQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKN 555

Query: 569 LRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQCSCGDFW 606
           LRVC+DCH A K +SK+Y+REI++RDR+RFHHFK+G CSC D+W
Sbjct: 556 LRVCADCHLAIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDYW 595

BLAST of Cla97C08G156650 vs. ExPASy Swiss-Prot
Match: Q8LK93 (Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H26 PE=2 SV=2)

HSP 1 Score: 496.9 bits (1278), Expect = 3.2e-139
Identity = 252/580 (43.45%), Postives = 381/580 (65.69%), Query Frame = 0

Query: 29  QNCLALLQACNALPKLAQIHTHILKLGLHNNPLV--LTKFASISSLIHAPDYAASFLFSA 88
           QN + L+  CN+L +L QI  + +K  + +   V  L  F + S    +  Y A  LF A
Sbjct: 30  QNPILLISKCNSLRELMQIQAYAIKSHIEDVSFVAKLINFCTESPTESSMSY-ARHLFEA 89

Query: 89  EADTRLYDAFLFNTLIRAYAQTAHSKDKALSLYGIMLHGRILPNKFTYPFVLKACAGLEV 148
            ++    D  +FN++ R Y++  +  +   SL+  +L   ILP+ +T+P +LKACA  + 
Sbjct: 90  MSEP---DIVIFNSMARGYSRFTNPLE-VFSLFVEILEDGILPDNYTFPSLLKACAVAKA 149

Query: 149 LNLGRSVHGSVVKFGFDRDVHVQNTLVHMYSCCAGGISSARKVFDEMPKADSVTWSAMIG 208
           L  GR +H   +K G D +V+V  TL++MY+ C   + SAR VFD + +   V ++AMI 
Sbjct: 150 LEEGRQLHCLSMKLGLDDNVYVCPTLINMYTECE-DVDSARCVFDRIVEPCVVCYNAMIT 209

Query: 209 GYARVGRSTEAVALFREMQMAEVCPDEITMVSILSACTDLGALELGKWIEAYIERQGIQK 268
           GYAR  R  EA++LFREMQ   + P+EIT++S+LS+C  LG+L+LGKWI  Y ++    K
Sbjct: 210 GYARRNRPNEALSLFREMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIHKYAKKHSFCK 269

Query: 269 PAEVSNALIDMFAKCGDISKALKLFRTMNEKTIVSWTSVIVGMAMHGRGQEAICLFEEMI 328
             +V+ ALIDMFAKCG +  A+ +F  M  K   +W+++IV  A HG+ ++++ +FE M 
Sbjct: 270 YVKVNTALIDMFAKCGSLDDAVSIFEKMRYKDTQAWSAMIVAYANHGKAEKSMLMFERMR 329

Query: 329 GSGVAPDDVAFIGLLSACSHSGLVEKGREYFSSMMKKYKLVPKIEHYGCMVDMYCRTGLV 388
              V PD++ F+GLL+ACSH+G VE+GR+YFS M+ K+ +VP I+HYG MVD+  R G +
Sbjct: 330 SENVQPDEITFLGLLNACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSMVDLLSRAGNL 389

Query: 389 KEALDFVRNMPIEPNPVILRTLVSACRGHGEFKLGEKITKLLMRHEPLHESNYVLLSNIY 448
           ++A +F+  +PI P P++ R L++AC  H    L EK+++ +   +  H  +YV+LSN+Y
Sbjct: 390 EDAYEFIDKLPISPTPMLWRILLAACSSHNNLDLAEKVSERIFELDDSHGGDYVILSNLY 449

Query: 449 AKMLSWEKKTKIREVMEVKGMKKVPGSTMIEIDNEIYEFVAGDKSHKQFKEIYEMVDEMG 508
           A+   WE    +R+VM+ +   KVPG + IE++N ++EF +GD       +++  +DEM 
Sbjct: 450 ARNKKWEYVDSLRKVMKDRKAVKVPGCSSIEVNNVVHEFFSGDGVKSATTKLHRALDEMV 509

Query: 509 REMKKSGYRPTTSEVL-LDINEEDKEDTLNRHSEKLAIAFGLLSTPPGTPIRIVKNLRVC 568
           +E+K SGY P TS V+  ++N+++KE TL  HSEKLAI FGLL+TPPGT IR+VKNLRVC
Sbjct: 510 KELKLSGYVPDTSMVVHANMNDQEKEITLRYHSEKLAITFGLLNTPPGTTIRVVKNLRVC 569

Query: 569 SDCHSASKFISKIYDREIIMRDRNRFHHFKSGQCSCGDFW 606
            DCH+A+K IS I+ R++++RD  RFHHF+ G+CSCGDFW
Sbjct: 570 RDCHNAAKLISLIFGRKVVLRDVQRFHHFEDGKCSCGDFW 603

BLAST of Cla97C08G156650 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 488.8 bits (1257), Expect = 8.8e-137
Identity = 256/604 (42.38%), Postives = 376/604 (62.25%), Query Frame = 0

Query: 34  LLQAC---NALPKLAQIHTHILKLGLHNNPLVLTKFAS---------------------- 93
           +L++C    A  +  QIH H+LKLG   +  V T   S                      
Sbjct: 140 VLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRD 199

Query: 94  -ISSLIHAPDYAA-SFLFSAEA---DTRLYDAFLFNTLIRAYAQTAHSKDKALSLYGIML 153
            +S       YA+  ++ +A+    +  + D   +N +I  YA+T + K+ AL L+  M+
Sbjct: 200 VVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKE-ALELFKDMM 259

Query: 154 HGRILPNKFTYPFVLKACAGLEVLNLGRSVHGSVVKFGFDRDVHVQNTLVHMYSCCAGGI 213
              + P++ T   V+ ACA    + LGR VH  +   GF  ++ + N L+ +YS C G +
Sbjct: 260 KTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKC-GEL 319

Query: 214 SSARKVFDEMPKADSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSILSAC 273
            +A  +F+ +P  D ++W+ +IGGY  +    EA+ LF+EM  +   P+++TM+SIL AC
Sbjct: 320 ETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPAC 379

Query: 274 TDLGALELGKWIEAYIER--QGIQKPAEVSNALIDMFAKCGDISKALKLFRTMNEKTIVS 333
             LGA+++G+WI  YI++  +G+   + +  +LIDM+AKCGDI  A ++F ++  K++ S
Sbjct: 380 AHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSS 439

Query: 334 WTSVIVGMAMHGRGQEAICLFEEMIGSGVAPDDVAFIGLLSACSHSGLVEKGREYFSSMM 393
           W ++I G AMHGR   +  LF  M   G+ PDD+ F+GLLSACSHSG+++ GR  F +M 
Sbjct: 440 WNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMT 499

Query: 394 KKYKLVPKIEHYGCMVDMYCRTGLVKEALDFVRNMPIEPNPVILRTLVSACRGHGEFKLG 453
           + YK+ PK+EHYGCM+D+   +GL KEA + +  M +EP+ VI  +L+ AC+ HG  +LG
Sbjct: 500 QDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELG 559

Query: 454 EKITKLLMRHEPLHESNYVLLSNIYAKMLSWEKKTKIREVMEVKGMKKVPGSTMIEIDNE 513
           E   + L++ EP +  +YVLLSNIYA    W +  K R ++  KGMKKVPG + IEID+ 
Sbjct: 560 ESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSV 619

Query: 514 IYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPTTSEVLLDINEEDKEDTLNRHSEKL 573
           ++EF+ GDK H + +EIY M++EM   ++K+G+ P TSEVL ++ EE KE  L  HSEKL
Sbjct: 620 VHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKL 679

Query: 574 AIAFGLLSTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQCSC 606
           AIAFGL+ST PGT + IVKNLRVC +CH A+K ISKIY REII RDR RFHHF+ G CSC
Sbjct: 680 AIAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSC 739

BLAST of Cla97C08G156650 vs. ExPASy Swiss-Prot
Match: Q9FI80 (Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H38 PE=2 SV=1)

HSP 1 Score: 481.5 bits (1238), Expect = 1.4e-134
Identity = 254/639 (39.75%), Postives = 384/639 (60.09%), Query Frame = 0

Query: 18  SASTTNPRAAEQNCLALLQACNALPKLAQIHTHILKLGLHNNPLV---LTKFASISSLIH 77
           ++  ++P +   +    +  C  +  L+QIH   +K G   + L    + +F + S L H
Sbjct: 13  NSPASSPASHPSSLFPQINNCRTIRDLSQIHAVFIKSGQMRDTLAAAEILRFCATSDLHH 72

Query: 78  APDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTAHSKDKAL----SLYGIMLHGRILPN 137
                A  +F+        + F +NT+IR ++++   +DKAL      Y +M    + PN
Sbjct: 73  RDLDYAHKIFNQMPQR---NCFSWNTIIRGFSES--DEDKALIAITLFYEMMSDEFVEPN 132

Query: 138 KFTYPFVLKACAGLEVLNLGRSVHGSVVKFGFDRDVHVQNTLVHMYSCCA---------- 197
           +FT+P VLKACA    +  G+ +HG  +K+GF  D  V + LV MY  C           
Sbjct: 133 RFTFPSVLKACAKTGKIQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLFY 192

Query: 198 ----------------------------------GGISSARKVFDEMPKADSVTWSAMIG 257
                                             G   +AR +FD+M +   V+W+ MI 
Sbjct: 193 KNIIEKDMVVMTDRRKRDGEIVLWNVMIDGYMRLGDCKAARMLFDKMRQRSVVSWNTMIS 252

Query: 258 GYARVGRSTEAVALFREMQMAEVCPDEITMVSILSACTDLGALELGKWIEAYIERQGIQK 317
           GY+  G   +AV +FREM+  ++ P+ +T+VS+L A + LG+LELG+W+  Y E  GI+ 
Sbjct: 253 GYSLNGFFKDAVEVFREMKKGDIRPNYVTLVSVLPAISRLGSLELGEWLHLYAEDSGIRI 312

Query: 318 PAEVSNALIDMFAKCGDISKALKLFRTMNEKTIVSWTSVIVGMAMHGRGQEAICLFEEMI 377
              + +ALIDM++KCG I KA+ +F  +  + +++W+++I G A+HG+  +AI  F +M 
Sbjct: 313 DDVLGSALIDMYSKCGIIEKAIHVFERLPRENVITWSAMINGFAIHGQAGDAIDCFCKMR 372

Query: 378 GSGVAPDDVAFIGLLSACSHSGLVEKGREYFSSMMKKYKLVPKIEHYGCMVDMYCRTGLV 437
            +GV P DVA+I LL+ACSH GLVE+GR YFS M+    L P+IEHYGCMVD+  R+GL+
Sbjct: 373 QAGVRPSDVAYINLLTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDLLGRSGLL 432

Query: 438 KEALDFVRNMPIEPNPVILRTLVSACRGHGEFKLGEKITKLLMRHEPLHESNYVLLSNIY 497
            EA +F+ NMPI+P+ VI + L+ ACR  G  ++G+++  +LM   P     YV LSN+Y
Sbjct: 433 DEAEEFILNMPIKPDDVIWKALLGACRMQGNVEMGKRVANILMDMVPHDSGAYVALSNMY 492

Query: 498 AKMLSWEKKTKIREVMEVKGMKKVPGSTMIEIDNEIYEFVAGDKSHKQFKEIYEMVDEMG 557
           A   +W + +++R  M+ K ++K PG ++I+ID  ++EFV  D SH + KEI  M+ E+ 
Sbjct: 493 ASQGNWSEVSEMRLRMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKAKEINSMLVEIS 552

Query: 558 REMKKSGYRPTTSEVLLDINEEDKEDTLNRHSEKLAIAFGLLSTPPGTPIRIVKNLRVCS 606
            +++ +GYRP T++VLL++ EEDKE+ L+ HSEK+A AFGL+ST PG PIRIVKNLR+C 
Sbjct: 553 DKLRLAGYRPITTQVLLNLEEEDKENVLHYHSEKIATAFGLISTSPGKPIRIVKNLRICE 612

BLAST of Cla97C08G156650 vs. ExPASy Swiss-Prot
Match: Q9FJY7 (Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H61 PE=2 SV=1)

HSP 1 Score: 476.5 bits (1225), Expect = 4.5e-133
Identity = 239/606 (39.44%), Postives = 373/606 (61.55%), Query Frame = 0

Query: 32  LALLQACNALPKLAQIHTHILKLGLHNNPLVLTKFASISSLIHAPDYAASFLFSAEADTR 91
           ++ LQ C+   +L QIH  +LK GL  +   +TKF S      + D+        +   R
Sbjct: 18  MSCLQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDGFDR 77

Query: 92  LYDAFLFNTLIRAYAQTAHSKDKALSLYGIMLHGRILPNKFTYPFVLKACAGLEVLNLGR 151
             D FL+N +IR ++  +   +++L LY  ML      N +T+P +LKAC+ L       
Sbjct: 78  -PDTFLWNLMIRGFS-CSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEETT 137

Query: 152 SVHGSVVKFGFDRDVHVQNTLVHMYSCCAGGISSARKVFDEMPKADSVTWSAMIGGYARV 211
            +H  + K G++ DV+  N+L++ Y+   G    A  +FD +P+ D V+W+++I GY + 
Sbjct: 138 QIHAQITKLGYENDVYAVNSLINSYA-VTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVKA 197

Query: 212 GR-------------------------------STEAVALFREMQMAEVCPDEITMVSIL 271
           G+                               + EA+ LF EMQ ++V PD +++ + L
Sbjct: 198 GKMDIALTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSLANAL 257

Query: 272 SACTDLGALELGKWIEAYIERQGIQKPAEVSNALIDMFAKCGDISKALKLFRTMNEKTIV 331
           SAC  LGALE GKWI +Y+ +  I+  + +   LIDM+AKCG++ +AL++F+ + +K++ 
Sbjct: 258 SACAQLGALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKKKSVQ 317

Query: 332 SWTSVIVGMAMHGRGQEAICLFEEMIGSGVAPDDVAFIGLLSACSHSGLVEKGREYFSSM 391
           +WT++I G A HG G+EAI  F EM   G+ P+ + F  +L+ACS++GLVE+G+  F SM
Sbjct: 318 AWTALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFYSM 377

Query: 392 MKKYKLVPKIEHYGCMVDMYCRTGLVKEALDFVRNMPIEPNPVILRTLVSACRGHGEFKL 451
            + Y L P IEHYGC+VD+  R GL+ EA  F++ MP++PN VI   L+ ACR H   +L
Sbjct: 378 ERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNIEL 437

Query: 452 GEKITKLLMRHEPLHESNYVLLSNIYAKMLSWEKKTKIREVMEVKGMKKVPGSTMIEIDN 511
           GE+I ++L+  +P H   YV  +NI+A    W+K  + R +M+ +G+ KVPG + I ++ 
Sbjct: 438 GEEIGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTISLEG 497

Query: 512 EIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPTTSEVLLD-INEEDKEDTLNRHSE 571
             +EF+AGD+SH + ++I      M R+++++GY P   E+LLD ++++++E  +++HSE
Sbjct: 498 TTHEFLAGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDEREAIVHQHSE 557

Query: 572 KLAIAFGLLSTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 606
           KLAI +GL+ T PGT IRI+KNLRVC DCH  +K ISKIY R+I+MRDR RFHHF+ G+C
Sbjct: 558 KLAITYGLIKTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRTRFHHFRDGKC 617

BLAST of Cla97C08G156650 vs. ExPASy TrEMBL
Match: A0A5A7V9A4 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold82G002500 PE=3 SV=1)

HSP 1 Score: 1151.3 bits (2977), Expect = 0.0e+00
Identity = 571/606 (94.22%), Postives = 587/606 (96.86%), Query Frame = 0

Query: 1   MQSQFTKTKLLRAINNV-SASTTNPRAAEQNCLALLQACNALPKLAQIHTHILKLGLHNN 60
           MQSQFTK KLLR INNV ++STTNPRAAEQNCLALLQACNALPKL QIHTHILKLGLHNN
Sbjct: 1   MQSQFTKPKLLRTINNVLASSTTNPRAAEQNCLALLQACNALPKLTQIHTHILKLGLHNN 60

Query: 61  PLVLTKFASISSLIHAPDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTAHSKDKALSLY 120
           PLVLTKFASISSLIHA DYAASFLFSAEADTRLYDAFLFNTLIRAYAQT HSKDKAL+LY
Sbjct: 61  PLVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALY 120

Query: 121 GIMLHGRILPNKFTYPFVLKACAGLEVLNLGRSVHGSVVKFGFDRDVHVQNTLVHMYSCC 180
           GIMLH  ILPNKFTYPFVLKACAGLEVLNLG+SVHGSVVKFGFD D+HVQNT+VHMYSCC
Sbjct: 121 GIMLHDGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFDCDIHVQNTMVHMYSCC 180

Query: 181 AGGISSARKVFDEMPKADSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSI 240
           AGGI+SARKVFDEMPK+DSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVS+
Sbjct: 181 AGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSM 240

Query: 241 LSACTDLGALELGKWIEAYIERQGIQKPAEVSNALIDMFAKCGDISKALKLFRTMNEKTI 300
           LSACTDLGALELGKWIEAYIER GI KP EVSNALIDMFAKCGDISKALKLFR MNEKTI
Sbjct: 241 LSACTDLGALELGKWIEAYIERHGIHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTI 300

Query: 301 VSWTSVIVGMAMHGRGQEAICLFEEMIGSGVAPDDVAFIGLLSACSHSGLVEKGREYFSS 360
           VSWTSVIVGMAMHGRG+EA CLFEEMI SGVAPDDVAFIGLLSACSHSGLVE+GREYF S
Sbjct: 301 VSWTSVIVGMAMHGRGREATCLFEEMITSGVAPDDVAFIGLLSACSHSGLVERGREYFGS 360

Query: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALDFVRNMPIEPNPVILRTLVSACRGHGEFK 420
           MMKKYKLVPKIEHYGCMVDMYCRTGLVKEAL+FVRNMPIEPNPVILRTLV+ACRGHGEFK
Sbjct: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVTACRGHGEFK 420

Query: 421 LGEKITKLLMRHEPLHESNYVLLSNIYAKMLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480
           LGEKITKLLM+HEPLHESNYVLLSNIYAK LSWEKKTKIREVMEVKGMKKVPGSTMIEID
Sbjct: 421 LGEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480

Query: 481 NEIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPTTSEVLLDINEEDKEDTLNRHSE 540
           NEIYEFVAGDKSHKQ KEIYEMVDEMGREMKKSGYRP+TSEVLLDINEEDKED+LNRHSE
Sbjct: 481 NEIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSLNRHSE 540

Query: 541 KLAIAFGLLSTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 600
           KLAIAFGLL TPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC
Sbjct: 541 KLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 600

Query: 601 SCGDFW 606
           SCGDFW
Sbjct: 601 SCGDFW 606

BLAST of Cla97C08G156650 vs. ExPASy TrEMBL
Match: A0A1S3BC37 (pentatricopeptide repeat-containing protein At4g21065-like OS=Cucumis melo OX=3656 GN=LOC103488302 PE=3 SV=1)

HSP 1 Score: 1150.2 bits (2974), Expect = 0.0e+00
Identity = 570/606 (94.06%), Postives = 587/606 (96.86%), Query Frame = 0

Query: 1   MQSQFTKTKLLRAINNV-SASTTNPRAAEQNCLALLQACNALPKLAQIHTHILKLGLHNN 60
           MQSQFTK KLLR INNV ++STTNPRAAEQNCLALLQACNALPKL QIHTHI+KLGLHNN
Sbjct: 1   MQSQFTKPKLLRTINNVLASSTTNPRAAEQNCLALLQACNALPKLTQIHTHIVKLGLHNN 60

Query: 61  PLVLTKFASISSLIHAPDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTAHSKDKALSLY 120
           PLVLTKFASISSLIHA DYAASFLFSAEADTRLYDAFLFNTLIRAYAQT HSKDKAL+LY
Sbjct: 61  PLVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALY 120

Query: 121 GIMLHGRILPNKFTYPFVLKACAGLEVLNLGRSVHGSVVKFGFDRDVHVQNTLVHMYSCC 180
           GIMLH  ILPNKFTYPFVLKACAGLEVLNLG+SVHGSVVKFGFD D+HVQNT+VHMYSCC
Sbjct: 121 GIMLHDGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFDCDIHVQNTMVHMYSCC 180

Query: 181 AGGISSARKVFDEMPKADSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSI 240
           AGGI+SARKVFDEMPK+DSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVS+
Sbjct: 181 AGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSM 240

Query: 241 LSACTDLGALELGKWIEAYIERQGIQKPAEVSNALIDMFAKCGDISKALKLFRTMNEKTI 300
           LSACTDLGALELGKWIEAYIER GI KP EVSNALIDMFAKCGDISKALKLFR MNEKTI
Sbjct: 241 LSACTDLGALELGKWIEAYIERHGIHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTI 300

Query: 301 VSWTSVIVGMAMHGRGQEAICLFEEMIGSGVAPDDVAFIGLLSACSHSGLVEKGREYFSS 360
           VSWTSVIVGMAMHGRG+EA CLFEEMI SGVAPDDVAFIGLLSACSHSGLVE+GREYF S
Sbjct: 301 VSWTSVIVGMAMHGRGREATCLFEEMITSGVAPDDVAFIGLLSACSHSGLVERGREYFGS 360

Query: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALDFVRNMPIEPNPVILRTLVSACRGHGEFK 420
           MMKKYKLVPKIEHYGCMVDMYCRTGLVKEAL+FVRNMPIEPNPVILRTLV+ACRGHGEFK
Sbjct: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVTACRGHGEFK 420

Query: 421 LGEKITKLLMRHEPLHESNYVLLSNIYAKMLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480
           LGEKITKLLM+HEPLHESNYVLLSNIYAK LSWEKKTKIREVMEVKGMKKVPGSTMIEID
Sbjct: 421 LGEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480

Query: 481 NEIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPTTSEVLLDINEEDKEDTLNRHSE 540
           NEIYEFVAGDKSHKQ KEIYEMVDEMGREMKKSGYRP+TSEVLLDINEEDKED+LNRHSE
Sbjct: 481 NEIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSLNRHSE 540

Query: 541 KLAIAFGLLSTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 600
           KLAIAFGLL TPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC
Sbjct: 541 KLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 600

Query: 601 SCGDFW 606
           SCGDFW
Sbjct: 601 SCGDFW 606

BLAST of Cla97C08G156650 vs. ExPASy TrEMBL
Match: A0A0A0LQ71 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G381680 PE=3 SV=1)

HSP 1 Score: 1145.6 bits (2962), Expect = 0.0e+00
Identity = 569/606 (93.89%), Postives = 583/606 (96.20%), Query Frame = 0

Query: 1   MQSQFTKTKLLRAINNVSASTT-NPRAAEQNCLALLQACNALPKLAQIHTHILKLGLHNN 60
           MQSQFTK KLLR INNV AS+T NPRA EQNCLALLQACNALPKL QIHTHILKLGLHNN
Sbjct: 1   MQSQFTKPKLLRTINNVLASSTPNPRAPEQNCLALLQACNALPKLTQIHTHILKLGLHNN 60

Query: 61  PLVLTKFASISSLIHAPDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTAHSKDKALSLY 120
           PLVLTKFASISSLIHA DYAASFLFSAEADTRLYDAFLFNTLIRAYAQT HSKDKAL+LY
Sbjct: 61  PLVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALY 120

Query: 121 GIMLHGRILPNKFTYPFVLKACAGLEVLNLGRSVHGSVVKFGFDRDVHVQNTLVHMYSCC 180
           GIMLH  ILPNKFTYPFVLKACAGLEVLNLG++VHGSVVKFGFD D+HVQNT+VHMYSCC
Sbjct: 121 GIMLHDAILPNKFTYPFVLKACAGLEVLNLGQTVHGSVVKFGFDCDIHVQNTMVHMYSCC 180

Query: 181 AGGISSARKVFDEMPKADSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSI 240
           AGGI+SARKVFDEMPK+DSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVS+
Sbjct: 181 AGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSM 240

Query: 241 LSACTDLGALELGKWIEAYIERQGIQKPAEVSNALIDMFAKCGDISKALKLFRTMNEKTI 300
           LSACTDLGALELGKWIEAYIER  I KP EVSNALIDMFAKCGDISKALKLFR MNEKTI
Sbjct: 241 LSACTDLGALELGKWIEAYIERHEIHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTI 300

Query: 301 VSWTSVIVGMAMHGRGQEAICLFEEMIGSGVAPDDVAFIGLLSACSHSGLVEKGREYFSS 360
           VSWTSVIVGMAMHGRGQEA CLFEEM  SGVAPDDVAFIGLLSACSHSGLVE+GREYF S
Sbjct: 301 VSWTSVIVGMAMHGRGQEATCLFEEMTSSGVAPDDVAFIGLLSACSHSGLVERGREYFGS 360

Query: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALDFVRNMPIEPNPVILRTLVSACRGHGEFK 420
           MMKKYKLVPKIEHYGCMVDMYCRTGLVKEAL+FVRNMPIEPNPVILRTLVSACRGHGEFK
Sbjct: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVSACRGHGEFK 420

Query: 421 LGEKITKLLMRHEPLHESNYVLLSNIYAKMLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480
           LGEKITKLLM+HEPLHESNYVLLSNIYAK LSWEKKTKIREVMEVKGMKKVPGSTMIEID
Sbjct: 421 LGEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480

Query: 481 NEIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPTTSEVLLDINEEDKEDTLNRHSE 540
           NEIYEFVAGDKSHKQ KEIYEMVDEMGREMKKSGYRP+TSEVLLDINEEDKED+LNRHSE
Sbjct: 481 NEIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSLNRHSE 540

Query: 541 KLAIAFGLLSTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 600
           KLAIAFGLL TPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC
Sbjct: 541 KLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 600

Query: 601 SCGDFW 606
           SCGDFW
Sbjct: 601 SCGDFW 606

BLAST of Cla97C08G156650 vs. ExPASy TrEMBL
Match: A0A6J1BQ70 (pentatricopeptide repeat-containing protein At4g21065-like OS=Momordica charantia OX=3673 GN=LOC111004636 PE=3 SV=1)

HSP 1 Score: 1120.5 bits (2897), Expect = 0.0e+00
Identity = 555/606 (91.58%), Postives = 581/606 (95.87%), Query Frame = 0

Query: 1   MQSQFTKTKLLRAINNVSA-STTNPRAAEQNCLALLQACNALPKLAQIHTHILKLGLHNN 60
           MQSQF+KTKLL AINN    S  NPRAAEQ+CLALLQACNALPKLAQIH HILKLGLHNN
Sbjct: 1   MQSQFSKTKLLLAINNAPVFSRANPRAAEQDCLALLQACNALPKLAQIHAHILKLGLHNN 60

Query: 61  PLVLTKFASISSLIHAPDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTAHSKDKALSLY 120
           PLVLTKFASISS+I A DYAASFLFSA ADTRLYDAFLFNTLIRAYAQT HSK KAL+LY
Sbjct: 61  PLVLTKFASISSVISATDYAASFLFSAGADTRLYDAFLFNTLIRAYAQTGHSKPKALALY 120

Query: 121 GIMLHGRILPNKFTYPFVLKACAGLEVLNLGRSVHGSVVKFGFDRDVHVQNTLVHMYSCC 180
           G+ML   ILPNKFTYPFVLKACAGLEVLNLG+SVHGSVVKFGFDRDVHV+NT+VHMYSCC
Sbjct: 121 GLMLRDGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFDRDVHVRNTMVHMYSCC 180

Query: 181 AGGISSARKVFDEMPKADSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSI 240
           AGGI+ ARKVFDEMPK+DSVTWSAMIGGYARVGR TEAV+LFREMQ+AEVCPDEITMVSI
Sbjct: 181 AGGINFARKVFDEMPKSDSVTWSAMIGGYARVGRPTEAVSLFREMQLAEVCPDEITMVSI 240

Query: 241 LSACTDLGALELGKWIEAYIERQGIQKPAEVSNALIDMFAKCGDISKALKLFRTMNEKTI 300
           LSACTDLGALELGKW+EAYIERQGIQKP EVSNALIDMFAKCGDISKALKLF+TM+EKTI
Sbjct: 241 LSACTDLGALELGKWLEAYIERQGIQKPEEVSNALIDMFAKCGDISKALKLFKTMSEKTI 300

Query: 301 VSWTSVIVGMAMHGRGQEAICLFEEMIGSGVAPDDVAFIGLLSACSHSGLVEKGREYFSS 360
           VSWTSVIVGMAMHGRGQ+AICLFEEMIGSGVAPDDVAFIGLLSACSHSG+VE+GREYFSS
Sbjct: 301 VSWTSVIVGMAMHGRGQDAICLFEEMIGSGVAPDDVAFIGLLSACSHSGMVERGREYFSS 360

Query: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALDFVRNMPIEPNPVILRTLVSACRGHGEFK 420
           M KKYKLVPKIEHYGCMVDM+CRTGLVKEAL+FV +MPIEPN VILRTLVSACRGHGEF+
Sbjct: 361 MTKKYKLVPKIEHYGCMVDMFCRTGLVKEALEFVHSMPIEPNAVILRTLVSACRGHGEFQ 420

Query: 421 LGEKITKLLMRHEPLHESNYVLLSNIYAKMLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480
           LGEKITK LMRHEP+HESNYVLLSNIYAKMLSWEKKTKIREVMEVKGMKKVPGSTMIEID
Sbjct: 421 LGEKITKQLMRHEPMHESNYVLLSNIYAKMLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480

Query: 481 NEIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPTTSEVLLDINEEDKEDTLNRHSE 540
           NEIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRP+TSEVLLDINEEDKEDTLNRH E
Sbjct: 481 NEIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDTLNRHGE 540

Query: 541 KLAIAFGLLSTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 600
           KLAIAFGLL+TPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFK+G C
Sbjct: 541 KLAIAFGLLNTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKAGIC 600

Query: 601 SCGDFW 606
           SCGDFW
Sbjct: 601 SCGDFW 606

BLAST of Cla97C08G156650 vs. ExPASy TrEMBL
Match: A0A6J1KQ01 (pentatricopeptide repeat-containing protein At4g21065-like OS=Cucurbita maxima OX=3661 GN=LOC111496170 PE=3 SV=1)

HSP 1 Score: 1107.8 bits (2864), Expect = 0.0e+00
Identity = 552/606 (91.09%), Postives = 573/606 (94.55%), Query Frame = 0

Query: 1   MQSQFTKTKLLRAINNVSASTTNPRAAEQNCLALLQACNALPKLAQIHTHILKLGLHNNP 60
           MQSQF    +LR INN +AS +NPRAAEQNCLALLQACN LPKL QIH HI KLGLHNNP
Sbjct: 1   MQSQF----VLRVINNATASRSNPRAAEQNCLALLQACNLLPKLTQIHAHIFKLGLHNNP 60

Query: 61  LVLTKFASISSLIHAPDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTAHSKDKALSLYG 120
           LVLTKF SISS+I+A DYAASFLFSAEADTRLYDAFLFNTLIRA+AQT HSK +ALSLYG
Sbjct: 61  LVLTKFVSISSVINATDYAASFLFSAEADTRLYDAFLFNTLIRAFAQTGHSKARALSLYG 120

Query: 121 IMLHGRILPNKFTYPFVLKACAGLEVLNLGRSVHGSVVKFGFDRDVHVQNTLVHMYSCCA 180
           IMLH  ILPNKFTYPFVLKACAGLEVL+LG+SVHGSVVKFGFD DVHVQNT+VHMYSCCA
Sbjct: 121 IMLHDGILPNKFTYPFVLKACAGLEVLSLGQSVHGSVVKFGFDHDVHVQNTMVHMYSCCA 180

Query: 181 GGISSARKVFDEMPKADSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSIL 240
            GI  ARKVFDEMPK+DSVTWSAMIGGYARVGRSTEAVALFREMQMAEV PDEITMVS+L
Sbjct: 181 DGIIFARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVFPDEITMVSVL 240

Query: 241 SACTDLGALELGKWIEAYIERQGIQKPAEVSNALIDMFAKCGDISKALKLFRTMNEKTIV 300
           SACTDLGALELGKWIEAYIERQGIQKP EVSNALIDMFAKCGDI KALKLFR M+EKTIV
Sbjct: 241 SACTDLGALELGKWIEAYIERQGIQKPVEVSNALIDMFAKCGDIGKALKLFRVMSEKTIV 300

Query: 301 SWTSVIVGMAMHGRGQEAICLFEEMIG-SGVAPDDVAFIGLLSACSHSGLVEKGREYFSS 360
           SWTSVIVGMAMHGRGQEAICLFEEMIG SGVAPDDVAFIGLLSACSHSGLVE+GREYFSS
Sbjct: 301 SWTSVIVGMAMHGRGQEAICLFEEMIGSSGVAPDDVAFIGLLSACSHSGLVERGREYFSS 360

Query: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALDFVRNMPIEPNPVILRTLVSACRGHGEFK 420
           MMKKYKLVPKIEHYGCMVDMYCRTGLVKEAL+FV NMPIEPN VILRTLVSACRGHGEFK
Sbjct: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVHNMPIEPNTVILRTLVSACRGHGEFK 420

Query: 421 LGEKITKLLMRHEPLHESNYVLLSNIYAKMLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480
           LGEKITKLLMRHEP+HESNYVLLSNIYAKM +WEKK KIREVMEVKGMKKVPGSTMIEID
Sbjct: 421 LGEKITKLLMRHEPMHESNYVLLSNIYAKMFNWEKKAKIREVMEVKGMKKVPGSTMIEID 480

Query: 481 NEIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPTTSEVLLDINEEDKEDTLNRHSE 540
           NEIYEFVAGDKSHKQFKEIY MVDEMGREM KSGYRP+TSEVLLDINEEDKEDTLNRHSE
Sbjct: 481 NEIYEFVAGDKSHKQFKEIYAMVDEMGREMTKSGYRPSTSEVLLDINEEDKEDTLNRHSE 540

Query: 541 KLAIAFGLLSTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 600
           KLAIAFGLL+TPPGTPIRIVKNLRVC+DCHSASKFISKIYDREIIMRDRNRFHHFK+G C
Sbjct: 541 KLAIAFGLLNTPPGTPIRIVKNLRVCTDCHSASKFISKIYDREIIMRDRNRFHHFKAGLC 600

Query: 601 SCGDFW 606
           SCGDFW
Sbjct: 601 SCGDFW 602

BLAST of Cla97C08G156650 vs. TAIR 10
Match: AT4G21065.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 508.4 bits (1308), Expect = 7.7e-144
Identity = 261/584 (44.69%), Postives = 388/584 (66.44%), Query Frame = 0

Query: 29  QNCLALLQ--ACNALPKLAQIHTHILKLGLHNNPLVLTK---FASISSLIHAPDYAASFL 88
           + C+ LLQ    +++ KL QIH   ++ G+  +   L K   F  +S     P   A  +
Sbjct: 16  EKCINLLQTYGVSSITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKV 75

Query: 89  FSAEADTRLYDAFLFNTLIRAYAQTAHSKDKALSLYGIM-LHGRILPNKFTYPFVLKACA 148
           FS     +  + F++NTLIR YA+  +S   A SLY  M + G + P+  TYPF++KA  
Sbjct: 76  FS--KIEKPINVFIWNTLIRGYAEIGNS-ISAFSLYREMRVSGLVEPDTHTYPFLIKAVT 135

Query: 149 GLEVLNLGRSVHGSVVKFGFDRDVHVQNTLVHMYSCCAGGISSARKVFDEMPKADSVTWS 208
            +  + LG ++H  V++ GF   ++VQN+L+H+Y+ C G ++SA KVFD+MP+ D V W+
Sbjct: 136 TMADVRLGETIHSVVIRSGFGSLIYVQNSLLHLYANC-GDVASAYKVFDKMPEKDLVAWN 195

Query: 209 AMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSILSACTDLGALELGKWIEAYIERQ 268
           ++I G+A  G+  EA+AL+ EM    + PD  T+VS+LSAC  +GAL LGK +  Y+ + 
Sbjct: 196 SVINGFAENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKV 255

Query: 269 GIQKPAEVSNALIDMFAKCGDISKALKLFRTMNEKTIVSWTSVIVGMAMHGRGQEAICLF 328
           G+ +    SN L+D++A+CG + +A  LF  M +K  VSWTS+IVG+A++G G+EAI LF
Sbjct: 256 GLTRNLHSSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELF 315

Query: 329 EEMIGS-GVAPDDVAFIGLLSACSHSGLVEKGREYFSSMMKKYKLVPKIEHYGCMVDMYC 388
           + M  + G+ P ++ F+G+L ACSH G+V++G EYF  M ++YK+ P+IEH+GCMVD+  
Sbjct: 316 KYMESTEGLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLA 375

Query: 389 RTGLVKEALDFVRNMPIEPNPVILRTLVSACRGHGEFKLGEKITKLLMRHEPLHESNYVL 448
           R G VK+A +++++MP++PN VI RTL+ AC  HG+  L E     +++ EP H  +YVL
Sbjct: 376 RAGQVKKAYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVL 435

Query: 449 LSNIYAKMLSWEKKTKIREVMEVKGMKKVPGSTMIEIDNEIYEFVAGDKSHKQFKEIYEM 508
           LSN+YA    W    KIR+ M   G+KKVPG +++E+ N ++EF+ GDKSH Q   IY  
Sbjct: 436 LSNMYASEQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAK 495

Query: 509 VDEMGREMKKSGYRPTTSEVLLDINEEDKEDTLNRHSEKLAIAFGLLSTPPGTPIRIVKN 568
           + EM   ++  GY P  S V +D+ EE+KE+ +  HSEK+AIAF L+STP  +PI +VKN
Sbjct: 496 LKEMTGRLRSEGYVPQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKN 555

Query: 569 LRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQCSCGDFW 606
           LRVC+DCH A K +SK+Y+REI++RDR+RFHHFK+G CSC D+W
Sbjct: 556 LRVCADCHLAIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDYW 595

BLAST of Cla97C08G156650 vs. TAIR 10
Match: AT2G02980.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 496.9 bits (1278), Expect = 2.3e-140
Identity = 252/580 (43.45%), Postives = 381/580 (65.69%), Query Frame = 0

Query: 29  QNCLALLQACNALPKLAQIHTHILKLGLHNNPLV--LTKFASISSLIHAPDYAASFLFSA 88
           QN + L+  CN+L +L QI  + +K  + +   V  L  F + S    +  Y A  LF A
Sbjct: 30  QNPILLISKCNSLRELMQIQAYAIKSHIEDVSFVAKLINFCTESPTESSMSY-ARHLFEA 89

Query: 89  EADTRLYDAFLFNTLIRAYAQTAHSKDKALSLYGIMLHGRILPNKFTYPFVLKACAGLEV 148
            ++    D  +FN++ R Y++  +  +   SL+  +L   ILP+ +T+P +LKACA  + 
Sbjct: 90  MSEP---DIVIFNSMARGYSRFTNPLE-VFSLFVEILEDGILPDNYTFPSLLKACAVAKA 149

Query: 149 LNLGRSVHGSVVKFGFDRDVHVQNTLVHMYSCCAGGISSARKVFDEMPKADSVTWSAMIG 208
           L  GR +H   +K G D +V+V  TL++MY+ C   + SAR VFD + +   V ++AMI 
Sbjct: 150 LEEGRQLHCLSMKLGLDDNVYVCPTLINMYTECE-DVDSARCVFDRIVEPCVVCYNAMIT 209

Query: 209 GYARVGRSTEAVALFREMQMAEVCPDEITMVSILSACTDLGALELGKWIEAYIERQGIQK 268
           GYAR  R  EA++LFREMQ   + P+EIT++S+LS+C  LG+L+LGKWI  Y ++    K
Sbjct: 210 GYARRNRPNEALSLFREMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIHKYAKKHSFCK 269

Query: 269 PAEVSNALIDMFAKCGDISKALKLFRTMNEKTIVSWTSVIVGMAMHGRGQEAICLFEEMI 328
             +V+ ALIDMFAKCG +  A+ +F  M  K   +W+++IV  A HG+ ++++ +FE M 
Sbjct: 270 YVKVNTALIDMFAKCGSLDDAVSIFEKMRYKDTQAWSAMIVAYANHGKAEKSMLMFERMR 329

Query: 329 GSGVAPDDVAFIGLLSACSHSGLVEKGREYFSSMMKKYKLVPKIEHYGCMVDMYCRTGLV 388
              V PD++ F+GLL+ACSH+G VE+GR+YFS M+ K+ +VP I+HYG MVD+  R G +
Sbjct: 330 SENVQPDEITFLGLLNACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSMVDLLSRAGNL 389

Query: 389 KEALDFVRNMPIEPNPVILRTLVSACRGHGEFKLGEKITKLLMRHEPLHESNYVLLSNIY 448
           ++A +F+  +PI P P++ R L++AC  H    L EK+++ +   +  H  +YV+LSN+Y
Sbjct: 390 EDAYEFIDKLPISPTPMLWRILLAACSSHNNLDLAEKVSERIFELDDSHGGDYVILSNLY 449

Query: 449 AKMLSWEKKTKIREVMEVKGMKKVPGSTMIEIDNEIYEFVAGDKSHKQFKEIYEMVDEMG 508
           A+   WE    +R+VM+ +   KVPG + IE++N ++EF +GD       +++  +DEM 
Sbjct: 450 ARNKKWEYVDSLRKVMKDRKAVKVPGCSSIEVNNVVHEFFSGDGVKSATTKLHRALDEMV 509

Query: 509 REMKKSGYRPTTSEVL-LDINEEDKEDTLNRHSEKLAIAFGLLSTPPGTPIRIVKNLRVC 568
           +E+K SGY P TS V+  ++N+++KE TL  HSEKLAI FGLL+TPPGT IR+VKNLRVC
Sbjct: 510 KELKLSGYVPDTSMVVHANMNDQEKEITLRYHSEKLAITFGLLNTPPGTTIRVVKNLRVC 569

Query: 569 SDCHSASKFISKIYDREIIMRDRNRFHHFKSGQCSCGDFW 606
            DCH+A+K IS I+ R++++RD  RFHHF+ G+CSCGDFW
Sbjct: 570 RDCHNAAKLISLIFGRKVVLRDVQRFHHFEDGKCSCGDFW 603

BLAST of Cla97C08G156650 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 488.8 bits (1257), Expect = 6.3e-138
Identity = 256/604 (42.38%), Postives = 376/604 (62.25%), Query Frame = 0

Query: 34  LLQAC---NALPKLAQIHTHILKLGLHNNPLVLTKFAS---------------------- 93
           +L++C    A  +  QIH H+LKLG   +  V T   S                      
Sbjct: 140 VLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRD 199

Query: 94  -ISSLIHAPDYAA-SFLFSAEA---DTRLYDAFLFNTLIRAYAQTAHSKDKALSLYGIML 153
            +S       YA+  ++ +A+    +  + D   +N +I  YA+T + K+ AL L+  M+
Sbjct: 200 VVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKE-ALELFKDMM 259

Query: 154 HGRILPNKFTYPFVLKACAGLEVLNLGRSVHGSVVKFGFDRDVHVQNTLVHMYSCCAGGI 213
              + P++ T   V+ ACA    + LGR VH  +   GF  ++ + N L+ +YS C G +
Sbjct: 260 KTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKC-GEL 319

Query: 214 SSARKVFDEMPKADSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSILSAC 273
            +A  +F+ +P  D ++W+ +IGGY  +    EA+ LF+EM  +   P+++TM+SIL AC
Sbjct: 320 ETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPAC 379

Query: 274 TDLGALELGKWIEAYIER--QGIQKPAEVSNALIDMFAKCGDISKALKLFRTMNEKTIVS 333
             LGA+++G+WI  YI++  +G+   + +  +LIDM+AKCGDI  A ++F ++  K++ S
Sbjct: 380 AHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSS 439

Query: 334 WTSVIVGMAMHGRGQEAICLFEEMIGSGVAPDDVAFIGLLSACSHSGLVEKGREYFSSMM 393
           W ++I G AMHGR   +  LF  M   G+ PDD+ F+GLLSACSHSG+++ GR  F +M 
Sbjct: 440 WNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMT 499

Query: 394 KKYKLVPKIEHYGCMVDMYCRTGLVKEALDFVRNMPIEPNPVILRTLVSACRGHGEFKLG 453
           + YK+ PK+EHYGCM+D+   +GL KEA + +  M +EP+ VI  +L+ AC+ HG  +LG
Sbjct: 500 QDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELG 559

Query: 454 EKITKLLMRHEPLHESNYVLLSNIYAKMLSWEKKTKIREVMEVKGMKKVPGSTMIEIDNE 513
           E   + L++ EP +  +YVLLSNIYA    W +  K R ++  KGMKKVPG + IEID+ 
Sbjct: 560 ESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSV 619

Query: 514 IYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPTTSEVLLDINEEDKEDTLNRHSEKL 573
           ++EF+ GDK H + +EIY M++EM   ++K+G+ P TSEVL ++ EE KE  L  HSEKL
Sbjct: 620 VHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKL 679

Query: 574 AIAFGLLSTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQCSC 606
           AIAFGL+ST PGT + IVKNLRVC +CH A+K ISKIY REII RDR RFHHF+ G CSC
Sbjct: 680 AIAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSC 739

BLAST of Cla97C08G156650 vs. TAIR 10
Match: AT5G48910.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 481.5 bits (1238), Expect = 1.0e-135
Identity = 254/639 (39.75%), Postives = 384/639 (60.09%), Query Frame = 0

Query: 18  SASTTNPRAAEQNCLALLQACNALPKLAQIHTHILKLGLHNNPLV---LTKFASISSLIH 77
           ++  ++P +   +    +  C  +  L+QIH   +K G   + L    + +F + S L H
Sbjct: 13  NSPASSPASHPSSLFPQINNCRTIRDLSQIHAVFIKSGQMRDTLAAAEILRFCATSDLHH 72

Query: 78  APDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTAHSKDKAL----SLYGIMLHGRILPN 137
                A  +F+        + F +NT+IR ++++   +DKAL      Y +M    + PN
Sbjct: 73  RDLDYAHKIFNQMPQR---NCFSWNTIIRGFSES--DEDKALIAITLFYEMMSDEFVEPN 132

Query: 138 KFTYPFVLKACAGLEVLNLGRSVHGSVVKFGFDRDVHVQNTLVHMYSCCA---------- 197
           +FT+P VLKACA    +  G+ +HG  +K+GF  D  V + LV MY  C           
Sbjct: 133 RFTFPSVLKACAKTGKIQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLFY 192

Query: 198 ----------------------------------GGISSARKVFDEMPKADSVTWSAMIG 257
                                             G   +AR +FD+M +   V+W+ MI 
Sbjct: 193 KNIIEKDMVVMTDRRKRDGEIVLWNVMIDGYMRLGDCKAARMLFDKMRQRSVVSWNTMIS 252

Query: 258 GYARVGRSTEAVALFREMQMAEVCPDEITMVSILSACTDLGALELGKWIEAYIERQGIQK 317
           GY+  G   +AV +FREM+  ++ P+ +T+VS+L A + LG+LELG+W+  Y E  GI+ 
Sbjct: 253 GYSLNGFFKDAVEVFREMKKGDIRPNYVTLVSVLPAISRLGSLELGEWLHLYAEDSGIRI 312

Query: 318 PAEVSNALIDMFAKCGDISKALKLFRTMNEKTIVSWTSVIVGMAMHGRGQEAICLFEEMI 377
              + +ALIDM++KCG I KA+ +F  +  + +++W+++I G A+HG+  +AI  F +M 
Sbjct: 313 DDVLGSALIDMYSKCGIIEKAIHVFERLPRENVITWSAMINGFAIHGQAGDAIDCFCKMR 372

Query: 378 GSGVAPDDVAFIGLLSACSHSGLVEKGREYFSSMMKKYKLVPKIEHYGCMVDMYCRTGLV 437
            +GV P DVA+I LL+ACSH GLVE+GR YFS M+    L P+IEHYGCMVD+  R+GL+
Sbjct: 373 QAGVRPSDVAYINLLTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDLLGRSGLL 432

Query: 438 KEALDFVRNMPIEPNPVILRTLVSACRGHGEFKLGEKITKLLMRHEPLHESNYVLLSNIY 497
            EA +F+ NMPI+P+ VI + L+ ACR  G  ++G+++  +LM   P     YV LSN+Y
Sbjct: 433 DEAEEFILNMPIKPDDVIWKALLGACRMQGNVEMGKRVANILMDMVPHDSGAYVALSNMY 492

Query: 498 AKMLSWEKKTKIREVMEVKGMKKVPGSTMIEIDNEIYEFVAGDKSHKQFKEIYEMVDEMG 557
           A   +W + +++R  M+ K ++K PG ++I+ID  ++EFV  D SH + KEI  M+ E+ 
Sbjct: 493 ASQGNWSEVSEMRLRMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKAKEINSMLVEIS 552

Query: 558 REMKKSGYRPTTSEVLLDINEEDKEDTLNRHSEKLAIAFGLLSTPPGTPIRIVKNLRVCS 606
            +++ +GYRP T++VLL++ EEDKE+ L+ HSEK+A AFGL+ST PG PIRIVKNLR+C 
Sbjct: 553 DKLRLAGYRPITTQVLLNLEEEDKENVLHYHSEKIATAFGLISTSPGKPIRIVKNLRICE 612

BLAST of Cla97C08G156650 vs. TAIR 10
Match: AT5G66520.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 476.5 bits (1225), Expect = 3.2e-134
Identity = 239/606 (39.44%), Postives = 373/606 (61.55%), Query Frame = 0

Query: 32  LALLQACNALPKLAQIHTHILKLGLHNNPLVLTKFASISSLIHAPDYAASFLFSAEADTR 91
           ++ LQ C+   +L QIH  +LK GL  +   +TKF S      + D+        +   R
Sbjct: 18  MSCLQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDGFDR 77

Query: 92  LYDAFLFNTLIRAYAQTAHSKDKALSLYGIMLHGRILPNKFTYPFVLKACAGLEVLNLGR 151
             D FL+N +IR ++  +   +++L LY  ML      N +T+P +LKAC+ L       
Sbjct: 78  -PDTFLWNLMIRGFS-CSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEETT 137

Query: 152 SVHGSVVKFGFDRDVHVQNTLVHMYSCCAGGISSARKVFDEMPKADSVTWSAMIGGYARV 211
            +H  + K G++ DV+  N+L++ Y+   G    A  +FD +P+ D V+W+++I GY + 
Sbjct: 138 QIHAQITKLGYENDVYAVNSLINSYA-VTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVKA 197

Query: 212 GR-------------------------------STEAVALFREMQMAEVCPDEITMVSIL 271
           G+                               + EA+ LF EMQ ++V PD +++ + L
Sbjct: 198 GKMDIALTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSLANAL 257

Query: 272 SACTDLGALELGKWIEAYIERQGIQKPAEVSNALIDMFAKCGDISKALKLFRTMNEKTIV 331
           SAC  LGALE GKWI +Y+ +  I+  + +   LIDM+AKCG++ +AL++F+ + +K++ 
Sbjct: 258 SACAQLGALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKKKSVQ 317

Query: 332 SWTSVIVGMAMHGRGQEAICLFEEMIGSGVAPDDVAFIGLLSACSHSGLVEKGREYFSSM 391
           +WT++I G A HG G+EAI  F EM   G+ P+ + F  +L+ACS++GLVE+G+  F SM
Sbjct: 318 AWTALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFYSM 377

Query: 392 MKKYKLVPKIEHYGCMVDMYCRTGLVKEALDFVRNMPIEPNPVILRTLVSACRGHGEFKL 451
            + Y L P IEHYGC+VD+  R GL+ EA  F++ MP++PN VI   L+ ACR H   +L
Sbjct: 378 ERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNIEL 437

Query: 452 GEKITKLLMRHEPLHESNYVLLSNIYAKMLSWEKKTKIREVMEVKGMKKVPGSTMIEIDN 511
           GE+I ++L+  +P H   YV  +NI+A    W+K  + R +M+ +G+ KVPG + I ++ 
Sbjct: 438 GEEIGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTISLEG 497

Query: 512 EIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPTTSEVLLD-INEEDKEDTLNRHSE 571
             +EF+AGD+SH + ++I      M R+++++GY P   E+LLD ++++++E  +++HSE
Sbjct: 498 TTHEFLAGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDEREAIVHQHSE 557

Query: 572 KLAIAFGLLSTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 606
           KLAI +GL+ T PGT IRI+KNLRVC DCH  +K ISKIY R+I+MRDR RFHHF+ G+C
Sbjct: 558 KLAITYGLIKTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRTRFHHFRDGKC 617

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038884201.10.0e+0094.71pentatricopeptide repeat-containing protein At4g21065-like [Benincasa hispida][more]
KAA0064932.10.0e+0094.22pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
XP_008445200.10.0e+0094.06PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Cucumis m... [more]
XP_004138859.10.0e+0093.89pentatricopeptide repeat-containing protein At4g21065 [Cucumis sativus] >KGN6294... [more]
XP_022131416.10.0e+0091.58pentatricopeptide repeat-containing protein At4g21065-like [Momordica charantia]... [more]
Match NameE-valueIdentityDescription
A8MQA31.1e-14244.69Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX... [more]
Q8LK933.2e-13943.45Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidop... [more]
Q9LN018.8e-13742.38Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Q9FI801.4e-13439.75Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX... [more]
Q9FJY74.5e-13339.44Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A5A7V9A40.0e+0094.22Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3BC370.0e+0094.06pentatricopeptide repeat-containing protein At4g21065-like OS=Cucumis melo OX=36... [more]
A0A0A0LQ710.0e+0093.89DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G3816... [more]
A0A6J1BQ700.0e+0091.58pentatricopeptide repeat-containing protein At4g21065-like OS=Momordica charanti... [more]
A0A6J1KQ010.0e+0091.09pentatricopeptide repeat-containing protein At4g21065-like OS=Cucurbita maxima O... [more]
Match NameE-valueIdentityDescription
AT4G21065.17.7e-14444.69Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G02980.12.3e-14043.45Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G08070.16.3e-13842.38Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G48910.11.0e-13539.75Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G66520.13.2e-13439.44Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 471..595
e-value: 4.4E-41
score: 139.7
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 199..233
e-value: 2.8E-8
score: 31.4
coord: 338..368
e-value: 0.0023
score: 16.0
coord: 373..396
e-value: 0.0026
score: 15.8
coord: 300..333
e-value: 2.8E-6
score: 25.1
coord: 272..295
e-value: 9.9E-4
score: 17.1
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 197..243
e-value: 7.2E-10
score: 38.9
coord: 298..345
e-value: 1.1E-7
score: 32.0
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 272..297
e-value: 1.0E-4
score: 22.3
coord: 372..396
e-value: 0.002
score: 18.2
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 197..231
score: 12.254791
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 94..129
score: 9.218511
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 333..368
score: 8.681407
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 298..332
score: 10.731171
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 267..297
score: 8.560833
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 5..148
e-value: 1.7E-6
score: 29.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 251..482
e-value: 1.0E-42
score: 148.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 149..250
e-value: 1.4E-22
score: 81.9
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 29..591
NoneNo IPR availablePANTHERPTHR47926:SF239SUBFAMILY NOT NAMEDcoord: 29..591

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C08G156650.1Cla97C08G156650.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding