Cmc09g0238571 (gene) Melon (Charmono) v1.1

Overview
NameCmc09g0238571
Typegene
OrganismCucumis melo L. var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionPentatricopeptide repeat-containing protein
LocationCMiso1.1chr09: 1045019 .. 1048300 (-)
RNA-Seq ExpressionCmc09g0238571
SyntenyCmc09g0238571
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTGTGGATTTTTTCTAATTTATATCTGAATTTATTTCAACTCTTAATCTAATAAATACCAAATTTTTGAGAAACATATTATATCATTGTTCAAGGGGACGAAGAAAGAAAAACACAAAGTACCAAAAAACTATTTAAAGCAGCTTCTCTCTTCTTAGACGTCACCTGACCTTATTCTTTTCCTCCGAACGTACCAACCGTCCCACTAGACCGAACCGTTTCGGTTTGCTTATAATATTGGTTAAATTCAAGACGAACCGATCGATATGGGTTTGGTTCGTCTATTTGACCGATTCATAAATTCAAAACGTCTCCGGGTTTCGAAGGTAATCGGAAGAAACATGTCGTCGGTCTCAATACAGCCCCATTTGTATCAATTGGTTGTAGCAGCGCTTGAAAAATGCAGTAATCTCAATCACCTCAAGCAACTTCAAGGATTTCTTATTTCACACGGTCATTCACAAACGCAGTTCTTTGCCTTCAAGCTTGTTCGCTTTTGCAACCTTACTCTTACTGACTTATGTTATGCTCGCTACATTTTTGATAATCTAACTTCCCCAAATGTTTATCTTTATACTGCAATGATCACAGCTTATGCCGCATACCCCGATCCTAAAGCTGCGTTTCTTTTGTACCGTAACATGGTCCGCCATGGAGCCATTCGACCGAACCATTTTATCTACCCCCATGTCCTTAAATCTTGCCCTGATGTTTTGGGGTCCAATGCCACGAAAATGGTTCATACCCAGGTTCTGAAATCTGGATTTGGTCGATACCCAGTTGTTCAAACAGCCATTGTTGATTCCTATTCGAGGTTCTCTTCGGTTATTGGGAATGCAAGACAGATGTTCGACGAAATGGTCGAGAGAAGTGTAGTGTCTTGGACGGCCATGATTTCAGGGTATGCGAGGCTTGGGAACTTTGATAGTGCAATTGAGTTGTTTGAAAGTATGCCGGAGAGGGATGTCCCTGCTTGGAATGCTCTTATTGCTGGTTGTGCTCAAAACGGATTCTTCTGTGAAGCGATTTGGTTGTTCAAAAAAATGGTTTCTCTGGCGCTAGAGGGTAATAATAATGATCGTGAAAATAAGCCGAATAAGACCACACTTGCATCTGCACTATCAGCTTGTGGAAATACCGGGATGCTTCATCTTGGTAAGTGGATTCATGGTTATGTTTTCAAAACTTATCCTGGTCAGGATTCGTTTATCTCAAATGCTTTGCTAGATATGTATGGAAAATGTGGCAATTTAAAAGTTGCGAGGAGAGTTTTTGACATGATTACTTTAAAAAGCTTGACATCATGGAATTCCTTGATAAATTGTCTTGCACTCCATGGCCATAGTGGAAGTGCAATTGATTTGTTTGCAGAGTTGGTTCAATGTGGGGATGGTGTAAAGCCAGATGAGGTTACCTTTGTGGGTGTGTTGAATGCTTGTACTCACGGAGGATTAGTTGAAAAAGGTTACTCATACTTTGAAATGATGAGGCGGGATTACGATATCGAGCCTCAGATTGAACACTTTGGGTGCTTGATAGACCTTCTTGGCCGTGCAGGGCGGTTTGAGGAAGCAATGGAAGTTGTGCGAGGCATGAATATTGAACCAGATGAGGTTGTATGGGGTTCTTTACTAAATGCATGCAAAATCCATGGTCGTTCAGATTTAGCTGAATACTCGGTGAAAAAGTTGATCGAGATGGATCCAAAAAATGGCGGTTATAGAATTATGCTAGCTAATATATATGCCGAGCTTGGAAAGTGGGATGAGGTTCGGAAGGTTCGGAAGCTTTTGAAGGAGAAAAATGCTTATAAAACACCAGGTTGCAGTTGGATTGAGGTTGATAATCAGGTCTATCAGTTCTATTCTTTTGACATGACACATCCCAGTGTAGAACAGATATACAAAACCTTAGAAAGTATGATCAGCTTTATGTAAAAGGAAAATGAAGATTTGCATCAAATTGGAGCATTTTCGACATTCCATGCTATTGGTGATCGGCTGATCGCCATTTATCAGATGAGAAAGTCGAACCACAAGAGGATCATCAAGCACTACCGAGCCTCTCCTTCCAGTTGCACAGTCTTCTTTGAGGTCTATAAAAAAGACTTCATGAAAGAAGACTTATCTATTCTTAGCTTGAATTGGGGTATTGGTTTCTAAGTACAGACAATTGACAACATTAGCAAAACAGAAGGACAGCAATTGGAGGCAATTAAGTGATAGTTTAGAGAAATTGCAAGCAACTTTGATCGAATTTTGTTCCCTTTATATGCTTGTGTTGACCCTTAAACAACATGAAGGTAGTAGTATGGTTCAATTGCTGATGATGATAAAATGGCCAACCGGGGAAGCTTTTTGGAGTTCTATGAACAAATTAATGACCGTAAGTCTTAAGGTATGTGACTGTTTTAGTTATGTCTACCACCAAGTTGTTCATGGACAATGAACATCTACAAGTTCATAAGTTTGAAGAATTGACATCTCAACGGTGAAATTTTTCCATTTGGAAATTGCTTTGTATAGATGTTTTTTTGATAAGTTACAACGAATCTGCATCATTTGTCAGAGGCTTCCCCCCAATTTTTCTTTATGGGAAAAGGGAGAGATTTTGTATTCATTATCGATTTTTTTTTTTTTTCCTTTTGTATTGTCCTATCAAATATATCATTGTAATAACTTCATTATTTAAAGAAACTGTGAGAAGACTTTGAAAAGAAGGTTAGAAGTTAAAAGAAAAATTTTAAAAGCCAAAGATTTGTTCAAGTTGAAATTTTATAAAAGTAAATAAGAGATAAATAAAAAAATGAAGTTGGATTGGCATGTTTATTAATTTTTAGTTGTATTTCAGATGGAGGATTAATTTGTAATAATAGTTCCTTTTATTTTTATTTTATTTTATTTTATTCTCAATACATATGTTATATGAAAAAGAATAAAGTTGAAGGGAGGGAGGAACTCCATTTGTACTAAATTGGTCGGAATATTTAGTCAACCAAACAAATTTGTATATAAAAAATGACATTTTAGAAAACACTCATTTTTTTGAATCAATCTAAAAAGTCACTAGACTTATAATAAAATTCACAGTCTTGAGTGTTTTTTTTTTTATAGTAACCAACATTCAATCAATAATTTAAGATAGAATTCTTCTTAGCACATCTTAAATTATGAACAAATCCATGTTTATATGGATTTAATATCAATTTATACGAGAGGTGAATTGATATTTACTATTTGAGGATAATAATATGTTCTTTTAGCCTCAGAAACGTCGAGAAGGA

mRNA sequence

TTGTGGATTTTTTCTAATTTATATCTGAATTTATTTCAACTCTTAATCTAATAAATACCAAATTTTTGAGAAACATATTATATCATTGTTCAAGGGGACGAAGAAAGAAAAACACAAAGTACCAAAAAACTATTTAAAGCAGCTTCTCTCTTCTTAGACGTCACCTGACCTTATTCTTTTCCTCCGAACGTACCAACCGTCCCACTAGACCGAACCGTTTCGGTTTGCTTATAATATTGGTTAAATTCAAGACGAACCGATCGATATGGGTTTGGTTCGTCTATTTGACCGATTCATAAATTCAAAACGTCTCCGGGTTTCGAAGGTAATCGGAAGAAACATGTCGTCGGTCTCAATACAGCCCCATTTGTATCAATTGGTTGTAGCAGCGCTTGAAAAATGCAGTAATCTCAATCACCTCAAGCAACTTCAAGGATTTCTTATTTCACACGGTCATTCACAAACGCAGTTCTTTGCCTTCAAGCTTGTTCGCTTTTGCAACCTTACTCTTACTGACTTATGTTATGCTCGCTACATTTTTGATAATCTAACTTCCCCAAATGTTTATCTTTATACTGCAATGATCACAGCTTATGCCGCATACCCCGATCCTAAAGCTGCGTTTCTTTTGTACCGTAACATGGTCCGCCATGGAGCCATTCGACCGAACCATTTTATCTACCCCCATGTCCTTAAATCTTGCCCTGATGTTTTGGGGTCCAATGCCACGAAAATGGTTCATACCCAGGTTCTGAAATCTGGATTTGGTCGATACCCAGTTGTTCAAACAGCCATTGTTGATTCCTATTCGAGGTTCTCTTCGGTTATTGGGAATGCAAGACAGATGTTCGACGAAATGGTCGAGAGAAGTGTAGTGTCTTGGACGGCCATGATTTCAGGGTATGCGAGGCTTGGGAACTTTGATAGTGCAATTGAGTTGTTTGAAAGTATGCCGGAGAGGGATGTCCCTGCTTGGAATGCTCTTATTGCTGGTTGTGCTCAAAACGGATTCTTCTGTGAAGCGATTTGGTTGTTCAAAAAAATGGTTTCTCTGGCGCTAGAGGGTAATAATAATGATCGTGAAAATAAGCCGAATAAGACCACACTTGCATCTGCACTATCAGCTTGTGGAAATACCGGGATGCTTCATCTTGAGTTGGTTCAATGTGGGGATGGTGTAAAGCCAGATGAGGTTACCTTTGTGGGTGTGTTGAATGCTTGTACTCACGGAGGATTAGTTGAAAAAGGTTACTCATACTTTGAAATGATGAGGCGGGATTACGATATCGAGCCTCAGATTGAACACTTTGGGTGCTTGATAGACCTTCTTGGCCGTGCAGGGCGGTTTGAGGAAGCAATGGAAGTTGTGCGAGGCATGAATATTGAACCAGATGAGGTTGTATGGGGTTCTTTACTAAATGCATGCAAAATCCATGGTCGTTCAGATTTAGCTGAATACTCGGTGAAAAAGTTGATCGAGATGGATCCAAAAAATGGCGGTTATAGAATTATGCTAGCTAATATATATGCCGAGCTTGGAAAGTGGGATGAGGTTCGGAAGGTTCGGAAGCTTTTGAAGGAGAAAAATGCTTATAAAACACCAGGTTGCAGTTGGATTGAGGTTGATAATCAGGTCTATCAGTTCTATTCTTTTGACATGACACATCCCAGTGTAGAACAGATATACAAAACCTTAGAAAGTATGATCAGCTTTATGTAAAAGGAAAATGAAGATTTGCATCAAATTGGAGCATTTTCGACATTCCATGCTATTGGTGATCGGCTGATCGCCATTTATCAGATGAGAAAGTCGAACCACAAGAGGATCATCAAGCACTACCGAGCCTCTCCTTCCAGTTGCACAGTCTTCTTTGAGGTCTATAAAAAAGACTTCATGAAAGAAGACTTATCTATTCTTAGCTTGAATTGGGGTATTGGTTTCTAAGTACAGACAATTGACAACATTAGCAAAACAGAAGGACAGCAATTGGAGGCAATTAAGTGATAGTTTAGAGAAATTGCAAGCAACTTTGATCGAATTTTGTTCCCTTTATATGCTTGTGTTGACCCTTAAACAACATGAAGGTAGTAGTATGGTTCAATTGCTGATGATGATAAAATGGCCAACCGGGGAAGCTTTTTGGAGTTCTATGAACAAATTAATGACCGTAAGTCTTAAGGTATGTGACTGTTTTAGTTATGTCTACCACCAAGTTGTTCATGGACAATGAACATCTACAAGTTCATAAGTTTGAAGAATTGACATCTCAACGGTGAAATTTTTCCATTTGGAAATTGCTTTGTATAGATGTTTTTTTGATAAGTTACAACGAATCTGCATCATTTGTCAGAGGCTTCCCCCCAATTTTTCTTTATGGGAAAAGGGAGAGATTTTGTATTCATTATCGATTTTTTTTTTTTTTCCTTTTGTATTGTCCTATCAAATATATCATTGTAATAACTTCATTATTTAAAGAAACTGTGAGAAGACTTTGAAAAGAAGGTTAGAAGTTAAAAGAAAAATTTTAAAAGCCAAAGATTTGTTCAAGTTGAAATTTTATAAAAGTAAATAAGAGATAAATAAAAAAATGAAGTTGGATTGGCATGTTTATTAATTTTTAGTTGTATTTCAGATGGAGGATTAATTTGTAATAATAGTTCCTTTTATTTTTATTTTATTTTATTTTATTCTCAATACATATGTTATATGAAAAAGAATAAAGTTGAAGGGAGGGAGGAACTCCATTTGTACTAAATTGGTCGGAATATTTAGTCAACCAAACAAATTTGTATATAAAAAATGACATTTTAGAAAACACTCATTTTTTTGAATCAATCTAAAAAGTCACTAGACTTATAATAAAATTCACAGTCTTGAGTGTTTTTTTTTTTATAGTAACCAACATTCAATCAATAATTTAAGATAGAATTCTTCTTAGCACATCTTAAATTATGAACAAATCCATGTTTATATGGATTTAATATCAATTTATACGAGAGGTGAATTGATATTTACTATTTGAGGATAATAATATGTTCTTTTAGCCTCAGAAACGTCGAGAAGGA

Coding sequence (CDS)

ATGGGTTTGGTTCGTCTATTTGACCGATTCATAAATTCAAAACGTCTCCGGGTTTCGAAGGTAATCGGAAGAAACATGTCGTCGGTCTCAATACAGCCCCATTTGTATCAATTGGTTGTAGCAGCGCTTGAAAAATGCAGTAATCTCAATCACCTCAAGCAACTTCAAGGATTTCTTATTTCACACGGTCATTCACAAACGCAGTTCTTTGCCTTCAAGCTTGTTCGCTTTTGCAACCTTACTCTTACTGACTTATGTTATGCTCGCTACATTTTTGATAATCTAACTTCCCCAAATGTTTATCTTTATACTGCAATGATCACAGCTTATGCCGCATACCCCGATCCTAAAGCTGCGTTTCTTTTGTACCGTAACATGGTCCGCCATGGAGCCATTCGACCGAACCATTTTATCTACCCCCATGTCCTTAAATCTTGCCCTGATGTTTTGGGGTCCAATGCCACGAAAATGGTTCATACCCAGGTTCTGAAATCTGGATTTGGTCGATACCCAGTTGTTCAAACAGCCATTGTTGATTCCTATTCGAGGTTCTCTTCGGTTATTGGGAATGCAAGACAGATGTTCGACGAAATGGTCGAGAGAAGTGTAGTGTCTTGGACGGCCATGATTTCAGGGTATGCGAGGCTTGGGAACTTTGATAGTGCAATTGAGTTGTTTGAAAGTATGCCGGAGAGGGATGTCCCTGCTTGGAATGCTCTTATTGCTGGTTGTGCTCAAAACGGATTCTTCTGTGAAGCGATTTGGTTGTTCAAAAAAATGGTTTCTCTGGCGCTAGAGGGTAATAATAATGATCGTGAAAATAAGCCGAATAAGACCACACTTGCATCTGCACTATCAGCTTGTGGAAATACCGGGATGCTTCATCTTGAGTTGGTTCAATGTGGGGATGGTGTAAAGCCAGATGAGGTTACCTTTGTGGGTGTGTTGAATGCTTGTACTCACGGAGGATTAGTTGAAAAAGGTTACTCATACTTTGAAATGATGAGGCGGGATTACGATATCGAGCCTCAGATTGAACACTTTGGGTGCTTGATAGACCTTCTTGGCCGTGCAGGGCGGTTTGAGGAAGCAATGGAAGTTGTGCGAGGCATGAATATTGAACCAGATGAGGTTGTATGGGGTTCTTTACTAAATGCATGCAAAATCCATGGTCGTTCAGATTTAGCTGAATACTCGGTGAAAAAGTTGATCGAGATGGATCCAAAAAATGGCGGTTATAGAATTATGCTAGCTAATATATATGCCGAGCTTGGAAAGTGGGATGAGGTTCGGAAGGTTCGGAAGCTTTTGAAGGAGAAAAATGCTTATAAAACACCAGGTTGCAGTTGGATTGAGGTTGATAATCAGGTCTATCAGTTCTATTCTTTTGACATGACACATCCCAGTGTAGAACAGATATACAAAACCTTAGAAAGTATGATCAGCTTTATGTAA

Protein sequence

MGLVRLFDRFINSKRLRVSKVIGRNMSSVSIQPHLYQLVVAALEKCSNLNHLKQLQGFLISHGHSQTQFFAFKLVRFCNLTLTDLCYARYIFDNLTSPNVYLYTAMITAYAAYPDPKAAFLLYRNMVRHGAIRPNHFIYPHVLKSCPDVLGSNATKMVHTQVLKSGFGRYPVVQTAIVDSYSRFSSVIGNARQMFDEMVERSVVSWTAMISGYARLGNFDSAIELFESMPERDVPAWNALIAGCAQNGFFCEAIWLFKKMVSLALEGNNNDRENKPNKTTLASALSACGNTGMLHLELVQCGDGVKPDEVTFVGVLNACTHGGLVEKGYSYFEMMRRDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNACKIHGRSDLAEYSVKKLIEMDPKNGGYRIMLANIYAELGKWDEVRKVRKLLKEKNAYKTPGCSWIEVDNQVYQFYSFDMTHPSVEQIYKTLESMISFM
Homology
BLAST of Cmc09g0238571 vs. NCBI nr
Match: XP_008459497.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g33350 [Cucumis melo])

HSP 1 Score: 960.7 bits (2482), Expect = 4.8e-276
Identity = 484/556 (87.05%), Postives = 484/556 (87.05%), Query Frame = 0

Query: 1   MGLVRLFDRFINSKRLRVSKVIGRNMSSVSIQPHLYQLVVAALEKCSNLNHLKQLQGFLI 60
           MGLVRLFDRFINSKRLRVSKVIGRNMSSVSIQPHLYQLVVAALEKCSNLNHLKQLQGFLI
Sbjct: 1   MGLVRLFDRFINSKRLRVSKVIGRNMSSVSIQPHLYQLVVAALEKCSNLNHLKQLQGFLI 60

Query: 61  SHGHSQTQFFAFKLVRFCNLTLTDLCYARYIFDNLTSPNVYLYTAMITAYAAYPDPKAAF 120
           SHGHSQTQFFAFKLVRFCNLTLTDLCYARYIFDNLTSPNVYLYTAMITAYAAYPDPKAAF
Sbjct: 61  SHGHSQTQFFAFKLVRFCNLTLTDLCYARYIFDNLTSPNVYLYTAMITAYAAYPDPKAAF 120

Query: 121 LLYRNMVRHGAIRPNHFIYPHVLKSCPDVLGSNATKMVHTQVLKSGFGRYPVVQTAIVDS 180
           LLYRNMVRHGAIRPNHFIYPHVLKSCPDVLGSNATKMVHTQVLKSGFGRYPVVQTAIVDS
Sbjct: 121 LLYRNMVRHGAIRPNHFIYPHVLKSCPDVLGSNATKMVHTQVLKSGFGRYPVVQTAIVDS 180

Query: 181 YSRFSSVIGNARQMFDEMVERSVVSWTAMISGYARLGNFDSAIELFESMPERDVPAWNAL 240
           YSRFSSVIGNARQMFDEMVERSVVSWTAMISGYARLGNFDSAIELFESMPERDVPAWNAL
Sbjct: 181 YSRFSSVIGNARQMFDEMVERSVVSWTAMISGYARLGNFDSAIELFESMPERDVPAWNAL 240

Query: 241 IAGCAQNGFFCEAIWLFKKMVSLALEGNNNDRENKPNKTTLASALSACGNTGMLHL---- 300
           IAGCAQNGFFCEAIWLFKKMVSLALEGNNNDRENKPNKTTLASALSACGNTGMLHL    
Sbjct: 241 IAGCAQNGFFCEAIWLFKKMVSLALEGNNNDRENKPNKTTLASALSACGNTGMLHLGKWI 300

Query: 301 ------------------------------------------------------------ 360
                                                                       
Sbjct: 301 HGYVFKTYPGQDSFISNALLDMYGKCGNLKVARRVFDMITLKSLTSWNSLINCLALHGHS 360

Query: 361 --------ELVQCGDGVKPDEVTFVGVLNACTHGGLVEKGYSYFEMMRRDYDIEPQIEHF 420
                   ELVQCGDGVKPDEVTFVGVLNACTHGGLVEKGYSYFEMMRRDYDIEPQIEHF
Sbjct: 361 GSAIDLFAELVQCGDGVKPDEVTFVGVLNACTHGGLVEKGYSYFEMMRRDYDIEPQIEHF 420

Query: 421 GCLIDLLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNACKIHGRSDLAEYSVKKLIEMDP 480
           GCLIDLLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNACKIHGRSDLAEYSVKKLIEMDP
Sbjct: 421 GCLIDLLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNACKIHGRSDLAEYSVKKLIEMDP 480

Query: 481 KNGGYRIMLANIYAELGKWDEVRKVRKLLKEKNAYKTPGCSWIEVDNQVYQFYSFDMTHP 485
           KNGGYRIMLANIYAELGKWDEVRKVRKLLKEKNAYKTPGCSWIEVDNQVYQFYSFDMTHP
Sbjct: 481 KNGGYRIMLANIYAELGKWDEVRKVRKLLKEKNAYKTPGCSWIEVDNQVYQFYSFDMTHP 540

BLAST of Cmc09g0238571 vs. NCBI nr
Match: XP_004141609.3 (pentatricopeptide repeat-containing protein At1g33350 [Cucumis sativus])

HSP 1 Score: 897.9 bits (2319), Expect = 3.9e-257
Identity = 454/563 (80.64%), Postives = 469/563 (83.30%), Query Frame = 0

Query: 1   MGLVRLFDRFINSKRLRV-------SKVIGRNMSSVSIQPHLYQLVVAALEKCSNLNHLK 60
           MG V LFDRFI SKRLRV       SK IGR+MSSVSI PHL QL VAALEKCSNLNHLK
Sbjct: 1   MGSVLLFDRFITSKRLRVWLSLRIISKAIGRSMSSVSIHPHLNQLFVAALEKCSNLNHLK 60

Query: 61  QLQGFLISHGHSQTQFFAFKLVRFCNLTLTDLCYARYIFDNLTSPNVYLYTAMITAYAAY 120
           QLQGFLISHGHSQTQFFAFKLVRFCNLTL DLCYARYIFDNLTSPNV+LYTAMITAYA+Y
Sbjct: 61  QLQGFLISHGHSQTQFFAFKLVRFCNLTLADLCYARYIFDNLTSPNVFLYTAMITAYASY 120

Query: 121 PDPKAAFLLYRNMVRHGAIRPNHFIYPHVLKSCPDVLGSNATKMVHTQVLKSGFGRYPVV 180
           PDPKAAFLLYRNMVR GAIRPN+FIYPHVL+SCPDVLGSNATKMVHTQVLKSGFG YPVV
Sbjct: 121 PDPKAAFLLYRNMVRRGAIRPNNFIYPHVLRSCPDVLGSNATKMVHTQVLKSGFGGYPVV 180

Query: 181 QTAIVDSYSRFSSVIGNARQMFDEMVERSVVSWTAMISGYARLGNFDSAIELFESMPERD 240
           QTAIVDSYSRFSS IG+ARQMFDEM+ER+VVSWTAMISGYARLGNFDSAIELFESMPERD
Sbjct: 181 QTAIVDSYSRFSSDIGSARQMFDEMLERTVVSWTAMISGYARLGNFDSAIELFESMPERD 240

Query: 241 VPAWNALIAGCAQNGFFCEAIWLFKKMVSLALEGNNNDRENKPNKTTLASALSACGNTGM 300
           VPAWNALIAGCAQNGFFCEAIWLFK+MV LALEGNNNDRENKPNKTTL SALSACG+TGM
Sbjct: 241 VPAWNALIAGCAQNGFFCEAIWLFKRMVLLALEGNNNDRENKPNKTTLGSALSACGHTGM 300

Query: 301 LHL--------------------------------------------------------- 360
           LHL                                                         
Sbjct: 301 LHLGKWIHGYVFKTYPGQDSFISNALLDMYGKCGNLKVARRVFDMITLKNLTSWNSLINC 360

Query: 361 ---------------ELVQCGDGVKPDEVTFVGVLNACTHGGLVEKGYSYFEMMRRDYDI 420
                          EL+ CGDGVKP+EVTFVGVLNACTHGGLVEKGYSYFEMMRRDYDI
Sbjct: 361 LALHGHSGSAIDLFAELIHCGDGVKPNEVTFVGVLNACTHGGLVEKGYSYFEMMRRDYDI 420

Query: 421 EPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNACKIHGRSDLAEYSVK 480
           EPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNACKIHGRSDLAEYSVK
Sbjct: 421 EPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNACKIHGRSDLAEYSVK 480

Query: 481 KLIEMDPKNGGYRIMLANIYAELGKWDEVRKVRKLLKEKNAYKTPGCSWIEVDNQVYQFY 485
           KLIEMDPKNGGYRIMLANIYAE GKWDEVRKVR+LLKEKNAYKTPGCSWIEVDNQVYQFY
Sbjct: 481 KLIEMDPKNGGYRIMLANIYAEFGKWDEVRKVRRLLKEKNAYKTPGCSWIEVDNQVYQFY 540

BLAST of Cmc09g0238571 vs. NCBI nr
Match: KAA0039360.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK00543.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 859.4 bits (2219), Expect = 1.5e-245
Identity = 434/518 (83.78%), Postives = 437/518 (84.36%), Query Frame = 0

Query: 26  MSSVSIQPHLYQLVVAALEKCSNLNHLKQLQGFLISHGHSQTQFFAFKLVRFCNLTLTDL 85
           MSSVSIQPHLYQLVVAALEKCSNLNHLKQLQGFLISHGHSQTQFFAFKLVRFCNLTLTDL
Sbjct: 1   MSSVSIQPHLYQLVVAALEKCSNLNHLKQLQGFLISHGHSQTQFFAFKLVRFCNLTLTDL 60

Query: 86  CYARYIFDNLTSPNVYLYTAMITAYAAYPDPKAAFLLYRNMVRHGAIRPNHFIYPHVLKS 145
           CYARYIFDNLTSPNVYLYTAMITAYAAYPDPKAAFLLYRNMVRHGAIRPNHFIYPHVLKS
Sbjct: 61  CYARYIFDNLTSPNVYLYTAMITAYAAYPDPKAAFLLYRNMVRHGAIRPNHFIYPHVLKS 120

Query: 146 CPDVLGSNATKMVHTQVLKSGFGRYPVVQTAIVDSYSRFSSVIGNARQMFDEMVERSVVS 205
           CPDVLGSNATKMVHTQVLKSGFGRYPVVQTAIVDSYSRFSSVIGNARQMFDEMVERSVVS
Sbjct: 121 CPDVLGSNATKMVHTQVLKSGFGRYPVVQTAIVDSYSRFSSVIGNARQMFDEMVERSVVS 180

Query: 206 WTAMISGYARLGNFDSAIELFESMPERDVPAWNALIAGCAQNGFFCEAIWLFKKMVSLAL 265
           WTAMISGYARLGNFDSAIELFESMPERDVPAWNALIAGCAQNGFFCEAIWLFKKMVSLAL
Sbjct: 181 WTAMISGYARLGNFDSAIELFESMPERDVPAWNALIAGCAQNGFFCEAIWLFKKMVSLAL 240

Query: 266 EGNNNDRENKPNKTTLASALSACGNTGMLHL----------------------------- 325
           EGNNNDRENKPNKTTLASALSACGNTGMLHL                             
Sbjct: 241 EGNNNDRENKPNKTTLASALSACGNTGMLHLGKWIHGYVFKTYPGQDSFISNALLDMYGK 300

Query: 326 -------------------------------------------ELVQCGDGVKPDEVTFV 385
                                                      ELVQCGDGVKPDEVTFV
Sbjct: 301 CGNLKVARRVFDMITLKSLTSWNSLINCLALHGHSGSAIDLFAELVQCGDGVKPDEVTFV 360

Query: 386 GVLNACTHGGLVEKGYSYFEMMRRDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNI 445
           GVLNACTHGGLVEKGYSYFEMMRRDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNI
Sbjct: 361 GVLNACTHGGLVEKGYSYFEMMRRDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNI 420

Query: 446 EPDEVVWGSLLNACKIHGRSDLAEYSVKKLIEMDPKNGGYRIMLANIYAELGKWDEVRKV 468
           EPDEVVWGSLLNACKIHGRSDLAEYSVKKLIEMDPKNGGYRIMLANIYAELGKWDEVRKV
Sbjct: 421 EPDEVVWGSLLNACKIHGRSDLAEYSVKKLIEMDPKNGGYRIMLANIYAELGKWDEVRKV 480

BLAST of Cmc09g0238571 vs. NCBI nr
Match: XP_038889416.1 (pentatricopeptide repeat-containing protein At1g33350-like [Benincasa hispida])

HSP 1 Score: 847.8 bits (2189), Expect = 4.6e-242
Identity = 425/531 (80.04%), Postives = 441/531 (83.05%), Query Frame = 0

Query: 26  MSSVSIQPHLYQLVVAALEKCSNLNHLKQLQGFLISHGHSQTQFFAFKLVRFCNLTLTDL 85
           MSSV IQPHL QLV+AALEKCS LNHLKQLQGFLIS GHS+TQFFAFKLVRFCNLTL DL
Sbjct: 1   MSSVPIQPHLNQLVLAALEKCSQLNHLKQLQGFLISLGHSRTQFFAFKLVRFCNLTLADL 60

Query: 86  CYARYIFDNLTSPNVYLYTAMITAYAAYPDPKAAFLLYRNMVRHGAIRPNHFIYPHVLKS 145
           CYAR+IFD+LTSPNVYLYTAMITAYA+ PDPKAAFLLYRNMVRHGA RPNHFIYPHVLKS
Sbjct: 61  CYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRNMVRHGAPRPNHFIYPHVLKS 120

Query: 146 CPDVLGSNATKMVHTQVLKSGFGRYPVVQTAIVDSYSRFSSVIGNARQMFDEMVERSVVS 205
           CP+VL SN TKMVHTQVLKSGFG+YPVVQTAIVDSYSRF S +GNARQMFDEM+ERSVVS
Sbjct: 121 CPEVLESNGTKMVHTQVLKSGFGQYPVVQTAIVDSYSRFCSDLGNARQMFDEMLERSVVS 180

Query: 206 WTAMISGYARLGNFDSAIELFESMPERDVPAWNALIAGCAQNGFFCEAIWLFKKMVSLAL 265
           WTAMISGYARLGN DSAIELFESMPERDVPAWNALIAGCAQNGFFCEAIWLFK+MVSLAL
Sbjct: 181 WTAMISGYARLGNVDSAIELFESMPERDVPAWNALIAGCAQNGFFCEAIWLFKRMVSLAL 240

Query: 266 EGNNNDRENKPNKTTLASALSACGNTGMLHL----------------------------- 325
           EGNNN+RENKPNK T+ASALSACG+TGMLHL                             
Sbjct: 241 EGNNNERENKPNKITVASALSACGHTGMLHLGKWIHGYVFKNYFGQDSFISNALLDMYGK 300

Query: 326 -------------------------------------------ELVQCGDGVKPDEVTFV 385
                                                      ELVQCGDGVKPD VTFV
Sbjct: 301 CGNLKVARRVFDMISLKSLTSWNSLINCLALHGHSGSAIDLFVELVQCGDGVKPDGVTFV 360

Query: 386 GVLNACTHGGLVEKGYSYFEMMRRDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNI 445
           GVLNACTHGGLVEKGYSYFEMMRRDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNI
Sbjct: 361 GVLNACTHGGLVEKGYSYFEMMRRDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNI 420

Query: 446 EPDEVVWGSLLNACKIHGRSDLAEYSVKKLIEMDPKNGGYRIMLANIYAELGKWDEVRKV 485
           EPDEVVWGSLLN CKIHGR DLAEYSVKKLIEMDP+NGGYRIMLANIYAEL KWDEVRKV
Sbjct: 421 EPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKV 480

BLAST of Cmc09g0238571 vs. NCBI nr
Match: KAE8648973.1 (hypothetical protein Csa_008304 [Cucumis sativus])

HSP 1 Score: 845.9 bits (2184), Expect = 1.7e-241
Identity = 428/535 (80.00%), Postives = 441/535 (82.43%), Query Frame = 0

Query: 1   MGLVRLFDRFINSKRLRV-------SKVIGRNMSSVSIQPHLYQLVVAALEKCSNLNHLK 60
           MG V LFDRFI SKRLRV       SK IGR+MSSVSI PHL QL VAALEKCSNLNHLK
Sbjct: 1   MGSVLLFDRFITSKRLRVWLSLRIISKAIGRSMSSVSIHPHLNQLFVAALEKCSNLNHLK 60

Query: 61  QLQGFLISHGHSQTQFFAFKLVRFCNLTLTDLCYARYIFDNLTSPNVYLYTAMITAYAAY 120
           QLQGFLISHGHSQTQFFAFKLVRFCNLTL DLCYARYIFDNLTSPNV+LYTAMITAYA+Y
Sbjct: 61  QLQGFLISHGHSQTQFFAFKLVRFCNLTLADLCYARYIFDNLTSPNVFLYTAMITAYASY 120

Query: 121 PDPKAAFLLYRNMVRHGAIRPNHFIYPHVLKSCPDVLGSNATKMVHTQVLKSGFGRYPVV 180
           PDPKAAFLLYRNMVR GAIRPN+FIYPHVL+SCPDVLGSNATKMVHTQVLKSGFG YPVV
Sbjct: 121 PDPKAAFLLYRNMVRRGAIRPNNFIYPHVLRSCPDVLGSNATKMVHTQVLKSGFGGYPVV 180

Query: 181 QTAIVDSYSRFSSVIGNARQMFDEMVERSVVSWTAMISGYARLGNFDSAIELFESMPERD 240
           QTAIVDSYSRFSS IG+ARQMFDEM+ER+VVSWTAMISGYARLGNFDSAIELFESMPERD
Sbjct: 181 QTAIVDSYSRFSSDIGSARQMFDEMLERTVVSWTAMISGYARLGNFDSAIELFESMPERD 240

Query: 241 VPAWNALIAGCAQNGFFCEAIWLFKKMVSLALEGNNNDRENKPNKTTLASALSACGNTGM 300
           VPAWNALIAGCAQNGFFCEAIWLFK+MV LALEGNNNDRENKPNKTTL SALSACG+TGM
Sbjct: 241 VPAWNALIAGCAQNGFFCEAIWLFKRMVLLALEGNNNDRENKPNKTTLGSALSACGHTGM 300

Query: 301 LHL--------------------------------------------------------- 360
           LHL                                                         
Sbjct: 301 LHLGKWIHGYVFKTYPGQDSFISNALLDMYGKCGNLKVARRVFDMITLKNLTSWNSLINC 360

Query: 361 ---------------ELVQCGDGVKPDEVTFVGVLNACTHGGLVEKGYSYFEMMRRDYDI 420
                          EL+ CGDGVKP+EVTFVGVLNACTHGGLVEKGYSYFEMMRRDYDI
Sbjct: 361 LALHGHSGSAIDLFAELIHCGDGVKPNEVTFVGVLNACTHGGLVEKGYSYFEMMRRDYDI 420

Query: 421 EPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNACKIHGRSDLAEYSVK 457
           EPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNACKIHGRSDLAEYSVK
Sbjct: 421 EPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNACKIHGRSDLAEYSVK 480

BLAST of Cmc09g0238571 vs. ExPASy Swiss-Prot
Match: Q9C501 (Pentatricopeptide repeat-containing protein At1g33350 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E57 PE=2 SV=1)

HSP 1 Score: 485.0 bits (1247), Expect = 1.0e-135
Identity = 250/532 (46.99%), Postives = 329/532 (61.84%), Query Frame = 0

Query: 27  SSVSIQPHLYQLVVAALEKCSNLNHLKQLQGFLISHGHSQTQFFAFKLVRFCNLTLTDLC 86
           SS   +  L Q + A + K  +LNHLKQ+Q F+I  G S + F  FKL+RFC L L +L 
Sbjct: 15  SSHMAEQLLNQFISAVISKSRHLNHLKQVQSFMIVSGLSHSHFLCFKLLRFCTLRLCNLS 74

Query: 87  YARYIFDNLTSPNVYLYTAMITAYAAY--PDPKAAFLLYRNMVRHGAIRPNHFIYPHVLK 146
           YAR+IFD  + PN +LY A++TAY++       +AF  +R MV     RPNHFIYP VLK
Sbjct: 75  YARFIFDRFSFPNTHLYAAVLTAYSSSLPLHASSAFSFFRLMVNRSVPRPNHFIYPLVLK 134

Query: 147 SCPDVLGSNATKMVHTQVLKSGFGRYPVVQTAIVDSYSRFSSVIGNARQMFDEMVERSVV 206
           S P +  + +T +VHT + KSGF  Y VVQTA++ SY+   S I  ARQ+FDEM ER+VV
Sbjct: 135 STPYLSSAFSTPLVHTHLFKSGFHLYVVVQTALLHSYASSVSHITLARQLFDEMSERNVV 194

Query: 207 SWTAMISGYARLGNFDSAIELFESMPERDVPAWNALIAGCAQNGFFCEAIWLFKKMVSLA 266
           SWTAM+SGYAR G+  +A+ LFE MPERDVP+WNA++A C QNG F EA+ LF++M+   
Sbjct: 195 SWTAMLSGYARSGDISNAVALFEDMPERDVPSWNAILAACTQNGLFLEAVSLFRRMI--- 254

Query: 267 LEGNNNDRENKPNKTTLASALSACGNTGMLHL---------------------ELV---- 326
                N+   +PN+ T+   LSAC  TG L L                      LV    
Sbjct: 255 -----NEPSIRPNEVTVVCVLSACAQTGTLQLAKGIHAFAYRRDLSSDVFVSNSLVDLYG 314

Query: 327 QCG------------------------------------------------DGVKPDEVT 386
           +CG                                                + +KPD +T
Sbjct: 315 KCGNLEEASSVFKMASKKSLTAWNSMINCFALHGRSEEAIAVFEEMMKLNINDIKPDHIT 374

Query: 387 FVGVLNACTHGGLVEKGYSYFEMMRRDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGM 446
           F+G+LNACTHGGLV KG  YF++M   + IEP+IEH+GCLIDLLGRAGRF+EA+EV+  M
Sbjct: 375 FIGLLNACTHGGLVSKGRGYFDLMTNRFGIEPRIEHYGCLIDLLGRAGRFDEALEVMSTM 434

Query: 447 NIEPDEVVWGSLLNACKIHGRSDLAEYSVKKLIEMDPKNGGYRIMLANIYAELGKWDEVR 484
            ++ DE +WGSLLNACKIHG  DLAE +VK L+ ++P NGGY  M+AN+Y E+G W+E R
Sbjct: 435 KMKADEAIWGSLLNACKIHGHLDLAEVAVKNLVALNPNNGGYVAMMANLYGEMGNWEEAR 494

BLAST of Cmc09g0238571 vs. ExPASy Swiss-Prot
Match: Q9SIL5 (Pentatricopeptide repeat-containing protein At2g20540 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E78 PE=2 SV=1)

HSP 1 Score: 307.0 bits (785), Expect = 3.8e-82
Identity = 170/510 (33.33%), Postives = 278/510 (54.51%), Query Frame = 0

Query: 43  LEKCSNLNHLKQLQGFLISHGHSQTQFFAFKLVRFCNLTLTDLCYARYIFDNLTSPNVYL 102
           L++  + N  K++   +I HG SQ+ F   K+V FC+  + D+ YA  +F+ +++PNV+L
Sbjct: 17  LQRVKSRNEWKKINASIIIHGLSQSSFMVTKMVDFCD-KIEDMDYATRLFNQVSNPNVFL 76

Query: 103 YTAMITAYAAYPDPKAAFLLYRNMVRHGAIRPNHFIYPHVLKSCPDVLGSNATKMVHTQV 162
           Y ++I AY           +Y+ ++R     P+ F +P + KSC  +      K VH  +
Sbjct: 77  YNSIIRAYTHNSLYCDVIRIYKQLLRKSFELPDRFTFPFMFKSCASLGSCYLGKQVHGHL 136

Query: 163 LKSGFGRYPVVQTAIVDSYSRFSSVIGNARQMFDEMVERSVVSWTAMISGYARLGNFDSA 222
            K G   + V + A++D Y +F  ++ +A ++FDEM ER V+SW +++SGYARLG    A
Sbjct: 137 CKFGPRFHVVTENALIDMYMKFDDLV-DAHKVFDEMYERDVISWNSLLSGYARLGQMKKA 196

Query: 223 IELFESMPERDVPAWNALIAGCAQNGFFCEAIWLFKKMVSLALEGNNNDRENKPNKTTLA 282
             LF  M ++ + +W A+I+G    G + EA+  F++M    +E         P++ +L 
Sbjct: 197 KGLFHLMLDKTIVSWTAMISGYTGIGCYVEAMDFFREMQLAGIE---------PDEISLI 256

Query: 283 SALSACGNTGMLHL---------------------ELVQ----CG------------DG- 342
           S L +C   G L L                      L++    CG            +G 
Sbjct: 257 SVLPSCAQLGSLELGKWIHLYAERRGFLKQTGVCNALIEMYSKCGVISQAIQLFGQMEGK 316

Query: 343 --------------------------------VKPDEVTFVGVLNACTHGGLVEKGYSYF 402
                                           VKP+ +TF+G+L+AC+H G+ ++G  YF
Sbjct: 317 DVISWSTMISGYAYHGNAHGAIETFNEMQRAKVKPNGITFLGLLSACSHVGMWQEGLRYF 376

Query: 403 EMMRRDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNACKIHGR 462
           +MMR+DY IEP+IEH+GCLID+L RAG+ E A+E+ + M ++PD  +WGSLL++C+  G 
Sbjct: 377 DMMRQDYQIEPKIEHYGCLIDVLARAGKLERAVEITKTMPMKPDSKIWGSLLSSCRTPGN 436

Query: 463 SDLAEYSVKKLIEMDPKNGGYRIMLANIYAELGKWDEVRKVRKLLKEKNAYKTPGCSWIE 483
            D+A  ++  L+E++P++ G  ++LANIYA+LGKW++V ++RK+++ +N  KTPG S IE
Sbjct: 437 LDVALVAMDHLVELEPEDMGNYVLLANIYADLGKWEDVSRLRKMIRNENMKKTPGGSLIE 496

BLAST of Cmc09g0238571 vs. ExPASy Swiss-Prot
Match: Q9FIF7 (Putative pentatricopeptide repeat-containing protein At5g59200, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-E41 PE=3 SV=1)

HSP 1 Score: 303.1 bits (775), Expect = 5.5e-81
Identity = 174/528 (32.95%), Postives = 272/528 (51.52%), Query Frame = 0

Query: 24  RNMSSVSIQPHLYQLVVAALEKCSNLNHLKQLQGFLISHGHSQTQFFAFKLVRFCNLTLT 83
           R+  S +++    + +++ L  C N+ H+  +   +I   H Q  F  F+L+R C+ TL 
Sbjct: 17  RDPDSNTLRLSRRKTLISVLRSCKNIAHVPSIHAKIIRTFHDQDAFVVFELIRVCS-TLD 76

Query: 84  DLCYARYIFDNLTSPNVYLYTAMITAYAAYPDPKAAFLLYRNMVRHGAIRPNHFIYPHVL 143
            + YA  +F  +++PNVYLYTAMI  + +         LY  M+ H ++ P++++   VL
Sbjct: 77  SVDYAYDVFSYVSNPNVYLYTAMIDGFVSSGRSADGVSLYHRMI-HNSVLPDNYVITSVL 136

Query: 144 KSCPDVLGSNATKMVHTQVLKSGFGRYPVVQTAIVDSYSRFSSVIGNARQMFDEMVERSV 203
           K+C         + +H QVLK GFG    V   +++ Y +   ++ NA++MFDEM +R  
Sbjct: 137 KAC----DLKVCREIHAQVLKLGFGSSRSVGLKMMEIYGKSGELV-NAKKMFDEMPDRDH 196

Query: 204 VSWTAMISGYARLGNFDSAIELFESMPERDVPAWNALIAGCAQNGFFCEAIWLFKKMVSL 263
           V+ T MI+ Y+  G    A+ELF+ +  +D   W A+I G  +N    +A+ LF++M   
Sbjct: 197 VAATVMINCYSECGFIKEALELFQDVKIKDTVCWTAMIDGLVRNKEMNKALELFREM--- 256

Query: 264 ALEGNNNDREN-KPNKTTLASALSACGNTGMLHL-------------EL----------- 323
                    EN   N+ T    LSAC + G L L             EL           
Sbjct: 257 -------QMENVSANEFTAVCVLSACSDLGALELGRWVHSFVENQRMELSNFVGNALINM 316

Query: 324 -VQCGD---------------------------------------------GVKPDEVTF 383
             +CGD                                             G +P++VT 
Sbjct: 317 YSRCGDINEARRVFRVMRDKDVISYNTMISGLAMHGASVEAINEFRDMVNRGFRPNQVTL 376

Query: 384 VGVLNACTHGGLVEKGYSYFEMMRRDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGMN 443
           V +LNAC+HGGL++ G   F  M+R +++EPQIEH+GC++DLLGR GR EEA   +  + 
Sbjct: 377 VALLNACSHGGLLDIGLEVFNSMKRVFNVEPQIEHYGCIVDLLGRVGRLEEAYRFIENIP 436

Query: 444 IEPDEVVWGSLLNACKIHGRSDLAEYSVKKLIEMDPKNGGYRIMLANIYAELGKWDEVRK 481
           IEPD ++ G+LL+ACKIHG  +L E   K+L E +  + G  ++L+N+YA  GKW E  +
Sbjct: 437 IEPDHIMLGTLLSACKIHGNMELGEKIAKRLFESENPDSGTYVLLSNLYASSGKWKESTE 496

BLAST of Cmc09g0238571 vs. ExPASy Swiss-Prot
Match: Q9SZT8 (Pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=ELI1 PE=3 SV=1)

HSP 1 Score: 285.8 bits (730), Expect = 9.1e-76
Identity = 168/512 (32.81%), Postives = 269/512 (52.54%), Query Frame = 0

Query: 43  LEKCSNLNHLKQLQGFLISHG---HSQTQFFAFKLVRFCNLTLTDLCYARYIFDNLTSPN 102
           ++K  +++ + Q+   ++ H    H +      KL R    +   + ++  +F     P+
Sbjct: 36  IDKSQSVDEVLQIHAAILRHNLLLHPRYPVLNLKLHR-AYASHGKIRHSLALFHQTIDPD 95

Query: 103 VYLYTAMITAYAAYPDPKAAFLLYRNMVRHGAIRPNHFIYPHVLKSCPDVLGSNATKMVH 162
           ++L+TA I   +       AFLLY  ++    I PN F +  +LKSC     + + K++H
Sbjct: 96  LFLFTAAINTASINGLKDQAFLLYVQLL-SSEINPNEFTFSSLLKSC----STKSGKLIH 155

Query: 163 TQVLKSGFGRYPVVQTAIVDSYSRFSSVIGNARQMFDEMVERSVVSWTAMISGYARLGNF 222
           T VLK G G  P V T +VD Y++   V+ +A+++FD M ERS+VS TAMI+ YA+ GN 
Sbjct: 156 THVLKFGLGIDPYVATGLVDVYAKGGDVV-SAQKVFDRMPERSLVSSTAMITCYAKQGNV 215

Query: 223 DSAIELFESMPERDVPAWNALIAGCAQNGFFCEAIWLFKKMVSLALEGNNNDRENKPNKT 282
           ++A  LF+SM ERD+ +WN +I G AQ+GF  +A+ LF+K+++        + + KP++ 
Sbjct: 216 EAARALFDSMCERDIVSWNVMIDGYAQHGFPNDALMLFQKLLA--------EGKPKPDEI 275

Query: 283 TLASALSACGNTGMLHL-------------------------ELVQCGD----------- 342
           T+ +ALSAC   G L                              +CG            
Sbjct: 276 TVVAALSACSQIGALETGRWIHVFVKSSRIRLNVKVCTGLIDMYSKCGSLEEAVLVFNDT 335

Query: 343 -----------------------------------GVKPDEVTFVGVLNACTHGGLVEKG 402
                                              G++P ++TF+G L AC H GLV +G
Sbjct: 336 PRKDIVAWNAMIAGYAMHGYSQDALRLFNEMQGITGLQPTDITFIGTLQACAHAGLVNEG 395

Query: 403 YSYFEMMRRDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNACK 462
              FE M ++Y I+P+IEH+GCL+ LLGRAG+ + A E ++ MN++ D V+W S+L +CK
Sbjct: 396 IRIFESMGQEYGIKPKIEHYGCLVSLLGRAGQLKRAYETIKNMNMDADSVLWSSVLGSCK 455

Query: 463 IHGRSDLAEYSVKKLIEMDPKNGGYRIMLANIYAELGKWDEVRKVRKLLKEKNAYKTPGC 481
           +HG   L +   + LI ++ KN G  ++L+NIYA +G ++ V KVR L+KEK   K PG 
Sbjct: 456 LHGDFVLGKEIAEYLIGLNIKNSGIYVLLSNIYASVGDYEGVAKVRNLMKEKGIVKEPGI 515

BLAST of Cmc09g0238571 vs. ExPASy Swiss-Prot
Match: Q9FFG8 (Pentatricopeptide repeat-containing protein At5g44230 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H17 PE=2 SV=1)

HSP 1 Score: 281.6 bits (719), Expect = 1.7e-74
Identity = 167/517 (32.30%), Postives = 263/517 (50.87%), Query Frame = 0

Query: 35  LYQLVVAALEKCSNLNHLKQLQGFLISHGHSQTQFFAFKLVRFCNLTLTDL-----CYAR 94
           L   +++ L+ C NLN +KQ+ G ++  G  Q+ +   KL+R    TLT L      YAR
Sbjct: 48  LVSSLISKLDDCINLNQIKQIHGHVLRKGLDQSCYILTKLIR----TLTKLGVPMDPYAR 107

Query: 95  YIFDNLTSPNVYLYTAMITAYAAYPDPKAAFLLYRNMVRHGAIRPNHFIYPHVLKSCPDV 154
            + + +   N +L+TA+I  YA       A  +Y   +R   I P  F +  +LK+C  +
Sbjct: 108 RVIEPVQFRNPFLWTAVIRGYAIEGKFDEAIAMY-GCMRKEEITPVSFTFSALLKACGTM 167

Query: 155 LGSNATKMVHTQVLKSGFGRYPVVQTAIVDSYSRFSSVIGNARQMFDEMVERSVVSWTAM 214
              N  +  H Q  +     +  V   ++D Y +  S I  AR++FDEM ER V+SWT +
Sbjct: 168 KDLNLGRQFHAQTFRLRGFCFVYVGNTMIDMYVKCES-IDCARKVFDEMPERDVISWTEL 227

Query: 215 ISGYARLGNFDSAIELFESMPERDVPAWNALIAGCAQNGFFCEAIWLFKKM--------- 274
           I+ YAR+GN + A ELFES+P +D+ AW A++ G AQN    EA+  F +M         
Sbjct: 228 IAAYARVGNMECAAELFESLPTKDMVAWTAMVTGFAQNAKPQEALEYFDRMEKSGIRADE 287

Query: 275 ----------------------VSLALEGNNNDRENKPNKTTLASALSACGNT------- 334
                                 V +A +   +  ++    + L    S CGN        
Sbjct: 288 VTVAGYISACAQLGASKYADRAVQIAQKSGYSPSDHVVIGSALIDMYSKCGNVEEAVNVF 347

Query: 335 -------------------------GMLHL-ELVQCGDGVKPDEVTFVGVLNACTHGGLV 394
                                      LHL   +     +KP+ VTFVG L AC+H GLV
Sbjct: 348 MSMNNKNVFTYSSMILGLATHGRAQEALHLFHYMVTQTEIKPNTVTFVGALMACSHSGLV 407

Query: 395 EKGYSYFEMMRRDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLN 454
           ++G   F+ M + + ++P  +H+ C++DLLGR GR +EA+E+++ M++EP   VWG+LL 
Sbjct: 408 DQGRQVFDSMYQTFGVQPTRDHYTCMVDLLGRTGRLQEALELIKTMSVEPHGGVWGALLG 467

Query: 455 ACKIHGRSDLAEYSVKKLIEMDPKNGGYRIMLANIYAELGKWDEVRKVRKLLKEKNAYKT 482
           AC+IH   ++AE + + L E++P   G  I+L+N+YA  G W  V +VRKL+KEK   KT
Sbjct: 468 ACRIHNNPEIAEIAAEHLFELEPDIIGNYILLSNVYASAGDWGGVLRVRKLIKEKGLKKT 527

BLAST of Cmc09g0238571 vs. ExPASy TrEMBL
Match: A0A1S3CAA7 (pentatricopeptide repeat-containing protein At1g33350 OS=Cucumis melo OX=3656 GN=LOC103498615 PE=4 SV=1)

HSP 1 Score: 960.7 bits (2482), Expect = 2.3e-276
Identity = 484/556 (87.05%), Postives = 484/556 (87.05%), Query Frame = 0

Query: 1   MGLVRLFDRFINSKRLRVSKVIGRNMSSVSIQPHLYQLVVAALEKCSNLNHLKQLQGFLI 60
           MGLVRLFDRFINSKRLRVSKVIGRNMSSVSIQPHLYQLVVAALEKCSNLNHLKQLQGFLI
Sbjct: 1   MGLVRLFDRFINSKRLRVSKVIGRNMSSVSIQPHLYQLVVAALEKCSNLNHLKQLQGFLI 60

Query: 61  SHGHSQTQFFAFKLVRFCNLTLTDLCYARYIFDNLTSPNVYLYTAMITAYAAYPDPKAAF 120
           SHGHSQTQFFAFKLVRFCNLTLTDLCYARYIFDNLTSPNVYLYTAMITAYAAYPDPKAAF
Sbjct: 61  SHGHSQTQFFAFKLVRFCNLTLTDLCYARYIFDNLTSPNVYLYTAMITAYAAYPDPKAAF 120

Query: 121 LLYRNMVRHGAIRPNHFIYPHVLKSCPDVLGSNATKMVHTQVLKSGFGRYPVVQTAIVDS 180
           LLYRNMVRHGAIRPNHFIYPHVLKSCPDVLGSNATKMVHTQVLKSGFGRYPVVQTAIVDS
Sbjct: 121 LLYRNMVRHGAIRPNHFIYPHVLKSCPDVLGSNATKMVHTQVLKSGFGRYPVVQTAIVDS 180

Query: 181 YSRFSSVIGNARQMFDEMVERSVVSWTAMISGYARLGNFDSAIELFESMPERDVPAWNAL 240
           YSRFSSVIGNARQMFDEMVERSVVSWTAMISGYARLGNFDSAIELFESMPERDVPAWNAL
Sbjct: 181 YSRFSSVIGNARQMFDEMVERSVVSWTAMISGYARLGNFDSAIELFESMPERDVPAWNAL 240

Query: 241 IAGCAQNGFFCEAIWLFKKMVSLALEGNNNDRENKPNKTTLASALSACGNTGMLHL---- 300
           IAGCAQNGFFCEAIWLFKKMVSLALEGNNNDRENKPNKTTLASALSACGNTGMLHL    
Sbjct: 241 IAGCAQNGFFCEAIWLFKKMVSLALEGNNNDRENKPNKTTLASALSACGNTGMLHLGKWI 300

Query: 301 ------------------------------------------------------------ 360
                                                                       
Sbjct: 301 HGYVFKTYPGQDSFISNALLDMYGKCGNLKVARRVFDMITLKSLTSWNSLINCLALHGHS 360

Query: 361 --------ELVQCGDGVKPDEVTFVGVLNACTHGGLVEKGYSYFEMMRRDYDIEPQIEHF 420
                   ELVQCGDGVKPDEVTFVGVLNACTHGGLVEKGYSYFEMMRRDYDIEPQIEHF
Sbjct: 361 GSAIDLFAELVQCGDGVKPDEVTFVGVLNACTHGGLVEKGYSYFEMMRRDYDIEPQIEHF 420

Query: 421 GCLIDLLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNACKIHGRSDLAEYSVKKLIEMDP 480
           GCLIDLLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNACKIHGRSDLAEYSVKKLIEMDP
Sbjct: 421 GCLIDLLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNACKIHGRSDLAEYSVKKLIEMDP 480

Query: 481 KNGGYRIMLANIYAELGKWDEVRKVRKLLKEKNAYKTPGCSWIEVDNQVYQFYSFDMTHP 485
           KNGGYRIMLANIYAELGKWDEVRKVRKLLKEKNAYKTPGCSWIEVDNQVYQFYSFDMTHP
Sbjct: 481 KNGGYRIMLANIYAELGKWDEVRKVRKLLKEKNAYKTPGCSWIEVDNQVYQFYSFDMTHP 540

BLAST of Cmc09g0238571 vs. ExPASy TrEMBL
Match: A0A5A7T7Y7 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold169G001540 PE=4 SV=1)

HSP 1 Score: 859.4 bits (2219), Expect = 7.4e-246
Identity = 434/518 (83.78%), Postives = 437/518 (84.36%), Query Frame = 0

Query: 26  MSSVSIQPHLYQLVVAALEKCSNLNHLKQLQGFLISHGHSQTQFFAFKLVRFCNLTLTDL 85
           MSSVSIQPHLYQLVVAALEKCSNLNHLKQLQGFLISHGHSQTQFFAFKLVRFCNLTLTDL
Sbjct: 1   MSSVSIQPHLYQLVVAALEKCSNLNHLKQLQGFLISHGHSQTQFFAFKLVRFCNLTLTDL 60

Query: 86  CYARYIFDNLTSPNVYLYTAMITAYAAYPDPKAAFLLYRNMVRHGAIRPNHFIYPHVLKS 145
           CYARYIFDNLTSPNVYLYTAMITAYAAYPDPKAAFLLYRNMVRHGAIRPNHFIYPHVLKS
Sbjct: 61  CYARYIFDNLTSPNVYLYTAMITAYAAYPDPKAAFLLYRNMVRHGAIRPNHFIYPHVLKS 120

Query: 146 CPDVLGSNATKMVHTQVLKSGFGRYPVVQTAIVDSYSRFSSVIGNARQMFDEMVERSVVS 205
           CPDVLGSNATKMVHTQVLKSGFGRYPVVQTAIVDSYSRFSSVIGNARQMFDEMVERSVVS
Sbjct: 121 CPDVLGSNATKMVHTQVLKSGFGRYPVVQTAIVDSYSRFSSVIGNARQMFDEMVERSVVS 180

Query: 206 WTAMISGYARLGNFDSAIELFESMPERDVPAWNALIAGCAQNGFFCEAIWLFKKMVSLAL 265
           WTAMISGYARLGNFDSAIELFESMPERDVPAWNALIAGCAQNGFFCEAIWLFKKMVSLAL
Sbjct: 181 WTAMISGYARLGNFDSAIELFESMPERDVPAWNALIAGCAQNGFFCEAIWLFKKMVSLAL 240

Query: 266 EGNNNDRENKPNKTTLASALSACGNTGMLHL----------------------------- 325
           EGNNNDRENKPNKTTLASALSACGNTGMLHL                             
Sbjct: 241 EGNNNDRENKPNKTTLASALSACGNTGMLHLGKWIHGYVFKTYPGQDSFISNALLDMYGK 300

Query: 326 -------------------------------------------ELVQCGDGVKPDEVTFV 385
                                                      ELVQCGDGVKPDEVTFV
Sbjct: 301 CGNLKVARRVFDMITLKSLTSWNSLINCLALHGHSGSAIDLFAELVQCGDGVKPDEVTFV 360

Query: 386 GVLNACTHGGLVEKGYSYFEMMRRDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNI 445
           GVLNACTHGGLVEKGYSYFEMMRRDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNI
Sbjct: 361 GVLNACTHGGLVEKGYSYFEMMRRDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNI 420

Query: 446 EPDEVVWGSLLNACKIHGRSDLAEYSVKKLIEMDPKNGGYRIMLANIYAELGKWDEVRKV 468
           EPDEVVWGSLLNACKIHGRSDLAEYSVKKLIEMDPKNGGYRIMLANIYAELGKWDEVRKV
Sbjct: 421 EPDEVVWGSLLNACKIHGRSDLAEYSVKKLIEMDPKNGGYRIMLANIYAELGKWDEVRKV 480

BLAST of Cmc09g0238571 vs. ExPASy TrEMBL
Match: A0A0A0KUN2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G646730 PE=4 SV=1)

HSP 1 Score: 837.8 bits (2163), Expect = 2.3e-239
Identity = 424/532 (79.70%), Postives = 438/532 (82.33%), Query Frame = 0

Query: 1   MGLVRLFDRFINSKRLRV-------SKVIGRNMSSVSIQPHLYQLVVAALEKCSNLNHLK 60
           MG V LFDRFI SKRLRV       SK IGR+MSSVSI PHL QL VAALEKCSNLNHLK
Sbjct: 1   MGSVLLFDRFITSKRLRVWLSLRIISKAIGRSMSSVSIHPHLNQLFVAALEKCSNLNHLK 60

Query: 61  QLQGFLISHGHSQTQFFAFKLVRFCNLTLTDLCYARYIFDNLTSPNVYLYTAMITAYAAY 120
           QLQGFLISHGHSQTQFFAFKLVRFCNLTL DLCYARYIFDNLTSPNV+LYTAMITAYA+Y
Sbjct: 61  QLQGFLISHGHSQTQFFAFKLVRFCNLTLADLCYARYIFDNLTSPNVFLYTAMITAYASY 120

Query: 121 PDPKAAFLLYRNMVRHGAIRPNHFIYPHVLKSCPDVLGSNATKMVHTQVLKSGFGRYPVV 180
           PDPKAAFLLYRNMVR GAIRPN+FIYPHVL+SCPDVLGSNATKMVHTQVLKSGFG YPVV
Sbjct: 121 PDPKAAFLLYRNMVRRGAIRPNNFIYPHVLRSCPDVLGSNATKMVHTQVLKSGFGGYPVV 180

Query: 181 QTAIVDSYSRFSSVIGNARQMFDEMVERSVVSWTAMISGYARLGNFDSAIELFESMPERD 240
           QTAIVDSYSRFSS IG+ARQMFDEM+ER+VVSWTAMISGYARLGNFDSAIELFESMPERD
Sbjct: 181 QTAIVDSYSRFSSDIGSARQMFDEMLERTVVSWTAMISGYARLGNFDSAIELFESMPERD 240

Query: 241 VPAWNALIAGCAQNGFFCEAIWLFKKMVSLALEGNNNDRENKPNKTTLASALSACGNTGM 300
           VPAWNALIAGCAQNGFFCEAIWLFK+MV LALEGNNNDRENKPNKTTL SALSACG+TGM
Sbjct: 241 VPAWNALIAGCAQNGFFCEAIWLFKRMVLLALEGNNNDRENKPNKTTLGSALSACGHTGM 300

Query: 301 LHL--------------------------------------------------------- 360
           LHL                                                         
Sbjct: 301 LHLGKWIHGYVFKTYPGQDSFISNALLDMYGKCGNLKVARRVFDMITLKNLTSWNSLINC 360

Query: 361 ---------------ELVQCGDGVKPDEVTFVGVLNACTHGGLVEKGYSYFEMMRRDYDI 420
                          EL+ CGDGVKP+EVTFVGVLNACTHGGLVEKGYSYFEMMRRDYDI
Sbjct: 361 LALHGHSGSAIDLFAELIHCGDGVKPNEVTFVGVLNACTHGGLVEKGYSYFEMMRRDYDI 420

Query: 421 EPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNACKIHGRSDLAEYSVK 454
           EPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNACKIHGRSDLAEYSVK
Sbjct: 421 EPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNACKIHGRSDLAEYSVK 480

BLAST of Cmc09g0238571 vs. ExPASy TrEMBL
Match: A0A6J1DQX1 (pentatricopeptide repeat-containing protein At1g33350 OS=Momordica charantia OX=3673 GN=LOC111022983 PE=4 SV=1)

HSP 1 Score: 790.0 bits (2039), Expect = 5.5e-225
Identity = 397/530 (74.91%), Postives = 427/530 (80.57%), Query Frame = 0

Query: 26  MSSVSIQPHLYQLVVAALEKCSNLNHLKQLQGFLISHGHSQTQFFAFKLVRFCNLTLTDL 85
           M+SV IQPHL QLV++ LEKCS+LNHLKQ+QGFLIS GHSQTQFFAFKLVRFCNLTL +L
Sbjct: 1   MASVPIQPHLNQLVLSVLEKCSHLNHLKQIQGFLISLGHSQTQFFAFKLVRFCNLTLANL 60

Query: 86  CYARYIFDNLTSPNVYLYTAMITAYAAYPDPKAAFLLYRNMVRHGAIRPNHFIYPHVLKS 145
            YAR+IFD L SPNVYLYTAMITAYA+ PD KAAF+LYR+MVR G  RPNHFIYPHVLKS
Sbjct: 61  SYARFIFDQLNSPNVYLYTAMITAYASQPDSKAAFVLYRDMVRRGTPRPNHFIYPHVLKS 120

Query: 146 CPDVLGSNATKMVHTQVLKSGFGRYPVVQTAIVDSYSRFSSVIGNARQMFDEMVERSVVS 205
           CP+V  SNAT+MVH Q+LKSGFGRYPVVQTAIVDSYS+F S IG ARQMFDEM+ERSVVS
Sbjct: 121 CPEVXESNATQMVHAQILKSGFGRYPVVQTAIVDSYSKFCSDIGIARQMFDEMIERSVVS 180

Query: 206 WTAMISGYARLGNFDSAIELFESMPERDVPAWNALIAGCAQNGFFCEAIWLFKKMVSLAL 265
           WTAMISGYARLGN D A+ LFESMPERDVPAWNA+IAG AQNGFFCEAIWLF++MVSLA+
Sbjct: 181 WTAMISGYARLGNIDDAVALFESMPERDVPAWNAIIAGFAQNGFFCEAIWLFRRMVSLAM 240

Query: 266 EGNNNDRENKPNKTTLASALSACGNTGMLHL----------------------------- 325
           E ++ +RENKPNK T+ASALSACG+TGMLHL                             
Sbjct: 241 E-DDEERENKPNKITVASALSACGHTGMLHLGKWIHGYVFKSLSQDSFISNALLDMYGKC 300

Query: 326 ------------------------------------------ELVQCGDGVKPDEVTFVG 385
                                                     +LV+CGDGVKPD VTFVG
Sbjct: 301 GNLKIAKRVFDMITLKSLTSWNSLINCLALHGHSESAIDLFVKLVECGDGVKPDGVTFVG 360

Query: 386 VLNACTHGGLVEKGYSYFEMMRRDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNIE 445
           VLNACTHGGLVEKGYSYFEMMR+DY IEPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNIE
Sbjct: 361 VLNACTHGGLVEKGYSYFEMMRQDYGIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNIE 420

Query: 446 PDEVVWGSLLNACKIHGRSDLAEYSVKKLIEMDPKNGGYRIMLANIYAELGKWDEVRKVR 485
           PDEVVWGSLLN CKIHGRSDLAEYSVKKLIEMDP+NGGYRIMLANIYAEL KWDEVRKVR
Sbjct: 421 PDEVVWGSLLNGCKIHGRSDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVR 480

BLAST of Cmc09g0238571 vs. ExPASy TrEMBL
Match: A0A6J1IF22 (pentatricopeptide repeat-containing protein At1g33350 OS=Cucurbita maxima OX=3661 GN=LOC111472677 PE=4 SV=1)

HSP 1 Score: 780.0 bits (2013), Expect = 5.7e-222
Identity = 393/531 (74.01%), Postives = 426/531 (80.23%), Query Frame = 0

Query: 26  MSSVSIQPHLYQLVVAALEKCSNLNHLKQLQGFLISHGHSQTQFFAFKLVRFCNLTLTDL 85
           M+SV +QPHL QLV++ LEKCS+LNHLKQLQGFLIS GHSQTQF+AFKLVRFCNLTLTDL
Sbjct: 1   MASVPMQPHLNQLVLSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDL 60

Query: 86  CYARYIFDNLTSPNVYLYTAMITAYAAYPDPKAAFLLYRNMVRHGAIRPNHFIYPHVLKS 145
           CY+R+IFD+L+SPNVYLYTAMITAYA+ PD KAAFLLYRNMVR GA  PNHFIYPHVLKS
Sbjct: 61  CYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRNMVRRGAPLPNHFIYPHVLKS 120

Query: 146 CPDVLGSNATKMVHTQVLKSGFGRYPVVQTAIVDSYSRFSSVIGNARQMFDEMVERSVVS 205
           CP++L SN TKMVH QVLKSGFG YPVVQTAIVD+YSRF + IG ARQ+FDEM+ERSVVS
Sbjct: 121 CPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVS 180

Query: 206 WTAMISGYARLGNFDSAIELFESMPERDVPAWNALIAGCAQNGFFCEAIWLFKKMVSLAL 265
           WTAMISGYARLG+ D+A+ LFESMPERD+PAWNALIAGCAQNGFFCEAI LFK+MVSLAL
Sbjct: 181 WTAMISGYARLGDIDNAMALFESMPERDIPAWNALIAGCAQNGFFCEAIGLFKRMVSLAL 240

Query: 266 EGNNNDRENKPNKTTLASALSACGNTGMLH------------------------------ 325
           EG N +RE KPNK T+ASALS+CG+TGMLH                              
Sbjct: 241 EG-NKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGK 300

Query: 326 ------------------------------------------LELVQCGDGVKPDEVTFV 385
                                                     LELVQC DGV+PD VTFV
Sbjct: 301 CGNLKVARRVFDMITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFV 360

Query: 386 GVLNACTHGGLVEKGYSYFEMMRRDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNI 445
           GVLNACTHGGLVEKGYSYF+MMR+DYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNI
Sbjct: 361 GVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNI 420

Query: 446 EPDEVVWGSLLNACKIHGRSDLAEYSVKKLIEMDPKNGGYRIMLANIYAELGKWDEVRKV 485
           EPDEVVWGSLLN CKIHGR DLAEYSVKKLIEMDP+NGGYRIMLANIYAEL  WDEVRKV
Sbjct: 421 EPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKV 480

BLAST of Cmc09g0238571 vs. TAIR 10
Match: AT1G33350.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 485.0 bits (1247), Expect = 7.3e-137
Identity = 250/532 (46.99%), Postives = 329/532 (61.84%), Query Frame = 0

Query: 27  SSVSIQPHLYQLVVAALEKCSNLNHLKQLQGFLISHGHSQTQFFAFKLVRFCNLTLTDLC 86
           SS   +  L Q + A + K  +LNHLKQ+Q F+I  G S + F  FKL+RFC L L +L 
Sbjct: 15  SSHMAEQLLNQFISAVISKSRHLNHLKQVQSFMIVSGLSHSHFLCFKLLRFCTLRLCNLS 74

Query: 87  YARYIFDNLTSPNVYLYTAMITAYAAY--PDPKAAFLLYRNMVRHGAIRPNHFIYPHVLK 146
           YAR+IFD  + PN +LY A++TAY++       +AF  +R MV     RPNHFIYP VLK
Sbjct: 75  YARFIFDRFSFPNTHLYAAVLTAYSSSLPLHASSAFSFFRLMVNRSVPRPNHFIYPLVLK 134

Query: 147 SCPDVLGSNATKMVHTQVLKSGFGRYPVVQTAIVDSYSRFSSVIGNARQMFDEMVERSVV 206
           S P +  + +T +VHT + KSGF  Y VVQTA++ SY+   S I  ARQ+FDEM ER+VV
Sbjct: 135 STPYLSSAFSTPLVHTHLFKSGFHLYVVVQTALLHSYASSVSHITLARQLFDEMSERNVV 194

Query: 207 SWTAMISGYARLGNFDSAIELFESMPERDVPAWNALIAGCAQNGFFCEAIWLFKKMVSLA 266
           SWTAM+SGYAR G+  +A+ LFE MPERDVP+WNA++A C QNG F EA+ LF++M+   
Sbjct: 195 SWTAMLSGYARSGDISNAVALFEDMPERDVPSWNAILAACTQNGLFLEAVSLFRRMI--- 254

Query: 267 LEGNNNDRENKPNKTTLASALSACGNTGMLHL---------------------ELV---- 326
                N+   +PN+ T+   LSAC  TG L L                      LV    
Sbjct: 255 -----NEPSIRPNEVTVVCVLSACAQTGTLQLAKGIHAFAYRRDLSSDVFVSNSLVDLYG 314

Query: 327 QCG------------------------------------------------DGVKPDEVT 386
           +CG                                                + +KPD +T
Sbjct: 315 KCGNLEEASSVFKMASKKSLTAWNSMINCFALHGRSEEAIAVFEEMMKLNINDIKPDHIT 374

Query: 387 FVGVLNACTHGGLVEKGYSYFEMMRRDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGM 446
           F+G+LNACTHGGLV KG  YF++M   + IEP+IEH+GCLIDLLGRAGRF+EA+EV+  M
Sbjct: 375 FIGLLNACTHGGLVSKGRGYFDLMTNRFGIEPRIEHYGCLIDLLGRAGRFDEALEVMSTM 434

Query: 447 NIEPDEVVWGSLLNACKIHGRSDLAEYSVKKLIEMDPKNGGYRIMLANIYAELGKWDEVR 484
            ++ DE +WGSLLNACKIHG  DLAE +VK L+ ++P NGGY  M+AN+Y E+G W+E R
Sbjct: 435 KMKADEAIWGSLLNACKIHGHLDLAEVAVKNLVALNPNNGGYVAMMANLYGEMGNWEEAR 494

BLAST of Cmc09g0238571 vs. TAIR 10
Match: AT2G20540.1 (mitochondrial editing factor 21 )

HSP 1 Score: 307.0 bits (785), Expect = 2.7e-83
Identity = 170/510 (33.33%), Postives = 278/510 (54.51%), Query Frame = 0

Query: 43  LEKCSNLNHLKQLQGFLISHGHSQTQFFAFKLVRFCNLTLTDLCYARYIFDNLTSPNVYL 102
           L++  + N  K++   +I HG SQ+ F   K+V FC+  + D+ YA  +F+ +++PNV+L
Sbjct: 17  LQRVKSRNEWKKINASIIIHGLSQSSFMVTKMVDFCD-KIEDMDYATRLFNQVSNPNVFL 76

Query: 103 YTAMITAYAAYPDPKAAFLLYRNMVRHGAIRPNHFIYPHVLKSCPDVLGSNATKMVHTQV 162
           Y ++I AY           +Y+ ++R     P+ F +P + KSC  +      K VH  +
Sbjct: 77  YNSIIRAYTHNSLYCDVIRIYKQLLRKSFELPDRFTFPFMFKSCASLGSCYLGKQVHGHL 136

Query: 163 LKSGFGRYPVVQTAIVDSYSRFSSVIGNARQMFDEMVERSVVSWTAMISGYARLGNFDSA 222
            K G   + V + A++D Y +F  ++ +A ++FDEM ER V+SW +++SGYARLG    A
Sbjct: 137 CKFGPRFHVVTENALIDMYMKFDDLV-DAHKVFDEMYERDVISWNSLLSGYARLGQMKKA 196

Query: 223 IELFESMPERDVPAWNALIAGCAQNGFFCEAIWLFKKMVSLALEGNNNDRENKPNKTTLA 282
             LF  M ++ + +W A+I+G    G + EA+  F++M    +E         P++ +L 
Sbjct: 197 KGLFHLMLDKTIVSWTAMISGYTGIGCYVEAMDFFREMQLAGIE---------PDEISLI 256

Query: 283 SALSACGNTGMLHL---------------------ELVQ----CG------------DG- 342
           S L +C   G L L                      L++    CG            +G 
Sbjct: 257 SVLPSCAQLGSLELGKWIHLYAERRGFLKQTGVCNALIEMYSKCGVISQAIQLFGQMEGK 316

Query: 343 --------------------------------VKPDEVTFVGVLNACTHGGLVEKGYSYF 402
                                           VKP+ +TF+G+L+AC+H G+ ++G  YF
Sbjct: 317 DVISWSTMISGYAYHGNAHGAIETFNEMQRAKVKPNGITFLGLLSACSHVGMWQEGLRYF 376

Query: 403 EMMRRDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNACKIHGR 462
           +MMR+DY IEP+IEH+GCLID+L RAG+ E A+E+ + M ++PD  +WGSLL++C+  G 
Sbjct: 377 DMMRQDYQIEPKIEHYGCLIDVLARAGKLERAVEITKTMPMKPDSKIWGSLLSSCRTPGN 436

Query: 463 SDLAEYSVKKLIEMDPKNGGYRIMLANIYAELGKWDEVRKVRKLLKEKNAYKTPGCSWIE 483
            D+A  ++  L+E++P++ G  ++LANIYA+LGKW++V ++RK+++ +N  KTPG S IE
Sbjct: 437 LDVALVAMDHLVELEPEDMGNYVLLANIYADLGKWEDVSRLRKMIRNENMKKTPGGSLIE 496

BLAST of Cmc09g0238571 vs. TAIR 10
Match: AT5G59200.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 303.1 bits (775), Expect = 3.9e-82
Identity = 174/528 (32.95%), Postives = 272/528 (51.52%), Query Frame = 0

Query: 24  RNMSSVSIQPHLYQLVVAALEKCSNLNHLKQLQGFLISHGHSQTQFFAFKLVRFCNLTLT 83
           R+  S +++    + +++ L  C N+ H+  +   +I   H Q  F  F+L+R C+ TL 
Sbjct: 17  RDPDSNTLRLSRRKTLISVLRSCKNIAHVPSIHAKIIRTFHDQDAFVVFELIRVCS-TLD 76

Query: 84  DLCYARYIFDNLTSPNVYLYTAMITAYAAYPDPKAAFLLYRNMVRHGAIRPNHFIYPHVL 143
            + YA  +F  +++PNVYLYTAMI  + +         LY  M+ H ++ P++++   VL
Sbjct: 77  SVDYAYDVFSYVSNPNVYLYTAMIDGFVSSGRSADGVSLYHRMI-HNSVLPDNYVITSVL 136

Query: 144 KSCPDVLGSNATKMVHTQVLKSGFGRYPVVQTAIVDSYSRFSSVIGNARQMFDEMVERSV 203
           K+C         + +H QVLK GFG    V   +++ Y +   ++ NA++MFDEM +R  
Sbjct: 137 KAC----DLKVCREIHAQVLKLGFGSSRSVGLKMMEIYGKSGELV-NAKKMFDEMPDRDH 196

Query: 204 VSWTAMISGYARLGNFDSAIELFESMPERDVPAWNALIAGCAQNGFFCEAIWLFKKMVSL 263
           V+ T MI+ Y+  G    A+ELF+ +  +D   W A+I G  +N    +A+ LF++M   
Sbjct: 197 VAATVMINCYSECGFIKEALELFQDVKIKDTVCWTAMIDGLVRNKEMNKALELFREM--- 256

Query: 264 ALEGNNNDREN-KPNKTTLASALSACGNTGMLHL-------------EL----------- 323
                    EN   N+ T    LSAC + G L L             EL           
Sbjct: 257 -------QMENVSANEFTAVCVLSACSDLGALELGRWVHSFVENQRMELSNFVGNALINM 316

Query: 324 -VQCGD---------------------------------------------GVKPDEVTF 383
             +CGD                                             G +P++VT 
Sbjct: 317 YSRCGDINEARRVFRVMRDKDVISYNTMISGLAMHGASVEAINEFRDMVNRGFRPNQVTL 376

Query: 384 VGVLNACTHGGLVEKGYSYFEMMRRDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGMN 443
           V +LNAC+HGGL++ G   F  M+R +++EPQIEH+GC++DLLGR GR EEA   +  + 
Sbjct: 377 VALLNACSHGGLLDIGLEVFNSMKRVFNVEPQIEHYGCIVDLLGRVGRLEEAYRFIENIP 436

Query: 444 IEPDEVVWGSLLNACKIHGRSDLAEYSVKKLIEMDPKNGGYRIMLANIYAELGKWDEVRK 481
           IEPD ++ G+LL+ACKIHG  +L E   K+L E +  + G  ++L+N+YA  GKW E  +
Sbjct: 437 IEPDHIMLGTLLSACKIHGNMELGEKIAKRLFESENPDSGTYVLLSNLYASSGKWKESTE 496

BLAST of Cmc09g0238571 vs. TAIR 10
Match: AT4G37380.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 285.8 bits (730), Expect = 6.5e-77
Identity = 168/512 (32.81%), Postives = 269/512 (52.54%), Query Frame = 0

Query: 43  LEKCSNLNHLKQLQGFLISHG---HSQTQFFAFKLVRFCNLTLTDLCYARYIFDNLTSPN 102
           ++K  +++ + Q+   ++ H    H +      KL R    +   + ++  +F     P+
Sbjct: 36  IDKSQSVDEVLQIHAAILRHNLLLHPRYPVLNLKLHR-AYASHGKIRHSLALFHQTIDPD 95

Query: 103 VYLYTAMITAYAAYPDPKAAFLLYRNMVRHGAIRPNHFIYPHVLKSCPDVLGSNATKMVH 162
           ++L+TA I   +       AFLLY  ++    I PN F +  +LKSC     + + K++H
Sbjct: 96  LFLFTAAINTASINGLKDQAFLLYVQLL-SSEINPNEFTFSSLLKSC----STKSGKLIH 155

Query: 163 TQVLKSGFGRYPVVQTAIVDSYSRFSSVIGNARQMFDEMVERSVVSWTAMISGYARLGNF 222
           T VLK G G  P V T +VD Y++   V+ +A+++FD M ERS+VS TAMI+ YA+ GN 
Sbjct: 156 THVLKFGLGIDPYVATGLVDVYAKGGDVV-SAQKVFDRMPERSLVSSTAMITCYAKQGNV 215

Query: 223 DSAIELFESMPERDVPAWNALIAGCAQNGFFCEAIWLFKKMVSLALEGNNNDRENKPNKT 282
           ++A  LF+SM ERD+ +WN +I G AQ+GF  +A+ LF+K+++        + + KP++ 
Sbjct: 216 EAARALFDSMCERDIVSWNVMIDGYAQHGFPNDALMLFQKLLA--------EGKPKPDEI 275

Query: 283 TLASALSACGNTGMLHL-------------------------ELVQCGD----------- 342
           T+ +ALSAC   G L                              +CG            
Sbjct: 276 TVVAALSACSQIGALETGRWIHVFVKSSRIRLNVKVCTGLIDMYSKCGSLEEAVLVFNDT 335

Query: 343 -----------------------------------GVKPDEVTFVGVLNACTHGGLVEKG 402
                                              G++P ++TF+G L AC H GLV +G
Sbjct: 336 PRKDIVAWNAMIAGYAMHGYSQDALRLFNEMQGITGLQPTDITFIGTLQACAHAGLVNEG 395

Query: 403 YSYFEMMRRDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNACK 462
              FE M ++Y I+P+IEH+GCL+ LLGRAG+ + A E ++ MN++ D V+W S+L +CK
Sbjct: 396 IRIFESMGQEYGIKPKIEHYGCLVSLLGRAGQLKRAYETIKNMNMDADSVLWSSVLGSCK 455

Query: 463 IHGRSDLAEYSVKKLIEMDPKNGGYRIMLANIYAELGKWDEVRKVRKLLKEKNAYKTPGC 481
           +HG   L +   + LI ++ KN G  ++L+NIYA +G ++ V KVR L+KEK   K PG 
Sbjct: 456 LHGDFVLGKEIAEYLIGLNIKNSGIYVLLSNIYASVGDYEGVAKVRNLMKEKGIVKEPGI 515

BLAST of Cmc09g0238571 vs. TAIR 10
Match: AT5G66520.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 281.6 bits (719), Expect = 1.2e-75
Identity = 166/521 (31.86%), Postives = 266/521 (51.06%), Query Frame = 0

Query: 28  SVSIQPHLYQLVVAALEKCSNLNHLKQLQGFLISHGHSQTQFFAFKLVRFC-NLTLTD-L 87
           S S++ +LY+  ++ L++CS    LKQ+   ++  G  Q  +   K + FC + T +D L
Sbjct: 7   SFSLEHNLYE-TMSCLQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFL 66

Query: 88  CYARYIFDNLTSPNVYLYTAMITAYAAYPDPKAAFLLYRNMVRHGAIRPNHFIYPHVLKS 147
            YA+ +FD    P+ +L+  MI  ++   +P+ + LLY+ M+   A   N + +P +LK+
Sbjct: 67  PYAQIVFDGFDRPDTFLWNLMIRGFSCSDEPERSLLLYQRMLCSSAPH-NAYTFPSLLKA 126

Query: 148 CPDVLGSNATKMVHTQVLKSGFGRYPVVQTAIVDSYSRFSSVIGN---ARQMFDEMVERS 207
           C ++     T  +H Q+ K G+        ++++SY    +V GN   A  +FD + E  
Sbjct: 127 CSNLSAFEETTQIHAQITKLGYENDVYAVNSLINSY----AVTGNFKLAHLLFDRIPEPD 186

Query: 208 VVSWTAMISGYARLGNFDSAIELFESMPERDVPAWNALIAGCAQNGFFCEAIWLFKKMVS 267
            VSW ++I GY + G  D A+ LF  M E++  +W  +I+G  Q     EA+ LF +M  
Sbjct: 187 DVSWNSVIKGYVKAGKMDIALTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEM-- 246

Query: 268 LALEGNNNDRENKPNKTTLASALSACGNTGML------HLEL------------------ 327
                 N+D E  P+  +LA+ALSAC   G L      H  L                  
Sbjct: 247 -----QNSDVE--PDNVSLANALSACAQLGALEQGKWIHSYLNKTRIRMDSVLGCVLIDM 306

Query: 328 -VQCGD---------------------------------------------GVKPDEVTF 387
             +CG+                                             G+KP+ +TF
Sbjct: 307 YAKCGEMEEALEVFKNIKKKSVQAWTALISGYAYHGHGREAISKFMEMQKMGIKPNVITF 366

Query: 388 VGVLNACTHGGLVEKGYSYFEMMRRDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGMN 447
             VL AC++ GLVE+G   F  M RDY+++P IEH+GC++DLLGRAG  +EA   ++ M 
Sbjct: 367 TAVLTACSYTGLVEEGKLIFYSMERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMP 426

Query: 448 IEPDEVVWGSLLNACKIHGRSDLAEYSVKKLIEMDPKNGGYRIMLANIYAELGKWDEVRK 474
           ++P+ V+WG+LL AC+IH   +L E   + LI +DP +GG  +  ANI+A   KWD+  +
Sbjct: 427 LKPNAVIWGALLKACRIHKNIELGEEIGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAE 486

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008459497.14.8e-27687.05PREDICTED: pentatricopeptide repeat-containing protein At1g33350 [Cucumis melo][more]
XP_004141609.33.9e-25780.64pentatricopeptide repeat-containing protein At1g33350 [Cucumis sativus][more]
KAA0039360.11.5e-24583.78pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK00543... [more]
XP_038889416.14.6e-24280.04pentatricopeptide repeat-containing protein At1g33350-like [Benincasa hispida][more]
KAE8648973.11.7e-24180.00hypothetical protein Csa_008304 [Cucumis sativus][more]
Match NameE-valueIdentityDescription
Q9C5011.0e-13546.99Pentatricopeptide repeat-containing protein At1g33350 OS=Arabidopsis thaliana OX... [more]
Q9SIL53.8e-8233.33Pentatricopeptide repeat-containing protein At2g20540 OS=Arabidopsis thaliana OX... [more]
Q9FIF75.5e-8132.95Putative pentatricopeptide repeat-containing protein At5g59200, chloroplastic OS... [more]
Q9SZT89.1e-7632.81Pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Arabidopsis t... [more]
Q9FFG81.7e-7432.30Pentatricopeptide repeat-containing protein At5g44230 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A1S3CAA72.3e-27687.05pentatricopeptide repeat-containing protein At1g33350 OS=Cucumis melo OX=3656 GN... [more]
A0A5A7T7Y77.4e-24683.78Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A0A0KUN22.3e-23979.70Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G646730 PE=4 SV=1[more]
A0A6J1DQX15.5e-22574.91pentatricopeptide repeat-containing protein At1g33350 OS=Momordica charantia OX=... [more]
A0A6J1IF225.7e-22274.01pentatricopeptide repeat-containing protein At1g33350 OS=Cucurbita maxima OX=366... [more]
Match NameE-valueIdentityDescription
AT1G33350.17.3e-13746.99Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G20540.12.7e-8333.33mitochondrial editing factor 21 [more]
AT5G59200.13.9e-8232.95Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G37380.16.5e-7732.81Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G66520.11.2e-7531.86Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Melon (Charmono) v1.1
Date Performed: 2022-10-13
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 348..371
e-value: 5.6E-4
score: 17.9
coord: 310..343
e-value: 0.0013
score: 16.8
coord: 204..234
e-value: 2.6E-8
score: 31.5
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 204..233
e-value: 3.7E-9
score: 36.2
coord: 310..338
e-value: 0.037
score: 14.3
coord: 348..371
e-value: 0.0065
score: 16.6
coord: 102..130
e-value: 0.78
score: 10.1
coord: 236..262
e-value: 3.6E-5
score: 23.7
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 202..236
score: 12.287675
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 299..461
e-value: 6.7E-24
score: 86.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 6..155
e-value: 3.7E-9
score: 38.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 156..298
e-value: 4.1E-26
score: 93.4
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 213..432
NoneNo IPR availablePANTHERPTHR47924:SF11SUBFAMILY NOT NAMEDcoord: 297..484
NoneNo IPR availablePANTHERPTHR47924:SF11SUBFAMILY NOT NAMEDcoord: 19..296
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 19..296
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 297..484

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cmc09g0238571.1Cmc09g0238571.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding