Cla97C06G111710 (gene) Watermelon (97103) v2

NameCla97C06G111710
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCla97Chr06 : 2506398 .. 2508809 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGCCATGGAAGAGGTGGTTTCCACGCCATGGAATCAAGGGCAACTTCAACTCTCTCTCAATTGGCCGACCTCTTCCTCGTGGCTTCCATTACCAAAACCCTTTCGGAATCAGGTACTCGAACCCTCCAACACCATTCTCTTCCATTATCACAGCCTCTCCTTCTCCAAATCCTCCATTCCAGATCTGTTCATCCTTCCCACAAGCTCGATTTCTTCAAATGGTGTTCTCTCACCCCCAATTTCCACCATTCATCCTCCACGTATTCCCAAATCTTCCATGTCCTCTGCCGCTCTGGATACCTCCACGAGGTTCCCCCTTTACTCTCCTCGATGAAGCGAGATGGGGTTGCTGTTGATTCCCACACTTTCAAGGTCCTTCTCGATGCCTTTATCAGGTCTGGTAAATACGATGCTGCCCTTGAAATTTTAGACCATATGGAAGATTTGGGAACTAGCTTGGAACTCAACACCTACAACTCTGTTCTTGTTGCTCTGCTTAGAAAAAACCAGGTGGGTTTGGCATTGTCAATTTTCTTTAAGCTGTTTGACGCTTCTAATAATGAAGGGCAAGAAAGTACTGCTGCAACTAGTTTTCCTTTCTTGCCTAATTCACTAGCTTGTAATGAATTGTTGGTCGCTCTTAGGAAATCAGACATGAGGGTTGAATTCAAAAAGGTTTTTGACAAGCTTAGAGCAATTAGAAGCTTTGAGTTTAATATATGTGGTTATAATATATGTATTCATGCTTTTGGATGTTGGGGTTATCTGGATACTTCTCTTGCTCTGTTCAAAGAAATGAAGCAAAAGAGCTTAGTTTCGGAGTCTTTCGGTCCGGATTTGTGTACATATAATAGCCTTATTCATGTGCTATGTTTGGTAGGGAAGGTTAAGGATGCACTTATTGTGTGGGAGGAACTTAAAGGGTCAGGTCATGAGCCCGATGCCTTCACTTACCGTGTCATAATTCAGGGTTGCTGTAAATCTTACCGAATGGACGATGCAACCATGATTTTTAATGAAATGGAGTACAGTGGATTTATCCCAGATACCATTGTGTATAATTCTCTCCTAGACGGGCTATTTAAGGCTCGGAAAGTTACTGAAGCATGTCAACTTTTTGATAAAATGGTACAAGAAGGTGTAAGAGCTTCTCCTTGGACATACAATATTCTAATTGATGGATTGTTTAGGAATGGAAGAGCTGAAGCTAGCTACTCTTTATTCTGTGATTTGAAGAAAAAGGGTCAGTTTGTGGATGGTGTTACTTACAGCATCATCGTTTTGCAACTGTGTAAAGAGGGACTGCTTGAGGAAGCTCTACAATTGGTCGAAGAAATGGAAGCGAGAGGCTTTGTTGTTGATCTTATTACTGTAACATCTCTTTTAATTGCAATGCACAAGCAAGGCCAGTGGGAAGAGCTAGAGAGGCTCATGAAGCACATTAGAGAAGGTGATTTGGTCCCTAATGTTCTCAAATGGAAGACTAACATGGAAAATTCAATCAAATATCAGCAAAATAAAAGGAAAGACTACACATCCCTGTTCTCCCCAAAGGAGGATTTGAGTGAGATTATTAGTGCAAGAGCTTCTTCTGTTCCGAAAGTTAATATTGATGATACTTCTGAAAACAAAGAAGAAAGAGATGCGGAGAGTTGGTCATCATCCCCACATGCAGATCTTTTGGCTAATCTTGCAAAGTCCACCGGTGATTTTTTGCAACCATTCTCTCTAAGTCAGGGGCGACGAATCCAAGCTAAAGGGGACAACTCATTTGATATCAATATGGTTAATACATTTTTGTCTATTTTTCTAGCAAAGGGAAAATTGAGCTTGGCATGTAAGTTGTTTGAGGTCTTCAGTGATATGGGTGTCAACCCAGTGAGATACACATACAATTCAATGCTGAGTTCATTTGTGAAGAAGGGATACTTTCACCAGGCATGGGGCATATTTAACGAAATGGGCGAGAAGGTATGCCCAGCCGATATAGCCACATACAATGTGATCATTCAAGGACTCGGGAAGATGGGTAGAGCGGATCTTGCAAGTTCTGTTCTGGAAAAGCTAATGGAGCAGGGTGGCTATCTCGATATTGTAATGTACAACACCTTGATTAATGCGTTGGGGAAGGCAGGTCGAATGGATGATGTAAATAAGCTTTTCGAGCAGATGAAGAACAGCGGGATAAACCCAGATGTTGTCACTTTTAATACACTTATTGAAGTTCACAGCAAAGCAGGTCGGTTTAAGGATGCTTACAAGTTCTTAAAAATGATGCTGGATTCGGGCTGTTCCCCCAATCATGTCACAGATACAACTTTGGATTTTCTAGGGAGGGAGATCGAGAAAGCAAGGTATGAAAAAGCTTCAATCATACGTGACAAGAACAGTTCTTCACTTTCCTGTTAA

mRNA sequence

ATGCGCCATGGAAGAGGTGGTTTCCACGCCATGGAATCAAGGGCAACTTCAACTCTCTCTCAATTGGCCGACCTCTTCCTCGTGGCTTCCATTACCAAAACCCTTTCGGAATCAGGTACTCGAACCCTCCAACACCATTCTCTTCCATTATCACAGCCTCTCCTTCTCCAAATCCTCCATTCCAGATCTGTTCATCCTTCCCACAAGCTCGATTTCTTCAAATGGTGTTCTCTCACCCCCAATTTCCACCATTCATCCTCCACGTATTCCCAAATCTTCCATGTCCTCTGCCGCTCTGGATACCTCCACGAGGTTCCCCCTTTACTCTCCTCGATGAAGCGAGATGGGGTTGCTGTTGATTCCCACACTTTCAAGGTCCTTCTCGATGCCTTTATCAGGTCTGGTAAATACGATGCTGCCCTTGAAATTTTAGACCATATGGAAGATTTGGGAACTAGCTTGGAACTCAACACCTACAACTCTGTTCTTGTTGCTCTGCTTAGAAAAAACCAGGTGGGTTTGGCATTGTCAATTTTCTTTAAGCTGTTTGACGCTTCTAATAATGAAGGGCAAGAAAGTACTGCTGCAACTAGTTTTCCTTTCTTGCCTAATTCACTAGCTTGTAATGAATTGTTGGTCGCTCTTAGGAAATCAGACATGAGGGTTGAATTCAAAAAGGTTTTTGACAAGCTTAGAGCAATTAGAAGCTTTGAGTTTAATATATGTGGTTATAATATATGTATTCATGCTTTTGGATGTTGGGGTTATCTGGATACTTCTCTTGCTCTGTTCAAAGAAATGAAGCAAAAGAGCTTAGTTTCGGAGTCTTTCGGTCCGGATTTGTGTACATATAATAGCCTTATTCATGTGCTATGTTTGGTAGGGAAGGTTAAGGATGCACTTATTGTGTGGGAGGAACTTAAAGGGTCAGGTCATGAGCCCGATGCCTTCACTTACCGTGTCATAATTCAGGGTTGCTGTAAATCTTACCGAATGGACGATGCAACCATGATTTTTAATGAAATGGAGTACAGTGGATTTATCCCAGATACCATTGTGTATAATTCTCTCCTAGACGGGCTATTTAAGGCTCGGAAAGTTACTGAAGCATGTCAACTTTTTGATAAAATGGTACAAGAAGGTGTAAGAGCTTCTCCTTGGACATACAATATTCTAATTGATGGATTGTTTAGGAATGGAAGAGCTGAAGCTAGCTACTCTTTATTCTGTGATTTGAAGAAAAAGGGTCAGTTTGTGGATGGTGTTACTTACAGCATCATCGTTTTGCAACTGTGTAAAGAGGGACTGCTTGAGGAAGCTCTACAATTGGTCGAAGAAATGGAAGCGAGAGGCTTTGTTGTTGATCTTATTACTGTAACATCTCTTTTAATTGCAATGCACAAGCAAGGCCAGTGGGAAGAGCTAGAGAGGCTCATGAAGCACATTAGAGAAGGTGATTTGGTCCCTAATGTTCTCAAATGGAAGACTAACATGGAAAATTCAATCAAATATCAGCAAAATAAAAGGAAAGACTACACATCCCTGTTCTCCCCAAAGGAGGATTTGAGTGAGATTATTAGTGCAAGAGCTTCTTCTGTTCCGAAAGTTAATATTGATGATACTTCTGAAAACAAAGAAGAAAGAGATGCGGAGAGTTGGTCATCATCCCCACATGCAGATCTTTTGGCTAATCTTGCAAAGTCCACCGGTGATTTTTTGCAACCATTCTCTCTAAGTCAGGGGCGACGAATCCAAGCTAAAGGGGACAACTCATTTGATATCAATATGGTTAATACATTTTTGTCTATTTTTCTAGCAAAGGGAAAATTGAGCTTGGCATGTAAGTTGTTTGAGGTCTTCAGTGATATGGGTGTCAACCCAGTGAGATACACATACAATTCAATGCTGAGTTCATTTGTGAAGAAGGGATACTTTCACCAGGCATGGGGCATATTTAACGAAATGGGCGAGAAGGTATGCCCAGCCGATATAGCCACATACAATGTGATCATTCAAGGACTCGGGAAGATGGGTAGAGCGGATCTTGCAAGTTCTGTTCTGGAAAAGCTAATGGAGCAGGGTGGCTATCTCGATATTGTAATGTACAACACCTTGATTAATGCGTTGGGGAAGGCAGGTCGAATGGATGATGTAAATAAGCTTTTCGAGCAGATGAAGAACAGCGGGATAAACCCAGATGTTGTCACTTTTAATACACTTATTGAAGTTCACAGCAAAGCAGGTCGGTTTAAGGATGCTTACAAGTTCTTAAAAATGATGCTGGATTCGGGCTGTTCCCCCAATCATGTCACAGATACAACTTTGGATTTTCTAGGGAGGGAGATCGAGAAAGCAAGGTATGAAAAAGCTTCAATCATACGTGACAAGAACAGTTCTTCACTTTCCTGTTAA

Coding sequence (CDS)

ATGCGCCATGGAAGAGGTGGTTTCCACGCCATGGAATCAAGGGCAACTTCAACTCTCTCTCAATTGGCCGACCTCTTCCTCGTGGCTTCCATTACCAAAACCCTTTCGGAATCAGGTACTCGAACCCTCCAACACCATTCTCTTCCATTATCACAGCCTCTCCTTCTCCAAATCCTCCATTCCAGATCTGTTCATCCTTCCCACAAGCTCGATTTCTTCAAATGGTGTTCTCTCACCCCCAATTTCCACCATTCATCCTCCACGTATTCCCAAATCTTCCATGTCCTCTGCCGCTCTGGATACCTCCACGAGGTTCCCCCTTTACTCTCCTCGATGAAGCGAGATGGGGTTGCTGTTGATTCCCACACTTTCAAGGTCCTTCTCGATGCCTTTATCAGGTCTGGTAAATACGATGCTGCCCTTGAAATTTTAGACCATATGGAAGATTTGGGAACTAGCTTGGAACTCAACACCTACAACTCTGTTCTTGTTGCTCTGCTTAGAAAAAACCAGGTGGGTTTGGCATTGTCAATTTTCTTTAAGCTGTTTGACGCTTCTAATAATGAAGGGCAAGAAAGTACTGCTGCAACTAGTTTTCCTTTCTTGCCTAATTCACTAGCTTGTAATGAATTGTTGGTCGCTCTTAGGAAATCAGACATGAGGGTTGAATTCAAAAAGGTTTTTGACAAGCTTAGAGCAATTAGAAGCTTTGAGTTTAATATATGTGGTTATAATATATGTATTCATGCTTTTGGATGTTGGGGTTATCTGGATACTTCTCTTGCTCTGTTCAAAGAAATGAAGCAAAAGAGCTTAGTTTCGGAGTCTTTCGGTCCGGATTTGTGTACATATAATAGCCTTATTCATGTGCTATGTTTGGTAGGGAAGGTTAAGGATGCACTTATTGTGTGGGAGGAACTTAAAGGGTCAGGTCATGAGCCCGATGCCTTCACTTACCGTGTCATAATTCAGGGTTGCTGTAAATCTTACCGAATGGACGATGCAACCATGATTTTTAATGAAATGGAGTACAGTGGATTTATCCCAGATACCATTGTGTATAATTCTCTCCTAGACGGGCTATTTAAGGCTCGGAAAGTTACTGAAGCATGTCAACTTTTTGATAAAATGGTACAAGAAGGTGTAAGAGCTTCTCCTTGGACATACAATATTCTAATTGATGGATTGTTTAGGAATGGAAGAGCTGAAGCTAGCTACTCTTTATTCTGTGATTTGAAGAAAAAGGGTCAGTTTGTGGATGGTGTTACTTACAGCATCATCGTTTTGCAACTGTGTAAAGAGGGACTGCTTGAGGAAGCTCTACAATTGGTCGAAGAAATGGAAGCGAGAGGCTTTGTTGTTGATCTTATTACTGTAACATCTCTTTTAATTGCAATGCACAAGCAAGGCCAGTGGGAAGAGCTAGAGAGGCTCATGAAGCACATTAGAGAAGGTGATTTGGTCCCTAATGTTCTCAAATGGAAGACTAACATGGAAAATTCAATCAAATATCAGCAAAATAAAAGGAAAGACTACACATCCCTGTTCTCCCCAAAGGAGGATTTGAGTGAGATTATTAGTGCAAGAGCTTCTTCTGTTCCGAAAGTTAATATTGATGATACTTCTGAAAACAAAGAAGAAAGAGATGCGGAGAGTTGGTCATCATCCCCACATGCAGATCTTTTGGCTAATCTTGCAAAGTCCACCGGTGATTTTTTGCAACCATTCTCTCTAAGTCAGGGGCGACGAATCCAAGCTAAAGGGGACAACTCATTTGATATCAATATGGTTAATACATTTTTGTCTATTTTTCTAGCAAAGGGAAAATTGAGCTTGGCATGTAAGTTGTTTGAGGTCTTCAGTGATATGGGTGTCAACCCAGTGAGATACACATACAATTCAATGCTGAGTTCATTTGTGAAGAAGGGATACTTTCACCAGGCATGGGGCATATTTAACGAAATGGGCGAGAAGGTATGCCCAGCCGATATAGCCACATACAATGTGATCATTCAAGGACTCGGGAAGATGGGTAGAGCGGATCTTGCAAGTTCTGTTCTGGAAAAGCTAATGGAGCAGGGTGGCTATCTCGATATTGTAATGTACAACACCTTGATTAATGCGTTGGGGAAGGCAGGTCGAATGGATGATGTAAATAAGCTTTTCGAGCAGATGAAGAACAGCGGGATAAACCCAGATGTTGTCACTTTTAATACACTTATTGAAGTTCACAGCAAAGCAGGTCGGTTTAAGGATGCTTACAAGTTCTTAAAAATGATGCTGGATTCGGGCTGTTCCCCCAATCATGTCACAGATACAACTTTGGATTTTCTAGGGAGGGAGATCGAGAAAGCAAGGTATGAAAAAGCTTCAATCATACGTGACAAGAACAGTTCTTCACTTTCCTGTTAA

Protein sequence

MRHGRGGFHAMESRATSTLSQLADLFLVASITKTLSESGTRTLQHHSLPLSQPLLLQILHSRSVHPSHKLDFFKWCSLTPNFHHSSSTYSQIFHVLCRSGYLHEVPPLLSSMKRDGVAVDSHTFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLFDASNNEGQESTAATSFPFLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFNICGYNICIHAFGCWGYLDTSLALFKEMKQKSLVSESFGPDLCTYNSLIHVLCLVGKVKDALIVWEELKGSGHEPDAFTYRVIIQGCCKSYRMDDATMIFNEMEYSGFIPDTIVYNSLLDGLFKARKVTEACQLFDKMVQEGVRASPWTYNILIDGLFRNGRAEASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLLEEALQLVEEMEARGFVVDLITVTSLLIAMHKQGQWEELERLMKHIREGDLVPNVLKWKTNMENSIKYQQNKRKDYTSLFSPKEDLSEIISARASSVPKVNIDDTSENKEERDAESWSSSPHADLLANLAKSTGDFLQPFSLSQGRRIQAKGDNSFDINMVNTFLSIFLAKGKLSLACKLFEVFSDMGVNPVRYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGYLDIVMYNTLINALGKAGRMDDVNKLFEQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYKFLKMMLDSGCSPNHVTDTTLDFLGREIEKARYEKASIIRDKNSSSLSC
BLAST of Cla97C06G111710 vs. NCBI nr
Match: XP_023549441.1 (pentatricopeptide repeat-containing protein At4g01570 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 485.7 bits (1249), Expect = 3.0e-133
Identity = 555/632 (87.82%), Postives = 581/632 (91.93%), Query Frame = 0

Query: 1   MRHGRGGFHAMESRATSTLSQLADLFLVASITKTLSESGTRTLQHHSLPLSQPLLLQILH 60
           MRH  GGF AMESRAT TLS+LADL LVASITKTLSESGTRTLQH SL +S+PLLLQIL 
Sbjct: 1   MRHRIGGFVAMESRATPTLSRLADLLLVASITKTLSESGTRTLQHQSLSISEPLLLQILR 60

Query: 61  SRSVHPSHKLDFFKWCSLTPNFHHSSSTYSQIFHVLCRSGYLHEVPPLLSSMKRDGVAVD 120
           SRSVHPS+KLDFFKWCSL+PNF HS+STYSQIF  LCRSGYLHEVP +LSSMKRDGV VD
Sbjct: 61  SRSVHPSNKLDFFKWCSLSPNFSHSASTYSQIFRTLCRSGYLHEVPLVLSSMKRDGVDVD 120

Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180
             XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 
Sbjct: 121 SHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXF 180

Query: 181 XXFDASNNEGQESTAATSFPFLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFN 240
             FDA +  GQE +A  SF FLPN+LACNELLVALRKSDMRVEFKKVFDKLR IRSFEFN
Sbjct: 181 KLFDAFSTGGQEGSAVPSFSFLPNALACNELLVALRKSDMRVEFKKVFDKLRTIRSFEFN 240

Query: 241 ICGYNICIHAFGCWGYLDTSLALXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 300
           +CGYNICIHAFGCWGYLDTSLAL             XXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 241 VCGYNICIHAFGCWGYLDTSLALFKEMKQRSLVSVSXXXXXXXXXXXXXXXXXXXXXXXX 300

Query: 301 XXXXXXXXXSGHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           XXXXXXXX SGHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 XXXXXXXXGSGHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420

Query: 421 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 480
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 421 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 480

Query: 481 XXXXGDLVPNVLKWKTNMENSIKYQQNKRKDYTSLFSPKEDLSEIISARASSVPKVNIDD 540
           XX  GDLVPNVLKWK NME+S+KYQ+NKRK+Y+SLFSPKEDLSEIIS+RASSV KVN+ D
Sbjct: 481 XXREGDLVPNVLKWKANMEDSVKYQKNKRKNYSSLFSPKEDLSEIISSRASSVAKVNVGD 540

Query: 541 TSENKEERDAESWSSSPHADLLANLAKSTGDFLQPFSLSQGRRIQAKGDNSFDINMVNTF 600
            SEN EE+D ++WSSSPH DLLANLAKSTGD LQPFSLS G+R++AKGDNSFDI+MVNTF
Sbjct: 541 ISENTEEKDDDNWSSSPHVDLLANLAKSTGDSLQPFSLSPGQRVEAKGDNSFDIDMVNTF 600

Query: 601 LSIFLAKGKLSLACKLFEVFSDMGVNPVRYTY 633
           LSIFLAKGKLSLACKLFE+FSDMGVNPVRYTY
Sbjct: 601 LSIFLAKGKLSLACKLFEIFSDMGVNPVRYTY 632

BLAST of Cla97C06G111710 vs. NCBI nr
Match: XP_022929794.1 (pentatricopeptide repeat-containing protein At4g01570 [Cucurbita moschata])

HSP 1 Score: 482.3 bits (1240), Expect = 3.3e-132
Identity = 551/632 (87.18%), Postives = 573/632 (90.66%), Query Frame = 0

Query: 1   MRHGRGGFHAMESRATSTLSQLADLFLVASITKTLSESGTRTLQHHSLPLSQPLLLQILH 60
           MRHG GGF AMESRAT TLS+LADL LVASITKTLSESGTRTLQH SL +S+PLLLQIL 
Sbjct: 1   MRHGIGGFVAMESRATPTLSRLADLLLVASITKTLSESGTRTLQHQSLSISEPLLLQILR 60

Query: 61  SRSVHPSHKLDFFKWCSLTPNFHHSSSTYSQIFHVLCRSGYLHEVPPLLSSMKRDGVAVD 120
           SRSVHPS+KLDFFKWCSL+PNF HS STYSQIF +LCRSGYLHEVP LLSSMKRDGV VD
Sbjct: 61  SRSVHPSNKLDFFKWCSLSPNFSHSPSTYSQIFRILCRSGYLHEVPLLLSSMKRDGVDVD 120

Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180
                  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX   
Sbjct: 121 SHTFKVLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIFF 180

Query: 181 XXFDASNNEGQESTAATSFPFLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFN 240
             FDA +  GQE +A  SF FLPN+LACNELLVALRKSDMRVEFKKVFDKLR IRSFEFN
Sbjct: 181 KLFDAFSTGGQEGSAVPSFSFLPNALACNELLVALRKSDMRVEFKKVFDKLRTIRSFEFN 240

Query: 241 ICGYNICIHAFGCWGYLDTSLALXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 300
           +CGYNICIHAFGCWGYLDTSLAL            XXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 241 VCGYNICIHAFGCWGYLDTSLALFKEMKQRSLVSVXXXXXXXXXXXXXXXXXXXXXXXXX 300

Query: 301 XXXXXXXXXSGHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           XXXXXXXX SG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 XXXXXXXXGSGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420

Query: 421 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 480
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 421 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 480

Query: 481 XXXXGDLVPNVLKWKTNMENSIKYQQNKRKDYTSLFSPKEDLSEIISARASSVPKVNIDD 540
           XX  GDLVPNVLKWK NME+S+KYQ+NKRKDY+ LFSPKEDLSEIIS+RASSV KV  DD
Sbjct: 481 XXREGDLVPNVLKWKANMEDSVKYQKNKRKDYSPLFSPKEDLSEIISSRASSVAKV--DD 540

Query: 541 TSENKEERDAESWSSSPHADLLANLAKSTGDFLQPFSLSQGRRIQAKGDNSFDINMVNTF 600
            SEN EE+D ++WSSSPH DLLANLAKSTGD LQPFSLS G+R+QAKGDNSFDI+MVNTF
Sbjct: 541 ISENTEEKDDDNWSSSPHVDLLANLAKSTGDSLQPFSLSPGQRVQAKGDNSFDIDMVNTF 600

Query: 601 LSIFLAKGKLSLACKLFEVFSDMGVNPVRYTY 633
           LSIFLAKGKLSLACKLFE+FSDMGVNPVRYTY
Sbjct: 601 LSIFLAKGKLSLACKLFEIFSDMGVNPVRYTY 630

BLAST of Cla97C06G111710 vs. NCBI nr
Match: XP_022992119.1 (pentatricopeptide repeat-containing protein At4g01570 [Cucurbita maxima])

HSP 1 Score: 479.6 bits (1233), Expect = 2.1e-131
Identity = 541/632 (85.60%), Postives = 569/632 (90.03%), Query Frame = 0

Query: 1   MRHGRGGFHAMESRATSTLSQLADLFLVASITKTLSESGTRTLQHHSLPLSQPLLLQILH 60
           MRHG GGF AMESRAT TLS+LADL LVASITKTLSESGTRTLQH SL +S+PLLLQIL 
Sbjct: 1   MRHGIGGFVAMESRATPTLSRLADLLLVASITKTLSESGTRTLQHQSLSISEPLLLQILR 60

Query: 61  SRSVHPSHKLDFFKWCSLTPNFHHSSSTYSQIFHVLCRSGYLHEVPPLLSSMKRDGVAVD 120
           SRSVHPS+KLDFFKWCSL+PNF HS+STYSQIF +LCRSGY HEVP LLSSMKRDGV VD
Sbjct: 61  SRSVHPSNKLDFFKWCSLSPNFSHSASTYSQIFRILCRSGYFHEVPLLLSSMKRDGVDVD 120

Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180
             XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 
Sbjct: 121 SHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXF 180

Query: 181 XXFDASNNEGQESTAATSFPFLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFN 240
             FDA +  GQE +A  SF FLPN+LACNELLVALRKSDMRVEFK VFDKLR IRSFEFN
Sbjct: 181 KLFDAFSTGGQEGSAVPSFSFLPNALACNELLVALRKSDMRVEFKMVFDKLRTIRSFEFN 240

Query: 241 ICGYNICIHAFGCWGYLDTSLALXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 300
           +CGYNICIHAFGCWGYLDTSLAL                  XXXXXXXXXXXXXXX    
Sbjct: 241 VCGYNICIHAFGCWGYLDTSLALFKEMKQRSLVSVSFGPDLXXXXXXXXXXXXXXXVNDA 300

Query: 301 XXXXXXXXXSGHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
            XXXXXXX SGHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 LXXXXXXXGSGHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420

Query: 421 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 480
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 421 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 480

Query: 481 XXXXGDLVPNVLKWKTNMENSIKYQQNKRKDYTSLFSPKEDLSEIISARASSVPKVNIDD 540
           XX  GDLVPNVLKWK NME+S+KYQ+NKRKDY+ LFSPKEDLSEIIS+RA+SV KVN+DD
Sbjct: 481 XXREGDLVPNVLKWKANMEDSVKYQKNKRKDYSPLFSPKEDLSEIISSRAASVAKVNVDD 540

Query: 541 TSENKEERDAESWSSSPHADLLANLAKSTGDFLQPFSLSQGRRIQAKGDNSFDINMVNTF 600
            SEN EE+D ++WSSSPH DLLAN AKSTGD LQ FSLS G+R+Q+KG+NSFDI+MVNTF
Sbjct: 541 ISENTEEKDDDNWSSSPHVDLLANRAKSTGDSLQLFSLSPGQRVQSKGNNSFDIDMVNTF 600

Query: 601 LSIFLAKGKLSLACKLFEVFSDMGVNPVRYTY 633
           LSIFLAKGKLSLACKLF++FSDMGVNPVRYTY
Sbjct: 601 LSIFLAKGKLSLACKLFDIFSDMGVNPVRYTY 632

BLAST of Cla97C06G111710 vs. NCBI nr
Match: XP_008459805.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g01570 [Cucumis melo])

HSP 1 Score: 464.5 bits (1194), Expect = 7.1e-127
Identity = 513/636 (80.66%), Postives = 544/636 (85.53%), Query Frame = 0

Query: 1   MRHG--RGGFHAME--SRATSTLSQLADLFLVASITKTLSESGTRTLQHHSLPLSQPLLL 60
           MRHG  R  F  +E  SR  STLSQL+DL LVASITKTLSESGTRTLQHHSLP+S PLLL
Sbjct: 1   MRHGRTRACFLCIESHSRTVSTLSQLSDLLLVASITKTLSESGTRTLQHHSLPISHPLLL 60

Query: 61  QILHSRSVHPSHKLDFFKWCSLTPNFHHSSSTYSQIFHVLCRSGYLHEVPPLLSSMKRDG 120
           QILHSRS++PSHKLDFFKWCSL PNF+HS STYSQIFH+LCRSGYLHEVPPLL SMKRDG
Sbjct: 61  QILHSRSLNPSHKLDFFKWCSLAPNFNHSPSTYSQIFHILCRSGYLHEVPPLLDSMKRDG 120

Query: 121 VAVDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180
           V+VD  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 121 VSVDSHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180

Query: 181 XXXXXXFDASNNEGQESTAATSFPFLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRS 240
           XXX   FD  NN GQ+ +AATSF FLPNSLACNELLVALRK DMRVEF+KVFDKLRAI +
Sbjct: 181 XXXFKLFDGLNNGGQDDSAATSFHFLPNSLACNELLVALRKLDMRVEFRKVFDKLRAIEA 240

Query: 241 FEFNICGYNICIHAFGCWGYLDTSLALXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 300
           FEFN+CGYNICI+AFGCWGYLDT+L+L                                 
Sbjct: 241 FEFNVCGYNICIYAFGCWGYLDTALSLFKEMKEKSLVLGSFGPDLCTYNSIIRVLCLVGK 300

Query: 301 XXXXXXXXXXXXXSGHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
                        SG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 VKDALIVWEELKGSGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420

Query: 421 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 480
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 421 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 480

Query: 481 XXXXXXXXGDLVPNVLKWKTNMENSIKYQQNKRKDYTSLFSPKEDLSEIISARASSVPKV 540
           XXXXXX  GDLVPNVLKWK NME+SIKYQ+NKR+D++SLFSPKEDL E+IS+RASS  +V
Sbjct: 481 XXXXXXREGDLVPNVLKWKINMEDSIKYQKNKREDFSSLFSPKEDLIEVISSRASSAAEV 540

Query: 541 NIDDTSENKEERDAESWSSSPHADLLANLAKSTGDFLQPFSLSQGRRIQAKGDNSFDINM 600
           NID++ EN EE D + WSSSPH D LANLA ST D LQPFSL QGRRIQ KG+NSFDINM
Sbjct: 541 NIDNSVENTEEMDTDGWSSSPHVDGLANLANSTTDILQPFSLRQGRRIQEKGNNSFDINM 600

Query: 601 VNTFLSIFLAKGKLSLACKLFEVFSDMGVNPVRYTY 633
           VNTFLSIFLAKGKL+LACKLFE+FSDMGVNPV+YTY
Sbjct: 601 VNTFLSIFLAKGKLNLACKLFEIFSDMGVNPVKYTY 636

BLAST of Cla97C06G111710 vs. NCBI nr
Match: XP_004140525.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g01570 [Cucumis sativus] >KGN46481.1 hypothetical protein Csa_6G101450 [Cucumis sativus])

HSP 1 Score: 448.4 bits (1152), Expect = 5.3e-122
Identity = 500/636 (78.62%), Postives = 531/636 (83.49%), Query Frame = 0

Query: 1   MRHGRGG--FHAME--SRATSTLSQLADLFLVASITKTLSESGTRTLQHHSLPLSQPLLL 60
           MRHGR    F ++E  SR  STLS L+ L L+ASITKTLSESGTRTLQHHSLP+S PLLL
Sbjct: 1   MRHGRTRTCFLSIESHSRTASTLSHLSHLLLLASITKTLSESGTRTLQHHSLPISHPLLL 60

Query: 61  QILHSRSVHPSHKLDFFKWCSLTPNFHHSSSTYSQIFHVLCRSGYLHEVPPLLSSMKRDG 120
           QILHSRS++PSHKLDFFKWCSL PNF+HS STYSQIFH+LCRSGYLHEVPPLL SMKRDG
Sbjct: 61  QILHSRSLNPSHKLDFFKWCSLAPNFNHSPSTYSQIFHILCRSGYLHEVPPLLDSMKRDG 120

Query: 121 VAVDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180
           V+VD  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 121 VSVDSHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180

Query: 181 XXXXXXFDASNNEGQESTAATSFPFLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRS 240
           XXX    D  NN GQ  +AAT+F FLPNSLACNELLVALRK DMRVEFKKVFDKLRAI S
Sbjct: 181 XXXFKLLDGFNNGGQVDSAATTFHFLPNSLACNELLVALRKLDMRVEFKKVFDKLRAIES 240

Query: 241 FEFNICGYNICIHAFGCWGYLDTSLALXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 300
           FEF++ GYNICI+AFGCWGYLDT+L+L                                 
Sbjct: 241 FEFSVYGYNICIYAFGCWGYLDTALSLFKEMKEKSLVSESFSPDLCTYNSIIHVLCLVGK 300

Query: 301 XXXXXXXXXXXXXSGHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
                     XXX   XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 VKDALIVWEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420

Query: 421 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 480
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX     
Sbjct: 421 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXWDGLE 480

Query: 481 XXXXXXXXGDLVPNVLKWKTNMENSIKYQQNKRKDYTSLFSPKEDLSEIISARASSVPKV 540
                   GDLVPNVLKWK NME SIKYQ+NKRKD++SLFSPKEDLSE+IS+RASS  KV
Sbjct: 481 RLMKHIREGDLVPNVLKWKINMEYSIKYQKNKRKDFSSLFSPKEDLSEVISSRASSAAKV 540

Query: 541 NIDDTSENKEERDAESWSSSPHADLLANLAKSTGDFLQPFSLSQGRRIQAKGDNSFDINM 600
           NID++ EN EERD +SWSSSP+ + LANLA ST D LQPFS+ QGRRIQ K DNSFDINM
Sbjct: 541 NIDNSFENTEERDMDSWSSSPYVNRLANLANSTSDILQPFSIRQGRRIQEKQDNSFDINM 600

Query: 601 VNTFLSIFLAKGKLSLACKLFEVFSDMGVNPVRYTY 633
           VNTFLSIFLAKGKL+LACKLFE+FSDMGVNPV+YTY
Sbjct: 601 VNTFLSIFLAKGKLNLACKLFEIFSDMGVNPVKYTY 636

BLAST of Cla97C06G111710 vs. TrEMBL
Match: tr|A0A1S3CBH7|A0A1S3CBH7_CUCME (pentatricopeptide repeat-containing protein At4g01570 OS=Cucumis melo OX=3656 GN=LOC103498827 PE=4 SV=1)

HSP 1 Score: 464.5 bits (1194), Expect = 4.7e-127
Identity = 513/636 (80.66%), Postives = 544/636 (85.53%), Query Frame = 0

Query: 1   MRHG--RGGFHAME--SRATSTLSQLADLFLVASITKTLSESGTRTLQHHSLPLSQPLLL 60
           MRHG  R  F  +E  SR  STLSQL+DL LVASITKTLSESGTRTLQHHSLP+S PLLL
Sbjct: 1   MRHGRTRACFLCIESHSRTVSTLSQLSDLLLVASITKTLSESGTRTLQHHSLPISHPLLL 60

Query: 61  QILHSRSVHPSHKLDFFKWCSLTPNFHHSSSTYSQIFHVLCRSGYLHEVPPLLSSMKRDG 120
           QILHSRS++PSHKLDFFKWCSL PNF+HS STYSQIFH+LCRSGYLHEVPPLL SMKRDG
Sbjct: 61  QILHSRSLNPSHKLDFFKWCSLAPNFNHSPSTYSQIFHILCRSGYLHEVPPLLDSMKRDG 120

Query: 121 VAVDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180
           V+VD  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 121 VSVDSHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180

Query: 181 XXXXXXFDASNNEGQESTAATSFPFLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRS 240
           XXX   FD  NN GQ+ +AATSF FLPNSLACNELLVALRK DMRVEF+KVFDKLRAI +
Sbjct: 181 XXXFKLFDGLNNGGQDDSAATSFHFLPNSLACNELLVALRKLDMRVEFRKVFDKLRAIEA 240

Query: 241 FEFNICGYNICIHAFGCWGYLDTSLALXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 300
           FEFN+CGYNICI+AFGCWGYLDT+L+L                                 
Sbjct: 241 FEFNVCGYNICIYAFGCWGYLDTALSLFKEMKEKSLVLGSFGPDLCTYNSIIRVLCLVGK 300

Query: 301 XXXXXXXXXXXXXSGHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
                        SG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 VKDALIVWEELKGSGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420

Query: 421 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 480
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 421 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 480

Query: 481 XXXXXXXXGDLVPNVLKWKTNMENSIKYQQNKRKDYTSLFSPKEDLSEIISARASSVPKV 540
           XXXXXX  GDLVPNVLKWK NME+SIKYQ+NKR+D++SLFSPKEDL E+IS+RASS  +V
Sbjct: 481 XXXXXXREGDLVPNVLKWKINMEDSIKYQKNKREDFSSLFSPKEDLIEVISSRASSAAEV 540

Query: 541 NIDDTSENKEERDAESWSSSPHADLLANLAKSTGDFLQPFSLSQGRRIQAKGDNSFDINM 600
           NID++ EN EE D + WSSSPH D LANLA ST D LQPFSL QGRRIQ KG+NSFDINM
Sbjct: 541 NIDNSVENTEEMDTDGWSSSPHVDGLANLANSTTDILQPFSLRQGRRIQEKGNNSFDINM 600

Query: 601 VNTFLSIFLAKGKLSLACKLFEVFSDMGVNPVRYTY 633
           VNTFLSIFLAKGKL+LACKLFE+FSDMGVNPV+YTY
Sbjct: 601 VNTFLSIFLAKGKLNLACKLFEIFSDMGVNPVKYTY 636

BLAST of Cla97C06G111710 vs. TrEMBL
Match: tr|A0A0A0KFG9|A0A0A0KFG9_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G101450 PE=4 SV=1)

HSP 1 Score: 448.4 bits (1152), Expect = 3.5e-122
Identity = 500/636 (78.62%), Postives = 531/636 (83.49%), Query Frame = 0

Query: 1   MRHGRGG--FHAME--SRATSTLSQLADLFLVASITKTLSESGTRTLQHHSLPLSQPLLL 60
           MRHGR    F ++E  SR  STLS L+ L L+ASITKTLSESGTRTLQHHSLP+S PLLL
Sbjct: 1   MRHGRTRTCFLSIESHSRTASTLSHLSHLLLLASITKTLSESGTRTLQHHSLPISHPLLL 60

Query: 61  QILHSRSVHPSHKLDFFKWCSLTPNFHHSSSTYSQIFHVLCRSGYLHEVPPLLSSMKRDG 120
           QILHSRS++PSHKLDFFKWCSL PNF+HS STYSQIFH+LCRSGYLHEVPPLL SMKRDG
Sbjct: 61  QILHSRSLNPSHKLDFFKWCSLAPNFNHSPSTYSQIFHILCRSGYLHEVPPLLDSMKRDG 120

Query: 121 VAVDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180
           V+VD  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 121 VSVDSHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180

Query: 181 XXXXXXFDASNNEGQESTAATSFPFLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRS 240
           XXX    D  NN GQ  +AAT+F FLPNSLACNELLVALRK DMRVEFKKVFDKLRAI S
Sbjct: 181 XXXFKLLDGFNNGGQVDSAATTFHFLPNSLACNELLVALRKLDMRVEFKKVFDKLRAIES 240

Query: 241 FEFNICGYNICIHAFGCWGYLDTSLALXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 300
           FEF++ GYNICI+AFGCWGYLDT+L+L                                 
Sbjct: 241 FEFSVYGYNICIYAFGCWGYLDTALSLFKEMKEKSLVSESFSPDLCTYNSIIHVLCLVGK 300

Query: 301 XXXXXXXXXXXXXSGHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
                     XXX   XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 VKDALIVWEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420

Query: 421 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 480
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX     
Sbjct: 421 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXWDGLE 480

Query: 481 XXXXXXXXGDLVPNVLKWKTNMENSIKYQQNKRKDYTSLFSPKEDLSEIISARASSVPKV 540
                   GDLVPNVLKWK NME SIKYQ+NKRKD++SLFSPKEDLSE+IS+RASS  KV
Sbjct: 481 RLMKHIREGDLVPNVLKWKINMEYSIKYQKNKRKDFSSLFSPKEDLSEVISSRASSAAKV 540

Query: 541 NIDDTSENKEERDAESWSSSPHADLLANLAKSTGDFLQPFSLSQGRRIQAKGDNSFDINM 600
           NID++ EN EERD +SWSSSP+ + LANLA ST D LQPFS+ QGRRIQ K DNSFDINM
Sbjct: 541 NIDNSFENTEERDMDSWSSSPYVNRLANLANSTSDILQPFSIRQGRRIQEKQDNSFDINM 600

Query: 601 VNTFLSIFLAKGKLSLACKLFEVFSDMGVNPVRYTY 633
           VNTFLSIFLAKGKL+LACKLFE+FSDMGVNPV+YTY
Sbjct: 601 VNTFLSIFLAKGKLNLACKLFEIFSDMGVNPVKYTY 636

BLAST of Cla97C06G111710 vs. TrEMBL
Match: tr|A0A2P5BKZ8|A0A2P5BKZ8_PARAD (Tetratricopeptide-like helical domain containing protein OS=Parasponia andersonii OX=3476 GN=PanWU01x14_230180 PE=4 SV=1)

HSP 1 Score: 283.5 bits (724), Expect = 1.5e-72
Identity = 430/640 (67.19%), Postives = 473/640 (73.91%), Query Frame = 0

Query: 1   MRHGRGGFHAMESRATSTLSQLADLFLVASITKTLSESGTRTL-QHHSLPLSQPLLLQIL 60
           MRHGR        +   T SQL DL LVAS+TKTLS+SGTR L + HS+PLS+PLLLQIL
Sbjct: 1   MRHGR----TFLLKPRRTHSQLGDLLLVASLTKTLSDSGTRYLPEPHSIPLSEPLLLQIL 60

Query: 61  HSRSVHPSHKLDFFKWCSLTPNFHHSSSTYSQIFHVLCRSGYLHEVPPLLSSMKRDGVAV 120
            + ++HPS KLDFF+WCSL PNF HS+ +YS IF  LCR+G+LHEVP LL SMK+DGV V
Sbjct: 61  RTNALHPSKKLDFFRWCSLAPNFTHSARSYSLIFRTLCRAGHLHEVPDLLHSMKQDGVVV 120

Query: 121 DXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180
           D              XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 121 D---SETFKALLDAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180

Query: 181 XXXFDASNNEGQESTAATSFPFLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEF 240
           XX F    +E Q          +P+S+ACNELLVALRK DM  EFK+VFDK+R  + FE 
Sbjct: 181 XXFFKLLEDETQ----------VPSSIACNELLVALRKMDMMGEFKRVFDKVREKKGFEM 240

Query: 241 NICGYNICIHAFGCWGYLDTSLALXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 300
           +I GYNICIH FGCWG L TSL L                                    
Sbjct: 241 DIWGYNICIHGFGCWGDLGTSLKL------FKEMKGLMGPDLCTYNSLIRVLCFVGKVKD 300

Query: 301 XXXXXXXXXXSGHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
                     SGH XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 ALVVWEELKVSGHEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420

Query: 421 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 480
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 
Sbjct: 421 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXM 480

Query: 481 XXXXXGDLVPNVLKWKTNMENSIKYQQNKRKDYTSLFSPKEDLSEII----SARASSVPK 540
                G+L+PNVL+WK +ME S+K  Q+KRKD+  LFS + + SEI+    SA A+   +
Sbjct: 481 KHIRDGNLLPNVLRWKMDMEASMKSPQSKRKDFKPLFSSEGEFSEIMNLIRSANATMEAE 540

Query: 541 VNIDDTSENKEE---RDAESWSSSPHADLLANLAKSTGDFLQPFSLSQGRRIQAKGDNSF 600
           V  D+T    EE    D + WSSS + D   N   STG F Q FSLS+GRR+QAKG  SF
Sbjct: 541 VVPDNTEVKDEESMRTDVDQWSSSSYMDQFTNQVLSTGHFSQLFSLSRGRRVQAKGAASF 600

Query: 601 DINMVNTFLSIFLAKGKLSLACKLFEVFSDMGVNPVRYTY 633
           DI+MVNTFLSIFLAKGKLSLACKLFE+F+DMGVNPV YTY
Sbjct: 601 DIDMVNTFLSIFLAKGKLSLACKLFEIFTDMGVNPVSYTY 617

BLAST of Cla97C06G111710 vs. TrEMBL
Match: tr|A0A2C9VJW7|A0A2C9VJW7_MANES (Uncharacterized protein OS=Manihot esculenta OX=3983 GN=MANES_07G094000 PE=4 SV=1)

HSP 1 Score: 270.0 bits (689), Expect = 1.7e-68
Identity = 371/622 (59.65%), Postives = 418/622 (67.20%), Query Frame = 0

Query: 15  ATSTLSQLADLFLVASITKTLSESGTRTLQHHSLPLSQPLLLQILHSRSVHPSHKLDFFK 74
           ++S+ S+L D+ LVA +TK LSESGTR L+  S+PLS+PL+LQIL   S+ PS K+DFFK
Sbjct: 39  SSSSPSKLEDILLVAFLTKNLSESGTRNLEPDSIPLSEPLVLQILRQSSLEPSRKIDFFK 98

Query: 75  WCSLTPNFHHSSSTYSQIFHVLCRSGYLHEVPPLLSSMKRDGVAVDXXXXXXXXXXXXXX 134
           WCSL  N+ HS+ TYS +F  LCR+G L E+P LL+ MK DGV V               
Sbjct: 99  WCSLRHNYKHSACTYSNMFRTLCRAGNLEEIPNLLNLMKDDGVVVSSDTFKFLLDAFIRS 158

Query: 135 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFDASNNEGQEST 194
                                                            +ASN   +E +
Sbjct: 159 GKFDFALEIFDHMEELGTNLNPHMYDSVIVALARKNQIGLALSIFFKLLEASNGMNKEDS 218

Query: 195 AATSFPFLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFNICGYNICIHAFGCW 254
                  +P S+ACN LLVALRK+DMR EF+KVFDKLRA   FE +  G NICIHAFGCW
Sbjct: 219 VRNVGLSMPGSIACNALLVALRKADMRAEFRKVFDKLRATDEFELDTWGCNICIHAFGCW 278

Query: 255 GYLDTSLALXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSGHXX 314
           G L T+L L                                              SGHXX
Sbjct: 279 GDLATALMLFKEMKEKSLGSGPFGPDLCTYNSLIHVLCLFGKVNDALIVYEELKVSGHXX 338

Query: 315 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 374
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 339 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 398

Query: 375 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 434
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 399 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 458

Query: 435 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGDLVPNVLKW 494
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXG+LVPNVL W
Sbjct: 459 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGNLVPNVLNW 518

Query: 495 KTNMENSIKYQQNKRKDYTSLFSPKEDLSEIIS-ARASSVPKVNIDDTS---ENKEERDA 554
           + +ME S+K  Q +RKDYT +F     LSEI S  R   + K + D ++   EN    D 
Sbjct: 519 QADMEASLKNPQRRRKDYTPMFPSNGKLSEITSLLRYPDLEKCSSDGSAIEDENSSLNDT 578

Query: 555 ESWSSSPHADLLANLAKSTGDFLQPFSLSQGRRIQAKGDNSFDINMVNTFLSIFLAKGKL 614
           + WSSSP+ D LAN  +S     Q FSLS+G+R+Q KG +SFDI+MVNTFLSIFLAKGKL
Sbjct: 579 DQWSSSPYMDHLANQVQSNNLSSQLFSLSRGQRVQEKGIDSFDIDMVNTFLSIFLAKGKL 638

Query: 615 SLACKLFEVFSDMGVNPVRYTY 633
           SLACKLFE+FSD+GVNPV YTY
Sbjct: 639 SLACKLFEIFSDIGVNPVSYTY 660

BLAST of Cla97C06G111710 vs. TrEMBL
Match: tr|A0A061DRT6|A0A061DRT6_THECC (Pentatricopeptide repeat-containing protein, putative OS=Theobroma cacao OX=3641 GN=TCM_005001 PE=4 SV=1)

HSP 1 Score: 268.5 bits (685), Expect = 5.0e-68
Identity = 374/641 (58.35%), Postives = 422/641 (65.83%), Query Frame = 0

Query: 1   MRHGRGGFHAMESRATSTLS-QLADLFLVASITKTLSESGTRTLQHHSLPLSQPLLLQIL 60
           MR+GR G  ++ S    + S  L ++ L+AS+TKTLSESGTR L  +S+P+S+PL++QIL
Sbjct: 1   MRNGRTGLSSVSSPLLKSPSIHLGNILLIASLTKTLSESGTRNLDPNSIPISEPLVIQIL 60

Query: 61  HSRSVHPSHKLDFFKWC-SLTPNFHHSSSTYSQIFHVLCRSGYLHEVPPLLSSMKRDGVA 120
              S+ PS KLDFF WC S+ PNF HS+ TYS IF  LCRSG++ EVP LL +MK DGV 
Sbjct: 61  RKHSLEPSKKLDFFNWCRSVKPNFKHSAVTYSHIFRTLCRSGFVEEVPNLLFAMKEDGVL 120

Query: 121 VDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180
           VD                                                          
Sbjct: 121 VDSDTFKFLLDAFIRSGKFDSALEILDFMEELGAGLNLRVYDSVLVALIRKDQVGLALSL 180

Query: 181 XXXXFDASNNEGQESTAATSFPFLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFE 240
                +A N     ++  +S   LP S+A NELLVALRK+ MR EFK+VFD LR  R FE
Sbjct: 181 FFKLLEACNGNDDGNSVDSS---LPGSIAINELLVALRKAHMRREFKQVFDILREKREFE 240

Query: 241 FNICGYNICIHAFGCWGYLDTSLALXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 300
           F+ CGYNICIH+FGCWG L  SL L                                   
Sbjct: 241 FDTCGYNICIHSFGCWGDLGASLKLFKEMKEKEKSFGSFGPDLCTYNSLIDVLCLVGKVK 300

Query: 301 XXXXXXXXXXXSGHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
                         XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 DALVVWEELKVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420

Query: 421 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 480
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 421 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 480

Query: 481 XXXXXXGDLVPNVLKWKTNMENSIKYQQNKRKDYTSLFSPKEDLSEIIS-------ARAS 540
           XXXXX G+LVPNVLKWK NME S+K     RKDYT LF  K D  EI++       A  +
Sbjct: 481 XXXXXDGNLVPNVLKWKANMEASMKNPPKNRKDYTPLFPSKGDFREIMNLLGSVGQAMGT 540

Query: 541 SVPKVNIDDTSENKEERDAESWSSSPHADLLANLAKSTGDFLQPFSLSQGRRIQAKGDNS 600
           ++   + D+  + K   D + WSSSP+ D LAN  KST    Q FSL +G+R+Q KG  S
Sbjct: 541 NLDSEDCDEKDQEKPSIDTDQWSSSPYMDQLANQGKSTERSSQLFSLIRGQRVQEKGIGS 600

Query: 601 FDINMVNTFLSIFLAKGKLSLACKLFEVFSDMGVNPVRYTY 633
           FD++MVNTFLSIFLAKGKLSLACKLFEVF+DMGV+PV YTY
Sbjct: 601 FDVDMVNTFLSIFLAKGKLSLACKLFEVFTDMGVDPVSYTY 638

BLAST of Cla97C06G111710 vs. Swiss-Prot
Match: sp|Q8VZE4|PP299_ARATH (Pentatricopeptide repeat-containing protein At4g01570 OS=Arabidopsis thaliana OX=3702 GN=At4g01570 PE=2 SV=1)

HSP 1 Score: 211.5 bits (537), Expect = 3.6e-53
Identity = 344/641 (53.67%), Postives = 411/641 (64.12%), Query Frame = 0

Query: 1   MRHGRG-----GFHAMESRATSTLSQLADLFLVASITKTLSESGTRTLQHHSLPLSQPLL 60
           MRHGRG         +     S   QL ++ LVAS++KTLS+SGTR+L  +S+P+S+P++
Sbjct: 1   MRHGRGSAVSAAISGLSPAKNSPFPQLCNVLLVASLSKTLSQSGTRSLDANSIPISEPVV 60

Query: 61  LQILHSRSVHPSHKLDFFKWC-SLTPNFHHSSSTYSQIFHVLCRSGYLHEVPPLLSSMKR 120
           LQIL   S+ PS KLDFF+WC SL P + HS++ YSQIF  +CR+G L EVP LL SMK 
Sbjct: 61  LQILRRNSIDPSKKLDFFRWCYSLRPGYKHSATAYSQIFRTVCRTGLLGEVPDLLGSMKE 120

Query: 121 DGVAVDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180
           DGV +D                                                      
Sbjct: 121 DGVNLDQTMAKILLDSLIRSGKFESALGVLDYMEELGDCLNPSVYDSVLIALVKKHELRL 180

Query: 181 XXXXXXXXFDASNNEGQESTAATSF-PFLPNSLACNELLVALRKSDMRVEFKKVFDKLRA 240
                    +AS+N   + T       +LP ++A NELLV LR++DMR EFK+VF+KL+ 
Sbjct: 181 ALSILFKLLEASDNHSDDDTGRVIIVSYLPGTVAVNELLVGLRRADMRSEFKRVFEKLKG 240

Query: 241 IRSFEFNICGYNICIHAFGCWGYLDTSLAL-XXXXXXXXXXXXXXXXXXXXXXXXXXXXX 300
           ++ F+F+   YNICIH FGCWG LD +L+L                              
Sbjct: 241 MKRFKFDTWSYNICIHGFGCWGDLDAALSLFKEMKERSSVYGSSFGPDICTYNSLIHVLC 300

Query: 301 XXXXXXXXXXXXXXXXXSGHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
                               XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 LFGKAKDALIVWDELKVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420

Query: 421 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 480
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 421 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 480

Query: 481 XXXXXXXXXXXXGDLVPNVLKWKTNMENSIKYQQNKRKDYTSLFSPKEDLSEIISARASS 540
           XXXXXXXXXX  G+LVPNVL+W   +E S+K  Q+K KDYT +F  K    +I+S   S 
Sbjct: 481 XXXXXXXXXXREGNLVPNVLRWNAGVEASLKRPQSKDKDYTPMFPSKGSFLDIMSMVGSE 540

Query: 541 VPKVNIDDTSENKEERDAESWSSSPHADLLANLAKSTGDFLQPFSLSQGRRIQAKGDNSF 600
               + ++ S  ++    + WSSSP+ D LA+           F L++G+R++AK D SF
Sbjct: 541 DDGASAEEVSPMED----DPWSSSPYMDQLAHQRNQPKPL---FGLARGQRVEAKPD-SF 600

Query: 601 DINMVNTFLSIFLAKGKLSLACKLFEVFSDMGVNPV-RYTY 633
           D++M+NTFLSI+L+KG LSLACKLFE+F+ MGV  +  YTY
Sbjct: 601 DVDMMNTFLSIYLSKGDLSLACKLFEIFNGMGVTDLTSYTY 633

BLAST of Cla97C06G111710 vs. TAIR10
Match: AT4G01570.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 211.5 bits (537), Expect = 2.0e-54
Identity = 344/641 (53.67%), Postives = 411/641 (64.12%), Query Frame = 0

Query: 1   MRHGRG-----GFHAMESRATSTLSQLADLFLVASITKTLSESGTRTLQHHSLPLSQPLL 60
           MRHGRG         +     S   QL ++ LVAS++KTLS+SGTR+L  +S+P+S+P++
Sbjct: 1   MRHGRGSAVSAAISGLSPAKNSPFPQLCNVLLVASLSKTLSQSGTRSLDANSIPISEPVV 60

Query: 61  LQILHSRSVHPSHKLDFFKWC-SLTPNFHHSSSTYSQIFHVLCRSGYLHEVPPLLSSMKR 120
           LQIL   S+ PS KLDFF+WC SL P + HS++ YSQIF  +CR+G L EVP LL SMK 
Sbjct: 61  LQILRRNSIDPSKKLDFFRWCYSLRPGYKHSATAYSQIFRTVCRTGLLGEVPDLLGSMKE 120

Query: 121 DGVAVDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180
           DGV +D                                                      
Sbjct: 121 DGVNLDQTMAKILLDSLIRSGKFESALGVLDYMEELGDCLNPSVYDSVLIALVKKHELRL 180

Query: 181 XXXXXXXXFDASNNEGQESTAATSF-PFLPNSLACNELLVALRKSDMRVEFKKVFDKLRA 240
                    +AS+N   + T       +LP ++A NELLV LR++DMR EFK+VF+KL+ 
Sbjct: 181 ALSILFKLLEASDNHSDDDTGRVIIVSYLPGTVAVNELLVGLRRADMRSEFKRVFEKLKG 240

Query: 241 IRSFEFNICGYNICIHAFGCWGYLDTSLAL-XXXXXXXXXXXXXXXXXXXXXXXXXXXXX 300
           ++ F+F+   YNICIH FGCWG LD +L+L                              
Sbjct: 241 MKRFKFDTWSYNICIHGFGCWGDLDAALSLFKEMKERSSVYGSSFGPDICTYNSLIHVLC 300

Query: 301 XXXXXXXXXXXXXXXXXSGHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
                               XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 LFGKAKDALIVWDELKVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420

Query: 421 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 480
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 421 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 480

Query: 481 XXXXXXXXXXXXGDLVPNVLKWKTNMENSIKYQQNKRKDYTSLFSPKEDLSEIISARASS 540
           XXXXXXXXXX  G+LVPNVL+W   +E S+K  Q+K KDYT +F  K    +I+S   S 
Sbjct: 481 XXXXXXXXXXREGNLVPNVLRWNAGVEASLKRPQSKDKDYTPMFPSKGSFLDIMSMVGSE 540

Query: 541 VPKVNIDDTSENKEERDAESWSSSPHADLLANLAKSTGDFLQPFSLSQGRRIQAKGDNSF 600
               + ++ S  ++    + WSSSP+ D LA+           F L++G+R++AK D SF
Sbjct: 541 DDGASAEEVSPMED----DPWSSSPYMDQLAHQRNQPKPL---FGLARGQRVEAKPD-SF 600

Query: 601 DINMVNTFLSIFLAKGKLSLACKLFEVFSDMGVNPV-RYTY 633
           D++M+NTFLSI+L+KG LSLACKLFE+F+ MGV  +  YTY
Sbjct: 601 DVDMMNTFLSIYLSKGDLSLACKLFEIFNGMGVTDLTSYTY 633

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_023549441.13.0e-13387.82pentatricopeptide repeat-containing protein At4g01570 [Cucurbita pepo subsp. pep... [more]
XP_022929794.13.3e-13287.18pentatricopeptide repeat-containing protein At4g01570 [Cucurbita moschata][more]
XP_022992119.12.1e-13185.60pentatricopeptide repeat-containing protein At4g01570 [Cucurbita maxima][more]
XP_008459805.17.1e-12780.66PREDICTED: pentatricopeptide repeat-containing protein At4g01570 [Cucumis melo][more]
XP_004140525.15.3e-12278.62PREDICTED: pentatricopeptide repeat-containing protein At4g01570 [Cucumis sativu... [more]
Match NameE-valueIdentityDescription
tr|A0A1S3CBH7|A0A1S3CBH7_CUCME4.7e-12780.66pentatricopeptide repeat-containing protein At4g01570 OS=Cucumis melo OX=3656 GN... [more]
tr|A0A0A0KFG9|A0A0A0KFG9_CUCSA3.5e-12278.62Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G101450 PE=4 SV=1[more]
tr|A0A2P5BKZ8|A0A2P5BKZ8_PARAD1.5e-7267.19Tetratricopeptide-like helical domain containing protein OS=Parasponia andersoni... [more]
tr|A0A2C9VJW7|A0A2C9VJW7_MANES1.7e-6859.65Uncharacterized protein OS=Manihot esculenta OX=3983 GN=MANES_07G094000 PE=4 SV=... [more]
tr|A0A061DRT6|A0A061DRT6_THECC5.0e-6858.35Pentatricopeptide repeat-containing protein, putative OS=Theobroma cacao OX=3641... [more]
Match NameE-valueIdentityDescription
sp|Q8VZE4|PP299_ARATH3.6e-5353.67Pentatricopeptide repeat-containing protein At4g01570 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
AT4G01570.12.0e-5453.67Tetratricopeptide repeat (TPR)-like superfamily protein[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0009451 RNA modification
cellular_component GO:0005575 cellular_component
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C06G111710.1Cla97C06G111710.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 349..397
e-value: 2.0E-15
score: 56.6
coord: 698..741
e-value: 2.6E-14
score: 53.0
coord: 279..328
e-value: 2.4E-14
score: 53.1
coord: 630..676
e-value: 3.6E-13
score: 49.4
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 244..270
e-value: 6.4E-4
score: 19.7
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 416..448
e-value: 1.1E-7
score: 31.4
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 666..698
e-value: 4.7E-5
score: 21.3
coord: 630..663
e-value: 4.6E-7
score: 27.6
coord: 317..351
e-value: 5.7E-6
score: 24.2
coord: 244..272
e-value: 8.1E-4
score: 17.4
coord: 352..383
e-value: 4.2E-8
score: 30.9
coord: 735..768
e-value: 2.2E-9
score: 34.9
coord: 422..455
e-value: 5.1E-7
score: 27.5
coord: 122..151
e-value: 8.1E-4
score: 17.4
coord: 282..316
e-value: 3.0E-6
score: 25.0
coord: 388..416
e-value: 1.5E-5
score: 22.9
coord: 700..734
e-value: 9.7E-11
score: 39.2
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 108..164
e-value: 1.3E-7
score: 31.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 155..185
score: 6.237
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 593..627
score: 8.068
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 120..154
score: 10.786
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 350..384
score: 12.112
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 240..274
score: 8.111
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 385..419
score: 10.019
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 628..662
score: 9.821
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 280..314
score: 11.707
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 315..349
score: 12.09
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 733..767
score: 12.967
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 455..489
score: 8.714
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 85..119
score: 8.89
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 420..454
score: 11.63
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 663..697
score: 10.084
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 698..732
score: 13.899
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 184..344
e-value: 8.5E-27
score: 95.9
coord: 345..417
e-value: 3.0E-17
score: 64.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 696..778
e-value: 3.9E-27
score: 96.7
coord: 579..695
e-value: 3.2E-20
score: 74.2
coord: 420..521
e-value: 7.5E-14
score: 53.4
coord: 30..177
e-value: 5.7E-20
score: 73.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 536..556
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 539..554
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 11..782
NoneNo IPR availablePANTHERPTHR24015:SF479SUBFAMILY NOT NAMEDcoord: 11..782