Cla97C08G151610 (gene) Watermelon (97103) v2

NameCla97C08G151610
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionPentatricopeptide repeat
LocationCla97Chr08 : 19936454 .. 19938962 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCTACTTCAACCTTTTGTTACAGAAATGTTCTTCCTTCTCGCAAATCAAGCAACTCCAAGCGAATCTCATCATCAATGGCCGTTTCCAATTCTCTTCCTCTCGCACTATGCTTCTCGAGCTCTGCGCCATCTCCTCCTTCGGCGACCTTTCTTATGCCCTACATATTTTCCGCCATATCCGGTACCCTTCGACCAACGATTGGAACGCCGTCATTCGCGGCACTGCCCTGAGCTCCGATCCCGAAAATGCCGTTTTCTGGTACAGGGCAATGGCTGCGTCAAATGGGCCTCACAGAATTGACGCTCTCACATGCTCTTTTGCCCTCAAAGCTTGTGCCCGTGCCTTGGCTCGTTCTGAAGCGATGCAATTACATTCGCAGCTTTTACGATTTGGGTTCAATGCTGATGTTCTTCTGCAGACTACATTGCTTGATGCGTACGCAAAAGTTGGGGATCTCGATCATGCCCAGAACCTGTTCGATGAAATGCCGCAACCAGATATCGCCTCGTGGAATGCGTTGATTGCTGGGTTTGCTCAGGGGAGTCGACCAGGTGATGCTATAATGATGTTTAAGAAAATGAAGGAGGATGGAAATTTGAGACCCAATGGAGTAACCGTTCAAGGTGCTCTGTTGGCGTGTTCACAATTGGGTGCTTTGAAAGAAGGGGAAAATGTTCATAAACATATTCTAGAGGAGAAGTTAGATATGAATGTGCAGGTTTGTAATGTCGTCATTGATATGTATGCTAAATGTGGATCTGTGGATAAAGCTTATTGGGTGTTTGAGAACATGAGGTGTGATAAAAGTTTGATCACATGGAATACAATGATAATGGCGTTTGCTATGCATGGTGATGGATACAAAGCGTTGGATCTTTTTGAAAAGTTGGGTCGGTGTGGAATGCCTCCTGATGCAGTATCATATCTAGCTGTGCTTTGCGCCTGCAACCATGCAGGACTTGTCGAGGATGGGCTTAAGCTGTTCAATTCAATGGCGCAGAGGGGGTTGGCGCCAAATATAAAGCATTATGGAGCCCTGGTTGATTTGTTAGGTCGAGCAGGGCGTCTCAAAGAAGCTTATGACGTTGTAAATTCAATGCCTTTCCCTAATATGGTACTCTGGCAAACACTGCTTGGTGCTTGCAGGACATATGGGAATGTAGAAATGGCAGAACTGGCATCAAGGAAGCTAGTAGAGATGGGATTTATTAGCTGTGGTGATTTTGTTTTGTTATCGAATGTATATGCCGCTCGTCAGAGATGGGATGATGTTGGGAGAGTTAGGGACGCCATGAGAAGAAGGGATGTGAAGAAGACACCAGGATTTAGTTACATAGAAGTAAAAGGTATGATGCACAAATTTGTTTATGGTGATCAAAGCCACTCAAGTTGCCGTGAGATTTATGCTAAGCTTGATGAGATCAAGTTCAGGATCAAAGCCTATGGATATGCAGCTGAAACTGGCAATGTATTGCATGATATTGGAGATGAGGACAAGGAGAATGCACTATGTTATCACAGTGAGAAGCTCGCTGTTGCTTTTGGATTGACTTGTACTGAAGAAGGGACCCTAATTCAAGTGATTAAGAATTTAAGGATTTGTGGGGATTGTCATGTTGTTATCAAACTAATATCTAAGATTTATAATCGAGAAATCGTTGTAAGAGACAGAACTCGATTTCATCGATTTAAAGAAGGTTTGTGCTCTTGCAAAGATTATTGGTGATATTTATTGGCTTAATACTTCTAAAATCGAGATGAAGCCCTCTTGGAAAATTTCCAATGTCTCTGGTGTAGGATTGTATTCTTTCATTGAGGCAAGATCTAAGGGGCAGCTAAAGTTTCAACTTATCAAGTGTTCAGCTTAGAGGTTTGAACTCAGAACCTTGGATGTGTCGCCAAACATCCCAAACTCTTACGCCAAGAACCTTGGATGTGAGATCTTTTCTGAAATTCCTTCTATGATGCATCTCGCCTCTCGCCTCTTGGTTATATGTTCATTAGACTAGCTAAATGTAACACGACCCCTGCAAGATCATTTAGGCATGAATTGAATGCACTTCAAAATGGTAGTAGGCACGTCTTATTCTATCTTTAATGTTTTCCGAAGCTTCAAAATTCTAGTTTGAAAAGTATATAGGTTAACCTTGCCCTTGCAGGACTGTTATTGTTGTTGCTTTACCACTTTGATGATGATAGCCAAAGTGAAAATTTTGAAGAGGTGGGCAGCATTTCCAGAAATGTAACTCCAATGCCAAAGGTAATCATGGTAACATTACTTAAGATTCATCAATCCTTCCTAAATGAAAAAAACACAAATCTATATTTCTGTTGGATTCTCTGTTTACTTTCTTGTCTGAATTGGTTTGAATGGATCACTGTAAAAATGCTTTGTCATTTGCTGGAGCTTATAGGAGGGAGCTGTGACGTCGTGAAACCTACTTAATGTAAATATTTTGACTTTGCAGGCTGGGCGAGTCAAGACCGAGGAAGAAGGCAGCTGA

mRNA sequence

ATGGCCTACTTCAACCTTTTGTTACAGAAATGTTCTTCCTTCTCGCAAATCAAGCAACTCCAAGCGAATCTCATCATCAATGGCCGTTTCCAATTCTCTTCCTCTCGCACTATGCTTCTCGAGCTCTGCGCCATCTCCTCCTTCGGCGACCTTTCTTATGCCCTACATATTTTCCGCCATATCCGGTACCCTTCGACCAACGATTGGAACGCCGTCATTCGCGGCACTGCCCTGAGCTCCGATCCCGAAAATGCCGTTTTCTGGTACAGGGCAATGGCTGCGTCAAATGGGCCTCACAGAATTGACGCTCTCACATGCTCTTTTGCCCTCAAAGCTTGTGCCCGTGCCTTGGCTCGTTCTGAAGCGATGCAATTACATTCGCAGCTTTTACGATTTGGGTTCAATGCTGATGTTCTTCTGCAGACTACATTGCTTGATGCGTACGCAAAAGTTGGGGATCTCGATCATGCCCAGAACCTGTTCGATGAAATGCCGCAACCAGATATCGCCTCGTGGAATGCGTTGATTGCTGGGTTTGCTCAGGGGAGTCGACCAGGTGATGCTATAATGATGTTTAAGAAAATGAAGGAGGATGGAAATTTGAGACCCAATGGAGTAACCGTTCAAGGTGCTCTGTTGGCGTGTTCACAATTGGGTGCTTTGAAAGAAGGGGAAAATGTTCATAAACATATTCTAGAGGAGAAGTTAGATATGAATGTGCAGGTTTGTAATGTCGTCATTGATATGTATGCTAAATGTGGATCTGTGGATAAAGCTTATTGGGTGTTTGAGAACATGAGGTGTGATAAAAGTTTGATCACATGGAATACAATGATAATGGCGTTTGCTATGCATGGTGATGGATACAAAGCGTTGGATCTTTTTGAAAAGTTGGGTCGGTGTGGAATGCCTCCTGATGCAGTATCATATCTAGCTGTGCTTTGCGCCTGCAACCATGCAGGACTTGTCGAGGATGGGCTTAAGCTGTTCAATTCAATGGCGCAGAGGGGGTTGGCGCCAAATATAAAGCATTATGGAGCCCTGGTTGATTTGTTAGGTCGAGCAGGGCGTCTCAAAGAAGCTTATGACGTTGTAAATTCAATGCCTTTCCCTAATATGGTACTCTGGCAAACACTGCTTGGTGCTTGCAGGACATATGGGAATGTAGAAATGGCAGAACTGGCATCAAGGAAGCTAGTAGAGATGGGATTTATTAGCTGTGGTGATTTTGTTTTGTTATCGAATGTATATGCCGCTCGTCAGAGATGGGATGATGTTGGGAGAGTTAGGGACGCCATGAGAAGAAGGGATGTGAAGAAGACACCAGGATTTAGTTACATAGAAGTAAAAGGTATGATGCACAAATTTGTTTATGGTGATCAAAGCCACTCAAGTTGCCGTGAGATTTATGCTAAGCTTGATGAGATCAAGTTCAGGATCAAAGCCTATGGATATGCAGCTGAAACTGGCAATGTATTGCATGATATTGGAGATGAGGACAAGGAGAATGCACTATGTTATCACAGTGAGAAGCTCGCTGTTGCTTTTGGATTGACTTGTACTGAAGAAGGGACCCTAATTCAAGTGATTAAGAATTTAAGGATTTGTGGGGATTGTCATGTTGTTATCAAACTAATATCTAAGATTTATAATCGAGAAATCGTTGTAAGAGACAGAACTCGATTTCATCGATTTAAAGAAGGACTGTTATTGTTGTTGCTTTACCACTTTGATGATGATAGCCAAAGTGAAAATTTTGAAGAGGTGGGCAGCATTTCCAGAAATGTAACTCCAATGCCAAAGGCTGGGCGAGTCAAGACCGAGGAAGAAGGCAGCTGA

Coding sequence (CDS)

ATGGCCTACTTCAACCTTTTGTTACAGAAATGTTCTTCCTTCTCGCAAATCAAGCAACTCCAAGCGAATCTCATCATCAATGGCCGTTTCCAATTCTCTTCCTCTCGCACTATGCTTCTCGAGCTCTGCGCCATCTCCTCCTTCGGCGACCTTTCTTATGCCCTACATATTTTCCGCCATATCCGGTACCCTTCGACCAACGATTGGAACGCCGTCATTCGCGGCACTGCCCTGAGCTCCGATCCCGAAAATGCCGTTTTCTGGTACAGGGCAATGGCTGCGTCAAATGGGCCTCACAGAATTGACGCTCTCACATGCTCTTTTGCCCTCAAAGCTTGTGCCCGTGCCTTGGCTCGTTCTGAAGCGATGCAATTACATTCGCAGCTTTTACGATTTGGGTTCAATGCTGATGTTCTTCTGCAGACTACATTGCTTGATGCGTACGCAAAAGTTGGGGATCTCGATCATGCCCAGAACCTGTTCGATGAAATGCCGCAACCAGATATCGCCTCGTGGAATGCGTTGATTGCTGGGTTTGCTCAGGGGAGTCGACCAGGTGATGCTATAATGATGTTTAAGAAAATGAAGGAGGATGGAAATTTGAGACCCAATGGAGTAACCGTTCAAGGTGCTCTGTTGGCGTGTTCACAATTGGGTGCTTTGAAAGAAGGGGAAAATGTTCATAAACATATTCTAGAGGAGAAGTTAGATATGAATGTGCAGGTTTGTAATGTCGTCATTGATATGTATGCTAAATGTGGATCTGTGGATAAAGCTTATTGGGTGTTTGAGAACATGAGGTGTGATAAAAGTTTGATCACATGGAATACAATGATAATGGCGTTTGCTATGCATGGTGATGGATACAAAGCGTTGGATCTTTTTGAAAAGTTGGGTCGGTGTGGAATGCCTCCTGATGCAGTATCATATCTAGCTGTGCTTTGCGCCTGCAACCATGCAGGACTTGTCGAGGATGGGCTTAAGCTGTTCAATTCAATGGCGCAGAGGGGGTTGGCGCCAAATATAAAGCATTATGGAGCCCTGGTTGATTTGTTAGGTCGAGCAGGGCGTCTCAAAGAAGCTTATGACGTTGTAAATTCAATGCCTTTCCCTAATATGGTACTCTGGCAAACACTGCTTGGTGCTTGCAGGACATATGGGAATGTAGAAATGGCAGAACTGGCATCAAGGAAGCTAGTAGAGATGGGATTTATTAGCTGTGGTGATTTTGTTTTGTTATCGAATGTATATGCCGCTCGTCAGAGATGGGATGATGTTGGGAGAGTTAGGGACGCCATGAGAAGAAGGGATGTGAAGAAGACACCAGGATTTAGTTACATAGAAGTAAAAGGTATGATGCACAAATTTGTTTATGGTGATCAAAGCCACTCAAGTTGCCGTGAGATTTATGCTAAGCTTGATGAGATCAAGTTCAGGATCAAAGCCTATGGATATGCAGCTGAAACTGGCAATGTATTGCATGATATTGGAGATGAGGACAAGGAGAATGCACTATGTTATCACAGTGAGAAGCTCGCTGTTGCTTTTGGATTGACTTGTACTGAAGAAGGGACCCTAATTCAAGTGATTAAGAATTTAAGGATTTGTGGGGATTGTCATGTTGTTATCAAACTAATATCTAAGATTTATAATCGAGAAATCGTTGTAAGAGACAGAACTCGATTTCATCGATTTAAAGAAGGACTGTTATTGTTGTTGCTTTACCACTTTGATGATGATAGCCAAAGTGAAAATTTTGAAGAGGTGGGCAGCATTTCCAGAAATGTAACTCCAATGCCAAAGGCTGGGCGAGTCAAGACCGAGGAAGAAGGCAGCTGA

Protein sequence

MAYFNLLLQKCSSFSQIKQLQANLIINGRFQFSSSRTMLLELCAISSFGDLSYALHIFRHIRYPSTNDWNAVIRGTALSSDPENAVFWYRAMAASNGPHRIDALTCSFALKACARALARSEAMQLHSQLLRFGFNADVLLQTTLLDAYAKVGDLDHAQNLFDEMPQPDIASWNALIAGFAQGSRPGDAIMMFKKMKEDGNLRPNGVTVQGALLACSQLGALKEGENVHKHILEEKLDMNVQVCNVVIDMYAKCGSVDKAYWVFENMRCDKSLITWNTMIMAFAMHGDGYKALDLFEKLGRCGMPPDAVSYLAVLCACNHAGLVEDGLKLFNSMAQRGLAPNIKHYGALVDLLGRAGRLKEAYDVVNSMPFPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFISCGDFVLLSNVYAARQRWDDVGRVRDAMRRRDVKKTPGFSYIEVKGMMHKFVYGDQSHSSCREIYAKLDEIKFRIKAYGYAAETGNVLHDIGDEDKENALCYHSEKLAVAFGLTCTEEGTLIQVIKNLRICGDCHVVIKLISKIYNREIVVRDRTRFHRFKEGLLLLLLYHFDDDSQSENFEEVGSISRNVTPMPKAGRVKTEEEGS
BLAST of Cla97C08G151610 vs. NCBI nr
Match: XP_004140941.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g34160 [Cucumis sativus] >KGN46094.1 hypothetical protein Csa_6G052690 [Cucumis sativus])

HSP 1 Score: 976.5 bits (2523), Expect = 4.2e-281
Identity = 488/569 (85.76%), Postives = 504/569 (88.58%), Query Frame = 0

Query: 1   MAYFNLLLQKCSSFSQIKQLQANLIINGRFQFSSSRTMLLELCAISSFGDLSYALHIFRH 60
           MAYFNLLLQKCSSFSQIKQLQANLIING F FSSSRT LLELCAISSFGDLSYALHIFR+
Sbjct: 1   MAYFNLLLQKCSSFSQIKQLQANLIINGDFHFSSSRTKLLELCAISSFGDLSYALHIFRY 60

Query: 61  IRYPSTNDWNAVIRGTALSSDPENAVFWYRAMAASNGPHRIDALTCSFALKACARALARS 120
           I YPSTNDWNAVIRGTALSSDP NAVFWYRAMAASNG HRIDALTCSFALKACARALARS
Sbjct: 61  IPYPSTNDWNAVIRGTALSSDPANAVFWYRAMAASNGLHRIDALTCSFALKACARALARS 120

Query: 121 EAMQLHSQLLRFGFNADVLLQTTLLDAYAKVGDLDHAQNLFDEMPQPDIASWNALIAGFA 180
           EA+QLHSQLLRFGFNADVLLQTTLLDAYAK+GDLD AQ LFDEMPQPDIASWNALIAGFA
Sbjct: 121 EAIQLHSQLLRFGFNADVLLQTTLLDAYAKIGDLDLAQKLFDEMPQPDIASWNALIAGFA 180

Query: 181 QGSRPGDAIMMFKKMKEDGNLRPNGVTVQGALLACSQLGALKEGENVHKHILEEKLDMNV 240
           QGSRP DAIM FK+MK DGNLRPN VTVQGALLACSQLGALKEGE+VHK+I+EEKL+ NV
Sbjct: 181 QGSRPADAIMTFKRMKVDGNLRPNAVTVQGALLACSQLGALKEGESVHKYIVEEKLNSNV 240

Query: 241 QVCNVVIDMYAKCGSVDKAYWVFENMRCDKSLITWNTMIMAFAMHGDGYKALDLFEKLGR 300
           QVCNVVIDMYAKCGS+DKAYWVFENMRCDKSLITWNTMIMAFAMHGDG+KALDLFEKLGR
Sbjct: 241 QVCNVVIDMYAKCGSMDKAYWVFENMRCDKSLITWNTMIMAFAMHGDGHKALDLFEKLGR 300

Query: 301 CGMPPDAVSYLAVLCACNHAGLVEDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
            GM PDAVSYLAVLCACNHAGLVED                                   
Sbjct: 301 SGMSPDAVSYLAVLCACNHAGLVEDGLKLFNSMTQRGLEPNIKHYGSMVDLLGRAGRLKE 360

Query: 361 XXXXXXXXPFPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFISCGDFVLLSNVYAAR 420
                   PFPNMVLWQTLLGACRTYG+VEMAELASRKLVEMGFISCGDFVLLSNVYAAR
Sbjct: 361 AYDIVSSLPFPNMVLWQTLLGACRTYGDVEMAELASRKLVEMGFISCGDFVLLSNVYAAR 420

Query: 421 QRWDDVGRVRDAMRRRDVKKTPGFSYIEVKGMMHKFVYGDQSHSSCREIYAKLDEIKFRI 480
           QRWDDVGRVRDAMRRRDVKKTPGFSYIE+KG M+KFV GDQSHSSCREIYAKLDEI  RI
Sbjct: 421 QRWDDVGRVRDAMRRRDVKKTPGFSYIEIKGKMYKFVNGDQSHSSCREIYAKLDEINLRI 480

Query: 481 KAYGYAAETGNVLHDIGDEDKENALCYHSEKLAVAFGLTCTEEGTLIQVIKNLRICGDCH 540
           KAYGY+A+T NVLHDIGDEDKENALCYHSEKLAVAFGLTCTEEGT IQVIKNLRICGDCH
Sbjct: 481 KAYGYSADTSNVLHDIGDEDKENALCYHSEKLAVAFGLTCTEEGTPIQVIKNLRICGDCH 540

Query: 541 VVIKLISKIYNREIVVRDRTRFHRFKEGL 570
           VVIKLISKIY REI+VRDRTRFHRFKEGL
Sbjct: 541 VVIKLISKIYIREIIVRDRTRFHRFKEGL 569

BLAST of Cla97C08G151610 vs. NCBI nr
Match: XP_008456696.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g34160 [Cucumis melo])

HSP 1 Score: 970.7 bits (2508), Expect = 2.3e-279
Identity = 482/569 (84.71%), Postives = 501/569 (88.05%), Query Frame = 0

Query: 1   MAYFNLLLQKCSSFSQIKQLQANLIINGRFQFSSSRTMLLELCAISSFGDLSYALHIFRH 60
           MAYFNLLLQKCSSFS IKQLQANLIING F FSSSRT LLELCA+SS GDLSYALHIFR+
Sbjct: 1   MAYFNLLLQKCSSFSHIKQLQANLIINGDFHFSSSRTKLLELCAVSSCGDLSYALHIFRY 60

Query: 61  IRYPSTNDWNAVIRGTALSSDPENAVFWYRAMAASNGPHRIDALTCSFALKACARALARS 120
           IRYPSTNDWNA+IRGTALSSDP NAV WYRAMAASNGPHRIDALTCSFALKACARALA S
Sbjct: 61  IRYPSTNDWNAIIRGTALSSDPANAVVWYRAMAASNGPHRIDALTCSFALKACARALACS 120

Query: 121 EAMQLHSQLLRFGFNADVLLQTTLLDAYAKVGDLDHAQNLFDEMPQPDIASWNALIAGFA 180
           EA+QLHSQLLRFGFNADVLLQTTLLD YAKVGDLD AQ LFDEMP+PDIASWNALI+GFA
Sbjct: 121 EAIQLHSQLLRFGFNADVLLQTTLLDVYAKVGDLDLAQKLFDEMPRPDIASWNALISGFA 180

Query: 181 QGSRPGDAIMMFKKMKEDGNLRPNGVTVQGALLACSQLGALKEGENVHKHILEEKLDMNV 240
           QGSRP DAIMMFK+MKE GNLRPN VTVQGALLACSQLG LKEGENVHK+I+EEKLDMNV
Sbjct: 181 QGSRPADAIMMFKRMKEGGNLRPNAVTVQGALLACSQLGTLKEGENVHKYIVEEKLDMNV 240

Query: 241 QVCNVVIDMYAKCGSVDKAYWVFENMRCDKSLITWNTMIMAFAMHGDGYKALDLFEKLGR 300
           QVCNVVIDMYAKCGS+DKAYWVFENMRCDKSLITWNTMIMAFAMHGDGY+ALDLF+KLGR
Sbjct: 241 QVCNVVIDMYAKCGSMDKAYWVFENMRCDKSLITWNTMIMAFAMHGDGYEALDLFKKLGR 300

Query: 301 CGMPPDAVSYLAVLCACNHAGLVEDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
            GM PDAVSYLAVLCACNHAGLVED                                   
Sbjct: 301 SGMSPDAVSYLAVLCACNHAGLVEDGLKLFNLMAQRGLEPNIKHYGSMVDLLGRAGRLKE 360

Query: 361 XXXXXXXXPFPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFISCGDFVLLSNVYAAR 420
                   PFPNMVLWQTLLGACRTYG+VEMAELAS KLVEMGFISCGDFVLLSNVYAAR
Sbjct: 361 AYDIVNSLPFPNMVLWQTLLGACRTYGDVEMAELASGKLVEMGFISCGDFVLLSNVYAAR 420

Query: 421 QRWDDVGRVRDAMRRRDVKKTPGFSYIEVKGMMHKFVYGDQSHSSCREIYAKLDEIKFRI 480
           QRWDDVGRVRDAMR RDVKKTPGFSYIE+KG M++FVYGDQSHSSCREIYAKLDEI  RI
Sbjct: 421 QRWDDVGRVRDAMRIRDVKKTPGFSYIEIKGKMYQFVYGDQSHSSCREIYAKLDEINLRI 480

Query: 481 KAYGYAAETGNVLHDIGDEDKENALCYHSEKLAVAFGLTCTEEGTLIQVIKNLRICGDCH 540
           KAYGY+A+T NVLHDIGDEDKENALCYHSEKLAVAFGLTCTEEG  IQVIKNLRICGDCH
Sbjct: 481 KAYGYSADTSNVLHDIGDEDKENALCYHSEKLAVAFGLTCTEEGIPIQVIKNLRICGDCH 540

Query: 541 VVIKLISKIYNREIVVRDRTRFHRFKEGL 570
           VVIKLISKIYNREI+VRDRTRFHRFKEGL
Sbjct: 541 VVIKLISKIYNREIIVRDRTRFHRFKEGL 569

BLAST of Cla97C08G151610 vs. NCBI nr
Match: XP_022133715.1 (pentatricopeptide repeat-containing protein At1g34160 [Momordica charantia])

HSP 1 Score: 952.2 bits (2460), Expect = 8.6e-274
Identity = 515/569 (90.51%), Postives = 535/569 (94.02%), Query Frame = 0

Query: 1   MAYFNLLLQKCSSFSQIKQLQANLIINGRFQFSSSRTMLLELCAISSFGDLSYALHIFRH 60
           MAYF+LLLQKCSSFSQIKQLQANLI NGRFQ SSSRT LLELCAIS FGDL +A+ IFRH
Sbjct: 18  MAYFDLLLQKCSSFSQIKQLQANLITNGRFQLSSSRTKLLELCAISPFGDLPHAIRIFRH 77

Query: 61  IRYPSTNDWNAVIRGTALSSDPENAVFWYRAMAASNGPHRIDALTCSFALKACARALARS 120
           IR P TNDWNAVIRGTALSSDP NAV WYRAMAAS GPHR+DALTCSF LKACARALARS
Sbjct: 78  IRAPPTNDWNAVIRGTALSSDPANAVLWYRAMAASIGPHRVDALTCSFTLKACARALARS 137

Query: 121 EAMQLHSQLLRFGFNADVLLQTTLLDAYAKVGDLDHAQNLFDEMPQPDIASWNALIAGFA 180
           EAMQLHSQLLRFGF+AD+LLQTTLLDAYAKVGDLD AQ LFDE+PQPDIASWNALIAGFA
Sbjct: 138 EAMQLHSQLLRFGFDADILLQTTLLDAYAKVGDLDRAQKLFDEIPQPDIASWNALIAGFA 197

Query: 181 QGSRPGDAIMMFKKMKEDGNLRPNGVTVQGALLACSQLGALKEGENVHKHILEEKLDMNV 240
           QGSRPGDAI +FK+MKEDG LRPN VTVQGALLACSQLGALKEGE VHK+I+EE LDMNV
Sbjct: 198 QGSRPGDAIALFKRMKEDGYLRPNEVTVQGALLACSQLGALKEGEEVHKYIIEENLDMNV 257

Query: 241 QVCNVVIDMYAKCGSVDKAYWVFENMRCDKSLITWNTMIMAFAMHGDGYKALDLFEKLGR 300
           QVCNVVIDMYAKCGSVDKAYWVF+NMRC+KSLI+WNTMIMAFA+HG GYKALDLFEKLG 
Sbjct: 258 QVCNVVIDMYAKCGSVDKAYWVFQNMRCEKSLISWNTMIMAFAIHGHGYKALDLFEKLGL 317

Query: 301 CGMPPDAVSYLAVLCACNHAGLVEDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
            G+ PDAVSYL VLCACNH GLVED XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 318 SGISPDAVSYLVVLCACNHGGLVEDGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 377

Query: 361 XXXXXXXXPFPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFISCGDFVLLSNVYAAR 420
           XXXXXXXXPFPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFISCGDFVLLSNVYAA 
Sbjct: 378 XXXXXXXXPFPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFISCGDFVLLSNVYAAC 437

Query: 421 QRWDDVGRVRDAMRRRDVKKTPGFSYIEVKGMMHKFVYGDQSHSSCREIYAKLDEIKFRI 480
           QRWDDVGR+RDAMRRRDVKKTPGFSY EVKG MHKF YGDQ+HSSC EIYAKLDEIKFRI
Sbjct: 438 QRWDDVGRIRDAMRRRDVKKTPGFSYTEVKGKMHKFAYGDQNHSSCHEIYAKLDEIKFRI 497

Query: 481 KAYGYAAETGNVLHDIGDEDKENALCYHSEKLAVAFGLTCTEEGTLIQVIKNLRICGDCH 540
           KA GYAAETGNVLHDIG+EDKENALCYHSEKLAVAFGL CTEEGTLIQVIKNLRIC DCH
Sbjct: 498 KACGYAAETGNVLHDIGEEDKENALCYHSEKLAVAFGLICTEEGTLIQVIKNLRICVDCH 557

Query: 541 VVIKLISKIYNREIVVRDRTRFHRFKEGL 570
           VVIKLISKIYNREI+VRDRTRFHRFKEGL
Sbjct: 558 VVIKLISKIYNREIIVRDRTRFHRFKEGL 586

BLAST of Cla97C08G151610 vs. NCBI nr
Match: XP_023526446.1 (pentatricopeptide repeat-containing protein At1g34160 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 936.0 bits (2418), Expect = 6.4e-269
Identity = 503/569 (88.40%), Postives = 532/569 (93.50%), Query Frame = 0

Query: 1   MAYFNLLLQKCSSFSQIKQLQANLIINGRFQFSSSRTMLLELCAISSFGDLSYALHIFRH 60
           MAYF+LLLQKCSSFSQIKQLQANLI NG F FSSSRT LLELCAIS FGDLS+ALH+FRH
Sbjct: 1   MAYFDLLLQKCSSFSQIKQLQANLITNGHFHFSSSRTKLLELCAISPFGDLSHALHVFRH 60

Query: 61  IRYPSTNDWNAVIRGTALSSDPENAVFWYRAMAASNGPHRIDALTCSFALKACARALARS 120
           I  PST DWNAVIRGTALSS+P NA+FWYR M ASNGPHR+DALTCSFALKACARALARS
Sbjct: 61  IHSPSTKDWNAVIRGTALSSNPSNAIFWYRTMTASNGPHRVDALTCSFALKACARALARS 120

Query: 121 EAMQLHSQLLRFGFNADVLLQTTLLDAYAKVGDLDHAQNLFDEMPQPDIASWNALIAGFA 180
           E MQLHSQ+LRFGF+ADVLLQTTLLDAYAKV DLD AQ +FDEMP+PDIASWN+LIAGFA
Sbjct: 121 EVMQLHSQVLRFGFDADVLLQTTLLDAYAKVEDLDQAQKVFDEMPEPDIASWNSLIAGFA 180

Query: 181 QGSRPGDAIMMFKKMKEDGNLRPNGVTVQGALLACSQLGALKEGENVHKHILEEKLDMNV 240
           QG RP DAI +FK+MKEDGNLRPN VTVQGAL ACSQLG LKEGENVHK+I EE LD  V
Sbjct: 181 QGGRPSDAIDLFKRMKEDGNLRPNEVTVQGALSACSQLGTLKEGENVHKYIAEENLDTVV 240

Query: 241 QVCNVVIDMYAKCGSVDKAYWVFENMRCDKSLITWNTMIMAFAMHGDGYKALDLFEKLGR 300
           QVCNVVIDMYAKCGSVDKAYWVFENMRC+KSLITWNTMIMAFAMHGDG+KALDLFEKLGR
Sbjct: 241 QVCNVVIDMYAKCGSVDKAYWVFENMRCEKSLITWNTMIMAFAMHGDGHKALDLFEKLGR 300

Query: 301 CGMPPDAVSYLAVLCACNHAGLVEDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
            G+ PDA+SYLAVLCACNHAGL+E+XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 SGIYPDAISYLAVLCACNHAGLIEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXPFPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFISCGDFVLLSNVYAAR 420
           XXXXXXX PFPNMVLWQTLLGACRTYG+V+MAE+ASRKLVEMGFISCGDFVLLSNVYAAR
Sbjct: 361 XXXXXXXMPFPNMVLWQTLLGACRTYGDVKMAEMASRKLVEMGFISCGDFVLLSNVYAAR 420

Query: 421 QRWDDVGRVRDAMRRRDVKKTPGFSYIEVKGMMHKFVYGDQSHSSCREIYAKLDEIKFRI 480
           +RWDDVGRVRDAMRRRDVKKTPGFSYIEVKG MHKF+YGD+SHSSCREIYAKLDEI FRI
Sbjct: 421 RRWDDVGRVRDAMRRRDVKKTPGFSYIEVKGNMHKFLYGDRSHSSCREIYAKLDEIMFRI 480

Query: 481 KAYGYAAETGNVLHDIGDEDKENALCYHSEKLAVAFGLTCTEEGTLIQVIKNLRICGDCH 540
           KA GY AETGNVLHDI +EDKEN LCYHSEKLAVAFGL+CTEEGT IQVIKNLRICGDCH
Sbjct: 481 KAIGYTAETGNVLHDIEEEDKENVLCYHSEKLAVAFGLSCTEEGTPIQVIKNLRICGDCH 540

Query: 541 VVIKLISKIYNREIVVRDRTRFHRFKEGL 570
           VVIKLISK YNREI VRDRTRFHRFK+GL
Sbjct: 541 VVIKLISKSYNREIFVRDRTRFHRFKDGL 569

BLAST of Cla97C08G151610 vs. NCBI nr
Match: XP_022990287.1 (pentatricopeptide repeat-containing protein At1g34160 isoform X1 [Cucurbita maxima])

HSP 1 Score: 935.3 bits (2416), Expect = 1.1e-268
Identity = 462/569 (81.20%), Postives = 490/569 (86.12%), Query Frame = 0

Query: 1   MAYFNLLLQKCSSFSQIKQLQANLIINGRFQFSSSRTMLLELCAISSFGDLSYALHIFRH 60
           MAYF+LLLQKCSSFSQIKQLQANLI NG F FSSSRT LLELCAIS FGDLS+ALHIFRH
Sbjct: 23  MAYFDLLLQKCSSFSQIKQLQANLITNGHFHFSSSRTKLLELCAISPFGDLSHALHIFRH 82

Query: 61  IRYPSTNDWNAVIRGTALSSDPENAVFWYRAMAASNGPHRIDALTCSFALKACARALARS 120
           +  PST DWNAVIRGTALSS+P NA+FWYR M ASNGPHR+DALTCSFALKACARALARS
Sbjct: 83  VHSPSTKDWNAVIRGTALSSNPSNALFWYRTMNASNGPHRVDALTCSFALKACARALARS 142

Query: 121 EAMQLHSQLLRFGFNADVLLQTTLLDAYAKVGDLDHAQNLFDEMPQPDIASWNALIAGFA 180
           E MQLHSQ+LRFGF+ADVLLQTTLLDAYAKV DLD AQ +FDEMP+PDIASWN+LIAGFA
Sbjct: 143 EVMQLHSQVLRFGFDADVLLQTTLLDAYAKVEDLDQAQKVFDEMPEPDIASWNSLIAGFA 202

Query: 181 QGSRPGDAIMMFKKMKEDGNLRPNGVTVQGALLACSQLGALKEGENVHKHILEEKLDMNV 240
           QG RP DAI +FK+MKEDGNLRPN VTVQGAL ACSQLG LKEGENVHK+I+EE LD  V
Sbjct: 203 QGGRPSDAIDLFKRMKEDGNLRPNEVTVQGALSACSQLGTLKEGENVHKYIVEENLDTIV 262

Query: 241 QVCNVVIDMYAKCGSVDKAYWVFENMRCDKSLITWNTMIMAFAMHGDGYKALDLFEKLGR 300
           QVCNVVIDMYAKCGSVDKAYWVFENMRC+KSLITWNTMIMAFAMHGDGYKALDLFEKLGR
Sbjct: 263 QVCNVVIDMYAKCGSVDKAYWVFENMRCEKSLITWNTMIMAFAMHGDGYKALDLFEKLGR 322

Query: 301 CGMPPDAVSYLAVLCACNHAGLVEDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
            G+ PDA+SYLAVLCACNHAGL+ED                                   
Sbjct: 323 SGICPDAISYLAVLCACNHAGLIEDGLKLCNSMMQRGVAPNIKHYGVVVDLLGRAGRLKE 382

Query: 361 XXXXXXXXPFPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFISCGDFVLLSNVYAAR 420
                   PFPNMVLWQTLLGACRTYG+V+MAE+ASRKLVEMGFISCGDFVLLSNVYAAR
Sbjct: 383 AYEIVSSMPFPNMVLWQTLLGACRTYGDVKMAEMASRKLVEMGFISCGDFVLLSNVYAAR 442

Query: 421 QRWDDVGRVRDAMRRRDVKKTPGFSYIEVKGMMHKFVYGDQSHSSCREIYAKLDEIKFRI 480
           +RWDDVGRVRDAMRRRDVKKTPGFSYIEVKG MHKF+YGD+SHSSCREIYAKLDEI FRI
Sbjct: 443 RRWDDVGRVRDAMRRRDVKKTPGFSYIEVKGNMHKFLYGDRSHSSCREIYAKLDEIMFRI 502

Query: 481 KAYGYAAETGNVLHDIGDEDKENALCYHSEKLAVAFGLTCTEEGTLIQVIKNLRICGDCH 540
           KA GY AETGNVLHDI +EDKEN LC HSEKLAVAFGL+CTEEGT IQVIKNLRICGDCH
Sbjct: 503 KAIGYTAETGNVLHDIEEEDKENVLCCHSEKLAVAFGLSCTEEGTPIQVIKNLRICGDCH 562

Query: 541 VVIKLISKIYNREIVVRDRTRFHRFKEGL 570
           VVIKLISK YNREI VRDRTRFHRFK+GL
Sbjct: 563 VVIKLISKSYNREIFVRDRTRFHRFKDGL 591

BLAST of Cla97C08G151610 vs. TrEMBL
Match: tr|A0A0A0KB77|A0A0A0KB77_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G052690 PE=4 SV=1)

HSP 1 Score: 976.5 bits (2523), Expect = 2.8e-281
Identity = 488/569 (85.76%), Postives = 504/569 (88.58%), Query Frame = 0

Query: 1   MAYFNLLLQKCSSFSQIKQLQANLIINGRFQFSSSRTMLLELCAISSFGDLSYALHIFRH 60
           MAYFNLLLQKCSSFSQIKQLQANLIING F FSSSRT LLELCAISSFGDLSYALHIFR+
Sbjct: 1   MAYFNLLLQKCSSFSQIKQLQANLIINGDFHFSSSRTKLLELCAISSFGDLSYALHIFRY 60

Query: 61  IRYPSTNDWNAVIRGTALSSDPENAVFWYRAMAASNGPHRIDALTCSFALKACARALARS 120
           I YPSTNDWNAVIRGTALSSDP NAVFWYRAMAASNG HRIDALTCSFALKACARALARS
Sbjct: 61  IPYPSTNDWNAVIRGTALSSDPANAVFWYRAMAASNGLHRIDALTCSFALKACARALARS 120

Query: 121 EAMQLHSQLLRFGFNADVLLQTTLLDAYAKVGDLDHAQNLFDEMPQPDIASWNALIAGFA 180
           EA+QLHSQLLRFGFNADVLLQTTLLDAYAK+GDLD AQ LFDEMPQPDIASWNALIAGFA
Sbjct: 121 EAIQLHSQLLRFGFNADVLLQTTLLDAYAKIGDLDLAQKLFDEMPQPDIASWNALIAGFA 180

Query: 181 QGSRPGDAIMMFKKMKEDGNLRPNGVTVQGALLACSQLGALKEGENVHKHILEEKLDMNV 240
           QGSRP DAIM FK+MK DGNLRPN VTVQGALLACSQLGALKEGE+VHK+I+EEKL+ NV
Sbjct: 181 QGSRPADAIMTFKRMKVDGNLRPNAVTVQGALLACSQLGALKEGESVHKYIVEEKLNSNV 240

Query: 241 QVCNVVIDMYAKCGSVDKAYWVFENMRCDKSLITWNTMIMAFAMHGDGYKALDLFEKLGR 300
           QVCNVVIDMYAKCGS+DKAYWVFENMRCDKSLITWNTMIMAFAMHGDG+KALDLFEKLGR
Sbjct: 241 QVCNVVIDMYAKCGSMDKAYWVFENMRCDKSLITWNTMIMAFAMHGDGHKALDLFEKLGR 300

Query: 301 CGMPPDAVSYLAVLCACNHAGLVEDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
            GM PDAVSYLAVLCACNHAGLVED                                   
Sbjct: 301 SGMSPDAVSYLAVLCACNHAGLVEDGLKLFNSMTQRGLEPNIKHYGSMVDLLGRAGRLKE 360

Query: 361 XXXXXXXXPFPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFISCGDFVLLSNVYAAR 420
                   PFPNMVLWQTLLGACRTYG+VEMAELASRKLVEMGFISCGDFVLLSNVYAAR
Sbjct: 361 AYDIVSSLPFPNMVLWQTLLGACRTYGDVEMAELASRKLVEMGFISCGDFVLLSNVYAAR 420

Query: 421 QRWDDVGRVRDAMRRRDVKKTPGFSYIEVKGMMHKFVYGDQSHSSCREIYAKLDEIKFRI 480
           QRWDDVGRVRDAMRRRDVKKTPGFSYIE+KG M+KFV GDQSHSSCREIYAKLDEI  RI
Sbjct: 421 QRWDDVGRVRDAMRRRDVKKTPGFSYIEIKGKMYKFVNGDQSHSSCREIYAKLDEINLRI 480

Query: 481 KAYGYAAETGNVLHDIGDEDKENALCYHSEKLAVAFGLTCTEEGTLIQVIKNLRICGDCH 540
           KAYGY+A+T NVLHDIGDEDKENALCYHSEKLAVAFGLTCTEEGT IQVIKNLRICGDCH
Sbjct: 481 KAYGYSADTSNVLHDIGDEDKENALCYHSEKLAVAFGLTCTEEGTPIQVIKNLRICGDCH 540

Query: 541 VVIKLISKIYNREIVVRDRTRFHRFKEGL 570
           VVIKLISKIY REI+VRDRTRFHRFKEGL
Sbjct: 541 VVIKLISKIYIREIIVRDRTRFHRFKEGL 569

BLAST of Cla97C08G151610 vs. TrEMBL
Match: tr|A0A1S3C3U7|A0A1S3C3U7_CUCME (pentatricopeptide repeat-containing protein At1g34160 OS=Cucumis melo OX=3656 GN=LOC103496566 PE=4 SV=1)

HSP 1 Score: 970.7 bits (2508), Expect = 1.5e-279
Identity = 482/569 (84.71%), Postives = 501/569 (88.05%), Query Frame = 0

Query: 1   MAYFNLLLQKCSSFSQIKQLQANLIINGRFQFSSSRTMLLELCAISSFGDLSYALHIFRH 60
           MAYFNLLLQKCSSFS IKQLQANLIING F FSSSRT LLELCA+SS GDLSYALHIFR+
Sbjct: 1   MAYFNLLLQKCSSFSHIKQLQANLIINGDFHFSSSRTKLLELCAVSSCGDLSYALHIFRY 60

Query: 61  IRYPSTNDWNAVIRGTALSSDPENAVFWYRAMAASNGPHRIDALTCSFALKACARALARS 120
           IRYPSTNDWNA+IRGTALSSDP NAV WYRAMAASNGPHRIDALTCSFALKACARALA S
Sbjct: 61  IRYPSTNDWNAIIRGTALSSDPANAVVWYRAMAASNGPHRIDALTCSFALKACARALACS 120

Query: 121 EAMQLHSQLLRFGFNADVLLQTTLLDAYAKVGDLDHAQNLFDEMPQPDIASWNALIAGFA 180
           EA+QLHSQLLRFGFNADVLLQTTLLD YAKVGDLD AQ LFDEMP+PDIASWNALI+GFA
Sbjct: 121 EAIQLHSQLLRFGFNADVLLQTTLLDVYAKVGDLDLAQKLFDEMPRPDIASWNALISGFA 180

Query: 181 QGSRPGDAIMMFKKMKEDGNLRPNGVTVQGALLACSQLGALKEGENVHKHILEEKLDMNV 240
           QGSRP DAIMMFK+MKE GNLRPN VTVQGALLACSQLG LKEGENVHK+I+EEKLDMNV
Sbjct: 181 QGSRPADAIMMFKRMKEGGNLRPNAVTVQGALLACSQLGTLKEGENVHKYIVEEKLDMNV 240

Query: 241 QVCNVVIDMYAKCGSVDKAYWVFENMRCDKSLITWNTMIMAFAMHGDGYKALDLFEKLGR 300
           QVCNVVIDMYAKCGS+DKAYWVFENMRCDKSLITWNTMIMAFAMHGDGY+ALDLF+KLGR
Sbjct: 241 QVCNVVIDMYAKCGSMDKAYWVFENMRCDKSLITWNTMIMAFAMHGDGYEALDLFKKLGR 300

Query: 301 CGMPPDAVSYLAVLCACNHAGLVEDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
            GM PDAVSYLAVLCACNHAGLVED                                   
Sbjct: 301 SGMSPDAVSYLAVLCACNHAGLVEDGLKLFNLMAQRGLEPNIKHYGSMVDLLGRAGRLKE 360

Query: 361 XXXXXXXXPFPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFISCGDFVLLSNVYAAR 420
                   PFPNMVLWQTLLGACRTYG+VEMAELAS KLVEMGFISCGDFVLLSNVYAAR
Sbjct: 361 AYDIVNSLPFPNMVLWQTLLGACRTYGDVEMAELASGKLVEMGFISCGDFVLLSNVYAAR 420

Query: 421 QRWDDVGRVRDAMRRRDVKKTPGFSYIEVKGMMHKFVYGDQSHSSCREIYAKLDEIKFRI 480
           QRWDDVGRVRDAMR RDVKKTPGFSYIE+KG M++FVYGDQSHSSCREIYAKLDEI  RI
Sbjct: 421 QRWDDVGRVRDAMRIRDVKKTPGFSYIEIKGKMYQFVYGDQSHSSCREIYAKLDEINLRI 480

Query: 481 KAYGYAAETGNVLHDIGDEDKENALCYHSEKLAVAFGLTCTEEGTLIQVIKNLRICGDCH 540
           KAYGY+A+T NVLHDIGDEDKENALCYHSEKLAVAFGLTCTEEG  IQVIKNLRICGDCH
Sbjct: 481 KAYGYSADTSNVLHDIGDEDKENALCYHSEKLAVAFGLTCTEEGIPIQVIKNLRICGDCH 540

Query: 541 VVIKLISKIYNREIVVRDRTRFHRFKEGL 570
           VVIKLISKIYNREI+VRDRTRFHRFKEGL
Sbjct: 541 VVIKLISKIYNREIIVRDRTRFHRFKEGL 569

BLAST of Cla97C08G151610 vs. TrEMBL
Match: tr|A0A2I4FME1|A0A2I4FME1_9ROSI (pentatricopeptide repeat-containing protein At1g34160 OS=Juglans regia OX=51240 GN=LOC109000395 PE=4 SV=1)

HSP 1 Score: 758.4 bits (1957), Expect = 1.2e-215
Identity = 389/570 (68.25%), Postives = 435/570 (76.32%), Query Frame = 0

Query: 1   MAYFNLLLQKCSSFSQIKQLQANLIINGRFQFSSSRTMLLELCAISSFGDLSYALHIFRH 60
           MA    L+QKC+S S +KQLQA+ I   +  F SSRT LLELCA+S FGDLSYA H+FRH
Sbjct: 1   MASLESLIQKCNSLSHVKQLQAHFITTSQLLFCSSRTKLLELCAVSPFGDLSYATHLFRH 60

Query: 61  IRYPSTNDWNAVIRGTALSSDPENAVFWYRAMAASNGPHRIDALTCSFALKACARALARS 120
           I+ PSTNDWNAV+RG A   +P  A+ WYR M  S    R+DALTCSFALKACARALARS
Sbjct: 61  IQNPSTNDWNAVVRGLAAGPEPTRAISWYRTM--SRQSCRVDALTCSFALKACARALARS 120

Query: 121 EAMQLHSQLLRFGFNADVLLQTTLLDAYAKVGDLDHAQNLFDEMPQPDIASWNALIAGFA 180
           EAMQ+HSQ++R GF ADVLL TTLLDAYAK GDLD A+ +FDEM   DIASWNALIAG A
Sbjct: 121 EAMQIHSQVVRLGFYADVLLMTTLLDAYAKSGDLDGAKKVFDEMVVRDIASWNALIAGLA 180

Query: 181 QGSRPGDAIMMFKKMKEDGNLRPNGVTVQGALLACSQLGALKEGENVHKHILEEKLDMNV 240
           QGSRP +AI +F  M+ +G  +P+ +TV GAL ACSQLGALKEGE VH +I+ EKLDMNV
Sbjct: 181 QGSRPSEAIELFNGMRVEG-WKPDEITVLGALSACSQLGALKEGEKVHSYIMGEKLDMNV 240

Query: 241 QVCNVVIDMYAKCGSVDKAYWVFENMRCDKSLITWNTMIMAFAMHGDGYKALDLFEKLGR 300
           QVCN VIDMYAKCG  +KAY VF+NM C +SL+TWNTMIMAFAMHGDGYKAL+LFE++G+
Sbjct: 241 QVCNAVIDMYAKCGFANKAYTVFKNM-CCRSLVTWNTMIMAFAMHGDGYKALELFEQMGQ 300

Query: 301 CGMPPDAVSYLAVLCACNHAGLVEDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
            G+ PDAVSYL  LCACNHAGLV D                                   
Sbjct: 301 AGVHPDAVSYLGALCACNHAGLVNDGVRLFDSMAGSGLTPNVKHYGSVVDLLGRAGRLKE 360

Query: 361 XXXXXXXXPF-PNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFISCGDFVLLSNVYAA 420
                   P   + VLWQTLLGAC+TYGNVEMAE+ASRKLVEMG  SCGDFVLLSNVYAA
Sbjct: 361 AYDIVSSMPMKSDEVLWQTLLGACKTYGNVEMAEIASRKLVEMGSKSCGDFVLLSNVYAA 420

Query: 421 RQRWDDVGRVRDAMRRRDVKKTPGFSYIEVKGMMHKFVYGDQSHSSCREIYAKLDEIKFR 480
            +RW DV RVR AM+ RDVKK PGFSYIEV G++HKFV GDQ+H + REIYAKLDEIKFR
Sbjct: 421 HERWKDVDRVRTAMKDRDVKKIPGFSYIEVDGVIHKFVNGDQNHFNWREIYAKLDEIKFR 480

Query: 481 IKAYGYAAETGNVLHDIGDEDKENALCYHSEKLAVAFGLTCTEEGTLIQVIKNLRICGDC 540
            KAYGY AET  VLHDIG E+KENALCYHSEKLAVAFGL  T EGT IQVIKNLRICGDC
Sbjct: 481 TKAYGYVAETNYVLHDIGQEEKENALCYHSEKLAVAFGLISTAEGTPIQVIKNLRICGDC 540

Query: 541 HVVIKLISKIYNREIVVRDRTRFHRFKEGL 570
           HVVIK+ISKIYNREIVVRDR RFHRF EGL
Sbjct: 541 HVVIKIISKIYNREIVVRDRARFHRFTEGL 566

BLAST of Cla97C08G151610 vs. TrEMBL
Match: tr|A0A2N9EDZ7|A0A2N9EDZ7_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS746 PE=4 SV=1)

HSP 1 Score: 752.7 bits (1942), Expect = 6.6e-214
Identity = 427/570 (74.91%), Postives = 482/570 (84.56%), Query Frame = 0

Query: 1   MAYFNLLLQKCSSFSQIKQLQANLIINGRFQFSSSRTMLLELCAISSFGDLSYALHIFRH 60
           MAY + L+Q+C+S + IKQLQA+ II+G+FQF  SRT LLELCAIS  GDLS+A  IFR 
Sbjct: 3   MAYLDSLIQRCNSLTHIKQLQAHFIISGQFQFCPSRTKLLELCAISPAGDLSHATRIFRQ 62

Query: 61  IRYPSTNDWNAVIRGTALSSDPENAVFWYRAMAASNGPHRIDALTCSFALKACARALARS 120
           I+ PSTNDWNAV+RG A   +P  A+  YR M + +   RIDALTCSFALKACARALARS
Sbjct: 63  IQNPSTNDWNAVVRGLASGPEPTMAISCYRTM-SRHSLTRIDALTCSFALKACARALARS 122

Query: 121 EAMQLHSQLLRFGFNADVLLQTTLLDAYAKVGDLDHAQNLFDEMPQPDIASWNALIAGFA 180
           EA+QLHSQLLRFGF ADVLL TTLLD YAK GDLD A+ +FDEM   DIASWNALI+G A
Sbjct: 123 EAIQLHSQLLRFGFKADVLLLTTLLDVYAKTGDLDSAEKVFDEMIVRDIASWNALISGLA 182

Query: 181 QGSRPGDAIMMFKKMKEDGNLRPNGVTVQGALLACSQLGALKEGENVHKHILEEKLDMNV 240
           QGSRP +AI +F +M+ +G  +PN VTV G L ACSQLGALKEG+ +H +I+E+KLDMNV
Sbjct: 183 QGSRPTEAIALFNRMRVEG-WKPNEVTVLGVLSACSQLGALKEGKKIHGYIMEQKLDMNV 242

Query: 241 QVCNVVIDMYAKCGSVDKAYWVFENMRCDKSLITWNTMIMAFAMHGDGYKALDLFEKLGR 300
            VCN VIDMYAKCG  DKAYWVFE M   KSL+TWNTMIMAFAMHGDG KAL+LFE++G+
Sbjct: 243 IVCNAVIDMYAKCGFADKAYWVFEKMCGRKSLVTWNTMIMAFAMHGDGSKALELFEQMGQ 302

Query: 301 CGMPPDAVSYLAVLCACNHAGLVEDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
            G+ PDAVSYLAVLCACNHAGLVE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 303 TGVHPDAVSYLAVLCACNHAGLVEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 362

Query: 361 XXXXXXXXP-FPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFISCGDFVLLSNVYAA 420
           XXXXXXXXP FP++VLWQ+LLGAC+TY NVEMAE AS+KL+EMG  SCGDFVLLSNVYAA
Sbjct: 363 XXXXXXXXPMFPDVVLWQSLLGACKTYDNVEMAEFASQKLLEMGSNSCGDFVLLSNVYAA 422

Query: 421 RQRWDDVGRVRDAMRRRDVKKTPGFSYIEVKGMMHKFVYGDQSHSSCREIYAKLDEIKFR 480
            +RW+DVGRVR AM+ RDVKK PGFSYIEV G++HKFV  DQSH++  EIYAKLDEIKFR
Sbjct: 423 HERWNDVGRVRKAMKNRDVKKIPGFSYIEVDGVIHKFVNADQSHTNWCEIYAKLDEIKFR 482

Query: 481 IKAYGYAAETGNVLHDIGDEDKENALCYHSEKLAVAFGLTCTEEGTLIQVIKNLRICGDC 540
           IKAYGY AET  VLHDIG E+KEN LCYHSEKLAVAFGL    EGT IQVIKNLRICGDC
Sbjct: 483 IKAYGYVAETNFVLHDIGQEEKENVLCYHSEKLAVAFGLISISEGTAIQVIKNLRICGDC 542

Query: 541 HVVIKLISKIYNREIVVRDRTRFHRFKEGL 570
           H VIK+ISK+Y+REI+VRDR RFHRFK+GL
Sbjct: 543 HAVIKIISKVYDREIIVRDRARFHRFKDGL 570

BLAST of Cla97C08G151610 vs. TrEMBL
Match: tr|M5VIA7|M5VIA7_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_8G045500 PE=4 SV=1)

HSP 1 Score: 749.6 bits (1934), Expect = 5.6e-213
Identity = 419/570 (73.51%), Postives = 475/570 (83.33%), Query Frame = 0

Query: 1   MAYFNLLLQKCSSFSQIKQLQANLIINGRFQF-SSSRTMLLELCAISSFGDLSYALHIFR 60
           MA    LLQKC+S ++IKQLQ++L+ +G+FQF  S  T L+ELCA+S   DLS+A+ +F 
Sbjct: 1   MANLESLLQKCTSLARIKQLQSHLLTSGKFQFYPSLTTKLIELCALSPIADLSHAITLFH 60

Query: 61  HIRYPSTNDWNAVIRGTALSSDPENAVFWYRAMAASNGPHRIDALTCSFALKACARALAR 120
            +R PSTN WNAV+RG A S  P  A+ WY+ M  S    ++DALTCSFALKACARALA 
Sbjct: 61  QLRKPSTNQWNAVVRGLAQSLQPTQAISWYKTM--SKASQKVDALTCSFALKACARALAF 120

Query: 121 SEAMQLHSQLLRFGFNADVLLQTTLLDAYAKVGDLDHAQNLFDEMPQPDIASWNALIAGF 180
           SEAMQ+HSQ++RFGF  DVLLQTTLLD YAKVGDL  AQ +FDEM + DIASWNALIAG 
Sbjct: 121 SEAMQIHSQIVRFGFGVDVLLQTTLLDVYAKVGDLGFAQKVFDEMSERDIASWNALIAGL 180

Query: 181 AQGSRPGDAIMMFKKMKEDGNLRPNGVTVQGALLACSQLGALKEGENVHKHILEEKLDMN 240
           AQGSRP +AI +FK+M E+  L+PN VTV GAL ACSQLG +K GE +H +I+EEKLDM+
Sbjct: 181 AQGSRPTEAIALFKRMSEEEGLKPNEVTVLGALSACSQLGGVKGGEKIHVYIMEEKLDMH 240

Query: 241 VQVCNVVIDMYAKCGSVDKAYWVFENMRCDKSLITWNTMIMAFAMHGDGYKALDLFEKLG 300
           V VCN VIDMYAKCG VDKAYWVF+NM+C K+LITWNTMIMAFAMHGDG KAL+LF ++ 
Sbjct: 241 VIVCNAVIDMYAKCGFVDKAYWVFKNMKCGKNLITWNTMIMAFAMHGDGGKALELFGEMA 300

Query: 301 RCGMPPDAVSYLAVLCACNHAGLVEDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           + G+ PDAVSYLA LCACNHAGLVED  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 KSGVCPDAVSYLAALCACNHAGLVEDGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXP-FPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFISCGDFVLLSNVYA 420
           XXXXXXXXXP F ++VLWQTLLGA +TYGNVEMAE+ASRKLVEMG   CGDFVLLSNVYA
Sbjct: 361 XXXXXXXXXPMFADVVLWQTLLGASKTYGNVEMAEMASRKLVEMGSKGCGDFVLLSNVYA 420

Query: 421 ARQRWDDVGRVRDAMRRRDVKKTPGFSYIEVKGMMHKFVYGDQSHSSCREIYAKLDEIKF 480
           A +RWDDVGRVR+AM+RRDVKK PGF YIEV G++HKFV GDQSH   REIYAKLDEI  
Sbjct: 421 AHERWDDVGRVREAMKRRDVKKIPGFGYIEVDGVIHKFVNGDQSHVKWREIYAKLDEIML 480

Query: 481 RIKAYGYAAETGNVLHDIGDEDKENALCYHSEKLAVAFGLTCTEEGTLIQVIKNLRICGD 540
            +KAYGY A+T NVLHDIG+E+KENAL YH EKLAVAFGL  T EGT IQVIKNLRIC D
Sbjct: 481 SVKAYGYVAKTNNVLHDIGEEEKENALSYHCEKLAVAFGLISTSEGTPIQVIKNLRICDD 540

Query: 541 CHVVIKLISKIYNREIVVRDRTRFHRFKEG 569
           CH VIKLISKIYNREI+VRDR RFHRFKEG
Sbjct: 541 CHAVIKLISKIYNREIIVRDRARFHRFKEG 568

BLAST of Cla97C08G151610 vs. Swiss-Prot
Match: sp|Q9FX24|PPR71_ARATH (Pentatricopeptide repeat-containing protein At1g34160 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H68 PE=2 SV=2)

HSP 1 Score: 591.7 bits (1524), Expect = 9.6e-168
Identity = 317/574 (55.23%), Postives = 386/574 (67.25%), Query Frame = 0

Query: 3   YFNLLLQKCSSFSQIKQLQANLIINGRFQFSSSRTMLLELCAISSFGDLSYALHIFRHIR 62
           Y   ++QKC SFSQIKQLQ++ +  G FQ S  R+ LLE CAIS FGDLS+A+ IFR+I 
Sbjct: 5   YMETMIQKCVSFSQIKQLQSHFLTAGHFQSSFLRSRLLERCAISPFGDLSFAVQIFRYIP 64

Query: 63  YPSTNDWNAVIRGTALSSDPENAVFWYRAM----AASNGPHRIDALTCSFALKACARALA 122
            P TNDWNA+IRG A SS P  A  WYR+M    ++S+   R+DALTCSF LKACARAL 
Sbjct: 65  KPLTNDWNAIIRGFAGSSHPSLAFSWYRSMLQQSSSSSAICRVDALTCSFTLKACARALC 124

Query: 123 RSEAMQLHSQLLRFGFNADVLLQTTLLDAYAKVGDLDHAQNLFDEMPQPDIASWNALIAG 182
            S   QLH Q+ R G +AD LL TTLLDAY+K GDL  A  LFDEMP  D+ASWNALIAG
Sbjct: 125 SSAMDQLHCQINRRGLSADSLLCTTLLDAYSKNGDLISAYKLFDEMPVRDVASWNALIAG 184

Query: 183 FAQGSRPGDAIMMFKKMKEDGNLRPNGVTVQGALLACSQLGALKEGENVHKHILEEKLDM 242
              G+R  +A+ ++K+M+ +G +R + VTV  AL ACS LG +KEGEN+      +    
Sbjct: 185 LVSGNRASEAMELYKRMETEG-IRRSEVTVVAALGACSHLGDVKEGENIFHGYSND---- 244

Query: 243 NVQVCNVVIDMYAKCGSVDKAYWVFENMRCDKSLITWNTMIMAFAMHGDGYKALDLFEKL 302
           NV V N  IDMY+KCG VDKAY VFE     KS++TWNTMI  FA+HG+ ++AL++F+KL
Sbjct: 245 NVIVSNAAIDMYSKCGFVDKAYQVFEQFTGKKSVVTWNTMITGFAVHGEAHRALEIFDKL 304

Query: 303 GRCGMPPDAVSYLAVLCACNHAGLVE-DXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 362
              G+ PD VSYLA L AC HAGLVE                                  
Sbjct: 305 EDNGIKPDDVSYLAALTACRHAGLVEYGLSVFNNMACKGVERNMKHYGCVVDLLSRAGRL 364

Query: 363 XXXXXXXXXXXPFPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFISCGDFVLLSNVY 422
                        P+ VLWQ+LLGA   Y +VEMAE+ASR++ EMG  + GDFVLLSNVY
Sbjct: 365 REAHDIICSMSMIPDPVLWQSLLGASEIYSDVEMAEIASREIKEMGVNNDGDFVLLSNVY 424

Query: 423 AARQRWDDVGRVRDAMRRRDVKKTPGFSYIEVKGMMHKFVYGDQSHSSCREIYAKLDEIK 482
           AA+ RW DVGRVRD M  + VKK PG SYIE KG +H+F   D+SH   REIY K+DEI+
Sbjct: 425 AAQGRWKDVGRVRDDMESKQVKKIPGLSYIEAKGTIHEFYNSDKSHEQWREIYEKIDEIR 484

Query: 483 FRIKAYGYAAETGNVLHDIGDEDKENALCYHSEKLAVAFGLTC---TEEGTLIQVIKNLR 542
           F+I+  GY A+TG VLHDIG+E+KENALCYHSEKLAVA+GL      +E + ++VI NLR
Sbjct: 485 FKIREDGYVAQTGLVLHDIGEEEKENALCYHSEKLAVAYGLMMMDGADEESPVRVINNLR 544

Query: 543 ICGDCHVVIKLISKIYNREIVVRDRTRFHRFKEG 569
           ICGDCHVV K ISKIY REI+VRDR RFHRFK+G
Sbjct: 545 ICGDCHVVFKHISKIYKREIIVRDRVRFHRFKDG 573

BLAST of Cla97C08G151610 vs. Swiss-Prot
Match: sp|B8YEK4|OGR1_ORYSJ (Pentatricopeptide repeat-containing protein OGR1, mitochondrial OS=Oryza sativa subsp. japonica OX=39947 GN=OGR1 PE=2 SV=1)

HSP 1 Score: 482.6 bits (1241), Expect = 6.3e-135
Identity = 268/571 (46.94%), Postives = 350/571 (61.30%), Query Frame = 0

Query: 7   LLQKCSSFSQIKQLQANLIINGRF-QFSSSRTMLLELCAISSF-GDLSYALHIFRHIRYP 66
           LL + +S     Q  A L+ +G        R   L+  A+S     L +AL + R +  P
Sbjct: 13  LLPRLASLRHYLQFHARLLTSGHLGAHPGLRARFLDRLALSPHPAALPHALLLLRSLPTP 72

Query: 67  STNDWNAVIRGTALSSDPENAVFWYRAMAASNGPHRIDALTCSFALKACARALARSEAMQ 126
           +TND NA +RG A S  P  ++             R DAL+ SFALKA AR       +Q
Sbjct: 73  ATNDLNAALRGLAASPHPARSLLLLAGRLLPALLPRPDALSLSFALKASARCSDAHTTVQ 132

Query: 127 LHSQLLRFGFNADVLLQTTLLDAYAKVGDLDHAQNLFDEMPQPDIASWNALIAGFAQGSR 186
           LH+ +LR G  ADV L TTLLD+YAK GDL  A+ +FDEM   D+A+WN+L+AG AQG+ 
Sbjct: 133 LHALVLRLGVAADVRLLTTLLDSYAKCGDLASARKVFDEMTVRDVATWNSLLAGLAQGTE 192

Query: 187 PGDAIMMFKKMKED-----GNLRPNGVTVQGALLACSQLGALKEGENVHKHILEEKLDMN 246
           P  A+ +F ++            PN VT+  AL AC+Q+G LK+G  VH+      LD N
Sbjct: 193 PNLALALFHRLANSFQELPSREEPNEVTIVAALSACAQIGLLKDGMYVHEFAKRFGLDRN 252

Query: 247 VQVCNVVIDMYAKCGSVDKAYWVFENMRC-DKSLITWNTMIMAFAMHGDGYKALDLFEKL 306
           V+VCN +IDMY+KCGS+ +A  VF +++  D++L+++N  I A +MHG G  AL LF+++
Sbjct: 253 VRVCNSLIDMYSKCGSLSRALDVFHSIKPEDQTLVSYNAAIQAHSMHGHGGDALRLFDEM 312

Query: 307 GRCGMPPDAVSYLAVLCACNHAGLVEDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 366
               + PD V+YLAVLC CNH+GLV+D                                 
Sbjct: 313 -PTRIEPDGVTYLAVLCGCNHSGLVDD---GLRVFNSMRVAPNMKHYGTIVDLLGRAGRL 372

Query: 367 XXXXXXXXXXPFP-NMVLWQTLLGACRTYGNVEMAELASRKLVEMGFISCGDFVLLSNVY 426
                     PFP ++VLWQTLLGA + +G VE+AELA+ KL E+G    GD+VLLSNVY
Sbjct: 373 TEAYDTVISMPFPADIVLWQTLLGAAKMHGVVELAELAANKLAELGSNVDGDYVLLSNVY 432

Query: 427 AARQRWDDVGRVRDAMRRRDVKKTPGFSYIEVKGMMHKFVYGDQSHSSCREIYAKLDEIK 486
           A++ RW DVGRVRD MR  DV+K PGFSY E+ G+MHKF+ GD+ H   +EIY  L++I 
Sbjct: 433 ASKARWMDVGRVRDTMRSNDVRKVPGFSYTEIDGVMHKFINGDKEHPRWQEIYRALEDIV 492

Query: 487 FRIKAYGYAAETGNVLHDIGDEDKENALCYHSEKLAVAFGLTCTEEGTLIQVIKNLRICG 546
            RI   GY  ET NVLHDIG+E+K+ ALCYHSEKLA+AFGL  T  G  ++VIKNLRICG
Sbjct: 493 SRISELGYEPETSNVLHDIGEEEKQYALCYHSEKLAIAFGLIATPPGETLRVIKNLRICG 552

Query: 547 DCHVVIKLISKIYNREIVVRDRTRFHRFKEG 569
           DCHVV KLISK Y R IV+RDR RFHRF++G
Sbjct: 553 DCHVVAKLISKAYGRVIVIRDRARFHRFEDG 579

BLAST of Cla97C08G151610 vs. Swiss-Prot
Match: sp|A8MQA3|PP330_ARATH (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 369.4 bits (947), Expect = 7.8e-101
Identity = 246/563 (43.69%), Postives = 367/563 (65.19%), Query Frame = 0

Query: 12  SSFSQIKQLQANLIING--RFQFSSSRTMLLELCAISSFGDLSYALHIFRHIRYP-STND 71
           SS ++++Q+ A  I +G         + ++  L ++ S   +SYA  +F  I  P +   
Sbjct: 28  SSITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKVFSKIEKPINVFI 87

Query: 72  WNAVIRGTALSSDPENAVFWYRAMAASNGPHRIDALTCSFALKACARALARSEAMQLHSQ 131
           WN +IRG A   +  +A   YR M  S G    D  T  F +KA            +HS 
Sbjct: 88  WNTLIRGYAEIGNSISAFSLYREMRVS-GLVEPDTHTYPFLIKAVTTMADVRLGETIHSV 147

Query: 132 LLRFGFNADVLLQTTLLDAYAKVGDLDHAQNLFDEMPQPDIASWNALIAGFAQGSRPGDA 191
           ++R GF + + +Q +LL  YA  GD+  A  +FD+MP+ D+ +WN++I GFA+  +P +A
Sbjct: 148 VIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSVINGFAENGKPEEA 207

Query: 192 IMMFKKMKEDGNLRPNGVTVQGALLACSQLGALKEGENVHKHILEEKLDMNVQVCNVVID 251
           + ++ +M   G ++P+G T+   L AC+++GAL  G+ VH ++++  L  N+   NV++D
Sbjct: 208 LALYTEMNSKG-IKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLHSSNVLLD 267

Query: 252 MYAKCGSVDKAYWVFENMRCDKSLITWNTMIMAFAMHGDGYKALDLFEKLGRC-GMPPDA 311
           +YA+CG V++A  +F+ M  DK+ ++W ++I+  A++G G +A++LF+ +    G+ P  
Sbjct: 268 LYARCGRVEEAKTLFDEM-VDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTEGLLPCE 327

Query: 312 VSYLAVLCACNHAGLVEDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 371
           ++++ +L AC+       XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 328 ITFVGILYACSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 387

Query: 372 XP--FPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFISCGDFVLLSNVYAARQRWDD 431
           X    PN+V+W+TLLGAC  +G+ ++AE A  +++++     GD+VLLSN+YA+ QRW D
Sbjct: 388 XXPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLSNMYASEQRWSD 447

Query: 432 VGRVRDAMRRRDVKKTPGFSYIEVKGMMHKFVYGDQSHSSCREIYAKLDEIKFRIKAYGY 491
           V ++R  M R  VKK PG S +EV   +H+F+ GD+SH     IYAKL E+  R+++ GY
Sbjct: 448 VQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLKEMTGRLRSEGY 507

Query: 492 AAETGNVLHDIGDEDKENALCYHSEKLAVAFGLTCTEEGTLIQVIKNLRICGDCHVVIKL 551
             +  NV  D+ +E+KENA+ YHSEK+A+AF L  T E + I V+KNLR+C DCH+ IKL
Sbjct: 508 VPQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLRVCADCHLAIKL 567

Query: 552 ISKIYNREIVVRDRTRFHRFKEG 569
           +SK+YNREIVVRDR+RFH FK G
Sbjct: 568 VSKVYNREIVVRDRSRFHHFKNG 587

BLAST of Cla97C08G151610 vs. Swiss-Prot
Match: sp|Q9LXY5|PP284_ARATH (Pentatricopeptide repeat-containing protein At3g56550 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H80 PE=2 SV=1)

HSP 1 Score: 367.9 bits (943), Expect = 2.3e-100
Identity = 198/567 (34.92%), Postives = 318/567 (56.08%), Query Frame = 0

Query: 7   LLQKCSSFSQIKQLQANLIINGRFQFSSSRTMLLELCAISSFGDLSYALHIFRHI-RYPS 66
           +LQ C+S  +++++ +++IING     S    LL  CA+S  G LS+A  +F H    PS
Sbjct: 11  MLQGCNSMKKLRKIHSHVIINGLQHHPSIFNHLLRFCAVSVTGSLSHAQLLFDHFDSDPS 70

Query: 67  TNDWNAVIRGTALSSDPENAVFWYRAMAASNGPHRIDALTCSFALKACARALARSEAMQL 126
           T+DWN +IRG + SS P N++ +Y  M  S+   R D  T +FALK+C R  +  + +++
Sbjct: 71  TSDWNYLIRGFSNSSSPLNSILFYNRMLLSS-VSRPDLFTFNFALKSCERIKSIPKCLEI 130

Query: 127 HSQLLRFGFNADVLLQTTLLDAYAKVGDLDHAQNLFDEMPQPDIASWNALIAGFAQGSRP 186
           H  ++R GF  D ++ T+L+  Y+  G ++ A  +FDEMP  D+ SWN +I  F+     
Sbjct: 131 HGSVIRSGFLDDAIVATSLVRCYSANGSVEIASKVFDEMPVRDLVSWNVMICCFSHVGLH 190

Query: 187 GDAIMMFKKMKEDGNLRPNGVTVQGALLACSQLGALKEGENVHKHILEEKLDMNVQVCNV 246
             A+ M+K+M  +G +  +  T+   L +C+ + AL  G  +H+   + + +  V V N 
Sbjct: 191 NQALSMYKRMGNEG-VCGDSYTLVALLSSCAHVSALNMGVMLHRIACDIRCESCVFVSNA 250

Query: 247 VIDMYAKCGSVDKAYWVFENMRCDKSLITWNTMIMAFAMHGDGYKALDLFEKLGRCGMPP 306
           +IDMYAKCGS++ A  VF  MR  + ++TWN+MI+ + +HG G +A+  F K+   G+ P
Sbjct: 251 LIDMYAKCGSLENAIGVFNGMR-KRDVLTWNSMIIGYGVHGHGVEAISFFRKMVASGVRP 310

Query: 307 DAVSYLAVLCACNHAGLVEDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 366
           +A+++L +L  C+H GLV++                                        
Sbjct: 311 NAITFLGLLLGCSHQGLVKEGVEHFEIMSSQFHLTPNVKHYGCMVDLYGRAGQLENSLEM 370

Query: 367 XXXP--FPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFISCGDFVLLSNVYAARQRW 426
                   + VLW+TLLG+C+ + N+E+ E+A +KLV++   + GD+VL++++Y+A    
Sbjct: 371 IYASSCHEDPVLWRTLLGSCKIHRNLELGEVAMKKLVQLEAFNAGDYVLMTSIYSAANDA 430

Query: 427 DDVGRVRDAMRRRDVKKTPGFSYIEVKGMMHKFVYGDQSHSSCREIYAKLDEIKFRIKAY 486
                +R  +R  D++  PG+S+IE+   +HKFV  D+ H     IY++L E+  R    
Sbjct: 431 QAFASMRKLIRSHDLQTVPGWSWIEIGDQVHKFVVDDKMHPESAVIYSELGEVINRAILA 490

Query: 487 GYAAETGN-VLHDIGDEDKENALCYHSEKLAVAFGLTCTEEGTLIQVIKNLRICGDCHVV 546
           GY  E  N     + D    +A   HSEKLA+A+GL  T  GT +++ KNLR+C DCH  
Sbjct: 491 GYKPEDSNRTAPTLSDRCLGSADTSHSEKLAIAYGLMRTTAGTTLRITKNLRVCRDCHSF 550

Query: 547 IKLISKIYNREIVVRDRTRFHRFKEGL 570
            K +SK +NREI+VRDR RFH F +G+
Sbjct: 551 TKYVSKAFNREIIVRDRVRFHHFADGI 574

BLAST of Cla97C08G151610 vs. Swiss-Prot
Match: sp|Q9C6T2|PPR68_ARATH (Pentatricopeptide repeat-containing protein At1g31920 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H11 PE=2 SV=1)

HSP 1 Score: 366.3 bits (939), Expect = 6.6e-100
Identity = 200/567 (35.27%), Postives = 321/567 (56.61%), Query Frame = 0

Query: 7   LLQKCSSFSQIKQLQANLIINGRFQFSS-SRTMLLELCAISSF-GDLSYALHIFRHIRYP 66
           LL++C +  + KQ+ A  I    F  SS S + +L  CA S +   ++YA  IFR I  P
Sbjct: 36  LLKRCHNIDEFKQVHARFIKLSLFYSSSFSASSVLAKCAHSGWENSMNYAASIFRGIDDP 95

Query: 67  STNDWNAVIRG-TALSSDPENAVFWYRAMAASNGPHRIDALTCSFALKACARALARSEAM 126
            T D+N +IRG   + S  E   F+   M   N P   D  T    LKAC R  +  E  
Sbjct: 96  CTFDFNTMIRGYVNVMSFEEALCFYNEMMQRGNEP---DNFTYPCLLKACTRLKSIREGK 155

Query: 127 QLHSQLLRFGFNADVLLQTTLLDAYAKVGDLDHAQNLFDEMPQPDIASWNALIAGFAQGS 186
           Q+H Q+ + G  ADV +Q +L++ Y + G+++ +  +F+++     ASW+++++  A   
Sbjct: 156 QIHGQVFKLGLEADVFVQNSLINMYGRCGEMELSSAVFEKLESKTAASWSSMVSARAGMG 215

Query: 187 RPGDAIMMFKKMKEDGNLRPNGVTVQGALLACSQLGALKEGENVHKHILEEKLDMNVQVC 246
              + +++F+ M  + NL+     +  ALLAC+  GAL  G ++H  +L    ++N+ V 
Sbjct: 216 MWSECLLLFRGMCSETNLKAEESGMVSALLACANTGALNLGMSIHGFLLRNISELNIIVQ 275

Query: 247 NVVIDMYAKCGSVDKAYWVFENMRCDKSLITWNTMIMAFAMHGDGYKALDLFEKLGRCGM 306
             ++DMY KCG +DKA  +F+ M   ++ +T++ MI   A+HG+G  AL +F K+ + G+
Sbjct: 276 TSLVDMYVKCGCLDKALHIFQKME-KRNNLTYSAMISGLALHGEGESALRMFSKMIKEGL 335

Query: 307 PPDAVSYLAVLCACNHAGLV-EDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 366
            PD V Y++VL AC+H+GLV E                                      
Sbjct: 336 EPDHVVYVSVLNACSHSGLVKEGRRVFAEMLKEGKVEPTAEHYGCLVDLLGRAGLLEEAL 395

Query: 367 XXXXXXPF-PNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFISCGDFVLLSNVYAARQ 426
                 P   N V+W+T L  CR   N+E+ ++A+++L+++   + GD++L+SN+Y+  Q
Sbjct: 396 ETIQSIPIEKNDVIWRTFLSQCRVRQNIELGQIAAQELLKLSSHNPGDYLLISNLYSQGQ 455

Query: 427 RWDDVGRVRDAMRRRDVKKTPGFSYIEVKGMMHKFVYGDQSHSSCREIYAKLDEIKFRIK 486
            WDDV R R  +  + +K+TPGFS +E+KG  H+FV  D+SH  C+EIY  L ++++++K
Sbjct: 456 MWDDVARTRTEIAIKGLKQTPGFSIVELKGKTHRFVSQDRSHPKCKEIYKMLHQMEWQLK 515

Query: 487 AYGYAAETGNVLHDIGDEDKENALCYHSEKLAVAFGLTCTEEGTLIQVIKNLRICGDCHV 546
             GY+ +   +L ++ +E+K+  L  HS+K+A+AFGL  T  G++I++ +NLR+C DCH 
Sbjct: 516 FEGYSPDLTQILLNVDEEEKKERLKGHSQKVAIAFGLLYTPPGSIIKIARNLRMCSDCHT 575

Query: 547 VIKLISKIYNREIVVRDRTRFHRFKEG 569
             K IS IY REIVVRDR RFH FK G
Sbjct: 576 YTKKISMIYEREIVVRDRNRFHLFKGG 598

BLAST of Cla97C08G151610 vs. TAIR10
Match: AT1G34160.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 591.7 bits (1524), Expect = 5.3e-169
Identity = 317/574 (55.23%), Postives = 386/574 (67.25%), Query Frame = 0

Query: 3   YFNLLLQKCSSFSQIKQLQANLIINGRFQFSSSRTMLLELCAISSFGDLSYALHIFRHIR 62
           Y   ++QKC SFSQIKQLQ++ +  G FQ S  R+ LLE CAIS FGDLS+A+ IFR+I 
Sbjct: 5   YMETMIQKCVSFSQIKQLQSHFLTAGHFQSSFLRSRLLERCAISPFGDLSFAVQIFRYIP 64

Query: 63  YPSTNDWNAVIRGTALSSDPENAVFWYRAM----AASNGPHRIDALTCSFALKACARALA 122
            P TNDWNA+IRG A SS P  A  WYR+M    ++S+   R+DALTCSF LKACARAL 
Sbjct: 65  KPLTNDWNAIIRGFAGSSHPSLAFSWYRSMLQQSSSSSAICRVDALTCSFTLKACARALC 124

Query: 123 RSEAMQLHSQLLRFGFNADVLLQTTLLDAYAKVGDLDHAQNLFDEMPQPDIASWNALIAG 182
            S   QLH Q+ R G +AD LL TTLLDAY+K GDL  A  LFDEMP  D+ASWNALIAG
Sbjct: 125 SSAMDQLHCQINRRGLSADSLLCTTLLDAYSKNGDLISAYKLFDEMPVRDVASWNALIAG 184

Query: 183 FAQGSRPGDAIMMFKKMKEDGNLRPNGVTVQGALLACSQLGALKEGENVHKHILEEKLDM 242
              G+R  +A+ ++K+M+ +G +R + VTV  AL ACS LG +KEGEN+      +    
Sbjct: 185 LVSGNRASEAMELYKRMETEG-IRRSEVTVVAALGACSHLGDVKEGENIFHGYSND---- 244

Query: 243 NVQVCNVVIDMYAKCGSVDKAYWVFENMRCDKSLITWNTMIMAFAMHGDGYKALDLFEKL 302
           NV V N  IDMY+KCG VDKAY VFE     KS++TWNTMI  FA+HG+ ++AL++F+KL
Sbjct: 245 NVIVSNAAIDMYSKCGFVDKAYQVFEQFTGKKSVVTWNTMITGFAVHGEAHRALEIFDKL 304

Query: 303 GRCGMPPDAVSYLAVLCACNHAGLVE-DXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 362
              G+ PD VSYLA L AC HAGLVE                                  
Sbjct: 305 EDNGIKPDDVSYLAALTACRHAGLVEYGLSVFNNMACKGVERNMKHYGCVVDLLSRAGRL 364

Query: 363 XXXXXXXXXXXPFPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFISCGDFVLLSNVY 422
                        P+ VLWQ+LLGA   Y +VEMAE+ASR++ EMG  + GDFVLLSNVY
Sbjct: 365 REAHDIICSMSMIPDPVLWQSLLGASEIYSDVEMAEIASREIKEMGVNNDGDFVLLSNVY 424

Query: 423 AARQRWDDVGRVRDAMRRRDVKKTPGFSYIEVKGMMHKFVYGDQSHSSCREIYAKLDEIK 482
           AA+ RW DVGRVRD M  + VKK PG SYIE KG +H+F   D+SH   REIY K+DEI+
Sbjct: 425 AAQGRWKDVGRVRDDMESKQVKKIPGLSYIEAKGTIHEFYNSDKSHEQWREIYEKIDEIR 484

Query: 483 FRIKAYGYAAETGNVLHDIGDEDKENALCYHSEKLAVAFGLTC---TEEGTLIQVIKNLR 542
           F+I+  GY A+TG VLHDIG+E+KENALCYHSEKLAVA+GL      +E + ++VI NLR
Sbjct: 485 FKIREDGYVAQTGLVLHDIGEEEKENALCYHSEKLAVAYGLMMMDGADEESPVRVINNLR 544

Query: 543 ICGDCHVVIKLISKIYNREIVVRDRTRFHRFKEG 569
           ICGDCHVV K ISKIY REI+VRDR RFHRFK+G
Sbjct: 545 ICGDCHVVFKHISKIYKREIIVRDRVRFHRFKDG 573

BLAST of Cla97C08G151610 vs. TAIR10
Match: AT4G21065.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 369.4 bits (947), Expect = 4.3e-102
Identity = 246/563 (43.69%), Postives = 367/563 (65.19%), Query Frame = 0

Query: 12  SSFSQIKQLQANLIING--RFQFSSSRTMLLELCAISSFGDLSYALHIFRHIRYP-STND 71
           SS ++++Q+ A  I +G         + ++  L ++ S   +SYA  +F  I  P +   
Sbjct: 28  SSITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKVFSKIEKPINVFI 87

Query: 72  WNAVIRGTALSSDPENAVFWYRAMAASNGPHRIDALTCSFALKACARALARSEAMQLHSQ 131
           WN +IRG A   +  +A   YR M  S G    D  T  F +KA            +HS 
Sbjct: 88  WNTLIRGYAEIGNSISAFSLYREMRVS-GLVEPDTHTYPFLIKAVTTMADVRLGETIHSV 147

Query: 132 LLRFGFNADVLLQTTLLDAYAKVGDLDHAQNLFDEMPQPDIASWNALIAGFAQGSRPGDA 191
           ++R GF + + +Q +LL  YA  GD+  A  +FD+MP+ D+ +WN++I GFA+  +P +A
Sbjct: 148 VIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSVINGFAENGKPEEA 207

Query: 192 IMMFKKMKEDGNLRPNGVTVQGALLACSQLGALKEGENVHKHILEEKLDMNVQVCNVVID 251
           + ++ +M   G ++P+G T+   L AC+++GAL  G+ VH ++++  L  N+   NV++D
Sbjct: 208 LALYTEMNSKG-IKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLHSSNVLLD 267

Query: 252 MYAKCGSVDKAYWVFENMRCDKSLITWNTMIMAFAMHGDGYKALDLFEKLGRC-GMPPDA 311
           +YA+CG V++A  +F+ M  DK+ ++W ++I+  A++G G +A++LF+ +    G+ P  
Sbjct: 268 LYARCGRVEEAKTLFDEM-VDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTEGLLPCE 327

Query: 312 VSYLAVLCACNHAGLVEDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 371
           ++++ +L AC+       XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 328 ITFVGILYACSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 387

Query: 372 XP--FPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFISCGDFVLLSNVYAARQRWDD 431
           X    PN+V+W+TLLGAC  +G+ ++AE A  +++++     GD+VLLSN+YA+ QRW D
Sbjct: 388 XXPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLSNMYASEQRWSD 447

Query: 432 VGRVRDAMRRRDVKKTPGFSYIEVKGMMHKFVYGDQSHSSCREIYAKLDEIKFRIKAYGY 491
           V ++R  M R  VKK PG S +EV   +H+F+ GD+SH     IYAKL E+  R+++ GY
Sbjct: 448 VQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLKEMTGRLRSEGY 507

Query: 492 AAETGNVLHDIGDEDKENALCYHSEKLAVAFGLTCTEEGTLIQVIKNLRICGDCHVVIKL 551
             +  NV  D+ +E+KENA+ YHSEK+A+AF L  T E + I V+KNLR+C DCH+ IKL
Sbjct: 508 VPQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLRVCADCHLAIKL 567

Query: 552 ISKIYNREIVVRDRTRFHRFKEG 569
           +SK+YNREIVVRDR+RFH FK G
Sbjct: 568 VSKVYNREIVVRDRSRFHHFKNG 587

BLAST of Cla97C08G151610 vs. TAIR10
Match: AT3G56550.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 367.9 bits (943), Expect = 1.3e-101
Identity = 198/567 (34.92%), Postives = 318/567 (56.08%), Query Frame = 0

Query: 7   LLQKCSSFSQIKQLQANLIINGRFQFSSSRTMLLELCAISSFGDLSYALHIFRHI-RYPS 66
           +LQ C+S  +++++ +++IING     S    LL  CA+S  G LS+A  +F H    PS
Sbjct: 11  MLQGCNSMKKLRKIHSHVIINGLQHHPSIFNHLLRFCAVSVTGSLSHAQLLFDHFDSDPS 70

Query: 67  TNDWNAVIRGTALSSDPENAVFWYRAMAASNGPHRIDALTCSFALKACARALARSEAMQL 126
           T+DWN +IRG + SS P N++ +Y  M  S+   R D  T +FALK+C R  +  + +++
Sbjct: 71  TSDWNYLIRGFSNSSSPLNSILFYNRMLLSS-VSRPDLFTFNFALKSCERIKSIPKCLEI 130

Query: 127 HSQLLRFGFNADVLLQTTLLDAYAKVGDLDHAQNLFDEMPQPDIASWNALIAGFAQGSRP 186
           H  ++R GF  D ++ T+L+  Y+  G ++ A  +FDEMP  D+ SWN +I  F+     
Sbjct: 131 HGSVIRSGFLDDAIVATSLVRCYSANGSVEIASKVFDEMPVRDLVSWNVMICCFSHVGLH 190

Query: 187 GDAIMMFKKMKEDGNLRPNGVTVQGALLACSQLGALKEGENVHKHILEEKLDMNVQVCNV 246
             A+ M+K+M  +G +  +  T+   L +C+ + AL  G  +H+   + + +  V V N 
Sbjct: 191 NQALSMYKRMGNEG-VCGDSYTLVALLSSCAHVSALNMGVMLHRIACDIRCESCVFVSNA 250

Query: 247 VIDMYAKCGSVDKAYWVFENMRCDKSLITWNTMIMAFAMHGDGYKALDLFEKLGRCGMPP 306
           +IDMYAKCGS++ A  VF  MR  + ++TWN+MI+ + +HG G +A+  F K+   G+ P
Sbjct: 251 LIDMYAKCGSLENAIGVFNGMR-KRDVLTWNSMIIGYGVHGHGVEAISFFRKMVASGVRP 310

Query: 307 DAVSYLAVLCACNHAGLVEDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 366
           +A+++L +L  C+H GLV++                                        
Sbjct: 311 NAITFLGLLLGCSHQGLVKEGVEHFEIMSSQFHLTPNVKHYGCMVDLYGRAGQLENSLEM 370

Query: 367 XXXP--FPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFISCGDFVLLSNVYAARQRW 426
                   + VLW+TLLG+C+ + N+E+ E+A +KLV++   + GD+VL++++Y+A    
Sbjct: 371 IYASSCHEDPVLWRTLLGSCKIHRNLELGEVAMKKLVQLEAFNAGDYVLMTSIYSAANDA 430

Query: 427 DDVGRVRDAMRRRDVKKTPGFSYIEVKGMMHKFVYGDQSHSSCREIYAKLDEIKFRIKAY 486
                +R  +R  D++  PG+S+IE+   +HKFV  D+ H     IY++L E+  R    
Sbjct: 431 QAFASMRKLIRSHDLQTVPGWSWIEIGDQVHKFVVDDKMHPESAVIYSELGEVINRAILA 490

Query: 487 GYAAETGN-VLHDIGDEDKENALCYHSEKLAVAFGLTCTEEGTLIQVIKNLRICGDCHVV 546
           GY  E  N     + D    +A   HSEKLA+A+GL  T  GT +++ KNLR+C DCH  
Sbjct: 491 GYKPEDSNRTAPTLSDRCLGSADTSHSEKLAIAYGLMRTTAGTTLRITKNLRVCRDCHSF 550

Query: 547 IKLISKIYNREIVVRDRTRFHRFKEGL 570
            K +SK +NREI+VRDR RFH F +G+
Sbjct: 551 TKYVSKAFNREIIVRDRVRFHHFADGI 574

BLAST of Cla97C08G151610 vs. TAIR10
Match: AT1G31920.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 366.3 bits (939), Expect = 3.6e-101
Identity = 200/567 (35.27%), Postives = 321/567 (56.61%), Query Frame = 0

Query: 7   LLQKCSSFSQIKQLQANLIINGRFQFSS-SRTMLLELCAISSF-GDLSYALHIFRHIRYP 66
           LL++C +  + KQ+ A  I    F  SS S + +L  CA S +   ++YA  IFR I  P
Sbjct: 36  LLKRCHNIDEFKQVHARFIKLSLFYSSSFSASSVLAKCAHSGWENSMNYAASIFRGIDDP 95

Query: 67  STNDWNAVIRG-TALSSDPENAVFWYRAMAASNGPHRIDALTCSFALKACARALARSEAM 126
            T D+N +IRG   + S  E   F+   M   N P   D  T    LKAC R  +  E  
Sbjct: 96  CTFDFNTMIRGYVNVMSFEEALCFYNEMMQRGNEP---DNFTYPCLLKACTRLKSIREGK 155

Query: 127 QLHSQLLRFGFNADVLLQTTLLDAYAKVGDLDHAQNLFDEMPQPDIASWNALIAGFAQGS 186
           Q+H Q+ + G  ADV +Q +L++ Y + G+++ +  +F+++     ASW+++++  A   
Sbjct: 156 QIHGQVFKLGLEADVFVQNSLINMYGRCGEMELSSAVFEKLESKTAASWSSMVSARAGMG 215

Query: 187 RPGDAIMMFKKMKEDGNLRPNGVTVQGALLACSQLGALKEGENVHKHILEEKLDMNVQVC 246
              + +++F+ M  + NL+     +  ALLAC+  GAL  G ++H  +L    ++N+ V 
Sbjct: 216 MWSECLLLFRGMCSETNLKAEESGMVSALLACANTGALNLGMSIHGFLLRNISELNIIVQ 275

Query: 247 NVVIDMYAKCGSVDKAYWVFENMRCDKSLITWNTMIMAFAMHGDGYKALDLFEKLGRCGM 306
             ++DMY KCG +DKA  +F+ M   ++ +T++ MI   A+HG+G  AL +F K+ + G+
Sbjct: 276 TSLVDMYVKCGCLDKALHIFQKME-KRNNLTYSAMISGLALHGEGESALRMFSKMIKEGL 335

Query: 307 PPDAVSYLAVLCACNHAGLV-EDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 366
            PD V Y++VL AC+H+GLV E                                      
Sbjct: 336 EPDHVVYVSVLNACSHSGLVKEGRRVFAEMLKEGKVEPTAEHYGCLVDLLGRAGLLEEAL 395

Query: 367 XXXXXXPF-PNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFISCGDFVLLSNVYAARQ 426
                 P   N V+W+T L  CR   N+E+ ++A+++L+++   + GD++L+SN+Y+  Q
Sbjct: 396 ETIQSIPIEKNDVIWRTFLSQCRVRQNIELGQIAAQELLKLSSHNPGDYLLISNLYSQGQ 455

Query: 427 RWDDVGRVRDAMRRRDVKKTPGFSYIEVKGMMHKFVYGDQSHSSCREIYAKLDEIKFRIK 486
            WDDV R R  +  + +K+TPGFS +E+KG  H+FV  D+SH  C+EIY  L ++++++K
Sbjct: 456 MWDDVARTRTEIAIKGLKQTPGFSIVELKGKTHRFVSQDRSHPKCKEIYKMLHQMEWQLK 515

Query: 487 AYGYAAETGNVLHDIGDEDKENALCYHSEKLAVAFGLTCTEEGTLIQVIKNLRICGDCHV 546
             GY+ +   +L ++ +E+K+  L  HS+K+A+AFGL  T  G++I++ +NLR+C DCH 
Sbjct: 516 FEGYSPDLTQILLNVDEEEKKERLKGHSQKVAIAFGLLYTPPGSIIKIARNLRMCSDCHT 575

Query: 547 VIKLISKIYNREIVVRDRTRFHRFKEG 569
             K IS IY REIVVRDR RFH FK G
Sbjct: 576 YTKKISMIYEREIVVRDRNRFHLFKGG 598

BLAST of Cla97C08G151610 vs. TAIR10
Match: AT3G47530.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 348.2 bits (892), Expect = 1.0e-95
Identity = 196/559 (35.06%), Postives = 304/559 (54.38%), Query Frame = 0

Query: 17  IKQLQANLIINGRFQFSSSRTMLLELCAISSF-GDLSYALHIFRHIRYPSTNDWNAVIRG 76
           ++Q+ A L+     + S      L   A+S    D++Y+  +F     P+ +  N +IR 
Sbjct: 27  LRQIHALLLRTSLIRNSDVFHHFLSRLALSLIPRDINYSCRVFSQRLNPTLSHCNTMIRA 86

Query: 77  TALSSDPENAVFWYRAMAASNGPHRIDALTCSFALKACARALARSEAMQLHSQLLRFGFN 136
            +LS  P      +R++   N     + L+ SFALK C ++      +Q+H ++   GF 
Sbjct: 87  FSLSQTPCEGFRLFRSL-RRNSSLPANPLSSSFALKCCIKSGDLLGGLQIHGKIFSDGFL 146

Query: 137 ADVLLQTTLLDAYAKVGDLDHAQNLFDEMPQPDIASWNALIAGFAQGSRPGDAIMMFKKM 196
           +D LL TTL+D Y+   +   A  +FDE+P+ D  SWN L + + +  R  D +++F KM
Sbjct: 147 SDSLLMTTLMDLYSTCENSTDACKVFDEIPKRDTVSWNVLFSCYLRNKRTRDVLVLFDKM 206

Query: 197 KE--DGNLRPNGVTVQGALLACSQLGALKEGENVHKHILEEKLDMNVQVCNVVIDMYAKC 256
           K   DG ++P+GVT   AL AC+ LGAL  G+ VH  I E  L   + + N ++ MY++C
Sbjct: 207 KNDVDGCVKPDGVTCLLALQACANLGALDFGKQVHDFIDENGLSGALNLSNTLVSMYSRC 266

Query: 257 GSVDKAYWVFENMRCDKSLITWNTMIMAFAMHGDGYKALDLFEKLGRCGMPPDAVSYLAV 316
           GS+DKAY VF  MR ++++++W  +I   AM+G G +A++ F ++ + G+ P+  +   +
Sbjct: 267 GSMDKAYQVFYGMR-ERNVVSWTALISGLAMNGFGKEAIEAFNEMLKFGISPEEQTLTGL 326

Query: 317 LCACNHAGLVEDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPF--- 376
           L AC+H+GLV +                                                
Sbjct: 327 LSACSHSGLVAEGMMFFDRMRSGEFKIKPNLHHYGCVVDLLGRARLLDKAYSLIKSMEMK 386

Query: 377 PNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFISCGDFVLLSNVYAARQRWDDVGRVR 436
           P+  +W+TLLGACR +G+VE+ E     L+E+     GD+VLL N Y+   +W+ V  +R
Sbjct: 387 PDSTIWRTLLGACRVHGDVELGERVISHLIELKAEEAGDYVLLLNTYSTVGKWEKVTELR 446

Query: 437 DAMRRRDVKKTPGFSYIEVKGMMHKFVYGDQSHSSCREIYAKLDEIKFRIKAYGYAAETG 496
             M+ + +   PG S IE++G +H+F+  D SH    EIY  L EI  ++K  GY AE  
Sbjct: 447 SLMKEKRIHTKPGCSAIELQGTVHEFIVDDVSHPRKEEIYKMLAEINQQLKIAGYVAEIT 506

Query: 497 NVLHDI-GDEDKENALCYHSEKLAVAFGLTCTEEGTLIQVIKNLRICGDCHVVIKLISKI 556
           + LH++  +E+K  AL YHSEKLA+AFG+  T  GT I+V KNLR C DCH   K +S +
Sbjct: 507 SELHNLESEEEKGYALRYHSEKLAIAFGILVTPPGTTIRVTKNLRTCVDCHNFAKFVSDV 566

Query: 557 YNREIVVRDRTRFHRFKEG 569
           Y+R ++VRDR+RFH FK G
Sbjct: 567 YDRIVIVRDRSRFHHFKGG 583

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004140941.14.2e-28185.76PREDICTED: pentatricopeptide repeat-containing protein At1g34160 [Cucumis sativu... [more]
XP_008456696.12.3e-27984.71PREDICTED: pentatricopeptide repeat-containing protein At1g34160 [Cucumis melo][more]
XP_022133715.18.6e-27490.51pentatricopeptide repeat-containing protein At1g34160 [Momordica charantia][more]
XP_023526446.16.4e-26988.40pentatricopeptide repeat-containing protein At1g34160 isoform X1 [Cucurbita pepo... [more]
XP_022990287.11.1e-26881.20pentatricopeptide repeat-containing protein At1g34160 isoform X1 [Cucurbita maxi... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0KB77|A0A0A0KB77_CUCSA2.8e-28185.76Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G052690 PE=4 SV=1[more]
tr|A0A1S3C3U7|A0A1S3C3U7_CUCME1.5e-27984.71pentatricopeptide repeat-containing protein At1g34160 OS=Cucumis melo OX=3656 GN... [more]
tr|A0A2I4FME1|A0A2I4FME1_9ROSI1.2e-21568.25pentatricopeptide repeat-containing protein At1g34160 OS=Juglans regia OX=51240 ... [more]
tr|A0A2N9EDZ7|A0A2N9EDZ7_FAGSY6.6e-21474.91Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS746 PE=4 SV=1[more]
tr|M5VIA7|M5VIA7_PRUPE5.6e-21373.51Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_8G045500 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
sp|Q9FX24|PPR71_ARATH9.6e-16855.23Pentatricopeptide repeat-containing protein At1g34160 OS=Arabidopsis thaliana OX... [more]
sp|B8YEK4|OGR1_ORYSJ6.3e-13546.94Pentatricopeptide repeat-containing protein OGR1, mitochondrial OS=Oryza sativa ... [more]
sp|A8MQA3|PP330_ARATH7.8e-10143.69Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX... [more]
sp|Q9LXY5|PP284_ARATH2.3e-10034.92Pentatricopeptide repeat-containing protein At3g56550 OS=Arabidopsis thaliana OX... [more]
sp|Q9C6T2|PPR68_ARATH6.6e-10035.27Pentatricopeptide repeat-containing protein At1g31920 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
AT1G34160.15.3e-16955.23Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G21065.14.3e-10243.69Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G56550.11.3e-10134.92Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G31920.13.6e-10135.27Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G47530.11.0e-9535.06Pentatricopeptide repeat (PPR) superfamily protein[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR032867DYW_dom
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0080156 mitochondrial mRNA modification
biological_process GO:0031425 chloroplast RNA processing
biological_process GO:0009451 RNA modification
cellular_component GO:0005575 cellular_component
cellular_component GO:0005739 mitochondrion
cellular_component GO:0009507 chloroplast
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C08G151610.1Cla97C08G151610.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 7..150
e-value: 9.7E-13
score: 49.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 271..468
e-value: 8.8E-33
score: 115.9
coord: 156..270
e-value: 4.2E-26
score: 94.0
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 243..268
e-value: 1.6E-4
score: 21.6
coord: 171..199
e-value: 5.4E-6
score: 26.2
coord: 142..165
e-value: 2.5E-4
score: 21.0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 271..317
e-value: 6.0E-8
score: 32.7
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 242..269
e-value: 0.0027
score: 15.8
coord: 273..307
e-value: 1.4E-5
score: 22.9
coord: 308..342
e-value: 1.9E-5
score: 22.5
coord: 171..199
e-value: 6.0E-6
score: 24.1
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 336..368
e-value: 2.3E-5
score: 23.9
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 204..238
score: 5.722
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 271..305
score: 10.633
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 65..99
score: 5.821
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 168..202
score: 10.106
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 102..136
score: 7.399
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 406..440
score: 5.744
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 341..375
score: 8.364
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 306..340
score: 10.775
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 239..269
score: 9.153
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 137..167
score: 8.616
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 442..566
e-value: 3.6E-38
score: 130.2
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 4..495
NoneNo IPR availablePANTHERPTHR24015:SF717SUBFAMILY NOT NAMEDcoord: 4..495

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla97C08G151610Watermelon (97103) v2wmbwmbB143
Cla97C08G151610Silver-seed gourdcarwmbB0864
Cla97C08G151610Cucumber (Gy14) v2cgybwmbB464
Cla97C08G151610Cucurbita maxima (Rimu)cmawmbB317
Cla97C08G151610Cucurbita maxima (Rimu)cmawmbB788
Cla97C08G151610Cucurbita moschata (Rifu)cmowmbB300