Cla97C06G111560 (gene) Watermelon (97103) v2

NameCla97C06G111560
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionPentatricopeptide repeat-containing family protein
LocationCla97Chr06 : 2360658 .. 2364593 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATAATATGGAGGGCTCCCTTGTGGGCGATATCTATTCCATTTGCCATGGAATATTCAACACCCCTTACCAAGCTAATTGCTCGCACTGGTGGTTTTGCGGCAACAGAAACTCACGGTGTGCGAGATTGGTCAAGCACAGTGGCTACAGAACACCCGATCGGACGCAAACCAGACGAGATCAATCGGACAGAAACTCTGGGCTGAGCGCCACAAAGTCTGGATGGTCGAACCAGCGAAAGGACGACGTTGCGGCGTGCACGGCGCAAGCGACAGCAGCCGCGACGTGGGTTGAAGACCGGCGGCTGTTGCAGGGAGAGAGAAAAATGGAAGGGACCAGCGGCCAGCCTTCACGAGAAGACAACCTTGAGGATTGGTTGAAGATTGTTTGGGCAAGAAAGGTTTATATATAGTTTGCTATAACATGGTTATAGCTTTATGTTACTTACTGATCAGGCCACATCAAAGAAAGTCATTACAAGGGAAGAAGCAGGAAAGAAGAAAGTCATTTCAAGCAACAAAGTTTAGCTTATATAGGAGGAATTAAATTATTGGGACGTTTTTAAATATAGTAAAATGAGTCAAACTACTTATAAATATAGAAAAATTTTATTGTATAGATCGCGATTGATGTTTATCATGATAGTGTATATCAATCAATCGCTAATAAACAGATAAATTTTTCTATATGTGTAAATAGTTTGATATTTTTTCTATTTATAATAATTTTTCTAAATAATTTGTGCAATACGATATTGACATATTTTTACTTTTTTTAATTAAGCATATTTATACAATTTTTAAAAGCGTAATATTTTTTAAATGAGAGCAATTTTTGTCTATATAAAAATAATAATAAATAACTTAAGAGTATTATTAAAACTTTTGAAAGTTTACAATATTTTTTTTATATAAAATGCTAATGATTTTCATTTAAAAAGTGGCGATAATTAAAATTTACTTTAAAGAACTATTGAAACTTTGAATGGTTAAAAAATTTCTTTTTTGAGTAAATTGCAAAAACTACATCCACCGTAGTATGGTGGTACTTATTACTTTCAAACTTTCAATGTAAAATTAAGCTCATAATTAAAATTGAACTCTTAAGCTTACAATAATGGTAAAAATTGAACACAAATGATAAAAATTGAACCTACATGCTTATACAAATATTAAAATTTATATAAGTTTATAAATTTGAAGGTTCAATTTGACCAATTTTAATTTCAAAATTGAATGTTTAAGGATGTAATTACAACTCCTATTATACTTTAGGGGTATTTTTGTAATTTAGACAAGAACTAAAGCAAACAACAAATCCAACCAATTAAAGCCAAATTGCTTTTGAAATCGATAAAGTGATTTTAAAAAAATCTTAACCGTGTTTCTTATTTACTAAATCAAATTATTAATTAATCTATCATAATCACTTTTTTAATTTTAACTAACAGGTAAAAGTTAATAGGCTTTTAGATTATGATTTTCTAAAAATCTACGACATTATTAACTATACATTGTAACGATATTCTTCCATATTAATATAATCTTCACACATATATTTTAAAATTTTAATTGATTCTATTTTTTATAACAATTATTTTTTAAATAAAATTGGCTAAAATTGAAATCATATAAAGTTCTAAAAAAAATTATAATAAATTATAAATTAATTATAAAATAATAAGTCTATTTTAGTTCTTATAATTAAATTGACCCAATTTAATATTAATTTATCAAATATCTAACTAATTTTTTTAATTTAAAACTAAAATTCACCGAATACTATTTTATTTTCTTGAACAGTAACTAGTATTTTTTATAATTTAACCAAATAAACACAATAAAAATAATAAATATAAAAAAAACTTCAAATATCAAGTTTTATTTGGAATCTTTTTAAATATAACAAAAGTAATTAAAATATTTACAAATTAAAATTTGACTGTGTTCATCGTAGATTATTATCTGCATCACGAAAGACACGATAGTCTAGTATAGTTTATTTGTGGATCAAATATTTTGTTATTAGTTCTAAATATATTTTGATTTTTAAATATGAAGATAATTTCTCTATTTGTTTTTTCTTTTCTTTTTCCTTTCTTTTTGAATGCACCGCACTCTCTCCCCCAGGAATTCCACTGCACAGGATCGGCTGAACAGTTTCCGTCTATGCAAAATAGAGGAGGCGATATGCTTAGCCAAATTCAATCCGAGATAACTTCTTCAAGCGCCGAAGTTGATTTTGAGGTGGGACAGAAAATCCCACTAAGATGGTAAATTCCGAAGCTTTCAACCTTCCATCTGCACATTAGATGTTTGTTAAAATGTCGTGCCTGAAATTCGTCAAGATTCTCTTGTTTTCGCCGCAATTCGATTAGATTTTGGTCACTTGGTTCAAACATAAGCATGTTCTTGATTTATACACCAATTAAATAGGTATGACTGTTAGGTGGCCAAGGCTTTTAACGCCTACATATTTATCTCAGATTATTAGGAAGCAGAATAATCCCTTAACAGCTTACCAACTGTTTAAGGAAGCCAAATGTAGGTACCCAGATTATCGGCACAATGGTCCGGTGTACGCCGCGATGATTGATATACTCGGAAATTCGGGGAGGATTTCTGAGATGAGAGAAGTGTTGGATCAGATGAGAGATGACTCTTGTGAATGCAAAGATTCTGTATTTTCATTTGCAATTAAAACGTATGCTAGTCAGGGATTATTGGAAGATGGTATATCTCTTTTTAAAAGTCTTGGGAGATTTAACTGTACCGATAGAACACAAACTTTTAATACCCTTTTGGAAATCCTGTTGAATCAATCTCAGCTTGATGCTGCCTGTCAGCTTTTTCAGCAGTGTTCTTATGGTTGGGAAGTGAAATCCAGGACTCAGTCCTTGAATTTGCTGATGCAATCTCTCTGCCAGAGAGGCCAGTCTGAACTTGCTTTACATGTCTTTCAAGAAATGGATTACCAAAGTTGCTATCCAAATAGACTGAGTTATTTGATTTTAATGAAAGGATTGTGTCAAGATGGTAGGCTTAATGAGGCCATCCATTTGTTGTATTCCATGTTTTGGCGGATTTCTCAAAGGGGTGGTGGAGGGGACATAGTAATTTACAGAACCCTGCTGTTTGCTTTGTGTGATAATGGAGAGATAGAGCAAGCTGTGGAAATACTTGGCAAGATCTTGAGGAAAGGACTGAAAGCCCCTAAGCGAGCTCATTACCGGATTGACTTAGATCAATGCAGGAATAGCAAGCTCACTATTGGGGAAATCAAGAGTTTAATCAATGAAGCTTTAATCAAAGGCGGAATTCCCAGTTCAGATAGCTATTGTGCCATGGCTGTTGATCTATATAACGAAAATGAGATTGATCAGGGAGATAAAGTGGTTAGCCACATGGTAGCTAAAGGCTTCAGGCCACCGCCGTCGATCTATGAAGCAAAAGCGGCTGCATTATGCAAAGAAGGCAAAGTCGATGATGCAGTGAAAGTAATTGAAGAGGAAATAGTGAAGGGAAGTGGTGTTCCAACTGTTGCATTGTACAACATAGTTCTGAAGGGTCTGTGTGATGAGGGCAAATCAACAGTGGCTATGGAGTATTTGAAGAAAATGGCAAAGCAGGTCGGTCTTGTTGCAAACAAAGGAACTTACAGCACTTTAGTACATGGACTTTGTCGTGAAAATCGATACGTTGAAGCATGCAAGGTTTTAGAGGAGATGGTTATCAAATCGTTTTCGCCTTGTTCTAACACATTCAATACACTAATTAGAGGCCTTTGCTCAGTTGGAAAACACTATGAAGCTGTGATGTGGTTGGAAGAAATGATTAGCCAAGGTCAATTGCCTCATGTTTCTGTCTGGAATTCTTTGGTTTCATCTCTGTGTTGCAACCTGACTGGCACCGCTATGTGGTCCAAGGTTTTATGA

mRNA sequence

ATGGATAATATGGAGGGCTCCCTTGTGGGCGATATCTATTCCATTTGCCATGGAATATTCAACACCCCTTACCAAGCTAATTGCTCGCACTGGTGGTTTTGCGGCAACAGAAACTCACGGTGTGCGAGATTGGTCAAGCACAGTGGCTACAGAACACCCGATCGGACGCAAACCAGACGAGATCAATCGGACAGAAACTCTGGGCTGAGCGCCACAAAGTCTGGATGGTCGAACCAGCGAAAGGACGACGTTGCGGCGTGCACGGCGCAAGCGACAGCAGCCGCGACGTGGGTTGAAGACCGGCGGCTGTTGCAGGGAGAGAGAAAAATGGAAGGGACCAGCGGCCAGCCTTCACGAGAAGACAACCTTGAGGATTGGTTGAAGATTGTTTGGGCAAGAAAGGAATTCCACTGCACAGGATCGGCTGAACAGTTTCCGTCTATGCAAAATAGAGGAGGCGATATGCTTAGCCAAATTCAATCCGAGATAACTTCTTCAAGCGCCGAAGTTGATTTTGAGATTATTAGGAAGCAGAATAATCCCTTAACAGCTTACCAACTGTTTAAGGAAGCCAAATGTAGGTACCCAGATTATCGGCACAATGGTCCGGTGTACGCCGCGATGATTGATATACTCGGAAATTCGGGGAGGATTTCTGAGATGAGAGAAGTGTTGGATCAGATGAGAGATGACTCTTGTGAATGCAAAGATTCTGTATTTTCATTTGCAATTAAAACGTATGCTAGTCAGGGATTATTGGAAGATGGTATATCTCTTTTTAAAAGTCTTGGGAGATTTAACTGTACCGATAGAACACAAACTTTTAATACCCTTTTGGAAATCCTGTTGAATCAATCTCAGCTTGATGCTGCCTGTCAGCTTTTTCAGCAGTGTTCTTATGGTTGGGAAGTGAAATCCAGGACTCAGTCCTTGAATTTGCTGATGCAATCTCTCTGCCAGAGAGGCCAGTCTGAACTTGCTTTACATGTCTTTCAAGAAATGGATTACCAAAGTTGCTATCCAAATAGACTGAGTTATTTGATTTTAATGAAAGGATTGTGTCAAGATGGTAGGCTTAATGAGGCCATCCATTTGTTGTATTCCATGTTTTGGCGGATTTCTCAAAGGGGTGGTGGAGGGGACATAGTAATTTACAGAACCCTGCTGTTTGCTTTGTGTGATAATGGAGAGATAGAGCAAGCTGTGGAAATACTTGGCAAGATCTTGAGGAAAGGACTGAAAGCCCCTAAGCGAGCTCATTACCGGATTGACTTAGATCAATGCAGGAATAGCAAGCTCACTATTGGGGAAATCAAGAGTTTAATCAATGAAGCTTTAATCAAAGGCGGAATTCCCAGTTCAGATAGCTATTGTGCCATGGCTGTTGATCTATATAACGAAAATGAGATTGATCAGGGAGATAAAGTGGTTAGCCACATGGTAGCTAAAGGCTTCAGGCCACCGCCGTCGATCTATGAAGCAAAAGCGGCTGCATTATGCAAAGAAGGCAAAGTCGATGATGCAGTGAAAGTAATTGAAGAGGAAATAGTGAAGGGAAGTGGTGTTCCAACTGTTGCATTGTACAACATAGTTCTGAAGGGTCTGTGTGATGAGGGCAAATCAACAGTGGCTATGGAGTATTTGAAGAAAATGGCAAAGCAGGTCGGTCTTGTTGCAAACAAAGGAACTTACAGCACTTTAGTACATGGACTTTGTCGTGAAAATCGATACGTTGAAGCATGCAAGGTTTTAGAGGAGATGGTTATCAAATCGTTTTCGCCTTGTTCTAACACATTCAATACACTAATTAGAGGCCTTTGCTCAGTTGGAAAACACTATGAAGCTGTGATGTGGTTGGAAGAAATGATTAGCCAAGGTCAATTGCCTCATGTTTCTGTCTGGAATTCTTTGGTTTCATCTCTGTGTTGCAACCTGACTGGCACCGCTATGTGGTCCAAGGTTTTATGA

Coding sequence (CDS)

ATGGATAATATGGAGGGCTCCCTTGTGGGCGATATCTATTCCATTTGCCATGGAATATTCAACACCCCTTACCAAGCTAATTGCTCGCACTGGTGGTTTTGCGGCAACAGAAACTCACGGTGTGCGAGATTGGTCAAGCACAGTGGCTACAGAACACCCGATCGGACGCAAACCAGACGAGATCAATCGGACAGAAACTCTGGGCTGAGCGCCACAAAGTCTGGATGGTCGAACCAGCGAAAGGACGACGTTGCGGCGTGCACGGCGCAAGCGACAGCAGCCGCGACGTGGGTTGAAGACCGGCGGCTGTTGCAGGGAGAGAGAAAAATGGAAGGGACCAGCGGCCAGCCTTCACGAGAAGACAACCTTGAGGATTGGTTGAAGATTGTTTGGGCAAGAAAGGAATTCCACTGCACAGGATCGGCTGAACAGTTTCCGTCTATGCAAAATAGAGGAGGCGATATGCTTAGCCAAATTCAATCCGAGATAACTTCTTCAAGCGCCGAAGTTGATTTTGAGATTATTAGGAAGCAGAATAATCCCTTAACAGCTTACCAACTGTTTAAGGAAGCCAAATGTAGGTACCCAGATTATCGGCACAATGGTCCGGTGTACGCCGCGATGATTGATATACTCGGAAATTCGGGGAGGATTTCTGAGATGAGAGAAGTGTTGGATCAGATGAGAGATGACTCTTGTGAATGCAAAGATTCTGTATTTTCATTTGCAATTAAAACGTATGCTAGTCAGGGATTATTGGAAGATGGTATATCTCTTTTTAAAAGTCTTGGGAGATTTAACTGTACCGATAGAACACAAACTTTTAATACCCTTTTGGAAATCCTGTTGAATCAATCTCAGCTTGATGCTGCCTGTCAGCTTTTTCAGCAGTGTTCTTATGGTTGGGAAGTGAAATCCAGGACTCAGTCCTTGAATTTGCTGATGCAATCTCTCTGCCAGAGAGGCCAGTCTGAACTTGCTTTACATGTCTTTCAAGAAATGGATTACCAAAGTTGCTATCCAAATAGACTGAGTTATTTGATTTTAATGAAAGGATTGTGTCAAGATGGTAGGCTTAATGAGGCCATCCATTTGTTGTATTCCATGTTTTGGCGGATTTCTCAAAGGGGTGGTGGAGGGGACATAGTAATTTACAGAACCCTGCTGTTTGCTTTGTGTGATAATGGAGAGATAGAGCAAGCTGTGGAAATACTTGGCAAGATCTTGAGGAAAGGACTGAAAGCCCCTAAGCGAGCTCATTACCGGATTGACTTAGATCAATGCAGGAATAGCAAGCTCACTATTGGGGAAATCAAGAGTTTAATCAATGAAGCTTTAATCAAAGGCGGAATTCCCAGTTCAGATAGCTATTGTGCCATGGCTGTTGATCTATATAACGAAAATGAGATTGATCAGGGAGATAAAGTGGTTAGCCACATGGTAGCTAAAGGCTTCAGGCCACCGCCGTCGATCTATGAAGCAAAAGCGGCTGCATTATGCAAAGAAGGCAAAGTCGATGATGCAGTGAAAGTAATTGAAGAGGAAATAGTGAAGGGAAGTGGTGTTCCAACTGTTGCATTGTACAACATAGTTCTGAAGGGTCTGTGTGATGAGGGCAAATCAACAGTGGCTATGGAGTATTTGAAGAAAATGGCAAAGCAGGTCGGTCTTGTTGCAAACAAAGGAACTTACAGCACTTTAGTACATGGACTTTGTCGTGAAAATCGATACGTTGAAGCATGCAAGGTTTTAGAGGAGATGGTTATCAAATCGTTTTCGCCTTGTTCTAACACATTCAATACACTAATTAGAGGCCTTTGCTCAGTTGGAAAACACTATGAAGCTGTGATGTGGTTGGAAGAAATGATTAGCCAAGGTCAATTGCCTCATGTTTCTGTCTGGAATTCTTTGGTTTCATCTCTGTGTTGCAACCTGACTGGCACCGCTATGTGGTCCAAGGTTTTATGA

Protein sequence

MDNMEGSLVGDIYSICHGIFNTPYQANCSHWWFCGNRNSRCARLVKHSGYRTPDRTQTRRDQSDRNSGLSATKSGWSNQRKDDVAACTAQATAAATWVEDRRLLQGERKMEGTSGQPSREDNLEDWLKIVWARKEFHCTGSAEQFPSMQNRGGDMLSQIQSEITSSSAEVDFEIIRKQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFNTLLEILLNQSQLDAACQLFQQCSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLILMKGLCQDGRLNEAIHLLYSMFWRISQRGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQGDKVVSHMVAKGFRPPPSIYEAKAAALCKEGKVDDAVKVIEEEIVKGSGVPTVALYNIVLKGLCDEGKSTVAMEYLKKMAKQVGLVANKGTYSTLVHGLCRENRYVEACKVLEEMVIKSFSPCSNTFNTLIRGLCSVGKHYEAVMWLEEMISQGQLPHVSVWNSLVSSLCCNLTGTAMWSKVL
BLAST of Cla97C06G111560 vs. NCBI nr
Match: XP_016902498.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g05600 isoform X1 [Cucumis melo] >XP_016902499.1 PREDICTED: pentatricopeptide repeat-containing protein At1g05600 isoform X1 [Cucumis melo])

HSP 1 Score: 461.1 bits (1185), Expect = 6.4e-126
Identity = 285/302 (94.37%), Postives = 296/302 (98.01%), Query Frame = 0

Query: 173 EIIRKQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDS 232
           +IIRKQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMI+ILGNSGR+SEMREV+DQMRDDS
Sbjct: 16  QIIRKQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMINILGNSGRVSEMREVMDQMRDDS 75

Query: 233 CECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFNTLLEILLNQSQLDAAC 292
           CECKDSVFSFAIKTYAS GLLEDGISLFKSLGRFNCT+RTQTFNTLLEILLN+SQL AAC
Sbjct: 76  CECKDSVFSFAIKTYASHGLLEDGISLFKSLGRFNCTNRTQTFNTLLEILLNESQLHAAC 135

Query: 293 QLFQQCSYGWEVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQSCYPNRLSYXXXXXX 352
           QLFQ+CSYGWEVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQSCYPNRLSY XXXXX
Sbjct: 136 QLFQECSYGWEVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQSCYPNRLSYLXXXXX 195

Query: 353 XXXXXXXXXXXXXXXXXFWRISQRGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKG 412
           XXXXXXXXXXXXXXXXXFWRIS++G GGDIVIYRTLLFALCDNGEIEQAVEILGKILRKG
Sbjct: 196 XXXXXXXXXXXXXXXXXFWRISRKGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKG 255

Query: 413 LKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQ 472
           LKAPKRAHYRIDLDQCRN+KLTI EIKSLINEALIKGGIPSSDSYCAMAVDLYNEN+ DQ
Sbjct: 256 LKAPKRAHYRIDLDQCRNNKLTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNENKTDQ 315

Query: 473 GD 475
           GD
Sbjct: 316 GD 317

BLAST of Cla97C06G111560 vs. NCBI nr
Match: XP_004140638.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g05600 isoform X1 [Cucumis sativus] >KGN46470.1 hypothetical protein Csa_6G095860 [Cucumis sativus])

HSP 1 Score: 452.2 bits (1162), Expect = 3.0e-123
Identity = 279/303 (92.08%), Postives = 289/303 (95.38%), Query Frame = 0

Query: 173 EIIRKQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDS 232
           +IIRKQNNP TAYQLFKEAKCRYPDYRHNGPVYA MI+ILGNSGR+SEMREV+DQMRDDS
Sbjct: 16  QIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDS 75

Query: 233 CECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFNTLLEILLNQSQLDAAC 292
           CECKDSVFSFAIKTYAS GLLEDGISLFKS GRFNCT+RTQTFNTLLEILL +SQL AAC
Sbjct: 76  CECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAAC 135

Query: 293 QLFQQCSYGWEVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQSCYPNRLSYXXXXXX 352
           QLFQ+CSYGW VXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX SCYPNRLSY XXXXX
Sbjct: 136 QLFQECSYGWGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSCYPNRLSYLXXXXX 195

Query: 353 XXXXXXXXXXXXXXXXXFWRISQRGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKG 412
           XXXXXXXXXXXXXXX  FWRIS++GGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKG
Sbjct: 196 XXXXXXXXXXXXXXXSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKG 255

Query: 413 LKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQ 472
           LKAPKRAHYRIDLDQCRNS LTI EIKSLINEALIKGGIPSSDSYCAMAVDLYNEN+ DQ
Sbjct: 256 LKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNENKTDQ 315

Query: 473 GDK 476
           GDK
Sbjct: 316 GDK 318

BLAST of Cla97C06G111560 vs. NCBI nr
Match: XP_022959953.1 (pentatricopeptide repeat-containing protein At1g05600 [Cucurbita moschata] >XP_022959954.1 pentatricopeptide repeat-containing protein At1g05600 [Cucurbita moschata] >XP_022959955.1 pentatricopeptide repeat-containing protein At1g05600 [Cucurbita moschata])

HSP 1 Score: 424.9 bits (1091), Expect = 5.1e-115
Identity = 268/303 (88.45%), Postives = 284/303 (93.73%), Query Frame = 0

Query: 173 EIIRKQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDS 232
           +IIRKQNNP TAYQLF EAKCRYP+Y+HNGPVYAAMI+ILGNSGRISEMREV+DQM+ DS
Sbjct: 16  QIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDS 75

Query: 233 CECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFNTLLEILLNQSQLDAAC 292
           C+CKDS+FSFAIKTYAS GLLE+GISLFKSLG FNCTDRTQTFNTLLEILLN+SQLDAAC
Sbjct: 76  CQCKDSIFSFAIKTYASHGLLEEGISLFKSLGGFNCTDRTQTFNTLLEILLNESQLDAAC 135

Query: 293 QLFQQCSYGWEVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQSCYPNRLSYXXXXXX 352
           QLFQQ S+    XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX  QSCYPNRLSYXXXXXX
Sbjct: 136 QLFQQSSFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDYQSCYPNRLSYXXXXXX 195

Query: 353 XXXXXXXXXXXXXXXXXFWRISQRGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKG 412
           XXXXXXXXXXXXXXXXXFWRIS+RG GGDIVIYRTLLFALCDNGEIEQAVEILGKIL+KG
Sbjct: 196 XXXXXXXXXXXXXXXXXFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKG 255

Query: 413 LKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQ 472
           LKAPKRAHY IDL+ CR SKLT+ EIK LINEALIKGGIPSSDSYCAMA+DLYNENE DQ
Sbjct: 256 LKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSYCAMAIDLYNENETDQ 315

Query: 473 GDK 476
           GDK
Sbjct: 316 GDK 318

BLAST of Cla97C06G111560 vs. NCBI nr
Match: XP_023514107.1 (pentatricopeptide repeat-containing protein At1g05600 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 423.7 bits (1088), Expect = 1.1e-114
Identity = 267/303 (88.12%), Postives = 284/303 (93.73%), Query Frame = 0

Query: 173 EIIRKQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDS 232
           +IIRKQNNP TAYQLF EAKCRYP+Y+HNGPVYAAMI+ILGNSGRISEMREV+DQM+ DS
Sbjct: 16  QIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDS 75

Query: 233 CECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFNTLLEILLNQSQLDAAC 292
           C+CKDS+FSFAIKTYAS GLLE+GISLFKSLG FNCTDRTQTFNTLLEILLN+SQLDAAC
Sbjct: 76  CQCKDSIFSFAIKTYASHGLLEEGISLFKSLGGFNCTDRTQTFNTLLEILLNESQLDAAC 135

Query: 293 QLFQQCSYGWEVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQSCYPNRLSYXXXXXX 352
           QLFQQ S+    XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX  QSCYPNRLSYXXXXXX
Sbjct: 136 QLFQQSSFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDYQSCYPNRLSYXXXXXX 195

Query: 353 XXXXXXXXXXXXXXXXXFWRISQRGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKG 412
           XXXXXXXXXXXXXXXXXFWRIS+RG GGDIVIYRTLLFALCDNGEIEQAVEILGKIL+KG
Sbjct: 196 XXXXXXXXXXXXXXXXXFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKG 255

Query: 413 LKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQ 472
           LK+PKRAHY IDL+ CR SKLT+ EIK LINEALIKGGIPSSDSYCAMA+DLYNENE DQ
Sbjct: 256 LKSPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSYCAMAIDLYNENETDQ 315

Query: 473 GDK 476
           GDK
Sbjct: 316 GDK 318

BLAST of Cla97C06G111560 vs. NCBI nr
Match: XP_023004266.1 (pentatricopeptide repeat-containing protein At1g05600 [Cucurbita maxima] >XP_023004267.1 pentatricopeptide repeat-containing protein At1g05600 [Cucurbita maxima] >XP_023004268.1 pentatricopeptide repeat-containing protein At1g05600 [Cucurbita maxima] >XP_023004269.1 pentatricopeptide repeat-containing protein At1g05600 [Cucurbita maxima] >XP_023004270.1 pentatricopeptide repeat-containing protein At1g05600 [Cucurbita maxima])

HSP 1 Score: 422.5 bits (1085), Expect = 2.5e-114
Identity = 266/303 (87.79%), Postives = 284/303 (93.73%), Query Frame = 0

Query: 173 EIIRKQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDS 232
           +IIRKQNNP TAYQLF EAKCRYP+Y+HNGPVYAAMI+ILGNSGRISEMREV+DQM+ DS
Sbjct: 16  QIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVIDQMKVDS 75

Query: 233 CECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFNTLLEILLNQSQLDAAC 292
           C+CKDS+FSFAIKTYAS GLLE+GISLFKSLG FNCTDRTQTFNTLLEILLN+SQLDAAC
Sbjct: 76  CQCKDSIFSFAIKTYASHGLLEEGISLFKSLGGFNCTDRTQTFNTLLEILLNESQLDAAC 135

Query: 293 QLFQQCSYGWEVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQSCYPNRLSYXXXXXX 352
           QLFQQ S+    XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX  QSCYPNRLSYXXXXXX
Sbjct: 136 QLFQQSSFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDYQSCYPNRLSYXXXXXX 195

Query: 353 XXXXXXXXXXXXXXXXXFWRISQRGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKG 412
           XXXXXXXXXXXXXXXXXFWRIS++G GGDIVIYRTLLFALCDNGEIEQAVEILGKIL+KG
Sbjct: 196 XXXXXXXXXXXXXXXXXFWRISRKGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILKKG 255

Query: 413 LKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQ 472
           LK+PKRAHY IDL+ CR SKLT+ EIK LINEALIKGGIPSSDSYCAMA+DLYNENE DQ
Sbjct: 256 LKSPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSYCAMAIDLYNENETDQ 315

Query: 473 GDK 476
           GDK
Sbjct: 316 GDK 318

BLAST of Cla97C06G111560 vs. TrEMBL
Match: tr|A0A1S4E2N8|A0A1S4E2N8_CUCME (pentatricopeptide repeat-containing protein At1g05600 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103498838 PE=4 SV=1)

HSP 1 Score: 461.1 bits (1185), Expect = 4.2e-126
Identity = 285/302 (94.37%), Postives = 296/302 (98.01%), Query Frame = 0

Query: 173 EIIRKQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDS 232
           +IIRKQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMI+ILGNSGR+SEMREV+DQMRDDS
Sbjct: 16  QIIRKQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMINILGNSGRVSEMREVMDQMRDDS 75

Query: 233 CECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFNTLLEILLNQSQLDAAC 292
           CECKDSVFSFAIKTYAS GLLEDGISLFKSLGRFNCT+RTQTFNTLLEILLN+SQL AAC
Sbjct: 76  CECKDSVFSFAIKTYASHGLLEDGISLFKSLGRFNCTNRTQTFNTLLEILLNESQLHAAC 135

Query: 293 QLFQQCSYGWEVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQSCYPNRLSYXXXXXX 352
           QLFQ+CSYGWEVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQSCYPNRLSY XXXXX
Sbjct: 136 QLFQECSYGWEVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQSCYPNRLSYLXXXXX 195

Query: 353 XXXXXXXXXXXXXXXXXFWRISQRGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKG 412
           XXXXXXXXXXXXXXXXXFWRIS++G GGDIVIYRTLLFALCDNGEIEQAVEILGKILRKG
Sbjct: 196 XXXXXXXXXXXXXXXXXFWRISRKGSGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKG 255

Query: 413 LKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQ 472
           LKAPKRAHYRIDLDQCRN+KLTI EIKSLINEALIKGGIPSSDSYCAMAVDLYNEN+ DQ
Sbjct: 256 LKAPKRAHYRIDLDQCRNNKLTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNENKTDQ 315

Query: 473 GD 475
           GD
Sbjct: 316 GD 317

BLAST of Cla97C06G111560 vs. TrEMBL
Match: tr|A0A0A0K9Q4|A0A0A0K9Q4_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G095860 PE=4 SV=1)

HSP 1 Score: 452.2 bits (1162), Expect = 2.0e-123
Identity = 279/303 (92.08%), Postives = 289/303 (95.38%), Query Frame = 0

Query: 173 EIIRKQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDS 232
           +IIRKQNNP TAYQLFKEAKCRYPDYRHNGPVYA MI+ILGNSGR+SEMREV+DQMRDDS
Sbjct: 16  QIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDS 75

Query: 233 CECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFNTLLEILLNQSQLDAAC 292
           CECKDSVFSFAIKTYAS GLLEDGISLFKS GRFNCT+RTQTFNTLLEILL +SQL AAC
Sbjct: 76  CECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAAC 135

Query: 293 QLFQQCSYGWEVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQSCYPNRLSYXXXXXX 352
           QLFQ+CSYGW VXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX SCYPNRLSY XXXXX
Sbjct: 136 QLFQECSYGWGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSCYPNRLSYLXXXXX 195

Query: 353 XXXXXXXXXXXXXXXXXFWRISQRGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKG 412
           XXXXXXXXXXXXXXX  FWRIS++GGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKG
Sbjct: 196 XXXXXXXXXXXXXXXSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKG 255

Query: 413 LKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQ 472
           LKAPKRAHYRIDLDQCRNS LTI EIKSLINEALIKGGIPSSDSYCAMAVDLYNEN+ DQ
Sbjct: 256 LKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNENKTDQ 315

Query: 473 GDK 476
           GDK
Sbjct: 316 GDK 318

BLAST of Cla97C06G111560 vs. TrEMBL
Match: tr|A0A1S3CB38|A0A1S3CB38_CUCME (pentatricopeptide repeat-containing protein At1g05600 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103498838 PE=4 SV=1)

HSP 1 Score: 387.1 bits (993), Expect = 7.8e-104
Identity = 251/267 (94.01%), Postives = 261/267 (97.75%), Query Frame = 0

Query: 208 MIDILGNSGRISEMREVLDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFN 267
           MI+ILGNSGR+SEMREV+DQMRDDSCECKDSVFSFAIKTYAS GLLEDGISLFKSLGRFN
Sbjct: 1   MINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSLGRFN 60

Query: 268 CTDRTQTFNTLLEILLNQSQLDAACQLFQQCSYGWEVXXXXXXXXXXXXXXXXXXXXXXX 327
           CT+RTQTFNTLLEILLN+SQL AACQLFQ+CSYGWEVXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 61  CTNRTQTFNTLLEILLNESQLHAACQLFQECSYGWEVXXXXXXXXXXXXXXXXXXXXXXX 120

Query: 328 XXXXXXXXXQSCYPNRLSYXXXXXXXXXXXXXXXXXXXXXXXFWRISQRGGGGDIVIYRT 387
           XXXXXXXXXQSCYPNRLSY XXXXXXXXXXXXXXXXXXXXXXFWRIS++G GGDIVIYRT
Sbjct: 121 XXXXXXXXXQSCYPNRLSYLXXXXXXXXXXXXXXXXXXXXXXFWRISRKGSGGDIVIYRT 180

Query: 388 LLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALI 447
           LLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRN+KLTI EIKSLINEALI
Sbjct: 181 LLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNNKLTIEEIKSLINEALI 240

Query: 448 KGGIPSSDSYCAMAVDLYNENEIDQGD 475
           KGGIPSSDSYCAMAVDLYNEN+ DQGD
Sbjct: 241 KGGIPSSDSYCAMAVDLYNENKTDQGD 267

BLAST of Cla97C06G111560 vs. TrEMBL
Match: tr|B9RNK1|B9RNK1_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis OX=3988 GN=RCOM_1339630 PE=4 SV=1)

HSP 1 Score: 307.4 bits (786), Expect = 7.8e-80
Identity = 160/293 (54.61%), Postives = 193/293 (65.87%), Query Frame = 0

Query: 173 EIIRKQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDS 232
           +IIR Q NPL A ++FKEAK +YP+YRHNGPVYA MI ILG+SGRI+EM+EVLDQMR+DS
Sbjct: 16  QIIRNQKNPLIALRIFKEAKDKYPNYRHNGPVYATMIGILGSSGRITEMKEVLDQMREDS 75

Query: 233 CECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFNTLLEILLNQSQLDAAC 292
           CECKDS+F+ AIKTYA  GLL + ISLFK++ +FNC + T++FNTLL+I++ +S+L+AA 
Sbjct: 76  CECKDSIFANAIKTYARVGLLNEAISLFKNIPQFNCVNWTESFNTLLQIMVKESKLEAAH 135

Query: 293 QLFQQCSYGWEVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQSCYPNRLSYXXXXXX 352
           +LF + SYGWEV                                Q CYP+R SY      
Sbjct: 136 RLFLESSYGWEVKSRVRSLNLLMDVLCQHNRSDVALQVFQEMNYQGCYPDRDSYRIVMMG 195

Query: 353 XXXXXXXXXXXXXXXXXFWRISQRGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKG 412
                            FWRISQ+G G DIVIYR  L ALCD G +EQA+E+LGKILRKG
Sbjct: 196 LCKDGRLNEATHLLYSMFWRISQKGSGEDIVIYRIFLDALCDIGMVEQALEVLGKILRKG 255

Query: 413 LKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLY 466
           LKAPKR H R+DL  C NS   I   K LINEALI+G IPS  SY AMAVD Y
Sbjct: 256 LKAPKRCHPRLDLSNC-NSDGNIETTKHLINEALIRGAIPSLSSYTAMAVDFY 307

BLAST of Cla97C06G111560 vs. TrEMBL
Match: tr|A0A2H5P127|A0A2H5P127_CITUN (Uncharacterized protein OS=Citrus unshiu OX=55188 GN=CUMW_092180 PE=4 SV=1)

HSP 1 Score: 304.7 bits (779), Expect = 5.1e-79
Identity = 160/302 (52.98%), Postives = 198/302 (65.56%), Query Frame = 0

Query: 173 EIIRKQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDS 232
           +II+KQ +PLTA ++FKEAK +YP+YRHNGPVYA+MI IL  S RI+EM+EV+DQM+ DS
Sbjct: 16  QIIKKQKSPLTALKIFKEAKEKYPNYRHNGPVYASMIGILSESNRITEMKEVIDQMKGDS 75

Query: 233 CECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFNTLLEILLNQSQLDAAC 292
           CECKDSVF+ AI+TYA  G L + +SLFK+L +FNC + TQ+FNTLL+ ++ +S+L+AA 
Sbjct: 76  CECKDSVFATAIRTYARAGQLNEAVSLFKNLSQFNCVNWTQSFNTLLKEMVKESKLEAAH 135

Query: 293 QLFQQCSYGWEVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQSCYPNRLSYXXXXXX 352
            LF +  YGWEV                                Q CYP+R SY      
Sbjct: 136 ILFLRSCYGWEVKSRIQSLNLLMDVLCQRRRSDLALHVFQEMDFQGCYPDRESYHILMKG 195

Query: 353 XXXXXXXXXXXXXXXXXFWRISQRGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKG 412
                            FWRISQ+G G DIVIYRTLLFALCD G+I+ A++IL KILRKG
Sbjct: 196 LCNDRRLNEATHLLYSMFWRISQKGSGEDIVIYRTLLFALCDQGKIQDAMQILEKILRKG 255

Query: 413 LKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQ 472
           LKAPK   +RIDL  C + +  I   KSLINEALI+GGIPS  SY AMAVDLYNE  I +
Sbjct: 256 LKAPKSRRHRIDLCPCNDGE-DIEGAKSLINEALIRGGIPSLASYSAMAVDLYNEGRIVE 315

Query: 473 GD 475
           GD
Sbjct: 316 GD 316

BLAST of Cla97C06G111560 vs. Swiss-Prot
Match: sp|Q9SYK1|PPR11_ARATH (Pentatricopeptide repeat-containing protein At1g05600 OS=Arabidopsis thaliana OX=3702 GN=At1g05600 PE=2 SV=1)

HSP 1 Score: 215.3 bits (547), Expect = 2.0e-54
Identity = 116/303 (38.28%), Postives = 161/303 (53.14%), Query Frame = 0

Query: 173 EIIRKQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDS 232
           +I++KQ NP+TA +LF+EAK R+P Y HNG VYA MIDILG S R+ EM+ V+++M++DS
Sbjct: 17  QILKKQKNPVTALKLFEEAKERFPSYGHNGSVYATMIDILGKSNRVLEMKYVIERMKEDS 76

Query: 233 CECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFNTLLEILLNQSQLDAAC 292
           CECKDSVF+  I+T++  G LED ISLFKSL  FNC + + +F+TLL+ ++ +S+L+AAC
Sbjct: 77  CECKDSVFASVIRTFSRAGRLEDAISLFKSLHEFNCVNWSLSFDTLLQEMVKESELEAAC 136

Query: 293 QLFQQCSYGWEVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQSCYPNRLSYXXXXXX 352
            +F++  YGWEV                                Q CYP+R SY      
Sbjct: 137 HIFRKYCYGWEVNSRITALNLLMKVLCQVNRSDLASQVFQEMNYQGCYPDRDSYRILMKG 196

Query: 353 XXXXXXXXXXXXXXXXXFWRISQRGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKG 412
                           X                                    GKILRKG
Sbjct: 197 FCLEGKLEEATHLLYSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGKILRKG 256

Query: 413 LKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQ 472
           LKAPKR ++ I+     +S   I  +K L+ E LI+G IP  DSY AMA DL+ E ++ +
Sbjct: 257 LKAPKRCYHHIEAGHWESSSEGIERVKRLLTETLIRGAIPCLDSYSAMATDLFEEGKLVE 316

Query: 473 GDK 476
           G++
Sbjct: 317 GEE 319

BLAST of Cla97C06G111560 vs. Swiss-Prot
Match: sp|Q9LQQ1|PPR20_ARATH (Pentatricopeptide repeat-containing protein At1g07740, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g07740 PE=2 SV=1)

HSP 1 Score: 59.3 bits (142), Expect = 1.8e-07
Identity = 31/121 (25.62%), Postives = 63/121 (52.07%), Query Frame = 0

Query: 175 IRKQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCE 234
           +++  +P  A  LF +   +   +RH+ P Y+++I  L  S     + ++L  +R  +  
Sbjct: 56  LKEIEDPEEALSLFHQ--YQEMGFRHDYPSYSSLIYKLAKSRNFDAVDQILRLVRYRNVR 115

Query: 235 CKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFNTLLEILLNQSQLDAACQL 294
           C++S+F   I+ Y   G ++  I +F  +  F+C    Q+ NTL+ +L++  +L+ A   
Sbjct: 116 CRESLFMGLIQHYGKAGSVDKAIDVFHKITSFDCVRTIQSLNTLINVLVDNGELEKAKSF 174

Query: 295 F 296
           F
Sbjct: 176 F 174

BLAST of Cla97C06G111560 vs. Swiss-Prot
Match: sp|O81908|PPR2_ARATH (Pentatricopeptide repeat-containing protein At1g02060, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At1g02060 PE=2 SV=2)

HSP 1 Score: 50.4 bits (119), Expect = 8.5e-05
Identity = 27/103 (26.21%), Postives = 54/103 (52.43%), Query Frame = 0

Query: 198 YRHNGPVYAAMIDILGNSGRISEMREVL--DQMRDDSC-ECKDSVFSFAIKTYASQGLLE 257
           + H    +  M++ LG +  ++  R  L   + R + C + +D  F+  I++Y + GL +
Sbjct: 96  FSHKEQSFFLMLEFLGRARNLNVARNFLFSIERRSNGCVKLQDRYFNSLIRSYGNAGLFQ 155

Query: 258 DGISLFKSLGRFNCTDRTQTFNTLLEILLNQSQLDAACQLFQQ 298
           + + LF+++ +   +    TFN+LL ILL + +   A  LF +
Sbjct: 156 ESVKLFQTMKQMGISPSVLTFNSLLSILLKRGRTGMAHDLFDE 198

BLAST of Cla97C06G111560 vs. Swiss-Prot
Match: sp|Q9LFF1|PP281_ARATH (Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=MEE40 PE=2 SV=1)

HSP 1 Score: 50.1 bits (118), Expect = 1.1e-04
Identity = 33/125 (26.40%), Postives = 63/125 (50.40%), Query Frame = 0

Query: 166 SSAEVD-FEIIRKQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREV 225
           SS +V   + +R Q +   A +LF  A  + P++     +Y  ++  LG SG   +M+++
Sbjct: 47  SSTDVKLLDSLRSQPDDSAALRLFNLAS-KKPNFSPEPALYEEILLRLGRSGSFDDMKKI 106

Query: 226 LDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFK-SLGRFNCTDRTQTFNTLLEILL 285
           L+ M+   CE   S F   I++YA   L ++ +S+    +  F     T  +N +L +L+
Sbjct: 107 LEDMKSSRCEMGTSTFLILIESYAQFELQDEILSVVDWMIDEFGLKPDTHFYNRMLNLLV 166

Query: 286 NQSQL 289
           + + L
Sbjct: 167 DGNSL 170

BLAST of Cla97C06G111560 vs. TAIR10
Match: AT1G05600.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 215.3 bits (547), Expect = 1.1e-55
Identity = 116/303 (38.28%), Postives = 161/303 (53.14%), Query Frame = 0

Query: 173 EIIRKQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDS 232
           +I++KQ NP+TA +LF+EAK R+P Y HNG VYA MIDILG S R+ EM+ V+++M++DS
Sbjct: 17  QILKKQKNPVTALKLFEEAKERFPSYGHNGSVYATMIDILGKSNRVLEMKYVIERMKEDS 76

Query: 233 CECKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFNTLLEILLNQSQLDAAC 292
           CECKDSVF+  I+T++  G LED ISLFKSL  FNC + + +F+TLL+ ++ +S+L+AAC
Sbjct: 77  CECKDSVFASVIRTFSRAGRLEDAISLFKSLHEFNCVNWSLSFDTLLQEMVKESELEAAC 136

Query: 293 QLFQQCSYGWEVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQSCYPNRLSYXXXXXX 352
            +F++  YGWEV                                Q CYP+R SY      
Sbjct: 137 HIFRKYCYGWEVNSRITALNLLMKVLCQVNRSDLASQVFQEMNYQGCYPDRDSYRILMKG 196

Query: 353 XXXXXXXXXXXXXXXXXFWRISQRGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKG 412
                           X                                    GKILRKG
Sbjct: 197 FCLEGKLEEATHLLYSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGKILRKG 256

Query: 413 LKAPKRAHYRIDLDQCRNSKLTIGEIKSLINEALIKGGIPSSDSYCAMAVDLYNENEIDQ 472
           LKAPKR ++ I+     +S   I  +K L+ E LI+G IP  DSY AMA DL+ E ++ +
Sbjct: 257 LKAPKRCYHHIEAGHWESSSEGIERVKRLLTETLIRGAIPCLDSYSAMATDLFEEGKLVE 316

Query: 473 GDK 476
           G++
Sbjct: 317 GEE 319

BLAST of Cla97C06G111560 vs. TAIR10
Match: AT1G07740.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 59.3 bits (142), Expect = 1.0e-08
Identity = 31/121 (25.62%), Postives = 63/121 (52.07%), Query Frame = 0

Query: 175 IRKQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREVLDQMRDDSCE 234
           +++  +P  A  LF +   +   +RH+ P Y+++I  L  S     + ++L  +R  +  
Sbjct: 56  LKEIEDPEEALSLFHQ--YQEMGFRHDYPSYSSLIYKLAKSRNFDAVDQILRLVRYRNVR 115

Query: 235 CKDSVFSFAIKTYASQGLLEDGISLFKSLGRFNCTDRTQTFNTLLEILLNQSQLDAACQL 294
           C++S+F   I+ Y   G ++  I +F  +  F+C    Q+ NTL+ +L++  +L+ A   
Sbjct: 116 CRESLFMGLIQHYGKAGSVDKAIDVFHKITSFDCVRTIQSLNTLINVLVDNGELEKAKSF 174

Query: 295 F 296
           F
Sbjct: 176 F 174

BLAST of Cla97C06G111560 vs. TAIR10
Match: AT1G02060.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 50.4 bits (119), Expect = 4.7e-06
Identity = 27/103 (26.21%), Postives = 54/103 (52.43%), Query Frame = 0

Query: 198 YRHNGPVYAAMIDILGNSGRISEMREVL--DQMRDDSC-ECKDSVFSFAIKTYASQGLLE 257
           + H    +  M++ LG +  ++  R  L   + R + C + +D  F+  I++Y + GL +
Sbjct: 96  FSHKEQSFFLMLEFLGRARNLNVARNFLFSIERRSNGCVKLQDRYFNSLIRSYGNAGLFQ 155

Query: 258 DGISLFKSLGRFNCTDRTQTFNTLLEILLNQSQLDAACQLFQQ 298
           + + LF+++ +   +    TFN+LL ILL + +   A  LF +
Sbjct: 156 ESVKLFQTMKQMGISPSVLTFNSLLSILLKRGRTGMAHDLFDE 198

BLAST of Cla97C06G111560 vs. TAIR10
Match: AT3G53700.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 50.1 bits (118), Expect = 6.2e-06
Identity = 33/125 (26.40%), Postives = 63/125 (50.40%), Query Frame = 0

Query: 166 SSAEVD-FEIIRKQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMIDILGNSGRISEMREV 225
           SS +V   + +R Q +   A +LF  A  + P++     +Y  ++  LG SG   +M+++
Sbjct: 47  SSTDVKLLDSLRSQPDDSAALRLFNLAS-KKPNFSPEPALYEEILLRLGRSGSFDDMKKI 106

Query: 226 LDQMRDDSCECKDSVFSFAIKTYASQGLLEDGISLFK-SLGRFNCTDRTQTFNTLLEILL 285
           L+ M+   CE   S F   I++YA   L ++ +S+    +  F     T  +N +L +L+
Sbjct: 107 LEDMKSSRCEMGTSTFLILIESYAQFELQDEILSVVDWMIDEFGLKPDTHFYNRMLNLLV 166

Query: 286 NQSQL 289
           + + L
Sbjct: 167 DGNSL 170

BLAST of Cla97C06G111560 vs. TAIR10
Match: AT1G72175.1 (RING/U-box protein with domain of unknown function (DUF 1232))

HSP 1 Score: 44.3 bits (103), Expect = 3.4e-04
Identity = 22/39 (56.41%), Postives = 23/39 (58.97%), Query Frame = 0

Query: 11 DIYSICHGIFNTPYQANCSHWWFCGNRNSRCARLVKHSG 50
          D+ SICH  F  P QANCSH WFCGN    C  LV   G
Sbjct: 8  DLCSICHSHFTAPCQANCSH-WFCGN----CIMLVWRHG 41

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_016902498.16.4e-12694.37PREDICTED: pentatricopeptide repeat-containing protein At1g05600 isoform X1 [Cuc... [more]
XP_004140638.13.0e-12392.08PREDICTED: pentatricopeptide repeat-containing protein At1g05600 isoform X1 [Cuc... [more]
XP_022959953.15.1e-11588.45pentatricopeptide repeat-containing protein At1g05600 [Cucurbita moschata] >XP_0... [more]
XP_023514107.11.1e-11488.12pentatricopeptide repeat-containing protein At1g05600 isoform X1 [Cucurbita pepo... [more]
XP_023004266.12.5e-11487.79pentatricopeptide repeat-containing protein At1g05600 [Cucurbita maxima] >XP_023... [more]
Match NameE-valueIdentityDescription
tr|A0A1S4E2N8|A0A1S4E2N8_CUCME4.2e-12694.37pentatricopeptide repeat-containing protein At1g05600 isoform X1 OS=Cucumis melo... [more]
tr|A0A0A0K9Q4|A0A0A0K9Q4_CUCSA2.0e-12392.08Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G095860 PE=4 SV=1[more]
tr|A0A1S3CB38|A0A1S3CB38_CUCME7.8e-10494.01pentatricopeptide repeat-containing protein At1g05600 isoform X2 OS=Cucumis melo... [more]
tr|B9RNK1|B9RNK1_RICCO7.8e-8054.61Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis OX=398... [more]
tr|A0A2H5P127|A0A2H5P127_CITUN5.1e-7952.98Uncharacterized protein OS=Citrus unshiu OX=55188 GN=CUMW_092180 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
sp|Q9SYK1|PPR11_ARATH2.0e-5438.28Pentatricopeptide repeat-containing protein At1g05600 OS=Arabidopsis thaliana OX... [more]
sp|Q9LQQ1|PPR20_ARATH1.8e-0725.62Pentatricopeptide repeat-containing protein At1g07740, mitochondrial OS=Arabidop... [more]
sp|O81908|PPR2_ARATH8.5e-0526.21Pentatricopeptide repeat-containing protein At1g02060, chloroplastic OS=Arabidop... [more]
sp|Q9LFF1|PP281_ARATH1.1e-0426.40Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
AT1G05600.11.1e-5538.28Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G07740.11.0e-0825.62Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G02060.14.7e-0626.21Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G53700.16.2e-0626.40Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G72175.13.4e-0456.41RING/U-box protein with domain of unknown function (DUF 1232)[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0009451 RNA modification
biological_process GO:0008150 biological_process
cellular_component GO:0043231 intracellular membrane-bounded organelle
cellular_component GO:0005575 cellular_component
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C06G111560.1Cla97C06G111560.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 562..608
e-value: 6.8E-12
score: 45.3
coord: 311..355
e-value: 2.1E-10
score: 40.5
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 598..629
e-value: 2.5E-6
score: 25.3
coord: 527..560
e-value: 5.0E-4
score: 18.1
coord: 204..235
e-value: 8.8E-6
score: 23.6
coord: 345..369
e-value: 0.002
score: 16.2
coord: 563..594
e-value: 1.9E-7
score: 28.8
coord: 311..342
e-value: 7.8E-5
score: 20.6
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 383..413
e-value: 0.004
score: 17.2
coord: 205..233
e-value: 0.0085
score: 16.2
coord: 528..552
e-value: 0.018
score: 15.2
coord: 498..514
e-value: 1.1
score: 9.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 381..415
score: 10.446
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 595..629
score: 10.611
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 342..372
score: 8.144
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 201..235
score: 9.24
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 236..270
score: 6.467
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 453..487
score: 7.695
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 307..341
score: 9.525
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 524..554
score: 8.276
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 560..594
score: 11.597
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 488..523
score: 7.574
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 271..301
score: 5.656
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 435..540
e-value: 1.0E-10
score: 43.2
coord: 541..646
e-value: 5.6E-24
score: 86.5
coord: 154..289
e-value: 2.9E-17
score: 64.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 300..433
e-value: 2.2E-18
score: 68.7
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 63..77
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 52..77
NoneNo IPR availablePANTHERPTHR24015:SF542SUBFAMILY NOT NAMEDcoord: 171..638
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 171..638
NoneNo IPR availableSUPERFAMILYSSF81901HCP-likecoord: 521..621
coord: 318..413

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla97C06G111560Silver-seed gourdcarwmbB0078
Cla97C06G111560Silver-seed gourdcarwmbB1018
Cla97C06G111560Cucurbita maxima (Rimu)cmawmbB371
Cla97C06G111560Cucurbita maxima (Rimu)cmawmbB924
Cla97C06G111560Cucurbita moschata (Rifu)cmowmbB357
Cla97C06G111560Cucurbita moschata (Rifu)cmowmbB899
Cla97C06G111560Watermelon (Charleston Gray)wcgwmbB241