Cla97C01G019950 (gene) Watermelon (97103) v2.5

Overview
NameCla97C01G019950
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionPentatricopeptide repeat-containing protein
LocationCla97Chr01: 32818813 .. 32820603 (-)
RNA-Seq ExpressionCla97C01G019950
SyntenyCla97C01G019950
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAAGGATTTCCACTCGACAACTCTTTCGCTTCACGCACGCCTCCCTCCCTCTGCCCTTCAAATCCGTTGATCGATCTTCCTCTCCATTCTCCTCTTTTCCAGAACCAGATCTTTCACTCGAGACCACAAATCCTCCTAGACATAACCGAAGCTACTCGCTTCTTCAATCATGCCAGAGCGTAAGAGAATTACTTCAAATCCATGGCCATTTGATTACCTCTGGTCGTTTTAAACACCATTTTTGGGCCAACAGAGTTCTATTTCAGGCCTCGGAGTTTGGCGACGTCATTTATACTGTTTTGGTCTTCAGGTATATCAACATTCCCAATACCTTCTGTATCAATAGAGTAATTAAGGCTTATTCTCTTAGCATAGTTCCTCTAGAGGCTGTATCTTTGTATTTTGAATGGCTTGGTAATGGGTTTCGGCCAGATTCGTACACTTTTCTTTCACTTTTTTCCGCTTGTGCGAATTTTGGCTGTGGGGCTTCTGGGCGTAAGTGTCATGGACAAGCTTTCAAGAATGGGATTGACTCTGTCATGGTTTTGAGAAATAGTTTGATTCATATGTATGGCTGTTGTGGGCATATTGAGCTCGGTCGGAAGGTGTTCGATGAAATGTCGAGCTGGGATTTGGTATCTTGGAATTCAATTGTTACTGCTTATGCAAGAACTGGAGATTTGCACACTGCCCATGACCTGTTCGATGCAATGCCGGAGAGAAATATTGTGTCTTGGAATTTGATGATTAGTGAGTATTTGAGAGGTGGGAATCCAGGCTGTGCAATGAAGTTGTTTAGGAATATGGTGAATATAGGAATAAGAGGGAACAATACAACAATGGTCAACGTTCTTGGTGCTTGCGGTCGATCAGCAAGGCTGAATGAAGGAAGATCGGTTCATGGTTTTATGTACCGTACTTCAATGAAGTTTTGCGTATTTATCAACACAGCATTGGTTGACATGTATAGCAAATGCCAGAGAGTGTCTATTGCACGTAGAGTGTTTGACAGGATGCTGAGTCGGAATTTGGTTACCTGGAATGCAATGGTTTTGGGGCATTGCCTACATGGCAATCCTGATGATGGACTTAAGCTATTTCAGGAAATGGCTGCCAAATTAAGGGAAATAAATGGGGAAATTGGCAATGGCAAGAAATTCAAGCAAGATGAAGGTAAGCGAAATGTTTACCCAGACCAAATTACATTTATTGGCGTTCTATGTGCCTGTGCCCGAGCGGGACTGCTGAAAGATGCAAATAATTACTTCAACGAGATGATCAATGTGTTTCTTGTGAGGCCAAATTTTGCCCACTACTGGTGTTTAGCCAATGTTTACGTTGCAGCAGGGCTGATACAGCAGGCTGTGGAAATACTGAGGAACGTGCCTGAGGATGACGAGGACTTTTCATCAGATTCAGTTGTATGGATTAACTTGCTCACCATGTGTCGTTTTGTGGGAGATGTTTCTTTGGGAGAACAGATAGCAAAATATTTGATTGACTTGGAACCTAAGAATGACTCATACTATAGATTGCTTCTGAATATTTATGCTGTAGCAGGGAGATGGGAGGATGTTTCTAGAATCAAATTATTAATGAAAGAAAAAAGACTTGGAACAATGCCGGGTTGTAGACTAGTAGACCTGAAAGAGATTGTTCACAGATTAAAATTGGGAAATCTTCTGCAAGATGGGATGAAGGAGACAAACACAGTGATGCATAAACTTGCTAGTGAAGTGAGTCTATTGTCAAGCATTGCTGCAGGCCAATCAGATTTTGGAGTTTAG

mRNA sequence

ATGGCAAGGATTTCCACTCGACAACTCTTTCGCTTCACGCACGCCTCCCTCCCTCTGCCCTTCAAATCCGTTGATCGATCTTCCTCTCCATTCTCCTCTTTTCCAGAACCAGATCTTTCACTCGAGACCACAAATCCTCCTAGACATAACCGAAGCTACTCGCTTCTTCAATCATGCCAGAGCGTAAGAGAATTACTTCAAATCCATGGCCATTTGATTACCTCTGGTCGTTTTAAACACCATTTTTGGGCCAACAGAGTTCTATTTCAGGCCTCGGAGTTTGGCGACGTCATTTATACTGTTTTGGTCTTCAGGTATATCAACATTCCCAATACCTTCTGTATCAATAGAGTAATTAAGGCTTATTCTCTTAGCATAGTTCCTCTAGAGGCTGTATCTTTGTATTTTGAATGGCTTGGTAATGGGTTTCGGCCAGATTCGTACACTTTTCTTTCACTTTTTTCCGCTTGTGCGAATTTTGGCTGTGGGGCTTCTGGGCGTAAGTGTCATGGACAAGCTTTCAAGAATGGGATTGACTCTGTCATGGTTTTGAGAAATAGTTTGATTCATATGTATGGCTGTTGTGGGCATATTGAGCTCGGTCGGAAGGTGTTCGATGAAATGTCGAGCTGGGATTTGGTATCTTGGAATTCAATTGTTACTGCTTATGCAAGAACTGGAGATTTGCACACTGCCCATGACCTGTTCGATGCAATGCCGGAGAGAAATATTGTGTCTTGGAATTTGATGATTAGTGAGTATTTGAGAGGTGGGAATCCAGGCTGTGCAATGAAGTTGTTTAGGAATATGGTGAATATAGGAATAAGAGGGAACAATACAACAATGGTCAACGTTCTTGGTGCTTGCGGTCGATCAGCAAGGCTGAATGAAGGAAGATCGGTTCATGGTTTTATGTACCGTACTTCAATGAAGTTTTGCGTATTTATCAACACAGCATTGGTTGACATGTATAGCAAATGCCAGAGAGTGTCTATTGCACGTAGAGTGTTTGACAGGATGCTGAGTCGGAATTTGGTTACCTGGAATGCAATGGTTTTGGGGCATTGCCTACATGGCAATCCTGATGATGGACTTAAGCTATTTCAGGAAATGGCTGCCAAATTAAGGGAAATAAATGGGGAAATTGGCAATGGCAAGAAATTCAAGCAAGATGAAGGTAAGCGAAATGTTTACCCAGACCAAATTACATTTATTGGCGTTCTATGTGCCTGTGCCCGAGCGGGACTGCTGAAAGATGCAAATAATTACTTCAACGAGATGATCAATGTGTTTCTTGTGAGGCCAAATTTTGCCCACTACTGGTGTTTAGCCAATGTTTACGTTGCAGCAGGGCTGATACAGCAGGCTGTGGAAATACTGAGGAACGTGCCTGAGGATGACGAGGACTTTTCATCAGATTCAGTTGTATGGATTAACTTGCTCACCATGTGTCGTTTTGTGGGAGATGTTTCTTTGGGAGAACAGATAGCAAAATATTTGATTGACTTGGAACCTAAGAATGACTCATACTATAGATTGCTTCTGAATATTTATGCTGTAGCAGGGAGATGGGAGGATGTTTCTAGAATCAAATTATTAATGAAAGAAAAAAGACTTGGAACAATGCCGGGTTGTAGACTAGTAGACCTGAAAGAGATTGTTCACAGATTAAAATTGGGAAATCTTCTGCAAGATGGGATGAAGGAGACAAACACAGTGATGCATAAACTTGCTAGTGAAGTGAGTCTATTGTCAAGCATTGCTGCAGGCCAATCAGATTTTGGAGTTTAG

Coding sequence (CDS)

ATGGCAAGGATTTCCACTCGACAACTCTTTCGCTTCACGCACGCCTCCCTCCCTCTGCCCTTCAAATCCGTTGATCGATCTTCCTCTCCATTCTCCTCTTTTCCAGAACCAGATCTTTCACTCGAGACCACAAATCCTCCTAGACATAACCGAAGCTACTCGCTTCTTCAATCATGCCAGAGCGTAAGAGAATTACTTCAAATCCATGGCCATTTGATTACCTCTGGTCGTTTTAAACACCATTTTTGGGCCAACAGAGTTCTATTTCAGGCCTCGGAGTTTGGCGACGTCATTTATACTGTTTTGGTCTTCAGGTATATCAACATTCCCAATACCTTCTGTATCAATAGAGTAATTAAGGCTTATTCTCTTAGCATAGTTCCTCTAGAGGCTGTATCTTTGTATTTTGAATGGCTTGGTAATGGGTTTCGGCCAGATTCGTACACTTTTCTTTCACTTTTTTCCGCTTGTGCGAATTTTGGCTGTGGGGCTTCTGGGCGTAAGTGTCATGGACAAGCTTTCAAGAATGGGATTGACTCTGTCATGGTTTTGAGAAATAGTTTGATTCATATGTATGGCTGTTGTGGGCATATTGAGCTCGGTCGGAAGGTGTTCGATGAAATGTCGAGCTGGGATTTGGTATCTTGGAATTCAATTGTTACTGCTTATGCAAGAACTGGAGATTTGCACACTGCCCATGACCTGTTCGATGCAATGCCGGAGAGAAATATTGTGTCTTGGAATTTGATGATTAGTGAGTATTTGAGAGGTGGGAATCCAGGCTGTGCAATGAAGTTGTTTAGGAATATGGTGAATATAGGAATAAGAGGGAACAATACAACAATGGTCAACGTTCTTGGTGCTTGCGGTCGATCAGCAAGGCTGAATGAAGGAAGATCGGTTCATGGTTTTATGTACCGTACTTCAATGAAGTTTTGCGTATTTATCAACACAGCATTGGTTGACATGTATAGCAAATGCCAGAGAGTGTCTATTGCACGTAGAGTGTTTGACAGGATGCTGAGTCGGAATTTGGTTACCTGGAATGCAATGGTTTTGGGGCATTGCCTACATGGCAATCCTGATGATGGACTTAAGCTATTTCAGGAAATGGCTGCCAAATTAAGGGAAATAAATGGGGAAATTGGCAATGGCAAGAAATTCAAGCAAGATGAAGGTAAGCGAAATGTTTACCCAGACCAAATTACATTTATTGGCGTTCTATGTGCCTGTGCCCGAGCGGGACTGCTGAAAGATGCAAATAATTACTTCAACGAGATGATCAATGTGTTTCTTGTGAGGCCAAATTTTGCCCACTACTGGTGTTTAGCCAATGTTTACGTTGCAGCAGGGCTGATACAGCAGGCTGTGGAAATACTGAGGAACGTGCCTGAGGATGACGAGGACTTTTCATCAGATTCAGTTGTATGGATTAACTTGCTCACCATGTGTCGTTTTGTGGGAGATGTTTCTTTGGGAGAACAGATAGCAAAATATTTGATTGACTTGGAACCTAAGAATGACTCATACTATAGATTGCTTCTGAATATTTATGCTGTAGCAGGGAGATGGGAGGATGTTTCTAGAATCAAATTATTAATGAAAGAAAAAAGACTTGGAACAATGCCGGGTTGTAGACTAGTAGACCTGAAAGAGATTGTTCACAGATTAAAATTGGGAAATCTTCTGCAAGATGGGATGAAGGAGACAAACACAGTGATGCATAAACTTGCTAGTGAAGTGAGTCTATTGTCAAGCATTGCTGCAGGCCAATCAGATTTTGGAGTTTAG

Protein sequence

MARISTRQLFRFTHASLPLPFKSVDRSSSPFSSFPEPDLSLETTNPPRHNRSYSLLQSCQSVRELLQIHGHLITSGRFKHHFWANRVLFQASEFGDVIYTVLVFRYINIPNTFCINRVIKAYSLSIVPLEAVSLYFEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGIDSVMVLRNSLIHMYGCCGHIELGRKVFDEMSSWDLVSWNSIVTAYARTGDLHTAHDLFDAMPERNIVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRSVHGFMYRTSMKFCVFINTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGNPDDGLKLFQEMAAKLREINGEIGNGKKFKQDEGKRNVYPDQITFIGVLCACARAGLLKDANNYFNEMINVFLVRPNFAHYWCLANVYVAAGLIQQAVEILRNVPEDDEDFSSDSVVWINLLTMCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVHRLKLGNLLQDGMKETNTVMHKLASEVSLLSSIAAGQSDFGV
Homology
BLAST of Cla97C01G019950 vs. NCBI nr
Match: XP_038882774.1 (pentatricopeptide repeat-containing protein At3g51320 [Benincasa hispida])

HSP 1 Score: 1137.1 bits (2940), Expect = 0.0e+00
Identity = 545/596 (91.44%), Postives = 577/596 (96.81%), Query Frame = 0

Query: 1   MARISTRQLFRFTHASLPLPFKSVDRSSSPFSSFPEPDLSLETTNPPRHNRSYSLLQSCQ 60
           MARISTR+LFRFTHA LPLPFKSVDRSSSPFSSFPEPDLSL+TTNPPRHNRS+SLLQSCQ
Sbjct: 1   MARISTRKLFRFTHAPLPLPFKSVDRSSSPFSSFPEPDLSLDTTNPPRHNRSHSLLQSCQ 60

Query: 61  SVRELLQIHGHLITSGRFKHHFWANRVLFQASEFGDVIYTVLVFRYINIPNTFCINRVIK 120
           SVRELLQIHGHLITSG F HHFWANRVL QASEFGD++YTVL+FRYI++PNTFCINRVIK
Sbjct: 61  SVRELLQIHGHLITSGLFNHHFWANRVLLQASEFGDIVYTVLIFRYISVPNTFCINRVIK 120

Query: 121 AYSLSIVPLEAVSLYFEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGIDS 180
           AYSLS VPLEAV LYFEWLGNGFRPDSYTFL+LFSACA+FGC ASGRKCHGQAFKNG+DS
Sbjct: 121 AYSLSTVPLEAVFLYFEWLGNGFRPDSYTFLALFSACASFGCEASGRKCHGQAFKNGVDS 180

Query: 181 VMVLRNSLIHMYGCCGHIELGRKVFDEMSSWDLVSWNSIVTAYARTGDLHTAHDLFDAMP 240
           VMVLRNSLIHMYGCCGHIELGRKVFDEMS+WDLVSWNSIVTAYAR GDLH+AHD+FD MP
Sbjct: 181 VMVLRNSLIHMYGCCGHIELGRKVFDEMSTWDLVSWNSIVTAYARVGDLHSAHDMFDKMP 240

Query: 241 ERNIVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRS 300
           ERN+VSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNV GACGRSARLNEGRS
Sbjct: 241 ERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVFGACGRSARLNEGRS 300

Query: 301 VHGFMYRTSMKFCVFINTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGN 360
           VHGFMYR  M FCVFI+TALVDMYSKCQ+VSIARRVFDRMLSRNLVTWNAMVLGHCLHGN
Sbjct: 301 VHGFMYRNLMNFCVFIDTALVDMYSKCQKVSIARRVFDRMLSRNLVTWNAMVLGHCLHGN 360

Query: 361 PDDGLKLFQEMAAKLREINGEIGNGKKFKQDEGKRNVYPDQITFIGVLCACARAGLLKDA 420
           P+DGLKLF+EMAAKLREINGE G+GK+FKQ EGK+ V+PDQITFIGVLCACARAGLL+DA
Sbjct: 361 PEDGLKLFEEMAAKLREINGETGSGKEFKQYEGKQKVFPDQITFIGVLCACARAGLLEDA 420

Query: 421 NNYFNEMINVFLVRPNFAHYWCLANVYVAAGLIQQAVEILRNVPEDDEDFSSDSVVWINL 480
            NYF+EMINVFLVRPNFAHYWCLANVYVAAGLIQ+AVEILRN+PED+EDFSS+SVVWINL
Sbjct: 421 KNYFDEMINVFLVRPNFAHYWCLANVYVAAGLIQEAVEILRNMPEDNEDFSSESVVWINL 480

Query: 481 LTMCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLG 540
           LT CRFVGDVSLGEQIAKYLID+EPKNDSY RLLLNIYAVAGRWEDVSRIKLLMKEKRLG
Sbjct: 481 LTTCRFVGDVSLGEQIAKYLIDMEPKNDSYNRLLLNIYAVAGRWEDVSRIKLLMKEKRLG 540

Query: 541 TMPGCRLVDLKEIVHRLKLGNLLQDGMKETNTVMHKLASEVSLLSSIAAGQSDFGV 597
           TMPGCRL+DLKEIVHRLKLGNLLQ+GMKETNTVMHKLASEVSLLSSIAAGQSDFGV
Sbjct: 541 TMPGCRLIDLKEIVHRLKLGNLLQEGMKETNTVMHKLASEVSLLSSIAAGQSDFGV 596

BLAST of Cla97C01G019950 vs. NCBI nr
Match: XP_023544620.1 (pentatricopeptide repeat-containing protein At3g51320 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1074.7 bits (2778), Expect = 4.6e-310
Identity = 512/596 (85.91%), Postives = 559/596 (93.79%), Query Frame = 0

Query: 1   MARISTRQLFRFTHASLPLPFKSVDRSSSPFSSFPEPDLSLETTNPPRHNRSYSLLQSCQ 60
           MARI +RQLFRFT ASLP P KS+DR SSPF SF EPDLSL+T NPPRHNR +SLLQSCQ
Sbjct: 1   MARIYSRQLFRFTRASLPPPSKSIDRCSSPFCSFAEPDLSLDTRNPPRHNRCHSLLQSCQ 60

Query: 61  SVRELLQIHGHLITSGRFKHHFWANRVLFQASEFGDVIYTVLVFRYINIPNTFCINRVIK 120
           S+REL+QIHG+LITSG F HHFWANRVL QASEFGD++YTVL+F+ IN+PN FCINRVIK
Sbjct: 61  SMRELVQIHGYLITSGLFNHHFWANRVLLQASEFGDIVYTVLIFKLINVPNAFCINRVIK 120

Query: 121 AYSLSIVPLEAVSLYFEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGIDS 180
           AYSLS VPLEAV +YF+WLG+GFRPD+YTFLSLF ACA+ GCG+SGRKCHGQAFKNG+D 
Sbjct: 121 AYSLSSVPLEAVFVYFQWLGDGFRPDTYTFLSLFCACASIGCGSSGRKCHGQAFKNGVDC 180

Query: 181 VMVLRNSLIHMYGCCGHIELGRKVFDEMSSWDLVSWNSIVTAYARTGDLHTAHDLFDAMP 240
           VMVLRNSLIHMY CCGHIELGRKVFDEMS+ DLVSWNSIVTAYAR GDLHTAHD+FDAMP
Sbjct: 181 VMVLRNSLIHMYACCGHIELGRKVFDEMSTLDLVSWNSIVTAYARVGDLHTAHDMFDAMP 240

Query: 241 ERNIVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRS 300
           ERN+VSWNLMISEYLRGG+PGCAMKLFRNM+ IGIRGN+TTMVN+LGACGRSARLNEGRS
Sbjct: 241 ERNVVSWNLMISEYLRGGSPGCAMKLFRNMMKIGIRGNSTTMVNILGACGRSARLNEGRS 300

Query: 301 VHGFMYRTSMKFCVFINTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGN 360
           VHGFMYRTSMKFCVFI+TALVDMY KCQRVS+ARR+FDRM++RNLVTWNAMVLGHCLHGN
Sbjct: 301 VHGFMYRTSMKFCVFIDTALVDMYGKCQRVSVARRLFDRMVNRNLVTWNAMVLGHCLHGN 360

Query: 361 PDDGLKLFQEMAAKLREINGEIGNGKKFKQDEGKRNVYPDQITFIGVLCACARAGLLKDA 420
           P+DGLKLF+EMAAKLRE NGE G+GKKFKQDEG+R V+PDQITFIGVLCACARAGLL+DA
Sbjct: 361 PEDGLKLFEEMAAKLRERNGEAGSGKKFKQDEGERKVFPDQITFIGVLCACARAGLLEDA 420

Query: 421 NNYFNEMINVFLVRPNFAHYWCLANVYVAAGLIQQAVEILRNVPEDDEDFSSDSVVWINL 480
           NNYF+EMIN+FLVRPNFAHYWCLANVYVAAGLIQQAVEILRN+PED EDFSS+ VVW NL
Sbjct: 421 NNYFDEMINIFLVRPNFAHYWCLANVYVAAGLIQQAVEILRNMPEDIEDFSSELVVWTNL 480

Query: 481 LTMCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLG 540
           L  CRF GDVSLGEQIA YLID+EPKN+SYYRLLLNIYAVAGRWEDVSRIK+LMKEKRLG
Sbjct: 481 LATCRFGGDVSLGEQIANYLIDMEPKNESYYRLLLNIYAVAGRWEDVSRIKVLMKEKRLG 540

Query: 541 TMPGCRLVDLKEIVHRLKLGNLLQDGMKETNTVMHKLASEVSLLSSIAAGQSDFGV 597
           T PGCRLVDLKEIVHRLKLGNLLQ+GMKETN+VMHKLASEVSLLS+IAAGQSD  V
Sbjct: 541 TFPGCRLVDLKEIVHRLKLGNLLQEGMKETNSVMHKLASEVSLLSTIAAGQSDLRV 596

BLAST of Cla97C01G019950 vs. NCBI nr
Match: XP_022950690.1 (pentatricopeptide repeat-containing protein At3g51320 isoform X1 [Cucurbita moschata])

HSP 1 Score: 1066.2 bits (2756), Expect = 1.0e-307
Identity = 510/596 (85.57%), Postives = 554/596 (92.95%), Query Frame = 0

Query: 1   MARISTRQLFRFTHASLPLPFKSVDRSSSPFSSFPEPDLSLETTNPPRHNRSYSLLQSCQ 60
           MARI +RQLFRFT ASLP P K +DR  SPF SF EPDLSL+  NPPRHNR +SLLQSCQ
Sbjct: 1   MARIYSRQLFRFTRASLPPPSKYIDRCPSPFCSFAEPDLSLDARNPPRHNRCHSLLQSCQ 60

Query: 61  SVRELLQIHGHLITSGRFKHHFWANRVLFQASEFGDVIYTVLVFRYINIPNTFCINRVIK 120
           S+REL+QIHGHLITSG F HHFWANRVL QASEFGD++YTVL+F+ IN+PN FCINRVIK
Sbjct: 61  SMRELVQIHGHLITSGLFNHHFWANRVLLQASEFGDIVYTVLIFKLINVPNAFCINRVIK 120

Query: 121 AYSLSIVPLEAVSLYFEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGIDS 180
           AYSLS  PLEAV +YF+WLG+GFRPD+YTFLSLF ACA+ GCG+SGRKCHGQAFKNG+D 
Sbjct: 121 AYSLSSDPLEAVFVYFQWLGDGFRPDTYTFLSLFCACASIGCGSSGRKCHGQAFKNGVDC 180

Query: 181 VMVLRNSLIHMYGCCGHIELGRKVFDEMSSWDLVSWNSIVTAYARTGDLHTAHDLFDAMP 240
           VMVLRNSLIHMYGCCGHIELGRKVFDEM + DLVSWNSIVTAYAR GDLHTAHD+FDAMP
Sbjct: 181 VMVLRNSLIHMYGCCGHIELGRKVFDEMLTLDLVSWNSIVTAYARVGDLHTAHDMFDAMP 240

Query: 241 ERNIVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRS 300
           ERN+VSWNLMISEYLRGG+PGCAMKLFRNM+ IGIRGN+TTMVN+LGACGRSARLNEGRS
Sbjct: 241 ERNVVSWNLMISEYLRGGSPGCAMKLFRNMMKIGIRGNSTTMVNILGACGRSARLNEGRS 300

Query: 301 VHGFMYRTSMKFCVFINTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGN 360
           VHGFMYRTSMKFCVFI+TALVDMY KCQRVS+ARR+FDRM++RNLVTWNAMVLGHCLHGN
Sbjct: 301 VHGFMYRTSMKFCVFIDTALVDMYGKCQRVSVARRLFDRMVNRNLVTWNAMVLGHCLHGN 360

Query: 361 PDDGLKLFQEMAAKLREINGEIGNGKKFKQDEGKRNVYPDQITFIGVLCACARAGLLKDA 420
           P+DGLKLF+EMAAKLRE NGE G+GKKFKQDEG+R V+ DQITFIGVLCACARAGLL+DA
Sbjct: 361 PEDGLKLFEEMAAKLRERNGEAGSGKKFKQDEGERKVFLDQITFIGVLCACARAGLLEDA 420

Query: 421 NNYFNEMINVFLVRPNFAHYWCLANVYVAAGLIQQAVEILRNVPEDDEDFSSDSVVWINL 480
           NNYF+EMINVFLVRPNFAHYWCLANVYVAAGLIQQAVEILRN+PED EDFSS+ VVW NL
Sbjct: 421 NNYFDEMINVFLVRPNFAHYWCLANVYVAAGLIQQAVEILRNMPEDIEDFSSELVVWTNL 480

Query: 481 LTMCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLG 540
           L  CRF GDVSLGEQIA YLID+EPKN+SYYRLLLNIYAVAGRWEDVSRIK+LMKEKRLG
Sbjct: 481 LATCRFGGDVSLGEQIANYLIDMEPKNESYYRLLLNIYAVAGRWEDVSRIKVLMKEKRLG 540

Query: 541 TMPGCRLVDLKEIVHRLKLGNLLQDGMKETNTVMHKLASEVSLLSSIAAGQSDFGV 597
           T PGCRLVDLKEIVHRLKLGNLLQ+GMKETNTVMHKLASEVSLLS+IAAGQSD  V
Sbjct: 541 TFPGCRLVDLKEIVHRLKLGNLLQEGMKETNTVMHKLASEVSLLSTIAAGQSDLRV 596

BLAST of Cla97C01G019950 vs. NCBI nr
Match: KAG6603895.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1064.3 bits (2751), Expect = 3.8e-307
Identity = 510/596 (85.57%), Postives = 553/596 (92.79%), Query Frame = 0

Query: 1   MARISTRQLFRFTHASLPLPFKSVDRSSSPFSSFPEPDLSLETTNPPRHNRSYSLLQSCQ 60
           MARI +RQLFRFT ASLP P K +DR  SPF SF EPDLSL+  NPPRHNR +SLLQSCQ
Sbjct: 1   MARIYSRQLFRFTRASLPPPSKYIDRCPSPFCSFAEPDLSLDARNPPRHNRCHSLLQSCQ 60

Query: 61  SVRELLQIHGHLITSGRFKHHFWANRVLFQASEFGDVIYTVLVFRYINIPNTFCINRVIK 120
           S+REL+QIHGHLITSG F HHFWANRVL QASEFGD++YTVL+F+ I +PN FCINRVIK
Sbjct: 61  SMRELVQIHGHLITSGLFNHHFWANRVLLQASEFGDIVYTVLIFKLIKVPNAFCINRVIK 120

Query: 121 AYSLSIVPLEAVSLYFEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGIDS 180
           AYSLS  PLEAV +YF+WLG GFRPD+YTFLSLF ACA+FGCG+SGRKCHGQAFKNG+D 
Sbjct: 121 AYSLSSDPLEAVFVYFQWLGAGFRPDTYTFLSLFCACASFGCGSSGRKCHGQAFKNGVDC 180

Query: 181 VMVLRNSLIHMYGCCGHIELGRKVFDEMSSWDLVSWNSIVTAYARTGDLHTAHDLFDAMP 240
           VMVLRNSLIHMYGCCGHIELGRKVFDEMS+ DLVSWNSIVTAYAR GDL TAHD+FDAMP
Sbjct: 181 VMVLRNSLIHMYGCCGHIELGRKVFDEMSTLDLVSWNSIVTAYARVGDLQTAHDMFDAMP 240

Query: 241 ERNIVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRS 300
           ERN+VSWNLMISEYLRGG+PGCAMKLFRNM+ IGIRGN+TTMVN+LGACGRSARLNEGRS
Sbjct: 241 ERNVVSWNLMISEYLRGGSPGCAMKLFRNMMKIGIRGNSTTMVNILGACGRSARLNEGRS 300

Query: 301 VHGFMYRTSMKFCVFINTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGN 360
           VHGFMYRTSMKFCVFI+TALVDMY KCQRVS+ARR+FDRM++RNLVTWNAMVLGHCLHGN
Sbjct: 301 VHGFMYRTSMKFCVFIDTALVDMYGKCQRVSVARRLFDRMVNRNLVTWNAMVLGHCLHGN 360

Query: 361 PDDGLKLFQEMAAKLREINGEIGNGKKFKQDEGKRNVYPDQITFIGVLCACARAGLLKDA 420
           P+DGLKLF+EMAAKLRE NGE G+GKKFKQDEG+R V+ DQITFIGVLCACARAGLL+DA
Sbjct: 361 PEDGLKLFEEMAAKLRERNGEAGSGKKFKQDEGERKVFLDQITFIGVLCACARAGLLEDA 420

Query: 421 NNYFNEMINVFLVRPNFAHYWCLANVYVAAGLIQQAVEILRNVPEDDEDFSSDSVVWINL 480
           NNYF+EMINVFLVRPNFAHYWCLANVYVAAGLIQQAVEILRN+PED EDFSS+ VVW NL
Sbjct: 421 NNYFDEMINVFLVRPNFAHYWCLANVYVAAGLIQQAVEILRNMPEDIEDFSSELVVWTNL 480

Query: 481 LTMCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLG 540
           L  CRF GDVSLGEQIA YLID+EPKN+SYYRLLLNIYAVAGRWEDVSRIK+LMKEKRLG
Sbjct: 481 LATCRFGGDVSLGEQIANYLIDMEPKNESYYRLLLNIYAVAGRWEDVSRIKVLMKEKRLG 540

Query: 541 TMPGCRLVDLKEIVHRLKLGNLLQDGMKETNTVMHKLASEVSLLSSIAAGQSDFGV 597
           T PGCRLVDLKEIVHRLKLGNLLQ+GMKETNTVMHKLASEVSLLS+IAAGQSD  V
Sbjct: 541 TFPGCRLVDLKEIVHRLKLGNLLQEGMKETNTVMHKLASEVSLLSTIAAGQSDLRV 596

BLAST of Cla97C01G019950 vs. NCBI nr
Match: XP_022978359.1 (pentatricopeptide repeat-containing protein At3g51320 isoform X1 [Cucurbita maxima])

HSP 1 Score: 1050.8 bits (2716), Expect = 4.4e-303
Identity = 506/596 (84.90%), Postives = 549/596 (92.11%), Query Frame = 0

Query: 1   MARISTRQLFRFTHASLPLPFKSVDRSSSPFSSFPEPDLSLETTNPPRHNRSYSLLQSCQ 60
           MARI +RQLFRFT ASLP P KS+DR SSPF SF EPDLSL+T NPPRHNR +SLLQSCQ
Sbjct: 1   MARIYSRQLFRFTRASLPPPSKSIDRCSSPFCSFAEPDLSLDTINPPRHNRCHSLLQSCQ 60

Query: 61  SVRELLQIHGHLITSGRFKHHFWANRVLFQASEFGDVIYTVLVFRYINIPNTFCINRVIK 120
           S+REL+QIHGHLITSG F HHFWANRVL QASEFGD++YTVL+F+ IN+PN FCINRVIK
Sbjct: 61  SIRELVQIHGHLITSGLFNHHFWANRVLLQASEFGDIVYTVLIFKLINVPNAFCINRVIK 120

Query: 121 AYSLSIVPLEAVSLYFEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGIDS 180
           AYSLS  PLEAV +YF+WLG+GFRPD+YTFLSLF ACA+ GCG+SGRKCHGQAFKNG+D 
Sbjct: 121 AYSLSSDPLEAVFVYFQWLGDGFRPDTYTFLSLFCACASIGCGSSGRKCHGQAFKNGVDC 180

Query: 181 VMVLRNSLIHMYGCCGHIELGRKVFDEMSSWDLVSWNSIVTAYARTGDLHTAHDLFDAMP 240
           VMVLRNSLIHMYGCCG+IELGRKVFDEMS+ DLVSWNSIVTAYAR GDLHTAHD+FDAMP
Sbjct: 181 VMVLRNSLIHMYGCCGYIELGRKVFDEMSTLDLVSWNSIVTAYARVGDLHTAHDMFDAMP 240

Query: 241 ERNIVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRS 300
           ERN+VSWNLMISEYLRGG+PGCAMKLFRNM+ IGIRGN+TTMVN+LGACGRSA LNEGRS
Sbjct: 241 ERNVVSWNLMISEYLRGGSPGCAMKLFRNMMKIGIRGNSTTMVNILGACGRSASLNEGRS 300

Query: 301 VHGFMYRTSMKFCVFINTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGN 360
           VHGFMYRTS KFCVFI TALVDMY KCQRV IARR+FDRM +RNLVTWNAMVLGHCLHGN
Sbjct: 301 VHGFMYRTSTKFCVFIGTALVDMYGKCQRVCIARRLFDRMPNRNLVTWNAMVLGHCLHGN 360

Query: 361 PDDGLKLFQEMAAKLREINGEIGNGKKFKQDEGKRNVYPDQITFIGVLCACARAGLLKDA 420
           P+DGLKL++EMAAKLRE NGE G+GKKFKQDEG+R V+PDQITFIGVLCACARAGLL+DA
Sbjct: 361 PEDGLKLYEEMAAKLRERNGEAGSGKKFKQDEGERKVFPDQITFIGVLCACARAGLLEDA 420

Query: 421 NNYFNEMINVFLVRPNFAHYWCLANVYVAAGLIQQAVEILRNVPEDDEDFSSDSVVWINL 480
           NNYF+EMINVFLV+PNFAHYWCLANVYVAAGLIQQAVEILRN+ ED EDFSS+ VVW NL
Sbjct: 421 NNYFDEMINVFLVKPNFAHYWCLANVYVAAGLIQQAVEILRNMTEDIEDFSSELVVWTNL 480

Query: 481 LTMCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLG 540
           L  CRF GDVSLGEQIA YLID+EPKN+SYYRLLLNIYAVAGRWEDVSRIK+LMKEKRLG
Sbjct: 481 LATCRFRGDVSLGEQIANYLIDMEPKNESYYRLLLNIYAVAGRWEDVSRIKVLMKEKRLG 540

Query: 541 TMPGCRLVDLKEIVHRLKLGNLLQDGMKETNTVMHKLASEVSLLSSIAAGQSDFGV 597
           T PGCRLVDLKEIVHRLKLGNLLQ    ETNTVMHKLASEVSLLS+IAAGQSD  V
Sbjct: 541 TFPGCRLVDLKEIVHRLKLGNLLQ----ETNTVMHKLASEVSLLSTIAAGQSDLRV 592

BLAST of Cla97C01G019950 vs. ExPASy Swiss-Prot
Match: Q0WVU0 (Pentatricopeptide repeat-containing protein At3g51320 OS=Arabidopsis thaliana OX=3702 GN=At3g51320 PE=2 SV=1)

HSP 1 Score: 535.0 bits (1377), Expect = 1.1e-150
Identity = 254/510 (49.80%), Postives = 345/510 (67.65%), Query Frame = 0

Query: 51  RSYSLLQSCQSVRELLQIHGHLITSGRFKHHFWANRVLFQASEFGDVIYTVLVFRYINIP 110
           + + L++   S+  L Q+H  LITSG F    WA R+L  +S FGD  YTV ++R  +I 
Sbjct: 24  KGFKLVEDSNSITHLFQVHARLITSGNFWDSSWAIRLLKSSSRFGDSSYTVSIYR--SIG 83

Query: 111 NTFCINRVIKAYSLSIVPLEAVSLYFEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCH 170
             +C N V KAY +S  P +A+  YF+ L  GF PDSYTF+SL S      C  SG+ CH
Sbjct: 84  KLYCANPVFKAYLVSSSPKQALGFYFDILRFGFVPDSYTFVSLISCIEKTCCVDSGKMCH 143

Query: 171 GQAFKNGIDSVMVLRNSLIHMYGCCGHIELGRKVFDEMSSWDLVSWNSIVTAYARTGDLH 230
           GQA K+G D V+ ++NSL+HMY CCG ++L +K+F E+   D+VSWNSI+    R GD+ 
Sbjct: 144 GQAIKHGCDQVLPVQNSLMHMYTCCGALDLAKKLFVEIPKRDIVSWNSIIAGMVRNGDVL 203

Query: 231 TAHDLFDAMPERNIVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACG 290
            AH LFD MP++NI+SWN+MIS YL   NPG ++ LFR MV  G +GN +T+V +L ACG
Sbjct: 204 AAHKLFDEMPDKNIISWNIMISAYLGANNPGVSISLFREMVRAGFQGNESTLVLLLNACG 263

Query: 291 RSARLNEGRSVHGFMYRTSMKFCVFINTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNA 350
           RSARL EGRSVH  + RT +   V I+TAL+DMY KC+ V +ARR+FD +  RN VTWN 
Sbjct: 264 RSARLKEGRSVHASLIRTFLNSSVVIDTALIDMYGKCKEVGLARRIFDSLSIRNKVTWNV 323

Query: 351 MVLGHCLHGNPDDGLKLFQEMAAKLREINGEIGNGKKFKQDEGKRNVYPDQITFIGVLCA 410
           M+L HCLHG P+ GL+LF+ M      ING +                PD++TF+GVLC 
Sbjct: 324 MILAHCLHGRPEGGLELFEAM------INGML---------------RPDEVTFVGVLCG 383

Query: 411 CARAGLLKDANNYFNEMINVFLVRPNFAHYWCLANVYVAAGLIQQAVEILRNVPEDDEDF 470
           CARAGL+    +Y++ M++ F ++PNF H WC+AN+Y +AG  ++A E L+N+P  DED 
Sbjct: 384 CARAGLVSQGQSYYSLMVDEFQIKPNFGHQWCMANLYSSAGFPEEAEEALKNLP--DEDV 443

Query: 471 SSDSVVWINLLTMCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRI 530
           + +S  W NLL+  RF G+ +LGE IAK LI+ +P N  YY LL+NIY+V GRWEDV+R+
Sbjct: 444 TPESTKWANLLSSSRFTGNPTLGESIAKSLIETDPLNYKYYHLLMNIYSVTGRWEDVNRV 503

Query: 531 KLLMKEKRLGTMPGCRLVDLKEIVHRLKLG 561
           + ++KE+++G +PGC LVDLKEIVH L+LG
Sbjct: 504 REMVKERKIGRIPGCGLVDLKEIVHGLRLG 508

BLAST of Cla97C01G019950 vs. ExPASy Swiss-Prot
Match: Q9SJG6 (Pentatricopeptide repeat-containing protein At2g42920, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-E75 PE=2 SV=1)

HSP 1 Score: 307.0 bits (785), Expect = 4.7e-82
Identity = 167/500 (33.40%), Postives = 269/500 (53.80%), Query Frame = 0

Query: 59  CQSVRELLQIHGHLITSGRFKHHFWANRVL-FQASEFGDVIYTVLVFRYINIPNTFCINR 118
           C ++REL QIH  LI +G       A+RVL F  +   D+ Y  LVF  IN  N F  N 
Sbjct: 35  CSTMRELKQIHASLIKTGLISDTVTASRVLAFCCASPSDMNYAYLVFTRINHKNPFVWNT 94

Query: 119 VIKAYSLSIVPLEAVSLYFEWL--GNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFK 178
           +I+ +S S  P  A+S++ + L      +P   T+ S+F A    G    GR+ HG   K
Sbjct: 95  IIRGFSRSSFPEMAISIFIDMLCSSPSVKPQRLTYPSVFKAYGRLGQARDGRQLHGMVIK 154

Query: 179 NGIDSVMVLRNSLIHMYGCCGHIELGRKVFDEMSSWDLVSWNSIVTAYARTGDLHTAHDL 238
            G++    +RN+++HMY  CG +    ++F  M  +D+V+WNS++  +A+ G +  A +L
Sbjct: 155 EGLEDDSFIRNTMLHMYVTCGCLIEAWRIFLGMIGFDVVAWNSMIMGFAKCGLIDQAQNL 214

Query: 239 FDAMPERNIVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARL 298
           FD MP+RN VSWN MIS ++R G    A+ +FR M    ++ +  TMV++L AC      
Sbjct: 215 FDEMPQRNGVSWNSMISGFVRNGRFKDALDMFREMQEKDVKPDGFTMVSLLNACAYLGAS 274

Query: 299 NEGRSVHGFMYRTSMKFCVFINTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGH 358
            +GR +H ++ R   +    + TAL+DMY KC  +     VF+    + L  WN+M+LG 
Sbjct: 275 EQGRWIHEYIVRNRFELNSIVVTALIDMYCKCGCIEEGLNVFECAPKKQLSCWNSMILGL 334

Query: 359 CLHGNPDDGLKLFQEMAAKLREINGEIGNGKKFKQDEGKRNVYPDQITFIGVLCACARAG 418
             +G  +  + LF E+                      +  + PD ++FIGVL ACA +G
Sbjct: 335 ANNGFEERAMDLFSELE---------------------RSGLEPDSVSFIGVLTACAHSG 394

Query: 419 LLKDANNYFNEMINVFLVRPNFAHYWCLANVYVAAGLIQQAVEILRNVPEDDEDFSSDSV 478
            +  A+ +F  M   +++ P+  HY  + NV   AGL+++A  +++N+P ++     D+V
Sbjct: 395 EVHRADEFFRLMKEKYMIEPSIKHYTLMVNVLGGAGLLEEAEALIKNMPVEE-----DTV 454

Query: 479 VWINLLTMCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMK 538
           +W +LL+ CR +G+V + ++ AK L  L+P     Y LL N YA  G +E+    +LLMK
Sbjct: 455 IWSSLLSACRKIGNVEMAKRAAKCLKKLDPDETCGYVLLSNAYASYGLFEEAVEQRLLMK 508

Query: 539 EKRLGTMPGCRLVDLKEIVH 556
           E+++    GC  +++   VH
Sbjct: 515 ERQMEKEVGCSSIEVDFEVH 508

BLAST of Cla97C01G019950 vs. ExPASy Swiss-Prot
Match: Q9CA54 (Pentatricopeptide repeat-containing protein At1g74630 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H71 PE=2 SV=1)

HSP 1 Score: 303.5 bits (776), Expect = 5.2e-81
Identity = 180/572 (31.47%), Postives = 290/572 (50.70%), Query Frame = 0

Query: 54  SLLQSCQSVRELLQIHGHLITSG-RFKHHFWANRVLFQASEFGDVI-YTVLVFRYINIPN 113
           SLL SC+++R L QIHG  I  G     +F    +L  A    D + Y   +      P+
Sbjct: 10  SLLNSCKNLRALTQIHGLFIKYGVDTDSYFTGKLILHCAISISDALPYARRLLLCFPEPD 69

Query: 114 TFCINRVIKAYSLSIVPLEAVSLYFEWLGNGF-RPDSYTFLSLFSACANFGCGASGRKCH 173
            F  N +++ YS S  P  +V+++ E +  GF  PDS++F  +  A  NF    +G + H
Sbjct: 70  AFMFNTLVRGYSESDEPHNSVAVFVEMMRKGFVFPDSFSFAFVIKAVENFRSLRTGFQMH 129

Query: 174 GQAFKNGIDSVMVLRNSLIHMYGCCGHIELGRKVFDEMSSWDLVSWNSIVTAYARTGDLH 233
            QA K+G++S + +  +LI MYG CG +E  RKVFDEM   +LV+WN+++TA  R  D+ 
Sbjct: 130 CQALKHGLESHLFVGTTLIGMYGGCGCVEFARKVFDEMHQPNLVAWNAVITACFRGNDVA 189

Query: 234 TAHDLFDAMPERNIVSWNLMISEYLRGGNPGCAMKL------------------------ 293
            A ++FD M  RN  SWN+M++ Y++ G    A ++                        
Sbjct: 190 GAREIFDKMLVRNHTSWNVMLAGYIKAGELESAKRIFSEMPHRDDVSWSTMIVGIAHNGS 249

Query: 294 -------FRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRSVHGFMYRTSMKFCVFINTA 353
                  FR +   G+  N  ++  VL AC +S     G+ +HGF+ +    + V +N A
Sbjct: 250 FNESFLYFRELQRAGMSPNEVSLTGVLSACSQSGSFEFGKILHGFVEKAGYSWIVSVNNA 309

Query: 354 LVDMYSKCQRVSIARRVFDRML-SRNLVTWNAMVLGHCLHGNPDDGLKLFQEMAAKLREI 413
           L+DMYS+C  V +AR VF+ M   R +V+W +M+ G  +HG  ++ ++LF EM A     
Sbjct: 310 LIDMYSRCGNVPMARLVFEGMQEKRCIVSWTSMIAGLAMHGQGEEAVRLFNEMTA----- 369

Query: 414 NGEIGNGKKFKQDEGKRNVYPDQITFIGVLCACARAGLLKDANNYFNEMINVFLVRPNFA 473
                             V PD I+FI +L AC+ AGL+++  +YF+EM  V+ + P   
Sbjct: 370 ----------------YGVTPDGISFISLLHACSHAGLIEEGEDYFSEMKRVYHIEPEIE 429

Query: 474 HYWCLANVYVAAGLIQQAVEILRNVPEDDEDFSSDSVVWINLLTMCRFVGDVSLGEQIAK 533
           HY C+ ++Y  +G +Q+A + +  +P         ++VW  LL  C   G++ L EQ+ +
Sbjct: 430 HYGCMVDLYGRSGKLQKAYDFICQMP-----IPPTAIVWRTLLGACSSHGNIELAEQVKQ 489

Query: 534 YLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVHRLK 591
            L +L+P N     LL N YA AG+W+DV+ I+  M  +R+       LV++ + +++  
Sbjct: 490 RLNELDPNNSGDLVLLSNAYATAGKWKDVASIRKSMIVQRIKKTTAWSLVEVGKTMYKFT 549

BLAST of Cla97C01G019950 vs. ExPASy Swiss-Prot
Match: Q9SZT8 (Pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=ELI1 PE=3 SV=1)

HSP 1 Score: 300.8 bits (769), Expect = 3.4e-80
Identity = 184/564 (32.62%), Postives = 292/564 (51.77%), Query Frame = 0

Query: 27  SSSPF--SSFPEPDLSLETT---NPPRHNRSYSLLQSCQSVRELLQIHGHLITSGRFKHH 86
           +SSP   +S P+  LS   T     P   +   L+   QSV E+LQIH  ++      H 
Sbjct: 2   ASSPLLATSLPQNQLSTTATARFRLPPPEKLAVLIDKSQSVDEVLQIHAAILRHNLLLHP 61

Query: 87  FW--ANRVLFQA-SEFGDVIYTVLVFRYINIPNTFCINRVIKAYSLSIVPLEAVSLYFEW 146
            +   N  L +A +  G + +++ +F     P+ F     I   S++ +  +A  LY + 
Sbjct: 62  RYPVLNLKLHRAYASHGKIRHSLALFHQTIDPDLFLFTAAINTASINGLKDQAFLLYVQL 121

Query: 147 LGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGIDSVMVLRNSLIHMYGCCGHI 206
           L +   P+ +TF SL  +C+      SG+  H    K G+     +   L+ +Y   G +
Sbjct: 122 LSSEINPNEFTFSSLLKSCST----KSGKLIHTHVLKFGLGIDPYVATGLVDVYAKGGDV 181

Query: 207 ELGRKVFDEMSSWDLVSWNSIVTAYARTGDLHTAHDLFDAMPERNIVSWNLMISEYLRGG 266
              +KVFD M    LVS  +++T YA+ G++  A  LFD+M ER+IVSWN+MI  Y + G
Sbjct: 182 VSAQKVFDRMPERSLVSSTAMITCYAKQGNVEAARALFDSMCERDIVSWNVMIDGYAQHG 241

Query: 267 NPGCAMKLFRNMVNIG-IRGNNTTMVNVLGACGRSARLNEGRSVHGFMYRTSMKFCVFIN 326
            P  A+ LF+ ++  G  + +  T+V  L AC +   L  GR +H F+  + ++  V + 
Sbjct: 242 FPNDALMLFQKLLAEGKPKPDEITVVAALSACSQIGALETGRWIHVFVKSSRIRLNVKVC 301

Query: 327 TALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGNPDDGLKLFQEMAAKLRE 386
           T L+DMYSKC  +  A  VF+    +++V WNAM+ G+ +HG   D L+LF EM      
Sbjct: 302 TGLIDMYSKCGSLEEAVLVFNDTPRKDIVAWNAMIAGYAMHGYSQDALRLFNEM------ 361

Query: 387 INGEIGNGKKFKQDEGKRNVYPDQITFIGVLCACARAGLLKDANNYFNEMINVFLVRPNF 446
                         +G   + P  ITFIG L ACA AGL+ +    F  M   + ++P  
Sbjct: 362 --------------QGITGLQPTDITFIGTLQACAHAGLVNEGIRIFESMGQEYGIKPKI 421

Query: 447 AHYWCLANVYVAAGLIQQAVEILRNVPEDDEDFSSDSVVWINLLTMCRFVGDVSLGEQIA 506
            HY CL ++   AG +++A E ++N+  D     +DSV+W ++L  C+  GD  LG++IA
Sbjct: 422 EHYGCLVSLLGRAGQLKRAYETIKNMNMD-----ADSVLWSSVLGSCKLHGDFVLGKEIA 481

Query: 507 KYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVHRL 566
           +YLI L  KN   Y LL NIYA  G +E V++++ LMKEK +   PG   ++++  VH  
Sbjct: 482 EYLIGLNIKNSGIYVLLSNIYASVGDYEGVAKVRNLMKEKGIVKEPGISTIEIENKVHEF 536

Query: 567 KLGNLLQDGMKETNTVMHKLASEV 582
           + G+      KE  T++ K++  +
Sbjct: 542 RAGDREHSKSKEIYTMLRKISERI 536

BLAST of Cla97C01G019950 vs. ExPASy Swiss-Prot
Match: Q9FI80 (Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H38 PE=2 SV=1)

HSP 1 Score: 292.4 bits (747), Expect = 1.2e-77
Identity = 174/577 (30.16%), Postives = 293/577 (50.78%), Query Frame = 0

Query: 29  SPFSSFPEPDLSLETTNPPRHNRS-YSLLQSCQSVRELLQIHGHLITSGRFKHHFWANRV 88
           +P  +   P  +   ++P  H  S +  + +C+++R+L QIH   I SG+ +    A  +
Sbjct: 2   NPTQTLFSPGGNSPASSPASHPSSLFPQINNCRTIRDLSQIHAVFIKSGQMRDTLAAAEI 61

Query: 89  L-FQAS---EFGDVIYTVLVFRYINIPNTFCINRVIKAYSLS--IVPLEAVSLYFEWLGN 148
           L F A+      D+ Y   +F  +   N F  N +I+ +S S     L A++L++E + +
Sbjct: 62  LRFCATSDLHHRDLDYAHKIFNQMPQRNCFSWNTIIRGFSESDEDKALIAITLFYEMMSD 121

Query: 149 GF-RPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGIDSVMVLRNSLIHMYGCCGHIEL 208
            F  P+ +TF S+  ACA  G    G++ HG A K G      + ++L+ MY  CG ++ 
Sbjct: 122 EFVEPNRFTFPSVLKACAKTGKIQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKD 181

Query: 209 GRKVF--------------DEMSSWDLVSWNSIVTAYARTGDLHTAHDLFDAMPERNIVS 268
            R +F                    ++V WN ++  Y R GD   A  LFD M +R++VS
Sbjct: 182 ARVLFYKNIIEKDMVVMTDRRKRDGEIVLWNVMIDGYMRLGDCKAARMLFDKMRQRSVVS 241

Query: 269 WNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRSVHGFMY 328
           WN MIS Y   G    A+++FR M    IR N  T+V+VL A  R   L  G  +H +  
Sbjct: 242 WNTMISGYSLNGFFKDAVEVFREMKKGDIRPNYVTLVSVLPAISRLGSLELGEWLHLYAE 301

Query: 329 RTSMKFCVFINTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGNPDDGLK 388
            + ++    + +AL+DMYSKC  +  A  VF+R+   N++TW+AM+ G  +HG   D + 
Sbjct: 302 DSGIRIDDVLGSALIDMYSKCGIIEKAIHVFERLPRENVITWSAMINGFAIHGQAGDAID 361

Query: 389 LFQEMAAKLREINGEIGNGKKFKQDEGKRNVYPDQITFIGVLCACARAGLLKDANNYFNE 448
            F +M                      +  V P  + +I +L AC+  GL+++   YF++
Sbjct: 362 CFCKMR---------------------QAGVRPSDVAYINLLTACSHGGLVEEGRRYFSQ 421

Query: 449 MINVFLVRPNFAHYWCLANVYVAAGLIQQAVEILRNVPEDDEDFSSDSVVWINLLTMCRF 508
           M++V  + P   HY C+ ++   +GL+ +A E + N+P        D V+W  LL  CR 
Sbjct: 422 MVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEFILNMP-----IKPDDVIWKALLGACRM 481

Query: 509 VGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCR 568
            G+V +G+++A  L+D+ P +   Y  L N+YA  G W +VS ++L MKEK +   PGC 
Sbjct: 482 QGNVEMGKRVANILMDMVPHDSGAYVALSNMYASQGNWSEVSEMRLRMKEKDIRKDPGCS 541

Query: 569 LVDLKEIVHRLKLGNLLQDGMKETNTVMHKLASEVSL 584
           L+D+  ++H   + +      KE N+++ +++ ++ L
Sbjct: 542 LIDIDGVLHEFVVEDDSHPKAKEINSMLVEISDKLRL 552

BLAST of Cla97C01G019950 vs. ExPASy TrEMBL
Match: A0A6J1GGG4 (pentatricopeptide repeat-containing protein At3g51320 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111453712 PE=4 SV=1)

HSP 1 Score: 1066.2 bits (2756), Expect = 4.9e-308
Identity = 510/596 (85.57%), Postives = 554/596 (92.95%), Query Frame = 0

Query: 1   MARISTRQLFRFTHASLPLPFKSVDRSSSPFSSFPEPDLSLETTNPPRHNRSYSLLQSCQ 60
           MARI +RQLFRFT ASLP P K +DR  SPF SF EPDLSL+  NPPRHNR +SLLQSCQ
Sbjct: 1   MARIYSRQLFRFTRASLPPPSKYIDRCPSPFCSFAEPDLSLDARNPPRHNRCHSLLQSCQ 60

Query: 61  SVRELLQIHGHLITSGRFKHHFWANRVLFQASEFGDVIYTVLVFRYINIPNTFCINRVIK 120
           S+REL+QIHGHLITSG F HHFWANRVL QASEFGD++YTVL+F+ IN+PN FCINRVIK
Sbjct: 61  SMRELVQIHGHLITSGLFNHHFWANRVLLQASEFGDIVYTVLIFKLINVPNAFCINRVIK 120

Query: 121 AYSLSIVPLEAVSLYFEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGIDS 180
           AYSLS  PLEAV +YF+WLG+GFRPD+YTFLSLF ACA+ GCG+SGRKCHGQAFKNG+D 
Sbjct: 121 AYSLSSDPLEAVFVYFQWLGDGFRPDTYTFLSLFCACASIGCGSSGRKCHGQAFKNGVDC 180

Query: 181 VMVLRNSLIHMYGCCGHIELGRKVFDEMSSWDLVSWNSIVTAYARTGDLHTAHDLFDAMP 240
           VMVLRNSLIHMYGCCGHIELGRKVFDEM + DLVSWNSIVTAYAR GDLHTAHD+FDAMP
Sbjct: 181 VMVLRNSLIHMYGCCGHIELGRKVFDEMLTLDLVSWNSIVTAYARVGDLHTAHDMFDAMP 240

Query: 241 ERNIVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRS 300
           ERN+VSWNLMISEYLRGG+PGCAMKLFRNM+ IGIRGN+TTMVN+LGACGRSARLNEGRS
Sbjct: 241 ERNVVSWNLMISEYLRGGSPGCAMKLFRNMMKIGIRGNSTTMVNILGACGRSARLNEGRS 300

Query: 301 VHGFMYRTSMKFCVFINTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGN 360
           VHGFMYRTSMKFCVFI+TALVDMY KCQRVS+ARR+FDRM++RNLVTWNAMVLGHCLHGN
Sbjct: 301 VHGFMYRTSMKFCVFIDTALVDMYGKCQRVSVARRLFDRMVNRNLVTWNAMVLGHCLHGN 360

Query: 361 PDDGLKLFQEMAAKLREINGEIGNGKKFKQDEGKRNVYPDQITFIGVLCACARAGLLKDA 420
           P+DGLKLF+EMAAKLRE NGE G+GKKFKQDEG+R V+ DQITFIGVLCACARAGLL+DA
Sbjct: 361 PEDGLKLFEEMAAKLRERNGEAGSGKKFKQDEGERKVFLDQITFIGVLCACARAGLLEDA 420

Query: 421 NNYFNEMINVFLVRPNFAHYWCLANVYVAAGLIQQAVEILRNVPEDDEDFSSDSVVWINL 480
           NNYF+EMINVFLVRPNFAHYWCLANVYVAAGLIQQAVEILRN+PED EDFSS+ VVW NL
Sbjct: 421 NNYFDEMINVFLVRPNFAHYWCLANVYVAAGLIQQAVEILRNMPEDIEDFSSELVVWTNL 480

Query: 481 LTMCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLG 540
           L  CRF GDVSLGEQIA YLID+EPKN+SYYRLLLNIYAVAGRWEDVSRIK+LMKEKRLG
Sbjct: 481 LATCRFGGDVSLGEQIANYLIDMEPKNESYYRLLLNIYAVAGRWEDVSRIKVLMKEKRLG 540

Query: 541 TMPGCRLVDLKEIVHRLKLGNLLQDGMKETNTVMHKLASEVSLLSSIAAGQSDFGV 597
           T PGCRLVDLKEIVHRLKLGNLLQ+GMKETNTVMHKLASEVSLLS+IAAGQSD  V
Sbjct: 541 TFPGCRLVDLKEIVHRLKLGNLLQEGMKETNTVMHKLASEVSLLSTIAAGQSDLRV 596

BLAST of Cla97C01G019950 vs. ExPASy TrEMBL
Match: A0A6J1ITV7 (pentatricopeptide repeat-containing protein At3g51320 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111478374 PE=4 SV=1)

HSP 1 Score: 1050.8 bits (2716), Expect = 2.1e-303
Identity = 506/596 (84.90%), Postives = 549/596 (92.11%), Query Frame = 0

Query: 1   MARISTRQLFRFTHASLPLPFKSVDRSSSPFSSFPEPDLSLETTNPPRHNRSYSLLQSCQ 60
           MARI +RQLFRFT ASLP P KS+DR SSPF SF EPDLSL+T NPPRHNR +SLLQSCQ
Sbjct: 1   MARIYSRQLFRFTRASLPPPSKSIDRCSSPFCSFAEPDLSLDTINPPRHNRCHSLLQSCQ 60

Query: 61  SVRELLQIHGHLITSGRFKHHFWANRVLFQASEFGDVIYTVLVFRYINIPNTFCINRVIK 120
           S+REL+QIHGHLITSG F HHFWANRVL QASEFGD++YTVL+F+ IN+PN FCINRVIK
Sbjct: 61  SIRELVQIHGHLITSGLFNHHFWANRVLLQASEFGDIVYTVLIFKLINVPNAFCINRVIK 120

Query: 121 AYSLSIVPLEAVSLYFEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGIDS 180
           AYSLS  PLEAV +YF+WLG+GFRPD+YTFLSLF ACA+ GCG+SGRKCHGQAFKNG+D 
Sbjct: 121 AYSLSSDPLEAVFVYFQWLGDGFRPDTYTFLSLFCACASIGCGSSGRKCHGQAFKNGVDC 180

Query: 181 VMVLRNSLIHMYGCCGHIELGRKVFDEMSSWDLVSWNSIVTAYARTGDLHTAHDLFDAMP 240
           VMVLRNSLIHMYGCCG+IELGRKVFDEMS+ DLVSWNSIVTAYAR GDLHTAHD+FDAMP
Sbjct: 181 VMVLRNSLIHMYGCCGYIELGRKVFDEMSTLDLVSWNSIVTAYARVGDLHTAHDMFDAMP 240

Query: 241 ERNIVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRS 300
           ERN+VSWNLMISEYLRGG+PGCAMKLFRNM+ IGIRGN+TTMVN+LGACGRSA LNEGRS
Sbjct: 241 ERNVVSWNLMISEYLRGGSPGCAMKLFRNMMKIGIRGNSTTMVNILGACGRSASLNEGRS 300

Query: 301 VHGFMYRTSMKFCVFINTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGN 360
           VHGFMYRTS KFCVFI TALVDMY KCQRV IARR+FDRM +RNLVTWNAMVLGHCLHGN
Sbjct: 301 VHGFMYRTSTKFCVFIGTALVDMYGKCQRVCIARRLFDRMPNRNLVTWNAMVLGHCLHGN 360

Query: 361 PDDGLKLFQEMAAKLREINGEIGNGKKFKQDEGKRNVYPDQITFIGVLCACARAGLLKDA 420
           P+DGLKL++EMAAKLRE NGE G+GKKFKQDEG+R V+PDQITFIGVLCACARAGLL+DA
Sbjct: 361 PEDGLKLYEEMAAKLRERNGEAGSGKKFKQDEGERKVFPDQITFIGVLCACARAGLLEDA 420

Query: 421 NNYFNEMINVFLVRPNFAHYWCLANVYVAAGLIQQAVEILRNVPEDDEDFSSDSVVWINL 480
           NNYF+EMINVFLV+PNFAHYWCLANVYVAAGLIQQAVEILRN+ ED EDFSS+ VVW NL
Sbjct: 421 NNYFDEMINVFLVKPNFAHYWCLANVYVAAGLIQQAVEILRNMTEDIEDFSSELVVWTNL 480

Query: 481 LTMCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLG 540
           L  CRF GDVSLGEQIA YLID+EPKN+SYYRLLLNIYAVAGRWEDVSRIK+LMKEKRLG
Sbjct: 481 LATCRFRGDVSLGEQIANYLIDMEPKNESYYRLLLNIYAVAGRWEDVSRIKVLMKEKRLG 540

Query: 541 TMPGCRLVDLKEIVHRLKLGNLLQDGMKETNTVMHKLASEVSLLSSIAAGQSDFGV 597
           T PGCRLVDLKEIVHRLKLGNLLQ    ETNTVMHKLASEVSLLS+IAAGQSD  V
Sbjct: 541 TFPGCRLVDLKEIVHRLKLGNLLQ----ETNTVMHKLASEVSLLSTIAAGQSDLRV 592

BLAST of Cla97C01G019950 vs. ExPASy TrEMBL
Match: A0A1S4DTP8 (pentatricopeptide repeat-containing protein At3g51320 OS=Cucumis melo OX=3656 GN=LOC103485152 PE=4 SV=1)

HSP 1 Score: 1048.5 bits (2710), Expect = 1.1e-302
Identity = 503/575 (87.48%), Postives = 540/575 (93.91%), Query Frame = 0

Query: 1   MARISTRQLFRFTHASLPLPFKSVDRSSSPFSSFPEPDLSLETTNPPRHNRSYSLLQSCQ 60
           MARISTRQLFRFTH  LPLPFKSV RSSSPFS+FPEPD S ETTNPPRH++S+SLLQSC+
Sbjct: 1   MARISTRQLFRFTHFPLPLPFKSVGRSSSPFSAFPEPDHSPETTNPPRHDQSHSLLQSCE 60

Query: 61  SVRELLQIHGHLITSGRFKHHFWANRVLFQASEFGDVIYTVLVFRYINIPNTFCINRVIK 120
           SVREL QIHGHLITSG F +HFWANRVL QASEFGD++YT+L+FR+I +PNTFC+NRVIK
Sbjct: 61  SVRELFQIHGHLITSGLFNYHFWANRVLLQASEFGDIVYTILIFRHIKVPNTFCVNRVIK 120

Query: 121 AYSLSIVPLEAVSLYFEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGIDS 180
           AYSLS VPLEAV +YFEWLGNG RPDSYTFLSLFSACA+FGCGASGRKCHGQAFKNG+DS
Sbjct: 121 AYSLSTVPLEAVFVYFEWLGNGLRPDSYTFLSLFSACASFGCGASGRKCHGQAFKNGVDS 180

Query: 181 VMVLRNSLIHMYGCCGHIELGRKVFDEMSSWDLVSWNSIVTAYARTGDLHTAHDLFDAMP 240
           VMVL NSLIHMYGCCGHIELGRKVFDEMS+ DLVSWNSIVTAYAR GD++TAHD+FD MP
Sbjct: 181 VMVLGNSLIHMYGCCGHIELGRKVFDEMSTRDLVSWNSIVTAYARVGDMYTAHDMFDVMP 240

Query: 241 ERNIVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRS 300
           ERN+VSWNLMISEYLRGGNPGCAMKLFRNMVN+GIRGNNTTMVNVLGAC RSARLNEGRS
Sbjct: 241 ERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNVGIRGNNTTMVNVLGACSRSARLNEGRS 300

Query: 301 VHGFMYRTSMKFCVFINTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGN 360
           VHGFMYRTSMKFCVFI+TALVDMYSKCQRV IARRVFDRM+SRNLVTWNAMVLGH LHGN
Sbjct: 301 VHGFMYRTSMKFCVFIDTALVDMYSKCQRVLIARRVFDRMMSRNLVTWNAMVLGHSLHGN 360

Query: 361 PDDGLKLFQEMAAKLREINGEIGNGKKFKQDEGKRNVYPDQITFIGVLCACARAGLLKDA 420
           P DGLKLF+EMAA+LRE+  E GNGKKFKQDEGKR V+PDQITFIGVLCACARAGLLKDA
Sbjct: 361 PQDGLKLFEEMAAELREMIEETGNGKKFKQDEGKRKVFPDQITFIGVLCACARAGLLKDA 420

Query: 421 NNYFNEMINVFLVRPNFAHYWCLANVYVAAGLIQQAVEILRNVPEDDEDFSSDSVVWINL 480
            NYF+EMI VFLVRPNFAHYWCLANVYVA GLI+QAVEILRN+P   EDFSS+SVVWI+L
Sbjct: 421 KNYFDEMIKVFLVRPNFAHYWCLANVYVAVGLIEQAVEILRNMP---EDFSSESVVWIDL 480

Query: 481 LTMCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLG 540
           LT CRFVGDVSLGEQIAKYLID+EPKNDSYYRLLLN+YAVAGRWEDVSRIKLLMKEKRLG
Sbjct: 481 LTTCRFVGDVSLGEQIAKYLIDIEPKNDSYYRLLLNMYAVAGRWEDVSRIKLLMKEKRLG 540

Query: 541 TMPGCRLVDLKEIVHRLKLGNLLQDGMKETNTVMH 576
           TMPGCRLVDLKEIVH LKLGN LQ+ MKETNTV+H
Sbjct: 541 TMPGCRLVDLKEIVHDLKLGNHLQERMKETNTVIH 572

BLAST of Cla97C01G019950 vs. ExPASy TrEMBL
Match: A0A5D3CKW6 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G001040 PE=4 SV=1)

HSP 1 Score: 1048.1 bits (2709), Expect = 1.4e-302
Identity = 503/575 (87.48%), Postives = 540/575 (93.91%), Query Frame = 0

Query: 1   MARISTRQLFRFTHASLPLPFKSVDRSSSPFSSFPEPDLSLETTNPPRHNRSYSLLQSCQ 60
           MARISTRQLFRFTH  LPLPFKSV RSSSPFS+FPEPD S ETTNPPRH++S+SLLQSC+
Sbjct: 1   MARISTRQLFRFTHFPLPLPFKSVGRSSSPFSAFPEPDHSPETTNPPRHDQSHSLLQSCE 60

Query: 61  SVRELLQIHGHLITSGRFKHHFWANRVLFQASEFGDVIYTVLVFRYINIPNTFCINRVIK 120
           SVREL QIHGHLITSG F +HFWANRVL QASEFGD++YT+L+FR+I +PNTFC+NRVIK
Sbjct: 61  SVRELFQIHGHLITSGLFNYHFWANRVLLQASEFGDIVYTILIFRHIKVPNTFCVNRVIK 120

Query: 121 AYSLSIVPLEAVSLYFEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGIDS 180
           AYSLS VPLEAV +YFEWLGNG RPDSYTFLSLFSACA+FGCGASGRKCHGQAFKNG+DS
Sbjct: 121 AYSLSTVPLEAVFVYFEWLGNGLRPDSYTFLSLFSACASFGCGASGRKCHGQAFKNGVDS 180

Query: 181 VMVLRNSLIHMYGCCGHIELGRKVFDEMSSWDLVSWNSIVTAYARTGDLHTAHDLFDAMP 240
           VMVL NSLIHMYGCCGHIELGRKVFDEMS+ DLVSWNSIVTAYAR GD++TAHD+FD MP
Sbjct: 181 VMVLGNSLIHMYGCCGHIELGRKVFDEMSTRDLVSWNSIVTAYARVGDMYTAHDMFDVMP 240

Query: 241 ERNIVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRS 300
           ERN+VSWNLMISEYLRGGNPGCAMKLFRNMVN+GIRGNNTTMVNVLGAC RSARLNEGRS
Sbjct: 241 ERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNVGIRGNNTTMVNVLGACSRSARLNEGRS 300

Query: 301 VHGFMYRTSMKFCVFINTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGN 360
           VHGFMYRTSMKFCVFI+TALVDMYSKCQRV IARRVFDRM+SRNLVTWNAMVLGH LHGN
Sbjct: 301 VHGFMYRTSMKFCVFIDTALVDMYSKCQRVLIARRVFDRMMSRNLVTWNAMVLGHSLHGN 360

Query: 361 PDDGLKLFQEMAAKLREINGEIGNGKKFKQDEGKRNVYPDQITFIGVLCACARAGLLKDA 420
           P DGLKLF+EMAA+LRE+  E GNGKKFKQDEGKR V+PDQITFIGVLCACARAGLLKDA
Sbjct: 361 PKDGLKLFEEMAAELREMIEETGNGKKFKQDEGKRKVFPDQITFIGVLCACARAGLLKDA 420

Query: 421 NNYFNEMINVFLVRPNFAHYWCLANVYVAAGLIQQAVEILRNVPEDDEDFSSDSVVWINL 480
            NYF+EMI VFLVRPNFAHYWCLANVYVA GLI+QAVEILRN+P   EDFSS+SVVWI+L
Sbjct: 421 KNYFDEMIKVFLVRPNFAHYWCLANVYVAVGLIEQAVEILRNMP---EDFSSESVVWIDL 480

Query: 481 LTMCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLG 540
           LT CRFVGDVSLGEQIAKYLID+EPKNDSYYRLLLN+YAVAGRWEDVSRIKLLMKEKRLG
Sbjct: 481 LTTCRFVGDVSLGEQIAKYLIDIEPKNDSYYRLLLNMYAVAGRWEDVSRIKLLMKEKRLG 540

Query: 541 TMPGCRLVDLKEIVHRLKLGNLLQDGMKETNTVMH 576
           TMPGCRLVDLKEIVH LKLGN LQ+ MKETNTV+H
Sbjct: 541 TMPGCRLVDLKEIVHDLKLGNHLQERMKETNTVIH 572

BLAST of Cla97C01G019950 vs. ExPASy TrEMBL
Match: A0A0A0KMJ6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G507160 PE=4 SV=1)

HSP 1 Score: 1046.6 bits (2705), Expect = 4.0e-302
Identity = 500/575 (86.96%), Postives = 534/575 (92.87%), Query Frame = 0

Query: 1   MARISTRQLFRFTHASLPLPFKSVDRSSSPFSSFPEPDLSLETTNPPRHNRSYSLLQSCQ 60
           MARISTR LFRFTH  LPLPFKSVDRSSSPFSSFPEP  S +TTNPPRHN+S+SLLQSCQ
Sbjct: 1   MARISTRLLFRFTHFPLPLPFKSVDRSSSPFSSFPEPVHSPDTTNPPRHNQSHSLLQSCQ 60

Query: 61  SVRELLQIHGHLITSGRFKHHFWANRVLFQASEFGDVIYTVLVFRYINIPNTFCINRVIK 120
           SVREL Q HGHLITSG F  HFWANRVL QASEFGD++YTVL+FR+I +PNTFC+NRVIK
Sbjct: 61  SVRELFQFHGHLITSGLFNDHFWANRVLLQASEFGDIVYTVLIFRHIKVPNTFCVNRVIK 120

Query: 121 AYSLSIVPLEAVSLYFEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGIDS 180
           AYSLS VPLEAV +YFEWLGNG RPDSYTFLSLFSACA+FGCGASGRKCHGQAFKNG+DS
Sbjct: 121 AYSLSTVPLEAVFVYFEWLGNGLRPDSYTFLSLFSACASFGCGASGRKCHGQAFKNGVDS 180

Query: 181 VMVLRNSLIHMYGCCGHIELGRKVFDEMSSWDLVSWNSIVTAYARTGDLHTAHDLFDAMP 240
           VMVL NSLIHMYGCC HIELGRKVFDEMS+ DLVSWNSIVTAYAR GDL+TAHD+FD MP
Sbjct: 181 VMVLGNSLIHMYGCCKHIELGRKVFDEMSTQDLVSWNSIVTAYARVGDLYTAHDMFDVMP 240

Query: 241 ERNIVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRS 300
           ERN+VSWNLMISEYLRGGNPGCAMKLFRNMVN+GIRGNNTTMVNVL AC RSARLNEGRS
Sbjct: 241 ERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNVGIRGNNTTMVNVLSACSRSARLNEGRS 300

Query: 301 VHGFMYRTSMKFCVFINTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGN 360
           VHGFMYR SMKFCVFINTALVDMYSKC RVS+ARRVFDR++ RNLVTWNAM+LGH LHGN
Sbjct: 301 VHGFMYRASMKFCVFINTALVDMYSKCHRVSVARRVFDRLMIRNLVTWNAMILGHSLHGN 360

Query: 361 PDDGLKLFQEMAAKLREINGEIGNGKKFKQDEGKRNVYPDQITFIGVLCACARAGLLKDA 420
           P DGL+LF+EM  +LREIN E GNGKKFKQDEGKR V+PDQITFIGVLCACARAGLLKDA
Sbjct: 361 PKDGLELFEEMVGELREINEETGNGKKFKQDEGKRKVFPDQITFIGVLCACARAGLLKDA 420

Query: 421 NNYFNEMINVFLVRPNFAHYWCLANVYVAAGLIQQAVEILRNVPEDDEDFSSDSVVWINL 480
            NYF+EMINVFLVRPNF HYWCLANVYVA GLI+QAVEILRN+PED+EDFSS+SVVWI+L
Sbjct: 421 ENYFDEMINVFLVRPNFGHYWCLANVYVAVGLIEQAVEILRNMPEDNEDFSSESVVWIDL 480

Query: 481 LTMCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLG 540
           LT CRFVGDVSLGEQIAKYLID+EPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLG
Sbjct: 481 LTTCRFVGDVSLGEQIAKYLIDMEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLG 540

Query: 541 TMPGCRLVDLKEIVHRLKLGNLLQDGMKETNTVMH 576
           TM GCRLVDLKEIVH LKLGN LQ+ MKETNTV+H
Sbjct: 541 TMSGCRLVDLKEIVHSLKLGNHLQERMKETNTVIH 575

BLAST of Cla97C01G019950 vs. TAIR 10
Match: AT3G51320.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 535.0 bits (1377), Expect = 7.5e-152
Identity = 254/510 (49.80%), Postives = 345/510 (67.65%), Query Frame = 0

Query: 51  RSYSLLQSCQSVRELLQIHGHLITSGRFKHHFWANRVLFQASEFGDVIYTVLVFRYINIP 110
           + + L++   S+  L Q+H  LITSG F    WA R+L  +S FGD  YTV ++R  +I 
Sbjct: 24  KGFKLVEDSNSITHLFQVHARLITSGNFWDSSWAIRLLKSSSRFGDSSYTVSIYR--SIG 83

Query: 111 NTFCINRVIKAYSLSIVPLEAVSLYFEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCH 170
             +C N V KAY +S  P +A+  YF+ L  GF PDSYTF+SL S      C  SG+ CH
Sbjct: 84  KLYCANPVFKAYLVSSSPKQALGFYFDILRFGFVPDSYTFVSLISCIEKTCCVDSGKMCH 143

Query: 171 GQAFKNGIDSVMVLRNSLIHMYGCCGHIELGRKVFDEMSSWDLVSWNSIVTAYARTGDLH 230
           GQA K+G D V+ ++NSL+HMY CCG ++L +K+F E+   D+VSWNSI+    R GD+ 
Sbjct: 144 GQAIKHGCDQVLPVQNSLMHMYTCCGALDLAKKLFVEIPKRDIVSWNSIIAGMVRNGDVL 203

Query: 231 TAHDLFDAMPERNIVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACG 290
            AH LFD MP++NI+SWN+MIS YL   NPG ++ LFR MV  G +GN +T+V +L ACG
Sbjct: 204 AAHKLFDEMPDKNIISWNIMISAYLGANNPGVSISLFREMVRAGFQGNESTLVLLLNACG 263

Query: 291 RSARLNEGRSVHGFMYRTSMKFCVFINTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNA 350
           RSARL EGRSVH  + RT +   V I+TAL+DMY KC+ V +ARR+FD +  RN VTWN 
Sbjct: 264 RSARLKEGRSVHASLIRTFLNSSVVIDTALIDMYGKCKEVGLARRIFDSLSIRNKVTWNV 323

Query: 351 MVLGHCLHGNPDDGLKLFQEMAAKLREINGEIGNGKKFKQDEGKRNVYPDQITFIGVLCA 410
           M+L HCLHG P+ GL+LF+ M      ING +                PD++TF+GVLC 
Sbjct: 324 MILAHCLHGRPEGGLELFEAM------INGML---------------RPDEVTFVGVLCG 383

Query: 411 CARAGLLKDANNYFNEMINVFLVRPNFAHYWCLANVYVAAGLIQQAVEILRNVPEDDEDF 470
           CARAGL+    +Y++ M++ F ++PNF H WC+AN+Y +AG  ++A E L+N+P  DED 
Sbjct: 384 CARAGLVSQGQSYYSLMVDEFQIKPNFGHQWCMANLYSSAGFPEEAEEALKNLP--DEDV 443

Query: 471 SSDSVVWINLLTMCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRI 530
           + +S  W NLL+  RF G+ +LGE IAK LI+ +P N  YY LL+NIY+V GRWEDV+R+
Sbjct: 444 TPESTKWANLLSSSRFTGNPTLGESIAKSLIETDPLNYKYYHLLMNIYSVTGRWEDVNRV 503

Query: 531 KLLMKEKRLGTMPGCRLVDLKEIVHRLKLG 561
           + ++KE+++G +PGC LVDLKEIVH L+LG
Sbjct: 504 REMVKERKIGRIPGCGLVDLKEIVHGLRLG 508

BLAST of Cla97C01G019950 vs. TAIR 10
Match: AT2G42920.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 307.0 bits (785), Expect = 3.3e-83
Identity = 167/500 (33.40%), Postives = 269/500 (53.80%), Query Frame = 0

Query: 59  CQSVRELLQIHGHLITSGRFKHHFWANRVL-FQASEFGDVIYTVLVFRYINIPNTFCINR 118
           C ++REL QIH  LI +G       A+RVL F  +   D+ Y  LVF  IN  N F  N 
Sbjct: 35  CSTMRELKQIHASLIKTGLISDTVTASRVLAFCCASPSDMNYAYLVFTRINHKNPFVWNT 94

Query: 119 VIKAYSLSIVPLEAVSLYFEWL--GNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFK 178
           +I+ +S S  P  A+S++ + L      +P   T+ S+F A    G    GR+ HG   K
Sbjct: 95  IIRGFSRSSFPEMAISIFIDMLCSSPSVKPQRLTYPSVFKAYGRLGQARDGRQLHGMVIK 154

Query: 179 NGIDSVMVLRNSLIHMYGCCGHIELGRKVFDEMSSWDLVSWNSIVTAYARTGDLHTAHDL 238
            G++    +RN+++HMY  CG +    ++F  M  +D+V+WNS++  +A+ G +  A +L
Sbjct: 155 EGLEDDSFIRNTMLHMYVTCGCLIEAWRIFLGMIGFDVVAWNSMIMGFAKCGLIDQAQNL 214

Query: 239 FDAMPERNIVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARL 298
           FD MP+RN VSWN MIS ++R G    A+ +FR M    ++ +  TMV++L AC      
Sbjct: 215 FDEMPQRNGVSWNSMISGFVRNGRFKDALDMFREMQEKDVKPDGFTMVSLLNACAYLGAS 274

Query: 299 NEGRSVHGFMYRTSMKFCVFINTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGH 358
            +GR +H ++ R   +    + TAL+DMY KC  +     VF+    + L  WN+M+LG 
Sbjct: 275 EQGRWIHEYIVRNRFELNSIVVTALIDMYCKCGCIEEGLNVFECAPKKQLSCWNSMILGL 334

Query: 359 CLHGNPDDGLKLFQEMAAKLREINGEIGNGKKFKQDEGKRNVYPDQITFIGVLCACARAG 418
             +G  +  + LF E+                      +  + PD ++FIGVL ACA +G
Sbjct: 335 ANNGFEERAMDLFSELE---------------------RSGLEPDSVSFIGVLTACAHSG 394

Query: 419 LLKDANNYFNEMINVFLVRPNFAHYWCLANVYVAAGLIQQAVEILRNVPEDDEDFSSDSV 478
            +  A+ +F  M   +++ P+  HY  + NV   AGL+++A  +++N+P ++     D+V
Sbjct: 395 EVHRADEFFRLMKEKYMIEPSIKHYTLMVNVLGGAGLLEEAEALIKNMPVEE-----DTV 454

Query: 479 VWINLLTMCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMK 538
           +W +LL+ CR +G+V + ++ AK L  L+P     Y LL N YA  G +E+    +LLMK
Sbjct: 455 IWSSLLSACRKIGNVEMAKRAAKCLKKLDPDETCGYVLLSNAYASYGLFEEAVEQRLLMK 508

Query: 539 EKRLGTMPGCRLVDLKEIVH 556
           E+++    GC  +++   VH
Sbjct: 515 ERQMEKEVGCSSIEVDFEVH 508

BLAST of Cla97C01G019950 vs. TAIR 10
Match: AT1G74630.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 303.5 bits (776), Expect = 3.7e-82
Identity = 180/572 (31.47%), Postives = 290/572 (50.70%), Query Frame = 0

Query: 54  SLLQSCQSVRELLQIHGHLITSG-RFKHHFWANRVLFQASEFGDVI-YTVLVFRYINIPN 113
           SLL SC+++R L QIHG  I  G     +F    +L  A    D + Y   +      P+
Sbjct: 10  SLLNSCKNLRALTQIHGLFIKYGVDTDSYFTGKLILHCAISISDALPYARRLLLCFPEPD 69

Query: 114 TFCINRVIKAYSLSIVPLEAVSLYFEWLGNGF-RPDSYTFLSLFSACANFGCGASGRKCH 173
            F  N +++ YS S  P  +V+++ E +  GF  PDS++F  +  A  NF    +G + H
Sbjct: 70  AFMFNTLVRGYSESDEPHNSVAVFVEMMRKGFVFPDSFSFAFVIKAVENFRSLRTGFQMH 129

Query: 174 GQAFKNGIDSVMVLRNSLIHMYGCCGHIELGRKVFDEMSSWDLVSWNSIVTAYARTGDLH 233
            QA K+G++S + +  +LI MYG CG +E  RKVFDEM   +LV+WN+++TA  R  D+ 
Sbjct: 130 CQALKHGLESHLFVGTTLIGMYGGCGCVEFARKVFDEMHQPNLVAWNAVITACFRGNDVA 189

Query: 234 TAHDLFDAMPERNIVSWNLMISEYLRGGNPGCAMKL------------------------ 293
            A ++FD M  RN  SWN+M++ Y++ G    A ++                        
Sbjct: 190 GAREIFDKMLVRNHTSWNVMLAGYIKAGELESAKRIFSEMPHRDDVSWSTMIVGIAHNGS 249

Query: 294 -------FRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRSVHGFMYRTSMKFCVFINTA 353
                  FR +   G+  N  ++  VL AC +S     G+ +HGF+ +    + V +N A
Sbjct: 250 FNESFLYFRELQRAGMSPNEVSLTGVLSACSQSGSFEFGKILHGFVEKAGYSWIVSVNNA 309

Query: 354 LVDMYSKCQRVSIARRVFDRML-SRNLVTWNAMVLGHCLHGNPDDGLKLFQEMAAKLREI 413
           L+DMYS+C  V +AR VF+ M   R +V+W +M+ G  +HG  ++ ++LF EM A     
Sbjct: 310 LIDMYSRCGNVPMARLVFEGMQEKRCIVSWTSMIAGLAMHGQGEEAVRLFNEMTA----- 369

Query: 414 NGEIGNGKKFKQDEGKRNVYPDQITFIGVLCACARAGLLKDANNYFNEMINVFLVRPNFA 473
                             V PD I+FI +L AC+ AGL+++  +YF+EM  V+ + P   
Sbjct: 370 ----------------YGVTPDGISFISLLHACSHAGLIEEGEDYFSEMKRVYHIEPEIE 429

Query: 474 HYWCLANVYVAAGLIQQAVEILRNVPEDDEDFSSDSVVWINLLTMCRFVGDVSLGEQIAK 533
           HY C+ ++Y  +G +Q+A + +  +P         ++VW  LL  C   G++ L EQ+ +
Sbjct: 430 HYGCMVDLYGRSGKLQKAYDFICQMP-----IPPTAIVWRTLLGACSSHGNIELAEQVKQ 489

Query: 534 YLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVHRLK 591
            L +L+P N     LL N YA AG+W+DV+ I+  M  +R+       LV++ + +++  
Sbjct: 490 RLNELDPNNSGDLVLLSNAYATAGKWKDVASIRKSMIVQRIKKTTAWSLVEVGKTMYKFT 549

BLAST of Cla97C01G019950 vs. TAIR 10
Match: AT4G37380.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 300.8 bits (769), Expect = 2.4e-81
Identity = 184/564 (32.62%), Postives = 292/564 (51.77%), Query Frame = 0

Query: 27  SSSPF--SSFPEPDLSLETT---NPPRHNRSYSLLQSCQSVRELLQIHGHLITSGRFKHH 86
           +SSP   +S P+  LS   T     P   +   L+   QSV E+LQIH  ++      H 
Sbjct: 2   ASSPLLATSLPQNQLSTTATARFRLPPPEKLAVLIDKSQSVDEVLQIHAAILRHNLLLHP 61

Query: 87  FW--ANRVLFQA-SEFGDVIYTVLVFRYINIPNTFCINRVIKAYSLSIVPLEAVSLYFEW 146
            +   N  L +A +  G + +++ +F     P+ F     I   S++ +  +A  LY + 
Sbjct: 62  RYPVLNLKLHRAYASHGKIRHSLALFHQTIDPDLFLFTAAINTASINGLKDQAFLLYVQL 121

Query: 147 LGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGIDSVMVLRNSLIHMYGCCGHI 206
           L +   P+ +TF SL  +C+      SG+  H    K G+     +   L+ +Y   G +
Sbjct: 122 LSSEINPNEFTFSSLLKSCST----KSGKLIHTHVLKFGLGIDPYVATGLVDVYAKGGDV 181

Query: 207 ELGRKVFDEMSSWDLVSWNSIVTAYARTGDLHTAHDLFDAMPERNIVSWNLMISEYLRGG 266
              +KVFD M    LVS  +++T YA+ G++  A  LFD+M ER+IVSWN+MI  Y + G
Sbjct: 182 VSAQKVFDRMPERSLVSSTAMITCYAKQGNVEAARALFDSMCERDIVSWNVMIDGYAQHG 241

Query: 267 NPGCAMKLFRNMVNIG-IRGNNTTMVNVLGACGRSARLNEGRSVHGFMYRTSMKFCVFIN 326
            P  A+ LF+ ++  G  + +  T+V  L AC +   L  GR +H F+  + ++  V + 
Sbjct: 242 FPNDALMLFQKLLAEGKPKPDEITVVAALSACSQIGALETGRWIHVFVKSSRIRLNVKVC 301

Query: 327 TALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGNPDDGLKLFQEMAAKLRE 386
           T L+DMYSKC  +  A  VF+    +++V WNAM+ G+ +HG   D L+LF EM      
Sbjct: 302 TGLIDMYSKCGSLEEAVLVFNDTPRKDIVAWNAMIAGYAMHGYSQDALRLFNEM------ 361

Query: 387 INGEIGNGKKFKQDEGKRNVYPDQITFIGVLCACARAGLLKDANNYFNEMINVFLVRPNF 446
                         +G   + P  ITFIG L ACA AGL+ +    F  M   + ++P  
Sbjct: 362 --------------QGITGLQPTDITFIGTLQACAHAGLVNEGIRIFESMGQEYGIKPKI 421

Query: 447 AHYWCLANVYVAAGLIQQAVEILRNVPEDDEDFSSDSVVWINLLTMCRFVGDVSLGEQIA 506
            HY CL ++   AG +++A E ++N+  D     +DSV+W ++L  C+  GD  LG++IA
Sbjct: 422 EHYGCLVSLLGRAGQLKRAYETIKNMNMD-----ADSVLWSSVLGSCKLHGDFVLGKEIA 481

Query: 507 KYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVHRL 566
           +YLI L  KN   Y LL NIYA  G +E V++++ LMKEK +   PG   ++++  VH  
Sbjct: 482 EYLIGLNIKNSGIYVLLSNIYASVGDYEGVAKVRNLMKEKGIVKEPGISTIEIENKVHEF 536

Query: 567 KLGNLLQDGMKETNTVMHKLASEV 582
           + G+      KE  T++ K++  +
Sbjct: 542 RAGDREHSKSKEIYTMLRKISERI 536

BLAST of Cla97C01G019950 vs. TAIR 10
Match: AT5G48910.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 292.4 bits (747), Expect = 8.5e-79
Identity = 174/577 (30.16%), Postives = 293/577 (50.78%), Query Frame = 0

Query: 29  SPFSSFPEPDLSLETTNPPRHNRS-YSLLQSCQSVRELLQIHGHLITSGRFKHHFWANRV 88
           +P  +   P  +   ++P  H  S +  + +C+++R+L QIH   I SG+ +    A  +
Sbjct: 2   NPTQTLFSPGGNSPASSPASHPSSLFPQINNCRTIRDLSQIHAVFIKSGQMRDTLAAAEI 61

Query: 89  L-FQAS---EFGDVIYTVLVFRYINIPNTFCINRVIKAYSLS--IVPLEAVSLYFEWLGN 148
           L F A+      D+ Y   +F  +   N F  N +I+ +S S     L A++L++E + +
Sbjct: 62  LRFCATSDLHHRDLDYAHKIFNQMPQRNCFSWNTIIRGFSESDEDKALIAITLFYEMMSD 121

Query: 149 GF-RPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGIDSVMVLRNSLIHMYGCCGHIEL 208
            F  P+ +TF S+  ACA  G    G++ HG A K G      + ++L+ MY  CG ++ 
Sbjct: 122 EFVEPNRFTFPSVLKACAKTGKIQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKD 181

Query: 209 GRKVF--------------DEMSSWDLVSWNSIVTAYARTGDLHTAHDLFDAMPERNIVS 268
            R +F                    ++V WN ++  Y R GD   A  LFD M +R++VS
Sbjct: 182 ARVLFYKNIIEKDMVVMTDRRKRDGEIVLWNVMIDGYMRLGDCKAARMLFDKMRQRSVVS 241

Query: 269 WNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRSVHGFMY 328
           WN MIS Y   G    A+++FR M    IR N  T+V+VL A  R   L  G  +H +  
Sbjct: 242 WNTMISGYSLNGFFKDAVEVFREMKKGDIRPNYVTLVSVLPAISRLGSLELGEWLHLYAE 301

Query: 329 RTSMKFCVFINTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGNPDDGLK 388
            + ++    + +AL+DMYSKC  +  A  VF+R+   N++TW+AM+ G  +HG   D + 
Sbjct: 302 DSGIRIDDVLGSALIDMYSKCGIIEKAIHVFERLPRENVITWSAMINGFAIHGQAGDAID 361

Query: 389 LFQEMAAKLREINGEIGNGKKFKQDEGKRNVYPDQITFIGVLCACARAGLLKDANNYFNE 448
            F +M                      +  V P  + +I +L AC+  GL+++   YF++
Sbjct: 362 CFCKMR---------------------QAGVRPSDVAYINLLTACSHGGLVEEGRRYFSQ 421

Query: 449 MINVFLVRPNFAHYWCLANVYVAAGLIQQAVEILRNVPEDDEDFSSDSVVWINLLTMCRF 508
           M++V  + P   HY C+ ++   +GL+ +A E + N+P        D V+W  LL  CR 
Sbjct: 422 MVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEFILNMP-----IKPDDVIWKALLGACRM 481

Query: 509 VGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCR 568
            G+V +G+++A  L+D+ P +   Y  L N+YA  G W +VS ++L MKEK +   PGC 
Sbjct: 482 QGNVEMGKRVANILMDMVPHDSGAYVALSNMYASQGNWSEVSEMRLRMKEKDIRKDPGCS 541

Query: 569 LVDLKEIVHRLKLGNLLQDGMKETNTVMHKLASEVSL 584
           L+D+  ++H   + +      KE N+++ +++ ++ L
Sbjct: 542 LIDIDGVLHEFVVEDDSHPKAKEINSMLVEISDKLRL 552

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038882774.10.0e+0091.44pentatricopeptide repeat-containing protein At3g51320 [Benincasa hispida][more]
XP_023544620.14.6e-31085.91pentatricopeptide repeat-containing protein At3g51320 isoform X1 [Cucurbita pepo... [more]
XP_022950690.11.0e-30785.57pentatricopeptide repeat-containing protein At3g51320 isoform X1 [Cucurbita mosc... [more]
KAG6603895.13.8e-30785.57Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_022978359.14.4e-30384.90pentatricopeptide repeat-containing protein At3g51320 isoform X1 [Cucurbita maxi... [more]
Match NameE-valueIdentityDescription
Q0WVU01.1e-15049.80Pentatricopeptide repeat-containing protein At3g51320 OS=Arabidopsis thaliana OX... [more]
Q9SJG64.7e-8233.40Pentatricopeptide repeat-containing protein At2g42920, chloroplastic OS=Arabidop... [more]
Q9CA545.2e-8131.47Pentatricopeptide repeat-containing protein At1g74630 OS=Arabidopsis thaliana OX... [more]
Q9SZT83.4e-8032.62Pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Arabidopsis t... [more]
Q9FI801.2e-7730.16Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1GGG44.9e-30885.57pentatricopeptide repeat-containing protein At3g51320 isoform X1 OS=Cucurbita mo... [more]
A0A6J1ITV72.1e-30384.90pentatricopeptide repeat-containing protein At3g51320 isoform X1 OS=Cucurbita ma... [more]
A0A1S4DTP81.1e-30287.48pentatricopeptide repeat-containing protein At3g51320 OS=Cucumis melo OX=3656 GN... [more]
A0A5D3CKW61.4e-30287.48Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A0A0KMJ64.0e-30286.96Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G507160 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G51320.17.5e-15249.80Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G42920.13.3e-8333.40Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT1G74630.13.7e-8231.47Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G37380.12.4e-8132.62Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G48910.18.5e-7930.16Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 346..371
e-value: 4.4E-6
score: 26.6
coord: 214..244
e-value: 7.6E-6
score: 25.8
coord: 245..275
e-value: 1.2E-4
score: 22.1
coord: 186..210
e-value: 0.0039
score: 17.3
coord: 318..344
e-value: 0.11
score: 12.8
coord: 403..428
e-value: 0.065
score: 13.5
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 214..245
e-value: 8.8E-6
score: 23.6
coord: 245..271
e-value: 0.0028
score: 15.7
coord: 346..371
e-value: 3.3E-5
score: 21.8
coord: 186..213
e-value: 7.3E-4
score: 17.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 212..246
score: 11.334042
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 344..374
score: 10.183105
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 381..552
e-value: 3.4E-17
score: 64.8
coord: 110..263
e-value: 8.1E-27
score: 96.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 276..380
e-value: 9.1E-19
score: 69.5
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 412..526
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 25..48
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 54..566
NoneNo IPR availablePANTHERPTHR47928:SF120OS05G0107000 PROTEINcoord: 54..566

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C01G019950.1Cla97C01G019950.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding