Cla97C01G026040 (gene) Watermelon (97103) v2

NameCla97C01G026040
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionPentatricopeptide repeat
LocationCla97Chr01 : 36786492 .. 36787802 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTCGTTAATCTCTCCCATCAATTGCTTAAACCTGCTTGCCGTCGATTTGCTTTCTCCGCCAAAGGTTTCAACTCTTGGGCTTTACGTATTCGAAATGCTCCCTCCCTCCATAGAGCACTTGCTATCTACTCCCAGATGCACCGCCAATCCGTTCCCCACGACAGCTTCTCAATTCTATTTATGCTCAAAGCCTGCGCTCCTTCCAACAATCTCTCCACTCTTCACCATCTCCATGCCCATATTACTAAACTTGGTTTCACTACTCATGTCTTTGTCGCTACATCCCTACTCTATGCATACGTCCTTAACTCCTTTCAACTTGCTTGCTTGGTGTTCGATGAAATGCCCCACAGAAACACTGTTACCTGGAATACTATGATTTTGGGGTATTCCAAGACGGGGGATGTAGATAGAGCTCGCCAGCTGTTTGATCAAATGCCCTCAAGAGACTTGGTATCTTGGTCCACCATGATTGCTGCCTACATTAACAATCGCAATTACAGGGGTGGTTTGCTTCTTTTCCAAGATATGATAGCTGACGGTATAAATCCTGACCAGATGGCGGCAGGATCAATCTTAAATGGGTGTGCTCATATGGGCTCTTTAGGATTGTTGGCTGGAAAATCGGTTCATGGTTTTGTGGTCAAGAATAGGTGGGAACTCAACCTGGAGCTTGGTACAGTTTTGGTTGATATGTACGCCAAGTGTGGATTTTTGAAGTATGCTTGCCAGATATTTCTTTTGATGTCTGAAAAGAATGTCAAGACTTGGACTGCTCTGATATGTGGTTTGGCCCAGCATGGCTACTGCAAGGACGCATTAGTTTTATTTGAGACGATGACACATGAAGGTGTGGAACCGAATGAATTGACTTTTACTGGGGTTTTAAGTGCATGTGCCCATGCAGGGTTTGTTCAAGAAGGTCGCAAATACTTTAACATGATTGAAGAATATGGATTAGAAATAAGGATTCAACATTATGGTTGCATGGTTGATTTGTTGGGCAAGTCGGGATTGTTGGAGGAAGCCTATGGGGTTATTAAGACTATGAGACTTGAACCTAATGTCACTGTGTGGAGCTCTCTTTTGTCGGCCTGTAAGCAACAGAAAAGCTTTGACATGGCCGAGAGAGTCATTGAGCAGATATTGGAAAAGATAGAACCCAAGAATCATGGTGGAATTTACTCTCTTATATCTGATTTGTATGTTCTAGAGGAGAAGTGGGATGATGCAGAAAAGATAAGGAATTTACTGAGTCAAAATGTGCCGAAGGTTAGGGCCTATAGCCTTATCAGAAGTGGATTATAA

mRNA sequence

ATGCTCGTTAATCTCTCCCATCAATTGCTTAAACCTGCTTGCCGTCGATTTGCTTTCTCCGCCAAAGGTTTCAACTCTTGGGCTTTACGTATTCGAAATGCTCCCTCCCTCCATAGAGCACTTGCTATCTACTCCCAGATGCACCGCCAATCCGTTCCCCACGACAGCTTCTCAATTCTATTTATGCTCAAAGCCTGCGCTCCTTCCAACAATCTCTCCACTCTTCACCATCTCCATGCCCATATTACTAAACTTGGTTTCACTACTCATGTCTTTGTCGCTACATCCCTACTCTATGCATACGTCCTTAACTCCTTTCAACTTGCTTGCTTGGTGTTCGATGAAATGCCCCACAGAAACACTGTTACCTGGAATACTATGATTTTGGGGTATTCCAAGACGGGGGATGTAGATAGAGCTCGCCAGCTGTTTGATCAAATGCCCTCAAGAGACTTGGTATCTTGGTCCACCATGATTGCTGCCTACATTAACAATCGCAATTACAGGGGTGGTTTGCTTCTTTTCCAAGATATGATAGCTGACGGTATAAATCCTGACCAGATGGCGGCAGGATCAATCTTAAATGGGTGTGCTCATATGGGCTCTTTAGGATTGTTGGCTGGAAAATCGGTTCATGGTTTTGTGGTCAAGAATAGGTGGGAACTCAACCTGGAGCTTGGTACAGTTTTGGTTGATATGTACGCCAAGTGTGGATTTTTGAAGTATGCTTGCCAGATATTTCTTTTGATGTCTGAAAAGAATGTCAAGACTTGGACTGCTCTGATATGTGGTTTGGCCCAGCATGGCTACTGCAAGGACGCATTAGTTTTATTTGAGACGATGACACATGAAGGTGTGGAACCGAATGAATTGACTTTTACTGGGGTTTTAAGTGCATGTGCCCATGCAGGGTTTGTTCAAGAAGGTCGCAAATACTTTAACATGATTGAAGAATATGGATTAGAAATAAGGATTCAACATTATGGTTGCATGGTTGATTTGTTGGGCAAGTCGGGATTGTTGGAGGAAGCCTATGGGGTTATTAAGACTATGAGACTTGAACCTAATGTCACTGTGTGGAGCTCTCTTTTGTCGGCCTGTAAGCAACAGAAAAGCTTTGACATGGCCGAGAGAGTCATTGAGCAGATATTGGAAAAGATAGAACCCAAGAATCATGGTGGAATTTACTCTCTTATATCTGATTTGTATGTTCTAGAGGAGAAGTGGGATGATGCAGAAAAGATAAGGAATTTACTGAGTCAAAATGTGCCGAAGGTTAGGGCCTATAGCCTTATCAGAAGTGGATTATAA

Coding sequence (CDS)

ATGCTCGTTAATCTCTCCCATCAATTGCTTAAACCTGCTTGCCGTCGATTTGCTTTCTCCGCCAAAGGTTTCAACTCTTGGGCTTTACGTATTCGAAATGCTCCCTCCCTCCATAGAGCACTTGCTATCTACTCCCAGATGCACCGCCAATCCGTTCCCCACGACAGCTTCTCAATTCTATTTATGCTCAAAGCCTGCGCTCCTTCCAACAATCTCTCCACTCTTCACCATCTCCATGCCCATATTACTAAACTTGGTTTCACTACTCATGTCTTTGTCGCTACATCCCTACTCTATGCATACGTCCTTAACTCCTTTCAACTTGCTTGCTTGGTGTTCGATGAAATGCCCCACAGAAACACTGTTACCTGGAATACTATGATTTTGGGGTATTCCAAGACGGGGGATGTAGATAGAGCTCGCCAGCTGTTTGATCAAATGCCCTCAAGAGACTTGGTATCTTGGTCCACCATGATTGCTGCCTACATTAACAATCGCAATTACAGGGGTGGTTTGCTTCTTTTCCAAGATATGATAGCTGACGGTATAAATCCTGACCAGATGGCGGCAGGATCAATCTTAAATGGGTGTGCTCATATGGGCTCTTTAGGATTGTTGGCTGGAAAATCGGTTCATGGTTTTGTGGTCAAGAATAGGTGGGAACTCAACCTGGAGCTTGGTACAGTTTTGGTTGATATGTACGCCAAGTGTGGATTTTTGAAGTATGCTTGCCAGATATTTCTTTTGATGTCTGAAAAGAATGTCAAGACTTGGACTGCTCTGATATGTGGTTTGGCCCAGCATGGCTACTGCAAGGACGCATTAGTTTTATTTGAGACGATGACACATGAAGGTGTGGAACCGAATGAATTGACTTTTACTGGGGTTTTAAGTGCATGTGCCCATGCAGGGTTTGTTCAAGAAGGTCGCAAATACTTTAACATGATTGAAGAATATGGATTAGAAATAAGGATTCAACATTATGGTTGCATGGTTGATTTGTTGGGCAAGTCGGGATTGTTGGAGGAAGCCTATGGGGTTATTAAGACTATGAGACTTGAACCTAATGTCACTGTGTGGAGCTCTCTTTTGTCGGCCTGTAAGCAACAGAAAAGCTTTGACATGGCCGAGAGAGTCATTGAGCAGATATTGGAAAAGATAGAACCCAAGAATCATGGTGGAATTTACTCTCTTATATCTGATTTGTATGTTCTAGAGGAGAAGTGGGATGATGCAGAAAAGATAAGGAATTTACTGAGTCAAAATGTGCCGAAGGTTAGGGCCTATAGCCTTATCAGAAGTGGATTATAA

Protein sequence

MLVNLSHQLLKPACRRFAFSAKGFNSWALRIRNAPSLHRALAIYSQMHRQSVPHDSFSILFMLKACAPSNNLSTLHHLHAHITKLGFTTHVFVATSLLYAYVLNSFQLACLVFDEMPHRNTVTWNTMILGYSKTGDVDRARQLFDQMPSRDLVSWSTMIAAYINNRNYRGGLLLFQDMIADGINPDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFLKYACQIFLLMSEKNVKTWTALICGLAQHGYCKDALVLFETMTHEGVEPNELTFTGVLSACAHAGFVQEGRKYFNMIEEYGLEIRIQHYGCMVDLLGKSGLLEEAYGVIKTMRLEPNVTVWSSLLSACKQQKSFDMAERVIEQILEKIEPKNHGGIYSLISDLYVLEEKWDDAEKIRNLLSQNVPKVRAYSLIRSGL
BLAST of Cla97C01G026040 vs. NCBI nr
Match: XP_023544798.1 (pentatricopeptide repeat-containing protein At5g66520-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 701.4 bits (1809), Expect = 1.9e-198
Identity = 378/436 (86.70%), Postives = 398/436 (91.28%), Query Frame = 0

Query: 1   MLVNLSHQLLKPACRRFAFSAKGFNSWALRIRNAPSLHRALAIYSQMHRQSVPHDSFSIL 60
           ML++ S  LL PA R+FA  AKGFNSWALRIRNAPSLH+ALAIYSQMHRQSVPHDSFSIL
Sbjct: 1   MLLSRSSHLLNPARRQFALFAKGFNSWALRIRNAPSLHKALAIYSQMHRQSVPHDSFSIL 60

Query: 61  FMLKACAPSNNLSTLHHLHAHITKLGFTTHVFVATSLLYAYVLNSFQLACLVFDEMPHRN 120
           F+LKACA SNNLS LHHLHAHITKLGFTTHVFVATSLLYAYVLNSF+LACL+FDEMPH+N
Sbjct: 61  FVLKACARSNNLSILHHLHAHITKLGFTTHVFVATSLLYAYVLNSFELACLLFDEMPHKN 120

Query: 121 TVTWNTMILGYSKTGDVDRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIA 180
           TVTWNTMI GYSKTGDVDRXXXXXXXXXXXXXXXXXXXXXXXXXXXX           I 
Sbjct: 121 TVTWNTMIFGYSKTGDVDRXXXXXXXXXXXXXXXXXXXXXXXXXXXXYRGGLLLFQDMIV 180

Query: 181 DGINPDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFL 240
            GI PDQMA GSIL GCA+MGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGF 
Sbjct: 181 IGITPDQMAVGSILKGCAYMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFF 240

Query: 241 KYACQIFLLMSEKNVKTWTALICGLAQHGYCKDALVLFETMTHEGVEPNELTFTGVLSAC 300
           KYACQ+F LMSEKNV+TWTALICGLAQHGYCK+AL LFE M +E VEPNELTFTG+LSAC
Sbjct: 241 KYACQVFHLMSEKNVRTWTALICGLAQHGYCKEALDLFEMMRNECVEPNELTFTGILSAC 300

Query: 301 AHAGFVQEGRKYFNMIEEYGLEIRIQHYGCMVDLLGKSGLLEEAYGVIKTMRLEPNVTVW 360
            HAGFVQEGRKYFNMIEEYGLE RIQHYGCMVDLLG+SGLLEEAYGVIK MRLEPN+ VW
Sbjct: 301 VHAGFVQEGRKYFNMIEEYGLETRIQHYGCMVDLLGRSGLLEEAYGVIKNMRLEPNIIVW 360

Query: 361 SSLLSACKQQKSFDMAERVIEQILEKIEPKNHGGIYSLISDLYVLEEKWDDAEKIRNLLS 420
           SSLLSACKQ KSFDMAERVIEQIL+K+EP+NHGGIYSLISDLYVLEEKWDDAEKIRNLL+
Sbjct: 361 SSLLSACKQHKSFDMAERVIEQILDKLEPENHGGIYSLISDLYVLEEKWDDAEKIRNLLN 420

Query: 421 QNVPKVRAYSLIRSGL 437
           QNV KVRAYSLIRSGL
Sbjct: 421 QNVRKVRAYSLIRSGL 436

BLAST of Cla97C01G026040 vs. NCBI nr
Match: XP_022925591.1 (pentatricopeptide repeat-containing protein At5g66520-like [Cucurbita moschata])

HSP 1 Score: 696.8 bits (1797), Expect = 4.6e-197
Identity = 376/436 (86.24%), Postives = 397/436 (91.06%), Query Frame = 0

Query: 1   MLVNLSHQLLKPACRRFAFSAKGFNSWALRIRNAPSLHRALAIYSQMHRQSVPHDSFSIL 60
           ML++ S  LL PA R+FA  AKGFNSWALRIRNAPSL +ALAIYSQMHRQSVPHDSFSIL
Sbjct: 1   MLLSRSSHLLNPARRQFALFAKGFNSWALRIRNAPSLQKALAIYSQMHRQSVPHDSFSIL 60

Query: 61  FMLKACAPSNNLSTLHHLHAHITKLGFTTHVFVATSLLYAYVLNSFQLACLVFDEMPHRN 120
           F+LKACA SNNLS LHHLHAHITKLGFTTHVFVATSLLYAYVLNSF+LACL+FDEMPH+N
Sbjct: 61  FVLKACARSNNLSILHHLHAHITKLGFTTHVFVATSLLYAYVLNSFELACLLFDEMPHKN 120

Query: 121 TVTWNTMILGYSKTGDVDRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIA 180
           TVTWNTMI GYSKTGDVDRXXXXXXXXXXXXXXXXXXXXXXXXXXXX           I 
Sbjct: 121 TVTWNTMIFGYSKTGDVDRXXXXXXXXXXXXXXXXXXXXXXXXXXXXYRGGLLLFQDMIV 180

Query: 181 DGINPDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFL 240
            GI PDQMA GSIL GCA+MGSLGLLAGKSVHGFVVKNRW+LNLELGTVLVDMYAKCGF 
Sbjct: 181 IGITPDQMAVGSILKGCAYMGSLGLLAGKSVHGFVVKNRWKLNLELGTVLVDMYAKCGFF 240

Query: 241 KYACQIFLLMSEKNVKTWTALICGLAQHGYCKDALVLFETMTHEGVEPNELTFTGVLSAC 300
           KYACQ+F LMSEKNV+TWTALICGLAQHGYCK+AL LFE M +E VEPNELTFTG+LSAC
Sbjct: 241 KYACQVFHLMSEKNVRTWTALICGLAQHGYCKEALDLFEMMRNECVEPNELTFTGILSAC 300

Query: 301 AHAGFVQEGRKYFNMIEEYGLEIRIQHYGCMVDLLGKSGLLEEAYGVIKTMRLEPNVTVW 360
            HAGFVQEGRKYFNMIEEYGLE RIQHYGCMVDLLG+SGLLEEAYGVIK MRLEPN+ VW
Sbjct: 301 VHAGFVQEGRKYFNMIEEYGLETRIQHYGCMVDLLGRSGLLEEAYGVIKNMRLEPNIIVW 360

Query: 361 SSLLSACKQQKSFDMAERVIEQILEKIEPKNHGGIYSLISDLYVLEEKWDDAEKIRNLLS 420
           SSLLSACKQ KSFDMAERVIEQIL+K+EP+NHGGIYSLISDLYVLEEKWDDAEKIRNLL+
Sbjct: 361 SSLLSACKQHKSFDMAERVIEQILDKLEPENHGGIYSLISDLYVLEEKWDDAEKIRNLLN 420

Query: 421 QNVPKVRAYSLIRSGL 437
           QNV KVRAYSLIRSGL
Sbjct: 421 QNVRKVRAYSLIRSGL 436

BLAST of Cla97C01G026040 vs. NCBI nr
Match: XP_008439426.1 (PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like [Cucumis melo])

HSP 1 Score: 688.7 bits (1776), Expect = 1.3e-194
Identity = 385/436 (88.30%), Postives = 405/436 (92.89%), Query Frame = 0

Query: 1   MLVNLSHQLLKPACRRFAFSAKGFNSWALRIRNAPSLHRALAIYSQMHRQSVPHDSFSIL 60
           MLVNL   LL P+CR+FAFSAKGFNSWALRIRNAPSLH+ALAI+SQMHRQSVPHDSFSIL
Sbjct: 1   MLVNL---LLNPSCRQFAFSAKGFNSWALRIRNAPSLHKALAIFSQMHRQSVPHDSFSIL 60

Query: 61  FMLKACAPSNNLSTLHHLHAHITKLGFTTHVFVATSLLYAYVLNSFQLACLVFDEMPHRN 120
           FMLKACA SNNLS LHHLHAHITKLGFTTHVFVATSLL++YVL+SFQLA LVFDEMPH+N
Sbjct: 61  FMLKACASSNNLSILHHLHAHITKLGFTTHVFVATSLLHSYVLHSFQLARLVFDEMPHKN 120

Query: 121 TVTWNTMILGYSKTGDVDRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIA 180
           +VTWNTMI GYSKTGDV  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXI 
Sbjct: 121 SVTWNTMISGYSKTGDVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXII 180

Query: 181 DGINPDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFL 240
           +GINPDQMAAGSILNGCAHMGSLG LAGKSVHGFVVKNRWELNLELGTVLV MYAKCG L
Sbjct: 181 NGINPDQMAAGSILNGCAHMGSLGSLAGKSVHGFVVKNRWELNLELGTVLVHMYAKCGLL 240

Query: 241 KYACQIFLLMSEKNVKTWTALICGLAQHGYCKDALVLFETMTHEGVEPNELTFTGVLSAC 300
           KYACQIF LMSE+NV+TWTALICGLA HG CK+AL LFETM HEGVEPNELTFTGVLSAC
Sbjct: 241 KYACQIFHLMSERNVRTWTALICGLAHHGCCKEALALFETMRHEGVEPNELTFTGVLSAC 300

Query: 301 AHAGFVQEGRKYFNMIEEYGLEIRIQHYGCMVDLLGKSGLLEEAYGVIKTMRLEPNVTVW 360
            HAG VQEGRKYFNMIEEYGLEIRIQHYGC VDLLG+SGLLEEAYGVIK+MR EPNV VW
Sbjct: 301 VHAGLVQEGRKYFNMIEEYGLEIRIQHYGCFVDLLGRSGLLEEAYGVIKSMRFEPNVIVW 360

Query: 361 SSLLSACKQQKSFDMAERVIEQILEKIEPKNHGGIYSLISDLYVLEEKWDDAEKIRNLLS 420
           SSLLSACKQ KSFD+AERVIE ILEK EP NHGG+YSL+SDLYVL+EKWDDAE IRNLL+
Sbjct: 361 SSLLSACKQHKSFDLAERVIEHILEKTEPNNHGGVYSLVSDLYVLQEKWDDAENIRNLLN 420

Query: 421 QNVPKVRAYSLIRSGL 437
           Q V KVRAYSLIRSGL
Sbjct: 421 QKVRKVRAYSLIRSGL 433

BLAST of Cla97C01G026040 vs. NCBI nr
Match: XP_011658370.1 (PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like [Cucumis sativus] >KGN49540.1 hypothetical protein Csa_6G538690 [Cucumis sativus])

HSP 1 Score: 676.8 bits (1745), Expect = 5.0e-191
Identity = 381/436 (87.39%), Postives = 400/436 (91.74%), Query Frame = 0

Query: 1   MLVNLSHQLLKPACRRFAFSAKGFNSWALRIRNAPSLHRALAIYSQMHRQSVPHDSFSIL 60
           MLVNL   LL P+CR FAFSAKG NSWALRIRNAPSLH+ALA YSQMHRQSVPHDSFSIL
Sbjct: 4   MLVNL---LLNPSCRHFAFSAKGVNSWALRIRNAPSLHKALAFYSQMHRQSVPHDSFSIL 63

Query: 61  FMLKACAPSNNLSTLHHLHAHITKLGFTTHVFVATSLLYAYVLNSFQLACLVFDEMPHRN 120
           FMLKACA SNNLS LHHLHAHITKLGFTTHVFVATSLL++YVL+SFQLA LVFDEMPH+N
Sbjct: 64  FMLKACASSNNLSILHHLHAHITKLGFTTHVFVATSLLHSYVLHSFQLARLVFDEMPHKN 123

Query: 121 TVTWNTMILGYSKTGDVDRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIA 180
           +VTWNTMI GYSK GDV  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX  
Sbjct: 124 SVTWNTMISGYSKAGDVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 183

Query: 181 DGINPDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFL 240
               PDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFL
Sbjct: 184 XXXXPDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFL 243

Query: 241 KYACQIFLLMSEKNVKTWTALICGLAQHGYCKDALVLFETMTHEGVEPNELTFTGVLSAC 300
           KYACQIF LMSE+NV+TWTALICGLA HG CK+ALVLFETM HEGVEPNE TFTGVLSAC
Sbjct: 244 KYACQIFNLMSERNVRTWTALICGLAHHGCCKEALVLFETMRHEGVEPNEFTFTGVLSAC 303

Query: 301 AHAGFVQEGRKYFNMIEEYGLEIRIQHYGCMVDLLGKSGLLEEAYGVIKTMRLEPNVTVW 360
            HAG VQEGRKYFNMIEE GLEIRIQHYGC VDLLG+SGLLEEAYGVIK+MRLEPNV VW
Sbjct: 304 VHAGLVQEGRKYFNMIEECGLEIRIQHYGCFVDLLGRSGLLEEAYGVIKSMRLEPNVIVW 363

Query: 361 SSLLSACKQQKSFDMAERVIEQILEKIEPKNHGGIYSLISDLYVLEEKWDDAEKIRNLLS 420
           SSLLSACKQ KSFD+AERVIEQILEKIEP NH G+YSL+SDLYVL++KWDDAE IRNLL+
Sbjct: 364 SSLLSACKQHKSFDLAERVIEQILEKIEPDNHAGVYSLVSDLYVLQDKWDDAENIRNLLN 423

Query: 421 QNVPKVRAYSLIRSGL 437
           Q+V K RAYSLIRSGL
Sbjct: 424 QHVRKGRAYSLIRSGL 436

BLAST of Cla97C01G026040 vs. NCBI nr
Match: XP_022146506.1 (pentatricopeptide repeat-containing protein At5g66520-like [Momordica charantia])

HSP 1 Score: 654.4 bits (1687), Expect = 2.6e-184
Identity = 364/436 (83.49%), Postives = 393/436 (90.14%), Query Frame = 0

Query: 1   MLVNLSHQLLKPACRRFAFSAKGFNSWALRIRNAPSLHRALAIYSQMHRQSVPHDSFSIL 60
           M+V+L   LLKPA R+F    KG+NSWALRIRNAPSL +AL IYSQMHRQSVP+DSFSIL
Sbjct: 1   MVVHLCCHLLKPANRQFILFVKGYNSWALRIRNAPSLRKALTIYSQMHRQSVPYDSFSIL 60

Query: 61  FMLKACAPSNNLSTLHHLHAHITKLGFTTHVFVATSLLYAYVLNSFQLACLVFDEMPHRN 120
           FMLKACAPS NLS L HLHAHI KLGFT+H++VATSLLYAYVLNS QLACL+FDEMPHRN
Sbjct: 61  FMLKACAPSRNLSILEHLHAHIAKLGFTSHLYVATSLLYAYVLNSLQLACLLFDEMPHRN 120

Query: 121 TVTWNTMILGYSKTGDVDRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIA 180
           TV+WNTMI GYSK+G+VD XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX  IA
Sbjct: 121 TVSWNTMIFGYSKSGNVDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDMIA 180

Query: 181 DGINPDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFL 240
            G+NPDQMAAGSILNGCAHMGSLG LAGKSVHGFVVKNRWELNLELGTVLV MYAKCG L
Sbjct: 181 SGLNPDQMAAGSILNGCAHMGSLGFLAGKSVHGFVVKNRWELNLELGTVLVYMYAKCGSL 240

Query: 241 KYACQIFLLMSEKNVKTWTALICGLAQHGYCKDALVLFETMTHEGVEPNELTFTGVLSAC 300
           K ACQ+F LM EKNV++WTALICG  QHGY K+ALVLFE M +EGVEPNELTFTG+LSAC
Sbjct: 241 KNACQVFHLMPEKNVRSWTALICGSVQHGYSKEALVLFEMMRNEGVEPNELTFTGILSAC 300

Query: 301 AHAGFVQEGRKYFNMIEEYGLEIRIQHYGCMVDLLGKSGLLEEAYGVIKTMRLEPNVTVW 360
            HAG VQEGRKYFNMIEE+ LEIRIQHYGCMVDLLGKSGLLEEAYG+IK MRLEPNV VW
Sbjct: 301 VHAGLVQEGRKYFNMIEEFNLEIRIQHYGCMVDLLGKSGLLEEAYGIIKNMRLEPNVIVW 360

Query: 361 SSLLSACKQQKSFDMAERVIEQILEKIEPKNHGGIYSLISDLYVLEEKWDDAEKIRNLLS 420
           SS+LSACKQ   FD+AERVIEQILE+ EP+N+GGIYSLISDLYVLEEKWDDAEKIR L++
Sbjct: 361 SSILSACKQHNRFDIAERVIEQILERTEPENYGGIYSLISDLYVLEEKWDDAEKIRKLMN 420

Query: 421 QNVPKVRAYSLIRSGL 437
           QNV KVRAYSLIRS L
Sbjct: 421 QNVRKVRAYSLIRSEL 436

BLAST of Cla97C01G026040 vs. TrEMBL
Match: tr|A0A1S3AYC4|A0A1S3AYC4_CUCME (pentatricopeptide repeat-containing protein At5g66520-like OS=Cucumis melo OX=3656 GN=LOC103484239 PE=4 SV=1)

HSP 1 Score: 688.7 bits (1776), Expect = 8.3e-195
Identity = 385/436 (88.30%), Postives = 405/436 (92.89%), Query Frame = 0

Query: 1   MLVNLSHQLLKPACRRFAFSAKGFNSWALRIRNAPSLHRALAIYSQMHRQSVPHDSFSIL 60
           MLVNL   LL P+CR+FAFSAKGFNSWALRIRNAPSLH+ALAI+SQMHRQSVPHDSFSIL
Sbjct: 1   MLVNL---LLNPSCRQFAFSAKGFNSWALRIRNAPSLHKALAIFSQMHRQSVPHDSFSIL 60

Query: 61  FMLKACAPSNNLSTLHHLHAHITKLGFTTHVFVATSLLYAYVLNSFQLACLVFDEMPHRN 120
           FMLKACA SNNLS LHHLHAHITKLGFTTHVFVATSLL++YVL+SFQLA LVFDEMPH+N
Sbjct: 61  FMLKACASSNNLSILHHLHAHITKLGFTTHVFVATSLLHSYVLHSFQLARLVFDEMPHKN 120

Query: 121 TVTWNTMILGYSKTGDVDRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIA 180
           +VTWNTMI GYSKTGDV  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXI 
Sbjct: 121 SVTWNTMISGYSKTGDVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXII 180

Query: 181 DGINPDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFL 240
           +GINPDQMAAGSILNGCAHMGSLG LAGKSVHGFVVKNRWELNLELGTVLV MYAKCG L
Sbjct: 181 NGINPDQMAAGSILNGCAHMGSLGSLAGKSVHGFVVKNRWELNLELGTVLVHMYAKCGLL 240

Query: 241 KYACQIFLLMSEKNVKTWTALICGLAQHGYCKDALVLFETMTHEGVEPNELTFTGVLSAC 300
           KYACQIF LMSE+NV+TWTALICGLA HG CK+AL LFETM HEGVEPNELTFTGVLSAC
Sbjct: 241 KYACQIFHLMSERNVRTWTALICGLAHHGCCKEALALFETMRHEGVEPNELTFTGVLSAC 300

Query: 301 AHAGFVQEGRKYFNMIEEYGLEIRIQHYGCMVDLLGKSGLLEEAYGVIKTMRLEPNVTVW 360
            HAG VQEGRKYFNMIEEYGLEIRIQHYGC VDLLG+SGLLEEAYGVIK+MR EPNV VW
Sbjct: 301 VHAGLVQEGRKYFNMIEEYGLEIRIQHYGCFVDLLGRSGLLEEAYGVIKSMRFEPNVIVW 360

Query: 361 SSLLSACKQQKSFDMAERVIEQILEKIEPKNHGGIYSLISDLYVLEEKWDDAEKIRNLLS 420
           SSLLSACKQ KSFD+AERVIE ILEK EP NHGG+YSL+SDLYVL+EKWDDAE IRNLL+
Sbjct: 361 SSLLSACKQHKSFDLAERVIEHILEKTEPNNHGGVYSLVSDLYVLQEKWDDAENIRNLLN 420

Query: 421 QNVPKVRAYSLIRSGL 437
           Q V KVRAYSLIRSGL
Sbjct: 421 QKVRKVRAYSLIRSGL 433

BLAST of Cla97C01G026040 vs. TrEMBL
Match: tr|A0A0A0KIK9|A0A0A0KIK9_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G538690 PE=4 SV=1)

HSP 1 Score: 676.8 bits (1745), Expect = 3.3e-191
Identity = 381/436 (87.39%), Postives = 400/436 (91.74%), Query Frame = 0

Query: 1   MLVNLSHQLLKPACRRFAFSAKGFNSWALRIRNAPSLHRALAIYSQMHRQSVPHDSFSIL 60
           MLVNL   LL P+CR FAFSAKG NSWALRIRNAPSLH+ALA YSQMHRQSVPHDSFSIL
Sbjct: 4   MLVNL---LLNPSCRHFAFSAKGVNSWALRIRNAPSLHKALAFYSQMHRQSVPHDSFSIL 63

Query: 61  FMLKACAPSNNLSTLHHLHAHITKLGFTTHVFVATSLLYAYVLNSFQLACLVFDEMPHRN 120
           FMLKACA SNNLS LHHLHAHITKLGFTTHVFVATSLL++YVL+SFQLA LVFDEMPH+N
Sbjct: 64  FMLKACASSNNLSILHHLHAHITKLGFTTHVFVATSLLHSYVLHSFQLARLVFDEMPHKN 123

Query: 121 TVTWNTMILGYSKTGDVDRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIA 180
           +VTWNTMI GYSK GDV  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX  
Sbjct: 124 SVTWNTMISGYSKAGDVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 183

Query: 181 DGINPDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFL 240
               PDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFL
Sbjct: 184 XXXXPDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFL 243

Query: 241 KYACQIFLLMSEKNVKTWTALICGLAQHGYCKDALVLFETMTHEGVEPNELTFTGVLSAC 300
           KYACQIF LMSE+NV+TWTALICGLA HG CK+ALVLFETM HEGVEPNE TFTGVLSAC
Sbjct: 244 KYACQIFNLMSERNVRTWTALICGLAHHGCCKEALVLFETMRHEGVEPNEFTFTGVLSAC 303

Query: 301 AHAGFVQEGRKYFNMIEEYGLEIRIQHYGCMVDLLGKSGLLEEAYGVIKTMRLEPNVTVW 360
            HAG VQEGRKYFNMIEE GLEIRIQHYGC VDLLG+SGLLEEAYGVIK+MRLEPNV VW
Sbjct: 304 VHAGLVQEGRKYFNMIEECGLEIRIQHYGCFVDLLGRSGLLEEAYGVIKSMRLEPNVIVW 363

Query: 361 SSLLSACKQQKSFDMAERVIEQILEKIEPKNHGGIYSLISDLYVLEEKWDDAEKIRNLLS 420
           SSLLSACKQ KSFD+AERVIEQILEKIEP NH G+YSL+SDLYVL++KWDDAE IRNLL+
Sbjct: 364 SSLLSACKQHKSFDLAERVIEQILEKIEPDNHAGVYSLVSDLYVLQDKWDDAENIRNLLN 423

Query: 421 QNVPKVRAYSLIRSGL 437
           Q+V K RAYSLIRSGL
Sbjct: 424 QHVRKGRAYSLIRSGL 436

BLAST of Cla97C01G026040 vs. TrEMBL
Match: tr|A0A1U8AEQ8|A0A1U8AEQ8_NELNU (pentatricopeptide repeat-containing protein At5g66520-like OS=Nelumbo nucifera OX=4432 GN=LOC104599778 PE=4 SV=1)

HSP 1 Score: 486.1 bits (1250), Expect = 8.2e-134
Identity = 287/431 (66.59%), Postives = 341/431 (79.12%), Query Frame = 0

Query: 9   LLKPAC-RRFAFSAKGFNSWALRIRNAPSLHRALAIYSQMHRQSVPHDSFSILFMLKACA 68
           L+KPA   +F + +K +NSWA  IRNA S ++AL +Y+QM RQ VP DSF+ILF LK+C 
Sbjct: 5   LVKPAIEHQFRWFSKSYNSWASAIRNAASPYKALHLYTQMQRQGVPFDSFTILFTLKSCT 64

Query: 69  PSNNLSTLHHLHAHITKLGFTTHVFVATSLLYAYVLNSFQLACLVFDEMPHRNTVTWNTM 128
              NL+ + HLHAH+ KLGF +HV+VATSLLYAYV+ +F  A L+FDEMP +NTVTWNTM
Sbjct: 65  HLENLTLVRHLHAHLLKLGFNSHVYVATSLLYAYVIGTFHDARLLFDEMPEKNTVTWNTM 124

Query: 129 ILGYSKT-GDVDRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIADGINPD 188
           I GYSK       XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX        
Sbjct: 125 ITGYSKAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 184

Query: 189 QMAAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFLKYACQI 248
                     CAHMGSLGLL GKS+HGF VKN WELN+ELGTVL+DMYAKCGFLK AC+I
Sbjct: 185 XXXXXXXXXXCAHMGSLGLLVGKSIHGFTVKNGWELNVELGTVLIDMYAKCGFLKNACRI 244

Query: 249 FLLMSEKNVKTWTALICGLAQHGYCKDALVLFETMTHEGVEPNELTFTGVLSACAHAGFV 308
           F+ M EKNV +W+A+ICGLAQHGY ++AL LFE M   G++PNE+TFTG+ SAC  AG V
Sbjct: 245 FVKMPEKNVLSWSAMICGLAQHGYGEEALSLFEEMKAAGIKPNEITFTGIFSACTRAGLV 304

Query: 309 QEGRKYF-NMIEEYGLEIRIQHYGCMVDLLGKSGLLEEAYGVIKTMRLEPNVTVWSSLLS 368
            EG+KYF +MIEEYGLE RIQHYGCMVDLLGK+G LEEAY VIKTMRL+PNV VWSSLL+
Sbjct: 305 DEGKKYFKDMIEEYGLEPRIQHYGCMVDLLGKAGRLEEAYEVIKTMRLQPNVIVWSSLLA 364

Query: 369 ACKQQKSFDMAERVIEQILEKIEPKNHGGIYSLISDLYVLEEKWDDAEKIRNL-LSQNVP 428
           ACK  K F++AE+VIEQ+++ ++P N GG+Y+LISDLYVL +KWDDAE++R L L+QNV 
Sbjct: 365 ACKMHKKFELAEKVIEQVMQVVKPDNDGGVYTLISDLYVLNDKWDDAERVRKLMLNQNVK 424

Query: 429 KVRAYSLIRSG 436
           KVR  S IR+G
Sbjct: 425 KVRGSSFIRNG 435

BLAST of Cla97C01G026040 vs. TrEMBL
Match: tr|V4UCE3|V4UCE3_9ROSI (Uncharacterized protein OS=Citrus clementina OX=85681 GN=CICLE_v10018364mg PE=4 SV=1)

HSP 1 Score: 469.2 bits (1206), Expect = 1.0e-128
Identity = 274/435 (62.99%), Postives = 336/435 (77.24%), Query Frame = 0

Query: 1   MLVNLSHQLLKPACRRFAFSAKGFNSWALRIRNAPSLHRALAIYSQMHRQSVPHDSFSIL 60
           MLV ++HQL      +     K  N+WAL I++A S + A+ ++SQMHRQSVP DSFSIL
Sbjct: 1   MLV-INHQLKPTTKYQNWHFIKHLNTWALAIKDASSSNNAMQLFSQMHRQSVPSDSFSIL 60

Query: 61  FMLKACAPSNNLSTLHHLHAHITKLGFTTHVFVATSLLYAYVLNSFQLACLVFDEMPHRN 120
            +LK+C   NNL+ +HHLH+HI KLGF +HV+VAT          F  A  +FDE+P RN
Sbjct: 61  HILKSCTHFNNLTVIHHLHSHILKLGFISHVYVATXXXXXXXXXXFGFARKLFDELPERN 120

Query: 121 TVTWNTMILGYSKTGDVDRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIA 180
            VTWNT+I GYSK+G+V  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX  I+
Sbjct: 121 AVTWNTLIKGYSKSGNVCEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEMIS 180

Query: 181 D-GINPDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGF 240
           + G++PD+M  G++L+GC+H+GS+GLL GKS HGF+VKN WELN ++ T+LVDMYAKCGF
Sbjct: 181 NVGLSPDRMTIGAVLSGCSHLGSVGLLMGKSAHGFIVKNEWELNEQIATILVDMYAKCGF 240

Query: 241 LKYACQIFLLMSEKNVKTWTALICGLAQHGYCKDALVLFETMTHEGVEPNELTFTGVLSA 300
           LKYA  +F LM E+NV +WTALICG A  GY +DAL LFE M   GV+PNE+TFTGVL+A
Sbjct: 241 LKYALMVFELMEERNVISWTALICGSAHRGYSEDALSLFEMMQATGVKPNEMTFTGVLTA 300

Query: 301 CAHAGFVQEGRKYFNMI-EEYGLEIRIQHYGCMVDLLGKSGLLEEAYGVIKTMRLEPNVT 360
           C H G V EGRKYF MI EEY LE RIQHYGCMVDL GK+G LEEAY VI+TMRLEPNV 
Sbjct: 301 CVHTGLVDEGRKYFKMIDEEYDLEPRIQHYGCMVDLFGKAGFLEEAYEVIRTMRLEPNVI 360

Query: 361 VWSSLLSACKQQKSFDMAERVIEQILEKIEPKNHGGIYSLISDLYVLEEKWDDAEKIRNL 420
           +W S L+ACK+ K FDMAERVI+Q L  ++P+N GG+++LI DLY + EKW+DAE++R L
Sbjct: 361 IWGSFLAACKEHKQFDMAERVIKQALRMVKPENDGGVFTLICDLYTMNEKWEDAERVRKL 420

Query: 421 -LSQNVPKVRAYSLI 433
            L+QNV K R  S+I
Sbjct: 421 MLNQNVRKARGSSVI 434

BLAST of Cla97C01G026040 vs. TrEMBL
Match: tr|A0A2H5P9G2|A0A2H5P9G2_CITUN (Uncharacterized protein OS=Citrus unshiu OX=55188 GN=CUMW_115980 PE=4 SV=1)

HSP 1 Score: 469.2 bits (1206), Expect = 1.0e-128
Identity = 274/435 (62.99%), Postives = 336/435 (77.24%), Query Frame = 0

Query: 1   MLVNLSHQLLKPACRRFAFSAKGFNSWALRIRNAPSLHRALAIYSQMHRQSVPHDSFSIL 60
           MLV ++HQL      +     K  N+WAL I++A S + A+ ++SQMHRQSVP DSFSIL
Sbjct: 1   MLV-INHQLKPTTKYQNWHFIKHLNTWALAIKDASSSNNAMQLFSQMHRQSVPSDSFSIL 60

Query: 61  FMLKACAPSNNLSTLHHLHAHITKLGFTTHVFVATSLLYAYVLNSFQLACLVFDEMPHRN 120
            +LK+C   NNL+ +HHLH+HI KLGF +HV+VAT          F  A  +FDE+P RN
Sbjct: 61  HILKSCTHFNNLTVIHHLHSHILKLGFISHVYVATXXXXXXXXXXFGFARKLFDELPERN 120

Query: 121 TVTWNTMILGYSKTGDVDRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIA 180
            VTWNT+I GYSK+G+V  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX  I+
Sbjct: 121 AVTWNTLIKGYSKSGNVCEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEMIS 180

Query: 181 D-GINPDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGF 240
           + G++PD+M  G++L+GC+H+GS+GLL GKS HGF+VKN WELN ++ T+LVDMYAKCGF
Sbjct: 181 NVGLSPDRMTIGAVLSGCSHLGSVGLLMGKSAHGFIVKNEWELNEQIATILVDMYAKCGF 240

Query: 241 LKYACQIFLLMSEKNVKTWTALICGLAQHGYCKDALVLFETMTHEGVEPNELTFTGVLSA 300
           LKYA  +F LM E+NV +WTALICG A  GY +DAL LFE M   GV+PNE+TFTGVL+A
Sbjct: 241 LKYALMVFELMEERNVISWTALICGSAHRGYSEDALSLFEMMQATGVKPNEMTFTGVLTA 300

Query: 301 CAHAGFVQEGRKYFNMI-EEYGLEIRIQHYGCMVDLLGKSGLLEEAYGVIKTMRLEPNVT 360
           C H G V EGRKYF MI EEY LE RIQHYGCMVDL GK+G LEEAY VI+TMRLEPNV 
Sbjct: 301 CVHTGLVDEGRKYFKMIDEEYDLEPRIQHYGCMVDLFGKAGFLEEAYEVIRTMRLEPNVI 360

Query: 361 VWSSLLSACKQQKSFDMAERVIEQILEKIEPKNHGGIYSLISDLYVLEEKWDDAEKIRNL 420
           +W S L+ACK+ K FDMAERVI+Q L  ++P+N GG+++LI DLY + EKW+DAE++R L
Sbjct: 361 IWGSFLAACKEHKQFDMAERVIKQALRMVKPENDGGVFTLICDLYTMNEKWEDAERVRKL 420

Query: 421 -LSQNVPKVRAYSLI 433
            L+QNV K R  S+I
Sbjct: 421 MLNQNVRKARGSSVI 434

BLAST of Cla97C01G026040 vs. Swiss-Prot
Match: sp|Q9FJY7|PP449_ARATH (Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H61 PE=2 SV=1)

HSP 1 Score: 236.1 bits (601), Expect = 7.3e-61
Identity = 177/412 (42.96%), Postives = 259/412 (62.86%), Query Frame = 0

Query: 27  WALRIRN---APSLHRALAIYSQMHRQSVPHDSFSILFMLKACAPSNNLSTLHHLHAHIT 86
           W L IR    +    R+L +Y +M   S PH++++   +LKAC+  +       +HA IT
Sbjct: 83  WNLMIRGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEETTQIHAQIT 142

Query: 87  KLGFTTHVFVATSLLYAY-VLNSFQLACLVFDEMPHRNTVTWNTMILGYSKTGDVDRXXX 146
           KLG+   V+   SL+ +Y V  +F+LA L+FD +P                      XXX
Sbjct: 143 KLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPXXXXXXXXXXXXXXXXXXXXXXX 202

Query: 147 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIADGINPDQMAAGSILNGCAHMGS 206
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX    + PD ++  + L+ CA +G+
Sbjct: 203 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVEPDNVSLANALSACAQLGA 262

Query: 207 LGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFLKYACQIFLLMSEKNVKTWTALI 266
           L    GK +H ++ K R  ++  LG VL+DMYAKCG ++ A ++F  + +K+V+ WTALI
Sbjct: 263 LE--QGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKKKSVQAWTALI 322

Query: 267 CGLAQHGYCKDALVLFETMTHEGVEPNELTFTGVLSACAHAGFVQEGRK-YFNMIEEYGL 326
            G A HG+ ++A+  F  M   G++PN +TFT VL+AC++ G V+EG+  +++M  +Y L
Sbjct: 323 SGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFYSMERDYNL 382

Query: 327 EIRIQHYGCMVDLLGKSGLLEEAYGVIKTMRLEPNVTVWSSLLSACKQQKSFDMAERVIE 386
           +  I+HYGC+VDLLG++GLL+EA   I+ M L+PN  +W +LL AC+  K+ ++ E  I 
Sbjct: 383 KPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNIELGEE-IG 442

Query: 387 QILEKIEPKNHGGIYSLISDLYVLEEKWDDAEKIRNLL-SQNVPKVRAYSLI 433
           +IL  I+P  HGG Y   ++++ +++KWD A + R L+  Q V KV   S I
Sbjct: 443 EILIAIDP-YHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTI 490

BLAST of Cla97C01G026040 vs. Swiss-Prot
Match: sp|Q9FMA1|PP433_ARATH (Pentatricopeptide repeat-containing protein At5g56310 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E13 PE=2 SV=1)

HSP 1 Score: 224.2 bits (570), Expect = 2.9e-57
Identity = 147/397 (37.03%), Postives = 228/397 (57.43%), Query Frame = 0

Query: 28  ALRIRNAPSLHR-ALAIYSQMHRQSVPHDSFSILFMLKACAPSNNLSTLHHLHAHITKLG 87
           AL + + P+ H  A+ +Y ++       D+F+  F+LK     +++     +H  +   G
Sbjct: 87  ALSLLDEPNAHSIAITVYRKLWALCAKPDTFTFPFVLKIAVRVSDVWFGRQIHGQVVVFG 146

Query: 88  FTTHVFVATSLLYAYV-LNSFQLACLVFDEMPHRNTVTWNTMILGYSKTGDVD--RXXXX 147
           F + V V T L+  Y        A  +FDEM  ++   WN ++ GY K G++D  R    
Sbjct: 147 FDSSVHVVTGLIQMYFSCGGLGDARKMFDEMLVKDVNVWNALLAGYGKVGEMDEARSLLE 206

Query: 148 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIADGINPDQMAAGSILNGCAHMGSL 207
                     XXXXXXXXXXXXXXXXXXXXX    + + + PD++   ++L+ CA +GSL
Sbjct: 207 MMPCWVRNEVXXXXXXXXXXXXXXXXXXXXXFQRMLMENVEPDEVTLLAVLSACADLGSL 266

Query: 208 GLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFLKYACQIFLLMSEKNVKTWTALIC 267
            L  G+ +  +V        + L   ++DMYAK G +  A  +F  ++E+NV TWT +I 
Sbjct: 267 EL--GERICSYVDHRGMNRAVSLNNAVIDMYAKSGNITKALDVFECVNERNVVTWTTIIA 326

Query: 268 GLAQHGYCKDALVLFETMTHEGVEPNELTFTGVLSACAHAGFVQEGRKYFN-MIEEYGLE 327
           GLA HG+  +AL +F  M   GV PN++TF  +LSAC+H G+V  G++ FN M  +YG+ 
Sbjct: 327 GLATHGHGAEALAMFNRMVKAGVRPNDVTFIAILSACSHVGWVDLGKRLFNSMRSKYGIH 386

Query: 328 IRIQHYGCMVDLLGKSGLLEEAYGVIKTMRLEPNVTVWSSLLSACKQQKSFDMAERVIEQ 387
             I+HYGCM+DLLG++G L EA  VIK+M  + N  +W SLL+A       ++ ER + +
Sbjct: 387 PNIEHYGCMIDLLGRAGKLREADEVIKSMPFKANAAIWGSLLAASNVHHDLELGERALSE 446

Query: 388 ILEKIEPKNHGGIYSLISDLYVLEEKWDDAEKIRNLL 420
           ++ K+EP N G  Y L+++LY    +WD++  +RN++
Sbjct: 447 LI-KLEPNNSGN-YMLLANLYSNLGRWDESRMMRNMM 479

BLAST of Cla97C01G026040 vs. Swiss-Prot
Match: sp|Q9STF3|PP265_ARATH (Pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CRR2 PE=2 SV=1)

HSP 1 Score: 223.0 bits (567), Expect = 6.4e-57
Identity = 125/385 (32.47%), Postives = 204/385 (52.99%), Query Frame = 0

Query: 41  LAIYSQMHRQSVPHDSFSILFMLKACAPS----NNLSTLHHLHAHITKLGFTTHVFVATS 100
           L +Y +M+R  V  D F+  ++LKAC  S    N+L     +HAH+T+ G+++HV++ T+
Sbjct: 163 LGLYWKMNRIGVESDRFTYTYVLKACVASECTVNHLMKGKEIHAHLTRRGYSSHVYIMTT 222

Query: 101 LLYAYV-LNSFQLACLVFDEMPHRNTVTWNTMILGYSKTGDVDRXXXXXXXXXXXXXXXX 160
           L+  Y        A  VF  MP RN V+W+ MI  Y+K G                    
Sbjct: 223 LVDMYARFGCVDYASYVFGGMPVRNVVSWSAMIACYAKNGKAFEALRTFREMMRE----- 282

Query: 161 XXXXXXXXXXXXXXXXXXXXXXXIADGINPDQMAAGSILNGCAHMGSLGLLAGKSVHGFV 220
                                       +P+ +   S+L  CA + +L    GK +HG++
Sbjct: 283 ------------------------TKDSSPNSVTMVSVLQACASLAALE--QGKLIHGYI 342

Query: 221 VKNRWELNLELGTVLVDMYAKCGFLKYACQIFLLMSEKNVKTWTALICGLAQHGYCKDAL 280
           ++   +  L + + LV MY +CG L+   ++F  M +++V +W +LI     HGY K A+
Sbjct: 343 LRRGLDSILPVISALVTMYGRCGKLEVGQRVFDRMHDRDVVSWNSLISSYGVHGYGKKAI 402

Query: 281 VLFETMTHEGVEPNELTFTGVLSACAHAGFVQEGRKYF-NMIEEYGLEIRIQHYGCMVDL 340
            +FE M   G  P  +TF  VL AC+H G V+EG++ F  M  ++G++ +I+HY CMVDL
Sbjct: 403 QIFEEMLANGASPTPVTFVSVLGACSHEGLVEEGKRLFETMWRDHGIKPQIEHYACMVDL 462

Query: 341 LGKSGLLEEAYGVIKTMRLEPNVTVWSSLLSACKQQKSFDMAERVIEQILEKIEPKNHGG 400
           LG++  L+EA  +++ MR EP   VW SLL +C+   + ++AER   ++   +EPKN G 
Sbjct: 463 LGRANRLDEAAKMVQDMRTEPGPKVWGSLLGSCRIHGNVELAERASRRLF-ALEPKNAGN 514

Query: 401 IYSLISDLYVLEEKWDDAEKIRNLL 420
            Y L++D+Y   + WD+ ++++ LL
Sbjct: 523 -YVLLADIYAEAQMWDEVKRVKKLL 514

BLAST of Cla97C01G026040 vs. Swiss-Prot
Match: sp|Q9SX45|PPR75_ARATH (Pentatricopeptide repeat-containing protein At1g50270 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E42 PE=2 SV=1)

HSP 1 Score: 221.5 bits (563), Expect = 1.9e-56
Identity = 156/406 (38.42%), Postives = 227/406 (55.91%), Query Frame = 0

Query: 31  IRNAPSLHRALAIYSQMHRQSVPHDSFSILFMLKACAPSNNLSTLHHLHAHITKLG-FTT 90
           +RN  S   A+  + +M +  V  +  +++ +LKA     ++     +H    + G    
Sbjct: 180 VRNG-SASEAMVYFVEMKKTGVAANEMTVVSVLKAAGKVEDVRFGRSVHGLYLETGRVKC 239

Query: 91  HVFVATSLLYAY-VLNSFQLACLVFDEMPHRNTVTWNTMILGYSKTGDVDRXXXXXXXXX 150
            VF+ +SL+  Y   + +  A  VFDEMP RN VTW                        
Sbjct: 240 DVFIGSSLVDMYGKCSCYDDAQKVFDEMPSRNVVTWT----------------------- 299

Query: 151 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIADGINPDQMAAGSILNGCAHMGSLGLLAG 210
                   XXXXXXXXXXXXXXXXXXXXXX    + P++    S+L+ CAH+G+L    G
Sbjct: 300 --------XXXXXXXXXXXXXXXXXXXXXXXXXXVAPNEKTLSSVLSACAHVGALH--RG 359

Query: 211 KSVHGFVVKNRWELNLELGTVLVDMYAKCGFLKYACQIFLLMSEKNVKTWTALICGLAQH 270
           + VH +++KN  E+N   GT L+D+Y KCG L+ A  +F  + EKNV TWTA+I G A H
Sbjct: 360 RRVHCYMIKNSIEINTTAGTTLIDLYVKCGCLEEAILVFERLHEKNVYTWTAMINGFAAH 419

Query: 271 GYCKDALVLFETMTHEGVEPNELTFTGVLSACAHAGFVQEGRKYF-NMIEEYGLEIRIQH 330
           GY +DA  LF TM    V PNE+TF  VLSACAH G V+EGR+ F +M   + +E +  H
Sbjct: 420 GYARDAFDLFYTMLSSHVSPNEVTFMAVLSACAHGGLVEEGRRLFLSMKGRFNMEPKADH 479

Query: 331 YGCMVDLLGKSGLLEEAYGVIKTMRLEPNVTVWSSLLSACKQQKSFDMAERVIEQILEKI 390
           Y CMVDL G+ GLLEEA  +I+ M +EP   VW +L  +C   K +++ +    +++ K+
Sbjct: 480 YACMVDLFGRKGLLEEAKALIERMPMEPTNVVWGALFGSCLLHKDYELGKYAASRVI-KL 539

Query: 391 EPKNHGGIYSLISDLYVLEEKWDDAEKIR-NLLSQNVPKVRAYSLI 433
           +P +H G Y+L+++LY   + WD+  ++R  +  Q V K   +S I
Sbjct: 540 QP-SHSGRYTLLANLYSESQNWDEVARVRKQMKDQQVVKSPGFSWI 549

BLAST of Cla97C01G026040 vs. Swiss-Prot
Match: sp|Q9ZVF4|PP140_ARATH (Pentatricopeptide repeat-containing protein At2g01510, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H37 PE=3 SV=1)

HSP 1 Score: 214.9 bits (546), Expect = 1.7e-54
Identity = 129/398 (32.41%), Postives = 204/398 (51.26%), Query Frame = 0

Query: 40  ALAIYSQMHRQSVPHDSFSILFMLKACAPSNNLSTLHHLHAHITKLGFTTHVFVATSLLY 99
           +L +Y +M    V  D F+  F++KA +   + S    LHAH+ K GF     VAT L+ 
Sbjct: 93  SLLLYKKMRDLGVRPDEFTYPFVVKAISQLGDFSCGFALHAHVVKYGFGCLGIVATELVM 152

Query: 100 AYV-LNSFQLACLVFDEMPHRNTVTWNTMILGYSKTGDVDRXXXXXXXXXXXXXXXXXXX 159
            Y+       A  +F+ M  ++ V WN  +    +TG+                      
Sbjct: 153 MYMKFGELSSAEFLFESMQVKDLVAWNAFLAVCVQTGN---------------------- 212

Query: 160 XXXXXXXXXXXXXXXXXXXXIADGINPDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVKN 219
                                AD +  D     S+L+ C  +GSL +  G+ ++    K 
Sbjct: 213 ---------SAIALEYFNKMCADAVQFDSFTVVSMLSACGQLGSLEI--GEEIYDRARKE 272

Query: 220 RWELNLELGTVLVDMYAKCGFLKYACQIFLLMSEKNVKTWTALICGLAQHGYCKDALVLF 279
             + N+ +    +DM+ KCG  + A  +F  M ++NV +W+ +I G A +G  ++AL LF
Sbjct: 273 EIDCNIIVENARLDMHLKCGNTEAARVLFEEMKQRNVVSWSTMIVGYAMNGDSREALTLF 332

Query: 280 ETMTHEGVEPNELTFTGVLSACAHAGFVQEGRKYFNMI---EEYGLEIRIQHYGCMVDLL 339
            TM +EG+ PN +TF GVLSAC+HAG V EG++YF+++    +  LE R +HY CMVDLL
Sbjct: 333 TTMQNEGLRPNYVTFLGVLSACSHAGLVNEGKRYFSLMVQSNDKNLEPRKEHYACMVDLL 392

Query: 340 GKSGLLEEAYGVIKTMRLEPNVTVWSSLLSACKQQKSFDMAERVIEQILEKIEPKNHGGI 399
           G+SGLLEEAY  IK M +EP+  +W +LL AC   +   + ++V + ++E     + G  
Sbjct: 393 GRSGLLEEAYEFIKKMPVEPDTGIWGALLGACAVHRDMILGQKVADVLVE--TAPDIGSY 452

Query: 400 YSLISDLYVLEEKWDDAEKIRNLLSQ-NVPKVRAYSLI 433
           + L+S++Y    KWD  +K+R+ + +    KV AYS +
Sbjct: 453 HVLLSNIYAAAGKWDCVDKVRSKMRKLGTKKVAAYSSV 455

BLAST of Cla97C01G026040 vs. TAIR10
Match: AT5G66520.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 236.1 bits (601), Expect = 4.1e-62
Identity = 177/412 (42.96%), Postives = 259/412 (62.86%), Query Frame = 0

Query: 27  WALRIRN---APSLHRALAIYSQMHRQSVPHDSFSILFMLKACAPSNNLSTLHHLHAHIT 86
           W L IR    +    R+L +Y +M   S PH++++   +LKAC+  +       +HA IT
Sbjct: 83  WNLMIRGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEETTQIHAQIT 142

Query: 87  KLGFTTHVFVATSLLYAY-VLNSFQLACLVFDEMPHRNTVTWNTMILGYSKTGDVDRXXX 146
           KLG+   V+   SL+ +Y V  +F+LA L+FD +P                      XXX
Sbjct: 143 KLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPXXXXXXXXXXXXXXXXXXXXXXX 202

Query: 147 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIADGINPDQMAAGSILNGCAHMGS 206
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX    + PD ++  + L+ CA +G+
Sbjct: 203 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVEPDNVSLANALSACAQLGA 262

Query: 207 LGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFLKYACQIFLLMSEKNVKTWTALI 266
           L    GK +H ++ K R  ++  LG VL+DMYAKCG ++ A ++F  + +K+V+ WTALI
Sbjct: 263 LE--QGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKKKSVQAWTALI 322

Query: 267 CGLAQHGYCKDALVLFETMTHEGVEPNELTFTGVLSACAHAGFVQEGRK-YFNMIEEYGL 326
            G A HG+ ++A+  F  M   G++PN +TFT VL+AC++ G V+EG+  +++M  +Y L
Sbjct: 323 SGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFYSMERDYNL 382

Query: 327 EIRIQHYGCMVDLLGKSGLLEEAYGVIKTMRLEPNVTVWSSLLSACKQQKSFDMAERVIE 386
           +  I+HYGC+VDLLG++GLL+EA   I+ M L+PN  +W +LL AC+  K+ ++ E  I 
Sbjct: 383 KPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNIELGEE-IG 442

Query: 387 QILEKIEPKNHGGIYSLISDLYVLEEKWDDAEKIRNLL-SQNVPKVRAYSLI 433
           +IL  I+P  HGG Y   ++++ +++KWD A + R L+  Q V KV   S I
Sbjct: 443 EILIAIDP-YHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTI 490

BLAST of Cla97C01G026040 vs. TAIR10
Match: AT5G56310.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 224.2 bits (570), Expect = 1.6e-58
Identity = 147/397 (37.03%), Postives = 228/397 (57.43%), Query Frame = 0

Query: 28  ALRIRNAPSLHR-ALAIYSQMHRQSVPHDSFSILFMLKACAPSNNLSTLHHLHAHITKLG 87
           AL + + P+ H  A+ +Y ++       D+F+  F+LK     +++     +H  +   G
Sbjct: 87  ALSLLDEPNAHSIAITVYRKLWALCAKPDTFTFPFVLKIAVRVSDVWFGRQIHGQVVVFG 146

Query: 88  FTTHVFVATSLLYAYV-LNSFQLACLVFDEMPHRNTVTWNTMILGYSKTGDVD--RXXXX 147
           F + V V T L+  Y        A  +FDEM  ++   WN ++ GY K G++D  R    
Sbjct: 147 FDSSVHVVTGLIQMYFSCGGLGDARKMFDEMLVKDVNVWNALLAGYGKVGEMDEARSLLE 206

Query: 148 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIADGINPDQMAAGSILNGCAHMGSL 207
                     XXXXXXXXXXXXXXXXXXXXX    + + + PD++   ++L+ CA +GSL
Sbjct: 207 MMPCWVRNEVXXXXXXXXXXXXXXXXXXXXXFQRMLMENVEPDEVTLLAVLSACADLGSL 266

Query: 208 GLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFLKYACQIFLLMSEKNVKTWTALIC 267
            L  G+ +  +V        + L   ++DMYAK G +  A  +F  ++E+NV TWT +I 
Sbjct: 267 EL--GERICSYVDHRGMNRAVSLNNAVIDMYAKSGNITKALDVFECVNERNVVTWTTIIA 326

Query: 268 GLAQHGYCKDALVLFETMTHEGVEPNELTFTGVLSACAHAGFVQEGRKYFN-MIEEYGLE 327
           GLA HG+  +AL +F  M   GV PN++TF  +LSAC+H G+V  G++ FN M  +YG+ 
Sbjct: 327 GLATHGHGAEALAMFNRMVKAGVRPNDVTFIAILSACSHVGWVDLGKRLFNSMRSKYGIH 386

Query: 328 IRIQHYGCMVDLLGKSGLLEEAYGVIKTMRLEPNVTVWSSLLSACKQQKSFDMAERVIEQ 387
             I+HYGCM+DLLG++G L EA  VIK+M  + N  +W SLL+A       ++ ER + +
Sbjct: 387 PNIEHYGCMIDLLGRAGKLREADEVIKSMPFKANAAIWGSLLAASNVHHDLELGERALSE 446

Query: 388 ILEKIEPKNHGGIYSLISDLYVLEEKWDDAEKIRNLL 420
           ++ K+EP N G  Y L+++LY    +WD++  +RN++
Sbjct: 447 LI-KLEPNNSGN-YMLLANLYSNLGRWDESRMMRNMM 479

BLAST of Cla97C01G026040 vs. TAIR10
Match: AT3G46790.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 223.0 bits (567), Expect = 3.6e-58
Identity = 125/385 (32.47%), Postives = 204/385 (52.99%), Query Frame = 0

Query: 41  LAIYSQMHRQSVPHDSFSILFMLKACAPS----NNLSTLHHLHAHITKLGFTTHVFVATS 100
           L +Y +M+R  V  D F+  ++LKAC  S    N+L     +HAH+T+ G+++HV++ T+
Sbjct: 163 LGLYWKMNRIGVESDRFTYTYVLKACVASECTVNHLMKGKEIHAHLTRRGYSSHVYIMTT 222

Query: 101 LLYAYV-LNSFQLACLVFDEMPHRNTVTWNTMILGYSKTGDVDRXXXXXXXXXXXXXXXX 160
           L+  Y        A  VF  MP RN V+W+ MI  Y+K G                    
Sbjct: 223 LVDMYARFGCVDYASYVFGGMPVRNVVSWSAMIACYAKNGKAFEALRTFREMMRE----- 282

Query: 161 XXXXXXXXXXXXXXXXXXXXXXXIADGINPDQMAAGSILNGCAHMGSLGLLAGKSVHGFV 220
                                       +P+ +   S+L  CA + +L    GK +HG++
Sbjct: 283 ------------------------TKDSSPNSVTMVSVLQACASLAALE--QGKLIHGYI 342

Query: 221 VKNRWELNLELGTVLVDMYAKCGFLKYACQIFLLMSEKNVKTWTALICGLAQHGYCKDAL 280
           ++   +  L + + LV MY +CG L+   ++F  M +++V +W +LI     HGY K A+
Sbjct: 343 LRRGLDSILPVISALVTMYGRCGKLEVGQRVFDRMHDRDVVSWNSLISSYGVHGYGKKAI 402

Query: 281 VLFETMTHEGVEPNELTFTGVLSACAHAGFVQEGRKYF-NMIEEYGLEIRIQHYGCMVDL 340
            +FE M   G  P  +TF  VL AC+H G V+EG++ F  M  ++G++ +I+HY CMVDL
Sbjct: 403 QIFEEMLANGASPTPVTFVSVLGACSHEGLVEEGKRLFETMWRDHGIKPQIEHYACMVDL 462

Query: 341 LGKSGLLEEAYGVIKTMRLEPNVTVWSSLLSACKQQKSFDMAERVIEQILEKIEPKNHGG 400
           LG++  L+EA  +++ MR EP   VW SLL +C+   + ++AER   ++   +EPKN G 
Sbjct: 463 LGRANRLDEAAKMVQDMRTEPGPKVWGSLLGSCRIHGNVELAERASRRLF-ALEPKNAGN 514

Query: 401 IYSLISDLYVLEEKWDDAEKIRNLL 420
            Y L++D+Y   + WD+ ++++ LL
Sbjct: 523 -YVLLADIYAEAQMWDEVKRVKKLL 514

BLAST of Cla97C01G026040 vs. TAIR10
Match: AT1G50270.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 221.5 bits (563), Expect = 1.0e-57
Identity = 156/406 (38.42%), Postives = 227/406 (55.91%), Query Frame = 0

Query: 31  IRNAPSLHRALAIYSQMHRQSVPHDSFSILFMLKACAPSNNLSTLHHLHAHITKLG-FTT 90
           +RN  S   A+  + +M +  V  +  +++ +LKA     ++     +H    + G    
Sbjct: 180 VRNG-SASEAMVYFVEMKKTGVAANEMTVVSVLKAAGKVEDVRFGRSVHGLYLETGRVKC 239

Query: 91  HVFVATSLLYAY-VLNSFQLACLVFDEMPHRNTVTWNTMILGYSKTGDVDRXXXXXXXXX 150
            VF+ +SL+  Y   + +  A  VFDEMP RN VTW                        
Sbjct: 240 DVFIGSSLVDMYGKCSCYDDAQKVFDEMPSRNVVTWT----------------------- 299

Query: 151 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIADGINPDQMAAGSILNGCAHMGSLGLLAG 210
                   XXXXXXXXXXXXXXXXXXXXXX    + P++    S+L+ CAH+G+L    G
Sbjct: 300 --------XXXXXXXXXXXXXXXXXXXXXXXXXXVAPNEKTLSSVLSACAHVGALH--RG 359

Query: 211 KSVHGFVVKNRWELNLELGTVLVDMYAKCGFLKYACQIFLLMSEKNVKTWTALICGLAQH 270
           + VH +++KN  E+N   GT L+D+Y KCG L+ A  +F  + EKNV TWTA+I G A H
Sbjct: 360 RRVHCYMIKNSIEINTTAGTTLIDLYVKCGCLEEAILVFERLHEKNVYTWTAMINGFAAH 419

Query: 271 GYCKDALVLFETMTHEGVEPNELTFTGVLSACAHAGFVQEGRKYF-NMIEEYGLEIRIQH 330
           GY +DA  LF TM    V PNE+TF  VLSACAH G V+EGR+ F +M   + +E +  H
Sbjct: 420 GYARDAFDLFYTMLSSHVSPNEVTFMAVLSACAHGGLVEEGRRLFLSMKGRFNMEPKADH 479

Query: 331 YGCMVDLLGKSGLLEEAYGVIKTMRLEPNVTVWSSLLSACKQQKSFDMAERVIEQILEKI 390
           Y CMVDL G+ GLLEEA  +I+ M +EP   VW +L  +C   K +++ +    +++ K+
Sbjct: 480 YACMVDLFGRKGLLEEAKALIERMPMEPTNVVWGALFGSCLLHKDYELGKYAASRVI-KL 539

Query: 391 EPKNHGGIYSLISDLYVLEEKWDDAEKIR-NLLSQNVPKVRAYSLI 433
           +P +H G Y+L+++LY   + WD+  ++R  +  Q V K   +S I
Sbjct: 540 QP-SHSGRYTLLANLYSESQNWDEVARVRKQMKDQQVVKSPGFSWI 549

BLAST of Cla97C01G026040 vs. TAIR10
Match: AT2G01510.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 214.9 bits (546), Expect = 9.7e-56
Identity = 129/398 (32.41%), Postives = 204/398 (51.26%), Query Frame = 0

Query: 40  ALAIYSQMHRQSVPHDSFSILFMLKACAPSNNLSTLHHLHAHITKLGFTTHVFVATSLLY 99
           +L +Y +M    V  D F+  F++KA +   + S    LHAH+ K GF     VAT L+ 
Sbjct: 93  SLLLYKKMRDLGVRPDEFTYPFVVKAISQLGDFSCGFALHAHVVKYGFGCLGIVATELVM 152

Query: 100 AYV-LNSFQLACLVFDEMPHRNTVTWNTMILGYSKTGDVDRXXXXXXXXXXXXXXXXXXX 159
            Y+       A  +F+ M  ++ V WN  +    +TG+                      
Sbjct: 153 MYMKFGELSSAEFLFESMQVKDLVAWNAFLAVCVQTGN---------------------- 212

Query: 160 XXXXXXXXXXXXXXXXXXXXIADGINPDQMAAGSILNGCAHMGSLGLLAGKSVHGFVVKN 219
                                AD +  D     S+L+ C  +GSL +  G+ ++    K 
Sbjct: 213 ---------SAIALEYFNKMCADAVQFDSFTVVSMLSACGQLGSLEI--GEEIYDRARKE 272

Query: 220 RWELNLELGTVLVDMYAKCGFLKYACQIFLLMSEKNVKTWTALICGLAQHGYCKDALVLF 279
             + N+ +    +DM+ KCG  + A  +F  M ++NV +W+ +I G A +G  ++AL LF
Sbjct: 273 EIDCNIIVENARLDMHLKCGNTEAARVLFEEMKQRNVVSWSTMIVGYAMNGDSREALTLF 332

Query: 280 ETMTHEGVEPNELTFTGVLSACAHAGFVQEGRKYFNMI---EEYGLEIRIQHYGCMVDLL 339
            TM +EG+ PN +TF GVLSAC+HAG V EG++YF+++    +  LE R +HY CMVDLL
Sbjct: 333 TTMQNEGLRPNYVTFLGVLSACSHAGLVNEGKRYFSLMVQSNDKNLEPRKEHYACMVDLL 392

Query: 340 GKSGLLEEAYGVIKTMRLEPNVTVWSSLLSACKQQKSFDMAERVIEQILEKIEPKNHGGI 399
           G+SGLLEEAY  IK M +EP+  +W +LL AC   +   + ++V + ++E     + G  
Sbjct: 393 GRSGLLEEAYEFIKKMPVEPDTGIWGALLGACAVHRDMILGQKVADVLVE--TAPDIGSY 452

Query: 400 YSLISDLYVLEEKWDDAEKIRNLLSQ-NVPKVRAYSLI 433
           + L+S++Y    KWD  +K+R+ + +    KV AYS +
Sbjct: 453 HVLLSNIYAAAGKWDCVDKVRSKMRKLGTKKVAAYSSV 455

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_023544798.11.9e-19886.70pentatricopeptide repeat-containing protein At5g66520-like [Cucurbita pepo subsp... [more]
XP_022925591.14.6e-19786.24pentatricopeptide repeat-containing protein At5g66520-like [Cucurbita moschata][more]
XP_008439426.11.3e-19488.30PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like [Cucumis m... [more]
XP_011658370.15.0e-19187.39PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like [Cucumis s... [more]
XP_022146506.12.6e-18483.49pentatricopeptide repeat-containing protein At5g66520-like [Momordica charantia][more]
Match NameE-valueIdentityDescription
tr|A0A1S3AYC4|A0A1S3AYC4_CUCME8.3e-19588.30pentatricopeptide repeat-containing protein At5g66520-like OS=Cucumis melo OX=36... [more]
tr|A0A0A0KIK9|A0A0A0KIK9_CUCSA3.3e-19187.39Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G538690 PE=4 SV=1[more]
tr|A0A1U8AEQ8|A0A1U8AEQ8_NELNU8.2e-13466.59pentatricopeptide repeat-containing protein At5g66520-like OS=Nelumbo nucifera O... [more]
tr|V4UCE3|V4UCE3_9ROSI1.0e-12862.99Uncharacterized protein OS=Citrus clementina OX=85681 GN=CICLE_v10018364mg PE=4 ... [more]
tr|A0A2H5P9G2|A0A2H5P9G2_CITUN1.0e-12862.99Uncharacterized protein OS=Citrus unshiu OX=55188 GN=CUMW_115980 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
sp|Q9FJY7|PP449_ARATH7.3e-6142.96Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX... [more]
sp|Q9FMA1|PP433_ARATH2.9e-5737.03Pentatricopeptide repeat-containing protein At5g56310 OS=Arabidopsis thaliana OX... [more]
sp|Q9STF3|PP265_ARATH6.4e-5732.47Pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Arabidop... [more]
sp|Q9SX45|PPR75_ARATH1.9e-5638.42Pentatricopeptide repeat-containing protein At1g50270 OS=Arabidopsis thaliana OX... [more]
sp|Q9ZVF4|PP140_ARATH1.7e-5432.41Pentatricopeptide repeat-containing protein At2g01510, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
AT5G66520.14.1e-6242.96Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G56310.11.6e-5837.03Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G46790.13.6e-5832.47Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G50270.11.0e-5738.42Pentatricopeptide repeat (PPR) superfamily protein[more]
AT2G01510.19.7e-5632.41Tetratricopeptide repeat (TPR)-like superfamily protein[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0009451 RNA modification
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C01G026040.1Cla97C01G026040.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 91..210
e-value: 6.5E-27
score: 96.7
coord: 228..433
e-value: 1.4E-41
score: 144.9
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 153..186
e-value: 7.8E-7
score: 26.9
coord: 292..322
e-value: 0.0028
score: 15.7
coord: 122..153
e-value: 2.1E-8
score: 31.8
coord: 257..290
e-value: 2.0E-6
score: 25.6
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 253..301
e-value: 5.2E-12
score: 45.6
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 328..352
e-value: 0.01
score: 15.9
coord: 359..385
e-value: 0.2
score: 11.9
coord: 153..183
e-value: 9.2E-5
score: 22.3
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 118..147
e-value: 7.7E-9
score: 35.0
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 254..288
score: 11.345
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 324..354
score: 6.851
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 120..154
score: 12.584
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 223..253
score: 6.445
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 289..323
score: 8.67
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 155..185
score: 7.267
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 356..386
score: 8.133
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 55..89
score: 5.272
NoneNo IPR availablePANTHERPTHR24015:SF735SUBFAMILY NOT NAMEDcoord: 28..431
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 28..431

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla97C01G026040Silver-seed gourdcarwmbB0764
Cla97C01G026040Cucumber (Gy14) v2cgybwmbB394
Cla97C01G026040Cucurbita maxima (Rimu)cmawmbB875
Cla97C01G026040Cucurbita moschata (Rifu)cmowmbB849
Cla97C01G026040Melon (DHL92) v3.5.1mewmbB515
Cla97C01G026040Watermelon (Charleston Gray)wcgwmbB089