Cla97C09G171075 (gene) Watermelon (97103) v2.5

Overview
NameCla97C09G171075
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionPentatricopeptide repeat-containing protein
LocationCla97Chr09: 7427059 .. 7428317 (+)
RNA-Seq ExpressionCla97C09G171075
SyntenyCla97C09G171075
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGCCTAGCTAAACTTTTGCCTGATTACTTCATGCTCAATGTGAATCAAGTCAGCGAAGGTCTTGCGACCATGGCAGCAATTTTAAGGAGAGGTTATATTCCTAATATAGTGACATATACGACCTTGATTAAGGGCTTGTGTATGGAACATAGGTAGTGAAGCCACAAGGTTGTTTATGAGAATGCAAAAGTTAGGTTGTATGCCTAATGTGGTTAATTATACCTCTTTGATTCATGGATTTTCCTAGGGTGGGTGGAAACTGGAAGGAGGCTAAACATTCGTTAAATGAGATGGTGGATCAAGGTGTTCGACCAGATATGGTTACATTTAGTTTGTTAATTGATATCCTTTGCTAGGAAGGAAAGGTTATCAAGGCTCAGGAGTTCCTAAAGGCGGTGATTCAGACAGGCAATGTTCTAGATTCATTTACTTATACTTCAGAGGGGTTTTGTTTGGTTGGTGACTTGAATAGTGCGAGGGAACTCTTTGTTAGTATGCTAAGTAAGGGATGTGAACCTGATGTGATTAGTTACAATGTGCTAATCAATGGGTATTGTAAAACTAGGAAGGTGGAAGAAGCAATGAAGATTTACAATGAAATGCTTCTAGAGGGAGAGAGGCCAGGTGTGAAAACATATGGTGCCTTGTTATCGGGGCTTTTTCAGGCAGGCAAGGTTGGTGATGCAAAGAAGCTACTTGGTGTTATGGAAGCTTATGGTATTCCCGCAGATTCATGTATATATCTTATTTCCTTAGATGGGTTGTGTAAGAATGGTTGTTTATTTGAAGCGATGGAACTTTTTAAAGAGCTAAGATCATGCAACTTGAAATTGGATATTAAAATCTTTAATGGTCTAATTGATGGCCCGTGTAAACCAGGAAAACTTGATACTGCCTGGGAGCTTTTCGGAAAACTGTCCCAAGAAGGGCATCAACCAAATGTTGTGACTTATACCATTATGATCCATGGGTTTTGTAAAAAGGGACAAGTAGATAAGGCAAATATTTTGTTTTAAAAGATGGAAGAAAATGTCTGTACTCCCGACATAATTACATATAATACCCTTATGCGTTGTTTTTGCGAGAGTAATAAATCAGAGGAGGTGGTTCAACTTCTTCATAAGATGGTTCAGAATGATGTGTCGTCAAATGACAACACTTGCGCCATAGTCGTAGACATGCTTTGCAAAGATGAAAAATATCAAGAATGCCTAGACTTGATTCCAAGCTTTCCTGTCCAAGAGCGTCAACATTGA

mRNA sequence

ATGCGCCTAGCTAAACTTTTGCCTGATTACTTCATGCTCAATGTGAATCAAGTCAGCGAAGGTCTTGCGACCATGGCAGCAATTTTAAGGAGAGGTTATATTCCTAATATAGTGACATATACGACCTTGATTAAGGGCTTGTGTATGGAACATAGGGTGGGTGGAAACTGGAAGGAGGCTAAACATTCGTTAAATGAGATGGTGGATCAAGGTGTTCGACCAGATATGGAAGGAAAGGTTATCAAGGCTCAGGAGTTCCTAAAGGCGGTGATTCAGACAGGCAATGTTCTAGATTCATTTACTTATACTTCAGAGGGGTTTTGTTTGGTTGGTGACTTGAATAGTGCGAGGGAACTCTTTGTTAGTATGCTAAGTAAGGGATGTGAACCTGATGTGATTAGTTACAATGTGCTAATCAATGGGTATTGTAAAACTAGGAAGGTGGAAGAAGCAATGAAGATTTACAATGAAATGCTTCTAGAGGGAGAGAGGCCAGGTGTGAAAACATATGGTGCCTTGTTATCGGGGCTTTTTCAGGCAGGCAAGGTTGGTGATGCAAAGAAGCTACTTGGTGTTATGGAAGCTTATGGTATTCCCGCAGATTCATGTATATATCTTATTTCCTTAGATGGGTTGTGTAAGAATGGTTGTTTATTTGAAGCGATGGAACTTTTTAAAGAGCTAAGATCATGCAACTTGAAATTGGATATTAAAATCTTTAATGGTCTAATTGATGGCCCGTGTAAACCAGGAAAACTTGATACTGCCTGGGAGCTTTTCGGAAAACTGTCCCAAGAAGGGCATCAACCAAATGTTGTGACTTATACCATTATGATCCATGGGTTTTGTAAAAAGGGACAAAGTAATAAATCAGAGGAGGTGGTTCAACTTCTTCATAAGATGGTTCAGAATGATGTGTCGTCAAATGACAACACTTGCGCCATAGTCGTAGACATGCTTTGCAAAGATGAAAAATATCAAGAATGCCTAGACTTGATTCCAAGCTTTCCTGTCCAAGAGCGTCAACATTGA

Coding sequence (CDS)

ATGCGCCTAGCTAAACTTTTGCCTGATTACTTCATGCTCAATGTGAATCAAGTCAGCGAAGGTCTTGCGACCATGGCAGCAATTTTAAGGAGAGGTTATATTCCTAATATAGTGACATATACGACCTTGATTAAGGGCTTGTGTATGGAACATAGGGTGGGTGGAAACTGGAAGGAGGCTAAACATTCGTTAAATGAGATGGTGGATCAAGGTGTTCGACCAGATATGGAAGGAAAGGTTATCAAGGCTCAGGAGTTCCTAAAGGCGGTGATTCAGACAGGCAATGTTCTAGATTCATTTACTTATACTTCAGAGGGGTTTTGTTTGGTTGGTGACTTGAATAGTGCGAGGGAACTCTTTGTTAGTATGCTAAGTAAGGGATGTGAACCTGATGTGATTAGTTACAATGTGCTAATCAATGGGTATTGTAAAACTAGGAAGGTGGAAGAAGCAATGAAGATTTACAATGAAATGCTTCTAGAGGGAGAGAGGCCAGGTGTGAAAACATATGGTGCCTTGTTATCGGGGCTTTTTCAGGCAGGCAAGGTTGGTGATGCAAAGAAGCTACTTGGTGTTATGGAAGCTTATGGTATTCCCGCAGATTCATGTATATATCTTATTTCCTTAGATGGGTTGTGTAAGAATGGTTGTTTATTTGAAGCGATGGAACTTTTTAAAGAGCTAAGATCATGCAACTTGAAATTGGATATTAAAATCTTTAATGGTCTAATTGATGGCCCGTGTAAACCAGGAAAACTTGATACTGCCTGGGAGCTTTTCGGAAAACTGTCCCAAGAAGGGCATCAACCAAATGTTGTGACTTATACCATTATGATCCATGGGTTTTGTAAAAAGGGACAAAGTAATAAATCAGAGGAGGTGGTTCAACTTCTTCATAAGATGGTTCAGAATGATGTGTCGTCAAATGACAACACTTGCGCCATAGTCGTAGACATGCTTTGCAAAGATGAAAAATATCAAGAATGCCTAGACTTGATTCCAAGCTTTCCTGTCCAAGAGCGTCAACATTGA

Protein sequence

MRLAKLLPDYFMLNVNQVSEGLATMAAILRRGYIPNIVTYTTLIKGLCMEHRVGGNWKEAKHSLNEMVDQGVRPDMEGKVIKAQEFLKAVIQTGNVLDSFTYTSEGFCLVGDLNSARELFVSMLSKGCEPDVISYNVLINGYCKTRKVEEAMKIYNEMLLEGERPGVKTYGALLSGLFQAGKVGDAKKLLGVMEAYGIPADSCIYLISLDGLCKNGCLFEAMELFKELRSCNLKLDIKIFNGLIDGPCKPGKLDTAWELFGKLSQEGHQPNVVTYTIMIHGFCKKGQSNKSEEVVQLLHKMVQNDVSSNDNTCAIVVDMLCKDEKYQECLDLIPSFPVQERQH
Homology
BLAST of Cla97C09G171075 vs. NCBI nr
Match: XP_038896203.1 (pentatricopeptide repeat-containing protein At1g63330-like [Benincasa hispida])

HSP 1 Score: 450.7 bits (1158), Expect = 1.2e-122
Identity = 259/502 (51.59%), Postives = 290/502 (57.77%), Query Frame = 0

Query: 1   MRLAKLLPDYFML--------NVNQVSEGLATMAAILRRGYIPNIVTYTTLIKGLCMEHR 60
           M LA L P++F L        NVN+VSEG A MA ILRRGYIP+ +TY+TLIKGLCME+R
Sbjct: 133 MCLAGLFPNFFTLNILINCLCNVNRVSEGFAAMAGILRRGYIPDKITYSTLIKGLCMEYR 192

Query: 61  V----------------------------------------------------------- 120
           +                                                           
Sbjct: 193 ISEATRLFMRMQKLGCRPDVVTYGTLIKGLCRTGNINIALKLHQEMLNETGQYGINCKPN 252

Query: 121 ------------------------------------------------GGNWKEAKHSLN 180
                                                           GG W+EAK   N
Sbjct: 253 VICYSSIIDGLCKDRREDEASELFEEMKTRGMIPDVISYTSLIHGFCWGGKWEEAKRLFN 312

Query: 181 EMVDQGVRPDM------------EGKVIKAQEFLKAVIQTGNVLDSFTYTS--EGFCLVG 240
           EMVDQGV+PDM            EGKVI+A++ L  +IQ G V +  TY S  EGFCLVG
Sbjct: 313 EMVDQGVQPDMVTFNVLIDMLCKEGKVIEAKKLLDVMIQRGIVPNLVTYNSLIEGFCLVG 372

Query: 241 DLNSARELFVSMLSKGCEPDVISYNVLINGYCKTRKVEEAMKIYNEMLLEGERPGVKTYG 300
           DLNS RELFVSM SKGCEPDVISY  LINGYCKT KV EAMK+YNEML  G+RP VKTYG
Sbjct: 373 DLNSGRELFVSMPSKGCEPDVISYTTLINGYCKTLKVNEAMKLYNEMLQVGKRPNVKTYG 432

Query: 301 ALLSGLFQAGKVGDAKKLLGVMEAYGIPADSCIYLISLDGLCKNGCLFEAMELFKELRSC 342
           ALL+GLFQAGKVGDAKKL GVM+AYG+P DSCIY I LDGLCKNGCLFEAME F EL+S 
Sbjct: 433 ALLTGLFQAGKVGDAKKLFGVMKAYGVPIDSCIYGIFLDGLCKNGCLFEAMEFFNELKSY 492

BLAST of Cla97C09G171075 vs. NCBI nr
Match: XP_038897332.1 (pentatricopeptide repeat-containing protein At3g22470, mitochondrial-like [Benincasa hispida])

HSP 1 Score: 442.2 bits (1136), Expect = 4.1e-120
Identity = 231/357 (64.71%), Postives = 261/357 (73.11%), Query Frame = 0

Query: 31  RGYIPNIVTYTTLIKGLCMEHRVGGNWKEAKHSLNEMVDQGVRPDM------------EG 90
           +G IP++++YT+LI G C     G  W+EAK   NEMVDQGVRP++            EG
Sbjct: 224 QGMIPDVISYTSLIHGFCW----GEKWEEAKRLFNEMVDQGVRPNVVTFNVLIDMLCKEG 283

Query: 91  KVIKAQEFLKAVIQTGNVLDSFTYTS--EGFCLVGDLNSARELFVSMLSKGCEPDVISYN 150
           KVIKA+E L  +IQ G V D  TYTS  EGFC VGDLNSARELF++M SKGCEPDVISY 
Sbjct: 284 KVIKAKELLDMMIQRGIVPDLVTYTSLIEGFCKVGDLNSARELFINMPSKGCEPDVISYT 343

Query: 151 VLINGYCKTRKVEEAMKIYNEMLLEGERPGVKTYGALLSGLFQAGKVGDAKKLLGVMEAY 210
           +LINGYCKT KV EAMK+YNEML  G+RP VKTYGALL+GLFQAGKVGDAKKL GVM+AY
Sbjct: 344 MLINGYCKTLKVNEAMKLYNEMLQVGKRPNVKTYGALLTGLFQAGKVGDAKKLFGVMKAY 403

Query: 211 GIPADSCIYLISLDGLCKNGCLFEAMELFKELRSCNLKLDIKIFNGLIDGPCKPGKLDTA 270
           G+  DSCIY I LDGLCKNGCLFEAMELF EL+S NLKLDI IFN LIDG CK G L+TA
Sbjct: 404 GVAIDSCIYGIFLDGLCKNGCLFEAMELFNELKSYNLKLDIGIFNCLIDGLCKVGTLETA 463

Query: 271 WELFGKLSQEGHQPNVVTYTIMIHGFCKKGQ----------------------------- 330
           WELF KLSQEG QPNVVTY IMIHGFC+KGQ                             
Sbjct: 464 WELFKKLSQEGLQPNVVTYNIMIHGFCRKGQVDKANILFQHMEENGCTPNVITCNTLLRG 523

Query: 331 ---SNKSEEVVQLLHKMVQNDVSSNDNTCAIVVDMLCKDEKYQECLDLIPSFPVQER 342
              SNKS+EVV+LLH+MVQ DVS +  TC IVVDMLCKDEKY+EC+DL+P FPVQ+R
Sbjct: 524 FCESNKSKEVVELLHRMVQRDVSPDVRTCTIVVDMLCKDEKYRECIDLLPRFPVQKR 576

BLAST of Cla97C09G171075 vs. NCBI nr
Match: KAA0059713.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK26151.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 420.2 bits (1079), Expect = 1.7e-113
Identity = 221/376 (58.78%), Postives = 264/376 (70.21%), Query Frame = 0

Query: 12  MLNVNQVSEGLATMAAILRRGYIPNIVTYTTLIKGLCMEHRVGGNWKEAKHSLNEMVDQG 71
           +  V +  E   T   +  +G +PN+++Y++LI G C      G W E+K   +EMVDQG
Sbjct: 196 LCKVGREDEAKETFEEMKAQGMVPNVISYSSLIHGFC----CAGKWDESKQLFDEMVDQG 255

Query: 72  VRPD------------MEGKVIKAQEFLKAVIQTGNVLDSFTYTS--EGFCLVGDLNSAR 131
           V+P+             EGKVI+A++ L+ +I++G V + FTY S  +GFC+VGDLNSAR
Sbjct: 256 VQPNKVTFSVLIDTLCKEGKVIEAKKLLELMIESGIVPNVFTYNSLLKGFCMVGDLNSAR 315

Query: 132 ELFVSMLSKGCEPDVISYNVLINGYCKTRKVEEAMKIYNEMLLEGERPGVKTYGALLSGL 191
           ELFVSM SKGCEPDVISY VLINGYCKT KVEEAMK+YN+MLL G+RP V TYGALL+GL
Sbjct: 316 ELFVSMPSKGCEPDVISYTVLINGYCKTLKVEEAMKLYNKMLLVGKRPNVITYGALLTGL 375

Query: 192 FQAGKVGDAKKLLGVMEAYGIPADSCIYLISLDGLCKNGCLFEAMELFKELRSCNLKLDI 251
           F AGKVGDAKKL   M+A+GI  +SCIY I LDGLCKNGCLFEAMELF +L+S N KL+I
Sbjct: 376 FLAGKVGDAKKLFSAMKAHGIAENSCIYSIFLDGLCKNGCLFEAMELFTKLKSYNFKLEI 435

Query: 252 KIFNGLIDGPCKPGKLDTAWELFGKLSQEGHQPNVVTYTIMIHGFCKKG----------- 311
           + ++ LIDG CK GKL+TAWELF KLSQEG QPNVVTY IMI GFCK G           
Sbjct: 436 ETYSCLIDGLCKAGKLETAWELFEKLSQEGLQPNVVTYNIMISGFCKAGLVDKANILFEK 495

Query: 312 ---------------------QSNKSEEVVQLLHKMVQNDVSSNDNTCAIVVDMLCKDEK 342
                                QSNKSEEVV+LLHKMVQ DVS + + C IVVDMLCKDEK
Sbjct: 496 MEENGCTPSIITYDILLRRFCQSNKSEEVVRLLHKMVQRDVSPDVSICTIVVDMLCKDEK 555

BLAST of Cla97C09G171075 vs. NCBI nr
Match: XP_008451225.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g22470, mitochondrial-like [Cucumis melo] >XP_016901074.1 PREDICTED: pentatricopeptide repeat-containing protein At3g22470, mitochondrial-like [Cucumis melo])

HSP 1 Score: 418.3 bits (1074), Expect = 6.3e-113
Identity = 250/503 (49.70%), Postives = 285/503 (56.66%), Query Frame = 0

Query: 1   MRLAKLLPDYFML--------NVNQVSEGLATMAAILRRGYIPNIVTYTTLIKGLCMEHR 60
           MRLA L P    L        NVN+VSE LA MA +LRRGYIPN+VTYTTLIKGLCMEHR
Sbjct: 66  MRLAGLSPSAITLNILVNCLCNVNRVSEALAGMAGLLRRGYIPNVVTYTTLIKGLCMEHR 125

Query: 61  VG-------------------------------GN------------------------- 120
           +                                GN                         
Sbjct: 126 ISEATRLFLRMQKLGCTPNVVTYGTLVKGLCQTGNVNIALKLHQEMLNDTSQYGINCKPN 185

Query: 121 ---------------------------------------------------WKEAKHSLN 180
                                                              W+E+K   +
Sbjct: 186 VFNYNIIIDGLCKVGREDEANELFEEMKAQGMIPNVISYSSLIHGFCCARKWEESKRLFD 245

Query: 181 EMVDQGVRPD------------MEGKVIKAQEFLKAVIQTGNVLDSFTYTS--EGFCLVG 240
           EMVDQGV+PD             EGKVI+A++  + +IQ G V D F Y+S  EGFC+VG
Sbjct: 246 EMVDQGVQPDKVTFSVLIDTLCKEGKVIEAKKLFEVMIQRGIVPDLFIYSSLMEGFCMVG 305

Query: 241 DLNSARELFVSMLSKGCEPDVISYNVLINGYCKTRKVEEAMKIYNEMLLEGERPGVKTYG 300
           DLNSARELFVSM SKGCEPDVISY VLINGYCKT KVEEAMK+YNEMLL G+RP V TYG
Sbjct: 306 DLNSARELFVSMPSKGCEPDVISYTVLINGYCKTLKVEEAMKLYNEMLLVGKRPNVITYG 365

Query: 301 ALLSGLFQAGKVGDAKKLLGVMEAYGIPADSCIYLISLDGLCKNGCLFEAMELFKELRSC 343
           ALL+GLF AGKVGDAKKL   M+A GI A+S IY I LDGLCKNGCLFEAM+LF EL+S 
Sbjct: 366 ALLTGLFLAGKVGDAKKLFSAMKARGISANSHIYGIILDGLCKNGCLFEAMKLFTELKSY 425

BLAST of Cla97C09G171075 vs. NCBI nr
Match: KAA0059628.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK08161.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 418.3 bits (1074), Expect = 6.3e-113
Identity = 250/503 (49.70%), Postives = 285/503 (56.66%), Query Frame = 0

Query: 1   MRLAKLLPDYFML--------NVNQVSEGLATMAAILRRGYIPNIVTYTTLIKGLCMEHR 60
           MRLA L P    L        NVN+VSE LA MA +LRRGYIPN+VTYTTLIKGLCMEHR
Sbjct: 66  MRLAGLSPSAITLNILVNCLCNVNRVSEALAGMAGLLRRGYIPNVVTYTTLIKGLCMEHR 125

Query: 61  VG-------------------------------GN------------------------- 120
           +                                GN                         
Sbjct: 126 ISEATRLFLRMQKLGCTPNVVTYGTLVKGLCQTGNVNIALKLHQEMLNDTSQYGINCKPN 185

Query: 121 ---------------------------------------------------WKEAKHSLN 180
                                                              W+E+K   +
Sbjct: 186 VFNYNIIIDGLCKVGREDEANELFEEMKAQGMIPNVISYSSLIHGFCCARKWEESKRLFD 245

Query: 181 EMVDQGVRPD------------MEGKVIKAQEFLKAVIQTGNVLDSFTYTS--EGFCLVG 240
           EMVDQGV+PD             EGKVI+A++  + +IQ G V D F Y+S  EGFC+VG
Sbjct: 246 EMVDQGVQPDKVTFSVLIDTLCKEGKVIEAKKLFEVMIQRGIVPDLFIYSSLMEGFCMVG 305

Query: 241 DLNSARELFVSMLSKGCEPDVISYNVLINGYCKTRKVEEAMKIYNEMLLEGERPGVKTYG 300
           DLNSARELFVSM SKGCEPDVISY VLINGYCKT KVEEAMK+YNEMLL G+RP V TYG
Sbjct: 306 DLNSARELFVSMPSKGCEPDVISYTVLINGYCKTLKVEEAMKLYNEMLLVGKRPNVITYG 365

Query: 301 ALLSGLFQAGKVGDAKKLLGVMEAYGIPADSCIYLISLDGLCKNGCLFEAMELFKELRSC 343
           ALL+GLF AGKVGDAKKL   M+A GI A+S IY I LDGLCKNGCLFEAM+LF EL+S 
Sbjct: 366 ALLTGLFLAGKVGDAKKLFSAMKARGISANSHIYGIILDGLCKNGCLFEAMKLFTELKSY 425

BLAST of Cla97C09G171075 vs. ExPASy Swiss-Prot
Match: Q6NQ83 (Pentatricopeptide repeat-containing protein At3g22470, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g22470 PE=1 SV=1)

HSP 1 Score: 220.3 bits (560), Expect = 3.3e-56
Identity = 113/301 (37.54%), Postives = 180/301 (59.80%), Query Frame = 0

Query: 31  RGYIPNIVTYTTLIKGLCMEHRVGGNWKEAKHSLNEMVDQGVRPDM------------EG 90
           +G   ++VTY++LI GLC +    G W +    L EM+ + + PD+            EG
Sbjct: 274 KGIKADVVTYSSLIGGLCND----GKWDDGAKMLREMIGRNIIPDVVTFSALIDVFVKEG 333

Query: 91  KVIKAQEFLKAVIQTGNVLDSFTYTS--EGFCLVGDLNSARELFVSMLSKGCEPDVISYN 150
           K+++A+E    +I  G   D+ TY S  +GFC    L+ A ++F  M+SKGCEPD+++Y+
Sbjct: 334 KLLEAKELYNEMITRGIAPDTITYNSLIDGFCKENCLHEANQMFDLMVSKGCEPDIVTYS 393

Query: 151 VLINGYCKTRKVEEAMKIYNEMLLEGERPGVKTYGALLSGLFQAGKVGDAKKLLGVMEAY 210
           +LIN YCK ++V++ M+++ E+  +G  P   TY  L+ G  Q+GK+  AK+L   M + 
Sbjct: 394 ILINSYCKAKRVDDGMRLFREISSKGLIPNTITYNTLVLGFCQSGKLNAAKELFQEMVSR 453

Query: 211 GIPADSCIYLISLDGLCKNGCLFEAMELFKELRSCNLKLDIKIFNGLIDGPCKPGKLDTA 270
           G+P     Y I LDGLC NG L +A+E+F++++   + L I I+N +I G C   K+D A
Sbjct: 454 GVPPSVVTYGILLDGLCDNGELNKALEIFEKMQKSRMTLGIGIYNIIIHGMCNASKVDDA 513

Query: 271 WELFGKLSQEGHQPNVVTYTIMIHGFCKKGQSNKSEEVVQLLHKMVQNDVSSNDNTCAIV 318
           W LF  LS +G +P+VVTY +MI G CKKG  ++++    L  KM ++  + +D T  I+
Sbjct: 514 WSLFCSLSDKGVKPDVVTYNVMIGGLCKKGSLSEAD---MLFRKMKEDGCTPDDFTYNIL 567

BLAST of Cla97C09G171075 vs. ExPASy Swiss-Prot
Match: Q9LQ14 (Pentatricopeptide repeat-containing protein At1g62930, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At1g62930 PE=2 SV=2)

HSP 1 Score: 218.4 bits (555), Expect = 1.3e-55
Identity = 119/336 (35.42%), Postives = 187/336 (55.65%), Query Frame = 0

Query: 12  MLNVNQVSEGLATMAAILRRGYIPNIVTYTTLIKGLCMEHRVGGNWKEAKHSLNEMVDQG 71
           + N   V++ L     +  +G  PN+VTY +LI+ LC      G W +A   L++M+++ 
Sbjct: 265 LCNYKNVNDALNLFTEMDNKGIRPNVVTYNSLIRCLCNY----GRWSDASRLLSDMIERK 324

Query: 72  VRPDM------------EGKVIKAQEFLKAVIQTGNVLDSFTYTS--EGFCLVGDLNSAR 131
           + P++            EGK+++A++    +I+     D FTY+S   GFC+   L+ A+
Sbjct: 325 INPNVVTFSALIDAFVKEGKLVEAEKLYDEMIKRSIDPDIFTYSSLINGFCMHDRLDEAK 384

Query: 132 ELFVSMLSKGCEPDVISYNVLINGYCKTRKVEEAMKIYNEMLLEGERPGVKTYGALLSGL 191
            +F  M+SK C P+V++YN LI G+CK ++VEE M+++ EM   G      TY  L+ GL
Sbjct: 385 HMFELMISKDCFPNVVTYNTLIKGFCKAKRVEEGMELFREMSQRGLVGNTVTYNTLIQGL 444

Query: 192 FQAGKVGDAKKLLGVMEAYGIPADSCIYLISLDGLCKNGCLFEAMELFKELRSCNLKLDI 251
           FQAG    A+K+   M + G+P D   Y I LDGLCK G L +A+ +F+ L+   ++ DI
Sbjct: 445 FQAGDCDMAQKIFKKMVSDGVPPDIITYSILLDGLCKYGKLEKALVVFEYLQKSKMEPDI 504

Query: 252 KIFNGLIDGPCKPGKLDTAWELFGKLSQEGHQPNVVTYTIMIHGFCKKGQSNKSEEVVQL 311
             +N +I+G CK GK++  W+LF  LS +G +PNV+ YT MI GFC+KG     EE   L
Sbjct: 505 YTYNIMIEGMCKAGKVEDGWDLFCSLSLKGVKPNVIIYTTMISGFCRKG---LKEEADAL 564

Query: 312 LHKMVQNDVSSNDNTCAIVVDMLCKDEKYQECLDLI 334
             +M ++    N  T   ++    +D       +LI
Sbjct: 565 FREMKEDGTLPNSGTYNTLIRARLRDGDKAASAELI 593

BLAST of Cla97C09G171075 vs. ExPASy Swiss-Prot
Match: Q3ECK2 (Pentatricopeptide repeat-containing protein At1g62680, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g62680 PE=2 SV=2)

HSP 1 Score: 215.3 bits (547), Expect = 1.1e-54
Identity = 117/315 (37.14%), Postives = 176/315 (55.87%), Query Frame = 0

Query: 12  MLNVNQVSEGLATMAAILRRGYIPNIVTYTTLIKGLCMEHRVGGNWKEAKHSLNEMVDQG 71
           +    +V++       I R+G  PN+VTYT L+ GLC   R    W +A   L++M+ + 
Sbjct: 200 LCKTKRVNDAFDFFKEIERKGIRPNVVTYTALVNGLCNSSR----WSDAARLLSDMIKKK 259

Query: 72  VRPDM------------EGKVIKAQEFLKAVIQTGNVLDSFTYTS--EGFCLVGDLNSAR 131
           + P++             GKV++A+E  + +++     D  TY+S   G CL   ++ A 
Sbjct: 260 ITPNVITYSALLDAFVKNGKVLEAKELFEEMVRMSIDPDIVTYSSLINGLCLHDRIDEAN 319

Query: 132 ELFVSMLSKGCEPDVISYNVLINGYCKTRKVEEAMKIYNEMLLEGERPGVKTYGALLSGL 191
           ++F  M+SKGC  DV+SYN LING+CK ++VE+ MK++ EM   G      TY  L+ G 
Sbjct: 320 QMFDLMVSKGCLADVVSYNTLINGFCKAKRVEDGMKLFREMSQRGLVSNTVTYNTLIQGF 379

Query: 192 FQAGKVGDAKKLLGVMEAYGIPADSCIYLISLDGLCKNGCLFEAMELFKELRSCNLKLDI 251
           FQAG V  A++    M+ +GI  D   Y I L GLC NG L +A+ +F++++   + LDI
Sbjct: 380 FQAGDVDKAQEFFSQMDFFGISPDIWTYNILLGGLCDNGELEKALVIFEDMQKREMDLDI 439

Query: 252 KIFNGLIDGPCKPGKLDTAWELFGKLSQEGHQPNVVTYTIMIHGFCKKGQSNKSEEVVQL 311
             +  +I G CK GK++ AW LF  LS +G +P++VTYT M+ G C KG  +   EV  L
Sbjct: 440 VTYTTVIRGMCKTGKVEEAWSLFCSLSLKGLKPDIVTYTTMMSGLCTKGLLH---EVEAL 499

Query: 312 LHKMVQNDVSSNDNT 313
             KM Q  +  ND T
Sbjct: 500 YTKMKQEGLMKNDCT 507

BLAST of Cla97C09G171075 vs. ExPASy Swiss-Prot
Match: Q9SXD8 (Pentatricopeptide repeat-containing protein At1g62590 OS=Arabidopsis thaliana OX=3702 GN=At1g62590 PE=2 SV=1)

HSP 1 Score: 213.8 bits (543), Expect = 3.1e-54
Identity = 123/355 (34.65%), Postives = 188/355 (52.96%), Query Frame = 0

Query: 1   MRLAKLLPDYFMLNV--------NQVSEGLATMAAILRRGYIPNIVTYTTLIKGLCMEHR 60
           M  AK+  D  + N           V + L     +  +G  PN+VTY++LI  LC    
Sbjct: 251 MEAAKIEADVVIFNTIIDSLCKYRHVDDALNLFKEMETKGIRPNVVTYSSLISCLCSY-- 310

Query: 61  VGGNWKEAKHSLNEMVDQGVRPDM------------EGKVIKAQEFLKAVIQTGNVLDSF 120
             G W +A   L++M+++ + P++            EGK ++A++    +I+     D F
Sbjct: 311 --GRWSDASQLLSDMIEKKINPNLVTFNALIDAFVKEGKFVEAEKLYDDMIKRSIDPDIF 370

Query: 121 TYTS--EGFCLVGDLNSARELFVSMLSKGCEPDVISYNVLINGYCKTRKVEEAMKIYNEM 180
           TY S   GFC+   L+ A+++F  M+SK C PDV++YN LI G+CK+++VE+  +++ EM
Sbjct: 371 TYNSLVNGFCMHDRLDKAKQMFEFMVSKDCFPDVVTYNTLIKGFCKSKRVEDGTELFREM 430

Query: 181 LLEGERPGVKTYGALLSGLFQAGKVGDAKKLLGVMEAYGIPADSCIYLISLDGLCKNGCL 240
              G      TY  L+ GLF  G   +A+K+   M + G+P D   Y I LDGLC NG L
Sbjct: 431 SHRGLVGDTVTYTTLIQGLFHDGDCDNAQKVFKQMVSDGVPPDIMTYSILLDGLCNNGKL 490

Query: 241 FEAMELFKELRSCNLKLDIKIFNGLIDGPCKPGKLDTAWELFGKLSQEGHQPNVVTYTIM 300
            +A+E+F  ++   +KLDI I+  +I+G CK GK+D  W+LF  LS +G +PNVVTY  M
Sbjct: 491 EKALEVFDYMQKSEIKLDIYIYTTMIEGMCKAGKVDDGWDLFCSLSLKGVKPNVVTYNTM 550

Query: 301 IHGFCKKGQSNKSEEVVQLLHKMVQNDVSSNDNTCAIVVDMLCKDEKYQECLDLI 334
           I G C K      +E   LL KM ++    N  T   ++    +D       +LI
Sbjct: 551 ISGLCSK---RLLQEAYALLKKMKEDGPLPNSGTYNTLIRAHLRDGDKAASAELI 598

BLAST of Cla97C09G171075 vs. ExPASy Swiss-Prot
Match: Q9C8T7 (Pentatricopeptide repeat-containing protein At1g63330 OS=Arabidopsis thaliana OX=3702 GN=At1g63330 PE=2 SV=2)

HSP 1 Score: 207.6 bits (527), Expect = 2.2e-52
Identity = 121/355 (34.08%), Postives = 187/355 (52.68%), Query Frame = 0

Query: 1   MRLAKLLPDYFMLNV--------NQVSEGLATMAAILRRGYIPNIVTYTTLIKGLCMEHR 60
           M  AK+  D  + N           V + L     +  +G  PN+VTY++LI  LC    
Sbjct: 176 MEAAKIEADVVIFNTIIDSLCKYRHVDDALNLFKEMETKGIRPNVVTYSSLISCLCSY-- 235

Query: 61  VGGNWKEAKHSLNEMVDQGVRPDM------------EGKVIKAQEFLKAVIQTGNVLDSF 120
             G W +A   L++M+++ + P++            EGK ++A++    +I+     D F
Sbjct: 236 --GRWSDASQLLSDMIEKKINPNLVTFNALIDAFVKEGKFVEAEKLHDDMIKRSIDPDIF 295

Query: 121 TYTS--EGFCLVGDLNSARELFVSMLSKGCEPDVISYNVLINGYCKTRKVEEAMKIYNEM 180
           TY S   GFC+   L+ A+++F  M+SK C PD+ +YN LI G+CK+++VE+  +++ EM
Sbjct: 296 TYNSLINGFCMHDRLDKAKQMFEFMVSKDCFPDLDTYNTLIKGFCKSKRVEDGTELFREM 355

Query: 181 LLEGERPGVKTYGALLSGLFQAGKVGDAKKLLGVMEAYGIPADSCIYLISLDGLCKNGCL 240
              G      TY  L+ GLF  G   +A+K+   M + G+P D   Y I LDGLC NG L
Sbjct: 356 SHRGLVGDTVTYTTLIQGLFHDGDCDNAQKVFKQMVSDGVPPDIMTYSILLDGLCNNGKL 415

Query: 241 FEAMELFKELRSCNLKLDIKIFNGLIDGPCKPGKLDTAWELFGKLSQEGHQPNVVTYTIM 300
            +A+E+F  ++   +KLDI I+  +I+G CK GK+D  W+LF  LS +G +PNVVTY  M
Sbjct: 416 EKALEVFDYMQKSEIKLDIYIYTTMIEGMCKAGKVDDGWDLFCSLSLKGVKPNVVTYNTM 475

Query: 301 IHGFCKKGQSNKSEEVVQLLHKMVQNDVSSNDNTCAIVVDMLCKDEKYQECLDLI 334
           I G C K      +E   LL KM ++    +  T   ++    +D       +LI
Sbjct: 476 ISGLCSK---RLLQEAYALLKKMKEDGPLPDSGTYNTLIRAHLRDGDKAASAELI 523

BLAST of Cla97C09G171075 vs. ExPASy TrEMBL
Match: A0A5A7UUW8 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold111G00730 PE=4 SV=1)

HSP 1 Score: 420.2 bits (1079), Expect = 8.1e-114
Identity = 221/376 (58.78%), Postives = 264/376 (70.21%), Query Frame = 0

Query: 12  MLNVNQVSEGLATMAAILRRGYIPNIVTYTTLIKGLCMEHRVGGNWKEAKHSLNEMVDQG 71
           +  V +  E   T   +  +G +PN+++Y++LI G C      G W E+K   +EMVDQG
Sbjct: 196 LCKVGREDEAKETFEEMKAQGMVPNVISYSSLIHGFC----CAGKWDESKQLFDEMVDQG 255

Query: 72  VRPD------------MEGKVIKAQEFLKAVIQTGNVLDSFTYTS--EGFCLVGDLNSAR 131
           V+P+             EGKVI+A++ L+ +I++G V + FTY S  +GFC+VGDLNSAR
Sbjct: 256 VQPNKVTFSVLIDTLCKEGKVIEAKKLLELMIESGIVPNVFTYNSLLKGFCMVGDLNSAR 315

Query: 132 ELFVSMLSKGCEPDVISYNVLINGYCKTRKVEEAMKIYNEMLLEGERPGVKTYGALLSGL 191
           ELFVSM SKGCEPDVISY VLINGYCKT KVEEAMK+YN+MLL G+RP V TYGALL+GL
Sbjct: 316 ELFVSMPSKGCEPDVISYTVLINGYCKTLKVEEAMKLYNKMLLVGKRPNVITYGALLTGL 375

Query: 192 FQAGKVGDAKKLLGVMEAYGIPADSCIYLISLDGLCKNGCLFEAMELFKELRSCNLKLDI 251
           F AGKVGDAKKL   M+A+GI  +SCIY I LDGLCKNGCLFEAMELF +L+S N KL+I
Sbjct: 376 FLAGKVGDAKKLFSAMKAHGIAENSCIYSIFLDGLCKNGCLFEAMELFTKLKSYNFKLEI 435

Query: 252 KIFNGLIDGPCKPGKLDTAWELFGKLSQEGHQPNVVTYTIMIHGFCKKG----------- 311
           + ++ LIDG CK GKL+TAWELF KLSQEG QPNVVTY IMI GFCK G           
Sbjct: 436 ETYSCLIDGLCKAGKLETAWELFEKLSQEGLQPNVVTYNIMISGFCKAGLVDKANILFEK 495

Query: 312 ---------------------QSNKSEEVVQLLHKMVQNDVSSNDNTCAIVVDMLCKDEK 342
                                QSNKSEEVV+LLHKMVQ DVS + + C IVVDMLCKDEK
Sbjct: 496 MEENGCTPSIITYDILLRRFCQSNKSEEVVRLLHKMVQRDVSPDVSICTIVVDMLCKDEK 555

BLAST of Cla97C09G171075 vs. ExPASy TrEMBL
Match: A0A1S4DYL6 (pentatricopeptide repeat-containing protein At3g22470, mitochondrial-like OS=Cucumis melo OX=3656 GN=LOC103492586 PE=4 SV=1)

HSP 1 Score: 418.3 bits (1074), Expect = 3.1e-113
Identity = 250/503 (49.70%), Postives = 285/503 (56.66%), Query Frame = 0

Query: 1   MRLAKLLPDYFML--------NVNQVSEGLATMAAILRRGYIPNIVTYTTLIKGLCMEHR 60
           MRLA L P    L        NVN+VSE LA MA +LRRGYIPN+VTYTTLIKGLCMEHR
Sbjct: 66  MRLAGLSPSAITLNILVNCLCNVNRVSEALAGMAGLLRRGYIPNVVTYTTLIKGLCMEHR 125

Query: 61  VG-------------------------------GN------------------------- 120
           +                                GN                         
Sbjct: 126 ISEATRLFLRMQKLGCTPNVVTYGTLVKGLCQTGNVNIALKLHQEMLNDTSQYGINCKPN 185

Query: 121 ---------------------------------------------------WKEAKHSLN 180
                                                              W+E+K   +
Sbjct: 186 VFNYNIIIDGLCKVGREDEANELFEEMKAQGMIPNVISYSSLIHGFCCARKWEESKRLFD 245

Query: 181 EMVDQGVRPD------------MEGKVIKAQEFLKAVIQTGNVLDSFTYTS--EGFCLVG 240
           EMVDQGV+PD             EGKVI+A++  + +IQ G V D F Y+S  EGFC+VG
Sbjct: 246 EMVDQGVQPDKVTFSVLIDTLCKEGKVIEAKKLFEVMIQRGIVPDLFIYSSLMEGFCMVG 305

Query: 241 DLNSARELFVSMLSKGCEPDVISYNVLINGYCKTRKVEEAMKIYNEMLLEGERPGVKTYG 300
           DLNSARELFVSM SKGCEPDVISY VLINGYCKT KVEEAMK+YNEMLL G+RP V TYG
Sbjct: 306 DLNSARELFVSMPSKGCEPDVISYTVLINGYCKTLKVEEAMKLYNEMLLVGKRPNVITYG 365

Query: 301 ALLSGLFQAGKVGDAKKLLGVMEAYGIPADSCIYLISLDGLCKNGCLFEAMELFKELRSC 343
           ALL+GLF AGKVGDAKKL   M+A GI A+S IY I LDGLCKNGCLFEAM+LF EL+S 
Sbjct: 366 ALLTGLFLAGKVGDAKKLFSAMKARGISANSHIYGIILDGLCKNGCLFEAMKLFTELKSY 425

BLAST of Cla97C09G171075 vs. ExPASy TrEMBL
Match: A0A5D3C8J0 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold886G00440 PE=4 SV=1)

HSP 1 Score: 418.3 bits (1074), Expect = 3.1e-113
Identity = 250/503 (49.70%), Postives = 285/503 (56.66%), Query Frame = 0

Query: 1   MRLAKLLPDYFML--------NVNQVSEGLATMAAILRRGYIPNIVTYTTLIKGLCMEHR 60
           MRLA L P    L        NVN+VSE LA MA +LRRGYIPN+VTYTTLIKGLCMEHR
Sbjct: 66  MRLAGLSPSAITLNILVNCLCNVNRVSEALAGMAGLLRRGYIPNVVTYTTLIKGLCMEHR 125

Query: 61  VG-------------------------------GN------------------------- 120
           +                                GN                         
Sbjct: 126 ISEATRLFLRMQKLGCTPNVVTYGTLVKGLCQTGNVNIALKLHQEMLNDTSQYGINCKPN 185

Query: 121 ---------------------------------------------------WKEAKHSLN 180
                                                              W+E+K   +
Sbjct: 186 VFNYNIIIDGLCKVGREDEANELFEEMKAQGMIPNVISYSSLIHGFCCARKWEESKRLFD 245

Query: 181 EMVDQGVRPD------------MEGKVIKAQEFLKAVIQTGNVLDSFTYTS--EGFCLVG 240
           EMVDQGV+PD             EGKVI+A++  + +IQ G V D F Y+S  EGFC+VG
Sbjct: 246 EMVDQGVQPDKVTFSVLIDTLCKEGKVIEAKKLFEVMIQRGIVPDLFIYSSLMEGFCMVG 305

Query: 241 DLNSARELFVSMLSKGCEPDVISYNVLINGYCKTRKVEEAMKIYNEMLLEGERPGVKTYG 300
           DLNSARELFVSM SKGCEPDVISY VLINGYCKT KVEEAMK+YNEMLL G+RP V TYG
Sbjct: 306 DLNSARELFVSMPSKGCEPDVISYTVLINGYCKTLKVEEAMKLYNEMLLVGKRPNVITYG 365

Query: 301 ALLSGLFQAGKVGDAKKLLGVMEAYGIPADSCIYLISLDGLCKNGCLFEAMELFKELRSC 343
           ALL+GLF AGKVGDAKKL   M+A GI A+S IY I LDGLCKNGCLFEAM+LF EL+S 
Sbjct: 366 ALLTGLFLAGKVGDAKKLFSAMKARGISANSHIYGIILDGLCKNGCLFEAMKLFTELKSY 425

BLAST of Cla97C09G171075 vs. ExPASy TrEMBL
Match: A0A6J1GP31 (putative pentatricopeptide repeat-containing protein At1g12700, mitochondrial OS=Cucurbita moschata OX=3662 GN=LOC111456184 PE=4 SV=1)

HSP 1 Score: 390.2 bits (1001), Expect = 9.0e-105
Identity = 232/501 (46.31%), Postives = 278/501 (55.49%), Query Frame = 0

Query: 1   MRLAKLLPDYFML--------NVNQVSEGLATMAAILRRGYIPNIVTYTTLIKGLCMEHR 60
           M LA LLP+Y  L        NVN++SEGLA MA I+RRG+IPNIVTYT+LIKGLCMEHR
Sbjct: 138 MSLAGLLPNYITLNILLNCLCNVNRISEGLAAMAGIIRRGFIPNIVTYTSLIKGLCMEHR 197

Query: 61  V----------------------------------------------------------- 120
           +                                                           
Sbjct: 198 ISEATRLFMRMQKLGCRPNVITYGTLIKGLCQTGNTNIALKLHEEMLNGTGRYGISCKPN 257

Query: 121 ------------------------------------------------GGNWKEAKHSLN 180
                                                           GG W+EAK   N
Sbjct: 258 VISYSTIIDGLCKDGREDKARELFEEMKARRMLPDVISYSSLIHGFCNGGKWEEAKCLFN 317

Query: 181 EMVDQGVRPD------------MEGKVIKAQEFLKAVIQTGNVLDSFTYTS--EGFCLVG 240
           EMVD G++P+              GKVI+A E L+ +IQ GNV D FTY +  +GFCLV 
Sbjct: 318 EMVDLGIQPNAVTFNVLMDILCKAGKVIEANELLEVMIQRGNVPDLFTYNTLMDGFCLVS 377

Query: 241 DLNSARELFVSMLSKGCEPDVISYNVLINGYCKTRKVEEAMKIYNEMLLEGERPGVKTYG 300
           DL+SARELF+SM SKGCEP+VISYNVLINGYCK  KVEEAMKIYNEML  G +P + TY 
Sbjct: 378 DLDSARELFLSMPSKGCEPNVISYNVLINGYCKNWKVEEAMKIYNEMLQVGIKPSMITYN 437

Query: 301 ALLSGLFQAGKVGDAKKLLGVMEAYGIPADSCIYLISLDGLCKNGCLFEAMELFKELRSC 341
           ALL+GLFQAGKV DAKK+ GV++A+G+   S    I +DGLCKN CL EAME+F EL S 
Sbjct: 438 ALLTGLFQAGKVNDAKKIFGVIQAHGLVPSSSTLSIFVDGLCKNDCLLEAMEIFNEL-SY 497

BLAST of Cla97C09G171075 vs. ExPASy TrEMBL
Match: A0A6J1DSW3 (pentatricopeptide repeat-containing protein At1g63330-like OS=Momordica charantia OX=3673 GN=LOC111022854 PE=4 SV=1)

HSP 1 Score: 386.7 bits (992), Expect = 9.9e-104
Identity = 230/502 (45.82%), Postives = 275/502 (54.78%), Query Frame = 0

Query: 1   MRLAKLLPDYFML--------NVNQVSEGLATMAAILRRGYIPNIVTYTTLIKGLCMEHR 60
           M LA +LP+Y  L        NVN+VSEGLA MA I+RRGYIPNIVTYT+LIKGLCMEHR
Sbjct: 140 MSLAGILPNYITLNILLNCLCNVNRVSEGLAAMAGIIRRGYIPNIVTYTSLIKGLCMEHR 199

Query: 61  V----------------------------------------------------------- 120
           +                                                           
Sbjct: 200 ISEATRLFMRMQKLGCTPNVITYGTLIKGLCQTGNTNIALKLHEEMLNGTGRYGITCKPN 259

Query: 121 ------------------------------------------------GGNWKEAKHSLN 180
                                                           GG W+EAK   N
Sbjct: 260 VICYSTIIDGLCKDGLEDKARELFEEMKAQGMLPDVISYSSLIHGFCYGGKWEEAKSLFN 319

Query: 181 EMVDQGVRPDM------------EGKVIKAQEFLKAVIQTG-NVLDSFTYTS--EGFCLV 240
           EMVD GV+P++             GKVI+A+E L+ ++Q G N  D FTY +  +GFCLV
Sbjct: 320 EMVDHGVQPNVVTFNVLMDMLCKAGKVIEAKELLELMVQGGNNAPDLFTYNTLMDGFCLV 379

Query: 241 GDLNSARELFVSMLSKGCEPDVISYNVLINGYCKTRKVEEAMKIYNEMLLEGERPGVKTY 300
           GDLNSARELF++M +KGCEP+VISYNVLINGYCK  K+EEAMK+YNEML  G RP V TY
Sbjct: 380 GDLNSARELFINMPNKGCEPNVISYNVLINGYCKNWKMEEAMKLYNEMLQVGIRPSVITY 439

Query: 301 GALLSGLFQAGKVGDAKKLLGVMEAYGIPADSCIYLISLDGLCKNGCLFEAMELFKELRS 341
            +LL+GLFQAG V DAKKL GV++A G+   S  Y   LDGLCKN CL EA+ELF  L+ 
Sbjct: 440 NSLLTGLFQAGMVVDAKKLFGVIQANGLAPSSSTYSTFLDGLCKNDCLLEAIELFNGLKP 499

BLAST of Cla97C09G171075 vs. TAIR 10
Match: AT3G22470.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 220.3 bits (560), Expect = 2.4e-57
Identity = 113/301 (37.54%), Postives = 180/301 (59.80%), Query Frame = 0

Query: 31  RGYIPNIVTYTTLIKGLCMEHRVGGNWKEAKHSLNEMVDQGVRPDM------------EG 90
           +G   ++VTY++LI GLC +    G W +    L EM+ + + PD+            EG
Sbjct: 274 KGIKADVVTYSSLIGGLCND----GKWDDGAKMLREMIGRNIIPDVVTFSALIDVFVKEG 333

Query: 91  KVIKAQEFLKAVIQTGNVLDSFTYTS--EGFCLVGDLNSARELFVSMLSKGCEPDVISYN 150
           K+++A+E    +I  G   D+ TY S  +GFC    L+ A ++F  M+SKGCEPD+++Y+
Sbjct: 334 KLLEAKELYNEMITRGIAPDTITYNSLIDGFCKENCLHEANQMFDLMVSKGCEPDIVTYS 393

Query: 151 VLINGYCKTRKVEEAMKIYNEMLLEGERPGVKTYGALLSGLFQAGKVGDAKKLLGVMEAY 210
           +LIN YCK ++V++ M+++ E+  +G  P   TY  L+ G  Q+GK+  AK+L   M + 
Sbjct: 394 ILINSYCKAKRVDDGMRLFREISSKGLIPNTITYNTLVLGFCQSGKLNAAKELFQEMVSR 453

Query: 211 GIPADSCIYLISLDGLCKNGCLFEAMELFKELRSCNLKLDIKIFNGLIDGPCKPGKLDTA 270
           G+P     Y I LDGLC NG L +A+E+F++++   + L I I+N +I G C   K+D A
Sbjct: 454 GVPPSVVTYGILLDGLCDNGELNKALEIFEKMQKSRMTLGIGIYNIIIHGMCNASKVDDA 513

Query: 271 WELFGKLSQEGHQPNVVTYTIMIHGFCKKGQSNKSEEVVQLLHKMVQNDVSSNDNTCAIV 318
           W LF  LS +G +P+VVTY +MI G CKKG  ++++    L  KM ++  + +D T  I+
Sbjct: 514 WSLFCSLSDKGVKPDVVTYNVMIGGLCKKGSLSEAD---MLFRKMKEDGCTPDDFTYNIL 567

BLAST of Cla97C09G171075 vs. TAIR 10
Match: AT1G62930.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 218.4 bits (555), Expect = 9.0e-57
Identity = 119/336 (35.42%), Postives = 187/336 (55.65%), Query Frame = 0

Query: 12  MLNVNQVSEGLATMAAILRRGYIPNIVTYTTLIKGLCMEHRVGGNWKEAKHSLNEMVDQG 71
           + N   V++ L     +  +G  PN+VTY +LI+ LC      G W +A   L++M+++ 
Sbjct: 265 LCNYKNVNDALNLFTEMDNKGIRPNVVTYNSLIRCLCNY----GRWSDASRLLSDMIERK 324

Query: 72  VRPDM------------EGKVIKAQEFLKAVIQTGNVLDSFTYTS--EGFCLVGDLNSAR 131
           + P++            EGK+++A++    +I+     D FTY+S   GFC+   L+ A+
Sbjct: 325 INPNVVTFSALIDAFVKEGKLVEAEKLYDEMIKRSIDPDIFTYSSLINGFCMHDRLDEAK 384

Query: 132 ELFVSMLSKGCEPDVISYNVLINGYCKTRKVEEAMKIYNEMLLEGERPGVKTYGALLSGL 191
            +F  M+SK C P+V++YN LI G+CK ++VEE M+++ EM   G      TY  L+ GL
Sbjct: 385 HMFELMISKDCFPNVVTYNTLIKGFCKAKRVEEGMELFREMSQRGLVGNTVTYNTLIQGL 444

Query: 192 FQAGKVGDAKKLLGVMEAYGIPADSCIYLISLDGLCKNGCLFEAMELFKELRSCNLKLDI 251
           FQAG    A+K+   M + G+P D   Y I LDGLCK G L +A+ +F+ L+   ++ DI
Sbjct: 445 FQAGDCDMAQKIFKKMVSDGVPPDIITYSILLDGLCKYGKLEKALVVFEYLQKSKMEPDI 504

Query: 252 KIFNGLIDGPCKPGKLDTAWELFGKLSQEGHQPNVVTYTIMIHGFCKKGQSNKSEEVVQL 311
             +N +I+G CK GK++  W+LF  LS +G +PNV+ YT MI GFC+KG     EE   L
Sbjct: 505 YTYNIMIEGMCKAGKVEDGWDLFCSLSLKGVKPNVIIYTTMISGFCRKG---LKEEADAL 564

Query: 312 LHKMVQNDVSSNDNTCAIVVDMLCKDEKYQECLDLI 334
             +M ++    N  T   ++    +D       +LI
Sbjct: 565 FREMKEDGTLPNSGTYNTLIRARLRDGDKAASAELI 593

BLAST of Cla97C09G171075 vs. TAIR 10
Match: AT1G62680.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 215.3 bits (547), Expect = 7.6e-56
Identity = 117/315 (37.14%), Postives = 176/315 (55.87%), Query Frame = 0

Query: 12  MLNVNQVSEGLATMAAILRRGYIPNIVTYTTLIKGLCMEHRVGGNWKEAKHSLNEMVDQG 71
           +    +V++       I R+G  PN+VTYT L+ GLC   R    W +A   L++M+ + 
Sbjct: 200 LCKTKRVNDAFDFFKEIERKGIRPNVVTYTALVNGLCNSSR----WSDAARLLSDMIKKK 259

Query: 72  VRPDM------------EGKVIKAQEFLKAVIQTGNVLDSFTYTS--EGFCLVGDLNSAR 131
           + P++             GKV++A+E  + +++     D  TY+S   G CL   ++ A 
Sbjct: 260 ITPNVITYSALLDAFVKNGKVLEAKELFEEMVRMSIDPDIVTYSSLINGLCLHDRIDEAN 319

Query: 132 ELFVSMLSKGCEPDVISYNVLINGYCKTRKVEEAMKIYNEMLLEGERPGVKTYGALLSGL 191
           ++F  M+SKGC  DV+SYN LING+CK ++VE+ MK++ EM   G      TY  L+ G 
Sbjct: 320 QMFDLMVSKGCLADVVSYNTLINGFCKAKRVEDGMKLFREMSQRGLVSNTVTYNTLIQGF 379

Query: 192 FQAGKVGDAKKLLGVMEAYGIPADSCIYLISLDGLCKNGCLFEAMELFKELRSCNLKLDI 251
           FQAG V  A++    M+ +GI  D   Y I L GLC NG L +A+ +F++++   + LDI
Sbjct: 380 FQAGDVDKAQEFFSQMDFFGISPDIWTYNILLGGLCDNGELEKALVIFEDMQKREMDLDI 439

Query: 252 KIFNGLIDGPCKPGKLDTAWELFGKLSQEGHQPNVVTYTIMIHGFCKKGQSNKSEEVVQL 311
             +  +I G CK GK++ AW LF  LS +G +P++VTYT M+ G C KG  +   EV  L
Sbjct: 440 VTYTTVIRGMCKTGKVEEAWSLFCSLSLKGLKPDIVTYTTMMSGLCTKGLLH---EVEAL 499

Query: 312 LHKMVQNDVSSNDNT 313
             KM Q  +  ND T
Sbjct: 500 YTKMKQEGLMKNDCT 507

BLAST of Cla97C09G171075 vs. TAIR 10
Match: AT1G62590.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 213.8 bits (543), Expect = 2.2e-55
Identity = 123/355 (34.65%), Postives = 188/355 (52.96%), Query Frame = 0

Query: 1   MRLAKLLPDYFMLNV--------NQVSEGLATMAAILRRGYIPNIVTYTTLIKGLCMEHR 60
           M  AK+  D  + N           V + L     +  +G  PN+VTY++LI  LC    
Sbjct: 251 MEAAKIEADVVIFNTIIDSLCKYRHVDDALNLFKEMETKGIRPNVVTYSSLISCLCSY-- 310

Query: 61  VGGNWKEAKHSLNEMVDQGVRPDM------------EGKVIKAQEFLKAVIQTGNVLDSF 120
             G W +A   L++M+++ + P++            EGK ++A++    +I+     D F
Sbjct: 311 --GRWSDASQLLSDMIEKKINPNLVTFNALIDAFVKEGKFVEAEKLYDDMIKRSIDPDIF 370

Query: 121 TYTS--EGFCLVGDLNSARELFVSMLSKGCEPDVISYNVLINGYCKTRKVEEAMKIYNEM 180
           TY S   GFC+   L+ A+++F  M+SK C PDV++YN LI G+CK+++VE+  +++ EM
Sbjct: 371 TYNSLVNGFCMHDRLDKAKQMFEFMVSKDCFPDVVTYNTLIKGFCKSKRVEDGTELFREM 430

Query: 181 LLEGERPGVKTYGALLSGLFQAGKVGDAKKLLGVMEAYGIPADSCIYLISLDGLCKNGCL 240
              G      TY  L+ GLF  G   +A+K+   M + G+P D   Y I LDGLC NG L
Sbjct: 431 SHRGLVGDTVTYTTLIQGLFHDGDCDNAQKVFKQMVSDGVPPDIMTYSILLDGLCNNGKL 490

Query: 241 FEAMELFKELRSCNLKLDIKIFNGLIDGPCKPGKLDTAWELFGKLSQEGHQPNVVTYTIM 300
            +A+E+F  ++   +KLDI I+  +I+G CK GK+D  W+LF  LS +G +PNVVTY  M
Sbjct: 491 EKALEVFDYMQKSEIKLDIYIYTTMIEGMCKAGKVDDGWDLFCSLSLKGVKPNVVTYNTM 550

Query: 301 IHGFCKKGQSNKSEEVVQLLHKMVQNDVSSNDNTCAIVVDMLCKDEKYQECLDLI 334
           I G C K      +E   LL KM ++    N  T   ++    +D       +LI
Sbjct: 551 ISGLCSK---RLLQEAYALLKKMKEDGPLPNSGTYNTLIRAHLRDGDKAASAELI 598

BLAST of Cla97C09G171075 vs. TAIR 10
Match: AT1G63330.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 207.6 bits (527), Expect = 1.6e-53
Identity = 121/355 (34.08%), Postives = 187/355 (52.68%), Query Frame = 0

Query: 1   MRLAKLLPDYFMLNV--------NQVSEGLATMAAILRRGYIPNIVTYTTLIKGLCMEHR 60
           M  AK+  D  + N           V + L     +  +G  PN+VTY++LI  LC    
Sbjct: 176 MEAAKIEADVVIFNTIIDSLCKYRHVDDALNLFKEMETKGIRPNVVTYSSLISCLCSY-- 235

Query: 61  VGGNWKEAKHSLNEMVDQGVRPDM------------EGKVIKAQEFLKAVIQTGNVLDSF 120
             G W +A   L++M+++ + P++            EGK ++A++    +I+     D F
Sbjct: 236 --GRWSDASQLLSDMIEKKINPNLVTFNALIDAFVKEGKFVEAEKLHDDMIKRSIDPDIF 295

Query: 121 TYTS--EGFCLVGDLNSARELFVSMLSKGCEPDVISYNVLINGYCKTRKVEEAMKIYNEM 180
           TY S   GFC+   L+ A+++F  M+SK C PD+ +YN LI G+CK+++VE+  +++ EM
Sbjct: 296 TYNSLINGFCMHDRLDKAKQMFEFMVSKDCFPDLDTYNTLIKGFCKSKRVEDGTELFREM 355

Query: 181 LLEGERPGVKTYGALLSGLFQAGKVGDAKKLLGVMEAYGIPADSCIYLISLDGLCKNGCL 240
              G      TY  L+ GLF  G   +A+K+   M + G+P D   Y I LDGLC NG L
Sbjct: 356 SHRGLVGDTVTYTTLIQGLFHDGDCDNAQKVFKQMVSDGVPPDIMTYSILLDGLCNNGKL 415

Query: 241 FEAMELFKELRSCNLKLDIKIFNGLIDGPCKPGKLDTAWELFGKLSQEGHQPNVVTYTIM 300
            +A+E+F  ++   +KLDI I+  +I+G CK GK+D  W+LF  LS +G +PNVVTY  M
Sbjct: 416 EKALEVFDYMQKSEIKLDIYIYTTMIEGMCKAGKVDDGWDLFCSLSLKGVKPNVVTYNTM 475

Query: 301 IHGFCKKGQSNKSEEVVQLLHKMVQNDVSSNDNTCAIVVDMLCKDEKYQECLDLI 334
           I G C K      +E   LL KM ++    +  T   ++    +D       +LI
Sbjct: 476 ISGLCSK---RLLQEAYALLKKMKEDGPLPDSGTYNTLIRAHLRDGDKAASAELI 523

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038896203.11.2e-12251.59pentatricopeptide repeat-containing protein At1g63330-like [Benincasa hispida][more]
XP_038897332.14.1e-12064.71pentatricopeptide repeat-containing protein At3g22470, mitochondrial-like [Benin... [more]
KAA0059713.11.7e-11358.78pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK26151... [more]
XP_008451225.16.3e-11349.70PREDICTED: pentatricopeptide repeat-containing protein At3g22470, mitochondrial-... [more]
KAA0059628.16.3e-11349.70pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK08161... [more]
Match NameE-valueIdentityDescription
Q6NQ833.3e-5637.54Pentatricopeptide repeat-containing protein At3g22470, mitochondrial OS=Arabidop... [more]
Q9LQ141.3e-5535.42Pentatricopeptide repeat-containing protein At1g62930, chloroplastic OS=Arabidop... [more]
Q3ECK21.1e-5437.14Pentatricopeptide repeat-containing protein At1g62680, mitochondrial OS=Arabidop... [more]
Q9SXD83.1e-5434.65Pentatricopeptide repeat-containing protein At1g62590 OS=Arabidopsis thaliana OX... [more]
Q9C8T72.2e-5234.08Pentatricopeptide repeat-containing protein At1g63330 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A5A7UUW88.1e-11458.78Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S4DYL63.1e-11349.70pentatricopeptide repeat-containing protein At3g22470, mitochondrial-like OS=Cuc... [more]
A0A5D3C8J03.1e-11349.70Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1GP319.0e-10546.31putative pentatricopeptide repeat-containing protein At1g12700, mitochondrial OS... [more]
A0A6J1DSW39.9e-10445.82pentatricopeptide repeat-containing protein At1g63330-like OS=Momordica charanti... [more]
Match NameE-valueIdentityDescription
AT3G22470.12.4e-5737.54Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G62930.19.0e-5735.42Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G62680.17.6e-5637.14Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G62590.12.2e-5534.65pentatricopeptide (PPR) repeat-containing protein [more]
AT1G63330.11.6e-5334.08Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 78..179
e-value: 6.6E-24
score: 86.2
coord: 11..77
e-value: 1.2E-9
score: 39.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 185..342
e-value: 2.1E-31
score: 111.4
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 236..284
e-value: 4.8E-14
score: 52.3
coord: 130..177
e-value: 2.0E-15
score: 56.7
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 31..53
e-value: 2.0E-6
score: 27.4
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 106..132
e-value: 4.3E-5
score: 21.4
coord: 209..236
e-value: 6.6E-5
score: 20.8
coord: 273..300
e-value: 1.6E-5
score: 22.8
coord: 169..201
e-value: 3.6E-5
score: 21.7
coord: 38..75
e-value: 0.0013
score: 16.8
coord: 239..272
e-value: 0.0017
score: 16.4
coord: 133..165
e-value: 1.4E-10
score: 38.6
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 209..232
e-value: 8.5E-5
score: 22.6
coord: 106..128
e-value: 0.089
score: 13.1
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 236..270
score: 10.753093
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 201..235
score: 9.317163
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 131..165
score: 13.866102
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 36..74
score: 10.764054
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 271..308
score: 10.05157
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 166..200
score: 9.448698
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 96..130
score: 8.560833
NoneNo IPR availablePANTHERPTHR47942TETRATRICOPEPTIDE REPEAT (TPR)-LIKE SUPERFAMILY PROTEIN-RELATEDcoord: 15..78
coord: 175..332
coord: 77..290
NoneNo IPR availablePANTHERPTHR47942:SF21OS05G0275100 PROTEINcoord: 15..78
coord: 175..332
coord: 77..290
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 48..172

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C09G171075.1Cla97C09G171075.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding