Cla021304 (gene) Watermelon (97103) v1

NameCla021304
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionPentatricopeptide repeat-containing protein (AHRD V1 ***- D7KP55_ARALL); contains Interpro domain(s) IPR002885 Pentatricopeptide repeat
LocationChr5 : 2011643 .. 2014098 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATGCGAACCTCCACCAGTAACATTCTCCATCAACTTCATCCCAAACAGCCGCTAGTTAATGGAAATCCTGGGAGTTCGTATTCTTGTTACTGGAGAGGCTCAATTGCTCAAACATTCGGAGTCTTAAGATCTCGCCGAAGATGCTCTCAATTGGCTACTGTTGCTGCCATTGTTGAGGAATTTCACAAATTAGAGAGTGAAAGAGAGAAGCCAAGGTTTCGATGGGTCGAGGTGGGCTCTGATATTACTGAAATGCAGAAGAAAGCTATATCTCAGCTTCCTGCTAAGATGACTAAAAGATGTAAGGCTCTGATGAAACAACTTATATGTTTTTCGCCTCAGAAGGGTAATTTATCAGATATGTTGGCGGCTTGGGTGAGGATTATGAAGCCTGAAAGAGCAGATTGGCTTTCAGTTCTTAAGCATATGAGGATCTCGAATCATCCTCTTTACATCGAGGTATGTAAAATCTCATATAATTTCATAAATTTACTGAATAACAAGCTTAGTCTATCAACATTAAGTTTGTCTGTTTATTCTTCAAAGTTTGTGTGTCTAATAGATTTCATTTTTTCTATTTTGTAATTAATAAATCCATCTACTAAAATAAATTCGAGTTTCAACACTAAACTTTCAATTTTATACGGATTTTAGAAACTTTAAAGCTCTAGATTAAATTTGTAATTTCGATTATTTAGAGACTAAGTAATCACATTGCTGACTTTTTTCTTTTTTTTTTTGTTGATCTTATGTTAGATACGACCGAGTCTGCCAGCCTGGTTAGCATATTTGTTTAATTTCTCATTGTTAAGATTCTCCCGTTCCTTGAGTTCGACAGAGCCATTTATGTCAAGGATGCTTAAGAACGATTTTAAAATATAGGACATGGGTAGAAATGGATTCAATCTAGATTCAATCTAGGATGGTCACTTAGCTAAGATTTAATATCCTACGTGTTTTCTTACCACAAAAATATTATAAGATCAGGTGGGATATCTAAATGGACGCCAATTGGCCCAAATACTCACAGATTAAAAAATTAATAATTATTATTTTGTACTTTATCTAAAAAAAACTTGGGTTGAAATCAAAGGAAACTTCTTATGCATATCATATGAATTTATGACAATCAAATATTATAAGGTCAAACAATTAAAATAGTTGCATGGACTCGGATCCTGTATTTGAAAATTATATGATTTTCAATTACTGTAGAAGTTTTCAAAATTTCACTTTAAATTGTATGATGTAGTTGCAACTATCATTTAGATTCTTGGTTATGTTTAAGAGAAGTATCATTTTGCATATTCAAAGTTGATACGTAAAAGTTCCTATTTATTAAGGCTTCGTTTGATAATTTTTTTTTTTTTTTTAAAACTAAGCTTATAAACATTATTTTCACCCAAGTTTTAAATCTACATTATTTTATCTACTTTATAATCATGTTTTCACAAGTCATACGAAAATTTGAAAAAAAGAAAAAAAAAAACATAGTTATTTTTGTTTTTGAGTTATGCTAAGAATCCAAATACTTCCTTTAAGAAAAAAGATGAAAACCATTGTTAAGAAATTGTGAGAAAATAGACATAATTTTCAAAAACCAAATGGTTATCAAACGGAACCTAATCAATTGGTGATATCTCGTAGCCTAAGTTGATGAGTGTTGTTGCTCTTAGGTGGCAGAAGCTGCTCTTGTAGAGATAACATTTGAGGCCAATACTCGAGACTACACAAAGATTATTCATTACCATGGGAAGCGAAACCAACTCGAGGATGCTGAAAAAATTCTCTTAAGGATGAGAGAAAGGGGTTTTGCTTGTGATCAGATAACATTGACCACAATGATCCACATATATAGCAAGGCTGACAAACTTAATCTTGCCAAACGAACTTTTGAAGAGCTCAAACTGCTCGAGCAACCATTGGATAAAAGATCGTACGGTGCAATGATTATGGCATATGTCAGGGCCGGGATGCCCGAGGAAGGAGAAAATATTCTGAAAGAAATGGATGCGAAAGATATTAATGCAGGAAGTGAAGTTTACAAGGCTTTGTTAAGAGCATATTCCATGGCTGGCAATGCTGAAGGAGCCCAAAGGGTATTCGATGCAATTCAATTGGCTGCTATTCCTCCTGATGAAAAGTTATGTGGTCTGCTGATCAATGCGTATCTGATGGCCGGTCAAAGCCAAAAGGCGCAAATTGCTTTCGACAATATGAGGAGGGCAGGTATTGAACCTAGTGACAAATGCATAGCTTTGGTATTAAGTGCATATGAAAAGGAAAACAGGCTAAACGCAGCATTGGAACTTCTAATAGATTTAGAGAAGGATAACCTCATGGTTGGGAAGGAAGCTTCAGAAATATTAGCAGCTTGGCTTAAAAGACTTGGGGTGGTAGAAGAGGTTGAACTTGTCTTGAGGGAATACACTGTGAAAGAAGCAAGCGGATAA

mRNA sequence

ATGATGCGAACCTCCACCAGTAACATTCTCCATCAACTTCATCCCAAACAGCCGCTAGTTAATGGAAATCCTGGGAGTTCGTATTCTTGTTACTGGAGAGGCTCAATTGCTCAAACATTCGGAGTCTTAAGATCTCGCCGAAGATGCTCTCAATTGGCTACTGTTGCTGCCATTGTTGAGGAATTTCACAAATTAGAGAGTGAAAGAGAGAAGCCAAGGTTTCGATGGGTCGAGGTGGGCTCTGATATTACTGAAATGCAGAAGAAAGCTATATCTCAGCTTCCTGCTAAGATGACTAAAAGATGTAAGGCTCTGATGAAACAACTTATATGTTTTTCGCCTCAGAAGGGTAATTTATCAGATATGTTGGCGGCTTGGGTGAGGATTATGAAGCCTGAAAGAGCAGATTGGCTTTCAGTTCTTAAGCATATGAGGATCTCGAATCATCCTCTTTACATCGAGGTGGCAGAAGCTGCTCTTGTAGAGATAACATTTGAGGCCAATACTCGAGACTACACAAAGATTATTCATTACCATGGGAAGCGAAACCAACTCGAGGATGCTGAAAAAATTCTCTTAAGGATGAGAGAAAGGGGTTTTGCTTGTGATCAGATAACATTGACCACAATGATCCACATATATAGCAAGGCTGACAAACTTAATCTTGCCAAACGAACTTTTGAAGAGCTCAAACTGCTCGAGCAACCATTGGATAAAAGATCGTACGGTGCAATGATTATGGCATATGTCAGGGCCGGGATGCCCGAGGAAGGAGAAAATATTCTGAAAGAAATGGATGCGAAAGATATTAATGCAGGAAGTGAAGTTTACAAGGCTTTGTTAAGAGCATATTCCATGGCTGGCAATGCTGAAGGAGCCCAAAGGGTATTCGATGCAATTCAATTGGCTGCTATTCCTCCTGATGAAAAGTTATGTGGTCTGCTGATCAATGCGTATCTGATGGCCGGTCAAAGCCAAAAGGCGCAAATTGCTTTCGACAATATGAGGAGGGCAGGTATTGAACCTAGTGACAAATGCATAGCTTTGGTATTAAGTGCATATGAAAAGGAAAACAGGCTAAACGCAGCATTGGAACTTCTAATAGATTTAGAGAAGGATAACCTCATGGTTGGGAAGGAAGCTTCAGAAATATTAGCAGCTTGGCTTAAAAGACTTGGGGTGGTAGAAGAGGTTGAACTTGTCTTGAGGGAATACACTGTGAAAGAAGCAAGCGGATAA

Coding sequence (CDS)

ATGATGCGAACCTCCACCAGTAACATTCTCCATCAACTTCATCCCAAACAGCCGCTAGTTAATGGAAATCCTGGGAGTTCGTATTCTTGTTACTGGAGAGGCTCAATTGCTCAAACATTCGGAGTCTTAAGATCTCGCCGAAGATGCTCTCAATTGGCTACTGTTGCTGCCATTGTTGAGGAATTTCACAAATTAGAGAGTGAAAGAGAGAAGCCAAGGTTTCGATGGGTCGAGGTGGGCTCTGATATTACTGAAATGCAGAAGAAAGCTATATCTCAGCTTCCTGCTAAGATGACTAAAAGATGTAAGGCTCTGATGAAACAACTTATATGTTTTTCGCCTCAGAAGGGTAATTTATCAGATATGTTGGCGGCTTGGGTGAGGATTATGAAGCCTGAAAGAGCAGATTGGCTTTCAGTTCTTAAGCATATGAGGATCTCGAATCATCCTCTTTACATCGAGGTGGCAGAAGCTGCTCTTGTAGAGATAACATTTGAGGCCAATACTCGAGACTACACAAAGATTATTCATTACCATGGGAAGCGAAACCAACTCGAGGATGCTGAAAAAATTCTCTTAAGGATGAGAGAAAGGGGTTTTGCTTGTGATCAGATAACATTGACCACAATGATCCACATATATAGCAAGGCTGACAAACTTAATCTTGCCAAACGAACTTTTGAAGAGCTCAAACTGCTCGAGCAACCATTGGATAAAAGATCGTACGGTGCAATGATTATGGCATATGTCAGGGCCGGGATGCCCGAGGAAGGAGAAAATATTCTGAAAGAAATGGATGCGAAAGATATTAATGCAGGAAGTGAAGTTTACAAGGCTTTGTTAAGAGCATATTCCATGGCTGGCAATGCTGAAGGAGCCCAAAGGGTATTCGATGCAATTCAATTGGCTGCTATTCCTCCTGATGAAAAGTTATGTGGTCTGCTGATCAATGCGTATCTGATGGCCGGTCAAAGCCAAAAGGCGCAAATTGCTTTCGACAATATGAGGAGGGCAGGTATTGAACCTAGTGACAAATGCATAGCTTTGGTATTAAGTGCATATGAAAAGGAAAACAGGCTAAACGCAGCATTGGAACTTCTAATAGATTTAGAGAAGGATAACCTCATGGTTGGGAAGGAAGCTTCAGAAATATTAGCAGCTTGGCTTAAAAGACTTGGGGTGGTAGAAGAGGTTGAACTTGTCTTGAGGGAATACACTGTGAAAGAAGCAAGCGGATAA

Protein sequence

MMRTSTSNILHQLHPKQPLVNGNPGSSYSCYWRGSIAQTFGVLRSRRRCSQLATVAAIVEEFHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCKALMKQLICFSPQKGNLSDMLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEITFEANTRDYTKIIHYHGKRNQLEDAEKILLRMRERGFACDQITLTTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQRVFDAIQLAAIPPDEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKENRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLREYTVKEASG
BLAST of Cla021304 vs. Swiss-Prot
Match: PPR1_ARATH (Pentatricopeptide repeat-containing protein At1g01970 OS=Arabidopsis thaliana GN=At1g01970 PE=2 SV=1)

HSP 1 Score: 452.2 bits (1162), Expect = 6.0e-126
Identity = 216/364 (59.34%), Postives = 289/364 (79.40%), Query Frame = 1

Query: 47  RRCSQLATVAAIVEEFHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCKALM 106
           R CS     +  + E  + E   +   F W +VG ++TE Q +AI+++P KM+KRC+ALM
Sbjct: 43  RLCSCKCNASLAIGEVVEKEDAEQSRSFNWADVGLNLTEEQDEAITRIPIKMSKRCQALM 102

Query: 107 KQLICFSPQKGNLSDMLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEITFE 166
           +Q+ICFSP+KG+  D+L AW+R M P RADWLS+LK ++  + P YI+VAE +L++ +FE
Sbjct: 103 RQIICFSPEKGSFCDLLGAWLRRMNPIRADWLSILKELKNLDSPFYIKVAEFSLLQDSFE 162

Query: 167 ANTRDYTKIIHYHGKRNQLEDAEKILLRMRERGFACDQITLTTMIHIYSKADKLNLAKRT 226
           AN RDYTKIIHY+GK NQ+EDAE+ LL M+ RGF  DQ+TLT M+ +YSKA    LA+ T
Sbjct: 163 ANARDYTKIIHYYGKLNQVEDAERTLLSMKNRGFLIDQVTLTAMVQLYSKAGCHKLAEET 222

Query: 227 FEELKLLEQPLDKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSM 286
           F E+KLL +PLD RSYG+MIMAY+RAG+PE+GE++L+EMD+++I AG EVYKALLR YSM
Sbjct: 223 FNEIKLLGEPLDYRSYGSMIMAYIRAGVPEKGESLLREMDSQEICAGREVYKALLRDYSM 282

Query: 287 AGNAEGAQRVFDAIQLAAIPPDEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKC 346
            G+AEGA+RVFDA+Q+A I PD KLCGLLINAY ++GQSQ A++AF+NMR+AGI+ +DKC
Sbjct: 283 GGDAEGAKRVFDAVQIAGITPDVKLCGLLINAYSVSGQSQNARLAFENMRKAGIKATDKC 342

Query: 347 IALVLSAYEKENRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLREYT 406
           +ALVL+AYEKE +LN AL  L++LEKD++M+GKEAS +LA W K+LGVVEEVEL+LRE++
Sbjct: 343 VALVLAAYEKEEKLNEALGFLVELEKDSIMLGKEASAVLAQWFKKLGVVEEVELLLREFS 402

Query: 407 VKEA 411
             ++
Sbjct: 403 SSQS 406

BLAST of Cla021304 vs. Swiss-Prot
Match: PPR51_ARATH (Pentatricopeptide repeat-containing protein At1g19525 OS=Arabidopsis thaliana GN=At1g19525 PE=2 SV=2)

HSP 1 Score: 166.8 bits (421), Expect = 5.1e-40
Identity = 89/209 (42.58%), Postives = 124/209 (59.33%), Query Frame = 1

Query: 195 MRERGFACDQITLTTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRSYGAMIMAYVRAGM 254
           M + G   D +T T ++H+YSK+     A   FE LK      D++ Y AMI+ YV AG 
Sbjct: 1   MSQNGIFPDILTATALVHMYSKSGNFERATEAFENLKSYGLRPDEKIYEAMILGYVNAGK 60

Query: 255 PEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQRVFDAIQLAAIPP-DEKLCG 314
           P+ GE ++KEM AK++ A  EVY ALLRAY+  G+A GA  +  ++Q A+  P   +   
Sbjct: 61  PKLGERLMKEMQAKELKASEEVYMALLRAYAQMGDANGAAGISSSMQYASDGPLSFEAYS 120

Query: 315 LLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKENRLNAALELLIDLEKD 374
           L + AY  AGQ  KA+  FD MR+ G +P DKCIA ++ AY+ EN L+ AL LL+ LEKD
Sbjct: 121 LFVEAYGKAGQVDKAKSNFDEMRKLGHKPDDKCIANLVRAYKGENSLDKALRLLLQLEKD 180

Query: 375 NLMVGKEASEILAAWLKRLGVVEEVELVL 403
            + +G     +L  W+  LG++EE E +L
Sbjct: 181 GIEIGVITYTVLVDWMANLGLIEEAEQLL 209

BLAST of Cla021304 vs. Swiss-Prot
Match: PP186_ARATH (Pentatricopeptide repeat-containing protein At2g35130 OS=Arabidopsis thaliana GN=At2g35130 PE=2 SV=1)

HSP 1 Score: 95.1 bits (235), Expect = 1.9e-18
Identity = 66/259 (25.48%), Postives = 118/259 (45.56%), Query Frame = 1

Query: 153 IEVAEAALVEITFEANTRDYTKIIHYHG-------KRNQLEDAEKILLRMRERGFACDQI 212
           IE AE  LVE+     +     +  Y+        ++   E+A  +  RM+         
Sbjct: 206 IERAEVVLVEMQNHHVSPKTIGVTVYNAYIEGLMKRKGNTEEAIDVFQRMKRDRCKPTTE 265

Query: 213 TLTTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRSYGAMIMAYVRAGMPEEGENILKEM 272
           T   MI++Y KA K  ++ + + E++  +   +  +Y A++ A+ R G+ E+ E I +++
Sbjct: 266 TYNLMINLYGKASKSYMSWKLYCEMRSHQCKPNICTYTALVNAFAREGLCEKAEEIFEQL 325

Query: 273 DAKDINAGSEVYKALLRAYSMAGNAEGAQRVFDAIQLAAIPPDEKLCGLLINAYLMAGQS 332
               +     VY AL+ +YS AG   GA  +F  +Q     PD     ++++AY  AG  
Sbjct: 326 QEDGLEPDVYVYNALMESYSRAGYPYGAAEIFSLMQHMGCEPDRASYNIMVDAYGRAGLH 385

Query: 333 QKAQIAFDNMRRAGIEPSDKCIALVLSAYEKENRLNAALELLIDLEKDNLMVGKEASEIL 392
             A+  F+ M+R GI P+ K   L+LSAY K   +     ++ ++ ++ +         +
Sbjct: 386 SDAEAVFEEMKRLGIAPTMKSHMLLLSAYSKARDVTKCEAIVKEMSENGVEPDTFVLNSM 445

Query: 393 AAWLKRLGVVEEVELVLRE 405
                RLG   ++E +L E
Sbjct: 446 LNLYGRLGQFTKMEKILAE 464


HSP 2 Score: 32.3 bits (72), Expect = 1.5e+01
Identity = 30/129 (23.26%), Postives = 57/129 (44.19%), Query Frame = 1

Query: 277 YKALLRAYSMAGNAEGAQRVFDAIQLAAIPPDEKLCGLLINAYLMAGQSQKAQIAFDNMR 336
           +  L+ AY      + A+ ++  +  +   P E    LLI AY MAG  ++A++    M+
Sbjct: 158 FNLLIDAYGQKFQYKEAESLYVQLLESRYVPTEDTYALLIKAYCMAGLIERAEVVLVEMQ 217

Query: 337 RAGIEPSDKCIAL-VLSAY-----EKENRLNAALELLIDLEKD---------NLMVG--K 389
              + P  K I + V +AY     +++     A+++   +++D         NLM+    
Sbjct: 218 NHHVSP--KTIGVTVYNAYIEGLMKRKGNTEEAIDVFQRMKRDRCKPTTETYNLMINLYG 277

BLAST of Cla021304 vs. Swiss-Prot
Match: PP163_ARATH (Pentatricopeptide repeat-containing protein At2g18940, chloroplastic OS=Arabidopsis thaliana GN=At2g18940 PE=2 SV=1)

HSP 1 Score: 87.0 bits (214), Expect = 5.1e-16
Identity = 59/214 (27.57%), Postives = 100/214 (46.73%), Query Frame = 1

Query: 191 ILLRMRERGFACDQITLTTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRSYGAMIMAYV 250
           +L  MR +G   D+ T +T++   ++   L  AK  F ELK         +Y A++  + 
Sbjct: 268 VLDEMRSKGLKFDEFTCSTVLSACAREGLLREAKEFFAELKSCGYEPGTVTYNALLQVFG 327

Query: 251 RAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQRVFDAIQLAAIPPDEK 310
           +AG+  E  ++LKEM+     A S  Y  L+ AY  AG ++ A  V + +    + P+  
Sbjct: 328 KAGVYTEALSVLKEMEENSCPADSVTYNELVAAYVRAGFSKEAAGVIEMMTKKGVMPNAI 387

Query: 311 LCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKENRLNAALELLIDL 370
               +I+AY  AG+  +A   F +M+ AG  P+      VLS   K++R N  +++L D+
Sbjct: 388 TYTTVIDAYGKAGKEDEALKLFYSMKEAGCVPNTCTYNAVLSLLGKKSRSNEMIKMLCDM 447

Query: 371 EKDNLMVGKEASEILAAWLKRLGVVEEVELVLRE 405
           + +     +     + A     G+ + V  V RE
Sbjct: 448 KSNGCSPNRATWNTMLALCGNKGMDKFVNRVFRE 481


HSP 2 Score: 65.1 bits (157), Expect = 2.1e-09
Identity = 46/208 (22.12%), Postives = 85/208 (40.87%), Query Frame = 1

Query: 168 NTRDYTKIIHYHGKRNQLEDAEKILLRMRERGFACDQITLTTMIHIYSKADKLNLAKRTF 227
           N   +  ++   G +   +   ++   M+  GF  D+ T  T+I  Y +      A + +
Sbjct: 455 NRATWNTMLALCGNKGMDKFVNRVFREMKSCGFEPDRDTFNTLISAYGRCGSEVDASKMY 514

Query: 228 EELKLLEQPLDKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMA 287
            E+          +Y A++ A  R G    GEN++ +M +K        Y  +L+ Y+  
Sbjct: 515 GEMTRAGFNACVTTYNALLNALARKGDWRSGENVISDMKSKGFKPTETSYSLMLQCYAKG 574

Query: 288 GNAEGAQRVFDAIQLAAIPPDEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCI 347
           GN  G +R+ + I+   I P   L   L+ A         ++ AF   ++ G +P     
Sbjct: 575 GNYLGIERIENRIKEGQIFPSWMLLRTLLLANFKCRALAGSERAFTLFKKHGYKPDMVIF 634

Query: 348 ALVLSAYEKENRLNAALELLIDLEKDNL 376
             +LS + + N  + A  +L  + +D L
Sbjct: 635 NSMLSIFTRNNMYDQAEGILESIREDGL 662


HSP 3 Score: 58.9 bits (141), Expect = 1.5e-07
Identity = 46/221 (20.81%), Postives = 88/221 (39.82%), Query Frame = 1

Query: 189 EKILLRMRERGFACDQITLTTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRSYGAMIMA 248
           E ++L         D   +   + I  +  + ++A +  +++ L E  LD R+Y  ++ A
Sbjct: 160 EWLVLSSNSGALKLDHQVIEIFVRILGRESQYSVAAKLLDKIPLQEYLLDVRAYTTILHA 219

Query: 249 YVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNA-EGAQRVFDAIQLAAIPP 308
           Y R G  E+  ++ + M     +     Y  +L  +   G +      V D ++   +  
Sbjct: 220 YSRTGKYEKAIDLFERMKEMGPSPTLVTYNVILDVFGKMGRSWRKILGVLDEMRSKGLKF 279

Query: 309 DEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKENRLNAALELL 368
           DE  C  +++A    G  ++A+  F  ++  G EP       +L  + K      AL +L
Sbjct: 280 DEFTCSTVLSACAREGLLREAKEFFAELKSCGYEPGTVTYNALLQVFGKAGVYTEALSVL 339

Query: 369 IDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLREYTVK 409
            ++E+++          L A   R G  +E   V+   T K
Sbjct: 340 KEMEENSCPADSVTYNELVAAYVRAGFSKEAAGVIEMMTKK 380

BLAST of Cla021304 vs. Swiss-Prot
Match: PP408_ARATH (Pentatricopeptide repeat-containing protein At5g39980, chloroplastic OS=Arabidopsis thaliana GN=At5g39980 PE=2 SV=1)

HSP 1 Score: 86.7 bits (213), Expect = 6.7e-16
Identity = 63/215 (29.30%), Postives = 100/215 (46.51%), Query Frame = 1

Query: 166 EANTRDYTKIIHYHGKRNQLEDAEKILLRMRERGFACDQITLTTMIHIYSKADKLNLAKR 225
           E N   Y  +I  +GK  + E A  ++  M+ RG   + IT +T+I I+ KA KL+ A  
Sbjct: 397 EQNVVTYNTMIKIYGKTMEHEKATNLVQEMQSRGIEPNAITYSTIISIWGKAGKLDRAAT 456

Query: 226 TFEELKLLEQPLDKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYS 285
            F++L+     +D+  Y  MI+AY R G+    + +L E+   D N   E    +L   +
Sbjct: 457 LFQKLRSSGVEIDQVLYQTMIVAYERVGLMGHAKRLLHELKLPD-NIPRETAITIL---A 516

Query: 286 MAGNAEGAQRVFDAIQLAAIPPDEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDK 345
            AG  E A  VF     +    D  + G +IN Y    +       F+ MR AG  P   
Sbjct: 517 KAGRTEEATWVFRQAFESGEVKDISVFGCMINLYSRNQRYVNVIEVFEKMRTAGYFPDSN 576

Query: 346 CIALVLSAYEKENRLNAALELLIDLEKDNLMVGKE 381
            IA+VL+AY K+     A  +  +++++  +   E
Sbjct: 577 VIAMVLNAYGKQREFEKADTVYREMQEEGCVFPDE 607

BLAST of Cla021304 vs. TrEMBL
Match: A0A0A0L7L8_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G126080 PE=4 SV=1)

HSP 1 Score: 685.6 bits (1768), Expect = 3.6e-194
Identity = 352/410 (85.85%), Postives = 375/410 (91.46%), Query Frame = 1

Query: 2   MRTSTSNILHQLHPKQPLVNGNPGSSYSCYWRGSIAQTFGVLRSRRRCSQLATVAAIVEE 61
           M+ STSNIL+QLH   PLVNG   +SYS YWR SI     VL SRRRCSQ+AT  AIV+E
Sbjct: 1   MQISTSNILYQLH--LPLVNGTSNTSYSRYWRDSI-----VLSSRRRCSQMATATAIVDE 60

Query: 62  FHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCKALMKQLICFSPQKGNLSD 121
            HKLESEREKPRFRWVEVG DITE QK+AISQLP KMTKRCKA+MKQ+ICFSPQKG LSD
Sbjct: 61  IHKLESEREKPRFRWVEVGYDITETQKQAISQLPPKMTKRCKAVMKQIICFSPQKGELSD 120

Query: 122 MLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEITFEANTRDYTKIIHYHGK 181
           MLAAWVRIMKPERADWL VLKH+RI NHPLYI+VAEAAL EITFEANTRDYTKIIH++GK
Sbjct: 121 MLAAWVRIMKPERADWLLVLKHLRILNHPLYIQVAEAALEEITFEANTRDYTKIIHHYGK 180

Query: 182 RNQLEDAEKILLRMRERGFACDQITLTTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRS 241
           +NQLEDAEK+LL MRERGF CDQITLTTMIHIYSKADKLNLAK+TFEELKLLEQPLDKRS
Sbjct: 181 QNQLEDAEKVLLSMRERGFVCDQITLTTMIHIYSKADKLNLAKQTFEELKLLEQPLDKRS 240

Query: 242 YGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQRVFDAIQ 301
           +GAMIMAYVRAG PEEGE ILKEMDAKDI AGSEVYKALLRAYSM GNAEGAQRVFDAIQ
Sbjct: 241 FGAMIMAYVRAGFPEEGEKILKEMDAKDIYAGSEVYKALLRAYSMVGNAEGAQRVFDAIQ 300

Query: 302 LAAIPPDEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKENRLN 361
           LAAI PDEKLCGLLINAYLMAGQS++AQIAFDNMRRAGIEPSDKCIAL LSAYEKENRLN
Sbjct: 301 LAAITPDEKLCGLLINAYLMAGQSREAQIAFDNMRRAGIEPSDKCIALALSAYEKENRLN 360

Query: 362 AALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLREYTVKEAS 412
           +ALELLIDLEKDN+MVGKEAS+ILAAWLKRLGVVEEVE+VLREYT KE +
Sbjct: 361 SALELLIDLEKDNVMVGKEASKILAAWLKRLGVVEEVEIVLREYTEKEVN 403

BLAST of Cla021304 vs. TrEMBL
Match: W9QSE5_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_022440 PE=4 SV=1)

HSP 1 Score: 504.2 bits (1297), Expect = 1.5e-139
Identity = 249/357 (69.75%), Postives = 301/357 (84.31%), Query Frame = 1

Query: 55  VAAIVEEFHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCKALMKQLICFSP 114
           VA  VEE  K E+   KP+F+WVEVG  ITE QK+AISQL  KMTKRC+ALMKQLICFS 
Sbjct: 44  VATSVEETEKAENGGGKPKFKWVEVGPGITESQKEAISQLSPKMTKRCRALMKQLICFSA 103

Query: 115 QKGNLSDMLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEITFEANTRDYTK 174
            K +L+++LAAWVRIMKP+RADWL+++K ++I +HPLY +VAE AL+E +FEAN RDYTK
Sbjct: 104 HKASLNELLAAWVRIMKPQRADWLAIIKQLKIMDHPLYFQVAEVALLEESFEANIRDYTK 163

Query: 175 IIHYHGKRNQLEDAEKILLRMRERGFACDQITLTTMIHIYSKADKLNLAKRTFEELKLLE 234
           IIH +GK+N+LEDAEK LL M+ RGF  DQ+TLTT IH+YSKA  L LA+ TFEELKLL 
Sbjct: 164 IIHCYGKQNRLEDAEKTLLAMKSRGFIRDQVTLTTFIHMYSKAGNLKLAEETFEELKLLG 223

Query: 235 QPLDKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQ 294
           QPLDKRSYG+MIMAY+RAGMP++GENIL+EMD ++I AGSEVYKALLRAYSM G+AEGAQ
Sbjct: 224 QPLDKRSYGSMIMAYIRAGMPDQGENILREMDVEEIYAGSEVYKALLRAYSMTGDAEGAQ 283

Query: 295 RVFDAIQLAAIPPDEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAY 354
           RVFDAIQLA I PD +LCGLLINAY+ +GQS+KA +AF NMRRAG+EPSDKC+ALVL AY
Sbjct: 284 RVFDAIQLAGILPDPRLCGLLINAYVESGQSEKACVAFGNMRRAGLEPSDKCVALVLCAY 343

Query: 355 EKENRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLREYTVKEAS 412
           EKEN+L  AL+ L++LE+  +MVG+EASE L  W ++LGVV+EV+LVLREY  K AS
Sbjct: 344 EKENKLQRALDFLMELERHGIMVGEEASETLVGWFRKLGVVKEVDLVLREYASKGAS 400

BLAST of Cla021304 vs. TrEMBL
Match: A0A061DV02_THECC (Tetratricopeptide repeat (TPR)-like superfamily protein, putative OS=Theobroma cacao GN=TCM_005495 PE=4 SV=1)

HSP 1 Score: 501.9 bits (1291), Expect = 7.4e-139
Identity = 252/414 (60.87%), Postives = 321/414 (77.54%), Query Frame = 1

Query: 2   MRTSTSNILHQLHPKQPLVNGNPGSSYSCYWRGS---IAQTFGVLRSRRRCSQLATVAAI 61
           M TS  NI +  +   P +N      +   W      + Q  G   S  + +    +A+ 
Sbjct: 1   MVTSACNIPYCSYSTYPFINKTKKQIHPQSWGNRNPLLFQKKGAKFSSCKVNNQPEIASS 60

Query: 62  -VEEFHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCKALMKQLICFSPQKG 121
            VEE  K E+  EK R++WVE+G DI E QK+AI++LP KMTKRCKALMKQ+ICF P+KG
Sbjct: 61  NVEEKGKPETNEEKRRYKWVEIGPDIAEEQKQAITELPFKMTKRCKALMKQIICFCPEKG 120

Query: 122 NLSDMLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEITFEANTRDYTKIIH 181
           +L+D+LAAWV+IMKP RADWL VLK ++I  HPLY EVAE AL+E +FEAN RD+TKIIH
Sbjct: 121 SLADLLAAWVKIMKPRRADWLVVLKELKIMEHPLYFEVAELALLEESFEANIRDFTKIIH 180

Query: 182 YHGKRNQLEDAEKILLRMRERGFACDQITLTTMIHIYSKADKLNLAKRTFEELKLLEQPL 241
            +GK+ +L++AE IL+ M+ RGF CDQ+TLTTM+H+YSKA  L LA+ TFEE+KLL Q L
Sbjct: 181 GYGKQKRLQEAENILVAMKRRGFICDQVTLTTMVHMYSKAGNLKLAEETFEEIKLLGQQL 240

Query: 242 DKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQRVF 301
           DKRSYG+MIMAY+R+G PE+GE +L+EMD+++I AGSEVYKALLRAYSM G+A GAQRVF
Sbjct: 241 DKRSYGSMIMAYIRSGTPEQGEALLREMDSQEIYAGSEVYKALLRAYSMLGDANGAQRVF 300

Query: 302 DAIQLAAIPPDEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKE 361
           D IQLA I PD ++CGLLINAY +AGQS KA IAF+NMRRAG+EPSDKC+ALV++AYEK+
Sbjct: 301 DTIQLAGISPDARMCGLLINAYQLAGQSDKAHIAFENMRRAGLEPSDKCVALVVAAYEKQ 360

Query: 362 NRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLREYTVKEAS 412
           N+LN AL+ L++LE+D ++VGKEAS ILA W K+LGVVE+VELVLRE+  KE +
Sbjct: 361 NKLNKALDFLMELERDGIVVGKEASGILAQWFKKLGVVEQVELVLREFAAKETN 414

BLAST of Cla021304 vs. TrEMBL
Match: A0A0D2S0I3_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_004G161000 PE=4 SV=1)

HSP 1 Score: 494.2 bits (1271), Expect = 1.5e-136
Identity = 260/419 (62.05%), Postives = 314/419 (74.94%), Query Frame = 1

Query: 1   MMRTSTSNILH-----------QLHPKQPLVNGNPGSSYSCYWRGSIAQTFGVLRSRRRC 60
           MM TS SNI H           Q+HP Q   NGNP  S     + S   TF         
Sbjct: 1   MMVTSASNIPHCSYSPFPIINKQIHP-QSWGNGNPSLSLKQAMKPSSC-TFS------NE 60

Query: 61  SQLATVAAIVEEFHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCKALMKQL 120
            Q++ + A            EK RF+WVE+G  ITE Q++AI +LP KMTKRCKALMKQ+
Sbjct: 61  PQISFIDA-----------EEKRRFKWVEIGPGITEEQRQAIDKLPFKMTKRCKALMKQI 120

Query: 121 ICFSPQKGNLSDMLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEITFEANT 180
           ICF+P+KG+L D+L AWV +MKP RADWL VLK ++I  HPLY +VAE AL+E TFEAN 
Sbjct: 121 ICFNPEKGSLEDLLGAWVNVMKPRRADWLVVLKELKIMEHPLYFQVAEIALLEETFEANI 180

Query: 181 RDYTKIIHYHGKRNQLEDAEKILLRMRERGFACDQITLTTMIHIYSKADKLNLAKRTFEE 240
           RDYTKIIH +GK+N+L +AE IL  M+ RGF CDQ+TLTTM+H+YSKA  L LA+ TFEE
Sbjct: 181 RDYTKIIHGYGKQNRLREAENILDAMKRRGFICDQVTLTTMVHMYSKAGNLKLAEDTFEE 240

Query: 241 LKLLEQPLDKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGN 300
           +KLL Q LDKRSYGAMIMAY+RAGMPE+GE +LKEMD  +I AGSEVYKALLRAYS  G+
Sbjct: 241 IKLLGQQLDKRSYGAMIMAYIRAGMPEQGEGLLKEMDNLEIYAGSEVYKALLRAYSTNGD 300

Query: 301 AEGAQRVFDAIQLAAIPPDEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIAL 360
            +GAQRVF AIQLA I PD KLCGLLINAY +AGQS++A++AF+NMRRAG+EPSDKC+AL
Sbjct: 301 TDGAQRVFGAIQLAGISPDAKLCGLLINAYQVAGQSEEARVAFENMRRAGLEPSDKCVAL 360

Query: 361 VLSAYEKENRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLREYTVK 409
           VL+AYEK+N+LN ALE L+DLE+D ++VGKEAS ILA W K+LGVVE+VE VLRE+  K
Sbjct: 361 VLAAYEKQNKLNKALEFLMDLERDGIVVGKEASSILAQWFKKLGVVEQVEQVLREFAAK 400

BLAST of Cla021304 vs. TrEMBL
Match: B9IC06_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0014s06610g PE=4 SV=1)

HSP 1 Score: 493.0 bits (1268), Expect = 3.4e-136
Identity = 253/406 (62.32%), Postives = 313/406 (77.09%), Query Frame = 1

Query: 2   MRTSTSNILHQLHPKQPLVNGNPGSSYSCYWRGSIAQTFGVLRSRRRCSQLATVAAIVEE 61
           M T   NIL    P  PL +    +S   +   S+ Q    L S +  SQ+  V A +  
Sbjct: 1   MATYVINILPFSSPTCPLHSEPKKTSNLHFLGNSLCQQPVTLTSCK--SQIQPVLAAINV 60

Query: 62  FHKLESE--REKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCKALMKQLICFSPQKGNL 121
             K+E E  +EKP+FRWVE+G +I E QK+AISQLP KMTKRCKALM+Q+ICF+ +KG+L
Sbjct: 61  EEKVEGEIGKEKPKFRWVEIGPNIPEEQKQAISQLPFKMTKRCKALMRQIICFNDKKGSL 120

Query: 122 SDMLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEITFEANTRDYTKIIHYH 181
             +L+AWV+IMKP R DWLS+LK +    HPLY+EV E AL+E +FEAN RDYTKIIH++
Sbjct: 121 RGLLSAWVKIMKPRRKDWLSILKELNKMEHPLYLEVVEIALLEESFEANVRDYTKIIHFY 180

Query: 182 GKRNQLEDAEKILLRMRERGFACDQITLTTMIHIYSKADKLNLAKRTFEELKLLEQPLDK 241
           G  NQLE+AE+  L M ERGF  DQ+TLT MIH+YSK   L LA+ TFEELKLL QPLD+
Sbjct: 181 GMNNQLEEAERTRLAMEERGFVSDQVTLTAMIHMYSKGGNLTLAEETFEELKLLGQPLDR 240

Query: 242 RSYGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQRVFDA 301
           RSYG+MIMAY+RAGMPE+GE IL+EMDA++I AGSEVYKALLRAYS+ G+A+GAQRVFDA
Sbjct: 241 RSYGSMIMAYIRAGMPEKGEMILREMDAQEIRAGSEVYKALLRAYSIIGDADGAQRVFDA 300

Query: 302 IQLAAIPPDEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKENR 361
           IQLA IPPD++ C +L+NAY MAGQSQ A   F+NM RAGIEP+D+C+ALVL+AYEKEN+
Sbjct: 301 IQLAGIPPDDRTCAVLLNAYGMAGQSQNAYATFENMWRAGIEPTDRCVALVLAAYEKENK 360

Query: 362 LNAALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLREY 406
           LN AL+ LI LE++ L++GKEASE+LA W  RLGVV+EVELVLREY
Sbjct: 361 LNQALDFLIGLEREKLIIGKEASEVLAEWFGRLGVVKEVELVLREY 404

BLAST of Cla021304 vs. NCBI nr
Match: gi|659075451|ref|XP_008438151.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g01970 [Cucumis melo])

HSP 1 Score: 687.6 bits (1773), Expect = 1.4e-194
Identity = 352/410 (85.85%), Postives = 378/410 (92.20%), Query Frame = 1

Query: 2   MRTSTSNILHQLHPKQPLVNGNPGSSYSCYWRGSIAQTFGVLRSRRRCSQLATVAAIVEE 61
           M  STSNIL+QLH   PLVNG   +S S YW+ SI     VL SRRRCSQ+ATV AIV+E
Sbjct: 1   MHISTSNILYQLH--LPLVNGTSNTSSSRYWKDSI-----VLNSRRRCSQMATVTAIVDE 60

Query: 62  FHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCKALMKQLICFSPQKGNLSD 121
            HKLESEREKPRFRWVEVG +ITE QK+AISQLP KMTK+CKA+MKQ+ICFSPQKG LSD
Sbjct: 61  LHKLESEREKPRFRWVEVGYNITETQKQAISQLPPKMTKKCKAVMKQIICFSPQKGELSD 120

Query: 122 MLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEITFEANTRDYTKIIHYHGK 181
           MLAAWVRIMKPERADWLSVLKH+RI NHPLYI+VAEAALVEITFEANTRDYTKIIH++GK
Sbjct: 121 MLAAWVRIMKPERADWLSVLKHLRILNHPLYIQVAEAALVEITFEANTRDYTKIIHHYGK 180

Query: 182 RNQLEDAEKILLRMRERGFACDQITLTTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRS 241
           +NQLEDAEK+LL MRERGFACDQITLTTMIHIYSKADKL LAK+TFEELKLLEQ LDKRS
Sbjct: 181 QNQLEDAEKVLLTMRERGFACDQITLTTMIHIYSKADKLKLAKQTFEELKLLEQSLDKRS 240

Query: 242 YGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQRVFDAIQ 301
           YGAMIMAYVRAG+PEEGE ILKEMDAKDI AGSEVYKALLRAYSMAG+AEGAQRVFDAIQ
Sbjct: 241 YGAMIMAYVRAGLPEEGEKILKEMDAKDIYAGSEVYKALLRAYSMAGDAEGAQRVFDAIQ 300

Query: 302 LAAIPPDEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKENRLN 361
           LAAIPPDEKLCGLL+NAYLMAGQS+KAQIAFDNMRRAGIEPSDKCIAL LSAYEKENRLN
Sbjct: 301 LAAIPPDEKLCGLLMNAYLMAGQSRKAQIAFDNMRRAGIEPSDKCIALALSAYEKENRLN 360

Query: 362 AALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLREYTVKEAS 412
           AALELLIDLEKDN+MVGKEAS+ILAAWLKRLGVVEE+E+VLREYT KE +
Sbjct: 361 AALELLIDLEKDNVMVGKEASQILAAWLKRLGVVEEIEIVLREYTAKEVN 403

BLAST of Cla021304 vs. NCBI nr
Match: gi|449433119|ref|XP_004134345.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g01970 [Cucumis sativus])

HSP 1 Score: 685.6 bits (1768), Expect = 5.2e-194
Identity = 352/410 (85.85%), Postives = 375/410 (91.46%), Query Frame = 1

Query: 2   MRTSTSNILHQLHPKQPLVNGNPGSSYSCYWRGSIAQTFGVLRSRRRCSQLATVAAIVEE 61
           M+ STSNIL+QLH   PLVNG   +SYS YWR SI     VL SRRRCSQ+AT  AIV+E
Sbjct: 1   MQISTSNILYQLH--LPLVNGTSNTSYSRYWRDSI-----VLSSRRRCSQMATATAIVDE 60

Query: 62  FHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCKALMKQLICFSPQKGNLSD 121
            HKLESEREKPRFRWVEVG DITE QK+AISQLP KMTKRCKA+MKQ+ICFSPQKG LSD
Sbjct: 61  IHKLESEREKPRFRWVEVGYDITETQKQAISQLPPKMTKRCKAVMKQIICFSPQKGELSD 120

Query: 122 MLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEITFEANTRDYTKIIHYHGK 181
           MLAAWVRIMKPERADWL VLKH+RI NHPLYI+VAEAAL EITFEANTRDYTKIIH++GK
Sbjct: 121 MLAAWVRIMKPERADWLLVLKHLRILNHPLYIQVAEAALEEITFEANTRDYTKIIHHYGK 180

Query: 182 RNQLEDAEKILLRMRERGFACDQITLTTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRS 241
           +NQLEDAEK+LL MRERGF CDQITLTTMIHIYSKADKLNLAK+TFEELKLLEQPLDKRS
Sbjct: 181 QNQLEDAEKVLLSMRERGFVCDQITLTTMIHIYSKADKLNLAKQTFEELKLLEQPLDKRS 240

Query: 242 YGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQRVFDAIQ 301
           +GAMIMAYVRAG PEEGE ILKEMDAKDI AGSEVYKALLRAYSM GNAEGAQRVFDAIQ
Sbjct: 241 FGAMIMAYVRAGFPEEGEKILKEMDAKDIYAGSEVYKALLRAYSMVGNAEGAQRVFDAIQ 300

Query: 302 LAAIPPDEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKENRLN 361
           LAAI PDEKLCGLLINAYLMAGQS++AQIAFDNMRRAGIEPSDKCIAL LSAYEKENRLN
Sbjct: 301 LAAITPDEKLCGLLINAYLMAGQSREAQIAFDNMRRAGIEPSDKCIALALSAYEKENRLN 360

Query: 362 AALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLREYTVKEAS 412
           +ALELLIDLEKDN+MVGKEAS+ILAAWLKRLGVVEEVE+VLREYT KE +
Sbjct: 361 SALELLIDLEKDNVMVGKEASKILAAWLKRLGVVEEVEIVLREYTEKEVN 403

BLAST of Cla021304 vs. NCBI nr
Match: gi|703085829|ref|XP_010092845.1| (hypothetical protein L484_022440 [Morus notabilis])

HSP 1 Score: 504.2 bits (1297), Expect = 2.1e-139
Identity = 249/357 (69.75%), Postives = 301/357 (84.31%), Query Frame = 1

Query: 55  VAAIVEEFHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCKALMKQLICFSP 114
           VA  VEE  K E+   KP+F+WVEVG  ITE QK+AISQL  KMTKRC+ALMKQLICFS 
Sbjct: 44  VATSVEETEKAENGGGKPKFKWVEVGPGITESQKEAISQLSPKMTKRCRALMKQLICFSA 103

Query: 115 QKGNLSDMLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEITFEANTRDYTK 174
            K +L+++LAAWVRIMKP+RADWL+++K ++I +HPLY +VAE AL+E +FEAN RDYTK
Sbjct: 104 HKASLNELLAAWVRIMKPQRADWLAIIKQLKIMDHPLYFQVAEVALLEESFEANIRDYTK 163

Query: 175 IIHYHGKRNQLEDAEKILLRMRERGFACDQITLTTMIHIYSKADKLNLAKRTFEELKLLE 234
           IIH +GK+N+LEDAEK LL M+ RGF  DQ+TLTT IH+YSKA  L LA+ TFEELKLL 
Sbjct: 164 IIHCYGKQNRLEDAEKTLLAMKSRGFIRDQVTLTTFIHMYSKAGNLKLAEETFEELKLLG 223

Query: 235 QPLDKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQ 294
           QPLDKRSYG+MIMAY+RAGMP++GENIL+EMD ++I AGSEVYKALLRAYSM G+AEGAQ
Sbjct: 224 QPLDKRSYGSMIMAYIRAGMPDQGENILREMDVEEIYAGSEVYKALLRAYSMTGDAEGAQ 283

Query: 295 RVFDAIQLAAIPPDEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAY 354
           RVFDAIQLA I PD +LCGLLINAY+ +GQS+KA +AF NMRRAG+EPSDKC+ALVL AY
Sbjct: 284 RVFDAIQLAGILPDPRLCGLLINAYVESGQSEKACVAFGNMRRAGLEPSDKCVALVLCAY 343

Query: 355 EKENRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLREYTVKEAS 412
           EKEN+L  AL+ L++LE+  +MVG+EASE L  W ++LGVV+EV+LVLREY  K AS
Sbjct: 344 EKENKLQRALDFLMELERHGIMVGEEASETLVGWFRKLGVVKEVDLVLREYASKGAS 400

BLAST of Cla021304 vs. NCBI nr
Match: gi|590722924|ref|XP_007052035.1| (Tetratricopeptide repeat (TPR)-like superfamily protein, putative [Theobroma cacao])

HSP 1 Score: 501.9 bits (1291), Expect = 1.1e-138
Identity = 252/414 (60.87%), Postives = 321/414 (77.54%), Query Frame = 1

Query: 2   MRTSTSNILHQLHPKQPLVNGNPGSSYSCYWRGS---IAQTFGVLRSRRRCSQLATVAAI 61
           M TS  NI +  +   P +N      +   W      + Q  G   S  + +    +A+ 
Sbjct: 1   MVTSACNIPYCSYSTYPFINKTKKQIHPQSWGNRNPLLFQKKGAKFSSCKVNNQPEIASS 60

Query: 62  -VEEFHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCKALMKQLICFSPQKG 121
            VEE  K E+  EK R++WVE+G DI E QK+AI++LP KMTKRCKALMKQ+ICF P+KG
Sbjct: 61  NVEEKGKPETNEEKRRYKWVEIGPDIAEEQKQAITELPFKMTKRCKALMKQIICFCPEKG 120

Query: 122 NLSDMLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEITFEANTRDYTKIIH 181
           +L+D+LAAWV+IMKP RADWL VLK ++I  HPLY EVAE AL+E +FEAN RD+TKIIH
Sbjct: 121 SLADLLAAWVKIMKPRRADWLVVLKELKIMEHPLYFEVAELALLEESFEANIRDFTKIIH 180

Query: 182 YHGKRNQLEDAEKILLRMRERGFACDQITLTTMIHIYSKADKLNLAKRTFEELKLLEQPL 241
            +GK+ +L++AE IL+ M+ RGF CDQ+TLTTM+H+YSKA  L LA+ TFEE+KLL Q L
Sbjct: 181 GYGKQKRLQEAENILVAMKRRGFICDQVTLTTMVHMYSKAGNLKLAEETFEEIKLLGQQL 240

Query: 242 DKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQRVF 301
           DKRSYG+MIMAY+R+G PE+GE +L+EMD+++I AGSEVYKALLRAYSM G+A GAQRVF
Sbjct: 241 DKRSYGSMIMAYIRSGTPEQGEALLREMDSQEIYAGSEVYKALLRAYSMLGDANGAQRVF 300

Query: 302 DAIQLAAIPPDEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKE 361
           D IQLA I PD ++CGLLINAY +AGQS KA IAF+NMRRAG+EPSDKC+ALV++AYEK+
Sbjct: 301 DTIQLAGISPDARMCGLLINAYQLAGQSDKAHIAFENMRRAGLEPSDKCVALVVAAYEKQ 360

Query: 362 NRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLREYTVKEAS 412
           N+LN AL+ L++LE+D ++VGKEAS ILA W K+LGVVE+VELVLRE+  KE +
Sbjct: 361 NKLNKALDFLMELERDGIVVGKEASGILAQWFKKLGVVEQVELVLREFAAKETN 414

BLAST of Cla021304 vs. NCBI nr
Match: gi|1009168695|ref|XP_015902798.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g01970-like [Ziziphus jujuba])

HSP 1 Score: 496.9 bits (1278), Expect = 3.4e-137
Identity = 246/367 (67.03%), Postives = 301/367 (82.02%), Query Frame = 1

Query: 45  SRRRCSQLATVAAIVEEFHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCKA 104
           SR+   Q A+    VEE  K E+E  KP F+WVE+G  ITE Q++AIS+L  K+TKRCKA
Sbjct: 54  SRKLHFQQASFTKKVEETAKSENEEGKPMFKWVEIGPHITEAQRQAISKLSPKLTKRCKA 113

Query: 105 LMKQLICFSPQKGNLSDMLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEIT 164
           LM+QLICFSP K +LSD+LAAWVR MKP RADWL+VLK ++  +HP Y++VAE AL+E T
Sbjct: 114 LMRQLICFSPHKASLSDLLAAWVRTMKPRRADWLAVLKELKTMDHPFYLQVAELALLEET 173

Query: 165 FEANTRDYTKIIHYHGKRNQLEDAEKILLRMRERGFACDQITLTTMIHIYSKADKLNLAK 224
           FEAN RDYTKIIH +GK+N+L+DAEK+L  M+ RGF  DQ+TLT  I IYSKA KLNLA+
Sbjct: 174 FEANIRDYTKIIHGYGKQNRLKDAEKMLSAMKSRGFVLDQVTLTAFIDIYSKAGKLNLAE 233

Query: 225 RTFEELKLLEQPLDKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAY 284
            TFEELKLL QPLDKRSYG+MIMAY+RAGMP +GENILKEMDA++I AGSEVYKA+LR Y
Sbjct: 234 ETFEELKLLGQPLDKRSYGSMIMAYIRAGMPIKGENILKEMDAQEIYAGSEVYKAMLRLY 293

Query: 285 SMAGNAEGAQRVFDAIQLAAIPPDEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSD 344
           SMAG+ EGAQRVFDAIQ A I PD ++C LLINAY ++GQS KA++AF+NMRRAG+EPSD
Sbjct: 294 SMAGDCEGAQRVFDAIQFAGISPDVRMCALLINAYGISGQSDKARLAFENMRRAGLEPSD 353

Query: 345 KCIALVLSAYEKENRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLRE 404
           KC+A++L AYEKEN L  ALE L+DLE+D ++VGKEASE L  W ++LGVV+EV+ +LRE
Sbjct: 354 KCVAVMLLAYEKENELQKALEFLMDLERDGILVGKEASETLVGWFRKLGVVKEVDTILRE 413

Query: 405 YTVKEAS 412
           Y  KEA+
Sbjct: 414 YPGKEAN 420

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR1_ARATH6.0e-12659.34Pentatricopeptide repeat-containing protein At1g01970 OS=Arabidopsis thaliana GN... [more]
PPR51_ARATH5.1e-4042.58Pentatricopeptide repeat-containing protein At1g19525 OS=Arabidopsis thaliana GN... [more]
PP186_ARATH1.9e-1825.48Pentatricopeptide repeat-containing protein At2g35130 OS=Arabidopsis thaliana GN... [more]
PP163_ARATH5.1e-1627.57Pentatricopeptide repeat-containing protein At2g18940, chloroplastic OS=Arabidop... [more]
PP408_ARATH6.7e-1629.30Pentatricopeptide repeat-containing protein At5g39980, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0L7L8_CUCSA3.6e-19485.85Uncharacterized protein OS=Cucumis sativus GN=Csa_3G126080 PE=4 SV=1[more]
W9QSE5_9ROSA1.5e-13969.75Uncharacterized protein OS=Morus notabilis GN=L484_022440 PE=4 SV=1[more]
A0A061DV02_THECC7.4e-13960.87Tetratricopeptide repeat (TPR)-like superfamily protein, putative OS=Theobroma c... [more]
A0A0D2S0I3_GOSRA1.5e-13662.05Uncharacterized protein OS=Gossypium raimondii GN=B456_004G161000 PE=4 SV=1[more]
B9IC06_POPTR3.4e-13662.32Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0014s06610g PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|659075451|ref|XP_008438151.1|1.4e-19485.85PREDICTED: pentatricopeptide repeat-containing protein At1g01970 [Cucumis melo][more]
gi|449433119|ref|XP_004134345.1|5.2e-19485.85PREDICTED: pentatricopeptide repeat-containing protein At1g01970 [Cucumis sativu... [more]
gi|703085829|ref|XP_010092845.1|2.1e-13969.75hypothetical protein L484_022440 [Morus notabilis][more]
gi|590722924|ref|XP_007052035.1|1.1e-13860.87Tetratricopeptide repeat (TPR)-like superfamily protein, putative [Theobroma cac... [more]
gi|1009168695|ref|XP_015902798.1|3.4e-13767.03PREDICTED: pentatricopeptide repeat-containing protein At1g01970-like [Ziziphus ... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0009451 RNA modification
cellular_component GO:0005575 cellular_component
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0003674 molecular_function
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla021304Cla021304.1mRNA


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 314..340
score: 0.08coord: 276..299
score: 0.15coord: 241..269
score: 2.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 172..214
score: 2.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 172..203
score: 1.5E-5coord: 241..270
score: 2.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 343..377
score: 6.127coord: 203..237
score: 7.75coord: 238..272
score: 10.348coord: 168..202
score: 9.197coord: 273..307
score: 8.616coord: 308..342
score: 9
NoneNo IPR availableunknownCoilCoilcoord: 178..198
scor
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 70..404
score: 4.5E
NoneNo IPR availablePANTHERPTHR24015:SF457SUBFAMILY NOT NAMEDcoord: 70..404
score: 4.5E
NoneNo IPR availablePROFILEPS51257PROKAR_LIPOPROTEINcoord: 1..30
score:

The following gene(s) are paralogous to this gene:

None