Cla019500 (gene) Watermelon (97103) v1

NameCla019500
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionPentatricopeptide repeat-containing protein (AHRD V1 ***- D7L4J3_ARALL); contains Interpro domain(s) IPR002885 Pentatricopeptide repeat
LocationChr3 : 5939383 .. 5941353 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGCTCTTACAATCTGGAAAACCAATGCATCTGATTGCATTCCGGAGAGAATTGGTGTGTTTCTACTCTACGATTGCGAAGGATTCACTTTACAAAAGGATTTCACCGTTGGGTGATCCTAATATCTCTGTAATTCCAGTTCTTGATCAGTGGGTGCTAGAAGGCAGGCCTGTTTGTAGAGAAGAACTTCGGAACATAATCAAGGAACTTAGAGTTTACAACCGGTTCAAACATGCTCTCGAGGTTTGATTTCTCAGTATTAACTTATTCATTACTTCGTGAAAGCATGATGGATTTTTCATTATGTGAAATGATTTCGTTGATTGAGGCAAATGGGTCCAAACGATTTGCCATTGTTGAATTCTTAAGATGAATAATGAGTTCTCTTTTCCAGCATGGTTTCTCCTATGAACTTGTTAAGCTACCAGGGTTTCATTTGCTCGAACCAATGAAGCTATGAAATTAGGTTGCCGTGAAACTACCCTGGACTCTTGAGGGAGAAGATCTTATTTTCTATTACCTCGTAGATCACAAGTGATACTTGATTTGAGCTCAAATTCATGTCTGATAGCTTTCTTTTGTCATTGATAGTTGCTTTGCCGTTCTGCCAATGATTTGTTGCACCTAGCTTGTTTGTCCTCTTAATTCTTTCTCTATCTAATTTGTTAGCTCTGCTGAAAAGTTTTTACTTGCAAGAATAACCAACACATTGGAGACTGATTTGCTTTCCATGTTCTTTCTGACATATCAAGGAAGCCACATTGTGGGGAATTCTGGGCTTATTGATTTCATGTTGAATTGTATTATTCCCTACGTTTTACCCGGTGTTCTTGTTTTGACCCATCTTAAAACTCCCTTCATCTTCAGATATCAAAATGGATGAGTGATAAAAGGTACCTTCCTTTATCGACTGTTGATGTCGCAACACGGATGAAATTGATCTTAAGAGTTCATGGGTTGGAACAAGTAGAAGATTATTTCAATATCATTCCTAGTCAATTGAAAGGGTTTCAGGTTTATATAGCTCTTCTGAACTGCTATGCCCAAGAAAAGTGTGTAGATAAAGCCAATGCCATCCTGCAAAAGATGAAGGAAATGGGTTTTGATAGAACTCCCCTCTCATACAATATCATGATGAATCTTTATTATCAAATTGGAGAATTTGAGAAACTAGATCCTCTGTTGCAAGAAATGAAAGAAAAGGGTGTTTCTTTCGATCCATTCACATATTGCATCCGACTAAGTGCATATGCTGCTGCATCTGATATTACAGGAGTTGATAAGATTGTGGAACAAATGGAATCAGATACCAATATTGTCCTAGATTGGAATTGTTATGCCATTGCTGCAAATAGTTGCCTTAAAGCTGGTTTAATAGACAAATTCGTTTCCATGCTGAGGAAATCAGAAGGTCTCCTAGCAACTGCCAAAAGGAAAGGTTATGCATTTGACTTCCTCCTTAAACTCTATGCTAAAAGTGGAAAGAAAGACGAGGTGCACCGTGTATGGAATCTCTACAAGAAGGAGAAAGTCAACAACAAAGGTTTCATCAGCATGATAAGATCACTTTTGATATTAGACAACATTGAAGGTGCGGAATGTATTTTCAAGGAGTGGGAGACCCGGAAACTGGTCTACGACTTGCGAATTCCAAACATGTTGGTTGAAGCGTATTGTAGAGAAGGTCTAATGGAGAAGGCTGAAGACTTTATAAACGAGACATTGATTGTAAGAGGCAAGTTTTCTGTCGAGTCATGGTGCTATTTAGCGAATGGATATCTTCAGAAAGATCAGCTACCACAGGCAGTTGATGCACTGAAGAAAGCAGCCAGTTTGTGTCCACCAGAACTGAACCACTTAAAGGAAATTTTGGCAACATTTCTGGATGGTAAGCAAGATGTGAAAGAAGCTGAGAAAGTCGTTAATTTGTTGAGGTCTGGAGCTAACTCACGCTCTTTTTGCTCCTGA

mRNA sequence

ATGAAGCTCTTACAATCTGGAAAACCAATGCATCTGATTGCATTCCGGAGAGAATTGGTGTGTTTCTACTCTACGATTGCGAAGGATTCACTTTACAAAAGGATTTCACCGTTGGGTGATCCTAATATCTCTGTAATTCCAGTTCTTGATCAGTGGGTGCTAGAAGGCAGGCCTGTTTGTAGAGAAGAACTTCGGAACATAATCAAGGAACTTAGAGTTTACAACCGGTTCAAACATGCTCTCGAGATATCAAAATGGATGAGTGATAAAAGGTACCTTCCTTTATCGACTGTTGATGTCGCAACACGGATGAAATTGATCTTAAGAGTTCATGGGTTGGAACAAGTAGAAGATTATTTCAATATCATTCCTAGTCAATTGAAAGGGTTTCAGGTTTATATAGCTCTTCTGAACTGCTATGCCCAAGAAAAGTGTGTAGATAAAGCCAATGCCATCCTGCAAAAGATGAAGGAAATGGGTTTTGATAGAACTCCCCTCTCATACAATATCATGATGAATCTTTATTATCAAATTGGAGAATTTGAGAAACTAGATCCTCTGTTGCAAGAAATGAAAGAAAAGGGTGTTTCTTTCGATCCATTCACATATTGCATCCGACTAAGTGCATATGCTGCTGCATCTGATATTACAGGAGTTGATAAGATTGTGGAACAAATGGAATCAGATACCAATATTGTCCTAGATTGGAATTGTTATGCCATTGCTGCAAATAGTTGCCTTAAAGCTGGTTTAATAGACAAATTCGTTTCCATGCTGAGGAAATCAGAAGGTCTCCTAGCAACTGCCAAAAGGAAAGGTTATGCATTTGACTTCCTCCTTAAACTCTATGCTAAAAGTGGAAAGAAAGACGAGGTGCACCGTGTATGGAATCTCTACAAGAAGGAGAAAGTCAACAACAAAGGTTTCATCAGCATGATAAGATCACTTTTGATATTAGACAACATTGAAGGTGCGGAATGTATTTTCAAGGAGTGGGAGACCCGGAAACTGGTCTACGACTTGCGAATTCCAAACATGTTGGTTGAAGCGTATTGTAGAGAAGGTCTAATGGAGAAGGCTGAAGACTTTATAAACGAGACATTGATTGTAAGAGGCAAGTTTTCTGTCGAGTCATGGTGCTATTTAGCGAATGGATATCTTCAGAAAGATCAGCTACCACAGGCAGTTGATGCACTGAAGAAAGCAGCCAGTTTGTGTCCACCAGAACTGAACCACTTAAAGGAAATTTTGGCAACATTTCTGGATGGTAAGCAAGATGTGAAAGAAGCTGAGAAAGTCGTTAATTTGTTGAGGTCTGGAGCTAACTCACGCTCTTTTTGCTCCTGA

Coding sequence (CDS)

ATGAAGCTCTTACAATCTGGAAAACCAATGCATCTGATTGCATTCCGGAGAGAATTGGTGTGTTTCTACTCTACGATTGCGAAGGATTCACTTTACAAAAGGATTTCACCGTTGGGTGATCCTAATATCTCTGTAATTCCAGTTCTTGATCAGTGGGTGCTAGAAGGCAGGCCTGTTTGTAGAGAAGAACTTCGGAACATAATCAAGGAACTTAGAGTTTACAACCGGTTCAAACATGCTCTCGAGATATCAAAATGGATGAGTGATAAAAGGTACCTTCCTTTATCGACTGTTGATGTCGCAACACGGATGAAATTGATCTTAAGAGTTCATGGGTTGGAACAAGTAGAAGATTATTTCAATATCATTCCTAGTCAATTGAAAGGGTTTCAGGTTTATATAGCTCTTCTGAACTGCTATGCCCAAGAAAAGTGTGTAGATAAAGCCAATGCCATCCTGCAAAAGATGAAGGAAATGGGTTTTGATAGAACTCCCCTCTCATACAATATCATGATGAATCTTTATTATCAAATTGGAGAATTTGAGAAACTAGATCCTCTGTTGCAAGAAATGAAAGAAAAGGGTGTTTCTTTCGATCCATTCACATATTGCATCCGACTAAGTGCATATGCTGCTGCATCTGATATTACAGGAGTTGATAAGATTGTGGAACAAATGGAATCAGATACCAATATTGTCCTAGATTGGAATTGTTATGCCATTGCTGCAAATAGTTGCCTTAAAGCTGGTTTAATAGACAAATTCGTTTCCATGCTGAGGAAATCAGAAGGTCTCCTAGCAACTGCCAAAAGGAAAGGTTATGCATTTGACTTCCTCCTTAAACTCTATGCTAAAAGTGGAAAGAAAGACGAGGTGCACCGTGTATGGAATCTCTACAAGAAGGAGAAAGTCAACAACAAAGGTTTCATCAGCATGATAAGATCACTTTTGATATTAGACAACATTGAAGGTGCGGAATGTATTTTCAAGGAGTGGGAGACCCGGAAACTGGTCTACGACTTGCGAATTCCAAACATGTTGGTTGAAGCGTATTGTAGAGAAGGTCTAATGGAGAAGGCTGAAGACTTTATAAACGAGACATTGATTGTAAGAGGCAAGTTTTCTGTCGAGTCATGGTGCTATTTAGCGAATGGATATCTTCAGAAAGATCAGCTACCACAGGCAGTTGATGCACTGAAGAAAGCAGCCAGTTTGTGTCCACCAGAACTGAACCACTTAAAGGAAATTTTGGCAACATTTCTGGATGGTAAGCAAGATGTGAAAGAAGCTGAGAAAGTCGTTAATTTGTTGAGGTCTGGAGCTAACTCACGCTCTTTTTGCTCCTGA

Protein sequence

MKLLQSGKPMHLIAFRRELVCFYSTIAKDSLYKRISPLGDPNISVIPVLDQWVLEGRPVCREELRNIIKELRVYNRFKHALEISKWMSDKRYLPLSTVDVATRMKLILRVHGLEQVEDYFNIIPSQLKGFQVYIALLNCYAQEKCVDKANAILQKMKEMGFDRTPLSYNIMMNLYYQIGEFEKLDPLLQEMKEKGVSFDPFTYCIRLSAYAAASDITGVDKIVEQMESDTNIVLDWNCYAIAANSCLKAGLIDKFVSMLRKSEGLLATAKRKGYAFDFLLKLYAKSGKKDEVHRVWNLYKKEKVNNKGFISMIRSLLILDNIEGAECIFKEWETRKLVYDLRIPNMLVEAYCREGLMEKAEDFINETLIVRGKFSVESWCYLANGYLQKDQLPQAVDALKKAASLCPPELNHLKEILATFLDGKQDVKEAEKVVNLLRSGANSRSFCS
BLAST of Cla019500 vs. Swiss-Prot
Match: PP166_ARATH (Pentatricopeptide repeat-containing protein At2g20710, mitochondrial OS=Arabidopsis thaliana GN=At2g20710 PE=2 SV=1)

HSP 1 Score: 327.4 bits (838), Expect = 2.4e-88
Identity = 173/416 (41.59%), Postives = 271/416 (65.14%), Query Frame = 1

Query: 29  DSLYKRISPLGDPNISVIPVLDQWVLEGRPVCREELRNIIKELRVYNRFKHALEISKWMS 88
           D+L +R++  GDP+ S+I VLD W+ +G  V   EL +IIK LR ++RF HAL+IS WMS
Sbjct: 38  DTLQRRVARSGDPSASIIKVLDGWLDQGNLVKTSELHSIIKMLRKFSRFSHALQISDWMS 97

Query: 89  DKRYLPLSTVDVATRMKLILRVHGLEQVEDYFNIIPSQLKGFQVYIALLNCYAQEKCVDK 148
           + R   +S  DVA R+ LI +V GL + E +F  IP + + + +Y ALLNCYA +K + K
Sbjct: 98  EHRVHEISEGDVAIRLDLIAKVGGLGEAEKFFETIPMERRNYHLYGALLNCYASKKVLHK 157

Query: 149 ANAILQKMKEMGFDRTPLSYNIMMNLYYQIGEFEKLDPLLQEMKEKGVSFDPFTYCIRLS 208
           A  + Q+MKE+GF +  L YN+M+NLY + G++  ++ LL+EM+++ V  D FT   RL 
Sbjct: 158 AEQVFQEMKELGFLKGCLPYNVMLNLYVRTGKYTMVEKLLREMEDETVKPDIFTVNTRLH 217

Query: 209 AYAAASDITGVDKIVEQMESDTNIVLDWNCYAIAANSCLKAGLIDKFVSMLRKSEGLLAT 268
           AY+  SD+ G++K + + E+D  + LDW  YA  AN  +KAGL +K + MLRKSE ++  
Sbjct: 218 AYSVVSDVEGMEKFLMRCEADQGLHLDWRTYADTANGYIKAGLTEKALEMLRKSEQMVNA 277

Query: 269 AKRKGYAFDFLLKLYAKSGKKDEVHRVWNLYKK-EKVNNKGFISMIRSLLILDNIEGAEC 328
            KRK +A++ L+  Y  +GKK+EV+R+W+LYK+ +   N G+IS+I +LL +D+IE  E 
Sbjct: 278 QKRK-HAYEVLMSFYGAAGKKEEVYRLWSLYKELDGFYNTGYISVISALLKMDDIEEVEK 337

Query: 329 IFKEWETRKLVYDLRIPNMLVEAYCREGLMEKAEDFINETLIVRGKFSVE---SWCYLAN 388
           I +EWE    ++D+RIP++L+  YC++G+MEKAE+ +N   I+  K+ VE   +W  LA 
Sbjct: 338 IMEEWEAGHSLFDIRIPHLLITGYCKKGMMEKAEEVVN---ILVQKWRVEDTSTWERLAL 397

Query: 389 GYLQKDQLPQAVDALKKAASLCPPELNHLKEILAT---FLDGKQDVKEAEKVVNLL 438
           GY    ++ +AV+  K+A  +  P     + +L +   +L+G++D++   K++ LL
Sbjct: 398 GYKMAGKMEKAVEKWKRAIEVSKPGWRPHQVVLMSCVDYLEGQRDMEGLRKILRLL 449

BLAST of Cla019500 vs. Swiss-Prot
Match: PP334_ARATH (Pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Arabidopsis thaliana GN=At4g21705 PE=2 SV=1)

HSP 1 Score: 266.9 bits (681), Expect = 3.9e-70
Identity = 165/450 (36.67%), Postives = 254/450 (56.44%), Query Frame = 1

Query: 9   PMHLIAFRRELVCFYSTIAKDSLYKRISPLGDPNISVIPVLDQWVLEGRPVCREELRNII 68
           P +LIA R     + + + K +LY +ISPLGDP  SV P L  WV  G+ V   EL  I+
Sbjct: 8   PANLIASR---YYYTNRVKKTTLYSKISPLGDPKSSVYPELQNWVQCGKKVSVAELIRIV 67

Query: 69  KELRVYNRFKHALEISKWMSDKRYLPLSTVDVATRMKLILRVHGLEQVEDYFNIIPSQLK 128
            +LR   RF HALE+SKWM++      S  + A  + LI RV+G    E+YF  +  Q K
Sbjct: 68  HDLRRRKRFLHALEVSKWMNETGVCVFSPTEHAVHLDLIGRVYGFVTAEEYFENLKEQYK 127

Query: 129 GFQVYIALLNCYAQEKCVDKANAILQKMKEMGFDRTPLSYNIMMNLYYQIGEFEKLDPLL 188
             + Y ALLNCY +++ V+K+    +KMKEMGF  + L+YN +M LY  IG+ EK+  +L
Sbjct: 128 NDKTYGALLNCYVRQQNVEKSLLHFEKMKEMGFVTSSLTYNNIMCLYTNIGQHEKVPKVL 187

Query: 189 QEMKEKGVSFDPFTYCIRLSAYAAASDITGVDKIVEQMESDTNIVLDWNCYAIAANSCLK 248
           +EMKE+ V+ D ++Y I ++A+ A  D+  +   +  ME   +I +DWN YA+AA   + 
Sbjct: 188 EEMKEENVAPDNYSYRICINAFGAMYDLERIGGTLRDMERRQDITMDWNTYAVAAKFYID 247

Query: 249 AGLIDKFVSMLRKSEGLLATAKRKGYAFDFLLKLYAKSGKKDEVHRVWNLYK---KEKVN 308
            G  D+ V +L+ SE  L   K+ G  ++ L+ LYA+ GKK EV R+W+L K   K ++ 
Sbjct: 248 GGDCDRAVELLKMSENRL--EKKDGEGYNHLITLYARLGKKIEVLRLWDLEKDVCKRRI- 307

Query: 309 NKGFISMIRSLLILDNIEGAECIFKEWETRKLVYDLRIPNMLVEAYCREGLMEKAEDFIN 368
           N+ ++++++SL+ +D +  AE +  EW++    YD R+PN ++  Y  + + EKAE  + 
Sbjct: 308 NQDYLTVLQSLVKIDALVEAEEVLTEWKSSGNCYDFRVPNTVIRGYIGKSMEEKAEAML- 367

Query: 369 ETLIVRGKFSV-ESWCYLANGYLQKDQLPQAVDALKKAASL------CPPELNHLKEILA 428
           E L  RGK +  ESW  +A  Y +K  L  A   +K A  +        P L  +  +L 
Sbjct: 368 EDLARRGKATTPESWELVATAYAEKGTLENAFKCMKTALGVEVGSRKWRPGLTLVTSVL- 427

Query: 429 TFLDGKQDVKEAEKVVNLLRS--GANSRSF 447
           +++  +  +KE E  V  LR+  G N + +
Sbjct: 428 SWVGDEGSLKEVESFVASLRNCIGVNKQMY 449

BLAST of Cla019500 vs. Swiss-Prot
Match: PPR3_ARATH (Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana GN=At1g02150 PE=2 SV=2)

HSP 1 Score: 245.4 bits (625), Expect = 1.2e-63
Identity = 141/422 (33.41%), Postives = 233/422 (55.21%), Query Frame = 1

Query: 29  DSLYKRISPLGDPNISVIPVLDQWVLEGRPVCREELRNIIKELRVYNRFKHALEISKWMS 88
           +++YK+IS +  P +    VL+QW   GR + + EL  ++KELR Y R   ALE+  WM+
Sbjct: 67  NAIYKKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKRANQALEVYDWMN 126

Query: 89  DK-RYLPLSTVDVATRMKLILRVHGLEQVEDYFNIIPSQLKGFQVYIALLNCYAQEKCVD 148
           ++     LS  D A ++ LI +V G+   E++F  +P   K  +VY +LLN Y + K  +
Sbjct: 127 NRGERFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGSLLNAYVRAKSRE 186

Query: 149 KANAILQKMKEMGFDRTPLSYNIMMNLYYQIGEFEKLDPLLQEMKEKGVSFDPFTYCIRL 208
           KA A+L  M++ G+   PL +N+MM LY  + E++K+D ++ EMK+K +  D ++Y I L
Sbjct: 187 KAEALLNTMRDKGYALHPLPFNVMMTLYMNLREYDKVDAMVFEMKQKDIRLDIYSYNIWL 246

Query: 209 SAYAAASDITGVDKIVEQMESDTNIVLDWNCYAIAANSCLKAGLIDKFVSMLRKSEGLLA 268
           S+  +   +  ++ + +QM+SD +I  +W  ++  A   +K G  +K    LRK E  + 
Sbjct: 247 SSCGSLGSVEKMELVYQQMKSDVSIYPNWTTFSTMATMYIKMGETEKAEDALRKVEARIT 306

Query: 269 TAKRKGYAFDFLLKLYAKSGKKDEVHRVWNLYKK--EKVNNKGFISMIRSLLILDNIEGA 328
              R  Y   +LL LY   G K E++RVW++YK     + N G+ +++ SL+ + +IEGA
Sbjct: 307 GRNRIPY--HYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSSLVRMGDIEGA 366

Query: 329 ECIFKEWETRKLVYDLRIPNMLVEAYCREGLMEKAEDFINETLIVRGKFSVESWCYLANG 388
           E +++EW   K  YD RIPN+L+ AY +   +E AE   +  + + GK S  +W  LA G
Sbjct: 367 EKVYEEWLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPSSSTWEILAVG 426

Query: 389 YLQKDQLPQAVDALKKAASLCPPELNHLKEILA-----TFLDGKQDVKEAEKVVNLLRSG 443
           + +K  + +A+  L+ A S      N   ++L         + + DV   E V+ LLR  
Sbjct: 427 HTRKRCISEALTCLRNAFS-AEGSSNWRPKVLMLSGFFKLCEEESDVTSKEAVLELLRQS 485

BLAST of Cla019500 vs. Swiss-Prot
Match: PPR86_ARATH (Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana GN=At1g60770 PE=2 SV=1)

HSP 1 Score: 229.2 bits (583), Expect = 9.0e-59
Identity = 134/427 (31.38%), Postives = 237/427 (55.50%), Query Frame = 1

Query: 28  KDSLYKRISPLGDPNISVIPVLDQWVLEGRPVCREELRNIIKELRVYNRFKHALEISKWM 87
           ++ LY R+   G   + V   L+Q++   + V + E+ + IK+LR    +  AL++S+ M
Sbjct: 22  EEPLYNRLFKDGGTEVKVRQQLNQFLKGTKHVFKWEVGDTIKKLRNRGLYYPALKLSEVM 81

Query: 88  SDKRYLPLSTVDVATRMKLILRVHGLEQVEDYFNIIPSQLKGFQVYIALLNCYAQEKCVD 147
            ++R +  +  D A  + L+ +   +   E+YF  +P   K    Y +LLNCY +E   +
Sbjct: 82  -EERGMNKTVSDQAIHLDLVAKAREITAGENYFVDLPETSKTELTYGSLLNCYCKELLTE 141

Query: 148 KANAILQKMKEMGFDRTPLSYNIMMNLYYQIGEFEKLDPLLQEMKEKGVSFDPFTYCIRL 207
           KA  +L KMKE+    + +SYN +M LY + GE EK+  ++QE+K + V  D +TY + +
Sbjct: 142 KAEGLLNKMKELNITPSSMSYNSLMTLYTKTGETEKVPAMIQELKAENVMPDSYTYNVWM 201

Query: 208 SAYAAASDITGVDKIVEQMESDTNIVLDWNCYAIAANSCLKAGLIDKFVSMLRKSEGLLA 267
            A AA +DI+GV++++E+M  D  +  DW  Y+  A+  + AGL  K    L++ E  + 
Sbjct: 202 RALAATNDISGVERVIEEMNRDGRVAPDWTTYSNMASIYVDAGLSQKAEKALQELE--MK 261

Query: 268 TAKRKGYAFDFLLKLYAKSGKKDEVHRVWNLYKK--EKVNNKGFISMIRSLLILDNIEGA 327
             +R   A+ FL+ LY + GK  EV+R+W   +    K +N  +++MI+ L+ L+++ GA
Sbjct: 262 NTQRDFTAYQFLITLYGRLGKLTEVYRIWRSLRLAIPKTSNVAYLNMIQVLVKLNDLPGA 321

Query: 328 ECIFKEWETRKLVYDLRIPNMLVEAYCREGLMEKAEDFINETLIVRGKFSVESWCYLANG 387
           E +FKEW+     YD+RI N+L+ AY +EGL++KA +   +     GK + ++W    + 
Sbjct: 322 ETLFKEWQANCSTYDIRIVNVLIGAYAQEGLIQKANELKEKAPRRGGKLNAKTWEIFMDY 381

Query: 388 YLQKDQLPQAVDALKKAAS---------LCPPELNHLKEILATFLDGKQDVKEAEKVVNL 444
           Y++   + +A++ + KA S         L  PE       L ++ + K+DV  AE ++ +
Sbjct: 382 YVKSGDMARALECMSKAVSIGKGDGGKWLPSPE---TVRALMSYFEQKKDVNGAENLLEI 441

BLAST of Cla019500 vs. Swiss-Prot
Match: PPR61_ARATH (Putative pentatricopeptide repeat-containing protein At1g28020 OS=Arabidopsis thaliana GN=At1g28020 PE=3 SV=2)

HSP 1 Score: 228.0 bits (580), Expect = 2.0e-58
Identity = 136/339 (40.12%), Postives = 196/339 (57.82%), Query Frame = 1

Query: 34  RISPLGDPNISVIPVLDQWVLEGRPVCREELRNIIKELRVYNRFKHALEISKWMSDKRYL 93
           RI+     N  +IPVL+QW  +G  V    +R IIK+LR  ++   AL++S+WMS ++  
Sbjct: 41  RITDALHRNAQIIPVLEQWRQQGNQVNPSHVRVIIKKLRDSDQSLQALQVSEWMSKEKIC 100

Query: 94  PLSTVDVATRMKLILRVHGLEQVEDYFNIIPSQLKGFQVYIALLNCYAQ-EKCVDKANAI 153
            L   D A R+ LI  V GLE+ E +F  IP   +G  VY +LLN YA+ +K + KA A 
Sbjct: 101 NLIPEDFAARLHLIENVVGLEEAEKFFESIPKNARGDSVYTSLLNSYARSDKTLCKAEAT 160

Query: 154 LQKMKEMGFDRTPLSYNIMMNLYYQIGEFEKLDPLLQEMKEKGVSFDPFTYCIRLSAYAA 213
            QKM+++G    P+ YN MM+LY  +   EK++ LL EMK+  V  D  T    L  Y+A
Sbjct: 161 FQKMRDLGLLLRPVPYNAMMSLYSALKNREKVEELLLEMKDNDVEADNVTVNNVLKLYSA 220

Query: 214 ASDITGVDKIVEQMESDTNIVLDWNCYAIAANSCLKAGLIDKFVSMLRKSEGLLATAKRK 273
             D+T ++K + + E    I L+W+     A + L+A    K + MLR +E L+     K
Sbjct: 221 VCDVTEMEKFLNKWEGIHGIKLEWHTTLDMAKAYLRARSSGKAMKMLRLTEQLVDQKSLK 280

Query: 274 GYAFDFLLKLYAKSGKKDEVHRVWNLYKKE--KVNNKGFISMIRSLLILDNIEGAECIFK 333
             A+D L+KLY ++G ++EV RVW LYK +  + +N G+ ++IRSLL +D+I GAE I+K
Sbjct: 281 S-AYDHLMKLYGEAGNREEVLRVWKLYKSKIGERDNNGYRTVIRSLLKVDDIVGAEEIYK 340

Query: 334 EWETRKLVYDLRIPNMLVEAYCREGLMEKAEDFINETLI 370
            WE+  L +D RIP ML   Y   G+ EKAE  +N   I
Sbjct: 341 VWESLPLEFDHRIPTMLASGYRDRGMTEKAEKLMNSKTI 378


HSP 2 Score: 88.6 bits (218), Expect = 1.9e-16
Identity = 54/175 (30.86%), Postives = 91/175 (52.00%), Query Frame = 1

Query: 42  NISVIPVLDQWVLEGRPVCREELRNIIKELRVYNRFKHALEISKWMSDKRYLPLSTVDVA 101
           N  V P+L+QW  + +P    +L+ +IK LR   +F  AL++S+WM +K+   L   D A
Sbjct: 384 NKPVTPLLEQWGDQMKP---SDLKCLIKNLRDSKQFSKALQVSEWMGEKQVCNLYLEDYA 443

Query: 102 TRMKLILRVHGLEQVEDYFNIIPSQLKGFQVYIALLNCYAQ--EKCVDKANAILQKMKEM 161
            R+ L   V GLE+ E YF  IP  +K + VY+ALL+ YA+  +   +  + IL++M+E 
Sbjct: 444 ARLYLTENVLGLEEAEKYFENIPENMKDYSVYVALLSSYAKSDKNLGNMVDEILREMEEN 503

Query: 162 GFDRTPLSYNIMMNLYYQIGEFEKLDPLLQEM-KEKGVSFDPFTYCIRLSAYAAA 214
             D   ++ N ++ +Y    + + ++  ++    E G+  +  T      AY  A
Sbjct: 504 NVDPDLITVNHVLKVYAAESKIQAMEMFMRRWGTEDGIKLERGTMIAMAKAYVKA 555

BLAST of Cla019500 vs. TrEMBL
Match: A0A0A0LV44_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G073790 PE=4 SV=1)

HSP 1 Score: 679.1 bits (1751), Expect = 3.7e-192
Identity = 333/443 (75.17%), Postives = 388/443 (87.58%), Query Frame = 1

Query: 1   MKLLQSGKPMHLIAFRRELVCFYSTIAKDSLYKRISPLGDPNISVIPVLDQWVLEGRPVC 60
           MKLLQS KP++LIAFR E V FYST+ KD+LY+RISP+GDPNISV P+LDQWVLEGR V 
Sbjct: 1   MKLLQSLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60

Query: 61  REELRNIIKELRVYNRFKHALEISKWMSDKRYLPLSTVDVATRMKLILRVHGLEQVEDYF 120
           ++ELR+IIKELRVY RFKHALEISKWMSDKRY PLST D+A RM LILRVHGLEQVEDYF
Sbjct: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120

Query: 121 NIIPSQLKGFQVYIALLNCYAQEKCVDKANAILQKMKEMGFDRTPLSYNIMMNLYYQIGE 180
           + +PSQLK +QV+IALLNCYA EKCVDKANA +QK+KEMGF  +PL YNIMMNLY+QIGE
Sbjct: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMNLYHQIGE 180

Query: 181 FEKLDPLLQEMKEKGVSFDPFTYCIRLSAYAAASDITGVDKIVEQMESDTNIVLDWNCYA 240
           FE+LD LL+EMKE+GV +D FTY IR+SAYAAASD  G++KI+EQMES+ +IVLDWNCY 
Sbjct: 181 FERLDSLLKEMKERGVYYDRFTYSIRISAYAAASDFRGIEKIMEQMESNPSIVLDWNCYV 240

Query: 241 IAANSCLKAGLIDKFVSMLRKSEGLLATAKRKGYAFDFLLKLYAKSGKKDEVHRVWNLYK 300
           IAAN+  K GLIDK +SML+KSEGLLA  K+KG+AF+  LKLYA++GKKDE+HR+WNLYK
Sbjct: 241 IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK 300

Query: 301 KEKVNNKGFISMIRSLLILDNIEGAECIFKEWETRKLVYDLRIPNMLVEAYCREGLMEKA 360
           KEK+ NKGFISMI SL +LD+I+GAE I+KEWETRKL YDLRIPN+LV+AYCR GLMEKA
Sbjct: 301 KEKIFNKGFISMITSLFVLDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKA 360

Query: 361 EDFINETLIVRGKFSVESWCYLANGYLQKDQLPQAVDALKKAASLCPPELNHLKEILATF 420
           E  +NE +IVR KFSVESWCYLA+GYLQKDQLPQAV+ LK AAS+CP  LN++KEILA F
Sbjct: 361 EVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVKEILAAF 420

Query: 421 LDGKQDVKEAEKVVNLLRSGANS 444
           LDGKQDV+E EKVVNLLR   +S
Sbjct: 421 LDGKQDVEETEKVVNLLREKDDS 443

BLAST of Cla019500 vs. TrEMBL
Match: W9RYV8_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_014527 PE=4 SV=1)

HSP 1 Score: 454.9 bits (1169), Expect = 1.1e-124
Identity = 225/428 (52.57%), Postives = 317/428 (74.07%), Query Frame = 1

Query: 22  FYSTIA-------KDSLYKRISPLGDPNISVIPVLDQWVLEGRPVCREELRNIIKELRVY 81
           FYS+I+       ++ LY+RISP+G+PN+SV+P+L+QWV EG+PV + EL+ IIKELR++
Sbjct: 43  FYSSISNPTNTNIEERLYRRISPVGNPNVSVVPILEQWVQEGKPVSQIELQRIIKELRIF 102

Query: 82  NRFKHALEISKWMSDKRYLPLSTVDVATRMKLILRVHGLEQVEDYFNIIPSQLKGFQVYI 141
            RF HALEIS+WMSDKRY+ LST DVA R+ LI +VHGL++  +YFN IP+ LK F+VY 
Sbjct: 103 KRFHHALEISQWMSDKRYMHLSTKDVAARLDLISKVHGLDEAVNYFNDIPAALKIFEVYS 162

Query: 142 ALLNCYAQEKCVDKANAILQKMKEMGFDRTPLSYNIMMNLYYQIGEFEKLDPLLQEMKEK 201
            LLNCYA EK V+KA  I+Q+M++M   +TP+ YNIMMNLYY   +++KLD L+ EM+EK
Sbjct: 163 TLLNCYANEKSVEKAEEIMQQMRDMWDHKTPICYNIMMNLYYHTEDYDKLDSLMSEMEEK 222

Query: 202 GVSFDPFTYCIRLSAYAAASDITGVDKIVEQMESDTNIVLDWNCYAIAANSCLKAGLIDK 261
           G+ FD +T+ IR+SAY A SD+ GV+KI+E++ES   +VLDWN Y++AA+S L  GL+DK
Sbjct: 223 GIPFDTYTFSIRMSAYVAISDVEGVNKIMEKVESYPGLVLDWNFYSVAASSHLNVGLVDK 282

Query: 262 FVSMLRKSEGLLATAKRKGYAFDFLLKLYAKSGKKDEVHRVWNLYKKEKVNNKGFISMIR 321
            V ML+K E  L T  RK +AFD LLK YA  G +DE++R+W+LYKKEK+ NKG+ SMI 
Sbjct: 283 AVEMLKKLEDRLPTVNRKSFAFDALLKSYALIGNRDELYRIWDLYKKEKLFNKGYKSMIC 342

Query: 322 SLLILDNIEGAECIFKEWETRKLVYDLRIPNMLVEAYCREGLMEKAEDFINETLIVRGKF 381
           SLL +D++EGA  I++EWE+R L +D  IP+++++ Y R+GL+EKAE  +++ +    + 
Sbjct: 343 SLLRIDDVEGAAKIYEEWESRGLPFDFLIPDLMIDTYFRKGLLEKAEALVDKAIAKGDES 402

Query: 382 SVESWCYLANGYLQKDQLPQAVDALKKAASLCPPELNHLKEILAT---FLDGKQDVKEAE 440
           SVE W YL    L+ +Q+ +AV+ALKKA S+  P     KE L+    +LDGK D++ A+
Sbjct: 403 SVELWYYLVIRSLEHNQISKAVEALKKAISVWSPGSKPSKETLSVCLEYLDGKVDIEGAQ 462

BLAST of Cla019500 vs. TrEMBL
Match: A0A068TUA9_COFCA (Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00029177001 PE=4 SV=1)

HSP 1 Score: 443.7 bits (1140), Expect = 2.6e-121
Identity = 217/423 (51.30%), Postives = 307/423 (72.58%), Query Frame = 1

Query: 29  DSLYKRISPLGDPNISVIPVLDQWVLEGRPVCREELRNIIKELRVYNRFKHALEISKWMS 88
           ++LYKRISPLGDP ISV+PVLDQW  EGRPV ++ L +I+KEL+ Y R+KHALE+S+WM+
Sbjct: 40  NNLYKRISPLGDPKISVVPVLDQWAAEGRPVHKQYLESIVKELKAYKRYKHALEVSRWMT 99

Query: 89  DKRYLPLSTVDVATRMKLILRVHGLEQVEDYFNIIPSQLKGFQVYIALLNCYAQEKCVDK 148
           +KRY+PL  +DV+ ++ LI RVHGL++ E+ FN + S+LKGF  +IALLNCY  EK V+K
Sbjct: 100 EKRYMPLRELDVSIQINLIHRVHGLKEAENCFNNVSSKLKGFNAHIALLNCYVHEKSVEK 159

Query: 149 ANAILQKMKEMGFDRTPLSYNIMMNLYYQIGEFEKLDPLLQEMKEKGVSFDPFTYCIRLS 208
           A A++QKM+EMG+  +PL YN+MMNL+Y +G ++KLD L+ EM+ +G+ FDPFT  IRLS
Sbjct: 160 AEALMQKMREMGYANSPLPYNLMMNLHYGLGNYKKLDDLMNEMEGRGIKFDPFTLTIRLS 219

Query: 209 AYAAASDITGVDKIVEQMESDTNIVLDWNCYAIAANSCLKAGLIDKFVSMLRKSEGLLAT 268
           AYAAASD  GVDKI + ME D  IV D++ YA+ A   LK G +DK + +L+K E L  T
Sbjct: 220 AYAAASDAEGVDKIAKMMEIDPLIVPDFSVYAVVAQGYLKVGQLDKALPILKKMEELAVT 279

Query: 269 AKRKGYAFDFLLKLYAKSGKKDEVHRVWNLYK-KEKVNNKGFISMIRSLLILDNIEGAEC 328
            ++  + +DFLLKLYA   ++D+V R+W +YK K+K+NNKG+++M+ SLL   ++ G E 
Sbjct: 280 TRKGKFPYDFLLKLYAGMQRRDDVLRIWEMYKQKQKINNKGYMTMMSSLLSFGDVGGIED 339

Query: 329 IFKEWETRKLVYDLRIPNMLVEAYCREGLMEKAEDFINETLIVRGKFSVESWCYLANGYL 388
           IFKEWE+R L YD R+PN+L+ AYCR G +EKAE  I++ L   G+    +W Y+A GY+
Sbjct: 340 IFKEWESRGLSYDFRVPNVLIHAYCRNGELEKAEALIDKGLSEGGEPFATTWFYMALGYI 399

Query: 389 QKDQLPQAVDALKKAASLCPPELNHLKEILATFLDGKQ--DVKEAEKVVNLLRSGANSRS 448
           + +Q+ +AV+ALKKA   CPP+     E L T L+  +  DV+++E+ + L++      S
Sbjct: 400 KDNQISKAVEALKKAILKCPPDHKPNTETLNTCLEHMERGDVEKSEEFIKLIK----KES 458

BLAST of Cla019500 vs. TrEMBL
Match: M1CWK5_SOLTU (Uncharacterized protein OS=Solanum tuberosum GN=PGSC0003DMG400029687 PE=4 SV=1)

HSP 1 Score: 419.5 bits (1077), Expect = 5.2e-114
Identity = 209/427 (48.95%), Postives = 302/427 (70.73%), Query Frame = 1

Query: 4   LQSGKPMHLIAFRRELVC-FYSTIA--------KDSLYKRISPLGDPNISVIPVLDQWVL 63
           L+   P+H   F+R ++  FY T +        +D ++ RI+PLG P++S++PVL+QWV 
Sbjct: 9   LRKQNPIH---FQRTIINRFYGTTSTKQDKNQKRDWVFARIAPLGSPDVSMVPVLEQWVE 68

Query: 64  EGRPVCREELRNIIKELRVYNRFKHALEISKWMSDKRYLPLSTVDVATRMKLILRVHGLE 123
           EG+ V + EL+ IIK L  Y R+KHALE+S WM+D+RY PL   DVA R+ L+ +V GLE
Sbjct: 69  EGKTVVKSELQWIIKRLNSYKRYKHALEVSHWMTDRRYYPLQPADVAARINLMNKVKGLE 128

Query: 124 QVEDYFNIIPSQLKGFQVYIALLNCYAQEKCVDKANAILQKMKEMGFDRTPLSYNIMMNL 183
           +VE YFN IP  L+  +VY ALLNCY  EK V+KA AI+Q++++MGF +  L YN MMNL
Sbjct: 129 EVEKYFNSIPQMLRRPEVYTALLNCYTNEKSVEKAEAIMQQLRDMGFAKGTLCYNHMMNL 188

Query: 184 YYQIGEFEKLDPLLQEMKEKGVSFDPFTYCIRLSAYAAASDITGVDKIVEQMESDTNIVL 243
           YY+ G +EK+D ++ EM++KGV+FD FT  IRL+AYAAA D  G+DKI+  MESD  I+L
Sbjct: 189 YYKTGTWEKMDNMMNEMEQKGVNFDEFTLTIRLTAYAAAGDSEGMDKILAIMESDKQIIL 248

Query: 244 DWNCYAIAANSCLKAGLIDKFVSMLRKSEGLLATAKRKGYAFDFLLKLYAKSGKKDEVHR 303
            W+ Y+IAA   LK G ++K + +L K E ++   ++   A++ LLKLYA++GKK+EVHR
Sbjct: 249 HWDTYSIAAELYLKVGQVEKALELLSKLESMILNHEKSNGAYNCLLKLYAEAGKKEEVHR 308

Query: 304 VWNLYKKE-KVNNKGFISMIRSLLIL-DNIEGAECIFKEWETRKLVYDLRIPNMLVEAYC 363
           VW+LYK+  ++ NKG+I+++ +L+   D  EG E IF+EWE+  L YD R+P++L+ +YC
Sbjct: 309 VWDLYKQNMRILNKGYITVMSALMKFGDTTEGVEKIFEEWESEALSYDFRVPDVLIRSYC 368

Query: 364 REGLMEKAEDFINETLIVRGKFSVESWCYLANGYLQKDQLPQAVDALKKAASLCPPELNH 420
           R GL+EKA+  +++ +   G   V +WC+LANGY+ +D +P+AV+ALKKA S+CPP    
Sbjct: 369 RNGLLEKAKALMDKGISKGGVPWVTTWCHLANGYIHEDLVPEAVEALKKAISICPPNYKP 428

BLAST of Cla019500 vs. TrEMBL
Match: K4B957_SOLLC (Uncharacterized protein OS=Solanum lycopersicum PE=4 SV=1)

HSP 1 Score: 415.2 bits (1066), Expect = 9.9e-113
Identity = 204/424 (48.11%), Postives = 297/424 (70.05%), Query Frame = 1

Query: 22  FYSTIA-----KDSLYKRISPLGDPNISVIPVLDQWVLEGRPVCREELRNIIKELRVYNR 81
           FY T +     +D ++ RI+PLG P++S++PVL+QWV EG+ V + EL+ IIK L  Y R
Sbjct: 5   FYGTTSTKYKKRDWVFARIAPLGSPDMSMVPVLEQWVEEGKTVAKGELQWIIKRLNSYKR 64

Query: 82  FKHALEISKWMSDKRYLPLSTVDVATRMKLILRVHGLEQVEDYFNIIPSQLKGFQVYIAL 141
           +KHALE+S WM+D+RYLPL   DVA R+ L+ +V GLE+VE YFN I   L+  +VY AL
Sbjct: 65  YKHALEVSHWMTDRRYLPLQVADVAERINLVYKVKGLEEVEKYFNSISQILRRPEVYTAL 124

Query: 142 LNCYAQEKCVDKANAILQKMKEMGFDRTPLSYNIMMNLYYQIGEFEKLDPLLQEMKEKGV 201
           LNCY  EK V KA AI+Q++++MGF +  L YN MMNLY + G +EK+D L+ EM++KGV
Sbjct: 125 LNCYTNEKSVGKAEAIMQQLRDMGFAKGTLCYNHMMNLYCKTGTWEKMDKLMNEMEQKGV 184

Query: 202 SFDPFTYCIRLSAYAAASDITGVDKIVEQMESDTNIVLDWNCYAIAANSCLKAGLIDKFV 261
           +FD FT  IRL+AYA A D  G+DKI+  MESD  I+L W+ Y+IAA   LK GL++K +
Sbjct: 185 NFDEFTLTIRLTAYATAGDSEGMDKILAMMESDKQIILHWDTYSIAAELYLKVGLVEKAL 244

Query: 262 SMLRKSEGLLATAKRKGYAFDFLLKLYAKSGKKDEVHRVWNLYKKE-KVNNKGFISMIRS 321
            +L + + ++ T K+   A++ LLKLYA++GKK+EVHRVW+LYK+  ++ NKG+IS++ +
Sbjct: 245 ELLSRLDSMILTRKKSNGAYNDLLKLYAEAGKKEEVHRVWDLYKQNMRILNKGYISVMSA 304

Query: 322 LLILDNIEGAECIFKEWETRKLVYDLRIPNMLVEAYCREGLMEKAEDFINETLIVRGKFS 381
           L+   + E  E IF+EWE+  L YD R+P++L+ +YCR GL+ KA+  +++ +   G   
Sbjct: 305 LMKFGDTERVEKIFEEWESEALSYDFRVPDVLIRSYCRNGLLGKAKALMDKGISKGGVPW 364

Query: 382 VESWCYLANGYLQKDQLPQAVDALKKAASLCPPELNHLKEILATFLDGKQDVKEAEKVVN 440
           V +WC+LANGY+ +D +P+AV+ALKKA S+CPP     KE LAT ++  +     +   +
Sbjct: 365 VTTWCHLANGYIHEDLVPEAVEALKKAISICPPNYKPSKETLATCVNYWEKQGNVDNAAD 424

BLAST of Cla019500 vs. NCBI nr
Match: gi|659068040|ref|XP_008442434.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like [Cucumis melo])

HSP 1 Score: 691.8 bits (1784), Expect = 7.9e-196
Identity = 345/442 (78.05%), Postives = 392/442 (88.69%), Query Frame = 1

Query: 1   MKLLQSGKPMHLIAFRRELVCFYSTIAKDSLYKRISPLGDPNISVIPVLDQWVLEGRPVC 60
           MKLLQS KP +LIA RR LV FYST  KD+LY+RISP+GDPNISVIPVLDQWVLEGR V 
Sbjct: 1   MKLLQSVKPTNLIALRRGLVNFYSTFVKDNLYRRISPVGDPNISVIPVLDQWVLEGRVVQ 60

Query: 61  REELRNIIKELRVYNRFKHALEISKWMSDKRYLPLSTVDVATRMKLILRVHGLEQVEDYF 120
           +EEL+ IIKELRVY RFKHALEISKWMSDKRYLPLST DVATRM LILRVHGLEQVEDYF
Sbjct: 61  KEELQKIIKELRVYKRFKHALEISKWMSDKRYLPLSTDDVATRMNLILRVHGLEQVEDYF 120

Query: 121 NIIPSQLKGFQVYIALLNCYAQEKCVDKANAILQKMKEMGFDRTPLSYNIMMNLYYQIGE 180
           N +PSQLK + V+IALLNCYA EKCVDKANA LQK+KEMG+ ++ L YNIMMNLY+QIGE
Sbjct: 121 NNMPSQLKRYHVHIALLNCYAHEKCVDKANAFLQKIKEMGYAKSTLPYNIMMNLYHQIGE 180

Query: 181 FEKLDPLLQEMKEKGVSFDPFTYCIRLSAYAAASDITGVDKIVEQMESDTNIVLDWNCYA 240
           FE+LD LL+EMKEKGV +D FTY IRLSAYAAASD TG++K++EQMES+T+IVLDWNCY 
Sbjct: 181 FERLDSLLKEMKEKGVYYDRFTYSIRLSAYAAASDCTGIEKMMEQMESNTSIVLDWNCYV 240

Query: 241 IAANSCLKAGLIDKFVSMLRKSEGLLATAKRKGYAFDFLLKLYAKSGKKDEVHRVWNLYK 300
           IAAN+  K GLIDK VSML+KSEG LAT K+KG+AF+  LKLYA++GKKDEVHR+WNLYK
Sbjct: 241 IAANAYNKVGLIDKSVSMLKKSEGRLATDKKKGHAFNVYLKLYARNGKKDEVHRIWNLYK 300

Query: 301 KEKVNNKGFISMIRSLLILDNIEGAECIFKEWETRKLVYDLRIPNMLVEAYCREGLMEKA 360
           KEK+ NKGFISMIRSLLILD+I GAE I+KEWET+KL YD+RIPN+LV+AYCR GL+EKA
Sbjct: 301 KEKIFNKGFISMIRSLLILDDIRGAEDIYKEWETQKLSYDVRIPNLLVDAYCRAGLIEKA 360

Query: 361 EDFINETLIVRGKFSVESWCYLANGYLQKDQLPQAVDALKKAASLCPPELNHLKEILATF 420
           E+ +NE + VRGKFSVESWCYLA+GYLQKDQLPQAV+ LKKAASLCP ELN++KEILA F
Sbjct: 361 EELVNEIVNVRGKFSVESWCYLASGYLQKDQLPQAVETLKKAASLCPSELNYVKEILAAF 420

Query: 421 LDGKQDVKEAEKVVNLLRSGAN 443
            DGKQDV+EAEKVVNLLR   N
Sbjct: 421 SDGKQDVEEAEKVVNLLREKDN 442

BLAST of Cla019500 vs. NCBI nr
Match: gi|778658728|ref|XP_011653157.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like [Cucumis sativus])

HSP 1 Score: 679.1 bits (1751), Expect = 5.3e-192
Identity = 333/443 (75.17%), Postives = 388/443 (87.58%), Query Frame = 1

Query: 1   MKLLQSGKPMHLIAFRRELVCFYSTIAKDSLYKRISPLGDPNISVIPVLDQWVLEGRPVC 60
           MKLLQS KP++LIAFR E V FYST+ KD+LY+RISP+GDPNISV P+LDQWVLEGR V 
Sbjct: 1   MKLLQSLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60

Query: 61  REELRNIIKELRVYNRFKHALEISKWMSDKRYLPLSTVDVATRMKLILRVHGLEQVEDYF 120
           ++ELR+IIKELRVY RFKHALEISKWMSDKRY PLST D+A RM LILRVHGLEQVEDYF
Sbjct: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120

Query: 121 NIIPSQLKGFQVYIALLNCYAQEKCVDKANAILQKMKEMGFDRTPLSYNIMMNLYYQIGE 180
           + +PSQLK +QV+IALLNCYA EKCVDKANA +QK+KEMGF  +PL YNIMMNLY+QIGE
Sbjct: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMNLYHQIGE 180

Query: 181 FEKLDPLLQEMKEKGVSFDPFTYCIRLSAYAAASDITGVDKIVEQMESDTNIVLDWNCYA 240
           FE+LD LL+EMKE+GV +D FTY IR+SAYAAASD  G++KI+EQMES+ +IVLDWNCY 
Sbjct: 181 FERLDSLLKEMKERGVYYDRFTYSIRISAYAAASDFRGIEKIMEQMESNPSIVLDWNCYV 240

Query: 241 IAANSCLKAGLIDKFVSMLRKSEGLLATAKRKGYAFDFLLKLYAKSGKKDEVHRVWNLYK 300
           IAAN+  K GLIDK +SML+KSEGLLA  K+KG+AF+  LKLYA++GKKDE+HR+WNLYK
Sbjct: 241 IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK 300

Query: 301 KEKVNNKGFISMIRSLLILDNIEGAECIFKEWETRKLVYDLRIPNMLVEAYCREGLMEKA 360
           KEK+ NKGFISMI SL +LD+I+GAE I+KEWETRKL YDLRIPN+LV+AYCR GLMEKA
Sbjct: 301 KEKIFNKGFISMITSLFVLDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKA 360

Query: 361 EDFINETLIVRGKFSVESWCYLANGYLQKDQLPQAVDALKKAASLCPPELNHLKEILATF 420
           E  +NE +IVR KFSVESWCYLA+GYLQKDQLPQAV+ LK AAS+CP  LN++KEILA F
Sbjct: 361 EVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVKEILAAF 420

Query: 421 LDGKQDVKEAEKVVNLLRSGANS 444
           LDGKQDV+E EKVVNLLR   +S
Sbjct: 421 LDGKQDVEETEKVVNLLREKDDS 443

BLAST of Cla019500 vs. NCBI nr
Match: gi|659068042|ref|XP_008442448.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like [Cucumis melo])

HSP 1 Score: 633.3 bits (1632), Expect = 3.3e-178
Identity = 319/449 (71.05%), Postives = 378/449 (84.19%), Query Frame = 1

Query: 1   MKLLQSGKPMHLIAFRRELVCFYSTIAKDSLYKRISPLGDPNISVIPVLDQWVLEGRPVC 60
           MKLLQS K ++LIAFRRELV FYST   D LY+R+SP+GDPNIS++P+LDQWV EGRPV 
Sbjct: 1   MKLLQSVKSINLIAFRRELVNFYSTFVNDDLYRRLSPVGDPNISIVPILDQWVSEGRPVQ 60

Query: 61  REELRNIIKELRVYNRFKHALEISKWMSDKRYLPLSTVDVATRMKLILRVHGLEQVEDYF 120
             ELR IIKELRVY R+KHALE+SKWMSDK  LPLST D+ATRM LILRVHGLEQVEDYF
Sbjct: 61  IVELRLIIKELRVYKRYKHALEMSKWMSDKVCLPLSTADIATRMNLILRVHGLEQVEDYF 120

Query: 121 NIIPSQLKGFQVYIALLNCYAQEKCVDKANAILQKMKEMGFDRTPLSYNIMMNLYYQIGE 180
           N +PS+LK +QV+IALLNCYA EKCVDKANA+LQK+KE+GF  TP  YNIMMNLY+QIGE
Sbjct: 121 NNMPSKLKRYQVHIALLNCYAHEKCVDKANALLQKIKELGFATTPHPYNIMMNLYHQIGE 180

Query: 181 FEKLDPLLQEMKEKGVSFDPFTYCIRLSAYAAASDITGVDKIVEQMESDTNIVLDWNCYA 240
           FE+LD L++EMKE+G+ +D FTY IRLSAYA ASD  G++KI EQMES+T+IVLDWNCY 
Sbjct: 181 FERLDSLMKEMKERGLYYDRFTYSIRLSAYATASDCAGIEKITEQMESNTSIVLDWNCYV 240

Query: 241 IAANSCLKAGLIDKFVSMLRKSEGLLA-TAKRKGYAFDFLLKLYAKSGKKDEVHRVWNLY 300
           +AA++  K GLIDK +SML+KSE LLA TA+ K +AF+  L LYAK+GKKDE +R+WNLY
Sbjct: 241 VAADAYYKVGLIDKSISMLKKSEELLAKTAENKCHAFNIYLTLYAKNGKKDETYRIWNLY 300

Query: 301 KKEKVNNKGFISMIRSLLILDNIEGAECIFKE-----WETRKLVYDLRIPNMLVEAYCRE 360
           KKEKV NKGFISMI SLLILD+I+GA  I +E     WET+KL YDLRIPN+LV+AYCR 
Sbjct: 301 KKEKVFNKGFISMITSLLILDDIKGARRICEEWETQVWETQKLSYDLRIPNLLVDAYCRA 360

Query: 361 GLMEKAEDFINETLIVRGKFSVESWCYLANGYLQKDQLPQAVDALKKAASLCPPELNHLK 420
           GLME+AE  + E + VR KFSV+SWCY+A+GYLQKDQLP+AV+ LK AASLCP +L+++K
Sbjct: 361 GLMEEAEVLVYEMMTVRRKFSVKSWCYIASGYLQKDQLPEAVETLKIAASLCPSKLDYVK 420

Query: 421 EILATFLDGKQDVKEAEKVVNLLRSGANS 444
           EILA FLDGKQDV+E EKVVNLLR   NS
Sbjct: 421 EILAAFLDGKQDVEEVEKVVNLLREKDNS 449

BLAST of Cla019500 vs. NCBI nr
Match: gi|778658725|ref|XP_011653151.1| (PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like [Cucumis sativus])

HSP 1 Score: 527.3 bits (1357), Expect = 2.6e-146
Identity = 262/358 (73.18%), Postives = 303/358 (84.64%), Query Frame = 1

Query: 1   MKLLQSGKPMHLIAFRRELVCFYSTIAKDSLYKRISPLGDPNISVIPVLDQWVLEGRPVC 60
           MKLLQS KP++LIAFRRE V FYST+ KDSLY+RISP+GDPNISV P+LDQWVLE   V 
Sbjct: 1   MKLLQSLKPINLIAFRREFVNFYSTVVKDSLYRRISPVGDPNISVTPLLDQWVLESGLVQ 60

Query: 61  REELRNIIKELRVYNRFKHALEISKWMSDKRYLPLSTVDVATRMKLILRVHGLEQVEDYF 120
           ++ELR+IIKELRVY RFKHALEISKWMSDKRY PLST D+ATRM LILRVHGLEQVEDYF
Sbjct: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIATRMNLILRVHGLEQVEDYF 120

Query: 121 NIIPSQLKGFQVYIALLNCYAQEKCVDKANAILQKMKEMGFDRTPLSYNIMMNLYYQIGE 180
           N +PSQLK  QV+IALLNCYA EK  DKANA+LQK+KEMGF +T L YNI MNLY+QIGE
Sbjct: 121 NNMPSQLKRCQVHIALLNCYAHEKYADKANAVLQKIKEMGFAKTSLPYNITMNLYHQIGE 180

Query: 181 FEKLDPLLQEMKEKGVSFDPFTYCIRLSAYAAASDITGVDKIVEQMESDTNIVLDWNCYA 240
           FE+LD     +KE  V  D FTY  RLSAYA A D TG++KI+EQME +T+IVLDWNCY 
Sbjct: 181 FERLD---SPLKETDVDHDQFTYTTRLSAYATAFDFTGIEKIMEQMEXNTSIVLDWNCYV 240

Query: 241 IAANSCLKAGLIDKFVSMLRKSEGLLATAKRKGYAFDFLLKLYAKSGKKDEVHRVWNLYK 300
           IAAN+  K GLIDK +SML+KSEGLLA  K+KG+AF+  LKLYA++GKKDE+H +WNLYK
Sbjct: 241 IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHLIWNLYK 300

Query: 301 KEKVNNKGFISMIRSLLILDNIEGAECIFKEWETRKLVYDLRIPNMLVEAYCREGLME 359
           KEK+ NKGFISMI SL +LD+I+GAE I+KEWET+KL YDLRIPN+LV+AYCR GLME
Sbjct: 301 KEKIFNKGFISMITSLFVLDDIKGAERIYKEWETQKLSYDLRIPNLLVDAYCRAGLME 355

BLAST of Cla019500 vs. NCBI nr
Match: gi|731437627|ref|XP_010647278.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like [Vitis vinifera])

HSP 1 Score: 474.2 bits (1219), Expect = 2.6e-130
Identity = 238/413 (57.63%), Postives = 312/413 (75.54%), Query Frame = 1

Query: 31  LYKRISPLGDPNISVIPVLDQWVLEGRPVCREELRNIIKELRVYNRFKHALEISKWMSDK 90
           LY RISPLGD  IS++PVLD+WV +GR V  EELR+II++L  Y RFKHALEIS+WMSDK
Sbjct: 44  LYWRISPLGDSKISMVPVLDEWVQKGRTVNEEELRDIIQKLNHYRRFKHALEISQWMSDK 103

Query: 91  RYLPLSTVDVATRMKLILRVHGLEQVEDYFNIIPSQLKGFQVYIALLNCYAQEKCVDKAN 150
           RY+PL   D+A RM LIL+VHGLEQVE+YFN I   LK +QVYIALLNCYA EK VDKA 
Sbjct: 104 RYIPLMPRDIALRMNLILKVHGLEQVENYFNNIHKNLKTYQVYIALLNCYALEKSVDKAE 163

Query: 151 AILQKMKEMGFDRTPLSYNIMMNLYYQIGEFEKLDPLLQEMKEKGVSFDPFTYCIRLSAY 210
           AI+Q+++++GF RT L YN +MN+YY++G +EKLD L+ EM+EKG+  D FT  IRLSAY
Sbjct: 164 AIMQRLRDLGFVRTALGYNTLMNVYYRMGNWEKLDILMHEMEEKGIFCDKFTLAIRLSAY 223

Query: 211 AAASDITGVDKIVEQMESDTNIVLDWNCYAIAANSCLKAGLIDKFVSMLRKSEGLLATAK 270
           AAAS+I G+D IV +MESD  I+LDWN YA+ A+  LK GL+DK + M++K E L+  AK
Sbjct: 224 AAASNIVGIDNIVTRMESDPRIILDWNSYAVVAHGYLKVGLVDKTLVMMKKLEELI-DAK 283

Query: 271 RKGYAFDFLLKLYAKSGKKDEVHRVWNLY-KKEKVNNKGFISMIRSLLILDNIEGAECIF 330
               AFD LLKLYA++ ++DE+ RVW LY KKEK+ NKG+++MI SLL  D+I+ AE + 
Sbjct: 284 GSNVAFDNLLKLYAETRQRDELDRVWMLYKKKEKIYNKGYMAMISSLLKFDDIDAAEKVL 343

Query: 331 KEWETRKLVYDLRIPNMLVEAYCREGLMEKAEDFINETLIVRGKFSVESWCYLANGYLQK 390
           +EWE+R+L YD R+PN L++AYCR+GL EKAE  +N+ L   G   V++W YLANGYL+ 
Sbjct: 344 EEWESRRLSYDFRVPNFLIDAYCRKGLTEKAEALVNKILTKGGNPLVDTWFYLANGYLED 403

Query: 391 DQLPQAVDALKKAASLCPPELNHLKEILAT---FLDGKQDVKEAEKVVNLLRS 440
            Q+P+AV+ALKKA  +CPP     K  LAT   +L+G +DV+ A + +  L++
Sbjct: 404 SQIPKAVEALKKAVVVCPPNWKPSKNTLATCLEYLEGNRDVEGAGEFIRFLQN 455

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP166_ARATH2.4e-8841.59Pentatricopeptide repeat-containing protein At2g20710, mitochondrial OS=Arabidop... [more]
PP334_ARATH3.9e-7036.67Pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Arabidop... [more]
PPR3_ARATH1.2e-6333.41Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana GN... [more]
PPR86_ARATH9.0e-5931.38Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana GN... [more]
PPR61_ARATH2.0e-5840.12Putative pentatricopeptide repeat-containing protein At1g28020 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
A0A0A0LV44_CUCSA3.7e-19275.17Uncharacterized protein OS=Cucumis sativus GN=Csa_1G073790 PE=4 SV=1[more]
W9RYV8_9ROSA1.1e-12452.57Uncharacterized protein OS=Morus notabilis GN=L484_014527 PE=4 SV=1[more]
A0A068TUA9_COFCA2.6e-12151.30Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00029177001 PE=4 SV=1[more]
M1CWK5_SOLTU5.2e-11448.95Uncharacterized protein OS=Solanum tuberosum GN=PGSC0003DMG400029687 PE=4 SV=1[more]
K4B957_SOLLC9.9e-11348.11Uncharacterized protein OS=Solanum lycopersicum PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|659068040|ref|XP_008442434.1|7.9e-19678.05PREDICTED: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-... [more]
gi|778658728|ref|XP_011653157.1|5.3e-19275.17PREDICTED: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-... [more]
gi|659068042|ref|XP_008442448.1|3.3e-17871.05PREDICTED: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-... [more]
gi|778658725|ref|XP_011653151.1|2.6e-14673.18PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At2g... [more]
gi|731437627|ref|XP_010647278.1|2.6e-13057.63PREDICTED: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0009058 biosynthetic process
biological_process GO:0055114 oxidation-reduction process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0005506 iron ion binding
molecular_function GO:0016491 oxidoreductase activity
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
WMU51356watermelon EST collection version 2.0transcribed_cluster
WMU52419watermelon EST collection version 2.0transcribed_cluster
WMU52435watermelon EST collection version 2.0transcribed_cluster
WMU53462watermelon EST collection version 2.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla019500Cla019500.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
WMU53462WMU53462transcribed_cluster
WMU52435WMU52435transcribed_cluster
WMU51356WMU51356transcribed_cluster
WMU52419WMU52419transcribed_cluster


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 345..366
score: 0.0054coord: 133..161
score: 2.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 167..211
score: 7.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 132..161
score: 2.1E-5coord: 167..199
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 164..198
score: 9.81coord: 340..375
score: 7.629coord: 272..306
score: 6.73coord: 199..229
score: 6.796coord: 129..163
score:
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 339..434
score: 5.1E-9coord: 130..186
score: 5.
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 339..432
score: 2.7
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 28..438
score: 8.3E
NoneNo IPR availablePANTHERPTHR24015:SF644PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 28..438
score: 8.3E

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla019500Cucumber (Gy14) v2cgybwmB448
Cla019500Melon (DHL92) v3.6.1medwmB550
Cla019500Silver-seed gourdcarwmB0142
Cla019500Silver-seed gourdcarwmB0948
Cla019500Cucumber (Chinese Long) v3cucwmB503
Cla019500Watermelon (97103) v2wmwmbB111
Cla019500Watermelon (97103) v2wmwmbB117
Cla019500Watermelon (97103) v2wmwmbB124
Cla019500Wax gourdwgowmB420
Cla019500Wax gourdwgowmB478
Cla019500Watermelon (97103) v1wmwmB082
Cla019500Watermelon (97103) v1wmwmB095
Cla019500Cucumber (Gy14) v1cgywmB020
Cla019500Cucurbita maxima (Rimu)cmawmB025
Cla019500Cucurbita maxima (Rimu)cmawmB188
Cla019500Cucurbita maxima (Rimu)cmawmB367
Cla019500Cucurbita maxima (Rimu)cmawmB820
Cla019500Cucurbita moschata (Rifu)cmowmB017
Cla019500Cucurbita moschata (Rifu)cmowmB176
Cla019500Cucurbita moschata (Rifu)cmowmB360
Cla019500Cucurbita moschata (Rifu)cmowmB420
Cla019500Melon (DHL92) v3.5.1mewmB149
Cla019500Melon (DHL92) v3.5.1mewmB563
Cla019500Watermelon (Charleston Gray)wcgwmB092
Cla019500Watermelon (Charleston Gray)wcgwmB124
Cla019500Watermelon (Charleston Gray)wcgwmB223
Cla019500Cucumber (Chinese Long) v2cuwmB008
Cla019500Cucumber (Chinese Long) v2cuwmB477
Cla019500Cucurbita pepo (Zucchini)cpewmB025
Cla019500Cucurbita pepo (Zucchini)cpewmB069
Cla019500Cucurbita pepo (Zucchini)cpewmB503
Cla019500Cucurbita pepo (Zucchini)cpewmB764
Cla019500Bottle gourd (USVL1VR-Ls)lsiwmB137
Cla019500Bottle gourd (USVL1VR-Ls)lsiwmB184
Cla019500Bottle gourd (USVL1VR-Ls)lsiwmB299