Cla019096 (gene) Watermelon (97103) v1

NameCla019096
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionPentatricopeptide repeat-containing protein (AHRD V1 ***- D7MUF9_ARALL); contains Interpro domain(s) IPR002885 Pentatricopeptide repeat
LocationChr6 : 25045412 .. 25046890 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCACGCACGCCATATGTTCGATGAAATGTCGCTGAGAAGATGCGGCTCATTGTTGAAGTTAAAATGGGATAGCTTCATTGTGCAATCCTTCTATATGCAGCATCGCTTTTGTTCCCTTCATTCCACCATAGACAAGAGAGCTTCTGTAATAAAACTGTGCGAAGTGATTTCGTGCACGATCGGTGGGTTAGATGAACTGGAATCCAGTCTGAATAAATGTACAATATCATTGACGTCTTCACTCGTTACCCAGGTTATCGATTCTAGCAAAAACGAAGCTCCCATTAGAAGATTGCTTAGGTTTTTCTTATGGTCTCTCAAGAAGTTAAACCACGATTTAGAAGATGAAGATTTCAATTATGCCATCCGCTTCTTTGCTCAGAAGAAGGACTACACTGCCATTAATATTTTACTTTCCAATCTTAAGAAAGCTGACCGCACAATGGACGGCCAGACCTTCGGTTTCGTGGCTGAGGCTTTAGTCAGAATGGATAGAGAAGATGAAGCATTGGGTCTGTTCAAGAACTTGGATAAGTACAAATGCCCACATGACCAATTTACTGTCACTGCAGTTATTACTGCTCTTTGTGCAAAAGGGCATGCTAAAAGAGCAGAAGGGGTTGTCTTGCACCACAAGGACAAGATTTCTAGCACAATGAGTTGCATCTATAGAAGCCTTCTATATGGATGGTCTATTAAGAAGAACACAAAAGAAGCAAGAAGAATACTCAAAGAAATGAAGTCAGATGGAACCATGCCAGATTTGTTCTGCTACAACACTTTCCTCAATTGTCTTTGTGAGCAGAATCTTGAGAAAAATCCTTCAGGTCTTGTGCCTGAAGCCTTGAATGTGATGATGGAAATGAGATCCTACAAAATTGCTCCTAACTCAATCAGTTATAATATATTGTTGTCATGTCTGGGTAGAACGAGGAGAGTGAAGGAATCATGTACAATCCTTGAGACAATGAAACGATCTGGTTGTCGACCCGATTGCATAAGCTATTATCTTGTGGCAAGAGTGTTGTTCTTGACTGGAAGATTTGGGAAAGGGCGCAGAATTGTGGACGAGATGATTGAAGAAGGGTTGATCCCGGATCGAAAATTCTACTATGATTTGATTGGTGTTCTGTGTGGTGTTGAGAGAACAAATTATGCACTTGAGCTTTTTGAGAAAATGAAGAGAAGCTCATTGGGGGGTTATGGGCCAGTTTATGATGTGCTTATACCAAAGCTTTGTCGAGGAGGAGATTTTGAAATGGGCAGGAAACTTTGGGAGGAGGCCATGGCTATGGGCGTTATGCTTAGCTGCTCAAGTGAGGTTTTGGATCCTTCAATCACAAAGGTTTTCAAGCCAACAAGAAAGATTGAGAATAAAATTCTTGAAGAATGCAATAGAGGTGAGAAGCAGAACAAAGCTGCTACTGAGAAACCAAAAGAAAAGAGAAAGAAAAGTAAGAATCTGTCTTGGAAATAA

mRNA sequence

ATGCACGCACGCCATATGTTCGATGAAATGTCGCTGAGAAGATGCGGCTCATTGTTGAAGTTAAAATGGGATAGCTTCATTGTGCAATCCTTCTATATGCAGCATCGCTTTTGTTCCCTTCATTCCACCATAGACAAGAGAGCTTCTGTAATAAAACTGTGCGAAGTGATTTCGTGCACGATCGGTGGGTTAGATGAACTGGAATCCAGTCTGAATAAATGTACAATATCATTGACGTCTTCACTCGTTACCCAGGTTATCGATTCTAGCAAAAACGAAGCTCCCATTAGAAGATTGCTTAGGTTTTTCTTATGGTCTCTCAAGAAGTTAAACCACGATTTAGAAGATGAAGATTTCAATTATGCCATCCGCTTCTTTGCTCAGAAGAAGGACTACACTGCCATTAATATTTTACTTTCCAATCTTAAGAAAGCTGACCGCACAATGGACGGCCAGACCTTCGGTTTCGTGGCTGAGGCTTTAGTCAGAATGGATAGAGAAGATGAAGCATTGGGTCTGTTCAAGAACTTGGATAAGTACAAATGCCCACATGACCAATTTACTGTCACTGCAGTTATTACTGCTCTTTGTGCAAAAGGGCATGCTAAAAGAGCAGAAGGGGTTGTCTTGCACCACAAGGACAAGATTTCTAGCACAATGAGTTGCATCTATAGAAGCCTTCTATATGGATGGTCTATTAAGAAGAACACAAAAGAAGCAAGAAGAATACTCAAAGAAATGAAGTCAGATGGAACCATGCCAGATTTGTTCTGCTACAACACTTTCCTCAATTGTCTTTGTGAGCAGAATCTTGAGAAAAATCCTTCAGGTCTTGTGCCTGAAGCCTTGAATGTGATGATGGAAATGAGATCCTACAAAATTGCTCCTAACTCAATCAGTTATAATATATTGTTGTCATGTCTGGGTAGAACGAGGAGAGTGAAGGAATCATGTACAATCCTTGAGACAATGAAACGATCTGGTTGTCGACCCGATTGCATAAGCTATTATCTTGTGGCAAGAGTGTTGTTCTTGACTGGAAGATTTGGGAAAGGGCGCAGAATTGTGGACGAGATGATTGAAGAAGGGTTGATCCCGGATCGAAAATTCTACTATGATTTGATTGGTGTTCTGTGTGGTGTTGAGAGAACAAATTATGCACTTGAGCTTTTTGAGAAAATGAAGAGAAGCTCATTGGGGGGTTATGGGCCAGTTTATGATGTGCTTATACCAAAGCTTTGTCGAGGAGGAGATTTTGAAATGGGCAGGAAACTTTGGGAGGAGGCCATGGCTATGGGCGTTATGCTTAGCTGCTCAAGTGAGGTTTTGGATCCTTCAATCACAAAGGTTTTCAAGCCAACAAGAAAGATTGAGAATAAAATTCTTGAAGAATGCAATAGAGGTGAGAAGCAGAACAAAGCTGCTACTGAGAAACCAAAAGAAAAGAGAAAGAAAAGTAAGAATCTGTCTTGGAAATAA

Coding sequence (CDS)

ATGCACGCACGCCATATGTTCGATGAAATGTCGCTGAGAAGATGCGGCTCATTGTTGAAGTTAAAATGGGATAGCTTCATTGTGCAATCCTTCTATATGCAGCATCGCTTTTGTTCCCTTCATTCCACCATAGACAAGAGAGCTTCTGTAATAAAACTGTGCGAAGTGATTTCGTGCACGATCGGTGGGTTAGATGAACTGGAATCCAGTCTGAATAAATGTACAATATCATTGACGTCTTCACTCGTTACCCAGGTTATCGATTCTAGCAAAAACGAAGCTCCCATTAGAAGATTGCTTAGGTTTTTCTTATGGTCTCTCAAGAAGTTAAACCACGATTTAGAAGATGAAGATTTCAATTATGCCATCCGCTTCTTTGCTCAGAAGAAGGACTACACTGCCATTAATATTTTACTTTCCAATCTTAAGAAAGCTGACCGCACAATGGACGGCCAGACCTTCGGTTTCGTGGCTGAGGCTTTAGTCAGAATGGATAGAGAAGATGAAGCATTGGGTCTGTTCAAGAACTTGGATAAGTACAAATGCCCACATGACCAATTTACTGTCACTGCAGTTATTACTGCTCTTTGTGCAAAAGGGCATGCTAAAAGAGCAGAAGGGGTTGTCTTGCACCACAAGGACAAGATTTCTAGCACAATGAGTTGCATCTATAGAAGCCTTCTATATGGATGGTCTATTAAGAAGAACACAAAAGAAGCAAGAAGAATACTCAAAGAAATGAAGTCAGATGGAACCATGCCAGATTTGTTCTGCTACAACACTTTCCTCAATTGTCTTTGTGAGCAGAATCTTGAGAAAAATCCTTCAGGTCTTGTGCCTGAAGCCTTGAATGTGATGATGGAAATGAGATCCTACAAAATTGCTCCTAACTCAATCAGTTATAATATATTGTTGTCATGTCTGGGTAGAACGAGGAGAGTGAAGGAATCATGTACAATCCTTGAGACAATGAAACGATCTGGTTGTCGACCCGATTGCATAAGCTATTATCTTGTGGCAAGAGTGTTGTTCTTGACTGGAAGATTTGGGAAAGGGCGCAGAATTGTGGACGAGATGATTGAAGAAGGGTTGATCCCGGATCGAAAATTCTACTATGATTTGATTGGTGTTCTGTGTGGTGTTGAGAGAACAAATTATGCACTTGAGCTTTTTGAGAAAATGAAGAGAAGCTCATTGGGGGGTTATGGGCCAGTTTATGATGTGCTTATACCAAAGCTTTGTCGAGGAGGAGATTTTGAAATGGGCAGGAAACTTTGGGAGGAGGCCATGGCTATGGGCGTTATGCTTAGCTGCTCAAGTGAGGTTTTGGATCCTTCAATCACAAAGGTTTTCAAGCCAACAAGAAAGATTGAGAATAAAATTCTTGAAGAATGCAATAGAGGTGAGAAGCAGAACAAAGCTGCTACTGAGAAACCAAAAGAAAAGAGAAAGAAAAGTAAGAATCTGTCTTGGAAATAA

Protein sequence

MHARHMFDEMSLRRCGSLLKLKWDSFIVQSFYMQHRFCSLHSTIDKRASVIKLCEVISCTIGGLDELESSLNKCTISLTSSLVTQVIDSSKNEAPIRRLLRFFLWSLKKLNHDLEDEDFNYAIRFFAQKKDYTAINILLSNLKKADRTMDGQTFGFVAEALVRMDREDEALGLFKNLDKYKCPHDQFTVTAVITALCAKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEARRILKEMKSDGTMPDLFCYNTFLNCLCEQNLEKNPSGLVPEALNVMMEMRSYKIAPNSISYNILLSCLGRTRRVKESCTILETMKRSGCRPDCISYYLVARVLFLTGRFGKGRRIVDEMIEEGLIPDRKFYYDLIGVLCGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGGDFEMGRKLWEEAMAMGVMLSCSSEVLDPSITKVFKPTRKIENKILEECNRGEKQNKAATEKPKEKRKKSKNLSWK
BLAST of Cla019096 vs. Swiss-Prot
Match: PP439_ARATH (Pentatricopeptide repeat-containing protein At5g61370, mitochondrial OS=Arabidopsis thaliana GN=At5g61370 PE=2 SV=1)

HSP 1 Score: 575.1 bits (1481), Expect = 7.4e-163
Identity = 280/480 (58.33%), Postives = 360/480 (75.00%), Query Frame = 1

Query: 19  LKLKWDSFIVQSFYMQHRFCSLHSTIDKRASVIKLCEVISCTIGGLDELESSLNKCTISL 78
           ++L   +++  +  +   FCS H       ++ ++  ++S  +GGLD+LE +LN+ ++S 
Sbjct: 6   VRLNRFTYLTSTAKLTRYFCSHHLVDRSETALHEVIRIVSSPVGGLDDLEENLNQVSVSP 65

Query: 79  TSSLVTQVIDSSKNEAPIRRLLRFFLWSLKKLNHDLEDEDFNYAIRFFAQKKDYTAINIL 138
           +S+LVTQVI+S KNE   RRLLRFF WS K L   L D++FNY +R  A+KKD+TA+ IL
Sbjct: 66  SSNLVTQVIESCKNETSPRRLLRFFSWSCKSLGSSLHDKEFNYVLRVLAEKKDHTAMQIL 125

Query: 139 LSNLKKADRTMDGQTFGFVAEALVRMDREDEALGLFKNLDKYKCPHDQFTVTAVITALCA 198
           LS+L+K +R MD QTF  VAE LV++ +E++A+G+FK LDK+ CP D FTVTA+I+ALC+
Sbjct: 126 LSDLRKENRAMDKQTFSIVAETLVKVGKEEDAIGIFKILDKFSCPQDGFTVTAIISALCS 185

Query: 199 KGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEARRILKEMKSDGTMPDLFC 258
           +GH KRA GV+ HHKD IS     +YRSLL+GWS+++N KEARR++++MKS G  PDLFC
Sbjct: 186 RGHVKRALGVMHHHKDVISGNELSVYRSLLFGWSVQRNVKEARRVIQDMKSAGITPDLFC 245

Query: 259 YNTFLNCLCEQNLEKNPSGLVPEALNVMMEMRSYKIAPNSISYNILLSCLGRTRRVKESC 318
           +N+ L CLCE+N+ +NPSGLVPEALN+M+EMRSYKI P S+SYNILLSCLGRTRRV+ESC
Sbjct: 246 FNSLLTCLCERNVNRNPSGLVPEALNIMLEMRSYKIQPTSMSYNILLSCLGRTRRVRESC 305

Query: 319 TILETMKRSGCRPDCISYYLVARVLFLTGRFGKGRRIVDEMIEEGLIPDRKFYYDLIGVL 378
            ILE MKRSGC PD  SYY V RVL+LTGRFGKG +IVDEMIE G  P+RKFYYDLIGVL
Sbjct: 306 QILEQMKRSGCDPDTGSYYFVVRVLYLTGRFGKGNQIVDEMIERGFRPERKFYYDLIGVL 365

Query: 379 CGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGGDFEMGRKLWEEAMAMGVMLSC 438
           CGVER N+AL+LFEKMKRSS+GGYG VYD+LIPKLC+GG+FE GR+LWEEA+++ V LSC
Sbjct: 366 CGVERVNFALQLFEKMKRSSVGGYGQVYDLLIPKLCKGGNFEKGRELWEEALSIDVTLSC 425

Query: 439 SSEVLDPSITKVFKPTRKIENKILEECN--------RGEKQNKAATEKPKEKRK-KSKNL 490
           S  +LDPS+T+VFKP +  E   + +          R  K       KPK + K K KNL
Sbjct: 426 SISLLDPSVTEVFKPMKMKEEAAMVDRRALNLKIHARMNKTKPKLKLKPKRRSKTKKKNL 485

BLAST of Cla019096 vs. Swiss-Prot
Match: PP438_ARATH (Pentatricopeptide repeat-containing protein PNM1, mitochondrial OS=Arabidopsis thaliana GN=PNM1 PE=1 SV=1)

HSP 1 Score: 179.9 bits (455), Expect = 6.9e-44
Identity = 116/418 (27.75%), Postives = 204/418 (48.80%), Query Frame = 1

Query: 76  ISLTSSLVTQVIDSSKNEAPIRRLLRFFLWSLKKLNHDLEDEDFNYAIRFFAQKKDYTAI 135
           I+    L+ Q ++ S      R  L F  W     N    DE  ++ + +F ++KD+  +
Sbjct: 105 ITPNPDLILQTLNLSPEAG--RAALGFNEWLDSNSNFSHTDETVSFFVDYFGRRKDFKGM 164

Query: 136 NILLSNLKKADRTMDGQTFGFVAEALVRMDREDEALGLFKNLDK-YKCPHDQFTVTAVIT 195
             ++S  K       G+T     + LVR  R  +    F+ ++  Y    D+ ++T V+ 
Sbjct: 165 LEIISKYKGI---AGGKTLESAIDRLVRAGRPKQVTDFFEKMENDYGLKRDKESLTLVVK 224

Query: 196 ALCAKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEARRILKEMKSDGTMP 255
            LC KGHA  AE +V +  ++I    + I   L+ GW I +   EA R+  EM   G   
Sbjct: 225 KLCEKGHASIAEKMVKNTANEIFPDEN-ICDLLISGWCIAEKLDEATRLAGEMSRGGFEI 284

Query: 256 DLFCYNTFLNCLCEQNLEKNPSGLVPEALNVMMEMRSYKIAPNSISYNILLSCLGRTRRV 315
               YN  L+C+C+   +K+P  L PE   V++EM    +  N+ ++N+L++ L + RR 
Sbjct: 285 GTKAYNMMLDCVCKLCRKKDPFKLQPEVEKVLLEMEFRGVPRNTETFNVLINNLCKIRRT 344

Query: 316 KESCTILETMKRSGCRPDCISYYLVARVLFLTGRFGKGRRIVDEMIEEGL--IPDRKFYY 375
           +E+ T+   M   GC+PD  +Y ++ R L+   R G+G  ++D+M   G   + ++K YY
Sbjct: 345 EEAMTLFGRMGEWGCQPDAETYLVLIRSLYQAARIGEGDEMIDKMKSAGYGELLNKKEYY 404

Query: 376 DLIGVLCGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGGDFEMGRKLWEEAMAM 435
             + +LCG+ER  +A+ +F+ MK +        YD+L+ K+C          L++EA   
Sbjct: 405 GFLKILCGIERLEHAMSVFKSMKANGCKPGIKTYDLLMGKMCANNQLTRANGLYKEAAKK 464

Query: 436 GVMLSCSSEVLDPSITKVFKPTRKIENKILEECNRGEKQNKAATEKPKEKRKKSKNLS 491
           G+ +S     +DP   K  K T+++++ +        K+ +   EK   K+K+ K ++
Sbjct: 465 GIAVSPKEYRVDPRFMK--KKTKEVDSNV--------KKRETLPEKTARKKKRLKQIN 506

BLAST of Cla019096 vs. Swiss-Prot
Match: PP447_ARATH (Putative pentatricopeptide repeat-containing protein At5g65820 OS=Arabidopsis thaliana GN=At5g65820 PE=3 SV=1)

HSP 1 Score: 122.1 bits (305), Expect = 1.7e-26
Identity = 83/370 (22.43%), Postives = 170/370 (45.95%), Query Frame = 1

Query: 66  ELESSLNKCTISLTSSLVTQVIDSSKNEAPIRRLLRFFLWSLKKLNHDLEDEDFNYAIRF 125
           +LE +LN+  + L   L+ +V++   +   +    RFF+W+ K+  +    E +   ++ 
Sbjct: 99  KLELALNESGVELRPGLIERVLNRCGDAGNLG--YRFFVWAAKQPRYCHSIEVYKSMVKI 158

Query: 126 FAQKKDYTAINILLSNLKKAD-RTMDGQTFGFVAEALVRMDREDEALGLFKNLDKYKCPH 185
            ++ + + A+  L+  ++K + + ++ + F  + +     D   +A+ +   + K+    
Sbjct: 159 LSKMRQFGAVWGLIEEMRKENPQLIEPELFVVLVQRFASADMVKKAIEVLDEMPKFGFEP 218

Query: 186 DQFTVTAVITALCAKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEARRIL 245
           D++    ++ ALC  G  K A  +    + +    +   + SLLYGW       EA+ +L
Sbjct: 219 DEYVFGCLLDALCKHGSVKDAAKLFEDMRMRFPVNLR-YFTSLLYGWCRVGKMMEAKYVL 278

Query: 246 KEMKSDGTMPDLFCYNTFLNCLCEQNLEKNPSGLVPEALNVMMEMRSYKIAPNSISYNIL 305
            +M   G  PD+  Y   L+           +G + +A +++ +MR     PN+  Y +L
Sbjct: 279 VQMNEAGFEPDIVDYTNLLSGYAN-------AGKMADAYDLLRDMRRRGFEPNANCYTVL 338

Query: 306 LSCLGRTRRVKESCTILETMKRSGCRPDCISYYLVARVLFLTGRFGKGRRIVDEMIEEGL 365
           +  L +  R++E+  +   M+R  C  D ++Y  +       G+  K   ++D+MI++GL
Sbjct: 339 IQALCKVDRMEEAMKVFVEMERYECEADVVTYTALVSGFCKWGKIDKCYIVLDDMIKKGL 398

Query: 366 IPDRKFYYDLIGVLCGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGGDFEMGRK 425
           +P    Y  ++      E     LEL EKM++        +Y+V+I   C+ G+ +   +
Sbjct: 399 MPSELTYMHIMVAHEKKESFEECLELMEKMRQIEYHPDIGIYNVVIRLACKLGEVKEAVR 458

Query: 426 LWEEAMAMGV 435
           LW E    G+
Sbjct: 459 LWNEMEENGL 458


HSP 2 Score: 60.5 bits (145), Expect = 6.1e-08
Identity = 44/227 (19.38%), Postives = 100/227 (44.05%), Query Frame = 1

Query: 138 LLSNLKKADRTMDGQTFGFVAEALVRMDREDEALGLFKNLDKYKCPHDQFTVTAVITALC 197
           LL ++++     +   +  + +AL ++DR +EA+ +F  +++Y+C  D  T TA+++  C
Sbjct: 309 LLRDMRRRGFEPNANCYTVLIQALCKVDRMEEAMKVFVEMERYECEADVVTYTALVSGFC 368

Query: 198 AKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEARRILKEMKSDGTMPDLF 257
             G   +   V+     K        Y  ++     K++ +E   ++++M+     PD+ 
Sbjct: 369 KWGKIDKCYIVLDDMIKKGLMPSELTYMHIMVAHEKKESFEECLELMEKMRQIEYHPDIG 428

Query: 258 CYNTFLNCLCEQNLEKNPSGLVPEALNVMMEMRSYKIAPNSISYNILLSCLGRTRRVKES 317
            YN  +   C+        G V EA+ +  EM    ++P   ++ I+++ L         
Sbjct: 429 IYNVVIRLACK-------LGEVKEAVRLWNEMEENGLSPGVDTFVIMINGLA-------- 488

Query: 318 CTILETMKRSGCRPDCISYY--LVARVLFLTGRFGKGRRIVDEMIEE 363
                     GC  +   ++  +V R LF   ++G  + +++ ++++
Sbjct: 489 --------SQGCLLEASDHFKEMVTRGLFSVSQYGTLKLLLNTVLKD 512

BLAST of Cla019096 vs. Swiss-Prot
Match: PP248_ARATH (Pentatricopeptide repeat-containing protein At3g22670, mitochondrial OS=Arabidopsis thaliana GN=At3g22670 PE=2 SV=1)

HSP 1 Score: 111.3 bits (277), Expect = 3.0e-23
Identity = 86/370 (23.24%), Postives = 162/370 (43.78%), Query Frame = 1

Query: 52  KLCEVISCTIGGLDELESSLNKCTISLTSSLVTQVIDSSKNEAPIRRLLRFFLWSLKKLN 111
           K+C+ ++      +++   L+KC + +T SLV QV+    N     +   FF+W+  +  
Sbjct: 104 KVCDFLNKKDTSHEDVVKELSKCDVVVTESLVLQVLRRFSNGW--NQAYGFFIWANSQTG 163

Query: 112 HDLEDEDFNYAIRFFAQKKDYTAINILLSNLKKADR----TMDGQTFGFVAEALVRMDRE 171
           +      +N  +    + +++  +  L++ + K +     T+D  T   V   L +  + 
Sbjct: 164 YVHSGHTYNAMVDVLGKCRNFDLMWELVNEMNKNEESKLVTLD--TMSKVMRRLAKSGKY 223

Query: 172 DEALGLFKNLDK-YKCPHDQFTVTAVITALCAKGHAKRAEGVVLHHKDKISSTMSCIYRS 231
           ++A+  F  ++K Y    D   + +++ AL  +   + A  V L   D I       +  
Sbjct: 224 NKAVDAFLEMEKSYGVKTDTIAMNSLMDALVKENSIEHAHEVFLKLFDTIKPDART-FNI 283

Query: 232 LLYGWSIKKNTKEARRILKEMKSDGTMPDLFCYNTFLNCLCEQNLEKNPSGLVPEALNVM 291
           L++G+   +   +AR ++  MK     PD+  Y +F+   C++       G       ++
Sbjct: 284 LIHGFCKARKFDDARAMMDLMKVTEFTPDVVTYTSFVEAYCKE-------GDFRRVNEML 343

Query: 292 MEMRSYKIAPNSISYNILLSCLGRTRRVKESCTILETMKRSGCRPDCISYYLVARVLFLT 351
            EMR     PN ++Y I++  LG++++V E+  + E MK  GC PD   Y  +  +L  T
Sbjct: 344 EEMRENGCNPNVVTYTIVMHSLGKSKQVAEALGVYEKMKEDGCVPDAKFYSSLIHILSKT 403

Query: 352 GRFGKGRRIVDEMIEEGLIPDRKFYYDLIGVLCGVERTNYALELFEKMKRSSLGGYGPVY 411
           GRF     I ++M  +G+  D   Y  +I       R   AL L ++M+        P  
Sbjct: 404 GRFKDAAEIFEDMTNQGVRRDVLVYNTMISAALHHSRDEMALRLLKRMEDEEGESCSPNV 461

Query: 412 DVLIP--KLC 415
           +   P  K+C
Sbjct: 464 ETYAPLLKMC 461

BLAST of Cla019096 vs. Swiss-Prot
Match: PP418_ARATH (Pentatricopeptide repeat-containing protein At5g46100 OS=Arabidopsis thaliana GN=At5g46100 PE=2 SV=1)

HSP 1 Score: 107.5 bits (267), Expect = 4.4e-22
Identity = 81/337 (24.04%), Postives = 147/337 (43.62%), Query Frame = 1

Query: 114 LEDEDFNYAIRFFAQKKDY------TAINILLSNLKKADRTMDGQTFGFVAEALVRMDRE 173
           +E+   N A +F+   ++        ++N+L+  L + D T+D                 
Sbjct: 132 VEENQLNLAFKFYKNMREIGLPPTVASLNVLIKALCRNDGTVDA---------------- 191

Query: 174 DEALGLFKNLDKYKCPHDQFTVTAVITALCAKGHAKRAEGVVLHHKDKISSTMSCIYRSL 233
              L +F  + K  C  D +T   +I+ LC  G    A+ +     +K  +     Y SL
Sbjct: 192 --GLKIFLEMPKRGCDPDSYTYGTLISGLCRFGRIDEAKKLFTEMVEKDCAPTVVTYTSL 251

Query: 234 LYGWSIKKNTKEARRILKEMKSDGTMPDLFCYNTFLNCLCEQNLEKNPSGLVPEALNVMM 293
           + G    KN  EA R L+EMKS G  P++F Y++ ++ LC+        G   +A+ +  
Sbjct: 252 INGLCGSKNVDEAMRYLEEMKSKGIEPNVFTYSSLMDGLCK-------DGRSLQAMELFE 311

Query: 294 EMRSYKIAPNSISYNILLSCLGRTRRVKESCTILETMKRSGCRPDCISYYLVARVLFLTG 353
            M +    PN ++Y  L++ L + ++++E+  +L+ M   G +PD   Y  V        
Sbjct: 312 MMMARGCRPNMVTYTTLITGLCKEQKIQEAVELLDRMNLQGLKPDAGLYGKVISGFCAIS 371

Query: 354 RFGKGRRIVDEMIEEGLIPDR-------KFYYDLIGVLCGVERTNYALELFEKMKRSSLG 413
           +F +    +DEMI  G+ P+R       K   +++  LC     + A  L+  M+   + 
Sbjct: 372 KFREAANFLDEMILGGITPNRLTWNIHVKTSNEVVRGLC-ANYPSRAFTLYLSMRSRGIS 431

Query: 414 GYGPVYDVLIPKLCRGGDFEMGRKLWEEAMAMGVMLS 438
                 + L+  LC+ G+F+   +L +E +  G + S
Sbjct: 432 VEVETLESLVKCLCKKGEFQKAVQLVDEIVTDGCIPS 442

BLAST of Cla019096 vs. TrEMBL
Match: A0A0A0LPF4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G009920 PE=4 SV=1)

HSP 1 Score: 895.6 bits (2313), Expect = 2.7e-257
Identity = 437/487 (89.73%), Postives = 461/487 (94.66%), Query Frame = 1

Query: 1   MHARHMFDEMSLRRCGSLLKLKWDSFIVQSFYMQHRFCSLHSTIDKRASVIKLCEVISCT 60
           MHARH+FDEMSLRRC SLLKLKWDSFI QS   QHRFCSLHST++  A+V KLCEVISCT
Sbjct: 1   MHARHLFDEMSLRRCVSLLKLKWDSFIAQSVCTQHRFCSLHSTVNNGAAVSKLCEVISCT 60

Query: 61  IGGLDELESSLNKCTISLTSSLVTQVIDSSKNEAPIRRLLRFFLWSLKKLNHDLEDEDFN 120
           IGGLDELESSLNKCTISLTSSLVTQVIDSSKNEAP RRLLRFFLWSLKKLNH LEDEDFN
Sbjct: 61  IGGLDELESSLNKCTISLTSSLVTQVIDSSKNEAPTRRLLRFFLWSLKKLNHTLEDEDFN 120

Query: 121 YAIRFFAQKKDYTAINILLSNLKKADRTMDGQTFGFVAEALVRMDREDEALGLFKNLDKY 180
            AIRFFAQKKDYTA+NILLSNLKKADR MDGQTFGFVAEA V+MDREDEALGLFKNL+KY
Sbjct: 121 NAIRFFAQKKDYTAVNILLSNLKKADRAMDGQTFGFVAEAFVKMDREDEALGLFKNLEKY 180

Query: 181 KCPHDQFTVTAVITALCAKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEA 240
           KCPHDQFTV A+ITALC+KGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEA
Sbjct: 181 KCPHDQFTVVAIITALCSKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEA 240

Query: 241 RRILKEMKSDGTMPDLFCYNTFLNCLCEQNLEKNPSGLVPEALNVMMEMRSYKIAPNSIS 300
           RRILKEMKSDGTMPDLFCYNTFL CLCE+N+EKNPSGLVPE+LNVMMEMRSYKI+PNSIS
Sbjct: 241 RRILKEMKSDGTMPDLFCYNTFLKCLCEKNVEKNPSGLVPESLNVMMEMRSYKISPNSIS 300

Query: 301 YNILLSCLGRTRRVKESCTILETMKRSGCRPDCISYYLVARVLFLTGRFGKGRRIVDEMI 360
           YNILLSCL +TRRVKESC ILE MKR+GC+PDC+SYYL+ARVLFLTGRFGKGR IVDEMI
Sbjct: 301 YNILLSCLCKTRRVKESCKILEMMKRTGCQPDCVSYYLMARVLFLTGRFGKGREIVDEMI 360

Query: 361 EEGLIPDRKFYYDLIGVLCGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGGDFE 420
           EEGL PDRKFYYDLIG+LCGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGG+FE
Sbjct: 361 EEGLTPDRKFYYDLIGILCGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGGEFE 420

Query: 421 MGRKLWEEAMAMGVMLSCSSEVLDPSITKVFKPTRKIENKILEECNRGEKQNKAATEKPK 480
           MGR+LWEEAMAMGV L+CSSE+LDPSITKVFKPTRKIENKI+EE N  EKQNKAA EKPK
Sbjct: 421 MGRQLWEEAMAMGVSLNCSSEILDPSITKVFKPTRKIENKIVEEFNSAEKQNKAAAEKPK 480

Query: 481 EKRKKSK 488
           EKRKK K
Sbjct: 481 EKRKKGK 487

BLAST of Cla019096 vs. TrEMBL
Match: V4T153_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10023621mg PE=4 SV=1)

HSP 1 Score: 640.2 bits (1650), Expect = 2.1e-180
Identity = 320/476 (67.23%), Postives = 383/476 (80.46%), Query Frame = 1

Query: 18  LLKLKWDSFIVQSFYMQ---HRFCSLHSTIDKRA---SVIKLCEVISCTIGGLDELESSL 77
           LLK KW S ++Q    Q   H   SL+S +        + +LC+V+S TIGGLD+LE SL
Sbjct: 4   LLKPKWRSLLLQRVDTQKSEHLLLSLYSMVPSNQVSHELKELCKVVSSTIGGLDDLELSL 63

Query: 78  NKCTISLTSSLVTQVIDSSKNEAPIRRLLRFFLWSLKKLNHDLEDEDFNYAIRFFAQKKD 137
           N+ T SL+SSLVTQVIDS K+EAP RRLLRFFLWS K L+  LED+D+N+AIR FA+KKD
Sbjct: 64  NQFTGSLSSSLVTQVIDSCKHEAPTRRLLRFFLWSCKNLSASLEDKDYNHAIRVFAEKKD 123

Query: 138 YTAINILLSNLKKADRTMDGQTFGFVAEALVRMDREDEALGLFKNLDKYKCPHDQFTVTA 197
           + A+NIL+S+L+K  R M+ Q+FG + E LV++ REDEALG+FKNL+K+KC  D  TV+A
Sbjct: 124 HMAMNILVSDLRKEGRVMETQSFGVLVETLVKLGREDEALGIFKNLEKFKCVQDSVTVSA 183

Query: 198 VITALCAKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEARRILKEMKSDG 257
           +++ALCAKGHA+RAEGVV HHKDKIS    CIYRSL+YGWS+++N K AR+I+KEMKS G
Sbjct: 184 IVSALCAKGHARRAEGVVYHHKDKISGVELCIYRSLIYGWSMQENVKAARKIIKEMKSAG 243

Query: 258 TMPDLFCYNTFLNCLCEQNLEKNPSGLVPEALNVMMEMRSYKIAPNSISYNILLSCLGRT 317
            MPDLFCYNTFL  LCE+NL++NPSGLVPEALNVMMEMRSY+IAP SISYNILLSCLGRT
Sbjct: 244 FMPDLFCYNTFLRGLCERNLKRNPSGLVPEALNVMMEMRSYRIAPTSISYNILLSCLGRT 303

Query: 318 RRVKESCTILETMKRSGCRPDCISYYLVARVLFLTGRFGKGRRIVDEMIEEGLIPDRKFY 377
           RRVKESC +LE MK+SGC PD +SYYLVARVL+L+GRFGKG +IVDEMIEEGLIPDRKFY
Sbjct: 304 RRVKESCRVLEQMKKSGCAPDWVSYYLVARVLYLSGRFGKGNKIVDEMIEEGLIPDRKFY 363

Query: 378 YDLIGVLCGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGGDFEMGRKLWEEAMA 437
           YDLIG+LCGVER N+ALELFE+MKRSSLGGYGPVYDVLIPK+C+GGDF  GR+LW+EAM 
Sbjct: 364 YDLIGILCGVERVNFALELFERMKRSSLGGYGPVYDVLIPKVCQGGDFVKGRELWDEAMV 423

Query: 438 MGVMLSCSSEVLDPSITKVFKPTRKIENKILEECNRGEKQNKAATEKPKEKRKKSK 488
           MG+ LSCSS VLDPSIT+VF P RK     L       +     T    +K+KKSK
Sbjct: 424 MGLTLSCSSNVLDPSITEVFHPRRKPTEGCLGSTTPNIEAQVKKTVIEVDKKKKSK 479

BLAST of Cla019096 vs. TrEMBL
Match: A0A061FXY0_THECC (Pentatricopeptide repeat superfamily protein OS=Theobroma cacao GN=TCM_014106 PE=4 SV=1)

HSP 1 Score: 637.9 bits (1644), Expect = 1.0e-179
Identity = 313/461 (67.90%), Postives = 376/461 (81.56%), Query Frame = 1

Query: 37  FCSLHSTIDKRASVIKLCEVISCTIGGLDELESSLNKCTISLTSSLVTQVIDSSKNEAPI 96
           F S HSTI       +LC+V+S ++GGLD+LESSLN+  +SL+  LVTQVI+S +NEAP 
Sbjct: 27  FHSPHSTITTPPEFEELCKVVSSSMGGLDDLESSLNRFKLSLSPLLVTQVINSCENEAPT 86

Query: 97  RRLLRFFLWSLKKLNHDLEDEDFNYAIRFFAQKKDYTAINILLSNLKKADRTMDGQTFGF 156
           RRLLRFFLWS+K L+  LED+D N  +R FA+KKD+TA+ IL+S+++   RTM+ QTF  
Sbjct: 87  RRLLRFFLWSVKNLSSSLEDKDLNNVVRVFAKKKDHTAMGILVSDIRNRGRTMESQTFSV 146

Query: 157 VAEALVRMDREDEALGLFKNLDKYKCPHDQFTVTAVITALCAKGHAKRAEGVVLHHKDKI 216
           VAE LV++ REDEALG+FKNL+K+KCP D F++TA++ ALCAKGHA++AEGVV HHKD I
Sbjct: 147 VAEMLVKLGREDEALGIFKNLEKFKCPRDSFSLTAIVNALCAKGHARKAEGVVYHHKDTI 206

Query: 217 SSTMSCIYRSLLYGWSIKKNTKEARRILKEMKSDGTMPDLFCYNTFLNCLCEQNLEKNPS 276
           +    CIYR LLYGWS+++N KEARR++KEMKS G   DL+CYNTFL CLC +N ++NPS
Sbjct: 207 AGVEPCIYRCLLYGWSVQENVKEARRVIKEMKSAGFELDLYCYNTFLRCLCGKNAKRNPS 266

Query: 277 GLVPEALNVMMEMRSYKIAPNSISYNILLSCLGRTRRVKESCTILETMKRSGCRPDCISY 336
           GLVPEALNVMMEMRS +IAP S+SYNILLSCLGRTRRVKESC ILE MK++GC PD ISY
Sbjct: 267 GLVPEALNVMMEMRSQRIAPTSVSYNILLSCLGRTRRVKESCQILELMKKAGCAPDWISY 326

Query: 337 YLVARVLFLTGRFGKGRRIVDEMIEEGLIPDRKFYYDLIGVLCGVERTNYALELFEKMKR 396
           YLVARVL+LTGRFGKG +IVDEMIE+GL PDRKFYYDLIGVLCGVER N+ALELFE+MKR
Sbjct: 327 YLVARVLYLTGRFGKGNKIVDEMIEQGLTPDRKFYYDLIGVLCGVERVNFALELFERMKR 386

Query: 397 SSLGGYGPVYDVLIPKLCRGGDFEMGRKLWEEAMAMGVMLSCSSEVLDPSITKVFKPTRK 456
           SSLGGYGPVYDVLIPKLCRGGDFE GR+LW+EA+A GV LSCSS+VLDPSIT+VFKPTRK
Sbjct: 387 SSLGGYGPVYDVLIPKLCRGGDFEKGRELWDEAVATGVSLSCSSDVLDPSITEVFKPTRK 446

Query: 457 IENKILEECNRGE-----KQNKAATEKPKEKRKKSKNLSWK 493
            E   L+ C   +     KQN    +K K+ +KK K  S K
Sbjct: 447 AEKVHLKGCTMAKSPVKNKQNTMKGKKYKKIKKKKKKSSSK 487

BLAST of Cla019096 vs. TrEMBL
Match: W9QUA2_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_008580 PE=4 SV=1)

HSP 1 Score: 635.2 bits (1637), Expect = 6.7e-179
Identity = 320/458 (69.87%), Postives = 369/458 (80.57%), Query Frame = 1

Query: 20  KLKWDSFIVQSFYMQ---HRFCSLHSTIDKRASVIKLCEVISCTIGGLDELESSLNKCTI 79
           K +W  F+++SF  Q      C  +S +   + + +LC ++S TIGGLD+LESSL+    
Sbjct: 6   KSRWHYFLLRSFTAQKFRQLSCLPNSNLSSASRLQELCTIVSRTIGGLDDLESSLSDFRG 65

Query: 80  SLTSSLVTQVIDSSKNEAPIRRLLRFFLWSLKKLNHDLEDEDFNYAIRFFAQKKDYTAIN 139
           SLTSSLVTQVIDS K EAP RRLLRFFLWS K L  DLED+D+N+AIR FA KKD+TA+ 
Sbjct: 66  SLTSSLVTQVIDSCKTEAPTRRLLRFFLWSHKNLKCDLEDKDYNHAIRVFAGKKDHTALE 125

Query: 140 ILLSNLKKADRTMDGQTFGFVAEALVRMDREDEALGLFKNLDKYKCPHDQFTVTAVITAL 199
           IL+S+LKK  R ++ QT+  VAE LV++ REDEALG+FKN DKYKCP + FTVTAV+ AL
Sbjct: 126 ILVSDLKKGGRALESQTYAIVAETLVKLGREDEALGIFKNSDKYKCPQNSFTVTAVVNAL 185

Query: 200 CAKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEARRILKEMKSDGTMPDL 259
           CA+GHAKRAEGVV HHKD+IS    CIYRSLLYGWS ++N KEARRI+KEMKS G  PDL
Sbjct: 186 CAQGHAKRAEGVVGHHKDRISGMERCIYRSLLYGWSEQENVKEARRIIKEMKSAGINPDL 245

Query: 260 FCYNTFLNCLCEQNLEKNPSGLVPEALNVMMEMRSYKIAPNSISYNILLSCLGRTRRVKE 319
           FCYNTFL CLCE+NL++NPSGLVPEALNVMMEMRSY I PNSISYNILLSCLGR RRVKE
Sbjct: 246 FCYNTFLRCLCERNLKRNPSGLVPEALNVMMEMRSYMITPNSISYNILLSCLGRARRVKE 305

Query: 320 SCTILETMKRSGCRPDCISYYLVARVLFLTGRFGKGRRIVDEMIEEGLIPDRKFYYDLIG 379
           +C ILE MK++GC PD +SYYLV RVL+LT RFGKG ++VDEMI EGL+P+ KFYYDLIG
Sbjct: 306 ACQILERMKQAGCSPDWMSYYLVIRVLYLTMRFGKGNKLVDEMIGEGLVPNCKFYYDLIG 365

Query: 380 VLCGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGGDFEMGRKLWEEAMAMGVML 439
           VLCGVER  YALELFE MK+ SLGGYGPVYDVLIPKLCRGGDFE GR+LW EAM MGV  
Sbjct: 366 VLCGVERPYYALELFEHMKKRSLGGYGPVYDVLIPKLCRGGDFEKGRELWIEAMNMGVDF 425

Query: 440 SCSSEVLDPSITKVFKPTRKIENKI-LEECNRGEKQNK 474
            CSS+VLDPSITKVFKPTRK E KI  EE    E +NK
Sbjct: 426 CCSSDVLDPSITKVFKPTRKEEEKISQEESTSSENKNK 463

BLAST of Cla019096 vs. TrEMBL
Match: D7TDF7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0127g00660 PE=4 SV=1)

HSP 1 Score: 629.0 bits (1621), Expect = 4.8e-177
Identity = 312/442 (70.59%), Postives = 357/442 (80.77%), Query Frame = 1

Query: 52  KLCEVISCTIGGLDELESSLNKCTISLTSSLVTQVIDSSKNEAPIRRLLRFFLWSLKKLN 111
           +LC V+S  +G LD+LE+SL++   S TSSL++Q++D+ KNEAP RRLLRFFLWS KK N
Sbjct: 11  ELCNVVSNGVGSLDDLEASLDRLDASFTSSLISQILDTCKNEAPTRRLLRFFLWSSKKFN 70

Query: 112 HDLEDEDFNYAIRFFAQKKDYTAINILLSNLKKADRTMDGQTFGFVAEALVRMDREDEAL 171
             LED+DFNYAI+ FA+KKD  AI+IL+S+L    R M  QTFG VAE LV + RED+AL
Sbjct: 71  CKLEDDDFNYAIQVFAEKKDLKAIDILVSDLSNEGREMKAQTFGIVAETLVSLGREDDAL 130

Query: 172 GLFKNLDKYKCPHDQFTVTAVITALCAKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGW 231
           GLFKNLDK+KC +D  TVTA++ ALC+KGHA+RAEGVV HHKDKI     CIYRSL YGW
Sbjct: 131 GLFKNLDKFKCSYDSVTVTAIVNALCSKGHARRAEGVVRHHKDKILGVKPCIYRSLFYGW 190

Query: 232 SIKKNTKEARRILKEMKSDGTMPDLFCYNTFLNCLCEQNLEKNPSGLVPEALNVMMEMRS 291
           S +KN KEARRILKEMKS G MPDLFCYNTFL CLCE+NL+ NPSGLVPEALNVMMEMRS
Sbjct: 191 SEQKNVKEARRILKEMKSVGIMPDLFCYNTFLRCLCERNLKSNPSGLVPEALNVMMEMRS 250

Query: 292 YKIAPNSISYNILLSCLGRTRRVKESCTILETMKRSGCRPDCISYYLVARVLFLTGRFGK 351
            +I P SISYNILLSCLGRTRRVKESC IL+ MKR GC PD +SYYLVARVL+LTGRFGK
Sbjct: 251 NRITPTSISYNILLSCLGRTRRVKESCRILDLMKRLGCSPDWVSYYLVARVLYLTGRFGK 310

Query: 352 GRRIVDEMIEEGLIPDRKFYYDLIGVLCGVERTNYALELFEKMKRSSLGGYGPVYDVLIP 411
           G +IVDEMIEEGL+PDRKFYYDLIGVLCGVER NYALE+FE+MKRSSLGGYGPVYDVLIP
Sbjct: 311 GNQIVDEMIEEGLVPDRKFYYDLIGVLCGVERVNYALEMFERMKRSSLGGYGPVYDVLIP 370

Query: 412 KLCRGGDFEMGRKLWEEAMAMGVMLSCSSEVLDPSITKVFKPTRKIENKILEECNRGE-- 471
           KLCR GDF  GR+LW+EA  +GV+L CSSEVLDPSITKVFKP RK E    E C      
Sbjct: 371 KLCRSGDFGKGRELWDEATRVGVLLHCSSEVLDPSITKVFKPARKDE----EVCTMSRTL 430

Query: 472 -KQNKAATEKPKEKRKKSKNLS 491
            ++ KA     K K+ K+K  S
Sbjct: 431 VQEAKAVRNGNKNKKNKNKKKS 448

BLAST of Cla019096 vs. NCBI nr
Match: gi|449441065|ref|XP_004138304.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g61370, mitochondrial [Cucumis sativus])

HSP 1 Score: 895.6 bits (2313), Expect = 3.9e-257
Identity = 437/487 (89.73%), Postives = 461/487 (94.66%), Query Frame = 1

Query: 1   MHARHMFDEMSLRRCGSLLKLKWDSFIVQSFYMQHRFCSLHSTIDKRASVIKLCEVISCT 60
           MHARH+FDEMSLRRC SLLKLKWDSFI QS   QHRFCSLHST++  A+V KLCEVISCT
Sbjct: 1   MHARHLFDEMSLRRCVSLLKLKWDSFIAQSVCTQHRFCSLHSTVNNGAAVSKLCEVISCT 60

Query: 61  IGGLDELESSLNKCTISLTSSLVTQVIDSSKNEAPIRRLLRFFLWSLKKLNHDLEDEDFN 120
           IGGLDELESSLNKCTISLTSSLVTQVIDSSKNEAP RRLLRFFLWSLKKLNH LEDEDFN
Sbjct: 61  IGGLDELESSLNKCTISLTSSLVTQVIDSSKNEAPTRRLLRFFLWSLKKLNHTLEDEDFN 120

Query: 121 YAIRFFAQKKDYTAINILLSNLKKADRTMDGQTFGFVAEALVRMDREDEALGLFKNLDKY 180
            AIRFFAQKKDYTA+NILLSNLKKADR MDGQTFGFVAEA V+MDREDEALGLFKNL+KY
Sbjct: 121 NAIRFFAQKKDYTAVNILLSNLKKADRAMDGQTFGFVAEAFVKMDREDEALGLFKNLEKY 180

Query: 181 KCPHDQFTVTAVITALCAKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEA 240
           KCPHDQFTV A+ITALC+KGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEA
Sbjct: 181 KCPHDQFTVVAIITALCSKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEA 240

Query: 241 RRILKEMKSDGTMPDLFCYNTFLNCLCEQNLEKNPSGLVPEALNVMMEMRSYKIAPNSIS 300
           RRILKEMKSDGTMPDLFCYNTFL CLCE+N+EKNPSGLVPE+LNVMMEMRSYKI+PNSIS
Sbjct: 241 RRILKEMKSDGTMPDLFCYNTFLKCLCEKNVEKNPSGLVPESLNVMMEMRSYKISPNSIS 300

Query: 301 YNILLSCLGRTRRVKESCTILETMKRSGCRPDCISYYLVARVLFLTGRFGKGRRIVDEMI 360
           YNILLSCL +TRRVKESC ILE MKR+GC+PDC+SYYL+ARVLFLTGRFGKGR IVDEMI
Sbjct: 301 YNILLSCLCKTRRVKESCKILEMMKRTGCQPDCVSYYLMARVLFLTGRFGKGREIVDEMI 360

Query: 361 EEGLIPDRKFYYDLIGVLCGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGGDFE 420
           EEGL PDRKFYYDLIG+LCGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGG+FE
Sbjct: 361 EEGLTPDRKFYYDLIGILCGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGGEFE 420

Query: 421 MGRKLWEEAMAMGVMLSCSSEVLDPSITKVFKPTRKIENKILEECNRGEKQNKAATEKPK 480
           MGR+LWEEAMAMGV L+CSSE+LDPSITKVFKPTRKIENKI+EE N  EKQNKAA EKPK
Sbjct: 421 MGRQLWEEAMAMGVSLNCSSEILDPSITKVFKPTRKIENKIVEEFNSAEKQNKAAAEKPK 480

Query: 481 EKRKKSK 488
           EKRKK K
Sbjct: 481 EKRKKGK 487

BLAST of Cla019096 vs. NCBI nr
Match: gi|659105975|ref|XP_008453221.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g61370, mitochondrial [Cucumis melo])

HSP 1 Score: 887.1 bits (2291), Expect = 1.4e-254
Identity = 434/485 (89.48%), Postives = 457/485 (94.23%), Query Frame = 1

Query: 1   MHARHMFDEMSLRRCGSLLKLKWDSFIVQSFYMQHRFCSLHSTIDKRASVIKLCEVISCT 60
           MHARHMFDEM LRRC SLLKLKWDSFI QS   QHRFCSLHST++  A+V KLCEVISCT
Sbjct: 1   MHARHMFDEMPLRRCVSLLKLKWDSFIAQSVCTQHRFCSLHSTVNNGAAVSKLCEVISCT 60

Query: 61  IGGLDELESSLNKCTISLTSSLVTQVIDSSKNEAPIRRLLRFFLWSLKKLNHDLEDEDFN 120
           IGGLDELESSLN+CTISLTSSLVTQVIDSSKNEAP RRLLRFFLWSLKKLNH LEDEDFN
Sbjct: 61  IGGLDELESSLNQCTISLTSSLVTQVIDSSKNEAPTRRLLRFFLWSLKKLNHTLEDEDFN 120

Query: 121 YAIRFFAQKKDYTAINILLSNLKKADRTMDGQTFGFVAEALVRMDREDEALGLFKNLDKY 180
            AIRFFAQKKDYTAINILLSNLKKADR MDGQTF FVAEA V+M+R+DEALGLFKNL+KY
Sbjct: 121 NAIRFFAQKKDYTAINILLSNLKKADRAMDGQTFSFVAEAFVKMNRDDEALGLFKNLEKY 180

Query: 181 KCPHDQFTVTAVITALCAKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEA 240
           KCPHDQFTV A+ITALC+KGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEA
Sbjct: 181 KCPHDQFTVVAIITALCSKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEA 240

Query: 241 RRILKEMKSDGTMPDLFCYNTFLNCLCEQNLEKNPSGLVPEALNVMMEMRSYKIAPNSIS 300
           RRILKEMKSDGTMPDLF YNTFL CLCE+N+EKNPSGLVPEALNVMMEMRSYKIAPNSIS
Sbjct: 241 RRILKEMKSDGTMPDLFSYNTFLKCLCEKNVEKNPSGLVPEALNVMMEMRSYKIAPNSIS 300

Query: 301 YNILLSCLGRTRRVKESCTILETMKRSGCRPDCISYYLVARVLFLTGRFGKGRRIVDEMI 360
           YNILLSCL +TRRVKESC ILE MKRSGCRPDC+SYYLVARVLFLTGRFGKGR IVDEMI
Sbjct: 301 YNILLSCLCKTRRVKESCRILEMMKRSGCRPDCVSYYLVARVLFLTGRFGKGREIVDEMI 360

Query: 361 EEGLIPDRKFYYDLIGVLCGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGGDFE 420
           EEGL PDRKFYY+LIG+LCGVERTNYA+ELFEKMKRSSLGGYGPVYDVLIPK+CRGGDFE
Sbjct: 361 EEGLTPDRKFYYELIGILCGVERTNYAVELFEKMKRSSLGGYGPVYDVLIPKVCRGGDFE 420

Query: 421 MGRKLWEEAMAMGVMLSCSSEVLDPSITKVFKPTRKIENKILEECNRGEKQNKAATEKPK 480
           MGR+LWEEAMAMGV L+CSSE+LDPSITKVFKPTRKIENKI+EECN  EKQNKAA EKP 
Sbjct: 421 MGRQLWEEAMAMGVSLNCSSEILDPSITKVFKPTRKIENKIVEECNTAEKQNKAAAEKPN 480

Query: 481 EKRKK 486
           +KRKK
Sbjct: 481 KKRKK 485

BLAST of Cla019096 vs. NCBI nr
Match: gi|568884695|ref|XP_006494986.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g61370, mitochondrial [Citrus sinensis])

HSP 1 Score: 645.2 bits (1663), Expect = 9.3e-182
Identity = 319/476 (67.02%), Postives = 385/476 (80.88%), Query Frame = 1

Query: 18  LLKLKWDSFIVQ---SFYMQHRFCSLHSTIDKRA---SVIKLCEVISCTIGGLDELESSL 77
           LLK KW + ++Q   +   +H   SL+ST+        + +LC+V+S TIGGLD+LE SL
Sbjct: 9   LLKPKWRTLLLQRVDTHNFEHLLLSLYSTVPSNQVSHELKELCKVVSSTIGGLDDLELSL 68

Query: 78  NKCTISLTSSLVTQVIDSSKNEAPIRRLLRFFLWSLKKLNHDLEDEDFNYAIRFFAQKKD 137
           N+ T SLTSSLVTQVIDS K EAP RRLLRFFLWS K ++  LED+D+N+AIR FA+K+D
Sbjct: 69  NQFTGSLTSSLVTQVIDSCKQEAPTRRLLRFFLWSCKNMSASLEDKDYNHAIRVFAEKRD 128

Query: 138 YTAINILLSNLKKADRTMDGQTFGFVAEALVRMDREDEALGLFKNLDKYKCPHDQFTVTA 197
           +TA+NIL+S+L+K  R M+ Q+FG + E LV++ REDEALG+FKNL+K+KC  D  TV+A
Sbjct: 129 HTAMNILVSDLRKEGRVMESQSFGVLVETLVKLGREDEALGIFKNLEKFKCVQDSVTVSA 188

Query: 198 VITALCAKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEARRILKEMKSDG 257
           +++ALCAKGHA+RAEGVV HHKDKIS    CIYRSL+YGWS+++N K AR+I+KEMKS G
Sbjct: 189 IVSALCAKGHARRAEGVVYHHKDKISGVELCIYRSLIYGWSMQENVKAARKIIKEMKSAG 248

Query: 258 TMPDLFCYNTFLNCLCEQNLEKNPSGLVPEALNVMMEMRSYKIAPNSISYNILLSCLGRT 317
            MPDLFCYNTFL  LCE+NL++NPSGLVPEALNVMMEMRSY+IAP SISYNILLSCLGRT
Sbjct: 249 IMPDLFCYNTFLRGLCERNLKRNPSGLVPEALNVMMEMRSYRIAPTSISYNILLSCLGRT 308

Query: 318 RRVKESCTILETMKRSGCRPDCISYYLVARVLFLTGRFGKGRRIVDEMIEEGLIPDRKFY 377
           RRVKESC +LE MK+SGC PD +SYYLVARVL+L+GRFGKG +IVDEMIEEGLIPDRKFY
Sbjct: 309 RRVKESCQVLEQMKKSGCAPDWVSYYLVARVLYLSGRFGKGNKIVDEMIEEGLIPDRKFY 368

Query: 378 YDLIGVLCGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGGDFEMGRKLWEEAMA 437
           YDLIG+LCGVER N+ALELFE+MKRSSLGGYGPVYDVLIPK+CRGGDF  GR+LW+EAM 
Sbjct: 369 YDLIGILCGVERVNFALELFERMKRSSLGGYGPVYDVLIPKVCRGGDFVKGRELWDEAMV 428

Query: 438 MGVMLSCSSEVLDPSITKVFKPTRKIENKILEECNRGEKQNKAATEKPKEKRKKSK 488
           MG+ LSCSS VLDPSI +VF+P RK     L       +     T    +K+KKSK
Sbjct: 429 MGLTLSCSSNVLDPSIIEVFQPRRKPTESCLGSTTPNIETQVKKTVIEVDKKKKSK 484

BLAST of Cla019096 vs. NCBI nr
Match: gi|657965798|ref|XP_008374567.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g61370, mitochondrial [Malus domestica])

HSP 1 Score: 643.3 bits (1658), Expect = 3.5e-181
Identity = 318/482 (65.98%), Postives = 389/482 (80.71%), Query Frame = 1

Query: 17  SLLKLKWDSFIVQSFYMQHRF----CSLHSTIDKRAS---VIKLCEVISCTIGGLDELES 76
           S+L  KW  F++Q+     +     C  +ST+   +S   + +LC ++S  IGGLD+LES
Sbjct: 3   SVLLSKWQRFLLQNAVSTQKSQFFSCFYYSTMLPASSPPELQELCTIVSSAIGGLDDLES 62

Query: 77  SLNKCTISLTSSLVTQVIDSSKNEAPIRRLLRFFLWSLKKLNHDLEDEDFNYAIRFFAQK 136
           SLN+ + SLTSS+VTQVIDS K+EAP RRLLRFF W  K L + LED+D+NY IR FA+K
Sbjct: 63  SLNEFSASLTSSIVTQVIDSCKSEAPTRRLLRFFSWCHKNLGYGLEDKDYNYGIRVFAEK 122

Query: 137 KDYTAINILLSNLKKADRTMDGQTFGFVAEALVRMDREDEALGLFKNLDKYKCPHDQFTV 196
           KD+TA+NI+LS+L KA R M+ QTFG V EALV++ +EDEALG+FKNLDKYKCP D  TV
Sbjct: 123 KDHTAMNIVLSDLVKAGRVMEAQTFGLVTEALVKLGKEDEALGMFKNLDKYKCPQDGVTV 182

Query: 197 TAVITALCAKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEARRILKEMKS 256
           TA++ ALCA+GHAKRAEGVVLHH+DKI+    CIYRSLLYGWS+++N KEARRI+KEMKS
Sbjct: 183 TAIVNALCAQGHAKRAEGVVLHHRDKIAGIEPCIYRSLLYGWSVQENVKEARRIIKEMKS 242

Query: 257 DGTMPDLFCYNTFLNCLCEQNLEKNPSGLVPEALNVMMEMRSYKIAPNSISYNILLSCLG 316
            G MPDLFCYNTFL  LCE+NL+ NPSGLVPEALNVM+EMRSY+I+PNSISYNILLSCLG
Sbjct: 243 IGIMPDLFCYNTFLRALCERNLKLNPSGLVPEALNVMIEMRSYRISPNSISYNILLSCLG 302

Query: 317 RTRRVKESCTILETMKRSGCRPDCISYYLVARVLFLTGRFGKGRRIVDEMIEEGLIPDRK 376
           RTRRVKESC ILETMK++GC PD +SYYLV RVL+L+GRFGKG ++VDEM+E+GL P+RK
Sbjct: 303 RTRRVKESCNILETMKKTGCSPDSVSYYLVVRVLYLSGRFGKGNKLVDEMLEQGLQPNRK 362

Query: 377 FYYDLIGVLCGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGGDFEMGRKLWEEA 436
           FYYDLIGVL G +R +YALEL ++MK S+LGGYGPVYDVLIPK CRGGDFE GR+LW+EA
Sbjct: 363 FYYDLIGVLVGKDRPHYALELLQRMKASALGGYGPVYDVLIPKFCRGGDFEKGRELWDEA 422

Query: 437 MAMGVMLSCSSEVLDPSITKVFKPTRKIENKILEEC--NRGEKQNKAATEKPKEKRKKSK 490
           +AMG+ L CSS +LDPSIT+VFKPTR  E   L EC   + E++ +  T K K+ +KK K
Sbjct: 423 VAMGITLRCSSNLLDPSITEVFKPTRNEEKLRLAECANAKAEEKVRRRTGKTKQTKKKKK 482

BLAST of Cla019096 vs. NCBI nr
Match: gi|567896330|ref|XP_006440653.1| (hypothetical protein CICLE_v10023621mg [Citrus clementina])

HSP 1 Score: 640.2 bits (1650), Expect = 3.0e-180
Identity = 320/476 (67.23%), Postives = 383/476 (80.46%), Query Frame = 1

Query: 18  LLKLKWDSFIVQSFYMQ---HRFCSLHSTIDKRA---SVIKLCEVISCTIGGLDELESSL 77
           LLK KW S ++Q    Q   H   SL+S +        + +LC+V+S TIGGLD+LE SL
Sbjct: 4   LLKPKWRSLLLQRVDTQKSEHLLLSLYSMVPSNQVSHELKELCKVVSSTIGGLDDLELSL 63

Query: 78  NKCTISLTSSLVTQVIDSSKNEAPIRRLLRFFLWSLKKLNHDLEDEDFNYAIRFFAQKKD 137
           N+ T SL+SSLVTQVIDS K+EAP RRLLRFFLWS K L+  LED+D+N+AIR FA+KKD
Sbjct: 64  NQFTGSLSSSLVTQVIDSCKHEAPTRRLLRFFLWSCKNLSASLEDKDYNHAIRVFAEKKD 123

Query: 138 YTAINILLSNLKKADRTMDGQTFGFVAEALVRMDREDEALGLFKNLDKYKCPHDQFTVTA 197
           + A+NIL+S+L+K  R M+ Q+FG + E LV++ REDEALG+FKNL+K+KC  D  TV+A
Sbjct: 124 HMAMNILVSDLRKEGRVMETQSFGVLVETLVKLGREDEALGIFKNLEKFKCVQDSVTVSA 183

Query: 198 VITALCAKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEARRILKEMKSDG 257
           +++ALCAKGHA+RAEGVV HHKDKIS    CIYRSL+YGWS+++N K AR+I+KEMKS G
Sbjct: 184 IVSALCAKGHARRAEGVVYHHKDKISGVELCIYRSLIYGWSMQENVKAARKIIKEMKSAG 243

Query: 258 TMPDLFCYNTFLNCLCEQNLEKNPSGLVPEALNVMMEMRSYKIAPNSISYNILLSCLGRT 317
            MPDLFCYNTFL  LCE+NL++NPSGLVPEALNVMMEMRSY+IAP SISYNILLSCLGRT
Sbjct: 244 FMPDLFCYNTFLRGLCERNLKRNPSGLVPEALNVMMEMRSYRIAPTSISYNILLSCLGRT 303

Query: 318 RRVKESCTILETMKRSGCRPDCISYYLVARVLFLTGRFGKGRRIVDEMIEEGLIPDRKFY 377
           RRVKESC +LE MK+SGC PD +SYYLVARVL+L+GRFGKG +IVDEMIEEGLIPDRKFY
Sbjct: 304 RRVKESCRVLEQMKKSGCAPDWVSYYLVARVLYLSGRFGKGNKIVDEMIEEGLIPDRKFY 363

Query: 378 YDLIGVLCGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGGDFEMGRKLWEEAMA 437
           YDLIG+LCGVER N+ALELFE+MKRSSLGGYGPVYDVLIPK+C+GGDF  GR+LW+EAM 
Sbjct: 364 YDLIGILCGVERVNFALELFERMKRSSLGGYGPVYDVLIPKVCQGGDFVKGRELWDEAMV 423

Query: 438 MGVMLSCSSEVLDPSITKVFKPTRKIENKILEECNRGEKQNKAATEKPKEKRKKSK 488
           MG+ LSCSS VLDPSIT+VF P RK     L       +     T    +K+KKSK
Sbjct: 424 MGLTLSCSSNVLDPSITEVFHPRRKPTEGCLGSTTPNIEAQVKKTVIEVDKKKKSK 479

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP439_ARATH7.4e-16358.33Pentatricopeptide repeat-containing protein At5g61370, mitochondrial OS=Arabidop... [more]
PP438_ARATH6.9e-4427.75Pentatricopeptide repeat-containing protein PNM1, mitochondrial OS=Arabidopsis t... [more]
PP447_ARATH1.7e-2622.43Putative pentatricopeptide repeat-containing protein At5g65820 OS=Arabidopsis th... [more]
PP248_ARATH3.0e-2323.24Pentatricopeptide repeat-containing protein At3g22670, mitochondrial OS=Arabidop... [more]
PP418_ARATH4.4e-2224.04Pentatricopeptide repeat-containing protein At5g46100 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0LPF4_CUCSA2.7e-25789.73Uncharacterized protein OS=Cucumis sativus GN=Csa_1G009920 PE=4 SV=1[more]
V4T153_9ROSI2.1e-18067.23Uncharacterized protein OS=Citrus clementina GN=CICLE_v10023621mg PE=4 SV=1[more]
A0A061FXY0_THECC1.0e-17967.90Pentatricopeptide repeat superfamily protein OS=Theobroma cacao GN=TCM_014106 PE... [more]
W9QUA2_9ROSA6.7e-17969.87Uncharacterized protein OS=Morus notabilis GN=L484_008580 PE=4 SV=1[more]
D7TDF7_VITVI4.8e-17770.59Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0127g00660 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
gi|449441065|ref|XP_004138304.1|3.9e-25789.73PREDICTED: pentatricopeptide repeat-containing protein At5g61370, mitochondrial ... [more]
gi|659105975|ref|XP_008453221.1|1.4e-25489.48PREDICTED: pentatricopeptide repeat-containing protein At5g61370, mitochondrial ... [more]
gi|568884695|ref|XP_006494986.1|9.3e-18267.02PREDICTED: pentatricopeptide repeat-containing protein At5g61370, mitochondrial ... [more]
gi|657965798|ref|XP_008374567.1|3.5e-18165.98PREDICTED: pentatricopeptide repeat-containing protein At5g61370, mitochondrial ... [more]
gi|567896330|ref|XP_006440653.1|3.0e-18067.23hypothetical protein CICLE_v10023621mg [Citrus clementina][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005739 mitochondrion
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
molecular_function GO:0003674 molecular_function
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla019096Cla019096.1mRNA


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 374..397
score:
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 296..336
score: 3.
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 224..265
score: 1.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 223..255
score: 0.0024coord: 299..333
score: 7.9E-9coord: 335..367
score: 0.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 255..296
score: 6.796coord: 367..401
score: 7.344coord: 297..331
score: 11.948coord: 150..184
score: 6.917coord: 185..215
score: 5.996coord: 402..436
score: 7.826coord: 220..254
score: 8.484coord: 332..366
score: 8
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 78..435
score: 3.7E
NoneNo IPR availablePANTHERPTHR24015:SF620SUBFAMILY NOT NAMEDcoord: 78..435
score: 3.7E

The following gene(s) are paralogous to this gene:

None