ClCG02G011140 (gene) Watermelon (Charleston Gray)

NameClCG02G011140
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionPentatricopeptide (PPR) repeat protein-like
LocationCG_Chr02 : 23085066 .. 23086792 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCAACGAGTTGTGGGGCGGGTATTTGGTTTTGCCGATGCGATTCACTTTCGACGCTTGATGGAGCTCCGCTTATGTCCACCGCCATACGTGATCGGAGACGGCGTTCGACTCTTCTTAAAGCCGCTTAAACGCCACGACGGCTTCCGCAGTTACCCTTTCGTGCCAAATCTGCAGGTCAAATGTACACTCACCAAACAAACCCACCGATTCCTCTCCACATTGGCCACAACCGCCGCCACCGGTGACCATTCGGCGACCAATCGCTTGATTCGAAAGTTTGTAGCGAGTTCTCCGAAATCGATAACTCTCAATGTTCTCTCCAATATCGTCTCCCCCCACACGGCTCAACCTGGGCTCTGCTCTGTTGCTCTCACGGTAAGTAGGGCTTTTTTTTCCTCTTTATCCCATCATTCTTGCTTCGTTTCCGAGAAGATCGAGATGAGAATTGTATTTTCCACGTTTTAAAGTCGAAAATTCACACGAAATTTCATTAATGATCTCTTCTGATTCCGTCGTTGCATTTCTCAATCAACTTCCCTATGTTCATGCAGTTGTATTCTAGAATTACGGAAGCGTCTTGGTTCACATGGAATTCCAAGCTAGTTGCTGACCTTATTGCCTTCCTCGACCAAAATGGACTGTACAGTGAGTCGGAAGCCCTAATGTCCGAGACAATTTCAAAGTTAGGGTTTCAAGAAAGAAAACTTGTGAATTTCTACTCCCAGCTGGTTGAATCTCAATCCAAACACGGTTCAGAAAGAGGATGTGGTAACTCCTATGCTCGCCTTCTTGAGCTTCTTTATAATTCGCCGTCAGTTTATGTTAAACGTCGAGCTTATGAATCAATGGTTACTGGTTTGTGCTCCATGAAAAGACCTCAGGAAGCTGAGACTTTGGTAGAAAAAATGAGGAGCAAAGGAATTACGCCTTCTGCATATGAATATAGGTCCATTATTTACGCCTATGGAACATTGGGATTGTTTGAAGATATGCAGAGGAGTTTAAAACAGATGGAGAATGAAGATTATGCATTAGACACAATTTGTTCCAATATGGTGCTTTCTTCATATGGAGCTCATAATAAGCTTGCAGATATGGTTCTATGGCTTCAAAGAATGAAAACTTCCGCGCATCTTAATTTCTCTGTTCGGACTTACAATTCTGTTTTGAAGTCATGTCCGAAGATTACATCAATGCTACAAGATCACAAGAGCAGCAAGTTTCCCGTTTTGATCGAAGGCTTAATCGCGGTTCTGGATGGTGATGAGGAGGCTTTGTTGGTTGAAGAGTTGGTGGTTGGTTCATCTGTTTTGAAAGAAGTAATGGTGTGGGATGCAATGGAATTGAAGTTGGATTTGCATGGAGCACATGTTGGTGCAGCTTATGTGATCATGTTACAGTGGATGAAGGAGATGAGACTCAACTTTGAGGATGAGAGTTGTGTGATTCCAGCACAAGTTACATTGATTTCTGGATCTGGAAACCATAGTATTGTTAGAGGAGAGTCTCCTGTAAAAGCTTTGATTAAAGAGATTATAGTTCGAACTGAAAGTCCGCTGAGAGTGGATCGAAAGAACACAGGTTGCTTCATCTCTAAAGGAAAAGCTGTAAAGAATTGGTTATGTTCACTACCAGAAAAAAGGGAGATTGTAGCTAATAGAAAATGCCATAATAGGCCTTTTACACCGTCTTAAAGTCTATAACAAATGCTATGACAAACCATAAA

mRNA sequence

TCAACGAGTTGTGGGGCGGGTATTTGGTTTTGCCGATGCGATTCACTTTCGACGCTTGATGGAGCTCCGCTTATGTCCACCGCCATACGTGATCGGAGACGGCGTTCGACTCTTCTTAAAGCCGCTTAAACGCCACGACGGCTTCCGCAGTTACCCTTTCGTGCCAAATCTGCAGGTCAAATGTACACTCACCAAACAAACCCACCGATTCCTCTCCACATTGGCCACAACCGCCGCCACCGGTGACCATTCGGCGACCAATCGCTTGATTCGAAAGTTTGTAGCGAGTTCTCCGAAATCGATAACTCTCAATGTTCTCTCCAATATCGTCTCCCCCCACACGGCTCAACCTGGGCTCTGCTCTGTTGCTCTCACGTTGTATTCTAGAATTACGGAAGCGTCTTGGTTCACATGGAATTCCAAGCTAGTTGCTGACCTTATTGCCTTCCTCGACCAAAATGGACTGTACAGTGAGTCGGAAGCCCTAATGTCCGAGACAATTTCAAAGTTAGGGTTTCAAGAAAGAAAACTTGTGAATTTCTACTCCCAGCTGGTTGAATCTCAATCCAAACACGGTTCAGAAAGAGGATGTGGTAACTCCTATGCTCGCCTTCTTGAGCTTCTTTATAATTCGCCGTCAGTTTATGTTAAACGTCGAGCTTATGAATCAATGGTTACTGGTTTGTGCTCCATGAAAAGACCTCAGGAAGCTGAGACTTTGGTAGAAAAAATGAGGAGCAAAGGAATTACGCCTTCTGCATATGAATATAGGTCCATTATTTACGCCTATGGAACATTGGGATTGTTTGAAGATATGCAGAGGAGTTTAAAACAGATGGAGAATGAAGATTATGCATTAGACACAATTTGTTCCAATATGGTGCTTTCTTCATATGGAGCTCATAATAAGCTTGCAGATATGGTTCTATGGCTTCAAAGAATGAAAACTTCCGCGCATCTTAATTTCTCTGTTCGGACTTACAATTCTGTTTTGAAGTCATGTCCGAAGATTACATCAATGCTACAAGATCACAAGAGCAGCAAGTTTCCCGTTTTGATCGAAGGCTTAATCGCGGTTCTGGATGGTGATGAGGAGGCTTTGTTGGTTGAAGAGTTGGTGGTTGGTTCATCTGTTTTGAAAGAAGTAATGGTGTGGGATGCAATGGAATTGAAGTTGGATTTGCATGGAGCACATGTTGGTGCAGCTTATGTGATCATGTTACAGTGGATGAAGGAGATGAGACTCAACTTTGAGGATGAGAGTTGTGTGATTCCAGCACAAGTTACATTGATTTCTGGATCTGGAAACCATAGTATTGTTAGAGGAGAGTCTCCTGTAAAAGCTTTGATTAAAGAGATTATAGTTCGAACTGAAAGTCCGCTGAGAGTGGATCGAAAGAACACAGGTTGCTTCATCTCTAAAGGAAAAGCTGTAAAGAATTGGTTATGTTCACTACCAGAAAAAAGGGAGATTGTAGCTAATAGAAAATGCCATAATAGGCCTTTTACACCGTCTTAAAGTCTATAACAAATGCTATGACAAACCATAAA

Coding sequence (CDS)

ATGGAGCTCCGCTTATGTCCACCGCCATACGTGATCGGAGACGGCGTTCGACTCTTCTTAAAGCCGCTTAAACGCCACGACGGCTTCCGCAGTTACCCTTTCGTGCCAAATCTGCAGGTCAAATGTACACTCACCAAACAAACCCACCGATTCCTCTCCACATTGGCCACAACCGCCGCCACCGGTGACCATTCGGCGACCAATCGCTTGATTCGAAAGTTTGTAGCGAGTTCTCCGAAATCGATAACTCTCAATGTTCTCTCCAATATCGTCTCCCCCCACACGGCTCAACCTGGGCTCTGCTCTGTTGCTCTCACGTTGTATTCTAGAATTACGGAAGCGTCTTGGTTCACATGGAATTCCAAGCTAGTTGCTGACCTTATTGCCTTCCTCGACCAAAATGGACTGTACAGTGAGTCGGAAGCCCTAATGTCCGAGACAATTTCAAAGTTAGGGTTTCAAGAAAGAAAACTTGTGAATTTCTACTCCCAGCTGGTTGAATCTCAATCCAAACACGGTTCAGAAAGAGGATGTGGTAACTCCTATGCTCGCCTTCTTGAGCTTCTTTATAATTCGCCGTCAGTTTATGTTAAACGTCGAGCTTATGAATCAATGGTTACTGGTTTGTGCTCCATGAAAAGACCTCAGGAAGCTGAGACTTTGGTAGAAAAAATGAGGAGCAAAGGAATTACGCCTTCTGCATATGAATATAGGTCCATTATTTACGCCTATGGAACATTGGGATTGTTTGAAGATATGCAGAGGAGTTTAAAACAGATGGAGAATGAAGATTATGCATTAGACACAATTTGTTCCAATATGGTGCTTTCTTCATATGGAGCTCATAATAAGCTTGCAGATATGGTTCTATGGCTTCAAAGAATGAAAACTTCCGCGCATCTTAATTTCTCTGTTCGGACTTACAATTCTGTTTTGAAGTCATGTCCGAAGATTACATCAATGCTACAAGATCACAAGAGCAGCAAGTTTCCCGTTTTGATCGAAGGCTTAATCGCGGTTCTGGATGGTGATGAGGAGGCTTTGTTGGTTGAAGAGTTGGTGGTTGGTTCATCTGTTTTGAAAGAAGTAATGGTGTGGGATGCAATGGAATTGAAGTTGGATTTGCATGGAGCACATGTTGGTGCAGCTTATGTGATCATGTTACAGTGGATGAAGGAGATGAGACTCAACTTTGAGGATGAGAGTTGTGTGATTCCAGCACAAGTTACATTGATTTCTGGATCTGGAAACCATAGTATTGTTAGAGGAGAGTCTCCTGTAAAAGCTTTGATTAAAGAGATTATAGTTCGAACTGAAAGTCCGCTGAGAGTGGATCGAAAGAACACAGGTTGCTTCATCTCTAAAGGAAAAGCTGTAAAGAATTGGTTATGTTCACTACCAGAAAAAAGGGAGATTGTAGCTAATAGAAAATGCCATAATAGGCCTTTTACACCGTCTTAA

Protein sequence

MELRLCPPPYVIGDGVRLFLKPLKRHDGFRSYPFVPNLQVKCTLTKQTHRFLSTLATTAATGDHSATNRLIRKFVASSPKSITLNVLSNIVSPHTAQPGLCSVALTLYSRITEASWFTWNSKLVADLIAFLDQNGLYSESEALMSETISKLGFQERKLVNFYSQLVESQSKHGSERGCGNSYARLLELLYNSPSVYVKRRAYESMVTGLCSMKRPQEAETLVEKMRSKGITPSAYEYRSIIYAYGTLGLFEDMQRSLKQMENEDYALDTICSNMVLSSYGAHNKLADMVLWLQRMKTSAHLNFSVRTYNSVLKSCPKITSMLQDHKSSKFPVLIEGLIAVLDGDEEALLVEELVVGSSVLKEVMVWDAMELKLDLHGAHVGAAYVIMLQWMKEMRLNFEDESCVIPAQVTLISGSGNHSIVRGESPVKALIKEIIVRTESPLRVDRKNTGCFISKGKAVKNWLCSLPEKREIVANRKCHNRPFTPS
BLAST of ClCG02G011140 vs. Swiss-Prot
Match: PP157_ARATH (Pentatricopeptide repeat-containing protein At2g17033 OS=Arabidopsis thaliana GN=At2g17033 PE=2 SV=1)

HSP 1 Score: 448.7 bits (1153), Expect = 7.9e-125
Identity = 232/421 (55.11%), Postives = 303/421 (71.97%), Query Frame = 1

Query: 44  LTKQTHRFLSTLATTAATGDHSATNRLIRKFVASSPKSITLNVLSNIVSPHTAQPGLCSV 103
           L K   RFLS+L++ A  GD SA NR I+KFVA+SPKS+ LNVLS+++S  T+ P L   
Sbjct: 89  LMKHGDRFLSSLSSPALAGDPSAINRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFF 148

Query: 104 ALTLYSRITEASWFTWNSKLVADLIAFLDQNGLYSESEALMSETISKLGFQERKLVNFYS 163
           AL+LYS ITEASWF WN KL+A+LIA L++   + ESE L+S  +S+L   ER    F  
Sbjct: 149 ALSLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLC 208

Query: 164 QLVESQSKHGSERGCGNSYARLLELLYNSPSVYVKRRAYESMVTGLCSMKRPQEAETLVE 223
            LVES SK GS +G   +  RL E++  S SVYVK +AY+SMV+GLC+M +P +AE ++E
Sbjct: 209 NLVESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIE 268

Query: 224 KMRSKGITPSAYEYRSIIYAYGTLGLFEDMQRSLKQMENEDYALDTICSNMVLSSYGAHN 283
           +MR + I P  +EY+S++Y YG LGLF+DM R + +M  E + +DT+CSNMVLSSYGAH+
Sbjct: 269 EMRMEKIKPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHD 328

Query: 284 KLADMVLWLQRMKTSAHLNFSVRTYNSVLKSCPKITSMLQDHKSSKFPVLIEGLIAVLDG 343
            L  M  WLQ++K   ++ FS+RTYNSVL SCP I SML+D  S   PV +  L   L+ 
Sbjct: 329 ALPQMGSWLQKLK-GFNVPFSIRTYNSVLNSCPTIISMLKDLDSC--PVSLSELRTFLN- 388

Query: 344 DEEALLVEELVVGSSVLKEVMVWDAMELKLDLHGAHVGAAYVIMLQWMKEMRLNFEDESC 403
           ++EALLV EL   SSVL E + W+A+E KLDLHG H+ ++Y+I+LQWM E RL F +E C
Sbjct: 389 EDEALLVHEL-TQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSEEKC 448

Query: 404 VIPAQVTLISGSGNHSIVRGESPVKALIKEIIVRTESPLRVDRKNTGCFISKGKAVKNWL 463
           VIPA++ ++SGSG HS VRGESPVKAL+K+I+VRT SP+R+DRKN G FI+KGK VK WL
Sbjct: 449 VIPAEIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWL 504

Query: 464 C 465
           C
Sbjct: 509 C 504

BLAST of ClCG02G011140 vs. Swiss-Prot
Match: PP123_ARATH (Pentatricopeptide repeat-containing protein At1g74750 OS=Arabidopsis thaliana GN=At1g74750 PE=2 SV=1)

HSP 1 Score: 61.2 bits (147), Expect = 3.5e-08
Identity = 94/431 (21.81%), Postives = 172/431 (39.91%), Query Frame = 1

Query: 44  LTKQTHRFLSTLATTAATGDHSATNRLIRKFVAS--SPKSITLNVLSNIVSPHTAQPGLC 103
           L+  T  +   +      G   A +RL  + V    +P  +T N++   ++ H A+    
Sbjct: 460 LSPDTFTYSVIINCLGKAGHLPAAHRLFCEMVGQGCTPNLVTFNIM---IALH-AKARNY 519

Query: 104 SVALTLYSRITEASWFTWNSKLVADLIAFLDQNGLYSESEALMSETISKLGFQERKLVNF 163
             AL LY  +  A  F  +    + ++  L   G   E+E + +E   K    +  +   
Sbjct: 520 ETALKLYRDMQNAG-FQPDKVTYSIVMEVLGHCGFLEEAEGVFAEMQRKNWVPDEPV--- 579

Query: 164 YSQLVESQSKHGSERGCGNSYARLLE--LLYNSPSVYVKRRAYESMVTGLCSMKRPQEAE 223
           Y  LV+   K G+       Y  +L+  L  N P+         S+++    + R  EA 
Sbjct: 580 YGLLVDLWGKAGNVDKAWQWYQAMLQAGLRPNVPTC-------NSLLSTFLRVHRMSEAY 639

Query: 224 TLVEKMRSKGITPSAYEYRSIIYAYGTLGLFEDMQRSLKQMENEDYALDTICSNMVLSSY 283
            L++ M + G+ PS   Y  ++              S       ++ +      M +S +
Sbjct: 640 NLLQSMLALGLHPSLQTYTLLL--------------SCCTDARSNFDMGFCGQLMAVSGH 699

Query: 284 GAHNKLADMVLWLQRMKTSAHLNFSVRTYNSVLKSCPKITSMLQDHKSSKFPVLIEGLIA 343
            AH       ++L +M  +      VR + S       +  M  + + SK   L++ ++ 
Sbjct: 700 PAH-------MFLLKMPPAGPDGQKVRDHVSNF-----LDFMHSEDRESKRG-LMDAVVD 759

Query: 344 VLDGD---EEALLVEELVVGSSVLKEVMVWDAMELKL-DLHGAHVGAAYVIM---LQWMK 403
            L      EEA  V E+  G +V  + +   +    L +LH    G A + +   L W +
Sbjct: 760 FLHKSGLKEEAGSVWEVAAGKNVYPDALREKSYSYWLINLHVMSEGTAVIALSRTLAWFR 819

Query: 404 EMRLNFEDESCVIPAQVTLISGSGNHSIVRGESPVKALIKEIIVRTESPLRVDRKNTGCF 463
           +  L   D     P+++ +++G G  S V G S V+  ++E++     P   +  N+GCF
Sbjct: 820 KQMLVSGD----CPSRIDIVTGWGRRSRVTGTSMVRQAVEELLNIFNFPFFTENGNSGCF 844

BLAST of ClCG02G011140 vs. Swiss-Prot
Match: PP402_ARATH (Putative pentatricopeptide repeat-containing protein At5g36300 OS=Arabidopsis thaliana GN=At5g36300 PE=3 SV=3)

HSP 1 Score: 59.3 bits (142), Expect = 1.3e-07
Identity = 60/275 (21.82%), Postives = 121/275 (44.00%), Query Frame = 1

Query: 51  FLSTLATTAATGDHSATNRLIRKFVASSPKSITLNVLSNIVSPHTAQPGLCSVALTLYSR 110
           F+ TLA+   T +  A    + +F+     S+ L   + +VS +  +     V   +  R
Sbjct: 52  FIETLASLRRTLEADALFHEVVRFMIYGSYSVRL--YNALVSRYLRKEVSWRVVNEMKKR 111

Query: 111 ITEASWFTWNSKLVADLIAFLDQNGLYSESEALMSETISKLGFQERKLVNFYSQLVESQS 170
                 F  NS +   +I     NG++ ++  ++ E I ++G      V  Y+ ++++  
Sbjct: 112 K-----FRLNSFVYGKIIRIYRDNGMWKKALGIVEE-IREIGLPMD--VEIYNSVIDTFG 171

Query: 171 KHGSERGCGNSYARLLELLYNSPSVYVKRRAYESMVTGLCSMKRP-----------QEAE 230
           K+G      +   ++LE L  S       R + S++   C                ++  
Sbjct: 172 KYGEL----DEELQVLEKLQRSSDSRPNIRTWNSLIRWHCHHGAVDMALELFTMIFEDIG 231

Query: 231 TLVEKMRSKGITPSAYEYRSIIYAYGTLGLFEDMQRSLKQMENEDYALDTICSNMVLSSY 290
            LV K++S+G+ PSA  + ++  AY   GL +   + LK MENE    + I  N++++++
Sbjct: 232 ELVGKLKSQGVAPSANLFCTLANAYAQQGLCKQTVKVLKMMENEGIEPNLIMLNVLINAF 291

Query: 291 GAHNKLADMVLWLQRMKTSAHLNFSVRTYNSVLKS 315
           G   K  + +     +K +  ++  V TY++++K+
Sbjct: 292 GTAGKHMEALSIYHHIKETVWIHPDVVTYSTLMKA 312

BLAST of ClCG02G011140 vs. Swiss-Prot
Match: PP279_ARATH (Pentatricopeptide repeat-containing protein At3g53170 OS=Arabidopsis thaliana GN=At3g53170 PE=3 SV=1)

HSP 1 Score: 55.1 bits (131), Expect = 2.5e-06
Identity = 59/273 (21.61%), Postives = 111/273 (40.66%), Query Frame = 1

Query: 46  KQTHRFLSTLATTAATGDHSATNRLIRKFVASSPKSITLNVLSNIVSPHTAQPGLCSVAL 105
           K+  R L T A  A  G     N    K++   PK++ L  L   +  +  Q      AL
Sbjct: 29  KELSRILRTDA--AVKGIERKANS--EKYLTLWPKAV-LEALDEAIKENRWQS-----AL 88

Query: 106 TLYSRITEASWFTWNSKLVADLIAFLDQNGLYSESEALMSETISKLGFQERKLVNFYSQL 165
            +++ + +  W+    K    L   L  N    +  +L+ E +   G   +  ++ Y+ L
Sbjct: 89  KIFNLLRKQHWYEPRCKTYTKLFKVLG-NCKQPDQASLLFEVMLSEGL--KPTIDVYTSL 148

Query: 166 VESQSKHGSERGCGNSYARLLELLYNSPSVYVKRRAYESMVTGLCSMKRPQEAETLVEKM 225
           +    K        ++   +  +    P V+     +  +++  C + R    +++V +M
Sbjct: 149 ISVYGKSELLDKAFSTLEYMKSVSDCKPDVFT----FTVLISCCCKLGRFDLVKSIVLEM 208

Query: 226 RSKGITPSAYEYRSIIYAYGTLGLFEDMQRSLKQMENEDYALDTICS-NMVLSSYGAHNK 285
              G+  S   Y +II  YG  G+FE+M+  L  M  +  +L  +C+ N ++ SYG    
Sbjct: 209 SYLGVGCSTVTYNTIIDGYGKAGMFEEMESVLADMIEDGDSLPDVCTLNSIIGSYGNGRN 268

Query: 286 LADMVLWLQRMKTSAHLNFSVRTYNSVLKSCPK 318
           +  M  W  R +    +   + T+N ++ S  K
Sbjct: 269 MRKMESWYSRFQLMG-VQPDITTFNILILSFGK 283

BLAST of ClCG02G011140 vs. Swiss-Prot
Match: PP327_ARATH (Pentatricopeptide repeat-containing protein At4g20090 OS=Arabidopsis thaliana GN=EMB1025 PE=3 SV=1)

HSP 1 Score: 54.3 bits (129), Expect = 4.3e-06
Identity = 53/238 (22.27%), Postives = 107/238 (44.96%), Query Frame = 1

Query: 89  NIVSPHTAQPGLC-----SVALTLYSRITEASWFTWNSKLVADLIAFLDQNGLYSESEAL 148
           N V+ +T   GLC       A++L  R+  +     N      LI  L +    +++  L
Sbjct: 291 NEVTYNTLIHGLCLKGKLDKAVSLLERMVSSKCIP-NDVTYGTLINGLVKQRRATDAVRL 350

Query: 149 MSETISKLGFQERKLVNFYSQLVESQSKHGSERGCGNSYARLLELLYNSPSVYVKRRAYE 208
           +S ++ + G+   +  + YS L+    K G      + + ++ E     P++ V    Y 
Sbjct: 351 LS-SMEERGYHLNQ--HIYSVLISGLFKEGKAEEAMSLWRKMAEKGCK-PNIVV----YS 410

Query: 209 SMVTGLCSMKRPQEAETLVEKMRSKGITPSAYEYRSIIYAYGTLGLFEDMQRSLKQMENE 268
            +V GLC   +P EA+ ++ +M + G  P+AY Y S++  +   GL E+  +  K+M+  
Sbjct: 411 VLVDGLCREGKPNEAKEILNRMIASGCLPNAYTYSSLMKGFFKTGLCEEAVQVWKEMDKT 470

Query: 269 DYALDTICSNMVLSSYGAHNKLADMVLWLQRMKTSAHLNFSVRTYNSVLKSCPKITSM 322
             + +  C ++++       ++ + ++   +M T   +      Y+S++K    I SM
Sbjct: 471 GCSRNKFCYSVLIDGLCGVGRVKEAMMVWSKMLTIG-IKPDTVAYSSIIKGLCGIGSM 518


HSP 2 Score: 37.7 bits (86), Expect = 4.2e-01
Identity = 32/119 (26.89%), Postives = 56/119 (47.06%), Query Frame = 1

Query: 202 YESMVTGLCSMKRPQEAETLVEKMRSKGITPSAYEYRSIIYAYGTLGLFEDMQRSLKQME 261
           Y +++ GLC  +R  EA  L+++M+S+G +PS   Y  +I      G   D+ R  K ++
Sbjct: 225 YCTLMDGLCKEERIDEAVLLLDEMQSEGCSPSPVIYNVLIDGLCKKG---DLTRVTKLVD 284

Query: 262 N---EDYALDTICSNMVLSSYGAHNKLADMVLWLQRMKTSAHLNFSVRTYNSVLKSCPK 318
           N   +    + +  N ++       KL   V  L+RM +S  +   V TY +++    K
Sbjct: 285 NMFLKGCVPNEVTYNTLIHGLCLKGKLDKAVSLLERMVSSKCIPNDV-TYGTLINGLVK 339

BLAST of ClCG02G011140 vs. TrEMBL
Match: Q5DMV7_CUCME (Pentatricopeptide (PPR) repeat protein-like OS=Cucumis melo GN=PPR PE=4 SV=1)

HSP 1 Score: 843.6 bits (2178), Expect = 1.2e-241
Identity = 423/486 (87.04%), Postives = 452/486 (93.00%), Query Frame = 1

Query: 1   MELRLCPPPYVIGDGVRLFLKPLKRHDGFRSYPFVPNLQVKCT-LTKQTHRFLSTLATTA 60
           MELRLCPPPYVIGDGVRLFL+PLKR DGFRSYPF+PNLQVKCT LTKQTHRFLSTL+TT 
Sbjct: 1   MELRLCPPPYVIGDGVRLFLQPLKRLDGFRSYPFLPNLQVKCTTLTKQTHRFLSTLSTTG 60

Query: 61  ATGDHSATNRLIRKFVASSPKSITLNVLSNIVSPHTAQPGLCSVALTLYSRITEASWFTW 120
           ATGD SATNRLIRKFVASSPKSITL+VLSNIVS HT QP LCS ALTLYSRITEASWFTW
Sbjct: 61  ATGDQSATNRLIRKFVASSPKSITLSVLSNIVSTHTPQPELCSAALTLYSRITEASWFTW 120

Query: 121 NSKLVADLIAFLDQNGLYSESEALMSETISKLGFQERKLVNFYSQLVESQSKHGSERGCG 180
           NSKLVADL+AFL QNGLYSESEAL+SE ISKLG QERKLVNFYSQLVESQSKHG ERG G
Sbjct: 121 NSKLVADLVAFLGQNGLYSESEALISEAISKLGSQERKLVNFYSQLVESQSKHGFERGFG 180

Query: 181 NSYARLLELLYNSPSVYVKRRAYESMVTGLCSMKRPQEAETLVEKMRSKGITPSAYEYRS 240
           +SY+RL ELLYNSPSVYVKRRAYESMVTGLCSMKRP EAE+LV++MRSKGITP+AYEYRS
Sbjct: 181 DSYSRLFELLYNSPSVYVKRRAYESMVTGLCSMKRPHEAESLVKEMRSKGITPTAYEYRS 240

Query: 241 IIYAYGTLGLFEDMQRSLKQMENEDYALDTICSNMVLSSYGAHNKLADMVLWLQRMKTSA 300
           IIYAYGTLGLFE+M+RSLKQMEN++  LDT+CSNMVLSSYGAHNKL DM+LWLQRMKTS+
Sbjct: 241 IIYAYGTLGLFEEMKRSLKQMENDNIELDTVCSNMVLSSYGAHNKLGDMLLWLQRMKTSS 300

Query: 301 HLNFSVRTYNSVLKSCPKITSMLQDHKSSKFPVLIEGLIAVLDGDEEALLVEELVVGSSV 360
           H   SVRTYNSVL SCPKITSMLQDHKS   PVLIE LIA+LDGDEEALLV+EL+VGSSV
Sbjct: 301 HCKSSVRTYNSVLNSCPKITSMLQDHKSGDLPVLIEDLIAILDGDEEALLVKELLVGSSV 360

Query: 361 LKEVMVWDAMELKLDLHGAHVGAAYVIMLQWMKEMRLNFEDESCVIPAQVTLISGSGNHS 420
           L E+MVWDAMELKLDLHGAHVGAAYVIMLQW+KEMRLNFEDES VIPAQVTLI GSG HS
Sbjct: 361 LNEIMVWDAMELKLDLHGAHVGAAYVIMLQWIKEMRLNFEDESNVIPAQVTLICGSGKHS 420

Query: 421 IVRGESPVKALIKEIIVRTESPLRVDRKNTGCFISKGKAVKNWLCSLPEKREIVANRKCH 480
           IVRGESPVKALIKEI+VRTESPLR+DRKNTGCFISKGKAVKNWLCSLP K++ VANRKC+
Sbjct: 421 IVRGESPVKALIKEIMVRTESPLRIDRKNTGCFISKGKAVKNWLCSLPGKKDTVANRKCY 480

Query: 481 NRPFTP 486
            RPF P
Sbjct: 481 KRPFIP 486

BLAST of ClCG02G011140 vs. TrEMBL
Match: M4R4K5_CUCME (PPR OS=Cucumis melo GN=PPR PE=4 SV=1)

HSP 1 Score: 842.4 bits (2175), Expect = 2.7e-241
Identity = 423/486 (87.04%), Postives = 451/486 (92.80%), Query Frame = 1

Query: 1   MELRLCPPPYVIGDGVRLFLKPLKRHDGFRSYPFVPNLQVKCT-LTKQTHRFLSTLATTA 60
           MELRLCPPPYVIGDGVRL L+PLKR DGFRSYPF+PNLQVKCT LTKQTHRFLSTL+TTA
Sbjct: 1   MELRLCPPPYVIGDGVRLLLQPLKRLDGFRSYPFLPNLQVKCTTLTKQTHRFLSTLSTTA 60

Query: 61  ATGDHSATNRLIRKFVASSPKSITLNVLSNIVSPHTAQPGLCSVALTLYSRITEASWFTW 120
           ATGD SATNRLIRKFVASSPKSITL+VLSNIVS HT QP LCS ALTLYSRITEASWFTW
Sbjct: 61  ATGDQSATNRLIRKFVASSPKSITLSVLSNIVSTHTPQPELCSAALTLYSRITEASWFTW 120

Query: 121 NSKLVADLIAFLDQNGLYSESEALMSETISKLGFQERKLVNFYSQLVESQSKHGSERGCG 180
           NSKLVADL+AFL QNGLYSESEAL+SE ISKLG QERKLVNFYSQLVESQSKHG ERG G
Sbjct: 121 NSKLVADLVAFLGQNGLYSESEALISEAISKLGSQERKLVNFYSQLVESQSKHGFERGFG 180

Query: 181 NSYARLLELLYNSPSVYVKRRAYESMVTGLCSMKRPQEAETLVEKMRSKGITPSAYEYRS 240
           +SY+RL ELLYNSPSVYVKRRAYESMVTGLCSMKRP EAE+LV++MRSKGITP+AYEYRS
Sbjct: 181 DSYSRLFELLYNSPSVYVKRRAYESMVTGLCSMKRPHEAESLVKEMRSKGITPTAYEYRS 240

Query: 241 IIYAYGTLGLFEDMQRSLKQMENEDYALDTICSNMVLSSYGAHNKLADMVLWLQRMKTSA 300
           IIYAYGTLGLFE+M+RSLKQMEN++  LDT+CSNMVLSSYGAHNKL DM+LWLQRMKTS 
Sbjct: 241 IIYAYGTLGLFEEMKRSLKQMENDNIELDTVCSNMVLSSYGAHNKLGDMLLWLQRMKTSP 300

Query: 301 HLNFSVRTYNSVLKSCPKITSMLQDHKSSKFPVLIEGLIAVLDGDEEALLVEELVVGSSV 360
           H   SVRTYNSVL SCPKITSMLQDHKS   PVLIE LIA+LDGDEEALLV+EL+VGSSV
Sbjct: 301 HCKSSVRTYNSVLNSCPKITSMLQDHKSGDLPVLIEDLIAILDGDEEALLVKELLVGSSV 360

Query: 361 LKEVMVWDAMELKLDLHGAHVGAAYVIMLQWMKEMRLNFEDESCVIPAQVTLISGSGNHS 420
           L E+MVWDAMELKLDLHGAHVGAAYVIMLQW+KEMRLNFEDES VIPAQVTLI GSG HS
Sbjct: 361 LNEIMVWDAMELKLDLHGAHVGAAYVIMLQWIKEMRLNFEDESYVIPAQVTLICGSGKHS 420

Query: 421 IVRGESPVKALIKEIIVRTESPLRVDRKNTGCFISKGKAVKNWLCSLPEKREIVANRKCH 480
           IVRGESPVKALIKEI+VRTESPLR+DRKNTGCFISKGKAVKNWLCSLP K++ VANRKC+
Sbjct: 421 IVRGESPVKALIKEIMVRTESPLRIDRKNTGCFISKGKAVKNWLCSLPGKKDTVANRKCY 480

Query: 481 NRPFTP 486
            RPF P
Sbjct: 481 KRPFIP 486

BLAST of ClCG02G011140 vs. TrEMBL
Match: M5VLA1_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa021547mg PE=4 SV=1)

HSP 1 Score: 491.1 bits (1263), Expect = 1.5e-135
Identity = 252/425 (59.29%), Postives = 326/425 (76.71%), Query Frame = 1

Query: 40  VKCTLTKQTHRFLSTLATTAATGDHSATNRLIRKFVASSPKSITLNVLSNIVSPHTAQPG 99
           ++C +TKQ  RFL+ LA  A   D   TN+LI KF+ SS KSI LN LS ++SP T  P 
Sbjct: 30  IQCAVTKQGQRFLTKLAANAR--DAKVTNKLIAKFLTSSTKSIALNTLSYLLSPDTTLPH 89

Query: 100 LCSVALTLYSRITEASWFTWNSKLVADLIAFLDQNGLYSESEALMSETISKLGFQERKLV 159
           L S+AL  YS+ITEASWF WN KLVA L+A LD+ G ++E+E L+SETISKLG +ER+L 
Sbjct: 90  LSSLALPFYSKITEASWFEWNPKLVAALVALLDKQGQHNEAEVLISETISKLGSRERELA 149

Query: 160 NFYSQLVESQSKHGSERGCGNSYARLLELLYNSPSVYVKRRAYESMVTGLCSMKRPQEAE 219
            F+ QLVES SK  S+ G  +SY+ L +LL+NS SVYVK RA+ESMV+GLC M RP+EA+
Sbjct: 150 LFHCQLVESHSKLSSKHGFDSSYSYLYQLLHNSSSVYVKNRAFESMVSGLCEMDRPREAD 209

Query: 220 TLVEKMRSKGITPSAYEYRSIIYAYGTLGLFEDMQRSLKQMENEDYALDTICSNMVLSSY 279
            L+E+MR +G+ PS +E+RS++Y YG LGLFEDM + ++QMEN+  A+DTICSNMVLSSY
Sbjct: 210 NLIEEMRVRGLKPSVFEFRSVVYGYGRLGLFEDMLKVVEQMENQGIAIDTICSNMVLSSY 269

Query: 280 GAHNKLADMVLWLQRMKTSAHLNFSVRTYNSVLKSCPKITSMLQDHKSSKFPVLIEGLIA 339
           GAH++LA M++WL++MK S  L FS+RTYNSVL SC  I +MLQ+ K   FP  IE L  
Sbjct: 270 GAHSELAAMLVWLRKMK-SLSLPFSIRTYNSVLNSCLTIMAMLQEPKD--FPCSIEELNG 329

Query: 340 VLDGDEEALLVEELVVGSSVLKEVMVWDAMELKLDLHGAHVGAAYVIMLQWMKEMRLNFE 399
           VL+GD EALLV+EL V S+VL EVMVW+ +E KLDLHG H+G+AY+I+L+W + MR  F 
Sbjct: 330 VLNGD-EALLVKEL-VESTVLDEVMVWEPLEAKLDLHGMHLGSAYLILLEWFEAMRCRFN 389

Query: 400 DESCVIPAQVTLISGSGNHSIVRGESPVKALIKEIIVRTESPLRVDRKNTGCFISKGKAV 459
               VIPA+V +I GSG HS VRGESPVK L+K++++R ESP+R+DRKN GCF++KG+AV
Sbjct: 390 SGKDVIPAEVVVICGSGKHSSVRGESPVKGLVKQMMLRMESPMRIDRKNVGCFVAKGRAV 447

Query: 460 KNWLC 465
           K+WLC
Sbjct: 450 KDWLC 447

BLAST of ClCG02G011140 vs. TrEMBL
Match: W9QLA9_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_006076 PE=4 SV=1)

HSP 1 Score: 485.7 bits (1249), Expect = 6.5e-134
Identity = 249/425 (58.59%), Postives = 327/425 (76.94%), Query Frame = 1

Query: 40  VKCTLTKQTHRFLSTLATTAATGDHSATNRLIRKFVASSPKSITLNVLSNIVSPHTAQPG 99
           ++C LTKQ HRFLSTL+  A  G+ SA N+LI KFVASSPKSI+LN LS+++SP T    
Sbjct: 100 IQCALTKQGHRFLSTLSINA--GNASAANKLIGKFVASSPKSISLNALSHLLSPDTTHTH 159

Query: 100 LCSVALTLYSRITEASWFTWNSKLVADLIAFLDQNGLYSESEALMSETISKLGFQERKLV 159
           L S +L LYS+I EASWF ++ KLVA L A LD+ G YSE+EAL++E +SKLG ++R+L 
Sbjct: 160 LTSHSLHLYSKIREASWFVYSPKLVAALAALLDKQGRYSEAEALIAEAVSKLGHRQRELA 219

Query: 160 NFYSQLVESQSKHGSERGCGNSYARLLELLYNSPSVYVKRRAYESMVTGLCSMKRPQEAE 219
            FY  LVES SK  S+ G  +SYA L +LL +S S YVK RA+E+MV  LC+M RP EAE
Sbjct: 220 VFYCSLVESHSKQSSKHGFDSSYAYLYQLLRDSSSAYVKCRAFETMVGALCTMDRPCEAE 279

Query: 220 TLVEKMRSKGITPSAYEYRSIIYAYGTLGLFEDMQRSLKQMENEDYALDTICSNMVLSSY 279
           +L+E+MR KG+ PS +E+RS++Y YG LGL+EDM R++ QME E   +DTICSNMVLSSY
Sbjct: 280 SLMEEMRHKGLKPSVFEFRSLVYGYGRLGLWEDMLRTVNQMEIEGLVIDTICSNMVLSSY 339

Query: 280 GAHNKLADMVLWLQRMKTSAHLNFSVRTYNSVLKSCPKITSMLQDHKSSKFPVLIEGLIA 339
           GAHN+L  MVLWLQ+M+TS+ + FS+RTYNSVL  CP IT+MLQD K    P+ +  L A
Sbjct: 340 GAHNELQQMVLWLQKMRTSS-IPFSIRTYNSVLNWCPTITAMLQDLKD--IPLSMYELNA 399

Query: 340 VLDGDEEALLVEELVVGSSVLKEVMVWDAMELKLDLHGAHVGAAYVIMLQWMKEMRLNFE 399
            L GDE  L++E  +VGSSVL+EV+VWD++E+KLDLHG H+G+AY+IML+WM+EM   F 
Sbjct: 400 TLRGDEGLLVME--LVGSSVLEEVLVWDSLEVKLDLHGMHLGSAYLIMLEWMEEMTRRFN 459

Query: 400 DESCVIPAQVTLISGSGNHSIVRGESPVKALIKEIIVRTESPLRVDRKNTGCFISKGKAV 459
           D +  IPA+V ++ GSG HS VRG SPVK L+KE++V+ +SP+++DRKN GCF++KGK V
Sbjct: 460 DGNHGIPAEVVVVCGSGKHSNVRGVSPVKILVKEMMVQMKSPMKIDRKNAGCFLAKGKTV 517

Query: 460 KNWLC 465
           ++WLC
Sbjct: 520 RDWLC 517

BLAST of ClCG02G011140 vs. TrEMBL
Match: A0A061DXY1_THECC (Pentatricopeptide (PPR) repeat-containing protein, putative OS=Theobroma cacao GN=TCM_006548 PE=4 SV=1)

HSP 1 Score: 484.2 bits (1245), Expect = 1.9e-133
Identity = 241/420 (57.38%), Postives = 315/420 (75.00%), Query Frame = 1

Query: 44  LTKQTHRFLSTLATTAATGDHSATNRLIRKFVASSPKSITLNVLSNIVSPHTAQPGLCSV 103
           LTKQ HRF S+LA TA   D +  NRLI+KFVASSPKSI LN LS+++SP  + P L ++
Sbjct: 34  LTKQGHRFFSSLAATADVNDPATANRLIKKFVASSPKSIALNALSHLLSPRNSHPHLSAL 93

Query: 104 ALTLYSRITEASWFTWNSKLVADLIAFLDQNGLYSESEALMSETISKLGFQERKLVNFYS 163
           A  LY++I+E SW+ WN KLVA+LIA L + G Y ESEAL+S+ +SKL F+ER LV FY 
Sbjct: 94  AFPLYTKISETSWYNWNPKLVAELIALLVKQGRYDESEALISQAVSKLKFRERDLVQFYC 153

Query: 164 QLVESQSKHGSERGCGNSYARLLELLYNSPSVYVKRRAYESMVTGLCSMKRPQEAETLVE 223
             +ES SKH S+ G  ++Y  L EL+ NS SVYVKR+ Y+SMV+ LC M RP EAE LVE
Sbjct: 154 NWIESCSKHNSKEGFNDAYCYLSELICNSSSVYVKRQGYKSMVSSLCEMDRPNEAENLVE 213

Query: 224 KMRSKGITPSAYEYRSIIYAYGTLGLFEDMQRSLKQMENEDYALDTICSNMVLSSYGAHN 283
           +MR  G+TP+ +E+R I Y YG LGLFEDM+R + +ME E + +DTICSNMVLSSYGA+N
Sbjct: 214 EMRKNGLTPTLFEFRFISYGYGQLGLFEDMERMVCEMEIEGFEVDTICSNMVLSSYGAYN 273

Query: 284 KLADMVLWLQRMKTSAHLNFSVRTYNSVLKSCPKITSMLQDHKSSKFPVLIEGLIAVLDG 343
             + MV WLQ+MKT   + FS+RTYNSVL SCP+I S++Q   S    +   G +A +  
Sbjct: 274 AFSKMVPWLQKMKT-LQIPFSIRTYNSVLNSCPEIMSLVQGLDSVPLSL---GELAKILN 333

Query: 344 DEEALLVEELVVGSSVLKEVMVWDAMELKLDLHGAHVGAAYVIMLQWMKEMRLNFEDESC 403
           ++EALLV+ELV  SSVL E M W+  E KLDLHG H+G+AY+IMLQW++EM+  F+ E C
Sbjct: 334 EDEALLVQELVKSSSVLDEAMEWNGSEGKLDLHGMHLGSAYLIMLQWIEEMKCRFKVEEC 393

Query: 404 VIPAQVTLISGSGNHSIVRGESPVKALIKEIIVRTESPLRVDRKNTGCFISKGKAVKNWL 463
           VIPAQ+T++ GSG HS VRGESPVK L+++++V+ +SP+++DRKN GCFI+KG+ VKNWL
Sbjct: 394 VIPAQITIVCGSGKHSSVRGESPVKTLMRKMMVKMKSPMKIDRKNIGCFIAKGQVVKNWL 449

BLAST of ClCG02G011140 vs. TAIR10
Match: AT2G17033.2 (AT2G17033.2 pentatricopeptide (PPR) repeat-containing protein)

HSP 1 Score: 448.7 bits (1153), Expect = 4.4e-126
Identity = 232/421 (55.11%), Postives = 303/421 (71.97%), Query Frame = 1

Query: 44  LTKQTHRFLSTLATTAATGDHSATNRLIRKFVASSPKSITLNVLSNIVSPHTAQPGLCSV 103
           L K   RFLS+L++ A  GD SA NR I+KFVA+SPKS+ LNVLS+++S  T+ P L   
Sbjct: 89  LMKHGDRFLSSLSSPALAGDPSAINRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFF 148

Query: 104 ALTLYSRITEASWFTWNSKLVADLIAFLDQNGLYSESEALMSETISKLGFQERKLVNFYS 163
           AL+LYS ITEASWF WN KL+A+LIA L++   + ESE L+S  +S+L   ER    F  
Sbjct: 149 ALSLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLC 208

Query: 164 QLVESQSKHGSERGCGNSYARLLELLYNSPSVYVKRRAYESMVTGLCSMKRPQEAETLVE 223
            LVES SK GS +G   +  RL E++  S SVYVK +AY+SMV+GLC+M +P +AE ++E
Sbjct: 209 NLVESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIE 268

Query: 224 KMRSKGITPSAYEYRSIIYAYGTLGLFEDMQRSLKQMENEDYALDTICSNMVLSSYGAHN 283
           +MR + I P  +EY+S++Y YG LGLF+DM R + +M  E + +DT+CSNMVLSSYGAH+
Sbjct: 269 EMRMEKIKPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHD 328

Query: 284 KLADMVLWLQRMKTSAHLNFSVRTYNSVLKSCPKITSMLQDHKSSKFPVLIEGLIAVLDG 343
            L  M  WLQ++K   ++ FS+RTYNSVL SCP I SML+D  S   PV +  L   L+ 
Sbjct: 329 ALPQMGSWLQKLK-GFNVPFSIRTYNSVLNSCPTIISMLKDLDSC--PVSLSELRTFLN- 388

Query: 344 DEEALLVEELVVGSSVLKEVMVWDAMELKLDLHGAHVGAAYVIMLQWMKEMRLNFEDESC 403
           ++EALLV EL   SSVL E + W+A+E KLDLHG H+ ++Y+I+LQWM E RL F +E C
Sbjct: 389 EDEALLVHEL-TQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSEEKC 448

Query: 404 VIPAQVTLISGSGNHSIVRGESPVKALIKEIIVRTESPLRVDRKNTGCFISKGKAVKNWL 463
           VIPA++ ++SGSG HS VRGESPVKAL+K+I+VRT SP+R+DRKN G FI+KGK VK WL
Sbjct: 449 VIPAEIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWL 504

Query: 464 C 465
           C
Sbjct: 509 C 504

BLAST of ClCG02G011140 vs. TAIR10
Match: AT1G74750.1 (AT1G74750.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 61.2 bits (147), Expect = 2.0e-09
Identity = 94/431 (21.81%), Postives = 172/431 (39.91%), Query Frame = 1

Query: 44  LTKQTHRFLSTLATTAATGDHSATNRLIRKFVAS--SPKSITLNVLSNIVSPHTAQPGLC 103
           L+  T  +   +      G   A +RL  + V    +P  +T N++   ++ H A+    
Sbjct: 460 LSPDTFTYSVIINCLGKAGHLPAAHRLFCEMVGQGCTPNLVTFNIM---IALH-AKARNY 519

Query: 104 SVALTLYSRITEASWFTWNSKLVADLIAFLDQNGLYSESEALMSETISKLGFQERKLVNF 163
             AL LY  +  A  F  +    + ++  L   G   E+E + +E   K    +  +   
Sbjct: 520 ETALKLYRDMQNAG-FQPDKVTYSIVMEVLGHCGFLEEAEGVFAEMQRKNWVPDEPV--- 579

Query: 164 YSQLVESQSKHGSERGCGNSYARLLE--LLYNSPSVYVKRRAYESMVTGLCSMKRPQEAE 223
           Y  LV+   K G+       Y  +L+  L  N P+         S+++    + R  EA 
Sbjct: 580 YGLLVDLWGKAGNVDKAWQWYQAMLQAGLRPNVPTC-------NSLLSTFLRVHRMSEAY 639

Query: 224 TLVEKMRSKGITPSAYEYRSIIYAYGTLGLFEDMQRSLKQMENEDYALDTICSNMVLSSY 283
            L++ M + G+ PS   Y  ++              S       ++ +      M +S +
Sbjct: 640 NLLQSMLALGLHPSLQTYTLLL--------------SCCTDARSNFDMGFCGQLMAVSGH 699

Query: 284 GAHNKLADMVLWLQRMKTSAHLNFSVRTYNSVLKSCPKITSMLQDHKSSKFPVLIEGLIA 343
            AH       ++L +M  +      VR + S       +  M  + + SK   L++ ++ 
Sbjct: 700 PAH-------MFLLKMPPAGPDGQKVRDHVSNF-----LDFMHSEDRESKRG-LMDAVVD 759

Query: 344 VLDGD---EEALLVEELVVGSSVLKEVMVWDAMELKL-DLHGAHVGAAYVIM---LQWMK 403
            L      EEA  V E+  G +V  + +   +    L +LH    G A + +   L W +
Sbjct: 760 FLHKSGLKEEAGSVWEVAAGKNVYPDALREKSYSYWLINLHVMSEGTAVIALSRTLAWFR 819

Query: 404 EMRLNFEDESCVIPAQVTLISGSGNHSIVRGESPVKALIKEIIVRTESPLRVDRKNTGCF 463
           +  L   D     P+++ +++G G  S V G S V+  ++E++     P   +  N+GCF
Sbjct: 820 KQMLVSGD----CPSRIDIVTGWGRRSRVTGTSMVRQAVEELLNIFNFPFFTENGNSGCF 844

BLAST of ClCG02G011140 vs. TAIR10
Match: AT3G53170.1 (AT3G53170.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 55.1 bits (131), Expect = 1.4e-07
Identity = 59/273 (21.61%), Postives = 111/273 (40.66%), Query Frame = 1

Query: 46  KQTHRFLSTLATTAATGDHSATNRLIRKFVASSPKSITLNVLSNIVSPHTAQPGLCSVAL 105
           K+  R L T A  A  G     N    K++   PK++ L  L   +  +  Q      AL
Sbjct: 79  KELSRILRTDA--AVKGIERKANS--EKYLTLWPKAV-LEALDEAIKENRWQS-----AL 138

Query: 106 TLYSRITEASWFTWNSKLVADLIAFLDQNGLYSESEALMSETISKLGFQERKLVNFYSQL 165
            +++ + +  W+    K    L   L  N    +  +L+ E +   G   +  ++ Y+ L
Sbjct: 139 KIFNLLRKQHWYEPRCKTYTKLFKVLG-NCKQPDQASLLFEVMLSEGL--KPTIDVYTSL 198

Query: 166 VESQSKHGSERGCGNSYARLLELLYNSPSVYVKRRAYESMVTGLCSMKRPQEAETLVEKM 225
           +    K        ++   +  +    P V+     +  +++  C + R    +++V +M
Sbjct: 199 ISVYGKSELLDKAFSTLEYMKSVSDCKPDVFT----FTVLISCCCKLGRFDLVKSIVLEM 258

Query: 226 RSKGITPSAYEYRSIIYAYGTLGLFEDMQRSLKQMENEDYALDTICS-NMVLSSYGAHNK 285
              G+  S   Y +II  YG  G+FE+M+  L  M  +  +L  +C+ N ++ SYG    
Sbjct: 259 SYLGVGCSTVTYNTIIDGYGKAGMFEEMESVLADMIEDGDSLPDVCTLNSIIGSYGNGRN 318

Query: 286 LADMVLWLQRMKTSAHLNFSVRTYNSVLKSCPK 318
           +  M  W  R +    +   + T+N ++ S  K
Sbjct: 319 MRKMESWYSRFQLMG-VQPDITTFNILILSFGK 333

BLAST of ClCG02G011140 vs. TAIR10
Match: AT4G20090.1 (AT4G20090.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 54.3 bits (129), Expect = 2.4e-07
Identity = 53/238 (22.27%), Postives = 107/238 (44.96%), Query Frame = 1

Query: 89  NIVSPHTAQPGLC-----SVALTLYSRITEASWFTWNSKLVADLIAFLDQNGLYSESEAL 148
           N V+ +T   GLC       A++L  R+  +     N      LI  L +    +++  L
Sbjct: 291 NEVTYNTLIHGLCLKGKLDKAVSLLERMVSSKCIP-NDVTYGTLINGLVKQRRATDAVRL 350

Query: 149 MSETISKLGFQERKLVNFYSQLVESQSKHGSERGCGNSYARLLELLYNSPSVYVKRRAYE 208
           +S ++ + G+   +  + YS L+    K G      + + ++ E     P++ V    Y 
Sbjct: 351 LS-SMEERGYHLNQ--HIYSVLISGLFKEGKAEEAMSLWRKMAEKGCK-PNIVV----YS 410

Query: 209 SMVTGLCSMKRPQEAETLVEKMRSKGITPSAYEYRSIIYAYGTLGLFEDMQRSLKQMENE 268
            +V GLC   +P EA+ ++ +M + G  P+AY Y S++  +   GL E+  +  K+M+  
Sbjct: 411 VLVDGLCREGKPNEAKEILNRMIASGCLPNAYTYSSLMKGFFKTGLCEEAVQVWKEMDKT 470

Query: 269 DYALDTICSNMVLSSYGAHNKLADMVLWLQRMKTSAHLNFSVRTYNSVLKSCPKITSM 322
             + +  C ++++       ++ + ++   +M T   +      Y+S++K    I SM
Sbjct: 471 GCSRNKFCYSVLIDGLCGVGRVKEAMMVWSKMLTIG-IKPDTVAYSSIIKGLCGIGSM 518


HSP 2 Score: 37.7 bits (86), Expect = 2.4e-02
Identity = 32/119 (26.89%), Postives = 56/119 (47.06%), Query Frame = 1

Query: 202 YESMVTGLCSMKRPQEAETLVEKMRSKGITPSAYEYRSIIYAYGTLGLFEDMQRSLKQME 261
           Y +++ GLC  +R  EA  L+++M+S+G +PS   Y  +I      G   D+ R  K ++
Sbjct: 225 YCTLMDGLCKEERIDEAVLLLDEMQSEGCSPSPVIYNVLIDGLCKKG---DLTRVTKLVD 284

Query: 262 N---EDYALDTICSNMVLSSYGAHNKLADMVLWLQRMKTSAHLNFSVRTYNSVLKSCPK 318
           N   +    + +  N ++       KL   V  L+RM +S  +   V TY +++    K
Sbjct: 285 NMFLKGCVPNEVTYNTLIHGLCLKGKLDKAVSLLERMVSSKCIPNDV-TYGTLINGLVK 339

BLAST of ClCG02G011140 vs. TAIR10
Match: AT1G18900.3 (AT1G18900.3 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 52.4 bits (124), Expect = 9.3e-07
Identity = 93/432 (21.53%), Postives = 174/432 (40.28%), Query Frame = 1

Query: 44  LTKQTHRFLSTLATTAATGDHSATNRLIRKFV--ASSPKSITLNVLSNIVSPHTAQPGLC 103
           L+  T  +   +      G   A ++L  + V    +P  +T N++ ++     A+    
Sbjct: 465 LSPDTFTYSVIINCLGKAGHLPAAHKLFCEMVDQGCTPNLVTYNIMMDL----HAKARNY 524

Query: 104 SVALTLYSRITEASWFTWNSKLVADLIAFLDQNGLYSESEALMSETISKLGFQERKLVNF 163
             AL LY  +  A  F  +    + ++  L   G   E+EA+ +E   K    +  +   
Sbjct: 525 QNALKLYRDMQNAG-FEPDKVTYSIVMEVLGHCGYLEEAEAVFTEMQQKNWIPDEPV--- 584

Query: 164 YSQLVESQSKHGSERGCGNSYARLLE--LLYNSPSVYVKRRAYESMVTGLCSMKRPQEAE 223
           Y  LV+   K G+       Y  +L   L  N P+         S+++    + +  EA 
Sbjct: 585 YGLLVDLWGKAGNVEKAWQWYQAMLHAGLRPNVPTC-------NSLLSTFLRVNKIAEAY 644

Query: 224 TLVEKMRSKGITPSAYEYRSIIYAYGTLGLFEDMQRSLKQMENEDYALDTICSNMVLSS- 283
            L++ M + G+ PS   Y +++ +  T G      RS   M          C  ++ S+ 
Sbjct: 645 ELLQNMLALGLRPSLQTY-TLLLSCCTDG------RSKLDMG--------FCGQLMASTG 704

Query: 284 YGAHNKLADMVLWLQRMKTSAHLNFSVRTYNSVLKSCPKITSMLQDHKSSKFPVLIEGLI 343
           + AH       ++L +M  +     +VR + +       +  M  + + SK   L++ ++
Sbjct: 705 HPAH-------MFLLKMPAAGPDGENVRNHANNF-----LDLMHSEDRESKRG-LVDAVV 764

Query: 344 AVLDGD---EEALLVEELVVGSSVLKEVMVWDAMELKL-DLHGAHVGAAYVIM---LQWM 403
             L      EEA  V E+    +V  + +   +    L +LH    G A   +   L W 
Sbjct: 765 DFLHKSGQKEEAGSVWEVAAQKNVFPDALREKSCSYWLINLHVMSEGTAVTALSRTLAWF 824

Query: 404 KEMRLNFEDESCVIPAQVTLISGSGNHSIVRGESPVKALIKEIIVRTESPLRVDRKNTGC 463
           ++  L     S   P+++ +++G G  S V G S V+  ++E++    SP   +  N+GC
Sbjct: 825 RKQML----ASGTCPSRIDIVTGWGRRSRVTGTSMVRQAVEELLNIFGSPFFTESGNSGC 849

BLAST of ClCG02G011140 vs. NCBI nr
Match: gi|659119236|ref|XP_008459547.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g17033 [Cucumis melo])

HSP 1 Score: 843.6 bits (2178), Expect = 1.8e-241
Identity = 423/486 (87.04%), Postives = 452/486 (93.00%), Query Frame = 1

Query: 1   MELRLCPPPYVIGDGVRLFLKPLKRHDGFRSYPFVPNLQVKCT-LTKQTHRFLSTLATTA 60
           MELRLCPPPYVIGDGVRLFL+PLKR DGFRSYPF+PNLQVKCT LTKQTHRFLSTL+TT 
Sbjct: 1   MELRLCPPPYVIGDGVRLFLQPLKRLDGFRSYPFLPNLQVKCTTLTKQTHRFLSTLSTTG 60

Query: 61  ATGDHSATNRLIRKFVASSPKSITLNVLSNIVSPHTAQPGLCSVALTLYSRITEASWFTW 120
           ATGD SATNRLIRKFVASSPKSITL+VLSNIVS HT QP LCS ALTLYSRITEASWFTW
Sbjct: 61  ATGDQSATNRLIRKFVASSPKSITLSVLSNIVSTHTPQPELCSAALTLYSRITEASWFTW 120

Query: 121 NSKLVADLIAFLDQNGLYSESEALMSETISKLGFQERKLVNFYSQLVESQSKHGSERGCG 180
           NSKLVADL+AFL QNGLYSESEAL+SE ISKLG QERKLVNFYSQLVESQSKHG ERG G
Sbjct: 121 NSKLVADLVAFLGQNGLYSESEALISEAISKLGSQERKLVNFYSQLVESQSKHGFERGFG 180

Query: 181 NSYARLLELLYNSPSVYVKRRAYESMVTGLCSMKRPQEAETLVEKMRSKGITPSAYEYRS 240
           +SY+RL ELLYNSPSVYVKRRAYESMVTGLCSMKRP EAE+LV++MRSKGITP+AYEYRS
Sbjct: 181 DSYSRLFELLYNSPSVYVKRRAYESMVTGLCSMKRPHEAESLVKEMRSKGITPTAYEYRS 240

Query: 241 IIYAYGTLGLFEDMQRSLKQMENEDYALDTICSNMVLSSYGAHNKLADMVLWLQRMKTSA 300
           IIYAYGTLGLFE+M+RSLKQMEN++  LDT+CSNMVLSSYGAHNKL DM+LWLQRMKTS+
Sbjct: 241 IIYAYGTLGLFEEMKRSLKQMENDNIELDTVCSNMVLSSYGAHNKLGDMLLWLQRMKTSS 300

Query: 301 HLNFSVRTYNSVLKSCPKITSMLQDHKSSKFPVLIEGLIAVLDGDEEALLVEELVVGSSV 360
           H   SVRTYNSVL SCPKITSMLQDHKS   PVLIE LIA+LDGDEEALLV+EL+VGSSV
Sbjct: 301 HCKSSVRTYNSVLNSCPKITSMLQDHKSGDLPVLIEDLIAILDGDEEALLVKELLVGSSV 360

Query: 361 LKEVMVWDAMELKLDLHGAHVGAAYVIMLQWMKEMRLNFEDESCVIPAQVTLISGSGNHS 420
           L E+MVWDAMELKLDLHGAHVGAAYVIMLQW+KEMRLNFEDES VIPAQVTLI GSG HS
Sbjct: 361 LNEIMVWDAMELKLDLHGAHVGAAYVIMLQWIKEMRLNFEDESNVIPAQVTLICGSGKHS 420

Query: 421 IVRGESPVKALIKEIIVRTESPLRVDRKNTGCFISKGKAVKNWLCSLPEKREIVANRKCH 480
           IVRGESPVKALIKEI+VRTESPLR+DRKNTGCFISKGKAVKNWLCSLP K++ VANRKC+
Sbjct: 421 IVRGESPVKALIKEIMVRTESPLRIDRKNTGCFISKGKAVKNWLCSLPGKKDTVANRKCY 480

Query: 481 NRPFTP 486
            RPF P
Sbjct: 481 KRPFIP 486

BLAST of ClCG02G011140 vs. NCBI nr
Match: gi|469474106|gb|AGH33847.1| (PPR [Cucumis melo])

HSP 1 Score: 842.4 bits (2175), Expect = 3.9e-241
Identity = 423/486 (87.04%), Postives = 451/486 (92.80%), Query Frame = 1

Query: 1   MELRLCPPPYVIGDGVRLFLKPLKRHDGFRSYPFVPNLQVKCT-LTKQTHRFLSTLATTA 60
           MELRLCPPPYVIGDGVRL L+PLKR DGFRSYPF+PNLQVKCT LTKQTHRFLSTL+TTA
Sbjct: 1   MELRLCPPPYVIGDGVRLLLQPLKRLDGFRSYPFLPNLQVKCTTLTKQTHRFLSTLSTTA 60

Query: 61  ATGDHSATNRLIRKFVASSPKSITLNVLSNIVSPHTAQPGLCSVALTLYSRITEASWFTW 120
           ATGD SATNRLIRKFVASSPKSITL+VLSNIVS HT QP LCS ALTLYSRITEASWFTW
Sbjct: 61  ATGDQSATNRLIRKFVASSPKSITLSVLSNIVSTHTPQPELCSAALTLYSRITEASWFTW 120

Query: 121 NSKLVADLIAFLDQNGLYSESEALMSETISKLGFQERKLVNFYSQLVESQSKHGSERGCG 180
           NSKLVADL+AFL QNGLYSESEAL+SE ISKLG QERKLVNFYSQLVESQSKHG ERG G
Sbjct: 121 NSKLVADLVAFLGQNGLYSESEALISEAISKLGSQERKLVNFYSQLVESQSKHGFERGFG 180

Query: 181 NSYARLLELLYNSPSVYVKRRAYESMVTGLCSMKRPQEAETLVEKMRSKGITPSAYEYRS 240
           +SY+RL ELLYNSPSVYVKRRAYESMVTGLCSMKRP EAE+LV++MRSKGITP+AYEYRS
Sbjct: 181 DSYSRLFELLYNSPSVYVKRRAYESMVTGLCSMKRPHEAESLVKEMRSKGITPTAYEYRS 240

Query: 241 IIYAYGTLGLFEDMQRSLKQMENEDYALDTICSNMVLSSYGAHNKLADMVLWLQRMKTSA 300
           IIYAYGTLGLFE+M+RSLKQMEN++  LDT+CSNMVLSSYGAHNKL DM+LWLQRMKTS 
Sbjct: 241 IIYAYGTLGLFEEMKRSLKQMENDNIELDTVCSNMVLSSYGAHNKLGDMLLWLQRMKTSP 300

Query: 301 HLNFSVRTYNSVLKSCPKITSMLQDHKSSKFPVLIEGLIAVLDGDEEALLVEELVVGSSV 360
           H   SVRTYNSVL SCPKITSMLQDHKS   PVLIE LIA+LDGDEEALLV+EL+VGSSV
Sbjct: 301 HCKSSVRTYNSVLNSCPKITSMLQDHKSGDLPVLIEDLIAILDGDEEALLVKELLVGSSV 360

Query: 361 LKEVMVWDAMELKLDLHGAHVGAAYVIMLQWMKEMRLNFEDESCVIPAQVTLISGSGNHS 420
           L E+MVWDAMELKLDLHGAHVGAAYVIMLQW+KEMRLNFEDES VIPAQVTLI GSG HS
Sbjct: 361 LNEIMVWDAMELKLDLHGAHVGAAYVIMLQWIKEMRLNFEDESYVIPAQVTLICGSGKHS 420

Query: 421 IVRGESPVKALIKEIIVRTESPLRVDRKNTGCFISKGKAVKNWLCSLPEKREIVANRKCH 480
           IVRGESPVKALIKEI+VRTESPLR+DRKNTGCFISKGKAVKNWLCSLP K++ VANRKC+
Sbjct: 421 IVRGESPVKALIKEIMVRTESPLRIDRKNTGCFISKGKAVKNWLCSLPGKKDTVANRKCY 480

Query: 481 NRPFTP 486
            RPF P
Sbjct: 481 KRPFIP 486

BLAST of ClCG02G011140 vs. NCBI nr
Match: gi|778707816|ref|XP_011656064.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g17033 [Cucumis sativus])

HSP 1 Score: 825.5 bits (2131), Expect = 4.9e-236
Identity = 418/478 (87.45%), Postives = 441/478 (92.26%), Query Frame = 1

Query: 1   MELRLCPPPYVIGDGVRLFLKPLKRHDGFRSYPFVPNLQVKCT-LTKQTHRFLSTLATTA 60
           MELRLCPPPYVIGDGVRLFL P KR   FRSYPFVPNLQVKCT LTKQTHRFLSTL+TTA
Sbjct: 1   MELRLCPPPYVIGDGVRLFLHPFKRLHAFRSYPFVPNLQVKCTSLTKQTHRFLSTLSTTA 60

Query: 61  ATGDHSATNRLIRKFVASSPKSITLNVLSNIVSPHTAQPGLCSVALTLYSRITEASWFTW 120
           ATGD SATNRLIRKFVASSPKSITL+VLSNIVS HT QP LCS ALTLYSRITEASWFTW
Sbjct: 61  ATGDQSATNRLIRKFVASSPKSITLSVLSNIVSTHTPQPELCSAALTLYSRITEASWFTW 120

Query: 121 NSKLVADLIAFLDQNGLYSESEALMSETISKLGFQERKLVNFYSQLVESQSKHGSERGCG 180
           NSKLVADL+AFLDQNGLYSESE L+SE ISKLG QERKLVNFYSQLVESQSKHG ERG  
Sbjct: 121 NSKLVADLVAFLDQNGLYSESEVLISEAISKLGSQERKLVNFYSQLVESQSKHGFERGFV 180

Query: 181 NSYARLLELLYNSPSVYVKRRAYESMVTGLCSMKRPQEAETLVEKMRSKGITPSAYEYRS 240
           +SY+RLLELLYNSPSVYVKRRAYESMVTGLCSMKRP EAE LV++MRSKGITP+AYEYRS
Sbjct: 181 DSYSRLLELLYNSPSVYVKRRAYESMVTGLCSMKRPHEAENLVKEMRSKGITPTAYEYRS 240

Query: 241 IIYAYGTLGLFEDMQRSLKQMENEDYALDTICSNMVLSSYGAHNKLADMVLWLQRMKTSA 300
           IIYAYGTLGLFE+M+RSLKQMEN++  LDT+CSNMVLSSYGAHNKL DMVLWLQRMKTS 
Sbjct: 241 IIYAYGTLGLFEEMKRSLKQMENDNIELDTVCSNMVLSSYGAHNKLGDMVLWLQRMKTSP 300

Query: 301 HLNFSVRTYNSVLKSCPKITSMLQDHKSSKFPVLIEGLIAVLDGDEEALLVEELVVGSSV 360
           H N SVRTYNSVL SCPKIT+MLQDHKS+  PVLIE LIAVLDGDEEALLVEEL+ GSSV
Sbjct: 301 HCNSSVRTYNSVLNSCPKITAMLQDHKSTNLPVLIEDLIAVLDGDEEALLVEELLAGSSV 360

Query: 361 LKEVMVWDAMELKLDLHGAHVGAAYVIMLQWMKEMRLNFEDESCVIPAQVTLISGSGNHS 420
           L E+MVWDAMELKLDLHGAHVGAAYVIMLQW+KEMRLNFEDES VIPAQVTLI GSG HS
Sbjct: 361 LNEIMVWDAMELKLDLHGAHVGAAYVIMLQWIKEMRLNFEDESYVIPAQVTLICGSGKHS 420

Query: 421 IVRGESPVKALIKEIIVRTESPLRVDRKNTGCFISKGKAVKNWLCSLPEKREIVANRK 478
           IVRGESPVKALIKEI+VRTESPLR+DRKNTGCFISKGKAVKNWLCSLP K++ V NRK
Sbjct: 421 IVRGESPVKALIKEIMVRTESPLRIDRKNTGCFISKGKAVKNWLCSLPGKKDTVPNRK 478

BLAST of ClCG02G011140 vs. NCBI nr
Match: gi|658045366|ref|XP_008358358.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like [Malus domestica])

HSP 1 Score: 493.0 bits (1268), Expect = 5.8e-136
Identity = 262/456 (57.46%), Postives = 336/456 (73.68%), Query Frame = 1

Query: 12  IGDGVRLF--LKPLKRHDGFRSYPFVPNLQVKCTLTKQTHRFLSTLATTAATGDHSATNR 71
           IG G   F    P K H    + P   ++Q  C LTKQ  RFL+ LA  A   D   TN+
Sbjct: 13  IGSGQLSFSVASPWKHHQPRPTPPLASSVQ--CVLTKQGQRFLTKLAANAR--DPKFTNK 72

Query: 72  LIRKFVASSPKSITLNVLSNIVSPHTAQPGLCSVALTLYSRITEASWFTWNSKLVADLIA 131
           LI KF++SSPKSI L+ LS ++SP +  P L S+A  LYS+ITE SWF WN KLVA L+A
Sbjct: 73  LISKFLSSSPKSIALSTLSYLLSPDSTPPHLSSLAFPLYSKITEESWFEWNPKLVASLVA 132

Query: 132 FLDQNGLYSESEALMSETISKLGFQERKLVNFYSQLVESQSKHGSERGCGNSYARLLELL 191
            LD  GLYS+SEAL+SETISKLG +ER+L  F+ QL+ES SK  S+ G  ++Y+ L +LL
Sbjct: 133 LLDNQGLYSQSEALISETISKLGSRERELALFHCQLLESHSKLSSKHGFDSTYSYLHQLL 192

Query: 192 YNSPSVYVKRRAYESMVTGLCSMKRPQEAETLVEKMRSKGITPSAYEYRSIIYAYGTLGL 251
           +NS SVYVKRRA+ESMV GLC+M RPQEA+ L+E+M  KG+ PS +E+RS++Y YG LGL
Sbjct: 193 HNSSSVYVKRRAFESMVGGLCAMDRPQEADILIEEMMVKGLKPSVFEFRSVVYGYGRLGL 252

Query: 252 FEDMQRSLKQMENEDYALDTICSNMVLSSYGAHNKLADMVLWLQRMKTSAHLNFSVRTYN 311
           FE+M + +++ME +  A+DTICSNMVLSSYGA+++LA MVLWL++MK    L FS+RTYN
Sbjct: 253 FEEMLKVVEKMEGQGLAVDTICSNMVLSSYGAYSELAAMVLWLRKMKI-LRLPFSIRTYN 312

Query: 312 SVLKSCPKITSMLQDHKSSKFPVLIEGLIAVLDGDEEALLVEELVVGSSVLKEVMVWDAM 371
           SVL SCP I +MLQD K    P  IE L  VL+GD E L+V+EL VGS+VL+EVMVW+++
Sbjct: 313 SVLNSCPTIMAMLQDPKD--VPCSIEQLNGVLNGD-EGLVVKEL-VGSTVLEEVMVWESL 372

Query: 372 ELKLDLHGAHVGAAYVIMLQWMKEMRLNFEDESCVIPAQVTLISGSGNHSIVRGESPVKA 431
           E KLDLHG H+G+AY+IML+W + MR  F    CVIPA+V ++ G G HS VRGESPVK 
Sbjct: 373 EAKLDLHGLHLGSAYLIMLEWFEAMRHRFNCGECVIPAEVVIVCGLGKHSSVRGESPVKG 432

Query: 432 LIKEIIVRTESPLRVDRKNTGCFISKGKAVKNWLCS 466
           L+K ++ R  SP+R+DRKN GCFI+KG+AVK+WLCS
Sbjct: 433 LVKVMMHRMGSPMRIDRKNVGCFIAKGRAVKDWLCS 459

BLAST of ClCG02G011140 vs. NCBI nr
Match: gi|658045374|ref|XP_008358363.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like [Malus domestica])

HSP 1 Score: 493.0 bits (1268), Expect = 5.8e-136
Identity = 262/456 (57.46%), Postives = 336/456 (73.68%), Query Frame = 1

Query: 12  IGDGVRLF--LKPLKRHDGFRSYPFVPNLQVKCTLTKQTHRFLSTLATTAATGDHSATNR 71
           IG G   F    P K H    + P   ++Q  C LTKQ  RFL+ LA  A   D   TN+
Sbjct: 13  IGSGQLSFSVASPWKHHQPRPTPPLASSVQ--CVLTKQGQRFLTKLAANAR--DPKFTNK 72

Query: 72  LIRKFVASSPKSITLNVLSNIVSPHTAQPGLCSVALTLYSRITEASWFTWNSKLVADLIA 131
           LI KF++SSPKSI L+ LS ++SP +  P L S+A  LYS+ITE SWF WN KLVA L+A
Sbjct: 73  LISKFLSSSPKSIALSTLSYLLSPDSTPPHLSSLAFPLYSKITEESWFEWNPKLVASLVA 132

Query: 132 FLDQNGLYSESEALMSETISKLGFQERKLVNFYSQLVESQSKHGSERGCGNSYARLLELL 191
            LD  GLYS+SEAL+SETISKLG +ER+L  F+ QL+ES SK  S+ G  ++Y+ L +LL
Sbjct: 133 LLDNQGLYSQSEALISETISKLGSRERELALFHCQLLESHSKLSSKHGFDSTYSYLHQLL 192

Query: 192 YNSPSVYVKRRAYESMVTGLCSMKRPQEAETLVEKMRSKGITPSAYEYRSIIYAYGTLGL 251
           +NS SVYVKRRA+ESMV GLC+M RPQEA+ L+E+M  KG+ PS +E+RS++Y YG LGL
Sbjct: 193 HNSSSVYVKRRAFESMVGGLCAMDRPQEADILIEEMMVKGLKPSVFEFRSVVYGYGRLGL 252

Query: 252 FEDMQRSLKQMENEDYALDTICSNMVLSSYGAHNKLADMVLWLQRMKTSAHLNFSVRTYN 311
           FE+M + +++ME +  A+DTICSNMVLSSYGA+++LA MVLWL++MK    L FS+RTYN
Sbjct: 253 FEEMLKVVEKMEGQGLAVDTICSNMVLSSYGAYSELAAMVLWLRKMKI-LRLPFSIRTYN 312

Query: 312 SVLKSCPKITSMLQDHKSSKFPVLIEGLIAVLDGDEEALLVEELVVGSSVLKEVMVWDAM 371
           SVL SCP I +MLQD K    P  IE L  VL+GD E L+V+EL VGS+VL+EVMVW+++
Sbjct: 313 SVLNSCPTIMAMLQDPKD--VPCSIEQLNGVLNGD-EGLVVKEL-VGSTVLEEVMVWESL 372

Query: 372 ELKLDLHGAHVGAAYVIMLQWMKEMRLNFEDESCVIPAQVTLISGSGNHSIVRGESPVKA 431
           E KLDLHG H+G+AY+IML+W + MR  F    CVIPA+V ++ G G HS VRGESPVK 
Sbjct: 373 EAKLDLHGLHLGSAYLIMLEWFEAMRHRFNCGECVIPAEVVIVCGLGKHSSVRGESPVKG 432

Query: 432 LIKEIIVRTESPLRVDRKNTGCFISKGKAVKNWLCS 466
           L+K ++ R  SP+R+DRKN GCFI+KG+AVK+WLCS
Sbjct: 433 LVKVMMHRMGSPMRIDRKNVGCFIAKGRAVKDWLCS 459

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP157_ARATH7.9e-12555.11Pentatricopeptide repeat-containing protein At2g17033 OS=Arabidopsis thaliana GN... [more]
PP123_ARATH3.5e-0821.81Pentatricopeptide repeat-containing protein At1g74750 OS=Arabidopsis thaliana GN... [more]
PP402_ARATH1.3e-0721.82Putative pentatricopeptide repeat-containing protein At5g36300 OS=Arabidopsis th... [more]
PP279_ARATH2.5e-0621.61Pentatricopeptide repeat-containing protein At3g53170 OS=Arabidopsis thaliana GN... [more]
PP327_ARATH4.3e-0622.27Pentatricopeptide repeat-containing protein At4g20090 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
Q5DMV7_CUCME1.2e-24187.04Pentatricopeptide (PPR) repeat protein-like OS=Cucumis melo GN=PPR PE=4 SV=1[more]
M4R4K5_CUCME2.7e-24187.04PPR OS=Cucumis melo GN=PPR PE=4 SV=1[more]
M5VLA1_PRUPE1.5e-13559.29Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa021547mg PE=4 SV=1[more]
W9QLA9_9ROSA6.5e-13458.59Uncharacterized protein OS=Morus notabilis GN=L484_006076 PE=4 SV=1[more]
A0A061DXY1_THECC1.9e-13357.38Pentatricopeptide (PPR) repeat-containing protein, putative OS=Theobroma cacao G... [more]
Match NameE-valueIdentityDescription
AT2G17033.24.4e-12655.11 pentatricopeptide (PPR) repeat-containing protein[more]
AT1G74750.12.0e-0921.81 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G53170.11.4e-0721.61 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G20090.12.4e-0722.27 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G18900.39.3e-0721.53 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659119236|ref|XP_008459547.1|1.8e-24187.04PREDICTED: pentatricopeptide repeat-containing protein At2g17033 [Cucumis melo][more]
gi|469474106|gb|AGH33847.1|3.9e-24187.04PPR [Cucumis melo][more]
gi|778707816|ref|XP_011656064.1|4.9e-23687.45PREDICTED: pentatricopeptide repeat-containing protein At2g17033 [Cucumis sativu... [more]
gi|658045366|ref|XP_008358358.1|5.8e-13657.46PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like [Malus dom... [more]
gi|658045374|ref|XP_008358363.1|5.8e-13657.46PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like [Malus dom... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002625Smr_dom
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009058 biosynthetic process
biological_process GO:0009987 cellular process
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG02G011140.1ClCG02G011140.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002625Smr domainSMARTSM00463SMR_2coord: 370..457
score: 1.6
IPR002625Smr domainPROFILEPS50828SMRcoord: 373..457
score: 15
IPR002625Smr domainunknownSSF160443SMR domain-likecoord: 371..453
score: 4.4
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 201..230
score: 1.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 201..233
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 268..298
score: 5.963coord: 233..267
score: 7.574coord: 198..232
score: 11
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 46..446
score: 5.5
NoneNo IPR availablePANTHERPTHR24015:SF537SUBFAMILY NOT NAMEDcoord: 46..446
score: 5.5

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
ClCG02G011140Cla001730Watermelon (97103) v1wcgwmB200
ClCG02G011140Cla97C02G036900Watermelon (97103) v2wcgwmbB138
ClCG02G011140Bhi10G000927Wax gourdwcgwgoB290
ClCG02G011140Lsi10G008190Bottle gourd (USVL1VR-Ls)lsiwcgB049
ClCG02G011140Carg16155Silver-seed gourdcarwcgB0337
The following gene(s) are paralogous to this gene:

None