ClCG01G009870 (gene) Watermelon (Charleston Gray)

NameClCG01G009870
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionArmadillo/beta-catenin repeat family protein
LocationCG_Chr01 : 14213347 .. 14215002 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAAATACCTCCGGAAACTGATCATTTCCTTCTCTCTAACAACCTCATTTCTTCTCTTCTTGACGATATTCCACTCATCAACAATTTCAAGGGCAAATGGTCTTCCATCAGAGCAAAACTCTCCGATCTTCGCACTCAATTGATCGATGTTTCCCACTTTCCCAACTCCTCTTCCAATCCCCTCTCTCTCGATTTCCTTCGTTCCGTTCTCGAAGCTCTTACGCAGGCGGCTTCTCTCTCCCACAAGTGCCGGAATCCGGCACTTTCCGATGGTAAACTCAAGACCCAGAGCGATATTGACGCCGTTCTCGCCAAGTTCGACTCCCTTCTCAAAGACGGTGAGGTCTTGATTAGAAGTGAGATTCTTCACGACGGTGTCGTTTCGAGTTCCTCGTCTAGAAGGGAGGCCGTGAGGGCGGAGTCCAGGAATTTGATCACTAGGTTGCAGATTGGGAGCATTGAATCCAGGGTATTGGCCATTGATTCGCTGTTGCAGTTGCTGAATGAGGATGATAAGAATGTTACTATTGCTGCAGCTCAAGGTGCTGTTCCTGTTCTTGTTCGGCTACTGGATTCCAGCTCCTTAGAATTGAAGGAGAGGGCTGTTGCTGCTATTTCTATTGTTTCTATGGTGGATGGTGTTAAGCACGTCATGATTGCTGAAGGTCTAGTGCTTTTGAATCACTTGCTGAGGATTCTCGATTCTGGTAGCGGTTTTGCTAAAGAGAAGGCCTGTTTAGCCCTCCAACCCCTGAGTATTTCCAAGGAAAATGCTAGGTCAATCGGTTCTAGAGGAGGAATTTCATCTCTGTTGGAGATTTGTGAGGCCGGTACTCCCGGTTCTCAAGCTTCTGCAGCTGCAGTTTTGAGAAATCTTGCGTCATTTAGTGAAATTAAAGAGAATTTCATAGAAGAAAATGGGGTTGTAGTTCTTTTGGGGCTTTTGGCCTCGGGAACTCCGTTGGCTCAAGAAAACGCGATTGGGTGTTTGTGTAATTTAGTTCTGGACGATGATAATCTGAAGCTCTTGATTGTTAGAGAAGGTGGGATCGAGTTCTTGAGAAATTTCTGGGATTCTGTTCCATCAGTTCGTAGTCTTGAAGTTGCTGTGGAGCTTTTGAGCCTCTTGGCTTCTTATTCCCCAATTGCAGAAGCTCTTATTTCAGATGGATTTGTTGATCGACTTCTTCCAGTTTTGAGTTGTGGAGTATTAGGTGCAAGAACTGCAGCAGCTCGAGCAGTTTACGAGCTCGGATTCTGCACAAAAACAAGAAAAGAAATGGGGGAATCTGGATTCATTACACCCTTAATTAATATGTTGGATGGTAAGTCTGTTGATGAGAAAAAAGCAGCTGCTAAGGCATTGTCTTCTCTATTACAATACTCTGGTAACAGAAGAATCTTCCAGAAAGAGGAGAGGGGAATTGTAAGTGCAGTTCAACTCTTAGACCCTTCAATCTCAAACCTCGACAAGAAATACCCTGTTTCATTATTATCCTCAGTTGTGATTTCAAGCAAGTGTAGAAAGCAGATGGTTGCTGCTGGTGCTGGTTTGTATCTACAGAAGCTTGTTGAAATGAATGTTGAAGGGTCAAAGAAGCTGTTGGAAAGTCTTGGCCGTGGTAAAATCTGGGGTGTCTTTGTCAGATCTTAG

mRNA sequence

ATGAAAATACCTCCGGAAACTGATCATTTCCTTCTCTCTAACAACCTCATTTCTTCTCTTCTTGACGATATTCCACTCATCAACAATTTCAAGGGCAAATGGTCTTCCATCAGAGCAAAACTCTCCGATCTTCGCACTCAATTGATCGATGTTTCCCACTTTCCCAACTCCTCTTCCAATCCCCTCTCTCTCGATTTCCTTCGTTCCGTTCTCGAAGCTCTTACGCAGGCGGCTTCTCTCTCCCACAAGTGCCGGAATCCGGCACTTTCCGATGGTAAACTCAAGACCCAGAGCGATATTGACGCCGTTCTCGCCAAGTTCGACTCCCTTCTCAAAGACGGTGAGGTCTTGATTAGAAGTGAGATTCTTCACGACGGTGTCGTTTCGAGTTCCTCGTCTAGAAGGGAGGCCGTGAGGGCGGAGTCCAGGAATTTGATCACTAGGTTGCAGATTGGGAGCATTGAATCCAGGGTATTGGCCATTGATTCGCTGTTGCAGTTGCTGAATGAGGATGATAAGAATGTTACTATTGCTGCAGCTCAAGGTGCTGTTCCTGTTCTTGTTCGGCTACTGGATTCCAGCTCCTTAGAATTGAAGGAGAGGGCTGTTGCTGCTATTTCTATTGTTTCTATGGTGGATGGTGTTAAGCACGTCATGATTGCTGAAGGTCTAGTGCTTTTGAATCACTTGCTGAGGATTCTCGATTCTGGTAGCGGTTTTGCTAAAGAGAAGGCCTGTTTAGCCCTCCAACCCCTGAGTATTTCCAAGGAAAATGCTAGGTCAATCGGTTCTAGAGGAGGAATTTCATCTCTGTTGGAGATTTGTGAGGCCGGTACTCCCGGTTCTCAAGCTTCTGCAGCTGCAGTTTTGAGAAATCTTGCGTCATTTAGTGAAATTAAAGAGAATTTCATAGAAGAAAATGGGGTTGTAGTTCTTTTGGGGCTTTTGGCCTCGGGAACTCCGTTGGCTCAAGAAAACGCGATTGGGTGTTTGTGTAATTTAGTTCTGGACGATGATAATCTGAAGCTCTTGATTGTTAGAGAAGGTGGGATCGAGTTCTTGAGAAATTTCTGGGATTCTGTTCCATCAGTTCGTAGTCTTGAAGTTGCTGTGGAGCTTTTGAGCCTCTTGGCTTCTTATTCCCCAATTGCAGAAGCTCTTATTTCAGATGGATTTGTTGATCGACTTCTTCCAGTTTTGAGTTGTGGAGTATTAGGTGCAAGAACTGCAGCAGCTCGAGCAGTTTACGAGCTCGGATTCTGCACAAAAACAAGAAAAGAAATGGGGGAATCTGGATTCATTACACCCTTAATTAATATGTTGGATGGTAAGTCTGTTGATGAGAAAAAAGCAGCTGCTAAGGCATTGTCTTCTCTATTACAATACTCTGGTAACAGAAGAATCTTCCAGAAAGAGGAGAGGGGAATTGTAAGTGCAGTTCAACTCTTAGACCCTTCAATCTCAAACCTCGACAAGAAATACCCTGTTTCATTATTATCCTCAGTTGTGATTTCAAGCAAGTGTAGAAAGCAGATGGTTGCTGCTGGTGCTGGTTTGTATCTACAGAAGCTTGTTGAAATGAATGTTGAAGGGTCAAAGAAGCTGTTGGAAAGTCTTGGCCGTGGTAAAATCTGGGGTGTCTTTGTCAGATCTTAG

Coding sequence (CDS)

ATGAAAATACCTCCGGAAACTGATCATTTCCTTCTCTCTAACAACCTCATTTCTTCTCTTCTTGACGATATTCCACTCATCAACAATTTCAAGGGCAAATGGTCTTCCATCAGAGCAAAACTCTCCGATCTTCGCACTCAATTGATCGATGTTTCCCACTTTCCCAACTCCTCTTCCAATCCCCTCTCTCTCGATTTCCTTCGTTCCGTTCTCGAAGCTCTTACGCAGGCGGCTTCTCTCTCCCACAAGTGCCGGAATCCGGCACTTTCCGATGGTAAACTCAAGACCCAGAGCGATATTGACGCCGTTCTCGCCAAGTTCGACTCCCTTCTCAAAGACGGTGAGGTCTTGATTAGAAGTGAGATTCTTCACGACGGTGTCGTTTCGAGTTCCTCGTCTAGAAGGGAGGCCGTGAGGGCGGAGTCCAGGAATTTGATCACTAGGTTGCAGATTGGGAGCATTGAATCCAGGGTATTGGCCATTGATTCGCTGTTGCAGTTGCTGAATGAGGATGATAAGAATGTTACTATTGCTGCAGCTCAAGGTGCTGTTCCTGTTCTTGTTCGGCTACTGGATTCCAGCTCCTTAGAATTGAAGGAGAGGGCTGTTGCTGCTATTTCTATTGTTTCTATGGTGGATGGTGTTAAGCACGTCATGATTGCTGAAGGTCTAGTGCTTTTGAATCACTTGCTGAGGATTCTCGATTCTGGTAGCGGTTTTGCTAAAGAGAAGGCCTGTTTAGCCCTCCAACCCCTGAGTATTTCCAAGGAAAATGCTAGGTCAATCGGTTCTAGAGGAGGAATTTCATCTCTGTTGGAGATTTGTGAGGCCGGTACTCCCGGTTCTCAAGCTTCTGCAGCTGCAGTTTTGAGAAATCTTGCGTCATTTAGTGAAATTAAAGAGAATTTCATAGAAGAAAATGGGGTTGTAGTTCTTTTGGGGCTTTTGGCCTCGGGAACTCCGTTGGCTCAAGAAAACGCGATTGGGTGTTTGTGTAATTTAGTTCTGGACGATGATAATCTGAAGCTCTTGATTGTTAGAGAAGGTGGGATCGAGTTCTTGAGAAATTTCTGGGATTCTGTTCCATCAGTTCGTAGTCTTGAAGTTGCTGTGGAGCTTTTGAGCCTCTTGGCTTCTTATTCCCCAATTGCAGAAGCTCTTATTTCAGATGGATTTGTTGATCGACTTCTTCCAGTTTTGAGTTGTGGAGTATTAGGTGCAAGAACTGCAGCAGCTCGAGCAGTTTACGAGCTCGGATTCTGCACAAAAACAAGAAAAGAAATGGGGGAATCTGGATTCATTACACCCTTAATTAATATGTTGGATGGTAAGTCTGTTGATGAGAAAAAAGCAGCTGCTAAGGCATTGTCTTCTCTATTACAATACTCTGGTAACAGAAGAATCTTCCAGAAAGAGGAGAGGGGAATTGTAAGTGCAGTTCAACTCTTAGACCCTTCAATCTCAAACCTCGACAAGAAATACCCTGTTTCATTATTATCCTCAGTTGTGATTTCAAGCAAGTGTAGAAAGCAGATGGTTGCTGCTGGTGCTGGTTTGTATCTACAGAAGCTTGTTGAAATGAATGTTGAAGGGTCAAAGAAGCTGTTGGAAAGTCTTGGCCGTGGTAAAATCTGGGGTGTCTTTGTCAGATCTTAG

Protein sequence

MKIPPETDHFLLSNNLISSLLDDIPLINNFKGKWSSIRAKLSDLRTQLIDVSHFPNSSSNPLSLDFLRSVLEALTQAASLSHKCRNPALSDGKLKTQSDIDAVLAKFDSLLKDGEVLIRSEILHDGVVSSSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAAQGAVPVLVRLLDSSSLELKERAVAAISIVSMVDGVKHVMIAEGLVLLNHLLRILDSGSGFAKEKACLALQPLSISKENARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFSEIKENFIEENGVVVLLGLLASGTPLAQENAIGCLCNLVLDDDNLKLLIVREGGIEFLRNFWDSVPSVRSLEVAVELLSLLASYSPIAEALISDGFVDRLLPVLSCGVLGARTAAARAVYELGFCTKTRKEMGESGFITPLINMLDGKSVDEKKAAAKALSSLLQYSGNRRIFQKEERGIVSAVQLLDPSISNLDKKYPVSLLSSVVISSKCRKQMVAAGAGLYLQKLVEMNVEGSKKLLESLGRGKIWGVFVRS
BLAST of ClCG01G009870 vs. Swiss-Prot
Match: PUB4_ARATH (U-box domain-containing protein 4 OS=Arabidopsis thaliana GN=PUB4 PE=1 SV=3)

HSP 1 Score: 97.8 bits (242), Expect = 3.9e-19
Identity = 100/386 (25.91%), Postives = 163/386 (42.23%), Query Frame = 1

Query: 32  GKWSSIRAKLSDLRTQLIDVSHFPNSSSNPLSLDFLRSVLEALTQAASLSHKCRN-PALS 91
           G+ S      S   T  +    FP + +N  S         A   ++  S + R+ P  +
Sbjct: 431 GQTSENHHHRSPSATSTVSNEEFPRADANENS----EESAHATPYSSDASGEIRSGPLAA 490

Query: 92  DGKLKTQSDIDAVLAKFDSLLKDGEVLIR-SEILHDGVVS--SSSSRREAVRAES--RNL 151
                T+ D+     KF      G+   R SE L   +VS  S+ +RR+    E+  + L
Sbjct: 491 TTSAATRRDLSDFSPKFMDRRTRGQFWRRPSERLGSRIVSAPSNETRRDLSEVETQVKKL 550

Query: 152 ITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAAQGAVPVLVRLLDSSSLELKERAVAA 211
           +  L+  S++++  A   L  L   +  N  +    GA+ +LV LL S+    +E AV A
Sbjct: 551 VEELKSSSLDTQRQATAELRLLAKHNMDNRIVIGNSGAIVLLVELLYSTDSATQENAVTA 610

Query: 212 ISIVSMVDGVKHVMIAEGLVLLNHLLRILDSGSGFAKEKACLALQPLSISKENARSIGSR 271
           +  +S+ D  K  +   G +    L+ +L++GS  AKE +   L  LS+ +EN   IG  
Sbjct: 611 LLNLSINDNNKKAIADAGAI--EPLIHVLENGSSEAKENSAATLFSLSVIEENKIKIGQS 670

Query: 272 GGISSLLEICEAGTPGSQASAAAVLRNLASFSEIKENFIEENGVVVLLGLLASGTPLAQE 331
           G I  L+++   GTP  +  AA  L NL+   E K   ++   V  L+ L+     +  +
Sbjct: 671 GAIGPLVDLLGNGTPRGKKDAATALFNLSIHQENKAMIVQSGAVRYLIDLMDPAAGMV-D 730

Query: 332 NAIGCLCNLVLDDDNLKLLIVREGGIEFLRNFWDSVPSVRSLEVAVELLSLLASYSPIAE 391
            A+  L NL    +  +  I +EGGI  L    +   +      A  LL L  +      
Sbjct: 731 KAVAVLANLATIPEG-RNAIGQEGGIPLLVEVVELGSARGKENAAAALLQLSTNSGRFCN 790

Query: 392 ALISDGFVDRLLPVLSCGVLGARTAA 412
            ++ +G V  L+ +   G   AR  A
Sbjct: 791 MVLQEGAVPPLVALSQSGTPRAREKA 808


HSP 2 Score: 95.5 bits (236), Expect = 1.9e-18
Identity = 81/297 (27.27%), Postives = 144/297 (48.48%), Query Frame = 1

Query: 177 IAAAQGAVPVLVRLLDSSSLELKERAVAAISIVSMVDGVKHVMIAE-GLVLLNHLLRILD 236
           ++  +  V  LV  L SSSL+ + +A A + +++  +    ++I   G ++L  L+ +L 
Sbjct: 536 LSEVETQVKKLVEELKSSSLDTQRQATAELRLLAKHNMDNRIVIGNSGAIVL--LVELLY 595

Query: 237 SGSGFAKEKACLALQPLSISKENARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLAS 296
           S     +E A  AL  LSI+  N ++I   G I  L+ + E G+  ++ ++AA L +L+ 
Sbjct: 596 STDSATQENAVTALLNLSINDNNKKAIADAGAIEPLIHVLENGSSEAKENSAATLFSLSV 655

Query: 297 FSEIKENFIEENGVVVLLGLLASGTPLAQENAIGCLCNLVLDDDNLKLLIVREGGIEFLR 356
             E K    +   +  L+ LL +GTP  +++A   L NL +  +N K +IV+ G + +L 
Sbjct: 656 IEENKIKIGQSGAIGPLVDLLGNGTPRGKKDAATALFNLSIHQEN-KAMIVQSGAVRYLI 715

Query: 357 NFWDSVPSVRSLEVAVELLSLLASYSPIAEALISDGFVDRLLPVLSCGVLGARTAAARAV 416
           +  D  P+   ++ AV +L+ LA+      A+  +G +  L+ V+  G    +  AA A+
Sbjct: 716 DLMD--PAAGMVDKAVAVLANLATIPEGRNAIGQEGGIPLLVEVVELGSARGKENAAAAL 775

Query: 417 YELG-----FCTKTRKEMGESGFITPLINMLDGKSVDEKKAAAKALSSLLQYSGNRR 468
            +L      FC    +E    G + PL+ +    S      A +   +LL Y  N+R
Sbjct: 776 LQLSTNSGRFCNMVLQE----GAVPPLVAL----SQSGTPRAREKAQALLSYFRNQR 819


HSP 3 Score: 58.5 bits (140), Expect = 2.6e-07
Identity = 73/274 (26.64%), Postives = 119/274 (43.43%), Query Frame = 1

Query: 268 ISSLLEICEAGTPGSQASAAAVLRNLASFSEIKENFIEENGVVVLL-GLLASGTPLAQEN 327
           +  L+E  ++ +  +Q  A A LR LA  +      I  +G +VLL  LL S     QEN
Sbjct: 543 VKKLVEELKSSSLDTQRQATAELRLLAKHNMDNRIVIGNSGAIVLLVELLYSTDSATQEN 602

Query: 328 AIGCLCNLVLDDDNLKLLIVREGGIEFLRNFWDSVPSVRSLEVAVELLSLLASYSPIAEA 387
           A+  L NL ++D+N K  I   G IE L +  ++  S      A  L SL    S I E 
Sbjct: 603 AVTALLNLSINDNN-KKAIADAGAIEPLIHVLENGSSEAKENSAATLFSL----SVIEEN 662

Query: 388 LI---SDGFVDRLLPVLSCGVLGARTAAARAVYELGFCTKTRKEMGESGFITPLINMLDG 447
            I     G +  L+ +L  G    +  AA A++ L    + +  + +SG +  LI+++D 
Sbjct: 663 KIKIGQSGAIGPLVDLLGNGTPRGKKDAATALFNLSIHQENKAMIVQSGAVRYLIDLMDP 722

Query: 448 KSVDEKKAAAKALSSLLQYSGNRRIFQKEERGIVSAVQLLDPSISNLDKKYPVSLLSSVV 507
            +    KA A   +      G   I Q  E GI   V++++   +   +    +LL    
Sbjct: 723 AAGMVDKAVAVLANLATIPEGRNAIGQ--EGGIPLLVEVVELGSARGKENAAAALLQLST 782

Query: 508 ISSKCRKQMVAAGAGLYLQKLVEMNVEGSKKLLE 538
            S +    ++  GA   +  LV ++  G+ +  E
Sbjct: 783 NSGRFCNMVLQEGA---VPPLVALSQSGTPRARE 806

BLAST of ClCG01G009870 vs. Swiss-Prot
Match: PUB11_ARATH (U-box domain-containing protein 11 OS=Arabidopsis thaliana GN=PUB11 PE=2 SV=2)

HSP 1 Score: 91.3 bits (225), Expect = 3.6e-17
Identity = 71/273 (26.01%), Postives = 124/273 (45.42%), Query Frame = 1

Query: 143 RNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAAQGAVPVLVRLLDSSSLELKERA 202
           R L+ RL   S E R  A+  +  L      N  + A  GA+PVLV LL S  +  +E A
Sbjct: 334 RALVQRLSSRSTEDRRNAVSEIRSLSKRSTDNRILIAEAGAIPVLVNLLTSEDVATQENA 393

Query: 203 VAAISIVSMVDGVKHVMIAEGLVLLNHLLRILDSGSGFAKEKACLALQPLSISKENARSI 262
           +  +  +S+ +  K +++  G V    ++++L +G+  A+E A   L  LS++ EN   I
Sbjct: 394 ITCVLNLSIYENNKELIMFAGAV--TSIVQVLRAGTMEARENAAATLFSLSLADENKIII 453

Query: 263 GSRGGISSLLEICEAGTPGSQASAAAVLRNLASFSEIKENFIEENGVVVLLGLLASGTPL 322
           G  G I +L+++ E GTP  +  AA  L NL  +   K   +    V  L+ +L+  T  
Sbjct: 454 GGSGAIPALVDLLENGTPRGKKDAATALFNLCIYHGNKGRAVRAGIVTALVKMLSDSTRH 513

Query: 323 AQENAIGCLCNLVLDDDNLKLLIVREGGIEFLRNFWDSVPSVRSLEVAVELLSLLASYSP 382
              +    + +++ ++ + K  IV+   +  L     +  +      A  LLSL    + 
Sbjct: 514 RMVDEALTILSVLANNQDAKSAIVKANTLPALIGILQTDQTRNRENAAAILLSLCKRDT- 573

Query: 383 IAEALISDGFVDRLLPVLSCGVLGARTAAARAV 416
             E LI+ G +  ++P++     G      +A+
Sbjct: 574 --EKLITIGRLGAVVPLMDLSKNGTERGKRKAI 601


HSP 2 Score: 76.6 bits (187), Expect = 9.2e-13
Identity = 78/321 (24.30%), Postives = 140/321 (43.61%), Query Frame = 1

Query: 139 RAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAAQGAVPVLVRLLDSSSLEL 198
           + E+  L     + S+ SR  A  ++ Q     +     +     +  LV+ L S S E 
Sbjct: 288 KLENFTLTPNYVLRSLISRWCAEHNIEQPAGYINGRTKNSGDMSVIRALVQRLSSRSTED 347

Query: 199 KERAVAAISIVSMVDGVKHVMIAEGLVLLNHLLRILDSGSGFAKEKACLALQPLSISKEN 258
           +  AV+ I  +S       ++IAE   +   L+ +L S     +E A   +  LSI + N
Sbjct: 348 RRNAVSEIRSLSKRSTDNRILIAEAGAI-PVLVNLLTSEDVATQENAITCVLNLSIYENN 407

Query: 259 ARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFSEIKENFIEENGVVVLLGLLAS 318
              I   G ++S++++  AGT  ++ +AAA L +L+   E K        +  L+ LL +
Sbjct: 408 KELIMFAGAVTSIVQVLRAGTMEARENAAATLFSLSLADENKIIIGGSGAIPALVDLLEN 467

Query: 319 GTPLAQENAIGCLCNLVLDDDNLKLLIVREGGIEFLRNFWDSVPSVRSLEVAVELLSLLA 378
           GTP  +++A   L NL +   N K   VR G +  L          R ++ A+ +LS+LA
Sbjct: 468 GTPRGKKDAATALFNLCIYHGN-KGRAVRAGIVTALVKMLSDSTRHRMVDEALTILSVLA 527

Query: 379 SYSPIAEALISDGFVDRLLPVLSCGVLGARTAAARAVYELGFCTKTRKEMGESGFITPLI 438
           +      A++    +  L+ +L       R  AA  +  L  C +  +++   G +  ++
Sbjct: 528 NNQDAKSAIVKANTLPALIGILQTDQTRNRENAAAIL--LSLCKRDTEKLITIGRLGAVV 587

Query: 439 NMLDGKSVDEKKAAAKALSSL 460
            ++D      ++   KA+S L
Sbjct: 588 PLMDLSKNGTERGKRKAISLL 604


HSP 3 Score: 65.9 bits (159), Expect = 1.6e-09
Identity = 75/316 (23.73%), Postives = 137/316 (43.35%), Query Frame = 1

Query: 70  VLEALTQAASLSHKCRNPA-LSDGKLKTQSDIDAVLAKFDSLLKDGEVLIRSEILHDGVV 129
           VL +L       H    PA   +G+ K   D+  + A    L        R+ +     +
Sbjct: 299 VLRSLISRWCAEHNIEQPAGYINGRTKNSGDMSVIRALVQRLSSRSTEDRRNAVSEIRSL 358

Query: 130 SSSSSRREAVRAESR------NLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAAQG 189
           S  S+    + AE+       NL+T   + + E+ +  + +L   + E++K + + A  G
Sbjct: 359 SKRSTDNRILIAEAGAIPVLVNLLTSEDVATQENAITCVLNLS--IYENNKELIMFA--G 418

Query: 190 AVPVLVRLLDSSSLELKERAVAAISIVSMVDGVKHVMIAEGLVLLNHLLRILDSGSGFAK 249
           AV  +V++L + ++E +E A A +  +S+ D  K ++   G +    L+ +L++G+   K
Sbjct: 419 AVTSIVQVLRAGTMEARENAAATLFSLSLADENKIIIGGSGAI--PALVDLLENGTPRGK 478

Query: 250 EKACLALQPLSISKENARSIGSRGGISSLLEICEAGTPGSQA-SAAAVLRNLASFSEIKE 309
           + A  AL  L I   N       G +++L+++    T       A  +L  LA+  + K 
Sbjct: 479 KDAATALFNLCIYHGNKGRAVRAGIVTALVKMLSDSTRHRMVDEALTILSVLANNQDAKS 538

Query: 310 NFIEENGVVVLLGLLASGTPLAQENAIGCLCNLVLDDDNLKLLIVREGGIEFLRNFWDSV 369
             ++ N +  L+G+L +     +ENA   L +L   D    + I R G +  L +     
Sbjct: 539 AIVKANTLPALIGILQTDQTRNRENAAAILLSLCKRDTEKLITIGRLGAVVPLMDL-SKN 598

Query: 370 PSVRSLEVAVELLSLL 378
            + R    A+ LL LL
Sbjct: 599 GTERGKRKAISLLELL 607


HSP 4 Score: 60.8 bits (146), Expect = 5.2e-08
Identity = 56/205 (27.32%), Postives = 97/205 (47.32%), Query Frame = 1

Query: 312 LLGLLASGTPLAQENAIGCLCNLVLDDDNLKLLIVREGGIEFLRNFWDSVPSVRSLEVAV 371
           L+  L+S +   + NA+  + +L     + ++LI   G I  L N   S   V + E A+
Sbjct: 336 LVQRLSSRSTEDRRNAVSEIRSLSKRSTDNRILIAEAGAIPVLVNLLTS-EDVATQENAI 395

Query: 372 ELLSLLASYSPIAEALISDGFVDRLLPVLSCGVLGARTAAARAVYELGFCTKTRKEMGES 431
             +  L+ Y    E ++  G V  ++ VL  G + AR  AA  ++ L    + +  +G S
Sbjct: 396 TCVLNLSIYENNKELIMFAGAVTSIVQVLRAGTMEARENAAATLFSLSLADENKIIIGGS 455

Query: 432 GFITPLINMLDGKSVDEKKAAAKALSSLLQYSGNRRIFQKEERGIVSA-VQLLDPSISNL 491
           G I  L+++L+  +   KK AA AL +L  Y GN+   +    GIV+A V++L  S  + 
Sbjct: 456 GAIPALVDLLENGTPRGKKDAATALFNLCIYHGNKG--RAVRAGIVTALVKMLSDSTRHR 515

Query: 492 DKKYPVSLLSSVVISSKCRKQMVAA 516
                +++LS +  +   +  +V A
Sbjct: 516 MVDEALTILSVLANNQDAKSAIVKA 537

BLAST of ClCG01G009870 vs. Swiss-Prot
Match: PUB10_ARATH (U-box domain-containing protein 10 OS=Arabidopsis thaliana GN=PUB10 PE=2 SV=1)

HSP 1 Score: 86.3 bits (212), Expect = 1.2e-15
Identity = 82/306 (26.80%), Postives = 135/306 (44.12%), Query Frame = 1

Query: 125 DGVVSSSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAAQGAV 184
           DG     S    A+RA    L+ +L   SIE R  A+  +  L      N  + A  GA+
Sbjct: 330 DGSFRDLSGDMSAIRA----LVCKLSSQSIEDRRTAVSEIRSLSKRSTDNRILIAEAGAI 389

Query: 185 PVLVRLLDSSS-LELKERAVAAISIVSMVDGVKHVMIAEGLVLLNHLLRILDSGSGFAKE 244
           PVLV+LL S    E +E AV  I  +S+ +  K +++  G V    ++ +L +GS  A+E
Sbjct: 390 PVLVKLLTSDGDTETQENAVTCILNLSIYEHNKELIMLAGAV--TSIVLVLRAGSMEARE 449

Query: 245 KACLALQPLSISKENARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFSEIKENF 304
            A   L  LS++ EN   IG+ G I +L+++ + G+   +  AA  L NL  +   K   
Sbjct: 450 NAAATLFSLSLADENKIIIGASGAIMALVDLLQYGSVRGKKDAATALFNLCIYQGNKGRA 509

Query: 305 IEENGVVVLLGLLASGTPLAQENAIGCLCNLVLDDDNLKLLIVREGGIEFLRNFWDSVPS 364
           +    V  L+ +L   +     +    + +++  +   K  I+R   I  L +       
Sbjct: 510 VRAGIVKPLVKMLTDSSSERMADEALTILSVLASNQVAKTAILRANAIPPLIDCLQK-DQ 569

Query: 365 VRSLEVAVELLSLLASYSPIAEALISDGFVDRLLPVLSCGVLGARTAAARAVYELGFCTK 424
            R+ E A  +L  L       E LIS G +  ++P++     G   A  +A   L    K
Sbjct: 570 PRNRENAAAILLCLCKRD--TEKLISIGRLGAVVPLMELSRDGTERAKRKANSLLELLRK 626

Query: 425 TRKEMG 430
           + +++G
Sbjct: 630 SSRKLG 626


HSP 2 Score: 57.8 bits (138), Expect = 4.4e-07
Identity = 51/177 (28.81%), Postives = 80/177 (45.20%), Query Frame = 1

Query: 342 KLLIVREGGIEFLRNFWDSVPSVRSLEVAVELLSLLASYSPIAEALISDGFVDRLLPVLS 401
           ++LI   G I  L     S     + E AV  +  L+ Y    E ++  G V  ++ VL 
Sbjct: 376 RILIAEAGAIPVLVKLLTSDGDTETQENAVTCILNLSIYEHNKELIMLAGAVTSIVLVLR 435

Query: 402 CGVLGARTAAARAVYELGFCTKTRKEMGESGFITPLINMLDGKSVDEKKAAAKALSSLLQ 461
            G + AR  AA  ++ L    + +  +G SG I  L+++L   SV  KK AA AL +L  
Sbjct: 436 AGSMEARENAAATLFSLSLADENKIIIGASGAIMALVDLLQYGSVRGKKDAATALFNLCI 495

Query: 462 YSGNRRIFQKEERGIVS-AVQLLDPSISNLDKKYPVSLLSSVVISSKCRKQMVAAGA 518
           Y GN+   +    GIV   V++L  S S       +++LS +  +   +  ++ A A
Sbjct: 496 YQGNKG--RAVRAGIVKPLVKMLTDSSSERMADEALTILSVLASNQVAKTAILRANA 550


HSP 3 Score: 57.4 bits (137), Expect = 5.8e-07
Identity = 68/269 (25.28%), Postives = 117/269 (43.49%), Query Frame = 1

Query: 90  SDGKLKTQS-DIDAVLAKFDSLLKDGEVLIRSEILHDGVVSSSSSRREAVRAESRNLITR 149
           SDG  +  S D+ A+ A    L        R+ +     +S  S+    + AE+  +   
Sbjct: 329 SDGSFRDLSGDMSAIRALVCKLSSQSIEDRRTAVSEIRSLSKRSTDNRILIAEAGAIPVL 388

Query: 150 LQI----GSIESRVLAIDSLLQL-LNEDDKNVTIAAAQGAVPVLVRLLDSSSLELKERAV 209
           +++    G  E++  A+  +L L + E +K + + A  GAV  +V +L + S+E +E A 
Sbjct: 389 VKLLTSDGDTETQENAVTCILNLSIYEHNKELIMLA--GAVTSIVLVLRAGSMEARENAA 448

Query: 210 AAISIVSMVDGVKHVMIAEGLVLLNHLLRILDSGSGFAKEKACLALQPLSISKENARSIG 269
           A +  +S+ D  K ++ A G ++   L+ +L  GS   K+ A  AL  L I + N     
Sbjct: 449 ATLFSLSLADENKIIIGASGAIMA--LVDLLQYGSVRGKKDAATALFNLCIYQGNKGRAV 508

Query: 270 SRGGISSLLE-ICEAGTPGSQASAAAVLRNLASFSEIKENFIEENGVVVLLGLLASGTPL 329
             G +  L++ + ++ +      A  +L  LAS    K   +  N +  L+  L    P 
Sbjct: 509 RAGIVKPLVKMLTDSSSERMADEALTILSVLASNQVAKTAILRANAIPPLIDCLQKDQPR 568

Query: 330 AQENAIGCLCNLVLDDDNLKLLIVREGGI 352
            +ENA   L  L   D    + I R G +
Sbjct: 569 NRENAAAILLCLCKRDTEKLISIGRLGAV 593

BLAST of ClCG01G009870 vs. Swiss-Prot
Match: PUB15_ARATH (U-box domain-containing protein 15 OS=Arabidopsis thaliana GN=PUB15 PE=2 SV=2)

HSP 1 Score: 82.0 bits (201), Expect = 2.2e-14
Identity = 71/238 (29.83%), Postives = 117/238 (49.16%), Query Frame = 1

Query: 184 VPVLVRLLDSSSLELKERAVAAISIVSMVDGVKHVMIAEG--LVLLNHLLRILDSGSGFA 243
           V +LV  L SS LE + R+V  + +++  +    V+IA    + LL  LL   DSG    
Sbjct: 381 VSLLVEALSSSQLEEQRRSVKQMRLLARENPENRVLIANAGAIPLLVQLLSYPDSG---I 440

Query: 244 KEKACLALQPLSISKENARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFSEIKE 303
           +E A   L  LSI + N + I + G I +++EI E G   ++ ++AA L +L+   E K 
Sbjct: 441 QENAVTTLLNLSIDEVNKKLISNEGAIPNIIEILENGNREARENSAAALFSLSMLDENKV 500

Query: 304 NFIEENGVVVLLGLLASGTPLAQENAIGCLCNLVLDDDNLKLLIVREGGIEFLRNFWDSV 363
                NG+  L+ LL  GT   +++A+  L NL L+  N K   +  G ++ L N     
Sbjct: 501 TIGLSNGIPPLVDLLQHGTLRGKKDALTALFNLSLNSAN-KGRAIDAGIVQPLLNLLKD- 560

Query: 364 PSVRSLEVAVELLSLLASYSPIAEALISDGFVDRLLPVLSCGVLGARTAAARAVYELG 420
            ++  ++ A+ +L LLAS+    +A+    F++ L+  +  G    +  A   + ELG
Sbjct: 561 KNLGMIDEALSILLLLASHPEGRQAIGQLSFIETLVEFIRQGTPKNKECATSVLLELG 613


HSP 2 Score: 73.6 bits (179), Expect = 7.8e-12
Identity = 71/270 (26.30%), Postives = 124/270 (45.93%), Query Frame = 1

Query: 145 LITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAAQGAVPVLVRLLDSSSLELKERAVA 204
           L+  L    +E +  ++  +  L  E+ +N  + A  GA+P+LV+LL      ++E AV 
Sbjct: 384 LVEALSSSQLEEQRRSVKQMRLLARENPENRVLIANAGAIPLLVQLLSYPDSGIQENAVT 443

Query: 205 AISIVSMVDGVKHVMIAEGLVLLNHLLRILDSGSGFAKEKACLALQPLSISKENARSIGS 264
            +  +S+ +  K ++  EG +   +++ IL++G+  A+E +  AL  LS+  EN  +IG 
Sbjct: 444 TLLNLSIDEVNKKLISNEGAI--PNIIEILENGNREARENSAAALFSLSMLDENKVTIGL 503

Query: 265 RGGISSLLEICEAGTPGSQASAAAVLRNLASFSEIKENFIEENGVVVLLGLLASGTPLAQ 324
             GI  L+++ + GT   +  A   L NL+  S  K   I+   V  LL LL        
Sbjct: 504 SNGIPPLVDLLQHGTLRGKKDALTALFNLSLNSANKGRAIDAGIVQPLLNLLKDKNLGMI 563

Query: 325 ENAIGCLCNLVLDDDN---LKLLIVREGGIEFLRNFWDSVPSVRSLEVAVELLSLLASYS 384
           + A+  L  L    +    +  L   E  +EF+R      P  +    +V LL L ++ S
Sbjct: 564 DEALSILLLLASHPEGRQAIGQLSFIETLVEFIR---QGTPKNKECATSV-LLELGSNNS 623

Query: 385 PIAEALISDGFVDRLLPVLSCGVLGARTAA 412
               A +  G  + L+ + + G   A+  A
Sbjct: 624 SFILAALQFGVYEYLVEITTSGTNRAQRKA 647


HSP 3 Score: 48.5 bits (114), Expect = 2.7e-04
Identity = 59/233 (25.32%), Postives = 107/233 (45.92%), Query Frame = 1

Query: 305 EENGVVVLLGLLASGTPLAQENAIGCLCNLVLDDDNLKLLIVREGGIEFLRNFWDSVPSV 364
           +++ V +L+  L+S     Q  ++  +  L  ++   ++LI   G I  L     S P  
Sbjct: 377 QKDEVSLLVEALSSSQLEEQRRSVKQMRLLARENPENRVLIANAGAIPLLVQLL-SYPDS 436

Query: 365 RSLEVAVELLSLLASYSPIAEALIS-DGFVDRLLPVLSCGVLGARTAAARAVYELGFCTK 424
              E AV  L L  S   + + LIS +G +  ++ +L  G   AR  +A A++ L    +
Sbjct: 437 GIQENAVTTL-LNLSIDEVNKKLISNEGAIPNIIEILENGNREARENSAAALFSLSMLDE 496

Query: 425 TRKEMGESGFITPLINMLDGKSVDEKKAAAKALSSLLQYSGNRRIFQKEERGIVSAV--Q 484
            +  +G S  I PL+++L   ++  KK A  AL +L   S N+   +  + GIV  +   
Sbjct: 497 NKVTIGLSNGIPPLVDLLQHGTLRGKKDALTALFNLSLNSANKG--RAIDAGIVQPLLNL 556

Query: 485 LLDPSISNLDKKYPVSLLSSVVISSKCRKQMVAAGAGLYLQKLVEMNVEGSKK 535
           L D ++  +D+      LS +++ +   +   A G   +++ LVE   +G+ K
Sbjct: 557 LKDKNLGMIDE-----ALSILLLLASHPEGRQAIGQLSFIETLVEFIRQGTPK 600

BLAST of ClCG01G009870 vs. Swiss-Prot
Match: VAC8_ASPOR (Vacuolar protein 8 OS=Aspergillus oryzae (strain ATCC 42149 / RIB 40) GN=vac8 PE=3 SV=1)

HSP 1 Score: 77.8 bits (190), Expect = 4.1e-13
Identity = 75/334 (22.46%), Postives = 147/334 (44.01%), Query Frame = 1

Query: 145 LITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAAQGAVPVLVRLLDSSSLELKERAVA 204
           LI ++   ++E +  A+  +  L   +D    IA + GA+  L+RL  S  + ++  A  
Sbjct: 152 LIRQMMSPNVEVQCNAVGCITNLATHEDNKAKIARS-GALGPLIRLAKSKDMRVQRNATG 211

Query: 205 AISIVSMVDGVKHVMIAEGLVLLNHLLRILDSGSGFAKEKACLALQPLSISKENARSIGS 264
           A+  ++  D  +  ++  G + +  L+++L S     +     AL  +++   N + +  
Sbjct: 212 ALLNMTHSDDNRQQLVNAGAIPV--LVQLLSSSDVDVQYYCTTALSNIAVDASNRKRLAQ 271

Query: 265 RGG--ISSLLEICEAGTPGSQASAAAVLRNLASFSEIKENFIEENGVVVLLGLLASGTPL 324
                + SL+ + ++ TP  Q  AA  LRNLAS  + +   +   G+  LL LL S    
Sbjct: 272 TESRLVQSLVHLMDSSTPKVQCQAALALRNLASDEKYQLEIVRAKGLPPLLRLLQSSYLP 331

Query: 325 AQENAIGCLCNLVLDDDNLKLLIVREGGIEFLRNFWDSVPSVRSLEVAVELLS----LLA 384
              +A+ C+ N+ +   N   +I  + G  FL+   D + S  + E+    +S    L A
Sbjct: 332 LILSAVACIRNISIHPLNESPII--DAG--FLKPLVDLLGSTDNEEIQCHAISTLRNLAA 391

Query: 385 SYSPIAEALISDGFVDRLLPVLSCGVLGARTAAARAVYELGFCTKTRKEMGESGFITPLI 444
           S     E ++  G V +   ++    L  ++    A+  L    + +  +   G    LI
Sbjct: 392 SSDRNKELVLQAGAVQKCKDLVLKVPLSVQSEMTAAIAVLALSDELKPHLLNLGVFDVLI 451

Query: 445 NMLDGKSVDEKKAAAKALSSLLQYSGNRRIFQKE 473
            + + +S++ +  +A AL +L    G+  IF ++
Sbjct: 452 PLTESESIEVQGNSAAALGNLSSKVGDYSIFVRD 478

BLAST of ClCG01G009870 vs. TrEMBL
Match: A0A061G6D5_THECC (ARM repeat superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_014724 PE=4 SV=1)

HSP 1 Score: 760.8 bits (1963), Expect = 1.2e-216
Identity = 400/550 (72.73%), Postives = 465/550 (84.55%), Query Frame = 1

Query: 1   MKIPPETDHFLLSNNLISSLLDDIPLINNFKGKWSSIRAKLSDLRTQLIDVSHFPNSSSN 60
           MK+P E D   LSN+L++SL + IP INNFKGKW+ I++KLS L+ QL D S FP SSSN
Sbjct: 1   MKVP-ENDPISLSNHLLASLSEQIPNINNFKGKWALIKSKLSGLQAQLADFSDFPASSSN 60

Query: 61  PLSLDFLRSVLEALTQAASLSHKCRNPALSDGKLKTQSDIDAVLAKFDSLLKDGEVLIRS 120
           PL++D L S+ + L  A SLS KC+   L++GKLKTQSDIDAVLAK D  +KD E+LIRS
Sbjct: 61  PLAVDLLYSITQTLNDAVSLSQKCQLADLTEGKLKTQSDIDAVLAKLDRHIKDSEILIRS 120

Query: 121 EILHDGVVSSSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAA 180
            +L DG VS+SSS++EAVR ESRNLITRLQIG+ ES+  A+DSLL LL EDDKNV IA A
Sbjct: 121 GVLQDGAVSTSSSKKEAVRVESRNLITRLQIGTTESKNSAMDSLLGLLQEDDKNVMIAVA 180

Query: 181 QGAVPVLVRLLDSSSLELKERAVAAISIVSMVDGVKHVMIAEGLVLLNHLLRILDSGSGF 240
           QG VPVLVRLLDSSSLE+KE+ VAAIS VS V+  KHV+IAEGL+LLNHLLR+L+SGSGF
Sbjct: 181 QGVVPVLVRLLDSSSLEMKEKTVAAISRVSTVESSKHVLIAEGLLLLNHLLRVLESGSGF 240

Query: 241 AKEKACLALQPLSISKENARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFSEIK 300
           AKEKAC+ALQ LS SKENAR+IGSRGGISSLLEIC+AGTPGSQA AA VL+NLAS  EIK
Sbjct: 241 AKEKACIALQALSFSKENARAIGSRGGISSLLEICQAGTPGSQAFAAGVLKNLASVDEIK 300

Query: 301 ENFIEENGVVVLLGLLASGTPLAQENAIGCLCNLVLDDDNLKLLIVREGGIEFLRNFWDS 360
           ENFIEEN V VL+GL ASGT LAQEN+IGCLCNLV DD+NL+LLIV+EGGIE L+NFWDS
Sbjct: 301 ENFIEENAVFVLIGLAASGTALAQENSIGCLCNLVSDDENLRLLIVKEGGIECLKNFWDS 360

Query: 361 VPSVRSLEVAVELLSLLASYSPIAEALISDGFVDRLLPVLSCGVLGARTAAARAVYELGF 420
            P+ +SLEVAVEL+  LAS SPIAEAL++DGFV RL+ VL+CGVLG R AAARAVYELGF
Sbjct: 361 SPNPKSLEVAVELVRRLASCSPIAEALVADGFVARLVAVLNCGVLGVRIAAARAVYELGF 420

Query: 421 CTKTRKEMGESGFITPLINMLDGKSVDEKKAAAKALSSLLQYSGNRRIFQKEERGIVSAV 480
            +KTRKEMGE G    LI M+DGK+V+EK+AAA ALS+L+ Y+GNR++FQK+ERGIV+AV
Sbjct: 421 NSKTRKEMGECGCTVALIKMMDGKAVEEKEAAAMALSTLMLYAGNRKVFQKDERGIVNAV 480

Query: 481 QLLDPSISNLDKKYPVSLLSSVVISSKCRKQMVAAGAGLYLQKLVEMNVEGSKKLLESLG 540
           QLLDP I NLDKKYPV +LS +V S KCRKQMVAAGA +YLQKLVEMNVEG+KKLLESLG
Sbjct: 481 QLLDPLIQNLDKKYPVLILSELVHSKKCRKQMVAAGACVYLQKLVEMNVEGAKKLLESLG 540

Query: 541 RGKIWGVFVR 551
           RGKIWGVF R
Sbjct: 541 RGKIWGVFAR 549

BLAST of ClCG01G009870 vs. TrEMBL
Match: W9QV83_9ROSA (U-box domain-containing protein 11 OS=Morus notabilis GN=L484_008839 PE=4 SV=1)

HSP 1 Score: 742.3 bits (1915), Expect = 4.3e-211
Identity = 398/558 (71.33%), Postives = 469/558 (84.05%), Query Frame = 1

Query: 1   MKIPPE-TDHFLLSNNLISSLLDDIPLINNFKGKWSSIRAKLSDLRTQLIDVSHFPNSSS 60
           MK P E  D   +S  L+SSL+D+I L+  FKGKWS IRAKL DLR QL D +  P+++S
Sbjct: 1   MKAPEEEADTTAISTELLSSLMDEILLVQTFKGKWSLIRAKLDDLRPQLADFADSPDAAS 60

Query: 61  NPLSLDFLRSVLEALTQAASLSHKCRNPALSDGKLKTQSDIDAVLAKFDSLLKDGEVLIR 120
           NPLS+D LRSV  AL+ A S++ +C++P+L+DGKL+TQSD+DAVLA+ D +++DGE+L+R
Sbjct: 61  NPLSIDLLRSVAAALSDAISVARRCQSPSLADGKLRTQSDVDAVLARLDRVVRDGEILLR 120

Query: 121 SEILHDG---VVSSS-----SSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNED 180
           S +L D    VVS+S     SSRREAVRAESRNLITRLQIG+ ESR  A+DSLL LL ED
Sbjct: 121 SGVLSDNNRAVVSNSGNSGSSSRREAVRAESRNLITRLQIGTPESRNSAMDSLLGLLRED 180

Query: 181 DKNVTIAAAQGAVPVLVRLLDS-SSLELKERAVAAISIVSMVDGVKHVMIAEGLVLLNHL 240
           DKNV IA AQG VPV VRLLDS SS+E+KE+ VAAIS VSMVD  KHV+IAEGL+LLNHL
Sbjct: 181 DKNVMIAVAQGVVPVFVRLLDSSSSVEMKEKTVAAISRVSMVDSSKHVLIAEGLLLLNHL 240

Query: 241 LRILDSGSGFAKEKACLALQPLSISKENARSIGSRGGISSLLEICEAGTPGSQASAAAVL 300
           LR+LDSGSGF+KEKAC+ALQ LS SKENAR+IGSRGGISSLLEIC+AGTP SQASAA VL
Sbjct: 241 LRVLDSGSGFSKEKACVALQALSFSKENARAIGSRGGISSLLEICQAGTPCSQASAAGVL 300

Query: 301 RNLASFSEIKENFIEENGVVVLLGLLASGTPLAQENAIGCLCNLVLDDDNLKLLIVREGG 360
           RNLA+F+EIKENFIEENG+ VLLGL +SGT LAQENAIGCLCNL+  D+NLKLL+V+EGG
Sbjct: 301 RNLAAFAEIKENFIEENGIAVLLGLTSSGTALAQENAIGCLCNLISGDENLKLLVVKEGG 360

Query: 361 IEFLRNFWDSVPSVRSLEVAVELLSLLASYSPIAEALISDGFVDRLLPVLSCGVLGARTA 420
           IE L+NFWDS PSVRSLEVAV+LLS LAS  P+AEAL SDGFV RL+ VL+CGVLG R A
Sbjct: 361 IECLKNFWDSAPSVRSLEVAVDLLSHLASLLPVAEALCSDGFVARLVSVLNCGVLGVRIA 420

Query: 421 AARAVYELGFCTKTRKEMGESGFITPLINMLDGKSVDEKKAAAKALSSLLQYSGNRRIFQ 480
           AARAV ELG  ++TRKEMGE G I PLI MLDGK+V EK+AAAKALS L+  + NR+IF+
Sbjct: 421 AARAVSELGSSSRTRKEMGECGCIGPLIKMLDGKAVQEKEAAAKALSKLMLCTVNRKIFR 480

Query: 481 KEERGIVSAVQLLDPSISNLDKKYPVSLLSSVVISSKCRKQMVAAGAGLYLQKLVEMNVE 540
           ++E+GIVSAVQLLDPS+ NLDKKYPVS+L+S+  S KCRKQMVAAGA  YLQK+VEM+VE
Sbjct: 481 RDEKGIVSAVQLLDPSLRNLDKKYPVSVLASLSHSKKCRKQMVAAGACAYLQKVVEMDVE 540

Query: 541 GSKKLLESLGRGKIWGVF 549
           GSKKLLESLGRGK+WGVF
Sbjct: 541 GSKKLLESLGRGKMWGVF 558

BLAST of ClCG01G009870 vs. TrEMBL
Match: A0A067JIA0_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26389 PE=4 SV=1)

HSP 1 Score: 737.3 bits (1902), Expect = 1.4e-209
Identity = 387/551 (70.24%), Postives = 449/551 (81.49%), Query Frame = 1

Query: 1   MKIPPETDHFLLSNNLISSLLDDIPLINNFKGKWSSIRAKLSDLRTQLIDVSHFPNSSSN 60
           MK+P E D    ++ L+ SLLD+IP +  FKGKW+ IRAKL+DL+TQL D + FP S+SN
Sbjct: 1   MKVP-ENDPINANDQLLQSLLDEIPHVQTFKGKWALIRAKLADLQTQLTDFADFPASTSN 60

Query: 61  PLSLDFLRSVLEALTQAASLSHKCRNPALSDGKLKTQSDIDAVLAKFDSLLKDGEVLIRS 120
           PL LD L S+  +L  A  L+ KCR P  ++GKL+TQSD+D++LAK D  +KD E+LI+S
Sbjct: 61  PLCLDLLHSISNSLNDAVLLARKCRTPNFTEGKLRTQSDVDSILAKLDRHVKDSEILIKS 120

Query: 121 EILHDGVVS-SSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAA 180
            +L DG  S  SSS+REAVR ESRNLITRLQIGS ES+  A+DSLL LL EDDKNV IA 
Sbjct: 121 GVLQDGATSVGSSSKREAVRVESRNLITRLQIGSSESKNSAMDSLLGLLQEDDKNVMIAV 180

Query: 181 AQGAVPVLVRLLDSSSLELKERAVAAISIVSMVDGVKHVMIAEGLVLLNHLLRILDSGSG 240
           AQG VPVLVRLLDSSSLE+KE+ VAAIS VSMVD  KHV+IAEGL+LLNHLLR+L+SGSG
Sbjct: 181 AQGVVPVLVRLLDSSSLEMKEKTVAAISRVSMVDSSKHVLIAEGLLLLNHLLRVLESGSG 240

Query: 241 FAKEKACLALQPLSISKENARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFSEI 300
           FAKEKAC+ALQ LS SKENAR+IGSRGGISSLLEIC+ GTPGSQA AA VLRNLA F EI
Sbjct: 241 FAKEKACVALQALSFSKENARAIGSRGGISSLLEICQGGTPGSQAFAAGVLRNLAVFEEI 300

Query: 301 KENFIEENGVVVLLGLLASGTPLAQENAIGCLCNLVLDDDNLKLLIVREGGIEFLRNFWD 360
           +ENFIEEN V VL+GL ASGT LAQENAIGCLCNL  DD+NLKLLIV+EGG+E LRNFWD
Sbjct: 301 RENFIEENAVFVLIGLAASGTALAQENAIGCLCNLAKDDENLKLLIVKEGGVECLRNFWD 360

Query: 361 SVPSVRSLEVAVELLSLLASYSPIAEALISDGFVDRLLPVLSCGVLGARTAAARAVYELG 420
           S P VRSLEVAV+LL  LAS   IAE L+SDGFV RL+  L+CGVLG R A A A+YELG
Sbjct: 361 SGPPVRSLEVAVDLLRNLASNQAIAEVLVSDGFVSRLMVFLNCGVLGVRIATAEAIYELG 420

Query: 421 FCTKTRKEMGESGFITPLINMLDGKSVDEKKAAAKALSSLLQYSGNRRIFQKEERGIVSA 480
           F TKTRKEMGE   I PLINMLDGK+V EK+AAAKALS LL Y+GNR+ F+K+ERGIV  
Sbjct: 421 FNTKTRKEMGECEVIVPLINMLDGKAVVEKEAAAKALSHLLLYAGNRKTFRKDERGIVYT 480

Query: 481 VQLLDPSISNLDKKYPVSLLSSVVISSKCRKQMVAAGAGLYLQKLVEMNVEGSKKLLESL 540
           VQLLDPSI NLDKKYPVS+L+S+V S KCRKQM+ AGA ++L+ L EM +EG+KKLL+ L
Sbjct: 481 VQLLDPSIQNLDKKYPVSILASLVQSKKCRKQMIGAGACVHLKTLAEMEIEGAKKLLDGL 540

Query: 541 GRGKIWGVFVR 551
           GRGKIWGVF R
Sbjct: 541 GRGKIWGVFAR 550

BLAST of ClCG01G009870 vs. TrEMBL
Match: M5WBC9_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003699mg PE=4 SV=1)

HSP 1 Score: 723.4 bits (1866), Expect = 2.1e-205
Identity = 384/552 (69.57%), Postives = 452/552 (81.88%), Query Frame = 1

Query: 5   PETDHFLLSNNLISSLLDDIPLINNFKGKWSSIRAKLSDLRTQLIDVSHFPNSSSNPLSL 64
           PETD   +S+ L+SSL++ IP I NFKGKW+ IRAKLS+L+ QL D + FP  +S+PLSL
Sbjct: 4   PETDPMAVSSQLLSSLVEQIPFIQNFKGKWALIRAKLSELQAQLTDFADFPTYTSHPLSL 63

Query: 65  DFLRSVLEALTQAASLSHKCRNPALSDGKLKTQSDIDAVLAKFDSLLKDGEVLIRSEILH 124
             L SV + L  A SLS KC+ P LS GKL+TQSD+D++LA+    + D E+LI+S +L 
Sbjct: 64  HLLLSVSQTLADAVSLSQKCQTPNLSAGKLRTQSDVDSILARLHRHVTDAEILIKSGVLL 123

Query: 125 DGVV---SSSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAAQ 184
           D  V   SSS+S+RE VRAE RNL+TRLQIGS ESR  A++SLL +L EDDKNV IA AQ
Sbjct: 124 DPAVSSVSSSASKRETVRAECRNLVTRLQIGSGESRNSAMESLLGILQEDDKNVMIAVAQ 183

Query: 185 GAVPVLVRLLDSSSLELKERAVAAISIVSMVDGVKHVMIAEGLVLLNHLLRILDSGSGFA 244
           G VPVLVRLLDSSS E KE AV AISI+SMV+  KHV+IAEGL LLNHL+R+LDSGSGF 
Sbjct: 184 GIVPVLVRLLDSSSFETKENAVFAISIISMVESSKHVLIAEGLSLLNHLMRVLDSGSGFG 243

Query: 245 KEKACLALQPLSISKENARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFSEIKE 304
           KEKACLALQ LS SKENAR+IGS GG+SSLL+IC+AGTPGSQASAA VLRNLA FSE +E
Sbjct: 244 KEKACLALQALSFSKENARAIGSGGGVSSLLDICQAGTPGSQASAAGVLRNLAGFSENQE 303

Query: 305 NFIEENGVVVLLGLLASGTPLAQENAIGCLCNLVLDDDNLKLLIVREGGIEFLRNFWDSV 364
           NF+EENGV VLL L +SGT LAQENAIGCLC+L+   ++LKLL+V+EGGIE LRNFWDS 
Sbjct: 304 NFVEENGVGVLLALASSGTALAQENAIGCLCHLLSGSESLKLLVVKEGGIECLRNFWDSC 363

Query: 365 --PSVRSLEVAVELLSLLASYSPIAEALISDGFVDRLLPVLSCGVLGARTAAARAVYELG 424
              + R LEVAVELL  LAS SPIAE L+S+GFV RL+ VLSCG+LG R AAA+A YELG
Sbjct: 364 WNNNTRGLEVAVELLRHLASCSPIAEVLVSNGFVARLVGVLSCGILGVRIAAAKAAYELG 423

Query: 425 FCTKTRKEMGESGFITPLINMLDGKSVDEKKAAAKALSSLLQYSGNRRIFQKEERGIVSA 484
           FC+KTRKEMGE G I PLI MLDGK+V+EK+AAAKALS+L+ Y+GNR++F+K E GIVS+
Sbjct: 424 FCSKTRKEMGECGCIAPLIKMLDGKAVEEKEAAAKALSTLILYAGNRKLFKKHEGGIVSS 483

Query: 485 VQLLDPSISNLDKKYPVSLLSSVVISSKCRKQMVAAGAGLYLQKLVEMNVEGSKKLLESL 544
           VQLLDPSI NLDKKYPV+LL+S+  S KCRKQMVAAGA L+LQKLV+M VEGSKKLLESL
Sbjct: 484 VQLLDPSIQNLDKKYPVALLASLAHSKKCRKQMVAAGACLHLQKLVDMEVEGSKKLLESL 543

Query: 545 GRGKIWGVFVRS 552
           GRGKIWGVF RS
Sbjct: 544 GRGKIWGVFSRS 555

BLAST of ClCG01G009870 vs. TrEMBL
Match: A0A067DZT1_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g008560mg PE=4 SV=1)

HSP 1 Score: 723.0 bits (1865), Expect = 2.7e-205
Identity = 385/558 (69.00%), Postives = 459/558 (82.26%), Query Frame = 1

Query: 5   PETDHFLLSNNLISSLLDDIPLINNFKGKWSSIRAKLSDLRTQLIDVSHFPNSSSNPLSL 64
           PETD   LS   +SSLLD IPL+ +FKGKW  ++ KL+DL TQL D S FP ++SN L L
Sbjct: 4   PETDPINLSTQHLSSLLDQIPLVKHFKGKWVIVKTKLNDLETQLKDFSDFPAAASNTLCL 63

Query: 65  DFLRSVLEALTQAASLSHKCRNPALSDGKLKTQSDIDAVLAKFD------------SLLK 124
           D + SV   L +AAS++ KC+  +L++GKLKTQSDID+VLAK D             +L+
Sbjct: 64  DHVHSVSHTLIEAASVAQKCQGVSLTEGKLKTQSDIDSVLAKLDRHVRDGDVLIKSGVLQ 123

Query: 125 DGEVLIRSEILHDGVVSSSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDD 184
           DG+VLI+S +L DGVVSS S +REAVRAESRNLITRLQIGS ES+  A+DSLL LL EDD
Sbjct: 124 DGDVLIKSGVLQDGVVSSGS-KREAVRAESRNLITRLQIGSAESKNSAMDSLLGLLQEDD 183

Query: 185 KNVTIAAAQGAVPVLVRLLDSSSLELKERAVAAISIVSMVDGVKHVMIAEGLVLLNHLLR 244
           KNV IA AQG VPVLV+L+DSSSLE+KE+ VA+I+ VSMVD  KHV+IAEGL+LLNHL+R
Sbjct: 184 KNVVIAVAQGVVPVLVKLMDSSSLEMKEKTVASIARVSMVDSSKHVLIAEGLLLLNHLIR 243

Query: 245 ILDSGSGFAKEKACLALQPLSISKENARSIGSRGGISSLLEICEAGTPGSQASAAAVLRN 304
           +L+SGSGFAKE+AC+ALQ LS SKENAR+IGSRGGISSLLEIC+AGTPGSQA AA VLRN
Sbjct: 244 VLESGSGFAKERACVALQALSFSKENARAIGSRGGISSLLEICQAGTPGSQAFAAGVLRN 303

Query: 305 LASFSEIKENFIEENGVVVLLGLLASGTPLAQENAIGCLCNLVLDDDNLKLLIVREGGIE 364
           LA FSEIKENFIEEN V+VLLGL+ASGT LAQEN  GCLCNLV DD++LKLLIVREGGI 
Sbjct: 304 LAGFSEIKENFIEENAVMVLLGLVASGTALAQENVFGCLCNLVSDDESLKLLIVREGGIG 363

Query: 365 FLRNFWDSVPSVRSLEVAVELLSLLASYSPIAEALISDGFVDRLLPVLSCGVLGARTAAA 424
            L+++WDSV +V+SLEVAVELLS LAS  PIAE L+SDGFV RL+ VL+CGVL  R AAA
Sbjct: 364 SLKSYWDSVSAVKSLEVAVELLSQLASCLPIAEVLVSDGFVVRLVNVLNCGVLSVRIAAA 423

Query: 425 RAVYELGFCTKTRKEMGESGFITPLINMLDGKSVDEKKAAAKALSSLLQYSGNRRIFQKE 484
           RAV  LG  +K RKEMGE G I PLI MLDGK+V+EK++AAKALS+L+ Y+GNR+I +K+
Sbjct: 424 RAVSMLGINSKARKEMGECGCIGPLIKMLDGKAVEEKESAAKALSTLMLYAGNRKILRKD 483

Query: 485 ERGIVSAVQLLDPSISNLDKKYPVSLLSSVVISSKCRKQMVAAGAGLYLQKLVEMNVEGS 544
           ERGIV+ VQLLDP I NLDKKYPV++L+++V   KCRKQMVAAGA L+L+KLVEM++EG+
Sbjct: 484 ERGIVTVVQLLDPLIQNLDKKYPVAILAALVHCRKCRKQMVAAGACLHLRKLVEMDIEGA 543

Query: 545 KKLLESLGRGKIWGVFVR 551
            KLLESLGRGKIWGVF R
Sbjct: 544 NKLLESLGRGKIWGVFAR 560

BLAST of ClCG01G009870 vs. TAIR10
Match: AT5G50900.1 (AT5G50900.1 ARM repeat superfamily protein)

HSP 1 Score: 656.0 bits (1691), Expect = 2.1e-188
Identity = 345/554 (62.27%), Postives = 431/554 (77.80%), Query Frame = 1

Query: 1   MKIPPETDHFLLSNNLISSLLDDIPLINNFKGKWSSIRAKLSDLRTQLIDVSHFPNSSSN 60
           M +P   D       +I+SL+D IP + +FK KWSSIRAKL+DL+TQL D S F  SSSN
Sbjct: 1   MTVPNSDDGDRSLTEVITSLIDSIPNLLSFKCKWSSIRAKLADLKTQLSDFSDFAGSSSN 60

Query: 61  PLSLDFLRSVLEALTQAASLSHKCRNPALSDGKLKTQSDIDAVLAKFDSLLKDGEVLIRS 120
            L++D L SV E L  A +++ +C  P L++GKLKTQS++D+V+A+ D  +KD EVLI+S
Sbjct: 61  KLAVDLLVSVRETLNDAVAVAARCEGPDLAEGKLKTQSEVDSVMARLDRHVKDAEVLIKS 120

Query: 121 EILHD-GVVSSS---SSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVT 180
            +L D G+V S    SS++EAVR E+RNL+ RLQIG +ES+  AIDSL++LL EDDKNV 
Sbjct: 121 GLLIDNGIVVSGFSISSKKEAVRLEARNLVIRLQIGGVESKNSAIDSLIELLQEDDKNVM 180

Query: 181 IAAAQGAVPVLVRLLDSSSLELKERAVAAISIVSMVDGVKHVMIAEGLVLLNHLLRILDS 240
           I  AQG VPVLVRLLDS SL +KE+ VA IS +SMV+  KHV+IAEGL LLNHLLR+L+S
Sbjct: 181 ICVAQGVVPVLVRLLDSCSLVMKEKTVAVISRISMVESSKHVLIAEGLSLLNHLLRVLES 240

Query: 241 GSGFAKEKACLALQPLSISKENARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASF 300
           GSGFAKEKAC+ALQ LS+SKENAR+IG RGGISSLLEIC+ G+PGSQA AA VLRNLA F
Sbjct: 241 GSGFAKEKACVALQALSLSKENARAIGCRGGISSLLEICQGGSPGSQAFAAGVLRNLALF 300

Query: 301 SEIKENFIEENGVVVLLGLLASGTPLAQENAIGCLCNLVLDDDNLKLLIVREGGIEFLRN 360
            E KENF+EEN + VL+ +++SGT LAQENA+GCL NL   D++L + +VREGGI+ L++
Sbjct: 301 GETKENFVEENAIFVLISMVSSGTSLAQENAVGCLANLTSGDEDLMISVVREGGIQCLKS 360

Query: 361 FWDSVPSVRSLEVAVELLSLLASYSPIAEALISDGFVDRLLPVLSCGVLGARTAAARAVY 420
           FWDSV SV+SLEV V LL  LA    + E +IS+GF+ RL+PVLSCGVLG R AAA AV 
Sbjct: 361 FWDSVSSVKSLEVGVVLLKNLALCPIVREVVISEGFIPRLVPVLSCGVLGVRIAAAEAVS 420

Query: 421 ELGFCTKTRKEMGESGFITPLINMLDGKSVDEKKAAAKALSSLLQYSGNRRIFQKEERGI 480
            LGF +K+RKEMGESG I PLI+MLDGK+++EK+AA+KALS+LL  + NR+IF+K ++G+
Sbjct: 421 SLGFSSKSRKEMGESGCIVPLIDMLDGKAIEEKEAASKALSTLLVCTSNRKIFKKSDKGV 480

Query: 481 VSAVQLLDPSISNLDKKYPVSLLSSVVISSKCRKQMVAAGAGLYLQKLVEMNVEGSKKLL 540
           VS VQLLDP I  LDK+Y VS L  +V S KCRKQ+VAAGA L+LQKLV+M+ EG+KKL 
Sbjct: 481 VSLVQLLDPKIKKLDKRYTVSALELLVTSKKCRKQVVAAGACLHLQKLVDMDTEGAKKLA 540

Query: 541 ESLGRGKIWGVFVR 551
           E+L R KIWGVF R
Sbjct: 541 ENLSRSKIWGVFTR 554

BLAST of ClCG01G009870 vs. TAIR10
Match: AT2G45720.1 (AT2G45720.1 ARM repeat superfamily protein)

HSP 1 Score: 305.1 bits (780), Expect = 9.0e-83
Identity = 196/544 (36.03%), Postives = 313/544 (57.54%), Query Frame = 1

Query: 8   DHFLLSNNLISSLLDDIPLINNFKGKWSSIRAKLSDLRTQLIDVSHFPNSSSNPLSLDFL 67
           D  L +  L+   L     +  F  +W  I ++L  + T L D+S  P  S + L  + L
Sbjct: 18  DLLLQAQELVPIALSKARTVKGFSSRWRVIISRLEKIPTCLSDLSSHPCFSKHTLCKEQL 77

Query: 68  RSVLEALTQAASLSHKCRNPALSDGKLKTQSDIDAVLAKFDSLLKDGEVLIRSEILHDGV 127
           ++VLE L +   L++ C +    +GKLK QSD+D++ AK D  LKD  +L+++ +L +  
Sbjct: 78  QAVLETLKETIELANVCVSEK-QEGKLKMQSDLDSLSAKIDLSLKDCGLLMKTGVLGEVT 137

Query: 128 VSSSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAAQGAVPVL 187
              SSS ++      R L+ RLQIG +ES+  A++ L++++ ED+K V  A  +  V  L
Sbjct: 138 KPLSSSTQDLETFSVRELLARLQIGHLESKRKALEQLVEVMKEDEKAVITALGRTNVASL 197

Query: 188 VRLLDSSSLELKERAVAAISIVSMVDGVKHVMIAEGLVLLNHLLRILDSGSGFAKEKACL 247
           V+LL ++S  ++E AV  I  ++   G ++ +I+E    L  L+R+L+SGS  AKEKA +
Sbjct: 198 VQLLTATSPSVRENAVTVICSLAESGGCENWLISENA--LPSLIRLLESGSIVAKEKAVI 257

Query: 248 ALQPLSISKENARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFSEIKENFIEEN 307
           +LQ +SIS E +RSI   GG+  L+EIC+ G   SQ+++A  L+N+++  E+++N  EE 
Sbjct: 258 SLQRMSISSETSRSIVGHGGVGPLIEICKTGDSVSQSASACTLKNISAVPEVRQNLAEEG 317

Query: 308 GVVVLLGLLASGTPL-AQENAIGCLCNLVLDDDNLKLLIVREGGIEFLRNFWDSVPSVRS 367
            V V++ +L  G  L ++E A  CL NL   ++ L+  ++ E GI+ L  + D      S
Sbjct: 318 IVKVMINILNCGILLGSKEYAAECLQNLTSSNETLRRSVISENGIQTLLAYLDGPLPQES 377

Query: 368 LEVAVELLSLLASYSPIAEALISDGFVDRLLPVLSCGVLGARTAAARAVYELGFCTKTRK 427
              A+   +L+ S S      I    +  L+ VL  G +GA+ AAA  +  +    +T++
Sbjct: 378 GVAAIR--NLVGSVSVETYFKI----IPSLVHVLKSGSIGAQQAAASTICRIATSNETKR 437

Query: 428 EMGESGFITPLINMLDGKSVDEKKAAAKALSSLLQYSGNRRIFQKEERGIVSAVQLLDPS 487
            +GESG I  LI ML+ K+   ++ AA+A++SL+    N R  +++E+ + S V LL+PS
Sbjct: 438 MIGESGCIPLLIRMLEAKASGAREVAAQAIASLVTVPRNCREVKRDEKSVTSLVMLLEPS 497

Query: 488 ISNLDKKYPVSLLSSVVISSKCRKQMVAAGAGLYLQKLVEMNVEGSKKLLESLGRGKIWG 547
             N  KKY VS L+++  S KC+K MV+ GA  YL+KL E+ V GSKKLLE + +GK+  
Sbjct: 498 PGNSAKKYAVSGLAALCSSRKCKKLMVSHGAVGYLKKLSELEVPGSKKLLERIEKGKLKS 552

Query: 548 VFVR 551
            F R
Sbjct: 558 FFSR 552

BLAST of ClCG01G009870 vs. TAIR10
Match: AT1G01830.2 (AT1G01830.2 ARM repeat superfamily protein)

HSP 1 Score: 290.0 bits (741), Expect = 3.0e-78
Identity = 197/543 (36.28%), Postives = 313/543 (57.64%), Query Frame = 1

Query: 14  NNLISSLLDDIPLINNFKGKWSSIRAKLSDLRTQLIDVSHFPNSSSNPLSLDFLRSVLEA 73
           N+LI S+L     +  F G+W +I +K+  +   L D+S  P  S N L  + L+SV + 
Sbjct: 40  NSLIPSVLSKAKTVKKFTGRWKTIISKIEQIPACLSDLSSHPCFSKNKLCNEQLQSVAKT 99

Query: 74  LTQAASLSHKCRNPALSDGKLKTQSDIDAVLAKFDSLLKDGEVLIRSEILHDGVVS---S 133
           L++   L+ +C      +GKL+ QSD+D++  K D  L+D  VLI++ +L +  +    S
Sbjct: 100 LSEVIELAEQCSTDKY-EGKLRMQSDLDSLSGKLDLNLRDCGVLIKTGVLGEATLPLYIS 159

Query: 134 SSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAAQGA-VPVLVR 193
           SSS    + +  + L+ RLQIG +ES+  A++SLL  + ED+K V +     A V  LV+
Sbjct: 160 SSSETPKI-SSLKELLARLQIGHLESKHNALESLLGAMQEDEKMVLMPLIGRANVAALVQ 219

Query: 194 LLDSSSLELKERAVAAISIVSMVDGVKHVMIAEGLVLLNHLLRILDSGSGFAKEKACLAL 253
           LL ++S  ++E+AV  IS+++        +I+EG+  L  L+R+++SGS   KEKA +A+
Sbjct: 220 LLTATSTRIREKAVNLISVLAESGHCDEWLISEGV--LPPLVRLIESGSLETKEKAAIAI 279

Query: 254 QPLSISKENARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFSEIKENFIEENGV 313
           Q LS+++ENAR I   GGI+ L+++C+ G   SQA++AA L+N+++ SE+++   EE  +
Sbjct: 280 QRLSMTEENAREIAGHGGITPLIDLCKTGDSVSQAASAAALKNMSAVSELRQLLAEEGII 339

Query: 314 VVLLGLLASGTPL-AQENAIGCLCNLVLDDDNLKLLIVREGGIEFLRNFWDSVPSVRSLE 373
            V + LL  G  L ++E+   CL NL    D L+  IV EGG+  L  + D    +    
Sbjct: 340 RVSIDLLNHGILLGSREHMAECLQNLTAASDALREAIVSEGGVPSLLAYLDG--PLPQQP 399

Query: 374 VAVELLSLLASYSPIAEALISDGFVDRLLPVLSCGVLGARTAAARAVYELGFCTKTRKEM 433
               L +L+ S +P  E  ++   + RL  VL  G LGA+ AAA A+       +T++ +
Sbjct: 400 AVTALRNLIPSVNP--EIWVALNLLPRLRHVLKSGSLGAQQAAASAICRFACSPETKRLV 459

Query: 434 GESGFITPLINMLDGKSVDEKKAAAKALSSLLQYSGNRRIFQKEERGIV-SAVQLLDPSI 493
           GESG I  ++ +L+ KS   ++AAA+A++ L+     RR  +K+ + ++ + V LLD + 
Sbjct: 460 GESGCIPEIVKLLESKSNGCREAAAQAIAGLVAEGRIRRELKKDGKSVLTNLVMLLDSNP 519

Query: 494 SNLDKKYPVSLLSSVVISSKCRKQMVAAGAGLYLQKLVEMNVEGSKKLLESLGRGKIWGV 551
            N  KKY V+ L  +  S K +K MV+ GA  YL+KL EM V G+ KLLE L RGK+   
Sbjct: 520 GNTAKKYAVAGLLGMSGSEKSKKMMVSYGAIGYLKKLSEMEVMGADKLLEKLERGKLRSF 574

BLAST of ClCG01G009870 vs. TAIR10
Match: AT2G05810.1 (AT2G05810.1 ARM repeat superfamily protein)

HSP 1 Score: 234.6 bits (597), Expect = 1.5e-61
Identity = 168/545 (30.83%), Postives = 291/545 (53.39%), Query Frame = 1

Query: 12  LSNNLISSLLDDIPLINNFKGKWSSIRAKLSDLRTQLIDVSHFPNSSSNPLSLDFLRSVL 71
           L  N++S LL     + +F G+W  +R+KL  L + L  +S  P+ S NPL    L S+L
Sbjct: 26  LITNVLSLLLLSSLTVRSFIGRWQILRSKLFTLNSSLSSLSESPHWSQNPLLHTLLPSLL 85

Query: 72  EALTQAASLSHKCRNPALSDGKLKTQSDIDAVLAKFDSLLKDGEVLIRSEILH--DGVVS 131
             L + +SLS +C + + S GKL  QSD+D   +   + + D ++L+RS +LH  + +V 
Sbjct: 86  SNLQRLSSLSDQCSSASFSGGKLLMQSDLDIASSSLSTHISDLDLLLRSGVLHQQNAIVL 145

Query: 132 S---SSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAAQGAVPV 191
           S    +S ++ +    R+L TRLQIG  E +  +++SLLQLL +++K+  I A +G V  
Sbjct: 146 SLPPPTSDKDDIAFFIRDLFTRLQIGGAEFKKKSLESLLQLLTDNEKSARIIAKEGNVGY 205

Query: 192 LVRLLDSSSLEL-KERAVAAISIV--SMVDGVKHVMIAEGLVLLNHLLRILDSGSGFAKE 251
           LV LLD     L +E A+AA+S++  S  D  K V    G   L  LLR+L++GS   K 
Sbjct: 206 LVTLLDLHHHPLIREHALAAVSLLTSSSADSRKTVFEQGG---LGPLLRLLETGSSPFKT 265

Query: 252 KACLALQPLSISKENARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFSEIKENF 311
           +A +A++ ++     A +I + GG++ L+E C +G+   Q   A  + N+A+  EI+   
Sbjct: 266 RAAIAIEAITADPATAWAISAYGGVTVLIEACRSGSKQVQEHIAGAISNIAAVEEIRTTL 325

Query: 312 IEENGVVVLLGLLASGTPLAQENAIGCLCNLVLDDDNLKLLIVRE-GGIEFLRNFWDSVP 371
            EE  + VL+ LL SG+   QE     +  +    +  + LIVRE GG++ L +      
Sbjct: 326 AEEGAIPVLIQLLISGSSSVQEKTANFISLISSSGEYYRDLIVRERGGLQILIHLVQESS 385

Query: 372 SVRSLEVAVELLSLLASYSPIAEALISD-GFVDRLLPVLSCGVLGARTAAARAVYELGFC 431
           +  ++E  +  LS +++   ++  L S   F+ RL  ++  G +  +  +   +  L   
Sbjct: 386 NPDTIEHCLLALSQISAMETVSRVLSSSTRFIIRLGELIKHGNVILQQISTSLLSNLTIS 445

Query: 432 TKTRKEMGESGFITPLINMLDG-KSVDEKKAAAKALSSLLQYSGNRRIFQKEERGIVSAV 491
              ++ + +   ++ LI +++  K    ++AA +A  SLL    NR+   ++E+ ++  V
Sbjct: 446 DGNKRAVADC--LSSLIRLMESPKPAGLQEAATEAAKSLLTVRSNRKELMRDEKSVIRLV 505

Query: 492 QLLDPSISNL-DKKYPVSLLSSVVI--SSKCRKQMVAAGAGLYLQKLVEMNVEGSKKLLE 543
           Q+LDP    + +K+ PV ++++++   S   R +++  GA  YLQ L EM V G+KK ++
Sbjct: 506 QMLDPRNERMNNKELPVMVVTAILSGGSYAARTKLIGLGADRYLQSLEEMEVPGAKKAVQ 565

BLAST of ClCG01G009870 vs. TAIR10
Match: AT1G61350.1 (AT1G61350.1 ARM repeat superfamily protein)

HSP 1 Score: 219.2 bits (557), Expect = 6.5e-57
Identity = 164/547 (29.98%), Postives = 285/547 (52.10%), Query Frame = 1

Query: 17  ISSLLDDIPLINNFKGKWSSIRAKLSDLRTQLIDVSHFPNSSSNPLSLDFLRSVLEALTQ 76
           ISSL+     I +F  KW  IR KL +L + L  + +  NS  +P     + ++L +L  
Sbjct: 21  ISSLISLSHSIKSFNIKWQLIRTKLQELYSGLDSLRNL-NSGFDPSLSSLISAILISLKD 80

Query: 77  AASLSHKCRNPALSDGKLKTQSDIDAVLAKFDSLLKDGEVLIRSEILHDGVV-----SSS 136
              L+ +C N + S GKL  QSD+D +  KFD   ++   +  + IL  G        + 
Sbjct: 81  TYDLATRCVNVSFS-GKLLMQSDLDVMAGKFDGHTRNLSRIYSAGILSHGFAIVVLKPNG 140

Query: 137 SSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAA-QGAVPVLVRL 196
           ++ ++ +R   R+L+TR++IG +E +  A+  L + + EDD+ V I       V VLV  
Sbjct: 141 NACKDDMRFYIRDLLTRMKIGDLEMKKQALVKLNEAMEEDDRYVKILIEISDMVNVLVGF 200

Query: 197 LDSSSLELKERAVAAISIVSMVDGVKHVMIAEGLVLLNHLLRILDSGSGFAKEKACLALQ 256
           LDS  + ++E +  A+  +S     + V+I  G++    L+R+L++G+G  +E +   L 
Sbjct: 201 LDSE-IGIQEESAKAVFFISGFGSYRDVLIRSGVI--GPLVRVLENGNGVGREASARCLM 260

Query: 257 PLSISKENARSIGSRGGISSLLEICEAGTPGSQ--ASAAAVLRNLASFSEIKENFIEENG 316
            L+ + ENA S+ + GG+S+LL+IC     G +   ++  VLRNL    EIK   IEE+ 
Sbjct: 261 KLTENSENAWSVSAHGGVSALLKICSCSDFGGELIGTSCGVLRNLVGVEEIKRFMIEEDH 320

Query: 317 VVV-LLGLLASGTPLAQENAIGCLCNLVLDDDNLKLLIVREGGIEFLRNFW---DSVPSV 376
            V   + L+ S   + Q N+I  L ++   D+  + ++VREGGI+ L +     +S+ S 
Sbjct: 321 TVATFIKLIGSKEEIVQVNSIDLLLSMCCKDEQTRDILVREGGIQELVSVLSDPNSLSSS 380

Query: 377 RSLEVAVELLSLLASYSP-IAEALISDGFVDRLLPVLSCGVLGARTAAARAVYEL-GFCT 436
           +S E+A+  +  L   S     AL+   F+D LL +L  G +  + +A +    L     
Sbjct: 381 KSKEIALRAIDNLCFGSAGCLNALMGCKFLDHLLNLLRNGEISVQESALKVTSRLCSLQE 440

Query: 437 KTRKEMGESGFITPLINMLDGKSVDEKKAAAKALSSLLQYSGNRRIFQKEERGIVSAVQL 496
           + ++ MGE+GF+  L+  LD KS+D ++ A+ AL  L+    NR+ F +++  I   +QL
Sbjct: 441 EVKRIMGEAGFMPELVKFLDAKSIDVREMASVALYCLISVPRNRKKFAQDDFNISYILQL 500

Query: 497 LD------PSISNLDKKYPVSLLSSVVISSKCRKQMVAAGAGLYLQKLVEMNVEGSKKLL 544
           LD       S  + + K+ +S+L S+   +  R+++ ++G    ++KL E     +KKL+
Sbjct: 501 LDHEDGSNVSSDSGNTKFLISILMSLTSCNSARRKIASSGYLKSIEKLAETEGSDAKKLV 560

BLAST of ClCG01G009870 vs. NCBI nr
Match: gi|449455447|ref|XP_004145464.1| (PREDICTED: vacuolar protein 8 [Cucumis sativus])

HSP 1 Score: 1021.1 bits (2639), Expect = 7.0e-295
Identity = 538/551 (97.64%), Postives = 545/551 (98.91%), Query Frame = 1

Query: 1   MKIPPETDHFLLSNNLISSLLDDIPLINNFKGKWSSIRAKLSDLRTQLIDVSHFPNSSSN 60
           MKIPPETDHFLLSNNLISSLLDDIPLI  FKGKWSSIRAKLSDLRTQLIDVSHFPNSSSN
Sbjct: 1   MKIPPETDHFLLSNNLISSLLDDIPLITIFKGKWSSIRAKLSDLRTQLIDVSHFPNSSSN 60

Query: 61  PLSLDFLRSVLEALTQAASLSHKCRNPALSDGKLKTQSDIDAVLAKFDSLLKDGEVLIRS 120
           PLSLDFL SVLEALTQAASLSHKCRNPALSDGKLKTQSDIDA+LAKFDSLLKDGEVLIRS
Sbjct: 61  PLSLDFLHSVLEALTQAASLSHKCRNPALSDGKLKTQSDIDAILAKFDSLLKDGEVLIRS 120

Query: 121 EILHDGVVSSSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAA 180
           EILHDGVVSSSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAA
Sbjct: 121 EILHDGVVSSSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAA 180

Query: 181 QGAVPVLVRLLDSSSLELKERAVAAISIVSMVDGVKHVMIAEGLVLLNHLLRILDSGSGF 240
           QGAVPVLVRLLDSSSLELKERAVAAISIVSMVDGVKH+MIAEGLVLLNHLLRILDSGSGF
Sbjct: 181 QGAVPVLVRLLDSSSLELKERAVAAISIVSMVDGVKHIMIAEGLVLLNHLLRILDSGSGF 240

Query: 241 AKEKACLALQPLSISKENARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFSEIK 300
           AKEKACLALQPLSISKENARSIGSRGGISSLLEICE GTPGSQASAAAVLRNLASFSEIK
Sbjct: 241 AKEKACLALQPLSISKENARSIGSRGGISSLLEICEGGTPGSQASAAAVLRNLASFSEIK 300

Query: 301 ENFIEENGVVVLLGLLASGTPLAQENAIGCLCNLVLDDDNLKLLIVREGGIEFLRNFWDS 360
           ENFIEENGV+VLLGLLASGTPLAQENAIGCLCNLVLDDDNLKLLIVREGGIEFLRNFWDS
Sbjct: 301 ENFIEENGVIVLLGLLASGTPLAQENAIGCLCNLVLDDDNLKLLIVREGGIEFLRNFWDS 360

Query: 361 VPSVRSLEVAVELLSLLASYSPIAEALISDGFVDRLLPVLSCGVLGARTAAARAVYELGF 420
           VPSVRSLEVAVELLSLLASYSPIAEALISDGFVDRLLPVLSCGVLGARTAAARAVYELGF
Sbjct: 361 VPSVRSLEVAVELLSLLASYSPIAEALISDGFVDRLLPVLSCGVLGARTAAARAVYELGF 420

Query: 421 CTKTRKEMGESGFITPLINMLDGKSVDEKKAAAKALSSLLQYSGNRRIFQKEERGIVSAV 480
           CTKTRKEMGESGFITPL+NMLDGKSVDE+KAAAKALSSLLQYSGNR+IFQKEERGIVSAV
Sbjct: 421 CTKTRKEMGESGFITPLVNMLDGKSVDERKAAAKALSSLLQYSGNRKIFQKEERGIVSAV 480

Query: 481 QLLDPSISNLDKKYPVSLLSSVVISSKCRKQMVAAGAGLYLQKLVEMNVEGSKKLLESLG 540
           QLLDPSISNLDKKYPVSLLSSV ISSKCRKQMVAAGAGLYLQKLVE+NVEGSKKLLESLG
Sbjct: 481 QLLDPSISNLDKKYPVSLLSSVAISSKCRKQMVAAGAGLYLQKLVEINVEGSKKLLESLG 540

Query: 541 RGKIWGVFVRS 552
           RGKIWGVF RS
Sbjct: 541 RGKIWGVFARS 551

BLAST of ClCG01G009870 vs. NCBI nr
Match: gi|659118181|ref|XP_008458985.1| (PREDICTED: U-box domain-containing protein 10 [Cucumis melo])

HSP 1 Score: 1016.9 bits (2628), Expect = 1.3e-293
Identity = 535/551 (97.10%), Postives = 544/551 (98.73%), Query Frame = 1

Query: 1   MKIPPETDHFLLSNNLISSLLDDIPLINNFKGKWSSIRAKLSDLRTQLIDVSHFPNSSSN 60
           MKIPP TDHFLLSNNL+SSLLDDIPLI+ FKGKWSSIRAKLSDLRTQLIDVSHFPNSSSN
Sbjct: 1   MKIPPHTDHFLLSNNLLSSLLDDIPLISIFKGKWSSIRAKLSDLRTQLIDVSHFPNSSSN 60

Query: 61  PLSLDFLRSVLEALTQAASLSHKCRNPALSDGKLKTQSDIDAVLAKFDSLLKDGEVLIRS 120
           PLSLDFL SVLEALTQAASLSHKCRNPALSDGKLKTQSDIDA+LAKFDSLLKDGEVLIRS
Sbjct: 61  PLSLDFLHSVLEALTQAASLSHKCRNPALSDGKLKTQSDIDALLAKFDSLLKDGEVLIRS 120

Query: 121 EILHDGVVSSSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAA 180
           EILHDGVVSSSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAA
Sbjct: 121 EILHDGVVSSSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAA 180

Query: 181 QGAVPVLVRLLDSSSLELKERAVAAISIVSMVDGVKHVMIAEGLVLLNHLLRILDSGSGF 240
           QGAVPVLVRLLDSSSLELKERAVAAISIVSMVDGVKH+MIAEGLVLLNHLLRILDSGSGF
Sbjct: 181 QGAVPVLVRLLDSSSLELKERAVAAISIVSMVDGVKHIMIAEGLVLLNHLLRILDSGSGF 240

Query: 241 AKEKACLALQPLSISKENARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFSEIK 300
           AKEKACLALQPLSISKENARSIGSRGGISSLLEICE GTPGSQASAAAVLRNLASFSEIK
Sbjct: 241 AKEKACLALQPLSISKENARSIGSRGGISSLLEICEGGTPGSQASAAAVLRNLASFSEIK 300

Query: 301 ENFIEENGVVVLLGLLASGTPLAQENAIGCLCNLVLDDDNLKLLIVREGGIEFLRNFWDS 360
           ENFIEENGV+VLLGLLASGTPLAQENAIGCLCNLVLDDDNLKLLIVREGGIEFLRNFWDS
Sbjct: 301 ENFIEENGVMVLLGLLASGTPLAQENAIGCLCNLVLDDDNLKLLIVREGGIEFLRNFWDS 360

Query: 361 VPSVRSLEVAVELLSLLASYSPIAEALISDGFVDRLLPVLSCGVLGARTAAARAVYELGF 420
           VPS RSLEVAVELLSLLASYSPIAEALISDGFVDRLLPVLSCGVLGARTAAARAVYELGF
Sbjct: 361 VPSARSLEVAVELLSLLASYSPIAEALISDGFVDRLLPVLSCGVLGARTAAARAVYELGF 420

Query: 421 CTKTRKEMGESGFITPLINMLDGKSVDEKKAAAKALSSLLQYSGNRRIFQKEERGIVSAV 480
           CTKTRKEMGESGFITPL+NMLDGKSVDE+KAAAKALSSLLQYSGNR+IFQKEERGI+SAV
Sbjct: 421 CTKTRKEMGESGFITPLVNMLDGKSVDERKAAAKALSSLLQYSGNRKIFQKEERGIISAV 480

Query: 481 QLLDPSISNLDKKYPVSLLSSVVISSKCRKQMVAAGAGLYLQKLVEMNVEGSKKLLESLG 540
           QLLDPSISNLDKKYPVSLLSSV ISSKCRKQMVAAGAGLYLQKLVEMNVEGSKKLLESLG
Sbjct: 481 QLLDPSISNLDKKYPVSLLSSVAISSKCRKQMVAAGAGLYLQKLVEMNVEGSKKLLESLG 540

Query: 541 RGKIWGVFVRS 552
           RGKIWGVF RS
Sbjct: 541 RGKIWGVFARS 551

BLAST of ClCG01G009870 vs. NCBI nr
Match: gi|590670572|ref|XP_007038093.1| (ARM repeat superfamily protein isoform 1 [Theobroma cacao])

HSP 1 Score: 760.8 bits (1963), Expect = 1.7e-216
Identity = 400/550 (72.73%), Postives = 465/550 (84.55%), Query Frame = 1

Query: 1   MKIPPETDHFLLSNNLISSLLDDIPLINNFKGKWSSIRAKLSDLRTQLIDVSHFPNSSSN 60
           MK+P E D   LSN+L++SL + IP INNFKGKW+ I++KLS L+ QL D S FP SSSN
Sbjct: 1   MKVP-ENDPISLSNHLLASLSEQIPNINNFKGKWALIKSKLSGLQAQLADFSDFPASSSN 60

Query: 61  PLSLDFLRSVLEALTQAASLSHKCRNPALSDGKLKTQSDIDAVLAKFDSLLKDGEVLIRS 120
           PL++D L S+ + L  A SLS KC+   L++GKLKTQSDIDAVLAK D  +KD E+LIRS
Sbjct: 61  PLAVDLLYSITQTLNDAVSLSQKCQLADLTEGKLKTQSDIDAVLAKLDRHIKDSEILIRS 120

Query: 121 EILHDGVVSSSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAA 180
            +L DG VS+SSS++EAVR ESRNLITRLQIG+ ES+  A+DSLL LL EDDKNV IA A
Sbjct: 121 GVLQDGAVSTSSSKKEAVRVESRNLITRLQIGTTESKNSAMDSLLGLLQEDDKNVMIAVA 180

Query: 181 QGAVPVLVRLLDSSSLELKERAVAAISIVSMVDGVKHVMIAEGLVLLNHLLRILDSGSGF 240
           QG VPVLVRLLDSSSLE+KE+ VAAIS VS V+  KHV+IAEGL+LLNHLLR+L+SGSGF
Sbjct: 181 QGVVPVLVRLLDSSSLEMKEKTVAAISRVSTVESSKHVLIAEGLLLLNHLLRVLESGSGF 240

Query: 241 AKEKACLALQPLSISKENARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFSEIK 300
           AKEKAC+ALQ LS SKENAR+IGSRGGISSLLEIC+AGTPGSQA AA VL+NLAS  EIK
Sbjct: 241 AKEKACIALQALSFSKENARAIGSRGGISSLLEICQAGTPGSQAFAAGVLKNLASVDEIK 300

Query: 301 ENFIEENGVVVLLGLLASGTPLAQENAIGCLCNLVLDDDNLKLLIVREGGIEFLRNFWDS 360
           ENFIEEN V VL+GL ASGT LAQEN+IGCLCNLV DD+NL+LLIV+EGGIE L+NFWDS
Sbjct: 301 ENFIEENAVFVLIGLAASGTALAQENSIGCLCNLVSDDENLRLLIVKEGGIECLKNFWDS 360

Query: 361 VPSVRSLEVAVELLSLLASYSPIAEALISDGFVDRLLPVLSCGVLGARTAAARAVYELGF 420
            P+ +SLEVAVEL+  LAS SPIAEAL++DGFV RL+ VL+CGVLG R AAARAVYELGF
Sbjct: 361 SPNPKSLEVAVELVRRLASCSPIAEALVADGFVARLVAVLNCGVLGVRIAAARAVYELGF 420

Query: 421 CTKTRKEMGESGFITPLINMLDGKSVDEKKAAAKALSSLLQYSGNRRIFQKEERGIVSAV 480
            +KTRKEMGE G    LI M+DGK+V+EK+AAA ALS+L+ Y+GNR++FQK+ERGIV+AV
Sbjct: 421 NSKTRKEMGECGCTVALIKMMDGKAVEEKEAAAMALSTLMLYAGNRKVFQKDERGIVNAV 480

Query: 481 QLLDPSISNLDKKYPVSLLSSVVISSKCRKQMVAAGAGLYLQKLVEMNVEGSKKLLESLG 540
           QLLDP I NLDKKYPV +LS +V S KCRKQMVAAGA +YLQKLVEMNVEG+KKLLESLG
Sbjct: 481 QLLDPLIQNLDKKYPVLILSELVHSKKCRKQMVAAGACVYLQKLVEMNVEGAKKLLESLG 540

Query: 541 RGKIWGVFVR 551
           RGKIWGVF R
Sbjct: 541 RGKIWGVFAR 549

BLAST of ClCG01G009870 vs. NCBI nr
Match: gi|1009130344|ref|XP_015882248.1| (PREDICTED: armadillo segment polarity protein-like [Ziziphus jujuba])

HSP 1 Score: 755.7 bits (1950), Expect = 5.5e-215
Identity = 403/553 (72.88%), Postives = 464/553 (83.91%), Query Frame = 1

Query: 1   MKIPPETDHFLLSNNLISSLLDDIPLINNFKGKWSSIRAKLSDLRTQLIDVSHFPNSSSN 60
           MK+P E D   L+ +L+SSL  +IP ++NFKGKW+ I  KL++L+TQL D S  P S SN
Sbjct: 1   MKVP-ENDPISLATHLVSSLSQEIPTVSNFKGKWALINDKLAELKTQLADFSDCPTSVSN 60

Query: 61  PLSLDFLRSVLEALTQAASLSHKCRNPALSDGKLKTQSDIDAVLAKFDSLLKDGEVLIRS 120
           PLSL+ LRSV   L  A SLS KC++P+LS+GKL+TQSD+D+VLAK D  +KD E+LIRS
Sbjct: 61  PLSLELLRSVSLTLHDAVSLSKKCQSPSLSEGKLRTQSDVDSVLAKLDKHVKDAEILIRS 120

Query: 121 EILHDGVVS--SSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIA 180
            +L DG VS  SSSS+REAVRAESRNLITRLQIGS E+R  A+DSLL LL EDDKNV IA
Sbjct: 121 GVLQDGTVSGSSSSSKREAVRAESRNLITRLQIGSAEARNSAMDSLLGLLQEDDKNVMIA 180

Query: 181 AAQGAVPVLVRLLDSSSL-ELKERAVAAISIVSMVDGVKHVMIAEGLVLLNHLLRILDSG 240
            AQG VPVLVRL+DSSS  E+KE+ VAAIS VSMVD  KHV+IAEGL+LLNHLLR+LDSG
Sbjct: 181 VAQGIVPVLVRLMDSSSSPEMKEKTVAAISRVSMVDSSKHVLIAEGLLLLNHLLRVLDSG 240

Query: 241 SGFAKEKACLALQPLSISKENARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFS 300
           SGFAKEKAC+ALQ LS SKENAR+IGSRGGISSLLEIC +GTPGSQASAA VLRNLA+FS
Sbjct: 241 SGFAKEKACIALQALSFSKENARAIGSRGGISSLLEICNSGTPGSQASAAGVLRNLAAFS 300

Query: 301 EIKENFIEENGVVVLLGLLASGTPLAQENAIGCLCNLVLDDDNLKLLIVREGGIEFLRNF 360
           E KENFIEENGV VLLGL   GT LAQENAIGCLCNLV +DD+LKLL+ +EGGIE L+NF
Sbjct: 301 ENKENFIEENGVFVLLGLAGLGTVLAQENAIGCLCNLVCEDDHLKLLVAKEGGIECLKNF 360

Query: 361 WDSVPSVRSLEVAVELLSLLASYSPIAEALISDGFVDRLLPVLSCGVLGARTAAARAVYE 420
           WDS  SVRSLEVAV+LL  LAS  PIAE L+SDGFV R+  VL+CGVLG R AAARAVYE
Sbjct: 361 WDSASSVRSLEVAVDLLRHLASRQPIAEVLVSDGFVTRIAGVLNCGVLGVRIAAARAVYE 420

Query: 421 LGFCTKTRKEMGESGFITPLINMLDGKSVDEKKAAAKALSSLLQYSGNRRIFQKEERGIV 480
           LGFCTKTRKEMGE G I  LI MLDGK+V+EK++AAKALS+L+ ++GNRRIF+K+ +GI+
Sbjct: 421 LGFCTKTRKEMGECGCIASLIKMLDGKAVEEKESAAKALSNLMLFTGNRRIFRKDAKGIL 480

Query: 481 SAVQLLDPSISNLDKKYPVSLLSSVVISSKCRKQMVAAGAGLYLQKLVEMNVEGSKKLLE 540
            AVQLLDPSI NLDKKYPVS+L+S+V S KCRKQMVA+GA  YLQKLVE +VEGSKKLLE
Sbjct: 481 CAVQLLDPSIQNLDKKYPVSVLASLVHSKKCRKQMVASGACAYLQKLVEADVEGSKKLLE 540

Query: 541 SLGRGKIWGVFVR 551
           SLGRGKIWGVF R
Sbjct: 541 SLGRGKIWGVFAR 552

BLAST of ClCG01G009870 vs. NCBI nr
Match: gi|703089949|ref|XP_010093946.1| (U-box domain-containing protein 11 [Morus notabilis])

HSP 1 Score: 742.3 bits (1915), Expect = 6.2e-211
Identity = 398/558 (71.33%), Postives = 469/558 (84.05%), Query Frame = 1

Query: 1   MKIPPE-TDHFLLSNNLISSLLDDIPLINNFKGKWSSIRAKLSDLRTQLIDVSHFPNSSS 60
           MK P E  D   +S  L+SSL+D+I L+  FKGKWS IRAKL DLR QL D +  P+++S
Sbjct: 1   MKAPEEEADTTAISTELLSSLMDEILLVQTFKGKWSLIRAKLDDLRPQLADFADSPDAAS 60

Query: 61  NPLSLDFLRSVLEALTQAASLSHKCRNPALSDGKLKTQSDIDAVLAKFDSLLKDGEVLIR 120
           NPLS+D LRSV  AL+ A S++ +C++P+L+DGKL+TQSD+DAVLA+ D +++DGE+L+R
Sbjct: 61  NPLSIDLLRSVAAALSDAISVARRCQSPSLADGKLRTQSDVDAVLARLDRVVRDGEILLR 120

Query: 121 SEILHDG---VVSSS-----SSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNED 180
           S +L D    VVS+S     SSRREAVRAESRNLITRLQIG+ ESR  A+DSLL LL ED
Sbjct: 121 SGVLSDNNRAVVSNSGNSGSSSRREAVRAESRNLITRLQIGTPESRNSAMDSLLGLLRED 180

Query: 181 DKNVTIAAAQGAVPVLVRLLDS-SSLELKERAVAAISIVSMVDGVKHVMIAEGLVLLNHL 240
           DKNV IA AQG VPV VRLLDS SS+E+KE+ VAAIS VSMVD  KHV+IAEGL+LLNHL
Sbjct: 181 DKNVMIAVAQGVVPVFVRLLDSSSSVEMKEKTVAAISRVSMVDSSKHVLIAEGLLLLNHL 240

Query: 241 LRILDSGSGFAKEKACLALQPLSISKENARSIGSRGGISSLLEICEAGTPGSQASAAAVL 300
           LR+LDSGSGF+KEKAC+ALQ LS SKENAR+IGSRGGISSLLEIC+AGTP SQASAA VL
Sbjct: 241 LRVLDSGSGFSKEKACVALQALSFSKENARAIGSRGGISSLLEICQAGTPCSQASAAGVL 300

Query: 301 RNLASFSEIKENFIEENGVVVLLGLLASGTPLAQENAIGCLCNLVLDDDNLKLLIVREGG 360
           RNLA+F+EIKENFIEENG+ VLLGL +SGT LAQENAIGCLCNL+  D+NLKLL+V+EGG
Sbjct: 301 RNLAAFAEIKENFIEENGIAVLLGLTSSGTALAQENAIGCLCNLISGDENLKLLVVKEGG 360

Query: 361 IEFLRNFWDSVPSVRSLEVAVELLSLLASYSPIAEALISDGFVDRLLPVLSCGVLGARTA 420
           IE L+NFWDS PSVRSLEVAV+LLS LAS  P+AEAL SDGFV RL+ VL+CGVLG R A
Sbjct: 361 IECLKNFWDSAPSVRSLEVAVDLLSHLASLLPVAEALCSDGFVARLVSVLNCGVLGVRIA 420

Query: 421 AARAVYELGFCTKTRKEMGESGFITPLINMLDGKSVDEKKAAAKALSSLLQYSGNRRIFQ 480
           AARAV ELG  ++TRKEMGE G I PLI MLDGK+V EK+AAAKALS L+  + NR+IF+
Sbjct: 421 AARAVSELGSSSRTRKEMGECGCIGPLIKMLDGKAVQEKEAAAKALSKLMLCTVNRKIFR 480

Query: 481 KEERGIVSAVQLLDPSISNLDKKYPVSLLSSVVISSKCRKQMVAAGAGLYLQKLVEMNVE 540
           ++E+GIVSAVQLLDPS+ NLDKKYPVS+L+S+  S KCRKQMVAAGA  YLQK+VEM+VE
Sbjct: 481 RDEKGIVSAVQLLDPSLRNLDKKYPVSVLASLSHSKKCRKQMVAAGACAYLQKVVEMDVE 540

Query: 541 GSKKLLESLGRGKIWGVF 549
           GSKKLLESLGRGK+WGVF
Sbjct: 541 GSKKLLESLGRGKMWGVF 558

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PUB4_ARATH3.9e-1925.91U-box domain-containing protein 4 OS=Arabidopsis thaliana GN=PUB4 PE=1 SV=3[more]
PUB11_ARATH3.6e-1726.01U-box domain-containing protein 11 OS=Arabidopsis thaliana GN=PUB11 PE=2 SV=2[more]
PUB10_ARATH1.2e-1526.80U-box domain-containing protein 10 OS=Arabidopsis thaliana GN=PUB10 PE=2 SV=1[more]
PUB15_ARATH2.2e-1429.83U-box domain-containing protein 15 OS=Arabidopsis thaliana GN=PUB15 PE=2 SV=2[more]
VAC8_ASPOR4.1e-1322.46Vacuolar protein 8 OS=Aspergillus oryzae (strain ATCC 42149 / RIB 40) GN=vac8 PE... [more]
Match NameE-valueIdentityDescription
A0A061G6D5_THECC1.2e-21672.73ARM repeat superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_014724 PE=4 S... [more]
W9QV83_9ROSA4.3e-21171.33U-box domain-containing protein 11 OS=Morus notabilis GN=L484_008839 PE=4 SV=1[more]
A0A067JIA0_JATCU1.4e-20970.24Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26389 PE=4 SV=1[more]
M5WBC9_PRUPE2.1e-20569.57Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003699mg PE=4 SV=1[more]
A0A067DZT1_CITSI2.7e-20569.00Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g008560mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G50900.12.1e-18862.27 ARM repeat superfamily protein[more]
AT2G45720.19.0e-8336.03 ARM repeat superfamily protein[more]
AT1G01830.23.0e-7836.28 ARM repeat superfamily protein[more]
AT2G05810.11.5e-6130.83 ARM repeat superfamily protein[more]
AT1G61350.16.5e-5729.98 ARM repeat superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449455447|ref|XP_004145464.1|7.0e-29597.64PREDICTED: vacuolar protein 8 [Cucumis sativus][more]
gi|659118181|ref|XP_008458985.1|1.3e-29397.10PREDICTED: U-box domain-containing protein 10 [Cucumis melo][more]
gi|590670572|ref|XP_007038093.1|1.7e-21672.73ARM repeat superfamily protein isoform 1 [Theobroma cacao][more]
gi|1009130344|ref|XP_015882248.1|5.5e-21572.88PREDICTED: armadillo segment polarity protein-like [Ziziphus jujuba][more]
gi|703089949|ref|XP_010093946.1|6.2e-21171.33U-box domain-containing protein 11 [Morus notabilis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000225Armadillo
IPR011989ARM-like
IPR016024ARM-type_fold
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO:0005488binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
molecular_function GO:0005515 protein binding
molecular_function GO:0005488 binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG01G009870.1ClCG01G009870.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000225ArmadilloPFAMPF00514Armcoord: 178..207
score: 6.
IPR000225ArmadilloSMARTSM00185arm_5coord: 421..461
score: 4.8coord: 255..295
score: 24.0coord: 171..211
score: 0.1coord: 380..420
score: 66.0coord: 296..336
score: 7.0coord: 337..379
score:
IPR000225ArmadilloPROFILEPS50176ARM_REPEATcoord: 182..210
score: 8
IPR011989Armadillo-like helicalGENE3DG3DSA:1.25.10.10coord: 128..534
score: 2.6
IPR016024Armadillo-type foldunknownSSF48371ARM repeatcoord: 144..489
score: 7.68
NoneNo IPR availablePANTHERPTHR23315BETA CATENIN-RELATED ARMADILLO REPEAT-CONTAININGcoord: 91..541
score: 2.8E-271coord: 12..34
score: 2.8E
NoneNo IPR availablePANTHERPTHR23315:SF86ARMADILLO/BETA-CATENIN-LIKE REPEAT-CONTAINING PROTEINcoord: 91..541
score: 2.8E-271coord: 12..34
score: 2.8E