Cp4.1LG09g09440 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG09g09440
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionArmadillo/beta-catenin repeat family protein
LocationCp4.1LG09 : 8468524 .. 8470179 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACAATACCGCCGGAGAACGACCAGTTTGCTCTTTCAAACGACCGGATTTCTTCTCTTCTTGATGATATTCCGCTCATCAACAATTTCAAGGGGAAATGGTCTTCAATCAGAGCGAAACTCTCCGATCTTCGCACACAGTTGATAGATGTTTCTGAGTTTCCGAATTCGTCTTCCAATCCTCTTTCTGTTGATTTTCTTCATTCTGTTTTGGAAGTTCTTAGTGAGGCGGCTTCTCTCTCGGAGAAGTGCCGGAATCCGGAACTTTCCGATGGTAAACTCAAGACTCAGAGCGATATTGACTCGGTTCTTGCGAAGTTGGACTCTCTACTTAAAGATGGTGAGGTTTTGATTCGGAGTGAGATTCTTCATGATGGTGCGGTTTCGAGTTCGTCGTCTAGGAGGGAGGCTGTGCGGGCGGAGTCGAGGAATTTAATCACGAGGTTGCAGATTGGGAGCATTGAATCGAGGGTATTGGCTATTGAATCGCTGTTGAAGTTGTTGAATGAGGATGATAAGAATGTAACGATTGCTGCGGCTCAAGGGATTGTTCCTGTTCTTGTTCGGCTACTGGATTCCAGTTCTTTAGAACTGAAGGAACGGGCTGTTGCTGCTATTTCCATCGTTTCATCGGTGGATGGTGTTAGGCATTTAATGATTGCTGAAGGTCTACAGCTTTTGAATCACTTGCTGAGGATTCTCGATTCTGGTAGCGGTTTTGCTAAGGAGAAGGCCTGTTTAACCCTCCAACCTTTAAGCATTTCGAAAGAAAATGCTAGATCAATCGGTTCTAGAGGAGGAATTTCATCTCTATTGGAGATTTGTGAGGCCGGCACTCCCGGTTCTCAAGCCTCAGCAGCTGCTGTTCTTAGAAATCTTGCATCATTTGGTGAAATTAAAGAGAATTTCATTGAAGAAAATGGGGTTATCGTTCTTTTGGGGCTTTTAGCCTCGGGAACTCCGTTAGCTCAAGAAAATGCAATTGGATGTCTATGTAATTTAGTTGCAGAAGATGATAATCTGAAGCTCTTGATCGTTAGAGAAGGTGGGATCGAGCTTTTGAGAAGTTTCTGGGATTCAGCTCCATCAGTTCATAGTCTCGAAGTGGCTGTGGAGCTTTTGGGGCTCTTGGCTTCATCTGCCCCGATCGCCGAAGCTCTTATCTCAAACGGGTTTGTTGATCGGCTTCTTCCAGCTCTAAGCTGCGGAGTATTAGGCGTTAGAACTGCCGCAGCTCGAGCAGTTTACGAGCTCGGATTCTGCACAAAAACAAGAAAAGATATGGGGGAAGCTGGATTCATCACACCCTTGGTTAATATGCTAGATGGCAAGTCTGTTGAGGAGAAAGAAGCAGCTGCCAAGGCATTGTCTTCTCTATTACAATACACAGGTAACAGAAAAATTTTCCAGAAAGAAGAGAAAGGGATAGTAAGTGCAGTCCAACTCTTAGACCCTTCAATATCAAATCTAGACAAGAAGTACCCTGTTTCATTATTAGCCTCGGTTATGATATCAAGCAAGTGTAGAAAGCAGATGGCTGCTGCTGGTGCTGGTTTATATCTTCAAAAGCTTGTTGAAATGAATGTTGATGGATCAAAGAAACTATTGGAATGTCTTGGCCGAGGTAAAATCTGGGGTGTCTTTGCCAGATCTTAG

mRNA sequence

ATGACAATACCGCCGGAGAACGACCAGTTTGCTCTTTCAAACGACCGGATTTCTTCTCTTCTTGATGATATTCCGCTCATCAACAATTTCAAGGGGAAATGGTCTTCAATCAGAGCGAAACTCTCCGATCTTCGCACACAGTTGATAGATGTTTCTGAGTTTCCGAATTCGTCTTCCAATCCTCTTTCTGTTGATTTTCTTCATTCTGTTTTGGAAGTTCTTAGTGAGGCGGCTTCTCTCTCGGAGAAGTGCCGGAATCCGGAACTTTCCGATGGTAAACTCAAGACTCAGAGCGATATTGACTCGGTTCTTGCGAAGTTGGACTCTCTACTTAAAGATGGTGAGGTTTTGATTCGGAGTGAGATTCTTCATGATGGTGCGGTTTCGAGTTCGTCGTCTAGGAGGGAGGCTGTGCGGGCGGAGTCGAGGAATTTAATCACGAGGTTGCAGATTGGGAGCATTGAATCGAGGGTATTGGCTATTGAATCGCTGTTGAAGTTGTTGAATGAGGATGATAAGAATGTAACGATTGCTGCGGCTCAAGGGATTGTTCCTGTTCTTGTTCGGCTACTGGATTCCAGTTCTTTAGAACTGAAGGAACGGGCTGTTGCTGCTATTTCCATCGTTTCATCGGTGGATGGTGTTAGGCATTTAATGATTGCTGAAGGTCTACAGCTTTTGAATCACTTGCTGAGGATTCTCGATTCTGGTAGCGGTTTTGCTAAGGAGAAGGCCTGTTTAACCCTCCAACCTTTAAGCATTTCGAAAGAAAATGCTAGATCAATCGGTTCTAGAGGAGGAATTTCATCTCTATTGGAGATTTGTGAGGCCGGCACTCCCGGTTCTCAAGCCTCAGCAGCTGCTGTTCTTAGAAATCTTGCATCATTTGGTGAAATTAAAGAGAATTTCATTGAAGAAAATGGGGTTATCGTTCTTTTGGGGCTTTTAGCCTCGGGAACTCCGTTAGCTCAAGAAAATGCAATTGGATGTCTATGTAATTTAGTTGCAGAAGATGATAATCTGAAGCTCTTGATCGTTAGAGAAGGTGGGATCGAGCTTTTGAGAAGTTTCTGGGATTCAGCTCCATCAGTTCATAGTCTCGAAGTGGCTGTGGAGCTTTTGGGGCTCTTGGCTTCATCTGCCCCGATCGCCGAAGCTCTTATCTCAAACGGGTTTGTTGATCGGCTTCTTCCAGCTCTAAGCTGCGGAGTATTAGGCGTTAGAACTGCCGCAGCTCGAGCAGTTTACGAGCTCGGATTCTGCACAAAAACAAGAAAAGATATGGGGGAAGCTGGATTCATCACACCCTTGGTTAATATGCTAGATGGCAAGTCTGTTGAGGAGAAAGAAGCAGCTGCCAAGGCATTGTCTTCTCTATTACAATACACAGGTAACAGAAAAATTTTCCAGAAAGAAGAGAAAGGGATAGTAAGTGCAGTCCAACTCTTAGACCCTTCAATATCAAATCTAGACAAGAAGTACCCTGTTTCATTATTAGCCTCGGTTATGATATCAAGCAAGTGTAGAAAGCAGATGGCTGCTGCTGGTGCTGGTTTATATCTTCAAAAGCTTGTTGAAATGAATGTTGATGGATCAAAGAAACTATTGGAATGTCTTGGCCGAGGTAAAATCTGGGGTGTCTTTGCCAGATCTTAG

Coding sequence (CDS)

ATGACAATACCGCCGGAGAACGACCAGTTTGCTCTTTCAAACGACCGGATTTCTTCTCTTCTTGATGATATTCCGCTCATCAACAATTTCAAGGGGAAATGGTCTTCAATCAGAGCGAAACTCTCCGATCTTCGCACACAGTTGATAGATGTTTCTGAGTTTCCGAATTCGTCTTCCAATCCTCTTTCTGTTGATTTTCTTCATTCTGTTTTGGAAGTTCTTAGTGAGGCGGCTTCTCTCTCGGAGAAGTGCCGGAATCCGGAACTTTCCGATGGTAAACTCAAGACTCAGAGCGATATTGACTCGGTTCTTGCGAAGTTGGACTCTCTACTTAAAGATGGTGAGGTTTTGATTCGGAGTGAGATTCTTCATGATGGTGCGGTTTCGAGTTCGTCGTCTAGGAGGGAGGCTGTGCGGGCGGAGTCGAGGAATTTAATCACGAGGTTGCAGATTGGGAGCATTGAATCGAGGGTATTGGCTATTGAATCGCTGTTGAAGTTGTTGAATGAGGATGATAAGAATGTAACGATTGCTGCGGCTCAAGGGATTGTTCCTGTTCTTGTTCGGCTACTGGATTCCAGTTCTTTAGAACTGAAGGAACGGGCTGTTGCTGCTATTTCCATCGTTTCATCGGTGGATGGTGTTAGGCATTTAATGATTGCTGAAGGTCTACAGCTTTTGAATCACTTGCTGAGGATTCTCGATTCTGGTAGCGGTTTTGCTAAGGAGAAGGCCTGTTTAACCCTCCAACCTTTAAGCATTTCGAAAGAAAATGCTAGATCAATCGGTTCTAGAGGAGGAATTTCATCTCTATTGGAGATTTGTGAGGCCGGCACTCCCGGTTCTCAAGCCTCAGCAGCTGCTGTTCTTAGAAATCTTGCATCATTTGGTGAAATTAAAGAGAATTTCATTGAAGAAAATGGGGTTATCGTTCTTTTGGGGCTTTTAGCCTCGGGAACTCCGTTAGCTCAAGAAAATGCAATTGGATGTCTATGTAATTTAGTTGCAGAAGATGATAATCTGAAGCTCTTGATCGTTAGAGAAGGTGGGATCGAGCTTTTGAGAAGTTTCTGGGATTCAGCTCCATCAGTTCATAGTCTCGAAGTGGCTGTGGAGCTTTTGGGGCTCTTGGCTTCATCTGCCCCGATCGCCGAAGCTCTTATCTCAAACGGGTTTGTTGATCGGCTTCTTCCAGCTCTAAGCTGCGGAGTATTAGGCGTTAGAACTGCCGCAGCTCGAGCAGTTTACGAGCTCGGATTCTGCACAAAAACAAGAAAAGATATGGGGGAAGCTGGATTCATCACACCCTTGGTTAATATGCTAGATGGCAAGTCTGTTGAGGAGAAAGAAGCAGCTGCCAAGGCATTGTCTTCTCTATTACAATACACAGGTAACAGAAAAATTTTCCAGAAAGAAGAGAAAGGGATAGTAAGTGCAGTCCAACTCTTAGACCCTTCAATATCAAATCTAGACAAGAAGTACCCTGTTTCATTATTAGCCTCGGTTATGATATCAAGCAAGTGTAGAAAGCAGATGGCTGCTGCTGGTGCTGGTTTATATCTTCAAAAGCTTGTTGAAATGAATGTTGATGGATCAAAGAAACTATTGGAATGTCTTGGCCGAGGTAAAATCTGGGGTGTCTTTGCCAGATCTTAG

Protein sequence

MTIPPENDQFALSNDRISSLLDDIPLINNFKGKWSSIRAKLSDLRTQLIDVSEFPNSSSNPLSVDFLHSVLEVLSEAASLSEKCRNPELSDGKLKTQSDIDSVLAKLDSLLKDGEVLIRSEILHDGAVSSSSSRREAVRAESRNLITRLQIGSIESRVLAIESLLKLLNEDDKNVTIAAAQGIVPVLVRLLDSSSLELKERAVAAISIVSSVDGVRHLMIAEGLQLLNHLLRILDSGSGFAKEKACLTLQPLSISKENARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFGEIKENFIEENGVIVLLGLLASGTPLAQENAIGCLCNLVAEDDNLKLLIVREGGIELLRSFWDSAPSVHSLEVAVELLGLLASSAPIAEALISNGFVDRLLPALSCGVLGVRTAAARAVYELGFCTKTRKDMGEAGFITPLVNMLDGKSVEEKEAAAKALSSLLQYTGNRKIFQKEEKGIVSAVQLLDPSISNLDKKYPVSLLASVMISSKCRKQMAAAGAGLYLQKLVEMNVDGSKKLLECLGRGKIWGVFARS
BLAST of Cp4.1LG09g09440 vs. Swiss-Prot
Match: PUB4_ARATH (U-box domain-containing protein 4 OS=Arabidopsis thaliana GN=PUB4 PE=1 SV=3)

HSP 1 Score: 87.4 bits (215), Expect = 5.2e-16
Identity = 101/385 (26.23%), Postives = 163/385 (42.34%), Query Frame = 1

Query: 32  GKWSSIRAKLSDLRTQLIDVSEFPNSSSNPLSVDFLHSVLEVLSEAASLSEKCRNPELSD 91
           G+ S      S   T  +   EFP + +N  S +  H+     S+A+   E    P  + 
Sbjct: 431 GQTSENHHHRSPSATSTVSNEEFPRADANENSEESAHAT-PYSSDASG--EIRSGPLAAT 490

Query: 92  GKLKTQSDIDSVLAKLDSLLKDGEVLIR-SEILHDGAVS--SSSSRREAVRAES--RNLI 151
               T+ D+     K       G+   R SE L    VS  S+ +RR+    E+  + L+
Sbjct: 491 TSAATRRDLSDFSPKFMDRRTRGQFWRRPSERLGSRIVSAPSNETRRDLSEVETQVKKLV 550

Query: 152 TRLQIGSIESRVLAIESLLKLLNEDDKNVTIAAAQGIVPVLVRLLDSSSLELKERAVAAI 211
             L+  S++++  A   L  L   +  N  +    G + +LV LL S+    +E AV A+
Sbjct: 551 EELKSSSLDTQRQATAELRLLAKHNMDNRIVIGNSGAIVLLVELLYSTDSATQENAVTAL 610

Query: 212 SIVSSVDGVRHLMIAEGLQLLNHLLRILDSGSGFAKEKACLTLQPLSISKENARSIGSRG 271
             +S  D  +  +   G   +  L+ +L++GS  AKE +  TL  LS+ +EN   IG  G
Sbjct: 611 LNLSINDNNKKAIADAGA--IEPLIHVLENGSSEAKENSAATLFSLSVIEENKIKIGQSG 670

Query: 272 GISSLLEICEAGTPGSQASAAAVLRNLASFGEIKENFIEENGVIVLLGLLASGTPLAQEN 331
            I  L+++   GTP  +  AA  L NL+   E K   ++   V  L+ L+     +  + 
Sbjct: 671 AIGPLVDLLGNGTPRGKKDAATALFNLSIHQENKAMIVQSGAVRYLIDLMDPAAGMV-DK 730

Query: 332 AIGCLCNLVAEDDNLKLLIVREGGIELLRSFWDSAPSVHSLEVAVELLGLLASSAPIAEA 391
           A+  L NL    +  +  I +EGGI LL    +   +      A  LL L  +S      
Sbjct: 731 AVAVLANLATIPEG-RNAIGQEGGIPLLVEVVELGSARGKENAAAALLQLSTNSGRFCNM 790

Query: 392 LISNGFVDRLLPALSCGVLGVRTAA 412
           ++  G V  L+     G    R  A
Sbjct: 791 VLQEGAVPPLVALSQSGTPRAREKA 808

BLAST of Cp4.1LG09g09440 vs. Swiss-Prot
Match: PUB11_ARATH (U-box domain-containing protein 11 OS=Arabidopsis thaliana GN=PUB11 PE=2 SV=2)

HSP 1 Score: 83.6 bits (205), Expect = 7.5e-15
Identity = 70/273 (25.64%), Postives = 123/273 (45.05%), Query Frame = 1

Query: 143 RNLITRLQIGSIESRVLAIESLLKLLNEDDKNVTIAAAQGIVPVLVRLLDSSSLELKERA 202
           R L+ RL   S E R  A+  +  L      N  + A  G +PVLV LL S  +  +E A
Sbjct: 334 RALVQRLSSRSTEDRRNAVSEIRSLSKRSTDNRILIAEAGAIPVLVNLLTSEDVATQENA 393

Query: 203 VAAISIVSSVDGVRHLMIAEGLQLLNHLLRILDSGSGFAKEKACLTLQPLSISKENARSI 262
           +  +  +S  +  + L++  G   +  ++++L +G+  A+E A  TL  LS++ EN   I
Sbjct: 394 ITCVLNLSIYENNKELIMFAGA--VTSIVQVLRAGTMEARENAAATLFSLSLADENKIII 453

Query: 263 GSRGGISSLLEICEAGTPGSQASAAAVLRNLASFGEIKENFIEENGVIVLLGLLASGTPL 322
           G  G I +L+++ E GTP  +  AA  L NL  +   K   +    V  L+ +L+  T  
Sbjct: 454 GGSGAIPALVDLLENGTPRGKKDAATALFNLCIYHGNKGRAVRAGIVTALVKMLSDSTRH 513

Query: 323 AQENAIGCLCNLVAEDDNLKLLIVREGGIELLRSFWDSAPSVHSLEVAVELLGLLASSAP 382
              +    + +++A + + K  IV+   +  L     +  + +    A  LL L      
Sbjct: 514 RMVDEALTILSVLANNQDAKSAIVKANTLPALIGILQTDQTRNRENAAAILLSLCKRD-- 573

Query: 383 IAEALISNGFVDRLLPALSCGVLGVRTAAARAV 416
             E LI+ G +  ++P +     G      +A+
Sbjct: 574 -TEKLITIGRLGAVVPLMDLSKNGTERGKRKAI 601

BLAST of Cp4.1LG09g09440 vs. Swiss-Prot
Match: PUB10_ARATH (U-box domain-containing protein 10 OS=Arabidopsis thaliana GN=PUB10 PE=2 SV=1)

HSP 1 Score: 76.6 bits (187), Expect = 9.2e-13
Identity = 81/307 (26.38%), Postives = 134/307 (43.65%), Query Frame = 1

Query: 125 DGAVSSSSSRREAVRAESRNLITRLQIGSIESRVLAIESLLKLLNEDDKNVTIAAAQGIV 184
           DG+    S    A+RA    L+ +L   SIE R  A+  +  L      N  + A  G +
Sbjct: 330 DGSFRDLSGDMSAIRA----LVCKLSSQSIEDRRTAVSEIRSLSKRSTDNRILIAEAGAI 389

Query: 185 PVLVRLLDSSS-LELKERAVAAISIVSSVDGVRHLMIAEGLQLLNHLLRILDSGSGFAKE 244
           PVLV+LL S    E +E AV  I  +S  +  + L++  G   +  ++ +L +GS  A+E
Sbjct: 390 PVLVKLLTSDGDTETQENAVTCILNLSIYEHNKELIMLAGA--VTSIVLVLRAGSMEARE 449

Query: 245 KACLTLQPLSISKENARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFGEIKENF 304
            A  TL  LS++ EN   IG+ G I +L+++ + G+   +  AA  L NL  +   K   
Sbjct: 450 NAAATLFSLSLADENKIIIGASGAIMALVDLLQYGSVRGKKDAATALFNLCIYQGNKGRA 509

Query: 305 IEENGVIVLLGLLASGTPLAQENAIGCLCNLVAEDDNLKLLIVREGGI-ELLRSFWDSAP 364
           +    V  L+ +L   +     +    + +++A +   K  I+R   I  L+       P
Sbjct: 510 VRAGIVKPLVKMLTDSSSERMADEALTILSVLASNQVAKTAILRANAIPPLIDCLQKDQP 569

Query: 365 SVHSLEVAVELLGLLASSAPIAEALISNGFVDRLLPALSCGVLGVRTAAARAVYELGFCT 424
                  A+    LL       E LIS G +  ++P +     G   A  +A   L    
Sbjct: 570 RNRENAAAI----LLCLCKRDTEKLISIGRLGAVVPLMELSRDGTERAKRKANSLLELLR 626

Query: 425 KTRKDMG 430
           K+ + +G
Sbjct: 630 KSSRKLG 626

BLAST of Cp4.1LG09g09440 vs. Swiss-Prot
Match: VAC8_ASPOR (Vacuolar protein 8 OS=Aspergillus oryzae (strain ATCC 42149 / RIB 40) GN=vac8 PE=3 SV=1)

HSP 1 Score: 70.1 bits (170), Expect = 8.6e-11
Identity = 75/335 (22.39%), Postives = 145/335 (43.28%), Query Frame = 1

Query: 145 LITRLQIGSIESRVLAIESLLKLLNEDDKNVTIAAAQGIVPVLVRLLDSSSLELKERAVA 204
           LI ++   ++E +  A+  +  L   +D    IA +  + P L+RL  S  + ++  A  
Sbjct: 152 LIRQMMSPNVEVQCNAVGCITNLATHEDNKAKIARSGALGP-LIRLAKSKDMRVQRNATG 211

Query: 205 AISIVSSVDGVRHLMIAEGLQLLNHLLRILDSGSGFAKEKACLTLQPLSISKENARSIGS 264
           A+  ++  D  R  ++  G   +  L+++L S     +      L  +++   N + +  
Sbjct: 212 ALLNMTHSDDNRQQLVNAGA--IPVLVQLLSSSDVDVQYYCTTALSNIAVDASNRKRLAQ 271

Query: 265 RGG--ISSLLEICEAGTPGSQASAAAVLRNLASFGEIKENFIEENGVIVLLGLLASGTPL 324
                + SL+ + ++ TP  Q  AA  LRNLAS  + +   +   G+  LL LL S    
Sbjct: 272 TESRLVQSLVHLMDSSTPKVQCQAALALRNLASDEKYQLEIVRAKGLPPLLRLLQSSYLP 331

Query: 325 AQENAIGCLCNLVAEDDNLKLLIVREGG-----IELLRSFWDSAPSVHSLEVAVELLGLL 384
              +A+ C+ N+     N   +I  + G     ++LL S  +     H++     L  L 
Sbjct: 332 LILSAVACIRNISIHPLNESPII--DAGFLKPLVDLLGSTDNEEIQCHAIST---LRNLA 391

Query: 385 ASSAPIAEALISNGFVDRLLPALSCGVLGVRTAAARAVYELGFCTKTRKDMGEAGFITPL 444
           ASS    E ++  G V +    +    L V++    A+  L    + +  +   G    L
Sbjct: 392 ASSDRNKELVLQAGAVQKCKDLVLKVPLSVQSEMTAAIAVLALSDELKPHLLNLGVFDVL 451

Query: 445 VNMLDGKSVEEKEAAAKALSSLLQYTGNRKIFQKE 473
           + + + +S+E +  +A AL +L    G+  IF ++
Sbjct: 452 IPLTESESIEVQGNSAAALGNLSSKVGDYSIFVRD 478

BLAST of Cp4.1LG09g09440 vs. Swiss-Prot
Match: VAC8_ASPFU (Vacuolar protein 8 OS=Neosartorya fumigata (strain ATCC MYA-4609 / Af293 / CBS 101355 / FGSC A1100) GN=vac8 PE=3 SV=1)

HSP 1 Score: 67.4 bits (163), Expect = 5.6e-10
Identity = 83/373 (22.25%), Postives = 159/373 (42.63%), Query Frame = 1

Query: 117 LIRSEILHDGAVSSSSSRREAVRAESRNLITRL----------QIGSIESRVLAIESLLK 176
           L++S  +     +S++    AV AE++ LI  L             ++E +  A+  +  
Sbjct: 114 LLQSSDIEVQRAASAALGNLAVNAENKVLIVALGGLTPLIRQMMSPNVEVQCNAVGCITN 173

Query: 177 LLNEDDKNVTIAAAQGIVPVLVRLLDSSSLELKERAVAAISIVSSVDGVRHLMIAEGLQL 236
           L   +D    IA +  + P L+RL  S  + ++  A  A+  ++  D  R  ++  G   
Sbjct: 174 LATHEDNKAKIARSGALGP-LIRLAKSKDMRVQRNATGALLNMTHSDDNRQQLVNAGA-- 233

Query: 237 LNHLLRILDSGSGFAKEKACLTLQPLSISKENARSIGSRGG--ISSLLEICEAGTPGSQA 296
           +  L+++L S     +      L  +++   N + +       + SL+ + ++ TP  Q 
Sbjct: 234 IPVLVQLLSSPDVDVQYYCTTALSNIAVDASNRKRLAQTESRLVQSLVHLMDSSTPKVQC 293

Query: 297 SAAAVLRNLASFGEIKENFIEENGVIVLLGLLASGTPLAQENAIGCLCNLVAEDDNLKLL 356
            AA  LRNLAS  + +   +   G+  LL LL S       +A+ C+ N+     N   +
Sbjct: 294 QAALALRNLASDEKYQLEIVRAKGLPPLLRLLQSSYLPLILSAVACIRNISIHPLNESPI 353

Query: 357 IVREGG-----IELLRSFWDSAPSVHSLEVAVELLGLLASSAPIAEALISNGFVDRLLPA 416
           I  + G     ++LL S  +     H++     L  L ASS    E ++  G V +    
Sbjct: 354 I--DAGFLKPLVDLLGSTDNEEIQCHAIST---LRNLAASSDRNKELVLQAGAVQKCKDL 413

Query: 417 LSCGVLGVRTAAARAVYELGFCTKTRKDMGEAGFITPLVNMLDGKSVEEKEAAAKALSSL 473
           +    L V++    A+  L    + +  +   G    L+ + + +S+E +  +A AL +L
Sbjct: 414 VLRVPLSVQSEMTAAIAVLALSDELKPHLLNLGVFDVLIPLTNSESIEVQGNSAAALGNL 473

BLAST of Cp4.1LG09g09440 vs. TrEMBL
Match: A0A0A0M0M7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G699580 PE=4 SV=1)

HSP 1 Score: 937.2 bits (2421), Expect = 9.2e-270
Identity = 497/551 (90.20%), Postives = 521/551 (94.56%), Query Frame = 1

Query: 1   MTIPPENDQFALSNDRISSLLDDIPLINNFKGKWSSIRAKLSDLRTQLIDVSEFPNSSSN 60
           M IPPE D F LSN+ ISSLLDDIPLI  FKGKWSSIRAKLSDLRTQLIDVS FPNSSSN
Sbjct: 1   MKIPPETDHFLLSNNLISSLLDDIPLITIFKGKWSSIRAKLSDLRTQLIDVSHFPNSSSN 60

Query: 61  PLSVDFLHSVLEVLSEAASLSEKCRNPELSDGKLKTQSDIDSVLAKLDSLLKDGEVLIRS 120
           PLS+DFLHSVLE L++AASLS KCRNP LSDGKLKTQSDID++LAK DSLLKDGEVLIRS
Sbjct: 61  PLSLDFLHSVLEALTQAASLSHKCRNPALSDGKLKTQSDIDAILAKFDSLLKDGEVLIRS 120

Query: 121 EILHDGAVSSSSSRREAVRAESRNLITRLQIGSIESRVLAIESLLKLLNEDDKNVTIAAA 180
           EILHDG VSSSSSRREAVRAESRNLITRLQIGSIESRVLAI+SLL+LLNEDDKNVTIAAA
Sbjct: 121 EILHDGVVSSSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAA 180

Query: 181 QGIVPVLVRLLDSSSLELKERAVAAISIVSSVDGVRHLMIAEGLQLLNHLLRILDSGSGF 240
           QG VPVLVRLLDSSSLELKERAVAAISIVS VDGV+H+MIAEGL LLNHLLRILDSGSGF
Sbjct: 181 QGAVPVLVRLLDSSSLELKERAVAAISIVSMVDGVKHIMIAEGLVLLNHLLRILDSGSGF 240

Query: 241 AKEKACLTLQPLSISKENARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFGEIK 300
           AKEKACL LQPLSISKENARSIGSRGGISSLLEICE GTPGSQASAAAVLRNLASF EIK
Sbjct: 241 AKEKACLALQPLSISKENARSIGSRGGISSLLEICEGGTPGSQASAAAVLRNLASFSEIK 300

Query: 301 ENFIEENGVIVLLGLLASGTPLAQENAIGCLCNLVAEDDNLKLLIVREGGIELLRSFWDS 360
           ENFIEENGVIVLLGLLASGTPLAQENAIGCLCNLV +DDNLKLLIVREGGIE LR+FWDS
Sbjct: 301 ENFIEENGVIVLLGLLASGTPLAQENAIGCLCNLVLDDDNLKLLIVREGGIEFLRNFWDS 360

Query: 361 APSVHSLEVAVELLGLLASSAPIAEALISNGFVDRLLPALSCGVLGVRTAAARAVYELGF 420
            PSV SLEVAVELL LLAS +PIAEALIS+GFVDRLLP LSCGVLG RTAAARAVYELGF
Sbjct: 361 VPSVRSLEVAVELLSLLASYSPIAEALISDGFVDRLLPVLSCGVLGARTAAARAVYELGF 420

Query: 421 CTKTRKDMGEAGFITPLVNMLDGKSVEEKEAAAKALSSLLQYTGNRKIFQKEEKGIVSAV 480
           CTKTRK+MGE+GFITPLVNMLDGKSV+E++AAAKALSSLLQY+GNRKIFQKEE+GIVSAV
Sbjct: 421 CTKTRKEMGESGFITPLVNMLDGKSVDERKAAAKALSSLLQYSGNRKIFQKEERGIVSAV 480

Query: 481 QLLDPSISNLDKKYPVSLLASVMISSKCRKQMAAAGAGLYLQKLVEMNVDGSKKLLECLG 540
           QLLDPSISNLDKKYPVSLL+SV ISSKCRKQM AAGAGLYLQKLVE+NV+GSKKLLE LG
Sbjct: 481 QLLDPSISNLDKKYPVSLLSSVAISSKCRKQMVAAGAGLYLQKLVEINVEGSKKLLESLG 540

Query: 541 RGKIWGVFARS 552
           RGKIWGVFARS
Sbjct: 541 RGKIWGVFARS 551

BLAST of Cp4.1LG09g09440 vs. TrEMBL
Match: A0A061G6D5_THECC (ARM repeat superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_014724 PE=4 SV=1)

HSP 1 Score: 748.0 bits (1930), Expect = 7.9e-213
Identity = 388/546 (71.06%), Postives = 469/546 (85.90%), Query Frame = 1

Query: 5   PENDQFALSNDRISSLLDDIPLINNFKGKWSSIRAKLSDLRTQLIDVSEFPNSSSNPLSV 64
           PEND  +LSN  ++SL + IP INNFKGKW+ I++KLS L+ QL D S+FP SSSNPL+V
Sbjct: 4   PENDPISLSNHLLASLSEQIPNINNFKGKWALIKSKLSGLQAQLADFSDFPASSSNPLAV 63

Query: 65  DFLHSVLEVLSEAASLSEKCRNPELSDGKLKTQSDIDSVLAKLDSLLKDGEVLIRSEILH 124
           D L+S+ + L++A SLS+KC+  +L++GKLKTQSDID+VLAKLD  +KD E+LIRS +L 
Sbjct: 64  DLLYSITQTLNDAVSLSQKCQLADLTEGKLKTQSDIDAVLAKLDRHIKDSEILIRSGVLQ 123

Query: 125 DGAVSSSSSRREAVRAESRNLITRLQIGSIESRVLAIESLLKLLNEDDKNVTIAAAQGIV 184
           DGAVS+SSS++EAVR ESRNLITRLQIG+ ES+  A++SLL LL EDDKNV IA AQG+V
Sbjct: 124 DGAVSTSSSKKEAVRVESRNLITRLQIGTTESKNSAMDSLLGLLQEDDKNVMIAVAQGVV 183

Query: 185 PVLVRLLDSSSLELKERAVAAISIVSSVDGVRHLMIAEGLQLLNHLLRILDSGSGFAKEK 244
           PVLVRLLDSSSLE+KE+ VAAIS VS+V+  +H++IAEGL LLNHLLR+L+SGSGFAKEK
Sbjct: 184 PVLVRLLDSSSLEMKEKTVAAISRVSTVESSKHVLIAEGLLLLNHLLRVLESGSGFAKEK 243

Query: 245 ACLTLQPLSISKENARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFGEIKENFI 304
           AC+ LQ LS SKENAR+IGSRGGISSLLEIC+AGTPGSQA AA VL+NLAS  EIKENFI
Sbjct: 244 ACIALQALSFSKENARAIGSRGGISSLLEICQAGTPGSQAFAAGVLKNLASVDEIKENFI 303

Query: 305 EENGVIVLLGLLASGTPLAQENAIGCLCNLVAEDDNLKLLIVREGGIELLRSFWDSAPSV 364
           EEN V VL+GL ASGT LAQEN+IGCLCNLV++D+NL+LLIV+EGGIE L++FWDS+P+ 
Sbjct: 304 EENAVFVLIGLAASGTALAQENSIGCLCNLVSDDENLRLLIVKEGGIECLKNFWDSSPNP 363

Query: 365 HSLEVAVELLGLLASSAPIAEALISNGFVDRLLPALSCGVLGVRTAAARAVYELGFCTKT 424
            SLEVAVEL+  LAS +PIAEAL+++GFV RL+  L+CGVLGVR AAARAVYELGF +KT
Sbjct: 364 KSLEVAVELVRRLASCSPIAEALVADGFVARLVAVLNCGVLGVRIAAARAVYELGFNSKT 423

Query: 425 RKDMGEAGFITPLVNMLDGKSVEEKEAAAKALSSLLQYTGNRKIFQKEEKGIVSAVQLLD 484
           RK+MGE G    L+ M+DGK+VEEKEAAA ALS+L+ Y GNRK+FQK+E+GIV+AVQLLD
Sbjct: 424 RKEMGECGCTVALIKMMDGKAVEEKEAAAMALSTLMLYAGNRKVFQKDERGIVNAVQLLD 483

Query: 485 PSISNLDKKYPVSLLASVMISSKCRKQMAAAGAGLYLQKLVEMNVDGSKKLLECLGRGKI 544
           P I NLDKKYPV +L+ ++ S KCRKQM AAGA +YLQKLVEMNV+G+KKLLE LGRGKI
Sbjct: 484 PLIQNLDKKYPVLILSELVHSKKCRKQMVAAGACVYLQKLVEMNVEGAKKLLESLGRGKI 543

Query: 545 WGVFAR 551
           WGVFAR
Sbjct: 544 WGVFAR 549

BLAST of Cp4.1LG09g09440 vs. TrEMBL
Match: A0A067JIA0_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26389 PE=4 SV=1)

HSP 1 Score: 723.4 bits (1866), Expect = 2.1e-205
Identity = 379/547 (69.29%), Postives = 449/547 (82.08%), Query Frame = 1

Query: 5   PENDQFALSNDRISSLLDDIPLINNFKGKWSSIRAKLSDLRTQLIDVSEFPNSSSNPLSV 64
           PEND    ++  + SLLD+IP +  FKGKW+ IRAKL+DL+TQL D ++FP S+SNPL +
Sbjct: 4   PENDPINANDQLLQSLLDEIPHVQTFKGKWALIRAKLADLQTQLTDFADFPASTSNPLCL 63

Query: 65  DFLHSVLEVLSEAASLSEKCRNPELSDGKLKTQSDIDSVLAKLDSLLKDGEVLIRSEILH 124
           D LHS+   L++A  L+ KCR P  ++GKL+TQSD+DS+LAKLD  +KD E+LI+S +L 
Sbjct: 64  DLLHSISNSLNDAVLLARKCRTPNFTEGKLRTQSDVDSILAKLDRHVKDSEILIKSGVLQ 123

Query: 125 DGAVS-SSSSRREAVRAESRNLITRLQIGSIESRVLAIESLLKLLNEDDKNVTIAAAQGI 184
           DGA S  SSS+REAVR ESRNLITRLQIGS ES+  A++SLL LL EDDKNV IA AQG+
Sbjct: 124 DGATSVGSSSKREAVRVESRNLITRLQIGSSESKNSAMDSLLGLLQEDDKNVMIAVAQGV 183

Query: 185 VPVLVRLLDSSSLELKERAVAAISIVSSVDGVRHLMIAEGLQLLNHLLRILDSGSGFAKE 244
           VPVLVRLLDSSSLE+KE+ VAAIS VS VD  +H++IAEGL LLNHLLR+L+SGSGFAKE
Sbjct: 184 VPVLVRLLDSSSLEMKEKTVAAISRVSMVDSSKHVLIAEGLLLLNHLLRVLESGSGFAKE 243

Query: 245 KACLTLQPLSISKENARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFGEIKENF 304
           KAC+ LQ LS SKENAR+IGSRGGISSLLEIC+ GTPGSQA AA VLRNLA F EI+ENF
Sbjct: 244 KACVALQALSFSKENARAIGSRGGISSLLEICQGGTPGSQAFAAGVLRNLAVFEEIRENF 303

Query: 305 IEENGVIVLLGLLASGTPLAQENAIGCLCNLVAEDDNLKLLIVREGGIELLRSFWDSAPS 364
           IEEN V VL+GL ASGT LAQENAIGCLCNL  +D+NLKLLIV+EGG+E LR+FWDS P 
Sbjct: 304 IEENAVFVLIGLAASGTALAQENAIGCLCNLAKDDENLKLLIVKEGGVECLRNFWDSGPP 363

Query: 365 VHSLEVAVELLGLLASSAPIAEALISNGFVDRLLPALSCGVLGVRTAAARAVYELGFCTK 424
           V SLEVAV+LL  LAS+  IAE L+S+GFV RL+  L+CGVLGVR A A A+YELGF TK
Sbjct: 364 VRSLEVAVDLLRNLASNQAIAEVLVSDGFVSRLMVFLNCGVLGVRIATAEAIYELGFNTK 423

Query: 425 TRKDMGEAGFITPLVNMLDGKSVEEKEAAAKALSSLLQYTGNRKIFQKEEKGIVSAVQLL 484
           TRK+MGE   I PL+NMLDGK+V EKEAAAKALS LL Y GNRK F+K+E+GIV  VQLL
Sbjct: 424 TRKEMGECEVIVPLINMLDGKAVVEKEAAAKALSHLLLYAGNRKTFRKDERGIVYTVQLL 483

Query: 485 DPSISNLDKKYPVSLLASVMISSKCRKQMAAAGAGLYLQKLVEMNVDGSKKLLECLGRGK 544
           DPSI NLDKKYPVS+LAS++ S KCRKQM  AGA ++L+ L EM ++G+KKLL+ LGRGK
Sbjct: 484 DPSIQNLDKKYPVSILASLVQSKKCRKQMIGAGACVHLKTLAEMEIEGAKKLLDGLGRGK 543

Query: 545 IWGVFAR 551
           IWGVFAR
Sbjct: 544 IWGVFAR 550

BLAST of Cp4.1LG09g09440 vs. TrEMBL
Match: W9QV83_9ROSA (U-box domain-containing protein 11 OS=Morus notabilis GN=L484_008839 PE=4 SV=1)

HSP 1 Score: 721.1 bits (1860), Expect = 1.0e-204
Identity = 386/552 (69.93%), Postives = 462/552 (83.70%), Query Frame = 1

Query: 6   ENDQFALSNDRISSLLDDIPLINNFKGKWSSIRAKLSDLRTQLIDVSEFPNSSSNPLSVD 65
           E D  A+S + +SSL+D+I L+  FKGKWS IRAKL DLR QL D ++ P+++SNPLS+D
Sbjct: 7   EADTTAISTELLSSLMDEILLVQTFKGKWSLIRAKLDDLRPQLADFADSPDAASNPLSID 66

Query: 66  FLHSVLEVLSEAASLSEKCRNPELSDGKLKTQSDIDSVLAKLDSLLKDGEVLIRSEILHD 125
            L SV   LS+A S++ +C++P L+DGKL+TQSD+D+VLA+LD +++DGE+L+RS +L D
Sbjct: 67  LLRSVAAALSDAISVARRCQSPSLADGKLRTQSDVDAVLARLDRVVRDGEILLRSGVLSD 126

Query: 126 G--AV------SSSSSRREAVRAESRNLITRLQIGSIESRVLAIESLLKLLNEDDKNVTI 185
              AV      S SSSRREAVRAESRNLITRLQIG+ ESR  A++SLL LL EDDKNV I
Sbjct: 127 NNRAVVSNSGNSGSSSRREAVRAESRNLITRLQIGTPESRNSAMDSLLGLLREDDKNVMI 186

Query: 186 AAAQGIVPVLVRLLDS-SSLELKERAVAAISIVSSVDGVRHLMIAEGLQLLNHLLRILDS 245
           A AQG+VPV VRLLDS SS+E+KE+ VAAIS VS VD  +H++IAEGL LLNHLLR+LDS
Sbjct: 187 AVAQGVVPVFVRLLDSSSSVEMKEKTVAAISRVSMVDSSKHVLIAEGLLLLNHLLRVLDS 246

Query: 246 GSGFAKEKACLTLQPLSISKENARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASF 305
           GSGF+KEKAC+ LQ LS SKENAR+IGSRGGISSLLEIC+AGTP SQASAA VLRNLA+F
Sbjct: 247 GSGFSKEKACVALQALSFSKENARAIGSRGGISSLLEICQAGTPCSQASAAGVLRNLAAF 306

Query: 306 GEIKENFIEENGVIVLLGLLASGTPLAQENAIGCLCNLVAEDDNLKLLIVREGGIELLRS 365
            EIKENFIEENG+ VLLGL +SGT LAQENAIGCLCNL++ D+NLKLL+V+EGGIE L++
Sbjct: 307 AEIKENFIEENGIAVLLGLTSSGTALAQENAIGCLCNLISGDENLKLLVVKEGGIECLKN 366

Query: 366 FWDSAPSVHSLEVAVELLGLLASSAPIAEALISNGFVDRLLPALSCGVLGVRTAAARAVY 425
           FWDSAPSV SLEVAV+LL  LAS  P+AEAL S+GFV RL+  L+CGVLGVR AAARAV 
Sbjct: 367 FWDSAPSVRSLEVAVDLLSHLASLLPVAEALCSDGFVARLVSVLNCGVLGVRIAAARAVS 426

Query: 426 ELGFCTKTRKDMGEAGFITPLVNMLDGKSVEEKEAAAKALSSLLQYTGNRKIFQKEEKGI 485
           ELG  ++TRK+MGE G I PL+ MLDGK+V+EKEAAAKALS L+  T NRKIF+++EKGI
Sbjct: 427 ELGSSSRTRKEMGECGCIGPLIKMLDGKAVQEKEAAAKALSKLMLCTVNRKIFRRDEKGI 486

Query: 486 VSAVQLLDPSISNLDKKYPVSLLASVMISSKCRKQMAAAGAGLYLQKLVEMNVDGSKKLL 545
           VSAVQLLDPS+ NLDKKYPVS+LAS+  S KCRKQM AAGA  YLQK+VEM+V+GSKKLL
Sbjct: 487 VSAVQLLDPSLRNLDKKYPVSVLASLSHSKKCRKQMVAAGACAYLQKVVEMDVEGSKKLL 546

Query: 546 ECLGRGKIWGVF 549
           E LGRGK+WGVF
Sbjct: 547 ESLGRGKMWGVF 558

BLAST of Cp4.1LG09g09440 vs. TrEMBL
Match: M5WBC9_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003699mg PE=4 SV=1)

HSP 1 Score: 713.4 bits (1840), Expect = 2.2e-202
Identity = 379/552 (68.66%), Postives = 453/552 (82.07%), Query Frame = 1

Query: 5   PENDQFALSNDRISSLLDDIPLINNFKGKWSSIRAKLSDLRTQLIDVSEFPNSSSNPLSV 64
           PE D  A+S+  +SSL++ IP I NFKGKW+ IRAKLS+L+ QL D ++FP  +S+PLS+
Sbjct: 4   PETDPMAVSSQLLSSLVEQIPFIQNFKGKWALIRAKLSELQAQLTDFADFPTYTSHPLSL 63

Query: 65  DFLHSVLEVLSEAASLSEKCRNPELSDGKLKTQSDIDSVLAKLDSLLKDGEVLIRSEILH 124
             L SV + L++A SLS+KC+ P LS GKL+TQSD+DS+LA+L   + D E+LI+S +L 
Sbjct: 64  HLLLSVSQTLADAVSLSQKCQTPNLSAGKLRTQSDVDSILARLHRHVTDAEILIKSGVLL 123

Query: 125 DGAV---SSSSSRREAVRAESRNLITRLQIGSIESRVLAIESLLKLLNEDDKNVTIAAAQ 184
           D AV   SSS+S+RE VRAE RNL+TRLQIGS ESR  A+ESLL +L EDDKNV IA AQ
Sbjct: 124 DPAVSSVSSSASKRETVRAECRNLVTRLQIGSGESRNSAMESLLGILQEDDKNVMIAVAQ 183

Query: 185 GIVPVLVRLLDSSSLELKERAVAAISIVSSVDGVRHLMIAEGLQLLNHLLRILDSGSGFA 244
           GIVPVLVRLLDSSS E KE AV AISI+S V+  +H++IAEGL LLNHL+R+LDSGSGF 
Sbjct: 184 GIVPVLVRLLDSSSFETKENAVFAISIISMVESSKHVLIAEGLSLLNHLMRVLDSGSGFG 243

Query: 245 KEKACLTLQPLSISKENARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFGEIKE 304
           KEKACL LQ LS SKENAR+IGS GG+SSLL+IC+AGTPGSQASAA VLRNLA F E +E
Sbjct: 244 KEKACLALQALSFSKENARAIGSGGGVSSLLDICQAGTPGSQASAAGVLRNLAGFSENQE 303

Query: 305 NFIEENGVIVLLGLLASGTPLAQENAIGCLCNLVAEDDNLKLLIVREGGIELLRSFWDSA 364
           NF+EENGV VLL L +SGT LAQENAIGCLC+L++  ++LKLL+V+EGGIE LR+FWDS 
Sbjct: 304 NFVEENGVGVLLALASSGTALAQENAIGCLCHLLSGSESLKLLVVKEGGIECLRNFWDSC 363

Query: 365 --PSVHSLEVAVELLGLLASSAPIAEALISNGFVDRLLPALSCGVLGVRTAAARAVYELG 424
              +   LEVAVELL  LAS +PIAE L+SNGFV RL+  LSCG+LGVR AAA+A YELG
Sbjct: 364 WNNNTRGLEVAVELLRHLASCSPIAEVLVSNGFVARLVGVLSCGILGVRIAAAKAAYELG 423

Query: 425 FCTKTRKDMGEAGFITPLVNMLDGKSVEEKEAAAKALSSLLQYTGNRKIFQKEEKGIVSA 484
           FC+KTRK+MGE G I PL+ MLDGK+VEEKEAAAKALS+L+ Y GNRK+F+K E GIVS+
Sbjct: 424 FCSKTRKEMGECGCIAPLIKMLDGKAVEEKEAAAKALSTLILYAGNRKLFKKHEGGIVSS 483

Query: 485 VQLLDPSISNLDKKYPVSLLASVMISSKCRKQMAAAGAGLYLQKLVEMNVDGSKKLLECL 544
           VQLLDPSI NLDKKYPV+LLAS+  S KCRKQM AAGA L+LQKLV+M V+GSKKLLE L
Sbjct: 484 VQLLDPSIQNLDKKYPVALLASLAHSKKCRKQMVAAGACLHLQKLVDMEVEGSKKLLESL 543

Query: 545 GRGKIWGVFARS 552
           GRGKIWGVF+RS
Sbjct: 544 GRGKIWGVFSRS 555

BLAST of Cp4.1LG09g09440 vs. TAIR10
Match: AT5G50900.1 (AT5G50900.1 ARM repeat superfamily protein)

HSP 1 Score: 651.0 bits (1678), Expect = 6.7e-187
Identity = 342/554 (61.73%), Postives = 434/554 (78.34%), Query Frame = 1

Query: 1   MTIPPENDQFALSNDRISSLLDDIPLINNFKGKWSSIRAKLSDLRTQLIDVSEFPNSSSN 60
           MT+P  +D      + I+SL+D IP + +FK KWSSIRAKL+DL+TQL D S+F  SSSN
Sbjct: 1   MTVPNSDDGDRSLTEVITSLIDSIPNLLSFKCKWSSIRAKLADLKTQLSDFSDFAGSSSN 60

Query: 61  PLSVDFLHSVLEVLSEAASLSEKCRNPELSDGKLKTQSDIDSVLAKLDSLLKDGEVLIRS 120
            L+VD L SV E L++A +++ +C  P+L++GKLKTQS++DSV+A+LD  +KD EVLI+S
Sbjct: 61  KLAVDLLVSVRETLNDAVAVAARCEGPDLAEGKLKTQSEVDSVMARLDRHVKDAEVLIKS 120

Query: 121 EILHDGAVSSS----SSRREAVRAESRNLITRLQIGSIESRVLAIESLLKLLNEDDKNVT 180
            +L D  +  S    SS++EAVR E+RNL+ RLQIG +ES+  AI+SL++LL EDDKNV 
Sbjct: 121 GLLIDNGIVVSGFSISSKKEAVRLEARNLVIRLQIGGVESKNSAIDSLIELLQEDDKNVM 180

Query: 181 IAAAQGIVPVLVRLLDSSSLELKERAVAAISIVSSVDGVRHLMIAEGLQLLNHLLRILDS 240
           I  AQG+VPVLVRLLDS SL +KE+ VA IS +S V+  +H++IAEGL LLNHLLR+L+S
Sbjct: 181 ICVAQGVVPVLVRLLDSCSLVMKEKTVAVISRISMVESSKHVLIAEGLSLLNHLLRVLES 240

Query: 241 GSGFAKEKACLTLQPLSISKENARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASF 300
           GSGFAKEKAC+ LQ LS+SKENAR+IG RGGISSLLEIC+ G+PGSQA AA VLRNLA F
Sbjct: 241 GSGFAKEKACVALQALSLSKENARAIGCRGGISSLLEICQGGSPGSQAFAAGVLRNLALF 300

Query: 301 GEIKENFIEENGVIVLLGLLASGTPLAQENAIGCLCNLVAEDDNLKLLIVREGGIELLRS 360
           GE KENF+EEN + VL+ +++SGT LAQENA+GCL NL + D++L + +VREGGI+ L+S
Sbjct: 301 GETKENFVEENAIFVLISMVSSGTSLAQENAVGCLANLTSGDEDLMISVVREGGIQCLKS 360

Query: 361 FWDSAPSVHSLEVAVELLGLLASSAPIAEALISNGFVDRLLPALSCGVLGVRTAAARAVY 420
           FWDS  SV SLEV V LL  LA    + E +IS GF+ RL+P LSCGVLGVR AAA AV 
Sbjct: 361 FWDSVSSVKSLEVGVVLLKNLALCPIVREVVISEGFIPRLVPVLSCGVLGVRIAAAEAVS 420

Query: 421 ELGFCTKTRKDMGEAGFITPLVNMLDGKSVEEKEAAAKALSSLLQYTGNRKIFQKEEKGI 480
            LGF +K+RK+MGE+G I PL++MLDGK++EEKEAA+KALS+LL  T NRKIF+K +KG+
Sbjct: 421 SLGFSSKSRKEMGESGCIVPLIDMLDGKAIEEKEAASKALSTLLVCTSNRKIFKKSDKGV 480

Query: 481 VSAVQLLDPSISNLDKKYPVSLLASVMISSKCRKQMAAAGAGLYLQKLVEMNVDGSKKLL 540
           VS VQLLDP I  LDK+Y VS L  ++ S KCRKQ+ AAGA L+LQKLV+M+ +G+KKL 
Sbjct: 481 VSLVQLLDPKIKKLDKRYTVSALELLVTSKKCRKQVVAAGACLHLQKLVDMDTEGAKKLA 540

Query: 541 ECLGRGKIWGVFAR 551
           E L R KIWGVF R
Sbjct: 541 ENLSRSKIWGVFTR 554

BLAST of Cp4.1LG09g09440 vs. TAIR10
Match: AT2G45720.1 (AT2G45720.1 ARM repeat superfamily protein)

HSP 1 Score: 297.7 bits (761), Expect = 1.4e-80
Identity = 191/527 (36.24%), Postives = 306/527 (58.06%), Query Frame = 1

Query: 27  INNFKGKWSSIRAKLSDLRTQLIDVSEFPNSSSNPLSVDFLHSVLEVLSEAASLSEKCRN 86
           +  F  +W  I ++L  + T L D+S  P  S + L  + L +VLE L E   L+  C +
Sbjct: 37  VKGFSSRWRVIISRLEKIPTCLSDLSSHPCFSKHTLCKEQLQAVLETLKETIELANVCVS 96

Query: 87  PELSDGKLKTQSDIDSVLAKLDSLLKDGEVLIRSEILHDGAVSSSSSRREAVRAESRNLI 146
            E  +GKLK QSD+DS+ AK+D  LKD  +L+++ +L +     SSS ++      R L+
Sbjct: 97  -EKQEGKLKMQSDLDSLSAKIDLSLKDCGLLMKTGVLGEVTKPLSSSTQDLETFSVRELL 156

Query: 147 TRLQIGSIESRVLAIESLLKLLNEDDKNVTIAAAQGIVPVLVRLLDSSSLELKERAVAAI 206
            RLQIG +ES+  A+E L++++ ED+K V  A  +  V  LV+LL ++S  ++E AV  I
Sbjct: 157 ARLQIGHLESKRKALEQLVEVMKEDEKAVITALGRTNVASLVQLLTATSPSVRENAVTVI 216

Query: 207 SIVSSVDGVRHLMIAEGLQLLNHLLRILDSGSGFAKEKACLTLQPLSISKENARSIGSRG 266
             ++   G  + +I+E    L  L+R+L+SGS  AKEKA ++LQ +SIS E +RSI   G
Sbjct: 217 CSLAESGGCENWLISENA--LPSLIRLLESGSIVAKEKAVISLQRMSISSETSRSIVGHG 276

Query: 267 GISSLLEICEAGTPGSQASAAAVLRNLASFGEIKENFIEENGVIVLLGLLASGTPL-AQE 326
           G+  L+EIC+ G   SQ+++A  L+N+++  E+++N  EE  V V++ +L  G  L ++E
Sbjct: 277 GVGPLIEICKTGDSVSQSASACTLKNISAVPEVRQNLAEEGIVKVMINILNCGILLGSKE 336

Query: 327 NAIGCLCNLVAEDDNLKLLIVREGGIELLRSFWDSAPSVHSLEVAVELLGLLASSAPIAE 386
            A  CL NL + ++ L+  ++ E GI+ L ++ D      S   A+  L        +  
Sbjct: 337 YAAECLQNLTSSNETLRRSVISENGIQTLLAYLDGPLPQESGVAAIRNL--------VGS 396

Query: 387 ALISNGF--VDRLLPALSCGVLGVRTAAARAVYELGFCTKTRKDMGEAGFITPLVNMLDG 446
             +   F  +  L+  L  G +G + AAA  +  +    +T++ +GE+G I  L+ ML+ 
Sbjct: 397 VSVETYFKIIPSLVHVLKSGSIGAQQAAASTICRIATSNETKRMIGESGCIPLLIRMLEA 456

Query: 447 KSVEEKEAAAKALSSLLQYTGNRKIFQKEEKGIVSAVQLLDPSISNLDKKYPVSLLASVM 506
           K+   +E AA+A++SL+    N +  +++EK + S V LL+PS  N  KKY VS LA++ 
Sbjct: 457 KASGAREVAAQAIASLVTVPRNCREVKRDEKSVTSLVMLLEPSPGNSAKKYAVSGLAALC 516

Query: 507 ISSKCRKQMAAAGAGLYLQKLVEMNVDGSKKLLECLGRGKIWGVFAR 551
            S KC+K M + GA  YL+KL E+ V GSKKLLE + +GK+   F+R
Sbjct: 517 SSRKCKKLMVSHGAVGYLKKLSELEVPGSKKLLERIEKGKLKSFFSR 552

BLAST of Cp4.1LG09g09440 vs. TAIR10
Match: AT1G01830.2 (AT1G01830.2 ARM repeat superfamily protein)

HSP 1 Score: 281.6 bits (719), Expect = 1.1e-75
Identity = 198/543 (36.46%), Postives = 310/543 (57.09%), Query Frame = 1

Query: 14  NDRISSLLDDIPLINNFKGKWSSIRAKLSDLRTQLIDVSEFPNSSSNPLSVDFLHSVLEV 73
           N  I S+L     +  F G+W +I +K+  +   L D+S  P  S N L  + L SV + 
Sbjct: 40  NSLIPSVLSKAKTVKKFTGRWKTIISKIEQIPACLSDLSSHPCFSKNKLCNEQLQSVAKT 99

Query: 74  LSEAASLSEKCRNPELSDGKLKTQSDIDSVLAKLDSLLKDGEVLIRSEILHDGAVS---S 133
           LSE   L+E+C   +  +GKL+ QSD+DS+  KLD  L+D  VLI++ +L +  +    S
Sbjct: 100 LSEVIELAEQCSTDKY-EGKLRMQSDLDSLSGKLDLNLRDCGVLIKTGVLGEATLPLYIS 159

Query: 134 SSSRREAVRAESRNLITRLQIGSIESRVLAIESLLKLLNEDDKNVTIAA-AQGIVPVLVR 193
           SSS    + +  + L+ RLQIG +ES+  A+ESLL  + ED+K V +    +  V  LV+
Sbjct: 160 SSSETPKI-SSLKELLARLQIGHLESKHNALESLLGAMQEDEKMVLMPLIGRANVAALVQ 219

Query: 194 LLDSSSLELKERAVAAISIVSSVDGVRHLMIAEGLQLLNHLLRILDSGSGFAKEKACLTL 253
           LL ++S  ++E+AV  IS+++        +I+EG+  L  L+R+++SGS   KEKA + +
Sbjct: 220 LLTATSTRIREKAVNLISVLAESGHCDEWLISEGV--LPPLVRLIESGSLETKEKAAIAI 279

Query: 254 QPLSISKENARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFGEIKENFIEENGV 313
           Q LS+++ENAR I   GGI+ L+++C+ G   SQA++AA L+N+++  E+++   EE  +
Sbjct: 280 QRLSMTEENAREIAGHGGITPLIDLCKTGDSVSQAASAAALKNMSAVSELRQLLAEEGII 339

Query: 314 IVLLGLLASGTPL-AQENAIGCLCNLVAEDDNLKLLIVREGGIELLRSFWDSAPSVHSLE 373
            V + LL  G  L ++E+   CL NL A  D L+  IV EGG+  L ++ D    +    
Sbjct: 340 RVSIDLLNHGILLGSREHMAECLQNLTAASDALREAIVSEGGVPSLLAYLDG--PLPQQP 399

Query: 374 VAVELLGLLASSAPIAEALISNGFVDRLLPALSCGVLGVRTAAARAVYELGFCTKTRKDM 433
               L  L+ S  P  E  ++   + RL   L  G LG + AAA A+       +T++ +
Sbjct: 400 AVTALRNLIPSVNP--EIWVALNLLPRLRHVLKSGSLGAQQAAASAICRFACSPETKRLV 459

Query: 434 GEAGFITPLVNMLDGKSVEEKEAAAKALSSLLQYTGNRKIFQKEEKGIV-SAVQLLDPSI 493
           GE+G I  +V +L+ KS   +EAAA+A++ L+     R+  +K+ K ++ + V LLD + 
Sbjct: 460 GESGCIPEIVKLLESKSNGCREAAAQAIAGLVAEGRIRRELKKDGKSVLTNLVMLLDSNP 519

Query: 494 SNLDKKYPVSLLASVMISSKCRKQMAAAGAGLYLQKLVEMNVDGSKKLLECLGRGKIWGV 551
            N  KKY V+ L  +  S K +K M + GA  YL+KL EM V G+ KLLE L RGK+   
Sbjct: 520 GNTAKKYAVAGLLGMSGSEKSKKMMVSYGAIGYLKKLSEMEVMGADKLLEKLERGKLRSF 574

BLAST of Cp4.1LG09g09440 vs. TAIR10
Match: AT2G05810.1 (AT2G05810.1 ARM repeat superfamily protein)

HSP 1 Score: 223.0 bits (567), Expect = 4.5e-58
Identity = 165/540 (30.56%), Postives = 289/540 (53.52%), Query Frame = 1

Query: 17  ISSLLDDIPLINNFKGKWSSIRAKLSDLRTQLIDVSEFPNSSSNPLSVDFLHSVLEVLSE 76
           +S LL     + +F G+W  +R+KL  L + L  +SE P+ S NPL    L S+L  L  
Sbjct: 31  LSLLLLSSLTVRSFIGRWQILRSKLFTLNSSLSSLSESPHWSQNPLLHTLLPSLLSNLQR 90

Query: 77  AASLSEKCRNPELSDGKLKTQSDIDSVLAKLDSLLKDGEVLIRSEILHDG-----AVSSS 136
            +SLS++C +   S GKL  QSD+D   + L + + D ++L+RS +LH       ++   
Sbjct: 91  LSSLSDQCSSASFSGGKLLMQSDLDIASSSLSTHISDLDLLLRSGVLHQQNAIVLSLPPP 150

Query: 137 SSRREAVRAESRNLITRLQIGSIESRVLAIESLLKLLNEDDKNVTIAAAQGIVPVLVRLL 196
           +S ++ +    R+L TRLQIG  E +  ++ESLL+LL +++K+  I A +G V  LV LL
Sbjct: 151 TSDKDDIAFFIRDLFTRLQIGGAEFKKKSLESLLQLLTDNEKSARIIAKEGNVGYLVTLL 210

Query: 197 DSSSLEL-KERAVAAISIV--SSVDGVRHLMIAEGLQLLNHLLRILDSGSGFAKEKACLT 256
           D     L +E A+AA+S++  SS D  + +    G   L  LLR+L++GS   K +A + 
Sbjct: 211 DLHHHPLIREHALAAVSLLTSSSADSRKTVFEQGG---LGPLLRLLETGSSPFKTRAAIA 270

Query: 257 LQPLSISKENARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFGEIKENFIEENG 316
           ++ ++     A +I + GG++ L+E C +G+   Q   A  + N+A+  EI+    EE  
Sbjct: 271 IEAITADPATAWAISAYGGVTVLIEACRSGSKQVQEHIAGAISNIAAVEEIRTTLAEEGA 330

Query: 317 VIVLLGLLASGTPLAQENAIGCLCNLVAEDDNLKLLIVRE-GGIELLRSFWDSAPSVHSL 376
           + VL+ LL SG+   QE     +  + +  +  + LIVRE GG+++L      + +  ++
Sbjct: 331 IPVLIQLLISGSSSVQEKTANFISLISSSGEYYRDLIVRERGGLQILIHLVQESSNPDTI 390

Query: 377 EVAVELLGLLASSAPIAEALISN-GFVDRLLPALSCGVLGVRTAAARAVYELGFCTKTRK 436
           E  +  L  +++   ++  L S+  F+ RL   +  G + ++  +   +  L      ++
Sbjct: 391 EHCLLALSQISAMETVSRVLSSSTRFIIRLGELIKHGNVILQQISTSLLSNLTISDGNKR 450

Query: 437 DMGEAGFITPLVNMLDG-KSVEEKEAAAKALSSLLQYTGNRKIFQKEEKGIVSAVQLLDP 496
            + +   ++ L+ +++  K    +EAA +A  SLL    NRK   ++EK ++  VQ+LDP
Sbjct: 451 AVADC--LSSLIRLMESPKPAGLQEAATEAAKSLLTVRSNRKELMRDEKSVIRLVQMLDP 510

Query: 497 SISNL-DKKYPVSLLASVMI--SSKCRKQMAAAGAGLYLQKLVEMNVDGSKKLLECLGRG 543
               + +K+ PV ++ +++   S   R ++   GA  YLQ L EM V G+KK ++ L  G
Sbjct: 511 RNERMNNKELPVMVVTAILSGGSYAARTKLIGLGADRYLQSLEEMEVPGAKKAVQRLAAG 565

BLAST of Cp4.1LG09g09440 vs. TAIR10
Match: AT1G61350.1 (AT1G61350.1 ARM repeat superfamily protein)

HSP 1 Score: 214.2 bits (544), Expect = 2.1e-55
Identity = 169/555 (30.45%), Postives = 289/555 (52.07%), Query Frame = 1

Query: 15  DRISSLLDDIPLINNFKGKWSSIRAKLSDLRTQLIDVSEFPNSSSNPLSVDFLHSVLEVL 74
           + ISSL+     I +F  KW  IR KL +L + L  +    NS  +P     + ++L  L
Sbjct: 19  EAISSLISLSHSIKSFNIKWQLIRTKLQELYSGLDSLRNL-NSGFDPSLSSLISAILISL 78

Query: 75  SEAASLSEKCRNPELSDGKLKTQSDIDSVLAKLDSLLKDGEVLIRSEILHDGAV-----S 134
            +   L+ +C N   S GKL  QSD+D +  K D   ++   +  + IL  G        
Sbjct: 79  KDTYDLATRCVNVSFS-GKLLMQSDLDVMAGKFDGHTRNLSRIYSAGILSHGFAIVVLKP 138

Query: 135 SSSSRREAVRAESRNLITRLQIGSIESRVLAIESLLKLLNEDDKNVTIAAA-QGIVPVLV 194
           + ++ ++ +R   R+L+TR++IG +E +  A+  L + + EDD+ V I      +V VLV
Sbjct: 139 NGNACKDDMRFYIRDLLTRMKIGDLEMKKQALVKLNEAMEEDDRYVKILIEISDMVNVLV 198

Query: 195 RLLDSSSLELKERAVAAISIVSSVDGVRHLMIAEGLQLLNHLLRILDSGSGFAKEKACLT 254
             LDS  + ++E +  A+  +S     R ++I  G+  +  L+R+L++G+G  +E +   
Sbjct: 199 GFLDSE-IGIQEESAKAVFFISGFGSYRDVLIRSGV--IGPLVRVLENGNGVGREASARC 258

Query: 255 LQPLSISKENARSIGSRGGISSLLEICEAGTPGSQ--ASAAAVLRNLASFGEIKENFIEE 314
           L  L+ + ENA S+ + GG+S+LL+IC     G +   ++  VLRNL    EIK   IEE
Sbjct: 259 LMKLTENSENAWSVSAHGGVSALLKICSCSDFGGELIGTSCGVLRNLVGVEEIKRFMIEE 318

Query: 315 NGVIV-LLGLLASGTPLAQENAIGCLCNLVAEDDNLKLLIVREGGIELLRSFW---DSAP 374
           +  +   + L+ S   + Q N+I  L ++  +D+  + ++VREGGI+ L S     +S  
Sbjct: 319 DHTVATFIKLIGSKEEIVQVNSIDLLLSMCCKDEQTRDILVREGGIQELVSVLSDPNSLS 378

Query: 375 SVHSLEVAVELL-GLLASSAPIAEALISNGFVDRLLPALSCGVLGVRTAAARAVYEL-GF 434
           S  S E+A+  +  L   SA    AL+   F+D LL  L  G + V+ +A +    L   
Sbjct: 379 SSKSKEIALRAIDNLCFGSAGCLNALMGCKFLDHLLNLLRNGEISVQESALKVTSRLCSL 438

Query: 435 CTKTRKDMGEAGFITPLVNMLDGKSVEEKEAAAKALSSLLQYTGNRKIFQKEEKGIVSAV 494
             + ++ MGEAGF+  LV  LD KS++ +E A+ AL  L+    NRK F +++  I   +
Sbjct: 439 QEEVKRIMGEAGFMPELVKFLDAKSIDVREMASVALYCLISVPRNRKKFAQDDFNISYIL 498

Query: 495 QLLD------PSISNLDKKYPVSLLASVMISSKCRKQMAAAGAGLYLQKLVEMNVDGSKK 550
           QLLD       S  + + K+ +S+L S+   +  R+++A++G    ++KL E     +KK
Sbjct: 499 QLLDHEDGSNVSSDSGNTKFLISILMSLTSCNSARRKIASSGYLKSIEKLAETEGSDAKK 558

BLAST of Cp4.1LG09g09440 vs. NCBI nr
Match: gi|449455447|ref|XP_004145464.1| (PREDICTED: vacuolar protein 8 [Cucumis sativus])

HSP 1 Score: 937.2 bits (2421), Expect = 1.3e-269
Identity = 497/551 (90.20%), Postives = 521/551 (94.56%), Query Frame = 1

Query: 1   MTIPPENDQFALSNDRISSLLDDIPLINNFKGKWSSIRAKLSDLRTQLIDVSEFPNSSSN 60
           M IPPE D F LSN+ ISSLLDDIPLI  FKGKWSSIRAKLSDLRTQLIDVS FPNSSSN
Sbjct: 1   MKIPPETDHFLLSNNLISSLLDDIPLITIFKGKWSSIRAKLSDLRTQLIDVSHFPNSSSN 60

Query: 61  PLSVDFLHSVLEVLSEAASLSEKCRNPELSDGKLKTQSDIDSVLAKLDSLLKDGEVLIRS 120
           PLS+DFLHSVLE L++AASLS KCRNP LSDGKLKTQSDID++LAK DSLLKDGEVLIRS
Sbjct: 61  PLSLDFLHSVLEALTQAASLSHKCRNPALSDGKLKTQSDIDAILAKFDSLLKDGEVLIRS 120

Query: 121 EILHDGAVSSSSSRREAVRAESRNLITRLQIGSIESRVLAIESLLKLLNEDDKNVTIAAA 180
           EILHDG VSSSSSRREAVRAESRNLITRLQIGSIESRVLAI+SLL+LLNEDDKNVTIAAA
Sbjct: 121 EILHDGVVSSSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAA 180

Query: 181 QGIVPVLVRLLDSSSLELKERAVAAISIVSSVDGVRHLMIAEGLQLLNHLLRILDSGSGF 240
           QG VPVLVRLLDSSSLELKERAVAAISIVS VDGV+H+MIAEGL LLNHLLRILDSGSGF
Sbjct: 181 QGAVPVLVRLLDSSSLELKERAVAAISIVSMVDGVKHIMIAEGLVLLNHLLRILDSGSGF 240

Query: 241 AKEKACLTLQPLSISKENARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFGEIK 300
           AKEKACL LQPLSISKENARSIGSRGGISSLLEICE GTPGSQASAAAVLRNLASF EIK
Sbjct: 241 AKEKACLALQPLSISKENARSIGSRGGISSLLEICEGGTPGSQASAAAVLRNLASFSEIK 300

Query: 301 ENFIEENGVIVLLGLLASGTPLAQENAIGCLCNLVAEDDNLKLLIVREGGIELLRSFWDS 360
           ENFIEENGVIVLLGLLASGTPLAQENAIGCLCNLV +DDNLKLLIVREGGIE LR+FWDS
Sbjct: 301 ENFIEENGVIVLLGLLASGTPLAQENAIGCLCNLVLDDDNLKLLIVREGGIEFLRNFWDS 360

Query: 361 APSVHSLEVAVELLGLLASSAPIAEALISNGFVDRLLPALSCGVLGVRTAAARAVYELGF 420
            PSV SLEVAVELL LLAS +PIAEALIS+GFVDRLLP LSCGVLG RTAAARAVYELGF
Sbjct: 361 VPSVRSLEVAVELLSLLASYSPIAEALISDGFVDRLLPVLSCGVLGARTAAARAVYELGF 420

Query: 421 CTKTRKDMGEAGFITPLVNMLDGKSVEEKEAAAKALSSLLQYTGNRKIFQKEEKGIVSAV 480
           CTKTRK+MGE+GFITPLVNMLDGKSV+E++AAAKALSSLLQY+GNRKIFQKEE+GIVSAV
Sbjct: 421 CTKTRKEMGESGFITPLVNMLDGKSVDERKAAAKALSSLLQYSGNRKIFQKEERGIVSAV 480

Query: 481 QLLDPSISNLDKKYPVSLLASVMISSKCRKQMAAAGAGLYLQKLVEMNVDGSKKLLECLG 540
           QLLDPSISNLDKKYPVSLL+SV ISSKCRKQM AAGAGLYLQKLVE+NV+GSKKLLE LG
Sbjct: 481 QLLDPSISNLDKKYPVSLLSSVAISSKCRKQMVAAGAGLYLQKLVEINVEGSKKLLESLG 540

Query: 541 RGKIWGVFARS 552
           RGKIWGVFARS
Sbjct: 541 RGKIWGVFARS 551

BLAST of Cp4.1LG09g09440 vs. NCBI nr
Match: gi|659118181|ref|XP_008458985.1| (PREDICTED: U-box domain-containing protein 10 [Cucumis melo])

HSP 1 Score: 932.6 bits (2409), Expect = 3.3e-268
Identity = 493/551 (89.47%), Postives = 520/551 (94.37%), Query Frame = 1

Query: 1   MTIPPENDQFALSNDRISSLLDDIPLINNFKGKWSSIRAKLSDLRTQLIDVSEFPNSSSN 60
           M IPP  D F LSN+ +SSLLDDIPLI+ FKGKWSSIRAKLSDLRTQLIDVS FPNSSSN
Sbjct: 1   MKIPPHTDHFLLSNNLLSSLLDDIPLISIFKGKWSSIRAKLSDLRTQLIDVSHFPNSSSN 60

Query: 61  PLSVDFLHSVLEVLSEAASLSEKCRNPELSDGKLKTQSDIDSVLAKLDSLLKDGEVLIRS 120
           PLS+DFLHSVLE L++AASLS KCRNP LSDGKLKTQSDID++LAK DSLLKDGEVLIRS
Sbjct: 61  PLSLDFLHSVLEALTQAASLSHKCRNPALSDGKLKTQSDIDALLAKFDSLLKDGEVLIRS 120

Query: 121 EILHDGAVSSSSSRREAVRAESRNLITRLQIGSIESRVLAIESLLKLLNEDDKNVTIAAA 180
           EILHDG VSSSSSRREAVRAESRNLITRLQIGSIESRVLAI+SLL+LLNEDDKNVTIAAA
Sbjct: 121 EILHDGVVSSSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAA 180

Query: 181 QGIVPVLVRLLDSSSLELKERAVAAISIVSSVDGVRHLMIAEGLQLLNHLLRILDSGSGF 240
           QG VPVLVRLLDSSSLELKERAVAAISIVS VDGV+H+MIAEGL LLNHLLRILDSGSGF
Sbjct: 181 QGAVPVLVRLLDSSSLELKERAVAAISIVSMVDGVKHIMIAEGLVLLNHLLRILDSGSGF 240

Query: 241 AKEKACLTLQPLSISKENARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFGEIK 300
           AKEKACL LQPLSISKENARSIGSRGGISSLLEICE GTPGSQASAAAVLRNLASF EIK
Sbjct: 241 AKEKACLALQPLSISKENARSIGSRGGISSLLEICEGGTPGSQASAAAVLRNLASFSEIK 300

Query: 301 ENFIEENGVIVLLGLLASGTPLAQENAIGCLCNLVAEDDNLKLLIVREGGIELLRSFWDS 360
           ENFIEENGV+VLLGLLASGTPLAQENAIGCLCNLV +DDNLKLLIVREGGIE LR+FWDS
Sbjct: 301 ENFIEENGVMVLLGLLASGTPLAQENAIGCLCNLVLDDDNLKLLIVREGGIEFLRNFWDS 360

Query: 361 APSVHSLEVAVELLGLLASSAPIAEALISNGFVDRLLPALSCGVLGVRTAAARAVYELGF 420
            PS  SLEVAVELL LLAS +PIAEALIS+GFVDRLLP LSCGVLG RTAAARAVYELGF
Sbjct: 361 VPSARSLEVAVELLSLLASYSPIAEALISDGFVDRLLPVLSCGVLGARTAAARAVYELGF 420

Query: 421 CTKTRKDMGEAGFITPLVNMLDGKSVEEKEAAAKALSSLLQYTGNRKIFQKEEKGIVSAV 480
           CTKTRK+MGE+GFITPLVNMLDGKSV+E++AAAKALSSLLQY+GNRKIFQKEE+GI+SAV
Sbjct: 421 CTKTRKEMGESGFITPLVNMLDGKSVDERKAAAKALSSLLQYSGNRKIFQKEERGIISAV 480

Query: 481 QLLDPSISNLDKKYPVSLLASVMISSKCRKQMAAAGAGLYLQKLVEMNVDGSKKLLECLG 540
           QLLDPSISNLDKKYPVSLL+SV ISSKCRKQM AAGAGLYLQKLVEMNV+GSKKLLE LG
Sbjct: 481 QLLDPSISNLDKKYPVSLLSSVAISSKCRKQMVAAGAGLYLQKLVEMNVEGSKKLLESLG 540

Query: 541 RGKIWGVFARS 552
           RGKIWGVFARS
Sbjct: 541 RGKIWGVFARS 551

BLAST of Cp4.1LG09g09440 vs. NCBI nr
Match: gi|590670572|ref|XP_007038093.1| (ARM repeat superfamily protein isoform 1 [Theobroma cacao])

HSP 1 Score: 748.0 bits (1930), Expect = 1.1e-212
Identity = 388/546 (71.06%), Postives = 469/546 (85.90%), Query Frame = 1

Query: 5   PENDQFALSNDRISSLLDDIPLINNFKGKWSSIRAKLSDLRTQLIDVSEFPNSSSNPLSV 64
           PEND  +LSN  ++SL + IP INNFKGKW+ I++KLS L+ QL D S+FP SSSNPL+V
Sbjct: 4   PENDPISLSNHLLASLSEQIPNINNFKGKWALIKSKLSGLQAQLADFSDFPASSSNPLAV 63

Query: 65  DFLHSVLEVLSEAASLSEKCRNPELSDGKLKTQSDIDSVLAKLDSLLKDGEVLIRSEILH 124
           D L+S+ + L++A SLS+KC+  +L++GKLKTQSDID+VLAKLD  +KD E+LIRS +L 
Sbjct: 64  DLLYSITQTLNDAVSLSQKCQLADLTEGKLKTQSDIDAVLAKLDRHIKDSEILIRSGVLQ 123

Query: 125 DGAVSSSSSRREAVRAESRNLITRLQIGSIESRVLAIESLLKLLNEDDKNVTIAAAQGIV 184
           DGAVS+SSS++EAVR ESRNLITRLQIG+ ES+  A++SLL LL EDDKNV IA AQG+V
Sbjct: 124 DGAVSTSSSKKEAVRVESRNLITRLQIGTTESKNSAMDSLLGLLQEDDKNVMIAVAQGVV 183

Query: 185 PVLVRLLDSSSLELKERAVAAISIVSSVDGVRHLMIAEGLQLLNHLLRILDSGSGFAKEK 244
           PVLVRLLDSSSLE+KE+ VAAIS VS+V+  +H++IAEGL LLNHLLR+L+SGSGFAKEK
Sbjct: 184 PVLVRLLDSSSLEMKEKTVAAISRVSTVESSKHVLIAEGLLLLNHLLRVLESGSGFAKEK 243

Query: 245 ACLTLQPLSISKENARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFGEIKENFI 304
           AC+ LQ LS SKENAR+IGSRGGISSLLEIC+AGTPGSQA AA VL+NLAS  EIKENFI
Sbjct: 244 ACIALQALSFSKENARAIGSRGGISSLLEICQAGTPGSQAFAAGVLKNLASVDEIKENFI 303

Query: 305 EENGVIVLLGLLASGTPLAQENAIGCLCNLVAEDDNLKLLIVREGGIELLRSFWDSAPSV 364
           EEN V VL+GL ASGT LAQEN+IGCLCNLV++D+NL+LLIV+EGGIE L++FWDS+P+ 
Sbjct: 304 EENAVFVLIGLAASGTALAQENSIGCLCNLVSDDENLRLLIVKEGGIECLKNFWDSSPNP 363

Query: 365 HSLEVAVELLGLLASSAPIAEALISNGFVDRLLPALSCGVLGVRTAAARAVYELGFCTKT 424
            SLEVAVEL+  LAS +PIAEAL+++GFV RL+  L+CGVLGVR AAARAVYELGF +KT
Sbjct: 364 KSLEVAVELVRRLASCSPIAEALVADGFVARLVAVLNCGVLGVRIAAARAVYELGFNSKT 423

Query: 425 RKDMGEAGFITPLVNMLDGKSVEEKEAAAKALSSLLQYTGNRKIFQKEEKGIVSAVQLLD 484
           RK+MGE G    L+ M+DGK+VEEKEAAA ALS+L+ Y GNRK+FQK+E+GIV+AVQLLD
Sbjct: 424 RKEMGECGCTVALIKMMDGKAVEEKEAAAMALSTLMLYAGNRKVFQKDERGIVNAVQLLD 483

Query: 485 PSISNLDKKYPVSLLASVMISSKCRKQMAAAGAGLYLQKLVEMNVDGSKKLLECLGRGKI 544
           P I NLDKKYPV +L+ ++ S KCRKQM AAGA +YLQKLVEMNV+G+KKLLE LGRGKI
Sbjct: 484 PLIQNLDKKYPVLILSELVHSKKCRKQMVAAGACVYLQKLVEMNVEGAKKLLESLGRGKI 543

Query: 545 WGVFAR 551
           WGVFAR
Sbjct: 544 WGVFAR 549

BLAST of Cp4.1LG09g09440 vs. NCBI nr
Match: gi|1009130344|ref|XP_015882248.1| (PREDICTED: armadillo segment polarity protein-like [Ziziphus jujuba])

HSP 1 Score: 739.2 bits (1907), Expect = 5.3e-210
Identity = 394/549 (71.77%), Postives = 459/549 (83.61%), Query Frame = 1

Query: 5   PENDQFALSNDRISSLLDDIPLINNFKGKWSSIRAKLSDLRTQLIDVSEFPNSSSNPLSV 64
           PEND  +L+   +SSL  +IP ++NFKGKW+ I  KL++L+TQL D S+ P S SNPLS+
Sbjct: 4   PENDPISLATHLVSSLSQEIPTVSNFKGKWALINDKLAELKTQLADFSDCPTSVSNPLSL 63

Query: 65  DFLHSVLEVLSEAASLSEKCRNPELSDGKLKTQSDIDSVLAKLDSLLKDGEVLIRSEILH 124
           + L SV   L +A SLS+KC++P LS+GKL+TQSD+DSVLAKLD  +KD E+LIRS +L 
Sbjct: 64  ELLRSVSLTLHDAVSLSKKCQSPSLSEGKLRTQSDVDSVLAKLDKHVKDAEILIRSGVLQ 123

Query: 125 DGAVS--SSSSRREAVRAESRNLITRLQIGSIESRVLAIESLLKLLNEDDKNVTIAAAQG 184
           DG VS  SSSS+REAVRAESRNLITRLQIGS E+R  A++SLL LL EDDKNV IA AQG
Sbjct: 124 DGTVSGSSSSSKREAVRAESRNLITRLQIGSAEARNSAMDSLLGLLQEDDKNVMIAVAQG 183

Query: 185 IVPVLVRLLDSSSL-ELKERAVAAISIVSSVDGVRHLMIAEGLQLLNHLLRILDSGSGFA 244
           IVPVLVRL+DSSS  E+KE+ VAAIS VS VD  +H++IAEGL LLNHLLR+LDSGSGFA
Sbjct: 184 IVPVLVRLMDSSSSPEMKEKTVAAISRVSMVDSSKHVLIAEGLLLLNHLLRVLDSGSGFA 243

Query: 245 KEKACLTLQPLSISKENARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFGEIKE 304
           KEKAC+ LQ LS SKENAR+IGSRGGISSLLEIC +GTPGSQASAA VLRNLA+F E KE
Sbjct: 244 KEKACIALQALSFSKENARAIGSRGGISSLLEICNSGTPGSQASAAGVLRNLAAFSENKE 303

Query: 305 NFIEENGVIVLLGLLASGTPLAQENAIGCLCNLVAEDDNLKLLIVREGGIELLRSFWDSA 364
           NFIEENGV VLLGL   GT LAQENAIGCLCNLV EDD+LKLL+ +EGGIE L++FWDSA
Sbjct: 304 NFIEENGVFVLLGLAGLGTVLAQENAIGCLCNLVCEDDHLKLLVAKEGGIECLKNFWDSA 363

Query: 365 PSVHSLEVAVELLGLLASSAPIAEALISNGFVDRLLPALSCGVLGVRTAAARAVYELGFC 424
            SV SLEVAV+LL  LAS  PIAE L+S+GFV R+   L+CGVLGVR AAARAVYELGFC
Sbjct: 364 SSVRSLEVAVDLLRHLASRQPIAEVLVSDGFVTRIAGVLNCGVLGVRIAAARAVYELGFC 423

Query: 425 TKTRKDMGEAGFITPLVNMLDGKSVEEKEAAAKALSSLLQYTGNRKIFQKEEKGIVSAVQ 484
           TKTRK+MGE G I  L+ MLDGK+VEEKE+AAKALS+L+ +TGNR+IF+K+ KGI+ AVQ
Sbjct: 424 TKTRKEMGECGCIASLIKMLDGKAVEEKESAAKALSNLMLFTGNRRIFRKDAKGILCAVQ 483

Query: 485 LLDPSISNLDKKYPVSLLASVMISSKCRKQMAAAGAGLYLQKLVEMNVDGSKKLLECLGR 544
           LLDPSI NLDKKYPVS+LAS++ S KCRKQM A+GA  YLQKLVE +V+GSKKLLE LGR
Sbjct: 484 LLDPSIQNLDKKYPVSVLASLVHSKKCRKQMVASGACAYLQKLVEADVEGSKKLLESLGR 543

Query: 545 GKIWGVFAR 551
           GKIWGVFAR
Sbjct: 544 GKIWGVFAR 552

BLAST of Cp4.1LG09g09440 vs. NCBI nr
Match: gi|802770420|ref|XP_012090618.1| (PREDICTED: uncharacterized protein LOC105648741 [Jatropha curcas])

HSP 1 Score: 723.4 bits (1866), Expect = 3.0e-205
Identity = 379/547 (69.29%), Postives = 449/547 (82.08%), Query Frame = 1

Query: 5   PENDQFALSNDRISSLLDDIPLINNFKGKWSSIRAKLSDLRTQLIDVSEFPNSSSNPLSV 64
           PEND    ++  + SLLD+IP +  FKGKW+ IRAKL+DL+TQL D ++FP S+SNPL +
Sbjct: 4   PENDPINANDQLLQSLLDEIPHVQTFKGKWALIRAKLADLQTQLTDFADFPASTSNPLCL 63

Query: 65  DFLHSVLEVLSEAASLSEKCRNPELSDGKLKTQSDIDSVLAKLDSLLKDGEVLIRSEILH 124
           D LHS+   L++A  L+ KCR P  ++GKL+TQSD+DS+LAKLD  +KD E+LI+S +L 
Sbjct: 64  DLLHSISNSLNDAVLLARKCRTPNFTEGKLRTQSDVDSILAKLDRHVKDSEILIKSGVLQ 123

Query: 125 DGAVS-SSSSRREAVRAESRNLITRLQIGSIESRVLAIESLLKLLNEDDKNVTIAAAQGI 184
           DGA S  SSS+REAVR ESRNLITRLQIGS ES+  A++SLL LL EDDKNV IA AQG+
Sbjct: 124 DGATSVGSSSKREAVRVESRNLITRLQIGSSESKNSAMDSLLGLLQEDDKNVMIAVAQGV 183

Query: 185 VPVLVRLLDSSSLELKERAVAAISIVSSVDGVRHLMIAEGLQLLNHLLRILDSGSGFAKE 244
           VPVLVRLLDSSSLE+KE+ VAAIS VS VD  +H++IAEGL LLNHLLR+L+SGSGFAKE
Sbjct: 184 VPVLVRLLDSSSLEMKEKTVAAISRVSMVDSSKHVLIAEGLLLLNHLLRVLESGSGFAKE 243

Query: 245 KACLTLQPLSISKENARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFGEIKENF 304
           KAC+ LQ LS SKENAR+IGSRGGISSLLEIC+ GTPGSQA AA VLRNLA F EI+ENF
Sbjct: 244 KACVALQALSFSKENARAIGSRGGISSLLEICQGGTPGSQAFAAGVLRNLAVFEEIRENF 303

Query: 305 IEENGVIVLLGLLASGTPLAQENAIGCLCNLVAEDDNLKLLIVREGGIELLRSFWDSAPS 364
           IEEN V VL+GL ASGT LAQENAIGCLCNL  +D+NLKLLIV+EGG+E LR+FWDS P 
Sbjct: 304 IEENAVFVLIGLAASGTALAQENAIGCLCNLAKDDENLKLLIVKEGGVECLRNFWDSGPP 363

Query: 365 VHSLEVAVELLGLLASSAPIAEALISNGFVDRLLPALSCGVLGVRTAAARAVYELGFCTK 424
           V SLEVAV+LL  LAS+  IAE L+S+GFV RL+  L+CGVLGVR A A A+YELGF TK
Sbjct: 364 VRSLEVAVDLLRNLASNQAIAEVLVSDGFVSRLMVFLNCGVLGVRIATAEAIYELGFNTK 423

Query: 425 TRKDMGEAGFITPLVNMLDGKSVEEKEAAAKALSSLLQYTGNRKIFQKEEKGIVSAVQLL 484
           TRK+MGE   I PL+NMLDGK+V EKEAAAKALS LL Y GNRK F+K+E+GIV  VQLL
Sbjct: 424 TRKEMGECEVIVPLINMLDGKAVVEKEAAAKALSHLLLYAGNRKTFRKDERGIVYTVQLL 483

Query: 485 DPSISNLDKKYPVSLLASVMISSKCRKQMAAAGAGLYLQKLVEMNVDGSKKLLECLGRGK 544
           DPSI NLDKKYPVS+LAS++ S KCRKQM  AGA ++L+ L EM ++G+KKLL+ LGRGK
Sbjct: 484 DPSIQNLDKKYPVSILASLVQSKKCRKQMIGAGACVHLKTLAEMEIEGAKKLLDGLGRGK 543

Query: 545 IWGVFAR 551
           IWGVFAR
Sbjct: 544 IWGVFAR 550

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PUB4_ARATH5.2e-1626.23U-box domain-containing protein 4 OS=Arabidopsis thaliana GN=PUB4 PE=1 SV=3[more]
PUB11_ARATH7.5e-1525.64U-box domain-containing protein 11 OS=Arabidopsis thaliana GN=PUB11 PE=2 SV=2[more]
PUB10_ARATH9.2e-1326.38U-box domain-containing protein 10 OS=Arabidopsis thaliana GN=PUB10 PE=2 SV=1[more]
VAC8_ASPOR8.6e-1122.39Vacuolar protein 8 OS=Aspergillus oryzae (strain ATCC 42149 / RIB 40) GN=vac8 PE... [more]
VAC8_ASPFU5.6e-1022.25Vacuolar protein 8 OS=Neosartorya fumigata (strain ATCC MYA-4609 / Af293 / CBS 1... [more]
Match NameE-valueIdentityDescription
A0A0A0M0M7_CUCSA9.2e-27090.20Uncharacterized protein OS=Cucumis sativus GN=Csa_1G699580 PE=4 SV=1[more]
A0A061G6D5_THECC7.9e-21371.06ARM repeat superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_014724 PE=4 S... [more]
A0A067JIA0_JATCU2.1e-20569.29Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26389 PE=4 SV=1[more]
W9QV83_9ROSA1.0e-20469.93U-box domain-containing protein 11 OS=Morus notabilis GN=L484_008839 PE=4 SV=1[more]
M5WBC9_PRUPE2.2e-20268.66Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003699mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G50900.16.7e-18761.73 ARM repeat superfamily protein[more]
AT2G45720.11.4e-8036.24 ARM repeat superfamily protein[more]
AT1G01830.21.1e-7536.46 ARM repeat superfamily protein[more]
AT2G05810.14.5e-5830.56 ARM repeat superfamily protein[more]
AT1G61350.12.1e-5530.45 ARM repeat superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449455447|ref|XP_004145464.1|1.3e-26990.20PREDICTED: vacuolar protein 8 [Cucumis sativus][more]
gi|659118181|ref|XP_008458985.1|3.3e-26889.47PREDICTED: U-box domain-containing protein 10 [Cucumis melo][more]
gi|590670572|ref|XP_007038093.1|1.1e-21271.06ARM repeat superfamily protein isoform 1 [Theobroma cacao][more]
gi|1009130344|ref|XP_015882248.1|5.3e-21071.77PREDICTED: armadillo segment polarity protein-like [Ziziphus jujuba][more]
gi|802770420|ref|XP_012090618.1|3.0e-20569.29PREDICTED: uncharacterized protein LOC105648741 [Jatropha curcas][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005488binding
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR016024ARM-type_fold
IPR011989ARM-like
IPR000225Armadillo
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0005488 binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG09g09440.1Cp4.1LG09g09440.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000225ArmadilloPFAMPF00514Armcoord: 178..207
score: 9.
IPR000225ArmadilloSMARTSM00185arm_5coord: 255..295
score: 24.0coord: 296..336
score: 5.6coord: 380..420
score: 19.0coord: 338..379
score: 16.0coord: 171..211
score: 0.0029coord: 421..461
score:
IPR011989Armadillo-like helicalGENE3DG3DSA:1.25.10.10coord: 142..534
score: 1.2
IPR016024Armadillo-type foldunknownSSF48371ARM repeatcoord: 144..489
score: 2.49
NoneNo IPR availablePANTHERPTHR23315BETA CATENIN-RELATED ARMADILLO REPEAT-CONTAININGcoord: 24..34
score: 2.8E-268coord: 91..541
score: 2.8E
NoneNo IPR availablePANTHERPTHR23315:SF86ARMADILLO/BETA-CATENIN-LIKE REPEAT-CONTAINING PROTEINcoord: 91..541
score: 2.8E-268coord: 24..34
score: 2.8E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG09g09440CmaCh18G002980Cucurbita maxima (Rimu)cmacpeB414
Cp4.1LG09g09440CmoCh18G002360Cucurbita moschata (Rifu)cmocpeB377
Cp4.1LG09g09440Carg06813Silver-seed gourdcarcpeB0694
The following gene(s) are paralogous to this gene:

None