Cp4.1LG20g06260 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG20g06260
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionArmadillo/beta-catenin repeat family protein
LocationCp4.1LG20 : 3948834 .. 3950489 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAAGTACCTCCAGAAAACAGCCATTTCCTTCTCTCTAACAATCTCATTTCTTCTGTTCTTGATGATATTCCACTCATCAACGATTTCAAGGGCAAATGGTCTTCCATGGCCACCAAACTCTCGGATCTTCGCGCTCAATTGATCGATGTTTCTGAGTTTCCCAACTCTTCTTCCAATCCGCTCTCTCTCGATTTCCTTCATTCTGTTATGGAAACTCTTACTCAGGCGGCTTCTCTCTCGCACAAGTGCCGCAATCCGGGAGTTTCCGATGGTAAACTCAAGACTCAAAGCGATATTCTCTCCGTTCTCACGAAGTTAGACTGCCTACTCAAAGATGGTGATGTGTTGATTAAGAGTGAGATTCTTCACGACGGTGTGATTTCGAGTTCCTCGTCTAGAAGGGAGGCCGTGCGGGCGGAGTCCAGGAATTTGATCACTAGGTTACAGATTGGAAGCATTGAATCCAGAGTATTGGCTATTGATTCGCTGTTGCAGTTGTTGAATGAGGATGATAAGAATGTCACCATTGCTGCGGCTCAAGGGGCTGTTCCTGTTCTGGTTCGGCTACTGGATTCCAGTTCTTTAGAATTGAAGGAGAAGGCTGTTGCTGCTATTTCCATTGTTTCTACGGTGGATGGTGTTAAGAATGTAATGATTGCTGAAGGAATCGTGCTTTTGAATCACTTGCTGAGGATTCTCGATTCTGGTAGCGGTTTTGCAAAAGAGAAGTCCTGTTTAGCTCTCCAATCTCTGAGTATTTCCAGGCAAAATGCTAGGTCAATCGGTTCCAGAGGAGGGATTTCATCTCTGTTGGAGATTTGTGAGGCCGGAACTCCCGGTTCTCAAGCCTCTGCAGCTGCGGTTTTGAGAAATCTTGCATCATTTAACGAAATCAAGGAGAATTTCATCGAAGAAAATGGGGTTATAGTTCTTTTGGGGCTTTTGACCTCTGGAACTCCATTGGCTCAAGAAAATGCAATTGGTTGTTTGTGTAATTTAGTTGTAGATGATGATAATTTGAAGCTCTTGATCGTTAAAGAAGGTGGGATCGAGTTCTTGAAAATTTTCTGGGATTCGGTTCCATCAGTTCGTAGTCTTGGAGTGGCTGTGGAGCTTTTGAGCCTCTTGGCTTCTTATTCCCCCATTGCAGAAACTCTTATTTCAGAGGGATTTCTTGATCGGCTTCTTCCAGTTTTGAGTTGCGGAGTATTAGGTGCGCGAATTGCAGCAGCTCGAGCAGTTTACGAGCTCAGCTTCTGCGCAAAAGCAAGAAAAGAAATGGGGGAATCTGGATTCATTACACCCTTAATTAATATGCTGGATGGTAAATCTGTTGATGAGAAAACAGCAGCTGCTAAGGCGTTGTCTTCTCTATTACAATATAATGGTAACAGAAGAATTTTCCAGAAAGAGGAGAGGGGAATTGTAAGTGCAGTTCATCTCTTAGATCCTTCAATCTTGAATCTGGATAAGAAGTACCCTGTTTCATTATTAGCCTCGGTTGTGATTTCAAGCAAGTGTAGAAAGCTGATGGCTGCTGCTGGTGCTGCTTTGTATCTACAAAAGCTTGTTGAAATGAATGTTGAGGGGTCAAAGAAGCTGTTGGTAAGTCTTAGCCGTGCTAAAATCTGGGGTGTCTTTGCCAGATCTTAG

mRNA sequence

ATGAAAGTACCTCCAGAAAACAGCCATTTCCTTCTCTCTAACAATCTCATTTCTTCTGTTCTTGATGATATTCCACTCATCAACGATTTCAAGGGCAAATGGTCTTCCATGGCCACCAAACTCTCGGATCTTCGCGCTCAATTGATCGATGTTTCTGAGTTTCCCAACTCTTCTTCCAATCCGCTCTCTCTCGATTTCCTTCATTCTGTTATGGAAACTCTTACTCAGGCGGCTTCTCTCTCGCACAAGTGCCGCAATCCGGGAGTTTCCGATGGTAAACTCAAGACTCAAAGCGATATTCTCTCCGTTCTCACGAAGTTAGACTGCCTACTCAAAGATGGTGATGTGTTGATTAAGAGTGAGATTCTTCACGACGGTGTGATTTCGAGTTCCTCGTCTAGAAGGGAGGCCGTGCGGGCGGAGTCCAGGAATTTGATCACTAGGTTACAGATTGGAAGCATTGAATCCAGAGTATTGGCTATTGATTCGCTGTTGCAGTTGTTGAATGAGGATGATAAGAATGTCACCATTGCTGCGGCTCAAGGGGCTGTTCCTGTTCTGGTTCGGCTACTGGATTCCAGTTCTTTAGAATTGAAGGAGAAGGCTGTTGCTGCTATTTCCATTGTTTCTACGGTGGATGGTGTTAAGAATGTAATGATTGCTGAAGGAATCGTGCTTTTGAATCACTTGCTGAGGATTCTCGATTCTGGTAGCGGTTTTGCAAAAGAGAAGTCCTGTTTAGCTCTCCAATCTCTGAGTATTTCCAGGCAAAATGCTAGGTCAATCGGTTCCAGAGGAGGGATTTCATCTCTGTTGGAGATTTGTGAGGCCGGAACTCCCGGTTCTCAAGCCTCTGCAGCTGCGGTTTTGAGAAATCTTGCATCATTTAACGAAATCAAGGAGAATTTCATCGAAGAAAATGGGGTTATAGTTCTTTTGGGGCTTTTGACCTCTGGAACTCCATTGGCTCAAGAAAATGCAATTGGTTGTTTGTGTAATTTAGTTGTAGATGATGATAATTTGAAGCTCTTGATCGTTAAAGAAGGTGGGATCGAGTTCTTGAAAATTTTCTGGGATTCGGTTCCATCAGTTCGTAGTCTTGGAGTGGCTGTGGAGCTTTTGAGCCTCTTGGCTTCTTATTCCCCCATTGCAGAAACTCTTATTTCAGAGGGATTTCTTGATCGGCTTCTTCCAGTTTTGAGTTGCGGAGTATTAGGTGCGCGAATTGCAGCAGCTCGAGCAGTTTACGAGCTCAGCTTCTGCGCAAAAGCAAGAAAAGAAATGGGGGAATCTGGATTCATTACACCCTTAATTAATATGCTGGATGGTAAATCTGTTGATGAGAAAACAGCAGCTGCTAAGGCGTTGTCTTCTCTATTACAATATAATGGTAACAGAAGAATTTTCCAGAAAGAGGAGAGGGGAATTGTAAGTGCAGTTCATCTCTTAGATCCTTCAATCTTGAATCTGGATAAGAAGTACCCTGTTTCATTATTAGCCTCGGTTGTGATTTCAAGCAAGTGTAGAAAGCTGATGGCTGCTGCTGGTGCTGCTTTGTATCTACAAAAGCTTGTTGAAATGAATGTTGAGGGGTCAAAGAAGCTGTTGGTAAGTCTTAGCCGTGCTAAAATCTGGGGTGTCTTTGCCAGATCTTAG

Coding sequence (CDS)

ATGAAAGTACCTCCAGAAAACAGCCATTTCCTTCTCTCTAACAATCTCATTTCTTCTGTTCTTGATGATATTCCACTCATCAACGATTTCAAGGGCAAATGGTCTTCCATGGCCACCAAACTCTCGGATCTTCGCGCTCAATTGATCGATGTTTCTGAGTTTCCCAACTCTTCTTCCAATCCGCTCTCTCTCGATTTCCTTCATTCTGTTATGGAAACTCTTACTCAGGCGGCTTCTCTCTCGCACAAGTGCCGCAATCCGGGAGTTTCCGATGGTAAACTCAAGACTCAAAGCGATATTCTCTCCGTTCTCACGAAGTTAGACTGCCTACTCAAAGATGGTGATGTGTTGATTAAGAGTGAGATTCTTCACGACGGTGTGATTTCGAGTTCCTCGTCTAGAAGGGAGGCCGTGCGGGCGGAGTCCAGGAATTTGATCACTAGGTTACAGATTGGAAGCATTGAATCCAGAGTATTGGCTATTGATTCGCTGTTGCAGTTGTTGAATGAGGATGATAAGAATGTCACCATTGCTGCGGCTCAAGGGGCTGTTCCTGTTCTGGTTCGGCTACTGGATTCCAGTTCTTTAGAATTGAAGGAGAAGGCTGTTGCTGCTATTTCCATTGTTTCTACGGTGGATGGTGTTAAGAATGTAATGATTGCTGAAGGAATCGTGCTTTTGAATCACTTGCTGAGGATTCTCGATTCTGGTAGCGGTTTTGCAAAAGAGAAGTCCTGTTTAGCTCTCCAATCTCTGAGTATTTCCAGGCAAAATGCTAGGTCAATCGGTTCCAGAGGAGGGATTTCATCTCTGTTGGAGATTTGTGAGGCCGGAACTCCCGGTTCTCAAGCCTCTGCAGCTGCGGTTTTGAGAAATCTTGCATCATTTAACGAAATCAAGGAGAATTTCATCGAAGAAAATGGGGTTATAGTTCTTTTGGGGCTTTTGACCTCTGGAACTCCATTGGCTCAAGAAAATGCAATTGGTTGTTTGTGTAATTTAGTTGTAGATGATGATAATTTGAAGCTCTTGATCGTTAAAGAAGGTGGGATCGAGTTCTTGAAAATTTTCTGGGATTCGGTTCCATCAGTTCGTAGTCTTGGAGTGGCTGTGGAGCTTTTGAGCCTCTTGGCTTCTTATTCCCCCATTGCAGAAACTCTTATTTCAGAGGGATTTCTTGATCGGCTTCTTCCAGTTTTGAGTTGCGGAGTATTAGGTGCGCGAATTGCAGCAGCTCGAGCAGTTTACGAGCTCAGCTTCTGCGCAAAAGCAAGAAAAGAAATGGGGGAATCTGGATTCATTACACCCTTAATTAATATGCTGGATGGTAAATCTGTTGATGAGAAAACAGCAGCTGCTAAGGCGTTGTCTTCTCTATTACAATATAATGGTAACAGAAGAATTTTCCAGAAAGAGGAGAGGGGAATTGTAAGTGCAGTTCATCTCTTAGATCCTTCAATCTTGAATCTGGATAAGAAGTACCCTGTTTCATTATTAGCCTCGGTTGTGATTTCAAGCAAGTGTAGAAAGCTGATGGCTGCTGCTGGTGCTGCTTTGTATCTACAAAAGCTTGTTGAAATGAATGTTGAGGGGTCAAAGAAGCTGTTGGTAAGTCTTAGCCGTGCTAAAATCTGGGGTGTCTTTGCCAGATCTTAG

Protein sequence

MKVPPENSHFLLSNNLISSVLDDIPLINDFKGKWSSMATKLSDLRAQLIDVSEFPNSSSNPLSLDFLHSVMETLTQAASLSHKCRNPGVSDGKLKTQSDILSVLTKLDCLLKDGDVLIKSEILHDGVISSSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAAQGAVPVLVRLLDSSSLELKEKAVAAISIVSTVDGVKNVMIAEGIVLLNHLLRILDSGSGFAKEKSCLALQSLSISRQNARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFNEIKENFIEENGVIVLLGLLTSGTPLAQENAIGCLCNLVVDDDNLKLLIVKEGGIEFLKIFWDSVPSVRSLGVAVELLSLLASYSPIAETLISEGFLDRLLPVLSCGVLGARIAAARAVYELSFCAKARKEMGESGFITPLINMLDGKSVDEKTAAAKALSSLLQYNGNRRIFQKEERGIVSAVHLLDPSILNLDKKYPVSLLASVVISSKCRKLMAAAGAALYLQKLVEMNVEGSKKLLVSLSRAKIWGVFARS
BLAST of Cp4.1LG20g06260 vs. Swiss-Prot
Match: PUB4_ARATH (U-box domain-containing protein 4 OS=Arabidopsis thaliana GN=PUB4 PE=1 SV=3)

HSP 1 Score: 84.7 bits (208), Expect = 3.4e-15
Identity = 81/293 (27.65%), Postives = 133/293 (45.39%), Query Frame = 1

Query: 120 SEILHDGVIS--SSSSRREAVRAES--RNLITRLQIGSIESRVLAIDSLLQLLNEDDKNV 179
           SE L   ++S  S+ +RR+    E+  + L+  L+  S++++  A   L  L   +  N 
Sbjct: 517 SERLGSRIVSAPSNETRRDLSEVETQVKKLVEELKSSSLDTQRQATAELRLLAKHNMDNR 576

Query: 180 TIAAAQGAVPVLVRLLDSSSLELKEKAVAAISIVSTVDGVKNVMIAEGIVLLNHLLRILD 239
            +    GA+ +LV LL S+    +E AV A+  +S  D  K  +   G +    L+ +L+
Sbjct: 577 IVIGNSGAIVLLVELLYSTDSATQENAVTALLNLSINDNNKKAIADAGAI--EPLIHVLE 636

Query: 240 SGSGFAKEKSCLALQSLSISRQNARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLAS 299
           +GS  AKE S   L SLS+  +N   IG  G I  L+++   GTP  +  AA  L NL+ 
Sbjct: 637 NGSSEAKENSAATLFSLSVIEENKIKIGQSGAIGPLVDLLGNGTPRGKKDAATALFNLSI 696

Query: 300 FNEIKENFIEENGVIVLLGLLTSGTPLAQENAIGCLCNLVVDDDNLKLLIVKEGGIEFLK 359
             E K   ++   V  L+ L+     +  + A+  L NL    +  +  I +EGGI  L 
Sbjct: 697 HQENKAMIVQSGAVRYLIDLMDPAAGMV-DKAVAVLANLATIPEG-RNAIGQEGGIPLLV 756

Query: 360 IFWDSVPSVRSLGVAVELLSLLASYSPIAETLISEGFLDRLLPVLSCGVLGAR 409
              +   +      A  LL L  +       ++ EG +  L+ +   G   AR
Sbjct: 757 EVVELGSARGKENAAAALLQLSTNSGRFCNMVLQEGAVPPLVALSQSGTPRAR 805

BLAST of Cp4.1LG20g06260 vs. Swiss-Prot
Match: PUB11_ARATH (U-box domain-containing protein 11 OS=Arabidopsis thaliana GN=PUB11 PE=2 SV=2)

HSP 1 Score: 84.0 bits (206), Expect = 5.8e-15
Identity = 75/282 (26.60%), Postives = 130/282 (46.10%), Query Frame = 1

Query: 143 RNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAAQGAVPVLVRLLDSSSLELKEKA 202
           R L+ RL   S E R  A+  +  L      N  + A  GA+PVLV LL S  +  +E A
Sbjct: 334 RALVQRLSSRSTEDRRNAVSEIRSLSKRSTDNRILIAEAGAIPVLVNLLTSEDVATQENA 393

Query: 203 VAAISIVSTVDGVKNVMIAEGIVLLNHLLRILDSGSGFAKEKSCLALQSLSISRQNARSI 262
           +  +  +S  +  K +++  G V    ++++L +G+  A+E +   L SLS++ +N   I
Sbjct: 394 ITCVLNLSIYENNKELIMFAGAV--TSIVQVLRAGTMEARENAAATLFSLSLADENKIII 453

Query: 263 GSRGGISSLLEICEAGTPGSQASAAAVLRNLASFNEIKENFIEENGVIVLLGLLTSGTPL 322
           G  G I +L+++ E GTP  +  AA  L NL  ++  K   +    V  L+ +L+  T  
Sbjct: 454 GGSGAIPALVDLLENGTPRGKKDAATALFNLCIYHGNKGRAVRAGIVTALVKMLSDSTRH 513

Query: 323 AQENAIGCLCNLVVDDDNLKLLIVKEGGIEFLKIFWDSVPSVRSLGVAVELLSLLASYSP 382
              +    + +++ ++ + K  IVK   +  L     +  +      A  LLSL    + 
Sbjct: 514 RMVDEALTILSVLANNQDAKSAIVKANTLPALIGILQTDQTRNRENAAAILLSLCKRDT- 573

Query: 383 IAETLISEGFLDRLLPVLSCGVLGARIAAARAVYELSFCAKA 425
             E LI+ G L  ++P++     G      +A+  L    KA
Sbjct: 574 --EKLITIGRLGAVVPLMDLSKNGTERGKRKAISLLELLRKA 610

BLAST of Cp4.1LG20g06260 vs. Swiss-Prot
Match: PUB10_ARATH (U-box domain-containing protein 10 OS=Arabidopsis thaliana GN=PUB10 PE=2 SV=1)

HSP 1 Score: 77.4 bits (189), Expect = 5.4e-13
Identity = 81/307 (26.38%), Postives = 136/307 (44.30%), Query Frame = 1

Query: 125 DGVISSSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAAQGAV 184
           DG     S    A+RA    L+ +L   SIE R  A+  +  L      N  + A  GA+
Sbjct: 330 DGSFRDLSGDMSAIRA----LVCKLSSQSIEDRRTAVSEIRSLSKRSTDNRILIAEAGAI 389

Query: 185 PVLVRLLDSSS-LELKEKAVAAISIVSTVDGVKNVMIAEGIVLLNHLLRILDSGSGFAKE 244
           PVLV+LL S    E +E AV  I  +S  +  K +++  G V    ++ +L +GS  A+E
Sbjct: 390 PVLVKLLTSDGDTETQENAVTCILNLSIYEHNKELIMLAGAV--TSIVLVLRAGSMEARE 449

Query: 245 KSCLALQSLSISRQNARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFNEIKENF 304
            +   L SLS++ +N   IG+ G I +L+++ + G+   +  AA  L NL  +   K   
Sbjct: 450 NAAATLFSLSLADENKIIIGASGAIMALVDLLQYGSVRGKKDAATALFNLCIYQGNKGRA 509

Query: 305 IEENGVIVLLGLLTSGTPLAQENAIGCLCNLVVDDDNLKLLIVKEGGI-EFLKIFWDSVP 364
           +    V  L+ +LT  +     +    + +++  +   K  I++   I   +       P
Sbjct: 510 VRAGIVKPLVKMLTDSSSERMADEALTILSVLASNQVAKTAILRANAIPPLIDCLQKDQP 569

Query: 365 SVRSLGVAVELLSLLASYSPIAETLISEGFLDRLLPVLSCGVLGARIAAARAVYELSFCA 424
             R    A+    LL       E LIS G L  ++P++     G   A  +A   L    
Sbjct: 570 RNRENAAAI----LLCLCKRDTEKLISIGRLGAVVPLMELSRDGTERAKRKANSLLELLR 626

Query: 425 KARKEMG 430
           K+ +++G
Sbjct: 630 KSSRKLG 626

BLAST of Cp4.1LG20g06260 vs. Swiss-Prot
Match: PUB14_ARATH (U-box domain-containing protein 14 OS=Arabidopsis thaliana GN=PUB14 PE=1 SV=1)

HSP 1 Score: 67.8 bits (164), Expect = 4.3e-10
Identity = 62/240 (25.83%), Postives = 105/240 (43.75%), Query Frame = 1

Query: 132 SSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAAQGAVPVLVRLL 191
           SS  +  R    +L+ +L  G+ E +  A   L  L   +  N    A  GA+P+LV LL
Sbjct: 337 SSSSDCDRTFVLSLLEKLANGTTEQQRAAAGELRLLAKRNVDNRVCIAEAGAIPLLVELL 396

Query: 192 DSSSLELKEKAVAAISIVSTVDGVKNVMIAEGIVLLNHLLRILDSGSGFAKEKSCLALQS 251
            S     +E +V A+  +S  +G K  ++  G +    ++ +L +GS  A+E +   L S
Sbjct: 397 SSPDPRTQEHSVTALLNLSINEGNKGAIVDAGAI--TDIVEVLKNGSMEARENAAATLFS 456

Query: 252 LSISRQNARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFNEIKENFIEENGVIV 311
           LS+  +N  +IG+ G I +L+ + E GT   +  AA  + NL  +   K   ++   V  
Sbjct: 457 LSVIDENKVAIGAAGAIQALISLLEEGTRRGKKDAATAIFNLCIYQGNKSRAVKGGIVDP 516

Query: 312 LLGLLTSGTPLAQENAIGCLCNLVVDDDNLKLLIVKEGGIEFLKIFWDSVPSVRSLGVAV 371
           L  LL        + A+  L  L  + +    +   E     ++I     P  R    A+
Sbjct: 517 LTRLLKDAGGGMVDEALAILAILSTNQEGKTAIAEAESIPVLVEIIRTGSPRNRENAAAI 574

BLAST of Cp4.1LG20g06260 vs. Swiss-Prot
Match: SL11_ORYSJ (E3 ubiquitin-protein ligase SPL11 OS=Oryza sativa subsp. japonica GN=SPL11 PE=1 SV=2)

HSP 1 Score: 67.0 bits (162), Expect = 7.3e-10
Identity = 61/201 (30.35%), Postives = 98/201 (48.76%), Query Frame = 1

Query: 131 SSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAAQGAVPVLVRL 190
           SSS R  + A    L+++L     E +  A   L  L   +  N    A  GA+P+L+ L
Sbjct: 362 SSSERANIDA----LLSKLCSPDTEEQRSAAAELRLLAKRNANNRICIAEAGAIPLLLSL 421

Query: 191 LDSSSLELKEKAVAAISIVSTVDGVKNVMIAEGIVLLNHLLRILDSGSGFAKEKSCLALQ 250
           L SS L  +E AV A+  +S  +  K  +I+ G V    ++ +L +GS  A+E +   L 
Sbjct: 422 LSSSDLRTQEHAVTALLNLSIHEDNKASIISSGAV--PSIVHVLKNGSMEARENAAATLF 481

Query: 251 SLSISRQNARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFNEIKENFIEENGVI 310
           SLS+  +   +IG  G I +L+ +   G+   +  AAA L NL  +   K   I    V 
Sbjct: 482 SLSVIDEYKVTIGGMGAIPALVVLLGEGSQRGKKDAAAALFNLCIYQGNKGRAIRAGLVP 541

Query: 311 VLLGLLTSGTPLAQENAIGCL 332
           +++GL+T+ T    + A+  L
Sbjct: 542 LIMGLVTNPTGALMDEAMAIL 556

BLAST of Cp4.1LG20g06260 vs. TrEMBL
Match: A0A0A0M0M7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G699580 PE=4 SV=1)

HSP 1 Score: 918.7 bits (2373), Expect = 3.4e-264
Identity = 487/551 (88.38%), Postives = 516/551 (93.65%), Query Frame = 1

Query: 1   MKVPPENSHFLLSNNLISSVLDDIPLINDFKGKWSSMATKLSDLRAQLIDVSEFPNSSSN 60
           MK+PPE  HFLLSNNLISS+LDDIPLI  FKGKWSS+  KLSDLR QLIDVS FPNSSSN
Sbjct: 1   MKIPPETDHFLLSNNLISSLLDDIPLITIFKGKWSSIRAKLSDLRTQLIDVSHFPNSSSN 60

Query: 61  PLSLDFLHSVMETLTQAASLSHKCRNPGVSDGKLKTQSDILSVLTKLDCLLKDGDVLIKS 120
           PLSLDFLHSV+E LTQAASLSHKCRNP +SDGKLKTQSDI ++L K D LLKDG+VLI+S
Sbjct: 61  PLSLDFLHSVLEALTQAASLSHKCRNPALSDGKLKTQSDIDAILAKFDSLLKDGEVLIRS 120

Query: 121 EILHDGVISSSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAA 180
           EILHDGV+SSSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAA
Sbjct: 121 EILHDGVVSSSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAA 180

Query: 181 QGAVPVLVRLLDSSSLELKEKAVAAISIVSTVDGVKNVMIAEGIVLLNHLLRILDSGSGF 240
           QGAVPVLVRLLDSSSLELKE+AVAAISIVS VDGVK++MIAEG+VLLNHLLRILDSGSGF
Sbjct: 181 QGAVPVLVRLLDSSSLELKERAVAAISIVSMVDGVKHIMIAEGLVLLNHLLRILDSGSGF 240

Query: 241 AKEKSCLALQSLSISRQNARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFNEIK 300
           AKEK+CLALQ LSIS++NARSIGSRGGISSLLEICE GTPGSQASAAAVLRNLASF+EIK
Sbjct: 241 AKEKACLALQPLSISKENARSIGSRGGISSLLEICEGGTPGSQASAAAVLRNLASFSEIK 300

Query: 301 ENFIEENGVIVLLGLLTSGTPLAQENAIGCLCNLVVDDDNLKLLIVKEGGIEFLKIFWDS 360
           ENFIEENGVIVLLGLL SGTPLAQENAIGCLCNLV+DDDNLKLLIV+EGGIEFL+ FWDS
Sbjct: 301 ENFIEENGVIVLLGLLASGTPLAQENAIGCLCNLVLDDDNLKLLIVREGGIEFLRNFWDS 360

Query: 361 VPSVRSLGVAVELLSLLASYSPIAETLISEGFLDRLLPVLSCGVLGARIAAARAVYELSF 420
           VPSVRSL VAVELLSLLASYSPIAE LIS+GF+DRLLPVLSCGVLGAR AAARAVYEL F
Sbjct: 361 VPSVRSLEVAVELLSLLASYSPIAEALISDGFVDRLLPVLSCGVLGARTAAARAVYELGF 420

Query: 421 CAKARKEMGESGFITPLINMLDGKSVDEKTAAAKALSSLLQYNGNRRIFQKEERGIVSAV 480
           C K RKEMGESGFITPL+NMLDGKSVDE+ AAAKALSSLLQY+GNR+IFQKEERGIVSAV
Sbjct: 421 CTKTRKEMGESGFITPLVNMLDGKSVDERKAAAKALSSLLQYSGNRKIFQKEERGIVSAV 480

Query: 481 HLLDPSILNLDKKYPVSLLASVVISSKCRKLMAAAGAALYLQKLVEMNVEGSKKLLVSLS 540
            LLDPSI NLDKKYPVSLL+SV ISSKCRK M AAGA LYLQKLVE+NVEGSKKLL SL 
Sbjct: 481 QLLDPSISNLDKKYPVSLLSSVAISSKCRKQMVAAGAGLYLQKLVEINVEGSKKLLESLG 540

Query: 541 RAKIWGVFARS 552
           R KIWGVFARS
Sbjct: 541 RGKIWGVFARS 551

BLAST of Cp4.1LG20g06260 vs. TrEMBL
Match: A0A061G6D5_THECC (ARM repeat superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_014724 PE=4 SV=1)

HSP 1 Score: 712.2 bits (1837), Expect = 4.8e-202
Identity = 380/550 (69.09%), Postives = 458/550 (83.27%), Query Frame = 1

Query: 1   MKVPPENSHFLLSNNLISSVLDDIPLINDFKGKWSSMATKLSDLRAQLIDVSEFPNSSSN 60
           MKVP EN    LSN+L++S+ + IP IN+FKGKW+ + +KLS L+AQL D S+FP SSSN
Sbjct: 1   MKVP-ENDPISLSNHLLASLSEQIPNINNFKGKWALIKSKLSGLQAQLADFSDFPASSSN 60

Query: 61  PLSLDFLHSVMETLTQAASLSHKCRNPGVSDGKLKTQSDILSVLTKLDCLLKDGDVLIKS 120
           PL++D L+S+ +TL  A SLS KC+   +++GKLKTQSDI +VL KLD  +KD ++LI+S
Sbjct: 61  PLAVDLLYSITQTLNDAVSLSQKCQLADLTEGKLKTQSDIDAVLAKLDRHIKDSEILIRS 120

Query: 121 EILHDGVISSSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAA 180
            +L DG +S+SSS++EAVR ESRNLITRLQIG+ ES+  A+DSLL LL EDDKNV IA A
Sbjct: 121 GVLQDGAVSTSSSKKEAVRVESRNLITRLQIGTTESKNSAMDSLLGLLQEDDKNVMIAVA 180

Query: 181 QGAVPVLVRLLDSSSLELKEKAVAAISIVSTVDGVKNVMIAEGIVLLNHLLRILDSGSGF 240
           QG VPVLVRLLDSSSLE+KEK VAAIS VSTV+  K+V+IAEG++LLNHLLR+L+SGSGF
Sbjct: 181 QGVVPVLVRLLDSSSLEMKEKTVAAISRVSTVESSKHVLIAEGLLLLNHLLRVLESGSGF 240

Query: 241 AKEKSCLALQSLSISRQNARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFNEIK 300
           AKEK+C+ALQ+LS S++NAR+IGSRGGISSLLEIC+AGTPGSQA AA VL+NLAS +EIK
Sbjct: 241 AKEKACIALQALSFSKENARAIGSRGGISSLLEICQAGTPGSQAFAAGVLKNLASVDEIK 300

Query: 301 ENFIEENGVIVLLGLLTSGTPLAQENAIGCLCNLVVDDDNLKLLIVKEGGIEFLKIFWDS 360
           ENFIEEN V VL+GL  SGT LAQEN+IGCLCNLV DD+NL+LLIVKEGGIE LK FWDS
Sbjct: 301 ENFIEENAVFVLIGLAASGTALAQENSIGCLCNLVSDDENLRLLIVKEGGIECLKNFWDS 360

Query: 361 VPSVRSLGVAVELLSLLASYSPIAETLISEGFLDRLLPVLSCGVLGARIAAARAVYELSF 420
            P+ +SL VAVEL+  LAS SPIAE L+++GF+ RL+ VL+CGVLG RIAAARAVYEL F
Sbjct: 361 SPNPKSLEVAVELVRRLASCSPIAEALVADGFVARLVAVLNCGVLGVRIAAARAVYELGF 420

Query: 421 CAKARKEMGESGFITPLINMLDGKSVDEKTAAAKALSSLLQYNGNRRIFQKEERGIVSAV 480
            +K RKEMGE G    LI M+DGK+V+EK AAA ALS+L+ Y GNR++FQK+ERGIV+AV
Sbjct: 421 NSKTRKEMGECGCTVALIKMMDGKAVEEKEAAAMALSTLMLYAGNRKVFQKDERGIVNAV 480

Query: 481 HLLDPSILNLDKKYPVSLLASVVISSKCRKLMAAAGAALYLQKLVEMNVEGSKKLLVSLS 540
            LLDP I NLDKKYPV +L+ +V S KCRK M AAGA +YLQKLVEMNVEG+KKLL SL 
Sbjct: 481 QLLDPLIQNLDKKYPVLILSELVHSKKCRKQMVAAGACVYLQKLVEMNVEGAKKLLESLG 540

Query: 541 RAKIWGVFAR 551
           R KIWGVFAR
Sbjct: 541 RGKIWGVFAR 549

BLAST of Cp4.1LG20g06260 vs. TrEMBL
Match: A0A067JIA0_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26389 PE=4 SV=1)

HSP 1 Score: 685.3 bits (1767), Expect = 6.3e-194
Identity = 370/551 (67.15%), Postives = 435/551 (78.95%), Query Frame = 1

Query: 1   MKVPPENSHFLLSNNLISSVLDDIPLINDFKGKWSSMATKLSDLRAQLIDVSEFPNSSSN 60
           MKVP EN     ++ L+ S+LD+IP +  FKGKW+ +  KL+DL+ QL D ++FP S+SN
Sbjct: 1   MKVP-ENDPINANDQLLQSLLDEIPHVQTFKGKWALIRAKLADLQTQLTDFADFPASTSN 60

Query: 61  PLSLDFLHSVMETLTQAASLSHKCRNPGVSDGKLKTQSDILSVLTKLDCLLKDGDVLIKS 120
           PL LD LHS+  +L  A  L+ KCR P  ++GKL+TQSD+ S+L KLD  +KD ++LIKS
Sbjct: 61  PLCLDLLHSISNSLNDAVLLARKCRTPNFTEGKLRTQSDVDSILAKLDRHVKDSEILIKS 120

Query: 121 EILHDGVIS-SSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAA 180
            +L DG  S  SSS+REAVR ESRNLITRLQIGS ES+  A+DSLL LL EDDKNV IA 
Sbjct: 121 GVLQDGATSVGSSSKREAVRVESRNLITRLQIGSSESKNSAMDSLLGLLQEDDKNVMIAV 180

Query: 181 AQGAVPVLVRLLDSSSLELKEKAVAAISIVSTVDGVKNVMIAEGIVLLNHLLRILDSGSG 240
           AQG VPVLVRLLDSSSLE+KEK VAAIS VS VD  K+V+IAEG++LLNHLLR+L+SGSG
Sbjct: 181 AQGVVPVLVRLLDSSSLEMKEKTVAAISRVSMVDSSKHVLIAEGLLLLNHLLRVLESGSG 240

Query: 241 FAKEKSCLALQSLSISRQNARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFNEI 300
           FAKEK+C+ALQ+LS S++NAR+IGSRGGISSLLEIC+ GTPGSQA AA VLRNLA F EI
Sbjct: 241 FAKEKACVALQALSFSKENARAIGSRGGISSLLEICQGGTPGSQAFAAGVLRNLAVFEEI 300

Query: 301 KENFIEENGVIVLLGLLTSGTPLAQENAIGCLCNLVVDDDNLKLLIVKEGGIEFLKIFWD 360
           +ENFIEEN V VL+GL  SGT LAQENAIGCLCNL  DD+NLKLLIVKEGG+E L+ FWD
Sbjct: 301 RENFIEENAVFVLIGLAASGTALAQENAIGCLCNLAKDDENLKLLIVKEGGVECLRNFWD 360

Query: 361 SVPSVRSLGVAVELLSLLASYSPIAETLISEGFLDRLLPVLSCGVLGARIAAARAVYELS 420
           S P VRSL VAV+LL  LAS   IAE L+S+GF+ RL+  L+CGVLG RIA A A+YEL 
Sbjct: 361 SGPPVRSLEVAVDLLRNLASNQAIAEVLVSDGFVSRLMVFLNCGVLGVRIATAEAIYELG 420

Query: 421 FCAKARKEMGESGFITPLINMLDGKSVDEKTAAAKALSSLLQYNGNRRIFQKEERGIVSA 480
           F  K RKEMGE   I PLINMLDGK+V EK AAAKALS LL Y GNR+ F+K+ERGIV  
Sbjct: 421 FNTKTRKEMGECEVIVPLINMLDGKAVVEKEAAAKALSHLLLYAGNRKTFRKDERGIVYT 480

Query: 481 VHLLDPSILNLDKKYPVSLLASVVISSKCRKLMAAAGAALYLQKLVEMNVEGSKKLLVSL 540
           V LLDPSI NLDKKYPVS+LAS+V S KCRK M  AGA ++L+ L EM +EG+KKLL  L
Sbjct: 481 VQLLDPSIQNLDKKYPVSILASLVQSKKCRKQMIGAGACVHLKTLAEMEIEGAKKLLDGL 540

Query: 541 SRAKIWGVFAR 551
            R KIWGVFAR
Sbjct: 541 GRGKIWGVFAR 550

BLAST of Cp4.1LG20g06260 vs. TrEMBL
Match: A0A067DZT1_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g008560mg PE=4 SV=1)

HSP 1 Score: 678.7 bits (1750), Expect = 5.9e-192
Identity = 371/558 (66.49%), Postives = 447/558 (80.11%), Query Frame = 1

Query: 5   PENSHFLLSNNLISSVLDDIPLINDFKGKWSSMATKLSDLRAQLIDVSEFPNSSSNPLSL 64
           PE     LS   +SS+LD IPL+  FKGKW  + TKL+DL  QL D S+FP ++SN L L
Sbjct: 4   PETDPINLSTQHLSSLLDQIPLVKHFKGKWVIVKTKLNDLETQLKDFSDFPAAASNTLCL 63

Query: 65  DFLHSVMETLTQAASLSHKCRNPGVSDGKLKTQSDILSVLTKLDC------------LLK 124
           D +HSV  TL +AAS++ KC+   +++GKLKTQSDI SVL KLD             +L+
Sbjct: 64  DHVHSVSHTLIEAASVAQKCQGVSLTEGKLKTQSDIDSVLAKLDRHVRDGDVLIKSGVLQ 123

Query: 125 DGDVLIKSEILHDGVISSSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDD 184
           DGDVLIKS +L DGV+SS S +REAVRAESRNLITRLQIGS ES+  A+DSLL LL EDD
Sbjct: 124 DGDVLIKSGVLQDGVVSSGS-KREAVRAESRNLITRLQIGSAESKNSAMDSLLGLLQEDD 183

Query: 185 KNVTIAAAQGAVPVLVRLLDSSSLELKEKAVAAISIVSTVDGVKNVMIAEGIVLLNHLLR 244
           KNV IA AQG VPVLV+L+DSSSLE+KEK VA+I+ VS VD  K+V+IAEG++LLNHL+R
Sbjct: 184 KNVVIAVAQGVVPVLVKLMDSSSLEMKEKTVASIARVSMVDSSKHVLIAEGLLLLNHLIR 243

Query: 245 ILDSGSGFAKEKSCLALQSLSISRQNARSIGSRGGISSLLEICEAGTPGSQASAAAVLRN 304
           +L+SGSGFAKE++C+ALQ+LS S++NAR+IGSRGGISSLLEIC+AGTPGSQA AA VLRN
Sbjct: 244 VLESGSGFAKERACVALQALSFSKENARAIGSRGGISSLLEICQAGTPGSQAFAAGVLRN 303

Query: 305 LASFNEIKENFIEENGVIVLLGLLTSGTPLAQENAIGCLCNLVVDDDNLKLLIVKEGGIE 364
           LA F+EIKENFIEEN V+VLLGL+ SGT LAQEN  GCLCNLV DD++LKLLIV+EGGI 
Sbjct: 304 LAGFSEIKENFIEENAVMVLLGLVASGTALAQENVFGCLCNLVSDDESLKLLIVREGGIG 363

Query: 365 FLKIFWDSVPSVRSLGVAVELLSLLASYSPIAETLISEGFLDRLLPVLSCGVLGARIAAA 424
            LK +WDSV +V+SL VAVELLS LAS  PIAE L+S+GF+ RL+ VL+CGVL  RIAAA
Sbjct: 364 SLKSYWDSVSAVKSLEVAVELLSQLASCLPIAEVLVSDGFVVRLVNVLNCGVLSVRIAAA 423

Query: 425 RAVYELSFCAKARKEMGESGFITPLINMLDGKSVDEKTAAAKALSSLLQYNGNRRIFQKE 484
           RAV  L   +KARKEMGE G I PLI MLDGK+V+EK +AAKALS+L+ Y GNR+I +K+
Sbjct: 424 RAVSMLGINSKARKEMGECGCIGPLIKMLDGKAVEEKESAAKALSTLMLYAGNRKILRKD 483

Query: 485 ERGIVSAVHLLDPSILNLDKKYPVSLLASVVISSKCRKLMAAAGAALYLQKLVEMNVEGS 544
           ERGIV+ V LLDP I NLDKKYPV++LA++V   KCRK M AAGA L+L+KLVEM++EG+
Sbjct: 484 ERGIVTVVQLLDPLIQNLDKKYPVAILAALVHCRKCRKQMVAAGACLHLRKLVEMDIEGA 543

Query: 545 KKLLVSLSRAKIWGVFAR 551
            KLL SL R KIWGVFAR
Sbjct: 544 NKLLESLGRGKIWGVFAR 560

BLAST of Cp4.1LG20g06260 vs. TrEMBL
Match: W9QV83_9ROSA (U-box domain-containing protein 11 OS=Morus notabilis GN=L484_008839 PE=4 SV=1)

HSP 1 Score: 678.3 bits (1749), Expect = 7.7e-192
Identity = 371/558 (66.49%), Postives = 451/558 (80.82%), Query Frame = 1

Query: 1   MKVPPENSHFL-LSNNLISSVLDDIPLINDFKGKWSSMATKLSDLRAQLIDVSEFPNSSS 60
           MK P E +    +S  L+SS++D+I L+  FKGKWS +  KL DLR QL D ++ P+++S
Sbjct: 1   MKAPEEEADTTAISTELLSSLMDEILLVQTFKGKWSLIRAKLDDLRPQLADFADSPDAAS 60

Query: 61  NPLSLDFLHSVMETLTQAASLSHKCRNPGVSDGKLKTQSDILSVLTKLDCLLKDGDVLIK 120
           NPLS+D L SV   L+ A S++ +C++P ++DGKL+TQSD+ +VL +LD +++DG++L++
Sbjct: 61  NPLSIDLLRSVAAALSDAISVARRCQSPSLADGKLRTQSDVDAVLARLDRVVRDGEILLR 120

Query: 121 SEILHDG---VISSS-----SSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNED 180
           S +L D    V+S+S     SSRREAVRAESRNLITRLQIG+ ESR  A+DSLL LL ED
Sbjct: 121 SGVLSDNNRAVVSNSGNSGSSSRREAVRAESRNLITRLQIGTPESRNSAMDSLLGLLRED 180

Query: 181 DKNVTIAAAQGAVPVLVRLLDS-SSLELKEKAVAAISIVSTVDGVKNVMIAEGIVLLNHL 240
           DKNV IA AQG VPV VRLLDS SS+E+KEK VAAIS VS VD  K+V+IAEG++LLNHL
Sbjct: 181 DKNVMIAVAQGVVPVFVRLLDSSSSVEMKEKTVAAISRVSMVDSSKHVLIAEGLLLLNHL 240

Query: 241 LRILDSGSGFAKEKSCLALQSLSISRQNARSIGSRGGISSLLEICEAGTPGSQASAAAVL 300
           LR+LDSGSGF+KEK+C+ALQ+LS S++NAR+IGSRGGISSLLEIC+AGTP SQASAA VL
Sbjct: 241 LRVLDSGSGFSKEKACVALQALSFSKENARAIGSRGGISSLLEICQAGTPCSQASAAGVL 300

Query: 301 RNLASFNEIKENFIEENGVIVLLGLLTSGTPLAQENAIGCLCNLVVDDDNLKLLIVKEGG 360
           RNLA+F EIKENFIEENG+ VLLGL +SGT LAQENAIGCLCNL+  D+NLKLL+VKEGG
Sbjct: 301 RNLAAFAEIKENFIEENGIAVLLGLTSSGTALAQENAIGCLCNLISGDENLKLLVVKEGG 360

Query: 361 IEFLKIFWDSVPSVRSLGVAVELLSLLASYSPIAETLISEGFLDRLLPVLSCGVLGARIA 420
           IE LK FWDS PSVRSL VAV+LLS LAS  P+AE L S+GF+ RL+ VL+CGVLG RIA
Sbjct: 361 IECLKNFWDSAPSVRSLEVAVDLLSHLASLLPVAEALCSDGFVARLVSVLNCGVLGVRIA 420

Query: 421 AARAVYELSFCAKARKEMGESGFITPLINMLDGKSVDEKTAAAKALSSLLQYNGNRRIFQ 480
           AARAV EL   ++ RKEMGE G I PLI MLDGK+V EK AAAKALS L+    NR+IF+
Sbjct: 421 AARAVSELGSSSRTRKEMGECGCIGPLIKMLDGKAVQEKEAAAKALSKLMLCTVNRKIFR 480

Query: 481 KEERGIVSAVHLLDPSILNLDKKYPVSLLASVVISSKCRKLMAAAGAALYLQKLVEMNVE 540
           ++E+GIVSAV LLDPS+ NLDKKYPVS+LAS+  S KCRK M AAGA  YLQK+VEM+VE
Sbjct: 481 RDEKGIVSAVQLLDPSLRNLDKKYPVSVLASLSHSKKCRKQMVAAGACAYLQKVVEMDVE 540

Query: 541 GSKKLLVSLSRAKIWGVF 549
           GSKKLL SL R K+WGVF
Sbjct: 541 GSKKLLESLGRGKMWGVF 558

BLAST of Cp4.1LG20g06260 vs. TAIR10
Match: AT5G50900.1 (AT5G50900.1 ARM repeat superfamily protein)

HSP 1 Score: 614.8 bits (1584), Expect = 5.3e-176
Identity = 331/554 (59.75%), Postives = 424/554 (76.53%), Query Frame = 1

Query: 1   MKVPPENSHFLLSNNLISSVLDDIPLINDFKGKWSSMATKLSDLRAQLIDVSEFPNSSSN 60
           M VP  +        +I+S++D IP +  FK KWSS+  KL+DL+ QL D S+F  SSSN
Sbjct: 1   MTVPNSDDGDRSLTEVITSLIDSIPNLLSFKCKWSSIRAKLADLKTQLSDFSDFAGSSSN 60

Query: 61  PLSLDFLHSVMETLTQAASLSHKCRNPGVSDGKLKTQSDILSVLTKLDCLLKDGDVLIKS 120
            L++D L SV ETL  A +++ +C  P +++GKLKTQS++ SV+ +LD  +KD +VLIKS
Sbjct: 61  KLAVDLLVSVRETLNDAVAVAARCEGPDLAEGKLKTQSEVDSVMARLDRHVKDAEVLIKS 120

Query: 121 EILHD-GVISSS---SSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVT 180
            +L D G++ S    SS++EAVR E+RNL+ RLQIG +ES+  AIDSL++LL EDDKNV 
Sbjct: 121 GLLIDNGIVVSGFSISSKKEAVRLEARNLVIRLQIGGVESKNSAIDSLIELLQEDDKNVM 180

Query: 181 IAAAQGAVPVLVRLLDSSSLELKEKAVAAISIVSTVDGVKNVMIAEGIVLLNHLLRILDS 240
           I  AQG VPVLVRLLDS SL +KEK VA IS +S V+  K+V+IAEG+ LLNHLLR+L+S
Sbjct: 181 ICVAQGVVPVLVRLLDSCSLVMKEKTVAVISRISMVESSKHVLIAEGLSLLNHLLRVLES 240

Query: 241 GSGFAKEKSCLALQSLSISRQNARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASF 300
           GSGFAKEK+C+ALQ+LS+S++NAR+IG RGGISSLLEIC+ G+PGSQA AA VLRNLA F
Sbjct: 241 GSGFAKEKACVALQALSLSKENARAIGCRGGISSLLEICQGGSPGSQAFAAGVLRNLALF 300

Query: 301 NEIKENFIEENGVIVLLGLLTSGTPLAQENAIGCLCNLVVDDDNLKLLIVKEGGIEFLKI 360
            E KENF+EEN + VL+ +++SGT LAQENA+GCL NL   D++L + +V+EGGI+ LK 
Sbjct: 301 GETKENFVEENAIFVLISMVSSGTSLAQENAVGCLANLTSGDEDLMISVVREGGIQCLKS 360

Query: 361 FWDSVPSVRSLGVAVELLSLLASYSPIAETLISEGFLDRLLPVLSCGVLGARIAAARAVY 420
           FWDSV SV+SL V V LL  LA    + E +ISEGF+ RL+PVLSCGVLG RIAAA AV 
Sbjct: 361 FWDSVSSVKSLEVGVVLLKNLALCPIVREVVISEGFIPRLVPVLSCGVLGVRIAAAEAVS 420

Query: 421 ELSFCAKARKEMGESGFITPLINMLDGKSVDEKTAAAKALSSLLQYNGNRRIFQKEERGI 480
            L F +K+RKEMGESG I PLI+MLDGK+++EK AA+KALS+LL    NR+IF+K ++G+
Sbjct: 421 SLGFSSKSRKEMGESGCIVPLIDMLDGKAIEEKEAASKALSTLLVCTSNRKIFKKSDKGV 480

Query: 481 VSAVHLLDPSILNLDKKYPVSLLASVVISSKCRKLMAAAGAALYLQKLVEMNVEGSKKLL 540
           VS V LLDP I  LDK+Y VS L  +V S KCRK + AAGA L+LQKLV+M+ EG+KKL 
Sbjct: 481 VSLVQLLDPKIKKLDKRYTVSALELLVTSKKCRKQVVAAGACLHLQKLVDMDTEGAKKLA 540

Query: 541 VSLSRAKIWGVFAR 551
            +LSR+KIWGVF R
Sbjct: 541 ENLSRSKIWGVFTR 554

BLAST of Cp4.1LG20g06260 vs. TAIR10
Match: AT2G45720.1 (AT2G45720.1 ARM repeat superfamily protein)

HSP 1 Score: 287.0 bits (733), Expect = 2.5e-77
Identity = 193/541 (35.67%), Postives = 314/541 (58.04%), Query Frame = 1

Query: 11  LLSNNLISSVLDDIPLINDFKGKWSSMATKLSDLRAQLIDVSEFPNSSSNPLSLDFLHSV 70
           L +  L+   L     +  F  +W  + ++L  +   L D+S  P  S + L  + L +V
Sbjct: 21  LQAQELVPIALSKARTVKGFSSRWRVIISRLEKIPTCLSDLSSHPCFSKHTLCKEQLQAV 80

Query: 71  METLTQAASLSHKCRNPGVSDGKLKTQSDILSVLTKLDCLLKDGDVLIKSEILHDGVISS 130
           +ETL +   L++ C +    +GKLK QSD+ S+  K+D  LKD  +L+K+ +L +     
Sbjct: 81  LETLKETIELANVCVSEK-QEGKLKMQSDLDSLSAKIDLSLKDCGLLMKTGVLGEVTKPL 140

Query: 131 SSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAAQGAVPVLVRL 190
           SSS ++      R L+ RLQIG +ES+  A++ L++++ ED+K V  A  +  V  LV+L
Sbjct: 141 SSSTQDLETFSVRELLARLQIGHLESKRKALEQLVEVMKEDEKAVITALGRTNVASLVQL 200

Query: 191 LDSSSLELKEKAVAAISIVSTVDGVKNVMIAEGIVLLNHLLRILDSGSGFAKEKSCLALQ 250
           L ++S  ++E AV  I  ++   G +N +I+E    L  L+R+L+SGS  AKEK+ ++LQ
Sbjct: 201 LTATSPSVRENAVTVICSLAESGGCENWLISENA--LPSLIRLLESGSIVAKEKAVISLQ 260

Query: 251 SLSISRQNARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFNEIKENFIEENGVI 310
            +SIS + +RSI   GG+  L+EIC+ G   SQ+++A  L+N+++  E+++N  EE  V 
Sbjct: 261 RMSISSETSRSIVGHGGVGPLIEICKTGDSVSQSASACTLKNISAVPEVRQNLAEEGIVK 320

Query: 311 VLLGLLTSGTPL-AQENAIGCLCNLVVDDDNLKLLIVKEGGIEFLKIFWDSVPSVRSLGV 370
           V++ +L  G  L ++E A  CL NL   ++ L+  ++ E GI+ L  + D  P  +  GV
Sbjct: 321 VMINILNCGILLGSKEYAAECLQNLTSSNETLRRSVISENGIQTLLAYLDG-PLPQESGV 380

Query: 371 AVELLSLLASYSPIAETLISEGFLDRLLPVLSCGVLGARIAAARAVYELSFCAKARKEMG 430
           A  + +L+ S S   ET      +  L+ VL  G +GA+ AAA  +  ++   + ++ +G
Sbjct: 381 AA-IRNLVGSVS--VETYFK--IIPSLVHVLKSGSIGAQQAAASTICRIATSNETKRMIG 440

Query: 431 ESGFITPLINMLDGKSVDEKTAAAKALSSLLQYNGNRRIFQKEERGIVSAVHLLDPSILN 490
           ESG I  LI ML+ K+   +  AA+A++SL+    N R  +++E+ + S V LL+PS  N
Sbjct: 441 ESGCIPLLIRMLEAKASGAREVAAQAIASLVTVPRNCREVKRDEKSVTSLVMLLEPSPGN 500

Query: 491 LDKKYPVSLLASVVISSKCRKLMAAAGAALYLQKLVEMNVEGSKKLLVSLSRAKIWGVFA 550
             KKY VS LA++  S KC+KLM + GA  YL+KL E+ V GSKKLL  + + K+   F+
Sbjct: 501 SAKKYAVSGLAALCSSRKCKKLMVSHGAVGYLKKLSELEVPGSKKLLERIEKGKLKSFFS 552

BLAST of Cp4.1LG20g06260 vs. TAIR10
Match: AT1G01830.2 (AT1G01830.2 ARM repeat superfamily protein)

HSP 1 Score: 276.2 bits (705), Expect = 4.5e-74
Identity = 198/543 (36.46%), Postives = 314/543 (57.83%), Query Frame = 1

Query: 14  NNLISSVLDDIPLINDFKGKWSSMATKLSDLRAQLIDVSEFPNSSSNPLSLDFLHSVMET 73
           N+LI SVL     +  F G+W ++ +K+  + A L D+S  P  S N L  + L SV +T
Sbjct: 40  NSLIPSVLSKAKTVKKFTGRWKTIISKIEQIPACLSDLSSHPCFSKNKLCNEQLQSVAKT 99

Query: 74  LTQAASLSHKCRNPGVSDGKLKTQSDILSVLTKLDCLLKDGDVLIKSEILHDGVIS---S 133
           L++   L+ +C      +GKL+ QSD+ S+  KLD  L+D  VLIK+ +L +  +    S
Sbjct: 100 LSEVIELAEQCSTDKY-EGKLRMQSDLDSLSGKLDLNLRDCGVLIKTGVLGEATLPLYIS 159

Query: 134 SSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAAQGA-VPVLVR 193
           SSS    + +  + L+ RLQIG +ES+  A++SLL  + ED+K V +     A V  LV+
Sbjct: 160 SSSETPKI-SSLKELLARLQIGHLESKHNALESLLGAMQEDEKMVLMPLIGRANVAALVQ 219

Query: 194 LLDSSSLELKEKAVAAISIVSTVDGVKNVMIAEGIVLLNHLLRILDSGSGFAKEKSCLAL 253
           LL ++S  ++EKAV  IS+++        +I+EG+  L  L+R+++SGS   KEK+ +A+
Sbjct: 220 LLTATSTRIREKAVNLISVLAESGHCDEWLISEGV--LPPLVRLIESGSLETKEKAAIAI 279

Query: 254 QSLSISRQNARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFNEIKENFIEENGV 313
           Q LS++ +NAR I   GGI+ L+++C+ G   SQA++AA L+N+++ +E+++   EE  +
Sbjct: 280 QRLSMTEENAREIAGHGGITPLIDLCKTGDSVSQAASAAALKNMSAVSELRQLLAEEGII 339

Query: 314 IVLLGLLTSGTPL-AQENAIGCLCNLVVDDDNLKLLIVKEGGIEFLKIFWDSVPSVRSLG 373
            V + LL  G  L ++E+   CL NL    D L+  IV EGG+  L  + D  P  +   
Sbjct: 340 RVSIDLLNHGILLGSREHMAECLQNLTAASDALREAIVSEGGVPSLLAYLDG-PLPQQPA 399

Query: 374 VAVELLSLLASYSPIAETLISEGFLDRLLPVLSCGVLGARIAAARAVYELSFCAKARKEM 433
           V   L +L+ S +P  E  ++   L RL  VL  G LGA+ AAA A+   +   + ++ +
Sbjct: 400 VTA-LRNLIPSVNP--EIWVALNLLPRLRHVLKSGSLGAQQAAASAICRFACSPETKRLV 459

Query: 434 GESGFITPLINMLDGKSVDEKTAAAKALSSLLQYNGNRRIFQKEERGIV-SAVHLLDPSI 493
           GESG I  ++ +L+ KS   + AAA+A++ L+     RR  +K+ + ++ + V LLD + 
Sbjct: 460 GESGCIPEIVKLLESKSNGCREAAAQAIAGLVAEGRIRRELKKDGKSVLTNLVMLLDSNP 519

Query: 494 LNLDKKYPVSLLASVVISSKCRKLMAAAGAALYLQKLVEMNVEGSKKLLVSLSRAKIWGV 551
            N  KKY V+ L  +  S K +K+M + GA  YL+KL EM V G+ KLL  L R K+   
Sbjct: 520 GNTAKKYAVAGLLGMSGSEKSKKMMVSYGAIGYLKKLSEMEVMGADKLLEKLERGKLRSF 574

BLAST of Cp4.1LG20g06260 vs. TAIR10
Match: AT2G05810.1 (AT2G05810.1 ARM repeat superfamily protein)

HSP 1 Score: 199.9 bits (507), Expect = 4.1e-51
Identity = 160/543 (29.47%), Postives = 285/543 (52.49%), Query Frame = 1

Query: 12  LSNNLISSVLDDIPLINDFKGKWSSMATKLSDLRAQLIDVSEFPNSSSNPLSLDFLHSVM 71
           L  N++S +L     +  F G+W  + +KL  L + L  +SE P+ S NPL    L S++
Sbjct: 26  LITNVLSLLLLSSLTVRSFIGRWQILRSKLFTLNSSLSSLSESPHWSQNPLLHTLLPSLL 85

Query: 72  ETLTQAASLSHKCRNPGVSDGKLKTQSDILSVLTKLDCLLKDGDVLIKSEILH--DGVIS 131
             L + +SLS +C +   S GKL  QSD+    + L   + D D+L++S +LH  + ++ 
Sbjct: 86  SNLQRLSSLSDQCSSASFSGGKLLMQSDLDIASSSLSTHISDLDLLLRSGVLHQQNAIVL 145

Query: 132 S---SSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAAQGAVPV 191
           S    +S ++ +    R+L TRLQIG  E +  +++SLLQLL +++K+  I A +G V  
Sbjct: 146 SLPPPTSDKDDIAFFIRDLFTRLQIGGAEFKKKSLESLLQLLTDNEKSARIIAKEGNVGY 205

Query: 192 LVRLLDSSSLEL-KEKAVAAISIV--STVDGVKNVMIAEGIVLLNHLLRILDSGSGFAKE 251
           LV LLD     L +E A+AA+S++  S+ D  K V    G   L  LLR+L++GS   K 
Sbjct: 206 LVTLLDLHHHPLIREHALAAVSLLTSSSADSRKTVFEQGG---LGPLLRLLETGSSPFKT 265

Query: 252 KSCLALQSLSISRQNARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFNEIKENF 311
           ++ +A+++++     A +I + GG++ L+E C +G+   Q   A  + N+A+  EI+   
Sbjct: 266 RAAIAIEAITADPATAWAISAYGGVTVLIEACRSGSKQVQEHIAGAISNIAAVEEIRTTL 325

Query: 312 IEENGVIVLLGLLTSGTPLAQENAIGCLCNLVVDDDNLKLLIVKE-GGIEFLKIFWDSVP 371
            EE  + VL+ LL SG+   QE     +  +    +  + LIV+E GG++ L        
Sbjct: 326 AEEGAIPVLIQLLISGSSSVQEKTANFISLISSSGEYYRDLIVRERGGLQILIHLVQESS 385

Query: 372 SVRSLGVAVELLSLLASYSPIAETLISE-GFLDRLLPVLSCGVLGARIAAARAVYELSFC 431
           +  ++   +  LS +++   ++  L S   F+ RL  ++  G +  +  +   +  L+  
Sbjct: 386 NPDTIEHCLLALSQISAMETVSRVLSSSTRFIIRLGELIKHGNVILQQISTSLLSNLTIS 445

Query: 432 AKARKEMGESGFITPLINMLDG-KSVDEKTAAAKALSSLLQYNGNRRIFQKEERGIVSAV 491
              ++ + +   ++ LI +++  K    + AA +A  SLL    NR+   ++E+ ++  V
Sbjct: 446 DGNKRAVADC--LSSLIRLMESPKPAGLQEAATEAAKSLLTVRSNRKELMRDEKSVIRLV 505

Query: 492 HLLDPSILNL-DKKYPVSLLASVVI--SSKCRKLMAAAGAALYLQKLVEMNVEGSKKLLV 541
            +LDP    + +K+ PV ++ +++   S   R  +   GA  YLQ L EM V G+KK + 
Sbjct: 506 QMLDPRNERMNNKELPVMVVTAILSGGSYAARTKLIGLGADRYLQSLEEMEVPGAKKAVQ 563

BLAST of Cp4.1LG20g06260 vs. TAIR10
Match: AT1G61350.1 (AT1G61350.1 ARM repeat superfamily protein)

HSP 1 Score: 188.7 bits (478), Expect = 9.4e-48
Identity = 160/544 (29.41%), Postives = 282/544 (51.84%), Query Frame = 1

Query: 17  ISSVLDDIPLINDFKGKWSSMATKLSDLRAQLIDVSEFPNSSSNPLSLDFLHSVMETLTQ 76
           ISS++     I  F  KW  + TKL +L + L  +    NS  +P     + +++ +L  
Sbjct: 21  ISSLISLSHSIKSFNIKWQLIRTKLQELYSGLDSLRNL-NSGFDPSLSSLISAILISLKD 80

Query: 77  AASLSHKCRNPGVSDGKLKTQSDILSVLTKLDCLLKDGDVLIKSEILHDGVI-----SSS 136
              L+ +C N   S GKL  QSD+  +  K D   ++   +  + IL  G        + 
Sbjct: 81  TYDLATRCVNVSFS-GKLLMQSDLDVMAGKFDGHTRNLSRIYSAGILSHGFAIVVLKPNG 140

Query: 137 SSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAA-QGAVPVLVRL 196
           ++ ++ +R   R+L+TR++IG +E +  A+  L + + EDD+ V I       V VLV  
Sbjct: 141 NACKDDMRFYIRDLLTRMKIGDLEMKKQALVKLNEAMEEDDRYVKILIEISDMVNVLVGF 200

Query: 197 LDSSSLELKEKAVAAISIVSTVDGVKNVMIAEGIVLLNHLLRILDSGSGFAKEKSCLALQ 256
           LDS  + ++E++  A+  +S     ++V+I  G++    L+R+L++G+G  +E S   L 
Sbjct: 201 LDSE-IGIQEESAKAVFFISGFGSYRDVLIRSGVI--GPLVRVLENGNGVGREASARCLM 260

Query: 257 SLSISRQNARSIGSRGGISSLLEICEAGTPGSQ--ASAAAVLRNLASFNEIKENFIEE-N 316
            L+ + +NA S+ + GG+S+LL+IC     G +   ++  VLRNL    EIK   IEE +
Sbjct: 261 KLTENSENAWSVSAHGGVSALLKICSCSDFGGELIGTSCGVLRNLVGVEEIKRFMIEEDH 320

Query: 317 GVIVLLGLLTSGTPLAQENAIGCLCNLVVDDDNLKLLIVKEGGI-EFLKIFWD--SVPSV 376
            V   + L+ S   + Q N+I  L ++   D+  + ++V+EGGI E + +  D  S+ S 
Sbjct: 321 TVATFIKLIGSKEEIVQVNSIDLLLSMCCKDEQTRDILVREGGIQELVSVLSDPNSLSSS 380

Query: 377 RSLGVAVELL-SLLASYSPIAETLISEGFLDRLLPVLSCGVLGARIAAARAVYEL-SFCA 436
           +S  +A+  + +L    +     L+   FLD LL +L  G +  + +A +    L S   
Sbjct: 381 KSKEIALRAIDNLCFGSAGCLNALMGCKFLDHLLNLLRNGEISVQESALKVTSRLCSLQE 440

Query: 437 KARKEMGESGFITPLINMLDGKSVDEKTAAAKALSSLLQYNGNRRIFQKEERGIVSAVHL 496
           + ++ MGE+GF+  L+  LD KS+D +  A+ AL  L+    NR+ F +++  I   + L
Sbjct: 441 EVKRIMGEAGFMPELVKFLDAKSIDVREMASVALYCLISVPRNRKKFAQDDFNISYILQL 500

Query: 497 L---DPSILNLDK---KYPVSLLASVVISSKCRKLMAAAGAALYLQKLVEMNVEGSKKLL 541
           L   D S ++ D    K+ +S+L S+   +  R+ +A++G    ++KL E     +KKL+
Sbjct: 501 LDHEDGSNVSSDSGNTKFLISILMSLTSCNSARRKIASSGYLKSIEKLAETEGSDAKKLV 559

BLAST of Cp4.1LG20g06260 vs. NCBI nr
Match: gi|449455447|ref|XP_004145464.1| (PREDICTED: vacuolar protein 8 [Cucumis sativus])

HSP 1 Score: 918.7 bits (2373), Expect = 4.9e-264
Identity = 487/551 (88.38%), Postives = 516/551 (93.65%), Query Frame = 1

Query: 1   MKVPPENSHFLLSNNLISSVLDDIPLINDFKGKWSSMATKLSDLRAQLIDVSEFPNSSSN 60
           MK+PPE  HFLLSNNLISS+LDDIPLI  FKGKWSS+  KLSDLR QLIDVS FPNSSSN
Sbjct: 1   MKIPPETDHFLLSNNLISSLLDDIPLITIFKGKWSSIRAKLSDLRTQLIDVSHFPNSSSN 60

Query: 61  PLSLDFLHSVMETLTQAASLSHKCRNPGVSDGKLKTQSDILSVLTKLDCLLKDGDVLIKS 120
           PLSLDFLHSV+E LTQAASLSHKCRNP +SDGKLKTQSDI ++L K D LLKDG+VLI+S
Sbjct: 61  PLSLDFLHSVLEALTQAASLSHKCRNPALSDGKLKTQSDIDAILAKFDSLLKDGEVLIRS 120

Query: 121 EILHDGVISSSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAA 180
           EILHDGV+SSSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAA
Sbjct: 121 EILHDGVVSSSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAA 180

Query: 181 QGAVPVLVRLLDSSSLELKEKAVAAISIVSTVDGVKNVMIAEGIVLLNHLLRILDSGSGF 240
           QGAVPVLVRLLDSSSLELKE+AVAAISIVS VDGVK++MIAEG+VLLNHLLRILDSGSGF
Sbjct: 181 QGAVPVLVRLLDSSSLELKERAVAAISIVSMVDGVKHIMIAEGLVLLNHLLRILDSGSGF 240

Query: 241 AKEKSCLALQSLSISRQNARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFNEIK 300
           AKEK+CLALQ LSIS++NARSIGSRGGISSLLEICE GTPGSQASAAAVLRNLASF+EIK
Sbjct: 241 AKEKACLALQPLSISKENARSIGSRGGISSLLEICEGGTPGSQASAAAVLRNLASFSEIK 300

Query: 301 ENFIEENGVIVLLGLLTSGTPLAQENAIGCLCNLVVDDDNLKLLIVKEGGIEFLKIFWDS 360
           ENFIEENGVIVLLGLL SGTPLAQENAIGCLCNLV+DDDNLKLLIV+EGGIEFL+ FWDS
Sbjct: 301 ENFIEENGVIVLLGLLASGTPLAQENAIGCLCNLVLDDDNLKLLIVREGGIEFLRNFWDS 360

Query: 361 VPSVRSLGVAVELLSLLASYSPIAETLISEGFLDRLLPVLSCGVLGARIAAARAVYELSF 420
           VPSVRSL VAVELLSLLASYSPIAE LIS+GF+DRLLPVLSCGVLGAR AAARAVYEL F
Sbjct: 361 VPSVRSLEVAVELLSLLASYSPIAEALISDGFVDRLLPVLSCGVLGARTAAARAVYELGF 420

Query: 421 CAKARKEMGESGFITPLINMLDGKSVDEKTAAAKALSSLLQYNGNRRIFQKEERGIVSAV 480
           C K RKEMGESGFITPL+NMLDGKSVDE+ AAAKALSSLLQY+GNR+IFQKEERGIVSAV
Sbjct: 421 CTKTRKEMGESGFITPLVNMLDGKSVDERKAAAKALSSLLQYSGNRKIFQKEERGIVSAV 480

Query: 481 HLLDPSILNLDKKYPVSLLASVVISSKCRKLMAAAGAALYLQKLVEMNVEGSKKLLVSLS 540
            LLDPSI NLDKKYPVSLL+SV ISSKCRK M AAGA LYLQKLVE+NVEGSKKLL SL 
Sbjct: 481 QLLDPSISNLDKKYPVSLLSSVAISSKCRKQMVAAGAGLYLQKLVEINVEGSKKLLESLG 540

Query: 541 RAKIWGVFARS 552
           R KIWGVFARS
Sbjct: 541 RGKIWGVFARS 551

BLAST of Cp4.1LG20g06260 vs. NCBI nr
Match: gi|659118181|ref|XP_008458985.1| (PREDICTED: U-box domain-containing protein 10 [Cucumis melo])

HSP 1 Score: 914.1 bits (2361), Expect = 1.2e-262
Identity = 483/551 (87.66%), Postives = 515/551 (93.47%), Query Frame = 1

Query: 1   MKVPPENSHFLLSNNLISSVLDDIPLINDFKGKWSSMATKLSDLRAQLIDVSEFPNSSSN 60
           MK+PP   HFLLSNNL+SS+LDDIPLI+ FKGKWSS+  KLSDLR QLIDVS FPNSSSN
Sbjct: 1   MKIPPHTDHFLLSNNLLSSLLDDIPLISIFKGKWSSIRAKLSDLRTQLIDVSHFPNSSSN 60

Query: 61  PLSLDFLHSVMETLTQAASLSHKCRNPGVSDGKLKTQSDILSVLTKLDCLLKDGDVLIKS 120
           PLSLDFLHSV+E LTQAASLSHKCRNP +SDGKLKTQSDI ++L K D LLKDG+VLI+S
Sbjct: 61  PLSLDFLHSVLEALTQAASLSHKCRNPALSDGKLKTQSDIDALLAKFDSLLKDGEVLIRS 120

Query: 121 EILHDGVISSSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAA 180
           EILHDGV+SSSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAA
Sbjct: 121 EILHDGVVSSSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAA 180

Query: 181 QGAVPVLVRLLDSSSLELKEKAVAAISIVSTVDGVKNVMIAEGIVLLNHLLRILDSGSGF 240
           QGAVPVLVRLLDSSSLELKE+AVAAISIVS VDGVK++MIAEG+VLLNHLLRILDSGSGF
Sbjct: 181 QGAVPVLVRLLDSSSLELKERAVAAISIVSMVDGVKHIMIAEGLVLLNHLLRILDSGSGF 240

Query: 241 AKEKSCLALQSLSISRQNARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFNEIK 300
           AKEK+CLALQ LSIS++NARSIGSRGGISSLLEICE GTPGSQASAAAVLRNLASF+EIK
Sbjct: 241 AKEKACLALQPLSISKENARSIGSRGGISSLLEICEGGTPGSQASAAAVLRNLASFSEIK 300

Query: 301 ENFIEENGVIVLLGLLTSGTPLAQENAIGCLCNLVVDDDNLKLLIVKEGGIEFLKIFWDS 360
           ENFIEENGV+VLLGLL SGTPLAQENAIGCLCNLV+DDDNLKLLIV+EGGIEFL+ FWDS
Sbjct: 301 ENFIEENGVMVLLGLLASGTPLAQENAIGCLCNLVLDDDNLKLLIVREGGIEFLRNFWDS 360

Query: 361 VPSVRSLGVAVELLSLLASYSPIAETLISEGFLDRLLPVLSCGVLGARIAAARAVYELSF 420
           VPS RSL VAVELLSLLASYSPIAE LIS+GF+DRLLPVLSCGVLGAR AAARAVYEL F
Sbjct: 361 VPSARSLEVAVELLSLLASYSPIAEALISDGFVDRLLPVLSCGVLGARTAAARAVYELGF 420

Query: 421 CAKARKEMGESGFITPLINMLDGKSVDEKTAAAKALSSLLQYNGNRRIFQKEERGIVSAV 480
           C K RKEMGESGFITPL+NMLDGKSVDE+ AAAKALSSLLQY+GNR+IFQKEERGI+SAV
Sbjct: 421 CTKTRKEMGESGFITPLVNMLDGKSVDERKAAAKALSSLLQYSGNRKIFQKEERGIISAV 480

Query: 481 HLLDPSILNLDKKYPVSLLASVVISSKCRKLMAAAGAALYLQKLVEMNVEGSKKLLVSLS 540
            LLDPSI NLDKKYPVSLL+SV ISSKCRK M AAGA LYLQKLVEMNVEGSKKLL SL 
Sbjct: 481 QLLDPSISNLDKKYPVSLLSSVAISSKCRKQMVAAGAGLYLQKLVEMNVEGSKKLLESLG 540

Query: 541 RAKIWGVFARS 552
           R KIWGVFARS
Sbjct: 541 RGKIWGVFARS 551

BLAST of Cp4.1LG20g06260 vs. NCBI nr
Match: gi|590670572|ref|XP_007038093.1| (ARM repeat superfamily protein isoform 1 [Theobroma cacao])

HSP 1 Score: 712.2 bits (1837), Expect = 6.9e-202
Identity = 380/550 (69.09%), Postives = 458/550 (83.27%), Query Frame = 1

Query: 1   MKVPPENSHFLLSNNLISSVLDDIPLINDFKGKWSSMATKLSDLRAQLIDVSEFPNSSSN 60
           MKVP EN    LSN+L++S+ + IP IN+FKGKW+ + +KLS L+AQL D S+FP SSSN
Sbjct: 1   MKVP-ENDPISLSNHLLASLSEQIPNINNFKGKWALIKSKLSGLQAQLADFSDFPASSSN 60

Query: 61  PLSLDFLHSVMETLTQAASLSHKCRNPGVSDGKLKTQSDILSVLTKLDCLLKDGDVLIKS 120
           PL++D L+S+ +TL  A SLS KC+   +++GKLKTQSDI +VL KLD  +KD ++LI+S
Sbjct: 61  PLAVDLLYSITQTLNDAVSLSQKCQLADLTEGKLKTQSDIDAVLAKLDRHIKDSEILIRS 120

Query: 121 EILHDGVISSSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAAA 180
            +L DG +S+SSS++EAVR ESRNLITRLQIG+ ES+  A+DSLL LL EDDKNV IA A
Sbjct: 121 GVLQDGAVSTSSSKKEAVRVESRNLITRLQIGTTESKNSAMDSLLGLLQEDDKNVMIAVA 180

Query: 181 QGAVPVLVRLLDSSSLELKEKAVAAISIVSTVDGVKNVMIAEGIVLLNHLLRILDSGSGF 240
           QG VPVLVRLLDSSSLE+KEK VAAIS VSTV+  K+V+IAEG++LLNHLLR+L+SGSGF
Sbjct: 181 QGVVPVLVRLLDSSSLEMKEKTVAAISRVSTVESSKHVLIAEGLLLLNHLLRVLESGSGF 240

Query: 241 AKEKSCLALQSLSISRQNARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFNEIK 300
           AKEK+C+ALQ+LS S++NAR+IGSRGGISSLLEIC+AGTPGSQA AA VL+NLAS +EIK
Sbjct: 241 AKEKACIALQALSFSKENARAIGSRGGISSLLEICQAGTPGSQAFAAGVLKNLASVDEIK 300

Query: 301 ENFIEENGVIVLLGLLTSGTPLAQENAIGCLCNLVVDDDNLKLLIVKEGGIEFLKIFWDS 360
           ENFIEEN V VL+GL  SGT LAQEN+IGCLCNLV DD+NL+LLIVKEGGIE LK FWDS
Sbjct: 301 ENFIEENAVFVLIGLAASGTALAQENSIGCLCNLVSDDENLRLLIVKEGGIECLKNFWDS 360

Query: 361 VPSVRSLGVAVELLSLLASYSPIAETLISEGFLDRLLPVLSCGVLGARIAAARAVYELSF 420
            P+ +SL VAVEL+  LAS SPIAE L+++GF+ RL+ VL+CGVLG RIAAARAVYEL F
Sbjct: 361 SPNPKSLEVAVELVRRLASCSPIAEALVADGFVARLVAVLNCGVLGVRIAAARAVYELGF 420

Query: 421 CAKARKEMGESGFITPLINMLDGKSVDEKTAAAKALSSLLQYNGNRRIFQKEERGIVSAV 480
            +K RKEMGE G    LI M+DGK+V+EK AAA ALS+L+ Y GNR++FQK+ERGIV+AV
Sbjct: 421 NSKTRKEMGECGCTVALIKMMDGKAVEEKEAAAMALSTLMLYAGNRKVFQKDERGIVNAV 480

Query: 481 HLLDPSILNLDKKYPVSLLASVVISSKCRKLMAAAGAALYLQKLVEMNVEGSKKLLVSLS 540
            LLDP I NLDKKYPV +L+ +V S KCRK M AAGA +YLQKLVEMNVEG+KKLL SL 
Sbjct: 481 QLLDPLIQNLDKKYPVLILSELVHSKKCRKQMVAAGACVYLQKLVEMNVEGAKKLLESLG 540

Query: 541 RAKIWGVFAR 551
           R KIWGVFAR
Sbjct: 541 RGKIWGVFAR 549

BLAST of Cp4.1LG20g06260 vs. NCBI nr
Match: gi|1009130344|ref|XP_015882248.1| (PREDICTED: armadillo segment polarity protein-like [Ziziphus jujuba])

HSP 1 Score: 699.9 bits (1805), Expect = 3.6e-198
Identity = 382/553 (69.08%), Postives = 451/553 (81.56%), Query Frame = 1

Query: 1   MKVPPENSHFLLSNNLISSVLDDIPLINDFKGKWSSMATKLSDLRAQLIDVSEFPNSSSN 60
           MKVP EN    L+ +L+SS+  +IP +++FKGKW+ +  KL++L+ QL D S+ P S SN
Sbjct: 1   MKVP-ENDPISLATHLVSSLSQEIPTVSNFKGKWALINDKLAELKTQLADFSDCPTSVSN 60

Query: 61  PLSLDFLHSVMETLTQAASLSHKCRNPGVSDGKLKTQSDILSVLTKLDCLLKDGDVLIKS 120
           PLSL+ L SV  TL  A SLS KC++P +S+GKL+TQSD+ SVL KLD  +KD ++LI+S
Sbjct: 61  PLSLELLRSVSLTLHDAVSLSKKCQSPSLSEGKLRTQSDVDSVLAKLDKHVKDAEILIRS 120

Query: 121 EILHDGVIS--SSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIA 180
            +L DG +S  SSSS+REAVRAESRNLITRLQIGS E+R  A+DSLL LL EDDKNV IA
Sbjct: 121 GVLQDGTVSGSSSSSKREAVRAESRNLITRLQIGSAEARNSAMDSLLGLLQEDDKNVMIA 180

Query: 181 AAQGAVPVLVRLLDSSSL-ELKEKAVAAISIVSTVDGVKNVMIAEGIVLLNHLLRILDSG 240
            AQG VPVLVRL+DSSS  E+KEK VAAIS VS VD  K+V+IAEG++LLNHLLR+LDSG
Sbjct: 181 VAQGIVPVLVRLMDSSSSPEMKEKTVAAISRVSMVDSSKHVLIAEGLLLLNHLLRVLDSG 240

Query: 241 SGFAKEKSCLALQSLSISRQNARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFN 300
           SGFAKEK+C+ALQ+LS S++NAR+IGSRGGISSLLEIC +GTPGSQASAA VLRNLA+F+
Sbjct: 241 SGFAKEKACIALQALSFSKENARAIGSRGGISSLLEICNSGTPGSQASAAGVLRNLAAFS 300

Query: 301 EIKENFIEENGVIVLLGLLTSGTPLAQENAIGCLCNLVVDDDNLKLLIVKEGGIEFLKIF 360
           E KENFIEENGV VLLGL   GT LAQENAIGCLCNLV +DD+LKLL+ KEGGIE LK F
Sbjct: 301 ENKENFIEENGVFVLLGLAGLGTVLAQENAIGCLCNLVCEDDHLKLLVAKEGGIECLKNF 360

Query: 361 WDSVPSVRSLGVAVELLSLLASYSPIAETLISEGFLDRLLPVLSCGVLGARIAAARAVYE 420
           WDS  SVRSL VAV+LL  LAS  PIAE L+S+GF+ R+  VL+CGVLG RIAAARAVYE
Sbjct: 361 WDSASSVRSLEVAVDLLRHLASRQPIAEVLVSDGFVTRIAGVLNCGVLGVRIAAARAVYE 420

Query: 421 LSFCAKARKEMGESGFITPLINMLDGKSVDEKTAAAKALSSLLQYNGNRRIFQKEERGIV 480
           L FC K RKEMGE G I  LI MLDGK+V+EK +AAKALS+L+ + GNRRIF+K+ +GI+
Sbjct: 421 LGFCTKTRKEMGECGCIASLIKMLDGKAVEEKESAAKALSNLMLFTGNRRIFRKDAKGIL 480

Query: 481 SAVHLLDPSILNLDKKYPVSLLASVVISSKCRKLMAAAGAALYLQKLVEMNVEGSKKLLV 540
            AV LLDPSI NLDKKYPVS+LAS+V S KCRK M A+GA  YLQKLVE +VEGSKKLL 
Sbjct: 481 CAVQLLDPSIQNLDKKYPVSVLASLVHSKKCRKQMVASGACAYLQKLVEADVEGSKKLLE 540

Query: 541 SLSRAKIWGVFAR 551
           SL R KIWGVFAR
Sbjct: 541 SLGRGKIWGVFAR 552

BLAST of Cp4.1LG20g06260 vs. NCBI nr
Match: gi|802770420|ref|XP_012090618.1| (PREDICTED: uncharacterized protein LOC105648741 [Jatropha curcas])

HSP 1 Score: 685.3 bits (1767), Expect = 9.1e-194
Identity = 370/551 (67.15%), Postives = 435/551 (78.95%), Query Frame = 1

Query: 1   MKVPPENSHFLLSNNLISSVLDDIPLINDFKGKWSSMATKLSDLRAQLIDVSEFPNSSSN 60
           MKVP EN     ++ L+ S+LD+IP +  FKGKW+ +  KL+DL+ QL D ++FP S+SN
Sbjct: 1   MKVP-ENDPINANDQLLQSLLDEIPHVQTFKGKWALIRAKLADLQTQLTDFADFPASTSN 60

Query: 61  PLSLDFLHSVMETLTQAASLSHKCRNPGVSDGKLKTQSDILSVLTKLDCLLKDGDVLIKS 120
           PL LD LHS+  +L  A  L+ KCR P  ++GKL+TQSD+ S+L KLD  +KD ++LIKS
Sbjct: 61  PLCLDLLHSISNSLNDAVLLARKCRTPNFTEGKLRTQSDVDSILAKLDRHVKDSEILIKS 120

Query: 121 EILHDGVIS-SSSSRREAVRAESRNLITRLQIGSIESRVLAIDSLLQLLNEDDKNVTIAA 180
            +L DG  S  SSS+REAVR ESRNLITRLQIGS ES+  A+DSLL LL EDDKNV IA 
Sbjct: 121 GVLQDGATSVGSSSKREAVRVESRNLITRLQIGSSESKNSAMDSLLGLLQEDDKNVMIAV 180

Query: 181 AQGAVPVLVRLLDSSSLELKEKAVAAISIVSTVDGVKNVMIAEGIVLLNHLLRILDSGSG 240
           AQG VPVLVRLLDSSSLE+KEK VAAIS VS VD  K+V+IAEG++LLNHLLR+L+SGSG
Sbjct: 181 AQGVVPVLVRLLDSSSLEMKEKTVAAISRVSMVDSSKHVLIAEGLLLLNHLLRVLESGSG 240

Query: 241 FAKEKSCLALQSLSISRQNARSIGSRGGISSLLEICEAGTPGSQASAAAVLRNLASFNEI 300
           FAKEK+C+ALQ+LS S++NAR+IGSRGGISSLLEIC+ GTPGSQA AA VLRNLA F EI
Sbjct: 241 FAKEKACVALQALSFSKENARAIGSRGGISSLLEICQGGTPGSQAFAAGVLRNLAVFEEI 300

Query: 301 KENFIEENGVIVLLGLLTSGTPLAQENAIGCLCNLVVDDDNLKLLIVKEGGIEFLKIFWD 360
           +ENFIEEN V VL+GL  SGT LAQENAIGCLCNL  DD+NLKLLIVKEGG+E L+ FWD
Sbjct: 301 RENFIEENAVFVLIGLAASGTALAQENAIGCLCNLAKDDENLKLLIVKEGGVECLRNFWD 360

Query: 361 SVPSVRSLGVAVELLSLLASYSPIAETLISEGFLDRLLPVLSCGVLGARIAAARAVYELS 420
           S P VRSL VAV+LL  LAS   IAE L+S+GF+ RL+  L+CGVLG RIA A A+YEL 
Sbjct: 361 SGPPVRSLEVAVDLLRNLASNQAIAEVLVSDGFVSRLMVFLNCGVLGVRIATAEAIYELG 420

Query: 421 FCAKARKEMGESGFITPLINMLDGKSVDEKTAAAKALSSLLQYNGNRRIFQKEERGIVSA 480
           F  K RKEMGE   I PLINMLDGK+V EK AAAKALS LL Y GNR+ F+K+ERGIV  
Sbjct: 421 FNTKTRKEMGECEVIVPLINMLDGKAVVEKEAAAKALSHLLLYAGNRKTFRKDERGIVYT 480

Query: 481 VHLLDPSILNLDKKYPVSLLASVVISSKCRKLMAAAGAALYLQKLVEMNVEGSKKLLVSL 540
           V LLDPSI NLDKKYPVS+LAS+V S KCRK M  AGA ++L+ L EM +EG+KKLL  L
Sbjct: 481 VQLLDPSIQNLDKKYPVSILASLVQSKKCRKQMIGAGACVHLKTLAEMEIEGAKKLLDGL 540

Query: 541 SRAKIWGVFAR 551
            R KIWGVFAR
Sbjct: 541 GRGKIWGVFAR 550

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PUB4_ARATH3.4e-1527.65U-box domain-containing protein 4 OS=Arabidopsis thaliana GN=PUB4 PE=1 SV=3[more]
PUB11_ARATH5.8e-1526.60U-box domain-containing protein 11 OS=Arabidopsis thaliana GN=PUB11 PE=2 SV=2[more]
PUB10_ARATH5.4e-1326.38U-box domain-containing protein 10 OS=Arabidopsis thaliana GN=PUB10 PE=2 SV=1[more]
PUB14_ARATH4.3e-1025.83U-box domain-containing protein 14 OS=Arabidopsis thaliana GN=PUB14 PE=1 SV=1[more]
SL11_ORYSJ7.3e-1030.35E3 ubiquitin-protein ligase SPL11 OS=Oryza sativa subsp. japonica GN=SPL11 PE=1 ... [more]
Match NameE-valueIdentityDescription
A0A0A0M0M7_CUCSA3.4e-26488.38Uncharacterized protein OS=Cucumis sativus GN=Csa_1G699580 PE=4 SV=1[more]
A0A061G6D5_THECC4.8e-20269.09ARM repeat superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_014724 PE=4 S... [more]
A0A067JIA0_JATCU6.3e-19467.15Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26389 PE=4 SV=1[more]
A0A067DZT1_CITSI5.9e-19266.49Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g008560mg PE=4 SV=1[more]
W9QV83_9ROSA7.7e-19266.49U-box domain-containing protein 11 OS=Morus notabilis GN=L484_008839 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G50900.15.3e-17659.75 ARM repeat superfamily protein[more]
AT2G45720.12.5e-7735.67 ARM repeat superfamily protein[more]
AT1G01830.24.5e-7436.46 ARM repeat superfamily protein[more]
AT2G05810.14.1e-5129.47 ARM repeat superfamily protein[more]
AT1G61350.19.4e-4829.41 ARM repeat superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449455447|ref|XP_004145464.1|4.9e-26488.38PREDICTED: vacuolar protein 8 [Cucumis sativus][more]
gi|659118181|ref|XP_008458985.1|1.2e-26287.66PREDICTED: U-box domain-containing protein 10 [Cucumis melo][more]
gi|590670572|ref|XP_007038093.1|6.9e-20269.09ARM repeat superfamily protein isoform 1 [Theobroma cacao][more]
gi|1009130344|ref|XP_015882248.1|3.6e-19869.08PREDICTED: armadillo segment polarity protein-like [Ziziphus jujuba][more]
gi|802770420|ref|XP_012090618.1|9.1e-19467.15PREDICTED: uncharacterized protein LOC105648741 [Jatropha curcas][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005488binding
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR016024ARM-type_fold
IPR011989ARM-like
IPR000225Armadillo
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0005488 binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG20g06260.1Cp4.1LG20g06260.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000225ArmadilloPFAMPF00514Armcoord: 178..207
score: 1.
IPR000225ArmadilloSMARTSM00185arm_5coord: 337..379
score: 320.0coord: 421..461
score: 3.9coord: 255..295
score: 34.0coord: 296..336
score: 0.94coord: 171..211
score: 0.0035coord: 380..420
score:
IPR000225ArmadilloPROFILEPS50176ARM_REPEATcoord: 182..210
score: 9
IPR011989Armadillo-like helicalGENE3DG3DSA:1.25.10.10coord: 369..532
score: 1.6E-15coord: 128..354
score: 1.2
IPR016024Armadillo-type foldunknownSSF48371ARM repeatcoord: 144..488
score: 3.17
NoneNo IPR availablePANTHERPTHR23315BETA CATENIN-RELATED ARMADILLO REPEAT-CONTAININGcoord: 91..544
score: 7.1E-257coord: 12..34
score: 7.1E
NoneNo IPR availablePANTHERPTHR23315:SF86ARMADILLO/BETA-CATENIN-LIKE REPEAT-CONTAINING PROTEINcoord: 91..544
score: 7.1E-257coord: 12..34
score: 7.1E

The following gene(s) are paralogous to this gene:

None