Cp4.1LG14g04170 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG14g04170
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionGlutelin type-A 2
LocationCp4.1LG14 : 1838550 .. 1840074 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAACAGCCGATGAATCCAAAGCCCTTCACTGAGGTAGAAGCTGGATCTTATCACAAATGGCTGCCTTCTGAATACCCTTTGCTTGCTCGGAACAAAGTCGCCGCCGGCCGCCTTCTCCTCCGCCCTCGCGGCTTCGTGGTCCCCCACTACGCCGATTGCTCTAAAGTGGGCTATGTTCTTCAAGGTAATTAACTTCTCACTGTGTTATATGATATTGTCTACTTTGAATATAAGCTATCACGTCTCTATTTGGATTCTTTTCAAATAATTCTCGACAAATTTTGATTGTAGGCGAGAATGGAGTTGTCGGGTTGGTGTTTCCGAGCAAGTCCGATGAGGTGGTGGTGAACTTGAAGAAAGGAGATTTGATTCCGGTGCCGAATGGAGTCTCGTCATGGTGGTTCAACGACGGTGACTCCGATTTGGAAATCATTTTTTTGGGTGAATCCAAAAACGCTCATGTTCCTGGTGACATCTCTTATTTTGTTCTCTCCGGCCCTCTCAGCCTCCTGCATGGCTTCTCGCCGGAGTACGTCGGAAAAACCTATTCCCTAAACGGAGAAGAAACAACCCAATTTCTCAAAAGTCAATCCAACGCTTTGATTTGCTCAATTCAGCAAACCCAATCCCTTCCCAAACCACCCAAATTCAGTAAATTTGTTTACAACATTGACGCCGCCGCGCCGGACGGTAGAGTCAAGGGCAGCGCCGGTGCTGTCACGACGGTGACGGAATCAAAATTTCCGTTCATTGGTCAATCTGGGTTGACGGCGATTCTCGAAAAGCTCGACGCCAACGCCGTCCGTTCGCCGGTGTACGTCGCGGAGCCGTACGATCAACTGATCTACGTGGCTAAAGGACGTGGGAAAATCCAGATCGTTGGATCTTCGAGTAAAATTGATGCAGAGGTGAAAATGGGTCAGCTGATTTTAGTCCCCAAATTCTTCGCCGTCGGAAAATTCGCCGGAGAAGATGGCTTGGAGTGCATCTCCATTATCACAGCAACACAGTAAAAAGCTTCAATTTTTACCCTTTTATTTTTAACTTTAATCTCTGAACCTTAAAATTTGACGTTTTAAAATTTAACAGCCCTGTGGTGGAAGAACTGGCCGGAAAGACGTCGGTGTTGGAGGCATTGTCGCCGGAGGTATTTCAAGTTTCGTTCAATGTGACGGCGGAGTTCGAGAAGCTTCTTAGATCGAAGATCACAAACGCTTCACCGGTGATCGGATCTTCCGATTGAAGACTGAATTATTAATATTAGAATATAATATTTGAATGTTTGGGTATTATTATTATTATTATAATAATAATAATTAAATAATAACAATTAATATTATGGGCTTTTACGCTATTAAATTTGATCCCCAATCCCTATGATAGTGATGTGGGGGCTTTCTGGTCATTTCACCACTTGAACTTTTATATGAATTATTGTGAAGATTATGATTGTTGATGAATGTATTGTCTTATATAATATCTTTTATATTTATTATGTTTTTTTTTTAATTATTTTTTTGGT

mRNA sequence

ATGGAACAGCCGATGAATCCAAAGCCCTTCACTGAGGTAGAAGCTGGATCTTATCACAAATGGCTGCCTTCTGAATACCCTTTGCTTGCTCGGAACAAAGTCGCCGCCGGCCGCCTTCTCCTCCGCCCTCGCGGCTTCGTGTCCGATGAGGTGGTGGTGAACTTGAAGAAAGGAGATTTGATTCCGGTGCCGAATGGAGTCTCGTCATGGTGGTTCAACGACGGTGACTCCGATTTGGAAATCATTTTTTTGGGTGAATCCAAAAACGCTCATGTTCCTGGTGACATCTCTTATTTTGTTCTCTCCGGCCCTCTCAGCCTCCTGCATGGCTTCTCGCCGGAGTACGTCGGAAAAACCTATTCCCTAAACGGAGAAGAAACAACCCAATTTCTCAAAAGTCAATCCAACGCTTTGATTTGCTCAATTCAGCAAACCCAATCCCTTCCCAAACCACCCAAATTCAGTAAATTTGTTTACAACATTGACGCCGCCGCGCCGGACGGTAGAGTCAAGGGCAGCGCCGGTGCTGTCACGACGGTGACGGAATCAAAATTTCCGTTCATTGGTCAATCTGGGTTGACGGCGATTCTCGAAAAGCTCGACGCCAACGCCGTCCGTTCGCCGGTGTACGTCGCGGAGCCGTACGATCAACTGATCTACGTGGCTAAAGGACGTGGGAAAATCCAGATCGTTGGATCTTCGAGTAAAATTGATGCAGAGGTGAAAATGGGTCAGCTGATTTTAGTCCCCAAATTCTTCGCCGTCGGAAAATTCGCCGGAGAAGATGGCTTGGAGTGCATCTCCATTATCACAGCAACACACCCTGTGGTGGAAGAACTGGCCGGAAAGACGTCGGTGTTGGAGGCATTGTCGCCGGAGGTATTTCAAGTTTCGTTCAATGTGACGGCGGAGTTCGAGAAGCTTCTTAGATCGAAGATCACAAACGCTTCACCGGTGATCGGATCTTCCGATTGAAGACTGAATTATTAATATTAGAATATAATATTTGAATGTTTGGGTATTATTATTATTATTATAATAATAATAATTAAATAATAACAATTAATATTATGGGCTTTTACGCTATTAAATTTGATCCCCAATCCCTATGATAGTGATGTGGGGGCTTTCTGGTCATTTCACCACTTGAACTTTTATATGAATTATTGTGAAGATTATGATTGTTGATGAATGTATTGTCTTATATAATATCTTTTATATTTATTATGTTTTTTTTTTAATTATTTTTTTGGT

Coding sequence (CDS)

ATGGAACAGCCGATGAATCCAAAGCCCTTCACTGAGGTAGAAGCTGGATCTTATCACAAATGGCTGCCTTCTGAATACCCTTTGCTTGCTCGGAACAAAGTCGCCGCCGGCCGCCTTCTCCTCCGCCCTCGCGGCTTCGTGTCCGATGAGGTGGTGGTGAACTTGAAGAAAGGAGATTTGATTCCGGTGCCGAATGGAGTCTCGTCATGGTGGTTCAACGACGGTGACTCCGATTTGGAAATCATTTTTTTGGGTGAATCCAAAAACGCTCATGTTCCTGGTGACATCTCTTATTTTGTTCTCTCCGGCCCTCTCAGCCTCCTGCATGGCTTCTCGCCGGAGTACGTCGGAAAAACCTATTCCCTAAACGGAGAAGAAACAACCCAATTTCTCAAAAGTCAATCCAACGCTTTGATTTGCTCAATTCAGCAAACCCAATCCCTTCCCAAACCACCCAAATTCAGTAAATTTGTTTACAACATTGACGCCGCCGCGCCGGACGGTAGAGTCAAGGGCAGCGCCGGTGCTGTCACGACGGTGACGGAATCAAAATTTCCGTTCATTGGTCAATCTGGGTTGACGGCGATTCTCGAAAAGCTCGACGCCAACGCCGTCCGTTCGCCGGTGTACGTCGCGGAGCCGTACGATCAACTGATCTACGTGGCTAAAGGACGTGGGAAAATCCAGATCGTTGGATCTTCGAGTAAAATTGATGCAGAGGTGAAAATGGGTCAGCTGATTTTAGTCCCCAAATTCTTCGCCGTCGGAAAATTCGCCGGAGAAGATGGCTTGGAGTGCATCTCCATTATCACAGCAACACACCCTGTGGTGGAAGAACTGGCCGGAAAGACGTCGGTGTTGGAGGCATTGTCGCCGGAGGTATTTCAAGTTTCGTTCAATGTGACGGCGGAGTTCGAGAAGCTTCTTAGATCGAAGATCACAAACGCTTCACCGGTGATCGGATCTTCCGATTGA

Protein sequence

MEQPMNPKPFTEVEAGSYHKWLPSEYPLLARNKVAAGRLLLRPRGFVSDEVVVNLKKGDLIPVPNGVSSWWFNDGDSDLEIIFLGESKNAHVPGDISYFVLSGPLSLLHGFSPEYVGKTYSLNGEETTQFLKSQSNALICSIQQTQSLPKPPKFSKFVYNIDAAAPDGRVKGSAGAVTTVTESKFPFIGQSGLTAILEKLDANAVRSPVYVAEPYDQLIYVAKGRGKIQIVGSSSKIDAEVKMGQLILVPKFFAVGKFAGEDGLECISIITATHPVVEELAGKTSVLEALSPEVFQVSFNVTAEFEKLLRSKITNASPVIGSSD
BLAST of Cp4.1LG14g04170 vs. Swiss-Prot
Match: CRU4_ARATH (12S seed storage protein CRD OS=Arabidopsis thaliana GN=CRD PE=1 SV=1)

HSP 1 Score: 99.8 bits (247), Expect = 6.0e-20
Identity = 80/293 (27.30%), Postives = 129/293 (44.03%), Query Frame = 1

Query: 54  NLKKGDLIPVPNGVSSWWFNDGDSDLEIIFLGESKNAHVPGDI--SYFVLSG-------- 113
           N ++GD+     GVS WW+N GDSD  I+ + +  N     D     F L+G        
Sbjct: 144 NFRRGDVFASLAGVSQWWYNRGDSDAVIVIVLDVTNRENQLDQVPRMFQLAGSRTQEEEQ 203

Query: 114 PLS------LLHGFSPEYVGKTYSLNGEETTQFLKSQSNA--LICSIQQTQSLPKPPK-- 173
           PL+         GF P  + + + +N E   Q    + N   +I +      +  PP+  
Sbjct: 204 PLTWPSGNNAFSGFDPNIIAEAFKINIETAKQLQNQKDNRGNIIRANGPLHFVIPPPREW 263

Query: 174 --------------FSKFVYNIDAAAPDGRVKGSAGAVTTVTESKFPFIGQSGLTAILEK 233
                          +K   NID           AG ++T+     P +    L A+   
Sbjct: 264 QQDGIANGIEETYCTAKIHENIDDPERSDHFSTRAGRISTLNSLNLPVLRLVRLNALRGY 323

Query: 234 LDANAVRSPVYVAEPYDQLIYVAKGRGKIQIVGSS--SKIDAEVKMGQLILVPKFFAVGK 293
           L +  +  P + A  +  ++YV  G+ KIQ+V  +  S  + +V  GQ+I++P+ FAV K
Sbjct: 324 LYSGGMVLPQWTANAHT-VLYVTGGQAKIQVVDDNGQSVFNEQVGQGQIIVIPQGFAVSK 383

Query: 294 FAGEDGLECISIITATHPVVEELAGKTSVLEALSPEVFQVSFNVTAEFEKLLR 311
            AGE G E IS  T  +  +  L+G+TS L A+  +V + S+ V  E  K ++
Sbjct: 384 TAGETGFEWISFKTNDNAYINTLSGQTSYLRAVPVDVIKASYGVNEEEAKRIK 435

BLAST of Cp4.1LG14g04170 vs. Swiss-Prot
Match: 11S2_SESIN (11S globulin seed storage protein 2 OS=Sesamum indicum PE=2 SV=1)

HSP 1 Score: 81.6 bits (200), Expect = 1.7e-14
Identity = 68/311 (21.86%), Postives = 140/311 (45.02%), Query Frame = 1

Query: 44  RGFVSD--EVVVNLKKGDLIPVPNGVSSWWFNDGDSDLEIIFLGESKNAHVPGDISY--F 103
           RG V D  + V  L++GD++ +P+G + W +NDG  DL  + + +  +     D  +  F
Sbjct: 134 RGSVRDLHQKVHRLRQGDIVAIPSGAAHWCYNDGSEDLVAVSINDVNHLSNQLDQKFRAF 193

Query: 104 VLSGPL---------------SLLHGFSPEYVGKTYSLNGEETTQFLKS--QSNALICSI 163
            L+G +               ++   F  E + + +++  +ET + ++S  +   LI   
Sbjct: 194 YLAGGVPRSGEQEQQARQTFHNIFRAFDAELLSEAFNV-PQETIRRMQSEEEERGLIVMA 253

Query: 164 QQTQSLPKPPK-------------------FSKFVY--NIDAAAPDGRVKGSAGAVTTVT 223
           ++  +  +P +                   F    +  N+++          AG V  V 
Sbjct: 254 RERMTFVRPDEEEGEQEHRGRQLDNGLEETFCTMKFRTNVESRREADIFSRQAGRVHVVD 313

Query: 224 ESKFPFIGQSGLTAILEKLDANAVRSPVYVAEPYDQLIYVAKGRGKIQIVGSSSK--IDA 283
            +K P +    L+A    L +NA+ SP +    +  ++YV +G  ++Q+V  + +  ++ 
Sbjct: 314 RNKLPILKYMDLSAEKGNLYSNALVSPDWSMTGH-TIVYVTRGDAQVQVVDHNGQALMND 373

Query: 284 EVKMGQLILVPKFFAVGKFAGEDGLECISIITATHPVVEELAGKTSVLEALSPEVFQVSF 311
            V  G++ +VP+++     AG +G E ++  T   P+   LAG TSV+ A+  +V   S+
Sbjct: 374 RVNQGEMFVVPQYYTSTARAGNNGFEWVAFKTTGSPMRSPLAGYTSVIRAMPLQVITNSY 433

BLAST of Cp4.1LG14g04170 vs. Swiss-Prot
Match: 13SB_FAGES (13S globulin basic chain OS=Fagopyrum esculentum PE=1 SV=1)

HSP 1 Score: 81.3 bits (199), Expect = 2.2e-14
Identity = 46/133 (34.59%), Postives = 72/133 (54.14%), Query Frame = 1

Query: 174 AGAVTTVTESKFPFIGQSGLTAILEKLDANAVRSPVYVAEPYDQLIYVAKGRGKIQIVGS 233
           AG +TT    K P +    ++A    L +N + +P +    +  L YV +G  K+Q+VG 
Sbjct: 28  AGRITTANSQKLPALRSLQMSAERGFLYSNGIYAPHWNINAHSAL-YVTRGNAKVQVVGD 87

Query: 234 SSK--IDAEVKMGQLILVPKFFAVGKFAGEDGLECISIITATHPVVEELAGKTSVLEALS 293
                 D EVK GQLI+VP++FAV K AG  G E ++  T  + ++  L G+ S   A+ 
Sbjct: 88  EGNKVFDDEVKQGQLIIVPQYFAVIKKAGNQGFEYVAFKTNDNAMINPLVGRLSAFRAIP 147

Query: 294 PEVFQVSFNVTAE 305
            EV + SF +++E
Sbjct: 148 EEVLRSSFQISSE 159

BLAST of Cp4.1LG14g04170 vs. Swiss-Prot
Match: GLUB1_ORYSJ (Glutelin type-B 1 OS=Oryza sativa subsp. japonica GN=GluB1-A PE=2 SV=3)

HSP 1 Score: 79.0 bits (193), Expect = 1.1e-13
Identity = 45/140 (32.14%), Postives = 79/140 (56.43%), Query Frame = 1

Query: 174 AGAVTTVTESKFPFIGQSGLTAILEKLDANAVRSPVYVAEPYDQLIYVAKGRGKIQIVGS 233
           AG +T+V   KFP +    ++A    L  NA+ SP +    +  L+Y+ +GR ++Q+V +
Sbjct: 330 AGRITSVNSQKFPILNLIQMSATRVNLYQNAILSPFWNVNAHS-LVYMIQGRSRVQVVSN 389

Query: 234 SSK--IDAEVKMGQLILVPKFFAVGKFAGEDGLECISIITATHPVVEELAGKTSVLEALS 293
             K   D  ++ GQL+++P+ +AV K A  +G + I+I T  +  V  LAGK SV  AL 
Sbjct: 390 FGKTVFDGVLRPGQLLIIPQHYAVLKKAEREGCQYIAIKTNANAFVSHLAGKNSVFRALP 449

Query: 294 PEVFQVSFNVTAEFEKLLRS 312
            +V   ++ ++ E  + L++
Sbjct: 450 VDVVANAYRISREQARSLKN 468

BLAST of Cp4.1LG14g04170 vs. Swiss-Prot
Match: 13S1_FAGES (13S globulin seed storage protein 1 OS=Fagopyrum esculentum GN=FA02 PE=2 SV=1)

HSP 1 Score: 77.0 bits (188), Expect = 4.2e-13
Identity = 47/151 (31.13%), Postives = 79/151 (52.32%), Query Frame = 1

Query: 156 KFVYNIDAAAPDGRVKGSAGAVTTVTESKFPFIGQSGLTAILEKLDANAVRSPVYVAEPY 215
           KF  N++  +        AG + TV  +  P +    L+A    L  NA+  P +    +
Sbjct: 387 KFKQNVNRPSRADVFNPRAGRINTVNSNNLPILEFIQLSAQHVVLYKNAILGPRWNLNAH 446

Query: 216 DQLIYVAKGRGKIQIVGSSSK--IDAEVKMGQLILVPKFFAVGKFAGEDGLECISIITAT 275
             L YV +G G++Q+VG   +   D  V+ GQ+++VP+ FAV   AG +GLE + +    
Sbjct: 447 SAL-YVTRGEGRVQVVGDEGRSVFDDNVQRGQILVVPQGFAVVLKAGREGLEWVELKNDD 506

Query: 276 HPVVEELAGKTSVLEALSPEVFQVSFNVTAE 305
           + +   +AGKTSVL A+  EV   S++++ +
Sbjct: 507 NAITSPIAGKTSVLRAIPVEVLANSYDISTK 536

BLAST of Cp4.1LG14g04170 vs. TrEMBL
Match: A0A0A0L6K0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G218160 PE=4 SV=1)

HSP 1 Score: 470.7 bits (1210), Expect = 1.4e-129
Identity = 239/339 (70.50%), Postives = 266/339 (78.47%), Query Frame = 1

Query: 3   QPMNPKPFTEVEAGSYHKWLPSEYPLLARNKVAAGRLLLRPRGFV--------------- 62
           + MNPKPF E E GSYHKWLPS+YPLLA+  VA GRLLLRPRGF                
Sbjct: 2   EAMNPKPFFEGEGGSYHKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYSDCSKFGYVLQ 61

Query: 63  -------------SDEVVVNLKKGDLIPVPNGVSSWWFNDGDSDLEIIFLGESKNAHVPG 122
                         +EVV+ LKKGDLIPVP GV+SWWFNDGDSDLEIIFLGE+K AHVPG
Sbjct: 62  GEDGVTGFVFPKKCNEVVIKLKKGDLIPVPAGVTSWWFNDGDSDLEIIFLGETKRAHVPG 121

Query: 123 DISYFVLSGPLSLLHGFSPEYVGKTYSLNGEETTQFLKSQSNALICSIQQTQSLPKPPKF 182
           DI+YF+LSGP  LL GF+PEYV K+ SLN EET  FLKSQ N LI ++Q +QSLPKP K+
Sbjct: 122 DITYFILSGPRGLLQGFTPEYVQKSCSLNQEETNTFLKSQPNVLIFTVQPSQSLPKPHKY 181

Query: 183 SKFVYNIDAAAPDGRVKGSAGAVTTVTESKFPFIGQSGLTAILEKLDANAVRSPVYVAEP 242
           SK VYNIDAAAPD R K    AVT VTES FPFIGQ+GLT +LEKLDANA+RSPVY+AEP
Sbjct: 182 SKLVYNIDAAAPDNRAKVGDAAVTMVTESTFPFIGQTGLTPVLEKLDANAIRSPVYIAEP 241

Query: 243 YDQLIYVAKGRGKIQIVGSSSKIDAEVKMGQLILVPKFFAVGKFAGEDGLECISIITATH 302
            DQLIYV KG GKIQ+VG SSK DA+VK GQLILVP++FAVGK AGE+GLECIS+I ATH
Sbjct: 242 SDQLIYVTKGSGKIQVVGFSSKFDADVKTGQLILVPRYFAVGKIAGEEGLECISMIVATH 301

Query: 303 PVVEELAGKTSVLEALSPEVFQVSFNVTAEFEKLLRSKI 314
           P+VEELAGKTSVLEALS EVFQVSFNVTAEFEKL RSK+
Sbjct: 302 PMVEELAGKTSVLEALSSEVFQVSFNVTAEFEKLFRSKV 340

BLAST of Cp4.1LG14g04170 vs. TrEMBL
Match: A0A0A0LC21_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G218170 PE=4 SV=1)

HSP 1 Score: 390.2 bits (1001), Expect = 2.5e-105
Identity = 195/351 (55.56%), Postives = 248/351 (70.66%), Query Frame = 1

Query: 3   QPMNPKPFTEVEAGSYHKWLPSEYPLLARNKVAAGRLLLRPRGFV--------------- 62
           +PM+P  F   E GS+HKW PS++P++++ KV AGRLLL PRGF                
Sbjct: 6   KPMDPSNFFTGEGGSFHKWFPSDFPIISQTKVGAGRLLLHPRGFAVPHNSDSSKVGYVLQ 65

Query: 63  ------------SDEVVVNLKKGDLIPVPNGVSSWWFNDGDSDLEIIFLGESKNAHVPGD 122
                       S+E  V LKKGD+IPVP GV+SWWFNDGDSD E++ +G+++NA +PGD
Sbjct: 66  GSGVAGIIFPCKSEEAAVRLKKGDVIPVPEGVTSWWFNDGDSDFEVLLVGDTRNALIPGD 125

Query: 123 ISYFVLSGPLSLLHGFSPEYVGKTYSLNGEETTQFLKSQSNALICSIQQTQSLPKPPKFS 182
           I+Y V +GPL +L GFS +Y+ K Y L  +E    LKSQ N LI  ++  Q+LP+P   S
Sbjct: 126 ITYVVFAGPLGVLQGFSSDYIEKVYDLTEKEREVLLKSQPNGLIFKLKDDQTLPEPDCHS 185

Query: 183 KFVYNIDAAAPDGRVKGSAGAVTTVTESKFPFIGQSGLTAILEKLDANAVRSPVYVAEPY 242
             V+NI   APD  VKG  G+VT +TE KFPFIG+SGLTA+LEKL+ANAVRSPVYVA+P 
Sbjct: 186 DLVFNIYHTAPDAVVKG-GGSVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPVYVADPS 245

Query: 243 DQLIYVAKGRGKIQIVGSSSK--IDAEVKMGQLILVPKFFAVGKFAGEDGLECISIITAT 302
            QLIYVA G G++QI  +  +  IDAEVK GQL+LVPK+FAVGK AGE+GLEC +IIT T
Sbjct: 246 VQLIYVASGSGRVQIAETFMRYQIDAEVKAGQLVLVPKYFAVGKMAGEEGLECFTIITTT 305

Query: 303 HPVVEELAGKTSVLEALSPEVFQVSFNVTAEFEKLLRSKITNASPVIGSSD 325
           HP++EEL GKTS+  A SP+VF+ SFN+TA FEKL RSKIT +SP++  SD
Sbjct: 306 HPLLEELGGKTSIFGAFSPQVFEASFNLTAHFEKLFRSKITKSSPLVPPSD 355

BLAST of Cp4.1LG14g04170 vs. TrEMBL
Match: A0A0A0K550_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G337100 PE=4 SV=1)

HSP 1 Score: 330.1 bits (845), Expect = 3.0e-87
Identity = 179/345 (51.88%), Postives = 225/345 (65.22%), Query Frame = 1

Query: 5   MNPKPFTEVEAGSYHKWLPSEYPLLARN----KVAAGRLLLRPRGFV------------- 64
           MNP+   E   GSY+KW PS+YPLLA++     +      L PRGF              
Sbjct: 9   MNPRKHFEGVGGSYNKWYPSDYPLLAQSKVGAGMLL----LHPRGFAILHYSDASKVGYV 68

Query: 65  ---------------SDEVVVNLKKGDLIPVPNGVSSWWFNDGDSDLEIIFLGESKNAHV 124
                          S+E V+ LKKGD+IPVP GV+SWW+NDGDSDLEI FLGE+K AHV
Sbjct: 69  LRGNNGVTGFIFPNTSNEEVIKLKKGDIIPVPTGVTSWWYNDGDSDLEIAFLGETKYAHV 128

Query: 125 PGDISYFVLSGPLSLLHGFSPEYVGKTYSLNGEETTQFLKSQSNALICSIQQTQSLPKPP 184
           PGDISY++LSGP  +L GFS +YV KT++LN  +T+  L SQ N +I  +Q+ Q+LP P 
Sbjct: 129 PGDISYYILSGPQGILQGFSQDYVAKTFNLNEMDTSTLLNSQQNGMIFKLQEGQTLPTPT 188

Query: 185 KFSKFVYNIDAAAPDGRVKGSAGAVTTVTESKFPFIGQSGLTAILEKLDANAVRSPVYVA 244
           K +KFVYN+D          +      V+ES+FPFIG++GL  ++E+L  N VRSPV + 
Sbjct: 189 KDTKFVYNLD----------NYDFFMKVSESEFPFIGETGLAVVVERLGPNVVRSPVLLV 248

Query: 245 EPYDQLIYVAKGRGKIQIVG--SSSKIDAEVKMGQLILVPKFFAVGKFAGEDGLECISII 304
            P DQLIYVA+G G +QIVG  SSSKI+  V+ GQLI VPK+FA GK A E G+E  SI+
Sbjct: 249 SPADQLIYVARGSGTVQIVGLSSSSKIELHVESGQLIFVPKYFAAGKIAAEQGMEFFSIL 308

Query: 305 TATHPVVEELAGKTSVLEALSPEVFQVSFNVTAEFEKLLRSKITN 316
           TA   +V EL GKTSV+EALS EV  VSFN+TAEFEK+LRS  TN
Sbjct: 309 TAKLGLVGELKGKTSVMEALSAEVIAVSFNITAEFEKVLRSNTTN 339

BLAST of Cp4.1LG14g04170 vs. TrEMBL
Match: A0A161JQR4_MEDSA (Hexenal isomerase OS=Medicago sativa GN=MSHI PE=2 SV=1)

HSP 1 Score: 305.1 bits (780), Expect = 1.0e-79
Identity = 168/355 (47.32%), Postives = 222/355 (62.54%), Query Frame = 1

Query: 1   MEQPMNPK---PFTEVEAGSYHKWLPSEYPLLARNKVAAGRLLLRPRGFV---------- 60
           ME  + PK   P  E + G Y+ WL S+ P+LA+  V AG+L+L+PRGF           
Sbjct: 1   MELDLTPKTAQPLLEGDGGGYYIWLSSQVPVLAKTNVGAGQLVLQPRGFALPHYADSNKV 60

Query: 61  ------------------SDEVVVNLKKGDLIPVPNGVSSWWFNDGDSDLEIIFLGESKN 120
                               EVV+ LKKGD+IPVP G  SWWFNDG+SDL IIFLGE+  
Sbjct: 61  GYVIEGTDGVVGMVLPSTGKEVVLKLKKGDVIPVPIGGVSWWFNDGESDLNIIFLGETSI 120

Query: 121 AHVPGDISYFVLSGPLSLLHGFSPEYVGKTYSLNGEETTQFLKSQSNALICSIQQTQSLP 180
           AHVPG+ +YF LSG   LL  FS E + K Y+ N +E T+  +SQ   +I  +++ Q +P
Sbjct: 121 AHVPGEFTYFFLSGVQGLLSSFSSELISKVYNFNKDEVTKLTQSQKGVVIIKLEKGQPMP 180

Query: 181 KP--PKFSKFVYNIDAAAPDGRVKGSAGAVTTVTESKFPFIGQSGLTAILEKLDANAVRS 240
           KP       FVY+IDA  PD + + + G VTT+T+  FPFI   GL+ I  KL+ NA+++
Sbjct: 181 KPQLDLTKDFVYDIDAKKPDIKAQ-NVGLVTTLTDKDFPFIKDVGLSVIRVKLEPNAIKA 240

Query: 241 PVYVAEPYDQLIYVAKGRGKIQIVGSSSK--IDAEVKMGQLILVPKFFAVGKFAGEDGLE 300
           P  +  P  QLIY+A+G GKI+IVG + K  +DA+VK G LI+VP+FF V K AGEDG+E
Sbjct: 241 PSNLITPAIQLIYIARGSGKIEIVGLNGKRVLDAQVKAGHLIVVPQFFVVAKVAGEDGME 300

Query: 301 CISIITATHPVVEELAGKTSVLEALSPEVFQVSFNVTAEFEKLLRSKITNASPVI 321
             SI+T T P+ EELAG+TSV  ALSP V QVSFNV +EF++L  SK T  + +I
Sbjct: 301 SYSIVTTTKPLFEELAGETSVWGALSPTVQQVSFNVDSEFQELFISKTTETTSLI 354

BLAST of Cp4.1LG14g04170 vs. TrEMBL
Match: F6H8X0_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_12s0034g01950 PE=4 SV=1)

HSP 1 Score: 302.4 bits (773), Expect = 6.8e-79
Identity = 160/341 (46.92%), Postives = 216/341 (63.34%), Query Frame = 1

Query: 12  EVEAGSYHKWLPSEYPLLARNKVAAGRLLLRPRGFV------------------------ 71
           E E G+Y++W  +EY LL   KV  GRL+L+PRGF                         
Sbjct: 15  EGEGGTYYRWSSAEYELLKEAKVGGGRLVLQPRGFALPHYADSNKIGYVLQGSCGVVGIV 74

Query: 72  ----SDEVVVNLKKGDLIPVPNGVSSWWFNDGDSDLEIIFLGESKNAHVPGDISYFVLSG 131
               S EVV+ LKKGD+IPVP+G  SWW+NDGDS+L I+FLGE+  A+VPG+ +YF+L+G
Sbjct: 75  SPKASQEVVLRLKKGDIIPVPSGAVSWWYNDGDSELIIVFLGETSKAYVPGEFTYFLLTG 134

Query: 132 PLSLLHGFSPEYVGKTYSLNGEETTQFLKSQSNALICSIQQTQSLPKPPKFS--KFVYNI 191
              +L GFS E+  + Y +N EE  +  +SQS  LI  + +   +P P K S  K V+NI
Sbjct: 135 TQGILGGFSTEFNSRAYDINNEEAKKLARSQSGVLIIKLPEGHKMPHPCKNSTDKLVFNI 194

Query: 192 DAAAPDGRVKGSAGAVTTVTESKFPFIGQSGLTAILEKLDANAVRSPVYVAEPYDQLIYV 251
           DAA PD  V+ +AG +T +T  KFPF+G+ GL+A L KLDANA+ SP+Y A+   Q+IYV
Sbjct: 195 DAALPDIHVQ-NAGLLTALTAKKFPFLGEVGLSATLVKLDANAMSSPMYAADSSVQVIYV 254

Query: 252 AKGRGKIQIVGSSSK--IDAEVKMGQLILVPKFFAVGKFAGEDGLECISIITATHPVVEE 311
           AKG G+IQ+VG + +  +D +VK G L++VP+FF     A  +GLE  S+ITAT PV  E
Sbjct: 255 AKGSGRIQVVGINGERALDTKVKAGHLLVVPRFFVASAIADGEGLEYFSLITATEPVFGE 314

Query: 312 LAGKTSVLEALSPEVFQVSFNVTAEFEKLLRSKITNASPVI 321
             GKTSV  ALSP V Q S NV  EFE+L R+KI  ++ ++
Sbjct: 315 FTGKTSVWGALSPHVLQASLNVAPEFEQLFRAKIKKSTILV 354

BLAST of Cp4.1LG14g04170 vs. TAIR10
Match: AT2G28680.1 (AT2G28680.1 RmlC-like cupins superfamily protein)

HSP 1 Score: 205.7 bits (522), Expect = 4.4e-53
Identity = 118/342 (34.50%), Postives = 174/342 (50.88%), Query Frame = 1

Query: 4   PMNPKPFTEVEAGSYHKWLPSEYPLLARNKVAAGRLLLRPRGFV---------------- 63
           P  PK     + GSY  W P E P+L    + A +L L   G                  
Sbjct: 7   PRLPKKVYGGDGGSYFAWCPEELPMLRDGNIGASKLALEKYGLALPRYSDSPKVAYVLQG 66

Query: 64  ----------SDEVVVNLKKGDLIPVPNGVSSWWFNDGDSDLEIIFLGESKNAHVPGDIS 123
                      +E V+ +KKGD I +P GV +WWFN+ D++L ++FLGE+   H  G  +
Sbjct: 67  AGTAGIVLPEKEEKVIAIKKGDSIALPFGVVTWWFNNEDTELVVLFLGETHKGHKAGQFT 126

Query: 124 YFVLSGPLSLLHGFSPEYVGKTYSLNGEETTQFLKSQSNALICSIQQTQSLPKPPKFSK- 183
            F L+G   +  GFS E+VG+ + L+     + + SQ+   I  +  +  +P+P K  + 
Sbjct: 127 DFYLTGSNGIFTGFSTEFVGRAWDLDETTVKKLVGSQTGNGIVKVDASLKMPEPKKGDRK 186

Query: 184 -FVYNIDAAAPDGRVKGSAGAVTTVTESKFPFIGQSGLTAILEKLDANAVRSPVYVAEPY 243
            FV N   A  D  +K   G V  +     P +G+ G  A L ++D +++ SP +  +  
Sbjct: 187 GFVLNCLEAPLDVDIK-DGGRVVVLNTKNLPLVGEVGFGADLVRIDGHSMCSPGFSCDSA 246

Query: 244 DQLIYVAKGRGKIQIVGSSSK--IDAEVKMGQLILVPKFFAVGKFAGEDGLECISIITAT 303
            Q+ Y+  G G++QIVG+  K  ++  VK G L +VP+FF V K A  DGL   SI+T  
Sbjct: 247 LQVTYIVGGSGRVQIVGADGKRVLETHVKAGVLFIVPRFFVVSKIADSDGLSWFSIVTTP 306

Query: 304 HPVVEELAGKTSVLEALSPEVFQVSFNVTAEFEKLLRSKITN 316
            P+   LAG+TSV +ALSPEV Q +F V  E EK  RSK T+
Sbjct: 307 DPIFTHLAGRTSVWKALSPEVLQAAFKVDPEVEKAFRSKRTS 347

BLAST of Cp4.1LG14g04170 vs. TAIR10
Match: AT1G07750.1 (AT1G07750.1 RmlC-like cupins superfamily protein)

HSP 1 Score: 202.2 bits (513), Expect = 4.8e-52
Identity = 114/344 (33.14%), Postives = 175/344 (50.87%), Query Frame = 1

Query: 4   PMNPKPFTEVEAGSYHKWLPSEYPLLARNKVAAGRLLLRPRGFV---------------- 63
           P  PK     + GSY  W P E P+L +  + A +L L   GF                 
Sbjct: 7   PKLPKKVYGGDGGSYSAWCPEELPMLKQGNIGAAKLALEKNGFAVPRYSDSSKVAYVLQG 66

Query: 64  ----------SDEVVVNLKKGDLIPVPNGVSSWWFNDGDSDLEIIFLGESKNAHVPGDIS 123
                      +E V+ +K+GD I +P GV +WWFN+ D +L I+FLGE+   H  G  +
Sbjct: 67  SGTAGIVLPEKEEKVIAIKQGDSIALPFGVVTWWFNNEDPELVILFLGETHKGHKAGQFT 126

Query: 124 YFVLSGPLSLLHGFSPEYVGKTYSLNGEETTQFLKSQSNALICSIQQTQSLPKPPKFSK- 183
            F L+G   +  GFS E+VG+ + L+     + + SQ+   I  +     +P+P + ++ 
Sbjct: 127 EFYLTGTNGIFTGFSTEFVGRAWDLDENTVKKLVGSQTGNGIVKLDAGFKMPQPKEENRA 186

Query: 184 -FVYNIDAAAPDGRVKGSAGAVTTVTESKFPFIGQSGLTAILEKLDANAVRSPVYVAEPY 243
            FV N   A  D  +K   G V  +     P +G+ G  A L ++DA+++ SP +  +  
Sbjct: 187 GFVLNCLEAPLDVDIK-DGGRVVVLNTKNLPLVGEVGFGADLVRIDAHSMCSPGFSCDSA 246

Query: 244 DQLIYVAKGRGKIQIVGSSSK--IDAEVKMGQLILVPKFFAVGKFAGEDGLECISIITAT 303
            Q+ Y+  G G++Q+VG   K  ++  +K G L +VP+FF V K A  DG+   SI+T  
Sbjct: 247 LQVTYIVGGSGRVQVVGGDGKRVLETHIKAGSLFIVPRFFVVSKIADADGMSWFSIVTTP 306

Query: 304 HPVVEELAGKTSVLEALSPEVFQVSFNVTAEFEKLLRSKITNAS 318
            P+   LAG TSV ++LSPEV Q +F V  E EK  RS  T+++
Sbjct: 307 DPIFTHLAGNTSVWKSLSPEVLQAAFKVAPEVEKSFRSTRTSSA 349

BLAST of Cp4.1LG14g04170 vs. TAIR10
Match: AT1G03890.1 (AT1G03890.1 RmlC-like cupins superfamily protein)

HSP 1 Score: 99.8 bits (247), Expect = 3.4e-21
Identity = 80/293 (27.30%), Postives = 129/293 (44.03%), Query Frame = 1

Query: 54  NLKKGDLIPVPNGVSSWWFNDGDSDLEIIFLGESKNAHVPGDI--SYFVLSG-------- 113
           N ++GD+     GVS WW+N GDSD  I+ + +  N     D     F L+G        
Sbjct: 144 NFRRGDVFASLAGVSQWWYNRGDSDAVIVIVLDVTNRENQLDQVPRMFQLAGSRTQEEEQ 203

Query: 114 PLS------LLHGFSPEYVGKTYSLNGEETTQFLKSQSNA--LICSIQQTQSLPKPPK-- 173
           PL+         GF P  + + + +N E   Q    + N   +I +      +  PP+  
Sbjct: 204 PLTWPSGNNAFSGFDPNIIAEAFKINIETAKQLQNQKDNRGNIIRANGPLHFVIPPPREW 263

Query: 174 --------------FSKFVYNIDAAAPDGRVKGSAGAVTTVTESKFPFIGQSGLTAILEK 233
                          +K   NID           AG ++T+     P +    L A+   
Sbjct: 264 QQDGIANGIEETYCTAKIHENIDDPERSDHFSTRAGRISTLNSLNLPVLRLVRLNALRGY 323

Query: 234 LDANAVRSPVYVAEPYDQLIYVAKGRGKIQIVGSS--SKIDAEVKMGQLILVPKFFAVGK 293
           L +  +  P + A  +  ++YV  G+ KIQ+V  +  S  + +V  GQ+I++P+ FAV K
Sbjct: 324 LYSGGMVLPQWTANAHT-VLYVTGGQAKIQVVDDNGQSVFNEQVGQGQIIVIPQGFAVSK 383

Query: 294 FAGEDGLECISIITATHPVVEELAGKTSVLEALSPEVFQVSFNVTAEFEKLLR 311
            AGE G E IS  T  +  +  L+G+TS L A+  +V + S+ V  E  K ++
Sbjct: 384 TAGETGFEWISFKTNDNAYINTLSGQTSYLRAVPVDVIKASYGVNEEEAKRIK 435

BLAST of Cp4.1LG14g04170 vs. TAIR10
Match: AT4G28520.1 (AT4G28520.1 cruciferin 3)

HSP 1 Score: 71.6 bits (174), Expect = 9.8e-13
Identity = 48/147 (32.65%), Postives = 77/147 (52.38%), Query Frame = 1

Query: 160 NIDAAAPDGRVKGSAGAVTTVTESKFPFIGQSGLTAILEKLDANAVRSPVYVAEPYDQLI 219
           NID  A     K S G VT+V     P +    L+A    L  NA+  P Y     ++++
Sbjct: 347 NIDDPARADVYKPSLGRVTSVNSYTLPILEYVRLSATRGVLQGNAMVLPKYNMNA-NEIL 406

Query: 220 YVAKGRGKIQIVGSSSK--IDAEVKMGQLILVPKFFAVGKFAGEDGLECISIITATHPVV 279
           Y   G+G+IQ+V  + +  +D +V+ GQL+++P+ FA    +  +  E IS  T  + ++
Sbjct: 407 YCTGGQGRIQVVNDNGQNVLDQQVQKGQLVVIPQGFAYVVQSHGNKFEWISFKTNENAMI 466

Query: 280 EELAGKTSVLEALSPEVFQVSFNVTAE 305
             LAG+TS+L AL  EV    F ++ E
Sbjct: 467 STLAGRTSLLRALPLEVISNGFQISPE 492

BLAST of Cp4.1LG14g04170 vs. TAIR10
Match: AT5G44120.3 (AT5G44120.3 RmlC-like cupins superfamily protein)

HSP 1 Score: 64.3 bits (155), Expect = 1.6e-10
Identity = 48/168 (28.57%), Postives = 77/168 (45.83%), Query Frame = 1

Query: 139 ICSIQQTQSLPKPPKFSKFVYNIDAAAPDGRVKGSAGAVTTVTESKFPFIGQSGLTAILE 198
           ICS + T +L  P +   +             K   G ++T+     P +    L+A+  
Sbjct: 288 ICSARCTDNLDDPSRADVY-------------KPQLGYISTLNSYDLPILRFIRLSALRG 347

Query: 199 KLDANAVRSPVYVAEPYDQLIYVAKGRGKIQIVGSSSK--IDAEVKMGQLILVPKFFAVG 258
            +  NA+  P + A   + ++YV  G  +IQIV  +     D +V  GQLI VP+ F+V 
Sbjct: 348 SIRQNAMVLPQWNANA-NAILYVTDGEAQIQIVNDNGNRVFDGQVSQGQLIAVPQGFSVV 407

Query: 259 KFAGEDGLECISIITATHPVVEELAGKTSVLEALSPEVFQVSFNVTAE 305
           K A  +  + +   T  +  +  LAG+TSVL  L  EV    F ++ E
Sbjct: 408 KRATSNRFQWVEFKTNANAQINTLAGRTSVLRGLPLEVITNGFQISPE 441

BLAST of Cp4.1LG14g04170 vs. NCBI nr
Match: gi|659112129|ref|XP_008456076.1| (PREDICTED: glutelin type-A 2-like [Cucumis melo])

HSP 1 Score: 474.9 bits (1221), Expect = 1.1e-130
Identity = 240/339 (70.80%), Postives = 271/339 (79.94%), Query Frame = 1

Query: 3   QPMNPKPFTEVEAGSYHKWLPSEYPLLARNKVAAGRLLLRPRGFV--------------- 62
           + MNPKPF E E GSY KWLPS+YPLLA+  VA GRLLLRPRGF                
Sbjct: 2   EAMNPKPFFEGEGGSYLKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYADCSKFGYVLQ 61

Query: 63  -------------SDEVVVNLKKGDLIPVPNGVSSWWFNDGDSDLEIIFLGESKNAHVPG 122
                         +EVV+ LKKGDLIPVP+G++SWWFNDGDSDLEIIFLGE+KNAHVPG
Sbjct: 62  GEDGVTGFVFPNKCNEVVMKLKKGDLIPVPSGITSWWFNDGDSDLEIIFLGETKNAHVPG 121

Query: 123 DISYFVLSGPLSLLHGFSPEYVGKTYSLNGEETTQFLKSQSNALICSIQQTQSLPKPPKF 182
           DI+YF+LSGP  LL GF+PEYV K+YSL+ EET +FLKSQSN LI ++Q +QSLPKP K 
Sbjct: 122 DITYFILSGPRGLLQGFAPEYVQKSYSLSQEETNKFLKSQSNVLIFTVQPSQSLPKPHKH 181

Query: 183 SKFVYNIDAAAPDGRVKGSAGAVTTVTESKFPFIGQSGLTAILEKLDANAVRSPVYVAEP 242
           SK VYNIDAA PD R K  A AVT VTES FPFIGQ+GLTA+LEKLDANA+RSPVY+AEP
Sbjct: 182 SKLVYNIDAAVPDNRAKVGAAAVTMVTESTFPFIGQTGLTAVLEKLDANAIRSPVYIAEP 241

Query: 243 YDQLIYVAKGRGKIQIVGSSSKIDAEVKMGQLILVPKFFAVGKFAGEDGLECISIITATH 302
            DQLIYV KG GKIQ+VG SSK DA+VK+GQLILVP++FAVGK AGE+GLECIS+I ATH
Sbjct: 242 SDQLIYVTKGSGKIQVVGFSSKFDADVKIGQLILVPRYFAVGKMAGEEGLECISMIVATH 301

Query: 303 PVVEELAGKTSVLEALSPEVFQVSFNVTAEFEKLLRSKI 314
           P+VEELAGKTSVLEALS EVFQVSFNVTAEFEKL RSK+
Sbjct: 302 PMVEELAGKTSVLEALSSEVFQVSFNVTAEFEKLFRSKV 340

BLAST of Cp4.1LG14g04170 vs. NCBI nr
Match: gi|449467587|ref|XP_004151504.1| (PREDICTED: legumin J [Cucumis sativus])

HSP 1 Score: 470.7 bits (1210), Expect = 2.1e-129
Identity = 239/339 (70.50%), Postives = 266/339 (78.47%), Query Frame = 1

Query: 3   QPMNPKPFTEVEAGSYHKWLPSEYPLLARNKVAAGRLLLRPRGFV--------------- 62
           + MNPKPF E E GSYHKWLPS+YPLLA+  VA GRLLLRPRGF                
Sbjct: 2   EAMNPKPFFEGEGGSYHKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYSDCSKFGYVLQ 61

Query: 63  -------------SDEVVVNLKKGDLIPVPNGVSSWWFNDGDSDLEIIFLGESKNAHVPG 122
                         +EVV+ LKKGDLIPVP GV+SWWFNDGDSDLEIIFLGE+K AHVPG
Sbjct: 62  GEDGVTGFVFPKKCNEVVIKLKKGDLIPVPAGVTSWWFNDGDSDLEIIFLGETKRAHVPG 121

Query: 123 DISYFVLSGPLSLLHGFSPEYVGKTYSLNGEETTQFLKSQSNALICSIQQTQSLPKPPKF 182
           DI+YF+LSGP  LL GF+PEYV K+ SLN EET  FLKSQ N LI ++Q +QSLPKP K+
Sbjct: 122 DITYFILSGPRGLLQGFTPEYVQKSCSLNQEETNTFLKSQPNVLIFTVQPSQSLPKPHKY 181

Query: 183 SKFVYNIDAAAPDGRVKGSAGAVTTVTESKFPFIGQSGLTAILEKLDANAVRSPVYVAEP 242
           SK VYNIDAAAPD R K    AVT VTES FPFIGQ+GLT +LEKLDANA+RSPVY+AEP
Sbjct: 182 SKLVYNIDAAAPDNRAKVGDAAVTMVTESTFPFIGQTGLTPVLEKLDANAIRSPVYIAEP 241

Query: 243 YDQLIYVAKGRGKIQIVGSSSKIDAEVKMGQLILVPKFFAVGKFAGEDGLECISIITATH 302
            DQLIYV KG GKIQ+VG SSK DA+VK GQLILVP++FAVGK AGE+GLECIS+I ATH
Sbjct: 242 SDQLIYVTKGSGKIQVVGFSSKFDADVKTGQLILVPRYFAVGKIAGEEGLECISMIVATH 301

Query: 303 PVVEELAGKTSVLEALSPEVFQVSFNVTAEFEKLLRSKI 314
           P+VEELAGKTSVLEALS EVFQVSFNVTAEFEKL RSK+
Sbjct: 302 PMVEELAGKTSVLEALSSEVFQVSFNVTAEFEKLFRSKV 340

BLAST of Cp4.1LG14g04170 vs. NCBI nr
Match: gi|659112131|ref|XP_008456077.1| (PREDICTED: glutelin type-B 2-like [Cucumis melo])

HSP 1 Score: 393.3 bits (1009), Expect = 4.2e-106
Identity = 201/351 (57.26%), Postives = 248/351 (70.66%), Query Frame = 1

Query: 3   QPMNPKPFTEVEAGSYHKWLPSEYPLLARNKVAAGRLLLRPRGFV--------------- 62
           +PM+P  F   E GS+HKW PS++P++ + KV AGRLLL PRGF                
Sbjct: 6   KPMDPTNFFTGEGGSFHKWFPSDHPIIPQTKVGAGRLLLHPRGFAVPHNSDSSKVGYVLQ 65

Query: 63  ------------SDEVVVNLKKGDLIPVPNGVSSWWFNDGDSDLEIIFLGESKNAHVPGD 122
                       S+E VV LKKGD+IPVP GV+SWWFNDGDSD E++ +G+++NA +PGD
Sbjct: 66  GSGVAGIVFPCKSEEAVVRLKKGDVIPVPEGVTSWWFNDGDSDFEVLLVGDTRNALIPGD 125

Query: 123 ISYFVLSGPLSLLHGFSPEYVGKTYSLNGEETTQFLKSQSNALICSIQQTQSLPKPPKFS 182
           I+Y V +GPL +L GFS +Y+ K Y L  EE    LKSQ N LI  ++  Q+LP+P   S
Sbjct: 126 ITYVVFAGPLGVLQGFSSDYIEKVYDLTEEEREVLLKSQPNGLIFKLKDDQTLPEPDCHS 185

Query: 183 KFVYNIDAAAPDGRVKGSAGAVTTVTESKFPFIGQSGLTAILEKLDANAVRSPVYVAEPY 242
             V+NI  AAPD  VKG  G VT +TE KFPFIG+SGLTA+LEKL+ANAVRSPVYVA+P 
Sbjct: 186 DLVFNIYDAAPDSVVKG-GGTVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPVYVADPS 245

Query: 243 DQLIYVAKGRGKIQIVGS--SSKIDAEVKMGQLILVPKFFAVGKFAGEDGLECISIITAT 302
            QLIYVA G G+IQI  +    +IDAEVK GQLILVPK+FAVGK AGE+GLEC +IIT T
Sbjct: 246 VQLIYVASGSGRIQIAETFMRKQIDAEVKAGQLILVPKYFAVGKMAGEEGLECFTIITTT 305

Query: 303 HPVVEELAGKTSVLEALSPEVFQVSFNVTAEFEKLLRSKITNASPVIGSSD 325
           HP++EEL GK+S+  A SP+VFQ SFNVTA FEKLL SKIT +SP++  SD
Sbjct: 306 HPLLEELGGKSSIFGAFSPQVFQASFNVTAHFEKLLISKITKSSPLVPPSD 355

BLAST of Cp4.1LG14g04170 vs. NCBI nr
Match: gi|700202448|gb|KGN57581.1| (hypothetical protein Csa_3G218170 [Cucumis sativus])

HSP 1 Score: 390.2 bits (1001), Expect = 3.5e-105
Identity = 195/351 (55.56%), Postives = 248/351 (70.66%), Query Frame = 1

Query: 3   QPMNPKPFTEVEAGSYHKWLPSEYPLLARNKVAAGRLLLRPRGFV--------------- 62
           +PM+P  F   E GS+HKW PS++P++++ KV AGRLLL PRGF                
Sbjct: 6   KPMDPSNFFTGEGGSFHKWFPSDFPIISQTKVGAGRLLLHPRGFAVPHNSDSSKVGYVLQ 65

Query: 63  ------------SDEVVVNLKKGDLIPVPNGVSSWWFNDGDSDLEIIFLGESKNAHVPGD 122
                       S+E  V LKKGD+IPVP GV+SWWFNDGDSD E++ +G+++NA +PGD
Sbjct: 66  GSGVAGIIFPCKSEEAAVRLKKGDVIPVPEGVTSWWFNDGDSDFEVLLVGDTRNALIPGD 125

Query: 123 ISYFVLSGPLSLLHGFSPEYVGKTYSLNGEETTQFLKSQSNALICSIQQTQSLPKPPKFS 182
           I+Y V +GPL +L GFS +Y+ K Y L  +E    LKSQ N LI  ++  Q+LP+P   S
Sbjct: 126 ITYVVFAGPLGVLQGFSSDYIEKVYDLTEKEREVLLKSQPNGLIFKLKDDQTLPEPDCHS 185

Query: 183 KFVYNIDAAAPDGRVKGSAGAVTTVTESKFPFIGQSGLTAILEKLDANAVRSPVYVAEPY 242
             V+NI   APD  VKG  G+VT +TE KFPFIG+SGLTA+LEKL+ANAVRSPVYVA+P 
Sbjct: 186 DLVFNIYHTAPDAVVKG-GGSVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPVYVADPS 245

Query: 243 DQLIYVAKGRGKIQIVGSSSK--IDAEVKMGQLILVPKFFAVGKFAGEDGLECISIITAT 302
            QLIYVA G G++QI  +  +  IDAEVK GQL+LVPK+FAVGK AGE+GLEC +IIT T
Sbjct: 246 VQLIYVASGSGRVQIAETFMRYQIDAEVKAGQLVLVPKYFAVGKMAGEEGLECFTIITTT 305

Query: 303 HPVVEELAGKTSVLEALSPEVFQVSFNVTAEFEKLLRSKITNASPVIGSSD 325
           HP++EEL GKTS+  A SP+VF+ SFN+TA FEKL RSKIT +SP++  SD
Sbjct: 306 HPLLEELGGKTSIFGAFSPQVFEASFNLTAHFEKLFRSKITKSSPLVPPSD 355

BLAST of Cp4.1LG14g04170 vs. NCBI nr
Match: gi|778726893|ref|XP_004139714.2| (PREDICTED: glutelin type-A 2-like [Cucumis sativus])

HSP 1 Score: 330.1 bits (845), Expect = 4.3e-87
Identity = 179/345 (51.88%), Postives = 225/345 (65.22%), Query Frame = 1

Query: 5   MNPKPFTEVEAGSYHKWLPSEYPLLARN----KVAAGRLLLRPRGFV------------- 64
           MNP+   E   GSY+KW PS+YPLLA++     +      L PRGF              
Sbjct: 9   MNPRKHFEGVGGSYNKWYPSDYPLLAQSKVGAGMLL----LHPRGFAILHYSDASKVGYV 68

Query: 65  ---------------SDEVVVNLKKGDLIPVPNGVSSWWFNDGDSDLEIIFLGESKNAHV 124
                          S+E V+ LKKGD+IPVP GV+SWW+NDGDSDLEI FLGE+K AHV
Sbjct: 69  LRGNNGVTGFIFPNTSNEEVIKLKKGDIIPVPTGVTSWWYNDGDSDLEIAFLGETKYAHV 128

Query: 125 PGDISYFVLSGPLSLLHGFSPEYVGKTYSLNGEETTQFLKSQSNALICSIQQTQSLPKPP 184
           PGDISY++LSGP  +L GFS +YV KT++LN  +T+  L SQ N +I  +Q+ Q+LP P 
Sbjct: 129 PGDISYYILSGPQGILQGFSQDYVAKTFNLNEMDTSTLLNSQQNGMIFKLQEGQTLPTPT 188

Query: 185 KFSKFVYNIDAAAPDGRVKGSAGAVTTVTESKFPFIGQSGLTAILEKLDANAVRSPVYVA 244
           K +KFVYN+D          +      V+ES+FPFIG++GL  ++E+L  N VRSPV + 
Sbjct: 189 KDTKFVYNLD----------NYDFFMKVSESEFPFIGETGLAVVVERLGPNVVRSPVLLV 248

Query: 245 EPYDQLIYVAKGRGKIQIVG--SSSKIDAEVKMGQLILVPKFFAVGKFAGEDGLECISII 304
            P DQLIYVA+G G +QIVG  SSSKI+  V+ GQLI VPK+FA GK A E G+E  SI+
Sbjct: 249 SPADQLIYVARGSGTVQIVGLSSSSKIELHVESGQLIFVPKYFAAGKIAAEQGMEFFSIL 308

Query: 305 TATHPVVEELAGKTSVLEALSPEVFQVSFNVTAEFEKLLRSKITN 316
           TA   +V EL GKTSV+EALS EV  VSFN+TAEFEK+LRS  TN
Sbjct: 309 TAKLGLVGELKGKTSVMEALSAEVIAVSFNITAEFEKVLRSNTTN 339

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
CRU4_ARATH6.0e-2027.3012S seed storage protein CRD OS=Arabidopsis thaliana GN=CRD PE=1 SV=1[more]
11S2_SESIN1.7e-1421.8611S globulin seed storage protein 2 OS=Sesamum indicum PE=2 SV=1[more]
13SB_FAGES2.2e-1434.5913S globulin basic chain OS=Fagopyrum esculentum PE=1 SV=1[more]
GLUB1_ORYSJ1.1e-1332.14Glutelin type-B 1 OS=Oryza sativa subsp. japonica GN=GluB1-A PE=2 SV=3[more]
13S1_FAGES4.2e-1331.1313S globulin seed storage protein 1 OS=Fagopyrum esculentum GN=FA02 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L6K0_CUCSA1.4e-12970.50Uncharacterized protein OS=Cucumis sativus GN=Csa_3G218160 PE=4 SV=1[more]
A0A0A0LC21_CUCSA2.5e-10555.56Uncharacterized protein OS=Cucumis sativus GN=Csa_3G218170 PE=4 SV=1[more]
A0A0A0K550_CUCSA3.0e-8751.88Uncharacterized protein OS=Cucumis sativus GN=Csa_7G337100 PE=4 SV=1[more]
A0A161JQR4_MEDSA1.0e-7947.32Hexenal isomerase OS=Medicago sativa GN=MSHI PE=2 SV=1[more]
F6H8X0_VITVI6.8e-7946.92Putative uncharacterized protein OS=Vitis vinifera GN=VIT_12s0034g01950 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT2G28680.14.4e-5334.50 RmlC-like cupins superfamily protein[more]
AT1G07750.14.8e-5233.14 RmlC-like cupins superfamily protein[more]
AT1G03890.13.4e-2127.30 RmlC-like cupins superfamily protein[more]
AT4G28520.19.8e-1332.65 cruciferin 3[more]
AT5G44120.31.6e-1028.57 RmlC-like cupins superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659112129|ref|XP_008456076.1|1.1e-13070.80PREDICTED: glutelin type-A 2-like [Cucumis melo][more]
gi|449467587|ref|XP_004151504.1|2.1e-12970.50PREDICTED: legumin J [Cucumis sativus][more]
gi|659112131|ref|XP_008456077.1|4.2e-10657.26PREDICTED: glutelin type-B 2-like [Cucumis melo][more]
gi|700202448|gb|KGN57581.1|3.5e-10555.56hypothetical protein Csa_3G218170 [Cucumis sativus][more]
gi|778726893|ref|XP_004139714.2|4.3e-8751.88PREDICTED: glutelin type-A 2-like [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0045735nutrient reservoir activity
Vocabulary: INTERPRO
TermDefinition
IPR014710RmlC-like_jellyroll
IPR011051RmlC_Cupin_sf
IPR006045Cupin_1
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0045735 nutrient reservoir activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG14g04170.1Cp4.1LG14g04170.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006045Cupin 1PFAMPF00190Cupin_1coord: 163..305
score: 3.9E-18coord: 53..126
score: 1.
IPR006045Cupin 1SMARTSM00835Cupin_1_3coord: 3..128
score: 2.5E-5coord: 159..307
score: 4.2
IPR011051RmlC-like cupin domainunknownSSF51182RmlC-like cupinscoord: 3..310
score: 4.11
IPR014710RmlC-like jelly roll foldGENE3DG3DSA:2.60.120.10coord: 165..312
score: 1.0E-24coord: 51..152
score: 6.3
NoneNo IPR availablePANTHERPTHR31189FAMILY NOT NAMEDcoord: 4..316
score: 8.7E
NoneNo IPR availablePANTHERPTHR31189:SF0SUBFAMILY NOT NAMEDcoord: 4..316
score: 8.7E

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None