CSPI06G30840 (gene) Wild cucumber (PI 183967)

NameCSPI06G30840
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionVicilin-like antimicrobial peptides 2-2
LocationChr6 : 26418997 .. 26421505 (+)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGAAACACTCATCAGTTTCAGTTTCAGGATCGTTGTTTTCACCTTCCATTCTTATCCCCATCTTCTTCCTTTTCCTCTCTCTTACTGCATATGCCAATGACGGGTGGTGGGAAGGAGACACTCCTGTGGTGAAGAGAGCCAATGAAAGAATCCCAATTCTTAAAACAGAGTACGGTGAGATCTCCGCCGTTGATTTCGACGATGGCACTCGATTTGGACGTTACCATCTCCAGTTCATCACATTGGAACCCAATTCACTGTTTCTCCCTGTTCTTCTTCATTCTGATATGGTGTTCTATGTCCACACTGGTATGGTATATGCTCTGTTTGTCGAAAAATTAGAAAAACAGAACGAAATATGAAATGTTGTATGACGAGATTTTTTATTTATTTTTGGTTGTTTTAGGAAGTGGGAGATTGAATTGGTTTGATGACAATGATTTGAAGGAGGTGGATTTACGGCGGGGAGATCTTTATAGGCTTCATCCAGGTTCCATTTTTTACTTGCAGAGTAGCTTAGAGACCGAACGTGAAAAGCTTCGAATTTATGCTCTGTTTTCCAGCACAGATGAAGATTCATTCGTACGTTTAATCAAAATTTATATCATTAAATCACTGAATGTTTGTAAATTGGTTTGAATATCTGACGGTGAGAATGAATTTGATGCATGTCTTTGAAGAATCCTTCCATTGGAGCATACTCCCGCGTCACTGATCTGGTTCGTGGCTTCGGCAAAGAAGTTCTTCGTAAAGCTTTCATGGTATGAATTTGGCCAATCCAACTCACTCCCTAATTAGTTTATAACTGTCAAATTGTCAAAATTTCCATATTACAACTTGTCAATTCAATTATTTTGTTTCGTTTTTGGATCATTTTCTGAACGGTTGTGATAATCTGTAAAAAGGCCCCTGATGAAGTAATAGAGGAAATAATGAACGCTAAGAGGCCGCCGCTTATCGTCCACGCTGCTGCGCCAACTCCGAGCATAAAGGCAAAGTCGTCGTCACCATGGGAATTTGAGGCTCGGTTATTGAAATCTTTCCTAGGAGGAGACGCAAGTGCGACAGAATTCAACAAGAAGAAGAAGAAGAAAGGCATATACAACGTTTATGAAGTAGACCCTGATTTTGAGAATTGCAATGGATGGAGTTTGACTGTAACCAAGAAAAACTCCCATCAATTGAAAGGCTCCAACATCGGCTTCCTCGTAGTCAACCTTACAGCGGTCAGTAATTAAATGTCACTGATTAATGAAAAAACAGAGTGAACAAACAAACAACAGAATTCATTTGCATTTGCAGGGTTCAATGATGGGTCCGCATTGGAATCCGAGGGCGTGGGAGATTGGGATTGTGACATCGGACGAGCCGGGGGTGATTCGTGTAGGGTGTTCGAGCACCTCGGCAAACAGTTCAAAATGCAAGAATTGGAGTTTTGTGGTAGAGAAAGGGGATGTATTTGTAGTGCCAAGGTTCCATCCAATGGCGCAAATGTCATTCAACAACGGAACGTTTGTATTTGTGGGATTTAGCACAACGAATGGACATAACATGCCGCAGTTCTTTGCTGGGAGCAGCTCTGTTTTGAAAATTGTGGACAGGGAAGTATTGGCATGGTCGTTTGATGTGAATGTGACAACGATTGATCGGTTGTTGAAAGCTAGAGTTGAGTCGATTGTTTTGGAATGTACTTCATGTGCTGAAGAGGAAGTAAGGAAAATGGAAGAGGAAGCTGAGAGAGAGAGGGAGGAGGAAGAAGAAAGAAAGAGAGAAGAGGAGGAACGTAGGAAGAGAGAGGAAGAGGAAGAGAAGAAACGGGAAGAAGAGGAAGAGAGGAAGAGAGAGGAAGAGGAAGAGAAGAAACGGGAAGAAGAAGAAGAGAGAAAAAGGGAAGAAGAGGAACGGAGAAAGAGAGAGGAAGAGGAAGAGAGGAAACGGGAAGAAGAAGAAGAAGAGGAAGAGAGGGAGAGAGAAGAAGAGGAAGCACAGAAAGAGGAAGAGAGGAGGAGAGAAGAGGAAGAGAGAAAAAGAGAGGAAGAGGAGAGAGAGAGAGAGAAAGAGAGAGAAGAGGAGGAACAGAGGAGGAGAGAAGAAGAAGAGGAAGAAGAAGAAGAGAGGAAGAGAGAGGAAGAAAGGGAGGCAGAGAGAGAGGAGGAGGAAGCTAGGAAAAGAGAGGAAGAACATCAAAGAGAGAGAGGGAAGAGGCGGAGAGAGGGGGAGGAAAGACAAAGAAGACGGTGGGAGGAAGAGGAAGAAGAAGAAGGTGGAGGAGAGGAGCCGCAGCTGCCACTCCCAGTACTAAGAATTTTGGAACAATGGACTTAACAACTAAACTAGAGAAACACTCTCAATGTCCCTATCCTAATGAATAACTAAGAGATGCAACAAAAAGATGTCACATTTTGTGTTAAAAACTGTTCAAGTTTTCAAGTTCTGTCGTGTAATGAAAGATCTTCATCAACCACGTGAGGCTCTTTGTTTACCTTTTAAGATTTTTTAATTTCT

mRNA sequence

ATGAAGAAACACTCATCAGTTTCAGTTTCAGGATCGTTGTTTTCACCTTCCATTCTTATCCCCATCTTCTTCCTTTTCCTCTCTCTTACTGCATATGCCAATGACGGGTGGTGGGAAGGAGACACTCCTGTGGTGAAGAGAGCCAATGAAAGAATCCCAATTCTTAAAACAGAGTACGGTGAGATCTCCGCCGTTGATTTCGACGATGGCACTCGATTTGGACGTTACCATCTCCAGTTCATCACATTGGAACCCAATTCACTGTTTCTCCCTGTTCTTCTTCATTCTGATATGGTGTTCTATGTCCACACTGGAAGTGGGAGATTGAATTGGTTTGATGACAATGATTTGAAGGAGGTGGATTTACGGCGGGGAGATCTTTATAGGCTTCATCCAGGTTCCATTTTTTACTTGCAGAGTAGCTTAGAGACCGAACGTGAAAAGCTTCGAATTTATGCTCTGTTTTCCAGCACAGATGAAGATTCATTCAATCCTTCCATTGGAGCATACTCCCGCGTCACTGATCTGGTTCGTGGCTTCGGCAAAGAAGTTCTTCGTAAAGCTTTCATGGCCCCTGATGAAGTAATAGAGGAAATAATGAACGCTAAGAGGCCGCCGCTTATCGTCCACGCTGCTGCGCCAACTCCGAGCATAAAGGCAAAGTCGTCGTCACCATGGGAATTTGAGGCTCGGTTATTGAAATCTTTCCTAGGAGGAGACGCAAGTGCGACAGAATTCAACAAGAAGAAGAAGAAGAAAGGCATATACAACGTTTATGAAGTAGACCCTGATTTTGAGAATTGCAATGGATGGAGTTTGACTGTAACCAAGAAAAACTCCCATCAATTGAAAGGCTCCAACATCGGCTTCCTCGTAGTCAACCTTACAGCGGGTTCAATGATGGGTCCGCATTGGAATCCGAGGGCGTGGGAGATTGGGATTGTGACATCGGACGAGCCGGGGGTGATTCGTGTAGGGTGTTCGAGCACCTCGGCAAACAGTTCAAAATGCAAGAATTGGAGTTTTGTGGTAGAGAAAGGGGATGTATTTGTAGTGCCAAGGTTCCATCCAATGGCGCAAATGTCATTCAACAACGGAACGTTTGTATTTGTGGGATTTAGCACAACGAATGGACATAACATGCCGCAGTTCTTTGCTGGGAGCAGCTCTGTTTTGAAAATTGTGGACAGGGAAGTATTGGCATGGTCGTTTGATGTGAATGTGACAACGATTGATCGGTTGTTGAAAGCTAGAGTTGAGTCGATTGTTTTGGAATGTACTTCATGTGCTGAAGAGGAAGTAAGGAAAATGGAAGAGGAAGCTGAGAGAGAGAGGGAGGAGGAAGAAGAAAGAAAGAGAGAAGAGGAGGAACGTAGGAAGAGAGAGGAAGAGGAAGAGAAGAAACGGGAAGAAGAGGAAGAGAGGAAGAGAGAGGAAGAGGAAGAGAAGAAACGGGAAGAAGAAGAAGAGAGAAAAAGGGAAGAAGAGGAACGGAGAAAGAGAGAGGAAGAGGAAGAGAGGAAACGGGAAGAAGAAGAAGAAGAGGAAGAGAGGGAGAGAGAAGAAGAGGAAGCACAGAAAGAGGAAGAGAGGAGGAGAGAAGAGGAAGAGAGAAAAAGAGAGGAAGAGGAGAGAGAGAGAGAGAAAGAGAGAGAAGAGGAGGAACAGAGGAGGAGAGAAGAAGAAGAGGAAGAAGAAGAAGAGAGGAAGAGAGAGGAAGAAAGGGAGGCAGAGAGAGAGGAGGAGGAAGCTAGGAAAAGAGAGGAAGAACATCAAAGAGAGAGAGGGAAGAGGCGGAGAGAGGGGGAGGAAAGACAAAGAAGACGGTGGGAGGAAGAGGAAGAAGAAGAAGGTGGAGGAGAGGAGCCGCAGCTGCCACTCCCAGTACTAAGAATTTTGGAACAATGGACTTAA

Coding sequence (CDS)

ATGAAGAAACACTCATCAGTTTCAGTTTCAGGATCGTTGTTTTCACCTTCCATTCTTATCCCCATCTTCTTCCTTTTCCTCTCTCTTACTGCATATGCCAATGACGGGTGGTGGGAAGGAGACACTCCTGTGGTGAAGAGAGCCAATGAAAGAATCCCAATTCTTAAAACAGAGTACGGTGAGATCTCCGCCGTTGATTTCGACGATGGCACTCGATTTGGACGTTACCATCTCCAGTTCATCACATTGGAACCCAATTCACTGTTTCTCCCTGTTCTTCTTCATTCTGATATGGTGTTCTATGTCCACACTGGAAGTGGGAGATTGAATTGGTTTGATGACAATGATTTGAAGGAGGTGGATTTACGGCGGGGAGATCTTTATAGGCTTCATCCAGGTTCCATTTTTTACTTGCAGAGTAGCTTAGAGACCGAACGTGAAAAGCTTCGAATTTATGCTCTGTTTTCCAGCACAGATGAAGATTCATTCAATCCTTCCATTGGAGCATACTCCCGCGTCACTGATCTGGTTCGTGGCTTCGGCAAAGAAGTTCTTCGTAAAGCTTTCATGGCCCCTGATGAAGTAATAGAGGAAATAATGAACGCTAAGAGGCCGCCGCTTATCGTCCACGCTGCTGCGCCAACTCCGAGCATAAAGGCAAAGTCGTCGTCACCATGGGAATTTGAGGCTCGGTTATTGAAATCTTTCCTAGGAGGAGACGCAAGTGCGACAGAATTCAACAAGAAGAAGAAGAAGAAAGGCATATACAACGTTTATGAAGTAGACCCTGATTTTGAGAATTGCAATGGATGGAGTTTGACTGTAACCAAGAAAAACTCCCATCAATTGAAAGGCTCCAACATCGGCTTCCTCGTAGTCAACCTTACAGCGGGTTCAATGATGGGTCCGCATTGGAATCCGAGGGCGTGGGAGATTGGGATTGTGACATCGGACGAGCCGGGGGTGATTCGTGTAGGGTGTTCGAGCACCTCGGCAAACAGTTCAAAATGCAAGAATTGGAGTTTTGTGGTAGAGAAAGGGGATGTATTTGTAGTGCCAAGGTTCCATCCAATGGCGCAAATGTCATTCAACAACGGAACGTTTGTATTTGTGGGATTTAGCACAACGAATGGACATAACATGCCGCAGTTCTTTGCTGGGAGCAGCTCTGTTTTGAAAATTGTGGACAGGGAAGTATTGGCATGGTCGTTTGATGTGAATGTGACAACGATTGATCGGTTGTTGAAAGCTAGAGTTGAGTCGATTGTTTTGGAATGTACTTCATGTGCTGAAGAGGAAGTAAGGAAAATGGAAGAGGAAGCTGAGAGAGAGAGGGAGGAGGAAGAAGAAAGAAAGAGAGAAGAGGAGGAACGTAGGAAGAGAGAGGAAGAGGAAGAGAAGAAACGGGAAGAAGAGGAAGAGAGGAAGAGAGAGGAAGAGGAAGAGAAGAAACGGGAAGAAGAAGAAGAGAGAAAAAGGGAAGAAGAGGAACGGAGAAAGAGAGAGGAAGAGGAAGAGAGGAAACGGGAAGAAGAAGAAGAAGAGGAAGAGAGGGAGAGAGAAGAAGAGGAAGCACAGAAAGAGGAAGAGAGGAGGAGAGAAGAGGAAGAGAGAAAAAGAGAGGAAGAGGAGAGAGAGAGAGAGAAAGAGAGAGAAGAGGAGGAACAGAGGAGGAGAGAAGAAGAAGAGGAAGAAGAAGAAGAGAGGAAGAGAGAGGAAGAAAGGGAGGCAGAGAGAGAGGAGGAGGAAGCTAGGAAAAGAGAGGAAGAACATCAAAGAGAGAGAGGGAAGAGGCGGAGAGAGGGGGAGGAAAGACAAAGAAGACGGTGGGAGGAAGAGGAAGAAGAAGAAGGTGGAGGAGAGGAGCCGCAGCTGCCACTCCCAGTACTAAGAATTTTGGAACAATGGACTTAA
BLAST of CSPI06G30840 vs. Swiss-Prot
Match: VCL21_ARATH (Vicilin-like seed storage protein At2g18540 OS=Arabidopsis thaliana GN=At2g18540 PE=3 SV=1)

HSP 1 Score: 349.7 bits (896), Expect = 6.5e-95
Identity = 292/596 (48.99%), Postives = 398/596 (66.78%), Query Frame = 1

Query: 42  TPVVKRANERIPILKTEYGEISAVDFDDGTRFGRYHLQFITLEPNSLFLPVLLHSDMVFY 101
           +P++ + ++R  ++ TE+G ISAV   DG     YH+QFITLEPN+L LP+LLHSDMVF+
Sbjct: 41  SPLLVKKDQRTSVVATEFGNISAVQIGDG-----YHIQFITLEPNALLLPLLLHSDMVFF 100

Query: 102 VHTGSGRLNWFDDNDLKEVDLRRGDLYRLHPGSIFYLQSSLETEREKLRIYALFSSTDED 161
           VHTG+G LNW D+   ++++LRRGD++RL  G++FY+ S+     EKLR+YA+F+   + 
Sbjct: 101 VHTGTGILNWIDEESERKLELRRGDVFRLRSGTVFYVHSN-----EKLRVYAIFN-VGKC 160

Query: 162 SFNPSIGAYSRVTDLVRGFGKEVLRKAFMAPDEVIEEIMNAKRPPLIVHAAAPTPSIKAK 221
             +P +GAYS V DL+ GF    LR AF  P++++ +I +A +PPLIV+A    P  + +
Sbjct: 161 LNDPCLGAYSSVRDLLLGFDDRTLRSAFAVPEDILRKIRDATKPPLIVNAL---PRNRTQ 220

Query: 222 SSSPWEFEARLLKSFLGGDASATEFNKK------KKKKGIYNVYEVDPDFENCNGWSLTV 281
                ++++RL++ F+  +        K      KKK   +NV+E DPDFEN NG S+ V
Sbjct: 221 GLEEDKWQSRLVRLFVSVEDVTDHLAMKPIVDTNKKKSRTFNVFEEDPDFENNNGRSIVV 280

Query: 282 TKKNSHQLKGSNIGFLVVNLTAGSMMGPHWNPRAWEIGIVTSDEPGVIRVGCSSTSANSS 341
            +K+   LKGS  G  +VNLT GSM+GPHWNP A EI IV   E  V  V   S S+  +
Sbjct: 281 DEKDLDALKGSRFGVFMVNLTKGSMIGPHWNPSACEISIVLEGEGMVRVVNQQSLSSCKN 340

Query: 342 KCKNWSFVVEKGDVFVVPRFHPMAQMSFNNGTFVFVGFSTTNGHNMPQFFAGSSSVLKIV 401
             K+ SF+VE+GDVFVVP+FHPMAQMSF N +FVF+GFST+   N PQF  G SSVLK++
Sbjct: 341 DRKSESFMVEEGDVFVVPKFHPMAQMSFENSSFVFMGFSTSAKTNHPQFLVGQSSVLKVL 400

Query: 402 DREVLAWSFDVNVTTIDRLLKARVESIVLECTSCAEEEVRKMEEEAEREREEEEERKREE 461
           DR+V+A SF+++  TI  LLKA+ ES++ EC SCAE E+ K+       RE EE ++REE
Sbjct: 401 DRDVVAVSFNLSNETIKGLLKAQKESVIFECASCAEGELSKL------MREIEERKRREE 460

Query: 462 EERRKREEEEEKKREEEEERKREEEEEKKREEEEERKREEEERRKREEEEERKREEEEEE 521
           EE  +R +EEE+ R+ EE ++REEEE K+REE      EE ER+KREEEE RKR     E
Sbjct: 461 EEIERRRKEEEEARKREEAKRREEEEAKRREE------EETERKKREEEEARKR-----E 520

Query: 522 EEREREEEEAQKEEERRREEEERKREEEEREREKEREEEEQRRREEE----EEEEEERKR 581
           EER+REEEEA++ EE R++ EE   +  +RE E+E+EEE  ++REEE    E EE ERKR
Sbjct: 521 EERKREEEEAKRREEERKKREEEAEQARKREEEREKEEEMAKKREEERQRKEREEVERKR 580

Query: 582 EEEREAEREEEEARKREEEHQRERGKRRREGEERQRRRWEE-----EEEEEGGGEE 623
            EE+E +R EEEARKREEE +RE    +R  +ERQR+  EE      EE+E   EE
Sbjct: 581 REEQERKRREEEARKREEERKREEEMAKRREQERQRKEREEVERKIREEQERKREE 605

BLAST of CSPI06G30840 vs. Swiss-Prot
Match: VCL43_ARATH (Vicilin-like seed storage protein At4g36700 OS=Arabidopsis thaliana GN=At4g36700 PE=3 SV=1)

HSP 1 Score: 339.7 bits (870), Expect = 6.8e-92
Identity = 241/491 (49.08%), Postives = 332/491 (67.62%), Query Frame = 1

Query: 42  TPVVKRANERIPILKTEYGEISAVDFDDGTR-FGRYHLQFITLEPNSLFLPVLLHSDMVF 101
           +P++ + ++  PI +T++G+IS V   +G    G Y +  ITLEPN++ LP+LLHSDMVF
Sbjct: 45  SPLLIKKDQWKPIFETKFGQISTVQIGNGCGGMGPYKIHSITLEPNTILLPLLLHSDMVF 104

Query: 102 YVHTGSGRLNWFDDNDLKEVDLRRGDLYRLHPGSIFYLQSS-----LETEREKLRIYALF 161
           +V +GSG LNW D+ + K  ++R GD+YRL PGS+FYLQS      L T   KL++YA+F
Sbjct: 105 FVDSGSGILNWVDE-EAKSTEIRLGDVYRLRPGSVFYLQSKPVDIFLGT---KLKLYAIF 164

Query: 162 SSTDEDSFNPSIGAYSRVTDLVRGFGKEVLRKAFMAPDEVIEEIMNAKRPPLIVHAAAPT 221
           S+ DE   +P  GAYS +TDL+ GF + +L+ AF  P+ +IE + N  +PPLIV     T
Sbjct: 165 SNNDECLHDPCFGAYSSITDLMFGFDETILQSAFGVPEGIIELMRNRTKPPLIVSETLCT 224

Query: 222 PSIKAKSSSPWEFEARLLKSFLGGDASATEFNKKKKKKG---------IYNVYEVDPDFE 281
           P +    ++ W+ + RLLK F  G A   +  KKK+KK           +NV+E +PDFE
Sbjct: 225 PGV----ANTWQLQPRLLKLF-AGSADLVDNKKKKEKKEKKEKVKKAKTFNVFESEPDFE 284

Query: 282 NCNGWSLTVTKKNSHQLKGSNIGFLVVNLTAGSMMGPHWNPRAWEIGIVTSDEPGVIRVG 341
           +  G ++T+ +K+   LKGS +G  +VNLT GSMMGPHWNP A EI IV     G++RV 
Sbjct: 285 SPYGRTITINRKDLKVLKGSMVGVSMVNLTQGSMMGPHWNPWACEISIVLKG-AGMVRVL 344

Query: 342 CSSTSAN-SSKCKNWSFVVEKGDVFVVPRFHPMAQMSFNNGTFVFVGFSTTNGHNMPQFF 401
            SS S+N SS+CKN  F VE+GD+F VPR HPMAQMSFNN + VFVGF+T+  +N PQF 
Sbjct: 345 RSSISSNTSSECKNVRFKVEEGDIFAVPRLHPMAQMSFNNDSLVFVGFTTSAKNNEPQFL 404

Query: 402 AGSSSVLKIVDREVLAWSFDVNVTTIDRLLKARVESIVLECTSCAEEEVRKMEEEAERER 461
           AG  S L+++DR+VLA S +V+  TID LL A+ E+++LEC SCAE E+ K++ E ER +
Sbjct: 405 AGEDSALRMLDRQVLAASLNVSSVTIDGLLGAQKEAVILECHSCAEGEIEKLKVEIER-K 464

Query: 462 EEEEERKREEEERRKREEEEEKKREEEEERKREEEEEKKREEEEERKREEEERRKREEEE 517
           + ++ERKR  +ER+K  EEEE KREEEE RKREEEEEKKR   ++   +EEE R+R+   
Sbjct: 465 KIDDERKRRHDERKK--EEEEAKREEEERRKREEEEEKKRWPPQQ-PPQEEELRERQLPM 521

BLAST of CSPI06G30840 vs. Swiss-Prot
Match: VCL22_ARATH (Vicilin-like seed storage protein At2g28490 OS=Arabidopsis thaliana GN=At2g28490 PE=2 SV=1)

HSP 1 Score: 119.0 bits (297), Expect = 1.9e-25
Identity = 74/217 (34.10%), Postives = 119/217 (54.84%), Query Frame = 1

Query: 223 SSPWEFEARLLKSFLGGDASATEFNKKKKKKGIYNVYEVD--PDFENCNGWSLTVTKKNS 282
           SS W +   +++S L       + +   + +  YN+Y+    P F+N  GWS+ +   + 
Sbjct: 297 SSGWSWR-NIVRSILDLTEEKNKGSGSSECEDSYNIYDKKDKPSFDNKYGWSIALDYDDY 356

Query: 283 HQLKGSNIGFLVVNLTAGSMMGPHWNPRAWEIGIVTSDEPGVIRVGCSSTSANSSKCKNW 342
             LK S IG  +VNLTAG+MM PH NP A E GIV +    +  V  + TSA +++    
Sbjct: 357 KPLKHSGIGVYLVNLTAGAMMAPHMNPTATEYGIVLAGSGEIQVVFPNGTSAMNTR---- 416

Query: 343 SFVVEKGDVFVVPRFHPMAQMSFNNGTFVFVGFSTTNGHNMPQFFAGSSSVLKIVDREVL 402
              V  GDVF +PR+    Q++   G F FVGF+T+   N PQF  GS+S+L+ ++   L
Sbjct: 417 ---VSVGDVFWIPRYFAFCQIASRTGPFEFVGFTTSAHKNRPQFLVGSNSLLRTLNLTSL 476

Query: 403 AWSFDVNVTTIDRLLKARVESIVLECTSCAEEEVRKM 438
           + +F V+  T+ R ++A+ E+++L   + A   V +M
Sbjct: 477 SIAFGVDEETMRRFIEAQREAVILPTPAAAPPHVGEM 505

BLAST of CSPI06G30840 vs. Swiss-Prot
Match: CONB1_LUPAN (Conglutin beta 1 OS=Lupinus angustifolius GN=BETA1 PE=1 SV=1)

HSP 1 Score: 76.6 bits (187), Expect = 1.1e-12
Identity = 79/385 (20.52%), Postives = 152/385 (39.48%), Query Frame = 1

Query: 84  EPNSLFLPVLLHSDMVFYVHTGSGRLNWFDDNDLKEVDLRRGDLYRLHPGSIFYLQSSLE 143
           +PN+L LP    +D +  V  G   +   + +  +  +L +GD  RL  G+  Y+ +  +
Sbjct: 231 KPNTLILPKHSDADFILVVLNGRATITIVNPDKRQVYNLEQGDALRLPAGTTSYILNPDD 290

Query: 144 TEREKL-----------RIYALFSSTDEDSFNPSIGAYSRVTDLVRGFGKEVLRKAFMAP 203
            +  ++           ++Y  + ST +D  +   G      +       E + +  +  
Sbjct: 291 NQNLRVAKLAIPINNPGKLYDFYPSTTKDQQSYFSGFSKNTLEATFNTRYEEIERVLLGD 350

Query: 204 DEVIEEIMNAKRPPLIVHAAAPTPSIKAKSSSPWEFEARLLKSFLGGDASATEFNKKKKK 263
           DE+ E     +              +  K               L   A ++    K  +
Sbjct: 351 DELQENEKQRRGQEQSHQDEGVIVRVSKKQIQE-----------LRKHAQSSSGEGKPSE 410

Query: 264 KGIYNVYEVDPDFENCNGWSLTVTKKNSHQLKGSNIGFLVVNLTAGSMMGPHWNPRAWEI 323
            G +N+    P + N  G    +T   + Q +  NI      +  G+++ PH+N +A  I
Sbjct: 411 SGPFNLRSNKPIYSNKFGNFYEITPDINPQFQDLNISLTFTEINEGALLLPHYNSKAIFI 470

Query: 324 GIVTSDEPGVIRVGCSSTSANSSK-----------CKNWSFVVEKGDVFVVPRFHPMAQM 383
            +V   E     VG         +            + +S  + KGDVF++P  HP++  
Sbjct: 471 VVVDEGEGNYELVGIRDQQRQQDEQEEEYEQGEEEVRRYSDKLSKGDVFIIPAGHPLSIN 530

Query: 384 SFNNGTFVFVGFSTTNGHNMPQFFAGSS-SVLKIVDREVLAWSFDVNVTTIDRLLKARVE 443
           + +N     +GF      N   F AGS  +V+K +DREV   +F  ++  ++RL+K + +
Sbjct: 531 ASSN--LRLLGFGINANENQRNFLAGSEDNVIKQLDREVKELTFPGSIEDVERLIKNQQQ 590

Query: 444 SIVLECTSCAEEEVRKMEEEAERER 446
           S      +   ++ ++ E+E  R R
Sbjct: 591 SYF---ANAQPQQQQQREKEGRRGR 599

BLAST of CSPI06G30840 vs. Swiss-Prot
Match: CONB2_LUPAN (Conglutin beta 2 OS=Lupinus angustifolius GN=BETA2 PE=1 SV=1)

HSP 1 Score: 75.5 bits (184), Expect = 2.4e-12
Identity = 79/403 (19.60%), Postives = 163/403 (40.45%), Query Frame = 1

Query: 54  ILKTEYGEISAVD-FDDGTR----FGRYHLQFITLEPNSLFLPVLLHSDMVFYVHTGSGR 113
           + K   G+I  ++ FD  T        Y +     +PN+L LP    +D +  V  G   
Sbjct: 187 LYKNRNGQIRVLERFDQRTNRLENLQNYRIVEFQSKPNTLILPKHSDADYILVVLNGRAT 246

Query: 114 LNWFDDNDLKEVDLRRGDLYRLHPGSIFYLQSSLETEREKLRIYALFSSTDEDSFNPSIG 173
           +   + +  +  +L  GD  RL  G+  Y+ +  + +  ++   A+  +   + ++    
Sbjct: 247 ITIVNPDKRQAYNLEHGDALRLPAGTTSYILNPDDNQNLRVVKLAIPINNPGNFYDFYPS 306

Query: 174 AYSRVTDLVRGFGKEVLRKAFMAPDEVIEEIMNAKRPPLIVHAAAPTPSIKAKSSSPWEF 233
           +         GF +  L   F    E I+ I+            +       +       
Sbjct: 307 STKDQQSYFNGFSRNTLEATFNTRYEEIQRIILGNEDGQEDEEQSRGQEQSHQDQGVIVR 366

Query: 234 EARLLKSFLGGDASATEFNKKKKKKGIYNVYEVDPDFENCNGWSLTVTKKNSHQLKGSNI 293
            ++     L   A ++    K  + G +N+   +P + N  G    +T   + Q +  +I
Sbjct: 367 VSKEQIQELRKHAQSSSGKGKPSESGPFNLRSDEPIYSNKFGNFYEITPDRNPQAQDLDI 426

Query: 294 GFLVVNLTAGSMMGPHWNPRAWEIGIVTSDEPGVIRVGC-----SSTSANSSKCKNWSFV 353
               + +  G ++ PH+N +A  + +V   E     VG              + + ++  
Sbjct: 427 SLTFIEINEGGLLLPHYNSKAIFVVVVDEGEGNYELVGIRDQERQQDEQEQEEVRRYNAK 486

Query: 354 VEKGDVFVVPRFHPMAQMSFNNGTFVFVGFSTTNGHNMPQFFAGSS-SVLKIVDREVLAW 413
           + +GD+FV+P  HP++  + +N     +GF      N   F AGS  +V++ +D+EV   
Sbjct: 487 LSEGDIFVIPAGHPISINASSN--LRLLGFGINADENQRNFLAGSEDNVIRQLDKEVKQL 546

Query: 414 SFDVNVTTIDRLLKARVESIVLECTSCAEEEVRKMEEEAERER 446
           +F  +V  ++RL+K + +S      +   ++ ++ E+E  R R
Sbjct: 547 TFPGSVEDVERLIKNQQQSYF---ANAQPQQQQQREKEGRRGR 584

BLAST of CSPI06G30840 vs. TrEMBL
Match: A0A0A0KGF2_CUCSA (PreproMP73 OS=Cucumis sativus GN=Csa_6G502040 PE=4 SV=1)

HSP 1 Score: 916.0 bits (2366), Expect = 2.5e-263
Identity = 549/622 (88.26%), Postives = 569/622 (91.48%), Query Frame = 1

Query: 1   MKKHSSVSVSGSLFSPSILIPIFFLFLSLTAYANDGWWEGDTPVVKRANERIPILKTEYG 60
           MKKHS+VSVSGS FSPSILIPIFFLFLSL AYA+DGWWEGDTPVVKRANERIPILKTEYG
Sbjct: 1   MKKHSAVSVSGSSFSPSILIPIFFLFLSLPAYADDGWWEGDTPVVKRANERIPILKTEYG 60

Query: 61  EISAVDFDDGTRFGRYHLQFITLEPNSLFLPVLLHSDMVFYVHTGSGRLNWFDDNDLKEV 120
           EISAVDFDDGTRFGRYHLQFITLEPNSLFLPVLLHSDMVFYVHTGSGRLNWFDDNDLKEV
Sbjct: 61  EISAVDFDDGTRFGRYHLQFITLEPNSLFLPVLLHSDMVFYVHTGSGRLNWFDDNDLKEV 120

Query: 121 DLRRGDLYRLHPGSIFYLQSSLETEREKLRIYALFSSTDEDSFNPSIGAYSRVTDLVRGF 180
           DLRRGDLYRLHPGSIFYLQSSLETEREKLRIYALFSSTDEDSFNPSIGAYSRVTDLVRGF
Sbjct: 121 DLRRGDLYRLHPGSIFYLQSSLETEREKLRIYALFSSTDEDSFNPSIGAYSRVTDLVRGF 180

Query: 181 GKEVLRKAFMAPDEVIEEIMNAKRPPLIVHAAAPTPSIKAKSSSPWEFEARLLKSFLGGD 240
           GKEVLRKAFMAPDEVIEEIMNAKRPPLIVHAAAPTPSIKAKSSSPWEFEARLLKSFLGGD
Sbjct: 181 GKEVLRKAFMAPDEVIEEIMNAKRPPLIVHAAAPTPSIKAKSSSPWEFEARLLKSFLGGD 240

Query: 241 ASATEFNKKKKKKGIYNVYEVDPDFENCNGWSLTVTKKNSHQLKGSNIGFLVVNLTAGSM 300
           ASA EFNKKKKKKGIYNVYEVDPDFENCNGWSLTVTKKNSHQLKGSNIGFLVVNLTAGSM
Sbjct: 241 ASAIEFNKKKKKKGIYNVYEVDPDFENCNGWSLTVTKKNSHQLKGSNIGFLVVNLTAGSM 300

Query: 301 MGPHWNPRAWEIGIVTSDEPGVIRVGCSSTSANSSKCKNWSFVVEKGDVFVVPRFHPMAQ 360
           MGPHWNPRAWEIGIVTSDEPGVIRVGCSSTSANSSKCKNWSFVVEKGDVFVVPRFHPMAQ
Sbjct: 301 MGPHWNPRAWEIGIVTSDEPGVIRVGCSSTSANSSKCKNWSFVVEKGDVFVVPRFHPMAQ 360

Query: 361 MSFNNGTFVFVGFSTTNGHNMPQFFAGSSSVLKIVDREVLAWSFDVNVTTIDRLLKARVE 420
           MSFNNGTFVFVGFSTTNGHNMPQFFAGSSSVLKIVDREVLAWSFDVNVTTIDRLLKARVE
Sbjct: 361 MSFNNGTFVFVGFSTTNGHNMPQFFAGSSSVLKIVDREVLAWSFDVNVTTIDRLLKARVE 420

Query: 421 SIVLECTSCAEEEVRKMEEEAEREREEEEERKREEEERRKREEEEEKKREEEEERKREEE 480
           SIVLECTSCAEEEVRKMEEEAEREREEEEERKREEEERRKREEEEE+KREEEE RKREEE
Sbjct: 421 SIVLECTSCAEEEVRKMEEEAEREREEEEERKREEEERRKREEEEERKREEEERRKREEE 480

Query: 481 EEKKREEEEERKREEEERRKREEEEERKREEEEEEEEREREEEEAQKEEERRREEEERKR 540
           EE+KREEEE              EEERKREEEEEEEER+REEEE            E+KR
Sbjct: 481 EERKREEEER-------------EEERKREEEEEEEERKREEEE------------EKKR 540

Query: 541 EEEEREREKEREEEEQRRREEEEEEEEERKREEEREAEREEEEARKREEEHQRERGKRRR 600
           EEEE E E++REEEE+++REEEEEEEE RKREEE E +REEEE     EE +R+R ++R 
Sbjct: 541 EEEEEEEERKREEEEEKKREEEEEEEE-RKREEEEEKKREEEE-----EEEERKRKRKRG 591

Query: 601 EGEERQRRRWEEEEEEEGGGEE 623
            GE+R+R +  +  E E   EE
Sbjct: 601 GGEKRKREKERKRRERERKREE 591

BLAST of CSPI06G30840 vs. TrEMBL
Match: Q8W3X8_CUCMA (PreproMP73 OS=Cucurbita maxima GN=CmMP73 PE=2 SV=1)

HSP 1 Score: 689.1 bits (1777), Expect = 5.1e-195
Identity = 462/637 (72.53%), Postives = 522/637 (81.95%), Query Frame = 1

Query: 1   MKKHSSVSVSGSLFSPSILIPIFFLFLSLTAYANDGWWEGDTPVVKRANERIPILKTEYG 60
           MKK    ++SGS FS S L  +FFLFLSL + A+D WWE   PV KRANER  +LKTEYG
Sbjct: 1   MKK--CTAISGSPFSLSFLFTVFFLFLSLPSNADDKWWEAACPV-KRANERKSLLKTEYG 60

Query: 61  EISAVDFDDGTRFGRYHLQFITLEPNSLFLPVLLHSDMVFYVHTGSGRLNWFDDNDLKEV 120
           EISAVD  D ++FG YHLQFIT+EPNSLFLPVLLH+DMV Y+HTGSGRLNWFDD+DL+EV
Sbjct: 61  EISAVDLHDASQFGPYHLQFITMEPNSLFLPVLLHADMVLYMHTGSGRLNWFDDDDLREV 120

Query: 121 DLRRGDLYRLHPGSIFYLQSSLETEREKLRIYALFSSTDEDSFNPSIGAYSRVTDLVRGF 180
           DLRRGD++RL PG+IFY+ SSLETEREKLR+YALFSSTDED F P+IGAYSRVTD VRGF
Sbjct: 121 DLRRGDIFRLQPGAIFYIHSSLETEREKLRMYALFSSTDEDPFEPAIGAYSRVTDHVRGF 180

Query: 181 GKEVLRKAFMAPDEVIEEIMNAKRPPLIVHAAAPTPSIKAKS----SSPWEFEARLLKSF 240
            KE+L KAFM P+EVIEEIM+AKRPPLIVHAA    ++  K     S   E EAR LKSF
Sbjct: 181 DKEILCKAFMVPEEVIEEIMDAKRPPLIVHAATTLSTLSKKQRSSLSMSLELEARFLKSF 240

Query: 241 LGGDASATEF-NKKKKKKGIYNVYEVDPDFENCNGWSLTVTKKNSHQLKGSNIGFLVVNL 300
           +GG     +F  KKKKKKG+YNV+E DPDFENCNGWSLTVTKK SHQLKGSNIGF VVNL
Sbjct: 241 IGGGGIGMDFKKKKKKKKGLYNVFEADPDFENCNGWSLTVTKKVSHQLKGSNIGFFVVNL 300

Query: 301 TAGSMMGPHWNPRAWEIGIVTSDEPGVIRVGCSSTSANSSKCKNWSFVVEKGDVFVVPRF 360
           TAGSMMGPHWNPRAWEIGIVTS+E GV+RVGCSS + NSSKCK WSFVV KGDVFVVPRF
Sbjct: 301 TAGSMMGPHWNPRAWEIGIVTSEEAGVVRVGCSSMT-NSSKCKKWSFVVGKGDVFVVPRF 360

Query: 361 HPMAQMSFNNGTFVFVGFSTTNGHNMPQFFAGSSSVLKIVDREVLAWSFDVNVTTIDRLL 420
           HPMAQMSFNNG+F FVGFSTTN +N+PQF AG SSVL+ V+R+VLAWSFDVNVTTIDRLL
Sbjct: 361 HPMAQMSFNNGSFAFVGFSTTNRNNLPQFLAGRSSVLQTVERQVLAWSFDVNVTTIDRLL 420

Query: 421 KARVESIVLECTSCAEEEVRKMEEEAEREREEEEERKREEEERRKREEEEEKKREEEEER 480
           +ARVES++LECTSCAEEEV KMEEEAERER+EEEER+REEEE R+REEEE +KR EEEER
Sbjct: 421 EARVESVILECTSCAEEEVMKMEEEAERERQEEEERRREEEE-REREEEEARKR-EEEER 480

Query: 481 KREEEEEKKREEEEERKREEEERRKREEEEERKREEEEEEEEREREEEEAQKEEERRREE 540
           +REEEE +KR EEEER+REEEE RKREEEE   RE EEEE  +  EE E ++EEERRREE
Sbjct: 481 EREEEEARKR-EEEEREREEEEARKREEEE---REREEEEARKREEEREREEEEERRREE 540

Query: 541 EERKREEEE-REREKE---REEEEQRRREEEE----EEEEERKREEEREAEREEEEARKR 600
           EER+REEEE R+RE+E   + EEE+R REEEE    EEEE RKREEE   +REEEEARKR
Sbjct: 541 EEREREEEEARKREEEEARKREEEEREREEEEARKREEEEARKREEEEARKREEEEARKR 600

Query: 601 EEEHQRERGKRRREGEERQRRRWEEEEEEEGGGEEPQ 625
           E+E  R+R +  RE EE   R   EEEEE    EE +
Sbjct: 601 EKEEARKREEEEREREEEAERERREEEEEARRREEAE 627

BLAST of CSPI06G30840 vs. TrEMBL
Match: U5GAT4_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0007s12490g PE=4 SV=1)

HSP 1 Score: 464.2 bits (1193), Expect = 2.6e-127
Identity = 337/602 (55.98%), Postives = 442/602 (73.42%), Query Frame = 1

Query: 25  LFLSLTAYAND-GWWEGDTPVVKRANERIPILKTEYGEISAVDFDDGTRFGRYHLQFITL 84
           L + + A++ D   WE   P + R   R  ++ TEYGEISA +   GT+ G YH+QFITL
Sbjct: 19  LSIHVEAFSEDVSAWE--RPYLVRRGHRRSLVVTEYGEISAAEISSGTK-GPYHIQFITL 78

Query: 85  EPNSLFLPVLLHSDMVFYVHTGSGRLNWFDDNDLKEVDLRRGDLYRLHPGSIFYLQSSLE 144
           EPNSL LPVLLH+DMVFYVHTG+G+L+W D  ++K ++LRRGD+YRL  GS+F+++S+L+
Sbjct: 79  EPNSLLLPVLLHADMVFYVHTGNGKLSWTDGREMKRMNLRRGDVYRLQAGSVFFVRSNLD 138

Query: 145 TEREKLRIYALFSSTDEDSFNPSIGAYSRVTDLVRGFGKEVLRKAFMAPDEVIEEIMNAK 204
           +ER+K+RI+A+FS+TDED + PSIGAYS V+DLV GF ++VL++AF  P+EV+EE+ +A 
Sbjct: 139 SERQKMRIHAIFSNTDEDIYEPSIGAYSSVSDLVLGFDRKVLQEAFKVPEEVLEELTSAT 198

Query: 205 RPPLIVHAAAPTPSIKAKSSSPWEFEARLLKSFLGGDASATEFNKKKKKKGIYNVYEVDP 264
           +PP +VHA       K + S  WE E R+L   +G        +KK K+   +N+ +  P
Sbjct: 199 KPPAVVHAVT-----KDQKSVNWELEDRMLDFLIGNK------HKKTKETKTFNILDAKP 258

Query: 265 DFENCNGWSLTVTKKNSHQLKGSNIGFLVVNLTAGSMMGPHWNPRAWEIGIVTSDEPGVI 324
           DFENCNGWSLTV K +   L  SNIG  +VNLT GSMMGPHWNP A EI IV     G++
Sbjct: 259 DFENCNGWSLTVDKHSLKSLSDSNIGIFMVNLTKGSMMGPHWNPMATEIAIVLHGR-GMV 318

Query: 325 RVGCSSTSANSSKCKNWSFVVEKGDVFVVPRFHPMAQMSFNNGTFVFVGFSTTNGHNMPQ 384
           RV C ST AN S+CKN  F V++GDVF VPRFHPMAQ+SFNN +FVF+GFST+   N PQ
Sbjct: 319 RVICHST-ANESECKNMRFKVKEGDVFAVPRFHPMAQISFNNDSFVFMGFSTSTKRNHPQ 378

Query: 385 FFAGSSSVLKIVDREVLAWSFDVNVTTIDRLLKARVESIVLECTSCAEEEVRKMEEEAER 444
           F  G SS+L+I+DR +LA SF+V  TT+D+LL A+ E+++L+CTSCAE E  KM+EE E+
Sbjct: 379 FLTGKSSILQILDRGILAVSFNVTNTTMDQLLNAQEEALILDCTSCAEIEENKMKEEFEK 438

Query: 445 EREEEEERKREEEERRKREEEEEKKREEEEERKREEEEEKKREEEEERKREEEERRKREE 504
           E++EEE RKREEEE RK+EEEE +KREEEEER+REEEE +KREE E  ++EEEE+++REE
Sbjct: 439 EKQEEEARKREEEEARKKEEEEARKREEEEEREREEEEARKREEAERERQEEEEKQRREE 498

Query: 505 EEERKREEEEEEEEREREEEEAQK-EEERRREEEERKREEEEREREKEREEEEQRRREEE 564
           EEE  R  + EEEEREREEEEA+K EEE RRE+EE +RE +E E ++ REEEE+  R+ E
Sbjct: 499 EEEEAR--KREEEEREREEEEARKREEEERREQEEAERERQEEEEKQRREEEEEEARKRE 558

Query: 565 EEEEEERKREEEREAEREEEEARKREEEHQRERGKRRREGEERQRRRWEEEEEEEGGGEE 624
           E E E ++ EE++  E EEEEARKREEE +    + RRE EE +R R EEEE++    EE
Sbjct: 559 EAERERQEEEEKQRREEEEEEARKREEEQREREEEERREQEEEERERQEEEEKQRREEEE 602

BLAST of CSPI06G30840 vs. TrEMBL
Match: W9RZX9_9ROSA (Vicilin-like antimicrobial peptides 2-2 OS=Morus notabilis GN=L484_009051 PE=4 SV=1)

HSP 1 Score: 454.5 bits (1168), Expect = 2.1e-124
Identity = 353/654 (53.98%), Postives = 454/654 (69.42%), Query Frame = 1

Query: 17  SILIPIFF----LFLSLTAYAN--------DGWWEGDTPVVKRANERIPILKTEYGEISA 76
           ++LI  FF    L L+L A A         D W   D   + R  +R  ++  EYG+ISA
Sbjct: 7   TLLIFFFFSLSLLHLALHAQAKAKHFGDDEDVWGYHDFRPLVRKTQRKSLVDNEYGQISA 66

Query: 77  VDFDDGTRFGRYHLQFITLEPNSLFLPVLLHSDMVFYVHTGSGRLNWFDDNDLKEVDLRR 136
           V+  DG R G YH+QF TLEPNSLFLPVLLH+DMV +VHTGSGRL++ D+ + + V LR 
Sbjct: 67  VNISDGIR-GPYHIQFFTLEPNSLFLPVLLHADMVLFVHTGSGRLSYADEEETRSVHLRS 126

Query: 137 GDLYRLHPGSIFYLQSSLETEREKLRIYALFSSTDEDSFNPSIGAYSRVTDLVRGFGKEV 196
            D++RL  GS+F++QS L+ ERE LRIYA+FS+TD++ F P+IGAYS   +LVRGF K  
Sbjct: 127 ADIFRLQTGSVFFVQSDLQPERESLRIYAMFSNTDDELFEPAIGAYSGFRNLVRGFDKVT 186

Query: 197 LRKAFMAPDEVIEEIMNAKRPPLIVHAAAPTPSIKA-----KSSSPWEFEARLLKSFLGG 256
           L++AF  PDEVIE I +A     IVHA   +   K      K  + WE E R LK+FL  
Sbjct: 187 LKQAFKVPDEVIESITSATDAEAIVHAIPSSKKEKKEKKEKKKKALWEMETRFLKAFLED 246

Query: 257 DASATEFNKKKKKKGIYNVYEVDPDFENCNGWSLTVTKKNS-HQLKGSNIGFLVVNLTAG 316
           + SA  +NKKKK    YN+++ +PDFENCNGWSLTV +K + H LK +N+G  +VNLT G
Sbjct: 247 EESAA-YNKKKKSGEGYNLFDAEPDFENCNGWSLTVNRKQAAHILKDTNVGLFMVNLTKG 306

Query: 317 SMMGPHWNPRAWEIGIVTSDEPGVIRVGCSS-TSANSSKCKNWSFVVEKGDVFVVPRFHP 376
           SMMGPHWNP++ EI IV   + G++RV CSS  + +  +CKN  F V++GDVF VPRFHP
Sbjct: 307 SMMGPHWNPKSTEIAIVLQGQ-GMVRVVCSSGPNKSKQECKNMRFRVKEGDVFAVPRFHP 366

Query: 377 MAQMSFNNGTFVFVGFSTTNGHNMPQFFAGSSSVLKIVDREVLAWSFDVNVTTIDRLLKA 436
           MAQ+SFNN + VF+GF+TT   N PQF AG  S+L+ +DR++LA SF+V+ TTID+LL  
Sbjct: 367 MAQISFNNDSLVFMGFTTTGVENHPQFLAGKQSILQTLDRDILALSFNVSNTTIDQLLAP 426

Query: 437 RVESIVLECTSCAEEEVRKMEEEAEREREEEEERKRE--EEERRKREEEE-----EKKRE 496
           + +SI+L+CTSCAE+E R MEEE E+E+EEEE RKRE  EEERRKREEEE     E+KR 
Sbjct: 427 QADSIILDCTSCAEKEERIMEEEIEKEKEEEEARKREKQEEERRKREEEERKREEEEKRR 486

Query: 497 EEEERKREEEEEKKREEEEERKREEEER-----RKREEEEERKREEEEEEEEREREEEEA 556
           EEEERKREEEEE+ REEEE RKREEEER     RKREEEE RKREEEE E++RE EEEE 
Sbjct: 487 EEEERKREEEEERSREEEE-RKREEEERKREEERKREEEEARKREEEEREKQREEEEEE- 546

Query: 557 QKEEERRREEEERKREEEEREREKEREEEEQR--RREEEEEEEEERKREEEREAEREEEE 616
               ERRREEE+++++EEERER++  EEEE+R  +REEE   EEER+R+EE E E+E+EE
Sbjct: 547 ----ERRREEEKQRQKEEERERQRREEEEEERESQREEERRREEERQRQEEEEREQEQEE 606

Query: 617 ARKREEEHQRERGKRRREGEERQRRRWEEEEEEEGGGEEPQLPLPVLRILEQWT 638
            R++E+E +RER               +++EEE GGG E +        L+ WT
Sbjct: 607 VRRQEDERERER---------------QQKEEEGGGGGEGE------NHLKMWT 630

BLAST of CSPI06G30840 vs. TrEMBL
Match: A0A067KHS0_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_12256 PE=4 SV=1)

HSP 1 Score: 447.2 bits (1149), Expect = 3.3e-122
Identity = 340/609 (55.83%), Postives = 432/609 (70.94%), Query Frame = 1

Query: 22  IFFLFLSL----TAYANDGWWEGDTPVVKRANERIPILKTEYGEISAVDFDDGTRFGRYH 81
           +  LFLSL    +  A D    G  P + +  +R  ++ TEYG+ISAVD   GT  G YH
Sbjct: 5   LLLLFLSLPFCFSLEAKDVSSAGMRPSLVKREDRKSLIVTEYGQISAVDISTGT-IGDYH 64

Query: 82  LQFITLEPNSLFLPVLLHSDMVFYVHTGSGRLNWFDDN-DLKEVDLRRGDLYRLHPGSIF 141
           L+FITLEPNSLFLPV+LHSDMVFYV+TGSGRL+W +   +LK +D+++GD+YRLHPGS+F
Sbjct: 65  LEFITLEPNSLFLPVILHSDMVFYVNTGSGRLSWAEGGKELKRMDIKKGDVYRLHPGSVF 124

Query: 142 YLQSSLETEREKLRIYALFSSTDEDSFNPSIGAYSRVTDLVRGFGKEVLRKAFMAPDEVI 201
           ++QS+LETER+KLRIYA+FS+ DE ++ P IGAYS + DLV GF  ++L+ AF  P+EVI
Sbjct: 125 FMQSNLETERKKLRIYAIFSNADEGTYEPHIGAYSSINDLVLGFDTKLLQSAFKVPEEVI 184

Query: 202 EEIMNAKRPPLIVHAAAPTPSIKAKSSSPWEFEARLLKSFLGG-DASATEFNKKKKKKGI 261
           EE+ +A RPP IVHAA    SI        E E RLL++F+G  D +    N   KK   
Sbjct: 185 EEMKSAMRPPDIVHAAPQKKSILL------EIEDRLLQAFVGNKDGTLYSSNGGHKKTKK 244

Query: 262 YNVYEVDPDFENCNGWSLTVTKKNSHQLKGSNIGFLVVNLTAGSMMGPHWNPRAWEIGIV 321
            N+ +  PDFENCNGWS+TV KK+  +LKGS I   +VNLT GSMMGPHWNP A EI +V
Sbjct: 245 VNLLDGKPDFENCNGWSVTVDKKDLKRLKGSGISVFMVNLTKGSMMGPHWNPMANEIAVV 304

Query: 322 TSDEPGVIRVGCSSTSANSSKCKNWSFVVEKGDVFVVPRFHPMAQMSFNNGTFVFVGFST 381
                G++RV CSS + N ++CKN  F V++GDVF +PRFHPMAQM+FNN + VF+GFST
Sbjct: 305 LQGL-GMVRVVCSS-NVNETECKNMRFRVQEGDVFAIPRFHPMAQMAFNNESLVFMGFST 364

Query: 382 TNGHNMPQFFAGSSSVLKIVDREVLAWSFDVNVTTIDRLLKARVESIVLECTSCAEEEVR 441
           +   N PQF AG  SV + +++E+LA SF+V  TT+D+LL  + E I+LEC SCAEEE R
Sbjct: 365 STSKNDPQFLAGKRSVFQTLNKEILALSFNVPNTTVDKLLNPQEEEIILECISCAEEEER 424

Query: 442 KMEEEAEREREEEEERKREEEERRKREEEEEKKREEEEERKREEEEEKKREEEEERKREE 501
           KMEEE ERERE      REEEE RKR EEEE+KR+EEE RKREEEE +KREEEE R+ EE
Sbjct: 425 KMEEEMERERE------REEEEARKR-EEEERKRKEEEARKREEEEARKREEEERRREEE 484

Query: 502 EERRKREEEEERKREEEEEEEEREREEEEAQKEEERRREEEERKREEEEREREKEREEEE 561
           E R + EEEEERKR     EEER REEEEA++ EE   EEEERKREEEERE+ + +E ++
Sbjct: 485 EAREREEEEEERKR-----EEERRREEEEAREREE---EEEERKREEEEREKREMKERKK 544

Query: 562 QRRREEEEEEEEERKREEEREAEREEEEARKREEEHQRERGKRRREGEERQRRRWEEEEE 621
           + R+E E  + EER+R E  EA RE+E+ARK EEE QR + +R  E +ER+R R  + EE
Sbjct: 545 RERKEAERRQREERRRRES-EARREQEQARKEEEERQRRQRQREEEAKEREREREVQPEE 588

Query: 622 EEGGGEEPQ 625
           E    EE +
Sbjct: 605 EIKRSEESE 588

BLAST of CSPI06G30840 vs. TAIR10
Match: AT2G18540.1 (AT2G18540.1 RmlC-like cupins superfamily protein)

HSP 1 Score: 352.1 bits (902), Expect = 7.4e-97
Identity = 296/596 (49.66%), Postives = 399/596 (66.95%), Query Frame = 1

Query: 42  TPVVKRANERIPILKTEYGEISAVDFDDGTRFGRYHLQFITLEPNSLFLPVLLHSDMVFY 101
           +P++ + ++R  ++ TE+G ISAV   DG     YH+QFITLEPN+L LP+LLHSDMVF+
Sbjct: 41  SPLLVKKDQRTSVVATEFGNISAVQIGDG-----YHIQFITLEPNALLLPLLLHSDMVFF 100

Query: 102 VHTGSGRLNWFDDNDLKEVDLRRGDLYRLHPGSIFYLQSSLETEREKLRIYALFSSTDED 161
           VHTG+G LNW D+   ++++LRRGD++RL  G++FY+ S+     EKLR+YA+F+   + 
Sbjct: 101 VHTGTGILNWIDEESERKLELRRGDVFRLRSGTVFYVHSN-----EKLRVYAIFN-VGKC 160

Query: 162 SFNPSIGAYSRVTDLVRGFGKEVLRKAFMAPDEVIEEIMNAKRPPLIVHAAAPTPSIKAK 221
             +P +GAYS V DL+ GF    LR AF  P++++ +I +A +PPLIV+A    P  + +
Sbjct: 161 LNDPCLGAYSSVRDLLLGFDDRTLRSAFAVPEDILRKIRDATKPPLIVNAL---PRNRTQ 220

Query: 222 SSSPWEFEARLLKSFLGGDASATEFNKK------KKKKGIYNVYEVDPDFENCNGWSLTV 281
                ++++RL++ F+  +        K      KKK   +NV+E DPDFEN NG S+ V
Sbjct: 221 GLEEDKWQSRLVRLFVSVEDVTDHLAMKPIVDTNKKKSRTFNVFEEDPDFENNNGRSIVV 280

Query: 282 TKKNSHQLKGSNIGFLVVNLTAGSMMGPHWNPRAWEIGIVTSDEPGVIRVGCSSTSANSS 341
            +K+   LKGS  G  +VNLT GSM+GPHWNP A EI IV   E  V  V   S S+  +
Sbjct: 281 DEKDLDALKGSRFGVFMVNLTKGSMIGPHWNPSACEISIVLEGEGMVRVVNQQSLSSCKN 340

Query: 342 KCKNWSFVVEKGDVFVVPRFHPMAQMSFNNGTFVFVGFSTTNGHNMPQFFAGSSSVLKIV 401
             K+ SF+VE+GDVFVVP+FHPMAQMSF N +FVF+GFST+   N PQF  G SSVLK++
Sbjct: 341 DRKSESFMVEEGDVFVVPKFHPMAQMSFENSSFVFMGFSTSAKTNHPQFLVGQSSVLKVL 400

Query: 402 DREVLAWSFDVNVTTIDRLLKARVESIVLECTSCAEEEV----RKMEEEAEREREEEEER 461
           DR+V+A SF+++  TI  LLKA+ ES++ EC SCAE E+    R++EE   RE EE E R
Sbjct: 401 DRDVVAVSFNLSNETIKGLLKAQKESVIFECASCAEGELSKLMREIEERKRREEEEIERR 460

Query: 462 KREEEERRKREEEEEKKREEEEERKREEEEEKKREEEEERKREEEERRKREEEEERKREE 521
           ++EEEE RKREE + ++ EE + R+ EE E KKREEEE RKREEE  RKREEEE ++REE
Sbjct: 461 RKEEEEARKREEAKRREEEEAKRREEEETERKKREEEEARKREEE--RKREEEEAKRREE 520

Query: 522 EEEEEEREREEEEAQKEEERRREEEERKREEEEREREKEREEEEQRRREEEEEEEEERKR 581
           E     R++ EEEA   E+ R+ EEER++EEE     K+REEE QR+    E EE ERKR
Sbjct: 521 E-----RKKREEEA---EQARKREEEREKEEE---MAKKREEERQRK----EREEVERKR 580

Query: 582 EEEREAEREEEEARKREEEHQRERGKRRREGEERQRRRWEE-----EEEEEGGGEE 623
            EE+E +R EEEARKREEE +RE    +R  +ERQR+  EE      EE+E   EE
Sbjct: 581 REEQERKRREEEARKREEERKREEEMAKRREQERQRKEREEVERKIREEQERKREE 605

BLAST of CSPI06G30840 vs. TAIR10
Match: AT4G36700.1 (AT4G36700.1 RmlC-like cupins superfamily protein)

HSP 1 Score: 339.7 bits (870), Expect = 3.8e-93
Identity = 241/491 (49.08%), Postives = 332/491 (67.62%), Query Frame = 1

Query: 42  TPVVKRANERIPILKTEYGEISAVDFDDGTR-FGRYHLQFITLEPNSLFLPVLLHSDMVF 101
           +P++ + ++  PI +T++G+IS V   +G    G Y +  ITLEPN++ LP+LLHSDMVF
Sbjct: 45  SPLLIKKDQWKPIFETKFGQISTVQIGNGCGGMGPYKIHSITLEPNTILLPLLLHSDMVF 104

Query: 102 YVHTGSGRLNWFDDNDLKEVDLRRGDLYRLHPGSIFYLQSS-----LETEREKLRIYALF 161
           +V +GSG LNW D+ + K  ++R GD+YRL PGS+FYLQS      L T   KL++YA+F
Sbjct: 105 FVDSGSGILNWVDE-EAKSTEIRLGDVYRLRPGSVFYLQSKPVDIFLGT---KLKLYAIF 164

Query: 162 SSTDEDSFNPSIGAYSRVTDLVRGFGKEVLRKAFMAPDEVIEEIMNAKRPPLIVHAAAPT 221
           S+ DE   +P  GAYS +TDL+ GF + +L+ AF  P+ +IE + N  +PPLIV     T
Sbjct: 165 SNNDECLHDPCFGAYSSITDLMFGFDETILQSAFGVPEGIIELMRNRTKPPLIVSETLCT 224

Query: 222 PSIKAKSSSPWEFEARLLKSFLGGDASATEFNKKKKKKG---------IYNVYEVDPDFE 281
           P +    ++ W+ + RLLK F  G A   +  KKK+KK           +NV+E +PDFE
Sbjct: 225 PGV----ANTWQLQPRLLKLF-AGSADLVDNKKKKEKKEKKEKVKKAKTFNVFESEPDFE 284

Query: 282 NCNGWSLTVTKKNSHQLKGSNIGFLVVNLTAGSMMGPHWNPRAWEIGIVTSDEPGVIRVG 341
           +  G ++T+ +K+   LKGS +G  +VNLT GSMMGPHWNP A EI IV     G++RV 
Sbjct: 285 SPYGRTITINRKDLKVLKGSMVGVSMVNLTQGSMMGPHWNPWACEISIVLKG-AGMVRVL 344

Query: 342 CSSTSAN-SSKCKNWSFVVEKGDVFVVPRFHPMAQMSFNNGTFVFVGFSTTNGHNMPQFF 401
            SS S+N SS+CKN  F VE+GD+F VPR HPMAQMSFNN + VFVGF+T+  +N PQF 
Sbjct: 345 RSSISSNTSSECKNVRFKVEEGDIFAVPRLHPMAQMSFNNDSLVFVGFTTSAKNNEPQFL 404

Query: 402 AGSSSVLKIVDREVLAWSFDVNVTTIDRLLKARVESIVLECTSCAEEEVRKMEEEAERER 461
           AG  S L+++DR+VLA S +V+  TID LL A+ E+++LEC SCAE E+ K++ E ER +
Sbjct: 405 AGEDSALRMLDRQVLAASLNVSSVTIDGLLGAQKEAVILECHSCAEGEIEKLKVEIER-K 464

Query: 462 EEEEERKREEEERRKREEEEEKKREEEEERKREEEEEKKREEEEERKREEEERRKREEEE 517
           + ++ERKR  +ER+K  EEEE KREEEE RKREEEEEKKR   ++   +EEE R+R+   
Sbjct: 465 KIDDERKRRHDERKK--EEEEAKREEEERRKREEEEEKKRWPPQQ-PPQEEELRERQLPM 521

BLAST of CSPI06G30840 vs. TAIR10
Match: AT2G28490.1 (AT2G28490.1 RmlC-like cupins superfamily protein)

HSP 1 Score: 119.0 bits (297), Expect = 1.1e-26
Identity = 74/217 (34.10%), Postives = 119/217 (54.84%), Query Frame = 1

Query: 223 SSPWEFEARLLKSFLGGDASATEFNKKKKKKGIYNVYEVD--PDFENCNGWSLTVTKKNS 282
           SS W +   +++S L       + +   + +  YN+Y+    P F+N  GWS+ +   + 
Sbjct: 297 SSGWSWR-NIVRSILDLTEEKNKGSGSSECEDSYNIYDKKDKPSFDNKYGWSIALDYDDY 356

Query: 283 HQLKGSNIGFLVVNLTAGSMMGPHWNPRAWEIGIVTSDEPGVIRVGCSSTSANSSKCKNW 342
             LK S IG  +VNLTAG+MM PH NP A E GIV +    +  V  + TSA +++    
Sbjct: 357 KPLKHSGIGVYLVNLTAGAMMAPHMNPTATEYGIVLAGSGEIQVVFPNGTSAMNTR---- 416

Query: 343 SFVVEKGDVFVVPRFHPMAQMSFNNGTFVFVGFSTTNGHNMPQFFAGSSSVLKIVDREVL 402
              V  GDVF +PR+    Q++   G F FVGF+T+   N PQF  GS+S+L+ ++   L
Sbjct: 417 ---VSVGDVFWIPRYFAFCQIASRTGPFEFVGFTTSAHKNRPQFLVGSNSLLRTLNLTSL 476

Query: 403 AWSFDVNVTTIDRLLKARVESIVLECTSCAEEEVRKM 438
           + +F V+  T+ R ++A+ E+++L   + A   V +M
Sbjct: 477 SIAFGVDEETMRRFIEAQREAVILPTPAAAPPHVGEM 505

BLAST of CSPI06G30840 vs. NCBI nr
Match: gi|778722460|ref|XP_011658490.1| (PREDICTED: LOW QUALITY PROTEIN: provicilin [Cucumis sativus])

HSP 1 Score: 988.4 bits (2554), Expect = 5.8e-285
Identity = 610/676 (90.24%), Postives = 617/676 (91.27%), Query Frame = 1

Query: 1   MKKHSSVSVSGSLFSPSILIPIFFLFLSLTAYANDGWWEGDTPVVKRANERIPILKTEYG 60
           MKKHS+VSVSGS FSPSILIPIFFLFLSL AYA+DGWWEGDTPVVKRANERIPILKTEYG
Sbjct: 1   MKKHSAVSVSGSSFSPSILIPIFFLFLSLPAYADDGWWEGDTPVVKRANERIPILKTEYG 60

Query: 61  EISAVDFDDGTRFGRYHLQFITLEPNSLFLPVLLHSDMVFYVHTGSGRLNWFDDNDLKEV 120
           EISAVDFDDGTRFGRYHLQFITLEPNSLFLPVLLHSDMVFYVHTGSGRLNWFDDNDLKEV
Sbjct: 61  EISAVDFDDGTRFGRYHLQFITLEPNSLFLPVLLHSDMVFYVHTGSGRLNWFDDNDLKEV 120

Query: 121 DLRRGDLYRLHPGSIFYLQSSLETEREKLRIYALFSSTDEDSFNPSIGAYSRVTDLVRGF 180
           DLRRGDLYRLHPGSIFYLQSSLETEREKLRIYALFSSTDEDSFNPSIGAYSRVTDLVRGF
Sbjct: 121 DLRRGDLYRLHPGSIFYLQSSLETEREKLRIYALFSSTDEDSFNPSIGAYSRVTDLVRGF 180

Query: 181 GKEVLRKAFMAPDEVIEEIMNAKRPPLIVHAAAPTPSIKAKSSSPWEFEARLLKSFLGGD 240
           GKEVLRKAFMAPDEVIEEIMNAKRPPLIVHAAAPTPSIKAKSSSPWEFEARLLKSFLGGD
Sbjct: 181 GKEVLRKAFMAPDEVIEEIMNAKRPPLIVHAAAPTPSIKAKSSSPWEFEARLLKSFLGGD 240

Query: 241 ASATEFNKKKKKKGIYNVYEVDPDFENCNGWSLTVTKKNSHQLKGSNIGFLVVNLTAGSM 300
           ASA EFNKKKKKKGIYNVYEVDPDFENCNGWSLTVTKKNSHQLKGSNIGFLVVNLTAGSM
Sbjct: 241 ASAIEFNKKKKKKGIYNVYEVDPDFENCNGWSLTVTKKNSHQLKGSNIGFLVVNLTAGSM 300

Query: 301 MGPHWNPRAWEIGIVTSDEPGVIRVGCSSTSANSSKCKNWSFVVEKGDVFVVPRFHPMAQ 360
           MGPHWNPRAWEIGIVTSDEPGVIRVGCSSTSANSSKCKNWSFVVEKGDVFVVPRFHPMAQ
Sbjct: 301 MGPHWNPRAWEIGIVTSDEPGVIRVGCSSTSANSSKCKNWSFVVEKGDVFVVPRFHPMAQ 360

Query: 361 MSFNNGTFVFVGFSTTNGHNMPQFFAGSSSVLKIVDREVLAWSFDVNVTTIDRLLKARVE 420
           MSFNNGTFVFVGFSTTNGHNMPQFFAGSSSVLKIVDREVLAWSFDVNVTTIDRLLKARVE
Sbjct: 361 MSFNNGTFVFVGFSTTNGHNMPQFFAGSSSVLKIVDREVLAWSFDVNVTTIDRLLKARVE 420

Query: 421 SIVLECTSCAEEEVRKMEEEAERER----------------EEEEERKREEEERRK---- 480
           SIVLECTSCAEEEVRKMEEEAERER                EEEEERKREEEERRK    
Sbjct: 421 SIVLECTSCAEEEVRKMEEEAEREREEEEERKREEEERRKREEEEERKREEEERRKREEE 480

Query: 481 -------REEEEEKKREEEEE---RKREEEEEKKREEEEE---RKREEEERRKREEEEE- 540
                   E EEE+KREEEEE   RKREEEEEKKREEEEE   RKREEEE +KREEEEE 
Sbjct: 481 EERKREEEEREEERKREEEEEEEERKREEEEEKKREEEEEEEERKREEEEEKKREEEEEE 540

Query: 541 --RKREEEEEEEEREREEEEAQK---EEERRREEEERKREEEEREREKEREEEEQRRREE 600
             RKREEEEE++  E EEEE +K   EE RRREEEERKREEEEREREKER EEEQRRREE
Sbjct: 541 EERKREEEEEKKREEEEEEEERKRXEEERRRREEEERKREEEEREREKERGEEEQRRREE 600

Query: 601 EEEEEEERKREEEREAEREEEEARKREEEHQRERGKRRREGEERQRRRWEEEEEEEGGGE 638
           EEEEEEER      EAEREEEEARKREEEHQRERGKRRREGEERQRRRW EEEEEEGGGE
Sbjct: 601 EEEEEEER------EAEREEEEARKREEEHQRERGKRRREGEERQRRRW-EEEEEEGGGE 660

BLAST of CSPI06G30840 vs. NCBI nr
Match: gi|700193611|gb|KGN48815.1| (PreproMP73 [Cucumis sativus])

HSP 1 Score: 916.0 bits (2366), Expect = 3.7e-263
Identity = 549/622 (88.26%), Postives = 569/622 (91.48%), Query Frame = 1

Query: 1   MKKHSSVSVSGSLFSPSILIPIFFLFLSLTAYANDGWWEGDTPVVKRANERIPILKTEYG 60
           MKKHS+VSVSGS FSPSILIPIFFLFLSL AYA+DGWWEGDTPVVKRANERIPILKTEYG
Sbjct: 1   MKKHSAVSVSGSSFSPSILIPIFFLFLSLPAYADDGWWEGDTPVVKRANERIPILKTEYG 60

Query: 61  EISAVDFDDGTRFGRYHLQFITLEPNSLFLPVLLHSDMVFYVHTGSGRLNWFDDNDLKEV 120
           EISAVDFDDGTRFGRYHLQFITLEPNSLFLPVLLHSDMVFYVHTGSGRLNWFDDNDLKEV
Sbjct: 61  EISAVDFDDGTRFGRYHLQFITLEPNSLFLPVLLHSDMVFYVHTGSGRLNWFDDNDLKEV 120

Query: 121 DLRRGDLYRLHPGSIFYLQSSLETEREKLRIYALFSSTDEDSFNPSIGAYSRVTDLVRGF 180
           DLRRGDLYRLHPGSIFYLQSSLETEREKLRIYALFSSTDEDSFNPSIGAYSRVTDLVRGF
Sbjct: 121 DLRRGDLYRLHPGSIFYLQSSLETEREKLRIYALFSSTDEDSFNPSIGAYSRVTDLVRGF 180

Query: 181 GKEVLRKAFMAPDEVIEEIMNAKRPPLIVHAAAPTPSIKAKSSSPWEFEARLLKSFLGGD 240
           GKEVLRKAFMAPDEVIEEIMNAKRPPLIVHAAAPTPSIKAKSSSPWEFEARLLKSFLGGD
Sbjct: 181 GKEVLRKAFMAPDEVIEEIMNAKRPPLIVHAAAPTPSIKAKSSSPWEFEARLLKSFLGGD 240

Query: 241 ASATEFNKKKKKKGIYNVYEVDPDFENCNGWSLTVTKKNSHQLKGSNIGFLVVNLTAGSM 300
           ASA EFNKKKKKKGIYNVYEVDPDFENCNGWSLTVTKKNSHQLKGSNIGFLVVNLTAGSM
Sbjct: 241 ASAIEFNKKKKKKGIYNVYEVDPDFENCNGWSLTVTKKNSHQLKGSNIGFLVVNLTAGSM 300

Query: 301 MGPHWNPRAWEIGIVTSDEPGVIRVGCSSTSANSSKCKNWSFVVEKGDVFVVPRFHPMAQ 360
           MGPHWNPRAWEIGIVTSDEPGVIRVGCSSTSANSSKCKNWSFVVEKGDVFVVPRFHPMAQ
Sbjct: 301 MGPHWNPRAWEIGIVTSDEPGVIRVGCSSTSANSSKCKNWSFVVEKGDVFVVPRFHPMAQ 360

Query: 361 MSFNNGTFVFVGFSTTNGHNMPQFFAGSSSVLKIVDREVLAWSFDVNVTTIDRLLKARVE 420
           MSFNNGTFVFVGFSTTNGHNMPQFFAGSSSVLKIVDREVLAWSFDVNVTTIDRLLKARVE
Sbjct: 361 MSFNNGTFVFVGFSTTNGHNMPQFFAGSSSVLKIVDREVLAWSFDVNVTTIDRLLKARVE 420

Query: 421 SIVLECTSCAEEEVRKMEEEAEREREEEEERKREEEERRKREEEEEKKREEEEERKREEE 480
           SIVLECTSCAEEEVRKMEEEAEREREEEEERKREEEERRKREEEEE+KREEEE RKREEE
Sbjct: 421 SIVLECTSCAEEEVRKMEEEAEREREEEEERKREEEERRKREEEEERKREEEERRKREEE 480

Query: 481 EEKKREEEEERKREEEERRKREEEEERKREEEEEEEEREREEEEAQKEEERRREEEERKR 540
           EE+KREEEE              EEERKREEEEEEEER+REEEE            E+KR
Sbjct: 481 EERKREEEER-------------EEERKREEEEEEEERKREEEE------------EKKR 540

Query: 541 EEEEREREKEREEEEQRRREEEEEEEEERKREEEREAEREEEEARKREEEHQRERGKRRR 600
           EEEE E E++REEEE+++REEEEEEEE RKREEE E +REEEE     EE +R+R ++R 
Sbjct: 541 EEEEEEEERKREEEEEKKREEEEEEEE-RKREEEEEKKREEEE-----EEEERKRKRKRG 591

Query: 601 EGEERQRRRWEEEEEEEGGGEE 623
            GE+R+R +  +  E E   EE
Sbjct: 601 GGEKRKREKERKRRERERKREE 591

BLAST of CSPI06G30840 vs. NCBI nr
Match: gi|659080244|ref|XP_008440688.1| (PREDICTED: globulin-1 S allele [Cucumis melo])

HSP 1 Score: 911.8 bits (2355), Expect = 6.9e-262
Identity = 557/659 (84.52%), Postives = 597/659 (90.59%), Query Frame = 1

Query: 1   MKKHSSVSVSGSLFSPSILIPIFFLFLSLTAYANDGWWEGDTPVVKRANERIPILKTEYG 60
           MKK +++SVSGS FS S LI IFFLF SL AYA+DGWWEGD+PVVKRANERI +LKTEYG
Sbjct: 1   MKKPTAISVSGSPFSLSFLISIFFLFFSLPAYADDGWWEGDSPVVKRANERIQLLKTEYG 60

Query: 61  EISAVDFDDGTRFGRYHLQFITLEPNSLFLPVLLHSDMVFYVHTGSGRLNWFDDNDLKEV 120
           +ISAVDFDDG+RFG YHLQFITLEPNSLFLPVLLHSDMVFY+HTGSGRLNWFD+NDLKEV
Sbjct: 61  DISAVDFDDGSRFGPYHLQFITLEPNSLFLPVLLHSDMVFYIHTGSGRLNWFDENDLKEV 120

Query: 121 DLRRGDLYRLHPGSIFYLQSSLETEREKLRIYALFSSTDEDSFNPSIGAYSRVTDLVRGF 180
           DLRRGDLYRLHPGSIFYLQSSLE EREKLRIYALFSSTDEDSFNPS+GAYSRVTDLVRGF
Sbjct: 121 DLRRGDLYRLHPGSIFYLQSSLEIEREKLRIYALFSSTDEDSFNPSLGAYSRVTDLVRGF 180

Query: 181 GKEVLRKAFMAPDEVIEEIMNAKRPPLIVHAAAPTPSIKAKSSSPWEFEARLLKSFLGGD 240
           GKEVLRKAFMAPDEVIEEIM AKRPPLIVHAAAPTPSI+AKSSSPWEFEARLLK+FLGGD
Sbjct: 181 GKEVLRKAFMAPDEVIEEIMTAKRPPLIVHAAAPTPSIRAKSSSPWEFEARLLKAFLGGD 240

Query: 241 ASATEFNKKKKKK-GIYNVYEVDPDFENCNGWSLTVTKKNSHQLKGSNIGFLVVNLTAGS 300
           AS  EFNKKKKKK GIYNVYEVDPDFENCNGWSLTVTKKNSHQLKGSNIGFLVVNLTAGS
Sbjct: 241 ASGIEFNKKKKKKKGIYNVYEVDPDFENCNGWSLTVTKKNSHQLKGSNIGFLVVNLTAGS 300

Query: 301 MMGPHWNPRAWEIGIVTSDEPGVIRVGCSSTSANSSKCKNWSFVVEKGDVFVVPRFHPMA 360
           MMGPHWNPRAWEIGIVTSDEPGV+ VGCSSTSANSSKCKNWSFVVEKGD+FVVPRFHPMA
Sbjct: 301 MMGPHWNPRAWEIGIVTSDEPGVVHVGCSSTSANSSKCKNWSFVVEKGDIFVVPRFHPMA 360

Query: 361 QMSFNNGTFVFVGFSTTNGHNMPQFFAGSSSVLKIVDREVLAWSFDVNVTTIDRLLKARV 420
           QMSFNNGTFVFVGFSTTNGHNMPQFF GSSSVL++VDREVLAWSFDVNVTT+DRLLKARV
Sbjct: 361 QMSFNNGTFVFVGFSTTNGHNMPQFFVGSSSVLQLVDREVLAWSFDVNVTTVDRLLKARV 420

Query: 421 ESIVLECTSCAEEEVRKMEEEAEREREEEEERKREEEERRKREEEEEKKR---------E 480
           ESI+LECTSCAEEEVRKMEEEAEREREEEEERKREEEERRKREEEE++KR         E
Sbjct: 421 ESIILECTSCAEEEVRKMEEEAEREREEEEERKREEEERRKREEEEQRKRXXXXXXXKRE 480

Query: 481 EEEERKREEEEEKKREEE-----EERKREEEERRKREEEEERKREEEEEE------EERE 540
           EEE RKREEEEE +REEE     EE +REEE RR+REEEE+R++E EEEE      EE++
Sbjct: 481 EEERRKREEEEEAEREEERKREEEEAQREEERRRRREEEEKREKEREEEEQRRREEEEQQ 540

Query: 541 REEEEAQKEEERRREEEERKREEEEREREKEREEEEQRRREE-EEEEEEERKREEEREAE 600
           +EEEEAQ+EEERRR  EE +R+ EE E EKEREEE+Q+  EE + EEEEERKREEEREA+
Sbjct: 541 QEEEEAQREEERRRRREEEERKREEEEGEKEREEEQQQEEEEAKREEEEERKREEEREAK 600

Query: 601 REEEEARKREEEHQRERGKRRREGEERQRRRWEEEEEEEGGGEEPQLPLPVLRILEQWT 638
           REEEEAR+REEEHQRERG+RRRE EE QRRRW   EEEEG GEE +   PVLRIL QWT
Sbjct: 601 REEEEAREREEEHQRERGRRRREAEEGQRRRW---EEEEGEGEEEEEEQPVLRILSQWT 656

BLAST of CSPI06G30840 vs. NCBI nr
Match: gi|17221648|dbj|BAB78478.1| (preproMP73 [Cucurbita maxima])

HSP 1 Score: 689.1 bits (1777), Expect = 7.3e-195
Identity = 462/637 (72.53%), Postives = 522/637 (81.95%), Query Frame = 1

Query: 1   MKKHSSVSVSGSLFSPSILIPIFFLFLSLTAYANDGWWEGDTPVVKRANERIPILKTEYG 60
           MKK    ++SGS FS S L  +FFLFLSL + A+D WWE   PV KRANER  +LKTEYG
Sbjct: 1   MKK--CTAISGSPFSLSFLFTVFFLFLSLPSNADDKWWEAACPV-KRANERKSLLKTEYG 60

Query: 61  EISAVDFDDGTRFGRYHLQFITLEPNSLFLPVLLHSDMVFYVHTGSGRLNWFDDNDLKEV 120
           EISAVD  D ++FG YHLQFIT+EPNSLFLPVLLH+DMV Y+HTGSGRLNWFDD+DL+EV
Sbjct: 61  EISAVDLHDASQFGPYHLQFITMEPNSLFLPVLLHADMVLYMHTGSGRLNWFDDDDLREV 120

Query: 121 DLRRGDLYRLHPGSIFYLQSSLETEREKLRIYALFSSTDEDSFNPSIGAYSRVTDLVRGF 180
           DLRRGD++RL PG+IFY+ SSLETEREKLR+YALFSSTDED F P+IGAYSRVTD VRGF
Sbjct: 121 DLRRGDIFRLQPGAIFYIHSSLETEREKLRMYALFSSTDEDPFEPAIGAYSRVTDHVRGF 180

Query: 181 GKEVLRKAFMAPDEVIEEIMNAKRPPLIVHAAAPTPSIKAKS----SSPWEFEARLLKSF 240
            KE+L KAFM P+EVIEEIM+AKRPPLIVHAA    ++  K     S   E EAR LKSF
Sbjct: 181 DKEILCKAFMVPEEVIEEIMDAKRPPLIVHAATTLSTLSKKQRSSLSMSLELEARFLKSF 240

Query: 241 LGGDASATEF-NKKKKKKGIYNVYEVDPDFENCNGWSLTVTKKNSHQLKGSNIGFLVVNL 300
           +GG     +F  KKKKKKG+YNV+E DPDFENCNGWSLTVTKK SHQLKGSNIGF VVNL
Sbjct: 241 IGGGGIGMDFKKKKKKKKGLYNVFEADPDFENCNGWSLTVTKKVSHQLKGSNIGFFVVNL 300

Query: 301 TAGSMMGPHWNPRAWEIGIVTSDEPGVIRVGCSSTSANSSKCKNWSFVVEKGDVFVVPRF 360
           TAGSMMGPHWNPRAWEIGIVTS+E GV+RVGCSS + NSSKCK WSFVV KGDVFVVPRF
Sbjct: 301 TAGSMMGPHWNPRAWEIGIVTSEEAGVVRVGCSSMT-NSSKCKKWSFVVGKGDVFVVPRF 360

Query: 361 HPMAQMSFNNGTFVFVGFSTTNGHNMPQFFAGSSSVLKIVDREVLAWSFDVNVTTIDRLL 420
           HPMAQMSFNNG+F FVGFSTTN +N+PQF AG SSVL+ V+R+VLAWSFDVNVTTIDRLL
Sbjct: 361 HPMAQMSFNNGSFAFVGFSTTNRNNLPQFLAGRSSVLQTVERQVLAWSFDVNVTTIDRLL 420

Query: 421 KARVESIVLECTSCAEEEVRKMEEEAEREREEEEERKREEEERRKREEEEEKKREEEEER 480
           +ARVES++LECTSCAEEEV KMEEEAERER+EEEER+REEEE R+REEEE +KR EEEER
Sbjct: 421 EARVESVILECTSCAEEEVMKMEEEAERERQEEEERRREEEE-REREEEEARKR-EEEER 480

Query: 481 KREEEEEKKREEEEERKREEEERRKREEEEERKREEEEEEEEREREEEEAQKEEERRREE 540
           +REEEE +KR EEEER+REEEE RKREEEE   RE EEEE  +  EE E ++EEERRREE
Sbjct: 481 EREEEEARKR-EEEEREREEEEARKREEEE---REREEEEARKREEEREREEEEERRREE 540

Query: 541 EERKREEEE-REREKE---REEEEQRRREEEE----EEEEERKREEEREAEREEEEARKR 600
           EER+REEEE R+RE+E   + EEE+R REEEE    EEEE RKREEE   +REEEEARKR
Sbjct: 541 EEREREEEEARKREEEEARKREEEEREREEEEARKREEEEARKREEEEARKREEEEARKR 600

Query: 601 EEEHQRERGKRRREGEERQRRRWEEEEEEEGGGEEPQ 625
           E+E  R+R +  RE EE   R   EEEEE    EE +
Sbjct: 601 EKEEARKREEEEREREEEAERERREEEEEARRREEAE 627

BLAST of CSPI06G30840 vs. NCBI nr
Match: gi|702453279|ref|XP_010026166.1| (PREDICTED: provicilin-like [Eucalyptus grandis])

HSP 1 Score: 473.0 bits (1216), Expect = 8.2e-130
Identity = 357/604 (59.11%), Postives = 435/604 (72.02%), Query Frame = 1

Query: 41  DTPVVKRANERIPILKTEYGEISAVDFDDGTRFGRYHLQFITLEPNSLFLPVLLHSDMVF 100
           D   V R  ER PI  TEYGEI+A    DG   G YHL+F+TLEPN+LFLPVLL SDMV 
Sbjct: 50  DVGTVVRKEERTPIAATEYGEITAARVADGGG-GVYHLRFVTLEPNALFLPVLLRSDMVL 109

Query: 101 YVHTGSGRLNWFDDNDLKEVDLRRGDLYRLHPGSIFYLQSSLETEREKLRIYALFSSTDE 160
           YVHTG GRLNW D+ND+K +DLRRGD+YRL PG+IFY+QSSLE EREKLRI A+F++T+E
Sbjct: 110 YVHTGRGRLNWADENDVKRIDLRRGDIYRLRPGTIFYVQSSLEPEREKLRINAIFTNTEE 169

Query: 161 DSFNPSIGAYSRVTDLVRGFGKEVLRKAFMAPDEVIEEIMNAKRPPLIVHAAAPTPSIKA 220
           D + PSIGAYS + DL+RGF  +VL+ AF  P+EV+EE+++A RPP IVHAAA       
Sbjct: 170 DIYEPSIGAYSSIGDLLRGFDSKVLQGAFKVPEEVVEEVISATRPPPIVHAAA-----SE 229

Query: 221 KSSSPWEFEARLLKSFLGGD---ASATEFNKKKKKKGIYNVYEVDPDFENCNGWSLTVTK 280
           K +  W++EAR LK++L      A  +  NKK K K  +NV++ D DFENCNGWSL VT 
Sbjct: 230 KRTKYWDWEARFLKTYLSSTGYLAEGSSSNKKTKTK-TFNVFDTDHDFENCNGWSLMVTG 289

Query: 281 KNSHQLKGSNIGFLVVNLTAGSMMGPHWNPRAWEIGIVTSDEPGVIRVGCSSTSANSSKC 340
           K+ H LK SNIG  +VNLT GSMMGPHWNPRA EI IV   + G+IRV CSST A  S+C
Sbjct: 290 KDMHALKHSNIGVFMVNLTKGSMMGPHWNPRATEIAIVLQGQ-GMIRVVCSST-AKESEC 349

Query: 341 KNWSFVVEKGDVFVVPRFHPMAQMSFNNGTFVFVGFSTTNGHNMPQFFAGSSSVLKIVDR 400
            N  F V +GDVFVVPRFHPMAQMSFNN + VF+GFST+   N PQF AG SS+L+ +DR
Sbjct: 350 NNTRFKVSEGDVFVVPRFHPMAQMSFNNESLVFMGFSTSTKRNYPQFLAGKSSILRALDR 409

Query: 401 EVLAWSFDVNVTTIDRLLKARVESIVLECTSCAEEEVRKMEEEAEREREEEEERKREEEE 460
           E+LA +F+V  TTI  +L  + +SI+L+CTSCAEEE R MEEE E+ER E      EEEE
Sbjct: 410 EILAVAFNVTNTTIHHILAPQTDSIILDCTSCAEEEERLMEEEIEKERRE------EEEE 469

Query: 461 RRKREEEEEKKREEEEERKREEEEEKKREEEEERKREEEERRKREEEEERKREEEEEEEE 520
            ++RE+EEE KR EEEER R+EEEE+      ER+REEEE R+REEEEER+R E +EEEE
Sbjct: 470 AKRREQEEEAKRREEEERARQEEEER------EREREEEEEREREEEEERER-ERKEEEE 529

Query: 521 REREEEEAQKEEERRREEEERKREEEEREREKEREEEEQRRREEEEEEEEERKREE---- 580
           RERE EE ++ E +R+EEE+R+REEEE+ER  EREEEE RRREEEE E EE +R E    
Sbjct: 530 REREREEEEERERKRQEEEQRRREEEEQER--EREEEEARRREEEEREREEEQRREGGGG 589

Query: 581 --EREAEREEEEARKREE-----EHQRERGKRRR----EGEERQRRRWEEE----EEEEG 623
                 ERE+EEAR+REE     E QR++ +RRR    E E+ + RR EEE     EEE 
Sbjct: 590 GGAGGGEREQEEARRREEASERWERQRQQERRRRQEAAEREQEEARRQEEEMQRRHEEEW 629

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
VCL21_ARATH6.5e-9548.99Vicilin-like seed storage protein At2g18540 OS=Arabidopsis thaliana GN=At2g18540... [more]
VCL43_ARATH6.8e-9249.08Vicilin-like seed storage protein At4g36700 OS=Arabidopsis thaliana GN=At4g36700... [more]
VCL22_ARATH1.9e-2534.10Vicilin-like seed storage protein At2g28490 OS=Arabidopsis thaliana GN=At2g28490... [more]
CONB1_LUPAN1.1e-1220.52Conglutin beta 1 OS=Lupinus angustifolius GN=BETA1 PE=1 SV=1[more]
CONB2_LUPAN2.4e-1219.60Conglutin beta 2 OS=Lupinus angustifolius GN=BETA2 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KGF2_CUCSA2.5e-26388.26PreproMP73 OS=Cucumis sativus GN=Csa_6G502040 PE=4 SV=1[more]
Q8W3X8_CUCMA5.1e-19572.53PreproMP73 OS=Cucurbita maxima GN=CmMP73 PE=2 SV=1[more]
U5GAT4_POPTR2.6e-12755.98Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0007s12490g PE=4 SV=1[more]
W9RZX9_9ROSA2.1e-12453.98Vicilin-like antimicrobial peptides 2-2 OS=Morus notabilis GN=L484_009051 PE=4 S... [more]
A0A067KHS0_JATCU3.3e-12255.83Uncharacterized protein OS=Jatropha curcas GN=JCGZ_12256 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G18540.17.4e-9749.66 RmlC-like cupins superfamily protein[more]
AT4G36700.13.8e-9349.08 RmlC-like cupins superfamily protein[more]
AT2G28490.11.1e-2634.10 RmlC-like cupins superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778722460|ref|XP_011658490.1|5.8e-28590.24PREDICTED: LOW QUALITY PROTEIN: provicilin [Cucumis sativus][more]
gi|700193611|gb|KGN48815.1|3.7e-26388.26PreproMP73 [Cucumis sativus][more]
gi|659080244|ref|XP_008440688.1|6.9e-26284.52PREDICTED: globulin-1 S allele [Cucumis melo][more]
gi|17221648|dbj|BAB78478.1|7.3e-19572.53preproMP73 [Cucurbita maxima][more]
gi|702453279|ref|XP_010026166.1|8.2e-13059.11PREDICTED: provicilin-like [Eucalyptus grandis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR006045Cupin_1
IPR011051RmlC_Cupin_sf
IPR014710RmlC-like_jellyroll
Vocabulary: Molecular Function
TermDefinition
GO:0045735nutrient reservoir activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0045735 nutrient reservoir activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI06G30840.1CSPI06G30840.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006045Cupin 1PFAMPF00190Cupin_1coord: 256..410
score: 3.5
IPR006045Cupin 1SMARTSM00835Cupin_1_3coord: 256..412
score: 3.2E-48coord: 46..197
score: 7.
IPR011051RmlC-like cupin domainunknownSSF51182RmlC-like cupinscoord: 23..418
score: 9.71
IPR014710RmlC-like jelly roll foldGENE3DG3DSA:2.60.120.10coord: 257..421
score: 1.8E-28coord: 50..211
score: 8.5
NoneNo IPR availableunknownCoilCoilcoord: 427..613
scor
NoneNo IPR availablePANTHERPTHR31189FAMILY NOT NAMEDcoord: 8..466
score: 8.3E
NoneNo IPR availablePANTHERPTHR31189:SF7CUPIN FAMILY PROTEINcoord: 8..466
score: 8.3E

The following gene(s) are paralogous to this gene:

None