Cp4.1LG16g05580 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG16g05580
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionVicilin-like antimicrobial peptides 2-2
LocationCp4.1LG16 : 6329950 .. 6332648 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGGAAGAAAGAAGCCCTTGTGATGCTGTTGATTATTGCTGTTCTGGGAAATGCGATTGGGATTAAGGAGGAAGCAGAAGCAGAAGAAGAAGAAGAATGGTGGAGAGAGAGAGAGGAAGAGAGGGAGTTTAGAAGTAAAGAACGGTTTCTGCTAGAGGATTCGAAGCGGGTGATAGAGACAGAAGCAGGAGAAATGAGGGTGATTAGAAGTCCGGCTTCAAGAATTCTTGACAGGCCCATGCATATCGGCTTCATCACCATGGAACCTAAGTCCCTGTTCGTCCCTCAGTATTTGGACTCCAGCTTGATTCTCTTTGTCCGCAGAGGTAATTGCAGTCTTCTTTCTCAACTCCTCCTTCTTGCTTACATGCATCCATAAGTCACCGTTAAAGAACTCGAAAAGCACTCTACCACAATGAACCAATAAAAGTGATTTTCAATGTTCAATAAGGCCATGCAAGTGCAGATATCGTACTCCTTGAGTTTTTAAACGTATCCTGCCTGGGAGAGGTTTCCAACCAAAGAATGCTTCGTTTCTACTCTCTAGGATGTGGGCTGGCACTTTGTTCCCATCTCCCATCGATGTGAGGTCCCCAATCCTGGGGATCAACATCTTTGGTGGCACACCGTTTTCGTGTGAGGTCCCCAATCCTGGGGATCAACATCTTTGGTGGCACACAGTCTTCGTGTCCACTCCCTTAAGGGCTCAGCCTCCTTGTTGGCATATTGCCCGAGGTTTGGTGGCACGCCGTCTTCGTGTCCACTCCCTTGAGGGCTCAGCCTTCTTGTTGGCAGATTGCCCGAGGTTTGGTGGCACACCGTCTTCGTGTCCACTCCCTTAAGGGCTCAGCCTCCTTGTTGGCATATTGGCCGAGGTCGGGCTCTTATATCATTTGTAACGGTCCAAGCCCACCGCTAGGTTATATTGTCCTTTTTGGATTCTTTTTTGTGCTTTCTCTTAAGGTTTTTACCACGCGATTGATAGGGAGATGGTCTCATTCCAAAATCTGGATGGGCTCAGATATAAATTTGAGATATCACTCCCTCAAATTACAGCTTTTGGGTCCCTCACTGAATAATCTTTTCAAGAAAAGGGATTCAAATACAGGACCCATGTAAATCTTTTTCACAATCTTGACATGCATTTAGTTTTTGGAAGCTAGAGTGTGAATGAAATGGGAAATTTTCAGACAATTGAAGGGAAGGGGAATTGTAATTGATTGCAGGGGAAGTGAAAGTGGGATTGATTTACAAAGACGAGTTAGCAGAGAGGAGGATGAAGGGCGGAGATGTGTACCGGATCCCAGCCGGTTCAGTATTTTACATGGTGAACGTAGGGGAAGGACAGAGACTTCAAATTATCTGCAGCATTGACAAATCTGAGAGCTTGAGTTATGGTACTTTCCAGGTAATTTCTCAGCTTGGTGATAATGGGGGTTGGAATGTGATCGGAATCCGGAATTAGTAATGAAATTTTGATTGTTTTTTTCTTACAATTGCAGTCCTTCTTCATCGGCGGAGGAACCTATCCGGTATCAGTTCTAGCAGGTTTCGACCAGGATACATTGGCTACAGCATTTAATGTAAGAAATTCGACTAAAATCCCTAATCTGCTTCTTTAATTTTGAAACATTATACTCACAGCGCTCTGCTTGTCGTTAACTATGGATTTTCACTCATCAATTTACTCCACAGGTTTCCTATACAGAATTGAGGAGGATTCTATCTAGGCAACGACAGGGTCCAATCGTTTACATCTCGGATACTGAATCGCCGGGGGTTTGGAGTAAGTTTCTACAGGTGAAAGATGGAGACAAAGGAAATAAAATAGCTACCATCAATGAAGATGGCGAAGAAGCAGAGAAGAATAAGACATGGTCATGGAGGAATCTCATGAGTTCGATCTTTGGAAACGAAAATCGGGACAAAACTAAAAGAACAAGAACAGGAAAATCCCCCGATTCGTACAACCTCTACGACAAAACCCCAGACTTCAGCAACGCCTACGGATGGAGCGTTGCCCTAGACGAGCACGAGTATTCTCCTCTAGGTCACTCCGGAATCGGAGTTTATCTCGTCAATCTCACAGCGGTAAATCTAATCCACCGAATCCCTAATTGTCCTCTCAATTAATTTACGTACCTCCCTCGATGTTCGATTCAATACGAATGACCTCTATTACAGGGATCCATGATGGCGCCACACATAAATCCAACCGCGGCAGAGTACGGCATCGTCCTCAGAGGCACAGGCACAATACAAATCGTATATCCAAACGGAACCTCCGCCATGGACACAGAAGTAACCGAAGGGGACGTATTCTGGGTTCCAAGATACTTCCCATTCTGCCAAATCGCATCGAGAACAGGTCCATTCGAGTTCTTCGGATTCACCACGTCATCGCGCAGGAATCGACCGCAGTTCTTGGCCGGTGCGAATTCGGTTTTCCACACTCTAAGAAGCCCGGCAGTGGCATCAGCATTCGACATAACAGAGGACGACCTCGACCGGTTGCTGAGCTTGCAGCACGAGGTGGTGATTCTGCCGTCAGCGGAGATTGCGCCGCCGCACAAGGAGGAGGAGAAGAGGAGGAGGAGAGAGGAAGGAAGGAGGGAGAGGGAGAGGGAAAGCGAAAGGGAAAGGGAAAGGGAAAGGGAAACGGAAGAAGAGTGGACGAGGCGATTAGGAGCGGTTTGA

mRNA sequence

ATGGGGAAGAAAGAAGCCCTTGTGATGCTGTTGATTATTGCTGTTCTGGGAAATGCGATTGGGATTAAGGAGGAAGCAGAAGCAGAAGAAGAAGAAGAATGGTGGAGAGAGAGAGAGGAAGAGAGGGAGTTTAGAAGTAAAGAACGGTTTCTGCTAGAGGATTCGAAGCGGGTGATAGAGACAGAAGCAGGAGAAATGAGGGTGATTAGAAGTCCGGCTTCAAGAATTCTTGACAGGCCCATGCATATCGGCTTCATCACCATGGAACCTAAGTCCCTGTTCGTCCCTCAGTATTTGGACTCCAGCTTGATTCTCTTTGTCCGCAGAGGGGAAGTGAAAGTGGGATTGATTTACAAAGACGAGTTAGCAGAGAGGAGGATGAAGGGCGGAGATGTGTACCGGATCCCAGCCGGTTCAGTATTTTACATGGTGAACGTAGGGGAAGGACAGAGACTTCAAATTATCTGCAGCATTGACAAATCTGAGAGCTTGAGTTATGGTACTTTCCAGTCCTTCTTCATCGGCGGAGGAACCTATCCGGTATCAGTTCTAGCAGGTTTCGACCAGGATACATTGGCTACAGCATTTAATGTTTCCTATACAGAATTGAGGAGGATTCTATCTAGGCAACGACAGGGTCCAATCGTTTACATCTCGGATACTGAATCGCCGGGGGTTTGGAGTAAGTTTCTACAGGTGAAAGATGGAGACAAAGGAAATAAAATAGCTACCATCAATGAAGATGGCGAAGAAGCAGAGAAGAATAAGACATGGTCATGGAGGAATCTCATGAGTTCGATCTTTGGAAACGAAAATCGGGACAAAACTAAAAGAACAAGAACAGGAAAATCCCCCGATTCGTACAACCTCTACGACAAAACCCCAGACTTCAGCAACGCCTACGGATGGAGCGTTGCCCTAGACGAGCACGAGTATTCTCCTCTAGGTCACTCCGGAATCGGAGTTTATCTCGTCAATCTCACAGCGGGATCCATGATGGCGCCACACATAAATCCAACCGCGGCAGAGTACGGCATCGTCCTCAGAGGCACAGGCACAATACAAATCGTATATCCAAACGGAACCTCCGCCATGGACACAGAAGTAACCGAAGGGGACGTATTCTGGGTTCCAAGATACTTCCCATTCTGCCAAATCGCATCGAGAACAGGTCCATTCGAGTTCTTCGGATTCACCACGTCATCGCGCAGGAATCGACCGCAGTTCTTGGCCGGTGCGAATTCGGTTTTCCACACTCTAAGAAGCCCGGCAGTGGCATCAGCATTCGACATAACAGAGGACGACCTCGACCGGTTGCTGAGCTTGCAGCACGAGGTGGTGATTCTGCCGTCAGCGGAGATTGCGCCGCCGCACAAGGAGGAGGAGAAGAGGAGGAGGAGAGAGGAAGGAAGGAGGGAGAGGGAGAGGGAAAGCGAAAGGGAAAGGGAAAGGGAAAGGGAAACGGAAGAAGAGTGGACGAGGCGATTAGGAGCGGTTTGA

Coding sequence (CDS)

ATGGGGAAGAAAGAAGCCCTTGTGATGCTGTTGATTATTGCTGTTCTGGGAAATGCGATTGGGATTAAGGAGGAAGCAGAAGCAGAAGAAGAAGAAGAATGGTGGAGAGAGAGAGAGGAAGAGAGGGAGTTTAGAAGTAAAGAACGGTTTCTGCTAGAGGATTCGAAGCGGGTGATAGAGACAGAAGCAGGAGAAATGAGGGTGATTAGAAGTCCGGCTTCAAGAATTCTTGACAGGCCCATGCATATCGGCTTCATCACCATGGAACCTAAGTCCCTGTTCGTCCCTCAGTATTTGGACTCCAGCTTGATTCTCTTTGTCCGCAGAGGGGAAGTGAAAGTGGGATTGATTTACAAAGACGAGTTAGCAGAGAGGAGGATGAAGGGCGGAGATGTGTACCGGATCCCAGCCGGTTCAGTATTTTACATGGTGAACGTAGGGGAAGGACAGAGACTTCAAATTATCTGCAGCATTGACAAATCTGAGAGCTTGAGTTATGGTACTTTCCAGTCCTTCTTCATCGGCGGAGGAACCTATCCGGTATCAGTTCTAGCAGGTTTCGACCAGGATACATTGGCTACAGCATTTAATGTTTCCTATACAGAATTGAGGAGGATTCTATCTAGGCAACGACAGGGTCCAATCGTTTACATCTCGGATACTGAATCGCCGGGGGTTTGGAGTAAGTTTCTACAGGTGAAAGATGGAGACAAAGGAAATAAAATAGCTACCATCAATGAAGATGGCGAAGAAGCAGAGAAGAATAAGACATGGTCATGGAGGAATCTCATGAGTTCGATCTTTGGAAACGAAAATCGGGACAAAACTAAAAGAACAAGAACAGGAAAATCCCCCGATTCGTACAACCTCTACGACAAAACCCCAGACTTCAGCAACGCCTACGGATGGAGCGTTGCCCTAGACGAGCACGAGTATTCTCCTCTAGGTCACTCCGGAATCGGAGTTTATCTCGTCAATCTCACAGCGGGATCCATGATGGCGCCACACATAAATCCAACCGCGGCAGAGTACGGCATCGTCCTCAGAGGCACAGGCACAATACAAATCGTATATCCAAACGGAACCTCCGCCATGGACACAGAAGTAACCGAAGGGGACGTATTCTGGGTTCCAAGATACTTCCCATTCTGCCAAATCGCATCGAGAACAGGTCCATTCGAGTTCTTCGGATTCACCACGTCATCGCGCAGGAATCGACCGCAGTTCTTGGCCGGTGCGAATTCGGTTTTCCACACTCTAAGAAGCCCGGCAGTGGCATCAGCATTCGACATAACAGAGGACGACCTCGACCGGTTGCTGAGCTTGCAGCACGAGGTGGTGATTCTGCCGTCAGCGGAGATTGCGCCGCCGCACAAGGAGGAGGAGAAGAGGAGGAGGAGAGAGGAAGGAAGGAGGGAGAGGGAGAGGGAAAGCGAAAGGGAAAGGGAAAGGGAAAGGGAAACGGAAGAAGAGTGGACGAGGCGATTAGGAGCGGTTTGA

Protein sequence

MGKKEALVMLLIIAVLGNAIGIKEEAEAEEEEEWWREREEEREFRSKERFLLEDSKRVIETEAGEMRVIRSPASRILDRPMHIGFITMEPKSLFVPQYLDSSLILFVRRGEVKVGLIYKDELAERRMKGGDVYRIPAGSVFYMVNVGEGQRLQIICSIDKSESLSYGTFQSFFIGGGTYPVSVLAGFDQDTLATAFNVSYTELRRILSRQRQGPIVYISDTESPGVWSKFLQVKDGDKGNKIATINEDGEEAEKNKTWSWRNLMSSIFGNENRDKTKRTRTGKSPDSYNLYDKTPDFSNAYGWSVALDEHEYSPLGHSGIGVYLVNLTAGSMMAPHINPTAAEYGIVLRGTGTIQIVYPNGTSAMDTEVTEGDVFWVPRYFPFCQIASRTGPFEFFGFTTSSRRNRPQFLAGANSVFHTLRSPAVASAFDITEDDLDRLLSLQHEVVILPSAEIAPPHKEEEKRRRREEGRRERERESERERERERETEEEWTRRLGAV
BLAST of Cp4.1LG16g05580 vs. Swiss-Prot
Match: VCL22_ARATH (Vicilin-like seed storage protein At2g28490 OS=Arabidopsis thaliana GN=At2g28490 PE=2 SV=1)

HSP 1 Score: 482.3 bits (1240), Expect = 6.6e-135
Identity = 235/422 (55.69%), Postives = 314/422 (74.41%), Query Frame = 1

Query: 50  FLLEDSKRVIETEAGEMRVIRSPASRILDRPMHIGFITMEPKSLFVPQYLDSSLILFVRR 109
           F++ +S++VI++E GEMRV+ SP  RI+++PMHIGF+TMEPK+LFVPQYLDSSL++F+R+
Sbjct: 86  FMMRESRQVIKSEGGEMRVVLSPRGRIIEKPMHIGFLTMEPKTLFVPQYLDSSLLIFIRQ 145

Query: 110 GEVKVGLIYKDELAERRMKGGDVYRIPAGSVFYMVNVGEGQRLQIICSIDKSESLSYGTF 169
           GE  +G+I KDE  ER++K GD+Y IPAGSVFY+ N G GQRL +ICSID ++SL + TF
Sbjct: 146 GEATLGVICKDEFGERKLKAGDIYWIPAGSVFYLHNTGLGQRLHVICSIDPTQSLGFETF 205

Query: 170 QSFFIGGGTYPVSVLAGFDQDTLATAFNVSYTELRRILSRQRQGPIVYISDTESPG---- 229
           Q F+IGGG  P SVLAGFD  TL +AFNVS  EL++++  Q +GPIVY+++   P     
Sbjct: 206 QPFYIGGG--PSSVLAGFDPHTLTSAFNVSLPELQQMMMSQFRGPIVYVTEGPQPQPQST 265

Query: 230 VWSKFLQVKDGDKGNKIATINE----DGEEAEKNKTWSWRNLMSSIFGNENRDKTKRTRT 289
           VW++FL ++  +K  ++  + E      ++ + +  WSWRN++ SI  +   +K K + +
Sbjct: 266 VWTQFLGLRGEEKHKQLKKLLETKQGSPQDQQYSSGWSWRNIVRSIL-DLTEEKNKGSGS 325

Query: 290 GKSPDSYNLYDKT--PDFSNAYGWSVALDEHEYSPLGHSGIGVYLVNLTAGSMMAPHINP 349
            +  DSYN+YDK   P F N YGWS+ALD  +Y PL HSGIGVYLVNLTAG+MMAPH+NP
Sbjct: 326 SECEDSYNIYDKKDKPSFDNKYGWSIALDYDDYKPLKHSGIGVYLVNLTAGAMMAPHMNP 385

Query: 350 TAAEYGIVLRGTGTIQIVYPNGTSAMDTEVTEGDVFWVPRYFPFCQIASRTGPFEFFGFT 409
           TA EYGIVL G+G IQ+V+PNGTSAM+T V+ GDVFW+PRYF FCQIASRTGPFEF GFT
Sbjct: 386 TATEYGIVLAGSGEIQVVFPNGTSAMNTRVSVGDVFWIPRYFAFCQIASRTGPFEFVGFT 445

Query: 410 TSSRRNRPQFLAGANSVFHTLRSPAVASAFDITEDDLDRLLSLQHEVVILPSAEIAPPHK 462
           TS+ +NRPQFL G+NS+  TL   +++ AF + E+ + R +  Q E VILP+   APPH 
Sbjct: 446 TSAHKNRPQFLVGSNSLLRTLNLTSLSIAFGVDEETMRRFIEAQREAVILPTPAAAPPHV 504

BLAST of Cp4.1LG16g05580 vs. Swiss-Prot
Match: VCL21_ARATH (Vicilin-like seed storage protein At2g18540 OS=Arabidopsis thaliana GN=At2g18540 PE=3 SV=1)

HSP 1 Score: 173.3 bits (438), Expect = 6.6e-42
Identity = 143/457 (31.29%), Postives = 219/457 (47.92%), Query Frame = 1

Query: 58  VIETEAGEMRVIRSPASRILDRPMHIGFITMEPKSLFVPQYLDSSLILFVRRGEVKVGLI 117
           V+ TE G +  ++      +    HI FIT+EP +L +P  L S ++ FV  G   +  I
Sbjct: 53  VVATEFGNISAVQ------IGDGYHIQFITLEPNALLLPLLLHSDMVFFVHTGTGILNWI 112

Query: 118 YKDELAERRMKGGDVYRIPAGSVFYMVNVGEGQRLQIICSIDKSESLSYGTFQSFFIGGG 177
            ++   +  ++ GDV+R+ +G+VFY V+  E  R+  I ++ K             +G  
Sbjct: 113 DEESERKLELRRGDVFRLRSGTVFY-VHSNEKLRVYAIFNVGKC-------LNDPCLGAY 172

Query: 178 TYPVSVLAGFDQDTLATAFNVSYTELRRILSRQRQGPIVYISDTESPGVWSKFLQVKDGD 237
           +    +L GFD  TL +AF V    LR+I    +   IV                     
Sbjct: 173 SSVRDLLLGFDDRTLRSAFAVPEDILRKIRDATKPPLIV--------------------- 232

Query: 238 KGNKIATINEDGEEAEKNKTWSWRNLMSSIFGNENRD----KTKRTRTGKSPDSYNLYDK 297
             N +      G E +K   W  R +   +   +  D    K       K   ++N++++
Sbjct: 233 --NALPRNRTQGLEEDK---WQSRLVRLFVSVEDVTDHLAMKPIVDTNKKKSRTFNVFEE 292

Query: 298 TPDFSNAYGWSVALDEHEYSPLGHSGIGVYLVNLTAGSMMAPHINPTAAEYGIVLRGTGT 357
            PDF N  G S+ +DE +   L  S  GV++VNLT GSM+ PH NP+A E  IVL G G 
Sbjct: 293 DPDFENNNGRSIVVDEKDLDALKGSRFGVFMVNLTKGSMIGPHWNPSACEISIVLEGEGM 352

Query: 358 IQIVYPNGTSAMDTE-------VTEGDVFWVPRYFPFCQIASRTGPFEFFGFTTSSRRNR 417
           +++V     S+   +       V EGDVF VP++ P  Q++     F F GF+TS++ N 
Sbjct: 353 VRVVNQQSLSSCKNDRKSESFMVEEGDVFVVPKFHPMAQMSFENSSFVFMGFSTSAKTNH 412

Query: 418 PQFLAGANSVFHTLRSPAVASAFDITEDDLDRLLSLQHEVVILPSA-----EIAPPHKEE 477
           PQFL G +SV   L    VA +F+++ + +  LL  Q E VI   A     E++   +E 
Sbjct: 413 PQFLVGQSSVLKVLDRDVVAVSFNLSNETIKGLLKAQKESVIFECASCAEGELSKLMREI 469

Query: 478 EKRRRREE---GRRERERESERERERERETEEEWTRR 496
           E+R+RREE    RR +E E  R+RE  +  EEE  +R
Sbjct: 473 EERKRREEEEIERRRKEEEEARKREEAKRREEEEAKR 469

BLAST of Cp4.1LG16g05580 vs. Swiss-Prot
Match: VCL43_ARATH (Vicilin-like seed storage protein At4g36700 OS=Arabidopsis thaliana GN=At4g36700 PE=3 SV=1)

HSP 1 Score: 159.8 bits (403), Expect = 7.5e-38
Identity = 149/517 (28.82%), Postives = 249/517 (48.16%), Query Frame = 1

Query: 8   VMLLIIAVLGNAIGIKEEAEAEEEEEWWREREEEREFRSKERFLLEDSKRVIETEAGEMR 67
           V+LL++  L      +  A++EE EE+         F S      +  K + ET+ G++ 
Sbjct: 11  VLLLVLLFLCT----ESLAKSEESEEYDVAVPSCCGFSSPLLIKKDQWKPIFETKFGQIS 70

Query: 68  VIRSPASRILDRPMHIGFITMEPKSLFVPQYLDSSLILFVRRGEVKVGLIYKDELAERRM 127
            ++         P  I  IT+EP ++ +P  L S ++ FV  G   +  +  +E     +
Sbjct: 71  TVQIGNGCGGMGPYKIHSITLEPNTILLPLLLHSDMVFFVDSGSGILNWV-DEEAKSTEI 130

Query: 128 KGGDVYRIPAGSVFYM----VNVGEGQRLQIICSIDKSESL----SYGTFQSFFIGGGTY 187
           + GDVYR+  GSVFY+    V++  G +L++      ++       +G + S        
Sbjct: 131 RLGDVYRLRPGSVFYLQSKPVDIFLGTKLKLYAIFSNNDECLHDPCFGAYSSI------- 190

Query: 188 PVSVLAGFDQDTLATAFNVSYTELRRILSRQRQGPIVYISDTE-SPGV---WS---KFLQ 247
              ++ GFD+  L +AF V    +   L R R  P + +S+T  +PGV   W    + L+
Sbjct: 191 -TDLMFGFDETILQSAFGVPEGIIE--LMRNRTKPPLIVSETLCTPGVANTWQLQPRLLK 250

Query: 248 VKDGDKGNKIATINEDGEEAEKNKTWSWRNLMSSIFGNENRDKTKRTRTGKSPDSYNLYD 307
           +  G      A + ++ ++ EK                E ++K K+ +T      +N+++
Sbjct: 251 LFAGS-----ADLVDNKKKKEKK---------------EKKEKVKKAKT------FNVFE 310

Query: 308 KTPDFSNAYGWSVALDEHEYSPLGHSGIGVYLVNLTAGSMMAPHINPTAAEYGIVLRGTG 367
             PDF + YG ++ ++  +   L  S +GV +VNLT GSMM PH NP A E  IVL+G G
Sbjct: 311 SEPDFESPYGRTITINRKDLKVLKGSMVGVSMVNLTQGSMMGPHWNPWACEISIVLKGAG 370

Query: 368 TIQIVYPNGTSAMDTE-------VTEGDVFWVPRYFPFCQIASRTGPFEFFGFTTSSRRN 427
            ++++  + +S   +E       V EGD+F VPR  P  Q++       F GFTTS++ N
Sbjct: 371 MVRVLRSSISSNTSSECKNVRFKVEEGDIFAVPRLHPMAQMSFNNDSLVFVGFTTSAKNN 430

Query: 428 RPQFLAGANSVFHTLRSPAVASAFDITEDDLDRLLSLQHEVVILPSAEIAPPH------- 487
            PQFLAG +S    L    +A++ +++   +D LL  Q E VIL     A          
Sbjct: 431 EPQFLAGEDSALRMLDRQVLAASLNVSSVTIDGLLGAQKEAVILECHSCAEGEIEKLKVE 486

Query: 488 ----KEEEKRRRREEGRRERERESERERERERETEEE 492
               K +++R+RR + R++ E E++RE E  R+ EEE
Sbjct: 491 IERKKIDDERKRRHDERKKEEEEAKREEEERRKREEE 486

BLAST of Cp4.1LG16g05580 vs. Swiss-Prot
Match: AMP22_MACIN (Vicilin-like antimicrobial peptides 2-2 OS=Macadamia integrifolia GN=AMP2-2 PE=2 SV=1)

HSP 1 Score: 115.9 bits (289), Expect = 1.2e-24
Identity = 110/472 (23.31%), Postives = 193/472 (40.89%), Query Frame = 1

Query: 21  GIKEEAEAEEEEE--WWREREEEREFRSKERFLLEDSKRVIETEAGEMRVIRSPASRILD 80
           G  EE E ++ +   ++ ER     FR++E  +      V+E   G  +++R+  +    
Sbjct: 238 GRYEEGEEKQSDNPYYFDERSLSTRFRTEEGHI-----SVLENFYGRSKLLRALKN---- 297

Query: 81  RPMHIGFITMEPKSLFVPQYLDSSLILFVRRGEVKVGLIYKDELAERRMKGGDVYRIPAG 140
               +  +   P +  +P +LD+  IL V  G   + +I++D      ++ GDV RIPAG
Sbjct: 298 --YRLVLLEANPNAFVLPTHLDADAILLVTGGRGALKMIHRDNRESYNLECGDVIRIPAG 357

Query: 141 SVFYMVNVGEGQRLQIICSIDKSESLSYGTFQSFFIGGGTYPVSVLAGFDQDTLATAFNV 200
           + FY++N    +RL I   +      + G ++ FF  GG  P   L+ F ++ L  A N 
Sbjct: 358 TTFYLINRDNNERLHIAKFLQTIS--TPGQYKEFFPAGGQNPEPYLSTFSKEILEAALNT 417

Query: 201 SYTELRRILSRQRQGPIVYISDTESPGVWSKFLQVKDGDKGNKIATINEDGEEAEKNKTW 260
               LR +L +QR+G I+  S  +                      I E   +  +++ W
Sbjct: 418 QAERLRGVLGQQREGVIISASQEQ----------------------IRELTRDDSESRRW 477

Query: 261 SWRNLMSSIFGNENRDKTKRTRTGKSPDSYNLYDKTPDFSNAYGWSVALDEHEYSPLGHS 320
             R    S                 S   YNL++K P +SN YG +  +   +Y  L   
Sbjct: 478 HIRRGGES-----------------SRGPYNLFNKRPLYSNKYGQAYEVKPEDYRQLQDM 537

Query: 321 GIGVYLVNLTAGSMMAPHINPTAAEYGIVLRGTGTIQIVYPN------GTSAMDTEVTEG 380
            + V++ N+T GSMM P  N  + +  +V  G   +++  P+      G         E 
Sbjct: 538 DVSVFIANITQGSMMGPFFNTRSTKVVVVASGEADVEMACPHLSGRHGGRRGGKRHEEEE 597

Query: 381 DVFW--------------VPRYFPFCQIASRTGPFEFFGFTTSSRRNRPQFLAG-ANSVF 440
           DV +              VP   P   ++S       F F  +++ N   FLAG   +V 
Sbjct: 598 DVHYEQVKARLSKREAIVVPVGHPVVFVSSGNENLLLFAFGINAQNNHENFLAGRERNVL 654

Query: 441 HTLRSPAVASAFDITEDDLDRLLSLQHEVVILPSAEIAPPHKEEEKRRRREE 470
             +   A+  AF     +++ L + Q E +  P       H+++  R  +++
Sbjct: 658 QQIEPQAMELAFAAPRKEVEELFNSQDESIFFPGPR---QHQQQSSRSTKQQ 654

BLAST of Cp4.1LG16g05580 vs. Swiss-Prot
Match: AMP23_MACIN (Vicilin-like antimicrobial peptides 2-3 (Fragment) OS=Macadamia integrifolia GN=AMP2-3 PE=1 SV=1)

HSP 1 Score: 111.3 bits (277), Expect = 3.1e-23
Identity = 107/472 (22.67%), Postives = 195/472 (41.31%), Query Frame = 1

Query: 21  GIKEEAEAEEEEE--WWREREEEREFRSKERFLLEDSKRVIETEAGEMRVIRSPASRILD 80
           G  EE E ++ +   ++ ER     FR++E  +      V+E   G  +++R+  +    
Sbjct: 197 GRYEEGEEKQSDNPYYFDERSLSTRFRTEEGHI-----SVLENFYGRSKLLRALKN---- 256

Query: 81  RPMHIGFITMEPKSLFVPQYLDSSLILFVRRGEVKVGLIYKDELAERRMKGGDVYRIPAG 140
               +  +   P +  +P +LD+  IL V  G   + +I++D      ++ GDV RIPAG
Sbjct: 257 --YRLVLLEANPNAFVLPTHLDADAILLVIGGRGALKMIHRDNRESYNLECGDVIRIPAG 316

Query: 141 SVFYMVNVGEGQRLQIICSIDKSESLSYGTFQSFFIGGGTYPVSVLAGFDQDTLATAFNV 200
           + FY++N    +RL I   +      + G ++ FF  GG  P   L+ F ++ L  A N 
Sbjct: 317 TTFYLINRDNNERLHIAKFLQTIS--TPGQYKEFFPAGGQNPEPYLSTFSKEILEAALNT 376

Query: 201 SYTELRRILSRQRQGPIVYISDTESPGVWSKFLQVKDGDKGNKIATINEDGEEAEKNKTW 260
               LR +L +QR+G I+  S  +                      I E   +  +++ W
Sbjct: 377 QTERLRGVLGQQREGVIIRASQEQ----------------------IRELTRDDSESRRW 436

Query: 261 SWRNLMSSIFGNENRDKTKRTRTGKSPDSYNLYDKTPDFSNAYGWSVALDEHEYSPLGHS 320
             R    S                 S   YNL++K P +SN YG +  +   +Y  L   
Sbjct: 437 HIRRGGES-----------------SRGPYNLFNKRPLYSNKYGQAYEVKPEDYRQLQDM 496

Query: 321 GIGVYLVNLTAGSMMAPHINPTAAEYGIVLRGTGTIQIVYPN---------GTSAMDTE- 380
            + V++ N+T GSMM P  N  + +  +V  G   +++  P+         G    + E 
Sbjct: 497 DVSVFIANITQGSMMGPFFNTRSTKVVVVASGEADVEMACPHLSGRHGGRGGGKRHEEEE 556

Query: 381 ----------VTEGDVFWVPRYFPFCQIASRTGPFEFFGFTTSSRRNRPQFLAG-ANSVF 440
                     +++ +   V    P   ++S       F F  +++ N   FLAG   +V 
Sbjct: 557 EVHYEQVRARLSKREAIVVLAGHPVVFVSSGNENLLLFAFGINAQNNHENFLAGRERNVL 613

Query: 441 HTLRSPAVASAFDITEDDLDRLLSLQHEVVILPSAEIAPPHKEEEKRRRREE 470
             +   A+  AF  +  +++ L + Q E +  P       H+++  R  +++
Sbjct: 617 QQIEPQAMELAFAASRKEVEELFNSQDESIFFPGPR---QHQQQSPRSTKQQ 613

BLAST of Cp4.1LG16g05580 vs. TrEMBL
Match: Q39651_9ROSI (PreproMP27-MP32 OS=Cucurbita cv. Kurokawa Amakuri PE=2 SV=1)

HSP 1 Score: 923.7 bits (2386), Expect = 9.5e-266
Identity = 478/498 (95.98%), Postives = 484/498 (97.19%), Query Frame = 1

Query: 1   MGKKEALVMLLIIAVLGNAIGIKEEAEAEEEEEWWREREEEREFRSKERFLLEDSKRVIE 60
           M KKEALVMLLIIAVLGNAIGIKEEAEA EEEEWWREREEEREFRSKE+FLLEDSKRVIE
Sbjct: 7   MWKKEALVMLLIIAVLGNAIGIKEEAEAAEEEEWWREREEEREFRSKEQFLLEDSKRVIE 66

Query: 61  TEAGEMRVIRSPASRILDRPMHIGFITMEPKSLFVPQYLDSSLILFVRRGEVKVGLIYKD 120
           TEAGEMRVIRSPASRILDRPMHIGFITMEPKSLFVPQYLDSSLILFVRRGEVKVGLIYKD
Sbjct: 67  TEAGEMRVIRSPASRILDRPMHIGFITMEPKSLFVPQYLDSSLILFVRRGEVKVGLIYKD 126

Query: 121 ELAERRMKGGDVYRIPAGSVFYMVNVGEGQRLQIICSIDKSESLSYGTFQSFFIGGGTYP 180
           ELAERRMKGGDVYRIPAGSVFYMVNVGEGQRLQIICSIDKSESLSYGTFQSFFIGGGTYP
Sbjct: 127 ELAERRMKGGDVYRIPAGSVFYMVNVGEGQRLQIICSIDKSESLSYGTFQSFFIGGGTYP 186

Query: 181 VSVLAGFDQDTLATAFNVSYTELRRILSRQRQGPIVYISDTESPGVWSKFLQVKDGDKGN 240
           VSVLAGFDQDTLATAFNVSYTELRRILSRQRQGPIVY+SDTESPGVWSKFLQVKDGDKGN
Sbjct: 187 VSVLAGFDQDTLATAFNVSYTELRRILSRQRQGPIVYVSDTESPGVWSKFLQVKDGDKGN 246

Query: 241 KIATINEDGEEAEKNKTWSWRNLMSSIFGNENRDKTKRTRTGKSPDSYNLYDKTPDFSNA 300
           KIA INEDGEEAEKNK WSWRNL+S IFGNENRDKTKRTRTGKSPDSYNLYDKTPDFSNA
Sbjct: 247 KIANINEDGEEAEKNKPWSWRNLVSLIFGNENRDKTKRTRTGKSPDSYNLYDKTPDFSNA 306

Query: 301 YGWSVALDEHEYSPLGHSGIGVYLVNLTAGSMMAPHINPTAAEYGIVLRGTGTIQIVYPN 360
           YGWSVALDEHEYSPLGHSGIGVYLVNLTAGSMMAPHINPTAAEYGIVLRGTGTIQIVYPN
Sbjct: 307 YGWSVALDEHEYSPLGHSGIGVYLVNLTAGSMMAPHINPTAAEYGIVLRGTGTIQIVYPN 366

Query: 361 GTSAMDTEVTEGDVFWVPRYFPFCQIASRTGPFEFFGFTTSSRRNRPQFLAGANSVFHTL 420
           GTSAMDTEVTEGDVFWVPRYFPFCQIASRTGPFEFFGFTTSSRRNRPQFLA ANS+FHTL
Sbjct: 367 GTSAMDTEVTEGDVFWVPRYFPFCQIASRTGPFEFFGFTTSSRRNRPQFLACANSIFHTL 426

Query: 421 RSPAVASAFDITEDDLDRLLSLQHEVVILPSAEIAPPHKEEEKRRRREEGRRERERESER 480
           RSPAVA+AFDITEDDLDRLLS Q+EVVILPSAEIAPPHKEEEKRRRREEGRRERERESER
Sbjct: 427 RSPAVATAFDITEDDLDRLLSAQYEVVILPSAEIAPPHKEEEKRRRREEGRRERERESER 486

Query: 481 ERERERETEEEWTRRLGA 499
           ER      EEEWTRRL A
Sbjct: 487 ER------EEEWTRRLEA 498

BLAST of Cp4.1LG16g05580 vs. TrEMBL
Match: A0A0A0KHQ1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G290870 PE=4 SV=1)

HSP 1 Score: 810.8 bits (2093), Expect = 9.0e-232
Identity = 421/500 (84.20%), Postives = 458/500 (91.60%), Query Frame = 1

Query: 1   MGKKEALVMLLIIAVLGNAIGIKEEAEAEEEEEWWREREEEREFRSKERFLLEDSKRVIE 60
           MGKKEAL++LLI+AVLGNAIGIKEE    EEEEWWREREEE+ F SKERFL+ DSK+VIE
Sbjct: 1   MGKKEALLILLIVAVLGNAIGIKEE----EEEEWWREREEEK-FGSKERFLMVDSKKVIE 60

Query: 61  TEAGEMRVIRSPASRILDRPMHIGFITMEPKSLFVPQYLDSSLILFVRRGEVKVGLIYKD 120
           TEAGEMRV+R P SRILD+ MHIGFITMEPKSLFVPQYLDS+LILFVRRG+VKVGLIYKD
Sbjct: 61  TEAGEMRVMRGPISRILDKAMHIGFITMEPKSLFVPQYLDSTLILFVRRGDVKVGLIYKD 120

Query: 121 ELAERRMKGGDVYRIPAGSVFYMVNVGEGQRLQIICSIDKSESLSYGTFQSFFIGGGTYP 180
           ELAERRMKGGDV+RIPAGSVFYMVNVGEGQRL+IICSIDKSESLSYGTFQSFF+ GG YP
Sbjct: 121 ELAERRMKGGDVFRIPAGSVFYMVNVGEGQRLEIICSIDKSESLSYGTFQSFFVAGGKYP 180

Query: 181 VSVLAGFDQDTLATAFNVSYTELRRILSRQRQGPIVYISDTESPGVWSKFLQVKDGDKGN 240
            SVLAGFDQDTLATAFNVSYTELRRILSRQRQGPIVYISDTESP VWSKFLQVKD  + +
Sbjct: 181 GSVLAGFDQDTLATAFNVSYTELRRILSRQRQGPIVYISDTESPRVWSKFLQVKDKARLS 240

Query: 241 KIATINEDGEEAEKNKTWSWRNLMSSIFGNENRDKTKR-TRTGKSPDSYNLYDKTPDFSN 300
           K+A  NEDGEE+EKNK WSWR LM+SIF NENRDK+K+ TRTGKSPDSYNLYDKTPDFSN
Sbjct: 241 KVADNNEDGEESEKNKRWSWRKLMNSIFRNENRDKSKKITRTGKSPDSYNLYDKTPDFSN 300

Query: 301 AYGWSVALDEHEYSPLGHSGIGVYLVNLTAGSMMAPHINPTAAEYGIVLRGTGTIQIVYP 360
           AYGWSVALDE EY PLGHSGIGVYLVNLTAGSMMAPH+NPTAAEYGIVLRGTGTIQIVYP
Sbjct: 301 AYGWSVALDETEYHPLGHSGIGVYLVNLTAGSMMAPHVNPTAAEYGIVLRGTGTIQIVYP 360

Query: 361 NGTSAMDTEVTEGDVFWVPRYFPFCQIASRTGPFEFFGFTTSSRRNRPQFLAGANSVFHT 420
           NGTSAM+ EVTEGDVFW+PRYFPFCQIASRTGPFEFFGFTTSSR+NRPQFLAGA+S+FHT
Sbjct: 361 NGTSAMNAEVTEGDVFWIPRYFPFCQIASRTGPFEFFGFTTSSRKNRPQFLAGASSIFHT 420

Query: 421 LRSPAVASAFDITEDDLDRLLSLQHEVVILPSAEIAPPHKEEEKRRRREEGRRERERESE 480
           LR+  +A+AFDITEDD++RLL  Q+E +ILPSAEIAPPHKEEEK+RR+EE RRE E E+E
Sbjct: 421 LRNMEMATAFDITEDDMERLLGAQYEAIILPSAEIAPPHKEEEKKRRKEEERREAETETE 480

Query: 481 RERERERETEEEWTRRLGAV 500
            E ERERE E E  RR+  V
Sbjct: 481 TEWERERERERERERRVDEV 495

BLAST of Cp4.1LG16g05580 vs. TrEMBL
Match: B9HNV4_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0009s05950g PE=4 SV=2)

HSP 1 Score: 585.9 bits (1509), Expect = 4.7e-164
Identity = 291/468 (62.18%), Postives = 368/468 (78.63%), Query Frame = 1

Query: 1   MGKKEALVMLLIIAVLG--NAIGIKEEAEAEEEEEWWREREEEREFRSKERFLLEDSKRV 60
           MG   A ++LL++   G   A+G+       E+E+W  +R+E +  R +E  LL+DSKRV
Sbjct: 1   MGNGAAHLLLLLVLCYGVQMAVGLYRG----EKEDWRGDRDETQIDREEEWLLLQDSKRV 60

Query: 61  IETEAGEMRVIRSPASRILDRPMHIGFITMEPKSLFVPQYLDSSLILFVRRGEVKVGLIY 120
           ++T+AGEM V+R+   RI+DRPMHIGFITMEP++LFVPQY+DSSLILF+R GE KVGLIY
Sbjct: 61  VKTDAGEMMVLRNYGGRIIDRPMHIGFITMEPRTLFVPQYIDSSLILFIRTGEAKVGLIY 120

Query: 121 KDELAERRMKGGDVYRIPAGSVFYMVNVGEGQRLQIICSIDKSESLSYGTFQSFFIGGGT 180
           KDELAERR+K GD+YRIPAGS FY++N  EGQRL IICSID SESL  G FQSF+IGGGT
Sbjct: 121 KDELAERRLKIGDIYRIPAGSAFYLMNAEEGQRLHIICSIDPSESLGLGFFQSFYIGGGT 180

Query: 181 YPVSVLAGFDQDTLATAFNVSYTELRRILSRQRQGPIVYISDTESP--GVWSKFLQVKDG 240
           YP S+LAGF+ +TL+ AFNV+  E+R I++RQ++GPIV+I D+ +P   +W+KFLQ+K+ 
Sbjct: 181 YPPSILAGFELETLSAAFNVTADEVREIMTRQQEGPIVFIGDSRAPRPSLWTKFLQLKEQ 240

Query: 241 DKGN---KIATINEDGEEAEKNKTWSWRNLMSSIFGNENRDKTKRTRTGKSPDSYNLYDK 300
           D+     ++    +   + E+ +TWSWR L++SIFG EN  K K  + GKSPDSYN+YD+
Sbjct: 241 DRLQHLKRMVKFQQQPSQGEEQRTWSWRKLLNSIFGQEN--KKKGEKVGKSPDSYNIYDR 300

Query: 301 TPDFSNAYGWSVALDEHEYSPLGHSGIGVYLVNLTAGSMMAPHINPTAAEYGIVLRGTGT 360
            PDF N YGWS+ALDE +Y PL +SGIGVYLVNLTAGSM+APH+NPTA EYGIVLRG+G 
Sbjct: 301 RPDFRNNYGWSIALDESDYQPLKYSGIGVYLVNLTAGSMLAPHVNPTATEYGIVLRGSGR 360

Query: 361 IQIVYPNGTSAMDTEVTEGDVFWVPRYFPFCQIASRTGPFEFFGFTTSSRRNRPQFLAGA 420
           IQIV+PNGT AMD  V EGDVFWVPRYFPFCQIA+R+GPFEFFGFTTS+R NRPQFL GA
Sbjct: 361 IQIVFPNGTQAMDATVKEGDVFWVPRYFPFCQIAARSGPFEFFGFTTSARENRPQFLVGA 420

Query: 421 NSVFHTLRSPAVASAFDITEDDLDRLLSLQHEVVILPSAEIAPPHKEE 462
           NS+  TLRSP +A+AF ++ED ++R++  Q E VILPSA  APP +EE
Sbjct: 421 NSILQTLRSPELAAAFGVSEDRINRVIKAQREAVILPSASAAPPDEEE 462

BLAST of Cp4.1LG16g05580 vs. TrEMBL
Match: A1E0W1_FICPW (7S globulin OS=Ficus pumila var. awkeotsang PE=2 SV=1)

HSP 1 Score: 565.1 bits (1455), Expect = 8.6e-158
Identity = 290/473 (61.31%), Postives = 356/473 (75.26%), Query Frame = 1

Query: 7   LVMLLIIAVLGNAIGIKEEAEAEEEEEWWREREEEREFRSKER--------FLLEDSKRV 66
           L ++L   +LG A+G  EE   EEEE+W RERE ER+   K R        FLL+DSK V
Sbjct: 17  LGLVLCHGLLGMAVGYDEE---EEEEDWRRERERERKREEKRREREEEFEPFLLKDSKHV 76

Query: 67  IETEAGEMRVIRSPASRILDRPMHIGFITMEPKSLFVPQYLDSSLILFVRRGEVKVGLIY 126
           + T+AGEM+V++    R +  PM IGFITMEPK+LFVPQYLDS  ILF+RRGE KVG IY
Sbjct: 77  VRTDAGEMKVVKRIGGRFIQGPMRIGFITMEPKTLFVPQYLDSDFILFIRRGEAKVGFIY 136

Query: 127 KDELAERRMKGGDVYRIPAGSVFYMVNVGEGQRLQIICSIDKSESLSYGTFQSFFIGGGT 186
           KD+LAERR+K GDVYRIPAGSVFY+VN GEGQRL +ICSID SESL +G+FQSFF+GGGT
Sbjct: 137 KDQLAERRLKIGDVYRIPAGSVFYLVNTGEGQRLHVICSIDTSESLRFGSFQSFFVGGGT 196

Query: 187 YPVSVLAGFDQDTLATAFNVSYTELRRILSRQRQGPIVYISDTESPGVWSKFLQVKDGDK 246
            PVS+++GFD + L  AFNV++ ELR  LS Q++GPIVYISD+ SP +WSKFL++K+ +K
Sbjct: 197 NPVSIISGFDSEILENAFNVTHAELREFLSSQQEGPIVYISDSRSPRLWSKFLELKESEK 256

Query: 247 GNKIATINEDGEEAEKNK-------TWSWRNLMSS-IFGN-ENRDKTKRTRTGKSPDSYN 306
            + +  I +  EE++  K        WSWR ++ S +F N E R +  +TR GKSP+SYN
Sbjct: 257 LDHLKKIVDSEEESDDEKLEEQGQEVWSWRKMLGSLLFANKEKRPEDVKTR-GKSPNSYN 316

Query: 307 LYDKTPDFSNAYGWSVALDEHEYSPLGHSGIGVYLVNLTAGSMMAPHINPTAAEYGIVLR 366
           LYD  PDF N YGWS+A+D   YSPL  +G GVYLVNLTAGSMMAPHINP A E+GIVLR
Sbjct: 317 LYDGKPDFKNKYGWSIAVDASSYSPLRKTGFGVYLVNLTAGSMMAPHINPRATEFGIVLR 376

Query: 367 GTGTIQIVYPNGTSAMDTEVTEGDVFWVPRYFPFCQIASRTGPFEFFGFTTSSRRNRPQF 426
           GTG +QIVYPNG+ AM+T+V EGDVFWVPRYFPFCQIASR+GP EFFGFTTS+R+NRPQF
Sbjct: 377 GTGNVQIVYPNGSLAMNTDVREGDVFWVPRYFPFCQIASRSGPMEFFGFTTSARKNRPQF 436

Query: 427 LAGANSVFHTLRSPAVASAFDITEDDLDRLLSLQHEVVILPSAEIAPPHKEEE 463
           L G+NSV  ++R P +A+AF +TE+ L  +   Q E VILPS   APP K EE
Sbjct: 437 LVGSNSVLRSMRGPELAAAFGLTEERLRNITDAQREAVILPSPMAAPPVKVEE 485

BLAST of Cp4.1LG16g05580 vs. TrEMBL
Match: M5W7Z8_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa020107mg PE=4 SV=1)

HSP 1 Score: 563.5 bits (1451), Expect = 2.5e-157
Identity = 287/502 (57.17%), Postives = 375/502 (74.70%), Query Frame = 1

Query: 9   MLLIIAVLGNAIGIKEEAEAEEEEEWWREREEEREFR-----------------SKERFL 68
           +LL++   G ++ +    + +E+   WR ++EERE R                  ++ +L
Sbjct: 10  VLLLVMWYGVSVAVGYTGDPDED---WRRKKEEREGRPGSGDMRRPEYEKPESEKEDWYL 69

Query: 69  LEDSKRVIETEAGEMRVIRSPASRILDRPMHIGFITMEPKSLFVPQYLDSSLILFVRRGE 128
           L  S++V++TEAGEM V+     R++D+PMHIGFITMEPKSLF+PQYLDS+L+LFVRRGE
Sbjct: 70  LPQSRQVVKTEAGEMSVVMRVGGRVVDKPMHIGFITMEPKSLFIPQYLDSNLVLFVRRGE 129

Query: 129 VKVGLIYKDELAERRMKGGDVYRIPAGSVFYMVNVGEGQRLQIICSIDKSESLSYGTFQS 188
            KVGLIY+DEL ERR+K GDVYRIPAGS FY+VN GEGQRL IICS+D SESL  G+ QS
Sbjct: 130 AKVGLIYRDELGERRLKSGDVYRIPAGSPFYLVNTGEGQRLHIICSLDTSESLGLGSVQS 189

Query: 189 FFIGGGTYPVSVLAGFDQDTLATAFNVSYTELRRILSRQRQGPIVYISDTESPGVWSKFL 248
           FFIGGG+ P SVLAGFD D L  AFNVS +EL  +L+ Q++GPIVY+SD+ SP +W+KFL
Sbjct: 190 FFIGGGSNPQSVLAGFDHDILTNAFNVSSSELMEVLTSQQKGPIVYLSDSHSPNLWAKFL 249

Query: 249 QVKDGDKGNKIATI----NEDGEEAEKNKTWSWRNLMSSIFGNENRDKTKRTRT-----G 308
           Q+K+ D+  ++  +     E     ++ +TWSWR L++S+FG  + D  KR        G
Sbjct: 250 QLKEQDRLQEMKKMVDFQQEPDHHQDQTQTWSWRKLLNSVFGAGSDDNKKRAEDYDKGKG 309

Query: 309 KSPDSYNLYDKTPDFSNAYGWSVALDEHEYSPLGHSGIGVYLVNLTAGSMMAPHINPTAA 368
           K+PDSYNLYD+ PDF N YGWS+ LDE +Y+PL  SG+GVYLVNLTAG+MMAPH+NPTA 
Sbjct: 310 KAPDSYNLYDRKPDFRNNYGWSMELDESDYAPLKDSGVGVYLVNLTAGAMMAPHVNPTAT 369

Query: 369 EYGIVLRGTGTIQIVYPNGTSAMDTEVTEGDVFWVPRYFPFCQIASRTGPFEFFGFTTSS 428
           EYGIVLRG+GTIQIV+PNGTSAM+T V +GDVFWVPRYFPFCQIASR+GP EFFGFTTS+
Sbjct: 370 EYGIVLRGSGTIQIVFPNGTSAMNTNVQDGDVFWVPRYFPFCQIASRSGPLEFFGFTTSA 429

Query: 429 RRNRPQFLAGANSVFHTLRSPAVASAFDITEDDLDRLLSLQHEVVILPSAEIAPPHKEEE 485
           R+NRPQFLAGA+SV  T+R P +A+AF ++ED L + +  Q E VILPSA+ APP+KE+ 
Sbjct: 430 RKNRPQFLAGASSVLQTIRGPELAAAFGVSEDRLRKFIDAQREAVILPSAQAAPPYKEDR 489

BLAST of Cp4.1LG16g05580 vs. TAIR10
Match: AT2G28490.1 (AT2G28490.1 RmlC-like cupins superfamily protein)

HSP 1 Score: 482.3 bits (1240), Expect = 3.7e-136
Identity = 235/422 (55.69%), Postives = 314/422 (74.41%), Query Frame = 1

Query: 50  FLLEDSKRVIETEAGEMRVIRSPASRILDRPMHIGFITMEPKSLFVPQYLDSSLILFVRR 109
           F++ +S++VI++E GEMRV+ SP  RI+++PMHIGF+TMEPK+LFVPQYLDSSL++F+R+
Sbjct: 86  FMMRESRQVIKSEGGEMRVVLSPRGRIIEKPMHIGFLTMEPKTLFVPQYLDSSLLIFIRQ 145

Query: 110 GEVKVGLIYKDELAERRMKGGDVYRIPAGSVFYMVNVGEGQRLQIICSIDKSESLSYGTF 169
           GE  +G+I KDE  ER++K GD+Y IPAGSVFY+ N G GQRL +ICSID ++SL + TF
Sbjct: 146 GEATLGVICKDEFGERKLKAGDIYWIPAGSVFYLHNTGLGQRLHVICSIDPTQSLGFETF 205

Query: 170 QSFFIGGGTYPVSVLAGFDQDTLATAFNVSYTELRRILSRQRQGPIVYISDTESPG---- 229
           Q F+IGGG  P SVLAGFD  TL +AFNVS  EL++++  Q +GPIVY+++   P     
Sbjct: 206 QPFYIGGG--PSSVLAGFDPHTLTSAFNVSLPELQQMMMSQFRGPIVYVTEGPQPQPQST 265

Query: 230 VWSKFLQVKDGDKGNKIATINE----DGEEAEKNKTWSWRNLMSSIFGNENRDKTKRTRT 289
           VW++FL ++  +K  ++  + E      ++ + +  WSWRN++ SI  +   +K K + +
Sbjct: 266 VWTQFLGLRGEEKHKQLKKLLETKQGSPQDQQYSSGWSWRNIVRSIL-DLTEEKNKGSGS 325

Query: 290 GKSPDSYNLYDKT--PDFSNAYGWSVALDEHEYSPLGHSGIGVYLVNLTAGSMMAPHINP 349
            +  DSYN+YDK   P F N YGWS+ALD  +Y PL HSGIGVYLVNLTAG+MMAPH+NP
Sbjct: 326 SECEDSYNIYDKKDKPSFDNKYGWSIALDYDDYKPLKHSGIGVYLVNLTAGAMMAPHMNP 385

Query: 350 TAAEYGIVLRGTGTIQIVYPNGTSAMDTEVTEGDVFWVPRYFPFCQIASRTGPFEFFGFT 409
           TA EYGIVL G+G IQ+V+PNGTSAM+T V+ GDVFW+PRYF FCQIASRTGPFEF GFT
Sbjct: 386 TATEYGIVLAGSGEIQVVFPNGTSAMNTRVSVGDVFWIPRYFAFCQIASRTGPFEFVGFT 445

Query: 410 TSSRRNRPQFLAGANSVFHTLRSPAVASAFDITEDDLDRLLSLQHEVVILPSAEIAPPHK 462
           TS+ +NRPQFL G+NS+  TL   +++ AF + E+ + R +  Q E VILP+   APPH 
Sbjct: 446 TSAHKNRPQFLVGSNSLLRTLNLTSLSIAFGVDEETMRRFIEAQREAVILPTPAAAPPHV 504

BLAST of Cp4.1LG16g05580 vs. TAIR10
Match: AT2G18540.1 (AT2G18540.1 RmlC-like cupins superfamily protein)

HSP 1 Score: 173.3 bits (438), Expect = 3.7e-43
Identity = 143/457 (31.29%), Postives = 219/457 (47.92%), Query Frame = 1

Query: 58  VIETEAGEMRVIRSPASRILDRPMHIGFITMEPKSLFVPQYLDSSLILFVRRGEVKVGLI 117
           V+ TE G +  ++      +    HI FIT+EP +L +P  L S ++ FV  G   +  I
Sbjct: 53  VVATEFGNISAVQ------IGDGYHIQFITLEPNALLLPLLLHSDMVFFVHTGTGILNWI 112

Query: 118 YKDELAERRMKGGDVYRIPAGSVFYMVNVGEGQRLQIICSIDKSESLSYGTFQSFFIGGG 177
            ++   +  ++ GDV+R+ +G+VFY V+  E  R+  I ++ K             +G  
Sbjct: 113 DEESERKLELRRGDVFRLRSGTVFY-VHSNEKLRVYAIFNVGKC-------LNDPCLGAY 172

Query: 178 TYPVSVLAGFDQDTLATAFNVSYTELRRILSRQRQGPIVYISDTESPGVWSKFLQVKDGD 237
           +    +L GFD  TL +AF V    LR+I    +   IV                     
Sbjct: 173 SSVRDLLLGFDDRTLRSAFAVPEDILRKIRDATKPPLIV--------------------- 232

Query: 238 KGNKIATINEDGEEAEKNKTWSWRNLMSSIFGNENRD----KTKRTRTGKSPDSYNLYDK 297
             N +      G E +K   W  R +   +   +  D    K       K   ++N++++
Sbjct: 233 --NALPRNRTQGLEEDK---WQSRLVRLFVSVEDVTDHLAMKPIVDTNKKKSRTFNVFEE 292

Query: 298 TPDFSNAYGWSVALDEHEYSPLGHSGIGVYLVNLTAGSMMAPHINPTAAEYGIVLRGTGT 357
            PDF N  G S+ +DE +   L  S  GV++VNLT GSM+ PH NP+A E  IVL G G 
Sbjct: 293 DPDFENNNGRSIVVDEKDLDALKGSRFGVFMVNLTKGSMIGPHWNPSACEISIVLEGEGM 352

Query: 358 IQIVYPNGTSAMDTE-------VTEGDVFWVPRYFPFCQIASRTGPFEFFGFTTSSRRNR 417
           +++V     S+   +       V EGDVF VP++ P  Q++     F F GF+TS++ N 
Sbjct: 353 VRVVNQQSLSSCKNDRKSESFMVEEGDVFVVPKFHPMAQMSFENSSFVFMGFSTSAKTNH 412

Query: 418 PQFLAGANSVFHTLRSPAVASAFDITEDDLDRLLSLQHEVVILPSA-----EIAPPHKEE 477
           PQFL G +SV   L    VA +F+++ + +  LL  Q E VI   A     E++   +E 
Sbjct: 413 PQFLVGQSSVLKVLDRDVVAVSFNLSNETIKGLLKAQKESVIFECASCAEGELSKLMREI 469

Query: 478 EKRRRREE---GRRERERESERERERERETEEEWTRR 496
           E+R+RREE    RR +E E  R+RE  +  EEE  +R
Sbjct: 473 EERKRREEEEIERRRKEEEEARKREEAKRREEEEAKR 469

BLAST of Cp4.1LG16g05580 vs. TAIR10
Match: AT4G36700.1 (AT4G36700.1 RmlC-like cupins superfamily protein)

HSP 1 Score: 159.8 bits (403), Expect = 4.2e-39
Identity = 149/517 (28.82%), Postives = 249/517 (48.16%), Query Frame = 1

Query: 8   VMLLIIAVLGNAIGIKEEAEAEEEEEWWREREEEREFRSKERFLLEDSKRVIETEAGEMR 67
           V+LL++  L      +  A++EE EE+         F S      +  K + ET+ G++ 
Sbjct: 11  VLLLVLLFLCT----ESLAKSEESEEYDVAVPSCCGFSSPLLIKKDQWKPIFETKFGQIS 70

Query: 68  VIRSPASRILDRPMHIGFITMEPKSLFVPQYLDSSLILFVRRGEVKVGLIYKDELAERRM 127
            ++         P  I  IT+EP ++ +P  L S ++ FV  G   +  +  +E     +
Sbjct: 71  TVQIGNGCGGMGPYKIHSITLEPNTILLPLLLHSDMVFFVDSGSGILNWV-DEEAKSTEI 130

Query: 128 KGGDVYRIPAGSVFYM----VNVGEGQRLQIICSIDKSESL----SYGTFQSFFIGGGTY 187
           + GDVYR+  GSVFY+    V++  G +L++      ++       +G + S        
Sbjct: 131 RLGDVYRLRPGSVFYLQSKPVDIFLGTKLKLYAIFSNNDECLHDPCFGAYSSI------- 190

Query: 188 PVSVLAGFDQDTLATAFNVSYTELRRILSRQRQGPIVYISDTE-SPGV---WS---KFLQ 247
              ++ GFD+  L +AF V    +   L R R  P + +S+T  +PGV   W    + L+
Sbjct: 191 -TDLMFGFDETILQSAFGVPEGIIE--LMRNRTKPPLIVSETLCTPGVANTWQLQPRLLK 250

Query: 248 VKDGDKGNKIATINEDGEEAEKNKTWSWRNLMSSIFGNENRDKTKRTRTGKSPDSYNLYD 307
           +  G      A + ++ ++ EK                E ++K K+ +T      +N+++
Sbjct: 251 LFAGS-----ADLVDNKKKKEKK---------------EKKEKVKKAKT------FNVFE 310

Query: 308 KTPDFSNAYGWSVALDEHEYSPLGHSGIGVYLVNLTAGSMMAPHINPTAAEYGIVLRGTG 367
             PDF + YG ++ ++  +   L  S +GV +VNLT GSMM PH NP A E  IVL+G G
Sbjct: 311 SEPDFESPYGRTITINRKDLKVLKGSMVGVSMVNLTQGSMMGPHWNPWACEISIVLKGAG 370

Query: 368 TIQIVYPNGTSAMDTE-------VTEGDVFWVPRYFPFCQIASRTGPFEFFGFTTSSRRN 427
            ++++  + +S   +E       V EGD+F VPR  P  Q++       F GFTTS++ N
Sbjct: 371 MVRVLRSSISSNTSSECKNVRFKVEEGDIFAVPRLHPMAQMSFNNDSLVFVGFTTSAKNN 430

Query: 428 RPQFLAGANSVFHTLRSPAVASAFDITEDDLDRLLSLQHEVVILPSAEIAPPH------- 487
            PQFLAG +S    L    +A++ +++   +D LL  Q E VIL     A          
Sbjct: 431 EPQFLAGEDSALRMLDRQVLAASLNVSSVTIDGLLGAQKEAVILECHSCAEGEIEKLKVE 486

Query: 488 ----KEEEKRRRREEGRRERERESERERERERETEEE 492
               K +++R+RR + R++ E E++RE E  R+ EEE
Sbjct: 491 IERKKIDDERKRRHDERKKEEEEAKREEEERRKREEE 486

BLAST of Cp4.1LG16g05580 vs. TAIR10
Match: AT3G22640.1 (AT3G22640.1 cupin family protein)

HSP 1 Score: 72.8 bits (177), Expect = 6.8e-13
Identity = 100/470 (21.28%), Postives = 170/470 (36.17%), Query Frame = 1

Query: 33  EWWREREEEREFRSKERFLLEDSKRVIETEAGEMRVI-----RSPASRILDRPMHIGFIT 92
           E W E      +  ++R   +      +++ G +RV+      +PA            + 
Sbjct: 53  EGWEEESTNHPYHFRKRSFSD----WFQSKEGFVRVLPKFTKHAPALFRGIENYRFSLVE 112

Query: 93  MEPKSLFVPQYLDSSLILFVRRGEVKVGLIYKDELAERRMKGGDVYRIPAGSVFYMVNVG 152
           MEP + FVP +LD+  +  V +G+  +  +         +  GDV RIP+G   ++ N  
Sbjct: 113 MEPTTFFVPHHLDADAVFIVLQGKGVIEFVTDKTKESFHITKGDVVRIPSGVTNFITNTN 172

Query: 153 EGQRL---QIICSIDKSESLSYGTFQSFFIGGGTYPVSVLAGFDQDTLATAFNVSYTELR 212
           +   L   QI   ++       G ++ +F     +  S   GF ++ L+T+FNV    L 
Sbjct: 173 QTVPLRLAQITVPVNNP-----GNYKDYFPAASQFQQSYFNGFTKEVLSTSFNVPEELLG 232

Query: 213 RILSRQR---QGPIVYISDTESPGVWSKFLQVKDGDKGNKIATINEDGEEAEKNKTWSWR 272
           R+++R +   QG I  IS  +      K L        NK     E  E+ +    W+  
Sbjct: 233 RLVTRSKEIGQGIIRRISPDQ-----IKELAEHATSPSNKHKAKKEKEEDKDLRTLWTPF 292

Query: 273 NLM------SSIFGNENRDKTKRTRTGKSPDSYNLYDKTPDFSNAYGWSVALDEHEYSPL 332
           NL       S+ FG+ +    K             Y++  D   A  W+           
Sbjct: 293 NLFAIDPIYSNDFGHFHEAHPKN------------YNQLQDLHIAAAWA----------- 352

Query: 333 GHSGIGVYLVNLTAGSMMAPHINPTAAEYGIVLRGTGTIQIVYP---------------- 392
                     N+T GS+  PH N        V  G    ++  P                
Sbjct: 353 ----------NMTQGSLFLPHFNSKTTFVTFVENGCARFEMATPYKFQRGQQQWPGQGQE 412

Query: 393 ------NGTSAMDTEVTEGDVFWVPRYFPFCQIASRTGPFEFFGFTTSSRRNRPQFLAGA 452
                      + + V +G+VF VP   PF  I S+   F   GF   +  ++  FLAG 
Sbjct: 413 EEEDMSENVHKVVSRVCKGEVFIVPAGHPF-TILSQDQDFIAVGFGIYATNSKRTFLAGE 472

Query: 453 NSVFHTLRSPAVASAFDITEDDLDRLLSLQHEVVILPSAEIAPPHKEEEK 464
            ++   L   A    F +     ++L + Q+     P++       E+ K
Sbjct: 473 ENLLSNLNPAATRVTFGVGSKVAEKLFTSQNYSYFAPTSRSQQQIPEKHK 474

BLAST of Cp4.1LG16g05580 vs. NCBI nr
Match: gi|7484767|pir||T10443 (probable major protein body membrane protein MP27 / major protein body protein MP32 precursor - cucurbit)

HSP 1 Score: 923.7 bits (2386), Expect = 1.4e-265
Identity = 478/498 (95.98%), Postives = 484/498 (97.19%), Query Frame = 1

Query: 1   MGKKEALVMLLIIAVLGNAIGIKEEAEAEEEEEWWREREEEREFRSKERFLLEDSKRVIE 60
           M KKEALVMLLIIAVLGNAIGIKEEAEA EEEEWWREREEEREFRSKE+FLLEDSKRVIE
Sbjct: 7   MWKKEALVMLLIIAVLGNAIGIKEEAEAAEEEEWWREREEEREFRSKEQFLLEDSKRVIE 66

Query: 61  TEAGEMRVIRSPASRILDRPMHIGFITMEPKSLFVPQYLDSSLILFVRRGEVKVGLIYKD 120
           TEAGEMRVIRSPASRILDRPMHIGFITMEPKSLFVPQYLDSSLILFVRRGEVKVGLIYKD
Sbjct: 67  TEAGEMRVIRSPASRILDRPMHIGFITMEPKSLFVPQYLDSSLILFVRRGEVKVGLIYKD 126

Query: 121 ELAERRMKGGDVYRIPAGSVFYMVNVGEGQRLQIICSIDKSESLSYGTFQSFFIGGGTYP 180
           ELAERRMKGGDVYRIPAGSVFYMVNVGEGQRLQIICSIDKSESLSYGTFQSFFIGGGTYP
Sbjct: 127 ELAERRMKGGDVYRIPAGSVFYMVNVGEGQRLQIICSIDKSESLSYGTFQSFFIGGGTYP 186

Query: 181 VSVLAGFDQDTLATAFNVSYTELRRILSRQRQGPIVYISDTESPGVWSKFLQVKDGDKGN 240
           VSVLAGFDQDTLATAFNVSYTELRRILSRQRQGPIVY+SDTESPGVWSKFLQVKDGDKGN
Sbjct: 187 VSVLAGFDQDTLATAFNVSYTELRRILSRQRQGPIVYVSDTESPGVWSKFLQVKDGDKGN 246

Query: 241 KIATINEDGEEAEKNKTWSWRNLMSSIFGNENRDKTKRTRTGKSPDSYNLYDKTPDFSNA 300
           KIA INEDGEEAEKNK WSWRNL+S IFGNENRDKTKRTRTGKSPDSYNLYDKTPDFSNA
Sbjct: 247 KIANINEDGEEAEKNKPWSWRNLVSLIFGNENRDKTKRTRTGKSPDSYNLYDKTPDFSNA 306

Query: 301 YGWSVALDEHEYSPLGHSGIGVYLVNLTAGSMMAPHINPTAAEYGIVLRGTGTIQIVYPN 360
           YGWSVALDEHEYSPLGHSGIGVYLVNLTAGSMMAPHINPTAAEYGIVLRGTGTIQIVYPN
Sbjct: 307 YGWSVALDEHEYSPLGHSGIGVYLVNLTAGSMMAPHINPTAAEYGIVLRGTGTIQIVYPN 366

Query: 361 GTSAMDTEVTEGDVFWVPRYFPFCQIASRTGPFEFFGFTTSSRRNRPQFLAGANSVFHTL 420
           GTSAMDTEVTEGDVFWVPRYFPFCQIASRTGPFEFFGFTTSSRRNRPQFLA ANS+FHTL
Sbjct: 367 GTSAMDTEVTEGDVFWVPRYFPFCQIASRTGPFEFFGFTTSSRRNRPQFLACANSIFHTL 426

Query: 421 RSPAVASAFDITEDDLDRLLSLQHEVVILPSAEIAPPHKEEEKRRRREEGRRERERESER 480
           RSPAVA+AFDITEDDLDRLLS Q+EVVILPSAEIAPPHKEEEKRRRREEGRRERERESER
Sbjct: 427 RSPAVATAFDITEDDLDRLLSAQYEVVILPSAEIAPPHKEEEKRRRREEGRRERERESER 486

Query: 481 ERERERETEEEWTRRLGA 499
           ER      EEEWTRRL A
Sbjct: 487 ER------EEEWTRRLEA 498

BLAST of Cp4.1LG16g05580 vs. NCBI nr
Match: gi|659127842|ref|XP_008463915.1| (PREDICTED: LOW QUALITY PROTEIN: globulin-1 S allele [Cucumis melo])

HSP 1 Score: 810.8 bits (2093), Expect = 1.3e-231
Identity = 418/498 (83.94%), Postives = 455/498 (91.37%), Query Frame = 1

Query: 1   MGKKEALVMLLIIAVLGNAIGIKEEAEAEEEEEWWREREEEREFRSKERFLLEDSKRVIE 60
           MGKKEAL++LLI+AVLGNAIGIKEE    EEEEWWREREEEREF  KERFL+ DSK+VIE
Sbjct: 43  MGKKEALLILLIVAVLGNAIGIKEE----EEEEWWREREEEREFGRKERFLMVDSKKVIE 102

Query: 61  TEAGEMRVIRSPASRILDRPMHIGFITMEPKSLFVPQYLDSSLILFVRRGEVKVGLIYKD 120
           TEAGEMRV+R PASRILDRPMHIGFITMEPKSLFVPQYLDS+LILFVRRG+VKVGLIYKD
Sbjct: 103 TEAGEMRVMRGPASRILDRPMHIGFITMEPKSLFVPQYLDSTLILFVRRGDVKVGLIYKD 162

Query: 121 ELAERRMKGGDVYRIPAGSVFYMVNVGEGQRLQIICSIDKSESLSYGTFQSFFIGGGTYP 180
           ELAERRMKGGDV+RIPAGSVFYMVNVGEGQRL+IICSIDKSESLSYGTFQSFF+ GG YP
Sbjct: 163 ELAERRMKGGDVFRIPAGSVFYMVNVGEGQRLEIICSIDKSESLSYGTFQSFFVAGGKYP 222

Query: 181 VSVLAGFDQDTLATAFNVSYTELRRILSRQRQGPIVYISDTESPGVWSKFLQVKDGDKGN 240
            SVLAGFDQDTLATAFNVSYTELRRILSRQRQGPIVY+SDTESP VWSKFLQV D  + +
Sbjct: 223 GSVLAGFDQDTLATAFNVSYTELRRILSRQRQGPIVYVSDTESPRVWSKFLQVNDEARLS 282

Query: 241 KIATINEDGEEAEKNKTWSWRNLMSSIFGNENRDKTKRTRTGKSPDSYNLYDKTPDFSNA 300
           K+A I+EDGE++EKNK WSWR L+ SIF N NRDK+K+TRTGKSPDSYNLYDK PDFSNA
Sbjct: 283 KVADIDEDGEKSEKNKPWSWRKLVESIFRNGNRDKSKKTRTGKSPDSYNLYDKDPDFSNA 342

Query: 301 YGWSVALDEHEYSPLGHSGIGVYLVNLTAGSMMAPHINPTAAEYGIVLRGTGTIQIVYPN 360
           YGWSVALDE EY PL HSGIGVYLVNLTAGSMMAPH+NPTA+EYGIVLRGTGTIQIVYPN
Sbjct: 343 YGWSVALDETEYHPLRHSGIGVYLVNLTAGSMMAPHVNPTASEYGIVLRGTGTIQIVYPN 402

Query: 361 GTSAMDTEVTEGDVFWVPRYFPFCQIASRTGPFEFFGFTTSSRRNRPQFLAGANSVFHTL 420
           GTSAM+ EVTEGDVFW+PRYFPFCQIASRTGPFEFFGFTTS+R+NRPQFLAGA+S+FHTL
Sbjct: 403 GTSAMNAEVTEGDVFWIPRYFPFCQIASRTGPFEFFGFTTSARKNRPQFLAGASSIFHTL 462

Query: 421 RSPAVASAFDITEDDLDRLLSLQHEVVILPSAEIAPPHKEEEKRRRREEGRRERERESER 480
           RS  +A+AFDITEDD++RLL  Q+E +ILPSAEIAPPHKEEEKRRR+EE RRE E E ER
Sbjct: 463 RSMEMATAFDITEDDMERLLGAQYEAIILPSAEIAPPHKEEEKRRRKEEERREAEWERER 522

Query: 481 ERERERETE-EEWTRRLG 498
           ERERERE   +E  R  G
Sbjct: 523 ERERERERRVDEMVRSFG 536

BLAST of Cp4.1LG16g05580 vs. NCBI nr
Match: gi|449463687|ref|XP_004149563.1| (PREDICTED: globulin-1 S allele [Cucumis sativus])

HSP 1 Score: 810.8 bits (2093), Expect = 1.3e-231
Identity = 421/500 (84.20%), Postives = 458/500 (91.60%), Query Frame = 1

Query: 1   MGKKEALVMLLIIAVLGNAIGIKEEAEAEEEEEWWREREEEREFRSKERFLLEDSKRVIE 60
           MGKKEAL++LLI+AVLGNAIGIKEE    EEEEWWREREEE+ F SKERFL+ DSK+VIE
Sbjct: 1   MGKKEALLILLIVAVLGNAIGIKEE----EEEEWWREREEEK-FGSKERFLMVDSKKVIE 60

Query: 61  TEAGEMRVIRSPASRILDRPMHIGFITMEPKSLFVPQYLDSSLILFVRRGEVKVGLIYKD 120
           TEAGEMRV+R P SRILD+ MHIGFITMEPKSLFVPQYLDS+LILFVRRG+VKVGLIYKD
Sbjct: 61  TEAGEMRVMRGPISRILDKAMHIGFITMEPKSLFVPQYLDSTLILFVRRGDVKVGLIYKD 120

Query: 121 ELAERRMKGGDVYRIPAGSVFYMVNVGEGQRLQIICSIDKSESLSYGTFQSFFIGGGTYP 180
           ELAERRMKGGDV+RIPAGSVFYMVNVGEGQRL+IICSIDKSESLSYGTFQSFF+ GG YP
Sbjct: 121 ELAERRMKGGDVFRIPAGSVFYMVNVGEGQRLEIICSIDKSESLSYGTFQSFFVAGGKYP 180

Query: 181 VSVLAGFDQDTLATAFNVSYTELRRILSRQRQGPIVYISDTESPGVWSKFLQVKDGDKGN 240
            SVLAGFDQDTLATAFNVSYTELRRILSRQRQGPIVYISDTESP VWSKFLQVKD  + +
Sbjct: 181 GSVLAGFDQDTLATAFNVSYTELRRILSRQRQGPIVYISDTESPRVWSKFLQVKDKARLS 240

Query: 241 KIATINEDGEEAEKNKTWSWRNLMSSIFGNENRDKTKR-TRTGKSPDSYNLYDKTPDFSN 300
           K+A  NEDGEE+EKNK WSWR LM+SIF NENRDK+K+ TRTGKSPDSYNLYDKTPDFSN
Sbjct: 241 KVADNNEDGEESEKNKRWSWRKLMNSIFRNENRDKSKKITRTGKSPDSYNLYDKTPDFSN 300

Query: 301 AYGWSVALDEHEYSPLGHSGIGVYLVNLTAGSMMAPHINPTAAEYGIVLRGTGTIQIVYP 360
           AYGWSVALDE EY PLGHSGIGVYLVNLTAGSMMAPH+NPTAAEYGIVLRGTGTIQIVYP
Sbjct: 301 AYGWSVALDETEYHPLGHSGIGVYLVNLTAGSMMAPHVNPTAAEYGIVLRGTGTIQIVYP 360

Query: 361 NGTSAMDTEVTEGDVFWVPRYFPFCQIASRTGPFEFFGFTTSSRRNRPQFLAGANSVFHT 420
           NGTSAM+ EVTEGDVFW+PRYFPFCQIASRTGPFEFFGFTTSSR+NRPQFLAGA+S+FHT
Sbjct: 361 NGTSAMNAEVTEGDVFWIPRYFPFCQIASRTGPFEFFGFTTSSRKNRPQFLAGASSIFHT 420

Query: 421 LRSPAVASAFDITEDDLDRLLSLQHEVVILPSAEIAPPHKEEEKRRRREEGRRERERESE 480
           LR+  +A+AFDITEDD++RLL  Q+E +ILPSAEIAPPHKEEEK+RR+EE RRE E E+E
Sbjct: 421 LRNMEMATAFDITEDDMERLLGAQYEAIILPSAEIAPPHKEEEKKRRKEEERREAETETE 480

Query: 481 RERERERETEEEWTRRLGAV 500
            E ERERE E E  RR+  V
Sbjct: 481 TEWERERERERERERRVDEV 495

BLAST of Cp4.1LG16g05580 vs. NCBI nr
Match: gi|566186528|ref|XP_002313331.2| (hypothetical protein POPTR_0009s05950g [Populus trichocarpa])

HSP 1 Score: 585.9 bits (1509), Expect = 6.8e-164
Identity = 291/468 (62.18%), Postives = 368/468 (78.63%), Query Frame = 1

Query: 1   MGKKEALVMLLIIAVLG--NAIGIKEEAEAEEEEEWWREREEEREFRSKERFLLEDSKRV 60
           MG   A ++LL++   G   A+G+       E+E+W  +R+E +  R +E  LL+DSKRV
Sbjct: 1   MGNGAAHLLLLLVLCYGVQMAVGLYRG----EKEDWRGDRDETQIDREEEWLLLQDSKRV 60

Query: 61  IETEAGEMRVIRSPASRILDRPMHIGFITMEPKSLFVPQYLDSSLILFVRRGEVKVGLIY 120
           ++T+AGEM V+R+   RI+DRPMHIGFITMEP++LFVPQY+DSSLILF+R GE KVGLIY
Sbjct: 61  VKTDAGEMMVLRNYGGRIIDRPMHIGFITMEPRTLFVPQYIDSSLILFIRTGEAKVGLIY 120

Query: 121 KDELAERRMKGGDVYRIPAGSVFYMVNVGEGQRLQIICSIDKSESLSYGTFQSFFIGGGT 180
           KDELAERR+K GD+YRIPAGS FY++N  EGQRL IICSID SESL  G FQSF+IGGGT
Sbjct: 121 KDELAERRLKIGDIYRIPAGSAFYLMNAEEGQRLHIICSIDPSESLGLGFFQSFYIGGGT 180

Query: 181 YPVSVLAGFDQDTLATAFNVSYTELRRILSRQRQGPIVYISDTESP--GVWSKFLQVKDG 240
           YP S+LAGF+ +TL+ AFNV+  E+R I++RQ++GPIV+I D+ +P   +W+KFLQ+K+ 
Sbjct: 181 YPPSILAGFELETLSAAFNVTADEVREIMTRQQEGPIVFIGDSRAPRPSLWTKFLQLKEQ 240

Query: 241 DKGN---KIATINEDGEEAEKNKTWSWRNLMSSIFGNENRDKTKRTRTGKSPDSYNLYDK 300
           D+     ++    +   + E+ +TWSWR L++SIFG EN  K K  + GKSPDSYN+YD+
Sbjct: 241 DRLQHLKRMVKFQQQPSQGEEQRTWSWRKLLNSIFGQEN--KKKGEKVGKSPDSYNIYDR 300

Query: 301 TPDFSNAYGWSVALDEHEYSPLGHSGIGVYLVNLTAGSMMAPHINPTAAEYGIVLRGTGT 360
            PDF N YGWS+ALDE +Y PL +SGIGVYLVNLTAGSM+APH+NPTA EYGIVLRG+G 
Sbjct: 301 RPDFRNNYGWSIALDESDYQPLKYSGIGVYLVNLTAGSMLAPHVNPTATEYGIVLRGSGR 360

Query: 361 IQIVYPNGTSAMDTEVTEGDVFWVPRYFPFCQIASRTGPFEFFGFTTSSRRNRPQFLAGA 420
           IQIV+PNGT AMD  V EGDVFWVPRYFPFCQIA+R+GPFEFFGFTTS+R NRPQFL GA
Sbjct: 361 IQIVFPNGTQAMDATVKEGDVFWVPRYFPFCQIAARSGPFEFFGFTTSARENRPQFLVGA 420

Query: 421 NSVFHTLRSPAVASAFDITEDDLDRLLSLQHEVVILPSAEIAPPHKEE 462
           NS+  TLRSP +A+AF ++ED ++R++  Q E VILPSA  APP +EE
Sbjct: 421 NSILQTLRSPELAAAFGVSEDRINRVIKAQREAVILPSASAAPPDEEE 462

BLAST of Cp4.1LG16g05580 vs. NCBI nr
Match: gi|743837587|ref|XP_011025519.1| (PREDICTED: vicilin-like antimicrobial peptides 2-3 [Populus euphratica])

HSP 1 Score: 576.6 bits (1485), Expect = 4.1e-161
Identity = 286/466 (61.37%), Postives = 362/466 (77.68%), Query Frame = 1

Query: 1   MGKKEALVMLLIIAVLGNAIGIKEEAEAEEEEEWWREREEEREFRSKERFLLEDSKRVIE 60
           MG   A ++LL++   G  + +      EE+E+W  +R E +  R +E  LL+DSKRV++
Sbjct: 1   MGNGAAHLLLLLVLCYGVTMAVG--FYREEKEDWRGDRGETQTDREEEWLLLQDSKRVVK 60

Query: 61  TEAGEMRVIRSPASRILDRPMHIGFITMEPKSLFVPQYLDSSLILFVRRGEVKVGLIYKD 120
           T+AG++RV+++   RI+DRPMHIGFITMEP+SLFVPQY+DSSLILF+R GE KVGLIYKD
Sbjct: 61  TDAGDVRVLKNYGGRIIDRPMHIGFITMEPRSLFVPQYIDSSLILFIRTGEAKVGLIYKD 120

Query: 121 ELAERRMKGGDVYRIPAGSVFYMVNVGEGQRLQIICSIDKSESLSYGTFQSFFIGGGTYP 180
           ELAERR+K GD+YRIPAGS FY++N  EGQRL IICSID SESL  G FQSFFIGGGTYP
Sbjct: 121 ELAERRLKIGDIYRIPAGSAFYLMNAEEGQRLHIICSIDPSESLGLGFFQSFFIGGGTYP 180

Query: 181 VSVLAGFDQDTLATAFNVSYTELRRILSRQRQGPIVYISDTESP--GVWSKFLQVKDGDK 240
            S+LAGF+ +TL+ AFNV+  E+R I++RQ++GPIV+I D+ +P   +W+KFLQ+K+ D+
Sbjct: 181 PSILAGFELETLSNAFNVTTDEVREIMTRQQEGPIVFIGDSRAPRPSLWTKFLQLKEQDR 240

Query: 241 GNKI---ATINEDGEEAEKNKTWSWRNLMSSIFGNENRDKTKRTRTGKSPDSYNLYDKTP 300
              +       +     E+ +TWSWR L++SI G EN  K K    GKSPDSYN+YD+ P
Sbjct: 241 LQHLKRTVKFQQQPSPGEEQRTWSWRKLLNSIVGQEN--KKKGEIAGKSPDSYNIYDRRP 300

Query: 301 DFSNAYGWSVALDEHEYSPLGHSGIGVYLVNLTAGSMMAPHINPTAAEYGIVLRGTGTIQ 360
           DF N YGWS+ALDE +Y PL +SGIGVYLVNLTAGSM+APH+NPTA EYGIVL G+G IQ
Sbjct: 301 DFRNNYGWSIALDESDYHPLKYSGIGVYLVNLTAGSMLAPHVNPTATEYGIVLSGSGRIQ 360

Query: 361 IVYPNGTSAMDTEVTEGDVFWVPRYFPFCQIASRTGPFEFFGFTTSSRRNRPQFLAGANS 420
           +V+PNGT AM+  V EGDVFWVPRYFPFCQIA+R+GPFEFFGFTTS+R NRPQFL GANS
Sbjct: 361 VVFPNGTQAMNARVKEGDVFWVPRYFPFCQIAARSGPFEFFGFTTSARENRPQFLVGANS 420

Query: 421 VFHTLRSPAVASAFDITEDDLDRLLSLQHEVVILPSAEIAPPHKEE 462
           +  TLRSP +A+AF ++ED ++R++  Q E VILPSA  APP +EE
Sbjct: 421 ILQTLRSPELAAAFGVSEDRINRVIKAQREAVILPSASAAPPDEEE 462

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
VCL22_ARATH6.6e-13555.69Vicilin-like seed storage protein At2g28490 OS=Arabidopsis thaliana GN=At2g28490... [more]
VCL21_ARATH6.6e-4231.29Vicilin-like seed storage protein At2g18540 OS=Arabidopsis thaliana GN=At2g18540... [more]
VCL43_ARATH7.5e-3828.82Vicilin-like seed storage protein At4g36700 OS=Arabidopsis thaliana GN=At4g36700... [more]
AMP22_MACIN1.2e-2423.31Vicilin-like antimicrobial peptides 2-2 OS=Macadamia integrifolia GN=AMP2-2 PE=2... [more]
AMP23_MACIN3.1e-2322.67Vicilin-like antimicrobial peptides 2-3 (Fragment) OS=Macadamia integrifolia GN=... [more]
Match NameE-valueIdentityDescription
Q39651_9ROSI9.5e-26695.98PreproMP27-MP32 OS=Cucurbita cv. Kurokawa Amakuri PE=2 SV=1[more]
A0A0A0KHQ1_CUCSA9.0e-23284.20Uncharacterized protein OS=Cucumis sativus GN=Csa_6G290870 PE=4 SV=1[more]
B9HNV4_POPTR4.7e-16462.18Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0009s05950g PE=4 SV=2[more]
A1E0W1_FICPW8.6e-15861.317S globulin OS=Ficus pumila var. awkeotsang PE=2 SV=1[more]
M5W7Z8_PRUPE2.5e-15757.17Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa020107mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G28490.13.7e-13655.69 RmlC-like cupins superfamily protein[more]
AT2G18540.13.7e-4331.29 RmlC-like cupins superfamily protein[more]
AT4G36700.14.2e-3928.82 RmlC-like cupins superfamily protein[more]
AT3G22640.16.8e-1321.28 cupin family protein[more]
Match NameE-valueIdentityDescription
gi|7484767|pir||T104431.4e-26595.98probable major protein body membrane protein MP27 / major protein body protein M... [more]
gi|659127842|ref|XP_008463915.1|1.3e-23183.94PREDICTED: LOW QUALITY PROTEIN: globulin-1 S allele [Cucumis melo][more]
gi|449463687|ref|XP_004149563.1|1.3e-23184.20PREDICTED: globulin-1 S allele [Cucumis sativus][more]
gi|566186528|ref|XP_002313331.2|6.8e-16462.18hypothetical protein POPTR_0009s05950g [Populus trichocarpa][more]
gi|743837587|ref|XP_011025519.1|4.1e-16161.37PREDICTED: vicilin-like antimicrobial peptides 2-3 [Populus euphratica][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0045735nutrient reservoir activity
Vocabulary: INTERPRO
TermDefinition
IPR014710RmlC-like_jellyroll
IPR011051RmlC_Cupin_sf
IPR006045Cupin_1
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0009506 plasmodesma
cellular_component GO:0005575 cellular_component
molecular_function GO:0045735 nutrient reservoir activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG16g05580.1Cp4.1LG16g05580.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006045Cupin 1PFAMPF00190Cupin_1coord: 288..433
score: 1.7E-32coord: 53..200
score: 5.
IPR006045Cupin 1SMARTSM00835Cupin_1_3coord: 288..437
score: 2.8E-53coord: 50..204
score: 8.6
IPR011051RmlC-like cupin domainunknownSSF51182RmlC-like cupinscoord: 274..446
score: 2.2E-60coord: 51..230
score: 2.2
IPR014710RmlC-like jelly roll foldGENE3DG3DSA:2.60.120.10coord: 292..450
score: 5.3E-34coord: 46..219
score: 4.3
NoneNo IPR availableunknownCoilCoilcoord: 458..491
scor
NoneNo IPR availablePANTHERPTHR31189FAMILY NOT NAMEDcoord: 1..226
score: 3.0E-261coord: 248..491
score: 3.0E
NoneNo IPR availablePANTHERPTHR31189:SF2CUPIN DOMAIN-CONTAINING PROTEINcoord: 1..226
score: 3.0E-261coord: 248..491
score: 3.0E

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG16g05580Cucurbita pepo (Zucchini)cpecpeB198
Cp4.1LG16g05580Cucurbita pepo (Zucchini)cpecpeB296
Cp4.1LG16g05580Cucurbita pepo (Zucchini)cpecpeB303
Cp4.1LG16g05580Cucurbita pepo (Zucchini)cpecpeB308
Cp4.1LG16g05580Cucurbita pepo (Zucchini)cpecpeB325
Cp4.1LG16g05580Cucumber (Gy14) v1cgycpeB0053
Cp4.1LG16g05580Cucumber (Gy14) v1cgycpeB0799
Cp4.1LG16g05580Cucurbita maxima (Rimu)cmacpeB224
Cp4.1LG16g05580Cucurbita maxima (Rimu)cmacpeB225
Cp4.1LG16g05580Cucurbita maxima (Rimu)cmacpeB226
Cp4.1LG16g05580Cucurbita maxima (Rimu)cmacpeB265
Cp4.1LG16g05580Cucurbita maxima (Rimu)cmacpeB625
Cp4.1LG16g05580Cucurbita maxima (Rimu)cmacpeB659
Cp4.1LG16g05580Cucurbita maxima (Rimu)cmacpeB821
Cp4.1LG16g05580Cucurbita maxima (Rimu)cmacpeB874
Cp4.1LG16g05580Cucurbita moschata (Rifu)cmocpeB193
Cp4.1LG16g05580Cucurbita moschata (Rifu)cmocpeB229
Cp4.1LG16g05580Cucurbita moschata (Rifu)cmocpeB517
Cp4.1LG16g05580Cucurbita moschata (Rifu)cmocpeB574
Cp4.1LG16g05580Cucurbita moschata (Rifu)cmocpeB610
Cp4.1LG16g05580Cucurbita moschata (Rifu)cmocpeB775
Cp4.1LG16g05580Cucurbita moschata (Rifu)cmocpeB812
Cp4.1LG16g05580Wild cucumber (PI 183967)cpecpiB279
Cp4.1LG16g05580Wild cucumber (PI 183967)cpecpiB285
Cp4.1LG16g05580Cucumber (Chinese Long) v2cpecuB276
Cp4.1LG16g05580Cucumber (Chinese Long) v2cpecuB283
Cp4.1LG16g05580Bottle gourd (USVL1VR-Ls)cpelsiB240
Cp4.1LG16g05580Bottle gourd (USVL1VR-Ls)cpelsiB241
Cp4.1LG16g05580Watermelon (Charleston Gray)cpewcgB248
Cp4.1LG16g05580Watermelon (Charleston Gray)cpewcgB279
Cp4.1LG16g05580Watermelon (97103) v1cpewmB276
Cp4.1LG16g05580Watermelon (97103) v1cpewmB286
Cp4.1LG16g05580Melon (DHL92) v3.5.1cpemeB276
Cp4.1LG16g05580Cucumber (Gy14) v2cgybcpeB036
Cp4.1LG16g05580Cucumber (Gy14) v2cgybcpeB347
Cp4.1LG16g05580Cucumber (Gy14) v2cgybcpeB757
Cp4.1LG16g05580Cucumber (Gy14) v2cgybcpeB768
Cp4.1LG16g05580Melon (DHL92) v3.6.1cpemedB314
Cp4.1LG16g05580Melon (DHL92) v3.6.1cpemedB322
Cp4.1LG16g05580Silver-seed gourdcarcpeB0224
Cp4.1LG16g05580Silver-seed gourdcarcpeB0423
Cp4.1LG16g05580Cucumber (Chinese Long) v3cpecucB0332
Cp4.1LG16g05580Cucumber (Chinese Long) v3cpecucB0343
Cp4.1LG16g05580Cucumber (Chinese Long) v3cpecucB0350
Cp4.1LG16g05580Wax gourdcpewgoB0345