Csa3G218170 (gene) Cucumber (Chinese Long) v2

NameCsa3G218170
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionLegumin-like protein; contains IPR014710 (RmlC-like jelly roll fold)
LocationChr3 : 14512631 .. 14514977 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAAAAAGAAAAAGAAAAATGGAGTTGAATTTGAAGCCAATGGATCCTTCAAATTTCTTTACGGGAGAAGGTGGATCCTTCCATAAATGGTTCCCTTCCGATTTTCCGATCATTTCTCAGACTAAAGTCGGCGCCGGAAGACTCCTTCTCCATCCACGTGGTTTTGCGGTTCCTCATAACTCTGATTCCTCCAAAGTTGGCTACGTTCTTCAAGGTTAGACTGTATTAATTTAAATTCTATTGTATCAATTGAATAATTAATTTTCCTTCACTTTTGAATTACTAATTGTATTAATTTAAATTCTATTGTATATGAATTTTCTGAAATGGAACGGTTTTGTAAAACGAAAATGAGTGTTGTTGAAAAAAAACGTGTAGTGGATTTGTTAACACTGTTTTCTTTGGTCAATATTGCACATGAAATTAAACGTGCAGGTAGCGGAGTAGCCGGAATTATATTTCCATGCAAATCTGAGGAAGCAGCGGTGAGACTAAAGAAAGGAGACGTAATTCCAGTGCCGGAGGGAGTCACCTCTTGGTGGTTTAACGACGGAGACTCCGATTTCGAAGTCCTTCTCGTCGGCGACACCCGAAACGCTCTCATTCCCGGTGACATCACCTACGTTGTCTTTGCTGGACCCCTCGGAGTCCTACAAGGCTTCTCGTCGGACTACATTGAAAAAGTGTACGATCTAACCGAAAAGGAAAGAGAGGTACTTCTCAAAAGCCAACCCAACGGCCTAATATTCAAGCTCAAAGATGACCAAACCTTACCCGAGCCCGACTGCCACAGCGATCTTGTTTTCAACATATACCACACCGCTCCCGATGCCGTAGTCAAGGGTGGTGGGTCGGTGACTGTCCTAACGGAAGAGAAGTTTCCATTTATTGGGAAATCTGGGCTGACGGCAGTTCTTGAGAAGCTTGAGGCCAATGCCGTGCGATCGCCGGTGTATGTTGCCGACCCTTCGGTGCAGCTGATATATGTAGCGAGCGGGTCGGGTCGGGTTCAGATTGCTGAGACGTTTATGCGTTATCAAATTGATGCGGAAGTGAAAGCGGGACAGTTGGTTTTGGTTCCAAAGTACTTTGCCGTCGGAAAAATGGCCGGAGAAGAAGGATTGGAGTGCTTCACTATTATCACCACCACACAGTAAGTTTCAATTTTGCATTATTGAGCTTTTGTCTTTTTTTTTTTTTTTTAATTTGAATTCTTTTCTTGGTGTTTTGATTTATGTTTTTTCTCTTAATCTTAACTTAACAGTTTAAAAAAAAAAATGTTTTGAAGTTTTCAAACGGAGTCATAGTTCATAGAAAATTGATTCTACCTAATTTGCGTTACAATTTGAATGCATTTCAAAGAAAAGAAAGTGCCTTTAACTAAGAATAATTTTTAAAATATTTAATATTTGTTATTACTTTTCATGGTCTTTCTTTTTTAGGCCGTTTTTCAGTTATTTGGTGGAATGTACTATATATAAAAGATCATACAAAAATATTCAATATTCAAAATTTAAAGTCAAATTTAAACTATCAAATCTAATCTAAACCTGTATAAAAAAACAATCGTTTTTCAAATATAAACTATCGTACCAAAAAATCTTAAGATTTGGATAACCAAATTTAAACTATCGTGTAATAAAAAAAATTATGAAAAAAATTGTTTAAATTTGGGTATTTAAACGATGAAATATAAACTAAAAAATATAGAACAAAAATAGAATAGTTAAATTGAAACAATCATTTTTCAAATAGAATAGTTAAATTTAAATAATAATTTTTCAAATATATTAGGGTTGCTAGACGTCAATTGATCGTAGAACATTTTTGTTATTTTTTATTGTGGTCAGTAAATTTTTTCTATTTTTGAAATTGTTCTATTCCTTATAAATATTTTGTGGTTTTGTTTGAAGAAAAATTCAAAACAAACGAACATATAATACACGGATGTATATCATTTATAAGTTAACATTATTATTAACAAAGTTTGATTTTTTGATTGATTGTAACACAGCCCTCTGTTAGAAGAGTTGGGAGGAAAGACATCAATTTTTGGGGCATTTTCACCCCAAGTTTTTGAAGCTTCTTTCAATCTCACAGCTCATTTTGAGAAGCTTTTCAGATCAAAGATAACAAAATCTTCACCCTTGGTTCCTCCCTCAGATAGTTGAACTGTTAATTCTACTAAGGAGCGAGAGGGAGAAATTTCACATCATAATGTTTGGACGTAACCTTTTGTACTTTAAATTAGTAAATTTGGTTTCTACAATAAAATCATGTTATTGATCTGTGTGTACTATTAAATCCATAAGTTACCTATCCACTTTCTTCAATAAAGCATGTGCCCT

mRNA sequence

ATGGAGTTGAATTTGAAGCCAATGGATCCTTCAAATTTCTTTACGGGAGAAGGTGGATCCTTCCATAAATGGTTCCCTTCCGATTTTCCGATCATTTCTCAGACTAAAGTCGGCGCCGGAAGACTCCTTCTCCATCCACGTGGTTTTGCGGTTCCTCATAACTCTGATTCCTCCAAAGTTGGCTACGTTCTTCAAGGTAGCGGAGTAGCCGGAATTATATTTCCATGCAAATCTGAGGAAGCAGCGGTGAGACTAAAGAAAGGAGACGTAATTCCAGTGCCGGAGGGAGTCACCTCTTGGTGGTTTAACGACGGAGACTCCGATTTCGAAGTCCTTCTCGTCGGCGACACCCGAAACGCTCTCATTCCCGGTGACATCACCTACGTTGTCTTTGCTGGACCCCTCGGAGTCCTACAAGGCTTCTCGTCGGACTACATTGAAAAAGTGTACGATCTAACCGAAAAGGAAAGAGAGGTACTTCTCAAAAGCCAACCCAACGGCCTAATATTCAAGCTCAAAGATGACCAAACCTTACCCGAGCCCGACTGCCACAGCGATCTTGTTTTCAACATATACCACACCGCTCCCGATGCCGTAGTCAAGGGTGGTGGGTCGGTGACTGTCCTAACGGAAGAGAAGTTTCCATTTATTGGGAAATCTGGGCTGACGGCAGTTCTTGAGAAGCTTGAGGCCAATGCCGTGCGATCGCCGGTGTATGTTGCCGACCCTTCGGTGCAGCTGATATATGTAGCGAGCGGGTCGGGTCGGGTTCAGATTGCTGAGACGTTTATGCGTTATCAAATTGATGCGGAAGTGAAAGCGGGACAGTTGGTTTTGGTTCCAAAGTACTTTGCCGTCGGAAAAATGGCCGGAGAAGAAGGATTGGAGTGCTTCACTATTATCACCACCACACACCCTCTGTTAGAAGAGTTGGGAGGAAAGACATCAATTTTTGGGGCATTTTCACCCCAAGTTTTTGAAGCTTCTTTCAATCTCACAGCTCATTTTGAGAAGCTTTTCAGATCAAAGATAACAAAATCTTCACCCTTGGTTCCTCCCTCAGATAGTTGA

Coding sequence (CDS)

ATGGAGTTGAATTTGAAGCCAATGGATCCTTCAAATTTCTTTACGGGAGAAGGTGGATCCTTCCATAAATGGTTCCCTTCCGATTTTCCGATCATTTCTCAGACTAAAGTCGGCGCCGGAAGACTCCTTCTCCATCCACGTGGTTTTGCGGTTCCTCATAACTCTGATTCCTCCAAAGTTGGCTACGTTCTTCAAGGTAGCGGAGTAGCCGGAATTATATTTCCATGCAAATCTGAGGAAGCAGCGGTGAGACTAAAGAAAGGAGACGTAATTCCAGTGCCGGAGGGAGTCACCTCTTGGTGGTTTAACGACGGAGACTCCGATTTCGAAGTCCTTCTCGTCGGCGACACCCGAAACGCTCTCATTCCCGGTGACATCACCTACGTTGTCTTTGCTGGACCCCTCGGAGTCCTACAAGGCTTCTCGTCGGACTACATTGAAAAAGTGTACGATCTAACCGAAAAGGAAAGAGAGGTACTTCTCAAAAGCCAACCCAACGGCCTAATATTCAAGCTCAAAGATGACCAAACCTTACCCGAGCCCGACTGCCACAGCGATCTTGTTTTCAACATATACCACACCGCTCCCGATGCCGTAGTCAAGGGTGGTGGGTCGGTGACTGTCCTAACGGAAGAGAAGTTTCCATTTATTGGGAAATCTGGGCTGACGGCAGTTCTTGAGAAGCTTGAGGCCAATGCCGTGCGATCGCCGGTGTATGTTGCCGACCCTTCGGTGCAGCTGATATATGTAGCGAGCGGGTCGGGTCGGGTTCAGATTGCTGAGACGTTTATGCGTTATCAAATTGATGCGGAAGTGAAAGCGGGACAGTTGGTTTTGGTTCCAAAGTACTTTGCCGTCGGAAAAATGGCCGGAGAAGAAGGATTGGAGTGCTTCACTATTATCACCACCACACACCCTCTGTTAGAAGAGTTGGGAGGAAAGACATCAATTTTTGGGGCATTTTCACCCCAAGTTTTTGAAGCTTCTTTCAATCTCACAGCTCATTTTGAGAAGCTTTTCAGATCAAAGATAACAAAATCTTCACCCTTGGTTCCTCCCTCAGATAGTTGA

Protein sequence

MELNLKPMDPSNFFTGEGGSFHKWFPSDFPIISQTKVGAGRLLLHPRGFAVPHNSDSSKVGYVLQGSGVAGIIFPCKSEEAAVRLKKGDVIPVPEGVTSWWFNDGDSDFEVLLVGDTRNALIPGDITYVVFAGPLGVLQGFSSDYIEKVYDLTEKEREVLLKSQPNGLIFKLKDDQTLPEPDCHSDLVFNIYHTAPDAVVKGGGSVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYVASGSGRVQIAETFMRYQIDAEVKAGQLVLVPKYFAVGKMAGEEGLECFTIITTTHPLLEELGGKTSIFGAFSPQVFEASFNLTAHFEKLFRSKITKSSPLVPPSDS*
BLAST of Csa3G218170 vs. Swiss-Prot
Match: 11S2_SESIN (11S globulin seed storage protein 2 OS=Sesamum indicum PE=2 SV=1)

HSP 1 Score: 79.7 bits (195), Expect = 7.1e-14
Identity = 67/310 (21.61%), Postives = 122/310 (39.35%), Query Frame = 1

Query: 84  RLKKGDVIPVPEGVTSWWFNDGDSDFEVLLVGDTRNALIPGDITYVVF--AGPL------ 143
           RL++GD++ +P G   W +NDG  D   + + D  +     D  +  F  AG +      
Sbjct: 146 RLRQGDIVAIPSGAAHWCYNDGSEDLVAVSINDVNHLSNQLDQKFRAFYLAGGVPRSGEQ 205

Query: 144 ---------GVLQGFSSDYIEKVYDLTEKE-REVLLKSQPNGLIFKLKD----------- 203
                     + + F ++ + + +++ ++  R +  + +  GLI   ++           
Sbjct: 206 EQQARQTFHNIFRAFDAELLSEAFNVPQETIRRMQSEEEERGLIVMARERMTFVRPDEEE 265

Query: 204 ----------DQTLPEPDCHSDLVFNIY-HTAPDAVVKGGGSVTVLTEEKFPFIGKSGLT 263
                     D  L E  C      N+      D   +  G V V+   K P +    L+
Sbjct: 266 GEQEHRGRQLDNGLEETFCTMKFRTNVESRREADIFSRQAGRVHVVDRNKLPILKYMDLS 325

Query: 264 AVLEKLEANAVRSPVYVADPSVQLIYVASGSGRVQIAETFMRYQIDAEVKAGQLVLVPKY 323
           A    L +NA+ SP +       ++YV  G  +VQ+ +   +  ++  V  G++ +VP+Y
Sbjct: 326 AEKGNLYSNALVSPDWSMTGHT-IVYVTRGDAQVQVVDHNGQALMNDRVNQGEMFVVPQY 385

Query: 324 FAVGKMAGEEGLECFTIITTTHPLLEELGGKTSIFGAFSPQVFEASFNLTAHFEKLFRSK 354
           +     AG  G E     TT  P+   L G TS+  A   QV   S+ ++ +  +  +  
Sbjct: 386 YTSTARAGNNGFEWVAFKTTGSPMRSPLAGYTSVIRAMPLQVITNSYQISPNQAQALKMN 445

BLAST of Csa3G218170 vs. Swiss-Prot
Match: CRU4_ARATH (12S seed storage protein CRD OS=Arabidopsis thaliana GN=CRD PE=1 SV=1)

HSP 1 Score: 75.1 bits (183), Expect = 1.7e-12
Identity = 95/413 (23.00%), Postives = 153/413 (37.05%), Query Frame = 1

Query: 4   NLKPMDPSNFFTGEGGSFHKWFPSDFPIISQTKVGAGRLLLHPRGFAVPHNSDSSKVGYV 63
           +L P   + F   E G    W     P +    V   R+ L P    +P       + YV
Sbjct: 43  SLAPAQATKF---EAGQMEVWDHMS-PELRCAGVTVARITLQPNSIFLPAFFSPPALAYV 102

Query: 64  LQG---------------------SGVAGIIFPCKSEE----AAVRLKKGDVIPVPEGVT 123
           +QG                     SG  G   P +  E         ++GDV     GV+
Sbjct: 103 VQGEGVMGTIASGCPETFAEVEGSSGRGGGGDPGRRFEDMHQKLENFRRGDVFASLAGVS 162

Query: 124 SWWFNDGDSDFEVLLVGDTRNALIPGDITYVVF--AG--------PL------GVLQGFS 183
            WW+N GDSD  +++V D  N     D    +F  AG        PL          GF 
Sbjct: 163 QWWYNRGDSDAVIVIVLDVTNRENQLDQVPRMFQLAGSRTQEEEQPLTWPSGNNAFSGFD 222

Query: 184 SDYIEKVYDLTEKEREVLLKSQPN-GLIFKLKDDQ--TLPEP-DCHSDLVFN----IYHT 243
            + I + + +  +  + L   + N G I +        +P P +   D + N     Y T
Sbjct: 223 PNIIAEAFKINIETAKQLQNQKDNRGNIIRANGPLHFVIPPPREWQQDGIANGIEETYCT 282

Query: 244 AP-----------DAVVKGGGSVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPVYVADP 303
           A            D      G ++ L     P +    L A+   L +  +  P + A+ 
Sbjct: 283 AKIHENIDDPERSDHFSTRAGRISTLNSLNLPVLRLVRLNALRGYLYSGGMVLPQWTANA 342

Query: 304 SVQLIYVASGSGRVQIAETFMRYQIDAEVKAGQLVLVPKYFAVGKMAGEEGLECFTIITT 357
              ++YV  G  ++Q+ +   +   + +V  GQ++++P+ FAV K AGE G E  +  T 
Sbjct: 343 HT-VLYVTGGQAKIQVVDDNGQSVFNEQVGQGQIIVIPQGFAVSKTAGETGFEWISFKTN 402

BLAST of Csa3G218170 vs. Swiss-Prot
Match: GLUB1_ORYSJ (Glutelin type-B 1 OS=Oryza sativa subsp. japonica GN=GluB1-A PE=2 SV=3)

HSP 1 Score: 72.8 bits (177), Expect = 8.6e-12
Identity = 38/130 (29.23%), Postives = 65/130 (50.00%), Query Frame = 1

Query: 204 GSVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYVASGSGRVQIAETF 263
           G +T +  +KFP +    ++A    L  NA+ SP +  +    L+Y+  G  RVQ+   F
Sbjct: 331 GRITSVNSQKFPILNLIQMSATRVNLYQNAILSPFWNVNAH-SLVYMIQGRSRVQVVSNF 390

Query: 264 MRYQIDAEVKAGQLVLVPKYFAVGKMAGEEGLECFTIITTTHPLLEELGGKTSIFGAFSP 323
            +   D  ++ GQL+++P+++AV K A  EG +   I T  +  +  L GK S+F A   
Sbjct: 391 GKTVFDGVLRPGQLLIIPQHYAVLKKAEREGCQYIAIKTNANAFVSHLAGKNSVFRALPV 450

Query: 324 QVFEASFNLT 334
            V   ++ ++
Sbjct: 451 DVVANAYRIS 459


HSP 2 Score: 50.1 bits (118), Expect = 6.0e-05
Identity = 27/103 (26.21%), Postives = 45/103 (43.69%), Query Frame = 1

Query: 41  RLLLHPRGFAVPHNSDSSKVGYVLQGSGVAGIIFP-CKS--------------------- 100
           R ++ P+G  VP  ++   V Y++QG G  G+ FP C +                     
Sbjct: 85  RRVIQPQGLLVPRYTNIPGVVYIIQGRGSMGLTFPGCPATYQQQFQQFSSQGQSQSQKFR 144

Query: 101 --EEAAVRLKKGDVIPVPEGVTSWWFNDGDSDFEVLLVGDTRN 120
              +   + ++GD++ +P GV  W++NDGD+    + V D  N
Sbjct: 145 DEHQKIHQFRQGDIVALPAGVAHWFYNDGDAPIVAVYVYDVNN 187

BLAST of Csa3G218170 vs. Swiss-Prot
Match: GLYG3_SOYBN (Glycinin G3 OS=Glycine max GN=GY3 PE=3 SV=1)

HSP 1 Score: 72.0 bits (175), Expect = 1.5e-11
Identity = 60/211 (28.44%), Postives = 94/211 (44.55%), Query Frame = 1

Query: 147 EKVYDLTEKEREVLLKSQPNGLIFKLKDDQTLPEPDCHSDLVFNIYHTA-PDAVVKGGGS 206
           E+  D  EK++    +S+ NG+      D+T+    C   L  NI  T+ PD      GS
Sbjct: 278 EEKPDCDEKDKHCQSQSR-NGI------DETI----CTMRLRHNIGQTSSPDIFNPQAGS 337

Query: 207 VTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYVASGSGRVQIAETFMR 266
           +T  T   FP +    L+A    L  NA+  P Y  + +  +IY  +G   VQ+      
Sbjct: 338 ITTATSLDFPALSWLKLSAQFGSLRKNAMFVPHYNLNAN-SIIYALNGRALVQVVNCNGE 397

Query: 267 YQIDAEVKAGQLVLVPKYFAVGKMAGEEGLECFTIITTTHPLLEELGGKTSIFGAFSPQV 326
              D E++ GQ+++VP+ FAV   +  +  E  +  T   P +  L G  S+  A   +V
Sbjct: 398 RVFDGELQEGQVLIVPQNFAVAARSQSDNFEYVSFKTNDRPSIGNLAGANSLLNALPEEV 457

Query: 327 FEASFNLTAHFEKLFRSKITKSSPLVPPSDS 357
            + +FNL     +  ++     S LVPP +S
Sbjct: 458 IQQTFNLRRQQARQVKNN-NPFSFLVPPKES 475

BLAST of Csa3G218170 vs. Swiss-Prot
Match: GLUB2_ORYSJ (Glutelin type-B 2 OS=Oryza sativa subsp. japonica GN=GLUB2 PE=2 SV=2)

HSP 1 Score: 71.6 bits (174), Expect = 1.9e-11
Identity = 37/130 (28.46%), Postives = 65/130 (50.00%), Query Frame = 1

Query: 204 GSVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYVASGSGRVQIAETF 263
           G ++ +  +KFP +    ++A    L  NA+ SP +  +    L+Y+  G  RVQ+   F
Sbjct: 327 GRISSVNSQKFPILNLIQMSATRVNLYQNAILSPFWNVNAH-SLVYMIQGQSRVQVVSNF 386

Query: 264 MRYQIDAEVKAGQLVLVPKYFAVGKMAGEEGLECFTIITTTHPLLEELGGKTSIFGAFSP 323
            +   D  ++ GQL+++P+++AV K A  EG +   I T  +  +  L GK S+F A   
Sbjct: 387 GKTVFDGVLRPGQLLIIPQHYAVLKKAEREGCQYIAIKTNANAFVSHLAGKNSVFRALPV 446

Query: 324 QVFEASFNLT 334
            V   ++ ++
Sbjct: 447 DVVANAYRIS 455


HSP 2 Score: 52.0 bits (123), Expect = 1.6e-05
Identity = 28/104 (26.92%), Postives = 47/104 (45.19%), Query Frame = 1

Query: 41  RLLLHPRGFAVPHNSDSSKVGYVLQGSGVAGIIFP-CKS--------------------- 100
           R ++ P+G  VP  S++  + Y++QG G  G+ FP C +                     
Sbjct: 85  RRVIQPQGLLVPRYSNTPGLVYIIQGRGSMGLTFPGCPATYQQQFQQFSSQGQSQSQKFR 144

Query: 101 --EEAAVRLKKGDVIPVPEGVTSWWFNDGDSDFEVLLVGDTRNA 121
              +   + ++GDV+ +P GV  W++NDGD+    + V D  N+
Sbjct: 145 DEHQKIHQFRQGDVVALPAGVAHWFYNDGDASVVAIYVYDINNS 188

BLAST of Csa3G218170 vs. TrEMBL
Match: A0A0A0LC21_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G218170 PE=4 SV=1)

HSP 1 Score: 715.7 bits (1846), Expect = 2.8e-203
Identity = 356/356 (100.00%), Postives = 356/356 (100.00%), Query Frame = 1

Query: 1   MELNLKPMDPSNFFTGEGGSFHKWFPSDFPIISQTKVGAGRLLLHPRGFAVPHNSDSSKV 60
           MELNLKPMDPSNFFTGEGGSFHKWFPSDFPIISQTKVGAGRLLLHPRGFAVPHNSDSSKV
Sbjct: 1   MELNLKPMDPSNFFTGEGGSFHKWFPSDFPIISQTKVGAGRLLLHPRGFAVPHNSDSSKV 60

Query: 61  GYVLQGSGVAGIIFPCKSEEAAVRLKKGDVIPVPEGVTSWWFNDGDSDFEVLLVGDTRNA 120
           GYVLQGSGVAGIIFPCKSEEAAVRLKKGDVIPVPEGVTSWWFNDGDSDFEVLLVGDTRNA
Sbjct: 61  GYVLQGSGVAGIIFPCKSEEAAVRLKKGDVIPVPEGVTSWWFNDGDSDFEVLLVGDTRNA 120

Query: 121 LIPGDITYVVFAGPLGVLQGFSSDYIEKVYDLTEKEREVLLKSQPNGLIFKLKDDQTLPE 180
           LIPGDITYVVFAGPLGVLQGFSSDYIEKVYDLTEKEREVLLKSQPNGLIFKLKDDQTLPE
Sbjct: 121 LIPGDITYVVFAGPLGVLQGFSSDYIEKVYDLTEKEREVLLKSQPNGLIFKLKDDQTLPE 180

Query: 181 PDCHSDLVFNIYHTAPDAVVKGGGSVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPVYV 240
           PDCHSDLVFNIYHTAPDAVVKGGGSVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPVYV
Sbjct: 181 PDCHSDLVFNIYHTAPDAVVKGGGSVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPVYV 240

Query: 241 ADPSVQLIYVASGSGRVQIAETFMRYQIDAEVKAGQLVLVPKYFAVGKMAGEEGLECFTI 300
           ADPSVQLIYVASGSGRVQIAETFMRYQIDAEVKAGQLVLVPKYFAVGKMAGEEGLECFTI
Sbjct: 241 ADPSVQLIYVASGSGRVQIAETFMRYQIDAEVKAGQLVLVPKYFAVGKMAGEEGLECFTI 300

Query: 301 ITTTHPLLEELGGKTSIFGAFSPQVFEASFNLTAHFEKLFRSKITKSSPLVPPSDS 357
           ITTTHPLLEELGGKTSIFGAFSPQVFEASFNLTAHFEKLFRSKITKSSPLVPPSDS
Sbjct: 301 ITTTHPLLEELGGKTSIFGAFSPQVFEASFNLTAHFEKLFRSKITKSSPLVPPSDS 356

BLAST of Csa3G218170 vs. TrEMBL
Match: A0A0A0L6K0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G218160 PE=4 SV=1)

HSP 1 Score: 429.9 bits (1104), Expect = 3.1e-117
Identity = 204/342 (59.65%), Postives = 264/342 (77.19%), Query Frame = 1

Query: 5   LKPMDPSNFFTGEGGSFHKWFPSDFPIISQTKVGAGRLLLHPRGFAVPHNSDSSKVGYVL 64
           ++ M+P  FF GEGGS+HKW PSD+P+++QT V  GRLLL PRGFAVPH SD SK GYVL
Sbjct: 1   MEAMNPKPFFEGEGGSYHKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYSDCSKFGYVL 60

Query: 65  QGS-GVAGIIFPCKSEEAAVRLKKGDVIPVPEGVTSWWFNDGDSDFEVLLVGDTRNALIP 124
           QG  GV G +FP K  E  ++LKKGD+IPVP GVTSWWFNDGDSD E++ +G+T+ A +P
Sbjct: 61  QGEDGVTGFVFPKKCNEVVIKLKKGDLIPVPAGVTSWWFNDGDSDLEIIFLGETKRAHVP 120

Query: 125 GDITYVVFAGPLGVLQGFSSDYIEKVYDLTEKEREVLLKSQPNGLIFKLKDDQTLPEPDC 184
           GDITY + +GP G+LQGF+ +Y++K   L ++E    LKSQPN LIF ++  Q+LP+P  
Sbjct: 121 GDITYFILSGPRGLLQGFTPEYVQKSCSLNQEETNTFLKSQPNVLIFTVQPSQSLPKPHK 180

Query: 185 HSDLVFNIYHTAPDAVVK-GGGSVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPVYVAD 244
           +S LV+NI   APD   K G  +VT++TE  FPFIG++GLT VLEKL+ANA+RSPVY+A+
Sbjct: 181 YSKLVYNIDAAAPDNRAKVGDAAVTMVTESTFPFIGQTGLTPVLEKLDANAIRSPVYIAE 240

Query: 245 PSVQLIYVASGSGRVQIAETFMRYQIDAEVKAGQLVLVPKYFAVGKMAGEEGLECFTIIT 304
           PS QLIYV  GSG++Q+     ++  DA+VK GQL+LVP+YFAVGK+AGEEGLEC ++I 
Sbjct: 241 PSDQLIYVTKGSGKIQVVGFSSKF--DADVKTGQLILVPRYFAVGKIAGEEGLECISMIV 300

Query: 305 TTHPLLEELGGKTSIFGAFSPQVFEASFNLTAHFEKLFRSKI 345
            THP++EEL GKTS+  A S +VF+ SFN+TA FEKLFRSK+
Sbjct: 301 ATHPMVEELAGKTSVLEALSSEVFQVSFNVTAEFEKLFRSKV 340

BLAST of Csa3G218170 vs. TrEMBL
Match: A0A0A0K550_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G337100 PE=4 SV=1)

HSP 1 Score: 372.1 bits (954), Expect = 7.7e-100
Identity = 184/345 (53.33%), Postives = 246/345 (71.30%), Query Frame = 1

Query: 2   ELNLKPMDPSNFFTGEGGSFHKWFPSDFPIISQTKVGAGRLLLHPRGFAVPHNSDSSKVG 61
           E NLK M+P   F G GGS++KW+PSD+P+++Q+KVGAG LLLHPRGFA+ H SD+SKVG
Sbjct: 3   EQNLKAMNPRKHFEGVGGSYNKWYPSDYPLLAQSKVGAGMLLLHPRGFAILHYSDASKVG 62

Query: 62  YVLQGS-GVAGIIFPCKSEEAAVRLKKGDVIPVPEGVTSWWFNDGDSDFEVLLVGDTRNA 121
           YVL+G+ GV G IFP  S E  ++LKKGD+IPVP GVTSWW+NDGDSD E+  +G+T+ A
Sbjct: 63  YVLRGNNGVTGFIFPNTSNEEVIKLKKGDIIPVPTGVTSWWYNDGDSDLEIAFLGETKYA 122

Query: 122 LIPGDITYVVFAGPLGVLQGFSSDYIEKVYDLTEKEREVLLKSQPNGLIFKLKDDQTLPE 181
            +PGDI+Y + +GP G+LQGFS DY+ K ++L E +   LL SQ NG+IFKL++ QTLP 
Sbjct: 123 HVPGDISYYILSGPQGILQGFSQDYVAKTFNLNEMDTSTLLNSQQNGMIFKLQEGQTLPT 182

Query: 182 PDCHSDLVFNIYHTAPDAVVKGGGSVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPVYV 241
           P   +  V+N+ +   D  +K       ++E +FPFIG++GL  V+E+L  N VRSPV +
Sbjct: 183 PTKDTKFVYNLDNY--DFFMK-------VSESEFPFIGETGLAVVVERLGPNVVRSPVLL 242

Query: 242 ADPSVQLIYVASGSGRVQIAETFMRYQIDAEVKAGQLVLVPKYFAVGKMAGEEGLECFTI 301
             P+ QLIYVA GSG VQI       +I+  V++GQL+ VPKYFA GK+A E+G+E F+I
Sbjct: 243 VSPADQLIYVARGSGTVQIVGLSSSSKIELHVESGQLIFVPKYFAAGKIAAEQGMEFFSI 302

Query: 302 ITTTHPLLEELGGKTSIFGAFSPQVFEASFNLTAHFEKLFRSKIT 346
           +T    L+ EL GKTS+  A S +V   SFN+TA FEK+ RS  T
Sbjct: 303 LTAKLGLVGELKGKTSVMEALSAEVIAVSFNITAEFEKVLRSNTT 338

BLAST of Csa3G218170 vs. TrEMBL
Match: D7SZX9_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_12s0034g01870 PE=4 SV=1)

HSP 1 Score: 364.0 bits (933), Expect = 2.1e-97
Identity = 176/358 (49.16%), Postives = 248/358 (69.27%), Query Frame = 1

Query: 1   MELNLKPMDPSNFFTGEGGSFHKWFPSDFPIISQTKVGAGRLLLHPRGFAVPHNSDSSKV 60
           MELNL P      F GEGG+++ W  +++ ++ + KVG GRL+L PRGFA+PH +DS+K+
Sbjct: 1   MELNLAPKFAQKIFEGEGGTYYSWSSAEYELLKEAKVGGGRLVLGPRGFALPHYADSNKI 60

Query: 61  GYVLQGS-GVAGIIFPCKSEEAAVRLKKGDVIPVPEGVTSWWFNDGDSDFEVLLVGDTRN 120
           GYVLQGS GV G++FP  SEE  ++LK+GD+IPVP G  SWW+NDGDS+  ++ +G+T  
Sbjct: 61  GYVLQGSCGVVGMVFPEASEEVVLKLKEGDIIPVPSGAVSWWYNDGDSELVIVFLGETSK 120

Query: 121 ALIPGDITYVVFAGPLGVLQGFSSDYIEKVYDLTEKEREVLLKSQPNGLIFKLKDDQTLP 180
           A +PG+ TY +  G  G+L GFS+++  + Y+++ +E E L KSQ   L+ KL + Q +P
Sbjct: 121 AYVPGEFTYFLLTGTQGILGGFSTEFNSRAYNISNEEAEKLAKSQTGVLLIKLPEGQKMP 180

Query: 181 EPDCHSD--LVFNIYHTAPDAVVKGGGSVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSP 240
            P  +S   LV+NI    PD  V+  G +T LT +KFPF+G+ GL+A L KL+ANA+ SP
Sbjct: 181 HPCKNSTDKLVYNIDAALPDIHVQNAGLLTALTAKKFPFLGEVGLSATLVKLDANAMSSP 240

Query: 241 VYVADPSVQLIYVASGSGRVQIAETFMRYQIDAEVKAGQLVLVPKYFAVGKMAGEEGLEC 300
           VY AD SVQ+IYVA GSGR+Q+        +D +VKAG L +VP++F    +A  EG+E 
Sbjct: 241 VYAADSSVQVIYVAKGSGRIQVVGINGERALDTKVKAGHLYVVPRFFVASTIADGEGMEY 300

Query: 301 FTIITTTHPLLEELGGKTSIFGAFSPQVFEASFNLTAHFEKLFRSKITKSSPLVPPSD 356
           F++IT T P+  E  GKTS++GA SPQV +AS N+   FE+LFR+KI KS+ LVPP +
Sbjct: 301 FSLITATQPVFGEFTGKTSVWGALSPQVLQASLNVAPEFEQLFRAKIKKSTILVPPQN 358

BLAST of Csa3G218170 vs. TrEMBL
Match: A0A151UBW6_CAJCA (Glutelin type-A 2 OS=Cajanus cajan GN=KK1_021060 PE=4 SV=1)

HSP 1 Score: 362.8 bits (930), Expect = 4.6e-97
Identity = 179/357 (50.14%), Postives = 248/357 (69.47%), Query Frame = 1

Query: 1   MELNLKPMDPSNFFTGEGGSFHKWFPSDFPIISQTKVGAGRLLLHPRGFAVPHNSDSSKV 60
           MEL+L P      F G+GG ++ W  S  P++++  V AGRLLLHPRGFA+PH +DSSK+
Sbjct: 1   MELDLTPRRAETMFEGDGGGYYTWSSSQVPLLAKNNVCAGRLLLHPRGFALPHYADSSKI 60

Query: 61  GYVLQGS-GVAGIIFPCKSEEAAVRLKKGDVIPVPEGVTSWWFNDGDSDFEVLLVGDTRN 120
           GYV+QG+ GV G++ P   EE  ++LKKGDVIPVP G  SWWFNDGDSDF ++ +G+T  
Sbjct: 61  GYVIQGTDGVVGLVLPNTGEEVVLKLKKGDVIPVPIGSVSWWFNDGDSDFIIVFLGETSK 120

Query: 121 ALIPGDITYVVFAGPLGVLQGFSSDYIEKVYDLTEKEREVLLKSQPNGLIFKL-KDDQTL 180
           ALIPG+I+Y    G LG++ GFS++   KVY L + E E L KSQ   LI KL K  Q +
Sbjct: 121 ALIPGEISYFFLTGALGIIGGFSTELTGKVYGLDKDEVEKLTKSQIGVLIIKLDKSHQHI 180

Query: 181 PEP--DCHSDLVFNIYHTAPDAVVKGGGSVTVLTEEKFPFIGKSGLTAVLEKLEANAVRS 240
           P+P  D    LV+NI    P+ VV+  G V  L EE FPFIG  GL+ +  KLE  A+++
Sbjct: 181 PKPQIDITKKLVYNIDDALPENVVENAGLVKTLKEEDFPFIGDVGLSVIRVKLEPGAIKA 240

Query: 241 PVYVADPSVQLIYVASGSGRVQIAETFMRYQIDAEVKAGQLVLVPKYFAVGKMAGEEGLE 300
           P Y   P+VQLIY+A GSG+++I ++  +  +D +V+ G L++VP+YF + ++AGEEG+E
Sbjct: 241 PSYSISPTVQLIYIARGSGKIEIVDSNGKRALDTKVEVGHLLVVPQYFVLAEIAGEEGIE 300

Query: 301 CFTIITTTHPLLEELGGKTSIFGAFSPQVFEASFNLTAHFEKLFRSKITKSSPLVPP 354
           C++I+TTT PL EELGG+ SI+GA SP + + + N+ + F+KLF SKI KS+ L+PP
Sbjct: 301 CYSIVTTTKPLFEELGGRRSIWGALSPTLEQVALNVDSDFQKLFMSKIKKSTNLIPP 357

BLAST of Csa3G218170 vs. TAIR10
Match: AT1G07750.1 (AT1G07750.1 RmlC-like cupins superfamily protein)

HSP 1 Score: 269.6 bits (688), Expect = 2.7e-72
Identity = 137/357 (38.38%), Postives = 206/357 (57.70%), Query Frame = 1

Query: 1   MELNLKPMDPSNFFTGEGGSFHKWFPSDFPIISQTKVGAGRLLLHPRGFAVPHNSDSSKV 60
           MEL+L P  P   + G+GGS+  W P + P++ Q  +GA +L L   GFAVP  SDSSKV
Sbjct: 1   MELDLTPKLPKKVYGGDGGSYSAWCPEELPMLKQGNIGAAKLALEKNGFAVPRYSDSSKV 60

Query: 61  GYVLQGSGVAGIIFPCKSEEAAVRLKKGDVIPVPEGVTSWWFNDGDSDFEVLLVGDTRNA 120
            YVLQGSG AGI+ P K EE  + +K+GD I +P GV +WWFN+ D +  +L +G+T   
Sbjct: 61  AYVLQGSGTAGIVLPEK-EEKVIAIKQGDSIALPFGVVTWWFNNEDPELVILFLGETHKG 120

Query: 121 LIPGDITYVVFAGPLGVLQGFSSDYIEKVYDLTEKEREVLLKSQPNGLIFKLKDDQTLPE 180
              G  T     G  G+  GFS++++ + +DL E   + L+ SQ    I KL     +P+
Sbjct: 121 HKAGQFTEFYLTGTNGIFTGFSTEFVGRAWDLDENTVKKLVGSQTGNGIVKLDAGFKMPQ 180

Query: 181 P--DCHSDLVFNIYHTAPDAVVKGGGSVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPV 240
           P  +  +  V N      D  +K GG V VL  +  P +G+ G  A L +++A+++ SP 
Sbjct: 181 PKEENRAGFVLNCLEAPLDVDIKDGGRVVVLNTKNLPLVGEVGFGADLVRIDAHSMCSPG 240

Query: 241 YVADPSVQLIYVASGSGRVQIAETFMRYQIDAEVKAGQLVLVPKYFAVGKMAGEEGLECF 300
           +  D ++Q+ Y+  GSGRVQ+     +  ++  +KAG L +VP++F V K+A  +G+  F
Sbjct: 241 FSCDSALQVTYIVGGSGRVQVVGGDGKRVLETHIKAGSLFIVPRFFVVSKIADADGMSWF 300

Query: 301 TIITTTHPLLEELGGKTSIFGAFSPQVFEASFNLTAHFEKLFRSKITKSSPLVPPSD 356
           +I+TT  P+   L G TS++ + SP+V +A+F +    EK FRS  T S+   PPS+
Sbjct: 301 SIVTTPDPIFTHLAGNTSVWKSLSPEVLQAAFKVAPEVEKSFRSTRTSSAIFFPPSN 356

BLAST of Csa3G218170 vs. TAIR10
Match: AT2G28680.1 (AT2G28680.1 RmlC-like cupins superfamily protein)

HSP 1 Score: 261.5 bits (667), Expect = 7.4e-70
Identity = 136/357 (38.10%), Postives = 201/357 (56.30%), Query Frame = 1

Query: 1   MELNLKPMDPSNFFTGEGGSFHKWFPSDFPIISQTKVGAGRLLLHPRGFAVPHNSDSSKV 60
           MEL+L P  P   + G+GGS+  W P + P++    +GA +L L   G A+P  SDS KV
Sbjct: 1   MELDLSPRLPKKVYGGDGGSYFAWCPEELPMLRDGNIGASKLALEKYGLALPRYSDSPKV 60

Query: 61  GYVLQGSGVAGIIFPCKSEEAAVRLKKGDVIPVPEGVTSWWFNDGDSDFEVLLVGDTRNA 120
            YVLQG+G AGI+ P K EE  + +KKGD I +P GV +WWFN+ D++  VL +G+T   
Sbjct: 61  AYVLQGAGTAGIVLPEK-EEKVIAIKKGDSIALPFGVVTWWFNNEDTELVVLFLGETHKG 120

Query: 121 LIPGDITYVVFAGPLGVLQGFSSDYIEKVYDLTEKEREVLLKSQPNGLIFKLKDDQTLPE 180
              G  T     G  G+  GFS++++ + +DL E   + L+ SQ    I K+     +PE
Sbjct: 121 HKAGQFTDFYLTGSNGIFTGFSTEFVGRAWDLDETTVKKLVGSQTGNGIVKVDASLKMPE 180

Query: 181 PDC--HSDLVFNIYHTAPDAVVKGGGSVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPV 240
           P        V N      D  +K GG V VL  +  P +G+ G  A L +++ +++ SP 
Sbjct: 181 PKKGDRKGFVLNCLEAPLDVDIKDGGRVVVLNTKNLPLVGEVGFGADLVRIDGHSMCSPG 240

Query: 241 YVADPSVQLIYVASGSGRVQIAETFMRYQIDAEVKAGQLVLVPKYFAVGKMAGEEGLECF 300
           +  D ++Q+ Y+  GSGRVQI     +  ++  VKAG L +VP++F V K+A  +GL  F
Sbjct: 241 FSCDSALQVTYIVGGSGRVQIVGADGKRVLETHVKAGVLFIVPRFFVVSKIADSDGLSWF 300

Query: 301 TIITTTHPLLEELGGKTSIFGAFSPQVFEASFNLTAHFEKLFRSKITKSSPLVPPSD 356
           +I+TT  P+   L G+TS++ A SP+V +A+F +    EK FRSK T  +    PS+
Sbjct: 301 SIVTTPDPIFTHLAGRTSVWKALSPEVLQAAFKVDPEVEKAFRSKRTSDAIFFSPSN 356

BLAST of Csa3G218170 vs. TAIR10
Match: AT1G03890.1 (AT1G03890.1 RmlC-like cupins superfamily protein)

HSP 1 Score: 75.1 bits (183), Expect = 9.8e-14
Identity = 95/413 (23.00%), Postives = 153/413 (37.05%), Query Frame = 1

Query: 4   NLKPMDPSNFFTGEGGSFHKWFPSDFPIISQTKVGAGRLLLHPRGFAVPHNSDSSKVGYV 63
           +L P   + F   E G    W     P +    V   R+ L P    +P       + YV
Sbjct: 43  SLAPAQATKF---EAGQMEVWDHMS-PELRCAGVTVARITLQPNSIFLPAFFSPPALAYV 102

Query: 64  LQG---------------------SGVAGIIFPCKSEE----AAVRLKKGDVIPVPEGVT 123
           +QG                     SG  G   P +  E         ++GDV     GV+
Sbjct: 103 VQGEGVMGTIASGCPETFAEVEGSSGRGGGGDPGRRFEDMHQKLENFRRGDVFASLAGVS 162

Query: 124 SWWFNDGDSDFEVLLVGDTRNALIPGDITYVVF--AG--------PL------GVLQGFS 183
            WW+N GDSD  +++V D  N     D    +F  AG        PL          GF 
Sbjct: 163 QWWYNRGDSDAVIVIVLDVTNRENQLDQVPRMFQLAGSRTQEEEQPLTWPSGNNAFSGFD 222

Query: 184 SDYIEKVYDLTEKEREVLLKSQPN-GLIFKLKDDQ--TLPEP-DCHSDLVFN----IYHT 243
            + I + + +  +  + L   + N G I +        +P P +   D + N     Y T
Sbjct: 223 PNIIAEAFKINIETAKQLQNQKDNRGNIIRANGPLHFVIPPPREWQQDGIANGIEETYCT 282

Query: 244 AP-----------DAVVKGGGSVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPVYVADP 303
           A            D      G ++ L     P +    L A+   L +  +  P + A+ 
Sbjct: 283 AKIHENIDDPERSDHFSTRAGRISTLNSLNLPVLRLVRLNALRGYLYSGGMVLPQWTANA 342

Query: 304 SVQLIYVASGSGRVQIAETFMRYQIDAEVKAGQLVLVPKYFAVGKMAGEEGLECFTIITT 357
              ++YV  G  ++Q+ +   +   + +V  GQ++++P+ FAV K AGE G E  +  T 
Sbjct: 343 HT-VLYVTGGQAKIQVVDDNGQSVFNEQVGQGQIIVIPQGFAVSKTAGETGFEWISFKTN 402

BLAST of Csa3G218170 vs. TAIR10
Match: AT4G28520.1 (AT4G28520.1 cruciferin 3)

HSP 1 Score: 56.6 bits (135), Expect = 3.6e-08
Identity = 34/130 (26.15%), Postives = 61/130 (46.92%), Query Frame = 1

Query: 204 GSVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYVASGSGRVQIAETF 263
           G VT +     P +    L+A    L+ NA+  P Y  + + +++Y   G GR+Q+    
Sbjct: 362 GRVTSVNSYTLPILEYVRLSATRGVLQGNAMVLPKYNMNAN-EILYCTGGQGRIQVVNDN 421

Query: 264 MRYQIDAEVKAGQLVLVPKYFAVGKMAGEEGLECFTIITTTHPLLEELGGKTSIFGAFSP 323
            +  +D +V+ GQLV++P+ FA    +     E  +  T  + ++  L G+TS+  A   
Sbjct: 422 GQNVLDQQVQKGQLVVIPQGFAYVVQSHGNKFEWISFKTNENAMISTLAGRTSLLRALPL 481

Query: 324 QVFEASFNLT 334
           +V    F ++
Sbjct: 482 EVISNGFQIS 490


HSP 2 Score: 35.8 bits (81), Expect = 6.6e-02
Identity = 19/72 (26.39%), Postives = 30/72 (41.67%), Query Frame = 1

Query: 4   NLKPMDPSNFFTGEGGSFHKWFPSDFPIISQTKVGAGRLLLHPRGFAVPHNSDSSKVGYV 63
           NL  +  +     E G    W   + P +    V   R ++   G  +P    S K+ YV
Sbjct: 41  NLDVLQATETIKSEAGQIEYW-DHNHPQLRCVGVSVARYVIEQGGLYLPTFFTSPKISYV 100

Query: 64  LQGSGVAGIIFP 76
           +QG+G++G + P
Sbjct: 101 VQGTGISGRVVP 111

BLAST of Csa3G218170 vs. NCBI nr
Match: gi|700202448|gb|KGN57581.1| (hypothetical protein Csa_3G218170 [Cucumis sativus])

HSP 1 Score: 715.7 bits (1846), Expect = 4.1e-203
Identity = 356/356 (100.00%), Postives = 356/356 (100.00%), Query Frame = 1

Query: 1   MELNLKPMDPSNFFTGEGGSFHKWFPSDFPIISQTKVGAGRLLLHPRGFAVPHNSDSSKV 60
           MELNLKPMDPSNFFTGEGGSFHKWFPSDFPIISQTKVGAGRLLLHPRGFAVPHNSDSSKV
Sbjct: 1   MELNLKPMDPSNFFTGEGGSFHKWFPSDFPIISQTKVGAGRLLLHPRGFAVPHNSDSSKV 60

Query: 61  GYVLQGSGVAGIIFPCKSEEAAVRLKKGDVIPVPEGVTSWWFNDGDSDFEVLLVGDTRNA 120
           GYVLQGSGVAGIIFPCKSEEAAVRLKKGDVIPVPEGVTSWWFNDGDSDFEVLLVGDTRNA
Sbjct: 61  GYVLQGSGVAGIIFPCKSEEAAVRLKKGDVIPVPEGVTSWWFNDGDSDFEVLLVGDTRNA 120

Query: 121 LIPGDITYVVFAGPLGVLQGFSSDYIEKVYDLTEKEREVLLKSQPNGLIFKLKDDQTLPE 180
           LIPGDITYVVFAGPLGVLQGFSSDYIEKVYDLTEKEREVLLKSQPNGLIFKLKDDQTLPE
Sbjct: 121 LIPGDITYVVFAGPLGVLQGFSSDYIEKVYDLTEKEREVLLKSQPNGLIFKLKDDQTLPE 180

Query: 181 PDCHSDLVFNIYHTAPDAVVKGGGSVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPVYV 240
           PDCHSDLVFNIYHTAPDAVVKGGGSVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPVYV
Sbjct: 181 PDCHSDLVFNIYHTAPDAVVKGGGSVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPVYV 240

Query: 241 ADPSVQLIYVASGSGRVQIAETFMRYQIDAEVKAGQLVLVPKYFAVGKMAGEEGLECFTI 300
           ADPSVQLIYVASGSGRVQIAETFMRYQIDAEVKAGQLVLVPKYFAVGKMAGEEGLECFTI
Sbjct: 241 ADPSVQLIYVASGSGRVQIAETFMRYQIDAEVKAGQLVLVPKYFAVGKMAGEEGLECFTI 300

Query: 301 ITTTHPLLEELGGKTSIFGAFSPQVFEASFNLTAHFEKLFRSKITKSSPLVPPSDS 357
           ITTTHPLLEELGGKTSIFGAFSPQVFEASFNLTAHFEKLFRSKITKSSPLVPPSDS
Sbjct: 301 ITTTHPLLEELGGKTSIFGAFSPQVFEASFNLTAHFEKLFRSKITKSSPLVPPSDS 356

BLAST of Csa3G218170 vs. NCBI nr
Match: gi|659112131|ref|XP_008456077.1| (PREDICTED: glutelin type-B 2-like [Cucumis melo])

HSP 1 Score: 682.2 bits (1759), Expect = 5.0e-193
Identity = 336/356 (94.38%), Postives = 348/356 (97.75%), Query Frame = 1

Query: 1   MELNLKPMDPSNFFTGEGGSFHKWFPSDFPIISQTKVGAGRLLLHPRGFAVPHNSDSSKV 60
           MEL+LKPMDP+NFFTGEGGSFHKWFPSD PII QTKVGAGRLLLHPRGFAVPHNSDSSKV
Sbjct: 1   MELDLKPMDPTNFFTGEGGSFHKWFPSDHPIIPQTKVGAGRLLLHPRGFAVPHNSDSSKV 60

Query: 61  GYVLQGSGVAGIIFPCKSEEAAVRLKKGDVIPVPEGVTSWWFNDGDSDFEVLLVGDTRNA 120
           GYVLQGSGVAGI+FPCKSEEA VRLKKGDVIPVPEGVTSWWFNDGDSDFEVLLVGDTRNA
Sbjct: 61  GYVLQGSGVAGIVFPCKSEEAVVRLKKGDVIPVPEGVTSWWFNDGDSDFEVLLVGDTRNA 120

Query: 121 LIPGDITYVVFAGPLGVLQGFSSDYIEKVYDLTEKEREVLLKSQPNGLIFKLKDDQTLPE 180
           LIPGDITYVVFAGPLGVLQGFSSDYIEKVYDLTE+EREVLLKSQPNGLIFKLKDDQTLPE
Sbjct: 121 LIPGDITYVVFAGPLGVLQGFSSDYIEKVYDLTEEEREVLLKSQPNGLIFKLKDDQTLPE 180

Query: 181 PDCHSDLVFNIYHTAPDAVVKGGGSVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPVYV 240
           PDCHSDLVFNIY  APD+VVKGGG+VTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPVYV
Sbjct: 181 PDCHSDLVFNIYDAAPDSVVKGGGTVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPVYV 240

Query: 241 ADPSVQLIYVASGSGRVQIAETFMRYQIDAEVKAGQLVLVPKYFAVGKMAGEEGLECFTI 300
           ADPSVQLIYVASGSGR+QIAETFMR QIDAEVKAGQL+LVPKYFAVGKMAGEEGLECFTI
Sbjct: 241 ADPSVQLIYVASGSGRIQIAETFMRKQIDAEVKAGQLILVPKYFAVGKMAGEEGLECFTI 300

Query: 301 ITTTHPLLEELGGKTSIFGAFSPQVFEASFNLTAHFEKLFRSKITKSSPLVPPSDS 357
           ITTTHPLLEELGGK+SIFGAFSPQVF+ASFN+TAHFEKL  SKITKSSPLVPPSD+
Sbjct: 301 ITTTHPLLEELGGKSSIFGAFSPQVFQASFNVTAHFEKLLISKITKSSPLVPPSDN 356

BLAST of Csa3G218170 vs. NCBI nr
Match: gi|778680244|ref|XP_011651276.1| (PREDICTED: legumin J-like [Cucumis sativus])

HSP 1 Score: 613.2 bits (1580), Expect = 2.8e-172
Identity = 304/304 (100.00%), Postives = 304/304 (100.00%), Query Frame = 1

Query: 1   MELNLKPMDPSNFFTGEGGSFHKWFPSDFPIISQTKVGAGRLLLHPRGFAVPHNSDSSKV 60
           MELNLKPMDPSNFFTGEGGSFHKWFPSDFPIISQTKVGAGRLLLHPRGFAVPHNSDSSKV
Sbjct: 1   MELNLKPMDPSNFFTGEGGSFHKWFPSDFPIISQTKVGAGRLLLHPRGFAVPHNSDSSKV 60

Query: 61  GYVLQGSGVAGIIFPCKSEEAAVRLKKGDVIPVPEGVTSWWFNDGDSDFEVLLVGDTRNA 120
           GYVLQGSGVAGIIFPCKSEEAAVRLKKGDVIPVPEGVTSWWFNDGDSDFEVLLVGDTRNA
Sbjct: 61  GYVLQGSGVAGIIFPCKSEEAAVRLKKGDVIPVPEGVTSWWFNDGDSDFEVLLVGDTRNA 120

Query: 121 LIPGDITYVVFAGPLGVLQGFSSDYIEKVYDLTEKEREVLLKSQPNGLIFKLKDDQTLPE 180
           LIPGDITYVVFAGPLGVLQGFSSDYIEKVYDLTEKEREVLLKSQPNGLIFKLKDDQTLPE
Sbjct: 121 LIPGDITYVVFAGPLGVLQGFSSDYIEKVYDLTEKEREVLLKSQPNGLIFKLKDDQTLPE 180

Query: 181 PDCHSDLVFNIYHTAPDAVVKGGGSVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPVYV 240
           PDCHSDLVFNIYHTAPDAVVKGGGSVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPVYV
Sbjct: 181 PDCHSDLVFNIYHTAPDAVVKGGGSVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPVYV 240

Query: 241 ADPSVQLIYVASGSGRVQIAETFMRYQIDAEVKAGQLVLVPKYFAVGKMAGEEGLECFTI 300
           ADPSVQLIYVASGSGRVQIAETFMRYQIDAEVKAGQLVLVPKYFAVGKMAGEEGLECFTI
Sbjct: 241 ADPSVQLIYVASGSGRVQIAETFMRYQIDAEVKAGQLVLVPKYFAVGKMAGEEGLECFTI 300

Query: 301 ITTT 305
           ITTT
Sbjct: 301 ITTT 304

BLAST of Csa3G218170 vs. NCBI nr
Match: gi|659112129|ref|XP_008456076.1| (PREDICTED: glutelin type-A 2-like [Cucumis melo])

HSP 1 Score: 430.3 bits (1105), Expect = 3.4e-117
Identity = 204/342 (59.65%), Postives = 265/342 (77.49%), Query Frame = 1

Query: 5   LKPMDPSNFFTGEGGSFHKWFPSDFPIISQTKVGAGRLLLHPRGFAVPHNSDSSKVGYVL 64
           ++ M+P  FF GEGGS+ KW PSD+P+++QT V  GRLLL PRGFAVPH +D SK GYVL
Sbjct: 1   MEAMNPKPFFEGEGGSYLKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYADCSKFGYVL 60

Query: 65  QGS-GVAGIIFPCKSEEAAVRLKKGDVIPVPEGVTSWWFNDGDSDFEVLLVGDTRNALIP 124
           QG  GV G +FP K  E  ++LKKGD+IPVP G+TSWWFNDGDSD E++ +G+T+NA +P
Sbjct: 61  QGEDGVTGFVFPNKCNEVVMKLKKGDLIPVPSGITSWWFNDGDSDLEIIFLGETKNAHVP 120

Query: 125 GDITYVVFAGPLGVLQGFSSDYIEKVYDLTEKEREVLLKSQPNGLIFKLKDDQTLPEPDC 184
           GDITY + +GP G+LQGF+ +Y++K Y L+++E    LKSQ N LIF ++  Q+LP+P  
Sbjct: 121 GDITYFILSGPRGLLQGFAPEYVQKSYSLSQEETNKFLKSQSNVLIFTVQPSQSLPKPHK 180

Query: 185 HSDLVFNIYHTAPDAVVK-GGGSVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPVYVAD 244
           HS LV+NI    PD   K G  +VT++TE  FPFIG++GLTAVLEKL+ANA+RSPVY+A+
Sbjct: 181 HSKLVYNIDAAVPDNRAKVGAAAVTMVTESTFPFIGQTGLTAVLEKLDANAIRSPVYIAE 240

Query: 245 PSVQLIYVASGSGRVQIAETFMRYQIDAEVKAGQLVLVPKYFAVGKMAGEEGLECFTIIT 304
           PS QLIYV  GSG++Q+     ++  DA+VK GQL+LVP+YFAVGKMAGEEGLEC ++I 
Sbjct: 241 PSDQLIYVTKGSGKIQVVGFSSKF--DADVKIGQLILVPRYFAVGKMAGEEGLECISMIV 300

Query: 305 TTHPLLEELGGKTSIFGAFSPQVFEASFNLTAHFEKLFRSKI 345
            THP++EEL GKTS+  A S +VF+ SFN+TA FEKLFRSK+
Sbjct: 301 ATHPMVEELAGKTSVLEALSSEVFQVSFNVTAEFEKLFRSKV 340

BLAST of Csa3G218170 vs. NCBI nr
Match: gi|449467587|ref|XP_004151504.1| (PREDICTED: legumin J [Cucumis sativus])

HSP 1 Score: 429.9 bits (1104), Expect = 4.4e-117
Identity = 204/342 (59.65%), Postives = 264/342 (77.19%), Query Frame = 1

Query: 5   LKPMDPSNFFTGEGGSFHKWFPSDFPIISQTKVGAGRLLLHPRGFAVPHNSDSSKVGYVL 64
           ++ M+P  FF GEGGS+HKW PSD+P+++QT V  GRLLL PRGFAVPH SD SK GYVL
Sbjct: 1   MEAMNPKPFFEGEGGSYHKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYSDCSKFGYVL 60

Query: 65  QGS-GVAGIIFPCKSEEAAVRLKKGDVIPVPEGVTSWWFNDGDSDFEVLLVGDTRNALIP 124
           QG  GV G +FP K  E  ++LKKGD+IPVP GVTSWWFNDGDSD E++ +G+T+ A +P
Sbjct: 61  QGEDGVTGFVFPKKCNEVVIKLKKGDLIPVPAGVTSWWFNDGDSDLEIIFLGETKRAHVP 120

Query: 125 GDITYVVFAGPLGVLQGFSSDYIEKVYDLTEKEREVLLKSQPNGLIFKLKDDQTLPEPDC 184
           GDITY + +GP G+LQGF+ +Y++K   L ++E    LKSQPN LIF ++  Q+LP+P  
Sbjct: 121 GDITYFILSGPRGLLQGFTPEYVQKSCSLNQEETNTFLKSQPNVLIFTVQPSQSLPKPHK 180

Query: 185 HSDLVFNIYHTAPDAVVK-GGGSVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPVYVAD 244
           +S LV+NI   APD   K G  +VT++TE  FPFIG++GLT VLEKL+ANA+RSPVY+A+
Sbjct: 181 YSKLVYNIDAAAPDNRAKVGDAAVTMVTESTFPFIGQTGLTPVLEKLDANAIRSPVYIAE 240

Query: 245 PSVQLIYVASGSGRVQIAETFMRYQIDAEVKAGQLVLVPKYFAVGKMAGEEGLECFTIIT 304
           PS QLIYV  GSG++Q+     ++  DA+VK GQL+LVP+YFAVGK+AGEEGLEC ++I 
Sbjct: 241 PSDQLIYVTKGSGKIQVVGFSSKF--DADVKTGQLILVPRYFAVGKIAGEEGLECISMIV 300

Query: 305 TTHPLLEELGGKTSIFGAFSPQVFEASFNLTAHFEKLFRSKI 345
            THP++EEL GKTS+  A S +VF+ SFN+TA FEKLFRSK+
Sbjct: 301 ATHPMVEELAGKTSVLEALSSEVFQVSFNVTAEFEKLFRSKV 340

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
11S2_SESIN7.1e-1421.6111S globulin seed storage protein 2 OS=Sesamum indicum PE=2 SV=1[more]
CRU4_ARATH1.7e-1223.0012S seed storage protein CRD OS=Arabidopsis thaliana GN=CRD PE=1 SV=1[more]
GLUB1_ORYSJ8.6e-1229.23Glutelin type-B 1 OS=Oryza sativa subsp. japonica GN=GluB1-A PE=2 SV=3[more]
GLYG3_SOYBN1.5e-1128.44Glycinin G3 OS=Glycine max GN=GY3 PE=3 SV=1[more]
GLUB2_ORYSJ1.9e-1128.46Glutelin type-B 2 OS=Oryza sativa subsp. japonica GN=GLUB2 PE=2 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0LC21_CUCSA2.8e-203100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_3G218170 PE=4 SV=1[more]
A0A0A0L6K0_CUCSA3.1e-11759.65Uncharacterized protein OS=Cucumis sativus GN=Csa_3G218160 PE=4 SV=1[more]
A0A0A0K550_CUCSA7.7e-10053.33Uncharacterized protein OS=Cucumis sativus GN=Csa_7G337100 PE=4 SV=1[more]
D7SZX9_VITVI2.1e-9749.16Putative uncharacterized protein OS=Vitis vinifera GN=VIT_12s0034g01870 PE=4 SV=... [more]
A0A151UBW6_CAJCA4.6e-9750.14Glutelin type-A 2 OS=Cajanus cajan GN=KK1_021060 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G07750.12.7e-7238.38 RmlC-like cupins superfamily protein[more]
AT2G28680.17.4e-7038.10 RmlC-like cupins superfamily protein[more]
AT1G03890.19.8e-1423.00 RmlC-like cupins superfamily protein[more]
AT4G28520.13.6e-0826.15 cruciferin 3[more]
Match NameE-valueIdentityDescription
gi|700202448|gb|KGN57581.1|4.1e-203100.00hypothetical protein Csa_3G218170 [Cucumis sativus][more]
gi|659112131|ref|XP_008456077.1|5.0e-19394.38PREDICTED: glutelin type-B 2-like [Cucumis melo][more]
gi|778680244|ref|XP_011651276.1|2.8e-172100.00PREDICTED: legumin J-like [Cucumis sativus][more]
gi|659112129|ref|XP_008456076.1|3.4e-11759.65PREDICTED: glutelin type-A 2-like [Cucumis melo][more]
gi|449467587|ref|XP_004151504.1|4.4e-11759.65PREDICTED: legumin J [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR006045Cupin_1
IPR011051RmlC_Cupin_sf
IPR014710RmlC-like_jellyroll
Vocabulary: Molecular Function
TermDefinition
GO:0045735nutrient reservoir activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0045735 nutrient reservoir activity
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
CU108087cucumber EST collection version 3.0transcribed_cluster
CU124749cucumber EST collection version 3.0transcribed_cluster
CU127739cucumber EST collection version 3.0transcribed_cluster
CU128047cucumber EST collection version 3.0transcribed_cluster
CU132512cucumber EST collection version 3.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa3G218170.1Csa3G218170.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
CU108087CU108087transcribed_cluster
CU127739CU127739transcribed_cluster
CU132512CU132512transcribed_cluster
CU128047CU128047transcribed_cluster
CU124749CU124749transcribed_cluster


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006045Cupin 1PFAMPF00190Cupin_1coord: 198..334
score: 2.5E-13coord: 9..156
score: 7.1
IPR006045Cupin 1SMARTSM00835Cupin_1_3coord: 189..338
score: 6.6E-15coord: 3..158
score: 2.8
IPR011051RmlC-like cupin domainunknownSSF51182RmlC-like cupinscoord: 5..337
score: 8.29
IPR014710RmlC-like jelly roll foldGENE3DG3DSA:2.60.120.10coord: 4..182
score: 4.1E-36coord: 200..343
score: 2.1
NoneNo IPR availablePANTHERPTHR31189FAMILY NOT NAMEDcoord: 1..355
score: 1.8E
NoneNo IPR availablePANTHERPTHR31189:SF0SUBFAMILY NOT NAMEDcoord: 1..355
score: 1.8E

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None