Cucsa.049540 (gene) Cucumber (Gy14) v1

NameCucsa.049540
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionHydroxyproline-rich glycoprotein family protein, putative
Locationscaffold00550 : 30581 .. 34014 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCTTTCGACTTTGCTTCTCTTCAACGTTGGGTTTCTCCTAAAAATACCCATCCATTTCATTTCACTTTGGCAATTTTTGGCTTTTTCTAAGAAACCCACAAGGGGTTTTCCCTTTGATTGTGTATTTTCTAGTTCGATTTTGTTTTCAGTGGTGGATTGTTACAGATGCTCCACAAATGGGGAAAGGGGAAGAGCAAAATCTCCCGTTGCAGCAGCGTCGTGAGGTGGCTTTAACTGGGGATTCTTCTGGATTTCTTTGTGGGCAATGCTCGATTGCTTTTCATAGAGTTTGTAAGGAGTTGAATTTCAAGTGTTTCTTTGTTTTGGTGTTGGGGTTTGTGGTGTTTGTTCCTGGATTCTTTTGGCTTCTTCCACTTCATGAAAGAAATTCTGGGTTTGAGGCCAAAGACAACATTAAACTCAGTGGTATGTCTTTCTTGAGATTCCTGTTTTTCTTTTCTTCTTTCTTCTTCTTCTTCAATTTTGTTCTTTGGAGTTTCTAATTTGATACTTGATTGCTCTGATTTTTCTTGCAAATTTTTTATGTGGTTTTATGCTTTTGAATGATCTTCTGTTTTCTCCCGTCTATTGAAATTGGAAATTTGGAAGTGTTGTGATGTTTCCAGAGCATGATCGTGCTTCAGATCAACTCATATGATCTACCATTTTTATTCCCATAGCTTTGCTTTGATGGTCATGTTAATATGTAGACGAAGGGCCATTACTGAAGGTTTAATTCTGGTTTTAGATGTTTCTTACTTTTATCTTGTTTAGGTTTTCCTTGACGGGTTTTATTTTGTTTGATTTTCTTGATAGAAATGTTTGAAATTATGGTGCCTTAGTCCTAATCCTACAGAGTTTGTTATGTGAATTTAGTGGTGTTGCAGCCAATATTCACTTGACGATGATTTAGTTCTTCCATGTTGGAATATAATTTTCACATGGGAAGATGTGAACCAACATTTATACTATTTGGAAAATAGTAAAGGCATCATATTCCTTCCTTACCATTTTGCCTAGGGAATCCTATTCTTAGTAGAATCGGATTCTGAACGATCGGGTGGATCGTACAAGCATTCCTCCTAGCATCAACTATGAAGCACTTGCTCTAGCAGCAAAACCAGACTTTGCTTTTGCTTCAATTTTAGATTGCCTGTGAACTGTTTGGTTAGAATCCGGTTCGTAAGGGATTAAGAATGCTTACTCCTTAGTTTGATGGACAGCTAGAAATTAGACTTTGCTTTTGCTTCAACTTAGATCTTCTGTGAGTATCTGTTACTTAGAAGGTTTTTGTTCTCTTTTCCAGCTACAGTTCAGGTGTATTTTGTTCTTGAAAAGCCCGTGACCGAGCTTCTCCCTCACATCAAGAGATTAGAGTTTGATATCAATGGTGAATTAGACATTCCAAACGTGAAGGTTTTGCGCTTCTTTACTGTTTTTGGATTTAGTTTACTAGTAAGGTCTTCTATGCCTAAGGGTTGTCGTTTATGTTACCAGGTTTCCATTCTATCCATGCATGATATAGGTGAGTCAAACAGGACTTACGTGGTTTTCGGTCTTCTTTCTGAATACATAACTGCTCCAATAAATCCAGTGTCCTTAAGTCTGCTGAGATCATCTTTATATGACTTTTTCCTTTCGGAATCCAACCTTACTTTGACGACATCAATTTTTGGACAGCCATCGACACTTCAAATTCTCAAGTTTCCAGGGGGGATTTCTATAATCCCATTTCAACATGCTTCGATTTGGGAGTTTCCCCAGATCGTATTTAACTTCACTCTTACTAACTCCATTTCCGAAATACTTGACAACTTTGCCAAGTTCAAGAGCCAGCTAAAGTTTGGATTGCGTCTGAGGTCTTATGAGGTACAAAAACTTATTTAAGTTAAACTTATACTTTCTGCTATGGATGGAATTTATCAAATATTATCTCTAGAACTTGAGATCATAAGTTTAGACTTGAGATGAGTTTTATATTATTGTTGTTAGTTTTATGTATCTGGGACTGGAGTTGATTAATGGATGTTGGATTCTACTCTTTCTTTTCTTGTTATACAGAATGTGTATTTGCAAATAACAAACAAGATTGGCTCGACAGTGCAACCACTCGTAATTGTTCAGGCTTCTATTACGTCGGAATTGGGACGCATTACGTCACAGAGATTACAGCAGTTGGCTGCAATCATCAACACCTCTCCTGAAAGAAATCTTGGCCTTGATTATTCTGTTTTTGGAGAAGTCAAGAGTGTCAGTTTGTCTTCTTATCCGAAGAGAACCTCCAAGGCAATGCCACCAAGTTTTTCTCCAGCCCCTGCCCCAGCGCCTGGCAATCATGTAGAAGTACCGAGTGGCCCACATCCGTTGAGATCCATGCGACCACCAGCAAACCATTCCCCACCTCATGCAAATTGCAAAAGCTCGTCTCCGAACCCTTCTATGGTTCCTGCAAATTCCCCTCATGAACATTCAATACCACCAATCTCGTATCCTAAGTCTACCAGACTGATCGTTCCTCCAGCTAATCAACCTCGTGTTTATTCTCCACGTGCATCTCCAGTAGAGTCTCCACCACTGTTGCCCCCCGATCTGTTACCTAAACCCAAGCCTTCTTTTCGCTCCAAATCAGGGCAGACAAATGAAGATCCGTCACATCCAGTTCATGTAAGGTTTGACATATTTTTTACTTGATTATAGGTTGAAAGCACGTAGTAGTAGTATTTCTTAGTACAAAATCAATTCATTTATATCTACAAAAGCACAACTCATTCCCTTCTCATCTTTGTAGGATTAAGAATAACGGGTCTTGACCGCTCTCTCCTACAACTCATTTACATCTATCACCTACATAATCAACATACAATAGTCTGAAGAATACAATCAGATGGGAAACTCATTTTTGGTTCACTCCTCAAGAACAAAGAAAGTCAGTATAGAGAGTCCGAAGAAGAATAAGAAGAAGAATAAGAAGAAGACAAAAGGTGATATTTCCCTTTATATTAACCCTACGCAGGGAAATAGGGATGTTGTTTTCCAAAGGGAACCAACTCTCCTCCTTGGTTGGGTTTAAATTATATATGTTTTAGAAAAAAATATAATTAATTTATGAAAGGAGAGATTTGAAGTTGCAGACCTCAAAAGGAACTATTATTATATAGTCTTATTTTCTTTAGTAAAAGATGTACAGTTCTTAAACAAAGGTTCCTTTTGACTGTCTGCAATTTTGGTAGTTACTTACAAGAAAGAAATGGAATAAGAAAGCAAAATGCTTGTTAATTACATTACTAAATCCAACACAATTTCATGTACCTAATCAATTGGTTAAGAAATGAAAGGAAATGAAGAGTAAGGAATGTAGTAGATATTACCGGGGGTGGGTTAATTGGCTAGCCAAGAGTACATCCACGAACCTGCTGCAAACTCCAACCAGTACA

mRNA sequence

TCTTTCGACTTTGCTTCTCTTCAACGTTGGGTTTCTCCTAAAAATACCCATCCATTTCATTTCACTTTGGCAATTTTTGGCTTTTTCTAAGAAACCCACAAGGGGTTTTCCCTTTGATTGTGTATTTTCTAGTTCGATTTTGTTTTCAGTGGTGGATTGTTACAGATGCTCCACAAATGGGGAAAGGGGAAGAGCAAAATCTCCCGTTGCAGCAGCGTCGTGAGGTGGCTTTAACTGGGGATTCTTCTGGATTTCTTTGTGGGCAATGCTCGATTGCTTTTCATAGAGTTTGTAAGGAGTTGAATTTCAAGTGTTTCTTTGTTTTGGTGTTGGGGTTTGTGGTGTTTGTTCCTGGATTCTTTTGGCTTCTTCCACTTCATGAAAGAAATTCTGGGTTTGAGGCCAAAGACAACATTAAACTCAGTGCTACAGTTCAGGTGTATTTTGTTCTTGAAAAGCCCGTGACCGAGCTTCTCCCTCACATCAAGAGATTAGAGTTTGATATCAATGGTGAATTAGACATTCCAAACGTGAAGCCATCGACACTTCAAATTCTCAAGTTTCCAGGGGGGATTTCTATAATCCCATTTCAACATGCTTCGATTTGGGAGTTTCCCCAGATCGTATTTAACTTCACTCTTACTAACTCCATTTCCGAAATACTTGACAACTTTGCCAAGTTCAAGAGCCAGCTAAAGTTTGGATTGCGTCTGAGGTCTTATGAGAATGTGTATTTGCAAATAACAAACAAGATTGGCTCGACAGTGCAACCACTCGTAATTGTTCAGGCTTCTATTACGTCGGAATTGGGACGCATTACGTCACAGAGATTACAGCAGTTGGCTGCAATCATCAACACCTCTCCTGAAAGAAATCTTGGCCTTGATTATTCTGTTTTTGGAGAAGTCAAGAGTGTCAGTTTGTCTTCTTATCCGAAGAGAACCTCCAAGGCAATGCCACCAAGTTTTTCTCCAGCCCCTGCCCCAGCGCCTGGCAATCATGTAGAAGTACCGAGTGGCCCACATCCGTTGAGATCCATGCGACCACCAGCAAACCATTCCCCACCTCATGCAAATTGCAAAAGCTCGTCTCCGAACCCTTCTATGGTTCCTGCAAATTCCCCTCATGAACATTCAATACCACCAATCTCGTATCCTAAGTCTACCAGACTGATCGTTCCTCCAGCTAATCAACCTCGTGTTTATTCTCCACGTGCATCTCCAGTAGAGTCTCCACCACTGTTGCCCCCCGATCTGTTACCTAAACCCAAGCCTTCTTTTCGCTCCAAATCAGGGCAGACAAATGAAGATCCGTCACATCCAGTTCATGATTAAGAATAACGGGTCTTGACCGCTCTCTCCTACAACTCATTTACATCTATCACCTACATAATCAACATACAATAGTCTGAAGAATACAATCAGATGGGAAACTCATTTTTGGTTCACTCCTCAAGAACAAAGAAAGTCAGTATAGAGAGTCCgaagaagaataagaagaagaataagaagaagaCAAAAGGTGATATTTCCCTTTATATTAACCCTACGCAGGGAAATAGGGATGTTGTTTTCCAAAGGGAACCAACTCTCCTCCTTGGTTGGGtttaaattatatatgttttagaaaaaaatataattaatttatGAAAGGAGAGATTTGAAGTTGCAGACCTCAAAAGGAACTATTATTATATAGTCTTATTTTCTTTAGTAAAAGATGTACAGTTCTTAAACAAAGGTTCCTTTTGACTGTCTGCAATTTTGGTAGTTACTTACAAGAAAGAAATGGAATAAGAAAGCAAAATGCTTGTTAATTACATTACTAAATCCAACACAATTTCATGTACCTAATCAATTGGTTAAGAAATGAAAGGAAATGAAGAGTAAGGAATGTAGTAGATATTACCGGGGGTGGGTTAATTGGCTAGCCAAGAGTACATCCACGAACCTGCTGCAAACTCCAACCAGTACA

Coding sequence (CDS)

ATGGGGAAAGGGGAAGAGCAAAATCTCCCGTTGCAGCAGCGTCGTGAGGTGGCTTTAACTGGGGATTCTTCTGGATTTCTTTGTGGGCAATGCTCGATTGCTTTTCATAGAGTTTGTAAGGAGTTGAATTTCAAGTGTTTCTTTGTTTTGGTGTTGGGGTTTGTGGTGTTTGTTCCTGGATTCTTTTGGCTTCTTCCACTTCATGAAAGAAATTCTGGGTTTGAGGCCAAAGACAACATTAAACTCAGTGCTACAGTTCAGGTGTATTTTGTTCTTGAAAAGCCCGTGACCGAGCTTCTCCCTCACATCAAGAGATTAGAGTTTGATATCAATGGTGAATTAGACATTCCAAACGTGAAGCCATCGACACTTCAAATTCTCAAGTTTCCAGGGGGGATTTCTATAATCCCATTTCAACATGCTTCGATTTGGGAGTTTCCCCAGATCGTATTTAACTTCACTCTTACTAACTCCATTTCCGAAATACTTGACAACTTTGCCAAGTTCAAGAGCCAGCTAAAGTTTGGATTGCGTCTGAGGTCTTATGAGAATGTGTATTTGCAAATAACAAACAAGATTGGCTCGACAGTGCAACCACTCGTAATTGTTCAGGCTTCTATTACGTCGGAATTGGGACGCATTACGTCACAGAGATTACAGCAGTTGGCTGCAATCATCAACACCTCTCCTGAAAGAAATCTTGGCCTTGATTATTCTGTTTTTGGAGAAGTCAAGAGTGTCAGTTTGTCTTCTTATCCGAAGAGAACCTCCAAGGCAATGCCACCAAGTTTTTCTCCAGCCCCTGCCCCAGCGCCTGGCAATCATGTAGAAGTACCGAGTGGCCCACATCCGTTGAGATCCATGCGACCACCAGCAAACCATTCCCCACCTCATGCAAATTGCAAAAGCTCGTCTCCGAACCCTTCTATGGTTCCTGCAAATTCCCCTCATGAACATTCAATACCACCAATCTCGTATCCTAAGTCTACCAGACTGATCGTTCCTCCAGCTAATCAACCTCGTGTTTATTCTCCACGTGCATCTCCAGTAGAGTCTCCACCACTGTTGCCCCCCGATCTGTTACCTAAACCCAAGCCTTCTTTTCGCTCCAAATCAGGGCAGACAAATGAAGATCCGTCACATCCAGTTCATGATTAA

Protein sequence

MGKGEEQNLPLQQRREVALTGDSSGFLCGQCSIAFHRVCKELNFKCFFVLVLGFVVFVPGFFWLLPLHERNSGFEAKDNIKLSATVQVYFVLEKPVTELLPHIKRLEFDINGELDIPNVKPSTLQILKFPGGISIIPFQHASIWEFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLRLRSYENVYLQITNKIGSTVQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYSVFGEVKSVSLSSYPKRTSKAMPPSFSPAPAPAPGNHVEVPSGPHPLRSMRPPANHSPPHANCKSSSPNPSMVPANSPHEHSIPPISYPKSTRLIVPPANQPRVYSPRASPVESPPLLPPDLLPKPKPSFRSKSGQTNEDPSHPVHD*
BLAST of Cucsa.049540 vs. Swiss-Prot
Match: PEXLP_TOBAC (Pistil-specific extensin-like protein OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 53.9 bits (128), Expect = 5.3e-06
Identity = 35/117 (29.91%), Postives = 49/117 (41.88%), Query Frame = 1

Query: 314 PKRTSKAMPPSFSPAPAPAPGNHVEVPSGPHPLRSMRPPANHSPPHANCKSSSPNPSMVP 373
           P     + PP   P  AP+P    + P  P P+++  P     PP     +  P P    
Sbjct: 172 PPAKQPSPPPPPPPVKAPSPSPAKQPPPPPPPVKAPSPSPATQPP-----TKQPPPPPRA 231

Query: 374 ANSPHEHSIPPISYPKSTRLIVPPANQPRVYSPRASPVESPPLLPPDLLP---KPKP 428
             SP     PP++YP        PA +P + +P  SP  +PPL+P    P   KP P
Sbjct: 232 KKSPLLPPPPPVAYPPVMTPSPSPAAEPPIIAPFPSPPANPPLIPRRPAPPVVKPLP 283

BLAST of Cucsa.049540 vs. TrEMBL
Match: A0A0A0L6J0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G214050 PE=4 SV=1)

HSP 1 Score: 897.1 bits (2317), Expect = 8.7e-258
Identity = 445/445 (100.00%), Postives = 445/445 (100.00%), Query Frame = 1

Query: 1   MGKGEEQNLPLQQRREVALTGDSSGFLCGQCSIAFHRVCKELNFKCFFVLVLGFVVFVPG 60
           MGKGEEQNLPLQQRREVALTGDSSGFLCGQCSIAFHRVCKELNFKCFFVLVLGFVVFVPG
Sbjct: 1   MGKGEEQNLPLQQRREVALTGDSSGFLCGQCSIAFHRVCKELNFKCFFVLVLGFVVFVPG 60

Query: 61  FFWLLPLHERNSGFEAKDNIKLSATVQVYFVLEKPVTELLPHIKRLEFDINGELDIPNVK 120
           FFWLLPLHERNSGFEAKDNIKLSATVQVYFVLEKPVTELLPHIKRLEFDINGELDIPNVK
Sbjct: 61  FFWLLPLHERNSGFEAKDNIKLSATVQVYFVLEKPVTELLPHIKRLEFDINGELDIPNVK 120

Query: 121 VSILSMHDIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFG 180
           VSILSMHDIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFG
Sbjct: 121 VSILSMHDIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFG 180

Query: 181 QPSTLQILKFPGGISIIPFQHASIWEFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLRL 240
           QPSTLQILKFPGGISIIPFQHASIWEFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLRL
Sbjct: 181 QPSTLQILKFPGGISIIPFQHASIWEFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLRL 240

Query: 241 RSYENVYLQITNKIGSTVQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS 300
           RSYENVYLQITNKIGSTVQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS
Sbjct: 241 RSYENVYLQITNKIGSTVQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS 300

Query: 301 VFGEVKSVSLSSYPKRTSKAMPPSFSPAPAPAPGNHVEVPSGPHPLRSMRPPANHSPPHA 360
           VFGEVKSVSLSSYPKRTSKAMPPSFSPAPAPAPGNHVEVPSGPHPLRSMRPPANHSPPHA
Sbjct: 301 VFGEVKSVSLSSYPKRTSKAMPPSFSPAPAPAPGNHVEVPSGPHPLRSMRPPANHSPPHA 360

Query: 361 NCKSSSPNPSMVPANSPHEHSIPPISYPKSTRLIVPPANQPRVYSPRASPVESPPLLPPD 420
           NCKSSSPNPSMVPANSPHEHSIPPISYPKSTRLIVPPANQPRVYSPRASPVESPPLLPPD
Sbjct: 361 NCKSSSPNPSMVPANSPHEHSIPPISYPKSTRLIVPPANQPRVYSPRASPVESPPLLPPD 420

Query: 421 LLPKPKPSFRSKSGQTNEDPSHPVH 446
           LLPKPKPSFRSKSGQTNEDPSHPVH
Sbjct: 421 LLPKPKPSFRSKSGQTNEDPSHPVH 445

BLAST of Cucsa.049540 vs. TrEMBL
Match: A0A067KRP0_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_05038 PE=4 SV=1)

HSP 1 Score: 406.4 bits (1043), Expect = 4.7e-110
Identity = 224/441 (50.79%), Postives = 286/441 (64.85%), Query Frame = 1

Query: 1   MGKGEEQNLPLQQRREVA----LTGDSSGFLCGQCSIAFHRVCKELNFKCFFVLVLGFVV 60
           MGK  +QNL   Q  E        G SSG  C +CS+   R+ K+ +F+CFFVL+L   +
Sbjct: 1   MGKEAQQNLQQWQSYENGNGGGREGGSSGIFCERCSMGLSRIYKDFSFRCFFVLILSLSL 60

Query: 61  FVPGFFWLLPLHERN-SGFEAKDNIKLSATVQVYFVLEKPVTELLPHIKRLEFDINGELD 120
            V G FW+LP H     GF+AKD+IK SA VQVYF L+KPV++++ HI RLE+DIN E+ 
Sbjct: 61  LVSGIFWILPSHTAKLDGFDAKDSIKFSAAVQVYFRLQKPVSQVVQHIDRLEYDINDEIG 120

Query: 121 IPNVKVSILSMHDIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLT 180
           +P  KV++LSMH  G SN T VVFG+LS  I  PIN VSLS+LRSSL + FL ESNLTLT
Sbjct: 121 VPGAKVAVLSMHQSGASNWTEVVFGVLSNSIQVPINQVSLSVLRSSLIEVFLRESNLTLT 180

Query: 181 TSIFGQPSTLQILKFPGGISIIPFQHASIWEFPQIVFNFTLTNSISEILDNFAKFKSQLK 240
           TSIFGQPS  QILKF GGI++IP  +ASIW+ PQI+F FTL NSI+EIL N A+ ++QLK
Sbjct: 181 TSIFGQPSMFQILKFSGGITVIPVAYASIWQRPQILFKFTLNNSIAEILYNLAELRNQLK 240

Query: 241 FGLRLRSYENVYLQITNKIGSTVQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNL 300
           FGL LR YENV +QITN  GST+   V VQAS+ S+LG +   RL+QLA  I  SP +NL
Sbjct: 241 FGLHLRPYENVIVQITNTAGSTIDSPVTVQASVVSDLGSLLPLRLRQLAQTITDSPSKNL 300

Query: 301 GLDYSVFGEVKSVSLSSYPKRTSKAMPPSFSPAPAPAPGNHVEVPSGPHPLRSMRPPANH 360
           GLD SVFG+VKSV LSSY K T  A PP+ SPAP+P   ++ E P+ P P  ++ P  + 
Sbjct: 301 GLDNSVFGKVKSVILSSYLKETLHANPPTPSPAPSPELNDYSEPPTSPCP--TISPSISP 360

Query: 361 SPPHANCKSSSPNPSMVPANSPHEHSI-----PPISYPKSTRLIVPPANQPRVYSPRASP 420
           +        S P+ S+ P NSP   ++      P  Y  S     P  ++  + SP   P
Sbjct: 361 AASPTTSPKSGPDNSLSPVNSPVHSTVTAEPPQPCGYHGSPVSPSPSPSRSNL-SPYLHP 420

Query: 421 VESPPLLPPDLLPKPKPSFRS 432
            + P  LPPD+ P P+ SF +
Sbjct: 421 ADPPSQLPPDMSPSPQASFNN 438

BLAST of Cucsa.049540 vs. TrEMBL
Match: V4T6J0_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10000890mg PE=4 SV=1)

HSP 1 Score: 404.1 bits (1037), Expect = 2.3e-109
Identity = 228/450 (50.67%), Postives = 296/450 (65.78%), Query Frame = 1

Query: 1   MGKGEEQNLPLQQRREVALTGD--SSGFLCGQCSIAFHRVCKELNFKCFFVLVLGFVVFV 60
           MGK E QNL   Q+++ + T D  SS F C +CS+    + KEL+ KC  +L     VF+
Sbjct: 22  MGKNE-QNL---QQQQTSHTPDQRSSRFFCARCSVVLSLISKELSLKCVVLLFFSLAVFL 81

Query: 61  PGFFWLLPLHERNSGFEAKDNIKLSATVQVYFVLEKPVTELLPHIKRLEFDINGELDIPN 120
            GFFWLLP  +  SGF+AK  IKLSA+VQ  F L+KPV+EL+P I RLE+DI GE+ +P+
Sbjct: 82  SGFFWLLPRSKFQSGFDAKAEIKLSASVQASFRLQKPVSELVPRIGRLEYDIYGEIGVPD 141

Query: 121 VKVSILSMHDIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSI 180
            KV++LS+H  G SN T +VFG+LS+ I A INPVSLS+L+SSL + FL +SNLTLTT++
Sbjct: 142 TKVAVLSVHQSGASNWTDIVFGVLSDPINARINPVSLSVLKSSLIELFLQQSNLTLTTTV 201

Query: 181 FGQPSTLQILKFPGGISIIPFQHASIWEFPQIVFNFTLTNSISEILDNFAKFKSQLKFGL 240
           FGQPS  +IL+FPGGI++IP   A I + PQI+FNFTL NSISEI +NF +   QLKFGL
Sbjct: 202 FGQPSMFEILRFPGGITVIPLPIAYILQLPQILFNFTLNNSISEIEENFIELSDQLKFGL 261

Query: 241 RLRSYENVYLQITNKIGSTVQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLD 300
           RLR YENVY+Q+TNK GST+ P V V+AS+ SE+G +  QRL+QLA  I+ SP +NLGLD
Sbjct: 262 RLRPYENVYVQVTNKDGSTISPPVTVEASVMSEMGSLLPQRLKQLAQAISDSPAKNLGLD 321

Query: 301 YSVFGEVKSVSLSSYPKRTSKAMPPSFSPAPAPAPGNHVEVPSGPHPLRSMRPPANHSPP 360
            SVFG+VK V LSSY K T  A PP+ SPAP+P P N       P    +  P  +H PP
Sbjct: 322 NSVFGKVKGVVLSSYLKGTLHATPPTPSPAPSPEPSNAPYPALSPSNSPAPSPNIHHLPP 381

Query: 361 HANCKSSSPNP--SMVPANSPHEHSIPPIS---YPKST--------RLIVPPANQPRV-Y 420
            +NC+ SSP+    + P +S  + +  P S   YP S            VPP++ P    
Sbjct: 382 CSNCEVSSPSDHNQLQPPSSQSDPTPAPSSATAYPPSPCRRPYHAHHRTVPPSSSPASDP 441

Query: 421 SPRASPVESPPLLPPDLLPKPKPSFRSKSG 435
           +P  SP   PP L P+L P P+ S+ S  G
Sbjct: 442 NPTNSPPIGPPKLAPNLSPLPEVSYSSGRG 467

BLAST of Cucsa.049540 vs. TrEMBL
Match: A0A067DJZ1_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g008625mg PE=4 SV=1)

HSP 1 Score: 401.4 bits (1030), Expect = 1.5e-108
Identity = 226/452 (50.00%), Postives = 290/452 (64.16%), Query Frame = 1

Query: 1   MGKGEEQNLPLQQRREVALTGD--SSGFLCGQCSIAFHRVCKELNFKCFFVLVLGFVVFV 60
           MGK E QNL   Q+++ + T D  SS F C +CS+    + KEL+ KC  +L     VF+
Sbjct: 1   MGKNE-QNL---QQQQTSHTPDQRSSRFFCARCSVVLSLISKELSLKCVVLLFFSLAVFL 60

Query: 61  PGFFWLLPLHERNSGFEAKDNIKLSATVQVYFVLEKPVTELLPHIKRLEFDINGELDIPN 120
            GFFWLLP  +  SGF+AK  IKLSA+VQ  F L+KPV+EL+P I RLE+DI GE+ +P+
Sbjct: 61  SGFFWLLPRSKFQSGFDAKAEIKLSASVQASFRLQKPVSELVPRIGRLEYDIYGEIGVPD 120

Query: 121 VKVSILSMHDIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSI 180
            KV++LS+H  G SN T +VFG+LS+ I A INPVSLS+L+SSL + FL +SNLTLTT++
Sbjct: 121 TKVAVLSVHQSGASNWTDIVFGVLSDPINARINPVSLSVLKSSLIELFLQQSNLTLTTTV 180

Query: 181 FGQPSTLQILKFPGGISIIPFQHASIWEFPQIVFNFTLTNSISEILDNFAKFKSQLKFGL 240
           FGQPS  +IL+FPGGI++IP   A I + PQI+FNFTL NSISEI +NF +   QLKFGL
Sbjct: 181 FGQPSMFEILRFPGGITVIPLPIAYILQLPQILFNFTLNNSISEIEENFIELSDQLKFGL 240

Query: 241 RLRSYENVYLQITNKIGSTVQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLD 300
           RLR YENVY+Q+TNK GST+ P V V+AS+ SE+G +  QRL+QLA  I+ SP +NLGLD
Sbjct: 241 RLRPYENVYVQVTNKDGSTISPPVTVEASVMSEMGSLLPQRLKQLAQAISDSPAKNLGLD 300

Query: 301 YSVFGEVKSVSLSSYPKRTSKAMPPSFSPAPAPAPGNHVEVPSGPHPLRSMRPPANHSPP 360
            SVFG+VK V LSSY K T  A PP+ SPAP+P P N       P    +  P  +H PP
Sbjct: 301 NSVFGKVKGVVLSSYLKGTLHATPPTPSPAPSPEPSNAPYPALSPSNSPAPSPNIHHLPP 360

Query: 361 HANCKSSSPNPSMVPANSPHEHSIPPISYPKST---------------RLIVPPANQPRV 420
            +NC+ SSP+    P   P      P   P S                   VPP++ P  
Sbjct: 361 CSNCEVSSPSDHNQP--QPPSSQSDPTPAPSSATAYSPSPCRRPYHVHHRTVPPSSSPAS 420

Query: 421 -YSPRASPVESPPLLPPDLLPKPKPSFRSKSG 435
             +P  SP   PP L P+L P P+ S+ S  G
Sbjct: 421 DPNPTNSPPIGPPKLAPNLSPLPEVSYSSGRG 446

BLAST of Cucsa.049540 vs. TrEMBL
Match: A0A061DX06_THECC (Hydroxyproline-rich glycoprotein family protein, putative OS=Theobroma cacao GN=TCM_006395 PE=4 SV=1)

HSP 1 Score: 396.0 bits (1016), Expect = 6.3e-107
Identity = 234/481 (48.65%), Postives = 298/481 (61.95%), Query Frame = 1

Query: 1   MGKGEEQNLPLQQRREVALTGDSSGF---LCGQCSIAFHRVCKELNFKCFFVLVLGFVVF 60
           MGK E+ NL  QQR +   +G + G    L G C +   R+    +F+C FVL L   V 
Sbjct: 1   MGKNEDPNL--QQREQSLESGSNQGHQGCLRGGCWVVLSRLSNAFSFRCVFVLFLSLSVL 60

Query: 61  VPGFFWLLPLHERNSGFEAKDNIKLSATVQVYFVLEKPVTELLPHIKRLEFDINGELDIP 120
           +PG FW+LP      GF+AK  IKLSA V  YF L+KPV++L+ HI +LE+DI  E+ +P
Sbjct: 61  LPGIFWILPFRSVKYGFDAKQAIKLSAPVHAYFKLQKPVSQLVQHIGKLEYDIFEEIGVP 120

Query: 121 NVKVSILSMHDIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTS 180
           + KV+ILSMH  G SN T VVFG+LS+ I  PINPVSLS+LRSSL + FL +SNLTLTTS
Sbjct: 121 DTKVAILSMHQSGASNSTNVVFGVLSDPINDPINPVSLSVLRSSLIELFLQQSNLTLTTS 180

Query: 181 IFGQPSTLQILKFPGGISIIPFQHASIWEFPQIVFNFTLTNSISEILDNFAKFKSQLKFG 240
           IFGQPS  +ILKFPGGI+IIP Q ASIW+  QI+FNFTL NSISEI D F + K QLK+G
Sbjct: 181 IFGQPSEFEILKFPGGITIIPVQSASIWQITQILFNFTLNNSISEIQDKFIELKDQLKYG 240

Query: 241 LRLRSYENVYLQITNKIGSTVQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGL 300
           LRLRSYENV++Q+TN  GST+   VIVQAS+ S+ G +  QRL+QLA  I  SP +NLGL
Sbjct: 241 LRLRSYENVFVQLTNINGSTISSPVIVQASVMSDFGSLLPQRLKQLAQTITDSPAKNLGL 300

Query: 301 DYSVFGEVKSVSLSSYPKRTSKAMPPSFSPAPAPAP--GNHVEVPSGPHPLRSMRPPANH 360
           + +VFG+VKS+SLSSY K +  A PP+ SPAP+P P    H   P    P    +    H
Sbjct: 301 NNTVFGKVKSISLSSYLKGSLHAGPPTPSPAPSPGPSISPHPTFPPTHSPASLPKSHHRH 360

Query: 361 SPPHANCKSSSPNPSMVPANSPHEH-----SIPPISY---PKS-----------TRLIVP 420
            P    CK++SP+ +  P +SP        S+PP S    P S           +R  V 
Sbjct: 361 LPHCRKCKATSPS-AHSPLHSPSPGSGSYLSLPPTSISPAPSSAVTHPPPPCPYSRHAVS 420

Query: 421 PANQPRVYS---PRASPVESP-PLLPPDLLPKPKPSFRSKSGQTNEDPSHPVHVRIKNNG 454
           P++ PR +S   P   PV SP   L P+L P P  S+ S+ G   E    PV   +  + 
Sbjct: 421 PSSSPRSHSNLIPHHPPVMSPRSQLSPELPPLPSVSYGSRPGHGMESMEGPVSAPLAQSP 478

BLAST of Cucsa.049540 vs. TAIR10
Match: AT3G56590.2 (AT3G56590.2 hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 201.8 bits (512), Expect = 8.8e-52
Identity = 147/452 (32.52%), Postives = 220/452 (48.67%), Query Frame = 1

Query: 1   MGKG--EEQNLPLQQRREVALTGDSSGFLCGQCSIAFHRVCKELNFKCFFVLVLGFVVFV 60
           MGK   EEQNLP+      A      G     C      +    + +C  +L     VF+
Sbjct: 1   MGKNTVEEQNLPVSDGAASARNNGGGGISTCCCC---DWISSYFSLRCVLILAFSAAVFL 60

Query: 61  PGFFWLLPLHERNSGFEAKDNIKLSATVQVY-----FVLEKPVTELLPHIKRLEFDINGE 120
              FWL P      GF    ++ L    + +     F + KP++ +  ++ +LE DI  E
Sbjct: 61  SALFWLPPF----LGFADPGDLDLDPRFKDHRIVASFDVGKPISFMEDNLMQLENDITDE 120

Query: 121 LDIPNVKVSILSMHDIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLT 180
           +  P  KV +L++  +G+ NRT V+F +  E   + I     SL++++       + +  
Sbjct: 121 ISFPMTKVVVLALERLGDLNRTMVIFAIDPEKENSKIPAEIESLIKAAFETLVQKQLSFR 180

Query: 181 LTTSIFGQPSTLQILKFPGGISIIPFQHASIWEFPQIVFNFTLTNSISEILDNFAKFKSQ 240
           LT S+FG+P   ++LKFPGGI++IP Q     +  Q++FNFTL  SI +I  NF +  SQ
Sbjct: 181 LTESLFGEPFFFEVLKFPGGITVIPPQPIFPLQKAQLLFNFTLNFSIYQIQSNFEELASQ 240

Query: 241 LKFGLRLRSYENVYLQITNKIGSTVQPLVIVQASITSELGRITSQRLQQLAAIINTSPER 300
           LK G+ L SYEN+Y+ ++N  GSTV P  IV +S+    G  +S RL+QLA  I +S  +
Sbjct: 241 LKKGINLASYENLYITLSNSRGSTVAPPTIVHSSVLLTFG--SSSRLKQLAQTITSSHSK 300

Query: 301 NLGLDYSVFGEVKSVSLSSYPKRTSKAMPPSFSPAPAPAPGNHVEVPSGPHPLRSMRPPA 360
           NLGL+++VFG+VK V LSS    +      S +P+P+P P  H             + P 
Sbjct: 301 NLGLNHTVFGKVKQVRLSSILPHSPAT---SSTPSPSPQPETH-------------QYPH 360

Query: 361 NHSPPHANCKSSSPNPSMVPANSPHEHSIPPISYPKSTRLIVPPANQPRVYSPRASPVES 420
           +H   H +    +P PS+    SP      P S P +    +PP N P  Y  R  P  +
Sbjct: 361 HHPHHHHHHHELAPEPSL----SPPTKGFAPASAP-TKHSPLPPRNPPCPYEQR-RPKGN 420

Query: 421 PPLLPPDLLPKPKPSFRSKSGQTNEDPSHPVH 446
             L      P P P  RS+      +P+ P H
Sbjct: 421 SALNHHTAPPTPAP-HRSQPHPPAPNPAPPRH 420

BLAST of Cucsa.049540 vs. TAIR10
Match: AT1G10790.1 (AT1G10790.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT3G56590.2))

HSP 1 Score: 199.9 bits (507), Expect = 3.4e-51
Identity = 127/331 (38.37%), Postives = 180/331 (54.38%), Query Frame = 1

Query: 5   EEQNLPLQQRREVALTGDSSGFLCGQ-CSIAFHRVCKELNFKCFFVLVLGFVVFVPGFFW 64
           +E  L LQQ        +SS    G+ CS AF R+   +  +C  VLVL   + +   FW
Sbjct: 6   KENALALQQETLDLENPESSPRSSGRSCSSAFSRL---VGLRCLIVLVLSCAILLSAIFW 65

Query: 65  LLPLHERNSGFEAKDNIKLSATVQVYFVLEKPVTELLPHIKRLEFDINGELDIPN-VKVS 124
           L P     S F+A   +KL+A+VQ  F L+KPV+E++ H  ++E DI   + + N  KV+
Sbjct: 66  LFP-RRSVSEFKADGTVKLNASVQASFRLQKPVSEVVRHKGKIEHDILRSIGLSNNSKVT 125

Query: 125 ILSMHDIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQP 184
           +LS++  G SN T V F +L       I+  SLSLLRSS    F   S L LTTS FG+P
Sbjct: 126 VLSLNQSGASNYTDVEFAVLPVPPDHEISKHSLSLLRSSFVKLFAKRSKLKLTTSGFGKP 185

Query: 185 STLQILKFPGGISIIPFQHASIWEFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLRLRS 244
           ++ Q+LKFPGGI++ P + A +     ++F+ T+  SIS + D         +  L L  
Sbjct: 186 TSFQVLKFPGGITVDPLEPAPVSGVALVLFSVTIKTSISTVQDRLDLLNGLFEHMLSLEP 245

Query: 245 YENVYLQITNKIGSTVQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYSVF 304
           YE+V+ Q+TNK GST+ P +  Q  +   + +   QRL     II TS  +NLGLD +VF
Sbjct: 246 YESVHFQLTNKQGSTISPPLTFQVYVAFTMRKYLHQRLNHFTQIIQTSRAKNLGLDEAVF 305

Query: 305 GEVKSVSLSSYPKRTSKAMPPSFSPAPAPAP 334
           GEVK ++ S+Y     K        APAP P
Sbjct: 306 GEVKDITFSTY--LDGKVPDSDLELAPAPTP 330

BLAST of Cucsa.049540 vs. TAIR10
Match: AT3G10810.1 (AT3G10810.1 zinc finger (C3HC4-type RING finger) family protein)

HSP 1 Score: 187.2 bits (474), Expect = 2.2e-47
Identity = 138/425 (32.47%), Postives = 208/425 (48.94%), Query Frame = 1

Query: 20  TGDSS--GFLCGQCSIAFHRVCKELNFKCFFVLVLGFVVFVPGFFWLLPLHERNSGFEAK 79
           TGDS+     CG C      +   + FKC FVL+L   +F+   F LLP           
Sbjct: 18  TGDSTVRNARCGCCKW----ISSFVGFKCLFVLLLSVALFLSALFLLLPFPMDREDSNLD 77

Query: 80  DNIKLSATVQVYFVLEKPVTELLPHIKRLEFDINGELDIPNVKVSILSMHDIGESNRTYV 139
              +  A V   F + +  + L  +  +L+ DI  E+   ++KV+IL++    E N T V
Sbjct: 78  PRFRGHAIV-ASFSINRSASFLNENTLQLQNDIFQEMSYISIKVTILAVEPSDELNITKV 137

Query: 140 VFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFGQPSTLQILKFPGGISII 199
           VFG+  +     I P+SLS ++       +++S L LT S+FG+    ++LKFPGGI++I
Sbjct: 138 VFGIDPDTGYREILPLSLSSIKEMFESVLINQSTLQLTKSLFGETFLFEVLKFPGGITVI 197

Query: 200 PFQHASIWEFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLRLRSYENVYLQITNKIGST 259
           P Q A   +  +IVFNFTL  SI +I  NF    SQLK GL L  YEN+Y+ ++N  GST
Sbjct: 198 PPQSAFPLQKFKIVFNFTLNYSIHQIQINFNTLASQLKNGLNLAPYENLYVSLSNSEGST 257

Query: 260 VQPLVIVQASITSELGRI-TSQRLQQLAAIINTSPERNLGLDYSVFGEVKSVSLSSYPKR 319
           V P   V +S+   +G   +S RL+QL   I  S  +NLGL+ ++FG+VK V LSS+   
Sbjct: 258 VSPPTTVHSSVLLRVGTSNSSPRLKQLTDTITGSRSKNLGLNNTIFGKVKQVRLSSFLPN 317

Query: 320 TSKAMPPSFSPAPAPAPGNHVEVPSGPHPLRSMRPPANHSPPHANCKSSSPNPSMVPANS 379
           +S +   S SP+P+P   +H       H         +H   H N             + 
Sbjct: 318 SSDSSTKSPSPSPSPHSKHHHHHHHHHH---------HHHHHHHN------------HHH 377

Query: 380 PHEHSIPPISYPKSTRLIVPPANQPRVYSPRASPVESPPLLPPDLLPKPKPSFRSKSGQT 439
            H H++ P   P+ + +  P  ++ R  +P A         PP   P  +  F+ K  Q 
Sbjct: 378 HHHHNLSPKMAPEVSPVASPAPHRSRKRAPSA---------PPPCNPGNRVHFKEKRVQF 407

Query: 440 NEDPS 442
           +  P+
Sbjct: 438 SSTPA 407

BLAST of Cucsa.049540 vs. TAIR10
Match: AT4G22505.1 (AT4G22505.1 Bifunctional inhibitor/lipid-transfer protein/seed storage 2S albumin superfamily protein)

HSP 1 Score: 49.7 bits (117), Expect = 5.6e-06
Identity = 39/114 (34.21%), Postives = 50/114 (43.86%), Query Frame = 1

Query: 314 PKRTSKAMPPSFSPAPAPAPGNHVEVPSGPHPLRSMRPPANHSPPHANCKSSSPNPSMVP 373
           P RT    PP   P P P   +    P  P PL   R P   SPP A      P P+  P
Sbjct: 155 PPRTPPTSPPRAPPIPPPRTPS-TSPPRAP-PLSPPRTPPT-SPPRA---PPVPPPNTPP 214

Query: 374 ANSPHEHSIPPISYPKSTRLIVPPANQPRVYSPRASPVESPPLLPPDLLPKPKP 428
            + P     PP+S P++     PP + PR  +P  SP  +PP+ PP + P   P
Sbjct: 215 TSPPRA---PPLSPPRT-----PPNSPPR--TPPTSPPRAPPVPPPRISPTAPP 252

BLAST of Cucsa.049540 vs. NCBI nr
Match: gi|778680189|ref|XP_011651267.1| (PREDICTED: uncharacterized protein LOC101222031 isoform X1 [Cucumis sativus])

HSP 1 Score: 912.5 bits (2357), Expect = 2.9e-262
Identity = 453/453 (100.00%), Postives = 453/453 (100.00%), Query Frame = 1

Query: 1   MGKGEEQNLPLQQRREVALTGDSSGFLCGQCSIAFHRVCKELNFKCFFVLVLGFVVFVPG 60
           MGKGEEQNLPLQQRREVALTGDSSGFLCGQCSIAFHRVCKELNFKCFFVLVLGFVVFVPG
Sbjct: 1   MGKGEEQNLPLQQRREVALTGDSSGFLCGQCSIAFHRVCKELNFKCFFVLVLGFVVFVPG 60

Query: 61  FFWLLPLHERNSGFEAKDNIKLSATVQVYFVLEKPVTELLPHIKRLEFDINGELDIPNVK 120
           FFWLLPLHERNSGFEAKDNIKLSATVQVYFVLEKPVTELLPHIKRLEFDINGELDIPNVK
Sbjct: 61  FFWLLPLHERNSGFEAKDNIKLSATVQVYFVLEKPVTELLPHIKRLEFDINGELDIPNVK 120

Query: 121 VSILSMHDIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFG 180
           VSILSMHDIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFG
Sbjct: 121 VSILSMHDIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFG 180

Query: 181 QPSTLQILKFPGGISIIPFQHASIWEFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLRL 240
           QPSTLQILKFPGGISIIPFQHASIWEFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLRL
Sbjct: 181 QPSTLQILKFPGGISIIPFQHASIWEFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLRL 240

Query: 241 RSYENVYLQITNKIGSTVQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS 300
           RSYENVYLQITNKIGSTVQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS
Sbjct: 241 RSYENVYLQITNKIGSTVQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS 300

Query: 301 VFGEVKSVSLSSYPKRTSKAMPPSFSPAPAPAPGNHVEVPSGPHPLRSMRPPANHSPPHA 360
           VFGEVKSVSLSSYPKRTSKAMPPSFSPAPAPAPGNHVEVPSGPHPLRSMRPPANHSPPHA
Sbjct: 301 VFGEVKSVSLSSYPKRTSKAMPPSFSPAPAPAPGNHVEVPSGPHPLRSMRPPANHSPPHA 360

Query: 361 NCKSSSPNPSMVPANSPHEHSIPPISYPKSTRLIVPPANQPRVYSPRASPVESPPLLPPD 420
           NCKSSSPNPSMVPANSPHEHSIPPISYPKSTRLIVPPANQPRVYSPRASPVESPPLLPPD
Sbjct: 361 NCKSSSPNPSMVPANSPHEHSIPPISYPKSTRLIVPPANQPRVYSPRASPVESPPLLPPD 420

Query: 421 LLPKPKPSFRSKSGQTNEDPSHPVHVRIKNNGS 454
           LLPKPKPSFRSKSGQTNEDPSHPVHVRIKNNGS
Sbjct: 421 LLPKPKPSFRSKSGQTNEDPSHPVHVRIKNNGS 453

BLAST of Cucsa.049540 vs. NCBI nr
Match: gi|778680192|ref|XP_004149972.2| (PREDICTED: uncharacterized protein LOC101222031 isoform X2 [Cucumis sativus])

HSP 1 Score: 897.1 bits (2317), Expect = 1.2e-257
Identity = 445/445 (100.00%), Postives = 445/445 (100.00%), Query Frame = 1

Query: 1   MGKGEEQNLPLQQRREVALTGDSSGFLCGQCSIAFHRVCKELNFKCFFVLVLGFVVFVPG 60
           MGKGEEQNLPLQQRREVALTGDSSGFLCGQCSIAFHRVCKELNFKCFFVLVLGFVVFVPG
Sbjct: 1   MGKGEEQNLPLQQRREVALTGDSSGFLCGQCSIAFHRVCKELNFKCFFVLVLGFVVFVPG 60

Query: 61  FFWLLPLHERNSGFEAKDNIKLSATVQVYFVLEKPVTELLPHIKRLEFDINGELDIPNVK 120
           FFWLLPLHERNSGFEAKDNIKLSATVQVYFVLEKPVTELLPHIKRLEFDINGELDIPNVK
Sbjct: 61  FFWLLPLHERNSGFEAKDNIKLSATVQVYFVLEKPVTELLPHIKRLEFDINGELDIPNVK 120

Query: 121 VSILSMHDIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFG 180
           VSILSMHDIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFG
Sbjct: 121 VSILSMHDIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFG 180

Query: 181 QPSTLQILKFPGGISIIPFQHASIWEFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLRL 240
           QPSTLQILKFPGGISIIPFQHASIWEFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLRL
Sbjct: 181 QPSTLQILKFPGGISIIPFQHASIWEFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLRL 240

Query: 241 RSYENVYLQITNKIGSTVQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS 300
           RSYENVYLQITNKIGSTVQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS
Sbjct: 241 RSYENVYLQITNKIGSTVQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS 300

Query: 301 VFGEVKSVSLSSYPKRTSKAMPPSFSPAPAPAPGNHVEVPSGPHPLRSMRPPANHSPPHA 360
           VFGEVKSVSLSSYPKRTSKAMPPSFSPAPAPAPGNHVEVPSGPHPLRSMRPPANHSPPHA
Sbjct: 301 VFGEVKSVSLSSYPKRTSKAMPPSFSPAPAPAPGNHVEVPSGPHPLRSMRPPANHSPPHA 360

Query: 361 NCKSSSPNPSMVPANSPHEHSIPPISYPKSTRLIVPPANQPRVYSPRASPVESPPLLPPD 420
           NCKSSSPNPSMVPANSPHEHSIPPISYPKSTRLIVPPANQPRVYSPRASPVESPPLLPPD
Sbjct: 361 NCKSSSPNPSMVPANSPHEHSIPPISYPKSTRLIVPPANQPRVYSPRASPVESPPLLPPD 420

Query: 421 LLPKPKPSFRSKSGQTNEDPSHPVH 446
           LLPKPKPSFRSKSGQTNEDPSHPVH
Sbjct: 421 LLPKPKPSFRSKSGQTNEDPSHPVH 445

BLAST of Cucsa.049540 vs. NCBI nr
Match: gi|659112144|ref|XP_008456084.1| (PREDICTED: uncharacterized protein LOC103496125 [Cucumis melo])

HSP 1 Score: 861.3 bits (2224), Expect = 7.6e-247
Identity = 427/445 (95.96%), Postives = 436/445 (97.98%), Query Frame = 1

Query: 1   MGKGEEQNLPLQQRREVALTGDSSGFLCGQCSIAFHRVCKELNFKCFFVLVLGFVVFVPG 60
           MGKGEEQNLPLQQRREVAL+GDSSGFLCGQCSIAFHRVCKELNFKCFFVLVLGFVVFVPG
Sbjct: 1   MGKGEEQNLPLQQRREVALSGDSSGFLCGQCSIAFHRVCKELNFKCFFVLVLGFVVFVPG 60

Query: 61  FFWLLPLHERNSGFEAKDNIKLSATVQVYFVLEKPVTELLPHIKRLEFDINGELDIPNVK 120
            FWLLPLHERNSGFEAK+N+KLSATVQVYFVLEKPVTELLPHIKRLEFDINGELDIP+VK
Sbjct: 61  LFWLLPLHERNSGFEAKENVKLSATVQVYFVLEKPVTELLPHIKRLEFDINGELDIPDVK 120

Query: 121 VSILSMHDIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFG 180
           VSILSMHDIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFG
Sbjct: 121 VSILSMHDIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFG 180

Query: 181 QPSTLQILKFPGGISIIPFQHASIWEFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLRL 240
           QPSTLQILKFPGGISIIPFQHASIWEFPQIVFNFTLTNSISEILDNFAKFKS+LKFGLRL
Sbjct: 181 QPSTLQILKFPGGISIIPFQHASIWEFPQIVFNFTLTNSISEILDNFAKFKSELKFGLRL 240

Query: 241 RSYENVYLQITNKIGSTVQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS 300
           RSYENVYLQITNKIGSTVQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS
Sbjct: 241 RSYENVYLQITNKIGSTVQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS 300

Query: 301 VFGEVKSVSLSSYPKRTSKAMPPSFSPAPAPAPGNHVEVPSGPHPLRSMRPPANHSPPHA 360
           VFGEVKSVSLSSYPKRTSKAMPPSFSPAPAPAPG+HVEVPS PH LRS RPPANHSPPHA
Sbjct: 301 VFGEVKSVSLSSYPKRTSKAMPPSFSPAPAPAPGDHVEVPSDPHRLRSTRPPANHSPPHA 360

Query: 361 NCKSSSPNPSMVPANSPHEHSIPPISYPKSTRLIVPPANQPRVYSPRASPVESPPLLPPD 420
           NCKS SPNPSMVPA+SPHEHSIPPISYPKSTRL+VPPANQPRV SPRASP+E PPLLPPD
Sbjct: 361 NCKSLSPNPSMVPAHSPHEHSIPPISYPKSTRLVVPPANQPRVSSPRASPIEFPPLLPPD 420

Query: 421 LLPKPKPSFRSKSGQTNEDPSHPVH 446
           LLPKPKPSF SKSGQTNED SHPVH
Sbjct: 421 LLPKPKPSFHSKSGQTNEDLSHPVH 445

BLAST of Cucsa.049540 vs. NCBI nr
Match: gi|778680195|ref|XP_011651268.1| (PREDICTED: uncharacterized protein LOC101222031 isoform X3 [Cucumis sativus])

HSP 1 Score: 560.5 bits (1443), Expect = 2.8e-156
Identity = 307/453 (67.77%), Postives = 326/453 (71.96%), Query Frame = 1

Query: 1   MGKGEEQNLPLQQRREVALTGDSSGFLCGQCSIAFHRVCKELNFKCFFVLVLGFVVFVPG 60
           MGKGEEQNLPLQQRREVALTGDSSGFLCGQCSIAFHRVCKELNFKCFFVLVLGFVVFVPG
Sbjct: 1   MGKGEEQNLPLQQRREVALTGDSSGFLCGQCSIAFHRVCKELNFKCFFVLVLGFVVFVPG 60

Query: 61  FFWLLPLHERNSGFEAKDNIKLSATVQVYFVLEKPVTELLPHIKRLEFDINGELDIPNVK 120
           FFWLLPLHERNSGFEAKDNIKLSATVQVYFVLEKPVTELLPHIKRLEFDINGELDIPNVK
Sbjct: 61  FFWLLPLHERNSGFEAKDNIKLSATVQVYFVLEKPVTELLPHIKRLEFDINGELDIPNVK 120

Query: 121 VSILSMHDIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFG 180
            S L                   + +  P     +    +S+++F     N TLT SI  
Sbjct: 121 PSTL-------------------QILKFPGGISIIPFQHASIWEFPQIVFNFTLTNSI-- 180

Query: 181 QPSTLQILKFPGGISIIPFQHASIWEFPQIVFNFTLTNSISEI---LDNFAKFKSQL--K 240
                                       +I+ NF    S  +    L ++     Q+  K
Sbjct: 181 ---------------------------SEILDNFAKFKSQLKFGLRLRSYENVYLQITNK 240

Query: 241 FGLRLRSYENVYLQITNKIGS-TVQPLVIVQASITSELGRITSQRLQQLAAIINTSPERN 300
            G  ++    V   IT+++G  T Q L  + A I                    TSPERN
Sbjct: 241 IGSTVQPLVIVQASITSELGRITSQRLQQLAAIIN-------------------TSPERN 300

Query: 301 LGLDYSVFGEVKSVSLSSYPKRTSKAMPPSFSPAPAPAPGNHVEVPSGPHPLRSMRPPAN 360
           LGLDYSVFGEVKSVSLSSYPKRTSKAMPPSFSPAPAPAPGNHVEVPSGPHPLRSMRPPAN
Sbjct: 301 LGLDYSVFGEVKSVSLSSYPKRTSKAMPPSFSPAPAPAPGNHVEVPSGPHPLRSMRPPAN 360

Query: 361 HSPPHANCKSSSPNPSMVPANSPHEHSIPPISYPKSTRLIVPPANQPRVYSPRASPVESP 420
           HSPPHANCKSSSPNPSMVPANSPHEHSIPPISYPKSTRLIVPPANQPRVYSPRASPVESP
Sbjct: 361 HSPPHANCKSSSPNPSMVPANSPHEHSIPPISYPKSTRLIVPPANQPRVYSPRASPVESP 386

Query: 421 PLLPPDLLPKPKPSFRSKSGQTNEDPSHPVHVR 448
           PLLPPDLLPKPKPSFRSKSGQTNEDPSHPVHVR
Sbjct: 421 PLLPPDLLPKPKPSFRSKSGQTNEDPSHPVHVR 386

BLAST of Cucsa.049540 vs. NCBI nr
Match: gi|657974018|ref|XP_008378828.1| (PREDICTED: uncharacterized protein LOC103441889 [Malus domestica])

HSP 1 Score: 414.5 bits (1064), Expect = 2.5e-112
Identity = 239/436 (54.82%), Postives = 286/436 (65.60%), Query Frame = 1

Query: 1   MGKGEEQNLPLQQRREVALTGDSSGFLCGQCSIAFHRVCKELNFKCFFVLVLGFVVFVPG 60
           MGKGE  NL  QQ      +  +SG +C  CS+ F+R+ K L+F+C FVLVL   VF+ G
Sbjct: 1   MGKGEA-NLHQQQPNHEGQS--ASGLICPGCSMVFNRIAKGLSFRCVFVLVLSLSVFLSG 60

Query: 61  FFWLLPLHERNSGFEAKDNIKLSATVQVYFVLEKPVTELLPHIKRLEFDINGELDIPNVK 120
            FW+LP    NSGF+A   IKLSATVQ YF LEKPVT+L+PHI RLE+DINGE+ +P  K
Sbjct: 61  IFWILPHRATNSGFDATQAIKLSATVQAYFRLEKPVTDLVPHIGRLEYDINGEIGVPGTK 120

Query: 121 VSILSMHDIGESNRTYVVFGLLSEYITAPINPVSLSLLRSSLYDFFLSESNLTLTTSIFG 180
           V+ILSMH     N T VVFG LS+ I  PI PVSLS+LRSSL + FL +SNLTLTTSIFG
Sbjct: 121 VAILSMHQFPAHNWTDVVFGFLSDPINVPIVPVSLSVLRSSLVELFLKQSNLTLTTSIFG 180

Query: 181 QPSTLQILKFPGGISIIPFQHASIWEFPQIVFNFTLTNSISEILDNFAKFKSQLKFGLRL 240
           QPSTL+ILK+PGGI++IP Q ASIW+ P+I+FNFTL N I +I++NF + K QLKFGL L
Sbjct: 181 QPSTLEILKYPGGITVIPGQPASIWQLPEILFNFTLNNXIDDIVENFGELKEQLKFGLYL 240

Query: 241 RSYENVYLQITNKIGSTVQPLVIVQASITSELGRITSQRLQQLAAIINTSPERNLGLDYS 300
           R YENVYLQITN +GST    V+VQAS+ SE G    QRL+QLA II  SP +NLGLD S
Sbjct: 241 RPYENVYLQITNTMGSTTAAPVVVQASLMSEFGGFGPQRLRQLAQIITGSPTKNLGLDNS 300

Query: 301 VFGEVKSVSLSSYPKRTSKAMPPSFSPAPAPAPGNHVEVPSGPHPLRSMRPPANHSPPHA 360
           VFG+ KS+SLSSY K T  A PP+ SPAP P P         P+    +  PA    P  
Sbjct: 301 VFGKXKSISLSSYLKGTLSATPPTVSPAPTPEPS------ISPYLASPVYAPA----PSP 360

Query: 361 NCKSSSPNPSMVPAN---SPHEHS-IPPISYPKSTRL-IVPPANQPRVYSPRASPVESPP 420
           +    S  PS VPA+    PH+ S IPP S P S     VPP      Y PR  P  SPP
Sbjct: 361 DIHHLSSAPSKVPAHPHPXPHQGSRIPPSSPPTSRSYPTVPP-----TYPPRIPPSSSPP 418

Query: 421 --LLPPDLLPKPKPSF 430
              L P + P P  S+
Sbjct: 421 SSQLSPHVSPAPVVSY 418

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PEXLP_TOBAC5.3e-0629.91Pistil-specific extensin-like protein OS=Nicotiana tabacum PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L6J0_CUCSA8.7e-258100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_3G214050 PE=4 SV=1[more]
A0A067KRP0_JATCU4.7e-11050.79Uncharacterized protein OS=Jatropha curcas GN=JCGZ_05038 PE=4 SV=1[more]
V4T6J0_9ROSI2.3e-10950.67Uncharacterized protein OS=Citrus clementina GN=CICLE_v10000890mg PE=4 SV=1[more]
A0A067DJZ1_CITSI1.5e-10850.00Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g008625mg PE=4 SV=1[more]
A0A061DX06_THECC6.3e-10748.65Hydroxyproline-rich glycoprotein family protein, putative OS=Theobroma cacao GN=... [more]
Match NameE-valueIdentityDescription
AT3G56590.28.8e-5232.52 hydroxyproline-rich glycoprotein family protein[more]
AT1G10790.13.4e-5138.37 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glyc... [more]
AT3G10810.12.2e-4732.47 zinc finger (C3HC4-type RING finger) family protein[more]
AT4G22505.15.6e-0634.21 Bifunctional inhibitor/lipid-transfer protein/seed storage 2S albumi... [more]
Match NameE-valueIdentityDescription
gi|778680189|ref|XP_011651267.1|2.9e-262100.00PREDICTED: uncharacterized protein LOC101222031 isoform X1 [Cucumis sativus][more]
gi|778680192|ref|XP_004149972.2|1.2e-257100.00PREDICTED: uncharacterized protein LOC101222031 isoform X2 [Cucumis sativus][more]
gi|659112144|ref|XP_008456084.1|7.6e-24795.96PREDICTED: uncharacterized protein LOC103496125 [Cucumis melo][more]
gi|778680195|ref|XP_011651268.1|2.8e-15667.77PREDICTED: uncharacterized protein LOC101222031 isoform X3 [Cucumis sativus][more]
gi|657974018|ref|XP_008378828.1|2.5e-11254.82PREDICTED: uncharacterized protein LOC103441889 [Malus domestica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.049540.4Cucsa.049540.4mRNA
Cucsa.049540.2Cucsa.049540.2mRNA
Cucsa.049540.1Cucsa.049540.1mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33826FAMILY NOT NAMEDcoord: 1..382
score: 3.1E
NoneNo IPR availablePANTHERPTHR33826:SF4F20B24.21coord: 1..382
score: 3.1E