Cucsa.106790 (gene) Cucumber (Gy14) v1

NameCucsa.106790
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionHydroxyproline-rich glycoprotein family protein
Locationscaffold00930 : 295916 .. 298536 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTTCCGTCTTGTTGCTTCTATCTCGCTAACGAACCTTCTCTTCACTTTCATACTGCAAAATCCCCTTGTTTCGAATTTGCCTAAGAATTTCATGATCTCTTCTTTGTATGTTGAATTACGATTCTTCTTTCTATGAAACCCATCGGCGATTCTCTCTTTCCAAGTAACGATAGCGATGAGACGACGTACGGATACTGATGATTTCAGGCCTGTTAACAATAATACTTTTCAAACAATTACTGCCGCCGCTGATGCGATCGCAACCGTCGATCATCGTTTCCCTCGGGCTACTGCCGTCCAGGTATTATTATTGTATTCTATTGTTCGGATCAATTCACCTTCTATTCAAATCATTCATTTGGTATTTTTAGAATTTAGTGTTGGTTACTTATGGATTGATAATATTGACCTTGTTTTTGGAAATTACTATTCCTTCTTAATTCCGTTAGGTATTAGGATGTACTGTAGTtGTTTTTCGATGGATGAGATGATATTATAGCGGAATGGAATCTGTGTTGTTTTTGTTTTGGATTGTGAAATTCTATTTAAGGCTTTTAATTTCCCATTTTATTGCCTGTTCCTTCTGTTTGTTTGCTTAGAAACTGTGAATCGTCACATGGAAACATGAAAAAAAGGGGAATCACGAGCTGAGTTGGGTTCTGCTTTTTTCTGCTTTTTCTGCTTTTGGCTTTCTTCTTCCCTTATGATTATTAGATTTTTAAGGATTGGGAAGTACCAAGATTAGTGTTATTGGGTTTGGATGACTGTGTCAGGAAGCAATGTGCTGCTGTAGCCCCAATGTTATTTGtCTTTTTGTAATAACACTAGGCACTTTTtAACCAGAAAAAAGCATCTCTTTCTCTCCATTCTCTTTATCTTTTCAAAAAaGAGAGAGAGAGAGGTGTTGTTTTTtAAATGCATTCTATATAGAGGAAGAGTATTGCTTTTTTtATAAAGATTCCAACAAATGAGGCTCTCATTTGCCCAAGCACTTCCCcACTGTTTTGTACTGATGTCATTATCAGTTTCCTCTTTTTCTCTTTGTTTGATTACTAGCAGAAAaGAaGATGGGGCAGTTGTTTGAGTATTTATTGGTGCTTTGGATCTATCAAACAGAGGAAAAGAATCGGGCATGCTGTATTGGTACCAGAACCAAGTCCTTCATCTGAGCCTCATGAAAATACATTACAATCACCAGATATTGTGCTTCCTTTTGCTGCACCTCCTTCTTCCCCTGTTTCCTTACTTCAATCTGAACCACCTTCTGCTATGCAGTCGCCTACTGCTTTAATCTCTTTCACTTCTCTTACTGCTAACATGTATTCTCCTGATGGGCCTTCCTCCATTTTTGCCATTGGCCCATTTGCTCATGAACCACAATTAGTGTCTCCACCTCTGAATTTCTCTACTCTTACTACTGAACCATCAACTCCCTTCACTCCTCCCGAATCTATCCACTTGACTACACCTTCTTCCCCTGAAGTTCCTTTTGCTCAGTTTGTTCAACCTACTCTTCCGAAAGTTGAGTCTGATAATCAATATACATTTCCTAATGATGATTTCCAATCTTACCAATTCTATCCAGGTAGTCCGGTTAGTCACCTCATATCACCACGGTCGGTTATTTCTCGTTCTGGGGCTTCATCGCCTTTGCCTGACTATGATTTTGCTTCCTTTGGTTCTCAATTTTTGAATTTCCCACTAGAAGTTCCACCTACTTTGTTGAACCTTGACAAACATTCCATTCATAACTGGCGACAACGTCAAAGTACTGATTCTTGTACTCAAGATTCTATAGAATTCAAATCAAGTAATGACTTTGTTTTGAATCCCCAAACTTCAGAATCTATGTCTGATCACCACGCAACAAATGAATCTCAAAATATTCAAATTCTCATCGATGATGGAAGTAAAAAGGAGGAGGAGCCAGGTGCTACTAATCATAGATTCTCATTTGAGCTATCTGATGGGGATGTTTTATTACAAAGCGTAGGAAGTAAGCCATTGGAATCAAATGAACTTGCGGTTGAATCATCGCCAATACATGAACCATTTGAAACGACTAAAGAAAATTCTCCTCATGGTGACCATACTTCAAATGTTATAGAAGAAAAGACAAAAGCTGACGGTGATGAAGCACATCAACGTCAAGAACATCATTCCGTTACACTTGGGTCTGTGAAGGAATTCAATTTTGATAATGGTAATGGAAGTGACACACATAACCCAAATATAAATTCAGAATGGTGGATTAATGCAAAGGATGGTAGCACAGAAAGCACAGCCACCGGGACCTGGTCATTCTTTCCAATGACGCAACAAAGATGAGCAAACTGGGGCAGTTGCAAATCGATAGGTAAGACCAACAGCAAGAGGAATTGTTAGTTTTGAAGGTTTTAAACATGTCAAATTATGAAAGAGCCTGACCAGAAGCCTTTTTTTtCAACAATATGACCTAAAACAAACAACGCCAGATATTATTAGATAGAACGATAGAGAAATTGTAGATTCAATAGGACCTTATTAACAAACACTTGTGGCTTGTGACTCGTCACTTGGATTGTAATAGATATCAAAGTCTTGATAGAAATTGAAAGCATGTAAATATGGTAATAAGAAG

mRNA sequence

GTTTCCGTCTTGTTGCTTCTATCTCGCTAACGAACCTTCTCTTCACTTTCATACTGCAAAATCCCCTTGTTTCGAATTTGCCTAAGAATTTCATGATCTCTTCTTTGTATGTTGAATTACGATTCTTCTTTCTATGAAACCCATCGGCGATTCTCTCTTTCCAAGTAACGATAGCGATGAGACGACGTACGGATACTGATGATTTCAGGCCTGTTAACAATAATACTTTTCAAACAATTACTGCCGCCGCTGATGCGATCGCAACCGTCGATCATCGTTTCCCTCGGGCTACTGCCGTCCAGAAAAGAAGATGGGGCAGTTGTTTGAGTATTTATTGGTGCTTTGGATCTATCAAACAGAGGAAAAGAATCGGGCATGCTGTATTGGTACCAGAACCAAGTCCTTCATCTGAGCCTCATGAAAATACATTACAATCACCAGATATTGTGCTTCCTTTTGCTGCACCTCCTTCTTCCCCTGTTTCCTTACTTCAATCTGAACCACCTTCTGCTATGCAGTCGCCTACTGCTTTAATCTCTTTCACTTCTCTTACTGCTAACATGTATTCTCCTGATGGGCCTTCCTCCATTTTTGCCATTGGCCCATTTGCTCATGAACCACAATTAGTGTCTCCACCTCTGAATTTCTCTACTCTTACTACTGAACCATCAACTCCCTTCACTCCTCCCGAATCTATCCACTTGACTACACCTTCTTCCCCTGAAGTTCCTTTTGCTCAGTTTGTTCAACCTACTCTTCCGAAAGTTGAGTCTGATAATCAATATACATTTCCTAATGATGATTTCCAATCTTACCAATTCTATCCAGGTAGTCCGGTTAGTCACCTCATATCACCACGGTCGGTTATTTCTCGTTCTGGGGCTTCATCGCCTTTGCCTGACTATGATTTTGCTTCCTTTGGTTCTCAATTTTTGAATTTCCCACTAGAAGTTCCACCTACTTTGTTGAACCTTGACAAACATTCCATTCATAACTGGCGACAACGTCAAAGTACTGATTCTTGTACTCAAGATTCTATAGAATTCAAATCAAGTAATGACTTTGTTTTGAATCCCCAAACTTCAGAATCTATGTCTGATCACCACGCAACAAATGAATCTCAAAATATTCAAATTCTCATCGATGATGGAAGTAAAAAGGAGGAGGAGCCAGGTGCTACTAATCATAGATTCTCATTTGAGCTATCTGATGGGGATGTTTTATTACAAAGCGTAGGAAGTAAGCCATTGGAATCAAATGAACTTGCGGTTGAATCATCGCCAATACATGAACCATTTGAAACGACTAAAGAAAATTCTCCTCATGGTGACCATACTTCAAATGTTATAGAAGAAAAGACAAAAGCTGACGGTGATGAAGCACATCAACGTCAAGAACATCATTCCGTTACACTTGGGTCTGTGAAGGAATTCAATTTTGATAATGGTAATGGAAGTGACACACATAACCCAAATATAAATTCAGAATGGTGGATTAATGCAAAGGATGGTAGCACAGAAAGCACAGCCACCGGGACCTGGTCATTCTTTCCAATGACGCAACAAAGATGAGCAAACTGGGGCAGTTGCAAATCGATAGGTAAGACCAACAGCAAGAGGAATTGTTAGTTTTGAAGGTTTTAAACATGTCAAATTATGAAAGAGCCTGACCAGAAGCCTTTTTTTTCAACAATATGACCTAAAACAAACAACGCCAGATATTATTAGATAGAACGATAGAGAAATTGTAGATTCAATAGGACCTTATTAACAAACACTTGTGGCTTGTGACTCGTCACTTGGATTGTAATAGATATCAAAGTCTTGATAGAAATTGAAAGCATGTAAATATGGTAATAAGAAG

Coding sequence (CDS)

ATGAGACGACGTACGGATACTGATGATTTCAGGCCTGTTAACAATAATACTTTTCAAACAATTACTGCCGCCGCTGATGCGATCGCAACCGTCGATCATCGTTTCCCTCGGGCTACTGCCGTCCAGAAAaGAaGATGGGGCAGTTGTTTGAGTATTTATTGGTGCTTTGGATCTATCAAACAGAGGAAAAGAATCGGGCATGCTGTATTGGTACCAGAACCAAGTCCTTCATCTGAGCCTCATGAAAATACATTACAATCACCAGATATTGTGCTTCCTTTTGCTGCACCTCCTTCTTCCCCTGTTTCCTTACTTCAATCTGAACCACCTTCTGCTATGCAGTCGCCTACTGCTTTAATCTCTTTCACTTCTCTTACTGCTAACATGTATTCTCCTGATGGGCCTTCCTCCATTTTTGCCATTGGCCCATTTGCTCATGAACCACAATTAGTGTCTCCACCTCTGAATTTCTCTACTCTTACTACTGAACCATCAACTCCCTTCACTCCTCCCGAATCTATCCACTTGACTACACCTTCTTCCCCTGAAGTTCCTTTTGCTCAGTTTGTTCAACCTACTCTTCCGAAAGTTGAGTCTGATAATCAATATACATTTCCTAATGATGATTTCCAATCTTACCAATTCTATCCAGGTAGTCCGGTTAGTCACCTCATATCACCACGGTCGGTTATTTCTCGTTCTGGGGCTTCATCGCCTTTGCCTGACTATGATTTTGCTTCCTTTGGTTCTCAATTTTTGAATTTCCCACTAGAAGTTCCACCTACTTTGTTGAACCTTGACAAACATTCCATTCATAACTGGCGACAACGTCAAAGTACTGATTCTTGTACTCAAGATTCTATAGAATTCAAATCAAGTAATGACTTTGTTTTGAATCCCCAAACTTCAGAATCTATGTCTGATCACCACGCAACAAATGAATCTCAAAATATTCAAATTCTCATCGATGATGGAAGTAAAAAGGAGGAGGAGCCAGGTGCTACTAATCATAGATTCTCATTTGAGCTATCTGATGGGGATGTTTTATTACAAAGCGTAGGAAGTAAGCCATTGGAATCAAATGAACTTGCGGTTGAATCATCGCCAATACATGAACCATTTGAAACGACTAAAGAAAATTCTCCTCATGGTGACCATACTTCAAATGTTATAGAAGAAAAGACAAAAGCTGACGGTGATGAAGCACATCAACGTCAAGAACATCATTCCGTTACACTTGGGTCTGTGAAGGAATTCAATTTTGATAATGGTAATGGAAGTGACACACATAACCCAAATATAAATTCAGAATGGTGGATTAATGCAAAGGATGGTAGCACAGAAAGCACAGCCACCGGGACCTGGTCATTCTTTCCAATGACGCAACAAAGATGA

Protein sequence

MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPFTPPESIHLTTPSSPEVPFAQFVQPTLPKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR*
BLAST of Cucsa.106790 vs. Swiss-Prot
Match: Y1666_ARATH (Uncharacterized protein At1g76660 OS=Arabidopsis thaliana GN=At1g76660 PE=2 SV=1)

HSP 1 Score: 147.5 bits (371), Expect = 3.6e-34
Identity = 99/210 (47.14%), Postives = 121/210 (57.62%), Query Frame = 1

Query: 42  QKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPS--PSSEPHE-------NTLQSPDIVL 101
           Q++RWG CL ++ CF S K  KRI  A  +PE     +S+P+        N   +  I L
Sbjct: 7   QRKRWGGCLGVFSCFKSQKGGKRIVPASRIPEGGNVSASQPNGAHQAGVLNNQAAGGINL 66

Query: 102 PFAAPPSSPVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSS-IFAIGPFAHEPQLV 161
              APPSSP S   S  PS  QSP     + SL AN  SP GPSS ++A GP+AHE QLV
Sbjct: 67  SLLAPPSSPASFTNSALPSTTQSPNC---YLSLAAN--SPGGPSSSMYATGPYAHETQLV 126

Query: 162 SPPLNFSTLTTEPST-PFT-PPESIHLTTPSSPEVPFAQFVQPTLPKVESDNQYTFPNDD 221
           SPP+ FST TTEPST PFT PPE   LT PSSP+VP+A+F+  ++    S   +   ND 
Sbjct: 127 SPPV-FSTFTTEPSTAPFTPPPELARLTAPSSPDVPYARFLTSSMDLKNSGKGHY--NDL 186

Query: 222 FQSYQFYPGSPVSHLISPRSVISRSGASSP 240
             +Y  YPGSP S L SP S  S  G  SP
Sbjct: 187 QATYSLYPGSPASALRSPISRASGDGLLSP 208

BLAST of Cucsa.106790 vs. TrEMBL
Match: A0A0A0KY57_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G047980 PE=4 SV=1)

HSP 1 Score: 726.9 bits (1875), Expect = 1.6e-206
Identity = 352/352 (100.00%), Postives = 352/352 (100.00%), Query Frame = 1

Query: 113 MQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPFTPPE 172
           MQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPFTPPE
Sbjct: 1   MQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPFTPPE 60

Query: 173 SIHLTTPSSPEVPFAQFVQPTLPKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVIS 232
           SIHLTTPSSPEVPFAQFVQPTLPKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVIS
Sbjct: 61  SIHLTTPSSPEVPFAQFVQPTLPKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVIS 120

Query: 233 RSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKS 292
           RSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKS
Sbjct: 121 RSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKS 180

Query: 293 SNDFVLNPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQS 352
           SNDFVLNPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQS
Sbjct: 181 SNDFVLNPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQS 240

Query: 353 VGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQRQEHHSVT 412
           VGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQRQEHHSVT
Sbjct: 241 VGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQRQEHHSVT 300

Query: 413 LGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR 465
           LGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR
Sbjct: 301 LGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR 352

BLAST of Cucsa.106790 vs. TrEMBL
Match: A0A067JZI1_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_20571 PE=4 SV=1)

HSP 1 Score: 425.6 bits (1093), Expect = 7.6e-116
Identity = 242/471 (51.38%), Postives = 297/471 (63.06%), Query Frame = 1

Query: 4   RTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIKQRK 63
           R    D RP +NN   TI AAA AIA+ ++R P+AT VQKRRWGSC S+YWCFG  + RK
Sbjct: 2   RAVNGDSRP-SNNALDTINAAASAIASAENRVPQAT-VQKRRWGSCFSVYWCFGYNRHRK 61

Query: 64  RIGHAVLVPE---PSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAMQSPTALI 123
           RIGHAVLVPE   P   S   EN+ Q+P I LPF APPSSP S LQSEPPSA QSPT ++
Sbjct: 62  RIGHAVLVPETPGPRNDSSAAENSTQTPTITLPFVAPPSSPASFLQSEPPSASQSPTGVL 121

Query: 124 SFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFT-PPESIHLTT 183
           S TS++ANMYSP GPSSIFAIGP+AHE QLVSPP+ FST TTEPST PFT PPES+HLTT
Sbjct: 122 SLTSISANMYSPSGPSSIFAIGPYAHETQLVSPPV-FSTFTTEPSTAPFTPPPESVHLTT 181

Query: 184 PSSPEVPFAQFVQPTLPKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASS 243
           PSSPEVPFAQ + P++  VE+  ++   N +FQSYQ YPGSPV  LISP S IS SG SS
Sbjct: 182 PSSPEVPFAQLLDPSIRNVEAGLRFPLSNYEFQSYQLYPGSPVGQLISPSSGISGSGTSS 241

Query: 244 PLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVL 303
           P PD +FA   + FL F +  PP LLNLDK S H W  R  + + T D++   +S  F  
Sbjct: 242 PFPDGEFA---AGFLEFRMGEPPKLLNLDKLSTHEWGSRCGSGTLTPDAVR-PTSCSFTP 301

Query: 304 NPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVL--LQSVGSK 363
           +   S+ +S  H+ N +QN ++               +HR SFEL+   VL   +   + 
Sbjct: 302 DRPFSDFVSHKHSDNGNQNDEV--------------GDHRLSFELAAEGVLGCEEQNPAS 361

Query: 364 PL----ESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQRQE-HHSV 423
           P+    +S E    ++   +  E   +       TSN   EK   DG++A  R E H S+
Sbjct: 362 PVKIIGDSLENGTVAARTEDSTEVVDDFESRVGETSNGTPEKASTDGEKAPPRHEKHRSI 421

Query: 424 TLGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQ 463
           TLGS+KEFNFDN +G D+H PN   +WW N  D   E  AT  WSFFPM Q
Sbjct: 422 TLGSLKEFNFDNVDGGDSHKPNAGPDWWANGSDIGKEDGATKNWSFFPMMQ 451

BLAST of Cucsa.106790 vs. TrEMBL
Match: M5WM36_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005552mg PE=4 SV=1)

HSP 1 Score: 412.1 bits (1058), Expect = 8.7e-112
Identity = 240/461 (52.06%), Postives = 290/461 (62.91%), Query Frame = 1

Query: 15  NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEP 74
           NN  +TI AAA AIA  ++R P+AT VQKRRWGS  S+YWCFG  + +KRIGHAVLVPE 
Sbjct: 12  NNALETINAAASAIAAAENRVPQAT-VQKRRWGSWWSMYWCFGFQRHKKRIGHAVLVPET 71

Query: 75  SP---SSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAMQSPTALISFTSLTANMYS 134
           +     +   EN +Q+P IVLPF APPSSP S LQSEPPSA QSP     F SLTA+MYS
Sbjct: 72  TDRGGDAPRAENPIQTPSIVLPFVAPPSSPASFLQSEPPSATQSPAG---FFSLTASMYS 131

Query: 135 PDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPP-ESIHLTTPSSPEVPFAQF 194
           P GP+SIFAIGP+AHE QLVSPP+ FST TTEPST PFTPP ES+HLTTPSSPEVPFAQ 
Sbjct: 132 PSGPTSIFAIGPYAHETQLVSPPV-FSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQL 191

Query: 195 VQPTLPKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFG 254
           + P     E   ++   + +FQSYQ YPGSPV  LISP S IS SG SSP PD +FA+ G
Sbjct: 192 LDPHFRNGEGGQRFPLSHYEFQSYQLYPGSPVGQLISPSSGISGSGTSSPFPDLEFAARG 251

Query: 255 SQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNPQTSESMSDH 314
             FL F    PP LLNLD  S  +W  R  + S T D  +  SS+ F+L PQT E + + 
Sbjct: 252 HHFLEFRTGDPPKLLNLDILSTRDWGSRLGSGSVTPDGAKSTSSDGFLLKPQTPEVVLNP 311

Query: 315 HATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSP 374
            + N  +N  I I             NHR SFELS  +V ++ V  KP+   E    S  
Sbjct: 312 RSNNRGRNNDISI-------------NHRVSFELSSEEV-IRCVEKKPVALAEAVSTSLE 371

Query: 375 IHEPFETTKENS--------PHGDHTSNVIEEKTKADGDEAHQRQEHHSVTLGSVKEFNF 434
             E  ++ ++ S        P G+ TSN   EK  ADG+EA    +  S+TLGSVKEFNF
Sbjct: 372 DTEKAQSKEDPSKVVSSSICPVGE-TSNDAAEKAVADGEEAQLHPKQRSITLGSVKEFNF 431

Query: 435 DNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQ 463
           DN +G D+ N +I S+WW N K  + E+  T  WSFFPM Q
Sbjct: 432 DNPDGGDSGN-SIGSDWWANEKVDAKENGPTKNWSFFPMMQ 451

BLAST of Cucsa.106790 vs. TrEMBL
Match: B9RCD8_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1687100 PE=4 SV=1)

HSP 1 Score: 406.0 bits (1042), Expect = 6.2e-110
Identity = 237/480 (49.38%), Postives = 298/480 (62.08%), Query Frame = 1

Query: 1   MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIK 60
           MR      D RP +NN   TI AAA  IA+ ++R P+AT +QKRRWGSC S+YWCFG  +
Sbjct: 2   MRNVNGGADSRP-SNNALDTINAAASVIASAENRVPQAT-IQKRRWGSCWSVYWCFGYHR 61

Query: 61  QRKRIGHAVLVPE---PSPSSEPHEN-TLQSPDIVLPFAAPPSSPVSLLQSEPPSAMQSP 120
            RKRIGHAVLVPE   P   S   EN T Q+P I LPF APPSSP S LQSEPPSA QSP
Sbjct: 62  HRKRIGHAVLVPENSAPGNDSSAAENPTTQAPTITLPFVAPPSSPASFLQSEPPSASQSP 121

Query: 121 TALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTP-PESI 180
             ++S TS++A+MYSP GP+SIFAIGP+AHE QLVSPP  FST TTEPST PFTP PES+
Sbjct: 122 AGILSLTSVSASMYSPSGPASIFAIGPYAHETQLVSPPA-FSTFTTEPSTAPFTPPPESV 181

Query: 181 HLTTPSSPEVPFAQFVQPTLPKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRS 240
            LTTPSSPEVPFAQ ++P+    E+  ++ F N +FQSYQFYPGSPV  LISP S IS S
Sbjct: 182 QLTTPSSPEVPFAQLLEPSNRNGEAGLRFPFSNYEFQSYQFYPGSPVGQLISPSSGISGS 241

Query: 241 GASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSN 300
           G SSP PD +FA+ G +FL F + VPP LLNLDK S+H    RQ + + T D++   +S 
Sbjct: 242 GTSSPFPDGEFAAAGPRFLEFQMAVPPKLLNLDKLSVHECGSRQGSGTLTPDAVR-ATSC 301

Query: 301 DFVLNPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVL--LQS 360
            F L+ Q S+  S+ H+ NE+++ Q+               + R SF+LS  D L   + 
Sbjct: 302 SFPLDRQCSDIASNRHSDNENKDDQV--------------ADLRVSFDLSAEDALRYAEP 361

Query: 361 VGSKPLE------SNELAVE----SSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEA 420
             + P++       NE+A E    SS I   FE           TSN I E+    G++ 
Sbjct: 362 KPASPVKIMPESMKNEIAAEKVQKSSEIRHNFEC------RVGETSNGILEQASTGGEKT 421

Query: 421 HQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQ 463
            + Q+H ++TLG+ KEFNFDN +G     P+   +WW N  D   E      WSFFP+ Q
Sbjct: 422 PRHQKHRTLTLGTFKEFNFDNADG--VPKPSAGPDWWDNGSDVGKEDFTAKNWSFFPVMQ 455

BLAST of Cucsa.106790 vs. TrEMBL
Match: A0A067G9B9_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g012593mg PE=4 SV=1)

HSP 1 Score: 398.3 bits (1022), Expect = 1.3e-107
Identity = 235/460 (51.09%), Postives = 286/460 (62.17%), Query Frame = 1

Query: 15  NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEP 74
           NN+ +TI+AAA AIA+ ++R  +AT+ QKRRWG C SI WCFG  K RKRIGHAVLVPEP
Sbjct: 13  NNSLETISAAATAIASAENRVHQATS-QKRRWGGCWSISWCFGFQKHRKRIGHAVLVPEP 72

Query: 75  SPS-SEPHE--NTLQSPDIVLPFAAPPSSPVSLLQSEPPSAMQSPTALISFTSLTANMYS 134
           + S S   E  N+ Q+  I LPF APPSSP S LQSEPPSA QSP  L+S  S++ NMYS
Sbjct: 73  TASRSNASEAVNSTQAAAISLPFVAPPSSPASFLQSEPPSATQSPAGLVSLNSISGNMYS 132

Query: 135 PDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPP-ESIHLTTPSSPEVPFAQF 194
           P GPSSIFAIGP+AHE QLVSPP+ FST TTEPST PFTPP ES+HLTTPSSPEVPFAQ 
Sbjct: 133 PGGPSSIFAIGPYAHETQLVSPPV-FSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQL 192

Query: 195 VQPTLPKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFG 254
           + P+L   E   ++ F   +FQSY  +PGSPV +LISP S IS SG SSP PD +FA+ G
Sbjct: 193 LDPSLRFGEQGQKFPFSYYEFQSYHLHPGSPVGNLISPSSGISGSGTSSPFPDGEFATAG 252

Query: 255 SQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNPQTSESMSDH 314
            QF +F    PP LLNLDK SI  W  RQ + + T D++     N F  N Q SE     
Sbjct: 253 PQFPDFHRGDPPKLLNLDKLSIREWGSRQGSGTLTPDAVRSTPRNGFFQNRQISEVALRP 312

Query: 315 HATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVES-- 374
           H+ N  +  QI+              +HR SFEL+  DV ++ V  KP    E   ES  
Sbjct: 313 HSENGLRKDQIV--------------DHRVSFELTTEDV-VRCVEKKPTTLAEAVSESLQ 372

Query: 375 ---SPIHEPFETTKENSPH--GDHTSNVIEEKTKADGDEAHQRQEHHSVTLGSVKEFNFD 434
              +   E      EN  H      +N    KT  D +EA + Q+  S+TLGS KEFNFD
Sbjct: 373 NGTTVEKEESSGEAENVHHSCAGEAANDEPLKTPVDVEEAPRHQKQQSITLGSTKEFNFD 432

Query: 435 NGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQ 463
           + +G D+H P I S+WW N K    +S A   W+FFP+ Q
Sbjct: 433 SADG-DSHEPTIASDWWANEKVVGKDSGAIKNWAFFPVIQ 454

BLAST of Cucsa.106790 vs. TAIR10
Match: AT5G52430.1 (AT5G52430.1 hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 248.1 bits (632), Expect = 1.1e-65
Identity = 178/468 (38.03%), Postives = 242/468 (51.71%), Query Frame = 1

Query: 11  RPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIKQRKRIGHAVL 70
           R V NN+ +T+ AAA AI T + R   +++ QK RWG C S+Y CFG+ K  KRIG+AVL
Sbjct: 2   RNVVNNSVETVNAAATAIVTAESRVQPSSS-QKGRWGKCWSLYSCFGTQKNNKRIGNAVL 61

Query: 71  VPEPSPSSEP---HENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAMQSPTALISFTSLTA 130
           VPEP  S  P    +N+  S  +VLPF APPSSP S LQS+P S   SP   +   SLT+
Sbjct: 62  VPEPVTSGVPVVTVQNSATSTTVVLPFIAPPSSPASFLQSDPSSVSHSPVGPL---SLTS 121

Query: 131 NMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPP--ESIHLTTPSSPEV 190
           N +SP  P S+F +GP+A+E Q V+PP+ FS   TEPST P+TPP   S+H+TTPSSPEV
Sbjct: 122 NTFSPKEPQSVFTVGPYANETQPVTPPV-FSAFITEPSTAPYTPPPESSVHITTPSSPEV 181

Query: 191 PFAQFVQPTLPKVESDN------QYTFPNDDFQSYQFYPGSP-VSHLISPRSVISRSGAS 250
           PFAQ +  +L     D+      +++  + +F+S Q  PGSP   +LISP SVIS SG S
Sbjct: 182 PFAQLLTSSLELTRRDSTSGMNQKFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTS 241

Query: 251 SPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFV 310
           SP P        S  + F +  PP  L  +  +   W  R  + S T             
Sbjct: 242 SPYPG------KSPMVEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPVG-HGSGLASGA 301

Query: 311 LNPQTSESMSDHHATN------ESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQ 370
           L P   E +S +   N      ++Q  ++     S    E    +HR SFEL+  DV  +
Sbjct: 302 LTPNGPEIVSGNLTPNNTTWPLQNQISEVASLANSDHGSEVMVADHRVSFELTGEDV-AR 361

Query: 371 SVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQRQEHHSV 430
            + SK   S++    +  I        E S   D   N IE+++    +E H+ Q+  S 
Sbjct: 362 CLASKLNRSHDRMNNNDRIE------TEESSSTDIRRN-IEKRSGDRENEQHRIQKLSSS 421

Query: 431 TLGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFP 460
           ++GS KEF FD                  N KD + E  A  +WSFFP
Sbjct: 422 SIGSSKEFKFD------------------NTKDENIEKVAGNSWSFFP 431

BLAST of Cucsa.106790 vs. TAIR10
Match: AT4G25620.1 (AT4G25620.1 hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 216.9 bits (551), Expect = 2.7e-56
Identity = 176/491 (35.85%), Postives = 239/491 (48.68%), Query Frame = 1

Query: 11  RPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIKQRKRIGHAVL 70
           R VNN++  T+ AAA AI + + R  + ++VQK+R GS  S+YWCFGS K  KRIGHAVL
Sbjct: 2   RSVNNSSVDTVNAAASAIVSAESR-TQPSSVQKKR-GSWWSLYWCFGSKKNNKRIGHAVL 61

Query: 71  VPEPSPSS---EPHEN-TLQSPDIVLPFAAPPSSPVSLLQSEPPSAMQSPTALISFTSLT 130
           VPEP+ S     P +N +  S  I +PF APPSSP S L S PPSA  +P   +   SLT
Sbjct: 62  VPEPAASGAAVAPVQNSSSNSTSIFMPFIAPPSSPASFLPSGPPSASHTPDPGL-LCSLT 121

Query: 131 ANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVP 190
            N      P S F IGP+AHE Q V+PP+ FS  TTEPST PFTPP      +PSSPEVP
Sbjct: 122 VN-----EPPSAFTIGPYAHETQPVTPPV-FSAFTTEPSTAPFTPPPE----SPSSPEVP 181

Query: 191 FAQFVQPTLPKVESDN------QYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSP 250
           FAQ +  +L +   ++      +++  + +F+S Q YPGSP  +LISP      SG SSP
Sbjct: 182 FAQLLTSSLERARRNSGGGMNQKFSAAHYEFKSCQVYPGSPGGNLISP-----GSGTSSP 241

Query: 251 LPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDS--------------CTQ 310
            P           + F +  PP  L  +  +   W  R  + S               T 
Sbjct: 242 YPG------KCSIIEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPAGQGSRLGSGALTP 301

Query: 311 DSIEFKSSNDFVLNPQTSESMSDHHATNESQNIQILIDD--------------GSKKEEE 370
           D  +  S    V+ P  +E++      N +     L+D                S+  +E
Sbjct: 302 DGSKLTSG---VVTPNGAETVIRMSYGNLTPLEGSLLDSQISEVASLANSDHGSSRHNDE 361

Query: 371 PGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVI 430
                HR SFEL+  DV  + + SK        +  S  HE           G+H   + 
Sbjct: 362 ALVVPHRVSFELTGEDV-ARCLASK--------LNRSGSHE--------KASGEH---LR 421

Query: 431 EEKTKADGD-EAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNINSEWWINAK-DGSTES 461
               K  G+ E+ Q Q+  S + GS KEF FD+ N  +     I SEWW N K  G  + 
Sbjct: 422 PNCCKTSGETESEQSQKLRSFSTGSNKEFKFDSTN--EEMIEKIRSEWWANEKVAGKGDH 443

BLAST of Cucsa.106790 vs. TAIR10
Match: AT1G63720.1 (AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1))

HSP 1 Score: 199.9 bits (507), Expect = 3.4e-51
Identity = 131/265 (49.43%), Postives = 160/265 (60.38%), Query Frame = 1

Query: 15  NNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPE 74
           NN F TI AAA AIA+ D R  +++ + +KR+W +  S+  CFGS +QRKRIG++VLVPE
Sbjct: 8   NNVFDTINAAASAIASSDDRLHQSSPIHKKRKWWNRWSLLKCFGSSRQRKRIGNSVLVPE 67

Query: 75  PSPSSEPHENT----LQSPDIVLPFAAPPSSPVSLLQSEPPSAMQSPTALISFTSLTANM 134
           P   S  +  T     +S    LPF APPSSP S  QSEPPSA QSP  ++SF+ L  N 
Sbjct: 68  PVSMSSSNSTTSNSGYRSVITTLPFIAPPSSPASFFQSEPPSATQSPVGILSFSPLPCN- 127

Query: 135 YSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPP---ESIHL--TTPSSPE 194
                  SIFAIGP+AHE QLVSPP+ FST TTEPS+ P TPP    SI+L  TTPSSPE
Sbjct: 128 ----NRPSIFAIGPYAHETQLVSPPV-FSTYTTEPSSAPITPPLDDSSIYLTTTTPSSPE 187

Query: 195 VPFAQFVQPTLPKVESDNQYTFP---NDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL 254
           VPFAQ              Y FP   + +FQ YQ  PGSP+  LISP      SG +SP 
Sbjct: 188 VPFAQLFNSN--HQTGSYGYKFPMSSSYEFQFYQLPPGSPLGQLISPS---PGSGPTSPF 247

Query: 255 PDYDFASFGSQFLNFPLEVPPTLLN 266
           PD +     S F +F +  PP LL+
Sbjct: 248 PDGE----TSLFPHFQVSDPPKLLS 257

BLAST of Cucsa.106790 vs. TAIR10
Match: AT1G76660.1 (AT1G76660.1 FUNCTIONS IN: molecular_function unknown)

HSP 1 Score: 147.5 bits (371), Expect = 2.0e-35
Identity = 99/210 (47.14%), Postives = 121/210 (57.62%), Query Frame = 1

Query: 42  QKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPS--PSSEPHE-------NTLQSPDIVL 101
           Q++RWG CL ++ CF S K  KRI  A  +PE     +S+P+        N   +  I L
Sbjct: 7   QRKRWGGCLGVFSCFKSQKGGKRIVPASRIPEGGNVSASQPNGAHQAGVLNNQAAGGINL 66

Query: 102 PFAAPPSSPVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSS-IFAIGPFAHEPQLV 161
              APPSSP S   S  PS  QSP     + SL AN  SP GPSS ++A GP+AHE QLV
Sbjct: 67  SLLAPPSSPASFTNSALPSTTQSPNC---YLSLAAN--SPGGPSSSMYATGPYAHETQLV 126

Query: 162 SPPLNFSTLTTEPST-PFT-PPESIHLTTPSSPEVPFAQFVQPTLPKVESDNQYTFPNDD 221
           SPP+ FST TTEPST PFT PPE   LT PSSP+VP+A+F+  ++    S   +   ND 
Sbjct: 127 SPPV-FSTFTTEPSTAPFTPPPELARLTAPSSPDVPYARFLTSSMDLKNSGKGHY--NDL 186

Query: 222 FQSYQFYPGSPVSHLISPRSVISRSGASSP 240
             +Y  YPGSP S L SP S  S  G  SP
Sbjct: 187 QATYSLYPGSPASALRSPISRASGDGLLSP 208

BLAST of Cucsa.106790 vs. NCBI nr
Match: gi|449457656|ref|XP_004146564.1| (PREDICTED: uncharacterized protein LOC101220378 [Cucumis sativus])

HSP 1 Score: 955.3 bits (2468), Expect = 4.0e-275
Identity = 464/464 (100.00%), Postives = 464/464 (100.00%), Query Frame = 1

Query: 1   MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIK 60
           MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIK
Sbjct: 1   MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIK 60

Query: 61  QRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAMQSPTALI 120
           QRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAMQSPTALI
Sbjct: 61  QRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAMQSPTALI 120

Query: 121 SFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPFTPPESIHLTTPS 180
           SFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPFTPPESIHLTTPS
Sbjct: 121 SFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPFTPPESIHLTTPS 180

Query: 181 SPEVPFAQFVQPTLPKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL 240
           SPEVPFAQFVQPTLPKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL
Sbjct: 181 SPEVPFAQFVQPTLPKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL 240

Query: 241 PDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP 300
           PDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP
Sbjct: 241 PDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP 300

Query: 301 QTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLES 360
           QTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLES
Sbjct: 301 QTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLES 360

Query: 361 NELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQRQEHHSVTLGSVKEFN 420
           NELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQRQEHHSVTLGSVKEFN
Sbjct: 361 NELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQRQEHHSVTLGSVKEFN 420

Query: 421 FDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR 465
           FDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR
Sbjct: 421 FDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR 464

BLAST of Cucsa.106790 vs. NCBI nr
Match: gi|659102256|ref|XP_008452033.1| (PREDICTED: uncharacterized protein LOC103493162 isoform X2 [Cucumis melo])

HSP 1 Score: 903.3 bits (2333), Expect = 1.8e-259
Identity = 442/465 (95.05%), Postives = 447/465 (96.13%), Query Frame = 1

Query: 1   MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIK 60
           MRRRTDTDDFRPVNN TFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGS+K
Sbjct: 1   MRRRTDTDDFRPVNN-TFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLK 60

Query: 61  QRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAMQSPTALI 120
           QRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSA+QSPTALI
Sbjct: 61  QRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAIQSPTALI 120

Query: 121 SFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTP-FTPPESIHLTTP 180
           SFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTP FTPPESIHLTTP
Sbjct: 121 SFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTP 180

Query: 181 SSPEVPFAQFVQPTLPKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSP 240
           SSPEVPFAQFV P+L KVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSP
Sbjct: 181 SSPEVPFAQFVPPSLQKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSP 240

Query: 241 LPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN 300
           LPDYDFASFGSQFLNFPLEVPPTL NLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN
Sbjct: 241 LPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN 300

Query: 301 PQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLE 360
           P TSESM DHHATNESQNIQILIDDGSK+EEEPGATNHRFSFELSDGDVL QSVGSKPLE
Sbjct: 301 PHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLE 360

Query: 361 SNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQRQEHHSVTLGSVKEF 420
           SNEL VESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQ QEHHSV LGSVKEF
Sbjct: 361 SNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQHQEHHSVALGSVKEF 420

Query: 421 NFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR 465
           NFDN NGSDTHNP INS+WW NAKDGSTE T TG WSFFP TQQR
Sbjct: 421 NFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR 464

BLAST of Cucsa.106790 vs. NCBI nr
Match: gi|659102254|ref|XP_008452032.1| (PREDICTED: uncharacterized protein LOC103493162 isoform X1 [Cucumis melo])

HSP 1 Score: 898.7 bits (2321), Expect = 4.4e-258
Identity = 442/466 (94.85%), Postives = 447/466 (95.92%), Query Frame = 1

Query: 1   MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQ-KRRWGSCLSIYWCFGSI 60
           MRRRTDTDDFRPVNN TFQTITAAADAIATVDHRFPRATAVQ KRRWGSCLSIYWCFGS+
Sbjct: 1   MRRRTDTDDFRPVNN-TFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGSL 60

Query: 61  KQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAMQSPTAL 120
           KQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSA+QSPTAL
Sbjct: 61  KQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAIQSPTAL 120

Query: 121 ISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTP-FTPPESIHLTT 180
           ISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTP FTPPESIHLTT
Sbjct: 121 ISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTT 180

Query: 181 PSSPEVPFAQFVQPTLPKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASS 240
           PSSPEVPFAQFV P+L KVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASS
Sbjct: 181 PSSPEVPFAQFVPPSLQKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASS 240

Query: 241 PLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVL 300
           PLPDYDFASFGSQFLNFPLEVPPTL NLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVL
Sbjct: 241 PLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVL 300

Query: 301 NPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPL 360
           NP TSESM DHHATNESQNIQILIDDGSK+EEEPGATNHRFSFELSDGDVL QSVGSKPL
Sbjct: 301 NPHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPL 360

Query: 361 ESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQRQEHHSVTLGSVKE 420
           ESNEL VESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQ QEHHSV LGSVKE
Sbjct: 361 ESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQHQEHHSVALGSVKE 420

Query: 421 FNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR 465
           FNFDN NGSDTHNP INS+WW NAKDGSTE T TG WSFFP TQQR
Sbjct: 421 FNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR 465

BLAST of Cucsa.106790 vs. NCBI nr
Match: gi|700198179|gb|KGN53337.1| (hypothetical protein Csa_4G047980 [Cucumis sativus])

HSP 1 Score: 726.9 bits (1875), Expect = 2.3e-206
Identity = 352/352 (100.00%), Postives = 352/352 (100.00%), Query Frame = 1

Query: 113 MQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPFTPPE 172
           MQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPFTPPE
Sbjct: 1   MQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPFTPPE 60

Query: 173 SIHLTTPSSPEVPFAQFVQPTLPKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVIS 232
           SIHLTTPSSPEVPFAQFVQPTLPKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVIS
Sbjct: 61  SIHLTTPSSPEVPFAQFVQPTLPKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVIS 120

Query: 233 RSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKS 292
           RSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKS
Sbjct: 121 RSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKS 180

Query: 293 SNDFVLNPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQS 352
           SNDFVLNPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQS
Sbjct: 181 SNDFVLNPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQS 240

Query: 353 VGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQRQEHHSVT 412
           VGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQRQEHHSVT
Sbjct: 241 VGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQRQEHHSVT 300

Query: 413 LGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR 465
           LGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR
Sbjct: 301 LGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR 352

BLAST of Cucsa.106790 vs. NCBI nr
Match: gi|802738414|ref|XP_012086872.1| (PREDICTED: uncharacterized protein LOC105645786 [Jatropha curcas])

HSP 1 Score: 425.6 bits (1093), Expect = 1.1e-115
Identity = 242/471 (51.38%), Postives = 297/471 (63.06%), Query Frame = 1

Query: 4   RTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIKQRK 63
           R    D RP +NN   TI AAA AIA+ ++R P+AT VQKRRWGSC S+YWCFG  + RK
Sbjct: 2   RAVNGDSRP-SNNALDTINAAASAIASAENRVPQAT-VQKRRWGSCFSVYWCFGYNRHRK 61

Query: 64  RIGHAVLVPE---PSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAMQSPTALI 123
           RIGHAVLVPE   P   S   EN+ Q+P I LPF APPSSP S LQSEPPSA QSPT ++
Sbjct: 62  RIGHAVLVPETPGPRNDSSAAENSTQTPTITLPFVAPPSSPASFLQSEPPSASQSPTGVL 121

Query: 124 SFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFT-PPESIHLTT 183
           S TS++ANMYSP GPSSIFAIGP+AHE QLVSPP+ FST TTEPST PFT PPES+HLTT
Sbjct: 122 SLTSISANMYSPSGPSSIFAIGPYAHETQLVSPPV-FSTFTTEPSTAPFTPPPESVHLTT 181

Query: 184 PSSPEVPFAQFVQPTLPKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASS 243
           PSSPEVPFAQ + P++  VE+  ++   N +FQSYQ YPGSPV  LISP S IS SG SS
Sbjct: 182 PSSPEVPFAQLLDPSIRNVEAGLRFPLSNYEFQSYQLYPGSPVGQLISPSSGISGSGTSS 241

Query: 244 PLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVL 303
           P PD +FA   + FL F +  PP LLNLDK S H W  R  + + T D++   +S  F  
Sbjct: 242 PFPDGEFA---AGFLEFRMGEPPKLLNLDKLSTHEWGSRCGSGTLTPDAVR-PTSCSFTP 301

Query: 304 NPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVL--LQSVGSK 363
           +   S+ +S  H+ N +QN ++               +HR SFEL+   VL   +   + 
Sbjct: 302 DRPFSDFVSHKHSDNGNQNDEV--------------GDHRLSFELAAEGVLGCEEQNPAS 361

Query: 364 PL----ESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQRQE-HHSV 423
           P+    +S E    ++   +  E   +       TSN   EK   DG++A  R E H S+
Sbjct: 362 PVKIIGDSLENGTVAARTEDSTEVVDDFESRVGETSNGTPEKASTDGEKAPPRHEKHRSI 421

Query: 424 TLGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQ 463
           TLGS+KEFNFDN +G D+H PN   +WW N  D   E  AT  WSFFPM Q
Sbjct: 422 TLGSLKEFNFDNVDGGDSHKPNAGPDWWANGSDIGKEDGATKNWSFFPMMQ 451

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y1666_ARATH3.6e-3447.14Uncharacterized protein At1g76660 OS=Arabidopsis thaliana GN=At1g76660 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KY57_CUCSA1.6e-206100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_4G047980 PE=4 SV=1[more]
A0A067JZI1_JATCU7.6e-11651.38Uncharacterized protein OS=Jatropha curcas GN=JCGZ_20571 PE=4 SV=1[more]
M5WM36_PRUPE8.7e-11252.06Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005552mg PE=4 SV=1[more]
B9RCD8_RICCO6.2e-11049.38Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1687100 PE=4 SV=1[more]
A0A067G9B9_CITSI1.3e-10751.09Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g012593mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G52430.11.1e-6538.03 hydroxyproline-rich glycoprotein family protein[more]
AT4G25620.12.7e-5635.85 hydroxyproline-rich glycoprotein family protein[more]
AT1G63720.13.4e-5149.43 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glyc... [more]
AT1G76660.12.0e-3547.14 FUNCTIONS IN: molecular_function unknown[more]
Match NameE-valueIdentityDescription
gi|449457656|ref|XP_004146564.1|4.0e-275100.00PREDICTED: uncharacterized protein LOC101220378 [Cucumis sativus][more]
gi|659102256|ref|XP_008452033.1|1.8e-25995.05PREDICTED: uncharacterized protein LOC103493162 isoform X2 [Cucumis melo][more]
gi|659102254|ref|XP_008452032.1|4.4e-25894.85PREDICTED: uncharacterized protein LOC103493162 isoform X1 [Cucumis melo][more]
gi|700198179|gb|KGN53337.1|2.3e-206100.00hypothetical protein Csa_4G047980 [Cucumis sativus][more]
gi|802738414|ref|XP_012086872.1|1.1e-11551.38PREDICTED: uncharacterized protein LOC105645786 [Jatropha curcas][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009725 response to hormone
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.106790.1Cucsa.106790.1mRNA
Cucsa.106790.2Cucsa.106790.2mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR31798FAMILY NOT NAMEDcoord: 1..443
score: 2.2E
NoneNo IPR availablePANTHERPTHR31798:SF2SUBFAMILY NOT NAMEDcoord: 1..443
score: 2.2E

The following gene(s) are paralogous to this gene:

None