Cla000521 (gene) Watermelon (97103) v1

NameCla000521
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionHydroxyproline-rich glycoprotein family protein (AHRD V1 *--- D7MRR7_ARALL)
LocationChr0 : 12661270 .. 12663073 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGAAAGAGATGGGGTGGATGTTGGGGTGCATTATCTTGTTTTCACTCTCAGAAGGGAGAAAAGCGCATCGTACCTGCATCTCGTTTACCTGAGGGCAATGCCGTGACAACCCAGCCTAATGGACCTCCAGGAATGACCAACCAGGCTACAGTGATAACTCCATCCCTTCTAGCCCCACCTTCTTCACCAGCATCTTTTACAAATTCTGCACTCCCTTCAACAGTCCAATCACCTAGCTGTTTCATGTCACTGTCTGCAAACTCACCTGGAGGTCCTTCATCCACAATGTTTGCTACAGGGCCATATGCGCATGAAACACAGCTGGTTTCTCCTCCTGTTTTCTCAGCCTTCACCACTGAACCGTCAACTGCTCCCCTCACCCCCCCACCCGAACTAGCTCATCTAACCACGCCTTCTTCCCCTGATGTGCCCTTTGCTCAGTTCCTATCCTCATCGGTGGATCTCAAAGGAACTGGAAAGGCCAATTACATTGCTTCAAATGATCTTCAAGCAGCATATTCTCTCTACCCTGGAAGTCCTGCCAGTAGCCTCGTGTCACCAATTTCAAGGACCTCTGGCGATTGCTTATCATCCTCATTTCCTGAAAGGGACTTCCCATCACAGTGGAATCCTTCAGCTTCTCTCCAAGATGGAAAATATCCAAGAAGTGGTTCTGGTCGGCTATTTGGAAATGAGAAAGCTGGTGGTACATCGTTGGCATCTCAGGATTCTAATTTCTTCTGCCCTGCTACATTTGCACAATTCTATCTGGACAATCCACCATTCCCTCATACTGGTGGGAGGTTAAGTGTATCGAAGGATTCAGATGTCTACTCTTCTAGTGGGAATGGATACCAGAACCGGCACAGTAAATCTCCAAAACAAGATGTGGAGGAAATAGAAGCTTACAGAGCATCGTTTGGTTTCAGTGCGGATGAAATTATAACTACTACACAGTATGTGGAGATATCTGATGTAATGGAGGATTCCTTTACTATGAGACCTTTTACCTCAACTAGTCTATCAGCAGAAGAAAGTATTGAACCTCCATTGTTGGGTGAAAAACTAAAATCCACGCATACAACTTTACAGAGTCAGAGAAGTATTAAATCAGCACCTGAGGTTGTCGAAAAGGAAACCTGCACTGAAGTGCTGGCATTATGCAATGGTTATAAAGGTGAGCCGCTGTTCCCTGTTACAACAAGCACTGAAGCACACACTTGTTTCATAAAATGAGAAATCAAAAGAAAATAAAAATCAGATATGGTTTAGAAAACGTGTTTAAAATGTCATTATTAGAATTTTATACTTTTCTTTTTTATTTTACTATTATTGTTATTCTTTTTTTTTTTTTTTTTTTGGGGGGGGGGGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGGGGGGATGGAATTTTGGAGGTATTATTTTATCCGTGACCCACAAGTCTGATACTATAAATAGTTAAGATTCTCCACTGTACACCTCCGGTTACAGTTGTTACCTGTTTGATGCAGACAATAAATTGCAAAGACAACCTGGTAATATGTCAGGATCAAGTACTTCAAACCAAGTTGAAAAAGACGTATTCTCAAGGACAGGGTCATCCAAAAATAGTCGCAAGTATAATCTTGGCTTATCCTGCTCTGATGCAGAGGTTGACTACGGAAGGGGAAGGAGCCCAAGGGAGGCCAGGGAAGATTTTTCATGGCATGACTAA

mRNA sequence

ATGGGAAAGAGATGGGGTGGATGTTGGGGTGCATTATCTTGTTTTCACTCTCAGAAGGGAGAAAAGCGCATCGTACCTGCATCTCGTTTACCTGAGGGCAATGCCGTGACAACCCAGCCTAATGGACCTCCAGGAATGACCAACCAGGCTACAGTGATAACTCCATCCCTTCTAGCCCCACCTTCTTCACCAGCATCTTTTACAAATTCTGCACTCCCTTCAACAGTCCAATCACCTAGCTGTTTCATGTCACTGTCTGCAAACTCACCTGGAGGTCCTTCATCCACAATGTTTGCTACAGGGCCATATGCGCATGAAACACAGCTGGTTTCTCCTCCTGTTTTCTCAGCCTTCACCACTGAACCGTCAACTGCTCCCCTCACCCCCCCACCCGAACTAGCTCATCTAACCACGCCTTCTTCCCCTGATGTGCCCTTTGCTCAGTTCCTATCCTCATCGGTGGATCTCAAAGGAACTGGAAAGGCCAATTACATTGCTTCAAATGATCTTCAAGCAGCATATTCTCTCTACCCTGGAAGTCCTGCCAGTAGCCTCGTGTCACCAATTTCAAGGACCTCTGGCGATTGCTTATCATCCTCATTTCCTGAAAGGGACTTCCCATCACAGTGGAATCCTTCAGCTTCTCTCCAAGATGGAAAATATCCAAGAAGTGGTTCTGGTCGGCTATTTGGAAATGAGAAAGCTGGTGGTACATCGTTGGCATCTCAGGATTCTAATTTCTTCTGCCCTGCTACATTTGCACAATTCTATCTGGACAATCCACCATTCCCTCATACTGGTGGGAGGTTAAGTGTATCGAAGGATTCAGATGTCTACTCTTCTAGTGGGAATGGATACCAGAACCGGCACAGTAAATCTCCAAAACAAGATGTGGAGGAAATAGAAGCTTACAGAGCATCGTTTGGTTTCAGTGCGGATGAAATTATAACTACTACACAGTATGTGGAGATATCTGATGTAATGGAGGATTCCTTTACTATGAGACCTTTTACCTCAACTAGTCTATCAGCAGAAGAAAGTATTGAACCTCCATTGTTGGGTGAAAAACTAAAATCCACGCATACAACTTTACAGAGTCAGAGAAGTATTAAATCAGCACCTGAGGTTGTCGAAAAGGAAACCTGCACTGAAGTGCTGGCATTATGCAATGGTTATAAAGACAATAAATTGCAAAGACAACCTGGTAATATGTCAGGATCAAGTACTTCAAACCAAGTTGAAAAAGACGTATTCTCAAGGACAGGGTCATCCAAAAATAGTCGCAAGTATAATCTTGGCTTATCCTGCTCTGATGCAGAGGTTGACTACGGAAGGGGAAGGAGCCCAAGGGAGGCCAGGGAAGATTTTTCATGGCATGACTAA

Coding sequence (CDS)

ATGGGAAAGAGATGGGGTGGATGTTGGGGTGCATTATCTTGTTTTCACTCTCAGAAGGGAGAAAAGCGCATCGTACCTGCATCTCGTTTACCTGAGGGCAATGCCGTGACAACCCAGCCTAATGGACCTCCAGGAATGACCAACCAGGCTACAGTGATAACTCCATCCCTTCTAGCCCCACCTTCTTCACCAGCATCTTTTACAAATTCTGCACTCCCTTCAACAGTCCAATCACCTAGCTGTTTCATGTCACTGTCTGCAAACTCACCTGGAGGTCCTTCATCCACAATGTTTGCTACAGGGCCATATGCGCATGAAACACAGCTGGTTTCTCCTCCTGTTTTCTCAGCCTTCACCACTGAACCGTCAACTGCTCCCCTCACCCCCCCACCCGAACTAGCTCATCTAACCACGCCTTCTTCCCCTGATGTGCCCTTTGCTCAGTTCCTATCCTCATCGGTGGATCTCAAAGGAACTGGAAAGGCCAATTACATTGCTTCAAATGATCTTCAAGCAGCATATTCTCTCTACCCTGGAAGTCCTGCCAGTAGCCTCGTGTCACCAATTTCAAGGACCTCTGGCGATTGCTTATCATCCTCATTTCCTGAAAGGGACTTCCCATCACAGTGGAATCCTTCAGCTTCTCTCCAAGATGGAAAATATCCAAGAAGTGGTTCTGGTCGGCTATTTGGAAATGAGAAAGCTGGTGGTACATCGTTGGCATCTCAGGATTCTAATTTCTTCTGCCCTGCTACATTTGCACAATTCTATCTGGACAATCCACCATTCCCTCATACTGGTGGGAGGTTAAGTGTATCGAAGGATTCAGATGTCTACTCTTCTAGTGGGAATGGATACCAGAACCGGCACAGTAAATCTCCAAAACAAGATGTGGAGGAAATAGAAGCTTACAGAGCATCGTTTGGTTTCAGTGCGGATGAAATTATAACTACTACACAGTATGTGGAGATATCTGATGTAATGGAGGATTCCTTTACTATGAGACCTTTTACCTCAACTAGTCTATCAGCAGAAGAAAGTATTGAACCTCCATTGTTGGGTGAAAAACTAAAATCCACGCATACAACTTTACAGAGTCAGAGAAGTATTAAATCAGCACCTGAGGTTGTCGAAAAGGAAACCTGCACTGAAGTGCTGGCATTATGCAATGGTTATAAAGACAATAAATTGCAAAGACAACCTGGTAATATGTCAGGATCAAGTACTTCAAACCAAGTTGAAAAAGACGTATTCTCAAGGACAGGGTCATCCAAAAATAGTCGCAAGTATAATCTTGGCTTATCCTGCTCTGATGCAGAGGTTGACTACGGAAGGGGAAGGAGCCCAAGGGAGGCCAGGGAAGATTTTTCATGGCATGACTAA

Protein sequence

MGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPPGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQVEKDVFSRTGSSKNSRKYNLGLSCSDAEVDYGRGRSPREAREDFSWHD
BLAST of Cla000521 vs. Swiss-Prot
Match: Y1666_ARATH (Uncharacterized protein At1g76660 OS=Arabidopsis thaliana GN=At1g76660 PE=2 SV=1)

HSP 1 Score: 468.8 bits (1205), Expect = 7.0e-131
Identity = 269/461 (58.35%), Postives = 315/461 (68.33%), Query Frame = 1

Query: 3   KRWGGCWGALSCFHSQKGEKRIVPASRLPEG-NAVTTQPNGPPG---MTNQATV-ITPSL 62
           KRWGGC G  SCF SQKG KRIVPASR+PEG N   +QPNG      + NQA   I  SL
Sbjct: 9   KRWGGCLGVFSCFKSQKGGKRIVPASRIPEGGNVSASQPNGAHQAGVLNNQAAGGINLSL 68

Query: 63  LAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSA 122
           LAPPSSPASFTNSALPST QSP+C++SL+ANSPGGPSS+M+ATGPYAHETQLVSPPVFS 
Sbjct: 69  LAPPSSPASFTNSALPSTTQSPNCYLSLAANSPGGPSSSMYATGPYAHETQLVSPPVFST 128

Query: 123 FTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLY 182
           FTTEPSTAP TPPPELA LT PSSPDVP+A+FL+SS+DLK +GK +Y   NDLQA YSLY
Sbjct: 129 FTTEPSTAPFTPPPELARLTAPSSPDVPYARFLTSSMDLKNSGKGHY---NDLQATYSLY 188

Query: 183 PGSPASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGG 242
           PGSPAS+L SPISR SGD L S                 Q+GK  RS SG  FG +   G
Sbjct: 189 PGSPASALRSPISRASGDGLLSP----------------QNGKCSRSDSGNTFGYD-TNG 248

Query: 243 TSLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYSSS--GNGYQNRHSKSP 302
            S   Q+SNFFCP TFA+FYLD +P  P  GGRLSVSKDSDVY ++  GNG QNR ++SP
Sbjct: 249 VSTPLQESNFFCPETFAKFYLDHDPSVPQNGGRLSVSKDSDVYPTNGYGNGNQNRQNRSP 308

Query: 303 KQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLG 362
           KQD+EE+EAYRASFGFSADEIITT+QYVEI+DVM+ SF    ++            P  G
Sbjct: 309 KQDMEELEAYRASFGFSADEIITTSQYVEITDVMDGSFNTSAYS------------PSDG 368

Query: 363 EKLKSTHTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQVE 422
           +KL      L SQ S KS  ++  +    +     N YKD+K + +          +  E
Sbjct: 369 QKLLRREANLLSQTSPKSEADLDSQVVDFQSPKSSNSYKDHKQRNR---------IHADE 426

Query: 423 KDVFSRTGSSKNSRKYNLGLSCSDAEVDYGRGRSPREARED 456
           + + SR GS K SR Y+  +S SDAEV+Y RGRS RE+RE+
Sbjct: 429 EALLSRVGSVKGSRSYH--ISSSDAEVEYRRGRSLRESREN 426

BLAST of Cla000521 vs. TrEMBL
Match: A0A0A0L1G3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G665140 PE=4 SV=1)

HSP 1 Score: 846.7 bits (2186), Expect = 1.4e-242
Identity = 426/461 (92.41%), Postives = 437/461 (94.79%), Query Frame = 1

Query: 2   GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPP--GMTNQATVITPSLLA 61
           GKRWGGCWGALSCFHSQKG+KRIVPASRLPEGN VTTQPNGP   GMTNQATVITPSLLA
Sbjct: 14  GKRWGGCWGALSCFHSQKGDKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLA 73

Query: 62  PPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFT 121
           PPSSPASFTNSALPSTVQSPSCF+SLSANSPGGPSSTM+ATGPYAH+TQLVSPPVFSAF 
Sbjct: 74  PPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMYATGPYAHDTQLVSPPVFSAFN 133

Query: 122 TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPG 181
           TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DLKGTGKANYIASNDLQAAYSLYPG
Sbjct: 134 TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSEDLKGTGKANYIASNDLQAAYSLYPG 193

Query: 182 SPASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTS 241
           SPASSLVSPISRTSGDCLSSSFPERDF  QWN SASLQDGKYPRSGSGRLFGNEKA GTS
Sbjct: 194 SPASSLVSPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKA-GTS 253

Query: 242 LASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVE 301
           LASQDSNFFCPATFAQFYLDN  FPHTGGRLSVSKDSDVYSS GNGYQNRHSKSPKQDVE
Sbjct: 254 LASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQNRHSKSPKQDVE 313

Query: 302 EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKS 361
           EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES EPPLLGEKLKS
Sbjct: 314 EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKS 373

Query: 362 THTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQVEKDVFS 421
           +HTTLQSQRSIKSAPE    ETCTE+ ALCNGYKDNKLQRQPG++SGSSTSNQVEKDVFS
Sbjct: 374 SHTTLQSQRSIKSAPE----ETCTEMPALCNGYKDNKLQRQPGDISGSSTSNQVEKDVFS 433

Query: 422 RTGSSKNSRKYNLGLSCSDAEVDYGRGRSPREAREDFSWHD 461
           R GSSKNSRKY+LGLSCSDAEVDY RGRS REA+ + SWHD
Sbjct: 434 RIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD 469

BLAST of Cla000521 vs. TrEMBL
Match: B9GPK6_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0002s00330g PE=4 SV=2)

HSP 1 Score: 662.1 bits (1707), Expect = 4.8e-187
Identity = 338/466 (72.53%), Postives = 382/466 (81.97%), Query Frame = 1

Query: 3   KRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPP--GMTNQATVITPSLLAP 62
           KRWGGCWGALSCF  QKG KRIVPASR+PEGNA   QPNGP   G+TNQAT + PSLLAP
Sbjct: 18  KRWGGCWGALSCFSVQKGGKRIVPASRIPEGNASAAQPNGPQPVGLTNQATALAPSLLAP 77

Query: 63  PSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTT 122
           PSSPASFTNSALPST QSPSCF+SLSANSPGGPSSTM+ATGPYAHETQLVSPPVFS FTT
Sbjct: 78  PSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMYATGPYAHETQLVSPPVFSTFTT 137

Query: 123 EPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGS 182
           EPSTAPLTPPPELAHLTTPSSPDVPFAQFL+SS DLKG  K NYI ++DLQ+ YSLYPGS
Sbjct: 138 EPSTAPLTPPPELAHLTTPSSPDVPFAQFLTSSRDLKGAEKNNYIVASDLQSTYSLYPGS 197

Query: 183 PASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSL 242
           PASSL+SPISRTSGDCLS+SFPER FP +W PS S Q+GKY RSGSGRLFG+E   G S+
Sbjct: 198 PASSLLSPISRTSGDCLSASFPERGFPREWGPSVSPQNGKYSRSGSGRLFGHETT-GASM 257

Query: 243 ASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVEE 302
            S DSNFFCPATFA+FYLD+   P+TGGRLSVSKDSDVY +SGNG+QNRH+KSPKQD EE
Sbjct: 258 VSHDSNFFCPATFARFYLDHD--PNTGGRLSVSKDSDVYPASGNGHQNRHNKSPKQDAEE 317

Query: 303 IEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKST 362
           +EAYRASFGFSADEIITT QYVEISDVMED+F+M PFTS   + EES+E  LL E  K+ 
Sbjct: 318 LEAYRASFGFSADEIITTPQYVEISDVMEDTFSMTPFTSAKPTMEESMEASLLNEGQKA- 377

Query: 363 HTTLQSQRSIKSAPEVVEKETCTEVLALCNGYK---DNKLQRQPGNMSGSST-SNQV--E 422
           +  L  Q S+K   ++ ++  C EV    + Y+   D K + QPGN+SGSST SN V  +
Sbjct: 378 NANLPKQNSLKLKSDLADRVVCCEVPVTSDRYEVNSDPKSRWQPGNVSGSSTPSNHVVTD 437

Query: 423 KDVFSRTGSSKNSRKYNLGLSCSDAEVDYGRGRSPREAREDFSWHD 461
            D+FS+  SSK SRKY+LGLS SDAE+DY RGRS RE + DF+WHD
Sbjct: 438 DDIFSKMASSKTSRKYHLGLSSSDAEIDYRRGRSLREGKGDFAWHD 479

BLAST of Cla000521 vs. TrEMBL
Match: F6I6Y3_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_18s0122g00140 PE=4 SV=1)

HSP 1 Score: 661.8 bits (1706), Expect = 6.2e-187
Identity = 337/469 (71.86%), Postives = 386/469 (82.30%), Query Frame = 1

Query: 3   KRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPP--GMTNQATVITPSLLAP 62
           KRWGGCWG LSCF +QKG KRIVPASR+PEGNA  TQPNGP   G+TNQ T + PSLLAP
Sbjct: 21  KRWGGCWGGLSCFGTQKGGKRIVPASRIPEGNASATQPNGPQAVGLTNQTTALAPSLLAP 80

Query: 63  PSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTT 122
           PSSPASFTNSALPST QSPSCF+S+SANSP GPSSTMFATGPYAHETQLVSPPVFS FTT
Sbjct: 81  PSSPASFTNSALPSTAQSPSCFLSMSANSPEGPSSTMFATGPYAHETQLVSPPVFSTFTT 140

Query: 123 EPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGS 182
           EPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLK  GK NYIA+NDLQA YSLYPGS
Sbjct: 141 EPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKSAGKTNYIAANDLQATYSLYPGS 200

Query: 183 PASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSL 242
           PASSL+SPISRTSGDCLSSSFPER+FP +W+PS S Q+ KYPR+GSGRLFG + A  +S 
Sbjct: 201 PASSLISPISRTSGDCLSSSFPEREFPPRWDPSISPQNAKYPRNGSGRLFGLDTA--SSS 260

Query: 243 ASQDSNFFCPATFAQFYLDN-----PPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPK 302
            SQDSNFFCPATFAQFYLD+     PPFP +GGRLS+S++SDVYSS GNG+QNRH+K+ K
Sbjct: 261 ISQDSNFFCPATFAQFYLDHTQQSYPPFP-SGGRLSLSRESDVYSSGGNGHQNRHNKNCK 320

Query: 303 QDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGE 362
           QDVEEIEAYRASFGFSADEIITTTQYVEISDV+EDSFTM PFTS     EE++ P ++ E
Sbjct: 321 QDVEEIEAYRASFGFSADEIITTTQYVEISDVLEDSFTMTPFTSNKPDMEENVVPAVVHE 380

Query: 363 KLKSTHTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQ--- 422
             K   T L ++ S+KS   +V++  C E L  C  ++D+K +RQ GN SGSST  +   
Sbjct: 381 GPKD-QTNLLNEESLKSESGLVDEGGCCEGLPSCKTFEDHKSERQSGNESGSSTPGKHIL 440

Query: 423 -VEKDVFSRTGSSKNSRKYNLGLSCSDAEVDYGRGRSPREAREDFSWHD 461
             E+++F + G+SK  RKY+LGLS SDAE+DY RGRS RE + DF+WHD
Sbjct: 441 TDEEEIFPK-GASKIGRKYHLGLSSSDAEIDYRRGRSLREGKGDFAWHD 484

BLAST of Cla000521 vs. TrEMBL
Match: V4TGC4_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10000992mg PE=4 SV=1)

HSP 1 Score: 661.4 bits (1705), Expect = 8.1e-187
Identity = 332/464 (71.55%), Postives = 384/464 (82.76%), Query Frame = 1

Query: 3   KRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPP--GMTNQATVITPSLLAP 62
           KRWGGC GA SCF SQKG KRIVPASR+PEGNA   QPNGP   G+ NQ T + PSLLAP
Sbjct: 17  KRWGGCLGAFSCFRSQKGGKRIVPASRMPEGNAPAAQPNGPQAAGLPNQTTTLAPSLLAP 76

Query: 63  PSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTT 122
           PSSPASFTNSALPST QSPSCF+SLSANSPGGPSSTMFATGPYAHETQLVSPPVFS FTT
Sbjct: 77  PSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSTFTT 136

Query: 123 EPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGS 182
           EPSTAPLTPPPELAHLTTPSSPDVPFA+FL+SS+DL GT KANYIA+NDLQA YSLYPGS
Sbjct: 137 EPSTAPLTPPPELAHLTTPSSPDVPFARFLTSSMDLNGTDKANYIAANDLQATYSLYPGS 196

Query: 183 PASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSL 242
           P SSL+SPISRTSG+CLSSSFPER+FP QW+P+ S Q+GKY RSGSGRL+ ++  GG S 
Sbjct: 197 PPSSLISPISRTSGECLSSSFPEREFPPQWDPTVSPQNGKYSRSGSGRLYTHDTTGG-SR 256

Query: 243 ASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVE 302
            SQD+NFFCPATFAQFYLD + PFPHTGGRLSVSKDSDVY +  NG QNRH+KSPKQDVE
Sbjct: 257 VSQDTNFFCPATFAQFYLDHDSPFPHTGGRLSVSKDSDVYPNGANGNQNRHTKSPKQDVE 316

Query: 303 EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKS 362
           E+EAYRASFGFSADEIITT QYVEI+DVM+DSFTM PFTS   + EES+   + G+K + 
Sbjct: 317 ELEAYRASFGFSADEIITTPQYVEITDVMDDSFTMMPFTSDKPAFEESLPASMDGQKPQG 376

Query: 363 THTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSST-SNQV---EK 422
             + L + +++KS  +++      E+    +G +DNK +RQ G++SG+ST  NQV   E+
Sbjct: 377 RESNLLNPKNLKSDSDLMNGGIHHELTESSDGCEDNKPKRQSGDVSGASTPGNQVLTDEE 436

Query: 423 DVFSRTGSSKNSRKYNLGLSCSDAEVDYGRGRSPREAREDFSWH 460
           D+FS+  +S+NSRKY+ GLSCSDAE+DY RGRS RE + DFSWH
Sbjct: 437 DIFSKMRTSRNSRKYHQGLSCSDAEIDYRRGRSLREGKGDFSWH 479

BLAST of Cla000521 vs. TrEMBL
Match: A0A067KKH5_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_08002 PE=4 SV=1)

HSP 1 Score: 659.1 bits (1699), Expect = 4.0e-186
Identity = 336/465 (72.26%), Postives = 378/465 (81.29%), Query Frame = 1

Query: 3   KRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPPG--MTNQATVITPSLLAP 62
           KRWGGC GA SCF SQKG KRIVPASR+P+GNA  +QPNGP    +TNQAT + PSLLAP
Sbjct: 15  KRWGGCLGAFSCFGSQKGGKRIVPASRIPDGNATASQPNGPQAGVLTNQATQLAPSLLAP 74

Query: 63  PSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTT 122
           PSSPASFTNSALPST QSPSCF+SLSANSPGGPSSTMFATGPYAHETQLVSPPVFS FTT
Sbjct: 75  PSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSTFTT 134

Query: 123 EPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGS 182
           EPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLK T K NYIA+ DLQ  YSLYPGS
Sbjct: 135 EPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKSTEKTNYIAAGDLQTTYSLYPGS 194

Query: 183 PASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSL 242
           PASSL+SPISRTSGDCLSSSFPERDFP QW+PS S Q+GKY R+GSGRLFG++   G S+
Sbjct: 195 PASSLISPISRTSGDCLSSSFPERDFPPQWDPSVSPQNGKYSRNGSGRLFGHDTT-GASM 254

Query: 243 ASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVE 302
            SQD+NFFCPATFA+FYLD NPPFPHTGGRLSVSKDSDVY + GNG+Q+RH+++PKQDVE
Sbjct: 255 VSQDTNFFCPATFARFYLDHNPPFPHTGGRLSVSKDSDVYPAGGNGHQSRHNRNPKQDVE 314

Query: 303 EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKS 362
           EIEAYRASFGFSADEIITT QYVEISDVM+DSFTM PFTS   + E S E   L +  K+
Sbjct: 315 EIEAYRASFGFSADEIITTQQYVEISDVMDDSFTMTPFTSNKPTIEGSTEAASLSDSQKA 374

Query: 363 THTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSN----QVEK 422
             T L + + +KS         C E    C+ Y+D+K +RQ G++SGSST        + 
Sbjct: 375 -QTNLPTLK-LKS------DRVCGEAPVSCDRYEDSKSRRQTGDVSGSSTPGIHALTDDD 434

Query: 423 DVFSRTGSSKNSRKYNLGLSCSDAEVDYGRGRSPREAREDFSWHD 461
           D+FS+  SSK SRKYNLG SCSDAE+DY RGRS  E + DF+WHD
Sbjct: 435 DIFSKMTSSKISRKYNLGSSCSDAEIDYRRGRSLGEGKADFAWHD 470

BLAST of Cla000521 vs. TAIR10
Match: AT1G76660.1 (AT1G76660.1 FUNCTIONS IN: molecular_function unknown)

HSP 1 Score: 468.8 bits (1205), Expect = 3.9e-132
Identity = 269/461 (58.35%), Postives = 315/461 (68.33%), Query Frame = 1

Query: 3   KRWGGCWGALSCFHSQKGEKRIVPASRLPEG-NAVTTQPNGPPG---MTNQATV-ITPSL 62
           KRWGGC G  SCF SQKG KRIVPASR+PEG N   +QPNG      + NQA   I  SL
Sbjct: 9   KRWGGCLGVFSCFKSQKGGKRIVPASRIPEGGNVSASQPNGAHQAGVLNNQAAGGINLSL 68

Query: 63  LAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSA 122
           LAPPSSPASFTNSALPST QSP+C++SL+ANSPGGPSS+M+ATGPYAHETQLVSPPVFS 
Sbjct: 69  LAPPSSPASFTNSALPSTTQSPNCYLSLAANSPGGPSSSMYATGPYAHETQLVSPPVFST 128

Query: 123 FTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLY 182
           FTTEPSTAP TPPPELA LT PSSPDVP+A+FL+SS+DLK +GK +Y   NDLQA YSLY
Sbjct: 129 FTTEPSTAPFTPPPELARLTAPSSPDVPYARFLTSSMDLKNSGKGHY---NDLQATYSLY 188

Query: 183 PGSPASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGG 242
           PGSPAS+L SPISR SGD L S                 Q+GK  RS SG  FG +   G
Sbjct: 189 PGSPASALRSPISRASGDGLLSP----------------QNGKCSRSDSGNTFGYD-TNG 248

Query: 243 TSLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYSSS--GNGYQNRHSKSP 302
            S   Q+SNFFCP TFA+FYLD +P  P  GGRLSVSKDSDVY ++  GNG QNR ++SP
Sbjct: 249 VSTPLQESNFFCPETFAKFYLDHDPSVPQNGGRLSVSKDSDVYPTNGYGNGNQNRQNRSP 308

Query: 303 KQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLG 362
           KQD+EE+EAYRASFGFSADEIITT+QYVEI+DVM+ SF    ++            P  G
Sbjct: 309 KQDMEELEAYRASFGFSADEIITTSQYVEITDVMDGSFNTSAYS------------PSDG 368

Query: 363 EKLKSTHTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQVE 422
           +KL      L SQ S KS  ++  +    +     N YKD+K + +          +  E
Sbjct: 369 QKLLRREANLLSQTSPKSEADLDSQVVDFQSPKSSNSYKDHKQRNR---------IHADE 426

Query: 423 KDVFSRTGSSKNSRKYNLGLSCSDAEVDYGRGRSPREARED 456
           + + SR GS K SR Y+  +S SDAEV+Y RGRS RE+RE+
Sbjct: 429 EALLSRVGSVKGSRSYH--ISSSDAEVEYRRGRSLRESREN 426

BLAST of Cla000521 vs. TAIR10
Match: AT5G52430.1 (AT5G52430.1 hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 154.8 bits (390), Expect = 1.3e-37
Identity = 104/233 (44.64%), Postives = 132/233 (56.65%), Query Frame = 1

Query: 4   RWGGCWGALSCFHSQKGEKRIVPASRLPE----GNAVTTQPNGPPGMTNQATVITPSLLA 63
           RWG CW   SCF +QK  KRI  A  +PE    G  V T  N         TV+ P  +A
Sbjct: 35  RWGKCWSLYSCFGTQKNNKRIGNAVLVPEPVTSGVPVVTVQNS----ATSTTVVLP-FIA 94

Query: 64  PPSSPASFTNSALPSTVQSPSCFMSLSAN--SPGGPSSTMFATGPYAHETQLVSPPVFSA 123
           PPSSPASF  S   S   SP   +SL++N  SP  P S +F  GPYA+ETQ V+PPVFSA
Sbjct: 95  PPSSPASFLQSDPSSVSHSPVGPLSLTSNTFSPKEPQS-VFTVGPYANETQPVTPPVFSA 154

Query: 124 FTTEPSTAPLTPPPELA-HLTTPSSPDVPFAQFLSSSVDL----KGTGKANYIASNDLQ- 183
           F TEPSTAP TPPPE + H+TTPSSP+VPFAQ L+SS++L      +G     +S+  + 
Sbjct: 155 FITEPSTAPYTPPPESSVHITTPSSPEVPFAQLLTSSLELTRRDSTSGMNQKFSSSHYEF 214

Query: 184 AAYSLYPGSP-ASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPR 224
            +  + PGSP   +L+SP S  S    SS +P +      +P    + G+ P+
Sbjct: 215 RSNQVCPGSPGGGNLISPGSVISNSGTSSPYPGK------SPMVEFRIGEPPK 255

BLAST of Cla000521 vs. TAIR10
Match: AT1G63720.1 (AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1))

HSP 1 Score: 144.8 bits (364), Expect = 1.3e-34
Identity = 85/205 (41.46%), Postives = 116/205 (56.59%), Query Frame = 1

Query: 3   KRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPPGMTNQATVITPSLLAPPS 62
           ++W   W  L CF S +  KRI  +  +PE  ++++  +       ++ + T   +APPS
Sbjct: 38  RKWWNRWSLLKCFGSSRQRKRIGNSVLVPEPVSMSSSNSTTSNSGYRSVITTLPFIAPPS 97

Query: 63  SPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEP 122
           SPASF  S  PS  QSP   +S S   P     ++FA GPYAHETQLVSPPVFS +TTEP
Sbjct: 98  SPASFFQSEPPSATQSPVGILSFSP-LPCNNRPSIFAIGPYAHETQLVSPPVFSTYTTEP 157

Query: 123 STAPLTPPPELAHL----TTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYP 182
           S+AP+TPP + + +    TTPSSP+VPFAQ  +S+      G    ++S+     Y L P
Sbjct: 158 SSAPITPPLDDSSIYLTTTTPSSPEVPFAQLFNSNHQTGSYGYKFPMSSSYEFQFYQLPP 217

Query: 183 GSPASSLVSPISRTSGDCLSSSFPE 204
           GSP   L+SP   + G   +S FP+
Sbjct: 218 GSPLGQLISP---SPGSGPTSPFPD 238

BLAST of Cla000521 vs. TAIR10
Match: AT4G25620.1 (AT4G25620.1 hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 142.5 bits (358), Expect = 6.4e-34
Identity = 101/252 (40.08%), Postives = 127/252 (50.40%), Query Frame = 1

Query: 3   KRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPPGMTNQATVITPSLLAPPS 62
           K+ G  W    CF S+K  KRI  A  +PE  A           ++ +T I    +APPS
Sbjct: 33  KKRGSWWSLYWCFGSKKNNKRIGHAVLVPEPAASGAAVAPVQNSSSNSTSIFMPFIAPPS 92

Query: 63  SPASFTNSALPSTVQSPS--CFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTT 122
           SPASF  S  PS   +P      SL+ N P  PS+  F  GPYAHETQ V+PPVFSAFTT
Sbjct: 93  SPASFLPSGPPSASHTPDPGLLCSLTVNEP--PSA--FTIGPYAHETQPVTPPVFSAFTT 152

Query: 123 EPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLK-----GTGKANYIASNDLQAAYS 182
           EPSTAP TPPPE     +PSSP+VPFAQ L+SS++       G     + A++    +  
Sbjct: 153 EPSTAPFTPPPE-----SPSSPEVPFAQLLTSSLERARRNSGGGMNQKFSAAHYEFKSCQ 212

Query: 183 LYPGSPASSLVSPISRTS----GDCL--------------SSSFPERDFPSQWNPSASLQ 230
           +YPGSP  +L+SP S TS    G C                  F  R + S++   +   
Sbjct: 213 VYPGSPGGNLISPGSGTSSPYPGKCSIIEFRIGEPPKFLGFEHFTARKWGSRFGSGSITP 272

BLAST of Cla000521 vs. NCBI nr
Match: gi|659105232|ref|XP_008453041.1| (PREDICTED: uncharacterized protein At1g76660 [Cucumis melo])

HSP 1 Score: 854.7 bits (2207), Expect = 7.2e-245
Identity = 429/461 (93.06%), Postives = 440/461 (95.44%), Query Frame = 1

Query: 2   GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPP--GMTNQATVITPSLLA 61
           GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGN VTTQPNGP   GMTNQATVITPSLLA
Sbjct: 14  GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLA 73

Query: 62  PPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFT 121
           PPSSPASFTNSALPSTVQSPSCF+SLSANSPGGPSST++ATGPYAHETQ VSPPVFSAFT
Sbjct: 74  PPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTIYATGPYAHETQPVSPPVFSAFT 133

Query: 122 TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPG 181
           TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DLKGTGKANYIASNDLQAAYSLYPG
Sbjct: 134 TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPG 193

Query: 182 SPASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTS 241
           SPASSLVSPISRTSGDCLSSSFPERDF  QWN SASLQDGKYPRSGSGRLFGNEKAG TS
Sbjct: 194 SPASSLVSPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAG-TS 253

Query: 242 LASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVE 301
           LASQDSNFFCPATFAQFYLDN  FPHTGGRLSVSKDSDVYSS GNGYQNRHSKSPKQDVE
Sbjct: 254 LASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQNRHSKSPKQDVE 313

Query: 302 EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKS 361
           EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES EPPLLGEKLKS
Sbjct: 314 EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKS 373

Query: 362 THTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQVEKDVFS 421
           +HTTLQ+QRSIKSAPEVVEKETCTEV ALCNGYKDNKLQRQPG++ GSSTS+QVEKDVFS
Sbjct: 374 SHTTLQNQRSIKSAPEVVEKETCTEVPALCNGYKDNKLQRQPGDILGSSTSDQVEKDVFS 433

Query: 422 RTGSSKNSRKYNLGLSCSDAEVDYGRGRSPREAREDFSWHD 461
           R GSSKNSRKY+LGLSCSDAEVDY RGRS REA+ + SWHD
Sbjct: 434 RIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD 473

BLAST of Cla000521 vs. NCBI nr
Match: gi|449466510|ref|XP_004150969.1| (PREDICTED: uncharacterized protein At1g76660 [Cucumis sativus])

HSP 1 Score: 846.7 bits (2186), Expect = 2.0e-242
Identity = 426/461 (92.41%), Postives = 437/461 (94.79%), Query Frame = 1

Query: 2   GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPP--GMTNQATVITPSLLA 61
           GKRWGGCWGALSCFHSQKG+KRIVPASRLPEGN VTTQPNGP   GMTNQATVITPSLLA
Sbjct: 14  GKRWGGCWGALSCFHSQKGDKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLA 73

Query: 62  PPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFT 121
           PPSSPASFTNSALPSTVQSPSCF+SLSANSPGGPSSTM+ATGPYAH+TQLVSPPVFSAF 
Sbjct: 74  PPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMYATGPYAHDTQLVSPPVFSAFN 133

Query: 122 TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPG 181
           TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DLKGTGKANYIASNDLQAAYSLYPG
Sbjct: 134 TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSEDLKGTGKANYIASNDLQAAYSLYPG 193

Query: 182 SPASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTS 241
           SPASSLVSPISRTSGDCLSSSFPERDF  QWN SASLQDGKYPRSGSGRLFGNEKA GTS
Sbjct: 194 SPASSLVSPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKA-GTS 253

Query: 242 LASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVE 301
           LASQDSNFFCPATFAQFYLDN  FPHTGGRLSVSKDSDVYSS GNGYQNRHSKSPKQDVE
Sbjct: 254 LASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQNRHSKSPKQDVE 313

Query: 302 EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKS 361
           EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES EPPLLGEKLKS
Sbjct: 314 EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKS 373

Query: 362 THTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQVEKDVFS 421
           +HTTLQSQRSIKSAPE    ETCTE+ ALCNGYKDNKLQRQPG++SGSSTSNQVEKDVFS
Sbjct: 374 SHTTLQSQRSIKSAPE----ETCTEMPALCNGYKDNKLQRQPGDISGSSTSNQVEKDVFS 433

Query: 422 RTGSSKNSRKYNLGLSCSDAEVDYGRGRSPREAREDFSWHD 461
           R GSSKNSRKY+LGLSCSDAEVDY RGRS REA+ + SWHD
Sbjct: 434 RIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD 469

BLAST of Cla000521 vs. NCBI nr
Match: gi|1009126185|ref|XP_015880014.1| (PREDICTED: uncharacterized protein At1g76660 [Ziziphus jujuba])

HSP 1 Score: 690.3 bits (1780), Expect = 2.3e-195
Identity = 341/465 (73.33%), Postives = 390/465 (83.87%), Query Frame = 1

Query: 3   KRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPP--GMTNQATVITPSLLAP 62
           KRWGGCW ALSCF +QKG KRIVPASR+PEGNA   QPNGP   G+TNQ T + PSLLAP
Sbjct: 17  KRWGGCWSALSCFGTQKGGKRIVPASRIPEGNASAAQPNGPQAVGLTNQGTALAPSLLAP 76

Query: 63  PSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTT 122
           PSSPASFTNSALPSTVQSPSCF+SLSANSPGGPSSTMFATGPYAHETQLVSPPVFS FTT
Sbjct: 77  PSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSTFTT 136

Query: 123 EPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGS 182
           EPSTAPLTPPPELAHLTTPSSPDVPFA FLSSSVDLK T K+NYIA NDL + YSLYPGS
Sbjct: 137 EPSTAPLTPPPELAHLTTPSSPDVPFAHFLSSSVDLKSTDKSNYIAVNDLHSTYSLYPGS 196

Query: 183 PASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSL 242
           P SS++SPISRTS +C SSSFPER+FP+QW+ S S ++GKYPR+ SGRLF ++  GG  +
Sbjct: 197 PPSSIISPISRTSNECSSSSFPEREFPTQWDSSVSPKNGKYPRNDSGRLFEHDATGG-PM 256

Query: 243 ASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVEE 302
            SQDSNFFCPATFAQFY+DNPPFPH GGRLSVSKDSD YS+ GNG+QNRHSKSPKQDVEE
Sbjct: 257 TSQDSNFFCPATFAQFYVDNPPFPHAGGRLSVSKDSDAYSTGGNGHQNRHSKSPKQDVEE 316

Query: 303 IEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEP-PLLGEKLKS 362
           IEAYRASFGFSADEIITT+QYVEISDVMEDSFTM PFTS  L  +ESIEP  + G K   
Sbjct: 317 IEAYRASFGFSADEIITTSQYVEISDVMEDSFTMTPFTSNKLPMDESIEPASISGLKAIK 376

Query: 363 THTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSST-SNQV---EK 422
           T T+ QSQ+S++S  ++++   C EV AL NG++D+K  + PG++SGSST  N++   E 
Sbjct: 377 TQTSAQSQKSLESELDLIDGGRCCEVPALSNGFEDHKSWKPPGDISGSSTPGNRILTDED 436

Query: 423 DVFSRTGSSKNSRKYNLGLSCSDAEVDYGRGRSPREAREDFSWHD 461
           D+FS+ GSS+ S+KY LGLSCSDAE+DY  GRS +E + DF WH+
Sbjct: 437 DIFSKVGSSRMSKKYQLGLSCSDAEIDYRSGRSVKEGKGDFKWHE 480

BLAST of Cla000521 vs. NCBI nr
Match: gi|645255247|ref|XP_008233411.1| (PREDICTED: uncharacterized protein At1g76660 isoform X1 [Prunus mume])

HSP 1 Score: 676.4 bits (1744), Expect = 3.5e-191
Identity = 348/468 (74.36%), Postives = 385/468 (82.26%), Query Frame = 1

Query: 3   KRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPP--GMTNQATVITPSLLAP 62
           KRWGGCWGA SCF S KG KRIVPASR+PEGNA  TQPNGP   G+TNQAT + PSLLAP
Sbjct: 15  KRWGGCWGAFSCFDSHKGGKRIVPASRIPEGNASATQPNGPQAVGLTNQATSLAPSLLAP 74

Query: 63  PSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTT 122
           PSSPASFTNSALPST QSPSC + LSANSPGGPSSTM+ATGPYA+ETQLVSPPVFS FTT
Sbjct: 75  PSSPASFTNSALPSTAQSPSCSLLLSANSPGGPSSTMYATGPYANETQLVSPPVFSTFTT 134

Query: 123 EPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGS 182
           EPSTAPLTPPPELAHLTTPSSPDVPFA+FLSSSVD+K T K NYIA+NDLQA YSLYPGS
Sbjct: 135 EPSTAPLTPPPELAHLTTPSSPDVPFARFLSSSVDIKTTDKTNYIAANDLQATYSLYPGS 194

Query: 183 PASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSL 242
           PASSL SPISR S DC SSSFPERDFP QW+PS S Q+G YPRSGS RLFG +   G S 
Sbjct: 195 PASSLRSPISRASNDC-SSSFPERDFPRQWDPSVSPQNGTYPRSGSARLFGYDTT-GASA 254

Query: 243 ASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVEE 302
           ASQDSNFFCPATFAQFYLDNPPFPH GGRLSVSKDSDVYS+ GNG QNRH++SPKQDVEE
Sbjct: 255 ASQDSNFFCPATFAQFYLDNPPFPHAGGRLSVSKDSDVYSTGGNGSQNRHNRSPKQDVEE 314

Query: 303 IEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKS- 362
           +EAYRASFGFSADEIITTTQYVEISDVM+DSFTM PFTS  L  EE IEP  + E LK+ 
Sbjct: 315 LEAYRASFGFSADEIITTTQYVEISDVMDDSFTMTPFTSHKLPTEEHIEPISVTEGLKAQ 374

Query: 363 -THTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSN------Q 422
            T T LQSQ + KS  ++ E  + +++   CNGY+D+K  RQPG++S SST         
Sbjct: 375 KTKTILQSQDTTKSESDLDEGGS-SDLPISCNGYEDHKSWRQPGDVSRSSTPGPGIRVLA 434

Query: 423 VEKDVFSRTGSSKNSRKYNLGLSCSDAEVDYGRGRSPREAREDFSWHD 461
            E+D+FS+ GSSK SRKY LGLS SDAE+DY RGRS RE + +F+WHD
Sbjct: 435 DEEDIFSKIGSSKLSRKYQLGLSSSDAEIDYRRGRSLRERKGEFAWHD 479

BLAST of Cla000521 vs. NCBI nr
Match: gi|743874734|ref|XP_011034746.1| (PREDICTED: uncharacterized protein At1g76660-like isoform X3 [Populus euphratica])

HSP 1 Score: 665.6 bits (1716), Expect = 6.2e-188
Identity = 334/462 (72.29%), Postives = 379/462 (82.03%), Query Frame = 1

Query: 3   KRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPP--GMTNQATVITPSLLAP 62
           KRWGGCWGALSCF  QKG KRIVPASR+PEGNA   QPNGP   G+TNQAT + PSLLAP
Sbjct: 19  KRWGGCWGALSCFSVQKGGKRIVPASRIPEGNASAAQPNGPQPVGLTNQATALAPSLLAP 78

Query: 63  PSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTT 122
           PSSPASFTNSALPST QSPSCF+SLSANSPGGPSSTMFATGPYAHETQLVSPPVFS FTT
Sbjct: 79  PSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSTFTT 138

Query: 123 EPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGS 182
           EPSTAPLTPPPE+AHLTTPSSPDVPFAQFL+SS DLKG  K NYI ++DLQ+ YSLYPGS
Sbjct: 139 EPSTAPLTPPPEMAHLTTPSSPDVPFAQFLTSSRDLKGAEKNNYIVASDLQSTYSLYPGS 198

Query: 183 PASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSL 242
           PASSL+SPISRTSGDCLS+SFPER FP +W PS S Q+GKY RSGSGRLFG+E   G S+
Sbjct: 199 PASSLLSPISRTSGDCLSASFPERGFPREWGPSVSPQNGKYSRSGSGRLFGHETT-GASM 258

Query: 243 ASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVEE 302
            S DSNFFCPATFA+FYL+    P+TGGRLSVSKDSDVY +SGNG+QNRH+KSPKQD EE
Sbjct: 259 VSHDSNFFCPATFARFYLEQN--PNTGGRLSVSKDSDVYPASGNGHQNRHNKSPKQDAEE 318

Query: 303 IEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKST 362
           +EAYRASFGFSADEIITT QYVEISDVMED+F+M PFTS   + EES+E   L E  K+ 
Sbjct: 319 LEAYRASFGFSADEIITTPQYVEISDVMEDTFSMTPFTSAKPTMEESMEASFLNESQKA- 378

Query: 363 HTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQV--EKDVF 422
           +  L  Q S+K   ++ ++  C EV    + Y+D K + QPGN+SGSST + V  + D+F
Sbjct: 379 NANLPKQNSLKLKSDLADRVVCCEVPVTSDRYEDPKSRWQPGNVSGSSTPSNVVTDDDIF 438

Query: 423 SRTGSSKNSRKYNLGLSCSDAEVDYGRGRSPREAREDFSWHD 461
           S+  SSK SRKY+LGLS SDAE+DY RGRS RE + DF+WHD
Sbjct: 439 SKMASSKTSRKYHLGLSSSDAEIDYRRGRSLREGKGDFAWHD 476

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y1666_ARATH7.0e-13158.35Uncharacterized protein At1g76660 OS=Arabidopsis thaliana GN=At1g76660 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L1G3_CUCSA1.4e-24292.41Uncharacterized protein OS=Cucumis sativus GN=Csa_4G665140 PE=4 SV=1[more]
B9GPK6_POPTR4.8e-18772.53Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0002s00330g PE=4 SV=2[more]
F6I6Y3_VITVI6.2e-18771.86Putative uncharacterized protein OS=Vitis vinifera GN=VIT_18s0122g00140 PE=4 SV=... [more]
V4TGC4_9ROSI8.1e-18771.55Uncharacterized protein OS=Citrus clementina GN=CICLE_v10000992mg PE=4 SV=1[more]
A0A067KKH5_JATCU4.0e-18672.26Uncharacterized protein OS=Jatropha curcas GN=JCGZ_08002 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G76660.13.9e-13258.35 FUNCTIONS IN: molecular_function unknown[more]
AT5G52430.11.3e-3744.64 hydroxyproline-rich glycoprotein family protein[more]
AT1G63720.11.3e-3441.46 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glyc... [more]
AT4G25620.16.4e-3440.08 hydroxyproline-rich glycoprotein family protein[more]
Match NameE-valueIdentityDescription
gi|659105232|ref|XP_008453041.1|7.2e-24593.06PREDICTED: uncharacterized protein At1g76660 [Cucumis melo][more]
gi|449466510|ref|XP_004150969.1|2.0e-24292.41PREDICTED: uncharacterized protein At1g76660 [Cucumis sativus][more]
gi|1009126185|ref|XP_015880014.1|2.3e-19573.33PREDICTED: uncharacterized protein At1g76660 [Ziziphus jujuba][more]
gi|645255247|ref|XP_008233411.1|3.5e-19174.36PREDICTED: uncharacterized protein At1g76660 isoform X1 [Prunus mume][more]
gi|743874734|ref|XP_011034746.1|6.2e-18872.29PREDICTED: uncharacterized protein At1g76660-like isoform X3 [Populus euphratica... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0055085 transmembrane transport
cellular_component GO:0005886 plasma membrane
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
WMU32304watermelon unigene v2 vs TrEMBLtranscribed_cluster
WMU55991watermelon EST collection version 2.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla000521Cla000521.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
WMU32304WMU32304transcribed_cluster
WMU55991WMU55991transcribed_cluster


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR31798FAMILY NOT NAMEDcoord: 3..460
score: 2.4E
NoneNo IPR availablePANTHERPTHR31798:SF3SUBFAMILY NOT NAMEDcoord: 3..460
score: 2.4E