CSPI03G17110 (gene) Wild cucumber (PI 183967)

NameCSPI03G17110
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionHydroxyproline-rich glycoprotein
LocationChr3 : 12838461 .. 12841963 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCTTCATTTTTATTACATTTCCTAATTTATAATTTCATTTTGTTTATTTATTTGGAAATATATAAAATGAGAAAATTCAATCTCTCTCTTTCTTCGTCTGAGTTTTTCATTTCTTAGATCAAATCCAAAAGTCTTCTCTCTTCCTTCTTTTGCCTGTCAAATCTCACTCCATACTAATTGATTTTCCCGGCCACTTACCGCCCTCGATTGATCCCAAATCACGGCGACCTTGTTAAGAACTCCGCCGGCCGAGCTCTCTGGATTCGAAGAAACTGGGGATGGCAAGTATCAACAACAGCGTCGATACGGTTAATGCTGCCGCTACTGCCATCGTTTCTGCTGAGGCTCGAGTTCAGCCTACCACACCTCCGGTACGTTTATTTTTTTTTCTTTATTTGTTTTCTGGTTTGATTTTTATTTGGAACTGAGAAAAAAGTGCGGTAATTTGATTGAGAAATGGAGTTTTTGATCTCGGTGTTGGTTTGCTGTTTTTAGTTTAGATGAGGAAAATCTCATCAAGGTTGATGATCTAAACGATTTTTTTTTTTTTTTTTAATTTGAGAAAACCGTGTTGGTTACGCGTTTTTGTTTCTTGATTCTAAGGCTTTGATTTGGTACTTTCTGCTTCGGTTGGTGGTGTTTTTTTTTTTTTTTTTTTTGGTGTTGCGCTTTTGTGAAGACGTTGCATCTTAATTTCACAAAAAGTTGTTGCTTAATTAGTCTAGCTGGTAAACGGTTGAAGATAAAGCAACTTGAAAAGTTTCGATTAATCTGCTTTTAGAACAGAAAGGAAGGGGAACATAAGGTTGTTCGGTAATTAATTACAAGTGCGATTTGCTCGATTCTGTGCAATAACTTCCTTCGGAACTTGGATTCATTCGGTGTAAGATTTCTCATACAGTGGGACTTGAATCAATTTAGTTAGCGATCTTATGGACTTGTCATCAGATTTGTTGAATTAGTTGTTTTTGATAATGGATCTTGTAGCGCTGTACAGTGTGTCAATTTATTCGGCTCTTTTGGGCTTTAGTTTTCTTCTCTTAAGGTTGAGTTTGGTCTGGGTTGAGGATTCCTTTGATGGGGACAGACAAAAATTTGCGTCTCTTTAGAGGAAGATTCTATGCTCCTGAAGGTGTTCCGCTTGAGCTATCATACGCTCTTATCACAGAGCAATTGACTGCCAAAATCTTTAGTCTTAATCTTACCACCTCTTTAAATAGATCGATCATCGACCTCAACGGGTGGCTCAAAACTGAGATGCCGTACGGTAGTTAGCTTTCACTTGTAAAACTTTCAACTCTGTTTACCGCTTTATGCAAAAATTTCTGAAACCGTTTGGTTTTTTTTAGCATATGAACACAACATGAATTGTTTACTAACCAATTTAGCCAAGAAACATTAAAAGTTTGGATTTTGAATTGATGAAGAAGGGTTTTCTGTTCCCACTCTCTTCCGTTCTGTCTATTCTCTATTGCCTGACATGAGAGGTGAGGAAGTAACGTAATGTTTTGTTGTTAACACGACATAGTCGATGATTATGAGCTTCTTACCGGAGGTTTTACCTGCTGAAGTTCTTAGACTTCTGAATAGTTGTTATTAGTACTGCTCTGATGGCTTGAATAATGATTCTGTTTTAGCTAGATTGAAGTCTCTGTTCTGTTTCCTGTTGATCTTTTCATTTACATTTTCTTGTTGTCTTAACTACAGAAACGAAGATGGGGTAGCTGCTGGAGTCTGTACTGGTGCTTTGGCATTGGTTCACAGAAAAGCAATAAACGTATAGGTCATGCTGTACTAGTTCCTGAACCTGCAGTACCAGGAGCCGTTGCCCCTGCTGTTGAGCATCGAACACCTTCAACCACCATGGTATTACCTTTCATTGCCCCTCCATCTTCTCCTGCATCTTTCCTCCAGTCCGAACCTACATCAAATACTCAATCTCCTGCTGGATTACTATCCTTAACTGCTCTTTCAGTCAATAACTACTCCCCAAATGGACCTGCTTCCATTTTTGCAATAGGCCCTTATACATATGACACTCAGTTGGTCTCACCTCCAGTTTTTTCTGCCTTCACCACTGAACCATCAACCGCTCCTATTACTCCTCCTCCTGAGTCTGTTCAACTGACTACACCCTCATCTCCTGAAGTTCCATTTGCTAAATTGCTGACATCTTCTCTAAGCCATACTAATAAAAGTTTTGGGACTAACCAAAAGTTCACACTATCACACTGTGATTTCCAGCCTTATCAACCCTACCCAGGAAGCCCTGGTGCTCATCTTATATCACCTGGATCAGTAATTTCAAACTCTGGTACATCTTCTCCTTTTCCTGATAAACACCCCATTCTTGAGTTCCGCATGGCAGATGCACCGAAGCTCTTGGGTCTTGAACATTTTACAACTCGCAAATGGATCTCAAGAATGGGTTCTGGATCTTTGACTCCAGATGGTACCGGTTTATGTTCTAGGTTAGGTTCAGGAACTTTGACTCCTGATGGTATGGGAATGGGTTCTAGATTGGGATCTGGATCTGTTACCCCAAATGGTATGAGGCAAGATTCAAGATTGGGTTCTGGAACCTTGACGCCTGATGGTCTGGGCCATGGCTTGCAAGATAGTCCATTGTTGGACAACCAAATATCTGAGGTGGCTTCTCTTGCCAACTCAGAAACTGGATGCCAAAATGATGTGACAAATCATAGGGTGTCATTTGAGTTAACTGGTGAAGATGTTGCACGCTGTCTTGCAAATAAGTCATTGACATCCATTAGAACTGAATCTGAGTCTCCGAAGCAAACAAGCACAAGCAATCAAAACGAAAACAAAGAATCATCCAGAGAAGCTGAAACTTGCGAGTTCTTTGACATCAAGACTTCCGCAGCACCAGAAAAAACTCCAGGAGAGGATGATCAATGCTACCAAAATCAGCGAGCTGTAACTCTTGGTTCATTCAAAGAGTTCAACTTTGACCAAACTAAAGGAGAAATACACAACACAGCCTCCATCGGTGCAGAATGGTGGGCCAATGAGAAAGTGGGTGTGAAGGAAGCTAGTCCAGGTAACAACTGGACCTTCTTCCCATTGTTGCAACCTGGCGTCAGCTGACTTTGACAAAGGATATCAACACTAAAAAGAACAAAACAAAGATGAAGAAGAAACAACAGCCAGCCCTTTTGAATGTACATTTGAATGTAATCCTCCTTTGGAGGTGATGCAATGATTTGGAGGAAAAAGAATGTTTTTTCAAAGTTTGTTTTGTGAAAAACAGTATTCCCAAATAGACATCAGAAAAGAAAGTAGTTATTAGGGATAACTTGTGTGGGTACCGAAGGAGCATTCTTTGTCATAGCTCAAAAGTAGATCATTCATAATCATAGGATCTTTTGGAAGTGCTTTATTCTTTCTTGTCTTTGTATAATAATAAGAAATTTCATTCTTCCCCAACAATCAAAACATCATTTCTTGAAAACTTAGTCTTGAATTTAGTTTTTTCATGTCAA

mRNA sequence

ATGGCAAGTATCAACAACAGCGTCGATACGGTTAATGCTGCCGCTACTGCCATCGTTTCTGCTGAGGCTCGAGTTCAGCCTACCACACCTCCGAAACGAAGATGGGGTAGCTGCTGGAGTCTGTACTGGTGCTTTGGCATTGGTTCACAGAAAAGCAATAAACGTATAGGTCATGCTGTACTAGTTCCTGAACCTGCAGTACCAGGAGCCGTTGCCCCTGCTGTTGAGCATCGAACACCTTCAACCACCATGGTATTACCTTTCATTGCCCCTCCATCTTCTCCTGCATCTTTCCTCCAGTCCGAACCTACATCAAATACTCAATCTCCTGCTGGATTACTATCCTTAACTGCTCTTTCAGTCAATAACTACTCCCCAAATGGACCTGCTTCCATTTTTGCAATAGGCCCTTATACATATGACACTCAGTTGGTCTCACCTCCAGTTTTTTCTGCCTTCACCACTGAACCATCAACCGCTCCTATTACTCCTCCTCCTGAGTCTGTTCAACTGACTACACCCTCATCTCCTGAAGTTCCATTTGCTAAATTGCTGACATCTTCTCTAAGCCATACTAATAAAAGTTTTGGGACTAACCAAAAGTTCACACTATCACACTGTGATTTCCAGCCTTATCAACCCTACCCAGGAAGCCCTGGTGCTCATCTTATATCACCTGGATCAGTAATTTCAAACTCTGGTACATCTTCTCCTTTTCCTGATAAACACCCCATTCTTGAGTTCCGCATGGCAGATGCACCGAAGCTCTTGGGTCTTGAACATTTTACAACTCGCAAATGGATCTCAAGAATGGGTTCTGGATCTTTGACTCCAGATGGTACCGGTTTATGTTCTAGGTTAGGTTCAGGAACTTTGACTCCTGATGGTATGGGAATGGGTTCTAGATTGGGATCTGGATCTGTTACCCCAAATGGTATGAGGCAAGATTCAAGATTGGGTTCTGGAACCTTGACGCCTGATGGTCTGGGCCATGGCTTGCAAGATAGTCCATTGTTGGACAACCAAATATCTGAGGTGGCTTCTCTTGCCAACTCAGAAACTGGATGCCAAAATGATGTGACAAATCATAGGGTGTCATTTGAGTTAACTGGTGAAGATGTTGCACGCTGTCTTGCAAATAAGTCATTGACATCCATTAGAACTGAATCTGAGTCTCCGAAGCAAACAAGCACAAGCAATCAAAACGAAAACAAAGAATCATCCAGAGAAGCTGAAACTTGCGAGTTCTTTGACATCAAGACTTCCGCAGCACCAGAAAAAACTCCAGGAGAGGATGATCAATGCTACCAAAATCAGCGAGCTGTAACTCTTGGTTCATTCAAAGAGTTCAACTTTGACCAAACTAAAGGAGAAATACACAACACAGCCTCCATCGGTGCAGAATGGTGGGCCAATGAGAAAGTGGGTGTGAAGGAAGCTAGTCCAGGTAACAACTGGACCTTCTTCCCATTGTTGCAACCTGGCGTCAGCTGA

Coding sequence (CDS)

ATGGCAAGTATCAACAACAGCGTCGATACGGTTAATGCTGCCGCTACTGCCATCGTTTCTGCTGAGGCTCGAGTTCAGCCTACCACACCTCCGAAACGAAGATGGGGTAGCTGCTGGAGTCTGTACTGGTGCTTTGGCATTGGTTCACAGAAAAGCAATAAACGTATAGGTCATGCTGTACTAGTTCCTGAACCTGCAGTACCAGGAGCCGTTGCCCCTGCTGTTGAGCATCGAACACCTTCAACCACCATGGTATTACCTTTCATTGCCCCTCCATCTTCTCCTGCATCTTTCCTCCAGTCCGAACCTACATCAAATACTCAATCTCCTGCTGGATTACTATCCTTAACTGCTCTTTCAGTCAATAACTACTCCCCAAATGGACCTGCTTCCATTTTTGCAATAGGCCCTTATACATATGACACTCAGTTGGTCTCACCTCCAGTTTTTTCTGCCTTCACCACTGAACCATCAACCGCTCCTATTACTCCTCCTCCTGAGTCTGTTCAACTGACTACACCCTCATCTCCTGAAGTTCCATTTGCTAAATTGCTGACATCTTCTCTAAGCCATACTAATAAAAGTTTTGGGACTAACCAAAAGTTCACACTATCACACTGTGATTTCCAGCCTTATCAACCCTACCCAGGAAGCCCTGGTGCTCATCTTATATCACCTGGATCAGTAATTTCAAACTCTGGTACATCTTCTCCTTTTCCTGATAAACACCCCATTCTTGAGTTCCGCATGGCAGATGCACCGAAGCTCTTGGGTCTTGAACATTTTACAACTCGCAAATGGATCTCAAGAATGGGTTCTGGATCTTTGACTCCAGATGGTACCGGTTTATGTTCTAGGTTAGGTTCAGGAACTTTGACTCCTGATGGTATGGGAATGGGTTCTAGATTGGGATCTGGATCTGTTACCCCAAATGGTATGAGGCAAGATTCAAGATTGGGTTCTGGAACCTTGACGCCTGATGGTCTGGGCCATGGCTTGCAAGATAGTCCATTGTTGGACAACCAAATATCTGAGGTGGCTTCTCTTGCCAACTCAGAAACTGGATGCCAAAATGATGTGACAAATCATAGGGTGTCATTTGAGTTAACTGGTGAAGATGTTGCACGCTGTCTTGCAAATAAGTCATTGACATCCATTAGAACTGAATCTGAGTCTCCGAAGCAAACAAGCACAAGCAATCAAAACGAAAACAAAGAATCATCCAGAGAAGCTGAAACTTGCGAGTTCTTTGACATCAAGACTTCCGCAGCACCAGAAAAAACTCCAGGAGAGGATGATCAATGCTACCAAAATCAGCGAGCTGTAACTCTTGGTTCATTCAAAGAGTTCAACTTTGACCAAACTAAAGGAGAAATACACAACACAGCCTCCATCGGTGCAGAATGGTGGGCCAATGAGAAAGTGGGTGTGAAGGAAGCTAGTCCAGGTAACAACTGGACCTTCTTCCCATTGTTGCAACCTGGCGTCAGCTGA
BLAST of CSPI03G17110 vs. Swiss-Prot
Match: Y1666_ARATH (Uncharacterized protein At1g76660 OS=Arabidopsis thaliana GN=At1g76660 PE=2 SV=1)

HSP 1 Score: 147.5 bits (371), Expect = 3.8e-34
Identity = 102/215 (47.44%), Postives = 123/215 (57.21%), Query Frame = 1

Query: 32  KRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRT------PSTTMV 91
           ++RWG C  ++ CF   SQK  KRI  A  +PE     A  P   H+        +  + 
Sbjct: 8   RKRWGGCLGVFSCFK--SQKGGKRIVPASRIPEGGNVSASQPNGAHQAGVLNNQAAGGIN 67

Query: 92  LPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALSVNNYSPNGPAS-IFAIGPYTYDTQL 151
           L  +APPSSPASF  S   S TQSP   LSL A      SP GP+S ++A GPY ++TQL
Sbjct: 68  LSLLAPPSSPASFTNSALPSTTQSPNCYLSLAA-----NSPGGPSSSMYATGPYAHETQL 127

Query: 152 VSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFTL 211
           VSPPVFS FTTEPSTAP TPPPE  +LT PSSP+VP+A+ LTSS+   N   G       
Sbjct: 128 VSPPVFSTFTTEPSTAPFTPPPELARLTAPSSPDVPYARFLTSSMDLKNSGKG------- 187

Query: 212 SHCDFQ-PYQPYPGSPGAHLISPGSVISNSGTSSP 239
            + D Q  Y  YPGSP + L SP S  S  G  SP
Sbjct: 188 HYNDLQATYSLYPGSPASALRSPISRASGDGLLSP 208

BLAST of CSPI03G17110 vs. TrEMBL
Match: W9S7Z6_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_004326 PE=4 SV=1)

HSP 1 Score: 659.8 bits (1701), Expect = 2.6e-186
Identity = 355/523 (67.88%), Postives = 407/523 (77.82%), Query Frame = 1

Query: 1   MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAV 60
           M ++NNSV+T+NAAATAIVSAEAR QP   PKRRWGSCWSLYWCFG  S K++KRIGHAV
Sbjct: 1   MRTVNNSVETINAAATAIVSAEARAQPAAVPKRRWGSCWSLYWCFG--SHKNSKRIGHAV 60

Query: 61  LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALS 120
           LVPEP +PGA APA E++ PST +VLPFIAPPSSPASFLQS+P S TQSPAGLLSLT+LS
Sbjct: 61  LVPEPVLPGAAAPAPENQAPSTAIVLPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLS 120

Query: 121 VNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVP 180
           +N YSP GP SIFAIGPY Y+TQLVSPPVFS FTTEPSTAP TPPPESVQLTTPSSPEVP
Sbjct: 121 INAYSPGGPTSIFAIGPYAYETQLVSPPVFSTFTTEPSTAPFTPPPESVQLTTPSSPEVP 180

Query: 181 FAKLLTSSLSHTNK-SFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPF 240
           FA+LLTSSL  T + S G NQKF+LSHC+FQPYQ YPGSPG +LISPGSV+SNSGTSSPF
Sbjct: 181 FAQLLTSSLDRTRRNSSGANQKFSLSHCEFQPYQLYPGSPGGNLISPGSVVSNSGTSSPF 240

Query: 241 PDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGM 300
           PDKHPIL FRM +AP+LLG EHFTT KW SR+GSGSLTPDG GL SRLGSG++TPDG+G+
Sbjct: 241 PDKHPILGFRMGEAPRLLGFEHFTTWKWGSRLGSGSLTPDGVGLGSRLGSGSVTPDGVGL 300

Query: 301 GSRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQ----------------DSPLLDNQI 360
           GSRLGSGS+TP+G    SRLGSG +TP+G G G +                DS LL+NQI
Sbjct: 301 GSRLGSGSLTPDGYGLGSRLGSGCMTPNGPGLGSRLGSGTLTPDGFLVVSGDSFLLENQI 360

Query: 361 SEVASLANSETGCQND--VTNHRVSFELTGEDVARCLANKSLTSI-RTESE----SPKQT 420
           SEVASLANS+ GCQND  V +HRVSFELTGEDVARCLA+KS +S  RT SE    SP + 
Sbjct: 361 SEVASLANSDNGCQNDGSVVDHRVSFELTGEDVARCLASKSASSNGRTTSESLEDSPAEC 420

Query: 421 STSNQ--NENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQ 480
            T     + N   S   ++C       +   +   GEDD  YQ  R++TLGS KEFNFD 
Sbjct: 421 PTKKDGISANNVDSPNDQSCVEETSNKTPQSDCREGEDDHFYQKHRSITLGSIKEFNFDN 480

Query: 481 TKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS 498
           TK ++    +IG+EWWANEKV  KEA  GN+W+FFP+LQPGVS
Sbjct: 481 TKADVSVKPTIGSEWWANEKVAGKEAKAGNSWSFFPILQPGVS 521

BLAST of CSPI03G17110 vs. TrEMBL
Match: M5XMF7_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004616mg PE=4 SV=1)

HSP 1 Score: 657.5 bits (1695), Expect = 1.3e-185
Identity = 347/501 (69.26%), Postives = 397/501 (79.24%), Query Frame = 1

Query: 1   MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAV 60
           M S+N+SVDT+NAAATAIVSAEAR QPTT PKRRWGSCWSLYWCFG      NKRIGHAV
Sbjct: 1   MRSVNSSVDTINAAATAIVSAEARPQPTTVPKRRWGSCWSLYWCFG---PHKNKRIGHAV 60

Query: 61  LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALS 120
           LVPEP VPGA   A++++T ST +V+PFIAPPSSPASFL S+P S TQSPAG LSL +LS
Sbjct: 61  LVPEPVVPGAAVSAIDNQTTSTAIVVPFIAPPSSPASFLPSDPPSATQSPAGFLSLKSLS 120

Query: 121 VNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVP 180
            N YSP GPASIF+IGPY Y+TQLVSPPVFS F TEPSTAP TPPPESVQLTTPSSPEVP
Sbjct: 121 ANAYSPGGPASIFSIGPYAYETQLVSPPVFSTFNTEPSTAPFTPPPESVQLTTPSSPEVP 180

Query: 181 FAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFP 240
           FA+LLTSSL    ++ GTNQKF LSH +FQPYQ YPGSPG +LISPGS +SNSGTSSPFP
Sbjct: 181 FAQLLTSSLDRNRRNSGTNQKFALSHYEFQPYQQYPGSPGGNLISPGSAVSNSGTSSPFP 240

Query: 241 DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG 300
           D+HP+LEFRM +APKL G +HFTTRKW SR+GSGSLTPDG GL SRLGSG+LTPDG  +G
Sbjct: 241 DRHPVLEFRMGEAPKLFGFDHFTTRKWGSRIGSGSLTPDGVGLGSRLGSGSLTPDGNELG 300

Query: 301 SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQ--N 360
           SRLGSG VTPNG    SRLGSG LTPDG G   +DS LL+NQISEVASLANSE+GCQ   
Sbjct: 301 SRLGSGCVTPNGAGIGSRLGSGCLTPDGPGPASRDSFLLENQISEVASLANSESGCQTVE 360

Query: 361 DVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSNQNENKE-SSREAETCEF- 420
            V +HRVSFELTGEDVA CLANK++ S RT S S K  ++   +E    SS  +  CEF 
Sbjct: 361 TVFDHRVSFELTGEDVACCLANKAVASNRTASGSSKVIASEYPSERDALSSDSSNHCEFS 420

Query: 421 FDIKTSAAPEKTPGE-DDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKV 480
            +  +S  PE   GE +DQ Y+  R++TLGS K+FNFD TK E+ N  +IG+EWWAN+ V
Sbjct: 421 VEESSSRIPENVSGEGEDQGYRKHRSITLGSTKDFNFDNTKAEVPNKPNIGSEWWANKNV 480

Query: 481 GVKEASPGNNWTFFPLLQPGV 497
             KE+ P N+WTFFP+LQPGV
Sbjct: 481 AAKESKPCNDWTFFPILQPGV 498

BLAST of CSPI03G17110 vs. TrEMBL
Match: A0A067JHK1_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26079 PE=4 SV=1)

HSP 1 Score: 644.4 bits (1661), Expect = 1.1e-181
Identity = 339/501 (67.66%), Postives = 397/501 (79.24%), Query Frame = 1

Query: 1   MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAV 60
           M S+NNSV+T+NAAATAI+SAE+RVQPT   KRRWG CWSLYWCFG  S K++KRIGHAV
Sbjct: 1   MRSVNNSVETINAAATAIISAESRVQPTVVQKRRWGGCWSLYWCFG--SHKNSKRIGHAV 60

Query: 61  LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALS 120
           LVPEP VP AV  + E++T ST   +PFIAPPSSPASFLQS+P S TQSPAGLLSLTALS
Sbjct: 61  LVPEPEVPQAVVTSAENQTHSTAAAVPFIAPPSSPASFLQSDPPSVTQSPAGLLSLTALS 120

Query: 121 VNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVP 180
           V+ YSP GPASIFAIGPY ++TQLV+PPVFSAFTTEPSTAP TPPPESVQLTTPSSPEVP
Sbjct: 121 VSAYSPGGPASIFAIGPYAHETQLVTPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVP 180

Query: 181 FAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFP 240
           FA+LLTSSL    ++ G NQKF LSH +FQ Y  YPGSPG  LISPGS+ISNSGTSSPFP
Sbjct: 181 FAQLLTSSLERARRNSGANQKFALSHYEFQSYPLYPGSPGGQLISPGSIISNSGTSSPFP 240

Query: 241 DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG 300
           D+HP+LEFRM +APKLLG EHFTTRKW SR+GSG+LTPDG GL SRL SGT TPDG+G+G
Sbjct: 241 DRHPLLEFRMGEAPKLLGFEHFTTRKWGSRLGSGTLTPDGVGLGSRLCSGTATPDGVGLG 300

Query: 301 SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQND- 360
           SRLGSGSVTP+G+   SRLGSG+LTPD +    QD  LL+NQISEVASLANSE   +ND 
Sbjct: 301 SRLGSGSVTPDGVGLRSRLGSGSLTPDCVVPASQDGLLLENQISEVASLANSENASKNDE 360

Query: 361 -VTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSNQNENKESSREAETCEFFD 420
            + +HRVSFEL+GE+VARCL +KS+TS RT SE P+ +    Q  ++E    +  C    
Sbjct: 361 NIVDHRVSFELSGEEVARCLESKSMTSSRTFSECPQDSMAEEQINSEEILINSNDCLHIG 420

Query: 421 IKTSAAPEKTPG--EDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVG 480
             ++  PEK  G  E++ CY+  R++TLGS KEFNFD +K E+ +  +I +EWWANE + 
Sbjct: 421 ETSNETPEKPSGETEEEPCYRKHRSITLGSIKEFNFDNSK-EVPDKPTISSEWWANETIA 480

Query: 481 VKEASPGNNWTFFPLLQPGVS 498
            KEA P NNWTFFPLLQP VS
Sbjct: 481 GKEARPANNWTFFPLLQPEVS 498

BLAST of CSPI03G17110 vs. TrEMBL
Match: B9RIV6_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1583050 PE=4 SV=1)

HSP 1 Score: 627.1 bits (1616), Expect = 1.8e-176
Identity = 334/497 (67.20%), Postives = 391/497 (78.67%), Query Frame = 1

Query: 5   NNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPE 64
           N+SVDT+NAAATAIVSAE+RVQPTT  KRRWG CWSLYWCFG  S K+ KRIGHAVL PE
Sbjct: 19  NSSVDTINAAATAIVSAESRVQPTTVQKRRWGGCWSLYWCFG--SHKT-KRIGHAVLAPE 78

Query: 65  PAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALSVNNY 124
           P V GAV  + E+++ ST + +PFIAPPSSPASFLQS+P S TQSPAGLLSLT+LSVN Y
Sbjct: 79  PEVQGAVVTSAENQSQSTAITVPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAY 138

Query: 125 SPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKL 184
           SP GPASIFAIGPY ++TQLV+PP FSAFTTEPSTAP TPPPESVQLTTPSSPEVPFA+L
Sbjct: 139 SPGGPASIFAIGPYAHETQLVTPPAFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQL 198

Query: 185 LTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHP 244
           LTSSL    ++ GTNQKF LSH +FQ Y  YPGSPG  LISPGSVISNSGTSSPFPD++P
Sbjct: 199 LTSSLERARRNSGTNQKFALSHYEFQSYPLYPGSPGGQLISPGSVISNSGTSSPFPDRYP 258

Query: 245 ILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMGSRLG 304
           ILEFRM +APKLLG EHFTTRKW SR+GSG++TPDG GL SRLGSGT+TPDG+G GSRLG
Sbjct: 259 ILEFRMGEAPKLLGFEHFTTRKWGSRLGSGTVTPDGVGLGSRLGSGTVTPDGVGQGSRLG 318

Query: 305 SGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQND--VTN 364
           SG+VTP+G+   S LGSG+LTPD +G   +D   L+NQISEVASLANSE G + D  + +
Sbjct: 319 SGTVTPDGVGLRSMLGSGSLTPDAVGPASRDGFFLENQISEVASLANSENGSKTDENIVD 378

Query: 365 HRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSNQNENKESSREAETCEFFDIKTS 424
           HRVSFEL+GE+VARCL +KSL S R  SE P  +   +Q ++ +     E     +  + 
Sbjct: 379 HRVSFELSGEEVARCLESKSLASCRAFSECPPDSMAEDQIKSGKMLMTDENLPTGET-SG 438

Query: 425 AAPEKTPGE--DDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEA 484
             PEK  GE  ++ CY+  R++TLGS KEFNFD +K E+ +  SI +EWWANE +  KEA
Sbjct: 439 ETPEKPSGEMEEEHCYRKHRSITLGSIKEFNFDNSK-EVPDKPSINSEWWANETIAGKEA 498

Query: 485 SPGNNWTFFPLLQPGVS 498
            P NNWTFFPLLQP VS
Sbjct: 499 RPANNWTFFPLLQPEVS 510

BLAST of CSPI03G17110 vs. TrEMBL
Match: V4S4F7_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10004813mg PE=4 SV=1)

HSP 1 Score: 624.0 bits (1608), Expect = 1.6e-175
Identity = 332/505 (65.74%), Postives = 396/505 (78.42%), Query Frame = 1

Query: 1   MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAV 60
           M+S+++SV+TVNAAATAIVSAE+R++P    KRRWGSCWSLYWCFG  S K++KRI HAV
Sbjct: 1   MSSVHDSVETVNAAATAIVSAESRLRPAAIQKRRWGSCWSLYWCFG--SHKTSKRISHAV 60

Query: 61  LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALS 120
           LVPEP V GA APA E +  ST +VLPFIAPPSSPASFLQS+P S TQSPAGLLSL +LS
Sbjct: 61  LVPEPMVTGAAAPAAETQAHSTAIVLPFIAPPSSPASFLQSDPPSATQSPAGLLSLNSLS 120

Query: 121 VNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVP 180
           VN YSP GPAS+FAIGPY ++TQLV+PPVFSAFTTEPSTA  TPPPESVQLTTPSSPEVP
Sbjct: 121 VNAYSPGGPASMFAIGPYAHETQLVTPPVFSAFTTEPSTALCTPPPESVQLTTPSSPEVP 180

Query: 181 FAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFP 240
           FA+LLTSSL    ++ GTNQK +LSH  +QPYQ YPGSPG  LISPGSV+S SGTSSPFP
Sbjct: 181 FAQLLTSSLERARRNSGTNQKLSLSHYGYQPYQLYPGSPGGQLISPGSVVSYSGTSSPFP 240

Query: 241 DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG 300
           D+HPIL+F  A APKLLG EHFTTRKW SR+GSGS+TPDG G+ SR+GSG+LTPDG+G+G
Sbjct: 241 DRHPILDFSAAAAPKLLGFEHFTTRKWGSRLGSGSVTPDGVGIGSRMGSGSLTPDGVGLG 300

Query: 301 SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQND- 360
           SRLGSG+VTP+G    SRLGSG+LTPDG+G   +D  + +NQISEVASLANS+ G ++D 
Sbjct: 301 SRLGSGTVTPDGAGLGSRLGSGSLTPDGMGPTSRDGFVRENQISEVASLANSDNGTKSDE 360

Query: 361 -VTNHRVSFELTGEDVARCLANKSLTSIRTESESPK----QTSTSNQNENKESSREAETC 420
            + +HRVSFEL+GE+VARCLANKS  S R   E P+    +       +  +S    E C
Sbjct: 361 HIIDHRVSFELSGEEVARCLANKSAASPRIVPEFPQDIVPEGEIRRDGKLTDSENHFELC 420

Query: 421 EFFDIKTSAAPEKT--PGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWAN 480
              +  ++  PEKT   GE++ CY+  R++TLGS KEFNFD T+GE+ N  SI +EWWAN
Sbjct: 421 P--EESSNRMPEKTMRDGEEEYCYRKHRSITLGSIKEFNFDNTEGEVSNKPSINSEWWAN 480

Query: 481 EKVGVKEASPGNNWTFFPLLQPGVS 498
           E VG KE+ P NNWTFFP+LQ   S
Sbjct: 481 ENVG-KESKPSNNWTFFPMLQSEAS 500

BLAST of CSPI03G17110 vs. TAIR10
Match: AT4G25620.1 (AT4G25620.1 hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 422.5 bits (1085), Expect = 3.5e-118
Identity = 278/507 (54.83%), Postives = 321/507 (63.31%), Query Frame = 1

Query: 1   MASINNS-VDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHA 60
           M S+NNS VDTVNAAA+AIVSAE+R QP++  K+R GS WSLYWCFG  S+K+NKRIGHA
Sbjct: 1   MRSVNNSSVDTVNAAASAIVSAESRTQPSSVQKKR-GSWWSLYWCFG--SKKNNKRIGHA 60

Query: 61  VLVPEPAVPGA-VAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEP--TSNTQSPAGLLSL 120
           VLVPEPA  GA VAP     + ST++ +PFIAPPSSPASFL S P   S+T  P  L SL
Sbjct: 61  VLVPEPAASGAAVAPVQNSSSNSTSIFMPFIAPPSSPASFLPSGPPSASHTPDPGLLCSL 120

Query: 121 TALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSS 180
           T         N P S F IGPY ++TQ V+PPVFSAFTTEPSTAP TPPPES     PSS
Sbjct: 121 TV--------NEPPSAFTIGPYAHETQPVTPPVFSAFTTEPSTAPFTPPPES-----PSS 180

Query: 181 PEVPFAKLLTSSLSHT--NKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSG 240
           PEVPFA+LLTSSL     N   G NQKF+ +H +F+  Q YPGSPG +LISPGS     G
Sbjct: 181 PEVPFAQLLTSSLERARRNSGGGMNQKFSAAHYEFKSCQVYPGSPGGNLISPGS-----G 240

Query: 241 TSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTP 300
           TSSP+P K  I+EFR+ + PK LG EHFT RKW SR GSGS+TP G G  SRLGSG LTP
Sbjct: 241 TSSPYPGKCSIIEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPAGQG--SRLGSGALTP 300

Query: 301 DGMGMGSRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSET 360
           DG    S+L SG VTPNG     R+  G LTP        +  LLD+QISEVASLANS+ 
Sbjct: 301 DG----SKLTSGVVTPNGAETVIRMSYGNLTP-------LEGSLLDSQISEVASLANSDH 360

Query: 361 GCQ--ND---VTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSNQNENKESSR 420
           G    ND   V  HRVSFELTGEDVARCLA+K               + S  +E      
Sbjct: 361 GSSRHNDEALVVPHRVSFELTGEDVARCLASK--------------LNRSGSHEKASGEH 420

Query: 421 EAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEW 480
               C     KTS   E          Q  R+ + GS KEF FD T  E+     I +EW
Sbjct: 421 LRPNC----CKTSGETESEQS------QKLRSFSTGSNKEFKFDSTNEEM--IEKIRSEW 447

Query: 481 WANEKV-GVKEASPGNNWTFFPLLQPG 496
           WANEKV G  + SP N+WTFFP+L+ G
Sbjct: 481 WANEKVAGKGDHSPRNSWTFFPVLRSG 447

BLAST of CSPI03G17110 vs. TAIR10
Match: AT5G52430.1 (AT5G52430.1 hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 421.4 bits (1082), Expect = 7.8e-118
Identity = 264/499 (52.91%), Postives = 320/499 (64.13%), Query Frame = 1

Query: 4   INNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVP 63
           +NNSV+TVNAAATAIV+AE+RVQP++  K RWG CWSLY CFG  +QK+NKRIG+AVLVP
Sbjct: 5   VNNSVETVNAAATAIVTAESRVQPSSSQKGRWGKCWSLYSCFG--TQKNNKRIGNAVLVP 64

Query: 64  EPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALSVNN 123
           EP   G     V++   STT+VLPFIAPPSSPASFLQS+P+S + SP G LSLT+   N 
Sbjct: 65  EPVTSGVPVVTVQNSATSTTVVLPFIAPPSSPASFLQSDPSSVSHSPVGPLSLTS---NT 124

Query: 124 YSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPES-VQLTTPSSPEVPFA 183
           +SP  P S+F +GPY  +TQ V+PPVFSAF TEPSTAP TPPPES V +TTPSSPEVPFA
Sbjct: 125 FSPKEPQSVFTVGPYANETQPVTPPVFSAFITEPSTAPYTPPPESSVHITTPSSPEVPFA 184

Query: 184 KLLTSSLSHTNK--SFGTNQKFTLSHCDFQPYQPYPGSPGA-HLISPGSVISNSGTSSPF 243
           +LLTSSL  T +  + G NQKF+ SH +F+  Q  PGSPG  +LISPGSVISNSGTSSP+
Sbjct: 185 QLLTSSLELTRRDSTSGMNQKFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPY 244

Query: 244 PDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGM 303
           P K P++EFR+ + PK LG EHFT RKW SR GSGS+TP                  +G 
Sbjct: 245 PGKSPMVEFRIGEPPKFLGFEHFTARKWGSRFGSGSITP------------------VGH 304

Query: 304 GSRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQND 363
           GS L SG++TPNG      + SG LTP+     LQ      NQISEVASLANS+ G +  
Sbjct: 305 GSGLASGALTPNG----PEIVSGNLTPNNTTWPLQ------NQISEVASLANSDHGSEVM 364

Query: 364 VTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSNQNENKESSREAETCEFFDI 423
           V +HRVSFELTGEDVARCLA+K               + S+   N     E E     DI
Sbjct: 365 VADHRVSFELTGEDVARCLASK--------------LNRSHDRMNNNDRIETEESSSTDI 424

Query: 424 KTSAAPEKTPGEDDQ-CYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVK 483
           + +        E++Q   Q   + ++GS KEF FD TK E              EKV   
Sbjct: 425 RRNIEKRSGDRENEQHRIQKLSSSSIGSSKEFKFDNTKDE------------NIEKVA-- 438

Query: 484 EASPGNNWTFFPLLQPGVS 498
               GN+W+FFP L+ GVS
Sbjct: 485 ----GNSWSFFPGLRSGVS 438

BLAST of CSPI03G17110 vs. TAIR10
Match: AT1G63720.1 (AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1))

HSP 1 Score: 204.1 bits (518), Expect = 2.0e-52
Identity = 137/266 (51.50%), Postives = 171/266 (64.29%), Query Frame = 1

Query: 2   ASINNSVDTVNAAATAIVSAEARVQPTTP--PKRRWGSCWSLYWCFGIGSQKSNKRIGHA 61
           A+ NN  DT+NAAA+AI S++ R+  ++P   KR+W + WSL  CFG  S +  KRIG++
Sbjct: 5   ANGNNVFDTINAAASAIASSDDRLHQSSPIHKKRKWWNRWSLLKCFG--SSRQRKRIGNS 64

Query: 62  VLVPEP-AVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTA 121
           VLVPEP ++  + +        S    LPFIAPPSSPASF QSEP S TQSP G+LS + 
Sbjct: 65  VLVPEPVSMSSSNSTTSNSGYRSVITTLPFIAPPSSPASFFQSEPPSATQSPVGILSFSP 124

Query: 122 LSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQL----TTP 181
           L  NN       SIFAIGPY ++TQLVSPPVFS +TTEPS+APITPP +   +    TTP
Sbjct: 125 LPCNNRP-----SIFAIGPYAHETQLVSPPVFSTYTTEPSSAPITPPLDDSSIYLTTTTP 184

Query: 182 SSPEVPFAKLLTSSLSHTNKSFGTNQKFTLSHC-DFQPYQPYPGSPGAHLISPGSVISNS 241
           SSPEVPFA+L  S  +H   S+G   KF +S   +FQ YQ  PGSP   LISP      S
Sbjct: 185 SSPEVPFAQLFNS--NHQTGSYG--YKFPMSSSYEFQFYQLPPGSPLGQLISPS---PGS 244

Query: 242 GTSSPFPDKHPIL--EFRMADAPKLL 258
           G +SPFPD    L   F+++D PKLL
Sbjct: 245 GPTSPFPDGETSLFPHFQVSDPPKLL 256

BLAST of CSPI03G17110 vs. TAIR10
Match: AT1G76660.1 (AT1G76660.1 FUNCTIONS IN: molecular_function unknown)

HSP 1 Score: 147.5 bits (371), Expect = 2.2e-35
Identity = 102/215 (47.44%), Postives = 123/215 (57.21%), Query Frame = 1

Query: 32  KRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRT------PSTTMV 91
           ++RWG C  ++ CF   SQK  KRI  A  +PE     A  P   H+        +  + 
Sbjct: 8   RKRWGGCLGVFSCFK--SQKGGKRIVPASRIPEGGNVSASQPNGAHQAGVLNNQAAGGIN 67

Query: 92  LPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALSVNNYSPNGPAS-IFAIGPYTYDTQL 151
           L  +APPSSPASF  S   S TQSP   LSL A      SP GP+S ++A GPY ++TQL
Sbjct: 68  LSLLAPPSSPASFTNSALPSTTQSPNCYLSLAA-----NSPGGPSSSMYATGPYAHETQL 127

Query: 152 VSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFTL 211
           VSPPVFS FTTEPSTAP TPPPE  +LT PSSP+VP+A+ LTSS+   N   G       
Sbjct: 128 VSPPVFSTFTTEPSTAPFTPPPELARLTAPSSPDVPYARFLTSSMDLKNSGKG------- 187

Query: 212 SHCDFQ-PYQPYPGSPGAHLISPGSVISNSGTSSP 239
            + D Q  Y  YPGSP + L SP S  S  G  SP
Sbjct: 188 HYNDLQATYSLYPGSPASALRSPISRASGDGLLSP 208

BLAST of CSPI03G17110 vs. NCBI nr
Match: gi|778679650|ref|XP_004140832.2| (PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101210841 [Cucumis sativus])

HSP 1 Score: 983.4 bits (2541), Expect = 1.5e-283
Identity = 496/497 (99.80%), Postives = 496/497 (99.80%), Query Frame = 1

Query: 1   MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAV 60
           MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAV
Sbjct: 1   MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAV 60

Query: 61  LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALS 120
           LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALS
Sbjct: 61  LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALS 120

Query: 121 VNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVP 180
           VNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVP
Sbjct: 121 VNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVP 180

Query: 181 FAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFP 240
           FAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFP
Sbjct: 181 FAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFP 240

Query: 241 DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG 300
           DKHPILEFRMADAPKLLGLEHFTTRKWI RMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG
Sbjct: 241 DKHPILEFRMADAPKLLGLEHFTTRKWIXRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG 300

Query: 301 SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDV 360
           SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDV
Sbjct: 301 SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDV 360

Query: 361 TNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSNQNENKESSREAETCEFFDIK 420
           TNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSNQNENKESSREAETCEFFDIK
Sbjct: 361 TNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSNQNENKESSREAETCEFFDIK 420

Query: 421 TSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEA 480
           TSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEA
Sbjct: 421 TSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEA 480

Query: 481 SPGNNWTFFPLLQPGVS 498
           SPGNNWTFFPLLQPGVS
Sbjct: 481 SPGNNWTFFPLLQPGVS 497

BLAST of CSPI03G17110 vs. NCBI nr
Match: gi|659077554|ref|XP_008439268.1| (PREDICTED: uncharacterized protein LOC103484098 [Cucumis melo])

HSP 1 Score: 967.2 bits (2499), Expect = 1.1e-278
Identity = 488/497 (98.19%), Postives = 491/497 (98.79%), Query Frame = 1

Query: 1   MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAV 60
           M SINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQK+NKRIGHAV
Sbjct: 1   MGSINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAV 60

Query: 61  LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALS 120
           LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQS PTSNTQSPAGLLSLTALS
Sbjct: 61  LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSGPTSNTQSPAGLLSLTALS 120

Query: 121 VNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVP 180
           VNNYSPNGPASIFAIGPY YDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVP
Sbjct: 121 VNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVP 180

Query: 181 FAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFP 240
           FAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFP
Sbjct: 181 FAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFP 240

Query: 241 DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG 300
           DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGL SRLGSGTLTPDGMGMG
Sbjct: 241 DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG 300

Query: 301 SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDV 360
           SRLGSGSVTPNG+RQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDV
Sbjct: 301 SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDV 360

Query: 361 TNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSNQNENKESSREAETCEFFDIK 420
           TNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSNQNENKE SREAETCEFFDIK
Sbjct: 361 TNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSNQNENKELSREAETCEFFDIK 420

Query: 421 TSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEA 480
           TS APEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGE+HNTASIGAEWWANEKVGVKEA
Sbjct: 421 TSMAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEVHNTASIGAEWWANEKVGVKEA 480

Query: 481 SPGNNWTFFPLLQPGVS 498
           SPGNNWTFFPLLQPGVS
Sbjct: 481 SPGNNWTFFPLLQPGVS 497

BLAST of CSPI03G17110 vs. NCBI nr
Match: gi|1009109183|ref|XP_015888763.1| (PREDICTED: uncharacterized protein LOC107423668 [Ziziphus jujuba])

HSP 1 Score: 686.4 bits (1770), Expect = 3.7e-194
Identity = 362/504 (71.83%), Postives = 406/504 (80.56%), Query Frame = 1

Query: 1   MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAV 60
           M S+NNSV+T+NAAA+AIVSAE R QPT  PKRRWGSCWSLYWCFG  S K+ KRI HAV
Sbjct: 1   MRSVNNSVETINAAASAIVSAETRAQPTAVPKRRWGSCWSLYWCFG--SHKNTKRISHAV 60

Query: 61  LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALS 120
           LVPE  VPGA  PA E++ PST +VLPFIAPPSSPASFLQS+P S TQSPAGLLSLT+LS
Sbjct: 61  LVPEQVVPGAAVPAAENQIPSTAVVLPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLS 120

Query: 121 VNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVP 180
           VN YSP GPASIFAIGPY Y+TQLVSPPVFS FTTEPSTAP TPPPESVQLTTPSSPEVP
Sbjct: 121 VNAYSPGGPASIFAIGPYAYETQLVSPPVFSTFTTEPSTAPFTPPPESVQLTTPSSPEVP 180

Query: 181 FAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFP 240
           FA+LLTSSL  T ++ GTNQKF LSHC+FQPYQPYPGSPG  LISPGSVISNSGTSSPFP
Sbjct: 181 FAQLLTSSLDRTRRNNGTNQKFALSHCEFQPYQPYPGSPGGQLISPGSVISNSGTSSPFP 240

Query: 241 DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG 300
           D+HPILEFRM +AP+LLG EHFTTRKW SR+GSGS+TPDG GL SRLGSG LTPDG G+G
Sbjct: 241 DRHPILEFRMGEAPRLLGFEHFTTRKWGSRLGSGSITPDGLGLGSRLGSGCLTPDGNGLG 300

Query: 301 SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQND- 360
           SR+GSGS+TPNG    SRLGSG LTPDG+G    DS  ++NQISEVASLANSE+GCQ D 
Sbjct: 301 SRIGSGSLTPNGAGLASRLGSGCLTPDGVGPASGDSFPMENQISEVASLANSESGCQLDG 360

Query: 361 -VTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSNQNENKESSREAETCEFFD 420
            V NHRVSFELTGEDVARCLANKS+ S+RT S+ P + + S     K+      T  F +
Sbjct: 361 NVINHRVSFELTGEDVARCLANKSMASVRTASD-PLKDTPSECGVKKDRMISTGTDHFSE 420

Query: 421 I---KTSA-APEKTPGE-DDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANE 480
               +TS   PE   GE +DQCY+  R++TLGS KEFNFD TK E  +  + G+EWWANE
Sbjct: 421 SCVEETSVELPENDHGEWEDQCYRKHRSITLGSIKEFNFDSTKSEFSDKPTNGSEWWANE 480

Query: 481 KVGVKEASPGNNWTFFPLLQPGVS 498
           KV  KE+ PGN WTFFP+LQPGVS
Sbjct: 481 KVAGKESKPGNGWTFFPILQPGVS 501

BLAST of CSPI03G17110 vs. NCBI nr
Match: gi|703122806|ref|XP_010102658.1| (hypothetical protein L484_004326 [Morus notabilis])

HSP 1 Score: 659.8 bits (1701), Expect = 3.7e-186
Identity = 355/523 (67.88%), Postives = 407/523 (77.82%), Query Frame = 1

Query: 1   MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAV 60
           M ++NNSV+T+NAAATAIVSAEAR QP   PKRRWGSCWSLYWCFG  S K++KRIGHAV
Sbjct: 1   MRTVNNSVETINAAATAIVSAEARAQPAAVPKRRWGSCWSLYWCFG--SHKNSKRIGHAV 60

Query: 61  LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALS 120
           LVPEP +PGA APA E++ PST +VLPFIAPPSSPASFLQS+P S TQSPAGLLSLT+LS
Sbjct: 61  LVPEPVLPGAAAPAPENQAPSTAIVLPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLS 120

Query: 121 VNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVP 180
           +N YSP GP SIFAIGPY Y+TQLVSPPVFS FTTEPSTAP TPPPESVQLTTPSSPEVP
Sbjct: 121 INAYSPGGPTSIFAIGPYAYETQLVSPPVFSTFTTEPSTAPFTPPPESVQLTTPSSPEVP 180

Query: 181 FAKLLTSSLSHTNK-SFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPF 240
           FA+LLTSSL  T + S G NQKF+LSHC+FQPYQ YPGSPG +LISPGSV+SNSGTSSPF
Sbjct: 181 FAQLLTSSLDRTRRNSSGANQKFSLSHCEFQPYQLYPGSPGGNLISPGSVVSNSGTSSPF 240

Query: 241 PDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGM 300
           PDKHPIL FRM +AP+LLG EHFTT KW SR+GSGSLTPDG GL SRLGSG++TPDG+G+
Sbjct: 241 PDKHPILGFRMGEAPRLLGFEHFTTWKWGSRLGSGSLTPDGVGLGSRLGSGSVTPDGVGL 300

Query: 301 GSRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQ----------------DSPLLDNQI 360
           GSRLGSGS+TP+G    SRLGSG +TP+G G G +                DS LL+NQI
Sbjct: 301 GSRLGSGSLTPDGYGLGSRLGSGCMTPNGPGLGSRLGSGTLTPDGFLVVSGDSFLLENQI 360

Query: 361 SEVASLANSETGCQND--VTNHRVSFELTGEDVARCLANKSLTSI-RTESE----SPKQT 420
           SEVASLANS+ GCQND  V +HRVSFELTGEDVARCLA+KS +S  RT SE    SP + 
Sbjct: 361 SEVASLANSDNGCQNDGSVVDHRVSFELTGEDVARCLASKSASSNGRTTSESLEDSPAEC 420

Query: 421 STSNQ--NENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQ 480
            T     + N   S   ++C       +   +   GEDD  YQ  R++TLGS KEFNFD 
Sbjct: 421 PTKKDGISANNVDSPNDQSCVEETSNKTPQSDCREGEDDHFYQKHRSITLGSIKEFNFDN 480

Query: 481 TKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS 498
           TK ++    +IG+EWWANEKV  KEA  GN+W+FFP+LQPGVS
Sbjct: 481 TKADVSVKPTIGSEWWANEKVAGKEAKAGNSWSFFPILQPGVS 521

BLAST of CSPI03G17110 vs. NCBI nr
Match: gi|596021788|ref|XP_007219041.1| (hypothetical protein PRUPE_ppa004616mg [Prunus persica])

HSP 1 Score: 657.5 bits (1695), Expect = 1.8e-185
Identity = 347/501 (69.26%), Postives = 397/501 (79.24%), Query Frame = 1

Query: 1   MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAV 60
           M S+N+SVDT+NAAATAIVSAEAR QPTT PKRRWGSCWSLYWCFG      NKRIGHAV
Sbjct: 1   MRSVNSSVDTINAAATAIVSAEARPQPTTVPKRRWGSCWSLYWCFG---PHKNKRIGHAV 60

Query: 61  LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALS 120
           LVPEP VPGA   A++++T ST +V+PFIAPPSSPASFL S+P S TQSPAG LSL +LS
Sbjct: 61  LVPEPVVPGAAVSAIDNQTTSTAIVVPFIAPPSSPASFLPSDPPSATQSPAGFLSLKSLS 120

Query: 121 VNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVP 180
            N YSP GPASIF+IGPY Y+TQLVSPPVFS F TEPSTAP TPPPESVQLTTPSSPEVP
Sbjct: 121 ANAYSPGGPASIFSIGPYAYETQLVSPPVFSTFNTEPSTAPFTPPPESVQLTTPSSPEVP 180

Query: 181 FAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFP 240
           FA+LLTSSL    ++ GTNQKF LSH +FQPYQ YPGSPG +LISPGS +SNSGTSSPFP
Sbjct: 181 FAQLLTSSLDRNRRNSGTNQKFALSHYEFQPYQQYPGSPGGNLISPGSAVSNSGTSSPFP 240

Query: 241 DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG 300
           D+HP+LEFRM +APKL G +HFTTRKW SR+GSGSLTPDG GL SRLGSG+LTPDG  +G
Sbjct: 241 DRHPVLEFRMGEAPKLFGFDHFTTRKWGSRIGSGSLTPDGVGLGSRLGSGSLTPDGNELG 300

Query: 301 SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQ--N 360
           SRLGSG VTPNG    SRLGSG LTPDG G   +DS LL+NQISEVASLANSE+GCQ   
Sbjct: 301 SRLGSGCVTPNGAGIGSRLGSGCLTPDGPGPASRDSFLLENQISEVASLANSESGCQTVE 360

Query: 361 DVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSNQNENKE-SSREAETCEF- 420
            V +HRVSFELTGEDVA CLANK++ S RT S S K  ++   +E    SS  +  CEF 
Sbjct: 361 TVFDHRVSFELTGEDVACCLANKAVASNRTASGSSKVIASEYPSERDALSSDSSNHCEFS 420

Query: 421 FDIKTSAAPEKTPGE-DDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKV 480
            +  +S  PE   GE +DQ Y+  R++TLGS K+FNFD TK E+ N  +IG+EWWAN+ V
Sbjct: 421 VEESSSRIPENVSGEGEDQGYRKHRSITLGSTKDFNFDNTKAEVPNKPNIGSEWWANKNV 480

Query: 481 GVKEASPGNNWTFFPLLQPGV 497
             KE+ P N+WTFFP+LQPGV
Sbjct: 481 AAKESKPCNDWTFFPILQPGV 498

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y1666_ARATH3.8e-3447.44Uncharacterized protein At1g76660 OS=Arabidopsis thaliana GN=At1g76660 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
W9S7Z6_9ROSA2.6e-18667.88Uncharacterized protein OS=Morus notabilis GN=L484_004326 PE=4 SV=1[more]
M5XMF7_PRUPE1.3e-18569.26Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004616mg PE=4 SV=1[more]
A0A067JHK1_JATCU1.1e-18167.66Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26079 PE=4 SV=1[more]
B9RIV6_RICCO1.8e-17667.20Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1583050 PE=4 SV=1[more]
V4S4F7_9ROSI1.6e-17565.74Uncharacterized protein OS=Citrus clementina GN=CICLE_v10004813mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G25620.13.5e-11854.83 hydroxyproline-rich glycoprotein family protein[more]
AT5G52430.17.8e-11852.91 hydroxyproline-rich glycoprotein family protein[more]
AT1G63720.12.0e-5251.50 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glyc... [more]
AT1G76660.12.2e-3547.44 FUNCTIONS IN: molecular_function unknown[more]
Match NameE-valueIdentityDescription
gi|778679650|ref|XP_004140832.2|1.5e-28399.80PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101210841 [Cucumis sa... [more]
gi|659077554|ref|XP_008439268.1|1.1e-27898.19PREDICTED: uncharacterized protein LOC103484098 [Cucumis melo][more]
gi|1009109183|ref|XP_015888763.1|3.7e-19471.83PREDICTED: uncharacterized protein LOC107423668 [Ziziphus jujuba][more]
gi|703122806|ref|XP_010102658.1|3.7e-18667.88hypothetical protein L484_004326 [Morus notabilis][more]
gi|596021788|ref|XP_007219041.1|1.8e-18569.26hypothetical protein PRUPE_ppa004616mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI03G17110.1CSPI03G17110.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR31798FAMILY NOT NAMEDcoord: 1..497
score: 3.2E
NoneNo IPR availablePANTHERPTHR31798:SF4SUBFAMILY NOT NAMEDcoord: 1..497
score: 3.2E

The following gene(s) are paralogous to this gene:

None