CmoCh18G012870 (gene) Cucurbita moschata (Rifu)

NameCmoCh18G012870
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionHydroxyproline-rich glycoprotein
LocationCmo_Chr18 : 12434906 .. 12438173 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCGAGTTTTTCATTTCTTAGATCAAATCCGAAAGTCGTTTATGTTTCTTTTTTTTGGCTGTCAACTCTGATCTCACTGGCGAGTAATTGATTTTGCCGGGGAGTTAGCGGCGGTAGGTTGGTCAGAAAACACGGCGAACTTGTTGAAGAGTCCGGCGGTAGAGCTATCTGGATTGGCAGAAGGTGGAGATGGGAAGTATGAACAACAGCGTGGATACGGTTAATGCTGCTGCTACTGCGATCATCTCCGCTGAGGCTCGTGTTCAGCCTCCGACGCCTCCGGTAGGTGGTTTTTTACTTTATTATGTTGTCTGATTTTACTTTGAAACTGAGAATAGTGCGATAATTTGATTGAGGATTAAGTCTTTGATCTCGGTGTTGGTTTGCTATTTTGTGTTCGGATGGGGAAAATCTCATAAAAGTTGATGACCTGATTGATTTTCTCTGATTTGAGAAAACCGTGTTGGTTACGCTTTTCTTTATTTCTTGATTCGAAGATTTTGATTTGGTGTTCTCTGCTTCGGTTGCTGGTTTCTATTACTGTGTTGCTTTTTTTTTTTTTTCTTTGGCTGAAGATGTTGGAGGTTAAATTCCCGAAAATTTGTTGCTTAATTAGTCTGACTGCTGAAAGTTGAGGATAAAGCAACTTGAAAAGTTACTTTTAATTTGTTTCGAAACAGAAAGGAATGAGAACATGAGGTTGTTCGGTAGTTAATTACAAGCCCGAAATGCTAGTTTCTGTGCAATAAATTCCTTCGGAACATTGATTCATTCGGTGTAAGATTGTCAATATCGAGGAACTTCTATCAATTTAATTAGCGATCTTATGTACTTCACTAGATTTCTTGAGTTAGTTGTTTTTTTATAGTGGATCTTGCAGCTGTATACAGTGTATCAATTAATTCGGCTCTTCTAGGCTTCAGTTTTTTCTAGTAAGGTTGAGTTTGGCCTAGGCTGAGATTTCTTTTAATGTGGACAGACAAAAATTTGCATCTGTTTAAAGGAAGATTCTAAGCTCCTGAAGGTGCGGTTATTGAGCTATCATACCGCTTCTTATCTTGGAGCAATTAACCGTCAAAATCTTTAGTTTTAAGCTTAAGAATGAAGAACACTTTTCATAGACCTCATATCATCTACCCGAATGAATGGCTGATAACTGAGACCGTCTCTTTATGCAGAGCTTTTCTGGAACCAGTTTGGTTTGTTTGGTTACCAACACAACGTGAATTATTTATCAATTTGCCCCAGAAACATAAATGTTCGGTTAATTTGATTTTTGTGGCCATTGGATGGAGTGGTTTTTTTCGCTCTCTTCCTTTCAGTCTTATGCTATGTTGCCTGACACGAGGAGGTGAGAAAGTAGTATACTGTTTTGTTAACACTCAACCGTTGATGAATATGAGCTTATACTAGTAAGATCCATAACAGCCGAAGATCTTAGATTTCTCAACAATTGTTCTTAGTACTGCTCTGTTGGTCTTTTGATTTATGTTCTTTTGTTATCTGAGTACAGAAACGAAGATGGGGTAGCTGCTGGAGTCTGTACTGGTGTTTTGGAATCGGTTCACAGAAAAACAGTAAGCGTATTAGTAATGCTGTACTTGTTCCGGAACCTGTGGTACCGGGAGCTGTTGCTGCTGCTGTTGAACACCGACCACCTTCAACCACAATGGTATTACCTTTCATTGCCCCTCCATCTTCTCCTGCATCTTTCCTCCAGTCCGAACCTCCATCACATAGTCAATCTCCGGCTGGATTACTCTCTTTAACTGCCCTTTCAGTCAATAACTACTCCCCAAATGGACCTGCATCCATTTTTGCAATAGGCCCTTATACATATGAGACCCAGTTGGTCTCACCTCCAGTTTTTTCTGCCTTCACCACTGAACCATCAACCGCTCCTTTTACTCCTCCTCCTGAGTCTGTGCAATTGACCACACCCTCCTCTCCTGAAGTGCCATTTGCTAAATTGCTGACATCTTCTCTAAGCCATACTAATAGAAGTTTTGGGACTAACCACAAGTTTGCACTATCTCATTGTGATTTCCAGCCTTATCAACCCTATCCAGGTAGCCCCGGTGCCTATCTTATATCACCTGGATCAGTAATTTCAAACTCTGGTACATCTTCTCCTTTCCCTGATAAGCACCCCATTCTTGAGTTTCGCATGGCTGATGCTCCCAAGCTCTTGGGTCTTGAACATTTTACAACTCGCAAATGGATCTCGAGAATGGGTTCTGGATCTTTAACGCCGGATGGTACTGGTTTAGGTTCTAGGTTAGGTTCGGGAACTTTGACCCCTGATGGTATGGGTATGGGTTCGAGATTGGGATCTGGATCTGTGACCCCAAATGGCATGAGACAAGATTCAAGATTGGGCTCTGGAACCTTGACGCCTGACGGTTTGGGGCATGCCTTGCAAGATGGTCTATTGTTGGACAACCAAATATCTGAGGTGGCTTCCCTTGCCAACTCAGGAAGTGGATGTCCAAATGATGTGACAAATCATAGGGTGTCATTTGAGTTAACAGGCGAAGATGTTGCACGTTGTCTTGCAATTAAGTCAATGACATCCATTAGAACCGAATCAGAGTCGTCTCCAAAGTGTCATCAAAATGAAAACAAAGAATCATCATCAAGAGAAGCTGAAACTTATGAGTTCTTTGACATCAAGACGACTTCTTCCACAGCACCCGAACAAGCTGCAGGAGAGGACAATCGATGCTACCAAAATCAACGAGCTGTAACTCTCGGTTCGTTCAAAGAGTTCAACTTCGACCGAACAAAAGGAGAAATGCAGAATACAGCCTCGGTCGGTGCAGAGTGGTGGGCGAACGAAAAGGTGGGTGTGAAGGAAGCCAGTAGTCCAGGCAACAACTGGACTTTCTTCCCAATCTTGCAATCTGGGGTCAGCTGACTTAAGATCTTTTGAATGTACATTTGAATGTAATCTTCCTTTGGAAATTGAAGTTTTAGGAAAAGAATGTTTTCCAATAGACAGCAGAAAGAAGAGGTAGTTATTAGCGATAACTGATATGGGTACTGAAGGAGCATTCTTTATCATAGTGCAACAAGTAGATGATTCATAGAAGCCTTTTATTCTTTGTTGTCTTGTAAAATAATATGAAATTTCATTCTTCCCCAACAATGAAACTTCCCTTCTTCAAAACTCTCTGGTGATTCTTAGCTTATTCCTCTGCAACTCATTGGATATTTTGGTTCATTATGTATCTTAATTTTAGAGTTCCAACACAATTTAATGCAAGTGA

mRNA sequence

CCGAGTTTTTCATTTCTTAGATCAAATCCGAAAGTCGTTTATGTTTCTTTTTTTTGGCTGTCAACTCTGATCTCACTGGCGAGTAATTGATTTTGCCGGGGAGTTAGCGGCGGTAGGTTGGTCAGAAAACACGGCGAACTTGTTGAAGAGTCCGGCGGTAGAGCTATCTGGATTGGCAGAAGGTGGAGATGGGAAGTATGAACAACAGCGTGGATACGGTTAATGCTGCTGCTACTGCGATCATCTCCGCTGAGGCTCGTGTTCAGCCTCCGACGCCTCCGAAACGAAGATGGGGTAGCTGCTGGAGTCTGTACTGGTGTTTTGGAATCGGTTCACAGAAAAACAGTAAGCGTATTAGTAATGCTGTACTTGTTCCGGAACCTGTGGTACCGGGAGCTGTTGCTGCTGCTGTTGAACACCGACCACCTTCAACCACAATGGTATTACCTTTCATTGCCCCTCCATCTTCTCCTGCATCTTTCCTCCAGTCCGAACCTCCATCACATAGTCAATCTCCGGCTGGATTACTCTCTTTAACTGCCCTTTCAGTCAATAACTACTCCCCAAATGGACCTGCATCCATTTTTGCAATAGGCCCTTATACATATGAGACCCAGTTGGTCTCACCTCCAGTTTTTTCTGCCTTCACCACTGAACCATCAACCGCTCCTTTTACTCCTCCTCCTGAGTCTGTGCAATTGACCACACCCTCCTCTCCTGAAGTGCCATTTGCTAAATTGCTGACATCTTCTCTAAGCCATACTAATAGAAGTTTTGGGACTAACCACAAGTTTGCACTATCTCATTGTGATTTCCAGCCTTATCAACCCTATCCAGGTAGCCCCGGTGCCTATCTTATATCACCTGGATCAGTAATTTCAAACTCTGGTACATCTTCTCCTTTCCCTGATAAGCACCCCATTCTTGAGTTTCGCATGGCTGATGCTCCCAAGCTCTTGGGTCTTGAACATTTTACAACTCGCAAATGGATCTCGAGAATGGGTTCTGGATCTTTAACGCCGGATGGTACTGGTTTAGGTTCTAGGTTAGGTTCGGGAACTTTGACCCCTGATGGTATGGGTATGGGTTCGAGATTGGGATCTGGATCTGTGACCCCAAATGGCATGAGACAAGATTCAAGATTGGGCTCTGGAACCTTGACGCCTGACGGTTTGGGGCATGCCTTGCAAGATGGTCTATTGTTGGACAACCAAATATCTGAGGTGGCTTCCCTTGCCAACTCAGGAAGTGGATGTCCAAATGATGTGACAAATCATAGGGTGTCATTTGAGTTAACAGGCGAAGATGTTGCACGTTGTCTTGCAATTAAGTCAATGACATCCATTAGAACCGAATCAGAGTCGTCTCCAAAGTGTCATCAAAATGAAAACAAAGAATCATCATCAAGAGAAGCTGAAACTTATGAGTTCTTTGACATCAAGACGACTTCTTCCACAGCACCCGAACAAGCTGCAGGAGAGGACAATCGATGCTACCAAAATCAACGAGCTGTAACTCTCGGTTCGTTCAAAGAGTTCAACTTCGACCGAACAAAAGGAGAAATGCAGAATACAGCCTCGGTCGGTGCAGAGTGGTGGGCGAACGAAAAGGTGGGTGTGAAGGAAGCCAGTAGTCCAGGCAACAACTGGACTTTCTTCCCAATCTTGCAATCTGGGGTCAGCTGACTTAAGATCTTTTGAATGTACATTTGAATGTAATCTTCCTTTGGAAATTGAAGTTTTAGGAAAAGAATGTTTTCCAATAGACAGCAGAAAGAAGAGGTAGTTATTAGCGATAACTGATATGGGTACTGAAGGAGCATTCTTTATCATAGTGCAACAAGTAGATGATTCATAGAAGCCTTTTATTCTTTGTTGTCTTGTAAAATAATATGAAATTTCATTCTTCCCCAACAATGAAACTTCCCTTCTTCAAAACTCTCTGGTGATTCTTAGCTTATTCCTCTGCAACTCATTGGATATTTTGGTTCATTATGTATCTTAATTTTAGAGTTCCAACACAATTTAATGCAAGTGA

Coding sequence (CDS)

ATGGGAAGTATGAACAACAGCGTGGATACGGTTAATGCTGCTGCTACTGCGATCATCTCCGCTGAGGCTCGTGTTCAGCCTCCGACGCCTCCGAAACGAAGATGGGGTAGCTGCTGGAGTCTGTACTGGTGTTTTGGAATCGGTTCACAGAAAAACAGTAAGCGTATTAGTAATGCTGTACTTGTTCCGGAACCTGTGGTACCGGGAGCTGTTGCTGCTGCTGTTGAACACCGACCACCTTCAACCACAATGGTATTACCTTTCATTGCCCCTCCATCTTCTCCTGCATCTTTCCTCCAGTCCGAACCTCCATCACATAGTCAATCTCCGGCTGGATTACTCTCTTTAACTGCCCTTTCAGTCAATAACTACTCCCCAAATGGACCTGCATCCATTTTTGCAATAGGCCCTTATACATATGAGACCCAGTTGGTCTCACCTCCAGTTTTTTCTGCCTTCACCACTGAACCATCAACCGCTCCTTTTACTCCTCCTCCTGAGTCTGTGCAATTGACCACACCCTCCTCTCCTGAAGTGCCATTTGCTAAATTGCTGACATCTTCTCTAAGCCATACTAATAGAAGTTTTGGGACTAACCACAAGTTTGCACTATCTCATTGTGATTTCCAGCCTTATCAACCCTATCCAGGTAGCCCCGGTGCCTATCTTATATCACCTGGATCAGTAATTTCAAACTCTGGTACATCTTCTCCTTTCCCTGATAAGCACCCCATTCTTGAGTTTCGCATGGCTGATGCTCCCAAGCTCTTGGGTCTTGAACATTTTACAACTCGCAAATGGATCTCGAGAATGGGTTCTGGATCTTTAACGCCGGATGGTACTGGTTTAGGTTCTAGGTTAGGTTCGGGAACTTTGACCCCTGATGGTATGGGTATGGGTTCGAGATTGGGATCTGGATCTGTGACCCCAAATGGCATGAGACAAGATTCAAGATTGGGCTCTGGAACCTTGACGCCTGACGGTTTGGGGCATGCCTTGCAAGATGGTCTATTGTTGGACAACCAAATATCTGAGGTGGCTTCCCTTGCCAACTCAGGAAGTGGATGTCCAAATGATGTGACAAATCATAGGGTGTCATTTGAGTTAACAGGCGAAGATGTTGCACGTTGTCTTGCAATTAAGTCAATGACATCCATTAGAACCGAATCAGAGTCGTCTCCAAAGTGTCATCAAAATGAAAACAAAGAATCATCATCAAGAGAAGCTGAAACTTATGAGTTCTTTGACATCAAGACGACTTCTTCCACAGCACCCGAACAAGCTGCAGGAGAGGACAATCGATGCTACCAAAATCAACGAGCTGTAACTCTCGGTTCGTTCAAAGAGTTCAACTTCGACCGAACAAAAGGAGAAATGCAGAATACAGCCTCGGTCGGTGCAGAGTGGTGGGCGAACGAAAAGGTGGGTGTGAAGGAAGCCAGTAGTCCAGGCAACAACTGGACTTTCTTCCCAATCTTGCAATCTGGGGTCAGCTGA
BLAST of CmoCh18G012870 vs. Swiss-Prot
Match: Y1666_ARATH (Uncharacterized protein At1g76660 OS=Arabidopsis thaliana GN=At1g76660 PE=2 SV=1)

HSP 1 Score: 151.8 bits (382), Expect = 2.0e-35
Identity = 105/215 (48.84%), Postives = 126/215 (58.60%), Query Frame = 1

Query: 32  KRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPE------PVVPGAVAAAVEHRPPSTTMV 91
           ++RWG C  ++ CF   SQK  KRI  A  +PE          GA  A V +   +  + 
Sbjct: 8   RKRWGGCLGVFSCFK--SQKGGKRIVPASRIPEGGNVSASQPNGAHQAGVLNNQAAGGIN 67

Query: 92  LPFIAPPSSPASFLQSEPPSHSQSPAGLLSLTALSVNNYSPNGPAS-IFAIGPYTYETQL 151
           L  +APPSSPASF  S  PS +QSP   LSL A      SP GP+S ++A GPY +ETQL
Sbjct: 68  LSLLAPPSSPASFTNSALPSTTQSPNCYLSLAA-----NSPGGPSSSMYATGPYAHETQL 127

Query: 152 VSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNRSFGTNHKFAL 211
           VSPPVFS FTTEPSTAPFTPPPE  +LT PSSP+VP+A+ LTSS+   N   G       
Sbjct: 128 VSPPVFSTFTTEPSTAPFTPPPELARLTAPSSPDVPYARFLTSSMDLKNSGKG------- 187

Query: 212 SHCDFQ-PYQPYPGSPGAYLISPGSVISNSGTSSP 239
            + D Q  Y  YPGSP + L SP S  S  G  SP
Sbjct: 188 HYNDLQATYSLYPGSPASALRSPISRASGDGLLSP 208

BLAST of CmoCh18G012870 vs. TrEMBL
Match: W9S7Z6_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_004326 PE=4 SV=1)

HSP 1 Score: 650.6 bits (1677), Expect = 1.6e-183
Identity = 357/524 (68.13%), Postives = 408/524 (77.86%), Query Frame = 1

Query: 1   MGSMNNSVDTVNAAATAIISAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAV 60
           M ++NNSV+T+NAAATAI+SAEAR QP   PKRRWGSCWSLYWCFG  S KNSKRI +AV
Sbjct: 1   MRTVNNSVETINAAATAIVSAEARAQPAAVPKRRWGSCWSLYWCFG--SHKNSKRIGHAV 60

Query: 61  LVPEPVVPGAVAAAVEHRPPSTTMVLPFIAPPSSPASFLQSEPPSHSQSPAGLLSLTALS 120
           LVPEPV+PGA A A E++ PST +VLPFIAPPSSPASFLQS+PPS +QSPAGLLSLT+LS
Sbjct: 61  LVPEPVLPGAAAPAPENQAPSTAIVLPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLS 120

Query: 121 VNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVP 180
           +N YSP GP SIFAIGPY YETQLVSPPVFS FTTEPSTAPFTPPPESVQLTTPSSPEVP
Sbjct: 121 INAYSPGGPTSIFAIGPYAYETQLVSPPVFSTFTTEPSTAPFTPPPESVQLTTPSSPEVP 180

Query: 181 FAKLLTSSLSHTNR-SFGTNHKFALSHCDFQPYQPYPGSPGAYLISPGSVISNSGTSSPF 240
           FA+LLTSSL  T R S G N KF+LSHC+FQPYQ YPGSPG  LISPGSV+SNSGTSSPF
Sbjct: 181 FAQLLTSSLDRTRRNSSGANQKFSLSHCEFQPYQLYPGSPGGNLISPGSVVSNSGTSSPF 240

Query: 241 PDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM 300
           PDKHPIL FRM +AP+LLG EHFTT KW SR+GSGSLTPDG GLGSRLGSG++TPDG+G+
Sbjct: 241 PDKHPILGFRMGEAPRLLGFEHFTTWKWGSRLGSGSLTPDGVGLGSRLGSGSVTPDGVGL 300

Query: 301 GSRLGSGS----------------VTPNGMRQDSRLGSGTLTPDGLGHALQDGLLLDNQI 360
           GSRLGSGS                +TPNG    SRLGSGTLTPDG      D  LL+NQI
Sbjct: 301 GSRLGSGSLTPDGYGLGSRLGSGCMTPNGPGLGSRLGSGTLTPDGFLVVSGDSFLLENQI 360

Query: 361 SEVASLANSGSGCPND--VTNHRVSFELTGEDVARCLAIKSMTSI-RTESES---SP-KC 420
           SEVASLANS +GC ND  V +HRVSFELTGEDVARCLA KS +S  RT SES   SP +C
Sbjct: 361 SEVASLANSDNGCQNDGSVVDHRVSFELTGEDVARCLASKSASSNGRTTSESLEDSPAEC 420

Query: 421 HQNENKESSSREAETYEFFDIKTTSSTAPEQ--AAGEDNRCYQNQRAVTLGSFKEFNFDR 480
              ++  S++      +   ++ TS+  P+     GED+  YQ  R++TLGS KEFNFD 
Sbjct: 421 PTKKDGISANNVDSPNDQSCVEETSNKTPQSDCREGEDDHFYQKHRSITLGSIKEFNFDN 480

Query: 481 TKGEMQNTASVGAEWWANEKVGVKEASSPGNNWTFFPILQSGVS 499
           TK ++    ++G+EWWANEKV  KEA + GN+W+FFPILQ GVS
Sbjct: 481 TKADVSVKPTIGSEWWANEKVAGKEAKA-GNSWSFFPILQPGVS 521

BLAST of CmoCh18G012870 vs. TrEMBL
Match: M5XMF7_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004616mg PE=4 SV=1)

HSP 1 Score: 639.8 bits (1649), Expect = 2.7e-180
Identity = 341/502 (67.93%), Postives = 396/502 (78.88%), Query Frame = 1

Query: 1   MGSMNNSVDTVNAAATAIISAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAV 60
           M S+N+SVDT+NAAATAI+SAEAR QP T PKRRWGSCWSLYWCFG    KN KRI +AV
Sbjct: 1   MRSVNSSVDTINAAATAIVSAEARPQPTTVPKRRWGSCWSLYWCFG--PHKN-KRIGHAV 60

Query: 61  LVPEPVVPGAVAAAVEHRPPSTTMVLPFIAPPSSPASFLQSEPPSHSQSPAGLLSLTALS 120
           LVPEPVVPGA  +A++++  ST +V+PFIAPPSSPASFL S+PPS +QSPAG LSL +LS
Sbjct: 61  LVPEPVVPGAAVSAIDNQTTSTAIVVPFIAPPSSPASFLPSDPPSATQSPAGFLSLKSLS 120

Query: 121 VNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVP 180
            N YSP GPASIF+IGPY YETQLVSPPVFS F TEPSTAPFTPPPESVQLTTPSSPEVP
Sbjct: 121 ANAYSPGGPASIFSIGPYAYETQLVSPPVFSTFNTEPSTAPFTPPPESVQLTTPSSPEVP 180

Query: 181 FAKLLTSSLSHTNRSFGTNHKFALSHCDFQPYQPYPGSPGAYLISPGSVISNSGTSSPFP 240
           FA+LLTSSL    R+ GTN KFALSH +FQPYQ YPGSPG  LISPGS +SNSGTSSPFP
Sbjct: 181 FAQLLTSSLDRNRRNSGTNQKFALSHYEFQPYQQYPGSPGGNLISPGSAVSNSGTSSPFP 240

Query: 241 DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG 300
           D+HP+LEFRM +APKL G +HFTTRKW SR+GSGSLTPDG GLGSRLGSG+LTPDG  +G
Sbjct: 241 DRHPVLEFRMGEAPKLFGFDHFTTRKWGSRIGSGSLTPDGVGLGSRLGSGSLTPDGNELG 300

Query: 301 SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSGSGCP--N 360
           SRLGSG VTPNG    SRLGSG LTPDG G A +D  LL+NQISEVASLANS SGC    
Sbjct: 301 SRLGSGCVTPNGAGIGSRLGSGCLTPDGPGPASRDSFLLENQISEVASLANSESGCQTVE 360

Query: 361 DVTNHRVSFELTGEDVARCLAIKSMTSIRTESESSP--KCHQNENKESSSREAETYEFFD 420
            V +HRVSFELTGEDVA CLA K++ S RT S SS          +++ S ++  +  F 
Sbjct: 361 TVFDHRVSFELTGEDVACCLANKAVASNRTASGSSKVIASEYPSERDALSSDSSNHCEFS 420

Query: 421 IKTTSSTAPEQAAGE-DNRCYQNQRAVTLGSFKEFNFDRTKGEMQNTASVGAEWWANEKV 480
           ++ +SS  PE  +GE +++ Y+  R++TLGS K+FNFD TK E+ N  ++G+EWWAN+ V
Sbjct: 421 VEESSSRIPENVSGEGEDQGYRKHRSITLGSTKDFNFDNTKAEVPNKPNIGSEWWANKNV 480

Query: 481 GVKEASSPGNNWTFFPILQSGV 498
             KE S P N+WTFFPILQ GV
Sbjct: 481 AAKE-SKPCNDWTFFPILQPGV 498

BLAST of CmoCh18G012870 vs. TrEMBL
Match: A0A067JHK1_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26079 PE=4 SV=1)

HSP 1 Score: 634.8 bits (1636), Expect = 8.8e-179
Identity = 342/502 (68.13%), Postives = 394/502 (78.49%), Query Frame = 1

Query: 1   MGSMNNSVDTVNAAATAIISAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAV 60
           M S+NNSV+T+NAAATAIISAE+RVQP    KRRWG CWSLYWCFG  S KNSKRI +AV
Sbjct: 1   MRSVNNSVETINAAATAIISAESRVQPTVVQKRRWGGCWSLYWCFG--SHKNSKRIGHAV 60

Query: 61  LVPEPVVPGAVAAAVEHRPPSTTMVLPFIAPPSSPASFLQSEPPSHSQSPAGLLSLTALS 120
           LVPEP VP AV  + E++  ST   +PFIAPPSSPASFLQS+PPS +QSPAGLLSLTALS
Sbjct: 61  LVPEPEVPQAVVTSAENQTHSTAAAVPFIAPPSSPASFLQSDPPSVTQSPAGLLSLTALS 120

Query: 121 VNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVP 180
           V+ YSP GPASIFAIGPY +ETQLV+PPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVP
Sbjct: 121 VSAYSPGGPASIFAIGPYAHETQLVTPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVP 180

Query: 181 FAKLLTSSLSHTNRSFGTNHKFALSHCDFQPYQPYPGSPGAYLISPGSVISNSGTSSPFP 240
           FA+LLTSSL    R+ G N KFALSH +FQ Y  YPGSPG  LISPGS+ISNSGTSSPFP
Sbjct: 181 FAQLLTSSLERARRNSGANQKFALSHYEFQSYPLYPGSPGGQLISPGSIISNSGTSSPFP 240

Query: 241 DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG 300
           D+HP+LEFRM +APKLLG EHFTTRKW SR+GSG+LTPDG GLGSRL SGT TPDG+G+G
Sbjct: 241 DRHPLLEFRMGEAPKLLGFEHFTTRKWGSRLGSGTLTPDGVGLGSRLCSGTATPDGVGLG 300

Query: 301 SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSGSGCPND- 360
           SRLGSGSVTP+G+   SRLGSG+LTPD +  A QDGLLL+NQISEVASLANS +   ND 
Sbjct: 301 SRLGSGSVTPDGVGLRSRLGSGSLTPDCVVPASQDGLLLENQISEVASLANSENASKNDE 360

Query: 361 -VTNHRVSFELTGEDVARCLAIKSMTSIRTESESSPKCHQNENKESSSREAETYEFFDIK 420
            + +HRVSFEL+GE+VARCL  KSMTS RT SE        E   S      + +   I 
Sbjct: 361 NIVDHRVSFELSGEEVARCLESKSMTSSRTFSECPQDSMAEEQINSEEILINSNDCLHIG 420

Query: 421 TTSSTAPEQAAG--EDNRCYQNQRAVTLGSFKEFNFDRTKGEMQNTASVGAEWWANEKVG 480
            TS+  PE+ +G  E+  CY+  R++TLGS KEFNFD +K E+ +  ++ +EWWANE + 
Sbjct: 421 ETSNETPEKPSGETEEEPCYRKHRSITLGSIKEFNFDNSK-EVPDKPTISSEWWANETIA 480

Query: 481 VKEASSPGNNWTFFPILQSGVS 499
            KEA  P NNWTFFP+LQ  VS
Sbjct: 481 GKEA-RPANNWTFFPLLQPEVS 498

BLAST of CmoCh18G012870 vs. TrEMBL
Match: B9RIV6_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1583050 PE=4 SV=1)

HSP 1 Score: 622.1 bits (1603), Expect = 5.9e-175
Identity = 332/498 (66.67%), Postives = 388/498 (77.91%), Query Frame = 1

Query: 5   NNSVDTVNAAATAIISAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPE 64
           N+SVDT+NAAATAI+SAE+RVQP T  KRRWG CWSLYWCFG      +KRI +AVL PE
Sbjct: 19  NSSVDTINAAATAIVSAESRVQPTTVQKRRWGGCWSLYWCFG---SHKTKRIGHAVLAPE 78

Query: 65  PVVPGAVAAAVEHRPPSTTMVLPFIAPPSSPASFLQSEPPSHSQSPAGLLSLTALSVNNY 124
           P V GAV  + E++  ST + +PFIAPPSSPASFLQS+PPS +QSPAGLLSLT+LSVN Y
Sbjct: 79  PEVQGAVVTSAENQSQSTAITVPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAY 138

Query: 125 SPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKL 184
           SP GPASIFAIGPY +ETQLV+PP FSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFA+L
Sbjct: 139 SPGGPASIFAIGPYAHETQLVTPPAFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQL 198

Query: 185 LTSSLSHTNRSFGTNHKFALSHCDFQPYQPYPGSPGAYLISPGSVISNSGTSSPFPDKHP 244
           LTSSL    R+ GTN KFALSH +FQ Y  YPGSPG  LISPGSVISNSGTSSPFPD++P
Sbjct: 199 LTSSLERARRNSGTNQKFALSHYEFQSYPLYPGSPGGQLISPGSVISNSGTSSPFPDRYP 258

Query: 245 ILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMGSRLG 304
           ILEFRM +APKLLG EHFTTRKW SR+GSG++TPDG GLGSRLGSGT+TPDG+G GSRLG
Sbjct: 259 ILEFRMGEAPKLLGFEHFTTRKWGSRLGSGTVTPDGVGLGSRLGSGTVTPDGVGQGSRLG 318

Query: 305 SGSVTPNGMRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSGSGCPND--VTN 364
           SG+VTP+G+   S LGSG+LTPD +G A +DG  L+NQISEVASLANS +G   D  + +
Sbjct: 319 SGTVTPDGVGLRSMLGSGSLTPDAVGPASRDGFFLENQISEVASLANSENGSKTDENIVD 378

Query: 365 HRVSFELTGEDVARCLAIKSMTSIRTESESSPKCHQNENKESSSREAETYEFFDIKTTSS 424
           HRVSFEL+GE+VARCL  KS+ S R  SE  P     E++  S +   T E      TS 
Sbjct: 379 HRVSFELSGEEVARCLESKSLASCRAFSECPPD-SMAEDQIKSGKMLMTDENLPTGETSG 438

Query: 425 TAPEQAAG--EDNRCYQNQRAVTLGSFKEFNFDRTKGEMQNTASVGAEWWANEKVGVKEA 484
             PE+ +G  E+  CY+  R++TLGS KEFNFD +K E+ +  S+ +EWWANE +  KEA
Sbjct: 439 ETPEKPSGEMEEEHCYRKHRSITLGSIKEFNFDNSK-EVPDKPSINSEWWANETIAGKEA 498

Query: 485 SSPGNNWTFFPILQSGVS 499
             P NNWTFFP+LQ  VS
Sbjct: 499 -RPANNWTFFPLLQPEVS 510

BLAST of CmoCh18G012870 vs. TrEMBL
Match: V4S4F7_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10004813mg PE=4 SV=1)

HSP 1 Score: 615.5 bits (1586), Expect = 5.5e-173
Identity = 328/504 (65.08%), Postives = 392/504 (77.78%), Query Frame = 1

Query: 1   MGSMNNSVDTVNAAATAIISAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAV 60
           M S+++SV+TVNAAATAI+SAE+R++P    KRRWGSCWSLYWCFG  S K SKRIS+AV
Sbjct: 1   MSSVHDSVETVNAAATAIVSAESRLRPAAIQKRRWGSCWSLYWCFG--SHKTSKRISHAV 60

Query: 61  LVPEPVVPGAVAAAVEHRPPSTTMVLPFIAPPSSPASFLQSEPPSHSQSPAGLLSLTALS 120
           LVPEP+V GA A A E +  ST +VLPFIAPPSSPASFLQS+PPS +QSPAGLLSL +LS
Sbjct: 61  LVPEPMVTGAAAPAAETQAHSTAIVLPFIAPPSSPASFLQSDPPSATQSPAGLLSLNSLS 120

Query: 121 VNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVP 180
           VN YSP GPAS+FAIGPY +ETQLV+PPVFSAFTTEPSTA  TPPPESVQLTTPSSPEVP
Sbjct: 121 VNAYSPGGPASMFAIGPYAHETQLVTPPVFSAFTTEPSTALCTPPPESVQLTTPSSPEVP 180

Query: 181 FAKLLTSSLSHTNRSFGTNHKFALSHCDFQPYQPYPGSPGAYLISPGSVISNSGTSSPFP 240
           FA+LLTSSL    R+ GTN K +LSH  +QPYQ YPGSPG  LISPGSV+S SGTSSPFP
Sbjct: 181 FAQLLTSSLERARRNSGTNQKLSLSHYGYQPYQLYPGSPGGQLISPGSVVSYSGTSSPFP 240

Query: 241 DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG 300
           D+HPIL+F  A APKLLG EHFTTRKW SR+GSGS+TPDG G+GSR+GSG+LTPDG+G+G
Sbjct: 241 DRHPILDFSAAAAPKLLGFEHFTTRKWGSRLGSGSVTPDGVGIGSRMGSGSLTPDGVGLG 300

Query: 301 SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSGSGCPND- 360
           SRLGSG+VTP+G    SRLGSG+LTPDG+G   +DG + +NQISEVASLANS +G  +D 
Sbjct: 301 SRLGSGTVTPDGAGLGSRLGSGSLTPDGMGPTSRDGFVRENQISEVASLANSDNGTKSDE 360

Query: 361 -VTNHRVSFELTGEDVARCLAIKSMTSIRTESESSPKCHQNENKESSSREAETYEFFDI- 420
            + +HRVSFEL+GE+VARCLA KS  S R   E               +  ++   F++ 
Sbjct: 361 HIIDHRVSFELSGEEVARCLANKSAASPRIVPEFPQDIVPEGEIRRDGKLTDSENHFELC 420

Query: 421 -KTTSSTAPEQAA--GEDNRCYQNQRAVTLGSFKEFNFDRTKGEMQNTASVGAEWWANEK 480
            + +S+  PE+    GE+  CY+  R++TLGS KEFNFD T+GE+ N  S+ +EWWANE 
Sbjct: 421 PEESSNRMPEKTMRDGEEEYCYRKHRSITLGSIKEFNFDNTEGEVSNKPSINSEWWANEN 480

Query: 481 VGVKEASSPGNNWTFFPILQSGVS 499
           VG  + S P NNWTFFP+LQS  S
Sbjct: 481 VG--KESKPSNNWTFFPMLQSEAS 500

BLAST of CmoCh18G012870 vs. TAIR10
Match: AT4G25620.1 (AT4G25620.1 hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 428.3 bits (1100), Expect = 6.4e-120
Identity = 280/507 (55.23%), Postives = 321/507 (63.31%), Query Frame = 1

Query: 1   MGSMNNS-VDTVNAAATAIISAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNA 60
           M S+NNS VDTVNAAA+AI+SAE+R QP +  K+R GS WSLYWCFG  S+KN+KRI +A
Sbjct: 1   MRSVNNSSVDTVNAAASAIVSAESRTQPSSVQKKR-GSWWSLYWCFG--SKKNNKRIGHA 60

Query: 61  VLVPEPVVPGAVAAAVEHRPP-STTMVLPFIAPPSSPASFLQSEPPS--HSQSPAGLLSL 120
           VLVPEP   GA  A V++    ST++ +PFIAPPSSPASFL S PPS  H+  P  L SL
Sbjct: 61  VLVPEPAASGAAVAPVQNSSSNSTSIFMPFIAPPSSPASFLPSGPPSASHTPDPGLLCSL 120

Query: 121 TALSVNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSS 180
           T         N P S F IGPY +ETQ V+PPVFSAFTTEPSTAPFTPPPES     PSS
Sbjct: 121 TV--------NEPPSAFTIGPYAHETQPVTPPVFSAFTTEPSTAPFTPPPES-----PSS 180

Query: 181 PEVPFAKLLTSSLSHTNRSF--GTNHKFALSHCDFQPYQPYPGSPGAYLISPGSVISNSG 240
           PEVPFA+LLTSSL    R+   G N KF+ +H +F+  Q YPGSPG  LISPG     SG
Sbjct: 181 PEVPFAQLLTSSLERARRNSGGGMNQKFSAAHYEFKSCQVYPGSPGGNLISPG-----SG 240

Query: 241 TSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTP 300
           TSSP+P K  I+EFR+ + PK LG EHFT RKW SR GSGS+TP   G GSRLGSG LTP
Sbjct: 241 TSSPYPGKCSIIEFRIGEPPKFLGFEHFTARKWGSRFGSGSITP--AGQGSRLGSGALTP 300

Query: 301 DGMGMGSRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANS-- 360
           D    GS+L SG VTPNG     R+  G LTP        +G LLD+QISEVASLANS  
Sbjct: 301 D----GSKLTSGVVTPNGAETVIRMSYGNLTP-------LEGSLLDSQISEVASLANSDH 360

Query: 361 GSGCPND---VTNHRVSFELTGEDVARCLAIKSMTSIRTESESSPKCHQNENKESSSREA 420
           GS   ND   V  HRVSFELTGEDVARCLA K   S   E  S      N          
Sbjct: 361 GSSRHNDEALVVPHRVSFELTGEDVARCLASKLNRSGSHEKASGEHLRPN---------- 420

Query: 421 ETYEFFDIKTTSSTAPEQAAGEDNRCYQNQRAVTLGSFKEFNFDRTKGEMQNTASVGAEW 480
                   KT+  T  EQ+        Q  R+ + GS KEF FD T  EM     + +EW
Sbjct: 421 ------CCKTSGETESEQS--------QKLRSFSTGSNKEFKFDSTNEEM--IEKIRSEW 447

Query: 481 WANEKVGVKEASSPGNNWTFFPILQSG 497
           WANEKV  K   SP N+WTFFP+L+SG
Sbjct: 481 WANEKVAGKGDHSPRNSWTFFPVLRSG 447

BLAST of CmoCh18G012870 vs. TAIR10
Match: AT5G52430.1 (AT5G52430.1 hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 425.2 bits (1092), Expect = 5.4e-119
Identity = 271/508 (53.35%), Postives = 316/508 (62.20%), Query Frame = 1

Query: 4   MNNSVDTVNAAATAIISAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAVLVP 63
           +NNSV+TVNAAATAI++AE+RVQP +  K RWG CWSLY CFG  +QKN+KRI NAVLVP
Sbjct: 5   VNNSVETVNAAATAIVTAESRVQPSSSQKGRWGKCWSLYSCFG--TQKNNKRIGNAVLVP 64

Query: 64  EPVVPGAVAAAVEHRPPSTTMVLPFIAPPSSPASFLQSEPPSHSQSPAGLLSLTALSVNN 123
           EPV  G     V++   STT+VLPFIAPPSSPASFLQS+P S S SP G LSLT+   N 
Sbjct: 65  EPVTSGVPVVTVQNSATSTTVVLPFIAPPSSPASFLQSDPSSVSHSPVGPLSLTS---NT 124

Query: 124 YSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPFTPPPES-VQLTTPSSPEVPFA 183
           +SP  P S+F +GPY  ETQ V+PPVFSAF TEPSTAP+TPPPES V +TTPSSPEVPFA
Sbjct: 125 FSPKEPQSVFTVGPYANETQPVTPPVFSAFITEPSTAPYTPPPESSVHITTPSSPEVPFA 184

Query: 184 KLLTSSLSHTNR--SFGTNHKFALSHCDFQPYQPYPGSPGA-YLISPGSVISNSGTSSPF 243
           +LLTSSL  T R  + G N KF+ SH +F+  Q  PGSPG   LISPGSVISNSGTSSP+
Sbjct: 185 QLLTSSLELTRRDSTSGMNQKFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPY 244

Query: 244 PDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM 303
           P K P++EFR+ + PK LG EHFT RKW SR GSGS+TP                  +G 
Sbjct: 245 PGKSPMVEFRIGEPPKFLGFEHFTARKWGSRFGSGSITP------------------VGH 304

Query: 304 GSRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSGSGCPND 363
           GS L SG++TPNG      + SG LTP+     LQ      NQISEVASLANS  G    
Sbjct: 305 GSGLASGALTPNG----PEIVSGNLTPNNTTWPLQ------NQISEVASLANSDHGSEVM 364

Query: 364 VTNHRVSFELTGEDVARCLAIKSMTS---------IRTESESSPKCHQNENKESSSREAE 423
           V +HRVSFELTGEDVARCLA K   S         I TE  SS    +N  K S  RE E
Sbjct: 365 VADHRVSFELTGEDVARCLASKLNRSHDRMNNNDRIETEESSSTDIRRNIEKRSGDRENE 424

Query: 424 TYEFFDIKTTSSTAPEQAAGEDNRCYQNQRAVTLGSFKEFNFDRTKGEMQNTASVGAEWW 483
            +    + ++S                      +GS KEF FD TK E            
Sbjct: 425 QHRIQKLSSSS----------------------IGSSKEFKFDNTKDE------------ 438

Query: 484 ANEKVGVKEASSPGNNWTFFPILQSGVS 499
             EKV        GN+W+FFP L+SGVS
Sbjct: 485 NIEKVA-------GNSWSFFPGLRSGVS 438

BLAST of CmoCh18G012870 vs. TAIR10
Match: AT1G63720.1 (AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1))

HSP 1 Score: 207.2 bits (526), Expect = 2.3e-53
Identity = 137/263 (52.09%), Postives = 168/263 (63.88%), Query Frame = 1

Query: 5   NNSVDTVNAAATAIISAEARVQPPTP--PKRRWGSCWSLYWCFGIGSQKNSKRISNAVLV 64
           NN  DT+NAAA+AI S++ R+   +P   KR+W + WSL  CFG  S +  KRI N+VLV
Sbjct: 8   NNVFDTINAAASAIASSDDRLHQSSPIHKKRKWWNRWSLLKCFG--SSRQRKRIGNSVLV 67

Query: 65  PEPVVPGAVAAAVEHRP-PSTTMVLPFIAPPSSPASFLQSEPPSHSQSPAGLLSLTALSV 124
           PEPV   +  +   +    S    LPFIAPPSSPASF QSEPPS +QSP G+LS + L  
Sbjct: 68  PEPVSMSSSNSTTSNSGYRSVITTLPFIAPPSSPASFFQSEPPSATQSPVGILSFSPLPC 127

Query: 125 NNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPFTPPPESVQL----TTPSSP 184
           NN       SIFAIGPY +ETQLVSPPVFS +TTEPS+AP TPP +   +    TTPSSP
Sbjct: 128 NNRP-----SIFAIGPYAHETQLVSPPVFSTYTTEPSSAPITPPLDDSSIYLTTTTPSSP 187

Query: 185 EVPFAKLLTSSLSHTNRSFGTNHKFALSHC-DFQPYQPYPGSPGAYLISPGSVISNSGTS 244
           EVPFA+L  S  +H   S+G  +KF +S   +FQ YQ  PGSP   LISP      SG +
Sbjct: 188 EVPFAQLFNS--NHQTGSYG--YKFPMSSSYEFQFYQLPPGSPLGQLISPS---PGSGPT 247

Query: 245 SPFPDKHPIL--EFRMADAPKLL 258
           SPFPD    L   F+++D PKLL
Sbjct: 248 SPFPDGETSLFPHFQVSDPPKLL 256

BLAST of CmoCh18G012870 vs. TAIR10
Match: AT1G76660.1 (AT1G76660.1 FUNCTIONS IN: molecular_function unknown)

HSP 1 Score: 151.8 bits (382), Expect = 1.1e-36
Identity = 105/215 (48.84%), Postives = 126/215 (58.60%), Query Frame = 1

Query: 32  KRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPE------PVVPGAVAAAVEHRPPSTTMV 91
           ++RWG C  ++ CF   SQK  KRI  A  +PE          GA  A V +   +  + 
Sbjct: 8   RKRWGGCLGVFSCFK--SQKGGKRIVPASRIPEGGNVSASQPNGAHQAGVLNNQAAGGIN 67

Query: 92  LPFIAPPSSPASFLQSEPPSHSQSPAGLLSLTALSVNNYSPNGPAS-IFAIGPYTYETQL 151
           L  +APPSSPASF  S  PS +QSP   LSL A      SP GP+S ++A GPY +ETQL
Sbjct: 68  LSLLAPPSSPASFTNSALPSTTQSPNCYLSLAA-----NSPGGPSSSMYATGPYAHETQL 127

Query: 152 VSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNRSFGTNHKFAL 211
           VSPPVFS FTTEPSTAPFTPPPE  +LT PSSP+VP+A+ LTSS+   N   G       
Sbjct: 128 VSPPVFSTFTTEPSTAPFTPPPELARLTAPSSPDVPYARFLTSSMDLKNSGKG------- 187

Query: 212 SHCDFQ-PYQPYPGSPGAYLISPGSVISNSGTSSP 239
            + D Q  Y  YPGSP + L SP S  S  G  SP
Sbjct: 188 HYNDLQATYSLYPGSPASALRSPISRASGDGLLSP 208

BLAST of CmoCh18G012870 vs. NCBI nr
Match: gi|659077554|ref|XP_008439268.1| (PREDICTED: uncharacterized protein LOC103484098 [Cucumis melo])

HSP 1 Score: 864.0 bits (2231), Expect = 1.3e-247
Identity = 446/501 (89.02%), Postives = 467/501 (93.21%), Query Frame = 1

Query: 1   MGSMNNSVDTVNAAATAIISAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAV 60
           MGS+NNSVDTVNAAATAI+SAEARVQP TPPKRRWGSCWSLYWCFGIGSQKN+KRI +AV
Sbjct: 1   MGSINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAV 60

Query: 61  LVPEPVVPGAVAAAVEHRPPSTTMVLPFIAPPSSPASFLQSEPPSHSQSPAGLLSLTALS 120
           LVPEP VPGAVA AVEHR PSTTMVLPFIAPPSSPASFLQS P S++QSPAGLLSLTALS
Sbjct: 61  LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSGPTSNTQSPAGLLSLTALS 120

Query: 121 VNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVP 180
           VNNYSPNGPASIFAIGPY Y+TQLVSPPVFSAFTTEPSTAP TPPPESVQLTTPSSPEVP
Sbjct: 121 VNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVP 180

Query: 181 FAKLLTSSLSHTNRSFGTNHKFALSHCDFQPYQPYPGSPGAYLISPGSVISNSGTSSPFP 240
           FAKLLTSSLSHTN+SFGTN KF LSHCDFQPYQPYPGSPGA+LISPGSVISNSGTSSPFP
Sbjct: 181 FAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFP 240

Query: 241 DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG 300
           DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
Sbjct: 241 DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG 300

Query: 301 SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSGSGCPNDV 360
           SRLGSGSVTPNG+RQDSRLGSGTLTPDGLGH LQD  LLDNQISEVASLANS +GC NDV
Sbjct: 301 SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDV 360

Query: 361 TNHRVSFELTGEDVARCLAIKSMTSIRTESES---SPKCHQNENKESSSREAETYEFFDI 420
           TNHRVSFELTGEDVARCLA KS+TSIRTESES   +   +QNENKE  SREAET EFFDI
Sbjct: 361 TNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSNQNENKE-LSREAETCEFFDI 420

Query: 421 KTTSSTAPEQAAGEDNRCYQNQRAVTLGSFKEFNFDRTKGEMQNTASVGAEWWANEKVGV 480
           KT  S APE+  GED++CYQNQRAVTLGSFKEFNFD+TKGE+ NTAS+GAEWWANEKVGV
Sbjct: 421 KT--SMAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEVHNTASIGAEWWANEKVGV 480

Query: 481 KEASSPGNNWTFFPILQSGVS 499
           KEA SPGNNWTFFP+LQ GVS
Sbjct: 481 KEA-SPGNNWTFFPLLQPGVS 497

BLAST of CmoCh18G012870 vs. NCBI nr
Match: gi|778679650|ref|XP_004140832.2| (PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101210841 [Cucumis sativus])

HSP 1 Score: 863.2 bits (2229), Expect = 2.2e-247
Identity = 446/501 (89.02%), Postives = 467/501 (93.21%), Query Frame = 1

Query: 1   MGSMNNSVDTVNAAATAIISAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAV 60
           M S+NNSVDTVNAAATAI+SAEARVQP TPPKRRWGSCWSLYWCFGIGSQK++KRI +AV
Sbjct: 1   MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAV 60

Query: 61  LVPEPVVPGAVAAAVEHRPPSTTMVLPFIAPPSSPASFLQSEPPSHSQSPAGLLSLTALS 120
           LVPEP VPGAVA AVEHR PSTTMVLPFIAPPSSPASFLQSEP S++QSPAGLLSLTALS
Sbjct: 61  LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALS 120

Query: 121 VNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVP 180
           VNNYSPNGPASIFAIGPYTY+TQLVSPPVFSAFTTEPSTAP TPPPESVQLTTPSSPEVP
Sbjct: 121 VNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVP 180

Query: 181 FAKLLTSSLSHTNRSFGTNHKFALSHCDFQPYQPYPGSPGAYLISPGSVISNSGTSSPFP 240
           FAKLLTSSLSHTN+SFGTN KF LSHCDFQPYQPYPGSPGA+LISPGSVISNSGTSSPFP
Sbjct: 181 FAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFP 240

Query: 241 DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG 300
           DKHPILEFRMADAPKLLGLEHFTTRKWI RMGSGSLTPDGTGL SRLGSGTLTPDGMGMG
Sbjct: 241 DKHPILEFRMADAPKLLGLEHFTTRKWIXRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG 300

Query: 301 SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSGSGCPNDV 360
           SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGH LQD  LLDNQISEVASLANS +GC NDV
Sbjct: 301 SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDV 360

Query: 361 TNHRVSFELTGEDVARCLAIKSMTSIRTESES---SPKCHQNENKESSSREAETYEFFDI 420
           TNHRVSFELTGEDVARCLA KS+TSIRTESES   +   +QNENKE SSREAET EFFDI
Sbjct: 361 TNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSNQNENKE-SSREAETCEFFDI 420

Query: 421 KTTSSTAPEQAAGEDNRCYQNQRAVTLGSFKEFNFDRTKGEMQNTASVGAEWWANEKVGV 480
           KT  S APE+  GED++CYQNQRAVTLGSFKEFNFD+TKGE+ NTAS+GAEWWANEKVGV
Sbjct: 421 KT--SAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGV 480

Query: 481 KEASSPGNNWTFFPILQSGVS 499
           KEA SPGNNWTFFP+LQ GVS
Sbjct: 481 KEA-SPGNNWTFFPLLQPGVS 497

BLAST of CmoCh18G012870 vs. NCBI nr
Match: gi|1009109183|ref|XP_015888763.1| (PREDICTED: uncharacterized protein LOC107423668 [Ziziphus jujuba])

HSP 1 Score: 677.6 bits (1747), Expect = 1.7e-191
Identity = 360/505 (71.29%), Postives = 408/505 (80.79%), Query Frame = 1

Query: 1   MGSMNNSVDTVNAAATAIISAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAV 60
           M S+NNSV+T+NAAA+AI+SAE R QP   PKRRWGSCWSLYWCFG  S KN+KRIS+AV
Sbjct: 1   MRSVNNSVETINAAASAIVSAETRAQPTAVPKRRWGSCWSLYWCFG--SHKNTKRISHAV 60

Query: 61  LVPEPVVPGAVAAAVEHRPPSTTMVLPFIAPPSSPASFLQSEPPSHSQSPAGLLSLTALS 120
           LVPE VVPGA   A E++ PST +VLPFIAPPSSPASFLQS+PPS +QSPAGLLSLT+LS
Sbjct: 61  LVPEQVVPGAAVPAAENQIPSTAVVLPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLS 120

Query: 121 VNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVP 180
           VN YSP GPASIFAIGPY YETQLVSPPVFS FTTEPSTAPFTPPPESVQLTTPSSPEVP
Sbjct: 121 VNAYSPGGPASIFAIGPYAYETQLVSPPVFSTFTTEPSTAPFTPPPESVQLTTPSSPEVP 180

Query: 181 FAKLLTSSLSHTNRSFGTNHKFALSHCDFQPYQPYPGSPGAYLISPGSVISNSGTSSPFP 240
           FA+LLTSSL  T R+ GTN KFALSHC+FQPYQPYPGSPG  LISPGSVISNSGTSSPFP
Sbjct: 181 FAQLLTSSLDRTRRNNGTNQKFALSHCEFQPYQPYPGSPGGQLISPGSVISNSGTSSPFP 240

Query: 241 DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG 300
           D+HPILEFRM +AP+LLG EHFTTRKW SR+GSGS+TPDG GLGSRLGSG LTPDG G+G
Sbjct: 241 DRHPILEFRMGEAPRLLGFEHFTTRKWGSRLGSGSITPDGLGLGSRLGSGCLTPDGNGLG 300

Query: 301 SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSGSGCPND- 360
           SR+GSGS+TPNG    SRLGSG LTPDG+G A  D   ++NQISEVASLANS SGC  D 
Sbjct: 301 SRIGSGSLTPNGAGLASRLGSGCLTPDGVGPASGDSFPMENQISEVASLANSESGCQLDG 360

Query: 361 -VTNHRVSFELTGEDVARCLAIKSMTSIRTESE----SSPKCHQNENKESSSREAETYEF 420
            V NHRVSFELTGEDVARCLA KSM S+RT S+    +  +C   +++  S+   + +  
Sbjct: 361 NVINHRVSFELTGEDVARCLANKSMASVRTASDPLKDTPSECGVKKDRMISTG-TDHFSE 420

Query: 421 FDIKTTSSTAPEQAAGE-DNRCYQNQRAVTLGSFKEFNFDRTKGEMQNTASVGAEWWANE 480
             ++ TS   PE   GE +++CY+  R++TLGS KEFNFD TK E  +  + G+EWWANE
Sbjct: 421 SCVEETSVELPENDHGEWEDQCYRKHRSITLGSIKEFNFDSTKSEFSDKPTNGSEWWANE 480

Query: 481 KVGVKEASSPGNNWTFFPILQSGVS 499
           KV  KE S PGN WTFFPILQ GVS
Sbjct: 481 KVAGKE-SKPGNGWTFFPILQPGVS 501

BLAST of CmoCh18G012870 vs. NCBI nr
Match: gi|703122806|ref|XP_010102658.1| (hypothetical protein L484_004326 [Morus notabilis])

HSP 1 Score: 650.6 bits (1677), Expect = 2.2e-183
Identity = 357/524 (68.13%), Postives = 408/524 (77.86%), Query Frame = 1

Query: 1   MGSMNNSVDTVNAAATAIISAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAV 60
           M ++NNSV+T+NAAATAI+SAEAR QP   PKRRWGSCWSLYWCFG  S KNSKRI +AV
Sbjct: 1   MRTVNNSVETINAAATAIVSAEARAQPAAVPKRRWGSCWSLYWCFG--SHKNSKRIGHAV 60

Query: 61  LVPEPVVPGAVAAAVEHRPPSTTMVLPFIAPPSSPASFLQSEPPSHSQSPAGLLSLTALS 120
           LVPEPV+PGA A A E++ PST +VLPFIAPPSSPASFLQS+PPS +QSPAGLLSLT+LS
Sbjct: 61  LVPEPVLPGAAAPAPENQAPSTAIVLPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLS 120

Query: 121 VNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVP 180
           +N YSP GP SIFAIGPY YETQLVSPPVFS FTTEPSTAPFTPPPESVQLTTPSSPEVP
Sbjct: 121 INAYSPGGPTSIFAIGPYAYETQLVSPPVFSTFTTEPSTAPFTPPPESVQLTTPSSPEVP 180

Query: 181 FAKLLTSSLSHTNR-SFGTNHKFALSHCDFQPYQPYPGSPGAYLISPGSVISNSGTSSPF 240
           FA+LLTSSL  T R S G N KF+LSHC+FQPYQ YPGSPG  LISPGSV+SNSGTSSPF
Sbjct: 181 FAQLLTSSLDRTRRNSSGANQKFSLSHCEFQPYQLYPGSPGGNLISPGSVVSNSGTSSPF 240

Query: 241 PDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM 300
           PDKHPIL FRM +AP+LLG EHFTT KW SR+GSGSLTPDG GLGSRLGSG++TPDG+G+
Sbjct: 241 PDKHPILGFRMGEAPRLLGFEHFTTWKWGSRLGSGSLTPDGVGLGSRLGSGSVTPDGVGL 300

Query: 301 GSRLGSGS----------------VTPNGMRQDSRLGSGTLTPDGLGHALQDGLLLDNQI 360
           GSRLGSGS                +TPNG    SRLGSGTLTPDG      D  LL+NQI
Sbjct: 301 GSRLGSGSLTPDGYGLGSRLGSGCMTPNGPGLGSRLGSGTLTPDGFLVVSGDSFLLENQI 360

Query: 361 SEVASLANSGSGCPND--VTNHRVSFELTGEDVARCLAIKSMTSI-RTESES---SP-KC 420
           SEVASLANS +GC ND  V +HRVSFELTGEDVARCLA KS +S  RT SES   SP +C
Sbjct: 361 SEVASLANSDNGCQNDGSVVDHRVSFELTGEDVARCLASKSASSNGRTTSESLEDSPAEC 420

Query: 421 HQNENKESSSREAETYEFFDIKTTSSTAPEQ--AAGEDNRCYQNQRAVTLGSFKEFNFDR 480
              ++  S++      +   ++ TS+  P+     GED+  YQ  R++TLGS KEFNFD 
Sbjct: 421 PTKKDGISANNVDSPNDQSCVEETSNKTPQSDCREGEDDHFYQKHRSITLGSIKEFNFDN 480

Query: 481 TKGEMQNTASVGAEWWANEKVGVKEASSPGNNWTFFPILQSGVS 499
           TK ++    ++G+EWWANEKV  KEA + GN+W+FFPILQ GVS
Sbjct: 481 TKADVSVKPTIGSEWWANEKVAGKEAKA-GNSWSFFPILQPGVS 521

BLAST of CmoCh18G012870 vs. NCBI nr
Match: gi|645256977|ref|XP_008234199.1| (PREDICTED: uncharacterized protein LOC103333182 [Prunus mume])

HSP 1 Score: 640.6 bits (1651), Expect = 2.3e-180
Identity = 341/502 (67.93%), Postives = 397/502 (79.08%), Query Frame = 1

Query: 1   MGSMNNSVDTVNAAATAIISAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAV 60
           M S+N+SVDT+NAAATAI+SAEAR QP T PKRRWGSCWSLYWCFG  S KN KRI +AV
Sbjct: 1   MRSVNSSVDTINAAATAIVSAEARAQPTTVPKRRWGSCWSLYWCFG--SHKN-KRIGHAV 60

Query: 61  LVPEPVVPGAVAAAVEHRPPSTTMVLPFIAPPSSPASFLQSEPPSHSQSPAGLLSLTALS 120
           LVPEPVVPGA  +A++++  ST +V+PFIAPPSSPASFL S+PPS +QSPAG LSL +LS
Sbjct: 61  LVPEPVVPGAAVSAIDNQTTSTAIVVPFIAPPSSPASFLPSDPPSATQSPAGFLSLKSLS 120

Query: 121 VNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVP 180
            N YSP GPASIF+IGPY YETQLVSPPVFS F TEPSTAPFTPPPESVQLTTPSSPEVP
Sbjct: 121 ANAYSPGGPASIFSIGPYAYETQLVSPPVFSTFNTEPSTAPFTPPPESVQLTTPSSPEVP 180

Query: 181 FAKLLTSSLSHTNRSFGTNHKFALSHCDFQPYQPYPGSPGAYLISPGSVISNSGTSSPFP 240
           FA+LLTSSL    R+ GTN KFALSH +FQPYQ YPGSPG  LISPGS +SNSGTSSPFP
Sbjct: 181 FAQLLTSSLDRNRRNSGTNQKFALSHYEFQPYQQYPGSPGGNLISPGSAVSNSGTSSPFP 240

Query: 241 DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG 300
           D+HP+LEF M +APKL G +HFTTRKW SR+GSGSLTPDG GLGSRLGSG+LTPDG  +G
Sbjct: 241 DRHPVLEFHMGEAPKLFGFDHFTTRKWGSRIGSGSLTPDGVGLGSRLGSGSLTPDGNELG 300

Query: 301 SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSGSGCP--N 360
           SRLGSG VTPNG    SRLGSG LTPDG G A +D  LL+NQISEVASLANS SGC    
Sbjct: 301 SRLGSGCVTPNGAGIGSRLGSGCLTPDGPGPASRDSFLLENQISEVASLANSESGCQTVE 360

Query: 361 DVTNHRVSFELTGEDVARCLAIKSMTSIRTESESSPKCHQN--ENKESSSREAETYEFFD 420
            V +HRVSFELTGEDVA CLA K+M S RT S SS     +    +++ S ++  +  F 
Sbjct: 361 TVFDHRVSFELTGEDVACCLANKAMASNRTASGSSKVIASDYPSERDALSSDSSNHCEFS 420

Query: 421 IKTTSSTAPEQAAGE-DNRCYQNQRAVTLGSFKEFNFDRTKGEMQNTASVGAEWWANEKV 480
           ++ +SS  PE  +GE +++ Y+  R++TLGS K+FNFD TK E+ +  ++G+EWWAN+ V
Sbjct: 421 VEESSSRIPENVSGEGEDQGYRKHRSITLGSTKDFNFDNTKAEVPSKPNIGSEWWANKNV 480

Query: 481 GVKEASSPGNNWTFFPILQSGV 498
             KE S P N+WTFFPILQ GV
Sbjct: 481 AAKE-SKPCNDWTFFPILQPGV 498

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y1666_ARATH2.0e-3548.84Uncharacterized protein At1g76660 OS=Arabidopsis thaliana GN=At1g76660 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
W9S7Z6_9ROSA1.6e-18368.13Uncharacterized protein OS=Morus notabilis GN=L484_004326 PE=4 SV=1[more]
M5XMF7_PRUPE2.7e-18067.93Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004616mg PE=4 SV=1[more]
A0A067JHK1_JATCU8.8e-17968.13Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26079 PE=4 SV=1[more]
B9RIV6_RICCO5.9e-17566.67Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1583050 PE=4 SV=1[more]
V4S4F7_9ROSI5.5e-17365.08Uncharacterized protein OS=Citrus clementina GN=CICLE_v10004813mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G25620.16.4e-12055.23 hydroxyproline-rich glycoprotein family protein[more]
AT5G52430.15.4e-11953.35 hydroxyproline-rich glycoprotein family protein[more]
AT1G63720.12.3e-5352.09 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glyc... [more]
AT1G76660.11.1e-3648.84 FUNCTIONS IN: molecular_function unknown[more]
Match NameE-valueIdentityDescription
gi|659077554|ref|XP_008439268.1|1.3e-24789.02PREDICTED: uncharacterized protein LOC103484098 [Cucumis melo][more]
gi|778679650|ref|XP_004140832.2|2.2e-24789.02PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101210841 [Cucumis sa... [more]
gi|1009109183|ref|XP_015888763.1|1.7e-19171.29PREDICTED: uncharacterized protein LOC107423668 [Ziziphus jujuba][more]
gi|703122806|ref|XP_010102658.1|2.2e-18368.13hypothetical protein L484_004326 [Morus notabilis][more]
gi|645256977|ref|XP_008234199.1|2.3e-18067.93PREDICTED: uncharacterized protein LOC103333182 [Prunus mume][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh18G012870.1CmoCh18G012870.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR31798FAMILY NOT NAMEDcoord: 1..498
score: 2.2E
NoneNo IPR availablePANTHERPTHR31798:SF4SUBFAMILY NOT NAMEDcoord: 1..498
score: 2.2E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmoCh18G012870CmoCh16G001230Cucurbita moschata (Rifu)cmocmoB275