HG10004848 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10004848
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptioncell wall protein RBR3-like
LocationChr08: 20903816 .. 20905897 (+)
RNA-Seq ExpressionHG10004848
SyntenyHG10004848
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCACACTCACAATTGCGCATTCTACTTCCTTGGCAATCGTTAAAAGCTTCTCCTCATCCTGCAAATGAGTCGTCAGGATGGAGTTTTGAGCCTACAGATGAAACGGAAACTTCTGCTTCTGCAGCTGATACCGTGCCAAATATTCGACATCAACCGGTACAGTCTCTTGAGATAAATCCAGAACAACCTCCTTTAGCACCAGCTCAGGCACTTGAAAAAAGTGAAACTATGCCACCTTCAAAATCTCATAAGGCAAGCAAAGTTCAATCTCAGCCACCATCAAATTCTCGAGCCAAAAATCGGTCCCGAACAGCTTCCAAGCCTCCATCACAATCTAAAACAATCCCTCAATCTTCAGTTGCTTCCAAGTCTCCTTCAACATCAAGCAAAGGCTCCCCATCTCAGGATACTTCAAAGCCATCATCACCAGCAGGCAAAGCCTCTCCATCTCAGGATGCTTCTTCAAAGCCTTCATCACCTGCAGCAGTTGCATCTACTGCTCCTCGAACCCGGATTGCTTTGAAGCCATCATCTCCATCGTCTCAAACATCCAGTAAAAGCCATCCAAATAAAAAACCAACATCACAATCAAGAATTAAAGCTGATTCTCAGCCTTCATCATCTTCAAGGTCAGCATTTCCATCTCAAGATTTTTCTATGCCACTGCGGTCGCCATCTCAAGAAAATTCTCGACAACAACCATTGGAAAAAACCTCTCGGGTTCAGTCTCCATCTCATTTGTCCAGTAAACCTACTGCACAATTAACATCACAACAACCTATTAAGTCTCCAGCAGCCATTGGAACTCAAATCCATCCAAATTCAAAACCATCATCCCAATCAAGATTTAAAGCTGATTCTCAGCCTTCATCACCTTCAAGGTCAGCATTTCCATCTCAAGATTCTTCTATGCCACCACGGTCGCCATCTCAAGAAATTTCTCGACAACAAGCATCGGAAAAAACCTCTCGAGTTCAGTCTCCATCTCATTTGTCCGGTAAACCTACTGCACAATCAACAACACAACAACCTATTGAATCTCCTACAGCCATTGGAGACCAAACAACAGATGGAATCATTTCTCATCCCGCAAATCAATCCCCAAAAGCAAGACCTACAAACAGGGAAACTCAGTTGCAAACCAAATCAACGCAGCCTCTGAAACCAGACAGGAAACCAGTGGAATCGAAAGCATCAAAAAATCAGCCTGAAACCAAGGAAGAGCTCACATCTAAGAACACTTCCAATCCCCATCCGAACCAGGACTATTCTGAAATCCCAACACAATCCGATCAAAACATGGAAAATGGCTTACATTCCTTTCTAGAATCACAGGCAGAGTCAAAAGAAACTAAGGAAGATCTGGCAAAGACAACCAATGCACTTCAAACCAAAGCATCTAGAAGCACATTAATCACATCTTCCAAAAGTCGTTCATCATTTGAACCAGAAAAGTGGTACTCACAACAGGAAGAATCCATGGAAGACTTATCCAAAGTTTTTCAGAAACTAAACATCAAATATTCAGACAAGGAAAATCAAAAGAGCTTCACAACATTGATCGGCGATAACAAAGGGTCGTCAATGCACTTACTCTCCGGTGAAGCCAAAAGCGAAAGCCCAATCCACATCCACCGTCAATATAAGAGCGATCCAGATCAAAGCCCTAAAAGTTCCACAGAAATCGAAGGAAATTTCAATAACGAAACACCGCAAGATTCAAGAACAGAGGAGAATCCATCACCCCTGGAATTATATATCAACGTCAATGTACAAGGTATCAACAACTCAATCATGTGCAATACCTCATTTATAGAGAATGATCCTGGAGTCAAGTTGAAATTCCCTCGAGAACCAACAAAATCTGAAGATGAATTAGAGGCTCATCACGCTAGAAAAGCAGAATACAGTCCGAGACCGGCGGAGAAGCTTACGTATGAACCCAGAGTAAGACGAAGATGCCTCAGAGGGATGTTAATGGAGTCGAGCGATTCTGAGGTCGAGAATCCAGAAAAGTCTCGACGCCATGGCTGCCGGTATAGTCGTAATAGCAAAGGAAAAGAGGTCGAAACTCTGTAA

mRNA sequence

ATGTCACACTCACAATTGCGCATTCTACTTCCTTGGCAATCGTTAAAAGCTTCTCCTCATCCTGCAAATGAGTCGTCAGGATGGAGTTTTGAGCCTACAGATGAAACGGAAACTTCTGCTTCTGCAGCTGATACCGTGCCAAATATTCGACATCAACCGGTACAGTCTCTTGAGATAAATCCAGAACAACCTCCTTTAGCACCAGCTCAGGCACTTGAAAAAAGTGAAACTATGCCACCTTCAAAATCTCATAAGGCAAGCAAAGTTCAATCTCAGCCACCATCAAATTCTCGAGCCAAAAATCGGTCCCGAACAGCTTCCAAGCCTCCATCACAATCTAAAACAATCCCTCAATCTTCAGTTGCTTCCAAGTCTCCTTCAACATCAAGCAAAGGCTCCCCATCTCAGGATACTTCAAAGCCATCATCACCAGCAGGCAAAGCCTCTCCATCTCAGGATGCTTCTTCAAAGCCTTCATCACCTGCAGCAGTTGCATCTACTGCTCCTCGAACCCGGATTGCTTTGAAGCCATCATCTCCATCGTCTCAAACATCCAGTAAAAGCCATCCAAATAAAAAACCAACATCACAATCAAGAATTAAAGCTGATTCTCAGCCTTCATCATCTTCAAGGTCAGCATTTCCATCTCAAGATTTTTCTATGCCACTGCGGTCGCCATCTCAAGAAAATTCTCGACAACAACCATTGGAAAAAACCTCTCGGGTTCAGTCTCCATCTCATTTGTCCAGTAAACCTACTGCACAATTAACATCACAACAACCTATTAAGTCTCCAGCAGCCATTGGAACTCAAATCCATCCAAATTCAAAACCATCATCCCAATCAAGATTTAAAGCTGATTCTCAGCCTTCATCACCTTCAAGGTCAGCATTTCCATCTCAAGATTCTTCTATGCCACCACGGTCGCCATCTCAAGAAATTTCTCGACAACAAGCATCGGAAAAAACCTCTCGAGTTCAGTCTCCATCTCATTTGTCCGGTAAACCTACTGCACAATCAACAACACAACAACCTATTGAATCTCCTACAGCCATTGGAGACCAAACAACAGATGGAATCATTTCTCATCCCGCAAATCAATCCCCAAAAGCAAGACCTACAAACAGGGAAACTCAGTTGCAAACCAAATCAACGCAGCCTCTGAAACCAGACAGGAAACCAGTGGAATCGAAAGCATCAAAAAATCAGCCTGAAACCAAGGAAGAGCTCACATCTAAGAACACTTCCAATCCCCATCCGAACCAGGACTATTCTGAAATCCCAACACAATCCGATCAAAACATGGAAAATGGCTTACATTCCTTTCTAGAATCACAGGCAGAGTCAAAAGAAACTAAGGAAGATCTGGCAAAGACAACCAATGCACTTCAAACCAAAGCATCTAGAAGCACATTAATCACATCTTCCAAAAGTCGTTCATCATTTGAACCAGAAAAGTGGTACTCACAACAGGAAGAATCCATGGAAGACTTATCCAAAGTTTTTCAGAAACTAAACATCAAATATTCAGACAAGGAAAATCAAAAGAGCTTCACAACATTGATCGGCGATAACAAAGGGTCGTCAATGCACTTACTCTCCGGTGAAGCCAAAAGCGAAAGCCCAATCCACATCCACCGTCAATATAAGAGCGATCCAGATCAAAGCCCTAAAAGTTCCACAGAAATCGAAGGAAATTTCAATAACGAAACACCGCAAGATTCAAGAACAGAGGAGAATCCATCACCCCTGGAATTATATATCAACGTCAATGTACAAGGTATCAACAACTCAATCATGTGCAATACCTCATTTATAGAGAATGATCCTGGAGTCAAGTTGAAATTCCCTCGAGAACCAACAAAATCTGAAGATGAATTAGAGGCTCATCACGCTAGAAAAGCAGAATACAGTCCGAGACCGGCGGAGAAGCTTACGTATGAACCCAGAGTAAGACGAAGATGCCTCAGAGGGATGTTAATGGAGTCGAGCGATTCTGAGGTCGAGAATCCAGAAAAGTCTCGACGCCATGGCTGCCGGTATAGTCGTAATAGCAAAGGAAAAGAGGTCGAAACTCTGTAA

Coding sequence (CDS)

ATGTCACACTCACAATTGCGCATTCTACTTCCTTGGCAATCGTTAAAAGCTTCTCCTCATCCTGCAAATGAGTCGTCAGGATGGAGTTTTGAGCCTACAGATGAAACGGAAACTTCTGCTTCTGCAGCTGATACCGTGCCAAATATTCGACATCAACCGGTACAGTCTCTTGAGATAAATCCAGAACAACCTCCTTTAGCACCAGCTCAGGCACTTGAAAAAAGTGAAACTATGCCACCTTCAAAATCTCATAAGGCAAGCAAAGTTCAATCTCAGCCACCATCAAATTCTCGAGCCAAAAATCGGTCCCGAACAGCTTCCAAGCCTCCATCACAATCTAAAACAATCCCTCAATCTTCAGTTGCTTCCAAGTCTCCTTCAACATCAAGCAAAGGCTCCCCATCTCAGGATACTTCAAAGCCATCATCACCAGCAGGCAAAGCCTCTCCATCTCAGGATGCTTCTTCAAAGCCTTCATCACCTGCAGCAGTTGCATCTACTGCTCCTCGAACCCGGATTGCTTTGAAGCCATCATCTCCATCGTCTCAAACATCCAGTAAAAGCCATCCAAATAAAAAACCAACATCACAATCAAGAATTAAAGCTGATTCTCAGCCTTCATCATCTTCAAGGTCAGCATTTCCATCTCAAGATTTTTCTATGCCACTGCGGTCGCCATCTCAAGAAAATTCTCGACAACAACCATTGGAAAAAACCTCTCGGGTTCAGTCTCCATCTCATTTGTCCAGTAAACCTACTGCACAATTAACATCACAACAACCTATTAAGTCTCCAGCAGCCATTGGAACTCAAATCCATCCAAATTCAAAACCATCATCCCAATCAAGATTTAAAGCTGATTCTCAGCCTTCATCACCTTCAAGGTCAGCATTTCCATCTCAAGATTCTTCTATGCCACCACGGTCGCCATCTCAAGAAATTTCTCGACAACAAGCATCGGAAAAAACCTCTCGAGTTCAGTCTCCATCTCATTTGTCCGGTAAACCTACTGCACAATCAACAACACAACAACCTATTGAATCTCCTACAGCCATTGGAGACCAAACAACAGATGGAATCATTTCTCATCCCGCAAATCAATCCCCAAAAGCAAGACCTACAAACAGGGAAACTCAGTTGCAAACCAAATCAACGCAGCCTCTGAAACCAGACAGGAAACCAGTGGAATCGAAAGCATCAAAAAATCAGCCTGAAACCAAGGAAGAGCTCACATCTAAGAACACTTCCAATCCCCATCCGAACCAGGACTATTCTGAAATCCCAACACAATCCGATCAAAACATGGAAAATGGCTTACATTCCTTTCTAGAATCACAGGCAGAGTCAAAAGAAACTAAGGAAGATCTGGCAAAGACAACCAATGCACTTCAAACCAAAGCATCTAGAAGCACATTAATCACATCTTCCAAAAGTCGTTCATCATTTGAACCAGAAAAGTGGTACTCACAACAGGAAGAATCCATGGAAGACTTATCCAAAGTTTTTCAGAAACTAAACATCAAATATTCAGACAAGGAAAATCAAAAGAGCTTCACAACATTGATCGGCGATAACAAAGGGTCGTCAATGCACTTACTCTCCGGTGAAGCCAAAAGCGAAAGCCCAATCCACATCCACCGTCAATATAAGAGCGATCCAGATCAAAGCCCTAAAAGTTCCACAGAAATCGAAGGAAATTTCAATAACGAAACACCGCAAGATTCAAGAACAGAGGAGAATCCATCACCCCTGGAATTATATATCAACGTCAATGTACAAGGTATCAACAACTCAATCATGTGCAATACCTCATTTATAGAGAATGATCCTGGAGTCAAGTTGAAATTCCCTCGAGAACCAACAAAATCTGAAGATGAATTAGAGGCTCATCACGCTAGAAAAGCAGAATACAGTCCGAGACCGGCGGAGAAGCTTACGTATGAACCCAGAGTAAGACGAAGATGCCTCAGAGGGATGTTAATGGAGTCGAGCGATTCTGAGGTCGAGAATCCAGAAAAGTCTCGACGCCATGGCTGCCGGTATAGTCGTAATAGCAAAGGAAAAGAGGTCGAAACTCTGTAA

Protein sequence

MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEINPEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAKNRSRTASKPPSQSKTIPQSSVASKSPSTSSKGSPSQDTSKPSSPAGKASPSQDASSKPSSPAAVASTAPRTRIALKPSSPSSQTSSKSHPNKKPTSQSRIKADSQPSSSSRSAFPSQDFSMPLRSPSQENSRQQPLEKTSRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHPNSKPSSQSRFKADSQPSSPSRSAFPSQDSSMPPRSPSQEISRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIGDQTTDGIISHPANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKASKNQPETKEELTSKNTSNPHPNQDYSEIPTQSDQNMENGLHSFLESQAESKETKEDLAKTTNALQTKASRSTLITSSKSRSSFEPEKWYSQQEESMEDLSKVFQKLNIKYSDKENQKSFTTLIGDNKGSSMHLLSGEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPSPLELYINVNVQGINNSIMCNTSFIENDPGVKLKFPREPTKSEDELEAHHARKAEYSPRPAEKLTYEPRVRRRCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKGKEVETL
Homology
BLAST of HG10004848 vs. NCBI nr
Match: XP_038886773.1 (flocculation protein FLO11 [Benincasa hispida])

HSP 1 Score: 924.5 bits (2388), Expect = 5.5e-265
Identity = 559/700 (79.86%), Postives = 596/700 (85.14%), Query Frame = 0

Query: 1   MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEIN 60
           MS SQLRILLPWQSLKASP P NES G SFEPTDE ETSASAADT  NIRHQP QS EI 
Sbjct: 1   MSLSQLRILLPWQSLKASPLPENESPGRSFEPTDEVETSASAADTTQNIRHQPAQSPEIK 60

Query: 61  PEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAKNRSRTASKPPSQSKTIPQSS 120
           PEQPPLA A A E+SETMPPSKSHKA KV SQPP NSRAKNRSRTASKP   SK IPQSS
Sbjct: 61  PEQPPLATALAPERSETMPPSKSHKAGKVHSQPPPNSRAKNRSRTASKPSPPSKAIPQSS 120

Query: 121 VAS-KSPSTSSKGSPSQDTSKPSSPAGKASPSQDASSKPSSPAAVASTAPRTRIALKP-- 180
           VAS KSPSTS K S SQDTSKPSSPAGK+S SQDASSKPSSPAAVA+TAPR+RI  KP  
Sbjct: 121 VASNKSPSTSGKDSLSQDTSKPSSPAGKSSRSQDASSKPSSPAAVAATAPRSRITSKPSS 180

Query: 181 ----SSPSSQTSSKSHPNKKPTSQSRIKADSQPSSSSRSAFPSQDFSMPLRSPSQENSRQ 240
               SSPSSQTSSK+HP  KP+SQSR KADSQP SSSRSAFPSQD S+P R PS ENSR 
Sbjct: 181 PSSSSSPSSQTSSKNHP--KPSSQSRFKADSQP-SSSRSAFPSQDSSLPPRLPSLENSR- 240

Query: 241 QPLEKTSRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHPNSKPSSQSRFKADSQPSSP 300
           QP E+TSRVQSPSH SSKPTAQ TSQQP +SPA IG Q HPNSKPSSQSRFKADSQPSS 
Sbjct: 241 QPSERTSRVQSPSHFSSKPTAQSTSQQPNESPAVIGIQSHPNSKPSSQSRFKADSQPSSS 300

Query: 301 SRSAFPSQDSSMPPRSPSQEISRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIG 360
           SRSAF SQDSSM P SPS+E SRQQ  EKTSRVQSPSHLS KP AQST+QQPIESP AIG
Sbjct: 301 SRSAFSSQDSSMLPWSPSRENSRQQPLEKTSRVQSPSHLSSKP-AQSTSQQPIESPAAIG 360

Query: 361 DQTTDGIISHPANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKASKNQPETKEELTSK 420
           +QTT+  ISHP NQSPKARPT+RE+Q+QTKS Q LKP+ K VE KASKN+ ETKEEL+SK
Sbjct: 361 NQTTNETISHPTNQSPKARPTSRESQMQTKSKQSLKPNTKQVELKASKNKSETKEELSSK 420

Query: 421 NTSNPHPNQDYSEIPTQSDQNMENGLHSFLESQAESKETKEDLAKTTNALQTKASRSTLI 480
           NTSNPH NQD  E PT+SDQ +EN L   LESQAES+ET+E+LAKTTNALQTKASRSTLI
Sbjct: 421 NTSNPHSNQDSFENPTKSDQTIENSLDFSLESQAESRETEEELAKTTNALQTKASRSTLI 480

Query: 481 TSSKSRSSFEPEKWYSQQEESMEDLSKVFQKLNIKYSDKENQKSFTTLIGDNKGSSMHLL 540
           TSSK   SFEPE    QQEESM+D SK FQKLNIKYSD+EN KSFTTLIG NKGSSMHL+
Sbjct: 481 TSSKIHPSFEPE----QQEESMDDSSKAFQKLNIKYSDEENPKSFTTLIGQNKGSSMHLV 540

Query: 541 SGEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPSPLELYINVNVQ 600
           SGEAKSES IHIHRQYKS+PDQSPK STEIEGNF NET +DSRTEENP  +E+YIN+NVQ
Sbjct: 541 SGEAKSESSIHIHRQYKSNPDQSPKCSTEIEGNFINETQEDSRTEENPPSVEIYINLNVQ 600

Query: 601 GINNSIMCNTSFIENDPGVKLKFPREPTKSEDELEAHHARKAEYSPRPAEKLTYEPRVRR 660
           GINNSIMCNTSF ENDPG+KLK  RE  KSEDELE+HHARKAEYS +PAEK+TYEPRVRR
Sbjct: 601 GINNSIMCNTSFTENDPGIKLKLSRETIKSEDELESHHARKAEYSAKPAEKVTYEPRVRR 660

Query: 661 RCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKGKEVETL 694
           RCLRGMLMESSDSEVENP KSRRHGCRY  +SKGKEVETL
Sbjct: 661 RCLRGMLMESSDSEVENPGKSRRHGCRYGLSSKGKEVETL 691

BLAST of HG10004848 vs. NCBI nr
Match: KAA0065223.1 (flocculation protein FLO11 [Cucumis melo var. makuwa])

HSP 1 Score: 828.2 bits (2138), Expect = 5.4e-236
Identity = 528/793 (66.58%), Postives = 579/793 (73.01%), Query Frame = 0

Query: 1   MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEIN 60
           MS SQLRILLPWQSLKASP PANES   SF PTDE+E+SAS ADT PNIRHQP QS EI 
Sbjct: 1   MSLSQLRILLPWQSLKASPRPANESPEGSFGPTDESESSASQADTAPNIRHQPDQSPEIK 60

Query: 61  PEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAKNRSRTASKPPSQSKTIPQSS 120
           PE+PPLA AQA E+SETMPPSKSHK  K+ SQ  +NSRAKNRSRTASKP S    IPQS 
Sbjct: 61  PEEPPLATAQAAERSETMPPSKSHKEGKIHSQLSTNSRAKNRSRTASKPSSPLNAIPQSP 120

Query: 121 VAS-KSPSTSSKGSPSQDTSKPSSPAGKA-SPSQDASSKPSSPAAVASTAPRTRIALKPS 180
           +AS K PSTS KGS SQD+SKPSSPAGK  SPSQDASSKPSSPA VA+TAP  RIA K S
Sbjct: 121 LASNKYPSTSGKGSKSQDSSKPSSPAGKVFSPSQDASSKPSSPATVAATAP--RIASKAS 180

Query: 181 SPSSQTSSKSHPNKKPT------------------------------------------- 240
           S SSQ S+K HP+ KPT                                           
Sbjct: 181 SSSSQASNKKHPSSKPTSKLRFKADSQPSSPSRSAFPSQDPSMPPQSLSQEKSRQQASEK 240

Query: 241 -----------------------------------------SQSRIKADSQPSSSSRSAF 300
                                                    SQSR K DSQPS SSRS F
Sbjct: 241 SSRVQSPSHFSRKPTTQSTSKQPVESPATIGIQSHPNSKAPSQSRFKTDSQPSPSSRSTF 300

Query: 301 PSQDFSMPLRSPSQENSRQQPLEKTSRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHP 360
           PSQDFSMP RSPS ENSRQQP +KTS VQSPSH S KPTAQ TS+QPI+SPA IG Q HP
Sbjct: 301 PSQDFSMPPRSPSHENSRQQPSDKTSGVQSPSHSSRKPTAQSTSKQPIESPATIGIQHHP 360

Query: 361 NSKPSSQSRFKADSQPSSPSRSAFPSQDSSMPPRSPSQEISRQQASEKTSRVQSPSHLSG 420
           N KPSSQSRFKA+S+PSS S+S FPSQDSSMPPRSPSQE S Q  SEKTSRVQSPS+LS 
Sbjct: 361 NLKPSSQSRFKAESRPSSSSKSKFPSQDSSMPPRSPSQENSLQPPSEKTSRVQSPSNLSR 420

Query: 421 KPTAQSTTQQPIESPTAIGDQTTDGIISHPANQSPKARPTNRETQLQTKSTQPLKPDRKP 480
           KPTA ST+QQPIES  +IGDQTTDGI+S PA  SPKA PT+ E Q+Q KS +  +P+ KP
Sbjct: 421 KPTAPSTSQQPIESTASIGDQTTDGILSDPATPSPKAIPTSGEIQIQAKSKKSPEPNVKP 480

Query: 481 VESKASKNQPETKEELT----------SKNTSNPHPNQDYSEIPTQSDQNMENGLHSFLE 540
           VE KASKNQ +TKEELT          SKNTSNPH ++D SE PTQSD+ +E GL S LE
Sbjct: 481 VELKASKNQNDTKEELTSKNETKEELASKNTSNPHSDEDSSENPTQSDETVERGLDSSLE 540

Query: 541 SQAESKETKEDLAKTTNALQTKASRSTLITSSKSRSSFEPEKWYSQQEESMEDLSKVFQK 600
           SQ ESKETKED  KTTNALQ KASRSTLITSSKSRSSFEPEK  +QQ+ESMEDLSK F K
Sbjct: 541 SQTESKETKEDGGKTTNALQAKASRSTLITSSKSRSSFEPEK-NTQQDESMEDLSKAFNK 600

Query: 601 LNIKYSDKENQKSFTTLIGDNKGSSMHLLSGEAKSESPIHIHRQYKSDPDQSPKSSTEIE 660
           LNIKYSD+EN KSFTT+IGDNKGSS+HLLSGEAKSES IH++ +YKS+PDQSPKSST I+
Sbjct: 601 LNIKYSDEENPKSFTTMIGDNKGSSVHLLSGEAKSESSIHVNHRYKSNPDQSPKSSTNIK 660

Query: 661 GNFNNETPQDSRTEENPS--PLELYINVNVQGINNSIMCNTSFIENDPGVKLKFP--REP 694
            N NNETPQDS TEENP   PLELYIN NVQGINNSIM NTSF EN+PG+KLKFP   EP
Sbjct: 661 ENSNNETPQDSTTEENPDPPPLELYINHNVQGINNSIMFNTSFTENNPGIKLKFPGDGEP 720

BLAST of HG10004848 vs. NCBI nr
Match: XP_011649631.1 (flocculation protein FLO11 [Cucumis sativus] >KAE8652109.1 hypothetical protein Csa_022237 [Cucumis sativus])

HSP 1 Score: 801.2 bits (2068), Expect = 7.0e-228
Identity = 520/792 (65.66%), Postives = 570/792 (71.97%), Query Frame = 0

Query: 1   MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEIN 60
           MS SQLRILLPWQSLKAS   ANES   SF PTDE+E SASAADTVPNIRHQP QS E  
Sbjct: 1   MSLSQLRILLPWQSLKASSRSANESPERSFGPTDESEGSASAADTVPNIRHQPDQSPETK 60

Query: 61  PEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAKNRSRTASKPPSQSKTIPQSS 120
           PE+PPLA AQA E+SETMPPSKSHKA KV SQ PS +RAKNRSR ASKP S SK IPQ S
Sbjct: 61  PEEPPLATAQAAERSETMPPSKSHKAGKVHSQ-PSTTRAKNRSRAASKPSSSSKAIPQFS 120

Query: 121 VASKSPSTSSKGSPSQDTSKPSSPAGKA-SPSQDASSKPSSPAAVASTAPRTRIALKPSS 180
           VAS  PSTS KGS SQD+SKPSSPAGK  SPS+DASSKPSSPA VA+T P  RIA K SS
Sbjct: 121 VASDKPSTSGKGSQSQDSSKPSSPAGKVFSPSKDASSKPSSPATVAATPP--RIASKASS 180

Query: 181 PSSQTSSKSHPNKKPTSQSRIKADSQPSSSSRSA-------------------------- 240
            SSQTS+K HPN KPTSQ ++KADSQPSSSSRSA                          
Sbjct: 181 SSSQTSNKKHPNSKPTSQLKVKADSQPSSSSRSALPSRDPSLPAQSLSQEDSRQQSSEKT 240

Query: 241 ----------------------------------------------------------FP 300
                                                                     FP
Sbjct: 241 SRVQSPSNLSRKPTTQSTSKQPVESPATIRIQSHPNSKQPSQSRFKADSHPSPSSRSTFP 300

Query: 301 SQDFSMPLRSPSQENSRQQPLEKTSRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHPN 360
           SQDFS P RSPS E SRQQP  KTSRVQSPSH S K TAQ T+QQP +SPA IG Q HPN
Sbjct: 301 SQDFSTPPRSPSHEISRQQPSVKTSRVQSPSHSSRKSTAQSTTQQPTESPATIGIQHHPN 360

Query: 361 SKPS-SQSRFKADSQPSSPSRSAFPSQDSSMPPRSPSQEISRQQASEKTSRVQSPSHLSG 420
            KPS SQSRFKADSQPSS S+  FPSQDSSMPPRSPSQE S Q  SEKT RVQSPSHLS 
Sbjct: 361 LKPSLSQSRFKADSQPSSSSKLKFPSQDSSMPPRSPSQENSLQPPSEKTFRVQSPSHLSR 420

Query: 421 KPTAQSTTQQPIESPTAIGDQTTDGIISHPANQSPKARPTNRETQLQTKSTQPLKPDRKP 480
           KPTAQST+QQPIE   +IGDQTTD I+S PAN SPKA PT+ E+Q+Q +S +  KP+ KP
Sbjct: 421 KPTAQSTSQQPIEPTASIGDQTTDRILSDPANPSPKAIPTSGESQIQAESKKSPKPNVKP 480

Query: 481 VESKASKNQPETKEELT----------SKNTSNPHPNQDYSEIPTQSDQNMENGLHSFLE 540
           VE + SK Q ETKEELT          SKNTSNPH  +D SE PTQSDQ +E GL S LE
Sbjct: 481 VELEESKTQHETKEELTSKNENKEELASKNTSNPHSYKDSSENPTQSDQAIEKGLDSSLE 540

Query: 541 SQAESKETKEDLAKTTNALQTKASRSTLITSSKSRSSFEPEKWYSQQEESMEDLSKVFQK 600
           SQ ESKETKED AKTTNA QTKASRSTLITSSKSRSSFEPE   +QQ+ESMEDLSK F K
Sbjct: 541 SQTESKETKEDGAKTTNAFQTKASRSTLITSSKSRSSFEPEN-NTQQDESMEDLSKAFNK 600

Query: 601 LNIKYSDKENQKSFTTLIGDNKGSSMHLLSGEAKSESPIHIHRQYKSDPDQSPKSSTEIE 660
           LNIKYSD+EN KS TT+IGDNKG+SMHLLS EAKSES IH++  YKS+PDQSP+SST+I+
Sbjct: 601 LNIKYSDEENPKSLTTMIGDNKGTSMHLLSDEAKSESSIHVNHHYKSNPDQSPESSTDIK 660

Query: 661 GNFNNETPQDSRTEENPS--PLELYINVNVQGINNSIMCNTSFIENDPGVKLKFPREPTK 694
            N NNET +DS TEENP   PLELYIN+NVQGINNSI  NTSF EN+PG+KLKFP EPT 
Sbjct: 661 ENSNNETAKDSTTEENPDPPPLELYINLNVQGINNSITFNTSFTENNPGIKLKFPGEPTN 720

BLAST of HG10004848 vs. NCBI nr
Match: XP_022951875.1 (cell wall protein RBR3-like [Cucurbita moschata])

HSP 1 Score: 610.5 bits (1573), Expect = 1.8e-170
Identity = 407/694 (58.65%), Postives = 480/694 (69.16%), Query Frame = 0

Query: 1   MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEIN 60
           M++ Q R  LPWQS+KAS    NESS  S EPTDE ETS SAADTVP ++H         
Sbjct: 1   MTYRQFRFRLPWQSIKASSRLENESSTRSSEPTDEAETSNSAADTVPYMQHL-------- 60

Query: 61  PEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAKNRSRTASKPPSQSKTIPQSS 120
           PE  PL  AQA E+SETM PSKSHK +KV SQP S+SRAK ++RTA+KPPS SK  PQSS
Sbjct: 61  PELSPLESAQAPERSETMLPSKSHKKAKVHSQPSSHSRAKKQTRTATKPPSASKVTPQSS 120

Query: 121 VAS-KSPSTSSKGSPSQDTSKPSSPAGKASPSQDASSKPSSPAAVASTAPRTRIALKPSS 180
           V+S KSP+TS+K SPS D SKPSS AGK SPS D +SK SSPA     +P          
Sbjct: 121 VSSNKSPTTSAKASPSHDASKPSSSAGKVSPSHD-TSKLSSPAGKGKVSP---------- 180

Query: 181 PSSQTSSKSHPNKKPTSQSRIKADSQPSSSSRSAFPSQDFSMPLRSPSQENSRQQPLEKT 240
                   SH              S+PSS +  AFPS+D S P                 
Sbjct: 181 --------SHDT------------SKPSSPAGKAFPSRDASQP----------------- 240

Query: 241 SRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHPNSKPSSQSRFKADSQPSSPSRSAFP 300
                 S  ++ P +Q+ S+ P  SP+   ++ HP SKP+SQSR KADSQPSSPSR AF 
Sbjct: 241 -----SSSAAAAPRSQIRSKPP--SPSQTSSKNHPQSKPTSQSRLKADSQPSSPSRPAFS 300

Query: 301 SQDSSMPPRSPSQEISRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIGDQTTDG 360
            Q SS+ PRSPS E SRQQ S+K SRVQSPSHLS KPTAQST+QQ  ESP  IGDQTT  
Sbjct: 301 PQASSI-PRSPSHENSRQQPSKKASRVQSPSHLSSKPTAQSTSQQLTESPATIGDQTTKR 360

Query: 361 IISHPANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKASKNQPETKEELTSKNTSNPH 420
           ++SHPA+QSP+AR   RE Q+QTKS Q  KPD KPVE KASK+QPET EE  SKNTS PH
Sbjct: 361 VVSHPADQSPRARCKRRENQVQTKSKQSPKPDLKPVEFKASKHQPETMEEFISKNTSYPH 420

Query: 421 PNQDYSEIPTQSDQNMENGLHSFLESQAESKETK------EDLAKTTNALQTKASRSTLI 480
            +QD+SEIP   D+ +ENG  + LESQ ES+E+K      EDL KTTNALQ  AS+S LI
Sbjct: 421 SDQDFSEIPIIIDETIENGPETSLESQTESQESKEIKSYEEDLEKTTNALQINASKSKLI 480

Query: 481 TSSKSRSSFEPEKWYSQQEESMEDLSKVFQKLNIKYSDKENQKSFTTLIGDNKGSSMHLL 540
           TS++  S FEPE   SQQE +MEDLSK FQ LNIKY + EN KSFTTL GDNKG+SMHLL
Sbjct: 481 TSAEITSPFEPENSDSQQEGTMEDLSKAFQTLNIKYPE-ENPKSFTTLTGDNKGASMHLL 540

Query: 541 SGEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPSPLELYINVNVQ 600
           SGEA  ES IHIHRQYKSDPD+ P+SST+IEGN N ETPQDS+TEE+P PLELYIN+NVQ
Sbjct: 541 SGEATKESSIHIHRQYKSDPDKGPESSTDIEGNSNEETPQDSKTEEDP-PLELYININVQ 600

Query: 601 GINNSIMCNTSFIENDPGVKLKFPREPTKSEDELEAHHARKAEYSPRPAEKLTYEPRVRR 660
           GINNS++ N+SF EN+PG+KLKF  + TKSED+  +  A+KA+Y+ +  E  TYEP VRR
Sbjct: 601 GINNSLLSNSSFTENNPGIKLKFVPQQTKSEDKSHSLQAQKAKYTAKHTENHTYEPTVRR 628

Query: 661 RCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKG 688
           RCL G+LMESSDS+ +N EK RRHGCRY  + +G
Sbjct: 661 RCLGGLLMESSDSDGDNSEKPRRHGCRYRGSFEG 628

BLAST of HG10004848 vs. NCBI nr
Match: XP_023002262.1 (cell wall protein RBR3-like [Cucurbita maxima])

HSP 1 Score: 605.9 bits (1561), Expect = 4.3e-169
Identity = 405/695 (58.27%), Postives = 478/695 (68.78%), Query Frame = 0

Query: 1   MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEIN 60
           M++ Q R  LPWQS+KAS  P NESS  S EPTDE ETS SAADTVP ++H P+QS E  
Sbjct: 1   MTYRQFRFRLPWQSIKASSRPENESSTRSSEPTDEAETSNSAADTVPYMQHLPLQSHETK 60

Query: 61  PEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAKNRSRTASKPPSQSKTIPQSS 120
           PE  PL  AQA E+SETM PSKSHK +KV SQP S+SRAK ++RTA+KPPS SK  PQSS
Sbjct: 61  PELSPLESAQAPERSETMLPSKSHKKAKVHSQPSSHSRAKKQTRTATKPPSASKVTPQSS 120

Query: 121 VAS-KSPSTSSKGSPSQDTSKPSSPAGKASPSQDASSKPSSPAAVASTAPRTRIALKPSS 180
           V+S KSP+TS+K SPS D SKPSS AGK SPS D +SK SSPA                S
Sbjct: 121 VSSNKSPTTSAKASPSHDASKPSSSAGKVSPSHD-TSKLSSPAGKGKV-----------S 180

Query: 181 PSSQTSSKSHPNKKPTSQSRIKADSQPSSSSRSAFPSQDFSMPLRSPSQENSRQQPLEKT 240
           PS  T                   S PS  +  AFPS+D S P                 
Sbjct: 181 PSRDT-------------------SMPSPPAGKAFPSRDASQP--------------SSA 240

Query: 241 SRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHPNSKPSSQSRFKADSQPSSPSRSAFP 300
           +     SH+ SKP           SP+   ++ H +SK +SQSR KADSQPSSPSR AF 
Sbjct: 241 AAAAPRSHIRSKP----------PSPSQTSSKNHLHSKQTSQSRLKADSQPSSPSRPAFS 300

Query: 301 SQDSSMPPRSPSQEISRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIGDQTTDG 360
            Q SS+ PRSPS E SRQQ S+K SRVQSPSHLS K TAQST+QQ  ESP  IGDQTT  
Sbjct: 301 PQASSI-PRSPSHENSRQQPSKKASRVQSPSHLSSKATAQSTSQQLTESPATIGDQTTKR 360

Query: 361 IISHPANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKASKNQPETKEELTSKNTSNPH 420
           ++SHPA+QSP+AR  ++E Q+QTKS Q  KPD KPVE KASK+QPET EE  SKNTS P 
Sbjct: 361 VVSHPADQSPRARCKSKENQVQTKSKQSPKPDLKPVEFKASKHQPETMEEFISKNTSYPL 420

Query: 421 PNQDYSEIPTQSDQNMENGLHSFLESQAESKETK------EDLAKTTNALQTKASRSTLI 480
            N+D+SEIP   D+ +ENG    LESQ ES+E+K      EDL KTTNALQ  AS+S LI
Sbjct: 421 SNEDFSEIPIIIDETIENGPEPSLESQTESQESKEIKSYEEDLEKTTNALQINASKSKLI 480

Query: 481 TSSKSRSSFEPEKWYSQQEESMEDLSKVFQKLNIKYSDKENQKSFTTLIGDNKGSSMHLL 540
           TS++  S FEPE   SQQE +MEDL K FQ LNIKY + EN KSFTTL GDNKG+SMHL+
Sbjct: 481 TSAEITSPFEPENSDSQQEGTMEDLPKAFQTLNIKYPE-ENPKSFTTLTGDNKGASMHLI 540

Query: 541 SGEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPSPLELYINVNVQ 600
           SGEA  ES IHIHRQYKSDPD+ P+SST+IEGN N ETPQDS+TEE+P PLELYIN+NVQ
Sbjct: 541 SGEATKESSIHIHRQYKSDPDKVPESSTDIEGNSNEETPQDSKTEEDP-PLELYININVQ 600

Query: 601 GINNSIMCNTSFIENDPGVKLKFPREPTKSEDELEAHHARKAEYSPRPAEKLTYEPRVRR 660
           GINNS++ N+SF EN+PG+KLKF  + TKSE++  +  A+KA+Y+ +  E  TYEP VRR
Sbjct: 601 GINNSLLSNSSFTENNPGIKLKFVPQQTKSENKPHSLQAQKAKYTAKHTENHTYEPTVRR 637

Query: 661 RCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKGK 689
           RCL G+LMESSDS+ +N EK RRHGCRY  + +GK
Sbjct: 661 RCLGGLLMESSDSDGDNSEKPRRHGCRYRGSFEGK 637

BLAST of HG10004848 vs. ExPASy TrEMBL
Match: A0A5A7VAN0 (Flocculation protein FLO11 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold82G005610 PE=4 SV=1)

HSP 1 Score: 828.2 bits (2138), Expect = 2.6e-236
Identity = 528/793 (66.58%), Postives = 579/793 (73.01%), Query Frame = 0

Query: 1   MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEIN 60
           MS SQLRILLPWQSLKASP PANES   SF PTDE+E+SAS ADT PNIRHQP QS EI 
Sbjct: 1   MSLSQLRILLPWQSLKASPRPANESPEGSFGPTDESESSASQADTAPNIRHQPDQSPEIK 60

Query: 61  PEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAKNRSRTASKPPSQSKTIPQSS 120
           PE+PPLA AQA E+SETMPPSKSHK  K+ SQ  +NSRAKNRSRTASKP S    IPQS 
Sbjct: 61  PEEPPLATAQAAERSETMPPSKSHKEGKIHSQLSTNSRAKNRSRTASKPSSPLNAIPQSP 120

Query: 121 VAS-KSPSTSSKGSPSQDTSKPSSPAGKA-SPSQDASSKPSSPAAVASTAPRTRIALKPS 180
           +AS K PSTS KGS SQD+SKPSSPAGK  SPSQDASSKPSSPA VA+TAP  RIA K S
Sbjct: 121 LASNKYPSTSGKGSKSQDSSKPSSPAGKVFSPSQDASSKPSSPATVAATAP--RIASKAS 180

Query: 181 SPSSQTSSKSHPNKKPT------------------------------------------- 240
           S SSQ S+K HP+ KPT                                           
Sbjct: 181 SSSSQASNKKHPSSKPTSKLRFKADSQPSSPSRSAFPSQDPSMPPQSLSQEKSRQQASEK 240

Query: 241 -----------------------------------------SQSRIKADSQPSSSSRSAF 300
                                                    SQSR K DSQPS SSRS F
Sbjct: 241 SSRVQSPSHFSRKPTTQSTSKQPVESPATIGIQSHPNSKAPSQSRFKTDSQPSPSSRSTF 300

Query: 301 PSQDFSMPLRSPSQENSRQQPLEKTSRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHP 360
           PSQDFSMP RSPS ENSRQQP +KTS VQSPSH S KPTAQ TS+QPI+SPA IG Q HP
Sbjct: 301 PSQDFSMPPRSPSHENSRQQPSDKTSGVQSPSHSSRKPTAQSTSKQPIESPATIGIQHHP 360

Query: 361 NSKPSSQSRFKADSQPSSPSRSAFPSQDSSMPPRSPSQEISRQQASEKTSRVQSPSHLSG 420
           N KPSSQSRFKA+S+PSS S+S FPSQDSSMPPRSPSQE S Q  SEKTSRVQSPS+LS 
Sbjct: 361 NLKPSSQSRFKAESRPSSSSKSKFPSQDSSMPPRSPSQENSLQPPSEKTSRVQSPSNLSR 420

Query: 421 KPTAQSTTQQPIESPTAIGDQTTDGIISHPANQSPKARPTNRETQLQTKSTQPLKPDRKP 480
           KPTA ST+QQPIES  +IGDQTTDGI+S PA  SPKA PT+ E Q+Q KS +  +P+ KP
Sbjct: 421 KPTAPSTSQQPIESTASIGDQTTDGILSDPATPSPKAIPTSGEIQIQAKSKKSPEPNVKP 480

Query: 481 VESKASKNQPETKEELT----------SKNTSNPHPNQDYSEIPTQSDQNMENGLHSFLE 540
           VE KASKNQ +TKEELT          SKNTSNPH ++D SE PTQSD+ +E GL S LE
Sbjct: 481 VELKASKNQNDTKEELTSKNETKEELASKNTSNPHSDEDSSENPTQSDETVERGLDSSLE 540

Query: 541 SQAESKETKEDLAKTTNALQTKASRSTLITSSKSRSSFEPEKWYSQQEESMEDLSKVFQK 600
           SQ ESKETKED  KTTNALQ KASRSTLITSSKSRSSFEPEK  +QQ+ESMEDLSK F K
Sbjct: 541 SQTESKETKEDGGKTTNALQAKASRSTLITSSKSRSSFEPEK-NTQQDESMEDLSKAFNK 600

Query: 601 LNIKYSDKENQKSFTTLIGDNKGSSMHLLSGEAKSESPIHIHRQYKSDPDQSPKSSTEIE 660
           LNIKYSD+EN KSFTT+IGDNKGSS+HLLSGEAKSES IH++ +YKS+PDQSPKSST I+
Sbjct: 601 LNIKYSDEENPKSFTTMIGDNKGSSVHLLSGEAKSESSIHVNHRYKSNPDQSPKSSTNIK 660

Query: 661 GNFNNETPQDSRTEENPS--PLELYINVNVQGINNSIMCNTSFIENDPGVKLKFP--REP 694
            N NNETPQDS TEENP   PLELYIN NVQGINNSIM NTSF EN+PG+KLKFP   EP
Sbjct: 661 ENSNNETPQDSTTEENPDPPPLELYINHNVQGINNSIMFNTSFTENNPGIKLKFPGDGEP 720

BLAST of HG10004848 vs. ExPASy TrEMBL
Match: A0A6J1GK50 (cell wall protein RBR3-like OS=Cucurbita moschata OX=3662 GN=LOC111454611 PE=4 SV=1)

HSP 1 Score: 610.5 bits (1573), Expect = 8.5e-171
Identity = 407/694 (58.65%), Postives = 480/694 (69.16%), Query Frame = 0

Query: 1   MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEIN 60
           M++ Q R  LPWQS+KAS    NESS  S EPTDE ETS SAADTVP ++H         
Sbjct: 1   MTYRQFRFRLPWQSIKASSRLENESSTRSSEPTDEAETSNSAADTVPYMQHL-------- 60

Query: 61  PEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAKNRSRTASKPPSQSKTIPQSS 120
           PE  PL  AQA E+SETM PSKSHK +KV SQP S+SRAK ++RTA+KPPS SK  PQSS
Sbjct: 61  PELSPLESAQAPERSETMLPSKSHKKAKVHSQPSSHSRAKKQTRTATKPPSASKVTPQSS 120

Query: 121 VAS-KSPSTSSKGSPSQDTSKPSSPAGKASPSQDASSKPSSPAAVASTAPRTRIALKPSS 180
           V+S KSP+TS+K SPS D SKPSS AGK SPS D +SK SSPA     +P          
Sbjct: 121 VSSNKSPTTSAKASPSHDASKPSSSAGKVSPSHD-TSKLSSPAGKGKVSP---------- 180

Query: 181 PSSQTSSKSHPNKKPTSQSRIKADSQPSSSSRSAFPSQDFSMPLRSPSQENSRQQPLEKT 240
                   SH              S+PSS +  AFPS+D S P                 
Sbjct: 181 --------SHDT------------SKPSSPAGKAFPSRDASQP----------------- 240

Query: 241 SRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHPNSKPSSQSRFKADSQPSSPSRSAFP 300
                 S  ++ P +Q+ S+ P  SP+   ++ HP SKP+SQSR KADSQPSSPSR AF 
Sbjct: 241 -----SSSAAAAPRSQIRSKPP--SPSQTSSKNHPQSKPTSQSRLKADSQPSSPSRPAFS 300

Query: 301 SQDSSMPPRSPSQEISRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIGDQTTDG 360
            Q SS+ PRSPS E SRQQ S+K SRVQSPSHLS KPTAQST+QQ  ESP  IGDQTT  
Sbjct: 301 PQASSI-PRSPSHENSRQQPSKKASRVQSPSHLSSKPTAQSTSQQLTESPATIGDQTTKR 360

Query: 361 IISHPANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKASKNQPETKEELTSKNTSNPH 420
           ++SHPA+QSP+AR   RE Q+QTKS Q  KPD KPVE KASK+QPET EE  SKNTS PH
Sbjct: 361 VVSHPADQSPRARCKRRENQVQTKSKQSPKPDLKPVEFKASKHQPETMEEFISKNTSYPH 420

Query: 421 PNQDYSEIPTQSDQNMENGLHSFLESQAESKETK------EDLAKTTNALQTKASRSTLI 480
            +QD+SEIP   D+ +ENG  + LESQ ES+E+K      EDL KTTNALQ  AS+S LI
Sbjct: 421 SDQDFSEIPIIIDETIENGPETSLESQTESQESKEIKSYEEDLEKTTNALQINASKSKLI 480

Query: 481 TSSKSRSSFEPEKWYSQQEESMEDLSKVFQKLNIKYSDKENQKSFTTLIGDNKGSSMHLL 540
           TS++  S FEPE   SQQE +MEDLSK FQ LNIKY + EN KSFTTL GDNKG+SMHLL
Sbjct: 481 TSAEITSPFEPENSDSQQEGTMEDLSKAFQTLNIKYPE-ENPKSFTTLTGDNKGASMHLL 540

Query: 541 SGEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPSPLELYINVNVQ 600
           SGEA  ES IHIHRQYKSDPD+ P+SST+IEGN N ETPQDS+TEE+P PLELYIN+NVQ
Sbjct: 541 SGEATKESSIHIHRQYKSDPDKGPESSTDIEGNSNEETPQDSKTEEDP-PLELYININVQ 600

Query: 601 GINNSIMCNTSFIENDPGVKLKFPREPTKSEDELEAHHARKAEYSPRPAEKLTYEPRVRR 660
           GINNS++ N+SF EN+PG+KLKF  + TKSED+  +  A+KA+Y+ +  E  TYEP VRR
Sbjct: 601 GINNSLLSNSSFTENNPGIKLKFVPQQTKSEDKSHSLQAQKAKYTAKHTENHTYEPTVRR 628

Query: 661 RCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKG 688
           RCL G+LMESSDS+ +N EK RRHGCRY  + +G
Sbjct: 661 RCLGGLLMESSDSDGDNSEKPRRHGCRYRGSFEG 628

BLAST of HG10004848 vs. ExPASy TrEMBL
Match: A0A6J1KJ10 (cell wall protein RBR3-like OS=Cucurbita maxima OX=3661 GN=LOC111496164 PE=4 SV=1)

HSP 1 Score: 605.9 bits (1561), Expect = 2.1e-169
Identity = 405/695 (58.27%), Postives = 478/695 (68.78%), Query Frame = 0

Query: 1   MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEIN 60
           M++ Q R  LPWQS+KAS  P NESS  S EPTDE ETS SAADTVP ++H P+QS E  
Sbjct: 1   MTYRQFRFRLPWQSIKASSRPENESSTRSSEPTDEAETSNSAADTVPYMQHLPLQSHETK 60

Query: 61  PEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAKNRSRTASKPPSQSKTIPQSS 120
           PE  PL  AQA E+SETM PSKSHK +KV SQP S+SRAK ++RTA+KPPS SK  PQSS
Sbjct: 61  PELSPLESAQAPERSETMLPSKSHKKAKVHSQPSSHSRAKKQTRTATKPPSASKVTPQSS 120

Query: 121 VAS-KSPSTSSKGSPSQDTSKPSSPAGKASPSQDASSKPSSPAAVASTAPRTRIALKPSS 180
           V+S KSP+TS+K SPS D SKPSS AGK SPS D +SK SSPA                S
Sbjct: 121 VSSNKSPTTSAKASPSHDASKPSSSAGKVSPSHD-TSKLSSPAGKGKV-----------S 180

Query: 181 PSSQTSSKSHPNKKPTSQSRIKADSQPSSSSRSAFPSQDFSMPLRSPSQENSRQQPLEKT 240
           PS  T                   S PS  +  AFPS+D S P                 
Sbjct: 181 PSRDT-------------------SMPSPPAGKAFPSRDASQP--------------SSA 240

Query: 241 SRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHPNSKPSSQSRFKADSQPSSPSRSAFP 300
           +     SH+ SKP           SP+   ++ H +SK +SQSR KADSQPSSPSR AF 
Sbjct: 241 AAAAPRSHIRSKP----------PSPSQTSSKNHLHSKQTSQSRLKADSQPSSPSRPAFS 300

Query: 301 SQDSSMPPRSPSQEISRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIGDQTTDG 360
            Q SS+ PRSPS E SRQQ S+K SRVQSPSHLS K TAQST+QQ  ESP  IGDQTT  
Sbjct: 301 PQASSI-PRSPSHENSRQQPSKKASRVQSPSHLSSKATAQSTSQQLTESPATIGDQTTKR 360

Query: 361 IISHPANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKASKNQPETKEELTSKNTSNPH 420
           ++SHPA+QSP+AR  ++E Q+QTKS Q  KPD KPVE KASK+QPET EE  SKNTS P 
Sbjct: 361 VVSHPADQSPRARCKSKENQVQTKSKQSPKPDLKPVEFKASKHQPETMEEFISKNTSYPL 420

Query: 421 PNQDYSEIPTQSDQNMENGLHSFLESQAESKETK------EDLAKTTNALQTKASRSTLI 480
            N+D+SEIP   D+ +ENG    LESQ ES+E+K      EDL KTTNALQ  AS+S LI
Sbjct: 421 SNEDFSEIPIIIDETIENGPEPSLESQTESQESKEIKSYEEDLEKTTNALQINASKSKLI 480

Query: 481 TSSKSRSSFEPEKWYSQQEESMEDLSKVFQKLNIKYSDKENQKSFTTLIGDNKGSSMHLL 540
           TS++  S FEPE   SQQE +MEDL K FQ LNIKY + EN KSFTTL GDNKG+SMHL+
Sbjct: 481 TSAEITSPFEPENSDSQQEGTMEDLPKAFQTLNIKYPE-ENPKSFTTLTGDNKGASMHLI 540

Query: 541 SGEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPSPLELYINVNVQ 600
           SGEA  ES IHIHRQYKSDPD+ P+SST+IEGN N ETPQDS+TEE+P PLELYIN+NVQ
Sbjct: 541 SGEATKESSIHIHRQYKSDPDKVPESSTDIEGNSNEETPQDSKTEEDP-PLELYININVQ 600

Query: 601 GINNSIMCNTSFIENDPGVKLKFPREPTKSEDELEAHHARKAEYSPRPAEKLTYEPRVRR 660
           GINNS++ N+SF EN+PG+KLKF  + TKSE++  +  A+KA+Y+ +  E  TYEP VRR
Sbjct: 601 GINNSLLSNSSFTENNPGIKLKFVPQQTKSENKPHSLQAQKAKYTAKHTENHTYEPTVRR 637

Query: 661 RCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKGK 689
           RCL G+LMESSDS+ +N EK RRHGCRY  + +GK
Sbjct: 661 RCLGGLLMESSDSDGDNSEKPRRHGCRYRGSFEGK 637

BLAST of HG10004848 vs. ExPASy TrEMBL
Match: A0A1S4DVD0 (micronuclear linker histone polyprotein OS=Cucumis melo OX=3656 GN=LOC103487982 PE=4 SV=1)

HSP 1 Score: 507.3 bits (1305), Expect = 1.0e-139
Identity = 295/403 (73.20%), Postives = 329/403 (81.64%), Query Frame = 0

Query: 305 MPPRSPSQEISRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIGDQTTDGIISHP 364
           MPPRSPSQE S Q  SEKTSRVQSPS+LS KPTA ST+QQPIES  +IGDQTTDGI+S P
Sbjct: 1   MPPRSPSQENSLQPPSEKTSRVQSPSNLSRKPTAPSTSQQPIESTASIGDQTTDGILSDP 60

Query: 365 ANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKASKNQPETKEELT----------SKN 424
           A  SPKA PT+ E Q+Q KS +  +P+ KPVE KASKNQ +TKEELT          SKN
Sbjct: 61  ATPSPKAIPTSGEIQIQAKSKKSPEPNVKPVELKASKNQNDTKEELTSKNETKEELASKN 120

Query: 425 TSNPHPNQDYSEIPTQSDQNMENGLHSFLESQAESKETKEDLAKTTNALQTKASRSTLIT 484
           TSNPH ++D SE PTQSD+ +E GL S LESQ ESKETKED  KTTNALQ KASRSTLIT
Sbjct: 121 TSNPHSDEDSSENPTQSDETVERGLDSSLESQTESKETKEDGGKTTNALQAKASRSTLIT 180

Query: 485 SSKSRSSFEPEKWYSQQEESMEDLSKVFQKLNIKYSDKENQKSFTTLIGDNKGSSMHLLS 544
           SSKSRSSFEPEK  +QQ+ESMEDLSK F KLNIKYSD+EN KSFTT+IGDNKGSS+HLLS
Sbjct: 181 SSKSRSSFEPEK-NTQQDESMEDLSKAFNKLNIKYSDEENPKSFTTMIGDNKGSSVHLLS 240

Query: 545 GEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPS--PLELYINVNV 604
           GEAKSES IH++ +YKS+PDQSPKSST I+ N NNETPQDS TEENP   PLELYIN NV
Sbjct: 241 GEAKSESSIHVNHRYKSNPDQSPKSSTNIKENSNNETPQDSTTEENPDPPPLELYINHNV 300

Query: 605 QGINNSIMCNTSFIENDPGVKLKFP--REPTKSEDELEAHHARKAEYSPRPAEKLTYEPR 664
           QGINNSIM NTSF EN+PG+KLKFP   EPT S+DELE+HH RK+ Y P PAEK+TYEPR
Sbjct: 301 QGINNSIMFNTSFTENNPGIKLKFPGDGEPTNSQDELESHHTRKSTYIPTPAEKVTYEPR 360

Query: 665 VRRRCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKGKEVETL 694
           +RRR L G+LMES DSE ENP K R HGCRYSR+SKGK+VETL
Sbjct: 361 IRRRYLGGLLMESGDSEDENPRKLRCHGCRYSRSSKGKKVETL 402

BLAST of HG10004848 vs. ExPASy TrEMBL
Match: A0A0A0LLH1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G361770 PE=4 SV=1)

HSP 1 Score: 489.6 bits (1259), Expect = 2.2e-134
Identity = 286/402 (71.14%), Postives = 323/402 (80.35%), Query Frame = 0

Query: 305 MPPRSPSQEISRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIGDQTTDGIISHP 364
           MPPRSPSQE S Q  SEKT RVQSPSHLS KPTAQST+QQPIE   +IGDQTTD I+S P
Sbjct: 1   MPPRSPSQENSLQPPSEKTFRVQSPSHLSRKPTAQSTSQQPIEPTASIGDQTTDRILSDP 60

Query: 365 ANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKASKNQPETKEELT----------SKN 424
           AN SPKA PT+ E+Q+Q +S +  KP+ KPVE + SK Q ETKEELT          SKN
Sbjct: 61  ANPSPKAIPTSGESQIQAESKKSPKPNVKPVELEESKTQHETKEELTSKNENKEELASKN 120

Query: 425 TSNPHPNQDYSEIPTQSDQNMENGLHSFLESQAESKETKEDLAKTTNALQTKASRSTLIT 484
           TSNPH  +D SE PTQSDQ +E GL S LESQ ESKETKED AKTTNA QTKASRSTLIT
Sbjct: 121 TSNPHSYKDSSENPTQSDQAIEKGLDSSLESQTESKETKEDGAKTTNAFQTKASRSTLIT 180

Query: 485 SSKSRSSFEPEKWYSQQEESMEDLSKVFQKLNIKYSDKENQKSFTTLIGDNKGSSMHLLS 544
           SSKSRSSFEPE   +QQ+ESMEDLSK F KLNIKYSD+EN KS TT+IGDNKG+SMHLLS
Sbjct: 181 SSKSRSSFEPEN-NTQQDESMEDLSKAFNKLNIKYSDEENPKSLTTMIGDNKGTSMHLLS 240

Query: 545 GEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPS--PLELYINVNV 604
            EAKSES IH++  YKS+PDQSP+SST+I+ N NNET +DS TEENP   PLELYIN+NV
Sbjct: 241 DEAKSESSIHVNHHYKSNPDQSPESSTDIKENSNNETAKDSTTEENPDPPPLELYINLNV 300

Query: 605 QGINNSIMCNTSFIENDPGVKLKFPREPTKSEDELEA-HHARKAEYSPRPAEKLTYEPRV 664
           QGINNSI  NTSF EN+PG+KLKFP EPT  +DELE+ HH RK++Y   PAEK+TY+PR+
Sbjct: 301 QGINNSITFNTSFTENNPGIKLKFPGEPTNCQDELESDHHTRKSKYIATPAEKVTYDPRI 360

Query: 665 RRRCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKGKEVETL 694
           RRRCL G+LMESSDSE ENP K + HGCRYS +SKGKEVETL
Sbjct: 361 RRRCLEGLLMESSDSEDENPGKLQPHGCRYSGSSKGKEVETL 401

BLAST of HG10004848 vs. TAIR 10
Match: AT1G75260.1 (oxidoreductases, acting on NADH or NADPH )

HSP 1 Score: 79.7 bits (195), Expect = 1.0e-14
Identity = 157/567 (27.69%), Postives = 242/567 (42.68%), Query Frame = 0

Query: 130 SKGSPSQDTSKPSSPAGKASPSQDASSKPSSPAAVASTAPRTRIALKPSSPSSQTSSKSH 189
           S  SPS+ +S  SSP+   +P    S  P  PA +A          +PS   S+T  K+ 
Sbjct: 12  SSNSPSRISSGTSSPSPPPTP---PSRPPFRPAGIA----------QPS--KSETKPKAS 71

Query: 190 PNKKPTSQSRIKADSQPSSSSRSAFPSQDFSMPLRSPSQENSRQQPLEKTSRVQSPSHLS 249
           P+    S+SR    +  +SSS S  PS   + P R   Q N +                S
Sbjct: 72  PS---LSRSRSNVAALAASSSASQLPSLGAATPTRLAKQTNQQ----------------S 131

Query: 250 SKPTAQLTSQQPIKSPAAIGTQIHPNSKPSSQSRFKADSQPSSPSRSAFPSQDSSMPPRS 309
             P+ +L S +       + T+  P  +    +          P   A P +        
Sbjct: 132 GSPSKKLDSLR--MEEQKVATKEKPPGETEKIAEENISPVKEKPPIGARPEEHLEQKETE 191

Query: 310 PSQEISRQQASEKTSRVQSPSHL---SGKPTAQSTTQQPIESPTAIGDQTTDGIISHPAN 369
             QE  R   + +    ++   L   SGK +A +  QQ IE    I  Q    ++     
Sbjct: 192 AVQEQGRNTEAARLVVQENKKVLPEGSGKKSAANQGQQKIEEIEKIALQERKKVLHDDGV 251

Query: 370 QSPKARPTNRETQLQTKSTQPLKPDRKPVESKASKNQPETKEELTSKNTSNPHPNQDYSE 429
           Q  +A     + Q ++K T+ L         +A   +  T+ + T    +     +   +
Sbjct: 252 QKLEA----DQGQQKSKETEKLALQETKRSLQAVGREDATRSKTTRHMAAASETTRGPRD 311

Query: 430 IPTQSDQNMENGLHSFLESQAESKETKEDLAKTTNALQTKASRSTLITSSKSRSSFEPEK 489
           +P +             E+Q  ++   +D  + T    T    +  +T+ +  S      
Sbjct: 312 LPEKK-----------TETQNRTEIPTDDNHQKTKGALTSNLGNPRVTNREGSS------ 371

Query: 490 WYSQQEESMEDLSKVFQKLNIKYSDKENQK-SFTTLIGDNKGSSMHLLSGEAKSESPIHI 549
             S   +  ED+     KL    S+ +++  S  TL G+NKG++M + S + K +  +HI
Sbjct: 372 --SMSRKIKEDIRDGISKLTWGKSNGDDKSVSVYTLTGENKGATMGIGSEKDKKDGEVHI 431

Query: 550 HRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPSPLELYINVNVQGINNSIMCNTSF 609
            R Y+S+PD+S  ++         E P+D   EE  S    YIN N QGINNSI+  +S 
Sbjct: 432 RRGYRSNPDESSNTTAT-----ETENPKDDEAEEEAS-FTAYINGNTQGINNSIVVESSV 491

Query: 610 IENDPGVKLKFPREPTKSEDELEAHHA-RKAEYSPRPAEKLTYEPRVRRRCLRGMLMESS 669
            ENDPGV + F  E  K E      +   K   +    +KL  EPRVRRRCLRG+L ESS
Sbjct: 492 SENDPGVHMSFKFEILKKEVIYPPENVEEKKPETVTVTKKLKNEPRVRRRCLRGLLAESS 511

Query: 670 DSEVENPEKSRRHGCRYSRNSKGKEVE 692
           +SE +NP K RRHGCR++   K K++E
Sbjct: 552 ESEPDNPLKPRRHGCRFT--CKDKDIE 511

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038886773.15.5e-26579.86flocculation protein FLO11 [Benincasa hispida][more]
KAA0065223.15.4e-23666.58flocculation protein FLO11 [Cucumis melo var. makuwa][more]
XP_011649631.17.0e-22865.66flocculation protein FLO11 [Cucumis sativus] >KAE8652109.1 hypothetical protein ... [more]
XP_022951875.11.8e-17058.65cell wall protein RBR3-like [Cucurbita moschata][more]
XP_023002262.14.3e-16958.27cell wall protein RBR3-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5A7VAN02.6e-23666.58Flocculation protein FLO11 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaff... [more]
A0A6J1GK508.5e-17158.65cell wall protein RBR3-like OS=Cucurbita moschata OX=3662 GN=LOC111454611 PE=4 S... [more]
A0A6J1KJ102.1e-16958.27cell wall protein RBR3-like OS=Cucurbita maxima OX=3661 GN=LOC111496164 PE=4 SV=... [more]
A0A1S4DVD01.0e-13973.20micronuclear linker histone polyprotein OS=Cucumis melo OX=3656 GN=LOC103487982 ... [more]
A0A0A0LLH12.2e-13471.14Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G361770 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G75260.11.0e-1427.69oxidoreductases, acting on NADH or NADPH [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 488..515
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 554..583
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 82..388
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 18..59
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 458..488
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 513..535
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 409..445
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 389..408
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 660..693
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 513..583
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 620..645
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..495
NoneNo IPR availablePANTHERPTHR33472OS01G0106600 PROTEINcoord: 442..691
NoneNo IPR availablePANTHERPTHR33472:SF15MUCIN-2-LIKEcoord: 442..691

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10004848.1HG10004848.1mRNA