Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCACACTCACAATTGCGCATTCTACTTCCTTGGCAATCGTTAAAAGCTTCTCCTCATCCTGCAAATGAGTCGTCAGGATGGAGTTTTGAGCCTACAGATGAAACGGAAACTTCTGCTTCTGCAGCTGATACCGTGCCAAATATTCGACATCAACCGGTACAGTCTCTTGAGATAAATCCAGAACAACCTCCTTTAGCACCAGCTCAGGCACTTGAAAAAAGTGAAACTATGCCACCTTCAAAATCTCATAAGGCAAGCAAAGTTCAATCTCAGCCACCATCAAATTCTCGAGCCAAAAATCGGTCCCGAACAGCTTCCAAGCCTCCATCACAATCTAAAACAATCCCTCAATCTTCAGTTGCTTCCAAGTCTCCTTCAACATCAAGCAAAGGCTCCCCATCTCAGGATACTTCAAAGCCATCATCACCAGCAGGCAAAGCCTCTCCATCTCAGGATGCTTCTTCAAAGCCTTCATCACCTGCAGCAGTTGCATCTACTGCTCCTCGAACCCGGATTGCTTTGAAGCCATCATCTCCATCGTCTCAAACATCCAGTAAAAGCCATCCAAATAAAAAACCAACATCACAATCAAGAATTAAAGCTGATTCTCAGCCTTCATCATCTTCAAGGTCAGCATTTCCATCTCAAGATTTTTCTATGCCACTGCGGTCGCCATCTCAAGAAAATTCTCGACAACAACCATTGGAAAAAACCTCTCGGGTTCAGTCTCCATCTCATTTGTCCAGTAAACCTACTGCACAATTAACATCACAACAACCTATTAAGTCTCCAGCAGCCATTGGAACTCAAATCCATCCAAATTCAAAACCATCATCCCAATCAAGATTTAAAGCTGATTCTCAGCCTTCATCACCTTCAAGGTCAGCATTTCCATCTCAAGATTCTTCTATGCCACCACGGTCGCCATCTCAAGAAATTTCTCGACAACAAGCATCGGAAAAAACCTCTCGAGTTCAGTCTCCATCTCATTTGTCCGGTAAACCTACTGCACAATCAACAACACAACAACCTATTGAATCTCCTACAGCCATTGGAGACCAAACAACAGATGGAATCATTTCTCATCCCGCAAATCAATCCCCAAAAGCAAGACCTACAAACAGGGAAACTCAGTTGCAAACCAAATCAACGCAGCCTCTGAAACCAGACAGGAAACCAGTGGAATCGAAAGCATCAAAAAATCAGCCTGAAACCAAGGAAGAGCTCACATCTAAGAACACTTCCAATCCCCATCCGAACCAGGACTATTCTGAAATCCCAACACAATCCGATCAAAACATGGAAAATGGCTTACATTCCTTTCTAGAATCACAGGCAGAGTCAAAAGAAACTAAGGAAGATCTGGCAAAGACAACCAATGCACTTCAAACCAAAGCATCTAGAAGCACATTAATCACATCTTCCAAAAGTCGTTCATCATTTGAACCAGAAAAGTGGTACTCACAACAGGAAGAATCCATGGAAGACTTATCCAAAGTTTTTCAGAAACTAAACATCAAATATTCAGACAAGGAAAATCAAAAGAGCTTCACAACATTGATCGGCGATAACAAAGGGTCGTCAATGCACTTACTCTCCGGTGAAGCCAAAAGCGAAAGCCCAATCCACATCCACCGTCAATATAAGAGCGATCCAGATCAAAGCCCTAAAAGTTCCACAGAAATCGAAGGAAATTTCAATAACGAAACACCGCAAGATTCAAGAACAGAGGAGAATCCATCACCCCTGGAATTATATATCAACGTCAATGTACAAGGTATCAACAACTCAATCATGTGCAATACCTCATTTATAGAGAATGATCCTGGAGTCAAGTTGAAATTCCCTCGAGAACCAACAAAATCTGAAGATGAATTAGAGGCTCATCACGCTAGAAAAGCAGAATACAGTCCGAGACCGGCGGAGAAGCTTACGTATGAACCCAGAGTAAGACGAAGATGCCTCAGAGGGATGTTAATGGAGTCGAGCGATTCTGAGGTCGAGAATCCAGAAAAGTCTCGACGCCATGGCTGCCGGTATAGTCGTAATAGCAAAGGAAAAGAGGTCGAAACTCTGTAA
mRNA sequence
ATGTCACACTCACAATTGCGCATTCTACTTCCTTGGCAATCGTTAAAAGCTTCTCCTCATCCTGCAAATGAGTCGTCAGGATGGAGTTTTGAGCCTACAGATGAAACGGAAACTTCTGCTTCTGCAGCTGATACCGTGCCAAATATTCGACATCAACCGGTACAGTCTCTTGAGATAAATCCAGAACAACCTCCTTTAGCACCAGCTCAGGCACTTGAAAAAAGTGAAACTATGCCACCTTCAAAATCTCATAAGGCAAGCAAAGTTCAATCTCAGCCACCATCAAATTCTCGAGCCAAAAATCGGTCCCGAACAGCTTCCAAGCCTCCATCACAATCTAAAACAATCCCTCAATCTTCAGTTGCTTCCAAGTCTCCTTCAACATCAAGCAAAGGCTCCCCATCTCAGGATACTTCAAAGCCATCATCACCAGCAGGCAAAGCCTCTCCATCTCAGGATGCTTCTTCAAAGCCTTCATCACCTGCAGCAGTTGCATCTACTGCTCCTCGAACCCGGATTGCTTTGAAGCCATCATCTCCATCGTCTCAAACATCCAGTAAAAGCCATCCAAATAAAAAACCAACATCACAATCAAGAATTAAAGCTGATTCTCAGCCTTCATCATCTTCAAGGTCAGCATTTCCATCTCAAGATTTTTCTATGCCACTGCGGTCGCCATCTCAAGAAAATTCTCGACAACAACCATTGGAAAAAACCTCTCGGGTTCAGTCTCCATCTCATTTGTCCAGTAAACCTACTGCACAATTAACATCACAACAACCTATTAAGTCTCCAGCAGCCATTGGAACTCAAATCCATCCAAATTCAAAACCATCATCCCAATCAAGATTTAAAGCTGATTCTCAGCCTTCATCACCTTCAAGGTCAGCATTTCCATCTCAAGATTCTTCTATGCCACCACGGTCGCCATCTCAAGAAATTTCTCGACAACAAGCATCGGAAAAAACCTCTCGAGTTCAGTCTCCATCTCATTTGTCCGGTAAACCTACTGCACAATCAACAACACAACAACCTATTGAATCTCCTACAGCCATTGGAGACCAAACAACAGATGGAATCATTTCTCATCCCGCAAATCAATCCCCAAAAGCAAGACCTACAAACAGGGAAACTCAGTTGCAAACCAAATCAACGCAGCCTCTGAAACCAGACAGGAAACCAGTGGAATCGAAAGCATCAAAAAATCAGCCTGAAACCAAGGAAGAGCTCACATCTAAGAACACTTCCAATCCCCATCCGAACCAGGACTATTCTGAAATCCCAACACAATCCGATCAAAACATGGAAAATGGCTTACATTCCTTTCTAGAATCACAGGCAGAGTCAAAAGAAACTAAGGAAGATCTGGCAAAGACAACCAATGCACTTCAAACCAAAGCATCTAGAAGCACATTAATCACATCTTCCAAAAGTCGTTCATCATTTGAACCAGAAAAGTGGTACTCACAACAGGAAGAATCCATGGAAGACTTATCCAAAGTTTTTCAGAAACTAAACATCAAATATTCAGACAAGGAAAATCAAAAGAGCTTCACAACATTGATCGGCGATAACAAAGGGTCGTCAATGCACTTACTCTCCGGTGAAGCCAAAAGCGAAAGCCCAATCCACATCCACCGTCAATATAAGAGCGATCCAGATCAAAGCCCTAAAAGTTCCACAGAAATCGAAGGAAATTTCAATAACGAAACACCGCAAGATTCAAGAACAGAGGAGAATCCATCACCCCTGGAATTATATATCAACGTCAATGTACAAGGTATCAACAACTCAATCATGTGCAATACCTCATTTATAGAGAATGATCCTGGAGTCAAGTTGAAATTCCCTCGAGAACCAACAAAATCTGAAGATGAATTAGAGGCTCATCACGCTAGAAAAGCAGAATACAGTCCGAGACCGGCGGAGAAGCTTACGTATGAACCCAGAGTAAGACGAAGATGCCTCAGAGGGATGTTAATGGAGTCGAGCGATTCTGAGGTCGAGAATCCAGAAAAGTCTCGACGCCATGGCTGCCGGTATAGTCGTAATAGCAAAGGAAAAGAGGTCGAAACTCTGTAA
Coding sequence (CDS)
ATGTCACACTCACAATTGCGCATTCTACTTCCTTGGCAATCGTTAAAAGCTTCTCCTCATCCTGCAAATGAGTCGTCAGGATGGAGTTTTGAGCCTACAGATGAAACGGAAACTTCTGCTTCTGCAGCTGATACCGTGCCAAATATTCGACATCAACCGGTACAGTCTCTTGAGATAAATCCAGAACAACCTCCTTTAGCACCAGCTCAGGCACTTGAAAAAAGTGAAACTATGCCACCTTCAAAATCTCATAAGGCAAGCAAAGTTCAATCTCAGCCACCATCAAATTCTCGAGCCAAAAATCGGTCCCGAACAGCTTCCAAGCCTCCATCACAATCTAAAACAATCCCTCAATCTTCAGTTGCTTCCAAGTCTCCTTCAACATCAAGCAAAGGCTCCCCATCTCAGGATACTTCAAAGCCATCATCACCAGCAGGCAAAGCCTCTCCATCTCAGGATGCTTCTTCAAAGCCTTCATCACCTGCAGCAGTTGCATCTACTGCTCCTCGAACCCGGATTGCTTTGAAGCCATCATCTCCATCGTCTCAAACATCCAGTAAAAGCCATCCAAATAAAAAACCAACATCACAATCAAGAATTAAAGCTGATTCTCAGCCTTCATCATCTTCAAGGTCAGCATTTCCATCTCAAGATTTTTCTATGCCACTGCGGTCGCCATCTCAAGAAAATTCTCGACAACAACCATTGGAAAAAACCTCTCGGGTTCAGTCTCCATCTCATTTGTCCAGTAAACCTACTGCACAATTAACATCACAACAACCTATTAAGTCTCCAGCAGCCATTGGAACTCAAATCCATCCAAATTCAAAACCATCATCCCAATCAAGATTTAAAGCTGATTCTCAGCCTTCATCACCTTCAAGGTCAGCATTTCCATCTCAAGATTCTTCTATGCCACCACGGTCGCCATCTCAAGAAATTTCTCGACAACAAGCATCGGAAAAAACCTCTCGAGTTCAGTCTCCATCTCATTTGTCCGGTAAACCTACTGCACAATCAACAACACAACAACCTATTGAATCTCCTACAGCCATTGGAGACCAAACAACAGATGGAATCATTTCTCATCCCGCAAATCAATCCCCAAAAGCAAGACCTACAAACAGGGAAACTCAGTTGCAAACCAAATCAACGCAGCCTCTGAAACCAGACAGGAAACCAGTGGAATCGAAAGCATCAAAAAATCAGCCTGAAACCAAGGAAGAGCTCACATCTAAGAACACTTCCAATCCCCATCCGAACCAGGACTATTCTGAAATCCCAACACAATCCGATCAAAACATGGAAAATGGCTTACATTCCTTTCTAGAATCACAGGCAGAGTCAAAAGAAACTAAGGAAGATCTGGCAAAGACAACCAATGCACTTCAAACCAAAGCATCTAGAAGCACATTAATCACATCTTCCAAAAGTCGTTCATCATTTGAACCAGAAAAGTGGTACTCACAACAGGAAGAATCCATGGAAGACTTATCCAAAGTTTTTCAGAAACTAAACATCAAATATTCAGACAAGGAAAATCAAAAGAGCTTCACAACATTGATCGGCGATAACAAAGGGTCGTCAATGCACTTACTCTCCGGTGAAGCCAAAAGCGAAAGCCCAATCCACATCCACCGTCAATATAAGAGCGATCCAGATCAAAGCCCTAAAAGTTCCACAGAAATCGAAGGAAATTTCAATAACGAAACACCGCAAGATTCAAGAACAGAGGAGAATCCATCACCCCTGGAATTATATATCAACGTCAATGTACAAGGTATCAACAACTCAATCATGTGCAATACCTCATTTATAGAGAATGATCCTGGAGTCAAGTTGAAATTCCCTCGAGAACCAACAAAATCTGAAGATGAATTAGAGGCTCATCACGCTAGAAAAGCAGAATACAGTCCGAGACCGGCGGAGAAGCTTACGTATGAACCCAGAGTAAGACGAAGATGCCTCAGAGGGATGTTAATGGAGTCGAGCGATTCTGAGGTCGAGAATCCAGAAAAGTCTCGACGCCATGGCTGCCGGTATAGTCGTAATAGCAAAGGAAAAGAGGTCGAAACTCTGTAA
Protein sequence
MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEINPEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAKNRSRTASKPPSQSKTIPQSSVASKSPSTSSKGSPSQDTSKPSSPAGKASPSQDASSKPSSPAAVASTAPRTRIALKPSSPSSQTSSKSHPNKKPTSQSRIKADSQPSSSSRSAFPSQDFSMPLRSPSQENSRQQPLEKTSRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHPNSKPSSQSRFKADSQPSSPSRSAFPSQDSSMPPRSPSQEISRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIGDQTTDGIISHPANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKASKNQPETKEELTSKNTSNPHPNQDYSEIPTQSDQNMENGLHSFLESQAESKETKEDLAKTTNALQTKASRSTLITSSKSRSSFEPEKWYSQQEESMEDLSKVFQKLNIKYSDKENQKSFTTLIGDNKGSSMHLLSGEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPSPLELYINVNVQGINNSIMCNTSFIENDPGVKLKFPREPTKSEDELEAHHARKAEYSPRPAEKLTYEPRVRRRCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKGKEVETL
Homology
BLAST of HG10004848 vs. NCBI nr
Match:
XP_038886773.1 (flocculation protein FLO11 [Benincasa hispida])
HSP 1 Score: 924.5 bits (2388), Expect = 5.5e-265
Identity = 559/700 (79.86%), Postives = 596/700 (85.14%), Query Frame = 0
Query: 1 MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEIN 60
MS SQLRILLPWQSLKASP P NES G SFEPTDE ETSASAADT NIRHQP QS EI
Sbjct: 1 MSLSQLRILLPWQSLKASPLPENESPGRSFEPTDEVETSASAADTTQNIRHQPAQSPEIK 60
Query: 61 PEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAKNRSRTASKPPSQSKTIPQSS 120
PEQPPLA A A E+SETMPPSKSHKA KV SQPP NSRAKNRSRTASKP SK IPQSS
Sbjct: 61 PEQPPLATALAPERSETMPPSKSHKAGKVHSQPPPNSRAKNRSRTASKPSPPSKAIPQSS 120
Query: 121 VAS-KSPSTSSKGSPSQDTSKPSSPAGKASPSQDASSKPSSPAAVASTAPRTRIALKP-- 180
VAS KSPSTS K S SQDTSKPSSPAGK+S SQDASSKPSSPAAVA+TAPR+RI KP
Sbjct: 121 VASNKSPSTSGKDSLSQDTSKPSSPAGKSSRSQDASSKPSSPAAVAATAPRSRITSKPSS 180
Query: 181 ----SSPSSQTSSKSHPNKKPTSQSRIKADSQPSSSSRSAFPSQDFSMPLRSPSQENSRQ 240
SSPSSQTSSK+HP KP+SQSR KADSQP SSSRSAFPSQD S+P R PS ENSR
Sbjct: 181 PSSSSSPSSQTSSKNHP--KPSSQSRFKADSQP-SSSRSAFPSQDSSLPPRLPSLENSR- 240
Query: 241 QPLEKTSRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHPNSKPSSQSRFKADSQPSSP 300
QP E+TSRVQSPSH SSKPTAQ TSQQP +SPA IG Q HPNSKPSSQSRFKADSQPSS
Sbjct: 241 QPSERTSRVQSPSHFSSKPTAQSTSQQPNESPAVIGIQSHPNSKPSSQSRFKADSQPSSS 300
Query: 301 SRSAFPSQDSSMPPRSPSQEISRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIG 360
SRSAF SQDSSM P SPS+E SRQQ EKTSRVQSPSHLS KP AQST+QQPIESP AIG
Sbjct: 301 SRSAFSSQDSSMLPWSPSRENSRQQPLEKTSRVQSPSHLSSKP-AQSTSQQPIESPAAIG 360
Query: 361 DQTTDGIISHPANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKASKNQPETKEELTSK 420
+QTT+ ISHP NQSPKARPT+RE+Q+QTKS Q LKP+ K VE KASKN+ ETKEEL+SK
Sbjct: 361 NQTTNETISHPTNQSPKARPTSRESQMQTKSKQSLKPNTKQVELKASKNKSETKEELSSK 420
Query: 421 NTSNPHPNQDYSEIPTQSDQNMENGLHSFLESQAESKETKEDLAKTTNALQTKASRSTLI 480
NTSNPH NQD E PT+SDQ +EN L LESQAES+ET+E+LAKTTNALQTKASRSTLI
Sbjct: 421 NTSNPHSNQDSFENPTKSDQTIENSLDFSLESQAESRETEEELAKTTNALQTKASRSTLI 480
Query: 481 TSSKSRSSFEPEKWYSQQEESMEDLSKVFQKLNIKYSDKENQKSFTTLIGDNKGSSMHLL 540
TSSK SFEPE QQEESM+D SK FQKLNIKYSD+EN KSFTTLIG NKGSSMHL+
Sbjct: 481 TSSKIHPSFEPE----QQEESMDDSSKAFQKLNIKYSDEENPKSFTTLIGQNKGSSMHLV 540
Query: 541 SGEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPSPLELYINVNVQ 600
SGEAKSES IHIHRQYKS+PDQSPK STEIEGNF NET +DSRTEENP +E+YIN+NVQ
Sbjct: 541 SGEAKSESSIHIHRQYKSNPDQSPKCSTEIEGNFINETQEDSRTEENPPSVEIYINLNVQ 600
Query: 601 GINNSIMCNTSFIENDPGVKLKFPREPTKSEDELEAHHARKAEYSPRPAEKLTYEPRVRR 660
GINNSIMCNTSF ENDPG+KLK RE KSEDELE+HHARKAEYS +PAEK+TYEPRVRR
Sbjct: 601 GINNSIMCNTSFTENDPGIKLKLSRETIKSEDELESHHARKAEYSAKPAEKVTYEPRVRR 660
Query: 661 RCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKGKEVETL 694
RCLRGMLMESSDSEVENP KSRRHGCRY +SKGKEVETL
Sbjct: 661 RCLRGMLMESSDSEVENPGKSRRHGCRYGLSSKGKEVETL 691
BLAST of HG10004848 vs. NCBI nr
Match:
KAA0065223.1 (flocculation protein FLO11 [Cucumis melo var. makuwa])
HSP 1 Score: 828.2 bits (2138), Expect = 5.4e-236
Identity = 528/793 (66.58%), Postives = 579/793 (73.01%), Query Frame = 0
Query: 1 MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEIN 60
MS SQLRILLPWQSLKASP PANES SF PTDE+E+SAS ADT PNIRHQP QS EI
Sbjct: 1 MSLSQLRILLPWQSLKASPRPANESPEGSFGPTDESESSASQADTAPNIRHQPDQSPEIK 60
Query: 61 PEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAKNRSRTASKPPSQSKTIPQSS 120
PE+PPLA AQA E+SETMPPSKSHK K+ SQ +NSRAKNRSRTASKP S IPQS
Sbjct: 61 PEEPPLATAQAAERSETMPPSKSHKEGKIHSQLSTNSRAKNRSRTASKPSSPLNAIPQSP 120
Query: 121 VAS-KSPSTSSKGSPSQDTSKPSSPAGKA-SPSQDASSKPSSPAAVASTAPRTRIALKPS 180
+AS K PSTS KGS SQD+SKPSSPAGK SPSQDASSKPSSPA VA+TAP RIA K S
Sbjct: 121 LASNKYPSTSGKGSKSQDSSKPSSPAGKVFSPSQDASSKPSSPATVAATAP--RIASKAS 180
Query: 181 SPSSQTSSKSHPNKKPT------------------------------------------- 240
S SSQ S+K HP+ KPT
Sbjct: 181 SSSSQASNKKHPSSKPTSKLRFKADSQPSSPSRSAFPSQDPSMPPQSLSQEKSRQQASEK 240
Query: 241 -----------------------------------------SQSRIKADSQPSSSSRSAF 300
SQSR K DSQPS SSRS F
Sbjct: 241 SSRVQSPSHFSRKPTTQSTSKQPVESPATIGIQSHPNSKAPSQSRFKTDSQPSPSSRSTF 300
Query: 301 PSQDFSMPLRSPSQENSRQQPLEKTSRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHP 360
PSQDFSMP RSPS ENSRQQP +KTS VQSPSH S KPTAQ TS+QPI+SPA IG Q HP
Sbjct: 301 PSQDFSMPPRSPSHENSRQQPSDKTSGVQSPSHSSRKPTAQSTSKQPIESPATIGIQHHP 360
Query: 361 NSKPSSQSRFKADSQPSSPSRSAFPSQDSSMPPRSPSQEISRQQASEKTSRVQSPSHLSG 420
N KPSSQSRFKA+S+PSS S+S FPSQDSSMPPRSPSQE S Q SEKTSRVQSPS+LS
Sbjct: 361 NLKPSSQSRFKAESRPSSSSKSKFPSQDSSMPPRSPSQENSLQPPSEKTSRVQSPSNLSR 420
Query: 421 KPTAQSTTQQPIESPTAIGDQTTDGIISHPANQSPKARPTNRETQLQTKSTQPLKPDRKP 480
KPTA ST+QQPIES +IGDQTTDGI+S PA SPKA PT+ E Q+Q KS + +P+ KP
Sbjct: 421 KPTAPSTSQQPIESTASIGDQTTDGILSDPATPSPKAIPTSGEIQIQAKSKKSPEPNVKP 480
Query: 481 VESKASKNQPETKEELT----------SKNTSNPHPNQDYSEIPTQSDQNMENGLHSFLE 540
VE KASKNQ +TKEELT SKNTSNPH ++D SE PTQSD+ +E GL S LE
Sbjct: 481 VELKASKNQNDTKEELTSKNETKEELASKNTSNPHSDEDSSENPTQSDETVERGLDSSLE 540
Query: 541 SQAESKETKEDLAKTTNALQTKASRSTLITSSKSRSSFEPEKWYSQQEESMEDLSKVFQK 600
SQ ESKETKED KTTNALQ KASRSTLITSSKSRSSFEPEK +QQ+ESMEDLSK F K
Sbjct: 541 SQTESKETKEDGGKTTNALQAKASRSTLITSSKSRSSFEPEK-NTQQDESMEDLSKAFNK 600
Query: 601 LNIKYSDKENQKSFTTLIGDNKGSSMHLLSGEAKSESPIHIHRQYKSDPDQSPKSSTEIE 660
LNIKYSD+EN KSFTT+IGDNKGSS+HLLSGEAKSES IH++ +YKS+PDQSPKSST I+
Sbjct: 601 LNIKYSDEENPKSFTTMIGDNKGSSVHLLSGEAKSESSIHVNHRYKSNPDQSPKSSTNIK 660
Query: 661 GNFNNETPQDSRTEENPS--PLELYINVNVQGINNSIMCNTSFIENDPGVKLKFP--REP 694
N NNETPQDS TEENP PLELYIN NVQGINNSIM NTSF EN+PG+KLKFP EP
Sbjct: 661 ENSNNETPQDSTTEENPDPPPLELYINHNVQGINNSIMFNTSFTENNPGIKLKFPGDGEP 720
BLAST of HG10004848 vs. NCBI nr
Match:
XP_011649631.1 (flocculation protein FLO11 [Cucumis sativus] >KAE8652109.1 hypothetical protein Csa_022237 [Cucumis sativus])
HSP 1 Score: 801.2 bits (2068), Expect = 7.0e-228
Identity = 520/792 (65.66%), Postives = 570/792 (71.97%), Query Frame = 0
Query: 1 MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEIN 60
MS SQLRILLPWQSLKAS ANES SF PTDE+E SASAADTVPNIRHQP QS E
Sbjct: 1 MSLSQLRILLPWQSLKASSRSANESPERSFGPTDESEGSASAADTVPNIRHQPDQSPETK 60
Query: 61 PEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAKNRSRTASKPPSQSKTIPQSS 120
PE+PPLA AQA E+SETMPPSKSHKA KV SQ PS +RAKNRSR ASKP S SK IPQ S
Sbjct: 61 PEEPPLATAQAAERSETMPPSKSHKAGKVHSQ-PSTTRAKNRSRAASKPSSSSKAIPQFS 120
Query: 121 VASKSPSTSSKGSPSQDTSKPSSPAGKA-SPSQDASSKPSSPAAVASTAPRTRIALKPSS 180
VAS PSTS KGS SQD+SKPSSPAGK SPS+DASSKPSSPA VA+T P RIA K SS
Sbjct: 121 VASDKPSTSGKGSQSQDSSKPSSPAGKVFSPSKDASSKPSSPATVAATPP--RIASKASS 180
Query: 181 PSSQTSSKSHPNKKPTSQSRIKADSQPSSSSRSA-------------------------- 240
SSQTS+K HPN KPTSQ ++KADSQPSSSSRSA
Sbjct: 181 SSSQTSNKKHPNSKPTSQLKVKADSQPSSSSRSALPSRDPSLPAQSLSQEDSRQQSSEKT 240
Query: 241 ----------------------------------------------------------FP 300
FP
Sbjct: 241 SRVQSPSNLSRKPTTQSTSKQPVESPATIRIQSHPNSKQPSQSRFKADSHPSPSSRSTFP 300
Query: 301 SQDFSMPLRSPSQENSRQQPLEKTSRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHPN 360
SQDFS P RSPS E SRQQP KTSRVQSPSH S K TAQ T+QQP +SPA IG Q HPN
Sbjct: 301 SQDFSTPPRSPSHEISRQQPSVKTSRVQSPSHSSRKSTAQSTTQQPTESPATIGIQHHPN 360
Query: 361 SKPS-SQSRFKADSQPSSPSRSAFPSQDSSMPPRSPSQEISRQQASEKTSRVQSPSHLSG 420
KPS SQSRFKADSQPSS S+ FPSQDSSMPPRSPSQE S Q SEKT RVQSPSHLS
Sbjct: 361 LKPSLSQSRFKADSQPSSSSKLKFPSQDSSMPPRSPSQENSLQPPSEKTFRVQSPSHLSR 420
Query: 421 KPTAQSTTQQPIESPTAIGDQTTDGIISHPANQSPKARPTNRETQLQTKSTQPLKPDRKP 480
KPTAQST+QQPIE +IGDQTTD I+S PAN SPKA PT+ E+Q+Q +S + KP+ KP
Sbjct: 421 KPTAQSTSQQPIEPTASIGDQTTDRILSDPANPSPKAIPTSGESQIQAESKKSPKPNVKP 480
Query: 481 VESKASKNQPETKEELT----------SKNTSNPHPNQDYSEIPTQSDQNMENGLHSFLE 540
VE + SK Q ETKEELT SKNTSNPH +D SE PTQSDQ +E GL S LE
Sbjct: 481 VELEESKTQHETKEELTSKNENKEELASKNTSNPHSYKDSSENPTQSDQAIEKGLDSSLE 540
Query: 541 SQAESKETKEDLAKTTNALQTKASRSTLITSSKSRSSFEPEKWYSQQEESMEDLSKVFQK 600
SQ ESKETKED AKTTNA QTKASRSTLITSSKSRSSFEPE +QQ+ESMEDLSK F K
Sbjct: 541 SQTESKETKEDGAKTTNAFQTKASRSTLITSSKSRSSFEPEN-NTQQDESMEDLSKAFNK 600
Query: 601 LNIKYSDKENQKSFTTLIGDNKGSSMHLLSGEAKSESPIHIHRQYKSDPDQSPKSSTEIE 660
LNIKYSD+EN KS TT+IGDNKG+SMHLLS EAKSES IH++ YKS+PDQSP+SST+I+
Sbjct: 601 LNIKYSDEENPKSLTTMIGDNKGTSMHLLSDEAKSESSIHVNHHYKSNPDQSPESSTDIK 660
Query: 661 GNFNNETPQDSRTEENPS--PLELYINVNVQGINNSIMCNTSFIENDPGVKLKFPREPTK 694
N NNET +DS TEENP PLELYIN+NVQGINNSI NTSF EN+PG+KLKFP EPT
Sbjct: 661 ENSNNETAKDSTTEENPDPPPLELYINLNVQGINNSITFNTSFTENNPGIKLKFPGEPTN 720
BLAST of HG10004848 vs. NCBI nr
Match:
XP_022951875.1 (cell wall protein RBR3-like [Cucurbita moschata])
HSP 1 Score: 610.5 bits (1573), Expect = 1.8e-170
Identity = 407/694 (58.65%), Postives = 480/694 (69.16%), Query Frame = 0
Query: 1 MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEIN 60
M++ Q R LPWQS+KAS NESS S EPTDE ETS SAADTVP ++H
Sbjct: 1 MTYRQFRFRLPWQSIKASSRLENESSTRSSEPTDEAETSNSAADTVPYMQHL-------- 60
Query: 61 PEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAKNRSRTASKPPSQSKTIPQSS 120
PE PL AQA E+SETM PSKSHK +KV SQP S+SRAK ++RTA+KPPS SK PQSS
Sbjct: 61 PELSPLESAQAPERSETMLPSKSHKKAKVHSQPSSHSRAKKQTRTATKPPSASKVTPQSS 120
Query: 121 VAS-KSPSTSSKGSPSQDTSKPSSPAGKASPSQDASSKPSSPAAVASTAPRTRIALKPSS 180
V+S KSP+TS+K SPS D SKPSS AGK SPS D +SK SSPA +P
Sbjct: 121 VSSNKSPTTSAKASPSHDASKPSSSAGKVSPSHD-TSKLSSPAGKGKVSP---------- 180
Query: 181 PSSQTSSKSHPNKKPTSQSRIKADSQPSSSSRSAFPSQDFSMPLRSPSQENSRQQPLEKT 240
SH S+PSS + AFPS+D S P
Sbjct: 181 --------SHDT------------SKPSSPAGKAFPSRDASQP----------------- 240
Query: 241 SRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHPNSKPSSQSRFKADSQPSSPSRSAFP 300
S ++ P +Q+ S+ P SP+ ++ HP SKP+SQSR KADSQPSSPSR AF
Sbjct: 241 -----SSSAAAAPRSQIRSKPP--SPSQTSSKNHPQSKPTSQSRLKADSQPSSPSRPAFS 300
Query: 301 SQDSSMPPRSPSQEISRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIGDQTTDG 360
Q SS+ PRSPS E SRQQ S+K SRVQSPSHLS KPTAQST+QQ ESP IGDQTT
Sbjct: 301 PQASSI-PRSPSHENSRQQPSKKASRVQSPSHLSSKPTAQSTSQQLTESPATIGDQTTKR 360
Query: 361 IISHPANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKASKNQPETKEELTSKNTSNPH 420
++SHPA+QSP+AR RE Q+QTKS Q KPD KPVE KASK+QPET EE SKNTS PH
Sbjct: 361 VVSHPADQSPRARCKRRENQVQTKSKQSPKPDLKPVEFKASKHQPETMEEFISKNTSYPH 420
Query: 421 PNQDYSEIPTQSDQNMENGLHSFLESQAESKETK------EDLAKTTNALQTKASRSTLI 480
+QD+SEIP D+ +ENG + LESQ ES+E+K EDL KTTNALQ AS+S LI
Sbjct: 421 SDQDFSEIPIIIDETIENGPETSLESQTESQESKEIKSYEEDLEKTTNALQINASKSKLI 480
Query: 481 TSSKSRSSFEPEKWYSQQEESMEDLSKVFQKLNIKYSDKENQKSFTTLIGDNKGSSMHLL 540
TS++ S FEPE SQQE +MEDLSK FQ LNIKY + EN KSFTTL GDNKG+SMHLL
Sbjct: 481 TSAEITSPFEPENSDSQQEGTMEDLSKAFQTLNIKYPE-ENPKSFTTLTGDNKGASMHLL 540
Query: 541 SGEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPSPLELYINVNVQ 600
SGEA ES IHIHRQYKSDPD+ P+SST+IEGN N ETPQDS+TEE+P PLELYIN+NVQ
Sbjct: 541 SGEATKESSIHIHRQYKSDPDKGPESSTDIEGNSNEETPQDSKTEEDP-PLELYININVQ 600
Query: 601 GINNSIMCNTSFIENDPGVKLKFPREPTKSEDELEAHHARKAEYSPRPAEKLTYEPRVRR 660
GINNS++ N+SF EN+PG+KLKF + TKSED+ + A+KA+Y+ + E TYEP VRR
Sbjct: 601 GINNSLLSNSSFTENNPGIKLKFVPQQTKSEDKSHSLQAQKAKYTAKHTENHTYEPTVRR 628
Query: 661 RCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKG 688
RCL G+LMESSDS+ +N EK RRHGCRY + +G
Sbjct: 661 RCLGGLLMESSDSDGDNSEKPRRHGCRYRGSFEG 628
BLAST of HG10004848 vs. NCBI nr
Match:
XP_023002262.1 (cell wall protein RBR3-like [Cucurbita maxima])
HSP 1 Score: 605.9 bits (1561), Expect = 4.3e-169
Identity = 405/695 (58.27%), Postives = 478/695 (68.78%), Query Frame = 0
Query: 1 MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEIN 60
M++ Q R LPWQS+KAS P NESS S EPTDE ETS SAADTVP ++H P+QS E
Sbjct: 1 MTYRQFRFRLPWQSIKASSRPENESSTRSSEPTDEAETSNSAADTVPYMQHLPLQSHETK 60
Query: 61 PEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAKNRSRTASKPPSQSKTIPQSS 120
PE PL AQA E+SETM PSKSHK +KV SQP S+SRAK ++RTA+KPPS SK PQSS
Sbjct: 61 PELSPLESAQAPERSETMLPSKSHKKAKVHSQPSSHSRAKKQTRTATKPPSASKVTPQSS 120
Query: 121 VAS-KSPSTSSKGSPSQDTSKPSSPAGKASPSQDASSKPSSPAAVASTAPRTRIALKPSS 180
V+S KSP+TS+K SPS D SKPSS AGK SPS D +SK SSPA S
Sbjct: 121 VSSNKSPTTSAKASPSHDASKPSSSAGKVSPSHD-TSKLSSPAGKGKV-----------S 180
Query: 181 PSSQTSSKSHPNKKPTSQSRIKADSQPSSSSRSAFPSQDFSMPLRSPSQENSRQQPLEKT 240
PS T S PS + AFPS+D S P
Sbjct: 181 PSRDT-------------------SMPSPPAGKAFPSRDASQP--------------SSA 240
Query: 241 SRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHPNSKPSSQSRFKADSQPSSPSRSAFP 300
+ SH+ SKP SP+ ++ H +SK +SQSR KADSQPSSPSR AF
Sbjct: 241 AAAAPRSHIRSKP----------PSPSQTSSKNHLHSKQTSQSRLKADSQPSSPSRPAFS 300
Query: 301 SQDSSMPPRSPSQEISRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIGDQTTDG 360
Q SS+ PRSPS E SRQQ S+K SRVQSPSHLS K TAQST+QQ ESP IGDQTT
Sbjct: 301 PQASSI-PRSPSHENSRQQPSKKASRVQSPSHLSSKATAQSTSQQLTESPATIGDQTTKR 360
Query: 361 IISHPANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKASKNQPETKEELTSKNTSNPH 420
++SHPA+QSP+AR ++E Q+QTKS Q KPD KPVE KASK+QPET EE SKNTS P
Sbjct: 361 VVSHPADQSPRARCKSKENQVQTKSKQSPKPDLKPVEFKASKHQPETMEEFISKNTSYPL 420
Query: 421 PNQDYSEIPTQSDQNMENGLHSFLESQAESKETK------EDLAKTTNALQTKASRSTLI 480
N+D+SEIP D+ +ENG LESQ ES+E+K EDL KTTNALQ AS+S LI
Sbjct: 421 SNEDFSEIPIIIDETIENGPEPSLESQTESQESKEIKSYEEDLEKTTNALQINASKSKLI 480
Query: 481 TSSKSRSSFEPEKWYSQQEESMEDLSKVFQKLNIKYSDKENQKSFTTLIGDNKGSSMHLL 540
TS++ S FEPE SQQE +MEDL K FQ LNIKY + EN KSFTTL GDNKG+SMHL+
Sbjct: 481 TSAEITSPFEPENSDSQQEGTMEDLPKAFQTLNIKYPE-ENPKSFTTLTGDNKGASMHLI 540
Query: 541 SGEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPSPLELYINVNVQ 600
SGEA ES IHIHRQYKSDPD+ P+SST+IEGN N ETPQDS+TEE+P PLELYIN+NVQ
Sbjct: 541 SGEATKESSIHIHRQYKSDPDKVPESSTDIEGNSNEETPQDSKTEEDP-PLELYININVQ 600
Query: 601 GINNSIMCNTSFIENDPGVKLKFPREPTKSEDELEAHHARKAEYSPRPAEKLTYEPRVRR 660
GINNS++ N+SF EN+PG+KLKF + TKSE++ + A+KA+Y+ + E TYEP VRR
Sbjct: 601 GINNSLLSNSSFTENNPGIKLKFVPQQTKSENKPHSLQAQKAKYTAKHTENHTYEPTVRR 637
Query: 661 RCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKGK 689
RCL G+LMESSDS+ +N EK RRHGCRY + +GK
Sbjct: 661 RCLGGLLMESSDSDGDNSEKPRRHGCRYRGSFEGK 637
BLAST of HG10004848 vs. ExPASy TrEMBL
Match:
A0A5A7VAN0 (Flocculation protein FLO11 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold82G005610 PE=4 SV=1)
HSP 1 Score: 828.2 bits (2138), Expect = 2.6e-236
Identity = 528/793 (66.58%), Postives = 579/793 (73.01%), Query Frame = 0
Query: 1 MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEIN 60
MS SQLRILLPWQSLKASP PANES SF PTDE+E+SAS ADT PNIRHQP QS EI
Sbjct: 1 MSLSQLRILLPWQSLKASPRPANESPEGSFGPTDESESSASQADTAPNIRHQPDQSPEIK 60
Query: 61 PEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAKNRSRTASKPPSQSKTIPQSS 120
PE+PPLA AQA E+SETMPPSKSHK K+ SQ +NSRAKNRSRTASKP S IPQS
Sbjct: 61 PEEPPLATAQAAERSETMPPSKSHKEGKIHSQLSTNSRAKNRSRTASKPSSPLNAIPQSP 120
Query: 121 VAS-KSPSTSSKGSPSQDTSKPSSPAGKA-SPSQDASSKPSSPAAVASTAPRTRIALKPS 180
+AS K PSTS KGS SQD+SKPSSPAGK SPSQDASSKPSSPA VA+TAP RIA K S
Sbjct: 121 LASNKYPSTSGKGSKSQDSSKPSSPAGKVFSPSQDASSKPSSPATVAATAP--RIASKAS 180
Query: 181 SPSSQTSSKSHPNKKPT------------------------------------------- 240
S SSQ S+K HP+ KPT
Sbjct: 181 SSSSQASNKKHPSSKPTSKLRFKADSQPSSPSRSAFPSQDPSMPPQSLSQEKSRQQASEK 240
Query: 241 -----------------------------------------SQSRIKADSQPSSSSRSAF 300
SQSR K DSQPS SSRS F
Sbjct: 241 SSRVQSPSHFSRKPTTQSTSKQPVESPATIGIQSHPNSKAPSQSRFKTDSQPSPSSRSTF 300
Query: 301 PSQDFSMPLRSPSQENSRQQPLEKTSRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHP 360
PSQDFSMP RSPS ENSRQQP +KTS VQSPSH S KPTAQ TS+QPI+SPA IG Q HP
Sbjct: 301 PSQDFSMPPRSPSHENSRQQPSDKTSGVQSPSHSSRKPTAQSTSKQPIESPATIGIQHHP 360
Query: 361 NSKPSSQSRFKADSQPSSPSRSAFPSQDSSMPPRSPSQEISRQQASEKTSRVQSPSHLSG 420
N KPSSQSRFKA+S+PSS S+S FPSQDSSMPPRSPSQE S Q SEKTSRVQSPS+LS
Sbjct: 361 NLKPSSQSRFKAESRPSSSSKSKFPSQDSSMPPRSPSQENSLQPPSEKTSRVQSPSNLSR 420
Query: 421 KPTAQSTTQQPIESPTAIGDQTTDGIISHPANQSPKARPTNRETQLQTKSTQPLKPDRKP 480
KPTA ST+QQPIES +IGDQTTDGI+S PA SPKA PT+ E Q+Q KS + +P+ KP
Sbjct: 421 KPTAPSTSQQPIESTASIGDQTTDGILSDPATPSPKAIPTSGEIQIQAKSKKSPEPNVKP 480
Query: 481 VESKASKNQPETKEELT----------SKNTSNPHPNQDYSEIPTQSDQNMENGLHSFLE 540
VE KASKNQ +TKEELT SKNTSNPH ++D SE PTQSD+ +E GL S LE
Sbjct: 481 VELKASKNQNDTKEELTSKNETKEELASKNTSNPHSDEDSSENPTQSDETVERGLDSSLE 540
Query: 541 SQAESKETKEDLAKTTNALQTKASRSTLITSSKSRSSFEPEKWYSQQEESMEDLSKVFQK 600
SQ ESKETKED KTTNALQ KASRSTLITSSKSRSSFEPEK +QQ+ESMEDLSK F K
Sbjct: 541 SQTESKETKEDGGKTTNALQAKASRSTLITSSKSRSSFEPEK-NTQQDESMEDLSKAFNK 600
Query: 601 LNIKYSDKENQKSFTTLIGDNKGSSMHLLSGEAKSESPIHIHRQYKSDPDQSPKSSTEIE 660
LNIKYSD+EN KSFTT+IGDNKGSS+HLLSGEAKSES IH++ +YKS+PDQSPKSST I+
Sbjct: 601 LNIKYSDEENPKSFTTMIGDNKGSSVHLLSGEAKSESSIHVNHRYKSNPDQSPKSSTNIK 660
Query: 661 GNFNNETPQDSRTEENPS--PLELYINVNVQGINNSIMCNTSFIENDPGVKLKFP--REP 694
N NNETPQDS TEENP PLELYIN NVQGINNSIM NTSF EN+PG+KLKFP EP
Sbjct: 661 ENSNNETPQDSTTEENPDPPPLELYINHNVQGINNSIMFNTSFTENNPGIKLKFPGDGEP 720
BLAST of HG10004848 vs. ExPASy TrEMBL
Match:
A0A6J1GK50 (cell wall protein RBR3-like OS=Cucurbita moschata OX=3662 GN=LOC111454611 PE=4 SV=1)
HSP 1 Score: 610.5 bits (1573), Expect = 8.5e-171
Identity = 407/694 (58.65%), Postives = 480/694 (69.16%), Query Frame = 0
Query: 1 MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEIN 60
M++ Q R LPWQS+KAS NESS S EPTDE ETS SAADTVP ++H
Sbjct: 1 MTYRQFRFRLPWQSIKASSRLENESSTRSSEPTDEAETSNSAADTVPYMQHL-------- 60
Query: 61 PEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAKNRSRTASKPPSQSKTIPQSS 120
PE PL AQA E+SETM PSKSHK +KV SQP S+SRAK ++RTA+KPPS SK PQSS
Sbjct: 61 PELSPLESAQAPERSETMLPSKSHKKAKVHSQPSSHSRAKKQTRTATKPPSASKVTPQSS 120
Query: 121 VAS-KSPSTSSKGSPSQDTSKPSSPAGKASPSQDASSKPSSPAAVASTAPRTRIALKPSS 180
V+S KSP+TS+K SPS D SKPSS AGK SPS D +SK SSPA +P
Sbjct: 121 VSSNKSPTTSAKASPSHDASKPSSSAGKVSPSHD-TSKLSSPAGKGKVSP---------- 180
Query: 181 PSSQTSSKSHPNKKPTSQSRIKADSQPSSSSRSAFPSQDFSMPLRSPSQENSRQQPLEKT 240
SH S+PSS + AFPS+D S P
Sbjct: 181 --------SHDT------------SKPSSPAGKAFPSRDASQP----------------- 240
Query: 241 SRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHPNSKPSSQSRFKADSQPSSPSRSAFP 300
S ++ P +Q+ S+ P SP+ ++ HP SKP+SQSR KADSQPSSPSR AF
Sbjct: 241 -----SSSAAAAPRSQIRSKPP--SPSQTSSKNHPQSKPTSQSRLKADSQPSSPSRPAFS 300
Query: 301 SQDSSMPPRSPSQEISRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIGDQTTDG 360
Q SS+ PRSPS E SRQQ S+K SRVQSPSHLS KPTAQST+QQ ESP IGDQTT
Sbjct: 301 PQASSI-PRSPSHENSRQQPSKKASRVQSPSHLSSKPTAQSTSQQLTESPATIGDQTTKR 360
Query: 361 IISHPANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKASKNQPETKEELTSKNTSNPH 420
++SHPA+QSP+AR RE Q+QTKS Q KPD KPVE KASK+QPET EE SKNTS PH
Sbjct: 361 VVSHPADQSPRARCKRRENQVQTKSKQSPKPDLKPVEFKASKHQPETMEEFISKNTSYPH 420
Query: 421 PNQDYSEIPTQSDQNMENGLHSFLESQAESKETK------EDLAKTTNALQTKASRSTLI 480
+QD+SEIP D+ +ENG + LESQ ES+E+K EDL KTTNALQ AS+S LI
Sbjct: 421 SDQDFSEIPIIIDETIENGPETSLESQTESQESKEIKSYEEDLEKTTNALQINASKSKLI 480
Query: 481 TSSKSRSSFEPEKWYSQQEESMEDLSKVFQKLNIKYSDKENQKSFTTLIGDNKGSSMHLL 540
TS++ S FEPE SQQE +MEDLSK FQ LNIKY + EN KSFTTL GDNKG+SMHLL
Sbjct: 481 TSAEITSPFEPENSDSQQEGTMEDLSKAFQTLNIKYPE-ENPKSFTTLTGDNKGASMHLL 540
Query: 541 SGEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPSPLELYINVNVQ 600
SGEA ES IHIHRQYKSDPD+ P+SST+IEGN N ETPQDS+TEE+P PLELYIN+NVQ
Sbjct: 541 SGEATKESSIHIHRQYKSDPDKGPESSTDIEGNSNEETPQDSKTEEDP-PLELYININVQ 600
Query: 601 GINNSIMCNTSFIENDPGVKLKFPREPTKSEDELEAHHARKAEYSPRPAEKLTYEPRVRR 660
GINNS++ N+SF EN+PG+KLKF + TKSED+ + A+KA+Y+ + E TYEP VRR
Sbjct: 601 GINNSLLSNSSFTENNPGIKLKFVPQQTKSEDKSHSLQAQKAKYTAKHTENHTYEPTVRR 628
Query: 661 RCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKG 688
RCL G+LMESSDS+ +N EK RRHGCRY + +G
Sbjct: 661 RCLGGLLMESSDSDGDNSEKPRRHGCRYRGSFEG 628
BLAST of HG10004848 vs. ExPASy TrEMBL
Match:
A0A6J1KJ10 (cell wall protein RBR3-like OS=Cucurbita maxima OX=3661 GN=LOC111496164 PE=4 SV=1)
HSP 1 Score: 605.9 bits (1561), Expect = 2.1e-169
Identity = 405/695 (58.27%), Postives = 478/695 (68.78%), Query Frame = 0
Query: 1 MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEIN 60
M++ Q R LPWQS+KAS P NESS S EPTDE ETS SAADTVP ++H P+QS E
Sbjct: 1 MTYRQFRFRLPWQSIKASSRPENESSTRSSEPTDEAETSNSAADTVPYMQHLPLQSHETK 60
Query: 61 PEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAKNRSRTASKPPSQSKTIPQSS 120
PE PL AQA E+SETM PSKSHK +KV SQP S+SRAK ++RTA+KPPS SK PQSS
Sbjct: 61 PELSPLESAQAPERSETMLPSKSHKKAKVHSQPSSHSRAKKQTRTATKPPSASKVTPQSS 120
Query: 121 VAS-KSPSTSSKGSPSQDTSKPSSPAGKASPSQDASSKPSSPAAVASTAPRTRIALKPSS 180
V+S KSP+TS+K SPS D SKPSS AGK SPS D +SK SSPA S
Sbjct: 121 VSSNKSPTTSAKASPSHDASKPSSSAGKVSPSHD-TSKLSSPAGKGKV-----------S 180
Query: 181 PSSQTSSKSHPNKKPTSQSRIKADSQPSSSSRSAFPSQDFSMPLRSPSQENSRQQPLEKT 240
PS T S PS + AFPS+D S P
Sbjct: 181 PSRDT-------------------SMPSPPAGKAFPSRDASQP--------------SSA 240
Query: 241 SRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHPNSKPSSQSRFKADSQPSSPSRSAFP 300
+ SH+ SKP SP+ ++ H +SK +SQSR KADSQPSSPSR AF
Sbjct: 241 AAAAPRSHIRSKP----------PSPSQTSSKNHLHSKQTSQSRLKADSQPSSPSRPAFS 300
Query: 301 SQDSSMPPRSPSQEISRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIGDQTTDG 360
Q SS+ PRSPS E SRQQ S+K SRVQSPSHLS K TAQST+QQ ESP IGDQTT
Sbjct: 301 PQASSI-PRSPSHENSRQQPSKKASRVQSPSHLSSKATAQSTSQQLTESPATIGDQTTKR 360
Query: 361 IISHPANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKASKNQPETKEELTSKNTSNPH 420
++SHPA+QSP+AR ++E Q+QTKS Q KPD KPVE KASK+QPET EE SKNTS P
Sbjct: 361 VVSHPADQSPRARCKSKENQVQTKSKQSPKPDLKPVEFKASKHQPETMEEFISKNTSYPL 420
Query: 421 PNQDYSEIPTQSDQNMENGLHSFLESQAESKETK------EDLAKTTNALQTKASRSTLI 480
N+D+SEIP D+ +ENG LESQ ES+E+K EDL KTTNALQ AS+S LI
Sbjct: 421 SNEDFSEIPIIIDETIENGPEPSLESQTESQESKEIKSYEEDLEKTTNALQINASKSKLI 480
Query: 481 TSSKSRSSFEPEKWYSQQEESMEDLSKVFQKLNIKYSDKENQKSFTTLIGDNKGSSMHLL 540
TS++ S FEPE SQQE +MEDL K FQ LNIKY + EN KSFTTL GDNKG+SMHL+
Sbjct: 481 TSAEITSPFEPENSDSQQEGTMEDLPKAFQTLNIKYPE-ENPKSFTTLTGDNKGASMHLI 540
Query: 541 SGEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPSPLELYINVNVQ 600
SGEA ES IHIHRQYKSDPD+ P+SST+IEGN N ETPQDS+TEE+P PLELYIN+NVQ
Sbjct: 541 SGEATKESSIHIHRQYKSDPDKVPESSTDIEGNSNEETPQDSKTEEDP-PLELYININVQ 600
Query: 601 GINNSIMCNTSFIENDPGVKLKFPREPTKSEDELEAHHARKAEYSPRPAEKLTYEPRVRR 660
GINNS++ N+SF EN+PG+KLKF + TKSE++ + A+KA+Y+ + E TYEP VRR
Sbjct: 601 GINNSLLSNSSFTENNPGIKLKFVPQQTKSENKPHSLQAQKAKYTAKHTENHTYEPTVRR 637
Query: 661 RCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKGK 689
RCL G+LMESSDS+ +N EK RRHGCRY + +GK
Sbjct: 661 RCLGGLLMESSDSDGDNSEKPRRHGCRYRGSFEGK 637
BLAST of HG10004848 vs. ExPASy TrEMBL
Match:
A0A1S4DVD0 (micronuclear linker histone polyprotein OS=Cucumis melo OX=3656 GN=LOC103487982 PE=4 SV=1)
HSP 1 Score: 507.3 bits (1305), Expect = 1.0e-139
Identity = 295/403 (73.20%), Postives = 329/403 (81.64%), Query Frame = 0
Query: 305 MPPRSPSQEISRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIGDQTTDGIISHP 364
MPPRSPSQE S Q SEKTSRVQSPS+LS KPTA ST+QQPIES +IGDQTTDGI+S P
Sbjct: 1 MPPRSPSQENSLQPPSEKTSRVQSPSNLSRKPTAPSTSQQPIESTASIGDQTTDGILSDP 60
Query: 365 ANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKASKNQPETKEELT----------SKN 424
A SPKA PT+ E Q+Q KS + +P+ KPVE KASKNQ +TKEELT SKN
Sbjct: 61 ATPSPKAIPTSGEIQIQAKSKKSPEPNVKPVELKASKNQNDTKEELTSKNETKEELASKN 120
Query: 425 TSNPHPNQDYSEIPTQSDQNMENGLHSFLESQAESKETKEDLAKTTNALQTKASRSTLIT 484
TSNPH ++D SE PTQSD+ +E GL S LESQ ESKETKED KTTNALQ KASRSTLIT
Sbjct: 121 TSNPHSDEDSSENPTQSDETVERGLDSSLESQTESKETKEDGGKTTNALQAKASRSTLIT 180
Query: 485 SSKSRSSFEPEKWYSQQEESMEDLSKVFQKLNIKYSDKENQKSFTTLIGDNKGSSMHLLS 544
SSKSRSSFEPEK +QQ+ESMEDLSK F KLNIKYSD+EN KSFTT+IGDNKGSS+HLLS
Sbjct: 181 SSKSRSSFEPEK-NTQQDESMEDLSKAFNKLNIKYSDEENPKSFTTMIGDNKGSSVHLLS 240
Query: 545 GEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPS--PLELYINVNV 604
GEAKSES IH++ +YKS+PDQSPKSST I+ N NNETPQDS TEENP PLELYIN NV
Sbjct: 241 GEAKSESSIHVNHRYKSNPDQSPKSSTNIKENSNNETPQDSTTEENPDPPPLELYINHNV 300
Query: 605 QGINNSIMCNTSFIENDPGVKLKFP--REPTKSEDELEAHHARKAEYSPRPAEKLTYEPR 664
QGINNSIM NTSF EN+PG+KLKFP EPT S+DELE+HH RK+ Y P PAEK+TYEPR
Sbjct: 301 QGINNSIMFNTSFTENNPGIKLKFPGDGEPTNSQDELESHHTRKSTYIPTPAEKVTYEPR 360
Query: 665 VRRRCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKGKEVETL 694
+RRR L G+LMES DSE ENP K R HGCRYSR+SKGK+VETL
Sbjct: 361 IRRRYLGGLLMESGDSEDENPRKLRCHGCRYSRSSKGKKVETL 402
BLAST of HG10004848 vs. ExPASy TrEMBL
Match:
A0A0A0LLH1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G361770 PE=4 SV=1)
HSP 1 Score: 489.6 bits (1259), Expect = 2.2e-134
Identity = 286/402 (71.14%), Postives = 323/402 (80.35%), Query Frame = 0
Query: 305 MPPRSPSQEISRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIGDQTTDGIISHP 364
MPPRSPSQE S Q SEKT RVQSPSHLS KPTAQST+QQPIE +IGDQTTD I+S P
Sbjct: 1 MPPRSPSQENSLQPPSEKTFRVQSPSHLSRKPTAQSTSQQPIEPTASIGDQTTDRILSDP 60
Query: 365 ANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKASKNQPETKEELT----------SKN 424
AN SPKA PT+ E+Q+Q +S + KP+ KPVE + SK Q ETKEELT SKN
Sbjct: 61 ANPSPKAIPTSGESQIQAESKKSPKPNVKPVELEESKTQHETKEELTSKNENKEELASKN 120
Query: 425 TSNPHPNQDYSEIPTQSDQNMENGLHSFLESQAESKETKEDLAKTTNALQTKASRSTLIT 484
TSNPH +D SE PTQSDQ +E GL S LESQ ESKETKED AKTTNA QTKASRSTLIT
Sbjct: 121 TSNPHSYKDSSENPTQSDQAIEKGLDSSLESQTESKETKEDGAKTTNAFQTKASRSTLIT 180
Query: 485 SSKSRSSFEPEKWYSQQEESMEDLSKVFQKLNIKYSDKENQKSFTTLIGDNKGSSMHLLS 544
SSKSRSSFEPE +QQ+ESMEDLSK F KLNIKYSD+EN KS TT+IGDNKG+SMHLLS
Sbjct: 181 SSKSRSSFEPEN-NTQQDESMEDLSKAFNKLNIKYSDEENPKSLTTMIGDNKGTSMHLLS 240
Query: 545 GEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPS--PLELYINVNV 604
EAKSES IH++ YKS+PDQSP+SST+I+ N NNET +DS TEENP PLELYIN+NV
Sbjct: 241 DEAKSESSIHVNHHYKSNPDQSPESSTDIKENSNNETAKDSTTEENPDPPPLELYINLNV 300
Query: 605 QGINNSIMCNTSFIENDPGVKLKFPREPTKSEDELEA-HHARKAEYSPRPAEKLTYEPRV 664
QGINNSI NTSF EN+PG+KLKFP EPT +DELE+ HH RK++Y PAEK+TY+PR+
Sbjct: 301 QGINNSITFNTSFTENNPGIKLKFPGEPTNCQDELESDHHTRKSKYIATPAEKVTYDPRI 360
Query: 665 RRRCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKGKEVETL 694
RRRCL G+LMESSDSE ENP K + HGCRYS +SKGKEVETL
Sbjct: 361 RRRCLEGLLMESSDSEDENPGKLQPHGCRYSGSSKGKEVETL 401
BLAST of HG10004848 vs. TAIR 10
Match:
AT1G75260.1 (oxidoreductases, acting on NADH or NADPH )
HSP 1 Score: 79.7 bits (195), Expect = 1.0e-14
Identity = 157/567 (27.69%), Postives = 242/567 (42.68%), Query Frame = 0
Query: 130 SKGSPSQDTSKPSSPAGKASPSQDASSKPSSPAAVASTAPRTRIALKPSSPSSQTSSKSH 189
S SPS+ +S SSP+ +P S P PA +A +PS S+T K+
Sbjct: 12 SSNSPSRISSGTSSPSPPPTP---PSRPPFRPAGIA----------QPS--KSETKPKAS 71
Query: 190 PNKKPTSQSRIKADSQPSSSSRSAFPSQDFSMPLRSPSQENSRQQPLEKTSRVQSPSHLS 249
P+ S+SR + +SSS S PS + P R Q N + S
Sbjct: 72 PS---LSRSRSNVAALAASSSASQLPSLGAATPTRLAKQTNQQ----------------S 131
Query: 250 SKPTAQLTSQQPIKSPAAIGTQIHPNSKPSSQSRFKADSQPSSPSRSAFPSQDSSMPPRS 309
P+ +L S + + T+ P + + P A P +
Sbjct: 132 GSPSKKLDSLR--MEEQKVATKEKPPGETEKIAEENISPVKEKPPIGARPEEHLEQKETE 191
Query: 310 PSQEISRQQASEKTSRVQSPSHL---SGKPTAQSTTQQPIESPTAIGDQTTDGIISHPAN 369
QE R + + ++ L SGK +A + QQ IE I Q ++
Sbjct: 192 AVQEQGRNTEAARLVVQENKKVLPEGSGKKSAANQGQQKIEEIEKIALQERKKVLHDDGV 251
Query: 370 QSPKARPTNRETQLQTKSTQPLKPDRKPVESKASKNQPETKEELTSKNTSNPHPNQDYSE 429
Q +A + Q ++K T+ L +A + T+ + T + + +
Sbjct: 252 QKLEA----DQGQQKSKETEKLALQETKRSLQAVGREDATRSKTTRHMAAASETTRGPRD 311
Query: 430 IPTQSDQNMENGLHSFLESQAESKETKEDLAKTTNALQTKASRSTLITSSKSRSSFEPEK 489
+P + E+Q ++ +D + T T + +T+ + S
Sbjct: 312 LPEKK-----------TETQNRTEIPTDDNHQKTKGALTSNLGNPRVTNREGSS------ 371
Query: 490 WYSQQEESMEDLSKVFQKLNIKYSDKENQK-SFTTLIGDNKGSSMHLLSGEAKSESPIHI 549
S + ED+ KL S+ +++ S TL G+NKG++M + S + K + +HI
Sbjct: 372 --SMSRKIKEDIRDGISKLTWGKSNGDDKSVSVYTLTGENKGATMGIGSEKDKKDGEVHI 431
Query: 550 HRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPSPLELYINVNVQGINNSIMCNTSF 609
R Y+S+PD+S ++ E P+D EE S YIN N QGINNSI+ +S
Sbjct: 432 RRGYRSNPDESSNTTAT-----ETENPKDDEAEEEAS-FTAYINGNTQGINNSIVVESSV 491
Query: 610 IENDPGVKLKFPREPTKSEDELEAHHA-RKAEYSPRPAEKLTYEPRVRRRCLRGMLMESS 669
ENDPGV + F E K E + K + +KL EPRVRRRCLRG+L ESS
Sbjct: 492 SENDPGVHMSFKFEILKKEVIYPPENVEEKKPETVTVTKKLKNEPRVRRRCLRGLLAESS 511
Query: 670 DSEVENPEKSRRHGCRYSRNSKGKEVE 692
+SE +NP K RRHGCR++ K K++E
Sbjct: 552 ESEPDNPLKPRRHGCRFT--CKDKDIE 511
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A5A7VAN0 | 2.6e-236 | 66.58 | Flocculation protein FLO11 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaff... | [more] |
A0A6J1GK50 | 8.5e-171 | 58.65 | cell wall protein RBR3-like OS=Cucurbita moschata OX=3662 GN=LOC111454611 PE=4 S... | [more] |
A0A6J1KJ10 | 2.1e-169 | 58.27 | cell wall protein RBR3-like OS=Cucurbita maxima OX=3661 GN=LOC111496164 PE=4 SV=... | [more] |
A0A1S4DVD0 | 1.0e-139 | 73.20 | micronuclear linker histone polyprotein OS=Cucumis melo OX=3656 GN=LOC103487982 ... | [more] |
A0A0A0LLH1 | 2.2e-134 | 71.14 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G361770 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT1G75260.1 | 1.0e-14 | 27.69 | oxidoreductases, acting on NADH or NADPH | [more] |