HG10023461 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10023461
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionSerine/arginine repetitive matrix protein 1-like
LocationChr05: 34430003 .. 34431688 (-)
RNA-Seq ExpressionHG10023461
SyntenyHG10023461
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGGAAGATGGCAACGCGCCGCCGCCGTTCTGGCTTCAATCCTCCAACTCTCTACACGAACTTGACTACAATCGCCGCCGTCGACTCAACCGTGCATCGTCGTTCCTCCTCAACTCCAGCGCCTTTCTCATTGTTTTGTTAGTAATTGTTCTCTGTTTCATCTTGATTGTGATTCCCAAATTTGTACAGTTCACTTCTCAATTGATTCGGCCTCAATCGATCAAGAAGAGCTGGGATTCGCTCAATTTTCTTCTGGTTCTCTTCGCCATTGTTTGTGGATTTCTTAGTAGAAACACTGGGGATGATAGTAGAGACTCTTTTGAAGATCGGAGCGTTTCTTCGAGGCGAACTATGAAGTCAAACCCTACGACTCCGCGCCGATGGGATGGATATACCGATCATCGGCCGAATCATTACACCCTCAATCGGATGAAGAGTAGCAGTTCGTATCCGGATCTACGTCTGCAGGAGTCTTCATTGGATGCCGGTGATCACCGGTGGCGATTTTACGACGATACTCATGTGACTAATCATCGATATTCGTCCTCCGATCAGCTTCATCGCCGTCGTGAAACCCGGCCGGAGCTTGAACGCCTAGATTCTGATGTCCGAAGTATTGTTTTCGACAGATCTGAGATTCGTGAAGATATATATTCACAACCGGCGATACCTTCTCCACCGCCACCGCCGCCGCCGCCGCCACGGGTGTCTCCTCCGCGACCTCCATCACCGCCTCCAACCCCTCCGCCTCCAGCTAATACGACTCCTAAAGTGGTCAAACGAAGGCTAAAGAGAACGCTTAAGGTCCATAGCCATACACCCGATGGAGAGATTAATCAACAGCACGAAAATGGCGATTCGGACGTCGCAAATTTTCAACGGATTCAGCTTCCACCACTCTCGCCCCCGTCGTTCTATCGAGAATCGGAGCAGAAGAGCAACAAAAACGAGAAGAAGAGAGGTGGTGCTTCAAAAGAAATTTGGTCCGCACTGAGGAGGAGGAAGAAGAAGCAAAGACAAAAAAGCATCGAAAGTTTTGATGCCATCATCGCCTCCCAACGTGATCCAACATCGTCATTACCGCCGCCATCACCACCGCCGCTTCCCTCGCCATCAGTTCTGCAAAATCTATTTTCATCCAAGAAAGGAAAAGGCAAAAAAGTGCAGTCCACACCACTACCAGAGCCGCCTCCACCATCAACAGCCTCCTCAGAACCTAAACCAAAGATCGAAGATCAAAACCAGATCCACAAGTCTCACGAGCCTCCAATGGAGCTCGAGAGACTGAGCAGTTTAAACGACGAAGAGTATAATACGCGCATTGGCAGTGAGTCGCCATTCCATCCGATTCCTCCTCCTCCGCCGCCGCCGCCGCCGTTCAGAATGCACGGAGACTTTGACAGTGTAGGAAGCAATAGCAGTACACCGAGAGCCATTTCGCCGGAAATGGACGAGAGTGAAACCGATGGACCACCGCCGACCGGCGAAAGAAAGCTCGTGAAAGACTCAACAGCTCCGATGTTCTGTTCAAGCCCCGATGTCAACAGTAAAGCCGATAAGTTCATTGCAAGATTCAGAGCCGATTTGAAGTTGCAGAAGATGAATTCCATCAAAGAGAAGACGGGGAGAAAGAGATCTAACCTAGGCCGAACATCAGGCCCAGGCCCAAGTAAATCAAGATAA

mRNA sequence

ATGGAGGAAGATGGCAACGCGCCGCCGCCGTTCTGGCTTCAATCCTCCAACTCTCTACACGAACTTGACTACAATCGCCGCCGTCGACTCAACCGTGCATCGTCGTTCCTCCTCAACTCCAGCGCCTTTCTCATTGTTTTGTTAGTAATTGTTCTCTGTTTCATCTTGATTGTGATTCCCAAATTTGTACAGTTCACTTCTCAATTGATTCGGCCTCAATCGATCAAGAAGAGCTGGGATTCGCTCAATTTTCTTCTGGTTCTCTTCGCCATTGTTTGTGGATTTCTTAGTAGAAACACTGGGGATGATAGTAGAGACTCTTTTGAAGATCGGAGCGTTTCTTCGAGGCGAACTATGAAGTCAAACCCTACGACTCCGCGCCGATGGGATGGATATACCGATCATCGGCCGAATCATTACACCCTCAATCGGATGAAGAGTAGCAGTTCGTATCCGGATCTACGTCTGCAGGAGTCTTCATTGGATGCCGGTGATCACCGGTGGCGATTTTACGACGATACTCATGTGACTAATCATCGATATTCGTCCTCCGATCAGCTTCATCGCCGTCGTGAAACCCGGCCGGAGCTTGAACGCCTAGATTCTGATGTCCGAAGTATTGTTTTCGACAGATCTGAGATTCGTGAAGATATATATTCACAACCGGCGATACCTTCTCCACCGCCACCGCCGCCGCCGCCGCCACGGGTGTCTCCTCCGCGACCTCCATCACCGCCTCCAACCCCTCCGCCTCCAGCTAATACGACTCCTAAAGTGGTCAAACGAAGGCTAAAGAGAACGCTTAAGGTCCATAGCCATACACCCGATGGAGAGATTAATCAACAGCACGAAAATGGCGATTCGGACGTCGCAAATTTTCAACGGATTCAGCTTCCACCACTCTCGCCCCCGTCGTTCTATCGAGAATCGGAGCAGAAGAGCAACAAAAACGAGAAGAAGAGAGGTGGTGCTTCAAAAGAAATTTGGTCCGCACTGAGGAGGAGGAAGAAGAAGCAAAGACAAAAAAGCATCGAAAGTTTTGATGCCATCATCGCCTCCCAACGTGATCCAACATCGTCATTACCGCCGCCATCACCACCGCCGCTTCCCTCGCCATCAGTTCTGCAAAATCTATTTTCATCCAAGAAAGGAAAAGGCAAAAAAGTGCAGTCCACACCACTACCAGAGCCGCCTCCACCATCAACAGCCTCCTCAGAACCTAAACCAAAGATCGAAGATCAAAACCAGATCCACAAGTCTCACGAGCCTCCAATGGAGCTCGAGAGACTGAGCAGTTTAAACGACGAAGAGTATAATACGCGCATTGGCAGTGAGTCGCCATTCCATCCGATTCCTCCTCCTCCGCCGCCGCCGCCGCCGTTCAGAATGCACGGAGACTTTGACAGTGTAGGAAGCAATAGCAGTACACCGAGAGCCATTTCGCCGGAAATGGACGAGAGTGAAACCGATGGACCACCGCCGACCGGCGAAAGAAAGCTCGTGAAAGACTCAACAGCTCCGATGTTCTGTTCAAGCCCCGATGTCAACAGTAAAGCCGATAAGTTCATTGCAAGATTCAGAGCCGATTTGAAGTTGCAGAAGATGAATTCCATCAAAGAGAAGACGGGGAGAAAGAGATCTAACCTAGGCCGAACATCAGGCCCAGGCCCAAGTAAATCAAGATAA

Coding sequence (CDS)

ATGGAGGAAGATGGCAACGCGCCGCCGCCGTTCTGGCTTCAATCCTCCAACTCTCTACACGAACTTGACTACAATCGCCGCCGTCGACTCAACCGTGCATCGTCGTTCCTCCTCAACTCCAGCGCCTTTCTCATTGTTTTGTTAGTAATTGTTCTCTGTTTCATCTTGATTGTGATTCCCAAATTTGTACAGTTCACTTCTCAATTGATTCGGCCTCAATCGATCAAGAAGAGCTGGGATTCGCTCAATTTTCTTCTGGTTCTCTTCGCCATTGTTTGTGGATTTCTTAGTAGAAACACTGGGGATGATAGTAGAGACTCTTTTGAAGATCGGAGCGTTTCTTCGAGGCGAACTATGAAGTCAAACCCTACGACTCCGCGCCGATGGGATGGATATACCGATCATCGGCCGAATCATTACACCCTCAATCGGATGAAGAGTAGCAGTTCGTATCCGGATCTACGTCTGCAGGAGTCTTCATTGGATGCCGGTGATCACCGGTGGCGATTTTACGACGATACTCATGTGACTAATCATCGATATTCGTCCTCCGATCAGCTTCATCGCCGTCGTGAAACCCGGCCGGAGCTTGAACGCCTAGATTCTGATGTCCGAAGTATTGTTTTCGACAGATCTGAGATTCGTGAAGATATATATTCACAACCGGCGATACCTTCTCCACCGCCACCGCCGCCGCCGCCGCCACGGGTGTCTCCTCCGCGACCTCCATCACCGCCTCCAACCCCTCCGCCTCCAGCTAATACGACTCCTAAAGTGGTCAAACGAAGGCTAAAGAGAACGCTTAAGGTCCATAGCCATACACCCGATGGAGAGATTAATCAACAGCACGAAAATGGCGATTCGGACGTCGCAAATTTTCAACGGATTCAGCTTCCACCACTCTCGCCCCCGTCGTTCTATCGAGAATCGGAGCAGAAGAGCAACAAAAACGAGAAGAAGAGAGGTGGTGCTTCAAAAGAAATTTGGTCCGCACTGAGGAGGAGGAAGAAGAAGCAAAGACAAAAAAGCATCGAAAGTTTTGATGCCATCATCGCCTCCCAACGTGATCCAACATCGTCATTACCGCCGCCATCACCACCGCCGCTTCCCTCGCCATCAGTTCTGCAAAATCTATTTTCATCCAAGAAAGGAAAAGGCAAAAAAGTGCAGTCCACACCACTACCAGAGCCGCCTCCACCATCAACAGCCTCCTCAGAACCTAAACCAAAGATCGAAGATCAAAACCAGATCCACAAGTCTCACGAGCCTCCAATGGAGCTCGAGAGACTGAGCAGTTTAAACGACGAAGAGTATAATACGCGCATTGGCAGTGAGTCGCCATTCCATCCGATTCCTCCTCCTCCGCCGCCGCCGCCGCCGTTCAGAATGCACGGAGACTTTGACAGTGTAGGAAGCAATAGCAGTACACCGAGAGCCATTTCGCCGGAAATGGACGAGAGTGAAACCGATGGACCACCGCCGACCGGCGAAAGAAAGCTCGTGAAAGACTCAACAGCTCCGATGTTCTGTTCAAGCCCCGATGTCAACAGTAAAGCCGATAAGTTCATTGCAAGATTCAGAGCCGATTTGAAGTTGCAGAAGATGAATTCCATCAAAGAGAAGACGGGGAGAAAGAGATCTAACCTAGGCCGAACATCAGGCCCAGGCCCAAGTAAATCAAGATAA

Protein sequence

MEEDGNAPPPFWLQSSNSLHELDYNRRRRLNRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVQFTSQLIRPQSIKKSWDSLNFLLVLFAIVCGFLSRNTGDDSRDSFEDRSVSSRRTMKSNPTTPRRWDGYTDHRPNHYTLNRMKSSSSYPDLRLQESSLDAGDHRWRFYDDTHVTNHRYSSSDQLHRRRETRPELERLDSDVRSIVFDRSEIREDIYSQPAIPSPPPPPPPPPRVSPPRPPSPPPTPPPPANTTPKVVKRRLKRTLKVHSHTPDGEINQQHENGDSDVANFQRIQLPPLSPPSFYRESEQKSNKNEKKRGGASKEIWSALRRRKKKQRQKSIESFDAIIASQRDPTSSLPPPSPPPLPSPSVLQNLFSSKKGKGKKVQSTPLPEPPPPSTASSEPKPKIEDQNQIHKSHEPPMELERLSSLNDEEYNTRIGSESPFHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRAISPEMDESETDGPPPTGERKLVKDSTAPMFCSSPDVNSKADKFIARFRADLKLQKMNSIKEKTGRKRSNLGRTSGPGPSKSR
Homology
BLAST of HG10023461 vs. NCBI nr
Match: XP_038896222.1 (serine/arginine repetitive matrix protein 1-like [Benincasa hispida])

HSP 1 Score: 954.1 bits (2465), Expect = 5.3e-274
Identity = 511/559 (91.41%), Postives = 526/559 (94.10%), Query Frame = 0

Query: 1   MEEDGNAPPPFWLQSSNSLHELDYNRRRRLNRASSFLLNSSAFLIVLLVIVLCFILIVIP 60
           MEEDGNAPPPFWLQSSNSLHELDYNRRRRL+RASSFLLNSSAFLIVLLVIVLCFILIVIP
Sbjct: 1   MEEDGNAPPPFWLQSSNSLHELDYNRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIP 60

Query: 61  KFVQFTSQLIRPQSIKKSWDSLNFLLVLFAIVCGFLSRNTGDDSRDSFEDRSVSSRRTMK 120
           KFVQFTSQLIRPQS+KKSWDSLN LLVLFAIVCGFLSRNTGDDSR SFED SVSSRRTMK
Sbjct: 61  KFVQFTSQLIRPQSVKKSWDSLNLLLVLFAIVCGFLSRNTGDDSRASFEDPSVSSRRTMK 120

Query: 121 SNPTTPRRWDGYTDHRPNHYTLNRMKSSSSYPDLRLQESSLDAGDHRWRFYDDTHVTNHR 180
           SNPTTPRRWDGYTDHRPNHYTLNRM+SSSSYPDLRLQES+ DAGDHRWRFYDDTHVTNHR
Sbjct: 121 SNPTTPRRWDGYTDHRPNHYTLNRMRSSSSYPDLRLQESTFDAGDHRWRFYDDTHVTNHR 180

Query: 181 YSSSDQLHRRRETRPELERLDSDVRSIVFDRSEIREDIYSQPAIPSPPPPPPPPPRVSPP 240
           Y SSDQLHRRRETRPELERLDSD +SI FDRSEIRED+YSQPAIPSPP P  PPPRVSPP
Sbjct: 181 YLSSDQLHRRRETRPELERLDSDAKSIGFDRSEIREDVYSQPAIPSPPRPRSPPPRVSPP 240

Query: 241 RPPSPPPTPPPPANTT--PKVVKRRLKRTLKVHSHTPDGEINQQHENGDSDVANFQRIQL 300
           RPPSPPPTPPPPANTT  PKVVKRR KRT KVHSHTPD EI+QQ+ENGDSDVANFQRIQL
Sbjct: 241 RPPSPPPTPPPPANTTPPPKVVKRRPKRTHKVHSHTPDTEIDQQNENGDSDVANFQRIQL 300

Query: 301 PPLSPPSFYRESEQKSNKNEKKRGGASKEIWSALRRRKKKQRQKSIESFDAIIASQRDPT 360
           PPLSPPSFYRESEQKSN+NEKKRGGASKEIWSALRRRKKKQRQKSIESF+AIIASQR  T
Sbjct: 301 PPLSPPSFYRESEQKSNRNEKKRGGASKEIWSALRRRKKKQRQKSIESFEAIIASQRAST 360

Query: 361 SSLPPPSPPPLPSPSVLQNLFSSKKGKGKKVQSTPLPEPPPPSTASSEPKPKIEDQNQIH 420
            S PPP PPPLPSPSVLQNLFSSKKGKGKKVQSTP PEPP    ASSEPKPK ED+NQ+ 
Sbjct: 361 PSSPPP-PPPLPSPSVLQNLFSSKKGKGKKVQSTPPPEPP----ASSEPKPKTEDRNQML 420

Query: 421 KSHEPPMELERLSSLNDEEYNTRIGSESPFHPIPPPPPPPPPFRMHGDFDSVGSNSSTPR 480
           K HEPPMEL+RLSSLNDEEYNTRIG ESP+HPIPPPPPPPPPFRMHGDFDSVGSNSSTPR
Sbjct: 421 KPHEPPMELDRLSSLNDEEYNTRIGGESPYHPIPPPPPPPPPFRMHGDFDSVGSNSSTPR 480

Query: 481 AISPEMDESETDGPPPTGERKLVKDSTAPMFCSSPDVNSKADKFIARFRADLKLQKMNSI 540
           AISPEMDESE DGPP TGERKLVKDST P+FCSSPDVNSKADKFIARFRADLKLQKMNSI
Sbjct: 481 AISPEMDESEADGPPATGERKLVKDSTIPIFCSSPDVNSKADKFIARFRADLKLQKMNSI 540

Query: 541 KEKTGRKRSNLGRTSGPGP 558
           KEKT RKRSNLGRTSGPGP
Sbjct: 541 KEKTARKRSNLGRTSGPGP 554

BLAST of HG10023461 vs. NCBI nr
Match: XP_008462455.2 (PREDICTED: LOW QUALITY PROTEIN: serine/arginine repetitive matrix protein 1-like [Cucumis melo])

HSP 1 Score: 917.5 bits (2370), Expect = 5.4e-263
Identity = 499/569 (87.70%), Postives = 523/569 (91.92%), Query Frame = 0

Query: 1   MEEDGNA-PPPFWLQSSN-SLHELDYNRRRRLNRASSFLLNSSAFLIVLLVIVLCFILIV 60
           MEEDGNA  PPFWLQSSN SLHEL Y+RRRRL+RASSFLLNSSAFLIVLLVIVLCFILIV
Sbjct: 1   MEEDGNAHSPPFWLQSSNSSLHELHYSRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIV 60

Query: 61  IPKFVQFTSQLIRPQSIKKSWDSLNFLLVLFAIVCGFLSRNT-GDDSRDSFEDRSVSSRR 120
           IPKFVQFTSQLIRPQS+KKSWDSLN LLVLFAIVCGFL RN  GDDSR SFEDRSVSSRR
Sbjct: 61  IPKFVQFTSQLIRPQSVKKSWDSLNLLLVLFAIVCGFLGRNAGGDDSRGSFEDRSVSSRR 120

Query: 121 TMKSNPTTPRRWDGYTDHRPNHYTLNRMKSSSSYPDLRLQESSLDAGDHRWRFYDDTHVT 180
           +MKSNPTTPRRWDGYTDHRPNH+TLNRM+SSSSYPDLRLQESS DAGDHRWRFYDDTHVT
Sbjct: 121 SMKSNPTTPRRWDGYTDHRPNHFTLNRMRSSSSYPDLRLQESSFDAGDHRWRFYDDTHVT 180

Query: 181 NHRYSSSDQLHRRRETRPELERLDSDVRSIVFDRSEIREDIYSQPAIPSPPPPPPPPPRV 240
           NHRYSSSDQLHRRRET+PELER DS+ +SIVFDRSEIR D+YS+P IPS  PP  PPP+V
Sbjct: 181 NHRYSSSDQLHRRRETQPELERQDSEAKSIVFDRSEIR-DVYSEPVIPS--PPRSPPPQV 240

Query: 241 SPPRPPSPPPTPPPPANTTPKVVKRRLKRTLKVHSHTPDGEINQQHENGDSDVANFQRIQ 300
           SPPRPPSPPPTPPPPANT PK+VKRR KRT KVHSHTP+ EINQQHENGDSDVANFQRIQ
Sbjct: 241 SPPRPPSPPPTPPPPANTIPKMVKRRPKRTHKVHSHTPEEEINQQHENGDSDVANFQRIQ 300

Query: 301 LPPLSPPSFYRESEQKSNKNEKKRGGASKEIWSALRRRKKKQRQKSIESFDAIIASQRDP 360
           LPPLSPP FYRESEQKS+KNEKKR GASKEIWSALRRRKKKQRQKS+ESF+AIIASQR  
Sbjct: 301 LPPLSPPLFYRESEQKSSKNEKKRTGASKEIWSALRRRKKKQRQKSVESFEAIIASQRAS 360

Query: 361 TSSLPPPS-----PPPLPSPSVLQNLFSSKKGKGKKVQSTPLPEPPPPSTASSEPKPKIE 420
           TSSLPPPS     PPPLPSPSVLQNLFSS+KGK KKVQST LP+PPPPS ASSEPKPK E
Sbjct: 361 TSSLPPPSPPPPPPPPLPSPSVLQNLFSSRKGKHKKVQSTSLPDPPPPSIASSEPKPKTE 420

Query: 421 DQNQIHKSHEPPMELERLSSLNDEEYNTRIGSESPFHPIPPPPPPPPPFRMHGDFDSVGS 480
           DQNQI K  +PPMEL+RLSSLNDEEY+TRIG ESP+HPIPPPPPPPPPFRMHGDFDSVGS
Sbjct: 421 DQNQILKPQDPPMELDRLSSLNDEEYHTRIGGESPYHPIPPPPPPPPPFRMHGDFDSVGS 480

Query: 481 NSSTPRAISPEMDESETDGPPPTGERKLVKDSTAPMFCSSPDVNSKADKFIARFRADLKL 540
           NSSTPRAISPEMDESE D PP T ERKLVKD T PMFCSSPDVNSKADKFIARFRADLKL
Sbjct: 481 NSSTPRAISPEMDESEADAPPATSERKLVKDPTIPMFCSSPDVNSKADKFIARFRADLKL 540

Query: 541 QKMNSIKEKTGRKRSNLGRTSGPGPSKSR 562
           QKMNSIKEKT RKRSNLGRTSGPGPSK+R
Sbjct: 541 QKMNSIKEKTTRKRSNLGRTSGPGPSKTR 566

BLAST of HG10023461 vs. NCBI nr
Match: KAA0059415.1 (serine/arginine repetitive matrix protein 1-like [Cucumis melo var. makuwa] >TYK03911.1 serine/arginine repetitive matrix protein 1-like [Cucumis melo var. makuwa])

HSP 1 Score: 915.6 bits (2365), Expect = 2.1e-262
Identity = 498/569 (87.52%), Postives = 523/569 (91.92%), Query Frame = 0

Query: 1   MEEDGNA-PPPFWLQSSN-SLHELDYNRRRRLNRASSFLLNSSAFLIVLLVIVLCFILIV 60
           MEEDGNA  PPFWLQSSN SLHEL Y+RRRRL+RASSFLLNSSAFLIVLLVIVLCFILIV
Sbjct: 1   MEEDGNAHSPPFWLQSSNSSLHELRYSRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIV 60

Query: 61  IPKFVQFTSQLIRPQSIKKSWDSLNFLLVLFAIVCGFLSRNT-GDDSRDSFEDRSVSSRR 120
           IPKFVQFTSQLIRPQS+KKSWDSLN LLVLFAIVCGFL RN  GDDSR SFEDRSVSSRR
Sbjct: 61  IPKFVQFTSQLIRPQSVKKSWDSLNLLLVLFAIVCGFLGRNAGGDDSRGSFEDRSVSSRR 120

Query: 121 TMKSNPTTPRRWDGYTDHRPNHYTLNRMKSSSSYPDLRLQESSLDAGDHRWRFYDDTHVT 180
           +MKSNPTTPRRWDGYTDHRPNH+TLNRM+SSSSYPDLRLQESS DAGDH+WRFYDDTHVT
Sbjct: 121 SMKSNPTTPRRWDGYTDHRPNHFTLNRMRSSSSYPDLRLQESSFDAGDHQWRFYDDTHVT 180

Query: 181 NHRYSSSDQLHRRRETRPELERLDSDVRSIVFDRSEIREDIYSQPAIPSPPPPPPPPPRV 240
           NHRYSSSDQLHRRRET+PELER DS+ +SIVFDRSEIR D+YS+P IPS  PP  PPP+V
Sbjct: 181 NHRYSSSDQLHRRRETQPELERQDSEAKSIVFDRSEIR-DVYSEPVIPS--PPRSPPPQV 240

Query: 241 SPPRPPSPPPTPPPPANTTPKVVKRRLKRTLKVHSHTPDGEINQQHENGDSDVANFQRIQ 300
           SPPRPPSPPPTPPPPANT PK+VKRR KRT KVHSHTP+ EINQQHENGDSDVANFQRIQ
Sbjct: 241 SPPRPPSPPPTPPPPANTIPKMVKRRPKRTHKVHSHTPEEEINQQHENGDSDVANFQRIQ 300

Query: 301 LPPLSPPSFYRESEQKSNKNEKKRGGASKEIWSALRRRKKKQRQKSIESFDAIIASQRDP 360
           LPPLSPP FYRESEQKS+KNEKKR GASKEIWSALRRRKKKQRQKS+ESF+AIIASQR  
Sbjct: 301 LPPLSPPLFYRESEQKSSKNEKKRTGASKEIWSALRRRKKKQRQKSVESFEAIIASQRAS 360

Query: 361 TSSLPPPS-----PPPLPSPSVLQNLFSSKKGKGKKVQSTPLPEPPPPSTASSEPKPKIE 420
           TSSLPPPS     PPPLPSPSVLQNLFSS+KGK KKVQST LP+PPPPS ASSEPKPK E
Sbjct: 361 TSSLPPPSPPPPPPPPLPSPSVLQNLFSSRKGKHKKVQSTSLPDPPPPSIASSEPKPKAE 420

Query: 421 DQNQIHKSHEPPMELERLSSLNDEEYNTRIGSESPFHPIPPPPPPPPPFRMHGDFDSVGS 480
           DQNQI K  +PPMEL+RLSSLNDEEY+TRIG ESP+HPIPPPPPPPPPFRMHGDFDSVGS
Sbjct: 421 DQNQILKPQDPPMELDRLSSLNDEEYHTRIGGESPYHPIPPPPPPPPPFRMHGDFDSVGS 480

Query: 481 NSSTPRAISPEMDESETDGPPPTGERKLVKDSTAPMFCSSPDVNSKADKFIARFRADLKL 540
           NSSTPRAISPEMDESE D PP T ERKLVKD T PMFCSSPDVNSKADKFIARFRADLKL
Sbjct: 481 NSSTPRAISPEMDESEADAPPATSERKLVKDPTIPMFCSSPDVNSKADKFIARFRADLKL 540

Query: 541 QKMNSIKEKTGRKRSNLGRTSGPGPSKSR 562
           QKMNSIKEKT RKRSNLGRTSGPGPSK+R
Sbjct: 541 QKMNSIKEKTTRKRSNLGRTSGPGPSKTR 566

BLAST of HG10023461 vs. NCBI nr
Match: XP_011659637.1 (formin-like protein 20 [Cucumis sativus] >KGN45509.1 hypothetical protein Csa_016787 [Cucumis sativus])

HSP 1 Score: 897.5 bits (2318), Expect = 5.8e-257
Identity = 492/570 (86.32%), Postives = 518/570 (90.88%), Query Frame = 0

Query: 1   MEEDGN-APPPFWLQSSN-SLHELDYNRRRRLNRASSFLLNSSAFLIVLLVIVLCFILIV 60
           MEEDGN  PPPFWLQSSN SL++L ++RRRRL+RASSFLLNSSAFLIVLLVIVLCFILIV
Sbjct: 1   MEEDGNPRPPPFWLQSSNSSLNQLHHSRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIV 60

Query: 61  IPKFVQFTSQLIRPQSIKKSWDSLNFLLVLFAIVCGFLSRNT-GDDSRDSFEDRSVSSRR 120
           IPKFVQFTSQLIRPQS+KKSWDSLN LLVLFAIVCGFL RN  GDDSR SFEDRSVSSRR
Sbjct: 61  IPKFVQFTSQLIRPQSVKKSWDSLNLLLVLFAIVCGFLGRNAGGDDSRGSFEDRSVSSRR 120

Query: 121 TMKSNPTTPRRW-DGYTDHRPNHYTLNRMKSSSSYPDLRLQESSLDAGDHRWRFYDDTHV 180
           +MKSNPT PRRW DGYTDHRPNH+TLNRM+SSSSYPDLRLQESS DAGDHRWRFYDDTHV
Sbjct: 121 SMKSNPTAPRRWDDGYTDHRPNHFTLNRMRSSSSYPDLRLQESSFDAGDHRWRFYDDTHV 180

Query: 181 TNHRYSSSDQLHRRRETRPELERLDSDVRSIVFDRSEIREDIYSQPAIPSPPPPPPPPPR 240
           TNHRY SSDQLHRRRET+PELE+ DS+ +SIVFDRSEIRED+YSQP IPS  PP  PP +
Sbjct: 181 TNHRYLSSDQLHRRRETQPELEQRDSEAKSIVFDRSEIREDVYSQPLIPS--PPRSPPQQ 240

Query: 241 VSPPRPPSPPPTPPPPANTTPKVVKRRLKRTLKVHSHTPDGEINQQHENGDSDVANFQRI 300
           VSPPRPPSPPPTPPPPANT PK+VKRR KRT KVHSHTPD E NQQHENGDSDVANFQRI
Sbjct: 241 VSPPRPPSPPPTPPPPANTIPKMVKRRPKRTHKVHSHTPDEENNQQHENGDSDVANFQRI 300

Query: 301 QLPPLSPPSFYRESEQKSNKNEKKRGGASKEIWSALRRRKKKQRQKSIESFDAIIASQRD 360
           QLPPLSPPSFYRESEQKS+KNEKKR GASKEIWSALRRRKKKQRQKS+ESF+AIIASQR 
Sbjct: 301 QLPPLSPPSFYRESEQKSSKNEKKRTGASKEIWSALRRRKKKQRQKSVESFEAIIASQRA 360

Query: 361 PTSSLPPPS----PPPLPSPSVLQNLFSSKKGKGKKVQSTPLPEPPPPSTASSEPKPKIE 420
            TSSLPPPS    PPPLPSPSVLQNLFSS+KGK KKVQST LPEPPPPS  SSEPKP+I 
Sbjct: 361 STSSLPPPSPPPPPPPLPSPSVLQNLFSSRKGKHKKVQSTSLPEPPPPSIPSSEPKPEIA 420

Query: 421 DQNQIHKSHEPPMELERLSSLNDEEYNTRIGSESPFHPI-PPPPPPPPPFRMHGDFDSVG 480
            QNQI K H+PPMEL+RLSSLNDEEYNT IG ESP+HPI PPPPPPPPPFRMHGDFDS G
Sbjct: 421 AQNQILKPHDPPMELDRLSSLNDEEYNTSIGGESPYHPIPPPPPPPPPPFRMHGDFDSAG 480

Query: 481 SNSSTPRAISPEMDESETDGPPPTGERKLVKDSTAPMFCSSPDVNSKADKFIARFRADLK 540
           SNSSTPRAISPEMDESE +GPP T ERKLVKD T PMFCSSPDVNSKAD FIARFRADLK
Sbjct: 481 SNSSTPRAISPEMDESEANGPPATSERKLVKDPTIPMFCSSPDVNSKADTFIARFRADLK 540

Query: 541 LQKMNSIKEKTGRKRSNLGRTSGPGPSKSR 562
           LQKMNSIKEKT RKRSNLGRTSGPGPSK+R
Sbjct: 541 LQKMNSIKEKTARKRSNLGRTSGPGPSKTR 568

BLAST of HG10023461 vs. NCBI nr
Match: XP_023548433.1 (protein enabled homolog [Cucurbita pepo subsp. pepo])

HSP 1 Score: 863.6 bits (2230), Expect = 9.3e-247
Identity = 470/561 (83.78%), Postives = 501/561 (89.30%), Query Frame = 0

Query: 1   MEEDGNAPPPFWLQSSNSLHELDYNRRR-RLNRASSFLLNSSAFLIVLLVIVLCFILIVI 60
           MEEDGNAPPPFWLQ SNSLHELD +RRR RL+RASSFLLNSSAFL+VLLVIVLCFI IVI
Sbjct: 1   MEEDGNAPPPFWLQPSNSLHELDNHRRRHRLSRASSFLLNSSAFLVVLLVIVLCFIWIVI 60

Query: 61  PKFVQFTSQLIRPQSIKKSWDSLNFLLVLFAIVCGFLSRNTGDDSRDSFEDRSVSSRRTM 120
           PKFVQF SQLIRPQS+KKSWDSLN +LVLFAIVCGFLSRN G+DSRDSFEDRSVSSRRT+
Sbjct: 61  PKFVQFGSQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNAGEDSRDSFEDRSVSSRRTI 120

Query: 121 KSNPTTPRRWDGYTDHRPNHYTLNRMKSSSSYPDLRLQESSLDAGDHRWRFYDDTHVTNH 180
           KSNP  PR+WDGY DHRP HYT+NRM+SSSSYPDLRLQESSLDAGD +WR YDDTHV N+
Sbjct: 121 KSNPRNPRQWDGYADHRPIHYTVNRMRSSSSYPDLRLQESSLDAGDQQWRSYDDTHVPNN 180

Query: 181 RYSSSDQLHRRRETRPELERLDSDVRSIVFDRSEIREDIYSQPAIPSPPPPPPPPPRVSP 240
           R+ SSDQLHRRRE RPELER DSDV+SI FDRSE+RED+YSQ  +P P PP  PPP+VSP
Sbjct: 181 RFPSSDQLHRRREARPELEREDSDVKSIGFDRSEMREDVYSQ--MPIPSPPRSPPPQVSP 240

Query: 241 PRPPSPPPTPPPPANTTPKVVKRRLKRTLKVHSHTPDGEINQQHENGDSDVANFQRIQLP 300
           PR PSPPPTPPPPANTTPKVVKRR KRT KVHSHTP GEI+Q ++NGDSDVA FQRI LP
Sbjct: 241 PRSPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPAGEIDQHNKNGDSDVAEFQRIPLP 300

Query: 301 PLSPPSFYRESEQKSNKNEKKRGGASKEIWSALRRRKKKQRQKSIESFDAIIASQRDPTS 360
           PLSPP FYRESEQKS KN+KKRGGA KEIWSALRRR+KKQRQKSIESF+ I+ASQR  TS
Sbjct: 301 PLSPPLFYRESEQKSVKNDKKRGGAPKEIWSALRRRRKKQRQKSIESFEDIVASQRPSTS 360

Query: 361 SLPPPS---PPPLPSPSVLQNLFSSKKGKGKKVQSTPLPEPPPPSTASSEPKPKIEDQNQ 420
           SLPPPS   PPPLPSPSVLQ LF+SKKGKGKKVQSTP PE  PPS AS EPKP IEDQN 
Sbjct: 361 SLPPPSPPPPPPLPSPSVLQVLFTSKKGKGKKVQSTPSPE-SPPSIASPEPKPIIEDQNH 420

Query: 421 IHKSHEPPMELERLSSLNDEEYNTRIGSESPFHPIPPPPPPPPPFRMHGDFDSVGSNSST 480
           + K HEPP+EL RLSSLNDEEY+TRIG ESPFHPIPPPPPPPPPFRMHGDFDSVGSNSST
Sbjct: 421 LLKPHEPPVELARLSSLNDEEYSTRIGGESPFHPIPPPPPPPPPFRMHGDFDSVGSNSST 480

Query: 481 PRAISPEMDESETDGPPPTGERKLVKDSTAPMFCSSPDVNSKADKFIARFRADLKLQKMN 540
           PRA+SP+M ESE DG P  GERKLVKDST PMFCSSPDVNSKADKFIARFRADLKLQKMN
Sbjct: 481 PRAVSPDMGESEADGQPAAGERKLVKDSTIPMFCSSPDVNSKADKFIARFRADLKLQKMN 540

Query: 541 SIKEKTGRKRSNLGRTSGPGP 558
           SIKEKT RKRSNLGRT GPGP
Sbjct: 541 SIKEKTARKRSNLGRTPGPGP 558

BLAST of HG10023461 vs. ExPASy TrEMBL
Match: A0A1S3CII2 (LOW QUALITY PROTEIN: serine/arginine repetitive matrix protein 1-like OS=Cucumis melo OX=3656 GN=LOC103500804 PE=4 SV=1)

HSP 1 Score: 917.5 bits (2370), Expect = 2.6e-263
Identity = 499/569 (87.70%), Postives = 523/569 (91.92%), Query Frame = 0

Query: 1   MEEDGNA-PPPFWLQSSN-SLHELDYNRRRRLNRASSFLLNSSAFLIVLLVIVLCFILIV 60
           MEEDGNA  PPFWLQSSN SLHEL Y+RRRRL+RASSFLLNSSAFLIVLLVIVLCFILIV
Sbjct: 1   MEEDGNAHSPPFWLQSSNSSLHELHYSRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIV 60

Query: 61  IPKFVQFTSQLIRPQSIKKSWDSLNFLLVLFAIVCGFLSRNT-GDDSRDSFEDRSVSSRR 120
           IPKFVQFTSQLIRPQS+KKSWDSLN LLVLFAIVCGFL RN  GDDSR SFEDRSVSSRR
Sbjct: 61  IPKFVQFTSQLIRPQSVKKSWDSLNLLLVLFAIVCGFLGRNAGGDDSRGSFEDRSVSSRR 120

Query: 121 TMKSNPTTPRRWDGYTDHRPNHYTLNRMKSSSSYPDLRLQESSLDAGDHRWRFYDDTHVT 180
           +MKSNPTTPRRWDGYTDHRPNH+TLNRM+SSSSYPDLRLQESS DAGDHRWRFYDDTHVT
Sbjct: 121 SMKSNPTTPRRWDGYTDHRPNHFTLNRMRSSSSYPDLRLQESSFDAGDHRWRFYDDTHVT 180

Query: 181 NHRYSSSDQLHRRRETRPELERLDSDVRSIVFDRSEIREDIYSQPAIPSPPPPPPPPPRV 240
           NHRYSSSDQLHRRRET+PELER DS+ +SIVFDRSEIR D+YS+P IPS  PP  PPP+V
Sbjct: 181 NHRYSSSDQLHRRRETQPELERQDSEAKSIVFDRSEIR-DVYSEPVIPS--PPRSPPPQV 240

Query: 241 SPPRPPSPPPTPPPPANTTPKVVKRRLKRTLKVHSHTPDGEINQQHENGDSDVANFQRIQ 300
           SPPRPPSPPPTPPPPANT PK+VKRR KRT KVHSHTP+ EINQQHENGDSDVANFQRIQ
Sbjct: 241 SPPRPPSPPPTPPPPANTIPKMVKRRPKRTHKVHSHTPEEEINQQHENGDSDVANFQRIQ 300

Query: 301 LPPLSPPSFYRESEQKSNKNEKKRGGASKEIWSALRRRKKKQRQKSIESFDAIIASQRDP 360
           LPPLSPP FYRESEQKS+KNEKKR GASKEIWSALRRRKKKQRQKS+ESF+AIIASQR  
Sbjct: 301 LPPLSPPLFYRESEQKSSKNEKKRTGASKEIWSALRRRKKKQRQKSVESFEAIIASQRAS 360

Query: 361 TSSLPPPS-----PPPLPSPSVLQNLFSSKKGKGKKVQSTPLPEPPPPSTASSEPKPKIE 420
           TSSLPPPS     PPPLPSPSVLQNLFSS+KGK KKVQST LP+PPPPS ASSEPKPK E
Sbjct: 361 TSSLPPPSPPPPPPPPLPSPSVLQNLFSSRKGKHKKVQSTSLPDPPPPSIASSEPKPKTE 420

Query: 421 DQNQIHKSHEPPMELERLSSLNDEEYNTRIGSESPFHPIPPPPPPPPPFRMHGDFDSVGS 480
           DQNQI K  +PPMEL+RLSSLNDEEY+TRIG ESP+HPIPPPPPPPPPFRMHGDFDSVGS
Sbjct: 421 DQNQILKPQDPPMELDRLSSLNDEEYHTRIGGESPYHPIPPPPPPPPPFRMHGDFDSVGS 480

Query: 481 NSSTPRAISPEMDESETDGPPPTGERKLVKDSTAPMFCSSPDVNSKADKFIARFRADLKL 540
           NSSTPRAISPEMDESE D PP T ERKLVKD T PMFCSSPDVNSKADKFIARFRADLKL
Sbjct: 481 NSSTPRAISPEMDESEADAPPATSERKLVKDPTIPMFCSSPDVNSKADKFIARFRADLKL 540

Query: 541 QKMNSIKEKTGRKRSNLGRTSGPGPSKSR 562
           QKMNSIKEKT RKRSNLGRTSGPGPSK+R
Sbjct: 541 QKMNSIKEKTTRKRSNLGRTSGPGPSKTR 566

BLAST of HG10023461 vs. ExPASy TrEMBL
Match: A0A5A7V0Q3 (Serine/arginine repetitive matrix protein 1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold347G001070 PE=4 SV=1)

HSP 1 Score: 915.6 bits (2365), Expect = 1.0e-262
Identity = 498/569 (87.52%), Postives = 523/569 (91.92%), Query Frame = 0

Query: 1   MEEDGNA-PPPFWLQSSN-SLHELDYNRRRRLNRASSFLLNSSAFLIVLLVIVLCFILIV 60
           MEEDGNA  PPFWLQSSN SLHEL Y+RRRRL+RASSFLLNSSAFLIVLLVIVLCFILIV
Sbjct: 1   MEEDGNAHSPPFWLQSSNSSLHELRYSRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIV 60

Query: 61  IPKFVQFTSQLIRPQSIKKSWDSLNFLLVLFAIVCGFLSRNT-GDDSRDSFEDRSVSSRR 120
           IPKFVQFTSQLIRPQS+KKSWDSLN LLVLFAIVCGFL RN  GDDSR SFEDRSVSSRR
Sbjct: 61  IPKFVQFTSQLIRPQSVKKSWDSLNLLLVLFAIVCGFLGRNAGGDDSRGSFEDRSVSSRR 120

Query: 121 TMKSNPTTPRRWDGYTDHRPNHYTLNRMKSSSSYPDLRLQESSLDAGDHRWRFYDDTHVT 180
           +MKSNPTTPRRWDGYTDHRPNH+TLNRM+SSSSYPDLRLQESS DAGDH+WRFYDDTHVT
Sbjct: 121 SMKSNPTTPRRWDGYTDHRPNHFTLNRMRSSSSYPDLRLQESSFDAGDHQWRFYDDTHVT 180

Query: 181 NHRYSSSDQLHRRRETRPELERLDSDVRSIVFDRSEIREDIYSQPAIPSPPPPPPPPPRV 240
           NHRYSSSDQLHRRRET+PELER DS+ +SIVFDRSEIR D+YS+P IPS  PP  PPP+V
Sbjct: 181 NHRYSSSDQLHRRRETQPELERQDSEAKSIVFDRSEIR-DVYSEPVIPS--PPRSPPPQV 240

Query: 241 SPPRPPSPPPTPPPPANTTPKVVKRRLKRTLKVHSHTPDGEINQQHENGDSDVANFQRIQ 300
           SPPRPPSPPPTPPPPANT PK+VKRR KRT KVHSHTP+ EINQQHENGDSDVANFQRIQ
Sbjct: 241 SPPRPPSPPPTPPPPANTIPKMVKRRPKRTHKVHSHTPEEEINQQHENGDSDVANFQRIQ 300

Query: 301 LPPLSPPSFYRESEQKSNKNEKKRGGASKEIWSALRRRKKKQRQKSIESFDAIIASQRDP 360
           LPPLSPP FYRESEQKS+KNEKKR GASKEIWSALRRRKKKQRQKS+ESF+AIIASQR  
Sbjct: 301 LPPLSPPLFYRESEQKSSKNEKKRTGASKEIWSALRRRKKKQRQKSVESFEAIIASQRAS 360

Query: 361 TSSLPPPS-----PPPLPSPSVLQNLFSSKKGKGKKVQSTPLPEPPPPSTASSEPKPKIE 420
           TSSLPPPS     PPPLPSPSVLQNLFSS+KGK KKVQST LP+PPPPS ASSEPKPK E
Sbjct: 361 TSSLPPPSPPPPPPPPLPSPSVLQNLFSSRKGKHKKVQSTSLPDPPPPSIASSEPKPKAE 420

Query: 421 DQNQIHKSHEPPMELERLSSLNDEEYNTRIGSESPFHPIPPPPPPPPPFRMHGDFDSVGS 480
           DQNQI K  +PPMEL+RLSSLNDEEY+TRIG ESP+HPIPPPPPPPPPFRMHGDFDSVGS
Sbjct: 421 DQNQILKPQDPPMELDRLSSLNDEEYHTRIGGESPYHPIPPPPPPPPPFRMHGDFDSVGS 480

Query: 481 NSSTPRAISPEMDESETDGPPPTGERKLVKDSTAPMFCSSPDVNSKADKFIARFRADLKL 540
           NSSTPRAISPEMDESE D PP T ERKLVKD T PMFCSSPDVNSKADKFIARFRADLKL
Sbjct: 481 NSSTPRAISPEMDESEADAPPATSERKLVKDPTIPMFCSSPDVNSKADKFIARFRADLKL 540

Query: 541 QKMNSIKEKTGRKRSNLGRTSGPGPSKSR 562
           QKMNSIKEKT RKRSNLGRTSGPGPSK+R
Sbjct: 541 QKMNSIKEKTTRKRSNLGRTSGPGPSKTR 566

BLAST of HG10023461 vs. ExPASy TrEMBL
Match: A0A0A0K9L4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G450660 PE=4 SV=1)

HSP 1 Score: 897.5 bits (2318), Expect = 2.8e-257
Identity = 492/570 (86.32%), Postives = 518/570 (90.88%), Query Frame = 0

Query: 1   MEEDGN-APPPFWLQSSN-SLHELDYNRRRRLNRASSFLLNSSAFLIVLLVIVLCFILIV 60
           MEEDGN  PPPFWLQSSN SL++L ++RRRRL+RASSFLLNSSAFLIVLLVIVLCFILIV
Sbjct: 1   MEEDGNPRPPPFWLQSSNSSLNQLHHSRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIV 60

Query: 61  IPKFVQFTSQLIRPQSIKKSWDSLNFLLVLFAIVCGFLSRNT-GDDSRDSFEDRSVSSRR 120
           IPKFVQFTSQLIRPQS+KKSWDSLN LLVLFAIVCGFL RN  GDDSR SFEDRSVSSRR
Sbjct: 61  IPKFVQFTSQLIRPQSVKKSWDSLNLLLVLFAIVCGFLGRNAGGDDSRGSFEDRSVSSRR 120

Query: 121 TMKSNPTTPRRW-DGYTDHRPNHYTLNRMKSSSSYPDLRLQESSLDAGDHRWRFYDDTHV 180
           +MKSNPT PRRW DGYTDHRPNH+TLNRM+SSSSYPDLRLQESS DAGDHRWRFYDDTHV
Sbjct: 121 SMKSNPTAPRRWDDGYTDHRPNHFTLNRMRSSSSYPDLRLQESSFDAGDHRWRFYDDTHV 180

Query: 181 TNHRYSSSDQLHRRRETRPELERLDSDVRSIVFDRSEIREDIYSQPAIPSPPPPPPPPPR 240
           TNHRY SSDQLHRRRET+PELE+ DS+ +SIVFDRSEIRED+YSQP IPS  PP  PP +
Sbjct: 181 TNHRYLSSDQLHRRRETQPELEQRDSEAKSIVFDRSEIREDVYSQPLIPS--PPRSPPQQ 240

Query: 241 VSPPRPPSPPPTPPPPANTTPKVVKRRLKRTLKVHSHTPDGEINQQHENGDSDVANFQRI 300
           VSPPRPPSPPPTPPPPANT PK+VKRR KRT KVHSHTPD E NQQHENGDSDVANFQRI
Sbjct: 241 VSPPRPPSPPPTPPPPANTIPKMVKRRPKRTHKVHSHTPDEENNQQHENGDSDVANFQRI 300

Query: 301 QLPPLSPPSFYRESEQKSNKNEKKRGGASKEIWSALRRRKKKQRQKSIESFDAIIASQRD 360
           QLPPLSPPSFYRESEQKS+KNEKKR GASKEIWSALRRRKKKQRQKS+ESF+AIIASQR 
Sbjct: 301 QLPPLSPPSFYRESEQKSSKNEKKRTGASKEIWSALRRRKKKQRQKSVESFEAIIASQRA 360

Query: 361 PTSSLPPPS----PPPLPSPSVLQNLFSSKKGKGKKVQSTPLPEPPPPSTASSEPKPKIE 420
            TSSLPPPS    PPPLPSPSVLQNLFSS+KGK KKVQST LPEPPPPS  SSEPKP+I 
Sbjct: 361 STSSLPPPSPPPPPPPLPSPSVLQNLFSSRKGKHKKVQSTSLPEPPPPSIPSSEPKPEIA 420

Query: 421 DQNQIHKSHEPPMELERLSSLNDEEYNTRIGSESPFHPI-PPPPPPPPPFRMHGDFDSVG 480
            QNQI K H+PPMEL+RLSSLNDEEYNT IG ESP+HPI PPPPPPPPPFRMHGDFDS G
Sbjct: 421 AQNQILKPHDPPMELDRLSSLNDEEYNTSIGGESPYHPIPPPPPPPPPPFRMHGDFDSAG 480

Query: 481 SNSSTPRAISPEMDESETDGPPPTGERKLVKDSTAPMFCSSPDVNSKADKFIARFRADLK 540
           SNSSTPRAISPEMDESE +GPP T ERKLVKD T PMFCSSPDVNSKAD FIARFRADLK
Sbjct: 481 SNSSTPRAISPEMDESEANGPPATSERKLVKDPTIPMFCSSPDVNSKADTFIARFRADLK 540

Query: 541 LQKMNSIKEKTGRKRSNLGRTSGPGPSKSR 562
           LQKMNSIKEKT RKRSNLGRTSGPGPSK+R
Sbjct: 541 LQKMNSIKEKTARKRSNLGRTSGPGPSKTR 568

BLAST of HG10023461 vs. ExPASy TrEMBL
Match: A0A6J1GQY1 (protein enabled homolog OS=Cucurbita moschata OX=3662 GN=LOC111456249 PE=4 SV=1)

HSP 1 Score: 858.6 bits (2217), Expect = 1.5e-245
Identity = 470/561 (83.78%), Postives = 501/561 (89.30%), Query Frame = 0

Query: 1   MEEDGNAPPPFWLQSSNSLHELDYNRRR-RLNRASSFLLNSSAFLIVLLVIVLCFILIVI 60
           MEEDGNAPPPFWLQ SNSLHELD +RRR RL+RASSFLLNSSAFL+VLLVIVLCFI IVI
Sbjct: 1   MEEDGNAPPPFWLQPSNSLHELDNHRRRHRLSRASSFLLNSSAFLVVLLVIVLCFIWIVI 60

Query: 61  PKFVQFTSQLIRPQSIKKSWDSLNFLLVLFAIVCGFLSRNTGDDSRDSFEDRSVSSRRTM 120
           PKFVQF SQLIRPQS+KKSWDSLN +LVLFAIVCGFLSRN GDDSRDSFEDRSVSSRRT+
Sbjct: 61  PKFVQFGSQLIRPQSMKKSWDSLNLVLVLFAIVCGFLSRNAGDDSRDSFEDRSVSSRRTI 120

Query: 121 KSNPTTPRRWDGYTDHRPNHYTLNRMKSSSSYPDLRLQESSLDAGDHRWRFYDDTHVTNH 180
           K+NP  PR+WDGY DHRP HYT+NRM+SSSSYPDLRLQESSL AGD R R YDDTHV N+
Sbjct: 121 KTNPRNPRQWDGYADHRPIHYTVNRMRSSSSYPDLRLQESSLVAGDQRRRSYDDTHVPNN 180

Query: 181 RYSSSDQLHRRRETRPELERLDSDVRSIVFDRSEIREDIYSQPAIPSPPPPPPPPPRVSP 240
           R+  SDQL+RRRE RPELER DSDV+SI FDRSEIRED+YSQ  +P P PP  PPP+VSP
Sbjct: 181 RFPYSDQLYRRREARPELEREDSDVKSIGFDRSEIREDVYSQ--LPIPSPPRSPPPQVSP 240

Query: 241 PRPPSPPPTPPPPANTTPKVVKRRLKRTLKVHSHTPDGEINQQHENGDSDVANFQRIQLP 300
           PR PSPPPTPPPPANTTPKVVKRR KRT KVHSHTP GEI+Q ++NGDSDVA FQRI LP
Sbjct: 241 PRSPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPAGEIDQHNKNGDSDVAEFQRIPLP 300

Query: 301 PLSPPSFYRESEQKSNKNEKKRGGASKEIWSALRRRKKKQRQKSIESFDAIIASQRDPTS 360
           PLSPP FYRESEQKS KNEKKRGGA KEIWSALRRR+KKQRQKSIESF+AI+ASQR  TS
Sbjct: 301 PLSPPLFYRESEQKSVKNEKKRGGAPKEIWSALRRRRKKQRQKSIESFEAIVASQRPSTS 360

Query: 361 SLPPPS---PPPLPSPSVLQNLFSSKKGKGKKVQSTPLPEPPPPSTASSEPKPKIEDQNQ 420
           SLPPPS   PPPLPSPSVLQ LF+SKKG+GKKVQSTP PE  PPS ASSEPKP IEDQN 
Sbjct: 361 SLPPPSPPPPPPLPSPSVLQVLFTSKKGRGKKVQSTPSPE-SPPSIASSEPKPIIEDQNH 420

Query: 421 IHKSHEPPMELERLSSLNDEEYNTRIGSESPFHPIPPPPPPPPPFRMHGDFDSVGSNSST 480
           + K HEPP+EL RL+SLNDEEY+TRIG ESPFHPIPPPPPPPPPFRMHGDFDSVGSNSST
Sbjct: 421 LLKPHEPPVELARLNSLNDEEYSTRIGGESPFHPIPPPPPPPPPFRMHGDFDSVGSNSST 480

Query: 481 PRAISPEMDESETDGPPPTGERKLVKDSTAPMFCSSPDVNSKADKFIARFRADLKLQKMN 540
           PRA+SP+MDESE DG P  GERKLVKDST PMFCSSPDVNSKADKFIARFRADLKLQKMN
Sbjct: 481 PRAVSPDMDESEADGKPAAGERKLVKDSTIPMFCSSPDVNSKADKFIARFRADLKLQKMN 540

Query: 541 SIKEKTGRKRSNLGRTSGPGP 558
           SIKEKT RKRSNLGRT GPGP
Sbjct: 541 SIKEKTARKRSNLGRTPGPGP 558

BLAST of HG10023461 vs. ExPASy TrEMBL
Match: A0A6J1JVT7 (serine/arginine repetitive matrix protein 1-like OS=Cucurbita maxima OX=3661 GN=LOC111488332 PE=4 SV=1)

HSP 1 Score: 836.3 bits (2159), Expect = 7.7e-239
Identity = 465/562 (82.74%), Postives = 494/562 (87.90%), Query Frame = 0

Query: 1   MEEDGNAPPPFWLQSSNSLHELDYNRRR-RLNRASSFLLNSSAFLIVLLVIVLCFILIVI 60
           MEEDGNAPPPFWLQ SNSL ELD +RRR RL+RASSFLLNSSAFL+VLLVIVLCFI IVI
Sbjct: 1   MEEDGNAPPPFWLQPSNSLPELDNHRRRHRLSRASSFLLNSSAFLVVLLVIVLCFIWIVI 60

Query: 61  PKFVQFTSQLIRPQSIKKSWDSLNFLLVLFAIVCGFLSRNTGDDSRDSFEDRSVSSRRTM 120
           PKFVQF SQLIRPQS+KKSWDSLN +LVLFAIVCGFLSRN GDD+RDS EDRSVSSRRT+
Sbjct: 61  PKFVQFGSQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNAGDDTRDSLEDRSVSSRRTI 120

Query: 121 KSNPTTPRRWDGYTDHRPNHYTLNRMKSSSSYPDLRLQESSLDAGDHRWRFYDDTHVTNH 180
           KSNP TPR+WDGY DHRP  YT+NRM+SSSSYPDL LQESSLDAGD RWR YDDTHV N+
Sbjct: 121 KSNPRTPRQWDGYADHRPIRYTVNRMRSSSSYPDLCLQESSLDAGDQRWRSYDDTHVPNN 180

Query: 181 RYSSSDQLHRRRETRPELERLDSDVRSIVFDRSEIREDIYSQPAIPSPPPPPPPPPRVSP 240
           R+ SSDQLHRRRE RPELER DSDV+SI FDRSEIRED+YSQ  +P P PP  PPP VSP
Sbjct: 181 RFPSSDQLHRRREARPELEREDSDVKSIGFDRSEIREDVYSQ--LPIPSPPRSPPPLVSP 240

Query: 241 PRPPSPPPTPPPPANTTPKVVKRRLKRTLKVHSHTPDGEINQQHENGDSDVANFQRIQLP 300
           PR PSPPPTPPPPA+TTPKVVKRR KRT KVHSHTP  EI+Q ++NGDSDVA FQRI LP
Sbjct: 241 PRSPSPPPTPPPPAHTTPKVVKRRPKRTHKVHSHTPAVEIDQHNKNGDSDVAEFQRIPLP 300

Query: 301 PLSPPSFYRESEQKSNKNEKKRGGASKEIWSALRRRKKKQRQKSIESFDAIIASQRDPTS 360
           PLSPP FYRESEQKS KN+KKRGGA KEIWSALRRR+KKQRQKSIESF+AI+ASQR  TS
Sbjct: 301 PLSPPLFYRESEQKSVKNDKKRGGAPKEIWSALRRRRKKQRQKSIESFEAIVASQRPSTS 360

Query: 361 SLPPPSPPP---LPSPSVLQNLFSSKKGKGKKVQSTPLPEPPPPSTASSEPKPKIEDQNQ 420
           SLPPPSPPP   LPSPSVLQ LF+SKKGKGKKVQSTP PE  PPS ASSEPKP IEDQN 
Sbjct: 361 SLPPPSPPPPPLLPSPSVLQVLFTSKKGKGKKVQSTPSPE-SPPSIASSEPKPSIEDQNH 420

Query: 421 IHKSHE-PPMELERLSSLNDEEYNTRIGSESPFHPIPPPPPPPPPFRMHGDFDSVGSNSS 480
           + K HE PP+EL RLSSLN EEY+TRIG ESPFHPIPPPPPPPP FRMHGDFDSVGSNSS
Sbjct: 421 LRKPHEPPPVELVRLSSLNGEEYSTRIGGESPFHPIPPPPPPPPLFRMHGDFDSVGSNSS 480

Query: 481 TPRAISPEMDESETDGPPPTGERKLVKDSTAPMFCSSPDVNSKADKFIARFRADLKLQKM 540
           TPRA+ P+M ESE DG P  GERKLVKDST PMFCSSPDVNSKADKFIARFRADLKLQKM
Sbjct: 481 TPRAV-PDMGESEADGQPAAGERKLVKDSTIPMFCSSPDVNSKADKFIARFRADLKLQKM 540

Query: 541 NSIKEKTGRKRSNLGRTSGPGP 558
           NSIKEKT RKRSNLGRT G GP
Sbjct: 541 NSIKEKTARKRSNLGRTPGSGP 558

BLAST of HG10023461 vs. TAIR 10
Match: AT1G72790.1 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 253.4 bits (646), Expect = 4.1e-67
Identity = 227/600 (37.83%), Postives = 302/600 (50.33%), Query Frame = 0

Query: 2   EEDGNAPPPFWLQSSNSLHELDYNRRRRLNRASSFLLNSSAFLIVLLVIVLCFILIVIPK 61
           E+DG+A  PFWLQS    +   + R   L   ++ +     F     ++++ FI   IP 
Sbjct: 3   EDDGDASTPFWLQSRR--NNTYFRRTASLGGRTTTIATQIFFAGTAAILIVVFI---IPP 62

Query: 62  FVQFTSQLIRPQSIKKSWDSLNFLLVLFAIVCGFLSRNTGDDSRDSFEDR---------- 121
           F    SQ+ RP  ++KSWD LNF+LVLFA++CGFLSRNT +D  +  ++           
Sbjct: 63  FFSSVSQIFRPHLVRKSWDYLNFVLVLFAVLCGFLSRNTNNDESNHHKEEDIRNKFSTSP 122

Query: 122 SVSSRRTMKSNP-TTPRRWD----GYTDHRPNHYTLNRMKSSSSYPDLRLQESSLDAGDH 181
           S+  RR+  SN  TTPR W+    G    +  +   +R++S SSYPDLRL+E      D 
Sbjct: 123 SIIDRRSRVSNSGTTPRYWNDDRGGGGGDQTVYKRFSRLRSVSSYPDLRLREYE---ADE 182

Query: 182 RWRFYDDTHVTNHRYSSSDQLHRRR-------ETRPELERLDSDVRSIVFDRSEIRE--- 241
           RWRFYDDT V+  RY   D ++  +       E +P  E +D        + S++R    
Sbjct: 183 RWRFYDDTRVSQCRYEDVDPIYPNQSYRNWHEEGKPPPEDVDQTEDGDNGEGSKVRNGGS 242

Query: 242 -------------DIYSQPAIPSPPPPPPPPPRVSPPRPPSPPPTPPPPANTTPKVVKRR 301
                        ++  +  +PS PP  P P    PP PP PPP          K  KR+
Sbjct: 243 ETEKVEVVATAEAEVVEELKVPSAPPYIPSP----PPSPPRPPPA---------KQAKRK 302

Query: 302 LKRTLKVHSHTPDGEINQQHENGDSDVANFQRIQLPPLSPPSFYRESEQKSNKNEKKRGG 361
             R  +        +++ Q E  + D        +PP  P + Y    QKSNK EKK+GG
Sbjct: 303 TNRVYQ--------DVSPQEEKKERDDFVATTTPIPP--PATVY----QKSNKQEKKKGG 362

Query: 362 ASKEIWSALRRRKKKQRQKSIESFDAIIASQRDPTSSLPPPSPPPLPSPSVLQNLFSSKK 421
           A+K+   ALRR+KKKQRQ+SI+  D +  S  DP     PP PPP P P   Q LFSSKK
Sbjct: 363 ATKDFLIALRRKKKKQRQQSIDGLDLLFGS--DPPLVYSPPPPPP-PPPPFFQGLFSSKK 422

Query: 422 GKGKKVQSTPLPEPPPPSTASSEPKPKIEDQNQIHKSHEPPMELERLSSLNDEEYNTR-- 481
           GK KK  S P P PPPP      P+ + E +    K  + P+E  R S  N     T+  
Sbjct: 423 GKSKKNNSNPPPPPPPP-----PPERRYESRASTSKLRKAPVE-SRTSKPNPPAKVTQYV 482

Query: 482 -IGSESPFHPIPPPPPPPP------PFRMHGDFDSVGSNSSTPRAISPEMDESETDGPPP 541
             GSESP  PIPPPPPPPP       F   GD+  + S+      IS   DE +    P 
Sbjct: 483 GTGSESPLMPIPPPPPPPPFKMPAWKFVKRGDYVRMASD------ISISSDEPD---DPD 542

Query: 542 TGERKLVKDSTAPMFCSSPDVNSKADKFIARFRADLKLQKMNSIKEKTGRKRSNLGRTSG 555
             +    K++   MFC SPDV++KAD FIARFRA LKL+KMNS+K    R RSNLG   G
Sbjct: 543 VAQSAGSKEAAGSMFCPSPDVDTKADDFIARFRAGLKLEKMNSVK----RGRSNLGPEPG 545

BLAST of HG10023461 vs. TAIR 10
Match: AT5G57070.1 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 145.6 bits (366), Expect = 1.2e-34
Identity = 185/617 (29.98%), Postives = 268/617 (43.44%), Query Frame = 0

Query: 8   PPPFWLQSSNSLHELDYNRRRRLNRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVQFTS 67
           PP  W Q  ++     Y RRR    A   +L  +   +    I L F+  V+P F+  TS
Sbjct: 5   PPLIWPQFDST----GYARRRSSIPA---ILVPAMIGVTSAAIFLVFVTFVVPTFLSVTS 64

Query: 68  QLIRPQSIKKSWDSLNFLLVLFAIVCGFLSRNTGD----DSRDSFEDRSVSS-------- 127
           Q+++P S+K+ WDS+N +LV+FAI+CG L+R   D    +S    E+  V          
Sbjct: 65  QILQPASVKRGWDSINVVLVVFAILCGVLARRNDDGLSSESLHGGEEEEVGGGAVTNGEM 124

Query: 128 -----RRTMKSNPTTPRRW--DGYTDHRPNHY----------------TLNRMKSSSSYP 187
                 +   S+ T   +W  D Y   R   Y                 +   +SSSSYP
Sbjct: 125 TVGEISKISSSSSTVSEQWFDDVYDSDRLKIYESVSSRSFSHGLPVTGNVPLRRSSSSYP 184

Query: 188 DLRLQESSLDAGDHRWRFYDDTHVTNHRYSSSDQLHR-RRETRPELERLDSDVRSIVFDR 247
           DLR Q    + GD R+RFYDD  +  +R   S    + +  ++ E+E  +S+ + I  D 
Sbjct: 185 DLR-QGVFRETGDRRFRFYDDFEIDKYRSQDSSSYQQFQNLSKTEIEEEESEPKEIQIDT 244

Query: 248 SEIREDIYSQPAIPSPPPPPPPPPRVSPPRPPSPPPTPPPPANTTPKVVKRRLKRTLKVH 307
             ++           P  PP  PP   PP PP PP   P     T + V+ R        
Sbjct: 245 FVVK-----------PSSPPQQPPATPPPPPPPPPVEVPQKPRRTHRSVRNR-------- 304

Query: 308 SHTPDGEINQQHENGDSDVANFQRIQLPPLSPPS----------FYRESEQKSNKNEKKR 367
                       EN       F+R   PP SPP                 +K    ++++
Sbjct: 305 ---------DLQENAKRSETKFKRTFQPPPSPPPPPPPPPPQPLIAATPPRKQGTLQRRK 364

Query: 368 GGASKEI-------WSALRRRKKKQRQKSIESFDA--IIASQRDP---TSSLPPPSPPPL 427
             A+KEI       ++  +++KK Q+ K  E  ++  ++    +P    S +PPPSPPP 
Sbjct: 365 SNAAKEIKMVFASLYNQGKKKKKLQKSKRKERIESSPMVEDVTEPPQYQSLIPPPSPPPP 424

Query: 428 PSP---------SVLQNLFSSKKGKGKKVQSTPLPEPPPPSTASSEPKPKIEDQNQIHKS 487
           P P         SV   LF       KK+ S P P PPPP    ++  P+   +    KS
Sbjct: 425 PPPPPPPLRSSQSVFYGLFKKGVKSNKKIHSVPAP-PPPPPPRYTQFDPQTPPRRV--KS 484

Query: 488 HEPPMELERLSSLNDEEYNTRIGSESPFHPIPPPPPPPPPFR-------MHGDFDSVGSN 542
             PP   +  +   +EE N   G  SP   I PPPPPPPPFR       + GDF  + SN
Sbjct: 485 GRPPRPTKPKNF--NEENN---GQGSPLIQITPPPPPPPPFRVPPLKYVVSGDFAKIRSN 544

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038896222.15.3e-27491.41serine/arginine repetitive matrix protein 1-like [Benincasa hispida][more]
XP_008462455.25.4e-26387.70PREDICTED: LOW QUALITY PROTEIN: serine/arginine repetitive matrix protein 1-like... [more]
KAA0059415.12.1e-26287.52serine/arginine repetitive matrix protein 1-like [Cucumis melo var. makuwa] >TYK... [more]
XP_011659637.15.8e-25786.32formin-like protein 20 [Cucumis sativus] >KGN45509.1 hypothetical protein Csa_01... [more]
XP_023548433.19.3e-24783.78protein enabled homolog [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A1S3CII22.6e-26387.70LOW QUALITY PROTEIN: serine/arginine repetitive matrix protein 1-like OS=Cucumis... [more]
A0A5A7V0Q31.0e-26287.52Serine/arginine repetitive matrix protein 1-like OS=Cucumis melo var. makuwa OX=... [more]
A0A0A0K9L42.8e-25786.32Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G450660 PE=4 SV=1[more]
A0A6J1GQY11.5e-24583.78protein enabled homolog OS=Cucurbita moschata OX=3662 GN=LOC111456249 PE=4 SV=1[more]
A0A6J1JVT77.7e-23982.74serine/arginine repetitive matrix protein 1-like OS=Cucurbita maxima OX=3661 GN=... [more]
Match NameE-valueIdentityDescription
AT1G72790.14.1e-6737.83hydroxyproline-rich glycoprotein family protein [more]
AT5G57070.11.2e-3429.98hydroxyproline-rich glycoprotein family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008480Protein of unknown function DUF761, plantPFAMPF05553DUF761coord: 513..540
e-value: 1.0E-10
score: 40.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 531..561
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 468..482
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 212..515
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 102..116
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 101..134
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 222..256
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 411..435
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 531..547
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 447..463
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 311..330
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 359..373
NoneNo IPR availablePANTHERPTHR33098:SF36OS01G0584100 PROTEINcoord: 1..551
NoneNo IPR availablePANTHERPTHR33098COTTON FIBER (DUF761)coord: 1..551

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10023461.1HG10023461.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane