ClCG07G008110 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG07G008110
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionHydroxyproline-rich glycoprotein family protein
LocationCG_Chr07: 21604934 .. 21607548 (-)
RNA-Seq ExpressionClCG07G008110
SyntenyClCG07G008110
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCTTCTTCTTCTTCCTTCTCTCTCGCTAACGAACCTTCTCTTCACTTTCTTACTGCAAAATCTCTTGTTTTGATTTTGCCTAAGAATTTCATCTCATGATCTCTTGTGTATGATCGGAACCACGATTCTTTCTATGAAACCGATCGGCAATTCTCTGGCTAGCGATGAGACGACGTACGGATGCTGATGATTTGAGGCCTGTTAACAATACTTTCCAAACCATTACTGCCGCCGCTGATGCGATCGCCACTGTTGATCATCGTTTTCCTCGGGCTACTGCCGTCCAGGTATTGTATTCTTCAACTTCTTTTCAATCATTTGGAATTTTTTTATTTAAGTGTTGGTTACTTATGTATTGAGAATATTGACCTTGTTTTTGGAAATTACTGTTCATTCTTAATTCCGCTAGGTATTAAGATGTAGTTGGTTTCCGAGGGATGAGATGATATAGCGGAATGGAATCTGTGTTGTGTTGTTTTTTTGTGGAATTCTATATAAGGCTTTTAATTTCCCTTTTTTTCCCTGCTCCTTCTGTTTGTTTGCTTAGAAATTGAGAATCGTCACATGGGTACATTAAAAAAGGGAATCACGAGCTGCGCTTCGTTTCTGCTTTTTCTGCTTTTGGCTTCCTTTCTCGCATGATTATTAGATTTTTAAGGATTGGAAAGTACCAAGATTAATGTTATTGGGTTGGATGACTGTGTCAGGAAGCAATATGCTGTTGTAGCCCAATGTTATTTGACTTTTTGTAATAACACTGCCCTTTTTAACCAGAAAGGCATCTGTTCCATTCTTTATCTTTCAAAAGAGAGAGAGAGAGAGAGAGAGAGAGAAAGGTTAATTTTAAAATGCATTCTATATGGAGGAAGAGAATTGGTTTTTATATAAAGATTCCAACAAATAGGGCTGTCATTTGTCCAAACTCTTCCTACTGTTTTGTACTGATGTCATTATCAGTTTCCTGTTCTTCTGTTTGTTTGGTTACTCACAGAAAAGAAGATGGGGCAGTTGTTGGAGTATTTATTGGTGCTTTGGATCCCTCAAACAGAGGAAAAGAATTGGGCACGCTGTTTTGGTACCAGAATCAAGTCCATCTGAGCCTCATGAAAATACATTGCAATCACCAGATATTGTGCTTCCTTTTGCTGCACCTCCCTCTTCCCCTGCATCCTTCCTTCAATCTGACCCACCTTCTGCTACACAGTCACCTACACCTTTAATTTCTTTCTCTTCTCTCACTTCTAACATGTATTCTCCTGATGGGCCTTCCTCCATTTTTGCCATTGGCCCATTTGCTCATGAAACACAACTAGTTTCTCCACCTCTCAATTTCTCTACTCTCACGACTGAACCATCAACTGCTCCCTTCACTCCTCCTGAGTCTATTCACTTGACTACCCCTTCTTCCCCTGAAGTTCCTTTTGCTCAGTTTCTTCAACCTACCCTCCACAAACCTGAGTCTGATCATCAATATCCATTTCCTAATGATGACTTCCAATCTTACCAATTCTATCCAGGCAGTCCGGTTAGTCACCTCATATCACCACGGTCCGTTATTTCTCGTTCTGGGGCTTCGTCGCCTTTGCCTGACTATGATTTTGCTTCCTTTGGTTCTCAATTTTTGAATTTCCCATTAGAAGTTCCACCTACTTTGTTGAATCTTGACAAGCATTCCATTCATAACTGGCCACAACGCCAGAGTACTGATTCTTGCACTCAAGATTCTATAGAATTCAAATCAAGTAATGATTTTGTTTTGAATCCCCAAACTTCAGAATCTATGTCAGATCACCACGCAACAAATAAATCTCAAAATATTCAAATTCTCATTGATGGAAGCCAAACGGAGGAGGAGCCAGCTGCTACTAATCATAGATTCTCATTTGAATTATCTGATGGAGATGTTCTATTACAAAGCGTAGGAAGTAAGCCACTGGAATCAAATGAACTTACTGTTGCATCGTCTCCAATACATGAACCATCTGAAACGGCTAAAGAAAATTCTCCTATTGGTGACCATACTTCAAATGTTACAGGAGAAAAGACAAAAGCAGACGGTGAAGAAGCACATCAGCATCAAGAACATCATTCCATTACTCTTGGATCTGTGAAGGAATTCAATTTTGATAATGGCAATGGAAGTGATACACATAAGCCAAATATAAATTCAGAATGGTGGACTAATGCAAAGGATGTTGACACAGAAGGCACGACCACGAGGGCCTGGTCATTCTTTCCAATGGCGCAGCAAAGATGAGCTGACTCGTGCTAACTTATCCTCTGGAATCTCCTCATGTCCATCATCCTTTGCAGTTTCAAATTGATAGGTAAGACAAACTGCAAGAGGAATGGTGGGTTTTGTAGGTATTAAAGAGGCCGTCAAATCATGAGAGAGCCAGACCAGAATGATAGAGAAATTTGTTGATTCGGTTGGGCCTTATTAACAAACAATTGTGGCTCGTCACCTGAATTGTCATAGATATTAGTAGTCTGATAGATATTGGAAGTGTGTAAATATGGTAATAAAAAGTGTTATTTTTTCTTTTTATCTTCACAATATCTCGTTTATTGACTTTGAATTAGCAGAATACACAAAAAGTTGAGAACATATTGAG

mRNA sequence

TCTTCTTCTTCTTCCTTCTCTCTCGCTAACGAACCTTCTCTTCACTTTCTTACTGCAAAATCTCTTGTTTTGATTTTGCCTAAGAATTTCATCTCATGATCTCTTGTGTATGATCGGAACCACGATTCTTTCTATGAAACCGATCGGCAATTCTCTGGCTAGCGATGAGACGACGTACGGATGCTGATGATTTGAGGCCTGTTAACAATACTTTCCAAACCATTACTGCCGCCGCTGATGCGATCGCCACTGTTGATCATCGTTTTCCTCGGGCTACTGCCGTCCAGAAAAGAAGATGGGGCAGTTGTTGGAGTATTTATTGGTGCTTTGGATCCCTCAAACAGAGGAAAAGAATTGGGCACGCTGTTTTGGTACCAGAATCAAGTCCATCTGAGCCTCATGAAAATACATTGCAATCACCAGATATTGTGCTTCCTTTTGCTGCACCTCCCTCTTCCCCTGCATCCTTCCTTCAATCTGACCCACCTTCTGCTACACAGTCACCTACACCTTTAATTTCTTTCTCTTCTCTCACTTCTAACATGTATTCTCCTGATGGGCCTTCCTCCATTTTTGCCATTGGCCCATTTGCTCATGAAACACAACTAGTTTCTCCACCTCTCAATTTCTCTACTCTCACGACTGAACCATCAACTGCTCCCTTCACTCCTCCTGAGTCTATTCACTTGACTACCCCTTCTTCCCCTGAAGTTCCTTTTGCTCAGTTTCTTCAACCTACCCTCCACAAACCTGAGTCTGATCATCAATATCCATTTCCTAATGATGACTTCCAATCTTACCAATTCTATCCAGGCAGTCCGGTTAGTCACCTCATATCACCACGGTCCGTTATTTCTCGTTCTGGGGCTTCGTCGCCTTTGCCTGACTATGATTTTGCTTCCTTTGGTTCTCAATTTTTGAATTTCCCATTAGAAGTTCCACCTACTTTGTTGAATCTTGACAAGCATTCCATTCATAACTGGCCACAACGCCAGAGTACTGATTCTTGCACTCAAGATTCTATAGAATTCAAATCAAGTAATGATTTTGTTTTGAATCCCCAAACTTCAGAATCTATGTCAGATCACCACGCAACAAATAAATCTCAAAATATTCAAATTCTCATTGATGGAAGCCAAACGGAGGAGGAGCCAGCTGCTACTAATCATAGATTCTCATTTGAATTATCTGATGGAGATGTTCTATTACAAAGCGTAGGAAGTAAGCCACTGGAATCAAATGAACTTACTGTTGCATCGTCTCCAATACATGAACCATCTGAAACGGCTAAAGAAAATTCTCCTATTGGTGACCATACTTCAAATGTTACAGGAGAAAAGACAAAAGCAGACGGTGAAGAAGCACATCAGCATCAAGAACATCATTCCATTACTCTTGGATCTGTGAAGGAATTCAATTTTGATAATGGCAATGGAAGTGATACACATAAGCCAAATATAAATTCAGAATGGTGGACTAATGCAAAGGATGTTGACACAGAAGGCACGACCACGAGGGCCTGGTCATTCTTTCCAATGGCGCAGCAAAGATGAGCTGACTCGTGCTAACTTATCCTCTGGAATCTCCTCATGTCCATCATCCTTTGCAGTTTCAAATTGATAGGTAAGACAAACTGCAAGAGGAATGGTGGGTTTTGTAGGTATTAAAGAGGCCGTCAAATCATGAGAGAGCCAGACCAGAATGATAGAGAAATTTGTTGATTCGGTTGGGCCTTATTAACAAACAATTGTGGCTCGTCACCTGAATTGTCATAGATATTAGTAGTCTGATAGATATTGGAAGTGTGTAAATATGGTAATAAAAAGTGTTATTTTTTCTTTTTATCTTCACAATATCTCGTTTATTGACTTTGAATTAGCAGAATACACAAAAAGTTGAGAACATATTGAG

Coding sequence (CDS)

ATGAGACGACGTACGGATGCTGATGATTTGAGGCCTGTTAACAATACTTTCCAAACCATTACTGCCGCCGCTGATGCGATCGCCACTGTTGATCATCGTTTTCCTCGGGCTACTGCCGTCCAGAAAAGAAGATGGGGCAGTTGTTGGAGTATTTATTGGTGCTTTGGATCCCTCAAACAGAGGAAAAGAATTGGGCACGCTGTTTTGGTACCAGAATCAAGTCCATCTGAGCCTCATGAAAATACATTGCAATCACCAGATATTGTGCTTCCTTTTGCTGCACCTCCCTCTTCCCCTGCATCCTTCCTTCAATCTGACCCACCTTCTGCTACACAGTCACCTACACCTTTAATTTCTTTCTCTTCTCTCACTTCTAACATGTATTCTCCTGATGGGCCTTCCTCCATTTTTGCCATTGGCCCATTTGCTCATGAAACACAACTAGTTTCTCCACCTCTCAATTTCTCTACTCTCACGACTGAACCATCAACTGCTCCCTTCACTCCTCCTGAGTCTATTCACTTGACTACCCCTTCTTCCCCTGAAGTTCCTTTTGCTCAGTTTCTTCAACCTACCCTCCACAAACCTGAGTCTGATCATCAATATCCATTTCCTAATGATGACTTCCAATCTTACCAATTCTATCCAGGCAGTCCGGTTAGTCACCTCATATCACCACGGTCCGTTATTTCTCGTTCTGGGGCTTCGTCGCCTTTGCCTGACTATGATTTTGCTTCCTTTGGTTCTCAATTTTTGAATTTCCCATTAGAAGTTCCACCTACTTTGTTGAATCTTGACAAGCATTCCATTCATAACTGGCCACAACGCCAGAGTACTGATTCTTGCACTCAAGATTCTATAGAATTCAAATCAAGTAATGATTTTGTTTTGAATCCCCAAACTTCAGAATCTATGTCAGATCACCACGCAACAAATAAATCTCAAAATATTCAAATTCTCATTGATGGAAGCCAAACGGAGGAGGAGCCAGCTGCTACTAATCATAGATTCTCATTTGAATTATCTGATGGAGATGTTCTATTACAAAGCGTAGGAAGTAAGCCACTGGAATCAAATGAACTTACTGTTGCATCGTCTCCAATACATGAACCATCTGAAACGGCTAAAGAAAATTCTCCTATTGGTGACCATACTTCAAATGTTACAGGAGAAAAGACAAAAGCAGACGGTGAAGAAGCACATCAGCATCAAGAACATCATTCCATTACTCTTGGATCTGTGAAGGAATTCAATTTTGATAATGGCAATGGAAGTGATACACATAAGCCAAATATAAATTCAGAATGGTGGACTAATGCAAAGGATGTTGACACAGAAGGCACGACCACGAGGGCCTGGTCATTCTTTCCAATGGCGCAGCAAAGATGA

Protein sequence

MRRRTDADDLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPESSPSEPHENTLQSPDIVLPFAAPPSSPASFLQSDPPSATQSPTPLISFSSLTSNMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTAPFTPPESIHLTTPSSPEVPFAQFLQPTLHKPESDHQYPFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWPQRQSTDSCTQDSIEFKSSNDFVLNPQTSESMSDHHATNKSQNIQILIDGSQTEEEPAATNHRFSFELSDGDVLLQSVGSKPLESNELTVASSPIHEPSETAKENSPIGDHTSNVTGEKTKADGEEAHQHQEHHSITLGSVKEFNFDNGNGSDTHKPNINSEWWTNAKDVDTEGTTTRAWSFFPMAQQR
Homology
BLAST of ClCG07G008110 vs. NCBI nr
Match: XP_038884079.1 (uncharacterized protein LOC120075005 isoform X2 [Benincasa hispida])

HSP 1 Score: 840.1 bits (2169), Expect = 9.1e-240
Identity = 428/465 (92.04%), Postives = 439/465 (94.41%), Query Frame = 0

Query: 1   MRRRTDADDLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQ 60
           MRRRTD DD RPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQ
Sbjct: 1   MRRRTDTDDSRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQ 60

Query: 61  RKRIGHAVLVPESSP-SEPHENTLQSPDIVLPFAAPPSSPASFLQSDPPSATQSPTPLIS 120
           RKRIGHAVLVPESSP SE HEN+LQSPDIVLPFAAPPSSP SFLQS+PPSATQSPT LIS
Sbjct: 61  RKRIGHAVLVPESSPSSESHENSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTALIS 120

Query: 121 FSSLTSNMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTAPFTPPESIHLTTPS 180
           F+SLT+NMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTAPFTPPESIHLTTPS
Sbjct: 121 FTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTAPFTPPESIHLTTPS 180

Query: 181 SPEVPFAQFLQPTLHKPESDHQYPFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL 240
           SPEVPFAQFLQPTL K ESDHQYPFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL
Sbjct: 181 SPEVPFAQFLQPTLQKSESDHQYPFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL 240

Query: 241 PDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWPQRQSTDSCTQDSIEFKSSNDFVLNP 300
           PDYDFASFGSQFLNFPLEVPPTLLNLDK SIHNW QRQSTDSCTQDSIE KSSNDFVLNP
Sbjct: 241 PDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWRQRQSTDSCTQDSIELKSSNDFVLNP 300

Query: 301 QTSESMSDHHATNKSQNIQILIDGSQTEEE-PAATNHRFSFELSDGDVLLQSVGSKPLES 360
           QTSESMSDHHATN+SQNIQILIDG+Q EEE P ATNHRFSFELSDGD LLQSVGSKPL+S
Sbjct: 301 QTSESMSDHHATNESQNIQILIDGNQKEEEVPGATNHRFSFELSDGDALLQSVGSKPLDS 360

Query: 361 NELTVASSPIHEPSETAKENSPI-GDHTSNVTGEKTKADGEEAHQHQEHHSITLGSVKEF 420
           NE+ VASSPIHEP ETAKENSP+  DHTSNVT  KTKA+ EEAHQHQEHHSITLGSVKEF
Sbjct: 361 NEVAVASSPIHEPFETAKENSPVDDDHTSNVTEGKTKAEVEEAHQHQEHHSITLGSVKEF 420

Query: 421 NFDNGNGSDTHKPNINSEWWTNAKDVDTEGTTTRAWSFFPMAQQR 463
           NFDNGNGSDTHK N+NSEWWTNAKDVDTEGTT  AWSFFPM QQR
Sbjct: 421 NFDNGNGSDTHKANLNSEWWTNAKDVDTEGTTNGAWSFFPMTQQR 465

BLAST of ClCG07G008110 vs. NCBI nr
Match: XP_004146564.1 (uncharacterized protein LOC101220378 isoform X1 [Cucumis sativus])

HSP 1 Score: 813.9 bits (2101), Expect = 7.0e-232
Identity = 419/465 (90.11%), Postives = 429/465 (92.26%), Query Frame = 0

Query: 1   MRRRTDADDLRPV-NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLK 60
           MRRRTD DD RPV NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC SIYWCFGS+K
Sbjct: 1   MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIK 60

Query: 61  QRKRIGHAVLVPESSP-SEPHENTLQSPDIVLPFAAPPSSPASFLQSDPPSATQSPTPLI 120
           QRKRIGHAVLVPE SP SEPHENTLQSPDIVLPFAAPPSSP S LQS+PPSA QSPT LI
Sbjct: 61  QRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAMQSPTALI 120

Query: 121 SFSSLTSNMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTAPFTPPESIHLTTP 180
           SF+SLT+NMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPST PFTPPESIHLTTP
Sbjct: 121 SFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPPESIHLTTP 180

Query: 181 SSPEVPFAQFLQPTLHKPESDHQYPFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSP 240
           SSPEVPFAQF+QPTL K ESD+QY FPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSP
Sbjct: 181 SSPEVPFAQFVQPTLPKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSP 240

Query: 241 LPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWPQRQSTDSCTQDSIEFKSSNDFVLN 300
           LPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNW QRQSTDSCTQDSIEFKSSNDFVLN
Sbjct: 241 LPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN 300

Query: 301 PQTSESMSDHHATNKSQNIQILI-DGSQTEEEPAATNHRFSFELSDGDVLLQSVGSKPLE 360
           PQTSESMSDHHATN+SQNIQILI DGS+ EEEP ATNHRFSFELSDGDVLLQSVGSKPLE
Sbjct: 301 PQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLE 360

Query: 361 SNELTVASSPIHEPSETAKENSPIGDHTSNVTGEKTKADGEEAHQHQEHHSITLGSVKEF 420
           SNEL V SSPIHEP ET KENSP GDHTSNV  EKTKADG+EAHQ QEHHS+TLGSVKEF
Sbjct: 361 SNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQRQEHHSVTLGSVKEF 420

Query: 421 NFDNGNGSDTHKPNINSEWWTNAKDVDTEGTTTRAWSFFPMAQQR 463
           NFDNGNGSDTH PNINSEWW NAKD  TE T T  WSFFPM QQR
Sbjct: 421 NFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR 464

BLAST of ClCG07G008110 vs. NCBI nr
Match: XP_008452033.1 (PREDICTED: uncharacterized protein LOC103493162 isoform X2 [Cucumis melo] >TYK16635.1 mucin-2 [Cucumis melo var. makuwa])

HSP 1 Score: 812.0 bits (2096), Expect = 2.7e-231
Identity = 414/464 (89.22%), Postives = 425/464 (91.59%), Query Frame = 0

Query: 1   MRRRTDADDLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQ 60
           MRRRTD DD RPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC SIYWCFGSLKQ
Sbjct: 1   MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQ 60

Query: 61  RKRIGHAVLVPESSP-SEPHENTLQSPDIVLPFAAPPSSPASFLQSDPPSATQSPTPLIS 120
           RKRIGHAVLVPE SP SEPHENTLQSPDIVLPFAAPPSSP S LQS+PPSA QSPT LIS
Sbjct: 61  RKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAIQSPTALIS 120

Query: 121 FSSLTSNMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTAPFTPPESIHLTTPS 180
           F+SLT+NMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPST PFTPPESIHLTTPS
Sbjct: 121 FTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPS 180

Query: 181 SPEVPFAQFLQPTLHKPESDHQYPFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL 240
           SPEVPFAQF+ P+L K ESD+QY FPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL
Sbjct: 181 SPEVPFAQFVPPSLQKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL 240

Query: 241 PDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWPQRQSTDSCTQDSIEFKSSNDFVLNP 300
           PDYDFASFGSQFLNFPLEVPPTL NLDKHSIHNW QRQSTDSCTQDSIEFKSSNDFVLNP
Sbjct: 241 PDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP 300

Query: 301 QTSESMSDHHATNKSQNIQILI-DGSQTEEEPAATNHRFSFELSDGDVLLQSVGSKPLES 360
            TSESM DHHATN+SQNIQILI DGS+ EEEP ATNHRFSFELSDGDVL QSVGSKPLES
Sbjct: 301 HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLES 360

Query: 361 NELTVASSPIHEPSETAKENSPIGDHTSNVTGEKTKADGEEAHQHQEHHSITLGSVKEFN 420
           NEL V SSPIHEP ET KENSP GDHTSNV  EKTKADG+EAHQHQEHHS+ LGSVKEFN
Sbjct: 361 NELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQHQEHHSVALGSVKEFN 420

Query: 421 FDNGNGSDTHKPNINSEWWTNAKDVDTEGTTTRAWSFFPMAQQR 463
           FDN NGSDTH P INS+WWTNAKD  TEGTTT AWSFFP  QQR
Sbjct: 421 FDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR 464

BLAST of ClCG07G008110 vs. NCBI nr
Match: XP_008452032.1 (PREDICTED: uncharacterized protein LOC103493162 isoform X1 [Cucumis melo])

HSP 1 Score: 807.4 bits (2084), Expect = 6.5e-230
Identity = 414/465 (89.03%), Postives = 425/465 (91.40%), Query Frame = 0

Query: 1   MRRRTDADDLRPVNNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCWSIYWCFGSLK 60
           MRRRTD DD RPVNNTFQTITAAADAIATVDHRFPRATAV QKRRWGSC SIYWCFGSLK
Sbjct: 1   MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGSLK 60

Query: 61  QRKRIGHAVLVPESSP-SEPHENTLQSPDIVLPFAAPPSSPASFLQSDPPSATQSPTPLI 120
           QRKRIGHAVLVPE SP SEPHENTLQSPDIVLPFAAPPSSP S LQS+PPSA QSPT LI
Sbjct: 61  QRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAIQSPTALI 120

Query: 121 SFSSLTSNMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTAPFTPPESIHLTTP 180
           SF+SLT+NMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPST PFTPPESIHLTTP
Sbjct: 121 SFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTP 180

Query: 181 SSPEVPFAQFLQPTLHKPESDHQYPFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSP 240
           SSPEVPFAQF+ P+L K ESD+QY FPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSP
Sbjct: 181 SSPEVPFAQFVPPSLQKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSP 240

Query: 241 LPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWPQRQSTDSCTQDSIEFKSSNDFVLN 300
           LPDYDFASFGSQFLNFPLEVPPTL NLDKHSIHNW QRQSTDSCTQDSIEFKSSNDFVLN
Sbjct: 241 LPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN 300

Query: 301 PQTSESMSDHHATNKSQNIQILI-DGSQTEEEPAATNHRFSFELSDGDVLLQSVGSKPLE 360
           P TSESM DHHATN+SQNIQILI DGS+ EEEP ATNHRFSFELSDGDVL QSVGSKPLE
Sbjct: 301 PHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLE 360

Query: 361 SNELTVASSPIHEPSETAKENSPIGDHTSNVTGEKTKADGEEAHQHQEHHSITLGSVKEF 420
           SNEL V SSPIHEP ET KENSP GDHTSNV  EKTKADG+EAHQHQEHHS+ LGSVKEF
Sbjct: 361 SNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQHQEHHSVALGSVKEF 420

Query: 421 NFDNGNGSDTHKPNINSEWWTNAKDVDTEGTTTRAWSFFPMAQQR 463
           NFDN NGSDTH P INS+WWTNAKD  TEGTTT AWSFFP  QQR
Sbjct: 421 NFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR 465

BLAST of ClCG07G008110 vs. NCBI nr
Match: KAA0044829.1 (mucin-2 [Cucumis melo var. makuwa])

HSP 1 Score: 807.0 bits (2083), Expect = 8.5e-230
Identity = 411/464 (88.58%), Postives = 424/464 (91.38%), Query Frame = 0

Query: 1   MRRRTDADDLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQ 60
           MRRRTD DD RPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC SIYWCFGSLKQ
Sbjct: 1   MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQ 60

Query: 61  RKRIGHAVLVPESSP-SEPHENTLQSPDIVLPFAAPPSSPASFLQSDPPSATQSPTPLIS 120
           RKRIGHAVLVPE SP SEPHENTLQSPDIVLPFAAPPSSP S LQS+PPSA QSPT LIS
Sbjct: 61  RKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAIQSPTALIS 120

Query: 121 FSSLTSNMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTAPFTPPESIHLTTPS 180
           F+SLT+NMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPST PFTPPESIHLTTPS
Sbjct: 121 FTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPS 180

Query: 181 SPEVPFAQFLQPTLHKPESDHQYPFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL 240
           SPEVPFAQF+ P+  K ESD+QY FPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL
Sbjct: 181 SPEVPFAQFVPPSHQKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL 240

Query: 241 PDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWPQRQSTDSCTQDSIEFKSSNDFVLNP 300
           PDYDFASFGSQFLNFPL+VPPTL N+DKHSIHNW QRQSTDSCTQDSIEFKSSNDFVLNP
Sbjct: 241 PDYDFASFGSQFLNFPLKVPPTLSNIDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP 300

Query: 301 QTSESMSDHHATNKSQNIQILI-DGSQTEEEPAATNHRFSFELSDGDVLLQSVGSKPLES 360
            TSESM DHHATN+SQNIQILI DGS+ EEEP ATNHRFSFELSDGDVL QSVGSKPLES
Sbjct: 301 HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLES 360

Query: 361 NELTVASSPIHEPSETAKENSPIGDHTSNVTGEKTKADGEEAHQHQEHHSITLGSVKEFN 420
           NEL V SSPIHEP ET KENSP GDHTSNV  EKTKADG+EAHQHQEHHS+ LGSVKEFN
Sbjct: 361 NELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQHQEHHSVALGSVKEFN 420

Query: 421 FDNGNGSDTHKPNINSEWWTNAKDVDTEGTTTRAWSFFPMAQQR 463
           FDN NGSDTH P INS+WWTNAKD  TEGTTT AWSFFP  QQR
Sbjct: 421 FDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR 464

BLAST of ClCG07G008110 vs. ExPASy Swiss-Prot
Match: Q9SRE5 (Uncharacterized protein At1g76660 OS=Arabidopsis thaliana OX=3702 GN=At1g76660 PE=2 SV=1)

HSP 1 Score: 146.4 bits (368), Expect = 8.2e-34
Identity = 104/210 (49.52%), Postives = 125/210 (59.52%), Query Frame = 0

Query: 41  QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPE------SSPSEPHE----NTLQSPDIVL 100
           Q++RWG C  ++ CF S K  KRI  A  +PE      S P+  H+    N   +  I L
Sbjct: 7   QRKRWGGCLGVFSCFKSQKGGKRIVPASRIPEGGNVSASQPNGAHQAGVLNNQAAGGINL 66

Query: 101 PFAAPPSSPASFLQSDPPSATQSPTPLISFSSLTSNMYSPDGP-SSIFAIGPFAHETQLV 160
              APPSSPASF  S  PS TQSP     + SL +N  SP GP SS++A GP+AHETQLV
Sbjct: 67  SLLAPPSSPASFTNSALPSTTQSPN---CYLSLAAN--SPGGPSSSMYATGPYAHETQLV 126

Query: 161 SPPLNFSTLTTEPSTAPFT-PPESIHLTTPSSPEVPFAQFLQPTLHKPESDHQYPFPNDD 220
           SPP+ FST TTEPSTAPFT PPE   LT PSSP+VP+A+FL  ++    S   +   ND 
Sbjct: 127 SPPV-FSTFTTEPSTAPFTPPPELARLTAPSSPDVPYARFLTSSMDLKNSGKGH--YNDL 186

Query: 221 FQSYQFYPGSPVSHLISPRSVISRSGASSP 239
             +Y  YPGSP S L SP S  S  G  SP
Sbjct: 187 QATYSLYPGSPASALRSPISRASGDGLLSP 208

BLAST of ClCG07G008110 vs. ExPASy TrEMBL
Match: A0A5D3CYQ2 (Mucin-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G004630 PE=4 SV=1)

HSP 1 Score: 812.0 bits (2096), Expect = 1.3e-231
Identity = 414/464 (89.22%), Postives = 425/464 (91.59%), Query Frame = 0

Query: 1   MRRRTDADDLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQ 60
           MRRRTD DD RPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC SIYWCFGSLKQ
Sbjct: 1   MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQ 60

Query: 61  RKRIGHAVLVPESSP-SEPHENTLQSPDIVLPFAAPPSSPASFLQSDPPSATQSPTPLIS 120
           RKRIGHAVLVPE SP SEPHENTLQSPDIVLPFAAPPSSP S LQS+PPSA QSPT LIS
Sbjct: 61  RKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAIQSPTALIS 120

Query: 121 FSSLTSNMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTAPFTPPESIHLTTPS 180
           F+SLT+NMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPST PFTPPESIHLTTPS
Sbjct: 121 FTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPS 180

Query: 181 SPEVPFAQFLQPTLHKPESDHQYPFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL 240
           SPEVPFAQF+ P+L K ESD+QY FPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL
Sbjct: 181 SPEVPFAQFVPPSLQKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL 240

Query: 241 PDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWPQRQSTDSCTQDSIEFKSSNDFVLNP 300
           PDYDFASFGSQFLNFPLEVPPTL NLDKHSIHNW QRQSTDSCTQDSIEFKSSNDFVLNP
Sbjct: 241 PDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP 300

Query: 301 QTSESMSDHHATNKSQNIQILI-DGSQTEEEPAATNHRFSFELSDGDVLLQSVGSKPLES 360
            TSESM DHHATN+SQNIQILI DGS+ EEEP ATNHRFSFELSDGDVL QSVGSKPLES
Sbjct: 301 HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLES 360

Query: 361 NELTVASSPIHEPSETAKENSPIGDHTSNVTGEKTKADGEEAHQHQEHHSITLGSVKEFN 420
           NEL V SSPIHEP ET KENSP GDHTSNV  EKTKADG+EAHQHQEHHS+ LGSVKEFN
Sbjct: 361 NELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQHQEHHSVALGSVKEFN 420

Query: 421 FDNGNGSDTHKPNINSEWWTNAKDVDTEGTTTRAWSFFPMAQQR 463
           FDN NGSDTH P INS+WWTNAKD  TEGTTT AWSFFP  QQR
Sbjct: 421 FDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR 464

BLAST of ClCG07G008110 vs. ExPASy TrEMBL
Match: A0A1S3BSY8 (uncharacterized protein LOC103493162 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103493162 PE=4 SV=1)

HSP 1 Score: 812.0 bits (2096), Expect = 1.3e-231
Identity = 414/464 (89.22%), Postives = 425/464 (91.59%), Query Frame = 0

Query: 1   MRRRTDADDLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQ 60
           MRRRTD DD RPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC SIYWCFGSLKQ
Sbjct: 1   MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQ 60

Query: 61  RKRIGHAVLVPESSP-SEPHENTLQSPDIVLPFAAPPSSPASFLQSDPPSATQSPTPLIS 120
           RKRIGHAVLVPE SP SEPHENTLQSPDIVLPFAAPPSSP S LQS+PPSA QSPT LIS
Sbjct: 61  RKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAIQSPTALIS 120

Query: 121 FSSLTSNMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTAPFTPPESIHLTTPS 180
           F+SLT+NMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPST PFTPPESIHLTTPS
Sbjct: 121 FTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPS 180

Query: 181 SPEVPFAQFLQPTLHKPESDHQYPFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL 240
           SPEVPFAQF+ P+L K ESD+QY FPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL
Sbjct: 181 SPEVPFAQFVPPSLQKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL 240

Query: 241 PDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWPQRQSTDSCTQDSIEFKSSNDFVLNP 300
           PDYDFASFGSQFLNFPLEVPPTL NLDKHSIHNW QRQSTDSCTQDSIEFKSSNDFVLNP
Sbjct: 241 PDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP 300

Query: 301 QTSESMSDHHATNKSQNIQILI-DGSQTEEEPAATNHRFSFELSDGDVLLQSVGSKPLES 360
            TSESM DHHATN+SQNIQILI DGS+ EEEP ATNHRFSFELSDGDVL QSVGSKPLES
Sbjct: 301 HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLES 360

Query: 361 NELTVASSPIHEPSETAKENSPIGDHTSNVTGEKTKADGEEAHQHQEHHSITLGSVKEFN 420
           NEL V SSPIHEP ET KENSP GDHTSNV  EKTKADG+EAHQHQEHHS+ LGSVKEFN
Sbjct: 361 NELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQHQEHHSVALGSVKEFN 420

Query: 421 FDNGNGSDTHKPNINSEWWTNAKDVDTEGTTTRAWSFFPMAQQR 463
           FDN NGSDTH P INS+WWTNAKD  TEGTTT AWSFFP  QQR
Sbjct: 421 FDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR 464

BLAST of ClCG07G008110 vs. ExPASy TrEMBL
Match: A0A1S3BSB0 (uncharacterized protein LOC103493162 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103493162 PE=4 SV=1)

HSP 1 Score: 807.4 bits (2084), Expect = 3.2e-230
Identity = 414/465 (89.03%), Postives = 425/465 (91.40%), Query Frame = 0

Query: 1   MRRRTDADDLRPVNNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCWSIYWCFGSLK 60
           MRRRTD DD RPVNNTFQTITAAADAIATVDHRFPRATAV QKRRWGSC SIYWCFGSLK
Sbjct: 1   MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGSLK 60

Query: 61  QRKRIGHAVLVPESSP-SEPHENTLQSPDIVLPFAAPPSSPASFLQSDPPSATQSPTPLI 120
           QRKRIGHAVLVPE SP SEPHENTLQSPDIVLPFAAPPSSP S LQS+PPSA QSPT LI
Sbjct: 61  QRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAIQSPTALI 120

Query: 121 SFSSLTSNMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTAPFTPPESIHLTTP 180
           SF+SLT+NMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPST PFTPPESIHLTTP
Sbjct: 121 SFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTP 180

Query: 181 SSPEVPFAQFLQPTLHKPESDHQYPFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSP 240
           SSPEVPFAQF+ P+L K ESD+QY FPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSP
Sbjct: 181 SSPEVPFAQFVPPSLQKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSP 240

Query: 241 LPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWPQRQSTDSCTQDSIEFKSSNDFVLN 300
           LPDYDFASFGSQFLNFPLEVPPTL NLDKHSIHNW QRQSTDSCTQDSIEFKSSNDFVLN
Sbjct: 241 LPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN 300

Query: 301 PQTSESMSDHHATNKSQNIQILI-DGSQTEEEPAATNHRFSFELSDGDVLLQSVGSKPLE 360
           P TSESM DHHATN+SQNIQILI DGS+ EEEP ATNHRFSFELSDGDVL QSVGSKPLE
Sbjct: 301 PHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLE 360

Query: 361 SNELTVASSPIHEPSETAKENSPIGDHTSNVTGEKTKADGEEAHQHQEHHSITLGSVKEF 420
           SNEL V SSPIHEP ET KENSP GDHTSNV  EKTKADG+EAHQHQEHHS+ LGSVKEF
Sbjct: 361 SNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQHQEHHSVALGSVKEF 420

Query: 421 NFDNGNGSDTHKPNINSEWWTNAKDVDTEGTTTRAWSFFPMAQQR 463
           NFDN NGSDTH P INS+WWTNAKD  TEGTTT AWSFFP  QQR
Sbjct: 421 NFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR 465

BLAST of ClCG07G008110 vs. ExPASy TrEMBL
Match: A0A5A7TUB1 (Mucin-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold74G001210 PE=4 SV=1)

HSP 1 Score: 807.0 bits (2083), Expect = 4.1e-230
Identity = 411/464 (88.58%), Postives = 424/464 (91.38%), Query Frame = 0

Query: 1   MRRRTDADDLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQ 60
           MRRRTD DD RPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC SIYWCFGSLKQ
Sbjct: 1   MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQ 60

Query: 61  RKRIGHAVLVPESSP-SEPHENTLQSPDIVLPFAAPPSSPASFLQSDPPSATQSPTPLIS 120
           RKRIGHAVLVPE SP SEPHENTLQSPDIVLPFAAPPSSP S LQS+PPSA QSPT LIS
Sbjct: 61  RKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAIQSPTALIS 120

Query: 121 FSSLTSNMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTAPFTPPESIHLTTPS 180
           F+SLT+NMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPST PFTPPESIHLTTPS
Sbjct: 121 FTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPS 180

Query: 181 SPEVPFAQFLQPTLHKPESDHQYPFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL 240
           SPEVPFAQF+ P+  K ESD+QY FPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL
Sbjct: 181 SPEVPFAQFVPPSHQKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL 240

Query: 241 PDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWPQRQSTDSCTQDSIEFKSSNDFVLNP 300
           PDYDFASFGSQFLNFPL+VPPTL N+DKHSIHNW QRQSTDSCTQDSIEFKSSNDFVLNP
Sbjct: 241 PDYDFASFGSQFLNFPLKVPPTLSNIDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP 300

Query: 301 QTSESMSDHHATNKSQNIQILI-DGSQTEEEPAATNHRFSFELSDGDVLLQSVGSKPLES 360
            TSESM DHHATN+SQNIQILI DGS+ EEEP ATNHRFSFELSDGDVL QSVGSKPLES
Sbjct: 301 HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLES 360

Query: 361 NELTVASSPIHEPSETAKENSPIGDHTSNVTGEKTKADGEEAHQHQEHHSITLGSVKEFN 420
           NEL V SSPIHEP ET KENSP GDHTSNV  EKTKADG+EAHQHQEHHS+ LGSVKEFN
Sbjct: 361 NELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQHQEHHSVALGSVKEFN 420

Query: 421 FDNGNGSDTHKPNINSEWWTNAKDVDTEGTTTRAWSFFPMAQQR 463
           FDN NGSDTH P INS+WWTNAKD  TEGTTT AWSFFP  QQR
Sbjct: 421 FDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR 464

BLAST of ClCG07G008110 vs. ExPASy TrEMBL
Match: A0A6J1C828 (uncharacterized protein At1g76660-like OS=Momordica charantia OX=3673 GN=LOC111008285 PE=4 SV=1)

HSP 1 Score: 740.3 bits (1910), Expect = 4.8e-210
Identity = 382/468 (81.62%), Postives = 413/468 (88.25%), Query Frame = 0

Query: 1   MRRRTDAD---DLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGS 60
           MRRR DAD   DL PVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGS
Sbjct: 1   MRRRPDADADADLSPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGS 60

Query: 61  LKQRKRIGHAVLVPESSPS-EPHENTLQSPDIVLPFAAPPSSPASFLQSDPPSATQSPTP 120
           LKQRKRIGHAVLVPE SPS EP ENTLQSPDIVLPFAAPPSSP SFLQS+PPSATQSPT 
Sbjct: 61  LKQRKRIGHAVLVPEPSPSTEPPENTLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTA 120

Query: 121 LISFSSLTSNMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTAPFTPPESIHLT 180
           ++SF+SLT+NMYSPDGPSSIFA+GPFAHETQLVSPPLNFST+TT+PSTAPFTPPESIHLT
Sbjct: 121 ILSFTSLTANMYSPDGPSSIFAVGPFAHETQLVSPPLNFSTVTTQPSTAPFTPPESIHLT 180

Query: 181 TPSSPEVPFAQFLQPTLHKPESDHQY-PFPNDDFQSYQFYPGSPVSHLISPRSVISRSGA 240
           TPSSPEVPFAQ+LQP+  K ESDHQY  FPNDDFQSYQFYPGSPVSHLISPRSVISRSGA
Sbjct: 181 TPSSPEVPFAQYLQPSHQKVESDHQYDQFPNDDFQSYQFYPGSPVSHLISPRSVISRSGA 240

Query: 241 SSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWPQRQSTDSCTQDSIEFKSSNDF 300
           SSPLPD DF   GS F NFP+EVPPTLLNLD+HSI +W  +QS+DSCTQ+S+ +KSSNDF
Sbjct: 241 SSPLPDCDFTPSGSSFSNFPIEVPPTLLNLDQHSIQDWRLQQSSDSCTQNSVGYKSSNDF 300

Query: 301 VLNPQTSESMSDHHATNKSQNIQILIDGSQTEEEPAATNHRFSFELSDGDVLLQSVGSKP 360
           VLNPQTSES+SD+HA+N+  NIQIL DGSQ  +E AA NHRFSFELSD D LL+SV +KP
Sbjct: 301 VLNPQTSESVSDYHASNEYHNIQILTDGSQ-RDEAAAANHRFSFELSDEDALLKSVENKP 360

Query: 361 LESNELTVASSPIHEPSETAKENSPIGDHTSNVTGEKTKADGEEAHQHQ--EHHSITLGS 420
           LESNEL VASSPIHEP ETAKE S +G HTSN T E+ KADGEE H HQ  EHHS+TLG+
Sbjct: 361 LESNELAVASSPIHEPLETAKETSHVGGHTSNDTEEQEKADGEEVHGHQEVEHHSVTLGT 420

Query: 421 VKEFNFDNGNGSDTHKPNINSEWWTNAKDVDTEGTTTRAWSFFPMAQQ 462
           VKEFNFDNGNG DT KPNINS WW N KD +TEGTTT AWSFFP+ QQ
Sbjct: 421 VKEFNFDNGNGCDTLKPNINSAWWANGKDAETEGTTTGAWSFFPITQQ 467

BLAST of ClCG07G008110 vs. TAIR 10
Match: AT5G52430.1 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 233.8 bits (595), Expect = 2.8e-61
Identity = 188/468 (40.17%), Postives = 242/468 (51.71%), Query Frame = 0

Query: 13  VNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPE 72
           VNN+ +T+ AAA AI T + R  + ++ QK RWG CWS+Y CFG+ K  KRIG+AVLVPE
Sbjct: 5   VNNSVETVNAAATAIVTAESRV-QPSSSQKGRWGKCWSLYSCFGTQKNNKRIGNAVLVPE 64

Query: 73  ----SSPSEPHENTLQSPDIVLPFAAPPSSPASFLQSDPPSATQSPT-PLISFSSLTSNM 132
                 P    +N+  S  +VLPF APPSSPASFLQSDP S + SP  PL    SLTSN 
Sbjct: 65  PVTSGVPVVTVQNSATSTTVVLPFIAPPSSPASFLQSDPSSVSHSPVGPL----SLTSNT 124

Query: 133 YSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTAPFTPP--ESIHLTTPSSPEVPF 192
           +SP  P S+F +GP+A+ETQ V+PP+ FS   TEPSTAP+TPP   S+H+TTPSSPEVPF
Sbjct: 125 FSPKEPQSVFTVGPYANETQPVTPPV-FSAFITEPSTAPYTPPPESSVHITTPSSPEVPF 184

Query: 193 AQFLQPTLHKPESD------HQYPFPNDDFQSYQFYPGSP-VSHLISPRSVISRSGASSP 252
           AQ L  +L     D       ++   + +F+S Q  PGSP   +LISP SVIS SG SSP
Sbjct: 185 AQLLTSSLELTRRDSTSGMNQKFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSP 244

Query: 253 LPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWPQRQSTDSCTQDSIEFKSSNDFVLN 312
            P        S  + F +  PP  L  +  +   W  R  + S T             L 
Sbjct: 245 YPG------KSPMVEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPVG-HGSGLASGALT 304

Query: 313 PQTSESMSDHHATNKS----QNIQI----LIDGSQTEEEPAATNHRFSFELSDGDVLLQS 372
           P   E +S +   N +    QN QI     +  S    E    +HR SFEL+  DV  + 
Sbjct: 305 PNGPEIVSGNLTPNNTTWPLQN-QISEVASLANSDHGSEVMVADHRVSFELTGEDV-ARC 364

Query: 373 VGSKPLESNELTVASSPIHEPSETAKENSPIGDHTSNVTGEKTKADGE-EAHQHQEHHSI 432
           + SK      L  +   ++       E S   D   N+  EK   D E E H+ Q+  S 
Sbjct: 365 LASK------LNRSHDRMNNNDRIETEESSSTDIRRNI--EKRSGDRENEQHRIQKLSSS 424

Query: 433 TLGSVKEFNFDNGNGSDTHKPNINSEWWTNAKDVDTEGTTTRAWSFFP 458
           ++GS KEF FD                  N KD + E     +WSFFP
Sbjct: 425 SIGSSKEFKFD------------------NTKDENIEKVAGNSWSFFP 431

BLAST of ClCG07G008110 vs. TAIR 10
Match: AT4G25620.1 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 206.5 bits (524), Expect = 4.8e-53
Identity = 184/492 (37.40%), Postives = 243/492 (49.39%), Query Frame = 0

Query: 10  LRPVNN-TFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAV 69
           +R VNN +  T+ AAA AI + + R  + ++VQK+R GS WS+YWCFGS K  KRIGHAV
Sbjct: 1   MRSVNNSSVDTVNAAASAIVSAESR-TQPSSVQKKR-GSWWSLYWCFGSKKNNKRIGHAV 60

Query: 70  LVPESSPS----EPHEN-TLQSPDIVLPFAAPPSSPASFLQSDPPSATQSPTPLISFSSL 129
           LVPE + S     P +N +  S  I +PF APPSSPASFL S PPSA+ +P P +   SL
Sbjct: 61  LVPEPAASGAAVAPVQNSSSNSTSIFMPFIAPPSSPASFLPSGPPSASHTPDPGL-LCSL 120

Query: 130 TSNMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTAPFTPPESIHLTTPSSPEV 189
           T N      P S F IGP+AHETQ V+PP+ FS  TTEPSTAPFTPP      +PSSPEV
Sbjct: 121 TVN-----EPPSAFTIGPYAHETQPVTPPV-FSAFTTEPSTAPFTPPPE----SPSSPEV 180

Query: 190 PFAQFLQPTLHKPE------SDHQYPFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASS 249
           PFAQ L  +L +         + ++   + +F+S Q YPGSP  +LISP      SG SS
Sbjct: 181 PFAQLLTSSLERARRNSGGGMNQKFSAAHYEFKSCQVYPGSPGGNLISP-----GSGTSS 240

Query: 250 PLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWPQRQSTDS--------------CT 309
           P P           + F +  PP  L  +  +   W  R  + S               T
Sbjct: 241 PYPG------KCSIIEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPAGQGSRLGSGALT 300

Query: 310 QDSIEFKSSNDFVLNPQTSESMSDHHATNKSQNIQILID---------------GSQTEE 369
            D  +  S    V+ P  +E++      N +     L+D                S+  +
Sbjct: 301 PDGSKLTSG---VVTPNGAETVIRMSYGNLTPLEGSLLDSQISEVASLANSDHGSSRHND 360

Query: 370 EPAATNHRFSFELSDGDVLLQSVGSKPLESNELTVASSPIHEPSETAKENSPIGDHTSNV 429
           E     HR SFEL+  DV  + + SK        +  S  HE +         G+H   +
Sbjct: 361 EALVVPHRVSFELTGEDV-ARCLASK--------LNRSGSHEKAS--------GEH---L 420

Query: 430 TGEKTKADGE-EAHQHQEHHSITLGSVKEFNFDNGNGSDTHKPNINSEWWTNAKDVDTEG 459
                K  GE E+ Q Q+  S + GS KEF FD+ N     K  I SEWW N K      
Sbjct: 421 RPNCCKTSGETESEQSQKLRSFSTGSNKEFKFDSTNEEMIEK--IRSEWWANEKVAGKGD 443

BLAST of ClCG07G008110 vs. TAIR 10
Match: AT1G63720.1 (BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1); Has 490 Blast hits to 394 proteins in 96 species: Archae - 0; Bacteria - 2; Metazoa - 132; Fungi - 88; Plants - 175; Viruses - 14; Other Eukaryotes - 79 (source: NCBI BLink). )

HSP 1 Score: 203.8 bits (517), Expect = 3.1e-52
Identity = 135/263 (51.33%), Postives = 167/263 (63.50%), Query Frame = 0

Query: 14  NNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPE 73
           NN F TI AAA AIA+ D R  +++ + +KR+W + WS+  CFGS +QRKRIG++VLVPE
Sbjct: 8   NNVFDTINAAASAIASSDDRLHQSSPIHKKRKWWNRWSLLKCFGSSRQRKRIGNSVLVPE 67

Query: 74  -----SSPSEPHENTLQSPDIVLPFAAPPSSPASFLQSDPPSATQSPTPLISFSSLTSNM 133
                SS S    +  +S    LPF APPSSPASF QS+PPSATQSP  ++SFS L  N 
Sbjct: 68  PVSMSSSNSTTSNSGYRSVITTLPFIAPPSSPASFFQSEPPSATQSPVGILSFSPLPCN- 127

Query: 134 YSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTAPFTPP---ESIHL--TTPSSPE 193
                  SIFAIGP+AHETQLVSPP+ FST TTEPS+AP TPP    SI+L  TTPSSPE
Sbjct: 128 ----NRPSIFAIGPYAHETQLVSPPV-FSTYTTEPSSAPITPPLDDSSIYLTTTTPSSPE 187

Query: 194 VPFAQFLQPTLHKPESDHQYPFPND-DFQSYQFYPGSPVSHLISPRSVISRSGASSPLPD 253
           VPFAQ            +++P  +  +FQ YQ  PGSP+  LISP      SG +SP PD
Sbjct: 188 VPFAQLFNSNHQTGSYGYKFPMSSSYEFQFYQLPPGSPLGQLISPS---PGSGPTSPFPD 247

Query: 254 YDFASFGSQFLNFPLEVPPTLLN 265
            +     S F +F +  PP LL+
Sbjct: 248 GE----TSLFPHFQVSDPPKLLS 257

BLAST of ClCG07G008110 vs. TAIR 10
Match: AT1G76660.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1); Has 353 Blast hits to 231 proteins in 60 species: Archae - 0; Bacteria - 6; Metazoa - 57; Fungi - 22; Plants - 125; Viruses - 4; Other Eukaryotes - 139 (source: NCBI BLink). )

HSP 1 Score: 146.4 bits (368), Expect = 5.8e-35
Identity = 104/210 (49.52%), Postives = 125/210 (59.52%), Query Frame = 0

Query: 41  QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPE------SSPSEPHE----NTLQSPDIVL 100
           Q++RWG C  ++ CF S K  KRI  A  +PE      S P+  H+    N   +  I L
Sbjct: 7   QRKRWGGCLGVFSCFKSQKGGKRIVPASRIPEGGNVSASQPNGAHQAGVLNNQAAGGINL 66

Query: 101 PFAAPPSSPASFLQSDPPSATQSPTPLISFSSLTSNMYSPDGP-SSIFAIGPFAHETQLV 160
              APPSSPASF  S  PS TQSP     + SL +N  SP GP SS++A GP+AHETQLV
Sbjct: 67  SLLAPPSSPASFTNSALPSTTQSPN---CYLSLAAN--SPGGPSSSMYATGPYAHETQLV 126

Query: 161 SPPLNFSTLTTEPSTAPFT-PPESIHLTTPSSPEVPFAQFLQPTLHKPESDHQYPFPNDD 220
           SPP+ FST TTEPSTAPFT PPE   LT PSSP+VP+A+FL  ++    S   +   ND 
Sbjct: 127 SPPV-FSTFTTEPSTAPFTPPPELARLTAPSSPDVPYARFLTSSMDLKNSGKGH--YNDL 186

Query: 221 FQSYQFYPGSPVSHLISPRSVISRSGASSP 239
             +Y  YPGSP S L SP S  S  G  SP
Sbjct: 187 QATYSLYPGSPASALRSPISRASGDGLLSP 208

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038884079.19.1e-24092.04uncharacterized protein LOC120075005 isoform X2 [Benincasa hispida][more]
XP_004146564.17.0e-23290.11uncharacterized protein LOC101220378 isoform X1 [Cucumis sativus][more]
XP_008452033.12.7e-23189.22PREDICTED: uncharacterized protein LOC103493162 isoform X2 [Cucumis melo] >TYK16... [more]
XP_008452032.16.5e-23089.03PREDICTED: uncharacterized protein LOC103493162 isoform X1 [Cucumis melo][more]
KAA0044829.18.5e-23088.58mucin-2 [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
Q9SRE58.2e-3449.52Uncharacterized protein At1g76660 OS=Arabidopsis thaliana OX=3702 GN=At1g76660 P... [more]
Match NameE-valueIdentityDescription
A0A5D3CYQ21.3e-23189.22Mucin-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G004630 PE=4 S... [more]
A0A1S3BSY81.3e-23189.22uncharacterized protein LOC103493162 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A1S3BSB03.2e-23089.03uncharacterized protein LOC103493162 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A5A7TUB14.1e-23088.58Mucin-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold74G001210 PE=4 S... [more]
A0A6J1C8284.8e-21081.62uncharacterized protein At1g76660-like OS=Momordica charantia OX=3673 GN=LOC1110... [more]
Match NameE-valueIdentityDescription
AT5G52430.12.8e-6140.17hydroxyproline-rich glycoprotein family protein [more]
AT4G25620.14.8e-5337.40hydroxyproline-rich glycoprotein family protein [more]
AT1G63720.13.1e-5251.33BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein fam... [more]
AT1G76660.15.8e-3549.52FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 97..116
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 362..409
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 389..409
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 98..116
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 362..388
NoneNo IPR availablePANTHERPTHR31798:SF2HYDROXYPROLINE-RICH GLYCOPROTEIN FAMILY PROTEINcoord: 8..460
IPR040420Uncharacterized protein At1g76660-likePANTHERPTHR31798HYDROXYPROLINE-RICH GLYCOPROTEIN-LIKEcoord: 8..460

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG07G008110.1ClCG07G008110.1mRNA