Cla97C02G032100 (gene) Watermelon (97103) v2.5

Overview
NameCla97C02G032100
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
LocationCla97Chr02: 5163807 .. 5164685 (+)
RNA-Seq ExpressionCla97C02G032100
SyntenyCla97C02G032100
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAAACATCACGAACCAAAGCAATGAACCCCACGGCGACCCTGACCTCCAACTCTCACTTCGGCCGCCGGCCGGGGATCCCTCACCCCAACCATTCTCACTCTGGTCATCGGTCGGAGATCCCTCACCGCAACCATTCTCACTCCGGTCGTCGGTCAGGGATCTTTTATCCCAACCATTCTCACTCCGGCCGCCGGTCAGAGATCTCTCACCCCATCCATTCTCACTCTCGCCGCCGGTCAGGGATCTCTCACCCCGACCGTCAGCTGTCCCCGTCACCGCCTGTCAGGCAAATGCAAATGCACTAACAAGCATGAGAATCACTCGCAATTTAGGAACTCGTCGATCATCTCTCCGTCGCTGCAATTCCCGATCACCAAGGACAACAGAGAGGATCGAGCCACCATATCCATGGTCAACAAACCGACGAGCCGTGGTTCAAACCCTAAACGACCTGAAATCAAATCAAATCCTCACAATCACTGGAGACGTCCGATGCCGACAATGCCAAAGACAATACAATATCGAATACGACACCGTCTCAAAATTCGAGGAGATTGCAAGCTTTGTGGAGGAGAACAAGAACTTGTTTCGCGATCGGGCACCGAGGTCGTGGATGAACCCTAATTACCCGACGTGTCGATTTTGCGGACATGAGAATGGAGCGAGGCCGGTGATCCCGGGGGAATGGAGAAAGATCAATTGGTTGTTCTTGCTTTTGGGAGAAATGCTTGGAGCTTTGAATCTGAATCATCTGAAATACTTCTGCAGTTACACTAACAATCATCGAACTGGTGCAAAGAATCGTCTTCTTTATCTAACTTATATCACTTTGTGCCACCAAGTTGATCCTTCTGGCCGTTTCCGTCGAGTTTGA

mRNA sequence

ATGGAAAACATCACGAACCAAAGCAATGAACCCCACGGCGACCCTGACCTCCAACTCTCACTTCGGCCGCCGGCCGGGGATCCCTCACCCCAACCATTCTCACTCTGGTCATCGGTCGGAGATCCCTCACCGCAACCATTCTCACTCCGGTCGTCGGTCAGGGATCTTTTATCCCAACCATTCTCACTCCGGCCGCCGGTCAGAGATCTCTCACCCCATCCATTCTCACTCTCGCCGCCGGTCAGGGATCTCTCACCCCGACCGTCAGCTGTCCCCGTCACCGCCTGTCAGGCAAATGCAAATGCACTAACAAGCATGAGAATCACTCGCAATTTAGGAACTCGTCGATCATCTCTCCGTCGCTGCAATTCCCGATCACCAAGGACAACAGAGAGGATCGAGCCACCATATCCATGGTCAACAAACCGACGAGCCGTGGTTCAAACCCTAAACGACCTGAAATCAAATCAAATCCTCACAATCACTGGAGACGTCCGATGCCGACAATGCCAAAGACAATACAATATCGAATACGACACCGTCTCAAAATTCGAGGAGATTGCAAGCTTTGTGGAGGAGAACAAGAACTTGTTTCGCGATCGGGCACCGAGGTCGTGGATGAACCCTAATTACCCGACGTGTCGATTTTGCGGACATGAGAATGGAGCGAGGCCGGTGATCCCGGGGGAATGGAGAAAGATCAATTGGTTGTTCTTGCTTTTGGGAGAAATGCTTGGAGCTTTGAATCTGAATCATCTGAAATACTTCTGCAGTTACACTAACAATCATCGAACTGGTGCAAAGAATCGTCTTCTTTATCTAACTTATATCACTTTGTGCCACCAAGTTGATCCTTCTGGCCGTTTCCGTCGAGTTTGA

Coding sequence (CDS)

ATGGAAAACATCACGAACCAAAGCAATGAACCCCACGGCGACCCTGACCTCCAACTCTCACTTCGGCCGCCGGCCGGGGATCCCTCACCCCAACCATTCTCACTCTGGTCATCGGTCGGAGATCCCTCACCGCAACCATTCTCACTCCGGTCGTCGGTCAGGGATCTTTTATCCCAACCATTCTCACTCCGGCCGCCGGTCAGAGATCTCTCACCCCATCCATTCTCACTCTCGCCGCCGGTCAGGGATCTCTCACCCCGACCGTCAGCTGTCCCCGTCACCGCCTGTCAGGCAAATGCAAATGCACTAACAAGCATGAGAATCACTCGCAATTTAGGAACTCGTCGATCATCTCTCCGTCGCTGCAATTCCCGATCACCAAGGACAACAGAGAGGATCGAGCCACCATATCCATGGTCAACAAACCGACGAGCCGTGGTTCAAACCCTAAACGACCTGAAATCAAATCAAATCCTCACAATCACTGGAGACGTCCGATGCCGACAATGCCAAAGACAATACAATATCGAATACGACACCGTCTCAAAATTCGAGGAGATTGCAAGCTTTGTGGAGGAGAACAAGAACTTGTTTCGCGATCGGGCACCGAGGTCGTGGATGAACCCTAATTACCCGACGTGTCGATTTTGCGGACATGAGAATGGAGCGAGGCCGGTGATCCCGGGGGAATGGAGAAAGATCAATTGGTTGTTCTTGCTTTTGGGAGAAATGCTTGGAGCTTTGAATCTGAATCATCTGAAATACTTCTGCAGTTACACTAACAATCATCGAACTGGTGCAAAGAATCGTCTTCTTTATCTAACTTATATCACTTTGTGCCACCAAGTTGATCCTTCTGGCCGTTTCCGTCGAGTTTGA

Protein sequence

MENITNQSNEPHGDPDLQLSLRPPAGDPSPQPFSLWSSVGDPSPQPFSLRSSVRDLLSQPFSLRPPVRDLSPHPFSLSPPVRDLSPRPSAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTERIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQRQYNIEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLTYITLCHQVDPSGRFRRV
Homology
BLAST of Cla97C02G032100 vs. NCBI nr
Match: XP_008447299.1 (PREDICTED: uncharacterized protein LOC103489770 [Cucumis melo])

HSP 1 Score: 387.5 bits (994), Expect = 1.0e-103
Identity = 189/217 (87.10%), Postives = 195/217 (89.86%), Query Frame = 0

Query: 76  SLSPPVRDLSPRPSAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTERIEP 135
           SL PP  DL  RPS  P     A ANALT+ RITRNLGTRRSSLRRCNSRSPRTTE IEP
Sbjct: 15  SLRPPSGDLRSRPS--PPAIGHARANALTNRRITRNLGTRRSSLRRCNSRSPRTTETIEP 74

Query: 136 PYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQRQYNIEYDTVSKFEEIASFVEENK 195
           PYPWSTNRRA+V+TLNDL+S+QIL ITGDVRCRQCQ +Y IEYD VSKFEEIASFVEENK
Sbjct: 75  PYPWSTNRRAMVRTLNDLRSSQILQITGDVRCRQCQIEYTIEYDMVSKFEEIASFVEENK 134

Query: 196 NLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKY 255
           NLFRDRAPRSWMNPNYPTCRFCGHENGARPVIP EWRKINWLFLLLGEMLG LNLNHLKY
Sbjct: 135 NLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPDEWRKINWLFLLLGEMLGVLNLNHLKY 194

Query: 256 FCSYTNNHRTGAKNRLLYLTYITLCHQVDPSGRFRRV 293
           FCSYTNNHRTGAKNRLLYLTYITLCHQVDPSGRF RV
Sbjct: 195 FCSYTNNHRTGAKNRLLYLTYITLCHQVDPSGRFNRV 229

BLAST of Cla97C02G032100 vs. NCBI nr
Match: XP_011659748.1 (uncharacterized protein LOC105436256 [Cucumis sativus] >KGN44335.1 hypothetical protein Csa_015666 [Cucumis sativus])

HSP 1 Score: 369.0 bits (946), Expect = 3.8e-98
Identity = 179/217 (82.49%), Postives = 192/217 (88.48%), Query Frame = 0

Query: 76  SLSPPVRDLSPRPSAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTERIEP 135
           SL PP   LS +PSA P+    A  NA+T+MR+TR+LGTRRSS +RCNSRSPRTTE IEP
Sbjct: 21  SLRPPSGHLSSQPSAAPIG--HARPNAVTNMRVTRSLGTRRSSHQRCNSRSPRTTETIEP 80

Query: 136 PYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQRQYNIEYDTVSKFEEIASFVEENK 195
           PYPWSTNRRA+V+TLNDLKSNQIL ITGDV+CRQCQ +Y IEYD  SKFEEIASFVEENK
Sbjct: 81  PYPWSTNRRAMVRTLNDLKSNQILQITGDVQCRQCQVEYTIEYDMDSKFEEIASFVEENK 140

Query: 196 NLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKY 255
           N FRDRAP+SWMNPNYPTCRFCGHENGARPVIP +WRKINWLFLLLGEMLG LNLNHLKY
Sbjct: 141 NSFRDRAPQSWMNPNYPTCRFCGHENGARPVIPKQWRKINWLFLLLGEMLGVLNLNHLKY 200

Query: 256 FCSYTNNHRTGAKNRLLYLTYITLCHQVDPSGRFRRV 293
           FCS T NHRTGAKNRLLYLTYITLCHQVDPSGRF RV
Sbjct: 201 FCSNTYNHRTGAKNRLLYLTYITLCHQVDPSGRFNRV 235

BLAST of Cla97C02G032100 vs. NCBI nr
Match: KAA0036575.1 (uncharacterized protein E6C27_scaffold191G00850 [Cucumis melo var. makuwa] >TYK22646.1 uncharacterized protein E5676_scaffold195G00840 [Cucumis melo var. makuwa])

HSP 1 Score: 352.8 bits (904), Expect = 2.8e-93
Identity = 174/202 (86.14%), Postives = 180/202 (89.11%), Query Frame = 0

Query: 76  SLSPPVRDLSPRPSAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTERIEP 135
           SL PP  DL  RPS  P     A ANALT+ RITRNLGTRRSSLRRCNSRSPRTTE IEP
Sbjct: 15  SLRPPSGDLRSRPS--PPAIGHARANALTNRRITRNLGTRRSSLRRCNSRSPRTTETIEP 74

Query: 136 PYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQRQYNIEYDTVSKFEEIASFVEENK 195
           PYPWSTNRRA+V+TLNDL+S+QIL ITGDVRCRQCQ +Y IEYD VSKFEEIASFVEENK
Sbjct: 75  PYPWSTNRRAMVRTLNDLRSSQILQITGDVRCRQCQIEYTIEYDMVSKFEEIASFVEENK 134

Query: 196 NLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKY 255
           NLFRDRAPRSWMNPNYPTCRFCGHENGARPVIP EWRKINWLFLLLGEMLG LNLNHLKY
Sbjct: 135 NLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPDEWRKINWLFLLLGEMLGVLNLNHLKY 194

Query: 256 FCSYTNNHRTGAKNRLLYLTYI 278
           FCSYTNNHRTGAKNRLLYLT I
Sbjct: 195 FCSYTNNHRTGAKNRLLYLTKI 214

BLAST of Cla97C02G032100 vs. NCBI nr
Match: XP_022952797.1 (uncharacterized protein LOC111455388 [Cucurbita moschata])

HSP 1 Score: 285.8 bits (730), Expect = 4.2e-73
Identity = 137/201 (68.16%), Postives = 157/201 (78.11%), Query Frame = 0

Query: 89  SAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTERIEPPYPWSTNRRAVVQ 148
           + V   A     + L+S+R   NLG R++SLR   S SP TT  IEPPYPWST+R AVV 
Sbjct: 23  ATVAAAAAAYELHLLSSLRTPNNLGVRQTSLRLRKSNSP-TTGPIEPPYPWSTDRIAVVH 82

Query: 149 TLNDLKSNQILTITGDVRCRQCQRQYNIEYDTVSKFEEIASFVEENKNLFRDRAPRSWMN 208
           TL+ L SNQILTITG+V+C+QC+R Y IEYD VSKF EI SFVE N   FRDRAP+ WM 
Sbjct: 83  TLHYLTSNQILTITGEVKCQQCRRIYEIEYDVVSKFNEIGSFVEHNMESFRDRAPKEWMQ 142

Query: 209 PNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAK 268
           PNYPTCRFCG E G +PVIP EW KINW+FLLLGEM+GAL LNHLKYFCSYT NHRTG+K
Sbjct: 143 PNYPTCRFCGAEKGVKPVIPKEWEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSK 202

Query: 269 NRLLYLTYITLCHQVDPSGRF 290
           +RL+YLTYITLC Q+DPSGRF
Sbjct: 203 DRLVYLTYITLCRQIDPSGRF 222

BLAST of Cla97C02G032100 vs. NCBI nr
Match: KAG6572022.1 (hypothetical protein SDJN03_28750, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 283.9 bits (725), Expect = 1.6e-72
Identity = 136/201 (67.66%), Postives = 155/201 (77.11%), Query Frame = 0

Query: 89  SAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTERIEPPYPWSTNRRAVVQ 148
           S     A     + L+S+R   NLG R++SLRR  S SP TT  IEPPYPWST+R AVVQ
Sbjct: 18  STATSAAAAYELHLLSSLRTPNNLGVRQTSLRRRKSNSP-TTGPIEPPYPWSTDRIAVVQ 77

Query: 149 TLNDLKSNQILTITGDVRCRQCQRQYNIEYDTVSKFEEIASFVEENKNLFRDRAPRSWMN 208
           TL  L SNQILTITG+V+C+QC+R Y +EYD VSKF EI  FVE     FRDRAP+ WM 
Sbjct: 78  TLQYLTSNQILTITGEVKCQQCRRIYEMEYDVVSKFNEIGRFVEHKMESFRDRAPKEWMQ 137

Query: 209 PNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAK 268
           PNYPTCRFCG E G +PVIP EW KINW+FLLLGEM+GAL LNHLKYFCSYT NHRTG+K
Sbjct: 138 PNYPTCRFCGAEKGVKPVIPKEWEKINWVFLLLGEMIGALKLNHLKYFCSYTKNHRTGSK 197

Query: 269 NRLLYLTYITLCHQVDPSGRF 290
           +RL+YLTYITLC Q+DPSGRF
Sbjct: 198 DRLVYLTYITLCRQIDPSGRF 217

BLAST of Cla97C02G032100 vs. ExPASy TrEMBL
Match: A0A1S3BHR1 (uncharacterized protein LOC103489770 OS=Cucumis melo OX=3656 GN=LOC103489770 PE=4 SV=1)

HSP 1 Score: 387.5 bits (994), Expect = 4.9e-104
Identity = 189/217 (87.10%), Postives = 195/217 (89.86%), Query Frame = 0

Query: 76  SLSPPVRDLSPRPSAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTERIEP 135
           SL PP  DL  RPS  P     A ANALT+ RITRNLGTRRSSLRRCNSRSPRTTE IEP
Sbjct: 15  SLRPPSGDLRSRPS--PPAIGHARANALTNRRITRNLGTRRSSLRRCNSRSPRTTETIEP 74

Query: 136 PYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQRQYNIEYDTVSKFEEIASFVEENK 195
           PYPWSTNRRA+V+TLNDL+S+QIL ITGDVRCRQCQ +Y IEYD VSKFEEIASFVEENK
Sbjct: 75  PYPWSTNRRAMVRTLNDLRSSQILQITGDVRCRQCQIEYTIEYDMVSKFEEIASFVEENK 134

Query: 196 NLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKY 255
           NLFRDRAPRSWMNPNYPTCRFCGHENGARPVIP EWRKINWLFLLLGEMLG LNLNHLKY
Sbjct: 135 NLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPDEWRKINWLFLLLGEMLGVLNLNHLKY 194

Query: 256 FCSYTNNHRTGAKNRLLYLTYITLCHQVDPSGRFRRV 293
           FCSYTNNHRTGAKNRLLYLTYITLCHQVDPSGRF RV
Sbjct: 195 FCSYTNNHRTGAKNRLLYLTYITLCHQVDPSGRFNRV 229

BLAST of Cla97C02G032100 vs. ExPASy TrEMBL
Match: A0A0A0K3Q8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G259350 PE=4 SV=1)

HSP 1 Score: 369.0 bits (946), Expect = 1.8e-98
Identity = 179/217 (82.49%), Postives = 192/217 (88.48%), Query Frame = 0

Query: 76  SLSPPVRDLSPRPSAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTERIEP 135
           SL PP   LS +PSA P+    A  NA+T+MR+TR+LGTRRSS +RCNSRSPRTTE IEP
Sbjct: 21  SLRPPSGHLSSQPSAAPIG--HARPNAVTNMRVTRSLGTRRSSHQRCNSRSPRTTETIEP 80

Query: 136 PYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQRQYNIEYDTVSKFEEIASFVEENK 195
           PYPWSTNRRA+V+TLNDLKSNQIL ITGDV+CRQCQ +Y IEYD  SKFEEIASFVEENK
Sbjct: 81  PYPWSTNRRAMVRTLNDLKSNQILQITGDVQCRQCQVEYTIEYDMDSKFEEIASFVEENK 140

Query: 196 NLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKY 255
           N FRDRAP+SWMNPNYPTCRFCGHENGARPVIP +WRKINWLFLLLGEMLG LNLNHLKY
Sbjct: 141 NSFRDRAPQSWMNPNYPTCRFCGHENGARPVIPKQWRKINWLFLLLGEMLGVLNLNHLKY 200

Query: 256 FCSYTNNHRTGAKNRLLYLTYITLCHQVDPSGRFRRV 293
           FCS T NHRTGAKNRLLYLTYITLCHQVDPSGRF RV
Sbjct: 201 FCSNTYNHRTGAKNRLLYLTYITLCHQVDPSGRFNRV 235

BLAST of Cla97C02G032100 vs. ExPASy TrEMBL
Match: A0A5A7T547 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold195G00840 PE=4 SV=1)

HSP 1 Score: 352.8 bits (904), Expect = 1.3e-93
Identity = 174/202 (86.14%), Postives = 180/202 (89.11%), Query Frame = 0

Query: 76  SLSPPVRDLSPRPSAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTERIEP 135
           SL PP  DL  RPS  P     A ANALT+ RITRNLGTRRSSLRRCNSRSPRTTE IEP
Sbjct: 15  SLRPPSGDLRSRPS--PPAIGHARANALTNRRITRNLGTRRSSLRRCNSRSPRTTETIEP 74

Query: 136 PYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQRQYNIEYDTVSKFEEIASFVEENK 195
           PYPWSTNRRA+V+TLNDL+S+QIL ITGDVRCRQCQ +Y IEYD VSKFEEIASFVEENK
Sbjct: 75  PYPWSTNRRAMVRTLNDLRSSQILQITGDVRCRQCQIEYTIEYDMVSKFEEIASFVEENK 134

Query: 196 NLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKY 255
           NLFRDRAPRSWMNPNYPTCRFCGHENGARPVIP EWRKINWLFLLLGEMLG LNLNHLKY
Sbjct: 135 NLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPDEWRKINWLFLLLGEMLGVLNLNHLKY 194

Query: 256 FCSYTNNHRTGAKNRLLYLTYI 278
           FCSYTNNHRTGAKNRLLYLT I
Sbjct: 195 FCSYTNNHRTGAKNRLLYLTKI 214

BLAST of Cla97C02G032100 vs. ExPASy TrEMBL
Match: A0A6J1GLD4 (uncharacterized protein LOC111455388 OS=Cucurbita moschata OX=3662 GN=LOC111455388 PE=4 SV=1)

HSP 1 Score: 285.8 bits (730), Expect = 2.0e-73
Identity = 137/201 (68.16%), Postives = 157/201 (78.11%), Query Frame = 0

Query: 89  SAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTERIEPPYPWSTNRRAVVQ 148
           + V   A     + L+S+R   NLG R++SLR   S SP TT  IEPPYPWST+R AVV 
Sbjct: 23  ATVAAAAAAYELHLLSSLRTPNNLGVRQTSLRLRKSNSP-TTGPIEPPYPWSTDRIAVVH 82

Query: 149 TLNDLKSNQILTITGDVRCRQCQRQYNIEYDTVSKFEEIASFVEENKNLFRDRAPRSWMN 208
           TL+ L SNQILTITG+V+C+QC+R Y IEYD VSKF EI SFVE N   FRDRAP+ WM 
Sbjct: 83  TLHYLTSNQILTITGEVKCQQCRRIYEIEYDVVSKFNEIGSFVEHNMESFRDRAPKEWMQ 142

Query: 209 PNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAK 268
           PNYPTCRFCG E G +PVIP EW KINW+FLLLGEM+GAL LNHLKYFCSYT NHRTG+K
Sbjct: 143 PNYPTCRFCGAEKGVKPVIPKEWEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSK 202

Query: 269 NRLLYLTYITLCHQVDPSGRF 290
           +RL+YLTYITLC Q+DPSGRF
Sbjct: 203 DRLVYLTYITLCRQIDPSGRF 222

BLAST of Cla97C02G032100 vs. ExPASy TrEMBL
Match: A0A6J1I5V9 (uncharacterized protein LOC111470968 OS=Cucurbita maxima OX=3661 GN=LOC111470968 PE=4 SV=1)

HSP 1 Score: 283.1 bits (723), Expect = 1.3e-72
Identity = 136/204 (66.67%), Postives = 157/204 (76.96%), Query Frame = 0

Query: 89  SAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTERIEPPYPWSTNRRAVVQ 148
           + V   A     + L+S+R    LG R++SLRR    SP TT  IEPPYPWST+R AVV 
Sbjct: 14  ATVAAAAAAYELHLLSSLRTPNILGVRQTSLRRRKCNSP-TTGPIEPPYPWSTDRIAVVH 73

Query: 149 TLNDLKSNQILTITGDVRCRQCQRQYNIEYDTVSKFEEIASFVEENKNLFRDRAPRSWMN 208
           TL+ L  NQILTITGDV+C+QC+R Y IEY+ VSKF EI SFVE N   FRDRAP+ WM 
Sbjct: 74  TLHYLTLNQILTITGDVKCQQCRRIYEIEYNVVSKFNEIGSFVEHNMESFRDRAPKKWMQ 133

Query: 209 PNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAK 268
           PNYPTCRFCG E G +PVIP EW KINW+FLLLGEM+GAL LNHLKYFCSYT NHRTG+K
Sbjct: 134 PNYPTCRFCGAEKGVKPVIPKEWEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSK 193

Query: 269 NRLLYLTYITLCHQVDPSGRFRRV 293
           +RL+YLTYITLC Q+DPSGRF R+
Sbjct: 194 DRLVYLTYITLCRQIDPSGRFSRI 216

BLAST of Cla97C02G032100 vs. TAIR 10
Match: AT1G49330.1 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 174.1 bits (440), Expect = 1.7e-43
Identity = 115/314 (36.62%), Postives = 156/314 (49.68%), Query Frame = 0

Query: 2   ENITNQSNEPHGDPD-LQLSL-----------RP-----PAGDPS--PQPFSLWSSVGD- 61
           + +TNQ+++   D + L LSL           RP     P   P   P P + W +  D 
Sbjct: 28  KTMTNQTHDDDDDDEQLPLSLTLGSTSYSSQIRPVKSPVPIAPPPEFPGPVTTWPTPADF 87

Query: 62  -------PSPQPFSLRSSVRDLLSQPFSLRPPVRDLSPH---PFSLSPPVRDLSPRPSAV 121
                  P P P S    +   +S  F   P    L  H   P  L+PP  +L+P P   
Sbjct: 88  LATRSMVPDPPPPS--HQIPLWMSNYFQQTPNPPQLVTHFFPPSGLAPPSSNLTPPPVKR 147

Query: 122 PVTACQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTERIEPPYPWSTNRRAVVQTLN 181
           PVT          S+RI R+            S   + ++ I PP+PW+TNRR  +Q+L 
Sbjct: 148 PVTG---------SVRIYRS-----------RSTVSKKSDTISPPFPWATNRRGEIQSLE 207

Query: 182 DLKSNQILTITGDVRCRQCQRQYNIEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNY 241
            L+SNQI TITG+V+CR C++ Y + Y+   +F E+  F    K   RDRA + W  P  
Sbjct: 208 YLESNQITTITGEVQCRHCEKVYQVSYNLRERFAEVVKFYLTEKRKMRDRAHKDWAYPEQ 267

Query: 242 PTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRL 286
             C  CG E   +PVI     +INWLFLLLG+ LG   L  LK FC ++ NHRTGAK+R+
Sbjct: 268 RRCELCGREKAVKPVIAERKSQINWLFLLLGQTLGFCTLEQLKNFCKHSKNHRTGAKDRV 319

BLAST of Cla97C02G032100 vs. TAIR 10
Match: AT2G16190.1 (BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1); Has 77 Blast hits to 77 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 6; Fungi - 13; Plants - 56; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 152.1 bits (383), Expect = 6.7e-37
Identity = 99/289 (34.26%), Postives = 139/289 (48.10%), Query Frame = 0

Query: 7   QSNEPHGDPDLQLSLRPPAGDPSPQPFSLWSSVGDPSPQPFSLRSSVRDLLSQPFSLRPP 66
           Q  E  G+  +QL    P  +  P P           PQP  + S            +  
Sbjct: 30  QRQEEQGEV-MQLLTSDPPQNTQPSP-----------PQPNDMTSFANGTNHVIVPTQAL 89

Query: 67  VRDLSPHPFSLSPPVRDLSPRPSAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRS 126
            + + P   S+  P   L  +PS   +   Q N  A  ++   R         RR + R 
Sbjct: 90  EQAVPPPNVSVRTP---LPYQPSEEVLPPPQLNQVATVALATPRRGRPPGGQARRNSKRP 149

Query: 127 PRTTER------IEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQRQYNIEYDT 186
               ER      I PPYPW+T +   +Q+  DL SN I  I+G V C+ C R   +EY+ 
Sbjct: 150 VAGVERNVGDREIVPPYPWATKKPGKIQSFRDLSSNNINVISGQVHCKTCDRTDTVEYNL 209

Query: 187 VSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLL 246
             KF E+  +++ NK   R RAP SW  P    CR C  E   +PV+     +INWLFLL
Sbjct: 210 EEKFSELYGYIKVNKEEMRHRAPGSWSTPKLIPCRTCKSE--MKPVMSERKEEINWLFLL 269

Query: 247 LGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLTYITLCHQVDPSGRF 290
           LG+MLG   L+ L+YFC   + HRTG+K+R++Y+TY++LC Q+DP G F
Sbjct: 270 LGQMLGCCTLDQLRYFCQLNSKHRTGSKDRVVYITYLSLCKQLDPEGPF 301

BLAST of Cla97C02G032100 vs. TAIR 10
Match: AT2G16190.2 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 102.4 bits (254), Expect = 6.1e-22
Identity = 80/253 (31.62%), Postives = 110/253 (43.48%), Query Frame = 0

Query: 7   QSNEPHGDPDLQLSLRPPAGDPSPQPFSLWSSVGDPSPQPFSLRSSVRDLLSQPFSLRPP 66
           Q  E  G+  +QL    P  +  P P           PQP  + S            +  
Sbjct: 30  QRQEEQGEV-MQLLTSDPPQNTQPSP-----------PQPNDMTSFANGTNHVIVPTQAL 89

Query: 67  VRDLSPHPFSLSPPVRDLSPRPSAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRS 126
            + + P   S+  P   L  +PS   +   Q N  A  ++   R         RR + R 
Sbjct: 90  EQAVPPPNVSVRTP---LPYQPSEEVLPPPQLNQVATVALATPRRGRPPGGQARRNSKRP 149

Query: 127 PRTTER------IEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQRQYNIEYDT 186
               ER      I PPYPW+T +   +Q+  DL SN I  I+G V C+ C R   +EY+ 
Sbjct: 150 VAGVERNVGDREIVPPYPWATKKPGKIQSFRDLSSNNINVISGQVHCKTCDRTDTVEYNL 209

Query: 187 VSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLL 246
             KF E+  +++ NK   R RAP SW  P    CR C  E   +PV+     +INWLFLL
Sbjct: 210 EEKFSELYGYIKVNKEEMRHRAPGSWSTPKLIPCRTCKSE--MKPVMSERKEEINWLFLL 265

Query: 247 LGEMLGALNLNHL 254
           LG+MLG   L+ L
Sbjct: 270 LGQMLGCCTLDQL 265

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008447299.11.0e-10387.10PREDICTED: uncharacterized protein LOC103489770 [Cucumis melo][more]
XP_011659748.13.8e-9882.49uncharacterized protein LOC105436256 [Cucumis sativus] >KGN44335.1 hypothetical ... [more]
KAA0036575.12.8e-9386.14uncharacterized protein E6C27_scaffold191G00850 [Cucumis melo var. makuwa] >TYK2... [more]
XP_022952797.14.2e-7368.16uncharacterized protein LOC111455388 [Cucurbita moschata][more]
KAG6572022.11.6e-7267.66hypothetical protein SDJN03_28750, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A1S3BHR14.9e-10487.10uncharacterized protein LOC103489770 OS=Cucumis melo OX=3656 GN=LOC103489770 PE=... [more]
A0A0A0K3Q81.8e-9882.49Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G259350 PE=4 SV=1[more]
A0A5A7T5471.3e-9386.14Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A6J1GLD42.0e-7368.16uncharacterized protein LOC111455388 OS=Cucurbita moschata OX=3662 GN=LOC1114553... [more]
A0A6J1I5V91.3e-7266.67uncharacterized protein LOC111470968 OS=Cucurbita maxima OX=3661 GN=LOC111470968... [more]
Match NameE-valueIdentityDescription
AT1G49330.11.7e-4336.62hydroxyproline-rich glycoprotein family protein [more]
AT2G16190.16.7e-3734.26BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein fam... [more]
AT2G16190.26.1e-2231.62FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 115..137
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 63..89
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 31..47
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..47
NoneNo IPR availablePANTHERPTHR34272EXPRESSED PROTEINcoord: 42..290

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C02G032100.1Cla97C02G032100.1mRNA