Cla97C07G134180 (gene) Watermelon (97103) v2

NameCla97C07G134180
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
Descriptionproline-rich family protein
LocationCla97Chr07 : 9798621 .. 9799004 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGCATCCAACCCCAGCCTTGCTATGCTCTCCTCACCCTCAGCTTGTTCTTGTTCTTCATTCTTTCAAGCAATGTACAACCCATCCATTGCTTGACCTCATCCAAGAAGCTTGACGAATCCGCCGGTGGGAGTGATCCGAGCGTCAAGTGTACGCCGTGCACCAATTATCCGCCACCACCACCGCCACCGCCGAAGAAACCCCCACCAGCTTATTGCCCTCCGCCACCTCCTCCTCCGTCATCTTTCATATACATGCTCGGCCCGCCAGGAAACTTGTATCCCATTGACCAAGATTTCGCCGGTGCTAATCGGAGGACGATGGCCGTGGAGTGGACGGTGGTTGCTCTCTTTGGACTAATTGGGTTTATTGGTTTGTGGTGA

mRNA sequence

ATGTGCATCCAACCCCAGCCTTGCTATGCTCTCCTCACCCTCAGCTTGTTCTTGTTCTTCATTCTTTCAAGCAATGTACAACCCATCCATTGCTTGACCTCATCCAAGAAGCTTGACGAATCCGCCGGTGGGAGTGATCCGAGCGTCAAGTGTACGCCGTGCACCAATTATCCGCCACCACCACCGCCACCGCCGAAGAAACCCCCACCAGCTTATTGCCCTCCGCCACCTCCTCCTCCGTCATCTTTCATATACATGCTCGGCCCGCCAGGAAACTTGTATCCCATTGACCAAGATTTCGCCGGTGCTAATCGGAGGACGATGGCCGTGGAGTGGACGGTGGTTGCTCTCTTTGGACTAATTGGGTTTATTGGTTTGTGGTGA

Coding sequence (CDS)

ATGTGCATCCAACCCCAGCCTTGCTATGCTCTCCTCACCCTCAGCTTGTTCTTGTTCTTCATTCTTTCAAGCAATGTACAACCCATCCATTGCTTGACCTCATCCAAGAAGCTTGACGAATCCGCCGGTGGGAGTGATCCGAGCGTCAAGTGTACGCCGTGCACCAATTATCCGCCACCACCACCGCCACCGCCGAAGAAACCCCCACCAGCTTATTGCCCTCCGCCACCTCCTCCTCCGTCATCTTTCATATACATGCTCGGCCCGCCAGGAAACTTGTATCCCATTGACCAAGATTTCGCCGGTGCTAATCGGAGGACGATGGCCGTGGAGTGGACGGTGGTTGCTCTCTTTGGACTAATTGGGTTTATTGGTTTGTGGTGA

Protein sequence

MCIQPQPCYALLTLSLFLFFILSSNVQPIHCLTSSKKLDESAGGSDPSVKCTPCTNYPPPPPPPPKKPPPAYCPPPPPPPSSFIYMLGPPGNLYPIDQDFAGANRRTMAVEWTVVALFGLIGFIGLW
BLAST of Cla97C07G134180 vs. NCBI nr
Match: XP_023546806.1 (formin-like protein 20 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 128.3 bits (321), Expect = 1.9e-26
Identity = 96/126 (76.19%), Postives = 106/126 (84.13%), Query Frame = 0

Query: 1   MCIQPQPCYALLTLSLFLFFILSSNVQPIHCLTSSKKLDESAGGSDPSVKCTPCTNYXXX 60
           MCIQ +P YAL   S F+FF LS NV   H   SSKKLDES GG DPSVKCTPCT+ XXX
Sbjct: 1   MCIQTRPRYALFIFSFFIFF-LSINVVLTHGFLSSKKLDESTGGDDPSVKCTPCTHXXXX 60

Query: 61  XXXXXXXXXXXXXXXXXXXXXXXXYMLGPPGNLYPIDQDFAGANRRTMAVEWTVVALFGL 120
           XXXXXXXXXXXXXXXXXXXXXXXXY+LGPPGNLYPID+DFAGA+RR +AVE + VALFGL
Sbjct: 61  XXXXXXXXXXXXXXXXXXXXXXXXYILGPPGNLYPIDRDFAGADRRRVAVELSAVALFGL 120

Query: 121 IGFIGL 127
           IGF+G+
Sbjct: 121 IGFLGV 125

BLAST of Cla97C07G134180 vs. NCBI nr
Match: KGN50831.1 (hypothetical protein Csa_5G283490 [Cucumis sativus])

HSP 1 Score: 116.7 bits (291), Expect = 5.8e-23
Identity = 91/126 (72.22%), Postives = 98/126 (77.78%), Query Frame = 0

Query: 1   MCIQPQPCYALLTLSLFLFFILSSNVQPIHCLTSSKKLDESAGGSDPSVKCTPCTNYXXX 60
           MCIQP P Y+LL L  FL  ILS N+QPIH   SSKKLDE     D SVKCTPCT Y XX
Sbjct: 1   MCIQPHPFYSLLILHFFL--ILSINLQPIHGFISSKKLDEPIPRHDSSVKCTPCTRYSXX 60

Query: 61  XXXXXXXXXXXXXXXXXXXXXXXXYMLGPPGNLYPIDQDFAGANRRTMAVEWTVVALFGL 120
           XXXXXXXXXXXXXXXXXXXXXXX YMLGPP NLYPI+ DFA A+RR++A+E  VVA FGL
Sbjct: 61  XXXXXXXXXXXXXXXXXXXXXXXIYMLGPPVNLYPIEHDFASADRRSVAMELPVVAFFGL 120

Query: 121 IGFIGL 127
           IG I L
Sbjct: 121 IGLIAL 124

BLAST of Cla97C07G134180 vs. NCBI nr
Match: XP_022155381.1 (acrosin-like [Momordica charantia])

HSP 1 Score: 94.4 bits (233), Expect = 3.1e-16
Identity = 87/117 (74.36%), Postives = 91/117 (77.78%), Query Frame = 0

Query: 13  TLSLFLFFILSSNVQPIHCLTSSKKLDESAGGSDPSVKCTPCTNY--XXXXXXXXXXXXX 72
           T  L L  ILS NV  IH L SS KLD S GG D +VKCTPCT Y  XXXXXXXXXXXXX
Sbjct: 5   TQILNLLLILSINVVTIHGLISSNKLDGSTGG-DSTVKCTPCTRYNXXXXXXXXXXXXXX 64

Query: 73  XXXXXXXXXXXXXXYMLGPPGNLYPIDQDFAGANRRTMAVEWTVVALFGLIGFIGLW 128
           XXXXXXXXXXXXXXYMLGPPGNLYPI QDFAG  RR +AVE  VVAL GL+GFI +W
Sbjct: 65  XXXXXXXXXXXXXXYMLGPPGNLYPIGQDFAGGRRR-VAVELPVVALLGLMGFIAVW 119

BLAST of Cla97C07G134180 vs. NCBI nr
Match: XP_022942055.1 (probable glycosyltransferase 4 [Cucurbita moschata])

HSP 1 Score: 72.4 bits (176), Expect = 1.2e-09
Identity = 32/42 (76.19%), Postives = 39/42 (92.86%), Query Frame = 0

Query: 85  YMLGPPGNLYPIDQDFAGANRRTMAVEWTVVALFGLIGFIGL 127
           Y+LGPPGNLYPID+DFAGA+RR +AVE + VALFGLIGF+G+
Sbjct: 84  YILGPPGNLYPIDRDFAGADRRRVAVELSAVALFGLIGFLGV 125

BLAST of Cla97C07G134180 vs. NCBI nr
Match: XP_021593887.1 (leucine-rich repeat extensin-like protein 6 [Manihot esculenta] >OAY29220.1 hypothetical protein MANES_15G127400 [Manihot esculenta])

HSP 1 Score: 56.2 bits (134), Expect = 9.2e-05
Identity = 66/124 (53.23%), Postives = 79/124 (63.71%), Query Frame = 0

Query: 11  LLTLSLFLFFILSSNVQPIHCLTSSKKLDESAGGSDPSVKCTPCT----------NYXXX 70
           L +L L  FF+L +   PI  L  S+KLDE+        KCTPCT            XXX
Sbjct: 4   LESLMLLNFFLLVAAAPPIQAL-DSRKLDENTAPGSTDQKCTPCTXXXXXXXXXXXXXXX 63

Query: 71  XXXXXXXXXXXXXXXXXXXXXXXXYMLGPPGNLYPIDQDFAGANRRTMAVEWTVVALFGL 125
           XXXXXXXXXXXXXXXXXXXXXXX Y+ GPPGNLYP+D DF+GA R T+A    ++   GL
Sbjct: 64  XXXXXXXXXXXXXXXXXXXXXXXIYISGPPGNLYPVDNDFSGAGRTTVAGLPALIGC-GL 123

BLAST of Cla97C07G134180 vs. TrEMBL
Match: tr|A0A0A0KSV7|A0A0A0KSV7_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G283490 PE=4 SV=1)

HSP 1 Score: 116.7 bits (291), Expect = 3.8e-23
Identity = 91/126 (72.22%), Postives = 98/126 (77.78%), Query Frame = 0

Query: 1   MCIQPQPCYALLTLSLFLFFILSSNVQPIHCLTSSKKLDESAGGSDPSVKCTPCTNYXXX 60
           MCIQP P Y+LL L  FL  ILS N+QPIH   SSKKLDE     D SVKCTPCT Y XX
Sbjct: 1   MCIQPHPFYSLLILHFFL--ILSINLQPIHGFISSKKLDEPIPRHDSSVKCTPCTRYSXX 60

Query: 61  XXXXXXXXXXXXXXXXXXXXXXXXYMLGPPGNLYPIDQDFAGANRRTMAVEWTVVALFGL 120
           XXXXXXXXXXXXXXXXXXXXXXX YMLGPP NLYPI+ DFA A+RR++A+E  VVA FGL
Sbjct: 61  XXXXXXXXXXXXXXXXXXXXXXXIYMLGPPVNLYPIEHDFASADRRSVAMELPVVAFFGL 120

Query: 121 IGFIGL 127
           IG I L
Sbjct: 121 IGLIAL 124

BLAST of Cla97C07G134180 vs. TrEMBL
Match: tr|A0A2C9UFM0|A0A2C9UFM0_MANES (Uncharacterized protein OS=Manihot esculenta OX=3983 GN=MANES_15G127400 PE=4 SV=1)

HSP 1 Score: 56.2 bits (134), Expect = 6.1e-05
Identity = 66/124 (53.23%), Postives = 79/124 (63.71%), Query Frame = 0

Query: 11  LLTLSLFLFFILSSNVQPIHCLTSSKKLDESAGGSDPSVKCTPCT----------NYXXX 70
           L +L L  FF+L +   PI  L  S+KLDE+        KCTPCT            XXX
Sbjct: 4   LESLMLLNFFLLVAAAPPIQAL-DSRKLDENTAPGSTDQKCTPCTXXXXXXXXXXXXXXX 63

Query: 71  XXXXXXXXXXXXXXXXXXXXXXXXYMLGPPGNLYPIDQDFAGANRRTMAVEWTVVALFGL 125
           XXXXXXXXXXXXXXXXXXXXXXX Y+ GPPGNLYP+D DF+GA R T+A    ++   GL
Sbjct: 64  XXXXXXXXXXXXXXXXXXXXXXXIYISGPPGNLYPVDNDFSGAGRTTVAGLPALIGC-GL 123

BLAST of Cla97C07G134180 vs. TAIR10
Match: AT1G23040.1 (hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 42.7 bits (99), Expect = 1.9e-04
Identity = 18/41 (43.90%), Postives = 28/41 (68.29%), Query Frame = 0

Query: 85  YMLGPPGNLYPIDQDF-AGANRRTMAVEWTVVALFGLIGFI 125
           Y+ GPPGNLYP+D+ F A A +  M V+ + +  FG++ F+
Sbjct: 102 YITGPPGNLYPVDEQFGAAAGKSFMVVKLSGLIAFGIMAFL 142

BLAST of Cla97C07G134180 vs. TAIR10
Match: AT1G70990.1 (proline-rich family protein)

HSP 1 Score: 42.0 bits (97), Expect = 3.3e-04
Identity = 21/39 (53.85%), Postives = 24/39 (61.54%), Query Frame = 0

Query: 85  YMLGPPGNLYPIDQDFAGANRRTMAVEWTVVALFGLIGF 124
           YM GPPG LYPIDQ F  A    +   +TVV + GLI F
Sbjct: 132 YMTGPPGELYPIDQQFGAA----VTKSFTVVKISGLIAF 166

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_023546806.11.9e-2676.19formin-like protein 20 [Cucurbita pepo subsp. pepo][more]
KGN50831.15.8e-2372.22hypothetical protein Csa_5G283490 [Cucumis sativus][more]
XP_022155381.13.1e-1674.36acrosin-like [Momordica charantia][more]
XP_022942055.11.2e-0976.19probable glycosyltransferase 4 [Cucurbita moschata][more]
XP_021593887.19.2e-0553.23leucine-rich repeat extensin-like protein 6 [Manihot esculenta] >OAY29220.1 hypo... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0KSV7|A0A0A0KSV7_CUCSA3.8e-2372.22Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G283490 PE=4 SV=1[more]
tr|A0A2C9UFM0|A0A2C9UFM0_MANES6.1e-0553.23Uncharacterized protein OS=Manihot esculenta OX=3983 GN=MANES_15G127400 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
AT1G23040.11.9e-0443.90hydroxyproline-rich glycoprotein family protein[more]
AT1G70990.13.3e-0453.85proline-rich family protein[more]
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C07G134180.1Cla97C07G134180.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 37..74
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 53..74
NoneNo IPR availablePANTHERPTHR35094:SF1SUBFAMILY NOT NAMEDcoord: 7..125
NoneNo IPR availablePANTHERPTHR35094FAMILY NOT NAMEDcoord: 7..125

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla97C07G134180Silver-seed gourdcarwmbB0183
Cla97C07G134180Cucumber (Gy14) v2cgybwmbB376
Cla97C07G134180Cucurbita maxima (Rimu)cmawmbB779
Cla97C07G134180Cucurbita maxima (Rimu)cmawmbB780
Cla97C07G134180Cucurbita maxima (Rimu)cmawmbB781
Cla97C07G134180Cucurbita moschata (Rifu)cmowmbB753
Cla97C07G134180Cucurbita moschata (Rifu)cmowmbB755
Cla97C07G134180Melon (DHL92) v3.5.1mewmbB032
Cla97C07G134180Melon (DHL92) v3.5.1mewmbB067
Cla97C07G134180Watermelon (Charleston Gray)wcgwmbB120
Cla97C07G134180Watermelon (97103) v1wmwmbB278
Cla97C07G134180Watermelon (97103) v1wmwmbB180
Cla97C07G134180Watermelon (97103) v1wmwmbB181
Cla97C07G134180Wax gourdwgowmbB105