Cla97C04G069005 (gene) Watermelon (97103) v2.5

Overview
NameCla97C04G069005
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionGag-pol polyprotein
LocationCla97Chr04: 2505402 .. 2508231 (+)
RNA-Seq ExpressionCla97C04G069005
SyntenyCla97C04G069005
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATTTCGTTGGAGAACTCTGTTATAGCCATAGTTGATTCAAACGAACAAGAAGAAGCTCAGTCGTTCAATGAAAGCTATGCATCAAGGTGAGCTAGTAGTGTCATCTATTCTTCAAGAGAAGAAGTTGGACACCAAATGGAAACTAGAATCATTTTTGTAGAAGAGAATGCATACTGATTTTCATCTTTCTTTTCTCACTCGAGATAAGTGATCAATAATTCCCTAGCTTTCTTTTTCTTGTTTGTTGAATGAGGATTGGTGGAATTGTTAATTTTAAATAGATGGTGAATAAATTGAGAGAAAATATCTATTAGTGATTAGACTCTCAACTTATTTTATTGTGAATTATTTGTGGTAGTTCCGTTAGATTTTTTTTTTTTTTTTTTTTGTTGAAATTAGAGGGATGAGAGACCCAAAGCCAATTTCTATTGTTGCATAGATACAGTGTATGAAAGTGGCAGAGAGAAACAAGGAAAATTATTCAAAATGACCTAAAAAGCAAAAAAAATTCTATGATTGACCTCTTTCTAAAAATTTGTTTGCTATTGACCCTTTTGGCATGTGAAATGCCACTTTTACCCTCAATATTAAATTCTCCACCTTCTTCTCTATTTCCTTCGTTTCTTTTTCTTTCCCCTTTCAACAACCAAAAAATAAAATTTCAATGTCATTTAAAAGTAAGCAAGTTTTTAGAAAGAGGAGAGGTGGAGTATTTAATATTAAGGGTAAAAGTGATATTTCACATGCAAAAATAGTCAATAGGAAACAAGTTTTTAGAAAGAGGCAATCATAAATTTTTTTTTGGTTTTTGGGTCATTTGAATAATTTTCCAAGAAACAGATTGGTGGTAAGTGAGAAACAAATTATGTTCCATCTGGTCTCGAATTTTGCAATTACAGTGTATGAAGGTGAGTAGGTTATGGTTATATATAATCTCAGTTATAATGAATGTAGGTGGGTGGGTCATTGTTTTGCTTACTTTAAGCTACTTTGTTACTGCATTGTTGGCTACAGTTATTTTCTTTTATCTAGTTGTTGCTGATGTTGTTACCTTATTGCGTGGCTTATTTTTTGGAATATATTTTTCCTTGTTTGGTGATTTTGCAGTTTTGAAGGTGATTTCCTTCATTTTTTTTGTTGGTGGTTTCCTTGTGTGAAACATATTTTTATATCATTTGTTCATGTCATCTAGGTTAAGTTGTTTGTTGTAGTTGCTGCAGCTACAGAAGTGGCTACTGAATAGAATGTTGAAAGTTGCGACTGTCTTGCTGATGATATAGATTTGGATTGGTCTCGTTTTGCAGAAAGTGTTTTTGGTGTTACAGATATCTGTATCTTATGGTAGAATATCTTTCATTGGACGTGGCTCAATAGTGGAATGAAAAGATAGAAGAGTGGCAGTGTAATAACATCAGATTTAGGGGGAGCCTAATCTGGGAACTCTTCTCTAAGGAGTTCAAAATTTAGGGGGAGCCTAAATCTCAAAAACTATCTAGTGAAACTCTTTTCATTCAGGCTTACTGCCTCCCAGACGTAGATGTGGTTACACCGAACTGGGTTACCAATTCTGTGTGTTCTTTTATTTCTTCTCTCTATCTCACTTGTTGTTATCATTGTTTGTAACATCTTTCCATACTTATTGGACTAGTTGTGATCTGCCTAACTTTCATTTGGTATCAGAGCTTGGTTGGCCTTTTATACATTGAGCTATTTAATGGAGCCTATCCGAGAGGGTGGATCGACAACTTGTCCTCCTGTACTTGATGGGTCGAATTATTCTTATTGGAAAGCAAGAATGATAGCTTTCTTGAAATCTATTGATAATAAGACGTGGAAGGCAGTCGCAAATGGATGGGGTCCTCCCGTAGTTACTGACACTAAAGGGAAGGTGAGGATAAAGATGAGGCTTCCTTAGTGAACTATTGTGCTCTAAATGCAATCTTTAATGGCTTTGACAATAACGTTTTTCGCCTCATTAATACCTATGTCTCTGCCAAAAAAGCATGAGACATCCTTTCGATTGCACATGAAGGAACTTCCAAATTAAAAATGTCCAGATTGCAGCTTTTTTATCACAAAATTTGAATTGCTAAAAATGTCAGAAGATGAGGCCACAACTGAATTCAATGTTTGCCTATTAGATATTGCAAATGAATCATTTGCACTTGGTGAGAAGATATCGGAAGAAAAACTAGTACGAAAAGCGCTTCGCTCCCTTTCAAAGAGATTTGACATGAAAGTCACAACTATTGAGGAGGCTCATGACATTGCAACAATGAAAGTTGATGAGCTTTTTGGTTTATTGAGAACTTTCAAAATGATGTTTGATGACAATCCTAATAAAAAATCCAAAAATATTGCCTTACAATTGGCAGTTGAGAATGATGTTGCGGCTGTTAAAAACAAGGAATCCGATGAGAAGTTGGCTCAATCAATTTCCTTGCTTGCCAAGCAGTTCGGAAAGGCTCTCAGAAGATGGGATAAACGCGAGGTGTCTCAAAGTGGCAATGTGTTTTCTAATGTTGAAGACAATTTTAGTCCGGCTAGGCGCTCAACGCTGAAATCCTCTCAATCATCTAATCGAAAGACTGATCATGGAAAAGGTTATGGATTTAGTCAAGATTTAAGAGACAAAAAGTTTAGATGTAGAAAGTGTGAGGGATACAGACATTATCAAGCTGAATGTCCTAATTTTCTAAAGAGTAAGAACAAAAGTTATTTTGCTACTCTATCAGACAATGATGATGAGGATGATGTCTCAAATAGTGATTTCGATGAAGAAATTCACGCCTTAATGGGCTGTTTATCTCAAAGCAGTGTCAAGTGA

mRNA sequence

ATGATTTCGTTGGAGAACTCTGTTATAGCCATAGTTGATTCAAACGAACAAGAAGAAGCTCAGTCGTTCAATGAAAGCTATGCATCAAGCTACTTTGTTACTGCATTGTTGGCTACAGTTATTTTCTTTTATCTAGTTGTTGCTGATGTTGTTACCTTATTGCGTGGCTTATTTTTTGGAATATATTTTTCCTTGTTTGGTGATTTTGCAGTTTTGAAGTTCGGAAAGGCTCTCAGAAGATGGGATAAACGCGAGGTGTCTCAAAGTGGCAATGTGTTTTCTAATGTTGAAGACAATTTTAGTCCGGCTAGGCGCTCAACGCTGAAATCCTCTCAATCATCTAATCGAAAGACTGATCATGGAAAAGGTTATGGATTTAGTCAAGATTTAAGAGACAAAAAGTTTAGATGTAGAAAGTGTGAGGGATACAGACATTATCAAGCTGAATGTCCTAATTTTCTAAAGAGTAAGAACAAAAGTTATTTTGCTACTCTATCAGACAATGATGATGAGGATGATGTCTCAAATAGTGATTTCGATGAAGAAATTCACGCCTTAATGGGCTGTTTATCTCAAAGCAGTGTCAAGTGA

Coding sequence (CDS)

ATGATTTCGTTGGAGAACTCTGTTATAGCCATAGTTGATTCAAACGAACAAGAAGAAGCTCAGTCGTTCAATGAAAGCTATGCATCAAGCTACTTTGTTACTGCATTGTTGGCTACAGTTATTTTCTTTTATCTAGTTGTTGCTGATGTTGTTACCTTATTGCGTGGCTTATTTTTTGGAATATATTTTTCCTTGTTTGGTGATTTTGCAGTTTTGAAGTTCGGAAAGGCTCTCAGAAGATGGGATAAACGCGAGGTGTCTCAAAGTGGCAATGTGTTTTCTAATGTTGAAGACAATTTTAGTCCGGCTAGGCGCTCAACGCTGAAATCCTCTCAATCATCTAATCGAAAGACTGATCATGGAAAAGGTTATGGATTTAGTCAAGATTTAAGAGACAAAAAGTTTAGATGTAGAAAGTGTGAGGGATACAGACATTATCAAGCTGAATGTCCTAATTTTCTAAAGAGTAAGAACAAAAGTTATTTTGCTACTCTATCAGACAATGATGATGAGGATGATGTCTCAAATAGTGATTTCGATGAAGAAATTCACGCCTTAATGGGCTGTTTATCTCAAAGCAGTGTCAAGTGA

Protein sequence

MISLENSVIAIVDSNEQEEAQSFNESYASSYFVTALLATVIFFYLVVADVVTLLRGLFFGIYFSLFGDFAVLKFGKALRRWDKREVSQSGNVFSNVEDNFSPARRSTLKSSQSSNRKTDHGKGYGFSQDLRDKKFRCRKCEGYRHYQAECPNFLKSKNKSYFATLSDNDDEDDVSNSDFDEEIHALMGCLSQSSVK
Homology
BLAST of Cla97C04G069005 vs. NCBI nr
Match: KAA0037814.1 (gag-pol polyprotein [Cucumis melo var. makuwa] >TYK26047.1 gag-pol polyprotein [Cucumis melo var. makuwa])

HSP 1 Score: 75.5 bits (184), Expect = 5.8e-10
Identity = 37/84 (44.05%), Postives = 55/84 (65.48%), Query Frame = 0

Query: 113 SSNRKTDHGKGYGFSQDLRDKKFRCRKCEGYRHYQAECPNFLKSKNKSYFATLSDNDDED 172
           S  R +DHGK     ++   K FRCR CEG+ HYQAECP +LK + K+Y+ATLSD D +D
Sbjct: 233 SYRRNSDHGK----KKEDVGKSFRCRDCEGFSHYQAECPTYLKRQKKNYYATLSDEDSDD 292

Query: 173 DVSNSDFDEEIHALMGCLSQSSVK 197
           D    + D  ++A + C+++ ++K
Sbjct: 293 D----EVDHGVNAFIACITEINLK 308

BLAST of Cla97C04G069005 vs. NCBI nr
Match: KAA0050476.1 (gag-pol polyprotein [Cucumis melo var. makuwa])

HSP 1 Score: 69.3 bits (168), Expect = 4.1e-08
Identity = 36/80 (45.00%), Postives = 51/80 (63.75%), Query Frame = 0

Query: 113 SSNRKTDHGKGYGFSQDLRDKKFRCRKCEGYRHYQAECPNFLKSKNKSYFATLSDNDDED 172
           S  R +DHGK     +DL  + FRCR+CEG+ HYQ ECP +LK + K+Y+ATLSD D +D
Sbjct: 107 SYRRNSDHGKK---KEDL-GRSFRCRECEGFGHYQVECPTYLKRQKKNYYATLSDEDSDD 166

Query: 173 DVSNSDFDEEIHALMGCLSQ 193
           D      D  ++A   C+++
Sbjct: 167 DKD----DHGMNAFTTCITE 178

BLAST of Cla97C04G069005 vs. NCBI nr
Match: KAA0033858.1 (Receptor-like protein 12 [Cucumis melo var. makuwa])

HSP 1 Score: 68.6 bits (166), Expect = 7.0e-08
Identity = 40/94 (42.55%), Postives = 54/94 (57.45%), Query Frame = 0

Query: 84  REVSQSGNVFSNVEDNFSPARRSTLKSSQ-SSNRKTDHGKGYGFSQDLRDKKFRCRKCEG 143
           R+          VE +      ST K +  S  R +DHGK     QD+  + FRCR+C+G
Sbjct: 125 RKFKSMNTTGETVEKDRYDGENSTRKVNDFSQRRNSDHGKK---KQDV-GRSFRCRECKG 184

Query: 144 YRHYQAECPNFLKSKNKSYFATLSDNDDEDDVSN 177
           + HYQAECP FL+ + K+Y+ATLSD D +DD  N
Sbjct: 185 FGHYQAECPTFLRRQKKNYYATLSDEDSDDDEVN 214

BLAST of Cla97C04G069005 vs. NCBI nr
Match: TYK01241.1 (gag-pol polyprotein [Cucumis melo var. makuwa])

HSP 1 Score: 68.6 bits (166), Expect = 7.0e-08
Identity = 40/94 (42.55%), Postives = 54/94 (57.45%), Query Frame = 0

Query: 84  REVSQSGNVFSNVEDNFSPARRSTLKSSQ-SSNRKTDHGKGYGFSQDLRDKKFRCRKCEG 143
           R+          VE +      ST K +  S  R +DHGK     QD+  + FRCR+C+G
Sbjct: 142 RKFKSMNTTGKTVETDRYDGENSTRKVNDFSQRRNSDHGKK---KQDV-GRSFRCRECKG 201

Query: 144 YRHYQAECPNFLKSKNKSYFATLSDNDDEDDVSN 177
           + HYQAECP FL+ + K+Y+ATLSD D +DD  N
Sbjct: 202 FGHYQAECPTFLRRQKKNYYATLSDEDSDDDEVN 231

BLAST of Cla97C04G069005 vs. NCBI nr
Match: KAA0037333.1 (Ulp1-like peptidase [Cucumis melo var. makuwa])

HSP 1 Score: 67.8 bits (164), Expect = 1.2e-07
Identity = 36/87 (41.38%), Postives = 58/87 (66.67%), Query Frame = 0

Query: 107 TLKSSQSSNRKT-DHGKGYGFSQDLRDKKFRCRKCEGYRHYQAECPNFLKSKNKSYFATL 166
           T K+ + SNR+  DHGK     ++  ++ FRCR+CEG+ HY+AECP FL+ + K+Y+ATL
Sbjct: 116 TRKTDELSNRRNGDHGK----KKEEVERSFRCRECEGFNHYKAECPTFLRRQKKNYYATL 175

Query: 167 SDNDDEDDVSNSDFDEEIHALMGCLSQ 193
           S N+D DD   ++ D  ++    C+++
Sbjct: 176 S-NEDSDD---NEVDHGLNVFTTCITK 194

BLAST of Cla97C04G069005 vs. ExPASy TrEMBL
Match: A0A5A7T2X1 (Gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1567G00680 PE=4 SV=1)

HSP 1 Score: 75.5 bits (184), Expect = 2.8e-10
Identity = 37/84 (44.05%), Postives = 55/84 (65.48%), Query Frame = 0

Query: 113 SSNRKTDHGKGYGFSQDLRDKKFRCRKCEGYRHYQAECPNFLKSKNKSYFATLSDNDDED 172
           S  R +DHGK     ++   K FRCR CEG+ HYQAECP +LK + K+Y+ATLSD D +D
Sbjct: 233 SYRRNSDHGK----KKEDVGKSFRCRDCEGFSHYQAECPTYLKRQKKNYYATLSDEDSDD 292

Query: 173 DVSNSDFDEEIHALMGCLSQSSVK 197
           D    + D  ++A + C+++ ++K
Sbjct: 293 D----EVDHGVNAFIACITEINLK 308

BLAST of Cla97C04G069005 vs. ExPASy TrEMBL
Match: A0A5A7U8G9 (Gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold175G00900 PE=4 SV=1)

HSP 1 Score: 69.3 bits (168), Expect = 2.0e-08
Identity = 36/80 (45.00%), Postives = 51/80 (63.75%), Query Frame = 0

Query: 113 SSNRKTDHGKGYGFSQDLRDKKFRCRKCEGYRHYQAECPNFLKSKNKSYFATLSDNDDED 172
           S  R +DHGK     +DL  + FRCR+CEG+ HYQ ECP +LK + K+Y+ATLSD D +D
Sbjct: 107 SYRRNSDHGKK---KEDL-GRSFRCRECEGFGHYQVECPTYLKRQKKNYYATLSDEDSDD 166

Query: 173 DVSNSDFDEEIHALMGCLSQ 193
           D      D  ++A   C+++
Sbjct: 167 DKD----DHGMNAFTTCITE 178

BLAST of Cla97C04G069005 vs. ExPASy TrEMBL
Match: A0A5D3BN60 (Gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold49G00250 PE=4 SV=1)

HSP 1 Score: 68.6 bits (166), Expect = 3.4e-08
Identity = 40/94 (42.55%), Postives = 54/94 (57.45%), Query Frame = 0

Query: 84  REVSQSGNVFSNVEDNFSPARRSTLKSSQ-SSNRKTDHGKGYGFSQDLRDKKFRCRKCEG 143
           R+          VE +      ST K +  S  R +DHGK     QD+  + FRCR+C+G
Sbjct: 142 RKFKSMNTTGKTVETDRYDGENSTRKVNDFSQRRNSDHGKK---KQDV-GRSFRCRECKG 201

Query: 144 YRHYQAECPNFLKSKNKSYFATLSDNDDEDDVSN 177
           + HYQAECP FL+ + K+Y+ATLSD D +DD  N
Sbjct: 202 FGHYQAECPTFLRRQKKNYYATLSDEDSDDDEVN 231

BLAST of Cla97C04G069005 vs. ExPASy TrEMBL
Match: A0A5A7STB1 (Receptor-like protein 12 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold43059G00390 PE=4 SV=1)

HSP 1 Score: 68.6 bits (166), Expect = 3.4e-08
Identity = 40/94 (42.55%), Postives = 54/94 (57.45%), Query Frame = 0

Query: 84  REVSQSGNVFSNVEDNFSPARRSTLKSSQ-SSNRKTDHGKGYGFSQDLRDKKFRCRKCEG 143
           R+          VE +      ST K +  S  R +DHGK     QD+  + FRCR+C+G
Sbjct: 125 RKFKSMNTTGETVEKDRYDGENSTRKVNDFSQRRNSDHGKK---KQDV-GRSFRCRECKG 184

Query: 144 YRHYQAECPNFLKSKNKSYFATLSDNDDEDDVSN 177
           + HYQAECP FL+ + K+Y+ATLSD D +DD  N
Sbjct: 185 FGHYQAECPTFLRRQKKNYYATLSDEDSDDDEVN 214

BLAST of Cla97C04G069005 vs. ExPASy TrEMBL
Match: A0A5A7T2Y7 (Ulp1-like peptidase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold278G00240 PE=4 SV=1)

HSP 1 Score: 67.8 bits (164), Expect = 5.8e-08
Identity = 36/87 (41.38%), Postives = 58/87 (66.67%), Query Frame = 0

Query: 107 TLKSSQSSNRKT-DHGKGYGFSQDLRDKKFRCRKCEGYRHYQAECPNFLKSKNKSYFATL 166
           T K+ + SNR+  DHGK     ++  ++ FRCR+CEG+ HY+AECP FL+ + K+Y+ATL
Sbjct: 116 TRKTDELSNRRNGDHGK----KKEEVERSFRCRECEGFNHYKAECPTFLRRQKKNYYATL 175

Query: 167 SDNDDEDDVSNSDFDEEIHALMGCLSQ 193
           S N+D DD   ++ D  ++    C+++
Sbjct: 176 S-NEDSDD---NEVDHGLNVFTTCITK 194

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA0037814.15.8e-1044.05gag-pol polyprotein [Cucumis melo var. makuwa] >TYK26047.1 gag-pol polyprotein [... [more]
KAA0050476.14.1e-0845.00gag-pol polyprotein [Cucumis melo var. makuwa][more]
KAA0033858.17.0e-0842.55Receptor-like protein 12 [Cucumis melo var. makuwa][more]
TYK01241.17.0e-0842.55gag-pol polyprotein [Cucumis melo var. makuwa][more]
KAA0037333.11.2e-0741.38Ulp1-like peptidase [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5A7T2X12.8e-1044.05Gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1567... [more]
A0A5A7U8G92.0e-0845.00Gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold175G... [more]
A0A5D3BN603.4e-0842.55Gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold49G0... [more]
A0A5A7STB13.4e-0842.55Receptor-like protein 12 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffol... [more]
A0A5A7T2Y75.8e-0841.38Ulp1-like peptidase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold278G... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 103..127
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 103..121
IPR036875Zinc finger, CCHC-type superfamilySUPERFAMILY57756Retrovirus zinc finger-like domainscoord: 118..160

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C04G069005.1Cla97C04G069005.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding