Cla97C03G064940.1 (mRNA) Watermelon (97103) v2

NameCla97C03G064940.1
TypemRNA
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionHaloacid dehalogenase-like hydrolase (HAD) superfamily protein
LocationCla97Chr03 : 28413249 .. 28416285 (+)
Sequence length1053
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTAAATCGTAGAGGCTCCTTCAATCAAATACTGTACTATGCTCGATTTCCTTTAATGGCGGCTACTCATTCTTTGACGCGAAATCTTCGATCGTCTCATTTTCATATGCAATCGCATCGTCCTTCTCAACCATTGCTATTTGAACGGTGTTGTGGAGTACCCATTACTGGAGAGCTTGTTCTTCGTAGTTTTCCAGGAAATAGATGCACTCGATTTGGTCTCAAAGCTTGTGTTCAATCCCCCAAAGATAATTACAGTAACGGCTCTTTGGGGATCAATGCTAAGGGCCCAACACTTGTTCGTCGTTCACACCTGCTTAACAATGTCGATATTGATTTAGTTGATGGAATTGATCGTAGATATTCTACTACTGTTCATGAGGAGCCTCAATGGGGAAATCCTTTAACGTTCCATGACTTCTCTTCCAGGGACAAACTTGTTGTTGCTGTTGACATCGATGAGGGTTTGTCTTTTCTTTCTCCTTCCCTGGATTTTGATTTCTGTTTCTCTTGACTTTCTCCAACCTAAAGCGGCTTGTCTTCTTCTTTTTACTCTTCCTCTGAATTTTCTGTTTAAGTGGGTTTGATTTTCTGTGACAGAGACAGGAAAAATCCAATAATGTACGTTGATGTAAACTAAGTGACTAAATCCATGGAATGGTTGACTTAGATTAATTAATTAATTTATTATTCCTTGACGACTTAGATAATTAGATTTGAGTAACAATACATTAGCTAATGACTTTGCATGACCATTGCATGTGTTAAGATGGCTACTTATTCTAGCCTGTACTTGATTTGAGCTGAGAATTTGGTTTATGTCAATCAATTGTTGCACACATGATTATTCTCTTAGTACACTGATCGAGTGCTTGTTTCTTTTTGTTGTGTAGTTTTGGGAAACTTTGTCTCCGCTCTTAACAAGTTTGTTGCTGATCGTTATTCTTCAAATCATTCTGTTTCAGAGTATCATGTCTATGAATTCTTCAGGGTATGATTTTGTTTCTCTTTTTTATTTGTTGGGGTTGCCTTTTGATTATTAATGTCTGGGAAACTTCTAATATGCACCAGTCTGATTTAGACTTTCGTTCGTGTCTCAGATATGGAAGTGCTCTCGTGATGAAGGTACAACTTTTGTGGATTGACTTTTAATCAATTGTCATCTGATCAAGATGGCTTTATAATTATATAATGATTGTTTTTACTGAGTGGAGTCATGCAGACTGATTGATGGTTGCATGTGGTTCTGAGATCTTTATTTTTTATTTTCTTTTTATAAATAGTTTTTTACTATTTCCAATCTTTCCATGTTGAATGAGTCTGAATAATAAGGAATCATAAGCGTCACGTGGTTGGTTTCATGACCTTTACTCAGTTGAATAATTGGACCTTCCAGCATTTTTTTTAAATGTCTTTGAGTATACTGGCTAGTTTATGTGCGCTTTTACTATTCTTACATTTGTATTAGTGCATGTGTTACAAAATAGTGTTTTGTAATTTGTTCTTGGTGTTTATCTTGAAAAAATATACATAGGAAAATTACTGATTTTTTAGTGAGAATTTTGAAAGCAAAATCGTTTGAGATCATGTTACCAATATGTTCAAAGTATGAGTTTTCTTTTGGGTTATAAATTGGCTTTAATAAATAGGTTTAAATTTTAGTATAGAAAAGGTATTTTTATACCATTGTCTTTTAAACATGTTTGCTTATAGAAGGTGTTCTGTGGTAACCTTTATTTGCTTTAACGCTTTTGTCTAATCTTGGCCATTGAATTTGGCTTTTGCACCTTATGCTTTTAGAAACCACTTCAATGTAATGTTGAGAAATTTCTGTGCAACAATCTTAAACTTCTTCTCAATTGTGCGGTCTCTTCGAGTGAAGGTATAAAATGAATAATTTGCGATTGCGGAAACTGTCGAAGAGGAACCTGAAACTGTTTAAATGGTACACTTTTGAAGGTTCATGGAAATGGATTTTGTTTGTTATGGGGAGCTCAAAGACCTTTTTCTTTTAGTGATACGAAATCTGTAATTGTGTGCTGATGAGAAAGGATTTTGTTTGTTAGTTGATCGATCTTGCAATCTACAAAATTTCAGGCTCAAAATTGTATGACATTTCTTATGTTAACCTATTTCCCGGCTGATGCATCCTTTGAAATTTGCACTTTTTTGAGAGAACTTTGCAGTCTATAATGAAGCACAATGTGATTTTTATGCATGAAAAACTCTTGATCGATTTTATCTTTATGTCCTTGATGCTGTTCATGAGATTTTTCTGTCCATTGAAATGTCTCAAACCTGGCTGGTTAATTGGTTCTGATACTTATTGCCCTGCTTTCCAGCTAATATTCGAGTTCATGAATTCTTCAAGACACCATATTTCAAGACTGGAATCTGGCCTATACCCGGAGCGCAGAGCACGCTCCTCAAGTTATCGAGATTCTGTCACCTATCAGTCGTAACGTATGTATAGCTACATTGGATCATTTTGATACGGTAGCAATTCCTTCCAATATACAATGAACTGATCTCGTCTGAACTAATTTTAGGTCTCGACAAAACGCAATCAAAGAACACACGCTTGAGTGGATCGAGAAACACTACCCGGGGCTGTTTCAGGAGATTCACTTTGGAAACCACTTTGCTCTAGATGGAGAGTCCAGATCAAAGGCAGAAATATGCAAGTATGTCAACGAATCCTTGCACTTTTATCATTTATCTTCTCACCCAACCATGCTCTAATCAAATCCTTTATTCAAGCAACACAACGCTCACTGATGAGTTGCTTTTAAATTTTTGTTGGATCGTGGATGATTCCACTGGGCAGGTCCTTCGGAGCAAGCGTGCTAATAGATGATAACCCAAGATACGCAATCGAATGTGCTGAAGCGGGGATTCGAGTTCTACTTTTCGACTATGAGAATTCATATCCATGGTGCAAGACTGAATGTGAGGATCTGCCTCCCTTAGTCACTAAAGTTCACAATTGGGAAGAAGTGGAGAAGCAATTAGGTTCTTGTGTTCTTTCTTCTTAA

mRNA sequence

ATGTTAAATCGTAGAGGCTCCTTCAATCAAATACTGTACTATGCTCGATTTCCTTTAATGGCGGCTACTCATTCTTTGACGCGAAATCTTCGATCGTCTCATTTTCATATGCAATCGCATCGTCCTTCTCAACCATTGCTATTTGAACGGTGTTGTGGAGTACCCATTACTGGAGAGCTTGTTCTTCGTAGTTTTCCAGGAAATAGATGCACTCGATTTGGTCTCAAAGCTTGTGTTCAATCCCCCAAAGATAATTACAGTAACGGCTCTTTGGGGATCAATGCTAAGGGCCCAACACTTGTTCGTCGTTCACACCTGCTTAACAATGTCGATATTGATTTAGTTGATGGAATTGATCGTAGATATTCTACTACTGTTCATGAGGAGCCTCAATGGGGAAATCCTTTAACGTTCCATGACTTCTCTTCCAGGGACAAACTTGTTGTTGCTGTTGACATCGATGAGGTTTTGGGAAACTTTGTCTCCGCTCTTAACAAGTTTGTTGCTGATCGTTATTCTTCAAATCATTCTGTTTCAGAGTATCATGTCTATGAATTCTTCAGGATATGGAAGTGCTCTCGTGATGAAGCTAATATTCGAGTTCATGAATTCTTCAAGACACCATATTTCAAGACTGGAATCTGGCCTATACCCGGAGCGCAGAGCACGCTCCTCAAGTTATCGAGATTCTGTCACCTATCAGTCGTAACGTCTCGACAAAACGCAATCAAAGAACACACGCTTGAGTGGATCGAGAAACACTACCCGGGGCTGTTTCAGGAGATTCACTTTGGAAACCACTTTGCTCTAGATGGAGAGTCCAGATCAAAGGCAGAAATATGCAAGTCCTTCGGAGCAAGCGTGCTAATAGATGATAACCCAAGATACGCAATCGAATGTGCTGAAGCGGGGATTCGAGTTCTACTTTTCGACTATGAGAATTCATATCCATGGTGCAAGACTGAATGTGAGGATCTGCCTCCCTTAGTCACTAAAGTTCACAATTGGGAAGAAGTGGAGAAGCAATTAGGTTCTTGTGTTCTTTCTTCTTAA

Coding sequence (CDS)

ATGTTAAATCGTAGAGGCTCCTTCAATCAAATACTGTACTATGCTCGATTTCCTTTAATGGCGGCTACTCATTCTTTGACGCGAAATCTTCGATCGTCTCATTTTCATATGCAATCGCATCGTCCTTCTCAACCATTGCTATTTGAACGGTGTTGTGGAGTACCCATTACTGGAGAGCTTGTTCTTCGTAGTTTTCCAGGAAATAGATGCACTCGATTTGGTCTCAAAGCTTGTGTTCAATCCCCCAAAGATAATTACAGTAACGGCTCTTTGGGGATCAATGCTAAGGGCCCAACACTTGTTCGTCGTTCACACCTGCTTAACAATGTCGATATTGATTTAGTTGATGGAATTGATCGTAGATATTCTACTACTGTTCATGAGGAGCCTCAATGGGGAAATCCTTTAACGTTCCATGACTTCTCTTCCAGGGACAAACTTGTTGTTGCTGTTGACATCGATGAGGTTTTGGGAAACTTTGTCTCCGCTCTTAACAAGTTTGTTGCTGATCGTTATTCTTCAAATCATTCTGTTTCAGAGTATCATGTCTATGAATTCTTCAGGATATGGAAGTGCTCTCGTGATGAAGCTAATATTCGAGTTCATGAATTCTTCAAGACACCATATTTCAAGACTGGAATCTGGCCTATACCCGGAGCGCAGAGCACGCTCCTCAAGTTATCGAGATTCTGTCACCTATCAGTCGTAACGTCTCGACAAAACGCAATCAAAGAACACACGCTTGAGTGGATCGAGAAACACTACCCGGGGCTGTTTCAGGAGATTCACTTTGGAAACCACTTTGCTCTAGATGGAGAGTCCAGATCAAAGGCAGAAATATGCAAGTCCTTCGGAGCAAGCGTGCTAATAGATGATAACCCAAGATACGCAATCGAATGTGCTGAAGCGGGGATTCGAGTTCTACTTTTCGACTATGAGAATTCATATCCATGGTGCAAGACTGAATGTGAGGATCTGCCTCCCTTAGTCACTAAAGTTCACAATTGGGAAGAAGTGGAGAAGCAATTAGGTTCTTGTGTTCTTTCTTCTTAA

Protein sequence

MLNRRGSFNQILYYARFPLMAATHSLTRNLRSSHFHMQSHRPSQPLLFERCCGVPITGELVLRSFPGNRCTRFGLKACVQSPKDNYSNGSLGINAKGPTLVRRSHLLNNVDIDLVDGIDRRYSTTVHEEPQWGNPLTFHDFSSRDKLVVAVDIDEVLGNFVSALNKFVADRYSSNHSVSEYHVYEFFRIWKCSRDEANIRVHEFFKTPYFKTGIWPIPGAQSTLLKLSRFCHLSVVTSRQNAIKEHTLEWIEKHYPGLFQEIHFGNHFALDGESRSKAEICKSFGASVLIDDNPRYAIECAEAGIRVLLFDYENSYPWCKTECEDLPPLVTKVHNWEEVEKQLGSCVLSS
BLAST of Cla97C03G064940.1 vs. NCBI nr
Match: XP_011660045.1 (PREDICTED: uncharacterized protein LOC101215456 [Cucumis sativus] >KGN66338.1 hypothetical protein Csa_1G598340 [Cucumis sativus])

HSP 1 Score: 652.9 bits (1683), Expect = 6.1e-184
Identity = 313/350 (89.43%), Postives = 325/350 (92.86%), Query Frame = 0

Query: 1   MLNRRGSFNQILYYARFPLMAATHSLTRNLRSSHFHMQSHRPSQPLLFERCCGVPITGEL 60
           MLN R SFNQILYYARFPLMAA +S+T+NLR SHF MQ HR  Q LLFERCCG+PITGEL
Sbjct: 1   MLNCRRSFNQILYYARFPLMAA-YSMTQNLRLSHFDMQPHRAPQQLLFERCCGLPITGEL 60

Query: 61  VLRSFPGNRCTRFGLKACVQSPKDNYSNGSLGINAKGPTLVRRSHLLNNVDIDLVDGIDR 120
           VLR F GNRCT F LKAC QSP+DNYSNGS G NAKGPTL RR  LLN+VD +LVDGIDR
Sbjct: 61  VLRGFSGNRCTGFSLKACAQSPQDNYSNGSFGFNAKGPTLARRPQLLNSVD-NLVDGIDR 120

Query: 121 RYSTTVHEEPQWGNPLTFHDFSSRDKLVVAVDIDEVLGNFVSALNKFVADRYSSNHSVSE 180
           R+S++VHEEP+WGNPLTFH+ SSRDKLVVAVDIDEVLGNFVSALNKFVADRYSSNHSVSE
Sbjct: 121 RFSSSVHEEPKWGNPLTFHEISSRDKLVVAVDIDEVLGNFVSALNKFVADRYSSNHSVSE 180

Query: 181 YHVYEFFRIWKCSRDEANIRVHEFFKTPYFKTGIWPIPGAQSTLLKLSRFCHLSVVTSRQ 240
           YHVYEFFRIWKCSRDEANIRVHEFFKTPYFKTGIWPIPGAQSTLLKLSRFCHLSVVTSRQ
Sbjct: 181 YHVYEFFRIWKCSRDEANIRVHEFFKTPYFKTGIWPIPGAQSTLLKLSRFCHLSVVTSRQ 240

Query: 241 NAIKEHTLEWIEKHYPGLFQEIHFGNHFALDGESRSKAEICKSFGASVLIDDNPRYAIEC 300
           NAIKEHTLEWIEKHY GLFQEIHFGNHFALDGESRSKAEICKSFGASVLIDDNPRYAIEC
Sbjct: 241 NAIKEHTLEWIEKHYQGLFQEIHFGNHFALDGESRSKAEICKSFGASVLIDDNPRYAIEC 300

Query: 301 AEAGIRVLLFDYENSYPWCKTECEDLPPLVTKVHNWEEVEKQLGSCVLSS 351
           AEAGIRVLLFDYENSYPWCKTEC DLPPLVTKVHNWEEVEKQL SCVL S
Sbjct: 301 AEAGIRVLLFDYENSYPWCKTECGDLPPLVTKVHNWEEVEKQLASCVLPS 348

BLAST of Cla97C03G064940.1 vs. NCBI nr
Match: XP_008450982.1 (PREDICTED: uncharacterized protein LOC103492406 isoform X1 [Cucumis melo] >XP_008450983.1 PREDICTED: uncharacterized protein LOC103492406 isoform X1 [Cucumis melo])

HSP 1 Score: 641.3 bits (1653), Expect = 1.9e-180
Identity = 299/327 (91.44%), Postives = 312/327 (95.41%), Query Frame = 0

Query: 22  ATHSLTRNLRSSHFHMQSHRPSQPLLFERCCGVPITGELVLRSFPGNRCTRFGLKACVQS 81
           A +S+T+N RSSHFHMQSHR  QPLLFERCCG+PITGELVLR F GNRCT FGLKACVQS
Sbjct: 2   AAYSMTQNFRSSHFHMQSHRAPQPLLFERCCGLPITGELVLRGFSGNRCTGFGLKACVQS 61

Query: 82  PKDNYSNGSLGINAKGPTLVRRSHLLNNVDIDLVDGIDRRYSTTVHEEPQWGNPLTFHDF 141
           PKDNYSNGS G+NAKGPTLVRR  LLN+VDIDLVDGIDRR+S++VHEEP+WGNPLTFH+F
Sbjct: 62  PKDNYSNGSFGLNAKGPTLVRRPQLLNSVDIDLVDGIDRRFSSSVHEEPKWGNPLTFHEF 121

Query: 142 SSRDKLVVAVDIDEVLGNFVSALNKFVADRYSSNHSVSEYHVYEFFRIWKCSRDEANIRV 201
           SSRDKLVVAVDIDEVLGNFVSALNKFVAD YSS+HSVSEYHVYEFFRIWKCSRDEANIRV
Sbjct: 122 SSRDKLVVAVDIDEVLGNFVSALNKFVADHYSSHHSVSEYHVYEFFRIWKCSRDEANIRV 181

Query: 202 HEFFKTPYFKTGIWPIPGAQSTLLKLSRFCHLSVVTSRQNAIKEHTLEWIEKHYPGLFQE 261
           HEFFKTPYFKTGIWPIPGAQSTLLKLSRFCHLSVVTSRQNAIKEHTLEWIEKHYPGLFQE
Sbjct: 182 HEFFKTPYFKTGIWPIPGAQSTLLKLSRFCHLSVVTSRQNAIKEHTLEWIEKHYPGLFQE 241

Query: 262 IHFGNHFALDGESRSKAEICKSFGASVLIDDNPRYAIECAEAGIRVLLFDYENSYPWCKT 321
           IHFGNHFALDGESRSKAEICKS GASVLIDDNPRYAIECAEAGIRVLLFDYENSYPWCKT
Sbjct: 242 IHFGNHFALDGESRSKAEICKSLGASVLIDDNPRYAIECAEAGIRVLLFDYENSYPWCKT 301

Query: 322 ECEDLPPLVTKVHNWEEVEKQLGSCVL 349
           EC +LPPLVTKVHNWEEVEK L SCVL
Sbjct: 302 ECGNLPPLVTKVHNWEEVEKHLASCVL 328

BLAST of Cla97C03G064940.1 vs. NCBI nr
Match: XP_008450985.1 (PREDICTED: uncharacterized protein LOC103492406 isoform X2 [Cucumis melo])

HSP 1 Score: 637.9 bits (1644), Expect = 2.0e-179
Identity = 297/323 (91.95%), Postives = 309/323 (95.67%), Query Frame = 0

Query: 26  LTRNLRSSHFHMQSHRPSQPLLFERCCGVPITGELVLRSFPGNRCTRFGLKACVQSPKDN 85
           +T+N RSSHFHMQSHR  QPLLFERCCG+PITGELVLR F GNRCT FGLKACVQSPKDN
Sbjct: 1   MTQNFRSSHFHMQSHRAPQPLLFERCCGLPITGELVLRGFSGNRCTGFGLKACVQSPKDN 60

Query: 86  YSNGSLGINAKGPTLVRRSHLLNNVDIDLVDGIDRRYSTTVHEEPQWGNPLTFHDFSSRD 145
           YSNGS G+NAKGPTLVRR  LLN+VDIDLVDGIDRR+S++VHEEP+WGNPLTFH+FSSRD
Sbjct: 61  YSNGSFGLNAKGPTLVRRPQLLNSVDIDLVDGIDRRFSSSVHEEPKWGNPLTFHEFSSRD 120

Query: 146 KLVVAVDIDEVLGNFVSALNKFVADRYSSNHSVSEYHVYEFFRIWKCSRDEANIRVHEFF 205
           KLVVAVDIDEVLGNFVSALNKFVAD YSS+HSVSEYHVYEFFRIWKCSRDEANIRVHEFF
Sbjct: 121 KLVVAVDIDEVLGNFVSALNKFVADHYSSHHSVSEYHVYEFFRIWKCSRDEANIRVHEFF 180

Query: 206 KTPYFKTGIWPIPGAQSTLLKLSRFCHLSVVTSRQNAIKEHTLEWIEKHYPGLFQEIHFG 265
           KTPYFKTGIWPIPGAQSTLLKLSRFCHLSVVTSRQNAIKEHTLEWIEKHYPGLFQEIHFG
Sbjct: 181 KTPYFKTGIWPIPGAQSTLLKLSRFCHLSVVTSRQNAIKEHTLEWIEKHYPGLFQEIHFG 240

Query: 266 NHFALDGESRSKAEICKSFGASVLIDDNPRYAIECAEAGIRVLLFDYENSYPWCKTECED 325
           NHFALDGESRSKAEICKS GASVLIDDNPRYAIECAEAGIRVLLFDYENSYPWCKTEC +
Sbjct: 241 NHFALDGESRSKAEICKSLGASVLIDDNPRYAIECAEAGIRVLLFDYENSYPWCKTECGN 300

Query: 326 LPPLVTKVHNWEEVEKQLGSCVL 349
           LPPLVTKVHNWEEVEK L SCVL
Sbjct: 301 LPPLVTKVHNWEEVEKHLASCVL 323

BLAST of Cla97C03G064940.1 vs. NCBI nr
Match: XP_022147633.1 (uncharacterized protein LOC111016509 [Momordica charantia] >XP_022147634.1 uncharacterized protein LOC111016509 [Momordica charantia])

HSP 1 Score: 625.9 bits (1613), Expect = 8.0e-176
Identity = 302/350 (86.29%), Postives = 318/350 (90.86%), Query Frame = 0

Query: 1   MLNRRGSFNQILYYARFPLMAATHSLTRNLRSSHFHMQSHRPSQPLLFERCCGVPITGEL 60
           MLNR GSF+QILYYAR P+MAA  S+TR+LR +HF++QSHRP Q  LFE+CCG PI+GEL
Sbjct: 1   MLNRGGSFHQILYYARSPVMAAP-SMTRDLRLAHFNLQSHRPPQSSLFEQCCGFPISGEL 60

Query: 61  VLRSFPGNRCTRFGLKACVQSPKDNYSNGSLGINAKGPTLVRRSHLLNNVDIDLVDGIDR 120
           VLRSF GNRC RFGLKACVQSPKDNYSN        GPTLVRR  LLNNVDIDLV+GIDR
Sbjct: 61  VLRSFRGNRCPRFGLKACVQSPKDNYSN--------GPTLVRRPQLLNNVDIDLVEGIDR 120

Query: 121 RYSTTVHEEPQWGNPLTFHDFSSRDKLVVAVDIDEVLGNFVSALNKFVADRYSSNHSVSE 180
           RYS  V  E QWGNPLTFH+ SSRDKL+VAVDIDEVLGNFVSALNKFVADRYSSNHSVSE
Sbjct: 121 RYSIAVPGESQWGNPLTFHELSSRDKLIVAVDIDEVLGNFVSALNKFVADRYSSNHSVSE 180

Query: 181 YHVYEFFRIWKCSRDEANIRVHEFFKTPYFKTGIWPIPGAQSTLLKLSRFCHLSVVTSRQ 240
           YHVYEFFRIWKCSRDEAN+RVHEFFKTPYFKTGIWPIPGAQSTLLKLSRFCHLSVVTSRQ
Sbjct: 181 YHVYEFFRIWKCSRDEANVRVHEFFKTPYFKTGIWPIPGAQSTLLKLSRFCHLSVVTSRQ 240

Query: 241 NAIKEHTLEWIEKHYPGLFQEIHFGNHFALDGESRSKAEICKSFGASVLIDDNPRYAIEC 300
           NAIK+HTLEWIEKHYPGLFQEIHFGNHFALDGESRSKAEICKS GASVLIDDNPRYAIEC
Sbjct: 241 NAIKDHTLEWIEKHYPGLFQEIHFGNHFALDGESRSKAEICKSLGASVLIDDNPRYAIEC 300

Query: 301 AEAGIRVLLFDYENSYPWCKTECEDLPPLVTKVHNWEEVEKQLGSCVLSS 351
           AEAGIRVLLFDYENSYPWCKTE EDLPPLVTKV NWE+VEKQL S V+SS
Sbjct: 301 AEAGIRVLLFDYENSYPWCKTESEDLPPLVTKVQNWEDVEKQLTSWVISS 341

BLAST of Cla97C03G064940.1 vs. NCBI nr
Match: XP_022967795.1 (uncharacterized protein LOC111467200 [Cucurbita maxima])

HSP 1 Score: 614.8 bits (1584), Expect = 1.9e-172
Identity = 293/329 (89.06%), Postives = 302/329 (91.79%), Query Frame = 0

Query: 22  ATHSLTRNLRSSHFHMQSHRPSQPLLFERCCGVPITGELVLRSFPGNRCTRFGLKACVQS 81
           A HSLTRNL SSHFHMQSHRPSQPLLFE CC  PI+ ELVLRSF  NRC RFGL+ACVQS
Sbjct: 2   AAHSLTRNLCSSHFHMQSHRPSQPLLFENCCAFPISAELVLRSFRRNRCNRFGLRACVQS 61

Query: 82  PKDNYSNGSLGINAKGPTLVRRSHLLNNVDIDLVDGIDRRYSTTVHEEPQWGNPLTFHDF 141
            KDN SNGSLG NAKGPT VRR  +LNNV+IDLVDGIDRRYSTTV  EPQWGNPLTFHD 
Sbjct: 62  HKDNCSNGSLGTNAKGPTPVRRPQVLNNVEIDLVDGIDRRYSTTVQGEPQWGNPLTFHDL 121

Query: 142 SSRDKLVVAVDIDEVLGNFVSALNKFVADRYSSNHSVSEYHVYEFFRIWKCSRDEANIRV 201
           SSR KLVVAVD+DEVLGNFVSALNKFVADRYSSNHSVSEYHVYEFFRIWKCSRDEAN RV
Sbjct: 122 SSRGKLVVAVDVDEVLGNFVSALNKFVADRYSSNHSVSEYHVYEFFRIWKCSRDEANHRV 181

Query: 202 HEFFKTPYFKTGIWPIPGAQSTLLKLSRFCHLSVVTSRQNAIKEHTLEWIEKHYPGLFQE 261
           HEFFKTPYFK GIWPIPGAQSTLLKLSRFCHLSVVTSRQNAIKEHTLEWIEKH+PGLFQE
Sbjct: 182 HEFFKTPYFKVGIWPIPGAQSTLLKLSRFCHLSVVTSRQNAIKEHTLEWIEKHFPGLFQE 241

Query: 262 IHFGNHFALDGESRSKAEICKSFGASVLIDDNPRYAIECAEAGIRVLLFDYENSYPWCKT 321
           IHFGNHFALDG SRSK EICKS GA+VLIDDNPRYAIECAEAGIRVLLFDYENSYPWCKT
Sbjct: 242 IHFGNHFALDGVSRSKEEICKSLGATVLIDDNPRYAIECAEAGIRVLLFDYENSYPWCKT 301

Query: 322 ECEDLPPLVTKVHNWEEVEKQLGSCVLSS 351
           E EDLPPLVTKV+NWE+VEKQL S VLSS
Sbjct: 302 ETEDLPPLVTKVYNWEDVEKQLTSLVLSS 330

BLAST of Cla97C03G064940.1 vs. TrEMBL
Match: tr|A0A0A0LX27|A0A0A0LX27_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G598340 PE=4 SV=1)

HSP 1 Score: 652.9 bits (1683), Expect = 4.1e-184
Identity = 313/350 (89.43%), Postives = 325/350 (92.86%), Query Frame = 0

Query: 1   MLNRRGSFNQILYYARFPLMAATHSLTRNLRSSHFHMQSHRPSQPLLFERCCGVPITGEL 60
           MLN R SFNQILYYARFPLMAA +S+T+NLR SHF MQ HR  Q LLFERCCG+PITGEL
Sbjct: 1   MLNCRRSFNQILYYARFPLMAA-YSMTQNLRLSHFDMQPHRAPQQLLFERCCGLPITGEL 60

Query: 61  VLRSFPGNRCTRFGLKACVQSPKDNYSNGSLGINAKGPTLVRRSHLLNNVDIDLVDGIDR 120
           VLR F GNRCT F LKAC QSP+DNYSNGS G NAKGPTL RR  LLN+VD +LVDGIDR
Sbjct: 61  VLRGFSGNRCTGFSLKACAQSPQDNYSNGSFGFNAKGPTLARRPQLLNSVD-NLVDGIDR 120

Query: 121 RYSTTVHEEPQWGNPLTFHDFSSRDKLVVAVDIDEVLGNFVSALNKFVADRYSSNHSVSE 180
           R+S++VHEEP+WGNPLTFH+ SSRDKLVVAVDIDEVLGNFVSALNKFVADRYSSNHSVSE
Sbjct: 121 RFSSSVHEEPKWGNPLTFHEISSRDKLVVAVDIDEVLGNFVSALNKFVADRYSSNHSVSE 180

Query: 181 YHVYEFFRIWKCSRDEANIRVHEFFKTPYFKTGIWPIPGAQSTLLKLSRFCHLSVVTSRQ 240
           YHVYEFFRIWKCSRDEANIRVHEFFKTPYFKTGIWPIPGAQSTLLKLSRFCHLSVVTSRQ
Sbjct: 181 YHVYEFFRIWKCSRDEANIRVHEFFKTPYFKTGIWPIPGAQSTLLKLSRFCHLSVVTSRQ 240

Query: 241 NAIKEHTLEWIEKHYPGLFQEIHFGNHFALDGESRSKAEICKSFGASVLIDDNPRYAIEC 300
           NAIKEHTLEWIEKHY GLFQEIHFGNHFALDGESRSKAEICKSFGASVLIDDNPRYAIEC
Sbjct: 241 NAIKEHTLEWIEKHYQGLFQEIHFGNHFALDGESRSKAEICKSFGASVLIDDNPRYAIEC 300

Query: 301 AEAGIRVLLFDYENSYPWCKTECEDLPPLVTKVHNWEEVEKQLGSCVLSS 351
           AEAGIRVLLFDYENSYPWCKTEC DLPPLVTKVHNWEEVEKQL SCVL S
Sbjct: 301 AEAGIRVLLFDYENSYPWCKTECGDLPPLVTKVHNWEEVEKQLASCVLPS 348

BLAST of Cla97C03G064940.1 vs. TrEMBL
Match: tr|A0A1S3BQG9|A0A1S3BQG9_CUCME (uncharacterized protein LOC103492406 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103492406 PE=4 SV=1)

HSP 1 Score: 641.3 bits (1653), Expect = 1.2e-180
Identity = 299/327 (91.44%), Postives = 312/327 (95.41%), Query Frame = 0

Query: 22  ATHSLTRNLRSSHFHMQSHRPSQPLLFERCCGVPITGELVLRSFPGNRCTRFGLKACVQS 81
           A +S+T+N RSSHFHMQSHR  QPLLFERCCG+PITGELVLR F GNRCT FGLKACVQS
Sbjct: 2   AAYSMTQNFRSSHFHMQSHRAPQPLLFERCCGLPITGELVLRGFSGNRCTGFGLKACVQS 61

Query: 82  PKDNYSNGSLGINAKGPTLVRRSHLLNNVDIDLVDGIDRRYSTTVHEEPQWGNPLTFHDF 141
           PKDNYSNGS G+NAKGPTLVRR  LLN+VDIDLVDGIDRR+S++VHEEP+WGNPLTFH+F
Sbjct: 62  PKDNYSNGSFGLNAKGPTLVRRPQLLNSVDIDLVDGIDRRFSSSVHEEPKWGNPLTFHEF 121

Query: 142 SSRDKLVVAVDIDEVLGNFVSALNKFVADRYSSNHSVSEYHVYEFFRIWKCSRDEANIRV 201
           SSRDKLVVAVDIDEVLGNFVSALNKFVAD YSS+HSVSEYHVYEFFRIWKCSRDEANIRV
Sbjct: 122 SSRDKLVVAVDIDEVLGNFVSALNKFVADHYSSHHSVSEYHVYEFFRIWKCSRDEANIRV 181

Query: 202 HEFFKTPYFKTGIWPIPGAQSTLLKLSRFCHLSVVTSRQNAIKEHTLEWIEKHYPGLFQE 261
           HEFFKTPYFKTGIWPIPGAQSTLLKLSRFCHLSVVTSRQNAIKEHTLEWIEKHYPGLFQE
Sbjct: 182 HEFFKTPYFKTGIWPIPGAQSTLLKLSRFCHLSVVTSRQNAIKEHTLEWIEKHYPGLFQE 241

Query: 262 IHFGNHFALDGESRSKAEICKSFGASVLIDDNPRYAIECAEAGIRVLLFDYENSYPWCKT 321
           IHFGNHFALDGESRSKAEICKS GASVLIDDNPRYAIECAEAGIRVLLFDYENSYPWCKT
Sbjct: 242 IHFGNHFALDGESRSKAEICKSLGASVLIDDNPRYAIECAEAGIRVLLFDYENSYPWCKT 301

Query: 322 ECEDLPPLVTKVHNWEEVEKQLGSCVL 349
           EC +LPPLVTKVHNWEEVEK L SCVL
Sbjct: 302 ECGNLPPLVTKVHNWEEVEKHLASCVL 328

BLAST of Cla97C03G064940.1 vs. TrEMBL
Match: tr|A0A1S3BRK7|A0A1S3BRK7_CUCME (uncharacterized protein LOC103492406 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103492406 PE=4 SV=1)

HSP 1 Score: 637.9 bits (1644), Expect = 1.4e-179
Identity = 297/323 (91.95%), Postives = 309/323 (95.67%), Query Frame = 0

Query: 26  LTRNLRSSHFHMQSHRPSQPLLFERCCGVPITGELVLRSFPGNRCTRFGLKACVQSPKDN 85
           +T+N RSSHFHMQSHR  QPLLFERCCG+PITGELVLR F GNRCT FGLKACVQSPKDN
Sbjct: 1   MTQNFRSSHFHMQSHRAPQPLLFERCCGLPITGELVLRGFSGNRCTGFGLKACVQSPKDN 60

Query: 86  YSNGSLGINAKGPTLVRRSHLLNNVDIDLVDGIDRRYSTTVHEEPQWGNPLTFHDFSSRD 145
           YSNGS G+NAKGPTLVRR  LLN+VDIDLVDGIDRR+S++VHEEP+WGNPLTFH+FSSRD
Sbjct: 61  YSNGSFGLNAKGPTLVRRPQLLNSVDIDLVDGIDRRFSSSVHEEPKWGNPLTFHEFSSRD 120

Query: 146 KLVVAVDIDEVLGNFVSALNKFVADRYSSNHSVSEYHVYEFFRIWKCSRDEANIRVHEFF 205
           KLVVAVDIDEVLGNFVSALNKFVAD YSS+HSVSEYHVYEFFRIWKCSRDEANIRVHEFF
Sbjct: 121 KLVVAVDIDEVLGNFVSALNKFVADHYSSHHSVSEYHVYEFFRIWKCSRDEANIRVHEFF 180

Query: 206 KTPYFKTGIWPIPGAQSTLLKLSRFCHLSVVTSRQNAIKEHTLEWIEKHYPGLFQEIHFG 265
           KTPYFKTGIWPIPGAQSTLLKLSRFCHLSVVTSRQNAIKEHTLEWIEKHYPGLFQEIHFG
Sbjct: 181 KTPYFKTGIWPIPGAQSTLLKLSRFCHLSVVTSRQNAIKEHTLEWIEKHYPGLFQEIHFG 240

Query: 266 NHFALDGESRSKAEICKSFGASVLIDDNPRYAIECAEAGIRVLLFDYENSYPWCKTECED 325
           NHFALDGESRSKAEICKS GASVLIDDNPRYAIECAEAGIRVLLFDYENSYPWCKTEC +
Sbjct: 241 NHFALDGESRSKAEICKSLGASVLIDDNPRYAIECAEAGIRVLLFDYENSYPWCKTECGN 300

Query: 326 LPPLVTKVHNWEEVEKQLGSCVL 349
           LPPLVTKVHNWEEVEK L SCVL
Sbjct: 301 LPPLVTKVHNWEEVEKHLASCVL 323

BLAST of Cla97C03G064940.1 vs. TrEMBL
Match: tr|A0A2I4EXJ0|A0A2I4EXJ0_9ROSI (uncharacterized protein LOC108993585 OS=Juglans regia OX=51240 GN=LOC108993585 PE=4 SV=1)

HSP 1 Score: 379.8 bits (974), Expect = 6.6e-102
Identity = 189/291 (64.95%), Postives = 220/291 (75.60%), Query Frame = 0

Query: 75  LKACVQSPKDN-----YSNGSLGINAKGPTLVRRSHL----LNNVDIDLVDGIDRRYSTT 134
           LK C++  +DN     +  G       GP      +L    + +++I L DG    ++ T
Sbjct: 90  LKGCLRD-EDNIDAIAFHKGKGKAKGMGPRAFAPQNLQLLPVRDLEIGLQDGTASPHNAT 149

Query: 135 V-------HEEPQWGNPLTFHDFSSRDKLVVAVDIDEVLGNFVSALNKFVADRYSSNHSV 194
                   H   + G+PL F D    DK+VVAVD+DEVLGNFVSALN+F+ADRY SNHS+
Sbjct: 150 ATANQNQQHGASRHGSPLGFPDRHLPDKIVVAVDVDEVLGNFVSALNRFIADRYFSNHSI 209

Query: 195 SEYHVYEFFRIWKCSRDEANIRVHEFFKTPYFKTGIWPIPGAQSTLLKLSRFCHLSVVTS 254
           SEYHVYEFF+IW CSRDEA+IRVHEFFKTPYFKTGI PIPGA+  L  LSRFC+LSVVTS
Sbjct: 210 SEYHVYEFFKIWNCSRDEADIRVHEFFKTPYFKTGIHPIPGARRALHTLSRFCNLSVVTS 269

Query: 255 RQNAIKEHTLEWIEKHYPGLFQEIHFGNHFALDGESRSKAEICKSFGASVLIDDNPRYAI 314
           RQNAIK+HT++WIEKHYPGLFQEIHFGNHFALDG SR K+EIC+S GA+VLIDDNPRYAI
Sbjct: 270 RQNAIKDHTIDWIEKHYPGLFQEIHFGNHFALDGVSRPKSEICRSLGANVLIDDNPRYAI 329

Query: 315 ECAEAGIRVLLFDYENSYPWCKTECEDLPPLVTKVHNWEEVEKQLGSCVLS 350
           ECAE GIRVLLFDYENSYPWCKTE  +  PLVTKVHNW EVEKQL S ++S
Sbjct: 330 ECAEIGIRVLLFDYENSYPWCKTESVNQHPLVTKVHNWAEVEKQLMSWIVS 379

BLAST of Cla97C03G064940.1 vs. TrEMBL
Match: tr|A0A218WNG7|A0A218WNG7_PUNGR (Uncharacterized protein OS=Punica granatum OX=22663 GN=CDL15_Pgr013229 PE=4 SV=1)

HSP 1 Score: 374.8 bits (961), Expect = 2.1e-100
Identity = 197/339 (58.11%), Postives = 232/339 (68.44%), Query Frame = 0

Query: 20  MAATHSLTRNLRSSHFHMQSHRPSQPLLFERCCGVPITGELVLRSFPGNRCTRF------ 79
           MA++  L     S+ FH   H  SQ L  E     P     V+     + C  F      
Sbjct: 1   MASSLILNDPFVSNRFHSLRHHASQSLPVENRSS-PFLARAVVPKI--DTCDGFRIADGR 60

Query: 80  ---GLKACVQSPKDNYSNGSLGINAKGPTLVRRSHLLNNVDIDLVDGIDRRYSTTVHEEP 139
               +K C  S  D+ S+      +  P L      L NV++ + D  D R    +    
Sbjct: 61  GGLSIKGCFNSVSDS-SSLFQKPQSGAPRLT-----LRNVEVGIEDEKDAR----LVNGR 120

Query: 140 QWGNPLTFHDFSSRDKLVVAVDIDEVLGNFVSALNKFVADRYSSNHSVSEYHVYEFFRIW 199
           QWG+PL F   +S +K+VVAVD+DEVLGNFVSALN+F+ADRYSSNHSVSEYHVYEF++IW
Sbjct: 121 QWGSPLCFPSSTSPEKIVVAVDVDEVLGNFVSALNRFIADRYSSNHSVSEYHVYEFYKIW 180

Query: 200 KCSRDEANIRVHEFFKTPYFKTGIWPIPGAQSTLLKLSRFCHLSVVTSRQNAIKEHTLEW 259
            CSRDEA++RVHEFFKTPYF+ GI PIPGA++ L KLS FC LSVVTSRQNAIK+HT+ W
Sbjct: 181 NCSRDEADLRVHEFFKTPYFRMGIHPIPGARTALHKLSSFCKLSVVTSRQNAIKDHTIRW 240

Query: 260 IEKHYPGLFQEIHFGNHFALDGESRSKAEICKSFGASVLIDDNPRYAIECAEAGIRVLLF 319
           IEKHYPGLF EIHFGNHFALDG SR K+EIC+S GA VLIDDNPRYAIECA  GIRVLLF
Sbjct: 241 IEKHYPGLFHEIHFGNHFALDGVSRPKSEICRSLGAKVLIDDNPRYAIECASVGIRVLLF 300

Query: 320 DYENSYPWCKTECEDLPPLVTKVHNWEEVEKQLGSCVLS 350
           DY+NSYPWCKTE     PLVTKVHNW EVE+QL S ++S
Sbjct: 301 DYKNSYPWCKTESIHGHPLVTKVHNWGEVEQQLVSWIVS 326

BLAST of Cla97C03G064940.1 vs. TAIR10
Match: AT4G33140.1 (Haloacid dehalogenase-like hydrolase (HAD) superfamily protein)

HSP 1 Score: 347.4 bits (890), Expect = 1.0e-95
Identity = 159/209 (76.08%), Postives = 182/209 (87.08%), Query Frame = 0

Query: 141 FSSRDKLVVAVDIDEVLGNFVSALNKFVADRYSSNHSVSEYHVYEFFRIWKCSRDEANIR 200
           F   DK+VVAVDIDEVLGNFVSALN+F+ADRY SNHSVSEYHVYEFF+IW CSR+EA++R
Sbjct: 143 FQGNDKIVVAVDIDEVLGNFVSALNRFIADRYLSNHSVSEYHVYEFFKIWNCSRNEADLR 202

Query: 201 VHEFFKTPYFKTGIWPIPGAQSTLLKLSRFCHLSVVTSRQNAIKEHTLEWIEKHYPGLFQ 260
           VHEFFKT YFK GI P+PGA  TL KLS++C +SVVTSRQNAIKEHTLEWIE H+PGLF+
Sbjct: 203 VHEFFKTSYFKKGIHPLPGAHKTLHKLSKYCDMSVVTSRQNAIKEHTLEWIEIHFPGLFK 262

Query: 261 EIHFGNHFALDGESRSKAEICKSFGASVLIDDNPRYAIECAEAGIRVLLFDYENSYPWCK 320
           +IHFGNHFAL GESR K+EIC+SFGA +LIDDNPRYA ECA  G++VLLFDYENSYPW K
Sbjct: 263 QIHFGNHFALHGESRPKSEICRSFGAEILIDDNPRYAEECANIGMKVLLFDYENSYPWSK 322

Query: 321 TECEDLPPLVTKVHNWEEVEKQLGSCVLS 350
           TE  D  PLVT+VHNWEEVE+Q+ S  +S
Sbjct: 323 TESVDRHPLVTRVHNWEEVEQQILSLAVS 351

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011660045.16.1e-18489.43PREDICTED: uncharacterized protein LOC101215456 [Cucumis sativus] >KGN66338.1 hy... [more]
XP_008450982.11.9e-18091.44PREDICTED: uncharacterized protein LOC103492406 isoform X1 [Cucumis melo] >XP_00... [more]
XP_008450985.12.0e-17991.95PREDICTED: uncharacterized protein LOC103492406 isoform X2 [Cucumis melo][more]
XP_022147633.18.0e-17686.29uncharacterized protein LOC111016509 [Momordica charantia] >XP_022147634.1 uncha... [more]
XP_022967795.11.9e-17289.06uncharacterized protein LOC111467200 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
tr|A0A0A0LX27|A0A0A0LX27_CUCSA4.1e-18489.43Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G598340 PE=4 SV=1[more]
tr|A0A1S3BQG9|A0A1S3BQG9_CUCME1.2e-18091.44uncharacterized protein LOC103492406 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
tr|A0A1S3BRK7|A0A1S3BRK7_CUCME1.4e-17991.95uncharacterized protein LOC103492406 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
tr|A0A2I4EXJ0|A0A2I4EXJ0_9ROSI6.6e-10264.95uncharacterized protein LOC108993585 OS=Juglans regia OX=51240 GN=LOC108993585 P... [more]
tr|A0A218WNG7|A0A218WNG7_PUNGR2.1e-10058.11Uncharacterized protein OS=Punica granatum OX=22663 GN=CDL15_Pgr013229 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
AT4G33140.11.0e-9576.08Haloacid dehalogenase-like hydrolase (HAD) superfamily protein[more]
The following terms have been associated with this mRNA:
Vocabulary: Molecular Function
TermDefinition
GO:00082535'-nucleotidase activity
Vocabulary: Biological Process
TermDefinition
GO:0009264deoxyribonucleotide catabolic process
Vocabulary: INTERPRO
TermDefinition
IPR036412HAD-like_sf
IPR023214HAD_sf
IPR0107085'(3')-deoxyribonucleotidase
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009264 deoxyribonucleotide catabolic process
molecular_function GO:0008253 5'-nucleotidase activity

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cla97C03G064940Cla97C03G064940gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cla97C03G064940.1Cla97C03G064940.1-proteinpolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cla97C03G064940.1.exon.1Cla97C03G064940.1.exon.1exon
Cla97C03G064940.1.exon.2Cla97C03G064940.1.exon.2exon
Cla97C03G064940.1.exon.3Cla97C03G064940.1.exon.3exon
Cla97C03G064940.1.exon.4Cla97C03G064940.1.exon.4exon
Cla97C03G064940.1.exon.5Cla97C03G064940.1.exon.5exon
Cla97C03G064940.1.exon.6Cla97C03G064940.1.exon.6exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cla97C03G064940.1.CDS.1Cla97C03G064940.1.CDS.1CDS
Cla97C03G064940.1.CDS.2Cla97C03G064940.1.CDS.2CDS
Cla97C03G064940.1.CDS.3Cla97C03G064940.1.CDS.3CDS
Cla97C03G064940.1.CDS.4Cla97C03G064940.1.CDS.4CDS
Cla97C03G064940.1.CDS.5Cla97C03G064940.1.CDS.5CDS
Cla97C03G064940.1.CDS.6Cla97C03G064940.1.CDS.6CDS


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3DG3DSA:1.10.40.40coord: 160..212
e-value: 1.9E-8
score: 36.6
NoneNo IPR availablePANTHERPTHR35134FAMILY NOT NAMEDcoord: 127..345
IPR0107085'(3')-deoxyribonucleotidasePFAMPF06941NT5Ccoord: 147..313
e-value: 2.8E-13
score: 50.0
IPR023214HAD superfamilyGENE3DG3DSA:3.40.50.1000coord: 213..311
e-value: 1.9E-8
score: 36.6
coord: 149..159
e-value: 1.9E-8
score: 36.6
IPR036412HAD-like superfamilySUPERFAMILYSSF56784HAD-likecoord: 135..165
coord: 195..343