Cla97C08G161650 (gene) Watermelon (97103) v2.5

Overview
NameCla97C08G161650
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionPlant protein 1589 of unknown function
LocationCla97Chr08: 28056369 .. 28057645 (-)
RNA-Seq ExpressionCla97C08G161650
SyntenyCla97C08G161650
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTGTAAATGAGTGCACAAATCAGTCCCATTGTTCAAAGATCCGCTCCATTTCATATGTATGTTAAATTTGGAGACAACGTGGCTTTATGAGTATACACTAAAAACTTATCCATCGCAAGGCTCACAATTCAATCACCCAACAGCTCCGCATATGATTATGTTCACTGCTTTCCAATCTGTATGTTACCCTTTCAATATTATTGTGGCCATTTATTTATTTGATGGCAGCTCTTAAATAATAAATAATAAATAATAAATATTGGGATGATCTACAACCTCACTTCCTTTTTTGTCCCCCTTTCAATTTCACCTATATCTATCAACCATCAGTCTATAGTTGTAATTTGAATGGCACTGGGAAAACAGTAGATTATGAATAATGAGTTATTGGCAGTGACAGGAATTTAAAGAGGTTGTTCATTGTTGGGGCCAAAGGGGAAGAGAGATTAGATTTGTCAAAATCAGTCACCATTATATTGCAATAACCATCCATTCCATACTGCAACTCCATCCTTCTTATCTGCTTGGAGATGGGTTTGGATTTCTTGTGCTCTTTTGGCCCTTCACTCAGATATCTGTGGATAATATTTCTTCCTTCTTTGCTATAAAAAGCTCACAAATATCTTCTCCTGTTTGGCGCTTATGAGGGCTTTTAATTTGTTCTCTTCTCCTATTCTCTGTTTATTGGTTTGGTTTTTTTTTTTCGCTGAATGTACTGGTTTTTGTCCAATTCCTAAGCTAATTTATAAACAGATATACATATATAAATATATACTATGTACAGCCACCAGTTTCTTCCTTGTTTTTATTGCCATCCCCATGCCTATATCCGGATGGTACTGCTAACGTAACTGCTTCTCTCTCTTTCACTCATCTTGCCGTAGTTGCGGTTAAGACTCTGCGCATTATTATTTTGCAGGTCCAACATCTGATAGAGAGATGCTTGCTGTTTCATATGAGTCGAGATGAGTGCGTAAAGGCATTGGCTCACCATGCAAACATTCGCCCTCTCATAACACTTACAGGTCCATACAAACACTTCCTTAACTTTAATTCTTCTCTCTCTTTCTCTAAATCAAATTAATTATTCAATAATCTAATTAAGCTTTTATGGTGTTTTTTTGACCTCCCAGTGTGGAAAGAGCTCCAGAAAGAGAACTCTGAATTCTTCCGGGCATATTTCCATACTATTTCTCCCAACCCATTCCTCGGTATCATCCCTTCTAATTTCACTAAAATAATTATTCAATCAATTCTTGGATGTTTATTCTAA

mRNA sequence

ATGGTTCCACCAGTTTCTTCCTTGTTTTTATTGCCATCCCCATGCCTATATCCGGATGGTACTGCTAACGTCCAACATCTGATAGAGAGATGCTTGCTGTTTCATATGAGTCGAGATGAGTGCGTAAAGGCATTGGCTCACCATGCAAACATTCGCCCTCTCATAACACTTACAGTGTGGAAAGAGCTCCAGAAAGAGAACTCTGAATTCTTCCGGGCATATTTCCATACTATTTCTCCCAACCCATTCCTCGGTATCATCCCTTCTAATTTCACTAAAATAATTATTCAATCAATTCTTGGATGTTTATTCTAA

Coding sequence (CDS)

ATGGTTCCACCAGTTTCTTCCTTGTTTTTATTGCCATCCCCATGCCTATATCCGGATGGTACTGCTAACGTCCAACATCTGATAGAGAGATGCTTGCTGTTTCATATGAGTCGAGATGAGTGCGTAAAGGCATTGGCTCACCATGCAAACATTCGCCCTCTCATAACACTTACAGTGTGGAAAGAGCTCCAGAAAGAGAACTCTGAATTCTTCCGGGCATATTTCCATACTATTTCTCCCAACCCATTCCTCGGTATCATCCCTTCTAATTTCACTAAAATAATTATTCAATCAATTCTTGGATGTTTATTCTAA

Protein sequence

MVPPVSSLFLLPSPCLYPDGTANVQHLIERCLLFHMSRDECVKALAHHANIRPLITLTVWKELQKENSEFFRAYFHTISPNPFLGIIPSNFTKIIIQSILGCLF
Homology
BLAST of Cla97C08G161650 vs. NCBI nr
Match: XP_038884418.1 (uncharacterized protein LOC120075271 isoform X1 [Benincasa hispida])

HSP 1 Score: 131.0 bits (328), Expect = 6.1e-27
Identity = 60/61 (98.36%), Postives = 60/61 (98.36%), Query Frame = 0

Query: 24 VQHLIERCLLFHMSRDECVKALAHHANIRPLITLTVWKELQKENSEFFRAYFHTISPNPF 83
          VQHLIERCLL HMSRDECVKALAHHANIRPLITLTVWKELQKENSEFFRAYFHTISPNPF
Sbjct: 26 VQHLIERCLLLHMSRDECVKALAHHANIRPLITLTVWKELQKENSEFFRAYFHTISPNPF 85

Query: 84 L 85
          L
Sbjct: 86 L 86

BLAST of Cla97C08G161650 vs. NCBI nr
Match: XP_038884419.1 (uncharacterized protein LOC120075271 isoform X2 [Benincasa hispida])

HSP 1 Score: 131.0 bits (328), Expect = 6.1e-27
Identity = 60/61 (98.36%), Postives = 60/61 (98.36%), Query Frame = 0

Query: 24 VQHLIERCLLFHMSRDECVKALAHHANIRPLITLTVWKELQKENSEFFRAYFHTISPNPF 83
          VQHLIERCLL HMSRDECVKALAHHANIRPLITLTVWKELQKENSEFFRAYFHTISPNPF
Sbjct: 26 VQHLIERCLLLHMSRDECVKALAHHANIRPLITLTVWKELQKENSEFFRAYFHTISPNPF 85

Query: 84 L 85
          L
Sbjct: 86 L 86

BLAST of Cla97C08G161650 vs. NCBI nr
Match: XP_008444348.1 (PREDICTED: uncharacterized protein LOC103487701 [Cucumis melo])

HSP 1 Score: 129.8 bits (325), Expect = 1.4e-26
Identity = 61/77 (79.22%), Postives = 65/77 (84.42%), Query Frame = 0

Query: 24  VQHLIERCLLFHMSRDECVKALAHHANIRPLITLTVWKELQKENSEFFRAYFHTISPNPF 83
           VQHLIERCLL HMSRDECVKALAHHANIRPLITLTVWKELQKENS+FFRAYFHTISPNPF
Sbjct: 24  VQHLIERCLLLHMSRDECVKALAHHANIRPLITLTVWKELQKENSDFFRAYFHTISPNPF 83

Query: 84  LGIIPSNFTKIIIQSIL 101
           L     +   I+ +  L
Sbjct: 84  LAKFTGSERSIVRRQYL 100

BLAST of Cla97C08G161650 vs. NCBI nr
Match: XP_004143067.2 (uncharacterized protein LOC101211262 [Cucumis sativus] >KGN62299.1 hypothetical protein Csa_018552 [Cucumis sativus])

HSP 1 Score: 126.3 bits (316), Expect = 1.5e-25
Identity = 66/87 (75.86%), Postives = 68/87 (78.16%), Query Frame = 0

Query: 1  MVPPVSSLFLLPSPCLYPDGTA---NVQHLIERCLLFHMSRDECVKALAHHANIRPLITL 60
          M  P S  FL   PC +    A    VQHLIERCLL HMSRDECVKALA HANIRPLITL
Sbjct: 1  MYRPNSHHFL---PCFHCQPHAYIRMVQHLIERCLLLHMSRDECVKALADHANIRPLITL 60

Query: 61 TVWKELQKENSEFFRAYFHTISPNPFL 85
          TVWKELQKENS+FFRAYFHTISPNPFL
Sbjct: 61 TVWKELQKENSDFFRAYFHTISPNPFL 84

BLAST of Cla97C08G161650 vs. NCBI nr
Match: XP_022961647.1 (uncharacterized protein LOC111462356 isoform X1 [Cucurbita moschata])

HSP 1 Score: 125.9 bits (315), Expect = 2.0e-25
Identity = 65/89 (73.03%), Postives = 68/89 (76.40%), Query Frame = 0

Query: 4  PVSSLFLLPSPCLYPDGTA---NVQHLIERCLLFHMSRDECVKALAHHANIRPLITLTVW 63
          P S  FL   PC +    A    VQHLIERCLL HMSRDECVKALAHHANIRPLIT TVW
Sbjct: 3  PYSHQFL---PCFHCHPQAYIRKVQHLIERCLLLHMSRDECVKALAHHANIRPLITHTVW 62

Query: 64 KELQKENSEFFRAYFHTISPNPFLGIIPS 90
          KELQKEN EFFRAYF TIS +PFL +IPS
Sbjct: 63 KELQKENPEFFRAYFRTISRHPFLSMIPS 88

BLAST of Cla97C08G161650 vs. ExPASy TrEMBL
Match: A0A1S3BA39 (uncharacterized protein LOC103487701 OS=Cucumis melo OX=3656 GN=LOC103487701 PE=4 SV=1)

HSP 1 Score: 129.8 bits (325), Expect = 6.6e-27
Identity = 61/77 (79.22%), Postives = 65/77 (84.42%), Query Frame = 0

Query: 24  VQHLIERCLLFHMSRDECVKALAHHANIRPLITLTVWKELQKENSEFFRAYFHTISPNPF 83
           VQHLIERCLL HMSRDECVKALAHHANIRPLITLTVWKELQKENS+FFRAYFHTISPNPF
Sbjct: 24  VQHLIERCLLLHMSRDECVKALAHHANIRPLITLTVWKELQKENSDFFRAYFHTISPNPF 83

Query: 84  LGIIPSNFTKIIIQSIL 101
           L     +   I+ +  L
Sbjct: 84  LAKFTGSERSIVRRQYL 100

BLAST of Cla97C08G161650 vs. ExPASy TrEMBL
Match: A0A0A0LMP2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G348860 PE=4 SV=1)

HSP 1 Score: 126.3 bits (316), Expect = 7.3e-26
Identity = 66/87 (75.86%), Postives = 68/87 (78.16%), Query Frame = 0

Query: 1  MVPPVSSLFLLPSPCLYPDGTA---NVQHLIERCLLFHMSRDECVKALAHHANIRPLITL 60
          M  P S  FL   PC +    A    VQHLIERCLL HMSRDECVKALA HANIRPLITL
Sbjct: 1  MYRPNSHHFL---PCFHCQPHAYIRMVQHLIERCLLLHMSRDECVKALADHANIRPLITL 60

Query: 61 TVWKELQKENSEFFRAYFHTISPNPFL 85
          TVWKELQKENS+FFRAYFHTISPNPFL
Sbjct: 61 TVWKELQKENSDFFRAYFHTISPNPFL 84

BLAST of Cla97C08G161650 vs. ExPASy TrEMBL
Match: A0A6J1HAR0 (uncharacterized protein LOC111462356 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111462356 PE=4 SV=1)

HSP 1 Score: 125.9 bits (315), Expect = 9.6e-26
Identity = 65/89 (73.03%), Postives = 68/89 (76.40%), Query Frame = 0

Query: 4  PVSSLFLLPSPCLYPDGTA---NVQHLIERCLLFHMSRDECVKALAHHANIRPLITLTVW 63
          P S  FL   PC +    A    VQHLIERCLL HMSRDECVKALAHHANIRPLIT TVW
Sbjct: 3  PYSHQFL---PCFHCHPQAYIRKVQHLIERCLLLHMSRDECVKALAHHANIRPLITHTVW 62

Query: 64 KELQKENSEFFRAYFHTISPNPFLGIIPS 90
          KELQKEN EFFRAYF TIS +PFL +IPS
Sbjct: 63 KELQKENPEFFRAYFRTISRHPFLSMIPS 88

BLAST of Cla97C08G161650 vs. ExPASy TrEMBL
Match: A0A6J1KA89 (uncharacterized protein LOC111492501 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111492501 PE=4 SV=1)

HSP 1 Score: 122.1 bits (305), Expect = 1.4e-24
Identity = 65/98 (66.33%), Postives = 70/98 (71.43%), Query Frame = 0

Query: 4  PVSSLFLLPSPCLYPDGTA---NVQHLIERCLLFHMSRDECVKALAHHANIRPLITLTVW 63
          P S  FL   PC +    A    VQHLIERCLL HMSRDECVKALAHHANIRPLIT TVW
Sbjct: 3  PYSHQFL---PCFHCHPQAYIRMVQHLIERCLLLHMSRDECVKALAHHANIRPLITHTVW 62

Query: 64 KELQKENSEFFRAYFHTISPNPFLGII-----PSNFTK 94
          KELQKEN EFFRAYF TIS +PFL +      P N++K
Sbjct: 63 KELQKENPEFFRAYFRTISRHPFLSMFLPFLPPKNYSK 97

BLAST of Cla97C08G161650 vs. ExPASy TrEMBL
Match: A0A6J1K802 (uncharacterized protein LOC111492501 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111492501 PE=4 SV=1)

HSP 1 Score: 119.4 bits (298), Expect = 8.9e-24
Identity = 65/100 (65.00%), Postives = 71/100 (71.00%), Query Frame = 0

Query: 4   PVSSLFLLPSPCLYPDGTA---NVQHLIERCLLFHMSRDECVKALAHHANIRPLITLTVW 63
           P S  FL   PC +    A    VQHLIERCLL HMSRDECVKALAHHANIRPLIT TVW
Sbjct: 3   PYSHQFL---PCFHCHPQAYIRMVQHLIERCLLLHMSRDECVKALAHHANIRPLITHTVW 62

Query: 64  KELQKENSEFFRAYFHTISPNPFLGIIPSNFTKIIIQSIL 101
           KELQKEN EFFRAYF TIS +PFL    + FT+  +  I+
Sbjct: 63  KELQKENPEFFRAYFRTISRHPFL----TKFTRRSVSRIM 95

BLAST of Cla97C08G161650 vs. TAIR 10
Match: AT1G10657.2 (Plant protein 1589 of unknown function )

HSP 1 Score: 97.8 bits (242), Expect = 5.4e-21
Identity = 41/61 (67.21%), Postives = 50/61 (81.97%), Query Frame = 0

Query: 24 VQHLIERCLLFHMSRDECVKALAHHANIRPLITLTVWKELQKENSEFFRAYFHTISPNPF 83
          VQH+IERC+L  M+RDECVKAL HHA+I PL+TLTVW+ LQ+EN +FF  Y H +SP PF
Sbjct: 27 VQHMIERCILLRMTRDECVKALDHHASILPLVTLTVWRGLQRENKDFFETYGHFVSPRPF 86

Query: 84 L 85
          L
Sbjct: 87 L 87

BLAST of Cla97C08G161650 vs. TAIR 10
Match: AT1G10657.3 (Plant protein 1589 of unknown function )

HSP 1 Score: 97.8 bits (242), Expect = 5.4e-21
Identity = 41/61 (67.21%), Postives = 50/61 (81.97%), Query Frame = 0

Query: 24 VQHLIERCLLFHMSRDECVKALAHHANIRPLITLTVWKELQKENSEFFRAYFHTISPNPF 83
          VQH+IERC+L  M+RDECVKAL HHA+I PL+TLTVW+ LQ+EN +FF  Y H +SP PF
Sbjct: 27 VQHMIERCILLRMTRDECVKALDHHASILPLVTLTVWRGLQRENKDFFETYGHFVSPRPF 86

Query: 84 L 85
          L
Sbjct: 87 L 87

BLAST of Cla97C08G161650 vs. TAIR 10
Match: AT3G55240.1 (Plant protein 1589 of unknown function )

HSP 1 Score: 90.1 bits (222), Expect = 1.1e-18
Identity = 37/51 (72.55%), Postives = 47/51 (92.16%), Query Frame = 0

Query: 24 VQHLIERCLLFHMSRDECVKALAHHANIRPLITLTVWKELQKENSEFFRAY 75
          VQH+IE+CL+FHMS++ECV+AL+ HANI P+IT TVWKEL+KEN EFF+AY
Sbjct: 13 VQHMIEKCLIFHMSKEECVEALSKHANITPVITSTVWKELEKENKEFFKAY 63

BLAST of Cla97C08G161650 vs. TAIR 10
Match: AT1G10657.1 (Plant protein 1589 of unknown function )

HSP 1 Score: 84.3 bits (207), Expect = 6.1e-17
Identity = 41/85 (48.24%), Postives = 50/85 (58.82%), Query Frame = 0

Query: 24  VQHLIERCLLFHMSRDECVKALAHHANIRPLITLT------------------------V 83
           VQH+IERC+L  M+RDECVKAL HHA+I PL+TLT                        V
Sbjct: 27  VQHMIERCILLRMTRDECVKALDHHASILPLVTLTVFSYILGFASVLQKKLHDIFPPPAV 86

Query: 84  WKELQKENSEFFRAYFHTISPNPFL 85
           W+ LQ+EN +FF  Y H +SP PFL
Sbjct: 87  WRGLQRENKDFFETYGHFVSPRPFL 111

BLAST of Cla97C08G161650 vs. TAIR 10
Match: AT1G10657.4 (Plant protein 1589 of unknown function )

HSP 1 Score: 81.6 bits (200), Expect = 4.0e-16
Identity = 34/48 (70.83%), Postives = 42/48 (87.50%), Query Frame = 0

Query: 24 VQHLIERCLLFHMSRDECVKALAHHANIRPLITLTVWKELQKENSEFF 72
          VQH+IERC+L  M+RDECVKAL HHA+I PL+TLTVW+ LQ+EN +FF
Sbjct: 27 VQHMIERCILLRMTRDECVKALDHHASILPLVTLTVWRGLQRENKDFF 74

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038884418.16.1e-2798.36uncharacterized protein LOC120075271 isoform X1 [Benincasa hispida][more]
XP_038884419.16.1e-2798.36uncharacterized protein LOC120075271 isoform X2 [Benincasa hispida][more]
XP_008444348.11.4e-2679.22PREDICTED: uncharacterized protein LOC103487701 [Cucumis melo][more]
XP_004143067.21.5e-2575.86uncharacterized protein LOC101211262 [Cucumis sativus] >KGN62299.1 hypothetical ... [more]
XP_022961647.12.0e-2573.03uncharacterized protein LOC111462356 isoform X1 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A1S3BA396.6e-2779.22uncharacterized protein LOC103487701 OS=Cucumis melo OX=3656 GN=LOC103487701 PE=... [more]
A0A0A0LMP27.3e-2675.86Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G348860 PE=4 SV=1[more]
A0A6J1HAR09.6e-2673.03uncharacterized protein LOC111462356 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1KA891.4e-2466.33uncharacterized protein LOC111492501 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1K8028.9e-2465.00uncharacterized protein LOC111492501 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
Match NameE-valueIdentityDescription
AT1G10657.25.4e-2167.21Plant protein 1589 of unknown function [more]
AT1G10657.35.4e-2167.21Plant protein 1589 of unknown function [more]
AT3G55240.11.1e-1872.55Plant protein 1589 of unknown function [more]
AT1G10657.16.1e-1748.24Plant protein 1589 of unknown function [more]
AT1G10657.44.0e-1670.83Plant protein 1589 of unknown function [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006476Conserved hypothetical protein CHP01589, plantPFAMPF09713A_thal_3526coord: 24..74
e-value: 3.8E-26
score: 91.1
IPR006476Conserved hypothetical protein CHP01589, plantTIGRFAMTIGR01589TIGR01589coord: 24..74
e-value: 2.9E-23
score: 79.6
IPR006476Conserved hypothetical protein CHP01589, plantPANTHERPTHR31871OS02G0137100 PROTEINcoord: 22..84
NoneNo IPR availablePANTHERPTHR31871:SF25SUBFAMILY NOT NAMEDcoord: 22..84

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C08G161650.2Cla97C08G161650.2mRNA