Cla97C01G010790 (gene) Watermelon (97103) v2

NameCla97C01G010790
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionGag-asp_proteas domain-containing protein
LocationCla97Chr01 : 16802634 .. 16804139 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGAGATCGAGTTAGTTGTTTGGAGGAACATGTCGGTGTACTTCCTGAGGGACATTCAACCATGGTGGTTGAGATGGCGATGACGCACGAAGTTAGACTGCTGACAATAGAGGGAACGTTGGGAGACTTCATGGCCCAAACGAACAAGCGTTTGGAAGAGATCATTGCAGAATTGGAAGACATACCTTATCCAATCTCCTTCAACATAAAGACTCTGAAAGAGGAACTCGACCTGACAAACACTGAAGTAGCCTTGGTGAAGAAAGCCTATGTGGACTCCTCACCCAAAGCTGACAATAATTCAAAGGTAAAGATTCCTGAACCATTAGTTTTTAGTGGTTTGAGAGATGCCAAGGAACTTGAAAACTTCCTTTGGGACATTGAAAAATACTACAAGATTGCCAAGATCATTGAAGAGGCACAAGTTGGCATTGCCAGCATGTGCCTGACCTCGAACGCCAATTTGTGGTGGGGTACTTAAGTACTCGATGATGATGGAAATCCATGAAGAAGGAACTTAGAGACTAGTTCCTTCTTAGCAATACAAGCTAGGTTGCTAGGGAGGCCCTGGAGAAGCTCAAGCATACAGGCTCCTTACGAGCCTATGTCAAGGATTACAACTCCCTGATCCTAGACATCTAGCACATGTCTGAGGAAAAACTATTCAATTTCATTTTGAGGGTGTAACCCTTGGCCCAGACCAAGCTAAGGAGACAAACAGTCAAGGATCTACCTTCCGACATTGTTGCTGCAGATGCCCTATTGGATTTTAAGGCCACGAACTCTTCCACTTCTCTTGGGAAGGAGAAGAAGGATTGGAAAAAGTCCAACAACCATAAGGACGGGTCCAAAGGTGAAATGGACTGTAGTGACAAGGACAAAGGTAAATCTGTTACCCCCTACTCCTTCAACGAGGACCTACAATTACTTTCTATTCAAGGAACCACATGGAGCACGAGATTGCCCAAGAAAAGAAAAATTAAATGTCATGCTTGAGGAGAGAGAGGCCAAAAAAGACACCTACCAGGTCAACAACCTCCAGTTGTTGAGTGCTTTCTCTTTTATCCCACAATCATGTCATTATAATTTGATGCATATGGTCGTCAAGATAAATGAGGTAGAGGTGTTCACTATGCTAGCCATTGAAGCCACCAACAACTTCATATCCACGAAGTTGGCGACCAAGTTGGGCTTGATAGTGACCAAGAGTGACAGTGATCATAAGACTGTCAATGCAGAGGCTCGAGTGACAGAAGGCGCATCCACGACCCGATTGGAGTTTCATTCATGGACGAACCAATGTAAGTTCTCAGTGATACCCCTAGACGACTTCGAGGTAGTACTAGGACTCGAGTCCCTGGTAAAGGAAAAGATCTCCCTAATACTTCATCTCAACGGGGTGATGGTTAACAATAAAACCACCCCCTGCTTTGTTCGAGGTATAACAACAAACCTAGGTTCGAGAAATAATCGTTTTGCCTTTGTCGGGCCAACCAAGTCGTAG

mRNA sequence

ATGAGAGATCGAGTTAGTTGTTTGGAGGAACATGTCGGTGTACTTCCTGAGGGACATTCAACCATGGTGGTTGAGATGGCGATGACGCACGAAGTTAGACTGCTGACAATAGAGGGAACGTTGGGAGACTTCATGGCCCAAACGAACAAGCGTTTGGAAGAGATCATTGCAGAATTGGAAGACATACCTTATCCAATCTCCTTCAACATAAAGACTCTGAAAGAGGAACTCGACCTGACAAACACTGAAATAAATGAGGTAGAGGTGTTCACTATGCTAGCCATTGAAGCCACCAACAACTTCATATCCACGAAGTTGGCGACCAAGTTGGGCTTGATAGTGACCAAGAGTGACAGTGATCATAAGACTGTCAATGCAGAGGCTCGAGTGACAGAAGGCGCATCCACGACCCGATTGGAGTTTCATTCATGGACGAACCAATGTAAGTTCTCAGTGATACCCCTAGACGACTTCGAGGTAGTACTAGGACTCGAGTCCCTGGTAAAGGAAAAGATCTCCCTAATACTTCATCTCAACGGGGTGATGGTTAACAATAAAACCACCCCCTGCTTTGTTCGAGGTATAACAACAAACCTAGGTTCGAGAAATAATCGTTTTGCCTTTGTCGGGCCAACCAAGTCGTAG

Coding sequence (CDS)

ATGAGAGATCGAGTTAGTTGTTTGGAGGAACATGTCGGTGTACTTCCTGAGGGACATTCAACCATGGTGGTTGAGATGGCGATGACGCACGAAGTTAGACTGCTGACAATAGAGGGAACGTTGGGAGACTTCATGGCCCAAACGAACAAGCGTTTGGAAGAGATCATTGCAGAATTGGAAGACATACCTTATCCAATCTCCTTCAACATAAAGACTCTGAAAGAGGAACTCGACCTGACAAACACTGAAATAAATGAGGTAGAGGTGTTCACTATGCTAGCCATTGAAGCCACCAACAACTTCATATCCACGAAGTTGGCGACCAAGTTGGGCTTGATAGTGACCAAGAGTGACAGTGATCATAAGACTGTCAATGCAGAGGCTCGAGTGACAGAAGGCGCATCCACGACCCGATTGGAGTTTCATTCATGGACGAACCAATGTAAGTTCTCAGTGATACCCCTAGACGACTTCGAGGTAGTACTAGGACTCGAGTCCCTGGTAAAGGAAAAGATCTCCCTAATACTTCATCTCAACGGGGTGATGGTTAACAATAAAACCACCCCCTGCTTTGTTCGAGGTATAACAACAAACCTAGGTTCGAGAAATAATCGTTTTGCCTTTGTCGGGCCAACCAAGTCGTAG

Protein sequence

MRDRVSCLEEHVGVLPEGHSTMVVEMAMTHEVRLLTIEGTLGDFMAQTNKRLEEIIAELEDIPYPISFNIKTLKEELDLTNTEINEVEVFTMLAIEATNNFISTKLATKLGLIVTKSDSDHKTVNAEARVTEGASTTRLEFHSWTNQCKFSVIPLDDFEVVLGLESLVKEKISLILHLNGVMVNNKTTPCFVRGITTNLGSRNNRFAFVGPTKS
BLAST of Cla97C01G010790 vs. NCBI nr
Match: XP_017227982.1 (PREDICTED: uncharacterized protein LOC108203519 [Daucus carota subsp. sativus])

HSP 1 Score: 90.5 bits (223), Expect = 7.4e-15
Identity = 44/112 (39.29%), Postives = 73/112 (65.18%), Query Frame = 0

Query: 84  INEVEVFTMLAIEATNNFISTKLATKLGLIVTKSDSDHKTVNAEARVTEGASTTRLEFHS 143
           IN  +V  M+   ATNNF++ +    LGL +  S S  K VN EA++ +G+S + +   S
Sbjct: 345 INGHDVMAMVDTGATNNFVADRNVEFLGLALKASTSRVKAVNYEAQLIKGSSQSDITVGS 404

Query: 144 WTNQCKFSVIPLDDFEVVLGLESLVKEKISLILHLNGVMVNNKTTPCFVRGI 196
           WT +    V+P+DDF+V+LG++ L+K K S++ H+ G+M+ + + PCFV+G+
Sbjct: 405 WTGKVNLFVVPVDDFDVILGIDFLLKAKASVMPHIGGLMIEDASNPCFVKGV 456

BLAST of Cla97C01G010790 vs. NCBI nr
Match: CAN60274.1 (hypothetical protein VITISV_024841 [Vitis vinifera])

HSP 1 Score: 87.4 bits (215), Expect = 6.3e-14
Identity = 46/109 (42.20%), Postives = 68/109 (62.39%), Query Frame = 0

Query: 89   VFTMLAIEATNNFISTKLATKLGLIVTKSDSDHKTVNAEARVTEG-ASTTRLEFHSWTNQ 148
            V  ++  EAT+N +ST +AT+LGL + K  S  K VN+EA  T+G A    ++   W   
Sbjct: 1386 VVALVDSEATHNLVSTGVATRLGLKLCKDASKLKVVNSEALETQGLAKDVAIQISEWKGT 1445

Query: 149  CKFSVIPLDDFEVVLGLESLVKEKISLILHLNGVMVNNKTTPCFVRGIT 197
                  PLDDF+++LG E  VK K+ ++ HLNG++  N+T PCFVRG++
Sbjct: 1446 VNMLSTPLDDFDLILGNEFFVKAKVMVLPHLNGLLFMNETQPCFVRGLS 1494

BLAST of Cla97C01G010790 vs. NCBI nr
Match: GAV83323.1 (gag-asp_proteas domain-containing protein [Cephalotus follicularis])

HSP 1 Score: 87.0 bits (214), Expect = 8.2e-14
Identity = 45/123 (36.59%), Postives = 73/123 (59.35%), Query Frame = 0

Query: 84  INEVEVFTMLAIEATNNFISTKLATKLGLIVTKSDSDHKTVNAEARVTEG-ASTTRLEFH 143
           +N VE++ M+   A++NF++ ++  K GL + K  S  K VNA+AR  +G A    L+  
Sbjct: 134 LNSVEIYAMIDSGASHNFVNERIVGKFGLKIEKHTSKIKAVNADARPVQGFARDVPLQVG 193

Query: 144 SWTNQCKFSVIPLDDFEVVLGLESLVKEKISLILHLNGVMVNNKTTPCFVRGITTNLGSR 203
           +W  Q    ++PLDDF+V+ G++ LV+ K + + HL G+M  ++  PCFV G T    S 
Sbjct: 194 AWKGQLNLMIVPLDDFDVIFGIDFLVRNKAAPMPHLKGLMFVDENQPCFVSGFTMEDHSF 253

Query: 204 NNR 206
            N+
Sbjct: 254 GNK 256

BLAST of Cla97C01G010790 vs. NCBI nr
Match: GAV79110.1 (gag-asp_proteas domain-containing protein [Cephalotus follicularis])

HSP 1 Score: 85.5 bits (210), Expect = 2.4e-13
Identity = 43/112 (38.39%), Postives = 68/112 (60.71%), Query Frame = 0

Query: 84  INEVEVFTMLAIEATNNFISTKLATKLGLIVTKSDSDHKTVNAEARVTEG-ASTTRLEFH 143
           +N VE+F M+   A++NF++ ++  KLGL V K  S  K V+A+AR  +G A    L+  
Sbjct: 134 LNSVEIFAMIDTGASHNFVNERIVGKLGLKVAKHTSKIKAVDADARPVQGVARDVPLQVG 193

Query: 144 SWTNQCKFSVIPLDDFEVVLGLESLVKEKISLILHLNGVMVNNKTTPCFVRG 195
           +W  Q    ++PLDDF+V+LG++ L + K   + HL G+M   +  PCF+ G
Sbjct: 194 AWKGQLNLMIVPLDDFDVILGIDFLTRNKAVPMPHLKGLMFMGENQPCFISG 245

BLAST of Cla97C01G010790 vs. NCBI nr
Match: GAV86378.1 (gag-asp_proteas domain-containing protein [Cephalotus follicularis])

HSP 1 Score: 84.7 bits (208), Expect = 4.1e-13
Identity = 44/112 (39.29%), Postives = 66/112 (58.93%), Query Frame = 0

Query: 84  INEVEVFTMLAIEATNNFISTKLATKLGLIVTKSDSDHKTVNAEARVTEG-ASTTRLEFH 143
           +N VE+F M+   A++NF+  ++  KLGL V K  S  K VNA+AR  +G A    L+  
Sbjct: 134 LNSVEIFAMIDTGASHNFVYERIVGKLGLKVEKHTSKIKAVNADARPVQGVARDVPLQVG 193

Query: 144 SWTNQCKFSVIPLDDFEVVLGLESLVKEKISLILHLNGVMVNNKTTPCFVRG 195
           +W  Q    ++PLDDF+V+ G++ L + K   + HL G+M   +  PCFV G
Sbjct: 194 AWKGQLNLMIVPLDDFDVIFGIDFLTRNKAIPMPHLKGLMFMGENQPCFVSG 245

BLAST of Cla97C01G010790 vs. TrEMBL
Match: tr|A5AIP6|A5AIP6_VITVI (Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VITISV_024841 PE=4 SV=1)

HSP 1 Score: 87.4 bits (215), Expect = 4.2e-14
Identity = 46/109 (42.20%), Postives = 68/109 (62.39%), Query Frame = 0

Query: 89   VFTMLAIEATNNFISTKLATKLGLIVTKSDSDHKTVNAEARVTEG-ASTTRLEFHSWTNQ 148
            V  ++  EAT+N +ST +AT+LGL + K  S  K VN+EA  T+G A    ++   W   
Sbjct: 1386 VVALVDSEATHNLVSTGVATRLGLKLCKDASKLKVVNSEALETQGLAKDVAIQISEWKGT 1445

Query: 149  CKFSVIPLDDFEVVLGLESLVKEKISLILHLNGVMVNNKTTPCFVRGIT 197
                  PLDDF+++LG E  VK K+ ++ HLNG++  N+T PCFVRG++
Sbjct: 1446 VNMLSTPLDDFDLILGNEFFVKAKVMVLPHLNGLLFMNETQPCFVRGLS 1494

BLAST of Cla97C01G010790 vs. TrEMBL
Match: tr|A0A1Q3CTC7|A0A1Q3CTC7_CEPFO (Gag-asp_proteas domain-containing protein OS=Cephalotus follicularis OX=3775 GN=CFOL_v3_26771 PE=4 SV=1)

HSP 1 Score: 87.0 bits (214), Expect = 5.4e-14
Identity = 45/123 (36.59%), Postives = 73/123 (59.35%), Query Frame = 0

Query: 84  INEVEVFTMLAIEATNNFISTKLATKLGLIVTKSDSDHKTVNAEARVTEG-ASTTRLEFH 143
           +N VE++ M+   A++NF++ ++  K GL + K  S  K VNA+AR  +G A    L+  
Sbjct: 134 LNSVEIYAMIDSGASHNFVNERIVGKFGLKIEKHTSKIKAVNADARPVQGFARDVPLQVG 193

Query: 144 SWTNQCKFSVIPLDDFEVVLGLESLVKEKISLILHLNGVMVNNKTTPCFVRGITTNLGSR 203
           +W  Q    ++PLDDF+V+ G++ LV+ K + + HL G+M  ++  PCFV G T    S 
Sbjct: 194 AWKGQLNLMIVPLDDFDVIFGIDFLVRNKAAPMPHLKGLMFVDENQPCFVSGFTMEDHSF 253

Query: 204 NNR 206
            N+
Sbjct: 254 GNK 256

BLAST of Cla97C01G010790 vs. TrEMBL
Match: tr|A0A1Q3CFT3|A0A1Q3CFT3_CEPFO (Gag-asp_proteas domain-containing protein OS=Cephalotus follicularis OX=3775 GN=CFOL_v3_22575 PE=4 SV=1)

HSP 1 Score: 85.5 bits (210), Expect = 1.6e-13
Identity = 43/112 (38.39%), Postives = 68/112 (60.71%), Query Frame = 0

Query: 84  INEVEVFTMLAIEATNNFISTKLATKLGLIVTKSDSDHKTVNAEARVTEG-ASTTRLEFH 143
           +N VE+F M+   A++NF++ ++  KLGL V K  S  K V+A+AR  +G A    L+  
Sbjct: 134 LNSVEIFAMIDTGASHNFVNERIVGKLGLKVAKHTSKIKAVDADARPVQGVARDVPLQVG 193

Query: 144 SWTNQCKFSVIPLDDFEVVLGLESLVKEKISLILHLNGVMVNNKTTPCFVRG 195
           +W  Q    ++PLDDF+V+LG++ L + K   + HL G+M   +  PCF+ G
Sbjct: 194 AWKGQLNLMIVPLDDFDVILGIDFLTRNKAVPMPHLKGLMFMGENQPCFISG 245

BLAST of Cla97C01G010790 vs. TrEMBL
Match: tr|A0A1Q3D256|A0A1Q3D256_CEPFO (Gag-asp_proteas domain-containing protein OS=Cephalotus follicularis OX=3775 GN=CFOL_v3_29809 PE=4 SV=1)

HSP 1 Score: 84.7 bits (208), Expect = 2.7e-13
Identity = 44/112 (39.29%), Postives = 66/112 (58.93%), Query Frame = 0

Query: 84  INEVEVFTMLAIEATNNFISTKLATKLGLIVTKSDSDHKTVNAEARVTEG-ASTTRLEFH 143
           +N VE+F M+   A++NF+  ++  KLGL V K  S  K VNA+AR  +G A    L+  
Sbjct: 134 LNSVEIFAMIDTGASHNFVYERIVGKLGLKVEKHTSKIKAVNADARPVQGVARDVPLQVG 193

Query: 144 SWTNQCKFSVIPLDDFEVVLGLESLVKEKISLILHLNGVMVNNKTTPCFVRG 195
           +W  Q    ++PLDDF+V+ G++ L + K   + HL G+M   +  PCFV G
Sbjct: 194 AWKGQLNLMIVPLDDFDVIFGIDFLTRNKAIPMPHLKGLMFMGENQPCFVSG 245

BLAST of Cla97C01G010790 vs. TrEMBL
Match: tr|A0A1Q3CEG3|A0A1Q3CEG3_CEPFO (Gag-asp_proteas domain-containing protein OS=Cephalotus follicularis OX=3775 GN=CFOL_v3_22084 PE=4 SV=1)

HSP 1 Score: 84.3 bits (207), Expect = 3.5e-13
Identity = 42/112 (37.50%), Postives = 69/112 (61.61%), Query Frame = 0

Query: 84  INEVEVFTMLAIEATNNFISTKLATKLGLIVTKSDSDHKTVNAEARVTEG-ASTTRLEFH 143
           +N VE++ M+   A++NF++ ++  KLGL + K  +  K VNA+AR  +G A    L+  
Sbjct: 157 LNSVEIYAMIDSGASHNFVNERIVGKLGLKIEKHTTKIKAVNADARPVQGVARDVPLQVG 216

Query: 144 SWTNQCKFSVIPLDDFEVVLGLESLVKEKISLILHLNGVMVNNKTTPCFVRG 195
           +W  Q    ++PLDDF+V+ G++ LV+ K   + HL G+M  ++  PCFV G
Sbjct: 217 AWKGQLNLMIVPLDDFDVIFGIDFLVRNKAVPMPHLKGLMFVDENQPCFVSG 268

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_017227982.17.4e-1539.29PREDICTED: uncharacterized protein LOC108203519 [Daucus carota subsp. sativus][more]
CAN60274.16.3e-1442.20hypothetical protein VITISV_024841 [Vitis vinifera][more]
GAV83323.18.2e-1436.59gag-asp_proteas domain-containing protein [Cephalotus follicularis][more]
GAV79110.12.4e-1338.39gag-asp_proteas domain-containing protein [Cephalotus follicularis][more]
GAV86378.14.1e-1339.29gag-asp_proteas domain-containing protein [Cephalotus follicularis][more]
Match NameE-valueIdentityDescription
tr|A5AIP6|A5AIP6_VITVI4.2e-1442.20Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VITISV_024841 PE=4 SV=1[more]
tr|A0A1Q3CTC7|A0A1Q3CTC7_CEPFO5.4e-1436.59Gag-asp_proteas domain-containing protein OS=Cephalotus follicularis OX=3775 GN=... [more]
tr|A0A1Q3CFT3|A0A1Q3CFT3_CEPFO1.6e-1338.39Gag-asp_proteas domain-containing protein OS=Cephalotus follicularis OX=3775 GN=... [more]
tr|A0A1Q3D256|A0A1Q3D256_CEPFO2.7e-1339.29Gag-asp_proteas domain-containing protein OS=Cephalotus follicularis OX=3775 GN=... [more]
tr|A0A1Q3CEG3|A0A1Q3CEG3_CEPFO3.5e-1337.50Gag-asp_proteas domain-containing protein OS=Cephalotus follicularis OX=3775 GN=... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR021109Peptidase_aspartic_dom_sf

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C01G010790.1Cla97C01G010790.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR021109Aspartic peptidase domain superfamilyGENE3DG3DSA:2.40.70.10coord: 79..190
e-value: 8.3E-7
score: 30.9
NoneNo IPR availableCDDcd00303retropepsin_likecoord: 80..168
e-value: 2.20134E-8
score: 48.1016

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla97C01G010790Watermelon (97103) v2wmbwmbB001
Cla97C01G010790Silver-seed gourdcarwmbB0050
Cla97C01G010790Silver-seed gourdcarwmbB0425
Cla97C01G010790Silver-seed gourdcarwmbB0778
Cla97C01G010790Cucumber (Gy14) v2cgybwmbB008
Cla97C01G010790Cucumber (Gy14) v2cgybwmbB009
Cla97C01G010790Cucumber (Gy14) v1cgywmbB325
Cla97C01G010790Cucumber (Gy14) v1cgywmbB557
Cla97C01G010790Cucurbita maxima (Rimu)cmawmbB394
Cla97C01G010790Cucurbita maxima (Rimu)cmawmbB669
Cla97C01G010790Cucurbita moschata (Rifu)cmowmbB042
Cla97C01G010790Cucurbita moschata (Rifu)cmowmbB380
Cla97C01G010790Cucurbita moschata (Rifu)cmowmbB641
Cla97C01G010790Wild cucumber (PI 183967)cpiwmbB008
Cla97C01G010790Wild cucumber (PI 183967)cpiwmbB009
Cla97C01G010790Wild cucumber (PI 183967)cpiwmbB437
Cla97C01G010790Cucumber (Chinese Long) v3cucwmbB007
Cla97C01G010790Cucumber (Chinese Long) v3cucwmbB008
Cla97C01G010790Cucumber (Chinese Long) v3cucwmbB433
Cla97C01G010790Cucumber (Chinese Long) v2cuwmbB008
Cla97C01G010790Cucumber (Chinese Long) v2cuwmbB009
Cla97C01G010790Cucumber (Chinese Long) v2cuwmbB415
Cla97C01G010790Bottle gourd (USVL1VR-Ls)lsiwmbB113
Cla97C01G010790Bottle gourd (USVL1VR-Ls)lsiwmbB153
Cla97C01G010790Bottle gourd (USVL1VR-Ls)lsiwmbB159
Cla97C01G010790Melon (DHL92) v3.6.1medwmbB126
Cla97C01G010790Melon (DHL92) v3.6.1medwmbB134
Cla97C01G010790Melon (DHL92) v3.6.1medwmbB509
Cla97C01G010790Melon (DHL92) v3.5.1mewmbB138
Cla97C01G010790Melon (DHL92) v3.5.1mewmbB145
Cla97C01G010790Melon (DHL92) v3.5.1mewmbB517
Cla97C01G010790Watermelon (Charleston Gray)wcgwmbB089
Cla97C01G010790Watermelon (Charleston Gray)wcgwmbB092
Cla97C01G010790Watermelon (97103) v1wmwmbB246
Cla97C01G010790Watermelon (97103) v1wmwmbB252
Cla97C01G010790Wax gourdwgowmbB460