Cla97C01G010790.1 (mRNA) Watermelon (97103) v2

NameCla97C01G010790.1
TypemRNA
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionGag-asp_proteas domain-containing protein
LocationCla97Chr01 : 16802634 .. 16804139 (+)
Sequence length645
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGAGATCGAGTTAGTTGTTTGGAGGAACATGTCGGTGTACTTCCTGAGGGACATTCAACCATGGTGGTTGAGATGGCGATGACGCACGAAGTTAGACTGCTGACAATAGAGGGAACGTTGGGAGACTTCATGGCCCAAACGAACAAGCGTTTGGAAGAGATCATTGCAGAATTGGAAGACATACCTTATCCAATCTCCTTCAACATAAAGACTCTGAAAGAGGAACTCGACCTGACAAACACTGAAGTAGCCTTGGTGAAGAAAGCCTATGTGGACTCCTCACCCAAAGCTGACAATAATTCAAAGGTAAAGATTCCTGAACCATTAGTTTTTAGTGGTTTGAGAGATGCCAAGGAACTTGAAAACTTCCTTTGGGACATTGAAAAATACTACAAGATTGCCAAGATCATTGAAGAGGCACAAGTTGGCATTGCCAGCATGTGCCTGACCTCGAACGCCAATTTGTGGTGGGGTACTTAAGTACTCGATGATGATGGAAATCCATGAAGAAGGAACTTAGAGACTAGTTCCTTCTTAGCAATACAAGCTAGGTTGCTAGGGAGGCCCTGGAGAAGCTCAAGCATACAGGCTCCTTACGAGCCTATGTCAAGGATTACAACTCCCTGATCCTAGACATCTAGCACATGTCTGAGGAAAAACTATTCAATTTCATTTTGAGGGTGTAACCCTTGGCCCAGACCAAGCTAAGGAGACAAACAGTCAAGGATCTACCTTCCGACATTGTTGCTGCAGATGCCCTATTGGATTTTAAGGCCACGAACTCTTCCACTTCTCTTGGGAAGGAGAAGAAGGATTGGAAAAAGTCCAACAACCATAAGGACGGGTCCAAAGGTGAAATGGACTGTAGTGACAAGGACAAAGGTAAATCTGTTACCCCCTACTCCTTCAACGAGGACCTACAATTACTTTCTATTCAAGGAACCACATGGAGCACGAGATTGCCCAAGAAAAGAAAAATTAAATGTCATGCTTGAGGAGAGAGAGGCCAAAAAAGACACCTACCAGGTCAACAACCTCCAGTTGTTGAGTGCTTTCTCTTTTATCCCACAATCATGTCATTATAATTTGATGCATATGGTCGTCAAGATAAATGAGGTAGAGGTGTTCACTATGCTAGCCATTGAAGCCACCAACAACTTCATATCCACGAAGTTGGCGACCAAGTTGGGCTTGATAGTGACCAAGAGTGACAGTGATCATAAGACTGTCAATGCAGAGGCTCGAGTGACAGAAGGCGCATCCACGACCCGATTGGAGTTTCATTCATGGACGAACCAATGTAAGTTCTCAGTGATACCCCTAGACGACTTCGAGGTAGTACTAGGACTCGAGTCCCTGGTAAAGGAAAAGATCTCCCTAATACTTCATCTCAACGGGGTGATGGTTAACAATAAAACCACCCCCTGCTTTGTTCGAGGTATAACAACAAACCTAGGTTCGAGAAATAATCGTTTTGCCTTTGTCGGGCCAACCAAGTCGTAG

mRNA sequence

ATGAGAGATCGAGTTAGTTGTTTGGAGGAACATGTCGGTGTACTTCCTGAGGGACATTCAACCATGGTGGTTGAGATGGCGATGACGCACGAAGTTAGACTGCTGACAATAGAGGGAACGTTGGGAGACTTCATGGCCCAAACGAACAAGCGTTTGGAAGAGATCATTGCAGAATTGGAAGACATACCTTATCCAATCTCCTTCAACATAAAGACTCTGAAAGAGGAACTCGACCTGACAAACACTGAAATAAATGAGGTAGAGGTGTTCACTATGCTAGCCATTGAAGCCACCAACAACTTCATATCCACGAAGTTGGCGACCAAGTTGGGCTTGATAGTGACCAAGAGTGACAGTGATCATAAGACTGTCAATGCAGAGGCTCGAGTGACAGAAGGCGCATCCACGACCCGATTGGAGTTTCATTCATGGACGAACCAATGTAAGTTCTCAGTGATACCCCTAGACGACTTCGAGGTAGTACTAGGACTCGAGTCCCTGGTAAAGGAAAAGATCTCCCTAATACTTCATCTCAACGGGGTGATGGTTAACAATAAAACCACCCCCTGCTTTGTTCGAGGTATAACAACAAACCTAGGTTCGAGAAATAATCGTTTTGCCTTTGTCGGGCCAACCAAGTCGTAG

Coding sequence (CDS)

ATGAGAGATCGAGTTAGTTGTTTGGAGGAACATGTCGGTGTACTTCCTGAGGGACATTCAACCATGGTGGTTGAGATGGCGATGACGCACGAAGTTAGACTGCTGACAATAGAGGGAACGTTGGGAGACTTCATGGCCCAAACGAACAAGCGTTTGGAAGAGATCATTGCAGAATTGGAAGACATACCTTATCCAATCTCCTTCAACATAAAGACTCTGAAAGAGGAACTCGACCTGACAAACACTGAAATAAATGAGGTAGAGGTGTTCACTATGCTAGCCATTGAAGCCACCAACAACTTCATATCCACGAAGTTGGCGACCAAGTTGGGCTTGATAGTGACCAAGAGTGACAGTGATCATAAGACTGTCAATGCAGAGGCTCGAGTGACAGAAGGCGCATCCACGACCCGATTGGAGTTTCATTCATGGACGAACCAATGTAAGTTCTCAGTGATACCCCTAGACGACTTCGAGGTAGTACTAGGACTCGAGTCCCTGGTAAAGGAAAAGATCTCCCTAATACTTCATCTCAACGGGGTGATGGTTAACAATAAAACCACCCCCTGCTTTGTTCGAGGTATAACAACAAACCTAGGTTCGAGAAATAATCGTTTTGCCTTTGTCGGGCCAACCAAGTCGTAG

Protein sequence

MRDRVSCLEEHVGVLPEGHSTMVVEMAMTHEVRLLTIEGTLGDFMAQTNKRLEEIIAELEDIPYPISFNIKTLKEELDLTNTEINEVEVFTMLAIEATNNFISTKLATKLGLIVTKSDSDHKTVNAEARVTEGASTTRLEFHSWTNQCKFSVIPLDDFEVVLGLESLVKEKISLILHLNGVMVNNKTTPCFVRGITTNLGSRNNRFAFVGPTKS
BLAST of Cla97C01G010790.1 vs. NCBI nr
Match: XP_017227982.1 (PREDICTED: uncharacterized protein LOC108203519 [Daucus carota subsp. sativus])

HSP 1 Score: 90.5 bits (223), Expect = 7.4e-15
Identity = 44/112 (39.29%), Postives = 73/112 (65.18%), Query Frame = 0

Query: 84  INEVEVFTMLAIEATNNFISTKLATKLGLIVTKSDSDHKTVNAEARVTEGASTTRLEFHS 143
           IN  +V  M+   ATNNF++ +    LGL +  S S  K VN EA++ +G+S + +   S
Sbjct: 345 INGHDVMAMVDTGATNNFVADRNVEFLGLALKASTSRVKAVNYEAQLIKGSSQSDITVGS 404

Query: 144 WTNQCKFSVIPLDDFEVVLGLESLVKEKISLILHLNGVMVNNKTTPCFVRGI 196
           WT +    V+P+DDF+V+LG++ L+K K S++ H+ G+M+ + + PCFV+G+
Sbjct: 405 WTGKVNLFVVPVDDFDVILGIDFLLKAKASVMPHIGGLMIEDASNPCFVKGV 456

BLAST of Cla97C01G010790.1 vs. NCBI nr
Match: CAN60274.1 (hypothetical protein VITISV_024841 [Vitis vinifera])

HSP 1 Score: 87.4 bits (215), Expect = 6.3e-14
Identity = 46/109 (42.20%), Postives = 68/109 (62.39%), Query Frame = 0

Query: 89   VFTMLAIEATNNFISTKLATKLGLIVTKSDSDHKTVNAEARVTEG-ASTTRLEFHSWTNQ 148
            V  ++  EAT+N +ST +AT+LGL + K  S  K VN+EA  T+G A    ++   W   
Sbjct: 1386 VVALVDSEATHNLVSTGVATRLGLKLCKDASKLKVVNSEALETQGLAKDVAIQISEWKGT 1445

Query: 149  CKFSVIPLDDFEVVLGLESLVKEKISLILHLNGVMVNNKTTPCFVRGIT 197
                  PLDDF+++LG E  VK K+ ++ HLNG++  N+T PCFVRG++
Sbjct: 1446 VNMLSTPLDDFDLILGNEFFVKAKVMVLPHLNGLLFMNETQPCFVRGLS 1494

BLAST of Cla97C01G010790.1 vs. NCBI nr
Match: GAV83323.1 (gag-asp_proteas domain-containing protein [Cephalotus follicularis])

HSP 1 Score: 87.0 bits (214), Expect = 8.2e-14
Identity = 45/123 (36.59%), Postives = 73/123 (59.35%), Query Frame = 0

Query: 84  INEVEVFTMLAIEATNNFISTKLATKLGLIVTKSDSDHKTVNAEARVTEG-ASTTRLEFH 143
           +N VE++ M+   A++NF++ ++  K GL + K  S  K VNA+AR  +G A    L+  
Sbjct: 134 LNSVEIYAMIDSGASHNFVNERIVGKFGLKIEKHTSKIKAVNADARPVQGFARDVPLQVG 193

Query: 144 SWTNQCKFSVIPLDDFEVVLGLESLVKEKISLILHLNGVMVNNKTTPCFVRGITTNLGSR 203
           +W  Q    ++PLDDF+V+ G++ LV+ K + + HL G+M  ++  PCFV G T    S 
Sbjct: 194 AWKGQLNLMIVPLDDFDVIFGIDFLVRNKAAPMPHLKGLMFVDENQPCFVSGFTMEDHSF 253

Query: 204 NNR 206
            N+
Sbjct: 254 GNK 256

BLAST of Cla97C01G010790.1 vs. NCBI nr
Match: GAV79110.1 (gag-asp_proteas domain-containing protein [Cephalotus follicularis])

HSP 1 Score: 85.5 bits (210), Expect = 2.4e-13
Identity = 43/112 (38.39%), Postives = 68/112 (60.71%), Query Frame = 0

Query: 84  INEVEVFTMLAIEATNNFISTKLATKLGLIVTKSDSDHKTVNAEARVTEG-ASTTRLEFH 143
           +N VE+F M+   A++NF++ ++  KLGL V K  S  K V+A+AR  +G A    L+  
Sbjct: 134 LNSVEIFAMIDTGASHNFVNERIVGKLGLKVAKHTSKIKAVDADARPVQGVARDVPLQVG 193

Query: 144 SWTNQCKFSVIPLDDFEVVLGLESLVKEKISLILHLNGVMVNNKTTPCFVRG 195
           +W  Q    ++PLDDF+V+LG++ L + K   + HL G+M   +  PCF+ G
Sbjct: 194 AWKGQLNLMIVPLDDFDVILGIDFLTRNKAVPMPHLKGLMFMGENQPCFISG 245

BLAST of Cla97C01G010790.1 vs. NCBI nr
Match: GAV86378.1 (gag-asp_proteas domain-containing protein [Cephalotus follicularis])

HSP 1 Score: 84.7 bits (208), Expect = 4.1e-13
Identity = 44/112 (39.29%), Postives = 66/112 (58.93%), Query Frame = 0

Query: 84  INEVEVFTMLAIEATNNFISTKLATKLGLIVTKSDSDHKTVNAEARVTEG-ASTTRLEFH 143
           +N VE+F M+   A++NF+  ++  KLGL V K  S  K VNA+AR  +G A    L+  
Sbjct: 134 LNSVEIFAMIDTGASHNFVYERIVGKLGLKVEKHTSKIKAVNADARPVQGVARDVPLQVG 193

Query: 144 SWTNQCKFSVIPLDDFEVVLGLESLVKEKISLILHLNGVMVNNKTTPCFVRG 195
           +W  Q    ++PLDDF+V+ G++ L + K   + HL G+M   +  PCFV G
Sbjct: 194 AWKGQLNLMIVPLDDFDVIFGIDFLTRNKAIPMPHLKGLMFMGENQPCFVSG 245

BLAST of Cla97C01G010790.1 vs. TrEMBL
Match: tr|A5AIP6|A5AIP6_VITVI (Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VITISV_024841 PE=4 SV=1)

HSP 1 Score: 87.4 bits (215), Expect = 4.2e-14
Identity = 46/109 (42.20%), Postives = 68/109 (62.39%), Query Frame = 0

Query: 89   VFTMLAIEATNNFISTKLATKLGLIVTKSDSDHKTVNAEARVTEG-ASTTRLEFHSWTNQ 148
            V  ++  EAT+N +ST +AT+LGL + K  S  K VN+EA  T+G A    ++   W   
Sbjct: 1386 VVALVDSEATHNLVSTGVATRLGLKLCKDASKLKVVNSEALETQGLAKDVAIQISEWKGT 1445

Query: 149  CKFSVIPLDDFEVVLGLESLVKEKISLILHLNGVMVNNKTTPCFVRGIT 197
                  PLDDF+++LG E  VK K+ ++ HLNG++  N+T PCFVRG++
Sbjct: 1446 VNMLSTPLDDFDLILGNEFFVKAKVMVLPHLNGLLFMNETQPCFVRGLS 1494

BLAST of Cla97C01G010790.1 vs. TrEMBL
Match: tr|A0A1Q3CTC7|A0A1Q3CTC7_CEPFO (Gag-asp_proteas domain-containing protein OS=Cephalotus follicularis OX=3775 GN=CFOL_v3_26771 PE=4 SV=1)

HSP 1 Score: 87.0 bits (214), Expect = 5.4e-14
Identity = 45/123 (36.59%), Postives = 73/123 (59.35%), Query Frame = 0

Query: 84  INEVEVFTMLAIEATNNFISTKLATKLGLIVTKSDSDHKTVNAEARVTEG-ASTTRLEFH 143
           +N VE++ M+   A++NF++ ++  K GL + K  S  K VNA+AR  +G A    L+  
Sbjct: 134 LNSVEIYAMIDSGASHNFVNERIVGKFGLKIEKHTSKIKAVNADARPVQGFARDVPLQVG 193

Query: 144 SWTNQCKFSVIPLDDFEVVLGLESLVKEKISLILHLNGVMVNNKTTPCFVRGITTNLGSR 203
           +W  Q    ++PLDDF+V+ G++ LV+ K + + HL G+M  ++  PCFV G T    S 
Sbjct: 194 AWKGQLNLMIVPLDDFDVIFGIDFLVRNKAAPMPHLKGLMFVDENQPCFVSGFTMEDHSF 253

Query: 204 NNR 206
            N+
Sbjct: 254 GNK 256

BLAST of Cla97C01G010790.1 vs. TrEMBL
Match: tr|A0A1Q3CFT3|A0A1Q3CFT3_CEPFO (Gag-asp_proteas domain-containing protein OS=Cephalotus follicularis OX=3775 GN=CFOL_v3_22575 PE=4 SV=1)

HSP 1 Score: 85.5 bits (210), Expect = 1.6e-13
Identity = 43/112 (38.39%), Postives = 68/112 (60.71%), Query Frame = 0

Query: 84  INEVEVFTMLAIEATNNFISTKLATKLGLIVTKSDSDHKTVNAEARVTEG-ASTTRLEFH 143
           +N VE+F M+   A++NF++ ++  KLGL V K  S  K V+A+AR  +G A    L+  
Sbjct: 134 LNSVEIFAMIDTGASHNFVNERIVGKLGLKVAKHTSKIKAVDADARPVQGVARDVPLQVG 193

Query: 144 SWTNQCKFSVIPLDDFEVVLGLESLVKEKISLILHLNGVMVNNKTTPCFVRG 195
           +W  Q    ++PLDDF+V+LG++ L + K   + HL G+M   +  PCF+ G
Sbjct: 194 AWKGQLNLMIVPLDDFDVILGIDFLTRNKAVPMPHLKGLMFMGENQPCFISG 245

BLAST of Cla97C01G010790.1 vs. TrEMBL
Match: tr|A0A1Q3D256|A0A1Q3D256_CEPFO (Gag-asp_proteas domain-containing protein OS=Cephalotus follicularis OX=3775 GN=CFOL_v3_29809 PE=4 SV=1)

HSP 1 Score: 84.7 bits (208), Expect = 2.7e-13
Identity = 44/112 (39.29%), Postives = 66/112 (58.93%), Query Frame = 0

Query: 84  INEVEVFTMLAIEATNNFISTKLATKLGLIVTKSDSDHKTVNAEARVTEG-ASTTRLEFH 143
           +N VE+F M+   A++NF+  ++  KLGL V K  S  K VNA+AR  +G A    L+  
Sbjct: 134 LNSVEIFAMIDTGASHNFVYERIVGKLGLKVEKHTSKIKAVNADARPVQGVARDVPLQVG 193

Query: 144 SWTNQCKFSVIPLDDFEVVLGLESLVKEKISLILHLNGVMVNNKTTPCFVRG 195
           +W  Q    ++PLDDF+V+ G++ L + K   + HL G+M   +  PCFV G
Sbjct: 194 AWKGQLNLMIVPLDDFDVIFGIDFLTRNKAIPMPHLKGLMFMGENQPCFVSG 245

BLAST of Cla97C01G010790.1 vs. TrEMBL
Match: tr|A0A1Q3CEG3|A0A1Q3CEG3_CEPFO (Gag-asp_proteas domain-containing protein OS=Cephalotus follicularis OX=3775 GN=CFOL_v3_22084 PE=4 SV=1)

HSP 1 Score: 84.3 bits (207), Expect = 3.5e-13
Identity = 42/112 (37.50%), Postives = 69/112 (61.61%), Query Frame = 0

Query: 84  INEVEVFTMLAIEATNNFISTKLATKLGLIVTKSDSDHKTVNAEARVTEG-ASTTRLEFH 143
           +N VE++ M+   A++NF++ ++  KLGL + K  +  K VNA+AR  +G A    L+  
Sbjct: 157 LNSVEIYAMIDSGASHNFVNERIVGKLGLKIEKHTTKIKAVNADARPVQGVARDVPLQVG 216

Query: 144 SWTNQCKFSVIPLDDFEVVLGLESLVKEKISLILHLNGVMVNNKTTPCFVRG 195
           +W  Q    ++PLDDF+V+ G++ LV+ K   + HL G+M  ++  PCFV G
Sbjct: 217 AWKGQLNLMIVPLDDFDVIFGIDFLVRNKAVPMPHLKGLMFVDENQPCFVSG 268

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_017227982.17.4e-1539.29PREDICTED: uncharacterized protein LOC108203519 [Daucus carota subsp. sativus][more]
CAN60274.16.3e-1442.20hypothetical protein VITISV_024841 [Vitis vinifera][more]
GAV83323.18.2e-1436.59gag-asp_proteas domain-containing protein [Cephalotus follicularis][more]
GAV79110.12.4e-1338.39gag-asp_proteas domain-containing protein [Cephalotus follicularis][more]
GAV86378.14.1e-1339.29gag-asp_proteas domain-containing protein [Cephalotus follicularis][more]
Match NameE-valueIdentityDescription
tr|A5AIP6|A5AIP6_VITVI4.2e-1442.20Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VITISV_024841 PE=4 SV=1[more]
tr|A0A1Q3CTC7|A0A1Q3CTC7_CEPFO5.4e-1436.59Gag-asp_proteas domain-containing protein OS=Cephalotus follicularis OX=3775 GN=... [more]
tr|A0A1Q3CFT3|A0A1Q3CFT3_CEPFO1.6e-1338.39Gag-asp_proteas domain-containing protein OS=Cephalotus follicularis OX=3775 GN=... [more]
tr|A0A1Q3D256|A0A1Q3D256_CEPFO2.7e-1339.29Gag-asp_proteas domain-containing protein OS=Cephalotus follicularis OX=3775 GN=... [more]
tr|A0A1Q3CEG3|A0A1Q3CEG3_CEPFO3.5e-1337.50Gag-asp_proteas domain-containing protein OS=Cephalotus follicularis OX=3775 GN=... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR021109Peptidase_aspartic_dom_sf

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cla97C01G010790Cla97C01G010790gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cla97C01G010790.1.exon.1Cla97C01G010790.1.exon.1exon
Cla97C01G010790.1.exon.2Cla97C01G010790.1.exon.2exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cla97C01G010790.1.CDS.1Cla97C01G010790.1.CDS.1CDS
Cla97C01G010790.1.CDS.2Cla97C01G010790.1.CDS.2CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cla97C01G010790.1Cla97C01G010790.1-proteinpolypeptide


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR021109Aspartic peptidase domain superfamilyGENE3DG3DSA:2.40.70.10coord: 79..190
e-value: 8.3E-7
score: 30.9
NoneNo IPR availableCDDcd00303retropepsin_likecoord: 80..168
e-value: 2.20134E-8
score: 48.1016