Cla97C03G053395 (gene) Watermelon (97103) v2.5

Overview
NameCla97C03G053395
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationCla97Chr03: 2610702 .. 2611169 (-)
RNA-Seq ExpressionCla97C03G053395
SyntenyCla97C03G053395
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGTTTGATGATCATTAAGCGCTCAATTCCAAGGATATATCAGGACTTGATTGTTGAGAGCACGAACGTCAAAGTTTTCTTGAAAGAACAAGAGCAATCCTTTGTTAAAAATGAACATGAAGAGGCAAGAGATCTTTTGACTAAGCTCATCACTATGAGGTACAGTGGGAAGGGCAACGTAAGGGAATACATAATGGAGATGTTGAGTCTCGCTTCGAGACTTAGAGCTCTAAAGTTGGAGATTTGTGAAGACCTTCTTGTGCACTTTGTTTTGATGTCTCTTCCTACACAATATACTCAACTTAAAGTGTGTTACAACACTCAAGTGCACAAATGGACTCTCAATGAACTGATCTCATTTTGTGTGGCTGAAGAGGAAAGGATGCAGCAAGAAGAAAGTGAAAGTGATCATTTGGAATCTACCTCTAGAGGCAAGAAGAAAAAGAGGGAGCAAGATGATATATAG

mRNA sequence

ATGAGTTTGATGATCATTAAGCGCTCAATTCCAAGGATATATCAGGACTTGATTGTTGAGAGCACGAACGTCAAAGTTTTCTTGAAAGAACAAGAGCAATCCTTTGTTAAAAATGAACATGAAGAGGCAAGAGATCTTTTGACTAAGCTCATCACTATGAGGTACAGTGGGAAGGGCAACGTAAGGGAATACATAATGGAGATGTTGAGTCTCGCTTCGAGACTTAGAGCTCTAAAGTTGGAGATTTGTGAAGACCTTCTTGTGCACTTTGTTTTGATGTCTCTTCCTACACAATATACTCAACTTAAAGTGTGTTACAACACTCAAGTGCACAAATGGACTCTCAATGAACTGATCTCATTTTGTGTGGCTGAAGAGGAAAGGATGCAGCAAGAAGAAAGTGAAAGTGATCATTTGGAATCTACCTCTAGAGGCAAGAAGAAAAAGAGGGAGCAAGATGATATATAG

Coding sequence (CDS)

ATGAGTTTGATGATCATTAAGCGCTCAATTCCAAGGATATATCAGGACTTGATTGTTGAGAGCACGAACGTCAAAGTTTTCTTGAAAGAACAAGAGCAATCCTTTGTTAAAAATGAACATGAAGAGGCAAGAGATCTTTTGACTAAGCTCATCACTATGAGGTACAGTGGGAAGGGCAACGTAAGGGAATACATAATGGAGATGTTGAGTCTCGCTTCGAGACTTAGAGCTCTAAAGTTGGAGATTTGTGAAGACCTTCTTGTGCACTTTGTTTTGATGTCTCTTCCTACACAATATACTCAACTTAAAGTGTGTTACAACACTCAAGTGCACAAATGGACTCTCAATGAACTGATCTCATTTTGTGTGGCTGAAGAGGAAAGGATGCAGCAAGAAGAAAGTGAAAGTGATCATTTGGAATCTACCTCTAGAGGCAAGAAGAAAAAGAGGGAGCAAGATGATATATAG

Protein sequence

MSLMIIKRSIPRIYQDLIVESTNVKVFLKEQEQSFVKNEHEEARDLLTKLITMRYSGKGNVREYIMEMLSLASRLRALKLEICEDLLVHFVLMSLPTQYTQLKVCYNTQVHKWTLNELISFCVAEEERMQQEESESDHLESTSRGKKKKREQDDI
Homology
BLAST of Cla97C03G053395 vs. NCBI nr
Match: XP_038895752.1 (uncharacterized protein LOC120083916 [Benincasa hispida])

HSP 1 Score: 260.0 bits (663), Expect = 1.3e-65
Identity = 135/155 (87.10%), Postives = 147/155 (94.84%), Query Frame = 0

Query: 1   MSLMIIKRSIPRIYQDLIVESTNVKVFLKEQEQSFVKNEHEEARDLLTKLITMRYSGKGN 60
           MSLMIIKRSIPR+YQ+LIVES N KVFLKE EQSF KNE+EEARDLLTKL+ MRY+GKGN
Sbjct: 66  MSLMIIKRSIPRVYQNLIVESMNAKVFLKELEQSFAKNENEEARDLLTKLVVMRYTGKGN 125

Query: 61  VREYIMEMLSLASRLRALKLEICEDLLVHFVLMSLPTQYTQLKVCYNTQVHKWTLNELIS 120
           +REYI+EM SLA++L+ALKLEICEDLLVHFVLMSLPTQYTQLKVCYNTQ HKWTLNELIS
Sbjct: 126 IREYIIEMSSLATKLKALKLEICEDLLVHFVLMSLPTQYTQLKVCYNTQKHKWTLNELIS 185

Query: 121 FCVAEEERMQQEESESDHLESTSRGKKKKREQDDI 156
           FCVAEEERMQQEESESD+LESTS+GKKKKRE DDI
Sbjct: 186 FCVAEEERMQQEESESDNLESTSKGKKKKREHDDI 220

BLAST of Cla97C03G053395 vs. NCBI nr
Match: XP_022157933.1 (uncharacterized protein LOC111024541 [Momordica charantia])

HSP 1 Score: 197.2 bits (500), Expect = 1.0e-46
Identity = 104/150 (69.33%), Postives = 127/150 (84.67%), Query Frame = 0

Query: 1   MSLMIIKRSIPRIYQDLIVESTNVKVFLKEQEQSFVKNEHEEARDLLTKLITMRYSGKG- 60
           MSLMIIKRSI +++ DLI+ESTNV+  LKE EQ F +NE+ EA DLLTKL+TMRY+ KG 
Sbjct: 25  MSLMIIKRSISKMFHDLILESTNVRALLKELEQCFARNENAEAGDLLTKLVTMRYTEKGN 84

Query: 61  NVREYIMEMLSLASRLRALKLEICEDLLVHFVLMSLPTQYTQLKVCYNTQVHKWTLNELI 120
           N+REYIMEM SLA+RLR LKLE+ EDLL+HFVLMSLPT+Y QLKVCYNTQ  KWTL ELI
Sbjct: 85  NIREYIMEMASLAARLRTLKLEVSEDLLIHFVLMSLPTRYNQLKVCYNTQKEKWTLVELI 144

Query: 121 SFCVAEEERMQQEESESDHLESTSRGKKKK 150
           S+CV EEER+++EE+E+D   ST +GK++K
Sbjct: 145 SYCVGEEERLRREETETDRSASTFQGKEEK 174

BLAST of Cla97C03G053395 vs. NCBI nr
Match: XP_024029782.1 (uncharacterized protein LOC112094049 [Morus notabilis])

HSP 1 Score: 177.6 bits (449), Expect = 8.5e-41
Identity = 95/153 (62.09%), Postives = 116/153 (75.82%), Query Frame = 0

Query: 1   MSLMIIKRSIPRIYQDLIVESTNVKVFLKEQEQSFVKNEHEEARDLLTKLITMRYSGKGN 60
           MSLMI+KRSIP  ++  I ESTN K FLKE EQ F KNE  E  +LL KLI MRY   GN
Sbjct: 33  MSLMIMKRSIPEAFRGSITESTNAKKFLKELEQYFAKNEKSETSNLLNKLIFMRYKANGN 92

Query: 61  VREYIMEMLSLASRLRALKLEICEDLLVHFVLMSLPTQYTQLKVCYNTQVHKWTLNELIS 120
           +REYIMEM ++A +L+AL LE+ EDLLVH VL+ LPTQY+Q KV YN Q  KWT+NELIS
Sbjct: 93  IREYIMEMSNIAEKLKALTLELSEDLLVHLVLIFLPTQYSQFKVGYNIQKEKWTVNELIS 152

Query: 121 FCVAEEERMQQEESESDHLESTSRGKKKKREQD 154
           +CV EEER+Q++++ES HL STS+ K K+R  D
Sbjct: 153 YCVQEEERLQRDKTESAHLASTSQNKNKRRAMD 185

BLAST of Cla97C03G053395 vs. NCBI nr
Match: KYP39716.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])

HSP 1 Score: 173.7 bits (439), Expect = 1.2e-39
Identity = 93/153 (60.78%), Postives = 116/153 (75.82%), Query Frame = 0

Query: 1   MSLMIIKRSIPRIYQDLIVESTNVKVFLKEQEQSFVKNEHEEARDLLTKLITMRYSGKGN 60
           M LMI+KRS+P +++  I ES N K FL   EQ F  NE  +A  LL KLI+MRY GKGN
Sbjct: 53  MCLMIMKRSVPEVFRGSISESKNAKGFLDAVEQYFTSNEKADASSLLAKLISMRYKGKGN 112

Query: 61  VREYIMEMLSLASRLRALKLEICEDLLVHFVLMSLPTQYTQLKVCYNTQVHKWTLNELIS 120
           +REYIMEM +LAS+L+ALKLE+ +DLLVH VL+SLPT + Q KV YNTQ  KWTLNELIS
Sbjct: 113 IREYIMEMSNLASKLKALKLELSDDLLVHLVLISLPTHFGQFKVSYNTQKDKWTLNELIS 172

Query: 121 FCVAEEERMQQEESESDHLESTSRGKKKKREQD 154
            CV EEER Q+E++ES HL S+S+ +K+K  +D
Sbjct: 173 HCVQEEERQQREKTESAHLASSSQNRKRKNNKD 205

BLAST of Cla97C03G053395 vs. NCBI nr
Match: KYP35727.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])

HSP 1 Score: 171.8 bits (434), Expect = 4.7e-39
Identity = 93/153 (60.78%), Postives = 115/153 (75.16%), Query Frame = 0

Query: 1   MSLMIIKRSIPRIYQDLIVESTNVKVFLKEQEQSFVKNEHEEARDLLTKLITMRYSGKGN 60
           M LMI+KRS+P ++   I ES N K FL   EQ F  NE  +A  LL KLI+MRY GKGN
Sbjct: 53  MCLMIMKRSVPEVFWGSISESQNAKGFLDVVEQYFTSNEKVDASSLLAKLISMRYKGKGN 112

Query: 61  VREYIMEMLSLASRLRALKLEICEDLLVHFVLMSLPTQYTQLKVCYNTQVHKWTLNELIS 120
           +REYIMEM +LAS+L+ALKLE+ +DLLVH VL+SLPT + Q KV YNTQ  KWTLNELIS
Sbjct: 113 IREYIMEMSNLASKLKALKLELSDDLLVHLVLISLPTHFGQFKVSYNTQKDKWTLNELIS 172

Query: 121 FCVAEEERMQQEESESDHLESTSRGKKKKREQD 154
            CV EEER Q+E++ES HL S+S+ +K+K  +D
Sbjct: 173 HCVQEEERQQREKTESAHLASSSQNRKRKNNKD 205

BLAST of Cla97C03G053395 vs. ExPASy TrEMBL
Match: A0A6J1DUQ1 (uncharacterized protein LOC111024541 OS=Momordica charantia OX=3673 GN=LOC111024541 PE=4 SV=1)

HSP 1 Score: 197.2 bits (500), Expect = 5.0e-47
Identity = 104/150 (69.33%), Postives = 127/150 (84.67%), Query Frame = 0

Query: 1   MSLMIIKRSIPRIYQDLIVESTNVKVFLKEQEQSFVKNEHEEARDLLTKLITMRYSGKG- 60
           MSLMIIKRSI +++ DLI+ESTNV+  LKE EQ F +NE+ EA DLLTKL+TMRY+ KG 
Sbjct: 25  MSLMIIKRSISKMFHDLILESTNVRALLKELEQCFARNENAEAGDLLTKLVTMRYTEKGN 84

Query: 61  NVREYIMEMLSLASRLRALKLEICEDLLVHFVLMSLPTQYTQLKVCYNTQVHKWTLNELI 120
           N+REYIMEM SLA+RLR LKLE+ EDLL+HFVLMSLPT+Y QLKVCYNTQ  KWTL ELI
Sbjct: 85  NIREYIMEMASLAARLRTLKLEVSEDLLIHFVLMSLPTRYNQLKVCYNTQKEKWTLVELI 144

Query: 121 SFCVAEEERMQQEESESDHLESTSRGKKKK 150
           S+CV EEER+++EE+E+D   ST +GK++K
Sbjct: 145 SYCVGEEERLRREETETDRSASTFQGKEEK 174

BLAST of Cla97C03G053395 vs. ExPASy TrEMBL
Match: A0A151RB35 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=3821 GN=KK1_038971 PE=4 SV=1)

HSP 1 Score: 173.7 bits (439), Expect = 6.0e-40
Identity = 93/153 (60.78%), Postives = 116/153 (75.82%), Query Frame = 0

Query: 1   MSLMIIKRSIPRIYQDLIVESTNVKVFLKEQEQSFVKNEHEEARDLLTKLITMRYSGKGN 60
           M LMI+KRS+P +++  I ES N K FL   EQ F  NE  +A  LL KLI+MRY GKGN
Sbjct: 53  MCLMIMKRSVPEVFRGSISESKNAKGFLDAVEQYFTSNEKADASSLLAKLISMRYKGKGN 112

Query: 61  VREYIMEMLSLASRLRALKLEICEDLLVHFVLMSLPTQYTQLKVCYNTQVHKWTLNELIS 120
           +REYIMEM +LAS+L+ALKLE+ +DLLVH VL+SLPT + Q KV YNTQ  KWTLNELIS
Sbjct: 113 IREYIMEMSNLASKLKALKLELSDDLLVHLVLISLPTHFGQFKVSYNTQKDKWTLNELIS 172

Query: 121 FCVAEEERMQQEESESDHLESTSRGKKKKREQD 154
            CV EEER Q+E++ES HL S+S+ +K+K  +D
Sbjct: 173 HCVQEEERQQREKTESAHLASSSQNRKRKNNKD 205

BLAST of Cla97C03G053395 vs. ExPASy TrEMBL
Match: A0A151QZJ9 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=3821 GN=KK1_043224 PE=4 SV=1)

HSP 1 Score: 171.8 bits (434), Expect = 2.3e-39
Identity = 93/153 (60.78%), Postives = 115/153 (75.16%), Query Frame = 0

Query: 1   MSLMIIKRSIPRIYQDLIVESTNVKVFLKEQEQSFVKNEHEEARDLLTKLITMRYSGKGN 60
           M LMI+KRS+P ++   I ES N K FL   EQ F  NE  +A  LL KLI+MRY GKGN
Sbjct: 53  MCLMIMKRSVPEVFWGSISESQNAKGFLDVVEQYFTSNEKVDASSLLAKLISMRYKGKGN 112

Query: 61  VREYIMEMLSLASRLRALKLEICEDLLVHFVLMSLPTQYTQLKVCYNTQVHKWTLNELIS 120
           +REYIMEM +LAS+L+ALKLE+ +DLLVH VL+SLPT + Q KV YNTQ  KWTLNELIS
Sbjct: 113 IREYIMEMSNLASKLKALKLELSDDLLVHLVLISLPTHFGQFKVSYNTQKDKWTLNELIS 172

Query: 121 FCVAEEERMQQEESESDHLESTSRGKKKKREQD 154
            CV EEER Q+E++ES HL S+S+ +K+K  +D
Sbjct: 173 HCVQEEERQQREKTESAHLASSSQNRKRKNNKD 205

BLAST of Cla97C03G053395 vs. ExPASy TrEMBL
Match: A0A151RDF9 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 (Fragment) OS=Cajanus cajan OX=3821 GN=KK1_038014 PE=4 SV=1)

HSP 1 Score: 170.2 bits (430), Expect = 6.6e-39
Identity = 92/153 (60.13%), Postives = 115/153 (75.16%), Query Frame = 0

Query: 1   MSLMIIKRSIPRIYQDLIVESTNVKVFLKEQEQSFVKNEHEEARDLLTKLITMRYSGKGN 60
           M LMI+KRS+P +++D I ES N K  L   EQ F  NE  +A  LL KLI+MR  GKGN
Sbjct: 86  MCLMIMKRSVPEVFRDSISESQNAKGLLDVVEQYFTSNEKADASSLLAKLISMRNKGKGN 145

Query: 61  VREYIMEMLSLASRLRALKLEICEDLLVHFVLMSLPTQYTQLKVCYNTQVHKWTLNELIS 120
           +REYIMEM +LAS+L+ALKLE+ +DLLVH VL+SLPT + Q KV YNTQ  KWTLNELIS
Sbjct: 146 IREYIMEMSNLASKLKALKLELSDDLLVHLVLISLPTHFGQFKVSYNTQKDKWTLNELIS 205

Query: 121 FCVAEEERMQQEESESDHLESTSRGKKKKREQD 154
            CV EEER Q+E++ES HL S+S+ +K+K  +D
Sbjct: 206 HCVQEEERQQREKTESAHLASSSQNRKRKNNKD 238

BLAST of Cla97C03G053395 vs. ExPASy TrEMBL
Match: A0A151TRZ9 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=3821 GN=KK1_009021 PE=4 SV=1)

HSP 1 Score: 169.5 bits (428), Expect = 1.1e-38
Identity = 92/155 (59.35%), Postives = 116/155 (74.84%), Query Frame = 0

Query: 1   MSLMIIKRSIPRIYQDLIVESTNVKVFLKEQEQSFVKNEHEEARDLLTKLITMRYSGKGN 60
           M LMI+KRS+  + +  I ES N K FL   EQ F  NE  +A +LL KLI+MRY GKGN
Sbjct: 53  MCLMIMKRSVLEVLRGFISESQNAKGFLDAIEQYFTSNEKVDASNLLAKLISMRYKGKGN 112

Query: 61  VREYIMEMLSLASRLRALKLEICEDLLVHFVLMSLPTQYTQLKVCYNTQVHKWTLNELIS 120
           +REYIMEM +LAS+L+ALKLE+ +DLLVH VL+SLPT + Q KV YNTQ  KWTLNELIS
Sbjct: 113 IREYIMEMSNLASKLKALKLELSDDLLVHLVLISLPTHFGQFKVIYNTQKDKWTLNELIS 172

Query: 121 FCVAEEERMQQEESESDHLESTSRGKKKKREQDDI 156
            CV EEER  +E++ES HL S+S+ +K+K  +DD+
Sbjct: 173 HCVQEEERQLREKTESAHLASSSQNRKRKNNKDDV 207

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038895752.11.3e-6587.10uncharacterized protein LOC120083916 [Benincasa hispida][more]
XP_022157933.11.0e-4669.33uncharacterized protein LOC111024541 [Momordica charantia][more]
XP_024029782.18.5e-4162.09uncharacterized protein LOC112094049 [Morus notabilis][more]
KYP39716.11.2e-3960.78Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan][more]
KYP35727.14.7e-3960.78Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1DUQ15.0e-4769.33uncharacterized protein LOC111024541 OS=Momordica charantia OX=3673 GN=LOC111024... [more]
A0A151RB356.0e-4060.78Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=... [more]
A0A151QZJ92.3e-3960.78Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=... [more]
A0A151RDF96.6e-3960.13Retrovirus-related Pol polyprotein from transposon TNT 1-94 (Fragment) OS=Cajanu... [more]
A0A151TRZ91.1e-3859.35Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 122..142
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 5..131
e-value: 1.2E-13
score: 51.0
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 130..155
NoneNo IPR availablePANTHERPTHR35317:SF3TRANSMEMBRANE PROTEINcoord: 1..151
NoneNo IPR availablePANTHERPTHR35317OS04G0629600 PROTEINcoord: 1..151

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C03G053395.1Cla97C03G053395.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005488 binding