Lsi03G007150 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi03G007150
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionTransposon Ty3-G Gag-Pol polyprotein
Locationchr03 : 9540187 .. 9544083 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTAGTTCAACTTTCCCACCTACACACATATGAGTACTCTCTCTAAAATGCAATGTGTGGCCTTGTAATTTGAGCATCTTTCTTGCAAAAATAAAATTCATAAAATTGAAAATTTAGATTTAAAAAAAAAAAAAAAAAATTCAAAGTTAGTTGCAACAAAAGTTAAGCTCCATCTTATATCTTTAGCATCTAATGTTCTTTAATTATTTTTTAATTTATATGGAGTTAAAAGTTAACAATTGCTTACTTAATAGTTTATATATTCTTAACTTTATACCAAAAACAAATCACAATAGTATATTTGATTTCTAATCCTGAAATATTAGTAACAGAAAATTAACTAAATAATCATTTAAGTACTTCCAAAAGAATCATAATAATAGTTTATTGAATTGGAGATGAATTAAAGCTTAAAGTGCATTTTAAACAATTTTTGTCAAAAGAGTTTAAATACAAATGAGTTTTTCGAAAATCATTTTTTTCTCTAGTTAATCTAAACGGGCATTAAATTGTCAACCATGCATATTCTCATTAGTTATGCATATAGACCTTCAGAAGAATGATATTTACCTAATTAAGGTAATTCGAGTTTAGTGTTTAAATCGATAAATTCGAGTGTTAATTAAAGTTAATATTAGGTAATTACTATTTTTTTTTTCTTCTGTGACATTTTTGATACAAGCTTTTCATCAATGTTTCCATTGGGTCAGATTTTGTTTGCTTACCAAACAAATAGGTTCATTCTATTAACTTATTCTCTAGTATATTTGGAGTTTGTTGGCCAAATTATTATTCCAAAAAAAAAAAAAATAGTCACTGCACATTGTAATTGTAACCAAATTCATATTTGGTTAGTTGAAAAAGGGTAGGTTGCATAGGTGGAACCATTATATGATTTATGACCATTTTCACAATGGTTTGTTGTAGGGGTGATCAAAAAATCTGATGAACCGAAAAATTCGATCAATCCAACTCAACTTATACAGTTTGGGTTGGATTATCAATTCATTTGAGTTAAGTTGGGTTCAAATAAATAACAATTATATGGGTTGGGTTGGTTCATGAGTTCACTTAAAATAACTCAAACCAACCCGAACTTATTATTAATTTTAAAAAGTATGTTGCTTTTACCTACAATTATTTTTACATATATTTATTTTTATTTTATTTTATTTTATGATTTTTTTTGGTAAGTTTATTCTCCAACAATTCTTACAAATAATTTTTTAGCACTTTGGAAATAAAATTTTTATAATATAAATTGAAATTGAGCTATTAATTTTAATTAAATATAAGTAAATAATTAAATAATTTTTTTTCATATTAACATTTTACTTATTTTTAGAACAAAAATTTAAATAAATGATTAGACTAATCCGAATCAACCTAATCCAAATGTTTCATGGTTGGGTTGGGTTCATTATTTAGTAAGGGTTATTTAGGTTAGAAATTTACAACCCAAACAATTGGAATGAGTCTTAAAAAGTGTTTCAACTCAATCCAATTCAATCCATGAACTAGTTTGTTGGTGCTATAGAACCATTCTAATGGTCTTTGACTATGTGTAAGTTAGAAAAATTTGTAATTTAACCATACTTACTTTGCTATTACTTTTTTTTATATATATAAATTCCTAGACGATTGAGGAAATTTATGGAAAAAAATGTCTTAAGTTTGAATGCCCAGAATTAACATGCGTTATTCCATAAGTCAAGAAATCACAACTTATAATTGTCTCTTACAACTATGCTTAAATTAATCTCTTGCAAACCATAAGTTTTATTTTATTTTATTTATTTATTTTCATGTTAAACTCATACGATCAACAATATGTTGTAGAGTTCTCGATTATATAAATAATAGTATTGTCTATATTAAGTGCAATAGTTACATGTAGAAAATTGTAAAATATATTTTTATTTGTTGAATTCACTTCTCAAATGAGTGTTAAGAATGTGGAAAAATATGTGTGTAGTTGAATGGTCGAACAAAAAGTTAAGCAAAAATTTAAAAAAAAAAAAAAAACAAAAATGCATGCTTTTTGTTTGCACGAGGTTTTTGGATAAATGTGTTTCGTAGAGGAACTTGACTTTATGTACATTTGATGTAGATGTGATTCTCTCTAGGTTCTCTGAGTGTAAAATTTTTTAACCTTTCAAATAAGGAAGAACTCCTTTATTTATAGAGTTCTCAAATAGGAGTTTGTGAGTTTGAGTTTGGTTGTTTCATGGACCTAGTTCTTAGGCCCAATCAATTGGATTTGGGTCTTACTTACTCTTTTTAGGCTCAATTATACTTTGCCTAAATAATAGTATCGGATTGAACCAAATTATTTTATTTAATCCCATGATCACGATGAAAACATGTGTGACATCCTCGGAATTTGTCAACTTATGTCTTCAATTTCAATTTGAGACACATGTCAACTTTTAATTAGTCCCAAATTTGACCCTTCCTAATTTTATCATTAATTTAAGAAATGGCGTGACAATTTGCAGTTGGTCCAAAATTTCTCACTCATCACTTTTTAGCTTACAAAAAACTGCACTATATATCTATAACTCAACCAACTCAACTCTACACTTCAAACACAGACTTATTGACGAAGATCTAAGACAAGATCTAGGTTTTTTTAGATTTAGTTATATTAATTTTGTTGTGGGACTTTATTTAATTGGGTGGATATTTAACTTGGGCCTTTAGTTATATTTTATATTTGGTTTATGTGGGCCTTCTTTAGCCCATGCTCCTGGAACTAGTTAAAAGAAAAAACCTAGCATATATATATTTGCTCATCAAGTGTGAAGAAGACAATTTTTAATAATTTAAATGAAATATGAACACAGAATCGACAGCCTCTAGCTTAGCCTAGACCGATCTTCTGTTTGTTTCATCTCATCCCATTCTTGTCTTCTACATCAAAGTGGTATTAGAGCCCAAAGATCACTTGGGCAATGGCAACTAGAAGTGATAGAGAAAGAATGAATGAGGGAGAGAGAGATTTTCCAGTTGTGTGGCAAGCCATAGTTGAAATCGGAAATCGACTCGATCAACATTTCGAGCGAACTAATCGAGACATCCAGGAAATGCAGCAAAGCATGGCCCAAATGAACGCCAATCTTAGAGGTGCGCACCCACAAAATCATTGCGAACAACCGCGCGCTAGAGGGGTTGAACGCAGAGGTGGTCGGCGTGGGGGTAGACAACATGCTGGAGGTAGGCAAGGACCTATAATGGAAGAAGAAAACATCAGTCAATCAGAAAATGATGATAATAGTAGTACTGAATCAGATGACTTGGCAGAAGAAGCACTACTTAGAGGAAATCGATATGAGAGAGAGCTTCCACAGCAAAGAAGAGGTGAAAACAGTGAGTATAAAATTAAAATTGATCTCCCTTATTTTGATGGTACATTTGATATTGAAGAATTTCTAGATTGGCTACAACATGTGGAATTTTTTTTGAATACATGGACATTCCTGAAGAAAGTAAAGTTAAACTTGTAGCATACAAGCTTAAGGGGGGAGCATCTGCATGGTGGGAACAATTTAAAGTAAATAGGAGAAGACATGGAAAAGAGAAAATACGCACATGCCAAAGCTGCGACGCTATTTGAAGAAGTATTTCTTGCCATTAGACCACGACTAAGTATTGTATCAACAATATCAGCACTGCCAACAAAGAAATAGAAAATACAACTGAGTTCAATCGATTAAATGCATTGAACAATTTGAATGAAACTCCAAGACAACAAATGGCAAGATATGTTGGGGGGTTAAAGTCAAGCATTCAAGATCAATTATCTTTAAAGTCCATGAAGACATTGTCAAAAGCAATAAAATTGGCACAAAAGGCAGAGGCACACAATACTAGAACAAACACGCGTTCAATATAG

mRNA sequence

ATGTTAAGCCCAAAGATCACTTGGGCAATGGCAACTAGAAGTGATAGAGAAAGAATGAATGAGGGAGAGAGAGATTTTCCAGTTGTGTGGCAAGCCATAGTTGAAATCGGAAATCGACTCGATCAACATTTCGAGCGAACTAATCGAGACATCCAGGAAATGCAGCAAAGCATGGCCCAAATGAACGCCAATCTTAGAGGTGCGCACCCACAAAATCATTGCGAACAACCGCGCGCTAGAGGGGTTGAACGCAGAGGTGGTCGGCGTGGGGGTAGACAACATGCTGGAGGTAGGCAAGGACCTATAATGGAAGAAGAAAACATCAGTCAATCAGAAAATGATGATAATAAAAATACAACTGAGTTCAATCGATTAAATGCATTGAACAATTTGAATGAAACTCCAAGACAACAAATGGCAAGATATGTTGGGGGGTTAAAGTCAAGCATTCAAGATCAATTATCTTTAAAGTCCATGAAGACATTGTCAAAAGCAATAAAATTGGCACAAAAGGCAGAGGCACACAATACTAGAACAAACACGCGTTCAATATAG

Coding sequence (CDS)

ATGTTAAGCCCAAAGATCACTTGGGCAATGGCAACTAGAAGTGATAGAGAAAGAATGAATGAGGGAGAGAGAGATTTTCCAGTTGTGTGGCAAGCCATAGTTGAAATCGGAAATCGACTCGATCAACATTTCGAGCGAACTAATCGAGACATCCAGGAAATGCAGCAAAGCATGGCCCAAATGAACGCCAATCTTAGAGGTGCGCACCCACAAAATCATTGCGAACAACCGCGCGCTAGAGGGGTTGAACGCAGAGGTGGTCGGCGTGGGGGTAGACAACATGCTGGAGGTAGGCAAGGACCTATAATGGAAGAAGAAAACATCAGTCAATCAGAAAATGATGATAATAAAAATACAACTGAGTTCAATCGATTAAATGCATTGAACAATTTGAATGAAACTCCAAGACAACAAATGGCAAGATATGTTGGGGGGTTAAAGTCAAGCATTCAAGATCAATTATCTTTAAAGTCCATGAAGACATTGTCAAAAGCAATAAAATTGGCACAAAAGGCAGAGGCACACAATACTAGAACAAACACGCGTTCAATATAG

Protein sequence

MLSPKITWAMATRSDRERMNEGERDFPVVWQAIVEIGNRLDQHFERTNRDIQEMQQSMAQMNANLRGAHPQNHCEQPRARGVERRGGRRGGRQHAGGRQGPIMEEENISQSENDDNKNTTEFNRLNALNNLNETPRQQMARYVGGLKSSIQDQLSLKSMKTLSKAIKLAQKAEAHNTRTNTRSI
BLAST of Lsi03G007150 vs. NCBI nr
Match: gi|659112439|ref|XP_008456221.1| (PREDICTED: uncharacterized protein LOC103496226 [Cucumis melo])

HSP 1 Score: 102.4 bits (254), Expect = 8.4e-19
Identity = 57/77 (74.03%), Postives = 64/77 (83.12%), Query Frame = 1

Query: 106 ENISQSENDDNKNTTEFNRLNALNNLNETPRQQMARYVGGLKSSIQDQLSLKSMKTLSKA 165
           ++  Q      + TTEF+RLNALNNLNETP QQ+ARYVGGLKS+IQDQLSLKSMKTLSKA
Sbjct: 23  QHYQQRNRSVREYTTEFSRLNALNNLNETPTQQVARYVGGLKSTIQDQLSLKSMKTLSKA 82

Query: 166 IKLAQKAEAHNTRTNTR 183
           I LAQKAEAHNT+ N R
Sbjct: 83  ITLAQKAEAHNTKANMR 99

BLAST of Lsi03G007150 vs. NCBI nr
Match: gi|672196124|ref|XP_008776725.1| (PREDICTED: uncharacterized protein LOC103696786 [Phoenix dactylifera])

HSP 1 Score: 59.3 bits (142), Expect = 8.1e-06
Identity = 32/57 (56.14%), Postives = 42/57 (73.68%), Query Frame = 1

Query: 121 EFNRLNALNNLNETPRQQMARYVGGLKSSIQDQLSLKSMKTLSKAIKLAQKAEAHNT 178
           EFNR NA NNL+ET   Q+ARY+GGLK +I+DQ+ L S+ +LS+A  LA K EA  +
Sbjct: 128 EFNRFNARNNLSETKNPQVARYIGGLKPAIRDQVDLHSIWSLSEATSLALKLEAQTS 184

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
gi|659112439|ref|XP_008456221.1|8.4e-1974.03PREDICTED: uncharacterized protein LOC103496226 [Cucumis melo][more]
gi|672196124|ref|XP_008776725.1|8.1e-0656.14PREDICTED: uncharacterized protein LOC103696786 [Phoenix dactylifera][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi03G007150.1Lsi03G007150.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableunknownCoilCoilcoord: 44..64
scor