Lsi03G007460 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi03G007460
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionRetrotransposon protein, putative, Ty1-copia subclass
Locationchr03 : 10179348 .. 10180270 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCGAACGAAGACACCCAATCCTTAGAAACTCAAGCTCAAGTAGAAACTTCCATTTTGGCTCCTCCTTCTGTTGCTGCAACAAACGGCTCCTCCTTTGGTCACCCTCTTGGGACTGTGCTAATAGTAAAGTTAGATGATAAAAATTATGCCTTGTGGAAGGAAATGATTCTTGCCATTCTTCGTGGACAGAAACTTGAAAGACATGTCCTTGGAACCAAGTCAGAGCCACCGGTCCTTCTTCCTTCGATGAAAGGAACGGGAACCTCAGAACCCGTAATTAATCCACCTTATGAAGAATGGTGTGTTATTGACCAAGCACTACTAGGGTGGCTCCTAGGATCTATGACTGCATCAGTGACTTAGCGAAGTAGTTCATCTTAGCACTTCAAGGAAAGTGTGGGCAGCATTAGAAGACCAATATGCTGCAGCAAGCAGGGCAAGAATCATTCAATTAAGAAATATCTTGCAAACTACAAGGAAAGGGGCTACGAGGATGGCTGACTACCTTGCATGCATGAAGCAAACTTCTGAAAATCTCAAACTAGCAGGTGAAGATGTTTCTTTCAATTATCTTATGTCATGTATTTTGGCCGGGTTAGAAGCTGAATTCATCCCAATAGTCTGTCAAATTGAAGGTAGAGAATCAATTACATGGTCTTCTTTACACTCTACATTACTAACCTTTGAACATAATCTTGTGAGATTGAATGTGATAAGTACCGGTGAAAATGTGGACACTTTGAGTGCCAATTATGTTGCCAATAGACAAGATGCAACTGGAAGTAGACAGTAAAACAGAGGTTCGTCCGGAAGAGCCAACCATGGAAATGAACGAGGAAGAGGGCGAGGCAGAAATGGAAACTACAGGTACAATGGCTCAAAGCCAACATGTCAACTATGTGGAAAATATGGCCACATAG

mRNA sequence

ATGTCGAACGAAGACACCCAATCCTTAGAAACTCAAGCTCAAGTAGAAACTTCCATTTTGGCTCCTCCTTCTGTTGCTGCAACAAACGGCTCCTCCTTTGGTCACCCTCTTGGGACTGTGCTAATAGTAAAGTTAGATGATAAAAATTATGCCTTGTGGAAGGAAATGATTCTTGCCATTCTTCGTGGACAGAAACTTGAAAGACATGTCCTTGGAACCAAGTCAGAGCCACCGGTCCTTCTTCCTTCGATGAAAGGAACGGGAACCTCAGAACCCGTAATTAATCCACCTTATGAAGAATGGAAAGTGTGGGCAGCATTAGAAGACCAATATGCTGCAGCAAGCAGGGCAAGAATCATTCAATTAAGAAATATCTTGCAAACTACAAGGAAAGGGGCTACGAGGATGGCTGACTACCTTGCATGCATGAAGCAAACTTCTGAAAATCTCAAACTAGCAGGTGAAGATGTTTCTTTCAATTATCTTATGTCATGTATTTTGGCCGGGTTAGAAGCTGAATTCATCCCAATAGTCTGTCAAATTGAAGACAAGATGCAACTGGAAGTAGACAGTAAAACAGAGGTTCGTCCGGAAGAGCCAACCATGGAAATGAACGAGGAAGAGGGCGAGGCAGAAATGGAAACTACAGGTACAATGGCTCAAAGCCAACATGTCAACTATGTGGAAAATATGGCCACATAG

Coding sequence (CDS)

ATGTCGAACGAAGACACCCAATCCTTAGAAACTCAAGCTCAAGTAGAAACTTCCATTTTGGCTCCTCCTTCTGTTGCTGCAACAAACGGCTCCTCCTTTGGTCACCCTCTTGGGACTGTGCTAATAGTAAAGTTAGATGATAAAAATTATGCCTTGTGGAAGGAAATGATTCTTGCCATTCTTCGTGGACAGAAACTTGAAAGACATGTCCTTGGAACCAAGTCAGAGCCACCGGTCCTTCTTCCTTCGATGAAAGGAACGGGAACCTCAGAACCCGTAATTAATCCACCTTATGAAGAATGGAAAGTGTGGGCAGCATTAGAAGACCAATATGCTGCAGCAAGCAGGGCAAGAATCATTCAATTAAGAAATATCTTGCAAACTACAAGGAAAGGGGCTACGAGGATGGCTGACTACCTTGCATGCATGAAGCAAACTTCTGAAAATCTCAAACTAGCAGGTGAAGATGTTTCTTTCAATTATCTTATGTCATGTATTTTGGCCGGGTTAGAAGCTGAATTCATCCCAATAGTCTGTCAAATTGAAGACAAGATGCAACTGGAAGTAGACAGTAAAACAGAGGTTCGTCCGGAAGAGCCAACCATGGAAATGAACGAGGAAGAGGGCGAGGCAGAAATGGAAACTACAGGTACAATGGCTCAAAGCCAACATGTCAACTATGTGGAAAATATGGCCACATAG

Protein sequence

MSNEDTQSLETQAQVETSILAPPSVAATNGSSFGHPLGTVLIVKLDDKNYALWKEMILAILRGQKLERHVLGTKSEPPVLLPSMKGTGTSEPVINPPYEEWKVWAALEDQYAAASRARIIQLRNILQTTRKGATRMADYLACMKQTSENLKLAGEDVSFNYLMSCILAGLEAEFIPIVCQIEDKMQLEVDSKTEVRPEEPTMEMNEEEGEAEMETTGTMAQSQHVNYVENMAT
BLAST of Lsi03G007460 vs. TrEMBL
Match: A0A151QNK8_CAJCA (Uncharacterized protein OS=Cajanus cajan GN=KK1_047566 PE=4 SV=1)

HSP 1 Score: 89.0 bits (219), Expect = 8.5e-15
Identity = 56/193 (29.02%), Postives = 97/193 (50.26%), Query Frame = 1

Query: 25  VAATNGSSFGHPLGTVLIVKLDDKNYALWKEMILAILRGQKLERHVLGTKSEPPVLLPSM 84
           +A+   S   + L +++ VKLD  NY LWK ++++I++G +L+ H+LGTK  P   + S 
Sbjct: 1   MASVADSKSKNDLPSIVSVKLDRDNYPLWKSLVISIVKGCRLDGHMLGTKECPEEFIAS- 60

Query: 85  KGTGTSEPVINPPYEEWK------------------------------VWAALEDQYAAA 144
               + +P  NP +E W+                              +W   +    A 
Sbjct: 61  -ADSSKKP--NPAFENWQAHDSQLLGWLMNSMTIEMATQLLHCETSKQLWDEAQSLAGAH 120

Query: 145 SRARIIQLRNILQTTRKGATRMADYLACMKQTSENLKLAGEDVSFNYLMSCILAGLEAEF 188
           +R+R+  L++     RKG  +M +YLA MK  ++ LKLAG  +S + L+   L GL++E+
Sbjct: 121 TRSRVTYLKSEFHNIRKGEMKMEEYLAKMKNLADKLKLAGSPISNSDLIIQTLNGLDSEY 180

BLAST of Lsi03G007460 vs. TrEMBL
Match: A0A151RKJ3_CAJCA (Uncharacterized protein OS=Cajanus cajan GN=KK1_035512 PE=4 SV=1)

HSP 1 Score: 86.7 bits (213), Expect = 4.2e-14
Identity = 57/191 (29.84%), Postives = 88/191 (46.07%), Query Frame = 1

Query: 27  ATNGSSFGHPLGTVLIVKLDDKNYALWKEMILAILRGQKLERHVLGTKSEPPVLLPSMKG 86
           +++ S+  + L     VKLD KNY LWK ++L  L+G  L+ ++ GT   PP  +    G
Sbjct: 6   SSSSSNLKNILSIPCSVKLDRKNYRLWKSLVLPSLKGHNLDGYLFGTTKCPPEFIIDETG 65

Query: 87  TGTSEPVINPPYEEW------------------------------KVWAALEDQYAAASR 146
                   NP + +W                              ++W   +    A +R
Sbjct: 66  KKN-----NPAFADWTSTDQLLLGWLINSMTQEVATQLLHCETSQQIWEDAQSLAGAHTR 125

Query: 147 ARIIQLRNILQTTRKGATRMADYLACMKQTSENLKLAGEDVSFNYLMSCILAGLEAEFIP 188
           +RI  L+     TRKG  +M +Y   MK+ +++L LAG  VS   L++  LAGL+ E+ P
Sbjct: 126 SRITFLKTEFHRTRKGGLKMEEYFTKMKEIADDLALAGSSVSTMDLVTQTLAGLDNEYNP 185

BLAST of Lsi03G007460 vs. TrEMBL
Match: A0A0A0LUB0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G064810 PE=4 SV=1)

HSP 1 Score: 85.1 bits (209), Expect = 1.2e-13
Identity = 42/83 (50.60%), Postives = 56/83 (67.47%), Query Frame = 1

Query: 102 KVWAALEDQYAAASRARIIQLRNILQTTRKGATRMADYLACMKQTSENLKLAGEDVSFNY 161
           ++W ALE+ Y A S++    +  ILQ TRKG  RM +YL+ MKQT EN++LAG  +S   
Sbjct: 172 ELWLALEELYGATSKSPYKSIGIILQNTRKGTMRMTEYLSMMKQTLENMQLAGSPISHED 231

Query: 162 LMSCILAGLEAEFIPIVCQIEDK 185
           L S +L GL+ E+IPIVC IE K
Sbjct: 232 LFSYVLVGLDVEYIPIVCDIEGK 254

BLAST of Lsi03G007460 vs. TrEMBL
Match: V7CEQ5_PHAVU (Uncharacterized protein (Fragment) OS=Phaseolus vulgaris GN=PHAVU_002G005200g PE=4 SV=1)

HSP 1 Score: 84.3 bits (207), Expect = 2.1e-13
Identity = 53/159 (33.33%), Postives = 77/159 (48.43%), Query Frame = 1

Query: 41  LIVKLDDKNYALWKEMILAILRGQKLERHVLGTKSEPPVLLPSMKGTGTSEPVINPPYEE 100
           L+ KLD  NY LW+  IL I+RG KL+ HV G K  PP  LP     G++  + NP +EE
Sbjct: 19  LVYKLDHTNYLLWETTILPIIRGHKLDGHVFGNKPCPPEFLPRTT-PGSTVKIPNPEFEE 78

Query: 101 W------------------------------KVWAALEDQYAAASRARIIQLRNILQTTR 160
           W                              ++W A ++   A +R+RI+  +  LQ  R
Sbjct: 79  WVSNHQLLLGWLYSTMRTDIASQLMRNSTLKELWDAAKELSGAHTRSRIVYYKAELQKMR 138

Query: 161 KGATRMADYLACMKQTSENLKLAGEDVSFNYLMSCILAG 170
           +G  +M  YL  MK  ++NL +AG  +S + L   IL+G
Sbjct: 139 EGGMKMEKYLTTMKSIADNLAVAGNLISQSELTIQILSG 176

BLAST of Lsi03G007460 vs. TrEMBL
Match: Q6ATL7_ORYSJ (Putative polyprotein OS=Oryza sativa subsp. japonica GN=OSJNBb0021K20.25 PE=4 SV=1)

HSP 1 Score: 82.0 bits (201), Expect = 1.0e-12
Identity = 55/189 (29.10%), Postives = 90/189 (47.62%), Query Frame = 1

Query: 25  VAATNGSSFGHPL-GTVLIVKLDDKNYALWKEMILAILRGQKLERHVLGTKSEPPVLLPS 84
           +A+++ ++ G+PL G  +  KL   N+A+WK  ILA +RG +LE H+ G    P  +L  
Sbjct: 1   MASSSKNNTGNPLVGQPVSEKLGKSNHAVWKAQILATIRGARLEGHLTGDDQPPAPIL-- 60

Query: 85  MKGTGTSEPVI-NPPYEEW------------------------------KVWAALEDQYA 144
            +  G  E V+ NP YEEW                                W+ ++  + 
Sbjct: 61  RRKEGEKEVVVSNPEYEEWVATDQQVLAYLLSSMTKDLLVQVATCRTAASAWSMIQGMFG 120

Query: 145 AASRARIIQLRNILQTTRKGATRMADYLACMKQTSENLKLAGEDVSFNYLMSCILAGLEA 182
           + +RAR I  R  L T +KG   +  Y+  M+  +++L   G+ V  + L+  I AGL+ 
Sbjct: 121 SMTRARTINTRLSLSTLQKGDMNITTYVGKMRALADDLMAVGKPVDDDELIGYIFAGLDD 180

BLAST of Lsi03G007460 vs. NCBI nr
Match: gi|828332633|ref|XP_004513130.2| (PREDICTED: uncharacterized protein LOC101488260, partial [Cicer arietinum])

HSP 1 Score: 92.4 bits (228), Expect = 1.1e-15
Identity = 69/232 (29.74%), Postives = 108/232 (46.55%), Query Frame = 1

Query: 24  SVAATNGSSFGHPLGTVLIVKLDDKNYALWKEMILAILRGQKLERHVLGTKSEPPVLLPS 83
           S A TN  +    L T + VKLD  NY LWK ++L ++RG KL+ +++GTK  P   + +
Sbjct: 7   SAANTNHKN---DLPTTVSVKLDRDNYLLWKSLVLPLIRGCKLDGYIIGTKECPEQFIST 66

Query: 84  MKGTGTSEPVINPPYEEW------------------------------KVWAALEDQYAA 143
              T  +    NP YEEW                              ++W   +    A
Sbjct: 67  NDTTKKN----NPDYEEWIAHDQALLGWLRNSVAIDIATQLLHCETSKELWNEAQSLTGA 126

Query: 144 ASRARIIQLRNILQTTRKGATRMADYLACMKQTSENLKLAGEDVSFNYLMSCILAGLEAE 203
            +++R I L++    TRKG  +M  YL  MK  S+ LKLAG  +S + L+   L GL+A+
Sbjct: 127 HTKSRTIYLKSEFHNTRKGQMKMDQYLLKMKNLSDKLKLAGSPISSSDLIIQTLNGLDAD 186

Query: 204 FIPIVCQIEDKMQLE-VDSKTEVRPEEPTMEMNEEEGEAEMETTGTMAQSQH 225
           + P+V ++ D++ L  VD + ++   E  ME         M  +  +A   H
Sbjct: 187 YNPVVVKLSDQINLNWVDLQAQLLAFENRMEQLNNFSNLSMNASANLASQTH 231

BLAST of Lsi03G007460 vs. NCBI nr
Match: gi|1012318520|gb|KYP31890.1| (hypothetical protein KK1_047566 [Cajanus cajan])

HSP 1 Score: 89.0 bits (219), Expect = 1.2e-14
Identity = 56/193 (29.02%), Postives = 97/193 (50.26%), Query Frame = 1

Query: 25  VAATNGSSFGHPLGTVLIVKLDDKNYALWKEMILAILRGQKLERHVLGTKSEPPVLLPSM 84
           +A+   S   + L +++ VKLD  NY LWK ++++I++G +L+ H+LGTK  P   + S 
Sbjct: 1   MASVADSKSKNDLPSIVSVKLDRDNYPLWKSLVISIVKGCRLDGHMLGTKECPEEFIAS- 60

Query: 85  KGTGTSEPVINPPYEEWK------------------------------VWAALEDQYAAA 144
               + +P  NP +E W+                              +W   +    A 
Sbjct: 61  -ADSSKKP--NPAFENWQAHDSQLLGWLMNSMTIEMATQLLHCETSKQLWDEAQSLAGAH 120

Query: 145 SRARIIQLRNILQTTRKGATRMADYLACMKQTSENLKLAGEDVSFNYLMSCILAGLEAEF 188
           +R+R+  L++     RKG  +M +YLA MK  ++ LKLAG  +S + L+   L GL++E+
Sbjct: 121 TRSRVTYLKSEFHNIRKGEMKMEEYLAKMKNLADKLKLAGSPISNSDLIIQTLNGLDSEY 180

BLAST of Lsi03G007460 vs. NCBI nr
Match: gi|1012331601|gb|KYP43069.1| (hypothetical protein KK1_035512 [Cajanus cajan])

HSP 1 Score: 86.7 bits (213), Expect = 6.0e-14
Identity = 57/191 (29.84%), Postives = 88/191 (46.07%), Query Frame = 1

Query: 27  ATNGSSFGHPLGTVLIVKLDDKNYALWKEMILAILRGQKLERHVLGTKSEPPVLLPSMKG 86
           +++ S+  + L     VKLD KNY LWK ++L  L+G  L+ ++ GT   PP  +    G
Sbjct: 6   SSSSSNLKNILSIPCSVKLDRKNYRLWKSLVLPSLKGHNLDGYLFGTTKCPPEFIIDETG 65

Query: 87  TGTSEPVINPPYEEW------------------------------KVWAALEDQYAAASR 146
                   NP + +W                              ++W   +    A +R
Sbjct: 66  KKN-----NPAFADWTSTDQLLLGWLINSMTQEVATQLLHCETSQQIWEDAQSLAGAHTR 125

Query: 147 ARIIQLRNILQTTRKGATRMADYLACMKQTSENLKLAGEDVSFNYLMSCILAGLEAEFIP 188
           +RI  L+     TRKG  +M +Y   MK+ +++L LAG  VS   L++  LAGL+ E+ P
Sbjct: 126 SRITFLKTEFHRTRKGGLKMEEYFTKMKEIADDLALAGSSVSTMDLVTQTLAGLDNEYNP 185

BLAST of Lsi03G007460 vs. NCBI nr
Match: gi|848893339|ref|XP_012846718.1| (PREDICTED: uncharacterized protein LOC105966675 [Erythranthe guttata])

HSP 1 Score: 85.9 bits (211), Expect = 1.0e-13
Identity = 57/180 (31.67%), Postives = 93/180 (51.67%), Query Frame = 1

Query: 24  SVAATNGSSFG--HPLGTVLIVKLDDKNYALWKEMILAILRGQKLERHVLGTKSEPPVLL 83
           + +ATN SS    +P  ++ + KL   NY +WK  IL  L+GQ++  +V GT   PP  L
Sbjct: 3   TASATNTSSLPIFNPSSSINVTKLTRTNYPIWKAQILPYLKGQEVFGYVDGTIKAPPATL 62

Query: 84  PSMKGTGTSEP--------------VINPPYEE------------WKVWAALEDQYAAAS 143
            ++ GT T  P               IN    E              +W AL+  +A+ S
Sbjct: 63  -TVNGTSTPNPEFTLWTKQDNLILSTINSSLTEEVLAQVYQSETSHAIWLALQTCFASQS 122

Query: 144 RARIIQLRNILQTTRKGATRMADYLACMKQTSENLKLAGEDVSFNYLMSCILAGLEAEFI 176
           RA+++Q+R+ L T+RKG     DY   +K+ ++ L +AG+ ++ + +++ ILAGL  EF+
Sbjct: 123 RAKVVQVRSQLATSRKGHLSATDYFVQIKKIADQLSMAGQALTSDDIITYILAGLGPEFV 181

BLAST of Lsi03G007460 vs. NCBI nr
Match: gi|1002290026|ref|XP_015649341.1| (PREDICTED: uncharacterized protein LOC107281969 [Oryza sativa Japonica Group])

HSP 1 Score: 85.5 bits (210), Expect = 1.3e-13
Identity = 54/156 (34.62%), Postives = 81/156 (51.92%), Query Frame = 1

Query: 38  GTVLIVKLDDKNYALWKEMILAILRGQKLERHVLGTKSEPPVLLPSMKGTG-TSEPVINP 97
           G  +  KL   N+ LWK  +LA LRG ++   + G+   P   +   K  G T+E V NP
Sbjct: 13  GQTVSEKLTKSNFVLWKAQVLAALRGAQMAGFLDGSNQAPAATIIITKDDGKTTEKVANP 72

Query: 98  PYEEWKV---WA--------ALEDQYAAASRARIIQLRNILQTTRKGATRMADYLACMKQ 157
              +WK    W         A+E  +A+ SRARI+  R  L TT KG   MA+Y+  MK 
Sbjct: 73  ALVQWKAQEQWQHSQRRQMEAIESLFASQSRARILNTRMALSTTVKGNRSMAEYVGKMKS 132

Query: 158 TSENLKLAGEDVSFNYLMSCILAGLEAEFIPIVCQI 182
            ++++  AG+ V    L+S ILAGL+ ++  +V  +
Sbjct: 133 LADDMASAGKPVDDEELISYILAGLDFDYNSVVSSV 168

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A151QNK8_CAJCA8.5e-1529.02Uncharacterized protein OS=Cajanus cajan GN=KK1_047566 PE=4 SV=1[more]
A0A151RKJ3_CAJCA4.2e-1429.84Uncharacterized protein OS=Cajanus cajan GN=KK1_035512 PE=4 SV=1[more]
A0A0A0LUB0_CUCSA1.2e-1350.60Uncharacterized protein OS=Cucumis sativus GN=Csa_1G064810 PE=4 SV=1[more]
V7CEQ5_PHAVU2.1e-1333.33Uncharacterized protein (Fragment) OS=Phaseolus vulgaris GN=PHAVU_002G005200g PE... [more]
Q6ATL7_ORYSJ1.0e-1229.10Putative polyprotein OS=Oryza sativa subsp. japonica GN=OSJNBb0021K20.25 PE=4 SV... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
gi|828332633|ref|XP_004513130.2|1.1e-1529.74PREDICTED: uncharacterized protein LOC101488260, partial [Cicer arietinum][more]
gi|1012318520|gb|KYP31890.1|1.2e-1429.02hypothetical protein KK1_047566 [Cajanus cajan][more]
gi|1012331601|gb|KYP43069.1|6.0e-1429.84hypothetical protein KK1_035512 [Cajanus cajan][more]
gi|848893339|ref|XP_012846718.1|1.0e-1331.67PREDICTED: uncharacterized protein LOC105966675 [Erythranthe guttata][more]
gi|1002290026|ref|XP_015649341.1|1.3e-1334.62PREDICTED: uncharacterized protein LOC107281969 [Oryza sativa Japonica Group][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi03G007460.1Lsi03G007460.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 100..190
score: 1.