Lsi04G024100 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi04G024100
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
Descriptionhydroxyproline-rich glycoprotein family protein
Locationchr04 : 31234304 .. 31234795 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACGTCGATGAATTTTACCGGCAACCGGCTGCTGTTCCTTTCAAATGGGAGATTAAACCCGGCGTCCCCAGAAATCACCACCGCCTCCGGCAGTCTCCAACTCACTCTCCTCTACACCATCAAAAGCTGAAGCCTCCTCCTGCTGTATCCCACTTCCTCCATCCTTCAAACTCCTTTCACTCCTCCCCACGAACCCGGTCCGACCGGTGGCGGTTTTCCCGGTCCAACCTCGTCGAACCCGAGCAAGTCTCGTCCGGTTGCTTCCCCTCGCCTTTGCCCAACCGGAAATCGGCCAAGACTGCGAGCCGGAAATCCGAACCGGATTACACCTCTGAATCGGAGACTTTGTCGCGGTGGTCGGTTTCCAACAGGAAGTCGATTTCGCCGTTTCGAAATTCAGTTTCGTCGTCGCCGTCGTCGTTCTCGTCGTACCAGTCATCGCCCCGTCCGACCAGTGATACGGAATGGGCAGGGTTTGGGCTCTTTTGA

mRNA sequence

ATGGACGTCGATGAATTTTACCGGCAACCGGCTGCTGTTCCTTTCAAATGGGAGATTAAACCCGGCGTCCCCAGAAATCACCACCGCCTCCGGCAGTCTCCAACTCACTCTCCTCTACACCATCAAAAGCTGAAGCCTCCTCCTGCTGTATCCCACTTCCTCCATCCTTCAAACTCCTTTCACTCCTCCCCACGAACCCGGTCCGACCGGTGGCGGTTTTCCCGGTCCAACCTCGTCGAACCCGAGCAAGTCTCGTCCGGTTGCTTCCCCTCGCCTTTGCCCAACCGGAAATCGGCCAAGACTGCGAGCCGGAAATCCGAACCGGATTACACCTCTGAATCGGAGACTTTGTCGCGGTGGTCGGTTTCCAACAGGAAGTCGATTTCGCCGTTTCGAAATTCAGTTTCGTCGTCGCCGTCGTCGTTCTCGTCGTACCAGTCATCGCCCCGTCCGACCAGTGATACGGAATGGGCAGGGTTTGGGCTCTTTTGA

Coding sequence (CDS)

ATGGACGTCGATGAATTTTACCGGCAACCGGCTGCTGTTCCTTTCAAATGGGAGATTAAACCCGGCGTCCCCAGAAATCACCACCGCCTCCGGCAGTCTCCAACTCACTCTCCTCTACACCATCAAAAGCTGAAGCCTCCTCCTGCTGTATCCCACTTCCTCCATCCTTCAAACTCCTTTCACTCCTCCCCACGAACCCGGTCCGACCGGTGGCGGTTTTCCCGGTCCAACCTCGTCGAACCCGAGCAAGTCTCGTCCGGTTGCTTCCCCTCGCCTTTGCCCAACCGGAAATCGGCCAAGACTGCGAGCCGGAAATCCGAACCGGATTACACCTCTGAATCGGAGACTTTGTCGCGGTGGTCGGTTTCCAACAGGAAGTCGATTTCGCCGTTTCGAAATTCAGTTTCGTCGTCGCCGTCGTCGTTCTCGTCGTACCAGTCATCGCCCCGTCCGACCAGTGATACGGAATGGGCAGGGTTTGGGCTCTTTTGA

Protein sequence

MDVDEFYRQPAAVPFKWEIKPGVPRNHHRLRQSPTHSPLHHQKLKPPPAVSHFLHPSNSFHSSPRTRSDRWRFSRSNLVEPEQVSSGCFPSPLPNRKSAKTASRKSEPDYTSESETLSRWSVSNRKSISPFRNSVSSSPSSFSSYQSSPRPTSDTEWAGFGLF
BLAST of Lsi04G024100 vs. TrEMBL
Match: A0A0A0KY52_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G361790 PE=4 SV=1)

HSP 1 Score: 182.2 bits (461), Expect = 5.1e-43
Identity = 94/122 (77.05%), Postives = 100/122 (81.97%), Query Frame = 1

Query: 1   MDVDEFYRQPAAVPFKWEIKPGVPRNHHRLRQSPTHSP--LHHQKLKPPPAVSHFLHPSN 60
           MD DEFYRQPAAVPFKWEIKPGVPRNHHRLR SPTHSP   H QKLKPPPAVSHF HP N
Sbjct: 1   MDTDEFYRQPAAVPFKWEIKPGVPRNHHRLRHSPTHSPPQHHRQKLKPPPAVSHFPHPPN 60

Query: 61  SFHSSPRTRSDRWRFSRSNLVEPEQVSSGCFPSPLPNRKSAKTASRK-SEPDYTSESETL 120
           S HSSPRT+S+RWRF RS  V     SSGCFPSPLPNRKS K+ SRK  EPDY+S+ +TL
Sbjct: 61  SLHSSPRTQSERWRFVRSEQVS----SSGCFPSPLPNRKSPKSVSRKLPEPDYSSDLDTL 118

BLAST of Lsi04G024100 vs. TrEMBL
Match: B9H7I2_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0005s20360g PE=4 SV=1)

HSP 1 Score: 166.4 bits (420), Expect = 2.9e-38
Identity = 100/194 (51.55%), Postives = 125/194 (64.43%), Query Frame = 1

Query: 1   MDVDEFYRQPAAVPFKWEIKPGVPRNHHRLRQS---------PTHSP------------L 60
           +++D+ +++P AVPFKWEI+PGVP+   + +Q          P+ SP            +
Sbjct: 3   VEIDDSFKKPGAVPFKWEIRPGVPKIQRQQKQQKKELSPPTLPSPSPPFNHRRPSPTPQV 62

Query: 61  HHQKLKPPPAVSHFLHP----SNSFHSSPRTRSDRWRFSRSNLVEPEQVSSGCFPSPLPN 120
             QKLKPPPA S FL P    ++SF S+PR+RS RWRF +   V PE VS GCFPSPL  
Sbjct: 63  QKQKLKPPPARSVFLPPPEPRAHSFRSAPRSRSGRWRFEQPTHVRPECVSPGCFPSPLLR 122

Query: 121 RK------SAKTASRKSEPDYTSESETLSRWSVSNRKSISPFRNSVSSSPSSFSSYQSSP 164
           RK      SA  A   SEPDYTS+ +TLSRWS+S+RKS S FR+S +   SSFSSYQSSP
Sbjct: 123 RKDSKRRTSAGIAKPASEPDYTSDLDTLSRWSISSRKSFSSFRDSPA---SSFSSYQSSP 182

BLAST of Lsi04G024100 vs. TrEMBL
Match: A0A067K9P8_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_12119 PE=4 SV=1)

HSP 1 Score: 161.8 bits (408), Expect = 7.2e-37
Identity = 98/188 (52.13%), Postives = 120/188 (63.83%), Query Frame = 1

Query: 1   MDVDEFYRQPAAVPFKWEIKPGVPR-NHHRLRQSPTH-----------------SPLHHQ 60
           M VD+ +R+P +VPFKWEI+PGVP+  H + +Q P                   +P   Q
Sbjct: 1   MAVDDSFRKPGSVPFKWEIRPGVPKIQHQQQKQQPKKLSPPTLPSPSQPFTSRLTPQPQQ 60

Query: 61  KLKPPPAVSHFLHP----SNSFHSSPRTRSDRWRFSRSNLVEPEQVSSGCFPSPLPNRKS 120
           KLKPPP    F+ P    + SF SS RTRS+RWRF +   V PE VS GCFPSPL  RK 
Sbjct: 61  KLKPPPGGFVFIPPPEPRTRSFRSSQRTRSERWRFEQPTRVRPECVSPGCFPSPLLKRKD 120

Query: 121 AKTAS---RKSEPDYTSESETLSRWSVSNRKSISPFRNSVSSSPSSFSSYQSSPRPTSDT 164
           +K  +    +SE DYTS+ ETL+RWS+S+RKS SPFR+   SS SSFSSYQSSPR  S+ 
Sbjct: 121 SKRRTVQLPESETDYTSDLETLARWSLSSRKSFSPFRD---SSASSFSSYQSSPRVPSEA 180

BLAST of Lsi04G024100 vs. TrEMBL
Match: V4V643_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10002610mg PE=4 SV=1)

HSP 1 Score: 153.3 bits (386), Expect = 2.6e-34
Identity = 95/188 (50.53%), Postives = 121/188 (64.36%), Query Frame = 1

Query: 1   MDVDEFYRQPAAVPFKWEIKPGVPRNHH-----RLRQSPTHSPL----HH--------QK 60
           M +D+  R+P AVPFKWEI+PGVP+        +L   P  SP     +H        QK
Sbjct: 1   MTIDDSCRKPGAVPFKWEIRPGVPKIQQQQPLKKLTPEPPLSPAVEFDNHTRSSPAPLQK 60

Query: 61  LKPPPAVSHFLHP----SNSFHSSPRTRSDRWRFSRSNLVEPEQVSSGCFPSPLPNRKSA 120
           L+PPPA  HF  P    S SF S+PRTRS+RWRF +   + P+ VS GCFP+PL  +K++
Sbjct: 61  LRPPPAALHFFPPVEPRSQSFRSTPRTRSERWRFEKP--LRPDCVSPGCFPAPLLRQKAS 120

Query: 121 KTA----SRKSEPDYTSESETLSRWSVSNRKSISPFRNSVSSSPSSFSSYQSSPRPTSDT 164
           K        +SEP Y+S+ ETL+RWSVS+RK++SPF  S +SS  SFSSY+SSPRP  D 
Sbjct: 121 KKRVLLPRPESEPGYSSDLETLARWSVSSRKTLSPFTASPASS--SFSSYKSSPRPVVDA 180

BLAST of Lsi04G024100 vs. TrEMBL
Match: B9S2W2_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0562950 PE=4 SV=1)

HSP 1 Score: 152.9 bits (385), Expect = 3.3e-34
Identity = 99/199 (49.75%), Postives = 123/199 (61.81%), Query Frame = 1

Query: 1   MDVDEFYRQPAAVPFKWEIKPGVPR-NHHR------------------LRQ---SPTHSP 60
           M +D+ +++P AVPFKWEI+PGVP+  HH+                  LR+   SP  +P
Sbjct: 1   MTIDDSFKKPGAVPFKWEIRPGVPKIQHHQQPKQLSPPKLPSPSPPFNLRRPSVSPLPTP 60

Query: 61  LHHQKLKPPPAVSHFLHP----SNSFHSSPRTRSDRWRFSRSNLVEPEQVSSGCFP-SPL 120
               KLKPPPA   FL P    ++SF S+PRTRS+RWRF +   V PE VS GCFP SPL
Sbjct: 61  QPQLKLKPPPAGFIFLPPPEPRTHSFRSAPRTRSERWRFDQPTRVGPECVSPGCFPSSPL 120

Query: 121 PNRKSAKTAS---------RKSEPDYTSESETLSRWSVSNRKSISPFRNSVSSSPSSFSS 164
             RK +K  +          +SE DY S+ ETL+RWS+S+RKS SPF +   SS SS+SS
Sbjct: 121 LKRKGSKRRTSHVHIDIPGSESEGDYVSDLETLARWSLSSRKSFSPFND---SSVSSYSS 180

BLAST of Lsi04G024100 vs. TAIR10
Match: AT1G77400.1 (AT1G77400.1 Protein of unknown function DUF688 (InterPro:IPR007789))

HSP 1 Score: 95.5 bits (236), Expect = 3.2e-20
Identity = 78/232 (33.62%), Postives = 103/232 (44.40%), Query Frame = 1

Query: 1   MDVDEFYRQPAAVPFKWEIKPGVPR--------------------------NHHRLRQSP 60
           +DVD+ +++P  +PF WEI+PGVP+                          +H +    P
Sbjct: 4   IDVDDSFKRPGTIPFSWEIRPGVPKTRMSQPGNTTPLQPPKKLSPLRFKPLSHSQPLLPP 63

Query: 61  THSPLHHQ-----------------------KLKPP---PAVSHFLHPSNSFHSSPRTRS 120
             SP                           KLKPP    ++S F  P  SF SSPR  S
Sbjct: 64  ALSPPSSSFISNSKSRPLSPLTPHSFSTTPSKLKPPRTPSSLSGFYSPGPSFRSSPRAFS 123

Query: 121 DRWRFSRSNLVEPEQ----------VSSGCFPSP---LPNRKSA----KTASRKSEPDYT 164
           +RW+  R N + PE              GCFPSP   L   KS     K+ SR     Y 
Sbjct: 124 ERWQLHRPNRIRPESEPEPSSDFSVAGFGCFPSPKFRLRKVKSGGSRRKSGSRSENDYYC 183

BLAST of Lsi04G024100 vs. NCBI nr
Match: gi|449449763|ref|XP_004142634.1| (PREDICTED: uncharacterized protein LOC101220757 [Cucumis sativus])

HSP 1 Score: 259.6 bits (662), Expect = 3.6e-66
Identity = 136/166 (81.93%), Postives = 143/166 (86.14%), Query Frame = 1

Query: 1   MDVDEFYRQPAAVPFKWEIKPGVPRNHHRLRQSPTHSPLHH--QKLKPPPAVSHFLHPSN 60
           MD DEFYRQPAAVPFKWEIKPGVPRNHHRLR SPTHSP  H  QKLKPPPAVSHF HP N
Sbjct: 1   MDTDEFYRQPAAVPFKWEIKPGVPRNHHRLRHSPTHSPPQHHRQKLKPPPAVSHFPHPPN 60

Query: 61  SFHSSPRTRSDRWRFSRSNLVEPEQVSSGCFPSPLPNRKSAKTASRK-SEPDYTSESETL 120
           S HSSPRT+S+RWRF RS  V     SSGCFPSPLPNRKS K+ SRK  EPDY+S+ +TL
Sbjct: 61  SLHSSPRTQSERWRFVRSEQVS----SSGCFPSPLPNRKSPKSVSRKLPEPDYSSDLDTL 120

Query: 121 SRWSVSNRKSISPFRNSVSSSPSSFSSYQSSPRPTSDTEWAGFGLF 164
           SRWSVS+RKSISPFR SVSSSPSSFSSYQSSPRPTSDTEWAGFGLF
Sbjct: 121 SRWSVSSRKSISPFRYSVSSSPSSFSSYQSSPRPTSDTEWAGFGLF 162

BLAST of Lsi04G024100 vs. NCBI nr
Match: gi|659086946|ref|XP_008444194.1| (PREDICTED: putative protein TPRXL [Cucumis melo])

HSP 1 Score: 256.9 bits (655), Expect = 2.4e-65
Identity = 138/167 (82.63%), Postives = 143/167 (85.63%), Query Frame = 1

Query: 1   MDVDEFYRQPAAVPFKWEIKPGVPRNHHRLRQSPTHSP--LHHQKLKPPPAVSHFLHPSN 60
           MD DEFYR+PAAVPFKWEIKPGVPRNHHR RQSPTHSP   H QKLKPPPAVSHF HPSN
Sbjct: 1   MDTDEFYRKPAAVPFKWEIKPGVPRNHHRPRQSPTHSPPQHHRQKLKPPPAVSHFPHPSN 60

Query: 61  SFHSSPRTRSDRWRFSRSNLVEPEQVSSGCFPSPLPNRKSAKTASRK-SEPDYTSESETL 120
           S HSSPRTRSDRWRF RS  V     SSGCFPSPLPNRKS K  SRK  EPDY+S+ +TL
Sbjct: 61  SLHSSPRTRSDRWRFVRSEQVS----SSGCFPSPLPNRKSPKALSRKFPEPDYSSDLDTL 120

Query: 121 SRWSVSNRKSISPFRNSV-SSSPSSFSSYQSSPRPTSDTEWAGFGLF 164
           SRWSVS+RKSISPFR SV SSSPSSFSSYQSSPRPTSDTEWAGFGLF
Sbjct: 121 SRWSVSSRKSISPFRYSVSSSSPSSFSSYQSSPRPTSDTEWAGFGLF 163

BLAST of Lsi04G024100 vs. NCBI nr
Match: gi|700199387|gb|KGN54545.1| (hypothetical protein Csa_4G361790 [Cucumis sativus])

HSP 1 Score: 182.2 bits (461), Expect = 7.4e-43
Identity = 94/122 (77.05%), Postives = 100/122 (81.97%), Query Frame = 1

Query: 1   MDVDEFYRQPAAVPFKWEIKPGVPRNHHRLRQSPTHSP--LHHQKLKPPPAVSHFLHPSN 60
           MD DEFYRQPAAVPFKWEIKPGVPRNHHRLR SPTHSP   H QKLKPPPAVSHF HP N
Sbjct: 1   MDTDEFYRQPAAVPFKWEIKPGVPRNHHRLRHSPTHSPPQHHRQKLKPPPAVSHFPHPPN 60

Query: 61  SFHSSPRTRSDRWRFSRSNLVEPEQVSSGCFPSPLPNRKSAKTASRK-SEPDYTSESETL 120
           S HSSPRT+S+RWRF RS  V     SSGCFPSPLPNRKS K+ SRK  EPDY+S+ +TL
Sbjct: 61  SLHSSPRTQSERWRFVRSEQVS----SSGCFPSPLPNRKSPKSVSRKLPEPDYSSDLDTL 118

BLAST of Lsi04G024100 vs. NCBI nr
Match: gi|224084958|ref|XP_002307455.1| (hypothetical protein POPTR_0005s20360g [Populus trichocarpa])

HSP 1 Score: 166.4 bits (420), Expect = 4.2e-38
Identity = 100/194 (51.55%), Postives = 125/194 (64.43%), Query Frame = 1

Query: 1   MDVDEFYRQPAAVPFKWEIKPGVPRNHHRLRQS---------PTHSP------------L 60
           +++D+ +++P AVPFKWEI+PGVP+   + +Q          P+ SP            +
Sbjct: 3   VEIDDSFKKPGAVPFKWEIRPGVPKIQRQQKQQKKELSPPTLPSPSPPFNHRRPSPTPQV 62

Query: 61  HHQKLKPPPAVSHFLHP----SNSFHSSPRTRSDRWRFSRSNLVEPEQVSSGCFPSPLPN 120
             QKLKPPPA S FL P    ++SF S+PR+RS RWRF +   V PE VS GCFPSPL  
Sbjct: 63  QKQKLKPPPARSVFLPPPEPRAHSFRSAPRSRSGRWRFEQPTHVRPECVSPGCFPSPLLR 122

Query: 121 RK------SAKTASRKSEPDYTSESETLSRWSVSNRKSISPFRNSVSSSPSSFSSYQSSP 164
           RK      SA  A   SEPDYTS+ +TLSRWS+S+RKS S FR+S +   SSFSSYQSSP
Sbjct: 123 RKDSKRRTSAGIAKPASEPDYTSDLDTLSRWSISSRKSFSSFRDSPA---SSFSSYQSSP 182

BLAST of Lsi04G024100 vs. NCBI nr
Match: gi|802636559|ref|XP_012078275.1| (PREDICTED: uncharacterized protein LOC105638965 [Jatropha curcas])

HSP 1 Score: 161.8 bits (408), Expect = 1.0e-36
Identity = 98/188 (52.13%), Postives = 120/188 (63.83%), Query Frame = 1

Query: 1   MDVDEFYRQPAAVPFKWEIKPGVPR-NHHRLRQSPTH-----------------SPLHHQ 60
           M VD+ +R+P +VPFKWEI+PGVP+  H + +Q P                   +P   Q
Sbjct: 1   MAVDDSFRKPGSVPFKWEIRPGVPKIQHQQQKQQPKKLSPPTLPSPSQPFTSRLTPQPQQ 60

Query: 61  KLKPPPAVSHFLHP----SNSFHSSPRTRSDRWRFSRSNLVEPEQVSSGCFPSPLPNRKS 120
           KLKPPP    F+ P    + SF SS RTRS+RWRF +   V PE VS GCFPSPL  RK 
Sbjct: 61  KLKPPPGGFVFIPPPEPRTRSFRSSQRTRSERWRFEQPTRVRPECVSPGCFPSPLLKRKD 120

Query: 121 AKTAS---RKSEPDYTSESETLSRWSVSNRKSISPFRNSVSSSPSSFSSYQSSPRPTSDT 164
           +K  +    +SE DYTS+ ETL+RWS+S+RKS SPFR+   SS SSFSSYQSSPR  S+ 
Sbjct: 121 SKRRTVQLPESETDYTSDLETLARWSLSSRKSFSPFRD---SSASSFSSYQSSPRVPSEA 180

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KY52_CUCSA5.1e-4377.05Uncharacterized protein OS=Cucumis sativus GN=Csa_4G361790 PE=4 SV=1[more]
B9H7I2_POPTR2.9e-3851.55Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0005s20360g PE=4 SV=1[more]
A0A067K9P8_JATCU7.2e-3752.13Uncharacterized protein OS=Jatropha curcas GN=JCGZ_12119 PE=4 SV=1[more]
V4V643_9ROSI2.6e-3450.53Uncharacterized protein OS=Citrus clementina GN=CICLE_v10002610mg PE=4 SV=1[more]
B9S2W2_RICCO3.3e-3449.75Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0562950 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G77400.13.2e-2033.62 Protein of unknown function DUF688 (InterPro:IPR007789)[more]
Match NameE-valueIdentityDescription
gi|449449763|ref|XP_004142634.1|3.6e-6681.93PREDICTED: uncharacterized protein LOC101220757 [Cucumis sativus][more]
gi|659086946|ref|XP_008444194.1|2.4e-6582.63PREDICTED: putative protein TPRXL [Cucumis melo][more]
gi|700199387|gb|KGN54545.1|7.4e-4377.05hypothetical protein Csa_4G361790 [Cucumis sativus][more]
gi|224084958|ref|XP_002307455.1|4.2e-3851.55hypothetical protein POPTR_0005s20360g [Populus trichocarpa][more]
gi|802636559|ref|XP_012078275.1|1.0e-3652.13PREDICTED: uncharacterized protein LOC105638965 [Jatropha curcas][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi04G024100.1Lsi04G024100.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR35466FAMILY NOT NAMEDcoord: 1..163
score: 3.4
NoneNo IPR availablePANTHERPTHR35466:SF2SUBFAMILY NOT NAMEDcoord: 1..163
score: 3.4

The following gene(s) are paralogous to this gene:

None