Lsi02G005280 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi02G005280
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionCentromere protein S
Locationchr02 : 4586481 .. 4588701 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CGTCGCCAGTGGCATTTCTGGAGGGAAATAAAGGCGGATGGAAGCAGTGACAGTGCCGCCAAATGCAGTAAATTGTGAATCCAGAAGCCTCTGCTCTGCAATGGAAACTGGGATGGAAGAAGACGACTCCGCCTCCGAACTCTTGAGGGACAGATTCCGACTCTCCACCATTTCTATCGCTGAAGCTGAAGGTTAATTATCTTCATCTTACTCTCTTCATCTTCGAGCCTGTCTTCTTTTATATGTAATATAAGCTGAATCGACCGGAGAAAGAGAAAGTATTATCCTCCGATTACTGAATGCGAAGGCGAACTTTCCATATTTCTGTTTGGCTTACTCTATTTTCGAACTGTTTGGTTTGTTTTTCAGCGAAGAGAAGCGGCATGGAAATTTCTGAACCCGTGATGACTTGTGTCGCTGATTTAGCCTTCAAATATACAAGTTAGTTTACTTTTCTTTTCTGCCTGTTTTGAGTCTGCAATGTTATTTTGAGCACTTTGTTTCTACGTGACTAATCCCACCATTCGGAGCACTTTCATTTGTTGATTCTTCATTATGTGGATAATCTAATTGCTAATTATTTTAATTTCCCTTCTCTTCTACCGGTGTTTTCATCATTTTTAGGTAGCTTATATGTGATTTCAATTATAAATTTCCTTATTATTTAGCATGCATTCGGTTTGTTGGGACGCCTCTCTGAGCTTTGTTGTTAAATTGTGTGACCTAGTTGGAAGAAACTATATATCTTTGGTCAATTCTGTTTCCAGAAGAACAAATGATGGAGAGACAATTGTGATATCTTAAAGAAAGACTTTGCATGATACTTTTGGATTCTAATCAGATGTTCTTATAGTTCATATGAATGTTTCTTCTTATTCTACCTTGAGACTTACACCTTACAGCGATCATGTTAGGTGTAAACCCTTAACTCTCGTTGTGCTCTTGTGGCGCTAATCTTGAGACCTAGATTTTGTGCTTCGCTTCAAGGAATACAATGGGAAGGATAGTCCTAGTTGTCTTGTGTTTCTTTGTTTATATTTCAATCACGTAAAGAAGACATATATTATTTTAATTTATATTTATATATTAGCTTCAAGCCATTTTCTGTTATTGATGAATGTTTAATTCATTGTCTAAAAGAAGTAAATATATACTTATGAATGATTTTGTAGGTTTATAAAGTTTTGACCAGACTTTAAGTTCATATCTTGGTACAGCGTCTATTTGTAGTCAGAGGGCTAACATATGAACCACCAATGGTATATATAATATATTAGATATTATTGGGTTCTGATCCATCTCAAAATCAATTGGCAAAGAGAGGAATAGTCCATCTATCTTTTAAAGAGTGTGAGATCCCTTGATTTGTCCAATGTAGAATTCTCAACATACCCTCTCAAGATGGTGCCTCTTTTGGGTTCATCATTCTTGATGGATCCCAATTTCTTTTTATTGGACCGAATACCCATTTGAGTTTTATGGGCTCTGACCCCATATTAGATAATATGGGATTCCATCTCAAAACCAATTGATAAGGAGAGTGTAATCCATCTATCTTATAAAGAGTGTGAGGTCCCTTGATTTGTCCAATGTGGGATCCTCAACATAGTACATGCAAGGAAAGCAAATTTTTGTTTACTGTGCTGCAGTAGTTCTAGTTGCATCTAAGTCAGCAAGGAAAGAAGATAATTCTGTCATATTTATATTTGCTTTGTTTTTATGATTCTCAAAGTGTTGCAATGAGATAACGGAATGATAACTTGTGTAGCAGAACAGTTGGCGAAGGACCTTGAGTTATTTGTTCAGCATGCTGGTCGGAAATCTGTGAATACAGAAGATGTAATACTATCAGGTTAGATTGTCCGATTCCCAAGTTGTTGTATTACTGTCTAGTTATATTTAGTCCATATGCAACAGACTTTAAGTATTACTTAATTGCCTAAACCTCTTCGCTTCTGCTTCATTTTGGTCTCAGCCCATAGAAATGAGCATTTGGCTGCCATATTAACATCCATCTGCAATGATCTAAAGACTAAAGAACCTCAAGGTGAGAGAGCGAGAAAAGGCACCAAAAAAGGAAGATAGAGATAGAGGTGTAGTGCATATTACTGACGCCTAATCACACCTCCCATCGGGCTGATCTCTAGTTAACAAGGGTGCATACCAACAAGCGATGGCAATAGCTTATGATTTCTTTCCTTTTCTTGGTTATACTAACTAG

mRNA sequence

CGTCGCCAGTGGCATTTCTGGAGGGAAATAAAGGCGGATGGAAGCAGTGACAGTGCCGCCAAATGCAGTAAATTGTGAATCCAGAAGCCTCTGCTCTGCAATGGAAACTGGGATGGAAGAAGACGACTCCGCCTCCGAACTCTTGAGGGACAGATTCCGACTCTCCACCATTTCTATCGCTGAAGCTGAAGCGAAGAGAAGCGGCATGGAAATTTCTGAACCCGTGATGACTTGTGTCGCTGATTTAGCCTTCAAATATACAACAGAACAGTTGGCGAAGGACCTTGAGTTATTTGTTCAGCATGCTGGTCGGAAATCTGTGAATACAGAAGATGTAATACTATCAGCCCATAGAAATGAGCATTTGGCTGCCATATTAACATCCATCTGCAATGATCTAAAGACTAAAGAACCTCAAGTTAACAAGGGTGCATACCAACAAGCGATGGCAATAGCTTATGATTTCTTTCCTTTTCTTGGTTATACTAACTAG

Coding sequence (CDS)

ATGGAAGCAGTGACAGTGCCGCCAAATGCAGTAAATTGTGAATCCAGAAGCCTCTGCTCTGCAATGGAAACTGGGATGGAAGAAGACGACTCCGCCTCCGAACTCTTGAGGGACAGATTCCGACTCTCCACCATTTCTATCGCTGAAGCTGAAGCGAAGAGAAGCGGCATGGAAATTTCTGAACCCGTGATGACTTGTGTCGCTGATTTAGCCTTCAAATATACAACAGAACAGTTGGCGAAGGACCTTGAGTTATTTGTTCAGCATGCTGGTCGGAAATCTGTGAATACAGAAGATGTAATACTATCAGCCCATAGAAATGAGCATTTGGCTGCCATATTAACATCCATCTGCAATGATCTAAAGACTAAAGAACCTCAAGTTAACAAGGGTGCATACCAACAAGCGATGGCAATAGCTTATGATTTCTTTCCTTTTCTTGGTTATACTAACTAG

Protein sequence

MEAVTVPPNAVNCESRSLCSAMETGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKYTTEQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKEPQVNKGAYQQAMAIAYDFFPFLGYTN
BLAST of Lsi02G005280 vs. TrEMBL
Match: A0A0A0LR97_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G051770 PE=4 SV=1)

HSP 1 Score: 190.3 bits (482), Expect = 1.7e-45
Identity = 99/106 (93.40%), Postives = 103/106 (97.17%), Query Frame = 1

Query: 22  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKYTTEQLAK 81
           METGMEEDDSASELLRDRFRLS+ISIAEAEA +SGMEISEPVMTCVADLAFKY T+QLAK
Sbjct: 1   METGMEEDDSASELLRDRFRLSSISIAEAEANKSGMEISEPVMTCVADLAFKY-TKQLAK 60

Query: 82  DLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKEPQ 128
           DLELF QHAGRKSVNTEDVIL+AHRNEHLAAILTSICNDLKTKEPQ
Sbjct: 61  DLELFAQHAGRKSVNTEDVILTAHRNEHLAAILTSICNDLKTKEPQ 105

BLAST of Lsi02G005280 vs. TrEMBL
Match: M5WE31_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa017850mg PE=4 SV=1)

HSP 1 Score: 158.3 bits (399), Expect = 7.4e-36
Identity = 82/101 (81.19%), Postives = 90/101 (89.11%), Query Frame = 1

Query: 27  EEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKYTTEQLAKDLELF 86
           EE+DS SELLRDRFR+STISIAEAEAKR+ MEIS PVMTC+ADLAFK+ TEQLAKDLELF
Sbjct: 4   EEEDSVSELLRDRFRVSTISIAEAEAKRNDMEISGPVMTCIADLAFKF-TEQLAKDLELF 63

Query: 87  VQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKEPQ 128
            QHAGRK+ N EDVILSAHRNEHLAA+L S  NDLK +EPQ
Sbjct: 64  AQHAGRKTANMEDVILSAHRNEHLAALLRSFSNDLKAREPQ 103

BLAST of Lsi02G005280 vs. TrEMBL
Match: A0A022QDG1_ERYGU (Uncharacterized protein OS=Erythranthe guttata GN=MIMGU_mgv1a016064mg PE=4 SV=1)

HSP 1 Score: 156.8 bits (395), Expect = 2.1e-35
Identity = 77/104 (74.04%), Postives = 92/104 (88.46%), Query Frame = 1

Query: 27  EEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKYTTEQLAKDLELF 86
           EE++SASELLRDRFRL TI+IAEAEAK++GME+S+P+M C++DLAFKY  +QLAKDLELF
Sbjct: 11  EEEESASELLRDRFRLCTIAIAEAEAKQNGMEVSQPIMACISDLAFKY-AQQLAKDLELF 70

Query: 87  VQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKEPQVNK 131
            QH GRKSVN EDVILSAHRN+HL+A L S CNDLK KEPQ ++
Sbjct: 71  AQHGGRKSVNMEDVILSAHRNDHLSASLRSFCNDLKAKEPQSDR 113

BLAST of Lsi02G005280 vs. TrEMBL
Match: M1BKK1_SOLTU (Uncharacterized protein OS=Solanum tuberosum GN=PGSC0003DMG400018389 PE=4 SV=1)

HSP 1 Score: 153.7 bits (387), Expect = 1.8e-34
Identity = 75/110 (68.18%), Postives = 93/110 (84.55%), Query Frame = 1

Query: 21  AMETGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKYTTEQLA 80
           A +   EE+++ ++LLRDRFRL TISIAE EAK+ GME+S+P++TC++DLAFK+  EQL+
Sbjct: 9   ASDVEREEEEAVTDLLRDRFRLCTISIAEGEAKQCGMEVSQPIITCISDLAFKF-AEQLS 68

Query: 81  KDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKEPQVNK 131
           KDLELF QHAGRKSVN EDVILSAHRN+HLAA L S CNDLKTKEP + +
Sbjct: 69  KDLELFAQHAGRKSVNMEDVILSAHRNDHLAASLRSFCNDLKTKEPNLER 117

BLAST of Lsi02G005280 vs. TrEMBL
Match: B9IFK8_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0015s11720g PE=4 SV=2)

HSP 1 Score: 152.1 bits (383), Expect = 5.3e-34
Identity = 77/105 (73.33%), Postives = 88/105 (83.81%), Query Frame = 1

Query: 23  ETGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKYTTEQLAKD 82
           E   EEDDS S++L+DRFRLS ISIAE EAK++G+EISEP+ +C+ADLA  YT EQLAK+
Sbjct: 8   EVEREEDDSVSDILQDRFRLSAISIAENEAKKNGVEISEPITSCIADLALNYT-EQLAKE 67

Query: 83  LELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKEPQ 128
           LE F QHAGRKSVN EDVILSAHRNEHL A+L S CNDLK KEPQ
Sbjct: 68  LEAFAQHAGRKSVNMEDVILSAHRNEHLTALLRSFCNDLKEKEPQ 111

BLAST of Lsi02G005280 vs. TAIR10
Match: AT5G50930.1 (AT5G50930.1 Histone superfamily protein)

HSP 1 Score: 141.7 bits (356), Expect = 3.6e-34
Identity = 77/131 (58.78%), Postives = 92/131 (70.23%), Query Frame = 1

Query: 8   PNAVNCESRSLCSAMETGME-----------EDDSASELLRDRFRLSTISIAEAEAKRSG 67
           P     +  S C AM+ G E           E+ S  +L+RDRFRLS ISIAEAEAK++G
Sbjct: 85  PTITQAKKPSYCFAMDVGGEDISDLQVDQIVEEYSMDDLIRDRFRLSAISIAEAEAKKNG 144

Query: 68  MEISEPVMTCVADLAFKYTTEQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTS 127
           MEI  PV+ CVADLAFKY  E +AKDLELF  HAGRK VN +DV+LSAHRN++LAA L S
Sbjct: 145 MEIGGPVVACVADLAFKY-AENVAKDLELFAHHAGRKVVNMDDVVLSAHRNDNLAASLRS 204

BLAST of Lsi02G005280 vs. NCBI nr
Match: gi|778658214|ref|XP_011652290.1| (PREDICTED: centromere protein S isoform X1 [Cucumis sativus])

HSP 1 Score: 198.7 bits (504), Expect = 7.0e-48
Identity = 101/106 (95.28%), Postives = 104/106 (98.11%), Query Frame = 1

Query: 22  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKYTTEQLAK 81
           METGMEEDDSASELLRDRFRLS+ISIAEAEA +SGMEISEPVMTCVADLAFKYTTEQLAK
Sbjct: 1   METGMEEDDSASELLRDRFRLSSISIAEAEANKSGMEISEPVMTCVADLAFKYTTEQLAK 60

Query: 82  DLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKEPQ 128
           DLELF QHAGRKSVNTEDVIL+AHRNEHLAAILTSICNDLKTKEPQ
Sbjct: 61  DLELFAQHAGRKSVNTEDVILTAHRNEHLAAILTSICNDLKTKEPQ 106

BLAST of Lsi02G005280 vs. NCBI nr
Match: gi|659067571|ref|XP_008440142.1| (PREDICTED: centromere protein S isoform X1 [Cucumis melo])

HSP 1 Score: 193.7 bits (491), Expect = 2.3e-46
Identity = 99/106 (93.40%), Postives = 102/106 (96.23%), Query Frame = 1

Query: 22  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKYTTEQLAK 81
           MET MEEDDSASELLRDRFRLSTISIAEAEA +SGMEISEPVMTCVADLAFK+TTEQLAK
Sbjct: 1   METRMEEDDSASELLRDRFRLSTISIAEAEANKSGMEISEPVMTCVADLAFKFTTEQLAK 60

Query: 82  DLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKEPQ 128
           DLELF QHAGRKSVNTEDVIL+AHRNEHLAAILTSICNDLK KEPQ
Sbjct: 61  DLELFAQHAGRKSVNTEDVILTAHRNEHLAAILTSICNDLKAKEPQ 106

BLAST of Lsi02G005280 vs. NCBI nr
Match: gi|449473866|ref|XP_004154006.1| (PREDICTED: centromere protein S isoform X2 [Cucumis sativus])

HSP 1 Score: 190.3 bits (482), Expect = 2.5e-45
Identity = 99/106 (93.40%), Postives = 103/106 (97.17%), Query Frame = 1

Query: 22  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKYTTEQLAK 81
           METGMEEDDSASELLRDRFRLS+ISIAEAEA +SGMEISEPVMTCVADLAFKY T+QLAK
Sbjct: 1   METGMEEDDSASELLRDRFRLSSISIAEAEANKSGMEISEPVMTCVADLAFKY-TKQLAK 60

Query: 82  DLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKEPQ 128
           DLELF QHAGRKSVNTEDVIL+AHRNEHLAAILTSICNDLKTKEPQ
Sbjct: 61  DLELFAQHAGRKSVNTEDVILTAHRNEHLAAILTSICNDLKTKEPQ 105

BLAST of Lsi02G005280 vs. NCBI nr
Match: gi|659067575|ref|XP_008440159.1| (PREDICTED: centromere protein S isoform X2 [Cucumis melo])

HSP 1 Score: 185.3 bits (469), Expect = 8.1e-44
Identity = 97/106 (91.51%), Postives = 101/106 (95.28%), Query Frame = 1

Query: 22  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKYTTEQLAK 81
           MET MEEDDSASELLRDRFRLSTISIAEAEA +SGMEISEPVMTCVADLAFK+ T+QLAK
Sbjct: 1   METRMEEDDSASELLRDRFRLSTISIAEAEANKSGMEISEPVMTCVADLAFKF-TKQLAK 60

Query: 82  DLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKEPQ 128
           DLELF QHAGRKSVNTEDVIL+AHRNEHLAAILTSICNDLK KEPQ
Sbjct: 61  DLELFAQHAGRKSVNTEDVILTAHRNEHLAAILTSICNDLKAKEPQ 105

BLAST of Lsi02G005280 vs. NCBI nr
Match: gi|595858860|ref|XP_007210833.1| (hypothetical protein PRUPE_ppa017850mg [Prunus persica])

HSP 1 Score: 158.3 bits (399), Expect = 1.1e-35
Identity = 82/101 (81.19%), Postives = 90/101 (89.11%), Query Frame = 1

Query: 27  EEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKYTTEQLAKDLELF 86
           EE+DS SELLRDRFR+STISIAEAEAKR+ MEIS PVMTC+ADLAFK+ TEQLAKDLELF
Sbjct: 4   EEEDSVSELLRDRFRVSTISIAEAEAKRNDMEISGPVMTCIADLAFKF-TEQLAKDLELF 63

Query: 87  VQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKEPQ 128
            QHAGRK+ N EDVILSAHRNEHLAA+L S  NDLK +EPQ
Sbjct: 64  AQHAGRKTANMEDVILSAHRNEHLAALLRSFSNDLKAREPQ 103

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LR97_CUCSA1.7e-4593.40Uncharacterized protein OS=Cucumis sativus GN=Csa_1G051770 PE=4 SV=1[more]
M5WE31_PRUPE7.4e-3681.19Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa017850mg PE=4 SV=1[more]
A0A022QDG1_ERYGU2.1e-3574.04Uncharacterized protein OS=Erythranthe guttata GN=MIMGU_mgv1a016064mg PE=4 SV=1[more]
M1BKK1_SOLTU1.8e-3468.18Uncharacterized protein OS=Solanum tuberosum GN=PGSC0003DMG400018389 PE=4 SV=1[more]
B9IFK8_POPTR5.3e-3473.33Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0015s11720g PE=4 SV=2[more]
Match NameE-valueIdentityDescription
AT5G50930.13.6e-3458.78 Histone superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778658214|ref|XP_011652290.1|7.0e-4895.28PREDICTED: centromere protein S isoform X1 [Cucumis sativus][more]
gi|659067571|ref|XP_008440142.1|2.3e-4693.40PREDICTED: centromere protein S isoform X1 [Cucumis melo][more]
gi|449473866|ref|XP_004154006.1|2.5e-4593.40PREDICTED: centromere protein S isoform X2 [Cucumis sativus][more]
gi|659067575|ref|XP_008440159.1|8.1e-4491.51PREDICTED: centromere protein S isoform X2 [Cucumis melo][more]
gi|595858860|ref|XP_007210833.1|1.1e-3581.19hypothetical protein PRUPE_ppa017850mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0046982protein heterodimerization activity
Vocabulary: Cellular Component
TermDefinition
GO:0071821FANCM-MHF complex
Vocabulary: INTERPRO
TermDefinition
IPR009072Histone-fold
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0007129 synapsis
biological_process GO:0006457 protein folding
biological_process GO:0017009 protein-phycocyanobilin linkage
biological_process GO:0008150 biological_process
cellular_component GO:0071821 FANCM-MHF complex
cellular_component GO:0005789 endoplasmic reticulum membrane
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0046982 protein heterodimerization activity
molecular_function GO:0005509 calcium ion binding
molecular_function GO:0030246 carbohydrate binding
molecular_function GO:0016829 lyase activity
molecular_function GO:0051082 unfolded protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi02G005280.1Lsi02G005280.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR009072Histone-foldGENE3DG3DSA:1.10.20.10coord: 59..109
score: 6.
IPR009072Histone-foldunknownSSF47113Histone-foldcoord: 55..110
score: 3.5
NoneNo IPR availablePANTHERPTHR22980CORTISTATINcoord: 26..127
score: 1.9
NoneNo IPR availablePANTHERPTHR22980:SF0CENTROMERE PROTEIN Scoord: 26..127
score: 1.9

The following gene(s) are paralogous to this gene:

None