Csa1G025880 (gene) Cucumber (Chinese Long) v2

NameCsa1G025880
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionSmall nuclear ribonucleoprotein-associated protein B; contains IPR010920 (Like-Sm (LSM) domain)
LocationChr1 : 2903542 .. 2905726 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TATGTTAGTGTATGTAAAAGTGTGTATATTTTTTTGATTACAAGAGGTAGAGATTGTTGAGATGCGTTTTTGATTTAGATCCTATTGTTACTCCTCCCCGCTCTCTCCATCTCTCTCTCTTTCATCCACGAGAAGCCAAGAATCACAATCGGCTCCCAAGTTCCCCTCTGCTCCGCCGTCCCTGTATGACTCTACTTACTCCCTTTTAATTACCTTACCAACATGTTTCCCTTTTTCTACTTACTCCATTTCCTTCGATCCTTCACGATTTCCTGCTCATTCTAAACTGATCCTTTCTTTCTCACCCCCCATCCCATTATTGCCTTTCTTTTAGGCCTCACCATGATTTTCTTTTATGTTTTATTACAAATCCAACCTGAATCAGACCACCCTCTTATTGGGCCAATGGTTCAAGGCCATTTTCATATATTAACTTGGACATTTAGGTGTTTTAAGTTTTAGTCGATGCTTTCCATGATACAATACTGTGCTACTCAACTCAATTTACACATGATTCTCAATTATGAAAACAATGGATTAATCTCTGGCACTTCGATGCTGGGTATATGGTTATTGGTTATTAGTTATTATGATTTGTTGTTTTTTGTGTTTCATACTTCATGACCTGCAGGCTTCTTGATGTATTCTTCTAAAACTAGATTGTTGAATCCCATATGCTTGCTATTGAATGTGGATTCTAATTCCTTACCCATAATTGCTTTGAAACCTGCTTCCATATTCACTTTAGTTTTACCTATTTCCTTTAGTGCAAAGGAAATTGTGCTTTCTTTGGCTTTACCATGTCTTTATCGAAACACTCATGCCTAAAAGAGGCACAACTTGCATTAATTAGCCATGTCAAGAGCTTTAGTTCAATTGAGTAGTTCTCTTCTCTTTATTCCTCCACCTTCCCCCTCTCTCTGTACCTTCCACTGGGCCTCCTTCAATACATCCCTGTTGATCATTGTGAGGGACATTCTTTTGCTGCTGTTTTCTTTTAATATAACAACTGTGAGATGGGAATCCAACCTCCGACCTCTAGGATAGAGAAGGTTATGTCAATTACAATGGAGCTGAGCACACTTTGGCATCTCTAGCAAAGTTGTCATCATTTCCATCCATTATCTATTGCTTGAATTTGCACTTATATATGGTAGTTCCTCATCTGGGGATTGTTGTTTTGGTGAACACAGTTCTATGCGTAATTGGGATGCAGCTCTTAAGATCGATGTTTTTTATTATGGAATGTGCAATTAGGACTTGATCTTTCCTTCCATTTGTTTTTCATTCGTTGCTGTTGCTGATGTTATAGAGGCTGCTCCCTTTTTCTTTGAGTGGATTCTCTGTCTCTTGTGTTGAGCATTCATTACTTTTACGGCTTTGATCTTTTTCCTTTCGACAAAATCTTCAATCCAGTTTTTCCCTGCAATTTCAACCCCCTTTTTTCACTCAGATTACGCTTTAGTTGCAATTCTGATAATATTATTGGTTTTGTTCATCAACTACAACTATCTTTCCACATATAGGGATCAGTTGAAGTAACCATAGATGGAACAAGAATCAGGAGGATCCATGGTTCAGGATGGGAGCAATGTTGAGTCTAACCCAGAGAGTTTAGATCACATAGGAAAGGTGAGAAAGCTGCTGTTTCGTCGAATGCTCATAGGTATTAAAGATGGAAGGTTTTTCTTGGGAAACTTTTACTGCATTGACAAGCAAGGAAATATCATCCTTCAAGATGCAGTAGAGTATCGTAGCACTCGGCGTAGCTCACCTTCTCCAATGGAACAACGGTGCCTTGGTCTTATTCTTATACCCAACTCTTGCCGTGTATCCTGTCATGTGGATAGTACCATTGATGAACAATTGGCATTGCTATCAGTTTAGCAAATATAAGATGAAGCCTGAGATTAAGAAGAGATGAAAAAAATGTGCACGCTTCTTTGATGGATTTGACTGTTTATATTGTTACAGAAAAATTCTAAAACATCTTGATGTTTTTAGTGGAAATATGGGATTAGGAAATACGATTTGTTTCTCTTATGGATAAGATATTGTCGCTTTCTTTATTAAGTTGAAGAGTGGATGCAAAGTAGGAGGAACCTTTGAAAAGATTGTGGAAGAAACCCGAAGGCTTGATAAAGTTATTGTTTTTTAGGGAAAATTATCCCAAGTTTGGAATAAC

mRNA sequence

ATGGAACAAGAATCAGGAGGATCCATGGTTCAGGATGGGAGCAATGTTGAGTCTAACCCAGAGAGTTTAGATCACATAGGAAAGGTGAGAAAGCTGCTGTTTCGTCGAATGCTCATAGGTATTAAAGATGGAAGGTTTTTCTTGGGAAACTTTTACTGCATTGACAAGCAAGGAAATATCATCCTTCAAGATGCAGTAGAGTATCGTAGCACTCGGCGTAGCTCACCTTCTCCAATGGAACAACGGTGCCTTGGTCTTATTCTTATACCCAACTCTTGCCGTGTATCCTGTCATGTGGATAGTACCATTGATGAACAATTGGCATTGCTATCAGTTTAG

Coding sequence (CDS)

ATGGAACAAGAATCAGGAGGATCCATGGTTCAGGATGGGAGCAATGTTGAGTCTAACCCAGAGAGTTTAGATCACATAGGAAAGGTGAGAAAGCTGCTGTTTCGTCGAATGCTCATAGGTATTAAAGATGGAAGGTTTTTCTTGGGAAACTTTTACTGCATTGACAAGCAAGGAAATATCATCCTTCAAGATGCAGTAGAGTATCGTAGCACTCGGCGTAGCTCACCTTCTCCAATGGAACAACGGTGCCTTGGTCTTATTCTTATACCCAACTCTTGCCGTGTATCCTGTCATGTGGATAGTACCATTGATGAACAATTGGCATTGCTATCAGTTTAG

Protein sequence

MEQESGGSMVQDGSNVESNPESLDHIGKVRKLLFRRMLIGIKDGRFFLGNFYCIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVDSTIDEQLALLSV*
BLAST of Csa1G025880 vs. TrEMBL
Match: A0A0A0LSZ5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G025880 PE=4 SV=1)

HSP 1 Score: 226.5 bits (576), Expect = 1.6e-56
Identity = 112/112 (100.00%), Postives = 112/112 (100.00%), Query Frame = 1

Query: 1   MEQESGGSMVQDGSNVESNPESLDHIGKVRKLLFRRMLIGIKDGRFFLGNFYCIDKQGNI 60
           MEQESGGSMVQDGSNVESNPESLDHIGKVRKLLFRRMLIGIKDGRFFLGNFYCIDKQGNI
Sbjct: 1   MEQESGGSMVQDGSNVESNPESLDHIGKVRKLLFRRMLIGIKDGRFFLGNFYCIDKQGNI 60

Query: 61  ILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVDSTIDEQLALLSV 113
           ILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVDSTIDEQLALLSV
Sbjct: 61  ILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVDSTIDEQLALLSV 112

BLAST of Csa1G025880 vs. TrEMBL
Match: A0A061GCV7_THECC (Small nuclear ribonucleoprotein family protein isoform 2 OS=Theobroma cacao GN=TCM_029454 PE=4 SV=1)

HSP 1 Score: 171.4 bits (433), Expect = 6.3e-40
Identity = 83/112 (74.11%), Postives = 96/112 (85.71%), Query Frame = 1

Query: 1   MEQESGGSMVQDGSNVESNPESLDHIGKVRKLLFRRMLIGIKDGRFFLGNFYCIDKQGNI 60
           MEQE G S++Q GS+ + N  S D + +VRKLLFRRML+GIKDGRFFLG F+CIDKQGNI
Sbjct: 29  MEQEVGESLIQLGSSDDPNSSSTDPVTRVRKLLFRRMLVGIKDGRFFLGTFHCIDKQGNI 88

Query: 61  ILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVDSTIDEQLALLSV 113
           ILQD++EYRSTR SSPSPMEQRCLGLILIP SCR SCHVD +I+EQL+LL V
Sbjct: 89  ILQDSIEYRSTRHSSPSPMEQRCLGLILIPFSCRTSCHVDCSINEQLSLLKV 140

BLAST of Csa1G025880 vs. TrEMBL
Match: A0A061GKU0_THECC (Small nuclear ribonucleoprotein family protein isoform 1 OS=Theobroma cacao GN=TCM_029454 PE=4 SV=1)

HSP 1 Score: 169.9 bits (429), Expect = 1.8e-39
Identity = 82/110 (74.55%), Postives = 95/110 (86.36%), Query Frame = 1

Query: 1   MEQESGGSMVQDGSNVESNPESLDHIGKVRKLLFRRMLIGIKDGRFFLGNFYCIDKQGNI 60
           MEQE G S++Q GS+ + N  S D + +VRKLLFRRML+GIKDGRFFLG F+CIDKQGNI
Sbjct: 1   MEQEVGESLIQLGSSDDPNSSSTDPVTRVRKLLFRRMLVGIKDGRFFLGTFHCIDKQGNI 60

Query: 61  ILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVDSTIDEQLALL 111
           ILQD++EYRSTR SSPSPMEQRCLGLILIP SCR SCHVD +I+EQL+LL
Sbjct: 61  ILQDSIEYRSTRHSSPSPMEQRCLGLILIPFSCRTSCHVDCSINEQLSLL 110

BLAST of Csa1G025880 vs. TrEMBL
Match: A0A0D2RE93_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_011G045800 PE=4 SV=1)

HSP 1 Score: 169.5 bits (428), Expect = 2.4e-39
Identity = 80/112 (71.43%), Postives = 97/112 (86.61%), Query Frame = 1

Query: 1   MEQESGGSMVQDGSNVESNPESLDHIGKVRKLLFRRMLIGIKDGRFFLGNFYCIDKQGNI 60
           MEQE G S++Q G++  SN  S D + +VRKLLFRRML+GIKDGRFFLG F+C+DKQGNI
Sbjct: 1   MEQEVGESLIQLGNSDGSNSSSSDPVTRVRKLLFRRMLVGIKDGRFFLGTFHCLDKQGNI 60

Query: 61  ILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVDSTIDEQLALLSV 113
           ILQD +EYRSTRRSSPSPMEQRCLGL+LIP+SC+ SCHVD +++EQL+LL V
Sbjct: 61  ILQDTIEYRSTRRSSPSPMEQRCLGLVLIPSSCQNSCHVDCSVEEQLSLLKV 112

BLAST of Csa1G025880 vs. TrEMBL
Match: A0A0D2ULJ4_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_011G045800 PE=4 SV=1)

HSP 1 Score: 167.9 bits (424), Expect = 6.9e-39
Identity = 79/110 (71.82%), Postives = 96/110 (87.27%), Query Frame = 1

Query: 1   MEQESGGSMVQDGSNVESNPESLDHIGKVRKLLFRRMLIGIKDGRFFLGNFYCIDKQGNI 60
           MEQE G S++Q G++  SN  S D + +VRKLLFRRML+GIKDGRFFLG F+C+DKQGNI
Sbjct: 1   MEQEVGESLIQLGNSDGSNSSSSDPVTRVRKLLFRRMLVGIKDGRFFLGTFHCLDKQGNI 60

Query: 61  ILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVDSTIDEQLALL 111
           ILQD +EYRSTRRSSPSPMEQRCLGL+LIP+SC+ SCHVD +++EQL+LL
Sbjct: 61  ILQDTIEYRSTRRSSPSPMEQRCLGLVLIPSSCQNSCHVDCSVEEQLSLL 110

BLAST of Csa1G025880 vs. TAIR10
Match: AT4G18372.1 (AT4G18372.1 Small nuclear ribonucleoprotein family protein)

HSP 1 Score: 152.5 bits (384), Expect = 1.5e-37
Identity = 76/112 (67.86%), Postives = 87/112 (77.68%), Query Frame = 1

Query: 1   MEQESGGSMVQDGSNVESNPESLDHIGKVRKLLFRRMLIGIKDGRFFLGNFYCIDKQGNI 60
           MEQ +  S     S  E +    D I ++RKLLFR+ML+GIKDGRFFLGNF+CIDKQGNI
Sbjct: 1   MEQAAERSSTIVASTSEGS--DFDPISRLRKLLFRQMLVGIKDGRFFLGNFHCIDKQGNI 60

Query: 61  ILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVDSTIDEQLALLSV 113
           ILQD VEYRS RRSSPSP EQRCLG+ILIP+SCR SCHVD +IDEQL+L+ +
Sbjct: 61  ILQDTVEYRSIRRSSPSPTEQRCLGMILIPSSCRTSCHVDCSIDEQLSLIQL 110

BLAST of Csa1G025880 vs. NCBI nr
Match: gi|778656582|ref|XP_011649337.1| (PREDICTED: uncharacterized protein LOC101206200 [Cucumis sativus])

HSP 1 Score: 226.5 bits (576), Expect = 2.4e-56
Identity = 112/112 (100.00%), Postives = 112/112 (100.00%), Query Frame = 1

Query: 1   MEQESGGSMVQDGSNVESNPESLDHIGKVRKLLFRRMLIGIKDGRFFLGNFYCIDKQGNI 60
           MEQESGGSMVQDGSNVESNPESLDHIGKVRKLLFRRMLIGIKDGRFFLGNFYCIDKQGNI
Sbjct: 1   MEQESGGSMVQDGSNVESNPESLDHIGKVRKLLFRRMLIGIKDGRFFLGNFYCIDKQGNI 60

Query: 61  ILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVDSTIDEQLALLSV 113
           ILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVDSTIDEQLALLSV
Sbjct: 61  ILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVDSTIDEQLALLSV 112

BLAST of Csa1G025880 vs. NCBI nr
Match: gi|659068447|ref|XP_008444468.1| (PREDICTED: uncharacterized protein LOC103487784 [Cucumis melo])

HSP 1 Score: 217.2 bits (552), Expect = 1.4e-53
Identity = 109/112 (97.32%), Postives = 109/112 (97.32%), Query Frame = 1

Query: 1   MEQESGGSMVQDGSNVESNPESLDHIGKVRKLLFRRMLIGIKDGRFFLGNFYCIDKQGNI 60
           MEQESGGSMVQDGSN ESN ESLD IGKVRKLLFRRMLIGIKDGRFFLGNFYCIDKQGNI
Sbjct: 1   MEQESGGSMVQDGSNFESNSESLDCIGKVRKLLFRRMLIGIKDGRFFLGNFYCIDKQGNI 60

Query: 61  ILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVDSTIDEQLALLSV 113
           ILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVDSTIDEQLALLSV
Sbjct: 61  ILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVDSTIDEQLALLSV 112

BLAST of Csa1G025880 vs. NCBI nr
Match: gi|590622413|ref|XP_007025043.1| (Small nuclear ribonucleoprotein family protein isoform 2 [Theobroma cacao])

HSP 1 Score: 171.4 bits (433), Expect = 9.0e-40
Identity = 83/112 (74.11%), Postives = 96/112 (85.71%), Query Frame = 1

Query: 1   MEQESGGSMVQDGSNVESNPESLDHIGKVRKLLFRRMLIGIKDGRFFLGNFYCIDKQGNI 60
           MEQE G S++Q GS+ + N  S D + +VRKLLFRRML+GIKDGRFFLG F+CIDKQGNI
Sbjct: 29  MEQEVGESLIQLGSSDDPNSSSTDPVTRVRKLLFRRMLVGIKDGRFFLGTFHCIDKQGNI 88

Query: 61  ILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVDSTIDEQLALLSV 113
           ILQD++EYRSTR SSPSPMEQRCLGLILIP SCR SCHVD +I+EQL+LL V
Sbjct: 89  ILQDSIEYRSTRHSSPSPMEQRCLGLILIPFSCRTSCHVDCSINEQLSLLKV 140

BLAST of Csa1G025880 vs. NCBI nr
Match: gi|590622409|ref|XP_007025042.1| (Small nuclear ribonucleoprotein family protein isoform 1 [Theobroma cacao])

HSP 1 Score: 169.9 bits (429), Expect = 2.6e-39
Identity = 82/110 (74.55%), Postives = 95/110 (86.36%), Query Frame = 1

Query: 1   MEQESGGSMVQDGSNVESNPESLDHIGKVRKLLFRRMLIGIKDGRFFLGNFYCIDKQGNI 60
           MEQE G S++Q GS+ + N  S D + +VRKLLFRRML+GIKDGRFFLG F+CIDKQGNI
Sbjct: 1   MEQEVGESLIQLGSSDDPNSSSTDPVTRVRKLLFRRMLVGIKDGRFFLGTFHCIDKQGNI 60

Query: 61  ILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVDSTIDEQLALL 111
           ILQD++EYRSTR SSPSPMEQRCLGLILIP SCR SCHVD +I+EQL+LL
Sbjct: 61  ILQDSIEYRSTRHSSPSPMEQRCLGLILIPFSCRTSCHVDCSINEQLSLL 110

BLAST of Csa1G025880 vs. NCBI nr
Match: gi|763801914|gb|KJB68852.1| (hypothetical protein B456_011G045800 [Gossypium raimondii])

HSP 1 Score: 169.5 bits (428), Expect = 3.4e-39
Identity = 80/112 (71.43%), Postives = 97/112 (86.61%), Query Frame = 1

Query: 1   MEQESGGSMVQDGSNVESNPESLDHIGKVRKLLFRRMLIGIKDGRFFLGNFYCIDKQGNI 60
           MEQE G S++Q G++  SN  S D + +VRKLLFRRML+GIKDGRFFLG F+C+DKQGNI
Sbjct: 1   MEQEVGESLIQLGNSDGSNSSSSDPVTRVRKLLFRRMLVGIKDGRFFLGTFHCLDKQGNI 60

Query: 61  ILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVDSTIDEQLALLSV 113
           ILQD +EYRSTRRSSPSPMEQRCLGL+LIP+SC+ SCHVD +++EQL+LL V
Sbjct: 61  ILQDTIEYRSTRRSSPSPMEQRCLGLVLIPSSCQNSCHVDCSVEEQLSLLKV 112

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0LSZ5_CUCSA1.6e-56100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_1G025880 PE=4 SV=1[more]
A0A061GCV7_THECC6.3e-4074.11Small nuclear ribonucleoprotein family protein isoform 2 OS=Theobroma cacao GN=T... [more]
A0A061GKU0_THECC1.8e-3974.55Small nuclear ribonucleoprotein family protein isoform 1 OS=Theobroma cacao GN=T... [more]
A0A0D2RE93_GOSRA2.4e-3971.43Uncharacterized protein OS=Gossypium raimondii GN=B456_011G045800 PE=4 SV=1[more]
A0A0D2ULJ4_GOSRA6.9e-3971.82Uncharacterized protein OS=Gossypium raimondii GN=B456_011G045800 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G18372.11.5e-3767.86 Small nuclear ribonucleoprotein family protein[more]
Match NameE-valueIdentityDescription
gi|778656582|ref|XP_011649337.1|2.4e-56100.00PREDICTED: uncharacterized protein LOC101206200 [Cucumis sativus][more]
gi|659068447|ref|XP_008444468.1|1.4e-5397.32PREDICTED: uncharacterized protein LOC103487784 [Cucumis melo][more]
gi|590622413|ref|XP_007025043.1|9.0e-4074.11Small nuclear ribonucleoprotein family protein isoform 2 [Theobroma cacao][more]
gi|590622409|ref|XP_007025042.1|2.6e-3974.55Small nuclear ribonucleoprotein family protein isoform 1 [Theobroma cacao][more]
gi|763801914|gb|KJB68852.1|3.4e-3971.43hypothetical protein B456_011G045800 [Gossypium raimondii][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001163LSM_dom_euk/arc
IPR010920LSM_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0030529 intracellular ribonucleoprotein complex
cellular_component GO:0019013 viral nucleocapsid
cellular_component GO:0031417 NatC complex
molecular_function GO:0003674 molecular_function
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
CU090525cucumber EST collection version 3.0transcribed_cluster
CU119657cucumber EST collection version 3.0transcribed_cluster
CU149651cucumber EST collection version 3.0transcribed_cluster
CU166788cucumber EST collection version 3.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa1G025880.1Csa1G025880.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
CU090525CU090525transcribed_cluster
CU166788CU166788transcribed_cluster
CU119657CU119657transcribed_cluster
CU149651CU149651transcribed_cluster


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001163LSM domain, eukaryotic/archaea-typePFAMPF01423LSMcoord: 30..89
score: 1.
IPR010920LSM domainunknownSSF50182Sm-like ribonucleoproteinscoord: 25..101
score: 1.56
NoneNo IPR availableGENE3DG3DSA:2.30.30.100coord: 25..101
score: 5.8

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:

None