CU114166 (transcribed_cluster) Cucumber (Chinese Long) v2

NameCU114166
Typetranscribed_cluster
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionUnknown protein
LocationChr1 : 20302849 .. 20308195 (+)
Sequence length630
The following sequences are available for this feature:

transcribed_cluster sequence

CCTAGAAAATAAAGATACGTTTTAGATTTAGTAAATGAAATACTGTTCCTCGAAGTCTTACAAACTGAAGATCACCTAACGACTCCATCGCCTAGTTATTTCTTAAAAAAAACGATCCCATTGCCTAGTAAACATAGTTGAAACCATCAGGCCTTCAAGGAATCAAATCGAGGTCTCCATAGTCCAAGTCTGTACTTGTGGGTGTACTTCAATATCAGCTTCCTATGTTTGCATGGAATTCCAAGTTTCTTCAGCTTAAGTGTGCGAGTAACAAGCAGCTTCTGGAAGTCGCCAATCTCTGATTCAAGTTTAGCCACGTGTGACTCAACTCCATGCCCAATACCAGTTAAAAATTCTGGTATCCCGACCTTCACGATATATGGGGAAGATTTAGAAAAGAATCTAGCAAATAAAGGAGCTGAGTATGGTTGCAGTATAGACCTATGGTTGGAGACAATATTCCTCCATGCCATCATTCTTGCTCAATGGCGAGTGAACAAGAGTTGAAGGGACGGCGGCAAAAGACAGTGTTTGGAGCGACGGCGTTAGGATGCAACAGATGAGCAGTGGGGGCGTTCCGGCAAGTGACAAAGGTGGAAGAGGGAGACAGAAAAACAGGAACGCCAGATA
BLAST of CU114166 vs. TrEMBL
Match: A0A0A0M1A4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G555610 PE=4 SV=1)

HSP 1 Score: 196.8 bits (499), Expect = 2.6e-47
Identity = 96/109 (88.07%), Postives = 96/109 (88.07%), Query Frame = -2

Query: 150 MMAWRNIVSNHRSILQPYSAPLFARFFSKSSPYIVKVGIPEFLTGIGHGVESHVAKLESE 209
           MMAWRNIVSNHRSILQPYSAPLFARFFSKSSPYIVKVGIPEFLTGIGHGVESHVAKLESE
Sbjct: 1   MMAWRNIVSNHRSILQPYSAPLFARFFSKSSPYIVKVGIPEFLTGIGHGVESHVAKLESE 60

Query: 210 IGDFQXXXXXXXXXXXXXGIPCKHRKLILKYTHKYRLGLWRPRFDSLKA 269
           IGDFQ             GIPCKHRKLILKYTHKYRLGLWRPRFDSLKA
Sbjct: 61  IGDFQKLLVTRTLKLKKLGIPCKHRKLILKYTHKYRLGLWRPRFDSLKA 109

Query: 270 329
           
Sbjct: 121 109

Query: 330 389
           
Sbjct: 181 109

Query: 390 449
           
Sbjct: 241 109

Query: 450 477
           
Sbjct: 301 109

BLAST of CU114166 vs. TrEMBL
Match: M5WR64_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa012681mg PE=4 SV=1)

HSP 1 Score: 149.8 bits (377), Expect = 3.6e-33
Identity = 72/110 (65.45%), Postives = 85/110 (77.27%), Query Frame = -2

Query: 150 RMMAWRNIVSNHRSILQPYSAPLFARFFSKSSPYIVKVGIPEFLTGIGHGVESHVAKLES 209
           R MAWR I+ + R+IL+PYS    A+F +KS+PY+VKVGIPEFL GIG+GVESHV+KLES
Sbjct: 50  RKMAWRQILFSTRAILEPYSTTGSAKFSTKSNPYLVKVGIPEFLNGIGNGVESHVSKLES 109

Query: 210 EIGDFQXXXXXXXXXXXXXGIPCKHRKLILKYTHKYRLGLWRPRFDSLKA 269
           EIGDFQ             GIPCKHRKLILKYTHKYRLGLWRPR  ++K+
Sbjct: 110 EIGDFQKLLVTRTLKLKKLGIPCKHRKLILKYTHKYRLGLWRPRAQAVKS 159

Query: 270 329
           
Sbjct: 170 159

Query: 330 389
           
Sbjct: 230 159

Query: 390 449
           
Sbjct: 290 159

Query: 450 480
           
Sbjct: 350 159

BLAST of CU114166 vs. TrEMBL
Match: M5X3Q6_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa022191mg PE=4 SV=1)

HSP 1 Score: 145.6 bits (366), Expect = 6.8e-32
Identity = 70/108 (64.81%), Postives = 82/108 (75.93%), Query Frame = -2

Query: 150 MAWRNIVSNHRSILQPYSAPLFARFFSKSSPYIVKVGIPEFLTGIGHGVESHVAKLESEI 209
           MAWR ++ N R+IL PY A   ARF +KS+PY+VKVGIPEFL GIG+GVESHVAKLE+EI
Sbjct: 1   MAWRQMLFNSRAILGPYLATGSARFSTKSNPYLVKVGIPEFLNGIGNGVESHVAKLEAEI 60

Query: 210 GDFQXXXXXXXXXXXXXGIPCKHRKLILKYTHKYRLGLWRPRFDSLKA 269
           GDFQ             G+PCKHRKLILKYTHKYRLGLWRP   ++K+
Sbjct: 61  GDFQKLLVTRTLKLKKLGVPCKHRKLILKYTHKYRLGLWRPLAQAIKS 108

Query: 270 329
           
Sbjct: 121 108

Query: 330 389
           
Sbjct: 181 108

Query: 390 449
           
Sbjct: 241 108

Query: 450 474
           
Sbjct: 301 108

BLAST of CU114166 vs. TrEMBL
Match: A0A151U4Q1_CAJCA (Uncharacterized protein OS=Cajanus cajan GN=KK1_006949 PE=4 SV=1)

HSP 1 Score: 141.4 bits (355), Expect = 1.3e-30
Identity = 69/93 (74.19%), Postives = 76/93 (81.72%), Query Frame = -2

Query: 150 PYSAPLFARFFSKSSPYIVKVGIPEFLTGIGHGVESHVAKLESEIGDFQXXXXXXXXXXX 209
           P+S P  +RFFSKSSPY+VKVGIPEFL+GIG+GVESHVAKLESEIGDFQ           
Sbjct: 19  PHSTP--SRFFSKSSPYVVKVGIPEFLSGIGNGVESHVAKLESEIGDFQKLLVTRTLKLK 78

Query: 210 XXGIPCKHRKLILKYTHKYRLGLWRPRFDSLKA 269
             GIPCKHRKLILKYTHKYRLGLWRPR +S+KA
Sbjct: 79  KLGIPCKHRKLILKYTHKYRLGLWRPRVESIKA 109

Query: 270 329
           
Sbjct: 139 109

Query: 330 389
           
Sbjct: 199 109

Query: 390 429
           
Sbjct: 259 109

BLAST of CU114166 vs. TrEMBL
Match: A0A061F3U0_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_026782 PE=4 SV=1)

HSP 1 Score: 140.2 bits (352), Expect = 2.9e-30
Identity = 71/109 (65.14%), Postives = 80/109 (73.39%), Query Frame = -2

Query: 150 MAWRNIVSNHRSILQPYSAPL-FARFFSKSSPYIVKVGIPEFLTGIGHGVESHVAKLESE 209
           MAW  I+ N R I  P S P  F+RFFS+S+P++VKVGIPEFL G+G GVE+HV KLESE
Sbjct: 1   MAWVQILRNTREI--PVSNPYGFSRFFSRSTPFVVKVGIPEFLNGVGKGVETHVVKLESE 60

Query: 210 IGDFQXXXXXXXXXXXXXGIPCKHRKLILKYTHKYRLGLWRPRFDSLKA 269
           IGDFQ             GIPCKHRKLILKY HKYRLGLWRPR +SLKA
Sbjct: 61  IGDFQKLLVTRTLKLKKLGIPCKHRKLILKYAHKYRLGLWRPRAESLKA 107

Query: 270 329
           
Sbjct: 121 107

Query: 330 389
           
Sbjct: 181 107

Query: 390 449
           
Sbjct: 241 107

Query: 450 474
           
Sbjct: 301 107

BLAST of CU114166 vs. NCBI nr
Match: gi|449435566|ref|XP_004135566.1| (PREDICTED: uncharacterized protein LOC101215337 [Cucumis sativus])

HSP 1 Score: 199.5 bits (506), Expect = 5.7e-48
Identity = 96/109 (88.07%), Postives = 96/109 (88.07%), Query Frame = -2

Query: 150 MMAWRNIVSNHRSILQPYSAPLFARFFSKSSPYIVKVGIPEFLTGIGHGVESHVAKLESE 209
           MMAWRNIVSNHRSILQPYSAPLFARFFSKSSPYIVKVGIPEFLTGIGHGVESHVAKLESE
Sbjct: 1   MMAWRNIVSNHRSILQPYSAPLFARFFSKSSPYIVKVGIPEFLTGIGHGVESHVAKLESE 60

Query: 210 IGDFQXXXXXXXXXXXXXGIPCKHRKLILKYTHKYRLGLWRPRFDSLKA 269
           IGDFQ             GIPCKHRKLILKYTHKYRLGLWRPRFDSLKA
Sbjct: 61  IGDFQKLLVTRTLKLKKLGIPCKHRKLILKYTHKYRLGLWRPRFDSLKA 109

Query: 270 329
           
Sbjct: 121 109

Query: 330 389
           
Sbjct: 181 109

Query: 390 449
           
Sbjct: 241 109

Query: 450 477
           
Sbjct: 301 109

BLAST of CU114166 vs. NCBI nr
Match: gi|659099308|ref|XP_008450534.1| (PREDICTED: uncharacterized protein LOC103492108 [Cucumis melo])

HSP 1 Score: 189.9 bits (481), Expect = 4.5e-45
Identity = 90/108 (83.33%), Postives = 93/108 (86.11%), Query Frame = -2

Query: 150 MAWRNIVSNHRSILQPYSAPLFARFFSKSSPYIVKVGIPEFLTGIGHGVESHVAKLESEI 209
           MAW+NI+SNHRSILQPYSA LFARFFSKSSPYIVKVGIPEFL GIGHGVESHVAKLESEI
Sbjct: 1   MAWKNIISNHRSILQPYSASLFARFFSKSSPYIVKVGIPEFLNGIGHGVESHVAKLESEI 60

Query: 210 GDFQXXXXXXXXXXXXXGIPCKHRKLILKYTHKYRLGLWRPRFDSLKA 269
           GDFQ             GIPCKHRKLILKYTHKYRLGLWRPRFDSLK+
Sbjct: 61  GDFQKLLVTRTLKLKKLGIPCKHRKLILKYTHKYRLGLWRPRFDSLKS 108

Query: 270 329
           
Sbjct: 121 108

Query: 330 389
           
Sbjct: 181 108

Query: 390 449
           
Sbjct: 241 108

Query: 450 474
           
Sbjct: 301 108

BLAST of CU114166 vs. NCBI nr
Match: gi|595927750|ref|XP_007215093.1| (hypothetical protein PRUPE_ppa012681mg [Prunus persica])

HSP 1 Score: 152.5 bits (384), Expect = 8.0e-34
Identity = 72/110 (65.45%), Postives = 85/110 (77.27%), Query Frame = -2

Query: 150 RMMAWRNIVSNHRSILQPYSAPLFARFFSKSSPYIVKVGIPEFLTGIGHGVESHVAKLES 209
           R MAWR I+ + R+IL+PYS    A+F +KS+PY+VKVGIPEFL GIG+GVESHV+KLES
Sbjct: 50  RKMAWRQILFSTRAILEPYSTTGSAKFSTKSNPYLVKVGIPEFLNGIGNGVESHVSKLES 109

Query: 210 EIGDFQXXXXXXXXXXXXXGIPCKHRKLILKYTHKYRLGLWRPRFDSLKA 269
           EIGDFQ             GIPCKHRKLILKYTHKYRLGLWRPR  ++K+
Sbjct: 110 EIGDFQKLLVTRTLKLKKLGIPCKHRKLILKYTHKYRLGLWRPRAQAVKS 159

Query: 270 329
           
Sbjct: 170 159

Query: 330 389
           
Sbjct: 230 159

Query: 390 449
           
Sbjct: 290 159

Query: 450 480
           
Sbjct: 350 159

BLAST of CU114166 vs. NCBI nr
Match: gi|645244516|ref|XP_008228459.1| (PREDICTED: uncharacterized protein LOC103327871 isoform X2 [Prunus mume])

HSP 1 Score: 151.0 bits (380), Expect = 2.3e-33
Identity = 71/108 (65.74%), Postives = 84/108 (77.78%), Query Frame = -2

Query: 150 MAWRNIVSNHRSILQPYSAPLFARFFSKSSPYIVKVGIPEFLTGIGHGVESHVAKLESEI 209
           MAWR I+ + R+IL+PYS    A+F +KS+PY+VKVGIPEFL GIG+GVESHV+KLESEI
Sbjct: 1   MAWRQILFSTRAILEPYSTTGSAKFSTKSNPYLVKVGIPEFLNGIGNGVESHVSKLESEI 60

Query: 210 GDFQXXXXXXXXXXXXXGIPCKHRKLILKYTHKYRLGLWRPRFDSLKA 269
           GDFQ             GIPCKHRKLILKYTHKYRLGLWRPR  ++K+
Sbjct: 61  GDFQKLLVTRTLKLKKLGIPCKHRKLILKYTHKYRLGLWRPRAQAVKS 108

Query: 270 329
           
Sbjct: 121 108

Query: 330 389
           
Sbjct: 181 108

Query: 390 449
           
Sbjct: 241 108

Query: 450 474
           
Sbjct: 301 108

BLAST of CU114166 vs. NCBI nr
Match: gi|645244513|ref|XP_008228458.1| (PREDICTED: uncharacterized protein LOC103327871 isoform X1 [Prunus mume])

HSP 1 Score: 151.0 bits (380), Expect = 2.3e-33
Identity = 71/108 (65.74%), Postives = 84/108 (77.78%), Query Frame = -2

Query: 150 MAWRNIVSNHRSILQPYSAPLFARFFSKSSPYIVKVGIPEFLTGIGHGVESHVAKLESEI 209
           MAWR I+ + R+IL+PYS    A+F +KS+PY+VKVGIPEFL GIG+GVESHV+KLESEI
Sbjct: 22  MAWRQILFSTRAILEPYSTTGSAKFSTKSNPYLVKVGIPEFLNGIGNGVESHVSKLESEI 81

Query: 210 GDFQXXXXXXXXXXXXXGIPCKHRKLILKYTHKYRLGLWRPRFDSLKA 269
           GDFQ             GIPCKHRKLILKYTHKYRLGLWRPR  ++K+
Sbjct: 82  GDFQKLLVTRTLKLKKLGIPCKHRKLILKYTHKYRLGLWRPRAQAVKS 129

Query: 270 329
           
Sbjct: 142 129

Query: 330 389
           
Sbjct: 202 129

Query: 390 449
           
Sbjct: 262 129

Query: 450 474
           
Sbjct: 322 129

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0M1A4_CUCSA2.6e-4788.07Uncharacterized protein OS=Cucumis sativus GN=Csa_1G555610 PE=4 SV=1[more]
M5WR64_PRUPE3.6e-3365.45Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa012681mg PE=4 SV=1[more]
M5X3Q6_PRUPE6.8e-3264.81Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa022191mg PE=4 SV=1[more]
A0A151U4Q1_CAJCA1.3e-3074.19Uncharacterized protein OS=Cajanus cajan GN=KK1_006949 PE=4 SV=1[more]
A0A061F3U0_THECC2.9e-3065.14Uncharacterized protein OS=Theobroma cacao GN=TCM_026782 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|449435566|ref|XP_004135566.1|5.7e-4888.07PREDICTED: uncharacterized protein LOC101215337 [Cucumis sativus][more]
gi|659099308|ref|XP_008450534.1|4.5e-4583.33PREDICTED: uncharacterized protein LOC103492108 [Cucumis melo][more]
gi|595927750|ref|XP_007215093.1|8.0e-3465.45hypothetical protein PRUPE_ppa012681mg [Prunus persica][more]
gi|645244516|ref|XP_008228459.1|2.3e-3365.74PREDICTED: uncharacterized protein LOC103327871 isoform X2 [Prunus mume][more]
gi|645244513|ref|XP_008228458.1|2.3e-3365.74PREDICTED: uncharacterized protein LOC103327871 isoform X1 [Prunus mume][more]
The following terms have been associated with this transcribed_cluster:
Vocabulary: INTERPRO
TermDefinition
IPR019083IGR_protein_motif

This transcribed_cluster is associated with the following gene feature(s):

Feature NameUnique NameType
Csa1G555610Csa1G555610gene


The following EST feature(s) are a part of this transcribed_cluster:

Feature NameUnique NameType
FKNP3UI02NNYOMFKNP3UI02NNYOMEST
G0074507G0074507EST
G0102142G0102142EST
G0167106G0167106EST
H0035795H0035795EST
csa01-1ms4-b01csa01-1ms4-b01EST


Analysis Name: InterPro Annotations of cucumber unigene v3
Date Performed: 2016-11-16
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR019083IGR protein motifPFAMPF09597IGRcoord: 40..97
score: 8.9
IPR019083IGR protein motifSMARTSM01238IGR_2coord: 38..97
score: 6.0
NoneNo IPR availablePANTHERPTHR34955FAMILY NOT NAMEDcoord: 1..108
score: 1.2