CSPI06G10000.1 (mRNA) Wild cucumber (PI 183967)

NameCSPI06G10000.1
TypemRNA
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionMicronuclear linker histone polyprotein-like protein
LocationChr6 : 8642075 .. 8643733 (-)
Sequence length1359
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CATCCAAAATCAGCGGTTTGAGCGCAGTTCCCATGCTTTCCTCTACCTTGCTAACTCCGCTTTCAGAGTAGTGGTTGCATGTTCTTTTTTACATGACAAGCAAAGAAATCTCCATTTTCAATGTATAAATTCATTTCCCATTTCTCCTTCTTCAATAAAATTCCCAAGCCCTCTATTACTCTTTCTTCTTCTTCTTCTCTTCTTCTTCTTGTAGATGGGTTCTGTTGGGAGCCTTCAGAGTAAAGCAAGCAGCAGCAGAGGAAGGTCTTATGCTCTGATGCTGCTTATTGCATTTGGGGCAGCCTTGCTAGGAGTTATGCTTCTTAACAAATTCAGAGAAAGGCGTATTTGCAACCTTCTTCTCAAACGCACAGATCAAGACCTCCGTTCCTTTCAACTCCTCTTTCAGGTCAACTCATTCATTATCCTTCTTCCATGCATTCCCTTGTCACTCGGTCTCACTTAATTGTATTCATTTCTGTTTTGAATGAACAGAAAGAGAAGGATCGAAGCAAGGAATTGGCGAAGAGCCACGAAGACATGACTGCAGCGCTGTATGTGCTCCGAACCAAGAAGATGGAGCTTGATAGGAGGCTACTGGAGTTGCAATCCACCATTGACTCACTCAGAGACGAGCAGAAAATCATAAGTGTTGCACTCCAAGAAAAGCAGAGTGAGATCAAAACTTTGAGGGAGAAAGAAATTGAATCTGGAAACGAAAATCCTCAAGTGGTTGCTTTAACGAAAAGCCTCAAGCAAAAGGAAGATGAGTTAGAGGAATTAAGACGTCGACTCGAGTCTCCTGCAGCTGTTGCGGCTAATGATTCATCAGATTCAGACCCACGTGGGGATTCAAAAACGTCTGAAGCACTAAGGGAGCAATTGCTTGAATCGAGCGGAGGAAGAGGAGAGAGTGAATCAATGGATAACAACAGAGGGGGGAGTAAATCGACAGGTTTTCATGAGATTGAAACTCAGAAGTTAGAGGATCATGAAGGAGTGGAGGAAAGGCAGAAGCAAGAGAAGTTGGGAGACGAGGGTGAAAATGCCATTGGGGGGGAACTTGAGGGCGAGGAGACTAAAATCACCAACGAACATAAGGAGAAAGAAGTCGGGCATTCTGAAGAAGTTGAAAATGGAAGAGCTAATAAGATTGCAAGGAGTGAAATTAAGATGAAAACAGAAACAGGCAAATATGGGAATACACGGAAGATCAGAGGAAAGAGATGGAGATACATAGCCAAAAGAAGGGCAGTGGACAATGGATGGAGACTGATATCGAAGAAATCGAAGAAGGATAGCAGAAACAGAAATTTGAACGACAATAGAGCCGTTGGCACAACACATGGGAAGTTCATTGATGGAGCAGAGGAGAAAATGAAAGAGGAGCACACACAGGGCGCAAAAGAAAATTTGATGGAAATGGAAAACAAAATGGTACAAGAAGAAAATTTGAATTCAGAAAATGACACATCGAGAGAGAAAAATGGGAATGGAGACGTGGAAGATGATAGCATTAAGCAAAACCCAGGTGAAGAGATGGAGCAAAATGACTCAAAAAGTGAAGAAAACCAAGAACCAGAACGAGGAATGGACTCCAAGGAAGATGGTAAAGAAGAAGAAGAAAACAAGGAAGAGCCAGAGGAGTCGGAGTTTTAA

mRNA sequence

ATGGGTTCTGTTGGGAGCCTTCAGAGTAAAGCAAGCAGCAGCAGAGGAAGGTCTTATGCTCTGATGCTGCTTATTGCATTTGGGGCAGCCTTGCTAGGAGTTATGCTTCTTAACAAATTCAGAGAAAGGCGTATTTGCAACCTTCTTCTCAAACGCACAGATCAAGACCTCCGTTCCTTTCAACTCCTCTTTCAGAAAGAGAAGGATCGAAGCAAGGAATTGGCGAAGAGCCACGAAGACATGACTGCAGCGCTGTATGTGCTCCGAACCAAGAAGATGGAGCTTGATAGGAGGCTACTGGAGTTGCAATCCACCATTGACTCACTCAGAGACGAGCAGAAAATCATAAGTGTTGCACTCCAAGAAAAGCAGAGTGAGATCAAAACTTTGAGGGAGAAAGAAATTGAATCTGGAAACGAAAATCCTCAAGTGGTTGCTTTAACGAAAAGCCTCAAGCAAAAGGAAGATGAGTTAGAGGAATTAAGACGTCGACTCGAGTCTCCTGCAGCTGTTGCGGCTAATGATTCATCAGATTCAGACCCACGTGGGGATTCAAAAACGTCTGAAGCACTAAGGGAGCAATTGCTTGAATCGAGCGGAGGAAGAGGAGAGAGTGAATCAATGGATAACAACAGAGGGGGGAGTAAATCGACAGGTTTTCATGAGATTGAAACTCAGAAGTTAGAGGATCATGAAGGAGTGGAGGAAAGGCAGAAGCAAGAGAAGTTGGGAGACGAGGGTGAAAATGCCATTGGGGGGGAACTTGAGGGCGAGGAGACTAAAATCACCAACGAACATAAGGAGAAAGAAGTCGGGCATTCTGAAGAAGTTGAAAATGGAAGAGCTAATAAGATTGCAAGGAGTGAAATTAAGATGAAAACAGAAACAGGCAAATATGGGAATACACGGAAGATCAGAGGAAAGAGATGGAGATACATAGCCAAAAGAAGGGCAGTGGACAATGGATGGAGACTGATATCGAAGAAATCGAAGAAGGATAGCAGAAACAGAAATTTGAACGACAATAGAGCCGTTGGCACAACACATGGGAAGTTCATTGATGGAGCAGAGGAGAAAATGAAAGAGGAGCACACACAGGGCGCAAAAGAAAATTTGATGGAAATGGAAAACAAAATGGTACAAGAAGAAAATTTGAATTCAGAAAATGACACATCGAGAGAGAAAAATGGGAATGGAGACGTGGAAGATGATAGCATTAAGCAAAACCCAGGTGAAGAGATGGAGCAAAATGACTCAAAAAGTGAAGAAAACCAAGAACCAGAACGAGGAATGGACTCCAAGGAAGATGGTAAAGAAGAAGAAGAAAACAAGGAAGAGCCAGAGGAGTCGGAGTTTTAA

Coding sequence (CDS)

ATGGGTTCTGTTGGGAGCCTTCAGAGTAAAGCAAGCAGCAGCAGAGGAAGGTCTTATGCTCTGATGCTGCTTATTGCATTTGGGGCAGCCTTGCTAGGAGTTATGCTTCTTAACAAATTCAGAGAAAGGCGTATTTGCAACCTTCTTCTCAAACGCACAGATCAAGACCTCCGTTCCTTTCAACTCCTCTTTCAGAAAGAGAAGGATCGAAGCAAGGAATTGGCGAAGAGCCACGAAGACATGACTGCAGCGCTGTATGTGCTCCGAACCAAGAAGATGGAGCTTGATAGGAGGCTACTGGAGTTGCAATCCACCATTGACTCACTCAGAGACGAGCAGAAAATCATAAGTGTTGCACTCCAAGAAAAGCAGAGTGAGATCAAAACTTTGAGGGAGAAAGAAATTGAATCTGGAAACGAAAATCCTCAAGTGGTTGCTTTAACGAAAAGCCTCAAGCAAAAGGAAGATGAGTTAGAGGAATTAAGACGTCGACTCGAGTCTCCTGCAGCTGTTGCGGCTAATGATTCATCAGATTCAGACCCACGTGGGGATTCAAAAACGTCTGAAGCACTAAGGGAGCAATTGCTTGAATCGAGCGGAGGAAGAGGAGAGAGTGAATCAATGGATAACAACAGAGGGGGGAGTAAATCGACAGGTTTTCATGAGATTGAAACTCAGAAGTTAGAGGATCATGAAGGAGTGGAGGAAAGGCAGAAGCAAGAGAAGTTGGGAGACGAGGGTGAAAATGCCATTGGGGGGGAACTTGAGGGCGAGGAGACTAAAATCACCAACGAACATAAGGAGAAAGAAGTCGGGCATTCTGAAGAAGTTGAAAATGGAAGAGCTAATAAGATTGCAAGGAGTGAAATTAAGATGAAAACAGAAACAGGCAAATATGGGAATACACGGAAGATCAGAGGAAAGAGATGGAGATACATAGCCAAAAGAAGGGCAGTGGACAATGGATGGAGACTGATATCGAAGAAATCGAAGAAGGATAGCAGAAACAGAAATTTGAACGACAATAGAGCCGTTGGCACAACACATGGGAAGTTCATTGATGGAGCAGAGGAGAAAATGAAAGAGGAGCACACACAGGGCGCAAAAGAAAATTTGATGGAAATGGAAAACAAAATGGTACAAGAAGAAAATTTGAATTCAGAAAATGACACATCGAGAGAGAAAAATGGGAATGGAGACGTGGAAGATGATAGCATTAAGCAAAACCCAGGTGAAGAGATGGAGCAAAATGACTCAAAAAGTGAAGAAAACCAAGAACCAGAACGAGGAATGGACTCCAAGGAAGATGGTAAAGAAGAAGAAGAAAACAAGGAAGAGCCAGAGGAGTCGGAGTTTTAA
BLAST of CSPI06G10000.1 vs. TrEMBL
Match: A0A0A0KG17_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G124060 PE=4 SV=1)

HSP 1 Score: 785.8 bits (2028), Expect = 2.8e-224
Identity = 446/452 (98.67%), Postives = 446/452 (98.67%), Query Frame = 1

Query: 1   MGSVGSLQSKASSSRGRSYALMLLIAFGAALLGVMLLNKFRERRICNLLLKRTDQDLRSF 60
           MGSVGSLQSKASSSRGRSYALMLLIAFGAALLGVMLLNKFRERRICNLLLKRTDQDLRSF
Sbjct: 1   MGSVGSLQSKASSSRGRSYALMLLIAFGAALLGVMLLNKFRERRICNLLLKRTDQDLRSF 60

Query: 61  QLLFQKEKDRSKELAKSHEDMTAALYVLRTKKMELDRRLLELQSTIDSLRDEQKIISVAL 120
           QLLFQKEKDRSKELAKSHEDMTAALYVLRTKKMELDRRLLELQSTIDSLRDEQKI SVAL
Sbjct: 61  QLLFQKEKDRSKELAKSHEDMTAALYVLRTKKMELDRRLLELQSTIDSLRDEQKITSVAL 120

Query: 121 QEKQSEIKTLREKEIESGNENPQVVALTKSLKQKEDELEELRRRLESPAAVAANDSSDSD 180
           QEKQSEIKTLREKEIESGNENPQVVALTKSLKQKEDELEELRRRLESPAAVAANDSSDSD
Sbjct: 121 QEKQSEIKTLREKEIESGNENPQVVALTKSLKQKEDELEELRRRLESPAAVAANDSSDSD 180

Query: 181 PRGDSKTSEALREQLLESSGGRGESESMDNNRGGSKSTGFHEIETQKLEDHEGVEERQKQ 240
           PRGDSKTSE LREQLLESSGGRGESESMDNNRGGSKSTGFHEIETQKLE HEGVEERQKQ
Sbjct: 181 PRGDSKTSETLREQLLESSGGRGESESMDNNRGGSKSTGFHEIETQKLEGHEGVEERQKQ 240

Query: 241 EKLGDEGENAIGGELEGEETKITNEHKEKEVGHSEEVENGRANKIARSEIKMKTETGKYG 300
           EKLGDEGENAIG ELEGEETKITNEHKEKEVGHSEEVENGRANKIARSEIKMKTETGKYG
Sbjct: 241 EKLGDEGENAIGRELEGEETKITNEHKEKEVGHSEEVENGRANKIARSEIKMKTETGKYG 300

Query: 301 NTRKIRGKRWRYIAKRRAVDNGWRLISKKSKKDSRNRNLNDNRAVGTTHGKFIDGAEEKM 360
           NTRKIRGKRWRYIAKRRAVDNGWRLISKKSKKDSRNRNLNDNRA GTTHGKFIDGAEEKM
Sbjct: 301 NTRKIRGKRWRYIAKRRAVDNGWRLISKKSKKDSRNRNLNDNRADGTTHGKFIDGAEEKM 360

Query: 361 KEEHTQGAKENLMEMENKMVQEENLNSENDTSREKNGNGDVEDDSIKQNPGEEMEQNDSK 420
           KEEHTQGAKENLMEMENKMVQ ENLNSENDTSREKNGNGDVEDDSIKQNPGEEMEQNDSK
Sbjct: 361 KEEHTQGAKENLMEMENKMVQ-ENLNSENDTSREKNGNGDVEDDSIKQNPGEEMEQNDSK 420

Query: 421 SEENQEPERGMDSKEDGKEEEENKEEPEESEF 453
           SEENQEPERGMDSKEDGKEEEENKEEPEESEF
Sbjct: 421 SEENQEPERGMDSKEDGKEEEENKEEPEESEF 451

BLAST of CSPI06G10000.1 vs. TrEMBL
Match: A0A061E2Z9_THECC (Micronuclear linker histone polyprotein-like protein OS=Theobroma cacao GN=TCM_007962 PE=4 SV=1)

HSP 1 Score: 161.8 bits (408), Expect = 2.0e-36
Identity = 128/304 (42.11%), Postives = 183/304 (60.20%), Query Frame = 1

Query: 1   MGSVGSLQSKASSSRGRSYALMLLIAFGAALLGVMLLNKFRERRICNLLLKRTDQDLRSF 60
           MG V S +   +S+RGR Y LMLL+AFGAALLGVM+L+K RERRI NLL++  ++ L S 
Sbjct: 3   MGGV-SHKGNGNSNRGRPYGLMLLVAFGAALLGVMVLHKLRERRIFNLLVEDKNRQLISL 62

Query: 61  QLLFQKEKDRSKELAKSHEDMTAALYVLRTKKMELDRRLLELQSTIDSLRDEQKIISVAL 120
           QLL QKE++  KE+ ++ E+  A +Y LR +KMELDRRLLE+QS I+SL+DEQK +  AL
Sbjct: 63  QLLLQKEREYMKEMKRNAEETKAKIYFLRNQKMELDRRLLEMQSAIESLKDEQKTMESAL 122

Query: 121 QEKQSEIKTLREKEIESGNENPQVVALTKSLKQKEDELEELRRRLESPA---AVAANDSS 180
           +EKQ EI  L+EK ++SGNENPQV+ALT +LKQKE E+E L+ RL+SP    +V+A+D S
Sbjct: 123 EEKQYEIILLQEKHVDSGNENPQVLALTATLKQKEAEIEVLKHRLKSPVRVWSVSADDKS 182

Query: 181 D--SDPRGDSKTSEALREQLLESSGGR-GESESMDNNRGGSKSTGFHEIETQKLEDHEGV 240
           +   +        E  + +  +  GGR  ES +  +    +K     EI++   ++ +  
Sbjct: 183 NLPVNITVTGSMEEKEKTEFSQEEGGRVHESTAYKDGDNSTKDQDRSEIKSNFSQEEQNR 242

Query: 241 EERQKQEKLGDEGENAIGGELEGEETKITNEHKEKEVGHSEEVENGRANKIARSEIKMKT 299
           EE +   K   +GE  +  ++ G          +K V   E   N  A    R+E    T
Sbjct: 243 EEVEDGSK--KKGETTLRMDMAG------GGQLQKPVSLGENARNEGAAGEMRNEYSQYT 297

BLAST of CSPI06G10000.1 vs. TrEMBL
Match: W9R083_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_022179 PE=4 SV=1)

HSP 1 Score: 150.6 bits (379), Expect = 4.6e-33
Identity = 157/439 (35.76%), Postives = 243/439 (55.35%), Query Frame = 1

Query: 4   VGSLQSKASSSRGRSYALMLLIAFGAALLGVMLLNKFRERRICNLLLKRTDQDLRSFQLL 63
           +G+  S   SSRGRSY LMLL+AFGAALLGVM+L+K RERRI NLL+K  D  L S  LL
Sbjct: 11  LGNGSSNGGSSRGRSYGLMLLLAFGAALLGVMILHKLRERRIFNLLIKEKDGQLFSLHLL 70

Query: 64  FQKEKDRSKELAKSHEDMTAALYVLRTKKMELDRRLLELQSTIDSLRDEQKIISVALQEK 123
            QKE+D  KE+   +E+M A +Y LRTKKMELDRRLLE+QSTIDSL+DEQ+ + +A++EK
Sbjct: 71  LQKERDYMKEVKSKNEEMKAKIYSLRTKKMELDRRLLEMQSTIDSLKDEQRTMEIAIEEK 130

Query: 124 QSEIKTLR-EKEIESGNENPQVVALTKSLKQKEDELEELRRRLESPA-AVAANDSSDSDP 183
           Q+EIK LR + +I +  EN QV AL +SLKQKE E+E+L+R L++P  A + N++++++ 
Sbjct: 131 QNEIKMLRAQDQISNEKENNQVAALVESLKQKEAEIEDLKRCLQNPENAGSVNNTAETEQ 190

Query: 184 RGDSKTSEALREQLLESSGGRGESESMDNNRGGSKSTGFHEIETQKLEDHEGVEERQKQE 243
           +G S + +    +++E  G   + E  + N+ G +S      E   +E     +E  K E
Sbjct: 191 KGKSGSEDNGERRIIE-EGETAKLEDENGNKVGDQSVKDGVTENPTIEG----QEHDKNE 250

Query: 244 KLGDEGENAIGGELEGEETKITNEHKEKEVGHSEEVE---NGRANKIARSEIKMKTETGK 303
              DE   ++   +    T   + +   +V   EE++   +G+  K  +S+ + +   G 
Sbjct: 251 DSKDENV-SLSLNIVANSTGAISRNMVGKVMDGEELKVEGDGQLGKHEKSQDEGEENPGS 310

Query: 304 YGNTRKIRGK-RWRYIAKRRAVDNG-----WRLISK-----KSKKDSRNRNLNDNRAVGT 363
                K+  +  +    KR +V +      WR I+K     KS K SR       R    
Sbjct: 311 VEGGMKLEIRDGFEKNGKRGSVSDNVKGKRWRKIAKSRRLEKSGKLSRVTRRRSVRFYKD 370

Query: 364 THGKFIDGAEEKMKEEHTQGAKENLMEMENKMVQ--EENLNS-ENDTSREKNGNGDVEDD 423
            HG      + ++K   T+G    L+  +N+ ++  + +++S E     E + NGDV++ 
Sbjct: 371 EHGDL----KSRVKNAATKGGV--LLGEDNRQLEGRKSDVSSFEMPKKGEVSINGDVKES 430

BLAST of CSPI06G10000.1 vs. TrEMBL
Match: V7AGA6_PHAVU (Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_011G112100g PE=4 SV=1)

HSP 1 Score: 145.6 bits (366), Expect = 1.5e-31
Identity = 141/417 (33.81%), Postives = 211/417 (50.60%), Query Frame = 1

Query: 5   GSLQSKASSSRGRSYALMLLIAFGAALLGVMLLNKFRERRICNLLLKRTDQDLRSFQLLF 64
           G   S   S RGR YAL+ LI FGAALLGVM+L+K RERRI  LL+K  D  + + QLLF
Sbjct: 6   GVHSSSGGSHRGRPYALIWLITFGAALLGVMVLHKLRERRIYTLLVKEKDHQILALQLLF 65

Query: 65  QKEKDRSKELAKSHEDMTAALYVLRTKKMELDRRLLELQSTIDSLRDEQKIISVALQEKQ 124
           QKEKDRSKEL   +E+M   +Y LR++KMEL R ++E+QST+DSL+DEQK++  A +E+Q
Sbjct: 66  QKEKDRSKELRGKNEEMKGKIYGLRSQKMELSRTVVEMQSTLDSLKDEQKLMETAFEEQQ 125

Query: 125 SEIKTLREKEIESGNENPQVVALTKSLKQKEDELEELRRRLES---------PAAVAAN- 184
           +E++ ++EK  +    + Q+VAL +++K KE E+E+L+RRLES         P  VAAN 
Sbjct: 126 NELRLMQEKVGDVDQGSSQIVALRENVKHKEAEIEDLKRRLESSVNGHPITFPQIVAANG 185

Query: 185 ------------DSSDSDPRGDSKTSEALREQLLESSGGRGESESMDNNRGGSKSTGFHE 244
                       DSS+S         +A++ +L +S  G   +E  D            E
Sbjct: 186 TMQAQNESEKEEDSSESAKHEGDDNDDAIKSELTKSKDGGVATEIKD------------E 245

Query: 245 IETQKLEDHEGVEERQKQEKLGDEGENAIGGELEGEETKITNEHKEKEVGHSEEVENGRA 304
           I T      EG   +  ++   D G  A    ++  E     E K     H+ EVE    
Sbjct: 246 IWT------EGEIGKANEDPQNDGGGAA--KYIDDAEVGYGREKKAVREEHAGEVE---- 305

Query: 305 NKIARSEIKMKT---ETGKYGNTRKIRGKRWRYIAKRRAVDNGWRLISKKSKKDSRNRNL 364
            KIA    ++K       K+G+ R+ +GKRWR                      + N +L
Sbjct: 306 -KIADGGGQVKQLAWMKRKHGHERRAKGKRWR---------------------STVNSSL 365

Query: 365 NDNRAVGTTHGKFIDGAEEKMKEEHTQGAKENLMEMENKMVQEENLNSENDTSREKN 397
            +N  V   H         K+ ++  +G++   +  E    +E+   ++N T ++K+
Sbjct: 366 MENNVVSDNH-----MGNRKVYKDEAKGSRVGKVYNEENFAREDERRNKNSTRKDKS 371

BLAST of CSPI06G10000.1 vs. TrEMBL
Match: K7KY66_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_06G297400 PE=4 SV=1)

HSP 1 Score: 140.2 bits (352), Expect = 6.2e-30
Identity = 143/430 (33.26%), Postives = 226/430 (52.56%), Query Frame = 1

Query: 5   GSLQSKASSSRGRSYALMLLIAFGAALLGVMLLNKFRERRICNLLLKRTDQDLRSFQLLF 64
           G   S   S RGR Y L+LLI FG ALLGVM+L+K RERRIC LL+K  DQ + + QLL 
Sbjct: 6   GVHSSSGGSHRGRPYVLVLLITFGIALLGVMVLHKLRERRICTLLVKEKDQQILALQLLL 65

Query: 65  QKEKDRSKELAKSHEDMTAALYVLRTKKMELDRRLLELQSTIDSLRDEQKIISVALQEKQ 124
           QKE+DRSKEL   +E+M   LY LR++KMEL R + E+QST+DSL+DEQK++  A +E+Q
Sbjct: 66  QKERDRSKELRSKNEEMKGKLYTLRSQKMELARTVGEMQSTLDSLKDEQKLMESAFEEQQ 125

Query: 125 SEIKTLREKEIESGNENPQVVALTKSLKQKEDELEELRRRLESPAAVAANDSSDSDPRGD 184
           +E+++++EK          + AL  +LK KE+ELE+L+RRLE+P  V  + +  ++    
Sbjct: 126 NELRSMQEKGKSVAQGGYDIEALRDNLKHKEEELEDLKRRLETP--VNDHPTIFNEIVTA 185

Query: 185 SKTSEALREQLLESSGGRGESESMDNNRGGSKSTGFHEIETQKLEDHEGVEERQKQEKLG 244
           + T  A  E   + + G       DNN G SKS      E  K +D E   E   ++ + 
Sbjct: 186 NGTIAAQDEIEKDENSGESAKHEGDNNEGASKS------ELTKFKDGEVATE--MRDGIR 245

Query: 245 DEGENAIGGELEGEETKITNEHKEKEVGHSEEVEN-----GRANKIARSEIKMKTETGKY 304
            +G+ A   +++  E     E K     H+ +VEN     GR  ++A ++        K+
Sbjct: 246 TDGDGAT-KDMDDAEVVDGREKKAMREEHAGQVENNTDGGGRVKQLAGTK-------RKH 305

Query: 305 GNTRKIRGKRWRYIAKRRAVDNGWRLISKKSKKDSRNRNLNDNRAVGTTHGKFIDGAEEK 364
               +++GKRWR   K  +++N     +   +  S NR +  +   G   GK  +     
Sbjct: 306 SRASRMKGKRWRTTVKNSSMEN-----NGVFENHSDNRKVYKDELKGRRVGKVSNEENFA 365

Query: 365 MKEEHTQGAKENLMEMENKMVQEENLNSENDTSREKNGNGDVEDDSIKQN-PGEEMEQND 424
            ++E      +   +   K+++ EN  S+ D +  K  N   +  +   N   +++    
Sbjct: 366 REDEGRNNNSQRKEKPHAKLLKTENHESKEDANDMKVNNTKHQVTNSGSNIYSQKILDEI 412

Query: 425 SKSEENQEPE 429
            +SEEN++ +
Sbjct: 426 RQSEENEQSQ 412

BLAST of CSPI06G10000.1 vs. NCBI nr
Match: gi|778712727|ref|XP_011656925.1| (PREDICTED: paralemmin-3-like [Cucumis sativus])

HSP 1 Score: 785.8 bits (2028), Expect = 4.0e-224
Identity = 446/452 (98.67%), Postives = 446/452 (98.67%), Query Frame = 1

Query: 1   MGSVGSLQSKASSSRGRSYALMLLIAFGAALLGVMLLNKFRERRICNLLLKRTDQDLRSF 60
           MGSVGSLQSKASSSRGRSYALMLLIAFGAALLGVMLLNKFRERRICNLLLKRTDQDLRSF
Sbjct: 1   MGSVGSLQSKASSSRGRSYALMLLIAFGAALLGVMLLNKFRERRICNLLLKRTDQDLRSF 60

Query: 61  QLLFQKEKDRSKELAKSHEDMTAALYVLRTKKMELDRRLLELQSTIDSLRDEQKIISVAL 120
           QLLFQKEKDRSKELAKSHEDMTAALYVLRTKKMELDRRLLELQSTIDSLRDEQKI SVAL
Sbjct: 61  QLLFQKEKDRSKELAKSHEDMTAALYVLRTKKMELDRRLLELQSTIDSLRDEQKITSVAL 120

Query: 121 QEKQSEIKTLREKEIESGNENPQVVALTKSLKQKEDELEELRRRLESPAAVAANDSSDSD 180
           QEKQSEIKTLREKEIESGNENPQVVALTKSLKQKEDELEELRRRLESPAAVAANDSSDSD
Sbjct: 121 QEKQSEIKTLREKEIESGNENPQVVALTKSLKQKEDELEELRRRLESPAAVAANDSSDSD 180

Query: 181 PRGDSKTSEALREQLLESSGGRGESESMDNNRGGSKSTGFHEIETQKLEDHEGVEERQKQ 240
           PRGDSKTSE LREQLLESSGGRGESESMDNNRGGSKSTGFHEIETQKLE HEGVEERQKQ
Sbjct: 181 PRGDSKTSETLREQLLESSGGRGESESMDNNRGGSKSTGFHEIETQKLEGHEGVEERQKQ 240

Query: 241 EKLGDEGENAIGGELEGEETKITNEHKEKEVGHSEEVENGRANKIARSEIKMKTETGKYG 300
           EKLGDEGENAIG ELEGEETKITNEHKEKEVGHSEEVENGRANKIARSEIKMKTETGKYG
Sbjct: 241 EKLGDEGENAIGRELEGEETKITNEHKEKEVGHSEEVENGRANKIARSEIKMKTETGKYG 300

Query: 301 NTRKIRGKRWRYIAKRRAVDNGWRLISKKSKKDSRNRNLNDNRAVGTTHGKFIDGAEEKM 360
           NTRKIRGKRWRYIAKRRAVDNGWRLISKKSKKDSRNRNLNDNRA GTTHGKFIDGAEEKM
Sbjct: 301 NTRKIRGKRWRYIAKRRAVDNGWRLISKKSKKDSRNRNLNDNRADGTTHGKFIDGAEEKM 360

Query: 361 KEEHTQGAKENLMEMENKMVQEENLNSENDTSREKNGNGDVEDDSIKQNPGEEMEQNDSK 420
           KEEHTQGAKENLMEMENKMVQ ENLNSENDTSREKNGNGDVEDDSIKQNPGEEMEQNDSK
Sbjct: 361 KEEHTQGAKENLMEMENKMVQ-ENLNSENDTSREKNGNGDVEDDSIKQNPGEEMEQNDSK 420

Query: 421 SEENQEPERGMDSKEDGKEEEENKEEPEESEF 453
           SEENQEPERGMDSKEDGKEEEENKEEPEESEF
Sbjct: 421 SEENQEPERGMDSKEDGKEEEENKEEPEESEF 451

BLAST of CSPI06G10000.1 vs. NCBI nr
Match: gi|659094716|ref|XP_008448207.1| (PREDICTED: glutamic acid-rich protein-like [Cucumis melo])

HSP 1 Score: 666.4 bits (1718), Expect = 3.6e-188
Identity = 394/455 (86.59%), Postives = 414/455 (90.99%), Query Frame = 1

Query: 1   MGSVGSLQSKASSSRGRSYALMLLIAFGAALLGVMLLNKFRERRICNLLLKRTDQDLRSF 60
           MGSVGSLQSKASSSRGRSYALMLLIAFGAALLGVMLL+KFRERRICNLLLKRTDQDLRSF
Sbjct: 1   MGSVGSLQSKASSSRGRSYALMLLIAFGAALLGVMLLHKFRERRICNLLLKRTDQDLRSF 60

Query: 61  QLLFQKEKDRSKELAKSHEDMTAALYVLRTKKMELDRRLLELQSTIDSLRDEQKIISVAL 120
           QLLFQKE DRSKELAKS EDMTAALY+LR++KMELDRRLLELQSTIDSLRDEQKI SVAL
Sbjct: 61  QLLFQKETDRSKELAKSLEDMTAALYLLRSQKMELDRRLLELQSTIDSLRDEQKITSVAL 120

Query: 121 QEKQSEIKTLREKEIESGNENPQVVALTKSLKQKEDELEELRRRLESPAAVAANDSSDSD 180
           QEKQ EIK LREKEI+SGN+NPQVVALT+SLKQKEDELEELRRR +SP A+AANDS  SD
Sbjct: 121 QEKQREIKALREKEIDSGNDNPQVVALTQSLKQKEDELEELRRRHKSPGAIAANDS--SD 180

Query: 181 PRGDSKTSEALREQLLESSGGRGESESMDNNRGGSKSTGFHEIETQKLED-HEGVEERQK 240
           PRGDS  SE  R+QLLES  G+GESESMDNNRGGS STGFHEIETQKLE+ HEG EERQK
Sbjct: 181 PRGDSTMSEEKRDQLLESGNGKGESESMDNNRGGSTSTGFHEIETQKLEETHEGEEERQK 240

Query: 241 QEKLGDEGENAIGGELEGEETKITNEHKEKEVGHSEEVENGRANKIARSEIKMKTETGKY 300
           QEKLGDEGENA G E+EG ETKITNEHKEKE G  EEVENGRANKIARSEI+MKTETGKY
Sbjct: 241 QEKLGDEGENANGREVEG-ETKITNEHKEKEGGDFEEVENGRANKIARSEIRMKTETGKY 300

Query: 301 GNTRKIRGKRWRYIAKRRAVDNGWRLISKKSKKDS-RNRNLNDNRAVGTTHGKFIDGAEE 360
           GNTRKIRGKRWRYI KRRAVDNGWRLISKK KK++  NRNLND RA+GTTHGKF DGAEE
Sbjct: 301 GNTRKIRGKRWRYIVKRRAVDNGWRLISKKMKKENGNNRNLNDIRAIGTTHGKFTDGAEE 360

Query: 361 KMKEEHTQGAKENLMEMENKMVQEENLNSEN-DTSREKNGNGDVEDDSIKQNPGEEMEQN 420
           KMKEE TQ AKENLME ENKMVQEENLNSEN DTSREKNGNGDV DD IK+NPGEEME N
Sbjct: 361 KMKEERTQEAKENLMEKENKMVQEENLNSENDDTSREKNGNGDVGDDRIKKNPGEEMEPN 420

Query: 421 DSKSEENQEPERGMDSKEDGKEEEENKEEPEESEF 453
           D KS+E +EPERG D KEDGKEEEENKEEPEESEF
Sbjct: 421 DLKSKEKEEPERGTDFKEDGKEEEENKEEPEESEF 452

BLAST of CSPI06G10000.1 vs. NCBI nr
Match: gi|590690405|ref|XP_007043499.1| (Micronuclear linker histone polyprotein-like protein [Theobroma cacao])

HSP 1 Score: 161.8 bits (408), Expect = 2.9e-36
Identity = 128/304 (42.11%), Postives = 183/304 (60.20%), Query Frame = 1

Query: 1   MGSVGSLQSKASSSRGRSYALMLLIAFGAALLGVMLLNKFRERRICNLLLKRTDQDLRSF 60
           MG V S +   +S+RGR Y LMLL+AFGAALLGVM+L+K RERRI NLL++  ++ L S 
Sbjct: 3   MGGV-SHKGNGNSNRGRPYGLMLLVAFGAALLGVMVLHKLRERRIFNLLVEDKNRQLISL 62

Query: 61  QLLFQKEKDRSKELAKSHEDMTAALYVLRTKKMELDRRLLELQSTIDSLRDEQKIISVAL 120
           QLL QKE++  KE+ ++ E+  A +Y LR +KMELDRRLLE+QS I+SL+DEQK +  AL
Sbjct: 63  QLLLQKEREYMKEMKRNAEETKAKIYFLRNQKMELDRRLLEMQSAIESLKDEQKTMESAL 122

Query: 121 QEKQSEIKTLREKEIESGNENPQVVALTKSLKQKEDELEELRRRLESPA---AVAANDSS 180
           +EKQ EI  L+EK ++SGNENPQV+ALT +LKQKE E+E L+ RL+SP    +V+A+D S
Sbjct: 123 EEKQYEIILLQEKHVDSGNENPQVLALTATLKQKEAEIEVLKHRLKSPVRVWSVSADDKS 182

Query: 181 D--SDPRGDSKTSEALREQLLESSGGR-GESESMDNNRGGSKSTGFHEIETQKLEDHEGV 240
           +   +        E  + +  +  GGR  ES +  +    +K     EI++   ++ +  
Sbjct: 183 NLPVNITVTGSMEEKEKTEFSQEEGGRVHESTAYKDGDNSTKDQDRSEIKSNFSQEEQNR 242

Query: 241 EERQKQEKLGDEGENAIGGELEGEETKITNEHKEKEVGHSEEVENGRANKIARSEIKMKT 299
           EE +   K   +GE  +  ++ G          +K V   E   N  A    R+E    T
Sbjct: 243 EEVEDGSK--KKGETTLRMDMAG------GGQLQKPVSLGENARNEGAAGEMRNEYSQYT 297

BLAST of CSPI06G10000.1 vs. NCBI nr
Match: gi|743893215|ref|XP_011039889.1| (PREDICTED: uncharacterized protein LOC105136297 [Populus euphratica])

HSP 1 Score: 157.5 bits (397), Expect = 5.4e-35
Identity = 88/155 (56.77%), Postives = 119/155 (76.77%), Query Frame = 1

Query: 1   MGSVGSLQSKASSSRGRSYALMLLIAFGAALLGVMLLNKFRERRICNLLLKRTDQDLRSF 60
           M + G+ +   + +RGR Y LMLL+AFGAALLGVM+L+KFRERRI NLL+K  D++L S 
Sbjct: 1   MAAGGAHKGNGNGNRGRPYGLMLLLAFGAALLGVMVLHKFRERRIFNLLVKEKDRELMSL 60

Query: 61  QLLFQKEKDRSKELAKSHEDMTAALYVLRTKKMELDRRLLELQSTIDSLRDEQKIISVAL 120
           QL+ QKE++RSKE+ +  E+M   +Y LRT+KMELDRR LELQSTIDS++DEQKI+  AL
Sbjct: 61  QLILQKERERSKEMKRKAEEMKGKIYSLRTQKMELDRRQLELQSTIDSIKDEQKIMESAL 120

Query: 121 QEKQSEIKTLREKEIESGNENPQVVALTKSLKQKE 156
           +EK++EIK LRE  I++   N Q+  L +SLK+ +
Sbjct: 121 EEKRNEIKMLREMNIDADKGNLQLADLIESLKKSK 155

BLAST of CSPI06G10000.1 vs. NCBI nr
Match: gi|1009127565|ref|XP_015880761.1| (PREDICTED: uncharacterized protein LOC107416748 [Ziziphus jujuba])

HSP 1 Score: 153.3 bits (386), Expect = 1.0e-33
Identity = 114/278 (41.01%), Postives = 169/278 (60.79%), Query Frame = 1

Query: 14  SRGRSYALMLLIAFGAALLGVMLLNKFRERRICNLLLKRTDQDLRSFQLLFQKEKDRSKE 73
           SRGR Y +MLL+AFGAALLGVM+L+K RERRI NLLLK  D +L S  LL QKE+DR +E
Sbjct: 20  SRGRPYGVMLLLAFGAALLGVMILHKLRERRIFNLLLKDRDNELISIHLLLQKERDRVQE 79

Query: 74  LAKSHEDMTAALYVLRTKKMELDRRLLELQSTIDSLRDEQKIISVALQEKQSEIKTLREK 133
           + + +EDM A +Y LR +KME+DRRL+E+QSTIDSL+DEQ+ + VA++EKQ+EIK LR K
Sbjct: 80  VNRKNEDMKANIYTLRNQKMEVDRRLIEMQSTIDSLKDEQRTMEVAIEEKQNEIKMLRLK 139

Query: 134 E-IESGNENPQVVALTKSLKQKEDELEELRRRLESPAAVAANDSSDSDPRGDSKTSEALR 193
           E   +  EN QV  L ++LKQK+ E+E+L++RL++P  +    SSD DP   S     + 
Sbjct: 140 ESATTETENTQVTELIETLKQKDAEIEDLKKRLQNPVKM----SSDDDPSNPSVNLTTVT 199

Query: 194 EQLL-ESSGGRGESESMDNNRGGSKSTGFHEIETQKLEDHEGVEERQKQEKLGDEGENAI 253
              + +     G+ + +  ++    S      + Q++E    + E+Q Q K  ++ ++  
Sbjct: 200 GNTVGKEKTAEGDGKIIVTDQKSKNS------QDQRIEYGTSMGEKQGQGKKSEDFKDD- 259

Query: 254 GGELEGEETKITNEHKEKEVGHSEEVENGRANKIARSE 290
                 EE  I       + G  +  ENG   ++  ++
Sbjct: 260 ----GSEENNIAASETISDGGGKDIEENGELGRLENAQ 282

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KG17_CUCSA2.8e-22498.67Uncharacterized protein OS=Cucumis sativus GN=Csa_6G124060 PE=4 SV=1[more]
A0A061E2Z9_THECC2.0e-3642.11Micronuclear linker histone polyprotein-like protein OS=Theobroma cacao GN=TCM_0... [more]
W9R083_9ROSA4.6e-3335.76Uncharacterized protein OS=Morus notabilis GN=L484_022179 PE=4 SV=1[more]
V7AGA6_PHAVU1.5e-3133.81Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_011G112100g PE=4 SV=1[more]
K7KY66_SOYBN6.2e-3033.26Uncharacterized protein OS=Glycine max GN=GLYMA_06G297400 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
gi|778712727|ref|XP_011656925.1|4.0e-22498.67PREDICTED: paralemmin-3-like [Cucumis sativus][more]
gi|659094716|ref|XP_008448207.1|3.6e-18886.59PREDICTED: glutamic acid-rich protein-like [Cucumis melo][more]
gi|590690405|ref|XP_007043499.1|2.9e-3642.11Micronuclear linker histone polyprotein-like protein [Theobroma cacao][more]
gi|743893215|ref|XP_011039889.1|5.4e-3556.77PREDICTED: uncharacterized protein LOC105136297 [Populus euphratica][more]
gi|1009127565|ref|XP_015880761.1|1.0e-3341.01PREDICTED: uncharacterized protein LOC107416748 [Ziziphus jujuba][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CSPI06G10000CSPI06G10000gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CSPI06G10000.1CSPI06G10000.1-proteinpolypeptide


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CSPI06G10000.1.cds2CSPI06G10000.1.cds2CDS
CSPI06G10000.1.cds1CSPI06G10000.1.cds1CDS


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CSPI06G10000.1.utr5p1CSPI06G10000.1.utr5p1five_prime_UTR


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableunknownCoilCoilcoord: 365..385
score: -coord: 144..164
score: -coord: 92..112
scor
NoneNo IPR availablePANTHERPTHR36143FAMILY NOT NAMEDcoord: 1..451
score: 1.7
NoneNo IPR availablePANTHERPTHR36143:SF2SUBFAMILY NOT NAMEDcoord: 1..451
score: 1.7