Csa4G000960.1 (mRNA) Cucumber (Chinese Long) v2

NameCsa4G000960.1
TypemRNA
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionPutative uncharacterized protein P0576F08.26; contains IPR024738 (Transcriptional coactivator Hfi1/Transcriptional adapter 1)
LocationChr4 : 247797 .. 250274 (+)
Sequence length1389
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTTGGAATCAGGGCGGGAAAGGGAAGCAGTGTGGATGTTATCCGATGCTTCCAAAAGCAAAGGGCGTTCGTAAGTAATCAATCGTTCAAAAATTATATCAATATTTAAACGTTTTAAACTTTCATATTTCAAAATAAAATAACCTTCGATCTTTTGTAGTTAAATCCATAAATATATTAATAATAAAAAGTTGTGTTATGCTTTGTTTCTTTTTCCTTTTTAATATTAGTTAAATAAATGAGAGAAAATATTGTAAGAAAAATAATATTTTTAAAAATAGTAAAATAAATTAAAATATTTATAAATTATATCATAATACATAAATAAATAAATATATAAGAGAAAGAAAATAAAATACATTTGTAGACTTTGGGAAAATAATAAAAAAAAAAAAAAAAAAAAAAAGAAAATAGAAAGAAAGGAAAAGGGCGCACGTGCGTAATTGAATAACGATACGATTGGCGGAAGACAGACACAATCGCAAGAACACCCCGTCGGAAGGTCGCTGTTTTTGCTTTTGGTCTCCAATCTCCGAATCACCCAAAAATTCCAATCCAAACCCATTAAACAATTTACACAAAATCATCTTAATTTTACGATTCTTCATTCACAAGACGGCGCAATTTCATTTCAATTCTTTTGTAAGTATCTGAATTCGAATTCTATGATTCCCCTTACTAATGATTAGGGTTTTCACTTCCATCTTCCAATTTTCAACTTTGATTTCAATGCTTTTTATCCTTCATTTCTTTTGGGGTTTTACTTTTTTTTTTTTTTTTCATTTTGTTTGTACAATGATTCAAAAACCCAACTCTTGTTGCCTTACGATTTGGCTGCAATCCATTTTTTAAAAAAAAATTGTTTTGTTTGTTTCCAGCTGTTCTTTACTTTGGGGTTTCTTCCTTCTGATTCCCAAACTTCTACAATTGGTATTCAAATGCTTCAGTCAGATATTGGGATGTCAATATTGTTTTGAATACTTAAGCTAAGCTGTGAAAATTAGGGTTTCTGGCTTTCTGATCAAAGATTTATTCTGCGGTTTTCTCAGTTTATCTGTTTTGTTGGTGGAGGATTCCTGAGGGCTTTGTCTCAAATTCAGCCATTGCTGGAATTGCGGCTCTGAAAATGCTTCCCAGGAAAGACACTTCTCGTATAGACACTTCCGAGCTGAAAGCGATGATATATCGAAAGCTTGGGCATCAGAGATCAGATAAATACTTTGATCAGCTCAAGAAATTGTTAAGTTTAAAGACCAACAAAAGGGAATTCGACAAGTTTTGTATTCAGATTATTGGGAGGGAAATTATACCTCTTCATAATCGGCTTATTAGAGCGATTCTTCAAAATGCTTGTGTGGCTAAAACTCCCCCTGTTCTTAGCAGTACAAGGAAAGTTGGAGGCAATCTCAGTGTAAAGGTTGTGAATGGATATCAAAGGAGTTGTCTTCAATCACTTCATGGGGATGCATTCCTTTCTTCTCCTCGAAAGGGTAGGTCGCCGGTTAGTAGAGACCGTAAGATTCGAGATCGTCCAAGTCCTTTGGGACCATGTGGGAAGCCACAGAATATGGCACTTGAAGAATTTGCTTCTAAGGCACAAGAACAGCAAAGTGCCACAGAGTTACATTCTCTTGGCAGCCGTCCTCCCGTCGAAATGGCGTCTGTAGAAGACGGAGAAGAGGTTGAGCAGGTGGCTGGAAGTCCAGGAGTTCAAAGCAGAAGCCCAGTTACTGCTCCGCTTGGTATCTCGATGAACTTCATTGGTTCCGGTAAGACTCTCTCCAATGTTCCCGTTGGAAGTAATTACCATGTAACAACATGCCAAGATGTTGGCGAGTTACCAGACACGAGGTTGCTGAGAACTCATTTAAGGAAGAAGTTGGAAACGGAGCAGATTGATATATCTGTAGATGGTGTAAACCTTCTTAACAATGCATTGGATGTTTATTTAAAGAGGTTAATCGAGCCATGTTTGAATTTCTCTCGGTCAAGGTGTGAGCGACTGAAATTTACAGGCAATCAACCAATAACTGGCTCAAGAATCACATTCCAGGAACAACATCGGCATCGAGCTCAACAATTAAATAACGGATCCTTGTTGGACTTCCGTGTTGCAATGCAACTGAATCCTCAAGTACTTGGGAGAGAGTGGACGATGCAACTCGAGAAAATCAGTTTACGAGCTTCTGAAGAGTGAGCTGACCAAATCTCACATCAGAAAAAGTATAGTAGGGTCTGAATATATAGTCAAAAGGTGATCGATCATATATTGTGAATAATTTGTCGGGTTCTGTGTCTTTCAAAGTCCCTTGCTAATTTTTGCGATTCTTGTGAGGATAGGTTTTCAGATCCATTATGAGCCTTAAATCAAGTTGATTAGGTTACATGTAGATAAAAGGCTCTAATGATCATTTGTCTTGAGGAATATGTTTATGGTCTAATAAAGGCATACAACTTCACTTGCGCCTGCGTGACG

mRNA sequence

ATGCTTGGAATCAGGGCGGGAAAGGGAAGCAGTGTGGATGTTATCCGATGCTTCCAAAAGCAAAGGGCGTTCACACAATCGCAAGAACACCCCGTCGGAAGGTCGCTGTTTTTGCTTTTGGTCTCCAATCTCCGAATCACCCAAAAATTCCAATCCAAACCCATTAAACAATTTACACAAAATCATCTTAATTTTACGATTCTTCATTCACAAGACGGCGCAATTTCATTTCAATTCTTTTTTTATCTGTTTTGTTGGTGGAGGATTCCTGAGGGCTTTGTCTCAAATTCAGCCATTGCTGGAATTGCGGCTCTGAAAATGCTTCCCAGGAAAGACACTTCTCGTATAGACACTTCCGAGCTGAAAGCGATGATATATCGAAAGCTTGGGCATCAGAGATCAGATAAATACTTTGATCAGCTCAAGAAATTGTTAAGTTTAAAGACCAACAAAAGGGAATTCGACAAGTTTTGTATTCAGATTATTGGGAGGGAAATTATACCTCTTCATAATCGGCTTATTAGAGCGATTCTTCAAAATGCTTGTGTGGCTAAAACTCCCCCTGTTCTTAGCAGTACAAGGAAAGTTGGAGGCAATCTCAGTGTAAAGGTTGTGAATGGATATCAAAGGAGTTGTCTTCAATCACTTCATGGGGATGCATTCCTTTCTTCTCCTCGAAAGGGTAGGTCGCCGGTTAGTAGAGACCGTAAGATTCGAGATCGTCCAAGTCCTTTGGGACCATGTGGGAAGCCACAGAATATGGCACTTGAAGAATTTGCTTCTAAGGCACAAGAACAGCAAAGTGCCACAGAGTTACATTCTCTTGGCAGCCGTCCTCCCGTCGAAATGGCGTCTGTAGAAGACGGAGAAGAGGTTGAGCAGGTGGCTGGAAGTCCAGGAGTTCAAAGCAGAAGCCCAGTTACTGCTCCGCTTGGTATCTCGATGAACTTCATTGGTTCCGGTAAGACTCTCTCCAATGTTCCCGTTGGAAGTAATTACCATGTAACAACATGCCAAGATGTTGGCGAGTTACCAGACACGAGGTTGCTGAGAACTCATTTAAGGAAGAAGTTGGAAACGGAGCAGATTGATATATCTGTAGATGGTGTAAACCTTCTTAACAATGCATTGGATGTTTATTTAAAGAGGTTAATCGAGCCATGTTTGAATTTCTCTCGGTCAAGGTGTGAGCGACTGAAATTTACAGGCAATCAACCAATAACTGGCTCAAGAATCACATTCCAGGAACAACATCGGCATCGAGCTCAACAATTAAATAACGGATCCTTGTTGGACTTCCGTGTTGCAATGCAACTGAATCCTCAAGTACTTGGGAGAGAGTGGACGATGCAACTCGAGAAAATCAGTTTACGAGCTTCTGAAGAGTGA

Coding sequence (CDS)

ATGCTTGGAATCAGGGCGGGAAAGGGAAGCAGTGTGGATGTTATCCGATGCTTCCAAAAGCAAAGGGCGTTCACACAATCGCAAGAACACCCCGTCGGAAGGTCGCTGTTTTTGCTTTTGGTCTCCAATCTCCGAATCACCCAAAAATTCCAATCCAAACCCATTAAACAATTTACACAAAATCATCTTAATTTTACGATTCTTCATTCACAAGACGGCGCAATTTCATTTCAATTCTTTTTTTATCTGTTTTGTTGGTGGAGGATTCCTGAGGGCTTTGTCTCAAATTCAGCCATTGCTGGAATTGCGGCTCTGAAAATGCTTCCCAGGAAAGACACTTCTCGTATAGACACTTCCGAGCTGAAAGCGATGATATATCGAAAGCTTGGGCATCAGAGATCAGATAAATACTTTGATCAGCTCAAGAAATTGTTAAGTTTAAAGACCAACAAAAGGGAATTCGACAAGTTTTGTATTCAGATTATTGGGAGGGAAATTATACCTCTTCATAATCGGCTTATTAGAGCGATTCTTCAAAATGCTTGTGTGGCTAAAACTCCCCCTGTTCTTAGCAGTACAAGGAAAGTTGGAGGCAATCTCAGTGTAAAGGTTGTGAATGGATATCAAAGGAGTTGTCTTCAATCACTTCATGGGGATGCATTCCTTTCTTCTCCTCGAAAGGGTAGGTCGCCGGTTAGTAGAGACCGTAAGATTCGAGATCGTCCAAGTCCTTTGGGACCATGTGGGAAGCCACAGAATATGGCACTTGAAGAATTTGCTTCTAAGGCACAAGAACAGCAAAGTGCCACAGAGTTACATTCTCTTGGCAGCCGTCCTCCCGTCGAAATGGCGTCTGTAGAAGACGGAGAAGAGGTTGAGCAGGTGGCTGGAAGTCCAGGAGTTCAAAGCAGAAGCCCAGTTACTGCTCCGCTTGGTATCTCGATGAACTTCATTGGTTCCGGTAAGACTCTCTCCAATGTTCCCGTTGGAAGTAATTACCATGTAACAACATGCCAAGATGTTGGCGAGTTACCAGACACGAGGTTGCTGAGAACTCATTTAAGGAAGAAGTTGGAAACGGAGCAGATTGATATATCTGTAGATGGTGTAAACCTTCTTAACAATGCATTGGATGTTTATTTAAAGAGGTTAATCGAGCCATGTTTGAATTTCTCTCGGTCAAGGTGTGAGCGACTGAAATTTACAGGCAATCAACCAATAACTGGCTCAAGAATCACATTCCAGGAACAACATCGGCATCGAGCTCAACAATTAAATAACGGATCCTTGTTGGACTTCCGTGTTGCAATGCAACTGAATCCTCAAGTACTTGGGAGAGAGTGGACGATGCAACTCGAGAAAATCAGTTTACGAGCTTCTGAAGAGTGA

Protein sequence

MLGIRAGKGSSVDVIRCFQKQRAFTQSQEHPVGRSLFLLLVSNLRITQKFQSKPIKQFTQNHLNFTILHSQDGAISFQFFFYLFCWWRIPEGFVSNSAIAGIAALKMLPRKDTSRIDTSELKAMIYRKLGHQRSDKYFDQLKKLLSLKTNKREFDKFCIQIIGREIIPLHNRLIRAILQNACVAKTPPVLSSTRKVGGNLSVKVVNGYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKPQNMALEEFASKAQEQQSATELHSLGSRPPVEMASVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNFIGSGKTLSNVPVGSNYHVTTCQDVGELPDTRLLRTHLRKKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTGNQPITGSRITFQEQHRHRAQQLNNGSLLDFRVAMQLNPQVLGREWTMQLEKISLRASEE*
BLAST of Csa4G000960.1 vs. TrEMBL
Match: A0A0A0KWF9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G000960 PE=4 SV=1)

HSP 1 Score: 919.5 bits (2375), Expect = 1.7e-264
Identity = 462/462 (100.00%), Postives = 462/462 (100.00%), Query Frame = 1

Query: 1   MLGIRAGKGSSVDVIRCFQKQRAFTQSQEHPVGRSLFLLLVSNLRITQKFQSKPIKQFTQ 60
           MLGIRAGKGSSVDVIRCFQKQRAFTQSQEHPVGRSLFLLLVSNLRITQKFQSKPIKQFTQ
Sbjct: 1   MLGIRAGKGSSVDVIRCFQKQRAFTQSQEHPVGRSLFLLLVSNLRITQKFQSKPIKQFTQ 60

Query: 61  NHLNFTILHSQDGAISFQFFFYLFCWWRIPEGFVSNSAIAGIAALKMLPRKDTSRIDTSE 120
           NHLNFTILHSQDGAISFQFFFYLFCWWRIPEGFVSNSAIAGIAALKMLPRKDTSRIDTSE
Sbjct: 61  NHLNFTILHSQDGAISFQFFFYLFCWWRIPEGFVSNSAIAGIAALKMLPRKDTSRIDTSE 120

Query: 121 LKAMIYRKLGHQRSDKYFDQLKKLLSLKTNKREFDKFCIQIIGREIIPLHNRLIRAILQN 180
           LKAMIYRKLGHQRSDKYFDQLKKLLSLKTNKREFDKFCIQIIGREIIPLHNRLIRAILQN
Sbjct: 121 LKAMIYRKLGHQRSDKYFDQLKKLLSLKTNKREFDKFCIQIIGREIIPLHNRLIRAILQN 180

Query: 181 ACVAKTPPVLSSTRKVGGNLSVKVVNGYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRD 240
           ACVAKTPPVLSSTRKVGGNLSVKVVNGYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRD
Sbjct: 181 ACVAKTPPVLSSTRKVGGNLSVKVVNGYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRD 240

Query: 241 RPSPLGPCGKPQNMALEEFASKAQEQQSATELHSLGSRPPVEMASVEDGEEVEQVAGSPG 300
           RPSPLGPCGKPQNMALEEFASKAQEQQSATELHSLGSRPPVEMASVEDGEEVEQVAGSPG
Sbjct: 241 RPSPLGPCGKPQNMALEEFASKAQEQQSATELHSLGSRPPVEMASVEDGEEVEQVAGSPG 300

Query: 301 VQSRSPVTAPLGISMNFIGSGKTLSNVPVGSNYHVTTCQDVGELPDTRLLRTHLRKKLET 360
           VQSRSPVTAPLGISMNFIGSGKTLSNVPVGSNYHVTTCQDVGELPDTRLLRTHLRKKLET
Sbjct: 301 VQSRSPVTAPLGISMNFIGSGKTLSNVPVGSNYHVTTCQDVGELPDTRLLRTHLRKKLET 360

Query: 361 EQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTGNQPITGSRITFQEQHRH 420
           EQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTGNQPITGSRITFQEQHRH
Sbjct: 361 EQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTGNQPITGSRITFQEQHRH 420

Query: 421 RAQQLNNGSLLDFRVAMQLNPQVLGREWTMQLEKISLRASEE 463
           RAQQLNNGSLLDFRVAMQLNPQVLGREWTMQLEKISLRASEE
Sbjct: 421 RAQQLNNGSLLDFRVAMQLNPQVLGREWTMQLEKISLRASEE 462

BLAST of Csa4G000960.1 vs. TrEMBL
Match: B9HMX5_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0009s09370g PE=4 SV=2)

HSP 1 Score: 427.9 bits (1099), Expect = 1.5e-116
Identity = 222/349 (63.61%), Postives = 266/349 (76.22%), Query Frame = 1

Query: 114 SRIDTSELKAMIYRKLGHQRSDKYFDQLKKLLSLKTNKREFDKFCIQIIGREIIPLHNRL 173
           SRIDT ELK++I +K+GHQR+DKYFD+L +L SLK  K EFDK CI+IIGRE IPLHNRL
Sbjct: 8   SRIDTLELKSLILKKIGHQRADKYFDELTQLFSLKITKCEFDKLCIRIIGRENIPLHNRL 67

Query: 174 IRAILQNACVAKTPPVLSSTRKVGGNLSVKVVNGYQRSCLQSLHGDAFLSSPRKGRSPVS 233
           IR+IL+NAC+ K PP     R+ G NL+VK  NG+QR+ LQSL+ DAF SSPRKGRSPV+
Sbjct: 68  IRSILKNACLGKVPPP-KGVRRAGSNLTVKTTNGHQRNYLQSLYRDAFPSSPRKGRSPVN 127

Query: 234 RDRKIRDRPSPLGPCGKPQNMALEEFASKAQEQQSATELHSLGSRPPVEMASVEDGEEVE 293
           RDRK RDRPSPLGP GKPQ+MA EE  S+AQEQQSATELHSLGSRPP+E+ASVE+GEEVE
Sbjct: 128 RDRKFRDRPSPLGPLGKPQSMACEELNSRAQEQQSATELHSLGSRPPIEVASVEEGEEVE 187

Query: 294 QVAGSPGVQSRSPVTAPLGISMNFIGSGKTLSNVPVGSNYHVTTCQDVGELPDTRLLRTH 353
           Q+A SPGVQSRSPVTAP GIS+N  GS K LSN+ +GSNY   TC + GELPDTR LR+ 
Sbjct: 188 QMAVSPGVQSRSPVTAPFGISLNPGGSRKALSNISIGSNYIPETCLNSGELPDTRSLRSR 247

Query: 354 LRKKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTGNQPITGSRIT 413
           L +KLE E I +S+D VN+LN  LD YLKRLIEPC+  + +RC+  +  G          
Sbjct: 248 LERKLEMEGIGVSLDCVNVLNIGLDAYLKRLIEPCMALAGARCDSEQLKG---------- 307

Query: 414 FQEQHRHRAQQLNNGSLLDFRVAMQLNPQVLGREWTMQLEKISLRASEE 463
              Q+  R  +  N S+LDFRVAM+ NPQ+LG +W +QLEKISL   EE
Sbjct: 308 ANGQYVKRQTESVNASMLDFRVAMESNPQILGEDWPVQLEKISLSGFEE 345

BLAST of Csa4G000960.1 vs. TrEMBL
Match: A0A067LJK3_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_16307 PE=4 SV=1)

HSP 1 Score: 426.8 bits (1096), Expect = 3.4e-116
Identity = 217/358 (60.61%), Postives = 271/358 (75.70%), Query Frame = 1

Query: 107 MLPRKDTSRIDTSELKAMIYRKLGHQRSDKYFDQLKKLLSLKTNKREFDKFCIQIIGREI 166
           M P +  +RI+T ELKA+I +K+GH+R++KYFDQL +L S K  K EFDKFC++IIGRE 
Sbjct: 1   MSPNQSYTRINTLELKALIVKKIGHERAEKYFDQLTRLFSFKITKSEFDKFCVRIIGREN 60

Query: 167 IPLHNRLIRAILQNACVAKTPPVLSSTRKVGGNLSVKVVNGYQRSCLQSLHGDAFLSSPR 226
           IPLHN LIR+I++NAC++K PP  +  R+   +L+VK  NGY ++CLQSL+GDAF  SPR
Sbjct: 61  IPLHNHLIRSIVKNACLSKVPPQKAIKRQAS-SLNVKTANGYHKNCLQSLYGDAFPPSPR 120

Query: 227 KGRSPVSRDRKIRDRPSPLGPCGKPQNMALEEFASKAQEQQSATELHSLGSRPPVEMASV 286
           KGRSPV+R RK RDRPSPLGP GKPQ++  EE +S+AQEQQSATELHSLGSRPP E+ASV
Sbjct: 121 KGRSPVNRYRKFRDRPSPLGPLGKPQSLVCEELSSRAQEQQSATELHSLGSRPPAEVASV 180

Query: 287 EDGEEVEQVAGSPGVQSRSPVTAPLGISMNFIGSGKTLSNVPVGSNYHVTTCQDVGELPD 346
           E+GEEVEQVAGSPGVQSRSPVTAPLG+SMN  G+ K LS+  V  ++H  TC + GELPD
Sbjct: 181 EEGEEVEQVAGSPGVQSRSPVTAPLGVSMNLGGARKALSSFTVCGSHHQETCVNSGELPD 240

Query: 347 TRLLRTHLRKKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRC--ERLKFTGN 406
           TR LR+ L +KL  E I++S+D VNLLNN LD YLKRLIEPC+  + SRC    LK    
Sbjct: 241 TRSLRSRLEQKLGMEGINVSMDCVNLLNNGLDTYLKRLIEPCMGLASSRCGNGHLKMVNG 300

Query: 407 QPITGSRITFQEQHRHRAQQLNNGSLLDFRVAMQLNPQVLGREWTMQLEKISLRASEE 463
           Q + G       ++  R  +    S+LDF VAM++NPQ+LG +W + LEKISLRASEE
Sbjct: 301 QLLPGLDGRLPGRYMQRRTESVYASMLDFHVAMEVNPQILGEDWIILLEKISLRASEE 357

BLAST of Csa4G000960.1 vs. TrEMBL
Match: W9RH76_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_011520 PE=4 SV=1)

HSP 1 Score: 426.4 bits (1095), Expect = 4.4e-116
Identity = 219/351 (62.39%), Postives = 273/351 (77.78%), Query Frame = 1

Query: 114 SRIDTSELKAMIYRKLGHQRSDKYFDQLKKLLSLKTNKREFDKFCIQIIGREIIPLHNRL 173
           SRIDT ELKA++ +K+G QR++KYFD L++L SLK +K EF+KFCI+ IG+E +PLHN+L
Sbjct: 8   SRIDTLELKALMVQKIGLQRAEKYFDHLRRLFSLKISKCEFNKFCIRTIGKENVPLHNQL 67

Query: 174 IRAILQNACVAKTPPVLSSTRKVGGNLSVKVVNGYQRSCLQSLHGDAFLSSPRKGRSPVS 233
           IR+I++NAC++K PP     RKVG NL+VK+ NG +++ LQSL+GDAF SSPRKGRSPV+
Sbjct: 68  IRSIVKNACLSKVPPT-KGIRKVGSNLNVKIANGLEKNYLQSLYGDAFPSSPRKGRSPVN 127

Query: 234 RDRKIRDRPSPLGPCGKPQNMALEEFASKAQEQQSATELHSLGSRPPVEMASVEDGEEVE 293
           RDRK+RDR SPLGP GKPQ++  EE  SK QEQQSATEL SLGSRPPVE+ASVEDGEEVE
Sbjct: 128 RDRKLRDRLSPLGPIGKPQSVTCEELLSKPQEQQSATELLSLGSRPPVEVASVEDGEEVE 187

Query: 294 QVAGSPGVQSRSPVTAPLGISMNFIGSGKTLSNVPVGSNYHVTTCQDVGELPDTRLLRTH 353
           Q AGSPGVQSRSPVTAPLGISMN  G+ K L N+ +G+NY + TCQ+ GELPDTRLLR+ 
Sbjct: 188 QDAGSPGVQSRSPVTAPLGISMNLGGARKALCNISIGNNYRLETCQNSGELPDTRLLRSR 247

Query: 354 LRKKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRC--ERLKFTGNQPITGSR 413
           +++KLE + I+IS+D VNLLNN LD YLKRLIEPC+  + SRC  E+LK   ++ I G  
Sbjct: 248 VKRKLEMKGINISMDCVNLLNNGLDAYLKRLIEPCMGLAGSRCGNEQLKPFNSRFIHGLN 307

Query: 414 ITFQEQHRHRAQQLNNGSLLDFRVAMQLNPQVLGREWTMQLEKISLRASEE 463
            T   +      +    S+LDF  AMQLNP +LG +W +QLEKI L A EE
Sbjct: 308 KTVPGRFAQNPTKSTCVSMLDFHTAMQLNPHILGEDWAVQLEKIGLHAFEE 357

BLAST of Csa4G000960.1 vs. TrEMBL
Match: A0A061E832_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_007200 PE=4 SV=1)

HSP 1 Score: 424.1 bits (1089), Expect = 2.2e-115
Identity = 226/358 (63.13%), Postives = 274/358 (76.54%), Query Frame = 1

Query: 107 MLPRKDTSRIDTSELKAMIYRKLGHQRSDKYFDQLKKLLSLKTNKREFDKFCIQIIGREI 166
           M+  ++ +R+DT ELKA+I RK+GHQR++KYFDQL++L SLK  K +FDK CI+ IGRE 
Sbjct: 1   MMLNQNYARVDTLELKALIVRKVGHQRAEKYFDQLRRLFSLKIGKCDFDKSCIKTIGREN 60

Query: 167 IPLHNRLIRAILQNACVAKTPPVLSSTRKVGGNLSVKVVNGYQRSCLQSLHGDAFLSSPR 226
           IPLHNRLIR+I++NAC+AK PP L + +K G NL +   NGYQR+ LQSL+GDAF  SPR
Sbjct: 61  IPLHNRLIRSIIKNACIAKVPP-LKTIKKGGSNLQIG--NGYQRNRLQSLYGDAFPPSPR 120

Query: 227 KGRSPVSRDRKIRDRPSPLGPCGKPQNMALEEFASKAQEQQSATELHSLGSRPPVEMASV 286
           KGRSPV+RDRK RDRPSPLGP GKPQ++  EE  SKAQEQ SATEL SLGSRPP E+ASV
Sbjct: 121 KGRSPVNRDRKFRDRPSPLGPLGKPQSIVCEESVSKAQEQ-SATELLSLGSRPPAEVASV 180

Query: 287 EDGEEVEQVAGSPGVQSRSPVTAPLGISMNFIGSGKTLSNVPVGSNYHVTTCQDVGELPD 346
           EDGEEVEQVAGSPGVQSRSPVTAPLGIS+NF G+ K LSN  V +NYH+ TCQ+ GELPD
Sbjct: 181 EDGEEVEQVAGSPGVQSRSPVTAPLGISINFGGARKALSNAFVSNNYHLETCQNRGELPD 240

Query: 347 TRLLRTHLRKKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFS--RSRCERLKFTGN 406
           TR LR+ L++KLE E I +SVD VNLLNN LD +LKRLIEPC+  +  RS    LK +  
Sbjct: 241 TRSLRSRLQQKLEMEGISVSVDCVNLLNNGLDAFLKRLIEPCVALAGLRSGDGNLKQSNG 300

Query: 407 QPITGSRITFQEQHRHRAQQLNNGSLLDFRVAMQLNPQVLGREWTMQLEKISLRASEE 463
           Q I          +   + +  + S+LDFR AM+LNPQVLG +W MQLEKISL + E+
Sbjct: 301 QFIPRLNGMLHRNYLQHSAKSCHASMLDFRAAMELNPQVLGEDWAMQLEKISLSSFED 354

BLAST of Csa4G000960.1 vs. TAIR10
Match: AT4G33890.1 (AT4G33890.1 unknown protein)

HSP 1 Score: 316.2 bits (809), Expect = 3.3e-86
Identity = 183/359 (50.97%), Postives = 248/359 (69.08%), Query Frame = 1

Query: 113 TSRIDTSELKAMIYRKLGHQRSDKYFDQLKKLLSLKTNKREFDKFCIQIIGREIIPLHNR 172
           +SR+DT E+KA+IYR++G+QR++ YF+QL +  +LK  K EFDK CI+ IGR+ I LHNR
Sbjct: 7   SSRLDTLEIKALIYREIGNQRAESYFNQLGRFFALKITKSEFDKLCIKTIGRQNIHLHNR 66

Query: 173 LIRAILQNACVAKTPPVLSSTRKVGGNLSVKVVNG--YQRSCLQSLHGD-AFLSSPRKGR 232
           LIR+I++NAC+AK+PP +    K GG+  V+  NG   + S +Q LHGD AF  S RK R
Sbjct: 67  LIRSIIKNACIAKSPPFI----KKGGSF-VRFGNGDSKKNSQIQPLHGDSAFSPSTRKCR 126

Query: 233 SPVSRDRKIRDRPSPLGPCGKPQNMAL--EEFASKAQEQQSATELHSLGSRPPVEMASVE 292
           S     RK+RDRPSPLGP GKP ++    EE  SKA   QSATEL SLGSRPPVE+ SVE
Sbjct: 127 S-----RKLRDRPSPLGPLGKPHSLTTTNEESMSKA---QSATELLSLGSRPPVEVVSVE 186

Query: 293 DGEEVEQVA-GSPGVQSRSPVTAPLGISMNFIGSG--KTLSNVPVGS-NYHVTTCQDVGE 352
           +GEEVEQ+A GSP VQSR P+TAPLG+SM+       K++SNV + S +++  TCQ+ GE
Sbjct: 187 EGEEVEQIAGGSPSVQSRCPLTAPLGVSMSLRNGATRKSVSNVSMCSRSFNRETCQNNGE 246

Query: 353 LPDTRLLRTHLRKKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTG 412
           LPDTR LR+ L ++LE E + I++D V+LLN+ LDV+++RLIEPCL+ + +RC      G
Sbjct: 247 LPDTRTLRSRLERRLEMEGLKITMDSVSLLNSGLDVFMRRLIEPCLSLANTRC------G 306

Query: 413 NQPITGSRITFQEQHRHRAQQLNNGSLLDFRVAMQLNPQVLGREWTMQLEKISLRASEE 463
              +      + +Q R    +L+  S+ DFR  M+LN ++LG +W M +EKI  RAS++
Sbjct: 307 TDRVREMNYQYTQQSR----RLSYVSMSDFRAGMELNTEILGEDWPMHMEKICSRASDK 342

BLAST of Csa4G000960.1 vs. TAIR10
Match: AT2G14850.1 (AT2G14850.1 unknown protein)

HSP 1 Score: 271.2 bits (692), Expect = 1.2e-72
Identity = 162/352 (46.02%), Postives = 208/352 (59.09%), Query Frame = 1

Query: 114 SRIDTSELKAMIYRKLGHQRSDKYFDQLKKLLSLKTNKREFDKFCIQIIGREIIPLHNRL 173
           SR+++ E+KA+IY+K+GHQR+D YFDQL K L+ + +K EFDK C + +GRE I LHNRL
Sbjct: 8   SRLNSLEIKALIYQKIGHQRADTYFDQLGKFLTSRISKSEFDKLCSKTVGRENISLHNRL 67

Query: 174 IRAILQNACVAKTPPVLSSTRKVGGNLSVKVVNGYQRSCLQSLHGD-AFLSSPRKGRSPV 233
           +R+IL+NA VAK+PP                     R   +SL+GD  F  SPRK RS  
Sbjct: 68  VRSILKNASVAKSPP--------------------PRYPKKSLYGDPVFPPSPRKCRS-- 127

Query: 234 SRDRKIRDRPSPLGPCGKPQNMAL--EEFASKAQEQQSATELHSLGSRPPVEMASVEDGE 293
              RK RDRPSPLGP GKPQ++    +E  SKAQ             R P+E+ SVEDGE
Sbjct: 128 ---RKFRDRPSPLGPLGKPQSLTTTNDESMSKAQ-------------RLPMEVVSVEDGE 187

Query: 294 EVEQVAGSPGVQSRSPVTAPLGISMNFIGSGKTLSNVPVGSNYHVTTCQDVGELPDTRLL 353
           EVEQ+ GSP VQSRSP+TAPLG+S +     K+ +     +  +  TCQ  GELPD   L
Sbjct: 188 EVEQMTGSPSVQSRSPLTAPLGVSFHL----KSKARFSTYNGINRETCQSSGELPDMITL 247

Query: 354 RTHLRKKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTGNQPITGS 413
           R  L KKLE E I +S+D  NLLN  L+ Y++RLIEPCL+ +                  
Sbjct: 248 RARLEKKLEMEGIKLSMDSANLLNRGLNAYMRRLIEPCLSLA------------------ 291

Query: 414 RITFQEQHRHRAQQLNNGSLLDFRVAMQLNPQVLGREWTMQLEKISLRASEE 463
                     + + ++N S+LDF  AM++NP+VLG EW +QLEKI  RASEE
Sbjct: 308 --------SQQKRAVSNVSMLDFHAAMEVNPRVLGEEWPIQLEKICCRASEE 291

BLAST of Csa4G000960.1 vs. TAIR10
Match: AT4G31440.1 (AT4G31440.1 unknown protein)

HSP 1 Score: 170.6 bits (431), Expect = 2.2e-42
Identity = 126/381 (33.07%), Postives = 188/381 (49.34%), Query Frame = 1

Query: 108 LPRKDTSRIDTSELKAMIYRKLGHQRSDKYFDQLKKLLSLKTNKREFDKFCIQIIGREII 167
           + R    RID +ELK  I +K+G +RS +YF  L + LS K  K EFDK C +++GRE +
Sbjct: 1   MQRLQDPRIDLAELKVHIVKKVGVERSTRYFYYLGRFLSQKLTKSEFDKSCFRLLGRENL 60

Query: 168 PLHNRLIRAILQNACVAKTPPVLSSTRKVGGNLSVKVVNGYQRSCL---QSLHGDAFLSS 227
            LHN+LIR+IL+NA +AK+PP +  +   G +L +   +G + S       +  D  LS+
Sbjct: 61  SLHNKLIRSILRNASLAKSPPSVHQSGHPGKSLVLGKEDGPEESRSLNPDHIRNDLALSN 120

Query: 228 P--RKGRSPVSRDRKIRDRPSPLGPCGKPQN-MALEEFASKAQEQQSA----TELHSLGS 287
               K R     DR IRD+P PLG  GK     A         E+ SA     E  ++  
Sbjct: 121 GVLAKVRPGTCDDRTIRDKPCPLGSNGKVLGPFAYSRPGRYPDERDSAFLCPAEQKAVSG 180

Query: 288 RPPVEMASVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNFIGSGKTLSNVPVGSNYHVTT 347
           +  V      D E   ++  +P      PV APLGI       G     VPV ++    +
Sbjct: 181 KDQVAAPISRDDEAQVRILSTP------PVMAPLGIPFCSASVGGDRRTVPVSTSAAAIS 240

Query: 348 CQDVGELPDTRLLRTHLRKKLETEQI-DISVDGVNLLNNALDVYLKRLIEPCLNFSRSRC 407
           C D G L DT +LR  +     T+ +  +S +   +LNN LD+YLK+L++ C++ + +R 
Sbjct: 241 CYDSGGLSDTEMLRKRMENIAVTQGLGGVSAECSIVLNNMLDLYLKKLMKSCVDLAGARS 300

Query: 408 E---------RLKFTGNQPITGSR------ITFQEQHRHRAQQLNNGSLLDFRVAMQLNP 463
                       + + ++ + G R      I    Q     ++ ++ SLLDFRVAM+LNP
Sbjct: 301 MNGTPGKHSLEKQQSRDELVNGVRTNNSFHIQTSNQPSDITREQHSVSLLDFRVAMELNP 360

BLAST of Csa4G000960.1 vs. TAIR10
Match: AT2G24530.1 (AT2G24530.1 unknown protein)

HSP 1 Score: 169.9 bits (429), Expect = 3.8e-42
Identity = 129/405 (31.85%), Postives = 188/405 (46.42%), Query Frame = 1

Query: 108 LPRKDTSRIDTSELKAMIYRKLGHQRSDKYFDQLKKLLSLKTNKREFDKFCIQIIGREII 167
           + R    RI   ELK  I +K G +RS +YF  L + LS K  K EFDK C++++GRE +
Sbjct: 1   MQRSQDQRISLCELKEHIVKKTGVERSRRYFYYLGRFLSQKLTKSEFDKTCLRLLGRENL 60

Query: 168 PLHNRLIRAILQNACVAKTPPV-LSSTRKVGGNLSVKVVNGYQRSCL----QSLHGDAFL 227
            LHN+LIR+IL+NA VAK+PP    +      N      +G ++S       S H   + 
Sbjct: 61  SLHNQLIRSILRNATVAKSPPPDHEAGHSTKANAFQSRGDGLEQSGTLIPNHSQHEPVWS 120

Query: 228 S-----SPRKGRSPVSRDRKIRDRPSPLGPCGKPQNMALEEFA----------------- 287
           +     SPRK RS + ++RK RDRPSPLG  GK ++M  +                    
Sbjct: 121 NGVLPISPRKVRSGM-QNRKSRDRPSPLGSNGKVEHMLHQPVCREDNRGSVGMENGDYQR 180

Query: 288 -------SKAQEQQSATELHSLGSRPPVEMASVEDGEEVEQVAGSPGVQSRSPVTAPLGI 347
                   K  E     E   + ++  +   S+ D +  E+ A      S SP+ APLGI
Sbjct: 181 SGRYVADEKDGEFLRPVEKPRIPNKEKIAAVSMRDDQNQEEQARVN--LSMSPLIAPLGI 240

Query: 348 SMNFIGSGKTLSNVPVGSNYHVTTCQDVGELPDTRLLRTHLRKKLETEQID-ISVDGVNL 407
                  G +   +PV +N  + +C D G LPD  +LR  +      + ++ +S++    
Sbjct: 241 PFCSASVGGSPRTIPVSTNAELISCYDSGGLPDIEMLRKRMENIAVAQGLEGVSMECAKT 300

Query: 408 LNNALDVYLKRLIEPCLN---------------FSRSRCERLKFTGNQPITGSRITFQEQ 463
           LNN LDVYLK+LI  C +                 + + +     G  P    +I     
Sbjct: 301 LNNMLDVYLKKLINSCFDLVGARSTNGDPGKQRIGKQQSQNKIVNGVWPTNSLKIQTPNG 360

BLAST of Csa4G000960.1 vs. TAIR10
Match: AT5G67410.1 (AT5G67410.1 unknown protein)

HSP 1 Score: 152.1 bits (383), Expect = 8.2e-37
Identity = 113/345 (32.75%), Postives = 167/345 (48.41%), Query Frame = 1

Query: 115 RIDTSELKAMIYRKLGHQRSDKYFDQLKKLLSLKTNKREFDKFCIQIIGREIIPLHNRLI 174
           R D SELK+ I +++G  +++ Y + L K LSLK +K +FDK  I  + RE I LHN L+
Sbjct: 10  RTDISELKSQIEKRIGRAKTESYLNLLSKFLSLKISKSDFDKLIIVTVKRENISLHNALL 69

Query: 175 RAILQNACVAKTPPVLSSTRKVGGNLSVKVVNGYQRSCLQSLHGDAFLSSPRKGRSPVSR 234
           R IL+N C++KT P          N   K +NG  +S  + L       SPRKGR+    
Sbjct: 70  RGILKNICLSKTLPPFVKNGVESDNKKKKQLNGAFQSLCKELP-----RSPRKGRT---- 129

Query: 235 DRKIRDRPSPLGPCGKPQNMALEEFASKAQEQQSATELHSLGSRPPVEMASVEDGEEVEQ 294
               + R +  G   K +++  E  +S  ++Q                  S+E+ EEV+Q
Sbjct: 130 ----QRRLNKDGNISKGKSLVTEVVSSSGRQQW-----------------SMENVEEVDQ 189

Query: 295 VAGSPGVQSRSPVTAPLGISMNFIGSGKTLSNVPVGSNYHV-TTCQDVGELPDTRLLRTH 354
           +   P  +S+ P+ AP G+++  +          +   + + T C   GELPD+  L+  
Sbjct: 190 LI--PCWRSQ-PIEAPFGVNLRDV----------IKKQHRIDTCCYSSGELPDSVSLKKK 249

Query: 355 LRKKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTGNQPITGSRIT 414
           L   LE E +++SV   N LN  LDV+LKRLI+PCL  + SR                  
Sbjct: 250 LEDDLE-EGLEVSVGFANSLNAGLDVFLKRLIKPCLELAASRSSNAS------------- 285

Query: 415 FQEQHRHRAQQLNNGSLLDFRVAMQLNPQVLGREWTMQLEKISLR 459
                       +  SL+DF+VAM LNP +LG +W  +LEKI+ R
Sbjct: 310 ------------SASSLVDFQVAMALNPSILGEDWPTKLEKIACR 285

BLAST of Csa4G000960.1 vs. NCBI nr
Match: gi|700197619|gb|KGN52777.1| (hypothetical protein Csa_4G000960 [Cucumis sativus])

HSP 1 Score: 919.5 bits (2375), Expect = 2.4e-264
Identity = 462/462 (100.00%), Postives = 462/462 (100.00%), Query Frame = 1

Query: 1   MLGIRAGKGSSVDVIRCFQKQRAFTQSQEHPVGRSLFLLLVSNLRITQKFQSKPIKQFTQ 60
           MLGIRAGKGSSVDVIRCFQKQRAFTQSQEHPVGRSLFLLLVSNLRITQKFQSKPIKQFTQ
Sbjct: 1   MLGIRAGKGSSVDVIRCFQKQRAFTQSQEHPVGRSLFLLLVSNLRITQKFQSKPIKQFTQ 60

Query: 61  NHLNFTILHSQDGAISFQFFFYLFCWWRIPEGFVSNSAIAGIAALKMLPRKDTSRIDTSE 120
           NHLNFTILHSQDGAISFQFFFYLFCWWRIPEGFVSNSAIAGIAALKMLPRKDTSRIDTSE
Sbjct: 61  NHLNFTILHSQDGAISFQFFFYLFCWWRIPEGFVSNSAIAGIAALKMLPRKDTSRIDTSE 120

Query: 121 LKAMIYRKLGHQRSDKYFDQLKKLLSLKTNKREFDKFCIQIIGREIIPLHNRLIRAILQN 180
           LKAMIYRKLGHQRSDKYFDQLKKLLSLKTNKREFDKFCIQIIGREIIPLHNRLIRAILQN
Sbjct: 121 LKAMIYRKLGHQRSDKYFDQLKKLLSLKTNKREFDKFCIQIIGREIIPLHNRLIRAILQN 180

Query: 181 ACVAKTPPVLSSTRKVGGNLSVKVVNGYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRD 240
           ACVAKTPPVLSSTRKVGGNLSVKVVNGYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRD
Sbjct: 181 ACVAKTPPVLSSTRKVGGNLSVKVVNGYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRD 240

Query: 241 RPSPLGPCGKPQNMALEEFASKAQEQQSATELHSLGSRPPVEMASVEDGEEVEQVAGSPG 300
           RPSPLGPCGKPQNMALEEFASKAQEQQSATELHSLGSRPPVEMASVEDGEEVEQVAGSPG
Sbjct: 241 RPSPLGPCGKPQNMALEEFASKAQEQQSATELHSLGSRPPVEMASVEDGEEVEQVAGSPG 300

Query: 301 VQSRSPVTAPLGISMNFIGSGKTLSNVPVGSNYHVTTCQDVGELPDTRLLRTHLRKKLET 360
           VQSRSPVTAPLGISMNFIGSGKTLSNVPVGSNYHVTTCQDVGELPDTRLLRTHLRKKLET
Sbjct: 301 VQSRSPVTAPLGISMNFIGSGKTLSNVPVGSNYHVTTCQDVGELPDTRLLRTHLRKKLET 360

Query: 361 EQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTGNQPITGSRITFQEQHRH 420
           EQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTGNQPITGSRITFQEQHRH
Sbjct: 361 EQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTGNQPITGSRITFQEQHRH 420

Query: 421 RAQQLNNGSLLDFRVAMQLNPQVLGREWTMQLEKISLRASEE 463
           RAQQLNNGSLLDFRVAMQLNPQVLGREWTMQLEKISLRASEE
Sbjct: 421 RAQQLNNGSLLDFRVAMQLNPQVLGREWTMQLEKISLRASEE 462

BLAST of Csa4G000960.1 vs. NCBI nr
Match: gi|449469122|ref|XP_004152270.1| (PREDICTED: uncharacterized protein LOC101211126 [Cucumis sativus])

HSP 1 Score: 707.2 bits (1824), Expect = 1.9e-200
Identity = 356/356 (100.00%), Postives = 356/356 (100.00%), Query Frame = 1

Query: 107 MLPRKDTSRIDTSELKAMIYRKLGHQRSDKYFDQLKKLLSLKTNKREFDKFCIQIIGREI 166
           MLPRKDTSRIDTSELKAMIYRKLGHQRSDKYFDQLKKLLSLKTNKREFDKFCIQIIGREI
Sbjct: 1   MLPRKDTSRIDTSELKAMIYRKLGHQRSDKYFDQLKKLLSLKTNKREFDKFCIQIIGREI 60

Query: 167 IPLHNRLIRAILQNACVAKTPPVLSSTRKVGGNLSVKVVNGYQRSCLQSLHGDAFLSSPR 226
           IPLHNRLIRAILQNACVAKTPPVLSSTRKVGGNLSVKVVNGYQRSCLQSLHGDAFLSSPR
Sbjct: 61  IPLHNRLIRAILQNACVAKTPPVLSSTRKVGGNLSVKVVNGYQRSCLQSLHGDAFLSSPR 120

Query: 227 KGRSPVSRDRKIRDRPSPLGPCGKPQNMALEEFASKAQEQQSATELHSLGSRPPVEMASV 286
           KGRSPVSRDRKIRDRPSPLGPCGKPQNMALEEFASKAQEQQSATELHSLGSRPPVEMASV
Sbjct: 121 KGRSPVSRDRKIRDRPSPLGPCGKPQNMALEEFASKAQEQQSATELHSLGSRPPVEMASV 180

Query: 287 EDGEEVEQVAGSPGVQSRSPVTAPLGISMNFIGSGKTLSNVPVGSNYHVTTCQDVGELPD 346
           EDGEEVEQVAGSPGVQSRSPVTAPLGISMNFIGSGKTLSNVPVGSNYHVTTCQDVGELPD
Sbjct: 181 EDGEEVEQVAGSPGVQSRSPVTAPLGISMNFIGSGKTLSNVPVGSNYHVTTCQDVGELPD 240

Query: 347 TRLLRTHLRKKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTGNQP 406
           TRLLRTHLRKKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTGNQP
Sbjct: 241 TRLLRTHLRKKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTGNQP 300

Query: 407 ITGSRITFQEQHRHRAQQLNNGSLLDFRVAMQLNPQVLGREWTMQLEKISLRASEE 463
           ITGSRITFQEQHRHRAQQLNNGSLLDFRVAMQLNPQVLGREWTMQLEKISLRASEE
Sbjct: 301 ITGSRITFQEQHRHRAQQLNNGSLLDFRVAMQLNPQVLGREWTMQLEKISLRASEE 356

BLAST of Csa4G000960.1 vs. NCBI nr
Match: gi|659108776|ref|XP_008454383.1| (PREDICTED: uncharacterized protein LOC103494799 [Cucumis melo])

HSP 1 Score: 691.0 bits (1782), Expect = 1.4e-195
Identity = 350/357 (98.04%), Postives = 352/357 (98.60%), Query Frame = 1

Query: 107 MLPRKDTSRIDTSELKAMIYRKLGHQRSDKYFDQLKKLLSLKTNKREFDKFCIQIIGREI 166
           M PRKDTSRIDTSELKAMIYRKLGHQRSDKYFDQLKKLLSLKTNKREFDKFCIQIIGREI
Sbjct: 1   MFPRKDTSRIDTSELKAMIYRKLGHQRSDKYFDQLKKLLSLKTNKREFDKFCIQIIGREI 60

Query: 167 IPLHNRLIRAILQNACVAKTPPVLSSTRKVGGNLSVKVVNGYQRSCLQSLHGDAFLSSPR 226
           IPLHNRLIRAILQNACVAKTPPVLSSTRKVGGNLSVKVVNGYQRSCLQSLHGDAFLSSPR
Sbjct: 61  IPLHNRLIRAILQNACVAKTPPVLSSTRKVGGNLSVKVVNGYQRSCLQSLHGDAFLSSPR 120

Query: 227 KGRSPVSRDRKIRDRPSPLGPCGKPQNMALEEFASKAQEQQSATELHSLGSRPPVEMASV 286
           KGRSPVSRDRKIRDRPSPLGPCGKPQNMALEEFASKAQEQQSATELHSLGSRPPVEMASV
Sbjct: 121 KGRSPVSRDRKIRDRPSPLGPCGKPQNMALEEFASKAQEQQSATELHSLGSRPPVEMASV 180

Query: 287 EDGEEVEQVAGSPGVQSRSPVTAPLGISMNFIGSGKTLSNVPVGS-NYHVTTCQDVGELP 346
           EDGEEVEQVAGSPGVQSRSPVTAPLGISMNFIGS KTLSNVPVG  NYHVTTCQD GELP
Sbjct: 181 EDGEEVEQVAGSPGVQSRSPVTAPLGISMNFIGSSKTLSNVPVGGRNYHVTTCQDGGELP 240

Query: 347 DTRLLRTHLRKKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTGNQ 406
           DTRLLRTHLRKKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTGNQ
Sbjct: 241 DTRLLRTHLRKKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTGNQ 300

Query: 407 PITGSRITFQEQHRHRAQQLNNGSLLDFRVAMQLNPQVLGREWTMQLEKISLRASEE 463
           PITGSRITFQEQ+RHRAQQ+NNGSLLDFRVAMQLNPQVLGREWTMQLEKISLRASEE
Sbjct: 301 PITGSRITFQEQNRHRAQQINNGSLLDFRVAMQLNPQVLGREWTMQLEKISLRASEE 357

BLAST of Csa4G000960.1 vs. NCBI nr
Match: gi|1009171750|ref|XP_015866909.1| (PREDICTED: uncharacterized protein LOC107404471 [Ziziphus jujuba])

HSP 1 Score: 453.0 bits (1164), Expect = 6.4e-124
Identity = 232/358 (64.80%), Postives = 279/358 (77.93%), Query Frame = 1

Query: 107 MLPRKDTSRIDTSELKAMIYRKLGHQRSDKYFDQLKKLLSLKTNKREFDKFCIQIIGREI 166
           MLP +  SRIDT ELKA+I +K+GHQR++KYFDQL++L S K +K EF+KFC + +GRE 
Sbjct: 2   MLPNQSYSRIDTLELKALIIQKIGHQRAEKYFDQLQRLFSFKISKCEFNKFCCRTLGREN 61

Query: 167 IPLHNRLIRAILQNACVAKTPPVLSSTRKVGGNLSVKVVNGYQRSCLQSLHGDAFLSSPR 226
           IPLHN+LIR+I++NACVAK PPV +S +K+G   +VKV NGYQR+CLQSL+GD F  SPR
Sbjct: 62  IPLHNQLIRSIVKNACVAKVPPVKAS-KKLGNTPNVKVTNGYQRNCLQSLYGDVFPPSPR 121

Query: 227 KGRSPVSRDRKIRDRPSPLGPCGKPQNMALEEFASKAQEQQSATELHSLGSRPPVEMASV 286
           KGRSPV+RDRK RDRPSPLGP GKPQ++  EE  SKAQEQQSATEL SLGSRPPVE+ASV
Sbjct: 122 KGRSPVNRDRKFRDRPSPLGPLGKPQSVTCEELVSKAQEQQSATELLSLGSRPPVEVASV 181

Query: 287 EDGEEVEQVAGSPGVQSRSPVTAPLGISMNFIGSGKTLSNVPVGSNYHVTTCQDVGELPD 346
           EDGEEVEQ+AGSPGVQSRSPVTAPLG+SMN  G+ K LSNV +  NYH  TCQ+ GELPD
Sbjct: 182 EDGEEVEQIAGSPGVQSRSPVTAPLGVSMNLGGARKALSNVSISGNYHPETCQNCGELPD 241

Query: 347 TRLLRTHLRKKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRC--ERLKFTGN 406
           TR LR+ L++KLE E  +ISVD VNLLNN LDVYLKRL+EPC+  + SRC  E L     
Sbjct: 242 TRSLRSRLQRKLEIEGFNISVDCVNLLNNGLDVYLKRLLEPCMRLAASRCGNEHLIELNA 301

Query: 407 QPITGSRITFQEQHRHRAQQLNNGSLLDFRVAMQLNPQVLGREWTMQLEKISLRASEE 463
           Q   G       ++  RA++    SLLDF  AM+LNP +LG +W +QLEKI LR+SEE
Sbjct: 302 QCNPGLNGMLPGRYMERAKKSTYASLLDFHAAMELNPCILGEDWAIQLEKIMLRSSEE 358

BLAST of Csa4G000960.1 vs. NCBI nr
Match: gi|1009171746|ref|XP_015866907.1| (PREDICTED: uncharacterized protein LOC107404470 [Ziziphus jujuba])

HSP 1 Score: 440.3 bits (1131), Expect = 4.3e-120
Identity = 230/363 (63.36%), Postives = 276/363 (76.03%), Query Frame = 1

Query: 107 MLPRKDTSRIDTSELKAMIYRKLGHQRSDKYFDQLKKLLSLKTNKREFDKFCIQIIGREI 166
           MLP +  SRIDT ELKA+I +K+GHQR++KYFDQL++L SLK +K EF+KFC + +GRE 
Sbjct: 1   MLPNQSYSRIDTLELKALIIQKIGHQRAEKYFDQLQRLFSLKISKCEFNKFCFRTLGREN 60

Query: 167 IPLHNRLIRAILQNACVAKTPPVLSSTRKVGGNLSVKVVNGYQRSCLQSLHGDAFLSSPR 226
           IPLHN+LIR+I++NACVAK PPV +S +K G  L+VKV NGYQR+ LQSL+GD F  SPR
Sbjct: 61  IPLHNQLIRSIVKNACVAKVPPVKAS-KKFGNTLNVKVTNGYQRNRLQSLYGDVFPPSPR 120

Query: 227 KGRSPVSRDRKIRDRPSPLGPCGKPQNMALEEF-----ASKAQEQQSATELHSLGSRPPV 286
           KGRSPV+RDRK RDRPSPLGP GKPQ++   +F      S AQEQQSATEL SLGSRPPV
Sbjct: 121 KGRSPVNRDRKFRDRPSPLGPLGKPQSLTCGDFPSGELVSMAQEQQSATELLSLGSRPPV 180

Query: 287 EMASVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNFIGSGKTLSNVPVGSNYHVTTCQDV 346
           E+ASVEDGEEVEQVAGSPGVQSRSPVTAPLG+SMN  G+ K LSNV +  NYH  TCQ+ 
Sbjct: 181 EVASVEDGEEVEQVAGSPGVQSRSPVTAPLGVSMNLGGARKALSNVSISGNYHPETCQNC 240

Query: 347 GELPDTRLLRTHLRKKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRC--ERL 406
           GELPDTR LR+ L++KLE E  +IS+D VNLLNN LD YLKRL+EPC+  + SRC  E L
Sbjct: 241 GELPDTRSLRSRLQQKLEMEGFNISIDCVNLLNNGLDAYLKRLLEPCMRLAVSRCGSEHL 300

Query: 407 KFTGNQPITGSRITFQEQHRHRAQQLNNGSLLDFRVAMQLNPQVLGREWTMQLEKISLRA 463
                Q   G       ++  RA+     SLLDF  AM+LNP +LG +W +QLEKI LR+
Sbjct: 301 NQLNAQFNPGLNGMLPGRYMERAKSSTYASLLDFHAAMELNPCILGEDWAIQLEKIMLRS 360

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0KWF9_CUCSA1.7e-264100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_4G000960 PE=4 SV=1[more]
B9HMX5_POPTR1.5e-11663.61Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0009s09370g PE=4 SV=2[more]
A0A067LJK3_JATCU3.4e-11660.61Uncharacterized protein OS=Jatropha curcas GN=JCGZ_16307 PE=4 SV=1[more]
W9RH76_9ROSA4.4e-11662.39Uncharacterized protein OS=Morus notabilis GN=L484_011520 PE=4 SV=1[more]
A0A061E832_THECC2.2e-11563.13Uncharacterized protein OS=Theobroma cacao GN=TCM_007200 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G33890.13.3e-8650.97 unknown protein[more]
AT2G14850.11.2e-7246.02 unknown protein[more]
AT4G31440.12.2e-4233.07 unknown protein[more]
AT2G24530.13.8e-4231.85 unknown protein[more]
AT5G67410.18.2e-3732.75 unknown protein[more]
Match NameE-valueIdentityDescription
gi|700197619|gb|KGN52777.1|2.4e-264100.00hypothetical protein Csa_4G000960 [Cucumis sativus][more]
gi|449469122|ref|XP_004152270.1|1.9e-200100.00PREDICTED: uncharacterized protein LOC101211126 [Cucumis sativus][more]
gi|659108776|ref|XP_008454383.1|1.4e-19598.04PREDICTED: uncharacterized protein LOC103494799 [Cucumis melo][more]
gi|1009171750|ref|XP_015866909.1|6.4e-12464.80PREDICTED: uncharacterized protein LOC107404471 [Ziziphus jujuba][more]
gi|1009171746|ref|XP_015866907.1|4.3e-12063.36PREDICTED: uncharacterized protein LOC107404470 [Ziziphus jujuba][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR024738Hfi1/Tada1
Vocabulary: Cellular Component
TermDefinition
GO:0070461SAGA-type complex
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0070461 SAGA-type complex
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Csa4G000960Csa4G000960gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Csa4G000960.1Csa4G000960.1-proteinpolypeptide


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Csa4G000960.1.cds1Csa4G000960.1.cds1CDS
Csa4G000960.1.cds2Csa4G000960.1.cds2CDS
Csa4G000960.1.cds3Csa4G000960.1.cds3CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Csa4G000960.1.utr3p1Csa4G000960.1.utr3p1three_prime_UTR


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR024738Transcriptional coactivator Hfi1/Transcriptional adapter 1PFAMPF12767SAGA-Tad1coord: 113..390
score: 6.3
NoneNo IPR availablePANTHERPTHR21277FAMILY NOT NAMEDcoord: 270..462
score: 2.8E-155coord: 94..236
score: 2.8E
NoneNo IPR availablePANTHERPTHR21277:SF12SUBFAMILY NOT NAMEDcoord: 94..236
score: 2.8E-155coord: 270..462
score: 2.8E