Csa6G042300 (gene) Cucumber (Chinese Long) v2

NameCsa6G042300
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionAleurone layer morphogenesis protein
LocationChr6 : 3208999 .. 3215165 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATCCCATGTATAGTAAATGAATCAAATGCCTCAGAAAGTGGTATCAAAGTCGAGGTAAGAAATTGAGGTCTTCTCACTAAGTTTAGGGACCTTAAGATACAGGTTGTTTTAATTTTGTATTTACTACTGGGACTCCTGATGCTTATGGGAACAGTGACTCATATTTTATTCTCCCTATAAATTTTGGCATTTAGTCATGTCTATACTTCAAAAATTATTATGGGTGCATTCCTTGAAGGATCTGGGCATGTTAAGAGCAGATTTTAAATAATATTTTTTTGGGTACTTTGTATGCATACTTCAATCACTCAAAACGAGATTCAATGTTAGGATCTTTGGGCCAAGAGACAGCTTGGGGATGGATTGACATGGTTACTCTACTTTAGAGTCCACAAAAGATACTAATTGGTTTTTGGTTTCATCTCTTTGATGGATTAAATTGAAGAACTTGCACATTTAAATCCCTGACTTTTTTGGTAATGTAAAGAACTATCTCGCGTGTTGCAAGGTAAATGGTTTTAAAAAATGAAGTTTTTGAAGAAATTGTGAAATAAGAAAATGATAAGTTACTTCTCAACAAACATAGTGACACCTGGTGTTCAAATTTTGGCATGGCCATATCTGTACTTCAAATCATTTATCTGTGTGATGAGCTTGTAAATCAAACGACAGTTTCATTCTGCATGTGAAAAAAAGATGGTTATTATTGTTGTAAAAAAATAAGCTTGTGCTAACAAGTCAGACTTTTCAGATTGTATTATCTGTTAGCTTTTATGAATTAGTTTTCTCTATATTTCTTTTATTTATAATTCAACTAATAGCATAATTTACTTCCCAGGATGTCGTAAATAAATAATTTTGTGCTAATAACTCAGAATTTTCAGATTTATAATCTATTAGCTTTTAATGAATTAGATTTTACTATATTTCTTTTATTTATAATTTAGCTGATACCATAGTTTACTTTCCAGGATGGGATATTAGCTACAAACCCGTGTATTGCTGAATGCAGTGGTGAAAAGCTTGCTTCTGGAAATCTCTCTGACAATATTTCGTTTGATCAAAATAGGAACGGTGATCATGCTCTCATCACCTGTCAATCGAACCCAGACTCAGAGCATCTTTCCAAGTTACAAGCAATTATTGTTTCAAAAGAAAGAGCACTGTCACAAGCTGCAATTAGAGCTCTAATCAGAAAGAGGGATAAGCTGGTACGGACATACATCTTATATTGTGGTTATATTTCAGGATTATCCTTGTCTCATGATTTTAATTTGAACCAGAAACTTTCTTTTAAAGTTTAGCATGTTGACGCACATATTATATGATAAAGAGCTTAGAGCTTACACCATAATAGTAAAATAGAGATGGGAATAGTGATTTAAATCTCAACTTTCGTAGACGAGATTGTAGGAAAATAATTAATCCTCTGTTATTGTTGTCTGTATCACTATACAATGACATTCAGCTGTCCATTCTTCTCTCTCTATCTGAACAGGGGTTGGTCATAGATTGAGCTCTTTTCAATCATATTACATAATCATTTTTGACAGTGTAATCCATTCGTTCTGTCTGCAGTCTCATCAACAGCGCCTCATTGAAGATGAGATTGCTCAGTGTGATAAAAACATGCAGACAATATTAAGGGGTAAGCTTTATTGTCTCTTTTCCTTTTCTGTTCCTTCTGTTTTTGTTTTGTTTTTACACTTTTTGTCTATTTGAAATATTTCATGTACAAAACCTTCTAGAAATTTAGATAATTTCCCTTTTCAAATACTTGATGAACCCCTCCCCCAGCTATCTATGATTTTTGGTCTTGTAAGGCCACTGCACCCAAGCTAAAGTATTTTTCAGTATTGGCTAGAGACTAAGATTTGAATTATCGTCATGATCACTATTTTAAGTGTTGGCATATTTTGAGCTTCAAGAGGGAGTAACAGTAGTATTTTTGGATTTACTTTTTGTTATTGCTTTATTGAATTGTTTGGTGTATATTTTGGAGCTAAATTTTCAATATTGTATGTACGTACGTTGATGGATAGTGTTAAATTAGAACAAGTAATATTAGAAACTCAGATTAAAATTAAAACTTTTCTAAAACTTGTGGAGTGCTTCTGCTATTCTAAGTACTAACTCATCATTGGGTGTGTCTGCGTGGCTAACAGCAATGGAGATGCTTTTTAATCTATAGGTATCATAATTAGCCAAAGAAAATGTCTGAACTGCATATATATATTTTGGTTGCTGAAGGTGATGAAGATGATTTGGTTTTAAAGCTGGATTCTGTGATTGAATGTTGCAATGATATATGCCCAAGAAGCACTGCCGAAGACAAATCTTATCAATACTTTGAAGAAAACTGCTCATCTCAATATGTCACAAGGAAGCGATTGTCAGAAGCAATTCTCTGCATACAGAATCCATGTCTGGTTGGTTAATAATTGAAGAAAATTAATTTGACATTTTTTAACATTCCAGGGTATTTGCATGTTTGTATTTGTGTTATGGTTATGTCCTAATGCTGCCCACGGCATATTTATATATTTTTATAAGAGATGCAAGTAAAACATTAAAAGAGAGGCTACAAGATCAAAGGGGCAGGATAGGAACTCTTCAAAAAATTGTACTGCACAAAAACACATAAGACATCGGTATAACAGATTGATGTTAAATAGTTTTGTTATGGCACTTAGTATAGAAATTGCAAATGCTTGGACTTAGAAAAAAATTACACCGTCCATAAGGTGGCCCACCATCTTCTAGCCTCTTACTATTTTATCATTGAGATGGTATAGCCAATATCACCAATTTGCTACAATCAACTAGTAAACCCTTAGGCTTATTATTTGAAATCTTACTATAAAGTACTGTGACAAAAGGAAGATCAAAAAGTGAGAATTAACTACTAGGGTTGACCTAAGGGTTAAGTTGACAGAATTAACTGAGTTATGGTTTCAAACATCAATATGAAAGTTGGTCTGGATATCTAACGATATTGAGAAAAACAAAAGGAAAGAAGAGAAATAGGAAGCCGATGCCTCCTGTTGCAATTCAGAAACCCGCCAAAGTTTTAATTTGAAAGGATGGTGAGAGTTAGTGTTTTGAATGATGCTCTTAAGAGCATGTACAGAGAAGAGGGGGAAACAGCAGGTCATGATCAGGCCTTCTCCGAAAGTGATTATCACGTTTCTTTTGGTTATGCAGAAGCTATTGGTGAATTTGAGTTTGTAGATGATCAAAGATCTGGAAAAATTGTTGTTGAACTAAACTGTAGGCTGAACAAATGTGGGGTTATTAGTCTTTAGTTTGATGTGGGTGTTAAGGAAATTAAAGGTTGGATTGCTAGGTTGCTCCCTTCCTGACAGTTTGGATTCATTGTGCTGTGCTGACATCTGCTGGAATTATGGATCATGAAGAGGCTAGAAGGAAGAATGTTGGAGGCAAAGTCCTTGGTTTCTTCTATTGAACATGGTTTTGAGCATCCATTATGAAGTTTTGACCTAGGTTTCAAATAGTATTACATTTTCTCTCGGTAAAGAAAGATCATTATCCTGTATTGAAAGAAAAAAAGAAGAGGGGAAATAAAGGAAATTGTGACAAGATAATTAACAAAATAATATTTTTGAGTAATCATAGGGAGCCTCACTGAAATAATTTTTTTTTTTAATATTAACATTTATCAGCTTGATAAAGGTGTGAGAGTTGATCTAGCTGTTGGCTAACATTTTGGTAGTAGTGAAGATTGGATATTCATCAAGCTTGGTGCTTTGTGATGCCAAACAGTCAACCAAATATAAACATGGACAGTATGGTAGCATGTTTCAAAACTAGAGATGACATAAGAATGCATTATTAAAAGTTGGAAGTATGACTTGTTATAGACAATTGGAAAGTTCTCACTGCTTCTGGTGAGGGTGCAGGGGGGAGTGAATTTTCAAGGTCTTTTATTTTTTACTTTTTCCCCTTTACTTTATATATTTGACTTATAGTTTATGCAAGGAAACTGCTGCCTCAATATTCTTTATGCTACTAGTTGGGGCCTAATTTTCGAGGTCCTAGTTTTGCCCACTTGTTCTTTTATGTTTTAAACTTCTTTATGGTTCCTCTTGTAACTTTTTTGAACCAGAAACAAGCCTCTTTATTTATAATAATAAATGAGACTAAAGCTCAAAGTACAAGAGAATTATACTAAGAATAAAAAGAACCAAGGATGAATACAATCTAAGACAAACAAAAACTATACTTAGGCAACCTAAAAACGAATTCGAACAGAAAATTAACGAGCTAATCAAAGCCACTCAAAGACAATACGTTTGTAGAAAAGCTAAATACAAAGCCAAACTTAAAATCTCTCAAAAGGAGCCCATTTGAATCTAAAATCTTGCATGCCCAAATCTTCAAGATGAAAGGAGAAAGAGAAACGCCAGGAAGATACAACCAGCTGCCTCAAAACTATCTTACCAAATGCAATCCAATACCCACAGGAACTCAAAGAACTCTTCCAAACCACAGAATTTGTTCCTAAAAGAAATCCCTAGTAGTTACTCCCATTTGGCAAGCCAGACAACGAAAACAAAATACAAAAAACCAATATTAAGCAAGCTACCACAATCAGTGAGGCTTTACTTGAGAAATCATGGAAAGGCAGAAGTTGGTCCAGCAACTGGGGGGACTTTATTCAACAGCTGAGATTCAAACTAGAGTCCACAATCGGCAACTATTGACTTTAGATGATCCGGTAATTGAAACGCTAACTTCCGAAATTGGTTTAACAGATTTCTATCCTCTAGATTCTTTAATCAAAACTGCAATATCCATGTGGATCCCGTGGATTTTCTTTTTGTTAAAGAGAGAATTTCCTTTTCAGGACAGTGTACAGTTTATAAATGTCATGTTGTAATGGCCTTGAAGTAACAGATCTTTATATTTTTACCGGGATGCAACAAATTGATCAATTTTTTTTTTAAACTTGTATTGCCACATAAGACATGCAGTTGAGGATGCTGCTAAGATGGTATTTGTTATGTATGGCAAACTTGGCTAATAAAAGATGTATTGTATTGATGTCTCTAATGTGTTAATGGGAACTGTATCTTGTATGCCTAATTGATTGAGTCCGGTTATCAATTTGCACTTGCATAGCACAAGAAACATTCAACATAAAAAATTCTGAAAAGCATGTAGCATAAACATTTCAATACGGTTTAAAATGCATGGAATGTACATTTGATGTTAAATATTCTATACAGGCATTTAAGAGGGAAAAAATGGTAGTCCGATTTCATTATTTCCACATTTGTAGTTGTAGATATAATGATGCTCTGTTGACTTGGCAGGAACTGGATGGTATATGTCATAAGAATAATTGGATATTGCCAGTTTACGGAGTTTCGTCATTAGATGGTAAAGTCTTCGTGAATTCAGTCAGTTATTACTTATTACTATTATTCTTTTAATTTTTGAGAGGTACGGTTGTTACTGATGGACAGACATTAAATTCTTCTTTTCTAGTGCACTGTGCTTCTCCAAAGTAGATCACGTAGAATGAATGATAAAATGTACTTCTCCTTTCACCTGGAGTACAATGTGCATGACTTTATGGCATTATTTTTTTTTCTGGATTGGACATTATTTTGTTTGTGTTTTTGGTTTGTTTGCTTGTTTGTCTTCTAAAGTATATTGAAAATCGGATTCTACCTTCTTGCATCTCGAAAGATTCTACAAATACAAATTATTATACTCAAGTAGTACAGTGCATTGAAAATTAGATGGTGCCTCATTCTCTCCCATCTTTTCTGATTCTGCCTTTAGAGGCTTTCAGGGTCTTTCTACATATGGATGGCCGATCATGCATAGCTCTTGTGTTTCAATTTGAAATCATTTGAGAGCATCTATTATGGAATAGTGGTGCTAAAGTGTATTATGTGTTTTGAAGGTGGGTTCCAAGCTAATGTGTTTGTAAAAGGGATGGATTTTGAGTATTCAAGCTGTAGCGAACTGTGTTCCGACCCTCGTGATGCGAGAGAATCAGCTGCGATGAAGATGTTGGGTCAACTTTGGAGGATGGCCAATCTGGCCAAGTAGTTTTAGAAGCCTTTGATGGAGGTTATTGTAGTAATGGGTCTGTTTTTGGGTTCTCCATGGAGAGGAGAGGGGGAAAGAGA

mRNA sequence

ATGATCCCATGTATAGTAAATGAATCAAATGCCTCAGAAAGTGGTATCAAAGTCGAGGATGGGATATTAGCTACAAACCCGTGTATTGCTGAATGCAGTGGTGAAAAGCTTGCTTCTGGAAATCTCTCTGACAATATTTCGTTTGATCAAAATAGGAACGGTGATCATGCTCTCATCACCTGTCAATCGAACCCAGACTCAGAGCATCTTTCCAAGTTACAAGCAATTATTGTTTCAAAAGAAAGAGCACTGTCACAAGCTGCAATTAGAGCTCTAATCAGAAAGAGGGATAAGCTGTCTCATCAACAGCGCCTCATTGAAGATGAGATTGCTCAGTGTGATAAAAACATGCAGACAATATTAAGGGGTGATGAAGATGATTTGGTTTTAAAGCTGGATTCTGTGATTGAATGTTGCAATGATATATGCCCAAGAAGCACTGCCGAAGACAAATCTTATCAATACTTTGAAGAAAACTGCTCATCTCAATATGTCACAAGGAAGCGATTGTCAGAAGCAATTCTCTGCATACAGAATCCATGTCTGGAACTGGATGGTATATGTCATAAGAATAATTGGATATTGCCAGTTTACGGAGTTTCGTCATTAGATGGTGGGTTCCAAGCTAATGTGTTTGTAAAAGGGATGGATTTTGAGTATTCAAGCTGTAGCGAACTGTGTTCCGACCCTCGTGATGCGAGAGAATCAGCTGCGATGAAGATGTTGGGTCAACTTTGGAGGATGGCCAATCTGGCCAAGTAG

Coding sequence (CDS)

ATGATCCCATGTATAGTAAATGAATCAAATGCCTCAGAAAGTGGTATCAAAGTCGAGGATGGGATATTAGCTACAAACCCGTGTATTGCTGAATGCAGTGGTGAAAAGCTTGCTTCTGGAAATCTCTCTGACAATATTTCGTTTGATCAAAATAGGAACGGTGATCATGCTCTCATCACCTGTCAATCGAACCCAGACTCAGAGCATCTTTCCAAGTTACAAGCAATTATTGTTTCAAAAGAAAGAGCACTGTCACAAGCTGCAATTAGAGCTCTAATCAGAAAGAGGGATAAGCTGTCTCATCAACAGCGCCTCATTGAAGATGAGATTGCTCAGTGTGATAAAAACATGCAGACAATATTAAGGGGTGATGAAGATGATTTGGTTTTAAAGCTGGATTCTGTGATTGAATGTTGCAATGATATATGCCCAAGAAGCACTGCCGAAGACAAATCTTATCAATACTTTGAAGAAAACTGCTCATCTCAATATGTCACAAGGAAGCGATTGTCAGAAGCAATTCTCTGCATACAGAATCCATGTCTGGAACTGGATGGTATATGTCATAAGAATAATTGGATATTGCCAGTTTACGGAGTTTCGTCATTAGATGGTGGGTTCCAAGCTAATGTGTTTGTAAAAGGGATGGATTTTGAGTATTCAAGCTGTAGCGAACTGTGTTCCGACCCTCGTGATGCGAGAGAATCAGCTGCGATGAAGATGTTGGGTCAACTTTGGAGGATGGCCAATCTGGCCAAGTAG

Protein sequence

MIPCIVNESNASESGIKVEDGILATNPCIAECSGEKLASGNLSDNISFDQNRNGDHALITCQSNPDSEHLSKLQAIIVSKERALSQAAIRALIRKRDKLSHQQRLIEDEIAQCDKNMQTILRGDEDDLVLKLDSVIECCNDICPRSTAEDKSYQYFEENCSSQYVTRKRLSEAILCIQNPCLELDGICHKNNWILPVYGVSSLDGGFQANVFVKGMDFEYSSCSELCSDPRDARESAAMKMLGQLWRMANLAK*
BLAST of Csa6G042300 vs. TrEMBL
Match: A0A0A0KE35_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G042300 PE=4 SV=1)

HSP 1 Score: 513.8 bits (1322), Expect = 1.2e-142
Identity = 253/253 (100.00%), Postives = 253/253 (100.00%), Query Frame = 1

Query: 1   MIPCIVNESNASESGIKVEDGILATNPCIAECSGEKLASGNLSDNISFDQNRNGDHALIT 60
           MIPCIVNESNASESGIKVEDGILATNPCIAECSGEKLASGNLSDNISFDQNRNGDHALIT
Sbjct: 1   MIPCIVNESNASESGIKVEDGILATNPCIAECSGEKLASGNLSDNISFDQNRNGDHALIT 60

Query: 61  CQSNPDSEHLSKLQAIIVSKERALSQAAIRALIRKRDKLSHQQRLIEDEIAQCDKNMQTI 120
           CQSNPDSEHLSKLQAIIVSKERALSQAAIRALIRKRDKLSHQQRLIEDEIAQCDKNMQTI
Sbjct: 61  CQSNPDSEHLSKLQAIIVSKERALSQAAIRALIRKRDKLSHQQRLIEDEIAQCDKNMQTI 120

Query: 121 LRGDEDDLVLKLDSVIECCNDICPRSTAEDKSYQYFEENCSSQYVTRKRLSEAILCIQNP 180
           LRGDEDDLVLKLDSVIECCNDICPRSTAEDKSYQYFEENCSSQYVTRKRLSEAILCIQNP
Sbjct: 121 LRGDEDDLVLKLDSVIECCNDICPRSTAEDKSYQYFEENCSSQYVTRKRLSEAILCIQNP 180

Query: 181 CLELDGICHKNNWILPVYGVSSLDGGFQANVFVKGMDFEYSSCSELCSDPRDARESAAMK 240
           CLELDGICHKNNWILPVYGVSSLDGGFQANVFVKGMDFEYSSCSELCSDPRDARESAAMK
Sbjct: 181 CLELDGICHKNNWILPVYGVSSLDGGFQANVFVKGMDFEYSSCSELCSDPRDARESAAMK 240

Query: 241 MLGQLWRMANLAK 254
           MLGQLWRMANLAK
Sbjct: 241 MLGQLWRMANLAK 253

BLAST of Csa6G042300 vs. TrEMBL
Match: F6HYR2_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0102g00250 PE=4 SV=1)

HSP 1 Score: 250.0 bits (637), Expect = 3.1e-63
Identity = 135/248 (54.44%), Postives = 174/248 (70.16%), Query Frame = 1

Query: 10  NASESGIKVEDGILAT--NPCIAECSGEKLASGN-LSDNISFDQNRN--GDHALITCQSN 69
           NA+ +G   ++GI  +   PC   C GEK+A+G  + + I  DQ+     D AL+T +SN
Sbjct: 299 NAAATGAMAKEGIANSVMTPCTTSCRGEKVANGGEICNFIMPDQDGMLIEDRALVTYESN 358

Query: 70  PDSEHLSKLQAIIVSKERALSQAAIRALIRKRDKLSHQQRLIEDEIAQCDKNMQTILRGD 129
             SEHL KLQ  I SKE+ LSQ A++ L+RKRD+LSHQQR +EDEIAQCDKN+QTIL G 
Sbjct: 359 --SEHLDKLQITIASKEKLLSQTALKVLLRKRDRLSHQQRKLEDEIAQCDKNIQTILDGG 418

Query: 130 EDDLVLKLDSVIECCNDICPRSTAEDKSYQYFEENCSSQYVTRKRLSEAILCIQNPCLEL 189
           EDDL LK++S++E CND CP++   D++Y++ E+  S Q++ RKRLSEAIL IQ  C EL
Sbjct: 419 EDDLALKIESILEFCNDACPQT--RDRTYRHLEDQESPQHIKRKRLSEAILNIQKSCQEL 478

Query: 190 DGICHKNNWILPVYGVSSLDGGFQANVFVKGMDFEYSSCSELCSDPRDARESAAMKMLGQ 249
           DGIC++NNWILP Y VS LDG  +  V VKG+DFE S   E C  PR+ARESAA +ML +
Sbjct: 479 DGICYENNWILPTYRVSLLDGKSEGTVSVKGVDFEISVVGEPCDTPREARESAAAQMLAK 538

Query: 250 LWRMANLA 253
           L  MA  A
Sbjct: 539 LQSMATAA 542

BLAST of Csa6G042300 vs. TrEMBL
Match: A0A061EJ05_THECC (Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_020129 PE=4 SV=1)

HSP 1 Score: 208.0 bits (528), Expect = 1.4e-50
Identity = 107/192 (55.73%), Postives = 135/192 (70.31%), Query Frame = 1

Query: 54  GDHALITCQSNPDSEHLSKLQAIIVSKERALSQAAIRALIRKRDKLSHQQRLIEDEIAQC 113
           G+HA +  +SN  SE+ +KLQ II SKE+ LS+ A R L RKRDKL  Q R I DEIAQC
Sbjct: 476 GNHAPVIHESN--SEYSAKLQNIIASKEQILSETAWRVLHRKRDKLVRQLRNIGDEIAQC 535

Query: 114 DKNMQTILRGDEDDLVLKLDSVIECCNDICPRSTAEDKSYQYFEENCSSQYVTRKRLSEA 173
           DK +QTIL G EDDL LK+D +IE CND+C RS ++ ++   +E+ CS+ Y+ R RLSE 
Sbjct: 536 DKQIQTILNGGEDDLELKIDLIIEGCNDVCLRSASQGRTSHDYEDQCSTHYIKRNRLSEE 595

Query: 174 ILCIQNPCLELDGICHKNNWILPVYGVSSLDGGFQANVFVKGMDFEYSSCSELCSDPRDA 233
            L  QNPC ELDGIC+KNNW+LP Y V   DGG+QA V VKG++ E SS  + C  P +A
Sbjct: 596 ALSTQNPCQELDGICNKNNWMLPTYHVFPSDGGYQAKVTVKGVNIESSSVGDACPKPSEA 655

Query: 234 RESAAMKMLGQL 246
           R SAA +ML +L
Sbjct: 656 RGSAAAEMLAKL 665

BLAST of Csa6G042300 vs. TrEMBL
Match: A0A0D2S874_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_006G227200 PE=4 SV=1)

HSP 1 Score: 193.4 bits (490), Expect = 3.5e-46
Identity = 102/192 (53.12%), Postives = 124/192 (64.58%), Query Frame = 1

Query: 54  GDHALITCQSNPDSEHLSKLQAIIVSKERALSQAAIRALIRKRDKLSHQQRLIEDEIAQC 113
           G+HA + C+SN  S+  +KL   I SK+  LS+ A+R L+ KRDKL  Q R I DEIAQC
Sbjct: 473 GNHASVICESN--SKCSAKLHNAIASKDHVLSKTALRVLLNKRDKLVLQLRKIGDEIAQC 532

Query: 114 DKNMQTILRGDEDDLVLKLDSVIECCNDICPRSTAEDKSYQYFEENCSSQYVTRKRLSEA 173
           DK MQTIL G EDDL LKLD VIE CND CP ST E+++ + +E+ C +Q   R R SE 
Sbjct: 533 DKKMQTILNGGEDDLELKLDLVIEGCNDACPESTGEERTSKDYEDPCWAQCKKRSRSSEE 592

Query: 174 ILCIQNPCLELDGICHKNNWILPVYGVSSLDGGFQANVFVKGMDFEYSSCSELCSDPRDA 233
                NPC ELDGIC   NW+LP Y V   DGG+QA V VKG +FE  S  + C  P +A
Sbjct: 593 ASSKHNPCQELDGICKDKNWVLPTYQVFPSDGGYQAKVTVKGTNFESLSLGDACPKPHEA 652

Query: 234 RESAAMKMLGQL 246
           R SAA  ML +L
Sbjct: 653 RSSAATTMLAKL 662

BLAST of Csa6G042300 vs. TrEMBL
Match: A0A0J8FIJ6_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_3g056920 PE=4 SV=1)

HSP 1 Score: 191.4 bits (485), Expect = 1.3e-45
Identity = 102/218 (46.79%), Postives = 147/218 (67.43%), Query Frame = 1

Query: 35  EKLASGNLSDNISF-DQNRNGDHALITCQSNPDSEHLSKLQAIIVSKERALSQAAIRALI 94
           EKL S N +D+ +  DQ+   DHAL++    P  +HL K+Q II SKE  +S+ A++ L 
Sbjct: 578 EKLLSDNKTDDTAMSDQDEKQDHALVSFDPTP--KHLHKIQLIIASKENMISETALKVLY 637

Query: 95  RKRDKLSHQQRLIEDEIAQCDKNMQTILRGDEDDLVLKLDSVIECCNDIC-PRSTAEDKS 154
           RKR++L  Q R I DEIA CD+ ++TI+ G EDDL +KLDS++E CN++C P + A    
Sbjct: 638 RKREQLYQQLRNIGDEIALCDRKIETIICGGEDDLAVKLDSIVEGCNELCQPTAVAGGNE 697

Query: 155 YQY-FEENCSSQYVTRKRLSEAILCIQNPCLELDGICHKNNWILPVYGVSSLDGGFQANV 214
                +++     V R RLSEA LC ++PC ELD IC++NNW+LP Y +S+ +GGF+A++
Sbjct: 698 IALPSQDHDILPAVKRTRLSEANLCKRSPCQELDSICNENNWMLPTYSISTSEGGFEASI 757

Query: 215 FVKGMDFEYSSCSELCSDPRDARESAAMKMLGQLWRMA 250
            VKG+DFE SS   +C+ P +AR SAA ++L +L RMA
Sbjct: 758 TVKGVDFECSSEGTICTTPHEARNSAASEVLVKLRRMA 793

BLAST of Csa6G042300 vs. TAIR10
Match: AT1G05950.1 (AT1G05950.1 unknown protein)

HSP 1 Score: 133.3 bits (334), Expect = 2.2e-31
Identity = 92/250 (36.80%), Postives = 133/250 (53.20%), Query Frame = 1

Query: 1   MIPCIVNESNASESGIKVEDGILATNPCIAECSGEKLASGNLSDNISFDQNRNGDHALIT 60
           M PC  N SN  + G +V     A++P       ++L    L    +     N  H L  
Sbjct: 393 MSPCKDNYSNGEKGGFEV-----ASDP-------KELKERGLQRKKAVPDRLNSIHKL-- 452

Query: 61  CQSNPDSEH-----LSKLQAIIVSKERALSQAAIRALIRKRDKLSHQQRLIEDEIAQCDK 120
             S P S H     L +LQ  ++SK  +LS+ A++ L+ KRDKL+ QQR IEDEIA+CDK
Sbjct: 453 -NSTPASAHNSNPNLEELQTSLLSKATSLSETALKVLLCKRDKLTRQQRNIEDEIAKCDK 512

Query: 121 NMQTILRGDEDDLVLKLDSVIECCNDICPRSTAEDKSYQYFEENCSSQYVTRKRLSEAIL 180
            +Q I +GD +   L+L++V+ECCN+  PR   ++       +  + Q   R +LSE + 
Sbjct: 513 CIQNI-KGDWE---LQLETVLECCNETYPRRNLQESL-----DKSACQSNKRLKLSETLP 572

Query: 181 CIQNPCLELDGICHKNNWILPVYGVSSLDGGFQANVFVKGMDFEYSSCSELCSDPRDARE 240
             ++ C  LD IC  NNW+LP Y V+  DGG++A V + G     +   E  SD  +ARE
Sbjct: 573 STKSLCQRLDDICLMNNWVLPNYRVAPSDGGYEAEVRITGNHVACTIHGEEKSDAEEARE 618

Query: 241 SAAMKMLGQL 246
           SAA  +L +L
Sbjct: 633 SAAACLLTKL 618

BLAST of Csa6G042300 vs. NCBI nr
Match: gi|778710231|ref|XP_011656540.1| (PREDICTED: uncharacterized protein LOC101206764 isoform X1 [Cucumis sativus])

HSP 1 Score: 513.8 bits (1322), Expect = 1.7e-142
Identity = 253/253 (100.00%), Postives = 253/253 (100.00%), Query Frame = 1

Query: 1   MIPCIVNESNASESGIKVEDGILATNPCIAECSGEKLASGNLSDNISFDQNRNGDHALIT 60
           MIPCIVNESNASESGIKVEDGILATNPCIAECSGEKLASGNLSDNISFDQNRNGDHALIT
Sbjct: 430 MIPCIVNESNASESGIKVEDGILATNPCIAECSGEKLASGNLSDNISFDQNRNGDHALIT 489

Query: 61  CQSNPDSEHLSKLQAIIVSKERALSQAAIRALIRKRDKLSHQQRLIEDEIAQCDKNMQTI 120
           CQSNPDSEHLSKLQAIIVSKERALSQAAIRALIRKRDKLSHQQRLIEDEIAQCDKNMQTI
Sbjct: 490 CQSNPDSEHLSKLQAIIVSKERALSQAAIRALIRKRDKLSHQQRLIEDEIAQCDKNMQTI 549

Query: 121 LRGDEDDLVLKLDSVIECCNDICPRSTAEDKSYQYFEENCSSQYVTRKRLSEAILCIQNP 180
           LRGDEDDLVLKLDSVIECCNDICPRSTAEDKSYQYFEENCSSQYVTRKRLSEAILCIQNP
Sbjct: 550 LRGDEDDLVLKLDSVIECCNDICPRSTAEDKSYQYFEENCSSQYVTRKRLSEAILCIQNP 609

Query: 181 CLELDGICHKNNWILPVYGVSSLDGGFQANVFVKGMDFEYSSCSELCSDPRDARESAAMK 240
           CLELDGICHKNNWILPVYGVSSLDGGFQANVFVKGMDFEYSSCSELCSDPRDARESAAMK
Sbjct: 610 CLELDGICHKNNWILPVYGVSSLDGGFQANVFVKGMDFEYSSCSELCSDPRDARESAAMK 669

Query: 241 MLGQLWRMANLAK 254
           MLGQLWRMANLAK
Sbjct: 670 MLGQLWRMANLAK 682

BLAST of Csa6G042300 vs. NCBI nr
Match: gi|700190792|gb|KGN45996.1| (hypothetical protein Csa_6G042300 [Cucumis sativus])

HSP 1 Score: 513.8 bits (1322), Expect = 1.7e-142
Identity = 253/253 (100.00%), Postives = 253/253 (100.00%), Query Frame = 1

Query: 1   MIPCIVNESNASESGIKVEDGILATNPCIAECSGEKLASGNLSDNISFDQNRNGDHALIT 60
           MIPCIVNESNASESGIKVEDGILATNPCIAECSGEKLASGNLSDNISFDQNRNGDHALIT
Sbjct: 1   MIPCIVNESNASESGIKVEDGILATNPCIAECSGEKLASGNLSDNISFDQNRNGDHALIT 60

Query: 61  CQSNPDSEHLSKLQAIIVSKERALSQAAIRALIRKRDKLSHQQRLIEDEIAQCDKNMQTI 120
           CQSNPDSEHLSKLQAIIVSKERALSQAAIRALIRKRDKLSHQQRLIEDEIAQCDKNMQTI
Sbjct: 61  CQSNPDSEHLSKLQAIIVSKERALSQAAIRALIRKRDKLSHQQRLIEDEIAQCDKNMQTI 120

Query: 121 LRGDEDDLVLKLDSVIECCNDICPRSTAEDKSYQYFEENCSSQYVTRKRLSEAILCIQNP 180
           LRGDEDDLVLKLDSVIECCNDICPRSTAEDKSYQYFEENCSSQYVTRKRLSEAILCIQNP
Sbjct: 121 LRGDEDDLVLKLDSVIECCNDICPRSTAEDKSYQYFEENCSSQYVTRKRLSEAILCIQNP 180

Query: 181 CLELDGICHKNNWILPVYGVSSLDGGFQANVFVKGMDFEYSSCSELCSDPRDARESAAMK 240
           CLELDGICHKNNWILPVYGVSSLDGGFQANVFVKGMDFEYSSCSELCSDPRDARESAAMK
Sbjct: 181 CLELDGICHKNNWILPVYGVSSLDGGFQANVFVKGMDFEYSSCSELCSDPRDARESAAMK 240

Query: 241 MLGQLWRMANLAK 254
           MLGQLWRMANLAK
Sbjct: 241 MLGQLWRMANLAK 253

BLAST of Csa6G042300 vs. NCBI nr
Match: gi|659089858|ref|XP_008445718.1| (PREDICTED: uncharacterized protein LOC103488666 isoform X2 [Cucumis melo])

HSP 1 Score: 477.2 bits (1227), Expect = 1.7e-131
Identity = 237/253 (93.68%), Postives = 246/253 (97.23%), Query Frame = 1

Query: 1   MIPCIVNESNASESGIKVEDGILATNPCIAECSGEKLASGNLSDNISFDQNRNGDHALIT 60
           MIPC+VNES+ASESGIKV+DGILATNPCIAECSGEK+ASGNLSDNISFDQNRNGDHALIT
Sbjct: 434 MIPCMVNESDASESGIKVQDGILATNPCIAECSGEKIASGNLSDNISFDQNRNGDHALIT 493

Query: 61  CQSNPDSEHLSKLQAIIVSKERALSQAAIRALIRKRDKLSHQQRLIEDEIAQCDKNMQTI 120
           CQSN  +EHLSKLQAIIVSKE ALSQAAI+ALIRKRDKLSHQQRLIEDEIAQCDKNMQTI
Sbjct: 494 CQSN--AEHLSKLQAIIVSKETALSQAAIKALIRKRDKLSHQQRLIEDEIAQCDKNMQTI 553

Query: 121 LRGDEDDLVLKLDSVIECCNDICPRSTAEDKSYQYFEENCSSQYVTRKRLSEAILCIQNP 180
           LRGDEDDLVLKLDSVI+CCND+C +STAEDKSYQYFEENCSSQYVTRKRLSEAILCIQNP
Sbjct: 554 LRGDEDDLVLKLDSVIDCCNDLC-QSTAEDKSYQYFEENCSSQYVTRKRLSEAILCIQNP 613

Query: 181 CLELDGICHKNNWILPVYGVSSLDGGFQANVFVKGMDFEYSSCSELCSDPRDARESAAMK 240
           C ELDGICHKNNWILPVYGVSSLDGGFQANVFVKGMDFEYSSC ELCSDPRDARESAAMK
Sbjct: 614 CQELDGICHKNNWILPVYGVSSLDGGFQANVFVKGMDFEYSSCGELCSDPRDARESAAMK 673

Query: 241 MLGQLWRMANLAK 254
           MLGQLWRMAN AK
Sbjct: 674 MLGQLWRMANQAK 683

BLAST of Csa6G042300 vs. NCBI nr
Match: gi|659089854|ref|XP_008445716.1| (PREDICTED: uncharacterized protein LOC103488666 isoform X1 [Cucumis melo])

HSP 1 Score: 475.3 bits (1222), Expect = 6.6e-131
Identity = 236/253 (93.28%), Postives = 245/253 (96.84%), Query Frame = 1

Query: 1   MIPCIVNESNASESGIKVEDGILATNPCIAECSGEKLASGNLSDNISFDQNRNGDHALIT 60
           MIPC+VNES+ASESGIK +DGILATNPCIAECSGEK+ASGNLSDNISFDQNRNGDHALIT
Sbjct: 434 MIPCMVNESDASESGIKAKDGILATNPCIAECSGEKIASGNLSDNISFDQNRNGDHALIT 493

Query: 61  CQSNPDSEHLSKLQAIIVSKERALSQAAIRALIRKRDKLSHQQRLIEDEIAQCDKNMQTI 120
           CQSN  +EHLSKLQAIIVSKE ALSQAAI+ALIRKRDKLSHQQRLIEDEIAQCDKNMQTI
Sbjct: 494 CQSN--AEHLSKLQAIIVSKETALSQAAIKALIRKRDKLSHQQRLIEDEIAQCDKNMQTI 553

Query: 121 LRGDEDDLVLKLDSVIECCNDICPRSTAEDKSYQYFEENCSSQYVTRKRLSEAILCIQNP 180
           LRGDEDDLVLKLDSVI+CCND+C +STAEDKSYQYFEENCSSQYVTRKRLSEAILCIQNP
Sbjct: 554 LRGDEDDLVLKLDSVIDCCNDLC-QSTAEDKSYQYFEENCSSQYVTRKRLSEAILCIQNP 613

Query: 181 CLELDGICHKNNWILPVYGVSSLDGGFQANVFVKGMDFEYSSCSELCSDPRDARESAAMK 240
           C ELDGICHKNNWILPVYGVSSLDGGFQANVFVKGMDFEYSSC ELCSDPRDARESAAMK
Sbjct: 614 CQELDGICHKNNWILPVYGVSSLDGGFQANVFVKGMDFEYSSCGELCSDPRDARESAAMK 673

Query: 241 MLGQLWRMANLAK 254
           MLGQLWRMAN AK
Sbjct: 674 MLGQLWRMANQAK 683

BLAST of Csa6G042300 vs. NCBI nr
Match: gi|659089860|ref|XP_008445719.1| (PREDICTED: uncharacterized protein LOC103488666 isoform X3 [Cucumis melo])

HSP 1 Score: 332.8 bits (852), Expect = 5.3e-88
Identity = 167/181 (92.27%), Postives = 176/181 (97.24%), Query Frame = 1

Query: 1   MIPCIVNESNASESGIKVEDGILATNPCIAECSGEKLASGNLSDNISFDQNRNGDHALIT 60
           MIPC+VNES+ASESGIK +DGILATNPCIAECSGEK+ASGNLSDNISFDQNRNGDHALIT
Sbjct: 434 MIPCMVNESDASESGIKAKDGILATNPCIAECSGEKIASGNLSDNISFDQNRNGDHALIT 493

Query: 61  CQSNPDSEHLSKLQAIIVSKERALSQAAIRALIRKRDKLSHQQRLIEDEIAQCDKNMQTI 120
           CQSN  +EHLSKLQAIIVSKE ALSQAAI+ALIRKRDKLSHQQRLIEDEIAQCDKNMQTI
Sbjct: 494 CQSN--AEHLSKLQAIIVSKETALSQAAIKALIRKRDKLSHQQRLIEDEIAQCDKNMQTI 553

Query: 121 LRGDEDDLVLKLDSVIECCNDICPRSTAEDKSYQYFEENCSSQYVTRKRLSEAILCIQNP 180
           LRGDEDDLVLKLDSVI+CCND+C +STAEDKSYQYFEENCSSQYVTRKRLSEAILCIQNP
Sbjct: 554 LRGDEDDLVLKLDSVIDCCNDLC-QSTAEDKSYQYFEENCSSQYVTRKRLSEAILCIQNP 611

Query: 181 C 182
           C
Sbjct: 614 C 611

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0KE35_CUCSA1.2e-142100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_6G042300 PE=4 SV=1[more]
F6HYR2_VITVI3.1e-6354.44Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0102g00250 PE=4 SV=... [more]
A0A061EJ05_THECC1.4e-5055.73Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_020129 PE=4 SV=1[more]
A0A0D2S874_GOSRA3.5e-4653.13Uncharacterized protein OS=Gossypium raimondii GN=B456_006G227200 PE=4 SV=1[more]
A0A0J8FIJ6_BETVU1.3e-4546.79Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_3g056920 PE=4 S... [more]
Match NameE-valueIdentityDescription
AT1G05950.12.2e-3136.80 unknown protein[more]
Match NameE-valueIdentityDescription
gi|778710231|ref|XP_011656540.1|1.7e-142100.00PREDICTED: uncharacterized protein LOC101206764 isoform X1 [Cucumis sativus][more]
gi|700190792|gb|KGN45996.1|1.7e-142100.00hypothetical protein Csa_6G042300 [Cucumis sativus][more]
gi|659089858|ref|XP_008445718.1|1.7e-13193.68PREDICTED: uncharacterized protein LOC103488666 isoform X2 [Cucumis melo][more]
gi|659089854|ref|XP_008445716.1|6.6e-13193.28PREDICTED: uncharacterized protein LOC103488666 isoform X1 [Cucumis melo][more]
gi|659089860|ref|XP_008445719.1|5.3e-8892.27PREDICTED: uncharacterized protein LOC103488666 isoform X3 [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
CU144510cucumber EST collection version 3.0transcribed_cluster
CU162245cucumber EST collection version 3.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa6G042300.1Csa6G042300.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
CU162245CU162245transcribed_cluster
CU144510CU144510transcribed_cluster


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33913FAMILY NOT NAMEDcoord: 62..253
score: 1.4
NoneNo IPR availablePFAMPF14709DND1_DSRMcoord: 183..245
score: 6.
NoneNo IPR availableunknownSSF54768dsRNA-binding domain-likecoord: 175..245
score: 2.8

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Csa6G042300MELO3C011646Melon (DHL92) v3.5.1cumeB476
Csa6G042300CSPI06G03560Wild cucumber (PI 183967)cpicuB314
The following gene(s) are paralogous to this gene:

None