ClCG03G011940 (gene) Watermelon (Charleston Gray)

NameClCG03G011940
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionUnknown protein
LocationCG_Chr03 : 23857942 .. 23859595 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATCTCAAACAAAGTCTTCACTACTTTGTTGATTTTGACCTCCAAACCGCTTTCAAAGTCGTCGAGATCGAGCTTGGATTCTTACATGATTCACTCTACACACAATTCCCCATCCTTCTTTGGTGCTCTTCTTCGCTTTATTACTTCCTTTTCCATTCTCTCAGCCATCATTGCATTCATCTTGATCGATAAGCAAAGTACCCTTTACAAGATTGTGTATTTCACTTTTATATTACTTTTTGGAGCTCTAGGAGTTGAAATTTACTCTGTTTTTCGGATTGGTGTGTAATTTGGTTGACCCAAATTGACAATTTTCTTGCTCGTTTGATGTTGAAAGCCCATCTATGCCTTTGGGTGGTCTCTTGAGATGAAAAGATGGTGTAATTCGATTCCGCAGTACAATCTATTTACTAGTTGTTTTGAAGATATAAACATATACAGACACGGCAGGTCGATGGTCTGGTCAAATCCTTTCGAAATCAAGACCAATATTTCACGACCAATTTCGAATCAACTTAAGAAACGAATCTTTGAAGAGTTAAACAAGATGGTGGAAATCAAGAGGGAGGAGGGTGATTGGGTTGGAGCATGGAATTGGATTCAGTTCAAAGCATACTCCTTTGGCACATTGCCACAGAGATTTGCTACTACTCACATCACAAAAACAAGGCTTTCAATCATTCAATATCTCTTCTTCAAGACACTAAATTGTTGTCTAATTACCTCGCCTACCTTTTAGCATATCGTCAATCCTTGTTCTCGAACGGGGTGGGACGAACAAGATTCGAAGCAACGGTTATCGACACTAAGGATTTTCTCCGACGACATGTCCCGATATTGTTGGGAAGAGTTGAGCATGCTTATGAGTATGTATTGGAAGATTTGGAGTTGAGAGTTGGGTGTAGAAAAGATGTTAAAGAATTGGGAGTTGGGTCAGTGCTAAACTATGGGTTCTTGTTGGCTAAGGAGCTTCAAAGGTTGGAAGAAAATGAGAGGTGGGAGATGATGAGCCATGAATGGGTGGAGTTGCTTGCTAAGGTTGCTTGTGAATGTAAATTTTATGATCATATTGAGCAACTTGGACATGGAGGAGACTTACTCACCCATGTTTGGCTTTTGATGCATCATATTGGTTATACAAAACAAGCTTATGATGTTACAAATGCCAGAGATGAACAAATCCAAGAATTTCGTAGGGTTTTAAAACGTCGTAGAGGTGGTGGATCAAGTCGATTTGATCTTATTCAACCATAACGAAGGATTAAAATATCGATGTCGATATTAAAATTTCAATTTTACGGATATATGGATAAAATATCGATATCCAAAAAAAATTTCTATAACTCAAAAATATTTAGAAAGTTTATTTTGATTGATAATTAAGTTGTTTTTATTCTCAAATTAAGTTATGGACATTGTTATTAGTATTTCTATTCATAATGGATTAAATAGATACATTTTTATGTTTTATGAGTATTAAGATATCTGTGGATATTTAATATCGATGTCAAACTCTTAGATTTATGGATATGTCAATGGATATTTCCATCCTTGACCATAACTTAACCTTTTTTTTTTATTTTTTTTATTTTTTTAAATTTTTTTTAATTTTTTTTTTAATTTTTTTTTTTAAGAGATGCCAATTTGGGTAA

mRNA sequence

ATGATCTCAAACAAAGTCTTCACTACTTTGTTGATTTTGACCTCCAAACCGCTTTCAAAGTCGTCGAGATCGAGCTTGGATTCTTACATGATTCACTCTACACACAATTCCCCATCCTTCTTTGGTGCTCTTCTTCGCTTTATTACTTCCTTTTCCATTCTCTCAGCCATCATTGCATTCATCTTGATCGATAAGCAAAGTACCCTTTACAAGATTGTGTATTTCACTTTTATATTACTTTTTGGAGCTCTAGGAGTTGAAATTTACTCTGTTTTTCGGATTGGTCCCATCTATGCCTTTGGGTGGTCTCTTGAGATGAAAAGATGGTGTAATTCGATTCCGCAGTACAATCTATTTACTAGTTGTTTTGAAGATATAAACATATACAGACACGGCAGGTCGATGGTCTGGTCAAATCCTTTCGAAATCAAGACCAATATTTCACGACCAATTTCGAATCAACTTAAGAAACGAATCTTTGAAGAGTTAAACAAGATGGTGGAAATCAAGAGGGAGGAGGTTCAAAGCATACTCCTTTGGCACATTGCCACAGAGATTTGCTACTACTCACATCACAAAAACAAGGCTTTCAATCATTCAATATCTCTTCTTCAAGACACTAAATTGTTGTCTAATTACCTCGCCTACCTTTTAGCATATCGTCAATCCTTGTTCTCGAACGGGGTGGGACGAACAAGATTCGAAGCAACGGTTATCGACACTAAGGATTTTCTCCGACGACATGTCCCGATATTGTTGGGAAGAGTTGAGCATGCTTATGAGTATGTATTGGAAGATTTGGAGTTGAGAGTTGGGTGTAGAAAAGATGTTAAAGAATTGGGAGTTGGGTCAGTGCTAAACTATGGGTTCTTGTTGGCTAAGGAGCTTCAAAGGTTGGAAGAAAATGAGAGGTGGGAGATGATGAGCCATGAATGGGTGGAGTTGCTTGCTAAGGTTGCTTGTGAATGTAAATTTTATGATCATATTGAGCAACTTGGACATGGAGGAGACTTACTCACCCATGTTTGGCTTTTGATGCATCATATTGGTTATACAAAACAAGCTTATGATGTTACAAATGCCAGAGATGAACAAATCCAAGAATTTCGTAGGAGATGCCAATTTGGGTAA

Coding sequence (CDS)

ATGATCTCAAACAAAGTCTTCACTACTTTGTTGATTTTGACCTCCAAACCGCTTTCAAAGTCGTCGAGATCGAGCTTGGATTCTTACATGATTCACTCTACACACAATTCCCCATCCTTCTTTGGTGCTCTTCTTCGCTTTATTACTTCCTTTTCCATTCTCTCAGCCATCATTGCATTCATCTTGATCGATAAGCAAAGTACCCTTTACAAGATTGTGTATTTCACTTTTATATTACTTTTTGGAGCTCTAGGAGTTGAAATTTACTCTGTTTTTCGGATTGGTCCCATCTATGCCTTTGGGTGGTCTCTTGAGATGAAAAGATGGTGTAATTCGATTCCGCAGTACAATCTATTTACTAGTTGTTTTGAAGATATAAACATATACAGACACGGCAGGTCGATGGTCTGGTCAAATCCTTTCGAAATCAAGACCAATATTTCACGACCAATTTCGAATCAACTTAAGAAACGAATCTTTGAAGAGTTAAACAAGATGGTGGAAATCAAGAGGGAGGAGGTTCAAAGCATACTCCTTTGGCACATTGCCACAGAGATTTGCTACTACTCACATCACAAAAACAAGGCTTTCAATCATTCAATATCTCTTCTTCAAGACACTAAATTGTTGTCTAATTACCTCGCCTACCTTTTAGCATATCGTCAATCCTTGTTCTCGAACGGGGTGGGACGAACAAGATTCGAAGCAACGGTTATCGACACTAAGGATTTTCTCCGACGACATGTCCCGATATTGTTGGGAAGAGTTGAGCATGCTTATGAGTATGTATTGGAAGATTTGGAGTTGAGAGTTGGGTGTAGAAAAGATGTTAAAGAATTGGGAGTTGGGTCAGTGCTAAACTATGGGTTCTTGTTGGCTAAGGAGCTTCAAAGGTTGGAAGAAAATGAGAGGTGGGAGATGATGAGCCATGAATGGGTGGAGTTGCTTGCTAAGGTTGCTTGTGAATGTAAATTTTATGATCATATTGAGCAACTTGGACATGGAGGAGACTTACTCACCCATGTTTGGCTTTTGATGCATCATATTGGTTATACAAAACAAGCTTATGATGTTACAAATGCCAGAGATGAACAAATCCAAGAATTTCGTAGGAGATGCCAATTTGGGTAA

Protein sequence

MISNKVFTTLLILTSKPLSKSSRSSLDSYMIHSTHNSPSFFGALLRFITSFSILSAIIAFILIDKQSTLYKIVYFTFILLFGALGVEIYSVFRIGPIYAFGWSLEMKRWCNSIPQYNLFTSCFEDINIYRHGRSMVWSNPFEIKTNISRPISNQLKKRIFEELNKMVEIKREEVQSILLWHIATEICYYSHHKNKAFNHSISLLQDTKLLSNYLAYLLAYRQSLFSNGVGRTRFEATVIDTKDFLRRHVPILLGRVEHAYEYVLEDLELRVGCRKDVKELGVGSVLNYGFLLAKELQRLEENERWEMMSHEWVELLAKVACECKFYDHIEQLGHGGDLLTHVWLLMHHIGYTKQAYDVTNARDEQIQEFRRRCQFG
BLAST of ClCG03G011940 vs. TrEMBL
Match: A0A0A0LZZ2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G638510 PE=4 SV=1)

HSP 1 Score: 196.8 bits (499), Expect = 4.6e-47
Identity = 126/358 (35.20%), Postives = 190/358 (53.07%), Query Frame = 1

Query: 39  SFFGALLRFITSFSILSAIIAFILIDKQSTLYKIVYFTFILLFGALGVEIYSVF------ 98
           S  G L R  T  S++ A + + LIDKQ      V   F+L  GAL +EIYS+F      
Sbjct: 263 SLCGRLFRLTTFSSLVIAFLTYCLIDKQEYPSTYVNLIFLLFSGALSIEIYSLFLFLFSD 322

Query: 99  -------------------RIGPIYAFGWSLEMKRWCNSIPQYNLFTSCFEDINIYRHGR 158
                               +  I   GWSL+ +R  NSI QYNL + C E  N   + +
Sbjct: 323 WNVIWLLTTQSPSNPLPRLALKLISLCGWSLKKRRCSNSISQYNLISHCLEQKNDSYYFK 382

Query: 159 SMVWSNPFEIKTNISRPISNQLKKRIFEELNKMVEIKRE------EV---------QSIL 218
               S       ++ RPISN L+  IF++L + + + +E      E+         QSIL
Sbjct: 383 FP--STKTIAAFSVQRPISNNLEAHIFQQLKQKLVLNQEYDYGYNEIGWSLKLDLDQSIL 442

Query: 219 LWHIATEICYYSHHKNKAFNHSISLL--QDTKLLSNYLAYLLAYRQSLFSNGVGRTRFEA 278
           +WHIAT+ CY+S  K K    S S +  QD+  LSN+LAY + +  SLF +G+ + R +A
Sbjct: 443 IWHIATDFCYHSSPKFKESEESKSCIPPQDSVSLSNFLAYFIVHHPSLFPSGMSQIRHKA 502

Query: 279 TVIDTKDFLRRHVPILLGRVEHAYEYVLEDLELRVGCRKDVKELGVGSVLNYGFLLAKEL 338
           T     + L+        +++     +L++LEL +   + VKE    S +   F LA  L
Sbjct: 503 TSEHVLELLQDE------KLDRCRSNMLKNLELNI---EVVKEERKESRVLDAFRLAGFL 562

Query: 339 QRLEENERWEMMSHEWVELLAKVACECKFYDHIEQLGHGGDLLTHVWLLMHHIGYTKQ 355
           ++LE++++WE++ + WVELL +++CEC++YDH +QL  GG L+T VW+LMHH+GY KQ
Sbjct: 563 EKLEQSQKWEIIGNVWVELLGRISCECEWYDHAKQLTQGGSLVTRVWILMHHLGYLKQ 609

BLAST of ClCG03G011940 vs. TrEMBL
Match: A0A166C0F7_DAUCA (Uncharacterized protein OS=Daucus carota subsp. sativus GN=DCAR_011877 PE=4 SV=1)

HSP 1 Score: 154.5 bits (389), Expect = 2.6e-34
Identity = 116/377 (30.77%), Postives = 178/377 (47.21%), Query Frame = 1

Query: 39  SFFGALLRFITSFSILSAIIAFILIDKQSTLYKIVYFTFILLFGALGVEIYSV------- 98
           S +G  LR  +  S    +I F LID +S     +  + +LL GA+G+E+Y+V       
Sbjct: 295 SIWGVALRATSFLSTAFGLIVFCLIDWKSYKAVDLCISLVLLVGAIGLEVYAVLLLLSSD 354

Query: 99  ---------------FRIGPIYAFGWSLEMKRWCNSIPQYNLFTSCFED-INIYRHGRSM 158
                          F    I  F W    K+W NS+ QYNL +S   +   I+   ++ 
Sbjct: 355 WTLLLLSKLKNPLIDFIYKFITCFNWITSKKKWSNSMAQYNLLSSSLNNKATIWNFIKNH 414

Query: 159 VWS--------NPFEIKTNISRPISNQLKKRI-----FEELNKMVEIKREEV-------- 218
           V+         N  +I T I+  I +QL ++      F    K+   + E V        
Sbjct: 415 VFQILDDYQDVNSVDIPTEINISIFSQLVEKSKGASDFRMCKKLCGARGEYVLAKYKCAE 474

Query: 219 -----------QSILLWHIATEICYYSHHKNKAFNHSISLLQDTKLLSNYLAYLLAYRQS 278
                       SILLWH+AT++CY++       +   S  + +KLLS+Y+ YLL  R  
Sbjct: 475 KFTWSVEFEFDHSILLWHLATDLCYFTDKTENPDSVLDSNCRISKLLSDYMLYLLVKRPF 534

Query: 279 LFSNGVGRTRFEATVIDTKDFLRRHVPILLGRVEHAYEYVLEDLELRVGCRKDVKELGVG 338
           +  NG+G+ RF+ T  +  +F +    + +   E      L  +E+ +    +VK     
Sbjct: 535 MLPNGIGQIRFQDTCAEATEFFQE---LKITPDEDEACTKLLQVEIIIP-PHEVKGDRSK 594

Query: 339 SVLNYGFLLAKELQRL------EENERWEMMSHEWVELLAKVACECKFYDHIEQLGHGGD 355
           SVL     LAK LQ L      E+ ++WEMMSH W+E+L+  AC C + DH +Q+  GG+
Sbjct: 595 SVLFDACKLAKLLQSLECGELWEKQQKWEMMSHMWIEMLSYAACHCTWKDHAQQIRRGGE 654

BLAST of ClCG03G011940 vs. TrEMBL
Match: F6HYV5_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0102g00970 PE=4 SV=1)

HSP 1 Score: 151.4 bits (381), Expect = 2.2e-33
Identity = 113/354 (31.92%), Postives = 179/354 (50.56%), Query Frame = 1

Query: 41  FGALLRFITSFSILSAIIAFILIDKQSTLYKIVYFTFILLFGALGVEIYSVFRIGPIYAF 100
           +G LLR ++    +S  +AF+LI+K       +  TF+LL GA+ +E+Y++     + + 
Sbjct: 297 WGHLLRSVSLSFTVSTFVAFLLINKHGYSTIDLIITFLLLVGAIVLEMYAII---VLLSS 356

Query: 101 GWSL--------------EM----KRWCNSIPQYNLFTSCFEDINIYRH-----GRSMVW 160
            W++              EM    KRW NS+ QYNL + C +D  I  +     G S V+
Sbjct: 357 DWTILSLSKHRITLKDRDEMDRANKRWSNSMAQYNLMSFCLKDKPIRWYLELLQGFSYVY 416

Query: 161 SNPFEIKTNISRPISNQLKKRIFEELNKMVE------------IKREEVQSILLWHIATE 220
               +     S  +++ LK  IF+ L+   +            ++ +  QSILLWHIAT+
Sbjct: 417 EMLEKHHYKSSVTVADNLKALIFQHLSDKSKEQYNCHSDLGWSVEEDFDQSILLWHIATD 476

Query: 221 ICYYSHHKNKAFNHSISLLQD----TKLLSNYLAYLLAYRQSLFSNGVGRTRFEATVIDT 280
           + YY+ H+N+  N S     D    +K++S+Y+ YLL     +  +G+G+ RF+ +  + 
Sbjct: 477 LLYYTDHQNQ--NPSSVKNPDCRTISKMVSDYMLYLLVMCPFMLPDGIGQIRFQDSCAEA 536

Query: 281 KDFLRRHVPILLGRVEHAYEYVLEDLELRVGCRKDVKELGVGSVLNYGFLLAKELQRLE- 340
           K FL     +  G  E   + +  + E+     + VK     SVL     LAK LQ L+ 
Sbjct: 537 KQFLEDKKLVGEGGTEACQKLLAVNTEVPP---QQVKGDKSKSVLFDACRLAKSLQSLKI 596

Query: 341 -ENERWEMMSHEWVELLAKVACECKFYDHIEQLGHGGDLLTHVWLLMHHIGYTK 354
            E E+WEM+   WVE+L   A +C +  H +QL  GG+LLTHVWLLM H G ++
Sbjct: 597 AEKEKWEMICDVWVEMLCYAASQCGWNQHAQQLRRGGELLTHVWLLMAHFGISE 642

BLAST of ClCG03G011940 vs. TrEMBL
Match: A0A061GVR6_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_041206 PE=4 SV=1)

HSP 1 Score: 149.8 bits (377), Expect = 6.5e-33
Identity = 114/378 (30.16%), Postives = 176/378 (46.56%), Query Frame = 1

Query: 39  SFFGALLRFITSFSILSAIIAFILIDKQSTLYKIVYFTFILLFGALGVEIYSV------- 98
           S +G  LR ++  S + A+  F +ID+Q      V  T++LL GA+ +EIY++       
Sbjct: 293 SNWGVFLRSVSLSSTIIALATFSMIDRQGFKTVDVSITYLLLIGAIFLEIYAILVLLSSE 352

Query: 99  -----------FRIGPIY----AFGWSLEMKRWCNSIPQYNLFTSCFEDINIYRHG-RSM 158
                      F +   Y     F +   M RW N + QY+L  SCF       HG +  
Sbjct: 353 WTMLWLSKQEKFPVSQTYEAISTFKFITSMNRWSNYMEQYSLIGSCFRGEPDDLHGVQRR 412

Query: 159 VWSNPFEIKTNISRPISNQLKKRIFEEL-------------NKMVEIKREEV-------- 218
            W +        S  +S  LK+ +FEEL              ++   + E+V        
Sbjct: 413 GWIHKHVHGNITSESVSPCLKEFVFEELVEKSKVASNFSISRQLCACRGEQVLGEKNCLD 472

Query: 219 ---QSILL--------WHIATEICYYSHHKNKAFNHSISLLQDTKLLSNYLAYLLAYRQS 278
               S+ +        WHIAT +CY    K    +   S  + +KL+S YL ++L  R S
Sbjct: 473 KLGWSVEVEFDHSILLWHIATSLCYCYDQKRNLSSVLDSRCKVSKLISEYLLFILVKRPS 532

Query: 279 LFSNGVGRTRFEATVIDTKDFLRRHVPILLGRVEHAYEYVLE-DLELRVGCRKDVKELGV 338
           +  NG+G+ RF+ T+ +  +F++      +     A E +L+ D  +     K  +    
Sbjct: 533 MMPNGIGQIRFQDTICEAIEFIQERK--FISNASLACEKLLQVDTRIEPAIVKGDRS--- 592

Query: 339 GSVLNYGFLLAKELQRLEE------NERWEMMSHEWVELLAKVACECKFYDHIEQLGHGG 355
            SVL     LAKEL  LE+       E+WE++SH W+E+L+  A +C++  H +QL  GG
Sbjct: 593 KSVLFDACRLAKELHSLEQERKWHTKEKWELVSHVWLEMLSYAASQCRWNQHAQQLRRGG 652

BLAST of ClCG03G011940 vs. TrEMBL
Match: M5WZX8_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa019290mg PE=4 SV=1)

HSP 1 Score: 142.5 bits (358), Expect = 1.0e-30
Identity = 120/375 (32.00%), Postives = 176/375 (46.93%), Query Frame = 1

Query: 39  SFFGALLRFITSFSILSAIIAFIL-IDKQSTLYKIVYFTFILLFG--------------- 98
           S  GA+LRF +S SI+S  +AF++ I+KQ    + +  T++LL G               
Sbjct: 298 SLNGAILRFTSSVSIISVSVAFLVTIEKQDYSARSIIITYMLLAGAIILDLYAVILFLRS 357

Query: 99  --AL---GVEIYSVFRIGPIYAFGWSLEMKRWCNSIPQYNLFTSCFEDINI-YRH----- 158
             AL        +   + P+ +    +E KRW N I QYNL   C ED    YR      
Sbjct: 358 DWALLWFCKHKTAAHLLYPVISHMSFVENKRWSNEITQYNLILICCEDKPAKYRFLQKVP 417

Query: 159 GRSMVWSNPFEIKTNISRPISNQLKKRI--------FEEL-NKMVE-------------- 218
           G         E+   +   I  QL K+         ++EL ++ VE              
Sbjct: 418 GICRKLKKSVEVPRELKELIFLQLLKKSTCAPNSAGYKELRDRRVEWVLQNENCIEKLGW 477

Query: 219 -IKREEVQSILLWHIATEICYYSHHKNKAFNHSIS--LLQDTKLLSNYLAYLLAYRQSLF 278
            I  E   S+LLWHIAT++CYYS      + ++      + +KLLS Y+ YLL  R S+ 
Sbjct: 478 SIGEEFDLSVLLWHIATQLCYYSDRDQDKYPNAFPDPNCEASKLLSEYMLYLLVKRSSML 537

Query: 279 SNGVGRTRFEATVIDTKDFLRRHVPILLGRVEHAYEYVLEDLELRVGCRKDVKELGVGSV 338
            NG+G+ RF+ T  +  +F ++         + A + + E     +    +VK     SV
Sbjct: 538 PNGIGQIRFKDTCAEATEFFKQR-KCQRSEQDRACKKLREVNSDEI-LPAEVKGDESKSV 597

Query: 339 LNYGFLLAKELQRLEENERWE------MMSHEWVELLAKVACECKFYDHIEQLGHGGDLL 355
           L     LAK+L+ LE  E WE      ++SH WVE+L+  A  C++  H  QL  GG+LL
Sbjct: 598 LFDACKLAKDLESLETKENWENQKKWQLISHVWVEMLSYAASHCRWNHHAVQLRRGGELL 657

BLAST of ClCG03G011940 vs. TAIR10
Match: AT5G45470.1 (AT5G45470.1 Protein of unknown function (DUF594))

HSP 1 Score: 113.6 bits (283), Expect = 2.6e-25
Identity = 74/197 (37.56%), Postives = 106/197 (53.81%), Query Frame = 1

Query: 175 QSILLWHIATEICYYSHHKN---KAFNHS---ISLLQDTKLLSNYLAYLLAYRQSLFSN- 234
           QS+L+WHIATE+CY  H K    + ++      S  + +K++S+Y+ YLL  +  L S  
Sbjct: 662 QSLLMWHIATELCYQQHEKETIPEGYDEQRKHYSNREFSKIISDYMMYLLILQPGLMSEV 721

Query: 235 -GVGRTRFEATVIDT-KDFLRRHVPILLGRVEHAYEYVLEDLELRVGCRKDVKELGVG-- 294
            G+G+ RF  T+ +T K F RRH+      VE A   +L+          +++ +GV   
Sbjct: 722 AGIGKIRFRDTLAETHKFFQRRHIENDRS-VETATLNILD-------VESEIEPMGVKGD 781

Query: 295 ---SVLNYGFLLAKELQRLEEN---ERWEMMSHEWVELLAKVACECKFYDHIEQLGHGGD 354
              SVL     LAK+L  +E+    ++WE++S  WVELL   AC C    H+EQL  GG+
Sbjct: 782 RSKSVLFDASRLAKDLAEMEKTHNKDKWEILSKVWVELLCYAACHCDSTAHVEQLSRGGE 841

BLAST of ClCG03G011940 vs. TAIR10
Match: AT5G45480.1 (AT5G45480.1 Protein of unknown function (DUF594))

HSP 1 Score: 110.5 bits (275), Expect = 2.2e-24
Identity = 77/192 (40.10%), Postives = 106/192 (55.21%), Query Frame = 1

Query: 175 QSILLWHIATEICYYSHHKNKAFNHSISLLQDTKLLSNYLAYLLAYRQSLFSN--GVGRT 234
           QS+L+WHIATE+ Y +    KA NHS    + +K+LS+Y+ YLL  + +L S   G+G+ 
Sbjct: 673 QSLLVWHIATELLYQTKKGTKA-NHSAR--EFSKILSDYMMYLLMMQPTLMSAVVGIGKI 732

Query: 235 RFEATVIDTKDFL-RRHVP-ILLGRVEHAYEYVLEDLELRVGCRK---DVKELGVGSVLN 294
           RF  T  + + F  RRH+  I   +   A E  +  L + V  +    DVK     SVL 
Sbjct: 733 RFRDTCEEAQRFFDRRHIMGISAKKAPDAKEASVAILSVAVPAKAEPIDVKGDRSKSVLF 792

Query: 295 YGFLLAKELQRLEEN-----ERWEMMSHEWVELLAKVACECKFYDHIEQLGHGGDLLTHV 354
            G +LAKEL+ L +N     E W++MS  WVELL+  A +C   +H  QL  GG+L++ V
Sbjct: 793 DGAMLAKELKGLRKNKEDDSEMWKIMSQVWVELLSYAATKCGAIEHAAQLSKGGELISFV 852

BLAST of ClCG03G011940 vs. TAIR10
Match: AT5G45530.1 (AT5G45530.1 Protein of unknown function (DUF594))

HSP 1 Score: 106.7 bits (265), Expect = 3.2e-23
Identity = 74/192 (38.54%), Postives = 97/192 (50.52%), Query Frame = 1

Query: 175 QSILLWHIATEICYYSHHKNKAFNHSISLLQD---TKLLSNYLAYLLAYRQSLFSN--GV 234
           QS+LLWHIATE+C+      K    S     D   +K++S+Y+ YLL  R  L S   G+
Sbjct: 595 QSLLLWHIATELCFQKEEGGKMEKLSREGYDDREFSKIISDYMMYLLIMRPKLMSEVAGI 654

Query: 235 GRTRFEATVIDTKDFLRRHVPILLGRVEHAYEYVL---EDLELRVGCRKDVKELGVGSVL 294
           G  RF  T  + + F +      L  ++ A E VL    D+E  +     VK     SVL
Sbjct: 655 GTIRFRDTKAEAERFFKGRQIKDLRDMKRASETVLLVSNDIEPIL-----VKGDRSKSVL 714

Query: 295 NYGFLLAKELQRLEENE----RWEMMSHEWVELLAKVACECKFYDHIEQLGHGGDLLTHV 354
               +LAKELQ L+E+     +W ++S  WVELL   A  CK  +H+ QL  GG+LL  V
Sbjct: 715 FDASMLAKELQNLKESSNEDGKWRVLSKVWVELLCYAASHCKATEHVAQLSRGGELLNFV 774

BLAST of ClCG03G011940 vs. TAIR10
Match: AT5G45540.1 (AT5G45540.1 Protein of unknown function (DUF594))

HSP 1 Score: 98.2 bits (243), Expect = 1.1e-20
Identity = 73/204 (35.78%), Postives = 97/204 (47.55%), Query Frame = 1

Query: 175 QSILLWHIATEICYY----------SHHKNKAFNHSISLLQDTKLLSNYLAYLLAYRQSL 234
           QSILLWHIATE+ Y             H         S  + +K+LS+Y+ YLL  + +L
Sbjct: 594 QSILLWHIATELLYQKPIDKKVTEKEEHSTNREKEEHSNREFSKILSDYMMYLLIVQPTL 653

Query: 235 FS--NGVGRTRFEATVIDTKDFL-RRHVPILLGRVEHAYEYVLEDLELRVGCR------K 294
            S  +G+ + RF  T  + KDF  RRHV            YV ++L ++  CR       
Sbjct: 654 MSAVSGIAKIRFRDTCEEAKDFFQRRHV--------DKSRYVKKNL-MKEACRAILSVNT 713

Query: 295 DVKELGV-----GSVLNYGFLLAKELQRLEENERWEMMSHEWVELLAKVACECKFYDHIE 354
           ++  + V      SVL    +LAKEL    EN  WE++S  WVELL   +  C   +H  
Sbjct: 714 EIDPMAVKGDRSKSVLFDASVLAKELMNEGEN-MWEVVSKVWVELLCYASLHCDSQEHAS 773

BLAST of ClCG03G011940 vs. TAIR10
Match: AT4G19090.1 (AT4G19090.1 Protein of unknown function (DUF594))

HSP 1 Score: 92.0 bits (227), Expect = 8.2e-19
Identity = 64/187 (34.22%), Postives = 98/187 (52.41%), Query Frame = 1

Query: 176 SILLWHIATEICYYSHHK-----NKAFNHSISLLQDTKLLSNYLAYLLAYRQSLFSN--G 235
           S+L+WHIATE+CY          +K+  H+   +  +K++S+Y+ YLL  +  L S   G
Sbjct: 567 SLLIWHIATELCYQEEDSAKENCDKSEYHTNRKI--SKIISDYMMYLLIMQPKLMSEVAG 626

Query: 236 VGRTRFEATVIDTKDFLRRHVPILLGR-VEHAYEYVLE-DLELRVGCRKDVKELGVGSVL 295
           +G+ RF  T+ +   F ++   I   R V+ A + +L  D  +     ++VK     SVL
Sbjct: 627 IGKIRFRDTLAEADRFFKKMGIIRDSRNVKLASKEILSADTSIE---PREVKGNHSKSVL 686

Query: 296 NYGFLLAKELQRLEEN---ERWEMMSHEWVELLAKVACECKFYDHIEQLGHGGDLLTHVW 351
                LAKELQR+E+N   ++W+++S  W+E L   A  C     +E L  GG+ +  VW
Sbjct: 687 FEASSLAKELQRVEKNFGEDKWKILSKVWLEFLFHAASHCDATTRMELLSKGGEFINFVW 746

BLAST of ClCG03G011940 vs. NCBI nr
Match: gi|449442759|ref|XP_004139148.1| (PREDICTED: uncharacterized protein LOC101222078 [Cucumis sativus])

HSP 1 Score: 196.8 bits (499), Expect = 6.7e-47
Identity = 126/358 (35.20%), Postives = 190/358 (53.07%), Query Frame = 1

Query: 39  SFFGALLRFITSFSILSAIIAFILIDKQSTLYKIVYFTFILLFGALGVEIYSVF------ 98
           S  G L R  T  S++ A + + LIDKQ      V   F+L  GAL +EIYS+F      
Sbjct: 263 SLCGRLFRLTTFSSLVIAFLTYCLIDKQEYPSTYVNLIFLLFSGALSIEIYSLFLFLFSD 322

Query: 99  -------------------RIGPIYAFGWSLEMKRWCNSIPQYNLFTSCFEDINIYRHGR 158
                               +  I   GWSL+ +R  NSI QYNL + C E  N   + +
Sbjct: 323 WNVIWLLTTQSPSNPLPRLALKLISLCGWSLKKRRCSNSISQYNLISHCLEQKNDSYYFK 382

Query: 159 SMVWSNPFEIKTNISRPISNQLKKRIFEELNKMVEIKRE------EV---------QSIL 218
               S       ++ RPISN L+  IF++L + + + +E      E+         QSIL
Sbjct: 383 FP--STKTIAAFSVQRPISNNLEAHIFQQLKQKLVLNQEYDYGYNEIGWSLKLDLDQSIL 442

Query: 219 LWHIATEICYYSHHKNKAFNHSISLL--QDTKLLSNYLAYLLAYRQSLFSNGVGRTRFEA 278
           +WHIAT+ CY+S  K K    S S +  QD+  LSN+LAY + +  SLF +G+ + R +A
Sbjct: 443 IWHIATDFCYHSSPKFKESEESKSCIPPQDSVSLSNFLAYFIVHHPSLFPSGMSQIRHKA 502

Query: 279 TVIDTKDFLRRHVPILLGRVEHAYEYVLEDLELRVGCRKDVKELGVGSVLNYGFLLAKEL 338
           T     + L+        +++     +L++LEL +   + VKE    S +   F LA  L
Sbjct: 503 TSEHVLELLQDE------KLDRCRSNMLKNLELNI---EVVKEERKESRVLDAFRLAGFL 562

Query: 339 QRLEENERWEMMSHEWVELLAKVACECKFYDHIEQLGHGGDLLTHVWLLMHHIGYTKQ 355
           ++LE++++WE++ + WVELL +++CEC++YDH +QL  GG L+T VW+LMHH+GY KQ
Sbjct: 563 EKLEQSQKWEIIGNVWVELLGRISCECEWYDHAKQLTQGGSLVTRVWILMHHLGYLKQ 609

BLAST of ClCG03G011940 vs. NCBI nr
Match: gi|659117652|ref|XP_008458716.1| (PREDICTED: uncharacterized protein LOC103498043 [Cucumis melo])

HSP 1 Score: 186.8 bits (473), Expect = 6.9e-44
Identity = 129/362 (35.64%), Postives = 189/362 (52.21%), Query Frame = 1

Query: 39  SFFGALLRFITSFSILSAIIAFILIDKQSTLYKIVYFTFILLFGALGVEIYSVFRI---- 98
           S  G L R  T  S+  AI+ + LIDKQ      V   F+L FGAL +EIYS+F I    
Sbjct: 263 SLCGRLFRLTTFSSLFIAILTYCLIDKQEYPSTYVNLIFLLFFGALSIEIYSLFLILFSD 322

Query: 99  ----------GP-----------IYAFGWSLEMKRWCNSIPQYNLFTSCFEDINIYRHGR 158
                      P           I   GWS + +R  NSI QYNL + C +  N      
Sbjct: 323 WNVIWLLTTQSPSNPLPRLALKLISLCGWSFKKRRCSNSISQYNLISHCLKQKN-----D 382

Query: 159 SMVWSNPFEIKT----NISRPISNQLKKRIFEELNKMVEIKRE------EV--------- 218
              +      KT    ++ RPISN L+  IF++L K + + +E      E+         
Sbjct: 383 DSYYCKFHNTKTMAAFSVQRPISNNLEAHIFQQLKKKLVLNQEYDSGYNEIGWSLKLDLD 442

Query: 219 QSILLWHIATEICYYSHHKNKA---FNHSISLLQDTKLLSNYLAYLLAYRQSLFSNGVGR 278
           QSILLWHIAT+ CYYS  K K    ++ S    QD+  LSN+LAY + +  SLF + + +
Sbjct: 443 QSILLWHIATDFCYYSSPKFKESEEYSESCIPPQDSISLSNFLAYFIVHHPSLFPSRMSQ 502

Query: 279 TRFEATVIDTKDFLRRHVPILLGRVEHAYEYVLEDLELRVGCRKDVKELGVGSVLNYGFL 338
            R +AT  D  + L+      LGR       +L++LEL++   + VKE    S++     
Sbjct: 503 IRHKATSEDVLELLQDKK---LGRCN---SNMLKNLELKI---EVVKEERKESMVLDACR 562

Query: 339 LAKELQRLEENERWEMMSHEWVELLAKVACECKFYDHIEQLGHGGDLLTHVWLLMHHIGY 354
           LA  L++LE++++WE++ + WVELL +++CE ++YDH + L  GG+L+T VW+LMHH+G 
Sbjct: 563 LAGILEKLEQSQKWEIIGNVWVELLGRISCEFEWYDHAKHLTQGGNLVTRVWILMHHLGC 610

BLAST of ClCG03G011940 vs. NCBI nr
Match: gi|1021045340|gb|KZN03121.1| (hypothetical protein DCAR_011877 [Daucus carota subsp. sativus])

HSP 1 Score: 154.5 bits (389), Expect = 3.8e-34
Identity = 116/377 (30.77%), Postives = 178/377 (47.21%), Query Frame = 1

Query: 39  SFFGALLRFITSFSILSAIIAFILIDKQSTLYKIVYFTFILLFGALGVEIYSV------- 98
           S +G  LR  +  S    +I F LID +S     +  + +LL GA+G+E+Y+V       
Sbjct: 295 SIWGVALRATSFLSTAFGLIVFCLIDWKSYKAVDLCISLVLLVGAIGLEVYAVLLLLSSD 354

Query: 99  ---------------FRIGPIYAFGWSLEMKRWCNSIPQYNLFTSCFED-INIYRHGRSM 158
                          F    I  F W    K+W NS+ QYNL +S   +   I+   ++ 
Sbjct: 355 WTLLLLSKLKNPLIDFIYKFITCFNWITSKKKWSNSMAQYNLLSSSLNNKATIWNFIKNH 414

Query: 159 VWS--------NPFEIKTNISRPISNQLKKRI-----FEELNKMVEIKREEV-------- 218
           V+         N  +I T I+  I +QL ++      F    K+   + E V        
Sbjct: 415 VFQILDDYQDVNSVDIPTEINISIFSQLVEKSKGASDFRMCKKLCGARGEYVLAKYKCAE 474

Query: 219 -----------QSILLWHIATEICYYSHHKNKAFNHSISLLQDTKLLSNYLAYLLAYRQS 278
                       SILLWH+AT++CY++       +   S  + +KLLS+Y+ YLL  R  
Sbjct: 475 KFTWSVEFEFDHSILLWHLATDLCYFTDKTENPDSVLDSNCRISKLLSDYMLYLLVKRPF 534

Query: 279 LFSNGVGRTRFEATVIDTKDFLRRHVPILLGRVEHAYEYVLEDLELRVGCRKDVKELGVG 338
           +  NG+G+ RF+ T  +  +F +    + +   E      L  +E+ +    +VK     
Sbjct: 535 MLPNGIGQIRFQDTCAEATEFFQE---LKITPDEDEACTKLLQVEIIIP-PHEVKGDRSK 594

Query: 339 SVLNYGFLLAKELQRL------EENERWEMMSHEWVELLAKVACECKFYDHIEQLGHGGD 355
           SVL     LAK LQ L      E+ ++WEMMSH W+E+L+  AC C + DH +Q+  GG+
Sbjct: 595 SVLFDACKLAKLLQSLECGELWEKQQKWEMMSHMWIEMLSYAACHCTWKDHAQQIRRGGE 654

BLAST of ClCG03G011940 vs. NCBI nr
Match: gi|1009111920|ref|XP_015865713.1| (PREDICTED: uncharacterized protein LOC107403340 [Ziziphus jujuba])

HSP 1 Score: 151.4 bits (381), Expect = 3.2e-33
Identity = 115/382 (30.10%), Postives = 174/382 (45.55%), Query Frame = 1

Query: 39  SFFGALLRFITSFSILSAIIAFILIDKQSTLYKI-VYFTFILLFGALGVEIYSV------ 98
           S  G  LR I+ FS L A + F++  K+ +  K+ ++ +++LL GA+ +E Y+V      
Sbjct: 323 SLLGVTLRCISFFSTLLAFLVFLIFPKKKSYMKVDIFISYVLLVGAITLEFYAVIIKLSS 382

Query: 99  --------------FRIGPIYAFGWSLEMKRWCNSIPQYNLFTSCFEDINIYRHGRSMVW 158
                         F    I    W+   K W N+I QYNL   C +D    R  + ++ 
Sbjct: 383 DWTMIWLSKHKNMAFLYNFISTLAWTKRNKGWSNTIAQYNLVRFCLKD----RPPKCILI 442

Query: 159 SNPFEIKTNISRP-------ISNQLKKRIFEE---------------------------- 218
                +   + R        +S +LKK IFE+                            
Sbjct: 443 QKVLFVDQLLERYRYKDSECVSAELKKLIFEQLKEKSRRATNFRACKQLCAARGDSVLDK 502

Query: 219 ---LNKMVEIKREEVQ-SILLWHIATEICYYSHHKNKAFNHSISLLQDTKLLSNYLAYLL 278
              L+K+     EE   S+LLWHIAT++CYY        +   S  + ++LLSNY+ Y+L
Sbjct: 503 ANCLDKLGWTTEEEFDHSLLLWHIATDLCYYHDLSRNMESIRSSNCKASQLLSNYMLYIL 562

Query: 279 AYRQSLFSNGVGRTRFEATVIDTKDFLRRHVPILLGRVEHAYEYVLEDLELRVGCRKDVK 338
                +  NG+G+ R   T  +  +F +    I   +   +   +L  +   V   K VK
Sbjct: 563 VMCPFMLPNGIGQIRLRDTCAEAVEFFKERKSITDAKDACS---MLMKVNTEVHPSK-VK 622

Query: 339 ELGVGSVLNYGFLLAKELQRLE------ENERWEMMSHEWVELLAKVACECKFYDHIEQL 355
                SVL    +LAK LQ LE        ++WE+MSH WVE+L   A  C++ +H +QL
Sbjct: 623 GDRSKSVLFEACVLAKNLQSLEAETNWDNGKKWELMSHVWVEMLCYAANHCRWSNHAQQL 682

BLAST of ClCG03G011940 vs. NCBI nr
Match: gi|658053530|ref|XP_008362520.1| (PREDICTED: uncharacterized protein LOC103426202 [Malus domestica])

HSP 1 Score: 150.2 bits (378), Expect = 7.2e-33
Identity = 119/384 (30.99%), Postives = 185/384 (48.18%), Query Frame = 1

Query: 42  GALLRFITSFSILSAIIAFILIDKQSTLYKI-VYFTFILLFGALGVEIYSVFRIGPIYAF 101
           G +LR IT    +   +AF+LI ++    K+ V  T+ILL GA+ +E Y+V     I + 
Sbjct: 316 GGILRCITLSFTVCVFLAFVLITEKQEYMKVDVIITYILLVGAIVLEFYAVV---VILSS 375

Query: 102 GWS------------------------LEMKRWCNSIPQYNLFTSCF-----------ED 161
            W+                        ++ KRW N++ QYNL T C            ED
Sbjct: 376 DWTRLWLSKHKNTAVDLLHRAVSSIPLIKDKRWSNTLGQYNLITFCLKERPAKCIFINED 435

Query: 162 INIYRHGRSMVWSNPFEIKTNISRPISNQLKKRI----------FEEL-----NKMVEIK 221
           + I R      +++  ++   +   I  QL +++           +EL     N+++E  
Sbjct: 436 LFINRLLEKYRYTDLKDVSKELKNLIFGQLLEKLGTASNFEASRLKELCARRGNRVLEKA 495

Query: 222 R------------EEVQSILLWHIATEICYYSH-HKNKAFNHSISLLQDTKLLSNYLAYL 281
           +            E  QSI+LWHIAT++CYYS  ++N  F  S++  + +++LSNY+ YL
Sbjct: 496 KCLDELGWTINGAEFDQSIILWHIATDLCYYSDLNRNPEFYQSLNF-EASRVLSNYMLYL 555

Query: 282 LAYRQSLFSNGVGRTRFEATVIDTKDFLRRHVPILLGRVEHAYEYVLEDLELRVGCRKDV 341
           L     +  NG+G+ RF  T  + K+F+     I     E     +L+     +  +  V
Sbjct: 556 LVMCPFMLPNGIGQIRFRDTCAEAKEFIAERKSIT--DAEKGCTMLLKVSTDILPSK--V 615

Query: 342 KELGVGSVLNYGFLLAKELQRLEEN-------ERWEMMSHEWVELLAKVACECKFYDHIE 355
           K     SVL     LAK LQ LE         ++WE +SH WVE+LA  A +C++ DH +
Sbjct: 616 KGDRSKSVLFDACRLAKALQSLESKGHWENNEKKWEFISHVWVEMLAFAANQCRWSDHAQ 675

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0LZZ2_CUCSA4.6e-4735.20Uncharacterized protein OS=Cucumis sativus GN=Csa_1G638510 PE=4 SV=1[more]
A0A166C0F7_DAUCA2.6e-3430.77Uncharacterized protein OS=Daucus carota subsp. sativus GN=DCAR_011877 PE=4 SV=1[more]
F6HYV5_VITVI2.2e-3331.92Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0102g00970 PE=4 SV=... [more]
A0A061GVR6_THECC6.5e-3330.16Uncharacterized protein OS=Theobroma cacao GN=TCM_041206 PE=4 SV=1[more]
M5WZX8_PRUPE1.0e-3032.00Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa019290mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G45470.12.6e-2537.56 Protein of unknown function (DUF594)[more]
AT5G45480.12.2e-2440.10 Protein of unknown function (DUF594)[more]
AT5G45530.13.2e-2338.54 Protein of unknown function (DUF594)[more]
AT5G45540.11.1e-2035.78 Protein of unknown function (DUF594)[more]
AT4G19090.18.2e-1934.22 Protein of unknown function (DUF594)[more]
Match NameE-valueIdentityDescription
gi|449442759|ref|XP_004139148.1|6.7e-4735.20PREDICTED: uncharacterized protein LOC101222078 [Cucumis sativus][more]
gi|659117652|ref|XP_008458716.1|6.9e-4435.64PREDICTED: uncharacterized protein LOC103498043 [Cucumis melo][more]
gi|1021045340|gb|KZN03121.1|3.8e-3430.77hypothetical protein DCAR_011877 [Daucus carota subsp. sativus][more]
gi|1009111920|ref|XP_015865713.1|3.2e-3330.10PREDICTED: uncharacterized protein LOC107403340 [Ziziphus jujuba][more]
gi|658053530|ref|XP_008362520.1|7.2e-3330.99PREDICTED: uncharacterized protein LOC103426202 [Malus domestica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR007658DUF594
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG03G011940.1ClCG03G011940.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007658Protein of unknown function DUF594PFAMPF04578DUF594coord: 299..350
score: 5.3
NoneNo IPR availablePANTHERPTHR31325FAMILY NOT NAMEDcoord: 39..355
score: 4.5

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
ClCG03G011940Cla001776Watermelon (97103) v1wcgwmB230
ClCG03G011940Cla97C03G062140Watermelon (97103) v2wcgwmbB186
The following gene(s) are paralogous to this gene:

None