Cla007983 (gene) Watermelon (97103) v1

NameCla007983
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionSnoaL-like polyketide cyclase (AHRD V1 ***- F4XSK7_9CYAN)
LocationChr8 : 5664412 .. 5666837 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGACGGCCATGATAGCATCTCAAGCTTTACTCTGCAACCATTCTTTTCCAGTGATTAAACCATATGCTCTAACATCCTCCCTCTCTCCTTTCCGGTTGTGCAGACCACAACAACCAATTCTCTCTCCTTCGATCTCCACCACCACTCCCTTCTCACCCTCACCCACTCGCCGGAATTTTTCTCTCACCCGCCGCTCCGCTCGCCGGAAACCCACATTGGTCGATTCTGCGGGATTCTCCGACGAGGGTGATTCCGATGTCCGACGGGTGCTTCAAATTCTGCTCTGGGCTGCTGAGGCTGTTTACATTTTGTGGCTATTTCTCCTTCCCTATGCCCCGGTTTGTTTCAATCGCTATCTCATTTTTAATTAAGCTTAATTCTGTATTGGGAATTGAGGTTTTACTTTGTTAACCTCTTGTTAACAAACAAACACTCAGATTGATGTAGTAGATAACAGTTTCTGTGGATTAGAAAGTTAATTTGTCCTGGTTTTTAATAATTACAGGGAGATCCTGTATGGGCAATCAGTTCAGAAACTGTGAATTCTCTTGTGGGTCTTTCTCTGAATTTCTTCCTTATATTGCCTGCTATGAACTCTGGTACGCTTGATTTGTTGTTTCCTTCATTTTATAACTACTTCTTATTGTTATTACTTCGCTGTCAATCAATTCCCAAAAAAGGAAAAAGAAAAAAAAAAAAAAATCGGCTTCTGTTTTGTAAAACTAATTTAGTATTTGGAATTGTCTCTCCCTAACATTCTTTTTCTTTTCTGAAAACTAAAATGAAGAAAAAGAATCCCTCCTCTAGCATTGAATGGGGCTGTGTGAGGATCCCTCCTCTATGGGGTTGATAATAATTCGATAACTTAAATTCTTGTTGTCTCTAGGAGCTGATCTAAGGTTTTCAAACAGCCACAATGTTTCTTTTGCTTTGGACTTTGATCAAATTCTAGTTGATGCAACAATGTTTTAGTGATGCACCTTGGAGTTCGAATGTTTCTTAATGCATATATATGGAAAGCTAAGATTTATGAAGCGTTATCAGAATCTGCACGGCCAATTCAAGCAAGGATTTTTTTCATCTATCCTATAACTTATGATGAATAAGGACCTGATTCTAAAATGGATGGTTTTCTACATTTTTGAGACCCAGTTGTTTTAAGGAAGCATCACCAATTCTAATTATACTAATATCTCAGGCTGGCAAAACCTGTTTACACTTGTAGAATATTTGCCTCAATATAGTTCTTTTTTCTAGTAAGGAGATGAGAGATTAATCCTCCGACCTTAAAGGAAGGATTGCATGTCGATTAACTCTTAGCTAAACTCACTTTGGCCTCAATATAGTTCTTATTTGGGAATTCACTTTTCAATCTTGATTCCTTTGTGATTTTGGTGATGAACTTTTTGGCAGGACAGTGGGCATTCGCCTGATTGATGCTCCTGTTCTTCACCCAGTAAGAATTATTTTCAAATCGTTGACCTCAGTTTTGCATTTTCATTCACTCTATGACGCCATTGTCATGATGGACTTTTCTTTCTCATTAGATGTCTGAGGGATTGTTCAATTTTGTCATTGCATGGACGCTCATGTTTGCTCCCCTTCTGTTCACGGATCGAAAGAGGGACCGGTACAGTGGTTCGTTGGACTTATTGTGGGGCTTGCAGATGTTCCTTACGAACAGTAAGCTTTATCCCTACTTGTTTTTTTGATTTGTTGCAATCATGATCGAGCCAACTTTAGTGCACATATATTCCCCTTTATAGTCGTTCTTCATATCTTAAAGGAATATCTTCTATGTGAATTGAACAACTGTAGTACCTGATAATCATAGGATTTTTTATTTAAGATACGATGATCTTTGTTGCGGTTACATATGATCTATTTTGTTAATTTTGGTTTTAATTGTTTGCCTTCGATTTCTCCATATGGGTTTGGGGTGAGATGCTAAGTTACAGTTCAGATAACTTCTTTAACGAACGACTTTTCATGATGTCATGCACCAGCCTTTTTGATACCTTATATGGCCATCCGGCTTAATAAGGCTAGCAAAGACTCCGCCCCACAACCGCAGTCGAAGCTGGGCTCTTTGATGACCAATAGAGCTCCTGTCGTTGGCCTGATCGGGGGTGCAGCATGCATCATTTCAATAATCTGGTCTTTCGTTGGTCGAGCAGACGGTAACTTTGGAGGTGTAACAGAGAGATGGGAGTTCTTGATCCAATATCTATCTACAGAGAGGTTAGCTTATGCATTTATTTGGGATATTTGCCTTTACACTGTGTTCCAGCCTTGGTTGATTGGAGAAAACCTTCAAAATGTTAAGGAGAGCAAGGTTGGACTCGTAAGTTCTCTTAGATTTGTCCCTGTTGTTGGCTTAATTGTCTATCTTCTTTTTCTGAAACTTGATGAGGAATTATAA

mRNA sequence

ATGGCGACGGCCATGATAGCATCTCAAGCTTTACTCTGCAACCATTCTTTTCCAGTGATTAAACCATATGCTCTAACATCCTCCCTCTCTCCTTTCCGGTTGTGCAGACCACAACAACCAATTCTCTCTCCTTCGATCTCCACCACCACTCCCTTCTCACCCTCACCCACTCGCCGGAATTTTTCTCTCACCCGCCGCTCCGCTCGCCGGAAACCCACATTGGTCGATTCTGCGGGATTCTCCGACGAGGGTGATTCCGATGTCCGACGGGTGCTTCAAATTCTGCTCTGGGCTGCTGAGGCTGTTTACATTTTGTGGCTATTTCTCCTTCCCTATGCCCCGGGAGATCCTGTATGGGCAATCAGTTCAGAAACTGTGAATTCTCTTGTGGGTCTTTCTCTGAATTTCTTCCTTATATTGCCTGCTATGAACTCTGTGGGCATTCGCCTGATTGATGCTCCTGTTCTTCACCCAATGTCTGAGGGATTGTTCAATTTTGTCATTGCATGGACGCTCATGTTTGCTCCCCTTCTGTTCACGGATCGAAAGAGGGACCGGTACAGTGGTTCGTTGGACTTATTGTGGGGCTTGCAGATGTTCCTTACGAACACCTTTTTGATACCTTATATGGCCATCCGGCTTAATAAGGCTAGCAAAGACTCCGCCCCACAACCGCAGTCGAAGCTGGGCTCTTTGATGACCAATAGAGCTCCTGTCGTTGGCCTGATCGGGGGTGCAGCATGCATCATTTCAATAATCTGGTCTTTCGTTGGTCGAGCAGACGGTAACTTTGGAGGTGTAACAGAGAGATGGGAGTTCTTGATCCAATATCTATCTACAGAGAGGTTAGCTTATGCATTTATTTGGGATATTTGCCTTTACACTGTGTTCCAGCCTTGGTTGATTGGAGAAAACCTTCAAAATGTTAAGGAGAGCAAGGTTGGACTCGTAAGTTCTCTTAGATTTGTCCCTGTTGTTGGCTTAATTGTCTATCTTCTTTTTCTGAAACTTGATGAGGAATTATAA

Coding sequence (CDS)

ATGGCGACGGCCATGATAGCATCTCAAGCTTTACTCTGCAACCATTCTTTTCCAGTGATTAAACCATATGCTCTAACATCCTCCCTCTCTCCTTTCCGGTTGTGCAGACCACAACAACCAATTCTCTCTCCTTCGATCTCCACCACCACTCCCTTCTCACCCTCACCCACTCGCCGGAATTTTTCTCTCACCCGCCGCTCCGCTCGCCGGAAACCCACATTGGTCGATTCTGCGGGATTCTCCGACGAGGGTGATTCCGATGTCCGACGGGTGCTTCAAATTCTGCTCTGGGCTGCTGAGGCTGTTTACATTTTGTGGCTATTTCTCCTTCCCTATGCCCCGGGAGATCCTGTATGGGCAATCAGTTCAGAAACTGTGAATTCTCTTGTGGGTCTTTCTCTGAATTTCTTCCTTATATTGCCTGCTATGAACTCTGTGGGCATTCGCCTGATTGATGCTCCTGTTCTTCACCCAATGTCTGAGGGATTGTTCAATTTTGTCATTGCATGGACGCTCATGTTTGCTCCCCTTCTGTTCACGGATCGAAAGAGGGACCGGTACAGTGGTTCGTTGGACTTATTGTGGGGCTTGCAGATGTTCCTTACGAACACCTTTTTGATACCTTATATGGCCATCCGGCTTAATAAGGCTAGCAAAGACTCCGCCCCACAACCGCAGTCGAAGCTGGGCTCTTTGATGACCAATAGAGCTCCTGTCGTTGGCCTGATCGGGGGTGCAGCATGCATCATTTCAATAATCTGGTCTTTCGTTGGTCGAGCAGACGGTAACTTTGGAGGTGTAACAGAGAGATGGGAGTTCTTGATCCAATATCTATCTACAGAGAGGTTAGCTTATGCATTTATTTGGGATATTTGCCTTTACACTGTGTTCCAGCCTTGGTTGATTGGAGAAAACCTTCAAAATGTTAAGGAGAGCAAGGTTGGACTCGTAAGTTCTCTTAGATTTGTCCCTGTTGTTGGCTTAATTGTCTATCTTCTTTTTCTGAAACTTGATGAGGAATTATAA

Protein sequence

MATAMIASQALLCNHSFPVIKPYALTSSLSPFRLCRPQQPILSPSISTTTPFSPSPTRRNFSLTRRSARRKPTLVDSAGFSDEGDSDVRRVLQILLWAAEAVYILWLFLLPYAPGDPVWAISSETVNSLVGLSLNFFLILPAMNSVGIRLIDAPVLHPMSEGLFNFVIAWTLMFAPLLFTDRKRDRYSGSLDLLWGLQMFLTNTFLIPYMAIRLNKASKDSAPQPQSKLGSLMTNRAPVVGLIGGAACIISIIWSFVGRADGNFGGVTERWEFLIQYLSTERLAYAFIWDICLYTVFQPWLIGENLQNVKESKVGLVSSLRFVPVVGLIVYLLFLKLDEEL
BLAST of Cla007983 vs. TrEMBL
Match: A0A0A0KC53_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G031960 PE=4 SV=1)

HSP 1 Score: 609.8 bits (1571), Expect = 2.1e-171
Identity = 301/341 (88.27%), Postives = 319/341 (93.55%), Query Frame = 1

Query: 1   MATAMIASQALLCNHSFPVIKPYALTSSLSPFRLCRPQQPILSPSISTTTPFSPSPTRRN 60
           MA AMIASQ LL NHSFPVIKPY LTS LSPFR  RPQQP+LSP ISTTTPFS SP RRN
Sbjct: 1   MAKAMIASQPLLSNHSFPVIKPYTLTSPLSPFRFFRPQQPLLSPLISTTTPFSLSPNRRN 60

Query: 61  FSLTRRSARRKPTLVDSAGFSDEGDSDVRRVLQILLWAAEAVYILWLFLLPYAPGDPVWA 120
            SLTR +ARRKPT VDS  FS++GDS+VRR+LQ+LLW AEAVYILWLFLLPYAPGDPVWA
Sbjct: 61  CSLTRPAARRKPTFVDSTDFSNDGDSNVRRLLQVLLWGAEAVYILWLFLLPYAPGDPVWA 120

Query: 121 ISSETVNSLVGLSLNFFLILPAMNSVGIRLIDAPVLHPMSEGLFNFVIAWTLMFAPLLFT 180
           ISSETVNSL+GLSLNFF +LPAMNSVGIRLIDAPVLHPMSEGLFNFVIAWTLMFAPLLFT
Sbjct: 121 ISSETVNSLLGLSLNFFFVLPAMNSVGIRLIDAPVLHPMSEGLFNFVIAWTLMFAPLLFT 180

Query: 181 DRKRDRYSGSLDLLWGLQMFLTNTFLIPYMAIRLNKASKDSAPQPQSKLGSLMTNRAPVV 240
           DRKRD+YSGSLDLLWG QMFLTNTFLIPYMAIRLN+AS+DSAPQPQSKLG+LMTN APVV
Sbjct: 181 DRKRDQYSGSLDLLWGFQMFLTNTFLIPYMAIRLNEASEDSAPQPQSKLGTLMTNGAPVV 240

Query: 241 GLIGGAACIISIIWSFVGRADGNFGGVTERWEFLIQYLSTERLAYAFIWDICLYTVFQPW 300
           G+IGGA CIISIIWSFVGRADGNFGGV ERWEFLIQYLS+ERLAYAFIWDICLY+VFQPW
Sbjct: 241 GVIGGAMCIISIIWSFVGRADGNFGGVAERWEFLIQYLSSERLAYAFIWDICLYSVFQPW 300

Query: 301 LIGENLQNVKESKVGLVSSLRFVPVVGLIVYLLFLKLDEEL 342
           LIGENLQNVKESK+G+VSSLRFVPVVGLI YLLFLKLDEEL
Sbjct: 301 LIGENLQNVKESKIGVVSSLRFVPVVGLIAYLLFLKLDEEL 341

BLAST of Cla007983 vs. TrEMBL
Match: A0A0D2VCB1_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_013G081400 PE=4 SV=1)

HSP 1 Score: 443.4 bits (1139), Expect = 2.6e-121
Identity = 223/343 (65.01%), Postives = 260/343 (75.80%), Query Frame = 1

Query: 1   MATAMIASQALLCN-HSFPVIKPYALTSSLSPFRLCRPQQPILSPSI--STTTPFSPSPT 60
           MA A++A+QALLC  H   + +P       +PF    P QP   P      T P +    
Sbjct: 1   MAMAVLATQALLCQTHCRALPRP-------TPFSSNPPPQPSRYPCKHHGETWPTTSFSI 60

Query: 61  RRNFSLTRRSARRKPTLVDSAGFSDEGDSDVRRVLQILLWAAEAVYILWLFLLPYAPGDP 120
             +  +  R++RRK T + S+    + D  +RRVL + LWAAEAVYILWLFLLPYAPGDP
Sbjct: 61  LHSTPIVCRASRRKSTALSSSSEESDQDGPLRRVLHLSLWAAEAVYILWLFLLPYAPGDP 120

Query: 121 VWAISSETVNSLVGLSLNFFLILPAMNSVGIRLIDAPVLHPMSEGLFNFVIAWTLMFAPL 180
           VWAISS T+N L+GLSLNFF ILP  N+VGIRLIDAPVLHPMSEGLFNFVI WTLMFAPL
Sbjct: 121 VWAISSNTINELIGLSLNFFFILPLTNAVGIRLIDAPVLHPMSEGLFNFVIGWTLMFAPL 180

Query: 181 LFTDRKRDRYSGSLDLLWGLQMFLTNTFLIPYMAIRLNKASKDSAPQPQSKLGSLMTNRA 240
           LFTDRKRDRY  SLD+LWGLQMFLTNTFLIPYMAIRLN+A  DS P   S LGS+MTN A
Sbjct: 181 LFTDRKRDRYKSSLDVLWGLQMFLTNTFLIPYMAIRLNEADADSRPTKLSPLGSVMTNGA 240

Query: 241 PVVGLIGGAACIISIIWSFVGRADGNFGGVTERWEFLIQYLSTERLAYAFIWDICLYTVF 300
            VVGL GGA C+ S IW+  GR DG FG +T+RW+FL+ YL +ERLAYAFIWDICLYT+F
Sbjct: 241 AVVGLTGGAVCVFSAIWALYGRMDGEFGNITDRWQFLVSYLGSERLAYAFIWDICLYTIF 300

Query: 301 QPWLIGENLQNVKESKVGLVSSLRFVPVVGLIVYLLFLKLDEE 341
           QPWLIGENLQNV++SKVG+VS LRF+PVVGL+ YLLFL L+E+
Sbjct: 301 QPWLIGENLQNVEKSKVGVVSYLRFIPVVGLVAYLLFLNLEED 336

BLAST of Cla007983 vs. TrEMBL
Match: V4SSX2_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10031676mg PE=4 SV=1)

HSP 1 Score: 442.2 bits (1136), Expect = 5.8e-121
Identity = 226/344 (65.70%), Postives = 269/344 (78.20%), Query Frame = 1

Query: 2   ATAMIASQALLCNHSFPVIKPYALTSSLSPFRLCRPQQPILSPSISTTTPFSPSPTRRNF 61
           A A + + ++      P  K +++ S+ S F    P++   + +IS+ +   P   +R  
Sbjct: 70  ADAKMITMSVAAATPLPCFKNHSILSTKS-FNSKPPRRHFDNATISSQS-LPPVQKQRRV 129

Query: 62  SLTR----RSARRKPTLV-DSAGFSDEGDSDVRRVLQILLWAAEAVYILWLFLLPYAPGD 121
            LTR     +ARRKPT+  D++  S EG+ +VRRVLQI+LWAAEAVYILWLFLLPYAPGD
Sbjct: 130 HLTRTVVCHAARRKPTVAADASKASAEGNDNVRRVLQIVLWAAEAVYILWLFLLPYAPGD 189

Query: 122 PVWAISSETVNSLVGLSLNFFLILPAMNSVGIRLIDAPVLHPMSEGLFNFVIAWTLMFAP 181
           PVWAISSETVNSLVGLSLNFF +LP MNSVGIRLIDAPVLHPMSEGLFNFVI WT MFAP
Sbjct: 190 PVWAISSETVNSLVGLSLNFFFVLPLMNSVGIRLIDAPVLHPMSEGLFNFVIGWTFMFAP 249

Query: 182 LLFTDRKRDRYSGSLDLLWGLQMFLTNTFLIPYMAIRLNKASKDSAPQPQSKLGSLMTNR 241
           LLFTD KRDRY GSLD+LWG QMFLTNTFLIPYMAIRLN+A  +  P+ +S+L S+MTN 
Sbjct: 250 LLFTDCKRDRYKGSLDVLWGFQMFLTNTFLIPYMAIRLNEACSEDTPRDRSQLASVMTNG 309

Query: 242 APVVGLIGGAACIISIIWSFVGRADGNFGGVTERWEFLIQYLSTERLAYAFIWDICLYTV 301
           AP+VGLIGGA C++S +W+  GR DG+FGG+TERWEFL+ YL +ERLAYAFIWDI LY +
Sbjct: 310 APIVGLIGGAICLLSTLWALYGRMDGDFGGITERWEFLVSYLGSERLAYAFIWDIFLYII 369

Query: 302 FQPWLIGENLQNVKESKVGLVSSLRFVPVVGLIVYLLFLKLDEE 341
           FQ WLIG+NLQNV+ SKVG V+ LRFVPVVGL  YLLFL LDEE
Sbjct: 370 FQAWLIGDNLQNVQLSKVGTVNYLRFVPVVGLTAYLLFLNLDEE 411

BLAST of Cla007983 vs. TrEMBL
Match: A0A061EAT1_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_011834 PE=4 SV=1)

HSP 1 Score: 441.8 bits (1135), Expect = 7.5e-121
Identity = 227/345 (65.80%), Postives = 269/345 (77.97%), Query Frame = 1

Query: 4   AMIASQALLCNHS---FPVIKPYALTS---SLSPFRLCRPQQPILS-PSISTTTPFSPSP 63
           A+IA+QALLCN      P  +P +  S   SLS   L   + P  + P+IS + P +P+ 
Sbjct: 2   ALIATQALLCNTHGTFLPGPRPRSFDSDPSSLSRHHLYYCKHPGKTWPAISFSIPTNPTS 61

Query: 64  TRRNFSLTRRSARRKPTLVDSAGFSDEGDSD-VRRVLQILLWAAEAVYILWLFLLPYAPG 123
                 + R S RRK T V  A  S+EGD D +RRV Q+ LW AEAVYI WLFLLPYAPG
Sbjct: 62  QHSTAPVCRES-RRKSTAVSPA--SEEGDGDSLRRVFQVALWTAEAVYISWLFLLPYAPG 121

Query: 124 DPVWAISSETVNSLVGLSLNFFLILPAMNSVGIRLIDAPVLHPMSEGLFNFVIAWTLMFA 183
           DPVWAISSET+N+L+GLSLNF  ILP  N+VGIRLIDAPVLHPMSEGLFNFVI WTLMFA
Sbjct: 122 DPVWAISSETINALIGLSLNFLFILPLTNAVGIRLIDAPVLHPMSEGLFNFVIGWTLMFA 181

Query: 184 PLLFTDRKRDRYSGSLDLLWGLQMFLTNTFLIPYMAIRLNKASKDSAPQPQSKLGSLMTN 243
           PLL+TD KRDRY GSLD+LWGLQMFLTNTFLIPYMAIRLN+A  D  P  +S LGS+MTN
Sbjct: 182 PLLYTDCKRDRYKGSLDVLWGLQMFLTNTFLIPYMAIRLNEADADGPPSKRSPLGSVMTN 241

Query: 244 RAPVVGLIGGAACIISIIWSFVGRADGNFGGVTERWEFLIQYLSTERLAYAFIWDICLYT 303
            APVVGLIGGA C++S IW+ +GR DG+FG +T+RW+FLI YL +ERLAYAFIWDIC Y 
Sbjct: 242 GAPVVGLIGGAVCLLSAIWALIGRMDGDFGSITDRWQFLISYLGSERLAYAFIWDICFYI 301

Query: 304 VFQPWLIGENLQNVKESKVGLVSSLRFVPVVGLIVYLLFLKLDEE 341
           +FQPWLIGENLQNV++S+V LV+ L+F+PVVGL+ YLLFL+L+EE
Sbjct: 302 IFQPWLIGENLQNVQKSRVPLVNYLKFIPVVGLVAYLLFLELEEE 343

BLAST of Cla007983 vs. TrEMBL
Match: A0A067KC29_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_18310 PE=4 SV=1)

HSP 1 Score: 437.6 bits (1124), Expect = 1.4e-119
Identity = 221/355 (62.25%), Postives = 267/355 (75.21%), Query Frame = 1

Query: 5   MIASQALLCNHSFPVIKPYALTSSLSPFRLCRPQQPILSPSI---------STTTPFSPS 64
           M A+Q L CN+ F     Y+L ++  P++          PSI            + FS +
Sbjct: 1   MAATQVLTCNNGFC---SYSLVAATIPYKS--------KPSIFHHYQLHKNKEASAFSCN 60

Query: 65  PTRRNFS--------LTRRSARRKPTLVDSAGFSDEGDSD-VRRVLQILLWAAEAVYILW 124
              RN             ++ARRKP+++++A  SD+GD D  R+VLQI+LWA E VYILW
Sbjct: 61  AWPRNQKNRLKLETLFVCQAARRKPSILNAAVSSDKGDGDNARKVLQIILWALEGVYILW 120

Query: 125 LFLLPYAPGDPVWAISSETVNSLVGLSLNFFLILPAMNSVGIRLIDAPVLHPMSEGLFNF 184
           LFLLPYAPGDPVWAIS +T+NSL+GLSLNFF ILP MNSVGI LIDAPVLHPMSEGLFNF
Sbjct: 121 LFLLPYAPGDPVWAISKDTINSLIGLSLNFFFILPFMNSVGISLIDAPVLHPMSEGLFNF 180

Query: 185 VIAWTLMFAPLLFTDRKRDRYSGSLDLLWGLQMFLTNTFLIPYMAIRLNKASKDSAPQPQ 244
           VI WT MFAPLLF+D +RDRY GSLD+LWGLQMFLTNTFLIPYMAIRLN+A  +S PQ  
Sbjct: 181 VIGWTFMFAPLLFSDCRRDRYKGSLDILWGLQMFLTNTFLIPYMAIRLNEADSESTPQKL 240

Query: 245 SKLGSLMTNRAPVVGLIGGAACIISIIWSFVGRADGNFGGVTERWEFLIQYLSTERLAYA 304
           S+LG++MTN AP+VGLIGG AC+IS +W+  GR DGNFG +T+RWEFL+ YL +ERLAYA
Sbjct: 241 SQLGTVMTNGAPIVGLIGGFACLISALWALYGRMDGNFGSITDRWEFLVSYLGSERLAYA 300

Query: 305 FIWDICLYTVFQPWLIGENLQNVKESKVGLVSSLRFVPVVGLIVYLLFLKLDEEL 342
           FIWDICLY +FQPWLIG+NLQNV++S++ +V  LRFVPVVGL+ YLL L LDEEL
Sbjct: 301 FIWDICLYIIFQPWLIGDNLQNVQKSRIDIVKYLRFVPVVGLVAYLLCLNLDEEL 344

BLAST of Cla007983 vs. NCBI nr
Match: gi|449465643|ref|XP_004150537.1| (PREDICTED: uncharacterized protein LOC101210554 isoform X1 [Cucumis sativus])

HSP 1 Score: 609.8 bits (1571), Expect = 3.0e-171
Identity = 301/341 (88.27%), Postives = 319/341 (93.55%), Query Frame = 1

Query: 1   MATAMIASQALLCNHSFPVIKPYALTSSLSPFRLCRPQQPILSPSISTTTPFSPSPTRRN 60
           MA AMIASQ LL NHSFPVIKPY LTS LSPFR  RPQQP+LSP ISTTTPFS SP RRN
Sbjct: 1   MAKAMIASQPLLSNHSFPVIKPYTLTSPLSPFRFFRPQQPLLSPLISTTTPFSLSPNRRN 60

Query: 61  FSLTRRSARRKPTLVDSAGFSDEGDSDVRRVLQILLWAAEAVYILWLFLLPYAPGDPVWA 120
            SLTR +ARRKPT VDS  FS++GDS+VRR+LQ+LLW AEAVYILWLFLLPYAPGDPVWA
Sbjct: 61  CSLTRPAARRKPTFVDSTDFSNDGDSNVRRLLQVLLWGAEAVYILWLFLLPYAPGDPVWA 120

Query: 121 ISSETVNSLVGLSLNFFLILPAMNSVGIRLIDAPVLHPMSEGLFNFVIAWTLMFAPLLFT 180
           ISSETVNSL+GLSLNFF +LPAMNSVGIRLIDAPVLHPMSEGLFNFVIAWTLMFAPLLFT
Sbjct: 121 ISSETVNSLLGLSLNFFFVLPAMNSVGIRLIDAPVLHPMSEGLFNFVIAWTLMFAPLLFT 180

Query: 181 DRKRDRYSGSLDLLWGLQMFLTNTFLIPYMAIRLNKASKDSAPQPQSKLGSLMTNRAPVV 240
           DRKRD+YSGSLDLLWG QMFLTNTFLIPYMAIRLN+AS+DSAPQPQSKLG+LMTN APVV
Sbjct: 181 DRKRDQYSGSLDLLWGFQMFLTNTFLIPYMAIRLNEASEDSAPQPQSKLGTLMTNGAPVV 240

Query: 241 GLIGGAACIISIIWSFVGRADGNFGGVTERWEFLIQYLSTERLAYAFIWDICLYTVFQPW 300
           G+IGGA CIISIIWSFVGRADGNFGGV ERWEFLIQYLS+ERLAYAFIWDICLY+VFQPW
Sbjct: 241 GVIGGAMCIISIIWSFVGRADGNFGGVAERWEFLIQYLSSERLAYAFIWDICLYSVFQPW 300

Query: 301 LIGENLQNVKESKVGLVSSLRFVPVVGLIVYLLFLKLDEEL 342
           LIGENLQNVKESK+G+VSSLRFVPVVGLI YLLFLKLDEEL
Sbjct: 301 LIGENLQNVKESKIGVVSSLRFVPVVGLIAYLLFLKLDEEL 341

BLAST of Cla007983 vs. NCBI nr
Match: gi|659089686|ref|XP_008445644.1| (PREDICTED: uncharacterized protein LOC103488603 [Cucumis melo])

HSP 1 Score: 497.3 bits (1279), Expect = 2.2e-137
Identity = 249/289 (86.16%), Postives = 264/289 (91.35%), Query Frame = 1

Query: 1   MATAMIASQALLCNHSFPVIKPYALTSSLSPFRLCRPQQPILSPSISTTTPFSPSPTRRN 60
           MA AMIASQ LLCN SFPVIKP+ALTS LSPFR  RPQQP+LSP IST TPFS SP RRN
Sbjct: 1   MAKAMIASQPLLCNLSFPVIKPFALTSPLSPFRFFRPQQPLLSPLISTPTPFSLSPNRRN 60

Query: 61  FSLTRRSARRKPTLVDSAGFSDEGDSDVRRVLQILLWAAEAVYILWLFLLPYAPGDPVWA 120
            SLTR +AR+KPT VDSA FS+EG+SDVRR+LQILLW AEAVYILWLFLLPYAPGDPVWA
Sbjct: 61  CSLTRPAARKKPTFVDSADFSNEGNSDVRRLLQILLWGAEAVYILWLFLLPYAPGDPVWA 120

Query: 121 ISSETVNSLVGLSLNFFLILPAMNSVGIRLIDAPVLHPMSEGLFNFVIAWTLMFAPLLFT 180
           ISSETVNSLVGLSLNFF +LPAMNSVGIR+IDAPVLHPMSEGLFNFVIAWTLMFAPLLFT
Sbjct: 121 ISSETVNSLVGLSLNFFFVLPAMNSVGIRVIDAPVLHPMSEGLFNFVIAWTLMFAPLLFT 180

Query: 181 DRKRDRYSGSLDLLWGLQMFLTNTFLIPYMAIRLNKASKDSAPQPQSKLGSLMTNRAPVV 240
           DRKRD+YSGSLDLLWGLQMFLTNTFLIPYMAIRLN+AS+ SAPQPQSKLG+LMTN APVV
Sbjct: 181 DRKRDQYSGSLDLLWGLQMFLTNTFLIPYMAIRLNEASEYSAPQPQSKLGTLMTNGAPVV 240

Query: 241 GLIGGAACIISIIWSFVGRADGNFGGVTERWEFLIQYLSTERLAYAFIW 290
           G+IGGA CIISIIWSFVGRADGNFGGV ERWEFLIQYLS+ERL     W
Sbjct: 241 GVIGGAICIISIIWSFVGRADGNFGGVAERWEFLIQYLSSERLKTLTNW 289

BLAST of Cla007983 vs. NCBI nr
Match: gi|823261890|ref|XP_012463682.1| (PREDICTED: uncharacterized protein LOC105783049 [Gossypium raimondii])

HSP 1 Score: 443.4 bits (1139), Expect = 3.7e-121
Identity = 223/343 (65.01%), Postives = 260/343 (75.80%), Query Frame = 1

Query: 1   MATAMIASQALLCN-HSFPVIKPYALTSSLSPFRLCRPQQPILSPSI--STTTPFSPSPT 60
           MA A++A+QALLC  H   + +P       +PF    P QP   P      T P +    
Sbjct: 1   MAMAVLATQALLCQTHCRALPRP-------TPFSSNPPPQPSRYPCKHHGETWPTTSFSI 60

Query: 61  RRNFSLTRRSARRKPTLVDSAGFSDEGDSDVRRVLQILLWAAEAVYILWLFLLPYAPGDP 120
             +  +  R++RRK T + S+    + D  +RRVL + LWAAEAVYILWLFLLPYAPGDP
Sbjct: 61  LHSTPIVCRASRRKSTALSSSSEESDQDGPLRRVLHLSLWAAEAVYILWLFLLPYAPGDP 120

Query: 121 VWAISSETVNSLVGLSLNFFLILPAMNSVGIRLIDAPVLHPMSEGLFNFVIAWTLMFAPL 180
           VWAISS T+N L+GLSLNFF ILP  N+VGIRLIDAPVLHPMSEGLFNFVI WTLMFAPL
Sbjct: 121 VWAISSNTINELIGLSLNFFFILPLTNAVGIRLIDAPVLHPMSEGLFNFVIGWTLMFAPL 180

Query: 181 LFTDRKRDRYSGSLDLLWGLQMFLTNTFLIPYMAIRLNKASKDSAPQPQSKLGSLMTNRA 240
           LFTDRKRDRY  SLD+LWGLQMFLTNTFLIPYMAIRLN+A  DS P   S LGS+MTN A
Sbjct: 181 LFTDRKRDRYKSSLDVLWGLQMFLTNTFLIPYMAIRLNEADADSRPTKLSPLGSVMTNGA 240

Query: 241 PVVGLIGGAACIISIIWSFVGRADGNFGGVTERWEFLIQYLSTERLAYAFIWDICLYTVF 300
            VVGL GGA C+ S IW+  GR DG FG +T+RW+FL+ YL +ERLAYAFIWDICLYT+F
Sbjct: 241 AVVGLTGGAVCVFSAIWALYGRMDGEFGNITDRWQFLVSYLGSERLAYAFIWDICLYTIF 300

Query: 301 QPWLIGENLQNVKESKVGLVSSLRFVPVVGLIVYLLFLKLDEE 341
           QPWLIGENLQNV++SKVG+VS LRF+PVVGL+ YLLFL L+E+
Sbjct: 301 QPWLIGENLQNVEKSKVGVVSYLRFIPVVGLVAYLLFLNLEED 336

BLAST of Cla007983 vs. NCBI nr
Match: gi|567890355|ref|XP_006437698.1| (hypothetical protein CICLE_v10031676mg [Citrus clementina])

HSP 1 Score: 442.2 bits (1136), Expect = 8.3e-121
Identity = 226/344 (65.70%), Postives = 269/344 (78.20%), Query Frame = 1

Query: 2   ATAMIASQALLCNHSFPVIKPYALTSSLSPFRLCRPQQPILSPSISTTTPFSPSPTRRNF 61
           A A + + ++      P  K +++ S+ S F    P++   + +IS+ +   P   +R  
Sbjct: 70  ADAKMITMSVAAATPLPCFKNHSILSTKS-FNSKPPRRHFDNATISSQS-LPPVQKQRRV 129

Query: 62  SLTR----RSARRKPTLV-DSAGFSDEGDSDVRRVLQILLWAAEAVYILWLFLLPYAPGD 121
            LTR     +ARRKPT+  D++  S EG+ +VRRVLQI+LWAAEAVYILWLFLLPYAPGD
Sbjct: 130 HLTRTVVCHAARRKPTVAADASKASAEGNDNVRRVLQIVLWAAEAVYILWLFLLPYAPGD 189

Query: 122 PVWAISSETVNSLVGLSLNFFLILPAMNSVGIRLIDAPVLHPMSEGLFNFVIAWTLMFAP 181
           PVWAISSETVNSLVGLSLNFF +LP MNSVGIRLIDAPVLHPMSEGLFNFVI WT MFAP
Sbjct: 190 PVWAISSETVNSLVGLSLNFFFVLPLMNSVGIRLIDAPVLHPMSEGLFNFVIGWTFMFAP 249

Query: 182 LLFTDRKRDRYSGSLDLLWGLQMFLTNTFLIPYMAIRLNKASKDSAPQPQSKLGSLMTNR 241
           LLFTD KRDRY GSLD+LWG QMFLTNTFLIPYMAIRLN+A  +  P+ +S+L S+MTN 
Sbjct: 250 LLFTDCKRDRYKGSLDVLWGFQMFLTNTFLIPYMAIRLNEACSEDTPRDRSQLASVMTNG 309

Query: 242 APVVGLIGGAACIISIIWSFVGRADGNFGGVTERWEFLIQYLSTERLAYAFIWDICLYTV 301
           AP+VGLIGGA C++S +W+  GR DG+FGG+TERWEFL+ YL +ERLAYAFIWDI LY +
Sbjct: 310 APIVGLIGGAICLLSTLWALYGRMDGDFGGITERWEFLVSYLGSERLAYAFIWDIFLYII 369

Query: 302 FQPWLIGENLQNVKESKVGLVSSLRFVPVVGLIVYLLFLKLDEE 341
           FQ WLIG+NLQNV+ SKVG V+ LRFVPVVGL  YLLFL LDEE
Sbjct: 370 FQAWLIGDNLQNVQLSKVGTVNYLRFVPVVGLTAYLLFLNLDEE 411

BLAST of Cla007983 vs. NCBI nr
Match: gi|590700900|ref|XP_007046268.1| (Uncharacterized protein TCM_011834 [Theobroma cacao])

HSP 1 Score: 441.8 bits (1135), Expect = 1.1e-120
Identity = 227/345 (65.80%), Postives = 269/345 (77.97%), Query Frame = 1

Query: 4   AMIASQALLCNHS---FPVIKPYALTS---SLSPFRLCRPQQPILS-PSISTTTPFSPSP 63
           A+IA+QALLCN      P  +P +  S   SLS   L   + P  + P+IS + P +P+ 
Sbjct: 2   ALIATQALLCNTHGTFLPGPRPRSFDSDPSSLSRHHLYYCKHPGKTWPAISFSIPTNPTS 61

Query: 64  TRRNFSLTRRSARRKPTLVDSAGFSDEGDSD-VRRVLQILLWAAEAVYILWLFLLPYAPG 123
                 + R S RRK T V  A  S+EGD D +RRV Q+ LW AEAVYI WLFLLPYAPG
Sbjct: 62  QHSTAPVCRES-RRKSTAVSPA--SEEGDGDSLRRVFQVALWTAEAVYISWLFLLPYAPG 121

Query: 124 DPVWAISSETVNSLVGLSLNFFLILPAMNSVGIRLIDAPVLHPMSEGLFNFVIAWTLMFA 183
           DPVWAISSET+N+L+GLSLNF  ILP  N+VGIRLIDAPVLHPMSEGLFNFVI WTLMFA
Sbjct: 122 DPVWAISSETINALIGLSLNFLFILPLTNAVGIRLIDAPVLHPMSEGLFNFVIGWTLMFA 181

Query: 184 PLLFTDRKRDRYSGSLDLLWGLQMFLTNTFLIPYMAIRLNKASKDSAPQPQSKLGSLMTN 243
           PLL+TD KRDRY GSLD+LWGLQMFLTNTFLIPYMAIRLN+A  D  P  +S LGS+MTN
Sbjct: 182 PLLYTDCKRDRYKGSLDVLWGLQMFLTNTFLIPYMAIRLNEADADGPPSKRSPLGSVMTN 241

Query: 244 RAPVVGLIGGAACIISIIWSFVGRADGNFGGVTERWEFLIQYLSTERLAYAFIWDICLYT 303
            APVVGLIGGA C++S IW+ +GR DG+FG +T+RW+FLI YL +ERLAYAFIWDIC Y 
Sbjct: 242 GAPVVGLIGGAVCLLSAIWALIGRMDGDFGSITDRWQFLISYLGSERLAYAFIWDICFYI 301

Query: 304 VFQPWLIGENLQNVKESKVGLVSSLRFVPVVGLIVYLLFLKLDEE 341
           +FQPWLIGENLQNV++S+V LV+ L+F+PVVGL+ YLLFL+L+EE
Sbjct: 302 IFQPWLIGENLQNVQKSRVPLVNYLKFIPVVGLVAYLLFLELEEE 343

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0KC53_CUCSA2.1e-17188.27Uncharacterized protein OS=Cucumis sativus GN=Csa_6G031960 PE=4 SV=1[more]
A0A0D2VCB1_GOSRA2.6e-12165.01Uncharacterized protein OS=Gossypium raimondii GN=B456_013G081400 PE=4 SV=1[more]
V4SSX2_9ROSI5.8e-12165.70Uncharacterized protein OS=Citrus clementina GN=CICLE_v10031676mg PE=4 SV=1[more]
A0A061EAT1_THECC7.5e-12165.80Uncharacterized protein OS=Theobroma cacao GN=TCM_011834 PE=4 SV=1[more]
A0A067KC29_JATCU1.4e-11962.25Uncharacterized protein OS=Jatropha curcas GN=JCGZ_18310 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|449465643|ref|XP_004150537.1|3.0e-17188.27PREDICTED: uncharacterized protein LOC101210554 isoform X1 [Cucumis sativus][more]
gi|659089686|ref|XP_008445644.1|2.2e-13786.16PREDICTED: uncharacterized protein LOC103488603 [Cucumis melo][more]
gi|823261890|ref|XP_012463682.1|3.7e-12165.01PREDICTED: uncharacterized protein LOC105783049 [Gossypium raimondii][more]
gi|567890355|ref|XP_006437698.1|8.3e-12165.70hypothetical protein CICLE_v10031676mg [Citrus clementina][more]
gi|590700900|ref|XP_007046268.1|1.1e-12065.80Uncharacterized protein TCM_011834 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0019288 isopentenyl diphosphate biosynthetic process, methylerythritol 4-phosphate pathway
biological_process GO:0015031 protein transport
biological_process GO:0008150 biological_process
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005575 cellular_component
cellular_component GO:0016020 membrane
molecular_function GO:0003674 molecular_function
molecular_function GO:0016874 ligase activity
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
WMU57716watermelon EST collection version 2.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla007983Cla007983.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
WMU57716WMU57716transcribed_cluster


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR36367FAMILY NOT NAMEDcoord: 9..341
score: 6.3E
NoneNo IPR availablePANTHERPTHR36367:SF1SUBFAMILY NOT NAMEDcoord: 9..341
score: 6.3E

The following gene(s) are paralogous to this gene:

None