Cla97C05G108820 (gene) Watermelon (97103) v2

NameCla97C05G108820
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionDUF4378 domain-containing protein
LocationCla97Chr05 : 35408190 .. 35409142 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTTTTCCTCTACTCTCCACTTGCATTTCAACCTCTCTTTTTCATACCATCCATGACACCAAGATCACCACTGAGACAACTTCTGCAAGAGCAACAAGAACCTTTCGAGTTGGAGGATTACCTTTTTGAAAGAGACAACTACTCGAGGAAGAGCCTGAGTCGGGGCAGTGGCTTTGCATGCAGCGGTGGAAAATTAGACAGCATTGTGAAGTTTGGAAAGGGATTGGTAGAGATCAATAAGGTGCTGAAGAATTCATGTAAGAAGCTTGTTGCCATCAATAGGAAGCAGAAAACTAAAGATTTGGGAAGAAATGGTTGGATTTTTAGTGTTGGCTGCAAGGGAGTTACAGAATCAGACACCTTTCCGTCTCCCTGTAGCACCAGAAAGACTGACCGTTCTGCTTCTGGAAAAGATAATGCAGAAACATCGTCTTCAACGAGAAGACAACATGCTAATTCATCAACTTCAGACACTTCCCAAGCTCCAGAATCCTGCAACAGGCAAGTACTGAAGGTATGGTCATTCTCTTTAAGCCTCAGTCCAAATCTGCTCACGTTGTTTTCAGATTAAATTATGACTCGTCCCAGACACCCAATTCATCATAATCCATTTGAACTGCGGGCGTTTTTGAATCATATGTTCATCAACAGAGATCCAAAATGACCAGTCTTTAAAGGAGACTCCAGATGTCCCATTTCTATTCTTTCATCAATGTGAATCTTAAAATGTGAAATACCAAAGCTACCTGCCGGACTTGCATCTTTTACAACTGTTTGTAAACTTGATGAACCATGTAACATTTTTACAGTTCTTGAGGAAATTTAGTTTTGATTTGAACTAACGACACCGAACAGGCTGCGGCAGGGAAAAAGCTGGTGTCTAGGAGCATGGAACCAAGTAAGCAACTCAGACCTGGATTAGACAATGATTTTCATCTTCCGTGTCCATGA

mRNA sequence

ATGTTTTTCCTCTACTCTCCACTTGCATTTCAACCTCTCTTTTTCATACCATCCATGACACCAAGATCACCACTGAGACAACTTCTGCAAGAGCAACAAGAACCTTTCGAGTTGGAGGATTACCTTTTTGAAAGAGACAACTACTCGAGGAAGAGCCTGAGTCGGGGCAGTGGCTTTGCATGCAGCGGTGGAAAATTAGACAGCATTGTGAAGTTTGGAAAGGGATTGGTAGAGATCAATAAGGTGCTGAAGAATTCATGTAAGAAGCTTGTTGCCATCAATAGGAAGCAGAAAACTAAAGATTTGGGAAGAAATGGTTGGATTTTTAGTGTTGGCTGCAAGGGAGTTACAGAATCAGACACCTTTCCGTCTCCCTGTAGCACCAGAAAGACTGACCGTTCTGCTTCTGGAAAAGATAATGCAGAAACATCGTCTTCAACGAGAAGACAACATGCTAATTCATCAACTTCAGACACTTCCCAAGCTCCAGAATCCTGCAACAGGCAAGTACTGAAGGCTGCGGCAGGGAAAAAGCTGGTGTCTAGGAGCATGGAACCAAGTAAGCAACTCAGACCTGGATTAGACAATGATTTTCATCTTCCGTGTCCATGA

Coding sequence (CDS)

ATGTTTTTCCTCTACTCTCCACTTGCATTTCAACCTCTCTTTTTCATACCATCCATGACACCAAGATCACCACTGAGACAACTTCTGCAAGAGCAACAAGAACCTTTCGAGTTGGAGGATTACCTTTTTGAAAGAGACAACTACTCGAGGAAGAGCCTGAGTCGGGGCAGTGGCTTTGCATGCAGCGGTGGAAAATTAGACAGCATTGTGAAGTTTGGAAAGGGATTGGTAGAGATCAATAAGGTGCTGAAGAATTCATGTAAGAAGCTTGTTGCCATCAATAGGAAGCAGAAAACTAAAGATTTGGGAAGAAATGGTTGGATTTTTAGTGTTGGCTGCAAGGGAGTTACAGAATCAGACACCTTTCCGTCTCCCTGTAGCACCAGAAAGACTGACCGTTCTGCTTCTGGAAAAGATAATGCAGAAACATCGTCTTCAACGAGAAGACAACATGCTAATTCATCAACTTCAGACACTTCCCAAGCTCCAGAATCCTGCAACAGGCAAGTACTGAAGGCTGCGGCAGGGAAAAAGCTGGTGTCTAGGAGCATGGAACCAAGTAAGCAACTCAGACCTGGATTAGACAATGATTTTCATCTTCCGTGTCCATGA

Protein sequence

MFFLYSPLAFQPLFFIPSMTPRSPLRQLLQEQQEPFELEDYLFERDNYSRKSLSRGSGFACSGGKLDSIVKFGKGLVEINKVLKNSCKKLVAINRKQKTKDLGRNGWIFSVGCKGVTESDTFPSPCSTRKTDRSASGKDNAETSSSTRRQHANSSTSDTSQAPESCNRQVLKAAAGKKLVSRSMEPSKQLRPGLDNDFHLPCP
BLAST of Cla97C05G108820 vs. NCBI nr
Match: XP_004141295.1 (PREDICTED: uncharacterized protein LOC101203076 isoform X1 [Cucumis sativus] >KGN55291.1 hypothetical protein Csa_4G644680 [Cucumis sativus])

HSP 1 Score: 241.1 bits (614), Expect = 3.2e-60
Identity = 128/187 (68.45%), Postives = 139/187 (74.33%), Query Frame = 0

Query: 3   FLYSPLAFQPLFFIPSMTPRSPLRQLLQEQQEPFELEDYLFERDNYSRKSLSRGSGFACS 62
           F YSPL FQ LF I SMTPRSPLRQLLQEQQEPF+LE+YL ERD YSRKSLSRGSG ACS
Sbjct: 2   FFYSPLTFQTLFLIASMTPRSPLRQLLQEQQEPFKLEEYLSERDYYSRKSLSRGSGLACS 61

Query: 63  GGKLDSIVKFGKGLVEINKVLKNSCKKLVAINRKQKTKDLGRNGWIFSVGCKGVTESDTF 122
           GGKLDSI+KFGKG +EIN +LKNSCKKLV INRKQ+ KDLG+NGWIFSVGCK V ESD F
Sbjct: 62  GGKLDSIMKFGKGFMEINTLLKNSCKKLVFINRKQQNKDLGKNGWIFSVGCKRVAESDNF 121

Query: 123 PSPCSTRKTDRSASGKDNAEXXXXXXXXXXXXXXXXXXXXXXXXXXXVLKAAAGKKLVSR 182
            SPCS+RKT  S SGKD+ E                           V+KAAAGKKLVSR
Sbjct: 122 LSPCSSRKTVHSVSGKDDEETLSSTRGRHANSSTSNTFEAPEPCNFQVVKAAAGKKLVSR 181

Query: 183 SMEPSKQ 190
           SMEP K+
Sbjct: 182 SMEPRKR 188

BLAST of Cla97C05G108820 vs. NCBI nr
Match: XP_008452675.1 (PREDICTED: uncharacterized protein LOC103493622 [Cucumis melo])

HSP 1 Score: 236.5 bits (602), Expect = 8.0e-59
Identity = 126/187 (67.38%), Postives = 137/187 (73.26%), Query Frame = 0

Query: 3   FLYSPLAFQPLFFIPSMTPRSPLRQLLQEQQEPFELEDYLFERDNYSRKSLSRGSGFACS 62
           F YSPL FQ LF I SMTPRSPLR+LLQEQQEPFELEDYL ERD YSRKSLSR S FACS
Sbjct: 2   FFYSPLTFQTLFLIASMTPRSPLRELLQEQQEPFELEDYLSERDYYSRKSLSRRSDFACS 61

Query: 63  GGKLDSIVKFGKGLVEINKVLKNSCKKLVAINRKQKTKDLGRNGWIFSVGCKGVTESDTF 122
            GKLDSI+KFGKG +EIN +LKNSCKKLV+INRKQ+ K LG+NGWIFSVGCK V ESD F
Sbjct: 62  WGKLDSIMKFGKGFIEINTLLKNSCKKLVSINRKQQNKGLGKNGWIFSVGCKRVAESDNF 121

Query: 123 PSPCSTRKTDRSASGKDNAEXXXXXXXXXXXXXXXXXXXXXXXXXXXVLKAAAGKKLVSR 182
            SPCS+RKT  S SGKDN E                           V+KAAAGKKL+SR
Sbjct: 122 LSPCSSRKTVHSVSGKDNEETFSSTRRRHANSSTSNTFQAPEPCNFQVVKAAAGKKLLSR 181

Query: 183 SMEPSKQ 190
           SMEP K+
Sbjct: 182 SMEPRKR 188

BLAST of Cla97C05G108820 vs. NCBI nr
Match: XP_004141296.1 (PREDICTED: uncharacterized protein LOC101203076 isoform X2 [Cucumis sativus])

HSP 1 Score: 221.9 bits (564), Expect = 2.0e-54
Identity = 117/171 (68.42%), Postives = 128/171 (74.85%), Query Frame = 0

Query: 19  MTPRSPLRQLLQEQQEPFELEDYLFERDNYSRKSLSRGSGFACSGGKLDSIVKFGKGLVE 78
           MTPRSPLRQLLQEQQEPF+LE+YL ERD YSRKSLSRGSG ACSGGKLDSI+KFGKG +E
Sbjct: 1   MTPRSPLRQLLQEQQEPFKLEEYLSERDYYSRKSLSRGSGLACSGGKLDSIMKFGKGFME 60

Query: 79  INKVLKNSCKKLVAINRKQKTKDLGRNGWIFSVGCKGVTESDTFPSPCSTRKTDRSASGK 138
           IN +LKNSCKKLV INRKQ+ KDLG+NGWIFSVGCK V ESD F SPCS+RKT  S SGK
Sbjct: 61  INTLLKNSCKKLVFINRKQQNKDLGKNGWIFSVGCKRVAESDNFLSPCSSRKTVHSVSGK 120

Query: 139 DNAEXXXXXXXXXXXXXXXXXXXXXXXXXXXVLKAAAGKKLVSRSMEPSKQ 190
           D+ E                           V+KAAAGKKLVSRSMEP K+
Sbjct: 121 DDEETLSSTRGRHANSSTSNTFEAPEPCNFQVVKAAAGKKLVSRSMEPRKR 171

BLAST of Cla97C05G108820 vs. NCBI nr
Match: XP_022936977.1 (uncharacterized protein LOC111443406 [Cucurbita moschata])

HSP 1 Score: 206.8 bits (525), Expect = 6.8e-50
Identity = 102/132 (77.27%), Postives = 115/132 (87.12%), Query Frame = 0

Query: 1   MFFLYSPLAFQPLFFIPSMTPRSPLRQLLQEQQEPFELEDYLFERDNYSRKSLSRGSGFA 60
           MFF YSP  FQPLFFI SMTPRSPLRQLLQEQQEPFELEDYLFER+ YSRKS +RGSGF 
Sbjct: 1   MFFFYSPFVFQPLFFIESMTPRSPLRQLLQEQQEPFELEDYLFEREWYSRKSSNRGSGFG 60

Query: 61  CSGGKLDSIVKFGKGLVEINKVLKNSCKKLVAINRK-QKTKDLGRNGWIFSVGCKGVTES 120
           CSGGKL+ ++KFGKG VEINK+L+NSCKKL++I+RK Q+TKD+G NGW FSVGCK V ES
Sbjct: 61  CSGGKLNGVMKFGKGFVEINKMLRNSCKKLLSIHRKQQQTKDVGENGWFFSVGCKRVAES 120

Query: 121 DTFPSPCSTRKT 132
           D F SPCS +KT
Sbjct: 121 DNFLSPCSFQKT 132

BLAST of Cla97C05G108820 vs. NCBI nr
Match: XP_022977058.1 (uncharacterized protein LOC111477240 [Cucurbita maxima])

HSP 1 Score: 205.7 bits (522), Expect = 1.5e-49
Identity = 103/132 (78.03%), Postives = 114/132 (86.36%), Query Frame = 0

Query: 1   MFFLYSPLAFQPLFFIPSMTPRSPLRQLLQEQQEPFELEDYLFERDNYSRKSLSRGSGFA 60
           MF  +SP  FQP FFI SMTPRSPLRQLLQEQQEPFELEDYLFER+ YSRKS SRGSGFA
Sbjct: 1   MFLFHSPFVFQPHFFIESMTPRSPLRQLLQEQQEPFELEDYLFEREYYSRKSSSRGSGFA 60

Query: 61  CSGGKLDSIVKFGKGLVEINKVLKNSCKKLVAINRK-QKTKDLGRNGWIFSVGCKGVTES 120
           CSGGKLD ++KFGKGLVEINK+L+NSCKKL++I+RK Q+TKDLG NGW FSV CK V ES
Sbjct: 61  CSGGKLDGVMKFGKGLVEINKMLRNSCKKLLSIHRKQQQTKDLGENGWFFSVSCKRVAES 120

Query: 121 DTFPSPCSTRKT 132
           D F SPCS +KT
Sbjct: 121 DNFLSPCSFQKT 132

BLAST of Cla97C05G108820 vs. TrEMBL
Match: tr|A0A0A0L558|A0A0A0L558_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G644680 PE=4 SV=1)

HSP 1 Score: 241.1 bits (614), Expect = 2.1e-60
Identity = 128/187 (68.45%), Postives = 139/187 (74.33%), Query Frame = 0

Query: 3   FLYSPLAFQPLFFIPSMTPRSPLRQLLQEQQEPFELEDYLFERDNYSRKSLSRGSGFACS 62
           F YSPL FQ LF I SMTPRSPLRQLLQEQQEPF+LE+YL ERD YSRKSLSRGSG ACS
Sbjct: 2   FFYSPLTFQTLFLIASMTPRSPLRQLLQEQQEPFKLEEYLSERDYYSRKSLSRGSGLACS 61

Query: 63  GGKLDSIVKFGKGLVEINKVLKNSCKKLVAINRKQKTKDLGRNGWIFSVGCKGVTESDTF 122
           GGKLDSI+KFGKG +EIN +LKNSCKKLV INRKQ+ KDLG+NGWIFSVGCK V ESD F
Sbjct: 62  GGKLDSIMKFGKGFMEINTLLKNSCKKLVFINRKQQNKDLGKNGWIFSVGCKRVAESDNF 121

Query: 123 PSPCSTRKTDRSASGKDNAEXXXXXXXXXXXXXXXXXXXXXXXXXXXVLKAAAGKKLVSR 182
            SPCS+RKT  S SGKD+ E                           V+KAAAGKKLVSR
Sbjct: 122 LSPCSSRKTVHSVSGKDDEETLSSTRGRHANSSTSNTFEAPEPCNFQVVKAAAGKKLVSR 181

Query: 183 SMEPSKQ 190
           SMEP K+
Sbjct: 182 SMEPRKR 188

BLAST of Cla97C05G108820 vs. TrEMBL
Match: tr|A0A1S3BV72|A0A1S3BV72_CUCME (uncharacterized protein LOC103493622 OS=Cucumis melo OX=3656 GN=LOC103493622 PE=4 SV=1)

HSP 1 Score: 236.5 bits (602), Expect = 5.3e-59
Identity = 126/187 (67.38%), Postives = 137/187 (73.26%), Query Frame = 0

Query: 3   FLYSPLAFQPLFFIPSMTPRSPLRQLLQEQQEPFELEDYLFERDNYSRKSLSRGSGFACS 62
           F YSPL FQ LF I SMTPRSPLR+LLQEQQEPFELEDYL ERD YSRKSLSR S FACS
Sbjct: 2   FFYSPLTFQTLFLIASMTPRSPLRELLQEQQEPFELEDYLSERDYYSRKSLSRRSDFACS 61

Query: 63  GGKLDSIVKFGKGLVEINKVLKNSCKKLVAINRKQKTKDLGRNGWIFSVGCKGVTESDTF 122
            GKLDSI+KFGKG +EIN +LKNSCKKLV+INRKQ+ K LG+NGWIFSVGCK V ESD F
Sbjct: 62  WGKLDSIMKFGKGFIEINTLLKNSCKKLVSINRKQQNKGLGKNGWIFSVGCKRVAESDNF 121

Query: 123 PSPCSTRKTDRSASGKDNAEXXXXXXXXXXXXXXXXXXXXXXXXXXXVLKAAAGKKLVSR 182
            SPCS+RKT  S SGKDN E                           V+KAAAGKKL+SR
Sbjct: 122 LSPCSSRKTVHSVSGKDNEETFSSTRRRHANSSTSNTFQAPEPCNFQVVKAAAGKKLLSR 181

Query: 183 SMEPSKQ 190
           SMEP K+
Sbjct: 182 SMEPRKR 188

BLAST of Cla97C05G108820 vs. TrEMBL
Match: tr|F6H178|F6H178_VITVI (Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VIT_18s0001g08690 PE=4 SV=1)

HSP 1 Score: 62.8 bits (151), Expect = 1.0e-06
Identity = 49/136 (36.03%), Postives = 67/136 (49.26%), Query Frame = 0

Query: 21  PRSPLRQLLQEQQEPFELEDYLFERDNYSRKSLSRGSGFACSGGKLDSIVK--------- 80
           P   L QLLQEQQEPF LE YL ER    +K+L+   GF+C  G    ++K         
Sbjct: 8   PGKQLGQLLQEQQEPFILETYLLER-GCLKKNLTSEGGFSCCHGNSSKLLKRSASCDLNM 67

Query: 81  FGKGLVEINKVLKNSCKKLVAINRKQKTKDLGRNGWIFSVGCKG-----VTESDTFPSPC 140
            GKG+ + +K+L+    KLV+I    K+K    +G + S    G     V E D+F S  
Sbjct: 68  SGKGIPQCSKILRAIFNKLVSITGNPKSKLCDHDGGVLSFSEVGSSSQKVAEQDSF-SSA 127

Query: 141 STRKTDRSASGKDNAE 143
           ST     S S  D+ +
Sbjct: 128 STTTVFNSCSASDSED 141

BLAST of Cla97C05G108820 vs. TrEMBL
Match: tr|A0A1Q3ALK9|A0A1Q3ALK9_CEPFO (DUF4378 domain-containing protein OS=Cephalotus follicularis OX=3775 GN=CFOL_v3_00179 PE=4 SV=1)

HSP 1 Score: 60.1 bits (144), Expect = 6.8e-06
Identity = 41/99 (41.41%), Postives = 57/99 (57.58%), Query Frame = 0

Query: 21  PRSPLRQLLQEQQEPFELEDYLFERDNYSRKSLSRGSGFACSGGKLDSIVKFGKGLVEIN 80
           P   LR++LQEQQEPF L+ YL ER  Y +KSLS  +G +C   +  ++ K  +G+   +
Sbjct: 8   PAKKLREVLQEQQEPFLLDIYLTER-GYLKKSLSSNAGLSCGHARSSTLNKSRRGIPHHS 67

Query: 81  KVLKNSCKKLVAINRKQKTKDLG-RNGWIFSVGCKGVTE 119
           KV K    KL++I+   K K+ G R G    VG   VTE
Sbjct: 68  KVFKTVSGKLISISESLKIKNSGNRKG---KVGGVCVTE 102

BLAST of Cla97C05G108820 vs. TrEMBL
Match: tr|A0A2N9EMX4|A0A2N9EMX4_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS4050 PE=4 SV=1)

HSP 1 Score: 57.8 bits (138), Expect = 3.4e-05
Identity = 42/124 (33.87%), Postives = 57/124 (45.97%), Query Frame = 0

Query: 21  PRSPLRQLLQEQQEPFELEDYLFERDNYSRKSLSRGSGFACSGGKLDSIVKFG------- 80
           P   L +LLQEQQEPF LE YL ER    + SL+  + F C  G   + +K         
Sbjct: 8   PSKQLGELLQEQQEPFILEVYLLERGYQKKNSLNFLATFGCCHGNSTNFLKKSASCGLNK 67

Query: 81  --KGLVEINKVLKNSCKKLVAINRKQKTKDLGRNGWIFSVGC-------KGVTESDTFPS 129
             KG+   +K+L+  CKK V+I   Q  K        +SV         + + ESD F S
Sbjct: 68  STKGIPHCSKILRTVCKKFVSIKEDQSMKSSDNRNEEYSVTVDEMDWDGQEIVESDRFSS 127

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004141295.13.2e-6068.45PREDICTED: uncharacterized protein LOC101203076 isoform X1 [Cucumis sativus] >KG... [more]
XP_008452675.18.0e-5967.38PREDICTED: uncharacterized protein LOC103493622 [Cucumis melo][more]
XP_004141296.12.0e-5468.42PREDICTED: uncharacterized protein LOC101203076 isoform X2 [Cucumis sativus][more]
XP_022936977.16.8e-5077.27uncharacterized protein LOC111443406 [Cucurbita moschata][more]
XP_022977058.11.5e-4978.03uncharacterized protein LOC111477240 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
tr|A0A0A0L558|A0A0A0L558_CUCSA2.1e-6068.45Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G644680 PE=4 SV=1[more]
tr|A0A1S3BV72|A0A1S3BV72_CUCME5.3e-5967.38uncharacterized protein LOC103493622 OS=Cucumis melo OX=3656 GN=LOC103493622 PE=... [more]
tr|F6H178|F6H178_VITVI1.0e-0636.03Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VIT_18s0001g08690 PE=4 SV=... [more]
tr|A0A1Q3ALK9|A0A1Q3ALK9_CEPFO6.8e-0641.41DUF4378 domain-containing protein OS=Cephalotus follicularis OX=3775 GN=CFOL_v3_... [more]
tr|A0A2N9EMX4|A0A2N9EMX4_FAGSY3.4e-0533.87Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS4050 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0016740 transferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C05G108820.1Cla97C05G108820.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 138..167
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 119..203
NoneNo IPR availablePANTHERPTHR37613FAMILY NOT NAMEDcoord: 21..192