ClCG01G004020 (gene) Watermelon (Charleston Gray)

NameClCG01G004020
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
Descriptionhistidine triad nucleotide-binding 4 LENGTH=146
LocationCG_Chr01 : 4359123 .. 4364236 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCCAATCAAGAGGCTCCGCTGCAAATAGGGAAATTGGTGGGAGAAGTTTGTGAAAAGAAAAAATGGCAGCAACAGTTTCATCTTGTATTTTCTGCAAGAAAGCTTCCAATTCTACCGCCAATAACCTTCTTCATTTTGTCTGCTCCTTTCGTTTCTCTGATAAAAACCATTTTTCTCTGTGTTTCTTCTTCTCTTATACCGGCGATTGAATGAAGGTTCTCTGATTAAATATATGGGTTCTGCAGGATGAGAGAGTTGTTGCTTTTGAAGATATAAACCCTTCTGCTGTCAGGTATTTTGAACCTCGTAATCTAGAATGAACGCTTTTTTTTTTTTCTTTTTTTTTTTCTAGCGTTTGATTCAGTTTTGCGATCGATATGAGGCATCTGAAATATCTTCTGTGTTAATTTCATCTTCATTTCTGCATTTATTTAGATTCTTTAGGAAAATTTCTTCTACTTTTGTATACCCATTTCGTTGGATTGCTTGATATTCAATGTACAATCCACATCTAAGCCTGGATAGCAATGGCGGCAGGAAACTTTAGAAAGAAGATATGTTTTGGATGTGTGAGTAAAAGAAATGCAAAACGGATTTATTGTTATAAGGGTCTTCCACATTTTAGGTTTTGAACTGCCTCAAAATATTTCCATAGTTGTGATCTAAGGTTCACATGTGATTATGAATTGAAGTCCTCGATGATAATCTGTTGAGTTCAAGAGCTTTTGTCGCTTCTTTCAATCATAGTCTTTTAAATACTCTCCCATTTGTTTTTACTTTGTAGTTTGTTGGTGTTTAACATCTATGACCTGTTTATAGTAGGTTTTCAGTGGCCATCGAGAGTGGCCATCTATGACCTTCGCGAGTTGCCATCTCGAAGGTGCTGAGGATGGCCTCTATAAGGCCACTAAATGGTAAGTGTCAATTGCCATTATGGATTTTCGAGAGTGTAGAAAGAGATTTATGCACCACTTACATAGAAACAAAGAAATAGTGGGTTTGTTGTCAATCTTTTGATTTCGACCGTCATTTTTCAATCGTTTGTGTTTCTTCCGTTATGAGTAATTTACATGAAATTGTCCATCTCATTTCTTTCTATTGTTTTTTTTACTAGTAATTATAATGTATTAGGATGCATTTAGAATTCCCACCACAAAAATGGTAGATGCAACTGATCTTCTGTAATTAAAGACATATTTAAAAATGTTCTTACATTGGATGGTAACACTTTATAGGCATTTCTTGGTAATTCCCAAGGAGCATATTCCCACAGTTAGGAATCTCCAGAGAAGAGCTGAAGATTACTCACTGGGTAATGTAACCGTTGTTAATTGCAATTCATTTACGGCTTTTTGTTTAAAGAAAGGGAGAGGAGGAGGCAATAGGCTGGGGGAACTAAGTCCTTAAAAAGCAAGAGGTATAGTTACATGTTATAACTTGATAGAACCTATTGGTATTTTTTGTCTGCATTAAACCTTCAATAGAACCATAATACAATTCCCTCACCCTACCATATTCTCTCCTTCCTTTGACCTCCCTTTTTTAACCTTCTCATTCTCAAATCTAGCCTTCGGGTGGGAAAAACCTGCATGAAAATTAGTAGGGGCATAATGCTTAGAGTGGCCTTGACACTCCCATTTTGAGAAAATAGTAAAGATAGATATCCCAACAACTTTCTTCCTTTCTCTTTGGATGGAAAAAAACCCAGGCGTGCCACATAGGTGCCTTGCACCTAGACTAGAGGCACACCTTCTAAATTAAGGGGTTTTTTTATTGGTAAGAAACTAAACTTTCATTCAAAAATGACAGAACACAATAGCATAATAGAAACAAGCCCCAAAAATAGGAGTCCAACACTAACTACATAAGAAGGCTCCAATCTAAAAGATCAATACCAGTGTCCTAAGTACAAAAAGTTTAATGACTGACGCTCAAATGGAAGCATTAAACTTCATCAGTCCCCATACCTCCTCTTTAGATTTCTCCACGTCTCAAAAGTCAGACTATTACGCTCCAACCACCCACAAAAGAGCAAAGAAGCTCACTTGCTGTGCTGCAATATTTTTCCCTTCTCCCTGGGTGGACCAAAATCACCTCCTCCGACAAAGAAGCACATACCCATTACGAGCCCCACTAACCCCTAAAGAGCTCCACCAATGATCCCAAAAATAGTTAGTGAAATGACACTCCACAACAAATGACTTAGATCCTCCTGCTGCCTCCCTCAGAGAATGCAATCCTGCATTATTAAATCTTTGTTCACGTTGCTAATTCTGATAATGACTTTGCTCATATCTGTCTTGATTATTTCTATCAGTAAGTCACATGTTAGAGGTGGGGCAGACACTTTTATCCCAAGATTCCCCTCAACTGAAGCACAGGTACTCTGAATGATAAACACTCCTCAAAATCTTTTATGTTGCTAGTCTTCTTCAGGAAAACGACGTTTATATTTTACTAAGTGCTAAAACAGGGTAAAACAAAAGAAAACATGGGCTTCAATGGTTTAGCAATGTCTCAATCCTTTTATTTTGGTTACTAATATTGTACGTGGGGCTGGTTTGTGCCTGTGACCATATTATTGTTTAAACATATAAAAATTTGCTACTATTTGCATTAGTTGATGGTAGTTTATTGTGAACAATTTTATTAATGTTAGAAAACCAATCACCTAAAAAGCTAGGTTTGGGACACAGTGCCTGGTAGCAACTTAGTTCTTCTTTCTTCTGAGGTGGATCAAGTGATGAATAATGTACTGTAGTTGTGAATTCCTACGAGTGCTGTTGTTCAGTTGTACTACAAACTTCATCCACTGCGAAAAGATATCTTGCTTATCTCTTCATTTCCTAGATGGCATGATGGAGAATCAGAGATGGATGGCCTTTCTGTTGGGCAGAATGGATATAGGATAGATACTTGACGAGAGTTGCAGTAATTTTTTGAAAAACATTTTTAGATTTTTTTTTTGGTGGCAAAAAAAGAGAAGTCTTATTTCCAATTGATTTCTGCTAAGAACCAAGTGAAAGTGAGTTAATAAACTTTTACAAAAAAGCTTTTCACATCACATGTGAAGATATCTTCCAGGCATTTCTTGGAATCTGATTGAGCGGAACAGGTGGAATTGTTTTGATGTGAAAAGCTTGTACGACAAGCCGAGGTTTCTTTCCAAGTTGAAGAGTTGTGGATGGCATTGAGAAACTTTAACTCCAATGAAATTCTGGAATGGAATTGTTCTTATTTGGATTTTTTAAAAGGAATTGTTATAGGCATGGTATTAGGATAAGATTAGGAATATTGTTACGCTTCTAGGAGTATATTAGTAATTGGCTTGAGAAGTGGTTATGCTAGTTTGTTATAGATAGCAAGGGTGGAGAAGGAGGAAGTTAGGCATGAAGTCGGTTGGTTTAAGGCTTGAGTGATCTCAAGAAAGAAAGGGTTCAAATTCTGTTGGGGTTCTATCATACTAGAACTTCATCAAAGCTGATCTAGTGAGCTCGCTGCTTGTATTTATGCAGCATACAAGTCTGATTTGATGCTTGGATTAGATTTTTGTCTTATGCAGTCTCACTCTCTTTTTGAAGTTTTAGCCAAATTACTTGCTGGAAGGCTTGAAGTGGTGTGAATGCCACTGTTTCTCCTTATCGTTTGGCCATTATAGGCTTAAGTGTTTACCTAAATTCATTTGGTGTTTTCTTTTGGAAATTTGGTAAGAAGGAACAGTCTCCCAAGGAATAGATAGGAATAGTGATAACATAAGATATGAGCCCACTTATTCTTGAGGCATTTCCATCCTCAAGCCAGCTCTAGAAGACAACCAGAATTTTAAGTCATTAAGTTTTTTCATGTAGAGCTTGAACCTAGATCTTGAAGGTTATTTCCAAATATAACGAGTCCTTACACTAGTAGGGCCAGCCCCTCAAGGATAAAGAGATAGTTGGTTTTTCAGGGTCTCTGTATACATATTTCATTTTAGGTTTAAATATTAGTTTGGTCTTTGTATTTTCTATTTTGGTTTTTTTTAGTTCCTATACTTTCAAAGTATACATTTTGGTCTACATACTTTCAACTTCTGTTCATTTAGTCTTAATACTTTCAAATTGTCCATTTTGCTCCTTGTATTTTTAAAAAGTGACCATTTTAGTCCCTTCAATTTCATTTTTATCTCATAACTTTCTATCAAAATTTTAGCATGACACTTCATTCACTAAATTTCTTAAAATATAGCTATAAATTTGTATTAAAGGGTTTCCATCGTGTTAAAGTTTTACAATAGGGATCAAGGTAGTCACTTTTTAAAAGTAAAGAGACCAAAACGGAGGTTCTGAATTGGGCTAAATTGAATGACCAAAATGGACAAAAACAGAAATACAAAACCAAAATAGTATTTAAACTTTCATTTTATCAATGAAATGCTTTGCTCCATATTGGAAAAAAAGTACTAAATGCTTCACAGTTCTTCTGACTTCAATTCTACTTTTAGTTCTTATCTTTTACTTTAGAGCAGCATGATTCAAAATTAATTCTTCTCTTCTCTTTCTCTGAAAGGTTTGGCTTCCATCAGCCACCAATGAACAGTGTCAATCACCTACATCTTCATTGTTTTGCCCTTCCTTACACACCCAGGTAACCAGTTTCTCAGTTTTCATATTATTAACACATACTTAGTATAGTGCTCCCATAAATGTCCAATTTGACATACTGTTTATTTCAACAACAAAAATCTGAACCGAAATGTGGTGCAAACAGGTGGAAGTTCGTAAAATATTTATCATTGGGATCGATCGGATTCATTGAAGCTGAGAAGTTGTTGGAGAAAATAAAGCCTTAACTCACACACCAGATCACTTCAGCAGTAGTGAGTATGTGTCCTTAATGATTGTTCACTGTTGTTATAAAGAGCAGAAAAGTTTGATATGTCCAACCAAAATTGAGGTATCAATTATGATGTATTATTTGATTGATTAAAAGTTGTGTCAATCTCTACAGGAATTGTTCTTTTTAGTATTCACTGGAAGAAAGGCTTTGTTTTTATTGCTTCTGATTCAGGAAAGGGACCAATGGCTTTTATTATTATTATTTTAATGGAATGAGAAGTAAAAGATGAATGTGTA

mRNA sequence

ATGCCCAATCAAGAGGCTCCGCTGCAAATAGGGAAATTGGTGGGAGAAGTTTGTGAAAAGAAAAAATGGCAGCAACAGTTTCATCTTGTATTTTCTGCAAGAAAGCTTCCAATTCTACCGCCAATAACCTTCTTCATTTTGATGAGAGAGTTGTTGCTTTTGAAGATATAAACCCTTCTGCTGTCAGTGGCCATCGAGAGTGGCCATCTATGACCTTCGCGAGTTGCCATCTCGAAGGTGCTGAGGATGGCCTCTATAAGGCCACTAAATGGCATTTCTTGGTAATTCCCAAGGAGCATATTCCCACAGTTAGGAATCTCCAGAGAAGAGCTGAAGATTACTCACTGGTAAGTCACATGTTAGAGGTGGGGCAGACACTTTTATCCCAAGATTCCCCTCAACTGAAGCACAGGTTTGGCTTCCATCAGCCACCAATGAACAGTGTCAATCACCTACATCTTCATTGTTTTGCCCTTCCTTACACACCCAGGTGGAAGTTCGTAAAATATTTATCATTGGGATCGATCGGATTCATTGAAGCTGAGAAGTTGTTGGAGAAAATAAAGCCTTAACTCACACACCAGATCACTTCAGCAGTAGTGAGTATGTGTCCTTAATGATTGTTCACTGTTGTTATAAAGAGCAGAAAAGTTTGATATGTCCAACCAAAATTGAGGTATCAATTATGATGTATTATTTGATTGATTAAAAGTTGTGTCAATCTCTACAGGAATTGTTCTTTTTAGTATTCACTGGAAGAAAGGCTTTGTTTTTATTGCTTCTGATTCAGGAAAGGGACCAATGGCTTTTATTATTATTATTTTAATGGAATGAGAAGTAAAAGATGAATGTGTA

Coding sequence (CDS)

ATGGCAGCAACAGTTTCATCTTGTATTTTCTGCAAGAAAGCTTCCAATTCTACCGCCAATAACCTTCTTCATTTTGATGAGAGAGTTGTTGCTTTTGAAGATATAAACCCTTCTGCTGTCAGTGGCCATCGAGAGTGGCCATCTATGACCTTCGCGAGTTGCCATCTCGAAGGTGCTGAGGATGGCCTCTATAAGGCCACTAAATGGCATTTCTTGGTAATTCCCAAGGAGCATATTCCCACAGTTAGGAATCTCCAGAGAAGAGCTGAAGATTACTCACTGGTAAGTCACATGTTAGAGGTGGGGCAGACACTTTTATCCCAAGATTCCCCTCAACTGAAGCACAGGTTTGGCTTCCATCAGCCACCAATGAACAGTGTCAATCACCTACATCTTCATTGTTTTGCCCTTCCTTACACACCCAGGTGGAAGTTCGTAAAATATTTATCATTGGGATCGATCGGATTCATTGAAGCTGAGAAGTTGTTGGAGAAAATAAAGCCTTAA

Protein sequence

MAATVSSCIFCKKASNSTANNLLHFDERVVAFEDINPSAVSGHREWPSMTFASCHLEGAEDGLYKATKWHFLVIPKEHIPTVRNLQRRAEDYSLVSHMLEVGQTLLSQDSPQLKHRFGFHQPPMNSVNHLHLHCFALPYTPRWKFVKYLSLGSIGFIEAEKLLEKIKP
BLAST of ClCG01G004020 vs. Swiss-Prot
Match: HINT4_ARATH (Bifunctional adenosine 5'-phosphosulfate phosphorylase/adenylylsulfatase HINT4 OS=Arabidopsis thaliana GN=HINT4 PE=1 SV=1)

HSP 1 Score: 190.3 bits (482), Expect = 1.7e-47
Identity = 97/169 (57.40%), Postives = 110/169 (65.09%), Query Frame = 1

Query: 1   MAATVSSCIFCKKASNSTANNLLHFDERVVAFEDINPSAVSGHREWPSMTFASCHLEGAE 60
           MA    +CIFC+   N T   LLH DE+V+AF+DI P+A                     
Sbjct: 1   MAGVNQACIFCEIVRNPTTTRLLHTDEKVIAFQDIKPAA--------------------- 60

Query: 61  DGLYKATKWHFLVIPKEHIPTVRNLQRRAEDYSLVSHMLEVGQTLLSQDSPQLKHRFGFH 120
                  + H+LVIPKEHIPTV +LQRR EDYSLV HML VGQ LL +D+PQ  HRFGFH
Sbjct: 61  -------QRHYLVIPKEHIPTVNDLQRRDEDYSLVRHMLSVGQQLLQKDAPQSIHRFGFH 120

Query: 121 QPPMNSVNHLHLHCFALPYTPRWKFVKYLSLGSI-GFIEAEKLLEKIKP 169
           QPP NSV+HLHLHCFALPY PRWK +KY SLG + GFIEAE LLEKI+P
Sbjct: 121 QPPFNSVDHLHLHCFALPYVPRWKAIKYKSLGPLGGFIEAETLLEKIRP 141

BLAST of ClCG01G004020 vs. Swiss-Prot
Match: HINT3_XENTR (Histidine triad nucleotide-binding protein 3 OS=Xenopus tropicalis GN=hint3 PE=2 SV=1)

HSP 1 Score: 72.8 bits (177), Expect = 4.1e-12
Identity = 49/164 (29.88%), Postives = 79/164 (48.17%), Query Frame = 1

Query: 7   SCIFCKKASNSTAN-NLLHFDERVVAFEDINPSAVSGHREWPSMTFASCHLEGAEDGLYK 66
           SCIFC+ A+   +   LLH D+ +V F+DI P                            
Sbjct: 18  SCIFCRIANKQESGAELLHSDDDLVCFKDIRP---------------------------- 77

Query: 67  ATKWHFLVIPKEHIPTVRNLQRRAEDYSLVSHMLEVGQTLLSQDS-PQLKH-RFGFHQPP 126
           A   H+LV+PK+H+ T + L +  +   L+  M+EVG++ L +++   L+  R GFH PP
Sbjct: 78  AVTHHYLVVPKKHVGTCKTLTK--DHVQLIKTMMEVGKSTLQKNNVTDLEDIRLGFHYPP 137

Query: 127 MNSVNHLHLHCFALPYTPRWKFVKYLSLGSIGFIEAEKLLEKIK 168
             S++HLHLH  A      +       + S  FI A++L+++++
Sbjct: 138 FCSISHLHLHVLAPASQLGFLSRMIYRVNSYWFITADELIDQLQ 151

BLAST of ClCG01G004020 vs. Swiss-Prot
Match: HINT3_PONAB (Histidine triad nucleotide-binding protein 3 OS=Pongo abelii GN=HINT3 PE=2 SV=1)

HSP 1 Score: 72.4 bits (176), Expect = 5.3e-12
Identity = 51/166 (30.72%), Postives = 76/166 (45.78%), Query Frame = 1

Query: 6   SSCIFCKKASNST-ANNLLHFD-ERVVAFEDINPSAVSGHREWPSMTFASCHLEGAEDGL 65
           S+C+FC+ A        LLH + E ++ F+DI P+A                        
Sbjct: 46  STCVFCRIAGRQDPGTELLHCENEDLICFKDIKPAATH---------------------- 105

Query: 66  YKATKWHFLVIPKEHIPTVRNLQRRAEDYSLVSHMLEVGQTLLSQDS--PQLKHRFGFHQ 125
                 H+LV+PK+HI   R L  R +   LV +M+ VG+T+L +++       R GFH 
Sbjct: 106 ------HYLVVPKKHIGNCRTL--RKDQVELVENMVTVGKTILERNNFTDFTNVRMGFHM 165

Query: 126 PPMNSVNHLHLHCFALPYTPRWKFVKYLSLGSIGFIEAEKLLEKIK 168
           PP  S++HLHLH  A      +       + S  FI A+ L+EK++
Sbjct: 166 PPFCSISHLHLHVLAPVDQLGFLSKLVYRVNSYWFITADHLIEKLR 181

BLAST of ClCG01G004020 vs. Swiss-Prot
Match: HINT3_HUMAN (Histidine triad nucleotide-binding protein 3 OS=Homo sapiens GN=HINT3 PE=1 SV=1)

HSP 1 Score: 72.4 bits (176), Expect = 5.3e-12
Identity = 51/166 (30.72%), Postives = 76/166 (45.78%), Query Frame = 1

Query: 6   SSCIFCKKASNST-ANNLLHFD-ERVVAFEDINPSAVSGHREWPSMTFASCHLEGAEDGL 65
           S+C+FC+ A        LLH + E ++ F+DI P+A                        
Sbjct: 46  STCVFCRIAGRQDPGTELLHCENEDLICFKDIKPAATH---------------------- 105

Query: 66  YKATKWHFLVIPKEHIPTVRNLQRRAEDYSLVSHMLEVGQTLLSQDS--PQLKHRFGFHQ 125
                 H+LV+PK+HI   R L  R +   LV +M+ VG+T+L +++       R GFH 
Sbjct: 106 ------HYLVVPKKHIGNCRTL--RKDQVELVENMVTVGKTILERNNFTDFTNVRMGFHM 165

Query: 126 PPMNSVNHLHLHCFALPYTPRWKFVKYLSLGSIGFIEAEKLLEKIK 168
           PP  S++HLHLH  A      +       + S  FI A+ L+EK++
Sbjct: 166 PPFCSISHLHLHVLAPVDQLGFLSKLVYRVNSYWFITADHLIEKLR 181

BLAST of ClCG01G004020 vs. Swiss-Prot
Match: HINT3_RAT (Histidine triad nucleotide-binding protein 3 OS=Rattus norvegicus GN=Hint3 PE=2 SV=2)

HSP 1 Score: 65.1 bits (157), Expect = 8.5e-10
Identity = 48/166 (28.92%), Postives = 75/166 (45.18%), Query Frame = 1

Query: 6   SSCIFCKKASNSTANNLLHFDER--VVAFEDINPSAVSGHREWPSMTFASCHLEGAEDGL 65
           S+C+FC+ A+       L + E   +V F+DI P+A+                       
Sbjct: 39  SNCVFCRVAAGQEPETELLYCENKDLVCFKDIKPAALH---------------------- 98

Query: 66  YKATKWHFLVIPKEHIPTVRNLQRRAEDYSLVSHMLEVGQTLLSQDS--PQLKHRFGFHQ 125
                 H+LV+PK+HI + ++L +  +   +V  M+ VG+T+L +++       R GFH 
Sbjct: 99  ------HYLVVPKKHIGSCKDLNK--DHIEMVESMVTVGKTILERNNFTDFTDVRMGFHV 158

Query: 126 PPMNSVNHLHLHCFALPYTPRWKFVKYLSLGSIGFIEAEKLLEKIK 168
           PP  SV+HLHLH  A      +         S  FI  + LLEK++
Sbjct: 159 PPFCSVSHLHLHVIAPAKEFGFLSRVVYRRDSYWFITGDYLLEKLR 174

BLAST of ClCG01G004020 vs. TrEMBL
Match: A0A0A0KKN0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G161990 PE=4 SV=1)

HSP 1 Score: 260.4 bits (664), Expect = 1.5e-66
Identity = 132/167 (79.04%), Postives = 135/167 (80.84%), Query Frame = 1

Query: 2   AATVSSCIFCKKASNSTANNLLHFDERVVAFEDINPSAVSGHREWPSMTFASCHLEGAED 61
           AA +SSCIFC+KA NSTAN+LLHFDERVVAFEDINPSAV                     
Sbjct: 4   AAAISSCIFCQKAYNSTANSLLHFDERVVAFEDINPSAVR-------------------- 63

Query: 62  GLYKATKWHFLVIPKEHIPTVRNLQRRAEDYSLVSHMLEVGQTLLSQDSPQLKHRFGFHQ 121
                   HFLVIPKEHIPTVRNLQRRAEDYSLVSHMLEVGQTLLSQDSPQLKHRFGFHQ
Sbjct: 64  --------HFLVIPKEHIPTVRNLQRRAEDYSLVSHMLEVGQTLLSQDSPQLKHRFGFHQ 123

Query: 122 PPMNSVNHLHLHCFALPYTPRWKFVKYLSLGSIGFIEAEKLLEKIKP 169
           PPMNSVNHLHLHCFALPYTPRWKF KYLSLGSIGFIEAEKLLEKIKP
Sbjct: 124 PPMNSVNHLHLHCFALPYTPRWKFAKYLSLGSIGFIEAEKLLEKIKP 142

BLAST of ClCG01G004020 vs. TrEMBL
Match: W9S6A5_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_011808 PE=4 SV=1)

HSP 1 Score: 204.1 bits (518), Expect = 1.3e-49
Identity = 102/167 (61.08%), Postives = 120/167 (71.86%), Query Frame = 1

Query: 2   AATVSSCIFCKKASNSTANNLLHFDERVVAFEDINPSAVSGHREWPSMTFASCHLEGAED 61
           AA  + CIFC+ AS ST+  +LH D++VVAF DINPSA                      
Sbjct: 8   AAFQADCIFCQIASKSTSTTILHSDDKVVAFPDINPSAFR-------------------- 67

Query: 62  GLYKATKWHFLVIPKEHIPTVRNLQRRAEDYSLVSHMLEVGQTLLSQDSPQLKHRFGFHQ 121
                   H+L+I   HIPTVR+LQR+AEDY LVSHMLEVGQTLL++D+PQ K+RFGFHQ
Sbjct: 68  --------HYLIISVAHIPTVRDLQRKAEDYFLVSHMLEVGQTLLARDAPQCKYRFGFHQ 127

Query: 122 PPMNSVNHLHLHCFALPYTPRWKFVKYLSLGSIGFIEAEKLLEKIKP 169
           PPMNSVNHLHLHC ALPYTPRWK +KYLS GS+GFIEA+KLLEK+KP
Sbjct: 128 PPMNSVNHLHLHCQALPYTPRWKCMKYLSFGSLGFIEADKLLEKLKP 146

BLAST of ClCG01G004020 vs. TrEMBL
Match: K4CIM5_SOLLC (Uncharacterized protein OS=Solanum lycopersicum PE=4 SV=1)

HSP 1 Score: 196.8 bits (499), Expect = 2.1e-47
Identity = 98/166 (59.04%), Postives = 122/166 (73.49%), Query Frame = 1

Query: 5   VSSCIFCKKASNSTANNLLHFDERVVAFEDINPSAVSGHREWPSMTFASCHLEGAEDGLY 64
           +S CIFC+ A++ST+  LLH D++VVAF+DINPSA                         
Sbjct: 6   LSECIFCQIATSSTSTTLLHSDDKVVAFQDINPSAFR----------------------- 65

Query: 65  KATKWHFLVIPKEHIPTVRNLQRRAEDYSLVSHMLEVGQTLLSQDSPQLKH-RFGFHQPP 124
                H+LVIPK+HIPTV+NLQR ++D+SLVSHML+VG++LL +D+PQ KH RFGFHQPP
Sbjct: 66  -----HYLVIPKQHIPTVKNLQRSSDDFSLVSHMLDVGKSLLDRDAPQSKHYRFGFHQPP 125

Query: 125 MNSVNHLHLHCFALPYTPRWKFVKYLSLGSI-GFIEAEKLLEKIKP 169
            NSV+HLHLHCFALPYTP W+F+KYLSLG + GFIE EKLLE+IKP
Sbjct: 126 FNSVDHLHLHCFALPYTPSWRFMKYLSLGPLGGFIEVEKLLERIKP 143

BLAST of ClCG01G004020 vs. TrEMBL
Match: M5WVX2_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa012871mg PE=4 SV=1)

HSP 1 Score: 194.5 bits (493), Expect = 1.0e-46
Identity = 96/168 (57.14%), Postives = 119/168 (70.83%), Query Frame = 1

Query: 2   AATVSSCIFCKKASNSTANNLLHFDERVVAFEDINPSAVSGHREWPSMTFASCHLEGAED 61
           AA  S CIFC+ AS ST+  LLH D++VVAF+DI P+AV                     
Sbjct: 4   AAAASQCIFCQIASKSTSTTLLHTDDKVVAFQDIRPAAVR-------------------- 63

Query: 62  GLYKATKWHFLVIPKEHIPTVRNLQRRAEDYSLVSHMLEVGQTLLSQDSPQL-KHRFGFH 121
                   H+LVIP +HIPTV++LQR+ EDYSLV+HMLEVG+ L+ +D+PQ  ++RFGFH
Sbjct: 64  --------HYLVIPVDHIPTVKDLQRKPEDYSLVNHMLEVGKMLVQRDAPQCHQYRFGFH 123

Query: 122 QPPMNSVNHLHLHCFALPYTPRWKFVKYLSLGSIGFIEAEKLLEKIKP 169
           QPP NSVNHLHLHCFALPYTPRWK +KY+++G  GF+EAEKLL KIKP
Sbjct: 124 QPPFNSVNHLHLHCFALPYTPRWKCMKYIAMGPFGFLEAEKLLGKIKP 143

BLAST of ClCG01G004020 vs. TrEMBL
Match: I1LGP6_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_11G032300 PE=4 SV=1)

HSP 1 Score: 194.1 bits (492), Expect = 1.3e-46
Identity = 98/170 (57.65%), Postives = 121/170 (71.18%), Query Frame = 1

Query: 1   MAATVSSCIFCKKASNSTANN-LLHFDERVVAFEDINPSAVSGHREWPSMTFASCHLEGA 60
           MA    SC+FC  A+ ST++N LL+ D++VVAF+DINPSA                    
Sbjct: 1   MAGATPSCVFCAIAAKSTSSNTLLYSDDKVVAFQDINPSAFR------------------ 60

Query: 61  EDGLYKATKWHFLVIPKEHIPTVRNLQRRAEDYSLVSHMLEVGQTLLSQDSPQLK-HRFG 120
                     H+LV+P  HIPTV+ LQR+ +DYSLVSHMLEVG+TLL++D+PQ + +RFG
Sbjct: 61  ----------HYLVVPVAHIPTVKYLQRKTDDYSLVSHMLEVGKTLLNRDAPQSQQYRFG 120

Query: 121 FHQPPMNSVNHLHLHCFALPYTPRWKFVKYLSLGSIGFIEAEKLLEKIKP 169
           FHQPP+NSVNHLHLHC ALPYTPRW+ +KYLSLG +GFIEAEK LEKIKP
Sbjct: 121 FHQPPLNSVNHLHLHCLALPYTPRWRSIKYLSLGPLGFIEAEKFLEKIKP 142

BLAST of ClCG01G004020 vs. TAIR10
Match: AT4G16566.1 (AT4G16566.1 histidine triad nucleotide-binding 4)

HSP 1 Score: 190.3 bits (482), Expect = 9.8e-49
Identity = 97/169 (57.40%), Postives = 110/169 (65.09%), Query Frame = 1

Query: 1   MAATVSSCIFCKKASNSTANNLLHFDERVVAFEDINPSAVSGHREWPSMTFASCHLEGAE 60
           MA    +CIFC+   N T   LLH DE+V+AF+DI P+A                     
Sbjct: 1   MAGVNQACIFCEIVRNPTTTRLLHTDEKVIAFQDIKPAA--------------------- 60

Query: 61  DGLYKATKWHFLVIPKEHIPTVRNLQRRAEDYSLVSHMLEVGQTLLSQDSPQLKHRFGFH 120
                  + H+LVIPKEHIPTV +LQRR EDYSLV HML VGQ LL +D+PQ  HRFGFH
Sbjct: 61  -------QRHYLVIPKEHIPTVNDLQRRDEDYSLVRHMLSVGQQLLQKDAPQSIHRFGFH 120

Query: 121 QPPMNSVNHLHLHCFALPYTPRWKFVKYLSLGSI-GFIEAEKLLEKIKP 169
           QPP NSV+HLHLHCFALPY PRWK +KY SLG + GFIEAE LLEKI+P
Sbjct: 121 QPPFNSVDHLHLHCFALPYVPRWKAIKYKSLGPLGGFIEAETLLEKIRP 141

BLAST of ClCG01G004020 vs. NCBI nr
Match: gi|659073483|ref|XP_008437084.1| (PREDICTED: histidine triad nucleotide-binding protein 3 isoform X2 [Cucumis melo])

HSP 1 Score: 261.9 bits (668), Expect = 7.5e-67
Identity = 133/167 (79.64%), Postives = 135/167 (80.84%), Query Frame = 1

Query: 2   AATVSSCIFCKKASNSTANNLLHFDERVVAFEDINPSAVSGHREWPSMTFASCHLEGAED 61
           A T+SSCIFC+KASNSTANNLLHFDERVVAFEDINPSAV                     
Sbjct: 3   ATTISSCIFCQKASNSTANNLLHFDERVVAFEDINPSAVR-------------------- 62

Query: 62  GLYKATKWHFLVIPKEHIPTVRNLQRRAEDYSLVSHMLEVGQTLLSQDSPQLKHRFGFHQ 121
                   HFLVIPKEHIPTVRNLQRRAEDYSLVSHML VGQTLLSQDSPQLKHRFGFHQ
Sbjct: 63  --------HFLVIPKEHIPTVRNLQRRAEDYSLVSHMLGVGQTLLSQDSPQLKHRFGFHQ 122

Query: 122 PPMNSVNHLHLHCFALPYTPRWKFVKYLSLGSIGFIEAEKLLEKIKP 169
           PPMNSVNHLHLHCFALPY PRWKFVKYLSLGSIGFIEAEKLLEKIKP
Sbjct: 123 PPMNSVNHLHLHCFALPYAPRWKFVKYLSLGSIGFIEAEKLLEKIKP 141

BLAST of ClCG01G004020 vs. NCBI nr
Match: gi|449469392|ref|XP_004152404.1| (PREDICTED: histidine triad nucleotide-binding protein 3 isoform X2 [Cucumis sativus])

HSP 1 Score: 260.4 bits (664), Expect = 2.2e-66
Identity = 132/167 (79.04%), Postives = 135/167 (80.84%), Query Frame = 1

Query: 2   AATVSSCIFCKKASNSTANNLLHFDERVVAFEDINPSAVSGHREWPSMTFASCHLEGAED 61
           AA +SSCIFC+KA NSTAN+LLHFDERVVAFEDINPSAV                     
Sbjct: 4   AAAISSCIFCQKAYNSTANSLLHFDERVVAFEDINPSAVR-------------------- 63

Query: 62  GLYKATKWHFLVIPKEHIPTVRNLQRRAEDYSLVSHMLEVGQTLLSQDSPQLKHRFGFHQ 121
                   HFLVIPKEHIPTVRNLQRRAEDYSLVSHMLEVGQTLLSQDSPQLKHRFGFHQ
Sbjct: 64  --------HFLVIPKEHIPTVRNLQRRAEDYSLVSHMLEVGQTLLSQDSPQLKHRFGFHQ 123

Query: 122 PPMNSVNHLHLHCFALPYTPRWKFVKYLSLGSIGFIEAEKLLEKIKP 169
           PPMNSVNHLHLHCFALPYTPRWKF KYLSLGSIGFIEAEKLLEKIKP
Sbjct: 124 PPMNSVNHLHLHCFALPYTPRWKFAKYLSLGSIGFIEAEKLLEKIKP 142

BLAST of ClCG01G004020 vs. NCBI nr
Match: gi|659073481|ref|XP_008437083.1| (PREDICTED: histidine triad nucleotide-binding protein 3 isoform X1 [Cucumis melo])

HSP 1 Score: 257.3 bits (656), Expect = 1.9e-65
Identity = 133/168 (79.17%), Postives = 135/168 (80.36%), Query Frame = 1

Query: 2   AATVSSCIFCKKASNSTANNLLHFDERVVAFEDINPSAVSGHREWPSMTFASCHLEGAED 61
           A T+SSCIFC+KASNSTANNLLHFDERVVAFEDINPSAV                     
Sbjct: 3   ATTISSCIFCQKASNSTANNLLHFDERVVAFEDINPSAVR-------------------- 62

Query: 62  GLYKATKWHFLVIPKEHIPTVRNLQRRAEDYSL-VSHMLEVGQTLLSQDSPQLKHRFGFH 121
                   HFLVIPKEHIPTVRNLQRRAEDYSL VSHML VGQTLLSQDSPQLKHRFGFH
Sbjct: 63  --------HFLVIPKEHIPTVRNLQRRAEDYSLAVSHMLGVGQTLLSQDSPQLKHRFGFH 122

Query: 122 QPPMNSVNHLHLHCFALPYTPRWKFVKYLSLGSIGFIEAEKLLEKIKP 169
           QPPMNSVNHLHLHCFALPY PRWKFVKYLSLGSIGFIEAEKLLEKIKP
Sbjct: 123 QPPMNSVNHLHLHCFALPYAPRWKFVKYLSLGSIGFIEAEKLLEKIKP 142

BLAST of ClCG01G004020 vs. NCBI nr
Match: gi|778700040|ref|XP_011654800.1| (PREDICTED: histidine triad nucleotide-binding protein 3 isoform X1 [Cucumis sativus])

HSP 1 Score: 255.8 bits (652), Expect = 5.4e-65
Identity = 132/168 (78.57%), Postives = 135/168 (80.36%), Query Frame = 1

Query: 2   AATVSSCIFCKKASNSTANNLLHFDERVVAFEDINPSAVSGHREWPSMTFASCHLEGAED 61
           AA +SSCIFC+KA NSTAN+LLHFDERVVAFEDINPSAV                     
Sbjct: 4   AAAISSCIFCQKAYNSTANSLLHFDERVVAFEDINPSAVR-------------------- 63

Query: 62  GLYKATKWHFLVIPKEHIPTVRNLQRRAEDYSL-VSHMLEVGQTLLSQDSPQLKHRFGFH 121
                   HFLVIPKEHIPTVRNLQRRAEDYSL VSHMLEVGQTLLSQDSPQLKHRFGFH
Sbjct: 64  --------HFLVIPKEHIPTVRNLQRRAEDYSLAVSHMLEVGQTLLSQDSPQLKHRFGFH 123

Query: 122 QPPMNSVNHLHLHCFALPYTPRWKFVKYLSLGSIGFIEAEKLLEKIKP 169
           QPPMNSVNHLHLHCFALPYTPRWKF KYLSLGSIGFIEAEKLLEKIKP
Sbjct: 124 QPPMNSVNHLHLHCFALPYTPRWKFAKYLSLGSIGFIEAEKLLEKIKP 143

BLAST of ClCG01G004020 vs. NCBI nr
Match: gi|659073491|ref|XP_008437088.1| (PREDICTED: histidine triad nucleotide-binding protein 3 isoform X4 [Cucumis melo])

HSP 1 Score: 246.1 bits (627), Expect = 4.3e-62
Identity = 116/120 (96.67%), Postives = 116/120 (96.67%), Query Frame = 1

Query: 49  MTFASCHLEGAEDGLYKATKWHFLVIPKEHIPTVRNLQRRAEDYSLVSHMLEVGQTLLSQ 108
           MTF SCHLEGAEDGLYKAT WHFLVIPKEHIPTVRNLQRRAEDYSLVSHML VGQTLLSQ
Sbjct: 1   MTFVSCHLEGAEDGLYKATTWHFLVIPKEHIPTVRNLQRRAEDYSLVSHMLGVGQTLLSQ 60

Query: 109 DSPQLKHRFGFHQPPMNSVNHLHLHCFALPYTPRWKFVKYLSLGSIGFIEAEKLLEKIKP 168
           DSPQLKHRFGFHQPPMNSVNHLHLHCFALPY PRWKFVKYLSLGSIGFIEAEKLLEKIKP
Sbjct: 61  DSPQLKHRFGFHQPPMNSVNHLHLHCFALPYAPRWKFVKYLSLGSIGFIEAEKLLEKIKP 120

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
HINT4_ARATH1.7e-4757.40Bifunctional adenosine 5'-phosphosulfate phosphorylase/adenylylsulfatase HINT4 O... [more]
HINT3_XENTR4.1e-1229.88Histidine triad nucleotide-binding protein 3 OS=Xenopus tropicalis GN=hint3 PE=2... [more]
HINT3_PONAB5.3e-1230.72Histidine triad nucleotide-binding protein 3 OS=Pongo abelii GN=HINT3 PE=2 SV=1[more]
HINT3_HUMAN5.3e-1230.72Histidine triad nucleotide-binding protein 3 OS=Homo sapiens GN=HINT3 PE=1 SV=1[more]
HINT3_RAT8.5e-1028.92Histidine triad nucleotide-binding protein 3 OS=Rattus norvegicus GN=Hint3 PE=2 ... [more]
Match NameE-valueIdentityDescription
A0A0A0KKN0_CUCSA1.5e-6679.04Uncharacterized protein OS=Cucumis sativus GN=Csa_5G161990 PE=4 SV=1[more]
W9S6A5_9ROSA1.3e-4961.08Uncharacterized protein OS=Morus notabilis GN=L484_011808 PE=4 SV=1[more]
K4CIM5_SOLLC2.1e-4759.04Uncharacterized protein OS=Solanum lycopersicum PE=4 SV=1[more]
M5WVX2_PRUPE1.0e-4657.14Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa012871mg PE=4 SV=1[more]
I1LGP6_SOYBN1.3e-4657.65Uncharacterized protein OS=Glycine max GN=GLYMA_11G032300 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G16566.19.8e-4957.40 histidine triad nucleotide-binding 4[more]
Match NameE-valueIdentityDescription
gi|659073483|ref|XP_008437084.1|7.5e-6779.64PREDICTED: histidine triad nucleotide-binding protein 3 isoform X2 [Cucumis melo... [more]
gi|449469392|ref|XP_004152404.1|2.2e-6679.04PREDICTED: histidine triad nucleotide-binding protein 3 isoform X2 [Cucumis sati... [more]
gi|659073481|ref|XP_008437083.1|1.9e-6579.17PREDICTED: histidine triad nucleotide-binding protein 3 isoform X1 [Cucumis melo... [more]
gi|778700040|ref|XP_011654800.1|5.4e-6578.57PREDICTED: histidine triad nucleotide-binding protein 3 isoform X1 [Cucumis sati... [more]
gi|659073491|ref|XP_008437088.1|4.3e-6296.67PREDICTED: histidine triad nucleotide-binding protein 3 isoform X4 [Cucumis melo... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR011146HIT-like
Vocabulary: Molecular Function
TermDefinition
GO:0003824catalytic activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
biological_process GO:0009150 purine ribonucleotide metabolic process
biological_process GO:0006790 sulfur compound metabolic process
biological_process GO:0044237 cellular metabolic process
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0005777 peroxisome
molecular_function GO:0003824 catalytic activity
molecular_function GO:0047627 adenylylsulfatase activity
molecular_function GO:0004780 sulfate adenylyltransferase (ADP) activity
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG01G004020.1ClCG01G004020.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011146HIT-like domainGENE3DG3DSA:3.30.428.10coord: 4..135
score: 5.4
IPR011146HIT-like domainunknownSSF54197HIT-likecoord: 69..141
score: 6.32E-21coord: 7..40
score: 6.32
NoneNo IPR availablePANTHERPTHR12486APRATAXIN-RELATEDcoord: 1..39
score: 9.8E-46coord: 68..167
score: 9.8
NoneNo IPR availablePFAMPF11969DcpS_Ccoord: 65..142
score: 3.2

The following gene(s) are paralogous to this gene:

None