ClCG01G021480 (gene) Watermelon (Charleston Gray)

NameClCG01G021480
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
Descriptionglycine-rich protein LENGTH=246
LocationCG_Chr01 : 35283412 .. 35285728 (+)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GCAAAACAAACTCAAAACCCTGAGGCCAAGGCTAGCGCTCGTTTTTCTTCTCCGACTCTGCCCATCGCATTTGAAAGCGCAGAGGCCGGCGTTTATCGGCCGGGATATGCGAAAACAATGCTTCAGGTTCTCAATCTAAGACCGCCAACTCCGATTCTGGCCCTACCGATCTCGATTTCCGATGACCCAACTTGCTCCGCCCTCCCATTACTTCGCCCTCGTAATGCAACGCACAATTGGGCGCTTTTACAGTCCAACCTCAAGTGCAACGGCAGATTCTCTTGCCTTTTCTCAGACAATAGAAGAGAGGTACTGCCTTTTCTTTTCTTTGTTTCTTTCAACCTTCAACTGATTAGAATTTGTAGCTTGTTCTTTTCTCTGTTGTGTTCGTTGTTAATAACCTAGTTAATTTTGGGAGATTGATATTTCCGGCCGGATTTTCTTTACTTTTTCGAGGGAAGGTGAGTGGGAGGCGAAACTTTCTGCAGTGGATTTGCCTTCTGTGGCTTACTTTTTGGGCGAAATTTTGTAAAACATGTGGAAGGAACACTGAGGTTGGGTAATTACTTCAAAGCTAGGATGGGAGATGGGGATAATTTGGAGTTGAAACTTAATACCCTAGTTACCTTATTCCATAACAAGCATCAAGTGGACGAGAATTGGATAGCTTTCCTTGGTATTGCGAAGCTATTTGTACATTTATTAGAAATTTTAGTGGATACTCATCTTCCTGTACTTGAATTTTTTAACTTAGAAGACCATAGATTTGATCTTTTTTATGAAGAATCCTGGAAGCTTTCTTGTCAAGATTGCATGTACTTTGTTTTTAGTACAATATAGGTGGAGAATTTGAACTACAGTTCTCTTAGTTGCTAGAACATAATTGTATGTGAGATGAGCTATGCTCACTTTAGCAATTGAGTGTAATCTGTTGATGCTGGGTTTCCAACTATATCTGTCTACTTTTGCTGCATAAATTTCGCCCTAAGAGGTAAAAAAGGGGTCAACCAATCTAGCTGACTGGCTGCTGAAAGATTGAAGTATGAATTGTTCTGCTCCTGTTTGAGAGAGGACTTCACCTATATTGTAGGGCCAGACGGGTTGTTTCGTGAAACTAGTCGGAGTGTGTGTAAGTTGGCCTGAACGCTCACAAATATTAAAAAAATAAAATAAAAAAATAAGGAAGCATTCTTCTAACATATCTTTTTTTGTCCATGGGAATCATAAATGACTAGCATTTATCCTTGGTAGCTGTAGCCACCCTGACTCACCCAGTTACTCATTTCCCTTTCCCCAGGGCAGTGTCATTATTGTTAGAGATGCACATCCATCCCTTCCCTTTCCACCCAAAAGAAGAAAAAAAGAAAAAAATAAACAACAATGAGAATTGACTTTATACTGTTGTTTAGTTTTGTGTGATCCAATGTACCTGATATGTTCCATATACAAACTTGCAATGCATTCTTCTTCATTGTTTGGCCCAACAAAGTGTGTTATGGAGCTGGAACAAAAGGAAATCTTGCTTAAATTTTCCTTCTCTCTTTGTTGGTAGCAGGAACAGGCAAGGAAGGCATTAGAAAGTGCACTTGGGGGAAAGAAAAATGAATTTGAGAAATGGAATAATGAAATAAAGAAAAGAGAGGAGATGGGTGGTGGTGGAGGTGGACGAGGAGGTTGGTTTGGATCGGGAGGATGGTTTGGTTGGTCTGATGACCAATTCTGGCCAGAAGCACAACAGACTAGTCTTGCTGTTTTAGGTATAATTGTCATGGTAAGTATTTTCATTTGCCCATCATAAATTTTGCTTATAATTGTTGGGAATGCAATCTAACTAACTAATCTCTCTTTTCACTGCTGCAAATTGTCAATAAAAGTATCTCATAGTTGCGAAAGGTGAACTGTTGCTTGCTGTTGTTTTCAACCCACTGCTTTATGCTTTGCGAGGAACGAGAAATGGATTGACTTTTGTTACTTCAAAAATTTTGAGAAAGACCTCTGTTAGTAATTCTGCTGAGTTTGAGGCGATTTCAAACAAACAAGTCTCTGGCCATGTCTCTGCCAAAGAGAGAGTTGCAAGGAAATGGGGGAGCGATTGATTTTTCTTCCCATTTACTTTTGGCCTTAATGTTTCAACTATTCAAATAGGTTGCTATGCGAATTTTTGGAAAATGTTTGGCTCCTAGAATTAGTTTTTTCTTTTCTTCTTTTTTTTATGTTGGTTGCTTTCTTTCATTTTAAAGAATAGCTTCCTTAAACGAGATGCTTGAGTTTGTTCCCCTTTTCAACAAAAACCACCCCAAAAGAGACAAAAAAAAAA

mRNA sequence

GCAAAACAAACTCAAAACCCTGAGGCCAAGGCTAGCGCTCGTTTTTCTTCTCCGACTCTGCCCATCGCATTTGAAAGCGCAGAGGCCGGCGTTTATCGGCCGGGATATGCGAAAACAATGCTTCAGGTTCTCAATCTAAGACCGCCAACTCCGATTCTGGCCCTACCGATCTCGATTTCCGATGACCCAACTTGCTCCGCCCTCCCATTACTTCGCCCTCGTAATGCAACGCACAATTGGGCGCTTTTACAGTCCAACCTCAAGTGCAACGGCAGATTCTCTTGCCTTTTCTCAGACAATAGAAGAGAGGAACAGGCAAGGAAGGCATTAGAAAGTGCACTTGGGGGAAAGAAAAATGAATTTGAGAAATGGAATAATGAAATAAAGAAAAGAGAGGAGATGGGTGGTGGTGGAGGTGGACGAGGAGGTTGGTTTGGATCGGGAGGATGGTTTGGTTGGTCTGATGACCAATTCTGGCCAGAAGCACAACAGACTAGTCTTGCTGTTTTAGGTATAATTGTCATGTATCTCATAGTTGCGAAAGGTGAACTGTTGCTTGCTGTTGTTTTCAACCCACTGCTTTATGCTTTGCGAGGAACGAGAAATGGATTGACTTTTGTTACTTCAAAAATTTTGAGAAAGACCTCTGTTAGTAATTCTGCTGAGTTTGAGGCGATTTCAAACAAACAAGTCTCTGGCCATGTCTCTGCCAAAGAGAGAGTTGCAAGGAAATGGGGGAGCGATTGATTTTTCTTCCCATTTACTTTTGGCCTTAATGTTTCAACTATTCAAATAGGTTGCTATGCGAATTTTTGGAAAATGTTTGGCTCCTAGAATTAGTTTTTTCTTTTCTTCTTTTTTTTATGTTGGTTGCTTTCTTTCATTTTAAAGAATAGCTTCCTTAAACGAGATGCTTGAGTTTGTTCCCCTTTTCAACAAAAACCACCCCAAAAGAGACAAAAAAAAAA

Coding sequence (CDS)

ATGCTTCAGGTTCTCAATCTAAGACCGCCAACTCCGATTCTGGCCCTACCGATCTCGATTTCCGATGACCCAACTTGCTCCGCCCTCCCATTACTTCGCCCTCGTAATGCAACGCACAATTGGGCGCTTTTACAGTCCAACCTCAAGTGCAACGGCAGATTCTCTTGCCTTTTCTCAGACAATAGAAGAGAGGAACAGGCAAGGAAGGCATTAGAAAGTGCACTTGGGGGAAAGAAAAATGAATTTGAGAAATGGAATAATGAAATAAAGAAAAGAGAGGAGATGGGTGGTGGTGGAGGTGGACGAGGAGGTTGGTTTGGATCGGGAGGATGGTTTGGTTGGTCTGATGACCAATTCTGGCCAGAAGCACAACAGACTAGTCTTGCTGTTTTAGGTATAATTGTCATGTATCTCATAGTTGCGAAAGGTGAACTGTTGCTTGCTGTTGTTTTCAACCCACTGCTTTATGCTTTGCGAGGAACGAGAAATGGATTGACTTTTGTTACTTCAAAAATTTTGAGAAAGACCTCTGTTAGTAATTCTGCTGAGTTTGAGGCGATTTCAAACAAACAAGTCTCTGGCCATGTCTCTGCCAAAGAGAGAGTTGCAAGGAAATGGGGGAGCGATTGA

Protein sequence

MLQVLNLRPPTPILALPISISDDPTCSALPLLRPRNATHNWALLQSNLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEMGGGGGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSVSNSAEFEAISNKQVSGHVSAKERVARKWGSD
BLAST of ClCG01G021480 vs. TrEMBL
Match: A0A061GG86_THECC (Sulfate adenylyltransferase subunit 2 OS=Theobroma cacao GN=TCM_030362 PE=4 SV=1)

HSP 1 Score: 268.1 bits (684), Expect = 9.1e-69
Identity = 143/214 (66.82%), Postives = 161/214 (75.23%), Query Frame = 1

Query: 1   MLQVLNLRPPTPILALPISIS---DDPTCSALPLLRPRNATHNWALLQSNLKCNGRFSCL 60
           M QVLNL P      L  S S   + P   +L   R +N   NW+ L  NLKCNGRFSCL
Sbjct: 1   MAQVLNLNP------LGSSYSTRPESPGFRSLNAARSQNVARNWSSLLQNLKCNGRFSCL 60

Query: 61  FSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEMGGGG-GGRGGWFGSGGWFGWS- 120
           FSDNRREEQARKALESALGGKK+EFEKWN EIK+REE GGG   G GGWFG GG FGWS 
Sbjct: 61  FSDNRREEQARKALESALGGKKSEFEKWNKEIKRREEAGGGDDAGGGGWFGWGGRFGWSN 120

Query: 121 DDQFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRK 180
           DD FW EAQQTSLAVLGIIVMYLI+AKGEL+LAVVFNPLLYALRGTR+GLT+VTS+IL K
Sbjct: 121 DDHFWQEAQQTSLAVLGIIVMYLIIAKGELMLAVVFNPLLYALRGTRSGLTYVTSRILGK 180

Query: 181 TSVSNSAEFEAISNKQVSGHVSAKERVARKWGSD 210
            +     +   +SNK+  G+VSAKE V +KWGS+
Sbjct: 181 RNADGPPDSCNMSNKETHGYVSAKENVLKKWGSN 208

BLAST of ClCG01G021480 vs. TrEMBL
Match: F6HRM1_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_00s0187g00200 PE=4 SV=1)

HSP 1 Score: 256.9 bits (655), Expect = 2.1e-65
Identity = 133/213 (62.44%), Postives = 158/213 (74.18%), Query Frame = 1

Query: 1   MLQVLNLRPP-TPILALPISISDDPTCSALP-LLRPRNATHNWALLQSNLKCNGRFSCLF 60
           M Q+L LRPP T   + P + +D    S  P  L P+     WA LQ  LKCNGRFSCLF
Sbjct: 1   MAQLLTLRPPITAFRSAPSTRTDSAVLSCCPKTLNPK-----WAFLQQKLKCNGRFSCLF 60

Query: 61  SDNRREEQARKALESALGGKKNEFEKWNNEIKKREEMGGGG-GGRGGWFGSGGWFGWS-D 120
           S+NR++E+ARKALESALGGKK+EFEKWN EIKKREE+GGGG  G GGWFG GG FGWS D
Sbjct: 61  SNNRKQEEARKALESALGGKKDEFEKWNKEIKKREEVGGGGEAGGGGWFGWGGRFGWSND 120

Query: 121 DQFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKT 180
           D FW EAQQ SLA+LGII+MYLI+AKGE+LLAV+FNPLL+ALRGTRNG  F+ S+IL+K 
Sbjct: 121 DHFWQEAQQASLAILGIIIMYLIIAKGEVLLAVIFNPLLFALRGTRNGFNFIISRILQKV 180

Query: 181 SVSNSAEFEAISNKQVSGHVSAKERVARKWGSD 210
           S +     + +  K+V   VSAKE V RKWG +
Sbjct: 181 SPATHTGLDNMPKKEVYTPVSAKESVVRKWGGE 208

BLAST of ClCG01G021480 vs. TrEMBL
Match: M5WV92_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa011534mg PE=4 SV=1)

HSP 1 Score: 256.1 bits (653), Expect = 3.6e-65
Identity = 134/210 (63.81%), Postives = 151/210 (71.90%), Query Frame = 1

Query: 1   MLQVLNLRPPTPILALPISISDDPTCSALPLLRPRNATHNWALLQSNLKCNGRFSCLFSD 60
           M  VLNL PP+ IL    S    PT S+    R    T +W  L   LKC GRFSCLFSD
Sbjct: 1   MTLVLNLIPPSQILLHSSSSHPLPTTSS----RQNETTQDWTALLFKLKCRGRFSCLFSD 60

Query: 61  NRREEQARKALESALGGKKNEFEKWNNEIKKREEMGGGGG-GRGGWFGSGGWFGWSD-DQ 120
           NR+EEQARKALE ALGGKK+EFEKW+ EIK+REE+GGGG  G GGWFG  GWFGWS+ D 
Sbjct: 61  NRKEEQARKALEGALGGKKSEFEKWDKEIKRREEVGGGGSAGGGGWFGWRGWFGWSNGDH 120

Query: 121 FWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSV 180
           FW EAQQ SLAVLGII+MYLI+AKGEL+LAV+FNPLLYALRGTRN   F+TSKILRK   
Sbjct: 121 FWREAQQASLAVLGIILMYLIIAKGELMLAVIFNPLLYALRGTRNVFAFITSKILRKRGP 180

Query: 181 SNSAEFEAISNKQVSGHVSAKERVARKWGS 209
                F+ IS  +    VSAK+ V RKWGS
Sbjct: 181 DGQVVFDNISKNEAYSSVSAKDSVLRKWGS 206

BLAST of ClCG01G021480 vs. TrEMBL
Match: A0A0B2R3Q2_GLYSO (Uncharacterized protein OS=Glycine soja GN=glysoja_045818 PE=4 SV=1)

HSP 1 Score: 255.0 bits (650), Expect = 8.0e-65
Identity = 119/194 (61.34%), Postives = 148/194 (76.29%), Query Frame = 1

Query: 18  ISISDDPTCSALPLLRPRNATHNWALLQSNLKCNGRFSCLFSDNRREEQARKALESALGG 77
           +++   P+ S   L   R+ + NW  LQ  LKCNGRF CLFSDNR+EEQARKALE AL G
Sbjct: 7   LNLGGPPSLSGRKLPPCRSNSQNWTCLQHKLKCNGRFLCLFSDNRKEEQARKALEGALSG 66

Query: 78  KKNEFEKWNNEIKKREEMGGGGG-GRGGWFGSGGWFGWS-DDQFWPEAQQTSLAVLGIIV 137
           KKNEF+KW+ EIK+REE+GGGG  G GGWFG G WFGWS DD FW EA+Q  L +LG++V
Sbjct: 67  KKNEFDKWDKEIKRREELGGGGDTGGGGWFGWGRWFGWSNDDNFWQEAKQAILTILGLVV 126

Query: 138 MYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSVSNSAEFEAISNKQVSGH 197
           +YL++AKG++LLAV+FNPLLYALRG RNG  F++SK+L+ TS SN  +F+ +S K+   H
Sbjct: 127 VYLLIAKGDMLLAVIFNPLLYALRGVRNGFGFISSKVLKNTSTSNQPDFDGLSKKKAYQH 186

Query: 198 VSAKERVARKWGSD 210
            SAKE V RKWGSD
Sbjct: 187 TSAKENVVRKWGSD 200

BLAST of ClCG01G021480 vs. TrEMBL
Match: I1KNI8_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_07G265300 PE=4 SV=1)

HSP 1 Score: 255.0 bits (650), Expect = 8.0e-65
Identity = 119/194 (61.34%), Postives = 148/194 (76.29%), Query Frame = 1

Query: 18  ISISDDPTCSALPLLRPRNATHNWALLQSNLKCNGRFSCLFSDNRREEQARKALESALGG 77
           +++   P+ S   L   R+ + NW  LQ  LKCNGRF CLFSDNR+EEQARKALE AL G
Sbjct: 7   LNLGGPPSLSGRKLPPCRSNSQNWTCLQHKLKCNGRFLCLFSDNRKEEQARKALEGALSG 66

Query: 78  KKNEFEKWNNEIKKREEMGGGGG-GRGGWFGSGGWFGWS-DDQFWPEAQQTSLAVLGIIV 137
           KKNEF+KW+ EIK+REE+GGGG  G GGWFG G WFGWS DD FW EA+Q  L +LG++V
Sbjct: 67  KKNEFDKWDKEIKRREELGGGGDTGGGGWFGWGRWFGWSNDDNFWQEAKQAILTILGLVV 126

Query: 138 MYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSVSNSAEFEAISNKQVSGH 197
           +YL++AKG++LLAV+FNPLLYALRG RNG  F++SK+L+ TS SN  +F+ +S K+   H
Sbjct: 127 VYLLIAKGDMLLAVIFNPLLYALRGVRNGFGFISSKVLKNTSTSNQPDFDGLSKKKAYQH 186

Query: 198 VSAKERVARKWGSD 210
            SAKE V RKWGSD
Sbjct: 187 TSAKENVVRKWGSD 200

BLAST of ClCG01G021480 vs. TAIR10
Match: AT5G20130.1 (AT5G20130.1 unknown protein)

HSP 1 Score: 193.0 bits (489), Expect = 1.9e-49
Identity = 104/175 (59.43%), Postives = 125/175 (71.43%), Query Frame = 1

Query: 41  WALLQSNLKCNGRFSCLFSD-NRREEQARKALESALGGKKNEFEKWNNEIKKREEMGGG- 100
           ++L  S  K  GRFSCLFS  N+REEQARK+LESALGGKKNEFEKW+ EIKKREE GGG 
Sbjct: 33  FSLRTSITKSKGRFSCLFSGGNQREEQARKSLESALGGKKNEFEKWDKEIKKREESGGGN 92

Query: 101 ---GGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLL 160
              GGG GGWFG GGWF  S D FW EAQQ +  +L I+ +Y++VAKGE++ A V NPLL
Sbjct: 93  GGKGGGGGGWFGGGGWF--SGDHFWKEAQQIAFTLLAILAVYMVVAKGEVMAAFVLNPLL 152

Query: 161 YALRGTRNGLTFVTSKIL-RKTSVSNSAEFEAISNKQVSGHVSAKERVARKWGSD 210
           YALRGTR GL+ ++SK++ R+ S  +    E +  K+ S   +AKE V RKWGSD
Sbjct: 153 YALRGTREGLSSLSSKLMGREASKVSGDNSEEMWKKEGS---TAKESVVRKWGSD 202

BLAST of ClCG01G021480 vs. NCBI nr
Match: gi|449434028|ref|XP_004134798.1| (PREDICTED: uncharacterized protein LOC101207146 [Cucumis sativus])

HSP 1 Score: 371.3 bits (952), Expect = 1.1e-99
Identity = 185/212 (87.26%), Postives = 191/212 (90.09%), Query Frame = 1

Query: 1   MLQVLNLRPPTPILALPISISDDPTCSALPLLRPRNATHNWALLQSNLKCNGRFSCLFSD 60
           MLQ LNLRPP PILAL  S+SDDPTCS LPL RPRN  HNWALLQS LKCNGRFSCLFSD
Sbjct: 1   MLQALNLRPPPPILALSFSVSDDPTCSPLPLRRPRNPMHNWALLQSKLKCNGRFSCLFSD 60

Query: 61  NRREEQARKALESALGGKKNEFEKWNNEIKKREEMGGG---GGGRGGWFGSGGWFGWSDD 120
           NR+EEQARKALESALGGKKNEFEKWNNEIKKREE+GGG   GGGRGGWFGSGGWFGWSDD
Sbjct: 61  NRKEEQARKALESALGGKKNEFEKWNNEIKKREEVGGGSGSGGGRGGWFGSGGWFGWSDD 120

Query: 121 QFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTS 180
           QFWPEAQQTSLAVLGIIVMYL+VAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRK+S
Sbjct: 121 QFWPEAQQTSLAVLGIIVMYLLVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSS 180

Query: 181 VSNSAEFEAISNKQVSGHVSAKERVARKWGSD 210
            SN AE E ISNK     VSAK+RVARKWGSD
Sbjct: 181 ASNYAEVEMISNKD----VSAKDRVARKWGSD 208

BLAST of ClCG01G021480 vs. NCBI nr
Match: gi|659079091|ref|XP_008440069.1| (PREDICTED: uncharacterized protein LOC103484656 isoform X2 [Cucumis melo])

HSP 1 Score: 367.5 bits (942), Expect = 1.6e-98
Identity = 183/210 (87.14%), Postives = 190/210 (90.48%), Query Frame = 1

Query: 1   MLQVLNLRPPTPILALPISISDDPTCSALPLLRPRNATHNWALLQSNLKCNGRFSCLFSD 60
           MLQ LNLR   PILAL  S+SDDPTCSALPLLRPRN THNWALL SNLKCNGRFSCLFS+
Sbjct: 1   MLQALNLRTSPPILALSFSVSDDPTCSALPLLRPRNPTHNWALLLSNLKCNGRFSCLFSN 60

Query: 61  NRREEQARKALESALGGKKNEFEKWNNEIKKREEMGGG-GGGRGGWFGSGGWFGWSDDQF 120
           NRREEQARKALESALGGKKNEFEKWNNEIKKREE+GGG GGGRGGWFGSGGWFGWSDDQF
Sbjct: 61  NRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGSGGGRGGWFGSGGWFGWSDDQF 120

Query: 121 WPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSVS 180
           WPEAQQTSLAV GIIVMYLIVAKGELLLAV+FNPLLYALRGTRNGLTFVTSK LRK+S S
Sbjct: 121 WPEAQQTSLAVFGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKFLRKSSAS 180

Query: 181 NSAEFEAISNKQVSGHVSAKERVARKWGSD 210
           N AE E ISNK     V+AK+RVARKWGSD
Sbjct: 181 NYAEVEEISNKD----VTAKDRVARKWGSD 206

BLAST of ClCG01G021480 vs. NCBI nr
Match: gi|659079089|ref|XP_008440068.1| (PREDICTED: uncharacterized protein LOC103484656 isoform X1 [Cucumis melo])

HSP 1 Score: 362.8 bits (930), Expect = 3.9e-97
Identity = 183/211 (86.73%), Postives = 190/211 (90.05%), Query Frame = 1

Query: 1   MLQVLNLRPPTPILALPISISDDPTCSALPLLRPRNATHNWALLQSNLKCNGRFSCLFSD 60
           MLQ LNLR   PILAL  S+SDDPTCSALPLLRPRN THNWALL SNLKCNGRFSCLFS+
Sbjct: 1   MLQALNLRTSPPILALSFSVSDDPTCSALPLLRPRNPTHNWALLLSNLKCNGRFSCLFSN 60

Query: 61  NRRE-EQARKALESALGGKKNEFEKWNNEIKKREEMGGG-GGGRGGWFGSGGWFGWSDDQ 120
           NRRE EQARKALESALGGKKNEFEKWNNEIKKREE+GGG GGGRGGWFGSGGWFGWSDDQ
Sbjct: 61  NRREQEQARKALESALGGKKNEFEKWNNEIKKREEVGGGSGGGRGGWFGSGGWFGWSDDQ 120

Query: 121 FWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSV 180
           FWPEAQQTSLAV GIIVMYLIVAKGELLLAV+FNPLLYALRGTRNGLTFVTSK LRK+S 
Sbjct: 121 FWPEAQQTSLAVFGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKFLRKSSA 180

Query: 181 SNSAEFEAISNKQVSGHVSAKERVARKWGSD 210
           SN AE E ISNK     V+AK+RVARKWGSD
Sbjct: 181 SNYAEVEEISNKD----VTAKDRVARKWGSD 207

BLAST of ClCG01G021480 vs. NCBI nr
Match: gi|590626775|ref|XP_007026263.1| (Sulfate adenylyltransferase subunit 2 [Theobroma cacao])

HSP 1 Score: 268.1 bits (684), Expect = 1.3e-68
Identity = 143/214 (66.82%), Postives = 161/214 (75.23%), Query Frame = 1

Query: 1   MLQVLNLRPPTPILALPISIS---DDPTCSALPLLRPRNATHNWALLQSNLKCNGRFSCL 60
           M QVLNL P      L  S S   + P   +L   R +N   NW+ L  NLKCNGRFSCL
Sbjct: 1   MAQVLNLNP------LGSSYSTRPESPGFRSLNAARSQNVARNWSSLLQNLKCNGRFSCL 60

Query: 61  FSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEMGGGG-GGRGGWFGSGGWFGWS- 120
           FSDNRREEQARKALESALGGKK+EFEKWN EIK+REE GGG   G GGWFG GG FGWS 
Sbjct: 61  FSDNRREEQARKALESALGGKKSEFEKWNKEIKRREEAGGGDDAGGGGWFGWGGRFGWSN 120

Query: 121 DDQFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRK 180
           DD FW EAQQTSLAVLGIIVMYLI+AKGEL+LAVVFNPLLYALRGTR+GLT+VTS+IL K
Sbjct: 121 DDHFWQEAQQTSLAVLGIIVMYLIIAKGELMLAVVFNPLLYALRGTRSGLTYVTSRILGK 180

Query: 181 TSVSNSAEFEAISNKQVSGHVSAKERVARKWGSD 210
            +     +   +SNK+  G+VSAKE V +KWGS+
Sbjct: 181 RNADGPPDSCNMSNKETHGYVSAKENVLKKWGSN 208

BLAST of ClCG01G021480 vs. NCBI nr
Match: gi|1009107237|ref|XP_015878618.1| (PREDICTED: uncharacterized protein LOC107414920 [Ziziphus jujuba])

HSP 1 Score: 258.1 bits (658), Expect = 1.4e-65
Identity = 136/211 (64.45%), Postives = 159/211 (75.36%), Query Frame = 1

Query: 1   MLQVLNLRPPTPILA-LPISISDDPTCSALPLLRPRNATHNWALLQSNLKCNGRFSCLFS 60
           M  VLNL PP    + LP +  +    S     R +N + +W  L   LKC GRFSCLFS
Sbjct: 1   MAPVLNLSPPPLTFSHLPRTRPELTRSSLCATTRRQNLSQDWNSLLLKLKCRGRFSCLFS 60

Query: 61  DNRREEQARKALESALGGKKNEFEKWNNEIKKREEMGGGG-GGRGGWFGSGGWFGWSD-D 120
           DNRR+E+A+KALESALGGKKNEFEKWN EIK+REE+GGGG  G GGWFG GG FGWS+ D
Sbjct: 61  DNRRQEEAKKALESALGGKKNEFEKWNKEIKRREEVGGGGDAGGGGWFGWGGRFGWSNGD 120

Query: 121 QFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTS 180
            FW EAQQTSLAVLGI++MYLIVAKGE++LAV+ NPLLYALRGTRNG TF+TSKILRKTS
Sbjct: 121 HFWQEAQQTSLAVLGIVIMYLIVAKGEVMLAVILNPLLYALRGTRNGFTFITSKILRKTS 180

Query: 181 VSNSAEFEAISNKQVSGHVSAKERVARKWGS 209
             +  +F+  SNK+    VSAK+RV RKW S
Sbjct: 181 PDSFNDFDITSNKE---GVSAKDRVTRKWRS 208

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A061GG86_THECC9.1e-6966.82Sulfate adenylyltransferase subunit 2 OS=Theobroma cacao GN=TCM_030362 PE=4 SV=1[more]
F6HRM1_VITVI2.1e-6562.44Putative uncharacterized protein OS=Vitis vinifera GN=VIT_00s0187g00200 PE=4 SV=... [more]
M5WV92_PRUPE3.6e-6563.81Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa011534mg PE=4 SV=1[more]
A0A0B2R3Q2_GLYSO8.0e-6561.34Uncharacterized protein OS=Glycine soja GN=glysoja_045818 PE=4 SV=1[more]
I1KNI8_SOYBN8.0e-6561.34Uncharacterized protein OS=Glycine max GN=GLYMA_07G265300 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G20130.11.9e-4959.43 unknown protein[more]
Match NameE-valueIdentityDescription
gi|449434028|ref|XP_004134798.1|1.1e-9987.26PREDICTED: uncharacterized protein LOC101207146 [Cucumis sativus][more]
gi|659079091|ref|XP_008440069.1|1.6e-9887.14PREDICTED: uncharacterized protein LOC103484656 isoform X2 [Cucumis melo][more]
gi|659079089|ref|XP_008440068.1|3.9e-9786.73PREDICTED: uncharacterized protein LOC103484656 isoform X1 [Cucumis melo][more]
gi|590626775|ref|XP_007026263.1|1.3e-6866.82Sulfate adenylyltransferase subunit 2 [Theobroma cacao][more]
gi|1009107237|ref|XP_015878618.1|1.4e-6564.45PREDICTED: uncharacterized protein LOC107414920 [Ziziphus jujuba][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0050790 regulation of catalytic activity
cellular_component GO:0005575 cellular_component
cellular_component GO:0005737 cytoplasm
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
molecular_function GO:0016740 transferase activity
molecular_function GO:0003824 catalytic activity
molecular_function GO:0003674 molecular_function
molecular_function GO:0005094 Rho GDP-dissociation inhibitor activity
molecular_function GO:0016779 nucleotidyltransferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG01G021480.1ClCG01G021480.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableunknownCoilCoilcoord: 68..88
scor
NoneNo IPR availablePANTHERPTHR36393FAMILY NOT NAMEDcoord: 1..209
score: 1.6
NoneNo IPR availablePANTHERPTHR36393:SF1SUBFAMILY NOT NAMEDcoord: 1..209
score: 1.6

The following gene(s) are paralogous to this gene:

None