ClCG05G012210 (gene) Watermelon (Charleston Gray)

NameClCG05G012210
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionUPF0420 protein C16orf58
LocationCG_Chr05 : 15472610 .. 15473732 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATGGGTTGCTGCCGTTCTCTTATCAGCCGCCGGAGCCGATTCCATTACGTCGAGTCTATGCCAATGTTCTAAACTATGTACCAGGCGGCCGTTTTCACCATTTCTCGGATTCTTCTATGCGAAGGTCATGCGCAGCACTAACACCTCTTCTTAGCGTATTTCCCCACCATCTTAAGCCCACAAAACTCGTCCAAGGTTATTTCTCTCCTTGTATTAGAACTAGAATCAAACCTGCTCTCGTTCATTCTCCTTTGCTGGCCGGTGACGGCCATGGGTGTGGTGGAAACAACAATGGCGGTTGGAATTATTCGAATCCTTTTGGGGGTTTTGGATGGTGGCAAAATGACGGTGATTCTCCTCCATGGTCAGACAATGCCTTCCTTGCCTTCTTCTTTACCTCCGTTCTGGGTTGTTTCTGCCTTTTTCAATTGGCAGCAGCGGTAGCACGTAATGAAATGAATTATGAGTCTGTTTGGGAAGTAAAAGGAGGTAAGCGAATCCGCCTCATTCTCGATACGTTTAGGGATGAGTTCCATGTTGCAACTGGCATGCCGTCGTCTTCGTTATCCTTTTCCTTTGTCAATTTTTGGCTTCGTTGCAGCGATGTATTCAGGCGTTTGATGCTTCCGGAGGGTTTTCCAGACAGCGTTACCAGCGACTATCTGGAATATTCTCTTTGGCGAGGAGTCCAGGGGATTGCCAGCCAAGTTAGTGGGGTCCTTGCAACTCAGGTGCCCATTTACGTTCGTCTATGGTTCCTCTTTCCTGTTTCTCATCCAACAAAAGAATAATTGATTGCATTTTGCTTGAACCCCTGCTGGGATGCAGGCACTGCTTTATGCTGTTGGATTGGGAAAAGGAGCTATTCCGACTGCTGCCGCAGTGAATTGGGTACTGAAAGATGGATTTGGATATCTGAGTAAAATTTTACTCTCAAAATATGGACGGCACTTTGATGTTCATCCGAAGGGGTGGAGGTTGTTTGCTGATCTTCTGGAAAACGCTGCCTATGGGATGGAAATGATAACTCCCGCATTTCCCCTCCATTTTGTCATGATCGGTGCTGCTGCTGGGGCTGGACGATCTGCAGCCGCCTTGATTCAGGTTATTGGAAGTTGA

mRNA sequence

ATGGATGGGTTGCTGCCGTTCTCTTATCAGCCGCCGGAGCCGATTCCATTACGTCGAGTCTATGCCAATGTTCTAAACTATGTACCAGGCGGCCGTTTTCACCATTTCTCGGATTCTTCTATGCGAAGGTCATGCGCAGCACTAACACCTCTTCTTAGCGTATTTCCCCACCATCTTAAGCCCACAAAACTCGTCCAAGGTTATTTCTCTCCTTGTATTAGAACTAGAATCAAACCTGCTCTCGTTCATTCTCCTTTGCTGGCCGGTGACGGCCATGGGTGTGGTGGAAACAACAATGGCGGTTGGAATTATTCGAATCCTTTTGGGGGTTTTGGATGGTGGCAAAATGACGGTGATTCTCCTCCATGGTCAGACAATGCCTTCCTTGCCTTCTTCTTTACCTCCGTTCTGGGTTGTTTCTGCCTTTTTCAATTGGCAGCAGCGGTAGCACGTAATGAAATGAATTATGAGTCTGTTTGGGAAGTAAAAGGAGGTAAGCGAATCCGCCTCATTCTCGATACGTTTAGGGATGAGTTCCATGTTGCAACTGGCATGCCGTCGTCTTCGTTATCCTTTTCCTTTGTCAATTTTTGGCTTCGTTGCAGCGATGTATTCAGGCGTTTGATGCTTCCGGAGGGTTTTCCAGACAGCGTTACCAGCGACTATCTGGAATATTCTCTTTGGCGAGGAGTCCAGGGGATTGCCAGCCAAGTTAGTGGGGTCCTTGCAACTCAGGCACTGCTTTATGCTGTTGGATTGGGAAAAGGAGCTATTCCGACTGCTGCCGCAGTGAATTGGGTACTGAAAGATGGATTTGGATATCTGAGTAAAATTTTACTCTCAAAATATGGACGGCACTTTGATGTTCATCCGAAGGGGTGGAGGTTGTTTGCTGATCTTCTGGAAAACGCTGCCTATGGGATGGAAATGATAACTCCCGCATTTCCCCTCCATTTTGTCATGATCGGTGCTGCTGCTGGGGCTGGACGATCTGCAGCCGCCTTGATTCAGGTTATTGGAAGTTGA

Coding sequence (CDS)

ATGGATGGGTTGCTGCCGTTCTCTTATCAGCCGCCGGAGCCGATTCCATTACGTCGAGTCTATGCCAATGTTCTAAACTATGTACCAGGCGGCCGTTTTCACCATTTCTCGGATTCTTCTATGCGAAGGTCATGCGCAGCACTAACACCTCTTCTTAGCGTATTTCCCCACCATCTTAAGCCCACAAAACTCGTCCAAGGTTATTTCTCTCCTTGTATTAGAACTAGAATCAAACCTGCTCTCGTTCATTCTCCTTTGCTGGCCGGTGACGGCCATGGGTGTGGTGGAAACAACAATGGCGGTTGGAATTATTCGAATCCTTTTGGGGGTTTTGGATGGTGGCAAAATGACGGTGATTCTCCTCCATGGTCAGACAATGCCTTCCTTGCCTTCTTCTTTACCTCCGTTCTGGGTTGTTTCTGCCTTTTTCAATTGGCAGCAGCGGTAGCACGTAATGAAATGAATTATGAGTCTGTTTGGGAAGTAAAAGGAGGTAAGCGAATCCGCCTCATTCTCGATACGTTTAGGGATGAGTTCCATGTTGCAACTGGCATGCCGTCGTCTTCGTTATCCTTTTCCTTTGTCAATTTTTGGCTTCGTTGCAGCGATGTATTCAGGCGTTTGATGCTTCCGGAGGGTTTTCCAGACAGCGTTACCAGCGACTATCTGGAATATTCTCTTTGGCGAGGAGTCCAGGGGATTGCCAGCCAAGTTAGTGGGGTCCTTGCAACTCAGGCACTGCTTTATGCTGTTGGATTGGGAAAAGGAGCTATTCCGACTGCTGCCGCAGTGAATTGGGTACTGAAAGATGGATTTGGATATCTGAGTAAAATTTTACTCTCAAAATATGGACGGCACTTTGATGTTCATCCGAAGGGGTGGAGGTTGTTTGCTGATCTTCTGGAAAACGCTGCCTATGGGATGGAAATGATAACTCCCGCATTTCCCCTCCATTTTGTCATGATCGGTGCTGCTGCTGGGGCTGGACGATCTGCAGCCGCCTTGATTCAGGTTATTGGAAGTTGA

Protein sequence

MDGLLPFSYQPPEPIPLRRVYANVLNYVPGGRFHHFSDSSMRRSCAALTPLLSVFPHHLKPTKLVQGYFSPCIRTRIKPALVHSPLLAGDGHGCGGNNNGGWNYSNPFGGFGWWQNDGDSPPWSDNAFLAFFFTSVLGCFCLFQLAAAVARNEMNYESVWEVKGGKRIRLILDTFRDEFHVATGMPSSSLSFSFVNFWLRCSDVFRRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKILLSKYGRHFDVHPKGWRLFADLLENAAYGMEMITPAFPLHFVMIGAAAGAGRSAAALIQVIGS
BLAST of ClCG05G012210 vs. Swiss-Prot
Match: RUS1_ARATH (Protein root UVB sensitive 1, chloroplastic OS=Arabidopsis thaliana GN=RUS1 PE=1 SV=1)

HSP 1 Score: 287.7 bits (735), Expect = 1.6e-76
Identity = 169/299 (56.52%), Postives = 195/299 (65.22%), Query Frame = 1

Query: 50  PLLSVFPHHLKPTKLVQGYFSP-CIRTRIKPALVHSPLLAG-DGHGCGGNNNGGWNYSNP 109
           P  S F   ++    V  +FS   + TR   A V S  L G +G+   GN  GG      
Sbjct: 38  PSGSSFSRCVRLVANVNDHFSKQSLATRNCLASVFSADLGGSNGNNDNGNGGGG------ 97

Query: 110 FGGFGWWQNDGDSPPWSDNAFLAFFFTSVLGCFCLFQLAAAVA---------RNEMNYES 169
            GG G   N  DS    D  +L F     L CF  F+L+AA A           +   E+
Sbjct: 98  -GGDGGGDNSDDSS--FDLRYLCFLLLG-LSCFFHFRLSAASAIAKDQNSDSNGDAVKET 157

Query: 170 VWEVKGGKRIRLILDTFRDEFHVATGMPSSSLSFSFVNFWLRCSDVFRRLMLPEGFPDSV 229
           VWEV+G KR RL+ D  +DEF         S S +  N   +C ++  + +LPEGFP+SV
Sbjct: 158 VWEVRGSKRKRLVPDFVKDEFVSEESAFELSSSLTPENLLAQCRNLLTQFLLPEGFPNSV 217

Query: 230 TSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKI 289
           TSDYL+YSLWRGVQGIASQ+SGVLATQ+LLYAVGLGKGAIPTAAA+NWVLKDG GYLSKI
Sbjct: 218 TSDYLDYSLWRGVQGIASQISGVLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKI 277

Query: 290 LLSKYGRHFDVHPKGWRLFADLLENAAYGMEMITPAFPLHFVMIGAAAGAGRSAAALIQ 338
           +LSKYGRHFDVHPKGWRLFADLLENAA+GMEM+TP FP  FVMIGAAAGAGRSAAALIQ
Sbjct: 278 MLSKYGRHFDVHPKGWRLFADLLENAAFGMEMLTPVFPQFFVMIGAAAGAGRSAAALIQ 326

BLAST of ClCG05G012210 vs. Swiss-Prot
Match: RUS1_MOUSE (RUS1 family protein C16orf58 homolog OS=Mus musculus PE=1 SV=1)

HSP 1 Score: 114.4 bits (285), Expect = 2.5e-24
Identity = 62/145 (42.76%), Postives = 86/145 (59.31%), Query Frame = 1

Query: 206 RRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKG-AIPTAAAV 265
           R ++LP+GFPDSV+ DYL Y LW  VQ  AS +SG LATQA+L  +G+G   A  +AA  
Sbjct: 72  RSVLLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATS 131

Query: 266 NWVLKDGFGYLSKILLSKY-GRHFDVHPKGWRLFADLLENAAYGMEMITPAFPLHFVM-- 325
            W++KD  G L +I+L+ + G   D + K WRLFAD+L + A  +E++ P +P+ F M  
Sbjct: 132 TWLVKDSTGMLGRIILAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYPIFFTMTV 191

Query: 326 ---------IGAAAGAGRSAAALIQ 338
                    +G A GA R+A  + Q
Sbjct: 192 STSNLAKCIVGVAGGATRAALTMHQ 216

BLAST of ClCG05G012210 vs. Swiss-Prot
Match: RUS1_RAT (RUS1 family protein C16orf58 homolog OS=Rattus norvegicus PE=2 SV=1)

HSP 1 Score: 114.0 bits (284), Expect = 3.2e-24
Identity = 61/145 (42.07%), Postives = 86/145 (59.31%), Query Frame = 1

Query: 206 RRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKG-AIPTAAAV 265
           R ++LP+GFPDSV+ DYL+Y LW  VQ  AS +SG LATQA+L  +G+G   A  +AA  
Sbjct: 72  RSVLLPQGFPDSVSPDYLQYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATS 131

Query: 266 NWVLKDGFGYLSKILLSKY-GRHFDVHPKGWRLFADLLENAAYGMEMITPAFPLHFVM-- 325
            W++KD  G L +I+ + + G   D + K WRLFAD+L + A  +E++ P +P+ F M  
Sbjct: 132 TWLVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDTAMFLEIMAPMYPIFFTMTV 191

Query: 326 ---------IGAAAGAGRSAAALIQ 338
                    +G A GA R+A  + Q
Sbjct: 192 STSNLAKCIVGVAGGATRAALTMHQ 216

BLAST of ClCG05G012210 vs. Swiss-Prot
Match: RUS1_HUMAN (RUS1 family protein C16orf58 OS=Homo sapiens GN=C16orf58 PE=1 SV=2)

HSP 1 Score: 110.5 bits (275), Expect = 3.6e-23
Identity = 73/197 (37.06%), Postives = 100/197 (50.76%), Query Frame = 1

Query: 160 WEVKGGK-----RIRLILDTFRDEFHV-ATGMPSSSLSFSFVNFWLRCSDVFRRLMLPEG 219
           WEV G +     R   +    RD   V A+G PS  LS              + + LP+G
Sbjct: 34  WEVGGWRWWGLSRAFTVKPEGRDAGEVGASGAPSPPLSG------------LQAVFLPQG 93

Query: 220 FPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKG-AIPTAAAVNWVLKDGF 279
           FPDSV+ DYL Y LW  VQ  AS +SG LATQA+L  +G+G   A  +AA   W++KD  
Sbjct: 94  FPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDST 153

Query: 280 GYLSKILLSKY-GRHFDVHPKGWRLFADLLENAAYGMEMITPAFPLHFVM---------- 338
           G L +I+ + + G   D + K WRLFAD+L + A  +E++ P +P+ F M          
Sbjct: 154 GMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKC 213

BLAST of ClCG05G012210 vs. Swiss-Prot
Match: RUS1_PONAB (RUS1 family protein C16orf58 homolog OS=Pongo abelii PE=2 SV=1)

HSP 1 Score: 109.4 bits (272), Expect = 7.9e-23
Identity = 73/197 (37.06%), Postives = 99/197 (50.25%), Query Frame = 1

Query: 160 WEVKGGK-----RIRLILDTFRDEFHV-ATGMPSSSLSFSFVNFWLRCSDVFRRLMLPEG 219
           WEV G +     R   +    RD   V A G PS  LS              + + LP+G
Sbjct: 34  WEVGGWRWWGLSRAFTVKPEGRDSGEVGAPGAPSPPLSG------------LQAVFLPQG 93

Query: 220 FPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKG-AIPTAAAVNWVLKDGF 279
           FPDSV+ DYL Y LW  VQ  AS +SG LATQA+L  +G+G   A  +AA   W++KD  
Sbjct: 94  FPDSVSPDYLPYQLWDSVQAFASGLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDST 153

Query: 280 GYLSKILLSKY-GRHFDVHPKGWRLFADLLENAAYGMEMITPAFPLHFVM---------- 338
           G L +I+ + + G   D + K WRLFAD+L + A  +E++ P +P+ F M          
Sbjct: 154 GMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKC 213

BLAST of ClCG05G012210 vs. TrEMBL
Match: A0A0A0L6U1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G257050 PE=4 SV=1)

HSP 1 Score: 633.6 bits (1633), Expect = 1.3e-178
Identity = 307/343 (89.50%), Postives = 316/343 (92.13%), Query Frame = 1

Query: 1   MDGLLPFSYQ--PPEPIPLRRVYANVLNYVPGGRFHHFSDSSMRRSCAALTPLLSVFPHH 60
           M G+LPFSYQ  PPEPIP R VY +VLNYVP  RFHH  DSSMRRSC AL P LSVFPH 
Sbjct: 1   MYGVLPFSYQAPPPEPIPFRPVYVDVLNYVPVRRFHHCLDSSMRRSCTALRPSLSVFPHF 60

Query: 61  LKPTKLVQGYFSPCIRTRIKPALVHSPLLAGDGHGCGGNNNGGWNYSNPFGGFGWWQNDG 120
           LKPTKL QGY SPC  TRIKPALVHSPLLAGDGHGC GNNNGGWN SNPFGGFGWWQ DG
Sbjct: 61  LKPTKLFQGYSSPCNGTRIKPALVHSPLLAGDGHGCDGNNNGGWNNSNPFGGFGWWQYDG 120

Query: 121 DSPPWSDNAFLAFFFTSVLGCFCLFQLAAAVARNEMNYESVWEVKGGKRIRLILDTFRDE 180
           DSPPWSDNAFLAFFF+SVLGCFCLFQLA A+ARN MN ES+WEVKGGKRIRLILDT+RDE
Sbjct: 121 DSPPWSDNAFLAFFFSSVLGCFCLFQLAVALARNNMNTESIWEVKGGKRIRLILDTYRDE 180

Query: 181 FHVATGMPSSSLSFSFVNFWLRCSDVFRRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQV 240
           FHVATGMPSSSLSFSFVN WLRCSD+F RLMLPEGFPDSVTSDYLEYSLWRGVQGIASQV
Sbjct: 181 FHVATGMPSSSLSFSFVNVWLRCSDIFTRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQV 240

Query: 241 SGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKILLSKYGRHFDVHPKGWRLFA 300
           SGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKI LSKYGRHFDVHPKGWRLFA
Sbjct: 241 SGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKIFLSKYGRHFDVHPKGWRLFA 300

Query: 301 DLLENAAYGMEMITPAFPLHFVMIGAAAGAGRSAAALIQVIGS 342
           DLLENAAYGMEM+TPAFPLHFV+IGAAAGAGRSAAALIQVIGS
Sbjct: 301 DLLENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQVIGS 343

BLAST of ClCG05G012210 vs. TrEMBL
Match: A0A061G6Z1_THECC (Uncharacterized protein isoform 8 OS=Theobroma cacao GN=TCM_016682 PE=4 SV=1)

HSP 1 Score: 319.7 bits (818), Expect = 4.3e-84
Identity = 172/272 (63.24%), Postives = 197/272 (72.43%), Query Frame = 1

Query: 69  FSPCIRTRIKPALVHSPLLAGDGHGCGGNNNGGWNYSNPFGGFGWWQNDGDSPPWSDNAF 128
           F P I      +L    L  G G GC GNNN   N   PFG   W  ND DS     + F
Sbjct: 47  FKPVIAAATTKSLPFPLLSHGHGGGCDGNNNN--NNDGPFGSDSWRWND-DSSSSHSHPF 106

Query: 129 LAFFFTSVLGCFCLFQLAAAVAR-NEMNYES--VWEVKGGKRIRLILDTFRDEFHVATGM 188
           L  F +S + CFC  QL++A+AR NE + E   VWEVKG K  +LI D   D F  + G+
Sbjct: 107 L-LFLSSFVACFCPSQLSSALARTNEDSQEDDVVWEVKGSKWTKLIPDFSEDAFVASNGI 166

Query: 189 PSSSLSFSFVNFWLRCSDVFRRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQ 248
            + + S S    W +C D+  RL+LPEGFPDSVTSDYL+YSLWRGVQG+ASQ+SGVLATQ
Sbjct: 167 VNLTKSLSLSTVWRQCRDIVMRLLLPEGFPDSVTSDYLDYSLWRGVQGVASQISGVLATQ 226

Query: 249 ALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKILLSKYGRHFDVHPKGWRLFADLLENAA 308
           ALLYAVGLGKGAIPTAAA+NWVLKDG GYLSKI+LSKYGRHFDV+PKGWRLFADLLENAA
Sbjct: 227 ALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAA 286

Query: 309 YGMEMITPAFPLHFVMIGAAAGAGRSAAALIQ 338
           +G+EM+TPAFP  FV IGAAAGAGRSAAALIQ
Sbjct: 287 FGLEMLTPAFPHLFVPIGAAAGAGRSAAALIQ 314

BLAST of ClCG05G012210 vs. TrEMBL
Match: A0A061GE65_THECC (Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_016682 PE=4 SV=1)

HSP 1 Score: 319.7 bits (818), Expect = 4.3e-84
Identity = 172/272 (63.24%), Postives = 197/272 (72.43%), Query Frame = 1

Query: 69  FSPCIRTRIKPALVHSPLLAGDGHGCGGNNNGGWNYSNPFGGFGWWQNDGDSPPWSDNAF 128
           F P I      +L    L  G G GC GNNN   N   PFG   W  ND DS     + F
Sbjct: 47  FKPVIAAATTKSLPFPLLSHGHGGGCDGNNNN--NNDGPFGSDSWRWND-DSSSSHSHPF 106

Query: 129 LAFFFTSVLGCFCLFQLAAAVAR-NEMNYES--VWEVKGGKRIRLILDTFRDEFHVATGM 188
           L  F +S + CFC  QL++A+AR NE + E   VWEVKG K  +LI D   D F  + G+
Sbjct: 107 L-LFLSSFVACFCPSQLSSALARTNEDSQEDDVVWEVKGSKWTKLIPDFSEDAFVASNGI 166

Query: 189 PSSSLSFSFVNFWLRCSDVFRRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQ 248
            + + S S    W +C D+  RL+LPEGFPDSVTSDYL+YSLWRGVQG+ASQ+SGVLATQ
Sbjct: 167 VNLTKSLSLSTVWRQCRDIVMRLLLPEGFPDSVTSDYLDYSLWRGVQGVASQISGVLATQ 226

Query: 249 ALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKILLSKYGRHFDVHPKGWRLFADLLENAA 308
           ALLYAVGLGKGAIPTAAA+NWVLKDG GYLSKI+LSKYGRHFDV+PKGWRLFADLLENAA
Sbjct: 227 ALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAA 286

Query: 309 YGMEMITPAFPLHFVMIGAAAGAGRSAAALIQ 338
           +G+EM+TPAFP  FV IGAAAGAGRSAAALIQ
Sbjct: 287 FGLEMLTPAFPHLFVPIGAAAGAGRSAAALIQ 314

BLAST of ClCG05G012210 vs. TrEMBL
Match: A0A061G662_THECC (Uncharacterized protein isoform 7 OS=Theobroma cacao GN=TCM_016682 PE=4 SV=1)

HSP 1 Score: 319.7 bits (818), Expect = 4.3e-84
Identity = 172/272 (63.24%), Postives = 197/272 (72.43%), Query Frame = 1

Query: 69  FSPCIRTRIKPALVHSPLLAGDGHGCGGNNNGGWNYSNPFGGFGWWQNDGDSPPWSDNAF 128
           F P I      +L    L  G G GC GNNN   N   PFG   W  ND DS     + F
Sbjct: 47  FKPVIAAATTKSLPFPLLSHGHGGGCDGNNNN--NNDGPFGSDSWRWND-DSSSSHSHPF 106

Query: 129 LAFFFTSVLGCFCLFQLAAAVAR-NEMNYES--VWEVKGGKRIRLILDTFRDEFHVATGM 188
           L  F +S + CFC  QL++A+AR NE + E   VWEVKG K  +LI D   D F  + G+
Sbjct: 107 L-LFLSSFVACFCPSQLSSALARTNEDSQEDDVVWEVKGSKWTKLIPDFSEDAFVASNGI 166

Query: 189 PSSSLSFSFVNFWLRCSDVFRRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQ 248
            + + S S    W +C D+  RL+LPEGFPDSVTSDYL+YSLWRGVQG+ASQ+SGVLATQ
Sbjct: 167 VNLTKSLSLSTVWRQCRDIVMRLLLPEGFPDSVTSDYLDYSLWRGVQGVASQISGVLATQ 226

Query: 249 ALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKILLSKYGRHFDVHPKGWRLFADLLENAA 308
           ALLYAVGLGKGAIPTAAA+NWVLKDG GYLSKI+LSKYGRHFDV+PKGWRLFADLLENAA
Sbjct: 227 ALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAA 286

Query: 309 YGMEMITPAFPLHFVMIGAAAGAGRSAAALIQ 338
           +G+EM+TPAFP  FV IGAAAGAGRSAAALIQ
Sbjct: 287 FGLEMLTPAFPHLFVPIGAAAGAGRSAAALIQ 314

BLAST of ClCG05G012210 vs. TrEMBL
Match: A0A061GE69_THECC (Uncharacterized protein isoform 6 OS=Theobroma cacao GN=TCM_016682 PE=4 SV=1)

HSP 1 Score: 319.7 bits (818), Expect = 4.3e-84
Identity = 172/272 (63.24%), Postives = 197/272 (72.43%), Query Frame = 1

Query: 69  FSPCIRTRIKPALVHSPLLAGDGHGCGGNNNGGWNYSNPFGGFGWWQNDGDSPPWSDNAF 128
           F P I      +L    L  G G GC GNNN   N   PFG   W  ND DS     + F
Sbjct: 47  FKPVIAAATTKSLPFPLLSHGHGGGCDGNNNN--NNDGPFGSDSWRWND-DSSSSHSHPF 106

Query: 129 LAFFFTSVLGCFCLFQLAAAVAR-NEMNYES--VWEVKGGKRIRLILDTFRDEFHVATGM 188
           L  F +S + CFC  QL++A+AR NE + E   VWEVKG K  +LI D   D F  + G+
Sbjct: 107 L-LFLSSFVACFCPSQLSSALARTNEDSQEDDVVWEVKGSKWTKLIPDFSEDAFVASNGI 166

Query: 189 PSSSLSFSFVNFWLRCSDVFRRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQ 248
            + + S S    W +C D+  RL+LPEGFPDSVTSDYL+YSLWRGVQG+ASQ+SGVLATQ
Sbjct: 167 VNLTKSLSLSTVWRQCRDIVMRLLLPEGFPDSVTSDYLDYSLWRGVQGVASQISGVLATQ 226

Query: 249 ALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKILLSKYGRHFDVHPKGWRLFADLLENAA 308
           ALLYAVGLGKGAIPTAAA+NWVLKDG GYLSKI+LSKYGRHFDV+PKGWRLFADLLENAA
Sbjct: 227 ALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAA 286

Query: 309 YGMEMITPAFPLHFVMIGAAAGAGRSAAALIQ 338
           +G+EM+TPAFP  FV IGAAAGAGRSAAALIQ
Sbjct: 287 FGLEMLTPAFPHLFVPIGAAAGAGRSAAALIQ 314

BLAST of ClCG05G012210 vs. TAIR10
Match: AT3G45890.1 (AT3G45890.1 Protein of unknown function, DUF647)

HSP 1 Score: 287.7 bits (735), Expect = 9.2e-78
Identity = 169/299 (56.52%), Postives = 195/299 (65.22%), Query Frame = 1

Query: 50  PLLSVFPHHLKPTKLVQGYFSP-CIRTRIKPALVHSPLLAG-DGHGCGGNNNGGWNYSNP 109
           P  S F   ++    V  +FS   + TR   A V S  L G +G+   GN  GG      
Sbjct: 38  PSGSSFSRCVRLVANVNDHFSKQSLATRNCLASVFSADLGGSNGNNDNGNGGGG------ 97

Query: 110 FGGFGWWQNDGDSPPWSDNAFLAFFFTSVLGCFCLFQLAAAVA---------RNEMNYES 169
            GG G   N  DS    D  +L F     L CF  F+L+AA A           +   E+
Sbjct: 98  -GGDGGGDNSDDSS--FDLRYLCFLLLG-LSCFFHFRLSAASAIAKDQNSDSNGDAVKET 157

Query: 170 VWEVKGGKRIRLILDTFRDEFHVATGMPSSSLSFSFVNFWLRCSDVFRRLMLPEGFPDSV 229
           VWEV+G KR RL+ D  +DEF         S S +  N   +C ++  + +LPEGFP+SV
Sbjct: 158 VWEVRGSKRKRLVPDFVKDEFVSEESAFELSSSLTPENLLAQCRNLLTQFLLPEGFPNSV 217

Query: 230 TSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKI 289
           TSDYL+YSLWRGVQGIASQ+SGVLATQ+LLYAVGLGKGAIPTAAA+NWVLKDG GYLSKI
Sbjct: 218 TSDYLDYSLWRGVQGIASQISGVLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKI 277

Query: 290 LLSKYGRHFDVHPKGWRLFADLLENAAYGMEMITPAFPLHFVMIGAAAGAGRSAAALIQ 338
           +LSKYGRHFDVHPKGWRLFADLLENAA+GMEM+TP FP  FVMIGAAAGAGRSAAALIQ
Sbjct: 278 MLSKYGRHFDVHPKGWRLFADLLENAAFGMEMLTPVFPQFFVMIGAAAGAGRSAAALIQ 326

BLAST of ClCG05G012210 vs. TAIR10
Match: AT1G13770.1 (AT1G13770.1 Protein of unknown function, DUF647)

HSP 1 Score: 102.4 bits (254), Expect = 5.5e-22
Identity = 66/172 (38.37%), Postives = 91/172 (52.91%), Query Frame = 1

Query: 179 FHVATGMPSSSLSFS-----FVNFWLRCSDVFRRLMLPEGFPDSVTSDYLEYSLWRGVQG 238
           F  AT   SSSLS       F + W R    F    +PEGFP SVT DY+ + LW  +QG
Sbjct: 26  FKTATITASSSLSIQRSANRFNHVWRRVLQAF----VPEGFPGSVTPDYVGFQLWDTLQG 85

Query: 239 IASQVSGVLATQALLYAVGLG-KGAIPTAAAVNWVLKDGFGYLSKILLSKY-GRHFDVHP 298
           +++    +L+TQALL A+G+G K A    A   W L+D  G L  IL + Y G + D + 
Sbjct: 86  LSTYTKMMLSTQALLSAIGVGEKSATVIGATFQWFLRDFTGMLGGILFTFYQGSNLDSNA 145

Query: 299 KGWRLFADLLENAAYGMEMITPAFPLHFVMI-----------GAAAGAGRSA 333
           K WRL ADL+ +    M++++P FP  F+++           G A+GA R+A
Sbjct: 146 KMWRLVADLMNDIGMLMDLLSPLFPSAFIVVVCLGSLSRSFTGVASGATRAA 193

BLAST of ClCG05G012210 vs. TAIR10
Match: AT5G01510.1 (AT5G01510.1 Protein of unknown function, DUF647)

HSP 1 Score: 100.5 bits (249), Expect = 2.1e-21
Identity = 53/145 (36.55%), Postives = 78/145 (53.79%), Query Frame = 1

Query: 198 WLRCSDVFRRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGK-- 257
           WL   DV R  + P GFP SV+ DYL+Y LW+    I   +  VL T +LL AVG+G   
Sbjct: 110 WL--PDVVRDFVFPSGFPGSVSDDYLDYMLWQFPTNITGWICNVLVTSSLLKAVGVGSFS 169

Query: 258 ------GAIPTAAAVNWVLKDGFGYLSKILL-SKYGRHFDVHPKGWRLFADLLENAAYGM 317
                  A  +AAA+ WV KDG G L ++L+  ++G  FD  PK WR++AD + +A    
Sbjct: 170 GTSAAATAAASAAAIRWVSKDGIGALGRLLIGGRFGSLFDDDPKQWRMYADFIGSAGSFF 229

Query: 318 EMITPAFPLHFVMIGAAAGAGRSAA 334
           ++ T  +P  F+++ +     ++ A
Sbjct: 230 DLATQLYPSQFLLLASTGNLAKAVA 252

BLAST of ClCG05G012210 vs. TAIR10
Match: AT5G49820.1 (AT5G49820.1 Protein of unknown function, DUF647)

HSP 1 Score: 94.7 bits (234), Expect = 1.1e-19
Identity = 50/131 (38.17%), Postives = 71/131 (54.20%), Query Frame = 1

Query: 206 RRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAA-AV 265
           R  ++PEGFP SV   Y+ Y  WR ++       GV  TQ LL +VG  + +  +AA A+
Sbjct: 109 RSYVVPEGFPGSVNESYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGASRNSSASAAVAI 168

Query: 266 NWVLKDGFGYLSKILLSKYGRHFDVHPKGWRLFADLLENAAYGMEMITPAFPLHFVMIGA 325
           NW+LKDG G + K+L ++ G+ FD   K  R   DLL     G+E+ T A P  F+ +  
Sbjct: 169 NWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLAC 228

Query: 326 AAGAGRSAAAL 336
           AA   ++ AA+
Sbjct: 229 AANVVKNVAAV 239

BLAST of ClCG05G012210 vs. TAIR10
Match: AT2G31190.1 (AT2G31190.1 Protein of unknown function, DUF647)

HSP 1 Score: 91.7 bits (226), Expect = 9.7e-19
Identity = 49/134 (36.57%), Postives = 73/134 (54.48%), Query Frame = 1

Query: 205 FRRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAAAV 264
           F     P G+P SV   YL Y+ +R +Q  +S    VL+TQ+LL+A GL +     A  V
Sbjct: 67  FLNKFFPSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGL-RPTPAQATVV 126

Query: 265 NWVLKDGFGYLSKILLSKYGRHFDVHPKGWRLFADLLENAAYGMEMITPAFPLHFVMIGA 324
           +W+LKDG  ++ K++ S  G   D  PK WR+ AD+L +   G+E+++P  P  F+ +  
Sbjct: 127 SWILKDGMQHVGKLICSNLGARMDSEPKRWRILADVLYDLGTGLELVSPLCPHLFLEM-- 186

Query: 325 AAGAGRSAAALIQV 339
            AG G  A  +  V
Sbjct: 187 -AGLGNFAKGMATV 196

BLAST of ClCG05G012210 vs. NCBI nr
Match: gi|700202567|gb|KGN57700.1| (hypothetical protein Csa_3G257050 [Cucumis sativus])

HSP 1 Score: 633.6 bits (1633), Expect = 1.9e-178
Identity = 307/343 (89.50%), Postives = 316/343 (92.13%), Query Frame = 1

Query: 1   MDGLLPFSYQ--PPEPIPLRRVYANVLNYVPGGRFHHFSDSSMRRSCAALTPLLSVFPHH 60
           M G+LPFSYQ  PPEPIP R VY +VLNYVP  RFHH  DSSMRRSC AL P LSVFPH 
Sbjct: 1   MYGVLPFSYQAPPPEPIPFRPVYVDVLNYVPVRRFHHCLDSSMRRSCTALRPSLSVFPHF 60

Query: 61  LKPTKLVQGYFSPCIRTRIKPALVHSPLLAGDGHGCGGNNNGGWNYSNPFGGFGWWQNDG 120
           LKPTKL QGY SPC  TRIKPALVHSPLLAGDGHGC GNNNGGWN SNPFGGFGWWQ DG
Sbjct: 61  LKPTKLFQGYSSPCNGTRIKPALVHSPLLAGDGHGCDGNNNGGWNNSNPFGGFGWWQYDG 120

Query: 121 DSPPWSDNAFLAFFFTSVLGCFCLFQLAAAVARNEMNYESVWEVKGGKRIRLILDTFRDE 180
           DSPPWSDNAFLAFFF+SVLGCFCLFQLA A+ARN MN ES+WEVKGGKRIRLILDT+RDE
Sbjct: 121 DSPPWSDNAFLAFFFSSVLGCFCLFQLAVALARNNMNTESIWEVKGGKRIRLILDTYRDE 180

Query: 181 FHVATGMPSSSLSFSFVNFWLRCSDVFRRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQV 240
           FHVATGMPSSSLSFSFVN WLRCSD+F RLMLPEGFPDSVTSDYLEYSLWRGVQGIASQV
Sbjct: 181 FHVATGMPSSSLSFSFVNVWLRCSDIFTRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQV 240

Query: 241 SGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKILLSKYGRHFDVHPKGWRLFA 300
           SGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKI LSKYGRHFDVHPKGWRLFA
Sbjct: 241 SGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKIFLSKYGRHFDVHPKGWRLFA 300

Query: 301 DLLENAAYGMEMITPAFPLHFVMIGAAAGAGRSAAALIQVIGS 342
           DLLENAAYGMEM+TPAFPLHFV+IGAAAGAGRSAAALIQVIGS
Sbjct: 301 DLLENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQVIGS 343

BLAST of ClCG05G012210 vs. NCBI nr
Match: gi|778680559|ref|XP_011651345.1| (PREDICTED: protein root UVB sensitive 1, chloroplastic [Cucumis sativus])

HSP 1 Score: 626.7 bits (1615), Expect = 2.4e-176
Identity = 303/339 (89.38%), Postives = 312/339 (92.04%), Query Frame = 1

Query: 1   MDGLLPFSYQ--PPEPIPLRRVYANVLNYVPGGRFHHFSDSSMRRSCAALTPLLSVFPHH 60
           M G+LPFSYQ  PPEPIP R VY +VLNYVP  RFHH  DSSMRRSC AL P LSVFPH 
Sbjct: 1   MYGVLPFSYQAPPPEPIPFRPVYVDVLNYVPVRRFHHCLDSSMRRSCTALRPSLSVFPHF 60

Query: 61  LKPTKLVQGYFSPCIRTRIKPALVHSPLLAGDGHGCGGNNNGGWNYSNPFGGFGWWQNDG 120
           LKPTKL QGY SPC  TRIKPALVHSPLLAGDGHGC GNNNGGWN SNPFGGFGWWQ DG
Sbjct: 61  LKPTKLFQGYSSPCNGTRIKPALVHSPLLAGDGHGCDGNNNGGWNNSNPFGGFGWWQYDG 120

Query: 121 DSPPWSDNAFLAFFFTSVLGCFCLFQLAAAVARNEMNYESVWEVKGGKRIRLILDTFRDE 180
           DSPPWSDNAFLAFFF+SVLGCFCLFQLA A+ARN MN ES+WEVKGGKRIRLILDT+RDE
Sbjct: 121 DSPPWSDNAFLAFFFSSVLGCFCLFQLAVALARNNMNTESIWEVKGGKRIRLILDTYRDE 180

Query: 181 FHVATGMPSSSLSFSFVNFWLRCSDVFRRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQV 240
           FHVATGMPSSSLSFSFVN WLRCSD+F RLMLPEGFPDSVTSDYLEYSLWRGVQGIASQV
Sbjct: 181 FHVATGMPSSSLSFSFVNVWLRCSDIFTRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQV 240

Query: 241 SGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKILLSKYGRHFDVHPKGWRLFA 300
           SGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKI LSKYGRHFDVHPKGWRLFA
Sbjct: 241 SGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKIFLSKYGRHFDVHPKGWRLFA 300

Query: 301 DLLENAAYGMEMITPAFPLHFVMIGAAAGAGRSAAALIQ 338
           DLLENAAYGMEM+TPAFPLHFV+IGAAAGAGRSAAALIQ
Sbjct: 301 DLLENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQ 339

BLAST of ClCG05G012210 vs. NCBI nr
Match: gi|659098056|ref|XP_008449956.1| (PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis melo])

HSP 1 Score: 614.4 bits (1583), Expect = 1.2e-172
Identity = 297/338 (87.87%), Postives = 311/338 (92.01%), Query Frame = 1

Query: 1   MDGLLPFSYQPP-EPIPLRRVYANVLNYVPGGRFHHFSDSSMRRSCAALTPLLSVFPHHL 60
           M G+LPFSYQPP E IPLRRVY +VL+YVP  RFHH  DSSMRRSC +L P LSVFPH L
Sbjct: 1   MYGVLPFSYQPPPELIPLRRVYVDVLSYVPVRRFHHCLDSSMRRSCKSLRPPLSVFPHFL 60

Query: 61  KPTKLVQGYFSPCIRTRIKPALVHSPLLAGDGHGCGGNNNGGWNYSNPFGGFGWWQNDGD 120
           KP KL +GY SPC  TRIKPALVHSPLLAGDG+GC GNNNGGWN SNPFGGFGWWQ D D
Sbjct: 61  KPAKLFRGYSSPCNGTRIKPALVHSPLLAGDGYGCDGNNNGGWNNSNPFGGFGWWQYDSD 120

Query: 121 SPPWSDNAFLAFFFTSVLGCFCLFQLAAAVARNEMNYESVWEVKGGKRIRLILDTFRDEF 180
           SPPWSDNAFLA FFTSVLGCFCLFQLA A+ARN+M  ES+WEVKGGKRIRLILDT+RDEF
Sbjct: 121 SPPWSDNAFLALFFTSVLGCFCLFQLAVALARNDMKTESIWEVKGGKRIRLILDTYRDEF 180

Query: 181 HVATGMPSSSLSFSFVNFWLRCSDVFRRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQVS 240
           HVATGMPSSSLSFSFVN WLRCSD+F+RLMLPEGFPDSVTSDYLEYSLWRGVQGIASQVS
Sbjct: 181 HVATGMPSSSLSFSFVNVWLRCSDIFKRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQVS 240

Query: 241 GVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKILLSKYGRHFDVHPKGWRLFAD 300
           GVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKI LSKYGRHFDVHPKGWRLFAD
Sbjct: 241 GVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKIFLSKYGRHFDVHPKGWRLFAD 300

Query: 301 LLENAAYGMEMITPAFPLHFVMIGAAAGAGRSAAALIQ 338
           LLENAAYGMEM+TPAFPLHFV+IGAAAGAGRSAAALIQ
Sbjct: 301 LLENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQ 338

BLAST of ClCG05G012210 vs. NCBI nr
Match: gi|470113387|ref|XP_004292905.1| (PREDICTED: protein root UVB sensitive 1, chloroplastic [Fragaria vesca subsp. vesca])

HSP 1 Score: 326.2 bits (835), Expect = 6.6e-86
Identity = 175/272 (64.34%), Postives = 198/272 (72.79%), Query Frame = 1

Query: 71  PCIRTRIKPALVHSPLLAGDGHGCGGNNN---GGWNYSNPFGGFGWWQNDGDSPPWSDNA 130
           P  R+R+ P L H       G   G NNN   GGWN  NPF    WW +D DS   S N 
Sbjct: 50  PQFRSRVLPPLKHFLTAPTGGSDAGNNNNNSGGGWN--NPFDSSSWWWHDDDSGGSSHNL 109

Query: 131 --FLAFFFTSVLGCFCLFQLAAAVARNEMNYESVWEVKGGKRIRLILDTFRDEFHVATGM 190
             F + F  +V  CFC  +LA A+A  E + ESVWEVKGGK  +L  D  RD F    G 
Sbjct: 110 ALFSSIFLAAVACCFCHLRLAYALASEE-DAESVWEVKGGKWTKLAPDFVRDAFVADGGG 169

Query: 191 PSSSLSFSFVNFWLRCSDVFRRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQ 250
              S+SF  +   L+C  +F +LMLPEGFPDSVTSDYL+YSLWR VQG+ASQVSGVLATQ
Sbjct: 170 GLGSISFESLG--LQCKSLFVQLMLPEGFPDSVTSDYLDYSLWRAVQGVASQVSGVLATQ 229

Query: 251 ALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKILLSKYGRHFDVHPKGWRLFADLLENAA 310
           ALLYAVGLGKGAIPTAAA+NWVLKDG GYLSKI+LSKYGRHFDV+PKGWRLFADLLENAA
Sbjct: 230 ALLYAVGLGKGAIPTAAALNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAA 289

Query: 311 YGMEMITPAFPLHFVMIGAAAGAGRSAAALIQ 338
           +GMEM+TP FP HF++IGAAAGAGRSAAALIQ
Sbjct: 290 FGMEMLTPVFPNHFLLIGAAAGAGRSAAALIQ 316

BLAST of ClCG05G012210 vs. NCBI nr
Match: gi|590680335|ref|XP_007040834.1| (Uncharacterized protein isoform 2 [Theobroma cacao])

HSP 1 Score: 319.7 bits (818), Expect = 6.2e-84
Identity = 172/272 (63.24%), Postives = 197/272 (72.43%), Query Frame = 1

Query: 69  FSPCIRTRIKPALVHSPLLAGDGHGCGGNNNGGWNYSNPFGGFGWWQNDGDSPPWSDNAF 128
           F P I      +L    L  G G GC GNNN   N   PFG   W  ND DS     + F
Sbjct: 47  FKPVIAAATTKSLPFPLLSHGHGGGCDGNNNN--NNDGPFGSDSWRWND-DSSSSHSHPF 106

Query: 129 LAFFFTSVLGCFCLFQLAAAVAR-NEMNYES--VWEVKGGKRIRLILDTFRDEFHVATGM 188
           L  F +S + CFC  QL++A+AR NE + E   VWEVKG K  +LI D   D F  + G+
Sbjct: 107 L-LFLSSFVACFCPSQLSSALARTNEDSQEDDVVWEVKGSKWTKLIPDFSEDAFVASNGI 166

Query: 189 PSSSLSFSFVNFWLRCSDVFRRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQ 248
            + + S S    W +C D+  RL+LPEGFPDSVTSDYL+YSLWRGVQG+ASQ+SGVLATQ
Sbjct: 167 VNLTKSLSLSTVWRQCRDIVMRLLLPEGFPDSVTSDYLDYSLWRGVQGVASQISGVLATQ 226

Query: 249 ALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKILLSKYGRHFDVHPKGWRLFADLLENAA 308
           ALLYAVGLGKGAIPTAAA+NWVLKDG GYLSKI+LSKYGRHFDV+PKGWRLFADLLENAA
Sbjct: 227 ALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAA 286

Query: 309 YGMEMITPAFPLHFVMIGAAAGAGRSAAALIQ 338
           +G+EM+TPAFP  FV IGAAAGAGRSAAALIQ
Sbjct: 287 FGLEMLTPAFPHLFVPIGAAAGAGRSAAALIQ 314

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
RUS1_ARATH1.6e-7656.52Protein root UVB sensitive 1, chloroplastic OS=Arabidopsis thaliana GN=RUS1 PE=1... [more]
RUS1_MOUSE2.5e-2442.76RUS1 family protein C16orf58 homolog OS=Mus musculus PE=1 SV=1[more]
RUS1_RAT3.2e-2442.07RUS1 family protein C16orf58 homolog OS=Rattus norvegicus PE=2 SV=1[more]
RUS1_HUMAN3.6e-2337.06RUS1 family protein C16orf58 OS=Homo sapiens GN=C16orf58 PE=1 SV=2[more]
RUS1_PONAB7.9e-2337.06RUS1 family protein C16orf58 homolog OS=Pongo abelii PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L6U1_CUCSA1.3e-17889.50Uncharacterized protein OS=Cucumis sativus GN=Csa_3G257050 PE=4 SV=1[more]
A0A061G6Z1_THECC4.3e-8463.24Uncharacterized protein isoform 8 OS=Theobroma cacao GN=TCM_016682 PE=4 SV=1[more]
A0A061GE65_THECC4.3e-8463.24Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_016682 PE=4 SV=1[more]
A0A061G662_THECC4.3e-8463.24Uncharacterized protein isoform 7 OS=Theobroma cacao GN=TCM_016682 PE=4 SV=1[more]
A0A061GE69_THECC4.3e-8463.24Uncharacterized protein isoform 6 OS=Theobroma cacao GN=TCM_016682 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G45890.19.2e-7856.52 Protein of unknown function, DUF647[more]
AT1G13770.15.5e-2238.37 Protein of unknown function, DUF647[more]
AT5G01510.12.1e-2136.55 Protein of unknown function, DUF647[more]
AT5G49820.11.1e-1938.17 Protein of unknown function, DUF647[more]
AT2G31190.19.7e-1936.57 Protein of unknown function, DUF647[more]
Match NameE-valueIdentityDescription
gi|700202567|gb|KGN57700.1|1.9e-17889.50hypothetical protein Csa_3G257050 [Cucumis sativus][more]
gi|778680559|ref|XP_011651345.1|2.4e-17689.38PREDICTED: protein root UVB sensitive 1, chloroplastic [Cucumis sativus][more]
gi|659098056|ref|XP_008449956.1|1.2e-17287.87PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis melo][more]
gi|470113387|ref|XP_004292905.1|6.6e-8664.34PREDICTED: protein root UVB sensitive 1, chloroplastic [Fragaria vesca subsp. ve... [more]
gi|590680335|ref|XP_007040834.1|6.2e-8463.24Uncharacterized protein isoform 2 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR006968RUS_fam
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0032502 developmental process
biological_process GO:0010224 response to UV-B
biological_process GO:0007155 cell adhesion
biological_process GO:0008150 biological_process
cellular_component GO:0009941 chloroplast envelope
cellular_component GO:0005739 mitochondrion
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0005540 hyaluronic acid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG05G012210.1ClCG05G012210.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006968Root UVB sensitive familyPANTHERPTHR12770FAMILY NOT NAMEDcoord: 109..337
score: 8.2E
IPR006968Root UVB sensitive familyPFAMPF04884DUF647coord: 201..336
score: 9.7
NoneNo IPR availablePANTHERPTHR12770:SF7PROTEIN ROOT UVB SENSITIVE 1coord: 109..337
score: 8.2E

The following gene(s) are paralogous to this gene:

None