ClCG05G021920 (gene) Watermelon (Charleston Gray)

NameClCG05G021920
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionGlycosyl transferase, group 1
LocationCG_Chr05 : 33929567 .. 33931021 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATGACGATCTCCACCATAACCAACAACCCACAGATCCACCATTTCCCAATCCCAACCAATCTCACTCCTTCAGATTCCGCCCTTCTGCGATTCACTTTTCATCCATCCTCATTCTCCTTCTAGCAATTTCCTTCTTCAGTTTCTCCAAAACAGATTTCTTCAAAACCCATTCCTTAAAACTCACCCGTCTTCTCAAAAATTCTAACCAAACCCCATTTCCTAATCCCTTCTGTGTTCTTTGGATGGCTCCATTTGTTTCTGGTGGTGGCTACAGTTCAGAAGCTTGGTCCTACATTTTAGCCCTTCATGATCATATAACAAACCCTGGATTTCGATTGGCCATTCAGCAACATGGCGATTTAGAATCCATCGACTTTTGGGAGGGCTTACCGGATTCTATCAGGAATCTGGCCATTGAACTTCACAGAACAAAATGTAGAATGAATGAAACCGTTGTGATTTGCCACAGTGAACCGGGTGCGTGGAATCCTCCTTTGTTTGAAACTTTGCCTTGCCCACCAGGTGCTTACCAAAATTTCAAGTCAGTGATTGGTAGAACAATGTTTGAAACTGATAGGGTAAGTCAAGAACATGTGAATCGATGTAATAGAATGGATTACGTTTGGGTTCCTTCTGAATTTCATGTCTCTACATTTGTGAAAAGTGGGGTTGATCCTTCTAAGATTGTGAAAGTTGTTCAACCTATTGATGTGAATTTCTTTGATCCATTGAAATACAAACCATTTAGTCTTGAATCTGTAGGAACATTAGTTTTAGGAGCCAAAAACTTGGAAGTAAGCTTAGAGAAGAAGGGATTTGTGTTTCTGAGTATCTTTAAATGGGAATTCAGGAAAGGTTGGGATTTGTTGTTGGAAGCATATTTGAGAGAATTTTGTAAGAAAGATGGAGTTGGGTTGTTTTTGTTGACAAATCCTTACCATACTGATAGTGATTTTGGGAATAAGATTTTGGATTTTGTAGAAAATTCAGACTTACAAATGCCACTTTCTGGTTGGGCTCCTGTTTATGTGGTTGATATTCATATACCTCAAACTGATTTGCCTAGAGTTTACAAGGCTGCTGATGCATTTGTACTTCCATCAAGAGGAGAAGGGTGGGGGAGGCCGCTCGTTGAAGCAATGGCGATGTCGTTGCCAGTGATTGCAACCAACTGGTCGGGGCAAACGGAGTTTTTGACCGATGAGAATAGCTATCCGTTGCCCGTTGAGAGAATGAGTGAAGTAAAGGAAGGGCCATTTGAAGGGCATCTGTGGGCTGAACCATCCATCAGTATACTTCAAGTTCTAATGAGGGAAGTAACAACTAATGTTGATGAAGCTAAGGCTAAAGGACGACGGGCAAGGGAGGACATGGTAAGCCGATTCTCGCCCAACATCGTTGCCGATATCGTTCATAGTCAGATACAAAATATATTCCATGAGAAGAGATGA

mRNA sequence

ATGGATGACGATCTCCACCATAACCAACAACCCACAGATCCACCATTTCCCAATCCCAACCAATCTCACTCCTTCAGATTCCGCCCTTCTGCGATTCACTTTTCATCCATCCTCATTCTCCTTCTAGCAATTTCCTTCTTCAGTTTCTCCAAAACAGATTTCTTCAAAACCCATTCCTTAAAACTCACCCGTCTTCTCAAAAATTCTAACCAAACCCCATTTCCTAATCCCTTCTGTGTTCTTTGGATGGCTCCATTTGTTTCTGGTGGTGGCTACAGTTCAGAAGCTTGGTCCTACATTTTAGCCCTTCATGATCATATAACAAACCCTGGATTTCGATTGGCCATTCAGCAACATGGCGATTTAGAATCCATCGACTTTTGGGAGGGCTTACCGGATTCTATCAGGAATCTGGCCATTGAACTTCACAGAACAAAATGTAGAATGAATGAAACCGTTGTGATTTGCCACAGTGAACCGGGTGCGTGGAATCCTCCTTTGTTTGAAACTTTGCCTTGCCCACCAGGTGCTTACCAAAATTTCAAGTCAGTGATTGGTAGAACAATGTTTGAAACTGATAGGGTAAGTCAAGAACATGTGAATCGATGTAATAGAATGGATTACGTTTGGGTTCCTTCTGAATTTCATGTCTCTACATTTGTGAAAAGTGGGGTTGATCCTTCTAAGATTGTGAAAGTTGTTCAACCTATTGATGTGAATTTCTTTGATCCATTGAAATACAAACCATTTAGTCTTGAATCTGTAGGAACATTAGTTTTAGGAGCCAAAAACTTGGAAGTAAGCTTAGAGAAGAAGGGATTTGTGTTTCTGAGTATCTTTAAATGGGAATTCAGGAAAGGTTGGGATTTGTTGTTGGAAGCATATTTGAGAGAATTTTGTAAGAAAGATGGAGTTGGGTTGTTTTTGTTGACAAATCCTTACCATACTGATAGTGATTTTGGGAATAAGATTTTGGATTTTGTAGAAAATTCAGACTTACAAATGCCACTTTCTGGTTGGGCTCCTGTTTATGTGGTTGATATTCATATACCTCAAACTGATTTGCCTAGAGTTTACAAGGCTGCTGATGCATTTGTACTTCCATCAAGAGGAGAAGGGTGGGGGAGGCCGCTCGTTGAAGCAATGGCGATGTCGTTGCCAGTGATTGCAACCAACTGGTCGGGGCAAACGGAGTTTTTGACCGATGAGAATAGCTATCCGTTGCCCGTTGAGAGAATGAGTGAAGTAAAGGAAGGGCCATTTGAAGGGCATCTGTGGGCTGAACCATCCATCAGTATACTTCAAGTTCTAATGAGGGAAGTAACAACTAATGTTGATGAAGCTAAGGCTAAAGGACGACGGGCAAGGGAGGACATGGTAAGCCGATTCTCGCCCAACATCGTTGCCGATATCGTTCATAGTCAGATACAAAATATATTCCATGAGAAGAGATGA

Coding sequence (CDS)

ATGGATGACGATCTCCACCATAACCAACAACCCACAGATCCACCATTTCCCAATCCCAACCAATCTCACTCCTTCAGATTCCGCCCTTCTGCGATTCACTTTTCATCCATCCTCATTCTCCTTCTAGCAATTTCCTTCTTCAGTTTCTCCAAAACAGATTTCTTCAAAACCCATTCCTTAAAACTCACCCGTCTTCTCAAAAATTCTAACCAAACCCCATTTCCTAATCCCTTCTGTGTTCTTTGGATGGCTCCATTTGTTTCTGGTGGTGGCTACAGTTCAGAAGCTTGGTCCTACATTTTAGCCCTTCATGATCATATAACAAACCCTGGATTTCGATTGGCCATTCAGCAACATGGCGATTTAGAATCCATCGACTTTTGGGAGGGCTTACCGGATTCTATCAGGAATCTGGCCATTGAACTTCACAGAACAAAATGTAGAATGAATGAAACCGTTGTGATTTGCCACAGTGAACCGGGTGCGTGGAATCCTCCTTTGTTTGAAACTTTGCCTTGCCCACCAGGTGCTTACCAAAATTTCAAGTCAGTGATTGGTAGAACAATGTTTGAAACTGATAGGGTAAGTCAAGAACATGTGAATCGATGTAATAGAATGGATTACGTTTGGGTTCCTTCTGAATTTCATGTCTCTACATTTGTGAAAAGTGGGGTTGATCCTTCTAAGATTGTGAAAGTTGTTCAACCTATTGATGTGAATTTCTTTGATCCATTGAAATACAAACCATTTAGTCTTGAATCTGTAGGAACATTAGTTTTAGGAGCCAAAAACTTGGAAGTAAGCTTAGAGAAGAAGGGATTTGTGTTTCTGAGTATCTTTAAATGGGAATTCAGGAAAGGTTGGGATTTGTTGTTGGAAGCATATTTGAGAGAATTTTGTAAGAAAGATGGAGTTGGGTTGTTTTTGTTGACAAATCCTTACCATACTGATAGTGATTTTGGGAATAAGATTTTGGATTTTGTAGAAAATTCAGACTTACAAATGCCACTTTCTGGTTGGGCTCCTGTTTATGTGGTTGATATTCATATACCTCAAACTGATTTGCCTAGAGTTTACAAGGCTGCTGATGCATTTGTACTTCCATCAAGAGGAGAAGGGTGGGGGAGGCCGCTCGTTGAAGCAATGGCGATGTCGTTGCCAGTGATTGCAACCAACTGGTCGGGGCAAACGGAGTTTTTGACCGATGAGAATAGCTATCCGTTGCCCGTTGAGAGAATGAGTGAAGTAAAGGAAGGGCCATTTGAAGGGCATCTGTGGGCTGAACCATCCATCAGTATACTTCAAGTTCTAATGAGGGAAGTAACAACTAATGTTGATGAAGCTAAGGCTAAAGGACGACGGGCAAGGGAGGACATGGTAAGCCGATTCTCGCCCAACATCGTTGCCGATATCGTTCATAGTCAGATACAAAATATATTCCATGAGAAGAGATGA

Protein sequence

MDDDLHHNQQPTDPPFPNPNQSHSFRFRPSAIHFSSILILLLAISFFSFSKTDFFKTHSLKLTRLLKNSNQTPFPNPFCVLWMAPFVSGGGYSSEAWSYILALHDHITNPGFRLAIQQHGDLESIDFWEGLPDSIRNLAIELHRTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHVNRCNRMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGAKNLEVSLEKKGFVFLSIFKWEFRKGWDLLLEAYLREFCKKDGVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFEGHLWAEPSISILQVLMREVTTNVDEAKAKGRRAREDMVSRFSPNIVADIVHSQIQNIFHEKR
BLAST of ClCG05G021920 vs. TrEMBL
Match: A0A0A0KTD9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G613520 PE=4 SV=1)

HSP 1 Score: 859.0 bits (2218), Expect = 2.8e-246
Identity = 418/486 (86.01%), Postives = 442/486 (90.95%), Query Frame = 1

Query: 1   MDDDLHHNQQPTDPPFPNPNQSHSFRFRPSAIHFSSILILLLAISFFSFSKTDFFKTHSL 60
           MD DL H+    D PFP PNQ H F+F  S IHFSSILILLLAISFF+F KT+F+K+ S 
Sbjct: 1   MDADLRHD----DRPFPIPNQPHRFKFHLSPIHFSSILILLLAISFFAFPKTNFYKSQSS 60

Query: 61  KLTRLLKNSNQTPFPNPFCVLWMAPFVSGGGYSSEAWSYILALHDHITNPGFRLAIQQHG 120
           KLT LLK SNQ P  NP CVLWMAPF+SGGGYSSEAWSYILAL  HITNPGFRL I+ HG
Sbjct: 61  KLTNLLKFSNQPPGFNPLCVLWMAPFLSGGGYSSEAWSYILALRHHITNPGFRLVIRHHG 120

Query: 121 DLESIDFWEGLPDSIRNLAIELHRTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQN 180
           DLES+DFWEGLP+S+RNLAIELHRT+CRMNETVVICHSEPGAWNPPLFETLPCPPG YQ 
Sbjct: 121 DLESVDFWEGLPESVRNLAIELHRTRCRMNETVVICHSEPGAWNPPLFETLPCPPGPYQK 180

Query: 181 FKSVIGRTMFETDRVSQEHVNRCNRMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVN 240
           FKSVIGRTMFETDRV++EHVNRCN MDYVWVPSEFHVSTFV+SGVDPSKIVKVVQP+DVN
Sbjct: 181 FKSVIGRTMFETDRVTREHVNRCNVMDYVWVPSEFHVSTFVESGVDPSKIVKVVQPVDVN 240

Query: 241 FFDPLKYKPFSLESVGTLVLGAKNL--EVSLEKKGFVFLSIFKWEFRKGWDLLLEAYLRE 300
           FFDPLKYKP SLESVGTLVLG KN   EV LEKK FVFLSIFKWEFRKGWD+LLEAYL+E
Sbjct: 241 FFDPLKYKPLSLESVGTLVLGGKNFEEEVKLEKKRFVFLSIFKWEFRKGWDVLLEAYLKE 300

Query: 301 FCKKDGVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRV 360
           F KKD VGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRV
Sbjct: 301 FSKKDEVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRV 360

Query: 361 YKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPVERMSEVKE 420
           YKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPVERMSEVKE
Sbjct: 361 YKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPVERMSEVKE 420

Query: 421 GPFEGHLWAEPSISILQVLMREVTTNVDEAKAKGRRAREDMVSRFSPNIVADIVHSQIQN 480
            PF+GH+WAEPSIS LQVLMREVT NVDEAK KGRRAR+DM+ RFSP+IVADIVH QI+N
Sbjct: 421 EPFKGHMWAEPSISKLQVLMREVTVNVDEAKEKGRRARQDMIDRFSPDIVADIVHRQIEN 480

Query: 481 IFHEKR 485
           IFHEKR
Sbjct: 481 IFHEKR 482

BLAST of ClCG05G021920 vs. TrEMBL
Match: M5WH53_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004583mg PE=4 SV=1)

HSP 1 Score: 686.8 bits (1771), Expect = 1.9e-194
Identity = 331/500 (66.20%), Postives = 397/500 (79.40%), Query Frame = 1

Query: 4   DLHHNQQPTD-PPFPNPNQSHSFRFRPSAIHFSSILILLLAISFFSFSKTDFFKTHSLKL 63
           D +  QQP +    PNP    + + +P A + SSILIL+LAISF S +KT++ KT  LK 
Sbjct: 2   DSNETQQPIENQQQPNPTLPFTSKLKPYAFYLSSILILILAISF-SLTKTNYLKTQQLKY 61

Query: 64  T---------------RLLKNSNQTPFPNP-----FCVLWMAPFVSGGGYSSEAWSYILA 123
           T               +  + + QT  PNP     +CVLWMAPF+SGGGYSSE+WSYILA
Sbjct: 62  TFSSQPAIFQALFGFLQPKQKTKQTQVPNPISKPPYCVLWMAPFLSGGGYSSESWSYILA 121

Query: 124 LHDHITNPGFRLAIQQHGDLESIDFWEGLPDSIRNLAIELHRTKCRMNETVVICHSEPGA 183
           LH+H  NP FR+AI+QHGDLES++FW GLP  ++NLA+EL+ T+C M ET+VICHSEPGA
Sbjct: 122 LHEHSKNPNFRMAIEQHGDLESLEFWGGLPKYMKNLAVELYHTQCSMKETIVICHSEPGA 181

Query: 184 WNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHVNRCNRMDYVWVPSEFHVSTFVK 243
           WNPPLFETLPCPP AYQNFKSVIGRTMFETDRV+ EHV RCN+MDYVWVP+EFHVSTFV+
Sbjct: 182 WNPPLFETLPCPPTAYQNFKSVIGRTMFETDRVNPEHVKRCNQMDYVWVPTEFHVSTFVQ 241

Query: 244 SGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGAKNLEVSLEKKGFVFLSIFKW 303
           SGVD SK+VK+VQPIDV FFDPL+Y+P +L S+G  V+G K  + S  KK FVF+SIFKW
Sbjct: 242 SGVDKSKVVKIVQPIDVKFFDPLEYEPLNLASIGKFVMG-KTTQNSKVKKKFVFMSIFKW 301

Query: 304 EFRKGWDLLLEAYLREFCKKDGVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAP 363
           E+RKGWD+LL++YL EF + DGV L+LLTNPYH+D DFGNKI++FVE S +Q P++GWAP
Sbjct: 302 EYRKGWDVLLKSYLEEFSEADGVALYLLTNPYHSDRDFGNKIVEFVEKSGMQKPVTGWAP 361

Query: 364 VYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTD 423
           VYV+D HI Q DLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSG TE+LT+
Sbjct: 362 VYVIDTHIAQIDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGPTEYLTE 421

Query: 424 ENSYPLPVERMSEVKEGPFEGHLWAEPSISILQVLMREVTTNVDEAKAKGRRAREDMVSR 483
           ENSY LPV+RMS++ EGPF GH WAEPS+S L+VLMR V  NV+EAK KG +AREDM++R
Sbjct: 422 ENSYRLPVDRMSDIMEGPFRGHRWAEPSVSKLRVLMRHVLNNVEEAKVKGEKAREDMITR 481

BLAST of ClCG05G021920 vs. TrEMBL
Match: W9QKT8_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_012598 PE=4 SV=1)

HSP 1 Score: 663.3 bits (1710), Expect = 2.3e-187
Identity = 322/502 (64.14%), Postives = 390/502 (77.69%), Query Frame = 1

Query: 4   DLHHNQQPTDPPFPNPNQSHSFRFRPS-AIHFSSILILLLAISFFSFSKTDFFKTHSLKL 63
           D H NQ+P + P  N   S + + + + A +   +LIL LAIS  + + T+++KT  LK 
Sbjct: 2   DSHENQRPLNEPETNQTLSFTSKLKKAFAFYILPLLILFLAISL-TLNNTNYYKTQLLKH 61

Query: 64  T------------RLLKNSNQTP---------FPNPFCVLWMAPFVSGGGYSSEAWSYIL 123
           T             LL  + Q P            P+CVLWMAPF+SGGGYSSE+WSYIL
Sbjct: 62  TFSSNPNLFQTIFALLSPTQQKPRIKQSPKPISKPPYCVLWMAPFLSGGGYSSESWSYIL 121

Query: 124 ALHDHITNPGFRLAIQQHGDLESIDFWEGLPDSIRNLAIELHRTKCRM-NETVVICHSEP 183
           ALH+HI N  F+LAI  HGDLES++FWEGLP   +NLA+EL+ T+C M N+T+VICHSEP
Sbjct: 122 ALHEHIKNSSFKLAIDHHGDLESVEFWEGLPGPTKNLAVELYNTECTMSNKTLVICHSEP 181

Query: 184 GAWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHVNRCNRMDYVWVPSEFHVSTF 243
           GAW PPLFET PCPPG YQNFK+VIGRTMFETDRV+ EHV RCNRMDYVWVP+EFHVSTF
Sbjct: 182 GAWYPPLFETSPCPPGVYQNFKAVIGRTMFETDRVNSEHVKRCNRMDYVWVPTEFHVSTF 241

Query: 244 VKSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGAKNLEVSLEKKGFVFLSIF 303
           V+SGVDPSK+VK+VQPIDV FFDPL+YKP +L SV +LV+G          + FVFLS+F
Sbjct: 242 VESGVDPSKVVKIVQPIDVKFFDPLEYKPLNLHSVESLVIGGPTRRPKSNSE-FVFLSVF 301

Query: 304 KWEFRKGWDLLLEAYLREFCKKDGVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGW 363
           KWE+RKGWD+LL+AYL EF   DGV L+LLTNPYHTDSDFGNKI++FVENS L+ P++GW
Sbjct: 302 KWEYRKGWDVLLKAYLEEFSGVDGVALYLLTNPYHTDSDFGNKIVEFVENSGLEKPVTGW 361

Query: 364 APVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFL 423
           APVYV+D HI Q DLPR+YKAADAFVLPSRGEGWGRP+VEAMAMSLPVIATNWSG TE++
Sbjct: 362 APVYVIDSHIDQIDLPRLYKAADAFVLPSRGEGWGRPIVEAMAMSLPVIATNWSGPTEYM 421

Query: 424 TDENSYPLPVERMSEVKEGPFEGHLWAEPSISILQVLMREVTTNVDEAKAKGRRAREDMV 483
           T+ENSYPLP ERMSEV EGPF GHLWAEPS+  L+VLMR V  N++EAKA+G++AREDM+
Sbjct: 422 TEENSYPLPPERMSEVTEGPFRGHLWAEPSVGKLRVLMRRVMNNIEEAKARGKKAREDMI 481

BLAST of ClCG05G021920 vs. TrEMBL
Match: A0A068TV54_COFCA (Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00029659001 PE=4 SV=1)

HSP 1 Score: 645.6 bits (1664), Expect = 4.9e-182
Identity = 309/484 (63.84%), Postives = 393/484 (81.20%), Query Frame = 1

Query: 21  QSHSFRFRPSAIHFSSILILLLAISFFSFSKTDFFKTHSLK---------LTRLLKN--- 80
           Q+H F F+    +  S+L+L +AIS  S  KT+ +K+H LK         L +LL++   
Sbjct: 18  QAHPFIFKQKLFYLLSVLVLFVAISL-SIPKTNHYKSHQLKSTLTSHPNFLNKLLRSLHP 77

Query: 81  -SNQT-PFPN------PFCVLWMAPFVSGGGYSSEAWSYILALHDHITNPG-----FRLA 140
            +NQ  P P+      P+C+LWMAPF+SGGGYSSEAWSYIL+L++++         FRL+
Sbjct: 78  IANQNAPNPSSISPTSPYCLLWMAPFLSGGGYSSEAWSYILSLNNYMKKNEPPRFKFRLS 137

Query: 141 IQQHGDLESIDFWEGLPDSIRNLAIELHRTKCRMNETVVICHSEPGAWNPPLFETLPCPP 200
           I+QHGDLE+++FWEGLP  +RNLAIEL+++KCR+NET+VICHSEPGAW PPLF+TLPCPP
Sbjct: 138 IEQHGDLENLEFWEGLPFGMRNLAIELYQSKCRLNETIVICHSEPGAWYPPLFQTLPCPP 197

Query: 201 GAYQNFKSVIGRTMFETDRVSQEHVNRCNRMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQ 260
             + + K VIGRTMFETDRV+ EHV RCN+MDYVWVP+EFHV +FV+SGVDPSK+VK+VQ
Sbjct: 198 TGFGDVKVVIGRTMFETDRVNAEHVKRCNQMDYVWVPTEFHVRSFVQSGVDPSKVVKIVQ 257

Query: 261 PIDVNFFDPLKYKPFSLESVGTLVLGAKNLEVSLEKKGFVFLSIFKWEFRKGWDLLLEAY 320
           P+D+ FFDP+K++P  L S+ +LVLG++   +S+ +  FVFLS+FKWE+RKGWD+LL +Y
Sbjct: 258 PVDLEFFDPVKHEPLELASIRSLVLGSETKNLSMGRN-FVFLSVFKWEYRKGWDVLLRSY 317

Query: 321 LREFCKKDGVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDL 380
           L+EF   D V L+LLTNPYH+D DFGNKI+++VE+SDL+ P++GWAPVYV+D HI Q DL
Sbjct: 318 LKEFSNADDVALYLLTNPYHSDRDFGNKIVEYVEDSDLEKPVNGWAPVYVIDAHIAQVDL 377

Query: 381 PRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPVERMSE 440
           PR+YKAADAFVLPSRGEGWGRP+VEAMAMSLPVIATNWSG TE+LT++NSYPLPVERMSE
Sbjct: 378 PRLYKAADAFVLPSRGEGWGRPVVEAMAMSLPVIATNWSGPTEYLTEDNSYPLPVERMSE 437

Query: 441 VKEGPFEGHLWAEPSISILQVLMREVTTNVDEAKAKGRRAREDMVSRFSPNIVADIVHSQ 480
           VKEGPF+GHLW+EPS+ +LQVLMR V TN ++AKAKG +AREDM+SRFSP+IVA IV   
Sbjct: 438 VKEGPFKGHLWSEPSVQLLQVLMRHVITNPEKAKAKGMQAREDMISRFSPDIVAQIVTES 497

BLAST of ClCG05G021920 vs. TrEMBL
Match: A0A0S3SP76_PHAAN (Uncharacterized protein OS=Vigna angularis var. angularis GN=Vigan.08G124400 PE=4 SV=1)

HSP 1 Score: 645.6 bits (1664), Expect = 4.9e-182
Identity = 312/480 (65.00%), Postives = 376/480 (78.33%), Query Frame = 1

Query: 17  PNP-NQSHSFRFRPSAIHFSSILILLLAISFFSFSKTDFFKTHSLK-----------LTR 76
           PNP + SH+ + R    H  S+L+LL +I ++ F++T+++  H LK           +  
Sbjct: 13  PNPESHSHTSKCRKFTFHSLSLLVLLTSI-YWGFTRTNYYTVHHLKYSLTSPSIIHAIES 72

Query: 77  LLKNSNQTPFPNPFCVLWMAPFVSGGGYSSEAWSYILALHDHITNPGFRLAIQQHGDLES 136
               S     P+  CVLWMAPF+SGGGYSSE WSYILALH H      RLAI  HGDLES
Sbjct: 73  FFLPSKPVSVPSNHCVLWMAPFLSGGGYSSEGWSYILALHGHRKMQSLRLAIDHHGDLES 132

Query: 137 IDFWEGLPDSIRNLAIELHRTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSV 196
           + FWEGLP  ++NLA EL++ +CRMNETVVICHSEPGAW PPLFET PCPP  Y NFKSV
Sbjct: 133 LAFWEGLPVHMKNLARELYQARCRMNETVVICHSEPGAWFPPLFETTPCPPSFYHNFKSV 192

Query: 197 IGRTMFETDRVSQEHVNRCNRMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDP 256
           +GRTMFETDRV+ +HV RCN M+YVWVP+EFH+STFV+SGVDPSK+VK+VQP+DV FFDP
Sbjct: 193 VGRTMFETDRVNDQHVQRCNTMNYVWVPTEFHMSTFVQSGVDPSKVVKIVQPVDVKFFDP 252

Query: 257 LKYKPFSLESVGT--LVLGAKNLEVSLEKKGFVFLSIFKWEFRKGWDLLLEAYLREFCKK 316
           ++YKPF L S     LVLG++       KK FVFLSIFKWE+RKGWD+LL++YL+EF K 
Sbjct: 253 VRYKPFRLASSSRAKLVLGSR------VKKSFVFLSIFKWEYRKGWDVLLKSYLKEFSKD 312

Query: 317 DGVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAA 376
           DGV L+LLTNPYHTD +FGNKILDFV++S +  P+SGWAPVYV+D HIP +DLPRVYKAA
Sbjct: 313 DGVALYLLTNPYHTDQNFGNKILDFVDSSGMVKPVSGWAPVYVIDSHIPLSDLPRVYKAA 372

Query: 377 DAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFE 436
           DAFVLPSRGEGWGRP+VEAM+M+LPVIATNWSG TE+LTD+NSYPLPV+R+SEV EGPF+
Sbjct: 373 DAFVLPSRGEGWGRPMVEAMSMALPVIATNWSGPTEYLTDDNSYPLPVDRLSEVTEGPFK 432

Query: 437 GHLWAEPSISILQVLMREVTTNVDEAKAKGRRAREDMVSRFSPNIVADIVHSQIQNIFHE 483
           GHLWAEPS + LQVLMR+V  N+ EA A GR+AREDM++RFSP IVADIV   I NI  +
Sbjct: 433 GHLWAEPSENKLQVLMRQVMNNLTEATAIGRKAREDMIARFSPEIVADIVADNILNILRQ 485

BLAST of ClCG05G021920 vs. TAIR10
Match: AT3G10630.1 (AT3G10630.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 601.7 bits (1550), Expect = 4.1e-172
Identity = 294/473 (62.16%), Postives = 354/473 (74.84%), Query Frame = 1

Query: 32  IHFSSILILLLAISF---------------FSFSKTDFFKTHSLKL---TRLLKNSNQTP 91
           ++ SSIL LLL+I                 F+F+   F+      L       K+ ++T 
Sbjct: 18  VYSSSILFLLLSIFLLGFTNTDLYKVQSLRFTFTVNRFYSYLQFLLGFHDGTPKSKSETL 77

Query: 92  FP---NPFCVLWMAPFVSGGGYSSEAWSYILALHDHITNPGFRLAIQQHGDLESIDFWEG 151
            P    P CVLWMAPF+S GGYSSEAWSY+L+L +H+TNP FR+ I+ HGDLES++FW G
Sbjct: 78  NPASSTPHCVLWMAPFLSSGGYSSEAWSYVLSLRNHLTNPRFRITIEHHGDLESVEFWNG 137

Query: 152 LPDSIRNLAIELHRTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGRTMF 211
           L    + +AIE++R +CR NET+V+CHSEPGAW PPLFETLPCPP  Y++F SVIGRTMF
Sbjct: 138 LAKETKEVAIEMYREQCRPNETIVVCHSEPGAWYPPLFETLPCPPTGYEDFLSVIGRTMF 197

Query: 212 ETDRVSQEHVNRCNRMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKYKPF 271
           ETDRV+ EHV RCN+MD+VWVP++FHVS+FV+SGVD SK+VK+VQP+DV FFDP KYKP 
Sbjct: 198 ETDRVNPEHVKRCNQMDHVWVPTDFHVSSFVQSGVDSSKVVKIVQPVDVGFFDPSKYKPL 257

Query: 272 SLESVGTLVLGAKNLEVSLEKKGFVFLSIFKWEFRKGWDLLLEAYLREFCKKDGVGLFLL 331
            L +VG LVLG      S  K GFVFLS+FKWE RKGWD+LL+AYL EF  +D V LFLL
Sbjct: 258 DLMAVGDLVLG------SGMKNGFVFLSVFKWEQRKGWDVLLKAYLSEFSGEDNVALFLL 317

Query: 332 TNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAFVLPSR 391
           TN YH+DSDFGNKILDFVE  +++ P +G+  VYV+D HI Q DLPR+YKAADAFVLP+R
Sbjct: 318 TNAYHSDSDFGNKILDFVEEMNIEEPRNGYPFVYVIDKHIAQVDLPRLYKAADAFVLPTR 377

Query: 392 GEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFEGHLWAEPS 451
           GEGWGRP+VEAMAMSLPVI TNWSG TE+LT+ N YPL VE MSEVKEGPFEGH WAEPS
Sbjct: 378 GEGWGRPIVEAMAMSLPVITTNWSGPTEYLTERNGYPLVVEEMSEVKEGPFEGHQWAEPS 437

Query: 452 ISILQVLMREVTTNVDEAKAKGRRAREDMVSRFSPNIVADIVHSQIQNIFHEK 484
           +  L+VLMR V +N DEAK KG+R R+DMV  F+P +VA +V  QI  IF EK
Sbjct: 438 VDKLRVLMRRVMSNPDEAKVKGKRGRDDMVKNFAPEVVAKVVADQIARIFDEK 484

BLAST of ClCG05G021920 vs. NCBI nr
Match: gi|449435172|ref|XP_004135369.1| (PREDICTED: uncharacterized protein LOC101204678 [Cucumis sativus])

HSP 1 Score: 859.0 bits (2218), Expect = 4.0e-246
Identity = 418/486 (86.01%), Postives = 442/486 (90.95%), Query Frame = 1

Query: 1   MDDDLHHNQQPTDPPFPNPNQSHSFRFRPSAIHFSSILILLLAISFFSFSKTDFFKTHSL 60
           MD DL H+    D PFP PNQ H F+F  S IHFSSILILLLAISFF+F KT+F+K+ S 
Sbjct: 1   MDADLRHD----DRPFPIPNQPHRFKFHLSPIHFSSILILLLAISFFAFPKTNFYKSQSS 60

Query: 61  KLTRLLKNSNQTPFPNPFCVLWMAPFVSGGGYSSEAWSYILALHDHITNPGFRLAIQQHG 120
           KLT LLK SNQ P  NP CVLWMAPF+SGGGYSSEAWSYILAL  HITNPGFRL I+ HG
Sbjct: 61  KLTNLLKFSNQPPGFNPLCVLWMAPFLSGGGYSSEAWSYILALRHHITNPGFRLVIRHHG 120

Query: 121 DLESIDFWEGLPDSIRNLAIELHRTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQN 180
           DLES+DFWEGLP+S+RNLAIELHRT+CRMNETVVICHSEPGAWNPPLFETLPCPPG YQ 
Sbjct: 121 DLESVDFWEGLPESVRNLAIELHRTRCRMNETVVICHSEPGAWNPPLFETLPCPPGPYQK 180

Query: 181 FKSVIGRTMFETDRVSQEHVNRCNRMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVN 240
           FKSVIGRTMFETDRV++EHVNRCN MDYVWVPSEFHVSTFV+SGVDPSKIVKVVQP+DVN
Sbjct: 181 FKSVIGRTMFETDRVTREHVNRCNVMDYVWVPSEFHVSTFVESGVDPSKIVKVVQPVDVN 240

Query: 241 FFDPLKYKPFSLESVGTLVLGAKNL--EVSLEKKGFVFLSIFKWEFRKGWDLLLEAYLRE 300
           FFDPLKYKP SLESVGTLVLG KN   EV LEKK FVFLSIFKWEFRKGWD+LLEAYL+E
Sbjct: 241 FFDPLKYKPLSLESVGTLVLGGKNFEEEVKLEKKRFVFLSIFKWEFRKGWDVLLEAYLKE 300

Query: 301 FCKKDGVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRV 360
           F KKD VGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRV
Sbjct: 301 FSKKDEVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRV 360

Query: 361 YKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPVERMSEVKE 420
           YKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPVERMSEVKE
Sbjct: 361 YKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPVERMSEVKE 420

Query: 421 GPFEGHLWAEPSISILQVLMREVTTNVDEAKAKGRRAREDMVSRFSPNIVADIVHSQIQN 480
            PF+GH+WAEPSIS LQVLMREVT NVDEAK KGRRAR+DM+ RFSP+IVADIVH QI+N
Sbjct: 421 EPFKGHMWAEPSISKLQVLMREVTVNVDEAKEKGRRARQDMIDRFSPDIVADIVHRQIEN 480

Query: 481 IFHEKR 485
           IFHEKR
Sbjct: 481 IFHEKR 482

BLAST of ClCG05G021920 vs. NCBI nr
Match: gi|659091807|ref|XP_008446743.1| (PREDICTED: uncharacterized protein LOC103489373 [Cucumis melo])

HSP 1 Score: 855.1 bits (2208), Expect = 5.8e-245
Identity = 413/479 (86.22%), Postives = 440/479 (91.86%), Query Frame = 1

Query: 8   NQQPTDPPFPNPNQSHSFRFRPSAIHFSSILILLLAISFFSFSKTDFFKTHSLKLTRLLK 67
           + +P D PFPNPNQ H F+   S IHFSSILILLLAISFF+F KT+F+K+ S KLT LLK
Sbjct: 4   DHRPNDRPFPNPNQPHRFKCHLSPIHFSSILILLLAISFFAFPKTNFYKSQSSKLTNLLK 63

Query: 68  NSNQTPFPNPFCVLWMAPFVSGGGYSSEAWSYILALHDHITNPGFRLAIQQHGDLESIDF 127
            SNQ P  NP CVLWMAPF+SGGGYSSEAWSYILAL  HITNPGFRL I+QHGDLES+DF
Sbjct: 64  TSNQPPGLNPSCVLWMAPFLSGGGYSSEAWSYILALRHHITNPGFRLVIRQHGDLESVDF 123

Query: 128 WEGLPDSIRNLAIELHRTKCRMNETVVICHSEPGAWNPPLFETLPCPPGAYQNFKSVIGR 187
           WEGLP+S+RNLAIELHRT+CRMNETVVICHSEPGAWNPPLFETLPCPPGAY+ FKSVIGR
Sbjct: 124 WEGLPESVRNLAIELHRTRCRMNETVVICHSEPGAWNPPLFETLPCPPGAYRKFKSVIGR 183

Query: 188 TMFETDRVSQEHVNRCNRMDYVWVPSEFHVSTFVKSGVDPSKIVKVVQPIDVNFFDPLKY 247
           TMFETDRV+QEHVNRCN MDYVWVPSEFHVSTFV+SGVDPSKIVKVVQP+DVNFFDPLKY
Sbjct: 184 TMFETDRVTQEHVNRCNVMDYVWVPSEFHVSTFVESGVDPSKIVKVVQPVDVNFFDPLKY 243

Query: 248 KPFSLESVGTLVLGAKNLEV--SLEKKGFVFLSIFKWEFRKGWDLLLEAYLREFCKKDGV 307
           KPFSLESVGTLVLG  N E    +EKK FVFLSIFKWEFRKGWDLLLEAYL+EF KKD V
Sbjct: 244 KPFSLESVGTLVLGGNNFEEVRLVEKKRFVFLSIFKWEFRKGWDLLLEAYLKEFSKKDEV 303

Query: 308 GLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAF 367
           GLFLLTNPYHT+SDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAF
Sbjct: 304 GLFLLTNPYHTESDFGNKILDFVENSDLQMPLSGWAPVYVVDIHIPQTDLPRVYKAADAF 363

Query: 368 VLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTDENSYPLPVERMSEVKEGPFEGHL 427
           VLPSRGEGWGRPLVEAMAMSLPVIATNWSG TEFLTDENSYPLPVERMSEVKE PF+GH+
Sbjct: 364 VLPSRGEGWGRPLVEAMAMSLPVIATNWSGPTEFLTDENSYPLPVERMSEVKEEPFKGHM 423

Query: 428 WAEPSISILQVLMREVTTNVDEAKAKGRRAREDMVSRFSPNIVADIVHSQIQNIFHEKR 485
           WAEPSIS LQVLMREVT NV+EAK KGRRAREDM++RFSP+IVADIVH QI+NIFHEKR
Sbjct: 424 WAEPSISKLQVLMREVTINVEEAKDKGRRAREDMINRFSPDIVADIVHRQIENIFHEKR 482

BLAST of ClCG05G021920 vs. NCBI nr
Match: gi|645241876|ref|XP_008227285.1| (PREDICTED: uncharacterized protein LOC103326817 [Prunus mume])

HSP 1 Score: 693.0 bits (1787), Expect = 3.8e-196
Identity = 332/500 (66.40%), Postives = 400/500 (80.00%), Query Frame = 1

Query: 4   DLHHNQQPTD-PPFPNPNQSHSFRFRPSAIHFSSILILLLAISFFSFSKTDFFKTHSLKL 63
           D +  Q+P +  P PNP    + + +P A + SSILIL+LAISF S +KT++ KT  LK 
Sbjct: 2   DSNETQRPIENQPQPNPTLPFTSKLKPYAFYLSSILILILAISF-SLTKTNYLKTQQLKY 61

Query: 64  T---------------RLLKNSNQTPFPNP-----FCVLWMAPFVSGGGYSSEAWSYILA 123
           T               +  + + QT  PNP     +CVLWMAPF+SGGGYSSE+WSYILA
Sbjct: 62  TFSSQPAIFQVLFGFLQPKQKTKQTQVPNPISKPPYCVLWMAPFLSGGGYSSESWSYILA 121

Query: 124 LHDHITNPGFRLAIQQHGDLESIDFWEGLPDSIRNLAIELHRTKCRMNETVVICHSEPGA 183
           LH+H   P FR+AI+QHGDLES++FWEGLP  ++NLA+EL+ T+C M ET+VICHSEPGA
Sbjct: 122 LHEHSKTPNFRMAIEQHGDLESLEFWEGLPKYMKNLAVELYHTQCSMKETIVICHSEPGA 181

Query: 184 WNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHVNRCNRMDYVWVPSEFHVSTFVK 243
           WNPPLFETLPCPP AYQNFKSVIGRTMFETDRV+ EHV RCN+MDYVWVP+EFHVSTF++
Sbjct: 182 WNPPLFETLPCPPTAYQNFKSVIGRTMFETDRVNPEHVKRCNQMDYVWVPTEFHVSTFIQ 241

Query: 244 SGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGAKNLEVSLEKKGFVFLSIFKW 303
           SGVD SK+VK+VQPIDV FFDPL+Y+P +L S+G LV+G K  + S   K FVF+SIFKW
Sbjct: 242 SGVDKSKVVKIVQPIDVKFFDPLEYEPLNLASIGKLVMG-KTTQNSKVMKKFVFMSIFKW 301

Query: 304 EFRKGWDLLLEAYLREFCKKDGVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAP 363
           E+RKGWD+LL++YL EF + DGV L+LLTNPYH+D DFGNKI++FVE S +Q P++GWAP
Sbjct: 302 EYRKGWDVLLKSYLEEFSEADGVALYLLTNPYHSDRDFGNKIVEFVEKSGMQKPVTGWAP 361

Query: 364 VYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTD 423
           VYV+D HI Q DLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSG TE+LT+
Sbjct: 362 VYVIDTHIAQIDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGPTEYLTE 421

Query: 424 ENSYPLPVERMSEVKEGPFEGHLWAEPSISILQVLMREVTTNVDEAKAKGRRAREDMVSR 483
           ENSYPLPV+RMS++ EGPF GH WAEPS+S L+VLMR V  NV+EAK KG +AREDM++R
Sbjct: 422 ENSYPLPVDRMSDIMEGPFRGHRWAEPSVSKLRVLMRHVLNNVEEAKVKGEKAREDMITR 481

BLAST of ClCG05G021920 vs. NCBI nr
Match: gi|595864780|ref|XP_007211835.1| (hypothetical protein PRUPE_ppa004583mg [Prunus persica])

HSP 1 Score: 686.8 bits (1771), Expect = 2.7e-194
Identity = 331/500 (66.20%), Postives = 397/500 (79.40%), Query Frame = 1

Query: 4   DLHHNQQPTD-PPFPNPNQSHSFRFRPSAIHFSSILILLLAISFFSFSKTDFFKTHSLKL 63
           D +  QQP +    PNP    + + +P A + SSILIL+LAISF S +KT++ KT  LK 
Sbjct: 2   DSNETQQPIENQQQPNPTLPFTSKLKPYAFYLSSILILILAISF-SLTKTNYLKTQQLKY 61

Query: 64  T---------------RLLKNSNQTPFPNP-----FCVLWMAPFVSGGGYSSEAWSYILA 123
           T               +  + + QT  PNP     +CVLWMAPF+SGGGYSSE+WSYILA
Sbjct: 62  TFSSQPAIFQALFGFLQPKQKTKQTQVPNPISKPPYCVLWMAPFLSGGGYSSESWSYILA 121

Query: 124 LHDHITNPGFRLAIQQHGDLESIDFWEGLPDSIRNLAIELHRTKCRMNETVVICHSEPGA 183
           LH+H  NP FR+AI+QHGDLES++FW GLP  ++NLA+EL+ T+C M ET+VICHSEPGA
Sbjct: 122 LHEHSKNPNFRMAIEQHGDLESLEFWGGLPKYMKNLAVELYHTQCSMKETIVICHSEPGA 181

Query: 184 WNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHVNRCNRMDYVWVPSEFHVSTFVK 243
           WNPPLFETLPCPP AYQNFKSVIGRTMFETDRV+ EHV RCN+MDYVWVP+EFHVSTFV+
Sbjct: 182 WNPPLFETLPCPPTAYQNFKSVIGRTMFETDRVNPEHVKRCNQMDYVWVPTEFHVSTFVQ 241

Query: 244 SGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGAKNLEVSLEKKGFVFLSIFKW 303
           SGVD SK+VK+VQPIDV FFDPL+Y+P +L S+G  V+G K  + S  KK FVF+SIFKW
Sbjct: 242 SGVDKSKVVKIVQPIDVKFFDPLEYEPLNLASIGKFVMG-KTTQNSKVKKKFVFMSIFKW 301

Query: 304 EFRKGWDLLLEAYLREFCKKDGVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWAP 363
           E+RKGWD+LL++YL EF + DGV L+LLTNPYH+D DFGNKI++FVE S +Q P++GWAP
Sbjct: 302 EYRKGWDVLLKSYLEEFSEADGVALYLLTNPYHSDRDFGNKIVEFVEKSGMQKPVTGWAP 361

Query: 364 VYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLTD 423
           VYV+D HI Q DLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSG TE+LT+
Sbjct: 362 VYVIDTHIAQIDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGPTEYLTE 421

Query: 424 ENSYPLPVERMSEVKEGPFEGHLWAEPSISILQVLMREVTTNVDEAKAKGRRAREDMVSR 483
           ENSY LPV+RMS++ EGPF GH WAEPS+S L+VLMR V  NV+EAK KG +AREDM++R
Sbjct: 422 ENSYRLPVDRMSDIMEGPFRGHRWAEPSVSKLRVLMRHVLNNVEEAKVKGEKAREDMITR 481

BLAST of ClCG05G021920 vs. NCBI nr
Match: gi|657951774|ref|XP_008354398.1| (PREDICTED: uncharacterized protein LOC103418016 [Malus domestica])

HSP 1 Score: 680.2 bits (1754), Expect = 2.6e-192
Identity = 328/501 (65.47%), Postives = 395/501 (78.84%), Query Frame = 1

Query: 4   DLHHNQQPTDPPFPNPNQSHSFRFRPSAIHFSSILILLLAISFFSFSKTDFFKT------ 63
           +L  NQQP + P  N     + + +P A + SSILIL+L IS F+F KT+F KT      
Sbjct: 2   NLRENQQPINRPQAN----RTLKLKPYAFYLSSILILVLTIS-FNFPKTNFLKTQLLKHT 61

Query: 64  ----------------HSLKLTRLLKNSNQTPFPNPFCVLWMAPFVSGGGYSSEAWSYIL 123
                           H +K  +  ++ N TP P P CVLWMAPF+SGGGYSSEAWSYIL
Sbjct: 62  FSSHPTTIVQNLFGFLHPIKKLQQTQSINXTPKP-PNCVLWMAPFLSGGGYSSEAWSYIL 121

Query: 124 ALHDHITNPGFRLAIQQHGDLESIDFWEGLPDSIRNLAIELHRTKCRMNETVVICHSEPG 183
           +L+ H  NP F++AI+QHGD ES++FWEGLP+ +R LAI L+ T+C M +TVVICHSEPG
Sbjct: 122 SLYQHSKNPNFQMAIEQHGDQESLEFWEGLPEYVRKLAIGLYNTQCSMKDTVVICHSEPG 181

Query: 184 AWNPPLFETLPCPPGAYQNFKSVIGRTMFETDRVSQEHVNRCNRMDYVWVPSEFHVSTFV 243
           AWNPPLFETLPCPP AYQ FKSVIGRTMFETDRV+ EHV RCNRMDYVWVP++FHVS+FV
Sbjct: 182 AWNPPLFETLPCPPAAYQKFKSVIGRTMFETDRVNAEHVKRCNRMDYVWVPTQFHVSSFV 241

Query: 244 KSGVDPSKIVKVVQPIDVNFFDPLKYKPFSLESVGTLVLGAKNLEVSLEKKGFVFLSIFK 303
           +SGVDPSK+VK+VQPIDV FFDPL Y+  +L SVG L++G    ++  +KK FVF+S+FK
Sbjct: 242 QSGVDPSKVVKIVQPIDVKFFDPLAYEQLNLASVGKLIMGKATQDLETKKK-FVFMSVFK 301

Query: 304 WEFRKGWDLLLEAYLREFCKKDGVGLFLLTNPYHTDSDFGNKILDFVENSDLQMPLSGWA 363
           WE+RKGWD+LL++YL EF + DGV L+LLTNPYH+D DFGNKI++F E S ++MP++GWA
Sbjct: 302 WEYRKGWDVLLKSYLEEFSEADGVALYLLTNPYHSDRDFGNKIVEFAEKSGIEMPVTGWA 361

Query: 364 PVYVVDIHIPQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGQTEFLT 423
           PVYV+D HI QTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSG TE+LT
Sbjct: 362 PVYVMDTHIAQTDLPRVYKAADAFVLPSRGEGWGRPLVEAMAMSLPVIATNWSGPTEYLT 421

Query: 424 DENSYPLPVERMSEVKEGPFEGHLWAEPSISILQVLMREVTTNVDEAKAKGRRAREDMVS 483
            ENSYPLPV+RMSEV EGPF GHLWAEPS+S L+VLMR V  NV+EAK KG++AREDM+ 
Sbjct: 422 AENSYPLPVDRMSEVMEGPFRGHLWAEPSVSKLRVLMRRVVNNVEEAKVKGKKAREDMIK 481

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0KTD9_CUCSA2.8e-24686.01Uncharacterized protein OS=Cucumis sativus GN=Csa_5G613520 PE=4 SV=1[more]
M5WH53_PRUPE1.9e-19466.20Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004583mg PE=4 SV=1[more]
W9QKT8_9ROSA2.3e-18764.14Uncharacterized protein OS=Morus notabilis GN=L484_012598 PE=4 SV=1[more]
A0A068TV54_COFCA4.9e-18263.84Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00029659001 PE=4 SV=1[more]
A0A0S3SP76_PHAAN4.9e-18265.00Uncharacterized protein OS=Vigna angularis var. angularis GN=Vigan.08G124400 PE=... [more]
Match NameE-valueIdentityDescription
AT3G10630.14.1e-17262.16 UDP-Glycosyltransferase superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449435172|ref|XP_004135369.1|4.0e-24686.01PREDICTED: uncharacterized protein LOC101204678 [Cucumis sativus][more]
gi|659091807|ref|XP_008446743.1|5.8e-24586.22PREDICTED: uncharacterized protein LOC103489373 [Cucumis melo][more]
gi|645241876|ref|XP_008227285.1|3.8e-19666.40PREDICTED: uncharacterized protein LOC103326817 [Prunus mume][more]
gi|595864780|ref|XP_007211835.1|2.7e-19466.20hypothetical protein PRUPE_ppa004583mg [Prunus persica][more]
gi|657951774|ref|XP_008354398.1|2.6e-19265.47PREDICTED: uncharacterized protein LOC103418016 [Malus domestica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001296Glyco_trans_1
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0019375 galactolipid biosynthetic process
biological_process GO:0001666 response to hypoxia
biological_process GO:0009058 biosynthetic process
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
molecular_function GO:0016740 transferase activity
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG05G021920.1ClCG05G021920.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001296Glycosyl transferase, family 1PFAMPF00534Glycos_transf_1coord: 349..404
score: 2.1
NoneNo IPR availableunknownCoilCoilcoord: 438..458
scor
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 265..404
score: 6.8
NoneNo IPR availablePANTHERPTHR12526GLYCOSYLTRANSFERASEcoord: 421..484
score: 1.4E-110coord: 76..402
score: 1.4E
NoneNo IPR availablePANTHERPTHR12526:SF378TRANSCRIPTIONAL ACTIVATOR PROTEIN UGA3coord: 421..484
score: 1.4E-110coord: 76..402
score: 1.4E
NoneNo IPR availableunknownSSF53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 79..479
score: 8.9

The following gene(s) are paralogous to this gene:

None