Cla97C05G090050 (gene) Watermelon (97103) v2

NameCla97C05G090050
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionO-glucosyltransferase rumi homolog
LocationCla97Chr05 : 8219385 .. 8223695 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATGATCAGAGACGACGATTCTCGCCGGAAGTTTCAGAAGCACCTGATCTCCGCTCACAAACTGCTCCACTTCTTGAATGCATCGCGTGGTTCATCTGTTATCATTTCCTTCGCCGTTGTCGTCCTCGTCGGCATGTTTCTCTCCGGCCGACTCCTCGGCCTCTTACTTGTATGTTACATAAGCCCACGCAATTTAATTATTTCCTTTCTTCAACTCAGCTATAAATTTAATAACTTATAATAATTTTGTTCTTTTTCAAAAAATTTGCTTATTTCTGTAGGTGTGTTCTCAGATTACAAATGATCTAAGACAGGATTTTCAATTAAAAAAAAAAATGTAGATGTTTGGTCAAAGAAAAAAAAACCACTTTTCAAGTAATTGGATTGGTTAATATTAAGTTTTAGATAGGCTAATAATCATTTCTACAAAATGGTGTGATTTCCAGGCAGCCACTTTGGAAAAATTCAAAGCATGTTGTAATTACACATTTGCCCTAAAATTTAAAGTCATGATTTAACAATTTTAAAATTTTTGATCAAATTCATATTTTATTATGATCTAGTAGCATTACTAGACGATCATTATTTAACCACTAAATTGAATAATTTAAGTCAACAATAATATATGTCATAAGATGGACGTGTATCGAATGAAATATGTGCATAAAGATTTGAATATTATATTAGGATGTTTTTAAATATAACAAAATGAACTAATTTATTTATCAATAGGGTGGTTTTTAAATATAGAAAGATGAATTAAAATATTTACGAATATAGCAAATTTTATTATTTATTTGCAATATATTATAATATTTTGCTATTACTAGTAAATATTTTCAACAGTTTTGTACTTTAAAATAATTTTTCTCATACAAATATAACAAAATGTAACTATCTATTAGTAATAGACAATGATAGACATCTATTAGTGTCATTGCTATTTTTGAAAATATTTTCAACAGTTTTGTTATTTACAATTACTCTATTATATTTGACATTGGAATAATTATACGTTAATTATAGTTGAGTTAGACACATTTTAATGATTTGTGGTAGTTTATTAACAATATGACATCTCCAATTGAATGATTAGTTCTAGGTTTTTTTCTTTCTACAATATGTGGGATGAAAAAATCGAAACTCTAATTTTGAAATAAGTAGTATAAACACTATATCAATTGAGTTATAGTCATTTTGGCAAACCATTTTAGATGTTACAGGAGAGAGTTGAATAAATTAAATAATTAAAATGTTGGAAATAATATAGGGATTGAAATCGAACGTTGTGAATCAACAACCACAAGGAGAAACTAAAAAAGATCCAGATCATCCAATATCCGCGACGTGTCCAGAGTACTTCCGTTGGATTCACGAGGACCTACGACCGTGGGCCGGAAGAGGAATAACAAAGTCGATGTTGGAAGAGGCCCAAAAGAAGGCCCATTTCAGGCTGGTGGTGGTGGAGGGAAAGGCTTACGTGGAGGCGTACGGAAAGGCATATCAAAGCAGAGACAGTCTTACGGTGTGGGGGGTTGTACAGTTGTTACGTAGGTACCCGGGAAAACTGCCCGATCTTGATCTGATGTTTAGCTGTGAAGACCGGCCAGAGATCTATCAAAAGGACTACAAGGGGTCCCAGAAGCCGGCCCCACCTCCTTTGTTTCGGTATAGTGGAGATGATGCCACGTGGGACATCGTGTTTCCTGATTGGTCCTTCTGGGGATGGTAAGTTAAACTAGGTCTAAAGAGGAATTTATTTATAAATTTAAGGTTTAAATATGAAAAGGAATTTGGGTGTAGTAATTAAAACATTTCATAATTTCGTTGTGGTCTTGCACTTTTTTTTTTTTTTTTTTTTTTAAAATAATCAGACGACACCAAGTTGTCATGTGTTTTTTTTTCCCTTAAAAAAAATGATTTTTTAAATTTTAATTTTACTATTTAAAAGCAAAGAAATTAGGTGAAACTTAAGCTGCATTTAATTTTTTCTTTTAAATATTACTTTAGTCTTTATACTCTCGATTTCAGTTCATTTTGACTCTTTTACTTTTAAAATATTTATTTTGATTCCTATATTTTCAATTTTGATTCATTTTGGTTTTTATACTTTCAAAAGTTACCACTTTGATTCTTAAAAAAATGAAATGAAATTAAATGAGAATGTACCAAAATGATCATTTTTTAAAAGTATAAGAATTAAAACGAATAGTTTTAAAATATTAAAATAAAAGTTAAAAGTATATTGACCAAAATGAATTGATACATCAAATAACAAAATGAATTTGTTGGTAGGCCTGAAATCAACATAAAGCCATGGGAAGGGATATTGAAGGATATTAAAGAAGGTAATAAGAAAATGGAGTGGATGAAGAGGGAACCATATGCTTATTGGAAAGGAAATCCAACGGTGGCTTACACAAGGAGAGACCTTCTAAAGTGCAATGTCACCCACAAACAAGATTGGAATGCTCGTTTATATAGGCAGGTATACAAAAATAATTATTTCAACTATAACTCAACAACGTTAATCAATAAACGATCAAATGTCTTGAATTTAATATTTGGATATTCAAATGACCCAACACTAAAATATAATATTTATCTTTAATAATAATATTGATATATTTTTCTGTTCGTGGATGTAACTAACACATTATTAGTAAACTTGGTGTAGAATTGGAATAAAGAGTCCAAAACAGGGTTCAAAGACTCAAAGTTGTGCAACCAATGTGTTTATAGGTGAATAAAGTTGACAACTTTTTAATTTTTTTTTTTTAAGTAGTTGAAATATATATGAATGATAGTGTAATAAATTTGATGCAGGTACAAAATATATATTGAAGGAAAGGCATGGTCAGTAAGTGAAAAATACATTCTAGCATGTGATTCAGTAAGCCTAATTGTGAGGCCTCATTATTATGATTTCTTCACAAGGAGTTTGATCCCAATGAAACACTATTGGCCTATTAGCTCCAACCGCAAGTGTTCTTCTATCAAGTTTGCTGTTGATTGGGGCAACACCCATCATCAAAAGGTATTTTCTTTTTCTCTTTTTAGTTCATGGAACATCTAACGACGTCTCGATCGGATGTACATATTTTATAGTAGTTGAGTTAGGTTTATATTGAAGTAATTTCAATTAGAACACTCAATTTAGCTTAATGTATTAATTAAAACCTATAATTCTTATTAGACTTAATTAATGACATGTTTACAATATTTACAATTATTTTTGATAGGTTCAAAATCACTTTGAGATGTGATTTTAGTCATTCAAAATTAATTTAATGCTCAATTTTATGCTATTAAACGTAATTTTTTTTTAGTATAACACAAATGAGAGTCTTGAAAATTTGAACATTCAAATCTCCGAGGAGTTAATACACATTAATTATCTTTTATCTTAATTTAATTTGGCTACACAAAATGTATAGTTTTTTTGGGAGTTCAATAATAACATGCGTGGATGAAGATTTGATTTCATGATCTATTGGTTGAGATTACAGTATCTTTTTTAATTGAGCTAGGTTGGCAAAATATAATTGGTCTGACTAAAGGACAAATTAAATACTTATTTATTGAATGCATCTAATATATATGAGTTGTCATCCTAAAATCATGATAAATTGATAGATATATGGTGTCAAAGTGAGCATAGTTTAATGATAATCGACATGAGGTCATGTACTACCTTTCCTTGACATTGAAAGTTCAAATTTTTCGTTTGTTGTACAAAAGAAATTAGATGTGTAGTGAAAGTTAAAATGAGATAACTTTAAATAGTTAAATTCAACTCACCATCCTGGTTGATTACATCCTCGAATCCTAATAACAATTTTTCAATGATAAGATATAGTATGATAATTAATCAAGATATTTATAATCTACATGCACTCGCATCGTCGTATTAAATATTTGCATTTAAACTATGAGATTTCAGGCAATGGCCATTGGAAAGGCAGCAAGCAAGTTCATTGAAGAAGAGCTGAAAATGGAGTACATTTACGACTACATGTTTCATCTTCTAAACCAATATTCCAAGCTCTTAACGTTCAAACCGACTGTACCGCCAAACGCGACGGAGCTCTCGTCGGAATCCATGGCTTTGGCCGCAGAAGGGTCGATCAGAAAGTCAATGATGGAGTCGGCGGTGACGAGCCCTGCAGATTCCGGCCCCTGCGCGCTGCAGGCGCCGTACGATCCCCAAACTCTTCAACTTTTAATTAGAAGCAAAGAAGATTCTATTAAACAAGTGGAAAGATGGGAGAGAAGTTTCTCGCAAAATGGTCGGATTGTGCAAAAATGA

mRNA sequence

ATGATGATCAGAGACGACGATTCTCGCCGGAAGTTTCAGAAGCACCTGATCTCCGCTCACAAACTGCTCCACTTCTTGAATGCATCGCGTGGTTCATCTGTTATCATTTCCTTCGCCGTTGTCGTCCTCGTCGGCATGTTTCTCTCCGGCCGACTCCTCGGCCTCTTACTTGGATTGAAATCGAACGTTGTGAATCAACAACCACAAGGAGAAACTAAAAAAGATCCAGATCATCCAATATCCGCGACGTGTCCAGAGTACTTCCGTTGGATTCACGAGGACCTACGACCGTGGGCCGGAAGAGGAATAACAAAGTCGATGTTGGAAGAGGCCCAAAAGAAGGCCCATTTCAGGCTGGTGGTGGTGGAGGGAAAGGCTTACGTGGAGGCGTACGGAAAGGCATATCAAAGCAGAGACAGTCTTACGGTGTGGGGGGTTGTACAGTTGTTACGTAGGTACCCGGGAAAACTGCCCGATCTTGATCTGATGTTTAGCTGTGAAGACCGGCCAGAGATCTATCAAAAGGACTACAAGGGGTCCCAGAAGCCGGCCCCACCTCCTTTGTTTCGGTATAGTGGAGATGATGCCACGTGGGACATCGTGTTTCCTGATTGGTCCTTCTGGGGATGGCCTGAAATCAACATAAAGCCATGGGAAGGGATATTGAAGGATATTAAAGAAGGTAATAAGAAAATGGAGTGGATGAAGAGGGAACCATATGCTTATTGGAAAGGAAATCCAACGGTGGCTTACACAAGGAGAGACCTTCTAAAGTGCAATGTCACCCACAAACAAGATTGGAATGCTCGTTTATATAGGCAGAATTGGAATAAAGAGTCCAAAACAGGGTTCAAAGACTCAAAGTTGTGCAACCAATGTGTTTATAGGTACAAAATATATATTGAAGGAAAGGCATGGTCAGTAAGTGAAAAATACATTCTAGCATGTGATTCAGTAAGCCTAATTGTGAGGCCTCATTATTATGATTTCTTCACAAGGAGTTTGATCCCAATGAAACACTATTGGCCTATTAGCTCCAACCGCAAGTGTTCTTCTATCAAGTTTGCTGTTGATTGGGGCAACACCCATCATCAAAAGGCAATGGCCATTGGAAAGGCAGCAAGCAAGTTCATTGAAGAAGAGCTGAAAATGGAGTACATTTACGACTACATGTTTCATCTTCTAAACCAATATTCCAAGCTCTTAACGTTCAAACCGACTGTACCGCCAAACGCGACGGAGCTCTCGTCGGAATCCATGGCTTTGGCCGCAGAAGGGTCGATCAGAAAGTCAATGATGGAGTCGGCGGTGACGAGCCCTGCAGATTCCGGCCCCTGCGCGCTGCAGGCGCCGTACGATCCCCAAACTCTTCAACTTTTAATTAGAAGCAAAGAAGATTCTATTAAACAAGTGGAAAGATGGGAGAGAAGTTTCTCGCAAAATGGTCGGATTGTGCAAAAATGA

Coding sequence (CDS)

ATGATGATCAGAGACGACGATTCTCGCCGGAAGTTTCAGAAGCACCTGATCTCCGCTCACAAACTGCTCCACTTCTTGAATGCATCGCGTGGTTCATCTGTTATCATTTCCTTCGCCGTTGTCGTCCTCGTCGGCATGTTTCTCTCCGGCCGACTCCTCGGCCTCTTACTTGGATTGAAATCGAACGTTGTGAATCAACAACCACAAGGAGAAACTAAAAAAGATCCAGATCATCCAATATCCGCGACGTGTCCAGAGTACTTCCGTTGGATTCACGAGGACCTACGACCGTGGGCCGGAAGAGGAATAACAAAGTCGATGTTGGAAGAGGCCCAAAAGAAGGCCCATTTCAGGCTGGTGGTGGTGGAGGGAAAGGCTTACGTGGAGGCGTACGGAAAGGCATATCAAAGCAGAGACAGTCTTACGGTGTGGGGGGTTGTACAGTTGTTACGTAGGTACCCGGGAAAACTGCCCGATCTTGATCTGATGTTTAGCTGTGAAGACCGGCCAGAGATCTATCAAAAGGACTACAAGGGGTCCCAGAAGCCGGCCCCACCTCCTTTGTTTCGGTATAGTGGAGATGATGCCACGTGGGACATCGTGTTTCCTGATTGGTCCTTCTGGGGATGGCCTGAAATCAACATAAAGCCATGGGAAGGGATATTGAAGGATATTAAAGAAGGTAATAAGAAAATGGAGTGGATGAAGAGGGAACCATATGCTTATTGGAAAGGAAATCCAACGGTGGCTTACACAAGGAGAGACCTTCTAAAGTGCAATGTCACCCACAAACAAGATTGGAATGCTCGTTTATATAGGCAGAATTGGAATAAAGAGTCCAAAACAGGGTTCAAAGACTCAAAGTTGTGCAACCAATGTGTTTATAGGTACAAAATATATATTGAAGGAAAGGCATGGTCAGTAAGTGAAAAATACATTCTAGCATGTGATTCAGTAAGCCTAATTGTGAGGCCTCATTATTATGATTTCTTCACAAGGAGTTTGATCCCAATGAAACACTATTGGCCTATTAGCTCCAACCGCAAGTGTTCTTCTATCAAGTTTGCTGTTGATTGGGGCAACACCCATCATCAAAAGGCAATGGCCATTGGAAAGGCAGCAAGCAAGTTCATTGAAGAAGAGCTGAAAATGGAGTACATTTACGACTACATGTTTCATCTTCTAAACCAATATTCCAAGCTCTTAACGTTCAAACCGACTGTACCGCCAAACGCGACGGAGCTCTCGTCGGAATCCATGGCTTTGGCCGCAGAAGGGTCGATCAGAAAGTCAATGATGGAGTCGGCGGTGACGAGCCCTGCAGATTCCGGCCCCTGCGCGCTGCAGGCGCCGTACGATCCCCAAACTCTTCAACTTTTAATTAGAAGCAAAGAAGATTCTATTAAACAAGTGGAAAGATGGGAGAGAAGTTTCTCGCAAAATGGTCGGATTGTGCAAAAATGA

Protein sequence

MMIRDDDSRRKFQKHLISAHKLLHFLNASRGSSVIISFAVVVLVGMFLSGRLLGLLLGLKSNVVNQQPQGETKKDPDHPISATCPEYFRWIHEDLRPWAGRGITKSMLEEAQKKAHFRLVVVEGKAYVEAYGKAYQSRDSLTVWGVVQLLRRYPGKLPDLDLMFSCEDRPEIYQKDYKGSQKPAPPPLFRYSGDDATWDIVFPDWSFWGWPEINIKPWEGILKDIKEGNKKMEWMKREPYAYWKGNPTVAYTRRDLLKCNVTHKQDWNARLYRQNWNKESKTGFKDSKLCNQCVYRYKIYIEGKAWSVSEKYILACDSVSLIVRPHYYDFFTRSLIPMKHYWPISSNRKCSSIKFAVDWGNTHHQKAMAIGKAASKFIEEELKMEYIYDYMFHLLNQYSKLLTFKPTVPPNATELSSESMALAAEGSIRKSMMESAVTSPADSGPCALQAPYDPQTLQLLIRSKEDSIKQVERWERSFSQNGRIVQK
BLAST of Cla97C05G090050 vs. NCBI nr
Match: XP_008439224.1 (PREDICTED: protein O-glucosyltransferase 1-like [Cucumis melo])

HSP 1 Score: 862.4 bits (2227), Expect = 7.1e-247
Identity = 419/485 (86.39%), Postives = 442/485 (91.13%), Query Frame = 0

Query: 2   MIR--DDDSRRKFQK-HLISAHKLLHFLNASRGSSVIISFAVVVLVGMFLSGRLLGLLLG 61
           MIR  DD  R KF K H    HKLL F NA   SSVII FA VVL+GMFLS RL  LLL 
Sbjct: 1   MIRASDDSRRTKFPKQHFFGDHKLLSFSNAPPRSSVIIFFAAVVLLGMFLSCRLFSLLLE 60

Query: 62  LKSNVVNQQPQGETKKDPDHPISATCPEYFRWIHEDLRPWAGRGITKSMLEEAQKKAHFR 121
           LKSNV+NQQPQG++K+ PD P+ ATCPEYFRWIHEDL+PWAGRGITKSMLEEAQKKAHFR
Sbjct: 61  LKSNVMNQQPQGQSKQYPDRPMPATCPEYFRWIHEDLKPWAGRGITKSMLEEAQKKAHFR 120

Query: 122 LVVVEGKAYVEAYGKAYQSRDSLTVWGVVQLLRRYPGKLPDLDLMFSCEDRPEIYQKDYK 181
           L+VVEGKAYVEAYGKAYQSRD+LTVWGVVQLLRRYPGK+PDLDLMF+C+DRPEIYQKDY 
Sbjct: 121 LLVVEGKAYVEAYGKAYQSRDNLTVWGVVQLLRRYPGKVPDLDLMFNCDDRPEIYQKDYS 180

Query: 182 GSQKPAPPPLFRYSGDDATWDIVFPDWSFWGWPEINIKPWEGILKDIKEGNKKMEWMKRE 241
           G +KPAPPPLFRYSGDDATWDIVFPDWSFWGWPEINIK WE ILKDIKEGNKKMEWMKR+
Sbjct: 181 GPEKPAPPPLFRYSGDDATWDIVFPDWSFWGWPEINIKAWESILKDIKEGNKKMEWMKRQ 240

Query: 242 PYAYWKGNPTVAYTRRDLLKCNVTHKQDWNARLYRQNWNKESKTGFKDSKLCNQCVYRYK 301
           PYAYWKGNPTVAYTRRDLLKCNVT KQDW+ARLYRQNW+KESK GFKDS L NQCVYRYK
Sbjct: 241 PYAYWKGNPTVAYTRRDLLKCNVTQKQDWSARLYRQNWDKESKAGFKDSNLANQCVYRYK 300

Query: 302 IYIEGKAWSVSEKYILACDSVSLIVRPHYYDFFTRSLIPMKHYWPISSNRKCSSIKFAVD 361
           IYIEGKAWSVSEKYILACDSVSLIVRP YYDFFTRSLIPMKHYWPISSNRKCSSIKFAV 
Sbjct: 301 IYIEGKAWSVSEKYILACDSVSLIVRPRYYDFFTRSLIPMKHYWPISSNRKCSSIKFAVH 360

Query: 362 WGNTHHQKAMAIGKAASKFIEEELKMEYIYDYMFHLLNQYSKLLTFKPTVPPNATELSSE 421
           WGNTHHQKAMAIGKAASK IEEELKMEYIYDYMFHLLNQYSKLLTFKPTVPPNATELSS+
Sbjct: 361 WGNTHHQKAMAIGKAASKLIEEELKMEYIYDYMFHLLNQYSKLLTFKPTVPPNATELSSD 420

Query: 422 SMALAAEGS-IRKSMMESAVTSPADSGPCALQAPYDPQTLQLLIRSKEDSIKQVERWERS 481
           S+A AA+GS IRKSMMES VTSPA+S PCALQ PYDPQ+LQLL R KEDSIKQVE+WERS
Sbjct: 421 SLASAAKGSIIRKSMMESVVTSPAESSPCALQPPYDPQSLQLLFRRKEDSIKQVEKWERS 480

Query: 482 FSQNG 483
           FS+NG
Sbjct: 481 FSKNG 485

BLAST of Cla97C05G090050 vs. NCBI nr
Match: KGN57370.1 (hypothetical protein Csa_3G182060 [Cucumis sativus])

HSP 1 Score: 835.1 bits (2156), Expect = 1.2e-238
Identity = 412/520 (79.23%), Postives = 438/520 (84.23%), Query Frame = 0

Query: 5   DDDSRRKFQKHLISAHKLLHFLNASRGSSVIISFAVVVLVGMFLSGRLLGLLL------- 64
           +D  R KF K   SA KLL F NAS  SSVII FA V+L+ MFLS RLLGLLL       
Sbjct: 2   NDSRRPKFPKQHFSAEKLLSFSNASPRSSVIIFFAAVILLSMFLSSRLLGLLLXXXXXXX 61

Query: 65  ------------------------------GLKSNVVNQQPQGETKKDPDHPISATCPEY 124
                                             NV++Q+PQG+ K+DPD P+ ATCPEY
Sbjct: 62  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNVMSQEPQGQRKQDPDGPMVATCPEY 121

Query: 125 FRWIHEDLRPWAGRGITKSMLEEAQKKAHFRLVVVEGKAYVEAYGKAYQSRDSLTVWGVV 184
           FRWIHEDL+PWAGRGITKSMLEEAQKKAHFR+VVVEGKAYVEAYGKAYQSRD+LTVWGVV
Sbjct: 122 FRWIHEDLKPWAGRGITKSMLEEAQKKAHFRVVVVEGKAYVEAYGKAYQSRDNLTVWGVV 181

Query: 185 QLLRRYPGKLPDLDLMFSCEDRPEIYQKDYKGSQKPAPPPLFRYSGDDATWDIVFPDWSF 244
           QLLRRYPGKLPDLDLMFSC+DRPEIYQKDY G++KP+PPPLFRYSGDDATWDIVFPDWSF
Sbjct: 182 QLLRRYPGKLPDLDLMFSCDDRPEIYQKDYSGAEKPSPPPLFRYSGDDATWDIVFPDWSF 241

Query: 245 WGWPEINIKPWEGILKDIKEGNKKMEWMKREPYAYWKGNPTVAYTRRDLLKCNVTHKQDW 304
           WGWPEINIK WE +LKDIKEGNKKM WMKR+PYAYWKGNP VAYTRRDLLKCNVT KQDW
Sbjct: 242 WGWPEINIKAWESMLKDIKEGNKKMGWMKRQPYAYWKGNPAVAYTRRDLLKCNVTQKQDW 301

Query: 305 NARLYRQNWNKESKTGFKDSKLCNQCVYRYKIYIEGKAWSVSEKYILACDSVSLIVRPHY 364
           +ARLYRQNW+KESK GFKDS L NQC YRYKIYIEGKAWSVSEKYILACDSVSLIVRP Y
Sbjct: 302 SARLYRQNWDKESKAGFKDSNLANQCDYRYKIYIEGKAWSVSEKYILACDSVSLIVRPRY 361

Query: 365 YDFFTRSLIPMKHYWPISSNRKCSSIKFAVDWGNTHHQKAMAIGKAASKFIEEELKMEYI 424
           YDFFTRSLIPMKHYWPISSNRKCSSIKFAV WGNTH Q+AMAIGKAASK IEEELKMEYI
Sbjct: 362 YDFFTRSLIPMKHYWPISSNRKCSSIKFAVHWGNTHSQEAMAIGKAASKLIEEELKMEYI 421

Query: 425 YDYMFHLLNQYSKLLTFKPTVPPNATELSSESMALAAEGSIRKSMMESAVTSPADSGPCA 484
           YDYMFHLLNQYSKLLTFKPTVPPNATEL SES+A AA+GSIRKSMMES VTSPA+SGPCA
Sbjct: 422 YDYMFHLLNQYSKLLTFKPTVPPNATELLSESLASAAKGSIRKSMMESVVTSPAESGPCA 481

Query: 485 LQAPYDPQTLQLLIRSKEDSIKQVERWERS-FSQNGRIVQ 487
           LQ PYDPQ+LQLLIRSKEDSIKQVE+WERS F  NG IVQ
Sbjct: 482 LQPPYDPQSLQLLIRSKEDSIKQVEKWERSFFKNNGPIVQ 521

BLAST of Cla97C05G090050 vs. NCBI nr
Match: XP_022141173.1 (protein O-glucosyltransferase 1-like [Momordica charantia])

HSP 1 Score: 722.2 bits (1863), Expect = 1.1e-204
Identity = 353/494 (71.46%), Postives = 405/494 (81.98%), Query Frame = 0

Query: 3   IRDDDSRRKFQKHLISAHKLLHFLNASRGSSVIISFAVVVLVGMFLSGRLLGLLLGLKSN 62
           +R +DSR KF+K   S  KLL F  +     VI  FAV ++VG  LSGRLL +  GLKS+
Sbjct: 1   MRGEDSRPKFEKQ-FSGEKLLPFAKSPPRFPVIFFFAVALIVGGLLSGRLL-ISSGLKSD 60

Query: 63  VVNQQPQ-------------GETKKDPDHPISATCPEYFRWIHEDLRPWAGRGITKSMLE 122
           V   QP+              +TK+DPD P  ATCPEYFRWI+EDLRPWAG  ITK MLE
Sbjct: 61  VHPPQPRRHVEQLNGTTFNSTKTKQDPDGPPHATCPEYFRWIYEDLRPWAGTRITKGMLE 120

Query: 123 EAQKKAHFRLVVVEGKAYVEAYGKAYQSRDSLTVWGVVQLLRRYPGKLPDLDLMFSCEDR 182
            AQKKAHFRLV+V+GKAYVE Y KAYQSRD+LT+WGV+QLLRRYPGKLPDLDLMF+C+DR
Sbjct: 121 AAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPGKLPDLDLMFNCDDR 180

Query: 183 PEIYQKDYKGSQKPAPPPLFRYSGDDATWDIVFPDWSFWGWPEINIKPWEGILKDIKEGN 242
           PEIY+KDY G Q PAPPPLFRYSGDDAT DI FPDWS+WGWPEI IKPWE +LKDIKEGN
Sbjct: 181 PEIYRKDYSGPQAPAPPPLFRYSGDDATLDIAFPDWSYWGWPEIRIKPWEEMLKDIKEGN 240

Query: 243 KKMEWMKREPYAYWKGNPTVAYTRRDLLKCNVTHKQDWNARLYRQNWNKESKTGFKDSKL 302
           K+MEW+KREPYAYWKGNP+V++ R DLLKCN+T KQDWNARLYRQ+W +E K GFK S L
Sbjct: 241 KEMEWVKREPYAYWKGNPSVSHKRTDLLKCNLTRKQDWNARLYRQDWIEEWKGGFKGSNL 300

Query: 303 CNQCVYRYKIYIEGKAWSVSEKYILACDSVSLIVRPHYYDFFTRSLIPMKHYWPISSNRK 362
            +QCVYRYKIYIEGKAWS SEKYI+ACDSV+LIVRPHYYDFF+RSLIPM+HYWPIS NR 
Sbjct: 301 ADQCVYRYKIYIEGKAWSASEKYIMACDSVTLIVRPHYYDFFSRSLIPMQHYWPISPNRN 360

Query: 363 --CSSIKFAVDWGNTHHQKAMAIGKAASKFIEEELKMEYIYDYMFHLLNQYSKLLTFKPT 422
             CSSIKFAVDWGN+HHQKAMAIG+AASKFI+EEL M+Y+YDYMFHLLN+YSKLLTFKPT
Sbjct: 361 SICSSIKFAVDWGNSHHQKAMAIGEAASKFIQEELNMDYVYDYMFHLLNEYSKLLTFKPT 420

Query: 423 VPPNATELSSESMALAAEGSIRKSMMESAVTSPADSGPCALQAPYDPQTLQLLIRSKEDS 482
           VP NATELS ESMA A   S+RK MM+S V SPA S PCA++ PYDPQ+++L + +KE+S
Sbjct: 421 VPSNATELSLESMASAVRRSVRKWMMKSFVKSPAVSDPCAMKPPYDPQSMELWLTTKENS 480

BLAST of Cla97C05G090050 vs. NCBI nr
Match: XP_011652774.1 (PREDICTED: protein O-glucosyltransferase 1-like [Cucumis sativus])

HSP 1 Score: 629.4 bits (1622), Expect = 1.0e-176
Identity = 286/324 (88.27%), Postives = 305/324 (94.14%), Query Frame = 0

Query: 46  MFLSGRLLGLLLGLKSNVVNQQPQGETKKDPDHPISATCPEYFRWIHEDLRPWAGRGITK 105
           MFLS RLLGLLLGLKSNV++Q+PQG+ K+DPD P+ ATCPEYFRWIHEDL+PWAGRGITK
Sbjct: 1   MFLSSRLLGLLLGLKSNVMSQEPQGQRKQDPDGPMVATCPEYFRWIHEDLKPWAGRGITK 60

Query: 106 SMLEEAQKKAHFRLVVVEGKAYVEAYGKAYQSRDSLTVWGVVQLLRRYPGKLPDLDLMFS 165
           SMLEEAQKKAHFR+VVVEGKAYVEAYGKAYQSRD+LTVWGVVQLLRRYPGKLPDLDLMFS
Sbjct: 61  SMLEEAQKKAHFRVVVVEGKAYVEAYGKAYQSRDNLTVWGVVQLLRRYPGKLPDLDLMFS 120

Query: 166 CEDRPEIYQKDYKGSQKPAPPPLFRYSGDDATWDIVFPDWSFWGWPEINIKPWEGILKDI 225
           C+DRPEIYQKDY G++KP+PPPLFRYSGDDATWDIVFPDWSFWGWPEINIK WE +LKDI
Sbjct: 121 CDDRPEIYQKDYSGAEKPSPPPLFRYSGDDATWDIVFPDWSFWGWPEINIKAWESMLKDI 180

Query: 226 KEGNKKMEWMKREPYAYWKGNPTVAYTRRDLLKCNVTHKQDWNARLYRQNWNKESKTGFK 285
           KEGNKKM WMKR+PYAYWKGNP VAYTRRDLLKCNVT KQDW+ARLYRQNW+KESK GFK
Sbjct: 181 KEGNKKMGWMKRQPYAYWKGNPAVAYTRRDLLKCNVTQKQDWSARLYRQNWDKESKAGFK 240

Query: 286 DSKLCNQCVYRYKIYIEGKAWSVSEKYILACDSVSLIVRPHYYDFFTRSLIPMKHYWPIS 345
           DS L NQC YRYKIYIEGKAWSVSEKYILACDSVSLIVRP YYDFFTRSLIPMKHYWPIS
Sbjct: 241 DSNLANQCDYRYKIYIEGKAWSVSEKYILACDSVSLIVRPRYYDFFTRSLIPMKHYWPIS 300

Query: 346 SNRKCSSIKFAVDWGNTHHQKAMA 370
           SNRKCSSIKFAV WGNTH Q+  +
Sbjct: 301 SNRKCSSIKFAVHWGNTHSQEVFS 324

BLAST of Cla97C05G090050 vs. NCBI nr
Match: XP_023552264.1 (protein O-glucosyltransferase 1-like isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 619.0 bits (1595), Expect = 1.4e-173
Identity = 278/406 (68.47%), Postives = 336/406 (82.76%), Query Frame = 0

Query: 73  KKDPDHPISATCPEYFRWIHEDLRPWAGRGITKSMLEEAQKKAHFRLVVVEGKAYVEAYG 132
           ++DP+ P S+TCPEYFRWIHEDLRPWA  GIT++ LE A++ A+FRLV+V G AYVE Y 
Sbjct: 120 EEDPNLPPSSTCPEYFRWIHEDLRPWAQTGITRASLEAAKQTANFRLVIVNGTAYVETYE 179

Query: 133 KAYQSRDSLTVWGVVQLLRRYPGKLPDLDLMFSCEDRPEIYQKDYKGSQKPAPPPLFRYS 192
           K++Q+RD+ T+WG++QLLRRYPGK+PDL++MF C D P I   ++     PAPPPLFRY 
Sbjct: 180 KSFQTRDTFTLWGILQLLRRYPGKVPDLEMMFDCVDWPVILTTNFSDPNGPAPPPLFRYC 239

Query: 193 GDDATWDIVFPDWSFWGWPEINIKPWEGILKDIKEGNKKMEWMKREPYAYWKGNPTVAYT 252
           G+DAT D+VFPDWSFWGW EINIKPWE +LKD+KEGNK+  W  RE YAYWKGNP VA T
Sbjct: 240 GNDATLDVVFPDWSFWGWSEINIKPWEVLLKDLKEGNKRTPWKNREAYAYWKGNPEVAET 299

Query: 253 RRDLLKCNVTHKQDWNARLYRQNWNKESKTGFKDSKLCNQCVYRYKIYIEGKAWSVSEKY 312
           R+DLLKCNV+ +QDWNAR++ Q+W KES+ G+K+S L NQC++RYKIYIEG AWSVSEKY
Sbjct: 300 RKDLLKCNVSDQQDWNARVFAQDWMKESQQGYKESDLANQCLHRYKIYIEGSAWSVSEKY 359

Query: 313 ILACDSVSLIVRPHYYDFFTRSLIPMKHYWPISSNRKCSSIKFAVDWGNTHHQKAMAIGK 372
           ILACDSV+LIV+PHYYDFFTR L+P+ HYWP+  + KC SIKFAVDWGN+H QKA  IGK
Sbjct: 360 ILACDSVALIVKPHYYDFFTRGLMPLHHYWPVKDDDKCKSIKFAVDWGNSHKQKAKTIGK 419

Query: 373 AASKFIEEELKMEYIYDYMFHLLNQYSKLLTFKPTVPPNATELSSESMALAAEGSIRKSM 432
           AAS FI+EELKM+Y+YDYMFHLL++YSKLLTFKPT+P NA EL SE+MA  AEG  +K M
Sbjct: 420 AASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPTIPSNAIELCSEAMACPAEGLTKKFM 479

Query: 433 MESAVTSPADSGPCALQAPYDPQTLQLLIRSKEDSIKQVERWERSF 479
           MES V SPADS PCA+  PYDP +L L++R KEDSIKQVE WE +F
Sbjct: 480 MESLVKSPADSKPCAMPPPYDPASLHLVLRRKEDSIKQVEEWENTF 525

BLAST of Cla97C05G090050 vs. TrEMBL
Match: tr|A0A1S3AYX9|A0A1S3AYX9_CUCME (protein O-glucosyltransferase 1-like OS=Cucumis melo OX=3656 GN=LOC103484074 PE=4 SV=1)

HSP 1 Score: 862.4 bits (2227), Expect = 4.7e-247
Identity = 419/485 (86.39%), Postives = 442/485 (91.13%), Query Frame = 0

Query: 2   MIR--DDDSRRKFQK-HLISAHKLLHFLNASRGSSVIISFAVVVLVGMFLSGRLLGLLLG 61
           MIR  DD  R KF K H    HKLL F NA   SSVII FA VVL+GMFLS RL  LLL 
Sbjct: 1   MIRASDDSRRTKFPKQHFFGDHKLLSFSNAPPRSSVIIFFAAVVLLGMFLSCRLFSLLLE 60

Query: 62  LKSNVVNQQPQGETKKDPDHPISATCPEYFRWIHEDLRPWAGRGITKSMLEEAQKKAHFR 121
           LKSNV+NQQPQG++K+ PD P+ ATCPEYFRWIHEDL+PWAGRGITKSMLEEAQKKAHFR
Sbjct: 61  LKSNVMNQQPQGQSKQYPDRPMPATCPEYFRWIHEDLKPWAGRGITKSMLEEAQKKAHFR 120

Query: 122 LVVVEGKAYVEAYGKAYQSRDSLTVWGVVQLLRRYPGKLPDLDLMFSCEDRPEIYQKDYK 181
           L+VVEGKAYVEAYGKAYQSRD+LTVWGVVQLLRRYPGK+PDLDLMF+C+DRPEIYQKDY 
Sbjct: 121 LLVVEGKAYVEAYGKAYQSRDNLTVWGVVQLLRRYPGKVPDLDLMFNCDDRPEIYQKDYS 180

Query: 182 GSQKPAPPPLFRYSGDDATWDIVFPDWSFWGWPEINIKPWEGILKDIKEGNKKMEWMKRE 241
           G +KPAPPPLFRYSGDDATWDIVFPDWSFWGWPEINIK WE ILKDIKEGNKKMEWMKR+
Sbjct: 181 GPEKPAPPPLFRYSGDDATWDIVFPDWSFWGWPEINIKAWESILKDIKEGNKKMEWMKRQ 240

Query: 242 PYAYWKGNPTVAYTRRDLLKCNVTHKQDWNARLYRQNWNKESKTGFKDSKLCNQCVYRYK 301
           PYAYWKGNPTVAYTRRDLLKCNVT KQDW+ARLYRQNW+KESK GFKDS L NQCVYRYK
Sbjct: 241 PYAYWKGNPTVAYTRRDLLKCNVTQKQDWSARLYRQNWDKESKAGFKDSNLANQCVYRYK 300

Query: 302 IYIEGKAWSVSEKYILACDSVSLIVRPHYYDFFTRSLIPMKHYWPISSNRKCSSIKFAVD 361
           IYIEGKAWSVSEKYILACDSVSLIVRP YYDFFTRSLIPMKHYWPISSNRKCSSIKFAV 
Sbjct: 301 IYIEGKAWSVSEKYILACDSVSLIVRPRYYDFFTRSLIPMKHYWPISSNRKCSSIKFAVH 360

Query: 362 WGNTHHQKAMAIGKAASKFIEEELKMEYIYDYMFHLLNQYSKLLTFKPTVPPNATELSSE 421
           WGNTHHQKAMAIGKAASK IEEELKMEYIYDYMFHLLNQYSKLLTFKPTVPPNATELSS+
Sbjct: 361 WGNTHHQKAMAIGKAASKLIEEELKMEYIYDYMFHLLNQYSKLLTFKPTVPPNATELSSD 420

Query: 422 SMALAAEGS-IRKSMMESAVTSPADSGPCALQAPYDPQTLQLLIRSKEDSIKQVERWERS 481
           S+A AA+GS IRKSMMES VTSPA+S PCALQ PYDPQ+LQLL R KEDSIKQVE+WERS
Sbjct: 421 SLASAAKGSIIRKSMMESVVTSPAESSPCALQPPYDPQSLQLLFRRKEDSIKQVEKWERS 480

Query: 482 FSQNG 483
           FS+NG
Sbjct: 481 FSKNG 485

BLAST of Cla97C05G090050 vs. TrEMBL
Match: tr|A0A0A0L5W0|A0A0A0L5W0_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G182060 PE=4 SV=1)

HSP 1 Score: 835.1 bits (2156), Expect = 8.0e-239
Identity = 412/520 (79.23%), Postives = 438/520 (84.23%), Query Frame = 0

Query: 5   DDDSRRKFQKHLISAHKLLHFLNASRGSSVIISFAVVVLVGMFLSGRLLGLLL------- 64
           +D  R KF K   SA KLL F NAS  SSVII FA V+L+ MFLS RLLGLLL       
Sbjct: 2   NDSRRPKFPKQHFSAEKLLSFSNASPRSSVIIFFAAVILLSMFLSSRLLGLLLXXXXXXX 61

Query: 65  ------------------------------GLKSNVVNQQPQGETKKDPDHPISATCPEY 124
                                             NV++Q+PQG+ K+DPD P+ ATCPEY
Sbjct: 62  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNVMSQEPQGQRKQDPDGPMVATCPEY 121

Query: 125 FRWIHEDLRPWAGRGITKSMLEEAQKKAHFRLVVVEGKAYVEAYGKAYQSRDSLTVWGVV 184
           FRWIHEDL+PWAGRGITKSMLEEAQKKAHFR+VVVEGKAYVEAYGKAYQSRD+LTVWGVV
Sbjct: 122 FRWIHEDLKPWAGRGITKSMLEEAQKKAHFRVVVVEGKAYVEAYGKAYQSRDNLTVWGVV 181

Query: 185 QLLRRYPGKLPDLDLMFSCEDRPEIYQKDYKGSQKPAPPPLFRYSGDDATWDIVFPDWSF 244
           QLLRRYPGKLPDLDLMFSC+DRPEIYQKDY G++KP+PPPLFRYSGDDATWDIVFPDWSF
Sbjct: 182 QLLRRYPGKLPDLDLMFSCDDRPEIYQKDYSGAEKPSPPPLFRYSGDDATWDIVFPDWSF 241

Query: 245 WGWPEINIKPWEGILKDIKEGNKKMEWMKREPYAYWKGNPTVAYTRRDLLKCNVTHKQDW 304
           WGWPEINIK WE +LKDIKEGNKKM WMKR+PYAYWKGNP VAYTRRDLLKCNVT KQDW
Sbjct: 242 WGWPEINIKAWESMLKDIKEGNKKMGWMKRQPYAYWKGNPAVAYTRRDLLKCNVTQKQDW 301

Query: 305 NARLYRQNWNKESKTGFKDSKLCNQCVYRYKIYIEGKAWSVSEKYILACDSVSLIVRPHY 364
           +ARLYRQNW+KESK GFKDS L NQC YRYKIYIEGKAWSVSEKYILACDSVSLIVRP Y
Sbjct: 302 SARLYRQNWDKESKAGFKDSNLANQCDYRYKIYIEGKAWSVSEKYILACDSVSLIVRPRY 361

Query: 365 YDFFTRSLIPMKHYWPISSNRKCSSIKFAVDWGNTHHQKAMAIGKAASKFIEEELKMEYI 424
           YDFFTRSLIPMKHYWPISSNRKCSSIKFAV WGNTH Q+AMAIGKAASK IEEELKMEYI
Sbjct: 362 YDFFTRSLIPMKHYWPISSNRKCSSIKFAVHWGNTHSQEAMAIGKAASKLIEEELKMEYI 421

Query: 425 YDYMFHLLNQYSKLLTFKPTVPPNATELSSESMALAAEGSIRKSMMESAVTSPADSGPCA 484
           YDYMFHLLNQYSKLLTFKPTVPPNATEL SES+A AA+GSIRKSMMES VTSPA+SGPCA
Sbjct: 422 YDYMFHLLNQYSKLLTFKPTVPPNATELLSESLASAAKGSIRKSMMESVVTSPAESGPCA 481

Query: 485 LQAPYDPQTLQLLIRSKEDSIKQVERWERS-FSQNGRIVQ 487
           LQ PYDPQ+LQLLIRSKEDSIKQVE+WERS F  NG IVQ
Sbjct: 482 LQPPYDPQSLQLLIRSKEDSIKQVEKWERSFFKNNGPIVQ 521

BLAST of Cla97C05G090050 vs. TrEMBL
Match: tr|A0A0A0L5W3|A0A0A0L5W3_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G182110 PE=4 SV=1)

HSP 1 Score: 618.2 bits (1593), Expect = 1.5e-173
Identity = 278/408 (68.14%), Postives = 335/408 (82.11%), Query Frame = 0

Query: 72  TKKDPDHPISAT-CPEYFRWIHEDLRPWAGRGITKSMLEEAQKKAHFRLVVVEGKAYVEA 131
           T +D + P S++ CP+YFRWIHEDLRPWA  GIT++ LE  Q+ A+FRL+++ GKAYVE 
Sbjct: 123 TDEDQNPPSSSSACPDYFRWIHEDLRPWARTGITRATLEAGQRTANFRLLILNGKAYVET 182

Query: 132 YGKAYQSRDSLTVWGVVQLLRRYPGKLPDLDLMFSCEDRPEIYQKDYKGSQKPAPPPLFR 191
           Y K++Q+RD+ TVWG++QLLRRYPGK+PDLDLMF C D P I    + G   P PPPLFR
Sbjct: 183 YKKSFQTRDTFTVWGILQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFR 242

Query: 192 YSGDDATWDIVFPDWSFWGWPEINIKPWEGILKDIKEGNKKMEWMKREPYAYWKGNPTVA 251
           Y GDDAT+DIVFPDWSFWGWPEINIKPWE +LKDIKEGNK++ W  REPYAYWKGNP VA
Sbjct: 243 YCGDDATFDIVFPDWSFWGWPEINIKPWEPLLKDIKEGNKRIPWKSREPYAYWKGNPEVA 302

Query: 252 YTRRDLLKCNVTHKQDWNARLYRQNWNKESKTGFKDSKLCNQCVYRYKIYIEGKAWSVSE 311
            TR+DL+KCNV+ +QDWNAR++ Q+W KES+ G+K S L NQC++RYKIYIEG AWSVSE
Sbjct: 303 DTRKDLIKCNVSDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSE 362

Query: 312 KYILACDSVSLIVRPHYYDFFTRSLIPMKHYWPISSNRKCSSIKFAVDWGNTHHQKAMAI 371
           KYILACDSV+LIV+PHYYDFFTR L+P+ HYWP+  + KC SIKFAVDWGN+H QKA AI
Sbjct: 363 KYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDKCKSIKFAVDWGNSHKQKAQAI 422

Query: 372 GKAASKFIEEELKMEYIYDYMFHLLNQYSKLLTFKPTVPPNATELSSESMALAAEGSIRK 431
           GKAAS FI+EELKM+Y+YDYMFHLL++YSKLLTFKPT+PPNA EL SE+MA  AEG  +K
Sbjct: 423 GKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPTLPPNAIELCSEAMACPAEGLTKK 482

Query: 432 SMMESAVTSPADSGPCALQAPYDPQTLQLLIRSKEDSIKQVERWERSF 479
            M ES V  PA+S PC +  PYDP +L  ++  KE+SIKQVE+WE SF
Sbjct: 483 FMTESLVKRPAESNPCTMPPPYDPASLHFVLSRKENSIKQVEKWETSF 530

BLAST of Cla97C05G090050 vs. TrEMBL
Match: tr|A0A1S3AYX8|A0A1S3AYX8_CUCME (O-glucosyltransferase rumi homolog OS=Cucumis melo OX=3656 GN=LOC103484079 PE=4 SV=1)

HSP 1 Score: 616.3 bits (1588), Expect = 5.9e-173
Identity = 277/407 (68.06%), Postives = 330/407 (81.08%), Query Frame = 0

Query: 72  TKKDPDHPISATCPEYFRWIHEDLRPWAGRGITKSMLEEAQKKAHFRLVVVEGKAYVEAY 131
           T +  + P S TCPEYFRWIHEDLRPWA  GI+++ +E  Q+ A+FRLV++ GKAYVE Y
Sbjct: 123 TDEHENRPSSTTCPEYFRWIHEDLRPWARTGISRAAVEAGQRTANFRLVILNGKAYVETY 182

Query: 132 GKAYQSRDSLTVWGVVQLLRRYPGKLPDLDLMFSCEDRPEIYQKDYKGSQKPAPPPLFRY 191
            K++Q+RD+ TVWG++QLLRRYPGK+ DLDLMF C D P I    + G   P PPPLFRY
Sbjct: 183 KKSFQTRDTFTVWGILQLLRRYPGKVADLDLMFDCVDWPVILSSHFSGPDGPTPPPLFRY 242

Query: 192 SGDDATWDIVFPDWSFWGWPEINIKPWEGILKDIKEGNKKMEWMKREPYAYWKGNPTVAY 251
            GDD T DIVFPDWSFWGWPEINIKPWE +LKD+KEGNK++ W  REPYAYWKGNP VA 
Sbjct: 243 CGDDPTLDIVFPDWSFWGWPEINIKPWEPLLKDLKEGNKRILWKSREPYAYWKGNPEVAD 302

Query: 252 TRRDLLKCNVTHKQDWNARLYRQNWNKESKTGFKDSKLCNQCVYRYKIYIEGKAWSVSEK 311
           TR+DLLKCNV+ +QDWNAR++ Q+W KES+ G+K S L NQC++RYKIYIEG AWSVSEK
Sbjct: 303 TRKDLLKCNVSDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEK 362

Query: 312 YILACDSVSLIVRPHYYDFFTRSLIPMKHYWPISSNRKCSSIKFAVDWGNTHHQKAMAIG 371
           YILACDSV+LIV+PHYYDFFTR L+P+ HYWP+  + KC SIKFAVDWGN+H QKA AIG
Sbjct: 363 YILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDKCKSIKFAVDWGNSHKQKAQAIG 422

Query: 372 KAASKFIEEELKMEYIYDYMFHLLNQYSKLLTFKPTVPPNATELSSESMALAAEGSIRKS 431
           KAAS FI+EELKM+Y+YDYMFHLL++YSKLLTFKPTVPP A EL SE+MA  AEG  +K 
Sbjct: 423 KAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPTVPPTAIELCSEAMACPAEGLTKKF 482

Query: 432 MMESAVTSPADSGPCALQAPYDPQTLQLLIRSKEDSIKQVERWERSF 479
           M ES V  PA+S PC +  PYDP +L  ++R KE+SIKQVE+WE SF
Sbjct: 483 MTESLVKRPAESNPCTMPPPYDPASLHFVLRRKENSIKQVEKWETSF 529

BLAST of Cla97C05G090050 vs. TrEMBL
Match: tr|A0A218X3H9|A0A218X3H9_PUNGR (Uncharacterized protein OS=Punica granatum OX=22663 GN=CDL15_Pgr022949 PE=4 SV=1)

HSP 1 Score: 604.7 bits (1558), Expect = 1.8e-169
Identity = 269/411 (65.45%), Postives = 331/411 (80.54%), Query Frame = 0

Query: 71  ETKKDPDHPISATCPEYFRWIHEDLRPWAGRGITKSMLEEAQKKAHFRLVVVEGKAYVEA 130
           E +KDP+ P +  CP+YFRWIHEDL+PWA  GITK M+E+A++ A+FRLV++ GKAYVE 
Sbjct: 120 ELEKDPNRPSTPMCPDYFRWIHEDLKPWARTGITKDMVEKARRTANFRLVILNGKAYVET 179

Query: 131 YGKAYQSRDSLTVWGVVQLLRRYPGKLPDLDLMFSCEDRPEIYQKDYKGSQKPAPPPLFR 190
           Y K +Q+RD  T+WG++QLLRRYPGK+PDL+LMF C D P I  + Y+G     PPPLFR
Sbjct: 180 YEKGFQTRDVFTIWGILQLLRRYPGKVPDLELMFDCVDWPVIRSQLYRGPNVTGPPPLFR 239

Query: 191 YSGDDATWDIVFPDWSFWGWPEINIKPWEGILKDIKEGNKKMEWMKREPYAYWKGNPTVA 250
           Y G+DAT DIVFPDWSFWGWPEINIKPWE +L+D+KEGN++ +W+ REPYAYWKGNP VA
Sbjct: 240 YCGNDATLDIVFPDWSFWGWPEINIKPWEALLEDLKEGNRRTKWVDREPYAYWKGNPEVA 299

Query: 251 YTRRDLLKCNVTHKQDWNARLYRQNWNKESKTGFKDSKLCNQCVYRYKIYIEGKAWSVSE 310
            TR+DLLKCNV+ KQDWNAR+Y QNW KES+ G+K S L +QC++RYKIYIEG AWSVSE
Sbjct: 300 KTRQDLLKCNVSEKQDWNARVYAQNWGKESQEGYKQSNLASQCMHRYKIYIEGSAWSVSE 359

Query: 311 KYILACDSVSLIVRPHYYDFFTRSLIPMKHYWPISSNRKCSSIKFAVDWGNTHHQKAMAI 370
           KYILACDSV+LIV+PHYYDFFTR L+P+ HYWPI  + KC SIKFAVDWGN+H +KA  I
Sbjct: 360 KYILACDSVTLIVKPHYYDFFTRGLLPVHHYWPIREDSKCKSIKFAVDWGNSHKKKAQEI 419

Query: 371 GKAASKFIEEELKMEYIYDYMFHLLNQYSKLLTFKPTVPPNATELSSESMALAAEGSIRK 430
           GKAAS FI ++LKM+ +YDYMFHLLN Y+KLLT+KPT+P  ATEL SE+MA  A+G ++K
Sbjct: 420 GKAASSFIHDDLKMDLVYDYMFHLLNGYAKLLTYKPTIPEKATELCSEAMACEAQGLVKK 479

Query: 431 SMMESAVTSPADSGPCALQAPYDPQTLQLLIRSKEDSIKQVERWERSFSQN 482
            M++S V  P DS PC L  P+DP +L  L R KE+S+KQVE WE  +  N
Sbjct: 480 FMLDSLVKGPRDSDPCNLPPPFDPVSLFSLRRRKENSVKQVETWESMYYDN 530

BLAST of Cla97C05G090050 vs. Swiss-Prot
Match: sp|Q5E9Q1|PGLT1_BOVIN (Protein O-glucosyltransferase 1 OS=Bos taurus OX=9913 GN=POGLUT1 PE=2 SV=1)

HSP 1 Score: 120.6 bits (301), Expect = 5.0e-26
Identity = 91/343 (26.53%), Postives = 156/343 (45.48%), Query Frame = 0

Query: 81  SATCPEYFRWIHEDLRPWAGRGITKSMLEEAQKKAHFRLVVVEGKAYVEAYGKAYQSRDS 140
           S  C  Y   I EDL P+ G    K M E  ++K      +++ + Y E+    + SR S
Sbjct: 51  SPNCSCYHGVIEEDLTPFRGGISRKMMAEVVRRKLGTHYQIIKNRLYRES-DCMFPSRCS 110

Query: 141 LTVWGVVQLLRRYPGKLPDLDLMFSCEDRPEIYQKDYKGSQKPAPPPLFRYSGDDATWDI 200
               GV   +    G+LPD++++ +  D P++ +       +PA  P+F +S      DI
Sbjct: 111 ----GVEHFILEVIGRLPDMEMVINVRDYPQVPK-----WMEPA-IPIFSFSKTLEYHDI 170

Query: 201 VFPDWSFWG-----WP--EINIKPWEGILKDIKEGNKKMEWMKREPYAYWKG-------N 260
           ++P W+FW      WP   + +  W+   +D+     +  W K+   AY++G       +
Sbjct: 171 MYPAWTFWEGGPAVWPIYPMGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSRTSPERD 230

Query: 261 PTVAYTRRD--LLKCNVTHKQDWNARLYRQNWNKESKTGFKDSKLCNQCVYRYKIYIEGK 320
           P +  +R++  L+    T  Q W +       +   K   KD  L + C Y+Y     G 
Sbjct: 231 PLILLSRKNPKLVDAEYTKNQAWKSMK-----DTLGKPAAKDVHLVDHCKYKYLFNFRGV 290

Query: 321 AWSVSEKYILACDSVSLIVRPHYYDFFTRSLIPMKHYWPISSNRKCSSIKFAVDWGNTHH 380
           A S   K++  C S+   V   + +FF   L P  HY P+ ++   S+++  + +   + 
Sbjct: 291 AASFRFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVKTD--LSNVQELLQFVKAND 350

Query: 381 QKAMAIGKAASKFIEEELKMEYIYDYMFHLLNQYSKLLTFKPT 408
             A  I +  S+FI   LKM+ I  Y  +LL +YSK L++  T
Sbjct: 351 DVAQEIAERGSQFILNHLKMDDITCYWENLLTEYSKFLSYNVT 375

BLAST of Cla97C05G090050 vs. Swiss-Prot
Match: sp|Q8NBL1|PGLT1_HUMAN (Protein O-glucosyltransferase 1 OS=Homo sapiens OX=9606 GN=POGLUT1 PE=1 SV=1)

HSP 1 Score: 118.6 bits (296), Expect = 1.9e-25
Identity = 90/343 (26.24%), Postives = 154/343 (44.90%), Query Frame = 0

Query: 81  SATCPEYFRWIHEDLRPWAGRGITKSMLEEAQKKAHFRLVVVEGKAYVEAYGKAYQSRDS 140
           S  C  Y   I EDL P+ G    K M E  ++K      + + + Y E     + SR S
Sbjct: 51  SQNCSCYHGVIEEDLTPFRGGISRKMMAEVVRRKLGTHYQITKNRLYRE-NDCMFPSRCS 110

Query: 141 LTVWGVVQLLRRYPGKLPDLDLMFSCEDRPEIYQKDYKGSQKPAPPPLFRYSGDDATWDI 200
               GV   +    G+LPD++++ +  D P++ +       +PA  P+F +S      DI
Sbjct: 111 ----GVEHFILEVIGRLPDMEMVINVRDYPQVPK-----WMEPA-IPVFSFSKTSEYHDI 170

Query: 201 VFPDWSFWG-----WP--EINIKPWEGILKDIKEGNKKMEWMKREPYAYWKG-------N 260
           ++P W+FW      WP     +  W+   +D+     +  W K+   AY++G       +
Sbjct: 171 MYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSRTSPERD 230

Query: 261 PTVAYTRRD--LLKCNVTHKQDWNARLYRQNWNKESKTGFKDSKLCNQCVYRYKIYIEGK 320
           P +  +R++  L+    T  Q W +       +   K   KD  L + C Y+Y     G 
Sbjct: 231 PLILLSRKNPKLVDAEYTKNQAWKSMK-----DTLGKPAAKDVHLVDHCKYKYLFNFRGV 290

Query: 321 AWSVSEKYILACDSVSLIVRPHYYDFFTRSLIPMKHYWPISSNRKCSSIKFAVDWGNTHH 380
           A S   K++  C S+   V   + +FF   L P  HY P+ ++   S+++  + +   + 
Sbjct: 291 AASFRFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVKTD--LSNVQELLQFVKAND 350

Query: 381 QKAMAIGKAASKFIEEELKMEYIYDYMFHLLNQYSKLLTFKPT 408
             A  I +  S+FI   L+M+ I  Y  +LL++YSK L++  T
Sbjct: 351 DVAQEIAERGSQFIRNHLQMDDITCYWENLLSEYSKFLSYNVT 375

BLAST of Cla97C05G090050 vs. Swiss-Prot
Match: sp|Q8BYB9|PGLT1_MOUSE (Protein O-glucosyltransferase 1 OS=Mus musculus OX=10090 GN=Poglut1 PE=1 SV=2)

HSP 1 Score: 113.2 bits (282), Expect = 8.0e-24
Identity = 90/345 (26.09%), Postives = 150/345 (43.48%), Query Frame = 0

Query: 81  SATCPEYFRWIHEDLRPWAGRGITKSMLEEAQKKAHFRLVVVEGKAYVEAYGKAYQSRDS 140
           S  C  Y   I EDL P+ G    K M E  ++K      +++ + + E     + SR S
Sbjct: 51  SQNCSCYHGVIEEDLTPFRGGISRKMMAEVVRRKLGTHYQIIKNRLFRED-DCMFPSRCS 110

Query: 141 LTVWGVVQLLRRYPGKLPDLDLMFSCEDRPEIYQKDYKGSQKPAPP--PLFRYSGDDATW 200
               GV   +     +LPD++++ +  D P++         K   P  P+F +S      
Sbjct: 111 ----GVEHFILEVIHRLPDMEMVINVRDYPQV--------PKWMEPTIPVFSFSKTSEYH 170

Query: 201 DIVFPDWSFWG-----WP--EINIKPWEGILKDIKEGNKKMEWMKREPYAYWKG------ 260
           DI++P W+FW      WP     +  W+   +D+     +  W K+   AY++G      
Sbjct: 171 DIMYPAWTFWEGGPAVWPLYPTGLGRWDLFREDLLRSAAQWPWEKKNSTAYFRGSRTSPE 230

Query: 261 -NPTVAYTRRD--LLKCNVTHKQDWNARLYRQNWNKESKTGFKDSKLCNQCVYRYKIYIE 320
            +P +  +R++  L+    T  Q W +       +   K   KD  L + C YRY     
Sbjct: 231 RDPLILLSRKNPKLVDAEYTKNQAWKSMK-----DTLGKPAAKDVHLIDHCKYRYLFNFR 290

Query: 321 GKAWSVSEKYILACDSVSLIVRPHYYDFFTRSLIPMKHYWPISSNRKCSSIKFAVDWGNT 380
           G A S   K++  C S+   V   + +FF   L P  HY P+ ++   S+++  + +   
Sbjct: 291 GVAASFRFKHLFLCGSLVFHVGDEWVEFFYPQLKPWVHYIPVKTD--LSNVQELLQFVKA 350

Query: 381 HHQKAMAIGKAASKFIEEELKMEYIYDYMFHLLNQYSKLLTFKPT 408
           +   A  I K  S+FI   L+M+ I  Y  +LL  YSK L++  T
Sbjct: 351 NDDIAQEIAKRGSQFIINHLQMDDITCYWENLLTDYSKFLSYNVT 375

BLAST of Cla97C05G090050 vs. Swiss-Prot
Match: sp|G3V9D0|PGLT1_RAT (Protein O-glucosyltransferase 1 OS=Rattus norvegicus OX=10116 GN=Poglut1 PE=3 SV=1)

HSP 1 Score: 112.5 bits (280), Expect = 1.4e-23
Identity = 88/345 (25.51%), Postives = 153/345 (44.35%), Query Frame = 0

Query: 81  SATCPEYFRWIHEDLRPWAGRGITKSMLEEAQKKAHFRLVVVEGKAYVEAYGKAYQSRDS 140
           S  C  Y   I EDL P+ G    K M E  +++      +++ + + E     + SR S
Sbjct: 51  SQNCSCYHGVIEEDLTPFRGGISRKMMAEVVRRRLGTHYQIIKHRLFRED-DCMFPSRCS 110

Query: 141 LTVWGVVQLLRRYPGKLPDLDLMFSCEDRPEIYQKDYKGSQKPAPP--PLFRYSGDDATW 200
                +++++RR    LPD++++ +  D P++         K   P  P+F +S      
Sbjct: 111 GVEHFILEVIRR----LPDMEMVINVRDYPQV--------PKWMEPTIPVFSFSKTSEYH 170

Query: 201 DIVFPDWSFWG-----WP--EINIKPWEGILKDIKEGNKKMEWMKREPYAYWKG------ 260
           DI++P W+FW      WP     +  W+   +D+     +  W K+   AY++G      
Sbjct: 171 DIMYPAWTFWEGGPAVWPLYPTGLGRWDLFREDLLRSAAQWPWEKKNSTAYFRGSRTSPE 230

Query: 261 -NPTVAYTRRD--LLKCNVTHKQDWNARLYRQNWNKESKTGFKDSKLCNQCVYRYKIYIE 320
            +P +  +R++  L+    T  Q W +       +   K   KD  L + C Y+Y     
Sbjct: 231 RDPLILLSRKNPKLVDAEYTKNQAWKSMK-----DTLGKPAAKDVHLIDHCKYKYLFNFR 290

Query: 321 GKAWSVSEKYILACDSVSLIVRPHYYDFFTRSLIPMKHYWPISSNRKCSSIKFAVDWGNT 380
           G A S   K++  C S+   V   + +FF   L P  HY P+ ++   S ++  + +   
Sbjct: 291 GVAASFRFKHLFLCGSLVFHVGDEWVEFFYPQLKPWVHYIPVKTD--LSDVQELLQFVKA 350

Query: 381 HHQKAMAIGKAASKFIEEELKMEYIYDYMFHLLNQYSKLLTFKPT 408
           +   A  I K  S+FI   L+M+ I  Y  +LL +YSK L++  T
Sbjct: 351 NDDLAQEIAKRGSQFIINHLQMDDITCYWENLLTEYSKFLSYNVT 375

BLAST of Cla97C05G090050 vs. Swiss-Prot
Match: sp|B0X1Q4|RUMI_CULQU (O-glucosyltransferase rumi homolog OS=Culex quinquefasciatus OX=7176 GN=CPIJ013394 PE=3 SV=1)

HSP 1 Score: 112.5 bits (280), Expect = 1.4e-23
Identity = 88/343 (25.66%), Postives = 154/343 (44.90%), Query Frame = 0

Query: 81  SATCPEYFRWIHEDLRPWAGRGITKSMLEEAQKKAHFRLVVVEGKAYVEAYGKAYQSRDS 140
           S+ C  +   +  DLRP+   GIT+ ++E A+           G  Y     + ++ RD 
Sbjct: 68  SSNCSCHLDVLKTDLRPFRS-GITQDLIELARS---------YGTKYQIIGHRMFRQRDC 127

Query: 141 L---TVWGVVQLLRRYPGKLPDLDLMFSCEDRPEIYQKDYKGSQKPAPPPLFRYSGDDAT 200
           +      GV   +R    KLPD++L+ +C D P+I  + +  S++P   P+  +S  +  
Sbjct: 128 MFPARCSGVEHFIRPNLPKLPDMELIINCRDWPQI-SRHWNASREPL--PVLSFSKTNDY 187

Query: 201 WDIVFPDWSFW-GWPEINIKP-----WEGILKDIKEGNKKMEWMKREPYAYWKGNPT--- 260
            DI++P W FW G P I++ P     W+     +++  K   W K+   A+++G+ T   
Sbjct: 188 LDIMYPTWGFWEGGPAISLYPTGLGRWDQHRVSVRKAAKVWPWEKKLQQAFFRGSRTSDE 247

Query: 261 ------VAYTRRDLLKCNVTHKQDWNARLYRQNWNKESKTGFKDSKLCNQCVYRYKIYIE 320
                 ++  R +L+    T  Q W  R  +   + E     ++ +L + C Y+Y     
Sbjct: 248 RDPLVLLSRMRPELVDAQYTKNQAW--RSPKDTLHAEPA---QEVRLEDHCQYKYLFNFR 307

Query: 321 GKAWSVSEKYILACDSVSLIVRPHYYDFFTRSLIPMKHYWPISSNRKCSSIKFAVDWGNT 380
           G A S   K++  C S+   V   + +FF  SL P  HY P+        ++  + +   
Sbjct: 308 GVAASFRFKHLFLCKSLVFHVGQEWQEFFYDSLKPWVHYVPVPVGINEWELEHLIQFFRE 367

Query: 381 HHQKAMAIGKAASKFIEEELKMEYIYDYMFHLLNQYSKLLTFK 406
           H Q A  I     + I   L+ME +  Y   LL +Y KL+ ++
Sbjct: 368 HDQLAQEIANRGYEHIWNHLRMEDVECYWKRLLRRYGKLVKYE 392

BLAST of Cla97C05G090050 vs. TAIR10
Match: AT5G23850.1 (Arabidopsis thaliana protein of unknown function (DUF821))

HSP 1 Score: 569.3 bits (1466), Expect = 2.3e-162
Identity = 249/404 (61.63%), Postives = 317/404 (78.47%), Query Frame = 0

Query: 75  DPDHPISATCPEYFRWIHEDLRPWAGRGITKSMLEEAQKKAHFRLVVVEGKAYVEAYGKA 134
           D +HP +ATCP+YFRWIHEDLRPW+  GIT+  LE A+K A FRL +V GK YVE +  A
Sbjct: 130 DTNHPPTATCPDYFRWIHEDLRPWSRTGITREALERAKKTATFRLAIVGGKIYVEKFQDA 189

Query: 135 YQSRDSLTVWGVVQLLRRYPGKLPDLDLMFSCEDRPEIYQKDYKGSQKPAPPPLFRYSGD 194
           +Q+RD  T+WG +QLLR+YPGK+PDL+LMF C D P +   ++ G+  P+PPPLFRY G+
Sbjct: 190 FQTRDVFTIWGFLQLLRKYPGKIPDLELMFDCVDWPVVRATEFAGANAPSPPPLFRYCGN 249

Query: 195 DATWDIVFPDWSFWGWPEINIKPWEGILKDIKEGNKKMEWMKREPYAYWKGNPTVAYTRR 254
           + T DIVFPDWSFWGW E+NIKPWE +LK+++EGN++ +W+ REPYAYWKGNP VA TR+
Sbjct: 250 EETLDIVFPDWSFWGWAEVNIKPWESLLKELREGNERTKWINREPYAYWKGNPMVAETRQ 309

Query: 255 DLLKCNVTHKQDWNARLYRQNWNKESKTGFKDSKLCNQCVYRYKIYIEGKAWSVSEKYIL 314
           DL+KCNV+ + +WNARLY Q+W KESK G+K S L +QC +RYKIYIEG AWSVSEKYIL
Sbjct: 310 DLMKCNVSEEHEWNARLYAQDWIKESKEGYKQSDLASQCHHRYKIYIEGSAWSVSEKYIL 369

Query: 315 ACDSVSLIVRPHYYDFFTRSLIPMKHYWPISSNRKCSSIKFAVDWGNTHHQKAMAIGKAA 374
           ACDSV+L+V+PHYYDFFTR L+P  HYWP+  + KC SIKFAVDWGN+H QKA  IGKAA
Sbjct: 370 ACDSVTLLVKPHYYDFFTRGLLPAHHYWPVREHDKCRSIKFAVDWGNSHIQKAQDIGKAA 429

Query: 375 SKFIEEELKMEYIYDYMFHLLNQYSKLLTFKPTVPPNATELSSESMALAAEGSIRKSMME 434
           S FI+++LKM+Y+YDYM+HLL +YSKLL FKP +P NA E+ SE+MA    G+ RK M E
Sbjct: 430 SDFIQQDLKMDYVYDYMYHLLTEYSKLLQFKPEIPRNAVEICSETMACLRSGNERKFMTE 489

Query: 435 SAVTSPADSGPCALQAPYDPQTLQLLIRSKEDSIKQVERWERSF 479
           S V  PADSGPCA+  PYDP T   +++ K+ +  ++ +WE  +
Sbjct: 490 SLVKQPADSGPCAMPPPYDPATYYEVVKRKQSTNMRILQWEMKY 533

BLAST of Cla97C05G090050 vs. TAIR10
Match: AT3G48980.1 (Arabidopsis thaliana protein of unknown function (DUF821))

HSP 1 Score: 560.5 bits (1443), Expect = 1.0e-159
Identity = 246/410 (60.00%), Postives = 316/410 (77.07%), Query Frame = 0

Query: 69  QGETKKDPDHPISATCPEYFRWIHEDLRPWAGRGITKSMLEEAQKKAHFRLVVVEGKAYV 128
           +GE+ + P    SATCP+YFRWIHEDLRPW   GIT+  LE A   A FRL ++ G+ YV
Sbjct: 125 EGESDRSP----SATCPDYFRWIHEDLRPWEKTGITREALERANATAIFRLAIINGRIYV 184

Query: 129 EAYGKAYQSRDSLTVWGVVQLLRRYPGKLPDLDLMFSCEDRPEIYQKDYKGSQKPAPPPL 188
           E + +A+Q+RD  T+WG VQLLRRYPGK+PDL+LMF C D P +   ++ G  +P PPPL
Sbjct: 185 EKFREAFQTRDVFTIWGFVQLLRRYPGKIPDLELMFDCVDWPVVKAAEFAGVDQPPPPPL 244

Query: 189 FRYSGDDATWDIVFPDWSFWGWPEINIKPWEGILKDIKEGNKKMEWMKREPYAYWKGNPT 248
           FRY  +D T DIVFPDWS+WGW E+NIKPWE +LK+++EGN++ +W+ REPYAYWKGNPT
Sbjct: 245 FRYCANDETLDIVFPDWSYWGWAEVNIKPWESLLKELREGNQRTKWIDREPYAYWKGNPT 304

Query: 249 VAYTRRDLLKCNVTHKQDWNARLYRQNWNKESKTGFKDSKLCNQCVYRYKIYIEGKAWSV 308
           VA TR DL+KCN++   DW ARLY+Q+W KESK G+K S L +QC +RYKIYIEG AWSV
Sbjct: 305 VAETRLDLMKCNLSEVYDWKARLYKQDWVKESKEGYKQSDLASQCHHRYKIYIEGSAWSV 364

Query: 309 SEKYILACDSVSLIVRPHYYDFFTRSLIPMKHYWPISSNRKCSSIKFAVDWGNTHHQKAM 368
           SEKYILACDSV+L+V+PHYYDFFTR + P  HYWP+  + KC SIKFAVDWGN H +KA 
Sbjct: 365 SEKYILACDSVTLMVKPHYYDFFTRGMFPGHHYWPVKEDDKCRSIKFAVDWGNLHMRKAQ 424

Query: 369 AIGKAASKFIEEELKMEYIYDYMFHLLNQYSKLLTFKPTVPPNATELSSESMALAAEGSI 428
            IGK AS+F+++ELKM+Y+YDYMFHLL QYSKLL FKP +P N+TEL SE+MA   +G+ 
Sbjct: 425 DIGKKASEFVQQELKMDYVYDYMFHLLIQYSKLLRFKPEIPQNSTELCSEAMACPRDGNE 484

Query: 429 RKSMMESAVTSPADSGPCALQAPYDPQTLQLLIRSKEDSIKQVERWERSF 479
           RK MMES V  PA++GPCA+  PYDP +   +++ ++ +  ++E+WE  +
Sbjct: 485 RKFMMESLVKRPAETGPCAMPPPYDPASFYSVLKRRQSTTSRIEQWESKY 530

BLAST of Cla97C05G090050 vs. TAIR10
Match: AT2G45830.1 (downstream target of AGL15 2)

HSP 1 Score: 533.9 bits (1374), Expect = 1.1e-151
Identity = 255/513 (49.71%), Postives = 345/513 (67.25%), Query Frame = 0

Query: 2   MIRDDDSRRKFQKHLISAHKLLHFLNASRGSSVIISFAVVVLVGMFLSGRLLGLL----- 61
           M++    +R     L + +K ++   AS  +  I    + ++  +F+S  LL LL     
Sbjct: 1   MLQRKSMKRNNNNLLTNKNKYVYLKTASHPAKSIAKATLFLVTSLFISAGLLDLLGCFDF 60

Query: 62  ---LGLKS-------------------NVVNQQ----PQGETKKDPDHPIS-----ATCP 121
               GLK                     VV  Q    PQ  + ++ D P S     +TCP
Sbjct: 61  TTFTGLKQVTTSIRKSPITSQRFPNQCGVVQNQTQLFPQNGSSRNNDKPRSSHSRISTCP 120

Query: 122 EYFRWIHEDLRPWAGRGITKSMLEEAQKKAHFRLVVVEGKAYVEAYGKAYQSRDSLTVWG 181
            YFRWIHEDLRPW   G+T+ MLE+A++ AHFR+V+++G+ YV+ Y K+ Q+RD  T+WG
Sbjct: 121 SYFRWIHEDLRPWKETGVTRGMLEKARRTAHFRVVILDGRVYVKKYRKSIQTRDVFTLWG 180

Query: 182 VVQLLRRYPGKLPDLDLMFSCEDRPEIYQKDYKGSQKPAPPPLFRYSGDDATWDIVFPDW 241
           +VQLLR YPG+LPDL+LMF  +DRP +  KD++G Q PAPPPLFRY  DDA+ DIVFPDW
Sbjct: 181 IVQLLRWYPGRLPDLELMFDPDDRPTVRSKDFQGQQHPAPPPLFRYCSDDASLDIVFPDW 240

Query: 242 SFWGWPEINIKPWEGILKDIKEGNKKMEWMKREPYAYWKGNPTVAYTRRDLLKCNVTHKQ 301
           SFWGW E+NIKPW+  L  I+EGNK  +W  R  YAYW+GNP VA TRRDLL+CNV+ ++
Sbjct: 241 SFWGWAEVNIKPWDKSLVAIEEGNKMTQWKDRVAYAYWRGNPNVAPTRRDLLRCNVSAQE 300

Query: 302 DWNARLYRQNWNKESKTGFKDSKLCNQCVYRYKIYIEGKAWSVSEKYILACDSVSLIVRP 361
           DWN RLY Q+W++ES+ GFK+S L NQC +RYKIYIEG AWSVSEKYI+ACDS++L VRP
Sbjct: 301 DWNTRLYIQDWDRESREGFKNSNLENQCTHRYKIYIEGWAWSVSEKYIMACDSMTLYVRP 360

Query: 362 HYYDFFTRSLIPMKHYWPISSNRKCSSIKFAVDWGNTHHQKAMAIGKAASKFIEEELKME 421
            +YDF+ R ++P++HYWPI    KC+S+KFAV WGNTH  +A  IG+  S+FI EE+KME
Sbjct: 361 MFYDFYVRGMMPLQHYWPIRDTSKCTSLKFAVHWGNTHLDQASKIGEEGSRFIREEVKME 420

Query: 422 YIYDYMFHLLNQYSKLLTFKPTVPPNATELSSESMALAAEGSIRKSMMESAVTSPADSGP 479
           Y+YDYMFHL+N+Y+KLL FKP +P  ATE++ + M  +A G  R  M ES V  P++  P
Sbjct: 421 YVYDYMFHLMNEYAKLLKFKPEIPWGATEITPDIMGCSATGRWRDFMEESMVMFPSEESP 480

BLAST of Cla97C05G090050 vs. TAIR10
Match: AT3G61270.1 (Arabidopsis thaliana protein of unknown function (DUF821))

HSP 1 Score: 513.1 bits (1320), Expect = 1.9e-145
Identity = 243/464 (52.37%), Postives = 316/464 (68.10%), Query Frame = 0

Query: 34  VIISFAVVVLVGMFLSGRLLGLLLGLKS------------NVVNQQPQGETKKDPDHP-- 93
           + IS A++ L+G        GL L  K+            N  +Q P  + +K   +P  
Sbjct: 31  LFISAAILDLLGYLDFNAFAGLKLTTKTKEPNPYGCDFVQNQSSQTPISQNRKSRLNPNN 90

Query: 94  --ISATCPEYFRWIHEDLRPWAGRGITKSMLEEAQKKAHFRLVVVEGKAYVEAYGKAYQS 153
              S+TCP YFRWIHEDLRPW   GIT+ M+EEA + AHFRLV+  GKAYV+ Y K+ Q+
Sbjct: 91  SSKSSTCPSYFRWIHEDLRPWKQTGITRGMIEEASRTAHFRLVIRNGKAYVKRYKKSIQT 150

Query: 154 RDSLTVWGVVQLLRRYPGKLPDLDLMFSCEDRPEIYQKDYKGSQKPAPPPLFRYSGDDAT 213
           RD  T+WG++QLLR YPGKLPDL+LMF  +DRP +   D+ G QK  PPP+FRY  DDA+
Sbjct: 151 RDEFTLWGILQLLRWYPGKLPDLELMFDADDRPVVRSVDFIGQQK-EPPPVFRYCSDDAS 210

Query: 214 WDIVFPDWSFWGWPEINIKPWEGILKDIKEGNKKMEWMKREPYAYWKGNPTVAYTRRDLL 273
            DIVFPDWSFWGW E+N+KPW   L+ IKEGN   +W  R  YAYW+GNP V   R DLL
Sbjct: 211 LDIVFPDWSFWGWAEVNVKPWGKSLEAIKEGNSMTQWKDRVAYAYWRGNPYVDPGRGDLL 270

Query: 274 KCNVTHKQDWNARLYRQNWNKESKTGFKDSKLCNQCVYRYKIYIEGKAWSVSEKYILACD 333
           KCN T  ++WN RLY Q+W+KE+K GFK+S L NQC +RYKIYIEG AWSVSEKYI+ACD
Sbjct: 271 KCNATEHEEWNTRLYIQDWDKETKEGFKNSNLENQCTHRYKIYIEGWAWSVSEKYIMACD 330

Query: 334 SVSLIVRPHYYDFFTRSLIPMKHYWPISSNRKCSSIKFAVDWGNTHHQKAMAIGKAASKF 393
           S++L V+P +YDF+ R ++P++HYWPI  + KC+S+KFAV WGNTH  KA  IG+  S+F
Sbjct: 331 SMTLYVKPRFYDFYIRGMMPLQHYWPIRDDSKCTSLKFAVHWGNTHEDKAREIGEVGSRF 390

Query: 394 IEEELKMEYIYDYMFHLLNQYSKLLTFKPTVPPNATELSSESMALAAEGSIRKSMMESAV 453
           I EE+ M+Y+YDYMFHLL +Y+ LL FKP +P +A E++ +SM   A    R    ES +
Sbjct: 391 IREEVNMQYVYDYMFHLLKEYATLLKFKPEIPLDAEEITPDSMGCPATERWRDFKAESMI 450

Query: 454 TSPADSGPCALQAPYDPQTLQLLIRSKEDSIKQVERWERSFSQN 482
            SP++  PC +  PYDP  L+ ++  K +  +QVE WE  + QN
Sbjct: 451 ISPSEESPCEMLPPYDPLALKEVLERKANLTRQVELWENQYFQN 493

BLAST of Cla97C05G090050 vs. TAIR10
Match: AT1G63420.1 (Arabidopsis thaliana protein of unknown function (DUF821))

HSP 1 Score: 503.8 bits (1296), Expect = 1.2e-142
Identity = 233/407 (57.25%), Postives = 303/407 (74.45%), Query Frame = 0

Query: 83  TCPEYFRWIHEDLRPWAGRGITKSMLEEAQKKAHFRLVVVEGKAYVEAYGKAYQSRDSLT 142
           +CP+YF+WIHEDL+PW   GITK M+E  +  AHFRLV++ GK +VE Y K+ Q+RD+ T
Sbjct: 169 SCPDYFKWIHEDLKPWRETGITKEMVERGKTTAHFRLVILNGKVFVENYKKSIQTRDAFT 228

Query: 143 VWGVVQLLRRYPGKLPDLDLMFSCEDRPEIYQKDY---KGSQKPAPPPLFRYSGDDATWD 202
           +WG++QLLR+YPGKLPD+DLMF C+DRP I    Y     + + APPPLFRY GD  T D
Sbjct: 229 LWGILQLLRKYPGKLPDVDLMFDCDDRPVIRSDGYNILNRTVENAPPPLFRYCGDRWTVD 288

Query: 203 IVFPDWSFWGWPEINIKPWEGILKDIKEGNKKMEWMKREPYAYWKGNPTVAY-TRRDLLK 262
           IVFPDWSFWGW EINI+ W  +LK+++EG KK ++M+R+ YAYWKGNP VA  +R DLL 
Sbjct: 289 IVFPDWSFWGWQEINIREWSKVLKEMEEGKKKKKFMERDAYAYWKGNPFVASPSREDLLT 348

Query: 263 CNVTHKQDWNARLYRQNWNKESKTGFKDSKLCNQCVYRYKIYIEGKAWSVSEKYILACDS 322
           CN++   DWNAR++ Q+W  E + GF++S + NQC YRYKIYIEG AWSVSEKYILACDS
Sbjct: 349 CNLSSLHDWNARIFIQDWISEGQRGFENSNVANQCTYRYKIYIEGYAWSVSEKYILACDS 408

Query: 323 VSLIVRPHYYDFFTRSLIPMKHYWPISSNRKCSSIKFAVDWGNTHHQKAMAIGKAASKFI 382
           V+L+V+P+YYDFF+R+L P++HYWPI    KC SIKFAVDW N H QKA  IG+ AS+F+
Sbjct: 409 VTLMVKPYYYDFFSRTLQPLQHYWPIRDKDKCRSIKFAVDWLNNHTQKAQEIGREASEFM 468

Query: 383 EEELKMEYIYDYMFHLLNQYSKLLTFKPTVPPNATELSSESMALAAEGS-----IRKSMM 442
           + +L ME +YDYMFHLLN+YSKLL +KP VP N+ EL +E++   +EG       +K M+
Sbjct: 469 QRDLSMENVYDYMFHLLNEYSKLLKYKPQVPKNSVELCTEALVCPSEGEDVNGVDKKFMI 528

Query: 443 ESAVTSPADSGPCALQAPYDPQTLQLLIRSKEDSIKQVERWERSFSQ 481
            S V+ P  SGPC+L  P+D   L+   R K + I+QVE+WE S+ Q
Sbjct: 529 GSLVSRPHASGPCSLPPPFDSNGLEKFHRKKLNLIRQVEKWEDSYWQ 575

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008439224.17.1e-24786.39PREDICTED: protein O-glucosyltransferase 1-like [Cucumis melo][more]
KGN57370.11.2e-23879.23hypothetical protein Csa_3G182060 [Cucumis sativus][more]
XP_022141173.11.1e-20471.46protein O-glucosyltransferase 1-like [Momordica charantia][more]
XP_011652774.11.0e-17688.27PREDICTED: protein O-glucosyltransferase 1-like [Cucumis sativus][more]
XP_023552264.11.4e-17368.47protein O-glucosyltransferase 1-like isoform X1 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
tr|A0A1S3AYX9|A0A1S3AYX9_CUCME4.7e-24786.39protein O-glucosyltransferase 1-like OS=Cucumis melo OX=3656 GN=LOC103484074 PE=... [more]
tr|A0A0A0L5W0|A0A0A0L5W0_CUCSA8.0e-23979.23Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G182060 PE=4 SV=1[more]
tr|A0A0A0L5W3|A0A0A0L5W3_CUCSA1.5e-17368.14Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G182110 PE=4 SV=1[more]
tr|A0A1S3AYX8|A0A1S3AYX8_CUCME5.9e-17368.06O-glucosyltransferase rumi homolog OS=Cucumis melo OX=3656 GN=LOC103484079 PE=4 ... [more]
tr|A0A218X3H9|A0A218X3H9_PUNGR1.8e-16965.45Uncharacterized protein OS=Punica granatum OX=22663 GN=CDL15_Pgr022949 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
sp|Q5E9Q1|PGLT1_BOVIN5.0e-2626.53Protein O-glucosyltransferase 1 OS=Bos taurus OX=9913 GN=POGLUT1 PE=2 SV=1[more]
sp|Q8NBL1|PGLT1_HUMAN1.9e-2526.24Protein O-glucosyltransferase 1 OS=Homo sapiens OX=9606 GN=POGLUT1 PE=1 SV=1[more]
sp|Q8BYB9|PGLT1_MOUSE8.0e-2426.09Protein O-glucosyltransferase 1 OS=Mus musculus OX=10090 GN=Poglut1 PE=1 SV=2[more]
sp|G3V9D0|PGLT1_RAT1.4e-2325.51Protein O-glucosyltransferase 1 OS=Rattus norvegicus OX=10116 GN=Poglut1 PE=3 SV... [more]
sp|B0X1Q4|RUMI_CULQU1.4e-2325.66O-glucosyltransferase rumi homolog OS=Culex quinquefasciatus OX=7176 GN=CPIJ0133... [more]
Match NameE-valueIdentityDescription
AT5G23850.12.3e-16261.63Arabidopsis thaliana protein of unknown function (DUF821)[more]
AT3G48980.11.0e-15960.00Arabidopsis thaliana protein of unknown function (DUF821)[more]
AT2G45830.11.1e-15149.71downstream target of AGL15 2[more]
AT3G61270.11.9e-14552.37Arabidopsis thaliana protein of unknown function (DUF821)[more]
AT1G63420.11.2e-14257.25Arabidopsis thaliana protein of unknown function (DUF821)[more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR006598LipoPS_modifying
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
molecular_function GO:0016740 transferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C05G090050.1Cla97C05G090050.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006598Lipopolysaccharide-modifying proteinSMARTSM00672cap10coord: 156..405
e-value: 1.4E-137
score: 473.0
IPR006598Lipopolysaccharide-modifying proteinPFAMPF05686Glyco_transf_90coord: 82..475
e-value: 2.7E-182
score: 606.0
NoneNo IPR availablePANTHERPTHR12203:SF59GLYCOSYLTRANSFERASEcoord: 76..481
NoneNo IPR availablePANTHERPTHR12203KDEL LYS-ASP-GLU-LEU CONTAINING - RELATEDcoord: 76..481

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla97C05G090050Watermelon (97103) v2wmbwmbB023
Cla97C05G090050Watermelon (97103) v2wmbwmbB130
Cla97C05G090050Silver-seed gourdcarwmbB0743
Cla97C05G090050Silver-seed gourdcarwmbB0772
Cla97C05G090050Silver-seed gourdcarwmbB0969
Cla97C05G090050Silver-seed gourdcarwmbB1042
Cla97C05G090050Cucumber (Gy14) v2cgybwmbB196
Cla97C05G090050Cucumber (Gy14) v2cgybwmbB367
Cla97C05G090050Cucumber (Gy14) v1cgywmbB147
Cla97C05G090050Cucumber (Gy14) v1cgywmbB522
Cla97C05G090050Cucurbita maxima (Rimu)cmawmbB346
Cla97C05G090050Cucurbita maxima (Rimu)cmawmbB424
Cla97C05G090050Cucurbita maxima (Rimu)cmawmbB649
Cla97C05G090050Cucurbita maxima (Rimu)cmawmbB651
Cla97C05G090050Cucurbita moschata (Rifu)cmowmbB331
Cla97C05G090050Cucurbita moschata (Rifu)cmowmbB404
Cla97C05G090050Cucurbita moschata (Rifu)cmowmbB621
Cla97C05G090050Cucurbita moschata (Rifu)cmowmbB624
Cla97C05G090050Wild cucumber (PI 183967)cpiwmbB209
Cla97C05G090050Wild cucumber (PI 183967)cpiwmbB404
Cla97C05G090050Cucumber (Chinese Long) v3cucwmbB206
Cla97C05G090050Cucumber (Chinese Long) v3cucwmbB400
Cla97C05G090050Cucumber (Chinese Long) v2cuwmbB204
Cla97C05G090050Cucumber (Chinese Long) v2cuwmbB386
Cla97C05G090050Bottle gourd (USVL1VR-Ls)lsiwmbB022
Cla97C05G090050Bottle gourd (USVL1VR-Ls)lsiwmbB334
Cla97C05G090050Melon (DHL92) v3.6.1medwmbB021
Cla97C05G090050Melon (DHL92) v3.6.1medwmbB423
Cla97C05G090050Melon (DHL92) v3.5.1mewmbB024
Cla97C05G090050Melon (DHL92) v3.5.1mewmbB434
Cla97C05G090050Watermelon (Charleston Gray)wcgwmbB229
Cla97C05G090050Watermelon (97103) v1wmwmbB206
Cla97C05G090050Wax gourdwgowmbB075
Cla97C05G090050Wax gourdwgowmbB192