CsaV3_1G036190.1 (mRNA) Cucumber (Chinese Long) v3

Overview
NameCsaV3_1G036190.1
TypemRNA
OrganismCucumis sativus L. var. sativus cv. Chinese Long (Cucumber (Chinese Long) v3)
DescriptionO-glucosyltransferase rumi homolog
Locationchr1: 22157915 .. 22160970 (+)
Sequence length1419
RNA-Seq ExpressionCsaV3_1G036190.1
SyntenyCsaV3_1G036190.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTCCGGCACCCAGACCTCCCTCCCACCTCCTCCCCTCCGTCGTCGCCATCTGCTTCCTCTCCCTCACTTTCCTCCTTTGTTACAAGGTCTCTATAACTCTATTTCTTCATTGCTGCATTTCTTAGACACCATTTTCTTATACTTACTTAGTTCTATTTCCTATTTCTTTGCCCACTTTCAGGTAGATGATTTCGCTGCTCAAACCAAAACTGTTGCTGGTCACAACTTGGATCCAACTCCATGGCATTTGTTCCCTCCCAAGACATTCAGTGATGAGACTCGCCATGCCAGAGCTGTTAAGATCATCCACTGTTCTTACCTCACCTGTCGCTATGCCACCAACAATGCCACTAAATTCCCTTTCCATTCCGCTGTATCAGCTCCCAAATGTCCTGAATTCTTCCGGTGGATTCATCACGATCTGGATCCTTGGGCTCGTACTCGAATCTCGATGACCCAGTTGGAAGAATCTCAGAAATTTGCGGCGTTTCGTGTAGTGATCGTGGAAGGTAGGCTTTATGTTGATATGTACTATGCTTGTGTGCAGAGCAGGGCGATTTTCACGATCTGGGGTTTGGTTCAAATGCTTAGAAGGTACCCTGGAATGGTGCCGGATGTGGATATGATGTTTGATTGTATGGATAAACCGAGTATCAATCGGACTGAGAATAAGGCCATGCCGCTGCCTCTGTTTCGGTATTGCACGACGGAGGCTCACTTCGACATTCCTTTTCCTGATTGGTCTTTCTGGGGATGGTATGTATGGCATCGAATTAAGATCCCTTCTCTGCTTTCATAGTTTGTTCTGTCCCATCCTTGAATTGCAACTTTTTCACCTAAAGTCTTGGCTTTCATTTTTCTTCCAACAGTTGTATGTTATGTGATGAGGATTAGGAAACTAATTATGTTATATTTTTCATCAATGGATGTGGGGATGTTATGCAGGCCAGAAGTGAACTTAAGATCATGGAGGGAAGAGTTTGAAGATATAAAGAAAGGGTCGAAAAATTTAAGTTGGTTCAACAAATTTCCTCGAGCTTATTGGAAGGGAAATCCAGATGTTGATTCACCTGCTCGTGAAGAGTTGCTGAAATGCAATCACTCAAGAATGTGGGGAGCTCAGATCATGCGTCAGGTTCAATCTGGAATCCTCAAAGCTCTTTCTTTCTTTCTATGAATTTCTTTTGTATTATACGCAAATCACTTGTTGTTGAGCTGGGAAAACTCAATGTGCTTCATGGCAGGACTGGGCACAAGAAGCAAAAGATGGTTATGAGCAGTCTAAGCTATCCAACCAATGCAACCACCGGTGAGGATGGCTCTTTCCTTTGCGGATCCGAACTTGAGATTTTAATTGAATGGAATTGGTTGGCTTGGTTTATGTTTTGTAAAACCAAAATAAACCGGACCAAACAAGAAAATCAATTACCTCATAGAAAAAAGAGGGTAAAAACCAACATTTATTTAGCTCAAATTACTCCAATGTTTGAATGCTTAATTTAATTTAGGGCTGAAAATAGGTAATAAAAAATTATGGCAACCTGAATCAAGAAAATTAAATCAAATTGATTGGTTTGATTAGAAGGAATTGATTAGAATAGTTGGTTCGGTTCATGGTCTAATAAGAGCAAACAAAACCAAACCCTAAACTCCTCAAAGGAGGTTTGGCCCACGGACTTAAGTCAGTCTTAAACAACTCACGTTTATACCTTTATTATTGTTGTGTGTTAACTGGCTAGCTGAGGCCACCGTCTCTATCTGTCTCTGCAGGTATAAAATCTATGCCGAAGGGTTTGCTTGGTCTGTGAGCTTGAAGTACATTCTTTCATGTGGTTCAATGTCTTTGATTATTTCACCTCAATATGAAGATTTCTTCAGCCGTGGTCTTGATCCTTTGAAGAACTATTGGCCCATCCCCTTCACTAACATGTGTGAGTCTATTAAGCATGCTGTTGACTGGGGAAATACTCATTTCCCTGAGGTATATAATTCTTAACTTCCTTATCGTAATTCTTTTTCTTGGTTCATGTTTATTACTATAAAATCTCTCTTCTTCCTACAACAGTTCCATATAGAGAAGAATATAGGTTTGTTTGAACCTGTGAATTTGGAAGTTGGTGATTGTTTTTGTTTCTCAACACTGAACTCGAAGGTGTATGATGTTAATTTGACTAACTTATGAAAGCTACCTTTACTCTGAGTACCAAATTTTACAATTTATTATATATTAACTCTAAAAGTTTAAGCCGCTGAAATATGTTTGGTTCATAAATTCGATATGAACAATGAAAGGAAAATGCTGCTTTCATGTATAACAAACTTACAGCAATATATTAAAATCATGAACCAAATCTGCCCATGGAATATGTTAACTTTTCTCTTCCATACGTTCTAAAGAAGGATTCGTTGAAAACAGGCCGAGACTATAGGACGGCAGGGACAGAAATTCATGGAGAACTTGAGCATGGATACAGTCTATTCTTACATGTTTCACCTCATCACAGAATACTCAAAGCTTCAGGACTTCAAGCCAACCCCGCCGCCATCGGCTTTAGAAGTATGTACTGATTCCTTGCTTTGCATTGCGGACGAGAAGCAGATGCAGTTCCTTGAGAAGTCAGCTGCCTCGGTTTCGTCAGTCCCTCCGTGCTCACTCAACCGTGGTGGTAGTGATATCATTTATAGTTGGCTGCAGCAAAAGTAGAGGAGGAAGGCGATGTAGGAGGAAGAAATGGCTGCACAAAGAGCCTCAAAGTAGAAGTTATGTATGTTTTTTTTTGTTCCACTTTAGATTATCGAGGAGAGTAGTGTTTATGCTATAGTGATTTAATTAAAGTAATTTTACGCAAAAAAAATATGTCTAATACTTTGTGTGAATCTGACCAAAGTTCTACATTGGTTAGATAAAAAGATGAGCTTAGATATACAAGTGAGAACATCTATCTCTTTTGGGATGATACTAAAAACAAAGTCATGAGAGTGTATACCGAAAGTGAACAATATCATACTATTGTAGAGACTCTGATCAGTCTATCGAATGAGCAAAGAGCTT

mRNA sequence

ATGGCTCCGGCACCCAGACCTCCCTCCCACCTCCTCCCCTCCGTCGTCGCCATCTGCTTCCTCTCCCTCACTTTCCTCCTTTGTTACAAGGTAGATGATTTCGCTGCTCAAACCAAAACTGTTGCTGGTCACAACTTGGATCCAACTCCATGGCATTTGTTCCCTCCCAAGACATTCAGTGATGAGACTCGCCATGCCAGAGCTGTTAAGATCATCCACTGTTCTTACCTCACCTGTCGCTATGCCACCAACAATGCCACTAAATTCCCTTTCCATTCCGCTGTATCAGCTCCCAAATGTCCTGAATTCTTCCGGTGGATTCATCACGATCTGGATCCTTGGGCTCGTACTCGAATCTCGATGACCCAGTTGGAAGAATCTCAGAAATTTGCGGCGTTTCGTGTAGTGATCGTGGAAGGTAGGCTTTATGTTGATATGTACTATGCTTGTGTGCAGAGCAGGGCGATTTTCACGATCTGGGGTTTGGTTCAAATGCTTAGAAGGTACCCTGGAATGGTGCCGGATGTGGATATGATGTTTGATTGTATGGATAAACCGAGTATCAATCGGACTGAGAATAAGGCCATGCCGCTGCCTCTGTTTCGGTATTGCACGACGGAGGCTCACTTCGACATTCCTTTTCCTGATTGGTCTTTCTGGGGATGGCCAGAAGTGAACTTAAGATCATGGAGGGAAGAGTTTGAAGATATAAAGAAAGGGTCGAAAAATTTAAGTTGGTTCAACAAATTTCCTCGAGCTTATTGGAAGGGAAATCCAGATGTTGATTCACCTGCTCGTGAAGAGTTGCTGAAATGCAATCACTCAAGAATGTGGGGAGCTCAGATCATGCGTCAGGACTGGGCACAAGAAGCAAAAGATGGTTATGAGCAGTCTAAGCTATCCAACCAATGCAACCACCGGTATAAAATCTATGCCGAAGGGTTTGCTTGGTCTGTGAGCTTGAAGTACATTCTTTCATGTGGTTCAATGTCTTTGATTATTTCACCTCAATATGAAGATTTCTTCAGCCGTGGTCTTGATCCTTTGAAGAACTATTGGCCCATCCCCTTCACTAACATGTGTGAGTCTATTAAGCATGCTGTTGACTGGGGAAATACTCATTTCCCTGAGGCCGAGACTATAGGACGGCAGGGACAGAAATTCATGGAGAACTTGAGCATGGATACAGTCTATTCTTACATGTTTCACCTCATCACAGAATACTCAAAGCTTCAGGACTTCAAGCCAACCCCGCCGCCATCGGCTTTAGAAGTATGTACTGATTCCTTGCTTTGCATTGCGGACGAGAAGCAGATGCAGTTCCTTGAGAAGTCAGCTGCCTCGGTTTCGTCAGTCCCTCCGTGCTCACTCAACCGTGGTGGTAGTGATATCATTTATAGTTGGCTGCAGCAAAAGTAG

Coding sequence (CDS)

ATGGCTCCGGCACCCAGACCTCCCTCCCACCTCCTCCCCTCCGTCGTCGCCATCTGCTTCCTCTCCCTCACTTTCCTCCTTTGTTACAAGGTAGATGATTTCGCTGCTCAAACCAAAACTGTTGCTGGTCACAACTTGGATCCAACTCCATGGCATTTGTTCCCTCCCAAGACATTCAGTGATGAGACTCGCCATGCCAGAGCTGTTAAGATCATCCACTGTTCTTACCTCACCTGTCGCTATGCCACCAACAATGCCACTAAATTCCCTTTCCATTCCGCTGTATCAGCTCCCAAATGTCCTGAATTCTTCCGGTGGATTCATCACGATCTGGATCCTTGGGCTCGTACTCGAATCTCGATGACCCAGTTGGAAGAATCTCAGAAATTTGCGGCGTTTCGTGTAGTGATCGTGGAAGGTAGGCTTTATGTTGATATGTACTATGCTTGTGTGCAGAGCAGGGCGATTTTCACGATCTGGGGTTTGGTTCAAATGCTTAGAAGGTACCCTGGAATGGTGCCGGATGTGGATATGATGTTTGATTGTATGGATAAACCGAGTATCAATCGGACTGAGAATAAGGCCATGCCGCTGCCTCTGTTTCGGTATTGCACGACGGAGGCTCACTTCGACATTCCTTTTCCTGATTGGTCTTTCTGGGGATGGCCAGAAGTGAACTTAAGATCATGGAGGGAAGAGTTTGAAGATATAAAGAAAGGGTCGAAAAATTTAAGTTGGTTCAACAAATTTCCTCGAGCTTATTGGAAGGGAAATCCAGATGTTGATTCACCTGCTCGTGAAGAGTTGCTGAAATGCAATCACTCAAGAATGTGGGGAGCTCAGATCATGCGTCAGGACTGGGCACAAGAAGCAAAAGATGGTTATGAGCAGTCTAAGCTATCCAACCAATGCAACCACCGGTATAAAATCTATGCCGAAGGGTTTGCTTGGTCTGTGAGCTTGAAGTACATTCTTTCATGTGGTTCAATGTCTTTGATTATTTCACCTCAATATGAAGATTTCTTCAGCCGTGGTCTTGATCCTTTGAAGAACTATTGGCCCATCCCCTTCACTAACATGTGTGAGTCTATTAAGCATGCTGTTGACTGGGGAAATACTCATTTCCCTGAGGCCGAGACTATAGGACGGCAGGGACAGAAATTCATGGAGAACTTGAGCATGGATACAGTCTATTCTTACATGTTTCACCTCATCACAGAATACTCAAAGCTTCAGGACTTCAAGCCAACCCCGCCGCCATCGGCTTTAGAAGTATGTACTGATTCCTTGCTTTGCATTGCGGACGAGAAGCAGATGCAGTTCCTTGAGAAGTCAGCTGCCTCGGTTTCGTCAGTCCCTCCGTGCTCACTCAACCGTGGTGGTAGTGATATCATTTATAGTTGGCTGCAGCAAAAGTAG

Protein sequence

MAPAPRPPSHLLPSVVAICFLSLTFLLCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKTFSDETRHARAVKIIHCSYLTCRYATNNATKFPFHSAVSAPKCPEFFRWIHHDLDPWARTRISMTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQMLRRYPGMVPDVDMMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSWREEFEDIKKGSKNLSWFNKFPRAYWKGNPDVDSPAREELLKCNHSRMWGAQIMRQDWAQEAKDGYEQSKLSNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFTNMCESIKHAVDWGNTHFPEAETIGRQGQKFMENLSMDTVYSYMFHLITEYSKLQDFKPTPPPSALEVCTDSLLCIADEKQMQFLEKSAASVSSVPPCSLNRGGSDIIYSWLQQK*
Homology
BLAST of CsaV3_1G036190.1 vs. NCBI nr
Match: XP_004145318.2 (O-glucosyltransferase rumi homolog [Cucumis sativus] >KAE8653283.1 hypothetical protein Csa_023214 [Cucumis sativus])

HSP 1 Score: 1015.4 bits (2624), Expect = 1.6e-292
Identity = 472/472 (100.00%), Postives = 472/472 (100.00%), Query Frame = 0

Query: 1   MAPAPRPPSHLLPSVVAICFLSLTFLLCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKTFS 60
           MAPAPRPPSHLLPSVVAICFLSLTFLLCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKTFS
Sbjct: 1   MAPAPRPPSHLLPSVVAICFLSLTFLLCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKTFS 60

Query: 61  DETRHARAVKIIHCSYLTCRYATNNATKFPFHSAVSAPKCPEFFRWIHHDLDPWARTRIS 120
           DETRHARAVKIIHCSYLTCRYATNNATKFPFHSAVSAPKCPEFFRWIHHDLDPWARTRIS
Sbjct: 61  DETRHARAVKIIHCSYLTCRYATNNATKFPFHSAVSAPKCPEFFRWIHHDLDPWARTRIS 120

Query: 121 MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQMLRRYPGMVPDVDMMF 180
           MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQMLRRYPGMVPDVDMMF
Sbjct: 121 MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQMLRRYPGMVPDVDMMF 180

Query: 181 DCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSWREEFEDIKKG 240
           DCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSWREEFEDIKKG
Sbjct: 181 DCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSWREEFEDIKKG 240

Query: 241 SKNLSWFNKFPRAYWKGNPDVDSPAREELLKCNHSRMWGAQIMRQDWAQEAKDGYEQSKL 300
           SKNLSWFNKFPRAYWKGNPDVDSPAREELLKCNHSRMWGAQIMRQDWAQEAKDGYEQSKL
Sbjct: 241 SKNLSWFNKFPRAYWKGNPDVDSPAREELLKCNHSRMWGAQIMRQDWAQEAKDGYEQSKL 300

Query: 301 SNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFTNM 360
           SNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFTNM
Sbjct: 301 SNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFTNM 360

Query: 361 CESIKHAVDWGNTHFPEAETIGRQGQKFMENLSMDTVYSYMFHLITEYSKLQDFKPTPPP 420
           CESIKHAVDWGNTHFPEAETIGRQGQKFMENLSMDTVYSYMFHLITEYSKLQDFKPTPPP
Sbjct: 361 CESIKHAVDWGNTHFPEAETIGRQGQKFMENLSMDTVYSYMFHLITEYSKLQDFKPTPPP 420

Query: 421 SALEVCTDSLLCIADEKQMQFLEKSAASVSSVPPCSLNRGGSDIIYSWLQQK 473
           SALEVCTDSLLCIADEKQMQFLEKSAASVSSVPPCSLNRGGSDIIYSWLQQK
Sbjct: 421 SALEVCTDSLLCIADEKQMQFLEKSAASVSSVPPCSLNRGGSDIIYSWLQQK 472

BLAST of CsaV3_1G036190.1 vs. NCBI nr
Match: XP_008457372.1 (PREDICTED: O-glucosyltransferase rumi homolog [Cucumis melo])

HSP 1 Score: 976.9 bits (2524), Expect = 6.4e-281
Identity = 452/471 (95.97%), Postives = 461/471 (97.88%), Query Frame = 0

Query: 1   MAPAPRPPSHLLPSVVAICFLSLTFLLCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKTFS 60
           MAPAPRPPSHLLP+VVAI FLSLTFLLCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKTF+
Sbjct: 1   MAPAPRPPSHLLPAVVAISFLSLTFLLCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKTFN 60

Query: 61  DETRHARAVKIIHCSYLTCRYATNNATKFPFHSAVSAPKCPEFFRWIHHDLDPWARTRIS 120
           DETRHARAVKIIHCSYLTCRY TNNATKFPFHSAVSAPKCPEFFRWIHHDLDPWA+TRIS
Sbjct: 61  DETRHARAVKIIHCSYLTCRYVTNNATKFPFHSAVSAPKCPEFFRWIHHDLDPWAQTRIS 120

Query: 121 MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQMLRRYPGMVPDVDMMF 180
           MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQ+LRR+PGMVPDVDMMF
Sbjct: 121 MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDMMF 180

Query: 181 DCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSWREEFEDIKKG 240
           DCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSWREEFEDIKKG
Sbjct: 181 DCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSWREEFEDIKKG 240

Query: 241 SKNLSWFNKFPRAYWKGNPDVDSPAREELLKCNHSRMWGAQIMRQDWAQEAKDGYEQSKL 300
           SKNLSW NKFPRAYWKGNPDVDSPAR ELLKCNHSR WGAQIMRQDWAQEA+DGYEQSKL
Sbjct: 241 SKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHSRKWGAQIMRQDWAQEARDGYEQSKL 300

Query: 301 SNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFTNM 360
           SNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPF+NM
Sbjct: 301 SNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNM 360

Query: 361 CESIKHAVDWGNTHFPEAETIGRQGQKFMENLSMDTVYSYMFHLITEYSKLQDFKPTPPP 420
           CESIKHAVDWGNTHFPEAETIG+QGQ FME+LSMDTVYSYMFHLITEYSKL DFKPTPPP
Sbjct: 361 CESIKHAVDWGNTHFPEAETIGQQGQNFMESLSMDTVYSYMFHLITEYSKLLDFKPTPPP 420

Query: 421 SALEVCTDSLLCIADEKQMQFLEKSAASVSSVPPCSLNRGGSDIIYSWLQQ 472
           SALEVC DSLLCIADEKQ QFLEKSAASVSSVPPCSLNR GSDIIYSWLQQ
Sbjct: 421 SALEVCADSLLCIADEKQRQFLEKSAASVSSVPPCSLNRAGSDIIYSWLQQ 471

BLAST of CsaV3_1G036190.1 vs. NCBI nr
Match: XP_038893964.1 (O-glucosyltransferase rumi homolog [Benincasa hispida])

HSP 1 Score: 943.3 bits (2437), Expect = 7.8e-271
Identity = 438/477 (91.82%), Postives = 449/477 (94.13%), Query Frame = 0

Query: 1   MAPAPRP-----PSHLLPSVVAICFLSLTFLLCYKVDDFAAQTKTVAGHNLDPTPWHLFP 60
           MAP PRP     PSHLLPSV AI FLSLTFLLCYKVDDFAAQTKTVAGHNLDPTPWHLFP
Sbjct: 1   MAPPPRPSSSRSPSHLLPSVFAISFLSLTFLLCYKVDDFAAQTKTVAGHNLDPTPWHLFP 60

Query: 61  PKTFSDETRHARAVKIIHCSYLTCRYATNNATKFPFHSAVSAPKCPEFFRWIHHDLDPWA 120
           PKTFSDETRHAR VKIIHCSYL CRY TNNATKFP HSAVSAPKCPEFFRW+HHDLDPWA
Sbjct: 61  PKTFSDETRHARTVKIIHCSYLACRYVTNNATKFPLHSAVSAPKCPEFFRWVHHDLDPWA 120

Query: 121 RTRISMTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQMLRRYPGMVPD 180
           RTRISMT L+E+QKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQ+LRR+PGMVPD
Sbjct: 121 RTRISMTHLDEAQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPD 180

Query: 181 VDMMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSWREEFE 240
           VDMMFDCMDKPSINRTENK MPLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSWREEFE
Sbjct: 181 VDMMFDCMDKPSINRTENKDMPLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSWREEFE 240

Query: 241 DIKKGSKNLSWFNKFPRAYWKGNPDVDSPAREELLKCNHSRMWGAQIMRQDWAQEAKDGY 300
           DIKKGSKNLSW +K+PRAYWKGNPDVDSPAR ELL CNHSR WGAQIMRQDW QEAK G+
Sbjct: 241 DIKKGSKNLSWSDKYPRAYWKGNPDVDSPARTELLNCNHSRKWGAQIMRQDWEQEAKAGF 300

Query: 301 EQSKLSNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPI 360
           EQSKLSNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPI
Sbjct: 301 EQSKLSNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPI 360

Query: 361 PFTNMCESIKHAVDWGNTHFPEAETIGRQGQKFMENLSMDTVYSYMFHLITEYSKLQDFK 420
           PFTNMCESIKHAVDWGNTH PEAE IGRQ Q FME+L+MDTVYSYMFHLITEYSKL DF+
Sbjct: 361 PFTNMCESIKHAVDWGNTHLPEAEVIGRQAQNFMESLNMDTVYSYMFHLITEYSKLLDFR 420

Query: 421 PTPPPSALEVCTDSLLCIADEKQMQFLEKSAASVSSVPPCSLNRGGSDIIYSWLQQK 473
           PTPPPSALEVC DSLLCIADEKQ QFLEKSAASVSSVPPCSLNR GSDIIYSWLQQK
Sbjct: 421 PTPPPSALEVCADSLLCIADEKQRQFLEKSAASVSSVPPCSLNRAGSDIIYSWLQQK 477

BLAST of CsaV3_1G036190.1 vs. NCBI nr
Match: KAG6583363.1 (O-glucosyltransferase rumi-like protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 865.9 bits (2236), Expect = 1.6e-247
Identity = 395/471 (83.86%), Postives = 427/471 (90.66%), Query Frame = 0

Query: 2   APAPRPPSHLLPSVVAICFLSLTFLLCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKTFSD 61
           AP+ R PS++LPSVVA+ FL+LTFL+CYKVDDFAAQTKTVAGHNLDPTPWHLFPPK FS+
Sbjct: 12  APSSRTPSNILPSVVALSFLALTFLICYKVDDFAAQTKTVAGHNLDPTPWHLFPPKIFSE 71

Query: 62  ETRHARAVKIIHCSYLTCRYATNNATKFPFHSAVSAPKCPEFFRWIHHDLDPWARTRISM 121
           +TRHAR VKIIHCSYL CRYA N AT+ P HSAVS  +CPE FRWIHHDLDPWAR+RISM
Sbjct: 72  DTRHARTVKIIHCSYLACRYANNTATRLPLHSAVSTHQCPEIFRWIHHDLDPWARSRISM 131

Query: 122 TQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQMLRRYPGMVPDVDMMFD 181
             L+ES+KFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQ+LRR+PGMVPDVDMMFD
Sbjct: 132 EHLDESKKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDMMFD 191

Query: 182 CMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSWREEFEDIKKGS 241
           CMD+P+INRTENK MPLPLFRYCTT+AHFDIPFPDWSFWGWPEVN+RSW EEF+DIKK S
Sbjct: 192 CMDRPTINRTENKDMPLPLFRYCTTDAHFDIPFPDWSFWGWPEVNIRSWGEEFKDIKKSS 251

Query: 242 KNLSWFNKFPRAYWKGNPDVDSPAREELLKCNHSRMWGAQIMRQDWAQEAKDGYEQSKLS 301
           K+ +W +K PRAYWKGNPDV SP R ELL CNHS  WGAQIMRQDW QEA+DG+EQSKLS
Sbjct: 252 KSSNWSSKLPRAYWKGNPDVASPVRTELLTCNHSIKWGAQIMRQDWDQEARDGFEQSKLS 311

Query: 302 NQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFTNMC 361
           NQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISP Y+DFFSRGLDPLKNYWPIPF NMC
Sbjct: 312 NQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPLYQDFFSRGLDPLKNYWPIPFDNMC 371

Query: 362 ESIKHAVDWGNTHFPEAETIGRQGQKFMENLSMDTVYSYMFHLITEYSKLQDFKPTPPPS 421
           ESIKHAVDWGN H  EAE IG+QGQ FME+LSMDTVY+YMF LITEYSKL DFKPTPPPS
Sbjct: 372 ESIKHAVDWGNDHLSEAEAIGQQGQNFMESLSMDTVYAYMFQLITEYSKLLDFKPTPPPS 431

Query: 422 ALEVCTDSLLCIADEKQMQFLEKSAASVSSVPPCSLNRGGSDIIYSWLQQK 473
           ALEVC +SLLCIADEKQ QFLEKSA S S VPPCSLNR GSD +YSWLQQ+
Sbjct: 432 ALEVCPESLLCIADEKQRQFLEKSATSASLVPPCSLNRAGSDSVYSWLQQE 482

BLAST of CsaV3_1G036190.1 vs. NCBI nr
Match: XP_022964856.1 (O-glucosyltransferase rumi homolog [Cucurbita moschata])

HSP 1 Score: 865.9 bits (2236), Expect = 1.6e-247
Identity = 395/471 (83.86%), Postives = 427/471 (90.66%), Query Frame = 0

Query: 2   APAPRPPSHLLPSVVAICFLSLTFLLCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKTFSD 61
           AP+ R PS++LPSVVA+ FL+LTFL+CYKVDDFAAQTKTVAGHNLDPTPWHLFPPK FS+
Sbjct: 12  APSSRTPSNILPSVVALSFLALTFLICYKVDDFAAQTKTVAGHNLDPTPWHLFPPKIFSE 71

Query: 62  ETRHARAVKIIHCSYLTCRYATNNATKFPFHSAVSAPKCPEFFRWIHHDLDPWARTRISM 121
           +TRHAR VKIIHCSYL CRYA N AT+ P HSAVS  +CPE FRWIHHDLDPWAR+RISM
Sbjct: 72  DTRHARTVKIIHCSYLACRYANNTATRLPLHSAVSTHQCPELFRWIHHDLDPWARSRISM 131

Query: 122 TQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQMLRRYPGMVPDVDMMFD 181
             L+ES+KFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQ+LRR+PGMVPDVDMMFD
Sbjct: 132 EHLDESKKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDMMFD 191

Query: 182 CMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSWREEFEDIKKGS 241
           CMD+P+INRTENK MPLPLFRYCTT+AHFDIPFPDWSFWGWPEVN+RSW EEF+DIKK S
Sbjct: 192 CMDRPTINRTENKDMPLPLFRYCTTDAHFDIPFPDWSFWGWPEVNIRSWGEEFKDIKKSS 251

Query: 242 KNLSWFNKFPRAYWKGNPDVDSPAREELLKCNHSRMWGAQIMRQDWAQEAKDGYEQSKLS 301
           K+ +W +K PRAYWKGNPDV SP R ELL CNHS  WGAQIMRQDW QEA+DG+EQSKLS
Sbjct: 252 KSSNWSSKLPRAYWKGNPDVASPVRTELLTCNHSIKWGAQIMRQDWDQEARDGFEQSKLS 311

Query: 302 NQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFTNMC 361
           NQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISP Y+DFFSRGLDPLKNYWPIPF NMC
Sbjct: 312 NQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPLYQDFFSRGLDPLKNYWPIPFDNMC 371

Query: 362 ESIKHAVDWGNTHFPEAETIGRQGQKFMENLSMDTVYSYMFHLITEYSKLQDFKPTPPPS 421
           ESIKHAVDWGN H  EAE IG+QGQ FME+LSMDTVY+YMF LITEYSKL DFKPTPPPS
Sbjct: 372 ESIKHAVDWGNDHLSEAEAIGQQGQNFMESLSMDTVYAYMFQLITEYSKLLDFKPTPPPS 431

Query: 422 ALEVCTDSLLCIADEKQMQFLEKSAASVSSVPPCSLNRGGSDIIYSWLQQK 473
           ALEVC +SLLCIADEKQ QFLEKSA S S VPPCSLNR GSD +YSWLQQ+
Sbjct: 432 ALEVCPESLLCIADEKQRQFLEKSATSASLVPPCSLNRAGSDSVYSWLQQE 482

BLAST of CsaV3_1G036190.1 vs. ExPASy Swiss-Prot
Match: Q5E9Q1 (Protein O-glucosyltransferase 1 OS=Bos taurus OX=9913 GN=POGLUT1 PE=2 SV=1)

HSP 1 Score: 106.3 bits (264), Expect = 9.6e-22
Identity = 85/339 (25.07%), Postives = 149/339 (43.95%), Query Frame = 0

Query: 96  SAPKCPEFFRWIHHDLDPWARTRISMTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRA 155
           S+P C  +   I  DL P+          E  ++       I++ RLY        +S  
Sbjct: 50  SSPNCSCYHGVIEEDLTPFRGGISRKMMAEVVRRKLGTHYQIIKNRLY-------RESDC 109

Query: 156 IF--TIWGLVQMLRRYPGMVPDVDMMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIP 215
           +F     G+   +    G +PD++M+ +  D P + +    A  +P+F +  T  + DI 
Sbjct: 110 MFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQVPKWMEPA--IPIFSFSKTLEYHDIM 169

Query: 216 FPDWSFWG-----WP--EVNLRSWREEFEDIKKGSKNLSWFNKFPRAYWKGNPDVDSPAR 275
           +P W+FW      WP   + L  W    ED+ + +    W  K   AY++G+    SP R
Sbjct: 170 YPAWTFWEGGPAVWPIYPMGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSR--TSPER 229

Query: 276 EE--LLKCNHSRMWGAQIMRQDWAQEAKD--GYEQSK---LSNQCNHRYKIYAEGFAWSV 335
           +   LL   + ++  A+  +    +  KD  G   +K   L + C ++Y     G A S 
Sbjct: 230 DPLILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVAASF 289

Query: 336 SLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFTNMCESIKHAVDWGNTHFPEAE 395
             K++  CGS+   +  ++ +FF   L P  +Y  IP      +++  + +   +   A+
Sbjct: 290 RFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVHY--IPVKTDLSNVQELLQFVKANDDVAQ 349

Query: 396 TIGRQGQKFMEN-LSMDTVYSYMFHLITEYSKLQDFKPT 418
            I  +G +F+ N L MD +  Y  +L+TEYSK   +  T
Sbjct: 350 EIAERGSQFILNHLKMDDITCYWENLLTEYSKFLSYNVT 375

BLAST of CsaV3_1G036190.1 vs. ExPASy Swiss-Prot
Match: Q8T045 (O-glucosyltransferase rumi OS=Drosophila melanogaster OX=7227 GN=rumi PE=1 SV=1)

HSP 1 Score: 105.9 bits (263), Expect = 1.3e-21
Identity = 81/349 (23.21%), Postives = 146/349 (41.83%), Query Frame = 0

Query: 85  NATKFPFHSAVSAPKCPEFFRWIHHDLDPWARTRISMTQLEESQKFAAFRVVIVEGRLYV 144
           NA   P  S      C      +  DL P+  T ++   +E S ++   +  I   RLY 
Sbjct: 58  NADYKPCSSDPQDSDCSCHANVLKRDLAPYKSTGVTRQMIESSARYGT-KYKIYGHRLYR 117

Query: 145 D---MYYACVQSRAIFTIWGLVQMLRRYPGMVPDVDMMFDCMDKPSINRTENKAMPLPLF 204
           D   M+ A  +        G+   L      +PD+D++ +  D P +N     A   P+F
Sbjct: 118 DANCMFPARCE--------GIEHFLLPLVATLPDMDLIINTRDYPQLNAAWGNAAGGPVF 177

Query: 205 RYCTTEAHFDIPFPDWSFW-GWPEVNLR-----SWREEFEDIKKGSKNLSWFNKFPRAYW 264
            +  T+ + DI +P W+FW G P   L       W +  E ++K +  + W  K    ++
Sbjct: 178 SFSKTKEYRDIMYPAWTFWAGGPATKLHPRGIGRWDQMREKLEKRAAAIPWSQKRSLGFF 237

Query: 265 KGNPDVDSPAREELLKCNHSRMWGAQIMRQDWAQEAKD-----GYEQSKLSNQCNHRYKI 324
           +G+   D      LL   +  +  AQ  +    +  KD       ++    + C ++Y  
Sbjct: 238 RGSRTSDERDSLILLSRRNPELVEAQYTKNQGWKSPKDTLDAPAADEVSFEDHCKYKYLF 297

Query: 325 YAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFTNMCESIKHAVDW 384
              G A S  LK++  C S+   +  ++++FF   L P  +Y P+      +  +H + +
Sbjct: 298 NFRGVAASFRLKHLFLCKSLVFHVGDEWQEFFYDQLKPWVHYVPLKSYPSQQEYEHILSF 357

Query: 385 GNTHFPEAETIGRQGQKFM-ENLSMDTVYSYMFHLITEYSKLQDFKPTP 419
              +   A+ I ++G  F+ E+L M  +  Y   L+  Y KL  ++  P
Sbjct: 358 FKKNDALAQEIAQRGYDFIWEHLRMKDIKCYWRKLLKRYVKLLQYEVKP 397

BLAST of CsaV3_1G036190.1 vs. ExPASy Swiss-Prot
Match: Q29AU6 (O-glucosyltransferase rumi OS=Drosophila pseudoobscura pseudoobscura OX=46245 GN=rumi PE=3 SV=1)

HSP 1 Score: 105.9 bits (263), Expect = 1.3e-21
Identity = 88/374 (23.53%), Postives = 154/374 (41.18%), Query Frame = 0

Query: 67  RAVKIIHCSYLTCRYATNNATKFPFHSAVSAPKCPEFFRWIHHDLDPWARTRISMTQLEE 126
           R +K    SY  C    N+A     H+AV           I  DL P+  T +S   +E 
Sbjct: 50  RKIKKALASYQPCSSDANDA-NCSCHAAV-----------IKSDLAPYKATGVSRQMIES 109

Query: 127 SQKFAAFRVVIVEGRLYVD---MYYACVQSRAIFTIWGLVQMLRRYPGMVPDVDMMFDCM 186
           S ++   R  I E RLY +   M+ A  Q        G+   L      +PD+D++ +  
Sbjct: 110 SARYGT-RYKIYEKRLYREENCMFPARCQ--------GIEHFLLPLVATLPDMDLVINTR 169

Query: 187 DKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFW-GWPEVNLR-----SWREEFEDI 246
           D P IN         P+  +  T+ H DI +P W+FW G P   L       W    E +
Sbjct: 170 DYPQINMAWGNGAQGPILSFSKTKDHRDIMYPAWTFWAGGPATKLHPRGIGRWDLMREKL 229

Query: 247 KKGSKNLSWFNKFPRAYWKGNPDVDSPAREELLKCNHSRMWGAQIMRQDWAQEAKDGYE- 306
           +K +  + W  K    +++G+   D      LL   +  +  AQ  +    +  KD  + 
Sbjct: 230 EKRAAAIPWSQKRELGFFRGSRTSDERDSLILLSRRNPELVEAQYTKNQGWKSPKDTLDA 289

Query: 307 ----QSKLSNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNY 366
               +    + C ++Y     G A S  LK++  C S+   +  ++++FF   L P  +Y
Sbjct: 290 PPAGEVSFEDHCKYKYLFNFRGVAASFRLKHLFLCQSLVFHVGDEWQEFFYDQLKPWVHY 349

Query: 367 WPIPFTNMCESIKHAVDWGNTHFPEAETIGRQGQKFM-ENLSMDTVYSYMFHLITEYSKL 426
            P+      +  +  + +   +   A+ I ++G+ F+ ++L M  +  Y   L+  Y KL
Sbjct: 350 VPLKNYPSQQEYEELLTFFRKNDALAQEIAQRGRDFIWQHLRMKDIKCYWRRLLKSYVKL 402

BLAST of CsaV3_1G036190.1 vs. ExPASy Swiss-Prot
Match: Q16QY8 (O-glucosyltransferase rumi homolog OS=Aedes aegypti OX=7159 GN=AAEL011121 PE=3 SV=1)

HSP 1 Score: 103.6 bits (257), Expect = 6.3e-21
Identity = 67/265 (25.28%), Postives = 120/265 (45.28%), Query Frame = 0

Query: 173 VPDVDMMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFW-GWPEVN----- 232
           +PD++++ +C D P INR   K   LP+  +  T+ + DI +P W FW G P ++     
Sbjct: 138 LPDMELIINCRDWPQINR-HWKQEKLPVLSFSKTDDYLDIMYPTWGFWEGGPAISLYPTG 197

Query: 233 LRSWREEFEDIKKGSKNLSWFNKFPRAYWKGNPDVDSPAREELLKCNHSRMWGAQIMRQD 292
           L  W +    IKK + +  W  K  +A+++G+   D      LL      +  AQ  +  
Sbjct: 198 LGRWDQHRVSIKKAADSWKWEKKKAKAFFRGSRTSDERDPLVLLSRRKPELVDAQYTKNQ 257

Query: 293 WAQEAKDGY-----EQSKLSNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDF 352
             +  KD       ++ +L + C ++Y     G A S   K++  C S+   +  ++++F
Sbjct: 258 AWKSPKDTLNAKPAQEVRLEDHCQYKYLFNFRGVAASFRFKHLFLCRSLVFHVGSEWQEF 317

Query: 353 FSRGLDPLKNYWPIPFTNMCESIKHAVDWGNTHFPEAETIGRQG-QKFMENLSMDTVYSY 412
           F   L P  +Y P+      E ++  +++   H   A  I  +G +   ++L M  V  Y
Sbjct: 318 FYPSLKPWVHYVPVRVGATQEELEELIEFFAEHDDLAREIADRGFEHVWKHLRMKDVECY 377

Query: 413 MFHLITEYSKLQDFKPTPPPSALEV 426
              L+  Y KL  ++     S +EV
Sbjct: 378 WRKLLRRYGKLVKYEVKRDHSLVEV 401

BLAST of CsaV3_1G036190.1 vs. ExPASy Swiss-Prot
Match: Q8NBL1 (Protein O-glucosyltransferase 1 OS=Homo sapiens OX=9606 GN=POGLUT1 PE=1 SV=1)

HSP 1 Score: 101.3 bits (251), Expect = 3.1e-20
Identity = 70/272 (25.74%), Postives = 125/272 (45.96%), Query Frame = 0

Query: 161 GLVQMLRRYPGMVPDVDMMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFW 220
           G+   +    G +PD++M+ +  D P + +    A  +P+F +  T  + DI +P W+FW
Sbjct: 110 GVEHFILEVIGRLPDMEMVINVRDYPQVPKWMEPA--IPVFSFSKTSEYHDIMYPAWTFW 169

Query: 221 G-----WP--EVNLRSWREEFEDIKKGSKNLSWFNKFPRAYWKGNPDVDSPAREE--LLK 280
                 WP     L  W    ED+ + +    W  K   AY++G+    SP R+   LL 
Sbjct: 170 EGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSR--TSPERDPLILLS 229

Query: 281 CNHSRMWGAQIMRQDWAQEAKD--GYEQSK---LSNQCNHRYKIYAEGFAWSVSLKYILS 340
             + ++  A+  +    +  KD  G   +K   L + C ++Y     G A S   K++  
Sbjct: 230 RKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVAASFRFKHLFL 289

Query: 341 CGSMSLIISPQYEDFFSRGLDPLKNYWPIPFTNMCESIKHAVDWGNTHFPEAETIGRQGQ 400
           CGS+   +  ++ +FF   L P  +Y  IP      +++  + +   +   A+ I  +G 
Sbjct: 290 CGSLVFHVGDEWLEFFYPQLKPWVHY--IPVKTDLSNVQELLQFVKANDDVAQEIAERGS 349

Query: 401 KFMEN-LSMDTVYSYMFHLITEYSKLQDFKPT 418
           +F+ N L MD +  Y  +L++EYSK   +  T
Sbjct: 350 QFIRNHLQMDDITCYWENLLSEYSKFLSYNVT 375

BLAST of CsaV3_1G036190.1 vs. ExPASy TrEMBL
Match: A0A0A0LY89 (CAP10 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G531170 PE=4 SV=1)

HSP 1 Score: 1012.3 bits (2616), Expect = 6.6e-292
Identity = 470/472 (99.58%), Postives = 472/472 (100.00%), Query Frame = 0

Query: 1   MAPAPRPPSHLLPSVVAICFLSLTFLLCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKTFS 60
           MAPAPRPPSHLLPSVVAICFLSLTFLLCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKTFS
Sbjct: 1   MAPAPRPPSHLLPSVVAICFLSLTFLLCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKTFS 60

Query: 61  DETRHARAVKIIHCSYLTCRYATNNATKFPFHSAVSAPKCPEFFRWIHHDLDPWARTRIS 120
           DETRHARAVKIIHCSYLTCRYATNNATKFPFHSAVSAPKCPEFFRWIHHDLDPWARTRIS
Sbjct: 61  DETRHARAVKIIHCSYLTCRYATNNATKFPFHSAVSAPKCPEFFRWIHHDLDPWARTRIS 120

Query: 121 MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQMLRRYPGMVPDVDMMF 180
           MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQMLRRYPGMVPDVDMMF
Sbjct: 121 MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQMLRRYPGMVPDVDMMF 180

Query: 181 DCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSWREEFEDIKKG 240
           DCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSWREEFEDIKKG
Sbjct: 181 DCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSWREEFEDIKKG 240

Query: 241 SKNLSWFNKFPRAYWKGNPDVDSPAREELLKCNHSRMWGAQIMRQDWAQEAKDGYEQSKL 300
           SKNLSWFNKFPRAYWKGNPDVDSPAREELLKCNHSRMWGAQIMRQDWAQEA+DGYEQSKL
Sbjct: 241 SKNLSWFNKFPRAYWKGNPDVDSPAREELLKCNHSRMWGAQIMRQDWAQEARDGYEQSKL 300

Query: 301 SNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFTNM 360
           SNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFTNM
Sbjct: 301 SNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFTNM 360

Query: 361 CESIKHAVDWGNTHFPEAETIGRQGQKFMENLSMDTVYSYMFHLITEYSKLQDFKPTPPP 420
           CESIKHAVDWGNTHFPEAETIGRQGQKFME+LSMDTVYSYMFHLITEYSKLQDFKPTPPP
Sbjct: 361 CESIKHAVDWGNTHFPEAETIGRQGQKFMESLSMDTVYSYMFHLITEYSKLQDFKPTPPP 420

Query: 421 SALEVCTDSLLCIADEKQMQFLEKSAASVSSVPPCSLNRGGSDIIYSWLQQK 473
           SALEVCTDSLLCIADEKQMQFLEKSAASVSSVPPCSLNRGGSDIIYSWLQQK
Sbjct: 421 SALEVCTDSLLCIADEKQMQFLEKSAASVSSVPPCSLNRGGSDIIYSWLQQK 472

BLAST of CsaV3_1G036190.1 vs. ExPASy TrEMBL
Match: A0A1S3C5H2 (O-glucosyltransferase rumi homolog OS=Cucumis melo OX=3656 GN=LOC103497080 PE=4 SV=1)

HSP 1 Score: 976.9 bits (2524), Expect = 3.1e-281
Identity = 452/471 (95.97%), Postives = 461/471 (97.88%), Query Frame = 0

Query: 1   MAPAPRPPSHLLPSVVAICFLSLTFLLCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKTFS 60
           MAPAPRPPSHLLP+VVAI FLSLTFLLCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKTF+
Sbjct: 1   MAPAPRPPSHLLPAVVAISFLSLTFLLCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKTFN 60

Query: 61  DETRHARAVKIIHCSYLTCRYATNNATKFPFHSAVSAPKCPEFFRWIHHDLDPWARTRIS 120
           DETRHARAVKIIHCSYLTCRY TNNATKFPFHSAVSAPKCPEFFRWIHHDLDPWA+TRIS
Sbjct: 61  DETRHARAVKIIHCSYLTCRYVTNNATKFPFHSAVSAPKCPEFFRWIHHDLDPWAQTRIS 120

Query: 121 MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQMLRRYPGMVPDVDMMF 180
           MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQ+LRR+PGMVPDVDMMF
Sbjct: 121 MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDMMF 180

Query: 181 DCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSWREEFEDIKKG 240
           DCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSWREEFEDIKKG
Sbjct: 181 DCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSWREEFEDIKKG 240

Query: 241 SKNLSWFNKFPRAYWKGNPDVDSPAREELLKCNHSRMWGAQIMRQDWAQEAKDGYEQSKL 300
           SKNLSW NKFPRAYWKGNPDVDSPAR ELLKCNHSR WGAQIMRQDWAQEA+DGYEQSKL
Sbjct: 241 SKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHSRKWGAQIMRQDWAQEARDGYEQSKL 300

Query: 301 SNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFTNM 360
           SNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPF+NM
Sbjct: 301 SNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNM 360

Query: 361 CESIKHAVDWGNTHFPEAETIGRQGQKFMENLSMDTVYSYMFHLITEYSKLQDFKPTPPP 420
           CESIKHAVDWGNTHFPEAETIG+QGQ FME+LSMDTVYSYMFHLITEYSKL DFKPTPPP
Sbjct: 361 CESIKHAVDWGNTHFPEAETIGQQGQNFMESLSMDTVYSYMFHLITEYSKLLDFKPTPPP 420

Query: 421 SALEVCTDSLLCIADEKQMQFLEKSAASVSSVPPCSLNRGGSDIIYSWLQQ 472
           SALEVC DSLLCIADEKQ QFLEKSAASVSSVPPCSLNR GSDIIYSWLQQ
Sbjct: 421 SALEVCADSLLCIADEKQRQFLEKSAASVSSVPPCSLNRAGSDIIYSWLQQ 471

BLAST of CsaV3_1G036190.1 vs. ExPASy TrEMBL
Match: A0A6J1HK39 (O-glucosyltransferase rumi homolog OS=Cucurbita moschata OX=3662 GN=LOC111464838 PE=4 SV=1)

HSP 1 Score: 865.9 bits (2236), Expect = 7.7e-248
Identity = 395/471 (83.86%), Postives = 427/471 (90.66%), Query Frame = 0

Query: 2   APAPRPPSHLLPSVVAICFLSLTFLLCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKTFSD 61
           AP+ R PS++LPSVVA+ FL+LTFL+CYKVDDFAAQTKTVAGHNLDPTPWHLFPPK FS+
Sbjct: 12  APSSRTPSNILPSVVALSFLALTFLICYKVDDFAAQTKTVAGHNLDPTPWHLFPPKIFSE 71

Query: 62  ETRHARAVKIIHCSYLTCRYATNNATKFPFHSAVSAPKCPEFFRWIHHDLDPWARTRISM 121
           +TRHAR VKIIHCSYL CRYA N AT+ P HSAVS  +CPE FRWIHHDLDPWAR+RISM
Sbjct: 72  DTRHARTVKIIHCSYLACRYANNTATRLPLHSAVSTHQCPELFRWIHHDLDPWARSRISM 131

Query: 122 TQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQMLRRYPGMVPDVDMMFD 181
             L+ES+KFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQ+LRR+PGMVPDVDMMFD
Sbjct: 132 EHLDESKKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDMMFD 191

Query: 182 CMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSWREEFEDIKKGS 241
           CMD+P+INRTENK MPLPLFRYCTT+AHFDIPFPDWSFWGWPEVN+RSW EEF+DIKK S
Sbjct: 192 CMDRPTINRTENKDMPLPLFRYCTTDAHFDIPFPDWSFWGWPEVNIRSWGEEFKDIKKSS 251

Query: 242 KNLSWFNKFPRAYWKGNPDVDSPAREELLKCNHSRMWGAQIMRQDWAQEAKDGYEQSKLS 301
           K+ +W +K PRAYWKGNPDV SP R ELL CNHS  WGAQIMRQDW QEA+DG+EQSKLS
Sbjct: 252 KSSNWSSKLPRAYWKGNPDVASPVRTELLTCNHSIKWGAQIMRQDWDQEARDGFEQSKLS 311

Query: 302 NQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFTNMC 361
           NQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISP Y+DFFSRGLDPLKNYWPIPF NMC
Sbjct: 312 NQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPLYQDFFSRGLDPLKNYWPIPFDNMC 371

Query: 362 ESIKHAVDWGNTHFPEAETIGRQGQKFMENLSMDTVYSYMFHLITEYSKLQDFKPTPPPS 421
           ESIKHAVDWGN H  EAE IG+QGQ FME+LSMDTVY+YMF LITEYSKL DFKPTPPPS
Sbjct: 372 ESIKHAVDWGNDHLSEAEAIGQQGQNFMESLSMDTVYAYMFQLITEYSKLLDFKPTPPPS 431

Query: 422 ALEVCTDSLLCIADEKQMQFLEKSAASVSSVPPCSLNRGGSDIIYSWLQQK 473
           ALEVC +SLLCIADEKQ QFLEKSA S S VPPCSLNR GSD +YSWLQQ+
Sbjct: 432 ALEVCPESLLCIADEKQRQFLEKSATSASLVPPCSLNRAGSDSVYSWLQQE 482

BLAST of CsaV3_1G036190.1 vs. ExPASy TrEMBL
Match: A0A6J1HZ33 (O-glucosyltransferase rumi homolog OS=Cucurbita maxima OX=3661 GN=LOC111469414 PE=4 SV=1)

HSP 1 Score: 859.8 bits (2220), Expect = 5.5e-246
Identity = 393/471 (83.44%), Postives = 425/471 (90.23%), Query Frame = 0

Query: 2   APAPRPPSHLLPSVVAICFLSLTFLLCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKTFSD 61
           A + R PS++LPSVVA+ FL+LTFL+CYKVDDFAAQTKTVAGHNLDPTPWHLFPPK FS+
Sbjct: 12  ALSSRTPSNILPSVVALSFLALTFLVCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKIFSE 71

Query: 62  ETRHARAVKIIHCSYLTCRYATNNATKFPFHSAVSAPKCPEFFRWIHHDLDPWARTRISM 121
           +TRHAR VKIIHCSYL CRYA N AT+ P HSAVS  +CPE FRWIHHDLDPWAR+RISM
Sbjct: 72  DTRHARTVKIIHCSYLACRYANNTATRLPLHSAVSTHQCPELFRWIHHDLDPWARSRISM 131

Query: 122 TQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQMLRRYPGMVPDVDMMFD 181
             L+ES KFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQ+LRR+PGMVPDVDMMFD
Sbjct: 132 KHLDESMKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDMMFD 191

Query: 182 CMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSWREEFEDIKKGS 241
           CMD+P+INRTENK MPLPLFRYCTT+AHFDIPFPDWSFWGWPEVN+RSW EEF+DIKK S
Sbjct: 192 CMDRPTINRTENKDMPLPLFRYCTTDAHFDIPFPDWSFWGWPEVNIRSWGEEFKDIKKSS 251

Query: 242 KNLSWFNKFPRAYWKGNPDVDSPAREELLKCNHSRMWGAQIMRQDWAQEAKDGYEQSKLS 301
           K+ +W +K PRAYWKGNPDV SP R ELL CNHS  WGAQIMRQDW QEA+DG+EQSKLS
Sbjct: 252 KSSNWSSKLPRAYWKGNPDVASPVRTELLTCNHSIKWGAQIMRQDWDQEARDGFEQSKLS 311

Query: 302 NQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFTNMC 361
            QCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISP Y+DFFSRGLDPLKNYWPIPF NMC
Sbjct: 312 KQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPLYQDFFSRGLDPLKNYWPIPFDNMC 371

Query: 362 ESIKHAVDWGNTHFPEAETIGRQGQKFMENLSMDTVYSYMFHLITEYSKLQDFKPTPPPS 421
           ESIKHAVDWGN H  EAE IG+QGQ FME+LSMDTVY+YMF LITEYSKL DFKPTPPPS
Sbjct: 372 ESIKHAVDWGNDHLSEAEAIGQQGQNFMESLSMDTVYAYMFQLITEYSKLLDFKPTPPPS 431

Query: 422 ALEVCTDSLLCIADEKQMQFLEKSAASVSSVPPCSLNRGGSDIIYSWLQQK 473
           ALEVC++SLLCIADEKQ QFLEKSA S S VPPCSLNR GSD +YSWLQQ+
Sbjct: 432 ALEVCSESLLCIADEKQRQFLEKSATSASLVPPCSLNRAGSDSVYSWLQQE 482

BLAST of CsaV3_1G036190.1 vs. ExPASy TrEMBL
Match: A0A6J1DBK2 (O-glucosyltransferase rumi OS=Momordica charantia OX=3673 GN=LOC111018978 PE=4 SV=1)

HSP 1 Score: 849.0 bits (2192), Expect = 9.7e-243
Identity = 390/475 (82.11%), Postives = 424/475 (89.26%), Query Frame = 0

Query: 2   APAPRPPSHLLPSVVAICFLSLTFLLCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKTFSD 61
           +P  R PS+LLPSV+A+ FLSL FL+ YKVDDFAAQTKTV GHNLDPTPWHLFPP++F D
Sbjct: 94  SPRSRSPSYLLPSVLALSFLSLAFLVFYKVDDFAAQTKTVVGHNLDPTPWHLFPPRSFDD 153

Query: 62  ETRHARAVKIIHCSYLTCRYATNN----ATKFPFHSAVSAPKCPEFFRWIHHDLDPWART 121
           ETRHAR +KII CSYLTCRYATN     A+  P   A S+ KCP+FFRWIHHDLDPWAR+
Sbjct: 154 ETRHARDLKIIRCSYLTCRYATNTNTSAASSRPSRRAQSSSKCPDFFRWIHHDLDPWARS 213

Query: 122 RISMTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQMLRRYPGMVPDVD 181
           RIS  QL E+QKFAAFRVVIVEG+LYVDMYYACVQSRA+FTIWGLVQ+L R+PGMVPDVD
Sbjct: 214 RISTAQLAEAQKFAAFRVVIVEGKLYVDMYYACVQSRAVFTIWGLVQLLERFPGMVPDVD 273

Query: 182 MMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSWREEFEDI 241
           MMFDCMDKPSINRTE+  MPLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSW EEFEDI
Sbjct: 274 MMFDCMDKPSINRTEHHDMPLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSWSEEFEDI 333

Query: 242 KKGSKNLSWFNKFPRAYWKGNPDVDSPAREELLKCNHSRMWGAQIMRQDWAQEAKDGYEQ 301
           KKGSK  SW +K P AYWKGNPDVDSPAR ELLKCN +R WGAQIMRQ+W +EA+ G+EQ
Sbjct: 334 KKGSKKSSWSSKLPLAYWKGNPDVDSPARTELLKCNDTRQWGAQIMRQNWVEEARAGFEQ 393

Query: 302 SKLSNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPF 361
           SKLSNQCN+RYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPF
Sbjct: 394 SKLSNQCNYRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPF 453

Query: 362 TNMCESIKHAVDWGNTHFPEAETIGRQGQKFMENLSMDTVYSYMFHLITEYSKLQDFKPT 421
           TNMC+SIKHAVDWGN+H PE E +G++GQ FME+LSMDTVYSYMFHLI EYSKLQDFKPT
Sbjct: 454 TNMCQSIKHAVDWGNSHLPETEAVGQRGQDFMESLSMDTVYSYMFHLIREYSKLQDFKPT 513

Query: 422 PPPSALEVCTDSLLCIADEKQMQFLEKSAASVSSVPPCSLNRGGSDIIYSWLQQK 473
           PPPSALEVC +SLLCIADE Q  FLEKSA S SSVPPCSL+R GSDI+YSWLQQK
Sbjct: 514 PPPSALEVCAESLLCIADEMQRSFLEKSATSASSVPPCSLDRAGSDIVYSWLQQK 568

BLAST of CsaV3_1G036190.1 vs. TAIR 10
Match: AT1G07220.1 (Arabidopsis thaliana protein of unknown function (DUF821) )

HSP 1 Score: 674.5 bits (1739), Expect = 6.3e-194
Identity = 310/475 (65.26%), Postives = 376/475 (79.16%), Query Frame = 0

Query: 4   APRPPSHLLPSVVAICFLSLTFLLCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKTFSDET 63
           +PR PS+LL  V+A+ F S T LL YKVDDF AQTKT+AGHNL+PTPWH+FP K+FS  T
Sbjct: 14  SPRSPSYLLLCVLALSFFSFTALLFYKVDDFIAQTKTLAGHNLEPTPWHIFPRKSFSAAT 73

Query: 64  RHARAVKIIHCSYLTCRYATNNATKFPFHSAVSA------PKCPEFFRWIHHDLDPWART 123
           +H++A +I+ CSY +C Y      K   HS   +      P+CP+FFRWIH DL+PWA+T
Sbjct: 74  KHSQAYRILQCSYFSCPYKAVVQPK-SLHSESGSGRQTHQPQCPDFFRWIHRDLEPWAKT 133

Query: 124 RISMTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQMLRRYPGMVPDVD 183
            ++   ++ ++  AAFRVVI+ G+LYVD+YYACVQSR +FTIWG++Q+L +YPGMVPDVD
Sbjct: 134 GVTKEHVKRAKANAAFRVVILSGKLYVDLYYACVQSRMMFTIWGILQLLTKYPGMVPDVD 193

Query: 184 MMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSWREEFEDI 243
           MMFDCMDKP IN+TE ++ P+PLFRYCT EAH DIPFPDWSFWGW E NLR W EEF DI
Sbjct: 194 MMFDCMDKPIINQTEYQSFPVPLFRYCTNEAHLDIPFPDWSFWGWSETNLRPWEEEFGDI 253

Query: 244 KKGSKNLSWFNKFPRAYWKGNPDVDSPAREELLKCNHSRMWGAQIMRQDWAQEAKDGYEQ 303
           K+GS+  SW+NK PRAYWKGNPDV SP R EL+KCNHSR+WGAQIMRQDWA+EAK G+EQ
Sbjct: 254 KQGSRRRSWYNKQPRAYWKGNPDVVSPIRLELMKCNHSRLWGAQIMRQDWAEEAKGGFEQ 313

Query: 304 SKLSNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPF 363
           SKLSNQCNHRYKIYAEG+AWSVSLKYILSCGSM+LIISP+YEDFFSRGL P +NYWPI  
Sbjct: 314 SKLSNQCNHRYKIYAEGYAWSVSLKYILSCGSMTLIISPEYEDFFSRGLLPKENYWPISP 373

Query: 364 TNMCESIKHAVDWGNTHFPEAETIGRQGQKFMENLSMDTVYSYMFHLITEYSKLQDFKPT 423
           T++C SIK+AVDWGN++  EAETIG++GQ +ME+LSM+ VY YMFHLITEYSKLQ FKP 
Sbjct: 374 TDLCRSIKYAVDWGNSNPSEAETIGKRGQGYMESLSMNRVYDYMFHLITEYSKLQKFKPE 433

Query: 424 PPPSALEVCTDSLLCIADEKQMQFLEKSAASVSSVPPCSLNRGGSDIIYSWLQQK 473
            P SA EVC  SLLCIA++K+ + LE+S    S   PC       + +   +QQK
Sbjct: 434 KPASANEVCAGSLLCIAEQKERELLERSRVVPSLDQPCKFPVEDRNRLEWLIQQK 487

BLAST of CsaV3_1G036190.1 vs. TAIR 10
Match: AT5G23850.1 (Arabidopsis thaliana protein of unknown function (DUF821) )

HSP 1 Score: 409.8 bits (1052), Expect = 2.9e-114
Identity = 199/464 (42.89%), Postives = 292/464 (62.93%), Query Frame = 0

Query: 11  LLPSVVAICFLSLTFLLCYKV--DDFAAQTKTVAGHNLDPTPWHLFPPKTFSDETRHARA 70
           LL  ++   F+S   LL   V  +  AA T T        TP +   P+  +  T+  + 
Sbjct: 44  LLILLIVGAFISTRLLLDTTVLLEKKAATTTTTKTQTQTITPKY---PRPTTVITQSPKP 103

Query: 71  VKIIHCS----YLTC---RYATNNATKFPFHSAVSAPKCPEFFRWIHHDLDPWARTRISM 130
              +HCS      +C   +Y T  + +    +      CP++FRWIH DL PW+RT I+ 
Sbjct: 104 EFTLHCSANETTASCPSNKYPTTTSFEDDDTNHPPTATCPDYFRWIHEDLRPWSRTGITR 163

Query: 131 TQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQMLRRYPGMVPDVDMMFD 190
             LE ++K A FR+ IV G++YV+ +    Q+R +FTIWG +Q+LR+YPG +PD+++MFD
Sbjct: 164 EALERAKKTATFRLAIVGGKIYVEKFQDAFQTRDVFTIWGFLQLLRKYPGKIPDLELMFD 223

Query: 191 CMDKPSINRTE----NKAMPLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSWREEFEDI 250
           C+D P +  TE    N   P PLFRYC  E   DI FPDWSFWGW EVN++ W    +++
Sbjct: 224 CVDWPVVRATEFAGANAPSPPPLFRYCGNEETLDIVFPDWSFWGWAEVNIKPWESLLKEL 283

Query: 251 KKGSKNLSWFNKFPRAYWKGNPDVDSPAREELLKCNHS--RMWGAQIMRQDWAQEAKDGY 310
           ++G++   W N+ P AYWKGNP V +  R++L+KCN S    W A++  QDW +E+K+GY
Sbjct: 284 REGNERTKWINREPYAYWKGNPMV-AETRQDLMKCNVSEEHEWNARLYAQDWIKESKEGY 343

Query: 311 EQSKLSNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPI 370
           +QS L++QC+HRYKIY EG AWSVS KYIL+C S++L++ P Y DFF+RGL P  +YWP+
Sbjct: 344 KQSDLASQCHHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLLPAHHYWPV 403

Query: 371 PFTNMCESIKHAVDWGNTHFPEAETIGRQGQKFM-ENLSMDTVYSYMFHLITEYSKLQDF 430
              + C SIK AVDWGN+H  +A+ IG+    F+ ++L MD VY YM+HL+TEYSKL  F
Sbjct: 404 REHDKCRSIKFAVDWGNSHIQKAQDIGKAASDFIQQDLKMDYVYDYMYHLLTEYSKLLQF 463

Query: 431 KPTPPPSALEVCTDSLLCIADEKQMQFLEKS-AASVSSVPPCSL 458
           KP  P +A+E+C++++ C+    + +F+ +S     +   PC++
Sbjct: 464 KPEIPRNAVEICSETMACLRSGNERKFMTESLVKQPADSGPCAM 503

BLAST of CsaV3_1G036190.1 vs. TAIR 10
Match: AT3G48980.1 (Arabidopsis thaliana protein of unknown function (DUF821) )

HSP 1 Score: 394.0 bits (1011), Expect = 1.6e-109
Identity = 171/366 (46.72%), Postives = 252/366 (68.85%), Query Frame = 0

Query: 100 CPEFFRWIHHDLDPWARTRISMTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTI 159
           CP++FRWIH DL PW +T I+   LE +   A FR+ I+ GR+YV+ +    Q+R +FTI
Sbjct: 136 CPDYFRWIHEDLRPWEKTGITREALERANATAIFRLAIINGRIYVEKFREAFQTRDVFTI 195

Query: 160 WGLVQMLRRYPGMVPDVDMMFDCMDKPSINRTE----NKAMPLPLFRYCTTEAHFDIPFP 219
           WG VQ+LRRYPG +PD+++MFDC+D P +   E    ++  P PLFRYC  +   DI FP
Sbjct: 196 WGFVQLLRRYPGKIPDLELMFDCVDWPVVKAAEFAGVDQPPPPPLFRYCANDETLDIVFP 255

Query: 220 DWSFWGWPEVNLRSWREEFEDIKKGSKNLSWFNKFPRAYWKGNPDVDSPAREELLKCNHS 279
           DWS+WGW EVN++ W    +++++G++   W ++ P AYWKGNP V +  R +L+KCN S
Sbjct: 256 DWSYWGWAEVNIKPWESLLKELREGNQRTKWIDREPYAYWKGNPTV-AETRLDLMKCNLS 315

Query: 280 RM--WGAQIMRQDWAQEAKDGYEQSKLSNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLI 339
            +  W A++ +QDW +E+K+GY+QS L++QC+HRYKIY EG AWSVS KYIL+C S++L+
Sbjct: 316 EVYDWKARLYKQDWVKESKEGYKQSDLASQCHHRYKIYIEGSAWSVSEKYILACDSVTLM 375

Query: 340 ISPQYEDFFSRGLDPLKNYWPIPFTNMCESIKHAVDWGNTHFPEAETIGRQGQKFM-ENL 399
           + P Y DFF+RG+ P  +YWP+   + C SIK AVDWGN H  +A+ IG++  +F+ + L
Sbjct: 376 VKPHYYDFFTRGMFPGHHYWPVKEDDKCRSIKFAVDWGNLHMRKAQDIGKKASEFVQQEL 435

Query: 400 SMDTVYSYMFHLITEYSKLQDFKPTPPPSALEVCTDSLLCIADEKQMQFLEKSAAS-VSS 458
            MD VY YMFHL+ +YSKL  FKP  P ++ E+C++++ C  D  + +F+ +S     + 
Sbjct: 436 KMDYVYDYMFHLLIQYSKLLRFKPEIPQNSTELCSEAMACPRDGNERKFMMESLVKRPAE 495

BLAST of CsaV3_1G036190.1 vs. TAIR 10
Match: AT1G63420.1 (Arabidopsis thaliana protein of unknown function (DUF821) )

HSP 1 Score: 392.9 bits (1008), Expect = 3.7e-109
Identity = 182/375 (48.53%), Postives = 255/375 (68.00%), Query Frame = 0

Query: 100 CPEFFRWIHHDLDPWARTRISMTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTI 159
           CP++F+WIH DL PW  T I+   +E  +  A FR+VI+ G+++V+ Y   +Q+R  FT+
Sbjct: 170 CPDYFKWIHEDLKPWRETGITKEMVERGKTTAHFRLVILNGKVFVENYKKSIQTRDAFTL 229

Query: 160 WGLVQMLRRYPGMVPDVDMMFDCMDKPSI--------NRTENKAMPLPLFRYCTTEAHFD 219
           WG++Q+LR+YPG +PDVD+MFDC D+P I        NRT   A P PLFRYC      D
Sbjct: 230 WGILQLLRKYPGKLPDVDLMFDCDDRPVIRSDGYNILNRTVENAPP-PLFRYCGDRWTVD 289

Query: 220 IPFPDWSFWGWPEVNLRSWREEFEDIKKGSKNLSWFNKFPRAYWKGNPDVDSPAREELLK 279
           I FPDWSFWGW E+N+R W +  +++++G K   +  +   AYWKGNP V SP+RE+LL 
Sbjct: 290 IVFPDWSFWGWQEINIREWSKVLKEMEEGKKKKKFMERDAYAYWKGNPFVASPSREDLLT 349

Query: 280 CNHSRM--WGAQIMRQDWAQEAKDGYEQSKLSNQCNHRYKIYAEGFAWSVSLKYILSCGS 339
           CN S +  W A+I  QDW  E + G+E S ++NQC +RYKIY EG+AWSVS KYIL+C S
Sbjct: 350 CNLSSLHDWNARIFIQDWISEGQRGFENSNVANQCTYRYKIYIEGYAWSVSEKYILACDS 409

Query: 340 MSLIISPQYEDFFSRGLDPLKNYWPIPFTNMCESIKHAVDWGNTHFPEAETIGRQGQKFM 399
           ++L++ P Y DFFSR L PL++YWPI   + C SIK AVDW N H  +A+ IGR+  +FM
Sbjct: 410 VTLMVKPYYYDFFSRTLQPLQHYWPIRDKDKCRSIKFAVDWLNNHTQKAQEIGREASEFM 469

Query: 400 E-NLSMDTVYSYMFHLITEYSKLQDFKPTPPPSALEVCTDSLLCIADEKQMQFLEKS--A 458
           + +LSM+ VY YMFHL+ EYSKL  +KP  P +++E+CT++L+C ++ + +  ++K    
Sbjct: 470 QRDLSMENVYDYMFHLLNEYSKLLKYKPQVPKNSVELCTEALVCPSEGEDVNGVDKKFMI 529

BLAST of CsaV3_1G036190.1 vs. TAIR 10
Match: AT2G45830.1 (downstream target of AGL15 2 )

HSP 1 Score: 386.3 bits (991), Expect = 3.4e-107
Identity = 177/382 (46.34%), Postives = 252/382 (65.97%), Query Frame = 0

Query: 84  NNATKFPFHSAVSAPKCPEFFRWIHHDLDPWARTRISMTQLEESQKFAAFRVVIVEGRLY 143
           NN      HS +S   CP +FRWIH DL PW  T ++   LE++++ A FRVVI++GR+Y
Sbjct: 105 NNDKPRSSHSRIST--CPSYFRWIHEDLRPWKETGVTRGMLEKARRTAHFRVVILDGRVY 164

Query: 144 VDMYYACVQSRAIFTIWGLVQMLRRYPGMVPDVDMMFDCMDKPSIN----RTENKAMPLP 203
           V  Y   +Q+R +FT+WG+VQ+LR YPG +PD+++MFD  D+P++     + +    P P
Sbjct: 165 VKKYRKSIQTRDVFTLWGIVQLLRWYPGRLPDLELMFDPDDRPTVRSKDFQGQQHPAPPP 224

Query: 204 LFRYCTTEAHFDIPFPDWSFWGWPEVNLRSWREEFEDIKKGSKNLSWFNKFPRAYWKGNP 263
           LFRYC+ +A  DI FPDWSFWGW EVN++ W +    I++G+K   W ++   AYW+GNP
Sbjct: 225 LFRYCSDDASLDIVFPDWSFWGWAEVNIKPWDKSLVAIEEGNKMTQWKDRVAYAYWRGNP 284

Query: 264 DVDSPAREELLKCNHSRM--WGAQIMRQDWAQEAKDGYEQSKLSNQCNHRYKIYAEGFAW 323
           +V +P R +LL+CN S    W  ++  QDW +E+++G++ S L NQC HRYKIY EG+AW
Sbjct: 285 NV-APTRRDLLRCNVSAQEDWNTRLYIQDWDRESREGFKNSNLENQCTHRYKIYIEGWAW 344

Query: 324 SVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFTNMCESIKHAVDWGNTHFPE 383
           SVS KYI++C SM+L + P + DF+ RG+ PL++YWPI  T+ C S+K AV WGNTH  +
Sbjct: 345 SVSEKYIMACDSMTLYVRPMFYDFYVRGMMPLQHYWPIRDTSKCTSLKFAVHWGNTHLDQ 404

Query: 384 AETIGRQGQKFM-ENLSMDTVYSYMFHLITEYSKLQDFKPTPPPSALEVCTDSLLCIADE 443
           A  IG +G +F+ E + M+ VY YMFHL+ EY+KL  FKP  P  A E+  D + C A  
Sbjct: 405 ASKIGEEGSRFIREEVKMEYVYDYMFHLMNEYAKLLKFKPEIPWGATEITPDIMGCSATG 464

Query: 444 KQMQFLEKSAASV-SSVPPCSL 458
           +   F+E+S     S   PC +
Sbjct: 465 RWRDFMEESMVMFPSEESPCEM 483

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004145318.21.6e-292100.00O-glucosyltransferase rumi homolog [Cucumis sativus] >KAE8653283.1 hypothetical ... [more]
XP_008457372.16.4e-28195.97PREDICTED: O-glucosyltransferase rumi homolog [Cucumis melo][more]
XP_038893964.17.8e-27191.82O-glucosyltransferase rumi homolog [Benincasa hispida][more]
KAG6583363.11.6e-24783.86O-glucosyltransferase rumi-like protein, partial [Cucurbita argyrosperma subsp. ... [more]
XP_022964856.11.6e-24783.86O-glucosyltransferase rumi homolog [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Q5E9Q19.6e-2225.07Protein O-glucosyltransferase 1 OS=Bos taurus OX=9913 GN=POGLUT1 PE=2 SV=1[more]
Q8T0451.3e-2123.21O-glucosyltransferase rumi OS=Drosophila melanogaster OX=7227 GN=rumi PE=1 SV=1[more]
Q29AU61.3e-2123.53O-glucosyltransferase rumi OS=Drosophila pseudoobscura pseudoobscura OX=46245 GN... [more]
Q16QY86.3e-2125.28O-glucosyltransferase rumi homolog OS=Aedes aegypti OX=7159 GN=AAEL011121 PE=3 S... [more]
Q8NBL13.1e-2025.74Protein O-glucosyltransferase 1 OS=Homo sapiens OX=9606 GN=POGLUT1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LY896.6e-29299.58CAP10 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G531170 PE=4 ... [more]
A0A1S3C5H23.1e-28195.97O-glucosyltransferase rumi homolog OS=Cucumis melo OX=3656 GN=LOC103497080 PE=4 ... [more]
A0A6J1HK397.7e-24883.86O-glucosyltransferase rumi homolog OS=Cucurbita moschata OX=3662 GN=LOC111464838... [more]
A0A6J1HZ335.5e-24683.44O-glucosyltransferase rumi homolog OS=Cucurbita maxima OX=3661 GN=LOC111469414 P... [more]
A0A6J1DBK29.7e-24382.11O-glucosyltransferase rumi OS=Momordica charantia OX=3673 GN=LOC111018978 PE=4 S... [more]
Match NameE-valueIdentityDescription
AT1G07220.16.3e-19465.26Arabidopsis thaliana protein of unknown function (DUF821) [more]
AT5G23850.12.9e-11442.89Arabidopsis thaliana protein of unknown function (DUF821) [more]
AT3G48980.11.6e-10946.72Arabidopsis thaliana protein of unknown function (DUF821) [more]
AT1G63420.13.7e-10948.53Arabidopsis thaliana protein of unknown function (DUF821) [more]
AT2G45830.13.4e-10746.34downstream target of AGL15 2 [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Chinese Long) v3
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006598Glycosyl transferase CAP10 domainSMARTSM00672cap10coord: 172..415
e-value: 2.0E-123
score: 426.0
IPR006598Glycosyl transferase CAP10 domainPFAMPF05686Glyco_transf_90coord: 97..457
e-value: 2.4E-147
score: 491.0
NoneNo IPR availablePANTHERPTHR12203:SF100BNAC05G05020D PROTEINcoord: 5..471
NoneNo IPR availablePANTHERPTHR12203KDEL LYS-ASP-GLU-LEU CONTAINING - RELATEDcoord: 5..471

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CsaV3_1G036190CsaV3_1G036190gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsaV3_1G036190.1.exon1CsaV3_1G036190.1.exon1exon
CsaV3_1G036190.1.exon2CsaV3_1G036190.1.exon2exon
CsaV3_1G036190.1.exon3CsaV3_1G036190.1.exon3exon
CsaV3_1G036190.1.exon4CsaV3_1G036190.1.exon4exon
CsaV3_1G036190.1.exon5CsaV3_1G036190.1.exon5exon
CsaV3_1G036190.1.exon6CsaV3_1G036190.1.exon6exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsaV3_1G036190.1.cds1CsaV3_1G036190.1.cds1CDS
CsaV3_1G036190.1.cds2CsaV3_1G036190.1.cds2CDS
CsaV3_1G036190.1.cds3CsaV3_1G036190.1.cds3CDS
CsaV3_1G036190.1.cds4CsaV3_1G036190.1.cds4CDS
CsaV3_1G036190.1.cds5CsaV3_1G036190.1.cds5CDS
CsaV3_1G036190.1.cds6CsaV3_1G036190.1.cds6CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CsaV3_1G036190.1CsaV3_1G036190.1-proteinpolypeptide