Cp4.1LG04g10080 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG04g10080
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionExostosin family protein
LocationCp4.1LG04: 9599728 .. 9605530 (-)
RNA-Seq ExpressionCp4.1LG04g10080
SyntenyCp4.1LG04g10080
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGGACTCGACCACACAGTTCTCAGAAGTCTGAATCACTTCCAAAACAAGCGATGAAGATGATCACCGTTGTACGCACGGTACCGACGAAGCTTGAAAGTTGAAATGCTGCTTACGTAGCTCATAGAATTGAAGAGGCTACAGCTTCCGAGGTAAAGAAAGCTCGAGCTTAGCTGAACATCCAACTTTCTTTTTGATTTCGCAAACAGTTCGTTCTCTTCTGTTTTGGTCGGAGTAACACTGAGTTTCGAATTCTCTGTCCGTTTGATGTTTTAGCTGCAATTTTGAATGTTTGTGATTTTCATCTCATTCCCCTGCGATCGATAAGGTTTCTGAGAAATGTCTTCTCAAAATTGGTGCTAATTGCTGGTAACGACTGAATTAGCTCATAATTTTTCGCCATGATTTAGCTGTCTATTGTCAACCGTAGTTCATAAGCTAATTCAGAGTTGTTTGATAATTGTGGGGTTTTTTCTTCACGCATTGGTATCCAGTTGCTGCAAGGTGCATGATGATCGTATGCGACTGCTCCAATTTGATGTGGTTTTTTAAAATTGGGAGTCCAATGTGATCCCGAGCTTCTTCAAGTTATTGGAATATTTTATAGATATTGGTTGCTTCCCACGGTTTCTTATATGGCTATTCATGTTTGTACAAACTTGTTTCATGGTATCAAAATCCGGAGTCTGCTTATTATGATAGCCATCATAATTTCAATTCTCATTGTTTCCCAGTGCTACGTTTATCCTTATGCGAAAAAATCTTTCCTACCACTTGATGTTAAGAGCTCAGACATTATGAGTCTTCAAAATATCACTAGTTTAAACCATTCAGAAGTTCATTTCCTGTATACTGTCACTCATGTGAAAAATAGGAAGGAAAGAACTGAGTACATTACTGAAAAGAAGGGAGAAAGAGGATTTGGTTTGACGTTGGATGCTGCTAATAGCATGCCATATGAGAATGGTACACCATTTGAAGAGACTTCGGCAATGCCAGATGGAAATTCTACTGTTGATAATGACATTGGGAGTGGGACGGTAGAGTTTGGTTATAATCCCCCCATGAAGGAAAAAATTTTAGACAACAGTTACAAGAGAGTTGTTGAAGGTGAAGACAGCAGCAATCTAAATATGAGTAAAATGAGAAACCATATTTCTTTTGTCTCAAATCAATCTCAAGAGTTAATTGTAGATCCAAGAAAATCTGACTTGTCTTCTGCTCAAAACACATCTTCCATTCCAGAAGACCGTTTCGGTAGAACCGAGGAAATAGTTACGAAGGATACAAGGTCTGAGCAAGGGAAGAATGTTTCCGATACCTTGGATGGACTTGCACGGTATGACATATCGACTTTGAAGAGTCCTGAGATGCCATCAATATCAATATCTCAAATGAACGCATTGTTGTCTCTAAGTCATACTTCTCCTTGTTCGAAGGTATGGATTTCAACTTAAGGTTCCAATTAGATCATTGACAAAAACTCGACTCTAACACTTGCACTGCTGATTTTACTTGGTAGTAGAAGCCACAGTGTCGTTTGTCTTCTTCACGTGATCGTGAACTTCTACATGCGAGACTGGAGATTGAGAAAGCCACTGCTGCTGTGAACAGCCCAGGAATTATTTCTGTTTTCCGAGATGTTTCTATGTTCAAGAGGTAATATTTTCTAAACATTGTTTGGTTCATACTGCCTCGACATGTAGCAGTGGCCTGGCATATAGTCTATCCCATCAGCTATCATCTTCGTTGATCCTGTGTCTTCAATGTCAATTAGCTTATTGTGAAAATTTGGATACTGTGACCAGTACCAACATTAACCAGGATGAAGCGTATATTGCGTGTTCCCTTTGCAGGAGTTATGACTTGATGGAAAAAACGCTTAAAGTTTATATCTACAAGGAAGGAGAAAAGCCTATTTTCCATCAACCTCGGATGAGAGGGATATATGCCTCAGAAGGATGGTTTATGAAATTGATGAAAGAGAATAAAAAATTTGTTGCAAAGAATCCCAAGAAGGCACACTTGTTCTATTTACCTTTCAGTTCGCAGTTACTAAGGAGTGCACTTTCTGAACAAAATTCCCAAGGTCGAAAGAACCTAGAGGAACGTCTAGGGAACTATGTCAACTTAATTAGGAGAAACCACCAATTCTGGAACAGAACTGGAGGTGCTGATCATTTTCTTGTTGCTTGTCACGACTGGGTATGTATTCTATACCGAATTTAGATTTATCGACAATGTCTTGTTCGAGTATTTTGTTGCTTATCTTGGAAACTCTTCTCATGTTATCCTTTGAAGTAGCTGAGGCTTGTTTGCTTTGTCTTCTGTGGACATTGTTCCGTGTGTCCGAAACCCTGACTTGAGCATAAATAGTTTAGTTACAATTTGTTGTCATCTTAAATGTATTGTTGGAGTCTCTTTCCTGGTAGTCAAGTTATCATTTATTTTCATTACATTCAATTTCAGGCCTCCAAACTCACAAGGAAGTATATGAAGAGCTGCATCAGAGCTCTCTGCAATGCAAACGCTGCTAGAGGCTTTCAAATTGGGAAGGACACTAGCTTACCAGTTACAAATATACATTTGACAAAGGACCCTGATATAACTACTGGAGCAAAACCTCCTTCAGACAGGACTACATTAGCCTTCTTTGCTGGGGGTATGCACGGTTATCTCCGACCAATACTGCTTCATTACTGGGAAAATAAAGAACCTGACATGAAAATTTTTGGCCCAATGCCACGCGATGCTGAAGGGAAAAGAATCTATAGGGAGCACATGAAAAATAGTAAGTACTGCATATGTGCGAGGGGATATGAAGTTCATACTCCTCGAGTGGTTGAGGCCATTCTAAACGCATGTGTTCCAGTTTTCCTATCAGATAATTACGTGCCTCCTTTCTTTGAGGTATTAAACTGGGAATCATTCTCAGTATTTGTTCAAGAGAAAGAGATCTCTAATTTGAGAAATATTCTGCTCTCAATTCCTGAGAAGGACTACCTTGTCATGCATGCAAGACTGAAAATAGTTCAAAAGCATTTCATTTGGAACAAAATTCCGGTAAAGTATGATTTATTTCATATGATCCTTCACTCAGTATGGTATACTCGAGTTTTTCAGATGAAAACCAGTTGATTTAGGTAGCATCGGATTCTGGTGTCGGAAAGCCACAAATGTCGAATGAATAAGAAGACTGTGAAGGGGAAGTAAAAGGATGTAGTGTAAATATGCCGAGCTTCCATAACATCGTACACCCTCCAGATAGAGATGAAGCTTAGCTTGAATCTCTCTAATCATATGGTGGAATTCTTGAGGGATATATTTGCAAATGTTTCCATCTTGTAAATGAGTTCGATGACTCATCAGTTTCATATTCCATACCAAGTTTTGAGTTCCCCTTAGAGCAGTTTGTGAAACCAAATGATACCGTCACTTAAGGGAACGAGTTAGAAGATTTTGGTGCAATGGATTGGATTTTGGTGAGTTAGAAGAATTTAAAAACTTATCATTGCGAGAGGATATAGATATGACTTTCAATTCTTCAACTGCCATGCTACCGATCCCAGCTTCACCTGTTAACACAGTTCATTCAGAGAACTTGATATCAAATATAAGCTCATCAGTCTCAGAAACCGATTCTAAAAGCACAACTAAACGGAAGAAGATGAAGAGTGAAATGCCACCAAAGTCCATAACTTCACTAGAAGAGACGAACAGTATGTTTGTGTTAGTTTTGAATTGTTCGTTTGATGATCTTATATTATGCTGGGTTTTAATCTTGTTCTTGCTTTGCGTGGAGACCAATGAGTTCCTCTTTACGTGATCAGGAAAATTATTCTGCCAAGTCCTATATGTGCGTAATTCTCATAACAGGACAAATTACGTCAATTTTTTGAAGGAATACTCGGAAAATATTGCAGCCAAATATCCATACTGGAATAGAACTGGTGGAGCAGATCATTTTCTTGTTGCATGCCATGATTGAGCGCTGCTTAAAAGCACTTTGCAATACTGATGTAACAGTTGCCTTCAAAATTGGATGAGATATGTCTCTTCCAGAAACTTATGTAGCGAGAAATCTTCTTAGAGATATTGGGGGAAAACCTGCTTCACAAAGACATGTTCATGCCTTTTATGGTGGGAATATACATGGTTATGTCTTAACCCTGATATGAAGATCTTTGGTCCAATGCCTCATGGTATTGCAAGCAAAATGAGTTACATTCAGTATAAGAAGAGCAGCAAATACTGCATCTGCCCAAAGGGTTACGAGCTCAATAGTCCACGGGTTGTTGACGCAATCTTTTATGAGTGTGTACCTGTGATCATATCAGACAATTTCGTGCCACCATTTTTTGAGGTGTTGAATTGGGAAGCATTTTCAGTGATTGTTGCAGAAAAGGATATTCCCTACTTACAACAACATACTGCTCTTTCAATACTAAATGACAGATATCTCGGATGCAACTCCGAGTCAGGAAAGTACATAAGCGCTCCCTTTGGCATGCCAAGCCCTTGAAGTATGACATTACCCTTCATTCATTCTGGTATAACATAGTGTTTCAGATAAAGATGTTGTAAGTGCTCCATTGAAGATATGAAGGAAATCCATGAAACTCTAAAATAACTGCACAGAACAAAATGACTCCTTTGCAGCTTCTCTTTGGAAAAAATCTGCTTAAACCTCGGTGGGATCATGTGAAGGACCTTGAAAATGGACTGCAACTGTGGCAGTGCTCTTGTATATAGCAGACTGGTAAAACAAAACTATGATTATTTGTCAATATTGCAGTAGAAATCCTGATTTCCTTTCAGCAACCTCTGCATCTGGCTGTGAGACTTCAGGTTTCAAATCATACTGCCCTGTACCTCGTCGACGTAGGAAGCCATTTTCCAGGACCACAATGGAAGTTATTACTCGTTCTTTGGGTCATCATCAACCACATCCATCCAAAGCAATAAATGTTTTCACTCACTGGGTCCATCCAGACCAATTGAAGAGAGACCCACATTCAAGAACCGTATCTCTGAGATCGGAATCAGATACTCAGGCATATATGATCTCTAATATGGCACCACAACACGAAGCTTCCTATGAAGAACAGGGGCAATGTGGAATGGGCGATGATGTTAATGAACTTGGTGGGTTGCTGTCAACAAACAAAGGGACCCTGATTATCAGATATTTAAGATCTTCTACAGGCTCTACTACTCTCATGAAACTTCAGATGATCATACAGGTACCAACAATAAACTATAATCAAAATATTAGCGATTGTGGGTCCCTGTGAATTTAGTGATCTTTGCTTCAAAATGGAAAAAAAAAATGGTGGGGATTGCAAAATTAGGTGTTCTGAATGAGAGCATATGGTTATAATGATGGCCTGATGAGGTTCTTCCTTCTGGGTCCTTCCTTCTTTTCTCATAACTCCAATAACCATAATAAATTTTGACAGATTTGAAGTGTCCCTATTTCATTAAAGGATGGAAATTTGCCAGCCTACCAGGATTAAGTTTCCCAGTCCCATATCCACCATTATTGGATAGGGTGAATTTCAAGTCCAATCAGAACTCATTTTTTACTATGTTTTCAATGATTCTGATCCAAATTTGTCGTCTTGAAGGACCATTTTGTGGTAAATAAATGCACAATTGACTGTTGAGCTACCATATCTGTTTGCTTTTTTACTTCACAGACATGCATGATCCATAAATAAAAAAGAAAAACTCAAATTTAAAGCCTTTGGCAGCTATTTGTACTCATTCCACAATAACCCTGTTGCTTTCTGGTTGTAAATAATT

mRNA sequence

CGGACTCGACCACACAGTTCTCAGAAGTCTGAATCACTTCCAAAACAAGCGATGAAGATGATCACCGTTGTACGCACGGTACCGACGAAGCTTGAAAGTTGAAATGCTGCTTACGTAGCTCATAGAATTGAAGAGGCTACAGCTTCCGAGTTGCTGCAAGGTGCATGATGATCGTATGCGACTGCTCCAATTTGATGTGGTTTTTTAAAATTGGGAGTCCAATGTGATCCCGAGCTTCTTCAAGTTATTGGAATATTTTATAGATATTGGTTGCTTCCCACGGTTTCTTATATGGCTATTCATGTTTGTACAAACTTGTTTCATGGTATCAAAATCCGGAGTCTGCTTATTATGATAGCCATCATAATTTCAATTCTCATTGTTTCCCAGTGCTACGTTTATCCTTATGCGAAAAAATCTTTCCTACCACTTGATGTTAAGAGCTCAGACATTATGAGTCTTCAAAATATCACTAGTTTAAACCATTCAGAAGTTCATTTCCTGTATACTGTCACTCATGTGAAAAATAGGAAGGAAAGAACTGAGTACATTACTGAAAAGAAGGGAGAAAGAGGATTTGGTTTGACGTTGGATGCTGCTAATAGCATGCCATATGAGAATGGTACACCATTTGAAGAGACTTCGGCAATGCCAGATGGAAATTCTACTGTTGATAATGACATTGGGAGTGGGACGGTAGAGTTTGGTTATAATCCCCCCATGAAGGAAAAAATTTTAGACAACAGTTACAAGAGAGTTGTTGAAGGTGAAGACAGCAGCAATCTAAATATGAGTAAAATGAGAAACCATATTTCTTTTGTCTCAAATCAATCTCAAGAGTTAATTGTAGATCCAAGAAAATCTGACTTGTCTTCTGCTCAAAACACATCTTCCATTCCAGAAGACCGTTTCGGTAGAACCGAGGAAATAGTTACGAAGGATACAAGGTCTGAGCAAGGGAAGAATGTTTCCGATACCTTGGATGGACTTGCACGGTATGACATATCGACTTTGAAGAGTCCTGAGATGCCATCAATATCAATATCTCAAATGAACGCATTGTTGTCTCTAAGTCATACTTCTCCTTGTTCGAAGAAGCCACAGTGTCGTTTGTCTTCTTCACGTGATCGTGAACTTCTACATGCGAGACTGGAGATTGAGAAAGCCACTGCTGCTGTGAACAGCCCAGGAATTATTTCTGTTTTCCGAGATGTTTCTATGTTCAAGAGGAGTTATGACTTGATGGAAAAAACGCTTAAAGTTTATATCTACAAGGAAGGAGAAAAGCCTATTTTCCATCAACCTCGGATGAGAGGGATATATGCCTCAGAAGGATGGTTTATGAAATTGATGAAAGAGAATAAAAAATTTGTTGCAAAGAATCCCAAGAAGGCACACTTGTTCTATTTACCTTTCAGTTCGCAGTTACTAAGGAGTGCACTTTCTGAACAAAATTCCCAAGGTCGAAAGAACCTAGAGGAACGTCTAGGGAACTATGTCAACTTAATTAGGAGAAACCACCAATTCTGGAACAGAACTGGAGGTGCTGATCATTTTCTTGTTGCTTGTCACGACTGGGCCTCCAAACTCACAAGGAAGTATATGAAGAGCTGCATCAGAGCTCTCTGCAATGCAAACGCTGCTAGAGGCTTTCAAATTGGGAAGGACACTAGCTTACCAGTTACAAATATACATTTGACAAAGGACCCTGATATAACTACTGGAGCAAAACCTCCTTCAGACAGGACTACATTAGCCTTCTTTGCTGGGGGTATGCACGGTTATCTCCGACCAATACTGCTTCATTACTGGGAAAATAAAGAACCTGACATGAAAATTTTTGGCCCAATGCCACGCGATGCTGAAGGGAAAAGAATCTATAGGGAGCACATGAAAAATAGTAAGTACTGCATATGTGCGAGGGGATATGAAGTTCATACTCCTCGAGTGGTTGAGGCCATTCTAAACGCATGTGTTCCAGTTTTCCTATCAGATAATTACGTGCCTCCTTTCTTTGAGGTATTAAACTGGGAATCATTCTCAGTATTTGTTCAAGAGAAAGAGATCTCTAATTTGAGAAATATTCTGCTCTCAATTCCTGAGAAGGACTACCTTGTCATGCATGCAAGACTGAAAATAGTTCAAAAGCATTTCATTTGGAACAAAATTCCGGTAAAGTATGATTTATTTCATATGATCCTTCACTCAGTATGGTATACTCGAGTTTTTCAGATGAAAACCAGTTGATTTAGGTAGCATCGGATTCTGGTGTCGGAAAGCCACAAATGTCGAATGAATAAGAAGACTGTGAAGGGGAAGTAAAAGGATGTAGTGTAAATATGCCGAGCTTCCATAACATCGTACACCCTCCAGATAGAGATGAAGCTTAGCTTGAATCTCTCTAATCATATGGTGGAATTCTTGAGGGATATATTTGCAAATGTTTCCATCTTGTAAATGAGTTCGATGACTCATCAGTTTCATATTCCATACCAAGTTTTGAGTTCCCCTTAGAGCAGTTTGTGAAACCAAATGATACCGTCACTTAAGGGAACGAGTTAGAAGATTTTGGTGCAATGGATTGGATTTTGGTGAGTTAGAAGAATTTAAAAACTTATCATTGCGAGAGGATATAGATATGACTTTCAATTCTTCAACTGCCATGCTACCGATCCCAGCTTCACCTGTTAACACAGTTCATTCAGAGAACTTGATATCAAATATAAGCTCATCAGTCTCAGAAACCGATTCTAAAAGCACAACTAAACGGAAGAAGATGAAGAGTGAAATGCCACCAAAGTCCATAACTTCACTAGAAGAGACGAACAGTATGTTTGTGTTAGTTTTGAATTGTTCGTTTGATGATCTTATATTATGCTGGGTTTTAATCTTGTTCTTGCTTTGCGTGGAGACCAATGAGTTCCTCTTTACGTGATCAGGAAAATTATTCTGCCAAGTCCTATATGTGCGTAATTCTCATAACAGGACAAATTACGTCAATTTTTTGAAGGAATACTCGGAAAATATTGCAGCCAAATATCCATACTGGAATAGAACTGGTGGAGCAGATCATTTTCTTGTTGCATGCCATGATTGAGCGCTGCTTAAAAGCACTTTGCAATACTGATGTAACAGTTGCCTTCAAAATTGGATGAGATATGTCTCTTCCAGAAACTTATGTAGCGAGAAATCTTCTTAGAGATATTGGGGGAAAACCTGCTTCACAAAGACATGTTCATGCCTTTTATGGTGGGAATATACATGGTTATGTCTTAACCCTGATATGAAGATCTTTGGTCCAATGCCTCATGGTATTGCAAGCAAAATGAGTTACATTCAGTATAAGAAGAGCAGCAAATACTGCATCTGCCCAAAGGGTTACGAGCTCAATAGTCCACGGGTTGTTGACGCAATCTTTTATGAGTGTGTACCTGTGATCATATCAGACAATTTCGTGCCACCATTTTTTGAGGTGTTGAATTGGGAAGCATTTTCAGTGATTGTTGCAGAAAAGGATATTCCCTACTTACAACAACATACTGCTCTTTCAATACTAAATGACAGATATCTCGGATGCAACTCCGAGTCAGGAAAGTACATAAGCGCTCCCTTTGGCATGCCAAGCCCTTGAAGTATGACATTACCCTTCATTCATTCTGGTATAACATAGTGTTTCAGATAAAGATGTTGTAAGTGCTCCATTGAAGATATGAAGGAAATCCATGAAACTCTAAAATAACTGCACAGAACAAAATGACTCCTTTGCAGCTTCTCTTTGGAAAAAATCTGCTTAAACCTCGGTGGGATCATGTGAAGGACCTTGAAAATGGACTGCAACTGTGGCAGTGCTCTTGTATATAGCAGACTGGTAAAACAAAACTATGATTATTTGTCAATATTGCAGTAGAAATCCTGATTTCCTTTCAGCAACCTCTGCATCTGGCTGTGAGACTTCAGGTTTCAAATCATACTGCCCTGTACCTCGTCGACGTAGGAAGCCATTTTCCAGGACCACAATGGAAGTTATTACTCGTTCTTTGGGTCATCATCAACCACATCCATCCAAAGCAATAAATGTTTTCACTCACTGGGTCCATCCAGACCAATTGAAGAGAGACCCACATTCAAGAACCGTATCTCTGAGATCGGAATCAGATACTCAGGCATATATGATCTCTAATATGGCACCACAACACGAAGCTTCCTATGAAGAACAGGGGCAATGTGGAATGGGCGATGATGTTAATGAACTTGGTGGGTTGCTGTCAACAAACAAAGGGACCCTGATTATCAGATATTTAAGATCTTCTACAGGCTCTACTACTCTCATGAAACTTCAGATGATCATACAGATTTGAAGTGTCCCTATTTCATTAAAGGATGGAAATTTGCCAGCCTACCAGGATTAAGTTTCCCAGTCCCATATCCACCATTATTGGATAGGGTGAATTTCAAGTCCAATCAGAACTCATTTTTTACTATGTTTTCAATGATTCTGATCCAAATTTGTCGTCTTGAAGGACCATTTTGTGGTAAATAAATGCACAATTGACTGTTGAGCTACCATATCTGTTTGCTTTTTTACTTCACAGACATGCATGATCCATAAATAAAAAAGAAAAACTCAAATTTAAAGCCTTTGGCAGCTATTTGTACTCATTCCACAATAACCCTGTTGCTTTCTGGTTGTAAATAATT

Coding sequence (CDS)

ATGGCTATTCATGTTTGTACAAACTTGTTTCATGGTATCAAAATCCGGAGTCTGCTTATTATGATAGCCATCATAATTTCAATTCTCATTGTTTCCCAGTGCTACGTTTATCCTTATGCGAAAAAATCTTTCCTACCACTTGATGTTAAGAGCTCAGACATTATGAGTCTTCAAAATATCACTAGTTTAAACCATTCAGAAGTTCATTTCCTGTATACTGTCACTCATGTGAAAAATAGGAAGGAAAGAACTGAGTACATTACTGAAAAGAAGGGAGAAAGAGGATTTGGTTTGACGTTGGATGCTGCTAATAGCATGCCATATGAGAATGGTACACCATTTGAAGAGACTTCGGCAATGCCAGATGGAAATTCTACTGTTGATAATGACATTGGGAGTGGGACGGTAGAGTTTGGTTATAATCCCCCCATGAAGGAAAAAATTTTAGACAACAGTTACAAGAGAGTTGTTGAAGGTGAAGACAGCAGCAATCTAAATATGAGTAAAATGAGAAACCATATTTCTTTTGTCTCAAATCAATCTCAAGAGTTAATTGTAGATCCAAGAAAATCTGACTTGTCTTCTGCTCAAAACACATCTTCCATTCCAGAAGACCGTTTCGGTAGAACCGAGGAAATAGTTACGAAGGATACAAGGTCTGAGCAAGGGAAGAATGTTTCCGATACCTTGGATGGACTTGCACGGTATGACATATCGACTTTGAAGAGTCCTGAGATGCCATCAATATCAATATCTCAAATGAACGCATTGTTGTCTCTAAGTCATACTTCTCCTTGTTCGAAGAAGCCACAGTGTCGTTTGTCTTCTTCACGTGATCGTGAACTTCTACATGCGAGACTGGAGATTGAGAAAGCCACTGCTGCTGTGAACAGCCCAGGAATTATTTCTGTTTTCCGAGATGTTTCTATGTTCAAGAGGAGTTATGACTTGATGGAAAAAACGCTTAAAGTTTATATCTACAAGGAAGGAGAAAAGCCTATTTTCCATCAACCTCGGATGAGAGGGATATATGCCTCAGAAGGATGGTTTATGAAATTGATGAAAGAGAATAAAAAATTTGTTGCAAAGAATCCCAAGAAGGCACACTTGTTCTATTTACCTTTCAGTTCGCAGTTACTAAGGAGTGCACTTTCTGAACAAAATTCCCAAGGTCGAAAGAACCTAGAGGAACGTCTAGGGAACTATGTCAACTTAATTAGGAGAAACCACCAATTCTGGAACAGAACTGGAGGTGCTGATCATTTTCTTGTTGCTTGTCACGACTGGGCCTCCAAACTCACAAGGAAGTATATGAAGAGCTGCATCAGAGCTCTCTGCAATGCAAACGCTGCTAGAGGCTTTCAAATTGGGAAGGACACTAGCTTACCAGTTACAAATATACATTTGACAAAGGACCCTGATATAACTACTGGAGCAAAACCTCCTTCAGACAGGACTACATTAGCCTTCTTTGCTGGGGGTATGCACGGTTATCTCCGACCAATACTGCTTCATTACTGGGAAAATAAAGAACCTGACATGAAAATTTTTGGCCCAATGCCACGCGATGCTGAAGGGAAAAGAATCTATAGGGAGCACATGAAAAATAGTAAGTACTGCATATGTGCGAGGGGATATGAAGTTCATACTCCTCGAGTGGTTGAGGCCATTCTAAACGCATGTGTTCCAGTTTTCCTATCAGATAATTACGTGCCTCCTTTCTTTGAGGTATTAAACTGGGAATCATTCTCAGTATTTGTTCAAGAGAAAGAGATCTCTAATTTGAGAAATATTCTGCTCTCAATTCCTGAGAAGGACTACCTTGTCATGCATGCAAGACTGAAAATAGTTCAAAAGCATTTCATTTGGAACAAAATTCCGGTAAAGTATGATTTATTTCATATGATCCTTCACTCAGTATGGTATACTCGAGTTTTTCAGATGAAAACCAGTTGA

Protein sequence

MAIHVCTNLFHGIKIRSLLIMIAIIISILIVSQCYVYPYAKKSFLPLDVKSSDIMSLQNITSLNHSEVHFLYTVTHVKNRKERTEYITEKKGERGFGLTLDAANSMPYENGTPFEETSAMPDGNSTVDNDIGSGTVEFGYNPPMKEKILDNSYKRVVEGEDSSNLNMSKMRNHISFVSNQSQELIVDPRKSDLSSAQNTSSIPEDRFGRTEEIVTKDTRSEQGKNVSDTLDGLARYDISTLKSPEMPSISISQMNALLSLSHTSPCSKKPQCRLSSSRDRELLHARLEIEKATAAVNSPGIISVFRDVSMFKRSYDLMEKTLKVYIYKEGEKPIFHQPRMRGIYASEGWFMKLMKENKKFVAKNPKKAHLFYLPFSSQLLRSALSEQNSQGRKNLEERLGNYVNLIRRNHQFWNRTGGADHFLVACHDWASKLTRKYMKSCIRALCNANAARGFQIGKDTSLPVTNIHLTKDPDITTGAKPPSDRTTLAFFAGGMHGYLRPILLHYWENKEPDMKIFGPMPRDAEGKRIYREHMKNSKYCICARGYEVHTPRVVEAILNACVPVFLSDNYVPPFFEVLNWESFSVFVQEKEISNLRNILLSIPEKDYLVMHARLKIVQKHFIWNKIPVKYDLFHMILHSVWYTRVFQMKTS
Homology
BLAST of Cp4.1LG04g10080 vs. ExPASy Swiss-Prot
Match: Q9FFN2 (Probable glycosyltransferase At5g03795 OS=Arabidopsis thaliana OX=3702 GN=At5g03795 PE=3 SV=2)

HSP 1 Score: 308.1 bits (788), Expect = 2.3e-82
Identity = 165/411 (40.15%), Postives = 251/411 (61.07%), Query Frame = 0

Query: 248 SISISQMNALLSLSH-TSPCSKKPQCRLSSSR----DRELLHARLEIEKATA--AVNSPG 307
           +I ++ +N   + ++ +S  S +P+ R   S     + +L  AR  I+ A+    V+ P 
Sbjct: 106 TIQLNMINVTATSNNVSSTASLEPKKRRVLSNLEKIEFKLQKARASIKAASMDDPVDDPD 165

Query: 308 II---SVFRDVSMFKRSYDLMEKTLKVYIYKEGEKPIFHQPRMRGIYASEGWFMKLMKEN 367
            +    ++ +  +F RSY  MEK  K+Y+YKEGE P+FH    + IY+ EG F+  ++ +
Sbjct: 166 YVPLGPMYWNAKVFHRSYLEMEKQFKIYVYKEGEPPLFHDGPCKSIYSMEGSFIYEIETD 225

Query: 368 KKFVAKNPKKAHLFYLPFSSQLLRSALSEQNSQGRKNLEERLGNYVNLIRRNHQFWNRTG 427
            +F   NP KAH+FYLPFS   +   + E+NS+    +   + +Y+NL+   + +WNR+ 
Sbjct: 226 TRFRTNNPDKAHVFYLPFSVVKMVRYVYERNSRDFSPIRNTVKDYINLVGDKYPYWNRSI 285

Query: 428 GADHFLVACHDWASKLTRKYM---KSCIRALCNANAARGFQIGKDTSLPVTNIHLTKDPD 487
           GADHF+++CHDW  + +  +     + IRALCNAN +  F+  KD S+P  N+  T    
Sbjct: 286 GADHFILSCHDWGPEASFSHPHLGHNSIRALCNANTSERFKPRKDVSIPEINLR-TGSLT 345

Query: 488 ITTGAKPPSDRTTLAFFAGGMHGYLRPILLHYWENKEPDMKIFGPMPRDAEGKRIYREHM 547
              G   PS R  LAFFAGG+HG +RP+LL +WENK+ D+++   +PR       Y + M
Sbjct: 346 GLVGGPSPSSRPILAFFAGGVHGPVRPVLLQHWENKDNDIRVHKYLPRGTS----YSDMM 405

Query: 548 KNSKYCICARGYEVHTPRVVEAILNACVPVFLSDNYVPPFFEVLNWESFSVFVQEKEISN 607
           +NSK+CIC  GYEV +PR+VEA+ + CVPV ++  YVPPF +VLNW SFSV V  ++I N
Sbjct: 406 RNSKFCICPSGYEVASPRIVEALYSGCVPVLINSGYVPPFSDVLNWRSFSVIVSVEDIPN 465

Query: 608 LRNILLSIPEKDYLVMHARLKIVQKHFIWNKIPVKYDLFHMILHSVWYTRV 646
           L+ IL SI  + YL M+ R+  V++HF  N    ++D+FHMILHS+W  R+
Sbjct: 466 LKTILTSISPRQYLRMYRRVLKVRRHFEVNSPAKRFDVFHMILHSIWVRRL 511

BLAST of Cp4.1LG04g10080 vs. ExPASy Swiss-Prot
Match: Q9SSE8 (Probable glycosyltransferase At3g07620 OS=Arabidopsis thaliana OX=3702 GN=At3g07620 PE=3 SV=1)

HSP 1 Score: 300.1 bits (767), Expect = 6.3e-80
Identity = 147/346 (42.49%), Postives = 222/346 (64.16%), Query Frame = 0

Query: 304 VFRDVSMFKRSYDLMEKTLKVYIYKEGEKPIFHQPRMRGIYASEGWFMKLMKEN-KKFVA 363
           ++R+   F RSY LMEK  K+Y+Y+EG+ PIFH    + IY+ EG F+  M+ +  K+  
Sbjct: 125 IYRNPYAFHRSYLLMEKMFKIYVYEEGDPPIFHYGLCKDIYSMEGLFLNFMENDVLKYRT 184

Query: 364 KNPKKAHLFYLPFSSQLLRSALSEQNSQGRKNLEERLGNYVNLIRRNHQFWNRTGGADHF 423
           ++P KAH+++LPFS  ++   L +   + +  LE  + +YV +I + + +WN + G DHF
Sbjct: 185 RDPDKAHVYFLPFSVVMILHHLFDPVVRDKAVLERVIADYVQIISKKYPYWNTSDGFDHF 244

Query: 424 LVACHDWASKLT---RKYMKSCIRALCNANAARGFQIGKDTSLPVTNIHLTKDPDITTGA 483
           +++CHDW  + T   +K   + IR LCNAN +  F   KD   P  N+ LT D +  TG 
Sbjct: 245 MLSCHDWGHRATWYVKKLFFNSIRVLCNANISEYFNPEKDAPFPEINL-LTGDINNLTGG 304

Query: 484 KPPSDRTTLAFFAGGMHGYLRPILLHYWENKEPDMKIFGPMPRDAEGKRIYREHMKNSKY 543
             P  RTTLAFFAG  HG +RP+LL++W+ K+ D+ ++  +P   +    Y E M+ S++
Sbjct: 305 LDPISRTTLAFFAGKSHGKIRPVLLNHWKEKDKDILVYENLPDGLD----YTEMMRKSRF 364

Query: 544 CICARGYEVHTPRVVEAILNACVPVFLSDNYVPPFFEVLNWESFSVFVQEKEISNLRNIL 603
           CIC  G+EV +PRV EAI + CVPV +S+NYV PF +VLNWE FSV V  KEI  L+ IL
Sbjct: 365 CICPSGHEVASPRVPEAIYSGCVPVLISENYVLPFSDVLNWEKFSVSVSVKEIPELKRIL 424

Query: 604 LSIPEKDYLVMHARLKIVQKHFIWNKIPVKYDLFHMILHSVWYTRV 646
           + IPE+ Y+ ++  +K V++H + N  P +YD+F+MI+HS+W  R+
Sbjct: 425 MDIPEERYMRLYEGVKKVKRHILVNDPPKRYDVFNMIIHSIWLRRL 465

BLAST of Cp4.1LG04g10080 vs. ExPASy Swiss-Prot
Match: Q3E7Q9 (Probable glycosyltransferase At5g25310 OS=Arabidopsis thaliana OX=3702 GN=At5g25310 PE=3 SV=2)

HSP 1 Score: 284.3 bits (726), Expect = 3.6e-75
Identity = 160/414 (38.65%), Postives = 241/414 (58.21%), Query Frame = 0

Query: 251 ISQMNALLSLSHTSPCSKKPQCRLSSSRDRELLHARLEIEKATAAVNSPGIIS------V 310
           +SQ    +  ++++  SK  +    +  ++ L  AR  I +A++ VN+    S      +
Sbjct: 74  VSQQILTVRSTNSTLQSKPEKLNRRNLVEQGLAKARASILEASSNVNTTLFKSDLPNSEI 133

Query: 311 FRDVSMFKRSYDLMEKTLKVYIYKEGEKPIFHQPRMRGIYASEGWFM-KLMKENKKFVAK 370
           +R+ S   RSY  MEK  KVY+Y+EGE P+ H    + +YA EG F+ ++ K   KF   
Sbjct: 134 YRNPSALYRSYLEMEKRFKVYVYEEGEPPLVHDGPCKSVYAVEGRFITEMEKRRTKFRTY 193

Query: 371 NPKKAHLFYLPFSSQLLRSALSEQNSQGRKNLEERLGNYVNLIRRNHQFWNRTGGADHFL 430
           +P +A++++LPFS   L   L E NS   K L+  + +Y+ L+  NH FWNRT GADHF+
Sbjct: 194 DPNQAYVYFLPFSVTWLVRYLYEGNSDA-KPLKTFVSDYIRLVSTNHPFWNRTNGADHFM 253

Query: 431 VACHDW---ASKLTRKYMKSCIRALCNANAARGFQIGKDTSLPVTNIH---------LTK 490
           + CHDW    S+  R    + IR +CNAN++ GF   KD +LP   ++         L+K
Sbjct: 254 LTCHDWGPLTSQANRDLFNTSIRVMCNANSSEGFNPTKDVTLPEIKLYGGEVDHKLRLSK 313

Query: 491 DPDITTGAKPPSDRTTLAFFAGGMHGYLRPILLHYWENKEPDMKIFGPMPRDAEGKRIYR 550
               T  A P   R  L FFAGG+HG +RPILL +W+ ++ DM ++  +P+       Y 
Sbjct: 314 ----TLSASP---RPYLGFFAGGVHGPVRPILLKHWKQRDLDMPVYEYLPKHLN----YY 373

Query: 551 EHMKNSKYCICARGYEVHTPRVVEAILNACVPVFLSDNYVPPFFEVLNWESFSVFVQEKE 610
           + M++SK+C C  GYEV +PRV+EAI + C+PV LS N+V PF +VL WE+FSV V   E
Sbjct: 374 DFMRSSKFCFCPSGYEVASPRVIEAIYSECIPVILSVNFVLPFTDVLRWETFSVLVDVSE 433

Query: 611 ISNLRNILLSIPEKDYLVMHARLKIVQKHFIWNKIPVKYDLFHMILHSVWYTRV 646
           I  L+ IL+SI  + Y  + + L+ V++HF  N  P ++D FH+ LHS+W  R+
Sbjct: 434 IPRLKEILMSISNEKYEWLKSNLRYVRRHFELNDPPQRFDAFHLTLHSIWLRRL 475

BLAST of Cp4.1LG04g10080 vs. ExPASy Swiss-Prot
Match: Q9LFP3 (Probable glycosyltransferase At5g11130 OS=Arabidopsis thaliana OX=3702 GN=At5g11130/At5g11120 PE=3 SV=2)

HSP 1 Score: 278.9 bits (712), Expect = 1.5e-73
Identity = 139/348 (39.94%), Postives = 220/348 (63.22%), Query Frame = 0

Query: 303 SVFRDVSMFKRSYDLMEKTLKVYIYKEGEKPIFHQPRMRGIYASEGWFM-KLMKENKKFV 362
           SV+ +   F +S+  MEK  K++ Y+EGE P+FH+  +  IYA EG FM ++   N +F 
Sbjct: 130 SVYLNAFTFHQSHKEMEKRFKIWTYREGEAPLFHKGPLNNIYAIEGQFMDEIENGNSRFK 189

Query: 363 AKNPKKAHLFYLPFS-SQLLRSALSEQNSQGRKNLEERLGNYVNLIRRNHQFWNRTGGAD 422
           A +P++A +FY+P     ++R       S  R  L+  + +Y++LI   + +WNR+ GAD
Sbjct: 190 AASPEEATVFYIPVGIVNIIRFVYRPYTSYARDRLQNIVKDYISLISNRYPYWNRSRGAD 249

Query: 423 HFLVACHDWA---SKLTRKYMKSCIRALCNANAARGFQIGKDTSLPVTNIHLTKDPDITT 482
           HF ++CHDWA   S +  +  K  IRALCNAN++ GF   +D SLP  NI  ++   + T
Sbjct: 250 HFFLSCHDWAPDVSAVDPELYKHFIRALCNANSSEGFTPMRDVSLPEINIPHSQLGFVHT 309

Query: 483 GAKPPSDRTTLAFFAGGMHGYLRPILLHYWENKEPDMKIFGPMPRDAEGKRIYREHMKNS 542
           G +PP +R  LAFFAGG HG +R IL  +W+ K+ D+ ++  +P+       Y + M  +
Sbjct: 310 G-EPPQNRKLLAFFAGGSHGDVRKILFQHWKEKDKDVLVYENLPKTMN----YTKMMDKA 369

Query: 543 KYCICARGYEVHTPRVVEAILNACVPVFLSDNYVPPFFEVLNWESFSVFVQEKEISNLRN 602
           K+C+C  G+EV +PR+VE++ + CVPV ++D YV PF +VLNW++FSV +   ++ +++ 
Sbjct: 370 KFCLCPSGWEVASPRIVESLYSGCVPVIIADYYVLPFSDVLNWKTFSVHIPISKMPDIKK 429

Query: 603 ILLSIPEKDYLVMHARLKIVQKHFIWNKIPVKYDLFHMILHSVWYTRV 646
           IL +I E++YL M  R+  V+KHF+ N+    YD+ HMI+HS+W  R+
Sbjct: 430 ILEAITEEEYLNMQRRVLEVRKHFVINRPSKPYDMLHMIMHSIWLRRL 472

BLAST of Cp4.1LG04g10080 vs. ExPASy Swiss-Prot
Match: Q3EAR7 (Probable glycosyltransferase At3g42180 OS=Arabidopsis thaliana OX=3702 GN=At3g42180 PE=2 SV=2)

HSP 1 Score: 264.2 bits (674), Expect = 3.8e-69
Identity = 154/417 (36.93%), Postives = 233/417 (55.88%), Query Frame = 0

Query: 255 NALLSLSHTS-----PCSKKPQCRLSSSRDRELLHARLEIEKATAAVN---SPGIIS--- 314
           NAL S S +S     P + K +  L   R+ EL  AR  I +A    N   +  +I+   
Sbjct: 54  NALQSSSSSSSLYSPPITVKRRSNL-EKREEELRKARAAIRRAVRFKNCTSNEEVITYIP 113

Query: 315 ---VFRDVSMFKRSYDLMEKTLKVYIYKEGEKPIFHQPRMRGIYASEGWFMKLMK----- 374
              ++R+   F +S+  M KT KV+ YKEGE+P+ H   +  IY  EG F+  +      
Sbjct: 114 TGQIYRNSFAFHQSHIEMMKTFKVWSYKEGEQPLVHDGPVNDIYGIEGQFIDELSYVMGG 173

Query: 375 ENKKFVAKNPKKAHLFYLPFS----SQLLRSALSEQNSQGRKNLEERLGNYVNLIRRNHQ 434
            + +F A  P++AH F+LPFS       +   ++      R  L     +YV+++   H 
Sbjct: 174 PSGRFRASRPEEAHAFFLPFSVANIVHYVYQPITSPADFNRARLHRIFNDYVDVVAHKHP 233

Query: 435 FWNRTGGADHFLVACHDWASKL---TRKYMKSCIRALCNANAARGFQIGKDTSLPVTNIH 494
           FWN++ GADHF+V+CHDWA  +     ++ K+ +R LCNAN + GF+   D S+P  NI 
Sbjct: 234 FWNQSNGADHFMVSCHDWAPDVPDSKPEFFKNFMRGLCNANTSEGFRRNIDFSIPEINIP 293

Query: 495 LTKDPDITTGAKPPSDRTTLAFFAGGMHGYLRPILLHYWENKEPDMKIFGPMPRDAEGKR 554
             K      G + P +RT LAFFAG  HGY+R +L  +W+ K+ D++++  + +      
Sbjct: 294 KRKLKPPFMG-QNPENRTILAFFAGRAHGYIREVLFSHWKGKDKDVQVYDHLTKGQN--- 353

Query: 555 IYREHMKNSKYCICARGYEVHTPRVVEAILNACVPVFLSDNYVPPFFEVLNWESFSVFVQ 614
            Y E + +SK+C+C  GYEV +PR VEAI + CVPV +SDNY  PF +VL+W  FSV + 
Sbjct: 354 -YHELIGHSKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLPFNDVLDWSKFSVEIP 413

Query: 615 EKEISNLRNILLSIPEKDYLVMHARLKIVQKHFIWNKIPVKYDLFHMILHSVWYTRV 646
             +I +++ IL  IP   YL M+  +  V++HF+ N+    +D+ HMILHSVW  R+
Sbjct: 414 VDKIPDIKKILQEIPHDKYLRMYRNVMKVRRHFVVNRPAQPFDVIHMILHSVWLRRL 464

BLAST of Cp4.1LG04g10080 vs. NCBI nr
Match: XP_023531315.1 (probable glycosyltransferase At3g07620 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1304 bits (3374), Expect = 0.0
Identity = 651/651 (100.00%), Postives = 651/651 (100.00%), Query Frame = 0

Query: 1   MAIHVCTNLFHGIKIRSLLIMIAIIISILIVSQCYVYPYAKKSFLPLDVKSSDIMSLQNI 60
           MAIHVCTNLFHGIKIRSLLIMIAIIISILIVSQCYVYPYAKKSFLPLDVKSSDIMSLQNI
Sbjct: 1   MAIHVCTNLFHGIKIRSLLIMIAIIISILIVSQCYVYPYAKKSFLPLDVKSSDIMSLQNI 60

Query: 61  TSLNHSEVHFLYTVTHVKNRKERTEYITEKKGERGFGLTLDAANSMPYENGTPFEETSAM 120
           TSLNHSEVHFLYTVTHVKNRKERTEYITEKKGERGFGLTLDAANSMPYENGTPFEETSAM
Sbjct: 61  TSLNHSEVHFLYTVTHVKNRKERTEYITEKKGERGFGLTLDAANSMPYENGTPFEETSAM 120

Query: 121 PDGNSTVDNDIGSGTVEFGYNPPMKEKILDNSYKRVVEGEDSSNLNMSKMRNHISFVSNQ 180
           PDGNSTVDNDIGSGTVEFGYNPPMKEKILDNSYKRVVEGEDSSNLNMSKMRNHISFVSNQ
Sbjct: 121 PDGNSTVDNDIGSGTVEFGYNPPMKEKILDNSYKRVVEGEDSSNLNMSKMRNHISFVSNQ 180

Query: 181 SQELIVDPRKSDLSSAQNTSSIPEDRFGRTEEIVTKDTRSEQGKNVSDTLDGLARYDIST 240
           SQELIVDPRKSDLSSAQNTSSIPEDRFGRTEEIVTKDTRSEQGKNVSDTLDGLARYDIST
Sbjct: 181 SQELIVDPRKSDLSSAQNTSSIPEDRFGRTEEIVTKDTRSEQGKNVSDTLDGLARYDIST 240

Query: 241 LKSPEMPSISISQMNALLSLSHTSPCSKKPQCRLSSSRDRELLHARLEIEKATAAVNSPG 300
           LKSPEMPSISISQMNALLSLSHTSPCSKKPQCRLSSSRDRELLHARLEIEKATAAVNSPG
Sbjct: 241 LKSPEMPSISISQMNALLSLSHTSPCSKKPQCRLSSSRDRELLHARLEIEKATAAVNSPG 300

Query: 301 IISVFRDVSMFKRSYDLMEKTLKVYIYKEGEKPIFHQPRMRGIYASEGWFMKLMKENKKF 360
           IISVFRDVSMFKRSYDLMEKTLKVYIYKEGEKPIFHQPRMRGIYASEGWFMKLMKENKKF
Sbjct: 301 IISVFRDVSMFKRSYDLMEKTLKVYIYKEGEKPIFHQPRMRGIYASEGWFMKLMKENKKF 360

Query: 361 VAKNPKKAHLFYLPFSSQLLRSALSEQNSQGRKNLEERLGNYVNLIRRNHQFWNRTGGAD 420
           VAKNPKKAHLFYLPFSSQLLRSALSEQNSQGRKNLEERLGNYVNLIRRNHQFWNRTGGAD
Sbjct: 361 VAKNPKKAHLFYLPFSSQLLRSALSEQNSQGRKNLEERLGNYVNLIRRNHQFWNRTGGAD 420

Query: 421 HFLVACHDWASKLTRKYMKSCIRALCNANAARGFQIGKDTSLPVTNIHLTKDPDITTGAK 480
           HFLVACHDWASKLTRKYMKSCIRALCNANAARGFQIGKDTSLPVTNIHLTKDPDITTGAK
Sbjct: 421 HFLVACHDWASKLTRKYMKSCIRALCNANAARGFQIGKDTSLPVTNIHLTKDPDITTGAK 480

Query: 481 PPSDRTTLAFFAGGMHGYLRPILLHYWENKEPDMKIFGPMPRDAEGKRIYREHMKNSKYC 540
           PPSDRTTLAFFAGGMHGYLRPILLHYWENKEPDMKIFGPMPRDAEGKRIYREHMKNSKYC
Sbjct: 481 PPSDRTTLAFFAGGMHGYLRPILLHYWENKEPDMKIFGPMPRDAEGKRIYREHMKNSKYC 540

Query: 541 ICARGYEVHTPRVVEAILNACVPVFLSDNYVPPFFEVLNWESFSVFVQEKEISNLRNILL 600
           ICARGYEVHTPRVVEAILNACVPVFLSDNYVPPFFEVLNWESFSVFVQEKEISNLRNILL
Sbjct: 541 ICARGYEVHTPRVVEAILNACVPVFLSDNYVPPFFEVLNWESFSVFVQEKEISNLRNILL 600

Query: 601 SIPEKDYLVMHARLKIVQKHFIWNKIPVKYDLFHMILHSVWYTRVFQMKTS 651
           SIPEKDYLVMHARLKIVQKHFIWNKIPVKYDLFHMILHSVWYTRVFQMKTS
Sbjct: 601 SIPEKDYLVMHARLKIVQKHFIWNKIPVKYDLFHMILHSVWYTRVFQMKTS 651

BLAST of Cp4.1LG04g10080 vs. NCBI nr
Match: XP_022933600.1 (probable glycosyltransferase At3g07620 [Cucurbita moschata])

HSP 1 Score: 1279 bits (3310), Expect = 0.0
Identity = 637/651 (97.85%), Postives = 644/651 (98.92%), Query Frame = 0

Query: 1   MAIHVCTNLFHGIKIRSLLIMIAIIISILIVSQCYVYPYAKKSFLPLDVKSSDIMSLQNI 60
           MAIH+CTNLFHGIKIR LLIMIAIIISILIVSQCYVYPYAKKSFLPLDVKSSDIMSLQNI
Sbjct: 1   MAIHICTNLFHGIKIRRLLIMIAIIISILIVSQCYVYPYAKKSFLPLDVKSSDIMSLQNI 60

Query: 61  TSLNHSEVHFLYTVTHVKNRKERTEYITEKKGERGFGLTLDAANSMPYENGTPFEETSAM 120
           TSLNHSEVHFLYTVTHVKNRKERTEYITEKKGERGFGLTLDAANSMPYENGTPFEETSAM
Sbjct: 61  TSLNHSEVHFLYTVTHVKNRKERTEYITEKKGERGFGLTLDAANSMPYENGTPFEETSAM 120

Query: 121 PDGNSTVDNDIGSGTVEFGYNPPMKEKILDNSYKRVVEGEDSSNLNMSKMRNHISFVSNQ 180
           PDGNSTVDNDIGSGTVEFGYNPP+KEKILDNSYKRVVEGEDSSNLN SKMRNHISFVSNQ
Sbjct: 121 PDGNSTVDNDIGSGTVEFGYNPPIKEKILDNSYKRVVEGEDSSNLNTSKMRNHISFVSNQ 180

Query: 181 SQELIVDPRKSDLSSAQNTSSIPEDRFGRTEEIVTKDTRSEQGKNVSDTLDGLARYDIST 240
           SQELIVDPRKSDLSSAQNTSSIPEDRFGRTEEIVTKDTRSEQ KNV DTLDGLARYDIST
Sbjct: 181 SQELIVDPRKSDLSSAQNTSSIPEDRFGRTEEIVTKDTRSEQAKNVFDTLDGLARYDIST 240

Query: 241 LKSPEMPSISISQMNALLSLSHTSPCSKKPQCRLSSSRDRELLHARLEIEKATAAVNSPG 300
           LKSPEMP ISISQMNALLSLSHTSPCSKKPQCRLSSSRDRELLHARLEIEKATA VNSPG
Sbjct: 241 LKSPEMPPISISQMNALLSLSHTSPCSKKPQCRLSSSRDRELLHARLEIEKATAVVNSPG 300

Query: 301 IISVFRDVSMFKRSYDLMEKTLKVYIYKEGEKPIFHQPRMRGIYASEGWFMKLMKENKKF 360
           IISVFR+VSMFKRSYDLMEKTLKVYIYKEGEKPIFHQPRMRGIYASEGWFMKLMKENKKF
Sbjct: 301 IISVFRNVSMFKRSYDLMEKTLKVYIYKEGEKPIFHQPRMRGIYASEGWFMKLMKENKKF 360

Query: 361 VAKNPKKAHLFYLPFSSQLLRSALSEQNSQGRKNLEERLGNYVNLIRRNHQFWNRTGGAD 420
           VAKNPKKAHLFYLPFSSQLLRSALSEQNSQGRKNLEERLGNYVNLIRRNHQFWNRTGGAD
Sbjct: 361 VAKNPKKAHLFYLPFSSQLLRSALSEQNSQGRKNLEERLGNYVNLIRRNHQFWNRTGGAD 420

Query: 421 HFLVACHDWASKLTRKYMKSCIRALCNANAARGFQIGKDTSLPVTNIHLTKDPDITTGAK 480
           HFLVACHDWASKLTRKYMK+CIRALCNANAARGFQIGKDTSLPVTNIHLTKDPDITTGAK
Sbjct: 421 HFLVACHDWASKLTRKYMKNCIRALCNANAARGFQIGKDTSLPVTNIHLTKDPDITTGAK 480

Query: 481 PPSDRTTLAFFAGGMHGYLRPILLHYWENKEPDMKIFGPMPRDAEGKRIYREHMKNSKYC 540
           PPSDRTTLAFFAGGMHGYLRPILLHYWENKEPDMKIFGPM RDAEGKRIYREHMKNSKYC
Sbjct: 481 PPSDRTTLAFFAGGMHGYLRPILLHYWENKEPDMKIFGPMARDAEGKRIYREHMKNSKYC 540

Query: 541 ICARGYEVHTPRVVEAILNACVPVFLSDNYVPPFFEVLNWESFSVFVQEKEISNLRNILL 600
           ICARGYEVHTPRVVEAILNACVPVFLSDNYVPPFFEVLNWESFSVFVQEKEISNLRNILL
Sbjct: 541 ICARGYEVHTPRVVEAILNACVPVFLSDNYVPPFFEVLNWESFSVFVQEKEISNLRNILL 600

Query: 601 SIPEKDYLVMHARLKIVQKHFIWNKIPVKYDLFHMILHSVWYTRVFQMKTS 651
           SIPE+DYLVMHARLKIVQKHFIWNKIPVKYDLFHMILHSVWYTRVFQM+T+
Sbjct: 601 SIPEEDYLVMHARLKIVQKHFIWNKIPVKYDLFHMILHSVWYTRVFQMQTN 651

BLAST of Cp4.1LG04g10080 vs. NCBI nr
Match: KAG6587987.1 (putative glycosyltransferase, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1274 bits (3297), Expect = 0.0
Identity = 633/651 (97.24%), Postives = 644/651 (98.92%), Query Frame = 0

Query: 1   MAIHVCTNLFHGIKIRSLLIMIAIIISILIVSQCYVYPYAKKSFLPLDVKSSDIMSLQNI 60
           MAIH+CTNLFHGIKIR LL+MIAIIISILIVSQ YVYPYAKKSFLPLDVKSSDIMSLQN+
Sbjct: 1   MAIHICTNLFHGIKIRRLLVMIAIIISILIVSQSYVYPYAKKSFLPLDVKSSDIMSLQNV 60

Query: 61  TSLNHSEVHFLYTVTHVKNRKERTEYITEKKGERGFGLTLDAANSMPYENGTPFEETSAM 120
           TSLNHSEVHFLYTVTHVKNRKERTEYITEKKGERGFGLTLDAANSMPYENGTPFEETSAM
Sbjct: 61  TSLNHSEVHFLYTVTHVKNRKERTEYITEKKGERGFGLTLDAANSMPYENGTPFEETSAM 120

Query: 121 PDGNSTVDNDIGSGTVEFGYNPPMKEKILDNSYKRVVEGEDSSNLNMSKMRNHISFVSNQ 180
           PDGNS+VDNDIGSGTVEFGYNPPMKEKILDNSYKRVVE EDSSNLNMSKMRNHISFVSNQ
Sbjct: 121 PDGNSSVDNDIGSGTVEFGYNPPMKEKILDNSYKRVVEDEDSSNLNMSKMRNHISFVSNQ 180

Query: 181 SQELIVDPRKSDLSSAQNTSSIPEDRFGRTEEIVTKDTRSEQGKNVSDTLDGLARYDIST 240
           SQELIVDPRKSDLSSAQNTSSIPEDRFGR+EEIVTKDTRSEQGKNVS+TLDGLARYDIST
Sbjct: 181 SQELIVDPRKSDLSSAQNTSSIPEDRFGRSEEIVTKDTRSEQGKNVSNTLDGLARYDIST 240

Query: 241 LKSPEMPSISISQMNALLSLSHTSPCSKKPQCRLSSSRDRELLHARLEIEKATAAVNSPG 300
           LKSPEMP ISISQMNALLSLSHTSPCSKKPQCR SSSRDRELLHARLEIEKATA VNSPG
Sbjct: 241 LKSPEMPPISISQMNALLSLSHTSPCSKKPQCRFSSSRDRELLHARLEIEKATAVVNSPG 300

Query: 301 IISVFRDVSMFKRSYDLMEKTLKVYIYKEGEKPIFHQPRMRGIYASEGWFMKLMKENKKF 360
           IISVFR+VSMFKRSYDLMEKTLKVYIYKEGEKPIFHQPRMRGIYASEGWFMKLMKENKKF
Sbjct: 301 IISVFRNVSMFKRSYDLMEKTLKVYIYKEGEKPIFHQPRMRGIYASEGWFMKLMKENKKF 360

Query: 361 VAKNPKKAHLFYLPFSSQLLRSALSEQNSQGRKNLEERLGNYVNLIRRNHQFWNRTGGAD 420
           VAKNPKKAHLFYLPFSSQLLRSALSEQNSQGRKNLEERLGNYVNLIRRNHQFWNRTGGAD
Sbjct: 361 VAKNPKKAHLFYLPFSSQLLRSALSEQNSQGRKNLEERLGNYVNLIRRNHQFWNRTGGAD 420

Query: 421 HFLVACHDWASKLTRKYMKSCIRALCNANAARGFQIGKDTSLPVTNIHLTKDPDITTGAK 480
           HFLVACHDWASKLTRKYMK+CIRALCNANAARGFQIGKDTSLPVTNIHLTKDPDITTGAK
Sbjct: 421 HFLVACHDWASKLTRKYMKNCIRALCNANAARGFQIGKDTSLPVTNIHLTKDPDITTGAK 480

Query: 481 PPSDRTTLAFFAGGMHGYLRPILLHYWENKEPDMKIFGPMPRDAEGKRIYREHMKNSKYC 540
           PPSDRTTLAFFAGGMHGYLRPILLHYWENKEPDMKIFGPM RDAEGKRIYREHMKNSKYC
Sbjct: 481 PPSDRTTLAFFAGGMHGYLRPILLHYWENKEPDMKIFGPMARDAEGKRIYREHMKNSKYC 540

Query: 541 ICARGYEVHTPRVVEAILNACVPVFLSDNYVPPFFEVLNWESFSVFVQEKEISNLRNILL 600
           ICARGYEVHTPRVVEAILNACVPVFLSDNYVPPFFEVLNWESFSVFVQEKEISNLRNILL
Sbjct: 541 ICARGYEVHTPRVVEAILNACVPVFLSDNYVPPFFEVLNWESFSVFVQEKEISNLRNILL 600

Query: 601 SIPEKDYLVMHARLKIVQKHFIWNKIPVKYDLFHMILHSVWYTRVFQMKTS 651
           SIPE+DYLVMHARLKIVQKHFIWNKIPVKYDLFHMILHSVWYTRVF+MKT+
Sbjct: 601 SIPEEDYLVMHARLKIVQKHFIWNKIPVKYDLFHMILHSVWYTRVFEMKTN 651

BLAST of Cp4.1LG04g10080 vs. NCBI nr
Match: KAG7021879.1 (putative glycosyltransferase, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1274 bits (3297), Expect = 0.0
Identity = 633/651 (97.24%), Postives = 644/651 (98.92%), Query Frame = 0

Query: 1    MAIHVCTNLFHGIKIRSLLIMIAIIISILIVSQCYVYPYAKKSFLPLDVKSSDIMSLQNI 60
            MAIH+CTNLFHGIKIR LL+MIAIIISILIVSQ YVYPYAKKSFLPLDVKSSDIMSLQN+
Sbjct: 667  MAIHICTNLFHGIKIRRLLVMIAIIISILIVSQSYVYPYAKKSFLPLDVKSSDIMSLQNV 726

Query: 61   TSLNHSEVHFLYTVTHVKNRKERTEYITEKKGERGFGLTLDAANSMPYENGTPFEETSAM 120
            TSLNHSEVHFLYTVTHVKNRKERTEYITEKKGERGFGLTLDAANSMPYENGTPFEETSAM
Sbjct: 727  TSLNHSEVHFLYTVTHVKNRKERTEYITEKKGERGFGLTLDAANSMPYENGTPFEETSAM 786

Query: 121  PDGNSTVDNDIGSGTVEFGYNPPMKEKILDNSYKRVVEGEDSSNLNMSKMRNHISFVSNQ 180
            PDGNS+VDNDIGSGTVEFGYNPPMKEKILDNSYKRVVE EDSSNLNMSKMRNHISFVSNQ
Sbjct: 787  PDGNSSVDNDIGSGTVEFGYNPPMKEKILDNSYKRVVEDEDSSNLNMSKMRNHISFVSNQ 846

Query: 181  SQELIVDPRKSDLSSAQNTSSIPEDRFGRTEEIVTKDTRSEQGKNVSDTLDGLARYDIST 240
            SQELIVDPRKSDLSSAQNTSSIPEDRFGR+EEIVTKDTRSEQGKNVS+TLDGLARYDIST
Sbjct: 847  SQELIVDPRKSDLSSAQNTSSIPEDRFGRSEEIVTKDTRSEQGKNVSNTLDGLARYDIST 906

Query: 241  LKSPEMPSISISQMNALLSLSHTSPCSKKPQCRLSSSRDRELLHARLEIEKATAAVNSPG 300
            LKSPEMP ISISQMNALLSLSHTSPCSKKPQCR SSSRDRELLHARLEIEKATA VNSPG
Sbjct: 907  LKSPEMPPISISQMNALLSLSHTSPCSKKPQCRFSSSRDRELLHARLEIEKATAVVNSPG 966

Query: 301  IISVFRDVSMFKRSYDLMEKTLKVYIYKEGEKPIFHQPRMRGIYASEGWFMKLMKENKKF 360
            IISVFR+VSMFKRSYDLMEKTLKVYIYKEGEKPIFHQPRMRGIYASEGWFMKLMKENKKF
Sbjct: 967  IISVFRNVSMFKRSYDLMEKTLKVYIYKEGEKPIFHQPRMRGIYASEGWFMKLMKENKKF 1026

Query: 361  VAKNPKKAHLFYLPFSSQLLRSALSEQNSQGRKNLEERLGNYVNLIRRNHQFWNRTGGAD 420
            VAKNPKKAHLFYLPFSSQLLRSALSEQNSQGRKNLEERLGNYVNLIRRNHQFWNRTGGAD
Sbjct: 1027 VAKNPKKAHLFYLPFSSQLLRSALSEQNSQGRKNLEERLGNYVNLIRRNHQFWNRTGGAD 1086

Query: 421  HFLVACHDWASKLTRKYMKSCIRALCNANAARGFQIGKDTSLPVTNIHLTKDPDITTGAK 480
            HFLVACHDWASKLTRKYMK+CIRALCNANAARGFQIGKDTSLPVTNIHLTKDPDITTGAK
Sbjct: 1087 HFLVACHDWASKLTRKYMKNCIRALCNANAARGFQIGKDTSLPVTNIHLTKDPDITTGAK 1146

Query: 481  PPSDRTTLAFFAGGMHGYLRPILLHYWENKEPDMKIFGPMPRDAEGKRIYREHMKNSKYC 540
            PPSDRTTLAFFAGGMHGYLRPILLHYWENKEPDMKIFGPM RDAEGKRIYREHMKNSKYC
Sbjct: 1147 PPSDRTTLAFFAGGMHGYLRPILLHYWENKEPDMKIFGPMARDAEGKRIYREHMKNSKYC 1206

Query: 541  ICARGYEVHTPRVVEAILNACVPVFLSDNYVPPFFEVLNWESFSVFVQEKEISNLRNILL 600
            ICARGYEVHTPRVVEAILNACVPVFLSDNYVPPFFEVLNWESFSVFVQEKEISNLRNILL
Sbjct: 1207 ICARGYEVHTPRVVEAILNACVPVFLSDNYVPPFFEVLNWESFSVFVQEKEISNLRNILL 1266

Query: 601  SIPEKDYLVMHARLKIVQKHFIWNKIPVKYDLFHMILHSVWYTRVFQMKTS 651
            SIPE+DYLVMHARLKIVQKHFIWNKIPVKYDLFHMILHSVWYTRVF+MKT+
Sbjct: 1267 SIPEEDYLVMHARLKIVQKHFIWNKIPVKYDLFHMILHSVWYTRVFEMKTN 1317

BLAST of Cp4.1LG04g10080 vs. NCBI nr
Match: XP_022965105.1 (probable glycosyltransferase At3g07620 [Cucurbita maxima] >XP_022965113.1 probable glycosyltransferase At3g07620 [Cucurbita maxima] >XP_022965124.1 probable glycosyltransferase At3g07620 [Cucurbita maxima])

HSP 1 Score: 1264 bits (3270), Expect = 0.0
Identity = 631/651 (96.93%), Postives = 638/651 (98.00%), Query Frame = 0

Query: 1   MAIHVCTNLFHGIKIRSLLIMIAIIISILIVSQCYVYPYAKKSFLPLDVKSSDIMSLQNI 60
           MAIH CTNLFHGIKIR LLIMIAIIIS+LIVSQCYVYPYAKKSFLPLDVKSSDIMSLQNI
Sbjct: 1   MAIHKCTNLFHGIKIRRLLIMIAIIISVLIVSQCYVYPYAKKSFLPLDVKSSDIMSLQNI 60

Query: 61  TSLNHSEVHFLYTVTHVKNRKERTEYITEKKGERGFGLTLDAANSMPYENGTPFEETSAM 120
           TSLNHSEVHFLYTVTHVKNRKERTEYITEKKGERGFGLTLDAA SMPYENGTPFEET AM
Sbjct: 61  TSLNHSEVHFLYTVTHVKNRKERTEYITEKKGERGFGLTLDAAKSMPYENGTPFEETLAM 120

Query: 121 PDGNSTVDNDIGSGTVEFGYNPPMKEKILDNSYKRVVEGEDSSNLNMSKMRNHISFVSNQ 180
           PDGN TVDNDIGSGTVEFG NPPMKEKILDNSYKRVVEGEDSSNLNMSKMRNHISFVSNQ
Sbjct: 121 PDGNFTVDNDIGSGTVEFGSNPPMKEKILDNSYKRVVEGEDSSNLNMSKMRNHISFVSNQ 180

Query: 181 SQELIVDPRKSDLSSAQNTSSIPEDRFGRTEEIVTKDTRSEQGKNVSDTLDGLARYDIST 240
            QELIVDPRKSDLSSAQNTSSIPEDRFGRTEEIVT DTRSEQGKNVS TLDGLARYDIST
Sbjct: 181 PQELIVDPRKSDLSSAQNTSSIPEDRFGRTEEIVTNDTRSEQGKNVSVTLDGLARYDIST 240

Query: 241 LKSPEMPSISISQMNALLSLSHTSPCSKKPQCRLSSSRDRELLHARLEIEKATAAVNSPG 300
           L+SPEMP ISISQMNALLSLSHTSPCSKKPQCRLSSSRDRELLHARLEIEKATA VNSPG
Sbjct: 241 LESPEMPPISISQMNALLSLSHTSPCSKKPQCRLSSSRDRELLHARLEIEKATAVVNSPG 300

Query: 301 IISVFRDVSMFKRSYDLMEKTLKVYIYKEGEKPIFHQPRMRGIYASEGWFMKLMKENKKF 360
           IISVFR+VSMFKRSYDLMEKTLKVYIYKEGEKPIFHQPRMRGIYASEGWFMKLMKENK F
Sbjct: 301 IISVFRNVSMFKRSYDLMEKTLKVYIYKEGEKPIFHQPRMRGIYASEGWFMKLMKENKNF 360

Query: 361 VAKNPKKAHLFYLPFSSQLLRSALSEQNSQGRKNLEERLGNYVNLIRRNHQFWNRTGGAD 420
           VAKNPKKAHLFYLPFSSQLLRSALSEQNSQGRK LEERLGNYVNLIRRNHQFWNRTGGAD
Sbjct: 361 VAKNPKKAHLFYLPFSSQLLRSALSEQNSQGRKILEERLGNYVNLIRRNHQFWNRTGGAD 420

Query: 421 HFLVACHDWASKLTRKYMKSCIRALCNANAARGFQIGKDTSLPVTNIHLTKDPDITTGAK 480
           HFLVACHDWASKLTRKYMK+CIRALCNANAARGFQIGKDTS+PVTNIHLTKDPDITTGAK
Sbjct: 421 HFLVACHDWASKLTRKYMKNCIRALCNANAARGFQIGKDTSVPVTNIHLTKDPDITTGAK 480

Query: 481 PPSDRTTLAFFAGGMHGYLRPILLHYWENKEPDMKIFGPMPRDAEGKRIYREHMKNSKYC 540
           PPSDRTTLAFFAGGMHGYLRPILLHYWENKEPDMKIFGPMPR+AEGKRIYREHMKNSKYC
Sbjct: 481 PPSDRTTLAFFAGGMHGYLRPILLHYWENKEPDMKIFGPMPRNAEGKRIYREHMKNSKYC 540

Query: 541 ICARGYEVHTPRVVEAILNACVPVFLSDNYVPPFFEVLNWESFSVFVQEKEISNLRNILL 600
           ICARGYEVHTPRVVEAILNACVPVFLSDNYVPPFFEVLNWESFSVFVQEKEISNLRNILL
Sbjct: 541 ICARGYEVHTPRVVEAILNACVPVFLSDNYVPPFFEVLNWESFSVFVQEKEISNLRNILL 600

Query: 601 SIPEKDYLVMHARLKIVQKHFIWNKIPVKYDLFHMILHSVWYTRVFQMKTS 651
           SIPEKDYLVMHARLKIVQKHFIWNKIPVKYDLFHMILHSVWYTRVFQMKT+
Sbjct: 601 SIPEKDYLVMHARLKIVQKHFIWNKIPVKYDLFHMILHSVWYTRVFQMKTN 651

BLAST of Cp4.1LG04g10080 vs. ExPASy TrEMBL
Match: A0A6J1F5A9 (probable glycosyltransferase At3g07620 OS=Cucurbita moschata OX=3662 GN=LOC111440979 PE=3 SV=1)

HSP 1 Score: 1279 bits (3310), Expect = 0.0
Identity = 637/651 (97.85%), Postives = 644/651 (98.92%), Query Frame = 0

Query: 1   MAIHVCTNLFHGIKIRSLLIMIAIIISILIVSQCYVYPYAKKSFLPLDVKSSDIMSLQNI 60
           MAIH+CTNLFHGIKIR LLIMIAIIISILIVSQCYVYPYAKKSFLPLDVKSSDIMSLQNI
Sbjct: 1   MAIHICTNLFHGIKIRRLLIMIAIIISILIVSQCYVYPYAKKSFLPLDVKSSDIMSLQNI 60

Query: 61  TSLNHSEVHFLYTVTHVKNRKERTEYITEKKGERGFGLTLDAANSMPYENGTPFEETSAM 120
           TSLNHSEVHFLYTVTHVKNRKERTEYITEKKGERGFGLTLDAANSMPYENGTPFEETSAM
Sbjct: 61  TSLNHSEVHFLYTVTHVKNRKERTEYITEKKGERGFGLTLDAANSMPYENGTPFEETSAM 120

Query: 121 PDGNSTVDNDIGSGTVEFGYNPPMKEKILDNSYKRVVEGEDSSNLNMSKMRNHISFVSNQ 180
           PDGNSTVDNDIGSGTVEFGYNPP+KEKILDNSYKRVVEGEDSSNLN SKMRNHISFVSNQ
Sbjct: 121 PDGNSTVDNDIGSGTVEFGYNPPIKEKILDNSYKRVVEGEDSSNLNTSKMRNHISFVSNQ 180

Query: 181 SQELIVDPRKSDLSSAQNTSSIPEDRFGRTEEIVTKDTRSEQGKNVSDTLDGLARYDIST 240
           SQELIVDPRKSDLSSAQNTSSIPEDRFGRTEEIVTKDTRSEQ KNV DTLDGLARYDIST
Sbjct: 181 SQELIVDPRKSDLSSAQNTSSIPEDRFGRTEEIVTKDTRSEQAKNVFDTLDGLARYDIST 240

Query: 241 LKSPEMPSISISQMNALLSLSHTSPCSKKPQCRLSSSRDRELLHARLEIEKATAAVNSPG 300
           LKSPEMP ISISQMNALLSLSHTSPCSKKPQCRLSSSRDRELLHARLEIEKATA VNSPG
Sbjct: 241 LKSPEMPPISISQMNALLSLSHTSPCSKKPQCRLSSSRDRELLHARLEIEKATAVVNSPG 300

Query: 301 IISVFRDVSMFKRSYDLMEKTLKVYIYKEGEKPIFHQPRMRGIYASEGWFMKLMKENKKF 360
           IISVFR+VSMFKRSYDLMEKTLKVYIYKEGEKPIFHQPRMRGIYASEGWFMKLMKENKKF
Sbjct: 301 IISVFRNVSMFKRSYDLMEKTLKVYIYKEGEKPIFHQPRMRGIYASEGWFMKLMKENKKF 360

Query: 361 VAKNPKKAHLFYLPFSSQLLRSALSEQNSQGRKNLEERLGNYVNLIRRNHQFWNRTGGAD 420
           VAKNPKKAHLFYLPFSSQLLRSALSEQNSQGRKNLEERLGNYVNLIRRNHQFWNRTGGAD
Sbjct: 361 VAKNPKKAHLFYLPFSSQLLRSALSEQNSQGRKNLEERLGNYVNLIRRNHQFWNRTGGAD 420

Query: 421 HFLVACHDWASKLTRKYMKSCIRALCNANAARGFQIGKDTSLPVTNIHLTKDPDITTGAK 480
           HFLVACHDWASKLTRKYMK+CIRALCNANAARGFQIGKDTSLPVTNIHLTKDPDITTGAK
Sbjct: 421 HFLVACHDWASKLTRKYMKNCIRALCNANAARGFQIGKDTSLPVTNIHLTKDPDITTGAK 480

Query: 481 PPSDRTTLAFFAGGMHGYLRPILLHYWENKEPDMKIFGPMPRDAEGKRIYREHMKNSKYC 540
           PPSDRTTLAFFAGGMHGYLRPILLHYWENKEPDMKIFGPM RDAEGKRIYREHMKNSKYC
Sbjct: 481 PPSDRTTLAFFAGGMHGYLRPILLHYWENKEPDMKIFGPMARDAEGKRIYREHMKNSKYC 540

Query: 541 ICARGYEVHTPRVVEAILNACVPVFLSDNYVPPFFEVLNWESFSVFVQEKEISNLRNILL 600
           ICARGYEVHTPRVVEAILNACVPVFLSDNYVPPFFEVLNWESFSVFVQEKEISNLRNILL
Sbjct: 541 ICARGYEVHTPRVVEAILNACVPVFLSDNYVPPFFEVLNWESFSVFVQEKEISNLRNILL 600

Query: 601 SIPEKDYLVMHARLKIVQKHFIWNKIPVKYDLFHMILHSVWYTRVFQMKTS 651
           SIPE+DYLVMHARLKIVQKHFIWNKIPVKYDLFHMILHSVWYTRVFQM+T+
Sbjct: 601 SIPEEDYLVMHARLKIVQKHFIWNKIPVKYDLFHMILHSVWYTRVFQMQTN 651

BLAST of Cp4.1LG04g10080 vs. ExPASy TrEMBL
Match: A0A6J1HMX2 (probable glycosyltransferase At3g07620 OS=Cucurbita maxima OX=3661 GN=LOC111465066 PE=3 SV=1)

HSP 1 Score: 1264 bits (3270), Expect = 0.0
Identity = 631/651 (96.93%), Postives = 638/651 (98.00%), Query Frame = 0

Query: 1   MAIHVCTNLFHGIKIRSLLIMIAIIISILIVSQCYVYPYAKKSFLPLDVKSSDIMSLQNI 60
           MAIH CTNLFHGIKIR LLIMIAIIIS+LIVSQCYVYPYAKKSFLPLDVKSSDIMSLQNI
Sbjct: 1   MAIHKCTNLFHGIKIRRLLIMIAIIISVLIVSQCYVYPYAKKSFLPLDVKSSDIMSLQNI 60

Query: 61  TSLNHSEVHFLYTVTHVKNRKERTEYITEKKGERGFGLTLDAANSMPYENGTPFEETSAM 120
           TSLNHSEVHFLYTVTHVKNRKERTEYITEKKGERGFGLTLDAA SMPYENGTPFEET AM
Sbjct: 61  TSLNHSEVHFLYTVTHVKNRKERTEYITEKKGERGFGLTLDAAKSMPYENGTPFEETLAM 120

Query: 121 PDGNSTVDNDIGSGTVEFGYNPPMKEKILDNSYKRVVEGEDSSNLNMSKMRNHISFVSNQ 180
           PDGN TVDNDIGSGTVEFG NPPMKEKILDNSYKRVVEGEDSSNLNMSKMRNHISFVSNQ
Sbjct: 121 PDGNFTVDNDIGSGTVEFGSNPPMKEKILDNSYKRVVEGEDSSNLNMSKMRNHISFVSNQ 180

Query: 181 SQELIVDPRKSDLSSAQNTSSIPEDRFGRTEEIVTKDTRSEQGKNVSDTLDGLARYDIST 240
            QELIVDPRKSDLSSAQNTSSIPEDRFGRTEEIVT DTRSEQGKNVS TLDGLARYDIST
Sbjct: 181 PQELIVDPRKSDLSSAQNTSSIPEDRFGRTEEIVTNDTRSEQGKNVSVTLDGLARYDIST 240

Query: 241 LKSPEMPSISISQMNALLSLSHTSPCSKKPQCRLSSSRDRELLHARLEIEKATAAVNSPG 300
           L+SPEMP ISISQMNALLSLSHTSPCSKKPQCRLSSSRDRELLHARLEIEKATA VNSPG
Sbjct: 241 LESPEMPPISISQMNALLSLSHTSPCSKKPQCRLSSSRDRELLHARLEIEKATAVVNSPG 300

Query: 301 IISVFRDVSMFKRSYDLMEKTLKVYIYKEGEKPIFHQPRMRGIYASEGWFMKLMKENKKF 360
           IISVFR+VSMFKRSYDLMEKTLKVYIYKEGEKPIFHQPRMRGIYASEGWFMKLMKENK F
Sbjct: 301 IISVFRNVSMFKRSYDLMEKTLKVYIYKEGEKPIFHQPRMRGIYASEGWFMKLMKENKNF 360

Query: 361 VAKNPKKAHLFYLPFSSQLLRSALSEQNSQGRKNLEERLGNYVNLIRRNHQFWNRTGGAD 420
           VAKNPKKAHLFYLPFSSQLLRSALSEQNSQGRK LEERLGNYVNLIRRNHQFWNRTGGAD
Sbjct: 361 VAKNPKKAHLFYLPFSSQLLRSALSEQNSQGRKILEERLGNYVNLIRRNHQFWNRTGGAD 420

Query: 421 HFLVACHDWASKLTRKYMKSCIRALCNANAARGFQIGKDTSLPVTNIHLTKDPDITTGAK 480
           HFLVACHDWASKLTRKYMK+CIRALCNANAARGFQIGKDTS+PVTNIHLTKDPDITTGAK
Sbjct: 421 HFLVACHDWASKLTRKYMKNCIRALCNANAARGFQIGKDTSVPVTNIHLTKDPDITTGAK 480

Query: 481 PPSDRTTLAFFAGGMHGYLRPILLHYWENKEPDMKIFGPMPRDAEGKRIYREHMKNSKYC 540
           PPSDRTTLAFFAGGMHGYLRPILLHYWENKEPDMKIFGPMPR+AEGKRIYREHMKNSKYC
Sbjct: 481 PPSDRTTLAFFAGGMHGYLRPILLHYWENKEPDMKIFGPMPRNAEGKRIYREHMKNSKYC 540

Query: 541 ICARGYEVHTPRVVEAILNACVPVFLSDNYVPPFFEVLNWESFSVFVQEKEISNLRNILL 600
           ICARGYEVHTPRVVEAILNACVPVFLSDNYVPPFFEVLNWESFSVFVQEKEISNLRNILL
Sbjct: 541 ICARGYEVHTPRVVEAILNACVPVFLSDNYVPPFFEVLNWESFSVFVQEKEISNLRNILL 600

Query: 601 SIPEKDYLVMHARLKIVQKHFIWNKIPVKYDLFHMILHSVWYTRVFQMKTS 651
           SIPEKDYLVMHARLKIVQKHFIWNKIPVKYDLFHMILHSVWYTRVFQMKT+
Sbjct: 601 SIPEKDYLVMHARLKIVQKHFIWNKIPVKYDLFHMILHSVWYTRVFQMKTN 651

BLAST of Cp4.1LG04g10080 vs. ExPASy TrEMBL
Match: A0A6J1CTZ3 (probable glycosyltransferase At3g07620 OS=Momordica charantia OX=3673 GN=LOC111014547 PE=3 SV=1)

HSP 1 Score: 1023 bits (2646), Expect = 0.0
Identity = 520/666 (78.08%), Postives = 570/666 (85.59%), Query Frame = 0

Query: 1   MAIHVCTNLFHGIKIRSLLIMIAIIISILIVSQCYVYPYAKKSFLPLDVKSSDIMSLQNI 60
           MAIH+ TNLFH IKIR LLIMI+III ILIVSQCYVYPYAK SFLPLD KSS+I +LQN+
Sbjct: 1   MAIHISTNLFHSIKIRRLLIMISIIIPILIVSQCYVYPYAKTSFLPLDFKSSNITTLQNV 60

Query: 61  TSLNHSE------VHFLYTVTHVKNRKERTEYITEKKGERGFGLTLDAANSMPYENGTPF 120
           TSLNHSE      VHF+ T+THVKN KE T+ ITEK+GERG GLT  AA SM YE G  F
Sbjct: 61  TSLNHSEITGFHQVHFMDTITHVKNTKEITDKITEKRGERGLGLTSYAAKSMSYEKGGTF 120

Query: 121 EETSAMPDGNSTVDNDIGSGTVEFGYNPPMKEKILDNSYKRVVEGEDSSNLNMSKMRNHI 180
           E +  MPDG  TVDN +    VEF Y+PPMKE+ L NSY+RVVE EDS+ LN S+ RNH+
Sbjct: 121 EGSLVMPDGKLTVDNGVRKMNVEFRYSPPMKEETLKNSYRRVVEAEDSNYLNASESRNHV 180

Query: 181 SFVSNQSQEL------IVDPRKSDLSSAQNTSSIPEDRFGRTEEIVTKDTRSEQGKNVSD 240
           S VSN+SQEL      IVDPRK DLSSAQN S+IPED F +TEEI+TK T++EQ KNVS 
Sbjct: 181 SIVSNRSQELSRKSVVIVDPRKFDLSSAQNVSTIPEDHFNKTEEIITKRTKTEQRKNVSI 240

Query: 241 TLDGLARYDISTLKSPEMPSISISQMNALLSLSHTSPCSKKPQCRLSSSRDRELLHARLE 300
           TLDGLA+YDIS  KS EMPSISISQMN LLSLSH S C KKPQC  SS RDRELL+ARLE
Sbjct: 241 TLDGLAQYDISNFKSLEMPSISISQMNTLLSLSHNSSCLKKPQCHWSSQRDRELLYARLE 300

Query: 301 IEKATAAVNS--PGII-SVFRDVSMFKRSYDLMEKTLKVYIYKEGEKPIFHQPRMRGIYA 360
           IEKATA VNS  PGI  SVFR+VSMFKRSYDLMEK LKVYIYKEGE PIFHQPR +GIYA
Sbjct: 301 IEKATAVVNSKNPGIATSVFRNVSMFKRSYDLMEKMLKVYIYKEGENPIFHQPRTKGIYA 360

Query: 361 SEGWFMKLMKENKKFVAKNPKKAHLFYLPFSSQLLRSALSEQNSQGRKNLEERLGNYVNL 420
           SEGWFMKL+KENKKFV K+PKKAHLFYLPFSSQLLR  LSEQN    K+LEE LGNYV+L
Sbjct: 361 SEGWFMKLIKENKKFVVKDPKKAHLFYLPFSSQLLRKELSEQNFYKPKDLEEHLGNYVDL 420

Query: 421 IRRNHQFWNRTGGADHFLVACHDWASKLTRKYMKSCIRALCNANAARGFQIGKDTSLPVT 480
           IRR HQFWNRTGG DHFLVACHDWASKLTR++MK+CIRALCN+NAARGFQIGKDTSLPVT
Sbjct: 421 IRRKHQFWNRTGGVDHFLVACHDWASKLTRQHMKNCIRALCNSNAARGFQIGKDTSLPVT 480

Query: 481 NIHLTKDPDITTGAKPPSDRTTLAFFAGGMHGYLRPILLHYWENKEPDMKIFGPMPRDAE 540
            IHL KDPDIT+GAKPPS+RTTLAFFAG +HGYLRP+LLH+WENKEPDMKIFGP+P D E
Sbjct: 481 YIHLKKDPDITSGAKPPSERTTLAFFAGRIHGYLRPVLLHFWENKEPDMKIFGPIPGDIE 540

Query: 541 GKRIYREHMKNSKYCICARGYEVHTPRVVEAILNACVPVFLSDNYVPPFFEVLNWESFSV 600
           GKR+YREHMKNSKYCICARGYEVHTPRVVEAIL+ CVPV +SDNYVPPFFEVLNWESFSV
Sbjct: 541 GKRVYREHMKNSKYCICARGYEVHTPRVVEAILSECVPVIISDNYVPPFFEVLNWESFSV 600

Query: 601 FVQEKEISNLRNILLSIPEKDYLVMHARLKIVQKHFIWNKIPVKYDLFHMILHSVWYTRV 651
           FVQEKEISNLRNILLSIP+K YL MHA+LK+VQKHFIW++ PVKYDLFHMILHSVWY RV
Sbjct: 601 FVQEKEISNLRNILLSIPDKSYLAMHAKLKMVQKHFIWHENPVKYDLFHMILHSVWYNRV 660

BLAST of Cp4.1LG04g10080 vs. ExPASy TrEMBL
Match: A0A6J1CVI7 (probable glycosyltransferase At3g07620 OS=Momordica charantia OX=3673 GN=LOC111014602 PE=3 SV=1)

HSP 1 Score: 645 bits (1665), Expect = 1.87e-223
Identity = 351/650 (54.00%), Postives = 451/650 (69.38%), Query Frame = 0

Query: 14  KIRSLLIMIAIIISILIVSQCYVYPYAKKSFLPLDVKSSDIMSLQNITSLNHSEV---HF 73
           +IR LLI+  +I+ +L V Q +V+ Y K   L  D K S  M + N+  LN S +   H 
Sbjct: 2   EIRRLLIISIMILFVLFVFQYFVFRYTKTLPLSPDDKDSMFMVVHNVCHLNDSGLCRFHP 61

Query: 74  LYTVTHVKNRKERTEYITEKK-GERGFGLTLDAANSMPYENGTPFEETSAMPDGNSTVDN 133
             T   + + KE  +Y T KK  E   G +   + ++  E+   F+E          ++N
Sbjct: 62  TDTGIDILDTKENFDYDTNKKVREETVGSSHLTSENLNKES---FDEKGKTVYEGLVLEN 121

Query: 134 DIGSGTVEFGYNPPMKEKILDNSYKRVVEGEDSSNLNMSKMRNHISFVSNQSQELI---- 193
           D  +   E GY+P MK  +L +S     EG+ SS+L MS + N ++FVSNQSQ  I    
Sbjct: 122 DNQTEDEELGYSPLMKGDVLVDSNMTADEGKGSSSLGMSGIANQVTFVSNQSQGTINNSV 181

Query: 194 --VDPRKSDLSSAQNTSSIPEDRFGRTEEIVTKDTRSEQGKNVSDTL-DGLARYDISTLK 253
             VD   SD+S   NTS   E+   R E++   + R E  K  S  L D +   ++S L 
Sbjct: 182 KKVDQTYSDISVTSNTSGQEENIKNRMEKL-ENNNRIELEKKDSVVLNDKVVGSEVSRLS 241

Query: 254 SPEMPSISISQMNALLSLSHTSPCSKKPQCRLSSSRDRELLHARLEIEKATAAVNSPGI- 313
            P    ISISQM + LS ++ SPC K+PQCR +S  DREL +AR EIE A    ++P I 
Sbjct: 242 GPF---ISISQMYSKLSRAYNSPCLKRPQCRQTSGHDRELHYARQEIENAPVLRSTPEIS 301

Query: 314 ISVFRDVSMFKRSYDLMEKTLKVYIYKEGEKPIFHQPRMRGIYASEGWFMKLMKENKKFV 373
            S+FR++SMF RSY+LMEK LKVY+Y+EGEKP+FHQP + GIYASEGWFMKL++E+ KF+
Sbjct: 302 ASIFRNISMFTRSYELMEKMLKVYVYEEGEKPVFHQPILTGIYASEGWFMKLLEESNKFI 361

Query: 374 AKNPKKAHLFYLPFSSQLLRSALSEQNSQGRKNLEERLGNYVNLIRRNHQFWNRTGGADH 433
            K+P+KAHLFYLPFSSQ LRSA   +  + +++L++ L  +++LI + ++FWNR GG+DH
Sbjct: 362 VKDPEKAHLFYLPFSSQFLRSAFGNK-FRNKRDLQKLLKKFIDLIGKKYRFWNRNGGSDH 421

Query: 434 FLVACHDWASKLTRKYMKSCIRALCNANAARGFQIGKDTSLPVTNIHLTKDPDITTGAKP 493
           FLVACHDWA KLT++ +K+CIRALCNANAA  F+IGKDTSLPVT +H  +D     G KP
Sbjct: 422 FLVACHDWAPKLTKRVVKNCIRALCNANAAADFEIGKDTSLPVTFVHSMEDSIKDIGGKP 481

Query: 494 PSDRTTLAFFAGGMHGYLRPILLHYWENKEPDMKIFGPMPRDAEGKRIYREHMKNSKYCI 553
           PS RT LAFFAG MHGYLRPILLHYWENKE DM I GPMP   EGKR Y   MK+SKYCI
Sbjct: 482 PSGRTALAFFAGSMHGYLRPILLHYWENKELDMMIVGPMPNGIEGKRAYMAQMKSSKYCI 541

Query: 554 CARGYEVHTPRVVEAILNACVPVFLSDNYVPPFFEVLNWESFSVFVQEKEISNLRNILLS 613
           CARGY+VHTPRV+EAILN C+PV LSDNYVPPFFEVLNWESFSVFV+E+EI  LR+ILLS
Sbjct: 542 CARGYQVHTPRVIEAILNECIPVILSDNYVPPFFEVLNWESFSVFVKEREIPKLRDILLS 601

Query: 614 IPEKDYLVMHARLKIVQKHFIWNKIPVKYDLFHMILHSVWYTRVFQMKTS 651
           IPE++YL MH+R+K+VQ+HF+W++ P KYD FHMILHS+WYTRVFQ+KT+
Sbjct: 602 IPEENYLAMHSRVKMVQQHFLWHEKPAKYDAFHMILHSIWYTRVFQIKTN 643

BLAST of Cp4.1LG04g10080 vs. ExPASy TrEMBL
Match: A0A1S3CNV8 (probable glycosyltransferase At3g07620 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103503066 PE=3 SV=1)

HSP 1 Score: 632 bits (1630), Expect = 2.00e-218
Identity = 346/647 (53.48%), Postives = 447/647 (69.09%), Query Frame = 0

Query: 16  RSLLIMIAIIISILIVSQCYVYPYAKKSFLPLDVKSSDIMSLQNITSLNHSEVHFLYTVT 75
           R + I+  +I+ IL   Q  V+ Y KK  L    K+S  M +QN+  LN++ +   + + 
Sbjct: 5   RRVFIITFMILLILFAFQYSVFRYTKKLSLSFGDKASTFMVVQNVCHLNNAGLCRFHPID 64

Query: 76  HVKNR---KERTEYITEKKGERGFGLTLDAANSMPYENGTPFEETSAMPDGNSTVDNDIG 135
              N+   K++ +Y    KG R   + L    S      T   ET+A             
Sbjct: 65  SGINKLDTKKKFDY-DSNKGVRDEVVDL---TSEFLNKDTAKSETNA------------- 124

Query: 136 SGTVEFGYNPPMKEKILDNSYKRVVEGEDSSNLNMSKMRNHISFVSNQSQELI------V 195
               E  YNP MK  +L+NS     E + +S+  M+++RN I  V NQS+  +      V
Sbjct: 125 ----ELSYNPRMKG-VLENSNMTADEAKANSSPGMNEVRNQIMVVPNQSEGTMNNSIKKV 184

Query: 196 DPRKSDLSSAQNTSSIPEDRFGR-TEEIVTKDTRSEQGKNVSDTLDGLARYDISTLKSPE 255
           D   SD+S   NTSS  +++    +EE+   D      K +    D +   D+STL  P 
Sbjct: 185 DQTYSDISVIPNTSSDQKEKIKNISEELENNDRIELVKKGLFVFKDRMVGPDVSTLNGPF 244

Query: 256 MPSISISQMNALLSLSHTSPCSKKPQCRLSSSRDRELLHARLEIEKATAAVNSPGI-ISV 315
              ISISQ+ + LS +H S CSK+ QCR +S RDRELL+ARLEIE A+A  ++P I  SV
Sbjct: 245 ---ISISQIYSKLSRAHKSACSKRLQCRQTSQRDRELLNARLEIENASALRSTPDISASV 304

Query: 316 FRDVSMFKRSYDLMEKTLKVYIYKEGEKPIFHQPRMRGIYASEGWFMKLMKENKKFVAKN 375
           FR++SMF RSY+LMEK LKVYIY EGEKPIFHQP + GIYASEGWFMKL+++NKKFV K+
Sbjct: 305 FRNISMFTRSYELMEKMLKVYIYDEGEKPIFHQPILTGIYASEGWFMKLLEDNKKFVVKD 364

Query: 376 PKKAHLFYLPFSSQLLRSALSEQNSQGRKNLEERLGNYVNLIRRNHQFWNRTGGADHFLV 435
           P+KAHLFYLPFSSQ LRSA   +  + +++L++ L NYV++I + ++FWN+ GG+DHFLV
Sbjct: 365 PEKAHLFYLPFSSQFLRSAFGNK-FRNKRDLQKPLKNYVDVIGKKYRFWNKNGGSDHFLV 424

Query: 436 ACHDWASKLTRKYMKSCIRALCNANAARGFQIGKDTSLPVTNIHLTKDPDITTGAKPPSD 495
           ACHDWA KLT++ +K+CIRALCNANAA  F+IGKDTSLPVT +H T+D     G KPPS+
Sbjct: 425 ACHDWAPKLTKRLVKNCIRALCNANAAGDFEIGKDTSLPVTFVHSTEDLITEIGGKPPSE 484

Query: 496 RTTLAFFAGGMHGYLRPILLHYWENKEPDMKIFGPMPRDAEGKRIYREHMKNSKYCICAR 555
           RTTLAFFAG MHGYLR ILLHYWENKEPDM I GPMP   EGK  Y E MK+SKYCICAR
Sbjct: 485 RTTLAFFAGSMHGYLRSILLHYWENKEPDMMIVGPMPNSIEGKNAYMEQMKSSKYCICAR 544

Query: 556 GYEVHTPRVVEAILNACVPVFLSDNYVPPFFEVLNWESFSVFVQEKEISNLRNILLSIPE 615
           GY+VH+PRV+EAILN C+PV +SDNYVPPFFEVLNW+SFSVFV+E+EI NLR+ILLSIPE
Sbjct: 545 GYQVHSPRVIEAILNECIPVIISDNYVPPFFEVLNWKSFSVFVKEREIPNLRDILLSIPE 604

Query: 616 KDYLVMHARLKIVQKHFIWNKIPVKYDLFHMILHSVWYTRVFQMKTS 651
           ++Y  MH+R+K+VQ+HF+W++ P KYD FHMILHS+WYTRVFQ+K++
Sbjct: 605 ENYRAMHSRVKMVQQHFLWHEKPAKYDAFHMILHSIWYTRVFQIKSN 625

BLAST of Cp4.1LG04g10080 vs. TAIR 10
Match: AT5G37000.1 (Exostosin family protein )

HSP 1 Score: 482.6 bits (1241), Expect = 4.8e-136
Identity = 266/498 (53.41%), Postives = 337/498 (67.67%), Query Frame = 0

Query: 162 SSNLNMSKMRNHISFVSNQSQELIVDPRKSDLSS-----------AQNTSSIPEDRFG-R 221
           +SN+    + N+ +   +  +EL  + +K DL S             N S I       R
Sbjct: 55  NSNVTQVSVMNYTNLSDDDDEEL--ENKKEDLDSENDVVISKEKVEMNVSFIAIGNISLR 114

Query: 222 TEEIVTKDTRSEQGKN-----VSDTLDGLARYDISTLKSPEMPSISISQMNALLSLSHTS 281
             ++V   + SE   N     V D+  G     +S  +  +  +ISISQMN+LL  S +S
Sbjct: 115 NPKMVVVSSESESDPNSVMIRVKDSRKGNV---LSLRRHKQGSAISISQMNSLLIQSLSS 174

Query: 282 PCSKKPQCRLSSSRDRELLHARLEIEKATAAVNSPGIIS-VFRDVSMFK----------- 341
              K P+ R SS+RD E+L AR EIEK +   +  G+   V+R++S F            
Sbjct: 175 --FKSPKPRWSSARDSEMLSARSEIEKVSLVHDFLGLNPLVYRNISKFLRSGDMSRFSMC 234

Query: 342 ---RSYDLMEKTLKVYIYKEGEKPIFHQPRMRGIYASEGWFMKLMKENKKFVAKNPKKAH 401
              RSYDLME+ LK+Y+YKEG KPIFH P  RGIYASEGWFMKLM+ NKKFV K+P+KAH
Sbjct: 235 CLFRSYDLMERKLKIYVYKEGGKPIFHTPMPRGIYASEGWFMKLMESNKKFVVKDPRKAH 294

Query: 402 LFYLPFSSQLLRSALSEQNSQGRKNLEERLGNYVNLIRRNHQFWNRTGGADHFLVACHDW 461
           LFY+P S + LRS+L   + Q  K+L + L  YV+LI   ++FWNRTGGADHFLVACHDW
Sbjct: 295 LFYIPISIKALRSSLG-LDFQTPKSLADHLKEYVDLIAGKYKFWNRTGGADHFLVACHDW 354

Query: 462 ASKLTRKYMKSCIRALCNANAARGFQIGKDTSLPVTNIHLTKDPDITTGAKPPSDRTTLA 521
            +KLT K MK+ +R+LCN+N A+GF+IG DT+LPVT I  ++ P    G K  S+R  LA
Sbjct: 355 GNKLTTKTMKNSVRSLCNSNVAQGFRIGTDTALPVTYIRSSEAPLEYLGGKTSSERKILA 414

Query: 522 FFAGGMHGYLRPILLHYWENKEPDMKIFGPMPRDAEGKRIYREHMKNSKYCICARGYEVH 581
           FFAG MHGYLRPIL+  WENKEPDMKIFGPMPRD + K+ YRE+MK+S+YCICARGYEVH
Sbjct: 415 FFAGSMHGYLRPILVKLWENKEPDMKIFGPMPRDPKSKKQYREYMKSSRYCICARGYEVH 474

Query: 582 TPRVVEAILNACVPVFLSDNYVPPFFEVLNWESFSVFVQEKEISNLRNILLSIPEKDYLV 628
           TPRVVEAI+N CVPV ++DNYVPPFFEVLNWE F+VFV+EK+I NLRNILLSIPE  Y+ 
Sbjct: 475 TPRVVEAIINECVPVIIADNYVPPFFEVLNWEEFAVFVEEKDIPNLRNILLSIPEDRYIG 534

BLAST of Cp4.1LG04g10080 vs. TAIR 10
Match: AT4G32790.1 (Exostosin family protein )

HSP 1 Score: 463.0 bits (1190), Expect = 4.0e-130
Identity = 247/514 (48.05%), Postives = 341/514 (66.34%), Query Frame = 0

Query: 149 LDNSYKRVVEGEDSSNLNMSK-----------MRNHISFVSNQSQELIVDPRKSDLSSA- 208
           L++S  R VE ++  +  + +           ++ H SFV +   +  +D      SS+ 
Sbjct: 86  LNSSSSRSVEVDEEESTGLKEDHVIGFDKNDTVQGHDSFVEDVKDKETLDLLPGTKSSSN 145

Query: 209 QNTSSIPEDRFGRTEEIVTKDTRSEQGKNVSDTLDGLARYDISTLKSPEMPSISISQMNA 268
           ++   I ED     E I   +    +     D L    +  ++   S     +SI++M  
Sbjct: 146 ESYEKIVEDADIAFENIRKMEILESKSDPSVDNLSSEVKKFMNVSNS---GVVSITEMMN 205

Query: 269 LLSLSHTSPCSKKPQCRLSSSRDRELLHARLEIEKATAAVNSPGI-ISVFRDVSMFKRSY 328
           LL  S TS  S K   + SS+ D ELL+AR +IE      N P +   ++ ++SMFKRSY
Sbjct: 206 LLHQSRTSHVSLK--VKRSSTIDHELLYARTQIENPPLIENDPLLHTPLYWNLSMFKRSY 265

Query: 329 DLMEKTLKVYIYKEGEKPIFHQPRMRGIYASEGWFMKLMKENKKFVAKNPKKAHLFYLPF 388
           +LMEK LKVY+Y+EG++P+ H+P ++GIYASEGWFMK +K ++ FV K+P+KAHLFYLPF
Sbjct: 266 ELMEKKLKVYVYREGKRPVLHKPVLKGIYASEGWFMKQLKSSRTFVTKDPRKAHLFYLPF 325

Query: 389 SSQLLRSALSEQNSQGRKNLEERLGNYVNLIRRNHQFWNRTGGADHFLVACHDWASKLTR 448
           SS++L   L    S   KNL + L NY+++I   + FWN+TGG+DHFLVACHDWA   TR
Sbjct: 326 SSKMLEETLYVPGSHSDKNLIQFLKNYLDMISSKYSFWNKTGGSDHFLVACHDWAPSETR 385

Query: 449 KYMKSCIRALCNANAARGFQIGKDTSLPVTNIHLTKDPDITTGAKPPSDRTTLAFFAGGM 508
           +YM  CIRALCN++ + GF  GKD +LP T I + + P    G KP S R  LAFFAGGM
Sbjct: 386 QYMAKCIRALCNSDVSEGFVFGKDVALPETTILVPRRPLRALGGKPVSQRQILAFFAGGM 445

Query: 509 HGYLRPILLHYW-ENKEPDMKIFGPMPRDAEGKRIYREHMKNSKYCICARGYEVHTPRVV 568
           HGYLRP+LL  W  N++PDMKIF  +P+ ++GK+ Y E+MK+SKYCIC +G+EV++PRVV
Sbjct: 446 HGYLRPLLLQNWGGNRDPDMKIFSEIPK-SKGKKSYMEYMKSSKYCICPKGHEVNSPRVV 505

Query: 569 EAILNACVPVFLSDNYVPPFFEVLNWESFSVFVQEKEISNLRNILLSIPEKDYLVMHARL 628
           EA+   CVPV +SDN+VPPFFEVLNWESF+VFV EK+I +L+NIL+SI E+ Y  M  R+
Sbjct: 506 EALFYECVPVIISDNFVPPFFEVLNWESFAVFVLEKDIPDLKNILVSITEERYREMQMRV 565

Query: 629 KIVQKHFIWNKIPVKYDLFHMILHSVWYTRVFQM 649
           K+VQKHF+W+  P ++D+FHMILHS+WY RVFQ+
Sbjct: 566 KMVQKHFLWHSKPERFDIFHMILHSIWYNRVFQI 593

BLAST of Cp4.1LG04g10080 vs. TAIR 10
Match: AT5G19670.1 (Exostosin family protein )

HSP 1 Score: 462.6 bits (1189), Expect = 5.2e-130
Identity = 253/532 (47.56%), Postives = 335/532 (62.97%), Query Frame = 0

Query: 124 NSTVDNDIGSGTVEFGYNPPMKEKILDNSYKRVVEGED----SSNLNMSKMRNHISFVSN 183
           N + D++   G V+F     +K+ I+    K V    D    S    M K     S    
Sbjct: 97  NESEDDEGFVGNVDFESFEDVKDSII---IKEVAGSSDNLFPSETTVMQKESVSTSNNGY 156

Query: 184 QSQELIVDPRKSDLSS-AQNTSSIPEDRFGRTEEIVTKDTRSEQGKNVSDTLDGLARYDI 243
           Q Q + V  +K+  SS     SSI     G +  +V+K    ++            R D+
Sbjct: 157 QVQNVTVQSQKNVKSSILSGGSSIASPASGNSSLLVSKKVSKKK----------KMRCDL 216

Query: 244 STLKSPEMPSISISQMNALLSLSHTSPCSKKPQCRLSSSRDRELLHARLEIEKATAAVNS 303
                P     +I +MN +L+    +  + +P  R SS RD E+L AR EIE A  A   
Sbjct: 217 -----PPKSVTTIDEMNRILARHRRTSRAMRP--RWSSRRDEEILTARKEIENAPVAKLE 276

Query: 304 PGII-SVFRDVSMFKRSYDLMEKTLKVYIYKEGEKPIFHQPRMRGIYASEGWFMKLMKEN 363
             +   +FR+VS+FKRSY+LME+ LKVY+YKEG +PIFH P ++G+YASEGWFMKLM+ N
Sbjct: 277 RELYPPIFRNVSLFKRSYELMERILKVYVYKEGNRPIFHTPILKGLYASEGWFMKLMEGN 336

Query: 364 KKFVAKNPKKAHLFYLPFSSQLLRSALSEQNSQGRKNLEERLGNYVNLIRRNHQFWNRTG 423
           K++  K+P+KAHL+Y+PFS+++L   L  +NS  R NL + L  Y   I   + F+NRT 
Sbjct: 337 KQYTVKDPRKAHLYYMPFSARMLEYTLYVRNSHNRTNLRQFLKEYTEHISSKYPFFNRTD 396

Query: 424 GADHFLVACHDWASKLTRKYMKSCIRALCNANAARGFQIGKDTSLPVTNIHLTKDPDITT 483
           GADHFLVACHDWA   TR +M+ CI+ALCNA+   GF+IG+D SLP T +   K+P    
Sbjct: 397 GADHFLVACHDWAPYETRHHMEHCIKALCNADVTAGFKIGRDISLPETYVRAAKNPLRDL 456

Query: 484 GAKPPSDRTTLAFFAGGMHGYLRPILLHYWENKEPDMKIFGPMPRDAEGKRIYREHMKNS 543
           G KPPS R TLAF+AG MHGYLR ILL +W++K+PDMKIFG MP     K  Y E MK+S
Sbjct: 457 GGKPPSQRRTLAFYAGSMHGYLRQILLQHWKDKDPDMKIFGRMPFGVASKMNYIEQMKSS 516

Query: 544 KYCICARGYEVHTPRVVEAILNACVPVFLSDNYVPPFFEVLNWESFSVFVQEKEISNLRN 603
           KYCIC +GYEV++PRVVE+I   CVPV +SDN+VPPFFEVL+W +FSV V EK+I  L++
Sbjct: 517 KYCICPKGYEVNSPRVVESIFYECVPVIISDNFVPPFFEVLDWSAFSVIVAEKDIPRLKD 576

Query: 604 ILLSIPEKDYLVMHARLKIVQKHFIWNKIPVKYDLFHMILHSVWYTRVFQMK 650
           ILLSIPE  Y+ M   ++  Q+HF+W+  P KYDLFHM+LHS+WY RVFQ K
Sbjct: 577 ILLSIPEDKYVKMQMAVRKAQRHFLWHAKPEKYDLFHMVLHSIWYNRVFQAK 608

BLAST of Cp4.1LG04g10080 vs. TAIR 10
Match: AT5G25820.1 (Exostosin family protein )

HSP 1 Score: 451.8 bits (1161), Expect = 9.2e-127
Identity = 239/440 (54.32%), Postives = 304/440 (69.09%), Query Frame = 0

Query: 219 RSEQGKNVSDTLDGLARYDISTLKSPEMPS---ISISQMNALL---SLSHTSPCSKKPQC 278
           R+   KNV D    + R+     ++ +MP    +SIS+M+  L    +SH +  +KKP  
Sbjct: 218 RNPTKKNVGDA-SPIVRFVPDVKENAKMPGFGVMSISEMSKQLRQNRISH-NRLAKKP-- 277

Query: 279 RLSSSRDRELLHARLEIEKATAAVNSPGIIS-VFRDVSMFKRSYDLMEKTLKVYIYKEGE 338
           +  +  D ELL A+ +IE A      P + + ++R+VSMFKRSY+LMEK LKVY YKEG 
Sbjct: 278 KWVTKPDLELLQAKYDIENAPIDDKDPFLYAPLYRNVSMFKRSYELMEKILKVYAYKEGN 337

Query: 339 KPIFHQPRMRGIYASEGWFMKLMK-ENKKFVAKNPKKAHLFYLPFSSQLLRSALSEQNSQ 398
           KPI H P +RGIYASEGWFM +++  N KFV K+P KAHLFYLPFSS++L   L  Q+S 
Sbjct: 338 KPIMHSPILRGIYASEGWFMNIIESNNNKFVTKDPAKAHLFYLPFSSRMLEVTLYVQDSH 397

Query: 399 GRKNLEERLGNYVNLIRRNHQFWNRTGGADHFLVACHDWASKLTRKYMKSCIRALCNANA 458
             +NL + L +Y++ I   + FWNRT GADHFL ACHDWA   TRK+M   IRALCN++ 
Sbjct: 398 SHRNLIKYLKDYIDFISAKYPFWNRTSGADHFLAACHDWAPSETRKHMAKSIRALCNSDV 457

Query: 459 ARGFQIGKDTSLPVTNIHLTKDPDITTGAKPPSDRTTLAFFAGGM-HGYLRPILLHYW-E 518
             GF  GKDTSLP T +   K P    G K  + R  LAFFAG   HGYLRPILL YW  
Sbjct: 458 KEGFVFGKDTSLPETFVRDPKKPLSNMGGKSANQRPILAFFAGKPDHGYLRPILLSYWGN 517

Query: 519 NKEPDMKIFGPMPRDAEGKRIYREHMKNSKYCICARGYEVHTPRVVEAILNACVPVFLSD 578
           NK+PD+KIFG +PR  +G + Y + MK SKYCICA+G+EV++PRVVEAI   CVPV +SD
Sbjct: 518 NKDPDLKIFGKLPR-TKGNKNYLQFMKTSKYCICAKGFEVNSPRVVEAIFYDCVPVIISD 577

Query: 579 NYVPPFFEVLNWESFSVFVQEKEISNLRNILLSIPEKDYLVMHARLKIVQKHFIWNKIPV 638
           N+VPPFFEVLNWESF++F+ EK+I NL+ IL+SIPE  Y  M  R+K VQKHF+W+  P 
Sbjct: 578 NFVPPFFEVLNWESFAIFIPEKDIPNLKKILMSIPESRYRSMQMRVKKVQKHFLWHAKPE 637

Query: 639 KYDLFHMILHSVWYTRVFQM 649
           KYD+FHMILHS+WY RVFQ+
Sbjct: 638 KYDMFHMILHSIWYNRVFQI 652

BLAST of Cp4.1LG04g10080 vs. TAIR 10
Match: AT5G11610.1 (Exostosin family protein )

HSP 1 Score: 433.0 bits (1112), Expect = 4.4e-121
Identity = 221/419 (52.74%), Postives = 291/419 (69.45%), Query Frame = 0

Query: 235 RYDISTLKSPEMPSISISQMNALLSLSHTSPCSKKPQCRLSSSR-DRELLHARLEIEKAT 294
           +Y   ++  P    ISI QMN ++   H  P  K     L  S+ D+EL  AR +I+KA 
Sbjct: 133 KYPHRSITKPPSIVISIKQMNNMILKRHNDP--KNSLAPLWGSKVDQELKTARDKIKKAA 192

Query: 295 AAVNSPGIIS-VFRDVSMFKRSYDLMEKTLKVYIYKEGEKPIFHQPR--MRGIYASEGWF 354
                  + + ++ ++S+FKRSY+LME+TLKVY+Y EG++PIFHQP   M GIYASEGWF
Sbjct: 193 LVKKDDTLYAPLYHNISIFKRSYELMEQTLKVYVYSEGDRPIFHQPEAIMEGIYASEGWF 252

Query: 355 MKLMKENKKFVAKNPKKAHLFYLPFSSQLLRSALSEQNSQGRKNLEERLGNYVNLIRRNH 414
           MKLM+ + +F+ K+P KAHLFY+PFSS++L+  L   +S  R NL + LGNY++LI  N+
Sbjct: 253 MKLMESSHRFLTKDPTKAHLFYIPFSSRILQQKLYVHDSHSRNNLVKYLGNYIDLIASNY 312

Query: 415 QFWNRTGGADHFLVACHDWASKLTRKYMKSCIRALCNANAARGFQIGKDTSLPVTNIHLT 474
             WNRT G+DHF  ACHDWA   TR    +CIRALCNA+    F +GKD SLP T +   
Sbjct: 313 PSWNRTCGSDHFFTACHDWAPTETRGPYINCIRALCNADVGIDFVVGKDVSLPETKVSSL 372

Query: 475 KDPDITTGAKPPSDRTTLAFFAGGMHGYLRPILLHYWENK-EPDMKIFGPMPRDAEGKRI 534
           ++P+   G   PS RT LAFFAG +HGY+RPILL+ W ++ E DMKIF  +       + 
Sbjct: 373 QNPNGKIGGSRPSKRTILAFFAGSLHGYVRPILLNQWSSRPEQDMKIFNRIDH-----KS 432

Query: 535 YREHMKNSKYCICARGYEVHTPRVVEAILNACVPVFLSDNYVPPFFEVLNWESFSVFVQE 594
           Y  +MK S++C+CA+GYEV++PRVVE+IL  CVPV +SDN+VPPF E+LNWESF+VFV E
Sbjct: 433 YIRYMKRSRFCVCAKGYEVNSPRVVESILYGCVPVIISDNFVPPFLEILNWESFAVFVPE 492

Query: 595 KEISNLRNILLSIPEKDYLVMHARLKIVQKHFIWNK-IPVKYDLFHMILHSVWYTRVFQ 648
           KEI NLR IL+SIP + Y+ M  R+  VQKHF+W+   PV+YD+FHMILHSVWY RVFQ
Sbjct: 493 KEIPNLRKILISIPVRRYVEMQKRVLKVQKHFMWHDGEPVRYDIFHMILHSVWYNRVFQ 544

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9FFN22.3e-8240.15Probable glycosyltransferase At5g03795 OS=Arabidopsis thaliana OX=3702 GN=At5g03... [more]
Q9SSE86.3e-8042.49Probable glycosyltransferase At3g07620 OS=Arabidopsis thaliana OX=3702 GN=At3g07... [more]
Q3E7Q93.6e-7538.65Probable glycosyltransferase At5g25310 OS=Arabidopsis thaliana OX=3702 GN=At5g25... [more]
Q9LFP31.5e-7339.94Probable glycosyltransferase At5g11130 OS=Arabidopsis thaliana OX=3702 GN=At5g11... [more]
Q3EAR73.8e-6936.93Probable glycosyltransferase At3g42180 OS=Arabidopsis thaliana OX=3702 GN=At3g42... [more]
Match NameE-valueIdentityDescription
XP_023531315.10.0100.00probable glycosyltransferase At3g07620 [Cucurbita pepo subsp. pepo][more]
XP_022933600.10.097.85probable glycosyltransferase At3g07620 [Cucurbita moschata][more]
KAG6587987.10.097.24putative glycosyltransferase, partial [Cucurbita argyrosperma subsp. sororia][more]
KAG7021879.10.097.24putative glycosyltransferase, partial [Cucurbita argyrosperma subsp. argyrosperm... [more]
XP_022965105.10.096.93probable glycosyltransferase At3g07620 [Cucurbita maxima] >XP_022965113.1 probab... [more]
Match NameE-valueIdentityDescription
A0A6J1F5A90.097.85probable glycosyltransferase At3g07620 OS=Cucurbita moschata OX=3662 GN=LOC11144... [more]
A0A6J1HMX20.096.93probable glycosyltransferase At3g07620 OS=Cucurbita maxima OX=3661 GN=LOC1114650... [more]
A0A6J1CTZ30.078.08probable glycosyltransferase At3g07620 OS=Momordica charantia OX=3673 GN=LOC1110... [more]
A0A6J1CVI71.87e-22354.00probable glycosyltransferase At3g07620 OS=Momordica charantia OX=3673 GN=LOC1110... [more]
A0A1S3CNV82.00e-21853.48probable glycosyltransferase At3g07620 isoform X2 OS=Cucumis melo OX=3656 GN=LOC... [more]
Match NameE-valueIdentityDescription
AT5G37000.14.8e-13653.41Exostosin family protein [more]
AT4G32790.14.0e-13048.05Exostosin family protein [more]
AT5G19670.15.2e-13047.56Exostosin family protein [more]
AT5G25820.19.2e-12754.32Exostosin family protein [more]
AT5G11610.14.4e-12152.74Exostosin family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR040911Exostosin, GT47 domainPFAMPF03016Exostosincoord: 319..600
e-value: 2.8E-56
score: 191.0
IPR004263Exostosin-likePANTHERPTHR11062EXOSTOSIN HEPARAN SULFATE GLYCOSYLTRANSFERASE -RELATEDcoord: 212..648
NoneNo IPR availablePANTHERPTHR11062:SF77GLYCOSYLTRANSFERASE FAMILY EXOSTOSIN PROTEINcoord: 212..648

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG04g10080.1Cp4.1LG04g10080.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006486 protein glycosylation
cellular_component GO:0000139 Golgi membrane
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0016757 glycosyltransferase activity