CaUC04G078450 (gene) Watermelon (USVL246-FR2) v1

Overview
NameCaUC04G078450
Typegene
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionNucleotide-diphospho-sugar transferase, nucleotide-diphospho-sugar transferase
LocationCiama_Chr04: 27777137 .. 27789213 (+)
RNA-Seq ExpressionCaUC04G078450
SyntenyCaUC04G078450
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAATTAGTGAATGATACGTAATTAAAATATAAGACTAAAATAGAGTTAGAGATCAAAATGTGGAAATTAATCAATGAATAAATGAAAGCAAAGAAGGAAGAAAAATCAAAATGGGGAGGCTGAGTGGGAGAAAGATTCCCCTTCTAAATTCCAGTCGTTTTCAGAAACACCAATACAAGCAATTTGTGTATATATATATATATATATAAAAGATACAAATTAAGGAGGGGAGGGACCAGACAAAGAAATTATGAAATAGAGAAAGGGTCAGCCGGCAATAAATTAAAAACTCCGTCTCGTATTAATATCTTCGCCCACTGCGTTTCTTTTACAGATTAACCCCATTTTTAATTTCAAGTTTTATTTGTTGCCCCTCTCTGACGTCAACATGAAAATAACGCCATGTTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTATATCTCTTTTCATCCTCTAATTCCCAGCGGCAGCGAGTCAACCATTCCGCTGTCGGCCTTATATGCAATTAAACTTTCCTCTTTTCCTATTTGGCTCCGCCTCCTCACGCAGTTTCAGCTTCCAATGAAGAATAATTCCGCCGCCGACTGCGAACCTGCCGCCAAGTCCTCGGCTCCCTCGGCCGTTCATACGGCGGTGGTTCCATGGAGGACGGTGAGGATCTCGGTTGTGTTGGTGGGCCTTATGTTGGGCCTCCTTGTTCTCTACAACTCAGCCATTAATCCTTTCAAATTTCTTCCTGTTTCCTACACCTACCGTGCTTTTCGATCCTCTTCTCCTCACAAAGATCTTCTTTTGGTTAGTAATTCCATTTCTTATCTCTCTTTTTTTTTTTTTTTTTTTCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTCTCTTTTTTTTTTTTTTTTTTTTCAATTACCCATTTGTTTAATTATTATTAATTTTTGGTAACTGGGGAATGGAATCGGGCGCTAGACTTGAGGAAGGTTGATTGTGTTTACGAGTAAAATCTGAATTAAAACGGAAAATAATTGTTGCTTCCCAATTGTGACGAAACGCAATTCTGGAGTCCTAGGATTGATGAATAAATGGAAGAGAGCCAGCTGGGCTTTGTTTGTGAATTAATATATAAATATTTTTATATAAAGGCAAAGGTTACATCTGCCGTAGTTCTTTAATGGATCATTTAATAATTTTCCTTGTATAAGTTTTTTAAACCCACACACAAAATTATTTTTAAAATAGAATTCAAAACGCAAAATTATTGGTGAATATCATGGCAAGTGATTGGTAAGTTGGCGATAGGCTCAATAGCCGGCTGGTTCTGTTCAGCGCCAGCGCGTCACTCGCACGTGCATTTATTATTGTTTCCCTTCTTTCCATTATTTTTTATATATTTCTTTTTACTTAGTTCAAATCCATGAAAAAGATATTTCAACTTTCATCTTTTTAGTTCATTTTTTTCCAATATTTGTTGTTTGTTTAATTTTAGTGGCTATAATTTCAATAAATTTTAAATTTTGTCTCTAATATTATTTTATTATTGGTTTTTAAAAATTTTTTTTTTATTTTTTAGGATATTTTTACTATAAATTTCGAAAACATATTCACATATTTTATTTTCTTATAGAGAAATTATTATTATTATTTTTATTTAATCAATTTTAGTAAAAAGTAATTTTAAGCGACTTGATTTAAGATTTAATGAAAAGTCCTTTAATCAATTTTATTATTTTTATTTAATCAATTTTAGTAAAAAGTAATTTTAAACTTTAAATTGTTTTGAACAGGCTTCAAATTTGTAAGGTCCTTTTTAACACAAAGTCGAAACCTAAGTACTTAATACACCCAAAATTAAAAATTTAAGAGACTTGTTAGACATTTAAAAAAAAATTCAATAACCAAATAGCTACAAAAAATAAAAAGTCATTATATGACGTGTAATTTCTATAATATCATAACTATTTAGGGATATTTATTTTCTAAGTCCACGTTTTATGTTACTTCTAGTACAATTTAGAATTATTGTAGAGGGTATCAAATAAAATAACAATAAGTTATCATTCATCAAGTTGAAAAGTAAAAGGGTGATTAAATTATAAGTTTAGTGATAGAGTTGTGTTTAATATAAAATTTAGATTAGTTGTAGAGTTGATGAAATAAAATAATTAACAACAACGTGGGATTTATGAATATGAGAAGTAAAGATATAATTAAATTATTATAAGTTAATCTTGGGATGTGAAGCAGGAGGAGTTAGGGAAAGGCAAGAAGGAAGGGGACAGGATTTTACAAAGGAAAGTGGGAGATTGTGGTGTAATTATATGTTTTTGTCCCAAAGTCCAACAATAATTTTCTCATATATGAAAACAATTTCCTCGGCTAGTAATAAATGGTGGGGTTTGATGTGTATTTTTATATATATATATATTTAAAATATCACCATTCAAAATAAAGGAACGTGTGTTTCTAAGAAATTTCGAACATACTTCTTCTTCTTTTTTTTTTTTGGTCGTCAAATCCTTTGAAAATAAGTTAGGTTTAGGAGTAATTTTTTAATAGTTAATATTTTTTAATAATTCTAAATAAATTTAATATTTTATTTTACACTCTATAAATAAAATTTCACAAAAGTTATGCTAGTTTTAAAAATTATCCTTAAATATGTTACTTTTTGGGTCTTTATATAGTCTAGTATACGTTTGATTCATAGATTTCAAAATATTATACATTTAGTCCTTAAATTTTGAGTTAATTTCAATTTAGTCACTAAGGCTCAAAATGTAACAATCTTAAATCCAATAAGGCATTTTTTTTTTCTTTTTAGATTGAGCTAAAAGTGTAGAAATGTCTATCGGGTAAAATAATAGATGTTTGTATGAGTCTATCGCAGTCTATCACAGATAGGAAATAAAATTTTGCTATATTTGTAATTTTATTTTTGAAAACAACAGTTTTTCTATTTTTGAAAACATTCCCTCAAAGAAATGTCTTGGCTTAATTTTTACTTTCATCATGAGTATACTTGCATTGGAAGAGTGTTTGGTTTAAATTTCTAAAATTGAAGTGTTTTTCATACTCTCAAAATAACTTTCAATAAAGAGAGTTTAAATAAAAATAATTTTTTAAGAAAACATTTTTTTTCTCTAGTGATGTTTATTGATCCAAATTCTAAAATCATGGAACCCTTTTGAAATTAGAAAAAGTTATGCTACTGATGGCGAATTATATTAAATCAAACAGAGTATTATAGTGTAAAATTGATGTTAAAACTTATAAAAGTTTGTAAAATGACAAAAATGAAATATAGGAAAAAGTTCTGAAAGAAGCAGCAATGGAAGATGGAACAATAATCTTGACGACGTTGAATGATGCATGGGCAGAGCCAGATTCACTCCTCGATCTGTTTCTTAAAAGCTTCCATATTGGAAACGGAACCCAAAGATTATTGAAGCACTTAGTCATAGTCACGTTGGACCAAAAGGCGTATTCTCGTTGCGTGTCCTTACACCCTCATTGTTATGAATTGGAAACTCAAGGAACCAACTTCTCCAGCGAAGCCTACTTCATGACCTCCGATTACTTGAAAATGATGTGGCGAAGAATCGAATTCCTTATCTCCGTACTCGAGATGGGTTACAGCTTCGTGTTCACCGTAAGTCACTGTTTGTTTGATGATATAATATTAAATTTACTTTTACTCATTAGAACAAGCTACTTTTAACGTTATTTTGTTCGGGAACACAGGATTCTGATATAATGTGGTTGCAAGACCCATTCAATCACTTCTACCCAGAGGCAGATTTTCAAATTGCTTGTGATTTGTTTTTGGGGAACTCAGAAGATTTAAACAATAGTCCCAATGGAGGGTTTGTGTACGTGAAAGCGAATCCAAAAACGGTACAATTCTACAAGTTTTGGTACCAATCAAGGACAATATATCCAGGTCAGCACGACCAAGATGTGCTGAACAAGATCAAACACAGTCCATTGATCCCTAAAATTGGGCTGAAATTAAGGTTTCTGGACACCGCGAATTTCGGAGGGTTCTGTCAGATGGGGAGGGACATGAGCAAGACGGCTACAATGCATGCCAATTGTTGCGTTGGACTAGAGAACAAAGTTCACGATCTCAGGATTTTGCTACAAGATTGGAATAACTTCTTTAATCCAACTGCGAATAACAAAGCTTCCCCTACCCCTTCATGGACTGTTCCTCAAGATTGCAGGTATATATATATATAATATATATATTACATTTATTATTACTATATGCTTTCTTTTTCTTTTCTAGCTAACTATTATAAATCTATGCAAAACCTTTACTTTTCATTCTTTTTTTTTTTTTTTTATTAGAGTTATTAATATTTTTGGATGGCAAATTCAGAACTTCATTTCAAAGAGGGAGGCAACCTAAGAAAACTGGGAACAGAAGGTTATCTTAGGAAGCACAAACTGGAATGATGATTTGAAGAGAGGAACAAAATTAAGAATATGTCTTGCAATCGTTCTTATTTTCCAAGATAAGTGCTGGATAATGTTTGCTTCCAATGCAATGGTTGCATTCCCCCATTCGATTTTTTAGTACACCATTTTTCTGTACATTGTTAAAAAGCCAAGAATGAAACACTGCTTTCCTTTTTTTCCTTTTTTAACTCCAATGGCAAAGGTCACATTTACCAAACCGTAATGAGAATTGGTGTAATATTTGGAATGAGATTTCATGAATGAATGAATTGTAATAATATAATCATCAGATTACTTTTAATTGAATTTACCTAAAATTATGAATATGTACCATTACCACTATCGTTACCGTTACTACTTAACCATAAAAGTAACAGTAAACTTTAATGTAACAATAAATGTGGCATATTTACAATATTTATACTAGCAATATCTATGACGGTAAAGGTAGTAAGTACCATATAATTTTAGACCATGCAGAATTTGGTTACAAATTTCAAAGTAAACCTACAACTCATTACGATTGTTAAATCTAAGTAAATGACAACAAAAATAATCAACTAAACCCACTTTTTATATTACGACTATCTTTCTGGCCATGGTGGTTTCACATTTTCTTAATTTTCTTAGTTTTCCCTTACTCATAACCAATTGAGAGTATTTGACTCATATTGGCTATATTTAATGTACTTGTGTGATTTTTATTTTTTATTTTATTTTTACCCTTAGGACAAAATGAGTGGATTTTTTTTTTTTAAAAAAAAAAAAAAAAAATTTGTGATGTGTCTAATTTTACCAGGCAAGTATGGAACTCGTCAATATGTTGACTTTTTTGTTTTCTAATAACTAATTGAGTATATTATACATATACCTTTGACAATATAACCATCATTATTAAGTAAAAAATCATATTCCGAATGACATCGTAGTTCGATTAAGATAACCCCACTTCTTAGTAGTTTAAAAATATTCTTAGACCGAGTTGGAAAGGGTCTGTTTGGTAATAAATTTAAAAAAAAAAAAAATTGAAAAAAATAAAATAACATTATGTCCAAAGTACTTGATTTAGAAATTAGATTTTAATCTAATTTAAACAATTGTTTTAAATGTATTTGATACTATATTTATAAATCATAGAATTAGTTGTTAGCTACTTGCTAAATATTAGGTTGACTATAAATTGTTTTGACAAACATTCTATAAAAGTTTAATTTGTTGAAAATGAAATGTGTATTCATATTGATTTCAATATAGTGAGTATAATCTCTAATATTTCATCATCTAATTCTTTCTGATCTAGAAAATTAAACTTGTCTTGAGAACTTTACAGCATGTTTTCAAGGTTGATTGTCTCGAAGGCTTTGCAGCTCGTTCTCGAATTGATTTGTCTTAAATGATTTACAACTTGTTCTTAAGAGTTAGGGGTTTTGTTTTCTAATTCTCTCCATAGTTTTCTCCAAAGCCCCTTACTTTTGCTCTGGATTATCTCTGTAGTGTATGACTTTAAAAGCTTTGAAGAGTCTTCTAATCTTCAGTCTTTAGATAATCAAGAGATCATCAAAAGATGATCTTTGGAACTTGAGAATTTCAAAGCTTTAGTATAAGAAACTTCGGAGCTTCATAGCTTCGATTGTCATACTCCACCCTAAACTACCCTTCTGAACCTAAAACGACTACCCTTTTGAACCTAAAATGAGATGTGATGACAGTAGTTTCGAAGTGTGCAATAAAATCTATTGTCGAATTTTTCTTCTTATTTCTAGACATGCGTGCTACCATCAAAATTCTCCTATAACCTTAAACTTGGACTTAATACATCACTTACACATTTCACAATTACTAAAACACCATGCTTCGCTACATGATGCAACAAACTCATGAGTTGAGTGGCAGCTTTTACAAAAATATTTCATATCATAAGTAGTGGTTTATGAATATGCAAATGTTTCGTATCAAAATCATTATGAAATATTGATTTATCAACTCATGATATTTTTTATACAGCAAATTCCATTGCACACTTGGAATCTATTATCGTCCATCTCGTCCTAGATTCAAGAGGGTAATTTAGGGTGGTGTGACATGAATCTTCAAAACTTGAGAGCTTTGGAGCTTTGGTCTTTAGAACTTTAGAGCTTCAAAACTTCCGTTTTTAGAAATTGAGAGCTTTAAAACTTTCTTTTTTAGAATATGAGAGCTTTAAAGTTCCAGTCTTTAGAGTTTGAGGAGAGTTATATTCTTCAAATCGTCAGAGCTTCAATATATGAGATTTAAGATCTTCGGAGCTTCAGTTCTTCCTACCTTTTAAAAATGAAAGGGTAAGGTTCGTATTTATAGAGTTTCCATGGGCTTTAAGTGCTTGGATTTGGTTGGTTCATGGATCTAACCCTTGGGCCCAATATATTAGGTTTTGGAATGGGTTAAGTTTTTGGGGCAAATTAAACCTAATGTTTGGGCTCAATTGATTAATTCAATCCAAAAAGACACACAACATCATTGGGAACTACCAACTTTTATCTTCAATTTTGATTTAGGACACACATCAACCTTTAATTTATAATTTTACCTTTAATTTGAGGTATGATGTAGTGATTTGTGATTTATCCAAAATTTCTCATTTAATAAAACTATTATTAGTTAATTTGGAACATATTTTATGTTTTTTTTTTTTTTTTTTGCATAATTATTATTTTGTAATATTATATAATTCATAATACATATTTTTAATTTTTTTAAAAAATGAATTGAGTTTATAAAATGGAATATTGTACTTCTAATTGTTTTTATCTTTATTTAGTATAATTATGCATTTAAAAATTTTAAATTCAAATTTTGTGTTCACTGAAAACATAAAAATTTTACACCGTTTGGATCCCAGAATCTAAAAATAAATTTAAAAAATATAATTTGAATTATCTGTCAAACACATACTTGTTCAAGAATTTAAAAATAAAAAATAAATCAAGATTCCAACCAAGCATACCTTAAATCTCCCAACTTTTCAATAATATTGTCTTGCTTAATATGGGATAGTTTTCAAAATTTTATGCAGTTTGCTAAAATGGAATTTTATTTCAGTCATGACATAAATTTCGTGTTTTGTAAAAGACTATGATTTAGTTTGATATATAATTGATATATTAAAAAAAATAAAATGTTTTAAAAAACTAAGTTAAGGTTGAAAAATTAAAAGTAGTTTTCAAATATTTGCTTTTATTTTTATAAATTAACTAATAATTCAAGTGTTTCGCTGATGAATGTGAAAGTCATAGTACGAACGTTGAAAAAAAATTATAAAATCTAAAATAAAATATTTATCAAAATAGTGATTCTAAATTAAAAAAAAAAAAAAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAAAAAAAAAAAAACAAAAAAAAAACAAAAATGACGTTTATGTTAGCTGCCCATTACACAAAATTTCCATTAGAAATCTATATTTTATAGATTTCAGAGAGAAACATAGACGAGCGTTCTTTATTTATTTTTATTTTTTATTTTTAATTATTTTTTAAAAAGTGGATTTTGGGTTTAGGGTTATTGCAAATTGGGCCAAGAAAGAACGGAGGATACCCAAACATAGATATCAATATCATCGTCATCTCTGCGGGCATCAACCAACTGGAAAGTACACCAAGCGGCGAACATTTCATCTCTTCCGGTGATCATCAAGCAACCTGAAGGTCTGGGCACTTGGCAGGCAAGGACCTGACCTCAGAGGGCGGTGAAGACGTCCTACGATGTCGGATCCATATGCAGGGGCGAAAGGAGGTAGGCTCACCTTCAAGGGAGGAGCCTTAGCCTCTCGTAGCAAGGATATTGACAAGAAGAAGAAGAAGAAGAAGAAAGAAAAAAGTAAAACCGACGATAACCCTACGGACGAGTCCGAGATTTTGACGTCGGCTGATGGTGTAGAAGGTGGAGACGGAGCCATCTATACTATTGATGCGGCCAAGCGTATGAAGTATGAGGAGCTATTCCCTGTGGAGACCAGGAAGTTTGGTTACGATCCTAACAACTCCAATACCAAGTTCAAGTCTGTGGAGGATGCTCTCGATGACCGTGTCAAGAAGAAGGCGGATCGTTACTGTAAATAATCAGGTCACTGCTCTCGCTATTCTGCTTTAGTTTGTCTCTGCATTCCTTCTCGCAATTCCATAAATGCAGTTTGTATTTCCTGCCTGATTTTGATGGATACTTTAGTTAGGATTTTCAATTAGATGCGCAATTTATTATAAGCATTAAGCATATTAGGTTAAGATTGTACGAATTAGGTTTAGTGTTGGTTTCTAACGAAAGAAATGTTTGTCCAATGTGCCTTATACTCGGAGAATGTCTGAATTTTGATTGCCTCTTTTATGAATCCTTGAATAAAATGCGCAGAAAGGAATCTGTTATATTTTCCTCTAGCCTTATACTGTTGGAGAACAGGAAGTTGGATCTGAACTAGCGAATTTCTGATCCACTCCTATAGAAAGGCCAAGGATTCTCTAGTCTTGTACCGCCAGCTTCTTTTCTGGATAGTAGTTTTGCATACTCATTGTGTATTTTTAGTTCCCAGCCGTTCATGACTTCATTAATTATCGAAGCTGCACTTTAACTGTTCCCTTTGGCCTATGGTGAGGAACATCGTTTGTTTCCATCCAAGTACAAAGGCAAGGCAGTTCAGGTACCTAACTATACTCCAGAAGAAAATGGAGATACGGAAGAAGTTGGACATTGGTCCTCCGACTTGTTAATTTGTAAAACAAGGTAAAATCGTGCTAATCATTTGTGAATGATAGCTGTGAAGTAATATAGACGGTGCAGATATAACCCTGGATTACAAAGGAGGTCTCTGAGTTGTCAGTAACTAGCATATAAGGTTATATAGCAAGTATCCTCCGCAAATGCCATAAAGTGGAGTGGCCCTAAGTGAATCTTTTGCCCAACACGCTTGAAGTCCCCAGATTTATCTGCAGATGTTCGTCTATTATCACCTTCTAGATTCACCAGTAGTGTGCCCCAATAGCATTAGTTGGATTTTTTTAATCTAGCCTCGAGTGTTGGAGATCAACAAGTCTTGGACATCGAACTGATCTCTCTCTCCTTCTCTCTCTTCTGGGGAGAACATGTTTATGACCAGCCTTTGTTTATAACTTGAACCACGTGGAGATGTGCTTAGGCATATGTTAGTTACCTGCTTGACATGATATGCTGATTTAGACTAGTTAACTCATGTTAGGCATAGCCTAAGGCAACAAGAAAAAAAAAGGAGAGCGAAAGGGAGTTAGTTTGGAACTGTAGGCACCTCTAATTTTTTAAGTTTCTTTTTCCTCTATAGACTTCTTGTCTTGGGGGAAGGGCGGAACAGCCTCTGATCATATTGCAGTAGGTTTGCATTTGATTAATGACCTATTCTGTCTTAATGTTCTGTCTGTGCCTGCCCCAATTCCTATAACGAACAATAAATTATTAGAAATAATGGAACTTTTAGCTTTTGTGGTAAAGTGTAGAGAAGGCACCAACCATCATGGGTCAGCTTAGTGGTAAAAAAAGAGACATAGTCTAAGAGGTCATGGGTTCAATCCATGGTGGCCACCTACCTAATAATTAATATCGTACAAGTTTCCTTGACACCCAAATGTTATGGGGTCAGATGGGTTGTCTTGTGAGATTAGTCGAGGTGCGCATAAGTTGGCCTGGACACTCATGGATATAAAAAAAAGTGTTAGTCTAAGAGGTCATGAGTTCAAGGTGGCCACCTACCTAATAATTAATATCCTACGAGTTTCCTTGACACCCAAATGTTGAGGGTCAGGCGGGTTATCTTGTGAGTATAGTCGAGGTGCGCATAAGTTAGCCTGGACACTCATGGATATAAAAAAAGAAAAGTGTAGAGAAGGTGCCACATGGGTAGGTAAATGTTATCTTGGGCAGTGTCGTTGGTGTTATTATTATTGGTAAGAAACCAAACTTTCATTGAGAAAAAATGAAAAGAATACAAGAGTCTACAAAAGAGCTCGACAAGAAAGTAGAAAGCAAGAGGGAGCCAAACGTAACTACAAAAAGAGGTTCCAATCCAACAGAATCAAACTAAGATCATAGTCACTTGTGGTCTTAGTTAAAATCAGGGTACCAGTAATACTTCACTGGATGCCTGATCTGATCTCTACTTCCTCCATATCTTACTATTATTCTCTAGTTCTTTGATAATCTTAATCTCTTCAAGTTTTCTTCAAGTGTGGTTCTTATCCCTTTCTGGGAGAACACCTGCAAACTGCACCGCTAGTGATTTGCGCAATTTATATCATATTTCTATGTTTGAAAAATGGTATATGTAAATTACAAGAAAAATACTTTTTTGCATAGTCTTTAAAAGGGGGAATGCAAAGATTGCCTAATTTGAATGGTAGAATTGAACCTTACGCAGGTTCAAACTAATCTGCGAGTTTGTCTGAGGACAAAATGCATTACACGTGTTGCTTCAATGCGGCAGTGTACATTACCTTTTGCTCTACATTTAATCACTCATGATCTTAGTGCATAAATGTGAAAAACTAGCGTCCTTCCGTACTTCAATGAATTGATGTTTGTTTCACGTTCGTGTGGAAGCTTTTGGTCCTGGAAGTCTTCCTTCATCTTGTTCATGTTCGCCATAACACTCATATCGTAGTTGTTTGTTGCAGGCATCGGTCATAGTAGAGATTCCAGGTATTTCTTGTAAGGATGCTCATCATGCTCCACTAAGCAACGGGTTTAGCATCACTATGCCTAAGATTAACGGGTTTAGCATCACTATGCCTAAGATTAACGGGTTTCATGGTATTGTGATCTATATAGACTCATGGACCCATTACCATTGAAGAAATTTTAGTTTCGTGCTGATTGTAATTACTTCCATCCTCCCCTCTGATTGTTTTGCTATCCCACACCGTGTCTGATGGGTCAAATTTGTACCGGCGAACATGAGAAGATGGCTTAATCTTTCGTTGTCAGTATTTTGTATGGATATTTCTTACAATTCAGGAACCAATTAATATAGCATGGATTGAAATATAAATCTTCCTGACTTGCACAAAAGTCATTGTAATTGGTTAAAGTTAAGAGTATGGAAAAGCAGGCTATACAGGAATACTTTTATTGTTACATACACGTTTACATCAATG

mRNA sequence

ATGGAATTAAATAATTCCGCCGCCGACTGCGAACCTGCCGCCAAGTCCTCGGCTCCCTCGGCCGTTCATACGGCGGTGGTTCCATGGAGGACGGTGAGGATCTCGGTTGTGTTGGTGGGCCTTATGTTGGGCCTCCTTGTTCTCTACAACTCAGCCATTAATCCTTTCAAATTTCTTCCTGTTTCCTACACCTACCGTGCTTTTCGATCCTCTTCTCCTCACAAAGATCTTCTTTTGGAAAAAGTTCTGAAAGAAGCAGCAATGGAAGATGGAACAATAATCTTGACGACGTTGAATGATGCATGGGCAGAGCCAGATTCACTCCTCGATCTGTTTCTTAAAAGCTTCCATATTGGAAACGGAACCCAAAGATTATTGAAGCACTTAGTCATAGTCACGTTGGACCAAAAGGCGTATTCTCGTTGCGTGTCCTTACACCCTCATTGTTATGAATTGGAAACTCAAGGAACCAACTTCTCCAGCGAAGCCTACTTCATGACCTCCGATTACTTGAAAATGATGTGGCGAAGAATCGAATTCCTTATCTCCGTACTCGAGATGGGTTACAGCTTCGTGTTCACCGATTCTGATATAATGTGGTTGCAAGACCCATTCAATCACTTCTACCCAGAGGCAGATTTTCAAATTGCTTGTGATTTGTTTTTGGGGAACTCAGAAGATTTAAACAATAGTCCCAATGGAGGGTTTGTGTACGTGAAAGCGAATCCAAAAACGGTACAATTCTACAAGTTTTGGTACCAATCAAGGACAATATATCCAGGTCAGCACGACCAAGATGTGCTGAACAAGATCAAACACAGTCCATTGATCCCTAAAATTGGGCTGAAATTAAGGTTTCTGGACACCGCGAATTTCGGAGGGTTCTGTCAGATGGGGAGGGACATGAGCAAGACGGCTACAATGCATGCCAATTGTTGCGTTGGACTAGAGAACAAAGTTCACGATCTCAGGATTTTGCTACAAGATTGGAATAACTTCTTTAATCCAACTGCGAATAACAAAGCTTCCCCTACCCCTTCATGGACTGTTCCTCAAGATTGCAGAACTTCATTTCAAAGAGGGAGGCAACCTAAGAAAACTGGGAACAGAAGGGTTATTGCAAATTGGGCCAAGAAAGAACGGAGGATACCCAAACATAGATATCAATATCATCGTCATCTCTGCGGGCATCAACCAACTGGAAAGTACACCAAGCGGCGAACATTTCATCTCTTCCGACGTCCTACGATGTCGGATCCATATGCAGGGGCGAAAGGAGGTAGGCTCACCTTCAAGGGAGGAGCCTTAGCCTCTCGTAGCAAGGATATTGACAAGAAGAAGAAGAAGAAGAAGAAAGAAAAAAGTAAAACCGACGATAACCCTACGGACGAGTCCGAGATTTTGACGTCGGCTGATGGTGTAGAAGGTGGAGACGGAGCCATCTATACTATTGATGCGGCCAAGCGTATGAAGTATGAGGAGCTATTCCCTGTGGAGACCAGGAAGTTTGGTTACGATCCTAACAACTCCAATACCAAGTTCAAGTCTGTGGAGGATGCTCTCGATGACCGTGTCAAGAAGAAGGCGGATCGCATCGGTCATAGTAGAGATTCCAGGTATTTCTTGTAAGGATGCTCATCATGCTCCACTAAGCAACGGGTTTAGCATCACTATGCCTAAGATTAACGGGTTTAGCATCACTATGCCTAAGATTAACGGGTTTCATGGTATTGTGATCTATATAGACTCATGGACCCATTACCATTGAAGAAATTTTAGTTTCGTGCTGATTGTAATTACTTCCATCCTCCCCTCTGATTGTTTTGCTATCCCACACCGTGTCTGATGGGTCAAATTTGTACCGGCGAACATGAGAAGATGGCTTAATCTTTCGTTGTCAGTATTTTGTATGGATATTTCTTACAATTCAGGAACCAATTAATATAGCATGGATTGAAATATAAATCTTCCTGACTTGCACAAAAGTCATTGTAATTGGTTAAAGTTAAGAGTATGGAAAAGCAGGCTATACAGGAATACTTTTATTGTTACATACACGTTTACATCAATG

Coding sequence (CDS)

ATGGAATTAAATAATTCCGCCGCCGACTGCGAACCTGCCGCCAAGTCCTCGGCTCCCTCGGCCGTTCATACGGCGGTGGTTCCATGGAGGACGGTGAGGATCTCGGTTGTGTTGGTGGGCCTTATGTTGGGCCTCCTTGTTCTCTACAACTCAGCCATTAATCCTTTCAAATTTCTTCCTGTTTCCTACACCTACCGTGCTTTTCGATCCTCTTCTCCTCACAAAGATCTTCTTTTGGAAAAAGTTCTGAAAGAAGCAGCAATGGAAGATGGAACAATAATCTTGACGACGTTGAATGATGCATGGGCAGAGCCAGATTCACTCCTCGATCTGTTTCTTAAAAGCTTCCATATTGGAAACGGAACCCAAAGATTATTGAAGCACTTAGTCATAGTCACGTTGGACCAAAAGGCGTATTCTCGTTGCGTGTCCTTACACCCTCATTGTTATGAATTGGAAACTCAAGGAACCAACTTCTCCAGCGAAGCCTACTTCATGACCTCCGATTACTTGAAAATGATGTGGCGAAGAATCGAATTCCTTATCTCCGTACTCGAGATGGGTTACAGCTTCGTGTTCACCGATTCTGATATAATGTGGTTGCAAGACCCATTCAATCACTTCTACCCAGAGGCAGATTTTCAAATTGCTTGTGATTTGTTTTTGGGGAACTCAGAAGATTTAAACAATAGTCCCAATGGAGGGTTTGTGTACGTGAAAGCGAATCCAAAAACGGTACAATTCTACAAGTTTTGGTACCAATCAAGGACAATATATCCAGGTCAGCACGACCAAGATGTGCTGAACAAGATCAAACACAGTCCATTGATCCCTAAAATTGGGCTGAAATTAAGGTTTCTGGACACCGCGAATTTCGGAGGGTTCTGTCAGATGGGGAGGGACATGAGCAAGACGGCTACAATGCATGCCAATTGTTGCGTTGGACTAGAGAACAAAGTTCACGATCTCAGGATTTTGCTACAAGATTGGAATAACTTCTTTAATCCAACTGCGAATAACAAAGCTTCCCCTACCCCTTCATGGACTGTTCCTCAAGATTGCAGAACTTCATTTCAAAGAGGGAGGCAACCTAAGAAAACTGGGAACAGAAGGGTTATTGCAAATTGGGCCAAGAAAGAACGGAGGATACCCAAACATAGATATCAATATCATCGTCATCTCTGCGGGCATCAACCAACTGGAAAGTACACCAAGCGGCGAACATTTCATCTCTTCCGACGTCCTACGATGTCGGATCCATATGCAGGGGCGAAAGGAGGTAGGCTCACCTTCAAGGGAGGAGCCTTAGCCTCTCGTAGCAAGGATATTGACAAGAAGAAGAAGAAGAAGAAGAAAGAAAAAAGTAAAACCGACGATAACCCTACGGACGAGTCCGAGATTTTGACGTCGGCTGATGGTGTAGAAGGTGGAGACGGAGCCATCTATACTATTGATGCGGCCAAGCGTATGAAGTATGAGGAGCTATTCCCTGTGGAGACCAGGAAGTTTGGTTACGATCCTAACAACTCCAATACCAAGTTCAAGTCTGTGGAGGATGCTCTCGATGACCGTGTCAAGAAGAAGGCGGATCGCATCGGTCATAGTAGAGATTCCAGGTATTTCTTGTAA

Protein sequence

MELNNSAADCEPAAKSSAPSAVHTAVVPWRTVRISVVLVGLMLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHKDLLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSDIMWLQDPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRDMSKTATMHANCCVGLENKVHDLRILLQDWNNFFNPTANNKASPTPSWTVPQDCRTSFQRGRQPKKTGNRRVIANWAKKERRIPKHRYQYHRHLCGHQPTGKYTKRRTFHLFRRPTMSDPYAGAKGGRLTFKGGALASRSKDIDKKKKKKKKEKSKTDDNPTDESEILTSADGVEGGDGAIYTIDAAKRMKYEELFPVETRKFGYDPNNSNTKFKSVEDALDDRVKKKADRIGHSRDSRYFL
Homology
BLAST of CaUC04G078450 vs. NCBI nr
Match: XP_038894961.1 (uncharacterized protein At4g15970-like [Benincasa hispida])

HSP 1 Score: 661.4 bits (1705), Expect = 6.8e-186
Identity = 327/377 (86.74%), Postives = 346/377 (91.78%), Query Frame = 0

Query: 4   NNSAADCEPA--AKSSAPSAVHTAVVPWRTVRISVVLVGLMLGLLVLYNSAINPFKFLPV 63
           NNSAAD  PA   K SAPSAVHT  V WR  R SVV VG++LGLLVLYNS INPFKFLPV
Sbjct: 3   NNSAADGGPAGNGKLSAPSAVHTTAVTWRMARTSVVFVGVILGLLVLYNSTINPFKFLPV 62

Query: 64  SYTYRAFRSSSPHKDLLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNG 123
           S TYRAFR S+PHKD LLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNG
Sbjct: 63  SDTYRAFRFSAPHKDPLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNG 122

Query: 124 TQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFL 183
           TQRLLKHLVIVTLD+KAYSRCV+LHPHCYEL TQGTNFSSEAYFMT DYLKMMWRRIEFL
Sbjct: 123 TQRLLKHLVIVTLDEKAYSRCVALHPHCYELNTQGTNFSSEAYFMTPDYLKMMWRRIEFL 182

Query: 184 ISVLEMGYSFVFTDSDIMWLQDPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKA 243
            SVL+MGYSFVFTDSDIMWLQDPFNHFYP+ADFQIACD F+GNSEDLNN+PNGGFVYVKA
Sbjct: 183 TSVLQMGYSFVFTDSDIMWLQDPFNHFYPDADFQIACDFFMGNSEDLNNNPNGGFVYVKA 242

Query: 244 NPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRD 303
           NPKTV+FYKFWY+SRTIYPG+HDQDVLNKIKHSPLI KIGLKLRFLDTANFGGFCQMGRD
Sbjct: 243 NPKTVKFYKFWYESRTIYPGKHDQDVLNKIKHSPLISKIGLKLRFLDTANFGGFCQMGRD 302

Query: 304 MSKTATMHANCCVGLENKVHDLRILLQDWNNFFNP-TANNK-ASPTPSWTVPQDCRTSFQ 363
           M+K AT+HANCCVGLENKVHDLRILLQDW+NFFNP TA+NK AS TPSWTVPQDCRTSFQ
Sbjct: 303 MNKMATVHANCCVGLENKVHDLRILLQDWSNFFNPTTADNKLASSTPSWTVPQDCRTSFQ 362

Query: 364 RGRQ---PKKTGNRRVI 374
           RGRQ    K TG+RR++
Sbjct: 363 RGRQRKDNKNTGDRRLL 379

BLAST of CaUC04G078450 vs. NCBI nr
Match: XP_008438689.1 (PREDICTED: uncharacterized protein At4g15970-like [Cucumis melo])

HSP 1 Score: 654.8 bits (1688), Expect = 6.4e-184
Identity = 323/378 (85.45%), Postives = 341/378 (90.21%), Query Frame = 0

Query: 1   MELNNSAADCEPAAKSSAPSAV----HTAVVPWRTVRISVVLVGLMLGLLVLYNSAINPF 60
           M+ NNSAAD E A K S PS V     T+VV WRTVR+SVVLVG+ LGL VLYNSAINPF
Sbjct: 1   MKNNNSAADGEQAGKLSVPSVVPTTTTTSVVTWRTVRVSVVLVGVTLGLFVLYNSAINPF 60

Query: 61  KFLPVSYTYRAFRSSSPHKDLLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSF 120
           KFLPVSYTYRAFR SSPHKD +LEKV+KEAAMEDGTII+TTLNDAWAEPDSL DLFLKSF
Sbjct: 61  KFLPVSYTYRAFRFSSPHKDPILEKVVKEAAMEDGTIIITTLNDAWAEPDSLFDLFLKSF 120

Query: 121 HIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWR 180
           H+GNGTQRLLKHLVIVTLDQKAYSRCV+LHPHCY+L+TQGTNFSSEAYFMTSDYLKMMWR
Sbjct: 121 HVGNGTQRLLKHLVIVTLDQKAYSRCVALHPHCYQLDTQGTNFSSEAYFMTSDYLKMMWR 180

Query: 181 RIEFLISVLEMGYSFVFTDSDIMWLQDPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGF 240
           RIEFLI VLEMG+SFVFTD+DIMWLQDPFNHFY EADFQIA D +LGN EDLNN PNGGF
Sbjct: 181 RIEFLIYVLEMGHSFVFTDTDIMWLQDPFNHFYKEADFQIASDSYLGNPEDLNNVPNGGF 240

Query: 241 VYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFC 300
           VYV+ANPKTV+FYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIG+KLRFLDTANFGGFC
Sbjct: 241 VYVRANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGMKLRFLDTANFGGFC 300

Query: 301 QMGRDMSKTATMHANCCVGLENKVHDLRILLQDWNNFFNPT--ANNKASPTPSWTVPQDC 360
           QMGRDMSK AT+HANCCVGLENKVHDLRILLQDWNNFFN T   N   S TPSWTVPQDC
Sbjct: 301 QMGRDMSKMATVHANCCVGLENKVHDLRILLQDWNNFFNRTIAGNKSPSSTPSWTVPQDC 360

Query: 361 RTSFQRGRQ---PKKTGN 370
           RTSFQRGRQ    KKTGN
Sbjct: 361 RTSFQRGRQHKDDKKTGN 378

BLAST of CaUC04G078450 vs. NCBI nr
Match: XP_004137392.1 (uncharacterized protein At4g15970 [Cucumis sativus] >KGN63967.1 hypothetical protein Csa_014220 [Cucumis sativus])

HSP 1 Score: 636.7 bits (1641), Expect = 1.8e-178
Identity = 313/375 (83.47%), Postives = 336/375 (89.60%), Query Frame = 0

Query: 4   NNSAADC-EPAAKSSAPSAVHTAVVPWRTVRISVVLVGLMLGLLVLYNSAINPFKFLPVS 63
           NNSA D  E A K SAPS   T    WRTVR+SVVLVG+ LGL VLYNSAINPFKFLP S
Sbjct: 3   NNSAVDGEEQAGKLSAPSVAPTTGATWRTVRVSVVLVGVTLGLFVLYNSAINPFKFLPAS 62

Query: 64  YTYRAFRSSSPHKDLLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGT 123
           Y YRAFR SSPHKD +LEKV+KEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGT
Sbjct: 63  YAYRAFRFSSPHKDPILEKVVKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGT 122

Query: 124 QRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLI 183
           QRLLKHLVIVTLDQKAYSRCV++HPHCY+L+TQGTNFSSEAYFMT+DYLKMMWRRIEFLI
Sbjct: 123 QRLLKHLVIVTLDQKAYSRCVAVHPHCYQLDTQGTNFSSEAYFMTADYLKMMWRRIEFLI 182

Query: 184 SVLEMGYSFVFTDSDIMWLQDPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKAN 243
            VLEMG+SFVFTD+DIMWLQDPFNHFY +ADFQIA DL+LGN E+LNN PNGGFVYV+AN
Sbjct: 183 YVLEMGHSFVFTDTDIMWLQDPFNHFYKDADFQIASDLYLGNPENLNNVPNGGFVYVRAN 242

Query: 244 PKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRDM 303
            +TV+FYKFWY+SRTIYPGQHDQDVLNKIKHSPLIPKIG+KLRFLDTANFGGFCQMGRDM
Sbjct: 243 HRTVKFYKFWYESRTIYPGQHDQDVLNKIKHSPLIPKIGMKLRFLDTANFGGFCQMGRDM 302

Query: 304 SKTATMHANCCVGLENKVHDLRILLQDWNNFFNPTANNKASP--TPSWTVPQDCRTSFQR 363
           SK ATMHANCCVGLENKVHDLRILLQDWN+FFN T  +  SP  T SWTVPQDC+TSFQR
Sbjct: 303 SKMATMHANCCVGLENKVHDLRILLQDWNSFFNQTTGDNKSPSSTHSWTVPQDCKTSFQR 362

Query: 364 GRQ---PKKTGNRRV 373
           GRQ    KK GNRR+
Sbjct: 363 GRQHKDDKKPGNRRL 377

BLAST of CaUC04G078450 vs. NCBI nr
Match: KAG6607414.1 (hypothetical protein SDJN03_00756, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 631.3 bits (1627), Expect = 7.5e-177
Identity = 324/485 (66.80%), Postives = 368/485 (75.88%), Query Frame = 0

Query: 6   SAADCEPAAKSSAPSAVHTAVVPWRTVRISVVLVGLMLGLLVLYNSAINPFKFLPVSYTY 65
           ++ D +P+   +AP   HTAVV W+TVR+SV   G++LGL+VLYNSAINPF  LPVSY+Y
Sbjct: 4   TSGDVQPS--GAAPFGAHTAVVRWKTVRLSVAFFGVILGLVVLYNSAINPFNILPVSYSY 63

Query: 66  RAFRSSSPHKDLLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRL 125
           RAFRS S  ++ LLEK L +A+ ED T+ILTTLN AWA PDSLLDLFLKSFH GNGTQRL
Sbjct: 64  RAFRSYSSLRNPLLEKALTKASNEDKTVILTTLNAAWAAPDSLLDLFLKSFHSGNGTQRL 123

Query: 126 LKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVL 185
           LKHLVIV LD KAY RCV+ HPHCY+L+T+G NFS EAYFMT+DYLKMMWRRI+FL SVL
Sbjct: 124 LKHLVIVCLDAKAYQRCVASHPHCYQLDTEGANFSGEAYFMTADYLKMMWRRIQFLTSVL 183

Query: 186 EMGYSFVFTDSDIMWLQDPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKT 245
           EMG+SFVFTDSDIMWLQDPFNHF+P+ADFQIACD F G+SEDLNN PNGGFVYVK+N KT
Sbjct: 184 EMGFSFVFTDSDIMWLQDPFNHFHPDADFQIACDTFKGSSEDLNNRPNGGFVYVKSNTKT 243

Query: 246 VQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRDMSKT 305
           ++FYKFWY+SRT++PG+HDQDVLNKIKHSPLIP+IGLK+RFLDTANFGGFCQMGRD +K 
Sbjct: 244 IRFYKFWYESRTMFPGRHDQDVLNKIKHSPLIPEIGLKIRFLDTANFGGFCQMGRDFTKV 303

Query: 306 ATMHANCCVGLENKVHDLRILLQDWNNFFNPTANNKASPTPSWTVPQDCRTSFQRGRQPK 365
            T+HANCCVGL+NKVHDLRILL DW+ F     N+KAS  PSW+VPQDCRTSFQRGRQ K
Sbjct: 304 CTVHANCCVGLDNKVHDLRILLNDWSKF----VNHKASSRPSWSVPQDCRTSFQRGRQSK 363

Query: 366 KTGNRRVIANWAKKERRIPKHRYQYHRHLCGHQPTGKYTKRRTFHLFRRPTMSDPYAGAK 425
                                                                    GAK
Sbjct: 364 H--------------------------------------------------------GAK 423

Query: 426 GGRLTFKGGALASRSKDIDKKKKKKKKEKSKTDDNPTDESEILTSADGVEGGDGAIYTID 485
           GGRLTFKGG LASRSK ID KKKKKKK KSK D+NPT E EIL SADG +GG G +YTID
Sbjct: 424 GGRLTFKGGVLASRSKAID-KKKKKKKGKSKGDENPTVEGEILLSADGADGGVGEVYTID 425

Query: 486 AAKRM 491
           AAKRM
Sbjct: 484 AAKRM 425

BLAST of CaUC04G078450 vs. NCBI nr
Match: XP_023524447.1 (uncharacterized protein At4g15970-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 572.4 bits (1474), Expect = 4.2e-159
Identity = 272/360 (75.56%), Postives = 312/360 (86.67%), Query Frame = 0

Query: 6   SAADCEPAAKSSAPSAVHTAVVPWRTVRISVVLVGLMLGLLVLYNSAINPFKFLPVSYTY 65
           ++ D +P+   +AP   HTAVV W+TVR+SV   G++LGLLVLYNSAINPF  LPVSY+Y
Sbjct: 60  TSGDVQPS--GTAPFGAHTAVVRWKTVRLSVAFFGVILGLLVLYNSAINPFNILPVSYSY 119

Query: 66  RAFRSSSPHKDLLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRL 125
           RAFRS S  ++ LLEK L +A+ ED T+ILTTLN AWAEPDSLLDLFLKSFH GNGTQRL
Sbjct: 120 RAFRSYSSLRNPLLEKALTKASNEDKTVILTTLNAAWAEPDSLLDLFLKSFHSGNGTQRL 179

Query: 126 LKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVL 185
           LKHLVIV LD KAY RCV+ HPHCY+L+T+G NFS EAYFMT+DYLKMMWRRI+FL SVL
Sbjct: 180 LKHLVIVCLDAKAYQRCVASHPHCYQLDTKGANFSGEAYFMTADYLKMMWRRIQFLTSVL 239

Query: 186 EMGYSFVFTDSDIMWLQDPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKT 245
           EMG+SFVFTDSDIMWLQDPFNHF+P+ADFQIACD F G+SEDLNN PNGGFVYVK+N KT
Sbjct: 240 EMGFSFVFTDSDIMWLQDPFNHFHPDADFQIACDTFKGSSEDLNNRPNGGFVYVKSNTKT 299

Query: 246 VQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRDMSKT 305
           ++FYKFWY+SRT++PG+HDQDVLNKIKHSPLIP+IGLK+RFLDTANFGGFCQMGRD +K 
Sbjct: 300 IRFYKFWYESRTMFPGRHDQDVLNKIKHSPLIPEIGLKIRFLDTANFGGFCQMGRDFTKV 359

Query: 306 ATMHANCCVGLENKVHDLRILLQDWNNFFNPTANNKASPTPSWTVPQDCRTSFQRGRQPK 365
            T+HANCCVGL+NKVHDLRILL DW+ F     N+KAS  PSW+VPQDCRTSFQRGRQ K
Sbjct: 360 CTVHANCCVGLDNKVHDLRILLNDWSKF----VNHKASSRPSWSVPQDCRTSFQRGRQSK 413

BLAST of CaUC04G078450 vs. ExPASy Swiss-Prot
Match: P0C042 (Uncharacterized protein At4g15970 OS=Arabidopsis thaliana OX=3702 GN=At4g15970 PE=2 SV=1)

HSP 1 Score: 319.7 bits (818), Expect = 6.4e-86
Identity = 149/277 (53.79%), Postives = 196/277 (70.76%), Query Frame = 0

Query: 79  LEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKA 138
           L K+L EAA ED T+I+TTLN AW+EP+S  DLFL SFH+G GT+ LL+HLV+  LD++A
Sbjct: 42  LGKILTEAATEDKTVIITTLNKAWSEPNSTFDLFLHSFHVGKGTKPLLRHLVVACLDEEA 101

Query: 139 YSRCVSLHPH-CYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSD 198
           YSRC  +HPH CY ++T G +F+ +  FMT DYLKMMWRRIEFL ++L++ Y+F+FT   
Sbjct: 102 YSRCSEVHPHRCYFMKTPGIDFAGDKMFMTPDYLKMMWRRIEFLGTLLKLRYNFIFT--- 161

Query: 199 IMWLQDPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRT 258
                 PF     E DFQIACD + G+ +D++N+ NGGF +VKAN +T+ FY +WY SR 
Sbjct: 162 -----IPFPRLSKEVDFQIACDRYSGDDKDIHNAVNGGFAFVKANQRTIDFYNYWYMSRL 221

Query: 259 IYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRDMSKTATMHANCCVGLE 318
            YP +HDQDVL++IK      KIGLK+RFLDT  FGGFC+  RD+ K  TMHANCCVGLE
Sbjct: 222 RYPDRHDQDVLDQIKGGGYPAKIGLKMRFLDTKYFGGFCEPSRDLDKVCTMHANCCVGLE 281

Query: 319 NKVHDLRILLQDWNNFFNPTANNKASPTPSWTVPQDC 355
           NK+ DLR ++ DW N+ +  A        +W  P++C
Sbjct: 282 NKIKDLRQVIVDWENYVS-AAKTTDGQIMTWRDPENC 309

BLAST of CaUC04G078450 vs. ExPASy Swiss-Prot
Match: Q3E6Y3 (Uncharacterized protein At1g28695 OS=Arabidopsis thaliana OX=3702 GN=At1g28695 PE=2 SV=1)

HSP 1 Score: 206.8 bits (525), Expect = 6.0e-52
Identity = 103/264 (39.02%), Postives = 152/264 (57.58%), Query Frame = 0

Query: 86  AAMEDGTIILTTLNDAWAEP----DSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSR 145
           AA  + T+I+T +N A+ +      ++LDLFL+SF  G GT  LL HL++V +DQ AY R
Sbjct: 53  AAGNNKTVIITMVNKAYVKEVGRGSTMLDLFLESFWEGEGTLPLLDHLMVVAVDQTAYDR 112

Query: 146 CVSLHPHCYELETQ-GTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSDIMW 205
           C     HCY++ET+ G +   E  FM+ D+++MMWRR   ++ VL  GY+ +FTD+D+MW
Sbjct: 113 CRFKRLHCYKMETEDGVDLEGEKVFMSKDFIEMMWRRTRLILDVLRRGYNVIFTDTDVMW 172

Query: 206 LQDPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYP 265
           L+ P +      D QI+ D      + +N     GF +V++N KT+  ++ WY  R    
Sbjct: 173 LRSPLSRLNMSLDMQISVDRINVGGQLINT----GFYHVRSNNKTISLFQKWYDMRLNST 232

Query: 266 GQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRDMSKTATMHANCCVGLENKV 325
           G  +QDVL  +  S    ++GL + FL T  F GFCQ    M    T+HANCC+ +  KV
Sbjct: 233 GMKEQDVLKNLLDSGFFNQLGLNVGFLSTTEFSGFCQDSPHMGVVTTVHANCCLHIPAKV 292

Query: 326 HDLRILLQDWNNFFNPTANNKASP 345
            DL  +L+DW  +     N+K SP
Sbjct: 293 FDLTRVLRDWKRYKASHVNSKWSP 312

BLAST of CaUC04G078450 vs. ExPASy Swiss-Prot
Match: Q9FXA7 (UDP-D-xylose:L-fucose alpha-1,3-D-xylosyltransferase 3 OS=Arabidopsis thaliana OX=3702 GN=RGXT3 PE=1 SV=1)

HSP 1 Score: 53.5 bits (127), Expect = 8.5e-06
Identity = 41/180 (22.78%), Postives = 72/180 (40.00%), Query Frame = 0

Query: 112 FLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYL 171
           FL ++ I    Q+  + ++++  D     +     P    L     +  S   F +  + 
Sbjct: 97  FLNNWLISISRQKHQEKVLVIAEDYATLYKVNEKWPGHAVLIPPALDPQSAHKFGSQGFF 156

Query: 172 KMMWRRIEFLISVLEMGYSFVFTDSDIMWLQDPFNHFYPEADFQIACDLF----LGNSED 231
            +  RR + L+++LE+GY+ ++ D D++WLQDPF++     D     D+     L +S D
Sbjct: 157 NLTSRRPQHLLNILELGYNVMYNDVDMVWLQDPFDYLQGSYDAYFMDDMIAIKPLNHSHD 216

Query: 232 LNNSPNGGFVYV-------KANPKTVQFYKFWYQSRTIYPGQ-------HDQDVLNKIKH 274
           L      G  YV       ++        K W +     P         HDQ   N+  H
Sbjct: 217 LPPLSRSGVTYVCSCMIFLRSTDGGKLLMKTWVEEIQAQPWNNTQAKKPHDQPAFNRALH 276

BLAST of CaUC04G078450 vs. ExPASy Swiss-Prot
Match: Q9M146 (UDP-D-xylose:L-fucose alpha-1,3-D-xylosyltransferase MGP4 OS=Arabidopsis thaliana OX=3702 GN=MGP4 PE=2 SV=1)

HSP 1 Score: 49.7 bits (117), Expect = 1.2e-04
Identity = 40/174 (22.99%), Postives = 76/174 (43.68%), Query Frame = 0

Query: 70  SSSPHKDLLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHL 129
           S S  +D  L + +K  A ++GT+I+  ++  +         FL ++ I    Q+    +
Sbjct: 74  SQSKWRDYSLPQAVKFVA-KNGTVIVCAVSYPYLP-------FLNNWLISVSRQKHQDQV 133

Query: 130 VIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGY 189
           +++  D     +     P    L     +  +   F +  +     RR + L+ +LE+GY
Sbjct: 134 LVIAEDYATLYKVNEKWPGHAVLIPPALDSQTAHKFGSQGFFNFTARRPQHLLEILELGY 193

Query: 190 SFVFTDSDIMWLQDPFNHFYPEADFQIACDLF----LGNSEDLNNSPNGGFVYV 240
           + ++ D D++WLQDPF +   + D     D+     L +S DL      G  Y+
Sbjct: 194 NVMYNDVDMVWLQDPFQYLEGKHDAYFMDDMTAIKPLDHSHDLPPPGKKGRTYI 239

BLAST of CaUC04G078450 vs. ExPASy Swiss-Prot
Match: Q9ZSJ0 (UDP-D-xylose:L-fucose alpha-1,3-D-xylosyltransferase OS=Arabidopsis thaliana OX=3702 GN=RGXT2 PE=1 SV=1)

HSP 1 Score: 48.5 bits (114), Expect = 2.7e-04
Identity = 51/213 (23.94%), Postives = 90/213 (42.25%), Query Frame = 0

Query: 36  VVLVGLMLGLLVLYNSAINPFKFLPVSYTYRAFRSS-SPH-KDLLLEKVLKEAA---MED 95
           +VL+ L L L V      +P    P   +  ++ SS SPH K       L +AA     +
Sbjct: 39  LVLLALFLLLGVFLPWPGSPLLLFPNKVSSPSYASSLSPHAKSEWRNYTLAQAAKFVATN 98

Query: 96  GTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCY 155
           GT+I+  ++  +         FL ++ I    Q+  + ++++  D     +     P   
Sbjct: 99  GTVIVCAVSSPFLP-------FLNNWLISVSRQKHQEKVLVIAEDYITLYKVNEKWPGHA 158

Query: 156 ELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSDIMWLQDPFNHFYP 215
            L     +  +   F +  +     RR + L+ +LE+GY+ ++ D D++WLQDPF +   
Sbjct: 159 VLIPPALDSKTAYSFGSQGFFNFTARRPQHLLQILELGYNVMYNDVDMVWLQDPFQYLEG 218

Query: 216 EADFQIACDL----FLGNSEDLNNSPNGGFVYV 240
             D     D+     L +S DL      G  Y+
Sbjct: 219 SHDAYFTDDMPQIKPLNHSHDLPAPDQNGETYI 244

BLAST of CaUC04G078450 vs. ExPASy TrEMBL
Match: A0A1S3AWN4 (Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103483692 PE=3 SV=1)

HSP 1 Score: 654.8 bits (1688), Expect = 3.1e-184
Identity = 323/378 (85.45%), Postives = 341/378 (90.21%), Query Frame = 0

Query: 1   MELNNSAADCEPAAKSSAPSAV----HTAVVPWRTVRISVVLVGLMLGLLVLYNSAINPF 60
           M+ NNSAAD E A K S PS V     T+VV WRTVR+SVVLVG+ LGL VLYNSAINPF
Sbjct: 1   MKNNNSAADGEQAGKLSVPSVVPTTTTTSVVTWRTVRVSVVLVGVTLGLFVLYNSAINPF 60

Query: 61  KFLPVSYTYRAFRSSSPHKDLLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSF 120
           KFLPVSYTYRAFR SSPHKD +LEKV+KEAAMEDGTII+TTLNDAWAEPDSL DLFLKSF
Sbjct: 61  KFLPVSYTYRAFRFSSPHKDPILEKVVKEAAMEDGTIIITTLNDAWAEPDSLFDLFLKSF 120

Query: 121 HIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWR 180
           H+GNGTQRLLKHLVIVTLDQKAYSRCV+LHPHCY+L+TQGTNFSSEAYFMTSDYLKMMWR
Sbjct: 121 HVGNGTQRLLKHLVIVTLDQKAYSRCVALHPHCYQLDTQGTNFSSEAYFMTSDYLKMMWR 180

Query: 181 RIEFLISVLEMGYSFVFTDSDIMWLQDPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGF 240
           RIEFLI VLEMG+SFVFTD+DIMWLQDPFNHFY EADFQIA D +LGN EDLNN PNGGF
Sbjct: 181 RIEFLIYVLEMGHSFVFTDTDIMWLQDPFNHFYKEADFQIASDSYLGNPEDLNNVPNGGF 240

Query: 241 VYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFC 300
           VYV+ANPKTV+FYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIG+KLRFLDTANFGGFC
Sbjct: 241 VYVRANPKTVKFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGMKLRFLDTANFGGFC 300

Query: 301 QMGRDMSKTATMHANCCVGLENKVHDLRILLQDWNNFFNPT--ANNKASPTPSWTVPQDC 360
           QMGRDMSK AT+HANCCVGLENKVHDLRILLQDWNNFFN T   N   S TPSWTVPQDC
Sbjct: 301 QMGRDMSKMATVHANCCVGLENKVHDLRILLQDWNNFFNRTIAGNKSPSSTPSWTVPQDC 360

Query: 361 RTSFQRGRQ---PKKTGN 370
           RTSFQRGRQ    KKTGN
Sbjct: 361 RTSFQRGRQHKDDKKTGN 378

BLAST of CaUC04G078450 vs. ExPASy TrEMBL
Match: A0A0A0LT78 (Glycosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_1G031880 PE=3 SV=1)

HSP 1 Score: 636.7 bits (1641), Expect = 8.7e-179
Identity = 313/375 (83.47%), Postives = 336/375 (89.60%), Query Frame = 0

Query: 4   NNSAADC-EPAAKSSAPSAVHTAVVPWRTVRISVVLVGLMLGLLVLYNSAINPFKFLPVS 63
           NNSA D  E A K SAPS   T    WRTVR+SVVLVG+ LGL VLYNSAINPFKFLP S
Sbjct: 3   NNSAVDGEEQAGKLSAPSVAPTTGATWRTVRVSVVLVGVTLGLFVLYNSAINPFKFLPAS 62

Query: 64  YTYRAFRSSSPHKDLLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGT 123
           Y YRAFR SSPHKD +LEKV+KEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGT
Sbjct: 63  YAYRAFRFSSPHKDPILEKVVKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGT 122

Query: 124 QRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLI 183
           QRLLKHLVIVTLDQKAYSRCV++HPHCY+L+TQGTNFSSEAYFMT+DYLKMMWRRIEFLI
Sbjct: 123 QRLLKHLVIVTLDQKAYSRCVAVHPHCYQLDTQGTNFSSEAYFMTADYLKMMWRRIEFLI 182

Query: 184 SVLEMGYSFVFTDSDIMWLQDPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKAN 243
            VLEMG+SFVFTD+DIMWLQDPFNHFY +ADFQIA DL+LGN E+LNN PNGGFVYV+AN
Sbjct: 183 YVLEMGHSFVFTDTDIMWLQDPFNHFYKDADFQIASDLYLGNPENLNNVPNGGFVYVRAN 242

Query: 244 PKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRDM 303
            +TV+FYKFWY+SRTIYPGQHDQDVLNKIKHSPLIPKIG+KLRFLDTANFGGFCQMGRDM
Sbjct: 243 HRTVKFYKFWYESRTIYPGQHDQDVLNKIKHSPLIPKIGMKLRFLDTANFGGFCQMGRDM 302

Query: 304 SKTATMHANCCVGLENKVHDLRILLQDWNNFFNPTANNKASP--TPSWTVPQDCRTSFQR 363
           SK ATMHANCCVGLENKVHDLRILLQDWN+FFN T  +  SP  T SWTVPQDC+TSFQR
Sbjct: 303 SKMATMHANCCVGLENKVHDLRILLQDWNSFFNQTTGDNKSPSSTHSWTVPQDCKTSFQR 362

Query: 364 GRQ---PKKTGNRRV 373
           GRQ    KK GNRR+
Sbjct: 363 GRQHKDDKKPGNRRL 377

BLAST of CaUC04G078450 vs. ExPASy TrEMBL
Match: A0A6J1KAZ2 (Glycosyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111493282 PE=3 SV=1)

HSP 1 Score: 569.3 bits (1466), Expect = 1.7e-158
Identity = 271/360 (75.28%), Postives = 310/360 (86.11%), Query Frame = 0

Query: 6   SAADCEPAAKSSAPSAVHTAVVPWRTVRISVVLVGLMLGLLVLYNSAINPFKFLPVSYTY 65
           S+ D +P    +AP + HTAVV W+TVR+SV   G++LGLLVLYNSAINPF  LPVSY+Y
Sbjct: 4   SSGDVQPG--GAAPFSAHTAVVRWKTVRLSVAFFGVILGLLVLYNSAINPFNILPVSYSY 63

Query: 66  RAFRSSSPHKDLLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRL 125
           RAFRS S  ++ LLEK L +A+ ED T+ILTTLN AWAEP+SLLDLFLKSFH GNGTQRL
Sbjct: 64  RAFRSYSSLRNPLLEKTLTKASNEDKTVILTTLNAAWAEPESLLDLFLKSFHAGNGTQRL 123

Query: 126 LKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVL 185
           LKHLVIV LD KAY RC + HPHCY+L+T+G NFS EAYFMT+DYLKMMWRRI+FL SVL
Sbjct: 124 LKHLVIVCLDAKAYQRCGASHPHCYQLDTEGANFSGEAYFMTADYLKMMWRRIQFLTSVL 183

Query: 186 EMGYSFVFTDSDIMWLQDPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKT 245
           EMG+SFVFTDSDIMWLQDPFNHF+P+ADFQIACD F G+SEDLNN PNGGFVYVK+N KT
Sbjct: 184 EMGFSFVFTDSDIMWLQDPFNHFHPDADFQIACDTFKGSSEDLNNRPNGGFVYVKSNTKT 243

Query: 246 VQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRDMSKT 305
           ++FYKFWY+SRT++PG+HDQDVLNKIKHSPLIP+IGLK+RFLDTANFGGFCQMGRD +K 
Sbjct: 244 IRFYKFWYESRTMFPGRHDQDVLNKIKHSPLIPEIGLKIRFLDTANFGGFCQMGRDFTKV 303

Query: 306 ATMHANCCVGLENKVHDLRILLQDWNNFFNPTANNKASPTPSWTVPQDCRTSFQRGRQPK 365
            T+HANCCVGL NKVHDLRILL DW+ F     N+KAS  PSW+VPQDCRTSFQRGRQ K
Sbjct: 304 CTVHANCCVGLNNKVHDLRILLNDWSKF----VNHKASSRPSWSVPQDCRTSFQRGRQSK 357

BLAST of CaUC04G078450 vs. ExPASy TrEMBL
Match: A0A6J1GCE5 (Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111452702 PE=3 SV=1)

HSP 1 Score: 568.2 bits (1463), Expect = 3.8e-158
Identity = 269/360 (74.72%), Postives = 311/360 (86.39%), Query Frame = 0

Query: 6   SAADCEPAAKSSAPSAVHTAVVPWRTVRISVVLVGLMLGLLVLYNSAINPFKFLPVSYTY 65
           ++ D +P+   +AP   HTA+V W+TVR+SV   G++LGL+VLYNSAI PF  LPVSY+Y
Sbjct: 4   TSGDVQPS--GAAPFGAHTALVRWKTVRLSVAFFGVILGLVVLYNSAIKPFNILPVSYSY 63

Query: 66  RAFRSSSPHKDLLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRL 125
           RAFRS S  ++ LLEK L +A+ ED T+ILTTLN AWAEPDSLLDLFLKSFH GNGTQRL
Sbjct: 64  RAFRSYSSLRNPLLEKALTKASNEDKTVILTTLNAAWAEPDSLLDLFLKSFHSGNGTQRL 123

Query: 126 LKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVL 185
           LKHLVIV LD KAY RCV+ HPHCY+L+T+G NFS EAYFMT+DYLKMMWRRI+FL SVL
Sbjct: 124 LKHLVIVCLDAKAYQRCVASHPHCYQLDTEGANFSGEAYFMTADYLKMMWRRIQFLTSVL 183

Query: 186 EMGYSFVFTDSDIMWLQDPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKT 245
           EMG+SFVFTDSDIMWLQDPFNHF+P+ADFQIACD F G+SEDLNN PNGGFVYVK+N KT
Sbjct: 184 EMGFSFVFTDSDIMWLQDPFNHFHPDADFQIACDTFKGSSEDLNNRPNGGFVYVKSNTKT 243

Query: 246 VQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRDMSKT 305
           ++FYKFWY+SRT++PG+HDQDVLNKIKHSPLIP+IGLK+RFLDTANFGGFCQMGRD +K 
Sbjct: 244 IRFYKFWYESRTMFPGRHDQDVLNKIKHSPLIPEIGLKIRFLDTANFGGFCQMGRDFTKV 303

Query: 306 ATMHANCCVGLENKVHDLRILLQDWNNFFNPTANNKASPTPSWTVPQDCRTSFQRGRQPK 365
            T+HANCCVGL+NKVHDLRILL DW+ F     N+KAS  PSW+VPQDCRTSFQRGRQ K
Sbjct: 304 CTVHANCCVGLDNKVHDLRILLNDWSKF----VNHKASSRPSWSVPQDCRTSFQRGRQSK 357

BLAST of CaUC04G078450 vs. ExPASy TrEMBL
Match: A0A6J1C6T6 (uncharacterized protein At4g15970-like OS=Momordica charantia OX=3673 GN=LOC111008782 PE=4 SV=1)

HSP 1 Score: 566.6 bits (1459), Expect = 1.1e-157
Identity = 275/356 (77.25%), Postives = 310/356 (87.08%), Query Frame = 0

Query: 1   MELNNSAADCEPAAKSSAPSAVHTAVVPWRTVRISVVLVGLMLGLLVLYNSAINPFKFLP 60
           M+ +NSAAD + AA   APS     +VP RTVRIS VL+G+ L +LVLYNSAINPF+FLP
Sbjct: 1   MKDSNSAADLQIAA---APS----TIVPRRTVRISFVLLGVALAILVLYNSAINPFRFLP 60

Query: 61  VSY-TYRAFRSSSPHKDLLLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIG 120
           VSY TYR   S S   D LLEK+LK A+ EDGT+ILTTLNDAWAEP SLLDLFL+SFHIG
Sbjct: 61  VSYTTYRPSASPSLTTDPLLEKILKNASTEDGTVILTTLNDAWAEPGSLLDLFLESFHIG 120

Query: 121 NGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIE 180
           NGT+RLLKHLVIVT+D+KAY+RCV+LHPHCYEL+TQG NFSSEAYFMTSDYL+MMWRRIE
Sbjct: 121 NGTERLLKHLVIVTMDKKAYARCVALHPHCYELDTQGINFSSEAYFMTSDYLQMMWRRIE 180

Query: 181 FLISVLEMGYSFVFTDSDIMWLQDPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYV 240
           FL SVL MG+SFVFTDSDIMWLQDPFNHF+P+ADFQIACD FLGNSEDLNN PNGGF YV
Sbjct: 181 FLTSVLRMGFSFVFTDSDIMWLQDPFNHFHPDADFQIACDYFLGNSEDLNNRPNGGFTYV 240

Query: 241 KANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMG 300
           K+NPKT++FYKFWYQSRTIYPGQHDQDVLNKIK SPLI KIGLK+RFLDTANFGGFCQ  
Sbjct: 241 KSNPKTIKFYKFWYQSRTIYPGQHDQDVLNKIKTSPLISKIGLKIRFLDTANFGGFCQPS 300

Query: 301 RDMSKTATMHANCCVGLENKVHDLRILLQDWNNFFNPTANNKASPTPSWTVPQDCR 356
           RD ++ +TMHANCCVGL+NKVHDL+ILL DWN FF  T  +KA+ TPSW+VPQDC+
Sbjct: 301 RDFNRVSTMHANCCVGLDNKVHDLKILLHDWNTFFTQTPRDKAASTPSWSVPQDCK 349

BLAST of CaUC04G078450 vs. TAIR 10
Match: AT1G14590.1 (Nucleotide-diphospho-sugar transferase family protein )

HSP 1 Score: 370.9 bits (951), Expect = 1.7e-102
Identity = 176/322 (54.66%), Postives = 229/322 (71.12%), Query Frame = 0

Query: 33  RISVVLVGLMLGLLVLYNSAINPFKFLPVSYTYRAFRSSSPHKDLLLEKVLKEAAMEDGT 92
           R ++ L  + +   VLY +A +   F P  +   ++  +   K   LE VL +AA  D T
Sbjct: 43  RAALFLAAISISCFVLYRAA-DSLSFSPPIFDLSSYLDNEEPK---LEDVLSKAATRDRT 102

Query: 93  IILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVSLHPHCYEL 152
           ++LTTLN AWA P S++DLF +SF IG  T ++L HLVIV LD KAYSRC+ LH HC+ L
Sbjct: 103 VVLTTLNAAWAAPGSVIDLFFESFRIGEETSQILDHLVIVALDAKAYSRCLELHKHCFSL 162

Query: 153 ETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSDIMWLQDPFNHFYPEA 212
            T+G +FS EAYFMT  YLKMMWRRI+ L SVLEMGY+FVFTD+D+MW ++PF  FY  A
Sbjct: 163 VTEGVDFSREAYFMTRSYLKMMWRRIDLLRSVLEMGYNFVFTDADVMWFRNPFPRFYMYA 222

Query: 213 DFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPGQHDQDVLNKIK 272
           DFQIACD +LG S DL+N PNGGF +V++N +T+ FYK+WY SR  +PG HDQDVLN +K
Sbjct: 223 DFQIACDHYLGRSNDLHNRPNGGFNFVRSNNRTILFYKYWYASRLRFPGYHDQDVLNFLK 282

Query: 273 HSPLIPKIGLKLRFLDTANFGGFCQMGRDMSKTATMHANCCVGLENKVHDLRILLQDWNN 332
             P + +IGLK+RFL+TA FGG C+  RD++   TMHANCC G+E+K+HDLRI+LQDW +
Sbjct: 283 AEPFVFRIGLKMRFLNTAYFGGLCEPSRDLNLVRTMHANCCYGMESKLHDLRIMLQDWKD 342

Query: 333 FFNPTANNKASPTPSWTVPQDC 355
           F +   + K S   SW VPQ+C
Sbjct: 343 FMSLPLHLKQSSGFSWKVPQNC 360

BLAST of CaUC04G078450 vs. TAIR 10
Match: AT2G02061.1 (Nucleotide-diphospho-sugar transferase family protein )

HSP 1 Score: 369.0 bits (946), Expect = 6.5e-102
Identity = 165/277 (59.57%), Postives = 212/277 (76.53%), Query Frame = 0

Query: 79  LEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKA 138
           LE+VL+ AA +DGT+ILTTLN+AWA P S++DLF +SF IG GT+RLLKHLVI+ LD KA
Sbjct: 108 LEEVLRRAATKDGTVILTTLNEAWAAPGSVIDLFFESFRIGKGTRRLLKHLVIIALDAKA 167

Query: 139 YSRCVSLHPHCYELETQGTNFS-SEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSD 198
           YSRC  LH HC+ LET+G +FS  EAYFMT  YL MMWRRI FL SVLE GY+FVFTD+D
Sbjct: 168 YSRCQELHKHCFRLETEGVDFSGGEAYFMTPSYLTMMWRRISFLRSVLEKGYNFVFTDAD 227

Query: 199 IMWLQDPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRT 258
           +MW ++PF  FY + DFQIACD ++G   D  N PNGGF +V+AN +++ FYKFWY SRT
Sbjct: 228 VMWFRNPFRRFYEDGDFQIACDHYIGRPNDFRNRPNGGFTFVRANNRSIGFYKFWYDSRT 287

Query: 259 IYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRDMSKTATMHANCCVGLE 318
            YP  HDQDVLN IK  P + K+ +++RFL+T  FGGFC+  +D++   TMHANCC GL+
Sbjct: 288 KYPKNHDQDVLNFIKTDPFLWKLRIRIRFLNTVYFGGFCEPSKDLNLVCTMHANCCFGLD 347

Query: 319 NKVHDLRILLQDWNNFFNPTANNKASPTPSWTVPQDC 355
           +K+HDLRI+LQDW +F +   ++  S   +W+VPQ+C
Sbjct: 348 SKLHDLRIMLQDWRDFKSLPLHSNQSSGFTWSVPQNC 384

BLAST of CaUC04G078450 vs. TAIR 10
Match: AT4G19970.1 (CONTAINS InterPro DOMAIN/s: Nucleotide-diphospho-sugar transferase, predicted (InterPro:IPR005069); BEST Arabidopsis thaliana protein match is: Nucleotide-diphospho-sugar transferase family protein (TAIR:AT5G44820.1); Has 801 Blast hits to 466 proteins in 35 species: Archae - 0; Bacteria - 0; Metazoa - 2; Fungi - 0; Plants - 750; Viruses - 0; Other Eukaryotes - 49 (source: NCBI BLink). )

HSP 1 Score: 359.8 bits (922), Expect = 3.9e-99
Identity = 174/333 (52.25%), Postives = 230/333 (69.07%), Query Frame = 0

Query: 30  RTVRISVVLVGLMLGLLVLYNSAINPFKFLPVSYTYRA-----FRSSSP---HKDLLLEK 89
           + V+  +VLV  +   L+LY +A    + L V+            SSSP    K +   +
Sbjct: 385 KEVKKILVLVLGLAACLLLYKTAYPLHQELDVNNLSSRPLLDHTSSSSPLTRSKSISFRE 444

Query: 90  VLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSR 149
           VL+ A+ E+ T+I+TTLN AWAEP+SL DLFL+SF IG GT++LL+H+V+V LD KA++R
Sbjct: 445 VLENASTENRTVIVTTLNQAWAEPNSLFDLFLESFRIGQGTKKLLQHVVVVCLDSKAFAR 504

Query: 150 CVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSDIMWL 209
           C  LHP+CY L+T GT+FS E  F T DYLKMMWRRIE L  VLEMGY+F+FTD+DIMWL
Sbjct: 505 CSQLHPNCYYLKTTGTDFSGEKLFATPDYLKMMWRRIELLTQVLEMGYNFIFTDADIMWL 564

Query: 210 QDPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRTIYPG 269
           +DPF   YP+ DFQ+ACD F G+  D +N  NGGF YVK+N ++++FYKFWY SR  YP 
Sbjct: 565 RDPFPRLYPDGDFQMACDRFFGDPHDSDNWVNGGFTYVKSNHRSIEFYKFWYNSRLDYPK 624

Query: 270 QHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRDMSKTATMHANCCVGLENKVH 329
            HDQDV N+IKH  L+ +IG+++RF DT  FGGFCQ  RD++   TMHANCCVGL  K+H
Sbjct: 625 MHDQDVFNQIKHKALVSEIGIQMRFFDTVYFGGFCQTSRDINLVCTMHANCCVGLAKKLH 684

Query: 330 DLRILLQDWNNFFNPTANNKASPTPSWTVPQDC 355
           DL ++L DW N+ + +   K     +W+VP  C
Sbjct: 685 DLNLVLDDWRNYLSLSEPVK---NTTWSVPMKC 714

BLAST of CaUC04G078450 vs. TAIR 10
Match: AT5G44820.1 (Nucleotide-diphospho-sugar transferase family protein )

HSP 1 Score: 352.4 bits (903), Expect = 6.3e-97
Identity = 169/340 (49.71%), Postives = 226/340 (66.47%), Query Frame = 0

Query: 33  RISVVLVGLMLGLLVLYNSAINPFKFLPVSYTYRAFRSSSP---------------HKDL 92
           RI ++ +GL    LVLY +A  P + L VS       S SP                  L
Sbjct: 32  RILILFLGLTASCLVLYKTAY-PLQRLNVSNLTSLQASPSPLLPNLNSSEISPETTKPKL 91

Query: 93  LLEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQK 152
             +++L+ A+ ++ T+I+TTLN AWAEP+SL DLFL+SF IG GTQ+LLKH+V+V LD K
Sbjct: 92  SFKEILENASTKNNTVIITTLNQAWAEPNSLFDLFLESFRIGQGTQQLLKHVVVVCLDIK 151

Query: 153 AYSRCVSLHPHCYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSD 212
           A+ RC  LH +CY +ET  T+FS E  + T DYLKMMW RI+ L  VLEMG++F+FTD+D
Sbjct: 152 AFERCSQLHTNCYHIETSETDFSGEKVYNTPDYLKMMWARIDLLTQVLEMGFNFIFTDAD 211

Query: 213 IMWLQDPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRT 272
           IMWL+DPF   YP+ DFQ+ACD F GN  D +N  NGGF YV++N ++++FYKFW++SR 
Sbjct: 212 IMWLRDPFPRLYPDGDFQMACDRFFGNPYDSDNWVNGGFTYVRSNNRSIEFYKFWHKSRL 271

Query: 273 IYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRDMSKTATMHANCCVGLE 332
            YP  HDQDV N+IKH P I +IG+++RF DT  FGGFCQ  RD++   TMHANCC+GL+
Sbjct: 272 DYPDLHDQDVFNRIKHEPFISEIGIQMRFFDTVYFGGFCQTSRDINLVCTMHANCCIGLD 331

Query: 333 NKVHDLRILLQDWNNFFN---PTANNKASPTPSWTVPQDC 355
            K+HDL ++L DW  + +   P  N       +W+VP  C
Sbjct: 332 KKLHDLNLVLDDWRKYLSLSEPVQNT------TWSVPMKC 364

BLAST of CaUC04G078450 vs. TAIR 10
Match: AT4G15970.1 (Nucleotide-diphospho-sugar transferase family protein )

HSP 1 Score: 319.7 bits (818), Expect = 4.5e-87
Identity = 149/277 (53.79%), Postives = 196/277 (70.76%), Query Frame = 0

Query: 79  LEKVLKEAAMEDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKA 138
           L K+L EAA ED T+I+TTLN AW+EP+S  DLFL SFH+G GT+ LL+HLV+  LD++A
Sbjct: 33  LGKILTEAATEDKTVIITTLNKAWSEPNSTFDLFLHSFHVGKGTKPLLRHLVVACLDEEA 92

Query: 139 YSRCVSLHPH-CYELETQGTNFSSEAYFMTSDYLKMMWRRIEFLISVLEMGYSFVFTDSD 198
           YSRC  +HPH CY ++T G +F+ +  FMT DYLKMMWRRIEFL ++L++ Y+F+FT   
Sbjct: 93  YSRCSEVHPHRCYFMKTPGIDFAGDKMFMTPDYLKMMWRRIEFLGTLLKLRYNFIFT--- 152

Query: 199 IMWLQDPFNHFYPEADFQIACDLFLGNSEDLNNSPNGGFVYVKANPKTVQFYKFWYQSRT 258
                 PF     E DFQIACD + G+ +D++N+ NGGF +VKAN +T+ FY +WY SR 
Sbjct: 153 -----IPFPRLSKEVDFQIACDRYSGDDKDIHNAVNGGFTFVKANQRTIDFYNYWYMSRL 212

Query: 259 IYPGQHDQDVLNKIKHSPLIPKIGLKLRFLDTANFGGFCQMGRDMSKTATMHANCCVGLE 318
            YP +HDQDVL++IK      KIGLK+RFLDT  FGGFC+  RD+ K  TMHANCCVGLE
Sbjct: 213 RYPDRHDQDVLDQIKGGGYPAKIGLKMRFLDTKYFGGFCEPSRDLDKVCTMHANCCVGLE 272

Query: 319 NKVHDLRILLQDWNNFFNPTANNKASPTPSWTVPQDC 355
           NK+ DLR ++ DW N+ +  A        +W  P++C
Sbjct: 273 NKIKDLRQVIVDWENYVS-AAKTTDGQIMTWRDPENC 300

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038894961.16.8e-18686.74uncharacterized protein At4g15970-like [Benincasa hispida][more]
XP_008438689.16.4e-18485.45PREDICTED: uncharacterized protein At4g15970-like [Cucumis melo][more]
XP_004137392.11.8e-17883.47uncharacterized protein At4g15970 [Cucumis sativus] >KGN63967.1 hypothetical pro... [more]
KAG6607414.17.5e-17766.80hypothetical protein SDJN03_00756, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_023524447.14.2e-15975.56uncharacterized protein At4g15970-like [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
P0C0426.4e-8653.79Uncharacterized protein At4g15970 OS=Arabidopsis thaliana OX=3702 GN=At4g15970 P... [more]
Q3E6Y36.0e-5239.02Uncharacterized protein At1g28695 OS=Arabidopsis thaliana OX=3702 GN=At1g28695 P... [more]
Q9FXA78.5e-0622.78UDP-D-xylose:L-fucose alpha-1,3-D-xylosyltransferase 3 OS=Arabidopsis thaliana O... [more]
Q9M1461.2e-0422.99UDP-D-xylose:L-fucose alpha-1,3-D-xylosyltransferase MGP4 OS=Arabidopsis thalian... [more]
Q9ZSJ02.7e-0423.94UDP-D-xylose:L-fucose alpha-1,3-D-xylosyltransferase OS=Arabidopsis thaliana OX=... [more]
Match NameE-valueIdentityDescription
A0A1S3AWN43.1e-18485.45Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103483692 PE=3 SV=1[more]
A0A0A0LT788.7e-17983.47Glycosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_1G031880 PE=3 SV=1[more]
A0A6J1KAZ21.7e-15875.28Glycosyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111493282 PE=3 SV=1[more]
A0A6J1GCE53.8e-15874.72Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111452702 PE=3 SV=1[more]
A0A6J1C6T61.1e-15777.25uncharacterized protein At4g15970-like OS=Momordica charantia OX=3673 GN=LOC1110... [more]
Match NameE-valueIdentityDescription
AT1G14590.11.7e-10254.66Nucleotide-diphospho-sugar transferase family protein [more]
AT2G02061.16.5e-10259.57Nucleotide-diphospho-sugar transferase family protein [more]
AT4G19970.13.9e-9952.25CONTAINS InterPro DOMAIN/s: Nucleotide-diphospho-sugar transferase, predicted (I... [more]
AT5G44820.16.3e-9749.71Nucleotide-diphospho-sugar transferase family protein [more]
AT4G15970.14.5e-8753.79Nucleotide-diphospho-sugar transferase family protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL246-FR2) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005069Nucleotide-diphospho-sugar transferasePFAMPF03407Nucleotid_transcoord: 125..323
e-value: 6.0E-63
score: 212.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 423..472
NoneNo IPR availablePANTHERPTHR46038:SF13NUCLEOTIDE-DIPHOSPHO-SUGAR TRANSFERASE FAMILY PROTEINcoord: 22..365
IPR044821Putative nucleotide-diphospho-sugar transferase At1g28695/At4g15970-likePANTHERPTHR46038EXPRESSED PROTEIN-RELATEDcoord: 22..365
IPR029044Nucleotide-diphospho-sugar transferasesSUPERFAMILY53448Nucleotide-diphospho-sugar transferasescoord: 129..273

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CaUC04G078450.1CaUC04G078450.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0071555 cell wall organization
cellular_component GO:0000139 Golgi membrane
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0016757 glycosyltransferase activity