CmUC05G081580 (gene) Watermelon (USVL531) v1

Overview
NameCmUC05G081580
Typegene
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionArabinogalactan protein 20
LocationCmU531Chr05: 1143689 .. 1158638 (-)
RNA-Seq ExpressionCmUC05G081580
SyntenyCmUC05G081580
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGAGGTCTTAATTTCAGAGAGTGTTTTTTAACCTAAAAAAAAAAAATTGATTCTCCGTTTCTGCCACTGCCGCAATCTTCCATCGTTTTCCTTTCCGCCTCCTCGTTCTCCTTGCAGTAGTGGCCGCTCCGCGATCTGTGGTTTGCTTAACTTTATCTTGTATTTCTTGGATTCAATGGTGTAAATTGTTCTAATCCGATCAACATTTTGAATGAGCCTCTTTGATTTGTGTCTGTTCTTCTAAGTTTCTAGATTGGTGCCATTTCCTCCCTTTATCTTCAATTGGGTCATCATCAGCTATTATATCTGCGGAGAAGCGGAAATTCTACCTATGAGAATCAGAGGAACAGTGGTTTATGGCTAATATCTCATCTCTTATGATCTCCATTCGCCAAAACTCTCGATTTTTCAAGAACCTTAGGATCCACATCCGGAATCTCTCCGTGGAAACGAATGGAGGCAATAATGGATTTGAAGCGATTGAACCGAGTGAGAAACTATTGAGCCGTACTCACCGCCAAGACGTCAGTGAGATTGCGGAAAATGTTTGTAAGGTCATTAGGAGCAAACCCAGATGGGAGCAGACTCTGCTTTCTGATTACCCTTCTTTCAATTTCCATGACCCGTCTTTTTTTCGCGAGCTTTTGAAGCAGTTGAACAATGTTTTGCTTTCTTTGAGGTTTTTTCTTTGGTTGAGTTCGCAGCCTGAGTTCTTGCCCCATCCAGTCAGTTGCAACACGCTTTTTGATGCTCTTTTGGAGGCCAGGGCTTGTGTTCCCGCTAAATCTTTTCTTCATTCTTTCGGTTTTAGTCCCGAGCCTGCCTCTTTGGAGAATTACATTCGATGTGTTTGCGAGGGTGGTTTGGTTGAGGAAGCTGTTAATGTATTTGATGTGTTAAAAGAGACTGGATATCGTCCATCCATTGAGACGTGGAACTTTGCTTTACAGAGTTGTCTTAAGTTTGGGAGGACTGATCTTATTTGGAAACTGTATGAAGAGATGATTGAAGCTGGTGTGCAGAAGGATGTGGGGATAGAGACTGTGGGGTATCTTATCCAGGCATTTTGCAGCGATAACAAGGTTTCAAGAGCTTATGAACTTCTAAGACAGGCTTTAGAGGATGGATTGGCCCCTTGTAATGATGCTTTCAACAAATTGATTTCTGGGTTCTGCGAGGAGGAGGATTATGATAGAGTATCAGAACTTCTCCACACAATGATAGCTAGGAAACGTACTCCCGATATTTTTACCTACCAGAAAATCATTAACGGGCTCTGCAAGAAAGGGAAGCAGCTGGAGGCATTTGAGGTTTTCAATGTCCTTAAGGATCGGGGATATGCTCCTGATAAGGTCATGTATACAACAATGATTGTTGGCCTTTGTAAGATGAGGTGGCTTGGAGATGCTAAAAAGCTGTGGTTTGAGATGATTGATAAGGGATTTCTTCCAAATGAGTATACGTACAACACGTTGATTAATGGATTTTGTAAGATTGGAAAGCTGGATGAGGCCTCTAAGCTATGTAAGGAAATGCATGATAGAGGTTATAAAAAAACCACTCTCAGCTGCAACATAGTGTCTACACGGAAGGACAGATGAAGCATAGGACTTCTATTGAGAAATGCCTTCCAAGGATGTAAATTCGCGATGTTCTAACATTCAATACCCTGATTCATGGATTTTGGAGAAATAAGATATTGCAGAGGACAGACCTATTCAAAGAACTGCTAGAGCAGGGGTTGCAGCCTTCAACTGCGTTTTGTACTCATCACATTAAAAAGCTTTGCCAATTAGGTAGAGTGAAAGAATCAAAGAAAATGTGGAATGACTTGCATAATAGAGGTCTTCAGCCGATGGTCTGCACTCATGAGCACATAATTAATGGATTATGTAAACAAGGATGTGTGGTAGAGGAGATGGATGGGTTGATAATTATGTTGAAGAGCAATCTCAAGCTTCAAAAGCGGACTTTTGTTAAGGTGGTTCAGAGTTTCATTCAAATGGCTAAATTAGATGATTCTTTATCAGTCTTAGGCTCAATGCTTAGAGTACGTTATAAACTAGAAAAAGGCACTTCAGTTATCTCTTGGAAAAACTATGTGGAAACAAGTATCAGTTTGTTGAATCACAATTACAGGAAATCATAAATTCCAACCAGTAGGAGATTCTCTATATATTCTAGGGAGGCTAGTTTAAGATGGTTGTACTCCTGTCATCGATGTTCCTAAAGGCAGATCCGTTGAATGCTTCTTTTGTGGGGGGAGATGGAAGCATGTTAGAAGGGATCTTTTTAACCCAGAAAGCTGTTCTTGCATCGACCGAATAGCCTCTCTGATTCATCTCGTCAGCTGGGAAAAATGGTTTCAACTTGGAAGAAGATTCAATGTGGAAACAGTGTTTATAACCTAGTGTATGTTGTTGGTGGAGAATTCTGTAGCCGAGTTGCAGTGGTTCTTAGGTCCAGGAGTCAATATTTCATTTATGAGAAGTTGATTGTTTAGATTTTCTCCACGCACGCATTGGGTTATAGAATTTCCTTGTCATGTAACTTATCTCCTACCATACGGGCTGCGTTATTGCTGAATAGTTTTAAATTTCACACTAAGTTTAATTCTCATTCCAAATATTAAAATCAGTATCAAAGCAATACATCTCATATGAAGTCTGGTTTAATAAAATTTTATAAATTTCATACACTGTTTCCTTTTTCTGCATTCTTGACCCCTTCCCCTTCCATTTCCAAAACCGTTTCTTTGGGCGAAAACAGCTGCTTCATCATCTTCTATTGTGGTCTGCAGAACTCCCGCATTTGGCAGGGCAGATAGGGAATCTCTTCACAACTTACAGTGTTTGCTTTGCTTTCATTCAGGCATTCATTCGCTGCAGTAATTTTTAAGATAGACAGCAAAACTTTGAAAAAAGTATATACCATTTGGCAGTGACTAAATCCTCTAGCAATTCCCCCCACTTTTTCTGATTGGTCAACTGCTCTTTGCTAAAGACAGGATATCCCTAGTTACAAAATATGTTGGATCATTCTCATCTTCGAAACACAGCTCCATATTTCCTTTTCCACCCTCCCATCAAGGAAGAAAAAGAACTTGTTTGCGAAGCCCATAGTTCATACAGGCCATCATCAGGGAACATTTATGAGCAATGCCTTTAAATCACTGAGGTCGTCAGCTACATCAATACTTCTTCCTTGGTTTTTAGGGTAGCTCAGAATCTGAGGCATGGCCAGATGATCTTCTAACCTTTGGACTCATTGCCATAAGCTGGCTTTCAGAATTCATTCGCATACTTCGCCCACGAGGACTAATTTGTGGAGGCCTCGAAAATCTTCCTTTCGGTGTACCTGTGTAATGGAGTCAGGATGAAGTTAGATTTAGAACATAGGAGCATTTTAACCATCTGTCAGTACCAATTTCAGTTTATATATTCATAAAGACTAGAGTGATATTCTTCAAAAATTACGACCAGATTGAGTTAAAGGTACATGTTCCTTGCCAGCCTAAATATTTCATGCTGGTGAATCTAGAAGATCGATTGAAGATTATGAGGGTGTTACTTCCTCCATTCTTGCACAACTGTTCATTGTTTGGTCATTTTTATCTGTTAATTCTAAGTGCAGTGTTCAATTAGTTCTAGCTTTAGTGTTTATAATTTTTGAACCAGAAAGTACAACTGATATTTCAAGAAAAATGGTAAGCATGACCATTAACGAGAATAACCTGGTAGCTTGTTCAGTTTAATATCAGAATTGTTCTTAGTCAATGAAAGCCTCCTGGTCGTTCCTTCACGCGAAGCTGGTAGTCTTTTTTCAACGGGCGTCGGGAAAGGATCGTCTATTGATGATGTCGTTTCATCCCCACATGCTGAGGCATCACTTCCTGATTCTGAGGTATCTTTAATGCTGGCACCAGCTTTCTCTTTTTCAACGACTCTGTAAGCAAACATCTGAAATGGGATATCTGTAATTGGTGACGAGGCCTCACGTGTGGAAGATGGCAGCTTCCTCAATAAACCTGACTCAATGTCACAGTAATCTACTGTCGGCGTAGGAGGATCTGTCTGGCTATTGTAATCAGGATGCATCGCAACAAATCGCTTTGTATACCGAAGAATCCTACAAATAGACGAAAAGCTTGGCCTTTGATTCGGGTCGGTATGCCAACATCTCTTTGTCAAGTTTGTCACATATTTTGGCATACTATGAGGAAAAAGTGGCCTCTCTCCTGCTCGAATGTTTCGACTCATTTTGTCCCCTTGAAGATGGCTATCCTCAAAAGGGACTTTCCCAGTCAGAACTTCAAAACAAACCATGCCAAAGCTGTACACATCAGACTTCTCTGTATACTTAGAACTCTCAGCACTTCCTGATTGATCCTGTTCTTCTAAAACTTCTGGAGCATACCATATGAATGGAAGGGATTCATTCTGATTGGAAGAGTTCTTAAACTTAACAACAGGTAGGCCAAACCCTGAGACCTGAGCATGTATATAGCCGTCTGTAGAATAAGCTCTTGGCTTAACCAGAATGTTACAAGGATTTAAATCCCCATGGTAGATCTTCTTTGAGTGGAGATATTCCATTCCTCTTGCAATCTGAAGCATCAAATCTAGAGCTACAGGAAGGGGAAATGGAATTCTCTTGCGGGGGCCACAAATCTCCTTTACATAGCTCGATAAATCTCTACTCATAAGTTCCATAATCAAGAAACACTCTTTCTTTTCCTCATCAGTAAACCCACAAAGAAATCTTGTGATGTTTGGATGAGAAAGAGATAACAGCATCGATATCTCTGGAATTAAAGATTCAATTTCCCCAAAAAAGTGTCTCATTGCAAAACTTTCCCCTAACCAGAGTATCTCTTTGTATTGACTTCCACCCCCTAATCTTCTCCTCACCTGGTAATCCCTTGATCCTACTAGCATGGAACTTGGCAATAGCTTTCCATTCAAAGATTCTGACCCATCTAAGTTTTTCAGAAGATGATCCGTTAGTCTTTGTTCATACTTGGATGAAGCTGTGGATTTCTTCTCCCGGAACTTATTGAGTAGAACCCATCTATCTTCATCCCAAACCGCATCCATCCGATTGCAAAAATCTTGAGTAATAAGGTATTGTTTCCCAAATTTCCATTTGAAGATCTTCAAATCCTTGTGTTCTCTCCTGTATTTGATGGAGTTGATTAGTCTTTTCTTCAGCATCTCATCGTGATCACTACCAGAATTCTCCCCAGCTGTTTCAATGGCTTCAACAACAACCGTAATGCTGTACAATAGATTGTGAATGTGGAATTCCATGCAATCAGTGTTTTGATATAACATAATGGTTTTTGCCCACCAATCTTTTGTTTCCAAACACTGCCTAATGTACCATTCTGCTTCTTTGAATACCCTGTTTAGGTCTCTCAATGGCTGATCCAAAACCTTCCATCTAGTGTGCTTCTCTTCGAATCGAAGATTTTGTTTCATCTCCTCCGCCACTGAATCAAACGCAAAGCTCAAAACATCGAACAGCAAACAGCATTGCCTCTGATTTATATGAACGCTATCTTTGAATAGCATGAGGGCTTTTATACTTCCCAATGCCTCACCAAGCTGCCGAAATTGCTCCATTTCAGAACCCAAAATTTCATATACCGATTGTGATTTCTTCAGGGGAAATTGATCTTCTCGATCAATCTCAGATCAAGGGAGTGTGGGAATTGAACGATCAACATAACAGCAATGCAGTGGATTAAAAACCAGGATTCTTCTGTAGGGCCTCAGAATTGGAACGGAATGAAGCAGAAAAAACCACCGATCGAAGTTTTAGCGGCGACCCAACAATCCAGTGAAGGAGGAGAAAGGAATTTTGGAGCAGAATAGGAAAGAGGGGAAATTTTTTTGAAATGGAGTTGAGAGAGGAACAGAAATGGGTTATTGGAGGAATGGCCGAAAATGAAGAGAGAAGAGAGAAGAAATGGAGGAAAGTTGAAGATTGAGGTTGGAAAATGGGTTTAAGAAGTGGGGAAAGGTGAGAACCAAACACTTGTCCAGAAATGGTAAGGTTGAAGAATGGTGGAGGATGGTCTGCAACGAAGGGTTTTGCCGCGAATTTCGATGGCCATATAAGTTTGGAATTTGCAGCAGCTCTTATAATAGTGTATTTTAACTGTTTTCTTTCTTATTTAAGCTCCCTTTCCTTCTTCCTTTCTCTTTTACTCATCTTCATATCTGGAATTAAAATATACAATCATGAAAATCTATTTATTTACATTCAAATATATATATCCAAACACATTAAAAAGTCTAAAAAATGACAAATACTTGATGTGAAGTTGAAGGTTGTACAATTTTTTAAATTTAGAAACTTATAGATTGCAATCTTTAATGTTGTTTGATAATTATTTGGTTTTTTATTTTTTAAAATTAAATTTATAAACATTATTTTTATTTTATTTTTATTTTTAATTTTGTTATCTATGGTTTAGGTATGTTTTCAAATTTTAAGACTTGAAAACTAGAAAAAGAAGCTGTTTTGGAAACTTGTTTGTTTTTTTGAAATTTGAATAATAGTTAATATGTTTTTTTAGAAAATATAAAAACCATGGTAAAAAATGTGAGAAAATTAGTATAATTTTCAAAAAAAAAAAAAAAAATCCACTAGTTATCAAATTGACCTTAAGAATTAAATTTATGATTTAATATATTTAAAACAATAATTTAAATGTTAATATTTTTAAGCAAGATATGAATTGTGAATATTTGATGTTTTGGACTAGCTAGGTAATACACTAAATTAAATCATTTCAAATGTCGATAACTCGATCTTTTACAATGTAATGGTAGAGGTTGAACTTAAAAAAAATATAAAAGATTAGAAAAAGGGTATAAATATGGTTATATTGCAAATTTGACCTTTATAGTTTGATAAAATTTTGGTAAATAATCTCTATAATAATTTATTTTTATAAAAGTTCTATCCAAACATAAAAAGTTATAAATATAAATCATGGGCTAAATTCTAAATTTTTCCATAATTTAACCGGTAAATACTGGCCATTTTTCAGGCTTCTTTTGGAGGCGGCCCAATACAGCGCAGGCTTTAATTGAGGCCCAATTCACAGTTCAAGTGGAATTGAAAGCCCAATTACTTATACTAAGCCCAAGTTATCTATGGCGTAAGTTAGCATCGGTTGCTCTCAGCCGGTTAGAACAGAAGAACTTCAGCTTTAGCTTCCTTCCATTCCACTCTCATTTTTACTCCAAATAGAAAACCATTCACTTTCCGAGTCCTTTCCTTTCCCTCTCTTTCTTCTTCCCATTCAAAAAGGGTTCCGCGCCATACCCGGATTCTTGATTACAGTATCCTCTCTTCCGCCTACCTTCATTTCTCTCCTTTCCATCCTTCTGCTTAGCCCTAATTTCCCAAGGAATATGCCGAGAGAGATCATCACTCTGCAAGTTGGACAATGTGGGAACCAGATCGGAATGGAGTTCTGGAAGCAGCTCTGCCTCGAGCATGGAATCAGCAAAGAGGGCATTCTTGAAGATTTTGCTACTCAGGTCCTTGCTTTGCTTTTCTTCTCTTCTTCATGCACTGTTGTTTTTTCTTTCAATTTGTTTATACATGTAATATATTATGCACGCACGAGCGTCAGTTCGTTTGGTTGTTTGGAAAATTGGGGGAGAGATGAGGAAATGGACTGTTTATGCTGTATGATAATATATAAACACATGACCGTGCTTTTCCAATGTGTTCGTCCCGTTGTAAGATTTTGGAAGATAAGTGGGAAATTGCCTAATAAGGGTTTTAACTAGAAAGAATTGTAGGGCCATTGGGGAATTCGCTTGTAGTTTCACCCGGTTTGGAGTTTGCTGGCTTCTTTTGAGTGATCTTGTGGAAATGAGCTTACGCGAAGGGGTTATTGACAGTACTTGGTCACTGGGGCGTTGATGGGAGAGATGTGCTTAGTTTCTGGCGCAATCCACTACGTAACTTTATCATAACTTTGGATCAGACTCTCTGGTCTTCGTTTTAGCTCCCCCTTTCAACATGTACGATATTGCTCCACTTTTATCTGCACCTTAGCTTCTTCTGAGCAATGCTTTACTATGCTCTCAATTTAAACAGTGTCCAGCTCTTATCATGTGGCAGACGTTTTATTAAGAGCTGAGAGATTATGGATTATGTATAGTAGTAGTAACTGGGCGACAATGAACCGGGCCACTCCTTTGATCTTATTCATATATAATATATTGCATACATAATCTGCAAGTACATTTCAGCGGAAGGATGTAAACATATTGTATAATATGCAGGGAGGAGACCGGAAAGATGTATTCTTCTATCAAGCTGATGATCAGCACTACATACCGAGAGCTTTGCTTATTGACCTGGAGCCCAGGGTCATCAATGGTATCCAGAACAGTGAATATCGAAATCTCTACAACCATGAGAACATCTTTGTTTCTGATCATGGAGGTGGTGCTGGAAATAACTGGGCCAGTGGATATCATCAGGTTTTCCTTTTAACAATATTTTTCCTTTTTAGTTTTGATATTCTCATTAGAGAAACATGCCGATATTCTTTCTATGACCTAAAATTTGTACACAACAAAGTTATGTGCCATGCTTCTTAGAATTCTATGCTATCCATGTGGTATCTTTCTGACATTGATTTGAAGACACGATCAGGGAAAAGGTGTTGAAGAGGATATCATGGATATGATTGACAGAGAAGCAGATGGAAGTGACAGCCTTGAGGGTTTTGTTCTATGCCACTCAATTGCCGGAGGGACAGGATCAGGTATCTTCTGCACTTCCTTCTAACCTTTCCTGATCTCTTTATTATTTGTCTCATTTCTATACTGGGGAAATGATAATGGGAAGGAATTGCAGTAAGGTAAGATTCAGACAGATTTTCAATCTTCTAAAACAAGTTTTCCAGACAGATTTTTGTACTATGTGTGCACATGTATTTTTAAGTAAAAGAAACTGTCCAAATTGATTCAGCATTTGAAGCATTGAAGAGTCCTAATTTTAGTTTATGTTATTGATTCTTCAGAACTTGGATGGTTGATAGTGTTTTGTTCACAGATAAATTTGAAGAAAAAAAAAGAAAAAAGATGAAAGAGGAGGTCTAATTCTCAATATTAGAGAATATATTAGAATAAGTAATTGAATTTTATCACTTGATATATTGCAGGCATGGGTTCATATCTCCTGGAGACTCTGAATGATCGCTACAGCAAAAAACTGGTTCAGACGTACAGTGTTTTTCCTAATCAGATGGAAACAAGTGATGTTGTAGTCCAACCTTACAACTCACTTTTGACTTTAAAGCGACTAACACTCAATGCTGATTGTGTTGTAGTTCTTGATAATACTGCCCTAAACAGAATAGCTGTAGAACGCCTTCATTTATCAAATCCAACCTTTGCACAAACAAACTCCTTAGTGTCTACAGTAATGTCAGCTAGCACAACCACTTTGAGATACCCAGGATATATGAACAATGACTTGGTTGGACTCCTGGCCTCTCTAATTCCAACACCAAGATGCCATTTTCTAATGACAGGATACACACCACTCACGGTTGAGCGCCAGGTATTTTTTTTGTGCTTGTAACTTATTTGTATTATACAGTTCCATTATCTCTGAACTAAAATTCTTTGAACATCATCAGTGATTTTGTTGATTGTACAAATCCACAACTTTCAGTTGAATGCTTTGACCTAGTTAAAGTTGGGTCTTGTTTGCACGATTTGTGTTTATCATTTGGCATGATTTGCTTGTAAGGAAGTTTCTAGCAATGGATAATCAAAGTTGTGCATGTTCTTTGCAGGCCAATGTGATAAGGAAAACCACTGTCCTTGACGTCATGAGAAGACTTCTGCAGGTAAGAGTTTTTGGTAGATTTTATTTTTAAGGGAAAAAAAAAAAACCCAATTTGTGCCTTTCATTTGACATTATGACATTATGCTCATTTACAGACAAAAAATATTATGGTCTCCTCATATGCTCGAACAAAAGAAGCTAGTCAAGCGAAATACATATCAATATTGAATATCATACAGGGAGAAGTGGACCCTACACAGGTACCGATGATTAAAACACCTGAGAAAAGAAAAAAGAAAATAATTGGTTTACTGCACGTTTGGTATGGATGTATTTTGACGTTTCCATTATGCTACAGGTTCACGAAAGCTTGCAGAGGATACGTGAAAGAAAGCTTGTGAATTTTATTGAGTGGGGCCCTGCCAGCATTCAGGTTTCTTTGAACAGTAAAATCAACATGTGTTGATATTCCAATTTGCCGTAGTCACTTTTCTGTGTAGAATCATATGAAATATATGCTGGCTTGTTTCAGGTTGCACTCTCACGGAAATCACCATATGTTCAAACTGCCCATAGGGTAAGACACTTCAAATCCCTTTTTAACTTCCTCTCTGCTATTAATTTCATACCTTGGCTTTTCTTTATTGCTAAATTTCAAACTGATGTCATCCGTTTTATTTCTAAATCCTCACTAGAAGAAATAATTTTCAGGTCAGTGGTCTTATGTTAGCTAGCCATACCAGCATCCGGCACCTTTTCAGCAAGTGTTTGAGCCAGTATGAGAAGTTGAGAAAGAAGCAAGCCTTTCTTGACAACTATCGGAAGTTCCCAATGTTTGCTGTAAGATTTCTGTTTCCATATTTTCTGCCCCAATTGACGATCAATTGTTTATTCATTCTAGTTGAACATCTTGTGTCAGGAGAAAATATTGTAACGAATATTGGTTAATCATGTATAAGTTAAGACAATCCATGTTGTTCCTTCCTCATGAATTAATCAATTGGTCAAAGTCTAAAATTGAAAGGAACAATTTCCATTGGACAGGACAATGACCTCTCTGAGTTTGATGAATCAAGGGATATAATCGAAAGTTTGGTTGATGAATATAAGGCTTGTGAGTCGCCAGATTATATCAAATGGGGAATGGAGGTCAGTGTGCCTGTATATCTTTGAACTGCTTCATTTCCTACACAGAATAAGAATCACTTATATGTGCTTTTACGATGCAGGATCCCGACCAGATTCTTACAGGTGATGGAAATGCATCAGGAACAGCAGACCCAAGCTTGGCCGTCTGACAAGGCAAATAAGGTGAAATGGATTTTATTTTTCTGTTGCAACGGGGATGATTTATCTTCTAATTTTGTTCCATGCATTTTTAACCCTAGTAGTCTCCTTAGAACTCAAGGATACAGTTGTACAACTTGTAGAATAGATCTTGTTCAACCAAGGTCATTTTTCACCCTAGTAGTCGGATATGATTATTGGCATCCTCCAACCTTAGCATGGCCTTGACCACAAAATATCTTTTAAAATACGTGCAAATTTATAAGGTGACAAATTGCCAGCTGATCTTATATCATTTCAACTTCAATTTCTATTTTTTGTGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAATATGATCAAGCTGGAAAAGCATTATTTAATTCTTGGAAGGAATATGAGTATATGGCACTAATATAAATATAAATTTCATATGAAAATTCGCCCCCTTCCAACACGGCAATACCCACCCCCAAATAAGCAATTGCACATATATGTACACACACACACACACAAGAAAAAAAAGAAAAGACAATTAATTGTAGATTTTGTATTACGCTGTCACTTAAATAATGCTCCTACCATATTTGCCTTTAGTCAGTGACATCATGACAAGCCCATTCATTTGTGAGAGACCTTTTCTTTATACTGCGCTAGATAATTGACTGCTTGACTTTCCTGATTGCAGGTCATGTTAATGGAGGTTGTACCCGTGTGATTTTTTGTCTTCTATTTTTATTGAGTAAATATCCAATCTGAAAATACAGATACTCTCTGTATTCGTTCGTTAACCTAAGGGTCAGACTCCCACTATTCTAATCCATACGCAATCCATTTAGGTGTCTTCCATTTGCGTCAATGAGGTGTCTTTTGTTTTTATGTTTTGGGACCGTTTGTGTGAGGAAGAGTGAGGCATGTGGCATTGGCATCATAGTTGAGAACAGCATTGTTCATCCTACCATCCACATGTGGTAGGCCTTTCTTACTTTAACATTCCATGTGGTGGGATTTTGTGCCATATTCTACAGACCCATGCTTGATTGAGAATCTTACTTCAACTTTTCCTTCAAGTTGTTAACAGTGTTTGAGGATTATTCCATTGTTGCAGATTACTGGCTTTCTGTTCCAACCTAAAACATTCAGTTATAGAGTGATTGCCAGGTATTACCAATTCCCAATCCTCCATTGATTGATAATATTTCTTCAAACGGCACGTCGGTCTGGTTGTCAATATATTAGTTTGAATTTTGAATTTCGAATTGTGTTATTTTATAGAATCCATAAATGATAATACTGACAGAGGCTGCCTGGCTGTGGGGTTAAAGAGATTGTTTGGCTTAGATTGTCTTGTTGTTTTCTTCAGTGTAATAGAACATTTCTTCATGTAAAATCTATTCAAAATGCAGGGTTGGTTGGAGGAAGACCAGTCTACGTCCATGATTGAAACACATACTGATTTTGATGGACCATTATTTTCAAATTGCATATTGAATGAATAGGATGGTGGGAGTATCTCTAAGTGCTTTTTGAGATTTATGATTGAGTGGGTTTGTAAAGAGTGTTTGAAACTCATCAACTACTGATTAGTAATGGAAGAATGATTTTAGTTATTTTCAAAATTTTCATCTATTAAAGTAAATTAAAGAAAAGAATTCTAAATGTAAATTTTAACTTGTATGATAAATTAGAATAGTTTCCATTCATATAAAGTGAATTTGCTACTTAAGATTACTTTTTGATCCTATTCAAGTGGATGGGATTCTAATCTATAGTACAATAACCAAGAAGATGAGATTGTTTGAAATCAGATCTCAACAATATTTCTCAATTAAAAAAAAAGTGAAATTTAATGATAACATTAAGAAGAAAGATCTTTATCAATATAAGAAGATTATTATTTATCTTTCATATGAAGTAAAAGAAAGAGCCTTTTTACATTCATTATTTTAATCTCTATACTTTCTATACTTTCAATTGGTATTTAAATGTTATTTTTTATTCATGTATTTTAAAATTTGACCGTTTTAGGGTGCAAAAATGGACGTTTTATAATTATATGAATAAACCGAAATTGAAAGTATTTAAATCTTTGAAAATACATGTATAATGCAACAAGAAAGCTTTATATAGCCGTTAATATTAGTTGGAAACTTGAAATTGTAAACGTTAAGGTTACAATGGGACCCACCATTGTACAGTTTTGATCAACTAATGAATGGGTGCCAACTAAATCGACAGACTGTCAGTTTGGGCCCATCACATGGCCTTCTAATCAAATATCTTCAATCAAAATAATTCCGAAAATCAAATACAATTTTATGTGGTTTTAATAACAAAATAAATTAGAAGCCGAACCCCCAAGGAAAAAAAAAAAAAAAAATTAGAAGCCAAAGTGCAAAAATGAGAATAGCTCACATAATTTAATTGGTAGAAGTAAAATAAAATTGAATACTCTCGATGAGATTAGTAATTTAGGTAGAATTTTGAATTCTTTAGATTGAACCACTGAACTTTTAAAATTATTATTTTACTATCTATTATTGGTTTCTTTCAGGACAAATATGGTGAGAGAGATTTGAATGTTGAATTTCAAAAGAAGTAAATATTTAAAAGCATTTTTTTATAAAAAAAAATCAATAACATGTAAGATAATTGTTAGTGATATGTTATTAAATTTGCCTTAATCCATCAATTTAAGTTTTTGAGTCAAATTAAAGCAAGTTGGTTCGAAAGGTCATGTGTTTAAGCACCTATATTGTTGTTTATTTCCTAATTAATATTGATAGCTTAATAAAGTAGGAGTGTTAATAATATGATATTAAATTTACCTTCATTGGATTAATTAAACAATTAGGAAACTTGATCTTTCATCTCTAAAATATGCATGGAAGTTATGGAAAAAAAAAAAAAAACTATAGTTTGTAAGAATAAGTAATTAATTTGCTATATAACTTAGGAAATAAAGAAATAAAGAATAGAAATTGTGGAGCCCACCCATATTGGGATTGAGGAATACGAAGGCAATTGGCAAAGTGGGTTTTGCCGGTTTCTTCTCTATTCTCTCTCTCTTTTATTTATAAAACGAAATCACCACATTATCCTCCGCCCCACTACTCTCCTTCTCTCTCCTCTCTCCTTCTTCAACCTTTGATTTTGATCGGAGCTTCCACTCAATTTGTTCCTTTGGAGAGGGAGACCACCATCAATGGCGTCCTCCAATTCTACTTCTAGGGTTTTCTCCGGCCTCTTCACCTTCTTCGCTCTCATCTTCTTCATTCTTTCGCCACTCGTTGACGCCGCCGCCCCCGCCCCCGCTCCCGCTCCCGCTCCCGCTAGCGACGGTATCCACATTTCCGATTTCTCTACTTCTCTCCCTTCGCATTCTCCTCTTCTTCTCTATCCTTCGTTGATTGAAACCGCATAATCTAGGTCTTTCTTTCAAATTAGGGCACGGATTAATAGGATTCTTAATTTTGTAATTTTGTAATTTTGTGTGAAAACAGGCACCTCCATAGACCAGGGGATTGCGTACGTCCTGATGCTGGTGGCGTTGGTTCTCACGTACCTTATTCACCCTCTCGATGCATCTTCCTACAACTTTTTTCTGAATTGAGTTCCTAGGACTGTAGCAATGTAGGCGCTGTTTGTGGGCGTTTGAGAAGGAACGAGGCCCAACTTTCTTTCTCTGTTTTGTGATAAATGTGTATATTGGGTTTCGCAGATTTTGATCAATTGCTCCATTTATTCATTGTTTATTCGGTTGCTTTATTTTTGTCATAATTTATTCCATGGATGGATCCATTTTGCAAAGCTTTTCAAGTCTG

mRNA sequence

GGAGGTCTTAATTTCAGAGAGTGTTTTTTAACCTAAAAAAAAAAAATTGATTCTCCGTTTCTGCCACTGCCGCAATCTTCCATCGTTTTCCTTTCCGCCTCCTCGTTCTCCTTGCAGTAGTGGCCGCTCCGCGATCTGTGGCTTCTTTTGGAGGCGGCCCAATACAGCGCAGGCTTTAATTGAGGCCCAATTCACAGTTCAAGTGGAATTGAAAGCCCAATTACTTATACTAAGCCCAAGTTATCTATGGCGTAAGTTAGCATCGGTTGCTCTCAGCCGGTTAGAACAGAAGAACTTCAGCTTTAGCTTCCTTCCATTCCACTCTCATTTTTACTCCAAATAGAAAACCATTCACTTTCCGAGTCCTTTCCTTTCCCTCTCTTTCTTCTTCCCATTCAAAAAGGGTTCCGCGCCATACCCGGATTCTTGATTACAGTATCCTCTCTTCCGCCTACCTTCATTTCTCTCCTTTCCATCCTTCTGCTTAGCCCTAATTTCCCAAGGAATATGCCGAGAGAGATCATCACTCTGCAAGTTGGACAATGTGGGAACCAGATCGGAATGGAGTTCTGGAAGCAGCTCTGCCTCGAGCATGGAATCAGCAAAGAGGGCATTCTTGAAGATTTTGCTACTCAGGGAGGAGACCGGAAAGATGTATTCTTCTATCAAGCTGATGATCAGCACTACATACCGAGAGCTTTGCTTATTGACCTGGAGCCCAGGGTCATCAATGGTATCCAGAACAGTGAATATCGAAATCTCTACAACCATGAGAACATCTTTGTTTCTGATCATGGAGGTGGTGCTGGAAATAACTGGGCCAGTGGATATCATCAGGTTTTCCTTTTAACAATATTTTTCCTTTTTAGTTTTGATATTCTCATTAGAGAAACATGCCGATATTCTTTCTATGACCTAAAATTTGTACACAACAAAGTTATGTGCCATGCTTCTTAGAATTCTATGCTATCCATGTGGTATCTTTCTGACATTGATTTGAAGACACGATCAGGGAAAAGGTGTTGAAGAGGATATCATGGATATGATTGACAGAGAAGCAGATGGAAGTGACAGCCTTGAGGGTTTTGTTCTATGCCACTCAATTGCCGGAGGGACAGGATCAGGCATGGGTTCATATCTCCTGGAGACTCTGAATGATCGCTACAGCAAAAAACTGGTTCAGACGTACAGTGTTTTTCCTAATCAGATGGAAACAAGTGATGTTGTAGTCCAACCTTACAACTCACTTTTGACTTTAAAGCGACTAACACTCAATGCTGATTGTGTTGTAGTTCTTGATAATACTGCCCTAAACAGAATAGCTGTAGAACGCCTTCATTTATCAAATCCAACCTTTGCACAAACAAACTCCTTAGTGTCTACAGTAATGTCAGCTAGCACAACCACTTTGAGATACCCAGGATATATGAACAATGACTTGGTTGGACTCCTGGCCTCTCTAATTCCAACACCAAGATGCCATTTTCTAATGACAGGATACACACCACTCACGGTTGAGCGCCAGGCCAATGTGATAAGGAAAACCACTGTCCTTGACGTCATGAGAAGACTTCTGCAGACAAAAAATATTATGGTCTCCTCATATGCTCGAACAAAAGAAGCTAGTCAAGCGAAATACATATCAATATTGAATATCATACAGGGAGAAGTGGACCCTACACAGGTTCACGAAAGCTTGCAGAGGATACGTGAAAGAAAGCTTGTGAATTTTATTGAGTGGGGCCCTGCCAGCATTCAGGTTGCACTCTCACGGAAATCACCATATGTTCAAACTGCCCATAGGGTCAGTGGTCTTATGTTAGCTAGCCATACCAGCATCCGGCACCTTTTCAGCAAGTGTTTGAGCCAGTATGAGAAGTTGAGAAAGAAGCAAGCCTTTCTTGACAACTATCGGAAGTTCCCAATGTTTGCTGACAATGACCTCTCTGAGTTTGATGAATCAAGGGATATAATCGAAAGTTTGGTTGATGAATATAAGGCTTGTGAGTCGCCAGATTATATCAAATGGGGAATGGAGGATCCCGACCAGATTCTTACAGGTGATGGAAATGCATCAGGAACAGCAGACCCAAGCTTGGCCGTCTGACAAGGCAAATAAGGTCATGTTAATGGAGGTTGTACCCGTGTGATTTTTTGTCTTCTATTTTTATTGAGTAAATATCCAATCTGAAAATACAGATACTCTCTGTATTCGTTCGTTAACCTAAGGGTCAGACTCCCACTATTCTAATCCATACGCAATCCATTTAGGTGTCTTCCATTTGCGTCAATGAGGTGTCTTTTGTTTTTATGTTTTGGGACCGTTTGTGTGAGGAAGAGTGAGGCATGTGGCATTGGCATCATAGTTGAGAACAGCATTGTTCATCCTACCATCCACATGTGTTTGTTCCTTTGGAGAGGGAGACCACCATCAATGGCGTCCTCCAATTCTACTTCTAGGGTTTTCTCCGGCCTCTTCACCTTCTTCGCTCTCATCTTCTTCATTCTTTCGCCACTCGTTGACGCCGCCGCCCCCGCCCCCGCTCCCGCTCCCGCTCCCGCTAGCGACGGCACCTCCATAGACCAGGGGATTGCGTACGTCCTGATGCTGGTGGCGTTGGTTCTCACGTACCTTATTCACCCTCTCGATGCATCTTCCTACAACTTTTTTCTGAATTGAGTTCCTAGGACTGTAGCAATGTAGGCGCTGTTTGTGGGCGTTTGAGAAGGAACGAGGCCCAACTTTCTTTCTCTGTTTTGTGATAAATGTGTATATTGGGTTTCGCAGATTTTGATCAATTGCTCCATTTATTCATTGTTTATTCGGTTGCTTTATTTTTGTCATAATTTATTCCATGGATGGATCCATTTTGCAAAGCTTTTCAAGTCTG

Coding sequence (CDS)

ATGAGGTGTCTTTTGTTTTTATGTTTTGGGACCGTTTGTGTGAGGAAGAGTGAGGCATGTGGCATTGGCATCATAGTTGAGAACAGCATTGTTCATCCTACCATCCACATGTGTTTGTTCCTTTGGAGAGGGAGACCACCATCAATGGCGTCCTCCAATTCTACTTCTAGGGTTTTCTCCGGCCTCTTCACCTTCTTCGCTCTCATCTTCTTCATTCTTTCGCCACTCGTTGACGCCGCCGCCCCCGCCCCCGCTCCCGCTCCCGCTCCCGCTAGCGACGGCACCTCCATAGACCAGGGGATTGCGTACGTCCTGATGCTGGTGGCGTTGGTTCTCACGTACCTTATTCACCCTCTCGATGCATCTTCCTACAACTTTTTTCTGAATTGA

Protein sequence

MRCLLFLCFGTVCVRKSEACGIGIIVENSIVHPTIHMCLFLWRGRPPSMASSNSTSRVFSGLFTFFALIFFILSPLVDAAAPAPAPAPAPASDGTSIDQGIAYVLMLVALVLTYLIHPLDASSYNFFLN
Homology
BLAST of CmUC05G081580 vs. NCBI nr
Match: XP_022924337.1 (arabinogalactan peptide 16-like [Cucurbita moschata] >KAG6582645.1 Arabinogalactan protein 16, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 140.2 bits (352), Expect = 1.3e-29
Identity = 75/83 (90.36%), Postives = 77/83 (92.77%), Query Frame = 0

Query: 49  MASSNSTSRVFSGLFTFFALIFFILSPLVDA--AAPAPAPAPAPASDGTSIDQGIAYVLM 108
           MASSNSTSR FSGLFTFF LI FILSPLVDA  + PAPAPAPAPASDGTSIDQG+AYVLM
Sbjct: 1   MASSNSTSRAFSGLFTFFTLILFILSPLVDAHSSVPAPAPAPAPASDGTSIDQGVAYVLM 60

Query: 109 LVALVLTYLIHPLDASSYNFFLN 130
           LVALVLTYLIHPLDASSYNFFLN
Sbjct: 61  LVALVLTYLIHPLDASSYNFFLN 83

BLAST of CmUC05G081580 vs. NCBI nr
Match: XP_023528757.1 (arabinogalactan peptide 16-like [Cucurbita pepo subsp. pepo] >KAG7019041.1 Arabinogalactan peptide 16 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 139.8 bits (351), Expect = 1.6e-29
Identity = 73/81 (90.12%), Postives = 75/81 (92.59%), Query Frame = 0

Query: 49  MASSNSTSRVFSGLFTFFALIFFILSPLVDAAAPAPAPAPAPASDGTSIDQGIAYVLMLV 108
           MASSNSTSR FSGLFTFF LI FILSPLVDA +  PAPAPAPASDGTSIDQG+AYVLMLV
Sbjct: 1   MASSNSTSRAFSGLFTFFTLILFILSPLVDAHSSVPAPAPAPASDGTSIDQGVAYVLMLV 60

Query: 109 ALVLTYLIHPLDASSYNFFLN 130
           ALVLTYLIHPLDASSYNFFLN
Sbjct: 61  ALVLTYLIHPLDASSYNFFLN 81

BLAST of CmUC05G081580 vs. NCBI nr
Match: XP_038905636.1 (arabinogalactan protein 41-like [Benincasa hispida])

HSP 1 Score: 137.9 bits (346), Expect = 6.2e-29
Identity = 75/81 (92.59%), Postives = 76/81 (93.83%), Query Frame = 0

Query: 49  MASSNSTSRVFSGLFTFFALIFFILSPLVDAAAPAPAPAPAPASDGTSIDQGIAYVLMLV 108
           M SSNSTSR FSGLFTFFALIFFILSPLVDA   APAPAPAPASDGTSIDQGIAYVLMLV
Sbjct: 1   MESSNSTSRAFSGLFTFFALIFFILSPLVDA---APAPAPAPASDGTSIDQGIAYVLMLV 60

Query: 109 ALVLTYLIHPLDASSYNFFLN 130
           ALVLTYLIHPLDASSY+FFLN
Sbjct: 61  ALVLTYLIHPLDASSYSFFLN 78

BLAST of CmUC05G081580 vs. NCBI nr
Match: XP_022147463.1 (arabinogalactan peptide 20-like [Momordica charantia])

HSP 1 Score: 134.4 bits (337), Expect = 6.9e-28
Identity = 72/81 (88.89%), Postives = 75/81 (92.59%), Query Frame = 0

Query: 49  MASSNSTSRVFSGLFTFFALIFFILSPLVDAAAPAPAPAPAPASDGTSIDQGIAYVLMLV 108
           MASS STSR FS LFT FALIFFI+SPLVDA + APAPAPAPASDGTSIDQGIAYVLML+
Sbjct: 1   MASSISTSRAFSSLFTAFALIFFIVSPLVDAHSSAPAPAPAPASDGTSIDQGIAYVLMLL 60

Query: 109 ALVLTYLIHPLDASSYNFFLN 130
           ALVLTYLIHPLDASSYNFFLN
Sbjct: 61  ALVLTYLIHPLDASSYNFFLN 81

BLAST of CmUC05G081580 vs. NCBI nr
Match: XP_022979386.1 (arabinogalactan peptide 20-like [Cucurbita maxima])

HSP 1 Score: 132.9 bits (333), Expect = 2.0e-27
Identity = 71/83 (85.54%), Postives = 75/83 (90.36%), Query Frame = 0

Query: 49  MASSNSTSRVFSGLFTFFALIFFILSPLVDA--AAPAPAPAPAPASDGTSIDQGIAYVLM 108
           MASSNST R FSGLFTFF LI FILSPL+DA  + P+PAPAPAPASDGTSIDQG+AYVLM
Sbjct: 1   MASSNSTFRAFSGLFTFFTLILFILSPLIDAHSSVPSPAPAPAPASDGTSIDQGVAYVLM 60

Query: 109 LVALVLTYLIHPLDASSYNFFLN 130
           LVALVLTYLIHPLDASS NFFLN
Sbjct: 61  LVALVLTYLIHPLDASSNNFFLN 83

BLAST of CmUC05G081580 vs. ExPASy Swiss-Prot
Match: Q9M373 (Arabinogalactan protein 20 OS=Arabidopsis thaliana OX=3702 GN=AGP20 PE=1 SV=1)

HSP 1 Score: 82.8 bits (203), Expect = 3.1e-15
Identity = 50/81 (61.73%), Postives = 59/81 (72.84%), Query Frame = 0

Query: 49  MASSNSTSRVFSGLFTFFALIFFILSPLVDAAAPAPAPAPAPASDGTSIDQGIAYVLMLV 108
           MAS NS +     +   FA +F ++SP   A A + APAP+P SDGTSIDQGIAY+LM+V
Sbjct: 1   MASRNSVA-----VIALFAFVFAVISPF--AGAQSLAPAPSPTSDGTSIDQGIAYLLMVV 60

Query: 109 ALVLTYLIHPLDA--SSYNFF 128
           ALVLTYLIHPLDA  SSY FF
Sbjct: 61  ALVLTYLIHPLDASSSSYTFF 74

BLAST of CmUC05G081580 vs. ExPASy Swiss-Prot
Match: O82337 (Arabinogalactan protein 16 OS=Arabidopsis thaliana OX=3702 GN=AGP16 PE=1 SV=1)

HSP 1 Score: 81.3 bits (199), Expect = 9.1e-15
Identity = 50/80 (62.50%), Postives = 59/80 (73.75%), Query Frame = 0

Query: 49  MASSNSTSRVFSGLFTFFALIFFILSPLVDAAAPAPAPAPAPASDGTSIDQGIAYVLMLV 108
           MAS NS +      F  F+ +F ++  L  A A + APAPAP SDGTSIDQGIAY+LM+V
Sbjct: 1   MASRNSVTG-----FALFSFVFAVILSL--AGAQSLAPAPAPTSDGTSIDQGIAYLLMVV 60

Query: 109 ALVLTYLIHPLDA-SSYNFF 128
           ALVLTYLIHPLDA SSY+FF
Sbjct: 61  ALVLTYLIHPLDASSSYSFF 73

BLAST of CmUC05G081580 vs. ExPASy Swiss-Prot
Match: Q8L9T8 (Arabinogalactan protein 41 OS=Arabidopsis thaliana OX=3702 GN=AGP41 PE=1 SV=1)

HSP 1 Score: 74.7 bits (182), Expect = 8.5e-13
Identity = 44/64 (68.75%), Postives = 51/64 (79.69%), Query Frame = 0

Query: 54  STSRVFSGLFTFFALIFFILSPLVDAAAPAPAPAPAPASDGTSIDQGIAYVLMLVALVLT 113
           S SR+F G+ T  ++IF IL P+  A A + APAPAP SDGT+IDQGIAYVLMLVALVLT
Sbjct: 2   SGSRLFFGVSTIVSIIFAILLPM--AHAQSAAPAPAPTSDGTTIDQGIAYVLMLVALVLT 61

Query: 114 YLIH 118
           YLIH
Sbjct: 62  YLIH 63

BLAST of CmUC05G081580 vs. ExPASy Swiss-Prot
Match: Q9FK16 (Arabinogalactan protein 22 OS=Arabidopsis thaliana OX=3702 GN=AGP22 PE=1 SV=1)

HSP 1 Score: 59.7 bits (143), Expect = 2.8e-08
Identity = 34/56 (60.71%), Postives = 41/56 (73.21%), Query Frame = 0

Query: 62  LFTFFALIFFILSPLVDAAAPAPAPAPAPASDGTSIDQGIAYVLMLVALVLTYLIH 118
           +   F +I  IL P+  A + + +PAPAP SDGTSIDQGIAYVLM+VAL LTY IH
Sbjct: 10  ILAVFVIISVILLPI--AQSHSSSPAPAPTSDGTSIDQGIAYVLMMVALALTYFIH 63

BLAST of CmUC05G081580 vs. ExPASy TrEMBL
Match: A0A6J1E8M6 (arabinogalactan peptide 16-like OS=Cucurbita moschata OX=3662 GN=LOC111431859 PE=4 SV=1)

HSP 1 Score: 140.2 bits (352), Expect = 6.1e-30
Identity = 75/83 (90.36%), Postives = 77/83 (92.77%), Query Frame = 0

Query: 49  MASSNSTSRVFSGLFTFFALIFFILSPLVDA--AAPAPAPAPAPASDGTSIDQGIAYVLM 108
           MASSNSTSR FSGLFTFF LI FILSPLVDA  + PAPAPAPAPASDGTSIDQG+AYVLM
Sbjct: 1   MASSNSTSRAFSGLFTFFTLILFILSPLVDAHSSVPAPAPAPAPASDGTSIDQGVAYVLM 60

Query: 109 LVALVLTYLIHPLDASSYNFFLN 130
           LVALVLTYLIHPLDASSYNFFLN
Sbjct: 61  LVALVLTYLIHPLDASSYNFFLN 83

BLAST of CmUC05G081580 vs. ExPASy TrEMBL
Match: A0A6J1D1D7 (arabinogalactan peptide 20-like OS=Momordica charantia OX=3673 GN=LOC111016381 PE=4 SV=1)

HSP 1 Score: 134.4 bits (337), Expect = 3.3e-28
Identity = 72/81 (88.89%), Postives = 75/81 (92.59%), Query Frame = 0

Query: 49  MASSNSTSRVFSGLFTFFALIFFILSPLVDAAAPAPAPAPAPASDGTSIDQGIAYVLMLV 108
           MASS STSR FS LFT FALIFFI+SPLVDA + APAPAPAPASDGTSIDQGIAYVLML+
Sbjct: 1   MASSISTSRAFSSLFTAFALIFFIVSPLVDAHSSAPAPAPAPASDGTSIDQGIAYVLMLL 60

Query: 109 ALVLTYLIHPLDASSYNFFLN 130
           ALVLTYLIHPLDASSYNFFLN
Sbjct: 61  ALVLTYLIHPLDASSYNFFLN 81

BLAST of CmUC05G081580 vs. ExPASy TrEMBL
Match: A0A6J1IQM6 (arabinogalactan peptide 20-like OS=Cucurbita maxima OX=3661 GN=LOC111479127 PE=4 SV=1)

HSP 1 Score: 132.9 bits (333), Expect = 9.7e-28
Identity = 71/83 (85.54%), Postives = 75/83 (90.36%), Query Frame = 0

Query: 49  MASSNSTSRVFSGLFTFFALIFFILSPLVDA--AAPAPAPAPAPASDGTSIDQGIAYVLM 108
           MASSNST R FSGLFTFF LI FILSPL+DA  + P+PAPAPAPASDGTSIDQG+AYVLM
Sbjct: 1   MASSNSTFRAFSGLFTFFTLILFILSPLIDAHSSVPSPAPAPAPASDGTSIDQGVAYVLM 60

Query: 109 LVALVLTYLIHPLDASSYNFFLN 130
           LVALVLTYLIHPLDASS NFFLN
Sbjct: 61  LVALVLTYLIHPLDASSNNFFLN 83

BLAST of CmUC05G081580 vs. ExPASy TrEMBL
Match: A0A0A0L7A0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G120410 PE=4 SV=1)

HSP 1 Score: 128.6 bits (322), Expect = 1.8e-26
Identity = 69/81 (85.19%), Postives = 72/81 (88.89%), Query Frame = 0

Query: 49  MASSNSTSRVFSGLFTFFALIFFILSPLVDAAAPAPAPAPAPASDGTSIDQGIAYVLMLV 108
           M S NSTSR F  LFTFF+LIFFILSPLVDA    PAPAPAP+SDGTSIDQGIAYVLML+
Sbjct: 1   MESFNSTSRAFKALFTFFSLIFFILSPLVDA---TPAPAPAPSSDGTSIDQGIAYVLMLL 60

Query: 109 ALVLTYLIHPLDASSYNFFLN 130
           ALVLTYLIHPLDASSYNFFLN
Sbjct: 61  ALVLTYLIHPLDASSYNFFLN 78

BLAST of CmUC05G081580 vs. ExPASy TrEMBL
Match: A0A6J1IAU1 (arabinogalactan peptide 16-like OS=Cucurbita maxima OX=3661 GN=LOC111473327 PE=4 SV=1)

HSP 1 Score: 120.2 bits (300), Expect = 6.5e-24
Identity = 65/80 (81.25%), Postives = 69/80 (86.25%), Query Frame = 0

Query: 50  ASSNSTSRVFSGLFTFFALIFFILSPLVDAAAPAPAPAPAPASDGTSIDQGIAYVLMLVA 109
           +SS+ST   FSG FT FALIFFIL PLV A + A APAPAPASDGTSIDQGIAYVLML+A
Sbjct: 3   SSSDSTPTAFSGFFTCFALIFFILLPLVHAHSSASAPAPAPASDGTSIDQGIAYVLMLMA 62

Query: 110 LVLTYLIHPLDASSYNFFLN 130
           LVLTYLIHPLDASSY FFLN
Sbjct: 63  LVLTYLIHPLDASSYQFFLN 82

BLAST of CmUC05G081580 vs. TAIR 10
Match: AT3G61640.1 (arabinogalactan protein 20 )

HSP 1 Score: 82.8 bits (203), Expect = 2.2e-16
Identity = 50/81 (61.73%), Postives = 59/81 (72.84%), Query Frame = 0

Query: 49  MASSNSTSRVFSGLFTFFALIFFILSPLVDAAAPAPAPAPAPASDGTSIDQGIAYVLMLV 108
           MAS NS +     +   FA +F ++SP   A A + APAP+P SDGTSIDQGIAY+LM+V
Sbjct: 1   MASRNSVA-----VIALFAFVFAVISPF--AGAQSLAPAPSPTSDGTSIDQGIAYLLMVV 60

Query: 109 ALVLTYLIHPLDA--SSYNFF 128
           ALVLTYLIHPLDA  SSY FF
Sbjct: 61  ALVLTYLIHPLDASSSSYTFF 74

BLAST of CmUC05G081580 vs. TAIR 10
Match: AT2G46330.1 (arabinogalactan protein 16 )

HSP 1 Score: 81.3 bits (199), Expect = 6.4e-16
Identity = 50/80 (62.50%), Postives = 59/80 (73.75%), Query Frame = 0

Query: 49  MASSNSTSRVFSGLFTFFALIFFILSPLVDAAAPAPAPAPAPASDGTSIDQGIAYVLMLV 108
           MAS NS +      F  F+ +F ++  L  A A + APAPAP SDGTSIDQGIAY+LM+V
Sbjct: 1   MASRNSVTG-----FALFSFVFAVILSL--AGAQSLAPAPAPTSDGTSIDQGIAYLLMVV 60

Query: 109 ALVLTYLIHPLDA-SSYNFF 128
           ALVLTYLIHPLDA SSY+FF
Sbjct: 61  ALVLTYLIHPLDASSSYSFF 73

BLAST of CmUC05G081580 vs. TAIR 10
Match: AT5G24105.1 (arabinogalactan protein 41 )

HSP 1 Score: 74.7 bits (182), Expect = 6.0e-14
Identity = 44/64 (68.75%), Postives = 51/64 (79.69%), Query Frame = 0

Query: 54  STSRVFSGLFTFFALIFFILSPLVDAAAPAPAPAPAPASDGTSIDQGIAYVLMLVALVLT 113
           S SR+F G+ T  ++IF IL P+  A A + APAPAP SDGT+IDQGIAYVLMLVALVLT
Sbjct: 2   SGSRLFFGVSTIVSIIFAILLPM--AHAQSAAPAPAPTSDGTTIDQGIAYVLMLVALVLT 61

Query: 114 YLIH 118
           YLIH
Sbjct: 62  YLIH 63

BLAST of CmUC05G081580 vs. TAIR 10
Match: AT5G53250.1 (arabinogalactan protein 22 )

HSP 1 Score: 59.7 bits (143), Expect = 2.0e-09
Identity = 34/56 (60.71%), Postives = 41/56 (73.21%), Query Frame = 0

Query: 62  LFTFFALIFFILSPLVDAAAPAPAPAPAPASDGTSIDQGIAYVLMLVALVLTYLIH 118
           +   F +I  IL P+  A + + +PAPAP SDGTSIDQGIAYVLM+VAL LTY IH
Sbjct: 10  ILAVFVIISVILLPI--AQSHSSSPAPAPTSDGTSIDQGIAYVLMMVALALTYFIH 63

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022924337.11.3e-2990.36arabinogalactan peptide 16-like [Cucurbita moschata] >KAG6582645.1 Arabinogalact... [more]
XP_023528757.11.6e-2990.12arabinogalactan peptide 16-like [Cucurbita pepo subsp. pepo] >KAG7019041.1 Arabi... [more]
XP_038905636.16.2e-2992.59arabinogalactan protein 41-like [Benincasa hispida][more]
XP_022147463.16.9e-2888.89arabinogalactan peptide 20-like [Momordica charantia][more]
XP_022979386.12.0e-2785.54arabinogalactan peptide 20-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Q9M3733.1e-1561.73Arabinogalactan protein 20 OS=Arabidopsis thaliana OX=3702 GN=AGP20 PE=1 SV=1[more]
O823379.1e-1562.50Arabinogalactan protein 16 OS=Arabidopsis thaliana OX=3702 GN=AGP16 PE=1 SV=1[more]
Q8L9T88.5e-1368.75Arabinogalactan protein 41 OS=Arabidopsis thaliana OX=3702 GN=AGP41 PE=1 SV=1[more]
Q9FK162.8e-0860.71Arabinogalactan protein 22 OS=Arabidopsis thaliana OX=3702 GN=AGP22 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1E8M66.1e-3090.36arabinogalactan peptide 16-like OS=Cucurbita moschata OX=3662 GN=LOC111431859 PE... [more]
A0A6J1D1D73.3e-2888.89arabinogalactan peptide 20-like OS=Momordica charantia OX=3673 GN=LOC111016381 P... [more]
A0A6J1IQM69.7e-2885.54arabinogalactan peptide 20-like OS=Cucurbita maxima OX=3661 GN=LOC111479127 PE=4... [more]
A0A0A0L7A01.8e-2685.19Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G120410 PE=4 SV=1[more]
A0A6J1IAU16.5e-2481.25arabinogalactan peptide 16-like OS=Cucurbita maxima OX=3661 GN=LOC111473327 PE=4... [more]
Match NameE-valueIdentityDescription
AT3G61640.12.2e-1661.73arabinogalactan protein 20 [more]
AT2G46330.16.4e-1662.50arabinogalactan protein 16 [more]
AT5G24105.16.0e-1468.75arabinogalactan protein 41 [more]
AT5G53250.12.0e-0960.71arabinogalactan protein 22 [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL531) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR009424Arabinogalactan protein 16/20/22/41PFAMPF06376AGPcoord: 83..117
e-value: 1.1E-18
score: 66.8
IPR009424Arabinogalactan protein 16/20/22/41PANTHERPTHR33374ARABINOGALACTAN PROTEIN 20coord: 55..124
NoneNo IPR availablePANTHERPTHR33374:SF38ARABINOGALACTAN PROTEIN 41coord: 55..124

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmUC05G081580.1CmUC05G081580.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane