CaUC10G187120 (gene) Watermelon (USVL246-FR2) v1

Overview
NameCaUC10G187120
Typegene
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionPWWP domain-containing protein
LocationCiama_Chr10: 17431011 .. 17450574 (-)
RNA-Seq ExpressionCaUC10G187120
SyntenyCaUC10G187120
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAACCCTAACCATATCTCGCATCGATCTTTGCGGATCCTCCATCTGTGCGCACTGTCAGATCATTCTTGCACCTCACTCTTCTTTGCTGGCGGTTCACTTGATATTGCTTGCGTGTTGCTTTACTGGAACTGGGGGTTTGGGGGGTGAGAGAAAAGGGCTTAGATAGGTGAAGAGAGATAAAAAAGTTGGACAGAGAATGGGGAGTCCAGGATCTGGTGCGGTTGATTATGCTGTGGGATCGATCGTGTGGGTCCGAAGGAGGAATGGTTCATGGTGGCCCGGTAAAATCCTGGGTTCTGATGAGCTTTCATCTTCGCACCTTACATCACCTCGATCTGGAACTCCAGTCAAGCTCCTTGGAAGAGAAGATGCCAGTGTGTAAGTTAGGATCCTCCTTACGATCTTTGAATTCTCTACTTATTTCCTCATTTGTTGTTTAGTGAGCTTCCTTTGTTTCATTGGTTATGCTTATATCTGTTTCTTTGTTTGAAGCTTGATTACTGAAGAAAGATGGGAAATATCCAATGCGTCCAATAACAAGTTGAGTTTTAGATGACAGAAATGCATATTTAGGGATAGAACTTTATGCCTTAGTATCCAAGATCAGACTAGATTATGTCACTATTTGCTATGGGAAATTGCGAATTACGAAGAATTTGGAATATGTTTATTGGCTAGAAGATGACTTTATTGGTTGTTTCTGTTCAGGTCGAGAAACACATTGTTAAGAAAGCATGCTAGGAAGGTTGTGAATGAGTTGTAAACTTGTCTCAATTTATAGAGGGTAGATTGTTTAGAACTAGAGCCAATTGTGTTAGTGAAGGAAATCCTTGTGTGAGAAGATAAAGTATAAACTCTGTGCTTGTTATCAGTATGAATTAGGATATACCATTTCATGATTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGATAAGAAACCATTTCATTGATAAATAAAATACCCAAAGATTACTACAAAGAATACCACGTTGGCCTTTAGTTTAGAGGAAAGGAACCAGAGTCTGGTCTGGGATTTGGGAAGACCAGTGACGGCTCATTTGTTTGAGCCTTTTTGCTATCGCTTGTAAGATAAATTGGCGGCGGAGGATTTTGGGATTCTAATGATTTACATTGGATCTAAATTTAACAAGGAATTACAAGATAGAGAGTTTGAAGAGTGGCTGATTTTAGATAGATTGGAAAATGTGGCCTCTAATCACATCAAGATAGCTTAATTTGGAAAGGAGATTCCAGTAAATACTCCTGTAAATTAGGCACCCTTTTGCTAGAGCAAAATGGCATGCTTGAAGCAAACCTAAGAACAGACTTGGGGATTTAAAGTTCATGAAAAGGTAAAGTTGATACAAAAATCCTAGCCAATTGGATTTAGGAACTTGATGTAATAGGATTAGGGTATATTAGGCTATCTTTGTCAAATCCTAATAAATTAGGATTAGTATTAGTTTTCTTGGTTTATTAGGATTAGATTATTTTCTTAGATTAATTAGGACTAGGATTAGGATTAGTTTGGTTTATTCCAATTCTCTATAAAGTGAGAAATTGCTTTTGTTAAATCACCAATTGACCCAAAAGCTTAAGTTGATGGGTGAAGACAAATTTAATTATATAATCTAACACTCCCCCACTTGTGGACTTGAAATATAAGGCAAACTCAACAACACTCTCCCACTTGTGGGCTTGAATTGTGGGCTTGAAATATGAAGAAGACCGAACAAGTGGAAATCAATATTAATTGGGGAGGAATAACATTGGAAGGACTTCAACACATGACCTCCTTGGACCACCTACCCTGATACTATGTTAAATTGCCAATTGACCCAAAAGCTTAAGTTGATGGGTGAAGGTAAATTTAATTATATCATCTAATAGGCTTCTTGTATGGACTACTAATTGATAACAAAGACTTTGATTTATTCTTGGAGTTTTTCTCCTTTTATCATTTAGGCTACATTAGAGCCCTTTGGGTATGTGAACCCCTGAATCTTAAACACTAAAACCCCCCATCAACTCGATTTCAACCCCAAGGATTGGATTAGGCATGGATCGCCTTGAGATCAAATCTCTTGAGACGAAGAATAAGTTCCTTGGTCCAAGATTTGAACACTCCACAAGATAGGATGATCAATTTGATTTCAATGTTCTTGGCATGCAACTCAATACATGCATTGCAAAATTCAAGAAGCTCTCTTGGCTAAAATTGAATGATCAAGTTTAAAGGTGGAAATTCAATTCATTGTAATCTGTTCAAACTTCAAGCAACCTGAAGGCAAGACTTTAGATAGCCTTCCAATTAAAACCCTAATGCCATGTCATAGTTCATAGTAAAATGACAAAAATACCCTACTTAATATATCATAAAAATGATAAAATTGAAAAATACTAAAATTACATAAAAAATATTAAATAAAATATAAATAATAATTTGATGCCTTGGACGGTTTATATCAAAAGTTTTTTGCAGTCCCTTACTGGAAAGGCATAAATCCTCATGAGATCCAAAGGAAGAACCTGCATATCTACCTCCTTCCATGAGCCTGCTTCAACTGCTCGAGGAAGGATGAGTTCATTAGTTACCTAACTTACCTTTTTTGGTGCTCTCTTACGATGGGCTGTAGATTGTTCAAGCTTTTTGGCTGGAATAGTTGTTTACTTGGAAGAGTTTAAGGTGATGCTTCTAGATTTTTTGTTGGTTGGGTTGCTAACTTGTTGTGGGACAAGATTTTCAGCTAATTTGATTTACTGTACTATCGGAACCAAATCTGTGTAGGCAGAAGTTGACGGGGCAGGAATTCTTTTGGAAAGTTGTCCCAGAGTTAGATATTTGGACACTTGGAAACGAAGGGCTTGTAAAGTGCTATGCTAGGCTCTCTTATTCTTTTGTTACTACTGTAACCAAACCTGGTTTTTGAGGAAACCCCTTTGCTTTATCCACCAGTTTCATGCCGTTTAATAAATTTCCTGTCATGTGTTTGGTAATTTATTGGGTTGAATATCATTTATAAGCTAGAAGGTTGTATACATAGGCTTTGTACTTTTCTTTTTCTGGGAAAAAATGTTATTATTATCATTTTTTATTCTAAAAAATCATTCTTATCATCAGTATACAGTATTTGATGTCTTCATCTACTTCTCAGGGATTGGTACAATTTAGAGAAGTCCAAGCGAGTAAAACCATTCAGATGTGGTGAGTTTGATGATTGTATTGAAAGGGCAGAGTCCTCGCAAGGCATGCCAATAAAGAAGAGAGAGAAATATGCACGGAGGGAGGATGCTATCCTTCATGCACTTGAACTTGAGAAGGAACTACTGAAGAAGCAAGGAAAACTTAATTTATATTCTGATCAAACGACTATTGAATCACTTGATGCCACTGCAAAGAAGGGAATAATTTCTTCAGAACATATAGGAGCTGATGATATCAACGATGGTCCTTCTGAATCCTACCAATTTTCTAAGATAAGAGATGTAAATTATGACAATGAAATTATGGAACCATGTCTTAAAGCAAGTGAAGGAGCTCAACTGAGTGGTGAGGATGACCATTCTGAAGCAAGACCGAGAATGAGAGGCTTGCAGGACTTTGGGCTCAAAATTACTCCTTCAAAAAGAAAGGTTCTATCTTCTTCTGTTGTCTCAAATGGTTTTGAAATGCTTGCAACGAATACCAATCCTCTGGCTCCTGCTCCTCTCGATGGTGTTTGTAACATAGGAAATGATAGCGATGCAAATGGTAAGCACTTTTTGGTTTTAATATATCGAATTGTTCATTGAGGAACATTGGGCAAGGGATGTCAAACTGTTCTTCTTGCTCAAGCATTCCTGAACGATCCCAGAGTAGACTGCCACAGCTAGGTTGACTGCATCATATTTTTTAATTCTTCAGGGATGCAGCAGATTGATCGTGCAAAGAGGAGCAAGTGTATGTATCTTCCAGCTGATTCTAGTGATTCATTGGAATGCAGAGAATCTTCTCTAGGTCAGGTTGAGATGTCAACACCTCATTTAGCAGCCGGGGTTATGCCTTCTCGGCCTGATTCCCTGGTTGAAGAGAATGCTTCTGGTTCATCTGAAAATGATTCTTCTGACTCAGAGACTGATTCTGATTCTTCCAGGTCAGATCAGGACATGGATAATGACATGGCTGCACTTTCAGGTTATTATGTGTTTCTTATATAGATGGATGTTTGTAGCATAGATATTGCACTACTAATTTGTGTGCTACAGATTTCAATGTGTTTGACTTGAATGCGTATATCCTCATGAATAGAACAGTCTCATTTTTCTTAAATGATCTAGAGGCACGTTCTTTAGCTGTTTTTATTAAATGTTTTGGGCTTCCACACCTCCAATATTGTTATTGTGAATGGAGTCTAGCACAATTGATGTGCTTATCTAGGTATATTTCTAGGCTACGTTGTGATCCTTCCACCAGATGCTTTGGGTGATGATGTGTTGTGTGGTAGAAGACCTCCTGTTTTAGTTTCAGAATAGAATATGAAATAGCAAGTCCAGGCATTCAAATCTTTTTTTCAGATTACATCAATCCTGAAGCTCACCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTAAATTAAGAGACGAGAAGTTCAGTTCTGAAGCTCAAGTATCAATGCTTGTATTGCTTTTAAATAGAAGGTCCAATGATGTTTTTTTCCTTCTCTTTGGTATTCTGTACTAGATGTACTTGTTTACATACAAAAATAAATTTATCACTGAGATACGGTATACCAAACCACTATATATAATCTTAACTTGTACTGCATAAAGGACAGAGAAAAAAAAAACTTAACATTCCTTGAGTTGTTCTCCATGGTTAGATCTCTGAACTGTATAACTGGTTTGCAGTTAGTATTCCTTGGCTTTTGTGCTTAGTGTTTCAATTAATTTTTTCCATATGTTTGGTTTTTTCAGATTCTACTTTGCCTTCAGAAAAGGAGCCGAGTACATTTGAAAGAATGGACACACAAGAGCAAGGGAATATGAGCAGCGAGGAGCCTGATGATTCTGCGCATTCTGGTGATACGTCTCACCTTTATCATCACGACCCTGTATCTACTAACGAAGCAGTGTCTAAGTGGCAATTGAAGGGAAAACGGAATGTTCGTAATTTTTCTAAAAAACCTGTTGGAGTAGATGATGAACCATCAAGCCACCTATGGGTACATGGGCAAACAAGACTTAGTAATAGGAATGATTATTTTGATGACAGCATGGAGGGAGCTGATGCATTGGAAGAGGAATATTACTTGACATCTAAAATGGTACCAAAAGATCAATATATTGTCAGAAATTATATGCCTGACTGGGAAGGCCAGCCTGCTTTGAAAGGATATTGGGATGTCAAAAATCCCTTATATGGTATGCGTCATCATTTTGGTGGGAGGCCAAGAACCATATTAATAGATGTTGATCTGAAGGTTCATGCAAGTTACCAGAAAGAACCTGTTCCTATCGTATCACTTATGAGCAAGTTAAATGGGCAAGCTATAATTGGGCATCCTATTCAAATCGAAACTTTAGAAGATGGTTTTTCTGAAACTATTCTTTCTGATAGTCTAGGCAATACACCCAGTGAAAATGATGGAAGCACAGCGCTTCAACCAGCTTGGAGGACTGCAAGGAGGACAGCAAATGTTCGCATCCCTCGCCCTCATTTACTGACAGTCTTCGATGGTGAAGAAGCTGGCTATGATTCTCCTTTTGCTGATCAAGAAAGGAAATCATCAAGATTCAAAAGAGTAAAAACTGGGGTCTACAATCAGAAGGCAGGCCAGAGTGGGGGCCAGCCTCACATTCCCCGACCTTCTCATGATAGAAGGCTCCCAAAAAAGCTGGCAAAGAAAGTAAGCTTATCATCTAACCAAAAGACTAGAACATTGTCTTCAATAGCTGTTGAGCAAAATTTTAGTAACATGCCAATACATGATAGTGTAACTTGTCATATGAATGGATCTATGAAACCAGAATCATCTGGGCCCCCAACTGTAGCATGCATACCAGTAAAATTAGTATTCAGTAGATTATTAGAGAAGATCAATAGGCCACCCTCAAAAGCTACTAATAATATGGTATTGTTAAATAATAATTCTAATAGAGATCATTGACAAGTTACGTGTATACTGTTGACAACCCCAAATAAATGTATATCCCTGGTGCCAATTGATTCTAATATCTCCTGACCACCTGAACCTGTTTTGTCAGCCTTGATTCTTTTGGTCAATCAAGTTACTGAGTATGAAGTTGTAATAAGTAATTAGGAACAGTATTCTTTAAGTAGGTAATTGTGGAATTTATATAGGAGAATGTATATTTTTGTTCAATACAAGATAGTTACTTTGAGCTGTTTTACTTTGTTAGAATTACAGAGCTTGAAAAATTACAAGAAATCTGGAGACTGAATAACTGTAAATCTTATATAGTTGTTATTTAAGGGTGGAAAGTCTTCAGTTGCAATGGGCAATTTATTTGTAAATAATTCTGGTCCCTGTACTTTTTTGTATGATGATCATGCTACGGGCATGGTTTTAGTTTGTGATCCTTTTTACATTATGCAGTTGTTTGAGAATATGTTTGTATGCAGAGCTAGTATTTATTGTAGTGCTATTGTCGTTTGCTTAAAGATTTCGTTTTTTGGTTTTTAAAAATTAAACCTATTTTCTTTTCTTTTTTCTTACCATGGTTTTTGAGTCAAATTCCCAGTTAAATTTCAAAAAAAAAANNNNNNNNNNAAAAAAAAAAAAAAAAAAAAAAAAAAATAAAATACATTTTTAGTTTTTAATCAATAGTATTTATTTATTTCTAATATAATAATAATAATATATATGAAAAATGCATGTGAGAGAAGTTATTTGTGGCTATGAAGTTTTCAAAATGGAGTTTTTCAAATTCTATACAAGTAAAGACAGTTTCTCTAAAATCTAAAATTATTACTCTTTATAGATGAAAGCAAAAATTGATAAAATCTAAAATTATTGCTCTTATAGATGAAAGCAAAAATTGATAAAGGAGCTTTAGAAAACAAAAAGTATTGCTCTTATATGAAAGCAAAAATTGATAAGGAGCTTTAGAAAACAAGGTACATTGCTTAACAAAGAGGAGATCTATAGAAGGATATCATAGCCTGGAGATCCACAGAAGGATATCATATAGTCCCAACCCTTGACGTCATCGTAGAAGAAGGAAAATGCACAAAAAAAAAAAAAAAAAAAAAAAAAAAAGAAAAAGAAAAAAAAAATTCATGAAAAGATGGTTGATGAGCAACCAAATTTTCAGATCTCACATAAGGAAAACTCACAATAAAGAAAGATTTAATAATGATACATAGATGGTAACAATACATGCCAAATTAGTCGTTTTTTATTGTTAATATGTTTGCTTTCAATTTTTGCGTTTTCTCTACTCTCGTTTTAGTCAATCAAATTTTAATTACTCAAAACTAATGTACACTTAACCAACCAAACTAATGTTTAAATTTTATTTGTCCATAAACGGTGTAAAATGTCCGTTGTTAGTTGTTATTTTGGTGATAAACAAAGGTCCGTATGAGCAAATATTTGTTAACCAAAATCAATTAGACTTCGTTCAATAATCTATCATTTTTTCTTTTCCATTTCTAGTTTTTAGAAATTGTGTTTGTATTCTCCATGATGTCTTTGCCAAAGTTTTCATCATTTTATACTAAGCACGAATCATATAGTCCGATATCATAAACAAAATTACATTTTTTTTTTTTTTAAATAGGGTTGTTTCCAAATATAGTCAAATAAGTCAAACTACTATGATAGATTTCTATTGCTATAGACGTTTATCATAGTTTATCGCTGTTAGACAGTAAATATTTTTTTATATTTGTAAATAGTTTGATATTTTTCTATTTATATTAATTTTCTAAAAAACAAAATCCCTTTTAATTTTTAAAACTTTGCTGCGATATATGAGTTTAATTTAAAGAAAATTAAAAACAAAAAATCAAATTGTTCGAACTTAAATTTACATTACACAATTAACTAAATTTTAAAATATTATGTAAAAAATTTAGAAAAAGTATCAATTTATATCCCTAAACTTTAGGAGTTGTATCAATTTAAACCTTAAACTTTTGTATTGGCCGATTGGTCTTCCAATATGCTAGTGTTTCAACAACTCCCGTTCTTTACATTTCTTCCTAGCTGCTTGGCCTGTAGTAGGTATCCAGTTCACAGCTTTAGGTATTAGCACTATGGCTTTCAACTTAAATTGTTGAAATTTCAACCAATTTGTAGTTGATAGTCAAGGTCATGTAATTCATACATGCCTAGATATTATTAACCGTGCTAATCTTGGAATGGAAGTTATGCATGAACGTAATGCGCAAAACTTCCCTCTAGCCCTAGCTACTGTTGAAGTTCCATCTATCAATGGATAAGACTTCTGTCTTAGCGTATACAAGTTCTTGAAAGTAAAGGAGCATCTTTCGGTAGCAGCATCCATAGCAAGAAAGGTAGTCTCCCATGGTCTAGGGGCCTATGTATCAATTTACACCCTCAATTATATTCTTTTTAAAATAATTTCATTCAAACTTACAATTATATCAATTAAACCCCTAAACTTTTATAATTGAATCAATTTGAACTCTTACACAAAATTTTCTTTAAAAACCATTCATGCATTTATTCTAATAGTTCATTTTGTAAAACAAACATTTGAAGAACTCTACATACATTTGAGAAGTAATGGTTGATTGTCAAAGGTAATCATAATGAAGGGTCTAAGTTTATTAGCTTATGAAAATTTAAGAGTTTACTTGACTTAAATTATAAGTTTTACATAATATTGGTCGAATGAAATTTGAAGGAGGTGTAAATTGATACACTCGTGAAAATTTAGGTTTTAATTGAAAATCCCAAAATTTAGGGACATAAATTGATATTTGTCCAAAGATTTATTTTATTAAATAAAGTTTAAACAATAAAACAAATTTGATCTTAAAACTTTATAAAGGGAAATAAACAATTCGCCTTAAGGTCCGACCTTAGCTCAGTTGGTAGAGCGGAGGACTGTAGTTTGAAAATCCAATTAGAAATCCTTAGGTCGCTGGTTCGATTCCGGCAGGTCGGAATTTTCATTTTACTTTAAAATTATTCCTTATCCAAATATATTCGTTATTCCTCCCTTTTTTTTTCCCGGAAAAACTACTTTATTCCATTTTCTCACTTTATTTTTCTTTTATTTCCATTTCTTCATCTTTCTCTAACCATGAACTCTCTGTTTCCTTTCTCCAGAGCTCAGACCTCTCAACAATGGAGTTTTTTCACTCTCTCATCTTTACCTTCCACATTCTCGCCTCCACTTCTATCTCCGCCCAGCGCACCGCCGCCGCCGCCCCTCCCTGCCGAACTACATGCGGCGCGCTCACCGTCAAGTATCCGTTTGGCACAGGCTACGGCTGCGGTTCTCCGCGGTTTTCCTCTCACGTGACTTGCTCCTCGGACGACCGACTCCTCCTAAACACACACACCGGCGATTATCCAATCACATCGATTTCATATTCCGATTCTACCGTCGTAATCTCGCCGCCGTCCATGTCGACTTGCTCTAAAATGCACGAATTTAGAACCCTAGGCATCGACTGGACTGGCCCCTTCCAACTCGGATCATCGACGTTTCTCCTCCTCGATTGTGAATCCCCTTCCGATTCCCTCTCGATTCGAGGCTCTGCTATTTGCGATTTGTCGTACGCTCATCTCTGCGCTTCGATCTACTCCTGTCCGTCCGTGGTCGACCTCGGCCTGCCGCTGTTCGCGCCGACGAATTCCTGCTGCGTGTACTCGCCAGCGAACTTCGACGGCAACGGAGAACTGGACCTGCGGGAGCTCAAGTGCGGGGGATTCTCGTCGGTGGTGAGGCTGGGAGAGTATGAGACGGATCCGATGCGGTGGGAGTATGGAGTGGAATTGAAGTACGGATATGGAGCGTTGGAGAACAGTGTGATGGAAACTAAATGTAAAGGATGTGAGATGAGCGGCGGCGCGTGTGGATTTACTCCGCCGGAGAATTTGTTCGTTTGTGTGTGTGAAAGAGGGTTCAATACGTCGACGGATTGCAAAAGTAATGATCTCAATCAGGAGTTCTTCTGGAGCTCTGCTTCCTCTCCGCTTACCGTTTTGTTTTGTAAGTTAGCCTTCCCGCCACTCATATTTATTTATTTTTATTTGAACTTAAGTTTTTAAACGGTTAAATTACAAATTTAGCATAAAAGAGAGCTGGCTTCAAGTTTTTCCAATGCCTTCCATTACAAGTTGGAAGTGTATGAATTAAACCCTGGATTTGAAATAAACTAACCAAAGAATAAGGATGAATTCACAAGATCCCTGGCTAATTCAACTTTAAGAAAAGAATAAAATACAAAAGTATCGAAGTAAAATAATCGACGTTAGTGAATTAAAAGTTGACATATATAATGCAGAAGTAGACATAAAGAGAATCTCATCTAATTGTCTCATCCATCGAATCTTCAAAGTGGCTTTATTAAAGGAGTTTTAATAAAAGTTTGATCGGGTTATCATCTCCTGACTTCAAAATAATCCTTCTATTATTTATCTGGTATGTAGGGTAGATATGTTACACAAACCATGGTTTTACCCCACCTCTCATGACTTTGACATTGTTCGTCTTTGATATACATATAATGGCTTATGGGGAGAATATTTACGAAATAATACTGGAATACTTAACAAGAATTCTTATACAAAAGTAGAATCTACAAACCATTTTTTTTTTTCACAAAATAATTTTATGACTATTTTGGTGTCTTTTTGATTATCTAATTATTTGTAGCTTAACGACTCTTCAGTCATTTTTTGAGTATTCAAGAGGATTGTGAATCCAAAAAATAAAATTCATCATCATCATCATCTTCTTCGAGGAATACAATCAAAGAAGGTATTTGAGAAGTAGTAGCTTGCTGAAAAAGCTTTTGAAAATTTGCCTCTTATAGTAGTCTGTAATAAGGATGCCAAGAGGGTTGACTTGTTCTCGTTGAAAACTTGTCCAGCTATTTTTGATATATCTTGCGCTGATAGGCGAAACCATTCTTTTACTGACTTAAGATCAGTAGCATGGTCAAACTTATATTTTTTCCACCATATTTGATATTGGGTATTTTTACGAGGCACTTTACTTTGTTAGTTTTAGTTTTGGAGATATTCCAACAGAATATCCACGATATTTGAAAATGAAGGACAAACCAAAAACTTAAAGCAAATACCATAAATATTTGAAATAGAAAGATTCTTGAAATGAGACTATAAATATTTGTTCGGAAAGTCCAAAATATTTCCACCAAGTGATAGGAAAAAAGGTTTTTTTTTTTTTTTTTTTTTTTTTTGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGGGTATTTTAACAGAATTGATTTAACCATGAATAAGCTCAATACCATAAAATATGTGTTCAAGCTTTGATATAATCATGGTAGGAATAGGATGGAGGGTTGAAACTGATAGATAATTTTTTCGCTGTGAAGATTTTTTGATTCCATTTTGTTGCAATAAGAACCTGATAATTTTTAATTTTGAATAGACTATAACAGAGGGGTTATTTCTGTCTTTGATGTGAGTAATCTTTGTTGATTTTGAGTCAATAAGAAGTAGGTCATAGAATTTATGAGATTTGAGTATTGGTTGGGGAATAAATTTGAAGTTATTGGGACTTTTAAATTGGATTCCAATAGCACGACAAGGGTAAACTCCTCATCAGAGAGAGCATATTGCTCTAACCTCATCAGAGCAGGTTTTTGATAAGAGACTGCGGCTTTAATTGCTCTTTCGATAAGAAGAGGAGGCTTGTGGTTGTAAGATAGAAGAAAAAGAGTTTTCAGAGGAAGATTATTTACACCTTGGAGAGGTGTTCTCTTGAGAGTGAGAGATTATCCCCTAGGTAATTCTCTCATTTCTTGGCGAAGTAAGAAACAAAGTATTGTCTCTCATTCTAGCACTAAATCTGAATATCGTGTCCTTGCTAATGCTACTTCAAAATTATTATGGCTCCGATGGTTGCTTGCTAACATGGGAGTGCCTCAACAGTCGGCTACTAAACTTCATTGTGATAATCACAGTGCTATTCAGATTTCTCATAATGATGTCTTTCATGAACGCACAAAACATATTAAAAATGATTGTCACTTTGTTTGCCATCACCTCTTGAGTAACACTCTTCTCTTACAATCAATTTCCACTACTGAGCAACCTGCTGATATTTTCACCAAAACCTTCTCATCTAATCAGTTCAATCAATTACTTACCAAACTCAAGTTGGTTGCTACTCTACCACCTTGAGTTTGAGAGATAAGATGTGAGCATAAATATATTGTTTGGAGATTTTTATGAATTAGGAAAATCAAGATCCTCAATTTTGCCTTTTTTTCCTTATTTGTATTAACCACAATTTTGTATTTAACCTCCTTTTGTACAATATACAAAATACATAAAAGATATTCAATATGGTATCAATCAATATTAAAATAAGATATTCTTTTGAAACCTCCTTGACGTTTAGGAGCAGGTTTATCTCCTTCTTTATTTCCTTTTTTTTTTTTTTTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTAATGCAACACTTATGCCTTTGAAAATTCTTTGATTTCGTTTGGATCACAATCCTCCAACAATAGCTTCGATTATATTGATGTACAACATTTTAATGTTGTTTTTAATCCATGCTTACCTAGCAATTGATAGATATTTTCCTTTGGGTAAGATCGAAGATCCCAGAATATCCAAAGAGGTGAAGTATTGCTCCAAACATCTTACACTATATGAACAATGAAAGAAAGACAGCTTCATCCTCCATTGCTTGCAAACAAAGCACGTAACAATTTGAACTTAACAGTAGCTATGTCACTTTATTTCTACTTTATTTGTCATACCTGAGTCTCCCAAAGAAGAATCCATGTTAAGACATTTCATCTTCCATAAAGCAATTGATAGATTCCCTGGGATCAAAGAATTTTTTTTTACTGGGAAATGACCCAAGAAGGCCAATTTCCAAAGTCTTGATTGATGTGTTATTTTTAATTAACTTGCAAGGCTGTAATATATTTTAGAATTTCATATGGTGGTTTGATAATCTGAACTTTTTTTTAAATAATATTATTTTAATAAATAATAATAGAAATAAGGTAGAAGGAAAATAAATGACAAAAATAGGTAGAAAAATCGTGGACCCAGTACACGATTTCAGGAAATCGTGGACCCCGTCCATGATTTTCATCTGGTTAAACCTGCCTGCCACGCTGGAGCATGTTTGCCATTTGGCAGGGATAAGAATCGTGGATCGGGTCCACGATTTTCGCCTCCTTTAAACACCCGACGCCCCTTTCTTCTTCACCACATCCTCTCCTCCCCACTTTTCTTCTTTCACCCAAGATTTGTTCTTCCCGTCGCCGTCGTCACCGCCGTTCGTTCCAATTGTCTCCACCGCCACCCCGAGCTCTCACCGAGAGCTAGAAGGTATTATCTTTCTCTCTTAACATTGTATAACATTGTTTAAATTTATGATTATTTTTTATAAAAATATGATTATAGTATTGTTTAGATTTAGATTTTGAATTATTTTTTACTAGGATAATCGTGGACTGGGTCCACGACTATGTAAAATATTTCTTAAATTTATTTTAAGTGATAATATTGTTTAGATTTGATTTATAAGATAAATGATTATTACTGTAATGAACTTTCTATGTTATAATTGTAATTGTAATTTTTATGTTTTAATTGTAATGAACTTTTTATCTTGCAATTGTAATGAACTTTGTATTATAATGAAATAATTTTGAATGATCATTTATGTTTATTAAATTGCAGTGTCAATTATGGACCCTGGTCCTTCTAATTCGGTGCAATTATATCTACAACCAATCCATCGTTCGCAATCAGTATGGGATACTCAATCTATTGTAGTTTTGGGATGTAGGAGAGAGCAGGTAGTTGATCATAGTATAATGTTAGACCCACAAATCGTATCCTACTTAGAACAAGCTGATTTTCTCGGGATTGCACAAATTGGGTTTATTCAGCTAGATTGACATCTGATTACTACTTTAGTGGAGCGATGGAGATCGAAAACCCGTGTGTTTCACATGCCTGGTGGGGAGTGTACCATTACATTGCAAGGCCTTGCCCTCCAGTTTGGGTTACCAGTTTATTCAGCTAGATTGATATCTGATTATTGCTTTAGTAGAGCGATGGAGACCGAAAACCCATACGTTTCACATGCCTGGTGGGGAGTGTACCATTACATTGCAAGACATTGCCCTCCAGTTTGGGTTACCAGTTGATGGAGAACTGTAACCGGATCGTTGGTGTACGACTGGAAACAGGTTTGTGAAGATTTTTTTGGGTGTTCGACCCCCATAATTGAAAGGCTCTAGGTTGAGTATCCCTTGGTTGTCATCCCAATTTATAGAACTATCTTCTGATGTCGACGATATTACCATACAGAGGTATGCACGTGCGTATATCATGCAGCTCATTGGAGGATTTTTGTTCGCTGACAAGTCGAACACATTAGTTCACCTGATGTTTCTTTCTTTACTGGGAAGCTTTGAGACTGCTGGTAGGTACTCATGGGGTAGTGCATGTCTTGCATGGCTATATAGGGAATTATGTTGGGTTAGTACTGCAACTGCTTTAGAGATAGCAGACCCATTAATATTATTGCAAGTATGGGCTTATGATTGGTTCTACACCATCGCACCACAAGTTGAGCTACTGTCTCCCAGTCATTTTATTAGACGTCCATTTAATGCCAGGTATTACAGTCCTTACATCTTCATTTCATACTAAATTTATATAATAAACCGTTCTAACTATATTACATATATATGTAGATGGAGCAGTATTCTAGCTGCCTCTGAGCATTCTGTAAACATGTTACTAGTGTACATATTATCATTTGATCGACTAATGCACACTCAGGTAAGACCATATATCTTTTATTAAAAAATAAATATTGCAAAATTGAGTTCTACTTATTAACTTTTTTCAATTTTAGATCAATTGGCATTGTACACACAAGAGATCATGGCATCGTTGTCGGATTACTGCTATAATGGTGAAGATATCTGGTTGACTGTGAGCCCTCTAATATGCTTTCATATTATAGAGTGACATTAGCCAAACCGAGTGTTAGGACAATTTGGAATGCGACAAATGGTGCTACAAATTTGTTATACAGACTGTGACCTGCACCAAATTAATTTAAGGGGTAAGCATGACCAAGATTGGCAACGAATTCATGTAAACCATTTATCCCATTGGCGAGCACATCGTGACCATTGCGTAGGAGGTGACATAGTATATGGACTAGTTGTATCAGATGACTATCTACCTTGGTACGACTCAATCACAAGACGATATATCACACTAGAGGGCACTTATTATTATTGTATGGTAAGAATTTTATTATTTATTTTATTCAATTTTTTTTGTTTCCATATTATTTTAGATTGTTTATTTTTTTTCTTTTGTCCATACAGAATAATTTTGTTCAGGATGTACAACAATATTCAGTGCATCATAATTTACTAGAGCTAAGTTCGATCTGTGATCAATGTGTTGTAAATATCGAAGATATCGTTCAACAGACATGTAGACTTAATGTTGCCAACACTAATAGAAGACGCGTTCGTCGTCGACGTCAATGACAACAAGCCGAAAATATTCCAGAGCAACACATTGACCCATAACCTCAAAGAGATATCGCATCGTTCTTGTCATCAAAATGAGTATTATGCACCTGATGCATTTCCATCTGTTGTCACCCAAAAAGACATCGATGATGTAATTCATCATCAAGAACGTTATAAAGTTCCTGTACCTTGTCGTGGCCGACGTGAGAGAAGCTGTCCACTATGTGGTACATGAGTTTGCATTAACGTGTAATTGACAATACTCACTTTACGTGTTGTATTTAAGAATATTTATCATTATTACGCTTATTATGTGTCAAACTTTATACTTTTCATGTTTAAATTGAAAAACCTAATTTTATGTTA

mRNA sequence

ATGGAAACCCTAACCATATCTCGCATCGATCTTTGCGGATCCTCCATCTGTGCGCACTGTCAGATCATTCTTGCACCTCACTCTTCTTTGCTGGCGGTTCACTTGATATTGCTTGCGTGTTGCTTTACTGGAACTGGGGGATCTGGTGCGGTTGATTATGCTGTGGGATCGATCGTGTGGGTCCGAAGGAGGAATGGTTCATGGTGGCCCGGTAAAATCCTGGGTTCTGATGAGCTTTCATCTTCGCACCTTACATCACCTCGATCTGGAACTCCAGTCAAGCTCCTTGGAAGAGAAGATGCCAGTGTGGATTGGTACAATTTAGAGAAGTCCAAGCGAGTAAAACCATTCAGATGTGGTGAGTTTGATGATTGTATTGAAAGGGCAGAGTCCTCGCAAGGCATGCCAATAAAGAAGAGAGAGAAATATGCACGGAGGGAGGATGCTATCCTTCATGCACTTGAACTTGAGAAGGAACTACTGAAGAAGCAAGGAAAACTTAATTTATATTCTGATCAAACGACTATTGAATCACTTGATGCCACTGCAAAGAAGGGAATAATTTCTTCAGAACATATAGGAGCTGATGATATCAACGATGGTCCTTCTGAATCCTACCAATTTTCTAAGATAAGAGATGTAAATTATGACAATGAAATTATGGAACCATGTCTTAAAGCAAGTGAAGGAGCTCAACTGAGTGGTGAGGATGACCATTCTGAAGCAAGACCGAGAATGAGAGGCTTGCAGGACTTTGGGCTCAAAATTACTCCTTCAAAAAGAAAGGTTCTATCTTCTTCTGTTGTCTCAAATGGTTTTGAAATGCTTGCAACGAATACCAATCCTCTGGCTCCTGCTCCTCTCGATGGTGTTTGTAACATAGGAAATGATAGCGATGCAAATGGGATGCAGCAGATTGATCGTGCAAAGAGGAGCAAGTGTATGTATCTTCCAGCTGATTCTAGTGATTCATTGGAATGCAGAGAATCTTCTCTAGGTCAGGTTGAGATGTCAACACCTCATTTAGCAGCCGGGGTTATGCCTTCTCGGCCTGATTCCCTGGTTGAAGAGAATGCTTCTGGTTCATCTGAAAATGATTCTTCTGACTCAGAGACTGATTCTGATTCTTCCAGGTCAGATCAGGACATGGATAATGACATGGCTGCACTTTCAGATTCTACTTTGCCTTCAGAAAAGGAGCCGAGTACATTTGAAAGAATGGACACACAAGAGCAAGGGAATATGAGCAGCGAGGAGCCTGATGATTCTGCGCATTCTGGTGATACGTCTCACCTTTATCATCACGACCCTGTATCTACTAACGAAGCAGTGTCTAAGTGGCAATTGAAGGGAAAACGGAATGTTCGTAATTTTTCTAAAAAACCTGTTGGAGTAGATGATGAACCATCAAGCCACCTATGGGTACATGGGCAAACAAGACTTAGTAATAGGAATGATTATTTTGATGACAGCATGGAGGGAGCTGATGCATTGGAAGAGGAATATTACTTGACATCTAAAATGGTACCAAAAGATCAATATATTGTCAGAAATTATATGCCTGACTGGGAAGGCCAGCCTGCTTTGAAAGGATATTGGGATGTCAAAAATCCCTTATATGGTATGCGTCATCATTTTGGTGGGAGGCCAAGAACCATATTAATAGATGTTGATCTGAAGGTTCATGCAAGTTACCAGAAAGAACCTGTTCCTATCGTATCACTTATGAGCAAGTTAAATGGGCAAGCTATAATTGGGCATCCTATTCAAATCGAAACTTTAGAAGATGGTTTTTCTGAAACTATTCTTTCTGATAGTCTAGGCAATACACCCAGTGAAAATGATGGAAGCACAGCGCTTCAACCAGCTTGGAGGACTGCAAGGAGGACAGCAAATGTTCGCATCCCTCGCCCTCATTTACTGACAGTCTTCGATGGTGAAGAAGCTGGCTATGATTCTCCTTTTGCTGATCAAGAAAGGAAATCATCAAGATTCAAAAGAGTAAAAACTGGGGTCTACAATCAGAAGGCAGGCCAGAGTGGGGGCCAGCCTCACATTCCCCGACCTTCTCATGATAGAAGGCTCCCAAAAAAGCTGGCAAAGAAAGTAAGCTTATCATCTAACCAAAAGACTAGAACATTGTCTTCAATAGCTGTTGAGCAAAATTTTAGTAACATGCCAATACATGATAGTGTAACTTGTCATATGAATGGATCTATGAAACCAGAATCATCTGGGCCCCCAACTGTAGCATGCATACCAGTAAAATTAGTATTCAGTAGATTATTAGAGAAGATCAATAGGCCACCCTCAAAAGCTACTAATAATATGAGCTCAGACCTCTCAACAATGGAGTTTTTTCACTCTCTCATCTTTACCTTCCACATTCTCGCCTCCACTTCTATCTCCGCCCAGCGCACCGCCGCCGCCGCCCCTCCCTGCCGAACTACATGCGGCGCGCTCACCGTCAAGTATCCGTTTGGCACAGGCTACGGCTGCGGTTCTCCGCGGTTTTCCTCTCACGTGACTTGCTCCTCGGACGACCGACTCCTCCTAAACACACACACCGGCGATTATCCAATCACATCGATTTCATATTCCGATTCTACCGTCACTGGCCCCTTCCAACTCGGATCATCGACGTTTCTCCTCCTCGATTGTGAATCCCCTTCCGATTCCCTCTCGATTCGAGGCTCTGCTATTTGCGATTTGTCGTACGCTCATCTCTGCGCTTCGATCTACTCCTGTCCGTCCGTGGTCGACCTCGGCCTGCCGCTGTTCGCGCCGACGAATTCCTGCTGCGTGTACTCGCCAGCGAACTTCGACGGCAACGGAGAACTGGACCTGCGGGAGCTCAAGTGCGGGGGATTCTCGTCGGTGGTGAGGCTGGGAGAGTATGAGACGGATCCGATGCGGTGGGAGTATGGAGTGGAATTGAAGTACGGATATGGAGCGTTGGAGAACAGTGTGATGGAAACTAAATGTAAAGGATGTGAGATGAGCGGCGGCGCGTGTGGATTTACTCCGCCGGAGAATTTGTTCGTTTGTGTGTGTGAAAGAGGGTTCAATACGTCGACGGATTGCAAAAGTAATGATCTCAATCAGGAGTTCTTCTGGAGCTCTGCTTCCTCTCCGCTTACCGTTTTGTTTTCACGACAAGGCACTAAATCTGAATATCGTGTCCTTGCTAATGCTACTTCAAAATTATTATGGCTCCGATGGTTGCTTGCTAACATGGGAGTGCCTCAACAGTCGGCTACTAAACTTCATTGTGATAATCACAGTGCTATTCAGATTTCTCATAATGATGTCTTTCATGAACGCACAAAACATATTAAAAATGATTGTCACTTTGTTTGCCATCACCTCTTGAGTAACACTCTTCTCTTACAATCAATTTCCACTACTGAGCAACCTGCTGATATTTTCACCAAAACCTTCTCATCTAATCAGTTCAATCAATTACTTACCAAACTCAAGGATAAGAATCGTGGATCGGGTCCACGATTTTCGCCTCCTTTAAACACCCGACGCCCCTTTCTTCTTCACCACATCCTCTCCTCCCCACTTTTCTTCTTTCACCCAAGATTTGTTCTTCCCGTCGCCGTCGTCACCGCCGTTCGTTCCAATTGTCTCCACCGCCACCCCGAGCTCTCACCGAGAGCTAGAAGGTATTATCTTTCTCTCTTAACATTGTATAACATTGTTTAAATTTATGATTATTTTTTATAAAAATATGATTATAGTATTGTTTAGATTTAGATTTTGAATTATTTTTTACTAGGATAATCGTGGACTGGGTCCACGACTATGTAAAATATTTCTTAAATTTATTTTAAGTGATAATATTGTTTAGATTTGATTTATAAGATAAATGATTATTACTGTAATGAACTTTCTATGTTATAATTGTAATTGTAATTTTTATGTTTTAATTGTAATGAACTTTTTATCTTGCAATTGTAATGAACTTTGTATTATAATGAAATAATTTTGAATGATCATTTATGTTTATTAAATTGCAGTGTCAATTATGGACCCTGGTCCTTCTAATTCGGTGCAATTATATCTACAACCAATCCATCGTTCGCAATCAGTATGGGATACTCAATCTATTGTAGTTTTGGGATGTAGGAGAGAGCAGGTAGTTGATCATAGTATAATGTTAGACCCACAAATCGTATCCTACTTAGAACAAGCTGATTTTCTCGGGATTGCACAAATTGGGTTTATTCAGCTAGATTGACATCTGATTACTACTTTAGTGGAGCGATGGAGATCGAAAACCCGTGTGTTTCACATGCCTGGTGGGGAGTGTACCATTACATTGCAAGGCCTTGCCCTCCAGTTTGGGTTACCAGTTTATTCAGCTAGATTGATATCTGATTATTGCTTTAGTAGAGCGATGGAGACCGAAAACCCATACGTTTCACATGCCTGGTGGGGAGTGTACCATTACATTGCAAGACATTGCCCTCCAGTTTGGGTTACCAGTTGATGGAGAACTGTAACCGGATCGTTGGTGTACGACTGGAAACAGGTTTGTGAAGATTTTTTTGGGTGTTCGACCCCCATAATTGAAAGGCTCTAGGTTGAGTATCCCTTGGTTGTCATCCCAATTTATAGAACTATCTTCTGATGTCGACGATATTACCATACAGAGGTATGCACGTGCGTATATCATGCAGCTCATTGGAGGATTTTTGTTCGCTGACAAGTCGAACACATTAGTTCACCTGATGTTTCTTTCTTTACTGGGAAGCTTTGAGACTGCTGGTAGGTACTCATGGGGTAGTGCATGTCTTGCATGGCTATATAGGGAATTATGTTGGGTTAGTACTGCAACTGCTTTAGAGATAGCAGACCCATTAATATTATTGCAAGTATGGGCTTATGATTGGTTCTACACCATCGCACCACAAGTTGAGCTACTGTCTCCCAGTCATTTTATTAGACGTCCATTTAATGCCAGGTATTACAGTCCTTACATCTTCATTTCATACTAAATTTATATAATAAACCGTTCTAACTATATTACATATATATGTAGATGGAGCAGTATTCTAGCTGCCTCTGAGCATTCTGTAAACATGTTACTAGTGTACATATTATCATTTGATCGACTAATGCACACTCAGATCAATTGGCATTGTACACACAAGAGATCATGGCATCGTTGTCGGATTACTGCTATAATGGTGAAGATATCTGGTTGACTGTGAGCCCTCTAATATGCTTTCATATTATAGAGTGACATTAGCCAAACCGAGTGTTAGGACAATTTGGAATGCGACAAATGGTGCTACAAATTTGTTATACAGACTGTGACCTGCACCAAATTAATTTAAGGGGTAAGCATGACCAAGATTGGCAACGAATTCATGTAAACCATTTATCCCATTGGCGAGCACATCGTGACCATTGCGTAGGAGGTGACATAGTATATGGACTAGTTGTATCAGATGACTATCTACCTTGAATAATTTTGTTCAGGATGTACAACAATATTCAGTGCATCATAATTTACTAGAGCTAAGTTCGATCTGTGATCAATGTGTTGTAAATATCGAAGATATCGTTCAACAGACATGTAGACTTAATGTTGCCAACACTAATAGAAGACGCGTTCGTCGTCGACGTCAATGACAACAAGCCGAAAATATTCCAGAGCAACACATTGACCCATAACCTCAAAGAGATATCGCATCGTTCTTGTCATCAAAATGAGTATTATGCACCTGATGCATTTCCATCTGTTGTCACCCAAAAAGACATCGATGATGTAATTCATCATCAAGAACGTTATAAAGTTCCTGTACCTTGTCGTGGCCGACGTGAGAGAAGCTGTCCACTATGTGGTACATGAGTTTGCATTAACGTGTAATTGACAATACTCACTTTACGTGTTGTATTTAAGAATATTTATCATTATTACGCTTATTATGTGTCAAACTTTATACTTTTCATGTTTAAATTGAAAAACCTAATTTTATGTTA

Coding sequence (CDS)

ATGGAAACCCTAACCATATCTCGCATCGATCTTTGCGGATCCTCCATCTGTGCGCACTGTCAGATCATTCTTGCACCTCACTCTTCTTTGCTGGCGGTTCACTTGATATTGCTTGCGTGTTGCTTTACTGGAACTGGGGGATCTGGTGCGGTTGATTATGCTGTGGGATCGATCGTGTGGGTCCGAAGGAGGAATGGTTCATGGTGGCCCGGTAAAATCCTGGGTTCTGATGAGCTTTCATCTTCGCACCTTACATCACCTCGATCTGGAACTCCAGTCAAGCTCCTTGGAAGAGAAGATGCCAGTGTGGATTGGTACAATTTAGAGAAGTCCAAGCGAGTAAAACCATTCAGATGTGGTGAGTTTGATGATTGTATTGAAAGGGCAGAGTCCTCGCAAGGCATGCCAATAAAGAAGAGAGAGAAATATGCACGGAGGGAGGATGCTATCCTTCATGCACTTGAACTTGAGAAGGAACTACTGAAGAAGCAAGGAAAACTTAATTTATATTCTGATCAAACGACTATTGAATCACTTGATGCCACTGCAAAGAAGGGAATAATTTCTTCAGAACATATAGGAGCTGATGATATCAACGATGGTCCTTCTGAATCCTACCAATTTTCTAAGATAAGAGATGTAAATTATGACAATGAAATTATGGAACCATGTCTTAAAGCAAGTGAAGGAGCTCAACTGAGTGGTGAGGATGACCATTCTGAAGCAAGACCGAGAATGAGAGGCTTGCAGGACTTTGGGCTCAAAATTACTCCTTCAAAAAGAAAGGTTCTATCTTCTTCTGTTGTCTCAAATGGTTTTGAAATGCTTGCAACGAATACCAATCCTCTGGCTCCTGCTCCTCTCGATGGTGTTTGTAACATAGGAAATGATAGCGATGCAAATGGGATGCAGCAGATTGATCGTGCAAAGAGGAGCAAGTGTATGTATCTTCCAGCTGATTCTAGTGATTCATTGGAATGCAGAGAATCTTCTCTAGGTCAGGTTGAGATGTCAACACCTCATTTAGCAGCCGGGGTTATGCCTTCTCGGCCTGATTCCCTGGTTGAAGAGAATGCTTCTGGTTCATCTGAAAATGATTCTTCTGACTCAGAGACTGATTCTGATTCTTCCAGGTCAGATCAGGACATGGATAATGACATGGCTGCACTTTCAGATTCTACTTTGCCTTCAGAAAAGGAGCCGAGTACATTTGAAAGAATGGACACACAAGAGCAAGGGAATATGAGCAGCGAGGAGCCTGATGATTCTGCGCATTCTGGTGATACGTCTCACCTTTATCATCACGACCCTGTATCTACTAACGAAGCAGTGTCTAAGTGGCAATTGAAGGGAAAACGGAATGTTCGTAATTTTTCTAAAAAACCTGTTGGAGTAGATGATGAACCATCAAGCCACCTATGGGTACATGGGCAAACAAGACTTAGTAATAGGAATGATTATTTTGATGACAGCATGGAGGGAGCTGATGCATTGGAAGAGGAATATTACTTGACATCTAAAATGGTACCAAAAGATCAATATATTGTCAGAAATTATATGCCTGACTGGGAAGGCCAGCCTGCTTTGAAAGGATATTGGGATGTCAAAAATCCCTTATATGGTATGCGTCATCATTTTGGTGGGAGGCCAAGAACCATATTAATAGATGTTGATCTGAAGGTTCATGCAAGTTACCAGAAAGAACCTGTTCCTATCGTATCACTTATGAGCAAGTTAAATGGGCAAGCTATAATTGGGCATCCTATTCAAATCGAAACTTTAGAAGATGGTTTTTCTGAAACTATTCTTTCTGATAGTCTAGGCAATACACCCAGTGAAAATGATGGAAGCACAGCGCTTCAACCAGCTTGGAGGACTGCAAGGAGGACAGCAAATGTTCGCATCCCTCGCCCTCATTTACTGACAGTCTTCGATGGTGAAGAAGCTGGCTATGATTCTCCTTTTGCTGATCAAGAAAGGAAATCATCAAGATTCAAAAGAGTAAAAACTGGGGTCTACAATCAGAAGGCAGGCCAGAGTGGGGGCCAGCCTCACATTCCCCGACCTTCTCATGATAGAAGGCTCCCAAAAAAGCTGGCAAAGAAAGTAAGCTTATCATCTAACCAAAAGACTAGAACATTGTCTTCAATAGCTGTTGAGCAAAATTTTAGTAACATGCCAATACATGATAGTGTAACTTGTCATATGAATGGATCTATGAAACCAGAATCATCTGGGCCCCCAACTGTAGCATGCATACCAGTAAAATTAGTATTCAGTAGATTATTAGAGAAGATCAATAGGCCACCCTCAAAAGCTACTAATAATATGAGCTCAGACCTCTCAACAATGGAGTTTTTTCACTCTCTCATCTTTACCTTCCACATTCTCGCCTCCACTTCTATCTCCGCCCAGCGCACCGCCGCCGCCGCCCCTCCCTGCCGAACTACATGCGGCGCGCTCACCGTCAAGTATCCGTTTGGCACAGGCTACGGCTGCGGTTCTCCGCGGTTTTCCTCTCACGTGACTTGCTCCTCGGACGACCGACTCCTCCTAAACACACACACCGGCGATTATCCAATCACATCGATTTCATATTCCGATTCTACCGTCACTGGCCCCTTCCAACTCGGATCATCGACGTTTCTCCTCCTCGATTGTGAATCCCCTTCCGATTCCCTCTCGATTCGAGGCTCTGCTATTTGCGATTTGTCGTACGCTCATCTCTGCGCTTCGATCTACTCCTGTCCGTCCGTGGTCGACCTCGGCCTGCCGCTGTTCGCGCCGACGAATTCCTGCTGCGTGTACTCGCCAGCGAACTTCGACGGCAACGGAGAACTGGACCTGCGGGAGCTCAAGTGCGGGGGATTCTCGTCGGTGGTGAGGCTGGGAGAGTATGAGACGGATCCGATGCGGTGGGAGTATGGAGTGGAATTGAAGTACGGATATGGAGCGTTGGAGAACAGTGTGATGGAAACTAAATGTAAAGGATGTGAGATGAGCGGCGGCGCGTGTGGATTTACTCCGCCGGAGAATTTGTTCGTTTGTGTGTGTGAAAGAGGGTTCAATACGTCGACGGATTGCAAAAGTAATGATCTCAATCAGGAGTTCTTCTGGAGCTCTGCTTCCTCTCCGCTTACCGTTTTGTTTTCACGACAAGGCACTAAATCTGAATATCGTGTCCTTGCTAATGCTACTTCAAAATTATTATGGCTCCGATGGTTGCTTGCTAACATGGGAGTGCCTCAACAGTCGGCTACTAAACTTCATTGTGATAATCACAGTGCTATTCAGATTTCTCATAATGATGTCTTTCATGAACGCACAAAACATATTAAAAATGATTGTCACTTTGTTTGCCATCACCTCTTGAGTAACACTCTTCTCTTACAATCAATTTCCACTACTGAGCAACCTGCTGATATTTTCACCAAAACCTTCTCATCTAATCAGTTCAATCAATTACTTACCAAACTCAAGGATAAGAATCGTGGATCGGGTCCACGATTTTCGCCTCCTTTAAACACCCGACGCCCCTTTCTTCTTCACCACATCCTCTCCTCCCCACTTTTCTTCTTTCACCCAAGATTTGTTCTTCCCGTCGCCGTCGTCACCGCCGTTCGTTCCAATTGTCTCCACCGCCACCCCGAGCTCTCACCGAGAGCTAGAAGGTATTATCTTTCTCTCTTAACATTGTATAACATTGTTTAA

Protein sequence

METLTISRIDLCGSSICAHCQIILAPHSSLLAVHLILLACCFTGTGGSGAVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASVDWYNLEKSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARREDAILHALELEKELLKKQGKLNLYSDQTTIESLDATAKKGIISSEHIGADDINDGPSESYQFSKIRDVNYDNEIMEPCLKASEGAQLSGEDDHSEARPRMRGLQDFGLKITPSKRKVLSSSVVSNGFEMLATNTNPLAPAPLDGVCNIGNDSDANGMQQIDRAKRSKCMYLPADSSDSLECRESSLGQVEMSTPHLAAGVMPSRPDSLVEENASGSSENDSSDSETDSDSSRSDQDMDNDMAALSDSTLPSEKEPSTFERMDTQEQGNMSSEEPDDSAHSGDTSHLYHHDPVSTNEAVSKWQLKGKRNVRNFSKKPVGVDDEPSSHLWVHGQTRLSNRNDYFDDSMEGADALEEEYYLTSKMVPKDQYIVRNYMPDWEGQPALKGYWDVKNPLYGMRHHFGGRPRTILIDVDLKVHASYQKEPVPIVSLMSKLNGQAIIGHPIQIETLEDGFSETILSDSLGNTPSENDGSTALQPAWRTARRTANVRIPRPHLLTVFDGEEAGYDSPFADQERKSSRFKRVKTGVYNQKAGQSGGQPHIPRPSHDRRLPKKLAKKVSLSSNQKTRTLSSIAVEQNFSNMPIHDSVTCHMNGSMKPESSGPPTVACIPVKLVFSRLLEKINRPPSKATNNMSSDLSTMEFFHSLIFTFHILASTSISAQRTAAAAPPCRTTCGALTVKYPFGTGYGCGSPRFSSHVTCSSDDRLLLNTHTGDYPITSISYSDSTVTGPFQLGSSTFLLLDCESPSDSLSIRGSAICDLSYAHLCASIYSCPSVVDLGLPLFAPTNSCCVYSPANFDGNGELDLRELKCGGFSSVVRLGEYETDPMRWEYGVELKYGYGALENSVMETKCKGCEMSGGACGFTPPENLFVCVCERGFNTSTDCKSNDLNQEFFWSSASSPLTVLFSRQGTKSEYRVLANATSKLLWLRWLLANMGVPQQSATKLHCDNHSAIQISHNDVFHERTKHIKNDCHFVCHHLLSNTLLLQSISTTEQPADIFTKTFSSNQFNQLLTKLKDKNRGSGPRFSPPLNTRRPFLLHHILSSPLFFFHPRFVLPVAVVTAVRSNCLHRHPELSPRARRYYLSLLTLYNIV
Homology
BLAST of CaUC10G187120 vs. NCBI nr
Match: XP_038897417.1 (uncharacterized protein At1g51745 isoform X6 [Benincasa hispida])

HSP 1 Score: 1274.6 bits (3297), Expect = 0.0e+00
Identity = 662/736 (89.95%), Postives = 684/736 (92.93%), Query Frame = 0

Query: 44  GTGGSGAVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASV 103
           G+ GSGAVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASV
Sbjct: 2   GSPGSGAVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASV 61

Query: 104 DWYNLEKSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARREDAILHALELEKELLKK 163
           DWYNLEKSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARREDAILHALELEKELL K
Sbjct: 62  DWYNLEKSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARREDAILHALELEKELLNK 121

Query: 164 QGKLNLYSDQTTIESLDATAKKGIISSEHIGADDINDGPSESYQFSKIRDVNYDNEIMEP 223
           QGKLNLY DQT I S  ATAKKGIISS+HIG  DINDG SES QFSKI DVNYDNEI +P
Sbjct: 122 QGKLNLYFDQTIIGSPGATAKKGIISSDHIGTGDINDGHSESLQFSKIIDVNYDNEI-DP 181

Query: 224 CLKASEGAQLSGEDDHSEARPRMRGLQDFGLKITPSKRKVLSSSVVSNGFEMLATNTNPL 283
           CLKA+EGAQ SGEDDHSEARPRMRGLQDFGL+IT SKRKVLSSSVVSNGFEMLAT+T+ L
Sbjct: 182 CLKANEGAQRSGEDDHSEARPRMRGLQDFGLRITSSKRKVLSSSVVSNGFEMLATDTSVL 241

Query: 284 APAPLDGVCNIGNDS-DANGMQQIDRAKRSKCMYLPADSSDSLECRESSLGQVEMSTPHL 343
           AP P+ GVCNIGNDS DANGMQQID AKRSKCMYLPADSSDSLECRESSLGQVE+STPHL
Sbjct: 242 AP-PV-GVCNIGNDSGDANGMQQIDLAKRSKCMYLPADSSDSLECRESSLGQVEVSTPHL 301

Query: 344 AAGVMPSRPDSLVEENASGSSENDSSDSETDSDSSRSDQDMDNDMAALSDSTLPSEKEPS 403
            +GVMPSRPDSLVEENASGSSENDSS SETDSDSSRSDQDMDNDMAALSDSTLPSEKEP+
Sbjct: 302 GSGVMPSRPDSLVEENASGSSENDSSGSETDSDSSRSDQDMDNDMAALSDSTLPSEKEPN 361

Query: 404 TFERMDTQEQGNMSSEEPDDSAHSGDTSHLYHHDPVSTNEAVSKWQLKGKRNVRNFSKKP 463
           TFE+ DTQE GN+SSEE DDS HSGD SHLYHHDPVSTNEAVSKWQLKGKRNVRNFSKK 
Sbjct: 362 TFEKTDTQEHGNVSSEEHDDSVHSGDMSHLYHHDPVSTNEAVSKWQLKGKRNVRNFSKKL 421

Query: 464 VGVDDEPSSHLWVHGQTRLSNRNDYFDDSMEGADALEEEYYLTSKMVPKDQYIVRNYMPD 523
            GV DEPSSHLWVHGQT  SNRNDYFDDS+EG DALEEEYYLTSKMV KDQY VRNYM D
Sbjct: 422 GGV-DEPSSHLWVHGQTTFSNRNDYFDDSIEGVDALEEEYYLTSKMVSKDQYFVRNYMHD 481

Query: 524 WEGQPALKGYWDVKNPLYGMRHHFGGRPRTILIDVDLKVHASYQKEPVPIVSLMSKLNGQ 583
           WEGQPALKGYWDVKNPLYG+ HHFGG PRTILIDVD+KVHASYQKEPVPIVSLMSKLNGQ
Sbjct: 482 WEGQPALKGYWDVKNPLYGLHHHFGGMPRTILIDVDVKVHASYQKEPVPIVSLMSKLNGQ 541

Query: 584 AIIGHPIQIETLEDGFSETILSDSLGNTPSENDGSTALQPAWRTARRTANVRIPRPHLLT 643
           AIIGHPIQIETLEDGFSETILSDSLGN  SENDG+TA QP+WRTARRTANVRIPRPHL T
Sbjct: 542 AIIGHPIQIETLEDGFSETILSDSLGNALSENDGNTAFQPSWRTARRTANVRIPRPHLPT 601

Query: 644 VFDGEEAGYDSPFADQERKSSRFKRVKTGVYNQKAGQSGGQPHIPR-PSHDRRLPKKLAK 703
           V DGEEAGYDSPF DQERK SRFKRVKTGVY QKAGQ   QPHIPR PS+DRRLPKK+AK
Sbjct: 602 VLDGEEAGYDSPFGDQERK-SRFKRVKTGVYRQKAGQGREQPHIPRGPSNDRRLPKKMAK 661

Query: 704 KVSLSSNQKTRTLSSIAVEQNFSNMPIHDSVTCHMNGSMKPESSGPPTVACIPVKLVFSR 763
           KVSLSS QKTRTLSSIAVEQNFSNMPIHDSVTC +NGS+KPESSGPPTVACIPVKLVFSR
Sbjct: 662 KVSLSSIQKTRTLSSIAVEQNFSNMPIHDSVTCQINGSIKPESSGPPTVACIPVKLVFSR 721

Query: 764 LLEKINRPPSKATNNM 778
           LLEKINRPPSKATNNM
Sbjct: 722 LLEKINRPPSKATNNM 732

BLAST of CaUC10G187120 vs. NCBI nr
Match: XP_038897415.1 (uncharacterized protein At1g51745 isoform X4 [Benincasa hispida])

HSP 1 Score: 1265.4 bits (3273), Expect = 0.0e+00
Identity = 661/742 (89.08%), Postives = 683/742 (92.05%), Query Frame = 0

Query: 44  GTGGSGAVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASV 103
           G+ GSGAVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASV
Sbjct: 2   GSPGSGAVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASV 61

Query: 104 DWYNLEKSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARREDAILHALELEKELLKK 163
           DWYNLEKSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARREDAILHALELEKELL K
Sbjct: 62  DWYNLEKSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARREDAILHALELEKELLNK 121

Query: 164 QGKLNLYSDQTTIESLDATAKKGIISSEHIGADDINDGPSESYQFSKIRDVNYDNEIMEP 223
           QGKLNLY DQT I S  ATAKKGIISS+HIG  DINDG SES QFSKI DVNYDNEI +P
Sbjct: 122 QGKLNLYFDQTIIGSPGATAKKGIISSDHIGTGDINDGHSESLQFSKIIDVNYDNEI-DP 181

Query: 224 CLKASEGAQLSGEDDHSEARPRMRGLQDFGLKITPSKRKVLSSSVVSNGFEMLATNTNPL 283
           CLKA+EGAQ SGEDDHSEARPRMRGLQDFGL+IT SKRKVLSSSVVSNGFEMLAT+T+ L
Sbjct: 182 CLKANEGAQRSGEDDHSEARPRMRGLQDFGLRITSSKRKVLSSSVVSNGFEMLATDTSVL 241

Query: 284 APAPLDGVCNIGNDS-DANGMQQIDRAKRSKCMYLPADSSDSLECRESSLGQVEMSTPHL 343
           AP P+ GVCNIGNDS DANGMQQID AKRSKCMYLPADSSDSLECRESSLGQVE+STPHL
Sbjct: 242 AP-PV-GVCNIGNDSGDANGMQQIDLAKRSKCMYLPADSSDSLECRESSLGQVEVSTPHL 301

Query: 344 AAGVMPSRPDSLVEENASGSSENDSSDSETDSDSSRSDQDMDNDMAALSD------STLP 403
            +GVMPSRPDSLVEENASGSSENDSS SETDSDSSRSDQDMDNDMAALS       STLP
Sbjct: 302 GSGVMPSRPDSLVEENASGSSENDSSGSETDSDSSRSDQDMDNDMAALSGYYLYRYSTLP 361

Query: 404 SEKEPSTFERMDTQEQGNMSSEEPDDSAHSGDTSHLYHHDPVSTNEAVSKWQLKGKRNVR 463
           SEKEP+TFE+ DTQE GN+SSEE DDS HSGD SHLYHHDPVSTNEAVSKWQLKGKRNVR
Sbjct: 362 SEKEPNTFEKTDTQEHGNVSSEEHDDSVHSGDMSHLYHHDPVSTNEAVSKWQLKGKRNVR 421

Query: 464 NFSKKPVGVDDEPSSHLWVHGQTRLSNRNDYFDDSMEGADALEEEYYLTSKMVPKDQYIV 523
           NFSKK  GV DEPSSHLWVHGQT  SNRNDYFDDS+EG DALEEEYYLTSKMV KDQY V
Sbjct: 422 NFSKKLGGV-DEPSSHLWVHGQTTFSNRNDYFDDSIEGVDALEEEYYLTSKMVSKDQYFV 481

Query: 524 RNYMPDWEGQPALKGYWDVKNPLYGMRHHFGGRPRTILIDVDLKVHASYQKEPVPIVSLM 583
           RNYM DWEGQPALKGYWDVKNPLYG+ HHFGG PRTILIDVD+KVHASYQKEPVPIVSLM
Sbjct: 482 RNYMHDWEGQPALKGYWDVKNPLYGLHHHFGGMPRTILIDVDVKVHASYQKEPVPIVSLM 541

Query: 584 SKLNGQAIIGHPIQIETLEDGFSETILSDSLGNTPSENDGSTALQPAWRTARRTANVRIP 643
           SKLNGQAIIGHPIQIETLEDGFSETILSDSLGN  SENDG+TA QP+WRTARRTANVRIP
Sbjct: 542 SKLNGQAIIGHPIQIETLEDGFSETILSDSLGNALSENDGNTAFQPSWRTARRTANVRIP 601

Query: 644 RPHLLTVFDGEEAGYDSPFADQERKSSRFKRVKTGVYNQKAGQSGGQPHIPR-PSHDRRL 703
           RPHL TV DGEEAGYDSPF DQERK SRFKRVKTGVY QKAGQ   QPHIPR PS+DRRL
Sbjct: 602 RPHLPTVLDGEEAGYDSPFGDQERK-SRFKRVKTGVYRQKAGQGREQPHIPRGPSNDRRL 661

Query: 704 PKKLAKKVSLSSNQKTRTLSSIAVEQNFSNMPIHDSVTCHMNGSMKPESSGPPTVACIPV 763
           PKK+AKKVSLSS QKTRTLSSIAVEQNFSNMPIHDSVTC +NGS+KPESSGPPTVACIPV
Sbjct: 662 PKKMAKKVSLSSIQKTRTLSSIAVEQNFSNMPIHDSVTCQINGSIKPESSGPPTVACIPV 721

Query: 764 KLVFSRLLEKINRPPSKATNNM 778
           KLVFSRLLEKINRPPSKATNNM
Sbjct: 722 KLVFSRLLEKINRPPSKATNNM 738

BLAST of CaUC10G187120 vs. NCBI nr
Match: XP_008451676.1 (PREDICTED: uncharacterized protein At1g51745 isoform X2 [Cucumis melo])

HSP 1 Score: 1264.2 bits (3270), Expect = 0.0e+00
Identity = 653/735 (88.84%), Postives = 679/735 (92.38%), Query Frame = 0

Query: 44  GTGGSGAVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASV 103
           G+ GSGAVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASV
Sbjct: 2   GSPGSGAVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASV 61

Query: 104 DWYNLEKSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARREDAILHALELEKELLKK 163
           DWYNLEKSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARREDAILHALELEKELL K
Sbjct: 62  DWYNLEKSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARREDAILHALELEKELLNK 121

Query: 164 QGKLNLYSDQTTIESLDATAKKGIISSEHIGADDINDGPSESYQFSKIRDVNYDNEIMEP 223
           QGKLNLYSDQ TIES  ATAK+GI+ SE+IG DD N G SES+QFSK   V+YDNEI EP
Sbjct: 122 QGKLNLYSDQMTIESPGATAKEGILFSEYIGTDDHNYGHSESHQFSKTIHVSYDNEITEP 181

Query: 224 CLKASEGAQLSGEDDHSEARPRMRGLQDFGLKITPSKRKVLSSSVVSNGFEMLATNTNPL 283
           CLKA+EGAQ SGED+HSE+RPRMRGLQDFGL+IT SKRKVLSSSVVSNGFEMLAT+TN L
Sbjct: 182 CLKANEGAQRSGEDEHSESRPRMRGLQDFGLRITSSKRKVLSSSVVSNGFEMLATDTNVL 241

Query: 284 APAPLDGVCNIGNDSDANGMQQIDRAKRSKCMYLPADSSDSLECRESSLGQVEMSTPHLA 343
            P    GVCNIGNDSDANGMQQIDRAKRSKCMYLPADSSDSLECRESSLGQVEMSTP L 
Sbjct: 242 VP---PGVCNIGNDSDANGMQQIDRAKRSKCMYLPADSSDSLECRESSLGQVEMSTPDLG 301

Query: 344 AGVMPSRPDSLVEENASGSSENDSSDSETDSDSSRSDQDMDNDMAALSDSTLPSEKEPST 403
            GVMPSRPDSL+EENASGSSENDSSD ETDSDSSRSDQDMDN+M ALSDSTLPSEKEPST
Sbjct: 302 PGVMPSRPDSLLEENASGSSENDSSDLETDSDSSRSDQDMDNEMTALSDSTLPSEKEPST 361

Query: 404 FERMDTQEQGNMSSEEPDDSAHSGDTSHLYHHDPVSTNEAVSKWQLKGKRNVRNFSKKPV 463
           FER DT+E  NMSSEEPDDS HSGD SHLYHHDPVSTNEAVSKW+LKGKRNVRNFSKK V
Sbjct: 362 FERTDTREHENMSSEEPDDSVHSGDMSHLYHHDPVSTNEAVSKWKLKGKRNVRNFSKKLV 421

Query: 464 GVDDEPSSHLWVHGQTRLSNRNDYFDDSMEGADALEEEYYLTSKMVPKDQYIVRNYMPDW 523
           GVDDEPSSHLWVH QTRL+NRNDYFDDSM+G DALEEEYYLTSKMV KDQY VRNY+PDW
Sbjct: 422 GVDDEPSSHLWVHAQTRLNNRNDYFDDSMDGVDALEEEYYLTSKMVSKDQYFVRNYLPDW 481

Query: 524 EGQPALKGYWDVKNPLYGMRHHFGGRPRTILIDVDLKVHASYQKEPVPIVSLMSKLNGQA 583
           EGQPALKGYWDVKNPLYG+ HHFGGRPRTILIDVDLKVHASYQKEPVPIVSLMSKLNGQA
Sbjct: 482 EGQPALKGYWDVKNPLYGIPHHFGGRPRTILIDVDLKVHASYQKEPVPIVSLMSKLNGQA 541

Query: 584 IIGHPIQIETLEDGFSETILSDSLGNTPSENDGSTALQPAWRTARRTANVRIPRPHLLTV 643
           IIGHPIQIETLEDGFSETILSDSLGN PSENDGSTALQPAWRTARRTANVRIPRPHL TV
Sbjct: 542 IIGHPIQIETLEDGFSETILSDSLGNAPSENDGSTALQPAWRTARRTANVRIPRPHLPTV 601

Query: 644 FDGEEAGYDSPFADQERKSSRFKRVKTGVYNQKAGQSGGQPHIPR-PSHDRRLPKKLAKK 703
            DGEEAGYDS    QERK SR K+VKTGVY  KA    GQPHIPR PS+DRRLPKK+AKK
Sbjct: 602 PDGEEAGYDS----QERK-SRLKKVKTGVYLSKA----GQPHIPRGPSNDRRLPKKMAKK 661

Query: 704 VSLSSNQKTRTLSSIAVEQNFSNMPIHDSVTCHMNGSMKPESSGPPTVACIPVKLVFSRL 763
           VSLSSNQKTRTLSSI VEQNFSNMPIHDSV+C +NGS+KPESSGPPTVACIPVKLVFSRL
Sbjct: 662 VSLSSNQKTRTLSSIDVEQNFSNMPIHDSVSCQINGSIKPESSGPPTVACIPVKLVFSRL 721

Query: 764 LEKINRPPSKATNNM 778
           LEKINRPPSKATNN+
Sbjct: 722 LEKINRPPSKATNNL 724

BLAST of CaUC10G187120 vs. NCBI nr
Match: XP_038897413.1 (uncharacterized protein At1g51745 isoform X2 [Benincasa hispida])

HSP 1 Score: 1264.2 bits (3270), Expect = 0.0e+00
Identity = 662/752 (88.03%), Postives = 684/752 (90.96%), Query Frame = 0

Query: 44  GTGGSGAVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASV 103
           G+ GSGAVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASV
Sbjct: 2   GSPGSGAVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASV 61

Query: 104 ----------------DWYNLEKSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARRE 163
                           DWYNLEKSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARRE
Sbjct: 62  LNIEERWKLSNVSNNKDWYNLEKSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARRE 121

Query: 164 DAILHALELEKELLKKQGKLNLYSDQTTIESLDATAKKGIISSEHIGADDINDGPSESYQ 223
           DAILHALELEKELL KQGKLNLY DQT I S  ATAKKGIISS+HIG  DINDG SES Q
Sbjct: 122 DAILHALELEKELLNKQGKLNLYFDQTIIGSPGATAKKGIISSDHIGTGDINDGHSESLQ 181

Query: 224 FSKIRDVNYDNEIMEPCLKASEGAQLSGEDDHSEARPRMRGLQDFGLKITPSKRKVLSSS 283
           FSKI DVNYDNEI +PCLKA+EGAQ SGEDDHSEARPRMRGLQDFGL+IT SKRKVLSSS
Sbjct: 182 FSKIIDVNYDNEI-DPCLKANEGAQRSGEDDHSEARPRMRGLQDFGLRITSSKRKVLSSS 241

Query: 284 VVSNGFEMLATNTNPLAPAPLDGVCNIGNDS-DANGMQQIDRAKRSKCMYLPADSSDSLE 343
           VVSNGFEMLAT+T+ LAP P+ GVCNIGNDS DANGMQQID AKRSKCMYLPADSSDSLE
Sbjct: 242 VVSNGFEMLATDTSVLAP-PV-GVCNIGNDSGDANGMQQIDLAKRSKCMYLPADSSDSLE 301

Query: 344 CRESSLGQVEMSTPHLAAGVMPSRPDSLVEENASGSSENDSSDSETDSDSSRSDQDMDND 403
           CRESSLGQVE+STPHL +GVMPSRPDSLVEENASGSSENDSS SETDSDSSRSDQDMDND
Sbjct: 302 CRESSLGQVEVSTPHLGSGVMPSRPDSLVEENASGSSENDSSGSETDSDSSRSDQDMDND 361

Query: 404 MAALSDSTLPSEKEPSTFERMDTQEQGNMSSEEPDDSAHSGDTSHLYHHDPVSTNEAVSK 463
           MAALSDSTLPSEKEP+TFE+ DTQE GN+SSEE DDS HSGD SHLYHHDPVSTNEAVSK
Sbjct: 362 MAALSDSTLPSEKEPNTFEKTDTQEHGNVSSEEHDDSVHSGDMSHLYHHDPVSTNEAVSK 421

Query: 464 WQLKGKRNVRNFSKKPVGVDDEPSSHLWVHGQTRLSNRNDYFDDSMEGADALEEEYYLTS 523
           WQLKGKRNVRNFSKK  GV DEPSSHLWVHGQT  SNRNDYFDDS+EG DALEEEYYLTS
Sbjct: 422 WQLKGKRNVRNFSKKLGGV-DEPSSHLWVHGQTTFSNRNDYFDDSIEGVDALEEEYYLTS 481

Query: 524 KMVPKDQYIVRNYMPDWEGQPALKGYWDVKNPLYGMRHHFGGRPRTILIDVDLKVHASYQ 583
           KMV KDQY VRNYM DWEGQPALKGYWDVKNPLYG+ HHFGG PRTILIDVD+KVHASYQ
Sbjct: 482 KMVSKDQYFVRNYMHDWEGQPALKGYWDVKNPLYGLHHHFGGMPRTILIDVDVKVHASYQ 541

Query: 584 KEPVPIVSLMSKLNGQAIIGHPIQIETLEDGFSETILSDSLGNTPSENDGSTALQPAWRT 643
           KEPVPIVSLMSKLNGQAIIGHPIQIETLEDGFSETILSDSLGN  SENDG+TA QP+WRT
Sbjct: 542 KEPVPIVSLMSKLNGQAIIGHPIQIETLEDGFSETILSDSLGNALSENDGNTAFQPSWRT 601

Query: 644 ARRTANVRIPRPHLLTVFDGEEAGYDSPFADQERKSSRFKRVKTGVYNQKAGQSGGQPHI 703
           ARRTANVRIPRPHL TV DGEEAGYDSPF DQERK SRFKRVKTGVY QKAGQ   QPHI
Sbjct: 602 ARRTANVRIPRPHLPTVLDGEEAGYDSPFGDQERK-SRFKRVKTGVYRQKAGQGREQPHI 661

Query: 704 PR-PSHDRRLPKKLAKKVSLSSNQKTRTLSSIAVEQNFSNMPIHDSVTCHMNGSMKPESS 763
           PR PS+DRRLPKK+AKKVSLSS QKTRTLSSIAVEQNFSNMPIHDSVTC +NGS+KPESS
Sbjct: 662 PRGPSNDRRLPKKMAKKVSLSSIQKTRTLSSIAVEQNFSNMPIHDSVTCQINGSIKPESS 721

Query: 764 GPPTVACIPVKLVFSRLLEKINRPPSKATNNM 778
           GPPTVACIPVKLVFSRLLEKINRPPSKATNNM
Sbjct: 722 GPPTVACIPVKLVFSRLLEKINRPPSKATNNM 748

BLAST of CaUC10G187120 vs. NCBI nr
Match: XP_016901178.1 (PREDICTED: uncharacterized protein At1g51745 isoform X1 [Cucumis melo])

HSP 1 Score: 1257.7 bits (3253), Expect = 0.0e+00
Identity = 653/741 (88.12%), Postives = 679/741 (91.63%), Query Frame = 0

Query: 44  GTGGSGAVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASV 103
           G+ GSGAVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASV
Sbjct: 2   GSPGSGAVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASV 61

Query: 104 ------DWYNLEKSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARREDAILHALELE 163
                 DWYNLEKSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARREDAILHALELE
Sbjct: 62  LNTEERDWYNLEKSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARREDAILHALELE 121

Query: 164 KELLKKQGKLNLYSDQTTIESLDATAKKGIISSEHIGADDINDGPSESYQFSKIRDVNYD 223
           KELL KQGKLNLYSDQ TIES  ATAK+GI+ SE+IG DD N G SES+QFSK   V+YD
Sbjct: 122 KELLNKQGKLNLYSDQMTIESPGATAKEGILFSEYIGTDDHNYGHSESHQFSKTIHVSYD 181

Query: 224 NEIMEPCLKASEGAQLSGEDDHSEARPRMRGLQDFGLKITPSKRKVLSSSVVSNGFEMLA 283
           NEI EPCLKA+EGAQ SGED+HSE+RPRMRGLQDFGL+IT SKRKVLSSSVVSNGFEMLA
Sbjct: 182 NEITEPCLKANEGAQRSGEDEHSESRPRMRGLQDFGLRITSSKRKVLSSSVVSNGFEMLA 241

Query: 284 TNTNPLAPAPLDGVCNIGNDSDANGMQQIDRAKRSKCMYLPADSSDSLECRESSLGQVEM 343
           T+TN L P    GVCNIGNDSDANGMQQIDRAKRSKCMYLPADSSDSLECRESSLGQVEM
Sbjct: 242 TDTNVLVP---PGVCNIGNDSDANGMQQIDRAKRSKCMYLPADSSDSLECRESSLGQVEM 301

Query: 344 STPHLAAGVMPSRPDSLVEENASGSSENDSSDSETDSDSSRSDQDMDNDMAALSDSTLPS 403
           STP L  GVMPSRPDSL+EENASGSSENDSSD ETDSDSSRSDQDMDN+M ALSDSTLPS
Sbjct: 302 STPDLGPGVMPSRPDSLLEENASGSSENDSSDLETDSDSSRSDQDMDNEMTALSDSTLPS 361

Query: 404 EKEPSTFERMDTQEQGNMSSEEPDDSAHSGDTSHLYHHDPVSTNEAVSKWQLKGKRNVRN 463
           EKEPSTFER DT+E  NMSSEEPDDS HSGD SHLYHHDPVSTNEAVSKW+LKGKRNVRN
Sbjct: 362 EKEPSTFERTDTREHENMSSEEPDDSVHSGDMSHLYHHDPVSTNEAVSKWKLKGKRNVRN 421

Query: 464 FSKKPVGVDDEPSSHLWVHGQTRLSNRNDYFDDSMEGADALEEEYYLTSKMVPKDQYIVR 523
           FSKK VGVDDEPSSHLWVH QTRL+NRNDYFDDSM+G DALEEEYYLTSKMV KDQY VR
Sbjct: 422 FSKKLVGVDDEPSSHLWVHAQTRLNNRNDYFDDSMDGVDALEEEYYLTSKMVSKDQYFVR 481

Query: 524 NYMPDWEGQPALKGYWDVKNPLYGMRHHFGGRPRTILIDVDLKVHASYQKEPVPIVSLMS 583
           NY+PDWEGQPALKGYWDVKNPLYG+ HHFGGRPRTILIDVDLKVHASYQKEPVPIVSLMS
Sbjct: 482 NYLPDWEGQPALKGYWDVKNPLYGIPHHFGGRPRTILIDVDLKVHASYQKEPVPIVSLMS 541

Query: 584 KLNGQAIIGHPIQIETLEDGFSETILSDSLGNTPSENDGSTALQPAWRTARRTANVRIPR 643
           KLNGQAIIGHPIQIETLEDGFSETILSDSLGN PSENDGSTALQPAWRTARRTANVRIPR
Sbjct: 542 KLNGQAIIGHPIQIETLEDGFSETILSDSLGNAPSENDGSTALQPAWRTARRTANVRIPR 601

Query: 644 PHLLTVFDGEEAGYDSPFADQERKSSRFKRVKTGVYNQKAGQSGGQPHIPR-PSHDRRLP 703
           PHL TV DGEEAGYDS    QERK SR K+VKTGVY  KA    GQPHIPR PS+DRRLP
Sbjct: 602 PHLPTVPDGEEAGYDS----QERK-SRLKKVKTGVYLSKA----GQPHIPRGPSNDRRLP 661

Query: 704 KKLAKKVSLSSNQKTRTLSSIAVEQNFSNMPIHDSVTCHMNGSMKPESSGPPTVACIPVK 763
           KK+AKKVSLSSNQKTRTLSSI VEQNFSNMPIHDSV+C +NGS+KPESSGPPTVACIPVK
Sbjct: 662 KKMAKKVSLSSNQKTRTLSSIDVEQNFSNMPIHDSVSCQINGSIKPESSGPPTVACIPVK 721

Query: 764 LVFSRLLEKINRPPSKATNNM 778
           LVFSRLLEKINRPPSKATNN+
Sbjct: 722 LVFSRLLEKINRPPSKATNNL 730

BLAST of CaUC10G187120 vs. ExPASy Swiss-Prot
Match: P59278 (Uncharacterized protein At1g51745 OS=Arabidopsis thaliana OX=3702 GN=At1g51745 PE=2 SV=2)

HSP 1 Score: 163.7 bits (413), Expect = 1.3e-38
Identity = 173/599 (28.88%), Postives = 251/599 (41.90%), Query Frame = 0

Query: 50  AVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASVDWYNLE 109
           A++ +VG +VWVRRRNGSWWPG+ L  D++  + L  P+ GTP+KLLGR+D SVDWY LE
Sbjct: 11  AINASVGRLVWVRRRNGSWWPGQTLVHDQVPDNSLVGPKVGTPIKLLGRDDVSVDWYILE 70

Query: 110 KSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARREDAILHALELEKELLKKQ--GKL 169
            SK VK FRCGE+D CIE+A++S     K+  K   REDAI +AL++E E L K+     
Sbjct: 71  NSKTVKAFRCGEYDTCIEKAKASSSK--KRSGKCTLREDAINNALKIENEHLAKEDDNLC 130

Query: 170 NLYSDQTTIESLDATAKK--GIISSEHIGADDINDGPSESYQFSKIRDVNYDNEIMEPCL 229
           NL  ++ +   L     +  G   +E    D++   P +    S I     +N       
Sbjct: 131 NLSGEEDSKRCLSGKEDEDSGSSDAEETEDDELASAPEQLQ--SSISSQEMNNVGASKVQ 190

Query: 230 KASEGAQLSGEDDHSEARPRMRGLQDFGLK----ITPSKR------KVLSSSVVSNGFEM 289
                     EDD +E   RMRGL+D G +    I   K+       V  S  VSNG   
Sbjct: 191 SKRRRTPNDSEDDGTEGVKRMRGLEDIGKEQAGGIVEHKQDLDLICAVGLSDSVSNG-NT 250

Query: 290 LATNTNPLAPAPLDGVCNIGNDSD-ANGMQQIDRAKRSKCMY---LPADSSDSLECR--- 349
           +A      +P+ L    N+   S   N  +Q+ +   S  M    +  D   SL+C+   
Sbjct: 251 IANGNKVCSPSSLKR--NVSECSKRKNRRRQLTKVLESTAMVSVPVTCDQGVSLDCQGIY 310

Query: 350 -ESSLGQVEMSTPHLAAGVMPSRPDSL------VEENASGSSENDSSDSETDSDSSRSDQ 409
                G   + +    + V+ +  DS         EN  G+S N+ +     S  S S +
Sbjct: 311 DSKVSGMESVESMKSVSVVINNNSDSTGVSCEDAYENVVGASHNNKAKDSEISSISVSAE 370

Query: 410 DMDNDMAALSDSTLPSEKEPSTFERMDTQEQGNMSSEEPDDSAHSGDTSH-LYHHDPVST 469
           D  +D   L D  L  E+  S       +      +   D +   G  SH ++  +  S 
Sbjct: 371 DDSSD--RLFDVPLTGEENHSEGFPAACRISSPRKALVTDLTRRCGRNSHNVFVKNEASN 430

Query: 470 NEA--------------------VSKWQLKGKRNVRNFSKKPVGVDDEPSSHLWVHGQTR 529
             A                     SKWQLKGKRN R  SKK                  +
Sbjct: 431 GSACTSPPASEPVNCILSGIEKNTSKWQLKGKRNSRQMSKK------------------Q 490

Query: 530 LSNRNDYFDDSMEGADALEEEYYLTSKMVPKDQYIVRNYMPDWEGQPALKGYWDVKNPLY 589
              RN Y +++   +                                             
Sbjct: 491 EERRNVYGEEANNNSST------------------------------------------- 530

Query: 590 GMRHHFGGRPRTILIDVDLKVHASYQKEPVPIVSLMSKLNGQAIIGHPIQIETLEDGFS 600
                    P + L +V ++V ASY K  VP+VS MS+L+G+AI+GHP+ +E LE+ +S
Sbjct: 551 ---------PHSTLYEVKIEVKASYTKPRVPLVSRMSELSGKAIVGHPLSVEILEEDYS 530

BLAST of CaUC10G187120 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 77.4 bits (189), Expect = 1.3e-12
Identity = 43/121 (35.54%), Postives = 63/121 (52.07%), Query Frame = 0

Query: 1039 WSSASSPLTVLFSRQGTKSEYRVLANATSKLLWLRWLLANMGVPQQSATKLHCDNHSAIQ 1098
            WSS      V   R  T++EYR +AN +S++ W+  LL  +G+       ++CDN  A  
Sbjct: 1341 WSSKKQKGVV---RSSTEAEYRSVANTSSEMQWICSLLTELGIRLTRPPVIYCDNVGATY 1400

Query: 1099 ISHNDVFHERTKHIKNDCHFVCHHLLSNTLLLQSISTTEQPADIFTKTFSSNQFNQLLTK 1158
            +  N VFH R KHI  D HF+ + + S  L +  +ST +Q AD  TK  S   F    +K
Sbjct: 1401 LCANPVFHSRMKHIAIDYHFIRNQVQSGALRVVHVSTHDQLADTLTKPLSRTAFQNFASK 1458

Query: 1159 L 1160
            +
Sbjct: 1461 I 1458

BLAST of CaUC10G187120 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 76.6 bits (187), Expect = 2.1e-12
Identity = 44/121 (36.36%), Postives = 62/121 (51.24%), Query Frame = 0

Query: 1039 WSSASSPLTVLFSRQGTKSEYRVLANATSKLLWLRWLLANMGVPQQSATKLHCDNHSAIQ 1098
            WSS      V   R  T++EYR +AN +S+L W+  LL  +G+       ++CDN  A  
Sbjct: 1324 WSSKKQKGVV---RSSTEAEYRSVANTSSELQWICSLLTELGIQLSHPPVIYCDNVGATY 1383

Query: 1099 ISHNDVFHERTKHIKNDCHFVCHHLLSNTLLLQSISTTEQPADIFTKTFSSNQFNQLLTK 1158
            +  N VFH R KHI  D HF+ + + S  L +  +ST +Q AD  TK  S   F     K
Sbjct: 1384 LCANPVFHSRMKHIALDYHFIRNQVQSGALRVVHVSTHDQLADTLTKPLSRVAFQNFSRK 1441

Query: 1159 L 1160
            +
Sbjct: 1444 I 1441

BLAST of CaUC10G187120 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 72.4 bits (176), Expect = 4.0e-11
Identity = 38/105 (36.19%), Postives = 60/105 (57.14%), Query Frame = 0

Query: 1055 TKSEYRVLANATSKLLWLRWLLANMGVPQQSATKLHCDNHSAIQISHNDVFHERTKHIKN 1114
            T++EY  L  A  + LWL++LL ++ +  ++  K++ DN   I I++N   H+R KHI  
Sbjct: 1294 TEAEYMALFEAVREALWLKFLLTSINIKLENPIKIYEDNQGCISIANNPSCHKRAKHIDI 1353

Query: 1115 DCHFVCHHLLSNTLLLQSISTTEQPADIFTKTFSSNQFNQLLTKL 1160
              HF    + +N + L+ I T  Q ADIFTK   + +F +L  KL
Sbjct: 1354 KYHFAREQVQNNVICLEYIPTENQLADIFTKPLPAARFVELRDKL 1398

BLAST of CaUC10G187120 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 66.6 bits (161), Expect = 2.2e-09
Identity = 34/98 (34.69%), Postives = 56/98 (57.14%), Query Frame = 0

Query: 1055 TKSEYRVLANATSKLLWLRWLLANMGVPQQSATKLHCDNHSAIQISHNDVFHERTKHIKN 1114
            T++EY        +++WL+  L  +G+ Q+    ++CD+ SAI +S N ++H RTKHI  
Sbjct: 1220 TEAEYIAATETGKEMIWLKRFLQELGLHQKEYV-VYCDSQSAIDLSKNSMYHARTKHIDV 1279

Query: 1115 DCHFVCHHLLSNTLLLQSISTTEQPADIFTKTFSSNQF 1153
              H++   +   +L +  IST E PAD+ TK    N+F
Sbjct: 1280 RYHWIREMVDDESLKVLKISTNENPADMLTKVVPRNKF 1316

BLAST of CaUC10G187120 vs. ExPASy TrEMBL
Match: A0A1S3BSV5 (uncharacterized protein At1g51745 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103492907 PE=4 SV=1)

HSP 1 Score: 1264.2 bits (3270), Expect = 0.0e+00
Identity = 653/735 (88.84%), Postives = 679/735 (92.38%), Query Frame = 0

Query: 44  GTGGSGAVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASV 103
           G+ GSGAVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASV
Sbjct: 2   GSPGSGAVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASV 61

Query: 104 DWYNLEKSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARREDAILHALELEKELLKK 163
           DWYNLEKSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARREDAILHALELEKELL K
Sbjct: 62  DWYNLEKSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARREDAILHALELEKELLNK 121

Query: 164 QGKLNLYSDQTTIESLDATAKKGIISSEHIGADDINDGPSESYQFSKIRDVNYDNEIMEP 223
           QGKLNLYSDQ TIES  ATAK+GI+ SE+IG DD N G SES+QFSK   V+YDNEI EP
Sbjct: 122 QGKLNLYSDQMTIESPGATAKEGILFSEYIGTDDHNYGHSESHQFSKTIHVSYDNEITEP 181

Query: 224 CLKASEGAQLSGEDDHSEARPRMRGLQDFGLKITPSKRKVLSSSVVSNGFEMLATNTNPL 283
           CLKA+EGAQ SGED+HSE+RPRMRGLQDFGL+IT SKRKVLSSSVVSNGFEMLAT+TN L
Sbjct: 182 CLKANEGAQRSGEDEHSESRPRMRGLQDFGLRITSSKRKVLSSSVVSNGFEMLATDTNVL 241

Query: 284 APAPLDGVCNIGNDSDANGMQQIDRAKRSKCMYLPADSSDSLECRESSLGQVEMSTPHLA 343
            P    GVCNIGNDSDANGMQQIDRAKRSKCMYLPADSSDSLECRESSLGQVEMSTP L 
Sbjct: 242 VP---PGVCNIGNDSDANGMQQIDRAKRSKCMYLPADSSDSLECRESSLGQVEMSTPDLG 301

Query: 344 AGVMPSRPDSLVEENASGSSENDSSDSETDSDSSRSDQDMDNDMAALSDSTLPSEKEPST 403
            GVMPSRPDSL+EENASGSSENDSSD ETDSDSSRSDQDMDN+M ALSDSTLPSEKEPST
Sbjct: 302 PGVMPSRPDSLLEENASGSSENDSSDLETDSDSSRSDQDMDNEMTALSDSTLPSEKEPST 361

Query: 404 FERMDTQEQGNMSSEEPDDSAHSGDTSHLYHHDPVSTNEAVSKWQLKGKRNVRNFSKKPV 463
           FER DT+E  NMSSEEPDDS HSGD SHLYHHDPVSTNEAVSKW+LKGKRNVRNFSKK V
Sbjct: 362 FERTDTREHENMSSEEPDDSVHSGDMSHLYHHDPVSTNEAVSKWKLKGKRNVRNFSKKLV 421

Query: 464 GVDDEPSSHLWVHGQTRLSNRNDYFDDSMEGADALEEEYYLTSKMVPKDQYIVRNYMPDW 523
           GVDDEPSSHLWVH QTRL+NRNDYFDDSM+G DALEEEYYLTSKMV KDQY VRNY+PDW
Sbjct: 422 GVDDEPSSHLWVHAQTRLNNRNDYFDDSMDGVDALEEEYYLTSKMVSKDQYFVRNYLPDW 481

Query: 524 EGQPALKGYWDVKNPLYGMRHHFGGRPRTILIDVDLKVHASYQKEPVPIVSLMSKLNGQA 583
           EGQPALKGYWDVKNPLYG+ HHFGGRPRTILIDVDLKVHASYQKEPVPIVSLMSKLNGQA
Sbjct: 482 EGQPALKGYWDVKNPLYGIPHHFGGRPRTILIDVDLKVHASYQKEPVPIVSLMSKLNGQA 541

Query: 584 IIGHPIQIETLEDGFSETILSDSLGNTPSENDGSTALQPAWRTARRTANVRIPRPHLLTV 643
           IIGHPIQIETLEDGFSETILSDSLGN PSENDGSTALQPAWRTARRTANVRIPRPHL TV
Sbjct: 542 IIGHPIQIETLEDGFSETILSDSLGNAPSENDGSTALQPAWRTARRTANVRIPRPHLPTV 601

Query: 644 FDGEEAGYDSPFADQERKSSRFKRVKTGVYNQKAGQSGGQPHIPR-PSHDRRLPKKLAKK 703
            DGEEAGYDS    QERK SR K+VKTGVY  KA    GQPHIPR PS+DRRLPKK+AKK
Sbjct: 602 PDGEEAGYDS----QERK-SRLKKVKTGVYLSKA----GQPHIPRGPSNDRRLPKKMAKK 661

Query: 704 VSLSSNQKTRTLSSIAVEQNFSNMPIHDSVTCHMNGSMKPESSGPPTVACIPVKLVFSRL 763
           VSLSSNQKTRTLSSI VEQNFSNMPIHDSV+C +NGS+KPESSGPPTVACIPVKLVFSRL
Sbjct: 662 VSLSSNQKTRTLSSIDVEQNFSNMPIHDSVSCQINGSIKPESSGPPTVACIPVKLVFSRL 721

Query: 764 LEKINRPPSKATNNM 778
           LEKINRPPSKATNN+
Sbjct: 722 LEKINRPPSKATNNL 724

BLAST of CaUC10G187120 vs. ExPASy TrEMBL
Match: A0A1S4DYX8 (uncharacterized protein At1g51745 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103492907 PE=4 SV=1)

HSP 1 Score: 1257.7 bits (3253), Expect = 0.0e+00
Identity = 653/741 (88.12%), Postives = 679/741 (91.63%), Query Frame = 0

Query: 44  GTGGSGAVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASV 103
           G+ GSGAVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASV
Sbjct: 2   GSPGSGAVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASV 61

Query: 104 ------DWYNLEKSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARREDAILHALELE 163
                 DWYNLEKSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARREDAILHALELE
Sbjct: 62  LNTEERDWYNLEKSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARREDAILHALELE 121

Query: 164 KELLKKQGKLNLYSDQTTIESLDATAKKGIISSEHIGADDINDGPSESYQFSKIRDVNYD 223
           KELL KQGKLNLYSDQ TIES  ATAK+GI+ SE+IG DD N G SES+QFSK   V+YD
Sbjct: 122 KELLNKQGKLNLYSDQMTIESPGATAKEGILFSEYIGTDDHNYGHSESHQFSKTIHVSYD 181

Query: 224 NEIMEPCLKASEGAQLSGEDDHSEARPRMRGLQDFGLKITPSKRKVLSSSVVSNGFEMLA 283
           NEI EPCLKA+EGAQ SGED+HSE+RPRMRGLQDFGL+IT SKRKVLSSSVVSNGFEMLA
Sbjct: 182 NEITEPCLKANEGAQRSGEDEHSESRPRMRGLQDFGLRITSSKRKVLSSSVVSNGFEMLA 241

Query: 284 TNTNPLAPAPLDGVCNIGNDSDANGMQQIDRAKRSKCMYLPADSSDSLECRESSLGQVEM 343
           T+TN L P    GVCNIGNDSDANGMQQIDRAKRSKCMYLPADSSDSLECRESSLGQVEM
Sbjct: 242 TDTNVLVP---PGVCNIGNDSDANGMQQIDRAKRSKCMYLPADSSDSLECRESSLGQVEM 301

Query: 344 STPHLAAGVMPSRPDSLVEENASGSSENDSSDSETDSDSSRSDQDMDNDMAALSDSTLPS 403
           STP L  GVMPSRPDSL+EENASGSSENDSSD ETDSDSSRSDQDMDN+M ALSDSTLPS
Sbjct: 302 STPDLGPGVMPSRPDSLLEENASGSSENDSSDLETDSDSSRSDQDMDNEMTALSDSTLPS 361

Query: 404 EKEPSTFERMDTQEQGNMSSEEPDDSAHSGDTSHLYHHDPVSTNEAVSKWQLKGKRNVRN 463
           EKEPSTFER DT+E  NMSSEEPDDS HSGD SHLYHHDPVSTNEAVSKW+LKGKRNVRN
Sbjct: 362 EKEPSTFERTDTREHENMSSEEPDDSVHSGDMSHLYHHDPVSTNEAVSKWKLKGKRNVRN 421

Query: 464 FSKKPVGVDDEPSSHLWVHGQTRLSNRNDYFDDSMEGADALEEEYYLTSKMVPKDQYIVR 523
           FSKK VGVDDEPSSHLWVH QTRL+NRNDYFDDSM+G DALEEEYYLTSKMV KDQY VR
Sbjct: 422 FSKKLVGVDDEPSSHLWVHAQTRLNNRNDYFDDSMDGVDALEEEYYLTSKMVSKDQYFVR 481

Query: 524 NYMPDWEGQPALKGYWDVKNPLYGMRHHFGGRPRTILIDVDLKVHASYQKEPVPIVSLMS 583
           NY+PDWEGQPALKGYWDVKNPLYG+ HHFGGRPRTILIDVDLKVHASYQKEPVPIVSLMS
Sbjct: 482 NYLPDWEGQPALKGYWDVKNPLYGIPHHFGGRPRTILIDVDLKVHASYQKEPVPIVSLMS 541

Query: 584 KLNGQAIIGHPIQIETLEDGFSETILSDSLGNTPSENDGSTALQPAWRTARRTANVRIPR 643
           KLNGQAIIGHPIQIETLEDGFSETILSDSLGN PSENDGSTALQPAWRTARRTANVRIPR
Sbjct: 542 KLNGQAIIGHPIQIETLEDGFSETILSDSLGNAPSENDGSTALQPAWRTARRTANVRIPR 601

Query: 644 PHLLTVFDGEEAGYDSPFADQERKSSRFKRVKTGVYNQKAGQSGGQPHIPR-PSHDRRLP 703
           PHL TV DGEEAGYDS    QERK SR K+VKTGVY  KA    GQPHIPR PS+DRRLP
Sbjct: 602 PHLPTVPDGEEAGYDS----QERK-SRLKKVKTGVYLSKA----GQPHIPRGPSNDRRLP 661

Query: 704 KKLAKKVSLSSNQKTRTLSSIAVEQNFSNMPIHDSVTCHMNGSMKPESSGPPTVACIPVK 763
           KK+AKKVSLSSNQKTRTLSSI VEQNFSNMPIHDSV+C +NGS+KPESSGPPTVACIPVK
Sbjct: 662 KKMAKKVSLSSNQKTRTLSSIDVEQNFSNMPIHDSVSCQINGSIKPESSGPPTVACIPVK 721

Query: 764 LVFSRLLEKINRPPSKATNNM 778
           LVFSRLLEKINRPPSKATNN+
Sbjct: 722 LVFSRLLEKINRPPSKATNNL 730

BLAST of CaUC10G187120 vs. ExPASy TrEMBL
Match: A0A6J1E6S8 (uncharacterized protein At1g51745-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111430485 PE=4 SV=1)

HSP 1 Score: 1240.7 bits (3209), Expect = 0.0e+00
Identity = 644/740 (87.03%), Postives = 669/740 (90.41%), Query Frame = 0

Query: 44  GTGGSGAVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASV 103
           G+ GSGAVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASV
Sbjct: 2   GSPGSGAVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASV 61

Query: 104 DWYNLEKSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARREDAILHALELEKELLKK 163
           DWYNLEKSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARREDAILHALELEKELLKK
Sbjct: 62  DWYNLEKSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARREDAILHALELEKELLKK 121

Query: 164 QGKLNLYSDQTTIESLDATAKKGIISSEHIGADDINDGPSESYQFSKIRDVNYDNEIMEP 223
           QGKLNL SDQ TIES   TAKK I+SSEHIG DD+NDG SES+QFSKI DVNYD++I +P
Sbjct: 122 QGKLNLCSDQMTIESSGVTAKKEILSSEHIGTDDMNDGHSESHQFSKIVDVNYDSKITDP 181

Query: 224 CLKASEGAQLSGEDDHSEARPRMRGLQDFGLKITPSKRKVLSSSVVSNGFEMLATNTNPL 283
           C K SEGAQLSGEDDHSEARPRMRGLQDFGL+ITPSKRKV SSSVVSNG EMLAT+TN L
Sbjct: 182 CHKTSEGAQLSGEDDHSEARPRMRGLQDFGLRITPSKRKVPSSSVVSNGSEMLATDTNAL 241

Query: 284 APAPLDGVCNIGNDSDANGMQQIDRAKRSKCMYLPADSSDSLECRESSLGQVEMSTPHLA 343
           AP   DGVC+IGNDSDANGMQQIDR KRSKCMYLPADSSDSLE RE SLGQVEMSTPH  
Sbjct: 242 APR--DGVCSIGNDSDANGMQQIDRVKRSKCMYLPADSSDSLEYREPSLGQVEMSTPHSG 301

Query: 344 AGVMPSRPDSLVEENASGSSENDSSDSETDSDSSRSDQDMDNDMAALSDSTLPSEKEPST 403
             VMPSRPDSLVEENASGS ENDSSDSETDSDSSRSDQD+DND AALSDSTLPSEKEPST
Sbjct: 302 TRVMPSRPDSLVEENASGSYENDSSDSETDSDSSRSDQDVDNDTAALSDSTLPSEKEPST 361

Query: 404 FERMDTQEQGNMSSEEPDDSAHSGDTSHLYHHDPVSTNEAVSKWQLKGKRNVRNFSKKPV 463
           FER D QE  NMSSEEPDDS HSGD SHLYHH+PVSTNEAVSKWQLKGKRNVRN SK+PV
Sbjct: 362 FERTDAQEHVNMSSEEPDDSVHSGDMSHLYHHEPVSTNEAVSKWQLKGKRNVRNLSKRPV 421

Query: 464 GVDDEPSSHLWVHGQTRLSNRNDYFDDSMEG-ADALEEEYYLTSKMVPKDQYIVRNYMPD 523
           GVDDEPSSHLWVHG+ RL+N+N YFDDSMEG ADALEEEYYL SK V KDQY+ RNYMPD
Sbjct: 422 GVDDEPSSHLWVHGKARLNNKNYYFDDSMEGDADALEEEYYLASKRVSKDQYLARNYMPD 481

Query: 524 WEGQPALKGYWDVKNPLYGMRHHFGGRPRTILIDVDLKVHASYQKEPVPIVSLMSKLNGQ 583
           WEGQPALKGYWDVKNPLYG+RHHFGGR RTILIDVDLKVHASYQKEPVPIVSLMSKLNGQ
Sbjct: 482 WEGQPALKGYWDVKNPLYGIRHHFGGRTRTILIDVDLKVHASYQKEPVPIVSLMSKLNGQ 541

Query: 584 AIIGHPIQIETLEDGFSETILSDSLGNTPSENDGSTALQPAWRTARRTANVRIPRPHLLT 643
           AIIGHPIQIETLEDGFSET+LSD LGN PSENDGSTALQPAWRTARRTANVRIPRPHL T
Sbjct: 542 AIIGHPIQIETLEDGFSETLLSDGLGNGPSENDGSTALQPAWRTARRTANVRIPRPHLPT 601

Query: 644 VFDGEEAGYDSPFADQERKSSRFKRVKTGVYNQK--AGQSGGQPHIPR---PSHDRRLPK 703
           V DGEEAGYDSPF DQERK +R KRVKTGV + K  AGQ  GQP IPR    SH+RRLP+
Sbjct: 602 VLDGEEAGYDSPFGDQERK-TRCKRVKTGVNSHKTGAGQGRGQPQIPRASSSSHERRLPR 661

Query: 704 KLAKKVSLSSNQ---KTRTLSSIAVEQNFSNMPIHDSVTCHMNGSMKPESSGPPTVACIP 763
           K+ KKVS+SSN    KTRTLSSI VEQN SNM IHDSVTC MNG MKPESSGPPTVACIP
Sbjct: 662 KMVKKVSISSNNQKTKTRTLSSIGVEQNHSNMAIHDSVTCQMNGLMKPESSGPPTVACIP 721

Query: 764 VKLVFSRLLEKINRPPSKAT 775
           VKLVFSRLLEKINRPPSKAT
Sbjct: 722 VKLVFSRLLEKINRPPSKAT 738

BLAST of CaUC10G187120 vs. ExPASy TrEMBL
Match: A0A6J1J1L6 (uncharacterized protein At1g51745-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111482596 PE=4 SV=1)

HSP 1 Score: 1240.3 bits (3208), Expect = 0.0e+00
Identity = 644/739 (87.14%), Postives = 669/739 (90.53%), Query Frame = 0

Query: 44  GTGGSGAVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASV 103
           G+ GSGAVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASV
Sbjct: 2   GSPGSGAVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASV 61

Query: 104 DWYNLEKSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARREDAILHALELEKELLKK 163
           DWYNLEKSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARREDAILHALELEKELLKK
Sbjct: 62  DWYNLEKSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARREDAILHALELEKELLKK 121

Query: 164 QGKLNLYSDQTTIESLDATAKKGIISSEHIGADDINDGPSESYQFSKIRDVNYDNEIMEP 223
           QGKLNL SDQ TIES   TAKK I+SSEHIG DD+NDG SES+QFSKI DVNYD++IM+P
Sbjct: 122 QGKLNLCSDQMTIESSGVTAKKEILSSEHIGTDDMNDGHSESHQFSKIVDVNYDSKIMDP 181

Query: 224 CLKASEGAQLSGEDDHSEARPRMRGLQDFGLKITPSKRKVLSSSVVSNGFEMLATNTNPL 283
           C KASEGAQLSGEDDHSEARPRMRGLQDFGL+ITPSKRKV SSSVVSNG EMLAT+TN L
Sbjct: 182 CHKASEGAQLSGEDDHSEARPRMRGLQDFGLRITPSKRKVPSSSVVSNGSEMLATDTNAL 241

Query: 284 APAPLDGVCNIGNDSDANGMQQIDRAKRSKCMYLPADSSDSLECRESSLGQVEMSTPHLA 343
           AP   DGVC+IGNDSDANGMQQIDR KRSKCMYLPADS DSLE  E SLGQVE STPH  
Sbjct: 242 APR--DGVCSIGNDSDANGMQQIDRVKRSKCMYLPADSCDSLEYIEPSLGQVETSTPHSG 301

Query: 344 AGVMPSRPDSLVEENASGSSENDSSDSETDSDSSRSDQDMDNDMAALSDSTLPSEKEPST 403
             VMPSRPDSLVEENASGS ENDSSDSETDSDSSRSDQD+DND AALSDSTLPSEKEPST
Sbjct: 302 TRVMPSRPDSLVEENASGSYENDSSDSETDSDSSRSDQDVDNDTAALSDSTLPSEKEPST 361

Query: 404 FERMDTQEQGNMSSEEPDDSAHSGDTSHLYHHDPVSTNEAVSKWQLKGKRNVRNFSKKPV 463
           FER D QE  NMSSEEPDDS HSGD SHLYHH+PVSTNEAVSKWQLKGKRNVRN SK+PV
Sbjct: 362 FERTDAQEHVNMSSEEPDDSVHSGDMSHLYHHEPVSTNEAVSKWQLKGKRNVRNLSKRPV 421

Query: 464 GVDDEPSSHLWVHGQTRLSNRNDYFDDSMEG-ADALEEEYYLTSKMVPKDQYIVRNYMPD 523
           GVDDEPSSHLWVHG+ RL+N+N YFDDSMEG ADALEEEYYL SK V KDQY+ RNYMPD
Sbjct: 422 GVDDEPSSHLWVHGKPRLNNKNYYFDDSMEGDADALEEEYYLASKRVSKDQYLARNYMPD 481

Query: 524 WEGQPALKGYWDVKNPLYGMRHHFGGRPRTILIDVDLKVHASYQKEPVPIVSLMSKLNGQ 583
           WEGQPALKGYWDVKNPLYG+RHHFGGR RTILIDVDLKVHASYQKEPVPIVSLMSKLNGQ
Sbjct: 482 WEGQPALKGYWDVKNPLYGIRHHFGGRTRTILIDVDLKVHASYQKEPVPIVSLMSKLNGQ 541

Query: 584 AIIGHPIQIETLEDGFSETILSDSLGNTPSENDGSTALQPAWRTARRTANVRIPRPHLLT 643
           AIIGHPIQIETLEDGFSET+LSD LGN PSENDGSTALQPAWRTARRTANVRIPRPHL T
Sbjct: 542 AIIGHPIQIETLEDGFSETLLSDGLGNGPSENDGSTALQPAWRTARRTANVRIPRPHLPT 601

Query: 644 VFDGEEAGYDSPFADQERKSSRFKRVKTGVYNQK--AGQSGGQPHIPR--PSHDRRLPKK 703
           V DGEEAGYDSPF DQERK +R KRVKTGV + K  AGQ  GQPHIPR   SH+RRLP+K
Sbjct: 602 VLDGEEAGYDSPFGDQERK-TRCKRVKTGVNSHKAGAGQGRGQPHIPRASSSHERRLPRK 661

Query: 704 LAKKVSLSSNQ---KTRTLSSIAVEQNFSNMPIHDSVTCHMNGSMKPESSGPPTVACIPV 763
           + KKVS+SSN    KTRTLSSI VEQN SNM IHDSVTC MNG MKPESSGPPTVACIPV
Sbjct: 662 MVKKVSISSNNQKTKTRTLSSIGVEQNHSNMAIHDSVTCQMNGLMKPESSGPPTVACIPV 721

Query: 764 KLVFSRLLEKINRPPSKAT 775
           KLVFSRLLEKINRPPSKAT
Sbjct: 722 KLVFSRLLEKINRPPSKAT 737

BLAST of CaUC10G187120 vs. ExPASy TrEMBL
Match: A0A6J1E3F4 (uncharacterized protein At1g51745-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111430485 PE=4 SV=1)

HSP 1 Score: 1230.3 bits (3182), Expect = 0.0e+00
Identity = 644/756 (85.19%), Postives = 669/756 (88.49%), Query Frame = 0

Query: 44  GTGGSGAVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASV 103
           G+ GSGAVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASV
Sbjct: 2   GSPGSGAVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASV 61

Query: 104 ----------------DWYNLEKSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARRE 163
                           DWYNLEKSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARRE
Sbjct: 62  LNTEESWEISNASNNKDWYNLEKSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARRE 121

Query: 164 DAILHALELEKELLKKQGKLNLYSDQTTIESLDATAKKGIISSEHIGADDINDGPSESYQ 223
           DAILHALELEKELLKKQGKLNL SDQ TIES   TAKK I+SSEHIG DD+NDG SES+Q
Sbjct: 122 DAILHALELEKELLKKQGKLNLCSDQMTIESSGVTAKKEILSSEHIGTDDMNDGHSESHQ 181

Query: 224 FSKIRDVNYDNEIMEPCLKASEGAQLSGEDDHSEARPRMRGLQDFGLKITPSKRKVLSSS 283
           FSKI DVNYD++I +PC K SEGAQLSGEDDHSEARPRMRGLQDFGL+ITPSKRKV SSS
Sbjct: 182 FSKIVDVNYDSKITDPCHKTSEGAQLSGEDDHSEARPRMRGLQDFGLRITPSKRKVPSSS 241

Query: 284 VVSNGFEMLATNTNPLAPAPLDGVCNIGNDSDANGMQQIDRAKRSKCMYLPADSSDSLEC 343
           VVSNG EMLAT+TN LAP   DGVC+IGNDSDANGMQQIDR KRSKCMYLPADSSDSLE 
Sbjct: 242 VVSNGSEMLATDTNALAPR--DGVCSIGNDSDANGMQQIDRVKRSKCMYLPADSSDSLEY 301

Query: 344 RESSLGQVEMSTPHLAAGVMPSRPDSLVEENASGSSENDSSDSETDSDSSRSDQDMDNDM 403
           RE SLGQVEMSTPH    VMPSRPDSLVEENASGS ENDSSDSETDSDSSRSDQD+DND 
Sbjct: 302 REPSLGQVEMSTPHSGTRVMPSRPDSLVEENASGSYENDSSDSETDSDSSRSDQDVDNDT 361

Query: 404 AALSDSTLPSEKEPSTFERMDTQEQGNMSSEEPDDSAHSGDTSHLYHHDPVSTNEAVSKW 463
           AALSDSTLPSEKEPSTFER D QE  NMSSEEPDDS HSGD SHLYHH+PVSTNEAVSKW
Sbjct: 362 AALSDSTLPSEKEPSTFERTDAQEHVNMSSEEPDDSVHSGDMSHLYHHEPVSTNEAVSKW 421

Query: 464 QLKGKRNVRNFSKKPVGVDDEPSSHLWVHGQTRLSNRNDYFDDSMEG-ADALEEEYYLTS 523
           QLKGKRNVRN SK+PVGVDDEPSSHLWVHG+ RL+N+N YFDDSMEG ADALEEEYYL S
Sbjct: 422 QLKGKRNVRNLSKRPVGVDDEPSSHLWVHGKARLNNKNYYFDDSMEGDADALEEEYYLAS 481

Query: 524 KMVPKDQYIVRNYMPDWEGQPALKGYWDVKNPLYGMRHHFGGRPRTILIDVDLKVHASYQ 583
           K V KDQY+ RNYMPDWEGQPALKGYWDVKNPLYG+RHHFGGR RTILIDVDLKVHASYQ
Sbjct: 482 KRVSKDQYLARNYMPDWEGQPALKGYWDVKNPLYGIRHHFGGRTRTILIDVDLKVHASYQ 541

Query: 584 KEPVPIVSLMSKLNGQAIIGHPIQIETLEDGFSETILSDSLGNTPSENDGSTALQPAWRT 643
           KEPVPIVSLMSKLNGQAIIGHPIQIETLEDGFSET+LSD LGN PSENDGSTALQPAWRT
Sbjct: 542 KEPVPIVSLMSKLNGQAIIGHPIQIETLEDGFSETLLSDGLGNGPSENDGSTALQPAWRT 601

Query: 644 ARRTANVRIPRPHLLTVFDGEEAGYDSPFADQERKSSRFKRVKTGVYNQK--AGQSGGQP 703
           ARRTANVRIPRPHL TV DGEEAGYDSPF DQERK +R KRVKTGV + K  AGQ  GQP
Sbjct: 602 ARRTANVRIPRPHLPTVLDGEEAGYDSPFGDQERK-TRCKRVKTGVNSHKTGAGQGRGQP 661

Query: 704 HIPR---PSHDRRLPKKLAKKVSLSSNQ---KTRTLSSIAVEQNFSNMPIHDSVTCHMNG 763
            IPR    SH+RRLP+K+ KKVS+SSN    KTRTLSSI VEQN SNM IHDSVTC MNG
Sbjct: 662 QIPRASSSSHERRLPRKMVKKVSISSNNQKTKTRTLSSIGVEQNHSNMAIHDSVTCQMNG 721

Query: 764 SMKPESSGPPTVACIPVKLVFSRLLEKINRPPSKAT 775
            MKPESSGPPTVACIPVKLVFSRLLEKINRPPSKAT
Sbjct: 722 LMKPESSGPPTVACIPVKLVFSRLLEKINRPPSKAT 754

BLAST of CaUC10G187120 vs. TAIR 10
Match: AT3G03140.1 (Tudor/PWWP/MBT superfamily protein )

HSP 1 Score: 474.6 bits (1220), Expect = 2.5e-133
Identity = 337/781 (43.15%), Postives = 448/781 (57.36%), Query Frame = 0

Query: 47  GSGAVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASVDWY 106
           GSGAVD+ VGSIVWVRRRNGSWWPG+ILG ++L S+H+TSPRSGTPVKLLGREDASVDWY
Sbjct: 5   GSGAVDWTVGSIVWVRRRNGSWWPGRILGQEDLDSTHITSPRSGTPVKLLGREDASVDWY 64

Query: 107 NLEKSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARREDAILHALELEKELLKKQGK 166
           NLEKSKRVKPFRCG+FD+CIER ESSQ M IKKREKYARREDAILHALELEKE+LK++GK
Sbjct: 65  NLEKSKRVKPFRCGDFDECIERVESSQAMIIKKREKYARREDAILHALELEKEMLKREGK 124

Query: 167 LNLYSDQTTIESLDATAKKGIISSEHIGADDINDGPSESYQFSKIRDVNYDNEIMEPCLK 226
             L  ++   +SLDAT ++  I    +   D ++G  ES  + +    N+  ++M   L+
Sbjct: 125 --LVPEKARDDSLDATKERMAI----VRVQDTSNGTRESTDYLR---TNHVGDVMH-LLR 184

Query: 227 ASEGAQLSGEDDHSEARPRMRGLQDFGLKITPSKRKVLSSSVVSNGFEMLATNTNPLAPA 286
             E  Q S ED   EA PRMRGLQDFGL+   SKRK+  S+     F+ LA  +N  A +
Sbjct: 185 DKEEDQPSCED---EAVPRMRGLQDFGLRTASSKRKISCSNGPDTSFKYLA-RSNSSASS 244

Query: 287 PLD-----GVCNIGNDSDANGMQQIDRAKRSKCMYLPADSSDSLECRESSLGQVEMSTPH 346
             D      +  +G +   +  +    AKR+K M+ P++S+D  +  E+ L   +     
Sbjct: 245 SGDHSMERPIYTLGKEKTKSRAE----AKRTKYMFTPSESNDVSDLHENLLSHRDAMHSS 304

Query: 347 LAAGVMPSRPDSLVEENASGSSENDSSDSETDSDSSRSDQDMDNDMAALSDSTLPSEKEP 406
            A G   +R       N     E+D S+SET  DSS  ++D D+D+  LS +   SE+  
Sbjct: 305 FAGG--DTRYSDYDPPNFLEDMESDYSESET--DSSDMEEDTDDDIPLLSGAGRHSERR- 364

Query: 407 STFERMDTQEQGNMSSEEPD-DSAHSGDTSHLYHHDPVSTNEAVSKWQLKGKRNVRNFSK 466
           +TF R  + E  + SSEE   +S+ SGD+S+LY  +P +    VS WQ KGKRN R   +
Sbjct: 365 NTFSRHTSGEDESTSSEEDHYESSISGDSSYLYSQNPNNEASTVSNWQHKGKRNFRTLPR 424

Query: 467 KPVGVDDEPSSHL----------WVHGQTRLSNRNDY--FDDSMEGADALE--EEYYLTS 526
           +         + L             GQ  +    D+   +D  +G D  +  E  +   
Sbjct: 425 RSARKRKLHRNRLEDGRYCEYKRRAFGQKPMGYGLDFNGINDMSDGTDDTDPNERQFGDR 484

Query: 527 KMVPKDQYIVRN---------YMPD--------WEGQPALKGYWDVKNPLYGM-----RH 586
            +VP D Y + N         Y  D        WEG+  +K   + K    G        
Sbjct: 485 MIVPGDDYQLSNVVASRCKNIYSHDMLDWDDDPWEGRIGMKKRGEEKLEGLGQEFDVSER 544

Query: 587 HFGGRPRTILIDVDLKVHASYQKEPVPIVSLMSKLNGQAIIGHPIQIETLEDGFSETILS 646
           HFG +  + L+DVDL+V  SYQK PVPIVSLMSKLNG+AIIGHP+++E L DG SE+ + 
Sbjct: 545 HFGRKTYSSLMDVDLEVRGSYQKGPVPIVSLMSKLNGRAIIGHPVEVEVLADGSSESYIQ 604

Query: 647 --DSLGNTPSENDGSTALQPAWRTARRTANVRIPR--PHLLTVFDGEEAGYDSPFADQER 706
             D  GN  +  D +  L  AW+TARR +N R+PR  P   +V   ++A YD   ADQ R
Sbjct: 605 TIDYFGNETTYQDKTFLLPSAWKTARR-SNSRVPRLQPFSSSVEADDDATYDYSLADQGR 664

Query: 707 KSSRFKRVKTGVY--NQKAGQSGGQPHIPRPSHDRR-----LPKKLAKKVSLSSNQKTRT 766
           K    K++  G +  +  + +      IPRP  +R+       KKL K  + +++QKTR 
Sbjct: 665 K-PLVKKLGLGHFSNDDNSVRRNSSLRIPRPPAERKQQHQQQQKKLLKNTNATASQKTRA 724

Query: 767 LSSIAVEQNFSNMPIHDSVTCHMNGSMKPESSGPPTVACIPVKLVFSRLLEKINRPPSKA 775
           LSS + EQ  + M      T  +  S +    GPPTVACIPVKLV+SRLLEKINRPPSK 
Sbjct: 725 LSSFSGEQGHNGMKASRDRTHEL--SNRRVLPGPPTVACIPVKLVYSRLLEKINRPPSKP 758

BLAST of CaUC10G187120 vs. TAIR 10
Match: AT3G21295.1 (Tudor/PWWP/MBT superfamily protein )

HSP 1 Score: 219.5 bits (558), Expect = 1.5e-56
Identity = 226/762 (29.66%), Postives = 323/762 (42.39%), Query Frame = 0

Query: 50  AVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASVDWYNLE 109
           A+D +VG +VWVRRRNG+WWPG+I+   E+    + SP+SGTP+KLLGR+DASVDWYNLE
Sbjct: 11  AIDASVGGLVWVRRRNGAWWPGRIMAHHEVPDGTIVSPKSGTPIKLLGRDDASVDWYNLE 70

Query: 110 KSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARREDAILHALELEKELLKKQGKLNL 169
           KSKRVK FRCGE+D CI  A+++     KK  KYARREDAI HALE+E   L K     +
Sbjct: 71  KSKRVKAFRCGEYDACIATAKATASTTGKKAVKYARREDAIAHALEIENAHLAKDHPPCI 130

Query: 170 YSDQTTIESLDATAKKGIISSEHIGADDINDGPSESYQFSKIRDVNYDNEIMEPCLKASE 229
               T+ E     ++KGI  S  +   ++    + S + +K    N     ++P  +   
Sbjct: 131 EKASTSGE----VSRKGIEDSGDVAETEVALQSTMSLKKTK----NGKASKVQPLSEKRR 190

Query: 230 GAQLSGEDDHSEARPRMRGLQDFGLKITPSKRKV----LSSSVVSNGFEMLATNTNPLAP 289
                 EDD ++   RMRGL+D G+  T SK KV    L      NGF+    N N    
Sbjct: 191 RTPNDSEDDGTQTNKRMRGLEDIGMG-TGSKGKVQVGALLEDTQENGFKSDTNNIN---- 250

Query: 290 APLDGVCNIGNDSDANGMQQIDRAKRSKCMYLPADSSDSLECRESSLGQVEMSTPHLAAG 349
              D V N G+ S+ +        KR +   + A+     + R  +L +V  ST   A  
Sbjct: 251 ---DSVSN-GSLSNGSSRDCSPSMKRKRSPVVIANDYSKRKNRRRTLTKVLEST---ATV 310

Query: 350 VMPSRPDSLVEENASGSSENDSSDSETDSDSSRSDQDMDNDMAALSDSTLPSE------- 409
            +P   D LV  +         SD+ +DS+   S+   +N +  ++D    SE       
Sbjct: 311 SIPGTCDKLVNSDCLSLPGVSESDNNSDSNEVFSENVSENIVEVINDKGKESEVSNISVL 370

Query: 410 -KEPSTFERMDTQEQGN-----------MSSEEPDDSAHSGDTSHL--YHHDPVSTNE-- 469
            K+ S+    D    G+            +S  P  +  SG T       HD V  +E  
Sbjct: 371 AKDDSSNGLFDVPLNGDEKYPSGISTVPFTSSSPRKALVSGPTRRFGQSSHDDVVKSEGS 430

Query: 470 ------------------AVSKWQLKGKRNVRNFSKKPVGVDDEPSSHLWVHGQTRLSNR 529
                             + SKWQLKGKRN R  SKK V                    R
Sbjct: 431 NGSPSTSPAATLFNGIKKSTSKWQLKGKRNSRQMSKKQV------------------ERR 490

Query: 530 NDYFDDSMEGADALEEEYYLTSKMVPKDQYIVRNYMPDWEGQPALKGYWDVKNPLYGMRH 589
           N Y +++                          N +P W               L+ +  
Sbjct: 491 NAYAEEAN------------------------NNALPHWSVSD------QKPRSLFSVGT 550

Query: 590 HFGGRPRTILIDVDLKVHASYQKEPVPIVSLMSKLNGQAIIGHPIQIETLEDGFSETILS 649
              GR  + L DV ++V A+Y+   VP++SL SKLNG+AI+GHP  +E LEDG    I+S
Sbjct: 551 QAMGR-NSELYDVKIEVKANYKPRNVPLISLRSKLNGEAIVGHPSVVEVLEDGSCGHIVS 610

Query: 650 DSLGNTPSENDGSTALQPAWRTARRTANVRIPRPHLLTVFDGEEAGYDSPFADQERKSSR 709
                                 + R  + + P+P                     +K S+
Sbjct: 611 ----------------------SHRIDDAK-PKP-------------------SSKKKSK 627

Query: 710 FKRVKTGVYNQKAGQSGGQPHIPRPSHDRRLPKKLAKKVSLSSNQKTRTLSSIAVEQNFS 767
            K+                PH P        P+    K S S   KTR LS+++ ++   
Sbjct: 671 KKK----------------PHFP--------PQASKSKKSSSLAIKTRCLSALSGQK--- 627

BLAST of CaUC10G187120 vs. TAIR 10
Match: AT1G10380.1 (Putative membrane lipoprotein )

HSP 1 Score: 165.2 bits (417), Expect = 3.2e-40
Identity = 109/310 (35.16%), Postives = 153/310 (49.35%), Query Frame = 0

Query: 777  MSSDLSTMEFFHSLIFTFHILASTSISAQRTAAAAPPCRTTCGALTVKYPFGTGYGCGSP 836
            MS   S+  FF   +F+F  L S+ +S+Q        C+ TCG + +KYP GTG GCG P
Sbjct: 1    MSLKRSSSSFF-IFLFSFFFL-SSHVSSQ-------ACQKTCGQIPIKYPLGTGSGCGDP 60

Query: 837  RFSSHVTCSSDDR-LLLNTHTGDYPITSISYSDSTV-----------------------T 896
            RF+ ++TC  D + L L THTG YPITS+ Y+   +                        
Sbjct: 61   RFTRYITCDPDQQTLTLTTHTGSYPITSVDYAKQEIYVTDPSMSTCACTRPSHGFGLDWD 120

Query: 897  GPFQLGSST-FLLLDC---ESPSDSLSIRGS---AICDLSYAHLCASIYS-CPSVVDLGL 956
             PF     T F LLDC   ESP  +    GS   ++CD   + +C  +YS C ++  + L
Sbjct: 121  APFSFHDDTVFTLLDCSVDESPVFTPLSNGSGRVSLCDRQSSSICTFLYSNCRAISLINL 180

Query: 957  PLFAPTNSCCVYSPANFDGNGELDLRELKCGGFSSVVRLGE-YETDPMRWEYGVELKYGY 1016
             +    ++CCVY P +   + E+DL +LKC  +S    LG   E+ P  W YG+ LKY +
Sbjct: 181  QV----STCCVYVPLDLGPSFEMDLNKLKCSSYSGFYNLGPGQESHPENWNYGIALKYKF 240

Query: 1017 GALENSVMETKCKGCEMSGGACGFTPPENLFVCVCERGFNTSTDCKSNDLNQEFFWSSAS 1054
               +       C  CE S GACGF    + FVC C  G NT++DC        FF  +++
Sbjct: 241  NVFDE--YPGVCGSCERSNGACGFNTQSSSFVCNCPGGINTTSDC--------FFLYNSA 287

BLAST of CaUC10G187120 vs. TAIR 10
Match: AT1G51745.1 (Tudor/PWWP/MBT superfamily protein )

HSP 1 Score: 163.7 bits (413), Expect = 9.5e-40
Identity = 173/599 (28.88%), Postives = 251/599 (41.90%), Query Frame = 0

Query: 50  AVDYAVGSIVWVRRRNGSWWPGKILGSDELSSSHLTSPRSGTPVKLLGREDASVDWYNLE 109
           A++ +VG +VWVRRRNGSWWPG+ L  D++  + L  P+ GTP+KLLGR+D SVDWY LE
Sbjct: 11  AINASVGRLVWVRRRNGSWWPGQTLVHDQVPDNSLVGPKVGTPIKLLGRDDVSVDWYILE 70

Query: 110 KSKRVKPFRCGEFDDCIERAESSQGMPIKKREKYARREDAILHALELEKELLKKQ--GKL 169
            SK VK FRCGE+D CIE+A++S     K+  K   REDAI +AL++E E L K+     
Sbjct: 71  NSKTVKAFRCGEYDTCIEKAKASSSK--KRSGKCTLREDAINNALKIENEHLAKEDDNLC 130

Query: 170 NLYSDQTTIESLDATAKK--GIISSEHIGADDINDGPSESYQFSKIRDVNYDNEIMEPCL 229
           NL  ++ +   L     +  G   +E    D++   P +    S I     +N       
Sbjct: 131 NLSGEEDSKRCLSGKEDEDSGSSDAEETEDDELASAPEQLQ--SSISSQEMNNVGASKVQ 190

Query: 230 KASEGAQLSGEDDHSEARPRMRGLQDFGLK----ITPSKR------KVLSSSVVSNGFEM 289
                     EDD +E   RMRGL+D G +    I   K+       V  S  VSNG   
Sbjct: 191 SKRRRTPNDSEDDGTEGVKRMRGLEDIGKEQAGGIVEHKQDLDLICAVGLSDSVSNG-NT 250

Query: 290 LATNTNPLAPAPLDGVCNIGNDSD-ANGMQQIDRAKRSKCMY---LPADSSDSLECR--- 349
           +A      +P+ L    N+   S   N  +Q+ +   S  M    +  D   SL+C+   
Sbjct: 251 IANGNKVCSPSSLKR--NVSECSKRKNRRRQLTKVLESTAMVSVPVTCDQGVSLDCQGIY 310

Query: 350 -ESSLGQVEMSTPHLAAGVMPSRPDSL------VEENASGSSENDSSDSETDSDSSRSDQ 409
                G   + +    + V+ +  DS         EN  G+S N+ +     S  S S +
Sbjct: 311 DSKVSGMESVESMKSVSVVINNNSDSTGVSCEDAYENVVGASHNNKAKDSEISSISVSAE 370

Query: 410 DMDNDMAALSDSTLPSEKEPSTFERMDTQEQGNMSSEEPDDSAHSGDTSH-LYHHDPVST 469
           D  +D   L D  L  E+  S       +      +   D +   G  SH ++  +  S 
Sbjct: 371 DDSSD--RLFDVPLTGEENHSEGFPAACRISSPRKALVTDLTRRCGRNSHNVFVKNEASN 430

Query: 470 NEA--------------------VSKWQLKGKRNVRNFSKKPVGVDDEPSSHLWVHGQTR 529
             A                     SKWQLKGKRN R  SKK                  +
Sbjct: 431 GSACTSPPASEPVNCILSGIEKNTSKWQLKGKRNSRQMSKK------------------Q 490

Query: 530 LSNRNDYFDDSMEGADALEEEYYLTSKMVPKDQYIVRNYMPDWEGQPALKGYWDVKNPLY 589
              RN Y +++   +                                             
Sbjct: 491 EERRNVYGEEANNNSST------------------------------------------- 530

Query: 590 GMRHHFGGRPRTILIDVDLKVHASYQKEPVPIVSLMSKLNGQAIIGHPIQIETLEDGFS 600
                    P + L +V ++V ASY K  VP+VS MS+L+G+AI+GHP+ +E LE+ +S
Sbjct: 551 ---------PHSTLYEVKIEVKASYTKPRVPLVSRMSELSGKAIVGHPLSVEILEEDYS 530

BLAST of CaUC10G187120 vs. TAIR 10
Match: AT3G17350.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 19 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G50290.1); Has 203 Blast hits to 203 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 203; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 103.6 bits (257), Expect = 1.2e-21
Identity = 78/273 (28.57%), Postives = 112/273 (41.03%), Query Frame = 0

Query: 786  FFHSLIFTFHILASTSISAQRTAAAAPPCRTTCGALTVKYPFGTGYGCGSPRFSSHVTCS 845
            FF S ++T   L    +    T +AA  CRT CG + + YPFG   GCGSP++     CS
Sbjct: 7    FFFSFLYTITTLTFPPL----TTSAATSCRTLCGNIPINYPFGIDGGCGSPQYRGMFNCS 66

Query: 846  SDDRLLLNTHTGDYPITSISYSDSTVT---------------GPFQLG-----------S 905
            +D  L   T +G Y + SI Y   T+                  F++             
Sbjct: 67   TD--LYFTTPSGSYKVQSIDYEKKTMVIFDPAMSTCSILQPHHDFKMADIQNTLIRPSYD 126

Query: 906  STFLLLDCESPSDSLSIRGSAICDLSYAHLCASIY-SCPSVVDLGLPLFAPTNSCCVYSP 965
            + F L +C + S  +  R   +C  +  H C  +Y SC S            NS    +P
Sbjct: 127  TVFALFNCSNDS-PVHNRYRNLCFNAAGHSCDELYSSCTSFRIFNTTSPYGNNSTVHTTP 186

Query: 966  ----ANFDGNGELDLRELKCGGFSSVVRLGEYE-TDPMRWEYGVELKYGYGALENSVMET 1025
                 N+D    + +  L C  +++V+  G+     P+ W YG+EL Y       SV E 
Sbjct: 187  YCCFTNYDTVRVMSMNILDCSHYTTVIDNGKMRGVGPLDWSYGIELSY-------SVTEI 246

Query: 1026 KCKGCEMSGGACGFTPPENLFVCVCERGFNTST 1027
             C  C  SGG CGF     +F+C C    N  T
Sbjct: 247  GCDRCRKSGGTCGFDAETEIFLCQCSGSNNNPT 265

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038897417.10.0e+0089.95uncharacterized protein At1g51745 isoform X6 [Benincasa hispida][more]
XP_038897415.10.0e+0089.08uncharacterized protein At1g51745 isoform X4 [Benincasa hispida][more]
XP_008451676.10.0e+0088.84PREDICTED: uncharacterized protein At1g51745 isoform X2 [Cucumis melo][more]
XP_038897413.10.0e+0088.03uncharacterized protein At1g51745 isoform X2 [Benincasa hispida][more]
XP_016901178.10.0e+0088.12PREDICTED: uncharacterized protein At1g51745 isoform X1 [Cucumis melo][more]
Match NameE-valueIdentityDescription
P592781.3e-3828.88Uncharacterized protein At1g51745 OS=Arabidopsis thaliana OX=3702 GN=At1g51745 P... [more]
Q94HW21.3e-1235.54Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT942.1e-1236.36Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P041464.0e-1136.19Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
P109782.2e-0934.69Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
Match NameE-valueIdentityDescription
A0A1S3BSV50.0e+0088.84uncharacterized protein At1g51745 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10349... [more]
A0A1S4DYX80.0e+0088.12uncharacterized protein At1g51745 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10349... [more]
A0A6J1E6S80.0e+0087.03uncharacterized protein At1g51745-like isoform X2 OS=Cucurbita moschata OX=3662 ... [more]
A0A6J1J1L60.0e+0087.14uncharacterized protein At1g51745-like isoform X2 OS=Cucurbita maxima OX=3661 GN... [more]
A0A6J1E3F40.0e+0085.19uncharacterized protein At1g51745-like isoform X1 OS=Cucurbita moschata OX=3662 ... [more]
Match NameE-valueIdentityDescription
AT3G03140.12.5e-13343.15Tudor/PWWP/MBT superfamily protein [more]
AT3G21295.11.5e-5629.66Tudor/PWWP/MBT superfamily protein [more]
AT1G10380.13.2e-4035.16Putative membrane lipoprotein [more]
AT1G51745.19.5e-4028.88Tudor/PWWP/MBT superfamily protein [more]
AT3G17350.11.2e-2128.57unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL246-FR2) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR032872Wall-associated receptor kinase, C-terminalPFAMPF14380WAK_assoccoord: 974..1020
e-value: 1.6E-5
score: 25.5
NoneNo IPR availableGENE3D2.30.30.140coord: 48..168
e-value: 1.9E-10
score: 42.7
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 675..707
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 383..398
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 343..437
NoneNo IPR availablePANTHERPTHR33697:SF2T17B22.17 PROTEINcoord: 46..778
NoneNo IPR availableCDDcd09272RNase_HI_RT_Ty1coord: 1039..1145
e-value: 1.5481E-38
score: 138.755
NoneNo IPR availableSUPERFAMILY63748Tudor/PWWP/MBTcoord: 52..159
IPR025287Wall-associated receptor kinase, galacturonan-binding domainPFAMPF13947GUB_WAK_bindcoord: 814..872
e-value: 8.7E-11
score: 42.1
IPR000313PWWP domainPFAMPF00855PWWPcoord: 53..136
e-value: 3.8E-10
score: 40.1
IPR000313PWWP domainPROSITEPS50812PWWPcoord: 55..110
score: 9.915161
IPR044679PWWP domain containing protein PWWP2-likePANTHERPTHR33697T17B22.17 PROTEIN-RELATEDcoord: 46..778

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CaUC10G187120.1CaUC10G187120.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0030247 polysaccharide binding