CmoCh01G021140 (gene) Cucurbita moschata (Rifu)

NameCmoCh01G021140
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionHydroxyproline-rich glycoprotein family protein, putative
LocationCmo_Chr01 : 14614189 .. 14626416 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGGTCCGAGCAGAATAGAATCCCTCAACAAGAACGGGTATGCATCCCCTCCCCACAAATCATTATGCTTGTTTATTGCTCGAGAATATTGTTGTTTTTGCACATTGTTTCTTATACCAGTAGATTTAGAGTGCTCCAATCCCCATTTTGCTTGTTTAGGATTTGGATTTACTGCGTAAGCAATGGCATTCTTTTTCACGGCCTTTTCTTTCCTAGATCTAGGGTTCACCCCTCTATTTCCTAGCAATGTACCAAATCCATGTTTCTACCAAATTATTTTACAAATGTGGATTCCATGTACTTGATAGTTTTTTATATAGATCTTTGTTAGTTTTACAGCTTTGACTAATGGGCATCACCTTCTTGTTGTGATTAGGTTTGTGGTCATTCCACTTTATGCTTATTATTCCTTTGTTTTAGTTTCTTTTCATGTAAGGAGCCACTTTTCGAGGGACTTGAGCCACTGACCGTTACTAACATGGAAAATAAGTAAAAAATGCGTAAAATCTAAAACATTTGAACACGTGCATGCACTTTGAAATTTGATTAAAATAATAATTAAAATTTTTGCTTTTATAAAGAAACAAGAAAAGACCACTAAAATACTACACTACAAGATCGAACTATTTAAAATACATGAAATACAAGATTTAAAGGGAAGTATAAAGGAACTTCCGACTCAAGTGTGATGAGACTCCCTGTAGCCAACAACCCATGTGTGCCTCTGCCTCGAACTATGTCAAGCTCTGCCTGAAAAGAAAGTTGAAGTGGTAAGGATGAGTATATAAAATATTCTCAGTAAGCAGCTTGTTGGTAATTCATATTTTGTTATTAAGTCTTATTGATCTTGTTCATACATTCGGTATCTTATTCTCTTTTGGGGATTCGTTCTCCCGTATAAGCTTTTCCTAGACCTCCCGCCCTTGGAATGTTCTTAGGTCTTCTAAGTCTCCTAATTAGTCGTTTATGCTAGTCTCTCATCATCATTGTCATATCTTAGAGTCCTTGAGGCGCGACCTCGGTATAAATCTCGTAGGGCTCATGCATATTAAGGCTCGACCAACTCTAAGGTGCAACTATGGACCCCTATTGCTCATGATAGGTTCACCAAAAGGAAGCTAACGGACAAGCTTCCTAATAGTCCCCACAGGGCACAAATCATAAAAAATCATATGCCAGTTCTATGGTTATCTGAACCCTTGACAATTGAGTCAGGTTTATGATGAGGGGCTAGTTGGCGAGCTCCTACCGTCCCTGTGTAATTGCATAAAATCACATGCATGAACAATAATTATAACCAATTTACAAGGTGCAACTGTGGACCACTGCTGCTTATGGCCGGCTCACTGCAGGAAGCTAACTGGCCAACTTCACAGTAGTCCCCAAATGGTACAATCATAAATCATAAAATATCATAAGGCAGTTCTATGGTAATCTAAACCCCTGACAATCGAGTCAGGTTCACAACTAGGGGTTTGTTGGCGTACTCCTAACTGTCCTCGTATAACCGCATACATCAGAAACATATGAAATAGAGACATAGCATAGCGATCTAAGCATGGATACCTATTCATAAATTTCAGTTCATAAGATAGTGCGTACGCGTTTTAGTAACATACATAATAACATTCCACATATGTTATCTTAGCTCATTTTATGCACAACCATTTATAAGCGGCTTCCTATAGCATGCTTCATATCATACTAAACAGTTCATTCCTCGATAAATGTGCACCATAACGTTTAATGTGCTCTTACATAGATCAAACCAACTCATTGCATGCTCAAATTTATATGACATGCTTCTCATATCATATTTTCATGCATGCTATGAGCATAGCTAAATCCTAAATTCTGAGGTGCCACTTACTTGGAAAGCGTAACATTTGGGTTTGCGTCTTTAGGGGATTCCAAAAATTGTCAAACTCCTTAAATAACTCTTACATTGATATAAATCGGTCAAAACGGTGCCAAAGTCGGAGGTAAAACGAGTAAATTACGCAAAAAGAGGCAAAATCGGGTTGGACTGCCCGAGTTGGGTTGGGCCGCCGAGTTGACCCGCCCACGGGTCGGCCACTCGGGTTGGGCTATCCGCGGGTCGGGCCTGTGGGTTGGACCCATCCTTTTCCTCCAAATTTTTTTTTCTTTCTTTCTCTTTCTTTATCTTTTTCGCTTCTTTTCTCTTTTTTTTTTCTTTTCTCTTTCCTCCTCTCCTTCTCCTCCCTCTTCCTCTCTTGAGAACCTCCTTCTTCCATTTCTTCTCTTTTGCTTCTTCTTCTTCCCTTTTTTTCTTCTTCTTCTTTCATAATCATTTTTTTCCCTCTTATTCTTTTTCTTTCCCTCATCTCTGCGCTTGCCGGATTTCCTCCTCCCTTTCTCGCACTCGCCCTTTGTCGTACAAGTCAAAACTCCAGCTGTTTTCTTCAACAAAGTCTAGATTGTCCTCTTGCGAACCTATAGCAATTGAAATTGGGTGAACTCATGGAGGGTAGGGGACGAATGTAACTGTAGAAGCTCTCAAAACTCATGAATGTACCTTGGCGTTCTCCCTTCGTTCGACTTTTCGTTTTGGCTTGCTTCCATCCAAAATTGCCAACGAATGCCCGTGGATCTAGAGCTTAAGTTCTCCCAACAAGCGTGGGTTTTGGTCTTCAACCTCCGGTGCCTTCTCCCTTTGACAAATTTGAGTCCTTCTTCCTTCCGCAGCCAGCAAGCCATCTTAAATCCACCCTTACCTCCCTCTGTTCTCACTCAAAGAGGGATGGGACGGAGGTCTTACTCTTTCATATATGCACACACTTTGAACTTTGAACTTGCTCTTTCGGTTAATCATACTTTGAACTTACTCATTCGGTTAATTATACTTTGAACTTACTCTTTCATATATGCACACACTTGTAATGTACGTGAATCCTTCAAGGTGAGATGCATTTGACTTATTCGAGAATCAAGTTCAGTCGTTGGTCCTTTTACTATTAGCAAATATTTAATTTTGCATTAATTACATTATAAAGTAATTAATTCTTCCTACATATGAATCCGACAAGCACGTACATTTAAGTATTATTTTAGCATCACAATTGTTTTTTTTTAATATATAAACCAAGCTTTTATTAAGAAAACATGAAAGAATATGAGGGGTAACCCCATGAGTCGAGATAAAGGACAGGGCTCAAATCAAGCTAGATGAACCCTGCAGATAATTACTAAAGTTCTTATGCATTGATGCCTAGAGGGAGACGTGAAATATGAAAAGAGACCAAACTTGCTTCCTAGTTTGACCCTTCCCCTCTAAACATATGATTGTTTATCTCTTCTCGAACACCTCACAAGGGTTAAGAATTCAAGTACGAAGATCCCTCCTCTTGGGACAAGCAAATGAATCCCCTCAATAAAGATAAAATGGTCAAAATATCCACATTAGAAAGCAAATCGTTAAGGTGACCCCTAACCTAAGCAAGAGACTAACACGGATCCCTCATGAAGAAACAAGTCATTTCTCATCCCTGCTCATGCTTGTTATAACAGTTCCTTATGGGAGGCTACTAGACTGGTCTTTCCCTTTTAGGTAATGGTTAGTTTAACCCAAGGTCAAGAGAGGTGGAACCTTCCAAGGAGGAAAGGACGTAAGCCATTAAATGAAGCCTCATGGAAGATAGATGGTAAAGATGAGAAAATAAAGAGTAAAGAGATCTATCATCCAAACTAACGTCCTTCCCATAAGTGTTCACCCCACCACATTCTCAAACATTGAGAGAAAAGAAAGGGAACCCCTAAAAGAATAACTTCCCACAAATGTTTGTAAGTGCCTTTTGAATGCTACTCGAGAGGGTGCTAACATCACAGTTAATCAATGTTCATTATATTCTTCATACGAATCTTTATTCATCATATGAAGAAGCTTCAAGAGAATATTCCACGCTAAACAAGTAGCATAACATAATTGTCTTATGATCTTGTTGTTAGGAACCTGAAGTAGCGGACTAGTTTGACTTTTGGGTAATTTCAAGAATGCCCGTGTGTCCTAGTGGGATTTATATTTCAAAGAAGTTGCTTGTGGTACTTTGTGATTGTTGAGTATTTATGATGGCGAAAGCCTTCTTTTGAATTGGTTGGTTGGAAATTGATTAGAGAAGTTTTTGAAGGATGCTAGCTATTAAACCGTATTTGGGAGAAATTAGAATTATAGGCGTCTCTATCTAATATTTGTTATGCCTTTTTGTATATACAATATTCTTTCATCCATTCTAATTAGAGTAGTTCTTTTACAACCCTTGTGTTTTTATGTTTTCTATAATTTTTTTCTTTCCTTTTTTGGTTGAAAAAAGAAACATAAGAGATACATATCATGTATCTAGGAAAAAGTGAAATTCAAGGATGAGAAGAAGAGTATGAATTTTTTTTAAAAACTAAATAACTTCCCATTGATGTATAGAAAGGAAGGAAAAACGTTAAGGATACAAGTTGCCAGTGGGAACACGACAGACCAGATAAGGAAAAATAAGCCCTAAAATACCGGAGGATAAAATGATTAAAAAAACTAGTGAGACAATAAAAAATACAGCAAAAGATTTGAAATAAAGAGCTCCCACTTTGGCTAAAAATGTAGGAACTTCTTCGGAGTGGAAGTATAACAAATACTCAAGAACTTGAAACTTAGAAAAGCAACTCGAGAAAATGAAGCTGAGAAAAGATATTTTTTGAAGAAACACTTCTTGCATTGAAAGAGTTATTAGACTAATGCAAGCCTTTGAAACCTTCCAAAATCCTCTAGAATGCCAAAGAAGGATCTTCGTTGAGGTGTATAAAATTTTACCAATACCCGCCAATATAACCAAATCGAGAACACAATTCAATCAAATTCAAACTCATCCATATTCAGTTGTTAATAAAACTGATTTTCTTATTCTGAAAAGCACATTGGACTGGAAAAACTTAAAACCGAATGCTTTGAAGCTTTCCGCAACCAAAAGCTATAAAAGAATCTTGAATAAGGGAATAAACAAAAAGAACAACTTAACCTTTTTCTGAGTTGGGAGTAGGGGAAATTTCATGCATATGACGACTGCAAATCTCATTAAGAGGTGAAATTTTTAGGAATCAAGATGGTGGACTCCACCAGAAAATCTTCTTTGACCGGGTCATCCATAAACTTTGAAGAAGCTATAACATTTTTCGAAAAAAGGAGGGTCGAGGATTGTTCAAGATTGTCATGGATTTTCTGATCAATTTCCAATTTCTCATTTCTTAATGATTCAGCATCAGAGTCGATAAATGGTGAAAAAAATGATAGAGCATCTGAAATCTTAGAGGAAAACGGTACACCATTTTCTGAATTAAGTTTGAACTTGGAATGGAGAATGATGAATTAGCAAAACCATGAGCAAACATCTTCAATAAAATTTGGATTAGTCTCTAAAATTTATGTACTTATTTTATTTGATCATAAGACCAAATGAGAATCAGTTGCATTTATTGAGATAATAGGATCTGACTGCATAGGCATACAAAATTCCATTGACTTGATAAAATTTGAAGAGTCTGGAACCATGAAAGACAAATAGTTAAATGCATTTTTTCTTTTATAAAATTTCGAATAGTGGCCTCGTTAATAACCTTGACTATTAGGTGTTCACTTCCAATATTTCAATCTCAATCATTTCCATTTAAAGCTGGTCCAATAAAACCGGGGGAAATAACACCTTTGAAGATGAGGACTGTTACTGATCATGCAAGAAACCGCCATTTAATCCTTTGTGTCCCACAAAGTTATTGTTGCTTGGAGAACCAAGGCCATCCACATCATTGATCGTATCATGAACGGAAGGGAACTTTGAATAATGGACCCCTGCCATCAAATCATCTTTGTTTGTGATAGTGAATCCTTCATCTTCCATCACCTGCTTAATTTTCCTTATATCCACGAAGTTTGAAGAGTCTCTAATGGACAAAACACCATGATTTCTTGGGAAATTAATACAATGGCTGAGGAAATTAGTTGTTTGGGAAATTAATGTTTACCAGACCACTGAAAGTGTTGCCCTATTGCTTCAAAAAAGAACGTTTTCAATATGACAAACGTAAATTCTTTAAAAATATTCATCCTCTGTAGCTCTTTATACACTCAAGGTGGGAGTGTTTCTTGTTGTACAAGCTTTCTTCCACCTGGGTAAGGCCAAATAGTACATTTTGAATGCCAATCATTGGATGAACCAAATTTAGACCGCTAGTTGATCCCAAAATCCTTTCTAAATTTCTTATGAAAAATCGAGCAAACTGGAATTTGTCAAAGCTCAACCATAGAGCATTCAAACCGGAATAAGAGGGAGGTTAGAAGAAGAATCTTCTTTTTGTTCACATTTTCGATGAAAAACTTGTCATTTTTACATTCAATACGAAAATCAATATTATGCATGTACTAACTCATCACTACCGTTGAACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGAAAAAAGAAAAAAGAAAATCGATCGACTTTTAATTATGAAATCTTGATAAGAAAAATCTATGAGAATATCACATACTAGGAGAGCTGATAGCTGGAAGGTTGAACGAAGAAAATGCAGATAACTATAAAAGGGCAAGCAACGGTTGCAGTCAATGAGTTGGTGCTGTGTGGTTAGGGAAGGAAGAGCAATTGAGTGATGGGAGAATTAGAAGAAGAATAGAAAGAAAAAGAAGAAAAAGGGGTTAATGAACAAAGGTAGAGTGGTCGATGGAATGGATGGGGAATGGGTGAGGAAAGAAAGACGGGAAAGACGTTGACATGAGTCATGGGAATGGTAGAGATGTCCATTTAATCCTAAAGGCCTGAGTACTCATCTCGTATAGGAGGGGGAATCTTGTACACAAGGAATGGAGGAGGGAGTAGGGATGGAGGAGTTGAGTTGATGTTGTATTATTGAAATTTAAGTATAAAAATTTATGTTACTTGAACATTATAATTGTATTTTTGCTTATGGTATATTATTATTTTTATTGTGATAATTTGTAACTTTTCTTTAAATAGTTATGTTGAGGATAATCCATTTAGAAGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGGAAAGAATGGTTATTAGAGTTGAAAATAAATTTAAAAGATAGAACAGATAAAAAAAAAAAAAGAAATAAAAGACAAAAAAAGTTTTCTTGTAGTGAACTTGCTCTCGGACAAATAAATGAAAATTTTAGTGAGAATAGAGATTCAAATTGACGGCAGGAATGTTTCTATTCTTGCCCTGTAGACATCTCTAAAGTAGGGTAGGGGCAGGGGGGAGCGATTGATGTTTAGGAAAAGGAGCATAAGTGACACAAATGGGGTTTAGAAAAAGGATTGAAGGCGGCACAAGAACAAACAAGGAAGGTACGATTGTCAGAATCGTCAAAGAATTCTTCCTTATGCTTAACCCTAAAAATTGTAGAAGACCCAACAAGTGGAAATCAATATTAATATTGGAGAGGAAATGACGTTACCGGAGTTCGAACATAGGACCTCTTGCTTTGATATCATTTTAAAACACCATTCCACCTCTAGGTTCTATGTTTCTCTTTGGGTGTCAGTTTCTAAGATCGTTTGTAATTATTCACTTGATTGAAATTCTTTTTTTTTATAACGAAAAAACAAGATTTTTCGTTGACATAATGAAAATAGACTAATGCTCTAAGAATGCATAGAAGTGAACAAAAGTAACAAAAAAAAAAAAAAATCAAAACTAAAAAGTTACGATAAGCAAAACACTATAAATAATATATAAATGCATTCCAATTCAAACATATGCCCCGAAGTGAAAATCCCACGAACAACTTTGTCAAAGAACACCAAGAATTGGTTTTGAGATGAGCTGTTTTCCTTGAAAAAATTCTTTGATTCCGTTCCAACCATCGTTCCGAAAGAAAAGTTTTAACCGCATTAATCCATAACAACTTGGCCTTAGAATTGAGAAGTGTTGGACCATGTAGTGTGGGAGATACTGCGATGAGACATTAGAAAAAATTGAAAATTGAGAACAGCTTGAACCAACCAAAGCTTGCATAAAAGCAATTAAAATACACATGGTTGGGAGAGCATCCTGAAGGCTGAGGAGCAGAAACTGATGGAAACAGACGGTGAGCTGGATTCTTCTTTTTCCAAACATTGGACGGATTAAAATTCCATCAAGCATTATCCACAAAAAAATATAAGTTCTCCTAGGACTCTTGGATTTCCGTGAGGCTAATTATAACTCTTTCTCCAAAGGTGAAAAAGATACCAGGTGATCATTTAAAGATTTTGCGGAGAAAATACCTGAAGGATCATTTGACCAATGCCTAGAATCTTCATTTGTATTAAGACTCGGAACAAAAATATTGTGCAGTAGAGTAGAGTCTGAAAATCGGCGATCTCCTCATCTTTCAAGCATCAACAAAAAGATAGATTCCAAGAACCAGCGTGCATATCCCAAAATGCTCAAACAGAGGCATTGGGAGAAGATGAAATAGCATACAGCCGAGTAAAAAAAGTCCCTAAAACATAAAATTTCAACCAAGGATCTTCCCAAAATCGAATCTTCCTTCCCTTGCCCAAATTAAAGTGAGCAAGTTAATCAAAGTTCTGCCATAATTTTGAAATTGAATCCATGGACTTCGAAGACTTAAATCTCTTCCAAGCCTGAATTCCATCCATTATTACAGAACCATAGATGCTAACAATTACTTTCCTCCATAAAAATTGAGGTTCCATCATGAAAGCCAGTCCCATTTGGCCAATAAAGCCATATTTCTCTGCTTCAATGCACCAATTTCAAGTCCACCTTCTCTTTGAGTCTTTGAAGCAAGACTCCATCTAACCAAATGTTACGAAACTGAAAAGGGGAAGGTCCCTAAGAAAAACTACCAACTCCAAAAAGAGGAGGGAAGTGATAAAAAATGAGGACCTTTTCTGAGACTCTAGAATTATCAAACATCACATCCCATTCCTTGGAAATCAAAGAACGATCTATGAGAAATTGAGAAGAATCATCTCCTGGTCTTGACCTCGTTACTTTCCATTAGATAAAGGCAACTCCCAAAGGCCTTATTCATCAATTCAATTATTAAACTCTCATTCCCTCTGGTTGTTCAACCAACTGGAATTCTTTCTTGAATAGAGTATCTTCAAGAAATGATTTTTTGCTGAAAAGAATATCTTCAAAAGAAGTTAGAGATTCTTCTTCAATACAAACAAGAGCTCAATGAGGCGTACTCCACCATTTCTGAAAAATACAGAAGTAAAATCTACAGTCGCAAATTTGAAAGCCCTCCCTAGCGTTTTCAACTTTCTCAAGATGGTAGCAAAATGTCAATAATAGTCATTTTTTTTGGGTGTTTTGTTAGGTCTATGAAAAGTTTTTGTTCATGGTTATTCTCTATTTGAGTGTCTTTTATATCATTAATTTTTTCTTTTTCACTCCCTCTGGGATTTTGTATCCCTTGAACTTTTAATCCTTTTCATTATATCTATGAGAATTTTCTTGTTAAAAAAACCTAGTAAATTTTTATTTCTGGAAGACAACGTAGTTAAAGGATTTGGTATTCTGCACAAGACAAACTTTTGTGAACAATAGCATTTGATTTTTCCTTCTTTTTGTTTTTTGTTTTTTATTTTAATGATGTTTGATTAAATATTTGTATTTATTGCGGTTGATATTTGATTTATTGATCCCCACATTCAGGCTGCTCCTTTGAAGCGCTTAAGTGACTTACATCCATTAACAACTTGCTCAGATCTAAAAGGCACTTTAACTAATGCTAAAACTGGATATTGGACATTTTTTATGGATAATTTTGGAATGGCCATCTCTGCTAGCAAGAATTTGTACGAAAATTGTGAAAAGGGATGGTTCTAAACTTTTTGGTTCTTCATTGACTATTCTTTAATGAGCTGTTGTGATAACCTGTTAAGTTCTGGTACTTTCTTCTGTACCAGGGAAAGAGATGGGGTGGATGTTGGGGTGCATTATCTTGTTTTCACTCTCAGAAAGGAGAAAAGCGCATTGTGCCTGCATCTCGTTTACCTGAGGGCAATGTCGTGACAACCCAGCCAAATGGACCTCATGCAGCAGGAATAGCCAATCAGGCTACTGTGATAGCTCCATCCCTTTTAGCCCCACCTTCTTCACCAGCATCATTTACAAATTCTGCACTCCCTTCAACAGCCCAATCACCTAGCTGTTTCTTGTCGTTGTCTGCCAACTCACCTGGAGGTCCTTCATCCACAATGTTCGCTACAGGGCCATATGCGCATGAAACACAACCGGTTTCTCCTCCTGTTTTCTCAGCCTTCACCACTGAACCGTCAACTGCTCCACTCACTCCCCCACCCGAACTAGCTCACCTAACCACTCCTTCTTCCCCTGATGTGCCTTTTGCTCAGTTCCTATCCTCATCGATGGATCTCAAAGGAACTGGAAAGGCAAATTACGTTGCTTCCAATGATCTTCAAGCAGCATATTCTCTCTACCCTGGAAGTCCTGCCAGTAGCCTCGTGTCACCAATTTCAAGGACCTCCGGCGATTGCTTATCATCTTCTTTTCCTGAGAGGGACTTTCCACCACAGTGGAATCCTTCAGCTTCTCTCCAAGATGGAAAATATCCAAGAAGTGGTTCTGGTCGGCTATTTGGACATGAGAAAACTGGCACACCTCTTGCTTCTCAGGATTCTAATTTCTTCTGCCCTGCTACATTTGCACAATTCTATCTGGACAATCCACCATTCCCTCATACTGGTGGGAGGTTAAGTGTATCAAAGGATTCAGATGTTTACGCTTCTGGTGGGAATGGATACCAAAACCGGCACAGTAAGTCTCCAAAACAAGATGTGGAGGAAATAGAAGCTTACCGAGCATCTTTTGGTTTCAGTGCCGATGAAATTATAACTACCACACAATACGTGGAGATATCTGATGTAATGGAAGATTCCTTTACTATGAGACCTTTTACTTCAACTAGTCTGTCTGCAGAAGAAAGTATTCAACCTCCATTAGTGGGTGAAAAACTGAAATCCACGCAGGCAACATTACAGAGTCAAAGAAGTATTAAATCAGCATCTGACGTTGAAAAAGAAACCTGCTCTGAAGTCCTGGCATTATGCAATGGCTGTAAAGGTGAGCCACTGTTCCCTGTTACAAGCTCACACTTGTTTCATGAGAAATCAAAAGAAAATAAAGACAAATACATTTTGGGGAACTTTTTCAAAATGTTGCAACTTTTCTTTCCGTTTCATTATTTCCTTTTTATATACACTGACTCACAATTCTGATGCTAGTAGTCAGGTTAAATTACATTTTTGGGATGTGGTTTTAGGAAAGCTACAATTTAGTTATGGTTACAATTGTCACCTGTTTGTCACCTGTTTGATGCAGACGATAAATTGCAAAGACAACCTGGTAACTTGCCAGGATCAAGTACTTCCCAAGGTGAAACAGAAGACCTATTCTCAAGAATAGGGTCGTCCAAAAATAGCCGCAAGTATAATCATGCTTTATCCTGCTCTGATGCAGAAGTTGACTACAGAAGAGGAAGGAGCCTGAGGGGGGAGGTCAAGGGAGATTTTTTATGGCATGACTAAGAGCCATTTCTGAAATAGTTTTTAGTATGTGTATCTGTTCTTTGCTTTGCGGGTTTCCATGGAATGTCTAACCTATGACCTAACCTGTTCTCTGAGTTGACTTGATCAGTGGATGGATGCATATAGTTTGTGTTCTTTATCGTTCCAGTTTAGAAATGGAATGGGTCAAATCTCTGTATTTGGTAGACGAAGTTTGTCTTTTTTCCAATGAATTATATAATAGCATGTACAATCTTGCCCATCTCGTGCATACGTCATCATTATGCATATCCAAACTCCCGTTAACAGT

mRNA sequence

ATGGGGTCCGAGCAGAATAGAATCCCTCAACAAGAACGGGGAAAGAGATGGGGTGGATGTTGGGGTGCATTATCTTGTTTTCACTCTCAGAAAGGAGAAAAGCGCATTGTGCCTGCATCTCGTTTACCTGAGGGCAATGTCGTGACAACCCAGCCAAATGGACCTCATGCAGCAGGAATAGCCAATCAGGCTACTGTGATAGCTCCATCCCTTTTAGCCCCACCTTCTTCACCAGCATCATTTACAAATTCTGCACTCCCTTCAACAGCCCAATCACCTAGCTGTTTCTTGTCGTTGTCTGCCAACTCACCTGGAGGTCCTTCATCCACAATGTTCGCTACAGGGCCATATGCGCATGAAACACAACCGGTTTCTCCTCCTGTTTTCTCAGCCTTCACCACTGAACCGTCAACTGCTCCACTCACTCCCCCACCCGAACTAGCTCACCTAACCACTCCTTCTTCCCCTGATGTGCCTTTTGCTCAGTTCCTATCCTCATCGATGGATCTCAAAGGAACTGGAAAGGCAAATTACGTTGCTTCCAATGATCTTCAAGCAGCATATTCTCTCTACCCTGGAAGTCCTGCCAGTAGCCTCGTGTCACCAATTTCAAGGACCTCCGGCGATTGCTTATCATCTTCTTTTCCTGAGAGGGACTTTCCACCACAGTGGAATCCTTCAGCTTCTCTCCAAGATGGAAAATATCCAAGAAGTGGTTCTGGTCGGCTATTTGGACATGAGAAAACTGGCACACCTCTTGCTTCTCAGGATTCTAATTTCTTCTGCCCTGCTACATTTGCACAATTCTATCTGGACAATCCACCATTCCCTCATACTGGTGGGAGGTTAAGTGTATCAAAGGATTCAGATGTTTACGCTTCTGGTGGGAATGGATACCAAAACCGGCACAGTAAGTCTCCAAAACAAGATGTGGAGGAAATAGAAGCTTACCGAGCATCTTTTGGTTTCAGTGCCGATGAAATTATAACTACCACACAATACGTGGAGATATCTGATGTAATGGAAGATTCCTTTACTATGAGACCTTTTACTTCAACTAGTCTGTCTGCAGAAGAAAGTATTCAACCTCCATTAGTGGGTGAAAAACTGAAATCCACGCAGGCAACATTACAGAGTCAAAGAAGTATTAAATCAGCATCTGACGTTGAAAAAGAAACCTGCTCTGAAGTCCTGGCATTATGCAATGGCTGTAAAGACGATAAATTGCAAAGACAACCTGGTAACTTGCCAGGATCAAGTACTTCCCAAGGTGAAACAGAAGACCTATTCTCAAGAATAGGGTCGTCCAAAAATAGCCGCAAGTATAATCATGCTTTATCCTGCTCTGATGCAGAAGTTGACTACAGAAGAGGAAGGAGCCTGAGGGGGGAGGTCAAGGGAGATTTTTTATGGCATGACTAAGAGCCATTTCTGAAATAGTTTTTAGTATGTGTATCTGTTCTTTGCTTTGCGGGTTTCCATGGAATGTCTAACCTATGACCTAACCTGTTCTCTGAGTTGACTTGATCAGTGGATGGATGCATATAGTTTGTGTTCTTTATCGTTCCAGTTTAGAAATGGAATGGGTCAAATCTCTGTATTTGGTAGACGAAGTTTGTCTTTTTTCCAATGAATTATATAATAGCATGTACAATCTTGCCCATCTCGTGCATACGTCATCATTATGCATATCCAAACTCCCGTTAACAGT

Coding sequence (CDS)

ATGGGGTCCGAGCAGAATAGAATCCCTCAACAAGAACGGGGAAAGAGATGGGGTGGATGTTGGGGTGCATTATCTTGTTTTCACTCTCAGAAAGGAGAAAAGCGCATTGTGCCTGCATCTCGTTTACCTGAGGGCAATGTCGTGACAACCCAGCCAAATGGACCTCATGCAGCAGGAATAGCCAATCAGGCTACTGTGATAGCTCCATCCCTTTTAGCCCCACCTTCTTCACCAGCATCATTTACAAATTCTGCACTCCCTTCAACAGCCCAATCACCTAGCTGTTTCTTGTCGTTGTCTGCCAACTCACCTGGAGGTCCTTCATCCACAATGTTCGCTACAGGGCCATATGCGCATGAAACACAACCGGTTTCTCCTCCTGTTTTCTCAGCCTTCACCACTGAACCGTCAACTGCTCCACTCACTCCCCCACCCGAACTAGCTCACCTAACCACTCCTTCTTCCCCTGATGTGCCTTTTGCTCAGTTCCTATCCTCATCGATGGATCTCAAAGGAACTGGAAAGGCAAATTACGTTGCTTCCAATGATCTTCAAGCAGCATATTCTCTCTACCCTGGAAGTCCTGCCAGTAGCCTCGTGTCACCAATTTCAAGGACCTCCGGCGATTGCTTATCATCTTCTTTTCCTGAGAGGGACTTTCCACCACAGTGGAATCCTTCAGCTTCTCTCCAAGATGGAAAATATCCAAGAAGTGGTTCTGGTCGGCTATTTGGACATGAGAAAACTGGCACACCTCTTGCTTCTCAGGATTCTAATTTCTTCTGCCCTGCTACATTTGCACAATTCTATCTGGACAATCCACCATTCCCTCATACTGGTGGGAGGTTAAGTGTATCAAAGGATTCAGATGTTTACGCTTCTGGTGGGAATGGATACCAAAACCGGCACAGTAAGTCTCCAAAACAAGATGTGGAGGAAATAGAAGCTTACCGAGCATCTTTTGGTTTCAGTGCCGATGAAATTATAACTACCACACAATACGTGGAGATATCTGATGTAATGGAAGATTCCTTTACTATGAGACCTTTTACTTCAACTAGTCTGTCTGCAGAAGAAAGTATTCAACCTCCATTAGTGGGTGAAAAACTGAAATCCACGCAGGCAACATTACAGAGTCAAAGAAGTATTAAATCAGCATCTGACGTTGAAAAAGAAACCTGCTCTGAAGTCCTGGCATTATGCAATGGCTGTAAAGACGATAAATTGCAAAGACAACCTGGTAACTTGCCAGGATCAAGTACTTCCCAAGGTGAAACAGAAGACCTATTCTCAAGAATAGGGTCGTCCAAAAATAGCCGCAAGTATAATCATGCTTTATCCTGCTCTGATGCAGAAGTTGACTACAGAAGAGGAAGGAGCCTGAGGGGGGAGGTCAAGGGAGATTTTTTATGGCATGACTAA
BLAST of CmoCh01G021140 vs. Swiss-Prot
Match: Y1666_ARATH (Uncharacterized protein At1g76660 OS=Arabidopsis thaliana GN=At1g76660 PE=2 SV=1)

HSP 1 Score: 461.1 bits (1185), Expect = 1.5e-128
Identity = 276/468 (58.97%), Postives = 321/468 (68.59%), Query Frame = 1

Query: 1   MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEG-NVVTTQPNGPHAAG 60
           MGSEQ      ++ KRWGGC G  SCF SQKG KRIVPASR+PEG NV  +QPNG H AG
Sbjct: 1   MGSEQ------DQRKRWGGCLGVFSCFKSQKGGKRIVPASRIPEGGNVSASQPNGAHQAG 60

Query: 61  IANQATV--IAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPY 120
           + N      I  SLLAPPSSPASFTNSALPST QSP+C+LSL+ANSPGGPSS+M+ATGPY
Sbjct: 61  VLNNQAAGGINLSLLAPPSSPASFTNSALPSTTQSPNCYLSLAANSPGGPSSSMYATGPY 120

Query: 121 AHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKAN 180
           AHETQ VSPPVFS FTTEPSTAP TPPPELA LT PSSPDVP+A+FL+SSMDLK +GK +
Sbjct: 121 AHETQLVSPPVFSTFTTEPSTAPFTPPPELARLTAPSSPDVPYARFLTSSMDLKNSGKGH 180

Query: 181 YVASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPR 240
           Y   NDLQA YSLYPGSPAS+L SPISR SGD L S                 Q+GK  R
Sbjct: 181 Y---NDLQATYSLYPGSPASALRSPISRASGDGLLSP----------------QNGKCSR 240

Query: 241 SGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYASG- 300
           S SG  FG++  G     Q+SNFFCP TFA+FYLD +P  P  GGRLSVSKDSDVY +  
Sbjct: 241 SDSGNTFGYDTNGVSTPLQESNFFCPETFAKFYLDHDPSVPQNGGRLSVSKDSDVYPTNG 300

Query: 301 -GNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTS 360
            GNG QNR ++SPKQD+EE+EAYRASFGFSADEIITT+QYVEI+DVM+ SF    ++   
Sbjct: 301 YGNGNQNRQNRSPKQDMEELEAYRASFGFSADEIITTSQYVEITDVMDGSFNTSAYS--- 360

Query: 361 LSAEESIQPPLVGEKLKSTQATLQSQRSIKSASDVEKETCSEVLALCNGCKDDKLQRQPG 420
                    P  G+KL   +A L SQ S KS +D++ +         +    D  QR   
Sbjct: 361 ---------PSDGQKLLRREANLLSQTSPKSEADLDSQVVDFQSPKSSNSYKDHKQR--- 420

Query: 421 NLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLR 463
                +    + E L SR+GS K SR Y+  +S SDAEV+YRRGRSLR
Sbjct: 421 -----NRIHADEEALLSRVGSVKGSRSYH--ISSSDAEVEYRRGRSLR 421

BLAST of CmoCh01G021140 vs. TrEMBL
Match: A0A0A0L1G3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G665140 PE=4 SV=1)

HSP 1 Score: 807.0 bits (2083), Expect = 1.2e-230
Identity = 417/473 (88.16%), Postives = 437/473 (92.39%), Query Frame = 1

Query: 1   MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGI 60
           MGSEQNR PQ ERGKRWGGCWGALSCFHSQKG+KRIVPASRLPEGNVVTTQPNGP AAG+
Sbjct: 1   MGSEQNRFPQHERGKRWGGCWGALSCFHSQKGDKRIVPASRLPEGNVVTTQPNGPQAAGM 60

Query: 61  ANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHE 120
            NQATVI PSLLAPPSSPASFTNSALPST QSPSCFLSLSANSPGGPSSTM+ATGPYAH+
Sbjct: 61  TNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMYATGPYAHD 120

Query: 121 TQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVA 180
           TQ VSPPVFSAF TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DLKGTGKANY+A
Sbjct: 121 TQLVSPPVFSAFNTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSEDLKGTGKANYIA 180

Query: 181 SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGS 240
           SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDF PQWN SASLQDGKYPRSGS
Sbjct: 181 SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGS 240

Query: 241 GRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ 300
           GRLFG+EK GT LASQDSNFFCPATFAQFYLDN  FPHTGGRLSVSKDSDVY+S GNGYQ
Sbjct: 241 GRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ 300

Query: 301 NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES 360
           NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES
Sbjct: 301 NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES 360

Query: 361 IQPPLVGEKLKSTQATLQSQRSIKSASDVEKETCSEVLALCNGCKDDKLQRQPGNLPGSS 420
            +PPL+GEKLKS+  TLQSQRSIKSA +   ETC+E+ ALCNG KD+KLQRQPG++ GSS
Sbjct: 361 TEPPLLGEKLKSSHTTLQSQRSIKSAPE---ETCTEMPALCNGYKDNKLQRQPGDISGSS 420

Query: 421 TSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD 474
           TS    +D+FSRIGSSKNSRKY+  LSCSDAEVDYRRGRSLR E KG+  WHD
Sbjct: 421 TSNQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLR-EAKGNGSWHD 469

BLAST of CmoCh01G021140 vs. TrEMBL
Match: V4TGC4_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10000992mg PE=4 SV=1)

HSP 1 Score: 668.7 bits (1724), Expect = 5.2e-189
Identity = 349/480 (72.71%), Postives = 400/480 (83.33%), Query Frame = 1

Query: 1   MGSEQNR--IPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAA 60
           MGSEQNR  +PQQER KRWGGC GA SCF SQKG KRIVPASR+PEGN    QPNGP AA
Sbjct: 1   MGSEQNRFPLPQQERRKRWGGCLGAFSCFRSQKGGKRIVPASRMPEGNAPAAQPNGPQAA 60

Query: 61  GIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYA 120
           G+ NQ T +APSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYA
Sbjct: 61  GLPNQTTTLAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYA 120

Query: 121 HETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANY 180
           HETQ VSPPVFS FTTEPSTAPLTPPPELAHLTTPSSPDVPFA+FL+SSMDL GT KANY
Sbjct: 121 HETQLVSPPVFSTFTTEPSTAPLTPPPELAHLTTPSSPDVPFARFLTSSMDLNGTDKANY 180

Query: 181 VASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRS 240
           +A+NDLQA YSLYPGSP SSL+SPISRTSG+CLSSSFPER+FPPQW+P+ S Q+GKY RS
Sbjct: 181 IAANDLQATYSLYPGSPPSSLISPISRTSGECLSSSFPEREFPPQWDPTVSPQNGKYSRS 240

Query: 241 GSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYASGGN 300
           GSGRL+ H+ TG    SQD+NFFCPATFAQFYLD + PFPHTGGRLSVSKDSDVY +G N
Sbjct: 241 GSGRLYTHDTTGGSRVSQDTNFFCPATFAQFYLDHDSPFPHTGGRLSVSKDSDVYPNGAN 300

Query: 301 GYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSA 360
           G QNRH+KSPKQDVEE+EAYRASFGFSADEIITT QYVEI+DVM+DSFTM PFTS   + 
Sbjct: 301 GNQNRHTKSPKQDVEELEAYRASFGFSADEIITTPQYVEITDVMDDSFTMMPFTSDKPAF 360

Query: 361 EESIQPPLVGEKLKSTQATLQSQRSIKSASDVEKETC-SEVLALCNGCKDDKLQRQPGNL 420
           EES+   + G+K +  ++ L + +++KS SD+       E+    +GC+D+K +RQ G++
Sbjct: 361 EESLPASMDGQKPQGRESNLLNPKNLKSDSDLMNGGIHHELTESSDGCEDNKPKRQSGDV 420

Query: 421 PGSSTSQGET----EDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWH 473
            G+ST   +     ED+FS++ +S+NSRKY+  LSCSDAE+DYRRGRSLR E KGDF WH
Sbjct: 421 SGASTPGNQVLTDEEDIFSKMRTSRNSRKYHQGLSCSDAEIDYRRGRSLR-EGKGDFSWH 479

BLAST of CmoCh01G021140 vs. TrEMBL
Match: A0A067KKH5_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_08002 PE=4 SV=1)

HSP 1 Score: 666.4 bits (1718), Expect = 2.6e-188
Identity = 350/478 (73.22%), Postives = 390/478 (81.59%), Query Frame = 1

Query: 1   MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGI 60
           MGSEQNR PQQER KRWGGC GA SCF SQKG KRIVPASR+P+GN   +QPNGP A  +
Sbjct: 1   MGSEQNRFPQQERRKRWGGCLGAFSCFGSQKGGKRIVPASRIPDGNATASQPNGPQAGVL 60

Query: 61  ANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHE 120
            NQAT +APSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHE
Sbjct: 61  TNQATQLAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHE 120

Query: 121 TQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVA 180
           TQ VSPPVFS FTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLK T K NY+A
Sbjct: 121 TQLVSPPVFSTFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKSTEKTNYIA 180

Query: 181 SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGS 240
           + DLQ  YSLYPGSPASSL+SPISRTSGDCLSSSFPERDFPPQW+PS S Q+GKY R+GS
Sbjct: 181 AGDLQTTYSLYPGSPASSLISPISRTSGDCLSSSFPERDFPPQWDPSVSPQNGKYSRNGS 240

Query: 241 GRLFGHEKTGTPLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYASGGNGY 300
           GRLFGH+ TG  + SQD+NFFCPATFA+FYLD NPPFPHTGGRLSVSKDSDVY +GGNG+
Sbjct: 241 GRLFGHDTTGASMVSQDTNFFCPATFARFYLDHNPPFPHTGGRLSVSKDSDVYPAGGNGH 300

Query: 301 QNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEE 360
           Q+RH+++PKQDVEEIEAYRASFGFSADEIITT QYVEISDVM+DSFTM PFTS   + E 
Sbjct: 301 QSRHNRNPKQDVEEIEAYRASFGFSADEIITTQQYVEISDVMDDSFTMTPFTSNKPTIEG 360

Query: 361 SIQPPLVGEKLKSTQATLQSQRSIKSASDVEKETCSEVLALCNGCKDDKLQRQPGNLPGS 420
           S +       L  +Q    +  ++K  SD     C E    C+  +D K +RQ G++ GS
Sbjct: 361 STE----AASLSDSQKAQTNLPTLKLKSD---RVCGEAPVSCDRYEDSKSRRQTGDVSGS 420

Query: 421 ST----SQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD 474
           ST    +  + +D+FS++ SSK SRKYN   SCSDAE+DYRRGRSL GE K DF WHD
Sbjct: 421 STPGIHALTDDDDIFSKMTSSKISRKYNLGSSCSDAEIDYRRGRSL-GEGKADFAWHD 470

BLAST of CmoCh01G021140 vs. TrEMBL
Match: B9R8W7_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1602580 PE=4 SV=1)

HSP 1 Score: 658.7 bits (1698), Expect = 5.4e-186
Identity = 345/477 (72.33%), Postives = 385/477 (80.71%), Query Frame = 1

Query: 1   MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGI 60
           MGSEQNR PQQER KRWGGCWGA SCF SQKG KRIVPASR+PEGN    QPNGP   G+
Sbjct: 1   MGSEQNRFPQQERRKRWGGCWGAFSCFSSQKGGKRIVPASRIPEGNATAAQPNGPQVGGL 60

Query: 61  ANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHE 120
            NQAT +APSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHE
Sbjct: 61  TNQATTLAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHE 120

Query: 121 TQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVA 180
           TQ VSPPVFS FTTEPSTAPLTPPPELAHLTTPSSPDVPFA FLSSS+DLK T KANY+A
Sbjct: 121 TQLVSPPVFSTFTTEPSTAPLTPPPELAHLTTPSSPDVPFAHFLSSSVDLKSTEKANYIA 180

Query: 181 SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGS 240
           +NDLQA YSLYPGSPASSL+SPISRTSGDCLSSSFP R+FPP W+P+ S Q+GKY RS S
Sbjct: 181 ANDLQATYSLYPGSPASSLISPISRTSGDCLSSSFPGREFPPHWDPTVSPQNGKYSRSNS 240

Query: 241 GRLFGHEKTGTPLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYASGGNGY 300
           GRLF H+ TG  + SQD+NFFCPATFA+FYLD NPPFPH GGRLSVSKDSD Y +GGNG+
Sbjct: 241 GRLFVHDTTGGSMVSQDTNFFCPATFARFYLDHNPPFPHNGGRLSVSKDSDAYPAGGNGH 300

Query: 301 QNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEE 360
           QNR S+SPKQD EE+EAYRASFGFSADEIITT QYVEISDVM+DSFTM PF S   + EE
Sbjct: 301 QNRSSRSPKQDAEELEAYRASFGFSADEIITTQQYVEISDVMDDSFTMTPFASNKSTVEE 360

Query: 361 SIQPPLVGEKLKSTQATLQSQRSIKSASDVEKETCSEVLALCNGCKDDKLQRQPGNLPGS 420
           +++   + E  K+ Q    +  SIK   D+    C EV   C+  +D K +RQ G++ GS
Sbjct: 361 TVEAASISESEKA-QRIQPNLPSIKLKLDL---ACGEVPVSCDRYEDPKSRRQAGDVSGS 420

Query: 421 STS----QGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWH 473
           ST       +  D+F ++ SS+ SRKY+   SCSDAE+DYRRGRSLR E KGDF WH
Sbjct: 421 STPGIHVLADDSDIFPKMTSSRISRKYHLGSSCSDAEIDYRRGRSLR-EGKGDFAWH 472

BLAST of CmoCh01G021140 vs. TrEMBL
Match: F6I6Y3_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_18s0122g00140 PE=4 SV=1)

HSP 1 Score: 648.7 bits (1672), Expect = 5.6e-183
Identity = 353/489 (72.19%), Postives = 395/489 (80.78%), Query Frame = 1

Query: 1   MGSEQNRIPQQERG------KRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNG 60
           MGSEQNR PQQER       KRWGGCWG LSCF +QKG KRIVPASR+PEGN   TQPNG
Sbjct: 1   MGSEQNRFPQQERERERERQKRWGGCWGGLSCFGTQKGGKRIVPASRIPEGNASATQPNG 60

Query: 61  PHAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFAT 120
           P A G+ NQ T +APSLLAPPSSPASFTNSALPSTAQSPSCFLS+SANSP GPSSTMFAT
Sbjct: 61  PQAVGLTNQTTALAPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPEGPSSTMFAT 120

Query: 121 GPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTG 180
           GPYAHETQ VSPPVFS FTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLK  G
Sbjct: 121 GPYAHETQLVSPPVFSTFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKSAG 180

Query: 181 KANYVASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGK 240
           K NY+A+NDLQA YSLYPGSPASSL+SPISRTSGDCLSSSFPER+FPP+W+PS S Q+ K
Sbjct: 181 KTNYIAANDLQATYSLYPGSPASSLISPISRTSGDCLSSSFPEREFPPRWDPSISPQNAK 240

Query: 241 YPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDN-----PPFPHTGGRLSVSKDS 300
           YPR+GSGRLFG + T +   SQDSNFFCPATFAQFYLD+     PPFP +GGRLS+S++S
Sbjct: 241 YPRNGSGRLFGLD-TASSSISQDSNFFCPATFAQFYLDHTQQSYPPFP-SGGRLSLSRES 300

Query: 301 DVYASGGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRP 360
           DVY+SGGNG+QNRH+K+ KQDVEEIEAYRASFGFSADEIITTTQYVEISDV+EDSFTM P
Sbjct: 301 DVYSSGGNGHQNRHNKNCKQDVEEIEAYRASFGFSADEIITTTQYVEISDVLEDSFTMTP 360

Query: 361 FTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASD-VEKETCSEVLALCNGCKDDK 420
           FTS     EE++ P +V E  K  Q  L ++ S+KS S  V++  C E L  C   +D K
Sbjct: 361 FTSNKPDMEENVVPAVVHEGPKD-QTNLLNEESLKSESGLVDEGGCCEGLPSCKTFEDHK 420

Query: 421 LQRQPGNLPGSSTS----QGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGE 474
            +RQ GN  GSST       + E++F + G+SK  RKY+  LS SDAE+DYRRGRSLR E
Sbjct: 421 SERQSGNESGSSTPGKHILTDEEEIFPK-GASKIGRKYHLGLSSSDAEIDYRRGRSLR-E 480

BLAST of CmoCh01G021140 vs. TAIR10
Match: AT1G76660.1 (AT1G76660.1 FUNCTIONS IN: molecular_function unknown)

HSP 1 Score: 461.1 bits (1185), Expect = 8.4e-130
Identity = 276/468 (58.97%), Postives = 321/468 (68.59%), Query Frame = 1

Query: 1   MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEG-NVVTTQPNGPHAAG 60
           MGSEQ      ++ KRWGGC G  SCF SQKG KRIVPASR+PEG NV  +QPNG H AG
Sbjct: 1   MGSEQ------DQRKRWGGCLGVFSCFKSQKGGKRIVPASRIPEGGNVSASQPNGAHQAG 60

Query: 61  IANQATV--IAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPY 120
           + N      I  SLLAPPSSPASFTNSALPST QSP+C+LSL+ANSPGGPSS+M+ATGPY
Sbjct: 61  VLNNQAAGGINLSLLAPPSSPASFTNSALPSTTQSPNCYLSLAANSPGGPSSSMYATGPY 120

Query: 121 AHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKAN 180
           AHETQ VSPPVFS FTTEPSTAP TPPPELA LT PSSPDVP+A+FL+SSMDLK +GK +
Sbjct: 121 AHETQLVSPPVFSTFTTEPSTAPFTPPPELARLTAPSSPDVPYARFLTSSMDLKNSGKGH 180

Query: 181 YVASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPR 240
           Y   NDLQA YSLYPGSPAS+L SPISR SGD L S                 Q+GK  R
Sbjct: 181 Y---NDLQATYSLYPGSPASALRSPISRASGDGLLSP----------------QNGKCSR 240

Query: 241 SGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYASG- 300
           S SG  FG++  G     Q+SNFFCP TFA+FYLD +P  P  GGRLSVSKDSDVY +  
Sbjct: 241 SDSGNTFGYDTNGVSTPLQESNFFCPETFAKFYLDHDPSVPQNGGRLSVSKDSDVYPTNG 300

Query: 301 -GNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTS 360
            GNG QNR ++SPKQD+EE+EAYRASFGFSADEIITT+QYVEI+DVM+ SF    ++   
Sbjct: 301 YGNGNQNRQNRSPKQDMEELEAYRASFGFSADEIITTSQYVEITDVMDGSFNTSAYS--- 360

Query: 361 LSAEESIQPPLVGEKLKSTQATLQSQRSIKSASDVEKETCSEVLALCNGCKDDKLQRQPG 420
                    P  G+KL   +A L SQ S KS +D++ +         +    D  QR   
Sbjct: 361 ---------PSDGQKLLRREANLLSQTSPKSEADLDSQVVDFQSPKSSNSYKDHKQR--- 420

Query: 421 NLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLR 463
                +    + E L SR+GS K SR Y+  +S SDAEV+YRRGRSLR
Sbjct: 421 -----NRIHADEEALLSRVGSVKGSRSYH--ISSSDAEVEYRRGRSLR 421

BLAST of CmoCh01G021140 vs. TAIR10
Match: AT5G52430.1 (AT5G52430.1 hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 143.7 bits (361), Expect = 3.0e-34
Identity = 111/260 (42.69%), Postives = 141/260 (54.23%), Query Frame = 1

Query: 9   PQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPE----GNVVTTQPNGPHAAGIANQA 68
           P   +  RWG CW   SCF +QK  KRI  A  +PE    G  V T  N       A   
Sbjct: 28  PSSSQKGRWGKCWSLYSCFGTQKNNKRIGNAVLVPEPVTSGVPVVTVQNS------ATST 87

Query: 69  TVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSAN--SPGGPSSTMFATGPYAHETQ 128
           TV+ P  +APPSSPASF  S   S + SP   LSL++N  SP  P S +F  GPYA+ETQ
Sbjct: 88  TVVLP-FIAPPSSPASFLQSDPSSVSHSPVGPLSLTSNTFSPKEPQS-VFTVGPYANETQ 147

Query: 129 PVSPPVFSAFTTEPSTAPLTPPPELA-HLTTPSSPDVPFAQFLSSSMDLKGTGKANYVAS 188
           PV+PPVFSAF TEPSTAP TPPPE + H+TTPSSP+VPFAQ L+SS++L      + +  
Sbjct: 148 PVTPPVFSAFITEPSTAPYTPPPESSVHITTPSSPEVPFAQLLTSSLELTRRDSTSGMNQ 207

Query: 189 NDLQAAY-----SLYPGSP-ASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKY 248
               + Y      + PGSP   +L+SP S  S    SS +P +      +P    + G+ 
Sbjct: 208 KFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPYPGK------SPMVEFRIGEP 267

Query: 249 PRSGSGRLFGHEKTGTPLAS 256
           P+      F   K G+   S
Sbjct: 268 PKFLGFEHFTARKWGSRFGS 273

BLAST of CmoCh01G021140 vs. TAIR10
Match: AT4G25620.1 (AT4G25620.1 hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 127.5 bits (319), Expect = 2.2e-29
Identity = 94/212 (44.34%), Postives = 121/212 (57.08%), Query Frame = 1

Query: 3   SEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIAN 62
           S ++R       K+ G  W    CF S+K  KRI  A  +PE    +     P     +N
Sbjct: 21  SAESRTQPSSVQKKRGSWWSLYWCFGSKKNNKRIGHAVLVPEP-AASGAAVAPVQNSSSN 80

Query: 63  QATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFL--SLSANSPGGPSSTMFATGPYAHE 122
             ++  P  +APPSSPASF  S  PS + +P   L  SL+ N P  PS+  F  GPYAHE
Sbjct: 81  STSIFMP-FIAPPSSPASFLPSGPPSASHTPDPGLLCSLTVNEP--PSA--FTIGPYAHE 140

Query: 123 TQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLK-----GTGK 182
           TQPV+PPVFSAFTTEPSTAP TPPPE     +PSSP+VPFAQ L+SS++       G   
Sbjct: 141 TQPVTPPVFSAFTTEPSTAPFTPPPE-----SPSSPEVPFAQLLTSSLERARRNSGGGMN 200

Query: 183 ANYVASNDLQAAYSLYPGSPASSLVSPISRTS 208
             + A++    +  +YPGSP  +L+SP S TS
Sbjct: 201 QKFSAAHYEFKSCQVYPGSPGGNLISPGSGTS 221

BLAST of CmoCh01G021140 vs. TAIR10
Match: AT1G63720.1 (AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1))

HSP 1 Score: 120.9 bits (302), Expect = 2.1e-27
Identity = 89/224 (39.73%), Postives = 124/224 (55.36%), Query Frame = 1

Query: 1   MGSEQNRIPQQE---RGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHA 60
           + S  +R+ Q     + ++W   W  L CF S +  KRI  +  +PE   +++  +    
Sbjct: 21  IASSDDRLHQSSPIHKKRKWWNRWSLLKCFGSSRQRKRIGNSVLVPEPVSMSSSNSTTSN 80

Query: 61  AGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPY 120
           +G  +  T +    +APPSSPASF  S  PS  QSP   LS S   P     ++FA GPY
Sbjct: 81  SGYRSVITTLP--FIAPPSSPASFFQSEPPSATQSPVGILSFSP-LPCNNRPSIFAIGPY 140

Query: 121 AHETQPVSPPVFSAFTTEPSTAPLTPPPELAHL----TTPSSPDVPFAQFLSSSMDLKGT 180
           AHETQ VSPPVFS +TTEPS+AP+TPP + + +    TTPSSP+VPFAQ  +S+      
Sbjct: 141 AHETQLVSPPVFSTYTTEPSSAPITPPLDDSSIYLTTTTPSSPEVPFAQLFNSNHQTGSY 200

Query: 181 GKANYVASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPE 218
           G    ++S+     Y L PGSP   L+SP   + G   +S FP+
Sbjct: 201 GYKFPMSSSYEFQFYQLPPGSPLGQLISP---SPGSGPTSPFPD 238

BLAST of CmoCh01G021140 vs. NCBI nr
Match: gi|659105232|ref|XP_008453041.1| (PREDICTED: uncharacterized protein At1g76660 [Cucumis melo])

HSP 1 Score: 820.1 bits (2117), Expect = 2.0e-234
Identity = 424/474 (89.45%), Postives = 443/474 (93.46%), Query Frame = 1

Query: 1   MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGI 60
           MGSEQNR PQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGP AAG+
Sbjct: 1   MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGM 60

Query: 61  ANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHE 120
            NQATVI PSLLAPPSSPASFTNSALPST QSPSCFLSLSANSPGGPSST++ATGPYAHE
Sbjct: 61  TNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTIYATGPYAHE 120

Query: 121 TQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVA 180
           TQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DLKGTGKANY+A
Sbjct: 121 TQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIA 180

Query: 181 SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGS 240
           SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDF PQWN SASLQDGKYPRSGS
Sbjct: 181 SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGS 240

Query: 241 GRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ 300
           GRLFG+EK GT LASQDSNFFCPATFAQFYLDN  FPHTGGRLSVSKDSDVY+S GNGYQ
Sbjct: 241 GRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ 300

Query: 301 NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES 360
           NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES
Sbjct: 301 NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES 360

Query: 361 IQPPLVGEKLKSTQATLQSQRSIKSASD-VEKETCSEVLALCNGCKDDKLQRQPGNLPGS 420
            +PPL+GEKLKS+  TLQ+QRSIKSA + VEKETC+EV ALCNG KD+KLQRQPG++ GS
Sbjct: 361 TEPPLLGEKLKSSHTTLQNQRSIKSAPEVVEKETCTEVPALCNGYKDNKLQRQPGDILGS 420

Query: 421 STSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD 474
           STS    +D+FSRIGSSKNSRKY+  LSCSDAEVDYRRGRSLR E KG+  WHD
Sbjct: 421 STSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLR-EAKGNGSWHD 473

BLAST of CmoCh01G021140 vs. NCBI nr
Match: gi|449466510|ref|XP_004150969.1| (PREDICTED: uncharacterized protein At1g76660 [Cucumis sativus])

HSP 1 Score: 807.0 bits (2083), Expect = 1.8e-230
Identity = 417/473 (88.16%), Postives = 437/473 (92.39%), Query Frame = 1

Query: 1   MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGI 60
           MGSEQNR PQ ERGKRWGGCWGALSCFHSQKG+KRIVPASRLPEGNVVTTQPNGP AAG+
Sbjct: 1   MGSEQNRFPQHERGKRWGGCWGALSCFHSQKGDKRIVPASRLPEGNVVTTQPNGPQAAGM 60

Query: 61  ANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHE 120
            NQATVI PSLLAPPSSPASFTNSALPST QSPSCFLSLSANSPGGPSSTM+ATGPYAH+
Sbjct: 61  TNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMYATGPYAHD 120

Query: 121 TQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVA 180
           TQ VSPPVFSAF TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DLKGTGKANY+A
Sbjct: 121 TQLVSPPVFSAFNTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSEDLKGTGKANYIA 180

Query: 181 SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGS 240
           SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDF PQWN SASLQDGKYPRSGS
Sbjct: 181 SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGS 240

Query: 241 GRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ 300
           GRLFG+EK GT LASQDSNFFCPATFAQFYLDN  FPHTGGRLSVSKDSDVY+S GNGYQ
Sbjct: 241 GRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ 300

Query: 301 NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES 360
           NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES
Sbjct: 301 NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES 360

Query: 361 IQPPLVGEKLKSTQATLQSQRSIKSASDVEKETCSEVLALCNGCKDDKLQRQPGNLPGSS 420
            +PPL+GEKLKS+  TLQSQRSIKSA +   ETC+E+ ALCNG KD+KLQRQPG++ GSS
Sbjct: 361 TEPPLLGEKLKSSHTTLQSQRSIKSAPE---ETCTEMPALCNGYKDNKLQRQPGDISGSS 420

Query: 421 TSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD 474
           TS    +D+FSRIGSSKNSRKY+  LSCSDAEVDYRRGRSLR E KG+  WHD
Sbjct: 421 TSNQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLR-EAKGNGSWHD 469

BLAST of CmoCh01G021140 vs. NCBI nr
Match: gi|645255247|ref|XP_008233411.1| (PREDICTED: uncharacterized protein At1g76660 isoform X1 [Prunus mume])

HSP 1 Score: 682.6 bits (1760), Expect = 5.0e-193
Identity = 361/481 (75.05%), Postives = 398/481 (82.74%), Query Frame = 1

Query: 1   MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGI 60
           MGSEQNR PQQER KRWGGCWGA SCF S KG KRIVPASR+PEGN   TQPNGP A G+
Sbjct: 1   MGSEQNRFPQQERRKRWGGCWGAFSCFDSHKGGKRIVPASRIPEGNASATQPNGPQAVGL 60

Query: 61  ANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHE 120
            NQAT +APSLLAPPSSPASFTNSALPSTAQSPSC L LSANSPGGPSSTM+ATGPYA+E
Sbjct: 61  TNQATSLAPSLLAPPSSPASFTNSALPSTAQSPSCSLLLSANSPGGPSSTMYATGPYANE 120

Query: 121 TQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVA 180
           TQ VSPPVFS FTTEPSTAPLTPPPELAHLTTPSSPDVPFA+FLSSS+D+K T K NY+A
Sbjct: 121 TQLVSPPVFSTFTTEPSTAPLTPPPELAHLTTPSSPDVPFARFLSSSVDIKTTDKTNYIA 180

Query: 181 SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGS 240
           +NDLQA YSLYPGSPASSL SPISR S DC SSSFPERDFP QW+PS S Q+G YPRSGS
Sbjct: 181 ANDLQATYSLYPGSPASSLRSPISRASNDC-SSSFPERDFPRQWDPSVSPQNGTYPRSGS 240

Query: 241 GRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ 300
            RLFG++ TG   ASQDSNFFCPATFAQFYLDNPPFPH GGRLSVSKDSDVY++GGNG Q
Sbjct: 241 ARLFGYDTTGASAASQDSNFFCPATFAQFYLDNPPFPHAGGRLSVSKDSDVYSTGGNGSQ 300

Query: 301 NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES 360
           NRH++SPKQDVEE+EAYRASFGFSADEIITTTQYVEISDVM+DSFTM PFTS  L  EE 
Sbjct: 301 NRHNRSPKQDVEELEAYRASFGFSADEIITTTQYVEISDVMDDSFTMTPFTSHKLPTEEH 360

Query: 361 IQPPLVGEKLKS--TQATLQSQRSIKSASDVEKETCSEVLALCNGCKDDKLQRQPGNLPG 420
           I+P  V E LK+  T+  LQSQ + KS SD+++   S++   CNG +D K  RQPG++  
Sbjct: 361 IEPISVTEGLKAQKTKTILQSQDTTKSESDLDEGGSSDLPISCNGYEDHKSWRQPGDVSR 420

Query: 421 SSTS------QGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWH 474
           SST         + ED+FS+IGSSK SRKY   LS SDAE+DYRRGRSLR E KG+F WH
Sbjct: 421 SSTPGPGIRVLADEEDIFSKIGSSKLSRKYQLGLSSSDAEIDYRRGRSLR-ERKGEFAWH 479

BLAST of CmoCh01G021140 vs. NCBI nr
Match: gi|1009126185|ref|XP_015880014.1| (PREDICTED: uncharacterized protein At1g76660 [Ziziphus jujuba])

HSP 1 Score: 676.8 bits (1745), Expect = 2.8e-191
Identity = 350/481 (72.77%), Postives = 397/481 (82.54%), Query Frame = 1

Query: 1   MGSEQNRIPQQERG--KRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAA 60
           MGSEQNR PQQER   KRWGGCW ALSCF +QKG KRIVPASR+PEGN    QPNGP A 
Sbjct: 1   MGSEQNRFPQQERERRKRWGGCWSALSCFGTQKGGKRIVPASRIPEGNASAAQPNGPQAV 60

Query: 61  GIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYA 120
           G+ NQ T +APSLLAPPSSPASFTNSALPST QSPSCFLSLSANSPGGPSSTMFATGPYA
Sbjct: 61  GLTNQGTALAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMFATGPYA 120

Query: 121 HETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANY 180
           HETQ VSPPVFS FTTEPSTAPLTPPPELAHLTTPSSPDVPFA FLSSS+DLK T K+NY
Sbjct: 121 HETQLVSPPVFSTFTTEPSTAPLTPPPELAHLTTPSSPDVPFAHFLSSSVDLKSTDKSNY 180

Query: 181 VASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRS 240
           +A NDL + YSLYPGSP SS++SPISRTS +C SSSFPER+FP QW+ S S ++GKYPR+
Sbjct: 181 IAVNDLHSTYSLYPGSPPSSIISPISRTSNECSSSSFPEREFPTQWDSSVSPKNGKYPRN 240

Query: 241 GSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNG 300
            SGRLF H+ TG P+ SQDSNFFCPATFAQFY+DNPPFPH GGRLSVSKDSD Y++GGNG
Sbjct: 241 DSGRLFEHDATGGPMTSQDSNFFCPATFAQFYVDNPPFPHAGGRLSVSKDSDAYSTGGNG 300

Query: 301 YQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAE 360
           +QNRHSKSPKQDVEEIEAYRASFGFSADEIITT+QYVEISDVMEDSFTM PFTS  L  +
Sbjct: 301 HQNRHSKSPKQDVEEIEAYRASFGFSADEIITTSQYVEISDVMEDSFTMTPFTSNKLPMD 360

Query: 361 ESIQPPLV-GEKLKSTQATLQSQRSIKSASD-VEKETCSEVLALCNGCKDDKLQRQPGNL 420
           ESI+P  + G K   TQ + QSQ+S++S  D ++   C EV AL NG +D K  + PG++
Sbjct: 361 ESIEPASISGLKAIKTQTSAQSQKSLESELDLIDGGRCCEVPALSNGFEDHKSWKPPGDI 420

Query: 421 PGSSTSQG----ETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWH 474
            GSST       + +D+FS++GSS+ S+KY   LSCSDAE+DYR GRS++ E KGDF WH
Sbjct: 421 SGSSTPGNRILTDEDDIFSKVGSSRMSKKYQLGLSCSDAEIDYRSGRSVK-EGKGDFKWH 480

BLAST of CmoCh01G021140 vs. NCBI nr
Match: gi|567885895|ref|XP_006435506.1| (hypothetical protein CICLE_v10000992mg [Citrus clementina])

HSP 1 Score: 668.7 bits (1724), Expect = 7.5e-189
Identity = 349/480 (72.71%), Postives = 400/480 (83.33%), Query Frame = 1

Query: 1   MGSEQNR--IPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAA 60
           MGSEQNR  +PQQER KRWGGC GA SCF SQKG KRIVPASR+PEGN    QPNGP AA
Sbjct: 1   MGSEQNRFPLPQQERRKRWGGCLGAFSCFRSQKGGKRIVPASRMPEGNAPAAQPNGPQAA 60

Query: 61  GIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYA 120
           G+ NQ T +APSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYA
Sbjct: 61  GLPNQTTTLAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYA 120

Query: 121 HETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANY 180
           HETQ VSPPVFS FTTEPSTAPLTPPPELAHLTTPSSPDVPFA+FL+SSMDL GT KANY
Sbjct: 121 HETQLVSPPVFSTFTTEPSTAPLTPPPELAHLTTPSSPDVPFARFLTSSMDLNGTDKANY 180

Query: 181 VASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRS 240
           +A+NDLQA YSLYPGSP SSL+SPISRTSG+CLSSSFPER+FPPQW+P+ S Q+GKY RS
Sbjct: 181 IAANDLQATYSLYPGSPPSSLISPISRTSGECLSSSFPEREFPPQWDPTVSPQNGKYSRS 240

Query: 241 GSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYASGGN 300
           GSGRL+ H+ TG    SQD+NFFCPATFAQFYLD + PFPHTGGRLSVSKDSDVY +G N
Sbjct: 241 GSGRLYTHDTTGGSRVSQDTNFFCPATFAQFYLDHDSPFPHTGGRLSVSKDSDVYPNGAN 300

Query: 301 GYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSA 360
           G QNRH+KSPKQDVEE+EAYRASFGFSADEIITT QYVEI+DVM+DSFTM PFTS   + 
Sbjct: 301 GNQNRHTKSPKQDVEELEAYRASFGFSADEIITTPQYVEITDVMDDSFTMMPFTSDKPAF 360

Query: 361 EESIQPPLVGEKLKSTQATLQSQRSIKSASDVEKETC-SEVLALCNGCKDDKLQRQPGNL 420
           EES+   + G+K +  ++ L + +++KS SD+       E+    +GC+D+K +RQ G++
Sbjct: 361 EESLPASMDGQKPQGRESNLLNPKNLKSDSDLMNGGIHHELTESSDGCEDNKPKRQSGDV 420

Query: 421 PGSSTSQGET----EDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWH 473
            G+ST   +     ED+FS++ +S+NSRKY+  LSCSDAE+DYRRGRSLR E KGDF WH
Sbjct: 421 SGASTPGNQVLTDEEDIFSKMRTSRNSRKYHQGLSCSDAEIDYRRGRSLR-EGKGDFSWH 479

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y1666_ARATH1.5e-12858.97Uncharacterized protein At1g76660 OS=Arabidopsis thaliana GN=At1g76660 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L1G3_CUCSA1.2e-23088.16Uncharacterized protein OS=Cucumis sativus GN=Csa_4G665140 PE=4 SV=1[more]
V4TGC4_9ROSI5.2e-18972.71Uncharacterized protein OS=Citrus clementina GN=CICLE_v10000992mg PE=4 SV=1[more]
A0A067KKH5_JATCU2.6e-18873.22Uncharacterized protein OS=Jatropha curcas GN=JCGZ_08002 PE=4 SV=1[more]
B9R8W7_RICCO5.4e-18672.33Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1602580 PE=4 SV=1[more]
F6I6Y3_VITVI5.6e-18372.19Putative uncharacterized protein OS=Vitis vinifera GN=VIT_18s0122g00140 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT1G76660.18.4e-13058.97 FUNCTIONS IN: molecular_function unknown[more]
AT5G52430.13.0e-3442.69 hydroxyproline-rich glycoprotein family protein[more]
AT4G25620.12.2e-2944.34 hydroxyproline-rich glycoprotein family protein[more]
AT1G63720.12.1e-2739.73 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glyc... [more]
Match NameE-valueIdentityDescription
gi|659105232|ref|XP_008453041.1|2.0e-23489.45PREDICTED: uncharacterized protein At1g76660 [Cucumis melo][more]
gi|449466510|ref|XP_004150969.1|1.8e-23088.16PREDICTED: uncharacterized protein At1g76660 [Cucumis sativus][more]
gi|645255247|ref|XP_008233411.1|5.0e-19375.05PREDICTED: uncharacterized protein At1g76660 isoform X1 [Prunus mume][more]
gi|1009126185|ref|XP_015880014.1|2.8e-19172.77PREDICTED: uncharacterized protein At1g76660 [Ziziphus jujuba][more]
gi|567885895|ref|XP_006435506.1|7.5e-18972.71hypothetical protein CICLE_v10000992mg [Citrus clementina][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005886 plasma membrane
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh01G021140.1CmoCh01G021140.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR31798FAMILY NOT NAMEDcoord: 1..473
score: 6.4E
NoneNo IPR availablePANTHERPTHR31798:SF3SUBFAMILY NOT NAMEDcoord: 1..473
score: 6.4E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmoCh01G021140CmoCh09G000010Cucurbita moschata (Rifu)cmocmoB017
The following block(s) are covering this gene:

None