CmaCh01G020580 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh01G020580
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
LocationCma_Chr01: 13066918 .. 13078429 (+)
RNA-Seq ExpressionCmaCh01G020580
SyntenyCmaCh01G020580
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGGTCCGAGCAGAATAGAATCCCTCAACAAGAACGGGTATGCATCCCCTCCCCACAAATCATTATGCTTGTTTATTGCTCGAGAATATTGTTGTTTTTGCACATTGTTTCTTATACCAGTAGATTTAGGGTGCTCCAAGCACCATTTTGCTTGTTTAGGATTTGGATTTACTGCGTAAGCAATGACATTCTTTTTCACGGCCTTTTCTTTCCTAGATCCAGGGTTCACCCCTCTATTTCCTAGCAATGTACCAAATCCATGTTTCTGCCAAATTATTTTATAATGTGGATTCCATGTACTTGATAGTTTTTTATATAGATCTTCGTTAGTTTTACAGCTTTGGCTAATGGGCATCAACTTCTTGTTGTGATTTGGTTTGTGGTCATTCCACTTTATGTTTATTATTCCTTTGTTTTAGTTTCTTTTCGGGTAAGGACCCACTTTTCGAGGGACTCGAGCCACTGACCGTTACTAACATGGAAAATAAGTAAAAAATGCGTAAAATCTAAAACATTTGAACACGTGCATGCACTTTGAAATTTGATTAAAATAATAATAAAATTTTTTGTTTTTATAAAGAAACAAGAAAAGACCACTAAAATTACTACACTACAAGATCGAACTATTTAAAATACATGAAATACAAGATTTAAAGGTAAGTATAAAGGAACTTCAGACTCAAGTGTGATGAGACTCCCTATAGCCAACAACCCATGTGTGCCTCTACCTCGAACTACGTTAAGCTCCGCCTGAAAAGAAAGTTGAAGTGGTAGGGATGAGTATATAAAATATTCTTAGTAAGCAGCTTATTGGTAGTTCATATTTCGTTATTAAGTCTTATTGATCTTGTTCATACATTCGGTATCTTATTCTCTTTTGGGGATTCGTTCCCCCGTATAAGCTTTTCCTAGGCTACACTTTAGGTTTACTCATTCTACTTTCTCTTGGTTCTTGGTCCTATCTTGACCTCCAGCCCTTGGAATGTTCTTAGGTCTTTTAGGTCTCCTAATTAGTCGTTTCTGCTAGTCTCTCATCATCATTGTCATATCTCAGAGTCCTTGAGGCGCGACCTCGGTATAAATCTCGTAGGGCTCATGTAAATTAAGGCTCGACCAACTCTAAGGTGCAACTATGGACCCCTATTGCTCATGATAGGTTCACCAAAAGGAAGCTAACGGACAAACTTCCTAATAGTCTCCATAGGGCACAAATCATAAAACATCATATGCCAATTCTATGGTTATCTGAACTCCTGACAATTGAGTCAGATTTGTGGTGAGGGGCTAGTTGGCGAGCCCCTACCGTCCCTGTGTAATTGCATAAATCACATGCATGAACAATAATTATAACCAATTTACAAGGTACAACTGTGGACCTCTGCTGCTTATGGTCGGCTCACTGCAGGAAGCTAACTGGCCAACTTCACAGTAGCCCCCACATGGTACAATCATAAATCATAAAATATCATAAGCAGTTCTATGGTTATCTGAACCCCCGACAATCGAGTTAGGTTCACAACTAGGGGTTTGTTGGCGTACCCCTAATTGTCCTCATATAACTGCATACATCAGAAACATATGAAATAGAGACATAGCATAACAATCTAAACATTGATACCTGTTCATGAATTTCAGTTCATAAGATAGTGCATACGCGTTTTAGTAACATACATAATAACATTCCACATATGTTATCTTAGCTCATTTTATGCACAACCATTTATAAGCGGCTTCCTATAGCAGGCTTCATATCATACTAAAGGCATACTCTCAATTTCGGTTAAATCATAACTCAACAGTTCATTCCTCGATAAATGTGCACCATAACGTTTAACGTGCTCTTACATAGATCAAACCAACTCATTTCATGCTCAAATTTATATGACATGCTTCTCATATCATACTTTCATGCATGCTATGAGCATAGCTAAATCTTAGGTTCTAAGGTGCCACTTACCTGGAAAGCATGGCACTTTTTGTTTGGGTTTGCGTCTTTAGGGGATTCCAAAAATTGTCAAACTCCTCAAATAACTCTTACATTGATATAAATTGGTGAAAATGGTGCCGAAGTCAGAGTTAAAACGAGTAAATTATGCGAAAAGAGGCAAAATCGGGTTGTACCGCCTGTGTTTGGATGCTCGTGAGTTAGACCGCTCGAGTTGTCCCGTCCACGAGTCGGCCGCTCGGGTTGGGCTATCTGCGGGTCGGGCCTGTGGGTTGGACCCATCCTTTTCCTCCAAATTTTTTTTTCTTTCTTTCTCTTTCTTTATCTTTTTCGTTTCTTTTCTCTTTTTTTTTCTTTTCTCTTTTTTTTTCTTTTCTCTTTCCTCCTCTCCTTCTCCTCCCTCTCCCTCTCTCCAGAACCTTCTTCCTCCATTTCTTCTCTTTTGCTTCTTTCTTCTTCCCTTTTTTTCTTCTTCTTCTTTCATAATCATTTTTTTTCCCTCTTATTCTTTTTCTTTCCCTCTTAATTCTTTTTCTTTCCCTCATCCCTGCGCTTGCCAGATTTCCTCCTTCCTTTCTCGCACTCACCCTTCGTTGTACAAGTCAAAACTCCAACTGTTTTCTTCAACAAAGTCTAGATTGTCCTCTCCCGAACCTATAGCAATTGAAATCGGGTCAACTCATGGAGGGTAGGGGACAAATGTAACTATAGAAGCTCTCAAAACTCATGAATGTACCTTGGCGTTCTCCCTTCGTTCGACTTTTCGTTTTGGCTTGCTTCCATCCGAAATTGCCGACGTATGCCCGTGGATCTAGAGCTTAAGTTCTCCCAACAAGCGTGGGTTTTGGTCTTCAACCTCCGTTGCCTTTTCCCTTTAACAAATCCGAGTCCTTCCTTCTGCAGCAATCCACCCTTGCCTCCCTCTGTTCTCGCTCAAAGAGGGATGGGACGGAGGTCTTACTCTTTCATATATGCACACACTTTGAACTTTGAACTTACTCTTTCGGTTAATCATACTTTGAACTTACTCTTTCATATATGCATACACTTGTAATGTACGTAAATCCTTCAAGGTGAGATGCATTTGACTGATTCGAGAATCAAGTTCAGTCGTTGGTCCTTTTGCTATTAGCAAATATTTAATTTTGCATTAATTACCTTATAAAGTAATTAATTCTTCCTATATATGAATTCGACAAGCACGTACATTTAAGTGTTATTTTAGCATCACAATTGTTTTTTTTTAATATATAAACCAAGCTTTTATTAAGAAAACATGAAATAATATGAGGGGTAACCCCATGAGTTGAGATAAAGGACAGGGCTCAAATCAAGCTAGATGAACCCTGCGGATAATTACTAAAGTTCTTATGCATTGATGCCTAGAGGGAGACGTGAAATATGAAAAGAGACCAAACTTGCTCCCTAGTTTGACCCTTCCCCTCTAAACATAGGATTGTTTATCTCTTCTCAAACACCTCACAAGGGTTAAGAATTCAAGTACGAAGATCCCTCCTCTTGGGACAAGCAAATGAATCCCCTCAGTAAAGATAAAAGGGTCAAAATATCCACGTTAGAAAGCAGATCGTTAAGGTGACCCCTAACCTAAGCAGGAGACTAACACAGATCCCTCATGAAGAAACAAGTCATTTCTCATCCCCGCTCATGCTTTCTATAACAGTTCCTTACGGGTTCCTTACGGGAGGCTACTAGACTGGTCTTTCCCTTTTAGGTAATGATTAGTGCAACCCAAGGTCAAGGGAGGTGGAACCTTCCAAGGAGGAAAGGACGTAAGCCATTAAATGAAGCCTCATGGAAGATAGATGGTAAAGATGAGAAAATAAAGAGTAAAGAGATCTATCATCCAAACTAACGTCCTTCCCATAAGTGTTCACCCCACCCCATCTCCCACCACATTCTCAAACATTGAGAGAAAAGAAAGGGAACCCCTAAAAGAATAACTTCCCACAAATGTTTGGAAGTGCCTTTTGAATGCTACTCGAGAGGGTGCTAACATCATAGTTAATCAATGTTCATTATATTCTTCATACGAATCTTTATTCATCATATGAAGAAGCTTCAAGAGAATATTCCACGCTAAACAAGTAGCATAACATAATTGTCTTACGATCTTGTTGTTAGGAACCTGAAGTAGCGGACTAGTTTGACTTTTGGGTAATTTCAAGAATGCCCTTGTGTCCTAGTGGGATTGATATTTCAAAGAAGTTGCTTGTGGTACTTTGTGATTGTTGAGTATTTATGATGGCGAAAGCCTTCTTTTGAATTGGTTGGTTGGAAATTGATTAGAGAAGTTTTTGAAGGATGCTCGCTATTAGACTGTATTTGGGAGCAATTAGAATTATAGGCGTCTCTATCTAATGTTTGTTATGCCTTTTTGTATATACAATATTCTTTCATCCATTCTAATTAGAGTAGTTCTTTTACAACCCTTGTGTTTTTATGTTTTCTATAATTTTTTTCTTTCCTTTTTTTGGTTGAAAAAAGAAACATAAGAGATACATATCATGTATCTAGGAAGAAGTGAAATTCAAGGATGAGAAGAAGAGTATGAATTTTTTTTAAAAAATAAATAACTTTCCGTTGATGTATAGAAAGGAAGGAAAAACGTTAAGGATACAAGTTGCCAGTGGGAACACAACAGACCAGATAAGAAAAAATAAGCCCTAAAATACCGGAGGATAAAATGATGAAAAACAACTAGTGAGACAATAAAAAATACAGCAAAAGATTTGAAATAAAGAGCTCCCACTTTGGCTAAAAATGTAGGAACTTCTTCGGAATGGAAGTAAACAAATACTCAAGAACTTGAAACTTAGAAAAGCAACTCGAGAAAATGAAGCCGAGAAAAGATATTTTTTGAAGAAACACTTCTTGCATTGAAAGAGTTATTAGACTAATGCAAGCCTTTGAAACCTTCCAAAATCCTCTAGAATGCCAAAGGAGGGATCTTCGTTGAGGTTTATAAAATTTTACCAATACCCGCCAATATAACCAAATCGAGAACACAATTCGATCAAATTCAAACTCATCCATATTCAATTGTTAATAAAACTGAATTTCTTATTCTAAAAAGCACATTGGACTGGAAAAACTTAAAACCGAATGCTTTGAAGCTTTCCACAACCAAAAGCTATAAAAGAATCTTGAATAAGGGAATAAACAAAAAGAACAACTTAACCTTTTTCTGAGTTGGAGTAGGGGAAATTTCATGCATATGACGACTGCAAATCTCATTAAGAGGTGAAATTTTCAGGAATCAAGATGGACTCCACCAGAAAATCTTCTTTGACCGGGTCATCCATAAACTTTGAAGAAGCTATAATATTTTTCGAAAAAAGGAGAGTCGAGGATTGTTCAAGATTGTCATGGATTTTCTGATCAATTTCCAGTTTCTCATTTCTTAATGATTCAGCATCAGAGTCGATAAATGGTGAAAAAAATGATAGAGCATCTGAAATTTTAGAGGAAAACAGTACACCATTTTCTGAATTAAGTTTGAACTTGGAATGGAGAATGATGAATTAACAAAACCATGAGCAAACATCTTCAATAAAATTTGGATTAGTCTCTAAAATTTATGTACTTATTTTATTTGATCATAAGACCAAATGAGAATCAGTTGCATTTATTGAGATAATAGGATCTGACTGCATATGCATACAAAATTCTATTGACTTGATAAAATTTGAAGAGTCTGGAACCATGAAAGACAAATAGTTAAATGCATTTTTTCTTTTATAAAATTTCGAATAGTGGCCTCGTTAATAACCTTGACTATTAGGTGTTCACTTCCAATATTTCAATCTCAATCACTTCCATTTAAAGCTGGTCCAATAAAACCGGGGGAAATAACACCTTTGAAGATGAGGACTGTTCTGATCATGCAAGAAACCGCCATTTAATCCTTTGTGTCCCACAAAGTTATTGTTGCTTGGAGGACCAAGGCCATCCACATTATTGATCGTATCATGAACGGAAGGGAACTTTGAATAATGGACCCCTGCCATCAAATCATCTTTGTTTGTGATAGTGAACCCTTCATCTTCCATTACCTTTGAACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAATTGAAAATCAATCGACTTTTAATTATGAAATCTTGATAAGAAAAATCTATGAGAATATCACATACTAGGAGAGCCGATAGCTGGAAGGTTGAACGAAGAAAATGCAGATAACAATAAAAGGGCAAGCAACGGTTGCAGTCAATGAGTTGATGCTGTGTGGTTAGGGAAGGAAGAGCAATTGAGTGGTGGGAGAATGAGAAGAAGAATAGAAAGAAAAAGAAGAAAAAGGGGCTAATGGACAAAGGTAGAGTGGTCGATGGAATGGATGGGGAATGGGTGAGGAAAGAAGGATGGGAAAGTCGTTGACATGAGTCATGTGAATGGTAGAGATGTCCATTTAATCGTAAGGGCCCGAGTCTTTAGAGGACTCATCTCGTATAGGATGGGGAGTCTCGTACACAAGGAATGGAGGAGGGAGTAGGGAGAGTTTTCTCCCCAATGACTAAACGGGGACCATGTGTCGAAGGCTATGTTGTAATATTGAAATTTGTATTTTTAGAAATTATGTTCGTTGAGTTGATGTTGCATTATTGAAATTTAGGTATAAAAATTTATGTTAGTTGAACATTATAAGTGTATTTTTGCTTATGGTATATTATTATTTTTATTTTTATTGTGATTATTTGTAATTTTTCTTTAAATAGTTATGTTGAGGACAATCCATTTAGAAGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGGAAAGAATGGTTTTTAGAGTTNNNNTTAAAAGATAGAATAGATAAAAAAAGAAATAAAAGACGGAAAAAGTTTTCTTGTAGTGAACTTGCTCTCAGACAAATAAATTAAAATTTTTGTGAGAATAGAGATTCAAATTGGTGGGAGGAATGTTTCTATTCTTACCTTGCCTTGTGGACATCTCTAACGAAGGGGCAGGGGAGATAAGTGGCACAAATGGGGTTTAGAAAAAGGATTGAAGGTGGCACAAGAACAAACAAGGAAGGTACGATTGTCAGAATCGTCAAAGAATTCTTCCCTACTTATCCCTAAAATTGTAGAAGACCCAACAAGTGGAAATCAATATTAATTGGGGAGGAAATGACATTACCCGAGTTCGAACATAGGACCTCTTGCTTTGATATCATGTTAAAACACCATTCCACCTCTAGGTTCTATGTTTCTCTTTGGGTGTCAGTTTCTAAGATCGTTTTTAATTATTCACTTAGATTTTTCGTTGACATAATGAAAATAGACTAATGCTTCAAGTATGAATAGAAGTGAACAAAAGTAACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAANAAAAAAGAAAAAGAAAAAAAAACAATTCAAACTAAAAAGTTACAATAAGCAAAGCACTATAAATAATATATAAATGCATTCCAATTCAAACATATGTTCCGAAGTGAAAATCCCACGAACAACTTTGTCAAAGAACACCAAGAAGAGGTTTTGAGATGAGCTGTTTTCCTTGAAAAAATTCTTTGATTCCGTTCCAACCATAGTTCTGAAAGAAAAGTTTTAACCGCATTAATCCATAACCACTTGGCCTTAGAATTGAGAAGTGTTGGACCATGTAGTTGTGGGAGATACTGCGATAAGACATTAGAAAAAATTGAAAATTGAGAACAGCTTGAACCAACCAAAGCTTGCATGAGAGCAACTAAAATACACATGGTTGGGAGAGCATCCTGAAGGCTGAGGAGAAGAAACTGATGGAAACAGACGGTGAGCTGGATTCTTCTTTTGTAAAACATTGGACGGATTAAAATTCCCATCAAGCATTATCCACAAAAAATATAAGTTCTCTTGGGACTCTTGGATTTCCGTGAGGCTAATTATAACTCTTTCTCCAAAGGTGAAGAAGATACCAGGTGATCATTTAAAGATTTTGCGGAGAAAATACCTGAAGGATCATTTGACCAATGCCTAGAATCTTCATTGTATTAAGACTCGGAACAAAAATAGTGTGCAGTAGAGTCTGAAAATCGGCAATCTCCTCATCTTTCAAGCATCGACAAAAAGATAGATTCCAAGAACCAGCGTGCATATCCCAAAACGCTCAAACAGAGGCATTGGGAGAAGATGAAATAGCATACAGCCAAGGAAGGAAGTCCTTAAAACATAAAATTTCAACCAAGGATCTTCCCAAAATCGAATCTTCCTTCCCTTGTCCAAATTAAAGTGAGCAAGTGATTCAAAGTTCTGCCATAATTTTGAAATGGACATCCATGGACTTCGAAGACTTAAATCTCTTCCAAGCCTGAATTCCATCCATTATTACAGAACCATAGATGCTAACAATTACTTTCCTCCATAAAAATTGAGGTTCCATCATGAAAGCCAGTTCCATCTGGCCAATAAAGCCATATGTCTCTGCTTCCATGCACCAATTTCAAGTCCACCATCTCTTTGAGTCTTTGAAGCAAGACTCCATCTAACCAAATGTTACGAAACTGACAAGGGGAAGGTCCCTAAGAAAAACTACCAGCTTCAAAAAGAGGAGGGAAGTGATAAAAACGAGGACCTGTTCTGAGACTCTAGAATTATCAAACATCACATCCCATTCCTTGGAAATCAAAGAACGATCTATGAGAAATTGAGAAGAATCATCTCCTGGTCTTGACCTCGTTACTTTCCATTAGATAAAGGCAACTCCCAAAGGCCTTAATCATCAATTCAATTATTAAACTGTCTCATTCCCTCTGATTGTTCAACCAACTGGAATTCTTTCTTGAATAGAGTATCTTCATGAAATGATTTTTTGCTGAAAAGAATATCTTCAAAAGAAGTTAGACGTACTCCACCATTTCTGAAAAATACGAAGTAAAGTCTACAGTCGCAAATTTGAAAGCCCTCCCTAGCGTTTTCAACTTTCTCAAGATGGTAGCAAAAAGTCAATAATAGTCATTTTTTTTGGGTGTTTTGTTAGGTCTATGAAAAGTTTTGGTTCATGGTTATTCTCTATTTGAGTGTCTTTTATATCATTACTTTTTTCTTTTTCACTCCCTCGGGGATTTTGTATCCCTTGAACTTTTAATCCTTTTCATTATATCTATGAGAATTTTCTTGTTAAAAAAACCTAGTAAATTGTTATTTATGGAAGACAACGTAGTCAAAGGATTTGGTATTCTGCACAAGACAAACTTTTGTGAACAATAGCATTTGATTTTTCCTTCTTTTTGTTTTTTCTTTTTTATTTTAATGATGTTTGATTAAATATTTGTATTTATTGGGGTTGATATTTGATTTATTGATCTCCACATTCAGGCTGCTCCTTTGAAGCGCTTAAGCGACTTACATCCATTAACAACTTGCTCAGATCTAAAAGGCACTTTAACTAATGCTAAAACTGGATATTGGACATTTTTTATGGATAATTTTGAAATGGCCATCTCTGCTAGCAAGAATTTGTACGAAAACTGTGCAAAGGGATGGTTCTAAACTTTTTGGTTCTTCATTGACTATTCTTTAATGAGCTGTTGTGATAACCTGTTAAGTTCTGGTACTTTCTTCTGTACCAGGGAAAGAGATGGGGTGGATGTTGGGGTGCATTATCTTGTTTTCTCTCTCAGAAAGGAGAAAAGCGCATTGTGCCTGCATCTCGTTTACCTGAGGGCAATGTCGTAACAACCCAGCCAAATGGACCTCATGCAGCAGGAATAGCCAATCAGGCTACTGTGATAGCTCCATCCCTTTTAGCCCCACCTTCTTCACCAGCATCCTTTACAAATTCTGCACTCCCTTCAACAGCCCAATCACCTAGCTGTTTCTTGTCGTTGTCTGCCAACTCACCTGGAGGTCCTTCATCCACAATGTTCGCTACAGGGCCATATGCGCATGAAACACAACCGGTTTCTCCTCCTGTTTTCTCAGCCTTCACCACTGAACCGTCAACTGCTCCACTCACTCCCCCACCCGAACTAGCTCACCTAACCACTCCTTCTTCCCCTGATGTGCCTTTTGCTCAGTTCCTATCCTCATCGGTGGATCTCAAAGGAACTGGAAAGGCAAATTACGTTGCTTCCAATGATCTTCAAGCAGCATATTCTCTCTACCCTGGAAGTCCTGCCAGTAGCCTCGTGTCACCAATTTCAAGGACCTCCGGCGATTGCTTATCATCTTCTTTTCCTGAGAGGGACTTTCCACCACAGTGGAATCCTTCAGCTTCTCTCCAAGATGGAAAATATCCAAGAAGTGGTTCTGGTCGGCTATTTGGACATGAGAAAACTGGCACACCTCTTGCTTCTCAGGATTCTAATTTCTTCTGCCCTGCTACATTTGCACAATTCTATCTGGATAATCCACCATTCCCTCATACTGGTGGGAGGTTAAGTGTATCAAAGGATTCAGATGTTTACGCTTCTGGTGGGAATGGATACCAAAACCGGCACAGTAAGTCTCCAAAACAAGATGTGGAGGAAATAGAAGCTTACCGAGCATCTTTTGGTTTCAGTGCCGATGAAATTATAACTACCACACAATACGTGGAGATATCTGATGTAATGGAAGATTCCTTTACTATGAGACCTTTTACTTCAACTAGTCTGTCTGCAGAAGAAAGTATTCAACCTCCATTAGTGGGTGAAAAACCGAAATCCACGCAGGCAACTTTACACAGTCAAAGAAGTATTAAATCAGCATCTGACATTGTTGAAAAAGAAACCTGCTCTGAAGTCCTGGCATTATGCAATGGCTGTAAAGGTGAGCCACTGTTCCCTGTTACAAGCTCACACTTGTTTCATGAGAAATCAAAAGAAAATAAAGACAAATACATTTTGGGGAACTTTTTCAAAATGTTGCAACTTTTCTTTCCGTTTCATTATTTCCTTTTTATATACACTGACTCACAATTCTGATGCTAGTAGTCAGGTTAAATTACATTTTTGGGGGTGTGGTTTTAGGAAAGCTACAATTTAGTTATGGTTACAATTGTCACTTGTTTGTCACCTGTTTGATGCAGACAATAAATTGCAAAGACAACCTGGTAACTTGCCAGGATCAAGTACTTCCCAAGGTGAAACAGAAGACCTATTCTCAAGAATAGGGTCGTCCAAAAATAGCCGCAAGTATAATCATGCTTTATCCTGCTCTGATGCAGAAGTTGACTACAGAAGAGGAAGGAGCCTGAGGGGGGAGGTCAAGGGAGATTTTTTATGGCATGACTAAGAGCCATTTCTGAAATAGTTTGTAGTATGTGTATCTGTTCTTTGCTTTGCGGGTTTCCATGGAATGTCTAACCTATGACCTAACCTGTTCTCTGAGTTGACTTGATCAGTGGATGGATGCATATAGTTTATGTTCTTTATCGTTCCAGTTTAGAAATGGAATGGGTCAAATCTCTGTATTTGGTAGACGAAGTTTGTCTTGTTTCCAATGAATTATATAATAGCATGTACAATCTTGCCC

mRNA sequence

ATGGGGTCCGAGCAGAATAGAATCCCTCAACAAGAACGGGGAAAGAGATGGGGTGGATGTTGGGGTGCATTATCTTGTTTTCTCTCTCAGAAAGGAGAAAAGCGCATTGTGCCTGCATCTCGTTTACCTGAGGGCAATGTCGTAACAACCCAGCCAAATGGACCTCATGCAGCAGGAATAGCCAATCAGGCTACTGTGATAGCTCCATCCCTTTTAGCCCCACCTTCTTCACCAGCATCCTTTACAAATTCTGCACTCCCTTCAACAGCCCAATCACCTAGCTGTTTCTTGTCGTTGTCTGCCAACTCACCTGGAGGTCCTTCATCCACAATGTTCGCTACAGGGCCATATGCGCATGAAACACAACCGGTTTCTCCTCCTGTTTTCTCAGCCTTCACCACTGAACCGTCAACTGCTCCACTCACTCCCCCACCCGAACTAGCTCACCTAACCACTCCTTCTTCCCCTGATGTGCCTTTTGCTCAGTTCCTATCCTCATCGGTGGATCTCAAAGGAACTGGAAAGGCAAATTACGTTGCTTCCAATGATCTTCAAGCAGCATATTCTCTCTACCCTGGAAGTCCTGCCAGTAGCCTCGTGTCACCAATTTCAAGGACCTCCGGCGATTGCTTATCATCTTCTTTTCCTGAGAGGGACTTTCCACCACAGTGGAATCCTTCAGCTTCTCTCCAAGATGGAAAATATCCAAGAAGTGGTTCTGGTCGGCTATTTGGACATGAGAAAACTGGCACACCTCTTGCTTCTCAGGATTCTAATTTCTTCTGCCCTGCTACATTTGCACAATTCTATCTGGATAATCCACCATTCCCTCATACTGGTGGGAGGTTAAGTGTATCAAAGGATTCAGATGTTTACGCTTCTGGTGGGAATGGATACCAAAACCGGCACAGTAAGTCTCCAAAACAAGATGTGGAGGAAATAGAAGCTTACCGAGCATCTTTTGGTTTCAGTGCCGATGAAATTATAACTACCACACAATACGTGGAGATATCTGATGTAATGGAAGATTCCTTTACTATGAGACCTTTTACTTCAACTAGTCTGTCTGCAGAAGAAAGTATTCAACCTCCATTAGTGGGTGAAAAACCGAAATCCACGCAGGCAACTTTACACAGTCAAAGAAGTATTAAATCAGCATCTGACATTGTTGAAAAAGAAACCTGCTCTGAAGTCCTGGCATTATGCAATGGCTGTAAAGACAATAAATTGCAAAGACAACCTGGTAACTTGCCAGGATCAAGTACTTCCCAAGGTGAAACAGAAGACCTATTCTCAAGAATAGGGTCGTCCAAAAATAGCCGCAAGTATAATCATGCTTTATCCTGCTCTGATGCAGAAGTTGACTACAGAAGAGGAAGGAGCCTGAGGGGGGAGGTCAAGGGAGATTTTTTATGGCATGACTAAGAGCCATTTCTGAAATAGTTTGTAGTATGTGTATCTGTTCTTTGCTTTGCGGGTTTCCATGGAATGTCTAACCTATGACCTAACCTGTTCTCTGAGTTGACTTGATCAGTGGATGGATGCATATAGTTTATGTTCTTTATCGTTCCAGTTTAGAAATGGAATGGGTCAAATCTCTGTATTTGGTAGACGAAGTTTGTCTTGTTTCCAATGAATTATATAATAGCATGTACAATCTTGCCC

Coding sequence (CDS)

ATGGGGTCCGAGCAGAATAGAATCCCTCAACAAGAACGGGGAAAGAGATGGGGTGGATGTTGGGGTGCATTATCTTGTTTTCTCTCTCAGAAAGGAGAAAAGCGCATTGTGCCTGCATCTCGTTTACCTGAGGGCAATGTCGTAACAACCCAGCCAAATGGACCTCATGCAGCAGGAATAGCCAATCAGGCTACTGTGATAGCTCCATCCCTTTTAGCCCCACCTTCTTCACCAGCATCCTTTACAAATTCTGCACTCCCTTCAACAGCCCAATCACCTAGCTGTTTCTTGTCGTTGTCTGCCAACTCACCTGGAGGTCCTTCATCCACAATGTTCGCTACAGGGCCATATGCGCATGAAACACAACCGGTTTCTCCTCCTGTTTTCTCAGCCTTCACCACTGAACCGTCAACTGCTCCACTCACTCCCCCACCCGAACTAGCTCACCTAACCACTCCTTCTTCCCCTGATGTGCCTTTTGCTCAGTTCCTATCCTCATCGGTGGATCTCAAAGGAACTGGAAAGGCAAATTACGTTGCTTCCAATGATCTTCAAGCAGCATATTCTCTCTACCCTGGAAGTCCTGCCAGTAGCCTCGTGTCACCAATTTCAAGGACCTCCGGCGATTGCTTATCATCTTCTTTTCCTGAGAGGGACTTTCCACCACAGTGGAATCCTTCAGCTTCTCTCCAAGATGGAAAATATCCAAGAAGTGGTTCTGGTCGGCTATTTGGACATGAGAAAACTGGCACACCTCTTGCTTCTCAGGATTCTAATTTCTTCTGCCCTGCTACATTTGCACAATTCTATCTGGATAATCCACCATTCCCTCATACTGGTGGGAGGTTAAGTGTATCAAAGGATTCAGATGTTTACGCTTCTGGTGGGAATGGATACCAAAACCGGCACAGTAAGTCTCCAAAACAAGATGTGGAGGAAATAGAAGCTTACCGAGCATCTTTTGGTTTCAGTGCCGATGAAATTATAACTACCACACAATACGTGGAGATATCTGATGTAATGGAAGATTCCTTTACTATGAGACCTTTTACTTCAACTAGTCTGTCTGCAGAAGAAAGTATTCAACCTCCATTAGTGGGTGAAAAACCGAAATCCACGCAGGCAACTTTACACAGTCAAAGAAGTATTAAATCAGCATCTGACATTGTTGAAAAAGAAACCTGCTCTGAAGTCCTGGCATTATGCAATGGCTGTAAAGACAATAAATTGCAAAGACAACCTGGTAACTTGCCAGGATCAAGTACTTCCCAAGGTGAAACAGAAGACCTATTCTCAAGAATAGGGTCGTCCAAAAATAGCCGCAAGTATAATCATGCTTTATCCTGCTCTGATGCAGAAGTTGACTACAGAAGAGGAAGGAGCCTGAGGGGGGAGGTCAAGGGAGATTTTTTATGGCATGACTAA

Protein sequence

MGSEQNRIPQQERGKRWGGCWGALSCFLSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIQPPLVGEKPKSTQATLHSQRSIKSASDIVEKETCSEVLALCNGCKDNKLQRQPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD
Homology
BLAST of CmaCh01G020580 vs. ExPASy Swiss-Prot
Match: Q9SRE5 (Uncharacterized protein At1g76660 OS=Arabidopsis thaliana OX=3702 GN=At1g76660 PE=2 SV=1)

HSP 1 Score: 458.8 bits (1179), Expect = 7.7e-128
Identity = 276/469 (58.85%), Postives = 323/469 (68.87%), Query Frame = 0

Query: 1   MGSEQNRIPQQERGKRWGGCWGALSCFLSQKGEKRIVPASRLPE-GNVVTTQPNGPHAAG 60
           MGSE      Q++ KRWGGC G  SCF SQKG KRIVPASR+PE GNV  +QPNG H AG
Sbjct: 1   MGSE------QDQRKRWGGCLGVFSCFKSQKGGKRIVPASRIPEGGNVSASQPNGAHQAG 60

Query: 61  IANQ--ATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPY 120
           + N   A  I  SLLAPPSSPASFTNSALPST QSP+C+LSL+ANSPGGPSS+M+ATGPY
Sbjct: 61  VLNNQAAGGINLSLLAPPSSPASFTNSALPSTTQSPNCYLSLAANSPGGPSSSMYATGPY 120

Query: 121 AHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKAN 180
           AHETQ VSPPVFS FTTEPSTAP TPPPELA LT PSSPDVP+A+FL+SS+DLK +GK +
Sbjct: 121 AHETQLVSPPVFSTFTTEPSTAPFTPPPELARLTAPSSPDVPYARFLTSSMDLKNSGKGH 180

Query: 181 YVASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPR 240
           Y   NDLQA YSLYPGSPAS+L SPISR SGD L S                 Q+GK  R
Sbjct: 181 Y---NDLQATYSLYPGSPASALRSPISRASGDGLLSP----------------QNGKCSR 240

Query: 241 SGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYASG- 300
           S SG  FG++  G     Q+SNFFCP TFA+FYLD +P  P  GGRLSVSKDSDVY +  
Sbjct: 241 SDSGNTFGYDTNGVSTPLQESNFFCPETFAKFYLDHDPSVPQNGGRLSVSKDSDVYPTNG 300

Query: 301 -GNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTS 360
            GNG QNR ++SPKQD+EE+EAYRASFGFSADEIITT+QYVEI+DVM+ SF    ++   
Sbjct: 301 YGNGNQNRQNRSPKQDMEELEAYRASFGFSADEIITTSQYVEITDVMDGSFNTSAYS--- 360

Query: 361 LSAEESIQPPLVGEKPKSTQATLHSQRSIKSASDIVEKETCSEVLALCNGCKDNKLQRQP 420
                    P  G+K    +A L SQ S KS +D+  +    +     N  KD+K + + 
Sbjct: 361 ---------PSDGQKLLRREANLLSQTSPKSEADLDSQVVDFQSPKSSNSYKDHKQRNR- 420

Query: 421 GNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLR 464
                      + E L SR+GS K SR Y+  +S SDAEV+YRRGRSLR
Sbjct: 421 --------IHADEEALLSRVGSVKGSRSYH--ISSSDAEVEYRRGRSLR 421

BLAST of CmaCh01G020580 vs. ExPASy TrEMBL
Match: A0A6J1IZ74 (uncharacterized protein At1g76660-like OS=Cucurbita maxima OX=3661 GN=LOC111480495 PE=4 SV=1)

HSP 1 Score: 936.4 bits (2419), Expect = 4.6e-269
Identity = 474/474 (100.00%), Postives = 474/474 (100.00%), Query Frame = 0

Query: 1   MGSEQNRIPQQERGKRWGGCWGALSCFLSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGI 60
           MGSEQNRIPQQERGKRWGGCWGALSCFLSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGI
Sbjct: 1   MGSEQNRIPQQERGKRWGGCWGALSCFLSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGI 60

Query: 61  ANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHE 120
           ANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHE
Sbjct: 61  ANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHE 120

Query: 121 TQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVA 180
           TQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVA
Sbjct: 121 TQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVA 180

Query: 181 SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGS 240
           SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGS
Sbjct: 181 SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGS 240

Query: 241 GRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ 300
           GRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ
Sbjct: 241 GRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ 300

Query: 301 NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES 360
           NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES
Sbjct: 301 NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES 360

Query: 361 IQPPLVGEKPKSTQATLHSQRSIKSASDIVEKETCSEVLALCNGCKDNKLQRQPGNLPGS 420
           IQPPLVGEKPKSTQATLHSQRSIKSASDIVEKETCSEVLALCNGCKDNKLQRQPGNLPGS
Sbjct: 361 IQPPLVGEKPKSTQATLHSQRSIKSASDIVEKETCSEVLALCNGCKDNKLQRQPGNLPGS 420

Query: 421 STSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD 475
           STSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD
Sbjct: 421 STSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD 474

BLAST of CmaCh01G020580 vs. ExPASy TrEMBL
Match: A0A6J1FRS7 (uncharacterized protein At1g76660 OS=Cucurbita moschata OX=3662 GN=LOC111446304 PE=4 SV=1)

HSP 1 Score: 917.5 bits (2370), Expect = 2.2e-263
Identity = 468/474 (98.73%), Postives = 470/474 (99.16%), Query Frame = 0

Query: 1   MGSEQNRIPQQERGKRWGGCWGALSCFLSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGI 60
           MGSEQNRIPQQERGKRWGGCWGALSCF SQKGEKRIVPASRLPEGNVVTTQPNGPHAAGI
Sbjct: 1   MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGI 60

Query: 61  ANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHE 120
           ANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHE
Sbjct: 61  ANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHE 120

Query: 121 TQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVA 180
           TQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLKGTGKANYVA
Sbjct: 121 TQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVA 180

Query: 181 SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGS 240
           SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGS
Sbjct: 181 SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGS 240

Query: 241 GRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ 300
           GRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ
Sbjct: 241 GRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ 300

Query: 301 NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES 360
           NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES
Sbjct: 301 NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES 360

Query: 361 IQPPLVGEKPKSTQATLHSQRSIKSASDIVEKETCSEVLALCNGCKDNKLQRQPGNLPGS 420
           IQPPLVGEK KSTQATL SQRSIKSASD VEKETCSEVLALCNGCKD+KLQRQPGNLPGS
Sbjct: 361 IQPPLVGEKLKSTQATLQSQRSIKSASD-VEKETCSEVLALCNGCKDDKLQRQPGNLPGS 420

Query: 421 STSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD 475
           STSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD
Sbjct: 421 STSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD 473

BLAST of CmaCh01G020580 vs. ExPASy TrEMBL
Match: A0A6J1BVS7 (uncharacterized protein At1g76660 OS=Momordica charantia OX=3673 GN=LOC111005956 PE=4 SV=1)

HSP 1 Score: 843.2 bits (2177), Expect = 5.3e-241
Identity = 432/476 (90.76%), Postives = 447/476 (93.91%), Query Frame = 0

Query: 1   MGSEQNRIPQQERGKRWGGCWGALSCFLSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGI 60
           MGSEQNR PQQER KRWGGCWGALSCF SQKG KRIVPASRLPEGN VTTQPNGP AAG+
Sbjct: 1   MGSEQNRFPQQERRKRWGGCWGALSCFHSQKGGKRIVPASRLPEGNAVTTQPNGPQAAGM 60

Query: 61  ANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHE 120
            NQATVIAPSLLAPPSSPASFTNSALPST QSPSCFLSLSANSPGGPSSTMFATGPYAHE
Sbjct: 61  TNQATVIAPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMFATGPYAHE 120

Query: 121 TQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVA 180
            Q VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGK NY+A
Sbjct: 121 PQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKTNYIA 180

Query: 181 SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGS 240
           SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPS+S QDGKYPRSGS
Sbjct: 181 SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSSSPQDGKYPRSGS 240

Query: 241 GRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ 300
           GRLFGHEKTGT LASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVY+SGGNGYQ
Sbjct: 241 GRLFGHEKTGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSGGNGYQ 300

Query: 301 NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES 360
           NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES
Sbjct: 301 NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES 360

Query: 361 IQPPLVGEKPKSTQATLHSQRSIKSASDIVEKETCSEVLALCNGCKDNKLQRQPGNLPGS 420
           I+PPL+GEK KST+ T+ SQRS+K ASD+VEKETC+EVL LCNGC+DNKLQRQPGN+ GS
Sbjct: 361 IEPPLLGEKLKSTKTTIQSQRSMKPASDVVEKETCAEVLPLCNGCEDNKLQRQPGNMSGS 420

Query: 421 STS--QGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD 475
           S+S  Q ETED+FSRI   KNSRKYN  LSCSDAEVDYRRGRSLR EVKGDF WHD
Sbjct: 421 SSSFNQVETEDVFSRIVPPKNSRKYNLGLSCSDAEVDYRRGRSLR-EVKGDFSWHD 475

BLAST of CmaCh01G020580 vs. ExPASy TrEMBL
Match: A0A1S3BV86 (uncharacterized protein At1g76660 OS=Cucumis melo OX=3656 GN=LOC103493867 PE=4 SV=1)

HSP 1 Score: 820.5 bits (2118), Expect = 3.7e-234
Identity = 422/474 (89.03%), Postives = 441/474 (93.04%), Query Frame = 0

Query: 1   MGSEQNRIPQQERGKRWGGCWGALSCFLSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGI 60
           MGSEQNR PQQERGKRWGGCWGALSCF SQKGEKRIVPASRLPEGNVVTTQPNGP AAG+
Sbjct: 1   MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGM 60

Query: 61  ANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHE 120
            NQATVI PSLLAPPSSPASFTNSALPST QSPSCFLSLSANSPGGPSST++ATGPYAHE
Sbjct: 61  TNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTIYATGPYAHE 120

Query: 121 TQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVA 180
           TQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DLKGTGKANY+A
Sbjct: 121 TQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIA 180

Query: 181 SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGS 240
           SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDF PQWN SASLQDGKYPRSGS
Sbjct: 181 SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGS 240

Query: 241 GRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ 300
           GRLFG+EK GT LASQDSNFFCPATFAQFYLDN  FPHTGGRLSVSKDSDVY+S GNGYQ
Sbjct: 241 GRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ 300

Query: 301 NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES 360
           NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES
Sbjct: 301 NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES 360

Query: 361 IQPPLVGEKPKSTQATLHSQRSIKSASDIVEKETCSEVLALCNGCKDNKLQRQPGNLPGS 420
            +PPL+GEK KS+  TL +QRSIKSA ++VEKETC+EV ALCNG KDNKLQRQPG++ GS
Sbjct: 361 TEPPLLGEKLKSSHTTLQNQRSIKSAPEVVEKETCTEVPALCNGYKDNKLQRQPGDILGS 420

Query: 421 STSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD 475
           STS    +D+FSRIGSSKNSRKY+  LSCSDAEVDYRRGRSLR E KG+  WHD
Sbjct: 421 STSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLR-EAKGNGSWHD 473

BLAST of CmaCh01G020580 vs. ExPASy TrEMBL
Match: A0A0A0L1G3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G665140 PE=4 SV=1)

HSP 1 Score: 800.8 bits (2067), Expect = 3.0e-228
Identity = 415/474 (87.55%), Postives = 434/474 (91.56%), Query Frame = 0

Query: 1   MGSEQNRIPQQERGKRWGGCWGALSCFLSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGI 60
           MGSEQNR PQ ERGKRWGGCWGALSCF SQKG+KRIVPASRLPEGNVVTTQPNGP AAG+
Sbjct: 1   MGSEQNRFPQHERGKRWGGCWGALSCFHSQKGDKRIVPASRLPEGNVVTTQPNGPQAAGM 60

Query: 61  ANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHE 120
            NQATVI PSLLAPPSSPASFTNSALPST QSPSCFLSLSANSPGGPSSTM+ATGPYAH+
Sbjct: 61  TNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMYATGPYAHD 120

Query: 121 TQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVA 180
           TQ VSPPVFSAF TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DLKGTGKANY+A
Sbjct: 121 TQLVSPPVFSAFNTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSEDLKGTGKANYIA 180

Query: 181 SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGS 240
           SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDF PQWN SASLQDGKYPRSGS
Sbjct: 181 SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGS 240

Query: 241 GRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ 300
           GRLFG+EK GT LASQDSNFFCPATFAQFYLDN  FPHTGGRLSVSKDSDVY+S GNGYQ
Sbjct: 241 GRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ 300

Query: 301 NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES 360
           NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES
Sbjct: 301 NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES 360

Query: 361 IQPPLVGEKPKSTQATLHSQRSIKSASDIVEKETCSEVLALCNGCKDNKLQRQPGNLPGS 420
            +PPL+GEK KS+  TL SQRSIKSA +    ETC+E+ ALCNG KDNKLQRQPG++ GS
Sbjct: 361 TEPPLLGEKLKSSHTTLQSQRSIKSAPE----ETCTEMPALCNGYKDNKLQRQPGDISGS 420

Query: 421 STSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD 475
           STS    +D+FSRIGSSKNSRKY+  LSCSDAEVDYRRGRSLR E KG+  WHD
Sbjct: 421 STSNQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLR-EAKGNGSWHD 469

BLAST of CmaCh01G020580 vs. NCBI nr
Match: XP_022981333.1 (uncharacterized protein At1g76660-like [Cucurbita maxima] >XP_022981334.1 uncharacterized protein At1g76660-like [Cucurbita maxima] >XP_022981335.1 uncharacterized protein At1g76660-like [Cucurbita maxima])

HSP 1 Score: 936.4 bits (2419), Expect = 9.6e-269
Identity = 474/474 (100.00%), Postives = 474/474 (100.00%), Query Frame = 0

Query: 1   MGSEQNRIPQQERGKRWGGCWGALSCFLSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGI 60
           MGSEQNRIPQQERGKRWGGCWGALSCFLSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGI
Sbjct: 1   MGSEQNRIPQQERGKRWGGCWGALSCFLSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGI 60

Query: 61  ANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHE 120
           ANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHE
Sbjct: 61  ANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHE 120

Query: 121 TQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVA 180
           TQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVA
Sbjct: 121 TQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVA 180

Query: 181 SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGS 240
           SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGS
Sbjct: 181 SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGS 240

Query: 241 GRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ 300
           GRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ
Sbjct: 241 GRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ 300

Query: 301 NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES 360
           NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES
Sbjct: 301 NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES 360

Query: 361 IQPPLVGEKPKSTQATLHSQRSIKSASDIVEKETCSEVLALCNGCKDNKLQRQPGNLPGS 420
           IQPPLVGEKPKSTQATLHSQRSIKSASDIVEKETCSEVLALCNGCKDNKLQRQPGNLPGS
Sbjct: 361 IQPPLVGEKPKSTQATLHSQRSIKSASDIVEKETCSEVLALCNGCKDNKLQRQPGNLPGS 420

Query: 421 STSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD 475
           STSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD
Sbjct: 421 STSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD 474

BLAST of CmaCh01G020580 vs. NCBI nr
Match: XP_023524439.1 (uncharacterized protein At1g76660 [Cucurbita pepo subsp. pepo] >XP_023524440.1 uncharacterized protein At1g76660 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 922.9 bits (2384), Expect = 1.1e-264
Identity = 468/474 (98.73%), Postives = 471/474 (99.37%), Query Frame = 0

Query: 1   MGSEQNRIPQQERGKRWGGCWGALSCFLSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGI 60
           MGSEQNRIPQQERGKRWGGCWGALSCF SQKGEKRIVPASRLPEGNVVTTQPNGPHAAGI
Sbjct: 1   MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGI 60

Query: 61  ANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHE 120
           ANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHE
Sbjct: 61  ANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHE 120

Query: 121 TQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVA 180
           TQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVA
Sbjct: 121 TQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVA 180

Query: 181 SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGS 240
           SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGS
Sbjct: 181 SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGS 240

Query: 241 GRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ 300
           GRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ
Sbjct: 241 GRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ 300

Query: 301 NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES 360
           NRHSKSPKQDVEEIEAYRASFGFSADEII+TTQYVEISDVMEDSFTMRPFTSTSLSAEES
Sbjct: 301 NRHSKSPKQDVEEIEAYRASFGFSADEIISTTQYVEISDVMEDSFTMRPFTSTSLSAEES 360

Query: 361 IQPPLVGEKPKSTQATLHSQRSIKSASDIVEKETCSEVLALCNGCKDNKLQRQPGNLPGS 420
           IQPPLVGEK KSTQATL SQRSIKSASD+VEKETCSEVLALCNGCKD+KLQRQPGNLPGS
Sbjct: 361 IQPPLVGEKLKSTQATLQSQRSIKSASDVVEKETCSEVLALCNGCKDDKLQRQPGNLPGS 420

Query: 421 STSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD 475
           STSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD
Sbjct: 421 STSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD 474

BLAST of CmaCh01G020580 vs. NCBI nr
Match: XP_022940824.1 (uncharacterized protein At1g76660 [Cucurbita moschata] >XP_022940825.1 uncharacterized protein At1g76660 [Cucurbita moschata])

HSP 1 Score: 917.5 bits (2370), Expect = 4.6e-263
Identity = 468/474 (98.73%), Postives = 470/474 (99.16%), Query Frame = 0

Query: 1   MGSEQNRIPQQERGKRWGGCWGALSCFLSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGI 60
           MGSEQNRIPQQERGKRWGGCWGALSCF SQKGEKRIVPASRLPEGNVVTTQPNGPHAAGI
Sbjct: 1   MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGI 60

Query: 61  ANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHE 120
           ANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHE
Sbjct: 61  ANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHE 120

Query: 121 TQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVA 180
           TQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLKGTGKANYVA
Sbjct: 121 TQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVA 180

Query: 181 SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGS 240
           SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGS
Sbjct: 181 SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGS 240

Query: 241 GRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ 300
           GRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ
Sbjct: 241 GRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ 300

Query: 301 NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES 360
           NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES
Sbjct: 301 NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES 360

Query: 361 IQPPLVGEKPKSTQATLHSQRSIKSASDIVEKETCSEVLALCNGCKDNKLQRQPGNLPGS 420
           IQPPLVGEK KSTQATL SQRSIKSASD VEKETCSEVLALCNGCKD+KLQRQPGNLPGS
Sbjct: 361 IQPPLVGEKLKSTQATLQSQRSIKSASD-VEKETCSEVLALCNGCKDDKLQRQPGNLPGS 420

Query: 421 STSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD 475
           STSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD
Sbjct: 421 STSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD 473

BLAST of CmaCh01G020580 vs. NCBI nr
Match: KAG6608673.1 (hypothetical protein SDJN03_02015, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 896.7 bits (2316), Expect = 8.4e-257
Identity = 457/468 (97.65%), Postives = 462/468 (98.72%), Query Frame = 0

Query: 1   MGSEQNRIPQQERGKRWGGCWGALSCFLSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGI 60
           MGSEQNRIPQQERGKRWGGCWGALSCF SQKGEKRIVPASRLPEGNVVTTQPNGPHAAGI
Sbjct: 1   MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGI 60

Query: 61  ANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHE 120
           ANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHE
Sbjct: 61  ANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHE 120

Query: 121 TQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVA 180
           TQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLKGTGKANYVA
Sbjct: 121 TQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVA 180

Query: 181 SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGS 240
           SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGS
Sbjct: 181 SNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGS 240

Query: 241 GRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ 300
           GRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ
Sbjct: 241 GRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ 300

Query: 301 NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES 360
           NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSL AEES
Sbjct: 301 NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLPAEES 360

Query: 361 IQPPLVGEKPKSTQATLHSQRSIKSASDIVEKETCSEVLALCNGCKDNKLQRQPGNLPGS 420
           IQPPLVGEK KSTQATL SQRSIKSAS++VEKETCSEVLALCNGCK++KLQRQPGNLPGS
Sbjct: 361 IQPPLVGEKLKSTQATLQSQRSIKSASEVVEKETCSEVLALCNGCKEDKLQRQPGNLPGS 420

Query: 421 STSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKG 469
           STSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGE  G
Sbjct: 421 STSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEGLG 468

BLAST of CmaCh01G020580 vs. NCBI nr
Match: KAG7037989.1 (hypothetical protein SDJN02_01622, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 895.6 bits (2313), Expect = 1.9e-256
Identity = 454/462 (98.27%), Postives = 458/462 (99.13%), Query Frame = 0

Query: 13  RGKRWGGCWGALSCFLSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIANQATVIAPSLL 72
           +GKRWGGCWGALSCF SQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIANQATVIAPSLL
Sbjct: 9   QGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIANQATVIAPSLL 68

Query: 73  APPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHETQPVSPPVFSAF 132
           APPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHETQPVSPPVFSAF
Sbjct: 69  APPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHETQPVSPPVFSAF 128

Query: 133 TTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAYSLYP 192
           TTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLKGTGKANYVASNDLQAAYSLYP
Sbjct: 129 TTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLQAAYSLYP 188

Query: 193 GSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTP 252
           GSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTP
Sbjct: 189 GSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTP 248

Query: 253 LASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQNRHSKSPKQDVE 312
           LASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQNRHSKSPKQDVE
Sbjct: 249 LASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQNRHSKSPKQDVE 308

Query: 313 EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIQPPLVGEKPKS 372
           EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSL AEESIQPPLVGEK KS
Sbjct: 309 EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLPAEESIQPPLVGEKLKS 368

Query: 373 TQATLHSQRSIKSASDIVEKETCSEVLALCNGCKDNKLQRQPGNLPGSSTSQGETEDLFS 432
           TQATL SQRSIKSASD+VEKETCSEVLALCNGCKD+KLQRQPGNLPGSSTSQGETEDLFS
Sbjct: 369 TQATLQSQRSIKSASDVVEKETCSEVLALCNGCKDDKLQRQPGNLPGSSTSQGETEDLFS 428

Query: 433 RIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD 475
           RIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD
Sbjct: 429 RIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD 470

BLAST of CmaCh01G020580 vs. TAIR 10
Match: AT1G76660.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1); Has 353 Blast hits to 231 proteins in 60 species: Archae - 0; Bacteria - 6; Metazoa - 57; Fungi - 22; Plants - 125; Viruses - 4; Other Eukaryotes - 139 (source: NCBI BLink). )

HSP 1 Score: 458.8 bits (1179), Expect = 5.5e-129
Identity = 276/469 (58.85%), Postives = 323/469 (68.87%), Query Frame = 0

Query: 1   MGSEQNRIPQQERGKRWGGCWGALSCFLSQKGEKRIVPASRLPE-GNVVTTQPNGPHAAG 60
           MGSE      Q++ KRWGGC G  SCF SQKG KRIVPASR+PE GNV  +QPNG H AG
Sbjct: 1   MGSE------QDQRKRWGGCLGVFSCFKSQKGGKRIVPASRIPEGGNVSASQPNGAHQAG 60

Query: 61  IANQ--ATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPY 120
           + N   A  I  SLLAPPSSPASFTNSALPST QSP+C+LSL+ANSPGGPSS+M+ATGPY
Sbjct: 61  VLNNQAAGGINLSLLAPPSSPASFTNSALPSTTQSPNCYLSLAANSPGGPSSSMYATGPY 120

Query: 121 AHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKAN 180
           AHETQ VSPPVFS FTTEPSTAP TPPPELA LT PSSPDVP+A+FL+SS+DLK +GK +
Sbjct: 121 AHETQLVSPPVFSTFTTEPSTAPFTPPPELARLTAPSSPDVPYARFLTSSMDLKNSGKGH 180

Query: 181 YVASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPR 240
           Y   NDLQA YSLYPGSPAS+L SPISR SGD L S                 Q+GK  R
Sbjct: 181 Y---NDLQATYSLYPGSPASALRSPISRASGDGLLSP----------------QNGKCSR 240

Query: 241 SGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYASG- 300
           S SG  FG++  G     Q+SNFFCP TFA+FYLD +P  P  GGRLSVSKDSDVY +  
Sbjct: 241 SDSGNTFGYDTNGVSTPLQESNFFCPETFAKFYLDHDPSVPQNGGRLSVSKDSDVYPTNG 300

Query: 301 -GNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTS 360
            GNG QNR ++SPKQD+EE+EAYRASFGFSADEIITT+QYVEI+DVM+ SF    ++   
Sbjct: 301 YGNGNQNRQNRSPKQDMEELEAYRASFGFSADEIITTSQYVEITDVMDGSFNTSAYS--- 360

Query: 361 LSAEESIQPPLVGEKPKSTQATLHSQRSIKSASDIVEKETCSEVLALCNGCKDNKLQRQP 420
                    P  G+K    +A L SQ S KS +D+  +    +     N  KD+K + + 
Sbjct: 361 ---------PSDGQKLLRREANLLSQTSPKSEADLDSQVVDFQSPKSSNSYKDHKQRNR- 420

Query: 421 GNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLR 464
                      + E L SR+GS K SR Y+  +S SDAEV+YRRGRSLR
Sbjct: 421 --------IHADEEALLSRVGSVKGSRSYH--ISSSDAEVEYRRGRSLR 421

BLAST of CmaCh01G020580 vs. TAIR 10
Match: AT5G52430.1 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 142.9 bits (359), Expect = 6.6e-34
Identity = 111/260 (42.69%), Postives = 141/260 (54.23%), Query Frame = 0

Query: 9   PQQERGKRWGGCWGALSCFLSQKGEKRIVPASRLPE----GNVVTTQPNGPHAAGIANQA 68
           P   +  RWG CW   SCF +QK  KRI  A  +PE    G  V T  N       A   
Sbjct: 28  PSSSQKGRWGKCWSLYSCFGTQKNNKRIGNAVLVPEPVTSGVPVVTVQNS------ATST 87

Query: 69  TVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSAN--SPGGPSSTMFATGPYAHETQ 128
           TV+ P  +APPSSPASF  S   S + SP   LSL++N  SP  P S +F  GPYA+ETQ
Sbjct: 88  TVVLP-FIAPPSSPASFLQSDPSSVSHSPVGPLSLTSNTFSPKEPQS-VFTVGPYANETQ 147

Query: 129 PVSPPVFSAFTTEPSTAPLTPPPELA-HLTTPSSPDVPFAQFLSSSVDLKGTGKANYVAS 188
           PV+PPVFSAF TEPSTAP TPPPE + H+TTPSSP+VPFAQ L+SS++L      + +  
Sbjct: 148 PVTPPVFSAFITEPSTAPYTPPPESSVHITTPSSPEVPFAQLLTSSLELTRRDSTSGMNQ 207

Query: 189 NDLQAAY-----SLYPGSP-ASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKY 248
               + Y      + PGSP   +L+SP S  S    SS +P +      +P    + G+ 
Sbjct: 208 KFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPYPGK------SPMVEFRIGEP 267

Query: 249 PRSGSGRLFGHEKTGTPLAS 256
           P+      F   K G+   S
Sbjct: 268 PKFLGFEHFTARKWGSRFGS 273

BLAST of CmaCh01G020580 vs. TAIR 10
Match: AT4G25620.1 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 126.3 bits (316), Expect = 6.4e-29
Identity = 95/215 (44.19%), Postives = 121/215 (56.28%), Query Frame = 0

Query: 3   SEQNRIPQQERGKRWGGCWGALSCFLSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIAN 62
           S ++R       K+ G  W    CF S+K  KRI  A  +PE        +G   A + N
Sbjct: 21  SAESRTQPSSVQKKRGSWWSLYWCFGSKKNNKRIGHAVLVPE-----PAASGAAVAPVQN 80

Query: 63  ---QATVIAPSLLAPPSSPASFTNSALPSTAQS--PSCFLSLSANSPGGPSSTMFATGPY 122
               +T I    +APPSSPASF  S  PS + +  P    SL+ N P  PS+  F  GPY
Sbjct: 81  SSSNSTSIFMPFIAPPSSPASFLPSGPPSASHTPDPGLLCSLTVNEP--PSA--FTIGPY 140

Query: 123 AHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLK-----G 182
           AHETQPV+PPVFSAFTTEPSTAP TPPPE     +PSSP+VPFAQ L+SS++       G
Sbjct: 141 AHETQPVTPPVFSAFTTEPSTAPFTPPPE-----SPSSPEVPFAQLLTSSLERARRNSGG 200

Query: 183 TGKANYVASNDLQAAYSLYPGSPASSLVSPISRTS 208
                + A++    +  +YPGSP  +L+SP S TS
Sbjct: 201 GMNQKFSAAHYEFKSCQVYPGSPGGNLISPGSGTS 221

BLAST of CmaCh01G020580 vs. TAIR 10
Match: AT1G63720.1 (BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1); Has 490 Blast hits to 394 proteins in 96 species: Archae - 0; Bacteria - 2; Metazoa - 132; Fungi - 88; Plants - 175; Viruses - 14; Other Eukaryotes - 79 (source: NCBI BLink). )

HSP 1 Score: 120.2 bits (300), Expect = 4.6e-27
Identity = 89/224 (39.73%), Postives = 124/224 (55.36%), Query Frame = 0

Query: 1   MGSEQNRIPQQ---ERGKRWGGCWGALSCFLSQKGEKRIVPASRLPEGNVVTTQPNGPHA 60
           + S  +R+ Q     + ++W   W  L CF S +  KRI  +  +PE   +++  +    
Sbjct: 21  IASSDDRLHQSSPIHKKRKWWNRWSLLKCFGSSRQRKRIGNSVLVPEPVSMSSSNSTTSN 80

Query: 61  AGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPY 120
           +G  +  T +    +APPSSPASF  S  PS  QSP   LS S   P     ++FA GPY
Sbjct: 81  SGYRSVITTL--PFIAPPSSPASFFQSEPPSATQSPVGILSFSP-LPCNNRPSIFAIGPY 140

Query: 121 AHETQPVSPPVFSAFTTEPSTAPLTPPPELAHL----TTPSSPDVPFAQFLSSSVDLKGT 180
           AHETQ VSPPVFS +TTEPS+AP+TPP + + +    TTPSSP+VPFAQ  +S+      
Sbjct: 141 AHETQLVSPPVFSTYTTEPSSAPITPPLDDSSIYLTTTTPSSPEVPFAQLFNSNHQTGSY 200

Query: 181 GKANYVASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPE 218
           G    ++S+     Y L PGSP   L+SP   + G   +S FP+
Sbjct: 201 GYKFPMSSSYEFQFYQLPPGSPLGQLISP---SPGSGPTSPFPD 238

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SRE57.7e-12858.85Uncharacterized protein At1g76660 OS=Arabidopsis thaliana OX=3702 GN=At1g76660 P... [more]
Match NameE-valueIdentityDescription
A0A6J1IZ744.6e-269100.00uncharacterized protein At1g76660-like OS=Cucurbita maxima OX=3661 GN=LOC1114804... [more]
A0A6J1FRS72.2e-26398.73uncharacterized protein At1g76660 OS=Cucurbita moschata OX=3662 GN=LOC111446304 ... [more]
A0A6J1BVS75.3e-24190.76uncharacterized protein At1g76660 OS=Momordica charantia OX=3673 GN=LOC111005956... [more]
A0A1S3BV863.7e-23489.03uncharacterized protein At1g76660 OS=Cucumis melo OX=3656 GN=LOC103493867 PE=4 S... [more]
A0A0A0L1G33.0e-22887.55Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G665140 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
XP_022981333.19.6e-269100.00uncharacterized protein At1g76660-like [Cucurbita maxima] >XP_022981334.1 unchar... [more]
XP_023524439.11.1e-26498.73uncharacterized protein At1g76660 [Cucurbita pepo subsp. pepo] >XP_023524440.1 u... [more]
XP_022940824.14.6e-26398.73uncharacterized protein At1g76660 [Cucurbita moschata] >XP_022940825.1 uncharact... [more]
KAG6608673.18.4e-25797.65hypothetical protein SDJN03_02015, partial [Cucurbita argyrosperma subsp. sorori... [more]
KAG7037989.11.9e-25698.27hypothetical protein SDJN02_01622, partial [Cucurbita argyrosperma subsp. argyro... [more]
Match NameE-valueIdentityDescription
AT1G76660.15.5e-12958.85FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
AT5G52430.16.6e-3442.69hydroxyproline-rich glycoprotein family protein [more]
AT4G25620.16.4e-2944.19hydroxyproline-rich glycoprotein family protein [more]
AT1G63720.14.6e-2739.73BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein fam... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 210..234
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 410..429
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 356..380
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 210..249
NoneNo IPR availablePANTHERPTHR31798:SF3OS01G0103800 PROTEINcoord: 1..473
IPR040420Uncharacterized protein At1g76660-likePANTHERPTHR31798HYDROXYPROLINE-RICH GLYCOPROTEIN-LIKEcoord: 1..473

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh01G020580.1CmaCh01G020580.1mRNA