CmaCh19G003970.1 (mRNA) Cucurbita maxima (Rimu)

NameCmaCh19G003970.1
TypemRNA
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionAnkyrin repeat-containing protein
LocationCma_Chr19 : 4690471 .. 4705576 (-)
Sequence length1092
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAACCAAAGGTTGAACGCAGTTGCATCTTCGGGAGATATCGACTCCTTATATTTGATACTTCAAGAAGATGCCTACATTTTAGAACGCATAGATCAGGTATCAACATATTTATTCTTGCATTTGAAATGAAAGAATTCATGGCATTGAATGGTTGGACCATATATTATTTTAAATGAAATAAAAATTACCATCTGAATGGTCAATACTTTAATTTTTCAACCTTATAATACCCATGAAGACGTGACCTAGAATTTCAAACAAGCCTCTAAAGAGGGCAACTGTTGACATGAGGATTCTCTCTTGGGGTCTTTGGCTTTACTCCCGGTATATAAAGCAACGTTTAGGTCCCTCAATTGTCATAGGGGTCTCTCACTCGGGTATTAGGAGGGCATTGTTTGCCTGCCCAAATTGATCCTGTGAGGGTAACACGACTAATTCTTCATCTCTCTTAGGCAGATAAGCTACCTTTATGAAAGACCATTTTGTTTAGATGTTAGGAGAACTAATGAACAAGAAAATGCCCCTTCCAATATCGAGAGATAGACTAAGATACTTGTATCGAGTAGTCATAAAAGTAGGGTCTCAAACAAGTTATCAAACAAGTTATTTGAAGGGAGAAAATAGATACCTTTATATGTTCATATAAATGAAAGAACTTGGGAAGAAACTCAATAAAATATTTAAAGATGCTTGATCATAAAAAATTGATTTTTTTTAAAAAATGAAACATCTAAAAGCTATTTTGTGTACACTATTATACTAATATATTAAATAGTAACAAAAACTCAAATGGTTAAAAGACATTATAGGTCAAATTGTTATCCCACTTATCCATGAACTCAAAACAGCTAAAGATTTTTATTGTTCATATGTTTAGCATCACTATTAGGGTTTGGGATTGATTTGTGTCTTCTTTTTGAAAGGTACCTTTTGTTGATACCCCATTACACATCTCAGCATCAGCAGGACATGTCCCTTTTTCTTTGGAGATCATGAGGTTGAAACCATCCTTAGCCAAGAAGCTTAACCCAGAAGGCTATAGCCCTATCCACTTGGCATTGCAAAATAACCAAACAAAAACTGTGTTGCGGCTCGTCGATATCGATCGAGATCTCGTATGCATACAAGGGAGAGATGGCTTAACTCCATTGCACATTGCAGCCTCGGGAGGCATGGATGATATATTGGCCAAGTTTCTAACTTCATGCCCAAAATCTATTAAACAATTGACCAATCGTAATGAGTCTGCTCTACATATTGCTGTTAAACGTGAAGATATTGAATCGGTTAAAATCTTGCTCCAATGGATCCAACAAACATGCATGACCTCAATACTTAATTGGTGTGATGACGAAGGAAACAATGTCATGCATCGTGTCGCGCTCGGAAATCAAATTGAGGTCAGTATCAAAAAATTTTAAATATAGCATAGCTTAATTGATTAATATATTGTATTTATTATTAAAAGGTAGGACTTGCAATAATCTCATAGAATGATCATCTAACTTGTTGGGATAATCAAAACTCAAAAGAAACCTATTTAATTTTTAGTTTTTGATTTTTTTATAAACTCAACTCCAATATTAAAAAAAAAAAGTAAATATTTTATGATTTATTAACTCTAAGTTTTTGTGGAATCTTACTTGGGTTAGAGTGGGAAAAGCTAAACATTTCCCGTAAAGATGTAAAAATCTCTCTCTAATAGATGTATTTTAAAATAGTGAAGTTGATAGTATATAATGGGTCAAAGCAAACAAAATCTTCTACCGAAAAGCGGATAATATCTTCTATAAAAGTTGACATGTAACAAATGTTTGAACATACGAAAAAAGCTGGCCTTTGGTGCAGATGGTGAAGCTTCTCATAAACAAAGTGGATATGAAAGCAAAGAATTTGGAAGGCAAGACAGCACTAGACATAATGAAAGAACATGGCCTTGTTGAAGACAAAGAGGTAAAAGAGATGCTCCATGGTTCCCACATTCTCAGAGATGTTGCAAAATTTGTGTGTTTAATATTATTGTTTGTGAAGAGATTAGTGATTACGGACCATCATGAGATCCTTTACATGTCTAAAAAGGATCGAAATGCCATATTAGTAGTTTCTGTGCTCATTGCCACTGCAACTTACGAAGCAGCTCTAAGTCCACCCGAAAAAGATATGGATTTCTTCCCTCCGGAGTGGATCTCATACGTATGCTTCCAACAGTTAAATACAGTTGCATTCGTTGCCTCAATGATAGAGATTTGTGTTCATCTTCCTTCTGGGATTGGCTACGCTCTCCATTTGGTTCTTCCTTTGGTCATTTGCTATGCATTATTGGCGATCGTTTGGAGGTCGTACTTTCCTTTAGTTATCACNGAAAAATGTCTACTCATGGAAACTGCTTTCGAATCTTTCTTTGATGAAGTAGGATTCTATTCGACTATACTAGGTTCTAGGTGCTTGTTTCAATCCGTAGAGAGCCTTTTGGAGTTCGTACACCATCTCCTCACTTCACTTTTTCTCATAGCTTTTTGGTTGCTCGACAAAAATATCTTCTTTTAACTCACCACATAAAAAAGTTGATTTGATATCTAATTGATAGAGTTTCCAACCAACCTTTGCGGAAGGAGAAACTCTCCTGCATGACCCTGCAAAAGTAATGATCATTCTCACAATATCCATTCTGGCCACAGGTGTGTACACTTACGTATAATCAATTTCATACTCTTGTGGGTAGCCTTTTACCACCAGTCTTGCTTTGTATTTATCAACTTCTCTATACTCATAAAGCTTGGTCTTATAAATCCATTCCACTCCTATTTTCTTTGCTCAAACCGGTAAAACCCCCAAGTCCTTATTTGATTTTTTCAATGAATTTGATTTCTTTATCCGTAGTCAATTGTTGAATTCTGCAAAGTTAATTATCTTCCTTTATGTTTCCTTAAACATTTTATAGGCATTCCAATTTTCTTTTCCACCATTATTTTTAAATAATTGGAATGATAAAGTGTTTATGGTTTTTCTACAAGAAAATAACTCCAAAGTCTTCCTTGAAAAATCATCTATAAAGCACAACAAATATTTTTTCTAGCTACTTGAAGTTGGTGAAATTGGACCACATAGATCTACATGAGCAAGGCCTAATTTTTCGGTTGCTCTCCATTTACCTTTTTTTGGAAATGGAGTCCAATGTTACTTGCCTTTCATACATGCCTCACAGGTGGTGTTTGGAACTACAATTTGTGGCTGACTTTTCACCATTTTTTTGTGTTTCAAGGTGCACAAGCCTTTGTAGCCGAGATGACCATATGATGGTGCCAAAGTGTCGACTGGTCCATGTTATAAACTTAGAGGCATCTCCTTCATTTGTTGTGGCATATGGTTCAGCAAGCAAAATGAACAACCAATTTGCACTCATAATTGACTCTACCATTTTTCCTTTCTGTGGATGATAAATGCTACATACACCATCTTTGAACAACACTGCTACTCTTTTTTCTTGTAGTTGTTCAACGCTCAAGAGATTATTTTTGAGTTCAGGTACCCAATACACATTACTAATAGTGTAACGAACCCCAAGCAAAGTTAATTTCACTACACCCTTTTGAGTAACTTTCATACTTATATTATTTCCAAGCTTGACAGTATGAGAAAAGGTCTTGTCTAAGCTTGAAAACATACCTTCATTTGCAAACATATGGTTTGATCAACCAGAATCTAAGAACCAAGCATCACTCCTTTTTGCTTCATATCTTTCCATATGAGCCATTAACCTCATTTCATCCTTTTCTTCCTGCTTTCATAATTCAATTACTTTTGTTCATCTTTGGACATTCATATTGAAAATGACTTAAGTCATGACACCTATAACATTCAACAGTTGCTTTGTTGAAAGTTGATCTTCCTCTTTCACGACTTTGCTCTTGTAAGTTCCTCTTCCTCTATTACTTATTTTCATTCGAGCTTCGACTTTTAGAACTTGCACCTCTTCATTAATGTTGGATCTTTGAAATTTTTGTTCATGCACCACTAATAAACTTTGTAATTCATCAATGGACATGTTATCAGTATCTTTAGATTTCTCAATGAATACTACATCAGAAGTAAATTTTCCAAGCAGGGTACGCAAATTTTTTTCTACAATTTTCTTGTTTGGCATATCTTCTCCATACTTGTCATCTTGTTGGAGATTATTATCACTCTTGCAAAATGATCTGTGATACTTTCATCCTTCTTCATCTCTAGGATCTCAAATTCTCTCCTTAATGCATTAAGAAGAGATTTATTCCCTTCTTGGTTTCCACCAAACTTTCGCTTCATTGAGTCCCAGACAATCTTGGTTGTGCAATGATCCAAGATTTGTTCAAATACAATACGATCAATTGCCTGAAAGAGATAATGCTTTACTTGATGATGTTTGAGACTAGCATCATCAAGGTGGCCCTTTAGTGGCATTATTTCTTCCACTGGTTCTAAGAAACCAATCTCTGCACACCATAGACCCTTTGCTTTGAGCAAATTTTCCATCAGCTCACTCCAATGATCATAGCGATCATTGAAGTGAGGGATCTTCNATGGTAGTAAAGCTATATATAGTGGACATATAAATGGTAAACCAAATTGTCATGTCCATATCTATCCCGTAAAGTTATATATAATATATGGTTATATCTGGTAAAGTTATATATACCGCTTTGAAAATGTACTTTTTACATTTTTTCTATCTTTTTCTACGGTACATACATGAAGTCATCAATATCAACTTTTAACCAATATTCTTGTCACGGCCCGACTTTCGAGGCTTCGAAAGACTGAATCGTGACTAAAACGACGAAATCGAAAACAAATAAAAATAAATAGAAGGTAAAAATCCGATAAAATCGGTAAAAATCCGGTAAAATCTAAAACAAGAAAAGGAAATTAAGGACACTTAAATTAAATATAAAATCAAAGTCAAATGTTACAAATGGGAATTACAAATACAAAATGAAATACAAAAGACTCTAAAAGACCGACAAGGGACTGACGCTCCAAAACGATACCCCTCTGTGACAGTGCAGCTCTCGAGCACCCCCCCGTATCGACTGGTACCTGAACAAAAAAAAAGGAGAAAGGGGTGAGTATAAAAATATACTCAGTAAGCAACCAACTTGTAGGCACTTTTCGCATCCTTGTCTTAGTAGGTATTCACGGTCTTATCTCTAGATTCTAGTGAGTGCGAATCACGCTACAACTCTCTTTTTTGTATCTTAGGAGGCTTGCCCATCTTATGCAGTGGAATGATCAGTGTCCTGAGGTGAGACGCGACCTCATAAGTAAGTATGAGGGGATCGTNATTTACTATAATTGATGTTTTAAAAGTGCCTAAAATCCAAATTGCTTCCGTTATATTAAGACGCGTGTCTTTCCAAGCTCGAATTATTTGCCGAAAATGGCTCCAAAAATTCTGAAATTTTCTTGATAGGTTCTAATTCACATAAAAATAGTATCAGATTTGAAATCTGAGGCAAAAAAACAACCAAACGGAGGCCAGACGGGTGAGAGAATGGTGTCCCACACGCCGTGCACACGCCAGTAGGTCTCCACACGCCCACACGCGCAGGGCACACGCGCGGGTAGTGCCGTGTTTCGAACCCCAACGACCGTGAACTTACCTTCGAATTTTATGAGCACGACACGTGGCACACAGTGTTTGGCGGGAAACGTCGGGTTGGACTGCAGGTTTTCGAACCGGTTCGGCGAGCGTACGGGTCGACCCGACTTAGCGCACTAGTGGAACACTGCCATGTGATGGACGCCGACTCCGCCGCGCTCCTCCGTTTGCCAGTCAACGCGTGGCGGTCTCTTTCTCTCTCCTACGTGTGTTTCAAATCTCGAGATGTTAGCATTTTCGTGTTTTCAATCTATTTTCTTCGGATTTTCTTATCTTTCAATCCATGAAAATTGTAGATCTCATATTCTACTTCTACAATATATTTTTACTTTCTTTTTTTTTTTCTTGAACACAGATGAGTTAAGGCCGTCCAAAGATACGGAAATACTCACATTTTCTGTTTTCGATCGGTTTACCTGAAGTCCAACTAAACTCCGATTTTCGTCCAAGAACTTCAAGAAACCTGTGTGAATCTTATTTTAGTTACATACGTCAAAAACTCCAGATCTAAAACTCAAGAAATTCCTCAGATCTAGATAAAATCGAAACAACAAGTTCGGAACCTTACTTTGTCCGTTCTCGAATTTTCTTGCCGGAATAAATGATTCCACCACCCAAACTCCTTTTCCAGCACGAGAGAAGTATTGCTACGAAGTTTCTTGCCTTAGATTCCCCAACTGCCGTTTGTCTCTTCCTCTCTCTCAAATCTTAACTTCTTCTCCTTTCTCCTTTCTCCTTTCTCCTCTCTCTCTCCCTCTCTCTCTTTTCTTTTCTTTTCTTTTCTTTTCTTTTCTTTTCTTTTCTTTCTCTTTCTTTTCTTAGCCCAAACAACTAGACCCTTAGGCAAGAGTAGGAGCAAAATGACCCTTTTGCCCCCAATGTTATTATTATTATTTTTATTTTTTTATTTGTTTATCTAAAAATTCGGGTATTACAATTCTTCATTGTATTTTCTCTCTCATGTGCTTTGAATTTTGGATTTAAGTACATCATTTATTCTATCTCCAAGTATGGAATTGTTTCAAAATTTCAAATCATAAAATTTCGGGAAGGTGTATCTTGCTGACAGCAAAACGTTGGATTGAAGGGAAGGAAGATATCTTCATACAAACTCCATCAGGAAATCAATGGACATTAGAAGATGCCAAATATCTTCTTGGTCTCAACAAGAATTTGATCTCTATTGGTCAGTTGGATAGCACAGGCTAAGCACTAGAGTTTGGAAAGAGTTGGTAAAAGATTGTGAAGGGTGTCATGCTTGTTGCACGTGACACAAAATCTGGAACCTTATACACTGCTGCAGAGGGTATAAACATGGTTGTTGTGCTCAGAGTGCTTCCAATTCAAGTCTATGGTAGGATAGACTTGGACATATGAGCGTCAAAGAAATGAAGTTGCTGACTGCAAAATAAATTTTAGAAGGCCTAAAATCTATTAAGTTTAATAGGGGTCTTTGTGAGAGTTGTGTTATGGGACAACATAAATGAGTTAGCTTGACAAAGGTTGTCAAAGAATCGAAGAAAGTGCGGTTGGAAATGGTTCAAAAAGACGTTTGGGGACCATCTCCAGTTTCATCACTTGGTGGATCAAAGTTCTACATCACCTTTATAGATGATTTCAGCAGGAATGTATGGGTTTACTTCTTGAGACACAAGTCAGATGTGTTTGCCACCTTCAAGAAAATAACCACAAATTAGTTAAACTAAGTAGCTNCCCGTTGAGTTTGCATGAATGTTTAGATTATGTTTTCCCAGCTTTAAACGACCGATAGGAATATCTCTCTAACAAAGCATACATGAGTGAACCAAGTTAAGATTAGGGCACTAGAATTAGAGTGATTTTCACATTGACCTAGGTCAGTACGTTTTCAGTAGACCGAGCACCCTAGATTTGTGAGCAGCGATCAAAGGTCTAGACCTAAGTGCACCCAGAAATTGGATTAGCCTTGAGTAGCTAACTTCGAGTGTTTTAGAAACGCGAGATGAGGACTAAACCATCTAGATGTCCCTAAAGACCCAGTAAACCTCCACGACATCTATGAATGATTAAGAAAGTTTAAAATGCTTACTAGCAATGTGAATGGTTTCATTAGAACCCTGAGGTTACTTAAGTTACTAATGAGTGTAGGGTTGGATCAAGGATAACGTGCCACTCAAGAAACCAGTCTGCAGGAGGGACTTGAGACTATATGGGTAGTTAGCTTAAGACAAGGACTCCTATTGAATGTAAAGTGGGTCAAAGTTGAAGCCGCGATGCCATAATCGGCCTTGGAAAGGGTTTAAAATTACCCAGGTAGATGACTTAGAGTGAATGTTTCTATTAAGTGTAGGAACAGGACAAAGTCTAAGACAAAAAATATGTAAAATGGTCTTGAGAGTGGACTTCGAGCTCATGATTTCTGCCACTAATGAGTGTAGAAACAGTTCTCGGTCTCCTGATTTAAACAACATTGCTTAAAATTCACTATTTTGAACGAAAATCAAGTTTGATACAAAATTAAACTTAGGGAAGATCTATGGATTGAAATGGATATATTACCTAGTTATTTTGATCCTAGGTATACTTTTCAAGTTTTGGGTAATGAATAATGAGAGAGAAATTAACAAGAACATGCGCAGAAATGGACTCGTAATTACGCTCCTCAACCATTCAACCAATAACCTTATCCTTACCTAACAAGTCAGTCAACGCCCTCGTTCATGTCAATAATGGAGCTGCAGAAGCTTGACTATCTTAGCTGCTAAGAAAATCATTGAGAAGACTCCAGGAAATGAACTCGAATTGGCGGTCTCCACACAGCTCTTCATCCATCTCATTTGTAAGTCTAATTTTAGTCTCAGTTTTGGTCCATTTCAGCGCAACATATATATTTATATATAATATATAATCATTGAATTGTGCTCATACTTAAAAAACTACCACAATATTGATCCTTAATCACGCTATTTTGTTCATGCTGCAGTTTTCTCAAAAGAAAAATGAACCAAAGGTTGAACGCAGTTGCATCTTCGGGAGATATCGACTCCTTATATTTGATACTTCAAGAAGATGCCTACATTTTAGAACGCATAGATCAGGTATCAACATATTTATTCTTGCATTTGAAATGAAAGAATTCATGGCATTGAATGGTTGGACCATATATTATTTTAAATGAAATAAAAATTACCATCTGAATGGTCAATACTTTAATTTTTCAACCTTATAATACCCATGAAGACGTGACCTAGAATTTCAAACAAGCCTCTAAAGAGGGCAACTGTTGACATGAGGATTCTCTCTTGGGGTCTTTGGCTTTACTCCCGGTGTATAAAGCAACGTTTAGGTCCCTCAATTGTCATAGGGGTCTCTCACTCGGGTATTAGGAGGGCATTGTTTGCCTGCCCAAATTGATCCTGTGAGGGTAACACGACTAATTCTTCATCTCTCTTAGGCAGATAAGCTACCTTTATGAAAGACCATTTTGTTTAGATGTTAGGAGAACTAATGAACAAGAAAATGCCCCTTCCAATATCGAGAGATAGACTAAGATACTTGTATCGAGTAGTCATAAAAGTAGGGTCTCAAACAAGTTATCAAACAAGTTATTTGAAGGGAGAAAATAGATACCTTTATATGTTCATATAAATGAAAGAACTTGGGAAGAAACTCAATAAAATATTTAAAGATGCTTGATCATAAAAAATTGATTTTTTTTAAAAAATGAAACATCTAAAAGCTATTTTGTGTACACTATTATACTAATATATTAAATAGTAACAAAAACTCAAATGGTTAAAAGACATTATAGGTCAAATTGTTATCCCACTTATCCATGAACTCAAAACAGCTAAAGATTTTTATTGTTCATATGTTTAGCATCACTATTAGGGTTTGGGATTGATTTGTGTCTTNTTCTATAAAATACATCATAATCTACTATACCTCAAGTACCTAAAATTATTTTAGCTGCTGCAAAATGTTGTTGTTTTGGACATAAAATGACTAATCAAACTTACAACAAACATGAGATCAAGACAAGTAGTTATGAGATACATTAGACTTCCCACTATTTGTTTATACAAAGTTACATCTACTTTTGCACCATTTTTATCTTTGCAAATTTTTTGTCTTGGAACAATAGGATTATCAACAGAATTGCAATTCTCGATGCCAAATCGATTTAAAACTTCAACCGCATGCTTCCTTTGACATGTAAAAATACCATCTAATCTTTGCATTACCTCTGTTCCAAGGAAAAATCTCATTTGACCTAAGTAAATCATATCANGTTGCTGACTGCAAAATAAATTTTAGAAGGCCTAAAATCTATTAAGTTTAATAGGGGTCTTTGTGAGAGTTGTGTTATGGGACAACATAAATGAGTTAGCTTGACAAAGGTTGTCAAAGAATCGAAGAAAGTGCGGTTGGAAATGGTTCAAAAAGACGTTTGGGGACCATCTCCAGTTTCATCACTTGGTGGATCAAAGTTCTACATCACCTTTATAGATGATTTCAGCAGGAATGTATGGGTTTACNGAAAAATGTCTACTCATGGAAACTGCTTTCGAATCTTTCTTTGATGAAGTAGGATTCTATTCGACTAAACTAGGTTCTAGGTGCTTGTTTCAATCCGTAGAGAGCCTTTTGGAGTTCGTACACCATCTCCTCACTTCACTTTTTCTCATAGCTTTTTGGTTGCTCGACAAAAATATCTTCTTTTAACTCACCACATAAAAAAGTTGATTTGATATGTAATTGATAGAGTTTCCAACCAACCTTTGCGGAAGGAGAAACTCTCCTGCATGACCCTGCAAAAGTAATGATCATTCTCACAATATCCATTCTGGCCACAGGTGTGTACACTTACGTATAATCAATTAAAAGTAATGATCATTCTCACAATATCCATTCTGGCCACAGGTGTGTACACTTACGTATAATCAATTTCATACTCTTGTGGGTAGCCTTTTACCACCAGTCTTGCTTTGTATTTATCAACTTCTCTATACTCATAAAGCTTGGTCTTATAAATCTTTTACCACCAGTCTTGCTTTGTATTTATCAACTTCTCTATACTCATAAAGCTTGGTCTTATAAATCAAATCCACTCCTATTTCCTTTGCTCAAACCAGTAAAACCCCCAAGTTCCTTATTTGATTTTTTCAATGAATTTGATTTCTTTATCCGTAGTCAATTGTTGAATTCTGCAAAGTTAATTATCTTCCTTTATGTTTCCTTAAACATTTTATAGGCATTCCAATTTTCTTTTCCACCATTATTTATAAATAATTGGAATGATAAAGTGTTTATGGTTTTTCTACAAGAAAATAACTCCAAAGTCTTCCTTGAAAAATCATCTATAAAGCACAACAAATATTTTTTCTAGCTACTTGAAGTTGGTGAAATTGGACCACATAGATCTACATGAGCAAGACCTAATTTTTCGGTTGCTCTCCATTTACCTTTTTTTGGAAATGGAGTCCAATGTTACTTGCCTTTCATACATGCCTCACAGGTGGTGTTTGGAACTACAATTTGTGGCTGACTTTTCACCATTTTTTTGTGTTTCAAGGTGCACAAGCCTTTGTAGCCGAGATGACCATATGATGGTGCCAAAGTGTTGACTGGTCCATGTTATAAACTTCGAGGCATCTCCTTCATTTGTTGTGGCATATGGTTCAGCAAGCAAAATGAACAACCAATTTGCACTCATAATTGACTCTACCATTTTTCCTTTCTGTCGATGATAAATGCTACATACACCATCTTTGAACAACACTGCTACTCTTTTTTCTTGTAGTTGTTCAACGCTCAAGAGATTATTTTTGAGTTCAGGTACCCAATACACATTACTAATAGTGTAACGAACCCCAAGTAAAGTTAATTTCACTACACCCTTTTGAGTAACTTTCATACTTATGTTATTTCCAAGCTTGACAGTACGAGAAAAGGTCTTGTCTAAGCTTGAAAACATACCTTCATTTGCAAACATATGGTTTGATCAACCAGAATCTAAGAACCAAGCATCACTCCTTTTTGCTTCATATCTTTCCATATGAGCCATTAACCTCATTTCATCCTTTTCTTCCTGCTTTCATAATTCAATTACTTTTGTTCATCTTTGGACATTCATATTGAAAATGACTTAAGTCATGACACCTATAACATTCAACAGTTGCTTTGTTGAAAGTTGATCTTCCTCTTTCACGACTTTGCTCTTGTAAGTTCCTCTTCCTCTATTACTTATTTTCATTCGAGCTTCGACTTTTAGAACTTGCACCTCTTCATTAATGTTGGATCTTTGAAATTTTTGTTCATGCACCACTAATAAACTTTGTAATTCATCAATGGACATGTTATCAGTATCTTTAGATTTCTCAATGAATACTACATCAGAAGTAAATTTTCCAAGCAGGGTACGCAAATTTTTTTCTACAATTTTCTTGTTTGGCATATCTTCTCCATACTTGTCATCTTGTTGGAGATTATTATCACTCTTGCAAAATGATCTGTGATACTTTCATCCTTCTTCATCTCTAGGATCTCAAATTCTCTCCTTAATGCATTAAGAAGAGATTTATTCCCTTCTTGGTTTCCACCAAACTTTCGCTTCATTGAGTCCCAGACAATCTTGGTTGTGCAATGATCCAAGATTTGTTCAAATACAATACGATCAATTGCCTGAAAGAGATAATGCTTTACTTGATGATGTTTGAGACTAGCATCATCAAGGTGGCCCTTTAGTGGCATTATTTCTTCCACTGGTTCTAAGAAACCAATCTCTGCACACCATAGACCCTTTGCTTTGAGCAAATTTTCCATCAGCTCACTCCAATGATCATAGCGATCATTGAAGTGAGGGATCTTCGTTAAAGTTTTGTCTTCACTCATACTCTTTGTATTTGATCATTCAAAAATTTCTATCTTGTTGCTCTGATACCAATTGTAAGGTTTTGAAAGAAGAATGACTAAGCTAGAAACATAAACAATTGGAGTTTCCAGTAGTGACGATATTATTGAGTCCAAAAATATGACAAATTAGTTAAACTAAATAGCTAAAATTACTCTAACTATCCCAAAAGAAACAATGTGTATTTAATCATATTTGACGTAATCAAATTCTTCTAACTCGTGGTTGACACTAATTTGTGACAACATATTTTTCAAGCATAATGCCACCTCAACTAGCAATACTTCAAAAAAAAAATGTGGGAAGGGGTGAGTATAAAAATACTTAGTAAGCAGCCTAGTTGCAAGCTCTCATCACATCCTACCGCTTGGCAAGGGCACTCCAACGCTAAAGGCATATCAGAGGAAATAGAAGGTACAAACTCTGGTGAAACCGTTACATTGCTATTAGGCTTGTTAAGCTTTCTTAGTCGTCCTTGTTGTAGAGATTTACTAGGTTCTTAGGGGCTAAATCTCATCTCATTGTTTCAAAGATGTTTAGGACTATCCCCATGGGGTTAATCCAATCTCTAGGTGCACTTAAGTATAGCCCTCTGATCGTCACTCACAATCTTAAAATGCTTATTCCATTGAAACATACTAACCTAGGTCAATATGTAAACTGCCCTAACTCTAGTGCCCTAATATTAACTTGATTCACTCATGCATGCTTTATTAGAGAGAGATTTCTATCGATCACATAAAGCACAAAAAATATATTCTACACTCACATTGGTCATCCTTGGGTGTTGAGGACGTCTACTGGGTTCCGGGAGCAGCTAGGTGGTTTAAATGCAAACTTTCAACTCCAGGATACTCTATTTTGACTCCTCAAGTCTTTTTTTTTAACACTAGGGTCTAGACCTTTTGATCGATGCTCACGATCCTAAGATGCTCAATCTACTATAACTGTCTTGACCTAAGTCATCGTCTAGCCAACTTAACTCTAACACCTTAGCTTATCTCTATATAAATAATAAGCTCTATAGGGAGAACACGTAATCAATCACCTAGAGCTAAGCAAACTATATCAAGCATTCATACAAGCTCAATGAGGAGATAACATCTATCAATCGCATAGATCATAATATCACATCCTTCGTTCTAGTACAGTGGAAAGCAATAAAAGTTGGCATGCATCTTAAATTCCCATCCTAGAGACACTTTGGCCATTTTTTATGACATACATTGCATTTTCATCCAAAAATTCCCTTCTAAGGTCCTAAACAAGCATATTATAACCACCTATTTTAAGATATGCTCACATGCTAGCACGTCCTAGCAATTCGTACAAGTATTACTTAAGAAAATTCGCATGGCTACTACAAATTGCTAATTACATGGATGAAGCGTGACTTCCCAAGCATGATTTTTGATTACTAGAAGCCCCAAATTTTTTGAAAGATCCTAACTTGAATCATGATCACATCAGTTTTGAAATCAGAGCAAAAAATAGTCGAACGAACTAAAAAGCTGAACTCAAACACCGCCACACGCTTTCATACATGCAATAGGTCGCCTGCATGCAAGTACAGACTCGTATTTGGAGTAACAGTCGTGCTCACGTTGCGACTCATGGAGCCTTCATATTAATGCAATCGAATCGGGTCGAAGCAATGGGCAGGCAGTTTGGGTCAAATGCTGTAAGTTCAGGTCCAATCGATTGGTTCGTGTCACCCTCAAAAGTCTCGACGTGTGGCGCAATCTAAGCCAGACATACTTCCTCTTCGTTAGCCAACGTGTGTCTTCGACCATAGCTTCACACATGTCCTCCACCATCACTTGACCTACATTTCACCTACCATTTGAGAGTTTTCTAGATTCTTAAATCAATTTTCAATTTCTTTCTAATTTATTTGGATTTTGTATATTTTTCTACATAACTTTGATTCAACTTGGTTCCTTCAAGATTTTTAATCCTTTTAAAAAAAAATTATTTCCATCTTAATCCGTTAGCTAGATAGAAAAATAAATGCTTCAAAAATCAGTGTAAAATACATAAATGATAATTTCTTGACGATCGTTGTAACTCCAACTATCGATCTGAATTGACTCGAGGTTAGTTCATGGATAGTGTGTGAAACCTTCCTTGGAATAACATATTAAAATTTCATAATCTAACTCTTCCCAAATCTCTCGAATTTGAAACAAATTGAGACAAAAATTTTGAAATTTATAACACCGTTCTTTCTTGATTCTCGATAAATTTAAGTGATTCAAGTATACAGATTGATTCTCCCCTTGTCTTTGAGCATTTCTTGTGAAGGAATTTTTGCTAAACTCTAACTCTAAATAGTAAGTTCAATAGATTCATTCTTCTTTTTTTAAAGTCCTCAACAAAATCTCGAAACATAGAAATTTAAAGAAAAGTTCAAGAAATTAATTCTGGTAGGGAGTTGGATGTTATCCATATGAATCAAAGGTTAGGAGCATATATTTCCTTGTTATACTAGTTTTTGCTTCTGTGTTTAGTCCCCTCATCTTTCTTTTACTATTATTATGTTTGGGATTGTCAAGCTCAAACGAGTGCTCTGA

mRNA sequence

ATGAACCAAAGGTTGAACGCAGTTGCATCTTCGGGAGATATCGACTCCTTATATTTGATACTTCAAGAAGATGCCTACATTTTAGAACGCATAGATCAGGTACCTTTTGTTGATACCCCATTACACATCTCAGCATCAGCAGGACATGTCCCTTTTTCTTTGGAGATCATGAGGTTGAAACCATCCTTAGCCAAGAAGCTTAACCCAGAAGGCTATAGCCCTATCCACTTGGCATTGCAAAATAACCAAACAAAAACTGTGTTGCGGCTCGTCGATATCGATCGAGATCTCGTATGCATACAAGGGAGAGATGGCTTAACTCCATTGCACATTGCAGCCTCGGGAGGCATGGATGATATATTGGCCAAGTTTCTAACTTCATGCCCAAAATCTATTAAACAATTGACCAATCGTAATGAGTCTGCTCTACATATTGCTGTTAAACGTGAAGATATTGAATCGGTTAAAATCTTGCTCCAATGGATCCAACAAACATGCATGACCTCAATACTTAATTGGTGTGATGACGAAGGAAACAATGTCATGCATCGTGTCGCGCTCGGAAATCAAATTGAGATGGTGAAGCTTCTCATAAACAAAGTGGATATGAAAGCAAAGAATTTGGAAGGCAAGACAGCACTAGACATAATGAAAGAACATGGCCTTGTTGAAGACAAAGAGGTAAAAGAGATGCTCCATGGTTCCCACATTCTCAGAGATGTTGCAAAATTTGTGTGTTTAATATTATTGTTTGTGAAGAGATTAGTGATTACGGACCATCATGAGATCCTTTACATGTCTAAAAAGGATCGAAATGCCATATTAGTAGTTTCTGTGCTCATTGCCACTGCAACTTACGAAGCAGCTCTAAGTCCACCCGAAAAAGATATGGATTTCTTCCCTCCGGAGTGGATCTCATACGTATGCTTCCAACAGTTAAATACAGTTGCATTCGTTGCCTCAATGATAGAGATTTGTGTTCATCTTCCTTCTGGGATTGGCTACGCTCTCCATTTGGTTCTTCCTTTGTCCCCTCATCTTTCTTTTACTATTATTATGTTTGGGATTGTCAAGCTCAAACGAGTGCTCTGA

Coding sequence (CDS)

ATGAACCAAAGGTTGAACGCAGTTGCATCTTCGGGAGATATCGACTCCTTATATTTGATACTTCAAGAAGATGCCTACATTTTAGAACGCATAGATCAGGTACCTTTTGTTGATACCCCATTACACATCTCAGCATCAGCAGGACATGTCCCTTTTTCTTTGGAGATCATGAGGTTGAAACCATCCTTAGCCAAGAAGCTTAACCCAGAAGGCTATAGCCCTATCCACTTGGCATTGCAAAATAACCAAACAAAAACTGTGTTGCGGCTCGTCGATATCGATCGAGATCTCGTATGCATACAAGGGAGAGATGGCTTAACTCCATTGCACATTGCAGCCTCGGGAGGCATGGATGATATATTGGCCAAGTTTCTAACTTCATGCCCAAAATCTATTAAACAATTGACCAATCGTAATGAGTCTGCTCTACATATTGCTGTTAAACGTGAAGATATTGAATCGGTTAAAATCTTGCTCCAATGGATCCAACAAACATGCATGACCTCAATACTTAATTGGTGTGATGACGAAGGAAACAATGTCATGCATCGTGTCGCGCTCGGAAATCAAATTGAGATGGTGAAGCTTCTCATAAACAAAGTGGATATGAAAGCAAAGAATTTGGAAGGCAAGACAGCACTAGACATAATGAAAGAACATGGCCTTGTTGAAGACAAAGAGGTAAAAGAGATGCTCCATGGTTCCCACATTCTCAGAGATGTTGCAAAATTTGTGTGTTTAATATTATTGTTTGTGAAGAGATTAGTGATTACGGACCATCATGAGATCCTTTACATGTCTAAAAAGGATCGAAATGCCATATTAGTAGTTTCTGTGCTCATTGCCACTGCAACTTACGAAGCAGCTCTAAGTCCACCCGAAAAAGATATGGATTTCTTCCCTCCGGAGTGGATCTCATACGTATGCTTCCAACAGTTAAATACAGTTGCATTCGTTGCCTCAATGATAGAGATTTGTGTTCATCTTCCTTCTGGGATTGGCTACGCTCTCCATTTGGTTCTTCCTTTGTCCCCTCATCTTTCTTTTACTATTATTATGTTTGGGATTGTCAAGCTCAAACGAGTGCTCTGA

Protein sequence

MNQRLNAVASSGDIDSLYLILQEDAYILERIDQVPFVDTPLHISASAGHVPFSLEIMRLKPSLAKKLNPEGYSPIHLALQNNQTKTVLRLVDIDRDLVCIQGRDGLTPLHIAASGGMDDILAKFLTSCPKSIKQLTNRNESALHIAVKREDIESVKILLQWIQQTCMTSILNWCDDEGNNVMHRVALGNQIEMVKLLINKVDMKAKNLEGKTALDIMKEHGLVEDKEVKEMLHGSHILRDVAKFVCLILLFVKRLVITDHHEILYMSKKDRNAILVVSVLIATATYEAALSPPEKDMDFFPPEWISYVCFQQLNTVAFVASMIEICVHLPSGIGYALHLVLPLSPHLSFTIIMFGIVKLKRVL
BLAST of CmaCh19G003970.1 vs. Swiss-Prot
Match: Y3236_ARATH (Ankyrin repeat-containing protein At3g12360 OS=Arabidopsis thaliana GN=At3g12360 PE=2 SV=1)

HSP 1 Score: 80.9 bits (198), Expect = 3.2e-14
Identity = 76/283 (26.86%), Postives = 126/283 (44.52%), Query Frame = 1

Query: 40  PLHISASAGHVPFSLEIMRLKPSLAKKLNPEGYSPIHLALQNNQTKTVLRLVDIDRDLVC 99
           PLHI+A  GH      ++    +L++   P   +P+  A     T+ V +L+    +L+ 
Sbjct: 167 PLHIAAIQGHHAIVEVLLDHDATLSQTFGPSNATPLVSAAMRGHTEVVNQLLSKAGNLLE 226

Query: 100 IQGRDGLTPLHIAASGGMDDILAKFLTSCPKSIKQLTNRNESALHIAVKREDIESVKILL 159
           I   +    LH+AA  G  +++   L+  P+  +++  + ++ALH+AVK +  E VK+LL
Sbjct: 227 ISRSNNKNALHLAARQGHVEVIKALLSKDPQLARRIDKKGQTALHMAVKGQSSEVVKLLL 286

Query: 160 QWIQQTCMTSILNWCDDEGNNVMHRVALGNQIEMVKLLINKVDMKAKNL--EGKTALDIM 219
                    +I+   D   N  +H      + E+V+LL++  D  A  L  + KTALDI 
Sbjct: 287 D-----ADPAIVMQPDKSCNTALHVATRKKRAEIVELLLSLPDTNANTLTRDHKTALDIA 346

Query: 220 KEHGLVEDKE-VKEMLHGSHILR----------------DVAKFVCLILLFVKRLVITDH 279
           +   L E+   +KE L  S  LR                 +   V + L   KR     H
Sbjct: 347 EGLPLSEESSYIKECLARSGALRANELNQPRDELRSTVTQIKNDVHIQLEQTKRTNKNVH 406

Query: 280 HEILYMSKKDR-------NAILVVSVLIATATYEAALSPPEKD 297
           +    + K  R       N++ VV+VL AT  + A  + P  D
Sbjct: 407 NISKELRKLHREGINNATNSVTVVAVLFATVAFAAIFTVPGGD 444

BLAST of CmaCh19G003970.1 vs. Swiss-Prot
Match: Y5262_ARATH (Ankyrin repeat-containing protein At5g02620 OS=Arabidopsis thaliana GN=At5g02620 PE=1 SV=1)

HSP 1 Score: 80.5 bits (197), Expect = 4.2e-14
Identity = 53/185 (28.65%), Postives = 93/185 (50.27%), Query Frame = 1

Query: 42  HISASAGHVPFSLEIMRLKPSLAKKLNPEGYSPIHLALQNNQTKTVLRLVDIDRDLVCIQ 101
           HI+A  G++     ++   P L+   +    + +H A      + V  L+D   DL  I 
Sbjct: 96  HIAAKNGNLQVLDVLIEANPELSFTFDSSKTTALHTAASQGHGEIVCFLLDKGVDLAAIA 155

Query: 102 GRDGLTPLHIAASGGMDDILAKFLTSCPKSIKQLTNRNESALHIAVKREDIESVKILLQW 161
             +G T LH AA  G   I+ K +      + ++  + ++ALH+AVK ++ E V +L++ 
Sbjct: 156 RSNGKTALHSAARNGHTVIVKKLIEKKAGMVTRVDKKGQTALHMAVKGQNTEIVDVLME- 215

Query: 162 IQQTCMTSILNWCDDEGNNVMHRVALGNQIEMVKLLIN--KVDMKAKNLEGKTALDIMKE 221
                  S++N  D++GN  +H     N+ E+V+ ++   +V   A N  G+TALDI ++
Sbjct: 216 ----ADGSLINSADNKGNTPLHIAVRKNRAEIVQTVLKYCEVSRVAVNKSGETALDIAEK 275

Query: 222 HGLVE 225
            GL E
Sbjct: 276 TGLHE 275

BLAST of CmaCh19G003970.1 vs. Swiss-Prot
Match: Y2168_ARATH (Ankyrin repeat-containing protein At2g01680 OS=Arabidopsis thaliana GN=At2g01680 PE=2 SV=1)

HSP 1 Score: 73.9 bits (180), Expect = 3.9e-12
Identity = 74/287 (25.78%), Postives = 126/287 (43.90%), Query Frame = 1

Query: 42  HISASAGHVPFSLEIMRLKPSLAKKLNPEGYSPIHLALQNNQTKTVLRLVDIDRDLVCIQ 101
           H++A  GH+    E++RL P L +  +    SP++ A   +  + V  ++D+D     I 
Sbjct: 99  HVAAKRGHLGIVKELLRLWPELCRICDASNTSPLYAAAVQDHLEIVNAMLDVDPSCAMIV 158

Query: 102 GRDGLTPLHIAASGGMDDILAKFLTSCPKSIKQLTNRNESALHIAVKREDIESVKILLQW 161
            ++G T LH A   G+  I+   +      +     + ++ALH+AVK   +E V+ +LQ 
Sbjct: 159 RKNGKTSLHTAGRYGLLRIVKALIEKDAAIVGVKDKKGQTALHMAVKGRSLEVVEEILQ- 218

Query: 162 IQQTCMTSILNWCDDEGNNVMHRVALGNQIEMVKLLI--NKVDMKAKNLEGKTALDIMKE 221
                  +ILN  D +GN  +H      + ++  LL+    +++ A N + +TA+D+  +
Sbjct: 219 ----ADYTILNERDRKGNTALHIATRKARPQITSLLLTFTAIEVNAINNQKETAMDLADK 278

Query: 222 HGLVEDK-EVKEML------HGSHILR-DVAKFVCLILLFVKRLVITDHHEI----LYMS 281
               E   E+ E L      HG  I R D A+        +KR V    HE+    L   
Sbjct: 279 LQYSESALEINEALVEAGAKHGRFIGREDEAR-------ALKRAVSDIKHEVQSQLLQNE 338

Query: 282 KKDR---------------------NAILVVSVLIATATYEAALSPP 294
           K +R                     N+I VV+VL A+  + A  + P
Sbjct: 339 KTNRRVSGIAKELRKLHREAVQNTTNSITVVAVLFASIAFLAIFNLP 373

BLAST of CmaCh19G003970.1 vs. Swiss-Prot
Match: LITA_LATTR (Alpha-latroinsectotoxin-Lt1a (Fragment) OS=Latrodectus tredecimguttatus PE=1 SV=1)

HSP 1 Score: 72.0 bits (175), Expect = 1.5e-11
Identity = 54/209 (25.84%), Postives = 101/209 (48.33%), Query Frame = 1

Query: 5   LNAVASSGDIDSLYLILQEDAYILERIDQVPFVDTPLHISASAGHVPFSLEIMRLKPSLA 64
           ++A AS+G+ D   L+L +D  +LE+ D+  +  TPLHI+A +    F + ++     + 
Sbjct: 504 IHAAASAGNYDVGELLLNKDINLLEKADKNGY--TPLHIAADSNKNDFVMFLIGNNADVN 563

Query: 65  KKLNPEGYSPIHLALQNNQTKTVLRLVDIDRDLVCIQGRDGLTPLHIAASGGMDDILAKF 124
            +   + ++P+HLA + + T     L+DI    +  Q + G TPLH++ S    +  A  
Sbjct: 564 VRTKSDLFTPLHLAARRDLTDVTQTLIDITEIDLNAQDKSGFTPLHLSIS-STSETAAIL 623

Query: 125 LTSCPKSIKQLTNRNESALHIAVKREDIESVKILLQWIQQTCMTSILNWCDDEGNNVMHR 184
           + +    I   +    + LH+A  + ++   K+L          + LN  D  G   +H 
Sbjct: 624 IRNTNAVINIKSKVGLTPLHLATLQNNLSVSKLL------AGKGAYLNDGDANGMTPLHY 683

Query: 185 VALGNQIEMVKLLINK--VDMKAKNLEGK 212
            A+   +EMV  L+N+  +++ A   E K
Sbjct: 684 AAMTGNLEMVDFLLNQQYININAATKEKK 703

BLAST of CmaCh19G003970.1 vs. Swiss-Prot
Match: ANS1B_DANRE (Ankyrin repeat and sterile alpha motif domain-containing protein 1B OS=Danio rerio GN=anks1b PE=3 SV=1)

HSP 1 Score: 68.6 bits (166), Expect = 1.7e-10
Identity = 51/197 (25.89%), Postives = 99/197 (50.25%), Query Frame = 1

Query: 40  PLHISASAGHVPFSLEIMRLKPSLAK--KLNPEGYSPIHLALQNNQTKTVLRLVDIDRDL 99
           PLH++A  G V     ++   PS ++  + N E  + +H A Q   ++ V  L+    D 
Sbjct: 94  PLHLAAWRGDVDIVQILIHHGPSHSRVNEQNLEKETALHCAAQYGHSEVVRVLLQELTDP 153

Query: 100 VCIQGRDGLTPLHIAASGGMDDILAKFLTSCPKSIKQLTNRNESALHIAVKREDIESVKI 159
                R G TPL +AA  G   ++   LT+ P ++     R  + LH+A +     +V++
Sbjct: 154 SMRNSR-GETPLDLAALYGRLQVVRMLLTAHP-NLMSCNTRKHTPLHLAARNGHYATVQV 213

Query: 160 LLQWIQQTCMTSILNWCDDEGNNVMHRVALGNQIEMVKLLINK-VDMKAKNLEGKTALDI 219
           LL+        +       E  + +H  AL  ++++V+LL++  +D   ++ +G+TALDI
Sbjct: 214 LLEADMDVNTQT-------EKGSALHEAALFGKMDVVQLLLDSGIDANIRDCQGRTALDI 273

Query: 220 MKEHGLVEDKEVKEMLH 234
           ++EH   + +++  ++H
Sbjct: 274 LREHPSQKSQQIASLIH 281

BLAST of CmaCh19G003970.1 vs. TrEMBL
Match: W9S5L9_9ROSA (Ankyrin repeat-containing protein OS=Morus notabilis GN=L484_023405 PE=4 SV=1)

HSP 1 Score: 316.6 bits (810), Expect = 3.9e-83
Identity = 174/376 (46.28%), Postives = 246/376 (65.43%), Query Frame = 1

Query: 1   MNQRLNAVASSGDIDSLYLILQEDAYILERIDQVPFVDTPLHISASAGHVPFSLEIMRLK 60
           M  RL+ VA  G+I++LY +++EDA++LE  + VPFVDTPLH++ASAG V F++EIMRLK
Sbjct: 1   MENRLDIVAREGNIEALYYLMKEDAHLLEHFEAVPFVDTPLHVAASAGQVHFAVEIMRLK 60

Query: 61  PSLAKKLNPEGYSPIHLALQNNQTKTVLRLVDIDRDLVCIQGRDGLTPLHIAASGGMDDI 120
           PS A+K N  GYSPIHLA+QN  T  VLRL+DIDRDLV ++GR+G TPLH A   G  D+
Sbjct: 61  PSFARKSNQNGYSPIHLAMQNAHTNLVLRLLDIDRDLVRVRGREGKTPLHFAVECGEIDL 120

Query: 121 LAKFLTSCPKSIKQLTNRNESALHIAVKREDIESVKILLQWIQQTCMTSILNWCDDEGNN 180
           LA+FL  CPKSI+ LT R E+ALH+AVK + +E++K+LL W++      +LNW DDEGN 
Sbjct: 121 LAEFLLVCPKSIQDLTIRKETALHVAVKSDKLEALKVLLGWLEHVGKKGVLNWGDDEGNT 180

Query: 181 VMHRVALGNQIEMVKLLINKVDMKAKNLEGKTALDIMKEHGLVEDKEVKEMLHGSHILR- 240
           ++H  A  NQ +MV+L+I+++D+ AKN  G TALDI  E        +K MLH +  L+ 
Sbjct: 181 ILHIAAARNQTKMVRLIIDRIDLNAKNSAGLTALDISPEQTQPNSGTLKLMLHRAGALKA 240

Query: 241 -------DVAKFVCLILLFVKRLVITDHHEILYMSKKDRNAILVVSVLIATATYEAALSP 300
                   +A  +   + + ++ +I+D+ + LY+S +DRN IL+V+VL ATA Y+AAL  
Sbjct: 241 SALPTVPKLADSLKSKMSWREKWIISDYRKRLYLSNEDRNIILLVAVLFATANYQAALDS 300

Query: 301 PEKDMDFFPPEWISY-------VCFQQLNTVAFVASMIEICVHLPSGIGYALHLVLPLSP 360
             K  D  P  + +Y         F  +N +AF+ASM+ I +HLP+  G  L LVLPL  
Sbjct: 301 FSKGDDDDPSTYFAYYSDGVVRAIFSLVNKIAFLASMVVIYLHLPADCG-ILRLVLPL-- 360

Query: 361 HLSFTIIMFGIVKLKR 362
                I  + ++K+ R
Sbjct: 361 ----VICNYAVMKIYR 369

BLAST of CmaCh19G003970.1 vs. TrEMBL
Match: W9SNY8_9ROSA (Phytosulfokine receptor 1 OS=Morus notabilis GN=L484_002271 PE=4 SV=1)

HSP 1 Score: 302.0 bits (772), Expect = 9.9e-79
Identity = 165/341 (48.39%), Postives = 234/341 (68.62%), Query Frame = 1

Query: 1   MNQRLNAVASSGDIDSLYLILQEDAYILERIDQVPFVDTPLHISASAGHVPFSLEIMRLK 60
           M+ RL   A +GDI++LY +L+ED+Y+LERID VPF+DTPLHI+ASAGHV F+LEIMRLK
Sbjct: 1   MDARLEVAAKAGDIEALYSLLKEDSYLLERIDAVPFIDTPLHIAASAGHVYFALEIMRLK 60

Query: 61  PSLAK-KLNPEGYSPIHLALQNNQTKTVLRLVDIDRDLVCIQGRDGLTPLHIAASGGMDD 120
           PS A  KLN +G+SPIHLALQN +++ VLRLV+++R LV +QGR+G T LH +    M D
Sbjct: 61  PSYAATKLNQDGFSPIHLALQNGKSQMVLRLVEMNRGLVQVQGREGKTRLHFSVEYDMVD 120

Query: 121 ILAKFLTSCPKSIKQLTNRNESALHIAVKREDIESVKILLQWIQQTCMTSILNWCDDEGN 180
           +LAKFL+ CPK+I+ LT R E+ALHIAVK +++E+VK+LL W+       +L   DDEGN
Sbjct: 121 VLAKFLSVCPKAIQILTIRKETALHIAVKNDNLEAVKLLLGWLDHVDKDVVLKLADDEGN 180

Query: 181 NVMHRVALGNQIEMVKLLINKVDMKAKNLEGKTALDIMKEHGLVEDKEVKEMLHGSHILR 240
            V+H     NQIEMV+L IN  D+ AKNL G TALD+++    + +++++ ML     L+
Sbjct: 181 TVLHMATSKNQIEMVRLFINGADVNAKNLVGLTALDVLENQKQLANEKLQRMLLRGGALK 240

Query: 241 DVA--------KFVCLILLFVKRLVITDHHEILYMSKKDRNAILVVSVLIATATYEAALS 300
             +          +   + + ++ +I+D  + LYMS  +RN +LVV+VL ATA Y+A L+
Sbjct: 241 GSSLPSIPTNKDAMKTKMSWHEKFLISDFRKKLYMSNDERNTMLVVAVLFATANYQAILN 300

Query: 301 PPEKDMDFFPPEWISYVCFQQLNTVAFVASMIEICVHLPSG 333
            P      +  E +S++ F  +N VAF+ASM EI ++LP G
Sbjct: 301 NP------YMYEDLSFMLFSFVNHVAFLASMFEIYLYLPKG 335

BLAST of CmaCh19G003970.1 vs. TrEMBL
Match: A0A164V5K6_DAUCA (Uncharacterized protein OS=Daucus carota subsp. sativus GN=DCAR_022796 PE=4 SV=1)

HSP 1 Score: 281.2 bits (718), Expect = 1.8e-72
Identity = 152/366 (41.53%), Postives = 232/366 (63.39%), Query Frame = 1

Query: 4   RLNAVASSGDIDSLYLILQEDAYILERIDQVPFVDTPLHISASAGHVPFSLEIMRLKPSL 63
           +LN VA +GDI+ LY  ++E+ YILE ID++PF+DTPLHI+A AGH+PF++E+MRLKPS 
Sbjct: 381 KLNKVAEAGDIEELYHSIREEPYILENIDKIPFIDTPLHIAAKAGHIPFTVELMRLKPSF 440

Query: 64  AKKLNPEGYSPIHLALQNNQTKTVLRLVDIDRDLVCIQGRDGLTPLHIAASGGMDDILAK 123
           A+KLNP+G SPIHLA+Q +  + V+R++++DR+LV +QG++  TPLH AA  G  +++ +
Sbjct: 441 ARKLNPDGSSPIHLAVQEDHERLVIRMINVDRELVRVQGKNCNTPLHCAADKGNVELIVE 500

Query: 124 FLTSCPKSIKQLTNRNESALHIAVKREDIESVKILLQWIQQTCMTSILNWCDDEGNNVMH 183
           FL +CP+SI  +  R +SALH+AV++ DI +VK++L+W++      IL W DD+ ++++H
Sbjct: 501 FLLACPESILDVNARKQSALHLAVQQNDIHTVKVMLEWLKLLDAQFILGWTDDQSDSILH 560

Query: 184 RVALGNQIEMVKLLINKVDMKAKNLEGKTALDIMKEHG---LVEDKEVKEMLHGSHILRD 243
             A  N +EMVK+LI K+DM A+N + ++A DI +      L  DK+V  M     ++R 
Sbjct: 561 IAARKNNVEMVKMLIPKIDMHARNSDNESARDIFEAQNPDLLPNDKKVTIM---QWLVRQ 620

Query: 244 VAKFVCLILL-----------------FVKRLVITDHHEILYMSKKDRNAILVVSVLIAT 303
            + +   +                   + KR ++++H  I   S  D+N +LVV+VLIAT
Sbjct: 621 KSHYNATLYCDTNDNTSLKESLRKGFPWHKRWILSNHRHI---SLVDKNGVLVVAVLIAT 680

Query: 304 ATYEAALSPPEKDMDFFPPEWISYVC------FQQLNTVAFVASMIEICVHLPSGIGYAL 344
             Y+A +  P      F P  I+Y        FQ  NT AFVA+M  I + LP GI Y L
Sbjct: 681 TAYQAVIQLPP----IFDPASIAYTTHYYFFIFQLFNTTAFVAAMSLIFILLPPGITYVL 736

BLAST of CmaCh19G003970.1 vs. TrEMBL
Match: W9S5J8_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_023404 PE=4 SV=1)

HSP 1 Score: 265.0 bits (676), Expect = 1.3e-67
Identity = 156/369 (42.28%), Postives = 221/369 (59.89%), Query Frame = 1

Query: 2   NQRLNAVASSGDIDSLYLILQEDAYILERIDQVPFVDTPLHISASAGHVPFSLEIMRLKP 61
           +QRL   A  GDID+LY ++QED  IL+RIDQ+ F+D P+HI+ASAGH  F+ E M LKP
Sbjct: 263 DQRLRKAAQEGDIDALYEVIQEDPSILDRIDQIAFIDNPMHIAASAGHAHFARETMMLKP 322

Query: 62  SLAKKLNPEGYSPIHLALQNNQTKTVLRLVDIDRDLVCIQGRDGLTPLHIAASGGMDDIL 121
           + ++K N  G+SP+HLALQN   +TVL L+  DR+LV ++GR+  TPLH AA  G  D+L
Sbjct: 323 TFSRKQNQHGFSPMHLALQNGNDRTVLHLLSADRNLVRVKGREAKTPLHCAAEMGNVDLL 382

Query: 122 AKFLTSCPKSIKQLTNRNESALHIAVKREDIESVKILLQWIQQTCMTSILNWCDDEGNNV 181
           ++FL +CP+SI+ LT RNE+A+H+A K +   +V++LL W+Q   M  +           
Sbjct: 383 SEFLAACPESIRDLTIRNETAIHVAAKNDRFGAVEVLLGWLQHVDMDVV----------- 442

Query: 182 MHRVALGNQIEMVKLLINKVDMKAKNLEGKTALDIMKEHGLVEDKEVKEMLH-----GSH 241
                       VKLL+N++++ AKNLE  TALDI K  G   DK++  +LH     G+ 
Sbjct: 443 ------------VKLLMNRININAKNLEDLTALDISKLQGPACDKDIWNLLHQNGALGAS 502

Query: 242 ILRDVAKFV----CLILLFVKRLVITDHHEILYMSKKDRNAILVVSVLIATATYEAALSP 301
            L  V   V      I +F +R+  T +     MS  +RNA+LVV+VLIAT+T++AALSP
Sbjct: 503 SLPRVTTLVDFLRTKISVFEERITST-YLRRCCMSNDNRNALLVVAVLIATSTFQAALSP 562

Query: 302 P---------------EKDMDFFP---PEWISYVCFQQLNTVAFVASMIEICVHLPSGIG 344
           P               E +    P    +   ++CF   NT+AF+ S+ EIC HLP G+ 
Sbjct: 563 PGGTRQGYGFSTNSTSELNQPASPVAFNDMAPWICFLAFNTMAFLTSVSEICFHLPKGLY 607

BLAST of CmaCh19G003970.1 vs. TrEMBL
Match: A0A061FN51_THECC (Ankyrin repeat-containing protein, putative OS=Theobroma cacao GN=TCM_042913 PE=4 SV=1)

HSP 1 Score: 260.4 bits (664), Expect = 3.3e-66
Identity = 141/308 (45.78%), Postives = 209/308 (67.86%), Query Frame = 1

Query: 3   QRLNAVASSGDIDSLYLILQEDAYILERIDQVPFVDTPLHISASAGHVPFSLEIMRLKPS 62
           +RL + A +G+ID LY +++ DAYILER+DQ+PF DTPLHI+++AGH+ F++EIM LKPS
Sbjct: 16  KRLKSAARAGNIDELYTLIRRDAYILERVDQMPFADTPLHIASAAGHIDFAMEIMNLKPS 75

Query: 63  LAKKLNPEGYSPIHLALQNNQTKTVLRLVDIDRDLVCIQGRDGLTPLHIAASGGMDDILA 122
            A+KLN +G+SPIHLALQ  QT+ VLRL+ ID+DLV ++GR+G TPLH  AS G   +LA
Sbjct: 76  FARKLNQDGFSPIHLALQQGQTEMVLRLLAIDKDLVRVKGREGKTPLHYVASEGNLSLLA 135

Query: 123 KFLTSCPKSIKQLTNRNESALHIAVKREDIESVKILLQWIQQTCM------TSILNWCDD 182
           +FL  CPK I+ +T RNE+ALHIA +  +++++++LL  +++T +        +LN+ D 
Sbjct: 136 QFLLRCPKCIQDVTIRNETALHIAAENNNLKALRVLLLSLKRTNLYGKSSEKKLLNFRDK 195

Query: 183 EGNNVMHRVALGNQIEMVKLLIN-KVDMKAKNLEGKTALDIMKEHGLVEDKEVKEMLH-- 242
           +GN V+H  A  NQ +MVKLLI  KV +   N  G TALDI++    V++++  E+L   
Sbjct: 196 DGNTVLHIAASTNQPQMVKLLIECKVSINKTNSRGLTALDILESMSNVDNRDSSEILRDF 255

Query: 243 GSHILRDVAKFVCLILLFVKRLVITDH------HEILYMSKKDRNAILVVSVLIATATYE 296
           G        +   L ++   ++ + ++      H+I+ MS    NA+LVV VLI TATY+
Sbjct: 256 GGLNASATRRRPSLHVMLGSKITLLENAFGEVFHDIITMSADRSNALLVVLVLILTATYQ 315

BLAST of CmaCh19G003970.1 vs. TAIR10
Match: AT5G54620.1 (AT5G54620.1 Ankyrin repeat family protein)

HSP 1 Score: 213.4 bits (542), Expect = 2.3e-55
Identity = 127/353 (35.98%), Postives = 201/353 (56.94%), Query Frame = 1

Query: 1   MNQRLNAVASSGDIDSLYLILQEDAYILERIDQVPFVDTPLHISASAGHVPFSLEIMRLK 60
           M++RL  V  SG++D+LY ++ +D YIL+ ID +PFV TPLH ++S G    ++E+M LK
Sbjct: 1   MDRRLLWVTDSGNVDALYALIHKDPYILQNIDVLPFVHTPLHEASSTGKTDLAMELMVLK 60

Query: 61  PSLAKKLNPEGYSPIHLALQNNQTKTVLRLVDIDRDLVCIQGRDGLTPLHIAASGGMDDI 120
           P+ AKKLN +G SP+HLA++N+Q +  L LV I+ DLV + GR G+TPLH+    G  ++
Sbjct: 61  PTFAKKLNSDGVSPLHLAVENHQVQLALELVKINPDLVLVAGRKGMTPLHLVVKKGDANL 120

Query: 121 LAKFLTSCPKSIKQLTNRNESALHIAVKREDIESVKILLQWIQQ-------TCMTSILNW 180
           L +FL +CP+SIK      E+ALHIAV  +  E +K+L  WI +       +    +LN 
Sbjct: 121 LTEFLLACPESIKDTNVNGETALHIAVMNDRYEELKVLTGWIHRLHKSDAASTEIHVLNK 180

Query: 181 CDDEGNNVMHRVALGNQIEMVKLLINKVDMK--AKNLEGKTALDIMKEHGLVEDKEVKEM 240
            D +GN ++H  A  N  +  K L+  + +    +N  G TALDI++ +G   + + +++
Sbjct: 181 RDRDGNTILHLAAYKNNHKAFKELLKCISLNRDIQNKGGMTALDILRTNGSHMNIKTEKI 240

Query: 241 LHGS--------HILRDVAKFVCLILLFVKRLVITDHHEILYMSKKDRNAILVVSVLIAT 300
           +  S          ++  + F+   + FV+    T       MS   RNA+LV++ LI T
Sbjct: 241 IRHSGGKSGVSLSKVKTASVFLRSPITFVEYCSTTMTRYKNRMSDGTRNALLVITALIIT 300

Query: 301 ATYEAALSPPEKDMDFFPPEWISYVCF-QQLNTVAFVASMIEICVHLPSGIGY 336
           ATY+ A+ P +KD  ++    +  V F    NT+AF  ++    + LP G  Y
Sbjct: 301 ATYQTAVQPQDKDEIYYTGNIMINVLFVWGFNTIAFCLAIALTFILLPVGKAY 353

BLAST of CmaCh19G003970.1 vs. TAIR10
Match: AT4G10720.1 (AT4G10720.1 Ankyrin repeat family protein)

HSP 1 Score: 206.1 bits (523), Expect = 3.7e-53
Identity = 130/386 (33.68%), Postives = 211/386 (54.66%), Query Frame = 1

Query: 1   MNQRLNAVASSGDIDSLYLILQEDAYILERIDQVPFVDTPLHISASAGHVPFSLEIMRLK 60
           M+ RL      G ID LY  + E+ YILE ID +PF++TPLHI++++G++ F++E+M LK
Sbjct: 1   MDPRLIVATQIGSIDELYAHIHENPYILEIIDAIPFINTPLHIASASGNLSFAMELMNLK 60

Query: 61  PSLAKKLNPEGYSPIHLALQNNQTKTVLRLVDIDRDLVCIQGRDGLTPLHIAASGGMDDI 120
           PS A+KLN  G SP+HLA++  QT+ VL L+ +D DLV ++GR+G+TP H     G  D+
Sbjct: 61  PSFARKLNTYGLSPLHLAIEEGQTRLVLSLLKVDSDLVRLRGREGMTPFHQVVRRGETDL 120

Query: 121 LAKFLTSCPKSIKQLTNRNESALHIAVKREDIESVKILLQWIQQTCMT-------SILNW 180
           + +FL +CP  IK      E+ALHIAV  +  E +++LL W+Q+   T         LN 
Sbjct: 121 MTEFLLACPGCIKDANVNGETALHIAVSNDRYEELEVLLGWVQRLRQTDAESLEMQFLNK 180

Query: 181 CDDEGNNVMHRVALGNQIEMVKLLI--NKVDMKAKNLEGKTALDIM-KEHGLVEDKEVKE 240
            D +GN  +H  A  N+ + VK+L+  + V+    N  G TALDI+  +     +  ++ 
Sbjct: 181 RDQDGNTALHIAAYQNRFKAVKILVKCSAVNRNIHNRTGLTALDILHNQRDHHANSNIEN 240

Query: 241 MLH------GSHI--LRDVAKFVCLILLFVKRLVITDHHEILYMSKKDRNAILVVSVLIA 300
           ++       G+ +   + V++ +   + F + L           S+  R+A+LV++ LI 
Sbjct: 241 IIRKWGGKSGNSLPKSKKVSEILRSPISFTEHLFTQTARYRNQTSEGTRSALLVIAALII 300

Query: 301 TATYEAALSPP------------EKDMDFFPPEWISYVCFQQLNTVAFVASMIEICVHLP 354
           TATY+ AL PP            +K +         +   + +NT+AFV ++      LP
Sbjct: 301 TATYQTALQPPGGVYQENAAEESKKSVGTVVMSHKYFFVLRGVNTMAFVGAIFMAFCLLP 360

BLAST of CmaCh19G003970.1 vs. TAIR10
Match: AT5G54610.1 (AT5G54610.1 ankyrin)

HSP 1 Score: 198.4 bits (503), Expect = 7.8e-51
Identity = 124/373 (33.24%), Postives = 206/373 (55.23%), Query Frame = 1

Query: 1   MNQRLNAVASSGDIDSLYLILQEDAYILERIDQVPFVDTPLHISASAGHVPFSLEIMRLK 60
           M+ +L  V  SG +D LY ++Q    IL+++D +P + TPLH ++SAG +  ++E+M LK
Sbjct: 1   MDSKLLLVTQSGSVDDLYSLIQAAPDILQKVDVLPIIHTPLHEASSAGKLDLAMELMILK 60

Query: 61  PSLAKKLNPEGYSPIHLALQNNQTKTVLRLVDIDRDLVCIQGRDGLTPLHIAASGGMDDI 120
           PS AKKLN  G SP+HLA++N+Q +  L LV +D  LV I+GR G+TPLH+ A  G  D+
Sbjct: 61  PSFAKKLNEYGLSPLHLAVENDQVELALELVKVDPSLVRIRGRGGMTPLHLVAKKGDVDL 120

Query: 121 LAKFLTSCPKSIKQLTNRNESALHIAVKREDIESVKILLQWIQQ-----TCMTSILNWCD 180
           L  FL +CP+SIK +    E+ LHI +  +  E +K+L  W+Q+          +LN  D
Sbjct: 121 LTDFLLACPESIKDVNVNGETILHITIMNDKYEQLKVLTGWMQKMRDSDDVFIDVLNRRD 180

Query: 181 DEGNNVMHRVALGNQIEMVKLLIN--KVDMKAKNLEGKTALDIMKEHGLVEDKEVKEMLH 240
             GN V+H  A  N  ++VK L+    +D   +N  G TALD+++  G   +KE++E++ 
Sbjct: 181 RGGNTVLHLAAYENNDKVVKQLVKCLSLDRNIQNKSGMTALDVLRARGSHMNKEIEEIIQ 240

Query: 241 --GSHILRDVAKFVCLILLFVKRLVITDHHEILY------MSKKDRNAILVVSVLIATAT 300
             G      ++      +   + +   +H +         +S   RNA+LV++ LI +AT
Sbjct: 241 MSGGKTGGSLSGIQEWYIFLREPVTFKEHCKTRIARYRSRISDGSRNALLVIAALIISAT 300

Query: 301 YEAALSPPEKD-MDFFPPEWISYVCFQ--QLNTVAFVASMIEICVHLPSGIGYA-LHLVL 355
           ++ A    +K+ +D      + +  FQ    NTVAF  +++   + LP G  Y   + ++
Sbjct: 301 FQTAAQLLDKEKLDKVKKNGMRFSEFQLWGCNTVAFSIAILFSFILLPVGRAYEWWYFII 360

BLAST of CmaCh19G003970.1 vs. TAIR10
Match: AT5G15500.2 (AT5G15500.2 Ankyrin repeat family protein)

HSP 1 Score: 196.1 bits (497), Expect = 3.9e-50
Identity = 117/315 (37.14%), Postives = 182/315 (57.78%), Query Frame = 1

Query: 1   MNQR-LNAVASSGDIDSLYLILQEDAYILERIDQVPFVDTPLHISASAGHVPFSLEIMRL 60
           M+QR L A A SG+ID LY ++ ED Y+L++ D VPFV+TPLH++A  G   F++E+M L
Sbjct: 1   MDQRSLEAAAKSGNIDLLYELIHEDPYVLDKTDHVPFVNTPLHVAAVNGKTEFAMEMMNL 60

Query: 61  KPSLAKKLNPEGYSPIHLALQNNQTKTVLRLVDIDRDLVCIQGRDGLTPLHIAASGGMDD 120
           KPS A+KLN +G +P+HLA+++     VL +V +D  LV I+GR G+TPL +A S    D
Sbjct: 61  KPSFARKLNADGLTPLHLAVEHGHFWLVLEVVKVDPSLVRIKGRHGMTPLLVAVSRKKID 120

Query: 121 ILAKFLTSCPKSIKQLTNRNESALHIAV----KREDIESVKILLQWIQQTCM-------T 180
           ++++F   CP+SI       E+ALHIAV    +RE +  +K+L+ WI + C        T
Sbjct: 121 LMSEFFLGCPESIVDANVNGENALHIAVNNYDQREGLSVLKVLMGWILRLCQKDAEWIET 180

Query: 181 SILNWCDDEGNNVMHRVALGNQIEMVKLLI--NKVDMKAKNLEGKTALDIMKEHGLVEDK 240
            ++N  D +GN  +H  A     + +KLL+  +K+++  +N  G T  DI   H    ++
Sbjct: 181 RVINRRDKDGNTPLHLAAYEINRQAMKLLLESSKINVNIENKNGLTVFDIAVLH---NNR 240

Query: 241 EVKEML--HGSHILRDVAKFVCLILLFVKRLVITDHHE------ILYMSKKDRNAILVVS 294
           E++ M+  HG      + K      +   +L   +           ++S++ RNA+LVV+
Sbjct: 241 EIERMVKRHGGKRSVSLVKIKTTSDILASQLSWRESRRTKKIRFYSWISEERRNALLVVA 300

BLAST of CmaCh19G003970.1 vs. TAIR10
Match: AT1G14480.1 (AT1G14480.1 Ankyrin repeat family protein)

HSP 1 Score: 181.0 bits (458), Expect = 1.3e-45
Identity = 133/425 (31.29%), Postives = 206/425 (48.47%), Query Frame = 1

Query: 1   MNQRLNAVASSGDIDSLYLILQEDAYILERIDQVPFVDTPLHISASAGHVPFSLEIMRLK 60
           M+ RL   A SG I+ LY ++ E+ YILE ID VPFV TPLH++A  G++ F++E++ LK
Sbjct: 1   MDLRLQQAAESGSINELYALIDENPYILENIDAVPFVSTPLHVAAVFGNIEFAMEMLNLK 60

Query: 61  PSLAKKLNPEGYSPIHLALQNNQTKTVLRLVDIDRDLVCIQGRDGLTPLHIAASGGMDDI 120
           PS A+KLN  GYSP+HLA++  Q+  V  ++  D  L  ++GR+G+TP H+    G DD+
Sbjct: 61  PSFARKLNTSGYSPLHLAVEKEQSDFVSHMLWHDGGLSRVKGRNGVTPFHLLVIRGDDDL 120

Query: 121 LAKFLTSCPKSIKQLTNRNESALHIAVKREDIESVKILLQWIQQTCM-------TSILNW 180
           +A+ L + P+ I+ +    ++ALH+AV  +  E +++L  WIQ+            +LN 
Sbjct: 121 VAECLITSPECIEDVNVDRQNALHLAVMNDRFEVLQVLTGWIQRMSQKDAYYIENRVLNK 180

Query: 181 CDDEGNNVMHRVALGNQIEMVKLLI--NKVDMKAKNLEGKTALDIMK------------- 240
            D + N  +H  A  N  + +KLL+    V+    N++  T +DI++             
Sbjct: 181 RDFDFNTALHLAAYKNDQQALKLLLKCRLVEPNLVNIDDLTFVDILRTQGENAGGGNLDL 240

Query: 241 -----EHGLVEDKEVKEMLHGSHILRDVAKFVCLILLFVKRLVITDHHEILYMSKKDRNA 300
                + G VE   + +    S +L+    F+      +KR+  +        S +DR A
Sbjct: 241 EQAVIKTGCVEAASMPKFKEESDLLKSPINFMTYYSTSMKRMKSS-------TSDQDRGA 300

Query: 301 ILVVSVLIATATYEAALSPP----------------EKDMDFFPPEWISYVCFQQLNTVA 360
            L+V  LI TATY+ AL PP                     FF   WIS       NTV 
Sbjct: 301 FLIVCTLIITATYQMALQPPGGVHQSENANANAGSVVMKQTFFILLWIS-------NTVG 360

Query: 361 FVASMIEICVHLPSG---------------IGYALHLVLPLSPH----LSFTIIMFGIVK 364
           F  ++      +P G               I YAL + + +SPH    LS T  +F +  
Sbjct: 361 FCCAVFYTFCLIPLGQLFTIWFFYIGTCLCISYALAMAV-ISPHPLVFLSATFALFLVFA 410

BLAST of CmaCh19G003970.1 vs. NCBI nr
Match: gi|703153015|ref|XP_010110570.1| (Ankyrin repeat-containing protein [Morus notabilis])

HSP 1 Score: 316.6 bits (810), Expect = 5.6e-83
Identity = 174/376 (46.28%), Postives = 246/376 (65.43%), Query Frame = 1

Query: 1   MNQRLNAVASSGDIDSLYLILQEDAYILERIDQVPFVDTPLHISASAGHVPFSLEIMRLK 60
           M  RL+ VA  G+I++LY +++EDA++LE  + VPFVDTPLH++ASAG V F++EIMRLK
Sbjct: 1   MENRLDIVAREGNIEALYYLMKEDAHLLEHFEAVPFVDTPLHVAASAGQVHFAVEIMRLK 60

Query: 61  PSLAKKLNPEGYSPIHLALQNNQTKTVLRLVDIDRDLVCIQGRDGLTPLHIAASGGMDDI 120
           PS A+K N  GYSPIHLA+QN  T  VLRL+DIDRDLV ++GR+G TPLH A   G  D+
Sbjct: 61  PSFARKSNQNGYSPIHLAMQNAHTNLVLRLLDIDRDLVRVRGREGKTPLHFAVECGEIDL 120

Query: 121 LAKFLTSCPKSIKQLTNRNESALHIAVKREDIESVKILLQWIQQTCMTSILNWCDDEGNN 180
           LA+FL  CPKSI+ LT R E+ALH+AVK + +E++K+LL W++      +LNW DDEGN 
Sbjct: 121 LAEFLLVCPKSIQDLTIRKETALHVAVKSDKLEALKVLLGWLEHVGKKGVLNWGDDEGNT 180

Query: 181 VMHRVALGNQIEMVKLLINKVDMKAKNLEGKTALDIMKEHGLVEDKEVKEMLHGSHILR- 240
           ++H  A  NQ +MV+L+I+++D+ AKN  G TALDI  E        +K MLH +  L+ 
Sbjct: 181 ILHIAAARNQTKMVRLIIDRIDLNAKNSAGLTALDISPEQTQPNSGTLKLMLHRAGALKA 240

Query: 241 -------DVAKFVCLILLFVKRLVITDHHEILYMSKKDRNAILVVSVLIATATYEAALSP 300
                   +A  +   + + ++ +I+D+ + LY+S +DRN IL+V+VL ATA Y+AAL  
Sbjct: 241 SALPTVPKLADSLKSKMSWREKWIISDYRKRLYLSNEDRNIILLVAVLFATANYQAALDS 300

Query: 301 PEKDMDFFPPEWISY-------VCFQQLNTVAFVASMIEICVHLPSGIGYALHLVLPLSP 360
             K  D  P  + +Y         F  +N +AF+ASM+ I +HLP+  G  L LVLPL  
Sbjct: 301 FSKGDDDDPSTYFAYYSDGVVRAIFSLVNKIAFLASMVVIYLHLPADCG-ILRLVLPL-- 360

Query: 361 HLSFTIIMFGIVKLKR 362
                I  + ++K+ R
Sbjct: 361 ----VICNYAVMKIYR 369

BLAST of CmaCh19G003970.1 vs. NCBI nr
Match: gi|703086159|ref|XP_010092930.1| (Phytosulfokine receptor 1 [Morus notabilis])

HSP 1 Score: 302.0 bits (772), Expect = 1.4e-78
Identity = 165/341 (48.39%), Postives = 234/341 (68.62%), Query Frame = 1

Query: 1   MNQRLNAVASSGDIDSLYLILQEDAYILERIDQVPFVDTPLHISASAGHVPFSLEIMRLK 60
           M+ RL   A +GDI++LY +L+ED+Y+LERID VPF+DTPLHI+ASAGHV F+LEIMRLK
Sbjct: 1   MDARLEVAAKAGDIEALYSLLKEDSYLLERIDAVPFIDTPLHIAASAGHVYFALEIMRLK 60

Query: 61  PSLAK-KLNPEGYSPIHLALQNNQTKTVLRLVDIDRDLVCIQGRDGLTPLHIAASGGMDD 120
           PS A  KLN +G+SPIHLALQN +++ VLRLV+++R LV +QGR+G T LH +    M D
Sbjct: 61  PSYAATKLNQDGFSPIHLALQNGKSQMVLRLVEMNRGLVQVQGREGKTRLHFSVEYDMVD 120

Query: 121 ILAKFLTSCPKSIKQLTNRNESALHIAVKREDIESVKILLQWIQQTCMTSILNWCDDEGN 180
           +LAKFL+ CPK+I+ LT R E+ALHIAVK +++E+VK+LL W+       +L   DDEGN
Sbjct: 121 VLAKFLSVCPKAIQILTIRKETALHIAVKNDNLEAVKLLLGWLDHVDKDVVLKLADDEGN 180

Query: 181 NVMHRVALGNQIEMVKLLINKVDMKAKNLEGKTALDIMKEHGLVEDKEVKEMLHGSHILR 240
            V+H     NQIEMV+L IN  D+ AKNL G TALD+++    + +++++ ML     L+
Sbjct: 181 TVLHMATSKNQIEMVRLFINGADVNAKNLVGLTALDVLENQKQLANEKLQRMLLRGGALK 240

Query: 241 DVA--------KFVCLILLFVKRLVITDHHEILYMSKKDRNAILVVSVLIATATYEAALS 300
             +          +   + + ++ +I+D  + LYMS  +RN +LVV+VL ATA Y+A L+
Sbjct: 241 GSSLPSIPTNKDAMKTKMSWHEKFLISDFRKKLYMSNDERNTMLVVAVLFATANYQAILN 300

Query: 301 PPEKDMDFFPPEWISYVCFQQLNTVAFVASMIEICVHLPSG 333
            P      +  E +S++ F  +N VAF+ASM EI ++LP G
Sbjct: 301 NP------YMYEDLSFMLFSFVNHVAFLASMFEIYLYLPKG 335

BLAST of CmaCh19G003970.1 vs. NCBI nr
Match: gi|1021032057|gb|KZM89841.1| (hypothetical protein DCAR_022796 [Daucus carota subsp. sativus])

HSP 1 Score: 281.2 bits (718), Expect = 2.6e-72
Identity = 152/366 (41.53%), Postives = 232/366 (63.39%), Query Frame = 1

Query: 4   RLNAVASSGDIDSLYLILQEDAYILERIDQVPFVDTPLHISASAGHVPFSLEIMRLKPSL 63
           +LN VA +GDI+ LY  ++E+ YILE ID++PF+DTPLHI+A AGH+PF++E+MRLKPS 
Sbjct: 381 KLNKVAEAGDIEELYHSIREEPYILENIDKIPFIDTPLHIAAKAGHIPFTVELMRLKPSF 440

Query: 64  AKKLNPEGYSPIHLALQNNQTKTVLRLVDIDRDLVCIQGRDGLTPLHIAASGGMDDILAK 123
           A+KLNP+G SPIHLA+Q +  + V+R++++DR+LV +QG++  TPLH AA  G  +++ +
Sbjct: 441 ARKLNPDGSSPIHLAVQEDHERLVIRMINVDRELVRVQGKNCNTPLHCAADKGNVELIVE 500

Query: 124 FLTSCPKSIKQLTNRNESALHIAVKREDIESVKILLQWIQQTCMTSILNWCDDEGNNVMH 183
           FL +CP+SI  +  R +SALH+AV++ DI +VK++L+W++      IL W DD+ ++++H
Sbjct: 501 FLLACPESILDVNARKQSALHLAVQQNDIHTVKVMLEWLKLLDAQFILGWTDDQSDSILH 560

Query: 184 RVALGNQIEMVKLLINKVDMKAKNLEGKTALDIMKEHG---LVEDKEVKEMLHGSHILRD 243
             A  N +EMVK+LI K+DM A+N + ++A DI +      L  DK+V  M     ++R 
Sbjct: 561 IAARKNNVEMVKMLIPKIDMHARNSDNESARDIFEAQNPDLLPNDKKVTIM---QWLVRQ 620

Query: 244 VAKFVCLILL-----------------FVKRLVITDHHEILYMSKKDRNAILVVSVLIAT 303
            + +   +                   + KR ++++H  I   S  D+N +LVV+VLIAT
Sbjct: 621 KSHYNATLYCDTNDNTSLKESLRKGFPWHKRWILSNHRHI---SLVDKNGVLVVAVLIAT 680

Query: 304 ATYEAALSPPEKDMDFFPPEWISYVC------FQQLNTVAFVASMIEICVHLPSGIGYAL 344
             Y+A +  P      F P  I+Y        FQ  NT AFVA+M  I + LP GI Y L
Sbjct: 681 TAYQAVIQLPP----IFDPASIAYTTHYYFFIFQLFNTTAFVAAMSLIFILLPPGITYVL 736

BLAST of CmaCh19G003970.1 vs. NCBI nr
Match: gi|645266193|ref|XP_008238508.1| (PREDICTED: ankyrin-1-like [Prunus mume])

HSP 1 Score: 270.4 bits (690), Expect = 4.6e-69
Identity = 158/374 (42.25%), Postives = 223/374 (59.63%), Query Frame = 1

Query: 2   NQRLNAVASSGDIDSLYLILQEDAYILERIDQVPFVDTPLHISASAGHVPFSLEIMRLKP 61
           +QRL   A  GDID  Y ++QED+ ILERIDQVPFV TPLHI+ASAGH  F+LE+MRLKP
Sbjct: 262 DQRLMKAAQEGDIDGFYALIQEDSCILERIDQVPFVHTPLHIAASAGHTHFALEMMRLKP 321

Query: 62  SLAKKLNPEGYSPIHLALQNNQTKTVLRLVDIDRDLVCIQGRDGLTPLHIAASGGMDDIL 121
              +K N EG+S +HLAL++ +T+TVL ++   RD+V ++GR+G T LH  A  G  D+L
Sbjct: 322 QFTRKQNKEGFSALHLALKHGKTQTVLSVLSAYRDIVRVKGREGRTLLHCVAEIGNLDLL 381

Query: 122 AKFLTSCPKSIKQLTNRNESALHIAVKREDIESVKILLQWIQQTCMTSILNWCDDEGNNV 181
           A+FL +CP+SI  LTN+ E+ALHIA K +   ++++LL WIQ   M  +L W D EGN V
Sbjct: 382 AEFLAACPESIIDLTNQKETALHIAAKNDKAGALEVLLGWIQHVDMDEVLQWTDVEGNTV 441

Query: 182 MHRVALGNQIEMVKLLINKVDMKAKNLEGKTALDIMKEHGLVEDKEV--KEMLHGSHILR 241
           +H     NQ ++V+LLI +VD+  KNLEG TALDI       E   +  +    G+  L 
Sbjct: 442 LHIATARNQFQVVRLLIKRVDLNVKNLEGLTALDISLVVNNTEMINLLCRNGALGASSLP 501

Query: 242 DVAKFVCLI---LLFVKRLVITDHHEILYMSKKDRNAILVVSVLIATATYEAALSPP--- 301
            V+     +   +   ++ ++ +H     M  + RNA+LVV+VLIATAT++A L+PP   
Sbjct: 502 RVSSLADSLRKEMSLTEKWILQNHLSKCCMPNEKRNALLVVAVLIATATFQAVLNPPAGI 561

Query: 302 EKDMDFFPPEW------------------------ISYVCFQQLNTVAFVASMIEICVHL 344
           +K     P ++                         + V F   NT+AF+ S+ EI  HL
Sbjct: 562 QKGYSETPHDFKRNSAATNSSAEENVVSNFAFHSAAACVSFLAFNTMAFLTSISEIWFHL 621

BLAST of CmaCh19G003970.1 vs. NCBI nr
Match: gi|985447676|ref|XP_015385479.1| (PREDICTED: ankyrin-1-like [Citrus sinensis])

HSP 1 Score: 270.0 bits (689), Expect = 6.0e-69
Identity = 139/304 (45.72%), Postives = 214/304 (70.39%), Query Frame = 1

Query: 2   NQRLNAVASSGDIDSLYLILQEDAYILERIDQVPFVDTPLHISASAGHVPFSLEIMRLKP 61
           +QRLN  A +G++D+LY ++ EDAY+L++IDQVPFVDTPLHI+AS GHV F+LEIMRLKP
Sbjct: 263 DQRLNEAAQAGNVDALYELIWEDAYLLDQIDQVPFVDTPLHIAASMGHVNFALEIMRLKP 322

Query: 62  SLAKKLNPEGYSPIHLALQNNQTKTVLRLVDIDRDLVCIQGRDGLTPLHIAASGGMDDIL 121
           S ++K N  G+ P+HLALQ   T+ VLRL+D DR+LV +QGR+G+TPLH  A  G  D+L
Sbjct: 323 SFSRKQNQYGFCPLHLALQKTHTQMVLRLIDFDRNLVRVQGREGVTPLHYVAEKGNVDLL 382

Query: 122 AKFLTSCPKSIKQLTNRNESALHIAVKREDIESVKILLQWIQQTCMTSILNWCDDEGNNV 181
            KFL +CP+SI Q+T R E+ALH+A K + +E ++ +L W++   M  ILNW +DEGN +
Sbjct: 383 CKFLAACPESILQVTIRKETALHVAAKYDRLEVLETMLGWLRYVNMDDILNWKNDEGNTL 442

Query: 182 MHRVALGNQIEMVKLLINKV--DMKAKNLEGKTALDIMKEH--GLVEDKEVKEML----- 241
           +H     + I++V+L++ +V   + A+N +  TA+D++K H     E +E+K M+     
Sbjct: 443 LHISISRSHIQIVRLIVKRVRDQINARNSKDNTAMDMVKFHLQTKPEFEELKSMVRKAGG 502

Query: 242 --HGSHILRDVAKFVCLILLFVKRLVITDHHEILYMSKKDRNAILVVSVLIATATYEAAL 295
               S    ++A ++   L + +++++  +   L ++ ++RNA+LVV+VLIATAT++AAL
Sbjct: 503 RERSSLATMEIADYLKRGLTWRRKVLLFFYRSSLCITDENRNALLVVAVLIATATFQAAL 562

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y3236_ARATH3.2e-1426.86Ankyrin repeat-containing protein At3g12360 OS=Arabidopsis thaliana GN=At3g12360... [more]
Y5262_ARATH4.2e-1428.65Ankyrin repeat-containing protein At5g02620 OS=Arabidopsis thaliana GN=At5g02620... [more]
Y2168_ARATH3.9e-1225.78Ankyrin repeat-containing protein At2g01680 OS=Arabidopsis thaliana GN=At2g01680... [more]
LITA_LATTR1.5e-1125.84Alpha-latroinsectotoxin-Lt1a (Fragment) OS=Latrodectus tredecimguttatus PE=1 SV=... [more]
ANS1B_DANRE1.7e-1025.89Ankyrin repeat and sterile alpha motif domain-containing protein 1B OS=Danio rer... [more]
Match NameE-valueIdentityDescription
W9S5L9_9ROSA3.9e-8346.28Ankyrin repeat-containing protein OS=Morus notabilis GN=L484_023405 PE=4 SV=1[more]
W9SNY8_9ROSA9.9e-7948.39Phytosulfokine receptor 1 OS=Morus notabilis GN=L484_002271 PE=4 SV=1[more]
A0A164V5K6_DAUCA1.8e-7241.53Uncharacterized protein OS=Daucus carota subsp. sativus GN=DCAR_022796 PE=4 SV=1[more]
W9S5J8_9ROSA1.3e-6742.28Uncharacterized protein OS=Morus notabilis GN=L484_023404 PE=4 SV=1[more]
A0A061FN51_THECC3.3e-6645.78Ankyrin repeat-containing protein, putative OS=Theobroma cacao GN=TCM_042913 PE=... [more]
Match NameE-valueIdentityDescription
AT5G54620.12.3e-5535.98 Ankyrin repeat family protein[more]
AT4G10720.13.7e-5333.68 Ankyrin repeat family protein[more]
AT5G54610.17.8e-5133.24 ankyrin[more]
AT5G15500.23.9e-5037.14 Ankyrin repeat family protein[more]
AT1G14480.11.3e-4531.29 Ankyrin repeat family protein[more]
Match NameE-valueIdentityDescription
gi|703153015|ref|XP_010110570.1|5.6e-8346.28Ankyrin repeat-containing protein [Morus notabilis][more]
gi|703086159|ref|XP_010092930.1|1.4e-7848.39Phytosulfokine receptor 1 [Morus notabilis][more]
gi|1021032057|gb|KZM89841.1|2.6e-7241.53hypothetical protein DCAR_022796 [Daucus carota subsp. sativus][more]
gi|645266193|ref|XP_008238508.1|4.6e-6942.25PREDICTED: ankyrin-1-like [Prunus mume][more]
gi|985447676|ref|XP_015385479.1|6.0e-6945.72PREDICTED: ankyrin-1-like [Citrus sinensis][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR002110Ankyrin_rpt
IPR020683Ankyrin_rpt-contain_dom
IPR026961PGG_dom
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmaCh19G003970CmaCh19G003970gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmaCh19G003970.1CmaCh19G003970.1-proteinpolypeptide


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh19G003970.1.CDS.4CmaCh19G003970.1.CDS.4CDS
CmaCh19G003970.1.CDS.3CmaCh19G003970.1.CDS.3CDS
CmaCh19G003970.1.CDS.2CmaCh19G003970.1.CDS.2CDS
CmaCh19G003970.1.CDS.1CmaCh19G003970.1.CDS.1CDS


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh19G003970.1.exon.4CmaCh19G003970.1.exon.4exon
CmaCh19G003970.1.exon.3CmaCh19G003970.1.exon.3exon
CmaCh19G003970.1.exon.2CmaCh19G003970.1.exon.2exon
CmaCh19G003970.1.exon.1CmaCh19G003970.1.exon.1exon


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002110Ankyrin repeatSMARTSM00248ANK_2acoord: 104..134
score: 0.32coord: 138..167
score: 540.0coord: 177..205
score: 25.0coord: 36..65
score: 590.0coord: 70..99
score:
IPR002110Ankyrin repeatPROFILEPS50088ANK_REPEATcoord: 104..125
score: 8
IPR020683Ankyrin repeat-containing domainGENE3DG3DSA:1.25.40.20coord: 7..232
score: 1.5
IPR020683Ankyrin repeat-containing domainPFAMPF12796Ank_2coord: 104..160
score: 1.
IPR020683Ankyrin repeat-containing domainPROFILEPS50297ANK_REP_REGIONcoord: 1..200
score: 25
IPR020683Ankyrin repeat-containing domainunknownSSF48403Ankyrin repeatcoord: 7..224
score: 1.62
IPR026961PGG domainPFAMPF13962PGGcoord: 268..333
score: 6.
NoneNo IPR availablePANTHERPTHR24128FAMILY NOT NAMEDcoord: 1..285
score: 3.8
NoneNo IPR availablePANTHERPTHR24128:SF24ANKYRIN REPEAT FAMILY PROTEIN-RELATEDcoord: 1..285
score: 3.8