Lag0022021 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0022021
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionZinc finger BED domain-containing protein RICESLEEPER 3-like
Locationchr7: 16025005 .. 16043763 (-)
RNA-Seq ExpressionLag0022021
SyntenyLag0022021
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATAATATCCCAGAAGAAAGTTCGGAAGAAAGTTCGATCCATGTTGTGGAGAAATCAAGTGTAGCTAAGAAAAGCAAAGTGAATCATTCGAGTCCAACCAAAAAACGTAAGAATACTAAAATCTCTGTAGTTTGGGGTCATTTTAAGAAAATTAAAGATTGTGATCCTAATGATCATTATGCTAAGTGTAAATACTGTGGGGCTAAGTATGCATGTCATTCCAAGCGTAATGGTACCGACAATTTAAAACACCATTTAGAAAATTGTAAGAAATACCTTTATCAAAGAAAGAAAGATCCAGGACAAAAACAATTGGTTTTCAAGCCTAAAGAAGCGATTGATGATTCTAAACCAAAACTTAGTTGTGAGACTTTCAGTCTGGAGAGTTGTAGAAGAGCTCTTGCTGAGTTGGAGAGTTGTGAGACTTTTAGGTTTGTGGAGAACATAGGTTTTCGAAGATTTATCAACAAATCAATAGCCTTGTTTGCGCCTAATTTTGTCCTTCCATCTCGATTAACTATTGCTAGGGATGTGCTTAAGATATATGTTAGTGAGAGAAAACGTTTGAAAGACATGTTTAAGACAAAGCGTTATAGAGTTTCTCTCACTACCGACTGTTGGACTTCTGGAAAAAACATTAATTATATGGTATTGACGACACACTTTATTGATTCTGGTTGGAATTTACATAAGATGATTCTTAGTTTTTCTACAATTGAGAATCATAGAGGGGAAACCATTGGTAAAACCATCGAAAAGAATTTAAAGAATTGGGGTATTGATAAAGTTATGACTTTGACGGTGGATAATGCTAGTTCAAATGATACTGCAGTTGTTTATCTTATTAAGAGATTTAGTAGTGGGTTGTTATTGAAAGGTGATTTTTTTTGCACGTTAGATGTAGTGCCCACATATTAAATTTTATTGTAACTGATGGTTTCAAGGAGCATTATGAGTGCATTAGTAGGATTTGCAATGTTGTAAGATTTGTGAGGTCTCCTCCCGCACATGCTACGAAATTTAAGAAGTGTATTGAAATTGAAAAGATTACTTCTAAGAGTCTGGTGTGTCTAGATGTTTCCACTAGATGGAACTCTACATATATGATGCTTGAGGCGGCTACAAAGTTCCAAAAAGCTTTTTAGAGATTAGAAGATGGGGCTCAACTTTTCAATGAATGTCAACCAACTAGAGAAGATTGGGAGAATGCTAAGAGTTTAGTTAGGTTTTTGAAAGTGTTTCATAATGTCACTTTAAAGATTTCAGGCTATTTATATACCACGTCTAATGTGATTTTCCACCAAATTAGCACAATTCAAAATTGCATACAATTGACTTTTTATGGATAGTGGAAATCAAATTCTTTGTGATGTGGCTAAGAGCATGAAGTCTAAGTTTGAGAAGTATTGGGAAAATAATGAGAAGACCAATTTGTTGTTGTATGTTGTTGTTGTGTTGGACCCACGACATAAGTTGAAATTTTTGTCGTATTGTTTCAACAACTTGTTAGAGCCAACTTCTGCTAAAAGCATGACAGATAAAGTAGAGCAGGTTTTGAGACAACTCTTTCGTGCATATAAATCTAGTTCATCTTCTTCTGGTGGTAAGTGTCATTACTAAAGAAAATGGTGGGACTCAAACTTTGGATGTTGATGATGAGAGTGAGACATATGGATTTGCACGTTCATTCTCGAGTGTCGAGACTACCACTTTTGATAGAGAGTCAGAGATTGATGTATATTTATTAGAAACTCTTGCTAAGGATGATAATACTTTTGACATTCTTAATTGGTGGAAGCATAATAGTCATCGGTTTGAGATAATCAATGCAATAAGTAGAGATATTTTGGAAATTCCAGTTTCGACCGTAGCATCTGAGTCTGCTTTTAGAACTGGAGGACGTGTAGTTGATTCCTCTCGTTCTTTATTGGCTCCAAAAAACACTAAGTGAGCTATAATAGCCCCCACTATTATAAGCAACACAGGGGGCCAAACAACCCCTCAGTGTTTGGGGCAGCAACAAAAATAAGAGATGAAGTGGATTATTTTTACCTCTATGTTGCTGCAGTAACCACTATTCAACCCACCCGGTTATTCTATCCATTCTCTACAGTAACTCTATCCTCTCTTCTTCCATGTGACTCCTTGACTTTTCTGGCGACCACCGACGATTTTTTCTGCGACTGCCTTCGACGTTCTAGCCAATTTTTCGACGACTGCTTTTTTTTTTTTAATTTATTTATCTTTCTCCTCATTTTCATTTCAAAAAATATAAAAAATGAAAATAAAGAAAAAAAAAAGAGAAAAAGGTAGAAGAAAAGAAAGAAAAAAGGGAGGGAAAAAATTTCAGTTGCCGAAAAAGTTGAGAAATTTCTAGGGAGTTAACTAGGTTGTCTTTGTACATATATAAATATGAAGTATTATTACATAATTACTTTTGATAATGAAGTATATGAATGTTTACCTTCATCGAACCATATTTTTCATTCCAAATTATATATTGTGTATTCAAGTTTACTGTGATAATAATAATTAAAAAAAAACACTACTCAAGTTTTTATTTTATGTGATTTCTATTTTATTTGATTCTTTTGATAATGAAGGTAAAATGAGTTTTCAATGAGCATTTACCTTTTCATAATGGAAATACATAAATGAAACAAATATAGTTTTTCTATATGAAATCATAAAATGACTTATTTTGTGAAAAACCATGACTCTAAAACAAAGAAAATCAACCCTTACCTTAGGTTATTTTTGTTTCCTTTTAAGTTAAACACATCAATTAACCCCTATATCCAAAACTTTATTTTTTTACAACTAAACCCAAACATTTGTGTATTGATATTTGCAAAATATGATATTTGACGTACAAAGTTGTATGAGAAAAATAAACAAAACAAGTCGGAAAAAAAAGTCAGCTTGAACCTTTGATTCAGAACCACATATTAAACAGATAAACACTCCAAACACAAACTTATTTAACCTAGACTAAAATAATTTGCATCCCAAACAAAGACTATTTTTGTCCGACTAAAATAATCTACACCCCAACCACAAATTATTTTATCCCAGACTATAATAATTTGCACTCTAAACACAGACTATTATAATTTTCAGACTATTATAACTCCCAACTCGTCCCAAATAACCCTTAATGTTATATCTTTGAGAAATTAATATTTAATAGTGAGTAAAGTTTAAAATATTGCACGTGTGGAAAATCCAATAGAAAGGCAAGGAAAAAAAAAAAAAAATTCAAAAAAAGAAAAAAAAAACGACTGCCCGCAACCAAGGTTGTGGGCCGACCGTTCCATTAGACCGCAAGGAGTACGTTATCAATATCTCAAGGGCTCTATAAATATGATCTCCCTCTATTGATAGCCGATTCATTTCTTCCCTTCCCGTCTCACATTGCCCTAATCGTCGCCAACGGAGTTTTGAGAGGCTACGTTTCCAGGTATTCCGATGCTTTCTTCTGTTAATCCCACGTTATTTTTCTGAGTTCATATGAATTTTAGGGGCCATGTTGTCGTTATAATCATGGGCAGTCCGTCTCTCGTCTACTTTTTCATTTTAACATTCTAATTTTGTCAATGTTTGATCTTCAATTTTTTGTTCTTTTTCAAATTGACGTATATCTCTTTCGAATTGTTGTTCATCTTCTTTCTGATATGTAATTGCCGTTGGAATCATTTGGTAGTTGTCTACTTTAGCGATAGGTTATTGAGGGTTGAATAGATGTTGTCGAACAAATATGTTCTACCTTGATCGCCGTCATTTGCAGTAGCCTCTCATTCTGTATACATTTTATCTGTACTGGTTTAGGATTTCGTACGATTTAACTTTTACTTGTGTCGTATTGATGTTGTTTCTCACAGTCGCTACCCTCTTCCTCTCTTTTTCTGTGAAGGGGTTTTATTCCACTTAGCAGAGCTGCAATTGCAATAAATTTCTTTGAATTTTATCCTGTACTTTATGACATTGGTGCTATGAAAGTTGAATGAACAGTAGTTATCGTATTAATCTTTCCGTTTATATGTATTTATCCTTTTTTTATTTATTTTTTAATTTTATTTTGGTGTGATTTTCAGTTCATCGGTAGTGCTGCAAGTTAGTATAGTATTCCCGTATCATTAGAAAGGTTAAGAGATGTGAAAAAGTCTACTATTTCTCTCTTTCCTCCTCCTCTCTATTGTTCGTTTAGTTTCTTAACAGGGATGGTTGAGTTTTTTGGAACAATGCCATGCCATTGGCTTGCAAGGTTCATCTTGACTGCGTGCTACACTTGGAATAACAATTAATTCGAGGTGGATGAAAGGTACTTGAGATGAATAAGGTGATCAATTTCATGCCACGACAGACCTTGGTATCGACCACAACCACGTGCATGCATGAAGGGGTTGATACTTGAAACTTGACTCAGAAAGGCAAATTTTAATAAATTTTGGCAGAATATACCTTATAGCACCTCTATTTGTCAAGAAGAGTGAAAGTTTATTCCAATTGGTAGAGATTGGAAAGTGGCATCTTATGTGACACTTAATTTTGACTCAATGGAAAGTTGGAGAAGTTTTGCACTTGGGATGGAAGGAGATCAGTTATAAAAAATACAAGTTTTGGTCAACTTGGATGATGGATAGGTTAACATTGAAGACATTAGCTTCTTCCACGAAGTAGCAAATGTTGTAGAAAATGAGATATTTATCTCTTTTTCTTAAATTCGCAAGCTTTGTGTTGTGTTTGAAAATTTTTCATTTCCAATGTGTTGCATTTGCAATAATGCTTCCTAATATCCTTCTAACATTCTCAGCCAGTTGCTATCCCTGAAAAAGTCACTGAAAAAGTCACAAGGAGTTCTATTCCTTTTCAAGGAAAATTATGAATCATGGTGAACATTGATGACTAGATGAATATTATATTTATTGAATTGACATAAAGGACTTGGACATGTATTTTAAACTAAGGAGTTCCTTTTCCTATTACAAATGGAAAGAATTGGGACCTTCATGTTATGGTTTACTTCAGTATCTAAATTAATTCCACTTAGTTCGAATACCTACCACCAAAGAAAAGAAAGTGCTAGAAGTTTTCTACAACTTGGAGATTTTGATGTTCCATTCTCCCCATTTGTTCTTGTCTCTATATTAAATAGTAGTGCTTATATTTTTTCCTAGGATCTTACATGTTGTGGGCTAGGTTTCTTCATATTTTAATTATAAGGTTGTGGGTAATATAGCTTAAGGTATTGTGATCCATGTTTAGAAGTCGAATCACGTTTTGATGTTTCCTCCTCCAGTAAATCTTGTCGCATGAGGGAGGGGTTAAAACAATGGTTGAGTTCCCCCTTTATGGTGTGTGATATGAGGTGTTTGAATAGCTCAAGGTAGAAATCATATTATGTGTTGTTGTTTCTATCTTTTGGCAATCTTTTGGTGTGTTTAGGTGTTGATAGATTTGTCTCCTTTTTGGCATGAGTGAGATGTTTGGAATGTTGTTATGGATTGAAATATGGCCAAGTCATTTAGATTTTTGGGATTCACGTAGGTTTCACATCATGCGGTCTCCAAAATGCTTCAAACTAGCCCTTTAAGTCAACAAAAAGGTGATGTGACATTTCATGAAATTGATATTATATTACCTTTAAGGTCGATTGACAAAAGATAATGAACTTTATAATAATTTTATTACTTTTTAAGGCGACATGGCCAGAGCTAATTAAAAAATATAACTTAAAACTCTCCTTTTCTTCCCAACCCCTATTCTCTCCCTTCCCTTTTTCTTACTCCCCTTACCTTCTCACTTAGCCCCTCTCTCCCCTGCTTAGCCACCCACTCTCTCCTCTTTCTTACCCAACCAACCACCCCCCTTCTTGCTTCCCCCCCACCCCCCACCCCCACCACGGCTAAGCTTGAAAATTTGTTCTCTAGCACTAGTGTTTGCTCATTGTACCGTTTTAGTGTTGTTTAGTTTTCGTTTCAATTTGGTACATCATTCTTGAGGTTGCTCCATCGTTATTTTCTGCCACTTATCGTGTTTGAAAGGAAGGTTGGGCAACTCAATTTAGTTTTTCATGCTTAAGAGGTGTGGTGTTGAAGGTGGGGGAGGGGAAGATGGGCAGAGATGAGGGAGGGTGGAAGGTCACCGTGCAAAGAAAAAGAAGTTTCTTAGTGTTGCTATTTGCTCATTATGTGTGTGCTTTGAAAGTTTGCCATTTGGTCATGCTCTAGCTTGGTTCTTTGCTAAATGTTCATTGTTGTTCCTCATGTTGCTTTGTTGTTCTCCAATATTGCCATGTGCCGAGGGTAGGCAACTTAGATCTAGTTTGCTTTAGTGACAATTGGGAAAAGGAAGAAAGAAGGGTAGGGAGGGTTAGTAAGAAAGGAAGATAGGGAGAGAGACATGGGGAAGGTTACCTGGACGAGAAGGGGGAGAGAAGGTGGCCATTGGAAATCCTATTACATCAACCTTGGTGTAACTTAAAAGAAGGGAAGAGAGAGACATGGAGGTGTGAAGAGGAAGTGTATAAAAGGGTTAATAAGAAGGGGAGAGAGAGCGATACTGGGGGAGGTTACTTGGACGAGAAGGTGAGAGAGGGTGGCCATCTAAGAAGCACAGATACTTTAGTGTGACTAGCATGTCCGTGTTCCACACATGTCAGACACTTGGACACTTAACATTTTAGTGTGGCTAGCATGTTTGTGTCTTGTCATATCTGTGTCCTGTATCCGTATCCATGCTTCAGGCCATTGGAAATCCTGTTACATCAACCTTGGTGTTACGTAAAGAAGGGAATATAGGGACATGGAGGTGTGAAGAGGAAGGGTAGAGAGGGATTAATAAGAAGGGAAGAGAGAGAGAGACATGGGGACGTTACCTGGATGAGAAGGGGGTGAGAATGTGGCCATTGGAGATCTTATTACATCAACCTTGGTGTAAATTAAAAGAAAGGAAGAGAGAGACATGGAGGTGTGAAGAGGAAGGTAGAGAAGGATTAATATGAAGAGAAGAGAGAGATATGGGGGATACTTGAACGAAAAAGGGGGAGAGATAGTGGCCATTGGAAATCGTATTACATCAACCTTGGTGTAACATAAAATCATTTAGAAATACCTCATCATCTTATTGGTGAAGTAGTTAGAACACAAAAAATAAACGACACATTATTTTGAATCATTTTAGAAACCTTAGGGATTAAGTTACAAGTTTTTGAAACTCTATACTTTGTTCATTATCAAGAACTTTGAATCAAGTTCGTTTGTGAAACTTCATGGGTTAATGATGTATTTATCCCCATAATTCTTTTTATACTTAGGATTTGTTCTTGTAAATATGCAACCATTTTGATTGTTTAAGCTATGGCAGTCATCCATGGCAAAGAGAAGAGGGAAAGCATCAAAGAGTGAGGATGGACCTTTGAATTCTATCTCAAATAAAGGTGTTAAGAAACATATCAAGAAGAAGAAGAAGCCAAGAAAAACACATGAAACTTCCAATTTGAAAGTTGCCCATTCTACTCAAAGACAAGAACCTGTCAGTGTTCCAACTTATGCCTCGGTTACCGCTTCAATCCCTCTTGGCAACGATCAGAAGAAACAGAAAGAAATGAAGGGAAAGAAGGATGCCTCTGGATTCATTTTTATGTGCAATGGAAAAACAAAGCCGGAATGTTACCAATATCGTGTTTTTGGCTTACCAAAAGGGAAAATTGAAGTTGTGAAGAATATAAGCCCTGATGCGAAGCTATTTTTGTTCGACACAGACTCAAAGCTTCTCTATGGCATATATGAAGCTACTAGTAAGGGTGCATTGGATTTGGAACCAACAGCTTTCAATGGCCAATTTCAAGCACAGGTAAGAAAAAATGCACTTATATGGATAAATTTTCATGGAATCAAGATTTATGCTTTTTTTTTTTTTTTTTTTTTGGTCAAAATGGAGGTTTGAAACTTAAGTGTATAACTCAATAAGTTAAGATATGGTGGCAACCCAAATCGTTTTTGGGGCATAGTTGTCTTACCTAGTTCAAACATTTCAATCAAGATTTTTGAATCAAGCCATCTTTTTGGCATTATTTATCATGTTAGGCAATGTGTTTACCTTATGGATTAAGCAAATGGTGTAGGTTTTGCAAGCAATTGAATTTTTTTATGCTTTGGGGACTAAACTAGTGATTGCTCTCTAATTAACCATTTAATTCAAATATATCAAGTTGAGAAATAGTTCTAAATGGTTCCCGGGGTTAATTGACCATTAGTTGACCTAACGAAAAAATGATGTGGCAGTTAATTAGTAAAGGACATTTTTTAAATGAGTTGGCAAATAAATATTAATTTTAATTATGTTGCATTTTCTCTCTTCCCCTTTCCTTCTTTCTCATTTTTTCTTTTGGTCTTCCACCTCTTTGGCAAAGGTAGTCGTCGCCCATCCTTCTCCCAATGTATTCTCTCTACAATTTCATTCAAGAGGAATTCCAGATCGTTCTAGATCTAGTTGATGGGTGCCATTGAGAAACTACCGTATCACTAGCGTCCCATTCGAGTAGGTCATTGACCCCATCTACCCAAGTGGAAGCTTCAACCCCGTGGGTCTTGCTGATGACTGAAAGTCCTTTATTGTGCTGAAGGTGAAGGAGCTTCAAGAATGATACTTTGGTTATGTTCTCCATGTTTGGTTCCTTTTTGTTAAGGCCATTGTTACTGAAAGGGTCCTCCGGAGAACTAATTGCTGGCAAAATCAACTACAATTTATCTGGAGAACTCATGCACCAATGAGCAAGGATGGGCTATCATGGCAGTGTGGTGGATATACTGAAGGAGGAGATGGAAGACGAAGAATAGGGAAGAAGGAATAGGGATAAGATAGAGGAGGTTAGAAAGAGAAAATGACAAATAATTAAAATAAATATTATAAAATTTATATTTATTTGCCAACTCATTTAAAACTGCCACTTCAGTGAAGTCTCATCACTAATTAACTGCCAAATCATCTTTCCGTTAAGTCCATTAATGGTTAATTCACCCTAGGGACCATTTAGAACAATTTTTCAAACTTCAGGGACTAAAATGTTGCTTTTGAAACTTCAGGAACCAAATAGATACTAAACCTAAACTTCAGGGACCAAAAATGTAATTTGCCCTTAAATTTATCGACAAACGGGACGGGGAGGGGATCCTTGTTTAGCTTGTAGAGGCCCCCTCCCCAACTCCCCTCTTTGTTCCCATGGGGAAAATGGATACCCCTCTGCATTATGCTAATCTAAGGAACACAAACACTTTAGCTTGATTGACATGTTGCATTTGACATGTGCCAGACAATTGGACATAGGCACTTTTAGATACACGACACACACTTGTTAGCATAATAGATGTGTTAAACACTAGTCATACAACGTGAATAAGGAGGACTGAACATTTGAAAGGCATGTGTTCTATAGGTTGTACAATTGTGTTCTTGAGTTCAAAGGGAGCTCTTTGGAGTTGGGTTCACTTAGTTTTTCCCCTAGTTCGATGAACCCTTTGAAAGAAACCATTGAATCTTAACAATTTCGTGAGGAGTATGAATGAAAAAGAAGGAAGAACATGTTTTTATTTTTATTTTTATTATTTTTAAAAACTTTGTAATAAAGAGTAAAACTAACCTTGAGGTGACGTGTATTAGGGCTCGATGGGTTTGATGGGTTCAAGTCTGGAGGATTGGGAACTTAATAATTTAAAACTTTTGTTGTCGCCTAGATCTAGCCTTGGGCGAGTAAAGCTGTCCCTGCAATAAGTGGGGGTACACATTGTAAAGTGTCCTTGTGAAATTTTCTATTAAATTATGCAGGGATCAGGGTAGGAAATTGTAAGGTATCTCCTTACAAGATGATAGTATGTAGATGTATTATAACAACCAAAAGGCTTTACAAACCCAACAAGTTCAAGAGTGCTGGAATCTCTCCTTCTCTTACAGAGACTTCACACTAAAACAAAGCATAAAAAGAAACTACCTCCCACAATACACTTATAAACTACCCTTACCCTTACATAACTTCCTCCCATAACTCCTAGACCATTAACCAACTACTTATTTCCCCTTTTTGCCCCTCCTAGTATATACTTTAGTAATTGGGGGCCTAACATTACTCCCGGCGCGATTAAAAACCTTGTCCTCAAGGTTGAAGTCAGGGAATTGCTGCTGAATGTCGCTCCACTTTTCCCAAGTGGCTTCATGCTCGGGCAAACCTCGCCACCCAATTAAAACCTCTGTAAACTCATGGTTAGCGGCATAGACATCCTCCGGATAAGCTAGCCACTCAAAATCATCTGTCAAAACCGGAGGCCCATTGTGGACAGATTGATGTTCCCCCAAAGCTTTCTTCAACTGGGATACATGAAACACGAGATGAATCATCGAGTCTGGGGGCAATTCCAACCGGTAAGCCACGGATCCGACCTTTGAGGAAATTCGGTAAGGCCCAAAAAACTTAGGAGACAGCTTTTCGTTTCTTTTCTTGGCAACTGATGCTTGACGATAGGGACGAACCTTAAGGAATAACCAGTCCCCCTCCTCAAAACTCACTTCCCGTCGCTTCCGATCAGCAAACTTCTTCATTCTCTCTTGTGCCTTCTGAAGATGATCCTGAGAACTCGGATAATCACATCACGCTCTGTTAATTGCTGATCAAGGTTCGCATTTGTTGTCTTCATATCACCATAAGTCAATAAAGGTGGTGGTGGACGACCATACACAGCCTGGAATGGGGTCATTCCAATGGAACTATTATATGTAGTGTTGTACCAATACTCCGCCCAACAAATCCATTGACTCCACTTCTGAGGCTGCTTGCTGCAAAAGCATCGCAAATAGAGCTCCACAGTTTTGTTCACCACCTCTGTTTGCCCGTCCGACTGTGGATGATAAGCTGTGCTTCGACATAATTTAGTACCCTGCAAACGAAAGAGCTCAGACCAGAAATTGCTAATAAATATCCGATCCCTGTCCGAAACTATGGATTTGGGAAATCCATGAAGGCGAACCACTTCGGTGACAAACATCTCAGCCACGGACTTGGCAGTAAATGGGGGTTTCAAACATAGAAAGTGCGCTGCCTTACTCAGTCTATCCACAACCACCAGGATCGTATCCATCCCTTTTGATCTCGGCAACCCTTCCACGAAATCCATGGAAATCTCCTCCCAAATGCGATCGGGGACCTCTAGAGGCTGCAGCAGACCTGCTGGAGATGCGGCCATGGTTTTGTTTTGTTGACAAACTAAACACTCAGCCACATATCGCTTCACATCACTTTTCATTCCTTCCCAATACAATTCTCTCGTCAAACGCTTATAAGTTCGCAGAAAACCCGAGTGACCACCCAATACTGAATCATGAAAGGTATGTAAGATTGAAGGAATTAGTGAAGAAGTTTTAGAGAGGACTAACCGCTTCTTGTATAGTAACTTCCCTTTACTCCACTCGAATTTAGGGACACTATTAGGATCATCCTGCAGCTTTCGGACGATGCTCATCAGTTTACTGTCTTGAGCCACCTCTCTCCCCACTATTCCCACATCCAAGATGGCTGGTGCCACCAACTGAGTGATGTGAACTGCAGGCAGCACGCGAGACAATCCATCTGCCACCCTATTCTCTGGTCTCGGTCGATAATGAACCTCGAAATTGTAGCCGAGCAGCTTTGCTAACCACTTTTGATATTGCGGTTGAACCACCCGCTGATCCAATAGATATTTCAAGGCTTTTTGATCCGTTTGGACAACAAATCTTTGACCCAACAGGTATGGCCTCCATCGTTGCACAGCCAAGACAATAGCCATGAGTTCTCTTTCATAAACAGATTTAAGCTGGGCACGGGCTGATAACGTGTGACTATAATATGCAATTGGCCTCTTATTCTGAGTCAAAACTGCCCCAAGACCGGTCCCAGAAGCATCGGTTTCTACAACAAAAGCCTGGGAAAAATCTGGTAAAGCTAAGACAGGTAATGTGACCATTGCCTTCTTCAGTTGCTCAAAAGCCTCCGTGGCTGCCGCCCCCCATTCGAACGCCCCAACCTTGAGCAACTGTGTCAATGGAGCAGCGATTTTCCCATAATTGCACACAAAACGTCTGTAATATCCAGTTAACCCCAAGAAACCCCGCAACTCCCGCAAATTCCGATGGCGCAGGCCACTCCATCATGGCACAGATTTTCTCTGGATCAGCTTCCACACCGCGGGCCGAAATAACGTGCCCCAAATATTCAATTCTCTCCTGCACAAACTGACATTTAGACATGTTGGCCTTCAGCTGGTTGCTCCGCAGAACATCAAACACCACGGTTAAATGTTGCAAGTGGTCCTCATACGTCAAACTATAGATTAATATGTCATCAAAGAAAACCAAGATAAACTTACGGAGATACGGCTTGAAAATGTTGTTCATCAAAGCTTGAAAAGTTGAAGGAGCGTTCGTGAGTCCAAAAGGCATAACTAAGAATTCATAATGGCCCTCATGAGTTCGGAAGGCTGTTTTCTTAGTATCATCAGGGTGGACGCGAATTTGATGATATCCGGCCCGCAAATCAATCTTGGAGTAGACCTTGGACCCATGTAGCTCATCCAACAACTCCTCCACCACCGGGATAGGGAACTTGTCCGGCATCGTGACATTGTTCAAAGCTCGGTAATCAACACAAAACCTCAAACTGCCATCCTTTTTCTTAACGAGGACGGGGCTGGAAAAAGGATTGACACTAGGCTGAATGACCCCAGCAAGAAGCATCTCTGATATAAGCGTTCGATCTCATTCTTCTGAACCTGCGGATAACGATATGGGCAGACATTTACCGGCTGAGCTCCCTCTTTCATGTGTATCTGATGATCAATTCTTCGCTGTGGTGGTAATCCGTCAAGCATCTGGAATATATCGGCATACTGGTCGAGGAGTTGCTGAGTGGAGATAGGGACATGCTCGATTGGAGTAAGAGCAGCAAACTCAGCATAATCGACCAACGCCTCCAGGGATTTGAACTCAACCAAGTAACCCTGGTCATTCTCGTCCCATGCTTTCATCATCCGTTTTAAAGACACACGGGTCAGTCAAACTAGGATCGCCTTTAAGGACCACATCCCCCTCATCTCGCTCAATAGTGATCGTCAAAGTGGTCCAATTCACCCTTGTTTCTCCCAAAGTATGGAGCCAATGCATACCCAAGACCACATCAACGCCCACCAAATCCAATGGAAGGAAATTCTCAACGACGGTAAGATCCCCAAGAGACAAGACCACACCTTTACATACGCCTCGACCCTTGATTGGTGGTCCCGAGCCCAGAAGGATCCCATAATGTGTCGTTTCGGTTAGAGGAAGTTTCAGGTCATCGACCAACCGTTGGGTAATGAAATTGTGGGTAGCCCCACTGTCGATCAAGACCATTACTTACTCCATATCAATCTTGCCCTGCATTTTGATCGTACCAGGTATTTCCACTTTCTTAAACTTGTGACGATCGCTCATGGCATCCACCACAGGATCGGGCTGAAGCTTACCCTCCCCATCGTTGCTTGCGCCCGAGTGCTCCCCTGCTTCTATCGGCTCCTTATGACGGAGACTTGCGCCATCAGTGATGCCGGAAGGCTCAGGTCGACGAAGATTGGCCACTTCCGCCGCAAGAGTAGCAATGGCCTTCCGCACGTCCTGCATCCCTTCTTCAATGCCGGTCTTTAACTCAAACATCTCACGCTCATTCATTGGCCTCTAGCCTTTCCTCCATCTTTTTCTGCGTCATCTTCGACCACCTACCCAGGTTTGATGGCTCTGATACCAATTTGTAAGGTATCTCCTTACAAGATGATAGTATGTAGATGTATTATAACAGCCAAAAGGCTTTACAAACCCAGCAAGTTCAAGAGTGCTGGAATCTCTCCTTCTCTTACAGAGACTTCACACTAAAACGAAGCATAAAAAGAAACTACCTCCCACAATACACTTATAAACTACCCTTACCCTTACATAACTTCCTCCCATAACTCCCAGACCATTAACCAACTACTTATTTCCCCTTTTTGCCCCTCCTAGTATATACTTTAGTAATTGGGGGCCTAACAGAAATTCGTAGGGATCGAGTTCCTTGTTGGAAATTATTCCCCTTTTTATATATTATTTATTTCTTACTTTTAATCTCGATGAAGTTAAATGAAATATTTTGAGAAATCTCCATTTTAAAGGTTAATTTCCTACATGGAGTTTCCATTTTCAAATGTTGAACCATTTAATTCAAATATATTAAATTAAGTCTTTTAAACAACAAAACTTTAAATTTGTGTGTGTGTGTGTGTATATAAAGATACAACTAGGACCGAGAGGGGGATCCTTGTTTAGCTCGTAGAATCCCCCCCCCCCACTTCTTCCTTTGTTCCCATGGGGAAAATAGACATCCCCGTGCATTATGTAACCTTATAAGGTGCATAGACACTTTAGCTTGGTTGACATGTCCACGTTTGACACGTGCTAGAGGCACTTTTAGATATGTATCACACACTTATTAGCATAGTAGATGTGTTAAACACTAGTTGTACAAATTGGATAAGGAGGACCAAATATTAGAAAGGCATGTATAGCAGACTTGTTTAAGCATAATAAGTTGGCACAAATAGTAAGGTAGGACAAATAATAATTTTTTTAGGAGCGCTCTATTAGGTACCCAAATAAATACAGAAAACTCACAATAGTAATATATTTCAATATCAAAAGGAAATTACAATATCAATAGCCTTTCGAGGGGTTGTCTCCTAAGAAGCATGGACACACGACACGCTAGCGACAGGGACACAGACACGGTGACACGCCAATTTCTAAAAATATAGGACACTGGACACGACATGTTGGGGACACGTTATTAATTTTTTATTTTTATTTTTATATATTATATGTATATATATTGTCATAGATACTTCTAAATAAATCAACCGTACAAATTTATCATAACAAAACAATTATGCTTCATTAATAATTAATAGTTAATAACTTAATACTTAATACATAAACGATAATGCTAATTGCTAAGCATTGTTGTCCAACATAAAATTACAATACAAATCAAAAGTACAAGTAAGAATTTACATCTTACACATAAAACAAACTAAAACTCTTCAACTCTCAACTCAAACTTCTTAGCTACTTCAAACATTCCAACATCTTCAAAAGAATCAAAAGAATCTCTAGCAACATCCCACAATTTTGTCTCGTGTAAATATTTTGGAGTTCTTGACAAGAGATGCAAATTACTATGAACATATACTAAATCATCTGCACGTTGTGGTAGCATCTTGTTCCGTCTAACAGAGTTAATAAATGAGTACGTGCTCCAATTCCTTTCACAACATGAAGATGAGGAAGGTTGGCCTAGTAGCTTTAAAGCAGTTGATTGAAGGGTTGGTGCATAAACACCATGTATAGTCCACCAATCTTTAGCATTCTCATTATACCTAGCAATAATGGAATCAACATTATTAAAATCTTTTACCTTTGTTGAAAATCTTGCAAACTCTAAGTTCACTTTTGCACGATCTTCTGGATCTGAAAAGTATCTCTTGATACACTTCATTCTCTCACGAGTTACTTCCATGTCTTTATGTGGTGGCACTTGATTAGGATCCTCTTTAAGCCATTACTCACTATAGTACCTAAATATAAGTTTTAAGTAACAAAAGTTGAACTCTTAAACATTATATTATATATATAGAAATAGAATGAACGAAAGATTTAATGATCAGAAAGTACCTTGGATTCAAAGAATGTGCCAAACAATGAAGTGGTGTATTGTTCTTGTTCCAACGATCCGTAAGAATATTATGTATCATGTCATAAAAAGAAGAGACTTCTTCTCCATTTTTTCCTTCATGATTATATATTGTCGCCTTCACCTTTTCAATCATAGAATCCCACGTCATAAACCAAATGAAGGGTAGGTGCATCAGTATCACAAACTCGAATCATGTCATATATAGGTGAAGTGAATGAAAGGATATAATCAATTTTGTCCCACCAAACATCATCCAAAACCATTTGTTTCACATATCTCGCTTTTCCCACACCATCCTCTCTATAACATGTCCATTTATCACTTATAACCATAGCTTGCAACCCACTTTTAATGAGCTTAAACCTCTTGAGCATAATAATGACGGATGCAAAACGGGTTTCTACCACGGAAAGCAATTTCAAAGATACAAACTCGTTAAAAATAGCCAGCCTCATTGAATGATTCATGATAATGTTCTTCACCGCCATCACATCTCTAGCAATATCAGAAATCCAACACATTCCTCATACACTAATATATTGTTCTCAACATTTTTCGCTGCACATATATTCTTTATTCTTTGAGGCAAGATTTTTCTTTTGATAATAAACCAGGCTTTCATTGAGAATGGGTTAAGTGACGGTGCGCGTAAGCTGGCTCGGACACTCACAGATAAAAAAAAAAAAAAAAAAGAAGATAAGCTGTAGTTTCAAAATAAAGGTGGCAATTTTCACCAATTCAAAGCTGAATAAGTGAGGGATTCAAAAATTAAATGAGTTTGGATTGATTAGTATTTTTTTTGTTGGTTGATTAAAGTGGTGCATAGCTTCAATTGTCTGTTGAGCTGAATATCTTCTGAGGCATTAAGGATATTTCTGCCCTATACTTTCTAATTAATTCATTTTCTTTCTTTTTTAATTTTTATTTTGATATTTTAAGTATATGAAACTGATATGTCATTAATTAATGTTATGCTTTGGCATGGATATCCTTTCATTTGATAAGTTATATCCTTTTATTTTTGATTGATGGTGGATGAGAAAAGGCTATTTTAGTCCTGTACATGCCTCAAGGTCTACTAGGGTGCAGAAAACCTGAAGCAAGTGCCTCTGCTTGTAAGTGGCACTTGCGATGTCTGAATCTTGAGCATTGGCGAGATGTGACTACCATTGTCTTGTTTGTTGCCATTAGTGGGAACTTTGAAGATATTGTTAGATTTGGTGCTTCACGTGTTATTTTATATTTTATATCATATGGCTATTTATATACTTCTGTCATGTAGATGGCTGAGACTTATGCGTTGAAATAGAAAATAGCTAATCTAGCTTCCTGTACTCGTTGAGTTATTTTTTGATTGCTTTGTTTTTCTGATGTAGGTGAAATTTAAGATAACCAAGGATTGCTTACCACTTCCTGAAAGTGCTTTCAAACATGCTATCAAAGACAATTATGAGGGTGGTCGGAAATTTAGACAGGAGCTCAGTAGCACACAGGTTACGTTATCCTAAAACTTCCATGTTGAATTATTGAGTTCTAGGTTTCATATCGTTTTCAAACAAATTTTCATTTTTAGTGAAGATCAGTTATTATAATTTTCTGTGGCTGTCTCTTTTAATATAGGTAAAAAGTCTAATTTCCTTGTTCCGCCCTATTGCCAAGAAAAAACCATCTGCTAAAAAATCGCATGGTCGTCCTAATGTGGCAATTCAGCCGTCATTTAGGTCTACACGAACTAAAAAAGTGGTTAAATCATATCCTCCTGATAACTTGTCATCTGGAGTGCACTACCAGCCGATCCATGAGACAAGACCTCAGCATGATGTCCATCTTGGGCAGTATGATCCGTTTGAACCAGGACTACATGTCTCTCATTCACAAGTGCAGCCTAGGCTTGTGAGGGTAGAAGCCCCTCCCCATCATGTTGAAGTTTATCATCCAGAGCAGGCACATAAAGCATTTTTTACTGAAACATCTTTTAGACACCCTCCAGATTCGTATGAAAGGTATTTTTGTTCTTATTATCGGCTTTTCTTTGAAAATAGATTATAAAGAAATAGCTCGCCAATGAGATGATCTTTAAAATATAACAGTATATATTTATATACCAAATGGGTCCCAAGATGTTTGATTGGTGGGGTGATATTTAAATTCTTTCCTTTTTGCATTGAGAACAATTATCTTAGGTAGTTTTAGGATTTGGACTATTGTATGACAGTTTGTACGGGAGGCATTTTTATGATGTTGGTTGATCGAGTTGATCTAAAGCTAAACTTTTTAAATGCGGTCCCTTTTTGTAGTTATAACTATTTCACACTGCACAATGATTTTATTGGTTATAATTTTTATGTTATTCATAACCGTGATAGCTCTGTTGTTGCTAAATCATGTGTGCTTCAAGGTGTTATTCCTCTTGTATATGAAAGATGGAATAAAGATCACTGTGCTTGCAAGGTGTTTAAGTTGATTACCCATGCTATGTTTTTCTTTTTAACTTCTACAGTATTCACAGTACGATTGAGACAAATAATGGGGATCATCGGTTTGTGTATGGGCGTCAGTATCATGCACCACAGTTGCTGTCAAATCGAGAAGATGTTACTCACATAGATTATACTCCTGGTTACTATAGCCAGCGGCTGTCTCCTACGACCTCACATACTGCTACTTTGTCACAAGCCAGATCATCTGCATATTATTCACAGAACAGGTTGCCATCTCCATATCGCCCACAGAGGTTGCCATCTCCATATCGGGCACCAAACAGGTTGCCATCCCCATATCGTGCACAGAACAGGTTGCCATCTCCACATCGCTCTTACTATTCTGCTGTGGCCTCTCAGGATCGAGGCAGGGTGTTGTATGCTGGTTCGCCACAAGGACTGGCTTCTGGTAATCTCAGCTATGGTGAATCAAATTTGCCAATCTCTTCTTATTATTACTCTTCAGCTTCTGCAAGGCTAAGCTATCAGTAA

mRNA sequence

ATGGATAATATCCCAGAAGAAAGTTCGGAAGAAAGTTCGATCCATGTTGTGGAGAAATCAAGTGTAGCTAAGAAAAGCAAAGTGAATCATTCGAGTCCAACCAAAAAACGTAAGAATACTAAAATCTCTGTAGTTTGGGGTCATTTTAAGAAAATTAAAGATTGTGATCCTAATGATCATTATGCTAAGTGTAAATACTGTGGGGCTAAGTATGCATGTCATTCCAAGCGTAATGGTACCGACAATTTAAAACACCATTTAGAAAATTGTAAGAAATACCTTTATCAAAGAAAGAAAGATCCAGGACAAAAACAATTGGTTTTCAAGCCTAAAGAAGCGATTGATGATTCTAAACCAAAACTTAGTTGTGAGACTTTCAGTCTGGAGAGTTGTAGAAGAGCTCTTGCTGAGTTGGAGAGTTGTGAGACTTTTAGGTTTGTGGAGAACATAGGTTTTCGAAGATTTATCAACAAATCAATAGCCTTGTTTGCGCCTAATTTTGTCCTTCCATCTCGATTAACTATTGCTAGGGATGTGCTTAAGATATATGTTAGTGAGAGAAAACGTTTGAAAGACATGTTTAAGACAAAGCGTTATAGAGTTTCTCTCACTACCGACTGTTGGACTTCTGGAAAAAACATTAATTATATGGTATTGACGACACACTTTATTGATTCTGGTTGGAATTTACATAAGATGATTCTTAGTTTTTCTACAATTGAGAATCATAGAGGGGAAACCATTGGTAAAACCATCGAAAAGAATTTAAAGAATTGGGGTATTGATAAAGTTATGACTTTGACGGTGGATAATGCTAGTTCAAATGATACTGCAGTTGTTTATCTTATTAAGAGATTTAGTAGTGGGTTGTTATTGAAAGTTCATCTTCTTCTGGTGGTAAGTGTCATTACTAAAGAAAATGGTGGGACTCAAACTTTGGATGTTGATGATGAGAGTGAGACATATGGATTTGCACGTTCATTCTCGAGTGTCGAGACTACCACTTTTGATAGAGAGTCAGAGATTGATGTATATTTATTAGAAACTCTTGCTAAGGATGATAATACTTTTGACATTCTTAATTGGTGGAAGCATAATAGTCATCGGTTTGAGATAATCAATGCAATAAGTAGAGATATTTTGGAAATTCCAGTTTCGACCGTAGCATCTGAGTCTGCTTTTAGAACTGGAGGACGTGTAGTTGATTCCTCTCGTTCTTTATTGGCTCCAAAAAACACTAACCGATTCATTTCTTCCCTTCCCGTCTCACATTGCCCTAATCGTCGCCAACGGAGTTTTGAGAGGCTACGTTTCCAGTCATCCATGGCAAAGAGAAGAGGGAAAGCATCAAAGAGTGAGGATGGACCTTTGAATTCTATCTCAAATAAAGGTGTTAAGAAACATATCAAGAAGAAGAAGAAGCCAAGAAAAACACATGAAACTTCCAATTTGAAAGTTGCCCATTCTACTCAAAGACAAGAACCTGTCAGTGTTCCAACTTATGCCTCGGTTACCGCTTCAATCCCTCTTGGCAACGATCAGAAGAAACAGAAAGAAATGAAGGGAAAGAAGGATGCCTCTGGATTCATTTTTATGTGCAATGGAAAAACAAAGCCGGAATGTTACCAATATCGTGTTTTTGGCTTACCAAAAGGGAAAATTGAAGTTGTGAAGAATATAAGCCCTGATGCGAAGCTATTTTTGTTCGACACAGACTCAAAGCTTCTCTATGGCATATATGAAGCTACTAGTAAGGGTGCATTGGATTTGGAACCAACAGCTTTCAATGGCCAATTTCAAGCACAGGTGAAATTTAAGATAACCAAGGATTGCTTACCACTTCCTGAAAGTGCTTTCAAACATGCTATCAAAGACAATTATGAGGGTGGTCGGAAATTTAGACAGGAGCTCAGTAGCACACAGGTAAAAAGTCTAATTTCCTTGTTCCGCCCTATTGCCAAGAAAAAACCATCTGCTAAAAAATCGCATGGTCGTCCTAATGTGGCAATTCAGCCGTCATTTAGGTCTACACGAACTAAAAAAGTGGTTAAATCATATCCTCCTGATAACTTGTCATCTGGAGTGCACTACCAGCCGATCCATGAGACAAGACCTCAGCATGATGTCCATCTTGGGCAGTATGATCCGTTTGAACCAGGACTACATGTCTCTCATTCACAAGTGCAGCCTAGGCTTGTGAGGGTAGAAGCCCCTCCCCATCATGTTGAAGTTTATCATCCAGAGCAGGCACATAAAGCATTTTTTACTGAAACATCTTTTAGACACCCTCCAGATTCGTATGAAAGTATTCACAGTACGATTGAGACAAATAATGGGGATCATCGGTTTGTGTATGGGCGTCAGTATCATGCACCACAGTTGCTGTCAAATCGAGAAGATGTTACTCACATAGATTATACTCCTGGTTACTATAGCCAGCGGCTGTCTCCTACGACCTCACATACTGCTACTTTGTCACAAGCCAGATCATCTGCATATTATTCACAGAACAGGTTGCCATCTCCATATCGCCCACAGAGGTTGCCATCTCCATATCGGGCACCAAACAGGTTGCCATCCCCATATCGTGCACAGAACAGGTTGCCATCTCCACATCGCTCTTACTATTCTGCTGTGGCCTCTCAGGATCGAGGCAGGGTGTTGTATGCTGGTTCGCCACAAGGACTGGCTTCTGGTAATCTCAGCTATGGTGAATCAAATTTGCCAATCTCTTCTTATTATTACTCTTCAGCTTCTGCAAGGCTAAGCTATCAGTAA

Coding sequence (CDS)

ATGGATAATATCCCAGAAGAAAGTTCGGAAGAAAGTTCGATCCATGTTGTGGAGAAATCAAGTGTAGCTAAGAAAAGCAAAGTGAATCATTCGAGTCCAACCAAAAAACGTAAGAATACTAAAATCTCTGTAGTTTGGGGTCATTTTAAGAAAATTAAAGATTGTGATCCTAATGATCATTATGCTAAGTGTAAATACTGTGGGGCTAAGTATGCATGTCATTCCAAGCGTAATGGTACCGACAATTTAAAACACCATTTAGAAAATTGTAAGAAATACCTTTATCAAAGAAAGAAAGATCCAGGACAAAAACAATTGGTTTTCAAGCCTAAAGAAGCGATTGATGATTCTAAACCAAAACTTAGTTGTGAGACTTTCAGTCTGGAGAGTTGTAGAAGAGCTCTTGCTGAGTTGGAGAGTTGTGAGACTTTTAGGTTTGTGGAGAACATAGGTTTTCGAAGATTTATCAACAAATCAATAGCCTTGTTTGCGCCTAATTTTGTCCTTCCATCTCGATTAACTATTGCTAGGGATGTGCTTAAGATATATGTTAGTGAGAGAAAACGTTTGAAAGACATGTTTAAGACAAAGCGTTATAGAGTTTCTCTCACTACCGACTGTTGGACTTCTGGAAAAAACATTAATTATATGGTATTGACGACACACTTTATTGATTCTGGTTGGAATTTACATAAGATGATTCTTAGTTTTTCTACAATTGAGAATCATAGAGGGGAAACCATTGGTAAAACCATCGAAAAGAATTTAAAGAATTGGGGTATTGATAAAGTTATGACTTTGACGGTGGATAATGCTAGTTCAAATGATACTGCAGTTGTTTATCTTATTAAGAGATTTAGTAGTGGGTTGTTATTGAAAGTTCATCTTCTTCTGGTGGTAAGTGTCATTACTAAAGAAAATGGTGGGACTCAAACTTTGGATGTTGATGATGAGAGTGAGACATATGGATTTGCACGTTCATTCTCGAGTGTCGAGACTACCACTTTTGATAGAGAGTCAGAGATTGATGTATATTTATTAGAAACTCTTGCTAAGGATGATAATACTTTTGACATTCTTAATTGGTGGAAGCATAATAGTCATCGGTTTGAGATAATCAATGCAATAAGTAGAGATATTTTGGAAATTCCAGTTTCGACCGTAGCATCTGAGTCTGCTTTTAGAACTGGAGGACGTGTAGTTGATTCCTCTCGTTCTTTATTGGCTCCAAAAAACACTAACCGATTCATTTCTTCCCTTCCCGTCTCACATTGCCCTAATCGTCGCCAACGGAGTTTTGAGAGGCTACGTTTCCAGTCATCCATGGCAAAGAGAAGAGGGAAAGCATCAAAGAGTGAGGATGGACCTTTGAATTCTATCTCAAATAAAGGTGTTAAGAAACATATCAAGAAGAAGAAGAAGCCAAGAAAAACACATGAAACTTCCAATTTGAAAGTTGCCCATTCTACTCAAAGACAAGAACCTGTCAGTGTTCCAACTTATGCCTCGGTTACCGCTTCAATCCCTCTTGGCAACGATCAGAAGAAACAGAAAGAAATGAAGGGAAAGAAGGATGCCTCTGGATTCATTTTTATGTGCAATGGAAAAACAAAGCCGGAATGTTACCAATATCGTGTTTTTGGCTTACCAAAAGGGAAAATTGAAGTTGTGAAGAATATAAGCCCTGATGCGAAGCTATTTTTGTTCGACACAGACTCAAAGCTTCTCTATGGCATATATGAAGCTACTAGTAAGGGTGCATTGGATTTGGAACCAACAGCTTTCAATGGCCAATTTCAAGCACAGGTGAAATTTAAGATAACCAAGGATTGCTTACCACTTCCTGAAAGTGCTTTCAAACATGCTATCAAAGACAATTATGAGGGTGGTCGGAAATTTAGACAGGAGCTCAGTAGCACACAGGTAAAAAGTCTAATTTCCTTGTTCCGCCCTATTGCCAAGAAAAAACCATCTGCTAAAAAATCGCATGGTCGTCCTAATGTGGCAATTCAGCCGTCATTTAGGTCTACACGAACTAAAAAAGTGGTTAAATCATATCCTCCTGATAACTTGTCATCTGGAGTGCACTACCAGCCGATCCATGAGACAAGACCTCAGCATGATGTCCATCTTGGGCAGTATGATCCGTTTGAACCAGGACTACATGTCTCTCATTCACAAGTGCAGCCTAGGCTTGTGAGGGTAGAAGCCCCTCCCCATCATGTTGAAGTTTATCATCCAGAGCAGGCACATAAAGCATTTTTTACTGAAACATCTTTTAGACACCCTCCAGATTCGTATGAAAGTATTCACAGTACGATTGAGACAAATAATGGGGATCATCGGTTTGTGTATGGGCGTCAGTATCATGCACCACAGTTGCTGTCAAATCGAGAAGATGTTACTCACATAGATTATACTCCTGGTTACTATAGCCAGCGGCTGTCTCCTACGACCTCACATACTGCTACTTTGTCACAAGCCAGATCATCTGCATATTATTCACAGAACAGGTTGCCATCTCCATATCGCCCACAGAGGTTGCCATCTCCATATCGGGCACCAAACAGGTTGCCATCCCCATATCGTGCACAGAACAGGTTGCCATCTCCACATCGCTCTTACTATTCTGCTGTGGCCTCTCAGGATCGAGGCAGGGTGTTGTATGCTGGTTCGCCACAAGGACTGGCTTCTGGTAATCTCAGCTATGGTGAATCAAATTTGCCAATCTCTTCTTATTATTACTCTTCAGCTTCTGCAAGGCTAAGCTATCAGTAA

Protein sequence

MDNIPEESSEESSIHVVEKSSVAKKSKVNHSSPTKKRKNTKISVVWGHFKKIKDCDPNDHYAKCKYCGAKYACHSKRNGTDNLKHHLENCKKYLYQRKKDPGQKQLVFKPKEAIDDSKPKLSCETFSLESCRRALAELESCETFRFVENIGFRRFINKSIALFAPNFVLPSRLTIARDVLKIYVSERKRLKDMFKTKRYRVSLTTDCWTSGKNINYMVLTTHFIDSGWNLHKMILSFSTIENHRGETIGKTIEKNLKNWGIDKVMTLTVDNASSNDTAVVYLIKRFSSGLLLKVHLLLVVSVITKENGGTQTLDVDDESETYGFARSFSSVETTTFDRESEIDVYLLETLAKDDNTFDILNWWKHNSHRFEIINAISRDILEIPVSTVASESAFRTGGRVVDSSRSLLAPKNTNRFISSLPVSHCPNRRQRSFERLRFQSSMAKRRGKASKSEDGPLNSISNKGVKKHIKKKKKPRKTHETSNLKVAHSTQRQEPVSVPTYASVTASIPLGNDQKKQKEMKGKKDASGFIFMCNGKTKPECYQYRVFGLPKGKIEVVKNISPDAKLFLFDTDSKLLYGIYEATSKGALDLEPTAFNGQFQAQVKFKITKDCLPLPESAFKHAIKDNYEGGRKFRQELSSTQVKSLISLFRPIAKKKPSAKKSHGRPNVAIQPSFRSTRTKKVVKSYPPDNLSSGVHYQPIHETRPQHDVHLGQYDPFEPGLHVSHSQVQPRLVRVEAPPHHVEVYHPEQAHKAFFTETSFRHPPDSYESIHSTIETNNGDHRFVYGRQYHAPQLLSNREDVTHIDYTPGYYSQRLSPTTSHTATLSQARSSAYYSQNRLPSPYRPQRLPSPYRAPNRLPSPYRAQNRLPSPHRSYYSAVASQDRGRVLYAGSPQGLASGNLSYGESNLPISSYYYSSASARLSYQ
Homology
BLAST of Lag0022021 vs. NCBI nr
Match: XP_038891726.1 (uncharacterized protein LOC120081120 [Benincasa hispida] >XP_038891729.1 uncharacterized protein LOC120081120 [Benincasa hispida])

HSP 1 Score: 666.4 bits (1718), Expect = 3.6e-187
Identity = 374/519 (72.06%), Postives = 403/519 (77.65%), Query Frame = 0

Query: 442 MAKRRGKASKSEDG---PLN-SISNKGVKKHIKKKKKPRKTHETSNLKVAHSTQRQEPVS 501
           M KRRGKASK + G   PLN S+S KGV K + KKKK RKT ETS LKVAHST R   V+
Sbjct: 1   MTKRRGKASKGKKGKEPPLNLSMSKKGVNK-LAKKKKSRKTSETSTLKVAHSTSRSNLVN 60

Query: 502 VPTYASVTASIPLGNDQKKQKEMKGKKDASGFIFMCNGKTKPECYQYRVFGLPKGKIEVV 561
            PT ASV+ S PL NDQKK ++ +G+K  SGFIFMCNGKTKPECYQYRVFGLPKGKIEVV
Sbjct: 61  GPTCASVSTSNPLVNDQKKHEKTEGEKYPSGFIFMCNGKTKPECYQYRVFGLPKGKIEVV 120

Query: 562 KNISPDAKLFLFDTDSKLLYGIYEATSKGALDLEPTAFNGQFQAQVKFKITKDCLPLPES 621
           KNISPD KLFLFDTDSKLLYGIYEATSKGALDLEPTAFNGQFQAQVKFKI KDCLPLPES
Sbjct: 121 KNISPDTKLFLFDTDSKLLYGIYEATSKGALDLEPTAFNGQFQAQVKFKIFKDCLPLPES 180

Query: 622 AFKHAIKDNYEGGRKFRQELSSTQVKSLISLFRPIAKKKPSAKKSHGRPNVAIQPSFRST 681
           AFKHAIKDNYEG RKFRQELS+TQVKSLISLFRPIA KKPSAKKSH RPNV I+PSF+S 
Sbjct: 181 AFKHAIKDNYEGHRKFRQELSNTQVKSLISLFRPIA-KKPSAKKSHVRPNVGIRPSFKSA 240

Query: 682 RTKKVVKSYPPDNLSSGVHYQPIHETRPQ------------HDVHLGQYDPFEPGLHVSH 741
           RTK+VVKSYP +  SSG HY PI ETRPQ            HDVH GQYDPFEPGLHVSH
Sbjct: 241 RTKEVVKSYPLEKPSSGAHYLPILETRPQHDVHHGHDVYHGHDVHHGQYDPFEPGLHVSH 300

Query: 742 SQVQPRLVRVEAP----------------PHHVEVYHPEQAHKA-FFTETSFRHPPDSYE 801
           SQVQPRL+R+EAP                P HVE YHPEQAH+  +F E SFRHPP+SY 
Sbjct: 301 SQVQPRLLRIEAPPGHGEPYHPRHVEPYHPRHVEPYHPEQAHEGKYFPEESFRHPPESYA 360

Query: 802 SIHSTIETNNGDHRFVYGRQYHAPQLLSNREDVTHIDY-TPGYYSQRLSPTTSHTATLSQ 861
           SI + IETNNGD RFVYGRQYH  Q + +R DVTH DY  PGYYSQRLSPT   TA LSQ
Sbjct: 361 SIRNNIETNNGDRRFVYGRQYHTSQFMLDR-DVTHTDYIPPGYYSQRLSPTALRTAPLSQ 420

Query: 862 ARSSAYYSQNRLPSPYRPQ-RLPSPYRAPNRLPSPYRAQNRLPSPHRSYYSAVASQDRGR 921
           AR       NR PSP+R Q RLPSP+R  NRLPSP+R QNRL SPHRSY+SAVASQDRG 
Sbjct: 421 AR-------NRSPSPHRQQNRLPSPHRPQNRLPSPHRPQNRLTSPHRSYHSAVASQDRGG 480

Query: 922 VLYAGSPQGLASGNLSYGESNLPISSYYYSSASARLSYQ 926
           V  A  PQ  AS NL YGESNLP SSYYYSSA+ RL+Y+
Sbjct: 481 VYAASLPQVTASNNLRYGESNLPSSSYYYSSATTRLNYR 509

BLAST of Lag0022021 vs. NCBI nr
Match: XP_022158674.1 (uncharacterized protein LOC111025137 [Momordica charantia])

HSP 1 Score: 619.0 bits (1595), Expect = 6.6e-173
Identity = 342/496 (68.95%), Postives = 393/496 (79.23%), Query Frame = 0

Query: 442 MAKRRGKASKSEDGPLN-SISNKGVKKHIKKKKKPRKTHETSNLKVAHSTQRQEPVSVPT 501
           MAKRRGKASK ++ P N SISNKGVKK IKKKKK +KT+ETSN KV HST +++ VS P 
Sbjct: 1   MAKRRGKASKGKEQPSNSSISNKGVKKIIKKKKKSKKTNETSNSKVVHSTSKRDLVSAP- 60

Query: 502 YASVTASIPLGNDQKKQKEMKGKKDASGFIFMCNGKTKPECYQYRVFGLPKGKIEVVKNI 561
           +ASVT SI LGND++ Q E +GK  +SGFIFMCNG+TKPECY+YRVFGLPKGKI+VVKNI
Sbjct: 61  HASVTPSILLGNDRENQNEKEGKNHSSGFIFMCNGRTKPECYEYRVFGLPKGKIDVVKNI 120

Query: 562 SPDAKLFLFDTDSKLLYGIYEATSKGALDLEPTAFNGQFQAQVKFKITKDCLPLPESAFK 621
           + + KLFLFDTD+KLLYGIY+ATSKGALDLEPTAFNG+FQAQVKFKI K+CLPLPESAF+
Sbjct: 121 NHNTKLFLFDTDAKLLYGIYQATSKGALDLEPTAFNGKFQAQVKFKIFKECLPLPESAFR 180

Query: 622 HAIKDNYEGGRKFRQELSSTQVKSLISLFRPIAKKKPSAKKSHGRPNVAIQPSFRSTRTK 681
            AIKDNYEGG KFRQELSSTQVKSL+SLFRPIA KKPSAKK+H RPNVA+QPSFR TRTK
Sbjct: 181 RAIKDNYEGGPKFRQELSSTQVKSLVSLFRPIA-KKPSAKKAHARPNVAVQPSFRPTRTK 240

Query: 682 KVVKSYPPDNLSSGVHYQPIHETRPQHDVHLGQYDPFEPGLHVSHSQVQPRLVRVEAPPH 741
           +VVKSY P+   SGVHY P+ ETR  H VH  QYDPFEPG+HVSHSQ+QPRLVR++APP 
Sbjct: 241 EVVKSYLPEYSLSGVHYPPMLETRSHHAVHHEQYDPFEPGIHVSHSQMQPRLVRIDAPPR 300

Query: 742 HVEVYHPEQAHKAFFTETSFRHPPDSYESIHSTIETNNGDHRFVYGRQYHAPQLLSNRED 801
           HVE YHPEQ H+A+F   SF HP DSYESI +T+ETN G H F+YGRQY +  L+   +D
Sbjct: 301 HVEPYHPEQTHEAYFHGKSFIHPQDSYESIRNTVETNLGGHPFMYGRQYTSQLLVD--QD 360

Query: 802 VTHIDYTPGYYSQRLSPTTSHTATLSQARSSAYYSQNRLPSPYRPQ-RLPSPYR------ 861
           V  +DYTPGYY +RLSPT +H   LSQA SSA+YSQNRLPSPYRPQ RLPSP+R      
Sbjct: 361 VPRVDYTPGYYRRRLSPTITHIPPLSQAGSSAHYSQNRLPSPYRPQNRLPSPHRPYYSAE 420

Query: 862 --APNRLPSPYRAQNRLPSPHRSYYSAVASQDRGR-VLYAGSPQ-GLASGNLSYGESNLP 921
               NRLP        LPSPH        SQDRGR  +    PQ G  SG +SYG++NL 
Sbjct: 421 ISQMNRLP--------LPSPH-------LSQDRGRGYVDVSLPQGGPGSGKVSYGDANLS 476

Query: 922 ISSYYYSSASARLSYQ 926
           ISS YYSSA+ARLSY+
Sbjct: 481 ISS-YYSSAAARLSYR 476

BLAST of Lag0022021 vs. NCBI nr
Match: XP_022932696.1 (uncharacterized protein LOC111439166 isoform X1 [Cucurbita moschata])

HSP 1 Score: 589.0 bits (1517), Expect = 7.3e-164
Identity = 336/499 (67.33%), Postives = 376/499 (75.35%), Query Frame = 0

Query: 442 MAKRRGKASKSEDGPLNSISNKGVKKHIKKKKKPRKTHETSNLKVAHSTQRQEPVSVPTY 501
           MAK+RGKA K E+   +SISNKGVKKHIKKKK  RK +ETSNLKV+ S+  QEPVS+PT+
Sbjct: 1   MAKKRGKALKVEELLNHSISNKGVKKHIKKKKS-RKANETSNLKVSRSSSSQEPVSIPTH 60

Query: 502 ASVTASIPLGNDQKKQKEMKGKKDASGFIFMCNGKTKPECYQYRVFGLPKGKIEVVKNIS 561
            SVT  IP  ND+KK KE + KK  SGFIFMCNGKTKPECY+YRVFGLPKGKIEVVKNI+
Sbjct: 61  DSVTTLIPPSNDKKKYKETEEKKHVSGFIFMCNGKTKPECYEYRVFGLPKGKIEVVKNIA 120

Query: 562 PDAKLFLFDTDSKLLYGIYEATSKGALDLEPTAFNGQFQAQVKFKITKDCLPLPESAFKH 621
           PDAKLFLFDTD KLLYGIY+AT+KGALDLEP AF+GQFQAQVKFKI KDCLPLPESAF+ 
Sbjct: 121 PDAKLFLFDTDLKLLYGIYQATTKGALDLEPRAFDGQFQAQVKFKIFKDCLPLPESAFRK 180

Query: 622 AIKDNYEGGRKFRQELSSTQVKSLISLFRPIAKKKPSAKKSHGRPNVAIQPSFRSTRTKK 681
           AIKDNY+G RKFRQELS TQVK LISLFRPIAK K S K+SH RPNVA +PSFRSTRT K
Sbjct: 181 AIKDNYDGHRKFRQELSGTQVKHLISLFRPIAKNKTSHKESHVRPNVANRPSFRSTRT-K 240

Query: 682 VVKSYPPDNLSSGVHYQPIHETRPQHDVHLG-------------QYDPFEPGLHVSHSQV 741
           VVK YP +NLSSGVHY P  ETRPQHD +L              QYDPFEPGLH SHSQV
Sbjct: 241 VVKPYPLENLSSGVHYFPDIETRPQHDHYLPDIETRSQHDVRHVQYDPFEPGLHFSHSQV 300

Query: 742 QPRLVRVEAPPHHVEVYHPEQAHKAFFTETSFRHPPDSYESIHSTIET-NNGDHRFVYGR 801
           QPRLVRVEAPP HVE YHPE AH+A+F E S R+P +SYESI + IET +NGD R VYGR
Sbjct: 301 QPRLVRVEAPPRHVEAYHPEHAHEAYFRENSLRYPINSYESIRNPIETYDNGDLRDVYGR 360

Query: 802 QYHAPQLLSNREDVTHIDYTPGYYSQRLSPTTSHTATLSQARSSAYYSQNRLPSPYRPQR 861
           +Y  P L S+RED   ID+ P YYSQ LSPT SHTA LSQA   AY+SQ           
Sbjct: 361 RYRTPHLQSDREDDARIDFIPRYYSQWLSPTASHTAPLSQA---AYHSQ----------- 420

Query: 862 LPSPYRAPNRLPSPYRAQNRLPSPHRSYYSAVASQDRGRVLYAGSPQ-GLASGNLSYGES 921
                   NRLPSPYR+Q+R P           SQD GR  YAG PQ G ASG+LSYGE+
Sbjct: 421 --------NRLPSPYRSQHRFP-----------SQDGGRD-YAGLPQGGPASGSLSYGEA 462

Query: 922 NLPISSYYYSSASARLSYQ 926
           NLP  SYY +S++ RLS++
Sbjct: 481 NLPF-SYYDNSSATRLSFR 462

BLAST of Lag0022021 vs. NCBI nr
Match: XP_023539818.1 (uncharacterized protein LOC111800385 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 589.0 bits (1517), Expect = 7.3e-164
Identity = 337/499 (67.54%), Postives = 375/499 (75.15%), Query Frame = 0

Query: 442 MAKRRGKASKSEDGPLNSISNKGVKKHIKKKKKPRKTHETSNLKVAHSTQRQEPVSVPTY 501
           MAK+RGKA K E+   +SISNKGVKKHIKKKK  RK +ET NLKV+ S+  QEPVS+PT+
Sbjct: 1   MAKKRGKALKVEELLNHSISNKGVKKHIKKKKS-RKANETLNLKVSRSSSSQEPVSIPTH 60

Query: 502 ASVTASIPLGNDQKKQKEMKGKKDASGFIFMCNGKTKPECYQYRVFGLPKGKIEVVKNIS 561
            SVT  IP  NDQKK KE + KK  SGFIFMCNGKTKPECY+YRVFGLPKGKIEVVKNI+
Sbjct: 61  DSVTTLIPPSNDQKKYKETEEKKHVSGFIFMCNGKTKPECYEYRVFGLPKGKIEVVKNIA 120

Query: 562 PDAKLFLFDTDSKLLYGIYEATSKGALDLEPTAFNGQFQAQVKFKITKDCLPLPESAFKH 621
           PDAKLFLFDTD KLLYGIY+AT+KGALDLEP AF+GQFQAQVKFKI KDCLPLPESAFK 
Sbjct: 121 PDAKLFLFDTDLKLLYGIYQATTKGALDLEPRAFDGQFQAQVKFKIFKDCLPLPESAFKK 180

Query: 622 AIKDNYEGGRKFRQELSSTQVKSLISLFRPIAKKKPSAKKSHGRPNVAIQPSFRSTRTKK 681
           AIKDNY G RKFRQELS TQVK LISLFRPIAK K S K+SH RPNVA +PSFRSTRT K
Sbjct: 181 AIKDNYSGHRKFRQELSGTQVKHLISLFRPIAKNKTSHKESHVRPNVANRPSFRSTRT-K 240

Query: 682 VVKSYPPDNLSSGVHYQPIHETRPQHDVHLG-------------QYDPFEPGLHVSHSQV 741
           VVK YP +NLSSGVHY P  ETRPQHD +L              QYDPFEPGLH SHSQV
Sbjct: 241 VVKPYPLENLSSGVHYFPDIETRPQHDHYLPDIETRSQHDVRHVQYDPFEPGLHFSHSQV 300

Query: 742 QPRLVRVEAPPHHVEVYHPEQAHKAFFTETSFRHPPDSYESIHSTIET-NNGDHRFVYGR 801
           QPRLVRVEAPP HVE YHPE AH+A+F E S R+P +SYESI + IET +NGD R VYGR
Sbjct: 301 QPRLVRVEAPPRHVEAYHPEHAHEAYFRENSLRYPLNSYESIRNPIETYDNGDLRDVYGR 360

Query: 802 QYHAPQLLSNREDVTHIDYTPGYYSQRLSPTTSHTATLSQARSSAYYSQNRLPSPYRPQR 861
           +YH P L S+RED   ID+ P YYSQ LSPT SHTA LSQA  +AY+SQ           
Sbjct: 361 RYHTPHLQSDREDDARIDFIPRYYSQWLSPTASHTAPLSQA--AAYHSQ----------- 420

Query: 862 LPSPYRAPNRLPSPYRAQNRLPSPHRSYYSAVASQDRGRVLYAGSPQ-GLASGNLSYGES 921
                   NRLPSPYR+Q+R P           SQD GR  YAG PQ G ASG+LSYGE+
Sbjct: 421 --------NRLPSPYRSQHRFP-----------SQDGGRD-YAGLPQGGPASGSLSYGEA 463

Query: 922 NLPISSYYYSSASARLSYQ 926
           NL   SYY +S++ RLS++
Sbjct: 481 NLSF-SYYDNSSATRLSFR 463

BLAST of Lag0022021 vs. NCBI nr
Match: KAG6597699.1 (hypothetical protein SDJN03_10879, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 587.8 bits (1514), Expect = 1.6e-163
Identity = 336/499 (67.33%), Postives = 375/499 (75.15%), Query Frame = 0

Query: 442 MAKRRGKASKSEDGPLNSISNKGVKKHIKKKKKPRKTHETSNLKVAHSTQRQEPVSVPTY 501
           MAK+RGKA K E+   +SISNKGVKKHIKKKK  RK +ETSNLKV+ S+  QEPVS+PT+
Sbjct: 1   MAKKRGKALKVEELLNHSISNKGVKKHIKKKKS-RKANETSNLKVSRSSSSQEPVSIPTH 60

Query: 502 ASVTASIPLGNDQKKQKEMKGKKDASGFIFMCNGKTKPECYQYRVFGLPKGKIEVVKNIS 561
            SVT  I   NDQKK KE + KK  SGFIFMCNGKTKPECY+YRVFGLPKGKIEVVKNI+
Sbjct: 61  DSVTTLILPSNDQKKYKETEEKKHVSGFIFMCNGKTKPECYEYRVFGLPKGKIEVVKNIA 120

Query: 562 PDAKLFLFDTDSKLLYGIYEATSKGALDLEPTAFNGQFQAQVKFKITKDCLPLPESAFKH 621
           PDAKLFLFDTD KLLYGIY+AT+KGALDLEP AF+GQFQAQVKFKI KDCLPLPESAFK 
Sbjct: 121 PDAKLFLFDTDLKLLYGIYQATTKGALDLEPRAFDGQFQAQVKFKIFKDCLPLPESAFKK 180

Query: 622 AIKDNYEGGRKFRQELSSTQVKSLISLFRPIAKKKPSAKKSHGRPNVAIQPSFRSTRTKK 681
           AIKDNY+G RKFRQELS TQVK LISLFRPIAK K S K+SH  PNVA +PSFRSTRT K
Sbjct: 181 AIKDNYDGHRKFRQELSGTQVKHLISLFRPIAKNKTSHKESHVGPNVANRPSFRSTRT-K 240

Query: 682 VVKSYPPDNLSSGVHYQPIHETRPQHDVHLG-------------QYDPFEPGLHVSHSQV 741
           V+K YP +NLSSGVHY P  ETRPQHD +L              QYDPFEPGLH SHSQV
Sbjct: 241 VIKPYPLENLSSGVHYFPDIETRPQHDHYLPDIETRSQHDVRHVQYDPFEPGLHFSHSQV 300

Query: 742 QPRLVRVEAPPHHVEVYHPEQAHKAFFTETSFRHPPDSYESIHSTIET-NNGDHRFVYGR 801
           QPRLVRVEAPP HVE YHPE AH+A+F E S R+P +SYESI + IET +NGD R VYGR
Sbjct: 301 QPRLVRVEAPPRHVEAYHPEHAHEAYFRENSLRYPLNSYESIRNPIETYDNGDLRDVYGR 360

Query: 802 QYHAPQLLSNREDVTHIDYTPGYYSQRLSPTTSHTATLSQARSSAYYSQNRLPSPYRPQR 861
           +YH P L S+RED   ID+ P YYSQ LSPT SHTA LSQA   AY+SQ           
Sbjct: 361 RYHTPHLQSDREDDARIDFIPRYYSQWLSPTASHTAPLSQA---AYHSQ----------- 420

Query: 862 LPSPYRAPNRLPSPYRAQNRLPSPHRSYYSAVASQDRGRVLYAGSPQ-GLASGNLSYGES 921
                   NRLPSPYR+Q+R P           SQD GR  YAG PQ G ASG+LSYGE+
Sbjct: 421 --------NRLPSPYRSQHRFP-----------SQDGGRD-YAGLPQGGPASGSLSYGEA 462

Query: 922 NLPISSYYYSSASARLSYQ 926
           NLP  SYY +S++ RLS++
Sbjct: 481 NLPF-SYYDNSSATRLSFR 462

BLAST of Lag0022021 vs. ExPASy Swiss-Prot
Match: P08770 (Putative AC transposase OS=Zea mays OX=4577 PE=2 SV=2)

HSP 1 Score: 90.1 bits (222), Expect = 1.4e-16
Identity = 77/265 (29.06%), Postives = 126/265 (47.55%), Query Frame = 0

Query: 35  KKRKNTKISVVWGHFKKIK---DCDPNDH---YAKCKY--CGAKYACHSKRNGTDNLKHH 94
           +KR     S VW HF K +   + D   +   +  C +  C AKY      +GT   ++H
Sbjct: 133 QKRAKKCTSDVWQHFTKKEIEVEVDGKKYVQVWGHCNFPNCKAKYRAEG-HHGTSGFRNH 192

Query: 95  LENCKKYL-----YQRKKDPGQKQLVFKPKEAIDDSKPKLSCETFSLESCRRALAELESC 154
           L      +      + +KD G+          I+  +P    E  SL+    A+   E  
Sbjct: 193 LRTSHSLVKGQLCLKSEKDHGKD---------INLIEPYKYDEVVSLKKLHLAIIMHE-- 252

Query: 155 ETFRFVENIGFRRFINKSIALFAPNFVLPSRLTIARDVLKIYVSERKRLKDMFKTKRYRV 214
             F  VE+  F  F+        P+F + SR+T  + ++ +Y+ E+++L    K  + R 
Sbjct: 253 YPFNIVEHEYFVEFVKS----LRPHFPIKSRVTARKYIMDLYLEEKEKLYGKLKDVQSRF 312

Query: 215 SLTTDCWTSGKNINYMVLTTHFIDSGWNLHKMILSFSTIE-NHRGETIGKTIEKNLKNWG 274
           S T D WTS +N +YM +T H+ID  W L K I+ F  +E  H G+ + +T    +  W 
Sbjct: 313 STTMDMWTSCQNKSYMCVTIHWIDDDWCLQKRIVGFFHVEGRHTGQRLSQTFTAIMVKWN 372

Query: 275 ID-KVMTLTVDNASSNDTAVVYLIK 285
           I+ K+  L++DNAS+N+ AV  +I+
Sbjct: 373 IEKKLFALSLDNASANEVAVHDIIE 381


HSP 2 Score: 74.3 bits (181), Expect = 7.9e-12
Identity = 42/94 (44.68%), Postives = 56/94 (59.57%), Query Frame = 0

Query: 316 DDESETY-GFARSFSSVETTTFDRESEIDVYLLETLAKDDNTFDILNWWKHNSHRFEIIN 375
           DDE + Y    + +  VE+      +E+D Y+ E L K    FDIL+WW+     + I+ 
Sbjct: 649 DDEFQNYLHELKDYDQVES------NELDKYMSEPLLKHSGQFDILSWWRGRVAEYPILT 708

Query: 376 AISRDILEIPVSTVASESAFRTGGRVVDSSRSLL 409
            I+RD+L I VSTVASESAF  GGRVVD  R+ L
Sbjct: 709 QIARDVLAIQVSTVASESAFSAGGRVVDPYRNRL 736

BLAST of Lag0022021 vs. ExPASy Swiss-Prot
Match: P03010 (Putative AC9 transposase OS=Zea mays OX=4577 PE=4 SV=1)

HSP 1 Score: 90.1 bits (222), Expect = 1.4e-16
Identity = 77/265 (29.06%), Postives = 126/265 (47.55%), Query Frame = 0

Query: 35  KKRKNTKISVVWGHFKKIK---DCDPNDH---YAKCKY--CGAKYACHSKRNGTDNLKHH 94
           +KR     S VW HF K +   + D   +   +  C +  C AKY      +GT   ++H
Sbjct: 78  QKRAKKCTSDVWQHFTKKEIEVEVDGKKYVQVWGHCNFPNCKAKYRAEG-HHGTSGFRNH 137

Query: 95  LENCKKYL-----YQRKKDPGQKQLVFKPKEAIDDSKPKLSCETFSLESCRRALAELESC 154
           L      +      + +KD G+          I+  +P    E  SL+    A+   E  
Sbjct: 138 LRTSHSLVKGQLCLKSEKDHGKD---------INLIEPYKYDEVVSLKKLHLAIIMHE-- 197

Query: 155 ETFRFVENIGFRRFINKSIALFAPNFVLPSRLTIARDVLKIYVSERKRLKDMFKTKRYRV 214
             F  VE+  F  F+        P+F + SR+T  + ++ +Y+ E+++L    K  + R 
Sbjct: 198 YPFNIVEHEYFVEFVKS----LRPHFPIKSRVTARKYIMDLYLEEKEKLYGKLKDVQSRF 257

Query: 215 SLTTDCWTSGKNINYMVLTTHFIDSGWNLHKMILSFSTIE-NHRGETIGKTIEKNLKNWG 274
           S T D WTS +N +YM +T H+ID  W L K I+ F  +E  H G+ + +T    +  W 
Sbjct: 258 STTMDMWTSCQNKSYMCVTIHWIDDDWCLQKRIVGFFHVEGRHTGQRLSQTFTAIMVKWN 317

Query: 275 ID-KVMTLTVDNASSNDTAVVYLIK 285
           I+ K+  L++DNAS+N+ AV  +I+
Sbjct: 318 IEKKLFALSLDNASANEVAVHDIIE 326


HSP 2 Score: 74.3 bits (181), Expect = 7.9e-12
Identity = 42/94 (44.68%), Postives = 56/94 (59.57%), Query Frame = 0

Query: 316 DDESETY-GFARSFSSVETTTFDRESEIDVYLLETLAKDDNTFDILNWWKHNSHRFEIIN 375
           DDE + Y    + +  VE+      +E+D Y+ E L K    FDIL+WW+     + I+ 
Sbjct: 618 DDEFQNYLHELKDYDQVES------NELDKYMSEPLLKHSGQFDILSWWRGRVAEYPILT 677

Query: 376 AISRDILEIPVSTVASESAFRTGGRVVDSSRSLL 409
            I+RD+L I VSTVASESAF  GGRVVD  R+ L
Sbjct: 678 QIARDVLAIQVSTVASESAFSAGGRVVDPYRNRL 705

BLAST of Lag0022021 vs. ExPASy Swiss-Prot
Match: P37707 (B2 protein OS=Daucus carota OX=4039 PE=2 SV=1)

HSP 1 Score: 84.7 bits (208), Expect = 5.9e-15
Identity = 45/129 (34.88%), Postives = 74/129 (57.36%), Query Frame = 0

Query: 528 GFIFMCNGKTKPECYQYRVFGLPKGKIEVVKNISPDAKLFLFDTDSKLLYGIYEATSKGA 587
           G+IF+CN  T  E  + ++FGLP    + V+ I+P   LFL++  +  L+G++EA S G 
Sbjct: 76  GYIFVCNNDTMQENLKRQLFGLPPRYRDSVRAITPGLPLFLYNYSTHQLHGVFEAASFGG 135

Query: 588 LDLEPTAF-------NGQFQAQVKFKITKDCLPLPESAFKHAIKDNYEGGRKFRQELSST 647
            +++PTA+         +F AQV+    K C PL E +F+  +  ++  G KFR EL+  
Sbjct: 136 TNIDPTAWEDKKNQGESRFPAQVRVMTRKICEPLEEDSFRPIL--HHYDGPKFRLELNIP 195

Query: 648 QVKSLISLF 650
           +  SL+ +F
Sbjct: 196 EAISLLDIF 202

BLAST of Lag0022021 vs. ExPASy Swiss-Prot
Match: Q75HY5 (Zinc finger BED domain-containing protein RICESLEEPER 3 OS=Oryza sativa subsp. japonica OX=39947 GN=Os05g0583200 PE=2 SV=1)

HSP 1 Score: 84.0 bits (206), Expect = 1.0e-14
Identity = 85/357 (23.81%), Postives = 149/357 (41.74%), Query Frame = 0

Query: 28  VNHSSPTKKRKNTKISVVWGHFKKIKDCDPNDHYAKCKYCGA--KYACHSKRNGTDNLKH 87
           V  ++PT  R+  K SVVW HF  I++       A C  C     Y+C SK +GT +LK 
Sbjct: 94  VELTTPTASRRRRKKSVVWEHF-TIEEMPGGVSRASCNLCKQTFAYSCGSKISGTSHLKR 153

Query: 88  HLENCKKYLYQRKKDPGQKQLVFKPKEAIDDSKPKLSCETFSLESCRRALAELE-SCET- 147
           H+      + + +       L        +    +++   +       A+ + + +C   
Sbjct: 154 HITLASCPMLKNEDMKLSLPLATVTNNDGEGCAERVAKRHYRSTGYANAMFDQDRTCSNL 213

Query: 148 ----------FRFVENIGFRRFINKSIALFAPNFVLPSRLTIARDVLKIYVSERKRLKDM 207
                        VE  GF  FI        P F +    TI   V  +Y  ER+ L  +
Sbjct: 214 AKMIILHDYPLHIVEQRGFTAFIGS----LQPRFRVIDVDTIEGQVHSVYQKERENLMHV 273

Query: 208 FKTKRYRVSLTTDCWTSGKNINYMVLTTHFIDSGWNLHKMILSFSTIEN-HRGETIGKTI 267
           F T   R+SLT   W + + + Y+ L   FID+ W +H+ +++F  + + H   ++ + I
Sbjct: 274 FSTVPGRISLTVRLWATSQTLGYISLAAQFIDTEWRVHRRMVNFMMVSSPHSENSLSEAI 333

Query: 268 EKNLKNWGI-DKVMTLTVDN-ASSNDTAVVYLIKRFSS---GLLLKVHLLLVVSVITKEN 327
             +L +W + DK+ T+T+DN  SS+D     +I   S+    +++K  L +V       N
Sbjct: 334 STSLSDWNMKDKLFTITLDNDPSSHDIYSANMINYLSNKKDNIMIKGQLFVVRCYAHILN 393

Query: 328 GGTQTLDVDDESETYGFARSFSSVETTTFDRESEIDVYL-LETLAKDDNTFDILNWW 364
              Q +     S  Y    S   ++ ++   +   ++ L LE  +      D+   W
Sbjct: 394 TVAQDVIASVHSVIYHIRESIKFIKASSVHEDKFAEIALQLEIPSAKTLCLDVTTQW 445


HSP 2 Score: 58.2 bits (139), Expect = 5.9e-07
Identity = 31/76 (40.79%), Postives = 47/76 (61.84%), Query Frame = 0

Query: 340 SEIDVYLLETLAKDDNTFDILNWWKHNSHRFEIINAISRDILEIPVSTVASE----SAFR 399
           SE++ YL E L      F+IL WWK N+ +F  ++ ++RD+L IP+S V+S     SA  
Sbjct: 646 SELEQYLEEALMPRIQDFEILEWWKLNTIKFPTLSKMARDVLAIPMSMVSSGSSIFSATA 705

Query: 400 TGGRVVDSSRSLLAPK 412
           TG +++D  RS L P+
Sbjct: 706 TGSQMLDDYRSSLRPE 721

BLAST of Lag0022021 vs. ExPASy Swiss-Prot
Match: Q5JZR1 (DCD domain-containing protein NRP-A OS=Glycine max OX=3847 GN=NRP-A PE=2 SV=1)

HSP 1 Score: 81.6 bits (200), Expect = 5.0e-14
Identity = 71/257 (27.63%), Postives = 113/257 (43.97%), Query Frame = 0

Query: 412 NTNRFISSLPVSHCPNRRQRSFERLRFQ---SSMAKRRGKASKSEDGPLNSISNKGVKKH 471
           NTN  ISS   ++  N     F +  +    SS           ++  LN  + KG K +
Sbjct: 106 NTNTTISSYHPNNLNNNAFGGFNKGIYSNTTSSPYLNNNHHHLDDNNNLNRNNLKGYKTY 165

Query: 472 I---------KKKKKPRKTHETSNLKVAHSTQRQEPVSVPTYASVTASIPLGNDQKKQKE 531
                     K  KK   T+ T+N K   +T   +              P        + 
Sbjct: 166 FKGEDQFHTPKSAKKKNTTNNTNNKKHGDNTNNNDGTKTGAEKKFKTLPP-------SES 225

Query: 532 MKGKKDASGFIFMCNGKTKPECYQYRVFGLPKGKIEVVKNISPDAKLFLFDTDSKLLYGI 591
           +   +   G+IF+CN  T  E  Q ++FGLP    + V+ I+P   +FL++  +  L+GI
Sbjct: 226 LPKNETIGGYIFVCNNDTMAENLQRQLFGLPPRYRDSVRTITPGLPIFLYNYSTHQLHGI 285

Query: 592 YEATSKGALDLEPTAF-------NGQFQAQVKFKITKDCLPLPESAFKHAIKDNYEGGRK 650
           +EA S G  +++PTA+         +F AQV+    K C PL E +F+  +  ++  G K
Sbjct: 286 FEAASFGGSNIDPTAWEDKKCPGESRFPAQVQVITRKVCEPLEEDSFRPIL--HHYDGPK 345

BLAST of Lag0022021 vs. ExPASy TrEMBL
Match: A0A6J1E036 (uncharacterized protein LOC111025137 OS=Momordica charantia OX=3673 GN=LOC111025137 PE=4 SV=1)

HSP 1 Score: 619.0 bits (1595), Expect = 3.2e-173
Identity = 342/496 (68.95%), Postives = 393/496 (79.23%), Query Frame = 0

Query: 442 MAKRRGKASKSEDGPLN-SISNKGVKKHIKKKKKPRKTHETSNLKVAHSTQRQEPVSVPT 501
           MAKRRGKASK ++ P N SISNKGVKK IKKKKK +KT+ETSN KV HST +++ VS P 
Sbjct: 1   MAKRRGKASKGKEQPSNSSISNKGVKKIIKKKKKSKKTNETSNSKVVHSTSKRDLVSAP- 60

Query: 502 YASVTASIPLGNDQKKQKEMKGKKDASGFIFMCNGKTKPECYQYRVFGLPKGKIEVVKNI 561
           +ASVT SI LGND++ Q E +GK  +SGFIFMCNG+TKPECY+YRVFGLPKGKI+VVKNI
Sbjct: 61  HASVTPSILLGNDRENQNEKEGKNHSSGFIFMCNGRTKPECYEYRVFGLPKGKIDVVKNI 120

Query: 562 SPDAKLFLFDTDSKLLYGIYEATSKGALDLEPTAFNGQFQAQVKFKITKDCLPLPESAFK 621
           + + KLFLFDTD+KLLYGIY+ATSKGALDLEPTAFNG+FQAQVKFKI K+CLPLPESAF+
Sbjct: 121 NHNTKLFLFDTDAKLLYGIYQATSKGALDLEPTAFNGKFQAQVKFKIFKECLPLPESAFR 180

Query: 622 HAIKDNYEGGRKFRQELSSTQVKSLISLFRPIAKKKPSAKKSHGRPNVAIQPSFRSTRTK 681
            AIKDNYEGG KFRQELSSTQVKSL+SLFRPIA KKPSAKK+H RPNVA+QPSFR TRTK
Sbjct: 181 RAIKDNYEGGPKFRQELSSTQVKSLVSLFRPIA-KKPSAKKAHARPNVAVQPSFRPTRTK 240

Query: 682 KVVKSYPPDNLSSGVHYQPIHETRPQHDVHLGQYDPFEPGLHVSHSQVQPRLVRVEAPPH 741
           +VVKSY P+   SGVHY P+ ETR  H VH  QYDPFEPG+HVSHSQ+QPRLVR++APP 
Sbjct: 241 EVVKSYLPEYSLSGVHYPPMLETRSHHAVHHEQYDPFEPGIHVSHSQMQPRLVRIDAPPR 300

Query: 742 HVEVYHPEQAHKAFFTETSFRHPPDSYESIHSTIETNNGDHRFVYGRQYHAPQLLSNRED 801
           HVE YHPEQ H+A+F   SF HP DSYESI +T+ETN G H F+YGRQY +  L+   +D
Sbjct: 301 HVEPYHPEQTHEAYFHGKSFIHPQDSYESIRNTVETNLGGHPFMYGRQYTSQLLVD--QD 360

Query: 802 VTHIDYTPGYYSQRLSPTTSHTATLSQARSSAYYSQNRLPSPYRPQ-RLPSPYR------ 861
           V  +DYTPGYY +RLSPT +H   LSQA SSA+YSQNRLPSPYRPQ RLPSP+R      
Sbjct: 361 VPRVDYTPGYYRRRLSPTITHIPPLSQAGSSAHYSQNRLPSPYRPQNRLPSPHRPYYSAE 420

Query: 862 --APNRLPSPYRAQNRLPSPHRSYYSAVASQDRGR-VLYAGSPQ-GLASGNLSYGESNLP 921
               NRLP        LPSPH        SQDRGR  +    PQ G  SG +SYG++NL 
Sbjct: 421 ISQMNRLP--------LPSPH-------LSQDRGRGYVDVSLPQGGPGSGKVSYGDANLS 476

Query: 922 ISSYYYSSASARLSYQ 926
           ISS YYSSA+ARLSY+
Sbjct: 481 ISS-YYSSAAARLSYR 476

BLAST of Lag0022021 vs. ExPASy TrEMBL
Match: A0A6J1EX39 (uncharacterized protein LOC111439166 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111439166 PE=4 SV=1)

HSP 1 Score: 589.0 bits (1517), Expect = 3.5e-164
Identity = 336/499 (67.33%), Postives = 376/499 (75.35%), Query Frame = 0

Query: 442 MAKRRGKASKSEDGPLNSISNKGVKKHIKKKKKPRKTHETSNLKVAHSTQRQEPVSVPTY 501
           MAK+RGKA K E+   +SISNKGVKKHIKKKK  RK +ETSNLKV+ S+  QEPVS+PT+
Sbjct: 1   MAKKRGKALKVEELLNHSISNKGVKKHIKKKKS-RKANETSNLKVSRSSSSQEPVSIPTH 60

Query: 502 ASVTASIPLGNDQKKQKEMKGKKDASGFIFMCNGKTKPECYQYRVFGLPKGKIEVVKNIS 561
            SVT  IP  ND+KK KE + KK  SGFIFMCNGKTKPECY+YRVFGLPKGKIEVVKNI+
Sbjct: 61  DSVTTLIPPSNDKKKYKETEEKKHVSGFIFMCNGKTKPECYEYRVFGLPKGKIEVVKNIA 120

Query: 562 PDAKLFLFDTDSKLLYGIYEATSKGALDLEPTAFNGQFQAQVKFKITKDCLPLPESAFKH 621
           PDAKLFLFDTD KLLYGIY+AT+KGALDLEP AF+GQFQAQVKFKI KDCLPLPESAF+ 
Sbjct: 121 PDAKLFLFDTDLKLLYGIYQATTKGALDLEPRAFDGQFQAQVKFKIFKDCLPLPESAFRK 180

Query: 622 AIKDNYEGGRKFRQELSSTQVKSLISLFRPIAKKKPSAKKSHGRPNVAIQPSFRSTRTKK 681
           AIKDNY+G RKFRQELS TQVK LISLFRPIAK K S K+SH RPNVA +PSFRSTRT K
Sbjct: 181 AIKDNYDGHRKFRQELSGTQVKHLISLFRPIAKNKTSHKESHVRPNVANRPSFRSTRT-K 240

Query: 682 VVKSYPPDNLSSGVHYQPIHETRPQHDVHLG-------------QYDPFEPGLHVSHSQV 741
           VVK YP +NLSSGVHY P  ETRPQHD +L              QYDPFEPGLH SHSQV
Sbjct: 241 VVKPYPLENLSSGVHYFPDIETRPQHDHYLPDIETRSQHDVRHVQYDPFEPGLHFSHSQV 300

Query: 742 QPRLVRVEAPPHHVEVYHPEQAHKAFFTETSFRHPPDSYESIHSTIET-NNGDHRFVYGR 801
           QPRLVRVEAPP HVE YHPE AH+A+F E S R+P +SYESI + IET +NGD R VYGR
Sbjct: 301 QPRLVRVEAPPRHVEAYHPEHAHEAYFRENSLRYPINSYESIRNPIETYDNGDLRDVYGR 360

Query: 802 QYHAPQLLSNREDVTHIDYTPGYYSQRLSPTTSHTATLSQARSSAYYSQNRLPSPYRPQR 861
           +Y  P L S+RED   ID+ P YYSQ LSPT SHTA LSQA   AY+SQ           
Sbjct: 361 RYRTPHLQSDREDDARIDFIPRYYSQWLSPTASHTAPLSQA---AYHSQ----------- 420

Query: 862 LPSPYRAPNRLPSPYRAQNRLPSPHRSYYSAVASQDRGRVLYAGSPQ-GLASGNLSYGES 921
                   NRLPSPYR+Q+R P           SQD GR  YAG PQ G ASG+LSYGE+
Sbjct: 421 --------NRLPSPYRSQHRFP-----------SQDGGRD-YAGLPQGGPASGSLSYGEA 462

Query: 922 NLPISSYYYSSASARLSYQ 926
           NLP  SYY +S++ RLS++
Sbjct: 481 NLPF-SYYDNSSATRLSFR 462

BLAST of Lag0022021 vs. ExPASy TrEMBL
Match: A0A5A7TQV5 (DCD domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold46G002350 PE=4 SV=1)

HSP 1 Score: 550.4 bits (1417), Expect = 1.4e-152
Identity = 316/502 (62.95%), Postives = 360/502 (71.71%), Query Frame = 0

Query: 439 QSSMAKRRGKASKSEDGPLNSISNKGVKKHIKKKKKPRKTHETSNLKVAHSTQRQEPVSV 498
           +SSM KRRGKA K ++  LN  S KG  K I  KKK RK    S LKVAH+  R++ V+ 
Sbjct: 74  ESSMTKRRGKALKGKE-RLNH-SKKGANKLI-NKKKARKMSRASTLKVAHTASREDLVNG 133

Query: 499 PTYASVTASIPLGNDQKKQKEMKGKKDASGFIFMCNGKTKPECYQYRVFGLPKGKIEVVK 558
           PT ASV+ SIP  +D++K ++++G +  SGFIFMCNGKTKPECYQYRVFGLPKGKIEVV+
Sbjct: 134 PTCASVSTSIPPVDDREKHEKVEGNEGTSGFIFMCNGKTKPECYQYRVFGLPKGKIEVVE 193

Query: 559 NISPDAKLFLFDTDSKLLYGIYEATSKGALDLEPTAFNGQFQAQVKFKITKDCLPLPESA 618
           NI+PD KLFLFDTD KLLYGIY+ATS GALDLEPTAFNGQFQAQVKFKI KDCLPL ESA
Sbjct: 194 NINPDTKLFLFDTDLKLLYGIYQATSNGALDLEPTAFNGQFQAQVKFKIFKDCLPLHESA 253

Query: 619 FKHAIKDNYEGGRKFRQELSSTQVKSLISLFRPIAKKKPSAKKSHGRPNVAIQPSFRSTR 678
           FKHAIKDNYEG RKF+QEL+STQVKSLISLFRPI KK   AK+S+ RPNV IQ SF+S R
Sbjct: 254 FKHAIKDNYEGHRKFKQELNSTQVKSLISLFRPIPKKS-FAKRSYIRPNVGIQSSFKSAR 313

Query: 679 TKKVVKSYPPDNLSSGVHYQPIHETRPQ-----HDVHLGQYDPFEPGLHV--SHSQVQPR 738
           +K+V KSYP +    GVHY PI ET PQ     HDVHLG+Y+PFEPGLHV  SHSQ+QPR
Sbjct: 314 SKEVAKSYPLEKPPFGVHYLPILETGPQHDVRGHDVHLGKYNPFEPGLHVSQSHSQLQPR 373

Query: 739 LVRVEAPPHHV--------EVYHPEQAHKAFFTETSFRHPPDSYESIHSTIETNNGDHRF 798
           L+R EAP  HV        E YHPEQAH+A+F E SFRHPP+SY SI +TIETNN DH F
Sbjct: 374 LLRTEAPTRHVEPYHPRSIEPYHPEQAHEAYFPEGSFRHPPESYASIRNTIETNNADHPF 433

Query: 799 VYGRQYHAPQLLSNREDVTHIDYTPGYYSQRLSPTTSHTATLSQARSSAYYSQNRLPSPY 858
           VYG QYH  Q + +R D    DY PGYYSQR S TT HTA LSQ++S A           
Sbjct: 434 VYGHQYHISQFMLDR-DAARTDYIPGYYSQRPSSTTLHTAPLSQSQSDA----------- 493

Query: 859 RPQRLPSPYRAPNRLPSPYRAQNRLPSPHRSYYSAVASQDRGRVLYAGSPQGLASGNLSY 918
                          PSPYR Q+RL SPH S Y  V    RG +LYA  PQ  A  +L Y
Sbjct: 494 ---------------PSPYRLQDRLTSPHLSCYPTV----RGGLLYASVPQTTAYSDLRY 538

Query: 919 GESNLPISSYYYSSASARLSYQ 926
           GESNLP  SYYYSSA+ RLSY+
Sbjct: 554 GESNLP--SYYYSSAATRLSYR 538

BLAST of Lag0022021 vs. ExPASy TrEMBL
Match: A0A1S4E001 (uncharacterized protein LOC103494640 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103494640 PE=4 SV=1)

HSP 1 Score: 548.9 bits (1413), Expect = 4.1e-152
Identity = 312/502 (62.15%), Postives = 359/502 (71.51%), Query Frame = 0

Query: 439 QSSMAKRRGKASKSEDGPLNSISNKGVKKHIKKKKKPRKTHETSNLKVAHSTQRQEPVSV 498
           +SSM KRRGKA K ++  LN   +K    ++  KKK RK    S LKVAH+  R++ V+ 
Sbjct: 9   KSSMTKRRGKALKGKE-RLN--HSKNGANNLINKKKARKMSRASTLKVAHTASREDLVNG 68

Query: 499 PTYASVTASIPLGNDQKKQKEMKGKKDASGFIFMCNGKTKPECYQYRVFGLPKGKIEVVK 558
           PT ASV+ SIP  +D++K ++++G +  SGFIFMCNGKTKPECYQYRVFGLPKGKIEVV+
Sbjct: 69  PTCASVSTSIPPVDDREKHEKVEGNEGTSGFIFMCNGKTKPECYQYRVFGLPKGKIEVVE 128

Query: 559 NISPDAKLFLFDTDSKLLYGIYEATSKGALDLEPTAFNGQFQAQVKFKITKDCLPLPESA 618
           NI+PD KLFLFDTD KLLYGIY+AT  GALDLEPTAFNGQFQAQVKFKI KDCLPL ESA
Sbjct: 129 NINPDTKLFLFDTDLKLLYGIYQATDNGALDLEPTAFNGQFQAQVKFKIFKDCLPLHESA 188

Query: 619 FKHAIKDNYEGGRKFRQELSSTQVKSLISLFRPIAKKKPSAKKSHGRPNVAIQPSFRSTR 678
           FKHAIKDNYEG RKF+QEL+STQVKSLISLFRPI KK   AK+S+ RPNV IQ SF+S R
Sbjct: 189 FKHAIKDNYEGHRKFKQELNSTQVKSLISLFRPIPKKS-FAKRSYIRPNVGIQSSFKSAR 248

Query: 679 TKKVVKSYPPDNLSSGVHYQPIHETRPQ-----HDVHLGQYDPFEPGLHV--SHSQVQPR 738
           +K+V KSYP +    GVHY PI ET PQ     HDVHLG+Y+PFEPGLHV  SHSQ+QPR
Sbjct: 249 SKEVAKSYPLEKPPFGVHYLPILETGPQHDVRGHDVHLGKYNPFEPGLHVSQSHSQLQPR 308

Query: 739 LVRVEAPPHHV--------EVYHPEQAHKAFFTETSFRHPPDSYESIHSTIETNNGDHRF 798
           L+R EAP  HV        E YHPEQAH+A+F E SFRHPP+SY SI +TIETNN DH F
Sbjct: 309 LLRTEAPTRHVEPYHPRSIEPYHPEQAHEAYFPEGSFRHPPESYASIRNTIETNNADHPF 368

Query: 799 VYGRQYHAPQLLSNREDVTHIDYTPGYYSQRLSPTTSHTATLSQARSSAYYSQNRLPSPY 858
           VYG QYH  Q + +R D    DY PGYYSQR S TT HTA LSQ++S A           
Sbjct: 369 VYGHQYHISQFMLDR-DAARTDYIPGYYSQRPSSTTLHTAPLSQSQSDA----------- 428

Query: 859 RPQRLPSPYRAPNRLPSPYRAQNRLPSPHRSYYSAVASQDRGRVLYAGSPQGLASGNLSY 918
                          PSPYR Q+RL SPH SYY  V    RG +LYA  PQ  A  +L Y
Sbjct: 429 ---------------PSPYRLQDRLTSPHLSYYPTV----RGGLLYASVPQTTAYSDLRY 473

Query: 919 GESNLPISSYYYSSASARLSYQ 926
           GESNLP  SYYYSSA+ RLSY+
Sbjct: 489 GESNLP--SYYYSSAATRLSYR 473

BLAST of Lag0022021 vs. ExPASy TrEMBL
Match: A0A1S3BY22 (uncharacterized protein LOC103494640 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103494640 PE=4 SV=1)

HSP 1 Score: 548.5 bits (1412), Expect = 5.3e-152
Identity = 312/501 (62.28%), Postives = 358/501 (71.46%), Query Frame = 0

Query: 440 SSMAKRRGKASKSEDGPLNSISNKGVKKHIKKKKKPRKTHETSNLKVAHSTQRQEPVSVP 499
           SSM KRRGKA K ++  LN   +K    ++  KKK RK    S LKVAH+  R++ V+ P
Sbjct: 18  SSMTKRRGKALKGKE-RLN--HSKNGANNLINKKKARKMSRASTLKVAHTASREDLVNGP 77

Query: 500 TYASVTASIPLGNDQKKQKEMKGKKDASGFIFMCNGKTKPECYQYRVFGLPKGKIEVVKN 559
           T ASV+ SIP  +D++K ++++G +  SGFIFMCNGKTKPECYQYRVFGLPKGKIEVV+N
Sbjct: 78  TCASVSTSIPPVDDREKHEKVEGNEGTSGFIFMCNGKTKPECYQYRVFGLPKGKIEVVEN 137

Query: 560 ISPDAKLFLFDTDSKLLYGIYEATSKGALDLEPTAFNGQFQAQVKFKITKDCLPLPESAF 619
           I+PD KLFLFDTD KLLYGIY+AT  GALDLEPTAFNGQFQAQVKFKI KDCLPL ESAF
Sbjct: 138 INPDTKLFLFDTDLKLLYGIYQATDNGALDLEPTAFNGQFQAQVKFKIFKDCLPLHESAF 197

Query: 620 KHAIKDNYEGGRKFRQELSSTQVKSLISLFRPIAKKKPSAKKSHGRPNVAIQPSFRSTRT 679
           KHAIKDNYEG RKF+QEL+STQVKSLISLFRPI KK   AK+S+ RPNV IQ SF+S R+
Sbjct: 198 KHAIKDNYEGHRKFKQELNSTQVKSLISLFRPIPKKS-FAKRSYIRPNVGIQSSFKSARS 257

Query: 680 KKVVKSYPPDNLSSGVHYQPIHETRPQ-----HDVHLGQYDPFEPGLHV--SHSQVQPRL 739
           K+V KSYP +    GVHY PI ET PQ     HDVHLG+Y+PFEPGLHV  SHSQ+QPRL
Sbjct: 258 KEVAKSYPLEKPPFGVHYLPILETGPQHDVRGHDVHLGKYNPFEPGLHVSQSHSQLQPRL 317

Query: 740 VRVEAPPHHV--------EVYHPEQAHKAFFTETSFRHPPDSYESIHSTIETNNGDHRFV 799
           +R EAP  HV        E YHPEQAH+A+F E SFRHPP+SY SI +TIETNN DH FV
Sbjct: 318 LRTEAPTRHVEPYHPRSIEPYHPEQAHEAYFPEGSFRHPPESYASIRNTIETNNADHPFV 377

Query: 800 YGRQYHAPQLLSNREDVTHIDYTPGYYSQRLSPTTSHTATLSQARSSAYYSQNRLPSPYR 859
           YG QYH  Q + +R D    DY PGYYSQR S TT HTA LSQ++S A            
Sbjct: 378 YGHQYHISQFMLDR-DAARTDYIPGYYSQRPSSTTLHTAPLSQSQSDA------------ 437

Query: 860 PQRLPSPYRAPNRLPSPYRAQNRLPSPHRSYYSAVASQDRGRVLYAGSPQGLASGNLSYG 919
                         PSPYR Q+RL SPH SYY  V    RG +LYA  PQ  A  +L YG
Sbjct: 438 --------------PSPYRLQDRLTSPHLSYYPTV----RGGLLYASVPQTTAYSDLRYG 481

Query: 920 ESNLPISSYYYSSASARLSYQ 926
           ESNLP  SYYYSSA+ RLSY+
Sbjct: 498 ESNLP--SYYYSSAATRLSYR 481

BLAST of Lag0022021 vs. TAIR 10
Match: AT5G61910.1 (DCD (Development and Cell Death) domain protein )

HSP 1 Score: 162.9 bits (411), Expect = 1.2e-39
Identity = 75/141 (53.19%), Postives = 105/141 (74.47%), Query Frame = 0

Query: 510 LGNDQKKQKEMKGKKDASGFIFMCNGKTKPECYQYRVFGLPKGKIEVVKNISPDAKLFLF 569
           +G ++  ++ +   +   G+IFMCNG+TK +CY+YRVFG+P+G  +VV++I P  KLFL+
Sbjct: 45  IGLEKGIERRLDHHEQLPGYIFMCNGRTKTDCYRYRVFGIPRGGKDVVESIKPGMKLFLY 104

Query: 570 DTDSKLLYGIYEATSKGALDLEPTAFNGQFQAQVKFKITKDCLPLPESAFKHAIKDNYEG 629
           D + +LLYG+YEAT  G LD+EP AF G++ AQV F+I  +CLPL E+ FK AI +NY+G
Sbjct: 105 DFEKRLLYGVYEATVGGRLDIEPEAFEGKYPAQVGFRIVMNCLPLTENTFKSAIYENYKG 164

Query: 630 GRKFRQELSSTQVKSLISLFR 651
             KF+QELS  QV SL+SLFR
Sbjct: 165 S-KFKQELSPHQVMSLLSLFR 184

BLAST of Lag0022021 vs. TAIR 10
Match: AT5G61910.2 (DCD (Development and Cell Death) domain protein )

HSP 1 Score: 162.9 bits (411), Expect = 1.2e-39
Identity = 75/141 (53.19%), Postives = 105/141 (74.47%), Query Frame = 0

Query: 510 LGNDQKKQKEMKGKKDASGFIFMCNGKTKPECYQYRVFGLPKGKIEVVKNISPDAKLFLF 569
           +G ++  ++ +   +   G+IFMCNG+TK +CY+YRVFG+P+G  +VV++I P  KLFL+
Sbjct: 45  IGLEKGIERRLDHHEQLPGYIFMCNGRTKTDCYRYRVFGIPRGGKDVVESIKPGMKLFLY 104

Query: 570 DTDSKLLYGIYEATSKGALDLEPTAFNGQFQAQVKFKITKDCLPLPESAFKHAIKDNYEG 629
           D + +LLYG+YEAT  G LD+EP AF G++ AQV F+I  +CLPL E+ FK AI +NY+G
Sbjct: 105 DFEKRLLYGVYEATVGGRLDIEPEAFEGKYPAQVGFRIVMNCLPLTENTFKSAIYENYKG 164

Query: 630 GRKFRQELSSTQVKSLISLFR 651
             KF+QELS  QV SL+SLFR
Sbjct: 165 S-KFKQELSPHQVMSLLSLFR 184

BLAST of Lag0022021 vs. TAIR 10
Match: AT5G61910.3 (DCD (Development and Cell Death) domain protein )

HSP 1 Score: 162.9 bits (411), Expect = 1.2e-39
Identity = 75/141 (53.19%), Postives = 105/141 (74.47%), Query Frame = 0

Query: 510 LGNDQKKQKEMKGKKDASGFIFMCNGKTKPECYQYRVFGLPKGKIEVVKNISPDAKLFLF 569
           +G ++  ++ +   +   G+IFMCNG+TK +CY+YRVFG+P+G  +VV++I P  KLFL+
Sbjct: 49  IGLEKGIERRLDHHEQLPGYIFMCNGRTKTDCYRYRVFGIPRGGKDVVESIKPGMKLFLY 108

Query: 570 DTDSKLLYGIYEATSKGALDLEPTAFNGQFQAQVKFKITKDCLPLPESAFKHAIKDNYEG 629
           D + +LLYG+YEAT  G LD+EP AF G++ AQV F+I  +CLPL E+ FK AI +NY+G
Sbjct: 109 DFEKRLLYGVYEATVGGRLDIEPEAFEGKYPAQVGFRIVMNCLPLTENTFKSAIYENYKG 168

Query: 630 GRKFRQELSSTQVKSLISLFR 651
             KF+QELS  QV SL+SLFR
Sbjct: 169 S-KFKQELSPHQVMSLLSLFR 188

BLAST of Lag0022021 vs. TAIR 10
Match: AT5G61910.4 (DCD (Development and Cell Death) domain protein )

HSP 1 Score: 162.9 bits (411), Expect = 1.2e-39
Identity = 75/141 (53.19%), Postives = 105/141 (74.47%), Query Frame = 0

Query: 510 LGNDQKKQKEMKGKKDASGFIFMCNGKTKPECYQYRVFGLPKGKIEVVKNISPDAKLFLF 569
           +G ++  ++ +   +   G+IFMCNG+TK +CY+YRVFG+P+G  +VV++I P  KLFL+
Sbjct: 45  IGLEKGIERRLDHHEQLPGYIFMCNGRTKTDCYRYRVFGIPRGGKDVVESIKPGMKLFLY 104

Query: 570 DTDSKLLYGIYEATSKGALDLEPTAFNGQFQAQVKFKITKDCLPLPESAFKHAIKDNYEG 629
           D + +LLYG+YEAT  G LD+EP AF G++ AQV F+I  +CLPL E+ FK AI +NY+G
Sbjct: 105 DFEKRLLYGVYEATVGGRLDIEPEAFEGKYPAQVGFRIVMNCLPLTENTFKSAIYENYKG 164

Query: 630 GRKFRQELSSTQVKSLISLFR 651
             KF+QELS  QV SL+SLFR
Sbjct: 165 S-KFKQELSPHQVMSLLSLFR 184

BLAST of Lag0022021 vs. TAIR 10
Match: AT2G32910.1 (DCD (Development and Cell Death) domain protein )

HSP 1 Score: 134.8 bits (338), Expect = 3.5e-31
Identity = 140/449 (31.18%), Postives = 202/449 (44.99%), Query Frame = 0

Query: 449 ASKSEDGPLNSISNKGVKKHIKKKKKPRKTHETSNLKVAHSTQRQEPVS---VPTYASVT 508
           A K+ DG + + +     K  +K+K+P K    SN K+     RQ+ V+           
Sbjct: 245 AKKAIDGSVEAKTGLTEDKR-RKRKRPTKQVRDSNKKL-----RQDVVAGADTTEQGMEE 304

Query: 509 ASIPLGNDQKKQKEMKGKKDASGFIFMCNGKTKPECYQYRVFGLPKGKIEVVKNISPDAK 568
                 + +K++ +  GK    G IFMCN KT+P+C+++ V G+ + + + VK I P  K
Sbjct: 305 RKEQPVDPEKREMDGPGKVKIGGLIFMCNTKTRPDCFRFSVMGVQEKRKDFVKGIKPGLK 364

Query: 569 LFLFDTDSKLLYGIYEATSKGALDLEPTAFNGQFQAQVKFKITKDCLPLPESAFKHAIKD 628
           LFL+D D KLLYGI+EA+S G + LE  AF G F AQV+FK+  DC+PL ES FK AI +
Sbjct: 365 LFLYDYDLKLLYGIFEASSAGGMKLERNAFGGSFPAQVRFKVFSDCIPLAESQFKKAIIE 424

Query: 629 NYEGGRKFRQELSSTQVKSLISLFRPIAKKKPSAKKSHGRPNVAIQPSFRSTRTKKVVKS 688
           NY    KF+ EL+  QV  L  LFRP A     A+ +H +      P  R T  K+  + 
Sbjct: 425 NYNNKNKFKTELTHKQVFKLKKLFRPAA---IPAQVTHTQQ----IPVPRDTDRKRSDRD 484

Query: 689 -YPPDNLSSGVHYQPIHETRPQHDVHLGQYDPFEPGLHVSHSQVQPRLVRVEAPPHHVEV 748
            Y P   SS  H    HE R        +  P +  L++S  + +   +R      H ++
Sbjct: 485 RYAPG--SSRGHPTRKHERRRASPPPRREEQPRD--LYLSEREYRTYGLRGGETTQHYQI 544

Query: 749 YHPEQAHKAFFTETSFRHPPDSYESI--------HSTIETNNG---DHRFVYGRQYHAPQ 808
             PE +           H  DSY S          + IE ++     H  +  R Y    
Sbjct: 545 PPPESSSSYHIVNRDRVH-LDSYRSSMDHDRLLRQAEIERHDRREVRHPHLSERDYQTYD 604

Query: 809 LLSNREDVTHIDYTPGYYSQRLSPTTSHTA-TLSQARSSAYY-----SQNRLPSPY---- 868
            L++R ++            R SP    +A TL   R   YY     +  R P  Y    
Sbjct: 605 HLTSRREIL----------VRNSPDPPDSAVTLDSYRRDPYYICERHALERPPRTYMVSP 664

Query: 869 --RPQRLPSPYRAPNRLPSPYRAQNRLPS 871
             +   L S Y  P+ L   YR+  R PS
Sbjct: 665 GRQDDDLYSRYVTPDSLAEYYRSSQRYPS 665

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038891726.13.6e-18772.06uncharacterized protein LOC120081120 [Benincasa hispida] >XP_038891729.1 unchara... [more]
XP_022158674.16.6e-17368.95uncharacterized protein LOC111025137 [Momordica charantia][more]
XP_022932696.17.3e-16467.33uncharacterized protein LOC111439166 isoform X1 [Cucurbita moschata][more]
XP_023539818.17.3e-16467.54uncharacterized protein LOC111800385 [Cucurbita pepo subsp. pepo][more]
KAG6597699.11.6e-16367.33hypothetical protein SDJN03_10879, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
P087701.4e-1629.06Putative AC transposase OS=Zea mays OX=4577 PE=2 SV=2[more]
P030101.4e-1629.06Putative AC9 transposase OS=Zea mays OX=4577 PE=4 SV=1[more]
P377075.9e-1534.88B2 protein OS=Daucus carota OX=4039 PE=2 SV=1[more]
Q75HY51.0e-1423.81Zinc finger BED domain-containing protein RICESLEEPER 3 OS=Oryza sativa subsp. j... [more]
Q5JZR15.0e-1427.63DCD domain-containing protein NRP-A OS=Glycine max OX=3847 GN=NRP-A PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1E0363.2e-17368.95uncharacterized protein LOC111025137 OS=Momordica charantia OX=3673 GN=LOC111025... [more]
A0A6J1EX393.5e-16467.33uncharacterized protein LOC111439166 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A5A7TQV51.4e-15262.95DCD domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_sc... [more]
A0A1S4E0014.1e-15262.15uncharacterized protein LOC103494640 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A1S3BY225.3e-15262.28uncharacterized protein LOC103494640 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
Match NameE-valueIdentityDescription
AT5G61910.11.2e-3953.19DCD (Development and Cell Death) domain protein [more]
AT5G61910.21.2e-3953.19DCD (Development and Cell Death) domain protein [more]
AT5G61910.31.2e-3953.19DCD (Development and Cell Death) domain protein [more]
AT5G61910.41.2e-3953.19DCD (Development and Cell Death) domain protein [more]
AT2G32910.13.5e-3131.18DCD (Development and Cell Death) domain protein [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableSMARTSM00614bed5coord: 40..93
e-value: 1.1E-12
score: 58.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..39
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 484..498
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 8..25
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 431..498
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 811..871
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 431..452
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 811..843
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 464..479
NoneNo IPR availablePANTHERPTHR46444:SF3DCD (DEVELOPMENT AND CELL DEATH) DOMAIN PROTEINcoord: 448..914
NoneNo IPR availablePANTHERPTHR46444DCD (DEVELOPMENT AND CELL DEATH) DOMAIN PROTEIN-RELATEDcoord: 448..914
IPR013989Development/cell death domainSMARTSM00767dcdcoord: 524..651
e-value: 8.6E-70
score: 247.9
IPR013989Development/cell death domainPFAMPF10539Dev_Cell_Deathcoord: 528..649
e-value: 8.9E-48
score: 161.3
IPR013989Development/cell death domainPROSITEPS51222DCDcoord: 524..651
score: 58.634716
IPR008906HAT, C-terminal dimerisation domainPFAMPF05699Dimer_Tnp_hATcoord: 341..416
e-value: 4.2E-18
score: 64.9
IPR003656Zinc finger, BED-typePFAMPF02892zf-BEDcoord: 43..88
e-value: 8.6E-6
score: 25.6
IPR003656Zinc finger, BED-typePROSITEPS50808ZF_BEDcoord: 40..97
score: 10.561487
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 188..415
IPR036236Zinc finger C2H2 superfamilySUPERFAMILY57667beta-beta-alpha zinc fingerscoord: 43..97

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0022021.1Lag0022021.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0110165 cellular anatomical entity
molecular_function GO:0003677 DNA binding
molecular_function GO:0046983 protein dimerization activity