PI0016731 (gene) Melon (PI 482460) v1

Overview
NamePI0016731
Typegene
OrganismCucumis metuliferus (Melon (PI 482460) v1)
Descriptionprotein MET1, chloroplastic
Locationchr05: 22979650 .. 23000953 (-)
RNA-Seq ExpressionPI0016731
SyntenyPI0016731
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAAATAAGAATGAAAAGTGGGACAGGTGTTGGAAGCATATTGGGTGGTTGTGGGAAATCATAAGAGCCACAAATGTGGGTGCAATCCGATTCTGTTATTTAGGACCAAAGAAATCGAACGGAAGTCTCACTCGTCCTTCCTTCCTTCCTTCCTTGGAAATGCATTCATTATTACATCTAATCCCCTCTCGAGCGTGAGCAGTGAGCAGTGAAGAATTTTGATGTTCACCACCATGTCTTTAGCTCCCCATTATCTCTATTCTTCACACTTACTTCCCAAAATTGCCCTCCTAACCATTCCTCTTCCACTTCCTCTTCCTTCACTCTATGCTCTCGATCCCATTCCCATCCCAAACTCTCTCATCTACCTTTCTCCACTTCCTTTTCAACTTCTCATTCCCTTAAACCCTCACCTTTCATCCTCAGAGCATCCGATACAGAATCCAAAACCGACACTGATTCAGATGATCCTAACCAACAACCTTACGAAGAGTATGAAGTGGAGCTGCAGCAGCCTTATGGTTTGAAATTCGCCAAGGGCCGAGATGGTGGAACTTATATCGACGCCATTGCACCTGGTGGTTTTGCCGACAAGACTGGATTGTTCACTGTTGGTGATAAAGTTATTGCTACCAGGTTCCTCTTAATTCAACTTCTTTCTCATCAATAACTATTTATTTCCTACGCTTAGCAATTTGAATTGTCAAATAAAAAAACTATAGGATGTGCTTCCGTCCATTACCTGCGACTTTTCAAAATGGTGTATGCAGGGTCAATGCATAACATTTTCTTTAAACACCGTCCCTGAATGTTTATTTGTGGCGCCTGGATGAACCCCGAGATAACGTGATCAACATCTCCGTTATATCAAACTTGTGCGCTTTCGTTGCTATGTTCAAAATCACATATGAAAGTAAAAGCTTATTTTGCAACATTCATCACTTACGATGCTGAAATTATCACAATCTAGATAGGTTACAATAATACAACATGCAACTCATATCCTTGACACTAAAGACAAAGTTGAATAATGCTTCAAAATTGGATTCTTGTTGATTTGTGAATTCATCCTAATGGAAGTACAATGAAGAAAAGAAAAAGAAAAGATCTAAGCTTCTTAAATCTTCCATGTGTTTATTCCACAGGTTAGGATTTTAATTGGGTCAACTGTTTTAGTGAAGGGTGGATTATAATGCCGTGTGCTCATAACTTCTGTCTTGTTGTTGTTGCGTGGTGTATTTTTGTTTCAAAAGAAAGTTAAAAAAAGAAAAGAAAAAGAAAAGAACTAGTTTGCATGATTTCTACATATCTCCAACAGACGTGGCTTAGCCTAGTATGAAAATGTGACAGAAAAATCCTAGCAGAAACATTTGCAGCTTTTCTTCCATGGATGGGATATGATAAACAAAATCGTTACATCTATGCAGTGCAGTATTTGGGACGGAAATATGGCCCGCTGCTGAATACGGAAGGACAATGTACACAATTCGGCAAAGAATTGGTCCATTACTCATGAAAATGCAGAAGAGATATGGTTAGTTATCAATCACACTCACACATGAATAACCAATAGCAAATCAAAATGAAGGATGATCAGATTTCTTATCTTAAATTGGCTGATGACTTGTATATATCTTAGTTCACTAAAAGACATAATACTCTTAACATCCTCTATAAATATGACCTTTCTAAATTGTTTTTTATTGTTTTTTCAGGAAAAACAGATAGTTTCGGTGAGTTAACGGAAAAGGAGATAATCAGAGCAGAAAGAAACTCGGGCGTAGTCAGTAATAAAGTGAGAGAAATTCAAGTAAGTGAATGTTTTCTAGTAGCTAATAGTATCCTATTGAGGCAAGTAACATCTCTGTTCTCTTTCCTCATTTTTTCTGAGCTTGGCAAGCAAGTACCATGCAATACTTCGCAGTTAGCAGAATGTCACTAGGGACTTGTCTGGAAAATAGGATCTGCAAAACATTGATTTAGCATTTAAAATCTCTGCTATATTGTTAACATATGATAGGTCTAATTGTTGTTACAGTTGCAAAATGCCTTGAGAATGAAGGAGCAGAAAGCACGCAGAGAAAACGACCTACGCCAAGGACTGCGTCTCTATAAGTATGTGTTGTATATCCAACTTATCCTCTCCCCGTATTGTTTACTTTTGAAGTGCAGACACATCAATGCCCCAACTCCGTCTTTCTTTCTGTTTTTCCCCCTCAAGGAATGCTAAATATGAGGAAGCATTAGAGAAATTCGAATCAGTGTTGGGGTCAAAACCAACTCCAGACGAAGCTTCGGTTGCTAGTTACAATGTTGCATGCTGTTATTCCCAACTAAATCAGGTAATGCTTTTGCTTTAATATGGAATTTGTTGTAACAAAATCTGTAATTGCAATCTCATGTTCAATATGAAACTTCGGCTTAGATTGGGGTTGATGTAGATTATAAAATAATTGTTGCCAATAGTTTATTTAGAAAAATCATTTGTGAAATGAAATAAAACTAGTATTTGACTAAACTTCAATTTTAAATTAGAAAGAAATAATAGAATTTAGTAAATGAATATTAGATTAGTGAGATTTAGCGACAAGGCTTATCATTTCAATGTACACAATCCTATGACTTTGGTTTACCATACTTTGTATAGACCCATTTGTTTGTTTACGTTTCCATCTTTATAATGCTCAAACAGTTAAAAGCTGGGCTATCAGCACTTGAAGACGCCTTGGAAGCAGGATTTGAAGACTTTAAGGTAACAGCCACTTCAAGTTTCGATATTCAAATAATATTACTTGCATTGGGCAATTTAGGCGGAAGTTGTGAAAGTATTACTCGAATGTTTGCCAGAGAGTTCGGACGGATCCTGATCTATCAAACCTAAGGACATTAGAGGAATTTGAGCCTCTTGTGAAAAGGTTTGACGAATCATTTATCAATGAGAATGCCATCAATGCCATCAAGTCGCTATTCGGCTTCAAAACATGAAAACCAGACTCTTCTACAGTAATCTTTTATTAAACAATTCACATTCTTGTTGTCACAACTTGTATAAAAGCATCCTGAGCTTCTAGCTTTTAGACGAGTAACAAAAGAATTTCAAGGGAATTTGTTCCCTTAAAAAAAAACTTTACATATGCATTATTGGTGATCCTATACACAGTATAGCTATGTTGAAGAAACCTGTTTGTGTTCAATTAATTCCAGGATAGCCAGTTTCTCTATCTGGAAGCTGAAACTTGATTTTAAGTTTAGGTTATGAAAGTTAATGTTAAACCGTGGCAACTAAAAAACCCCAACCAAAATCGAAGTCAGTTTGAGAATTGTTTGATTTTCCAGTTCTCTTGAGATCAAAGAGTCAATCCATGGGGATAGCATCCATGAGGGGTGCAAGTTTGGGGTATGGGCAATGGTAAGTTTAGGTTTTGTTTCAATTTTTGTTATTCTATATATATATATATATATACACACACACAAACAGTCCGGTTTCGGTTAGAGAGTCAAACCTAACTGCTTACACCCCTTAGATGATAAGGTAAAAATTCACGTACTATCCTTGCTATATTTGGACTTCTTTTCCGCAAAAAATCCGTGATCCACGATTAAAGAAGAAAAGAACTGCTAAGAAGTAAATTCTGAGAAGGTGAAATGCAAACTTACACATTGCCAATCAGATAGGGTACCCGAAGTCCTATTCATCCACCCCCAATCATCCAGAGCTTTTGAATTACAAAAGTAAATTCATAATAATACATTAATTTTATTATTTCAAAATCTCAATCCAATCCACAGAACTGTAGTGCCCTGTTACGGCAATAAAAAATGGATAACAGAAATGGACGGCAAGCCTCAAATAGCACATCCAGATGCATAAATGTGAACTTAGGCTATCTAACTGTTCCTACTACCCAACAAAACTTGGAATTTAGGAACAAACCTTGAGCCTCTTATGTCACTCGTCCTAAAATTACTGCTGAATTTTAAGCACAGGAAAAGCAATTACATCTCGAATGCTTGCAGAGTTTGTCAGAAGCATCACTAATCTATCAATTCCAATTCCCTGAAAGTTAAATTCTTTCATCAGTTACAGCGAGGAAAAGGGATACTACACATCCAAGTACAAGGAAACATCCATCATATGTTCCACAATTTAATTATAGATCAAATATCAGCTCCAGATTCGGTAATTTGTATGCACTTGAAATTTTTAGACTACTTGCTTGCACATGGGAGACTTTAGCAGCTGGTTTTTAAATGTTTGCATGTGGGTAAACTTTAGAGCAGCTATGAAATAGCTATGCAAGACACTTTCATCGCCACTTTCTTATCTGTTTTTTCTGTGATTTTATTGTGTCAAAAGAATGGAGTGATCTAGAGTTTCACTAGCAGTGCAATAAGGATGCCATCCAAAACGTGACACAGAAAAATGGACAGACAGAAAAGATAAGCATACCAGTCCAGAAGCTGGTGGCATTCCATATTCCAGAGCTGTAAGAAAATCATCATCAAGTGTCACTTCATAAGAATCATCCTCATCTTTTTTGCGATTGAAGTCCAAAGAGTCTGTTCCAGAAACAGCTGCGCCATGCTTCTCATTGTGTTGCCTGATCTGGTCTTCAAATCGTCCTCTCTGCCAATTTTGTTTTTTTAAAAGAAAAAGATCATGACAACTATTCCAATTATCATGGAACCGGAACACTATAACTTTCCAATAGAAATTATTATTTCTTGTCATGGCATAAACCAATAATGCACAAGTATACCAAGCAGGAAGAATAGAATGCTCATGTGGAGCTTGGCCGCACATTGGTATGGAGTTAGGTGCAAACCAAAAACCAATTCCTTCGGTAAGCTTTTGACTGAAAGGGTCTCTAGAAGATGGAACGAAGTATAGGAAAGTTATCTTTCAAAAGGTGATAGATTATCCAGTTCAATTTGACTTCAATACTAATGTACTTTCTCTTTCAAATGCCATGAAATTGGCTAAAAAGTTTCATCATGTATACACCCTTCTGACATTAATGATTGACATAATTAACTCTTACCTGATCAATTGGATCGGTCAATTCAGAAAATGCATTGGCCAACTCACGACCACAAATGAAAAGTTCGAATCTTTCTGTGAACCCTGCATGCCTAATAATGAACAAGAAAATGAAAAATTTTAACCCCTCCCAGGTAAGAAGAAAGAAACGATATACAGGCCAAGAAAAGATAGAAAAAAGATATTCAAACAGCTGTTGGTGTAAAAATTTGAGATCATCATGGTCATCCTCCAATAATTTCCGAAAAAGGAGAATAATGGAATATTTTAGGATGGAAAACAAAATTGATAATGCCCCAGAGCTTATTTCTTTGGAGTTCTTGTAGATTCCTAAAGGAATTAGACAATACGGCTTTGTGTTTTAACATCAAGATTGGATATCTTGTTGTAGCACAAACTTAACTTAGGACAGTATGCTCTGTAAATTATGCTTTTATATTTCTGATTGAATAATTAGTTGTTTTTGTGCGTGTACACTCGCACATATATATTTTAAATGGTGATTATCTTTCTTGCCCTACAAATGACTCCTTGGTTACAAATGGAATGGACAAATTTGACATTATAGCACACACAGAAACCTAAAACACTACAGGCACCTACAGTTATGGATGACGCTGATTTACTTGCATTTTGCAGTAAACCCAAATAGAGACAGGCTAGTTAGATAAGAAATAGAAAAGCATTACTCTGGCGTATATAGCTCAGATTGGAATATTGTAATCCCTCTGATATCTCATTACACTAAGAATACATGAAACACAAAGCTAAGAACATAGATATGCATCATGTGGCCTTGATACCTTCTGTGAGGTTTAGCCAGAGGAGATATCTCAATAGGGTAATCCAGAACAAACGTGGGTTGAATGAGTTTCGGTTCCACAACAAGCTCGAAAATCTGGCACATAAAATGCAAAGCATAAGGATTGGATTAAATAAAGACAAACCTTTTTCTTCTATAATAGGAAACTATATCATTAGAACAAAGGATGGGCATAGGCCAAAAAAATACCCAAACTATGACAAGAAAAATAAAGAAAGCACTCCAATTGAAAGGATATAGGCCAAAAGAAAAAGGGAGAAACGAGAAATCCTCCAACTTACCATATAAAATTTAAAGCTCTTACAAAAGGAAAAAAAGGAAGTTATTGACACCCATATGGAGGCTGCAAAGTGGAGAAGTTATGGCATGACAACAAATGAACCAACACAAAGGGACTGATTGCATCCATAAAGTTTCAAGAGAGAACTAAAATCTTCTCATAACATAATGTACATGTGTAAAGGTAAACCTCCTAGATAGAGATGGAGCATAACCTCAGATAGCAGGTGCCCAATGGATTGGCTTGCTTCAATAGAAGTTGTATCCTTACTATCAAGCGCATCCCCAATGGTCCTCAAAGTAACTTCTTTGGCAACTTTAAGGTCATTTCCAAACTCATTTATATCTATCCCAGTGGTTTGTTTCACGAGGTTGTGCATTGTTTCTCTCCTCCAAGGTCTTTCAAAGCATATATCAGTTCCCTACATATTAACCACAGTATTAGGTCCGTCATCTAATTGACTGATAATCACTGGATGGAGTCTTCTGTAAATGCTGCATCTTTCTATATATATAAAGGGGTTATCAAGGGAGAACCTTTATCTACCTATTTTATCCTTTTTTTTATATCAATATATATCTATTTTACTCTCAAGTTCAATAATGGGTTATTAAAAGTTCAGTTTAAGTTATAACTAATGAGTTGCAAATCGTTTTAACTATATAATTATAATTAAAGTAAAAATCATTTCAGCATTAATGCAATTTTCCAAATCATCTTGAAAAGTGTAAAGGTCACATAATTTTGAAAGGACACAAAAATTTCAATTAATTAAGTAAATAAGTAAATAAATTTAATAAATAAATGGTTATTAGTAGTTCCTTAAAAGTTATAATTAATGTAAAAATCACATTATAGTTAATAAATTGCAACTTTTAAACCATCTTGAAGAGTGTTTGGCAAAAAAAAAAAAAAAAATTAACAAGTTGTTTTCAAATATAGTCAAAAGTAGGGGTGATAATCAGTTCGGTTTTTTGGTCAACCAAGAACCAAACCAAAACAAATGGTTTTACATATTATAAGGCATCCAAACCAATGTCCAATAAAGAACAAAACCAATAGATTGGTTTTATTTGGATCGGTTTATTATTGGTTTGGTTTTCGATTTTTGGCCTTTTTTTTTCCAAATTTTTATTCGTACTTTCCACTTTGTTTCTAATTAGTCCTTTTTGCTAACTTTAAATTCAAATTGGATCTTTTTTTCTTTTTGTAAATTGCAATGTGACATTTTTTAAAAATGTTTTTGCACTTTTCTAAATAAGAAAACATGCACGCTGCATAATAATTATTTTCTTTCTAAACAACAAATATACACGCACCATAATATTTAAAATACAAAAAGTTACCTAATGAAAAATTTATTTTTAAGGGTATATATACCATTGGTTCGGTTTGATCTTCATTGGTTTTTCCAATAAAAATCCGATTTGAGCCAATACTTTTTGGTTTTCTCCAAATACAAACCAAAAAGTCCAATTTGATACTAAAAACCAAACTAAACCATTTGATTGGTTCGGTTTCTCGATTTTTTGTTTCACTCCTAGTCAAAAAAACCAAAATATTTACAAATATAGCAAAATGTCACTGTCTATCCGACATGATAGACACTGATAAACATCTATTAGTGTTTGTCATACACTATCAAGTATCAATATCTATGGTTATTTTTTCCTTTTATATTTTTCTTTTGTATTATATCTTGTATTAAGTTTGGGTTAAGGAAATAAGAAAATGAATTTTCTCCTAAAACCACATGGTATCAGGCAACCCTAACAACCTAACCCTAATTTCTGGTGCGTCTTTCCTTTTACAATGTCTCTTCTGACGTCATCTCTTCTGGCATTAGGGCATCTCTCTGTCGTGGTTTGCTCCGTCTTCAGCTGTTTTTCGTCTTTGCTTTGTCTCCGGTGGTTGATGCCTCTCCGATCGTTTTTTTCGACGAGTTATGTGGCGTTATTCATGTTTTTCCCCAGATCGACAGTTTCAATGGTTTTTCCTTCGTGGGTTGTTTTCCAAATCATCCTATTGGGTGTTTTTTGTTGTCCGCTCGTCTGGACTTGCCTCACCTGTAGGCTTGAAATATTTGAAAGGCCTAACAAGTTTAAATCAATTTTAATTAGAGAGGAAACAACAATGTAAGGGCTTAAATATAGGACCTCCCTGGACCACCTGCTTTGATACCATCTTCTTGATCTTGTCGGCGCCCCATCCACTAAGGTTATTGGACAATGATAGTGGCACGTAGGGGTGAGCATGCTAGACCGAAAAATCGAGCTGACCAACCAAACCGAAGTCAATCGGTCGGAGAACGAAAGGGGTCAATTGGTGTCAGTTTGGGAAAGTTCCAAATCGAGAAATTTCAAAAAATATTTCAATGAGTTAAACTGACCAACTCGACAAAAACCGGCCGACCGACATTTACTCATCGACGGTTGAGATTGGATTGAACACTTACTAAACCTACCATAGTCGGTTTGGTCGAAAAACCGACTCCGACCGACTGATGATCACTCCTAGTGGCACGTGCCTGTCTAATGAAGGTTGACACGTAGCGAATGGATGACCTGAGAGGAAGGTTCGTGATGGTCTCCCAGTGCCAGTGCCACACAGTAGGCATTGTGGCGTGTACTAGCAAACCGTGAGCTTCGATGATCACCCCATTCCTGGTGCTGAAGGTAGTTAGCGCGATCAACGAAACTGCCTCCATCTTCGGCAACTCATTTTTTATCAATTCACTATGTTTAATTTTACACTTAAAATTAATATTGAATGATTAAAGGAATTTTTTTAGTGATTTTGAAAATGGCAAAAGTGATTTTATTTTTAACCATCATAGAATAGCGCTCCCAAACAAGCCATAAAAGATTACATAAAATAATTTCCGATTGCAAGCTATAACTAAAAGGGGGCAACTATATCATATGTACGAAATATATATTTCACAATAGGGCTCCATGTTGAACTATTCACCCTCTCCGTCTTGGATTGTTCTTCTATTCCTCCAAAGCCAAAATTCCAAGGAACTTTTATAGCATTAGTCTACGAATTCTCTTTAAGATATACCATTTAAAAGGTCCTTTTATTCTATTTCATTGTTTAAGAACAACTTGTTAAATAACAATTTTGCATTTTTCAAAATAAAATAAATAAGTACGGAATAAAATAAAAAGTTGAATTGGAACATACCTGATAATCAACAGAAAGCTTCCCATGGATAGCAAGAGCACAACGAGTAACGATTTCTTCCGCCATATTCATCATACTCTGATAATCGGAATATGCTTCATATATCTATGGAGCAAGCCATAAAGTACTGATTAGAATTCATTTGCAAGATGTAACTTCAAAGAATGACCAAAAGAAAGCTTGCCTCTATAGATGTAAATTCGGGATTATGACGAGTTGAAATTCCTTCATTTCTGAATATGCGTCCAATTTCATATACTTTCTCAAAGCCACCAACCTATTACCAACAAATAATATATTCACGATGATACTATATCTCAAATACAAGTTTTGAGCTGATAAAGAAAGGGTTATCTAACATCTTCTTATATTACATTAGGATTTTCTCAATGCTTTTCAAAGTCAATATGGAAGTTGAAAGAACTTTGCGTTCCTTCCGTCGGGAAGCTTCAAATGGCAGCAAGATTTCTCACTTAGCAAAATGGGATGTAGTTTCTAAAGGTCAATACGGTGATGGTCTTGGCATTGGGGGCTTGAAAATTTATTTTGGAAAAAGATTCTCTTTTTGGAAAGGAAACACGATTTTGAATGTATAGGCTTCTTCAATGATAACTATTTTTTTAGCTTGTTTGGAAAGGGGTCCCCCCACCCCATGCCAGTGCCTTTAGGCCTTTTCCTTTTGAATAAATTCTTAGTTTCTTATAAAAAATAATGATGACCGCTGTTTGCAAAAATCTTGAGAAAAAGAAACTATCGGGATAAAACAAGGGACCAGTAATTTCGTTCTTTACAACAATGCTAAAAGTATAATATCAATAACTCACAAGCATCCTCTTTAAATGGAGCTCAGTAGCTATTCTTAAATACAGATCTCTCCCAAGGGAATTGTGGTATGTAACAAAGGGCCTTGCTTCAGCTCCCCCAGCTGCTCCCTAGTTGATTAAACAAATCAAAACCAAGTAAAAGGTAGTAGCCATATTTGGACACTTAAATTTATAGAGTAAATACATAGAAGAAAATGGCAATTCATCGATTAATTTAAAAGATGCTTCTTAACATTCAGACAATCAGGACCAAGTCAAAAGTCTCGTATATGTAATAATTTAATAGCTACAAATTTATCAGTTTTAGTTCTCTGTAAATGAGCTGGCTCTGATAATAACAATATAATTTGGATCAGTCTTGCAAGTGATAATTGATATAACGTTTAGCATATAAAGATGTAGAAATGTTTGGTTCCAAAGTTCCTTCAAAATACAATTTTATCAAGCATTTCATACTTCTCAGAAGCAAGATACCAATATAGTTTAGGTCTGCCATACGATAATTTCTCAGTATTAGGTCATATCGCAAGCTATCAAAACTAGTTCATACAAAGTACCATTTAAAGGGCAGTTTGAAAAAGTAAAGGCTAGGCCTCGGATATCCTTCAAAAGAAGTTAACATCCAATTATTAATGCCGTCGGTTTGTTTTTTGCATTTCGAGGCCAAGGATGTTTTTTGTTATCATACATGCTATTATGTATAGTTGCCCATGCCCTTTATTGTATTTGGGTGGAAAGAAATTGGCGGATCTTCAAAGATAATGCTTCATCATATGATCATTTTTTTGAGGCTATAGTTTTCTTTCTTTTTTTCTTTTGGTTTTTGACAATATACAAGGTGGGGGTCAAACCTCATACCTCAAGATCGATAATACAAGGTCATGCTAGCTGGCATATGATCTATTTTTTCGGCTATAGTTTTCTTAGAGGCTTCTTGGTGTAAAAGCCCTCCCCCCTCCCCTCCCATCTTTTATTACTGTAGGCTATAGCTTTTCTTTTTATTACCCAACTGGAGAGGTTTTTGTAACACTTAACGACCCTTTCGTAAATTCAGATGAAATTGCCTCTTCTCCAAAATAAATAAATAATGAAAATTGGTTCCTGCATGTGTGCACGTACACAATATGAATCTCATCAACTAAAGAAAAAAATTGTTCAAAGAGGCAAACAACCAATCTAGCAAAGGAGATATTGGATCTTCGGTGTAAAAATAAAAGGAAAAGCAATTTGAGGCCATGTCGAAAGAACAAGGATGATTAATTAGTTCTAATTGACCCTTAATTTTCAAATTTGTACATTAATCTTCATAGGATTTTTTTTTTTTTTTTGATTTATTAAATTAGATATAAATTTAAACTATGAAAATTTTCACGAACCTGTAGGACTGGAGTCTCAACTTCAACAAACCCTAATGATTCCACAGTCTTGCGAACCTCTGATATTATCTATACAAACAACAGATAGCATGGAGTTCAAAGTATTACAAGACTTTACATTATTTTTTGCAAATATTACATTGTGTGTGCAGGTGGGATTAGTAAGGGTCTTAGTTTGGGATATTAGTATGCTATTAGTAGGGGCATAAGAGTAATTAGCTAGGAAGTTGGTTGGCGTATTTGATTATAAATAGAGGAATGAGGATTGAGGGAAGGGAAACAATTGTTAGCATACCCAAGACAAGAGGTACCAAGTGCATTATACTTGGTTTATCTTGTAGTTCTTGATCCTTTATATTTCACTATATTTCAATTCTTGCTTCTTGTTAGGTGGTATCCTAACAATTGGTATTAGAGCAAGTTCATCCTAGAGTACCATTTTTGGCAATGGGAAAAGAAAAGCAGTGGCCATGGAGAGCAACTACTATATTGGGTTGATCACAAGAGCGAGATGGCAGAAAGAGGAAGAATTCCATGGTCATGGGAGCCACTGCAAAATTGAGTTAAGGATCTTCAATGACAAATGGTGGAAATAAGAAAAAATGGTGGGAGCAAATGCAACAAACTCAAAAAGAAGAGCAAAAACAGATGAGAAGATCAAGTTTGTAACTGAGACTGAGACCGTCATTAAGGGGGCGTTTGGGCCCCAGACTTCCTAGGAGTGGAGTGAGTTTAATAACTCACTCCTTGTTTGGGGCCCGAATTATGGAAACTCTAGGATTTGGGTTTCCATAATCCCCCAACTCCCCCAAACTCCCCCCCAACTCCCTCCTTATCATTACACTATTATCATAAAAAATTATCAAAAAAATTATCATAAAAAATTATCATAACACTACCTACATAATCCTTCCCCCAAACACATACTATCATAACACTACCTATCATAATCCTTCCCCCAAACACATACTATCATAACACTAGGACTATCATAACACTAGGATTATCATAACACTAACCCCCCATAACTCACTCTTTCCCCCAAATGCCCCCTAAAGATTGTTGATGTCTCAAGCCTGAATAAGTTTTTGGTGTAGGGCAGTGGAGGCGCCAATGATTCCATCATAGGTGACAACCAGACGAGAAATATGTAGTAGAGGAAAGGAGGAAAGAAAAATATACCCACAAAAATAGTTGAGGCAAGCAGAGAAAGAACTCACCAAAGGACGTCAGGTTGAAGAGGCAAAAAGACAAGCAACGGACGTTGAAGAAGGATAAAATTTGCAAGGGCAGGACGAAGCAACAAGGAAAGACACAATATTTACGAAAAGGCGTGGTGACCAGCGGATGAAAATGGAGAAAACACTTGCTAATAGTCACAGGTAGGCAGTTGAACGTATTGGAACTCACAAGCAGGTGTACGAAGAAAATTTTTTTCTTTTCCTCAAGGTTGAAAAGGACAATTGGCCACAATGGGGAGAGAAGACGCAGGAGCTTCATTGGCTAGGTAGGATGGGAAAGAGATTGGTGACAAAAAAACGATTGAAGGAACCAACTCTGGTCACGTAGGTCAAACAAGAAGGATGTGTTGAAAACTACAAAGGAAGGTAAGTTGCGGTGGATGAAGAACGATTGAAGGAACCAACTCTGGTCACGTGTGTTGTCGAAGAAAAGTAAAGTATGGAAGTATGGAAGCAAGTTGGTTAATCTTGGCCCTTGTGCATGGGCTTAAACAAACGTTTTGGAATAGGCAAAAAGCCTAAAATTGAATAGGTTAGCTGAGCTTATTAGGTGTGGGCTAGCCTAGAAAAATTTGGATCGAAAGCCCTACTTCACGAGTAGTTTTTAATCATTGGTCCAAATAGGGTTGGGTTGTTCCCTCTGGTTGCATTAAAAAAGATTTGGCCCATTGTCACACCAATGGGACCATGTGGAGATAGAAAGTTCAACCCTAGCTATCACCTCATTTGTCTTGTCATCCATTGTTTGTCATTGCTTGCCAACCATTGGTCGCCGTTGCACCTCACCATTTGTCACTCTGACCACCGCCGCTGATTATAAATAGCAGGAGGATGAGGCAAGGGAGGCCATTGTTGAGTGATTTTGGGCTTGGGTTAGTATACTCAGGAGGGGAGGTTCCAAGTGCCTTATAATTGAAAAATCTTATAATTCTTGATCATTTATATTTCAATATACTTCAGTTTTTGATTCTTGGTATCCTAACATATTCAAACCTACGACTTTTTAAATGAAGTGAATAGAAGAAGCCACAAGCTTAATTATATAGTTATATGTACAAGCTACAAATAACTATAAGAAATAGCTAAAGCCTACATCACCTTTGCTCTTTTCCGAAATGTATCAGCTACTTCGGGATTTGCAATCATATCTATGTACCTAAAAACAAAATACATCAATTCAGCTATGCAAAATGGCATTTCTAACCTAAAGCATATTTCAAATGTACGTTATTTGTACTCATTTGAAGTGCATTGTCTTCGTAGTATTGAAAAATATCTAGATAGTGAAGATTACATGTGAAACAAAAGGTAATATTCCTCCATAGGTAACTGATAGTGATAATATTTTTTTAAAAAAACAATAATAATAAAGGAGAGGAAACACAAAGGAAAAGAATGAAAAATCCCTCAAAGCCAACTTGATTTTCATAGACATTCTAAGGTTTTTTAGAGAAAGGGGGAGGCCCGAATATCCACATCTTGATCACAGGGGGTTTTCACAAAAAGGAAGACCCTTAAATATATTCGTAATCCAATTTTGCAAATTGCTTTCTGTAAGTTTGTAAACCTTCGGTTTATTGATGATATAAAAACCAGGAAAACCTCTATGGCAGAAAATATCGAATAGTTTTCTCCTTGGCAACCAGTGATTGATAAGGTTCAAAAACAAATAGATTGTTGGAAATTAACAAGAGGGGGCAAATTGACTCGATGTAACTCGATGTTATCAAGTAACAAGTACACCTCTCTCTATGTTTCCCCTTTGGGGTTATTTCTCCATTGTTTCCTTAAAATAAGACAAAAAAAATTGATTTTCACAAGCTTAGAGCAAAAGACATATCGAGAAGTGCTTGCCCACAAAAATCAGAACTTTTTCAAATATAAAAGAAAATAAACGGACCAAAAATTACCTTTGGCGGTAGCGCTTATCAACGTCAGTTAAACCATGATATTTGTCAGGAAGTGGAAGTAATGACTTTGTCAGGATTTCAAAAGAATTTACACAAACAGAAAGTTCCCCTGAAGAAAATGAAGGACAAAAATAAAATGAACTCACAACCAATCCAAATGCCTAAGAAGACGCCAATGCAGTTTGGATCCAAGTAGAAATATGCTCTCTAGTCTTAAATTTAATATGTACATAAAGGAACACAGAGAACTAATGCATAGCTAAATGAACTTTACAATTCAAAACCTGTATACCATACCCAAATTATTCGTTTTCATTATGATTCCTAGCCTAAACAATGAGTATGAAAAAGTCGTGTCCATACTTTTTTCTCTTATTGTTCTAATCTAAGTACAATACAAAATACTCCTTATATAGTCTCGAATACCACTCTCTCATTCAAATGGTACAGAAATACTCTTATGGCCCTATGGAACTGTCCAGTGCGTGCTTTTCTTTGCCTATTACAGAAAGAGAGAAATAGTAGGATTTTTCAAGATAAGGAACTCTCCTTGGATTATTTTTGGAAAGCTATCATTGATGTAGTTTTGGAAGGGAGGTTTCTCTATTCTGGCCTTTAGGTTGTTTTTCCTCGGATTGTTTTTGGAAAGCCATCTTTTAGAATCAGGGTGATTAGCCACATTGTATGTGTATGCGTGTGCATGTGTGCGTGTTTGGGTTAATGCGATCAAAGTTTTACTCTCAAAAGGATCGATTTTGGTTGACAGTTTCCTATGCAACAGGTGGATTAAGCTAGGAATAAGATTAATATTTGAGGTTCTCTCCTAAGTCCTAATTTCAAGTAGATTACGAGAAATCAGTTCTTTTTTCTGATAGCTGCATCCAATTAGTATTTCACTTCTAAGACTCTAGCTTCCAGAAATTAACTGCAGAAACTTTCCTTAAGTTGGAATCCCATTCTAACTCAGGATTCTTGTGGTTCTGGGCCGACAGTAAAATGTGACCTAACCTGCTGAAAAAAAAAATCGAAAAATGTGATCTAGTCATTCCAAACAATAGCGTGAGGACTCACCTTTCTCAGTTCGCTTAATTGAACCTCGAACACCAAGAATATCTCCAATATCAACGAGATTCTTTAGCTGATCAAACTGATCATTTACGAGCTTTTCCTTCTCACAGTAAAGCTGAAAGTTCAACCAATTATGTTGATTAAATCGCCCATGCTTAGAAGAGCAGGAGTAAAAAACCTCTTCAAAAGCACCTTGATCATAAGGATATAACTCAAAAGTAACTAAAAGACGTCATTCTCATGGATATTACGAGATAACAAAATAAAATAAAACAGACGTCAAATGATCTCCCAATTATGTTCACCGTCATCCAACATCAAAATCAAAATTTCCCCTTCTTAAAAAAGTGTACTCTGCTAAATAAAACTAGTATTTTCTATGCACGCCATAGAGTTTCGCCGATCCTAGCAAATACTTGACTTGACGAGGTAATAACGAGCATATGTGCGTGAAGCAAGCGAGTGTTCGAACCTGAATTGTACCAGAATCATCTCTTAACGTCAGAAATGCAAGCTTTCCAAACGCTCGGCGAGCCACAATTCTTCCAGCTACTGACACACAATCCGCATCGGCATTAGACTCTTCGCCATTACCTAGATGCTTATACAACTCTTGCAGCTGATTAGCAGTATGAGTCCGGTCCCACGTATATGCATAAGGTTCGAAGCCCTTGCTCCTCAATTCTTCAACCTGACACAACAGCATGAATGTCCAGCACGGTCAAGAAAGTCCAAAGAAAGAAGAGGTGTGGAGGGGAAGAGTATGGTGCATTTATAAGTAGATAGAAAAGAACCTTCTTTAAGCGAATTGCACGAACGGCCTCCCTGTCAGACGTGGAAGAAGGTGGCGATGTAGTCGAGGAAGAAGAACGGCGGGTGCGACCTGTGAGCTTAAGCTTTCCGTCTCCGGCAGTGGAGGTTGCAGCGGAGGAACAGCAGCGAAGGAAACGGGTGGAGCGAGAAGATTTGGGAACGATGGCAATGATAGTGGAAGAGGAGGAAGCTAAGCGAATGAGGTGCTTGAAGAAAGGCTGTGAGCATATCGTGCCGTATGGCTTCAAGGCCTCCATTAGCCTAACTCTCGACCCTCACAGTTCCTCTTCCTCTTCCGCTGTCAGTAAAACATAAACTCAAATGACACCTTATCCACTCCCACAGCATGTGCCGGTTGCCGCCACCTCCAATATCAGTGATTCAGTGATATACCGTGGAGCCTATCACTATCCACGAATTATACCATCTTCTTAAAAAAATTTAAATATTGTTATATACTTCAATTATTATTATTGATCTTAAGAAGTTATAATTATGGTTTTATTTTATAGACAAATCCAATCAAAGTTGAAGAATTAGAATCTAAACCTCCCGTATGAAAAATCATCTTTCAATATCAAAACGCAATTGAGAAAAGATGTCAAAGCAAAACAGATAAATATGACAAAAGTAAAAAAGCACCAACAGTCTTTCCAACAAGATTAATGTCATCTATGAAAACTTGAATATCAAGATGAGTTCACAACATTGATGGATTGTTGACTTTAAATACATCGAATTCAACCACCAACAAAGGTTAAGATCATCCCGTTTTTTTAGATTAACTTATTTTAACCCCTACGTAACAGCTTCAGCTTCTATAAGGATAACTCTCCTATTTAGGATTGGGTTTAAAAGTACAAATTTTGAAGGGTAACTATAATTGGCAGTAAATTTATGAATAATAATTTTATAAAATTTTCAAATATAGCAAAGTTTATCGGTGATAGACTCCTATGATTTATTAGTGATAGACCGATATTTACAACATGATCTATCAATGATAGACTTTGACACTTTTATATATTTGCAATATTTTATAAATATTGTTATGTACTTAATTATTTGAATCTAATTATTATATTTGCCGCTATCGCAATTTTAAAATTCAGGAACCAAATTGAAACAAAACTCAAAAGTTAAAATCTAAACGTGTAACGTTTGAAATCTATAAACTTAAAGTCAAACTCAAGACTTTTAAAGTAGAAATGAAACATTTTGAAATTTAAAAACTAAATAGAAATTAAATCCATAATATTGGAACAAAAATATATTTTCATCAATAATTTATTAGCTTCGTGTAAGGAGAGTTAAAATAAATCAATTTTGAATTTTAATACATATAAAATAAAGATATACTCTATTTTGGTCCCTAAACTTTGTGTTTTGTTTTATTTTGGTTTCAAAACTTTAAAAACTCTGAACTTTTAAAAGTGCATATTTTGGTCATGCTATTAAGATACTGTTAACTCGATGAGGTGGGTTCTATATTTTAATGACTGAGTTGACAAATAATATAATCAATTTTAATAATTTAAAGTTTTAATTTTTAATTAAAGGCTAGATAGAAATTTTATGTTTTATCGAAGAACTCAAAAAAGTAATTGTCTTCAATATTTTGATTGTCGCCATTTGCGCATTTTCATTCTACTTTACTTTCTCAATTCATATGCTAGAATAATTGTCTTCATTTTTTTTTAGGTTTCATAATTCATCGTGAGAATATTTATTTGCAATTTGTAGAGCTTATTTTATAGGTGTGTTGTACCTATTTTATTTTTCTAGTTTATTTTACGTTGTCATGGATGTTAACACCAATTGTAATTTTATTTTATTTTGCTTAGACTCAAAGTCAGTAGATGGGATTGAGAAGTCTATCACATAACCAACTGATAAAGCACCGATTTTTAATCAAAGAGGGGCACACATGTTTTATATTCAAACACTACAAACATTTATTTTTATATTTTATAAAAACACTATATATACTTATAACGACACGAGGTGAGGGGGCACGTGCCTCCCTTAATCCTTCATAGATACGTGTCTCCCTTAACCCTTCATAGATACGTCCTTGATATTAATGTGAGACATCATGCAATCTGTGAAACCCCAAACTTAAGCTTCCAAACTAAAGGAGGAAAGAAAAGCAAGACTTTTAGGAATTTTAGGATGGAATTAAGAAAGGAATTCTTCTTTGGAACTTAAGTGACATGATAAGAGCGTCATGGTTGTGATGACAAGGTTTGATTGTCATCGTCCCCATAATGAAAGAAATTAGCCTAAGTGTTGTGGAAGTCGTAGAGGTTAGCAAGTGGGCAAAATGGCCAAGCTAGGCGTTGAGAAGTGTCAAAATAACCTCCAAAGACAAGGGGTATGCCGCCAAAAGTATACTAGCCGTTTTGAGCTAGGCGGTCAACCATGTCCAAATAACCTCTAAGGGCAAGACATATTATGCAGCCAAGTTAGGCGTTTTGGGTGTTGTGTGTGTGGCCAAATAGTTAAGAAAAACCTTTAAGGAAAAGAGGTTAGGCGGCAAAAGGGTGTCCTAGGCGATAAAGGATTAGGCTATGTGTTTGAGAGTGTCAAAAACCTCTAAAGAGGTTAGCATGTGGCCACAAAGTCCTCCTATGCGAGTTGGCTATGCTATGCTGCCAAAACAAGGTAGGCGAAGAGAGGTTAGGCGACAACCAATGTCCAAGAAAAACCTTTAAAGACAAGGTTGGGCGACCATGAGTAACCAAGGCGTTTTAGGCAATAGATGAGTGATACAAGTGGCATGACATGCAAGACACCTAGGTTTTGCCAAGTAGGAGGTGTAAAGTGATTAGGATGAGGTGGAGGGTGACAAGTAGGATGTGTAAAAGGCAAGAGAAATGTGTCACCATGGGTCATGCAAAGACCCTTATATAAGGAGAACTTTGGATGATGAATGCTCACAACCCAATTGTGCCATTTGGGTAATTCCACTTGGAAAACTTCACAAATTGAGGTTAGGGCTACCAAGCAAAAGGGACGCGAGGAAAGCATGCGTTGAAAAGAAGAAAGTCACAAAGGTCATTTTCGAAGGAAATCTGAACAACGCCAAGTGTTTTGAAGCTACATAACTCAATACTAGGAATCTCTTGCTGAAATTTTGAGGTGAGATAGTTAAAACATTTTAAAGAAAATTTGTAGAAGGAAGTTTCTTAATTTAAATTTTATATTTTAAGTTATTAAGCTCCAAAGTTGAGTAAATATTCTGGATGAAAACTCAATTTTTGGGCAAGGTCCAAGAGTGAGCCAAGGGTTAGGATGACAAGGTAAGTTGGACGTGTTAATTAGGCAAGGGAGGTTTTATGGACCTTGAAGCATGTGAGGGACATGTTTAGCACATGGGCAGGCGACATGAGGCGCGTGCCTAGGCCATGTGAGCAGTCATGCACACGAATTAGGGGGTCATGCGCCAAATATACAACACGTGGCGTACGGTCTGGAGCGCACGCGGCGTGCATGAGCATGAAAGGTTAGCTCGTGGCCAGGCCCAGGGTCATGCACACACACGTGCAGCCTGCCGCCTATGCCCTGCATGTCAGCCCACCCATATGCCATTTTTGGGCATTTTCTAATGTTTTAGCAATTTTTGCTTAGAGTTTGGGTGTTTTATGAATTTTTGGGACTTAATGGATTATAAATACCCTATTTTCAAGATAAATGAAATTTCTCGAGCCAAGGAGTAAACATAAAGTTTGTAAGTAATTTTCATTTAAATGTTTTACTTAAATGTTTATGATTTCTACGTTGAATTTCAGATTATATAGGCAATAGCTAGTCAGTGAAATATGCATTTAAATAAGTTTCAAGTAATTCTAAGCATTGGTTATTAATCATGATTTCAAAGTTGTTGATAAACATGTTTTGAACTACTGAAACGATCTACGAGTTAAGCATAATGTTTTAAGGTTAAATGTTTTCATCAAACTATATGAGAACTATACTGAGCTACCTCGGAATGTGCACATATGATTCCCAACTACTTATGTGAAGGCTTTATAGAGGGGCCTTTCAGAGAAGACATTAATGCCTATAGGGGAAGGTTCAAGTTATACTGGCCTATGTTATGTTATGGATGTAAAGTTAAAAGCTGATATGTAATAGCTTAATGCACATTCAAATGGAAAATGATTTATGTAATGATTCTAAGCTTTGCTATGTTAAGTTCACAATGATTTAAGTGTTTGAAACAGAGCATTTGACATAACTATTTGGAAAAGTATGTTTTAGAAAACTATTACTCACCGAGCATTTTTAGCTTATTCTTTCAAATGTTTTCTCACTTTTCAGGTAATGATCGTGTACTCCTGAGCTAAGCTGAGCTTATTGCTGTCACGCCAACCAGTCTGATAGAAGGAAACATGTTAAGATAGTACACTAGAGTTTAGTATGTCATGTGGTGGATACAGAGAGTTGATTAATATTGAGTATCTAGTCTGAATAGAGTTTCTGTGAAGTTATGAAAATACTTTTGGCTGCACTTAGTATTTTATTTCTCTATTATGCTTCCGCTAGTTGTTAAGTATAAGGGTCAGCAATTATCGTTGAGAAGAGGATGATATTAGTTGACTTCACGCCGTCTCTCGGGACAAGAGTTAGGGGATCTGGAATCTTCTAAAGAAAAAAAGTTTGTACATTCTTGTATGCTCTTTATCTCTCTTTGCAGACAAAATTAAAATTACTAAATATATTTCTTTAAAATTTTA

mRNA sequence

ATGGAAAATAAGAATGAAAAGTGGGACAGCTCCCCATTATCTCTATTCTTCACACTTACTTCCCAAAATTGCCCTCCTAACCATTCCTCTTCCACTTCCTCTTCCTTCACTCTATGCTCTCGATCCCATTCCCATCCCAAACTCTCTCATCTACCTTTCTCCACTTCCTTTTCAACTTCTCATTCCCTTAAACCCTCACCTTTCATCCTCAGAGCATCCGATACAGAATCCAAAACCGACACTGATTCAGATGATCCTAACCAACAACCTTACGAAGAGTATGAAGTGGAGCTGCAGCAGCCTTATGGTTTGAAATTCGCCAAGGGCCGAGATGGTGGAACTTATATCGACGCCATTGCACCTGGTGGTTTTGCCGACAAGACTGGATTGTTCACTGTTGGTGATAAAGTTATTGCTACCAGTGCAGTATTTGGGACGGAAATATGGCCCGCTGCTGAATACGGAAGGACAATGTACACAATTCGGCAAAGAATTGGTCCATTACTCATGAAAATGCAGAAGAGATATGGAAAAACAGATAGTTTCGGTGAGTTAACGGAAAAGGAGATAATCAGAGCAGAAAGAAACTCGGGCGTAGTCAGTAATAAAGTGAGAGAAATTCAATTGCAAAATGCCTTGAGAATGAAGGAGCAGAAAGCACGCAGAGAAAACGACCTACGCCAAGGACTGCGTCTCTATAAGAATGCTAAATATGAGGAAGCATTAGAGAAATTCGAATCAGTGTTGGGGTCAAAACCAACTCCAGACGAAGCTTCGGTTGCTAGTTACAATGTTGCATGCTGTTATTCCCAACTAAATCAGTTAAAAGCTGGGCTATCAGCACTTGAAGACGCCTTGGAAGCAGGATTTGAAGACTTTAAGAGAGTTCGGACGGATCCTGATCTATCAAACCTAAGGACATTAGAGGAATTTGAGCCTCTTGTGAAAAGGTTTGACGAATCATTTATCAATGAGAATGCCATCAATGCCATCAAGTCGCTATTCGGCTTCAAAACATGAAAACCAGACTCTTCTACAGTAATGATCGTGTACTCCTGAGCTAAGCTGAGCTTATTGCTGTCACGCCAACCAGTCTGATAGAAGGAAACATGTTAAGATAGTACACTAGAGTTTAGTATGTCATGTGGTGGATACAGAGAGTTGATTAATATTGAGTATCTAGTCTGAATAGAGTTTCTGTGAAGTTATGAAAATACTTTTGGCTGCACTTAGTATTTTATTTCTCTATTATGCTTCCGCTAGTTGTTAAGTATAAGGGTCAGCAATTATCGTTGAGAAGAGGATGATATTAGTTGACTTCACGCCGTCTCTCGGGACAAGAGTTAGGGGATCTGGAATCTTCTAAAGAAAAAAAGTTTGTACATTCTTGTATGCTCTTTATCTCTCTTTGCAGACAAAATTAAAATTACTAAATATATTTCTTTAAAATTTTA

Coding sequence (CDS)

ATGGAAAATAAGAATGAAAAGTGGGACAGCTCCCCATTATCTCTATTCTTCACACTTACTTCCCAAAATTGCCCTCCTAACCATTCCTCTTCCACTTCCTCTTCCTTCACTCTATGCTCTCGATCCCATTCCCATCCCAAACTCTCTCATCTACCTTTCTCCACTTCCTTTTCAACTTCTCATTCCCTTAAACCCTCACCTTTCATCCTCAGAGCATCCGATACAGAATCCAAAACCGACACTGATTCAGATGATCCTAACCAACAACCTTACGAAGAGTATGAAGTGGAGCTGCAGCAGCCTTATGGTTTGAAATTCGCCAAGGGCCGAGATGGTGGAACTTATATCGACGCCATTGCACCTGGTGGTTTTGCCGACAAGACTGGATTGTTCACTGTTGGTGATAAAGTTATTGCTACCAGTGCAGTATTTGGGACGGAAATATGGCCCGCTGCTGAATACGGAAGGACAATGTACACAATTCGGCAAAGAATTGGTCCATTACTCATGAAAATGCAGAAGAGATATGGAAAAACAGATAGTTTCGGTGAGTTAACGGAAAAGGAGATAATCAGAGCAGAAAGAAACTCGGGCGTAGTCAGTAATAAAGTGAGAGAAATTCAATTGCAAAATGCCTTGAGAATGAAGGAGCAGAAAGCACGCAGAGAAAACGACCTACGCCAAGGACTGCGTCTCTATAAGAATGCTAAATATGAGGAAGCATTAGAGAAATTCGAATCAGTGTTGGGGTCAAAACCAACTCCAGACGAAGCTTCGGTTGCTAGTTACAATGTTGCATGCTGTTATTCCCAACTAAATCAGTTAAAAGCTGGGCTATCAGCACTTGAAGACGCCTTGGAAGCAGGATTTGAAGACTTTAAGAGAGTTCGGACGGATCCTGATCTATCAAACCTAAGGACATTAGAGGAATTTGAGCCTCTTGTGAAAAGGTTTGACGAATCATTTATCAATGAGAATGCCATCAATGCCATCAAGTCGCTATTCGGCTTCAAAACATGA

Protein sequence

MENKNEKWDSSPLSLFFTLTSQNCPPNHSSSTSSSFTLCSRSHSHPKLSHLPFSTSFSTSHSLKPSPFILRASDTESKTDTDSDDPNQQPYEEYEVELQQPYGLKFAKGRDGGTYIDAIAPGGFADKTGLFTVGDKVIATSAVFGTEIWPAAEYGRTMYTIRQRIGPLLMKMQKRYGKTDSFGELTEKEIIRAERNSGVVSNKVREIQLQNALRMKEQKARRENDLRQGLRLYKNAKYEEALEKFESVLGSKPTPDEASVASYNVACCYSQLNQLKAGLSALEDALEAGFEDFKRVRTDPDLSNLRTLEEFEPLVKRFDESFINENAINAIKSLFGFKT
Homology
BLAST of PI0016731 vs. ExPASy Swiss-Prot
Match: Q94BS2 (Protein MET1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=MET1 PE=1 SV=1)

HSP 1 Score: 401.7 bits (1031), Expect = 8.0e-111
Identity = 214/319 (67.08%), Postives = 255/319 (79.94%), Query Frame = 0

Query: 21  SQNCPPNHSSSTSSSFTLCSRSHSHPKLSHLPFSTSFSTSHSLKPSPFILRASDTES--K 80
           +Q    N S  T SSF       S   L     S S   +H  K     L+AS+TES  K
Sbjct: 20  TQQTKQNPSLITQSSFI------SAKSLFLSSNSASLCNTHVAKRRNLALKASETESSAK 79

Query: 81  TDTDSDDPNQQPYEEYEVELQQPYGLKFAKGRDGGTYIDAIAPGGFADKTGLFTVGDKVI 140
            +   D   ++ YE YE+E++QPYGLKF KGRDGGTYIDAI PGG ADKTG FTVGD+VI
Sbjct: 80  AEAGGDGEEEEKYETYEIEVEQPYGLKFRKGRDGGTYIDAILPGGSADKTGKFTVGDRVI 139

Query: 141 ATSAVFGTEIWPAAEYGRTMYTIRQRIGPLLMKMQKRYGKTDSFGELTEKEIIRAERNSG 200
           ATSAVFGTEIWPAAEYGRTMYTIRQRIGPLLM+M+KR GK +  GELTEKEIIRAERN+G
Sbjct: 140 ATSAVFGTEIWPAAEYGRTMYTIRQRIGPLLMQMEKRNGKAEDTGELTEKEIIRAERNAG 199

Query: 201 VVSNKVREIQLQNALRMKEQKARRENDLRQGLRLYKNAKYEEALEKFESVLGSKPTPDEA 260
            +S+++REIQ+QN L+ KEQKA+RE DLR+GL+  KN KYEEALE+FESVLGSKPTP+EA
Sbjct: 200 YISSRLREIQMQNYLKKKEQKAQREKDLREGLQFSKNGKYEEALERFESVLGSKPTPEEA 259

Query: 261 SVASYNVACCYSQLNQLKAGLSALEDALEAGFEDFKRVRTDPDLSNLRTLEEFEPLVKRF 320
           SVASYNVACCYS+LNQ++AGLSALE+AL++G+EDFKR+R+DPDL  LR  ++F+PL+K+F
Sbjct: 260 SVASYNVACCYSKLNQVQAGLSALEEALKSGYEDFKRIRSDPDLETLRKSKDFDPLLKQF 319

Query: 321 DESFINENAINAIKSLFGF 338
           DESFINE+AINAIKSLFGF
Sbjct: 320 DESFINESAINAIKSLFGF 332

BLAST of PI0016731 vs. ExPASy TrEMBL
Match: A0A0A0KWR6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G268040 PE=4 SV=1)

HSP 1 Score: 603.2 bits (1554), Expect = 6.7e-169
Identity = 309/316 (97.78%), Postives = 314/316 (99.37%), Query Frame = 0

Query: 24  CPPNHSSSTSSSFTLCSRSHSHPKLSHLPFSTSFSTSHSLKPSPFILRASDTESKTDTDS 83
           CPPNHSSS+SSSF LCS+SHSHPKLSHLPFSTSFSTSHSLK SPFILRASDTESKTDTDS
Sbjct: 22  CPPNHSSSSSSSFILCSQSHSHPKLSHLPFSTSFSTSHSLKHSPFILRASDTESKTDTDS 81

Query: 84  DDPNQQPYEEYEVELQQPYGLKFAKGRDGGTYIDAIAPGGFADKTGLFTVGDKVIATSAV 143
           DDPNQQPYEEYEVEL+QPYGLKFAKGRDGGTYIDAIAPGGFADKTGLFTVGDKV+ATSAV
Sbjct: 82  DDPNQQPYEEYEVELEQPYGLKFAKGRDGGTYIDAIAPGGFADKTGLFTVGDKVLATSAV 141

Query: 144 FGTEIWPAAEYGRTMYTIRQRIGPLLMKMQKRYGKTDSFGELTEKEIIRAERNSGVVSNK 203
           FGTEIWPAAEYGRTMYTIRQRIGPLLMKMQKRYGKTDSFGELTEKEIIRAERNSGVVSNK
Sbjct: 142 FGTEIWPAAEYGRTMYTIRQRIGPLLMKMQKRYGKTDSFGELTEKEIIRAERNSGVVSNK 201

Query: 204 VREIQLQNALRMKEQKARRENDLRQGLRLYKNAKYEEALEKFESVLGSKPTPDEASVASY 263
           VREIQLQNALRMKEQKARRENDLRQGLRLYKNAKYEEALEKFESVLGSKPTPDEASVASY
Sbjct: 202 VREIQLQNALRMKEQKARRENDLRQGLRLYKNAKYEEALEKFESVLGSKPTPDEASVASY 261

Query: 264 NVACCYSQLNQLKAGLSALEDALEAGFEDFKRVRTDPDLSNLRTLEEFEPLVKRFDESFI 323
           NVACCYSQLNQLKAGLSALEDALEAGFEDFKRVRTDPDLSN+RTLEEFEPLVKRFDESFI
Sbjct: 262 NVACCYSQLNQLKAGLSALEDALEAGFEDFKRVRTDPDLSNIRTLEEFEPLVKRFDESFI 321

Query: 324 NENAINAIKSLFGFKT 340
           NENAINAIKSLFGFKT
Sbjct: 322 NENAINAIKSLFGFKT 337

BLAST of PI0016731 vs. ExPASy TrEMBL
Match: A0A5D3DEH5 (Putative tyrosine phosphatase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold859G001110 PE=4 SV=1)

HSP 1 Score: 589.0 bits (1517), Expect = 1.3e-164
Identity = 305/316 (96.52%), Postives = 309/316 (97.78%), Query Frame = 0

Query: 24  CPPNHSSSTSSSFTLCSRSHSHPKLSHLPFSTSFSTSHSLKPSPFILRASDTESKTDTDS 83
           CPPNHSS  SSSF LCS+SHSHPKLSH PFSTSFSTSHSLK SPFILRASDTESKTDTDS
Sbjct: 22  CPPNHSS--SSSFILCSQSHSHPKLSHPPFSTSFSTSHSLKHSPFILRASDTESKTDTDS 81

Query: 84  DDPNQQPYEEYEVELQQPYGLKFAKGRDGGTYIDAIAPGGFADKTGLFTVGDKVIATSAV 143
           DDPNQQPYEEYEVEL+QPYGLKFAKGRDGGTYIDAIAPGGFADKTGLFTVGD+V+ATSAV
Sbjct: 82  DDPNQQPYEEYEVELEQPYGLKFAKGRDGGTYIDAIAPGGFADKTGLFTVGDRVLATSAV 141

Query: 144 FGTEIWPAAEYGRTMYTIRQRIGPLLMKMQKRYGKTDSFGELTEKEIIRAERNSGVVSNK 203
           FGTEIWPAAEYGRTMYTIRQRIGPLLMKMQKRYGKTDSFGELTEKEIIRAERNSGVVSNK
Sbjct: 142 FGTEIWPAAEYGRTMYTIRQRIGPLLMKMQKRYGKTDSFGELTEKEIIRAERNSGVVSNK 201

Query: 204 VREIQLQNALRMKEQKARRENDLRQGLRLYKNAKYEEALEKFESVLGSKPTPDEASVASY 263
           VREIQLQNALRMKEQKARRENDLRQGLRLYKNAKYEEALEKFESVLGSKPTPDEASVASY
Sbjct: 202 VREIQLQNALRMKEQKARRENDLRQGLRLYKNAKYEEALEKFESVLGSKPTPDEASVASY 261

Query: 264 NVACCYSQLNQLKAGLSALEDALEAGFEDFKRVRTDPDLSNLRTLEEFEPLVKRFDESFI 323
           NVACCYSQLNQLKAGLSALEDAL AGFEDFKRVRTDPDLSNLR LEEFEPLVKRFDESFI
Sbjct: 262 NVACCYSQLNQLKAGLSALEDALGAGFEDFKRVRTDPDLSNLRALEEFEPLVKRFDESFI 321

Query: 324 NENAINAIKSLFGFKT 340
           NENAINAIKSLFGFKT
Sbjct: 322 NENAINAIKSLFGFKT 335

BLAST of PI0016731 vs. ExPASy TrEMBL
Match: A0A1S3BMG9 (uncharacterized protein LOC103491636 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103491636 PE=4 SV=1)

HSP 1 Score: 589.0 bits (1517), Expect = 1.3e-164
Identity = 305/316 (96.52%), Postives = 309/316 (97.78%), Query Frame = 0

Query: 24  CPPNHSSSTSSSFTLCSRSHSHPKLSHLPFSTSFSTSHSLKPSPFILRASDTESKTDTDS 83
           CPPNHSS  SSSF LCS+SHSHPKLSH PFSTSFSTSHSLK SPFILRASDTESKTDTDS
Sbjct: 22  CPPNHSS--SSSFILCSQSHSHPKLSHPPFSTSFSTSHSLKHSPFILRASDTESKTDTDS 81

Query: 84  DDPNQQPYEEYEVELQQPYGLKFAKGRDGGTYIDAIAPGGFADKTGLFTVGDKVIATSAV 143
           DDPNQQPYEEYEVEL+QPYGLKFAKGRDGGTYIDAIAPGGFADKTGLFTVGD+V+ATSAV
Sbjct: 82  DDPNQQPYEEYEVELEQPYGLKFAKGRDGGTYIDAIAPGGFADKTGLFTVGDRVLATSAV 141

Query: 144 FGTEIWPAAEYGRTMYTIRQRIGPLLMKMQKRYGKTDSFGELTEKEIIRAERNSGVVSNK 203
           FGTEIWPAAEYGRTMYTIRQRIGPLLMKMQKRYGKTDSFGELTEKEIIRAERNSGVVSNK
Sbjct: 142 FGTEIWPAAEYGRTMYTIRQRIGPLLMKMQKRYGKTDSFGELTEKEIIRAERNSGVVSNK 201

Query: 204 VREIQLQNALRMKEQKARRENDLRQGLRLYKNAKYEEALEKFESVLGSKPTPDEASVASY 263
           VREIQLQNALRMKEQKARRENDLRQGLRLYKNAKYEEALEKFESVLGSKPTPDEASVASY
Sbjct: 202 VREIQLQNALRMKEQKARRENDLRQGLRLYKNAKYEEALEKFESVLGSKPTPDEASVASY 261

Query: 264 NVACCYSQLNQLKAGLSALEDALEAGFEDFKRVRTDPDLSNLRTLEEFEPLVKRFDESFI 323
           NVACCYSQLNQLKAGLSALEDAL AGFEDFKRVRTDPDLSNLR LEEFEPLVKRFDESFI
Sbjct: 262 NVACCYSQLNQLKAGLSALEDALGAGFEDFKRVRTDPDLSNLRALEEFEPLVKRFDESFI 321

Query: 324 NENAINAIKSLFGFKT 340
           NENAINAIKSLFGFKT
Sbjct: 322 NENAINAIKSLFGFKT 335

BLAST of PI0016731 vs. ExPASy TrEMBL
Match: A0A6J1K1D9 (protein MET1, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111491529 PE=4 SV=1)

HSP 1 Score: 501.5 bits (1290), Expect = 2.7e-138
Identity = 270/338 (79.88%), Postives = 291/338 (86.09%), Query Frame = 0

Query: 12  PLSLFFTLTSQNCPPNHSSST-----SSSFTLCSRSHSHPKL------SHLPFSTSFSTS 71
           P +L+   +S +   + SSS       SSF LC    SHPKL       HL F+TSFS+ 
Sbjct: 6   PTTLYLCSSSSSSSSSSSSSQPLPKFCSSFPLCC---SHPKLFLSNSHFHLSFATSFSSL 65

Query: 72  HSLKPSPFILRASDTESKTDTDSDDPNQQPYEEYEVELQQPYGLKFAKGRDGGTYIDAIA 131
           HSLKPS F++RASD +S  D        QPYEEYEVEL+QPYGLKF KGRDGGTYIDAIA
Sbjct: 66  HSLKPSNFVVRASDADSVAD--------QPYEEYEVELEQPYGLKFVKGRDGGTYIDAIA 125

Query: 132 PGGFADKTGLFTVGDKVIATSAVFGTEIWPAAEYGRTMYTIRQRIGPLLMKMQKRYGKTD 191
           PGG ADKTGLFTVGDKVIATSAVFGTEIWPAAEYGRTMYTIRQRIGPLLMKM+KRYGKTD
Sbjct: 126 PGGAADKTGLFTVGDKVIATSAVFGTEIWPAAEYGRTMYTIRQRIGPLLMKMEKRYGKTD 185

Query: 192 SFGELTEKEIIRAERNSGVVSNKVREIQLQNALRMKEQKARRENDLRQGLRLYKNAKYEE 251
            +GELTEKE+IRAERNSGVVSNKVREIQ+QNA+R+KEQKARRE+DLRQGLRLYKNAKYEE
Sbjct: 186 GYGELTEKELIRAERNSGVVSNKVREIQMQNAMRLKEQKARREDDLRQGLRLYKNAKYEE 245

Query: 252 ALEKFESVLGSKPTPDEASVASYNVACCYSQLNQLKAGLSALEDALEAGFEDFKRVRTDP 311
           ALEKFESVLGSKPTPDEASVASYNVACCYSQLN+LKAGLSALEDAL AGFEDFKR+RTDP
Sbjct: 246 ALEKFESVLGSKPTPDEASVASYNVACCYSQLNELKAGLSALEDALAAGFEDFKRIRTDP 305

Query: 312 DLSNLRTLEEFEPLVKRFDESFINENAINAIKSLFGFK 339
           DLSNLRTLEEFE L+KRFDESFINENAINAIKSLFGFK
Sbjct: 306 DLSNLRTLEEFETLLKRFDESFINENAINAIKSLFGFK 332

BLAST of PI0016731 vs. ExPASy TrEMBL
Match: A0A6J1H5N2 (protein MET1, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111459797 PE=4 SV=1)

HSP 1 Score: 498.0 bits (1281), Expect = 3.0e-137
Identity = 263/311 (84.57%), Postives = 279/311 (89.71%), Query Frame = 0

Query: 34  SSFTLCSRSHSHPKL------SHLPFSTSFSTSHSLKPSPFILRASDTESKTDTDSDDPN 93
           SSF LC    SHPKL       HL F+TSFS+ HSLKPS F++RASD +S  D       
Sbjct: 30  SSFPLCC---SHPKLFLSNSHFHLSFATSFSSLHSLKPSNFVVRASDADSVAD------- 89

Query: 94  QQPYEEYEVELQQPYGLKFAKGRDGGTYIDAIAPGGFADKTGLFTVGDKVIATSAVFGTE 153
            QPYEEYEVEL+QPYGLKF KGRDGGTYIDAIAPGG ADKTGLFTVGDKVIATSAVFGTE
Sbjct: 90  -QPYEEYEVELEQPYGLKFVKGRDGGTYIDAIAPGGAADKTGLFTVGDKVIATSAVFGTE 149

Query: 154 IWPAAEYGRTMYTIRQRIGPLLMKMQKRYGKTDSFGELTEKEIIRAERNSGVVSNKVREI 213
           IWPAAEYGRTMYTIRQRIGPLLMKM+KRYGKTD +GELTEKE+IRAERNSGVVSNKVREI
Sbjct: 150 IWPAAEYGRTMYTIRQRIGPLLMKMEKRYGKTDGYGELTEKELIRAERNSGVVSNKVREI 209

Query: 214 QLQNALRMKEQKARRENDLRQGLRLYKNAKYEEALEKFESVLGSKPTPDEASVASYNVAC 273
           Q+QNA+R+KEQKARRE+DLRQGLRLYKNAKYEEALEKFESVLGSKPTPDEASVASYNVAC
Sbjct: 210 QMQNAMRLKEQKARREDDLRQGLRLYKNAKYEEALEKFESVLGSKPTPDEASVASYNVAC 269

Query: 274 CYSQLNQLKAGLSALEDALEAGFEDFKRVRTDPDLSNLRTLEEFEPLVKRFDESFINENA 333
           CYSQLN+LKAGLSALEDAL AGFEDFKR+RTD DLSNLRTLEEFE L+KRFDESFINENA
Sbjct: 270 CYSQLNELKAGLSALEDALAAGFEDFKRIRTDSDLSNLRTLEEFETLLKRFDESFINENA 329

Query: 334 INAIKSLFGFK 339
           INAIKSLFGFK
Sbjct: 330 INAIKSLFGFK 329

BLAST of PI0016731 vs. NCBI nr
Match: XP_004149641.1 (protein MET1, chloroplastic [Cucumis sativus] >KGN54020.1 hypothetical protein Csa_021662 [Cucumis sativus])

HSP 1 Score: 603.2 bits (1554), Expect = 1.4e-168
Identity = 309/316 (97.78%), Postives = 314/316 (99.37%), Query Frame = 0

Query: 24  CPPNHSSSTSSSFTLCSRSHSHPKLSHLPFSTSFSTSHSLKPSPFILRASDTESKTDTDS 83
           CPPNHSSS+SSSF LCS+SHSHPKLSHLPFSTSFSTSHSLK SPFILRASDTESKTDTDS
Sbjct: 22  CPPNHSSSSSSSFILCSQSHSHPKLSHLPFSTSFSTSHSLKHSPFILRASDTESKTDTDS 81

Query: 84  DDPNQQPYEEYEVELQQPYGLKFAKGRDGGTYIDAIAPGGFADKTGLFTVGDKVIATSAV 143
           DDPNQQPYEEYEVEL+QPYGLKFAKGRDGGTYIDAIAPGGFADKTGLFTVGDKV+ATSAV
Sbjct: 82  DDPNQQPYEEYEVELEQPYGLKFAKGRDGGTYIDAIAPGGFADKTGLFTVGDKVLATSAV 141

Query: 144 FGTEIWPAAEYGRTMYTIRQRIGPLLMKMQKRYGKTDSFGELTEKEIIRAERNSGVVSNK 203
           FGTEIWPAAEYGRTMYTIRQRIGPLLMKMQKRYGKTDSFGELTEKEIIRAERNSGVVSNK
Sbjct: 142 FGTEIWPAAEYGRTMYTIRQRIGPLLMKMQKRYGKTDSFGELTEKEIIRAERNSGVVSNK 201

Query: 204 VREIQLQNALRMKEQKARRENDLRQGLRLYKNAKYEEALEKFESVLGSKPTPDEASVASY 263
           VREIQLQNALRMKEQKARRENDLRQGLRLYKNAKYEEALEKFESVLGSKPTPDEASVASY
Sbjct: 202 VREIQLQNALRMKEQKARRENDLRQGLRLYKNAKYEEALEKFESVLGSKPTPDEASVASY 261

Query: 264 NVACCYSQLNQLKAGLSALEDALEAGFEDFKRVRTDPDLSNLRTLEEFEPLVKRFDESFI 323
           NVACCYSQLNQLKAGLSALEDALEAGFEDFKRVRTDPDLSN+RTLEEFEPLVKRFDESFI
Sbjct: 262 NVACCYSQLNQLKAGLSALEDALEAGFEDFKRVRTDPDLSNIRTLEEFEPLVKRFDESFI 321

Query: 324 NENAINAIKSLFGFKT 340
           NENAINAIKSLFGFKT
Sbjct: 322 NENAINAIKSLFGFKT 337

BLAST of PI0016731 vs. NCBI nr
Match: XP_008449894.1 (PREDICTED: uncharacterized protein LOC103491636 isoform X1 [Cucumis melo] >KAA0040116.1 putative tyrosine phosphatase [Cucumis melo var. makuwa] >TYK21739.1 putative tyrosine phosphatase [Cucumis melo var. makuwa])

HSP 1 Score: 589.0 bits (1517), Expect = 2.7e-164
Identity = 305/316 (96.52%), Postives = 309/316 (97.78%), Query Frame = 0

Query: 24  CPPNHSSSTSSSFTLCSRSHSHPKLSHLPFSTSFSTSHSLKPSPFILRASDTESKTDTDS 83
           CPPNHSS  SSSF LCS+SHSHPKLSH PFSTSFSTSHSLK SPFILRASDTESKTDTDS
Sbjct: 22  CPPNHSS--SSSFILCSQSHSHPKLSHPPFSTSFSTSHSLKHSPFILRASDTESKTDTDS 81

Query: 84  DDPNQQPYEEYEVELQQPYGLKFAKGRDGGTYIDAIAPGGFADKTGLFTVGDKVIATSAV 143
           DDPNQQPYEEYEVEL+QPYGLKFAKGRDGGTYIDAIAPGGFADKTGLFTVGD+V+ATSAV
Sbjct: 82  DDPNQQPYEEYEVELEQPYGLKFAKGRDGGTYIDAIAPGGFADKTGLFTVGDRVLATSAV 141

Query: 144 FGTEIWPAAEYGRTMYTIRQRIGPLLMKMQKRYGKTDSFGELTEKEIIRAERNSGVVSNK 203
           FGTEIWPAAEYGRTMYTIRQRIGPLLMKMQKRYGKTDSFGELTEKEIIRAERNSGVVSNK
Sbjct: 142 FGTEIWPAAEYGRTMYTIRQRIGPLLMKMQKRYGKTDSFGELTEKEIIRAERNSGVVSNK 201

Query: 204 VREIQLQNALRMKEQKARRENDLRQGLRLYKNAKYEEALEKFESVLGSKPTPDEASVASY 263
           VREIQLQNALRMKEQKARRENDLRQGLRLYKNAKYEEALEKFESVLGSKPTPDEASVASY
Sbjct: 202 VREIQLQNALRMKEQKARRENDLRQGLRLYKNAKYEEALEKFESVLGSKPTPDEASVASY 261

Query: 264 NVACCYSQLNQLKAGLSALEDALEAGFEDFKRVRTDPDLSNLRTLEEFEPLVKRFDESFI 323
           NVACCYSQLNQLKAGLSALEDAL AGFEDFKRVRTDPDLSNLR LEEFEPLVKRFDESFI
Sbjct: 262 NVACCYSQLNQLKAGLSALEDALGAGFEDFKRVRTDPDLSNLRALEEFEPLVKRFDESFI 321

Query: 324 NENAINAIKSLFGFKT 340
           NENAINAIKSLFGFKT
Sbjct: 322 NENAINAIKSLFGFKT 335

BLAST of PI0016731 vs. NCBI nr
Match: XP_038902944.1 (protein MET1, chloroplastic [Benincasa hispida])

HSP 1 Score: 545.8 bits (1405), Expect = 2.6e-151
Identity = 295/318 (92.77%), Postives = 301/318 (94.65%), Query Frame = 0

Query: 25  PPNHSSSTSSSFTLCSRSHSHPKLSHLPFSTSFSTSHSLKPSPFILRASDTESKTDT--- 84
           PPN+S   SSSFTL S+  SHPKL   PF  SFSTS SLKPSPFILRASDTESK+DT   
Sbjct: 22  PPNNS---SSSFTLYSQ--SHPKL--FPFHVSFSTSFSLKPSPFILRASDTESKSDTGLG 81

Query: 85  DSDDPNQ-QPYEEYEVELQQPYGLKFAKGRDGGTYIDAIAPGGFADKTGLFTVGDKVIAT 144
           DSDD +Q QPYEEYEVELQQPYGLKFAKGRDGGTYIDAIAPGGFADKTGLFTVGDKVIAT
Sbjct: 82  DSDDADQPQPYEEYEVELQQPYGLKFAKGRDGGTYIDAIAPGGFADKTGLFTVGDKVIAT 141

Query: 145 SAVFGTEIWPAAEYGRTMYTIRQRIGPLLMKMQKRYGKTDSFGELTEKEIIRAERNSGVV 204
           SAVFGTEIWPAAEYGRTMYTIRQRIGPLLMKMQK+YGKTDSFGELTEKEIIRAERNSGVV
Sbjct: 142 SAVFGTEIWPAAEYGRTMYTIRQRIGPLLMKMQKKYGKTDSFGELTEKEIIRAERNSGVV 201

Query: 205 SNKVREIQLQNALRMKEQKARRENDLRQGLRLYKNAKYEEALEKFESVLGSKPTPDEASV 264
           SNKVREIQLQNALR+KEQKARRENDLRQGLRLYKNAKYEEALEKFESVLGSKPTPDEASV
Sbjct: 202 SNKVREIQLQNALRLKEQKARRENDLRQGLRLYKNAKYEEALEKFESVLGSKPTPDEASV 261

Query: 265 ASYNVACCYSQLNQLKAGLSALEDALEAGFEDFKRVRTDPDLSNLRTLEEFEPLVKRFDE 324
           ASYNVACCYSQLNQLKAGLSALEDALEAGFEDFKRVRTDPDLSNLRTLEEFEPLVKRFDE
Sbjct: 262 ASYNVACCYSQLNQLKAGLSALEDALEAGFEDFKRVRTDPDLSNLRTLEEFEPLVKRFDE 321

Query: 325 SFINENAINAIKSLFGFK 339
           SFINENAINAIKSLFGFK
Sbjct: 322 SFINENAINAIKSLFGFK 332

BLAST of PI0016731 vs. NCBI nr
Match: XP_023532558.1 (protein MET1, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 503.1 bits (1294), Expect = 1.9e-138
Identity = 271/338 (80.18%), Postives = 291/338 (86.09%), Query Frame = 0

Query: 12  PLSLFFTLTSQNCPPNHSSST-----SSSFTLCSRSHSHPKL------SHLPFSTSFSTS 71
           P +L+   +S +   + SSS       SSF LC    SHPKL       HL F+TSFS+ 
Sbjct: 6   PTTLYLCSSSSSSSSSSSSSQPLPKFCSSFPLCC---SHPKLFLSNSHVHLSFATSFSSF 65

Query: 72  HSLKPSPFILRASDTESKTDTDSDDPNQQPYEEYEVELQQPYGLKFAKGRDGGTYIDAIA 131
           HSLKPS F++RASD +S  D        QPYEEYEVEL+QPYGLKF KGRDGGTYIDAIA
Sbjct: 66  HSLKPSNFVVRASDDDSVAD--------QPYEEYEVELEQPYGLKFVKGRDGGTYIDAIA 125

Query: 132 PGGFADKTGLFTVGDKVIATSAVFGTEIWPAAEYGRTMYTIRQRIGPLLMKMQKRYGKTD 191
           PGG ADKTGLFTVGDKVIATSAVFGTEIWPAAEYGRTMYTIRQRIGPLLMKM+KRYGKTD
Sbjct: 126 PGGAADKTGLFTVGDKVIATSAVFGTEIWPAAEYGRTMYTIRQRIGPLLMKMEKRYGKTD 185

Query: 192 SFGELTEKEIIRAERNSGVVSNKVREIQLQNALRMKEQKARRENDLRQGLRLYKNAKYEE 251
            +GELTEKE+IRAERNSGVVSNKVREIQ+QNA+R+KEQKARRENDLRQGLRLYKNAKYEE
Sbjct: 186 GYGELTEKELIRAERNSGVVSNKVREIQMQNAMRLKEQKARRENDLRQGLRLYKNAKYEE 245

Query: 252 ALEKFESVLGSKPTPDEASVASYNVACCYSQLNQLKAGLSALEDALEAGFEDFKRVRTDP 311
           ALEKFESVLGSKPTPDEASVASYNVACCYSQLN+LKAGLSALEDAL AGFEDFKR+RTDP
Sbjct: 246 ALEKFESVLGSKPTPDEASVASYNVACCYSQLNELKAGLSALEDALAAGFEDFKRIRTDP 305

Query: 312 DLSNLRTLEEFEPLVKRFDESFINENAINAIKSLFGFK 339
           DLSNLRTLEEFE L+KRFDESFINENAINAIKSLFGFK
Sbjct: 306 DLSNLRTLEEFETLLKRFDESFINENAINAIKSLFGFK 332

BLAST of PI0016731 vs. NCBI nr
Match: XP_022996247.1 (protein MET1, chloroplastic [Cucurbita maxima])

HSP 1 Score: 501.5 bits (1290), Expect = 5.6e-138
Identity = 270/338 (79.88%), Postives = 291/338 (86.09%), Query Frame = 0

Query: 12  PLSLFFTLTSQNCPPNHSSST-----SSSFTLCSRSHSHPKL------SHLPFSTSFSTS 71
           P +L+   +S +   + SSS       SSF LC    SHPKL       HL F+TSFS+ 
Sbjct: 6   PTTLYLCSSSSSSSSSSSSSQPLPKFCSSFPLCC---SHPKLFLSNSHFHLSFATSFSSL 65

Query: 72  HSLKPSPFILRASDTESKTDTDSDDPNQQPYEEYEVELQQPYGLKFAKGRDGGTYIDAIA 131
           HSLKPS F++RASD +S  D        QPYEEYEVEL+QPYGLKF KGRDGGTYIDAIA
Sbjct: 66  HSLKPSNFVVRASDADSVAD--------QPYEEYEVELEQPYGLKFVKGRDGGTYIDAIA 125

Query: 132 PGGFADKTGLFTVGDKVIATSAVFGTEIWPAAEYGRTMYTIRQRIGPLLMKMQKRYGKTD 191
           PGG ADKTGLFTVGDKVIATSAVFGTEIWPAAEYGRTMYTIRQRIGPLLMKM+KRYGKTD
Sbjct: 126 PGGAADKTGLFTVGDKVIATSAVFGTEIWPAAEYGRTMYTIRQRIGPLLMKMEKRYGKTD 185

Query: 192 SFGELTEKEIIRAERNSGVVSNKVREIQLQNALRMKEQKARRENDLRQGLRLYKNAKYEE 251
            +GELTEKE+IRAERNSGVVSNKVREIQ+QNA+R+KEQKARRE+DLRQGLRLYKNAKYEE
Sbjct: 186 GYGELTEKELIRAERNSGVVSNKVREIQMQNAMRLKEQKARREDDLRQGLRLYKNAKYEE 245

Query: 252 ALEKFESVLGSKPTPDEASVASYNVACCYSQLNQLKAGLSALEDALEAGFEDFKRVRTDP 311
           ALEKFESVLGSKPTPDEASVASYNVACCYSQLN+LKAGLSALEDAL AGFEDFKR+RTDP
Sbjct: 246 ALEKFESVLGSKPTPDEASVASYNVACCYSQLNELKAGLSALEDALAAGFEDFKRIRTDP 305

Query: 312 DLSNLRTLEEFEPLVKRFDESFINENAINAIKSLFGFK 339
           DLSNLRTLEEFE L+KRFDESFINENAINAIKSLFGFK
Sbjct: 306 DLSNLRTLEEFETLLKRFDESFINENAINAIKSLFGFK 332

BLAST of PI0016731 vs. TAIR 10
Match: AT1G55480.1 (protein containing PDZ domain, a K-box domain, and a TPR region )

HSP 1 Score: 401.7 bits (1031), Expect = 5.7e-112
Identity = 214/319 (67.08%), Postives = 255/319 (79.94%), Query Frame = 0

Query: 21  SQNCPPNHSSSTSSSFTLCSRSHSHPKLSHLPFSTSFSTSHSLKPSPFILRASDTES--K 80
           +Q    N S  T SSF       S   L     S S   +H  K     L+AS+TES  K
Sbjct: 20  TQQTKQNPSLITQSSFI------SAKSLFLSSNSASLCNTHVAKRRNLALKASETESSAK 79

Query: 81  TDTDSDDPNQQPYEEYEVELQQPYGLKFAKGRDGGTYIDAIAPGGFADKTGLFTVGDKVI 140
            +   D   ++ YE YE+E++QPYGLKF KGRDGGTYIDAI PGG ADKTG FTVGD+VI
Sbjct: 80  AEAGGDGEEEEKYETYEIEVEQPYGLKFRKGRDGGTYIDAILPGGSADKTGKFTVGDRVI 139

Query: 141 ATSAVFGTEIWPAAEYGRTMYTIRQRIGPLLMKMQKRYGKTDSFGELTEKEIIRAERNSG 200
           ATSAVFGTEIWPAAEYGRTMYTIRQRIGPLLM+M+KR GK +  GELTEKEIIRAERN+G
Sbjct: 140 ATSAVFGTEIWPAAEYGRTMYTIRQRIGPLLMQMEKRNGKAEDTGELTEKEIIRAERNAG 199

Query: 201 VVSNKVREIQLQNALRMKEQKARRENDLRQGLRLYKNAKYEEALEKFESVLGSKPTPDEA 260
            +S+++REIQ+QN L+ KEQKA+RE DLR+GL+  KN KYEEALE+FESVLGSKPTP+EA
Sbjct: 200 YISSRLREIQMQNYLKKKEQKAQREKDLREGLQFSKNGKYEEALERFESVLGSKPTPEEA 259

Query: 261 SVASYNVACCYSQLNQLKAGLSALEDALEAGFEDFKRVRTDPDLSNLRTLEEFEPLVKRF 320
           SVASYNVACCYS+LNQ++AGLSALE+AL++G+EDFKR+R+DPDL  LR  ++F+PL+K+F
Sbjct: 260 SVASYNVACCYSKLNQVQAGLSALEEALKSGYEDFKRIRSDPDLETLRKSKDFDPLLKQF 319

Query: 321 DESFINENAINAIKSLFGF 338
           DESFINE+AINAIKSLFGF
Sbjct: 320 DESFINESAINAIKSLFGF 332

BLAST of PI0016731 vs. TAIR 10
Match: AT5G17170.1 (rubredoxin family protein )

HSP 1 Score: 55.1 bits (131), Expect = 1.3e-07
Identity = 29/70 (41.43%), Postives = 42/70 (60.00%), Query Frame = 0

Query: 95  EVELQQPYGLKFAKGRDGGTYIDAIAPGGFADKTGLFTVGDKVIATSAVFGTEIWPAAEY 154
           EVE+ +P GL   + + GG  I  +  GG A K GL + GD+V+ TS+ FG E+WPA + 
Sbjct: 87  EVEVDKPLGLTLGQKQGGGVVITGVDGGGNAAKAGLKS-GDQVVYTSSFFGDELWPADKL 146

Query: 155 GRTMYTIRQR 165
           G T   I+ +
Sbjct: 147 GFTKTAIQAK 155

BLAST of PI0016731 vs. TAIR 10
Match: AT5G17170.2 (rubredoxin family protein )

HSP 1 Score: 55.1 bits (131), Expect = 1.3e-07
Identity = 29/70 (41.43%), Postives = 42/70 (60.00%), Query Frame = 0

Query: 95  EVELQQPYGLKFAKGRDGGTYIDAIAPGGFADKTGLFTVGDKVIATSAVFGTEIWPAAEY 154
           EVE+ +P GL   + + GG  I  +  GG A K GL + GD+V+ TS+ FG E+WPA + 
Sbjct: 87  EVEVDKPLGLTLGQKQGGGVVITGVDGGGNAAKAGLKS-GDQVVYTSSFFGDELWPADKL 146

Query: 155 GRTMYTIRQR 165
           G T   I+ +
Sbjct: 147 GFTKTAIQAK 155

BLAST of PI0016731 vs. TAIR 10
Match: AT3G01510.1 (like SEX4 1 )

HSP 1 Score: 45.8 bits (107), Expect = 7.9e-05
Identity = 25/96 (26.04%), Postives = 47/96 (48.96%), Query Frame = 0

Query: 81  TDSDDPNQQPYEEYEVELQQPYGLKFAKGRDGGTYIDAIAPGGFADKTGLFTVGDKVIAT 140
           + S  P +    EY V L++P G++FA   DG  ++ AI  G  A+K  +  VGD +   
Sbjct: 64  SSSSTPFKMNLNEYMVTLEKPLGIRFALSADGKIFVHAIKKGSNAEKARIIMVGDTLKKA 123

Query: 141 SAVFGTEIWPAAEYGRTMYTIRQRIGPLLMKMQKRY 177
           S   G  +    ++G T   + ++ G   + +++ +
Sbjct: 124 SDSSGGTLVEIKDFGDTKKMLVEKTGSFSLVLERPF 159

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q94BS28.0e-11167.08Protein MET1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=MET1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KWR66.7e-16997.78Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G268040 PE=4 SV=1[more]
A0A5D3DEH51.3e-16496.52Putative tyrosine phosphatase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_sc... [more]
A0A1S3BMG91.3e-16496.52uncharacterized protein LOC103491636 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A6J1K1D92.7e-13879.88protein MET1, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111491529 PE=4 SV=... [more]
A0A6J1H5N23.0e-13784.57protein MET1, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111459797 PE=4 S... [more]
Match NameE-valueIdentityDescription
XP_004149641.11.4e-16897.78protein MET1, chloroplastic [Cucumis sativus] >KGN54020.1 hypothetical protein C... [more]
XP_008449894.12.7e-16496.52PREDICTED: uncharacterized protein LOC103491636 isoform X1 [Cucumis melo] >KAA00... [more]
XP_038902944.12.6e-15192.77protein MET1, chloroplastic [Benincasa hispida][more]
XP_023532558.11.9e-13880.18protein MET1, chloroplastic [Cucurbita pepo subsp. pepo][more]
XP_022996247.15.6e-13879.88protein MET1, chloroplastic [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
AT1G55480.15.7e-11267.08protein containing PDZ domain, a K-box domain, and a TPR region [more]
AT5G17170.11.3e-0741.43rubredoxin family protein [more]
AT5G17170.21.3e-0741.43rubredoxin family protein [more]
AT3G01510.17.9e-0526.04like SEX4 1 [more]
InterPro
Analysis Name: InterPro Annotations of Melon (PI 482460) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036034PDZ superfamilyGENE3D2.30.42.10coord: 73..146
e-value: 1.2E-6
score: 30.1
IPR036034PDZ superfamilySUPERFAMILY50156PDZ domain-likecoord: 85..139
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 225..311
e-value: 1.1E-8
score: 37.1
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 226..288
NoneNo IPR availablePANTHERPTHR47661:SF3BINDING PROTEIN, PUTATIVE-RELATEDcoord: 33..338
NoneNo IPR availablePANTHERPTHR47661PHOSPHOGLUCAN PHOSPHATASE LSF1, CHLOROPLASTICcoord: 33..338
IPR001478PDZ domainPROSITEPS50106PDZcoord: 102..141
score: 8.747399

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
PI0016731.3PI0016731.3mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding