MELO3C010097 (gene) Melon (DHL92) v3.5.1

NameMELO3C010097
Typegene
OrganismCucumis melo (Melon (DHL92) v3.5.1)
DescriptionPutative plastid-lipid-associated protein 4
Locationchr2 : 12825165 .. 12846949 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GCGCAAAAATAGTAACGGTTTGTTTTTCAGCAGATAAAAAATGGCTATGGCTGTGGCCTTATCTTCATCTACAGTTACTCCGACCGCCAAATTCTGGCCTCCTTCAACATCGCAACCACCCAGGTTTCTTCCTATTATTCCCTCAAACCCGTCTTCGAACACTAACCTCCTCACTTCTTCACCTTCTCACAAGTGGAGGCTCAGGATCTCTTTCTTTCCTGCCTTTTTGAACAAAGGAAAAGGAAATAATGTTACTGCTCTTAAACAGGAACTTCTTCAGGCAATTGCCCCCCTTGATCGAGGAGCTGAGGCCACTCCTGAAGATCAGGAAATGGTTGATCAGGTTTGGGTTTCCTCACTTGCCTTTTAATTTCAACAGCCCCACCAGGTGAGTGAGGTTGATAAATAAAGATAATTTGATGTTTCAGAGTCAGGGGATTGGGGCGGGCATATATTTTAAAAAGGAGAAGTTAGGCTGGTAGGGAAATTAAAAATGATGCTATTCTATTTAACCGTATACCTATTCTAAGTTTCAGAGAGGGCAAAGAAAACCACATTGTGGACTGATCATTTTTTGTAGGATAGTCTGATCTGCATATTTTAAGAAAATGGGATTACCTCCTTGAATGAATAAGAGGATGTGAAGTGATGGGAAGAATTTCTTTTGTCATAGTTATGGAGGTTTTATTTGGTCAGTTATCTCTGTGATGGACGTTGGATCCCGATACTATTGGAGGGAAAAGGTGGTTGTGTTTTAATTTTTTTTTTTTTATCCCTTCTGATTGGGCTGCAACTATTTTCGCTAAAGATCTAGTAGACAACAATAGTAACAAATAAGGGTAAGTTAACCAGAGCTTTACCTTTTTTGAAATGGGGTTAATATAAATTATAAGTTTTTCTAGATCATAGTAATGTGAGAGGAATGTATATTGGTTTGTTAAGTGGCATTGAGATCAAGTAGTGAATTTTGCGGGTATTACTAGTGAACATGATAAACCTTGGTACTTCCATAAGATGTTTAGTTTTCCATACGACGTCTAGATTTCTTTCAGCTTTTTCATGTTGTGTTCTAATCTTTCTTCGTCTTCTTTTATTTATTTATTTTTTATTTTTTATATGTATATATCTACTAACATAGCATTAAATGCATTATCAGTCTTCTGGATTTGACTTTTGGTGTCAAGTTGGTTATGTGTACATACATATCTATAACCAGTTTTTGCATGTACAGATACATAAATATGTGCATGCTTCTGTTTTATAAGAAACCACACTTTCATTGAAATAAGTGACTGAATACAAGGGGAATACCAAAATAATTAAGGATCAAGCTAACAAAAGTTGCGTCAATCAAACAAAATAAGATCTAAGAGACTTCATTCCAACAAGTCTAACTAAGACCAAAATGTGAGACTCCCAATTAACCATCTGAACTTGGGAAGGGGTTAAGAAGGGTAATTCACAGGAGACCATTGTTTATGACTATGAAGCACAAACACATATGTTGATAAGACAGGATATGAAAATACGTCAATTTCTAAAAATTAAGGTATTGGATACATTGAAGATCATCATTTTGAAGCTTTCAAGTTCCCCATCGAGGAGGTTGAGTTTATGTAAGGATTTGATGATTTACATTTGCTTGAGGTTGAAGCTGTCATCTTACACAACTACCTAATACTTCAAGTTAAATTAAAGGAGATTTAGATAGTGAAAAAGACAAAAACAAAGCTGGGAAAAATTGATTGACAGCTCGATGACGAGGGTTGTAGTCAAAGAAATGAAATGAAGATCGAGGGGGAGGGCCATGATTTTGGTGTAAAATGAGAAAGAATGTGATTTCTAGGGTTTGTTTTCTTTTACTTTTTTCTGTTATTTATGTATATTATCTTTATATTTGTAAAAGTAACATGAAGCTAGATACTAATTGTAGTCTACTAAAAAATATTAATACCAAGTTGATACTAAATTTTGGAAAGTGAAATACATATTTACATTTGTCCACGAAAAATTCATGATTCAACAAATCAAAACAAAGTTTTCAGTAAAAAGTATTTTCGAAACTAGAAAAATTTGAAGCTTTGAAGTCCCAAATCTCATAACATCTTCAACAACGAAGGAGGAGGTTGAGTTTATGTAAGTATTTTATGATTTACAGTTGCTTGAGGTTGAAGCTGTCATCTTACACAACTACCTAATACTTCAAGTTAAATTAAAGGAGATTTAGAAAGAGTGTAAAAAAAAAAAACAAAAACAAAGTTGGGCAAAATGAAGACAAAAGGGGGAGGGTCATGATTTTGGTGTACAATGAGAAAGAATATGATTTTTGGGATTTATTTTCTTTTATTTTTTTCTGTTATTTATTTATTTTATTTGGGCTACCACATTGAGCTTGTGAAATAAGTGGAGTGTCTTAGATAATACGAGGATCCATCTCAAAACCATTTTGCAATGAGAGGAGTAGTCCATATATCTTATAAGTAATGTATGGTCCCTTCATGTTTGCAATGTCTGATTCTGAATAGGTTGTGGGCTGGGCATTTAAAATACAACATCAAGCCTTCAAAAAATAGCTTCAACATGTCCCCTTCACATATTTGGGATAGGACACATGTATCACGAAAAACAAAATAAATAAATAAAAACTTCAAATCATGTTTTGGGAATGTATCTGTTAGATACCTAGATTAATGTAGGGTTAGGGATATAAAGGTAATTAGATATTTAGACAGTTACATATGGTAGTGGTTATCAATAAGAAGTTGGAGAGGGAAGAAGGGGGATTTTGGTGAGTGGTTTCTGCTAAGTTCCTTGAGAGAAAGAAAAGATAGAGAGGATAGTTTTTACTTGTTCTTAGTTATTTTCTGCATTTTCATCTCAGAATTGATATTCATCAATTCTTCTATGTAGTAGTGAGGTTTATATCAATTGAATAAGAACACATCTTCTTGGTGTTCTAGTTCTATCATTTGGAATCAGAGCTTTTTACTTGGGAAAACGATAGAGAATGGCATGCACACGAATCGAAGAGAAGTTGGAGACCTTTGATTAGGAAATCTCGGGGATACGAGTGGAGTCCCACAAGTTGTCGACAATCAAAGAAACCTTGTTATCCTTGGCAAAGAGCATCGAAAGATTGGGCATCCAGTGGGAAAAACATCAACTACTATTGCTAAAATGCATAGAAGGAGTAGTGAAGAACAATTTGACAATGTTTGGAGGAATGGAAGGATCGTCGAGCAAAGGACCGCCAATTGAGAGTATCATTGAAGGTGGAGTACGACATTGAAGGGAATCAAGCTAGAAGGTAAAACGAACAAGTCTGAAGGGGATGAAAACTTGAACGATCGTAGCAAATTCAAGAAGGTCGAAATACCTATGCAACACTGATCCGGATTCGTGTCGGTTTAGAGCAGATCGGTAACTTCAGATTCATAAATCGATGGATTCCAAGAAGAGGTTGCTGAATCCAGAGAAGAATCGAGGAAAACGCGATTGCAATCACTATATAGCGAAGGTGAAACTGAAGAGGATGGTGAAGAGTGACCAAGAAGAATCTATAGAGGAAGGTCGACGAGGATTACTTGTGAGGCTCAAAATGAAGACTCAGGTGAAGAAAAAATTGCTGCTCAAAGAAGTAGAAAACGGCTGGTTGCGGTGGAGAAAGAAGACGCCGGCAGTTGCCGATCTACGGTGGCCAGAGAAGGTGGTCTGGAATAGGGGTGAACCACGTGAAGAGGAAGATGAAGTTTCTGCCCACGTGATGGTGCCAATGACGTGACAGAAATTGATAGGAAAGAGGGTTGGGGCAAGAGAAATAAACCCTAAAATTTTGGGTTTTACTATTTCAAGTGGGCTTCAACTTATTCTAAATAAAACATTGGAGGCCTATCATTATGTAGATGAAGTGGGTGGTGAGTTCCAAAATGGAAGGGAAGAAAAATGTGCCTAAAGGTGGGTCCCAAAAATGAAATTGACGATAGTATGATGGAAGCTTGTGGTTGTTTATAAGGAGGGTAAAGGTGTGTGAAGGCCTCCAATTATTCATACTTATAAGAGAAAGGGCAGAAAAGTAATTACGAACAATCCAATGATGAGGAAATGGCTGCAGAAGAAGATCGTGATTGATGGGGCCCATGGGTAGGAAAGAGAGTGCTATAAATATGTTTTGTGGGAATTGTGTAGGTATGGTTTCTTTTATTGTGTAAAAGCTGCAGAAAGCTTTGGTAGAGGAATATCTCACCCTCCTTGTAAAGGTGCTGGGTAAGCATTGTTATTTCCTTTTATATTTTGTTGTTGAACTCTCTGTGATCTTATATTTGGCTGAAAGTAATATAAATAATAGAGTGCTGCCTTTGTTTTAGCCCTTTTTGTTAGGATTGCTTGTGATATTTGAGTTAGAATCCTAACAATTGGTATCAGAGCCTAGAAAATCTAGGGGTTGTTACATGTTTGGTGGTTCACAAATAGATGGAAGAGAGGTTAGAAGGGACGGAAAAGGAAGTTTTAGGGCTATAGGAATGATGCTTGATCTGAAGAAGGTCATTGATAGGATGGCAGAGGATATGAGAGAAAACACCAACTACAAACGCAAGGAAGAATCTGGTACATCTGATGGTTCTGTCATGAAGTTGAAAGAAAAAATGGAAGACCTGAAAATTGTAACTGATAATGGCACATTAACAGTGGACTGAAGCAAGTATAAAAGTTAGAAATGCCCATGGTTTTGGGGGAGAATCCAGAGTTTTGGGTGTCCATGGCTGAGCATTTTTTTAAATCAATAACCTCCCAAAATCTGAGAAGGTAAAGGTAACAGTAGTGAGTTTTGGGCAAGATGAGGTTGATTGGTACAGATGGAGCCACAATATAAAAAGGTGGAATCGTGTGAAGATCTAAAAGGGAGAATGTTTGAATATTTTCGTGACACTGGACAAAGAAGTTTGGGAGCAAGATTAATACGCATTCAGCAAGGGCTTTGACAATGATTATATGAAGAAATTTGTGAATTATTCAGCACCCTTACCTGATATGGCAGAGAGTGTCCTCATGGATGACTTCGTGACTGGACTAGAACCAGCCCTATAAGCTGAAGTAGTCGGTAGACAACCCCAAACATTGGAAGCTTGTATGAGAAAGGGGTTAATGATAGAAATTTAACATTAAAATTTGCAAGAGCTGAACTAGGGATACTTGAACTCAAGGGAGGGGATGCTTCATCAACCAAGATTGTGGGAGGTAATGAAAAAGGGAACCCAAGAAAGACAGAATTCCAGATGAAACAAATCACTATCCCTATCAAAGGGAACTACCCAAAAGGAGAGCCACCTGTAAAGAGATTGTTAGATACAAAATTTAGAGCAAGATTAGATAAAGGCCTTTGTTTCATATGCAATGAGATATACTCTCATGGGCACCGATGTAAAGTTAAAGAAAAAAGGGAGCTTATGTTGTTCATTTTTAATGAGGAAGAAGGAGCAGAAGGAGTAATAGTGGAAATCGAAGAAGACATTGTGGAATTGAAGAACTTAGAAGTACCCGTGGAGACTGAAATTGAGTTGAAGACTATTATGGGTTTTGCGGCCTAGGGAACTATGAAATTAAAAGGATCAGTCAAAGAGAAGGAAGTAATTGTGTTATTGGACAGTGGGGCTACTCATAACTTCATACATTAAACATTAGTGGAGGAGATGGGTATAATGATTGTGAGGGGTGCTACCTTTGGGGTAACCATTGGAAAGGGGACAGATAGAAGAGGGAAAGGGCTATGTAAAAGAGTAGAGTTGAAACTTCCCGAATTAACTATCATTGCTGATTTTCTTGTAGTAGAGTTGGGGAAGGTTGTATCATACTAGACATGCAGTGGTTGTGTACCACTGGATTTATGGGAGTCCATTGGCCCTCTCTAACAATGACCTTTATGGTGGGAGAAAAGTAGATTACTCAAAAAGGGGATCCCTCTTTAATAAAAGCTGAATGCTTTTTAAAAACTATACAAAAGACATGAGAAAAAGAAGACCAGGGTTCCTACTTGAGTTACAGAACTTTGAAATTGAAGAGGAGATCAACTATGAAGAAGTGTTAGAAAAAAGGGAGGAGGAAGATTTACCTATGATTCGGTTTCTGTTAAACAATATACTGACATATTTGCACCACCTACTAGGCTACCTCCGAAAAGGGAAGTTGACCACCACATTATGACCTTACTAGGACAGAAACCGATCGATGTACGCCCTTACAAGTATGGACATGCTCAGAAAGAGTATAAATTAAGAAATCAGTGACTGAAATGCTACAAGCAGGGGTAATTCGACCAAGTCACCACCCTTTCTCAAGCCTAGTTCTATTAGTCTGGAAAAAAGATGGAGGGTGGCGGTTGTGTGTCGATTATAGAAAGCTCAATCAAGTCACAATATCAGACAAGTTCCCAATTCCAGTTATAGAACTTTTGGATGAACTTCATGGGACAACGATTTTTTCCAAGCTGGATTTAAAGTCTGGATACCTGTAGATTCGAATGAGGGGAGAAGACATTGAAAAGACAGCTTTCAGGACACATGAGGGTCACTACGAATTTCTGGTAATGCCGTTCGGCTTAACCAATGCTCCGGCCACCTGCCAGCCACCGATGAACCAGGTATTTTGCCCCTTCCTAAGTCTTTGTGTACTTGTATTTTTTGATGATATATTGGTCTATAGTTTTGATGTGACTGTACATGAAAAGCACTTGGGGATGGTGTTTGCTGTTCCTAGAGATAATCAACTGTTTGCAAACAGAAAAAAATGAGTTATAGCCCACTCTAGAATCCAATATCTAGGCCACTGGATGTCAAGTAAAGGGGTGGAAGCTGATGACGAAATAATACGCATCATGGTGAATTGGCCCCAACCTAAAGATGTGATCGGATTAAGAGGATTTTTGGGGTTAATTGGAAAGCTACTGAAGCCTTTCAAAAACTGAAACTAGCAATGACAACCATCCCTGTTTTAGTCTTATTGGACTGGTCACTTCCTTTTATAATTGAGACAGATGTGTTAGGAATTTGGTTAGGGGCAGTCCTCTCACAAAATGGTCATCTCATTGCATTCTTCAATCAGAAATTAGCCCCAAGAGCCCAAGCCAAATCCATCTATGAGAGAGAATTGATGGCTGTTGTTCTTTCAATCCAAAAATGGAGGTATTACCTCTTAGGGAGGAAATTTACAATAATCTCAGACCAAAAATCTCTTAAATTCATATTGAAACAGAGAGAAGTCCAACCCTAATTCCAAAAGTGGCTTACCAAGCTACTTGGGTACGATTTTGAGATTTTGTATCAATTGGGTTTGCAGAATAAAGCAACTGATTCCCTCTCTAGAATAGAGCCACCTCCAAAACTGAATATCATGTCAACCCCTGGGATTGTTGATTTAGAAATTTTGCTAAAAGAAGTGGAAGAGAATAGGGAACTCCAGGAGCTTATTGAAGGCCTAAAGAAAGATCCAAGGGAGTGAAATAAGTATCATCGGGTGAATGGAAGACTTCTATACAAAGGAAGACTAGTTCTTTCCAAGAAGTCAACACAAATTCCTAATCTGCTGCATACATTTCATGACTCTATTTTAGGGGGTCATTTAGGTTTTTCGAGAACTTATAAGAGGATGAGTGGGGAACTTCATTGGAAGGGATGAAGACAAATATCAAGAAATATGTTGAACAATGTGATATTTGTTAACGAAACAAGACAGAAGCTACTCGGCCAACTAGGTTGTTACAACCAATTCCATTTTCTGAGCACATACTGGAAGATTGGTCCATGGCTTTCATGGAAGGATTACCAATGGCAGGGGGAGTTAATGTAATCTTTGTGGTAGTTGATAGATTAAGCAAATATGCTTATTTCATCTCTTTGAAACATCCCTTCTCGACTAAATAGGTGGACGAAAACCTTCATTGATAAGGTGGTGACGAAACATGGCATTCCCAAATCCATTATCATTGACAGAGATAAGATAATCCTCAGTAATTTCTGGAAAGAATTGTTTGCTACCATGGGCACTATACTGAAGAGAAGTACAACATTCCATCCCCAAACGGATGGGCAAACAAAATGGGTAAACAGATGCCTAGAGACCTATTTGAGGTGCTTATGTAATGAGCAACCGAACTGATGACATAAATTCCTTGGCTTTGTGATATAACACCCATTTTTCATGCATCCACCAAGACTACACCTTTCGAACTTGTGTTTGGTAGACCTCCGCCGCCCTTGATATCATATGGAGATAAAAATCCTCCAACAATGAGGTGGAATTGATGTCGAAAGAAAAGGATTTAGCCATAAATGCACTAAAGGAGAATTTAAGCGTGACTCAAAATCGGATGAAGAAAATGGTCGATTTAAAGAGAAGGGAACTAAAGTTCAAAGGGGGAGAAGAAGTGTACTTAAAATTGAGGCCATATAGACAACATTCACTGGCTAGAAAAAGGTGTGAGAAGCTTGCTCCTAAGTTTTATGGACCTTACAAAATAATAGAGGAAATTGGGGAAGTGGCTTACCGGTTACTACCTCCAGAGGTAGTCATCCACAATGTCTTCCATGTCTCTCAGTTGAAGTGGAAACTGGGGAAGCAACAACAAGTACAGCACCAAGCCCCAATTCTAATAGAGGAATTTGAACTGTAGTTGTGGCTCAAGACAGTTATGGGAATTTGTTGGAGTAAAGAGGTGGGAGCAAATGAGTGGTTAATCAAATGGAAGGGCTTACCAAAAAGTGAGGCTACTTGGGAATCAGTATATCAAATGAATAAGCAATTCTCTACGTTTCACCTTGAGGACAAGATGAATTTGGAACCTAGGGGTGTTGTAAGGCCTCTGATTATTCATACTTATAAGAGAAAGGGCAGAAAAGTAATTACCAACAATCCAATGATGAGGGAATGATTGCGGAAGAAGATCGTGATTTATGGGGCCCATGGGTAGGAAAGAGAGAGTGCTATAAATATGTCTTTATGGGAATTGCGTAGGTAGGGTTTCTTTTATTGTGTAAAAGCTGTAGAGAGCTTTGGCTGAAAGTAATATGAATAACAGAGTGCAGCCTCTGTTTTATTTTTGTTAGGATTAATTGTGATATTTGAGTTAGAATCCTAAGATTACTGGCATTTGGCACAACAATTTGATTATGGGCCTAGCCCACTAGAGAAGAAGAGTTTGAAAATTAAACACTTGGTGGTGATATGCAAGGGTTAAGTATGACAACATGTTTGGGAAAAGGAAGATTCAAGAACATTGGGCTGCTTTGTGGGCTGAAGGAAGATGATTCTGGGATATGGGCTGAAGTTAATGAACCTTATTGGAGCCAAGAGTTTTTCAAAGCTGAAAGGGGAGAACTTTTTTTTCAAATTAAAGAGCAAACTGCTGAGATTGTTGGTAAAAAAAGGGCCCAATTTTGTGATCATGATTGTAAGCACACCTTGAGGACAAGGTGTTTTGAAGGGCCGGGTAATGTTAGATACCTAGATTGGCATAGGGTTAGGAGTATAAAGATAATTAGATATTTAGGTAGTTACATATGATAGTGGTTATAAGTAAGAAGTTGGAGAGGGAAGAAGGAGGAATTTCGGTGAGTGGTTTCTGCAAAGTTCGTTGAGAGAAATGAAAGGTAGAGAGGGTAGTTTTTACTTGTTCTTAATTATTTTCTGCATTTTCATCTCAGAATTGATATTCGTCAGTTCTTCTCTGTAATAGTGAGGTTTATATCAATTGAATAAGAACACATCTTTTTGGTGTTCTATCAGTATCTTCTAGGTATCTGGATATATCAATGTCGGATGTGTATTTGATATTGATTCTCTACATACTTTAAGTATTTGTGCTTCATAGGTTAGTGGGGCAGTTGGTTGAGGGAATATCCTCTAAGTGGACCATGAGAGGTTTTTGAAGGGTTTGGAGTGAAGGTTGGGTATGAATGTTTTTTTGTGGTAGCAAAAGCTTTCGAGGGGAGGATTCGACTCTCTTAAATAGCCAATAGGTTTATGTGTTTTCCTTATCTTGCTTTCGTATATCTCTTTATGAATTTCTTTGCAATCATTGAGGAATTGTTGAACTCAAGATATTTAGTGAATAGATATTGTACATGAAGTTACATCCCCTGCTCTTGTTATATTTATGATCTTTCTACTTGATGGATTAGTTGGTACCTAACCTATAATTACAGAGTGGCCTGTGACTCTTATTGAAGATTAAGGTCTAGTAACTAATGCCTATAAGGAGGTTCACCTAACTGACCTCATGAGATGACCTCTCCAGCTCTCCAAAAATCCAATTATTTTTAAGCTCCTTAGAAACAATAGGACTTTTGAAGTGGTGAAGAGGTCCTTTTTTGTATGTACATACACTTTAATAAATCATAGCATAGTTAATCGTTCTCATTTTGATTATTACCAAAAGATATCACGCAAGCTTGAAGCAGTCAACCCTATAAAGGAGCCACTCAAATCTGATTTACTAAATGGAAAATGGGAGCTCATATATACTACTTCCAGATCAATTCTGCAAACTGAGGTAGCTGCATACTTCATTATGACATGTTTCATCACAACTATTTCTACTCCAACATATATGCACAGTATAGTCATTTATGCTGTTTCTCTATCTCTGGTTTAATTCAAGATTTTGATGGATATTTAAAAATTAGATCTTTCATTAGCTATTATTGATAAGATACCATCATTGATTAAGGTAAATGTGTTAACATAAGGTCATGAATAAGAATACTAGCCTAAAACAAGCTAGGAATCCTCATGAAATGATTAAGGAAGGATTAAGTTATTAAGAATAAAAAACAAGGTAGAATCGCTCACCAAAAAGGCTTGATGCAAGAAACCTTAGGATGGGCTTAATTAATTCTTAACCGTTGTGCTTTGGCAGAGGCCCAAGTTTTTTAGGTCAAGAATAAACTATCAAGGAATCAATGTAGATTCTCTCAGAGCTCAAAATATGGAATCTTGGCCGTTCTTCAACCAGGTATAAGTTACTGCTCCCTTTGTTGTATTTGTTTTCAATGAGATATTATGTGCTTTATCTCTCTCCTACTTCCTAACTTGTGTGCCAATGAGTTATGTTTGTATGTAATGTAATTTCTGTAGATTATATTTACTATTGCCTTTACCCTTTGCTTGCCAGATGCCATATCAAATGAAGTTATTTTGAAACCAAGTTATGAACCAGATTATATCACATCCGTAGATTGAGATGGGTCATTTAATTTTTTTTTTGTTAGATATATAATTAAAAGATAGTTTCTTACCCTTGGAGTTAGGGGTAGTTGATGTATTCATGGAATGCAATGGTTGCACTCGTTGGGTGTAACAGAGGTTGATTGGTGGAATTTGACCATGACATTAATCAAGAAGGTTGGAATGTAGTGTTGAGAGGTGATCCTAGCTAGTTAAGCTAGGGTAAGCTTGAAGAGTTTGATAAGTCTTGGTCAGCATCAATCAAGGATTTCTAATTGAATGTAGAGTTATAGAAGGGGGAGTGTTGTGGGCCGAACCGTATGGAGTTGATGAAGTACCCACAGTTGCTGAATCAGTACCTATGCTGTTAATTTGAAGATGTGTTTGCTTGGCTAGAGGAACTTCCTTCGAGGAGGAGCATTGAACATCATATTCATTTGAAGAAAGGGATTGATCTGGTCAATGTACACTCGTATAGGTATGCTTACCAACAAAAAGAGGAGTTGGAGAAATTAATGGATGAGGTGTTAACATCTGGAGTTATACGACCAAGTACTAGTTTGTATTCTAGTCTTGCTCTATTGGTGAAGAAGAAAGATGGTAGCTGGAGGTTTTGTGTTGATTATCGTGCATTGAATAATGTGACAATTTCAGATAAGTTTCCAATACCTATTATTGAAGAGTTGTTTGACGAATTGAATGGAGCAAGTATGTTTTCGAAGATTGATTTGAAGGCTGGCTATGATCAAATATGGATGCATTTGGGAGATATGGAGAAAACATCTTTTAGAACTCATGAAGGACACTATAAGTTTTTAGTACTGCATTTGGGCTAACTATTTCTCCCTGTACTTTTTCAGTCCTTGATGAATACTATTTTCAAGCTTTACATGGGAAGATTTGTTTTCGTGTTTTATGATGGTGTACTGATTTATAGTAAGAGTCTGGAGGACCACTTGAAAAATTTGGAGTCGGTGTTGGAGATTCAGAGAGAAAATGAATTGTATGCAAACGTAAATGCCACTTTGCTTGAGCAAGAGTGGAATATTTAGGGCATGTCATTTCGGGGAAAGGGGTAGAAGTTGATTCTGAAAAGATCAGAGCAATAAGGGAACGGCCAATACCAACGAATGTGTAGGAAGTAAGGGGATTCTTGGGTCTAATTGTTTACTATAGGTGGTTTGTTCAGAACTATGGTAATATGGCCGCTCCATTGACCCATCTGTTGAAGGAAGGAGCTTACCAGTGGACAAAGAAAACTCCGGAAGCTTTTGAAAAATTAAAGAATGCTATGATGACATTACGTGTATTAGCATTACCCCATTTCAACTTGCCTTTCAAGATTGAAACAGAAGCATTTGGTTATGGAATTGGAGCTGTGTTAGTCCAGTCGAAGAGGCTGATTGCTTATGTCAGTCATACTTTGTCAATGAGGGATCGGGCTAAACTGAGTTACAAATAGGAGCTTCTGGCAGTGGTTTTGGCTTTACATAGGTCGAGACCATATTGGTTAGGAAGGAAGTCTGTGGTGAAGACTGATCAACGCTTGTTGAATTTTTTGTTGAAACAACGAGTGATCTAGCCTCAACATCAAAAATGGATTGCTAAATTGTTGGGTTACACAATTGAAGTAGTCTATAAACCGGGGTTGGAGAATAAGGTTGTGGATGCACTATCTCGAATGCCTCCAACAATGCACTTAAATCATTTCTCCACCCCTGCTCTATTGGATTTGACAATGATTAAAGAAGAAGTGGAGAATGATCCGTGATTAAGAGAAATCATTGAAGAGTTGAATAAAAATGAAGAGAGTGTGGTTGATTTTTCCCTACATCAAGGAGTGTTGAAGTACAAAGATAGAGTGGTGAAATCAAAATCTTCATCTTTATTACCTACTATATTACACACTTATCATGACTCGGTGTACTCCAGAACTTATGAACGTTTGATGGGAGAGCTGTACTGGGAAGGTATGAAGGGTGATCTAAAGAAATACCGCGAAGAATGTTTAGTTTGTCAATGTACTAAGGCTTTGGCGTTATCTCCAATGGAAATTCTAGATGCTATATTGAGTGACATCGATGCATTTTATTGATGGATTACTGATGGATTTGATGTGATATTGGTGATGGTGGACAAACTTAGTAAGTACGCCCACTTTTGGCTTTGAAGCATCAAATACAACGAAGTTTGTGGTTGAAGTTTTTGTCAAGGAAGTGGTTTGACTACATGTACTTCCAAGGTCCATAGTTTCTAACCGTGATAAGGTGTTTGTCAGCAATTTCTGGAGCGAAATGTTTAAATTGTTTGATACTAAATTGACCGAAATTTAGCCTTTCATCCACAATCAGATGGTCAAACTGAGGTGGTTAACTGAAGTGTTGAGGCATATCTTTGTTGCTTTTTTGGAGAAAGACCTAAGAATGGTTGTCTTGGCTACATTGTTTTGAATACTGGTATACATGTATCAATTTCCATTGGAATTACCCCATTCCATGAAGTGTATTGATGATTTCCGCCTTCGTTGATTTATTATGGAGATTTAGAGACTCCTAATTTTACCCTTGATCAGAAACTCAAGGAAAGGGATGTAGCTTTTGGGGGTTTTGAAGGACCATTTACGAATAGCTCAAGGAAAAATGAACTATGCTGATAAAAAGTGGTGACATGTGGAGTTTCAGGTGGGAGATTTGGTATTTGTAAAGATACACCTTTATCGGCAAGTGTCCTTGCAAAAAAGGAGAAATGAAAAACTATCTACAAAATTTTTTGGGTCATAGAGGGTGCTGGAAAGAATTGGACAAGTGGTGTATAAATTGGAACTACCTCCATCAGCTTCTATACATCCGGTTTTTCATGTCTCACAGCTGAATGCAGTGTTGGGTGAGCATTCAGAGGCACATCAGGTGGTTCCTTACTTGTCTGAAAATCATGAATGGATGGCCAAACCAGAGGAAGTGTATGGGTGTCGTGAGAATCCTATGACCAGAGTGTGGGAGGTATTGATCAGTTGGGAAGGAAGGGTTACTTCCTCGTGAAGCCACTTGGGAGGTTTGTTATGATTTTCAACAGCAATTTCCAGATCCGCACCTTGAGGACAAGGTGATTTTGGATAAGGAGAGTAATGTGGTGGCAATACACTACGAGAGGGAGTAAGGGGAGTGCATGTGTAATGGGTAGTATTGCTGCCAAGAATGGGACCCACAATTATTTAGTAATAGGGGAAGGTTTAAATAGTGAGGGAGGTCTTTCAGTGAGGATATGAAAATTATGTGAGACATTTGTCTATTCCTTGAAAGATAGGAAAGGCAGAGAGAGGGTAGGGTTTCCATTTTGTTCTTCTTAATTGTCTTCGTTCTTGTTAAGGATTTGGTATTCTGCCAATTCCTTGTAATCGTTATTGTTATTCCATTGAATAAGAATCATCCAAAGTTGGTGTTCTATCACCAAGCCACTTAAGTACCACATTGGTCATCACATTCTAACCAATGTGGGACAAAAGCATCCCATACTACCTTGGTTCTTAACACCTTCATCTTACGTAGCATTCGAAGCCTGCTCTTAATTGGTCTATTTTCTCATTAATTGTCTGTTAGCTTGGATATGATCCCATGAAGCAGAAGCTCCTGATTTCAACTTGAATGTAACCAGCTGAACCTTTTGATTCTTTGTTGTATTCTAGTCAAAGAAGCTCTCAGCACACTTAACCCAATTGGGAAATGCTTCGAGGTTGATTTTCCTATTAAATGTTGGCATATCAATATTCATCTTATAGTTTTGTGGTTGGTTCTCACGTTCTTGGTCATGATCAACTCTTAGAAGTTGTATATTTTCTAGAAATCTTCTAATAAGGGGTTCCTCATTATCATTTGAGGTTCTGAATCTTGATCTTCTTGCTGGATTTCTTGGATTCTTGGAAGGTTTACCGGTGCTTTATTACTTGTTGAAGGTTGTGAGCACTCCCTATTATTGATTCTAGGACAGATTTGTTGTTTTTCTTGAGTTCGATTTCTGTTTCTTGTGTTTACCTATTTGTCCAAAGGTTGAGGTGGAGGAGCAGCAATGGTTAATTGATCAAGTTGTCTGCTCACTGTAAAAAGAAGTGTCATCAATACTCGATCACATCCCACGAAGTGCACCTTCAATGGACAGCAACCGTGCAGTTGTCTTGGGATAAATTTATAGAGGATTTTTGTCATCCGCTTCACATACCTGATTGGAAATAGCATTTATTTTCCAACCCATGAGGGCCAAAATCTTTGGAGCTCAAATACTAAAACTTGATGTATACAAGTTTGGAAGACAATCTTTACTGAATAATACTTATATACAAGTGTAAGCCTCAAGAATAAGAAGTACGAAGCACCAGATCAGTTCACATGGTAATCAAATGTTTACAGTAAAACAAACATAAAAAGGACTATTAACCACATACCCTGAACAAACCAGAAAACCGATAATGAATAACCAACCCAAAAATTAAAAGGACAAGTGATACATCACCTTAAAAGATAACAATCAGAGGGAAGAGTTGCAGCAGAGGTACATGAAGGATTGTATGACCCTTAGTTAAAATGTTGAATAGGCTTACCCACGAGTGATTCTACAATGGTTAACAACACTTTGAAAATGCTTTTAATCATGCAAGATCAATCTTAATAGGATCAAAATTGCAATTAAAAATGTATAATTAAAGATTGAATCAATCTCGATTGATTAAAGTTATGTTTCAGGGTAGAAAAATGATTTTAACCATTTCAAAATTAATCCCTAACATGTGCACTTAGTCATCTTAATATTTCAAGTTTCTATATGGGTTTAACCGTTGACATTCCAAAAAAGCGTTGCAAAAAGGATGTAACAATATATTAAAGAAGTGCTTCCCATTTTTCTCAATTAGATAATGAGAGATTGGTTTATAGGTCATACTTTATGTTCATGTTTGTCTTTTTTGTAGGTCACAGCAGACTTGAAGCCTTTAAATTCAAGAAAAGTAGCTGTTCAATTTGACACATTCAAAATTCTTGGTCTGGTAAATCATCTGGTTTCTTTGACTATTCTACTTAGTTCCATATGTAGATCAAACTTTATTTTGTGCTTGCATTTTCCAGACTCCGGAATTCATTTTCCTTAAATAATTGAAATGAGAAGTTACATTACTGGATAGTTTGGATGTCTTTTGTAGGACCTATGACTGAAGAAGCTTTTTTACTGGACACCCTGCTTAGAAAGAATCGAGATCCTTAGGCCTACTTTTGATCCTAGGAAACAGGCTTAGCTGTTGCAATTGAATTTTAAGCTTAACTTTTTAAGTTTTAAGTTTTTTTTTTTTTTTTTTTTTTTTTTTGAAAAGGAACTTTTTAAATAAGTTAGATATACTATGTTTCATAAAAGACTTTAAATAATGGAGCATTTTAATTTTATTTCTGTTGGCAATGGAAAAGAAGTTTTATCTTGAGTAAAATTAATTGGTTATGTGTAAGACTGGAATATATTGGGATAATTAGATGATCAACTGTAAGGGGGCATTCTAGTAATTAGACATTAAGTTAGGTTTTTGAGTTAAAAATAGTATGAGAGAGTATGACTAGGGGCGTGAAGAATTTAGAACAGTGTTTGGCCCTTGGGAGAGAATTCTCAACCCTGTAGAATGTTTTGAGGAGTTTGTGCATCTATATTTACTTGTTTTAGTTATTATCTTGATATTGTATTTTCCTTGAAGTAATTATATGTTGAAATTAGTGGGCTTAGCCCATTGGGAGATTTGCCCTAAATATTCTTTCTTTTTTTTTATCTCTTTGTATTTTTCTATTTATTCCCCTCTACCTGTTGTATTATTATGAAAAAATAATAAGAACAAAGTATCGTGGATATGATAAAAAGTTATGAGTAGTTTTAATTAAGCTTCAATTTTGACTCTCTCAAAAGTCATCTCAAATTGGCATCTTTAGAGGATATGATTCAAATCGTCTTTTTCTTTTGACAAGCAATACAACAAAACAACCCTACTAGCAAAGGCAACTTCTTAAAAAAAAATCGATCAAGAGTGTTAACTTGTTCATAAATAACCTGCTAGACGAACTCAAGCATCTCTTGACACCCCATTGAGCAGAAAAGATAAGATCCCTAATGGTAGAAGGGTCCATTAACCAACTAATGAAAGAGGTGAAAAAGAGCCCATATTGCTTAAAAAATGCACATCCCTTCTTTCGTGCCTAAACTCAAATTTCTTGATTTTAAAAAAAAAAAAAGATTTTGTGACATCCATTTTTTTCTTATCAGAAAACAAACTCTAGAAACTCAGCAAAAAGGAAAAAGAGCTCCCAAACCTTACCATAATTTCAGCCATGACTATTTTTTAAGGACAAATGACATGGACGAGGAAACGTAAATGCGTAGAGATTTATTTTCCACCAACTGATCCTTCCAAAAATATGTTTTCTTACCCTCACCCCTACACAATGAACCAAGTAAGAGAAAGAAGGAAGCTCAAGGAAATATCCTTCCACAAACACCTATGTGTGCCTTTAATCCTCTACCCGACAACCACTAAAAGAATGTGGTTTGCACTTGCTTGCGTTAATCCTATACCAGACGGTTTTAGGCTCAAGGAAGTTGAGTATATGAGCTGGTTATGTAGGATGTGACAGGTGTCCCTCTTGGAGGTAATCTCAAATGTGTTTCTTTTTGGAACCCAGTGGTAGATGGTTAAGGAGGAGATTGGCTTCTTGGAGGAAACTATTGTTATCTAAAAGTGGTAGAACAAGTGGCTCGGAGTTGCTTTACAGATTTTGCAACAATGAAGCGAGAGAGCCATGTGAGAATTAGGGACCATAAATGCCAAATATGGAATTCTTGAAAAAAATTGTCTAGATTTGCAAGCTCAGGCAATTTGTATATTGACGTAAAGACAATTTTTAAATTAAACTATGAAATCATTTACTCATTTCCCGGAAACTAAGCTAAAAACTATGGAAGACTGAGATTTCTGGATAGGCTAATCCTCCCGCTAGCTTTGATATTGCCATATATAGCCCCCTCACATGACTTGTACACGGCTACAAAACCTAATGCTTTTCTTTGATGACGAAGTAGGCTGGTTGTGGCTGAATGGAATCATAGTGGCCAAGTGCCAATCATAGAGCAGTCGACTAAGGTCTCACAATAAATCTCTCACCAGTTCTAATTTCATTGATGTAAATCTTTTCATGTCTTGATCAACAGATACCGGTCAAAGCACCTGGAAGAGCCCGTGGTGAACTGGATATCACATATTTGGATGAAGAGTTGCGGTAAGTAGTTATGCTGAATTTTCTTTTTACTGTTTTCTAGTTTTGTTCTAGTTGTAGGAATTCTACTTGATTTTTGGATCACAAGCAGTTAGTTCTTTGAGTGAAACAGATGGTTAGATTAGATGATATTTATATCAAAATTTAATAGTAAAATCTATTAATAAATGAATAATCAATTGTCACGTTCTAAAACAATTTTATTTCTCTCTCTTAATTGATGTCAGAAACTGTTAATATGAGCTATAAAATAATAATTATTTCAAAGATAAACTACAAGTTGATGGATAAACTTCTTATAACGAACATTGATATAAATGAGCTTTATTTACAAAAAAAAAAAAAAATGAAGAAAGTTTGTGTCTAAAAGTTAGTAGTAGTAGGTCCACTCATTATCAACTTCAATCCATTATCTCTAATTGCTTCTGTCCTTAGAATTCGACCAAATCTGCCTCTTTTATTGCAGGCTTTAAGTAACTTTTCTAAAGTTGCGAGAAGGGAGAGATTTTTGTTAAATCACTACTAATTTAAGCTTAAGTCAATAGTGTTAGCGATATATAATTAAATTTGCCTTCAACCACTAGCTTAAGCTCTTGGGTGAATTGGTGATTTAATATTGTATCAGAGTAGGTGGTCTAGGGAGGTCCTGGGGTCTTGTGTTCAAGCCCCTGCATTGTCGTTTCCTCCCCAATTAAAATTGATTTTCACTTGTTGGGCTTTTCAAATATTTCAAGCCCACAAGTGATAGGGAGTGTTAGTGATATATAATTAAATTTGCCTTTTGGTCAGATATTTGGGTTGGAACTTTCCTTACTCTACAGAATAGCCTTACTCCCCTATGGCTCGGTTGCAGATCATTTGGCTTTGGCTACTCTCTCTTGGTCCCTACCATTTAGAAGAAGTTTAAAAGATGCAGAAATTATTGAATTGGAATCATTACTCTCCCTTCTCTCAGTTGAAAAAACTCTTGATTTTGAGGACCACAAGGTTTGGTCAATTGATCTTCAAAGTTGTTTTAATATTTAATCTCTCTCGATGAATTTGCATTCAGCTCCCCAATGGATAATCGTTTATTTAAAGCCTTATGGAAGTCATCCAGCCCCCAAAGAGTTAACATTCTGATTTGGATCATGATTTTTGGATCCCTCAATTGTGCTGATGTCCTCTAAAAGAAGGCCCCCAACAAGTGCCTACTTCTCTCAGTTTGCCCCCTCTGTTTGAAAGCAAATGAGACTTCCTCACATCGTTTTGCTTTGCCCCTTCTTCTCTTTTTGCTGGAACAAATTCTTCTCCATTATCAACATTGTTTGGGTTTTTGATGGCTCTCTTTGCTCCTCAATTCTCAAGCTTCTAGAGGATCCTCCGTTCCCCAAAAAGCCAATATTATTGTGGTTAAATTTGCTAAAAGCTATCCTTTTCGAATTGTAGTTTGAAAGAAATCAGAGGCTCTTTCACGACAAAGAAAAGCCATGCTTAGAAGTCTTTCTCTCTTTATAGAGAAGTGAAGTAGCTTGGGTCTCCCTAAAAAGGAGTTCGGGGCTTACTCAATTCAAGATATTTATCTCAACTGGGCTGCTTTCCTCTCACAACTAGCCCTTTATTCTTCATCCAGTATTGGATCTAGTCTCAGTTTTGAAACATTCAGAAACTGTAGTCTTTCCTTCTGTATTTTCATCAATTTTTAGTTTCTTTCCTTGACTCGGCCTGTCTTTATGCCTGGACTTATACAAGTTGTAATGTTTATTTGCTTTGTTTTGTTTAATGTTATTTCTAGGATATGATGTTTTGGTTCTAAGGGGGTGTCAACCTAGTTGAGATGCCTAGAATTTTCTTTGTTCGAATGCAATCTTCCTTGCCTCCAAATTAACCGACAACCCTTGTCTAGAAATTGGACTCAACTTCCGATCTTTTGAAAGTAATTTGAGGTAAATGGCACGAGTGGATTGTGAGAAAACCTTGTCTGAAGTTGCCTATTTATAGTGAACCCTTGTTTGAAACTGAATTGACACCTTCAACTTCCTTAATTTGCCTTTTAATTTCTTTATTCCACCTTCTGCTTCTTAATCCACCTCTTACTTGGCCGCAAAATGCATGAAACACGCTTAAAACACCTAGATTTTCATTAAACTGGTTTCCCCTAATCAGCCACGTTGCTTACTTACACGCCTTGCTAACATCTATGGCTGCTAATTAACCTCCCTCGCGAGCAAGCCTTCCTCGCCTAGCTCTTGGCCGCCTAATTTTCATGCCTACTAACCTCTTAGTGTGTCATCTCGCCTAGCATATGGCCAATCTTCTATGCATAGCCAACATCAATCTCCTCGTGTAGCCCACAAAAACCTTGCTAACCTCTTAGTCAATCATAACGCACGCCTAGCAAACCTTTGCTCGCTTAGCAACCTTAACACCTATGTCCTTTCTTTAGACGTTATTCCTCCTTATGTGGCCGCCCAAACTTGTCTTTCTCGCCTAACCTTAGAGGTTCTCATCAACATTTGGCCACCTAAAACATGTCATTTAGAGGTTATCTAAAGTCAATTTGCCGACTAGCCTATGATTAAAACGCCTAACCTCTTGTGCTTCAGAGGTTATTTGATCTTCTTTTGGCTACCCACCTAACTTGGCCATTTTTCCCTTATTGACTAAATATCCAAACCTCAATACTTAATCTTTAACCCTTCCAAGAAATTCTCGTTATTTGTCTATTAATTTATTTTCTCTTCCTTCACCTTAGCTTCTCAACGGTTGCTATAAATTTTGGGCTTTTACAAAAGGCTTATATTGTCATGAAATAGTAAATGAGTGGACTTGAAAGTTTAAATAGAGATTAACTGAACCACATTTTGCATGCTTTACATCTTTGAAACGTCATTCTTCATCAATCTAATATTATGAAGCTTTCTTAATATCTTTGTATGTTATATTAATAATATCTATGTTTTCATTCACCAAACATTTGATTGATTTTACTTTTATACATTCTCATCACTGGGGTGTTAAACTGTTCTATATATATCACTGAATATTGGCAGAATATCCAGAGGTGACAAAGGGAATCTCTTTATCTTGAAGATGATCGATCCTTCTTATCGAGTACCTGTATAACACTTTAATCTCTCTACTTTACTAGTAGTATCCTTCTAAAGCCTGATGTCTAAAGCCTGATGAAGCACTAAATCAGAGTGAATATTATGTTATTGTAAATTGATTCTCATGATATCTTCATAAAACTTGTGGCAACCTTCTGATATTTTTTTTGCAATATACCAAATTATGCTTTAA

mRNA sequence

GCGCAAAAATAGTAACGGTTTGTTTTTCAGCAGATAAAAAATGGCTATGGCTGTGGCCTTATCTTCATCTACAGTTACTCCGACCGCCAAATTCTGGCCTCCTTCAACATCGCAACCACCCAGGTTTCTTCCTATTATTCCCTCAAACCCGTCTTCGAACACTAACCTCCTCACTTCTTCACCTTCTCACAAGTGGAGGCTCAGGATCTCTTTCTTTCCTGCCTTTTTGAACAAAGGAAAAGGAAATAATGTTACTGCTCTTAAACAGGAACTTCTTCAGGCAATTGCCCCCCTTGATCGAGGAGCTGAGGCCACTCCTGAAGATCAGGAAATGGTTGATCAGATATCACGCAAGCTTGAAGCAGTCAACCCTATAAAGGAGCCACTCAAATCTGATTTACTAAATGGAAAATGGGAGCTCATATATACTACTTCCAGATCAATTCTGCAAACTGAGAGGCCCAAGTTTTTTAGGTCAAGAATAAACTATCAAGGAATCAATGTAGATTCTCTCAGAGCTCAAAATATGGAATCTTGGCCGTTCTTCAACCAGGTCACAGCAGACTTGAAGCCTTTAAATTCAAGAAAAGTAGCTGTTCAATTTGACACATTCAAAATTCTTGGTCTGATACCGGTCAAAGCACCTGGAAGAGCCCGTGGTGAACTGGATATCACATATTTGGATGAAGAGTTGCGAATATCCAGAGGTGACAAAGGGAATCTCTTTATCTTGAAGATGATCGATCCTTCTTATCGAGTACCTGTATAACACTTTAATCTCTCTACTTTACTAGTAGTATCCTTCTAAAGCCTGATGTCTAAAGCCTGATGAAGCACTAAATCAGAGTGAATATTATGTTATTGTAAATTGATTCTCATGATATCTTCATAAAACTTGTGGCAACCTTCTGATATTTTTTTTGCAATATACCAAATTATGCTTTAA

Coding sequence (CDS)

ATGGCTATGGCTGTGGCCTTATCTTCATCTACAGTTACTCCGACCGCCAAATTCTGGCCTCCTTCAACATCGCAACCACCCAGGTTTCTTCCTATTATTCCCTCAAACCCGTCTTCGAACACTAACCTCCTCACTTCTTCACCTTCTCACAAGTGGAGGCTCAGGATCTCTTTCTTTCCTGCCTTTTTGAACAAAGGAAAAGGAAATAATGTTACTGCTCTTAAACAGGAACTTCTTCAGGCAATTGCCCCCCTTGATCGAGGAGCTGAGGCCACTCCTGAAGATCAGGAAATGGTTGATCAGATATCACGCAAGCTTGAAGCAGTCAACCCTATAAAGGAGCCACTCAAATCTGATTTACTAAATGGAAAATGGGAGCTCATATATACTACTTCCAGATCAATTCTGCAAACTGAGAGGCCCAAGTTTTTTAGGTCAAGAATAAACTATCAAGGAATCAATGTAGATTCTCTCAGAGCTCAAAATATGGAATCTTGGCCGTTCTTCAACCAGGTCACAGCAGACTTGAAGCCTTTAAATTCAAGAAAAGTAGCTGTTCAATTTGACACATTCAAAATTCTTGGTCTGATACCGGTCAAAGCACCTGGAAGAGCCCGTGGTGAACTGGATATCACATATTTGGATGAAGAGTTGCGAATATCCAGAGGTGACAAAGGGAATCTCTTTATCTTGAAGATGATCGATCCTTCTTATCGAGTACCTGTATAA

Protein sequence

MAMAVALSSSTVTPTAKFWPPSTSQPPRFLPIIPSNPSSNTNLLTSSPSHKWRLRISFFPAFLNKGKGNNVTALKQELLQAIAPLDRGAEATPEDQEMVDQISRKLEAVNPIKEPLKSDLLNGKWELIYTTSRSILQTERPKFFRSRINYQGINVDSLRAQNMESWPFFNQVTADLKPLNSRKVAVQFDTFKILGLIPVKAPGRARGELDITYLDEELRISRGDKGNLFILKMIDPSYRVPV*
BLAST of MELO3C010097 vs. Swiss-Prot
Match: PAP4_ARATH (Probable plastid-lipid-associated protein 4, chloroplastic OS=Arabidopsis thaliana GN=PAP4 PE=2 SV=1)

HSP 1 Score: 272.7 bits (696), Expect = 3.9e-72
Identity = 134/207 (64.73%), Postives = 164/207 (79.23%), Query Frame = 1

Query: 41  TNLLTSSPSHKWRLRI----SFFPAFLNK-GKGNNVTALKQELLQAIAPLDRGAEATPED 100
           T L ++    + RLR+    SF PAFL + G+      LKQELL+AI PL+RGA A+P+D
Sbjct: 36  TKLQSTRKGDRERLRVQAIFSFPPAFLTRNGRAEKQKQLKQELLEAIEPLERGATASPDD 95

Query: 101 QEMVDQISRKLEAVNPIKEPLKSDLLNGKWELIYTTSRSILQTERPKFFRSRINYQGINV 160
           Q  +DQ++RK+EAVNP KEPLKSDL+NGKWELIYTTS SILQ ++P+F RS  NYQ INV
Sbjct: 96  QLRIDQLARKVEAVNPTKEPLKSDLVNGKWELIYTTSASILQAKKPRFLRSITNYQSINV 155

Query: 161 DSLRAQNMESWPFFNQVTADLKPLNSRKVAVQFDTFKILGLIPVKAPGRARGELDITYLD 220
           D+L+ QNME+WPF+N VT D+KPLNS+KVAV+   FKILG IP+KAP  ARGEL+ITY+D
Sbjct: 156 DTLKVQNMETWPFYNSVTGDIKPLNSKKVAVKLQVFKILGFIPIKAPDSARGELEITYVD 215

Query: 221 EELRISRGDKGNLFILKMIDPSYRVPV 243
           EELR+SRGDKGNLFILKM DP+YR+P+
Sbjct: 216 EELRLSRGDKGNLFILKMFDPTYRIPL 242

BLAST of MELO3C010097 vs. Swiss-Prot
Match: PAP5_ARATH (Probable plastid-lipid-associated protein 5, chloroplastic OS=Arabidopsis thaliana GN=PAP5 PE=2 SV=1)

HSP 1 Score: 243.0 bits (619), Expect = 3.3e-63
Identity = 124/204 (60.78%), Postives = 153/204 (75.00%), Query Frame = 1

Query: 41  TNLLTSSPSHKWRLRISFFPAFLNKGKG-NNVTALKQELLQAIAPLDRGAEATPEDQEMV 100
           T LL+     + RLRI    +F  +  G      LK EL++AI PL+RGA A+P+DQ ++
Sbjct: 31  TKLLSIRKGDRERLRIQAVFSFPPRNGGAEKRKQLKHELVEAIEPLERGATASPDDQLLI 90

Query: 101 DQISRKLEAVNPIKEPLKSDLLNGKWELIYTTSRSILQTERPKFFRSRINYQGINVDSLR 160
           DQ++RK+EAVNP KEPLKSDL+NGKWELIYTTS +ILQ ++P+F RS  NYQ IN+D+L+
Sbjct: 91  DQLARKVEAVNPTKEPLKSDLINGKWELIYTTSAAILQAKKPRFLRSLTNYQCINMDTLK 150

Query: 161 AQNMESWPFFNQVTADLKPLNSRKVAVQFDTFKILGLIPVKAP-GRARGELDITYLDEEL 220
            Q ME+WPF+N VT DL PLNS+ VAV+   FKILG IPVKAP G ARGEL+ITY+DEEL
Sbjct: 151 VQRMETWPFYNSVTGDLTPLNSKTVAVKLQVFKILGFIPVKAPDGTARGELEITYVDEEL 210

Query: 221 RISRGDKGNLFILKMIDPSYRVPV 243
           RISRG    LFILKM DP+YR+P+
Sbjct: 211 RISRGKGNLLFILKMFDPTYRIPL 234

BLAST of MELO3C010097 vs. Swiss-Prot
Match: PAP8_ARATH (Probable plastid-lipid-associated protein 8, chloroplastic OS=Arabidopsis thaliana GN=PAP8 PE=1 SV=1)

HSP 1 Score: 67.0 bits (162), Expect = 3.2e-10
Identity = 67/248 (27.02%), Postives = 115/248 (46.37%), Query Frame = 1

Query: 5   VALSSSTVTPTAKFWPPSTSQPPRFLPIIPSNPSSNTNLLTSSPSHKWRLRISFFPAFLN 64
           +A ++S++T  + F  P T         I S+   N  L  S P    R R       ++
Sbjct: 1   MAATASSLTIASSFSEPRTQ--------IHSSRRLNLPLQYSIPYKVLRSRSRRLGLVVS 60

Query: 65  KGKGNNVTA------LKQELLQAIAPLDRGAEATPEDQEMVDQISRKLEAVNPIKEPLKS 124
                NV        L   LL  +A  D G   +PE  + V Q++ +L+    +KEP+K+
Sbjct: 61  SVSAPNVELRTGPDDLISTLLSKVANSDGGVTLSPEQHKEVAQVAGELQKYC-VKEPVKN 120

Query: 125 DLLNGKWELIYTTS--------RSILQTERPKFFRSRINYQGINVDSL--RAQNMESWPF 184
            L+ G WE++Y +         RS++      FF+++   Q I+   +     ++ ++ F
Sbjct: 121 PLIFGDWEVVYCSRPTSPGGGYRSVIGR---LFFKTKEMIQAIDAPDIVRNKVSINAFGF 180

Query: 185 FN---QVTADLKPLNSRKVAVQFDTFKI-LGLIPVKAPGRARGELDITYLDEELRISRGD 233
            +    +T  LK L+S  V V F+  +I +G +  K    +  +L ITY+DE+LR+  G 
Sbjct: 181 LDGDVSLTGKLKALDSEWVQVIFEPPEIKVGSLEFKYGFESEVKLRITYVDEKLRLGLGS 236

BLAST of MELO3C010097 vs. Swiss-Prot
Match: PAP6_ARATH (Probable plastid-lipid-associated protein 6, chloroplastic OS=Arabidopsis thaliana GN=PAP6 PE=1 SV=1)

HSP 1 Score: 58.5 bits (140), Expect = 1.1e-07
Identity = 58/202 (28.71%), Postives = 91/202 (45.05%), Query Frame = 1

Query: 66  GKGNNVTALKQELLQAIAPLDRGAEATPEDQEMVDQISRKLE-AVNPIKEPLKSDLLNGK 125
           G    +  LK +LL  ++ L+RG  A+ +D E  +  +++LE A  P+      D L GK
Sbjct: 79  GDDKQIALLKLKLLSVVSGLNRGLVASVDDLERAEVAAKELETAGGPVDLTDDLDKLQGK 138

Query: 126 WELIYTTSRSI--LQTERPKFFRSRIN-------YQGINVDSLRAQNMES------WPFF 185
           W L+Y+++ S   L   RP     R+        +Q I+V S    N+        WPF 
Sbjct: 139 WRLLYSSAFSSRSLGGSRPGLPTGRLIPVTLGQVFQRIDVFSKDFDNIAEVELGAPWPFP 198

Query: 186 N-QVTADL----KPLNSRKVAVQFD--TFKILGLI----------------PVKAPGRAR 229
             + TA L    + L + K+ + F+  T K  G +                P   PG   
Sbjct: 199 PLEATATLAHKFELLGTCKIKITFEKTTVKTSGNLSQIPPFDIPRLPDSFRPSSNPGT-- 258

BLAST of MELO3C010097 vs. Swiss-Prot
Match: PAP12_ARATH (Probable plastid-lipid-associated protein 12, chloroplastic OS=Arabidopsis thaliana GN=PAP12 PE=2 SV=1)

HSP 1 Score: 57.8 bits (138), Expect = 2.0e-07
Identity = 46/166 (27.71%), Postives = 74/166 (44.58%), Query Frame = 1

Query: 87  RGAEATPEDQEMVDQISRKLEAVNPIKEPLKSDLLNGKWELIYTTSRSILQTERPKFFRS 146
           RG  A+P+    V+   + LE +  I+ P  SDL+ G+W L++TT        +  F   
Sbjct: 88  RGKSASPKQLNDVESAVKVLEGLEGIQNPTDSDLIEGRWRLMFTTRPGTASPIQRTFTGV 147

Query: 147 RI--NYQGINVDSLRAQNMESWPFFNQVTADLKP------LNSRKVAVQFDTFKI-LGLI 206
            +   +Q + + +     + +   F+    +LK        + ++V  +FD     L  +
Sbjct: 148 DVFTVFQDVYLKATNDPRVSNIVKFSDFIGELKVEAVASIKDGKRVLFRFDRAAFDLKFL 207

Query: 207 PVKAP---------GRARGELDITYLDE--ELRISRGDKGNLFILK 233
           P K P           A+G LD TYL     LRISRG+KG  F+L+
Sbjct: 208 PFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQ 253

BLAST of MELO3C010097 vs. TrEMBL
Match: A0A0A0M3B4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G665380 PE=4 SV=1)

HSP 1 Score: 463.4 bits (1191), Expect = 1.7e-127
Identity = 233/243 (95.88%), Postives = 236/243 (97.12%), Query Frame = 1

Query: 1   MAMAVALSSSTVTPTAKFWPPSTSQPPRFL-PIIPSNPSSNTNLLTSSPSHKWRLRISFF 60
           MAMAVALSSST TPT K WPPSTSQPPRFL PIIPSNPSSNTNLLTSSPSHKWRLRISFF
Sbjct: 1   MAMAVALSSSTATPTTKLWPPSTSQPPRFLLPIIPSNPSSNTNLLTSSPSHKWRLRISFF 60

Query: 61  PAFLNKGKGNNVTALKQELLQAIAPLDRGAEATPEDQEMVDQISRKLEAVNPIKEPLKSD 120
           PAFLNKGKGNNVTALKQELLQAI PLDRGAEATPEDQEMVDQISRKLEAVNP KEPLKSD
Sbjct: 61  PAFLNKGKGNNVTALKQELLQAIEPLDRGAEATPEDQEMVDQISRKLEAVNPTKEPLKSD 120

Query: 121 LLNGKWELIYTTSRSILQTERPKFFRSRINYQGINVDSLRAQNMESWPFFNQVTADLKPL 180
           LLNGKWELIYTTSRSILQTERPKF RS++NYQGINVDSLRAQNMESWPFFNQVTADLKPL
Sbjct: 121 LLNGKWELIYTTSRSILQTERPKFLRSKLNYQGINVDSLRAQNMESWPFFNQVTADLKPL 180

Query: 181 NSRKVAVQFDTFKILGLIPVKAPGRARGELDITYLDEELRISRGDKGNLFILKMIDPSYR 240
           NSRKVAVQFDTFKILGLIPVKAPGRARGEL+ITYLDEELRISRGDKGNLFILKMIDPSYR
Sbjct: 181 NSRKVAVQFDTFKILGLIPVKAPGRARGELEITYLDEELRISRGDKGNLFILKMIDPSYR 240

Query: 241 VPV 243
           VPV
Sbjct: 241 VPV 243

BLAST of MELO3C010097 vs. TrEMBL
Match: A0A0D2Q9W8_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_002G209300 PE=4 SV=1)

HSP 1 Score: 338.6 bits (867), Expect = 6.4e-90
Identity = 175/238 (73.53%), Postives = 200/238 (84.03%), Query Frame = 1

Query: 7   LSSSTVTPTAKFWPPSTSQPPRFLPII-PSNPSS-NTNLLTSSPSHKWRLRISFFPAFLN 66
           L++S   P+ K  PPS S P RF  +I P +P++ N +  +S+P  KWR+ +SFFPAFLN
Sbjct: 11  LTTSLSAPSPKL-PPSISSPQRFPFLIKPLHPNAFNLSTSSSAPDQKWRVNVSFFPAFLN 70

Query: 67  KGKGNNVTALKQELLQAIAPLDRGAEATPEDQEMVDQISRKLEAVNPIKEPLKSDLLNGK 126
           KGK   V  LKQ+LL  IAPLDRGA+ATPEDQ+ VDQ++ KLEAVNP K+PLKSDLLNGK
Sbjct: 71  KGKDAKV--LKQDLLDCIAPLDRGADATPEDQQRVDQLASKLEAVNPTKQPLKSDLLNGK 130

Query: 127 WELIYTTSRSILQTERPKFFRSRINYQGINVDSLRAQNMESWPFFNQVTADLKPLNSRKV 186
           WELIYTTS+SILQT+RPKF RS  NYQ INVD+LRAQNMESWPFFNQVTADL PLN+RKV
Sbjct: 131 WELIYTTSKSILQTQRPKFLRSSTNYQAINVDTLRAQNMESWPFFNQVTADLTPLNARKV 190

Query: 187 AVQFDTFKILGLIPVKAPGRARGELDITYLDEELRISRGDKGNLFILKMIDPSYRVPV 243
           AV+FD FKI GLIP+KAPGRARGEL+ITYLDEELRISRGD GNLFILKMIDPSYRVPV
Sbjct: 191 AVKFDYFKIGGLIPIKAPGRARGELEITYLDEELRISRGDLGNLFILKMIDPSYRVPV 245

BLAST of MELO3C010097 vs. TrEMBL
Match: A0A0B0PAQ3_GOSAR (Putative plastid-lipid-associated 4, chloroplastic-like protein OS=Gossypium arboreum GN=F383_27514 PE=4 SV=1)

HSP 1 Score: 335.5 bits (859), Expect = 5.4e-89
Identity = 175/244 (71.72%), Postives = 199/244 (81.56%), Query Frame = 1

Query: 2   AMAVALSSSTVTPTAKFWPPSTSQPPRFLPII--PSNPSS-NTNLLTSSPSHKWRLRISF 61
           A    L++S   P+ K  PP  S P RF P +  P  P++ N +  +S+P  KWR+ +SF
Sbjct: 6   AFHTILTTSVSAPSPKL-PPFISSPQRF-PFLTKPLRPNAFNLSTSSSAPDQKWRVNVSF 65

Query: 62  FPAFLNKGKGNNVTALKQELLQAIAPLDRGAEATPEDQEMVDQISRKLEAVNPIKEPLKS 121
           FPAFLNKGK   V  LKQ+LL  IAPLDRGA+ATPEDQ+ VDQ++ KLEAVNP K+PLKS
Sbjct: 66  FPAFLNKGKDAKV--LKQDLLDCIAPLDRGADATPEDQQRVDQLASKLEAVNPTKQPLKS 125

Query: 122 DLLNGKWELIYTTSRSILQTERPKFFRSRINYQGINVDSLRAQNMESWPFFNQVTADLKP 181
           DLLNGKWELIYTTS+SILQT+RPKF RS  NYQ INVD+LRAQNMESWPFFNQVTADL P
Sbjct: 126 DLLNGKWELIYTTSKSILQTQRPKFLRSSTNYQAINVDTLRAQNMESWPFFNQVTADLTP 185

Query: 182 LNSRKVAVQFDTFKILGLIPVKAPGRARGELDITYLDEELRISRGDKGNLFILKMIDPSY 241
           LN+RKVAV+FD FKI GLIP+KAPGRARGEL+ITYLDEELRISRGD GNLFILKMIDPSY
Sbjct: 186 LNARKVAVKFDYFKIGGLIPIKAPGRARGELEITYLDEELRISRGDLGNLFILKMIDPSY 245

Query: 242 RVPV 243
           RVPV
Sbjct: 246 RVPV 245

BLAST of MELO3C010097 vs. TrEMBL
Match: A0A068TNK3_COFCA (Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00014907001 PE=4 SV=1)

HSP 1 Score: 323.9 bits (829), Expect = 1.6e-85
Identity = 171/249 (68.67%), Postives = 197/249 (79.12%), Query Frame = 1

Query: 1   MAMAVALSSSTVTPTAKFWPPSTSQPPRFLPI-IPSNPSSNTNLLTSSP-------SHKW 60
           MA++  L   T+  T+   PP++S      P   P       ++L++SP       S KW
Sbjct: 7   MALSSLLPPQTLHITSSHHPPNSSLKVFSFPTRSPQQNCHPKSILSTSPPSFTWVPSQKW 66

Query: 61  RLRISFFPAFLNKGKGNNVTALKQELLQAIAPLDRGAEATPEDQEMVDQISRKLEAVNPI 120
           R  ISFFPAFL K K  +  A+K+ELL+AIAPLDRGAEATPEDQ+ +DQI+RKLEAV+PI
Sbjct: 67  RTYISFFPAFL-KNKAKDAKAIKEELLEAIAPLDRGAEATPEDQQSIDQITRKLEAVSPI 126

Query: 121 KEPLKSDLLNGKWELIYTTSRSILQTERPKFFRSRINYQGINVDSLRAQNMESWPFFNQV 180
           KEPLKSDLLNGKWELIYTTS+SILQTERPK  RS+ NYQ INVD+LRAQNMES PFFNQV
Sbjct: 127 KEPLKSDLLNGKWELIYTTSQSILQTERPKILRSKTNYQAINVDTLRAQNMESCPFFNQV 186

Query: 181 TADLKPLNSRKVAVQFDTFKILGLIPVKAPGRARGELDITYLDEELRISRGDKGNLFILK 240
           TADL PLN+RKVAV+FD FKI GLIPVKAPGRARGEL+ITYLDEELR+SRGD GNLFILK
Sbjct: 187 TADLTPLNARKVAVKFDYFKIAGLIPVKAPGRARGELEITYLDEELRVSRGDLGNLFILK 246

Query: 241 MIDPSYRVP 242
           M+DPSYRVP
Sbjct: 247 MVDPSYRVP 254

BLAST of MELO3C010097 vs. TrEMBL
Match: M5XGR1_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa011650mg PE=4 SV=1)

HSP 1 Score: 323.6 bits (828), Expect = 2.1e-85
Identity = 160/200 (80.00%), Postives = 179/200 (89.50%), Query Frame = 1

Query: 43  LLTSSPSHKWRLRISFFPAFLNKGKGNNVTALKQELLQAIAPLDRGAEATPEDQEMVDQI 102
           L ++S S KWR ++SFFPAF +KGK  +   LK+ELL AIA LDRGA+ATPEDQ+ VDQI
Sbjct: 5   LSSTSASDKWRAKVSFFPAFSSKGK--DAKTLKEELLDAIASLDRGADATPEDQQTVDQI 64

Query: 103 SRKLEAVNPIKEPLKSDLLNGKWELIYTTSRSILQTERPKFFRSRINYQGINVDSLRAQN 162
           +RKLEAVNP KEPLKSDLLNGKWELIYTTS+SILQT+RPKF RSR+NYQ IN D+LRAQN
Sbjct: 65  ARKLEAVNPTKEPLKSDLLNGKWELIYTTSKSILQTQRPKFLRSRVNYQAINADTLRAQN 124

Query: 163 MESWPFFNQVTADLKPLNSRKVAVQFDTFKILGLIPVKAPGRARGELDITYLDEELRISR 222
           MESWP FNQVTADL PLN+RKVAV+FD FKI GLIPVKAPGRARGEL+ITYLDEELR+SR
Sbjct: 125 MESWPTFNQVTADLTPLNARKVAVKFDYFKIAGLIPVKAPGRARGELEITYLDEELRVSR 184

Query: 223 GDKGNLFILKMIDPSYRVPV 243
           GDKGNLFILKM+DPSYRVPV
Sbjct: 185 GDKGNLFILKMVDPSYRVPV 202

BLAST of MELO3C010097 vs. TAIR10
Match: AT3G26070.1 (AT3G26070.1 Plastid-lipid associated protein PAP / fibrillin family protein)

HSP 1 Score: 272.7 bits (696), Expect = 2.2e-73
Identity = 134/207 (64.73%), Postives = 164/207 (79.23%), Query Frame = 1

Query: 41  TNLLTSSPSHKWRLRI----SFFPAFLNK-GKGNNVTALKQELLQAIAPLDRGAEATPED 100
           T L ++    + RLR+    SF PAFL + G+      LKQELL+AI PL+RGA A+P+D
Sbjct: 36  TKLQSTRKGDRERLRVQAIFSFPPAFLTRNGRAEKQKQLKQELLEAIEPLERGATASPDD 95

Query: 101 QEMVDQISRKLEAVNPIKEPLKSDLLNGKWELIYTTSRSILQTERPKFFRSRINYQGINV 160
           Q  +DQ++RK+EAVNP KEPLKSDL+NGKWELIYTTS SILQ ++P+F RS  NYQ INV
Sbjct: 96  QLRIDQLARKVEAVNPTKEPLKSDLVNGKWELIYTTSASILQAKKPRFLRSITNYQSINV 155

Query: 161 DSLRAQNMESWPFFNQVTADLKPLNSRKVAVQFDTFKILGLIPVKAPGRARGELDITYLD 220
           D+L+ QNME+WPF+N VT D+KPLNS+KVAV+   FKILG IP+KAP  ARGEL+ITY+D
Sbjct: 156 DTLKVQNMETWPFYNSVTGDIKPLNSKKVAVKLQVFKILGFIPIKAPDSARGELEITYVD 215

Query: 221 EELRISRGDKGNLFILKMIDPSYRVPV 243
           EELR+SRGDKGNLFILKM DP+YR+P+
Sbjct: 216 EELRLSRGDKGNLFILKMFDPTYRIPL 242

BLAST of MELO3C010097 vs. TAIR10
Match: AT3G26080.1 (AT3G26080.1 plastid-lipid associated protein PAP / fibrillin family protein)

HSP 1 Score: 243.0 bits (619), Expect = 1.8e-64
Identity = 124/204 (60.78%), Postives = 153/204 (75.00%), Query Frame = 1

Query: 41  TNLLTSSPSHKWRLRISFFPAFLNKGKG-NNVTALKQELLQAIAPLDRGAEATPEDQEMV 100
           T LL+     + RLRI    +F  +  G      LK EL++AI PL+RGA A+P+DQ ++
Sbjct: 31  TKLLSIRKGDRERLRIQAVFSFPPRNGGAEKRKQLKHELVEAIEPLERGATASPDDQLLI 90

Query: 101 DQISRKLEAVNPIKEPLKSDLLNGKWELIYTTSRSILQTERPKFFRSRINYQGINVDSLR 160
           DQ++RK+EAVNP KEPLKSDL+NGKWELIYTTS +ILQ ++P+F RS  NYQ IN+D+L+
Sbjct: 91  DQLARKVEAVNPTKEPLKSDLINGKWELIYTTSAAILQAKKPRFLRSLTNYQCINMDTLK 150

Query: 161 AQNMESWPFFNQVTADLKPLNSRKVAVQFDTFKILGLIPVKAP-GRARGELDITYLDEEL 220
            Q ME+WPF+N VT DL PLNS+ VAV+   FKILG IPVKAP G ARGEL+ITY+DEEL
Sbjct: 151 VQRMETWPFYNSVTGDLTPLNSKTVAVKLQVFKILGFIPVKAPDGTARGELEITYVDEEL 210

Query: 221 RISRGDKGNLFILKMIDPSYRVPV 243
           RISRG    LFILKM DP+YR+P+
Sbjct: 211 RISRGKGNLLFILKMFDPTYRIPL 234

BLAST of MELO3C010097 vs. TAIR10
Match: AT5G19940.1 (AT5G19940.1 Plastid-lipid associated protein PAP / fibrillin family protein)

HSP 1 Score: 67.0 bits (162), Expect = 1.8e-11
Identity = 67/248 (27.02%), Postives = 115/248 (46.37%), Query Frame = 1

Query: 5   VALSSSTVTPTAKFWPPSTSQPPRFLPIIPSNPSSNTNLLTSSPSHKWRLRISFFPAFLN 64
           +A ++S++T  + F  P T         I S+   N  L  S P    R R       ++
Sbjct: 1   MAATASSLTIASSFSEPRTQ--------IHSSRRLNLPLQYSIPYKVLRSRSRRLGLVVS 60

Query: 65  KGKGNNVTA------LKQELLQAIAPLDRGAEATPEDQEMVDQISRKLEAVNPIKEPLKS 124
                NV        L   LL  +A  D G   +PE  + V Q++ +L+    +KEP+K+
Sbjct: 61  SVSAPNVELRTGPDDLISTLLSKVANSDGGVTLSPEQHKEVAQVAGELQKYC-VKEPVKN 120

Query: 125 DLLNGKWELIYTTS--------RSILQTERPKFFRSRINYQGINVDSL--RAQNMESWPF 184
            L+ G WE++Y +         RS++      FF+++   Q I+   +     ++ ++ F
Sbjct: 121 PLIFGDWEVVYCSRPTSPGGGYRSVIGR---LFFKTKEMIQAIDAPDIVRNKVSINAFGF 180

Query: 185 FN---QVTADLKPLNSRKVAVQFDTFKI-LGLIPVKAPGRARGELDITYLDEELRISRGD 233
            +    +T  LK L+S  V V F+  +I +G +  K    +  +L ITY+DE+LR+  G 
Sbjct: 181 LDGDVSLTGKLKALDSEWVQVIFEPPEIKVGSLEFKYGFESEVKLRITYVDEKLRLGLGS 236

BLAST of MELO3C010097 vs. TAIR10
Match: AT3G23400.1 (AT3G23400.1 Plastid-lipid associated protein PAP / fibrillin family protein)

HSP 1 Score: 58.5 bits (140), Expect = 6.5e-09
Identity = 58/202 (28.71%), Postives = 91/202 (45.05%), Query Frame = 1

Query: 66  GKGNNVTALKQELLQAIAPLDRGAEATPEDQEMVDQISRKLE-AVNPIKEPLKSDLLNGK 125
           G    +  LK +LL  ++ L+RG  A+ +D E  +  +++LE A  P+      D L GK
Sbjct: 79  GDDKQIALLKLKLLSVVSGLNRGLVASVDDLERAEVAAKELETAGGPVDLTDDLDKLQGK 138

Query: 126 WELIYTTSRSI--LQTERPKFFRSRIN-------YQGINVDSLRAQNMES------WPFF 185
           W L+Y+++ S   L   RP     R+        +Q I+V S    N+        WPF 
Sbjct: 139 WRLLYSSAFSSRSLGGSRPGLPTGRLIPVTLGQVFQRIDVFSKDFDNIAEVELGAPWPFP 198

Query: 186 N-QVTADL----KPLNSRKVAVQFD--TFKILGLI----------------PVKAPGRAR 229
             + TA L    + L + K+ + F+  T K  G +                P   PG   
Sbjct: 199 PLEATATLAHKFELLGTCKIKITFEKTTVKTSGNLSQIPPFDIPRLPDSFRPSSNPGT-- 258

BLAST of MELO3C010097 vs. TAIR10
Match: AT1G51110.1 (AT1G51110.1 Plastid-lipid associated protein PAP / fibrillin family protein)

HSP 1 Score: 57.8 bits (138), Expect = 1.1e-08
Identity = 46/166 (27.71%), Postives = 74/166 (44.58%), Query Frame = 1

Query: 87  RGAEATPEDQEMVDQISRKLEAVNPIKEPLKSDLLNGKWELIYTTSRSILQTERPKFFRS 146
           RG  A+P+    V+   + LE +  I+ P  SDL+ G+W L++TT        +  F   
Sbjct: 88  RGKSASPKQLNDVESAVKVLEGLEGIQNPTDSDLIEGRWRLMFTTRPGTASPIQRTFTGV 147

Query: 147 RI--NYQGINVDSLRAQNMESWPFFNQVTADLKP------LNSRKVAVQFDTFKI-LGLI 206
            +   +Q + + +     + +   F+    +LK        + ++V  +FD     L  +
Sbjct: 148 DVFTVFQDVYLKATNDPRVSNIVKFSDFIGELKVEAVASIKDGKRVLFRFDRAAFDLKFL 207

Query: 207 PVKAP---------GRARGELDITYLDE--ELRISRGDKGNLFILK 233
           P K P           A+G LD TYL     LRISRG+KG  F+L+
Sbjct: 208 PFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQ 253

BLAST of MELO3C010097 vs. NCBI nr
Match: gi|659086052|ref|XP_008443741.1| (PREDICTED: probable plastid-lipid-associated protein 4, chloroplastic isoform X1 [Cucumis melo])

HSP 1 Score: 483.0 bits (1242), Expect = 3.0e-133
Identity = 242/242 (100.00%), Postives = 242/242 (100.00%), Query Frame = 1

Query: 1   MAMAVALSSSTVTPTAKFWPPSTSQPPRFLPIIPSNPSSNTNLLTSSPSHKWRLRISFFP 60
           MAMAVALSSSTVTPTAKFWPPSTSQPPRFLPIIPSNPSSNTNLLTSSPSHKWRLRISFFP
Sbjct: 1   MAMAVALSSSTVTPTAKFWPPSTSQPPRFLPIIPSNPSSNTNLLTSSPSHKWRLRISFFP 60

Query: 61  AFLNKGKGNNVTALKQELLQAIAPLDRGAEATPEDQEMVDQISRKLEAVNPIKEPLKSDL 120
           AFLNKGKGNNVTALKQELLQAIAPLDRGAEATPEDQEMVDQISRKLEAVNPIKEPLKSDL
Sbjct: 61  AFLNKGKGNNVTALKQELLQAIAPLDRGAEATPEDQEMVDQISRKLEAVNPIKEPLKSDL 120

Query: 121 LNGKWELIYTTSRSILQTERPKFFRSRINYQGINVDSLRAQNMESWPFFNQVTADLKPLN 180
           LNGKWELIYTTSRSILQTERPKFFRSRINYQGINVDSLRAQNMESWPFFNQVTADLKPLN
Sbjct: 121 LNGKWELIYTTSRSILQTERPKFFRSRINYQGINVDSLRAQNMESWPFFNQVTADLKPLN 180

Query: 181 SRKVAVQFDTFKILGLIPVKAPGRARGELDITYLDEELRISRGDKGNLFILKMIDPSYRV 240
           SRKVAVQFDTFKILGLIPVKAPGRARGELDITYLDEELRISRGDKGNLFILKMIDPSYRV
Sbjct: 181 SRKVAVQFDTFKILGLIPVKAPGRARGELDITYLDEELRISRGDKGNLFILKMIDPSYRV 240

Query: 241 PV 243
           PV
Sbjct: 241 PV 242

BLAST of MELO3C010097 vs. NCBI nr
Match: gi|778664229|ref|XP_011660249.1| (PREDICTED: probable plastid-lipid-associated protein 4, chloroplastic isoform X1 [Cucumis sativus])

HSP 1 Score: 463.4 bits (1191), Expect = 2.5e-127
Identity = 233/243 (95.88%), Postives = 236/243 (97.12%), Query Frame = 1

Query: 1   MAMAVALSSSTVTPTAKFWPPSTSQPPRFL-PIIPSNPSSNTNLLTSSPSHKWRLRISFF 60
           MAMAVALSSST TPT K WPPSTSQPPRFL PIIPSNPSSNTNLLTSSPSHKWRLRISFF
Sbjct: 1   MAMAVALSSSTATPTTKLWPPSTSQPPRFLLPIIPSNPSSNTNLLTSSPSHKWRLRISFF 60

Query: 61  PAFLNKGKGNNVTALKQELLQAIAPLDRGAEATPEDQEMVDQISRKLEAVNPIKEPLKSD 120
           PAFLNKGKGNNVTALKQELLQAI PLDRGAEATPEDQEMVDQISRKLEAVNP KEPLKSD
Sbjct: 61  PAFLNKGKGNNVTALKQELLQAIEPLDRGAEATPEDQEMVDQISRKLEAVNPTKEPLKSD 120

Query: 121 LLNGKWELIYTTSRSILQTERPKFFRSRINYQGINVDSLRAQNMESWPFFNQVTADLKPL 180
           LLNGKWELIYTTSRSILQTERPKF RS++NYQGINVDSLRAQNMESWPFFNQVTADLKPL
Sbjct: 121 LLNGKWELIYTTSRSILQTERPKFLRSKLNYQGINVDSLRAQNMESWPFFNQVTADLKPL 180

Query: 181 NSRKVAVQFDTFKILGLIPVKAPGRARGELDITYLDEELRISRGDKGNLFILKMIDPSYR 240
           NSRKVAVQFDTFKILGLIPVKAPGRARGEL+ITYLDEELRISRGDKGNLFILKMIDPSYR
Sbjct: 181 NSRKVAVQFDTFKILGLIPVKAPGRARGELEITYLDEELRISRGDKGNLFILKMIDPSYR 240

Query: 241 VPV 243
           VPV
Sbjct: 241 VPV 243

BLAST of MELO3C010097 vs. NCBI nr
Match: gi|659086054|ref|XP_008443742.1| (PREDICTED: probable plastid-lipid-associated protein 4, chloroplastic isoform X2 [Cucumis melo])

HSP 1 Score: 392.5 bits (1007), Expect = 5.3e-106
Identity = 196/196 (100.00%), Postives = 196/196 (100.00%), Query Frame = 1

Query: 1   MAMAVALSSSTVTPTAKFWPPSTSQPPRFLPIIPSNPSSNTNLLTSSPSHKWRLRISFFP 60
           MAMAVALSSSTVTPTAKFWPPSTSQPPRFLPIIPSNPSSNTNLLTSSPSHKWRLRISFFP
Sbjct: 1   MAMAVALSSSTVTPTAKFWPPSTSQPPRFLPIIPSNPSSNTNLLTSSPSHKWRLRISFFP 60

Query: 61  AFLNKGKGNNVTALKQELLQAIAPLDRGAEATPEDQEMVDQISRKLEAVNPIKEPLKSDL 120
           AFLNKGKGNNVTALKQELLQAIAPLDRGAEATPEDQEMVDQISRKLEAVNPIKEPLKSDL
Sbjct: 61  AFLNKGKGNNVTALKQELLQAIAPLDRGAEATPEDQEMVDQISRKLEAVNPIKEPLKSDL 120

Query: 121 LNGKWELIYTTSRSILQTERPKFFRSRINYQGINVDSLRAQNMESWPFFNQVTADLKPLN 180
           LNGKWELIYTTSRSILQTERPKFFRSRINYQGINVDSLRAQNMESWPFFNQVTADLKPLN
Sbjct: 121 LNGKWELIYTTSRSILQTERPKFFRSRINYQGINVDSLRAQNMESWPFFNQVTADLKPLN 180

Query: 181 SRKVAVQFDTFKILGL 197
           SRKVAVQFDTFKILGL
Sbjct: 181 SRKVAVQFDTFKILGL 196

BLAST of MELO3C010097 vs. NCBI nr
Match: gi|778664232|ref|XP_011660250.1| (PREDICTED: probable plastid-lipid-associated protein 4, chloroplastic isoform X2 [Cucumis sativus])

HSP 1 Score: 374.4 bits (960), Expect = 1.5e-100
Identity = 188/197 (95.43%), Postives = 190/197 (96.45%), Query Frame = 1

Query: 1   MAMAVALSSSTVTPTAKFWPPSTSQPPRFL-PIIPSNPSSNTNLLTSSPSHKWRLRISFF 60
           MAMAVALSSST TPT K WPPSTSQPPRFL PIIPSNPSSNTNLLTSSPSHKWRLRISFF
Sbjct: 1   MAMAVALSSSTATPTTKLWPPSTSQPPRFLLPIIPSNPSSNTNLLTSSPSHKWRLRISFF 60

Query: 61  PAFLNKGKGNNVTALKQELLQAIAPLDRGAEATPEDQEMVDQISRKLEAVNPIKEPLKSD 120
           PAFLNKGKGNNVTALKQELLQAI PLDRGAEATPEDQEMVDQISRKLEAVNP KEPLKSD
Sbjct: 61  PAFLNKGKGNNVTALKQELLQAIEPLDRGAEATPEDQEMVDQISRKLEAVNPTKEPLKSD 120

Query: 121 LLNGKWELIYTTSRSILQTERPKFFRSRINYQGINVDSLRAQNMESWPFFNQVTADLKPL 180
           LLNGKWELIYTTSRSILQTERPKF RS++NYQGINVDSLRAQNMESWPFFNQVTADLKPL
Sbjct: 121 LLNGKWELIYTTSRSILQTERPKFLRSKLNYQGINVDSLRAQNMESWPFFNQVTADLKPL 180

Query: 181 NSRKVAVQFDTFKILGL 197
           NSRKVAVQFDTFKILGL
Sbjct: 181 NSRKVAVQFDTFKILGL 197

BLAST of MELO3C010097 vs. NCBI nr
Match: gi|823135922|ref|XP_012467724.1| (PREDICTED: probable plastid-lipid-associated protein 4, chloroplastic [Gossypium raimondii])

HSP 1 Score: 338.6 bits (867), Expect = 9.2e-90
Identity = 175/238 (73.53%), Postives = 200/238 (84.03%), Query Frame = 1

Query: 7   LSSSTVTPTAKFWPPSTSQPPRFLPII-PSNPSS-NTNLLTSSPSHKWRLRISFFPAFLN 66
           L++S   P+ K  PPS S P RF  +I P +P++ N +  +S+P  KWR+ +SFFPAFLN
Sbjct: 11  LTTSLSAPSPKL-PPSISSPQRFPFLIKPLHPNAFNLSTSSSAPDQKWRVNVSFFPAFLN 70

Query: 67  KGKGNNVTALKQELLQAIAPLDRGAEATPEDQEMVDQISRKLEAVNPIKEPLKSDLLNGK 126
           KGK   V  LKQ+LL  IAPLDRGA+ATPEDQ+ VDQ++ KLEAVNP K+PLKSDLLNGK
Sbjct: 71  KGKDAKV--LKQDLLDCIAPLDRGADATPEDQQRVDQLASKLEAVNPTKQPLKSDLLNGK 130

Query: 127 WELIYTTSRSILQTERPKFFRSRINYQGINVDSLRAQNMESWPFFNQVTADLKPLNSRKV 186
           WELIYTTS+SILQT+RPKF RS  NYQ INVD+LRAQNMESWPFFNQVTADL PLN+RKV
Sbjct: 131 WELIYTTSKSILQTQRPKFLRSSTNYQAINVDTLRAQNMESWPFFNQVTADLTPLNARKV 190

Query: 187 AVQFDTFKILGLIPVKAPGRARGELDITYLDEELRISRGDKGNLFILKMIDPSYRVPV 243
           AV+FD FKI GLIP+KAPGRARGEL+ITYLDEELRISRGD GNLFILKMIDPSYRVPV
Sbjct: 191 AVKFDYFKIGGLIPIKAPGRARGELEITYLDEELRISRGDLGNLFILKMIDPSYRVPV 245

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PAP4_ARATH3.9e-7264.73Probable plastid-lipid-associated protein 4, chloroplastic OS=Arabidopsis thalia... [more]
PAP5_ARATH3.3e-6360.78Probable plastid-lipid-associated protein 5, chloroplastic OS=Arabidopsis thalia... [more]
PAP8_ARATH3.2e-1027.02Probable plastid-lipid-associated protein 8, chloroplastic OS=Arabidopsis thalia... [more]
PAP6_ARATH1.1e-0728.71Probable plastid-lipid-associated protein 6, chloroplastic OS=Arabidopsis thalia... [more]
PAP12_ARATH2.0e-0727.71Probable plastid-lipid-associated protein 12, chloroplastic OS=Arabidopsis thali... [more]
Match NameE-valueIdentityDescription
A0A0A0M3B4_CUCSA1.7e-12795.88Uncharacterized protein OS=Cucumis sativus GN=Csa_1G665380 PE=4 SV=1[more]
A0A0D2Q9W8_GOSRA6.4e-9073.53Uncharacterized protein OS=Gossypium raimondii GN=B456_002G209300 PE=4 SV=1[more]
A0A0B0PAQ3_GOSAR5.4e-8971.72Putative plastid-lipid-associated 4, chloroplastic-like protein OS=Gossypium arb... [more]
A0A068TNK3_COFCA1.6e-8568.67Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00014907001 PE=4 SV=1[more]
M5XGR1_PRUPE2.1e-8580.00Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa011650mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G26070.12.2e-7364.73 Plastid-lipid associated protein PAP / fibrillin family protein[more]
AT3G26080.11.8e-6460.78 plastid-lipid associated protein PAP / fibrillin family protein[more]
AT5G19940.11.8e-1127.02 Plastid-lipid associated protein PAP / fibrillin family protein[more]
AT3G23400.16.5e-0928.71 Plastid-lipid associated protein PAP / fibrillin family protein[more]
AT1G51110.11.1e-0827.71 Plastid-lipid associated protein PAP / fibrillin family protein[more]
Match NameE-valueIdentityDescription
gi|659086052|ref|XP_008443741.1|3.0e-133100.00PREDICTED: probable plastid-lipid-associated protein 4, chloroplastic isoform X1... [more]
gi|778664229|ref|XP_011660249.1|2.5e-12795.88PREDICTED: probable plastid-lipid-associated protein 4, chloroplastic isoform X1... [more]
gi|659086054|ref|XP_008443742.1|5.3e-106100.00PREDICTED: probable plastid-lipid-associated protein 4, chloroplastic isoform X2... [more]
gi|778664232|ref|XP_011660250.1|1.5e-10095.43PREDICTED: probable plastid-lipid-associated protein 4, chloroplastic isoform X2... [more]
gi|823135922|ref|XP_012467724.1|9.2e-9073.53PREDICTED: probable plastid-lipid-associated protein 4, chloroplastic [Gossypium... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR006843PAP/fibrillin_dom
IPR019825Lectin_legB_Mn/Ca_BS
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
MU47414melon EST collection version 4.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MELO3C010097T1MELO3C010097T1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
MU47414MU47414transcribed_cluster


Analysis Name: InterPro Annotations of melon
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006843Plastid lipid-associated protein/fibrillin conserved domainPFAMPF04755PAP_fibrillincoord: 73..232
score: 2.8
IPR019825Legume lectin, beta chain, Mn/Ca-binding sitePROSITEPS00307LECTIN_LEGUME_BETAcoord: 184..190
scor
NoneNo IPR availablePANTHERPTHR31906FAMILY NOT NAMEDcoord: 33..242
score: 6.0E
NoneNo IPR availablePANTHERPTHR31906:SF14SUBFAMILY NOT NAMEDcoord: 33..242
score: 6.0E