PI0016811 (gene) Melon (PI 482460) v1

Overview
NamePI0016811
Typegene
OrganismCucumis metuliferus (Melon (PI 482460) v1)
Descriptionprotein ROS1
Locationchr04: 4794232 .. 4805321 (+)
RNA-Seq ExpressionPI0016811
SyntenyPI0016811
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTCAGTTTCTTTTAACAAAAATATGATATAGACACATCATTCGTGTGTACGTAAAATATTTTTTAAATAATATAAAAATTTAAATGTATCTTAAGTTACACTCATCTTTTAAAAAAATTTCAGTTGCGGTAGTTAAAGTACATGCAGGCCATGCATGCACTCAAGTATCGCTCAAAATTAAAATCTTCTTCCTTTTTAACCTCTTTCTCATTCCCTTCCCTCTCCTCGAGCAAGGTTTGAAGGTGCCACGTCCATTTTCAATGAACATCGCCACCACCGCCGTCGTTCTCTTTTCCCTTTACTCCCCCTTGCTCTTTCCCCATCCATTTTCAATTCTGTCCTTCATCTCTCTTTCTTCCTTCTATTAATTTTTTCTTTTTCCTCTTAGAAAAAATCACTCTAAGTCATCATCTTCTCTCTTCACTTATGCCAGTTATCCCAAATCCAGAACAAAGGGGGATCTAAATGATTCTTGAAAAAATATGCCTTTTTGTTGATTAGAAGTCATACAACAAAGCTTGAGAGGAAGGTGGAAACAATGATACAGTTCGCCGGAGCGAGGTGGCGGCAATATAATCCGACAACGGAGGAAATGAGAATAGGTAAGAAGAGTATTAGGGAGAGTTGAAGGAAGATAGAGATTTATTGGCCTTCCAGTGAGTGGTAACAACTCAGATCTTGCCGACATTGACTCAGTCCTCCGGTAACTATTCCCCTGAAATTATGCTCATTTTTCAGCTTAAATAATTTTGTGATGGAAGGGAAGGAGATAATTTTATGGCTTGTTAGGGTTTTTTTTTCCATCTTCCTCTTCTATTCCAATGATCAATGAAGCTAAAGACTGGGTTGTTCTCTACTTTCATCTACTTTCCCCCATTTTTGAAAGCTGGGTGTTTATGTTGCCATGAAAGAGCAGCAGGGTTTTACAAGGAGATCTTTAATTCAGATGGGCTTTTAATTGTTGAGTATAATTTTGTACTAAATTTCAGTTACGATCTGTTTGAGCTGAAACATAAAACTAGTTTTTTTTTTTTTTTTTGAGGGGGGTTTCACAGGTACTCCATGAATTTTCTAATTTATCGCGTTTTCTCACCGTGAACAGAGAATTAGATTAAGAAACGAAGGCGTACCTTTGCAAGGAAAGGAAGCTGGGATTTCTGGAACAAGAGGGCAAAAATTGGCTCTTCTGCTTGAGTGGTGGAAACCAATCAAATGGATTCCGGCCAACCTGAGGGAAATAAGGCCGATGTCCAAGGTGGTTCTTGGATTCCGGCGACACCCATGAAGCCCATTCTACCAAAACCGCCGCTGCAGCCACTGATCTATGCAAGGATGGACCGGAATCAGCCACGACCATACTGGTTGGGATCAGAGAGACTGTTCTCAAATTCTAACAAGGAAGCTGAGACTAGCAGTGGAGTTGCATGTTACGGTGGAGCTAATTTCATGACAGCTAATGGTTCTAACGACTGGGAAGCAGCTCAGGCTAGGCAGTTTCAAGTGGCCCGTAATGATAATGGAACAGTGACGATACATTCCATGGATGCACTGGGGGGCATTCCTTTTTTGCAGCTAATGGCTTTGGCAGATGCGGCTTCTATTGTGGGTGCTGATGCTGCATTGGGTGGAAATGCAAGCGACTTGTTCGATTCTGGCTCCAGCTATCAAATTGAATTGGAGTCTAGCTCTATGAAGGATCGTCTCAGTGGCAGCTGCATACCTGAAGCAAAAGAGTGTTAGTCCAGCTAATTTGAATAAAGTTAATTGCACTTTAGATTCTTAAATGTGGTATAATTGATAATGCTCTAGTAACATTTAGTGCAAGAATCGTTAGAAAAAATAAGAGCTGATGTGTTAAAAGCAGCACTATAGTATTAAAAGTATTGCTCTTTGGGGAAATTGAGACAATCTAATAGAAAAGGGAATTAGTAGGAGAAAACTGATTGAAAATGGAGAATTATTCATGGATATGCATTTTCTTTATTTCCCATTTTTGTTTCCTGTGAAGTTCTGCCTTTGTTTTTTGACCACTGCTACATGCTTTAATTAGTTTAAGACAACATTCCTTCTACCTACGTTATGTGAAAAGAAAAAAGGTAGTCTAATAGGTTCCTACAATAAATAACGTGACAAATCTCTTGAACTCTACCGTTGTTGTCATCCATTGTTCTGAAATTATTTCCAAATTTACAGATGGAACATCTGACCATGGCAGCCAGCATGCCTATGACCTCAATTTTCCATCGGGGACAGAGTCAGATGCAGCTGGTATTAGAGTAACCTCCCAATTTGCTCCTCTGACACCAGACATGGGCAAGTGTAAATATACCGAAAGAGGGATGGAATTACAGCAGATACCAATTGAGAACAGCCAAGATGAGAGAGAACGGAACCATAACTGTAATACTTCAATAACAGTTGATGGTGAAAACCTGAGACAAAATCAAGAACTTCTTGAACCTGCAATGCACTCAACAATTAATTGTACCCCAGATGGAAAGGAAGGCAAGAACGATGGTGACCTGAATAAAACACCAGCATCAAGACAGAGAAGGAGAAAACATAGACCTAAGGTCATAGTCGAAGGAAAGACTAACAGAACAAAACAAATTTTGAAGACACCTAGTTCCAATCAAAGTGTGAGAAAGCGTGTCAGGAAAAGTGGACTTACTAAGCCTTCAGCAACTCCCCCAATAGAAGTAACTGGTGAAACCTCAGAACAAGAAATGGTCAAGCATAGAAGAAAGTCATGCAGAAGAGCCATAAATTTTGATTCGCAGGCCCAAACAAGAGATGGATCCCTCGATTCAGGGCCATTGGAACAAGGTTCACTCACTCAAAACATTCAATCAACTACAGGACTAGAGGAAGCGAGGCTTGAAGAGGTAGGCTCTTCTACTGATCCAAATTGGTCCATGAATCAAATGCCAAAAAAATACGAGTCTCTATCTGAGAAACAAGCCCCACCCACCAAACTTTCAGCTGAAAATAATTCTTCTGAGAGAAAACAGCCTTCAAAAAGCCAAATGGAGAACAATATAGAACAAAATGGTAAAATAATTTCCAATTCTGATAAAGAAAACACGGTAGAAACCATCCTAAATGATGATAATCATTCATTACCAGGAAATTCGCATGGTCTTATTTTCTGCAAGAACCCCCCATTGACGTCAATAGAGCAAGCAACTTGTTGCCTTAGGAAACGTCCTCGGGCCATCAAACAAGCACATACTGGGAGCATAAATTTAACAGGAGCCCATTATAATACATTATCTGCATACCAGTCGATGTCCTGGATGCACTTTCCCCACATTTACAAGAAAAAAAGAACTGAGAAGGGGCAGAACCCTGTTCCCTCAAGTGCATTTACTACGGCCACAAATTTCATAAGGCCAGAAAGTGCATGCTCTTTCAATGACCCCCAAAGAGATCACATGGTATCAAAATTCAATGCCTGGATAGCTGGACCTCAGTTTAATATTTGTAAAAGTAAATCTGTAGCTGGGCATGGAGGAAACGATCTCCAGGATAAGTTGCAAACATATGGAGGTATCGTGGGCTTGGATCAGACTGGGAGAACAAAAAAGAAACCTAGAACAGCCAAACGACTTTCTGGCTTGGCTCCACCAGAAAGAATTACTCATTGGGAGAAGCAGCCAATATATCCTACTAATCACCCTCCTCCAGCCGGTTCTGCAAAAAATATCAATACATCGGGAACATGTATAAATGGACTATTCGAGATGATGCATGCAACAGTGGCAAAGAAGAAAAGAACAAAGAAGAAACCTTCAAACTCAGCACTTCTCAATATAAATAAAGATCTTCAGGACCGCAGATTTGTATCCTTCAATCCCTGGCAATTCTTTCCCAAGACATTAGGTACCAAAATTGAAAGTTTCTTTATCAATGGTGAAGATGTTCACCCATTTAAAGATTTTTTTTTTCTTTACCTTTTACCCATTTCCTTTATCTATACAGGCACTGCTTCAGAACACGGTAATCAAATATGCTTTATTGATCTCTTAGTTGAACAATTAAAGCATCTAGACATCAACAAGGAAAGCAACAATTTGGGATATAGAGAGCAAGCACTTGTACCCTATAACATGCAAAATCAGGAACACAGTGCTATTGTCGTTTATGGAAGAGATGGAACTATTGTACCATTTAATCCTATAAAGAAACGACGGCCACGACCGAAAGTCGAGCTTGATGAAGAGACTGGCAGAGTATGGAAGCTTCTGATGGGAAATATAAACAGTAAAGGCATTGATGGAACAGATGAAGAAAAGATCAAATGGTGGGAAGAAGAGAGGAAGGTGTTTCAAGGACGAGCAGATTCATTTATTGCTCGGATGCATCTTGTTCAAGGTACAAAATAAACTATCCAACTCTTAACCCACTGCTATCAAAGAGTTGCTTCTGGAACTCCATGTAGACATCAAATAAATAACAAAGTTCAGTGAAGTTCATTAGACAATTGGGTTTAAGAGTCATGGAGTATGACATTAATTCAGTAAAACCTTACTTGAGCAAAATATGTTCTATAGGTTGTACAACTGTGTCCTTGAGTTCTAAGAAGACATTAATTCAGTAAAATCAATCTATATGAGAAATGAGGGAGAGAAAGAGCAACAAGAGGGTTTATAGAACAATAACACCTTTGGCATACCTTCTTGTAGGAATTAAACAAGAAAAATCAAATTCCTAGGCTTAGGACATGTTCCAGTGGTTAAAATTACTTGTCAGTTTCAACATCACATCGAAACGTGTTTTTAACCATTCAAAATCAATTTTGATATGAAAAATGCATTTAAAGGGTGATAAAAAACATGTGTCAGAGGGATTTTTAACAAATAACAAAAGTGACTTTAACTAGTTTAAAATCACTTCCAAACATGGCTTTAGAACCCTAGTGCTTCTGCATATCTCTTGTATATCCCTTTTTGGAACAGATTAAACAGTAAATAAAGTTCTTAGGCTTAGAAGTCAAGTGCTTCTCCACGTATATATATGTACTTGTCAGCAATGTGCTCCAGAAAAACACAAATGAAAATTTAACTTGCACGAAGGATAAGTCTAGGCCTAGTAGTAGCCATCCTTACTGTAGCAACTATTTTGCTGATCTGTCTAAGCAGTAACTTGATTTCATTTCTCCAGGAGATAGGCGTTTCTCTCAATGGAAGGGATCAGTCGTGGACTCTGTGGTTGGAGTATTCCTAACTCAGAATGTCTCAGATCACCTTTCTAGGTAAGCTAGATTGTTAAATGGTGTGAGCTAAAAAGTGTAATTCAAAGTAATCTCCATCTTTCCCCTTTCTTTTACTAACTCCTACTGATGTTTGCAGCTCTGCCTTCATGTCTCTTGCTGCACGCTTTCCTCCTAAGCCAAAGTGTCGCCAAGCATCATGCTCCCAAGAGCCAATTATAGAGTTGGATGAACCCGAAGAAGCATGCGTGTTCAATTTAGAGGATAGCATGAAATTGAACAAACAGATAATACATCAGCAAATAAGTGAAGAGGGCTCGTTGATGAAAGATGAAATGGAAAAAAGTGAAGGACGAATAATTGTTGACAACAATGAATCATCCGGAAGTAATGCAGAGGATGGGAGCTCAAACAAAGAACCAGAAAAGAAAAGTTTTAGTTCATCTCATAACATTCTTGAGACATGTAGCAATTCTGTGGGAGAAATATCATTGACCGAAACCAGCTCAATGCAAGCATGTCTCTCCGGAGAGAAAGAAACCTATGATTCATTTTCATTTCAAGACTGTCTGGATTCATCAATTCCTCAAACCAGTGAGAGTATTGAACCATCCTCAGAAGGAAACTCAGAAGATCTACCAAGCTGGTCCACGGAGGCACACATCGACTCTTCATCAGAGGAGCTTATTCAGATGACTGGACCAAATACTTTAAATGCTAATTTTACTACTGATACTTCTGTTGAACAGTCAGAAAATACCACCACCAACAAATTAGTAGAAAAGAAGTGTGACAATAGAATAGATGACACTTCCCAACCAGATGATCCTGAAATATCCCTAAAAAATTCTGTTTATCATTTGAGTGATTATCAAACGCAGCAAAACCAGACCTCAAAATCATTGGAGGTTGACTGCTGTCAGACAAGCAATGGAGTTCAAACTTCTAATGATTGCCAGAACAAGGACGAACATTTTCATACTGAACAAAGTACACTGACTGTAGAATATGACAATCATGCTAATGTTGAGATGGAGCTCATAGTAGATATCGTTGAAGCACCGTCATCAAGTAGTGAATTAAGCATCAATGCAAAGGAGCCAGGTTTGACCTTACAGTCTCAAAGCAGTGTGATTGAAGACCCTCAAAATGTGGAGTCACCAGCAGAATGTACAAATACTGTGTATGAAATTCCTCCAAATGCTACAGAAATAGCAACAAAGCCAAATCCAAAGGAGTGTAATCTACTTAGTAATGAGTTTAAAGAGTTGAAACCTGCCTCTTCAAGATCTCAGAGAAAGCAAGTTGCAAAGGAAAAAGATAACATTAATTGGGACAACTTACGAAAACAGACAGAAACCAACGGAAAGACACGGCAGAGAACTGAAAGTACGATGGATTCATTGGATTGGGAAGCTATAAGATGTGCGGATGTGAATGAGATTGCACATGCCATCAGAGAACGGGGCATGAACAACATGCTTGCTGAACGAATCAAGGTATCATTATATCATTAATTTTAAACATGATCTAAAATTCTAATCAGTCCCTAGCACTAGCTGAGTACTGTATGAGTAATTTGTACAGGATTTTCTAAACCGTCTGGTGAAAGATCATGGGAGCATTGATCTTGAATGGTTAAGAGATGTGGAACCAGACCATGCAAAGTAAGATTACTTGAGAACTTAAATATCAAATTCTTGAAAATGAAATTGAAGAATATCGAGAATGTTATCTCCTACACTAGAGAATATATTGCATCCACCAAAATTGCATGATTCGTACATTGGGTTTTACACGCAAATGTAAAATATGCAGAGAATATCTACTGAGCATAAGAGGATTGGGACTGAAAAGTGTGGAGTGTGTGCGACTTCTTACTCTCCATCATCTTGCCTTCCCGGTAAGTCAAAATGAAGCATCGACATGAAATCAGATAGATCATATACATGTGCATATATGATTATCACTTGTACCTTCAACAGGTTGATACAAATGTTGGTCGTATAGCTGTGCGACTAGGATGGGTACCTCTTCAACCACTACCAGAGTCCCTACAGTTGCATCTTCTAGAATTGTGAGTATGACAAGTCTATATGTTAGCAAAATTCGTACTTCTTTACCATATTAAAGATATGTCAAGAAAATATTCAGAAAGCCATCTTCCATTATTAAATGTTTAAAGATTTAAATAAAGACAATTTTTCAGTTAATATACTTCGATTGATTAATGATATAAATTATTTCCAATTGTATGGCTGCAATATTTTTGTTTAAACAAGTATCACTGCAATATAAACATAGCTTATCAATAATTTGCATATGAAAAACAGAAAATTCAACATTCGATTTTCATGATGGTATAGAAACTCTAGAACACTATCTGGGGAAGGAAGAATAATATCACTCTTTGAACGTTTTCTTGCTCCCTCTCCAAGGTACCCTGTGCTGGAGTCAATCCAAAAGTATCTATGGCCTCGGCTTTGCAAGCTTGACCAAAGAACACTGTGAGCATGTTTTCAACTTATAGGCCTAGAAATGGATGCATTTTATTATTTTTATCATTTAGTTGTGGTAATTGAGAGAATCCAAACTAATTGCCCACCAATTGGCTATAAATCTGGATGCAGGTATGAGCTGCACTACCAAATGATTACATTTGGAAAGGTAGTTTTACTTGGAAATCTTGGTTAAGGCATAAGCCAACTTATATACGATCTTGACAGGAATGTATGGTGCAGGTCTTTTGTACCAAAAGCAAACCAAATTGTAATGCTTGTCCAATGAGAGGAGAATGCAGGCATTTTGCAAGCGCATTTGCAAGGTCTGGCCATCTTTCTTCTAGTCCATCAGTTCATGCAGTCTTTCTTTCCCTCTCTTTTTCTTTATTTCCTATCTTTTTTAAAGACAAGAGTATGTGCGCATAAGCAACATGATTTAGGCCCCAACTAATAGCAACTATTTTGAGTTCAATTCTTTTAGAATGTCACTAATCACATGTTAAGGGCAGAGATACTTTAAAGCTTAATTCCTGCAGCAATTCTCTAAGGATGGCGATCTTTAAGTATGGATAAGTGGGGTAACTAATAGACATGCTTCTTACATATGCAGTGCAAGGCTCGGCCTTCCAGCACCAGAAGATAAAAGAATAGTCAGTACAACTGAGTGCAGAGAACCAGACAATAACCAAGCTAGAACAATTGACCAACCAATGTTGTCCCTCCCTCCGTCGACAATATCATCTGAAGAGATCAAGCCATCAGAAAGCCATCAATCTGATGGTAAGACTACAGCTGGTGCATGTGTACCCATTATTGAAGAACCAGCAACACCAGAGCAAGAAACCACCACCCAAGATGCAATCATCGACATTGAGGATGGCTTCTACGAAGACCCTGATGAAATTCCTACAATAAAACTAAACATTGAGGAGTTCTCACAGAACTTACAGAACTTCGTTCAAAAGAATATGGAACTTCAAGAAGGAGACATGTCAAAAGCTTTAATTGCATTAACCCCAGAAGCCGCGTCAATTCCAACACCCAAACTTAAGAATGTCAGCCGGCTGCGAACAGAGCATCAAGTGTAAGCAAAACATTTGTACCGTTACTCATCAAGATCTTTCACATTTCCAAATAGATGGTGCTATATGATATAATTTGATGGTTTAGCAATTGTTTGCCCAGTGGATGACCAACGTCAACAGCTATAGGTATCACATTTCAAACAAATTTTTCTACTTGCATGCTAATTTTTTTCCACCGTTTGCAGCTATGAACTTCCAGATAACCACCCTCTTCTTGAGAAGGTTAGTGATTCTTACATCTAAAGAACGAACAATTTCACCTTCTTTTGGAGTGTTTCATTTATGGGGAATGTGGAGAATTATGACCGTGTGGTAATTCTTATGATACAGTTGAAGTTGGATAGAAGAGAGCCAGATGATCCTTCCTCATACCTTTTGGCTATATGGACACCAGGTAGATGTACCTCCAAAACATTACAAATCTCTATCTATTTCAAGGACAGAAGACTAAACCAGAAGGCAACATTATGAAGTAAACACTCATGAATTCTGTAATAAGTTATATTTTTTATGCTACTCAGGTGAAACAGCAAATTCCATCGAACTACCAGAAAAAAGATGCAGTAATCAAGAACACCATCAATTGTGTTGTGAGGAGGAGTGCCTCTCGTGCAACAGTGTCAGAGAAGCCAACTCCTTCATGGTTCGAGGGACTCTTTTGGTGAGATATGTCTTGTCAACTAAGTTGTTGTGCTCAAATATACCTAGTATACTGAATCCTAATAAATATTTCATACACCGCCCCCCCCCCCCCCCCCCCAAAAAAAAAAACAGATACCATGTCGGACAGCAATGAGAGGAAGCTTTCCACTGAATGGCACTTACTTTCAAGTCAATGAGGTAAAGTTAAACGGGATATGTACCATCTTTGTTCTTCAAGGCTTAGTGCCACGACTTACTATAAATATTTTATTTGTAGGTGTTTGCGGATCACGAATCAAGCCTCAACCCAATAGATGTTCCAAGGGACTGGATATGGAATCTACCTAGACGCACCGTGTATTTTGGGACCTCCATACCAACAATATTCAAAGGTTCCTCATAGAACATATACCTCTTATTGTGTGATTGACTAAATAATAACATTTCTTGTCCACCAGGTTTATCAACACAAGGCATCCAACACTGTTTCTGGAGAGGTACCTATTCTTAAATTTACTATTGTCCTGCCGTCTATATGTGCCATTGAGAAAAACAATGTAGTTTGCTATGTATGTGTGTCTGTCTGTCTGTGTGTATGTAGATGTATGTATGTGTGATTGCAGATGACATACGTTTACAAAACAAAATACAATGTTAAAAATGTGAGAATAATGTTAACAGACTAACAGTAAACAGAATTTTAGAATTTACTGGCCCTGTGTTTGCACCAACCAATGACTTCGAGTAAACATTAAACGATTATCATAAATCTCTCGACCTTATCATTCTCTACTTTCACTCAAATGTATTCACAACATGCAGGATTCGTCTGTGTTAGAGGTTTTGATCAAAAAACAAGGGCACCCCGACCATTGATGGCCAGGCTTCATTTTCCAGCCAGCAAATTGAACAGAGGAAGAGGTAAAACAGAGGATCAATAAGAAAGCGAGGAGATGAAGCCCAAACCAGGACAGCACAAATAACATAGCAGTAAGAAAAGAACAACATTCGGTAAGACAGCTTCCAACCCATCACAAATAAGTTAATATCCATTTGTAAATTCCTGTTTGACACGTGATTAACAGCATTAGAATTAGTCATTTCAAGCTCAAGTCTGAATCATTCATAAAAAAAACTAGCGTCCGACACTAGCCATGAGATGCTTATTAATTTCTCTATCCAAGATGTGAGATGCCAGTTCCATCCATAGCAGCAAACGGAAGATTACAGCACGTCAAACTAGCGATCATCAGAAATTTAGCTGAGTGTTGAGGAGAAAGTCGTGTTTTTTTTTTCCAGATAGATAATTGATTAGAGATAGAGATGGGCGTAGAGATGCAAAAGACGAGTGTATCAAATTCAGAGGCAGTGTTTAGTGGCTTGTCGGTGTGAAACAAAATATAACTCCTTTATGAATATGAAAGTTAGGAGGTAGTTTGGGAGCTGTATAGAAGAATATATATCTAGATATAGAAATTTGAAAATGCATTTAAAAACAAACTTCTGAGTAATTTTATGTTTGAAAATACTATTCTAAAAGTTCGAGGAAAAGGAGTACTTGGTTATTAAAGAAGCCAAAAAAATAATATAAGTACTCTTAAACTGAAATTTTATTGTAAAAGGAATATATTAGAAATTGTTGTTCAAAC

mRNA sequence

GTTCAGTTTCTTTTAACAAAAATATGATATAGACACATCATTCGTGTGTACGTAAAATATTTTTTAAATAATATAAAAATTTAAATGTATCTTAAGTTACACTCATCTTTTAAAAAAATTTCAGTTGCGGTAGTTAAAGTACATGCAGGCCATGCATGCACTCAAGTATCGCTCAAAATTAAAATCTTCTTCCTTTTTAACCTCTTTCTCATTCCCTTCCCTCTCCTCGAGCAAGGTTTGAAGGTGCCACGTCCATTTTCAATGAACATCGCCACCACCGCCGTCGTTCTCTTTTCCCTTTACTCCCCCTTGCTCTTTCCCCATCCATTTTCAATTCTGTCCTTCATCTCTCTTTCTTCCTTCTATTAATTTTTTCTTTTTCCTCTTAGAAAAAATCACTCTAAGTCATCATCTTCTCTCTTCACTTATGCCAGTTATCCCAAATCCAGAACAAAGGGGGATCTAAATGATTCTTGAAAAAATATGCCTTTTTGTTGATTAGAAGTCATACAACAAAGCTTGAGAGGAAGGTGGAAACAATGATACAGTTCGCCGGAGCGAGGTGGCGGCAATATAATCCGACAACGGAGGAAATGAGAATAGGTAAGAAGAGTATTAGGGAGAGTTGAAGGAAGATAGAGATTTATTGGCCTTCCAGTGAGTGGTAACAACTCAGATCTTGCCGACATTGACTCAGTCCTCCGAGAATTAGATTAAGAAACGAAGGCGTACCTTTGCAAGGAAAGGAAGCTGGGATTTCTGGAACAAGAGGGCAAAAATTGGCTCTTCTGCTTGAGTGGTGGAAACCAATCAAATGGATTCCGGCCAACCTGAGGGAAATAAGGCCGATGTCCAAGGTGGTTCTTGGATTCCGGCGACACCCATGAAGCCCATTCTACCAAAACCGCCGCTGCAGCCACTGATCTATGCAAGGATGGACCGGAATCAGCCACGACCATACTGGTTGGGATCAGAGAGACTGTTCTCAAATTCTAACAAGGAAGCTGAGACTAGCAGTGGAGTTGCATGTTACGGTGGAGCTAATTTCATGACAGCTAATGGTTCTAACGACTGGGAAGCAGCTCAGGCTAGGCAGTTTCAAGTGGCCCGTAATGATAATGGAACAGTGACGATACATTCCATGGATGCACTGGGGGGCATTCCTTTTTTGCAGCTAATGGCTTTGGCAGATGCGGCTTCTATTGTGGGTGCTGATGCTGCATTGGGTGGAAATGCAAGCGACTTGTTCGATTCTGGCTCCAGCTATCAAATTGAATTGGAGTCTAGCTCTATGAAGGATCGTCTCAGTGGCAGCTGCATACCTGAAGCAAAAGAGTATGGAACATCTGACCATGGCAGCCAGCATGCCTATGACCTCAATTTTCCATCGGGGACAGAGTCAGATGCAGCTGGTATTAGAGTAACCTCCCAATTTGCTCCTCTGACACCAGACATGGGCAAGTGTAAATATACCGAAAGAGGGATGGAATTACAGCAGATACCAATTGAGAACAGCCAAGATGAGAGAGAACGGAACCATAACTGTAATACTTCAATAACAGTTGATGGTGAAAACCTGAGACAAAATCAAGAACTTCTTGAACCTGCAATGCACTCAACAATTAATTGTACCCCAGATGGAAAGGAAGGCAAGAACGATGGTGACCTGAATAAAACACCAGCATCAAGACAGAGAAGGAGAAAACATAGACCTAAGGTCATAGTCGAAGGAAAGACTAACAGAACAAAACAAATTTTGAAGACACCTAGTTCCAATCAAAGTGTGAGAAAGCGTGTCAGGAAAAGTGGACTTACTAAGCCTTCAGCAACTCCCCCAATAGAAGTAACTGGTGAAACCTCAGAACAAGAAATGGTCAAGCATAGAAGAAAGTCATGCAGAAGAGCCATAAATTTTGATTCGCAGGCCCAAACAAGAGATGGATCCCTCGATTCAGGGCCATTGGAACAAGGTTCACTCACTCAAAACATTCAATCAACTACAGGACTAGAGGAAGCGAGGCTTGAAGAGGTAGGCTCTTCTACTGATCCAAATTGGTCCATGAATCAAATGCCAAAAAAATACGAGTCTCTATCTGAGAAACAAGCCCCACCCACCAAACTTTCAGCTGAAAATAATTCTTCTGAGAGAAAACAGCCTTCAAAAAGCCAAATGGAGAACAATATAGAACAAAATGGTAAAATAATTTCCAATTCTGATAAAGAAAACACGGTAGAAACCATCCTAAATGATGATAATCATTCATTACCAGGAAATTCGCATGGTCTTATTTTCTGCAAGAACCCCCCATTGACGTCAATAGAGCAAGCAACTTGTTGCCTTAGGAAACGTCCTCGGGCCATCAAACAAGCACATACTGGGAGCATAAATTTAACAGGAGCCCATTATAATACATTATCTGCATACCAGTCGATGTCCTGGATGCACTTTCCCCACATTTACAAGAAAAAAAGAACTGAGAAGGGGCAGAACCCTGTTCCCTCAAGTGCATTTACTACGGCCACAAATTTCATAAGGCCAGAAAGTGCATGCTCTTTCAATGACCCCCAAAGAGATCACATGGTATCAAAATTCAATGCCTGGATAGCTGGACCTCAGTTTAATATTTGTAAAAGTAAATCTGTAGCTGGGCATGGAGGAAACGATCTCCAGGATAAGTTGCAAACATATGGAGGTATCGTGGGCTTGGATCAGACTGGGAGAACAAAAAAGAAACCTAGAACAGCCAAACGACTTTCTGGCTTGGCTCCACCAGAAAGAATTACTCATTGGGAGAAGCAGCCAATATATCCTACTAATCACCCTCCTCCAGCCGGTTCTGCAAAAAATATCAATACATCGGGAACATGTATAAATGGACTATTCGAGATGATGCATGCAACAGTGGCAAAGAAGAAAAGAACAAAGAAGAAACCTTCAAACTCAGCACTTCTCAATATAAATAAAGATCTTCAGGACCGCAGATTTGTATCCTTCAATCCCTGGCAATTCTTTCCCAAGACATTAGGCACTGCTTCAGAACACGGTAATCAAATATGCTTTATTGATCTCTTAGTTGAACAATTAAAGCATCTAGACATCAACAAGGAAAGCAACAATTTGGGATATAGAGAGCAAGCACTTGTACCCTATAACATGCAAAATCAGGAACACAGTGCTATTGTCGTTTATGGAAGAGATGGAACTATTGTACCATTTAATCCTATAAAGAAACGACGGCCACGACCGAAAGTCGAGCTTGATGAAGAGACTGGCAGAGTATGGAAGCTTCTGATGGGAAATATAAACAGTAAAGGCATTGATGGAACAGATGAAGAAAAGATCAAATGGTGGGAAGAAGAGAGGAAGGTGTTTCAAGGACGAGCAGATTCATTTATTGCTCGGATGCATCTTGTTCAAGGAGATAGGCGTTTCTCTCAATGGAAGGGATCAGTCGTGGACTCTGTGGTTGGAGTATTCCTAACTCAGAATGTCTCAGATCACCTTTCTAGCTCTGCCTTCATGTCTCTTGCTGCACGCTTTCCTCCTAAGCCAAAGTGTCGCCAAGCATCATGCTCCCAAGAGCCAATTATAGAGTTGGATGAACCCGAAGAAGCATGCGTGTTCAATTTAGAGGATAGCATGAAATTGAACAAACAGATAATACATCAGCAAATAAGTGAAGAGGGCTCGTTGATGAAAGATGAAATGGAAAAAAGTGAAGGACGAATAATTGTTGACAACAATGAATCATCCGGAAGTAATGCAGAGGATGGGAGCTCAAACAAAGAACCAGAAAAGAAAAGTTTTAGTTCATCTCATAACATTCTTGAGACATGTAGCAATTCTGTGGGAGAAATATCATTGACCGAAACCAGCTCAATGCAAGCATGTCTCTCCGGAGAGAAAGAAACCTATGATTCATTTTCATTTCAAGACTGTCTGGATTCATCAATTCCTCAAACCAGTGAGAGTATTGAACCATCCTCAGAAGGAAACTCAGAAGATCTACCAAGCTGGTCCACGGAGGCACACATCGACTCTTCATCAGAGGAGCTTATTCAGATGACTGGACCAAATACTTTAAATGCTAATTTTACTACTGATACTTCTGTTGAACAGTCAGAAAATACCACCACCAACAAATTAGTAGAAAAGAAGTGTGACAATAGAATAGATGACACTTCCCAACCAGATGATCCTGAAATATCCCTAAAAAATTCTGTTTATCATTTGAGTGATTATCAAACGCAGCAAAACCAGACCTCAAAATCATTGGAGGTTGACTGCTGTCAGACAAGCAATGGAGTTCAAACTTCTAATGATTGCCAGAACAAGGACGAACATTTTCATACTGAACAAAGTACACTGACTGTAGAATATGACAATCATGCTAATGTTGAGATGGAGCTCATAGTAGATATCGTTGAAGCACCGTCATCAAGTAGTGAATTAAGCATCAATGCAAAGGAGCCAGGTTTGACCTTACAGTCTCAAAGCAGTGTGATTGAAGACCCTCAAAATGTGGAGTCACCAGCAGAATGTACAAATACTGTGTATGAAATTCCTCCAAATGCTACAGAAATAGCAACAAAGCCAAATCCAAAGGAGTGTAATCTACTTAGTAATGAGTTTAAAGAGTTGAAACCTGCCTCTTCAAGATCTCAGAGAAAGCAAGTTGCAAAGGAAAAAGATAACATTAATTGGGACAACTTACGAAAACAGACAGAAACCAACGGAAAGACACGGCAGAGAACTGAAAGTACGATGGATTCATTGGATTGGGAAGCTATAAGATGTGCGGATGTGAATGAGATTGCACATGCCATCAGAGAACGGGGCATGAACAACATGCTTGCTGAACGAATCAAGGATTTTCTAAACCGTCTGGTGAAAGATCATGGGAGCATTGATCTTGAATGGTTAAGAGATGTGGAACCAGACCATGCAAAAGAATATCTACTGAGCATAAGAGGATTGGGACTGAAAAGTGTGGAGTGTGTGCGACTTCTTACTCTCCATCATCTTGCCTTCCCGGTTGATACAAATGTTGGTCGTATAGCTGTGCGACTAGGATGGGTACCTCTTCAACCACTACCAGAGTCCCTACAGTTGCATCTTCTAGAATTGTACCCTGTGCTGGAGTCAATCCAAAAGTATCTATGGCCTCGGCTTTGCAAGCTTGACCAAAGAACACTGTATGAGCTGCACTACCAAATGATTACATTTGGAAAGGTCTTTTGTACCAAAAGCAAACCAAATTGTAATGCTTGTCCAATGAGAGGAGAATGCAGGCATTTTGCAAGCGCATTTGCAAGTGCAAGGCTCGGCCTTCCAGCACCAGAAGATAAAAGAATAGTCAGTACAACTGAGTGCAGAGAACCAGACAATAACCAAGCTAGAACAATTGACCAACCAATGTTGTCCCTCCCTCCGTCGACAATATCATCTGAAGAGATCAAGCCATCAGAAAGCCATCAATCTGATGGTAAGACTACAGCTGGTGCATGTGTACCCATTATTGAAGAACCAGCAACACCAGAGCAAGAAACCACCACCCAAGATGCAATCATCGACATTGAGGATGGCTTCTACGAAGACCCTGATGAAATTCCTACAATAAAACTAAACATTGAGGAGTTCTCACAGAACTTACAGAACTTCGTTCAAAAGAATATGGAACTTCAAGAAGGAGACATGTCAAAAGCTTTAATTGCATTAACCCCAGAAGCCGCGTCAATTCCAACACCCAAACTTAAGAATGTCAGCCGGCTGCGAACAGAGCATCAAGTCTATGAACTTCCAGATAACCACCCTCTTCTTGAGAAGTTGAAGTTGGATAGAAGAGAGCCAGATGATCCTTCCTCATACCTTTTGGCTATATGGACACCAGGTGAAACAGCAAATTCCATCGAACTACCAGAAAAAAGATGCAGTAATCAAGAACACCATCAATTGTGTTGTGAGGAGGAGTGCCTCTCGTGCAACAGTGTCAGAGAAGCCAACTCCTTCATGGTTCGAGGGACTCTTTTGATACCATGTCGGACAGCAATGAGAGGAAGCTTTCCACTGAATGGCACTTACTTTCAAGTCAATGAGGTGTTTGCGGATCACGAATCAAGCCTCAACCCAATAGATGTTCCAAGGGACTGGATATGGAATCTACCTAGACGCACCGTGTATTTTGGGACCTCCATACCAACAATATTCAAAGGTTTATCAACACAAGGCATCCAACACTGTTTCTGGAGAGGATTCGTCTGTGTTAGAGGTTTTGATCAAAAAACAAGGGCACCCCGACCATTGATGGCCAGGCTTCATTTTCCAGCCAGCAAATTGAACAGAGGAAGAGGTAAAACAGAGGATCAATAAGAAAGCGAGGAGATGAAGCCCAAACCAGGACAGCACAAATAACATAGCAGTAAGAAAAGAACAACATTCGATGTGAGATGCCAGTTCCATCCATAGCAGCAAACGGAAGATTACAGCACGTCAAACTAGCGATCATCAGAAATTTAGCTGAGTGTTGAGGAGAAAGTCGTGTTTTTTTTTTCCAGATAGATAATTGATTAGAGATAGAGATGGGCGTAGAGATGCAAAAGACGAGTGTATCAAATTCAGAGGCAGTGTTTAGTGGCTTGTCGGTGTGAAACAAAATATAACTCCTTTATGAATATGAAAGTTAGGAGGTAGTTTGGGAGCTGTATAGAAGAATATATATCTAGATATAGAAATTTGAAAATGCATTTAAAAACAAACTTCTGAGTAATTTTATGTTTGAAAATACTATTCTAAAAGTTCGAGGAAAAGGAGTACTTGGTTATTAAAGAAGCCAAAAAAATAATATAAGTACTCTTAAACTGAAATTTTATTGTAAAAGGAATATATTAGAAATTGTTGTTCAAAC

Coding sequence (CDS)

ATGGATTCCGGCCAACCTGAGGGAAATAAGGCCGATGTCCAAGGTGGTTCTTGGATTCCGGCGACACCCATGAAGCCCATTCTACCAAAACCGCCGCTGCAGCCACTGATCTATGCAAGGATGGACCGGAATCAGCCACGACCATACTGGTTGGGATCAGAGAGACTGTTCTCAAATTCTAACAAGGAAGCTGAGACTAGCAGTGGAGTTGCATGTTACGGTGGAGCTAATTTCATGACAGCTAATGGTTCTAACGACTGGGAAGCAGCTCAGGCTAGGCAGTTTCAAGTGGCCCGTAATGATAATGGAACAGTGACGATACATTCCATGGATGCACTGGGGGGCATTCCTTTTTTGCAGCTAATGGCTTTGGCAGATGCGGCTTCTATTGTGGGTGCTGATGCTGCATTGGGTGGAAATGCAAGCGACTTGTTCGATTCTGGCTCCAGCTATCAAATTGAATTGGAGTCTAGCTCTATGAAGGATCGTCTCAGTGGCAGCTGCATACCTGAAGCAAAAGAGTATGGAACATCTGACCATGGCAGCCAGCATGCCTATGACCTCAATTTTCCATCGGGGACAGAGTCAGATGCAGCTGGTATTAGAGTAACCTCCCAATTTGCTCCTCTGACACCAGACATGGGCAAGTGTAAATATACCGAAAGAGGGATGGAATTACAGCAGATACCAATTGAGAACAGCCAAGATGAGAGAGAACGGAACCATAACTGTAATACTTCAATAACAGTTGATGGTGAAAACCTGAGACAAAATCAAGAACTTCTTGAACCTGCAATGCACTCAACAATTAATTGTACCCCAGATGGAAAGGAAGGCAAGAACGATGGTGACCTGAATAAAACACCAGCATCAAGACAGAGAAGGAGAAAACATAGACCTAAGGTCATAGTCGAAGGAAAGACTAACAGAACAAAACAAATTTTGAAGACACCTAGTTCCAATCAAAGTGTGAGAAAGCGTGTCAGGAAAAGTGGACTTACTAAGCCTTCAGCAACTCCCCCAATAGAAGTAACTGGTGAAACCTCAGAACAAGAAATGGTCAAGCATAGAAGAAAGTCATGCAGAAGAGCCATAAATTTTGATTCGCAGGCCCAAACAAGAGATGGATCCCTCGATTCAGGGCCATTGGAACAAGGTTCACTCACTCAAAACATTCAATCAACTACAGGACTAGAGGAAGCGAGGCTTGAAGAGGTAGGCTCTTCTACTGATCCAAATTGGTCCATGAATCAAATGCCAAAAAAATACGAGTCTCTATCTGAGAAACAAGCCCCACCCACCAAACTTTCAGCTGAAAATAATTCTTCTGAGAGAAAACAGCCTTCAAAAAGCCAAATGGAGAACAATATAGAACAAAATGGTAAAATAATTTCCAATTCTGATAAAGAAAACACGGTAGAAACCATCCTAAATGATGATAATCATTCATTACCAGGAAATTCGCATGGTCTTATTTTCTGCAAGAACCCCCCATTGACGTCAATAGAGCAAGCAACTTGTTGCCTTAGGAAACGTCCTCGGGCCATCAAACAAGCACATACTGGGAGCATAAATTTAACAGGAGCCCATTATAATACATTATCTGCATACCAGTCGATGTCCTGGATGCACTTTCCCCACATTTACAAGAAAAAAAGAACTGAGAAGGGGCAGAACCCTGTTCCCTCAAGTGCATTTACTACGGCCACAAATTTCATAAGGCCAGAAAGTGCATGCTCTTTCAATGACCCCCAAAGAGATCACATGGTATCAAAATTCAATGCCTGGATAGCTGGACCTCAGTTTAATATTTGTAAAAGTAAATCTGTAGCTGGGCATGGAGGAAACGATCTCCAGGATAAGTTGCAAACATATGGAGGTATCGTGGGCTTGGATCAGACTGGGAGAACAAAAAAGAAACCTAGAACAGCCAAACGACTTTCTGGCTTGGCTCCACCAGAAAGAATTACTCATTGGGAGAAGCAGCCAATATATCCTACTAATCACCCTCCTCCAGCCGGTTCTGCAAAAAATATCAATACATCGGGAACATGTATAAATGGACTATTCGAGATGATGCATGCAACAGTGGCAAAGAAGAAAAGAACAAAGAAGAAACCTTCAAACTCAGCACTTCTCAATATAAATAAAGATCTTCAGGACCGCAGATTTGTATCCTTCAATCCCTGGCAATTCTTTCCCAAGACATTAGGCACTGCTTCAGAACACGGTAATCAAATATGCTTTATTGATCTCTTAGTTGAACAATTAAAGCATCTAGACATCAACAAGGAAAGCAACAATTTGGGATATAGAGAGCAAGCACTTGTACCCTATAACATGCAAAATCAGGAACACAGTGCTATTGTCGTTTATGGAAGAGATGGAACTATTGTACCATTTAATCCTATAAAGAAACGACGGCCACGACCGAAAGTCGAGCTTGATGAAGAGACTGGCAGAGTATGGAAGCTTCTGATGGGAAATATAAACAGTAAAGGCATTGATGGAACAGATGAAGAAAAGATCAAATGGTGGGAAGAAGAGAGGAAGGTGTTTCAAGGACGAGCAGATTCATTTATTGCTCGGATGCATCTTGTTCAAGGAGATAGGCGTTTCTCTCAATGGAAGGGATCAGTCGTGGACTCTGTGGTTGGAGTATTCCTAACTCAGAATGTCTCAGATCACCTTTCTAGCTCTGCCTTCATGTCTCTTGCTGCACGCTTTCCTCCTAAGCCAAAGTGTCGCCAAGCATCATGCTCCCAAGAGCCAATTATAGAGTTGGATGAACCCGAAGAAGCATGCGTGTTCAATTTAGAGGATAGCATGAAATTGAACAAACAGATAATACATCAGCAAATAAGTGAAGAGGGCTCGTTGATGAAAGATGAAATGGAAAAAAGTGAAGGACGAATAATTGTTGACAACAATGAATCATCCGGAAGTAATGCAGAGGATGGGAGCTCAAACAAAGAACCAGAAAAGAAAAGTTTTAGTTCATCTCATAACATTCTTGAGACATGTAGCAATTCTGTGGGAGAAATATCATTGACCGAAACCAGCTCAATGCAAGCATGTCTCTCCGGAGAGAAAGAAACCTATGATTCATTTTCATTTCAAGACTGTCTGGATTCATCAATTCCTCAAACCAGTGAGAGTATTGAACCATCCTCAGAAGGAAACTCAGAAGATCTACCAAGCTGGTCCACGGAGGCACACATCGACTCTTCATCAGAGGAGCTTATTCAGATGACTGGACCAAATACTTTAAATGCTAATTTTACTACTGATACTTCTGTTGAACAGTCAGAAAATACCACCACCAACAAATTAGTAGAAAAGAAGTGTGACAATAGAATAGATGACACTTCCCAACCAGATGATCCTGAAATATCCCTAAAAAATTCTGTTTATCATTTGAGTGATTATCAAACGCAGCAAAACCAGACCTCAAAATCATTGGAGGTTGACTGCTGTCAGACAAGCAATGGAGTTCAAACTTCTAATGATTGCCAGAACAAGGACGAACATTTTCATACTGAACAAAGTACACTGACTGTAGAATATGACAATCATGCTAATGTTGAGATGGAGCTCATAGTAGATATCGTTGAAGCACCGTCATCAAGTAGTGAATTAAGCATCAATGCAAAGGAGCCAGGTTTGACCTTACAGTCTCAAAGCAGTGTGATTGAAGACCCTCAAAATGTGGAGTCACCAGCAGAATGTACAAATACTGTGTATGAAATTCCTCCAAATGCTACAGAAATAGCAACAAAGCCAAATCCAAAGGAGTGTAATCTACTTAGTAATGAGTTTAAAGAGTTGAAACCTGCCTCTTCAAGATCTCAGAGAAAGCAAGTTGCAAAGGAAAAAGATAACATTAATTGGGACAACTTACGAAAACAGACAGAAACCAACGGAAAGACACGGCAGAGAACTGAAAGTACGATGGATTCATTGGATTGGGAAGCTATAAGATGTGCGGATGTGAATGAGATTGCACATGCCATCAGAGAACGGGGCATGAACAACATGCTTGCTGAACGAATCAAGGATTTTCTAAACCGTCTGGTGAAAGATCATGGGAGCATTGATCTTGAATGGTTAAGAGATGTGGAACCAGACCATGCAAAAGAATATCTACTGAGCATAAGAGGATTGGGACTGAAAAGTGTGGAGTGTGTGCGACTTCTTACTCTCCATCATCTTGCCTTCCCGGTTGATACAAATGTTGGTCGTATAGCTGTGCGACTAGGATGGGTACCTCTTCAACCACTACCAGAGTCCCTACAGTTGCATCTTCTAGAATTGTACCCTGTGCTGGAGTCAATCCAAAAGTATCTATGGCCTCGGCTTTGCAAGCTTGACCAAAGAACACTGTATGAGCTGCACTACCAAATGATTACATTTGGAAAGGTCTTTTGTACCAAAAGCAAACCAAATTGTAATGCTTGTCCAATGAGAGGAGAATGCAGGCATTTTGCAAGCGCATTTGCAAGTGCAAGGCTCGGCCTTCCAGCACCAGAAGATAAAAGAATAGTCAGTACAACTGAGTGCAGAGAACCAGACAATAACCAAGCTAGAACAATTGACCAACCAATGTTGTCCCTCCCTCCGTCGACAATATCATCTGAAGAGATCAAGCCATCAGAAAGCCATCAATCTGATGGTAAGACTACAGCTGGTGCATGTGTACCCATTATTGAAGAACCAGCAACACCAGAGCAAGAAACCACCACCCAAGATGCAATCATCGACATTGAGGATGGCTTCTACGAAGACCCTGATGAAATTCCTACAATAAAACTAAACATTGAGGAGTTCTCACAGAACTTACAGAACTTCGTTCAAAAGAATATGGAACTTCAAGAAGGAGACATGTCAAAAGCTTTAATTGCATTAACCCCAGAAGCCGCGTCAATTCCAACACCCAAACTTAAGAATGTCAGCCGGCTGCGAACAGAGCATCAAGTCTATGAACTTCCAGATAACCACCCTCTTCTTGAGAAGTTGAAGTTGGATAGAAGAGAGCCAGATGATCCTTCCTCATACCTTTTGGCTATATGGACACCAGGTGAAACAGCAAATTCCATCGAACTACCAGAAAAAAGATGCAGTAATCAAGAACACCATCAATTGTGTTGTGAGGAGGAGTGCCTCTCGTGCAACAGTGTCAGAGAAGCCAACTCCTTCATGGTTCGAGGGACTCTTTTGATACCATGTCGGACAGCAATGAGAGGAAGCTTTCCACTGAATGGCACTTACTTTCAAGTCAATGAGGTGTTTGCGGATCACGAATCAAGCCTCAACCCAATAGATGTTCCAAGGGACTGGATATGGAATCTACCTAGACGCACCGTGTATTTTGGGACCTCCATACCAACAATATTCAAAGGTTTATCAACACAAGGCATCCAACACTGTTTCTGGAGAGGATTCGTCTGTGTTAGAGGTTTTGATCAAAAAACAAGGGCACCCCGACCATTGATGGCCAGGCTTCATTTTCCAGCCAGCAAATTGAACAGAGGAAGAGGTAAAACAGAGGATCAATAA

Protein sequence

MDSGQPEGNKADVQGGSWIPATPMKPILPKPPLQPLIYARMDRNQPRPYWLGSERLFSNSNKEAETSSGVACYGGANFMTANGSNDWEAAQARQFQVARNDNGTVTIHSMDALGGIPFLQLMALADAASIVGADAALGGNASDLFDSGSSYQIELESSSMKDRLSGSCIPEAKEYGTSDHGSQHAYDLNFPSGTESDAAGIRVTSQFAPLTPDMGKCKYTERGMELQQIPIENSQDERERNHNCNTSITVDGENLRQNQELLEPAMHSTINCTPDGKEGKNDGDLNKTPASRQRRRKHRPKVIVEGKTNRTKQILKTPSSNQSVRKRVRKSGLTKPSATPPIEVTGETSEQEMVKHRRKSCRRAINFDSQAQTRDGSLDSGPLEQGSLTQNIQSTTGLEEARLEEVGSSTDPNWSMNQMPKKYESLSEKQAPPTKLSAENNSSERKQPSKSQMENNIEQNGKIISNSDKENTVETILNDDNHSLPGNSHGLIFCKNPPLTSIEQATCCLRKRPRAIKQAHTGSINLTGAHYNTLSAYQSMSWMHFPHIYKKKRTEKGQNPVPSSAFTTATNFIRPESACSFNDPQRDHMVSKFNAWIAGPQFNICKSKSVAGHGGNDLQDKLQTYGGIVGLDQTGRTKKKPRTAKRLSGLAPPERITHWEKQPIYPTNHPPPAGSAKNINTSGTCINGLFEMMHATVAKKKRTKKKPSNSALLNINKDLQDRRFVSFNPWQFFPKTLGTASEHGNQICFIDLLVEQLKHLDINKESNNLGYREQALVPYNMQNQEHSAIVVYGRDGTIVPFNPIKKRRPRPKVELDEETGRVWKLLMGNINSKGIDGTDEEKIKWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSDHLSSSAFMSLAARFPPKPKCRQASCSQEPIIELDEPEEACVFNLEDSMKLNKQIIHQQISEEGSLMKDEMEKSEGRIIVDNNESSGSNAEDGSSNKEPEKKSFSSSHNILETCSNSVGEISLTETSSMQACLSGEKETYDSFSFQDCLDSSIPQTSESIEPSSEGNSEDLPSWSTEAHIDSSSEELIQMTGPNTLNANFTTDTSVEQSENTTTNKLVEKKCDNRIDDTSQPDDPEISLKNSVYHLSDYQTQQNQTSKSLEVDCCQTSNGVQTSNDCQNKDEHFHTEQSTLTVEYDNHANVEMELIVDIVEAPSSSSELSINAKEPGLTLQSQSSVIEDPQNVESPAECTNTVYEIPPNATEIATKPNPKECNLLSNEFKELKPASSRSQRKQVAKEKDNINWDNLRKQTETNGKTRQRTESTMDSLDWEAIRCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVEPDHAKEYLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFASARLGLPAPEDKRIVSTTECREPDNNQARTIDQPMLSLPPSTISSEEIKPSESHQSDGKTTAGACVPIIEEPATPEQETTTQDAIIDIEDGFYEDPDEIPTIKLNIEEFSQNLQNFVQKNMELQEGDMSKALIALTPEAASIPTPKLKNVSRLRTEHQVYELPDNHPLLEKLKLDRREPDDPSSYLLAIWTPGETANSIELPEKRCSNQEHHQLCCEEECLSCNSVREANSFMVRGTLLIPCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTSIPTIFKGLSTQGIQHCFWRGFVCVRGFDQKTRAPRPLMARLHFPASKLNRGRGKTEDQ
Homology
BLAST of PI0016811 vs. ExPASy Swiss-Prot
Match: Q8LK56 (Transcriptional activator DEMETER OS=Arabidopsis thaliana OX=3702 GN=DME PE=1 SV=2)

HSP 1 Score: 941.4 bits (2432), Expect = 1.5e-272
Identity = 564/1132 (49.82%), Postives = 714/1132 (63.07%), Query Frame = 0

Query: 775  ALVPYNMQN---------QEHSAIVVYGRDGTIVPFNPIKKRRPRPKVELDEETGRVWKL 834
            A + Y MQN         QE +A+V+Y  DG +VP+   KKR+PRPKV++D+ET R+W L
Sbjct: 910  AEIIYRMQNLYLGDKEREQEQNAMVLYKGDGALVPYES-KKRKPRPKVDIDDETTRIWNL 969

Query: 835  LMGNINSK-GIDGTDEEKIKWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSVVDSV 894
            LMG  + K G +  D++K KWWEEER+VF+GRADSFIARMHLVQGDRRFS WKGSVVDSV
Sbjct: 970  LMGKGDEKEGDEEKDKKKEKWWEEERRVFRGRADSFIARMHLVQGDRRFSPWKGSVVDSV 1029

Query: 895  VGVFLTQNVSDHLSSSAFMSLAARFPPKPKCRQASCSQEPIIELDEPEEACVFNLEDSMK 954
            +GVFLTQNVSDHLSSSAFMSLAARFPPK    +        + +++P E C+ NL +   
Sbjct: 1030 IGVFLTQNVSDHLSSSAFMSLAARFPPKLSSSREDERNVRSVVVEDP-EGCILNLNEIPS 1089

Query: 955  LNKQIIHQQISEEGSLMKDEMEKSEGRIIVDNNESSGSNAEDGSSNKEPEKKSFSSSHN- 1014
              +++ H    E   +  D   K + R   ++     +  E    N E E  S   S + 
Sbjct: 1090 WQEKVQHPSDMEVSGV--DSGSKEQLRDCSNSGIERFNFLEKSIQNLEEEVLSSQDSFDP 1149

Query: 1015 -ILETCSNSVGEISLTETSSMQACLSGEKETYDSFSFQDCLDSSIPQTSESIEPSSEGNS 1074
             I ++C   VG  S +++ +              F    C   ++  TS+S++  S    
Sbjct: 1150 AIFQSCGR-VGSCSCSKSDA-------------EFPTTRCETKTVSGTSQSVQTGS---- 1209

Query: 1075 EDLPSWSTEAHIDSSSEELIQMTGPNTLNANFTTDTSVEQSENTTTNKLVEKKCDNR-ID 1134
               P+ S E  +  +    +   G   +    TT+ + ++ +   T    +  C  +  +
Sbjct: 1210 ---PNLSDEICLQGNERPHL-YEGSGDVQKQETTNVAQKKPDLEKTMNWKDSVCFGQPRN 1269

Query: 1135 DTSQPDDPEISLKN------SVYHLSDYQTQ-----QNQTSKSLEVDCCQTSN------- 1194
            DT+    P  S +        V  + D+  Q      +  S S  VD  +  N       
Sbjct: 1270 DTNWQTTPSSSYEQCATRQPHVLDIEDFGMQGEGLGYSWMSISPRVDRVKNKNVPRRFFR 1329

Query: 1195 ------------------------GVQTSNDC--QNKDEHFHTEQSTLTVEYDNHANVEM 1254
                                    G+  S+    +++D+  H +Q  +     N A+   
Sbjct: 1330 QGGSVPREFTGQIIPSTPHELPGMGLSGSSSAVQEHQDDTQHNQQDEM-----NKASHLQ 1389

Query: 1255 ELIVDIVEAPSSSSELSINAKEPGLTLQSQSSVIEDPQNVESPAECTNTVYEIPPNAT-- 1314
            +  +D+           +N+ E  LT QS +        +       + V  +  N++  
Sbjct: 1390 KTFLDL-----------LNSSEECLTRQSSTKQNITDGCLPRDRTAEDVVDPLSNNSSLQ 1449

Query: 1315 EIATKPNPKECNLLSNEFKELKPASSRSQRKQVAK-EKDNINWDNLRKQTETNGKTRQRT 1374
             I  + N       + E+KE      R  +  +A  +K    WD+LRK  E N   ++R 
Sbjct: 1450 NILVESNSSNKEQTAVEYKETNATILREMKGTLADGKKPTSQWDSLRKDVEGNEGRQERN 1509

Query: 1375 ESTMDSLDWEAIRCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVE 1434
            ++ MDS+D+EAIR A ++EI+ AI+ERGMNNMLA RIKDFL R+VKDHG IDLEWLR+  
Sbjct: 1510 KNNMDSIDYEAIRRASISEISEAIKERGMNNMLAVRIKDFLERIVKDHGGIDLEWLRESP 1569

Query: 1435 PDHAKEYLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLH 1494
            PD AK+YLLSIRGLGLKSVECVRLLTLH+LAFPVDTNVGRIAVR+GWVPLQPLPESLQLH
Sbjct: 1570 PDKAKDYLLSIRGLGLKSVECVRLLTLHNLAFPVDTNVGRIAVRMGWVPLQPLPESLQLH 1629

Query: 1495 LLELYPVLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRH 1554
            LLELYPVLESIQK+LWPRLCKLDQRTLYELHYQ+ITFGKVFCTKS+PNCNACPMRGECRH
Sbjct: 1630 LLELYPVLESIQKFLWPRLCKLDQRTLYELHYQLITFGKVFCTKSRPNCNACPMRGECRH 1689

Query: 1555 FASAFASARLGLPAPEDKRIVSTTECREPDNNQARTIDQPMLSLPPSTISSEEIKPSESH 1614
            FASA+ASARL LPAPE++ + S T    P++     I  PM+ LP        +   +S 
Sbjct: 1690 FASAYASARLALPAPEERSLTSATIPVPPESYPPVAI--PMIELP--------LPLEKSL 1749

Query: 1615 QSDGKTTAGACVPIIEEPATPEQETTTQDAIIDIEDGFY-EDPDEIPTIKLNIEEFSQNL 1674
             S   +    C PIIEEPA+P QE  T+    DIED +Y EDPDEIPTIKLNIE+F   L
Sbjct: 1750 ASGAPSNRENCEPIIEEPASPGQE-CTEITESDIEDAYYNEDPDEIPTIKLNIEQFGMTL 1809

Query: 1675 QNFVQKNMELQEGDMSKALIALTPEAASIPTPKLKNVSRLRTEHQVYELPDNHPLLEKLK 1734
            +  +++NMELQEGDMSKAL+AL P   SIPTPKLKN+SRLRTEHQVYELPD+H LL+   
Sbjct: 1810 REHMERNMELQEGDMSKALVALHPTTTSIPTPKLKNISRLRTEHQVYELPDSHRLLD--G 1869

Query: 1735 LDRREPDDPSSYLLAIWTPGETANSIELPEKRCSNQEHHQLCCEEECLSCNSVREANSFM 1794
            +D+REPDDPS YLLAIWTPGETANS + PE++C  +   ++C +E C  CNS+REANS  
Sbjct: 1870 MDKREPDDPSPYLLAIWTPGETANSAQPPEQKCGGKASGKMCFDETCSECNSLREANSQT 1929

Query: 1795 VRGTLLIPCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTS 1846
            VRGTLLIPCRTAMRGSFPLNGTYFQVNE+FADHESSL PIDVPRDWIW+LPRRTVYFGTS
Sbjct: 1930 VRGTLLIPCRTAMRGSFPLNGTYFQVNELFADHESSLKPIDVPRDWIWDLPRRTVYFGTS 1986

BLAST of PI0016811 vs. ExPASy Swiss-Prot
Match: Q9SJQ6 (DNA glycosylase/AP lyase ROS1 OS=Arabidopsis thaliana OX=3702 GN=ROS1 PE=1 SV=2)

HSP 1 Score: 937.2 bits (2421), Expect = 2.9e-271
Identity = 565/1171 (48.25%), Postives = 707/1171 (60.38%), Query Frame = 0

Query: 681  TSGTCI-----NGLFEMMHATVAKKKRTKKKPSNSALLNINKDLQDRRFVSFNPWQFFPK 740
            TSG C      N +      TV+KKK TK + S +   N+  +L           +F P 
Sbjct: 409  TSGYCSKPQQNNKILVDTRVTVSKKKPTKSEKSQTKQKNLLPNL----------CRFPPS 468

Query: 741  TLG-TASEHGNQICFIDLLVEQLKHLDINKESNNLGYREQALVPYNMQNQEHSAIVVYGR 800
              G +  E   +   I+ + E L+ LDIN+E     + E ALVPY M +Q    IV++G 
Sbjct: 469  FTGLSPDELWKRRNSIETISELLRLLDINRE-----HSETALVPYTMNSQ----IVLFGG 528

Query: 801  D-GTIVPFNPIKKRRPRPKVELDEETGRVWKLLMGNINSKGIDGTDEEKIKWWEEERKVF 860
              G IVP  P+KK RPRPKV+LD+ET RVWKLL+ NINS+G+DG+DE+K KWWEEER VF
Sbjct: 529  GAGAIVPVTPVKKPRPRPKVDLDDETDRVWKLLLENINSEGVDGSDEQKAKWWEEERNVF 588

Query: 861  QGRADSFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSDHLSSSAFMSLAARFPPK- 920
            +GRADSFIARMHLVQGDRRF+ WKGSVVDSVVGVFLTQNVSDHLSSSAFMSLA++FP   
Sbjct: 589  RGRADSFIARMHLVQGDRRFTPWKGSVVDSVVGVFLTQNVSDHLSSSAFMSLASQFPVPF 648

Query: 921  -PKCR-QASCSQEPIIELDEPEEACVFNLEDSMKLNKQIIHQQISEEGSLMKDEMEKSEG 980
             P     A  S  P I++         + E++M       H  ++ + +      +  E 
Sbjct: 649  VPSSNFDAGTSSMPSIQI------TYLDSEETMSSPPDHNHSSVTLKNT------QPDEE 708

Query: 981  RIIVDNNESSGSNAEDGSSNKEPEKKSFSSSHNILETCSNSVGEISLTETSSMQACLSGE 1040
            +  V +NE+S S++E   S  E   K+  S   +      S  E+  T+       L   
Sbjct: 709  KDYVPSNETSRSSSEIAISAHESVDKTTDSKEYVDSDRKGSSVEVDKTDEKCRVLNLFPS 768

Query: 1041 KETYDSFSFQDCLDSSIPQTSESIEPSSEGNSEDLPSWSTEAHIDSSSEELIQMTGPNTL 1100
            +++  + + Q  + S  PQ                                      NT 
Sbjct: 769  EDS--ALTCQHSMVSDAPQ--------------------------------------NTE 828

Query: 1101 NANFTTDTSVEQSENTTTNKLVEKKCDNRIDDTSQPDDPEISLKNSVYHLSDYQTQQNQT 1160
             A  +++  +E    T+  KL++                ++SL++S           NQ 
Sbjct: 829  RAGSSSEIDLEGEYRTSFMKLLQ--------------GVQVSLEDS-----------NQV 888

Query: 1161 SKSLEVDCCQTSNGVQTSNDCQNKDEHFHTEQSTLTVEYDNHANVEMELIVDIVEAPSSS 1220
            S ++            +  DC ++ + F +                       ++ P+ S
Sbjct: 889  SPNM------------SPGDCSSEIKGFQS-----------------------MKEPTKS 948

Query: 1221 SELSINAKEPGLTLQSQSSVIEDPQNVESPAECTNTVYEIPPNATEIATKPNPKECNLLS 1280
               S+++ EPG   Q    V+                                  C    
Sbjct: 949  ---SVDSSEPGCCSQQDGDVL---------------------------------SCQ--- 1008

Query: 1281 NEFKELKPASSRSQRKQVAKEKDNINWDNLRKQTETNGKTRQRTESTMDSLDWEAIRCAD 1340
                  KP      +K + +EK   +WD LR++ +     R++T STMD++DW+AIR AD
Sbjct: 1009 ------KPTLKEKGKKVLKEEKKAFDWDCLRREAQARAGIREKTRSTMDTVDWKAIRAAD 1068

Query: 1341 VNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVEPDHAKEYLLSIRGLGL 1400
            V E+A  I+ RGMN+ LAERI+ FL+RLV DHGSIDLEWLRDV PD AKEYLLS  GLGL
Sbjct: 1069 VKEVAETIKSRGMNHKLAERIQGFLDRLVNDHGSIDLEWLRDVPPDKAKEYLLSFNGLGL 1128

Query: 1401 KSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYPVLESIQKYLW 1460
            KSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLE+YP+LESIQKYLW
Sbjct: 1129 KSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLEMYPMLESIQKYLW 1188

Query: 1461 PRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFASARLGLPAPE 1520
            PRLCKLDQ+TLYELHYQMITFGKVFCTKSKPNCNACPM+GECRHFASAFASARL LP+ E
Sbjct: 1189 PRLCKLDQKTLYELHYQMITFGKVFCTKSKPNCNACPMKGECRHFASAFASARLALPSTE 1248

Query: 1521 DKRIVSTTECREPDNNQARTIDQPMLSLPPSTISSEEIKPSESHQSDGKTTAGACVPIIE 1580
                        PD N       P+    P     E+      H    K     C PIIE
Sbjct: 1249 -------KGMGTPDKN-------PLPLHLPEPFQREQGSEVVQHSEPAKKVT-CCEPIIE 1308

Query: 1581 EPATPEQETTTQDAIIDIEDGFYEDPDEIPTIKLNIEEFSQNLQNFVQKNMELQEGDMSK 1640
            EPA+PE E T + +I DIE+ F+EDP+EIPTI+LN++ F+ NL+  ++ N ELQ+G+MS 
Sbjct: 1309 EPASPEPE-TAEVSIADIEEAFFEDPEEIPTIRLNMDAFTSNLKKIMEHNKELQDGNMSS 1368

Query: 1641 ALIALTPEAASIPTPKLKNVSRLRTEHQVYELPDNHPLLEKLKLDRREPDDPSSYLLAIW 1700
            AL+ALT E AS+P PKLKN+S+LRTEH+VYELPD HPLL   +L++REPDDP SYLLAIW
Sbjct: 1369 ALVALTAETASLPMPKLKNISQLRTEHRVYELPDEHPLL--AQLEKREPDDPCSYLLAIW 1385

Query: 1701 TPGETANSIELPEKRCSNQEHHQLCCEEECLSCNSVREANSFMVRGTLLIPCRTAMRGSF 1760
            TPGETA+SI+     C  Q +  LC EE C SCNS++E  S +VRGT+LIPCRTAMRGSF
Sbjct: 1429 TPGETADSIQPSVSTCIFQANGMLCDEETCFSCNSIKETRSQIVRGTILIPCRTAMRGSF 1385

Query: 1761 PLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTSIPTIFKGLSTQGIQHCF 1820
            PLNGTYFQVNEVFADH SSLNPI+VPR+ IW LPRRTVYFGTS+PTIFKGLST+ IQ CF
Sbjct: 1489 PLNGTYFQVNEVFADHASSLNPINVPRELIWELPRRTVYFGTSVPTIFKGLSTEKIQACF 1385

Query: 1821 WRGFVCVRGFDQKTRAPRPLMARLHFPASKL 1842
            W+G+VCVRGFD+KTR P+PL+ARLHFPASKL
Sbjct: 1549 WKGYVCVRGFDRKTRGPKPLIARLHFPASKL 1385

BLAST of PI0016811 vs. ExPASy Swiss-Prot
Match: C7IW64 (Protein ROS1A OS=Oryza sativa subsp. japonica OX=39947 GN=ROS1A PE=1 SV=2)

HSP 1 Score: 921.8 bits (2381), Expect = 1.2e-266
Identity = 569/1162 (48.97%), Postives = 717/1162 (61.70%), Query Frame = 0

Query: 750  IDLLVEQLKHLDINKESNNLGYREQ-ALVPYNMQNQEHSAIVVYGRDGTIVPF-NPIKKR 809
            +D++++++K LDINK  + +      ALVPYN            G  G IVPF   +K++
Sbjct: 814  LDIVIQKIKVLDINKSEDPVTAEPHGALVPYN------------GEFGPIVPFEGKVKRK 873

Query: 810  RPRPKVELDEETGRVWKLLMGNINSKGIDGTDEEKIKWWEEERKVFQGRADSFIARMHLV 869
            R R KV+LD  T  +WKLLMG   S   +G D++K KW  EERK+FQGR DSFIARMHLV
Sbjct: 874  RSRAKVDLDPVTALMWKLLMGPDMSDCAEGMDKDKEKWLNEERKIFQGRVDSFIARMHLV 933

Query: 870  QGDRRFSQWKGSVVDSVVGVFLTQNVSDHLSSSAFMSLAARFPPKPKCRQASCSQEPIIE 929
            QGDRRFS WKGSVVDSVVGVFLTQNVSDHLSSSAFM+LAA+FP KP+  +   +   +  
Sbjct: 934  QGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSAFMALAAKFPVKPEASEKPAN--VMFH 993

Query: 930  LDEPEEACVFNLEDSMKLNKQIIHQQISEEGSLMKDEMEKSEGRIIVDNNESSGSNAEDG 989
                   C     +S+KL  +I+ Q+ S   +      +K EG   V+   SS  +  DG
Sbjct: 994  TISENGDCSGLFGNSVKLQGEILVQEASNTAASFITTEDK-EGSNSVELLGSSFGDGVDG 1053

Query: 990  SSN------KEPEKKSFSSSHNILETCSNSVGEISLTETSSMQACLSGEKETYDSFSFQD 1049
            ++       +    +  ++   +++T     G     E  S++  +S E  T  S +  D
Sbjct: 1054 AAGVYSNIYENLPARLHATRRPVVQT-----GNAVEAEDGSLEGVVSSENSTISSQNSSD 1113

Query: 1050 CLDSSIPQTSESIEPSSEGNSEDLPSWSTEAHIDSSSEELIQMTGPNTLNANFTTDTSVE 1109
             L         S+       +ED+ S +      ++  EL++M      +      +   
Sbjct: 1114 YLFHMSDHMFSSM--LLNFTAEDIGSRNMPKATRTTYTELLRMQELKNKSNETIESSEYH 1173

Query: 1110 QSENTTTNKLVEKKCDNRIDDTSQPDDPEISLKNS-------VYHLSDY-QTQQNQTSKS 1169
                + +N +        I    QP    IS   +       + H SD  Q+     ++ 
Sbjct: 1174 GVPVSCSNNIQVLNGIQNIGSKHQPLHSSISYHQTGQVHLPDIVHASDLEQSVYTGLNRV 1233

Query: 1170 LEVDCCQTS------NGVQTSNDCQNKDEHFH-----------TEQSTLTVEYDN-HANV 1229
            L+ +  QTS       G+  +N+ Q  D   +           T  S  T   DN    +
Sbjct: 1234 LDSNVTQTSYYPSPHPGIACNNETQKADSLSNMLYGIDRSDKTTSLSEPTPRIDNCFQPL 1293

Query: 1230 EMELIVDIVEAPSSSSELSINAKEPGLTLQ---------------------SQSSVIEDP 1289
              E +    E  SS + LS N  E     Q                     SQS   +  
Sbjct: 1294 SSEKMSFAREQSSSENYLSRNEAEAAFVKQHGTSNVQGDNTVRTEQNGGENSQSGYSQQD 1353

Query: 1290 QNVESPAECTNTVY--EIPPNATEIATKPNPKECNLLSNEFKELKPA------SSRSQRK 1349
             NV      T+ +Y   +  N    +   +    NL+ N   + K +       S+++R 
Sbjct: 1354 DNVGFQTATTSNLYSSNLCQNQKANSEVLHGVSSNLIENSKDDKKTSPKVPVDGSKAKRP 1413

Query: 1350 QV-AKEKDNINWDNLRKQTETNGKTRQRTESTMDSLDWEAIRCADVNEIAHAIRERGMNN 1409
            +V A +K   +WD LRK+   +   ++R+++  DS+DWE IR A+V EI+  IRERGMNN
Sbjct: 1414 RVGAGKKKTYDWDMLRKEVLYSHGNKERSQNAKDSIDWETIRQAEVKEISDTIRERGMNN 1473

Query: 1410 MLAERIKDFLNRLVKDHGSIDLEWLRDVEPDHAKEYLLSIRGLGLKSVECVRLLTLHHLA 1469
            MLAERIKDFLNRLV+DHGSIDLEWLR V+ D AK+YLLSIRGLGLKSVECVRLLTLHH+A
Sbjct: 1474 MLAERIKDFLNRLVRDHGSIDLEWLRYVDSDKAKDYLLSIRGLGLKSVECVRLLTLHHMA 1533

Query: 1470 FPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELH 1529
            FPVDTNVGRI VRLGWVPLQPLPESLQLHLLE+YP+LE+IQKYLWPRLCKLDQRTLYELH
Sbjct: 1534 FPVDTNVGRICVRLGWVPLQPLPESLQLHLLEMYPMLENIQKYLWPRLCKLDQRTLYELH 1593

Query: 1530 YQMITFGKVFCTKSKPNCNACPMRGECRHFASAFASARLGLPAPEDKRIVSTTECREPDN 1589
            YQMITFGKVFCTKSKPNCNACPMR EC+HFASAFASARL LP PE+K +V++        
Sbjct: 1594 YQMITFGKVFCTKSKPNCNACPMRAECKHFASAFASARLALPGPEEKSLVTS-----GTP 1653

Query: 1590 NQARTIDQPMLSLPPSTISSEEIKPSESHQSDGKTTAGACVPIIEEPATPEQETTTQD-A 1649
              A T  Q  +S  P  +S  E   +  H            PIIEEPA+PE E  T++  
Sbjct: 1654 IAAETFHQTYISSRP-VVSQLEWNSNTCHHGMNNRQ-----PIIEEPASPEPEHETEEMK 1713

Query: 1650 IIDIEDGFYEDPDEIPTIKLNIEEFSQNLQNFVQ-KNMELQEGDMSKALIALTPEAASIP 1709
               IED F +DP+EIPTIKLN EEF+QNL++++Q  N+E+++ DMSKAL+A+TPE ASIP
Sbjct: 1714 ECAIEDSFVDDPEEIPTIKLNFEEFTQNLKSYMQANNIEIEDADMSKALVAITPEVASIP 1773

Query: 1710 TPKLKNVSRLRTEHQVYELPDNHPLLEKLKLDRREPDDPSSYLLAIWTPGETANSIELPE 1769
            TPKLKNVSRLRTEHQVYELPD+HPLLE    ++REPDDP  YLL+IWTPGETA S + P+
Sbjct: 1774 TPKLKNVSRLRTEHQVYELPDSHPLLE--GFNQREPDDPCPYLLSIWTPGETAQSTDAPK 1833

Query: 1770 KRCSNQEHHQLCCEEECLSCNSVREANSFMVRGTLLIPCRTAMRGSFPLNGTYFQVNEVF 1829
              C++QE+ +LC    C SCNS+REA +  VRGTLLIPCRTAMRGSFPLNGTYFQVNEVF
Sbjct: 1834 SVCNSQENGELCASNTCFSCNSIREAQAQKVRGTLLIPCRTAMRGSFPLNGTYFQVNEVF 1893

Query: 1830 ADHESSLNPIDVPRDWIWNLPRRTVYFGTSIPTIFKGLSTQGIQHCFWRGFVCVRGFDQK 1846
            ADH+SS NPIDVPR WIWNLPRRTVYFGTSIPTIFKGL+T+ IQHCFWRGFVCVRGFD+ 
Sbjct: 1894 ADHDSSRNPIDVPRSWIWNLPRRTVYFGTSIPTIFKGLTTEEIQHCFWRGFVCVRGFDRT 1940

BLAST of PI0016811 vs. ExPASy Swiss-Prot
Match: B8YIE8 (Protein ROS1C OS=Oryza sativa subsp. japonica OX=39947 GN=ROS1C PE=2 SV=2)

HSP 1 Score: 875.5 bits (2261), Expect = 1.0e-252
Identity = 558/1221 (45.70%), Postives = 717/1221 (58.72%), Query Frame = 0

Query: 660  EKQPIYPTNHPPPAGSAKNINTSGTCINGLFEMMHATVAKKKRTKKKPSNSALLNINKDL 719
            +K  +   N   P    KN  +S T  +G F  +  +    ++T  +  +     IN D+
Sbjct: 701  QKARLNSPNSIQPNIDQKNRFSSETVFSGGFNGLKRSEETFQKTLPQIPDDK--RINLDI 760

Query: 720  QDRRFVSFNPWQFFPKTLGTASEHGNQICFIDLLVEQLKHLDINKESNNLGYREQALVPY 779
              +  V  +P    P  +       ++  + DL  EQ     ++K   +L     +L   
Sbjct: 761  HCKVPVESSPNTSTPPYMDYLQGVTSKFRYFDLNTEQ-----VHKTEMHLSQTMPSLSSL 820

Query: 780  NMQNQEHSAIVVYGRDGTIVP----FNPIKKRRPRPKVELDEETGRVWKLLMGNINSKGI 839
               N   +A+V Y   G +VP    F+ +KK+RPR KV+LD ET RVW LLMG   +  +
Sbjct: 821  GATNYLPNALVPY-VGGAVVPYQTQFHLVKKQRPRAKVDLDFETTRVWNLLMGKA-ADPV 880

Query: 840  DGTDEEKIKWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSD 899
            DGTD +K +WW++ER+VFQGRA+SFIARM LVQGDRRFS WKGSVVDSVVGVFLTQNV+D
Sbjct: 881  DGTDVDKERWWKQEREVFQGRANSFIARMRLVQGDRRFSPWKGSVVDSVVGVFLTQNVAD 940

Query: 900  HLSSSAFMSLAARFP--PKPKCRQASCSQ--EPIIELDEPEEACVF-------------N 959
            HLSSSA+M+LAA FP      C      Q  E II      +   F             N
Sbjct: 941  HLSSSAYMALAASFPTGSHGNCNDGIAGQDNEEIISTSAVGDRGTFEFFYNGSRPDIGLN 1000

Query: 960  LEDSMKLNKQIIHQQISEEGSLMKDEMEKSEGRIIVDNNESSGSNAEDGSSNKEPEKKSF 1019
             E SM   K  IH +  +  ++  +E+ K E    +   ES+GS  +  +          
Sbjct: 1001 FEFSMACEK--IHMEPKDNTTV--NELTKGE-NYSLHCKESAGSLCDHETE--------- 1060

Query: 1020 SSSHNILETCSNSVGEISLTETSSMQACLSGEKETYDSFSFQDCLDSSIPQTSESIEPSS 1079
                  ++  + S+ + S  E   + AC+     T   F  +  L  S+  +   ++P  
Sbjct: 1061 ------IDHKAKSISDFSAVE---LTACMKNLHAT--QFQKEISLSQSVVTSESILQPG- 1120

Query: 1080 EGNSEDLPSWSTEAHIDSSSEELIQMTGPNTLNANFTTDTSVEQSENTTTNKLVEKKCDN 1139
                  LP  S   H   +    I  T    + +NF    S+    + T N+        
Sbjct: 1121 ------LPLSSGMDHARRNFVGSISDTASQQVGSNFDDGKSL-TGNDVTANETEYHGIKA 1180

Query: 1140 RIDDTSQPDDPEISLKNSVY--------HLSDYQTQQNQTSKSLEVDCCQTSNGVQTSND 1199
               +    D+P I   +S+Y        H  D +   + +S S     C  S+  +    
Sbjct: 1181 AATNNYVVDEPGIPSGSSLYPFFSAIDCHQLDGRNDTHVSSTSPNCSICSASSNFKIGT- 1240

Query: 1200 CQNKDEHFHTEQSTLTVEYDNH-ANVEMELIVDI-VEAPSSSSELSINAKEPGLTLQSQS 1259
                      E S+L + +D H A     +IVD  + +   S+EL +     G     ++
Sbjct: 1241 --------IEENSSLFMPFDAHLAQRNGNMIVDTNLSSALESTELPVKLLHCGKRSCYEA 1300

Query: 1260 SVIEDPQNVESPAECTNTVYEIPPNATEIATKPNPKECNLLSNEFKEL-KPASSRSQRKQ 1319
            S  +D +++ +       + E    A +   K      N L +   +  KP  SR+  K 
Sbjct: 1301 SEFQDHESLYATG---GVIPETATKADDSTLKSGFASFNGLPDTAAQASKPKKSRTTSK- 1360

Query: 1320 VAKEKDNINWDNLRKQTETNGKTRQRTESTMDSLDWEAIRCADVNEIAHAIRERGMNNML 1379
              K  +N +WD LR+Q   N + ++R     DS+DWEA+RCADV  I+HAIRERGMNN+L
Sbjct: 1361 --KNSENFDWDKLRRQACGNYQMKERIFDRRDSVDWEAVRCADVQRISHAIRERGMNNVL 1420

Query: 1380 AERIKDFLNRLVKDHGSIDLEWLRDVEPDHAKEYLLSIRGLGLKSVECVRLLTLHHLAFP 1439
            AERI+ FLNRLV DHGSIDLEWLRDV PD AK+YLLSIRGLGLKSVECVRLLTLHHLAFP
Sbjct: 1421 AERIQKFLNRLVTDHGSIDLEWLRDVPPDSAKDYLLSIRGLGLKSVECVRLLTLHHLAFP 1480

Query: 1440 VDTNVGRIAVRLGWVPLQPLPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQ 1499
            VDTNVGRI VRLGWVP+QPLPESLQLHLLELYPVLE+IQKYLWPRLCKLDQ+TLYELHYQ
Sbjct: 1481 VDTNVGRICVRLGWVPIQPLPESLQLHLLELYPVLETIQKYLWPRLCKLDQQTLYELHYQ 1540

Query: 1500 MITFGKVFCTKSKPNCNACPMRGECRHFASAFASARLGLPAPEDKRIVSTTECREPDNNQ 1559
            MITFGKVFCTKSKPNCNACPMR ECRHFASAFASARL LP+P+DKR+V+ +      N  
Sbjct: 1541 MITFGKVFCTKSKPNCNACPMRSECRHFASAFASARLALPSPQDKRLVNLSNQFAFHNGT 1600

Query: 1560 ARTIDQPMLSLPPSTISSEEIKPSESHQSDGKTTAGACVPIIEEPATPEQETTTQDAIID 1619
              T +   L     +I + ++  + ++            PIIEEPA+P +E   +    D
Sbjct: 1601 MPTPNSTPLPQLEGSIHARDVHANNTN------------PIIEEPASPREEECRELLEND 1660

Query: 1620 IEDGFYEDPDEIPTIKLNIEEFSQNLQNFV-QKNMELQEGDMSKALIALTPEAASIPTPK 1679
            IED F ED DEIP IKLN+E FSQNL+N + + N + Q  D++KAL+A++ EAASIP PK
Sbjct: 1661 IED-FDEDTDEIPIIKLNMEAFSQNLENCIKESNKDFQSDDITKALVAISNEAASIPVPK 1720

Query: 1680 LKNVSRLRTEHQVYELPDNHPLLEKLKLDRREPDDPSSYLLAIWTPGETANSIELPEKRC 1739
            LKNV RLRTEH VYELPD+HPL+++L LD+REPDDPS YLLAIWTP E  ++ E P+  C
Sbjct: 1721 LKNVHRLRTEHYVYELPDSHPLMQQLALDQREPDDPSPYLLAIWTPDELKDTREAPKPCC 1780

Query: 1740 SNQEHHQLCCEEECLSCNSVREANSFMVRGTLLIPCRTAMRGSFPLNGTYFQVNEVFADH 1799
            + Q    LC  E C +C S RE     VRGT+L+PCRTAMRGSFPLNGTYFQVNEVFADH
Sbjct: 1781 NPQTEGGLCSNEMCHNCVSERENQYRYVRGTVLVPCRTAMRGSFPLNGTYFQVNEVFADH 1840

Query: 1800 ESSLNPIDVPRDWIWNLPRRTVYFGTSIPTIFKGLSTQGIQHCFWRGFVCVRGFDQKTRA 1848
             SS NPI++PR+ +WNL RR VYFGTS+PTIFKGL+T+ IQHCFWRGFVCVRGF+ +TRA
Sbjct: 1841 SSSHNPINIPREQLWNLHRRMVYFGTSVPTIFKGLTTEEIQHCFWRGFVCVRGFNMETRA 1851

BLAST of PI0016811 vs. ExPASy Swiss-Prot
Match: Q9SR66 (DEMETER-like protein 2 OS=Arabidopsis thaliana OX=3702 GN=DML2 PE=3 SV=2)

HSP 1 Score: 731.9 bits (1888), Expect = 1.8e-209
Identity = 483/1164 (41.49%), Postives = 626/1164 (53.78%), Query Frame = 0

Query: 702  RTKKKPSNSALLNINKDLQDRRFVSFNPWQFFPKTLGTASEHGNQICFIDLLVEQLKHLD 761
            R K+   N      N  + D ++   NP      T  + ++   +   ID + +  + LD
Sbjct: 405  RKKRSQRNRVASQFNARILDLQWRRQNP------TGTSLADIWERSLTIDAITKLFEELD 464

Query: 762  INKESNNLGY-REQALVPYNMQNQEHSAIVVYGRDGTIVPFNPIKKRRPRPKVELDEETG 821
            INKE   L + RE AL+ Y    +E  AIV Y              ++ +PKV+LD ET 
Sbjct: 465  INKEGLCLPHNRETALILYKKSYEEQKAIVKY-------------SKKQKPKVQLDPETS 524

Query: 822  RVWKLLMGNINSKGIDGTDEEKIKWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSV 881
            RVWKLLM +I+  G+DG+DEEK KWWEEER +F GRA+SFIARM +VQG+R FS WKGSV
Sbjct: 525  RVWKLLMSSIDCDGVDGSDEEKRKWWEEERNMFHGRANSFIARMRVVQGNRTFSPWKGSV 584

Query: 882  VDSVVGVFLTQNVSDHLSSSAFMSLAARFPPKPKCRQASCSQEPIIELDEPEEACVFNLE 941
            VDSVVGVFLTQNV+DH SSSA+M LAA FP +    + SC +E         +  + NL+
Sbjct: 585  VDSVVGVFLTQNVADHSSSSAYMDLAAEFPVEWNFNKGSCHEE---WGSSVTQETILNLD 644

Query: 942  DSMKLNKQIIHQQISEEGSLMKDEMEKSEGRIIVDNNESSGSNAEDGSSNKEPEKKSFSS 1001
                ++   I                ++  R+I++  +    N  D   ++E  K S   
Sbjct: 645  PRTGVSTPRI----------------RNPTRVIIEEIDDD-ENDIDAVCSQESSKTS--- 704

Query: 1002 SHNILETCSNSVGEISLTETSSMQACLSGEKETYDSFSFQDCLDSSIPQTSESIEPSSEG 1061
                                                       DSSI             
Sbjct: 705  -------------------------------------------DSSI------------- 764

Query: 1062 NSEDLPSWSTEAHIDSSSEELIQMTGPNTLNANFTTDTSVEQSENTTTNKLVEKKCDNRI 1121
                                                 TS +QS+    +        N +
Sbjct: 765  -------------------------------------TSADQSKTMLLDPF------NTV 824

Query: 1122 DDTSQPDDPEISLKNSVYHLSDYQTQQNQTSKSLEVDCCQTSNGVQTSNDCQNKDEHFHT 1181
                Q D   +  K  + +  D     N  S+                            
Sbjct: 825  LMNEQVDSQMVKGKGHIPYTDDL----NDLSQG--------------------------- 884

Query: 1182 EQSTLTVEYDNHANVEMELIVDIVEAPSSSSELSINAKEPGLTLQSQSSVIEDPQNVESP 1241
                                + +V + S+  EL++N   P + L S     E     +  
Sbjct: 885  --------------------ISMVSSASTHCELNLNEVPPEVELCSHQQDPESTIQTQDQ 944

Query: 1242 AECTNTVYEIPPNATEIATKPNPKECNLLSNEFKELKPASSRSQRKQVAKEKDNINWDNL 1301
             E T T  ++  N  +  T   PK+                +S+    + +K +++WD+L
Sbjct: 945  QESTRT-EDVKKNRKK-PTTSKPKK----------------KSKESAKSTQKKSVDWDSL 1004

Query: 1302 RKQTETNGKTRQRTESTMDSLDWEAIRCADVNEIAHAIRERGMNNMLAERIKDFLNRLVK 1361
            RK+ E+ G+ R+RTE TMD++DW+A+RC DV++IA+ I +RGMNNMLAERIK FLNRLVK
Sbjct: 1005 RKEAESGGRKRERTERTMDTVDWDALRCTDVHKIANIIIKRGMNNMLAERIKAFLNRLVK 1064

Query: 1362 DHGSIDLEWLRDVEPDHAKEYLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLG 1421
             HGSIDLEWLRDV PD AKEYLLSI GLGLKSVECVRLL+LH +AFPVDTNVGRIAVRLG
Sbjct: 1065 KHGSIDLEWLRDVPPDKAKEYLLSINGLGLKSVECVRLLSLHQIAFPVDTNVGRIAVRLG 1124

Query: 1422 WVPLQPLPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSK 1481
            WVPLQPLP+ LQ+HLLELYPVLES+QKYLWPRLCKLDQ+TLYELHY MITFGKVFCTK K
Sbjct: 1125 WVPLQPLPDELQMHLLELYPVLESVQKYLWPRLCKLDQKTLYELHYHMITFGKVFCTKVK 1184

Query: 1482 PNCNACPMRGECRHFASAFASARLGLPAPEDKRIVSTTECREPDNNQARTIDQP-MLSLP 1541
            PNCNACPM+ ECRH++SA ASARL LP PE+    S         ++ R+  +P +++  
Sbjct: 1185 PNCNACPMKAECRHYSSARASARLALPEPEESDRTSVM------IHERRSKRKPVVVNFR 1244

Query: 1542 PSTISSEEIKPSESHQSDGKTTAGACVPIIEEPATPEQETTTQDAIIDIED--------G 1601
            PS    +E K  E+ +S        C PIIEEPA+PE E        DIED        G
Sbjct: 1245 PSLFLYQE-KEQEAQRSQN------CEPIIEEPASPEPEYIEH----DIEDYPRDKNNVG 1304

Query: 1602 FYEDP----DEIPTIKLNIEEFSQNLQNFVQKNMELQEGDMSKALIALTPEAASIPTPKL 1661
              EDP    D IPTI LN +E   +    V K     E   S  L+ L+  AA+IP  KL
Sbjct: 1305 TSEDPWENKDVIPTIILN-KEAGTSHDLVVNK-----EAGTSHDLVVLSTYAAAIPRRKL 1332

Query: 1662 KNVSRLRTEHQVYELPDNHPLLEKLKLDRREPDDPSSYLLAIWTPGETANSIELPEKRCS 1721
            K   +LRTEH V+ELPD+H +LE    +RRE +D   YLLAIWTPGET NSI+ P++RC+
Sbjct: 1365 KIKEKLRTEHHVFELPDHHSILE--GFERREAEDIVPYLLAIWTPGETVNSIQPPKQRCA 1332

Query: 1722 -NQEHHQLCCEEECLSCNSVREANSFMVRGTLLIPCRTAMRGSFPLNGTYFQVNEVFADH 1781
              + ++ LC E +C  CN  RE  S  VRGT+LIPCRTAMRG FPLNGTYFQ NEVFADH
Sbjct: 1425 LFESNNTLCNENKCFQCNKTREEESQTVRGTILIPCRTAMRGGFPLNGTYFQTNEVFADH 1332

Query: 1782 ESSLNPIDVPRDWIWNLPRRTVYFGTSIPTIFKGLSTQGIQHCFWRGFVCVRGFDQKTRA 1841
            +SS+NPIDVP + IW+L RR  Y G+S+ +I KGLS + I++ F  G+VCVRGFD++ R 
Sbjct: 1485 DSSINPIDVPTELIWDLKRRVAYLGSSVSSICKGLSVEAIKYNFQEGYVCVRGFDRENRK 1332

Query: 1842 PRPLMARLHFPASKLNRGRGKTED 1851
            P+ L+ RLH     + R + KTE+
Sbjct: 1545 PKSLVKRLHCSHVAI-RTKEKTEE 1332

BLAST of PI0016811 vs. ExPASy TrEMBL
Match: A0A0A0LAQ7 (ENDO3c domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G748840 PE=3 SV=1)

HSP 1 Score: 3489.9 bits (9048), Expect = 0.0e+00
Identity = 1761/1851 (95.14%), Postives = 1792/1851 (96.81%), Query Frame = 0

Query: 1    MDSGQPEGNKADVQGGSWIPATPMKPILPKPPLQPLIYARMDRNQPRPYWLGSERLFSNS 60
            MDSGQPEGNKADVQG SWIPATPMKPILPKPPLQPLIYARMDRNQPRPYWLG ERLFSNS
Sbjct: 1    MDSGQPEGNKADVQGSSWIPATPMKPILPKPPLQPLIYARMDRNQPRPYWLGPERLFSNS 60

Query: 61   NKEAETSSGVACYGGANFMTANGSNDWEAAQARQFQVARNDNGTVTIHSMDALGGIPFLQ 120
            +KEAETSSGVACYGGAN MTANGSNDWEAAQARQFQVA NDNGTVTIHSMDALGGIPFLQ
Sbjct: 61   DKEAETSSGVACYGGANSMTANGSNDWEAAQARQFQVACNDNGTVTIHSMDALGGIPFLQ 120

Query: 121  LMALADAASIVGADAALGGNASDLFDSGSSYQIELESSSMKDRLSGSCIPEAKEYGTSDH 180
            LMALADAASIVGADAALGGNASDLFDSGSSYQIELESSSMKDRLSGSCIPEAKEY TSDH
Sbjct: 121  LMALADAASIVGADAALGGNASDLFDSGSSYQIELESSSMKDRLSGSCIPEAKEYETSDH 180

Query: 181  GSQHAYDLNFPSGTESDAAGIRVTSQFAPLTPDMGKCKYTERGMELQQIPIENSQDERER 240
            GSQHA+DLNFPS TESDAAGIRVTSQFAPLTPDMGK KYTERGMELQQIP ENSQDERE 
Sbjct: 181  GSQHAHDLNFPSRTESDAAGIRVTSQFAPLTPDMGKIKYTERGMELQQIPTENSQDEREL 240

Query: 241  NHNCNTSITVDGENLRQNQELLEPAMHSTINCTPDGKEGKNDGDLNKTPASRQRRRKHRP 300
            NHNCNTSITVDGENLRQNQELLEPAMHSTINCTPDGKEGKNDGDLNKTPASRQRRRKHRP
Sbjct: 241  NHNCNTSITVDGENLRQNQELLEPAMHSTINCTPDGKEGKNDGDLNKTPASRQRRRKHRP 300

Query: 301  KVIVEGKTNRTKQILKTPSSNQSVRKRVRKSGLTKPSATPPIEVTGETSEQEMVKHRRKS 360
            KVIVEGKTNRTKQ LKTPSSN SVRKRVRKSGL KPSATP IEVTGETSEQE+VKHRRKS
Sbjct: 301  KVIVEGKTNRTKQNLKTPSSNPSVRKRVRKSGLAKPSATPSIEVTGETSEQEIVKHRRKS 360

Query: 361  CRRAINFDSQAQTRDGSLDSGPLEQGSLTQNIQSTTGLEEARLEEVGSSTDPNWSMNQMP 420
            CRRAI FDSQAQTRD SLD GPLEQGSLTQNIQSTTGLEE R+EEVGSSTDPNWSMNQM 
Sbjct: 361  CRRAITFDSQAQTRDESLDLGPLEQGSLTQNIQSTTGLEEVRIEEVGSSTDPNWSMNQML 420

Query: 421  KKYESLSEKQAPPTKLSAENNSSERKQPSKSQMENNIEQNGKIISNSDKENTVETILNDD 480
            KKYESLSEK+APPT+LSAEN+SSE+ QPSKSQ EN+ EQNGK+IS+SDKENTVETILND+
Sbjct: 421  KKYESLSEKEAPPTELSAENDSSEQTQPSKSQKENDTEQNGKVISSSDKENTVETILNDE 480

Query: 481  NHSLPGNSHGLIFCKNPPLTSIEQATCCLRKRPRAIKQAHTGSINLTGAHYNTLSAYQSM 540
            NHSLPGNSHGLIFCKNPPLTSIEQATCCLRKRPRAIKQAHTGSINLTGAHYNTLSAYQSM
Sbjct: 481  NHSLPGNSHGLIFCKNPPLTSIEQATCCLRKRPRAIKQAHTGSINLTGAHYNTLSAYQSM 540

Query: 541  SWMHFPHIYKKKRTEKGQNPVPSSAFTTATNFIRPESACSFNDPQRDHMVSKFNAWIAGP 600
            SWMHFPHIYKKKRTEKGQNP+PSSAF TATNF RPESACSFNDPQRDH+VSKFN WI GP
Sbjct: 541  SWMHFPHIYKKKRTEKGQNPIPSSAFATATNFTRPESACSFNDPQRDHVVSKFNTWIPGP 600

Query: 601  QFNICKSKSVAGHGGNDLQDKLQTYGGIVGLDQTGRTKKKPRTAKRLSGLAPPERITHWE 660
            QFNICKSK+VAGH GN+LQDKLQT GGIVGL QTGRTKKKPRTAKRLS  A PERI+HWE
Sbjct: 601  QFNICKSKTVAGHEGNNLQDKLQTCGGIVGLGQTGRTKKKPRTAKRLSSSARPERISHWE 660

Query: 661  KQPIYPTNHPPPAGSAKNINTSGTCINGLFEMMHATVAKKKRTKKKPSNSALLNINKDLQ 720
            KQPIYPTNHPPPAGSAKNINTSGTCINGLFE+MHATVAKKKRTKKKPSNSALLNINKDLQ
Sbjct: 661  KQPIYPTNHPPPAGSAKNINTSGTCINGLFEIMHATVAKKKRTKKKPSNSALLNINKDLQ 720

Query: 721  DRRFVSFNPWQFFPKTLGTASEHGNQICFIDLLVEQLKHLDINKESNNLGYREQALVPYN 780
            DRRFVSF+PWQFFPKTLGT SEHGNQICFIDL+ EQLKHLDINKESNNLGYREQAL+PYN
Sbjct: 721  DRRFVSFSPWQFFPKTLGTDSEHGNQICFIDLIAEQLKHLDINKESNNLGYREQALIPYN 780

Query: 781  MQNQEHSAIVVYGRDGTIVPFNPIKKRRPRPKVELDEETGRVWKLLMGNINSKGIDGTDE 840
            MQNQEH+AIVVYGRDGTIVPFNPIKKRRPRPKVELDEETGRVWKLLMGNINSKGIDGTDE
Sbjct: 781  MQNQEHNAIVVYGRDGTIVPFNPIKKRRPRPKVELDEETGRVWKLLMGNINSKGIDGTDE 840

Query: 841  EKIKWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSDHLSSS 900
            E IKWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSDHLSSS
Sbjct: 841  ENIKWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSDHLSSS 900

Query: 901  AFMSLAARFPPKPKCRQASCSQEPIIELDEPEEACVFNLEDSMKLNKQIIHQQISEEGSL 960
            AFMSLAARFPPK KCRQASCSQEPIIELDEPEEAC+FNLEDSMKLNKQIIHQQISEE  L
Sbjct: 901  AFMSLAARFPPKSKCRQASCSQEPIIELDEPEEACMFNLEDSMKLNKQIIHQQISEEDLL 960

Query: 961  MKDEMEKSEGRIIVDNNESSGSNAEDGSSNKEPEKKSFSSSHNILETCSNSVGEISLTET 1020
            MKDEMEK EGRIIV+NNESSGSN EDGSSNKEPEKKSFSSSHNILETCSNSVGEISLTET
Sbjct: 961  MKDEMEKGEGRIIVENNESSGSNVEDGSSNKEPEKKSFSSSHNILETCSNSVGEISLTET 1020

Query: 1021 SSMQACLSGEKETYDSFSFQDCLDSSIPQTSESIEPSSEGNSEDLPSWSTEAHIDSSSEE 1080
            SSMQACLSGEKETYDSFS QDCLDSSIPQT+ES+EPSSEGNSEDLPSWSTEAHIDSSSEE
Sbjct: 1021 SSMQACLSGEKETYDSFSSQDCLDSSIPQTNESVEPSSEGNSEDLPSWSTEAHIDSSSEE 1080

Query: 1081 LIQMTGPNTLNANFTTDTSVEQSENTTTNKLVEKKCDNRIDDTSQPDDPEISLKNSVYHL 1140
            L QMTG NTLNANFT DT VEQSENT TNKLVE KCDNRIDDTSQP DPEISLKNSVYHL
Sbjct: 1081 LTQMTGLNTLNANFTIDTCVEQSENTITNKLVENKCDNRIDDTSQPVDPEISLKNSVYHL 1140

Query: 1141 SDYQTQQNQTSKSLEVDCCQTSNGVQTSNDCQNKDEHFHTEQSTLTVEYDNHANVEMELI 1200
            S YQTQQNQTSKSLEVDCCQTSNGVQTSNDCQNKDE FHTEQSTLTVE DNHA VEMELI
Sbjct: 1141 SGYQTQQNQTSKSLEVDCCQTSNGVQTSNDCQNKDEQFHTEQSTLTVESDNHAIVEMELI 1200

Query: 1201 VDIVEAPSSSSELSINAKEPGLTLQSQSSVIEDPQNVESPAECTNTVYEIPPNATEIATK 1260
            VDIVEAPSSSSELSINAKEP LTLQSQSSVIEDPQNVESPAECTNTV+EIPPNATEIATK
Sbjct: 1201 VDIVEAPSSSSELSINAKEPCLTLQSQSSVIEDPQNVESPAECTNTVHEIPPNATEIATK 1260

Query: 1261 PNPKECNLLSNEFKELKPASSRSQRKQVAKEKDNINWDNLRKQTETNGKTRQRTESTMDS 1320
            PNPKECNLLSNEFKELKPASSRSQ KQVAKEKDNINWDNLRK+TETNGKTRQRTE TMDS
Sbjct: 1261 PNPKECNLLSNEFKELKPASSRSQSKQVAKEKDNINWDNLRKRTETNGKTRQRTEDTMDS 1320

Query: 1321 LDWEAIRCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVEPDHAKE 1380
            LDWEAIRCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVEPD AKE
Sbjct: 1321 LDWEAIRCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVEPDQAKE 1380

Query: 1381 YLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYP 1440
            YLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYP
Sbjct: 1381 YLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYP 1440

Query: 1441 VLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFA 1500
            VLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFA
Sbjct: 1441 VLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFA 1500

Query: 1501 SARLGLPAPEDKRIVSTTECREPDNNQARTIDQPMLSLPPSTISSEEIKPSESHQSDGKT 1560
            SARLGLPAPEDKRIVSTTECREPDNNQ RTIDQPMLSLPPSTISS EIKPSESHQSDGKT
Sbjct: 1501 SARLGLPAPEDKRIVSTTECREPDNNQPRTIDQPMLSLPPSTISSVEIKPSESHQSDGKT 1560

Query: 1561 TAGACVPIIEEPATPEQETTTQDAIIDIEDGFYEDPDEIPTIKLNIEEFSQNLQNFVQKN 1620
            TAGACVPIIEEPATPEQET TQDAIIDIED FYEDPDEIPTIKLNIEEFSQNLQN+VQKN
Sbjct: 1561 TAGACVPIIEEPATPEQETATQDAIIDIEDAFYEDPDEIPTIKLNIEEFSQNLQNYVQKN 1620

Query: 1621 MELQEGDMSKALIALTPEAASIPTPKLKNVSRLRTEHQVYELPDNHPLLEKLKLDRREPD 1680
            MELQEGDMSKALIALTPEAASIPTPKLKNVSRLRTEHQVYELPDNHPLLEKLKLDRREPD
Sbjct: 1621 MELQEGDMSKALIALTPEAASIPTPKLKNVSRLRTEHQVYELPDNHPLLEKLKLDRREPD 1680

Query: 1681 DPSSYLLAIWTPGETANSIELPEKRCSNQEHHQLCCEEECLSCNSVREANSFMVRGTLLI 1740
            DPSSYLLAIWTPGETANSI+LPEKRCS+QEHHQLCCEEECLSCNSVREANSFMVRGTLLI
Sbjct: 1681 DPSSYLLAIWTPGETANSIQLPEKRCSSQEHHQLCCEEECLSCNSVREANSFMVRGTLLI 1740

Query: 1741 PCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTSIPTIFKG 1800
            PCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTSIPTIFKG
Sbjct: 1741 PCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTSIPTIFKG 1800

Query: 1801 LSTQGIQHCFWRGFVCVRGFDQKTRAPRPLMARLHFPASKLNRGRGKTEDQ 1852
            LSTQGIQHCFWRGFVCVRGFDQKTRAPRPLMARLHFPASKLNRGRGKTEDQ
Sbjct: 1801 LSTQGIQHCFWRGFVCVRGFDQKTRAPRPLMARLHFPASKLNRGRGKTEDQ 1851

BLAST of PI0016811 vs. ExPASy TrEMBL
Match: A0A1S4DUL3 (protein ROS1 OS=Cucumis melo OX=3656 GN=LOC103486570 PE=3 SV=1)

HSP 1 Score: 3458.7 bits (8967), Expect = 0.0e+00
Identity = 1747/1852 (94.33%), Postives = 1783/1852 (96.27%), Query Frame = 0

Query: 1    MDSGQPEGNKADVQGGSWIPATPMKPILPKPPLQPLIYARMDRNQPRPYWLGSERLFSNS 60
            MDSGQPEGNKADVQGGSWIPATPMKPILPKPPLQPLIYARMDRNQPRPYWLGSERLFSNS
Sbjct: 1    MDSGQPEGNKADVQGGSWIPATPMKPILPKPPLQPLIYARMDRNQPRPYWLGSERLFSNS 60

Query: 61   NKEAETSSGVACYGGANFMTANGSNDWEAAQARQFQVARNDNGTVTIHSMDALGGIPFLQ 120
            NKEAETSSGVACYGGAN MTANGSN WEAAQARQFQVARNDNGTVTIHSMDALGGIPFLQ
Sbjct: 61   NKEAETSSGVACYGGANSMTANGSNYWEAAQARQFQVARNDNGTVTIHSMDALGGIPFLQ 120

Query: 121  LMALADAASIVGADAALGGNASDLFDSGSSYQIELESSSMKDRLSGSCIPEAKEYGTSDH 180
            LMALADAASIVGADAALGGNASDLFDSGSSYQIELESSSMKDR  GSC+PEAKEY  S+H
Sbjct: 121  LMALADAASIVGADAALGGNASDLFDSGSSYQIELESSSMKDRFGGSCVPEAKEYVPSEH 180

Query: 181  GSQHAYDLNFPSGTESDAAGIRVTSQFAPLTPDMGKCKYTERGMELQQIPIENSQDERER 240
            GSQ+A++LNFPS  ESDAAGIRVTSQFAPLTPDMGK KY ERG ELQQI IENSQDERE+
Sbjct: 181  GSQYAHELNFPSRIESDAAGIRVTSQFAPLTPDMGKSKYIERGTELQQILIENSQDEREQ 240

Query: 241  NHNCNTSITVDGENLRQNQELLEPAMHSTINCTPDGKEGKNDGDLNKTPASRQRRRKHRP 300
            NHNCNTSITV GEN+ QNQ+LLEPAMHSTINCTPDGKEGKNDGDLNKTPASRQRRRKHRP
Sbjct: 241  NHNCNTSITV-GENVTQNQKLLEPAMHSTINCTPDGKEGKNDGDLNKTPASRQRRRKHRP 300

Query: 301  KVIVEGKTNRTKQILKTPSSNQSVRKRVRKSGLTKPSATPPIEVTGETSEQEMVKHRRKS 360
            KVIVEGKTNRTKQ LKTPSSN S RKRVRKSGL KPSATP IEVTGETSEQEMVKHRRKS
Sbjct: 301  KVIVEGKTNRTKQNLKTPSSNPSARKRVRKSGLAKPSATPSIEVTGETSEQEMVKHRRKS 360

Query: 361  CRRAINFDSQAQTRDGSLDSGPLEQGSLTQNIQSTTGLEEARLEEVGSSTDPNWSMNQMP 420
            CRRAI FDSQAQTRDGSLDSGPLEQGSLTQN QSTTGLE  RLEEVGSSTDPNWSMNQM 
Sbjct: 361  CRRAITFDSQAQTRDGSLDSGPLEQGSLTQNSQSTTGLEVVRLEEVGSSTDPNWSMNQML 420

Query: 421  KKYESLSEKQAPPTKLSAENNSSERKQPSKSQMENNIEQNGKIISNSDKENTVETILNDD 480
            KKYESLSEK+APPT+LSAENNSSE+KQPSKSQMENN EQNGK+ISNSDKENTVE I NDD
Sbjct: 421  KKYESLSEKEAPPTELSAENNSSEQKQPSKSQMENNTEQNGKVISNSDKENTVEAIPNDD 480

Query: 481  NHSLPGNSHGLIFCKNPPLTSIEQATCCLRKRPRAIKQAHTGSINLTGAHYNTLSAYQSM 540
            NHSLPGNSHGLIFCKNPPLTSIEQATCCLRKRPRAIKQAHTGSINLTG HYNTLSAYQSM
Sbjct: 481  NHSLPGNSHGLIFCKNPPLTSIEQATCCLRKRPRAIKQAHTGSINLTGVHYNTLSAYQSM 540

Query: 541  SWMHFPHIYKKKRTEKGQNPVPSSAFTTATNFIRPESACSFNDPQRDHMVSKFNAWIAGP 600
            SWMHFPHIYKKKRTEKGQNPVPSSAFTTATNF RPESACSFNDPQRDHMVSKFNAWI+GP
Sbjct: 541  SWMHFPHIYKKKRTEKGQNPVPSSAFTTATNFTRPESACSFNDPQRDHMVSKFNAWISGP 600

Query: 601  QFNICKSKSVAGHGGNDLQDKLQTYGGIVGLDQTGRTKKKPRTAKRLSGLAPPERITHWE 660
            QFN+CKSK+VAGHGGNDLQDKLQTYGGIVGL QTGRTKKKPRTAKR+S LAPPERI+HWE
Sbjct: 601  QFNVCKSKTVAGHGGNDLQDKLQTYGGIVGLGQTGRTKKKPRTAKRVSSLAPPERISHWE 660

Query: 661  KQPIYPTNHPPPAGSAKNINTSGTCINGLFEMMHATVAKKKRTKKKPSNSALLNINKDLQ 720
            KQP YPTNHPPPAGSAKNINTSGTC+NGLFEMMHATVAKKKRTKKKPSNS LLNINKDL+
Sbjct: 661  KQPTYPTNHPPPAGSAKNINTSGTCVNGLFEMMHATVAKKKRTKKKPSNSTLLNINKDLE 720

Query: 721  DRRFVSFNPWQFFPKTLGTASEHGNQICFIDLLVEQLKHLDINKESNNLGYREQALVPYN 780
            DRRFVSF+PWQFFPKTLGTASEHGNQICFIDL+ EQLKHLDINKESNNLGYREQALVP+N
Sbjct: 721  DRRFVSFSPWQFFPKTLGTASEHGNQICFIDLIAEQLKHLDINKESNNLGYREQALVPFN 780

Query: 781  MQNQEHSAIVVYGRDGTIVPFNPIKKRRPRPKVELDEETGRVWKLLMGNINSKGIDGTDE 840
            MQNQE +AIVVYGRDGTIVPFNPIKKRRPRPKVELDEETGRVWKLLMGNINS GIDGTDE
Sbjct: 781  MQNQELNAIVVYGRDGTIVPFNPIKKRRPRPKVELDEETGRVWKLLMGNINSTGIDGTDE 840

Query: 841  EKIKWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSDHLSSS 900
            E IKWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSDHLSSS
Sbjct: 841  ENIKWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSDHLSSS 900

Query: 901  AFMSLAARFPPKPKCRQASCSQEPIIELDEPEEACVFNLEDSMKLNKQIIHQQISEEGSL 960
            AFMSLAARFPPKPKC QAS SQEPIIEL+EPEE C+FNLEDSMKLNKQIIHQQISEEGSL
Sbjct: 901  AFMSLAARFPPKPKCHQASSSQEPIIELNEPEEVCMFNLEDSMKLNKQIIHQQISEEGSL 960

Query: 961  MKDEMEKSEGRIIVDNNESSGSNAEDGSSNKEPEKKSFSSSHNILETCSNSVGEISLTET 1020
            MKDEMEKSEGRIIVDNNESSGSN EDGSSNKEPEKKSF SSHNILET SNSVGEISLTET
Sbjct: 961  MKDEMEKSEGRIIVDNNESSGSNVEDGSSNKEPEKKSFGSSHNILETFSNSVGEISLTET 1020

Query: 1021 SSMQACLSGEKETYDSFSFQDCLDSSIPQTSESIEPSSEGNSEDLPSWSTEAHIDSSSEE 1080
            SSMQAC SGEKETYDSFS QD LDSSIPQT+ES+EPSSEGNSEDLPSWS EAHIDSSSEE
Sbjct: 1021 SSMQACFSGEKETYDSFSSQDFLDSSIPQTNESMEPSSEGNSEDLPSWSAEAHIDSSSEE 1080

Query: 1081 LIQMTGPNTLNANFTTDTSVEQSENTTTNKLVEKKCDNRIDDTSQPDDPEISLKNSVYHL 1140
            LIQMTG NTLNANFT D SVE SENT TN LVE KCDNRIDDTSQPDDPEIS+KNSVYHL
Sbjct: 1081 LIQMTGLNTLNANFTIDISVEPSENTITNNLVENKCDNRIDDTSQPDDPEISIKNSVYHL 1140

Query: 1141 SDYQTQQNQTSKSLEVDCCQTSNGVQTSNDCQNKDEHFHTEQSTLTVEYDNHANVEMELI 1200
            S YQTQQNQTSKSL+VDCCQTSNGVQTSNDCQNKDE FHTEQSTLTVE DNHANVEMELI
Sbjct: 1141 SGYQTQQNQTSKSLDVDCCQTSNGVQTSNDCQNKDEQFHTEQSTLTVESDNHANVEMELI 1200

Query: 1201 VDIVEAPSSSSELSINAKEPGLTLQSQSSVIEDPQNVESPAECTNTVYEIPPNATEIATK 1260
            VDIVEAPSSSSELSINAK+PGLTLQSQSSVIEDPQNVESPAECTNTV+  PPNATEIATK
Sbjct: 1201 VDIVEAPSSSSELSINAKDPGLTLQSQSSVIEDPQNVESPAECTNTVHGSPPNATEIATK 1260

Query: 1261 PNPKECNLLSNEFKELKPASSRSQRKQVAKEKDNINWDNLRKQTETNGKTRQRTESTMDS 1320
            PNPKE NLLSNEFKELKPASSRSQ KQVAKEKD INWDNLRKQTETNGKTRQRTE+TMDS
Sbjct: 1261 PNPKEYNLLSNEFKELKPASSRSQSKQVAKEKDKINWDNLRKQTETNGKTRQRTENTMDS 1320

Query: 1321 LDWEAIRCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVEPDHAKE 1380
            LDWEA+RCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVEPD AKE
Sbjct: 1321 LDWEALRCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVEPDQAKE 1380

Query: 1381 YLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYP 1440
            YLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYP
Sbjct: 1381 YLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYP 1440

Query: 1441 VLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFA 1500
            VLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFA
Sbjct: 1441 VLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFA 1500

Query: 1501 SARLGLPAPEDKRIVSTTECREPDNNQARTIDQPMLSLPPSTISSEEIKPSESHQSDGKT 1560
            SARLGLPAPEDKRIVSTTECREPDNNQ RTIDQPMLSLPPSTISSEEIKPSESH+ DGKT
Sbjct: 1501 SARLGLPAPEDKRIVSTTECREPDNNQPRTIDQPMLSLPPSTISSEEIKPSESHECDGKT 1560

Query: 1561 TAGACVPIIEEPATPEQETTTQD-AIIDIEDGFYEDPDEIPTIKLNIEEFSQNLQNFVQK 1620
            TAGACVPIIEEPATPEQET TQD  IIDIED FYEDPDEIPTIKLNIEEFSQNLQN+VQK
Sbjct: 1561 TAGACVPIIEEPATPEQETATQDPRIIDIEDAFYEDPDEIPTIKLNIEEFSQNLQNYVQK 1620

Query: 1621 NMELQEGDMSKALIALTPEAASIPTPKLKNVSRLRTEHQVYELPDNHPLLEKLKLDRREP 1680
            NMELQEGDMSKALIALTPEAASIPTPKLKNVSRLRTEHQVYELPDNHPLLEKLKLDRREP
Sbjct: 1621 NMELQEGDMSKALIALTPEAASIPTPKLKNVSRLRTEHQVYELPDNHPLLEKLKLDRREP 1680

Query: 1681 DDPSSYLLAIWTPGETANSIELPEKRCSNQEHHQLCCEEECLSCNSVREANSFMVRGTLL 1740
            DDPSSYLLAIWTPGETANSI+LPEKRCSNQEHHQLCCEEECLSCNSVREANSFMVRGTLL
Sbjct: 1681 DDPSSYLLAIWTPGETANSIQLPEKRCSNQEHHQLCCEEECLSCNSVREANSFMVRGTLL 1740

Query: 1741 IPCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTSIPTIFK 1800
            IPCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTSIPTIFK
Sbjct: 1741 IPCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTSIPTIFK 1800

Query: 1801 GLSTQGIQHCFWRGFVCVRGFDQKTRAPRPLMARLHFPASKLNRGRGKTEDQ 1852
            GLSTQGIQHCFWRGFVCVRGFDQKTRAPRPLMARLHFPASKLNRGRGKTEDQ
Sbjct: 1801 GLSTQGIQHCFWRGFVCVRGFDQKTRAPRPLMARLHFPASKLNRGRGKTEDQ 1851

BLAST of PI0016811 vs. ExPASy TrEMBL
Match: A0A5D3DNK4 (Protein ROS1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G003460 PE=3 SV=1)

HSP 1 Score: 3349.3 bits (8683), Expect = 0.0e+00
Identity = 1697/1803 (94.12%), Postives = 1733/1803 (96.12%), Query Frame = 0

Query: 1    MDSGQPEGNKADVQGGSWIPATPMKPILPKPPLQPLIYARMDRNQPRPYWLGSERLFSNS 60
            MDSGQPEGNKADVQGGSWIPATPMKPILPKPPLQPLIYARMDRNQPRPYWLGSERLFSNS
Sbjct: 1    MDSGQPEGNKADVQGGSWIPATPMKPILPKPPLQPLIYARMDRNQPRPYWLGSERLFSNS 60

Query: 61   NKEAETSSGVACYGGANFMTANGSNDWEAAQARQFQVARNDNGTVTIHSMDALGGIPFLQ 120
            NKEAETSSGVACYGGAN MTANGSN WEAAQARQFQVARNDNGTVTIHSMDALGGIPFLQ
Sbjct: 61   NKEAETSSGVACYGGANSMTANGSNYWEAAQARQFQVARNDNGTVTIHSMDALGGIPFLQ 120

Query: 121  LMALADAASIVGADAALGGNASDLFDSGSSYQIELESSSMKDRLSGSCIPEAKEYGTSDH 180
            LMALADAASIVGADAALGGNASDLFDSGSSYQIELESSSMKDR  GSC+PEAKEY  S+H
Sbjct: 121  LMALADAASIVGADAALGGNASDLFDSGSSYQIELESSSMKDRFGGSCVPEAKEYVPSEH 180

Query: 181  GSQHAYDLNFPSGTESDAAGIRVTSQFAPLTPDMGKCKYTERGMELQQIPIENSQDERER 240
            GSQ+A++LNFPS  ESDAAGIRVTSQFAPLTPDMGK KY ERG ELQQI IENSQDERE+
Sbjct: 181  GSQYAHELNFPSRIESDAAGIRVTSQFAPLTPDMGKSKYIERGTELQQILIENSQDEREQ 240

Query: 241  NHNCNTSITVDGENLRQNQELLEPAMHSTINCTPDGKEGKNDGDLNKTPASRQRRRKHRP 300
            NHNCNTSITV GEN+ QNQ+LLEPAMHSTINCTPDGKEGKNDGDLNKTPASRQRRRKHRP
Sbjct: 241  NHNCNTSITV-GENVTQNQKLLEPAMHSTINCTPDGKEGKNDGDLNKTPASRQRRRKHRP 300

Query: 301  KVIVEGKTNRTKQILKTPSSNQSVRKRVRKSGLTKPSATPPIEVTGETSEQEMVKHRRKS 360
            KVIVEGKTNRTKQ LKTPSSN S RKRVRKSGL KPSATP IEVTGETSEQEMVKHRRKS
Sbjct: 301  KVIVEGKTNRTKQNLKTPSSNPSARKRVRKSGLAKPSATPSIEVTGETSEQEMVKHRRKS 360

Query: 361  CRRAINFDSQAQTRDGSLDSGPLEQGSLTQNIQSTTGLEEARLEEVGSSTDPNWSMNQMP 420
            CRRAI FDSQAQTRDGSLDSGPLEQGSLTQN QSTTGLE  RLEEVGSSTDPNWSMNQM 
Sbjct: 361  CRRAITFDSQAQTRDGSLDSGPLEQGSLTQNSQSTTGLEVVRLEEVGSSTDPNWSMNQML 420

Query: 421  KKYESLSEKQAPPTKLSAENNSSERKQPSKSQMENNIEQNGKIISNSDKENTVETILNDD 480
            KKYESLSEK+APPT+LSAENNSSE+KQPSKSQMENN EQNGK+ISNSDKENTVE I NDD
Sbjct: 421  KKYESLSEKEAPPTELSAENNSSEQKQPSKSQMENNTEQNGKVISNSDKENTVEAIPNDD 480

Query: 481  NHSLPGNSHGLIFCKNPPLTSIEQATCCLRKRPRAIKQAHTGSINLTGAHYNTLSAYQSM 540
            NHSLPGNSHGLIFCKNPPLTSIEQATCCLRKRPRAIKQAHTGSINLTG HYNTLSAYQSM
Sbjct: 481  NHSLPGNSHGLIFCKNPPLTSIEQATCCLRKRPRAIKQAHTGSINLTGVHYNTLSAYQSM 540

Query: 541  SWMHFPHIYKKKRTEKGQNPVPSSAFTTATNFIRPESACSFNDPQRDHMVSKFNAWIAGP 600
            SWMHFPHIYKKKRTEKGQNPVPSSAFTTATNF RPESACSFNDPQRDHMVSKFNAWI+GP
Sbjct: 541  SWMHFPHIYKKKRTEKGQNPVPSSAFTTATNFTRPESACSFNDPQRDHMVSKFNAWISGP 600

Query: 601  QFNICKSKSVAGHGGNDLQDKLQTYGGIVGLDQTGRTKKKPRTAKRLSGLAPPERITHWE 660
            QFN+CKSK+VAGHGGNDLQDKLQTYGGIVGL QTGRTKKKPRTAKR+S LAPPERI+HWE
Sbjct: 601  QFNVCKSKTVAGHGGNDLQDKLQTYGGIVGLGQTGRTKKKPRTAKRVSSLAPPERISHWE 660

Query: 661  KQPIYPTNHPPPAGSAKNINTSGTCINGLFEMMHATVAKKKRTKKKPSNSALLNINKDLQ 720
            KQP YPTNHPPPAGSAKNINTSGTC+NGLFEMMHATVAKKKRTKKKPSNS LLNINKDL+
Sbjct: 661  KQPTYPTNHPPPAGSAKNINTSGTCVNGLFEMMHATVAKKKRTKKKPSNSTLLNINKDLE 720

Query: 721  DRRFVSFNPWQFFPKTLGTASEHGNQICFIDLLVEQLKHLDINKESNNLGYREQALVPYN 780
            DRRFVSF+PWQFFPKTLGTASEHGNQICFIDL+ EQLKHLDINKESNNLGYREQALVP+N
Sbjct: 721  DRRFVSFSPWQFFPKTLGTASEHGNQICFIDLIAEQLKHLDINKESNNLGYREQALVPFN 780

Query: 781  MQNQEHSAIVVYGRDGTIVPFNPIKKRRPRPKVELDEETGRVWKLLMGNINSKGIDGTDE 840
            MQNQE +AIVVYGRDGTIVPFNPIKKRRPRPKVELDEETGRVWKLLMGNINS GIDGTDE
Sbjct: 781  MQNQELNAIVVYGRDGTIVPFNPIKKRRPRPKVELDEETGRVWKLLMGNINSTGIDGTDE 840

Query: 841  EKIKWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSDHLSSS 900
            E IKWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSDHLSSS
Sbjct: 841  ENIKWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSDHLSSS 900

Query: 901  AFMSLAARFPPKPKCRQASCSQEPIIELDEPEEACVFNLEDSMKLNKQIIHQQISEEGSL 960
            AFMSLAARFPPKPKC QAS SQEPIIEL+EPEE C+FNLEDSMKLNKQIIHQQISEEGSL
Sbjct: 901  AFMSLAARFPPKPKCHQASSSQEPIIELNEPEEVCMFNLEDSMKLNKQIIHQQISEEGSL 960

Query: 961  MKDEMEKSEGRIIVDNNESSGSNAEDGSSNKEPEKKSFSSSHNILETCSNSVGEISLTET 1020
            MKDEMEKSEGRIIVDNNESSGSN EDGSSNKEPEKKSF SSHNILET SNSVGEISLTET
Sbjct: 961  MKDEMEKSEGRIIVDNNESSGSNVEDGSSNKEPEKKSFGSSHNILETFSNSVGEISLTET 1020

Query: 1021 SSMQACLSGEKETYDSFSFQDCLDSSIPQTSESIEPSSEGNSEDLPSWSTEAHIDSSSEE 1080
            SSMQAC SGEKETYDSFS QD LDSSIPQT+ES+EPSSEGNSEDLPSWS EAHIDSSSEE
Sbjct: 1021 SSMQACFSGEKETYDSFSSQDFLDSSIPQTNESMEPSSEGNSEDLPSWSAEAHIDSSSEE 1080

Query: 1081 LIQMTGPNTLNANFTTDTSVEQSENTTTNKLVEKKCDNRIDDTSQPDDPEISLKNSVYHL 1140
            LIQMTG NTLNANFT D SVE SENT TN LVE KCDNRIDDTSQPDDPEIS+KNSVYHL
Sbjct: 1081 LIQMTGLNTLNANFTIDISVEPSENTITNNLVENKCDNRIDDTSQPDDPEISIKNSVYHL 1140

Query: 1141 SDYQTQQNQTSKSLEVDCCQTSNGVQTSNDCQNKDEHFHTEQSTLTVEYDNHANVEMELI 1200
            S YQTQQNQTSKSL+VDCCQTSNGVQTSNDCQNKDE FHTEQSTLTVE DNHANVEMELI
Sbjct: 1141 SGYQTQQNQTSKSLDVDCCQTSNGVQTSNDCQNKDEQFHTEQSTLTVESDNHANVEMELI 1200

Query: 1201 VDIVEAPSSSSELSINAKEPGLTLQSQSSVIEDPQNVESPAECTNTVYEIPPNATEIATK 1260
            VDIVEAPSSSSELSINAK+PGLTLQSQSSVIEDPQNVESPAECTNTV+  PPNATEIATK
Sbjct: 1201 VDIVEAPSSSSELSINAKDPGLTLQSQSSVIEDPQNVESPAECTNTVHGSPPNATEIATK 1260

Query: 1261 PNPKECNLLSNEFKELKPASSRSQRKQVAKEKDNINWDNLRKQTETNGKTRQRTESTMDS 1320
            PNPKE NLLSNEFKELKPASSRSQ KQVAKEKD INWDNLRKQTETNGKTRQRTE+TMDS
Sbjct: 1261 PNPKEYNLLSNEFKELKPASSRSQSKQVAKEKDKINWDNLRKQTETNGKTRQRTENTMDS 1320

Query: 1321 LDWEAIRCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVEPDHAKE 1380
            LDWEA+RCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVEPD AKE
Sbjct: 1321 LDWEALRCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVEPDQAKE 1380

Query: 1381 YLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYP 1440
            YLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYP
Sbjct: 1381 YLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYP 1440

Query: 1441 VLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFA 1500
            VLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFA
Sbjct: 1441 VLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFA 1500

Query: 1501 SARLGLPAPEDKRIVSTTECREPDNNQARTIDQPMLSLPPSTISSEEIKPSESHQSDGKT 1560
            SARLGLPAPEDKRIVSTTECREPDNNQ RTIDQPMLSLPPSTISSEEIKPSESH+ DGKT
Sbjct: 1501 SARLGLPAPEDKRIVSTTECREPDNNQPRTIDQPMLSLPPSTISSEEIKPSESHECDGKT 1560

Query: 1561 TAGACVPIIEEPATPEQETTTQD-AIIDIEDGFYEDPDEIPTIKLNIEEFSQNLQNFVQK 1620
            TAGACVPIIEEPATPEQET TQD  IIDIED FYEDPDEIPTIKLNIEEFSQNLQN+VQK
Sbjct: 1561 TAGACVPIIEEPATPEQETATQDPRIIDIEDAFYEDPDEIPTIKLNIEEFSQNLQNYVQK 1620

Query: 1621 NMELQEGDMSKALIALTPEAASIPTPKLKNVSRLRTEHQVYELPDNHPLLEKLKLDRREP 1680
            NMELQEGDMSKALIALTPEAASIPTPKLKNVSRLRTEHQVYELPDNHPLLEKLKLDRREP
Sbjct: 1621 NMELQEGDMSKALIALTPEAASIPTPKLKNVSRLRTEHQVYELPDNHPLLEKLKLDRREP 1680

Query: 1681 DDPSSYLLAIWTPGETANSIELPEKRCSNQEHHQLCCEEECLSCNSVREANSFMVRGTLL 1740
            DDPSSYLLAIWTPGETANSI+LPEKRCSNQEHHQLCCEEECLSCNSVREANSFMVRGTLL
Sbjct: 1681 DDPSSYLLAIWTPGETANSIQLPEKRCSNQEHHQLCCEEECLSCNSVREANSFMVRGTLL 1740

Query: 1741 IPCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTSIPTIFK 1800
            IPCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTSIPTIFK
Sbjct: 1741 IPCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTSIPTIFK 1800

Query: 1801 GLS 1803
            G S
Sbjct: 1801 GSS 1802

BLAST of PI0016811 vs. ExPASy TrEMBL
Match: A0A5A7TKC2 (Protein ROS1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold236G002630 PE=3 SV=1)

HSP 1 Score: 3344.7 bits (8671), Expect = 0.0e+00
Identity = 1697/1804 (94.07%), Postives = 1733/1804 (96.06%), Query Frame = 0

Query: 1    MDSGQPEGNKADVQGGSWIPATPMKPILPKPPLQPLIYARMDRNQPRPYWLGSERLFSNS 60
            MDSGQPEGNKADVQGGSWIPATPMKPILPKPPLQPLIYARMDRNQPRPYWLGSERLFSNS
Sbjct: 1    MDSGQPEGNKADVQGGSWIPATPMKPILPKPPLQPLIYARMDRNQPRPYWLGSERLFSNS 60

Query: 61   NKEAETSSGVACYGGANFMTANGSNDWEAAQARQFQVARNDNGTVTIHSMDALGGIPFLQ 120
            NKEAETSSGVACYGGAN MTANGSN WEAAQARQFQVARNDNGTVTIHSMDALGGIPFLQ
Sbjct: 61   NKEAETSSGVACYGGANSMTANGSNYWEAAQARQFQVARNDNGTVTIHSMDALGGIPFLQ 120

Query: 121  LMALADAASIVGADAALGGNASDLFDSGSSYQIELESSSMKDRLSGSCIPEAKEYGTSDH 180
            LMALADAASIVGADAALGGNASDLFDSGSSYQIELESSSMKDR  GSC+PEAKEY  S+H
Sbjct: 121  LMALADAASIVGADAALGGNASDLFDSGSSYQIELESSSMKDRFGGSCVPEAKEYVPSEH 180

Query: 181  GSQHAYDLNFPSGTESDAAGIRVTSQFAPLTPDMGKCKYTERGMELQQIPIENSQDERER 240
            GSQ+A++LNFPS  ESDAAGIRVTSQFAPLTPDMGK KY ERG ELQQI IENSQDERE+
Sbjct: 181  GSQYAHELNFPSRIESDAAGIRVTSQFAPLTPDMGKSKYIERGTELQQILIENSQDEREQ 240

Query: 241  NHNCNTSITVDGENLRQNQELLEPAMHSTINCTPDGKEGKNDGDLNKTPASRQRRRKHRP 300
            NHNCNTSITV GEN+ QNQ+LLEPAMHSTINCTPDGKEGKNDGDLNKTPASRQRRRKHRP
Sbjct: 241  NHNCNTSITV-GENVTQNQKLLEPAMHSTINCTPDGKEGKNDGDLNKTPASRQRRRKHRP 300

Query: 301  KVIVEGKTNRTKQILKTPSSNQSVRKRVRKSGLTKPSATPPIEVTGETSEQEMVKHRRKS 360
            KVIVEGKTNRTKQ LKTPSSN S RKRVRKSGL KPSATP IEVTGETSEQEMVKHRRKS
Sbjct: 301  KVIVEGKTNRTKQNLKTPSSNPSARKRVRKSGLAKPSATPSIEVTGETSEQEMVKHRRKS 360

Query: 361  CRRAINFDSQAQTRDGSLDSGPLEQGSLTQNIQSTTGLEEARLEEVGSSTDPNWSMNQMP 420
            CRRAI FDSQAQTRDGSLDSGPLEQGSLTQN QSTTGLE  RLEEVGSSTDPNWSMNQM 
Sbjct: 361  CRRAITFDSQAQTRDGSLDSGPLEQGSLTQNSQSTTGLEVVRLEEVGSSTDPNWSMNQML 420

Query: 421  KKYESLSEKQAPPTKLSAENNSSERKQPSKSQMENNIEQNGKIISNSDKENTVETILNDD 480
            KKYESLSEK+APPT+LSAENNSSE+KQPSKSQMENN EQNGK+ISNSDKENTVE I NDD
Sbjct: 421  KKYESLSEKEAPPTELSAENNSSEQKQPSKSQMENNTEQNGKVISNSDKENTVEAIPNDD 480

Query: 481  NHSLPGNSHGLIFCKNPPLTSIEQATCCLRKRPRAIKQAHTGSINLTGAHYNTLSAYQSM 540
            NHSLPGNSHGLIFCKNPPLTSIEQATCCLRKRPRAIKQAHTGSINLTG HYNTLSAYQSM
Sbjct: 481  NHSLPGNSHGLIFCKNPPLTSIEQATCCLRKRPRAIKQAHTGSINLTGVHYNTLSAYQSM 540

Query: 541  SWMHFPHIYKKKRTEKGQNPVPSSAFTTATNFIRPESACSFNDPQRDHMVSKFNAWIAGP 600
            SWMHFPHIYKKKRTEKGQNPVPSSAFTTATNF RPESACSFNDPQRDHMVSKFNAWI+GP
Sbjct: 541  SWMHFPHIYKKKRTEKGQNPVPSSAFTTATNFTRPESACSFNDPQRDHMVSKFNAWISGP 600

Query: 601  QFNICKSKSVAGHGGNDLQDKLQTYGGIVGLDQTGRTKKKPRTAKRLSGLAPPERITHWE 660
            QFN+CKSK+VAGHGGNDLQDKLQTYGGIVGL QTGRTKKKPRTAKR+S LAPPERI+HWE
Sbjct: 601  QFNVCKSKTVAGHGGNDLQDKLQTYGGIVGLGQTGRTKKKPRTAKRVSSLAPPERISHWE 660

Query: 661  KQPIYPTNHPPPAGSAKNINTSGTCINGLFEMMHATVAKKKRTKKKPSNSALLNINKDLQ 720
            KQP YPTNHPPPAGSAKNINTSGTC+NGLFEMMHATVAKKKRTKKKPSNS LLNINKDL+
Sbjct: 661  KQPTYPTNHPPPAGSAKNINTSGTCVNGLFEMMHATVAKKKRTKKKPSNSTLLNINKDLE 720

Query: 721  DRRFVSFNPWQFFPKTLGTASEHGNQICFIDLLVEQLKHLDINKESNNLGYREQALVPYN 780
            DRRFVSF+PWQFFPKTLGTASEHGNQICFIDL+ EQLKHLDINKESNNLGYREQALVP+N
Sbjct: 721  DRRFVSFSPWQFFPKTLGTASEHGNQICFIDLIAEQLKHLDINKESNNLGYREQALVPFN 780

Query: 781  MQNQEHSAIVVYGRDGTIVPFNPIKKRRPRPKVELDEETGRVWKLLMGNINSKGIDGTDE 840
            MQNQE +AIVVYGRDGTIVPFNPIKKRRPRPKVELDEETGRVWKLLMGNINS GIDGTDE
Sbjct: 781  MQNQELNAIVVYGRDGTIVPFNPIKKRRPRPKVELDEETGRVWKLLMGNINSTGIDGTDE 840

Query: 841  EKIKWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSDHLSSS 900
            E IKWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSDHLSSS
Sbjct: 841  ENIKWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSDHLSSS 900

Query: 901  AFMSLAARFPPKPKCRQASCSQEPIIELDEPEEACVFNLEDSMKLNKQIIHQQISEEGSL 960
            AFMSLAARFPPKPKC QAS SQEPIIEL+EPEE C+FNLEDSMKLNKQIIHQQISEEGSL
Sbjct: 901  AFMSLAARFPPKPKCHQASSSQEPIIELNEPEEVCMFNLEDSMKLNKQIIHQQISEEGSL 960

Query: 961  MKDEMEKSEGRIIVDNNESSGSNAEDGSSNKEPEKKSFSSSHNILETCSNSVGEISLTET 1020
            MKDEMEKSEGRIIVDNNESSGSN EDGSSNKEPEKKSF SSHNILET SNSVGEISLTET
Sbjct: 961  MKDEMEKSEGRIIVDNNESSGSNVEDGSSNKEPEKKSFGSSHNILETFSNSVGEISLTET 1020

Query: 1021 SSMQACLSGEKETYDSFSFQDCLDSSIPQTSESIEPSSEGNSEDLPSWSTEAHIDSSSEE 1080
            SSMQAC SGEKETYDSFS QD LDSSIPQT+ES+EPSSEGNSEDLPSWS EAHIDSSSEE
Sbjct: 1021 SSMQACFSGEKETYDSFSSQDFLDSSIPQTNESMEPSSEGNSEDLPSWSAEAHIDSSSEE 1080

Query: 1081 LIQMTGPNTLNANFTTDTSVEQSENTTTNKLVEKKCDNRIDDTSQPDDPEISLKNSVYHL 1140
            LIQMTG NTLNANFT D SVE SENT TN LVE KCDNRIDDTSQPDDPEIS+KNSVYHL
Sbjct: 1081 LIQMTGLNTLNANFTIDISVEPSENTITNNLVENKCDNRIDDTSQPDDPEISIKNSVYHL 1140

Query: 1141 SDYQTQQNQTSKSLEVDCCQTSNGVQTSNDCQNKDEHFHTEQSTLTVEYDNHANVEMELI 1200
            S YQTQQNQTSKSL+VDCCQTSNGVQTSNDCQNKDE FHTEQSTLTVE DNHANVEMELI
Sbjct: 1141 SGYQTQQNQTSKSLDVDCCQTSNGVQTSNDCQNKDEQFHTEQSTLTVESDNHANVEMELI 1200

Query: 1201 VDIVEAPSSSSELSINAKEPGLTLQSQSSVIEDPQNVESPAECTNTVYEIPPNATEIATK 1260
            VDIVEAPSSSSELSINAK+PGLTLQSQSSVIEDPQNVESPAECTNTV+  PPNATEIATK
Sbjct: 1201 VDIVEAPSSSSELSINAKDPGLTLQSQSSVIEDPQNVESPAECTNTVHGSPPNATEIATK 1260

Query: 1261 PNPKECNLLSNEFKELKPASSRSQRKQVAKEKDNINWDNLRKQTETNGKTRQRTESTMDS 1320
            PNPKE NLLSNEFKELKPASSRSQ KQVAKEKD INWDNLRKQTETNGKTRQRTE+TMDS
Sbjct: 1261 PNPKEYNLLSNEFKELKPASSRSQSKQVAKEKDKINWDNLRKQTETNGKTRQRTENTMDS 1320

Query: 1321 LDWEAIRCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVEPDHAKE 1380
            LDWEA+RCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVEPD AKE
Sbjct: 1321 LDWEALRCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVEPDQAKE 1380

Query: 1381 YLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYP 1440
            YLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYP
Sbjct: 1381 YLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYP 1440

Query: 1441 VLESIQKYLWPRLCKLDQRTLYELHYQMITFGK-VFCTKSKPNCNACPMRGECRHFASAF 1500
            VLESIQKYLWPRLCKLDQRTLYELHYQMITFGK VFCTKSKPNCNACPMRGECRHFASAF
Sbjct: 1441 VLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVVFCTKSKPNCNACPMRGECRHFASAF 1500

Query: 1501 ASARLGLPAPEDKRIVSTTECREPDNNQARTIDQPMLSLPPSTISSEEIKPSESHQSDGK 1560
            ASARLGLPAPEDKRIVSTTECREPDNNQ RTIDQPMLSLPPSTISSEEIKPSESH+ DGK
Sbjct: 1501 ASARLGLPAPEDKRIVSTTECREPDNNQPRTIDQPMLSLPPSTISSEEIKPSESHECDGK 1560

Query: 1561 TTAGACVPIIEEPATPEQETTTQD-AIIDIEDGFYEDPDEIPTIKLNIEEFSQNLQNFVQ 1620
            TTAGACVPIIEEPATPEQET TQD  IIDIED FYEDPDEIPTIKLNIEEFSQNLQN+VQ
Sbjct: 1561 TTAGACVPIIEEPATPEQETATQDPRIIDIEDAFYEDPDEIPTIKLNIEEFSQNLQNYVQ 1620

Query: 1621 KNMELQEGDMSKALIALTPEAASIPTPKLKNVSRLRTEHQVYELPDNHPLLEKLKLDRRE 1680
            KNMELQEGDMSKALIALTPEAASIPTPKLKNVSRLRTEHQVYELPDNHPLLEKLKLDRRE
Sbjct: 1621 KNMELQEGDMSKALIALTPEAASIPTPKLKNVSRLRTEHQVYELPDNHPLLEKLKLDRRE 1680

Query: 1681 PDDPSSYLLAIWTPGETANSIELPEKRCSNQEHHQLCCEEECLSCNSVREANSFMVRGTL 1740
            PDDPSSYLLAIWTPGETANSI+LPEKRCSNQEHHQLCCEEECLSCNSVREANSFMVRGTL
Sbjct: 1681 PDDPSSYLLAIWTPGETANSIQLPEKRCSNQEHHQLCCEEECLSCNSVREANSFMVRGTL 1740

Query: 1741 LIPCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTSIPTIF 1800
            LIPCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTSIPTIF
Sbjct: 1741 LIPCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTSIPTIF 1800

Query: 1801 KGLS 1803
            KG S
Sbjct: 1801 KGSS 1803

BLAST of PI0016811 vs. ExPASy TrEMBL
Match: A0A6J1F2E4 (protein ROS1-like OS=Cucurbita moschata OX=3662 GN=LOC111441568 PE=3 SV=1)

HSP 1 Score: 2901.7 bits (7521), Expect = 0.0e+00
Identity = 1519/1862 (81.58%), Postives = 1624/1862 (87.22%), Query Frame = 0

Query: 1    MDSGQPEGNKADVQGGSWIPATPMKPILPKPPLQPLIYARMDRNQPRPYWLGSERLFSNS 60
            MDS QPEGNKA VQGGSWIPATP+KPILPKPPLQPLIYARMD NQ RP WLGSERL SNS
Sbjct: 1    MDSSQPEGNKAHVQGGSWIPATPVKPILPKPPLQPLIYARMDWNQSRPCWLGSERLSSNS 60

Query: 61   NKEAETSSGVACYGGANFMTANGSNDWEAAQARQFQVARNDNGTVTIHSMDALGGIPFLQ 120
            NKEAET+SGVACYGG     ANG+NDWEAAQA QFQVA  DNGTV IHS+DALG IPFLQ
Sbjct: 61   NKEAETTSGVACYGG-----ANGTNDWEAAQAGQFQVACKDNGTVAIHSIDALGSIPFLQ 120

Query: 121  LMALADAASIVGADAALGGNASDLFDSGSSYQIELESSSMKDRLSGSCIPEAKEYGTSDH 180
            LMALADAASIVGADAALGGNASDLFDSGSSYQ+ELESSSM+ RLSG CIPEA  Y  SDH
Sbjct: 121  LMALADAASIVGADAALGGNASDLFDSGSSYQVELESSSMRGRLSGGCIPEATGYEMSDH 180

Query: 181  GSQHAYDLNFPSGTESDAAGIRVTSQFAPLTPDMGKCKYTERGMELQQIPIENSQDERER 240
             SQHAYDLNFPSGTESDAA IR+TSQFAP TPDMGK KYTE   E+QQIP ENS+DERE+
Sbjct: 181  -SQHAYDLNFPSGTESDAAAIRITSQFAPPTPDMGKSKYTESEAEVQQIPTENSRDEREQ 240

Query: 241  NHNCNTSITVDGENLRQNQELLEPAMHSTI--NCTPDGKEGKNDGDLNKTPASRQRRRKH 300
            NHNCNTSIT+DGENL +N+E LEPAM  TI   CTPDGKEGKN  +LNKTP  RQ+RRKH
Sbjct: 241  NHNCNTSITIDGENLGENKE-LEPAMQPTITATCTPDGKEGKNADNLNKTPPPRQKRRKH 300

Query: 301  RPKVIVEGKTNRTKQILKTPSSNQSVRKRVRKSGLTKPSATPPIEVTGETSEQEMVKHRR 360
            RPKVI+EGK NR    LK  S   S RKRVRKSGL+KPSATPPIE+ GETS QEM+KH R
Sbjct: 301  RPKVIIEGKNNRKNPNLK--SHCPSTRKRVRKSGLSKPSATPPIEIIGETSNQEMLKHSR 360

Query: 361  KSCRRAINFDSQAQTRDGSLDSGPLEQGSLTQNIQSTTGLEEARLEEVGSSTDPNWSMNQ 420
            KSCRRAINFDSQAQTRD   DS  LE+  L QNIQST+G  E RLEEVGSSTDPNWSMNQ
Sbjct: 361  KSCRRAINFDSQAQTRDLYFDSRQLEKDPLPQNIQSTSGQMEVRLEEVGSSTDPNWSMNQ 420

Query: 421  MPKKYESLSEKQAPPTKLSAENNSSERKQPSKSQMENNIEQNGKIISNSDKENTVETILN 480
            M K YESL EKQA   ++SAE+NS ER+ PS +QMENN EQNGK+IS+ +K NTVET+LN
Sbjct: 421  MLKSYESLPEKQAQSAEISAEHNSPERRLPSNNQMENNTEQNGKVISSFEKGNTVETMLN 480

Query: 481  DDNHSLPGNSHGLIFCKNPPLTSIEQATCCLRKRPRAIKQAHTGSINLTGAHYNTLSAYQ 540
            D+N SLPG S+GLIFCKN   T+ EQA+C LRKR +AI QA  GSINLTG HYNTLSAYQ
Sbjct: 481  DNNRSLPGGSNGLIFCKNSAFTAREQASCGLRKRSQAIDQAGAGSINLTGVHYNTLSAYQ 540

Query: 541  SMSWMHFPHIYKKKRTEKGQNPVPSSAFT--TATNFIRPESACSFNDPQRDHMVSKFNAW 600
            S+SWMHFP IYKKKRTEK QNPV S+AFT  +AT+F+ PESACSFND QR+HM    N+W
Sbjct: 541  SISWMHFPTIYKKKRTEKRQNPVSSTAFTSASATHFMSPESACSFNDSQRNHMALVSNSW 600

Query: 601  IAGPQFNICKSKSVAGHGGNDLQDKLQTYGGIVGLDQTGRTKKKPRTAKRLSGLAPPERI 660
            IAGPQF+ CKSK  A HG  +LQDKLQTYG I+ L QT R K++PR+ KRL  LA P RI
Sbjct: 601  IAGPQFSTCKSKIAAVHGRQNLQDKLQTYGSIMALGQTERKKRRPRSTKRLRDLALPARI 660

Query: 661  THWEKQPIYPTNHPPPAGSAKNINTSGTCINGLFEMMHATVAKKKRTKK-KPSNSALLNI 720
               EKQPIYPTN P    S KNINTS TCI+ L E M ATVAKKKRTKK  P+ S L N+
Sbjct: 661  VDCEKQPIYPTNQPLVDSSVKNINTSQTCIHALSETMEATVAKKKRTKKNSPTISTLHNM 720

Query: 721  NKDLQDRRFVSFNPWQFFPKTLGTASEHGNQICFIDLLVEQLKHLDINKESNNLGYREQA 780
            NKDLQDRRFVSFNP+QFFPKTLGTASEHGNQ+CFID +VEQLKHLDINKESNNL  RE+A
Sbjct: 721  NKDLQDRRFVSFNPYQFFPKTLGTASEHGNQMCFIDAIVEQLKHLDINKESNNLECRERA 780

Query: 781  LVPYNMQNQEHSAIVVYGRDGTIVPFNPIKKRRPRPKVELDEETGRVWKLLMGNINSKGI 840
            LVPYNMQNQEH+AIVVYGR GTIVPFN  KKR PRPKVELDEET RVWKLLMGNINS+GI
Sbjct: 781  LVPYNMQNQEHNAIVVYGRKGTIVPFNLTKKRYPRPKVELDEETSRVWKLLMGNINSEGI 840

Query: 841  DGTDEEKIKWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSD 900
            DGTDEEKIKWWEEERKVF+GRA+SFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSD
Sbjct: 841  DGTDEEKIKWWEEERKVFRGRAESFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSD 900

Query: 901  HLSSSAFMSLAARFPPKPKCRQASCSQEPIIELDEPEEACVFNLEDSMKLNKQIIHQQIS 960
            HLSSSAFMSLAARFPPKP C+QASC Q PIIELDEP EA + +LED MKLNKQI+ QQIS
Sbjct: 901  HLSSSAFMSLAARFPPKPNCQQASCYQHPIIELDEP-EAYMLSLEDDMKLNKQIMQQQIS 960

Query: 961  EEGSLMKDEMEKSEGRIIVDNNESSGSNAEDGSSNKEPEKKSFSSSHNILETCSNSVGEI 1020
            EEGSLMK+E+E SEG+IIVD+NESSGSN EDGSSNKEPEK SFSSSHN++ TCSNS  EI
Sbjct: 961  EEGSLMKNEIENSEGQIIVDSNESSGSNVEDGSSNKEPEKISFSSSHNVVGTCSNSEREI 1020

Query: 1021 SLTETSSMQACLSGEKETYDSFSFQDCLDSSIPQTSESIEPSSEGNSEDLPSWSTEAHID 1080
            SL+ T  MQACLSG +E YDSFSFQDCLDSSI QTSE+IEPSSEGNSE LPSW  E HI+
Sbjct: 1021 SLSGTGPMQACLSGAREIYDSFSFQDCLDSSISQTSENIEPSSEGNSEGLPSWLKEVHIN 1080

Query: 1081 SSSEELIQMTGPNTLNANFTTDTSVEQSENTTTNKLVEKKCDNRIDDTSQPDDPEISLKN 1140
            SSSE+L QM G NTLN + T DTS+EQ+E  T   L  KKCDN IDDTSQPDD E ++K+
Sbjct: 1081 SSSEKLNQMAGLNTLNDHVTIDTSIEQTEVHTNINLAGKKCDNGIDDTSQPDDHEKAMKD 1140

Query: 1141 SVYHLSDYQTQQNQTSKSLEVDCCQTSNGVQTSNDCQNKDEHFHTEQSTLTVEYDNHANV 1200
            SV HL+  Q QQN TS+SLEVDC QT NGVQT N   +KD  FH+E+STLTVE  NHANV
Sbjct: 1141 SVNHLNGNQMQQNHTSESLEVDCHQTCNGVQTPN-VYHKDVDFHSEKSTLTVESRNHANV 1200

Query: 1201 EMELIVDIVEAPSSSSELSINAKEPGLTLQSQSSVIEDPQNVESPAECTNTVYEI----P 1260
            E+ELIVDI EAP  S ELSINAKEPGLTLQ Q SVIED QN ESPAECTN V+EI     
Sbjct: 1201 EIELIVDIHEAPLPSRELSINAKEPGLTLQPQGSVIEDAQNAESPAECTNNVHEILPKFS 1260

Query: 1261 PNATEIATKPNPKEC-NLLSNEFKELKPASSRSQRKQVAKEKD-NINWDNLRKQTETNGK 1320
            PN T I T+ NPKE  + LSN F+E+KPA+SRSQRKQVAKEK+ NINWDNLRKQ ETNGK
Sbjct: 1261 PNGTGIVTQSNPKEYDHSLSNGFEEMKPATSRSQRKQVAKEKEGNINWDNLRKQVETNGK 1320

Query: 1321 TRQRTESTMDSLDWEAIRCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEW 1380
            TRQR+E+TMDSLDWEA+RCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEW
Sbjct: 1321 TRQRSENTMDSLDWEAVRCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEW 1380

Query: 1381 LRDVEPDHAKEYLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPE 1440
            LRDV PD AKEYLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPE
Sbjct: 1381 LRDVAPDQAKEYLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPE 1440

Query: 1441 SLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMR 1500
            SLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMR
Sbjct: 1441 SLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMR 1500

Query: 1501 GECRHFASAFASARLGLPAPEDKRIVSTTECREPDNNQARTIDQPMLSLPPSTISSEEIK 1560
            GECRHFASAFASARLGLPAPEDKRIVSTTECREP++NQARTIDQPMLSLPPST  SEEIK
Sbjct: 1501 GECRHFASAFASARLGLPAPEDKRIVSTTECREPEDNQARTIDQPMLSLPPSTKPSEEIK 1560

Query: 1561 PSESHQSDGKTTAGACVPIIEEPATPEQETTTQDAIIDIEDGFYEDPDEIPTIKLNIEEF 1620
            PSE HQSDGKTT G CVPIIEEPATPEQE+TT+DAIIDIED FYEDPDEIPTIKLNIEEF
Sbjct: 1561 PSERHQSDGKTTIGMCVPIIEEPATPEQESTTKDAIIDIEDAFYEDPDEIPTIKLNIEEF 1620

Query: 1621 SQNLQNFVQKNMELQEGDMSKALIALTPEAASIPTPKLKNVSRLRTEHQVYELPDNHPLL 1680
            SQNLQN+VQKNMELQEGDMSKALIALTPEAASIP PKLKNVSRLRTEH VYELPDNHPLL
Sbjct: 1621 SQNLQNYVQKNMELQEGDMSKALIALTPEAASIPMPKLKNVSRLRTEHLVYELPDNHPLL 1680

Query: 1681 EKLKLDRREPDDPSSYLLAIWTPGETANSIELPEKRCSNQEHHQLCCEEECLSCNSVREA 1740
            EKL+LDRREPDDP SY LAIWTPGETANSI+LPEKRCSNQEHHQLC EEECLSCNSVREA
Sbjct: 1681 EKLELDRREPDDPCSYFLAIWTPGETANSIQLPEKRCSNQEHHQLCLEEECLSCNSVREA 1740

Query: 1741 NSFMVRGTLLIPCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVY 1800
            NS MVRGTLLIPCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVY
Sbjct: 1741 NSLMVRGTLLIPCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVY 1800

Query: 1801 FGTSIPTIFKGLSTQGIQHCFWRGFVCVRGFDQKTRAPRPLMARLHFPASKLNRGRGKTE 1852
            FGTSIPTIFKGLSTQGIQHCFWRGFVCVRGFD+K+RAPRPLMARLHFPASKLNRGRGKT 
Sbjct: 1801 FGTSIPTIFKGLSTQGIQHCFWRGFVCVRGFDKKSRAPRPLMARLHFPASKLNRGRGKTV 1851

BLAST of PI0016811 vs. NCBI nr
Match: XP_011651988.1 (transcriptional activator DEMETER [Cucumis sativus] >KGN59055.1 hypothetical protein Csa_001445 [Cucumis sativus])

HSP 1 Score: 3489.9 bits (9048), Expect = 0.0e+00
Identity = 1761/1851 (95.14%), Postives = 1792/1851 (96.81%), Query Frame = 0

Query: 1    MDSGQPEGNKADVQGGSWIPATPMKPILPKPPLQPLIYARMDRNQPRPYWLGSERLFSNS 60
            MDSGQPEGNKADVQG SWIPATPMKPILPKPPLQPLIYARMDRNQPRPYWLG ERLFSNS
Sbjct: 1    MDSGQPEGNKADVQGSSWIPATPMKPILPKPPLQPLIYARMDRNQPRPYWLGPERLFSNS 60

Query: 61   NKEAETSSGVACYGGANFMTANGSNDWEAAQARQFQVARNDNGTVTIHSMDALGGIPFLQ 120
            +KEAETSSGVACYGGAN MTANGSNDWEAAQARQFQVA NDNGTVTIHSMDALGGIPFLQ
Sbjct: 61   DKEAETSSGVACYGGANSMTANGSNDWEAAQARQFQVACNDNGTVTIHSMDALGGIPFLQ 120

Query: 121  LMALADAASIVGADAALGGNASDLFDSGSSYQIELESSSMKDRLSGSCIPEAKEYGTSDH 180
            LMALADAASIVGADAALGGNASDLFDSGSSYQIELESSSMKDRLSGSCIPEAKEY TSDH
Sbjct: 121  LMALADAASIVGADAALGGNASDLFDSGSSYQIELESSSMKDRLSGSCIPEAKEYETSDH 180

Query: 181  GSQHAYDLNFPSGTESDAAGIRVTSQFAPLTPDMGKCKYTERGMELQQIPIENSQDERER 240
            GSQHA+DLNFPS TESDAAGIRVTSQFAPLTPDMGK KYTERGMELQQIP ENSQDERE 
Sbjct: 181  GSQHAHDLNFPSRTESDAAGIRVTSQFAPLTPDMGKIKYTERGMELQQIPTENSQDEREL 240

Query: 241  NHNCNTSITVDGENLRQNQELLEPAMHSTINCTPDGKEGKNDGDLNKTPASRQRRRKHRP 300
            NHNCNTSITVDGENLRQNQELLEPAMHSTINCTPDGKEGKNDGDLNKTPASRQRRRKHRP
Sbjct: 241  NHNCNTSITVDGENLRQNQELLEPAMHSTINCTPDGKEGKNDGDLNKTPASRQRRRKHRP 300

Query: 301  KVIVEGKTNRTKQILKTPSSNQSVRKRVRKSGLTKPSATPPIEVTGETSEQEMVKHRRKS 360
            KVIVEGKTNRTKQ LKTPSSN SVRKRVRKSGL KPSATP IEVTGETSEQE+VKHRRKS
Sbjct: 301  KVIVEGKTNRTKQNLKTPSSNPSVRKRVRKSGLAKPSATPSIEVTGETSEQEIVKHRRKS 360

Query: 361  CRRAINFDSQAQTRDGSLDSGPLEQGSLTQNIQSTTGLEEARLEEVGSSTDPNWSMNQMP 420
            CRRAI FDSQAQTRD SLD GPLEQGSLTQNIQSTTGLEE R+EEVGSSTDPNWSMNQM 
Sbjct: 361  CRRAITFDSQAQTRDESLDLGPLEQGSLTQNIQSTTGLEEVRIEEVGSSTDPNWSMNQML 420

Query: 421  KKYESLSEKQAPPTKLSAENNSSERKQPSKSQMENNIEQNGKIISNSDKENTVETILNDD 480
            KKYESLSEK+APPT+LSAEN+SSE+ QPSKSQ EN+ EQNGK+IS+SDKENTVETILND+
Sbjct: 421  KKYESLSEKEAPPTELSAENDSSEQTQPSKSQKENDTEQNGKVISSSDKENTVETILNDE 480

Query: 481  NHSLPGNSHGLIFCKNPPLTSIEQATCCLRKRPRAIKQAHTGSINLTGAHYNTLSAYQSM 540
            NHSLPGNSHGLIFCKNPPLTSIEQATCCLRKRPRAIKQAHTGSINLTGAHYNTLSAYQSM
Sbjct: 481  NHSLPGNSHGLIFCKNPPLTSIEQATCCLRKRPRAIKQAHTGSINLTGAHYNTLSAYQSM 540

Query: 541  SWMHFPHIYKKKRTEKGQNPVPSSAFTTATNFIRPESACSFNDPQRDHMVSKFNAWIAGP 600
            SWMHFPHIYKKKRTEKGQNP+PSSAF TATNF RPESACSFNDPQRDH+VSKFN WI GP
Sbjct: 541  SWMHFPHIYKKKRTEKGQNPIPSSAFATATNFTRPESACSFNDPQRDHVVSKFNTWIPGP 600

Query: 601  QFNICKSKSVAGHGGNDLQDKLQTYGGIVGLDQTGRTKKKPRTAKRLSGLAPPERITHWE 660
            QFNICKSK+VAGH GN+LQDKLQT GGIVGL QTGRTKKKPRTAKRLS  A PERI+HWE
Sbjct: 601  QFNICKSKTVAGHEGNNLQDKLQTCGGIVGLGQTGRTKKKPRTAKRLSSSARPERISHWE 660

Query: 661  KQPIYPTNHPPPAGSAKNINTSGTCINGLFEMMHATVAKKKRTKKKPSNSALLNINKDLQ 720
            KQPIYPTNHPPPAGSAKNINTSGTCINGLFE+MHATVAKKKRTKKKPSNSALLNINKDLQ
Sbjct: 661  KQPIYPTNHPPPAGSAKNINTSGTCINGLFEIMHATVAKKKRTKKKPSNSALLNINKDLQ 720

Query: 721  DRRFVSFNPWQFFPKTLGTASEHGNQICFIDLLVEQLKHLDINKESNNLGYREQALVPYN 780
            DRRFVSF+PWQFFPKTLGT SEHGNQICFIDL+ EQLKHLDINKESNNLGYREQAL+PYN
Sbjct: 721  DRRFVSFSPWQFFPKTLGTDSEHGNQICFIDLIAEQLKHLDINKESNNLGYREQALIPYN 780

Query: 781  MQNQEHSAIVVYGRDGTIVPFNPIKKRRPRPKVELDEETGRVWKLLMGNINSKGIDGTDE 840
            MQNQEH+AIVVYGRDGTIVPFNPIKKRRPRPKVELDEETGRVWKLLMGNINSKGIDGTDE
Sbjct: 781  MQNQEHNAIVVYGRDGTIVPFNPIKKRRPRPKVELDEETGRVWKLLMGNINSKGIDGTDE 840

Query: 841  EKIKWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSDHLSSS 900
            E IKWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSDHLSSS
Sbjct: 841  ENIKWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSDHLSSS 900

Query: 901  AFMSLAARFPPKPKCRQASCSQEPIIELDEPEEACVFNLEDSMKLNKQIIHQQISEEGSL 960
            AFMSLAARFPPK KCRQASCSQEPIIELDEPEEAC+FNLEDSMKLNKQIIHQQISEE  L
Sbjct: 901  AFMSLAARFPPKSKCRQASCSQEPIIELDEPEEACMFNLEDSMKLNKQIIHQQISEEDLL 960

Query: 961  MKDEMEKSEGRIIVDNNESSGSNAEDGSSNKEPEKKSFSSSHNILETCSNSVGEISLTET 1020
            MKDEMEK EGRIIV+NNESSGSN EDGSSNKEPEKKSFSSSHNILETCSNSVGEISLTET
Sbjct: 961  MKDEMEKGEGRIIVENNESSGSNVEDGSSNKEPEKKSFSSSHNILETCSNSVGEISLTET 1020

Query: 1021 SSMQACLSGEKETYDSFSFQDCLDSSIPQTSESIEPSSEGNSEDLPSWSTEAHIDSSSEE 1080
            SSMQACLSGEKETYDSFS QDCLDSSIPQT+ES+EPSSEGNSEDLPSWSTEAHIDSSSEE
Sbjct: 1021 SSMQACLSGEKETYDSFSSQDCLDSSIPQTNESVEPSSEGNSEDLPSWSTEAHIDSSSEE 1080

Query: 1081 LIQMTGPNTLNANFTTDTSVEQSENTTTNKLVEKKCDNRIDDTSQPDDPEISLKNSVYHL 1140
            L QMTG NTLNANFT DT VEQSENT TNKLVE KCDNRIDDTSQP DPEISLKNSVYHL
Sbjct: 1081 LTQMTGLNTLNANFTIDTCVEQSENTITNKLVENKCDNRIDDTSQPVDPEISLKNSVYHL 1140

Query: 1141 SDYQTQQNQTSKSLEVDCCQTSNGVQTSNDCQNKDEHFHTEQSTLTVEYDNHANVEMELI 1200
            S YQTQQNQTSKSLEVDCCQTSNGVQTSNDCQNKDE FHTEQSTLTVE DNHA VEMELI
Sbjct: 1141 SGYQTQQNQTSKSLEVDCCQTSNGVQTSNDCQNKDEQFHTEQSTLTVESDNHAIVEMELI 1200

Query: 1201 VDIVEAPSSSSELSINAKEPGLTLQSQSSVIEDPQNVESPAECTNTVYEIPPNATEIATK 1260
            VDIVEAPSSSSELSINAKEP LTLQSQSSVIEDPQNVESPAECTNTV+EIPPNATEIATK
Sbjct: 1201 VDIVEAPSSSSELSINAKEPCLTLQSQSSVIEDPQNVESPAECTNTVHEIPPNATEIATK 1260

Query: 1261 PNPKECNLLSNEFKELKPASSRSQRKQVAKEKDNINWDNLRKQTETNGKTRQRTESTMDS 1320
            PNPKECNLLSNEFKELKPASSRSQ KQVAKEKDNINWDNLRK+TETNGKTRQRTE TMDS
Sbjct: 1261 PNPKECNLLSNEFKELKPASSRSQSKQVAKEKDNINWDNLRKRTETNGKTRQRTEDTMDS 1320

Query: 1321 LDWEAIRCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVEPDHAKE 1380
            LDWEAIRCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVEPD AKE
Sbjct: 1321 LDWEAIRCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVEPDQAKE 1380

Query: 1381 YLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYP 1440
            YLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYP
Sbjct: 1381 YLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYP 1440

Query: 1441 VLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFA 1500
            VLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFA
Sbjct: 1441 VLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFA 1500

Query: 1501 SARLGLPAPEDKRIVSTTECREPDNNQARTIDQPMLSLPPSTISSEEIKPSESHQSDGKT 1560
            SARLGLPAPEDKRIVSTTECREPDNNQ RTIDQPMLSLPPSTISS EIKPSESHQSDGKT
Sbjct: 1501 SARLGLPAPEDKRIVSTTECREPDNNQPRTIDQPMLSLPPSTISSVEIKPSESHQSDGKT 1560

Query: 1561 TAGACVPIIEEPATPEQETTTQDAIIDIEDGFYEDPDEIPTIKLNIEEFSQNLQNFVQKN 1620
            TAGACVPIIEEPATPEQET TQDAIIDIED FYEDPDEIPTIKLNIEEFSQNLQN+VQKN
Sbjct: 1561 TAGACVPIIEEPATPEQETATQDAIIDIEDAFYEDPDEIPTIKLNIEEFSQNLQNYVQKN 1620

Query: 1621 MELQEGDMSKALIALTPEAASIPTPKLKNVSRLRTEHQVYELPDNHPLLEKLKLDRREPD 1680
            MELQEGDMSKALIALTPEAASIPTPKLKNVSRLRTEHQVYELPDNHPLLEKLKLDRREPD
Sbjct: 1621 MELQEGDMSKALIALTPEAASIPTPKLKNVSRLRTEHQVYELPDNHPLLEKLKLDRREPD 1680

Query: 1681 DPSSYLLAIWTPGETANSIELPEKRCSNQEHHQLCCEEECLSCNSVREANSFMVRGTLLI 1740
            DPSSYLLAIWTPGETANSI+LPEKRCS+QEHHQLCCEEECLSCNSVREANSFMVRGTLLI
Sbjct: 1681 DPSSYLLAIWTPGETANSIQLPEKRCSSQEHHQLCCEEECLSCNSVREANSFMVRGTLLI 1740

Query: 1741 PCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTSIPTIFKG 1800
            PCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTSIPTIFKG
Sbjct: 1741 PCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTSIPTIFKG 1800

Query: 1801 LSTQGIQHCFWRGFVCVRGFDQKTRAPRPLMARLHFPASKLNRGRGKTEDQ 1852
            LSTQGIQHCFWRGFVCVRGFDQKTRAPRPLMARLHFPASKLNRGRGKTEDQ
Sbjct: 1801 LSTQGIQHCFWRGFVCVRGFDQKTRAPRPLMARLHFPASKLNRGRGKTEDQ 1851

BLAST of PI0016811 vs. NCBI nr
Match: XP_016899677.1 (PREDICTED: protein ROS1 [Cucumis melo])

HSP 1 Score: 3458.7 bits (8967), Expect = 0.0e+00
Identity = 1747/1852 (94.33%), Postives = 1783/1852 (96.27%), Query Frame = 0

Query: 1    MDSGQPEGNKADVQGGSWIPATPMKPILPKPPLQPLIYARMDRNQPRPYWLGSERLFSNS 60
            MDSGQPEGNKADVQGGSWIPATPMKPILPKPPLQPLIYARMDRNQPRPYWLGSERLFSNS
Sbjct: 1    MDSGQPEGNKADVQGGSWIPATPMKPILPKPPLQPLIYARMDRNQPRPYWLGSERLFSNS 60

Query: 61   NKEAETSSGVACYGGANFMTANGSNDWEAAQARQFQVARNDNGTVTIHSMDALGGIPFLQ 120
            NKEAETSSGVACYGGAN MTANGSN WEAAQARQFQVARNDNGTVTIHSMDALGGIPFLQ
Sbjct: 61   NKEAETSSGVACYGGANSMTANGSNYWEAAQARQFQVARNDNGTVTIHSMDALGGIPFLQ 120

Query: 121  LMALADAASIVGADAALGGNASDLFDSGSSYQIELESSSMKDRLSGSCIPEAKEYGTSDH 180
            LMALADAASIVGADAALGGNASDLFDSGSSYQIELESSSMKDR  GSC+PEAKEY  S+H
Sbjct: 121  LMALADAASIVGADAALGGNASDLFDSGSSYQIELESSSMKDRFGGSCVPEAKEYVPSEH 180

Query: 181  GSQHAYDLNFPSGTESDAAGIRVTSQFAPLTPDMGKCKYTERGMELQQIPIENSQDERER 240
            GSQ+A++LNFPS  ESDAAGIRVTSQFAPLTPDMGK KY ERG ELQQI IENSQDERE+
Sbjct: 181  GSQYAHELNFPSRIESDAAGIRVTSQFAPLTPDMGKSKYIERGTELQQILIENSQDEREQ 240

Query: 241  NHNCNTSITVDGENLRQNQELLEPAMHSTINCTPDGKEGKNDGDLNKTPASRQRRRKHRP 300
            NHNCNTSITV GEN+ QNQ+LLEPAMHSTINCTPDGKEGKNDGDLNKTPASRQRRRKHRP
Sbjct: 241  NHNCNTSITV-GENVTQNQKLLEPAMHSTINCTPDGKEGKNDGDLNKTPASRQRRRKHRP 300

Query: 301  KVIVEGKTNRTKQILKTPSSNQSVRKRVRKSGLTKPSATPPIEVTGETSEQEMVKHRRKS 360
            KVIVEGKTNRTKQ LKTPSSN S RKRVRKSGL KPSATP IEVTGETSEQEMVKHRRKS
Sbjct: 301  KVIVEGKTNRTKQNLKTPSSNPSARKRVRKSGLAKPSATPSIEVTGETSEQEMVKHRRKS 360

Query: 361  CRRAINFDSQAQTRDGSLDSGPLEQGSLTQNIQSTTGLEEARLEEVGSSTDPNWSMNQMP 420
            CRRAI FDSQAQTRDGSLDSGPLEQGSLTQN QSTTGLE  RLEEVGSSTDPNWSMNQM 
Sbjct: 361  CRRAITFDSQAQTRDGSLDSGPLEQGSLTQNSQSTTGLEVVRLEEVGSSTDPNWSMNQML 420

Query: 421  KKYESLSEKQAPPTKLSAENNSSERKQPSKSQMENNIEQNGKIISNSDKENTVETILNDD 480
            KKYESLSEK+APPT+LSAENNSSE+KQPSKSQMENN EQNGK+ISNSDKENTVE I NDD
Sbjct: 421  KKYESLSEKEAPPTELSAENNSSEQKQPSKSQMENNTEQNGKVISNSDKENTVEAIPNDD 480

Query: 481  NHSLPGNSHGLIFCKNPPLTSIEQATCCLRKRPRAIKQAHTGSINLTGAHYNTLSAYQSM 540
            NHSLPGNSHGLIFCKNPPLTSIEQATCCLRKRPRAIKQAHTGSINLTG HYNTLSAYQSM
Sbjct: 481  NHSLPGNSHGLIFCKNPPLTSIEQATCCLRKRPRAIKQAHTGSINLTGVHYNTLSAYQSM 540

Query: 541  SWMHFPHIYKKKRTEKGQNPVPSSAFTTATNFIRPESACSFNDPQRDHMVSKFNAWIAGP 600
            SWMHFPHIYKKKRTEKGQNPVPSSAFTTATNF RPESACSFNDPQRDHMVSKFNAWI+GP
Sbjct: 541  SWMHFPHIYKKKRTEKGQNPVPSSAFTTATNFTRPESACSFNDPQRDHMVSKFNAWISGP 600

Query: 601  QFNICKSKSVAGHGGNDLQDKLQTYGGIVGLDQTGRTKKKPRTAKRLSGLAPPERITHWE 660
            QFN+CKSK+VAGHGGNDLQDKLQTYGGIVGL QTGRTKKKPRTAKR+S LAPPERI+HWE
Sbjct: 601  QFNVCKSKTVAGHGGNDLQDKLQTYGGIVGLGQTGRTKKKPRTAKRVSSLAPPERISHWE 660

Query: 661  KQPIYPTNHPPPAGSAKNINTSGTCINGLFEMMHATVAKKKRTKKKPSNSALLNINKDLQ 720
            KQP YPTNHPPPAGSAKNINTSGTC+NGLFEMMHATVAKKKRTKKKPSNS LLNINKDL+
Sbjct: 661  KQPTYPTNHPPPAGSAKNINTSGTCVNGLFEMMHATVAKKKRTKKKPSNSTLLNINKDLE 720

Query: 721  DRRFVSFNPWQFFPKTLGTASEHGNQICFIDLLVEQLKHLDINKESNNLGYREQALVPYN 780
            DRRFVSF+PWQFFPKTLGTASEHGNQICFIDL+ EQLKHLDINKESNNLGYREQALVP+N
Sbjct: 721  DRRFVSFSPWQFFPKTLGTASEHGNQICFIDLIAEQLKHLDINKESNNLGYREQALVPFN 780

Query: 781  MQNQEHSAIVVYGRDGTIVPFNPIKKRRPRPKVELDEETGRVWKLLMGNINSKGIDGTDE 840
            MQNQE +AIVVYGRDGTIVPFNPIKKRRPRPKVELDEETGRVWKLLMGNINS GIDGTDE
Sbjct: 781  MQNQELNAIVVYGRDGTIVPFNPIKKRRPRPKVELDEETGRVWKLLMGNINSTGIDGTDE 840

Query: 841  EKIKWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSDHLSSS 900
            E IKWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSDHLSSS
Sbjct: 841  ENIKWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSDHLSSS 900

Query: 901  AFMSLAARFPPKPKCRQASCSQEPIIELDEPEEACVFNLEDSMKLNKQIIHQQISEEGSL 960
            AFMSLAARFPPKPKC QAS SQEPIIEL+EPEE C+FNLEDSMKLNKQIIHQQISEEGSL
Sbjct: 901  AFMSLAARFPPKPKCHQASSSQEPIIELNEPEEVCMFNLEDSMKLNKQIIHQQISEEGSL 960

Query: 961  MKDEMEKSEGRIIVDNNESSGSNAEDGSSNKEPEKKSFSSSHNILETCSNSVGEISLTET 1020
            MKDEMEKSEGRIIVDNNESSGSN EDGSSNKEPEKKSF SSHNILET SNSVGEISLTET
Sbjct: 961  MKDEMEKSEGRIIVDNNESSGSNVEDGSSNKEPEKKSFGSSHNILETFSNSVGEISLTET 1020

Query: 1021 SSMQACLSGEKETYDSFSFQDCLDSSIPQTSESIEPSSEGNSEDLPSWSTEAHIDSSSEE 1080
            SSMQAC SGEKETYDSFS QD LDSSIPQT+ES+EPSSEGNSEDLPSWS EAHIDSSSEE
Sbjct: 1021 SSMQACFSGEKETYDSFSSQDFLDSSIPQTNESMEPSSEGNSEDLPSWSAEAHIDSSSEE 1080

Query: 1081 LIQMTGPNTLNANFTTDTSVEQSENTTTNKLVEKKCDNRIDDTSQPDDPEISLKNSVYHL 1140
            LIQMTG NTLNANFT D SVE SENT TN LVE KCDNRIDDTSQPDDPEIS+KNSVYHL
Sbjct: 1081 LIQMTGLNTLNANFTIDISVEPSENTITNNLVENKCDNRIDDTSQPDDPEISIKNSVYHL 1140

Query: 1141 SDYQTQQNQTSKSLEVDCCQTSNGVQTSNDCQNKDEHFHTEQSTLTVEYDNHANVEMELI 1200
            S YQTQQNQTSKSL+VDCCQTSNGVQTSNDCQNKDE FHTEQSTLTVE DNHANVEMELI
Sbjct: 1141 SGYQTQQNQTSKSLDVDCCQTSNGVQTSNDCQNKDEQFHTEQSTLTVESDNHANVEMELI 1200

Query: 1201 VDIVEAPSSSSELSINAKEPGLTLQSQSSVIEDPQNVESPAECTNTVYEIPPNATEIATK 1260
            VDIVEAPSSSSELSINAK+PGLTLQSQSSVIEDPQNVESPAECTNTV+  PPNATEIATK
Sbjct: 1201 VDIVEAPSSSSELSINAKDPGLTLQSQSSVIEDPQNVESPAECTNTVHGSPPNATEIATK 1260

Query: 1261 PNPKECNLLSNEFKELKPASSRSQRKQVAKEKDNINWDNLRKQTETNGKTRQRTESTMDS 1320
            PNPKE NLLSNEFKELKPASSRSQ KQVAKEKD INWDNLRKQTETNGKTRQRTE+TMDS
Sbjct: 1261 PNPKEYNLLSNEFKELKPASSRSQSKQVAKEKDKINWDNLRKQTETNGKTRQRTENTMDS 1320

Query: 1321 LDWEAIRCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVEPDHAKE 1380
            LDWEA+RCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVEPD AKE
Sbjct: 1321 LDWEALRCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVEPDQAKE 1380

Query: 1381 YLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYP 1440
            YLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYP
Sbjct: 1381 YLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYP 1440

Query: 1441 VLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFA 1500
            VLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFA
Sbjct: 1441 VLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFA 1500

Query: 1501 SARLGLPAPEDKRIVSTTECREPDNNQARTIDQPMLSLPPSTISSEEIKPSESHQSDGKT 1560
            SARLGLPAPEDKRIVSTTECREPDNNQ RTIDQPMLSLPPSTISSEEIKPSESH+ DGKT
Sbjct: 1501 SARLGLPAPEDKRIVSTTECREPDNNQPRTIDQPMLSLPPSTISSEEIKPSESHECDGKT 1560

Query: 1561 TAGACVPIIEEPATPEQETTTQD-AIIDIEDGFYEDPDEIPTIKLNIEEFSQNLQNFVQK 1620
            TAGACVPIIEEPATPEQET TQD  IIDIED FYEDPDEIPTIKLNIEEFSQNLQN+VQK
Sbjct: 1561 TAGACVPIIEEPATPEQETATQDPRIIDIEDAFYEDPDEIPTIKLNIEEFSQNLQNYVQK 1620

Query: 1621 NMELQEGDMSKALIALTPEAASIPTPKLKNVSRLRTEHQVYELPDNHPLLEKLKLDRREP 1680
            NMELQEGDMSKALIALTPEAASIPTPKLKNVSRLRTEHQVYELPDNHPLLEKLKLDRREP
Sbjct: 1621 NMELQEGDMSKALIALTPEAASIPTPKLKNVSRLRTEHQVYELPDNHPLLEKLKLDRREP 1680

Query: 1681 DDPSSYLLAIWTPGETANSIELPEKRCSNQEHHQLCCEEECLSCNSVREANSFMVRGTLL 1740
            DDPSSYLLAIWTPGETANSI+LPEKRCSNQEHHQLCCEEECLSCNSVREANSFMVRGTLL
Sbjct: 1681 DDPSSYLLAIWTPGETANSIQLPEKRCSNQEHHQLCCEEECLSCNSVREANSFMVRGTLL 1740

Query: 1741 IPCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTSIPTIFK 1800
            IPCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTSIPTIFK
Sbjct: 1741 IPCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTSIPTIFK 1800

Query: 1801 GLSTQGIQHCFWRGFVCVRGFDQKTRAPRPLMARLHFPASKLNRGRGKTEDQ 1852
            GLSTQGIQHCFWRGFVCVRGFDQKTRAPRPLMARLHFPASKLNRGRGKTEDQ
Sbjct: 1801 GLSTQGIQHCFWRGFVCVRGFDQKTRAPRPLMARLHFPASKLNRGRGKTEDQ 1851

BLAST of PI0016811 vs. NCBI nr
Match: TYK25216.1 (protein ROS1 [Cucumis melo var. makuwa])

HSP 1 Score: 3349.3 bits (8683), Expect = 0.0e+00
Identity = 1697/1803 (94.12%), Postives = 1733/1803 (96.12%), Query Frame = 0

Query: 1    MDSGQPEGNKADVQGGSWIPATPMKPILPKPPLQPLIYARMDRNQPRPYWLGSERLFSNS 60
            MDSGQPEGNKADVQGGSWIPATPMKPILPKPPLQPLIYARMDRNQPRPYWLGSERLFSNS
Sbjct: 1    MDSGQPEGNKADVQGGSWIPATPMKPILPKPPLQPLIYARMDRNQPRPYWLGSERLFSNS 60

Query: 61   NKEAETSSGVACYGGANFMTANGSNDWEAAQARQFQVARNDNGTVTIHSMDALGGIPFLQ 120
            NKEAETSSGVACYGGAN MTANGSN WEAAQARQFQVARNDNGTVTIHSMDALGGIPFLQ
Sbjct: 61   NKEAETSSGVACYGGANSMTANGSNYWEAAQARQFQVARNDNGTVTIHSMDALGGIPFLQ 120

Query: 121  LMALADAASIVGADAALGGNASDLFDSGSSYQIELESSSMKDRLSGSCIPEAKEYGTSDH 180
            LMALADAASIVGADAALGGNASDLFDSGSSYQIELESSSMKDR  GSC+PEAKEY  S+H
Sbjct: 121  LMALADAASIVGADAALGGNASDLFDSGSSYQIELESSSMKDRFGGSCVPEAKEYVPSEH 180

Query: 181  GSQHAYDLNFPSGTESDAAGIRVTSQFAPLTPDMGKCKYTERGMELQQIPIENSQDERER 240
            GSQ+A++LNFPS  ESDAAGIRVTSQFAPLTPDMGK KY ERG ELQQI IENSQDERE+
Sbjct: 181  GSQYAHELNFPSRIESDAAGIRVTSQFAPLTPDMGKSKYIERGTELQQILIENSQDEREQ 240

Query: 241  NHNCNTSITVDGENLRQNQELLEPAMHSTINCTPDGKEGKNDGDLNKTPASRQRRRKHRP 300
            NHNCNTSITV GEN+ QNQ+LLEPAMHSTINCTPDGKEGKNDGDLNKTPASRQRRRKHRP
Sbjct: 241  NHNCNTSITV-GENVTQNQKLLEPAMHSTINCTPDGKEGKNDGDLNKTPASRQRRRKHRP 300

Query: 301  KVIVEGKTNRTKQILKTPSSNQSVRKRVRKSGLTKPSATPPIEVTGETSEQEMVKHRRKS 360
            KVIVEGKTNRTKQ LKTPSSN S RKRVRKSGL KPSATP IEVTGETSEQEMVKHRRKS
Sbjct: 301  KVIVEGKTNRTKQNLKTPSSNPSARKRVRKSGLAKPSATPSIEVTGETSEQEMVKHRRKS 360

Query: 361  CRRAINFDSQAQTRDGSLDSGPLEQGSLTQNIQSTTGLEEARLEEVGSSTDPNWSMNQMP 420
            CRRAI FDSQAQTRDGSLDSGPLEQGSLTQN QSTTGLE  RLEEVGSSTDPNWSMNQM 
Sbjct: 361  CRRAITFDSQAQTRDGSLDSGPLEQGSLTQNSQSTTGLEVVRLEEVGSSTDPNWSMNQML 420

Query: 421  KKYESLSEKQAPPTKLSAENNSSERKQPSKSQMENNIEQNGKIISNSDKENTVETILNDD 480
            KKYESLSEK+APPT+LSAENNSSE+KQPSKSQMENN EQNGK+ISNSDKENTVE I NDD
Sbjct: 421  KKYESLSEKEAPPTELSAENNSSEQKQPSKSQMENNTEQNGKVISNSDKENTVEAIPNDD 480

Query: 481  NHSLPGNSHGLIFCKNPPLTSIEQATCCLRKRPRAIKQAHTGSINLTGAHYNTLSAYQSM 540
            NHSLPGNSHGLIFCKNPPLTSIEQATCCLRKRPRAIKQAHTGSINLTG HYNTLSAYQSM
Sbjct: 481  NHSLPGNSHGLIFCKNPPLTSIEQATCCLRKRPRAIKQAHTGSINLTGVHYNTLSAYQSM 540

Query: 541  SWMHFPHIYKKKRTEKGQNPVPSSAFTTATNFIRPESACSFNDPQRDHMVSKFNAWIAGP 600
            SWMHFPHIYKKKRTEKGQNPVPSSAFTTATNF RPESACSFNDPQRDHMVSKFNAWI+GP
Sbjct: 541  SWMHFPHIYKKKRTEKGQNPVPSSAFTTATNFTRPESACSFNDPQRDHMVSKFNAWISGP 600

Query: 601  QFNICKSKSVAGHGGNDLQDKLQTYGGIVGLDQTGRTKKKPRTAKRLSGLAPPERITHWE 660
            QFN+CKSK+VAGHGGNDLQDKLQTYGGIVGL QTGRTKKKPRTAKR+S LAPPERI+HWE
Sbjct: 601  QFNVCKSKTVAGHGGNDLQDKLQTYGGIVGLGQTGRTKKKPRTAKRVSSLAPPERISHWE 660

Query: 661  KQPIYPTNHPPPAGSAKNINTSGTCINGLFEMMHATVAKKKRTKKKPSNSALLNINKDLQ 720
            KQP YPTNHPPPAGSAKNINTSGTC+NGLFEMMHATVAKKKRTKKKPSNS LLNINKDL+
Sbjct: 661  KQPTYPTNHPPPAGSAKNINTSGTCVNGLFEMMHATVAKKKRTKKKPSNSTLLNINKDLE 720

Query: 721  DRRFVSFNPWQFFPKTLGTASEHGNQICFIDLLVEQLKHLDINKESNNLGYREQALVPYN 780
            DRRFVSF+PWQFFPKTLGTASEHGNQICFIDL+ EQLKHLDINKESNNLGYREQALVP+N
Sbjct: 721  DRRFVSFSPWQFFPKTLGTASEHGNQICFIDLIAEQLKHLDINKESNNLGYREQALVPFN 780

Query: 781  MQNQEHSAIVVYGRDGTIVPFNPIKKRRPRPKVELDEETGRVWKLLMGNINSKGIDGTDE 840
            MQNQE +AIVVYGRDGTIVPFNPIKKRRPRPKVELDEETGRVWKLLMGNINS GIDGTDE
Sbjct: 781  MQNQELNAIVVYGRDGTIVPFNPIKKRRPRPKVELDEETGRVWKLLMGNINSTGIDGTDE 840

Query: 841  EKIKWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSDHLSSS 900
            E IKWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSDHLSSS
Sbjct: 841  ENIKWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSDHLSSS 900

Query: 901  AFMSLAARFPPKPKCRQASCSQEPIIELDEPEEACVFNLEDSMKLNKQIIHQQISEEGSL 960
            AFMSLAARFPPKPKC QAS SQEPIIEL+EPEE C+FNLEDSMKLNKQIIHQQISEEGSL
Sbjct: 901  AFMSLAARFPPKPKCHQASSSQEPIIELNEPEEVCMFNLEDSMKLNKQIIHQQISEEGSL 960

Query: 961  MKDEMEKSEGRIIVDNNESSGSNAEDGSSNKEPEKKSFSSSHNILETCSNSVGEISLTET 1020
            MKDEMEKSEGRIIVDNNESSGSN EDGSSNKEPEKKSF SSHNILET SNSVGEISLTET
Sbjct: 961  MKDEMEKSEGRIIVDNNESSGSNVEDGSSNKEPEKKSFGSSHNILETFSNSVGEISLTET 1020

Query: 1021 SSMQACLSGEKETYDSFSFQDCLDSSIPQTSESIEPSSEGNSEDLPSWSTEAHIDSSSEE 1080
            SSMQAC SGEKETYDSFS QD LDSSIPQT+ES+EPSSEGNSEDLPSWS EAHIDSSSEE
Sbjct: 1021 SSMQACFSGEKETYDSFSSQDFLDSSIPQTNESMEPSSEGNSEDLPSWSAEAHIDSSSEE 1080

Query: 1081 LIQMTGPNTLNANFTTDTSVEQSENTTTNKLVEKKCDNRIDDTSQPDDPEISLKNSVYHL 1140
            LIQMTG NTLNANFT D SVE SENT TN LVE KCDNRIDDTSQPDDPEIS+KNSVYHL
Sbjct: 1081 LIQMTGLNTLNANFTIDISVEPSENTITNNLVENKCDNRIDDTSQPDDPEISIKNSVYHL 1140

Query: 1141 SDYQTQQNQTSKSLEVDCCQTSNGVQTSNDCQNKDEHFHTEQSTLTVEYDNHANVEMELI 1200
            S YQTQQNQTSKSL+VDCCQTSNGVQTSNDCQNKDE FHTEQSTLTVE DNHANVEMELI
Sbjct: 1141 SGYQTQQNQTSKSLDVDCCQTSNGVQTSNDCQNKDEQFHTEQSTLTVESDNHANVEMELI 1200

Query: 1201 VDIVEAPSSSSELSINAKEPGLTLQSQSSVIEDPQNVESPAECTNTVYEIPPNATEIATK 1260
            VDIVEAPSSSSELSINAK+PGLTLQSQSSVIEDPQNVESPAECTNTV+  PPNATEIATK
Sbjct: 1201 VDIVEAPSSSSELSINAKDPGLTLQSQSSVIEDPQNVESPAECTNTVHGSPPNATEIATK 1260

Query: 1261 PNPKECNLLSNEFKELKPASSRSQRKQVAKEKDNINWDNLRKQTETNGKTRQRTESTMDS 1320
            PNPKE NLLSNEFKELKPASSRSQ KQVAKEKD INWDNLRKQTETNGKTRQRTE+TMDS
Sbjct: 1261 PNPKEYNLLSNEFKELKPASSRSQSKQVAKEKDKINWDNLRKQTETNGKTRQRTENTMDS 1320

Query: 1321 LDWEAIRCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVEPDHAKE 1380
            LDWEA+RCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVEPD AKE
Sbjct: 1321 LDWEALRCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVEPDQAKE 1380

Query: 1381 YLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYP 1440
            YLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYP
Sbjct: 1381 YLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYP 1440

Query: 1441 VLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFA 1500
            VLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFA
Sbjct: 1441 VLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFA 1500

Query: 1501 SARLGLPAPEDKRIVSTTECREPDNNQARTIDQPMLSLPPSTISSEEIKPSESHQSDGKT 1560
            SARLGLPAPEDKRIVSTTECREPDNNQ RTIDQPMLSLPPSTISSEEIKPSESH+ DGKT
Sbjct: 1501 SARLGLPAPEDKRIVSTTECREPDNNQPRTIDQPMLSLPPSTISSEEIKPSESHECDGKT 1560

Query: 1561 TAGACVPIIEEPATPEQETTTQD-AIIDIEDGFYEDPDEIPTIKLNIEEFSQNLQNFVQK 1620
            TAGACVPIIEEPATPEQET TQD  IIDIED FYEDPDEIPTIKLNIEEFSQNLQN+VQK
Sbjct: 1561 TAGACVPIIEEPATPEQETATQDPRIIDIEDAFYEDPDEIPTIKLNIEEFSQNLQNYVQK 1620

Query: 1621 NMELQEGDMSKALIALTPEAASIPTPKLKNVSRLRTEHQVYELPDNHPLLEKLKLDRREP 1680
            NMELQEGDMSKALIALTPEAASIPTPKLKNVSRLRTEHQVYELPDNHPLLEKLKLDRREP
Sbjct: 1621 NMELQEGDMSKALIALTPEAASIPTPKLKNVSRLRTEHQVYELPDNHPLLEKLKLDRREP 1680

Query: 1681 DDPSSYLLAIWTPGETANSIELPEKRCSNQEHHQLCCEEECLSCNSVREANSFMVRGTLL 1740
            DDPSSYLLAIWTPGETANSI+LPEKRCSNQEHHQLCCEEECLSCNSVREANSFMVRGTLL
Sbjct: 1681 DDPSSYLLAIWTPGETANSIQLPEKRCSNQEHHQLCCEEECLSCNSVREANSFMVRGTLL 1740

Query: 1741 IPCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTSIPTIFK 1800
            IPCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTSIPTIFK
Sbjct: 1741 IPCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTSIPTIFK 1800

Query: 1801 GLS 1803
            G S
Sbjct: 1801 GSS 1802

BLAST of PI0016811 vs. NCBI nr
Match: KAA0043923.1 (protein ROS1 [Cucumis melo var. makuwa])

HSP 1 Score: 3344.7 bits (8671), Expect = 0.0e+00
Identity = 1697/1804 (94.07%), Postives = 1733/1804 (96.06%), Query Frame = 0

Query: 1    MDSGQPEGNKADVQGGSWIPATPMKPILPKPPLQPLIYARMDRNQPRPYWLGSERLFSNS 60
            MDSGQPEGNKADVQGGSWIPATPMKPILPKPPLQPLIYARMDRNQPRPYWLGSERLFSNS
Sbjct: 1    MDSGQPEGNKADVQGGSWIPATPMKPILPKPPLQPLIYARMDRNQPRPYWLGSERLFSNS 60

Query: 61   NKEAETSSGVACYGGANFMTANGSNDWEAAQARQFQVARNDNGTVTIHSMDALGGIPFLQ 120
            NKEAETSSGVACYGGAN MTANGSN WEAAQARQFQVARNDNGTVTIHSMDALGGIPFLQ
Sbjct: 61   NKEAETSSGVACYGGANSMTANGSNYWEAAQARQFQVARNDNGTVTIHSMDALGGIPFLQ 120

Query: 121  LMALADAASIVGADAALGGNASDLFDSGSSYQIELESSSMKDRLSGSCIPEAKEYGTSDH 180
            LMALADAASIVGADAALGGNASDLFDSGSSYQIELESSSMKDR  GSC+PEAKEY  S+H
Sbjct: 121  LMALADAASIVGADAALGGNASDLFDSGSSYQIELESSSMKDRFGGSCVPEAKEYVPSEH 180

Query: 181  GSQHAYDLNFPSGTESDAAGIRVTSQFAPLTPDMGKCKYTERGMELQQIPIENSQDERER 240
            GSQ+A++LNFPS  ESDAAGIRVTSQFAPLTPDMGK KY ERG ELQQI IENSQDERE+
Sbjct: 181  GSQYAHELNFPSRIESDAAGIRVTSQFAPLTPDMGKSKYIERGTELQQILIENSQDEREQ 240

Query: 241  NHNCNTSITVDGENLRQNQELLEPAMHSTINCTPDGKEGKNDGDLNKTPASRQRRRKHRP 300
            NHNCNTSITV GEN+ QNQ+LLEPAMHSTINCTPDGKEGKNDGDLNKTPASRQRRRKHRP
Sbjct: 241  NHNCNTSITV-GENVTQNQKLLEPAMHSTINCTPDGKEGKNDGDLNKTPASRQRRRKHRP 300

Query: 301  KVIVEGKTNRTKQILKTPSSNQSVRKRVRKSGLTKPSATPPIEVTGETSEQEMVKHRRKS 360
            KVIVEGKTNRTKQ LKTPSSN S RKRVRKSGL KPSATP IEVTGETSEQEMVKHRRKS
Sbjct: 301  KVIVEGKTNRTKQNLKTPSSNPSARKRVRKSGLAKPSATPSIEVTGETSEQEMVKHRRKS 360

Query: 361  CRRAINFDSQAQTRDGSLDSGPLEQGSLTQNIQSTTGLEEARLEEVGSSTDPNWSMNQMP 420
            CRRAI FDSQAQTRDGSLDSGPLEQGSLTQN QSTTGLE  RLEEVGSSTDPNWSMNQM 
Sbjct: 361  CRRAITFDSQAQTRDGSLDSGPLEQGSLTQNSQSTTGLEVVRLEEVGSSTDPNWSMNQML 420

Query: 421  KKYESLSEKQAPPTKLSAENNSSERKQPSKSQMENNIEQNGKIISNSDKENTVETILNDD 480
            KKYESLSEK+APPT+LSAENNSSE+KQPSKSQMENN EQNGK+ISNSDKENTVE I NDD
Sbjct: 421  KKYESLSEKEAPPTELSAENNSSEQKQPSKSQMENNTEQNGKVISNSDKENTVEAIPNDD 480

Query: 481  NHSLPGNSHGLIFCKNPPLTSIEQATCCLRKRPRAIKQAHTGSINLTGAHYNTLSAYQSM 540
            NHSLPGNSHGLIFCKNPPLTSIEQATCCLRKRPRAIKQAHTGSINLTG HYNTLSAYQSM
Sbjct: 481  NHSLPGNSHGLIFCKNPPLTSIEQATCCLRKRPRAIKQAHTGSINLTGVHYNTLSAYQSM 540

Query: 541  SWMHFPHIYKKKRTEKGQNPVPSSAFTTATNFIRPESACSFNDPQRDHMVSKFNAWIAGP 600
            SWMHFPHIYKKKRTEKGQNPVPSSAFTTATNF RPESACSFNDPQRDHMVSKFNAWI+GP
Sbjct: 541  SWMHFPHIYKKKRTEKGQNPVPSSAFTTATNFTRPESACSFNDPQRDHMVSKFNAWISGP 600

Query: 601  QFNICKSKSVAGHGGNDLQDKLQTYGGIVGLDQTGRTKKKPRTAKRLSGLAPPERITHWE 660
            QFN+CKSK+VAGHGGNDLQDKLQTYGGIVGL QTGRTKKKPRTAKR+S LAPPERI+HWE
Sbjct: 601  QFNVCKSKTVAGHGGNDLQDKLQTYGGIVGLGQTGRTKKKPRTAKRVSSLAPPERISHWE 660

Query: 661  KQPIYPTNHPPPAGSAKNINTSGTCINGLFEMMHATVAKKKRTKKKPSNSALLNINKDLQ 720
            KQP YPTNHPPPAGSAKNINTSGTC+NGLFEMMHATVAKKKRTKKKPSNS LLNINKDL+
Sbjct: 661  KQPTYPTNHPPPAGSAKNINTSGTCVNGLFEMMHATVAKKKRTKKKPSNSTLLNINKDLE 720

Query: 721  DRRFVSFNPWQFFPKTLGTASEHGNQICFIDLLVEQLKHLDINKESNNLGYREQALVPYN 780
            DRRFVSF+PWQFFPKTLGTASEHGNQICFIDL+ EQLKHLDINKESNNLGYREQALVP+N
Sbjct: 721  DRRFVSFSPWQFFPKTLGTASEHGNQICFIDLIAEQLKHLDINKESNNLGYREQALVPFN 780

Query: 781  MQNQEHSAIVVYGRDGTIVPFNPIKKRRPRPKVELDEETGRVWKLLMGNINSKGIDGTDE 840
            MQNQE +AIVVYGRDGTIVPFNPIKKRRPRPKVELDEETGRVWKLLMGNINS GIDGTDE
Sbjct: 781  MQNQELNAIVVYGRDGTIVPFNPIKKRRPRPKVELDEETGRVWKLLMGNINSTGIDGTDE 840

Query: 841  EKIKWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSDHLSSS 900
            E IKWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSDHLSSS
Sbjct: 841  ENIKWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSDHLSSS 900

Query: 901  AFMSLAARFPPKPKCRQASCSQEPIIELDEPEEACVFNLEDSMKLNKQIIHQQISEEGSL 960
            AFMSLAARFPPKPKC QAS SQEPIIEL+EPEE C+FNLEDSMKLNKQIIHQQISEEGSL
Sbjct: 901  AFMSLAARFPPKPKCHQASSSQEPIIELNEPEEVCMFNLEDSMKLNKQIIHQQISEEGSL 960

Query: 961  MKDEMEKSEGRIIVDNNESSGSNAEDGSSNKEPEKKSFSSSHNILETCSNSVGEISLTET 1020
            MKDEMEKSEGRIIVDNNESSGSN EDGSSNKEPEKKSF SSHNILET SNSVGEISLTET
Sbjct: 961  MKDEMEKSEGRIIVDNNESSGSNVEDGSSNKEPEKKSFGSSHNILETFSNSVGEISLTET 1020

Query: 1021 SSMQACLSGEKETYDSFSFQDCLDSSIPQTSESIEPSSEGNSEDLPSWSTEAHIDSSSEE 1080
            SSMQAC SGEKETYDSFS QD LDSSIPQT+ES+EPSSEGNSEDLPSWS EAHIDSSSEE
Sbjct: 1021 SSMQACFSGEKETYDSFSSQDFLDSSIPQTNESMEPSSEGNSEDLPSWSAEAHIDSSSEE 1080

Query: 1081 LIQMTGPNTLNANFTTDTSVEQSENTTTNKLVEKKCDNRIDDTSQPDDPEISLKNSVYHL 1140
            LIQMTG NTLNANFT D SVE SENT TN LVE KCDNRIDDTSQPDDPEIS+KNSVYHL
Sbjct: 1081 LIQMTGLNTLNANFTIDISVEPSENTITNNLVENKCDNRIDDTSQPDDPEISIKNSVYHL 1140

Query: 1141 SDYQTQQNQTSKSLEVDCCQTSNGVQTSNDCQNKDEHFHTEQSTLTVEYDNHANVEMELI 1200
            S YQTQQNQTSKSL+VDCCQTSNGVQTSNDCQNKDE FHTEQSTLTVE DNHANVEMELI
Sbjct: 1141 SGYQTQQNQTSKSLDVDCCQTSNGVQTSNDCQNKDEQFHTEQSTLTVESDNHANVEMELI 1200

Query: 1201 VDIVEAPSSSSELSINAKEPGLTLQSQSSVIEDPQNVESPAECTNTVYEIPPNATEIATK 1260
            VDIVEAPSSSSELSINAK+PGLTLQSQSSVIEDPQNVESPAECTNTV+  PPNATEIATK
Sbjct: 1201 VDIVEAPSSSSELSINAKDPGLTLQSQSSVIEDPQNVESPAECTNTVHGSPPNATEIATK 1260

Query: 1261 PNPKECNLLSNEFKELKPASSRSQRKQVAKEKDNINWDNLRKQTETNGKTRQRTESTMDS 1320
            PNPKE NLLSNEFKELKPASSRSQ KQVAKEKD INWDNLRKQTETNGKTRQRTE+TMDS
Sbjct: 1261 PNPKEYNLLSNEFKELKPASSRSQSKQVAKEKDKINWDNLRKQTETNGKTRQRTENTMDS 1320

Query: 1321 LDWEAIRCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVEPDHAKE 1380
            LDWEA+RCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVEPD AKE
Sbjct: 1321 LDWEALRCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVEPDQAKE 1380

Query: 1381 YLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYP 1440
            YLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYP
Sbjct: 1381 YLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYP 1440

Query: 1441 VLESIQKYLWPRLCKLDQRTLYELHYQMITFGK-VFCTKSKPNCNACPMRGECRHFASAF 1500
            VLESIQKYLWPRLCKLDQRTLYELHYQMITFGK VFCTKSKPNCNACPMRGECRHFASAF
Sbjct: 1441 VLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVVFCTKSKPNCNACPMRGECRHFASAF 1500

Query: 1501 ASARLGLPAPEDKRIVSTTECREPDNNQARTIDQPMLSLPPSTISSEEIKPSESHQSDGK 1560
            ASARLGLPAPEDKRIVSTTECREPDNNQ RTIDQPMLSLPPSTISSEEIKPSESH+ DGK
Sbjct: 1501 ASARLGLPAPEDKRIVSTTECREPDNNQPRTIDQPMLSLPPSTISSEEIKPSESHECDGK 1560

Query: 1561 TTAGACVPIIEEPATPEQETTTQD-AIIDIEDGFYEDPDEIPTIKLNIEEFSQNLQNFVQ 1620
            TTAGACVPIIEEPATPEQET TQD  IIDIED FYEDPDEIPTIKLNIEEFSQNLQN+VQ
Sbjct: 1561 TTAGACVPIIEEPATPEQETATQDPRIIDIEDAFYEDPDEIPTIKLNIEEFSQNLQNYVQ 1620

Query: 1621 KNMELQEGDMSKALIALTPEAASIPTPKLKNVSRLRTEHQVYELPDNHPLLEKLKLDRRE 1680
            KNMELQEGDMSKALIALTPEAASIPTPKLKNVSRLRTEHQVYELPDNHPLLEKLKLDRRE
Sbjct: 1621 KNMELQEGDMSKALIALTPEAASIPTPKLKNVSRLRTEHQVYELPDNHPLLEKLKLDRRE 1680

Query: 1681 PDDPSSYLLAIWTPGETANSIELPEKRCSNQEHHQLCCEEECLSCNSVREANSFMVRGTL 1740
            PDDPSSYLLAIWTPGETANSI+LPEKRCSNQEHHQLCCEEECLSCNSVREANSFMVRGTL
Sbjct: 1681 PDDPSSYLLAIWTPGETANSIQLPEKRCSNQEHHQLCCEEECLSCNSVREANSFMVRGTL 1740

Query: 1741 LIPCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTSIPTIF 1800
            LIPCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTSIPTIF
Sbjct: 1741 LIPCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTSIPTIF 1800

Query: 1801 KGLS 1803
            KG S
Sbjct: 1801 KGSS 1803

BLAST of PI0016811 vs. NCBI nr
Match: XP_038904008.1 (DNA glycosylase/AP lyase ROS1-like [Benincasa hispida] >XP_038904009.1 DNA glycosylase/AP lyase ROS1-like [Benincasa hispida] >XP_038904010.1 DNA glycosylase/AP lyase ROS1-like [Benincasa hispida] >XP_038904011.1 DNA glycosylase/AP lyase ROS1-like [Benincasa hispida] >XP_038904012.1 DNA glycosylase/AP lyase ROS1-like [Benincasa hispida] >XP_038904013.1 DNA glycosylase/AP lyase ROS1-like [Benincasa hispida] >XP_038904014.1 DNA glycosylase/AP lyase ROS1-like [Benincasa hispida])

HSP 1 Score: 3261.5 bits (8455), Expect = 0.0e+00
Identity = 1660/1854 (89.54%), Postives = 1741/1854 (93.91%), Query Frame = 0

Query: 1    MDSGQPEGNKADVQGGSWIPATPMKPILPKPPLQPLIYARMDRNQPRPYWLGSERLFSNS 60
            M+ GQPEGNKADVQGGSWIPATPMKPILPKPPLQPLIYARMDRNQPRP WLGSERLFSNS
Sbjct: 1    MNFGQPEGNKADVQGGSWIPATPMKPILPKPPLQPLIYARMDRNQPRPCWLGSERLFSNS 60

Query: 61   NKEAETSSGVACYGGANFMTANGSNDWEAAQARQFQVARNDNGTVTIHSMDALGGIPFLQ 120
            NKE ETSS VACYGGAN M A+GS++W AA+A QFQVA N+NGTV IHSMDALGGIPFLQ
Sbjct: 61   NKEVETSSRVACYGGANSMGADGSSEWAAARAGQFQVACNENGTVGIHSMDALGGIPFLQ 120

Query: 121  LMALADAASIVGADAALGGNASDLFDSGSSYQIELESSSMKDRLSGSCIPEAKEYGTSDH 180
            LMALADAASIVGADAALGGNASDLFDSGSSYQIELESSS KDRLSGSCIPEA EYG SDH
Sbjct: 121  LMALADAASIVGADAALGGNASDLFDSGSSYQIELESSSTKDRLSGSCIPEATEYGISDH 180

Query: 181  GSQHAYDLNFPSGTESDAAGIRVTSQFAPLTPDMGKCKYTERGMELQQIPIENSQDERER 240
            G QH YDLNFPSGTES AA IR+TSQFAPLTPDMGK KYTER  E+QQIP EN QDERE+
Sbjct: 181  GGQHTYDLNFPSGTESHAAAIRITSQFAPLTPDMGKIKYTERDTEVQQIPTENKQDEREQ 240

Query: 241  NHNCNTSITVDGENLRQNQELLEPAMHSTI--NCTPDGKEGKNDGDLNKTPASRQRRRKH 300
            NHNCNTSI +DGENL++N+ELLEPAMHSTI   CTPDGKEGKNDGDLNKTPASRQRRRKH
Sbjct: 241  NHNCNTSIIIDGENLKENKELLEPAMHSTITATCTPDGKEGKNDGDLNKTPASRQRRRKH 300

Query: 301  RPKVIVEGKTNRTKQILKTPSSNQSVRKRVRKSGLTKPSATPPIEVTGETSEQEMVKHRR 360
            RPKVI+EGKT RTK  LKTPSSN S+RKRVRKSG++KPSATPPIEV GETS+QEM+KHRR
Sbjct: 301  RPKVIIEGKTKRTKPNLKTPSSNPSMRKRVRKSGVSKPSATPPIEVIGETSDQEMLKHRR 360

Query: 361  KSCRRAINFDSQAQTRDGSLDSGPLEQGSLTQNIQSTTGLEEARLEEVGSSTDPNWSMNQ 420
            KSCRRAINFD+QAQTRDG+ +SGPLEQGSLTQNIQSTTGLEE RLEEVGSSTDPNWSMNQ
Sbjct: 361  KSCRRAINFDTQAQTRDGTFESGPLEQGSLTQNIQSTTGLEEVRLEEVGSSTDPNWSMNQ 420

Query: 421  MPKKYESLSEKQAPPTKLSAENNSSERKQPSKSQMENNIEQNGKIISNSDKENTVETILN 480
            M K+YES+SEKQA  T+LSAE+NSSERKQPSK+QMENN EQ GK+ISNS+K N VET+LN
Sbjct: 421  MLKRYESVSEKQALTTELSAEHNSSERKQPSKTQMENNTEQIGKVISNSEKGNVVETMLN 480

Query: 481  DDNHSLPGNSHGLIFCKNPPLTSIEQATCCLRKRPRAIKQAHTGSINLTGAHYNTLSAYQ 540
            +DN SLPG+SHGLIFCKNP +TS EQATCCLRKR RAIKQAHTGSINLTGAHYNTLSAYQ
Sbjct: 481  NDNRSLPGSSHGLIFCKNPTMTSREQATCCLRKRSRAIKQAHTGSINLTGAHYNTLSAYQ 540

Query: 541  SMSWMHFPHIYKKKRTEKGQNPVPSSAFTTATNFIRPESACSFNDPQRDHMVSKFNAWIA 600
            SMSWMHFPHIYKKKRTEKGQNPV SSAFTTAT+F+RPESACSFNDPQR++MVSK N WIA
Sbjct: 541  SMSWMHFPHIYKKKRTEKGQNPVSSSAFTTATHFMRPESACSFNDPQRNYMVSKSNNWIA 600

Query: 601  GPQFNICKSKSVAGHGGNDLQDKLQTYGGIVGLDQTGRTKKKPRTAKRLSGLAPPERITH 660
            GPQFNICKS++VAGHGGN +QDKLQTYGGI+ L QT +T KKPRTAKRLSGLAP ERI H
Sbjct: 601  GPQFNICKSRTVAGHGGNGVQDKLQTYGGIMALGQTEKTIKKPRTAKRLSGLAPSERIGH 660

Query: 661  WEKQPIYPTNHPPPAGSAKNINTSGTCINGLFEMMHATVAKKKRTKKKPSNSALLNINKD 720
             EKQPIYPTNHPP A SAKNINTSGTCINGLFEMMHATVAKKKRTKKKPSNSALLN+NKD
Sbjct: 661  CEKQPIYPTNHPPLASSAKNINTSGTCINGLFEMMHATVAKKKRTKKKPSNSALLNVNKD 720

Query: 721  LQDRRFVSFNPWQFFPKTLGTASEHGNQICFIDLLVEQLKHLDINKESNNLGYREQALVP 780
            LQDRRFVSF+  QFF KTLGTA EH NQ+CFIDL+VEQLKHLDINKESN+LGYREQALV 
Sbjct: 721  LQDRRFVSFHSHQFFLKTLGTAPEHVNQMCFIDLIVEQLKHLDINKESNHLGYREQALVS 780

Query: 781  YNMQNQEHSAIVVYGRDGTIVPFNPIKKRRPRPKVELDEETGRVWKLLMGNINSKGIDGT 840
            YN+QNQE +AIVVYGRDGTIVPFNPIKKRRPRPKVELDEETGRVWKLLMGNINS+GIDGT
Sbjct: 781  YNIQNQEQNAIVVYGRDGTIVPFNPIKKRRPRPKVELDEETGRVWKLLMGNINSEGIDGT 840

Query: 841  DEEKIKWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSDHLS 900
            DEEKIKWWEEERKVF+GRADSFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSDHLS
Sbjct: 841  DEEKIKWWEEERKVFRGRADSFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSDHLS 900

Query: 901  SSAFMSLAARFPPKPKCRQASCSQEPIIELDEPEEACVFNLEDSMKLNKQIIHQQISEEG 960
            SSAFMSLAARFPPKPKC QASCSQEPIIELDEPEE C+ NLE+ M LNKQI+HQQISEEG
Sbjct: 901  SSAFMSLAARFPPKPKCHQASCSQEPIIELDEPEE-CMLNLENGMNLNKQILHQQISEEG 960

Query: 961  SLMKDEMEKSEGRIIVDNNESSGSNAEDGSSNKEPEKKSFSSSHNILETCSNSVGEISLT 1020
            S+ K+EM KSEGRIIVDNNESSGSN EDGSSNK PEK S+SSSHNILETCSNSVGEISLT
Sbjct: 961  SMKKNEMRKSEGRIIVDNNESSGSNVEDGSSNKGPEKISYSSSHNILETCSNSVGEISLT 1020

Query: 1021 ETSSMQACLSGEKETYDSFSFQDCLDSSIPQTSESIEPSSEGNSEDLPSWSTEAHIDSSS 1080
             TS MQACL GEKET DSFS QDCLD SIPQTSESIEPSSEGNSEDLPS STEAHID SS
Sbjct: 1021 GTSPMQACLYGEKETVDSFSCQDCLDLSIPQTSESIEPSSEGNSEDLPSCSTEAHID-SS 1080

Query: 1081 EELIQMTGPNTLNANFTTDTSVEQSENTTTNKLVEKKCDNRIDDTSQPDDPEISLKNSVY 1140
            EELIQM   NTLNAN+T DTSV+QSENTTTNKL E KCD RIDDT QPDDPEISLK+S++
Sbjct: 1081 EELIQMARLNTLNANYTIDTSVDQSENTTTNKLAE-KCDGRIDDTFQPDDPEISLKDSIH 1140

Query: 1141 HLSDYQTQQNQTSKSLEVDCCQTSNGVQTSNDCQNKDEHFHTEQSTLTVEYDNHANVEME 1200
            HLS YQ QQNQTSKSLEVDCCQT NGVQTSNDCQNKDEHFHTEQSTLTVE DNH NVE+E
Sbjct: 1141 HLSGYQMQQNQTSKSLEVDCCQTCNGVQTSNDCQNKDEHFHTEQSTLTVESDNHYNVEIE 1200

Query: 1201 LIVDIVEAPSSSSELSINAKEPGLTLQSQSSVIEDPQNVESPAECTNTVYEIPPNATEIA 1260
            L+VDIVEAPS+SSELSINAKEP LTLQSQ SVIEDPQNVESP ECTN V+EIPPNATE+A
Sbjct: 1201 LVVDIVEAPSTSSELSINAKEPDLTLQSQGSVIEDPQNVESPVECTNNVHEIPPNATEMA 1260

Query: 1261 TKPNPKECNLLSNEFKELKPASSRSQRKQVAKEKD-NINWDNLRKQTETNGKTRQRTEST 1320
             + NPKE + LSNEFKE+ PASSRSQRKQVAKEK+ NINWDNLRKQTETNGKTRQRTE+T
Sbjct: 1261 IQSNPKEYDQLSNEFKEMNPASSRSQRKQVAKEKENNINWDNLRKQTETNGKTRQRTENT 1320

Query: 1321 MDSLDWEAIRCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVEPDH 1380
            MDSLDWEAIRCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVEPD 
Sbjct: 1321 MDSLDWEAIRCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVEPDQ 1380

Query: 1381 AKEYLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLE 1440
            AKEYLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLE
Sbjct: 1381 AKEYLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLE 1440

Query: 1441 LYPVLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFAS 1500
            LYPVLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFAS
Sbjct: 1441 LYPVLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFAS 1500

Query: 1501 AFASARLGLPAPEDKRIVSTTECREPDNNQARTIDQPMLSLPPSTISSEEIKPSESHQSD 1560
            AFASARLGLPAPEDKRIVSTTECREPD+NQARTIDQPMLSLPPSTISSEEIKPSE+HQSD
Sbjct: 1501 AFASARLGLPAPEDKRIVSTTECREPDDNQARTIDQPMLSLPPSTISSEEIKPSETHQSD 1560

Query: 1561 GKTTAGACVPIIEEPATPEQETTTQDAIIDIEDGFYEDPDEIPTIKLNIEEFSQNLQNFV 1620
            GKTT   CVPIIEEPATPEQE+TTQDAIIDIED FYEDPDEIPTIKLNIEEFSQNLQN+V
Sbjct: 1561 GKTTGSTCVPIIEEPATPEQESTTQDAIIDIEDAFYEDPDEIPTIKLNIEEFSQNLQNYV 1620

Query: 1621 QKNMELQEGDMSKALIALTPEAASIPTPKLKNVSRLRTEHQVYELPDNHPLLEKLKLDRR 1680
            QKNMELQEGDMSKALIALTPEAASIP PKLKNVSRLRTEHQVYELPD+HPLLEKLKLDRR
Sbjct: 1621 QKNMELQEGDMSKALIALTPEAASIPMPKLKNVSRLRTEHQVYELPDSHPLLEKLKLDRR 1680

Query: 1681 EPDDPSSYLLAIWTPGETANSIELPEKRCSNQEHHQLCCEEECLSCNSVREANSFMVRGT 1740
            EPDDPSSYLLAIWTPGETANSI+LPE+RC NQEHHQLC EEECLSCNSVREANSFMVRGT
Sbjct: 1681 EPDDPSSYLLAIWTPGETANSIQLPERRC-NQEHHQLCHEEECLSCNSVREANSFMVRGT 1740

Query: 1741 LLIPCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTSIPTI 1800
            +LIPCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTSIPTI
Sbjct: 1741 ILIPCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTSIPTI 1800

Query: 1801 FKGLSTQGIQHCFWRGFVCVRGFDQKTRAPRPLMARLHFPASKLNRGRGKTEDQ 1852
            FKGLSTQGIQHCFWRGFVCVRGFDQKTRAPRPLMARLHFPASKLNRGRGKT+DQ
Sbjct: 1801 FKGLSTQGIQHCFWRGFVCVRGFDQKTRAPRPLMARLHFPASKLNRGRGKTDDQ 1850

BLAST of PI0016811 vs. TAIR 10
Match: AT5G04560.1 (HhH-GPD base excision DNA repair family protein )

HSP 1 Score: 941.4 bits (2432), Expect = 1.1e-273
Identity = 564/1132 (49.82%), Postives = 714/1132 (63.07%), Query Frame = 0

Query: 775  ALVPYNMQN---------QEHSAIVVYGRDGTIVPFNPIKKRRPRPKVELDEETGRVWKL 834
            A + Y MQN         QE +A+V+Y  DG +VP+   KKR+PRPKV++D+ET R+W L
Sbjct: 652  AEIIYRMQNLYLGDKEREQEQNAMVLYKGDGALVPYES-KKRKPRPKVDIDDETTRIWNL 711

Query: 835  LMGNINSK-GIDGTDEEKIKWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSVVDSV 894
            LMG  + K G +  D++K KWWEEER+VF+GRADSFIARMHLVQGDRRFS WKGSVVDSV
Sbjct: 712  LMGKGDEKEGDEEKDKKKEKWWEEERRVFRGRADSFIARMHLVQGDRRFSPWKGSVVDSV 771

Query: 895  VGVFLTQNVSDHLSSSAFMSLAARFPPKPKCRQASCSQEPIIELDEPEEACVFNLEDSMK 954
            +GVFLTQNVSDHLSSSAFMSLAARFPPK    +        + +++P E C+ NL +   
Sbjct: 772  IGVFLTQNVSDHLSSSAFMSLAARFPPKLSSSREDERNVRSVVVEDP-EGCILNLNEIPS 831

Query: 955  LNKQIIHQQISEEGSLMKDEMEKSEGRIIVDNNESSGSNAEDGSSNKEPEKKSFSSSHN- 1014
              +++ H    E   +  D   K + R   ++     +  E    N E E  S   S + 
Sbjct: 832  WQEKVQHPSDMEVSGV--DSGSKEQLRDCSNSGIERFNFLEKSIQNLEEEVLSSQDSFDP 891

Query: 1015 -ILETCSNSVGEISLTETSSMQACLSGEKETYDSFSFQDCLDSSIPQTSESIEPSSEGNS 1074
             I ++C   VG  S +++ +              F    C   ++  TS+S++  S    
Sbjct: 892  AIFQSCGR-VGSCSCSKSDA-------------EFPTTRCETKTVSGTSQSVQTGS---- 951

Query: 1075 EDLPSWSTEAHIDSSSEELIQMTGPNTLNANFTTDTSVEQSENTTTNKLVEKKCDNR-ID 1134
               P+ S E  +  +    +   G   +    TT+ + ++ +   T    +  C  +  +
Sbjct: 952  ---PNLSDEICLQGNERPHL-YEGSGDVQKQETTNVAQKKPDLEKTMNWKDSVCFGQPRN 1011

Query: 1135 DTSQPDDPEISLKN------SVYHLSDYQTQ-----QNQTSKSLEVDCCQTSN------- 1194
            DT+    P  S +        V  + D+  Q      +  S S  VD  +  N       
Sbjct: 1012 DTNWQTTPSSSYEQCATRQPHVLDIEDFGMQGEGLGYSWMSISPRVDRVKNKNVPRRFFR 1071

Query: 1195 ------------------------GVQTSNDC--QNKDEHFHTEQSTLTVEYDNHANVEM 1254
                                    G+  S+    +++D+  H +Q  +     N A+   
Sbjct: 1072 QGGSVPREFTGQIIPSTPHELPGMGLSGSSSAVQEHQDDTQHNQQDEM-----NKASHLQ 1131

Query: 1255 ELIVDIVEAPSSSSELSINAKEPGLTLQSQSSVIEDPQNVESPAECTNTVYEIPPNAT-- 1314
            +  +D+           +N+ E  LT QS +        +       + V  +  N++  
Sbjct: 1132 KTFLDL-----------LNSSEECLTRQSSTKQNITDGCLPRDRTAEDVVDPLSNNSSLQ 1191

Query: 1315 EIATKPNPKECNLLSNEFKELKPASSRSQRKQVAK-EKDNINWDNLRKQTETNGKTRQRT 1374
             I  + N       + E+KE      R  +  +A  +K    WD+LRK  E N   ++R 
Sbjct: 1192 NILVESNSSNKEQTAVEYKETNATILREMKGTLADGKKPTSQWDSLRKDVEGNEGRQERN 1251

Query: 1375 ESTMDSLDWEAIRCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVE 1434
            ++ MDS+D+EAIR A ++EI+ AI+ERGMNNMLA RIKDFL R+VKDHG IDLEWLR+  
Sbjct: 1252 KNNMDSIDYEAIRRASISEISEAIKERGMNNMLAVRIKDFLERIVKDHGGIDLEWLRESP 1311

Query: 1435 PDHAKEYLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLH 1494
            PD AK+YLLSIRGLGLKSVECVRLLTLH+LAFPVDTNVGRIAVR+GWVPLQPLPESLQLH
Sbjct: 1312 PDKAKDYLLSIRGLGLKSVECVRLLTLHNLAFPVDTNVGRIAVRMGWVPLQPLPESLQLH 1371

Query: 1495 LLELYPVLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRH 1554
            LLELYPVLESIQK+LWPRLCKLDQRTLYELHYQ+ITFGKVFCTKS+PNCNACPMRGECRH
Sbjct: 1372 LLELYPVLESIQKFLWPRLCKLDQRTLYELHYQLITFGKVFCTKSRPNCNACPMRGECRH 1431

Query: 1555 FASAFASARLGLPAPEDKRIVSTTECREPDNNQARTIDQPMLSLPPSTISSEEIKPSESH 1614
            FASA+ASARL LPAPE++ + S T    P++     I  PM+ LP        +   +S 
Sbjct: 1432 FASAYASARLALPAPEERSLTSATIPVPPESYPPVAI--PMIELP--------LPLEKSL 1491

Query: 1615 QSDGKTTAGACVPIIEEPATPEQETTTQDAIIDIEDGFY-EDPDEIPTIKLNIEEFSQNL 1674
             S   +    C PIIEEPA+P QE  T+    DIED +Y EDPDEIPTIKLNIE+F   L
Sbjct: 1492 ASGAPSNRENCEPIIEEPASPGQE-CTEITESDIEDAYYNEDPDEIPTIKLNIEQFGMTL 1551

Query: 1675 QNFVQKNMELQEGDMSKALIALTPEAASIPTPKLKNVSRLRTEHQVYELPDNHPLLEKLK 1734
            +  +++NMELQEGDMSKAL+AL P   SIPTPKLKN+SRLRTEHQVYELPD+H LL+   
Sbjct: 1552 REHMERNMELQEGDMSKALVALHPTTTSIPTPKLKNISRLRTEHQVYELPDSHRLLD--G 1611

Query: 1735 LDRREPDDPSSYLLAIWTPGETANSIELPEKRCSNQEHHQLCCEEECLSCNSVREANSFM 1794
            +D+REPDDPS YLLAIWTPGETANS + PE++C  +   ++C +E C  CNS+REANS  
Sbjct: 1612 MDKREPDDPSPYLLAIWTPGETANSAQPPEQKCGGKASGKMCFDETCSECNSLREANSQT 1671

Query: 1795 VRGTLLIPCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTS 1846
            VRGTLLIPCRTAMRGSFPLNGTYFQVNE+FADHESSL PIDVPRDWIW+LPRRTVYFGTS
Sbjct: 1672 VRGTLLIPCRTAMRGSFPLNGTYFQVNELFADHESSLKPIDVPRDWIWDLPRRTVYFGTS 1728

BLAST of PI0016811 vs. TAIR 10
Match: AT5G04560.2 (HhH-GPD base excision DNA repair family protein )

HSP 1 Score: 941.4 bits (2432), Expect = 1.1e-273
Identity = 564/1132 (49.82%), Postives = 714/1132 (63.07%), Query Frame = 0

Query: 775  ALVPYNMQN---------QEHSAIVVYGRDGTIVPFNPIKKRRPRPKVELDEETGRVWKL 834
            A + Y MQN         QE +A+V+Y  DG +VP+   KKR+PRPKV++D+ET R+W L
Sbjct: 910  AEIIYRMQNLYLGDKEREQEQNAMVLYKGDGALVPYES-KKRKPRPKVDIDDETTRIWNL 969

Query: 835  LMGNINSK-GIDGTDEEKIKWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSVVDSV 894
            LMG  + K G +  D++K KWWEEER+VF+GRADSFIARMHLVQGDRRFS WKGSVVDSV
Sbjct: 970  LMGKGDEKEGDEEKDKKKEKWWEEERRVFRGRADSFIARMHLVQGDRRFSPWKGSVVDSV 1029

Query: 895  VGVFLTQNVSDHLSSSAFMSLAARFPPKPKCRQASCSQEPIIELDEPEEACVFNLEDSMK 954
            +GVFLTQNVSDHLSSSAFMSLAARFPPK    +        + +++P E C+ NL +   
Sbjct: 1030 IGVFLTQNVSDHLSSSAFMSLAARFPPKLSSSREDERNVRSVVVEDP-EGCILNLNEIPS 1089

Query: 955  LNKQIIHQQISEEGSLMKDEMEKSEGRIIVDNNESSGSNAEDGSSNKEPEKKSFSSSHN- 1014
              +++ H    E   +  D   K + R   ++     +  E    N E E  S   S + 
Sbjct: 1090 WQEKVQHPSDMEVSGV--DSGSKEQLRDCSNSGIERFNFLEKSIQNLEEEVLSSQDSFDP 1149

Query: 1015 -ILETCSNSVGEISLTETSSMQACLSGEKETYDSFSFQDCLDSSIPQTSESIEPSSEGNS 1074
             I ++C   VG  S +++ +              F    C   ++  TS+S++  S    
Sbjct: 1150 AIFQSCGR-VGSCSCSKSDA-------------EFPTTRCETKTVSGTSQSVQTGS---- 1209

Query: 1075 EDLPSWSTEAHIDSSSEELIQMTGPNTLNANFTTDTSVEQSENTTTNKLVEKKCDNR-ID 1134
               P+ S E  +  +    +   G   +    TT+ + ++ +   T    +  C  +  +
Sbjct: 1210 ---PNLSDEICLQGNERPHL-YEGSGDVQKQETTNVAQKKPDLEKTMNWKDSVCFGQPRN 1269

Query: 1135 DTSQPDDPEISLKN------SVYHLSDYQTQ-----QNQTSKSLEVDCCQTSN------- 1194
            DT+    P  S +        V  + D+  Q      +  S S  VD  +  N       
Sbjct: 1270 DTNWQTTPSSSYEQCATRQPHVLDIEDFGMQGEGLGYSWMSISPRVDRVKNKNVPRRFFR 1329

Query: 1195 ------------------------GVQTSNDC--QNKDEHFHTEQSTLTVEYDNHANVEM 1254
                                    G+  S+    +++D+  H +Q  +     N A+   
Sbjct: 1330 QGGSVPREFTGQIIPSTPHELPGMGLSGSSSAVQEHQDDTQHNQQDEM-----NKASHLQ 1389

Query: 1255 ELIVDIVEAPSSSSELSINAKEPGLTLQSQSSVIEDPQNVESPAECTNTVYEIPPNAT-- 1314
            +  +D+           +N+ E  LT QS +        +       + V  +  N++  
Sbjct: 1390 KTFLDL-----------LNSSEECLTRQSSTKQNITDGCLPRDRTAEDVVDPLSNNSSLQ 1449

Query: 1315 EIATKPNPKECNLLSNEFKELKPASSRSQRKQVAK-EKDNINWDNLRKQTETNGKTRQRT 1374
             I  + N       + E+KE      R  +  +A  +K    WD+LRK  E N   ++R 
Sbjct: 1450 NILVESNSSNKEQTAVEYKETNATILREMKGTLADGKKPTSQWDSLRKDVEGNEGRQERN 1509

Query: 1375 ESTMDSLDWEAIRCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVE 1434
            ++ MDS+D+EAIR A ++EI+ AI+ERGMNNMLA RIKDFL R+VKDHG IDLEWLR+  
Sbjct: 1510 KNNMDSIDYEAIRRASISEISEAIKERGMNNMLAVRIKDFLERIVKDHGGIDLEWLRESP 1569

Query: 1435 PDHAKEYLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLH 1494
            PD AK+YLLSIRGLGLKSVECVRLLTLH+LAFPVDTNVGRIAVR+GWVPLQPLPESLQLH
Sbjct: 1570 PDKAKDYLLSIRGLGLKSVECVRLLTLHNLAFPVDTNVGRIAVRMGWVPLQPLPESLQLH 1629

Query: 1495 LLELYPVLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRH 1554
            LLELYPVLESIQK+LWPRLCKLDQRTLYELHYQ+ITFGKVFCTKS+PNCNACPMRGECRH
Sbjct: 1630 LLELYPVLESIQKFLWPRLCKLDQRTLYELHYQLITFGKVFCTKSRPNCNACPMRGECRH 1689

Query: 1555 FASAFASARLGLPAPEDKRIVSTTECREPDNNQARTIDQPMLSLPPSTISSEEIKPSESH 1614
            FASA+ASARL LPAPE++ + S T    P++     I  PM+ LP        +   +S 
Sbjct: 1690 FASAYASARLALPAPEERSLTSATIPVPPESYPPVAI--PMIELP--------LPLEKSL 1749

Query: 1615 QSDGKTTAGACVPIIEEPATPEQETTTQDAIIDIEDGFY-EDPDEIPTIKLNIEEFSQNL 1674
             S   +    C PIIEEPA+P QE  T+    DIED +Y EDPDEIPTIKLNIE+F   L
Sbjct: 1750 ASGAPSNRENCEPIIEEPASPGQE-CTEITESDIEDAYYNEDPDEIPTIKLNIEQFGMTL 1809

Query: 1675 QNFVQKNMELQEGDMSKALIALTPEAASIPTPKLKNVSRLRTEHQVYELPDNHPLLEKLK 1734
            +  +++NMELQEGDMSKAL+AL P   SIPTPKLKN+SRLRTEHQVYELPD+H LL+   
Sbjct: 1810 REHMERNMELQEGDMSKALVALHPTTTSIPTPKLKNISRLRTEHQVYELPDSHRLLD--G 1869

Query: 1735 LDRREPDDPSSYLLAIWTPGETANSIELPEKRCSNQEHHQLCCEEECLSCNSVREANSFM 1794
            +D+REPDDPS YLLAIWTPGETANS + PE++C  +   ++C +E C  CNS+REANS  
Sbjct: 1870 MDKREPDDPSPYLLAIWTPGETANSAQPPEQKCGGKASGKMCFDETCSECNSLREANSQT 1929

Query: 1795 VRGTLLIPCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTS 1846
            VRGTLLIPCRTAMRGSFPLNGTYFQVNE+FADHESSL PIDVPRDWIW+LPRRTVYFGTS
Sbjct: 1930 VRGTLLIPCRTAMRGSFPLNGTYFQVNELFADHESSLKPIDVPRDWIWDLPRRTVYFGTS 1986

BLAST of PI0016811 vs. TAIR 10
Match: AT2G36490.1 (demeter-like 1 )

HSP 1 Score: 937.2 bits (2421), Expect = 2.0e-272
Identity = 565/1171 (48.25%), Postives = 707/1171 (60.38%), Query Frame = 0

Query: 681  TSGTCI-----NGLFEMMHATVAKKKRTKKKPSNSALLNINKDLQDRRFVSFNPWQFFPK 740
            TSG C      N +      TV+KKK TK + S +   N+  +L           +F P 
Sbjct: 409  TSGYCSKPQQNNKILVDTRVTVSKKKPTKSEKSQTKQKNLLPNL----------CRFPPS 468

Query: 741  TLG-TASEHGNQICFIDLLVEQLKHLDINKESNNLGYREQALVPYNMQNQEHSAIVVYGR 800
              G +  E   +   I+ + E L+ LDIN+E     + E ALVPY M +Q    IV++G 
Sbjct: 469  FTGLSPDELWKRRNSIETISELLRLLDINRE-----HSETALVPYTMNSQ----IVLFGG 528

Query: 801  D-GTIVPFNPIKKRRPRPKVELDEETGRVWKLLMGNINSKGIDGTDEEKIKWWEEERKVF 860
              G IVP  P+KK RPRPKV+LD+ET RVWKLL+ NINS+G+DG+DE+K KWWEEER VF
Sbjct: 529  GAGAIVPVTPVKKPRPRPKVDLDDETDRVWKLLLENINSEGVDGSDEQKAKWWEEERNVF 588

Query: 861  QGRADSFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSDHLSSSAFMSLAARFPPK- 920
            +GRADSFIARMHLVQGDRRF+ WKGSVVDSVVGVFLTQNVSDHLSSSAFMSLA++FP   
Sbjct: 589  RGRADSFIARMHLVQGDRRFTPWKGSVVDSVVGVFLTQNVSDHLSSSAFMSLASQFPVPF 648

Query: 921  -PKCR-QASCSQEPIIELDEPEEACVFNLEDSMKLNKQIIHQQISEEGSLMKDEMEKSEG 980
             P     A  S  P I++         + E++M       H  ++ + +      +  E 
Sbjct: 649  VPSSNFDAGTSSMPSIQI------TYLDSEETMSSPPDHNHSSVTLKNT------QPDEE 708

Query: 981  RIIVDNNESSGSNAEDGSSNKEPEKKSFSSSHNILETCSNSVGEISLTETSSMQACLSGE 1040
            +  V +NE+S S++E   S  E   K+  S   +      S  E+  T+       L   
Sbjct: 709  KDYVPSNETSRSSSEIAISAHESVDKTTDSKEYVDSDRKGSSVEVDKTDEKCRVLNLFPS 768

Query: 1041 KETYDSFSFQDCLDSSIPQTSESIEPSSEGNSEDLPSWSTEAHIDSSSEELIQMTGPNTL 1100
            +++  + + Q  + S  PQ                                      NT 
Sbjct: 769  EDS--ALTCQHSMVSDAPQ--------------------------------------NTE 828

Query: 1101 NANFTTDTSVEQSENTTTNKLVEKKCDNRIDDTSQPDDPEISLKNSVYHLSDYQTQQNQT 1160
             A  +++  +E    T+  KL++                ++SL++S           NQ 
Sbjct: 829  RAGSSSEIDLEGEYRTSFMKLLQ--------------GVQVSLEDS-----------NQV 888

Query: 1161 SKSLEVDCCQTSNGVQTSNDCQNKDEHFHTEQSTLTVEYDNHANVEMELIVDIVEAPSSS 1220
            S ++            +  DC ++ + F +                       ++ P+ S
Sbjct: 889  SPNM------------SPGDCSSEIKGFQS-----------------------MKEPTKS 948

Query: 1221 SELSINAKEPGLTLQSQSSVIEDPQNVESPAECTNTVYEIPPNATEIATKPNPKECNLLS 1280
               S+++ EPG   Q    V+                                  C    
Sbjct: 949  ---SVDSSEPGCCSQQDGDVL---------------------------------SCQ--- 1008

Query: 1281 NEFKELKPASSRSQRKQVAKEKDNINWDNLRKQTETNGKTRQRTESTMDSLDWEAIRCAD 1340
                  KP      +K + +EK   +WD LR++ +     R++T STMD++DW+AIR AD
Sbjct: 1009 ------KPTLKEKGKKVLKEEKKAFDWDCLRREAQARAGIREKTRSTMDTVDWKAIRAAD 1068

Query: 1341 VNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVEPDHAKEYLLSIRGLGL 1400
            V E+A  I+ RGMN+ LAERI+ FL+RLV DHGSIDLEWLRDV PD AKEYLLS  GLGL
Sbjct: 1069 VKEVAETIKSRGMNHKLAERIQGFLDRLVNDHGSIDLEWLRDVPPDKAKEYLLSFNGLGL 1128

Query: 1401 KSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYPVLESIQKYLW 1460
            KSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLE+YP+LESIQKYLW
Sbjct: 1129 KSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLEMYPMLESIQKYLW 1188

Query: 1461 PRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFASARLGLPAPE 1520
            PRLCKLDQ+TLYELHYQMITFGKVFCTKSKPNCNACPM+GECRHFASAFASARL LP+ E
Sbjct: 1189 PRLCKLDQKTLYELHYQMITFGKVFCTKSKPNCNACPMKGECRHFASAFASARLALPSTE 1248

Query: 1521 DKRIVSTTECREPDNNQARTIDQPMLSLPPSTISSEEIKPSESHQSDGKTTAGACVPIIE 1580
                        PD N       P+    P     E+      H    K     C PIIE
Sbjct: 1249 -------KGMGTPDKN-------PLPLHLPEPFQREQGSEVVQHSEPAKKVT-CCEPIIE 1308

Query: 1581 EPATPEQETTTQDAIIDIEDGFYEDPDEIPTIKLNIEEFSQNLQNFVQKNMELQEGDMSK 1640
            EPA+PE E T + +I DIE+ F+EDP+EIPTI+LN++ F+ NL+  ++ N ELQ+G+MS 
Sbjct: 1309 EPASPEPE-TAEVSIADIEEAFFEDPEEIPTIRLNMDAFTSNLKKIMEHNKELQDGNMSS 1368

Query: 1641 ALIALTPEAASIPTPKLKNVSRLRTEHQVYELPDNHPLLEKLKLDRREPDDPSSYLLAIW 1700
            AL+ALT E AS+P PKLKN+S+LRTEH+VYELPD HPLL   +L++REPDDP SYLLAIW
Sbjct: 1369 ALVALTAETASLPMPKLKNISQLRTEHRVYELPDEHPLL--AQLEKREPDDPCSYLLAIW 1385

Query: 1701 TPGETANSIELPEKRCSNQEHHQLCCEEECLSCNSVREANSFMVRGTLLIPCRTAMRGSF 1760
            TPGETA+SI+     C  Q +  LC EE C SCNS++E  S +VRGT+LIPCRTAMRGSF
Sbjct: 1429 TPGETADSIQPSVSTCIFQANGMLCDEETCFSCNSIKETRSQIVRGTILIPCRTAMRGSF 1385

Query: 1761 PLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTSIPTIFKGLSTQGIQHCF 1820
            PLNGTYFQVNEVFADH SSLNPI+VPR+ IW LPRRTVYFGTS+PTIFKGLST+ IQ CF
Sbjct: 1489 PLNGTYFQVNEVFADHASSLNPINVPRELIWELPRRTVYFGTSVPTIFKGLSTEKIQACF 1385

Query: 1821 WRGFVCVRGFDQKTRAPRPLMARLHFPASKL 1842
            W+G+VCVRGFD+KTR P+PL+ARLHFPASKL
Sbjct: 1549 WKGYVCVRGFDRKTRGPKPLIARLHFPASKL 1385

BLAST of PI0016811 vs. TAIR 10
Match: AT3G10010.1 (demeter-like 2 )

HSP 1 Score: 731.9 bits (1888), Expect = 1.3e-210
Identity = 483/1164 (41.49%), Postives = 626/1164 (53.78%), Query Frame = 0

Query: 702  RTKKKPSNSALLNINKDLQDRRFVSFNPWQFFPKTLGTASEHGNQICFIDLLVEQLKHLD 761
            R K+   N      N  + D ++   NP      T  + ++   +   ID + +  + LD
Sbjct: 405  RKKRSQRNRVASQFNARILDLQWRRQNP------TGTSLADIWERSLTIDAITKLFEELD 464

Query: 762  INKESNNLGY-REQALVPYNMQNQEHSAIVVYGRDGTIVPFNPIKKRRPRPKVELDEETG 821
            INKE   L + RE AL+ Y    +E  AIV Y              ++ +PKV+LD ET 
Sbjct: 465  INKEGLCLPHNRETALILYKKSYEEQKAIVKY-------------SKKQKPKVQLDPETS 524

Query: 822  RVWKLLMGNINSKGIDGTDEEKIKWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSV 881
            RVWKLLM +I+  G+DG+DEEK KWWEEER +F GRA+SFIARM +VQG+R FS WKGSV
Sbjct: 525  RVWKLLMSSIDCDGVDGSDEEKRKWWEEERNMFHGRANSFIARMRVVQGNRTFSPWKGSV 584

Query: 882  VDSVVGVFLTQNVSDHLSSSAFMSLAARFPPKPKCRQASCSQEPIIELDEPEEACVFNLE 941
            VDSVVGVFLTQNV+DH SSSA+M LAA FP +    + SC +E         +  + NL+
Sbjct: 585  VDSVVGVFLTQNVADHSSSSAYMDLAAEFPVEWNFNKGSCHEE---WGSSVTQETILNLD 644

Query: 942  DSMKLNKQIIHQQISEEGSLMKDEMEKSEGRIIVDNNESSGSNAEDGSSNKEPEKKSFSS 1001
                ++   I                ++  R+I++  +    N  D   ++E  K S   
Sbjct: 645  PRTGVSTPRI----------------RNPTRVIIEEIDDD-ENDIDAVCSQESSKTS--- 704

Query: 1002 SHNILETCSNSVGEISLTETSSMQACLSGEKETYDSFSFQDCLDSSIPQTSESIEPSSEG 1061
                                                       DSSI             
Sbjct: 705  -------------------------------------------DSSI------------- 764

Query: 1062 NSEDLPSWSTEAHIDSSSEELIQMTGPNTLNANFTTDTSVEQSENTTTNKLVEKKCDNRI 1121
                                                 TS +QS+    +        N +
Sbjct: 765  -------------------------------------TSADQSKTMLLDPF------NTV 824

Query: 1122 DDTSQPDDPEISLKNSVYHLSDYQTQQNQTSKSLEVDCCQTSNGVQTSNDCQNKDEHFHT 1181
                Q D   +  K  + +  D     N  S+                            
Sbjct: 825  LMNEQVDSQMVKGKGHIPYTDDL----NDLSQG--------------------------- 884

Query: 1182 EQSTLTVEYDNHANVEMELIVDIVEAPSSSSELSINAKEPGLTLQSQSSVIEDPQNVESP 1241
                                + +V + S+  EL++N   P + L S     E     +  
Sbjct: 885  --------------------ISMVSSASTHCELNLNEVPPEVELCSHQQDPESTIQTQDQ 944

Query: 1242 AECTNTVYEIPPNATEIATKPNPKECNLLSNEFKELKPASSRSQRKQVAKEKDNINWDNL 1301
             E T T  ++  N  +  T   PK+                +S+    + +K +++WD+L
Sbjct: 945  QESTRT-EDVKKNRKK-PTTSKPKK----------------KSKESAKSTQKKSVDWDSL 1004

Query: 1302 RKQTETNGKTRQRTESTMDSLDWEAIRCADVNEIAHAIRERGMNNMLAERIKDFLNRLVK 1361
            RK+ E+ G+ R+RTE TMD++DW+A+RC DV++IA+ I +RGMNNMLAERIK FLNRLVK
Sbjct: 1005 RKEAESGGRKRERTERTMDTVDWDALRCTDVHKIANIIIKRGMNNMLAERIKAFLNRLVK 1064

Query: 1362 DHGSIDLEWLRDVEPDHAKEYLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLG 1421
             HGSIDLEWLRDV PD AKEYLLSI GLGLKSVECVRLL+LH +AFPVDTNVGRIAVRLG
Sbjct: 1065 KHGSIDLEWLRDVPPDKAKEYLLSINGLGLKSVECVRLLSLHQIAFPVDTNVGRIAVRLG 1124

Query: 1422 WVPLQPLPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSK 1481
            WVPLQPLP+ LQ+HLLELYPVLES+QKYLWPRLCKLDQ+TLYELHY MITFGKVFCTK K
Sbjct: 1125 WVPLQPLPDELQMHLLELYPVLESVQKYLWPRLCKLDQKTLYELHYHMITFGKVFCTKVK 1184

Query: 1482 PNCNACPMRGECRHFASAFASARLGLPAPEDKRIVSTTECREPDNNQARTIDQP-MLSLP 1541
            PNCNACPM+ ECRH++SA ASARL LP PE+    S         ++ R+  +P +++  
Sbjct: 1185 PNCNACPMKAECRHYSSARASARLALPEPEESDRTSVM------IHERRSKRKPVVVNFR 1244

Query: 1542 PSTISSEEIKPSESHQSDGKTTAGACVPIIEEPATPEQETTTQDAIIDIED--------G 1601
            PS    +E K  E+ +S        C PIIEEPA+PE E        DIED        G
Sbjct: 1245 PSLFLYQE-KEQEAQRSQN------CEPIIEEPASPEPEYIEH----DIEDYPRDKNNVG 1304

Query: 1602 FYEDP----DEIPTIKLNIEEFSQNLQNFVQKNMELQEGDMSKALIALTPEAASIPTPKL 1661
              EDP    D IPTI LN +E   +    V K     E   S  L+ L+  AA+IP  KL
Sbjct: 1305 TSEDPWENKDVIPTIILN-KEAGTSHDLVVNK-----EAGTSHDLVVLSTYAAAIPRRKL 1332

Query: 1662 KNVSRLRTEHQVYELPDNHPLLEKLKLDRREPDDPSSYLLAIWTPGETANSIELPEKRCS 1721
            K   +LRTEH V+ELPD+H +LE    +RRE +D   YLLAIWTPGET NSI+ P++RC+
Sbjct: 1365 KIKEKLRTEHHVFELPDHHSILE--GFERREAEDIVPYLLAIWTPGETVNSIQPPKQRCA 1332

Query: 1722 -NQEHHQLCCEEECLSCNSVREANSFMVRGTLLIPCRTAMRGSFPLNGTYFQVNEVFADH 1781
              + ++ LC E +C  CN  RE  S  VRGT+LIPCRTAMRG FPLNGTYFQ NEVFADH
Sbjct: 1425 LFESNNTLCNENKCFQCNKTREEESQTVRGTILIPCRTAMRGGFPLNGTYFQTNEVFADH 1332

Query: 1782 ESSLNPIDVPRDWIWNLPRRTVYFGTSIPTIFKGLSTQGIQHCFWRGFVCVRGFDQKTRA 1841
            +SS+NPIDVP + IW+L RR  Y G+S+ +I KGLS + I++ F  G+VCVRGFD++ R 
Sbjct: 1485 DSSINPIDVPTELIWDLKRRVAYLGSSVSSICKGLSVEAIKYNFQEGYVCVRGFDRENRK 1332

Query: 1842 PRPLMARLHFPASKLNRGRGKTED 1851
            P+ L+ RLH     + R + KTE+
Sbjct: 1545 PKSLVKRLHCSHVAI-RTKEKTEE 1332

BLAST of PI0016811 vs. TAIR 10
Match: AT4G34060.1 (demeter-like protein 3 )

HSP 1 Score: 499.6 bits (1285), Expect = 1.1e-140
Identity = 289/644 (44.88%), Postives = 393/644 (61.02%), Query Frame = 0

Query: 1208 SSSSELSINAKEPGLTLQSQSSVIEDPQNVESPAECTNTVYEIPPNATEIATKPNPKECN 1267
            SS++ +S+ AK P    +  S  IE+PQ+ +S                         EC 
Sbjct: 433  SSNAFMSVAAKFPVDAREGLSYYIEEPQDAKS------------------------SECI 492

Query: 1268 LLSNE----FKELKPASSRSQRKQVAKEKDNINWDNLRKQTETNGKTRQRTESTMDSLDW 1327
            +LS+E     ++ +  + R   K    E + ++W+NLR+     G    R E  MDS++W
Sbjct: 493  ILSDESISKVEDHENTAKRKNEKTGIIEDEIVDWNNLRRMYTKEG---SRPEMHMDSVNW 552

Query: 1328 EAIRCADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVEPDHAKEYLL 1387
              +R +  N +   I++RG   +L+ERI  FLN  V  +G+IDLEWLR+      K YLL
Sbjct: 553  SDVRLSGQNVLETTIKKRGQFRILSERILKFLNDEVNQNGNIDLEWLRNAPSHLVKRYLL 612

Query: 1388 SIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYPVLE 1447
             I G+GLKS ECVRLL L H AFPVDTNVGRIAVRLG VPL+PLP  +Q+H L  YP ++
Sbjct: 613  EIEGIGLKSAECVRLLGLKHHAFPVDTNVGRIAVRLGLVPLEPLPNGVQMHQLFEYPSMD 672

Query: 1448 SIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFASAR 1507
            SIQKYLWPRLCKL Q TLYELHYQMITFGKVFCTK+ PNCNACPM+ EC++FASA+ S++
Sbjct: 673  SIQKYLWPRLCKLPQETLYELHYQMITFGKVFCTKTIPNCNACPMKSECKYFASAYVSSK 732

Query: 1508 LGLPAPEDKRIVSTTECREPD---NNQARTIDQPMLSLPPSTISSEEIKPSESHQSDGKT 1567
            + L +PE+K         EP+   N  ++ +   M S          I   E   S G +
Sbjct: 733  VLLESPEEK-------MHEPNTFMNAHSQDVAVDMTS---------NINLVEECVSSGCS 792

Query: 1568 TAGACV-PIIEEPATPEQETTTQDAIIDIE-DGFYEDPDEIPTIKLNIEEFSQNLQN--F 1627
                C  P++E P++P  E      I D+     Y+    +P I  +++   +++++   
Sbjct: 793  DQAICYKPLVEFPSSPRAEIPESTDIEDVPFMNLYQSYASVPKIDFDLDALKKSVEDALV 852

Query: 1628 VQKNMELQEGDMSKALIALTPEAASIPTP---KLKNVSRLRTEHQVYELPDNHPLLEKLK 1687
            +   M   + ++SKAL+  TPE A IP     K+K  +RLRTEH VY LPDNH LL    
Sbjct: 853  ISGRMSSSDEEISKALVIPTPENACIPIKPPRKMKYYNRLRTEHVVYVLPDNHELLH--D 912

Query: 1688 LDRREPDDPSSYLLAIWTPGETANSIELPEKRCSNQEHHQLCCEEECLSCNSVREANSFM 1747
             +RR+ DDPS YLLAIW PGET++S   P+K+CS+ +  +LC  + C  C ++RE NS +
Sbjct: 913  FERRKLDDPSPYLLAIWQPGETSSSFVPPKKKCSS-DGSKLCKIKNCSYCWTIREQNSNI 972

Query: 1748 VRGTLLIPCRTAMRGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTS 1807
             RGT+LIPCRTAMRG+FPLNGTYFQ NEVFADHE+SLNPI   R+    L +R +Y G++
Sbjct: 973  FRGTILIPCRTAMRGAFPLNGTYFQTNEVFADHETSLNPIVFRRELCKGLEKRALYCGST 1030

Query: 1808 IPTIFKGLSTQGIQHCFWRGFVCVRGFDQKTRAPRPLMARLHFP 1838
            + +IFK L T+ I+ CFW GF+C+R FD+K R P+ L+ RLH P
Sbjct: 1033 VTSIFKLLDTRRIELCFWTGFLCLRAFDRKQRDPKELVRRLHTP 1030


HSP 2 Score: 110.9 bits (276), Expect = 1.1e-23
Identity = 83/222 (37.39%), Postives = 121/222 (54.50%), Query Frame = 0

Query: 806  KRRPRPKVELDEETGRVWKLLMGNINSKGIDGTDEEKIKWWEEERKVFQGRADSFIARMH 865
            K+    KV LD ET + W +LM N +S      D+E    W++ER++FQ R D FI RMH
Sbjct: 342  KKLVTAKVNLDPETIKEWDVLMVN-DSPSRSYDDKETEAKWKKEREIFQTRIDLFINRMH 401

Query: 866  LVQGDRRFSQWKGSVVDSVVGVFLTQNVSDHLSSSAFMSLAARFPPKPKCRQASCSQEPI 925
             +QG+R+F QWKGSVVDSVVGVFLTQN +D+LSS+AFMS+AA+FP   +   +   +EP 
Sbjct: 402  RLQGNRKFKQWKGSVVDSVVGVFLTQNTTDYLSSNAFMSVAAKFPVDAREGLSYYIEEP- 461

Query: 926  IELDEPEEACVFNLEDSMKL--NKQIIHQQISEEGSLMKDEMEKSEGRIIVDNNESSGSN 985
               D     C+   ++S+    + +   ++ +E+  +++DE        IVD N      
Sbjct: 462  --QDAKSSECIILSDESISKVEDHENTAKRKNEKTGIIEDE--------IVDWNNLRRMY 521

Query: 986  AEDGSSNKEPEKKSFS--------SSHNILETCSNSVGEISL 1018
             ++GS    PE    S        S  N+LET     G+  +
Sbjct: 522  TKEGS---RPEMHMDSVNWSDVRLSGQNVLETTIKKRGQFRI 548

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8LK561.5e-27249.82Transcriptional activator DEMETER OS=Arabidopsis thaliana OX=3702 GN=DME PE=1 SV... [more]
Q9SJQ62.9e-27148.25DNA glycosylase/AP lyase ROS1 OS=Arabidopsis thaliana OX=3702 GN=ROS1 PE=1 SV=2[more]
C7IW641.2e-26648.97Protein ROS1A OS=Oryza sativa subsp. japonica OX=39947 GN=ROS1A PE=1 SV=2[more]
B8YIE81.0e-25245.70Protein ROS1C OS=Oryza sativa subsp. japonica OX=39947 GN=ROS1C PE=2 SV=2[more]
Q9SR661.8e-20941.49DEMETER-like protein 2 OS=Arabidopsis thaliana OX=3702 GN=DML2 PE=3 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0LAQ70.0e+0095.14ENDO3c domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G748840 PE=3... [more]
A0A1S4DUL30.0e+0094.33protein ROS1 OS=Cucumis melo OX=3656 GN=LOC103486570 PE=3 SV=1[more]
A0A5D3DNK40.0e+0094.12Protein ROS1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G003460 ... [more]
A0A5A7TKC20.0e+0094.07Protein ROS1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold236G002630 ... [more]
A0A6J1F2E40.0e+0081.58protein ROS1-like OS=Cucurbita moschata OX=3662 GN=LOC111441568 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
XP_011651988.10.0e+0095.14transcriptional activator DEMETER [Cucumis sativus] >KGN59055.1 hypothetical pro... [more]
XP_016899677.10.0e+0094.33PREDICTED: protein ROS1 [Cucumis melo][more]
TYK25216.10.0e+0094.12protein ROS1 [Cucumis melo var. makuwa][more]
KAA0043923.10.0e+0094.07protein ROS1 [Cucumis melo var. makuwa][more]
XP_038904008.10.0e+0089.54DNA glycosylase/AP lyase ROS1-like [Benincasa hispida] >XP_038904009.1 DNA glyco... [more]
Match NameE-valueIdentityDescription
AT5G04560.11.1e-27349.82HhH-GPD base excision DNA repair family protein [more]
AT5G04560.21.1e-27349.82HhH-GPD base excision DNA repair family protein [more]
AT2G36490.12.0e-27248.25demeter-like 1 [more]
AT3G10010.11.3e-21041.49demeter-like 2 [more]
AT4G34060.11.1e-14044.88demeter-like protein 3 [more]
InterPro
Analysis Name: InterPro Annotations of Melon (PI 482460) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 1606..1626
NoneNo IPR availableGENE3D1.10.340.30Hypothetical protein; domain 2coord: 1304..1403
e-value: 5.5E-33
score: 116.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1531..1560
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 365..387
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1278..1309
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 271..300
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 423..455
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..25
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 317..387
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1531..1561
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 972..1002
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1277..1316
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1047..1067
NoneNo IPR availablePANTHERPTHR46213:SF14ROS1, PUTATIVE-RELATEDcoord: 1..314
coord: 320..1850
IPR003651Endonuclease III-like, iron-sulphur cluster loop motifSMARTSM00525ccc3coord: 1475..1495
e-value: 4.9E-4
score: 29.4
IPR003265HhH-GPD domainSMARTSM00478endo3endcoord: 1310..1474
e-value: 9.7E-5
score: 22.7
IPR003265HhH-GPD domainCDDcd00056ENDO3ccoord: 1322..1440
e-value: 1.36704E-17
score: 79.9782
IPR028924Permuted single zf-CXXC unitPFAMPF15629Perm-CXXCcoord: 1702..1731
e-value: 6.4E-7
score: 29.5
IPR028925Demeter, RRM-fold domainPFAMPF15628RRM_DMEcoord: 1736..1836
e-value: 2.2E-55
score: 185.1
IPR023170Helix-hairpin-helix, base-excision DNA repair, C-terminalGENE3D1.10.1670.10coord: 1404..1492
e-value: 5.5E-33
score: 116.4
IPR044811DNA glycosylase, plantPANTHERPTHR46213TRANSCRIPTIONAL ACTIVATOR DEMETERcoord: 1..314
coord: 320..1850
IPR011257DNA glycosylaseSUPERFAMILY48150DNA-glycosylasecoord: 855..1497

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
PI0016811.1PI0016811.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006284 base-excision repair
biological_process GO:0080111 DNA demethylation
biological_process GO:0006281 DNA repair
cellular_component GO:0005634 nucleus
molecular_function GO:0051539 4 iron, 4 sulfur cluster binding
molecular_function GO:0003677 DNA binding
molecular_function GO:0035514 DNA demethylase activity
molecular_function GO:0019104 DNA N-glycosylase activity
molecular_function GO:0046872 metal ion binding
molecular_function GO:0003824 catalytic activity