Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGCGATCCGCCTGGGGTAAGGTTCGAGCTTGATCCAGAAATCGATAGGACATTCAGGATCAGAAGGAGAGAGCAGCGTAGAAACAAGATGGAGAACGTGTCGCGTCTTCCGCAGGTTCCTGAAGATCCAGCAGACCCCCAGAATCGCTTGCTGCAGTAAAATCCGTCACTGGAGAAAATGAGCAGCAAAAGAATCAGGCTGAGAATCCTATCTTGGTAGCGAACGATAGGACTAGAGCCATTCGAGCGTATGTTGTCCCAATGTTTGATGAGTTGAATCCAGGGATTGCACGCCCCAAATCCAAACGGCAAATTTTGAAATGAAACCGGTAATGTTTCAGAAGTTGCAAACCGTGGGGCAATTCCATGGTTTGTCATCTGAAGATCCTCATTTACATCTTAAGTCTTTTCTAGGAGTAATTGATTCTTTTGTGATTCAAGGAGTGCCTAGAGATGCCCTTAGATTAACTTTGTTCCCGTATTCTTTTAGAGATGGAGAAAGGCGTGGTTAAATTCTTTTGCTCCAGGGTCAATTAGGACATGGAATGAGTTAGCTGAAAAATTTTTGAGTAAATATTTCCCACCAAATAGAAATGCTAAACTAAGAAGTGAAATAGTAGGGTTTAGGCAACTTGAAGATGAGACTTTTAGTGAGGCTTGAGAGAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACCATGGTTTACCACATTGTGTTCAAATGGAAACATTTTACAATGGGTTAAATACGGTAACCCAGGGAATGGTTGATGTTTCGGCTAGAGGGGCCCTTTTGGCAAAAACTGTTAATGAAACCTATGAAATTTTGAAAGAATATCTACTAATAGTTGTCAGTGGTCGGATGTTAGGGGCATAAATAAAAAGGTTAAAAGTGTATTAGAGGTTGATGGTGTGTCCACCATTAGGACTGATCTTGCAATGATTGCTAACGCTCTTAAGAATGTGACAGGGATTAGTCATCAGCAGCCACCAGCTATGGAGTCAACTGTAGTGGTGAGCCAAGTCACAGAAGAAACATGTGTCTACTATGGAGAAGATCACAACTACGAGTTTTTCCCCAGCAATCCAGCTTCTGTGTTTTTTGTAGGTAATCAGAGGAATAACCCTTATTCTAACTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTATAATCCAGGTTGGCGCAACCACCCCAATTTCTCATGGGGAGGACAAGGAAGTAATGTGCAAGCACAACAAAAGGTGAACCAGTTGGGATTTGCTAAAGCGCAGGTATTGCCCCAACAAAATAAGCAGGCTTTGCCCCAGCAAAATTCGGGGAGTTCTCTTGAGGTGATGATGAAAGAATTTATGGCTCTTACAGATGCCGCAATTCAAAGTAATCAAGCTTCGATGAGAGCACTTGAATTGCAAGTGGGTCAGCTAGCTAATGAGCTGAAGGCAAGGCCTCAAGGGAAACTTCCATCGGATACTGAACACCCTCGAAGGGAAGGTAAGGAGCAGGTAAAGGTGGTAACTCTTAGGAGTGGTAAGCCACTAGAAGAGCCTAGAAAAACTCAGGATATAGAAAAGGATAGTAATAAAAATGCTGTTGTTGAGAAAGAGTTGGAGTCTAGTCAGGGTGCTGGAGGCAGCAATAAAGATGTTGGAGCACCTGGCTCTGTTCCAGATGTGGAACCACCTTATGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAAAGCCTAAGAATCAAGATGGTCAATTTAAAAAGTTTTTAGAGATTCTTAAGCAATTGCATATAAATATCCCTTTAGTAGAAGCTATTGAGCAAATGCCTAATTATGCTAAATTTCTTAAGGATATTTTAATTAAAAAGAAGAGGTTAGGTGAGTTTGAAACTGTATCTCTTACTGAGGAGTGTAGTGTTATTCTTAAGAATGGGCTACCACCCAAGGCTAAGGATCCAGGATCATTTACCATACATGTGTCTATAGGTGGAAAAGAGTTAGGTAGAGCACTTTGTGATTTAGGTGCAAGCATTAACCTTATGCCTCTTTCGATCTATCGAAAGCTAGGTATTGGTGAAGCTAGACCTATCACAGTCACACTCCAACTAGCTGATAGGTCTATCACATATCCAGAGGGTAAAATTGAGGATGTCTTAGTAAAGGTAGATAAATTCATATTTCCAGTTGATTTTATTATTTTAGACTATGAGGCTGATAAAGATGTCCAAATTATTCTAGGTCGTCCATTTTTGGCTACTGGTAGGGCGTTAATAGATGTTCAGAAAGGGGAATTAACAATGAGAGTCTGTAATGAGGAGGTAAAATTTAATGTGTTTAAAGCCATGAAATATCCAAACGAAATGGAGGATTGCTCCTTCATTAGGATTCTGGAGAGCACAGTTATTGAGACAGCAATACAGGATTCGGCTGACAAGCATTTGGAAGGTCATGGAGAGGTTAGTGTAGAGGATTTGCAGGTTTGTTTGTTAGAAAGAAAAAACGAAAAAGAGTTGTTTAGGTGTGAGGATGTTTTTGAGTCTTTAGATTTAGATCAAAGAAAGGCTCCTCCTATTAAGCCATCCCTGATTGAGGCACCTACTTTAGATTTGAAGCCCTTGTCGGATCATCTAAAGTATGTGTATCTTGGGGAAGGTGAGACGTTACCCATTATTGTTGCATCAGATTTAATGACGGAGCATGAAGAGGCCTTAATAAAGTGGCTGCAGCAATACCGCAAGGCTATTGGTTGGACATTGGCTGAGATTCAGGGAATTAGCCCATCTTTTTGTATGCACAAAATCACTCTAGAGGAGGGATCCTTGAGGAGTGTTGAGCAACAAAGAAGGCTTAACCCTGCAATGAAAGAGGTTAAAAAGGAGGTGATTAAATGGTTGGATGCTGGGATCATTTATCCAATTGCAGACAGCAATTAGGTGAGCCTTGTCCAATGTGTTCCTAAGAAAGGAGGTGTCACTGTGGTAAGCAATAAAGACAATGAGTTGATCCCAACCAGGACAATAACTGGACTATAGGAGGCTTAATAAAGCCACCCGTAACTGGACTATAGGAGGCTTAATAAAGCCACCCGTAAGGCTTAACCTCTTTGTAACTTATTGTGTACTTATCATGTTTTTTACTTTAATGCGGATTTTAGGAAGGCTTTTGAAACTTTAAAGGCTACTCTAATCTCAGCACCCAATCTTTGTGCACCTAATTGGAATTTACCATTTGAGGTAATGTGTGATGCGAGTGATGCTGCAGTAGGTGCTATGCTGGGGCAAAAACAGGGCAAATTTATCCATCCTATATATTATGCAAGCAAGATTTTAAATGAGGCACAAGTCAACTACACAACTACTGAAAAGGAGTAGTTGGCTGTGGTGTTTGCTTTTGAGAAATTCCGGCCATATTTGGTTGGATCCAAAGTCACGGTGTTCACGGATCATGCAGCAATAAGGTATTTAATGTCTAAGAAAGATGCAAAACCGAGGCTAATTCGTTGGGTTTTATTATTGCAGGAGTTCGACTTGGAAATAAAGGACAAGAAGGGATCAGAAAATGTCATTGCAGATCATTTGTCTCGTCTTGATCCATCATCATCTTTGCTGGAGCAATCTGCCATTTCAGATTCTTTTTCAGATGAACAACTTTTTGCTGTTGAGGTAAAGGTAGTCAGGGATATCCCTTGGTATGCTGATATTGTCAACTTTTTGGTAAAGGGAGTCACTCCTATTGACATGGACTGGAGGCAGAAGAAAAAGTTTAAGCATGATGCAAAATTTTTCTATTGGGATGAGCCATTTATGTATAAGCAATGCTCTGACTGTATTTTTCGTAGGTGTGTTTCAGGTGATGAAGCAAAGGTAATCCTGGAGCAATGACACTCTTCGCCGTATGGAGGTCATTTCAGCGGTCAGAGGACAGCTATGAGGATTTTGCATTGTGGATTCTTCTGGCCTACCTTATTCAAGGATGCCCATTGGTTTTACAAGCAATGTGATGCTTGCCAAAGGAGAGGAAACTTAGGACCTAGAGATTGGAGGCCATTGCATGTCATCAGAGTGATGCCAAGACAGTTGCAAGGTTTCTTCAATCGCACATCTTTGCGCGGTTTGGGACACCTAGGCTCTAGTGAGTGATGAGGGTACACACTTTGTTAATAATATCTTAACTAAGCTTTTAGCTAAGTATGGGATTAAGCATAGGATAGCTACCCCTTATCACCCACAAGCAAATGGTCAAGCTGAAATTAGTAATAGGGAAATTAAATTTATTCTAGAGAAAGTAGTCCATCCATCTAGGAAGGATTTGTCTTTTAGGTTGGATGAGGCTCTTTGGGCTTATAGGACAGCCTATAAGACTCCTCTAGGTATGTCCCCTATAGGTTAGTATATGGGAAAGCTTGCCATTTACCATTAGAGCTTGAGCATAAAAAATTTTGGGCTTTGAAAAAGTTAAATTTTGACCTGAGTCGTGCAGGAGCAATAAGAATGTTGCAGCTTAATGAATTAGAGGAATTTCGCCAATTTTCTTATAAAATTCGAAAATGTATAAAGAAAAGACTAAGTTGTGGCATGACAAGAAAATTAAATCTAAAGAGTTTGTCAAAGGTCAGAAGGTTTTGCTTTATAATTCTAGATTAAAATTGTTTCCTGGGAAACTAAAATCTAAATTGTCAGGGCCGTTTGTTGTGATTGAAGTTTTTCCCCATGGAGCAATTACTTTGTAGGATGAAAAAGATGGGAGAGTGTTCAAAGTGAATGGACAGCGTGTGAAGCATTATTGGGAAGAGGAGTTTCAGTCGAAATATCCTTCCCTAAGGTTGATTGATGATTGAGAAAGCAAAATGTTTGCGGGAGCATTATACAGAGCAATATTTTGAGCTCCTATTGTTTGTTTTTATTTTTTGATGATTCGTTTTTAATTTCGATTAGGTTAGATTTGGTTTATTTAAATTTTGCATGATTTTTTCAGTCTGTTGGTATTCCGTAGAAGTTATTTTAGATTTTATCTATTTCGCTTTGAATTTTATTTGATTTTATTCTGATTTTATTTCCGGTTTATGTAAATTATCTTTGAATTATCTTTTAATTAGTTTAAATTAGGTTTTATTAGATTTTAGTGTAGATTTAGATTTTAATTTCAGTTATTTTAATTTTGTTTAATAGAAATATTCCTAATTAACTCGGCTTCGAAAATAAAATAGAAAAAAGAGAGGAAGGATTTGATTTATGCGGTTGAATTTAAATTTCCTAATTTGATTTCCTATTTTAAATTTGAAATTCGTATCTAAGATTATTTAGGATTTGTTTTGCTCAGACGGTTGTTACGGCAACTTTATGGCTTAAGCAAATCTCTCGTCCGAATAGGAGGGTTTTGTTGGGAGTTAATTATCTTTTACCGTTTGGATTTTCTTTAAATTTTTACAGGTAAATTTGCATGCATCATCTAATGAAGCCACGTGTCACACGGAGATCATCCAAAGGCTTTGATGGTGGACAACGCATAACAATTTACCATGGGCGATTGAAATTCATAAATTTACTGTTGATGGTTTTGAATTTCTACTTTGCATGTGTTTAAAATTAAACGGGAATATTAATAAATGCAGGGCAAATCTTTTCAGTTGGGATTCGGTTTTGCTGAAGCTATTTATTTAGAGACGGTGGTAAGTTTCTTTTTACTTCACATTCCCATTTAATTCCTTTACTGTTCGTTTTTCTTCTCTTCTTTCTTCGTTGCTTTTCTCTGTCAAATCTTTGAGTTTCCATGGCGAAAACACGAGCAAGAAAAGAAAGAGAGAATGAGGAGGAAGAGGTTGCGATTACCCCTGAGGTACAGAAAGTAAAGGCTAAAAAGAAAAAGACCCCGGAGGAGAAAGAAGCCAAGAGAAGAAGGAGGCAACAGAGGGTTGCGGAGCAAGAAGGGGTTCAGGAGGTGGCAGAAGATGTTGCCACTGTAGTGGAGGAAGGAGTTCAAATTCCTGAGGTAGAACCGTTAGTCCCAGATACGGTTCAAGAAGAGAATGTTGAGAAGAATCAAGAACCACAGGCTGACGAAGTTCGGGACGAACATGCCGCAGTTGTGCCTGAAGAAGAGAATGAACAGGAACCGGTGCGATAGGCTAGAGTAGAGGTAGTCATGCCGGAGGTGCCAAAACATCGTCGCATTAAGAGGAAGGCGGGCCGCATCAGGGTGATTTGGAATACCCCATCGCCTCCATCGTCAGATTCTGAGGAAGAGAGAAGGGAACAAGAAAAGAAGGAGGCTGAGAACAAAGCAAGAGAAGAAGAAGTAAATAAAGCTGAGGAAGAGGTTTTGCCCAAGCAGAGGGAAGATAAGGGCAAAGGTATTGTTGAAGCATCGGGTGAAGCTGACGAGATTGAGGAACCAAGATTACCGTACATTCGCTTTGTCAGTAACCTTGCTCGGGAAAAGTACGTTGAGATGCTGAGACGGGACTTCCTGTTTGAACGAGGATTTGGCGATGATTTGCCACAGTTCTTGAGGACTGGAATAACGAACCTCGGTTGGAGTCAATTTTGTGCGAAACCGGAGCCTGTTAATTCCAACATTGTTTGGGAATTTTACGCAAATATTGGCGATCAGGAAGAATATCAGGTTATAGTTCGAGGAGTGCCCGTTGATTGGAGCCCAGGAGCCATTAATGCTTTGTTCAATCTCCAGGATTTTCCGCACGCAGGCTTTAATGAGATGGTGGTTGCAACATCTAGCGACCAACTAAATGTGGCTGTCCGGGAGGTTGGCATCGAGGGGGCCCAGTGGAGGTTGTCGAAGACAGAGAAGCGCACATTTCATGCTGCTTATTTCAAGAGCGAGGCCAATATGTGGATGGGTTTCATTAAGTTGAGCTTACTGCCGACAACTCACGACTCAACGGTGTCTCGAGACCGGGTTTTGCTTGCCTTTGCTATTCTTCGTTCGATGAGTATCGATGTAGGTAAAATAATTTCTTTTGAGATTCTTGATTGCTGGCGGAAAAAAGGTGGGGAAGCTGTTTTTCCCCAACACTATCACGATGTTATGCTCAAGGGCAGAGGTGCCAGAGGATGA
mRNA sequence
ATGAGCGATCCGCCTGGGGTAAGGTTCGAGCTTGATCCAGAAATCGATAGGACATTCAGGATCAGAAGGAGAGAGCAGCGTAGAAACAAGATGGAGAACGTGTCGCGTCTTCCGCAGGTTCCTGAAGATCCAGCAGACCCCCAGAATCGCTTGCTGCAGGATTGCACGCCCCAAATCCAAACGGCAAATTTTGAAATGAAACCGGTAATGTTTCAGAAGTTGCAAACCGTGGGGCAATTCCATGGTTTGTCATCTGAAGATCCTCATTTACATCTTAAGTCTTTTCTAGGAGTAATTGATTCTTTTGTGATTCAAGGAGTGCCTAGAGATGCCCTTAGATTAACTTTGTTCCCGACTGATCTTGCAATGATTGCTAACGCTCTTAAGAATGTGACAGGGATTAGTCATCAGCAGCCACCAGCTATGGAGTCAACTGTAGTGGTGAGCCAAGTCACAGAAGAAACATGTGTCTACTATGGAGAAGATCACAACTACGAGTTTTTCCCCAGCAATCCAGCTTCTGTGTTTTTTGTAGGTTGGCGCAACCACCCCAATTTCTCATGGGGAGGACAAGGAAGTAATGTGCAAGCACAACAAAAGGTGAACCAGTTGGGATTTGCTAAAGCGCAGGTATTGCCCCAACAAAATAAGCAGGCTTTGCCCCAGCAAAATTCGGGGAGTTCTCTTGAGGTGATGATGAAAGAATTTATGGCTCTTACAGATGCCGCAATTCAAAGTAATCAAGCTTCGATGAGAGCACTTGAATTGCAAGTGGGTCAGCTAGCTAATGAGCTGAAGGCAAGGCCTCAAGGGAAACTTCCATCGGATACTGAACACCCTCGAAGGGAAGGTAAGGAGCAGGTAAAGGTGGTAACTCTTAGGAGTGGTAAGCCACTAGAAGAGCCTAGAAAAACTCAGGATATAGAAAAGGATAGTAATAAAAATGCTGTTGTTGAGAAAGAGTTGGAGTCTAGTCAGGGTGCTGGAGGCAGCAATAAAGATGTTGGAGCACCTGGCTCTGTTCCAGATGTGGAACCACCTTATGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAAAGCCTAAGAATCAAGATGGTCAATTTAAAAAGTTTTTAGAGATTCTTAAGCAATTGCATATAAATATCCCTTTAGTAGAAGCTATTGAGCAAATGCCTAATTATGCTAAATTTCTTAAGGATATTTTAATTAAAAAGAAGAGGTTAGGTGAGTTTGAAACTGTATCTCTTACTGAGGAGTGTAGTGTTATTCTTAAGAATGGGCTACCACCCAAGGCTAAGGATCCAGGATCATTTACCATACATGTGTCTATAGGTGGAAAAGAGTTAGGTAGAGCACTTTGTGATTTAGGTGCAAGCATTAACCTTATGCCTCTTTCGATCTATCGAAAGCTAGGTATTGGTGAAGCTAGACCTATCACAGTCACACTCCAACTAGCTGATAGGTCTATCACATATCCAGAGGGTAAAATTGAGGATGTCTTAGTAAAGGTAGATAAATTCATATTTCCAGTTGATTTTATTATTTTAGACTATGAGGCTGATAAAGATGTCCAAATTATTCTAGGTCGTCCATTTTTGGCTACTGGTAGGGCGTTAATAGATGTTCAGAAAGGGGAATTAACAATGAGAGTCTGTAATGAGGAGGTAAAATTTAATGTGTTTAAAGCCATGAAATATCCAAACGAAATGGAGGATTGCTCCTTCATTAGGATTCTGGAGAGCACAGTTATTGAGACAGCAATACAGGATTCGGCTGACAAGCATTTGGAAGGTCATGGAGAGGTTAGTGTAGAGGATTTGCAGGTTTGTTTGTTAGAAAGAAAAAACGAAAAAGAGTTGTTTAGGTGTGAGGATGTTTTTGAGTCTTTAGATTTAGATCAAAGAAAGGCTCCTCCTATTAAGCCATCCCTGATTGAGGCACCTACTTTAGATTTGAAGCCCTTGTCGGATCATCTAAAGTATGTGTATCTTGGGGAAGGTGAGACGTTACCCATTATTGTTGCATCAGATTTAATGACGGAGCATGAAGAGGCCTTAATAAAGTGGCTGCAGCAATACCGCAAGGCTATTGGTTGGACATTGGCTGAGATTCAGGGAATTAGCCCATCTTTTTGTATGCACAAAATCACTCTAGAGGAGGGATCCTTGAGGAGTGTTGAGCAACAAAGAAGGCTTAACCCTGCAATGAAAGAGGTTAAAAAGGAGGAGTTCGACTTGGAAATAAAGGACAAGAAGGGATCAGAAAATGTCATTGCAGATCATTTGTCTCGTCTTGATCCATCATCATCTTTGCTGGAGCAATCTGCCATTTCAGATTCTTTTTCAGATGAACAACTTTTTGCTGTTGAGGTAAAGGTAGTCAGGGATATCCCTTGGTATGCTGATATTGTCAACTTTTTGGTAAAGGGAGTCACTCCTATTGACATGGACTGGAGGCAGAAGAAAAAGTTTAAGCATGATGCAAAATTTTTCTATTGGGATGAGCCATTTATGTATAAGCAATGCTCTGACTGTATTTTTCGTAGGTGTGTTTCAGGTGATGAAGCAAAGGATGCCCATTGGTTTTACAAGCAATGTGATGCTTGCCAAAGGAGAGGAAACTTAGGACCTAGAGATTGGAGGCCATTGCATGTCATCAGAGTGATGCCAAGACAGTTGCAAGCTAAGTATGGGATTAAGCATAGGATAGCTACCCCTTATCACCCACAAGCAAATGGTCAAGCTGAAATTAGTAATAGGGAAATTAAATTTATTCTAGAGAAAGTAGTCCATCCATCTAGGAAGGATTTGTCTTTTAGGTTGGATGAGGCTCTTTGGGCTTATAGGACAGCCTATAAGACTCCTCTAGTTTCCATGGCGAAAACACGAGCAAGAAAAGAAAGAGAGAATGAGGAGGAAGAGGTTGCGATTACCCCTGAGGTACAGAAAGTAAAGGCTAAAAAGAAAAAGACCCCGGAGGAGAAAGAAGCCAAGAGAAGAAGGAGGCAACAGAGGGTTGCGGAGCAAGAAGGGGTTCAGGAGGTGGCAGAAGATGTTGCCACTGTAGTGGAGGAAGGAGTTCAAATTCCTGAGGTAGAACCGTTAGTCCCAGATACGGTTCAAGAAGAGAATGTTGAGAAGAATCAAGAACCACAGGCTGACGAAGTTCGGGACGAACATGCCGCAGTTGTGCCTGAAGAAGAGAATGAACAGGAACCGAGGAAGGCGGGCCGCATCAGGGTGATTTGGAATACCCCATCGCCTCCATCGTCAGATTCTGAGGAAGAGAGAAGGGAACAAGAAAAGAAGGAGGCTGAGAACAAAGCAAGAGAAGAAGAAGTAAATAAAGCTGAGGAAGAGGTTTTGCCCAAGCAGAGGGAAGATAAGGGCAAAGGTATTGTTGAAGCATCGGGTGAAGCTGACGAGATTGAGGAACCAAGATTACCGTACATTCGCTTTGTCAGTAACCTTGCTCGGGAAAAGTACGTTGAGATGCTGAGACGGGACTTCCTGTTTGAACGAGGATTTGGCGATGATTTGCCACAGTTCTTGAGGACTGGAATAACGAACCTCGGTTGGAGTCAATTTTGTGCGAAACCGGAGCCTGTTAATTCCAACATTGTTTGGGAATTTTACGCAAATATTGGCGATCAGGAAGAATATCAGGTTATAGTTCGAGGAGTGCCCGTTGATTGGAGCCCAGGAGCCATTAATGCTTTGTTCAATCTCCAGGATTTTCCGCACGCAGGCTTTAATGAGATGGTGGTTGCAACATCTAGCGACCAACTAAATGTGGCTGTCCGGGAGGTTGGCATCGAGGGGGCCCAGTGGAGGTTGTCGAAGACAGAGAAGCGCACATTTCATGCTGCTTATTTCAAGAGCGAGGCCAATATGTGGATGGGTTTCATTAAGTTGAGCTTACTGCCGACAACTCACGACTCAACGGTGTCTCGAGACCGGGTTTTGCTTGCCTTTGCTATTCTTCGTTCGATGAGTATCGATGTAGGTAAAATAATTTCTTTTGAGATTCTTGATTGCTGGCGGAAAAAAGGTGGGGAAGCTGTTTTTCCCCAACACTATCACGATGTTATGCTCAAGGGCAGAGGTGCCAGAGGATGA
Coding sequence (CDS)
ATGAGCGATCCGCCTGGGGTAAGGTTCGAGCTTGATCCAGAAATCGATAGGACATTCAGGATCAGAAGGAGAGAGCAGCGTAGAAACAAGATGGAGAACGTGTCGCGTCTTCCGCAGGTTCCTGAAGATCCAGCAGACCCCCAGAATCGCTTGCTGCAGGATTGCACGCCCCAAATCCAAACGGCAAATTTTGAAATGAAACCGGTAATGTTTCAGAAGTTGCAAACCGTGGGGCAATTCCATGGTTTGTCATCTGAAGATCCTCATTTACATCTTAAGTCTTTTCTAGGAGTAATTGATTCTTTTGTGATTCAAGGAGTGCCTAGAGATGCCCTTAGATTAACTTTGTTCCCGACTGATCTTGCAATGATTGCTAACGCTCTTAAGAATGTGACAGGGATTAGTCATCAGCAGCCACCAGCTATGGAGTCAACTGTAGTGGTGAGCCAAGTCACAGAAGAAACATGTGTCTACTATGGAGAAGATCACAACTACGAGTTTTTCCCCAGCAATCCAGCTTCTGTGTTTTTTGTAGGTTGGCGCAACCACCCCAATTTCTCATGGGGAGGACAAGGAAGTAATGTGCAAGCACAACAAAAGGTGAACCAGTTGGGATTTGCTAAAGCGCAGGTATTGCCCCAACAAAATAAGCAGGCTTTGCCCCAGCAAAATTCGGGGAGTTCTCTTGAGGTGATGATGAAAGAATTTATGGCTCTTACAGATGCCGCAATTCAAAGTAATCAAGCTTCGATGAGAGCACTTGAATTGCAAGTGGGTCAGCTAGCTAATGAGCTGAAGGCAAGGCCTCAAGGGAAACTTCCATCGGATACTGAACACCCTCGAAGGGAAGGTAAGGAGCAGGTAAAGGTGGTAACTCTTAGGAGTGGTAAGCCACTAGAAGAGCCTAGAAAAACTCAGGATATAGAAAAGGATAGTAATAAAAATGCTGTTGTTGAGAAAGAGTTGGAGTCTAGTCAGGGTGCTGGAGGCAGCAATAAAGATGTTGGAGCACCTGGCTCTGTTCCAGATGTGGAACCACCTTATGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAAAGCCTAAGAATCAAGATGGTCAATTTAAAAAGTTTTTAGAGATTCTTAAGCAATTGCATATAAATATCCCTTTAGTAGAAGCTATTGAGCAAATGCCTAATTATGCTAAATTTCTTAAGGATATTTTAATTAAAAAGAAGAGGTTAGGTGAGTTTGAAACTGTATCTCTTACTGAGGAGTGTAGTGTTATTCTTAAGAATGGGCTACCACCCAAGGCTAAGGATCCAGGATCATTTACCATACATGTGTCTATAGGTGGAAAAGAGTTAGGTAGAGCACTTTGTGATTTAGGTGCAAGCATTAACCTTATGCCTCTTTCGATCTATCGAAAGCTAGGTATTGGTGAAGCTAGACCTATCACAGTCACACTCCAACTAGCTGATAGGTCTATCACATATCCAGAGGGTAAAATTGAGGATGTCTTAGTAAAGGTAGATAAATTCATATTTCCAGTTGATTTTATTATTTTAGACTATGAGGCTGATAAAGATGTCCAAATTATTCTAGGTCGTCCATTTTTGGCTACTGGTAGGGCGTTAATAGATGTTCAGAAAGGGGAATTAACAATGAGAGTCTGTAATGAGGAGGTAAAATTTAATGTGTTTAAAGCCATGAAATATCCAAACGAAATGGAGGATTGCTCCTTCATTAGGATTCTGGAGAGCACAGTTATTGAGACAGCAATACAGGATTCGGCTGACAAGCATTTGGAAGGTCATGGAGAGGTTAGTGTAGAGGATTTGCAGGTTTGTTTGTTAGAAAGAAAAAACGAAAAAGAGTTGTTTAGGTGTGAGGATGTTTTTGAGTCTTTAGATTTAGATCAAAGAAAGGCTCCTCCTATTAAGCCATCCCTGATTGAGGCACCTACTTTAGATTTGAAGCCCTTGTCGGATCATCTAAAGTATGTGTATCTTGGGGAAGGTGAGACGTTACCCATTATTGTTGCATCAGATTTAATGACGGAGCATGAAGAGGCCTTAATAAAGTGGCTGCAGCAATACCGCAAGGCTATTGGTTGGACATTGGCTGAGATTCAGGGAATTAGCCCATCTTTTTGTATGCACAAAATCACTCTAGAGGAGGGATCCTTGAGGAGTGTTGAGCAACAAAGAAGGCTTAACCCTGCAATGAAAGAGGTTAAAAAGGAGGAGTTCGACTTGGAAATAAAGGACAAGAAGGGATCAGAAAATGTCATTGCAGATCATTTGTCTCGTCTTGATCCATCATCATCTTTGCTGGAGCAATCTGCCATTTCAGATTCTTTTTCAGATGAACAACTTTTTGCTGTTGAGGTAAAGGTAGTCAGGGATATCCCTTGGTATGCTGATATTGTCAACTTTTTGGTAAAGGGAGTCACTCCTATTGACATGGACTGGAGGCAGAAGAAAAAGTTTAAGCATGATGCAAAATTTTTCTATTGGGATGAGCCATTTATGTATAAGCAATGCTCTGACTGTATTTTTCGTAGGTGTGTTTCAGGTGATGAAGCAAAGGATGCCCATTGGTTTTACAAGCAATGTGATGCTTGCCAAAGGAGAGGAAACTTAGGACCTAGAGATTGGAGGCCATTGCATGTCATCAGAGTGATGCCAAGACAGTTGCAAGCTAAGTATGGGATTAAGCATAGGATAGCTACCCCTTATCACCCACAAGCAAATGGTCAAGCTGAAATTAGTAATAGGGAAATTAAATTTATTCTAGAGAAAGTAGTCCATCCATCTAGGAAGGATTTGTCTTTTAGGTTGGATGAGGCTCTTTGGGCTTATAGGACAGCCTATAAGACTCCTCTAGTTTCCATGGCGAAAACACGAGCAAGAAAAGAAAGAGAGAATGAGGAGGAAGAGGTTGCGATTACCCCTGAGGTACAGAAAGTAAAGGCTAAAAAGAAAAAGACCCCGGAGGAGAAAGAAGCCAAGAGAAGAAGGAGGCAACAGAGGGTTGCGGAGCAAGAAGGGGTTCAGGAGGTGGCAGAAGATGTTGCCACTGTAGTGGAGGAAGGAGTTCAAATTCCTGAGGTAGAACCGTTAGTCCCAGATACGGTTCAAGAAGAGAATGTTGAGAAGAATCAAGAACCACAGGCTGACGAAGTTCGGGACGAACATGCCGCAGTTGTGCCTGAAGAAGAGAATGAACAGGAACCGAGGAAGGCGGGCCGCATCAGGGTGATTTGGAATACCCCATCGCCTCCATCGTCAGATTCTGAGGAAGAGAGAAGGGAACAAGAAAAGAAGGAGGCTGAGAACAAAGCAAGAGAAGAAGAAGTAAATAAAGCTGAGGAAGAGGTTTTGCCCAAGCAGAGGGAAGATAAGGGCAAAGGTATTGTTGAAGCATCGGGTGAAGCTGACGAGATTGAGGAACCAAGATTACCGTACATTCGCTTTGTCAGTAACCTTGCTCGGGAAAAGTACGTTGAGATGCTGAGACGGGACTTCCTGTTTGAACGAGGATTTGGCGATGATTTGCCACAGTTCTTGAGGACTGGAATAACGAACCTCGGTTGGAGTCAATTTTGTGCGAAACCGGAGCCTGTTAATTCCAACATTGTTTGGGAATTTTACGCAAATATTGGCGATCAGGAAGAATATCAGGTTATAGTTCGAGGAGTGCCCGTTGATTGGAGCCCAGGAGCCATTAATGCTTTGTTCAATCTCCAGGATTTTCCGCACGCAGGCTTTAATGAGATGGTGGTTGCAACATCTAGCGACCAACTAAATGTGGCTGTCCGGGAGGTTGGCATCGAGGGGGCCCAGTGGAGGTTGTCGAAGACAGAGAAGCGCACATTTCATGCTGCTTATTTCAAGAGCGAGGCCAATATGTGGATGGGTTTCATTAAGTTGAGCTTACTGCCGACAACTCACGACTCAACGGTGTCTCGAGACCGGGTTTTGCTTGCCTTTGCTATTCTTCGTTCGATGAGTATCGATGTAGGTAAAATAATTTCTTTTGAGATTCTTGATTGCTGGCGGAAAAAAGGTGGGGAAGCTGTTTTTCCCCAACACTATCACGATGTTATGCTCAAGGGCAGAGGTGCCAGAGGATGA
Protein sequence
MSDPPGVRFELDPEIDRTFRIRRREQRRNKMENVSRLPQVPEDPADPQNRLLQDCTPQIQTANFEMKPVMFQKLQTVGQFHGLSSEDPHLHLKSFLGVIDSFVIQGVPRDALRLTLFPTDLAMIANALKNVTGISHQQPPAMESTVVVSQVTEETCVYYGEDHNYEFFPSNPASVFFVGWRNHPNFSWGGQGSNVQAQQKVNQLGFAKAQVLPQQNKQALPQQNSGSSLEVMMKEFMALTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKVVTLRSGKPLEEPRKTQDIEKDSNKNAVVEKELESSQGAGGSNKDVGAPGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILIKKKRLGEFETVSLTEECSVILKNGLPPKAKDPGSFTIHVSIGGKELGRALCDLGASINLMPLSIYRKLGIGEARPITVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVQIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPNEMEDCSFIRILESTVIETAIQDSADKHLEGHGEVSVEDLQVCLLERKNEKELFRCEDVFESLDLDQRKAPPIKPSLIEAPTLDLKPLSDHLKYVYLGEGETLPIIVASDLMTEHEEALIKWLQQYRKAIGWTLAEIQGISPSFCMHKITLEEGSLRSVEQQRRLNPAMKEVKKEEFDLEIKDKKGSENVIADHLSRLDPSSSLLEQSAISDSFSDEQLFAVEVKVVRDIPWYADIVNFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEPFMYKQCSDCIFRRCVSGDEAKDAHWFYKQCDACQRRGNLGPRDWRPLHVIRVMPRQLQAKYGIKHRIATPYHPQANGQAEISNREIKFILEKVVHPSRKDLSFRLDEALWAYRTAYKTPLVSMAKTRARKERENEEEEVAITPEVQKVKAKKKKTPEEKEAKRRRRQQRVAEQEGVQEVAEDVATVVEEGVQIPEVEPLVPDTVQEENVEKNQEPQADEVRDEHAAVVPEEENEQEPRKAGRIRVIWNTPSPPSSDSEEERREQEKKEAENKAREEEVNKAEEEVLPKQREDKGKGIVEASGEADEIEEPRLPYIRFVSNLAREKYVEMLRRDFLFERGFGDDLPQFLRTGITNLGWSQFCAKPEPVNSNIVWEFYANIGDQEEYQVIVRGVPVDWSPGAINALFNLQDFPHAGFNEMVVATSSDQLNVAVREVGIEGAQWRLSKTEKRTFHAAYFKSEANMWMGFIKLSLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISFEILDCWRKKGGEAVFPQHYHDVMLKGRGARG
Homology
BLAST of Lag0032147 vs. NCBI nr
Match:
XP_024028757.1 (uncharacterized protein LOC112093792 [Morus notabilis])
HSP 1 Score: 572.0 bits (1473), Expect = 1.4e-158
Identity = 341/673 (50.67%), Postives = 435/673 (64.64%), Query Frame = 0
Query: 119 TDLAMIANALKNVTGISHQQPPAMESTVVVSQVTEETCVYYGEDHNYEFFPSNPASVFFV 178
T L ++L N+ + PA +T V TCVY G +H++E PSNP SV +V
Sbjct: 67 TALTAQVSSLSNILKSLNVAAPANAATPVAL-----TCVYCGAEHSFENCPSNPESVCYV 126
Query: 179 ----------------GWRNHPNFSWGGQGSNVQ--AQQKVNQLGFAKAQVLPQ----QN 238
GW+ HPNFSW Q +N + GF + Q Q Q+
Sbjct: 127 NNFNRNNNPYSNSYNQGWKQHPNFSWSNQEANPMPGPSKPAYPPGFHQHQHQRQPPQEQS 186
Query: 239 KQALPQQNSGSSLEVMMKEFMALTD-------AAIQSNQASMRALELQVGQLANELKARP 298
Q P Q S + +E ++KE+MA D A +QS AS+R LE QVGQLAN L RP
Sbjct: 187 NQRQPHQASSTPMEALLKEYMARNDSLIPGQAALLQSQAASLRTLENQVGQLANVLSNRP 246
Query: 299 QGKLPSDTEHPRREG----KEQVKVVTLRSGKPLEE-PRKTQDIEKDSNKNAVVEKELES 358
QG LPSDT++PRR+G KE K +TL++G+ +E+ R+T E S + V+
Sbjct: 247 QGSLPSDTKNPRRDGKEHCKEHCKAITLQNGREIEQLTRQTAATEHSSIQTQEVQ----- 306
Query: 359 SQGAGGSNKDVGAPGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLH 418
Q S +DV D P PP PFPQR + + QD QF++FL++LKQLH
Sbjct: 307 -QPPAESEQDV----VDQDATAKLKQNKPERPPPPFPQRFQNQKQDKQFRRFLDVLKQLH 366
Query: 419 INIPLVEAIEQMPNYAKFLKDILIKKKRLGEFETVSLTEECSVILKNGLPPKAKDPGSFT 478
INIPLVEA+EQMP+Y KF+KDIL KK+RLGEFETV+LTEECS ILKN LPPK KDPGSFT
Sbjct: 367 INIPLVEALEQMPSYVKFMKDILTKKRRLGEFETVALTEECSAILKNRLPPKLKDPGSFT 426
Query: 479 IHVSIGGKELGRALCDLGASINLMPLSIYRKLGIGEARPITVTLQLADRSITYPEGKIED 538
I SIG + +G+ALCDLGASINLMP+SI+RKLGIGE P TVTLQLADRS +PEGKIED
Sbjct: 427 IPCSIGDQYIGKALCDLGASINLMPMSIFRKLGIGEVSPTTVTLQLADRSYAHPEGKIED 486
Query: 539 VLVKVDKFIFPVDFIILDYEADKDVQIILGRPFLATGRALIDVQKGELTMRVCNEEVKFN 598
VLV+VDKFIFP DFI+LDYEADK+V IILGRPFLATG+ LIDVQKGELTMRV +++V FN
Sbjct: 487 VLVRVDKFIFPADFIVLDYEADKEVPIILGRPFLATGKTLIDVQKGELTMRVHDQQVTFN 546
Query: 599 VFKAMKYPNEMEDCSFIRILESTVIETAIQDSADKHLEGHGEVSVEDLQVCLLERKNEKE 658
VFKAM++ +E+E+CS + +L+S V + A+K + + E + E N+K+
Sbjct: 547 VFKAMRFTDEVEECSAMNVLDSLVAAEFEKTCAEKLMTEEDLIDSE-----INEDNNDKQ 606
Query: 659 LFRCED---------VFESLDLDQRKAPPIKPSLIEAPTLDLKPLSDHLKYVYLGEGETL 718
+ R E FESLDL KPS+ E P L+L+PL HL+Y YLG+ +TL
Sbjct: 607 VSRLEGRHAATKSRRHFESLDLSTEPLRQHKPSVEEPPILELRPLPAHLRYAYLGDSDTL 666
Query: 719 PIIVASDLMTEHEEALIKWLQQYRKAIGWTLAEIQGISPSFCMHKITLEEGSLRSVEQQR 749
P+I+AS L E L++ L+++++AIGWT+A+I+GISPS CMHKI L+E SVEQQR
Sbjct: 667 PVIIASGLNDMQEIQLLEVLKKFKRAIGWTIADIKGISPSICMHKILLQECCSNSVEQQR 719
BLAST of Lag0032147 vs. NCBI nr
Match:
PIN20438.1 (DNA-directed DNA polymerase [Handroanthus impetiginosus])
HSP 1 Score: 571.6 bits (1472), Expect = 1.8e-158
Identity = 444/1308 (33.94%), Postives = 595/1308 (45.49%), Query Frame = 0
Query: 60 QTANFEMKPVMFQKLQTVGQFHGLSSED----PHLHLKSFL--------GVIDSFVIQGV 119
+TA + + F+++ T +HGL+ D HL+ SFL ++++FV
Sbjct: 170 KTAAVRAEIMTFRQVHTF--YHGLTDGDKDKLDHLNGDSFLSGTTAECHNLLNNFVANHY 229
Query: 120 PRDALRLTLFPTDLAMIANALKNVTGISHQQPPAMES--TVVVSQV--TEETCVYYGEDH 179
+ + R T P A + + VT ++ + M+S ++QV T TC YGE H
Sbjct: 230 EKKSERAT--PPKAAGVIE-VDQVTALNAKIDFLMQSIKNFGINQVQHTPVTCKEYGERH 289
Query: 180 NYEFFPSNPASVFFVG-----------------WRNHPNFSWG---GQGSNVQAQQKVNQ 239
+ P + S+ FV WR HPNFSW GQGS + QQ Q
Sbjct: 290 LSDQCPHSVESIQFVSNARKPQNNPYSNTYNPRWRQHPNFSWNNNQGQGSAPRFQQGGQQ 349
Query: 240 LGFAKAQVLPQQNKQALPQQNSGSSLEVMMKEFMALTDAAIQSNQASMRALELQVGQLAN 299
QV ++ L Q FMA S A+ + +E Q+GQLAN
Sbjct: 350 ------QVQQPMQEETLIQ-------------FMA-------STAANFKTMETQIGQLAN 409
Query: 300 ELKARPQGKLPSDTE-HPRREGKEQVKVVTLRSGKPLEEPRKTQDIEKDSNKNAVVEKEL 359
+ +RPQG LPS+TE +PR++G E + EE EKE+
Sbjct: 410 AINSRPQGSLPSNTEPNPRQDGNEVIS----------EEK----------------EKEI 469
Query: 360 ESSQGAGGSNKDVGAPGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQ 419
E+ P V P + P PFPQR + + + QF KFLE+ K+
Sbjct: 470 EA---------------------PLEVSKPTTLQP-PFPQRLQKQKLEKQFLKFLEVFKK 529
Query: 420 LHINIPLVEAIEQMPNYAKFLKDILIKKKRLGEFETVSLTEECSVILKNGLPPKAKDPGS 479
LHINIP EA+EQMP+Y KF+KDIL KK+RLG++ETV LTEECS I++N LPPK KDPGS
Sbjct: 530 LHINIPFAEALEQMPSYVKFMKDILSKKRRLGDYETVVLTEECSAIIQNKLPPKLKDPGS 589
Query: 480 FTIHVSIGGKELGRALCDLGASINLMPLSIYRKLGIGEARPITVTLQLADRSITYPEGKI 539
FTI +IG GRALCDL ASINLMP SIYR LG+GEA+P ++TLQLADRS+TYP+G I
Sbjct: 590 FTIPCTIGTHFSGRALCDLRASINLMPYSIYRTLGLGEAKPTSITLQLADRSLTYPKGVI 649
Query: 540 EDVLVKVDKFIFPVDFIILDYEADKDVQIILGRPFLATGRALIDVQKGELTMRVCNEEVK 599
ED+LVKVDKFIFP DF++LD E D +V IILGRPFLATGR LIDVQKGELTMRV ++++
Sbjct: 650 EDILVKVDKFIFPADFVVLDMEVDIEVPIILGRPFLATGRTLIDVQKGELTMRVQDQQII 709
Query: 600 FNVFKAMKYPNEMEDCSFIRILESTVIETAIQDSADKHLEGHGEVSVEDLQVCLLERKNE 659
FNVFKAMK+PNE ++C + + ++ +I A++ L+ +E + LL+ +NE
Sbjct: 710 FNVFKAMKFPNESDECFAVNLFDNLAGNESI---AEQPLD-----PLERALLDLLDEENE 769
Query: 660 KE-----LFRCEDVFESLDLD--QRKAPP--IKPSLIEAPTLDLKPLSDHLKYVYLGEGE 719
++ + F+S ++ +R AP +KPS+ E PTL LKPL HL YVYLG+ +
Sbjct: 770 EDREVVKMLDASKYFKSRGIESLERTAPSKVLKPSIEEPPTLVLKPLPSHLCYVYLGKSD 829
Query: 720 TLPIIVASDLMTEHEEALIKWLQQYRKAIGWTLAEIQGISPSFCMHKITLEEGSLRSVEQ 779
TLP+I++S L E L++ L+ ++ AIGWT+A+I+GISPSFCMHKI LE+ SVE
Sbjct: 830 TLPVIISSSLSDLQVEKLLRVLRNHKGAIGWTIADIKGISPSFCMHKILLEDDQKPSVES 889
Query: 780 QRRLNPAMKEVKK----------------------------------------------- 839
QRRLNP MKEV K
Sbjct: 890 QRRLNPIMKEVVKKEIIKWLDAGIIYPISDSSWVSPVQCIPKKGGTTVVPNMHNELIPTR 949
Query: 840 ------------------------------------------------------------ 899
Sbjct: 950 TVTGCMMAIFTDMVENCLEVFMDDFSVYGDSFDKCLNNLSCVLKRCEDTNLVLNWKKCHF 1009
Query: 900 ------------------------------------------------------------ 959
Sbjct: 1010 MVQEGIVLGHKISNRDIEVDKAKLETIEKLPPPTSVKGVRSFLGHAGFYRRFIKDFSKIS 1069
Query: 960 ------------------------------------------------------------ 1006
Sbjct: 1070 KPLCNFKTLNDAQLNYTTIEKELLAVVFAFDKFRSYLVGTKVIVYTDHAAIRYLIEKKDA 1129
BLAST of Lag0032147 vs. NCBI nr
Match:
XP_017239676.1 (PREDICTED: uncharacterized protein LOC108212460 [Daucus carota subsp. sativus])
HSP 1 Score: 554.7 bits (1428), Expect = 2.3e-153
Identity = 408/1268 (32.18%), Postives = 537/1268 (42.35%), Query Frame = 0
Query: 140 PAMESTVVVSQVTEETCV-YYGEDHNYEFFPSNP-ASVFFVGWRNHPNFSWGGQGSNVQA 199
P + ++ V + + V Y G N + +NP ++ + GWRNHPNFSW +NV+
Sbjct: 323 PTQQCPLIYHDVAQSSSVNYVGNSSNQQ---NNPFSNTYNPGWRNHPNFSW---NNNVRP 382
Query: 200 QQKVNQLGFAKAQVLPQQNKQALPQQ-NSGSSLEVMMKEFMALTDAAIQSNQASMRALEL 259
Q V P + PQ+ + E ++ ++M TDA IQS ASMRALE+
Sbjct: 383 NMPFKQ------NVPPGFQQNPRPQEMEKKPNTEDLLLQYMQKTDALIQSQSASMRALEM 442
Query: 260 QVGQLANELKARPQGKLPSDTE-HPRREGKEQVKVVTLRSGKPLEEPRKTQDIEKDSNKN 319
QVGQLA+ + RP G LPS+TE +P+ + +E K +TLRSGK +E K D D K
Sbjct: 443 QVGQLASAINNRPSGSLPSNTEPNPKNDKREHCKAITLRSGKEIEGNTKKVDDGGDPEKV 502
Query: 320 AVVEKELESSQGAGGSNKDVGAPGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFKK 379
E + S+ A S P PP PFPQR + + QD QF+K
Sbjct: 503 LNEEPSVLSNPKADAS-----------------TPKKHVYPPPPFPQRLQKQKQDKQFQK 562
Query: 380 FLEILKQLHINIPLVEAIEQMPNYAKFLKDILIKKKRLGEFETVSLTEECSVILKNGLPP 439
F+++ K+L INIP EA+EQM +Y KF+KDIL +K+RL EFETV+LTEECS IL+ LPP
Sbjct: 563 FMDVFKKLSINIPFAEALEQMSSYVKFMKDILSRKRRLEEFETVTLTEECSAILQKKLPP 622
Query: 440 KAKDPGSFTIHVSIGGKELGRALCDLGASINLMPLSIYRKLGIGEARPITVTLQLADRSI 499
K KDPGSFTI +IG + G+ALCDLGAS+NLMPLSI+ KLG+GE +P +V LQLADRS+
Sbjct: 623 KLKDPGSFTIPCTIGNQYFGKALCDLGASVNLMPLSIFVKLGVGEVKPTSVRLQLADRSL 682
Query: 500 TYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVQIILGRPFLATGRALIDVQKGELTMR 559
YP G +EDVLVKVDKFIFP DFI+LD E D D+ ++LGRPFLATGR LIDVQKGELTMR
Sbjct: 683 AYPRGVVEDVLVKVDKFIFPADFIVLDMEEDADIPLLLGRPFLATGRTLIDVQKGELTMR 742
Query: 560 VCNEEVKFNVFKAMKYPNEMEDCSFIRILES-----TVIETAIQDSADKHLEGHGEVSVE 619
V +E+V FNVF AMK+ N+ E C + ++E D + L G+ S E
Sbjct: 743 VQDEQVTFNVFSAMKFSNDEESCFSVSTFTGGDDLPLMLEQHSTDPLELSLREAGDESNE 802
Query: 620 DLQVCLLERKNEKELFRCEDVFESLDLDQRKAPPIKPSLIEAPTLDLKPLSDHLKYVYLG 679
++ C+ E R FES ++ K+ KPS+ E P L+LK L HLKY +LG
Sbjct: 803 EIAECVKELNALPTYRRPFQQFESFEMPV-KSKASKPSIEEPPELELKQLPTHLKYAFLG 862
Query: 680 EGETLPIIVASDLMTEHEEALIKWLQQYRKAIGWTLAEIQGISPSFCMHKITLEEGSLRS 739
E TLP+I++S L EHEE L++ L++Y++AIGW +A+I+GISPSFCMHKI++E+ +
Sbjct: 863 EKSTLPVILSSTLSAEHEEKLLRVLKEYKRAIGWKIADIRGISPSFCMHKISMEDDHKPN 922
Query: 740 VEQQRRLNPAMKEVKK-------------------------------------------- 799
+E QRRLNP MKEV K
Sbjct: 923 IEHQRRLNPVMKEVVKKEIIKWLDAGIIYPISDSSWVSPIQCVPKKGGITVVANEKNELI 982
Query: 800 ------------------------------------------------------------ 859
Sbjct: 983 PTRTVTGWRVCMDYRKLNKATRKDHFPLPFIDQMLDRLAGKEFYCFLDGYSGYHQIAIAP 1042
Query: 860 ------------------------------------------------------------ 919
Sbjct: 1043 EDQEKTTFTCPFGTFAFRKVSFGLCNAPSTFQRCMMAIFSDMIEQGVEVFMDDFSVLGDS 1102
Query: 920 ------------------------------------------------------------ 962
Sbjct: 1103 FDACLMNLARVLQQVDKAKLETIGKLPPPSSVKGVRSFLGHAGFYRRFIKDFSKISKPLC 1162
BLAST of Lag0032147 vs. NCBI nr
Match:
XP_017239676.1 (PREDICTED: uncharacterized protein LOC108212460 [Daucus carota subsp. sativus])
HSP 1 Score: 59.3 bits (142), Expect = 3.0e-04
Identity = 37/121 (30.58%), Postives = 63/121 (52.07%), Query Frame = 0
Query: 11 LDPEIDRTFRIRRR--EQRRNKMENVSRLPQVPEDPADPQNRLLQD--------CTPQIQ 70
LD EI++T + RR ++ RN+ + VP ++ ++ P I
Sbjct: 12 LDLEIEKTAKANRRKAKESRNRSSTMGDDANVPPRRLRVKDYIMPSFDGIHSSIARPAIA 71
Query: 71 TANFEMKPVMFQKLQTVGQFHGLSSEDPHLHLKSFLGVIDSFVIQGVPRDALRLTLFPTD 122
NF + Q ++ +F+GLS+EDP+ HL++FL ++D+F + GVP + +RL LF
Sbjct: 72 ANNFHVDSATMQAIRD-NKFNGLSAEDPNAHLRNFLEIVDNFKVNGVPEETIRLRLFSRS 131
HSP 2 Score: 553.1 bits (1424), Expect = 6.6e-153
Identity = 403/1159 (34.77%), Postives = 530/1159 (45.73%), Query Frame = 0
Query: 170 SNP-ASVFFVGWRNHPNFSWGGQGSNVQAQQKVNQLGFAKAQVLPQQNKQALPQQNSGSS 229
+NP ++ + GWRNHPNFSW SN Q Q+ GF P Q K+ + +++ +
Sbjct: 296 NNPYSNTYNPGWRNHPNFSW----SNTQNVQRPPP-GF------PAQEKK-INLEDALTQ 355
Query: 230 LEVMMKEFMALTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQ 289
L + +FM T Q+ QAS++ LE+QVGQLAN + R QG PS E +P+ + EQ
Sbjct: 356 LTMSTTQFMTETKTQFQNQQASIQNLEVQVGQLANVISGRNQGVFPSQPEVNPKNQ--EQ 415
Query: 290 VKVVTLRSGKPLEEPRKTQDIEKDSNKNAVVEKELESSQGAGGSNKDVGAPGSVPD---V 349
K +TLR GK + D+EK++ +EKE E+ + A P + +
Sbjct: 416 AKAITLRKGK---QVNTAIDLEKEA-----LEKEKEAKKFAAEMGHAFSPPITTTEKSQE 475
Query: 350 EPPYVPPP-----PYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNY 409
E +P P PYVP +PFPQR + DGQF KFLE+ ++L INIP EA+EQMP+Y
Sbjct: 476 EENSIPIPSLQLKPYVPQIPFPQRLRKNKVDGQFAKFLEMFRKLQINIPFAEALEQMPSY 535
Query: 410 AKFLKDILIKKKRLGEFETVSLTEECSVILKNGLPPKAKDPGSFTIHVSIGGKELGRALC 469
AKF+KDIL KK++ GE E + LTEECS IL+ LPPK KD GSF I +IG RALC
Sbjct: 536 AKFMKDILSKKRKFGEHEKIQLTEECSAILQRKLPPKQKDRGSFKIPCTIGNNFFERALC 595
Query: 470 DLGASINLMPLSIYRKLGIGEARPITVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFI 529
DLG+SINL+PLS+ +K+GIGE +P TV+LQ+ADRSITYP+G IEDVLVKVD IFP DF+
Sbjct: 596 DLGSSINLLPLSVAKKIGIGEIKPTTVSLQMADRSITYPDGIIEDVLVKVDTLIFPADFL 655
Query: 530 ILDYEADKDVQIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPNEMEDCS 589
+LD E D D Q+ILGRPFL T R LIDV++G LT+RV NE+ F VF+A+K+P E EDC
Sbjct: 656 VLDMEEDSDTQLILGRPFLITSRTLIDVEEGLLTLRVGNEQATFKVFEAIKFPREAEDCF 715
Query: 590 FIRILESTVIETAIQDSADKHLEG---HGEVSVED----LQVCLLERKNEKELFRCEDVF 649
I +++ +T +++ LE H S +D + L ++ R + F
Sbjct: 716 HIELIDEIASDTFKKENPSHPLESTLVHAATSQDDNPMVAEYALYLDASQPYHPRQRNQF 775
Query: 650 ESLDLDQRKAPP-IKPSLIEAPTLDLKPLSDHLKYVYLGEGETLPIIVASDLMTEHEEAL 709
E L APP PS+I APTL LKPL HL+Y YLG ETLP+I+A++L EE +
Sbjct: 776 EPLG----AAPPKAAPSVIAAPTLTLKPLPTHLRYAYLGTSETLPVIIAANLSETEEEKV 835
Query: 710 IKWLQQYRKAIGWTLAEIQGISPSFCMHKITLEEGSLRSVEQQRRLNPAMKEV------- 769
++ L++++ AIGWT+A+I+GISPS CMH+I +EE SVE QRRLNP MKEV
Sbjct: 836 LRVLRKHKTAIGWTIADIKGISPSMCMHRILMEEEHKPSVEHQRRLNPNMKEVVRAEVLK 895
Query: 770 ------------------------------------------------------------ 829
Sbjct: 896 LLDAGIIYPISDSSWVSPTQVVPKKGGMTVVKNENNELVPTRTVTGWRVDRAKIETIEKL 955
Query: 830 ------------------------------------------------------------ 889
Sbjct: 956 PPPSTVKGIRSFLGHAGFYRRFIKDFSKITKPLCKLLLKDSEFNFDSDCLEAFNLLKKKL 1015
Query: 890 ------------------------------------------------------------ 949
Sbjct: 1016 TTAPVIMAPDWELPFEIMCDASDYAIGAVLGQRKNKLLHVIHYASRTLNDAQLNYATTEK 1075
Query: 950 -----------------------------------KKE-------------EFDLEIKDK 962
KKE EFD+EI+DK
Sbjct: 1076 ELLAVVFALDKFRSYLLGAKVIVYTDHAALKFLLAKKEAKPRLIRWVLLLQEFDIEIRDK 1135
BLAST of Lag0032147 vs. NCBI nr
Match:
XP_008231996.1 (PREDICTED: uncharacterized protein LOC103331166 [Prunus mume])
HSP 1 Score: 62.4 bits (150), Expect = 3.5e-05
Identity = 31/65 (47.69%), Postives = 38/65 (58.46%), Query Frame = 0
Query: 57 PQIQTANFEMKPVMFQKLQTVGQFHGLSSEDPHLHLKSFLGVIDSFVIQGVPRDALRLTL 116
P I NFE+KP M LQ F GL +EDP++HL FL + D+ GV DA+RL L
Sbjct: 25 PAIAANNFEIKPAMITMLQNSSVFCGLPNEDPNIHLAIFLEICDTSKFNGVTDDAIRLRL 84
Query: 117 FPTDL 122
FP L
Sbjct: 85 FPFSL 89
HSP 2 Score: 545.0 bits (1403), Expect = 1.8e-150
Identity = 393/1198 (32.80%), Postives = 558/1198 (46.58%), Query Frame = 0
Query: 57 PQIQTANFEMKPVMFQKLQTVGQFHGLSSEDPHLHLKSFLGVIDSFVIQGVPRDALRLTL 116
P + NFE+KP + Q +Q QF G +EDPH HL +FL + D+ + GV DA+RL L
Sbjct: 31 PTVNANNFEIKPGLIQMVQQ-EQFGGGPAEDPHAHLANFLEICDTIKMNGVSDDAIRLRL 90
Query: 117 FPTDLAMIANALKNVTGISHQQPPAMESTVVVSQVTEETCVYYGED----HNYEFFPSNP 176
FP L A A N + P + +SQ G+ ++ F
Sbjct: 91 FPFSLKDKAKAWLN-----SKAPNSFTIWNALSQAFLSKYFPPGKTAKLRNDITSFAQFD 150
Query: 177 ASVFFVGWRNHPNF--------------SWGGQGSNVQAQQKVNQLGFAKAQVLPQQNKQ 236
+ W + + GG + ++ L + N++
Sbjct: 151 GESLYEAWERFKDLQRKCPHHVRITIDAAAGGTLMSKSIEEAYELLEEMASNNYQWSNER 210
Query: 237 ALPQQNSG--------------SSLEVMMKEFMALTDAAIQSNQA--------------S 296
+P++ G SL M T + QA
Sbjct: 211 GMPKKVPGMYDVDGINMLNAKVDSLVKMFGSSSRTTLIPTHTTQAGGIIPTFHGKIKVTK 270
Query: 297 MRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKVVTLRSGKPLEEPRKTQDIEK 356
+ L+L + ++N+ +G LPS TE KE K VTLRSGK L +
Sbjct: 271 VAVLDLSILLVSNQDHPNQKGNLPSKTE---VNPKEHCKAVTLRSGKQLGQ--------- 330
Query: 357 DSNKNAVVEKELESSQGAGGSNKDVGAPGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQD 416
+ +V E++ + + N++V + P PYVPP+PFPQR K D
Sbjct: 331 -VSGETIVGDEVDYDEVSKKVNEEV---EDLAKTTSPLPLVKPYVPPIPFPQRLKQNKID 390
Query: 417 GQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILIKKKRLGEFETVSLTEECSVILK 476
QF+KFL++ +QLHINIP +A+ Q+P Y KFLK+I+ KK++L +FET++LTEECS I++
Sbjct: 391 QQFEKFLKVFRQLHINIPFADALAQIPAYTKFLKEIMSKKRKLEDFETIALTEECSAIIQ 450
Query: 477 NGLPPKAKDPGSFTIHVSIGGKELGRALCDLGASINLMPLSIYRKLGIGEARPITVTLQL 536
N LPPK +DPGSF+I +IG + RALCDLGAS+ LMPLS+ RKLG+ E +P T++LQL
Sbjct: 451 NKLPPKLRDPGSFSIPCTIGDVDFSRALCDLGASVLLMPLSVSRKLGLKELKPTTISLQL 510
Query: 537 ADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVQIILGRPFLATGRALIDVQKG 596
ADRS+ YP G +E+VL+KV KFI PVDFI+L+ E D ++ IILGRPFLAT A+ID++ G
Sbjct: 511 ADRSVKYPLGVLENVLIKVKKFIIPVDFIVLEMEEDTEIPIILGRPFLATAGAIIDIKNG 570
Query: 597 ELTMRVCNEEVKFNVFKAMKYPNEMEDCSFIRILESTVIETAIQDSADKHLE----GHGE 656
LT++V EEV+FN+F+A KYP+ + + +++ + E ++ + LE G
Sbjct: 571 RLTLKVGEEEVEFNLFEATKYPSFTDHVFRVDVVDESTREVFRAENTKEPLETCLVSAGT 630
Query: 657 VSVEDLQV----CLLERKNEKELFRCEDVFESLDLDQRKAPPIKPSLIEAPTLDLKPLSD 716
++L+V C LE K L + FE D+ + K PP PS ++AP L+LKPL
Sbjct: 631 SKDDNLEVAKVACALEATCPK-LKKRGIYFE--DIGKGKPPP-PPSNVQAPVLELKPLPS 690
Query: 717 HLKYVYLGEGETLPIIVASDLMTEHEEALIKWLQQYRKAIGWTLAEIQGISPSFCMHKIT 776
HL+Y +LGE TLP+IV++ L E + LI+ L+ +KAIGWT+++++GISPS CMH+I
Sbjct: 691 HLRYAFLGENNTLPVIVSTSLSGEQLDKLIRILRLRKKAIGWTISDLRGISPSLCMHRIL 750
Query: 777 LEEGSLRSVEQQRRLNPAMKEVKK------------------------------------ 836
+E+ VE QRRLNP MKEV +
Sbjct: 751 MEDNHKPIVENQRRLNPNMKEVVRAEVLKWLDAGIIYPISDSSWISSVQVLERLAGYAYY 810
Query: 837 ------------------------------------------------------------ 896
Sbjct: 811 YFLDGYSGLKQELVSAPIMEAPDWSLPFELMCDASDFALGAILGQRKDRKLHVIYYASSK 870
Query: 897 ------------------------------EEFDLEIKDKKGSENVIADHLSRLDPSSSL 956
+EFDLEI+DK+G ENV+ADHLSRL+ S
Sbjct: 871 VIVYTDHSAIKYLLKKKDAKPRLIRWVLLLQEFDLEIRDKRGMENVVADHLSRLE-GQSR 930
Query: 957 LEQSAISDSFSDEQLFAVEVKVVRDIPWYADIVNFLVKGVTPIDMDWRQKKKFKHDAKFF 962
++ I++SF DEQL V V IPWYAD VN+LV G+ P D+ + QKKKF D K +
Sbjct: 931 ADEVPINESFPDEQLLVVSV-----IPWYADFVNYLVSGIVPPDLSYHQKKKFLRDVKHY 990
BLAST of Lag0032147 vs. ExPASy TrEMBL
Match:
A0A2G9HSD1 (DNA-directed DNA polymerase OS=Handroanthus impetiginosus OX=429701 GN=CDL12_06869 PE=4 SV=1)
HSP 1 Score: 571.6 bits (1472), Expect = 8.7e-159
Identity = 444/1308 (33.94%), Postives = 595/1308 (45.49%), Query Frame = 0
Query: 60 QTANFEMKPVMFQKLQTVGQFHGLSSED----PHLHLKSFL--------GVIDSFVIQGV 119
+TA + + F+++ T +HGL+ D HL+ SFL ++++FV
Sbjct: 170 KTAAVRAEIMTFRQVHTF--YHGLTDGDKDKLDHLNGDSFLSGTTAECHNLLNNFVANHY 229
Query: 120 PRDALRLTLFPTDLAMIANALKNVTGISHQQPPAMES--TVVVSQV--TEETCVYYGEDH 179
+ + R T P A + + VT ++ + M+S ++QV T TC YGE H
Sbjct: 230 EKKSERAT--PPKAAGVIE-VDQVTALNAKIDFLMQSIKNFGINQVQHTPVTCKEYGERH 289
Query: 180 NYEFFPSNPASVFFVG-----------------WRNHPNFSWG---GQGSNVQAQQKVNQ 239
+ P + S+ FV WR HPNFSW GQGS + QQ Q
Sbjct: 290 LSDQCPHSVESIQFVSNARKPQNNPYSNTYNPRWRQHPNFSWNNNQGQGSAPRFQQGGQQ 349
Query: 240 LGFAKAQVLPQQNKQALPQQNSGSSLEVMMKEFMALTDAAIQSNQASMRALELQVGQLAN 299
QV ++ L Q FMA S A+ + +E Q+GQLAN
Sbjct: 350 ------QVQQPMQEETLIQ-------------FMA-------STAANFKTMETQIGQLAN 409
Query: 300 ELKARPQGKLPSDTE-HPRREGKEQVKVVTLRSGKPLEEPRKTQDIEKDSNKNAVVEKEL 359
+ +RPQG LPS+TE +PR++G E + EE EKE+
Sbjct: 410 AINSRPQGSLPSNTEPNPRQDGNEVIS----------EEK----------------EKEI 469
Query: 360 ESSQGAGGSNKDVGAPGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQ 419
E+ P V P + P PFPQR + + + QF KFLE+ K+
Sbjct: 470 EA---------------------PLEVSKPTTLQP-PFPQRLQKQKLEKQFLKFLEVFKK 529
Query: 420 LHINIPLVEAIEQMPNYAKFLKDILIKKKRLGEFETVSLTEECSVILKNGLPPKAKDPGS 479
LHINIP EA+EQMP+Y KF+KDIL KK+RLG++ETV LTEECS I++N LPPK KDPGS
Sbjct: 530 LHINIPFAEALEQMPSYVKFMKDILSKKRRLGDYETVVLTEECSAIIQNKLPPKLKDPGS 589
Query: 480 FTIHVSIGGKELGRALCDLGASINLMPLSIYRKLGIGEARPITVTLQLADRSITYPEGKI 539
FTI +IG GRALCDL ASINLMP SIYR LG+GEA+P ++TLQLADRS+TYP+G I
Sbjct: 590 FTIPCTIGTHFSGRALCDLRASINLMPYSIYRTLGLGEAKPTSITLQLADRSLTYPKGVI 649
Query: 540 EDVLVKVDKFIFPVDFIILDYEADKDVQIILGRPFLATGRALIDVQKGELTMRVCNEEVK 599
ED+LVKVDKFIFP DF++LD E D +V IILGRPFLATGR LIDVQKGELTMRV ++++
Sbjct: 650 EDILVKVDKFIFPADFVVLDMEVDIEVPIILGRPFLATGRTLIDVQKGELTMRVQDQQII 709
Query: 600 FNVFKAMKYPNEMEDCSFIRILESTVIETAIQDSADKHLEGHGEVSVEDLQVCLLERKNE 659
FNVFKAMK+PNE ++C + + ++ +I A++ L+ +E + LL+ +NE
Sbjct: 710 FNVFKAMKFPNESDECFAVNLFDNLAGNESI---AEQPLD-----PLERALLDLLDEENE 769
Query: 660 KE-----LFRCEDVFESLDLD--QRKAPP--IKPSLIEAPTLDLKPLSDHLKYVYLGEGE 719
++ + F+S ++ +R AP +KPS+ E PTL LKPL HL YVYLG+ +
Sbjct: 770 EDREVVKMLDASKYFKSRGIESLERTAPSKVLKPSIEEPPTLVLKPLPSHLCYVYLGKSD 829
Query: 720 TLPIIVASDLMTEHEEALIKWLQQYRKAIGWTLAEIQGISPSFCMHKITLEEGSLRSVEQ 779
TLP+I++S L E L++ L+ ++ AIGWT+A+I+GISPSFCMHKI LE+ SVE
Sbjct: 830 TLPVIISSSLSDLQVEKLLRVLRNHKGAIGWTIADIKGISPSFCMHKILLEDDQKPSVES 889
Query: 780 QRRLNPAMKEVKK----------------------------------------------- 839
QRRLNP MKEV K
Sbjct: 890 QRRLNPIMKEVVKKEIIKWLDAGIIYPISDSSWVSPVQCIPKKGGTTVVPNMHNELIPTR 949
Query: 840 ------------------------------------------------------------ 899
Sbjct: 950 TVTGCMMAIFTDMVENCLEVFMDDFSVYGDSFDKCLNNLSCVLKRCEDTNLVLNWKKCHF 1009
Query: 900 ------------------------------------------------------------ 959
Sbjct: 1010 MVQEGIVLGHKISNRDIEVDKAKLETIEKLPPPTSVKGVRSFLGHAGFYRRFIKDFSKIS 1069
Query: 960 ------------------------------------------------------------ 1006
Sbjct: 1070 KPLCNFKTLNDAQLNYTTIEKELLAVVFAFDKFRSYLVGTKVIVYTDHAAIRYLIEKKDA 1129
BLAST of Lag0032147 vs. ExPASy TrEMBL
Match:
A0A2G9GK35 (Reverse transcriptase OS=Handroanthus impetiginosus OX=429701 GN=CDL12_21798 PE=4 SV=1)
HSP 1 Score: 539.7 bits (1389), Expect = 3.7e-149
Identity = 382/1045 (36.56%), Postives = 500/1045 (47.85%), Query Frame = 0
Query: 150 QVTEETCVYYGEDHNYEFFPSNPASVFFV-----------------GWRNHPNFSWG--- 209
Q T TC GE H + P + S+ FV GWR HPNFSW
Sbjct: 123 QHTPVTCDECGESHPSDQCPHSVESIQFVSNARKPQNNPYSNTYNPGWRQHPNFSWNNNQ 182
Query: 210 GQGSNVQAQQKVNQLGFAKAQVLPQQNKQALPQQNSGSSLEVMMKEFMALTDAAIQSNQA 269
GQGS + QQ Q + P Q SLE + +FMA S A
Sbjct: 183 GQGSAPRFQQ-------------GGQQQVQQPIQEKKPSLEETLIQFMA-------STAA 242
Query: 270 SMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKVVTLRSGKPLEEPRKTQDI 329
+ + +E Q+GQLAN + +RPQG LPS+TE +PR++GK Q + VTLR+G R+ Q++
Sbjct: 243 NFKTMETQIGQLANAINSRPQGSLPSNTEPNPRQDGKAQCQAVTLRNG------RELQEV 302
Query: 330 EKDSNKNAVVEKELESSQGAGGSNKDVGAPGSVPDVEPPYVPPPPYVPPLPFPQRQKPKN 389
K+ K+ EKE+ S + K+V AP V +P + P PFPQR + +
Sbjct: 303 VKEPTKSK--EKEVISEE----KEKEVEAPLEVS--KPTTLQP-------PFPQRLQKQK 362
Query: 390 QDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILIKKKRLGEFETVSLTEECSVI 449
+ QF KFLE+ K+LHINIP EA+EQMP+Y KF+KDIL KK+RLG++ETV+LTEECS I
Sbjct: 363 LEKQFLKFLEVFKKLHINIPFAEALEQMPSYVKFMKDILSKKRRLGDYETVALTEECSAI 422
Query: 450 LKNGLPPKAKDPGSFTIHVSIGGKELGRALCDLGASINLMPLSIYRKLGIGEARPITVTL 509
++N LPPK KDPGSFTI +IG GRALCDLGASINLMP SIYR LG+GEA+P ++TL
Sbjct: 423 IQNKLPPKLKDPGSFTIPCTIGTHFSGRALCDLGASINLMPYSIYRTLGLGEAKPTSITL 482
Query: 510 QLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVQIILGRPFLATGRALIDVQ 569
QLADRS+TYP+G IED+LVKVDKFIFP DF++LD E D +V IILGRPFLATGR LIDVQ
Sbjct: 483 QLADRSLTYPKGVIEDILVKVDKFIFPADFVVLDMEVDIEVPIILGRPFLATGRTLIDVQ 542
Query: 570 KGELTMRVCNEEVKFNVFKAMKYPNEMEDCSFIRILES-----TVIETAIQDSADKHLEG 629
KGELTMRV ++++ FNVFKAMK+PNE ++C + + ++ ++ E ++ L+
Sbjct: 543 KGELTMRVQDQQITFNVFKAMKFPNESDECFAVNLFDNLAGNESIAEQSLDPLERALLDL 602
Query: 630 HGEVSVEDLQVCLLERKNEKELFRCEDVFESLDLDQRKAPP--IKPSLIEAPTLDLKPLS 689
E + ED +V ++ + + F+ V ESL +R AP +KPS+ E PTL+LKPL
Sbjct: 603 LDEENEEDYEV--VKTLDASKYFKSRGV-ESL---ERIAPSKVLKPSIEEPPTLELKPLP 662
Query: 690 DHLKYVYLGEGETLPIIVASDLMTEHEEALIKWLQQYRKAIGWTLAEIQGISPSFCMHKI 749
HL Y YLGE +TLP+I++S L E L++ L+ ++ AIGWT+ +I+GISPSFCMHKI
Sbjct: 663 SHLCYAYLGESDTLPVIISSSLSDLQVEKLLRVLRNHKGAIGWTIVDIKGISPSFCMHKI 722
Query: 750 TLEEGSLRSVEQQRRLNPAMKEVKK----------------------------------- 809
LE+ SVE QRRLNP MKEV K
Sbjct: 723 LLEDDQKPSVESQRRLNPIMKEVVKKEIIKWLDAGIIYPISDRGITVVPNMHNELIPTRT 782
Query: 810 ------------------------------------------------------------ 869
Sbjct: 783 VTGWRVCMDYRKLNKATRKDHFPLPFIDQMLDRLAGKEFYCFLDGYSGYNQIAIAPEDQE 842
Query: 870 ------------------------------------------------------------ 892
Sbjct: 843 KITFTCPYGTFAFRRMPFGLCNAPATFQRCEDTNLILNWEKCHFMVQEGIVLGHKISNRG 902
BLAST of Lag0032147 vs. ExPASy TrEMBL
Match:
A0A1U7XC36 (uncharacterized protein LOC104232916 OS=Nicotiana sylvestris OX=4096 GN=LOC104232916 PE=4 SV=1)
HSP 1 Score: 519.2 bits (1336), Expect = 5.1e-143
Identity = 343/989 (34.68%), Postives = 477/989 (48.23%), Query Frame = 0
Query: 245 QSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKVVTLRSGKPLEE-PR 304
Q + R LE Q+GQLA RP G LPSDTE QV VTLR+G+ LEE P+
Sbjct: 17 QQLRTDFRNLERQMGQLATTQNTRPAGALPSDTEK-----NPQVNAVTLRNGRELEEVPK 76
Query: 305 KTQDIEKDSNKNAVVEKELESSQGAGGSNKDVGAPGSVPDVEPPYVPPPPYVPPLPFPQR 364
K +D K + ++ K + + ++ V AP PP PFPQR
Sbjct: 77 KNKD--KPIPEGKLIPKVTQEQKNVAEVSEPVEAP----------------KPPPPFPQR 136
Query: 365 QKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILIKKKRLGEFETVSLTE 424
+ KN D F KFL +L Q+ +NIPLV+ ++++P YAK++KDI+ K+RL +F TV+LTE
Sbjct: 137 LQKKNDDRMFTKFLSMLSQVQLNIPLVDVLQEIPKYAKYIKDIVAHKRRLTDFATVALTE 196
Query: 425 ECSVILKNGLPPKAKDPGSFTIHVSIGGKELGRALCDLGASINLMPLSIYRKLGIGEARP 484
E + ++N LP K KDPGSFTI V IG ++G ALCDLGASINLMPLS++++LG+G RP
Sbjct: 197 ESTSRVQNKLPQKLKDPGSFTIPVRIGNVDVGHALCDLGASINLMPLSLFKQLGLGAPRP 256
Query: 485 ITVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVQIILGRPFLATGRA 544
TV LQLADR I +PEG IEDVL ++ KFIFP DFIILDYEAD+ V IILGRP LATG A
Sbjct: 257 TTVMLQLADRLIAHPEGVIEDVLQQIGKFIFPADFIILDYEADELVPIILGRPLLATGDA 316
Query: 545 LIDVQKGELTMRVCNEEVKFNVFKAMKYPNEMEDCSFIRILESTV----IETAIQDSADK 604
+I V++G++ +RV +EE FNV++A++ P E+ S I ++E+ V + DS +K
Sbjct: 317 IIKVREGKMILRVDDEEAVFNVYRAIQLPRHYEELSMISVVEAVVKIHYPSVYLDDSLEK 376
Query: 605 HLEGHGEVSVEDLQVCLLERKNEKELFRCEDVFESLDLDQRKAPPIKPSLIEAPTLDLKP 664
L + ++ +V + E + L++ + P + S+ EAP L+LKP
Sbjct: 377 TLMLLNSLGADE-EVEEMMHILETSCAYLQGTHPFEPLNRPEGPLTRTSIEEAPKLELKP 436
Query: 665 LSDHLKYVYLGEGETLPIIVASDLMTEHEEALIKWLQQYRKAIGWTLAEIQGISPSFCMH 724
L HL+Y YLG+ +TLP+IV+ DL EE L++ L+++++A+GWT+ +I+GISP+FCMH
Sbjct: 437 LPPHLQYAYLGDSDTLPVIVSYDLSKLQEEKLLRVLREHKRALGWTMYDIKGISPAFCMH 496
Query: 725 KITLEEGSLRSVEQQRRLNPAMKEVKKEE------------------------------- 784
KI +E+G VEQQRRLN MKEV ++E
Sbjct: 497 KILMEDGHKPIVEQQRRLNTIMKEVVRKEVIKWLNAGIVFPISDSKWVSPVQCVPKKGGM 556
Query: 785 ---------------------------------------------------------FDL 844
F+L
Sbjct: 557 TVVVNENNDLIPTRTVTGLLEKDVAFKFDDACLKAFEELKGRLVIAPIIIAPDWEQPFEL 616
Query: 845 ------------------------------------------------------------ 904
Sbjct: 617 MCDASDLAVGAVLGQRRNKIFHSIYYASKTLNPAQMNYTVTETKLLAXGHILWGLKSSST 676
Query: 905 ------EIKDKKGSENVIADHLSRLDPSSSLLEQSAISDSFSDEQLFAVEVKVVRDIPWY 962
EI+D+KG+EN + DHLS+L+ + + E AI ++F DEQL A+ V PWY
Sbjct: 677 QIIQPSEIRDRKGTENQVVDHLSKLENRNHVAEGDAIKETFPDEQLLAITSSTV---PWY 736
BLAST of Lag0032147 vs. ExPASy TrEMBL
Match:
A0A2G9GSZ3 (DNA-directed DNA polymerase OS=Handroanthus impetiginosus OX=429701 GN=CDL12_19098 PE=4 SV=1)
HSP 1 Score: 518.1 bits (1333), Expect = 1.1e-142
Identity = 383/1113 (34.41%), Postives = 506/1113 (45.46%), Query Frame = 0
Query: 150 QVTEETCVYYGEDHNYEFFPSNPASVFFV-----------------GWRNHPNFSWG--- 209
Q T TC GE H + P + S+ FV GWR+HPNF+W
Sbjct: 112 QHTPVTCEECGEGHPSDQCPHSIESIQFVSNARKPQNNPYSNTNNPGWRSHPNFAWNNNQ 171
Query: 210 GQGSNVQAQQKVNQLGFAKAQVLPQQNKQALPQQNSGSSLEVMMKEFMALTDAAIQSNQA 269
GQGS + QQ V QQ +Q P Q SLE + +FMA S A
Sbjct: 172 GQGSAPRFQQGVQ-----------QQVQQ--PMQEKKPSLEETLIQFMA-------SIAA 231
Query: 270 SMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKVVTLRSGKPLEEPRKTQDI 329
+ + +E Q+GQLA+ + +RPQG LPS+TE +PR++GK Q + VTLR+G+ L+E K +
Sbjct: 232 NFKMMETQIGQLASAINSRPQGSLPSNTELNPRQDGKAQCQAVTLRNGRELQEVVK--EP 291
Query: 330 EKDSNKNAVVEKELESSQGAGGSNKDVGAPGSVPDVEPPYVPPPPYVPPLPFPQRQKPKN 389
K K + EK+ K+V AP
Sbjct: 292 TKSKGKEVIFEKK----------EKEVEAP------------------------------ 351
Query: 390 QDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILIKKKRLGEFETVSLTEECSVI 449
L++LHINIP EA+EQMP+Y KF++DIL KK+ LG++ET
Sbjct: 352 -----------LEKLHINIPFAEALEQMPSYVKFMEDILSKKRHLGDYET---------- 411
Query: 450 LKNGLPPKAKDPGSFTIHVSIGGKELGRALCDLGASINLMPLSIYRKLGIGEARPITVTL 509
N LPPK KDP SFTI + G LG+ALCDLGASINLMP IYR LG+GEA+P ++TL
Sbjct: 412 --NKLPPKLKDPESFTIPCTSGTHFLGKALCDLGASINLMPYLIYRTLGLGEAKPTSITL 471
Query: 510 QLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVQIILGRPFLATGRALIDVQ 569
QLADRS+TY +G IED+ VKVDKFIFP DF++LD EAD ++ IILGRPFLATG LIDVQ
Sbjct: 472 QLADRSLTYSKGVIEDIFVKVDKFIFPADFVVLDVEADSEIPIILGRPFLATGLTLIDVQ 531
Query: 570 KGELTMRVCNEEVKFNVFKAMKYPNEMEDCSFIRILESTVIETAIQDSADKHLEGHGEVS 629
KGE TMRV ++ + FNVFKAMK+PNE ++C + + ++ + A++ L+
Sbjct: 532 KGEFTMRVQDQHITFNVFKAMKFPNESDECFSVSLFDNLAENKLV---AEQSLD-----P 591
Query: 630 VEDLQVCLLERKNEKE-----LFRCEDVFESLDLD--QRKAPP--IKPSLIEAPTLDLKP 689
+E + LL+ KNE++ + F+S ++ +R AP +KPS+ E PT +LKP
Sbjct: 592 LERALLDLLDEKNEEDREVVKILDASKYFKSRGVESLERIAPSKILKPSIEEPPTFELKP 651
Query: 690 LSDHLKYVYLGEGETLPIIVASDLMTEHEEALIKWLQQYRKAIGWTLAEIQGISPSFCMH 749
L HL Y YL E +TLPII++S L E L++ L+ ++ AIGWT+A+I+GISPSFCMH
Sbjct: 652 LPSHLCYAYLCESDTLPIIISSSLSDLQVEKLLRVLRNHKGAIGWTIADIKGISPSFCMH 711
Query: 750 KITLEEGSLRSVEQQRRLNPAMKEV----------------------------------- 809
KI LE+ VE QRRLNP MKEV
Sbjct: 712 KILLEDDQKPYVESQRRLNPIMKEVVKKEIIKWLDAGIIYPISDSSWVNPIQCVPKKGGI 771
Query: 810 ------------------------------------------------------------ 869
Sbjct: 772 TVVPNIHNELIPTRTITGWRISTFQRCMMAIFTDIVENFLEVFMDDYFVYGNSFDECQNN 831
Query: 870 ------------------------------------------------------------ 929
Sbjct: 832 LSSVLKRCEDTNLVLNWEKCHFMVQEGIVLGHKVSNRGIEVDKTFPFKLMCDASDFAIGA 891
Query: 930 ------------------------------KKE----------------EFDLEIKDKKG 962
KKE EFDLEI+D+KG
Sbjct: 892 VLGQRKDKIFRWIYYANKTLNDAQLNYTTTKKELLAVVFAFDKFRSYLVEFDLEIRDRKG 951
BLAST of Lag0032147 vs. ExPASy TrEMBL
Match:
A0A6A2WLX1 (Reverse transcriptase OS=Hibiscus syriacus OX=106335 GN=F3Y22_tig00116939pilonHSYRG00212 PE=3 SV=1)
HSP 1 Score: 516.9 bits (1330), Expect = 2.5e-142
Identity = 438/1503 (29.14%), Postives = 591/1503 (39.32%), Query Frame = 0
Query: 57 PQIQTANFEMKPVMFQKLQTVGQFHGLSSEDPHLHLKSFLGVIDSFVIQGVPRDALRLTL 116
P+IQ A+FEMKPVMF L ++GQF G+ +ED H+++FL V DSF +GV D L+L L
Sbjct: 432 PEIQAAHFEMKPVMFNMLNSIGQFGGMPTEDVRQHIRNFLEVCDSFRQEGVHEDFLKLKL 491
Query: 117 FPTDL------------------------------------------------------- 176
FP L
Sbjct: 492 FPYSLRDRARAWLSGVPVGSMESWVDLCKSFLLRYNPPNMNTQLRNEISSFRQGDDESMY 551
Query: 177 -----------------------------------AMIANALKNVT-------------- 236
M+ +A N T
Sbjct: 552 ECWDRYKSLLQKCSYHGFHDWTQVVMFYNGVNAPTRMLLDASANGTLLDKSPTEAFAILD 611
Query: 237 --------------GISHQQPPAME------------------------STVVVSQVTEE 296
G + P A E + V + T
Sbjct: 612 RIANNDYQFPSSRLGSGRRAPGAFELEAKDSVSAQLSVITNMLKNLQCSTDVKEVKTTSL 671
Query: 297 TCVYYGEDHNYEFFPSNPASVFFV-----------------GWRNHPNFSWGGQGSN--- 356
C+ +H+ P+N S+ FV GWR HPNFSW QG++
Sbjct: 672 ACLLCQGNHHESECPTNHESINFVGNYNRGSNNPYSNTYNAGWRQHPNFSWENQGAHNAN 731
Query: 357 --VQAQQKVNQLGFAKAQVLPQQNKQALPQQNSGSSLEVMMKEFMALT------------ 416
+ Q G+ A NK+AL S SSLE ++EF++ T
Sbjct: 732 QPTRQQNHNEPQGYQNAMPWHNANKRAL-SSASISSLEATIQEFISTTKTMLQDHSTSIK 791
Query: 417 ---------DAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKVV 476
A IQS+ +S+RALE QVGQ+A L+ R QG+LPSDTE + GKE V+
Sbjct: 792 NQGALLHSQGALIQSHSSSLRALEGQVGQIATALQERQQGRLPSDTEVTKGPGKEHCNVL 851
Query: 477 TLRSGKPLEEPRKTQDIEKDSNKNAVVEKELESSQGAGGSNKDVGAPGSVPDVEPPYVPP 536
TLRSG + K +D K + +A V++ ++P
Sbjct: 852 TLRSGTQINRQDKEEDFAKVPDYDAKVKEN--------------------------FIPA 911
Query: 537 PPYV-PPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILIKK 596
PP PFPQR K N + QFKKF++IL QLHINIPL+EA+EQMP YAKF+KDI KK
Sbjct: 912 AKEARPPPPFPQRLKKHNNEVQFKKFVDILDQLHINIPLLEAVEQMPMYAKFMKDICTKK 971
Query: 597 KRLGEFETV-SLTEECSVILKNGLPPKAKDPGSFTIHVSIGGKELGRALCDLGASINLMP 656
+++ ETV + TE CS K L PK DPGSF I SIG +G+ALCDLG+S+NL+P
Sbjct: 972 RKV---ETVATATEFCSSSSK--LSPKRNDPGSFIIPCSIGANFVGKALCDLGSSVNLIP 1031
Query: 657 LSIYRKLGIGEARPITVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDV 716
SI+ KLGIG+ARP +V LQLAD+S EG++EDV+V+VDKF+F VDF+ILD E D
Sbjct: 1032 KSIFLKLGIGDARPTSVILQLADKSHVKLEGRVEDVIVRVDKFVFTVDFLILDCEVDAKA 1091
Query: 717 QIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPNEMEDCSFIRILESTVI 776
IILGRPFLATGR LID +KGELTMRV ++ V NVF+ +KY ++ E+C I L S +
Sbjct: 1092 PIILGRPFLATGRILIDYEKGELTMRVVDQCVTVNVFRTLKYVDDTEECQGISELNSVIE 1151
Query: 777 ETA--------IQDSADKHLEGHGEVSVEDLQVCLLERKNEKELFRCEDVFESLDLDQRK 836
E IQ + +++L E VE +LE ++ R + FE L+ D+
Sbjct: 1152 EETEHLCQNNFIQLAENEYLV-DDESLVESDDFPILEEQSSLVQVRSDINFEPLNFDEFI 1211
Query: 837 APPIKPSLIEAPTLDLKPLSDHLKYVYLGEGETLPIIVASDLMTEHEEALIKWLQQYRKA 896
+P KPSL+ AP L+LK L HLKYVYLG ETLP+I++++L E++L+ L Q++KA
Sbjct: 1212 SP--KPSLLHAPNLELKTLPGHLKYVYLGSDETLPVIISANLTANQEQSLLSVLMQHKKA 1271
Query: 897 IGWTLAEIQGISPSFCMHKITLEEGSLRSVEQQRRLNPAMKEVK---------------- 956
IGWT+A+++GISP+ CMHKI LE+ S+E QRRLNP MK+VK
Sbjct: 1272 IGWTMADLKGISPTICMHKILLEDCHGNSIEPQRRLNPIMKQVKGGTTVVTNEDNELLPT 1331
Query: 957 ------------------------------------------------------------ 962
Sbjct: 1332 RTVTGWRICMDYRKLNKATKKDHFPLPFIDQMLDRLAGKAFYCFLDGYSGYNQIAIAPED 1391
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_024028757.1 | 1.4e-158 | 50.67 | uncharacterized protein LOC112093792 [Morus notabilis] | [more] |
PIN20438.1 | 1.8e-158 | 33.94 | DNA-directed DNA polymerase [Handroanthus impetiginosus] | [more] |
XP_017239676.1 | 2.3e-153 | 32.18 | PREDICTED: uncharacterized protein LOC108212460 [Daucus carota subsp. sativus] | [more] |
XP_017239676.1 | 3.0e-04 | 30.58 | PREDICTED: uncharacterized protein LOC108212460 [Daucus carota subsp. sativus] | [more] |
XP_008231996.1 | 3.5e-05 | 47.69 | PREDICTED: uncharacterized protein LOC103331166 [Prunus mume] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A2G9HSD1 | 8.7e-159 | 33.94 | DNA-directed DNA polymerase OS=Handroanthus impetiginosus OX=429701 GN=CDL12_068... | [more] |
A0A2G9GK35 | 3.7e-149 | 36.56 | Reverse transcriptase OS=Handroanthus impetiginosus OX=429701 GN=CDL12_21798 PE=... | [more] |
A0A1U7XC36 | 5.1e-143 | 34.68 | uncharacterized protein LOC104232916 OS=Nicotiana sylvestris OX=4096 GN=LOC10423... | [more] |
A0A2G9GSZ3 | 1.1e-142 | 34.41 | DNA-directed DNA polymerase OS=Handroanthus impetiginosus OX=429701 GN=CDL12_190... | [more] |
A0A6A2WLX1 | 2.5e-142 | 29.14 | Reverse transcriptase OS=Hibiscus syriacus OX=106335 GN=F3Y22_tig00116939pilonHS... | [more] |
Match Name | E-value | Identity | Description | |