Lag0035274 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0035274
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionReverse transcriptase
Locationchr3: 17895485 .. 17902703 (-)
RNA-Seq ExpressionLag0035274
SyntenyLag0035274
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAACGAATTCCCTGACTTAGAATTCGGTTTGGACCCTGAAATTGAAAGGACTTTTCATAGGAGGAGAAGGGAACAAAGGGAGAGAAACAATAGGATGGATCCACCACCTCTTTTACCCCCTGTACCTCCTAGAAATCAAGAAAACCAAAACAACCAGCAGCCTAATCTTGAGGTATATCAGCAACCTAGGGTAGAAAATCAAGCTGAAAACCCTGTCCTTATAGCCAATGATAGGGGAAGAGCTATTAGAGCCTATGCCGTCCCTACCTTCAATGAGTTGTACCCTGGGATAGCTAGACCCGAGATACAGGCACACAATTTTGAAATGAAGCCTGTTATGTTCCAAATGTTACAAACGGTGGGACAATTTCATGGATTACCTTCTGAGGACCCACATCTACACCTTAAATCCTTTTTGGGAGTGAGTGACTCATTTAAAATTCAAGGAGTTTCTCAAGATGCTTTGAGGTTAACCTTGTTTCCTTATTCTTTGAGAGATGGTGCCAAAACTTGGTTAAATAACTTTGCTCCTGGCTCCATCACTACGTGGAATGAGTTAGCAGAGAAATTCCTCATTAAGTACTTCCCTCCCACAAGAAATGCCAAGCTTAGATCGGATATAGTGTCTTTTAGACAATTTGATGATGAATCTTTTAGTGAAGCTTGGGAGAGATTTAAGGAATTATTGAGAAAATGCCCTCACCATGGACTCCCCCATTGCATTCAAATGGAGACTTTTTATAATGGGTTGAATTGGGCTACTCAAAACATGGTAGATGCTTCAGCTAATGGTGCTTTATTATCCAAGACTTATGATGAAGCTTATGCCATTTTGGAGAGAATTTCCACAAATAGTTGCCAATGGTCTGATTCTAGAAGTGTGGCTGGGAAGAAAAATAAGGGTATGCTTGAGGTTGACACTGTTACATCCTTTAATGCAAGGTTTGATTCAATGGAAAGCATGATGAAAAATTTAAATTCAAGCTTTGAGAATTTGCAGTTGATGAATTCCAGTGCAAACCAGTCTGCTGCAGCAATCAATATAAATCAGAATGCAGCAGATTCTTGCGTTTACTGTGGAGAGGATCACAAGTTTGAATTTTGCCCTAGAAATCCAGCCTCAGTGTGTTATGTAGGAAACCAAAATGCTCCTAGGAATAATCCCTATTCCAATACCTATAATCCGGGTTGGAGAAATCATCCAAATTTTGCTTGGGGAGGTGATAACTGCCCAAAAGGTAGTTATTTAGGCCTCAATTATATAGGGATTTGTGTGTCCTTTATGCTTAATAGGGTTGGTTTTAGAGAGATATTACATGATTTAACCCATTTGGAAGAATGGAAGCTTTGGAAATCACTTTGTGCAGATTATGTTGCTGAGCGACTGGAAGGAGCAAATTCTATGCTGCAGCAAAACTGGAAGCAGAAACTGCCACATCACAGCTCGTTAGCCAACTTCATGAACCGACTTCTATTGAGATATTTTCGGGATAAAGGATCAAGGAGAGCCTTACACGTGTCCTAGTTGACCTAATTCAAGCCTCCCATCCTTCTATAAAAGGAAGTAGCCGTCAAAGTCAAATGGATTCCATGTTTTTCCCTTGGAAGCCGAATAGTGTAGAGATTGAAAATTGTCAAGTACACCATCATCCGCAAGCTAATTAGGGAGTTTATTTTCCTTCTTAATTTGTGGTTATGGGCATGCATACTCAGATTGTAATTCATATTCCTATGCTTATTATGAACTAAGTAGCTTAGGGTGTTGGTTAGGAATAATGTGATTACTGTGCTTCATCTCTAGTAATAAATTGACTATAGCATTTTGGTTAATTGTTGTGCTTATTCTTATTTTATGCCCGCCTGGCCAGCCGGTATGGATAGCTCTCTCTAGTTTCAGGGATGATTTAGAGAGTGTGTGGCAATTCTCCACATTACGATCATTGTGTATTGGTGAACGCCAGTGACTGAGCTAGGGATAGCCAGTTGCCTAGTGCCTGATCGGTCGCGCCTCATTGGGGTTGAAACACATCCCAAAAGATTCGGAGTTTCGGGAGGAATCCGCTAATAATACTGCAGGAAATGCATGATCAATAATATTGCTAGAGCATATTTTATAACAATCACTAAAACCTAACCAGCGCCTGTCTTTAATCTGTCTATCTATTTGTTTACTCGTTGTTCTGTCATTTATTTTGCTTCAGCATAATTCCGATAAAACCCAATCATTTTATTATTCTTGAAACCACTAGAAAAATTGTGCAAAATAGGTAAAGTGGAGATTATTCAGACCTTGGATTGATAATATTACTTGTTGCAGACTGTGTATACTTGCACAGGAATTCTAGAGTATTTTATACTACATTTTTTTACTCACTCTAGGAACCAAACAAGTTTTTGGCGCCGTTGCCGGGGACTTGAATAAGTTTTTATCTGATCTATTTTGTGTATGTGTTTTAGTGGTTTGGTGCTAGAACTTTTTATTGTTTCTTTTGTTTTTGCAGATGTGATTTTGGTGCATGAGCGATCCGCCTGGGGTACGGTTTGAGCTTGATCCAGAAATTGAAAGGACATTTAGGAACAGAAGGAGGGAGCAGCGCAGAAGCCAGATGGAGAACGCCTGATAGCGAGCATAGCTAGCTATCTTTGCTGATTTAATTTAGCTAGTCTTCTATGGTGTTTGGGCCATTTTATTGTGCTTTATTGTTTATTTTGTAGGAAATGGTTGATCGGAGCTATAATGCAATATCAGAGTTAATCGGGTGCTTGGGGCGTGAAAAGATGCAAAGGAATGAAAAGAGTAAAATTGGAGAAAAGTCAAATCTCGGTCAACAGCAGATTAGCGTCGAGACGCTAGCTCTTGAGCGTCTCGACGCTCACATTCCATATCAGATTAGGCGCGTAAAGCTTACAGTGTCGAGACGCTATGATAGGAAGCGTCCCGACGCTTCCGTTTTTCCTTATTCAGAACGCGCGTATAAGAGGCAGCGTCGCGACGCTGTCTTGACAGCGTCTCGACGCTAAGACGAAAAAATCAGAATAAAAGCTTCACTTCGGCTAGGTTTTTAGGGAGTTCAATTTGGGACCTTTTGGAGCCGTAAAGAGGACAGAACAGAGCATTTGGAGGCTGAAACAAAGGGGGAGACTTGGAAATCAACCCATTGTTCGTGGGGATCGTGACGGGGACGTCGGGACTCGGCCTACATTGAGTTTTCTTTTCCCTTTTCTTCCTTTAGTTAGATTTATGGCTTTCATATGTTTAGTTGAGATTGTATTTTCATTCATGAGTAGCTAAACTTGTAGCTTAGGGATTGATGTAGTTGATTAAGTTTATGGATTGATTTATGGTTTAAGTTGTGCAATGTTATTGTGTTTGTAATCTTGATTTAATGCTTTCAATTAATTGGCCATTAGTTGAATGATTTAAGTTTCGAATCTAGCTCGGGAGAGGGGATTTGATCTAGACTTACTCAAGATTGCATGAATAAAGTTTACTTTCTAATTCTTGACGGTTTTAGATTAACTTTAGAATTGGTTAAAAGATCTCATTGCCTTGCATATTAATAAACTTAGATAGGGATATCAAGTTGGTTAATTTGCATGAGAATCAATTATACTCGGGAGAGGTAGTTGTGATTAAAGGGTCTTTGGCTCATAATTTCATTGTGCATAATAACTTATTCTTTAAAATCAATCCTAAACGAGATCGGCGAAATCAATTCCCTAAGTAGTTTTCCCAATTGAATTTCCATTCTTTTTACTTGCTTTGATTTGGTTTTATTATCATCTTTTCATCACTTTTTGATCCCTAAATAAAATTGGGCAAGATAATTCACAATACTTGATTTGCTTAAAAATCCAATCCAGAGGACGACACTCGTATTTACGAACACTTTATTATTACTTGACCCGTGCGCTTGCGGTTAGGAAAAGTCCACATCAACGCCCCGCAACTCCCGTAGGTTCATGAAGGTCCAACAGCAGCAAACCCCCAGCAGAACTCGTTGCTGCAGCAAAACCCACTGTTTGAGCAAAATGAGCAGCGAAATAATCAGGCTGAGAATCCTATCTTGATAGCGAACGATAGGACCAGAGCCATTCGAGCATATGCTGTCCCGATGTTTAATGAGTTGAATCCAGGGATTGCACGTCCCCAAATCCAAGCGGCGAATTTTGAAATGAAACCGGTAATGTTTCAGATGTTGCAAACCGTGGGGCAATTCCATGGTTTGTCATCTGAAGACCCTCATTTACATCTTAAGTCTTTTCTAGGAGTTAGTGATTCTTTTGTAATTCAGGGAGTGCCTAGAGATGCTCTTAGATTAACTTTGTTCCCGTATTCTCTTAGAGATGGAGCAAAGTCATGGTTAAACTCTTTTGCTCCATGATCAATTAGGACGTGGGATGAGTTAGCTGAAAATTTTTTGAGTAAATATTTCCCACCTAATAGAAATGCTAAATTAAGGAGTGAAATAGTAGGGTTTAGGCAACTTGAGGATGAGACTTTTAGTGAGGCTTGGGAAAGGTTCAAGGAGCTTTTGCGAAAGTGTACCCACCATGGTTTACCTCATTGTATTCAAATGGAAACATTTTACAATGGTTTAAATGGAGTAACCCAAGGTATGGTCGATGCTTCGGCTGGAGGGGCCCTTTTGGCAAAACCTTTTGATGAAGCCTATGAAATGTTAGAAAGAATATCTATTAATAGTTGTCAGTGGTCGGATGTTAGAGGCAAAAATAAAAAGGTTAAGAGTGTGTTAGAGGTTGATGGTGTGTCCACCATTAGGGCCGATCTTGCTATGATTGCTAACGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCCACCAGCCATGGAGCCTGCAGCAGTGGTGAACCAAGTCACGGATGAAGCATGTGTCTATTGCGGTGAAGACCACAACTACGAGTTTTGCCCCAGCAATCCAGCTTCTGTGTTTTTTGTAGGTAATCAGAGGAACAACCCTTATTCTAACTTTTATAATCCAGGTTGGCGCAACCACCCCAATTTCTCATGGGGAGGTCAAGGAAGTAACGTACAAGCGCAACAGAAGATGAACCAGTCGGGATTTGCTAAAACGCAGGTAATGCCCCAGCAAAATAAGCAGGCTTTGCCCCAGCAAAATTCGGGAAATTCTCTCGAGGGGATGATGAAAGAATTTATGGCTCGCACAGACGCCGCAATTCAAAGTAATCAAGCTTCGATGAGGGCCCTGGAATTGCAAGTGGGTCAGCTAGCTAATGAGCTAAAGGCAAGGCCTCAAGGGAAACTTCCATCCGATACTGAACACCCTCGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTAACTCTTAGGAGTGGTAAGCCACTAGAAGAGTCTAGAAAGACCCAGGATTTAAATAGTAATAGTGATAATATTATTGTTATTGAAAAAGAGTTGGAGTCTGGTCAGGGTGTTGGAGGTAGCAAAGAGAATGCTGGAGCATCTGGTTCGGTGCCAGATGTAGAACCACCATATGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAAAGCCTAAGAATCAGGATGGTCAATTTAAAAAGTTTTTAGAGATTCTTAAGCAGTTGCACATAAATATCCCTTTAGTAGAAGCTATTGAGCAAATGCCTAATTATGTTAAATTTCTTAAGGATATTTTGACTAAAAAGAAAAGGTTAGGTGAGTTTGAAACTGTATCTCTTACTGAGGAGTGTAGTGTTATTCTTAAGAATGGGCTACCCCCCAAGGCTAAGGATCCAAGGTCATTTACTATACCTGTGTCTATAGGTGGAAAGGAATTAGGTAGAGCACTCTGTGATTTAGGTGCAAGCATTAACCTTATGCCTCTTTCGGTCTATCGAAAGTTAGGTATTGGTGAAGCTAGGCCTACCACAGTTACACTCCAATTAGCTGATAGGTCTATCACATATCCAGAGGGGAACATTGAGGATGTCTTAGTAAAAGTTGATAAATTCATATTTCCTGTTGATTTTATTATTTTAGACTATGAGGCTGATAAAGATGTCCCAATTATTCTAGGTCGTCCATTTTGGCTACTGGTAGGACATTAATAGATGTTCAGAAAGGAGAATTAACAATGAGAGTCTGTAATGAGGAAGTAAAATTTAATGTGTTTAAAGCCATGAAATATCCAGACGAAATGGAGGATTGCTCCTTCATTAGGATTCTAGAGAGCACAGTTATTGAGAAAGCAACACAGGATTCGGCTGATAAGCATTCGGAAAAGCATGGAGAGGTTAGTGTAGAAATTTTGAATTTTGTTCTTTAGATAAAAAAAATGATGAAAAGAATTGTTTAGGTGTGAGGATGTTTTTGAATCTTTAGATTTAGATCAAAGAAAGGCTCCCCCAATTAAGCCATCCCTAATTGAGGCACCCACTTTAGATTTGAAACCTTTACCGGATCATCTAAAGTATGTGTATCTTGGGGAAAGTGAGACGTTGCCCATTATTGTTGCATTAGATTTAATGCCGGAGCATGAGGAGGCCTTAATAAAATTGCTGCAGCAATACCGCAAAGCTATAGGTTGGACATTGGCTGATATTCAGGGAATTAGCTCGTCCTTTTGTATGCACAAAATCACTCTAGAGGAGGGATCCTTTAGGAGTATTGAGCAACAAAGAAGGCTTAACCCTGCAATGAAAGAGGTTGTTAAAAAGGAGGTAATTAAATGGTTGGATGCTGGGATCATTTATCCAATTGCCGATAGCAATTGGGTAAGCCCTGTCCAATGTGTTCCTAAGAAAGGAGGTGTCACTGTGGTGTGCAATAAAGATAATGAGTTGATCCCCACAAGGACAGTAACTGGCTGGAGGGTTTGTATGGATTACAGAAGGCTTAATAAAGCCACTCGAAAGGATCATTTCCCTCTACCATTTATCGATCAGATGTTGGATCGATTGGCTGGTCAGACCTATTACTGTTTCTTGGATGGTTATTCTGGGTATAACCAGATTACTATTGCTCCTGAGGATCAGGAAAAAACCACTTTCACTTGCCCTTATGGGACATTCGCTTTTAGGAGAATGCCTTTCAGCCTTTGCAATGCTCAAGCAACATTTCAGCGGTGTATGTTAACAATTTTTTCTGATATGATTGAGTCCACTGTTGAAGTCTTTATGGACGATTTTTCAGTTTTGGAGGGTCTTTTCAGAGTTGTTTAG

mRNA sequence

ATGAACGAATTCCCTGACTTAGAATTCGGTTTGGACCCTGAAATTGAAAGGACTTTTCATAGGAGGAGAAGGGAACAAAGGGAGAGAAACAATAGGATGGATCCACCACCTCTTTTACCCCCTGTACCTCCTAGAAATCAAGAAAACCAAAACAACCAGCAGCCTAATCTTGAGGTATATCAGCAACCTAGGGTAGAAAATCAAGCTGAAAACCCTGTCCTTATAGCCAATGATAGGGGAAGAGCTATTAGAGCCTATGCCGTCCCTACCTTCAATGAGTTGTACCCTGGGATAGCTAGACCCGAGATACAGGCACACAATTTTGAAATGAAGCCTGTTATGTTCCAAATGTTACAAACGGTGGGACAATTTCATGGATTACCTTCTGAGGACCCACATCTACACCTTAAATCCTTTTTGGGAGTGAGTGACTCATTTAAAATTCAAGGAGTTTCTCAAGATGCTTTGAGGTTAACCTTGTTTCCTTATTCTTTGAGAGATGGTGCCAAAACTTGGTTAAATAACTTTGCTCCTGGCTCCATCACTACGTGGAATGAGTTAGCAGAGAAATTCCTCATTAAGTACTTCCCTCCCACAAGAAATGCCAAGCTTAGATCGGATATAGTGTCTTTTAGACAATTTGATGATGAATCTTTTAGTGAAGCTTGGGAGAGATTTAAGGAATTATTGAGAAAATGCCCTCACCATGGACTCCCCCATTGCATTCAAATGGAGACTTTTTATAATGGGTTGAATTGGGCTACTCAAAACATGGTAGATGCTTCAGCTAATGGTGCTTTATTATCCAAGACTTATGATGAAGCTTATGCCATTTTGGAGAGAATTTCCACAAATAGTTGCCAATGGTCTGATTCTAGAAGTGTGGCTGGGAAGAAAAATAAGGGTATGCTTGAGGTTGACACTGTTACATCCTTTAATGCAAGGTTTGATTCAATGGAAAGCATGATGAAAAATTTAAATTCAAGCTTTGAGAATTTGCAGTTGATGAATTCCAGTGCAAACCAGTCTGCTGCAGCAATCAATATAAATCAGAATGCAGCAGATTCTTGCGTTTACTGTGGAGAGGATCACAAGTTTGAATTTTGCCCTAGAAATCCAGCCTCAGTGTGTTATGTAGGAAACCAAAATGCTCCTAGGAATAATCCCTATTCCAATACCTATAATCCGGGTTGGAGAAATCATCCAAATTTTGCTTGGGGAGGTGATAACTGCCCAAAAGATTATGTTGCTGAGCGACTGGAAGGAGCAAATTCTATGCTGCAGCAAAACTGGAAGCAGAAACTGCCACATCACAGCTCGTTAGCCAACTTCATGAACCGACTTCTATTGAGATATTTTCGGGATAAAGGATCAAGGAGAGCCTTACACGAAATGGTTGATCGGAGCTATAATGCAATATCAGAGTTAATCGGGTGCTTGGGGCGTGAAAAGATGCAAAGGAATGAAAAGAGTAAAATTGGAGAAAAGTCAAATCTCGGTCAACAGCAGATTAGCGTCGAGACGCTAGCTCTTGAGCGTCTCGACGCTCACATTCCATATCAGATTAGGCGCGTAAAGCTTACAGTGTCGAGACGCTATGATAGGAAGCGTCCCGACGCTTCCGTTTTTCCTTATTCAGAACGCGCGTATAAGAGGCAGCGTCGCGACGCTGTTCATGAAGGTCCAACAGCAGCAAACCCCCAGCAGAACTCGTTGCTGCAGCAAAACCCACTGTTTGAGCAAAATGAGCAGCGAAATAATCAGGCTGAGAATCCTATCTTGATAGCGAACGATAGGACCAGAGCCATTCGAGCATATGCTGTCCCGATGTTTAATGAGTTGAATCCAGGGATTGCACGTCCCCAAATCCAAGCGGCGAATTTTGAAATGAAACCGGTAATGTTTCAGATGTTGCAAACCGTGGGGCAATTCCATGGTTTGTCATCTGAAGACCCTCATTTACATCTTAAGTCTTTTCTAGGAGTTAGTGATTCTTTTGTAATTCAGGGAGTGCCTAGAGATGCTCTTAGATTAACTTTGTTCCCGTATTCTCTTAGAGATGGAGCAAAGTCATGGAGTGAAATAGTAGGGTTTAGGCAACTTGAGGATGAGACTTTTAGTGAGGCTTGGGAAAGGTTCAAGGAGCTTTTGCGAAAGTGTACCCACCATGGTTTACCTCATTGTATTCAAATGGAAACATTTTACAATGGTTTAAATGGAGTAACCCAAGGTATGGTCGATGCTTCGGCTGGAGGGGCCCTTTTGGCAAAACCTTTTGATGAAGCCTATGAAATGTTAGAAAGAATATCTATTAATAGTTGTCAGTGGTCGGATGTTAGAGGCAAAAATAAAAAGGTTAAGAGTGTGTTAGAGGTTGATGGTGTGTCCACCATTAGGGCCGATCTTGCTATGATTGCTAACGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCCACCAGCCATGGAGCCTGCAGCAGTGGTGAACCAAGTCACGGATGAAGCATGTGTCTATTGCGGTGAAGACCACAACTACGAGTTTTGCCCCAGCAATCCAGCTTCTGTGTTTTTTGTAGGTAATCAGAGGAACAACCCTTATTCTAACTTTTATAATCCAGGTTGGCGCAACCACCCCAATTTCTCATGGGGAGGTCAAGGAAGTAACGTACAAGCGCAACAGAAGATGAACCAGTCGGGATTTGCTAAAACGCAGGTAATGCCCCAGCAAAATAAGCAGGCTTTGCCCCAGCAAAATTCGGGAAATTCTCTCGAGGGGATGATGAAAGAATTTATGGCTCGCACAGACGCCGCAATTCAAAGTAATCAAGCTTCGATGAGGGCCCTGGAATTGCAAGTGGGTCAGCTAGCTAATGAGCTAAAGGCAAGGCCTCAAGGGAAACTTCCATCCGATACTGAACACCCTCGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTAACTCTTAGGAGTGGTAAGCCACTAGAAGAGTCTAGAAAGACCCAGGATTTAAATAGTAATAGTGATAATATTATTGTTATTGAAAAAGAGTTGGAGTCTGGTCAGGGTGTTGGAGGTAGCAAAGAGAATGCTGGAGCATCTGGTTCGGTGCCAGATGTAGAACCACCATATGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAAAGCCTAAGAATCAGGATGGTCAATTTAAAAAGTTTTTAGAGATTCTTAAGCAGTTGCACATAAATATCCCTTTAGTAGAAGCTATTGAGCAAATGCCTAATTATGTTAAATTTCTTAAGGATATTTTGACTAAAAAGAAAAGGTTAGGTGAGTTTGAAACTGTATCTCTTACTGAGGAGTGTAGTGTTATTCTTAAGAATGGGCTACCCCCCAAGGCTAAGGATCCAAGGTCATTTACTATACCTGTGTCTATAGGTGGAAAGGAATTAGGTAGAGCACTCTGTGATTTAGGTGCAAGCATTAACCTTATGCCTCTTTCGGTCTATCGAAAGTTAGGTATTGGTGAAGCTAGGCCTACCACAGTTACACTCCAATTAGCTGATAGGTCGTCCATTTTGGCTACTGGTAGGACATTAATAGATGTTCAGAAAGGAGAATTAACAATGAGAGTCTGTAATGAGGAAGTAAAATTTAATGTGTTTAAAGCCATGAAATATCCAGACGAAATGGAGGATTGCTCCTTCATTAGGATTCTAGAGAGCACAGTTATTGAGAAAGCAACACAGGATTCGGCTGATAAGCATTCGGAAAAGCATGGAGAGGCTCCCCCAATTAAGCCATCCCTAATTGAGGCACCCACTTTAGATTTGAAACCTTTACCGGATCATCTAAAGTATGTGTATCTTGGGGAAAGTGAGACGTTGCCCATTATTGTTGCATTAGATTTAATGCCGGAGCATGAGGAGGCCTTAATAAAATTGCTGCAGCAATACCGCAAAGCTATAGGTTGGACATTGGCTGATATTCAGGGAATTAGCTCGTCCTTTTGTATGCACAAAATCACTCTAGAGGAGGGATCCTTTAGGAGTATTGAGCAACAAAGAAGGCTTAACCCTGCAATGAAAGAGGTTGTTAAAAAGGAGGTAATTAAATGGTTGGATGCTGGGATCATTTATCCAATTGCCGATAGCAATTGGGTAAGCCCTGTCCAATGTGTTCCTAAGAAAGGAGGTGTCACTGTGGTGTGCAATAAAGATAATGAGTTGATCCCCACAAGGACAGTAACTGGCTGGAGGGTTTGTATGGATTACAGAAGGCTTAATAAAGCCACTCGAAAGGATCATTTCCCTCTACCATTTATCGATCAGATGTTGGATCGATTGGCTGGTCAGACCTATTACTGTTTCTTGGATGGTTATTCTGGGTATAACCAGATTACTATTGCTCCTGAGGATCAGGAAAAAACCACTTTCACTTGCCCTTATGGGACATTCGCTTTTAGGAGAATGCCTTTCAGCCTTTGCAATGCTCAAGCAACATTTCAGCGGTGTATGTTAACAATTTTTTCTGATATGATTGAGTCCACTGTTGAAGTCTTTATGGACGATTTTTCAGTTTTGGAGGGTCTTTTCAGAGTTGTTTAG

Coding sequence (CDS)

ATGAACGAATTCCCTGACTTAGAATTCGGTTTGGACCCTGAAATTGAAAGGACTTTTCATAGGAGGAGAAGGGAACAAAGGGAGAGAAACAATAGGATGGATCCACCACCTCTTTTACCCCCTGTACCTCCTAGAAATCAAGAAAACCAAAACAACCAGCAGCCTAATCTTGAGGTATATCAGCAACCTAGGGTAGAAAATCAAGCTGAAAACCCTGTCCTTATAGCCAATGATAGGGGAAGAGCTATTAGAGCCTATGCCGTCCCTACCTTCAATGAGTTGTACCCTGGGATAGCTAGACCCGAGATACAGGCACACAATTTTGAAATGAAGCCTGTTATGTTCCAAATGTTACAAACGGTGGGACAATTTCATGGATTACCTTCTGAGGACCCACATCTACACCTTAAATCCTTTTTGGGAGTGAGTGACTCATTTAAAATTCAAGGAGTTTCTCAAGATGCTTTGAGGTTAACCTTGTTTCCTTATTCTTTGAGAGATGGTGCCAAAACTTGGTTAAATAACTTTGCTCCTGGCTCCATCACTACGTGGAATGAGTTAGCAGAGAAATTCCTCATTAAGTACTTCCCTCCCACAAGAAATGCCAAGCTTAGATCGGATATAGTGTCTTTTAGACAATTTGATGATGAATCTTTTAGTGAAGCTTGGGAGAGATTTAAGGAATTATTGAGAAAATGCCCTCACCATGGACTCCCCCATTGCATTCAAATGGAGACTTTTTATAATGGGTTGAATTGGGCTACTCAAAACATGGTAGATGCTTCAGCTAATGGTGCTTTATTATCCAAGACTTATGATGAAGCTTATGCCATTTTGGAGAGAATTTCCACAAATAGTTGCCAATGGTCTGATTCTAGAAGTGTGGCTGGGAAGAAAAATAAGGGTATGCTTGAGGTTGACACTGTTACATCCTTTAATGCAAGGTTTGATTCAATGGAAAGCATGATGAAAAATTTAAATTCAAGCTTTGAGAATTTGCAGTTGATGAATTCCAGTGCAAACCAGTCTGCTGCAGCAATCAATATAAATCAGAATGCAGCAGATTCTTGCGTTTACTGTGGAGAGGATCACAAGTTTGAATTTTGCCCTAGAAATCCAGCCTCAGTGTGTTATGTAGGAAACCAAAATGCTCCTAGGAATAATCCCTATTCCAATACCTATAATCCGGGTTGGAGAAATCATCCAAATTTTGCTTGGGGAGGTGATAACTGCCCAAAAGATTATGTTGCTGAGCGACTGGAAGGAGCAAATTCTATGCTGCAGCAAAACTGGAAGCAGAAACTGCCACATCACAGCTCGTTAGCCAACTTCATGAACCGACTTCTATTGAGATATTTTCGGGATAAAGGATCAAGGAGAGCCTTACACGAAATGGTTGATCGGAGCTATAATGCAATATCAGAGTTAATCGGGTGCTTGGGGCGTGAAAAGATGCAAAGGAATGAAAAGAGTAAAATTGGAGAAAAGTCAAATCTCGGTCAACAGCAGATTAGCGTCGAGACGCTAGCTCTTGAGCGTCTCGACGCTCACATTCCATATCAGATTAGGCGCGTAAAGCTTACAGTGTCGAGACGCTATGATAGGAAGCGTCCCGACGCTTCCGTTTTTCCTTATTCAGAACGCGCGTATAAGAGGCAGCGTCGCGACGCTGTTCATGAAGGTCCAACAGCAGCAAACCCCCAGCAGAACTCGTTGCTGCAGCAAAACCCACTGTTTGAGCAAAATGAGCAGCGAAATAATCAGGCTGAGAATCCTATCTTGATAGCGAACGATAGGACCAGAGCCATTCGAGCATATGCTGTCCCGATGTTTAATGAGTTGAATCCAGGGATTGCACGTCCCCAAATCCAAGCGGCGAATTTTGAAATGAAACCGGTAATGTTTCAGATGTTGCAAACCGTGGGGCAATTCCATGGTTTGTCATCTGAAGACCCTCATTTACATCTTAAGTCTTTTCTAGGAGTTAGTGATTCTTTTGTAATTCAGGGAGTGCCTAGAGATGCTCTTAGATTAACTTTGTTCCCGTATTCTCTTAGAGATGGAGCAAAGTCATGGAGTGAAATAGTAGGGTTTAGGCAACTTGAGGATGAGACTTTTAGTGAGGCTTGGGAAAGGTTCAAGGAGCTTTTGCGAAAGTGTACCCACCATGGTTTACCTCATTGTATTCAAATGGAAACATTTTACAATGGTTTAAATGGAGTAACCCAAGGTATGGTCGATGCTTCGGCTGGAGGGGCCCTTTTGGCAAAACCTTTTGATGAAGCCTATGAAATGTTAGAAAGAATATCTATTAATAGTTGTCAGTGGTCGGATGTTAGAGGCAAAAATAAAAAGGTTAAGAGTGTGTTAGAGGTTGATGGTGTGTCCACCATTAGGGCCGATCTTGCTATGATTGCTAACGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCCACCAGCCATGGAGCCTGCAGCAGTGGTGAACCAAGTCACGGATGAAGCATGTGTCTATTGCGGTGAAGACCACAACTACGAGTTTTGCCCCAGCAATCCAGCTTCTGTGTTTTTTGTAGGTAATCAGAGGAACAACCCTTATTCTAACTTTTATAATCCAGGTTGGCGCAACCACCCCAATTTCTCATGGGGAGGTCAAGGAAGTAACGTACAAGCGCAACAGAAGATGAACCAGTCGGGATTTGCTAAAACGCAGGTAATGCCCCAGCAAAATAAGCAGGCTTTGCCCCAGCAAAATTCGGGAAATTCTCTCGAGGGGATGATGAAAGAATTTATGGCTCGCACAGACGCCGCAATTCAAAGTAATCAAGCTTCGATGAGGGCCCTGGAATTGCAAGTGGGTCAGCTAGCTAATGAGCTAAAGGCAAGGCCTCAAGGGAAACTTCCATCCGATACTGAACACCCTCGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTAACTCTTAGGAGTGGTAAGCCACTAGAAGAGTCTAGAAAGACCCAGGATTTAAATAGTAATAGTGATAATATTATTGTTATTGAAAAAGAGTTGGAGTCTGGTCAGGGTGTTGGAGGTAGCAAAGAGAATGCTGGAGCATCTGGTTCGGTGCCAGATGTAGAACCACCATATGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAAAGCCTAAGAATCAGGATGGTCAATTTAAAAAGTTTTTAGAGATTCTTAAGCAGTTGCACATAAATATCCCTTTAGTAGAAGCTATTGAGCAAATGCCTAATTATGTTAAATTTCTTAAGGATATTTTGACTAAAAAGAAAAGGTTAGGTGAGTTTGAAACTGTATCTCTTACTGAGGAGTGTAGTGTTATTCTTAAGAATGGGCTACCCCCCAAGGCTAAGGATCCAAGGTCATTTACTATACCTGTGTCTATAGGTGGAAAGGAATTAGGTAGAGCACTCTGTGATTTAGGTGCAAGCATTAACCTTATGCCTCTTTCGGTCTATCGAAAGTTAGGTATTGGTGAAGCTAGGCCTACCACAGTTACACTCCAATTAGCTGATAGGTCGTCCATTTTGGCTACTGGTAGGACATTAATAGATGTTCAGAAAGGAGAATTAACAATGAGAGTCTGTAATGAGGAAGTAAAATTTAATGTGTTTAAAGCCATGAAATATCCAGACGAAATGGAGGATTGCTCCTTCATTAGGATTCTAGAGAGCACAGTTATTGAGAAAGCAACACAGGATTCGGCTGATAAGCATTCGGAAAAGCATGGAGAGGCTCCCCCAATTAAGCCATCCCTAATTGAGGCACCCACTTTAGATTTGAAACCTTTACCGGATCATCTAAAGTATGTGTATCTTGGGGAAAGTGAGACGTTGCCCATTATTGTTGCATTAGATTTAATGCCGGAGCATGAGGAGGCCTTAATAAAATTGCTGCAGCAATACCGCAAAGCTATAGGTTGGACATTGGCTGATATTCAGGGAATTAGCTCGTCCTTTTGTATGCACAAAATCACTCTAGAGGAGGGATCCTTTAGGAGTATTGAGCAACAAAGAAGGCTTAACCCTGCAATGAAAGAGGTTGTTAAAAAGGAGGTAATTAAATGGTTGGATGCTGGGATCATTTATCCAATTGCCGATAGCAATTGGGTAAGCCCTGTCCAATGTGTTCCTAAGAAAGGAGGTGTCACTGTGGTGTGCAATAAAGATAATGAGTTGATCCCCACAAGGACAGTAACTGGCTGGAGGGTTTGTATGGATTACAGAAGGCTTAATAAAGCCACTCGAAAGGATCATTTCCCTCTACCATTTATCGATCAGATGTTGGATCGATTGGCTGGTCAGACCTATTACTGTTTCTTGGATGGTTATTCTGGGTATAACCAGATTACTATTGCTCCTGAGGATCAGGAAAAAACCACTTTCACTTGCCCTTATGGGACATTCGCTTTTAGGAGAATGCCTTTCAGCCTTTGCAATGCTCAAGCAACATTTCAGCGGTGTATGTTAACAATTTTTTCTGATATGATTGAGTCCACTGTTGAAGTCTTTATGGACGATTTTTCAGTTTTGGAGGGTCTTTTCAGAGTTGTTTAG

Protein sequence

MNEFPDLEFGLDPEIERTFHRRRREQRERNNRMDPPPLLPPVPPRNQENQNNQQPNLEVYQQPRVENQAENPVLIANDRGRAIRAYAVPTFNELYPGIARPEIQAHNFEMKPVMFQMLQTVGQFHGLPSEDPHLHLKSFLGVSDSFKIQGVSQDALRLTLFPYSLRDGAKTWLNNFAPGSITTWNELAEKFLIKYFPPTRNAKLRSDIVSFRQFDDESFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNWATQNMVDASANGALLSKTYDEAYAILERISTNSCQWSDSRSVAGKKNKGMLEVDTVTSFNARFDSMESMMKNLNSSFENLQLMNSSANQSAAAININQNAADSCVYCGEDHKFEFCPRNPASVCYVGNQNAPRNNPYSNTYNPGWRNHPNFAWGGDNCPKDYVAERLEGANSMLQQNWKQKLPHHSSLANFMNRLLLRYFRDKGSRRALHEMVDRSYNAISELIGCLGREKMQRNEKSKIGEKSNLGQQQISVETLALERLDAHIPYQIRRVKLTVSRRYDRKRPDASVFPYSERAYKRQRRDAVHEGPTAANPQQNSLLQQNPLFEQNEQRNNQAENPILIANDRTRAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWSEIVGFRQLEDETFSEAWERFKELLRKCTHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKPFDEAYEMLERISINSCQWSDVRGKNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQRNNPYSNFYNPGWRNHPNFSWGGQGSNVQAQQKMNQSGFAKTQVMPQQNKQALPQQNSGNSLEGMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIIVIEKELESGQGVGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYVKFLKDILTKKKRLGEFETVSLTEECSVILKNGLPPKAKDPRSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSSILATGRTLIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVIEKATQDSADKHSEKHGEAPPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVALDLMPEHEEALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFSLCNAQATFQRCMLTIFSDMIESTVEVFMDDFSVLEGLFRVV
Homology
BLAST of Lag0035274 vs. NCBI nr
Match: XP_017239676.1 (PREDICTED: uncharacterized protein LOC108212460 [Daucus carota subsp. sativus])

HSP 1 Score: 888.3 bits (2294), Expect = 9.7e-254
Identity = 505/1059 (47.69%), Postives = 662/1059 (62.51%), Query Frame = 0

Query: 599  RTRAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKS 658
            R   ++ Y +P F+ ++  IARP I A NF +     Q ++   +F+GLS+EDP+ HL++
Sbjct: 46   RRLRVKDYIMPSFDGIHSSIARPAIAANNFHVDSATMQAIRD-NKFNGLSAEDPNAHLRN 105

Query: 659  FLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW-------------------------- 718
            FL + D+F + GVP + +RL LF  SL   A+ W                          
Sbjct: 106  FLEIVDNFKVNGVPEETIRLRLFSRSLDGRAREWLDSLPNNSITTWNQLVEKFLNKYFSP 165

Query: 719  -------SEIVGFRQLEDETFSEAWERFKELLRKCTHHGLPHCIQMETFYNGLNGVTQGM 778
                    EI  F+Q + E+  EA+ERFK+LLRKC HHGL    ++ TFYNGL       
Sbjct: 166  AKIERLIKEIQNFQQFDMESLYEAYERFKDLLRKCPHHGLSDQQKIRTFYNGLTIQCATQ 225

Query: 779  VDASAGGALLAKPFDEAYEMLERISINSCQWSDVRGKNKKVKSVLEVDGVSTIRADLAMI 838
            VD +A G+L  +  ++A+E+LE I+ N+C+  D R  +KKV  V EVD +++  A +   
Sbjct: 226  VDGAARGSLTNQYPEDAFEILEDITSNNCRAYD-RSSSKKVAGVHEVDPLTSFSAQVVSQ 285

Query: 839  ANAL-KNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCP------SNPASVFF 898
              AL K +  +S  +   ++ A  V  +    C  CGE H  + CP      +  +SV +
Sbjct: 286  FEALNKKLESLSVDRQQPVQSAHQVQNI-HVFCDMCGEGHPTQQCPLIYHDVAQSSSVNY 345

Query: 899  VG---NQRNNPYSNFYNPGWRNHPNFSWGGQGSNVQAQQKMNQS---GFAKTQVMPQQNK 958
            VG   NQ+NNP+SN YNPGWRNHPNFSW    +NV+      Q+   GF       QQN 
Sbjct: 346  VGNSSNQQNNPFSNTYNPGWRNHPNFSW---NNNVRPNMPFKQNVPPGF-------QQNP 405

Query: 959  QALPQQNSGNSLEGMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDT 1018
            +    +   N+ E ++ ++M +TDA IQS  ASMRALE+QVGQLA+ +  RP G LPS+T
Sbjct: 406  RPQEMEKKPNT-EDLLLQYMQKTDALIQSQSASMRALEMQVGQLASAINNRPSGSLPSNT 465

Query: 1019 E-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIIVIEKELESGQGVGGSKENAG 1078
            E +P+ + +E  KA+TLRSGK +E + K  D   + + ++  E  + S         N  
Sbjct: 466  EPNPKNDKREHCKAITLRSGKEIEGNTKKVDDGGDPEKVLNEEPSVLS---------NPK 525

Query: 1079 ASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQM 1138
            A  S P     +V PPP     PFPQR + + QD QF+KF+++ K+L INIP  EA+EQM
Sbjct: 526  ADASTP---KKHVYPPP-----PFPQRLQKQKQDKQFQKFMDVFKKLSINIPFAEALEQM 585

Query: 1139 PNYVKFLKDILTKKKRLGEFETVSLTEECSVILKNGLPPKAKDPRSFTIPVSIGGKELGR 1198
             +YVKF+KDIL++K+RL EFETV+LTEECS IL+  LPPK KDP SFTIP +IG +  G+
Sbjct: 586  SSYVKFMKDILSRKRRLEEFETVTLTEECSAILQKKLPPKLKDPGSFTIPCTIGNQYFGK 645

Query: 1199 ALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRS---------------------- 1258
            ALCDLGAS+NLMPLS++ KLG+GE +PT+V LQLADRS                      
Sbjct: 646  ALCDLGASVNLMPLSIFVKLGVGEVKPTSVRLQLADRSLAYPRGVVEDVLVKVDKFIFPA 705

Query: 1259 -------------------SILATGRTLIDVQKGELTMRVCNEEVKFNVFKAMKYPDEME 1318
                                 LATGRTLIDVQKGELTMRV +E+V FNVF AMK+ ++ E
Sbjct: 706  DFIVLDMEEDADIPLLLGRPFLATGRTLIDVQKGELTMRVQDEQVTFNVFSAMKFSNDEE 765

Query: 1319 DCSFIRIL-------------ESTVIEKATQDSADKHSEKHGE----------------- 1378
             C  +                 +  +E + +++ D+ +E+  E                 
Sbjct: 766  SCFSVSTFTGGDDLPLMLEQHSTDPLELSLREAGDESNEEIAECVKELNALPTYRRPFQQ 825

Query: 1379 ---------APPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVALDLMPEHEEALI 1438
                     +   KPS+ E P L+LK LP HLKY +LGE  TLP+I++  L  EHEE L+
Sbjct: 826  FESFEMPVKSKASKPSIEEPPELELKQLPTHLKYAFLGEKSTLPVILSSTLSAEHEEKLL 885

Query: 1439 KLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKW 1498
            ++L++Y++AIGW +ADI+GIS SFCMHKI++E+    +IE QRRLNP MKEVVKKE+IKW
Sbjct: 886  RVLKEYKRAIGWKIADIRGISPSFCMHKISMEDDHKPNIEHQRRLNPVMKEVVKKEIIKW 945

Query: 1499 LDAGIIYPIADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRTVTGWRVCMDYRRLNKATR 1531
            LDAGIIYPI+DS+WVSP+QCVPKKGG+TVV N+ NELIPTRTVTGWRVCMDYR+LNKATR
Sbjct: 946  LDAGIIYPISDSSWVSPIQCVPKKGGITVVANEKNELIPTRTVTGWRVCMDYRKLNKATR 1005

BLAST of Lag0035274 vs. NCBI nr
Match: XP_016646912.1 (PREDICTED: uncharacterized protein LOC103318979 [Prunus mume])

HSP 1 Score: 857.4 bits (2214), Expect = 1.8e-244
Identity = 501/1069 (46.87%), Postives = 642/1069 (60.06%), Query Frame = 0

Query: 595  IANDRTRAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHL 654
            +A +  RA+  +  P+       I RP I A NFE+KP M  MLQ    F GL +EDP++
Sbjct: 1    MAEEAERAVGEFGPPVATP--SAIRRPAIAANNFEIKPAMITMLQNSSVFCGLPNEDPNI 60

Query: 655  HLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW---------------------- 714
            HL  FL + D+    GV  DA+RL LFP+SL+D AK W                      
Sbjct: 61   HLAIFLEICDTSKFNGVTDDAIRLRLFPFSLKDKAKLWLLSQPQDSIRTWDDLSKKFLAK 120

Query: 715  -----------SEIVGFRQLEDETFSEAWERFKELLRKCTHHGLPHCIQMETFYNGLNGV 774
                        +I+ F Q + E   EAWERFK+LLRKC HH LP  IQ++TFYNGL+  
Sbjct: 121  FFPPAKTAKFRQDIMSFAQYDKEPLYEAWERFKDLLRKCPHHELPTWIQVQTFYNGLSQT 180

Query: 775  TQGMVDASAGGALLAKPFDEAYEMLERISINSCQWSDVRGKNKKVKSVLEVDGVSTIRAD 834
            ++ +VDA+AGGAL+AK   EA+E+LE ++ N+ QW   R  N K   VLEVD ++ + A 
Sbjct: 181  SRTLVDAAAGGALMAKTATEAFELLETMASNNYQWPSER-MNLKPAGVLEVDAMALLTAQ 240

Query: 835  LAMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPS-NPAS----- 894
            ++ +   + +++V S            +N  T+  C  C   H    C + NP +     
Sbjct: 241  ISNLTKKVDSLSVNS------------INTSTNFGCELCAGPHPSSECTTGNPFASAEQV 300

Query: 895  --VFFVGNQRNNPYSNFYNPGWRNHPNFSWGGQGSNVQAQQKMNQSGFAKTQVMPQQNKQ 954
              V  +  QRNNPYSN YNPGWRNHPNFSW    SN Q  Q+    GF      P Q K+
Sbjct: 301  NQVGELNRQRNNPYSNTYNPGWRNHPNFSW----SNTQNVQR-PPPGF------PAQEKK 360

Query: 955  ALPQQNSGNSLEGMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE 1014
             +  +++   L     +FM  T    Q+ QAS++ LE+QVGQLAN +  R QG  PS  E
Sbjct: 361  -INLEDALTQLTMSTTQFMTETKTQFQNQQASIQNLEVQVGQLANVISGRNQGVFPSQPE 420

Query: 1015 -HPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIIVIEKELES-----------GQ 1074
             +P+ +  EQ KA+TLR GK +  +    DL   +     +EKE E+             
Sbjct: 421  VNPKNQ--EQAKAITLRKGKQVNTA---IDLEKEA-----LEKEKEAKKFAAEMGHAFSP 480

Query: 1075 GVGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHIN 1134
             +  ++++     S+P    P +   PYVP +PFPQR +    DGQF KFLE+ ++L IN
Sbjct: 481  PITTTEKSQEEENSIP---IPSLQLKPYVPQIPFPQRLRKNKVDGQFAKFLEMFRKLQIN 540

Query: 1135 IPLVEAIEQMPNYVKFLKDILTKKKRLGEFETVSLTEECSVILKNGLPPKAKDPRSFTIP 1194
            IP  EA+EQMP+Y KF+KDIL+KK++ GE E + LTEECS IL+  LPPK KD  SF IP
Sbjct: 541  IPFAEALEQMPSYAKFMKDILSKKRKFGEHEKIQLTEECSAILQRKLPPKQKDRGSFKIP 600

Query: 1195 VSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRS------------ 1254
             +IG     RALCDLG+SINL+PLSV +K+GIGE +PTTV+LQ+ADRS            
Sbjct: 601  CTIGNNFFERALCDLGSSINLLPLSVAKKIGIGEIKPTTVSLQMADRSITYPDGIIEDVL 660

Query: 1255 -----------------------------SILATGRTLIDVQKGELTMRVCNEEVKFNVF 1314
                                           L T RTLIDV++G LT+RV NE+  F VF
Sbjct: 661  VKVDTLIFPADFLVLDMEEDSDTQLILGRPFLITSRTLIDVEEGLLTLRVGNEQATFKVF 720

Query: 1315 KAMKYPDEMEDCSFIRI-----------------LESTVIEKATQDSAD----------- 1374
            +A+K+P E EDC  I +                 LEST++  AT    +           
Sbjct: 721  EAIKFPREAEDCFHIELIDEIASDTFKKENPSHPLESTLVHAATSQDDNPMVAEYALYLD 780

Query: 1375 ----------KHSEKHGEAPP-IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVALD 1434
                         E  G APP   PS+I APTL LKPLP HL+Y YLG SETLP+I+A +
Sbjct: 781  ASQPYHPRQRNQFEPLGAAPPKAAPSVIAAPTLTLKPLPTHLRYAYLGTSETLPVIIAAN 840

Query: 1435 LMPEHEEALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMK 1494
            L    EE ++++L++++ AIGWT+ADI+GIS S CMH+I +EE    S+E QRRLNP MK
Sbjct: 841  LSETEEEKVLRVLRKHKTAIGWTIADIKGISPSMCMHRILMEEEHKPSVEHQRRLNPNMK 900

Query: 1495 EVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRTVTGWRVCM 1531
            EVV+ EV+K LDAGIIYPI+DS+WVSP Q VPKKGG+TVV N++NEL+PTRTVTGWRVC+
Sbjct: 901  EVVRAEVLKLLDAGIIYPISDSSWVSPTQVVPKKGGMTVVKNENNELVPTRTVTGWRVCI 960

BLAST of Lag0035274 vs. NCBI nr
Match: XP_038976300.1 (uncharacterized protein LOC120107204 [Phoenix dactylifera])

HSP 1 Score: 855.9 bits (2210), Expect = 5.3e-244
Identity = 479/1055 (45.40%), Postives = 626/1055 (59.34%), Query Frame = 0

Query: 597  NDRTRAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHL 656
            N   R +  YAVP  N   P I RP + A NFE+KP + QM+Q   QF G  SEDPH HL
Sbjct: 7    NQNKRLLSDYAVPNVNGAQPSIVRPTVNANNFEIKPGLIQMVQQ-EQFGGGPSEDPHAHL 66

Query: 657  KSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW------------------------ 716
             +FL + D+  + GV  DA+RL LFP+SL+D AK+W                        
Sbjct: 67   ANFLEICDTIKMNGVSDDAIRLRLFPFSLKDKAKAWLNSKAPNSFTTWNALSQAFLSKYF 126

Query: 717  ---------SEIVGFRQLEDETFSEAWERFKELLRKCTHHGLPHCIQMETFYNGLNGVTQ 776
                     ++I  F Q + E+  EAWERFK+L RKC HHGLP  + ++TFYNGL    +
Sbjct: 127  PPGKTAKLRNDITSFAQFDGESLYEAWERFKDLQRKCPHHGLPDWLIVQTFYNGLTHSVR 186

Query: 777  GMVDASAGGALLAKPFDEAYEMLERISINSCQWSDVRGKNKKVKSVLEVDGVSTIRADLA 836
              +DA+AGG L++K  +EAYE+LE ++ N+ QWS+ R   KKV  + +VDG++ + A + 
Sbjct: 187  ITIDAAAGGTLMSKSTEEAYELLEEMASNNYQWSNERCMPKKVPGMYDVDGINMLNAKVD 246

Query: 837  MIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN-- 896
             +      +  ++    P +            +C  CG  H    C      V FV N  
Sbjct: 247  SLVKMFSKLGNVNSVSSPVL------------SCDCCGGAHMSSDC----MQVQFVSNYN 306

Query: 897  ---QRNNPYSNFYNPGWRNHPNFSWGGQGSNVQAQQKMNQSGFAKTQVMPQQNKQALPQQ 956
               Q+NNPYSN YNPGWRNHPNFSW  QG+   + + ++  GF   Q  P Q +     +
Sbjct: 307  RQQQQNNPYSNTYNPGWRNHPNFSWKDQGNQGSSSRPLHPPGF---QPKPSQPESKQSWE 366

Query: 957  NSGNSLEGMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRRE 1016
             +   L     E   R +A +    +S R +E+Q+GQLAN + +R QG LPS TE     
Sbjct: 367  IAIEKLANASSERFERLEAKVDQLASSNRNVEIQLGQLANSINSRGQGNLPSKTE---VN 426

Query: 1017 GKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIIVIEKELESGQGVGGSKENAGASGSVPD 1076
             KE  KAVTLRSGK L +                +  E   G  V   + N   S  V D
Sbjct: 427  PKEHCKAVTLRSGKQLGQ----------------VSGETIVGDKVDYEEVNKKVSEEVED 486

Query: 1077 V---EPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYV 1136
            +     P  P  PYVPP+PFPQR K    D QF+KFL++ +QLHINIP  +A+ Q+P Y 
Sbjct: 487  LAKTPSPLPPVEPYVPPIPFPQRLKQNKIDQQFEKFLKVFRQLHINIPFADALAQIPAYT 546

Query: 1137 KFLKDILTKKKRLGEFETVSLTEECSVILKNGLPPKAKDPRSFTIPVSIGGKELGRALCD 1196
            KFLK+I++KK++L +FET++LTEECS I++N LPPK +DP SF+IP +IG  +  RALCD
Sbjct: 547  KFLKEIMSKKRKLEDFETIALTEECSAIIQNKLPPKLRDPGSFSIPCTIGDVDFSRALCD 606

Query: 1197 LGASINLMPLSVYRKLGIGEARPTTVTLQLADRS-------------------------- 1256
            LGAS++LMPLSV RKLG+ E +PTT++LQLADRS                          
Sbjct: 607  LGASVSLMPLSVSRKLGLKELKPTTISLQLADRSVKYPLGILENVLIKVKKFIIPVDFIV 666

Query: 1257 ---------------SILATGRTLIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSF 1316
                             LAT   +IDV+ G LT++V  EEV+FN+F+A KYP   +    
Sbjct: 667  LEMEEDTEIPIILGRPFLATAGAIIDVKNGRLTLKVGEEEVEFNLFEATKYPSFTDHVFR 726

Query: 1317 IRI-----------------LESTVIEKATQ--------------DSADKHSEKHG---- 1376
            + +                 LE+ ++   T               ++     +K G    
Sbjct: 727  VDVVDESTREFFKAENTKEPLETCLVSAGTSKDDNLEIAKVACALEATCPKPKKRGIYFE 786

Query: 1377 ----EAPPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVALDLMPEHEEALIKLLQ 1436
                  PP  PS ++AP L+LKPLP HL Y +LGE+ TLP+IV++ L  E  + LI++L+
Sbjct: 787  DIGKGKPPPPPSNVQAPVLELKPLPSHLMYAFLGENNTLPVIVSVSLSAEQLDKLIRILR 846

Query: 1437 QYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAG 1496
              +KAIGWT++D++GIS S CMH+I +E+     +E QRRLNP MKEVV+ EV+KWLDAG
Sbjct: 847  LRKKAIGWTISDLRGISPSLCMHRILMEDNHKPIVENQRRLNPNMKEVVRAEVLKWLDAG 906

Query: 1497 IIYPIADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHF 1531
            IIYPI+DS W+SPVQ VPKKGG+TVV N++NELIPTRTVTGWRVC+DYR+LN  TRKDHF
Sbjct: 907  IIYPISDSLWISPVQVVPKKGGMTVVHNENNELIPTRTVTGWRVCIDYRKLNSVTRKDHF 966

BLAST of Lag0035274 vs. NCBI nr
Match: XP_038972405.1 (uncharacterized protein LOC120104748 [Phoenix dactylifera])

HSP 1 Score: 855.1 bits (2208), Expect = 9.1e-244
Identity = 479/1055 (45.40%), Postives = 626/1055 (59.34%), Query Frame = 0

Query: 597  NDRTRAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHL 656
            N   R +  YAVP  N   P I RP + A NFE+KP + QM+Q   QF G  SEDPH HL
Sbjct: 7    NQNKRLLSDYAVPNVNGAQPSIVRPTVNANNFEIKPGLIQMVQQ-EQFGGGPSEDPHAHL 66

Query: 657  KSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW------------------------ 716
             +FL + D+  + GV  DA+RL LFP+SL+D AK+W                        
Sbjct: 67   ANFLEICDTIKMNGVSDDAIRLRLFPFSLKDKAKAWLNSKAPNSFTTWNALSQAFLSKYF 126

Query: 717  ---------SEIVGFRQLEDETFSEAWERFKELLRKCTHHGLPHCIQMETFYNGLNGVTQ 776
                     ++I  F Q + E+  EAWERFK+L RKC HHGLP  + ++TFYNGL    +
Sbjct: 127  PPGKTAKLRNDITSFAQFDGESLYEAWERFKDLQRKCPHHGLPDWLIVQTFYNGLTHSVR 186

Query: 777  GMVDASAGGALLAKPFDEAYEMLERISINSCQWSDVRGKNKKVKSVLEVDGVSTIRADLA 836
              +DA+AGG L++K  +EAYE+LE ++ N+ QWS+ R   KKV  + +VDG++ + A + 
Sbjct: 187  ITIDAAAGGTLMSKSTEEAYELLEEMASNNYQWSNERCMPKKVPGMYDVDGINMLNAKVD 246

Query: 837  MIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN-- 896
             +      +  ++    P +            +C  CG  H    C      V FV N  
Sbjct: 247  SLVKMFGKLGNVNSVSSPVL------------SCDCCGGAHMSSDC----MQVQFVSNYN 306

Query: 897  ---QRNNPYSNFYNPGWRNHPNFSWGGQGSNVQAQQKMNQSGFAKTQVMPQQNKQALPQQ 956
               Q+NNPYSN YNPGWRNHPNFSW  QG+   + + ++  GF   Q  P Q +     +
Sbjct: 307  RQQQQNNPYSNTYNPGWRNHPNFSWKDQGNQGSSSRPLHPPGF---QPKPSQPESKQSWE 366

Query: 957  NSGNSLEGMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRRE 1016
             +   L     E   R +A +    +S R +E+Q+GQLAN + +R QG LPS TE     
Sbjct: 367  IAIEKLANASSERFERLEAKVDQLASSNRNVEIQLGQLANSINSRGQGNLPSKTE---VN 426

Query: 1017 GKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIIVIEKELESGQGVGGSKENAGASGSVPD 1076
             KE  KAVTLRSGK L +                +  E   G  V   + N   S  V D
Sbjct: 427  PKEHCKAVTLRSGKQLGQ----------------VSGETIVGDKVDYEEVNKKVSEEVED 486

Query: 1077 V---EPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYV 1136
            +     P  P  PYVPP+PFPQR K    D QF+KFL++ +QLHINIP  +A+ Q+P Y 
Sbjct: 487  LAKTPSPLPPVEPYVPPIPFPQRLKQNKIDQQFEKFLKVFRQLHINIPFADALAQIPAYT 546

Query: 1137 KFLKDILTKKKRLGEFETVSLTEECSVILKNGLPPKAKDPRSFTIPVSIGGKELGRALCD 1196
            KFLK+I++KK++L +FET++LTEECS I++N LPPK +DP SF+IP +IG  +  RALCD
Sbjct: 547  KFLKEIMSKKRKLEDFETIALTEECSAIIQNKLPPKLRDPGSFSIPCTIGDVDFSRALCD 606

Query: 1197 LGASINLMPLSVYRKLGIGEARPTTVTLQLADRS-------------------------- 1256
            LGAS++LMPLSV RKLG+ E +PTT++LQLADRS                          
Sbjct: 607  LGASVSLMPLSVSRKLGLKELKPTTISLQLADRSVKYPLGILENVLIKVKKFIIPVDFIV 666

Query: 1257 ---------------SILATGRTLIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSF 1316
                             LAT   +IDV+ G LT++V  EEV+FN+F+A KYP   +    
Sbjct: 667  LEMEEDTEIPIILGRPFLATAGAIIDVKNGRLTLKVGEEEVEFNLFEATKYPSFTDHVFR 726

Query: 1317 IRI-----------------LESTVIEKATQ--------------DSADKHSEKHG---- 1376
            + +                 LE+ ++   T               ++     +K G    
Sbjct: 727  VDVVDESTREFFKAENTKEPLETCLVSAGTSKDDNLEIAKVACALEATCPKPKKRGIYFE 786

Query: 1377 ----EAPPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVALDLMPEHEEALIKLLQ 1436
                  PP  PS ++AP L+LKPLP HL Y +LGE+ TLP+IV++ L  E  + LI++L+
Sbjct: 787  DIGKGKPPPPPSNVQAPVLELKPLPSHLMYAFLGENNTLPVIVSVSLSDEQLDKLIRILR 846

Query: 1437 QYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAG 1496
              +KAIGWT++D++GIS S CMH+I +E+     +E QRRLNP MKEVV+ EV+KWLDAG
Sbjct: 847  LRKKAIGWTISDLRGISPSLCMHRILMEDNHKPIVENQRRLNPNMKEVVRAEVLKWLDAG 906

Query: 1497 IIYPIADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHF 1531
            IIYPI+DS W+SPVQ VPKKGG+TVV N++NELIPTRTVTGWRVC+DYR+LN  TRKDHF
Sbjct: 907  IIYPISDSLWISPVQVVPKKGGMTVVHNENNELIPTRTVTGWRVCIDYRKLNSVTRKDHF 966

BLAST of Lag0035274 vs. NCBI nr
Match: XP_038973683.1 (uncharacterized protein LOC120105384 [Phoenix dactylifera])

HSP 1 Score: 855.1 bits (2208), Expect = 9.1e-244
Identity = 479/1055 (45.40%), Postives = 626/1055 (59.34%), Query Frame = 0

Query: 597  NDRTRAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHL 656
            N   R +  YAVP  N   P I RP + A NFE+KP + QM+Q   QF G  SEDPH HL
Sbjct: 7    NQNKRLLSDYAVPNVNGAQPSIVRPTVNANNFEIKPGLIQMVQQ-EQFGGGPSEDPHAHL 66

Query: 657  KSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW------------------------ 716
             +FL + D+  + GV  DA+RL LFP+SL+D AK+W                        
Sbjct: 67   ANFLEICDTIKMNGVSDDAIRLRLFPFSLKDKAKAWLNSKAPNSFTTWNALSQAFLSKYF 126

Query: 717  ---------SEIVGFRQLEDETFSEAWERFKELLRKCTHHGLPHCIQMETFYNGLNGVTQ 776
                     ++I  F Q + E+  EAWERFK+L RKC HHGLP  + ++TFYNGL    +
Sbjct: 127  PPGKTAKLRNDITSFAQFDGESLYEAWERFKDLQRKCPHHGLPDWLIVQTFYNGLTHSVR 186

Query: 777  GMVDASAGGALLAKPFDEAYEMLERISINSCQWSDVRGKNKKVKSVLEVDGVSTIRADLA 836
              +DA+AGG L++K  +EAYE+LE ++ N+ QWS+ R   KKV  + +VDG++ + A + 
Sbjct: 187  ITIDAAAGGTLMSKSTEEAYELLEEMASNNYQWSNERCMPKKVPGMYDVDGINMLNAKVD 246

Query: 837  MIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN-- 896
             +      +  ++    P +            +C  CG  H    C      V FV N  
Sbjct: 247  SLVKMFGKLGNVNSVSSPVL------------SCDCCGGAHMSSDC----MQVQFVSNYN 306

Query: 897  ---QRNNPYSNFYNPGWRNHPNFSWGGQGSNVQAQQKMNQSGFAKTQVMPQQNKQALPQQ 956
               Q+NNPYSN YNPGWRNHPNFSW  QG+   + + ++  GF   Q  P Q +     +
Sbjct: 307  RQQQQNNPYSNTYNPGWRNHPNFSWKDQGNQGSSSRPLHPPGF---QPKPSQPESKQSWE 366

Query: 957  NSGNSLEGMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRRE 1016
             +   L     E   R +A +    +S R +E+Q+GQLAN + +R QG LPS TE     
Sbjct: 367  IAIEKLANASSERFERLEAKVDQLASSNRNVEIQLGQLANSINSRGQGNLPSKTE---VN 426

Query: 1017 GKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIIVIEKELESGQGVGGSKENAGASGSVPD 1076
             KE  KAVTLRSGK L +                +  E   G  V   + N   S  V D
Sbjct: 427  PKEHCKAVTLRSGKQLGQ----------------VSGETIVGDKVDYEEVNKKVSEEVED 486

Query: 1077 V---EPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYV 1136
            +     P  P  PYVPP+PFPQR K    D QF+KFL++ +QLHINIP  +A+ Q+P Y 
Sbjct: 487  LAKTPSPLPPVEPYVPPIPFPQRLKQNKIDQQFEKFLKVFRQLHINIPFADALAQIPAYT 546

Query: 1137 KFLKDILTKKKRLGEFETVSLTEECSVILKNGLPPKAKDPRSFTIPVSIGGKELGRALCD 1196
            KFLK+I++KK++L +FET++LTEECS I++N LPPK +DP SF+IP +IG  +  RALCD
Sbjct: 547  KFLKEIMSKKRKLEDFETIALTEECSAIIQNKLPPKLRDPGSFSIPCTIGDVDFSRALCD 606

Query: 1197 LGASINLMPLSVYRKLGIGEARPTTVTLQLADRS-------------------------- 1256
            LGAS++LMPLSV RKLG+ E +PTT++LQLADRS                          
Sbjct: 607  LGASVSLMPLSVSRKLGLKELKPTTISLQLADRSVKYPLGILENVLIKVKKFIIPVDFIV 666

Query: 1257 ---------------SILATGRTLIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSF 1316
                             LAT   +IDV+ G LT++V  EEV+FN+F+A KYP   +    
Sbjct: 667  LEMEEDTEIPIILGRPFLATAGAIIDVKNGRLTLKVGEEEVEFNLFEATKYPSFTDHVFR 726

Query: 1317 IRI-----------------LESTVIEKATQ--------------DSADKHSEKHG---- 1376
            + +                 LE+ ++   T               ++     +K G    
Sbjct: 727  VDVVDESTREFFKAENTKEPLETCLVSAGTSKDDNLEIAKVACALEATCPKPKKRGIYFE 786

Query: 1377 ----EAPPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVALDLMPEHEEALIKLLQ 1436
                  PP  PS ++AP L+LKPLP HL Y +LGE+ TLP+IV++ L  E  + LI++L+
Sbjct: 787  DIGKGKPPPPPSNVQAPVLELKPLPSHLMYAFLGENNTLPVIVSVSLSDEQLDKLIRILR 846

Query: 1437 QYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAG 1496
              +KAIGWT++D++GIS S CMH+I +E+     +E QRRLNP MKEVV+ EV+KWLDAG
Sbjct: 847  LRKKAIGWTISDLRGISPSLCMHRILMEDNHKPIVENQRRLNPNMKEVVRAEVLKWLDAG 906

Query: 1497 IIYPIADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHF 1531
            IIYPI+DS W+SPVQ VPKKGG+TVV N++NELIPTRTVTGWRVC+DYR+LN  TRKDHF
Sbjct: 907  IIYPISDSLWISPVQVVPKKGGMTVVHNENNELIPTRTVTGWRVCIDYRKLNSVTRKDHF 966

BLAST of Lag0035274 vs. ExPASy Swiss-Prot
Match: Q99315 (Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-G PE=1 SV=3)

HSP 1 Score: 114.8 bits (286), Expect = 8.8e-24
Identity = 79/217 (36.41%), Postives = 107/217 (49.31%), Query Frame = 0

Query: 1313 LLQQYRKAIGWTL----ADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEV 1372
            L Q+YR+ I   L    ADI  I      H I ++ G+     Q   +    ++ + K V
Sbjct: 560  LQQKYREIIRNDLPPRPADINNIP---VKHDIEIKPGARLPRLQPYHVTEKNEQEINKIV 619

Query: 1373 IKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRTVTGWRVCMDYRRLNK 1432
             K LD   I P + S   SPV  VPKK G                   +R+C+DYR LNK
Sbjct: 620  QKLLDNKFIVP-SKSPCSSPVVLVPKKDGT------------------FRLCVDYRTLNK 679

Query: 1433 ATRKDHFPLPFIDQMLDRLAGQTYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRR 1492
            AT  D FPLP ID +L R+     +  LD +SGY+QI + P+D+ KT F  P G + +  
Sbjct: 680  ATISDPFPLPRIDNLLSRIGNAQIFTTLDLHSGYHQIPMEPKDRYKTAFVTPSGKYEYTV 739

Query: 1493 MPFSLCNAQATFQRCMLTIFSDMIESTVEVFMDDFSV 1526
            MPF L NA +TF R M   F D+    V V++DD  +
Sbjct: 740  MPFGLVNAPSTFARYMADTFRDL--RFVNVYLDDILI 752

BLAST of Lag0035274 vs. ExPASy Swiss-Prot
Match: Q7LHG5 (Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-I PE=1 SV=2)

HSP 1 Score: 114.8 bits (286), Expect = 8.8e-24
Identity = 79/217 (36.41%), Postives = 107/217 (49.31%), Query Frame = 0

Query: 1313 LLQQYRKAIGWTL----ADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEV 1372
            L Q+YR+ I   L    ADI  I      H I ++ G+     Q   +    ++ + K V
Sbjct: 586  LQQKYREIIRNDLPPRPADINNIP---VKHDIEIKPGARLPRLQPYHVTEKNEQEINKIV 645

Query: 1373 IKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRTVTGWRVCMDYRRLNK 1432
             K LD   I P + S   SPV  VPKK G                   +R+C+DYR LNK
Sbjct: 646  QKLLDNKFIVP-SKSPCSSPVVLVPKKDGT------------------FRLCVDYRTLNK 705

Query: 1433 ATRKDHFPLPFIDQMLDRLAGQTYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRR 1492
            AT  D FPLP ID +L R+     +  LD +SGY+QI + P+D+ KT F  P G + +  
Sbjct: 706  ATISDPFPLPRIDNLLSRIGNAQIFTTLDLHSGYHQIPMEPKDRYKTAFVTPSGKYEYTV 765

Query: 1493 MPFSLCNAQATFQRCMLTIFSDMIESTVEVFMDDFSV 1526
            MPF L NA +TF R M   F D+    V V++DD  +
Sbjct: 766  MPFGLVNAPSTFARYMADTFRDL--RFVNVYLDDILI 778

BLAST of Lag0035274 vs. ExPASy Swiss-Prot
Match: P04323 (Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogaster OX=7227 GN=pol PE=4 SV=1)

HSP 1 Score: 98.6 bits (244), Expect = 6.5e-19
Identity = 57/167 (34.13%), Postives = 91/167 (54.49%), Query Frame = 0

Query: 1359 AMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRTVTGWR 1418
            A ++ V+ ++   L+ GII   ++S + SP+  VPKK   +    K            +R
Sbjct: 218  AYEQEVESQIQDMLNQGII-RTSNSPYNSPIWVVPKKQDAS---GKQK----------FR 277

Query: 1419 VCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYYCFLDGYSGYNQITIAPEDQEKTTFT 1478
            + +DYR+LN+ T  D  P+P +D++L +L    Y+  +D   G++QI + PE   KT F+
Sbjct: 278  IVIDYRKLNEITVGDRHPIPNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFS 337

Query: 1479 CPYGTFAFRRMPFSLCNAQATFQRCMLTIFSDMIESTVEVFMDDFSV 1526
              +G + + RMPF L NA ATFQRCM  I   ++     V++DD  V
Sbjct: 338  TKHGHYEYLRMPFGLKNAPATFQRCMNDILRPLLNKHCLVYLDDIIV 370

BLAST of Lag0035274 vs. ExPASy Swiss-Prot
Match: P31843 (RNA-directed DNA polymerase homolog OS=Oenothera berteroana OX=3950 PE=4 SV=1)

HSP 1 Score: 98.6 bits (244), Expect = 6.5e-19
Identity = 49/108 (45.37%), Postives = 67/108 (62.04%), Query Frame = 0

Query: 1418 RVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYYCFLDGYSGYNQITIAPEDQEKTTF 1477
            R+C+DYR L K T K+ +P+P +D + DRLA  T++  LD  SGY Q+ IA  D+ KTT 
Sbjct: 7    RMCIDYRALTKVTIKNKYPIPRVDDLFDRLAQATWFTKLDLRSGYWQVRIAKGDEPKTTC 66

Query: 1478 TCPYGTFAFRRMPFSLCNAQATFQRCMLTIFSDMIESTVEVFMDDFSV 1526
               YG+F FR MPF L NA ATF   M  +  + ++  V V++DD  V
Sbjct: 67   VTRYGSFEFRVMPFGLTNALATFCNLMNNVLYEYLDHFVVVYLDDLVV 114

BLAST of Lag0035274 vs. ExPASy Swiss-Prot
Match: P10394 (Retrovirus-related Pol polyprotein from transposon 412 OS=Drosophila melanogaster OX=7227 GN=POL PE=4 SV=1)

HSP 1 Score: 96.3 bits (238), Expect = 3.2e-18
Identity = 57/165 (34.55%), Postives = 87/165 (52.73%), Query Frame = 0

Query: 1362 EVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRTVTGWRVCM 1421
            E ++ +V K +   I+ P + S + SP+  VPKK              P      WR+ +
Sbjct: 328  EEIQAQVQKLIKDKIVEP-SVSQYNSPLLLVPKKSS------------PNSDKKKWRLVI 387

Query: 1422 DYRRLNKATRKDHFPLPFIDQMLDRLAGQTYYCFLDGYSGYNQITIAPEDQEKTTFTCPY 1481
            DYR++NK    D FPLP ID +LD+L    Y+  LD  SG++QI +    ++ T+F+   
Sbjct: 388  DYRQINKKLLADKFPLPRIDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTSN 447

Query: 1482 GTFAFRRMPFSLCNAQATFQRCMLTIFSDMIESTVEVFMDDFSVL 1527
            G++ F R+PF L  A  +FQR M   FS +  S   ++MDD  V+
Sbjct: 448  GSYRFTRLPFGLKIAPNSFQRMMTIAFSGIEPSQAFLYMDDLIVI 479

BLAST of Lag0035274 vs. ExPASy TrEMBL
Match: A0A2G9HYA0 (Reverse transcriptase OS=Handroanthus impetiginosus OX=429701 GN=CDL12_04802 PE=4 SV=1)

HSP 1 Score: 823.9 bits (2127), Expect = 1.1e-234
Identity = 459/912 (50.33%), Postives = 572/912 (62.72%), Query Frame = 0

Query: 698  FRQLEDETFSEAWERFKELLRKCTHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAK 757
            FRQ   ET  EAW RF+++LR C +H +P  IQ+ TFY+GL    +  +D   G + L+ 
Sbjct: 3    FRQGVSETVYEAWSRFRKMLRNCPNHDIPRHIQVHTFYHGLTEGGKDKLDHLNGDSFLSG 62

Query: 758  PFDEAYEMLERISINSCQWSDVRGKNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISH 817
               E + +L  +  N  +    R    K   V+EVD V+ + A +  +  ++KN  V   
Sbjct: 63   TTAECHNLLNNLVANHYEKKSERATPPKAAGVIEVDQVTALNAKIDFLMQSMKNFGVNQV 122

Query: 818  QQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPG 877
            Q  P               C  CGE H  + CP +  S+ FV N R   NNPYSN YNPG
Sbjct: 123  QHTPV-------------TCEECGEGHPSDQCPHSVESIQFVSNARKPQNNPYSNTYNPG 182

Query: 878  WRNHPNFSWG---GQGSNVQAQQKMNQSGFAKTQVMPQQNKQALPQQNSGNSLEGMMKEF 937
            WR HPNFSW    GQGS  + QQ               Q +   P Q    SLE  + +F
Sbjct: 183  WRQHPNFSWNNNQGQGSAPRFQQ-------------GGQQQVQQPMQEKKPSLEETLIQF 242

Query: 938  MARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRS 997
            MA       S  A+ + +E Q+GQLAN + +RPQG LPS+TE +PR++GK Q +AVTLR+
Sbjct: 243  MA-------STAANFKTMETQIGQLANAINSRPQGSLPSNTEPNPRQDGKAQCQAVTLRN 302

Query: 998  GKPLEESRKTQDLNSNSDNIIVIEKELESGQGVGGSKENAGASGSVPDVEPPYVPPPPYV 1057
            G+ L+E  K +   S    +I  EKE E                    VE P     P  
Sbjct: 303  GRELQEVVK-EPTKSKEKEVISEEKEKE--------------------VEAPLEVSKPTT 362

Query: 1058 PPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYVKFLKDILTKKKRLGE 1117
               PFPQR + +  + QF KFLE+ K+LHINIP  EA+EQMP+YVKF+KDIL+KK+RLG+
Sbjct: 363  LQPPFPQRLQKQKLEKQFLKFLEVFKKLHINIPFAEALEQMPSYVKFMKDILSKKRRLGD 422

Query: 1118 FETVSLTEECSVILKNGLPPKAKDPRSFTIPVSIGGKELGRALCDLGASINLMPLSVYRK 1177
            +ETV+LTEECS I++N LPPK KDP SFTIP +IG    GRALCDLGASINLMP S+YR 
Sbjct: 423  YETVALTEECSAIIQNKLPPKLKDPGSFTIPCTIGTHFSGRALCDLGASINLMPYSIYRT 482

Query: 1178 LGIGEARPTTVTLQLADRS----------------------------------------- 1237
            LG+GEA+PT++TLQLADRS                                         
Sbjct: 483  LGLGEAKPTSITLQLADRSLTYPKGVIEDILVKVDKFIFPADFVVLDMEVDSEVPIILGR 542

Query: 1238 SILATGRTLIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------- 1297
              LATGRTLIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C  + + +          
Sbjct: 543  PFLATGRTLIDVQKGELTMRVQDQQITFNVFKAMKFPNESDECFAVSLFDKLAGNESIAE 602

Query: 1298 -----IEKATQDSADKHSEKHGE------------------------APPIKPSLIEAPT 1357
                 +E+A  D  D+ +E+  E                        +  +KPS+ + PT
Sbjct: 603  PPLDPLERALLDLLDEENEEDLEVVKTLDASKFLKSRRVESLERTTPSKVLKPSIEDPPT 662

Query: 1358 LDLKPLPDHLKYVYLGESETLPIIVALDLMPEHEEALIKLLQQYRKAIGWTLADIQGISS 1417
            L+LKPLP HL Y YLGES+TLP+I++  L     E L+++L+ ++ AIGWT+ADI+GIS 
Sbjct: 663  LELKPLPSHLCYAYLGESDTLPVIISSSLSDLQVEKLLRVLRNHKGAIGWTIADIKGISP 722

Query: 1418 SFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVP 1477
            SFCMHKI LE+    S+E QRRLNP MKEVVKKE+IKWLDAGIIYPI+DS+WVSPVQCVP
Sbjct: 723  SFCMHKILLEDDQKPSVESQRRLNPIMKEVVKKEIIKWLDAGIIYPISDSSWVSPVQCVP 782

Query: 1478 KKGGVTVVCNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYY 1526
            KKGG+TVV N  NELIPTRTVTGWRVCMDYR+LNKATRKDHFPLPFIDQMLDRLAG+ +Y
Sbjct: 783  KKGGITVVPNMHNELIPTRTVTGWRVCMDYRKLNKATRKDHFPLPFIDQMLDRLAGKEFY 842

BLAST of Lag0035274 vs. ExPASy TrEMBL
Match: A0A2G9HH15 (Reverse transcriptase OS=Handroanthus impetiginosus OX=429701 GN=CDL12_10756 PE=4 SV=1)

HSP 1 Score: 813.5 bits (2100), Expect = 1.5e-231
Identity = 480/1072 (44.78%), Postives = 613/1072 (57.18%), Query Frame = 0

Query: 574  QQNPLFEQNEQRNNQAENPILIAND--RTRAIRAYAVPMFNELNPGIARPQIQAANFEMK 633
            ++  L E  EQ     EN I+I  D      +R  A+P   E    +  P++  A  +++
Sbjct: 79   RRRKLAEHVEQEVKVKENQIIIMADNYENTPVRNLALPRARERKSCVVFPEL-PAGVKVE 138

Query: 634  PVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKS 693
              M +M+Q   QF GLS E+P+ H+ +FL + D+   +GV +DALRL LF +SL   A  
Sbjct: 139  IPMIRMIQNTAQFCGLSHENPNRHIDNFLKICDTLRQEGVSKDALRLRLFSFSLLGDALD 198

Query: 694  W---------------------------------SEIVGFRQLEDETFSEAWERFKELLR 753
            W                                 +EI+ FRQ   ET  EAW RF+++LR
Sbjct: 199  WFESLPEDSITTWVQLEEQFISKFFSPEKIAALRAEIMTFRQGVSETVYEAWSRFRKMLR 258

Query: 754  KCTHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKPFDEAYEMLERISINSCQWSD 813
             C +H +P  IQ+ TFY+GL    +  +D   G + L+    E + +L  +  N  +   
Sbjct: 259  NCPNHDIPRHIQVHTFYHGLTNGGKDKLDHLNGDSFLSGTTAECHNLLNNLVANHYEKKL 318

Query: 814  VRGKNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACV 873
             R    K   V+EVD V+ + A +  +  ++KN  V   Q  P               C 
Sbjct: 319  ERATPPKAAGVIEVDQVTALNAKIDFLMQSMKNFGVNQVQHTPV-------------TCE 378

Query: 874  YCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWG---GQGSNVQA 933
             CGE H  + CP +  S+ FV N R   NNPYSN YNPGWR HPNFSW    GQG  ++ 
Sbjct: 379  ECGEGHPSDQCPHSVESIQFVSNARKPQNNPYSNTYNPGWRQHPNFSWNNNQGQGLALRF 438

Query: 934  QQKMNQSGFAKTQVMPQQNKQALPQQNSGNSLEGMMKEFMARTDAAIQSNQASMRALELQ 993
            QQ               Q +   P Q    SLE  + +FMA       S  A+ + +E Q
Sbjct: 439  QQ-------------GGQQQVQQPMQEKKPSLEETLIQFMA-------STAANFKMMETQ 498

Query: 994  VGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNSDNII 1053
            +GQLAN + +RP+  LPS+TE +PR++ K Q +AVTLR+G  L+E  K +   S    +I
Sbjct: 499  IGQLANAINSRPRRSLPSNTEPNPRQDSKAQCQAVTLRNGTELQEVVK-EPTKSKEKEVI 558

Query: 1054 VIEKELESGQGVGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKF 1113
              EK  E                                                     
Sbjct: 559  SEEKGKE----------------------------------------------------- 618

Query: 1114 LEILKQLHINIPL-VEAIEQMPNYVKFLKDILTKKKRLGEFETVSLTEECSVILKNGLPP 1173
                    I  PL V+A+EQMP+YVKF+KDIL+KK+RLG++ETV+LTEECS I++N LPP
Sbjct: 619  --------IEAPLEVKALEQMPSYVKFMKDILSKKRRLGDYETVALTEECSAIIQNKLPP 678

Query: 1174 KAKDPRSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRS- 1233
            K KDP SFTIP +IG    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLADRS 
Sbjct: 679  KLKDPGSFTIPCTIGTHFSGRALCDLGASINLMPYSIYRTLGLGEAKPTSITLQLADRSL 738

Query: 1234 ----------------------------------------SILATGRTLIDVQKGELTMR 1293
                                                      LATGRTLIDVQKGELTMR
Sbjct: 739  TYPNGVIEDILVKVDKFIFPADFVVLDMEVDIEVPIILGRPFLATGRTLIDVQKGELTMR 798

Query: 1294 VCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------IEKATQDSADKHSEK 1353
            V ++++ FNVFKAMK+P+E ++C  + + +               +E+A  D  D+ +E+
Sbjct: 799  VQDQQITFNVFKAMKFPNESDECFAVSLFDKLAGNESIAEQPLDPLERALLDLLDEENEE 858

Query: 1354 HGE----------------------APP--IKPSLIEAPTLDLKPLPDHLKYVYLGESET 1413
              E                      AP   +KPS+ E PTL+LKPLP HL Y YLGES+T
Sbjct: 859  DCEVVKTLDASKYFKSRGVESLERTAPSKVLKPSIEEPPTLELKPLPSHLCYAYLGESDT 918

Query: 1414 LPIIVALDLMPEHEEALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQ 1473
            LP+I++  L     E L+++L+ ++ AIGWT+ADI+GIS SFCMHKI LE+G   S+E Q
Sbjct: 919  LPVIISSSLSDLQVEKLLRVLRNHKGAIGWTIADIKGISPSFCMHKILLEDGQKPSVESQ 978

Query: 1474 RRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRT 1526
            RRLNP MKEVVKKE+IKWLDAGIIYPI+DS+WVSPVQCVPKKGG+TVV N  NELIPTRT
Sbjct: 979  RRLNPIMKEVVKKEIIKWLDAGIIYPISDSSWVSPVQCVPKKGGITVVHNMHNELIPTRT 1038

BLAST of Lag0035274 vs. ExPASy TrEMBL
Match: A0A2G9HYD8 (Reverse transcriptase OS=Handroanthus impetiginosus OX=429701 GN=CDL12_04775 PE=4 SV=1)

HSP 1 Score: 800.0 bits (2065), Expect = 1.7e-227
Identity = 451/912 (49.45%), Postives = 568/912 (62.28%), Query Frame = 0

Query: 698  FRQLEDETFSEAWERFKELLRKCTHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAK 757
            FRQ   ET  EAW RF+++LR C +H +P  IQ+ TFY+GL    +  +D   G + L+ 
Sbjct: 3    FRQGVSETVYEAWSRFRKMLRNCPNHDIPRHIQVHTFYHGLTEGGKDKLDHLNGNSFLSG 62

Query: 758  PFDEAYEMLERISINSCQWSDVRGKNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISH 817
               E + +L  +  N  +    R    K   V+EVD V+ + A +  +  ++KN      
Sbjct: 63   TTAECHNLLNNLVANHYKKKSERATPPKAARVIEVDQVTALNAKIDFLMQSMKNF----- 122

Query: 818  QQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPG 877
                                    E H  + CP +  S+ FV N R   NNPYSN YNPG
Sbjct: 123  ------------------------ESHPSDQCPHSVESIQFVSNARKPQNNPYSNTYNPG 182

Query: 878  WRNHPNFSWG---GQGSNVQAQQKMNQSGFAKTQVMPQQNKQALPQQNSGNSLEGMMKEF 937
            WR HPNFSW    GQGS  + QQ   Q         P Q     P Q    SLE  + +F
Sbjct: 183  WRQHPNFSWNNNQGQGSAPRFQQGGQQ---------PVQQ----PMQEKKPSLEETLIQF 242

Query: 938  MARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRS 997
            MA       S  A+ + +E Q+GQLAN + +RPQG LPS+TE +PR++GK Q +AVTLR+
Sbjct: 243  MA-------SIAANFKTMETQIGQLANAINSRPQGSLPSNTEPNPRQDGKAQCQAVTLRN 302

Query: 998  GKPLEESRKTQDLNSNSDNIIVIEKELESGQGVGGSKENAGASGSVPDVEPPYVPPPPYV 1057
            G+ L+E  K +   S    +I  EKE E                    VE P     P  
Sbjct: 303  GRKLQEVVK-KPTKSKEKEVISKEKEKE--------------------VEAPLEVSKPTT 362

Query: 1058 PPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYVKFLKDILTKKKRLGE 1117
               PFPQ+ + +  + QF KFLE+ K+LHINIP  EA+EQMP+YVKF+KDIL+KK+RLG+
Sbjct: 363  LQPPFPQKLQKQKLEKQFLKFLEVFKKLHINIPFAEALEQMPSYVKFMKDILSKKRRLGD 422

Query: 1118 FETVSLTEECSVILKNGLPPKAKDPRSFTIPVSIGGKELGRALCDLGASINLMPLSVYRK 1177
            +ET +LTEEC+ I++N LPPK KDP SFTIP +IG    GRALCDLGASINLMP S+YR 
Sbjct: 423  YETAALTEECNAIIQNKLPPKLKDPGSFTIPCTIGTHFSGRALCDLGASINLMPYSIYRT 482

Query: 1178 LGIGEARPTTVTLQLADRS----------------------------------------- 1237
            LG+GEA+PT++TLQLADRS                                         
Sbjct: 483  LGLGEAKPTSITLQLADRSLTYPKGVIEDILVKVDKFIFPADFVVLDMEVDSEVPIILGR 542

Query: 1238 SILATGRTLIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------- 1297
              LATGRTLIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C  + + ++         
Sbjct: 543  PFLATGRTLIDVQKGELTMRVQDQQITFNVFKAMKFPNESDECFSVSLFDNLAGNESIAE 602

Query: 1298 -----IEKATQDSADKHSEKHGE------------------------APPIKPSLIEAPT 1357
                 +E+A  D  ++ +E+  E                        +  +KPS+ + PT
Sbjct: 603  QPLDSLERALLDLIEEGNEEDLEVVKTLNASKFFKSRGVESLERTTPSKVLKPSIEDPPT 662

Query: 1358 LDLKPLPDHLKYVYLGESETLPIIVALDLMPEHEEALIKLLQQYRKAIGWTLADIQGISS 1417
            L+LKPLP+HL YVYLGES+TLP+I++  L     E L+++L+ ++ AIGWT+ADI+GIS 
Sbjct: 663  LELKPLPNHLCYVYLGESDTLPVIISSSLSDLQVEKLLRVLRNHKGAIGWTIADIKGISP 722

Query: 1418 SFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVP 1477
            SFCMHKI LE+    S+E QRRLN  MKEVVKKE+IKWLDAGIIYPI+DS+WVSPVQCVP
Sbjct: 723  SFCMHKILLEDDQKPSVESQRRLNSIMKEVVKKEIIKWLDAGIIYPISDSSWVSPVQCVP 782

Query: 1478 KKGGVTVVCNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYY 1526
            KKGG+TVV N  NELIPTRTVTGWRVCMDYR+LNKATRKDHFPLPFIDQMLDRLAG+ +Y
Sbjct: 783  KKGGITVVPNMHNELIPTRTVTGWRVCMDYRKLNKATRKDHFPLPFIDQMLDRLAGKEFY 842

BLAST of Lag0035274 vs. ExPASy TrEMBL
Match: A0A6P8DD93 (uncharacterized protein LOC116206453 OS=Punica granatum OX=22663 GN=LOC116206453 PE=4 SV=1)

HSP 1 Score: 797.0 bits (2057), Expect = 1.4e-226
Identity = 468/1046 (44.74%), Postives = 620/1046 (59.27%), Query Frame = 0

Query: 601  RAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFL 660
            RA+R YAVP    +   I RP I A NFE+KP + QM+Q+  QF G  +E P  H+  FL
Sbjct: 52   RALRDYAVPTI--MGSAIRRPTIPANNFELKPALIQMVQS-NQFGGYPNESPDEHIAGFL 111

Query: 661  GVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW---------------------------- 720
               ++  +  V  D +RL LFP+SLRD A++W                            
Sbjct: 112  QYCNTVKMNNVTDDVIRLQLFPFSLRDKARAWFNSLPQESITTWADLSSKFLRRFFPPAR 171

Query: 721  -----SEIVGFRQLEDETFSEAWERFKELLRKCTHHGLPHCIQMETFYNGLNGVTQGMVD 780
                 +EI  F +   E+  EAWERFKE +RKC HHGLP  + +E FY  L+   + +VD
Sbjct: 172  TARLRNEITNFTKFNGESLYEAWERFKEAIRKCPHHGLPDNLLIEVFYLSLDDTLRSLVD 231

Query: 781  ASAGGALLAKPFDEAYEMLERISINSCQWSDVRGKNKKVKSVLEVDGVSTIRADLAMIAN 840
            A+AGGAL+ K +DEA  ++E ++ ++  W + R K+ +V SV ++D ++ +   ++ +  
Sbjct: 232  AAAGGALMGKNYDEASALIEEMASSAHNWQNERSKS-RVASVNDMDTIANLTTQISALTT 291

Query: 841  ALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPS-NPAS------VFFVG 900
             +  +T            +   NQV    C  C   H+   C S NP++      V FV 
Sbjct: 292  QVSKLT---------SAHSFNTNQVA--FCELCSGPHSTLECMSGNPSASPNGEQVNFVN 351

Query: 901  N-QRNN--PYSNFYNPGWRNHPNFSWGGQGSNVQAQQKMNQSGFAKTQVMPQQNKQALPQ 960
            N QR+N  PYSN YNPGWRNHPNFSW  + + ++      + G       P QN    P 
Sbjct: 352  NFQRSNQGPYSNTYNPGWRNHPNFSWRNENNALKPPPGFQKQG-------PAQN---APP 411

Query: 961  QNSGNSLEGMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRR 1020
            Q S + +E +M  +M +TD  +Q+ QA++R LE Q+ Q++ +L  RP G LPS+TE    
Sbjct: 412  QQSQSRMEELMLSYMQKTDTMLQNQQATIRNLEGQISQISQQLSNRPSGSLPSNTE---- 471

Query: 1021 EGKEQVKAVTLRSGKPLE-ESRKTQDLNSNSDNIIVIEKELESGQGVGGSKENAGASGSV 1080
            E  + V A+ LRSGK LE  +RK Q    + +     +K  E  Q   G K         
Sbjct: 472  ENPKGVNAIMLRSGKELEIVNRKAQTQEESPEKDKGKQKVEEPRQKSLGVK--------- 531

Query: 1081 PDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYVK 1140
                       PYVPP+PFP+R K +  D QF KFL++ K+L INIP  EA++QMP+Y +
Sbjct: 532  -----------PYVPPVPFPRRLKQQQLDAQFAKFLDVFKKLQINIPFAEALQQMPSYAR 591

Query: 1141 FLKDILTKKKRLGEFETVSLTEECSVILKN---GLPPKAKDPRSFTIPVSIGGKELGRAL 1200
            F+KD+LTKK++    E V LT ECS+IL+     LP K +D  SFT+P +IG       L
Sbjct: 592  FMKDLLTKKRKFDGSEPVMLTGECSMILQKDLPNLPRKQRDQGSFTVPCTIGNFHFENVL 651

Query: 1201 CDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRS------------------------ 1260
             D GASINLMPLS++RKLG+GE + T VTLQLADRS                        
Sbjct: 652  IDSGASINLMPLSIFRKLGLGECKKTHVTLQLADRSIKYPKGIVENVLVKVDKFIFPVDF 711

Query: 1261 -----------------SILATGRTLIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDC 1320
                               LATG+ LIDV++G+LT+RV NE++ FNV+ A+K  D+ + C
Sbjct: 712  IVLEMEEDREVPMILGRPFLATGKALIDVEQGKLTLRVMNEQITFNVYDAIKKFDDGKSC 771

Query: 1321 SFIRILE----STVIEKATQDSA-------------DKHSEKHGE--------------A 1380
              I I++     +V EKA  D+              D+H E+  E               
Sbjct: 772  YTIDIIDELISESVEEKAGVDTMESVLRDLDDWSDDDEHEEESVEKVSEIKARYYEELGT 831

Query: 1381 PPIKP--SLIEAPTLDLKPLPDHLKYVYLGESETLPIIVALDLMPEHEEALIKLLQQYRK 1440
               KP  SL ++P L+LKPLP HLKY YLG  +TLPII++  L  + E+ L+ +L+++++
Sbjct: 832  SATKPVSSLTQSPVLELKPLPSHLKYAYLGIDDTLPIIISSSLTGDQEQQLLSVLREHKE 891

Query: 1441 AIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYP 1500
            AIGWT+ADI+GIS   C H+I LE      ++ QRRLNP +KEVVKKEV+K LDAGIIYP
Sbjct: 892  AIGWTIADIKGISPLICTHRIMLEAECKPIVQPQRRLNPTLKEVVKKEVLKLLDAGIIYP 951

Query: 1501 IADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPF 1526
            I+DS WVSPVQ VPKKGG+TVV N+ N+LIPTRTVTGWRVC+DYR+LN ATRKDH PLPF
Sbjct: 952  ISDSKWVSPVQVVPKKGGMTVVKNEVNKLIPTRTVTGWRVCIDYRKLNDATRKDHLPLPF 1011

BLAST of Lag0035274 vs. ExPASy TrEMBL
Match: A0A6P8DKJ2 (uncharacterized protein LOC116204231 OS=Punica granatum OX=22663 GN=LOC116204231 PE=4 SV=1)

HSP 1 Score: 796.6 bits (2056), Expect = 1.9e-226
Identity = 469/1046 (44.84%), Postives = 621/1046 (59.37%), Query Frame = 0

Query: 601  RAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFL 660
            RA+R YAVP    +   I RP I A NFE+KP + QM+Q+  QF G  +E P  H+  FL
Sbjct: 158  RALRDYAVPTI--MGSAIRRPTIPANNFELKPALIQMVQS-NQFGGYPNESPDEHIAGFL 217

Query: 661  GVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW---------------------------- 720
               ++  +  V  D +RL LFP+SLRD A++W                            
Sbjct: 218  QYCNTVKMNNVTDDVIRLQLFPFSLRDKARAWFNSLPQESITTWADLSSKFLRRFFPPAR 277

Query: 721  -----SEIVGFRQLEDETFSEAWERFKELLRKCTHHGLPHCIQMETFYNGLNGVTQGMVD 780
                 +EI  F +   E+  EAWERFKE +RKC HHGLP  + +E FY  L+   + +VD
Sbjct: 278  TARLRNEITNFTKFNGESLYEAWERFKEAIRKCPHHGLPDNLLIEVFYLSLDDTLRSLVD 337

Query: 781  ASAGGALLAKPFDEAYEMLERISINSCQWSDVRGKNKKVKSVLEVDGVSTIRADLAMIAN 840
            A+AGGAL+ K +DEA  ++E ++ ++  W + R K+ +V SV ++D ++ +   ++ +  
Sbjct: 338  AAAGGALMGKNYDEASALIEEMASSAHNWQNERSKS-RVASVNDMDTIANLTTQISALTT 397

Query: 841  ALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPS-NPAS------VFFVG 900
             +  +T            +   NQV    C  C   H+   C S NP++      V FV 
Sbjct: 398  QVSKLT---------SAHSFNTNQVA--FCELCSGPHSTLECMSGNPSASPNGEQVNFVN 457

Query: 901  N-QRNN--PYSNFYNPGWRNHPNFSWGGQGSNVQAQQKMNQSGFAKTQVMPQQNKQALPQ 960
            N QR+N  PYSN YNPGWRNHPNFSW  + + ++      + G       P QN    P 
Sbjct: 458  NFQRSNQGPYSNTYNPGWRNHPNFSWRNENNALKPPPGFQKQG-------PAQN---APP 517

Query: 961  QNSGNSLEGMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRR 1020
            Q S + +E +M  +M +TD  +Q+ QA++R LE Q+ Q++ +L  RP G LPS+TE    
Sbjct: 518  QQSQSRMEELMLSYMQKTDTMLQNQQATIRNLEGQISQISQQLSNRPSGSLPSNTE---- 577

Query: 1021 EGKEQVKAVTLRSGKPLE-ESRKTQDLNSNSDNIIVIEKELESGQGVGGSKENAGASGSV 1080
            E  + V A+ LRSGK LE  +RK Q            E+  E  +G    +E    S  V
Sbjct: 578  ENPKGVNAIMLRSGKELEIVNRKAQ----------TQEESPEKDKGKQKVEEPRRKSLGV 637

Query: 1081 PDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYVK 1140
                       PYVPP+PFP R K +  D QF KFL++ K+L INIP  EA++QMP+Y +
Sbjct: 638  ----------KPYVPPVPFPGRLKQQQLDAQFAKFLDVFKKLQINIPFAEALQQMPSYAR 697

Query: 1141 FLKDILTKKKRLGEFETVSLTEECSVILKN---GLPPKAKDPRSFTIPVSIGGKELGRAL 1200
            F+KD+LTKK++    E V LT ECS+IL+     LP K +D  SFT+P +IG       L
Sbjct: 698  FMKDLLTKKRKFDGSEPVMLTGECSMILQKDLPNLPRKQRDQGSFTVPCTIGNFHFENVL 757

Query: 1201 CDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRS------------------------ 1260
             D GASINLMPLS++RKLG+GE + T +TLQLADRS                        
Sbjct: 758  IDSGASINLMPLSIFRKLGLGECKKTHITLQLADRSIKYPKGIVENVLVKVDKFIFPVDF 817

Query: 1261 -----------------SILATGRTLIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDC 1320
                               LATG+ LIDV++G+LT+RV NE++ FNV+ A+K  D+ + C
Sbjct: 818  IVLEMEEDREVPMILGRPFLATGKALIDVEQGKLTLRVMNEQITFNVYDAIKKFDDGKSC 877

Query: 1321 SFIRILE----STVIEKATQDSA-------------DKHSEKHGE--------------A 1380
              I I++     +V EKA  D+              D+H E+  E               
Sbjct: 878  YTIDIIDELISESVEEKAGVDTMESVLRDLDDWSDDDEHEEESVEKVSEIKARYYEELGT 937

Query: 1381 PPIKP--SLIEAPTLDLKPLPDHLKYVYLGESETLPIIVALDLMPEHEEALIKLLQQYRK 1440
               KP  SL ++P L+LKPLP HLKY YLG  +TLPII++  L  + E+ L+ +L+++++
Sbjct: 938  SATKPVSSLTQSPVLELKPLPSHLKYAYLGIDDTLPIIISSSLTGDQEQQLLSVLREHKE 997

Query: 1441 AIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYP 1500
            AIGWT+ADI+GIS   C H+I LE      ++ QRRLNP +KEVVKKEV+K LDAGIIYP
Sbjct: 998  AIGWTIADIKGISPLICTHRIMLEAECKPIVQPQRRLNPTLKEVVKKEVLKLLDAGIIYP 1057

Query: 1501 IADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPF 1526
            I+DS WVSPVQ VPKKGG+TVV N+ N+LIPTRTVTGWRVC+DYR+LN ATRKDHFPLPF
Sbjct: 1058 ISDSKWVSPVQVVPKKGGMTVVKNEVNKLIPTRTVTGWRVCIDYRKLNDATRKDHFPLPF 1117

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_017239676.19.7e-25447.69PREDICTED: uncharacterized protein LOC108212460 [Daucus carota subsp. sativus][more]
XP_016646912.11.8e-24446.87PREDICTED: uncharacterized protein LOC103318979 [Prunus mume][more]
XP_038976300.15.3e-24445.40uncharacterized protein LOC120107204 [Phoenix dactylifera][more]
XP_038972405.19.1e-24445.40uncharacterized protein LOC120104748 [Phoenix dactylifera][more]
XP_038973683.19.1e-24445.40uncharacterized protein LOC120105384 [Phoenix dactylifera][more]
Match NameE-valueIdentityDescription
Q993158.8e-2436.41Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Q7LHG58.8e-2436.41Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
P043236.5e-1934.13Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogast... [more]
P318436.5e-1945.37RNA-directed DNA polymerase homolog OS=Oenothera berteroana OX=3950 PE=4 SV=1[more]
P103943.2e-1834.55Retrovirus-related Pol polyprotein from transposon 412 OS=Drosophila melanogaste... [more]
Match NameE-valueIdentityDescription
A0A2G9HYA01.1e-23450.33Reverse transcriptase OS=Handroanthus impetiginosus OX=429701 GN=CDL12_04802 PE=... [more]
A0A2G9HH151.5e-23144.78Reverse transcriptase OS=Handroanthus impetiginosus OX=429701 GN=CDL12_10756 PE=... [more]
A0A2G9HYD81.7e-22749.45Reverse transcriptase OS=Handroanthus impetiginosus OX=429701 GN=CDL12_04775 PE=... [more]
A0A6P8DD931.4e-22644.74uncharacterized protein LOC116206453 OS=Punica granatum OX=22663 GN=LOC116206453... [more]
A0A6P8DKJ21.9e-22644.84uncharacterized protein LOC116204231 OS=Punica granatum OX=22663 GN=LOC116204231... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D3.10.10.10HIV Type 1 Reverse Transcriptase, subunit A, domain 1coord: 1338..1495
e-value: 5.3E-48
score: 165.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 960..1063
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1248..1267
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 966..1002
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1039..1058
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..52
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 10..32
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1248..1263
NoneNo IPR availablePANTHERPTHR24559TRANSPOSON TY3-I GAG-POL POLYPROTEINcoord: 1191..1256
coord: 135..407
NoneNo IPR availablePANTHERPTHR24559:SF334SUBFAMILY NOT NAMEDcoord: 1191..1256
coord: 689..1189
coord: 135..407
NoneNo IPR availablePANTHERPTHR24559TRANSPOSON TY3-I GAG-POL POLYPROTEINcoord: 689..1189
NoneNo IPR availableCDDcd00303retropepsin_likecoord: 1140..1226
e-value: 3.69662E-10
score: 56.1908
NoneNo IPR availableCDDcd01647RT_LTRcoord: 1375..1525
e-value: 3.10598E-57
score: 193.966
IPR005162Retrotransposon gag domainPFAMPF03732Retrotrans_gagcoord: 160..252
e-value: 1.2E-20
score: 73.6
coord: 692..739
e-value: 1.0E-5
score: 25.7
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 1409..1525
e-value: 6.7E-10
score: 38.9
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 1118..1232
e-value: 6.1E-12
score: 47.4
IPR043128Reverse transcriptase/Diguanylate cyclase domainGENE3D3.30.70.270coord: 1435..1525
e-value: 5.3E-48
score: 165.1
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 1335..1526

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0035274.1Lag0035274.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009987 cellular process
biological_process GO:0043170 macromolecule metabolic process
biological_process GO:0006807 nitrogen compound metabolic process
biological_process GO:0044238 primary metabolic process
molecular_function GO:0016740 transferase activity