Lag0042148 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0042148
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Locationchr13: 37420494 .. 37433271 (+)
RNA-Seq ExpressionLag0042148
SyntenyLag0042148
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAATAGTTGTGAATGCTCTGAGGTCTCGTGATTTAGAAGTGAAAAAGGTGAGTTCTAAAGAGGGTGAAGCTCATTTCACTTGAGGAAGAACTGAGAAGAAAAATACCAAAACAAACAGCAGCAGAGGAAAGTCATAAACAAATAAGGGTAGATCTAAGTCACAGTCGAGAGTTAAAAGATGCTATCACTGCCATAAGGAAGGTCATCTTCATACTGCTATGAATTGAAGAACAAGAGGAAATCAGAAGGAAAGAATAGTGAAGATGGAAACAATGTCAATGTCACTGAAGGCTACGATTCTGCAGAGGTACTAGTTGTGACTGAGGGAGATGTAGATTCTAAATGGATCCTTGACTCATGGTGCTCTTTCCATATGAAACCAAACCGTCATTGGTTTCAGGGCTTTGAACCAATGGAGGAAGGAAAGGTGCTCTTAGGCAACCCCCATGAGTGCAATGTAAGACGGATCAATTCAATTCAGGTGAAGATGTTTGATAATCAAACAAGGATCATCCCCAGGGTGAGGTACGTACCAGAACTGAAACGTAGCCTGCTGTTTTTGGGTACTTTTGATAAGGCAGGCTATGTTTGTAAGCTTGAGAATGAAGGAGCAATGGTAAAGTTGCGAGGAAAGCTTGCAAATGGATTATATATTTTAGAGGGTTCAACCATCATTGGAACAGCAGCAATAGCTTCTCTTAATGAACAACAAACTACTACCTTATGGCATAGGAGACTGGGCCATGTTAGTGAAAAAGGACTCATGGAACTCCATAAACAGGGCTTGTTGGGAAGTGAAAACTTGGGAACACTTGGTCTCTGCAAACATTGTGTTTATGGTAAGGCAACATGAGTGTCCTTCAGCAAAGGTAAACACACAACTAAAGCCAAAATGGACTATGTCCATATCGACTTATGGGGACCTAAGAAGACTAAGACTTTGGGAGGAGCTAGATTCTTCTTATCTATAGTGGACGACTACTCTAGGAGAGTGTGGGTGTATCCACTAAAGTCAAAGGATGAAACCTTCTTAAAGTTTAAAGAATGGAAAGTGTTGACAGAAAAGAAGACAAACAGAAAACTTAAATGTCTGAGGACGGATAATGGTCTTGAATTCCTTAACCATCAATTTAAGGAAATGTGTTCTAAAGAAGGTATAGAGAGGCATCTCACTGTGAAAGGAACTCCTCAGCAAAATTGCCTAGCTGATCGCATGAACATGACACTTCTTGAGCGAGTAAGATGCCTATTGTCATATGCCAAGCTAAGTAAATTTTTCTGGGGGGAAGCAGTGGTAACTGCAACATACTTAATCAACAGGAGTCCCTCATCAGCAATATGATTTAAAACTCCTATGGAAATGTGGACTGAAAAATCCACCAGACCTTACACACTTGAGGATCTCCCGGTGTGTTGCCTATGCACATATGAAAGAGGGAAAACTGGATAACAGGGCTGAGAAATGCATTCTGTTGGGATATTCTCATGGTATAAAAGGGTACAGATTGTGGGCAGCCAGGCCTAACATATGGTATGCGGTAGGGATCGTCAGTTGATACAAATTTAGTCCAGATTTAGACTACTGGAACGCAGTAAAAGCTCGTGTATAACACTAAGGATCTGATCCTTACAGGATACACTAATTCTTATCTGCACACCGATAAAGATTCAAGGAAATCCACATCAGCATCAGAATTCACCTTTAACAGGGAGCTATAATTTATCGTAGCACCAAGAATACCACTACTTGTGAAGCAGTTTAGAAGTCTGTGTGGCTAAGAAAGTTCTTGAAAAATTTGGAAGTTGTTCCAAATATGAACTTGCTAGCCACTAATGAGGCAAACACGTAGAGCATAAAAAATTATCTCATTTACGAGATCGTGCTTTGAAGAATCGTGACTGTCATAGAGATAACTTCAAAGCACAACATGCTGATCCATTTTCGAAGGCCCTCACGGCTAAAGTGTTCAAGGGACATCTAGATGGTCTAGGTCAACGAGTTCTGTACAAAGGATAATCTAGGGCAAGTGGAAGATATGTAATGGGTATATAGGATGCTCTAGTTTATTGTATTTGTACTATATGAATACTAACCCACTATGTTTCTAGATAATTGTACACCCCCCTAGAGTCTTAGTCCATGTGGGAGTTTGTTGGGTTTTATGTCCTAAAACTCGTGGATAGTAAATGTAACAAAATTAATCAATAAAGCATTATTGAGGTTATTCTATAAGTAGTTATAGAATGGTTGTTTAATTGTGCATTAATTAACCCAAATCCAATAAACTAGAACCCATGGCTATATAATGAACACTTGAACTTTATGTGTAGACATAAAAGCAAATCAAGTTCAAGTGATAGCCAAAACAGTCTATAATATATGGATAGAGGTTGGGTGCCTTATCCTCAGGACACTATGTGACGCAGTCCGCTTTGTATTTATTACAAACGACGTGATCCTGAATCGTTCATGTAACGACATGTGAATGGGGGCGTTCTATGCAAAGAGTTTGCATAAGACCTGGACCGAGAAATAAGTCACTTTCACTTTATAACACCGTTTACTGTGAAACTGACTATTTCATGTGATGACCTAGGTAATTCAATCTTAATCCTGAGCTAACTATGAACTCTTGTTTATTCGGGATTATCCTTTGATTTTCATGGGTGAGGTGACCCAACATCGCCGACTCAATAAGCCTACAATTTTGGGGATAAAACCAGGTAGATAGCTGGGGACATAGTCTTGCAAGATGGAATTCGCACCTACCCGCCTGTAGGGATAGTAGAGGTTGTTCCCTTAAGTGCTGACTCCGGATCTTGAACAAAGGCGCGTCGATTCACTCCTGGCACGAGAGGATTGTATGTTTTATAGTTGGAGCATAAACAAATTGTTTATTATAGGATCAGTGGTAGTTAAGGATCAAGATGTAACTTCAGGGGCAAAACGGACTTTTGACCTGCTGTAGTTACGAACAACCTGTGAAGGGTCGACCTATTAATTATGGTTGTATCGAGTTGACAGAAATATATCTACAGTGAGGGGAGTGCAACTAAGGGCTATAGTGGAGTGTCCCGTTAGTTAACAAATGTTGGTTAACTAGGCTAAAGAGTTTAGACAAGTTAAACTCGAATCATTGGAGCCCATGATCTGTAGGTCCATTAGGTCCCCTTGCTAGCTCATACAGAATTAACCTTAGAACAAAGTGATGAAATGATTTGAAAACGTTCAAAGTCATTTAAGAAAAATATCGTTATATTTAACGCTAAAGTTTAATTACGAATTAAACGATAAGAGAGAGTTGGGAATATTTAAATATTAGAATTGTGTAATTAATACACATTTCGATTTAAATATGAATAAAGATTCATACCGATTTAAATGTGGGGGAAATTATTTGATTTAGTGTTGAACATTAAATTAAATAATTATCATTAATTAAAAAATTTAATTAATGATATTAATATAATATATCAATTTCATTTAAAATTAATTGTCTAATTAATTATAGATGTTGAAATTGATTAAATTGAAATTAATAATTTTAATTTCTTGGTGGAAATTTTTAGCAGGAAAAACTTCATGCAATTCATGGAGATTTGACACCAAATTGGTTGACAAATCAAAATCCATCATCACTTATCCCTTCATATATAAGCATTTGTCTTCTTCTTCAAGAGAGAATAGAAGGTTCTGAATTTTTCATGCAAAAAGGAAGAAGAAATTCTCTCAAAACAGTGGGTTTTCAAGTTATTCAAAAACCCATTCTTTTTCTCCCAAATTCATCATGATTCGGTTCCCACAAGCTCGTTCTAAGGCAGGAGAATAGTGGGGAAGATCTTGATGGTTTTCTACAACTTGTTCGTGATGAGAAACGACCAACGGATTGCAAGGAATCGAGCTCCAAGAGGTATTTATTCATTCCTCTGTTTTTCGGTTTATTTACATGCTTTTAATTAGATAAATTAATGAAATTAAACGCTCAAGATCCGTTAATTGTTCCGCACGAGTTATGTAAACTTCTGCAATTGGTATCAGAGCGTGGTTTAGGCGTTTAATTTGCTTTAATTTCGAGTATGGTTGGTAATTTCTTGAGATGAATGCAGTGGGAGCTAAAGTGGATTTAATCATCTCATGATTTAGAGATTGTTATACTCTATCGGTCTTAAAATCATATAGGTTATTCTTCATTTTGCAGAACAAATATTGGCATCCTTTTTGACATGAACAGTTCATACACTAAAAGGTAATATACTAGCCCTGTTCTAGCAAACAAGATATAGAAATTAAACTTCTTTTAATCTTTTGGAATTATGTACACATTATCAAATAATATTTAATGTTCCCTAGAGAAGAATAATTTTTTATGTCTCCCACTGCCATAACCCAGATGAATTCTCCTGCCTTAGATTTAATTCATAATTTAAAGATGCCAAGATCCCTTTTGTAAAAGGAAGAACATACATGATTGGTGTTACTTGAATTAGGGATCCAAGTTGAATCATTATTCTTCACTAAGCATGTTTTCAAGACTAATAAATCATATTTTAGAGTCTTTGATGTTCTTTTAGGTTAGGCAACTTTCCTATGTTATATTCATTATAAACCTGTTGAGATGAAGTAAACATAGGAAGACGTTCCTCTATTTTAAGACCGTTTCCTAATCATTTACCATGAATTAAGTGTGTTTTCTTGGTTGTATGATGTATACCAATAATGAATTTGGCGTATAGACGAGGCATTCATAGAACTATTACAGCCTGAAAATAAACAAATTAACTAATATCGGTTGTCTAGCACTTTAACTCTTTACTTAATCCAATCAATTTAGCAAAAGTAATTAAGTACTCTATGAATCATAATTCTTTTGCAACGATACCTCAGAGGTTTAGAGCAACAACCACCGAAGGATGATCAACTACTCCTCCTCTAAATTGAGACGTTCTCAACCAAATGCTAACTTTAGAATAACTCATATTCCGTTATTGTTTAGTTGTCGTTTGTTTGGTCAATGATTTCGTTAACAATTTAACTCATCCCCGTAAGTGTAACCCTCCCAATTTCAGATCCCAGAGGTATGCACCAACAGGCTTACCGTTAGGAAAAGACAACTATTGGTGACTGAGCTAAGTGACCCTATCCATTTATGGAGTTTGCGGTGTTCTGAATCCTATGATCAGGCCCTCCGAAGGGGTGGTCGCTCCTAGGGCGACGCACAGGCGGATCATAGGAATCTCACGGTGCAACCTAGGGGTGGAGACCGTAGGATATGTTGACACGTGTCCTTCTCCCACTTACTATGAACATTTTCCCCATTCACCTTGGTATTGACCCATGCGTAATGCTCTCCGAAGGGAGGCCGCTCCCAGGGCGACCCGAAGCTGAGCATGAATCTCACGGTGTGAAACCAGGGGGACGTGAGAGTAAAAACTTGATATAAGAATCTCGTTCTATACTTTTCCTCCCACTGAAGTGTCTTATTAGTATTATAGGGTTTTATTTTGGCTTAGATAAAAATCCGGCTAAATGAAACTATCTAAGTTTATGTCTTATTAAATAAGGTATTTACACTAATGTCTAGGTGTATTAAACCTTTTTAATGACCTAGTGACATTGCATGCTAATAAGACCGTGGATAATTCAACTCAAGTACCGGATTACCGGGAGAGTTCCCTTATAACCAAAGTCTTATTGTCCTACTTAACCATATTTTAATCTAATAAAATACTAAGTAAGACTTTATTCTAATCCTATTAGAAAAAGTGATCTTAGGTCTAATTGATTTTAAACCGTTTTAAAATTAATTAATATTTCCTAAGATCCTACATTAGCATGCAATTCTTTGTTATAGGATTGATGGTTTCTATTAATTAACCCTTATAACTATTATAATTTTTAATGAATTAGTCCTATAACATGCTTCTAATGGTGATGAAAGCATATAACAATCAAATTCTAATTATAAATATATACCATTAGAGAGCATAAACATTTTGAAACAACCTATGTCTAGGTGATTTCCCAAGGCATAACAATTACGCTACAATGTCTAGGTGATAAAACGGTTTAAACCTTACACCACTCACGCATGCTCATAAAACCGGGTTAACATAACTCAAGTACCGACTCGATGCCCAGGTAGGAGGTGCTTGTAAACTGTCAACTTAAATACCTCCAGCCTTAGACAGCATCTCGCCGGGTGAGTTCTCTTGTAACCTTAGCTTTACAAGATATCGACCTATTCTATGAAAAAATTGATCGAAATTGTGGTTCGTTAACCCTAAGTGAGCATGCATTTATCTTTGTTATAGGTTTTAATATCTATCTCGTTTTATAACATTTATAACATAGAAAAACCTATAACATATATCTATATAACGATTATATAATGCTAAGGTGGTTTCATGCTACATGCTTGTTCGTTTTCGCTTAATAAATATAACGCTTATACTAAATTAACTAAACGAAAAATTTAAAACATTCAACATGCATCTAACATGAAACCTAACATGCATTTCTTTTATTATAAGTTTATAACCCTTATAATTAATTACTAATGCATGTAACATGCTTTCCTATGGTGGGATTTTAAATCTATATGACATACTTTATGCATATTATGATCCTATTTAATTTAAACATCCATTCATAATGCATAATAAATTAAACACACTAAAGTCGACTGGTTTGGCATCCTAAGCAAGCAAATTAACTAAATTATTACAAGAATAATTAATATAACGCCTTGGACCTCTCCCGAATGCTTCAAATTTGCTTGAACCAACCAAAACCATGCTGAACCAAGTCAAAAGATGCTTGAACCACCCTTAAATTGAAGAAAAAATTGTTGTACCGGCCAGACCAAGCTAAAAACAAACCATCGAAGAAAAACTGAGCCAAAACCAGTCCATATGTGCACGAAACGAACCGCGGAGGCTTGAACCAGACAGAGGACTCATCAATCTAATAGGAACCGGAAAGAAAACCGTGGAACCGAACCGAATTCGAGCCAAACTGCACGATTTTTTCCTTCACGATTCAGTCTGGTTTTCTTTGCACATCATCTTTTCCCAAATTCCAGAATTTAAAAAAAAATCTACTGCTCTGTTTTTTTCCAACTTTGCGCAGTCCTCAAATTTCTTCAAATTGAAGGCCTAAACAGACTCTATTGACTCTCAAATTTACAGACTTCATATGAACTCATTAATGCCCATAAACATGTAGGCAATTACAATTAATGTCAACGAATTGAAGGAATTAGTTTTTATTGATGATTACTCAAGGTACCCCATCTCTACCTAATACACCGTAAGTCTGAAGGCCTTGAAAAGTTCAAGAAGTATAAGGCCAATGTTAAGAACAATTAGATACGATGATAAACATACTTCGATTTGATCGAAGTAGAGAGTATAAGGACACAAAATTCTAGGACTATTTGATAGAACATGGAATTACGTCCTAAAACTACAGATACACCTCAAGAAATAAATATCGGAAAGAAGAAACATAACATTGTTAAACATGGTTCGTTTAATAATGAGTTTTGCCAAGTTGCCTAAATCGTTTTGGGATATGCAGTGTTGACTGCAGTTTAAAACACAATATGGTTCTCTCAAAGAGTGTCTTTTGAAACACTATTTGAGCTTTGAAAAGAGCATAAAGTAAGTTTATGCCATTTCAGAACTTAGGGATGTCCTGCATATATACAAAAGACATAACCGAAGGAACTGAAACCTTATTCTAAAATATGCCTATTTATAAGTCATGCCAAAGAAACGAGAGGTGGATGTTTCTTCAAAGAAAATATTTGTATCGAAAAATGCAATTTTCATAGAAGCAGACGACGTTACAGATCACAAAACCACGTCGTTGAATAGCGTTAATTAAAATTATCTAGTAAAGCTACAAGAACAATAGTTTTTGATTGAACTAACACATTTGGTCAATCCCATCCTCCTCAATTGTTGAGAGAACCTCGATGTAGTGGGAGGGTTGTTCTTCGACCTGATGAATACTATTATGAGTTTAATTGAAACTCTTGATGTCATACTTGATGACAATATCAAGGATCCATTGACCTTTAAAACAGCCAATGAATGACGTGGACAAGGATGAATGGATTAAATTATGGACCTGCAAAAGGAATCCATGTACTTTAATTCAATCTGGGTCTTGTAGATCAACATGAAGGGGTATGTCCCATAGGGTACAAATGGATGTACAAGAGGAAATGAGACTTAGCTGAAAATGTACGTACATTTAAAATAAACTTATGGCAAAGGTTTTACCCAATGAGAGGGGATTAACTATGAAGAAACTTTCTCCCTTGTTGCCATACTTAAGTCAATTAGGATACTCCCATCCATTACCATGTTTTATAATTATGAAACACGGTAAATGGATGTGAAGATTGCCATTCTGAATGGTAATCTTGAAGAGAATATCAACACGTCTCAACCAGAAGGGTCCATTACTTAGGATCAGGAACAAAAGGTTTGAAAACTTGAATCGATCCATTTATGGGTTGAAACAGGTATCCAGATCTTGAGATATAAGATTTGATACTTCAATCAAGTCTTATGGCTTTGAACAAAACGTTGATGAGTCTTGTGTCCAACAGAAGATCATCAACTACTTAGTAGTTTTACATTGTTTTTTGCATGTATGATTGATATCCTATTCATTGAGTATGACGTAGGTTATCAAAACTGACATTAAAAGAATGCTAGCAACCATGATTCCAAATAAAGGATTTGGAAAATGGCTTCCTATGTTCTTAGAATCCAAAACGTTCGGAATCATAAGAACAAAATGCTAGCCCTGTCTCAAGCTTCTTATATCGACAAACTGTTGGTTTGATATAAGATGCAGAATTCCAAGAATGGTTTTTGTTACCTTTCCAGTACGAAATTCAATTGTCTAAGAAACAGTATTCCAAACGCCCCTCAAGAAATTGAGGACATGAGAAATATTCCCTATGCCTCGACGGTAGGCAACCAAGCCTGGCATATGGTATGCGGTAGGGATCGTCAGTCGATACCAATTGAGTCCAGATTTAGACTACTGGAATGCAGTAAAAGCTCGTGTATAACACTAAGGATCTGATCCTTACGGATACACTAATTCTATTTGCACACCGATAAGGATTCAAGGAAATCTACATCGGCATCAGGATTCATCTTTAACATGGAGCTATAATTTGGCGTAACACCAAGAATACCACTACTTGTGAAGCAGTTTAGGAGTCTGTGTGGCTAAGAAAGTTCTTGAAAAATTTGGAAGTTGTTCCAAATATGGACTTGCTAGCCACTAATGAGGCAAACACATAGAGCGTAAAAATTATCTCATTTACGAGATCGTGCTCTGACGAATCGTGACTGTCACAAAGATAACTTCAAAGCACAACATGCTGATCCATTTACGAAGGTCCTCACGACTAAAGTGTTCGAGGGACATCTAGATGGTCTAGGTCAACGAGTTCTGTACAAAGGATAATCTAGGGCAAGTGGGAGATATGTAATGGGTATATAAGATGCCCTAGTTTATTGTATTTGTACTATATGAATACTAACCCACTATGTTTCTAGATAATTGTACACCCCACTAGAGTTTTATTCCAAGTGGGAGTTTGTTGAGTTTTATGTCCTAAAACTCGTGGATAGTAAATGTAAAAAGATTAATCAATAAAGCGTTATTGAGGTTATTCTATAAGTAGTTATAGAATGGTTGTTTAATTGTGCATTAATTAACCCAAATCCAATAACTAGAACCCATGGCTATATAATGAACACTTGAAATTTATGTGTAGACATAAAAGCGCATCAAGTTCAAGTGATAGCCAAAACAGTCTATAGTATATGGATAGAGGTTGGGTGCCTTATCCTGGGGACACTATGTGACGCAGCCCGCTTTGTATTTATTACAAATGATGTGATCCTAAATCGTTCATGTAACGACATGTGAGTGGGGGCGTCCTATGCAAAAGTTTGCATAAGACCAGGACTGAGAAATAAGTCACTTTCACTTTATAACACGTTTACTGTGAAACTGACTATTTCATGCGATGACCTAGGTAACTCGATCTTAATCCTGAGCTAACTATGAACTCCTGTTTATTCGGGATTATCCTTTGATTTGCATGGGTGAGAGTGGCCCAACATCGCCGACTCAACAAGCCTACCATTTTGGGGATAAGAGGGGTGTCCGTTAGTTAACGAATGTTGGTTAACTAGGCTAAAAACTTTAGACAAATTAATCTTGGATCGTTGGATCCCATGATCTGTAGGTCCATTAGGTCCCCTTGCTAAGTCATACGGAATTAACCTTAGAACAAAGTGATGAAAGGATTTGAAAACGTTCAAAGTCATTTAAGGGAATTATCGTTATATTTAACGCTAAAGTTTAATTATGAATTAAACGATAGGAGAGAGTTGAGAATATTTAAATAAGATTTAAATACTAGAATTGTGTAATTATTACACATTTCGATTTAAATATGAATAAAGATTCATACCGGTTTAAATGTGGGGGAAATTATTTAATTTAGTGTTGAACATTAAATTAAATAATAATCATTAATTAAAAAGTTTAATTAATGATATTAATATAATATATCAATTTCATTTACAATTAATTGTCTAATTAATTATAAATGTTGAAATTGATTAAATTGAAATTAAGAATTTCAATTTCTTGGTGGCAATTTTCAGCATGAAAAACTTCATGCAATTCATGGAGATTTGACACCAAATTGGTTGGCAAATCAAAATCCACCATCACTTATCCCTTCATATATAAGTCTTTGTCTTCTTCTTCAAGAGAGAATGGAAGTTTCTGAATTTTTCATGCAAAAGGAAGATGAAATTCTCTCAAAACAGTGGGTTTTCAAGTTATTCAAAAACCCATTCTTTTTCTCCCAAATTCATCACGATTCGGTTCCCACAAGCCCGTTCTAAGGCCGGAGAATAGCGGGGAAGATCTTGATGGTTGTGTACATGTAAGTCAATTCAAGCAACAAAGCAACTATCTCAAGCACATTAACCAGAAGACAATTCCCAGAAACAATAACAACAGATTTCCAAATTGAAATCAACCAGAAATTAATACTGAAACGCAAGAAGATCAAGAAGAAGAAACAACCCGGATTTATAGTGGTTCGACCCAAAAATTGGTCTACATCCACTTAGTCACTTCCATTAGCTATATTCTTGTCAGTTGCTTCATATACAACACTCTTACATCGATTACAATCAATCGCCCATCCCTCGAACACTCGAGCAAAGAAAAACAGAGTCGAAGAATAGCTACAGCCTTTAGGATTCAAGACTTAGCTAAAAAATTGACCCTCACAGAGAAATCTCTCTAACACCACAGAGGAAATTCAATGCCTCTTTACAAGAAATCACACAGAGACCTCACCATCTCTCAAGAGAACATTTTCCCCAATTCCCTTTCAATCTCAGATTCTTTTTCCTCAGCAAGAATCTCACTTAGTGTTCTCAACTGATTTCCAGTGGCCACATGCATCAATTGTAGCACGCTTCAAATGACGGTTATTTCCTCAGCTGACTAGCACATGAGTTTACTTCAGCTCCACCTACTCCAGATGATTCAGCTTCCACGTCAGCTACATAATACAACTTGGCATAGCACGTCATTATATTTACTCTGAGGGACATAGGACTCTAATAATTTGGAACTTTGACTATCAAAGATCAACAGTACAACTTGTTCATGATGAGAAACGACCAACAGATTGCAAGGAATCGAGCTCCAAGAGGTATTTATTCATTCCTCTTTTTTTCGGTTTATTTACATGCTTTTAATTAGAGAAATTGATGAAATTAAATGCCCAAGATCCGTTAATTGCTTCCGCACGAGTTATGTAAACTGCTGCATGAATGATGAACATCAGAAGGCCGACATGTCAAATATACCTTATGCTAGTGCAGTTGGGAGTCTTATGTACCTTATGGTATGCACCCGTCCAGACTTAGCTCATGCAATGAGTGTAGTGAGTAGATTCATGTCCAATCTGGGTAAAGAACACTGACAAGCTGTAAAATGGATCATGCGGTATGTTAGGGGTACTCTGACATATGGTTTATTATATGATCAATGAGGAATAGACTCAGATATCCTAGAAGGATATGTAGATGCTGATTATGCAGCAGATTGTGACAGGAGGAGGTCATTGTCTGGGTATGTATTTACTTACCTTGGAAATCTGGTGAGTTGGAGAACTACTCTACAGTTAGTTGTGGCTCTTTCCACAACAGAATCTGAGTACATAGCAGCAACTGATGCCATAAAAGAAGGTATATGGCTAAAGGAACTGTCAAGGGAGTTATGAGCTTTGACCGAGTTGTAAAGGTACATTGTGACAGTCGGAGTGTTGTATGTCTTTCAAAGAACTACACTTATCATGACCGAACAAAACATATGGATGTTCGGTACCACTTTATAAGGGATCTCCTAAGTGATGGTGAATTTAAGCTAGAAAAGATAAGTACAGAAGATAACCCAGCAGACGCCTTTACAAAAGCTCTTCCTGTGAGAAAAATTTAGAGTTGTTTATCAACTCTCAAGATATCTCCAGTATGATGGTGATAAGAGATATAGTCACTCGTGTTTTTTTGAAAAGCAGCTGATTTGATGAAGTTTGAAGATGGTATTACCAAGGTGGAGATTGTTATTCAGTGGTAATACACATGGAGGTCAAGCTTCAGTCAAATCCCAATAGACCTTCTGTATTTGATGACATGGAATCTTAG

mRNA sequence

ATGGAAATAGTTGTGAATGCTCTGAGGTCTCGTGATTTAGAAGTGAAAAAGAACAAGAGGAAATCAGAAGGAAAGAATAGTGAAGATGGAAACAATGTCAATGTCACTGAAGGCTACGATTCTGCAGAGGTACTAGTTGTGACTGAGGGAGATGTAGATTCTAAATGGATCCTTGACTCATGGTGCTCTTTCCATATGAAACCAAACCGTCATTGGTTTCAGGGCTTTGAACCAATGGAGGAAGGAAAGGTGCTCTTAGGCAACCCCCATGAGTGCAATGTAAGACGGATCAATTCAATTCAGGTGAAGATGTTTGATAATCAAACAAGGATCATCCCCAGGGTGAGGTACGTACCAGAACTGAAACGTAGCCTGCTGTTTTTGGGTACTTTTGATAAGGCAGGCTATGTTTGTAAGCTTGAGAATGAAGGAGCAATGGTAAAGTTGCGAGGAAAGCTTGCAAATGGATTATATATTTTAGAGGGTTCAACCATCATTGGAACAGCAGCAATAGCTTCTCTTAATGAACAACAAACTACTACCTTATGGCATAGGAGACTGGGCCATGTTAGTGAAAAAGGACTCATGGAACTCCATAAACAGGGCTTGTTGGGAAGTGAAAACTTGGGAACACTTGGTCTCTGCAAACATTGTGTTTATGGAGTCCCTCATCAGCAATATGATTTAAAACTCCTATGGAAATGTGGACTGAAAAATCCACCAGACCTTACACACTTGAGGATCTCCCGGTGTGTTGCCTATGCACATATGAAAGAGGGAAAACTGGATAACAGGGCTGAGAAATGCATTCTGTTGGGATATTCTCATGGTATAAAAGGGTACAGATTGTGGGCAGCCAGGCCTAACATATGGAGAATAGTGGGGAAGATCTTGATGGTTTTCTACAACTTGTTCGTGATGAGAAACGACCAACGGATTGCAAGGAATCGAGCTCCAAGAGTTGTCGTTTGTTTGGTCAATGATTTCGTTAACAATTTAACTCATCCCCGTAAGTGTAACCCTCCCAATTTCAGATCCCAGAGGCCCTCCGAAGGGGTGGTCGCTCCTAGGGCGACGCACAGGCGGATCATAGGAATCTCACGGTGCAACCTAGGGGTGGAGACCGTAGGATATGTTGACACGTGTCCTTCTCCCACTTACTATGAACATTTTCCCCATTCACCTTGTACGAAATTCAATTGTCTAAGAAACAGTATTCCAAACGCCCCTCAAGAAATTGAGGACATGAGAAATATTCCCTATGCCTCGACGGTAGGCAACCAAGCCTGGCATATGAAGGCCGACATGTCAAATATACCTTATGCTAGTGCAGTTGGGAGTCTTATGTACCTTATGGTATGCACCCGTCCAGACTTAGCTCATGCAATGAGTGTAGTGAGTAGATTCATGTCCAATCTGGACTCAGATATCCTAGAAGGATATGTAGATGCTGATTATGCAGCAGATTGTGACAGGAGGAGGTCATTGTCTGGGTATGTATTTACTTACCTTGGAAATCTGGTGAGTTGGAGAACTACTCTACAGTTAGTTGTGGCTCTTTCCACAACAGAATCTGAGTACATAGCAGCAACTGATGCCATAAAAGAAGGTGGAGATTGTTATTCAGTGGTAATACACATGGAGGTCAAGCTTCAGTCAAATCCCAATAGACCTTCTGTATTTGATGACATGGAATCTTAG

Coding sequence (CDS)

ATGGAAATAGTTGTGAATGCTCTGAGGTCTCGTGATTTAGAAGTGAAAAAGAACAAGAGGAAATCAGAAGGAAAGAATAGTGAAGATGGAAACAATGTCAATGTCACTGAAGGCTACGATTCTGCAGAGGTACTAGTTGTGACTGAGGGAGATGTAGATTCTAAATGGATCCTTGACTCATGGTGCTCTTTCCATATGAAACCAAACCGTCATTGGTTTCAGGGCTTTGAACCAATGGAGGAAGGAAAGGTGCTCTTAGGCAACCCCCATGAGTGCAATGTAAGACGGATCAATTCAATTCAGGTGAAGATGTTTGATAATCAAACAAGGATCATCCCCAGGGTGAGGTACGTACCAGAACTGAAACGTAGCCTGCTGTTTTTGGGTACTTTTGATAAGGCAGGCTATGTTTGTAAGCTTGAGAATGAAGGAGCAATGGTAAAGTTGCGAGGAAAGCTTGCAAATGGATTATATATTTTAGAGGGTTCAACCATCATTGGAACAGCAGCAATAGCTTCTCTTAATGAACAACAAACTACTACCTTATGGCATAGGAGACTGGGCCATGTTAGTGAAAAAGGACTCATGGAACTCCATAAACAGGGCTTGTTGGGAAGTGAAAACTTGGGAACACTTGGTCTCTGCAAACATTGTGTTTATGGAGTCCCTCATCAGCAATATGATTTAAAACTCCTATGGAAATGTGGACTGAAAAATCCACCAGACCTTACACACTTGAGGATCTCCCGGTGTGTTGCCTATGCACATATGAAAGAGGGAAAACTGGATAACAGGGCTGAGAAATGCATTCTGTTGGGATATTCTCATGGTATAAAAGGGTACAGATTGTGGGCAGCCAGGCCTAACATATGGAGAATAGTGGGGAAGATCTTGATGGTTTTCTACAACTTGTTCGTGATGAGAAACGACCAACGGATTGCAAGGAATCGAGCTCCAAGAGTTGTCGTTTGTTTGGTCAATGATTTCGTTAACAATTTAACTCATCCCCGTAAGTGTAACCCTCCCAATTTCAGATCCCAGAGGCCCTCCGAAGGGGTGGTCGCTCCTAGGGCGACGCACAGGCGGATCATAGGAATCTCACGGTGCAACCTAGGGGTGGAGACCGTAGGATATGTTGACACGTGTCCTTCTCCCACTTACTATGAACATTTTCCCCATTCACCTTGTACGAAATTCAATTGTCTAAGAAACAGTATTCCAAACGCCCCTCAAGAAATTGAGGACATGAGAAATATTCCCTATGCCTCGACGGTAGGCAACCAAGCCTGGCATATGAAGGCCGACATGTCAAATATACCTTATGCTAGTGCAGTTGGGAGTCTTATGTACCTTATGGTATGCACCCGTCCAGACTTAGCTCATGCAATGAGTGTAGTGAGTAGATTCATGTCCAATCTGGACTCAGATATCCTAGAAGGATATGTAGATGCTGATTATGCAGCAGATTGTGACAGGAGGAGGTCATTGTCTGGGTATGTATTTACTTACCTTGGAAATCTGGTGAGTTGGAGAACTACTCTACAGTTAGTTGTGGCTCTTTCCACAACAGAATCTGAGTACATAGCAGCAACTGATGCCATAAAAGAAGGTGGAGATTGTTATTCAGTGGTAATACACATGGAGGTCAAGCTTCAGTCAAATCCCAATAGACCTTCTGTATTTGATGACATGGAATCTTAG

Protein sequence

MEIVVNALRSRDLEVKKNKRKSEGKNSEDGNNVNVTEGYDSAEVLVVTEGDVDSKWILDSWCSFHMKPNRHWFQGFEPMEEGKVLLGNPHECNVRRINSIQVKMFDNQTRIIPRVRYVPELKRSLLFLGTFDKAGYVCKLENEGAMVKLRGKLANGLYILEGSTIIGTAAIASLNEQQTTTLWHRRLGHVSEKGLMELHKQGLLGSENLGTLGLCKHCVYGVPHQQYDLKLLWKCGLKNPPDLTHLRISRCVAYAHMKEGKLDNRAEKCILLGYSHGIKGYRLWAARPNIWRIVGKILMVFYNLFVMRNDQRIARNRAPRVVVCLVNDFVNNLTHPRKCNPPNFRSQRPSEGVVAPRATHRRIIGISRCNLGVETVGYVDTCPSPTYYEHFPHSPCTKFNCLRNSIPNAPQEIEDMRNIPYASTVGNQAWHMKADMSNIPYASAVGSLMYLMVCTRPDLAHAMSVVSRFMSNLDSDILEGYVDADYAADCDRRRSLSGYVFTYLGNLVSWRTTLQLVVALSTTESEYIAATDAIKEGGDCYSVVIHMEVKLQSNPNRPSVFDDMES
Homology
BLAST of Lag0042148 vs. NCBI nr
Match: KAG8478826.1 (hypothetical protein CXB51_028794 [Gossypium anomalum])

HSP 1 Score: 268.5 bits (685), Expect = 1.3e-67
Identity = 191/597 (31.99%), Postives = 267/597 (44.72%), Query Frame = 0

Query: 14  EVKKNKRKSEGKNSEDGNNVNVTEGYDSAEVLV--VTEGDVDSKWILDSWCSFHMKPNRH 73
           ++K+      GK  E+    +V E Y   E+LV  V    V  +WILDS C+FHM  NR 
Sbjct: 179 KIKREAANQNGKKLENSGKTDVVEDYSDGELLVTSVNNSKVSEEWILDSSCTFHMNLNRD 238

Query: 74  WFQGFEPMEEGKVLLGNPHECNVRRINSIQVKMFDNQTRIIPRVRYVPELKRSLLFLGTF 133
           WF  +E + E  VL+GN   C +  +  I+VKMFD   + +  VR+VPELKR L+ L T 
Sbjct: 239 WFTTYETVSECVVLMGNNTSCKIATVGIIKVKMFDGVVKTLSDVRHVPELKRKLISLSTL 298

Query: 134 DKAGYVCKLENEGAMVKLRGKLANGLYILEGSTIIGTAAI--ASLNEQQTTTLWHRRLGH 193
           D  GY    E+                   GSTI G A +  +SL++   T LWH  LGH
Sbjct: 299 DSKGYRYTAES------------------GGSTITGEAVVVSSSLSDDDITKLWHMHLGH 358

Query: 194 VSEKGLMELHKQGLLGSENLGTLGLCKHCVYG----VPHQQYDLKLL---------WKCG 253
           +SE  + EL K+GLL  + +  L  CKHCV+G    +   + D + L         W+C 
Sbjct: 359 MSENDMAELSKRGLLDGQGICKLNFCKHCVFGKQKRIVQVRRDCETLDSSPYSTAKWRCR 418

Query: 254 LK----------------------------------------------------NPPDLT 313
                                                                 NP + +
Sbjct: 419 TNEQNDHGEGSMYVVKCQLTKVILGRSSLYCMSFDQLSPSVAIEKKTPQKVWSGNPANYS 478

Query: 314 HLRISRCVAYAHMKEGKLDNRAEKCILLGYSHGIKGYRLWAARPNIWRIVGKILMVFYNL 373
            L+I  C AYAH+  GKL+ R+ KC+ L Y  G+KGY+LW   P   ++V    +VF   
Sbjct: 479 DLKIFGCPAYAHVDNGKLEPRSIKCVFLSYKAGVKGYKLWC--PENRKVVISRDVVFDET 538

Query: 374 FVMRN------DQRIARNRAPRVVVCLVNDFVNNLTHPRKCNPPNFRSQRPSEGVVAPRA 433
           + + N         IA+NR  R            +  P+K    N         +VA   
Sbjct: 539 YKIENKVASLPQYSIAKNRTRR-----------EIKPPKKYAEAN---------LVA--- 598

Query: 434 THRRIIGISRCNLGVETVGYVDTCPSPTYYEHFPHSPCTKFNCLRNSIPNAPQEIEDMRN 493
                         +     +D    P+ Y      P      +  S+           +
Sbjct: 599 ------------YALNVAEDIDANQEPSNYSEASAKP------VNTSL---------AAH 658

Query: 494 IPYASTVGNQAWHMKADMSNIPYASAVGSLMYLMVCTRPDLAHAMSVVSRFMSNLDSDIL 536
              +S +  Q+      M ++PY+SAVGSLMY+MVC+RPDL++A+S   R       D +
Sbjct: 659 FKLSSALSTQSNDEIEYMLHVPYSSAVGSLMYVMVCSRPDLSYAVSAFRR-----TRDGI 700

BLAST of Lag0042148 vs. NCBI nr
Match: CAD6269918.1 (unnamed protein product [Miscanthus lutarioriparius])

HSP 1 Score: 254.2 bits (648), Expect = 2.6e-63
Identity = 202/637 (31.71%), Postives = 293/637 (46.00%), Query Frame = 0

Query: 41  SAEVLVVTEGDVDSK--WILDSWCSFHMKPNRHWFQGFEPMEEGKVL-LGNPHECNVRRI 100
           + +VLVV  G V  +  WIL S CSFH+  N+ WF  ++P++ G V+ +G+ +   +  I
Sbjct: 261 NGDVLVVFAGCVAGRDEWILYSACSFHICSNKDWFSSYKPVQSGDVMRMGDNNPREIVGI 320

Query: 101 NSIQVKMFDNQTRIIPRVRYVPELKRSLLFLGTFDKAGY-------VCKLENEGAMVKLR 160
            S+Q+K+ D   R +  VR++P + R+L+ L T D  GY       VCK+ ++G+++ + 
Sbjct: 321 GSVQIKIHDGMIRTLKDVRHIPGMARNLISLSTLDAKGYRHSGSGGVCKV-SKGSLIHMI 380

Query: 161 GKLANG-LYILEGSTIIG--TAAIASLNEQQTTTLWHRRLGHVSEKGLMELHKQGLLGSE 220
           G + +  LY+L GST+ G  TA   S +E   T LWH RLGH+SE G+ EL K+ LL   
Sbjct: 381 GDMNSAMLYVLGGSTLHGSVTATTVSNDEPNKTNLWHMRLGHMSELGMTELIKRDLLDGC 440

Query: 221 NLGTLGLCKHCVYG-----------------VPHQQYDL--------------------- 280
            +G +   +HC++G                 + +   DL                     
Sbjct: 441 TVGRMKFYEHCIFGKHKTVKFNASVHTTKGTLDYVHADLWGPSRKTSYGGVRYILTIIDD 500

Query: 281 --KLLWKCGLKNPPDL----------------THLRISRCVAYAHMKEGKLDNRAEKCIL 340
             + +W   LKN  D                   LR+  C+AYAH+  GKL+ RA KC+ 
Sbjct: 501 YSRKVWPYFLKNKDDTFATFKEWKITIERQTEKKLRVFGCIAYAHVDNGKLEPRAIKCLF 560

Query: 341 LGYSHGIKGYRLWAARPNIWRIVGKILMVFYNLFVMRNDQRIARNRAPRVVVCLVNDFVN 400
           LGY  G KGY+LW           K    F +  V+ N+  +  +     +  + +D   
Sbjct: 561 LGYGSGCKGYKLWNP---------KNKKTFMSRSVLFNESIMFNDSLSTDISLVGSDEEQ 620

Query: 401 NLTHPRKCNPPNFR-SQRPSEGVVAPRATHRRIIGISRCNLGVETVGYVDTCPSPTYYEH 460
               P    P N   + R ++G   PR    R+  I  C++    V Y  +C      E+
Sbjct: 621 EHHSPPVLQPRNQSIADRRTKGNCGPRP---RL--IEECDI----VHYAFSCACAEQVEN 680

Query: 461 FPHSPCTKFN------------------------------CLRNSIPNAPQEIE------ 520
             H P T                                 CL N    + +EI       
Sbjct: 681 I-HEPATYTEAVVFGDREKWIFAMQEEMQSLEKNGTWDVVCLPNH-KKSKKEITTLKKLL 740

Query: 521 ----DMRNIPYASTV-GNQAWHMKAD---MSNIPYASAVGSLMYLMVCTRPDLAHAMSVV 537
               +M+++  A  + G Q   M  D   MS +PY+SAVGSL+Y+MVC+RPDL++AMS+V
Sbjct: 741 SSEFEMKDLGAAKKILGLQCASMDEDFEYMSRVPYSSAVGSLIYVMVCSRPDLSYAMSLV 800

BLAST of Lag0042148 vs. NCBI nr
Match: KAG8474542.1 (hypothetical protein CXB51_031315 [Gossypium anomalum])

HSP 1 Score: 253.1 bits (645), Expect = 5.8e-63
Identity = 199/655 (30.38%), Postives = 291/655 (44.43%), Query Frame = 0

Query: 14  EVKKNKRKSEGKNSEDGNNVNVTEGYDSAEVLV--VTEGDVDSKWILDSWCSFHMKPNRH 73
           ++K+     + K  E+    +V E Y   E+LV  + +  V  +WILDS C+FHM  NRH
Sbjct: 214 KIKREAANQKVKQPENFGEADVIEDYSDGELLVASINDSKVSKEWILDSGCTFHMSLNRH 273

Query: 74  WFQGFEPMEEGKVLLGNPHECNVRRINSIQVKMFDNQTRIIPRVRYVPELKRSLLFLGTF 133
           WF  +E + EG VL+GN   C +  +  I+VKMF+   RI+  VR VPELKR+L+ L T 
Sbjct: 274 WFTTYETVSEGVVLMGNNVSCKIAGVGKIKVKMFNGVVRILSDVRDVPELKRNLISLSTL 333

Query: 134 DKAGYVCKLENE------GAMVKLRGKLANG-LYILEGSTIIGTAAIA--SLNEQQTTTL 193
           D  GY    E+E      G++V ++G+     LY+L+GST+ G AA+A   L++   T L
Sbjct: 334 DSKGYRYTAESEVLKIFKGSLVVMKGQRKTAKLYVLQGSTVTGDAAVAFSFLSDDDITKL 393

Query: 194 WHRRLGHVSEKGLMELHKQGLLGSENLGTLGLCKHCVYGVPHQ--------------QYD 253
           W  RLGH+SE G++EL K+GLL  + +  L  C+HCV+G   +              +Y 
Sbjct: 394 WQMRLGHMSENGMVELSKRGLLDGQGICKLNFCEHCVFGKQKRVRFTRGIHNMKETLEYI 453

Query: 254 LKLLWKCGLKNPPD-------LTHL-----RISRCVAYAHMKEGKLDNRAEKCILLGYSH 313
              LW  G    P        LT +     +I RC+AYAH+  GKL++R+  C+ LGY  
Sbjct: 454 HSDLW--GPFRVPSRGGANYMLTFIDDFSRKIFRCLAYAHVDNGKLESRSITCVFLGYKA 513

Query: 314 GIKGYRLWAARPNIWRIVGKILMVFYNLFVMRN---------------DQRIARNRAPRV 373
            +KGY+LW   P   ++V    +VF    ++ N               + +I     P+ 
Sbjct: 514 SVKGYKLWC--PENRKVVISRDVVFDETVMLPNLSLKDSSNKENQKQVEHQINTELTPQA 573

Query: 374 VVCLVNDFVNNLTHP-------RKCNPP---------------------NFRSQRPSEGV 433
              + N   ++  +        R+  PP                     N      SE +
Sbjct: 574 STKIKNRVASSTQYSIAKNRTRREIKPPKKYVEANLIAYVLNVAEDIDANRELSNYSEAI 633

Query: 434 VAP-------------RATHR-RIIGISRCNLGVETVGY---------VDTCPSPTY--- 493
                            + H+ R     +   G + V Y               P Y   
Sbjct: 634 SCEDSEKWMFAMQEEMESLHKNRTWDFVKLPKGKKVVRYKWVFKKKEGTPRVEEPRYKAR 693

Query: 494 -----YEHFP-------HSPCTKFNCLRNSIPNAPQEIEDMRNIPYASTVGNQ-AWHMKA 537
                Y   P        SP  K++ +R  +        ++  +  A  V    A H + 
Sbjct: 694 LISKGYSQIPGVDFTDVFSPVVKYSWIRALLGIVVMHYLELEQLDSAKPVSTPLAAHFRL 753

BLAST of Lag0042148 vs. NCBI nr
Match: KAG8501848.1 (hypothetical protein CXB51_004653 [Gossypium anomalum])

HSP 1 Score: 249.2 bits (635), Expect = 8.4e-62
Identity = 216/777 (27.80%), Postives = 303/777 (39.00%), Query Frame = 0

Query: 17  KNKRKSEGKNSEDGNNVNVTEGYDSAEVLV--VTEGDVDSKWILDSWCSFHMKPNRHWFQ 76
           K+K +SE  N ED         Y   E+LV  V +  V  +WILDS C+FHM PNR WF 
Sbjct: 224 KSKGRSESSNRED---------YSDGELLVASVNDSKVSEEWILDSGCTFHMSPNRDWFT 283

Query: 77  GFEPMEEGKVLLGNPHECNVRRINSIQVKMFDNQTRIIPRVRYVPELKRSLLFLGTFDKA 136
            +E + EG VL+GN   C +  + +I+VKMFD   R +  VRYVPELKR+L+ L T D  
Sbjct: 284 TYETVSEGVVLMGNNASCKIAGLGTIKVKMFDGVVRTLSDVRYVPELKRNLISLSTLDSK 343

Query: 137 GYVCKLENEGAMVKLRGKLANGLYILEGSTIIGTAAIA--SLNEQQTTTLWHRRLGHVSE 196
           GY    E+                   GST+ G AA+A  SL++   T LWH RLGH+SE
Sbjct: 344 GYRYTAES------------------GGSTVTGDAAVASSSLSDDDITKLWHMRLGHMSE 403

Query: 197 KGLMELHKQGLLGSENLGTLGLCKHCVYGVPHQQ------------YDLKLLWK------ 256
            G++EL K+GLL  + +  L  C+HCV+G   ++            +     WK      
Sbjct: 404 NGMVELSKRGLLDGQGICKLNFCEHCVFGKQKRKSWAFFLKQKSDVFSAFKSWKIMIEKQ 463

Query: 257 ----------------------------------------------------------CG 316
                                                                     C 
Sbjct: 464 TGKQIKYLRTDNCLEFCSDEFNRLCKSEGIVRHLTVRHTPQQNGVAERMNRTIMEKVRCM 523

Query: 317 LK---------------------------------------NPPDLTHLRISRCVAYAHM 376
           L                                        NP + + L+I  C AYAH+
Sbjct: 524 LSNANLPKSFWAEAASTACFLINRSPSVAIEKKTPQEVWSGNPANYSDLKIFGCPAYAHV 583

Query: 377 KEGKLDNRAEKCILLGYSHGIKGYRLWAARPNIWRIVGKILMVFYNLFVMRN-------- 436
             GKL+ R+ KC+ LGY  G+KGY+LW   P   ++V    +VF    ++ N        
Sbjct: 584 SNGKLEPRSIKCVFLGYKAGVKGYKLWC--PENRKVVISRDVVFDETAMLPNLSLKDSSN 643

Query: 437 -------DQRIARNRAPRVVVCLVNDFVNNLTH-------PRKCNPPN------------ 496
                  + +I     P+V   + N   ++  +        R+  PP             
Sbjct: 644 KENQKQVEHQINTESTPQVSTKIENRVASSPQYSIAKNRTKREIKPPKKYAEADLIAYAL 703

Query: 497 -------------------------------------FRSQRPSEGVVAPR--------- 537
                                                F+ +  + GV  P+         
Sbjct: 704 NVAEDIDANQEPSNILRRLAVKTQKNGCKKTVRCKWVFKKKEGTPGVEEPKYKARLVAKG 763

BLAST of Lag0042148 vs. NCBI nr
Match: KAG8472304.1 (hypothetical protein CXB51_034358 [Gossypium anomalum])

HSP 1 Score: 248.8 bits (634), Expect = 1.1e-61
Identity = 228/845 (26.98%), Postives = 322/845 (38.11%), Query Frame = 0

Query: 14   EVKKNKRKSEGKNSEDGNNVNVTEGYDSAEVLV--VTEGDVDSKWILDSWCSFHMKPNRH 73
            ++K      +GK  E+    +V E Y   E+LV  V +  V  +WILDS C+FHM PNR 
Sbjct: 234  KIKGEAANQKGKQPENSGEADVVEDYSDGELLVASVNDSKVSEEWILDSGCTFHMSPNRD 293

Query: 74   WFQGFEPMEEGKVLLGNPHECNVRRINSIQVKMFDNQTRIIPRVRYVPELKRSLLFLGTF 133
            WF  +E + EG VL+GN   C +  + +I+VKMFD   R +  VRYVP+LKR+L+ L T 
Sbjct: 294  WFTTYETVSEGVVLMGNNASCKIAGVGTIKVKMFDGVVRTLSDVRYVPKLKRNLISLSTL 353

Query: 134  DKAGYVCKLE------NEGAMVKLRGKLANG-LYILEGSTIIGTAAIA--SLNEQQTTTL 193
            D  GY    E      ++G++V ++G+     LY+L+GST+ G AA+A  SL++   T L
Sbjct: 354  DSKGYRYTAESGVLKISKGSLVVMKGQRKTAKLYVLQGSTVTGDAAVASSSLSDDDITKL 413

Query: 194  WHRRLGHVSEKGLMELHKQGLLGSENLGTLGLCKHCVYG--------------------- 253
            WH RLGH+SE G++EL K+GLL  + +  L  C+HCV+G                     
Sbjct: 414  WHMRLGHMSENGMVELSKRGLLDGQGICKLNFCEHCVFGKQKRVRFTRGIHNTKETLEYI 473

Query: 254  ---------VPHQ---QYDLKLL------------------------WK----------- 313
                     VP +    Y L  +                        WK           
Sbjct: 474  HSDLWGPSRVPSRGGANYMLTFIDDFSRKVWAFFLKQKSDVFSAFKSWKIMIEKQTGKQI 533

Query: 314  -----------------------------------------------------CGLK--- 373
                                                                 C L    
Sbjct: 534  KYLRTDNGLEFCSDEFNRLCKSEGIVRHLTVRHTPQQNGVAERMNRTIMEKVRCMLSNAN 593

Query: 374  ------------------------------------NPPDLTHLRISRCVAYAHMKEGKL 433
                                                NP + + L+I  C AYAH+  GKL
Sbjct: 594  LPKSFWAEAASTACFLINRSPSVAIEKKTPQEVWSGNPANYSDLKIFGCPAYAHVNNGKL 653

Query: 434  DNRAEKCILLGYSHGIKGYRLWAARPNIWRIVGKILMVFYNLFVMRN------------- 493
            + R+ KC+ LGY  G+KGY+LW   P   ++V    +VF    ++ N             
Sbjct: 654  EPRSIKCVFLGYKAGVKGYKLWC--PENRKVVISRDVVFDETAMLPNLSLKDCSNKENQK 713

Query: 494  --DQRIARNRAPRVVVCLVNDFVNNLTH-------PRKCNPPN----------------- 537
              + +I     P+V   + N   ++  +        R+  PP                  
Sbjct: 714  QVEHQINTESTPQVSTKIENRVASSPQYSIAKNRTKREIKPPKKYAEADLVAYALNVAED 773

BLAST of Lag0042148 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 120.2 bits (300), Expect = 7.7e-26
Identity = 64/131 (48.85%), Postives = 82/131 (62.60%), Query Frame = 0

Query: 433  KADMSNIPYASAVGSLMYLMVCTRPDLAHAMSVVSRFMSNLDSD---------------- 492
            K +M+ +PY+SAVGSLMY MVCTRPD+AHA+ VVSRF+ N   +                
Sbjct: 1103 KGNMAKVPYSSAVGSLMYAMVCTRPDIAHAVGVVSRFLENPGKEHWEAVKWILRYLRGTT 1162

Query: 493  -----------ILEGYVDADYAADCDRRRSLSGYVFTYLGNLVSWRTTLQLVVALSTTES 537
                       IL+GY DAD A D D R+S +GY+FT+ G  +SW++ LQ  VALSTTE+
Sbjct: 1163 GDCLCFGGSDPILKGYTDADMAGDIDNRKSSTGYLFTFSGGAISWQSKLQKCVALSTTEA 1222


HSP 2 Score: 90.1 bits (222), Expect = 8.6e-17
Identity = 61/222 (27.48%), Postives = 106/222 (47.75%), Query Frame = 0

Query: 16  KKNKRKSEGKNSEDG------NNVNVTEGYDSAEVLVVTEGDVDSKWILDSWCSFHMKPN 75
           +K K ++ G+ ++D       NN NV    +  E  +   G  +S+W++D+  S H  P 
Sbjct: 249 RKGKGETSGQKNDDNTAAMVQNNDNVVLFINEEEECMHLSGP-ESEWVVDTAASHHATPV 308

Query: 76  RHWFQGFEPMEEGKVLLGNPHECNVRRINSIQVKMFDNQTRIIPRVRYVPELKRSLLFLG 135
           R  F  +   + G V +GN     +  I  I +K     T ++  VR+VP+L+ +L+   
Sbjct: 309 RDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLKDVRHVPDLRMNLISGI 368

Query: 136 TFDKAGYVCKLENE------GAMVKLRGKLANGLYILEGSTIIGTAAIASLNEQQTTTLW 195
             D+ GY     N+      G++V  +G     LY        G   + +  ++ +  LW
Sbjct: 369 ALDRDGYESYFANQKWRLTKGSLVIAKGVARGTLYRTNAEICQG--ELNAAQDEISVDLW 428

Query: 196 HRRLGHVSEKGLMELHKQGLLGSENLGTLGLCKHCVYGVPHQ 226
           H+R+GH+SEKGL  L K+ L+      T+  C +C++G  H+
Sbjct: 429 HKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCLFGKQHR 467

BLAST of Lag0042148 vs. ExPASy Swiss-Prot
Match: P0CV72 (Secreted RxLR effector protein 161 OS=Plasmopara viticola OX=143451 GN=RXLR161 PE=2 SV=1)

HSP 1 Score: 94.4 bits (233), Expect = 4.5e-18
Identity = 56/129 (43.41%), Postives = 73/129 (56.59%), Query Frame = 0

Query: 436 MSNIPYASAVGSLMYLMVCTRPDLAHAMSVVSRFMSN----------------------- 495
           M N+PY SAVG++MYLMV TRPDLA A+ V+S+F S+                       
Sbjct: 1   MKNVPYLSAVGAIMYLMVVTRPDLAAAVGVLSQFASDPCPTHWQALKRVLRYLQSTQTYG 60

Query: 496 -----LDSDILEGYVDADYAADCDRRRSLSGYVFTYLGNLVSWRTTLQLVVALSTTESEY 537
                  +  L GY DAD+A D + RRS SGY+F   G  VSWR+  Q  VALS+TE EY
Sbjct: 61  LEFTRAGTAKLVGYSDADWAGDVESRRSTSGYLFKLNGGCVSWRSKKQRTVALSSTEDEY 120

BLAST of Lag0042148 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 77.0 bits (188), Expect = 7.5e-13
Identity = 47/147 (31.97%), Postives = 77/147 (52.38%), Query Frame = 0

Query: 438  NIPYASAVGSLMYLMVCTRPDLAHAMSVVSRFMSNLDSDILE------------------ 497
            N P  S +G LMY+M+CTRPDL  A++++SR+ S  +S++ +                  
Sbjct: 1178 NTPCRSLIGCLMYIMLCTRPDLTTAVNILSRYSSKNNSELWQNLKRVLRYLKGTIDMKLI 1237

Query: 498  ------------GYVDADYAADCDRRRSLSGYVFTYLG-NLVSWRTTLQLVVALSTTESE 554
                        GYVD+D+A     R+S +GY+F     NL+ W T  Q  VA S+TE+E
Sbjct: 1238 FKKNLAFENKIIGYVDSDWAGSEIDRKSTTGYLFKMFDFNLICWNTKRQNSVAASSTEAE 1297

BLAST of Lag0042148 vs. ExPASy Swiss-Prot
Match: P93293 (Uncharacterized mitochondrial protein AtMg00300 OS=Arabidopsis thaliana OX=3702 GN=AtMg00300 PE=4 SV=1)

HSP 1 Score: 67.0 bits (162), Expect = 7.8e-10
Identity = 37/106 (34.91%), Postives = 57/106 (53.77%), Query Frame = 0

Query: 143 EGAMVKLRGKLANGLYILEGSTIIGTAAIASLNEQQTTTLWHRRLGHVSEKGLMELHKQG 202
           +G    L+G   + LYIL+GS   G + +A   + + T LWH RL H+S++G+  L K+G
Sbjct: 33  KGCRTILKGNRHDSLYILQGSVETGESNLAETAKDE-TRLWHSRLAHMSQRGMELLVKKG 92

Query: 203 LLGSENLGTLGLCKHCVYGVPHQQYDLKLLWKCG---LKNPPDLTH 246
            L S  + +L  C+ C+YG  H     ++ +  G    KNP D  H
Sbjct: 93  FLDSSKVSSLKFCEDCIYGKTH-----RVNFSTGQHTTKNPLDYVH 132

BLAST of Lag0042148 vs. ExPASy Swiss-Prot
Match: P25600 (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY5A PE=5 SV=2)

HSP 1 Score: 47.8 bits (112), Expect = 4.9e-04
Identity = 36/126 (28.57%), Postives = 57/126 (45.24%), Query Frame = 0

Query: 440 PYASAVGSLMYLMVCTRPDLAHAMSVVSRFMSNLDSDILEG------------------- 499
           PY S VG L++     RPD+++ +S++SRF+    +  LE                    
Sbjct: 182 PYQSIVGQLLFCANTGRPDISYPVSLLSRFLREPRAIHLESARRVLRYLYTTRSMCLKYR 241

Query: 500 ---------YVDADYAADCDRRRSLSGYVFTYLGNLVSWRT-TLQLVVALSTTESEYIAA 537
                    Y DA + A  D   S  GYV    G  V+W +  L+ V+ + +TE+EYI A
Sbjct: 242 SGSQLALTVYCDASHGAIHDLPHSTGGYVTLLAGAPVTWSSKKLKGVIPVPSTEAEYITA 301

BLAST of Lag0042148 vs. ExPASy TrEMBL
Match: A0A438FFY5 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_1826 PE=4 SV=1)

HSP 1 Score: 246.9 bits (629), Expect = 2.0e-61
Identity = 202/676 (29.88%), Postives = 291/676 (43.05%), Query Frame = 0

Query: 25   KNSEDGNNVNVTEGYDSAEVLVVTEGDVDSKWILDSWCSFHMKPNRHWFQGFEPMEEGKV 84
            K   +G+   + +GYDSAEVL V E D   +WILDS CSFHM P + WF+ F+  + G V
Sbjct: 433  KTVNEGDAAVILDGYDSAEVLNVAEVDSSKEWILDSGCSFHMCPIKAWFEDFKEADGGYV 492

Query: 85   LLGNPHECNVRRINSIQVKMFDNQTRIIPRVRYVPELKRSLLFLGTFDKAGYVCKLE--- 144
            LLGN     +    ++++K +D   R++  VRY+PELKR+L+ LG  DK+GY  K E   
Sbjct: 493  LLGNNKHYKILGTGTVRIKHYDGIERVLEDVRYIPELKRNLISLGMLDKSGYTFKSEPNS 552

Query: 145  ---NEGAMVKLRGKLANGLYILEGSTIIGTAAIASLNEQQTTTLWHRRLGHVSEKGLMEL 204
                 G++  ++G + NGLY L G T+    +     +  TT LWH+RLGH+S +GL EL
Sbjct: 553  LRVARGSLTVMKGTIKNGLYTLIGQTVTRKVSTVLKEDVGTTKLWHQRLGHISHRGLQEL 612

Query: 205  HKQGLLGSENLGTLGLCKHCVYG----------VPHQQYDLKL----LW----------- 264
             KQ +LG+  L  L  C+HCV+G          +   Q  L      LW           
Sbjct: 613  KKQRVLGNYKLTDLPFCEHCVFGKATRVKFAKAIHETQNQLDYIHSDLWGPSRVPSIGGA 672

Query: 265  -------------KCGLKNP--------PDLTHLRISRCVAYAHMKEGKLDNRAEKCILL 324
                            ++NP         D  HL++  C AY H K  KL+ RA KCI L
Sbjct: 673  RNSCPFDQQEPIISITIQNPSRKWTGKAADYQHLKVFGCTAYVHTKTDKLEPRAVKCIFL 732

Query: 325  GYSHGIKGYRLWAARPNIWRIVGKILMVFYNLFVMR------------------------ 384
            GY  G+KGY+LW       + +    + F    + +                        
Sbjct: 733  GYPKGVKGYKLWIETQGKGKCIISRDVTFNEQDMSKQTPAKDVEGSDQLQFEVEHETSQP 792

Query: 385  ----------------NDQRIARNRAPRVVVC------LVNDFV---------------- 444
                            +++++ + +    ++C      +   FV                
Sbjct: 793  EKSKETSSKTAQKEIVHERQMNQLKGWNPIICQTQLNPIAVGFVAHEDLELDQLDVKTAF 852

Query: 445  --NNLTHPRKCNPPNFRSQRPSEGVV------------APRATHRRI------IGISRC- 504
                L       PP    +   +G V            +PR  ++R       IG +R  
Sbjct: 853  LHGELDELIYMQPPEGFEEGIKDGQVCLLKKSLYGLKQSPRQWYKRFDKYMLDIGFNRSS 912

Query: 505  -------NLGVETVG----YVD----TCPSPTYYEHFPHSPCTKFNC--LRNSIPNAPQE 537
                   NL  +++     YVD     C    + E        +F    L ++      E
Sbjct: 913  HDGCVYFNLTEDSMVYLLLYVDDMLVACKEKRHLEQVKEMLKAEFEMKDLGSAKRILGME 972

BLAST of Lag0042148 vs. ExPASy TrEMBL
Match: A0A438CQ40 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_917 PE=4 SV=1)

HSP 1 Score: 242.7 bits (618), Expect = 3.8e-60
Identity = 175/566 (30.92%), Postives = 272/566 (48.06%), Query Frame = 0

Query: 25  KNSEDGNNVNVTEGYDSAEVLVVTEGDVDSKWILDSWCSFHMKPNRHWFQGFEPMEEGKV 84
           K    G+   + +GYD+ EVL + E D   +WILDS CSFHM P + WF+ F+  + G V
Sbjct: 92  KTVNQGDAAVILDGYDNVEVLNMAEVDSGKEWILDSGCSFHMCPIKAWFEDFKEADGGDV 151

Query: 85  LLGNPHECNVRRINSIQVKMFDNQTRIIPRVRYVPELKRSLLFLGTFDKAGYVCKLE--- 144
           LLGN   C +    ++++K +D+  R++  VRY+PELKR L+ LG  DK+ Y  KLE   
Sbjct: 152 LLGNNKHCKILGTGTVRIKHYDSIERVLEDVRYIPELKRKLISLGMLDKSRYTFKLEPNS 211

Query: 145 ---NEGAMVKLRGKLANGLYILEGSTIIGTAAIASLNEQQTTTLWHRRLGHVSEKGLMEL 204
                G++  ++  +  GLY L G T+    +     +  TT LWH+RLGH+S + L EL
Sbjct: 212 LRVARGSLTVMKETIKIGLYTLIGQTMTSKVSTVLKEDMGTTMLWHQRLGHISHRRLQEL 271

Query: 205 HKQGLLGSENLGTLGLCKHCVYGVPHQQYDLKLLWKCGLKNPPDLTHLRI---SRCVAYA 264
            KQG+LG+  L  L  C+HCV+G   +    K++ +   +N  +  H  +   SR     
Sbjct: 272 EKQGVLGNYKLTNLTFCEHCVFGKTIRVKFAKVVHE--TQNQLEYIHSDLWGPSRFKKKL 331

Query: 265 HMKEGKLDNRAEKCIL---LGYSHGIKGYRLWAARPNIWRIVGKILMVFYNLFVMRNDQR 324
           +MK+     +    I+    G+  GIK               G++ ++  +L+ ++   R
Sbjct: 332 YMKDKMNQLKGWNPIVWQETGFGEGIKD--------------GQVCLLKKSLYGLKQSPR 391

Query: 325 IARNRAPRVVVCLVNDFVNNLTHPRKCNPPNFRSQRPSEGVVAPRATHRRIIGISRCNLG 384
           +   R  +    +++   N  +H               +G V  + T   ++ +      
Sbjct: 392 LWYKRFDKY---MLDIGFNRSSH---------------DGCVYFKLTEDNMVYL------ 451

Query: 385 VETVGYVD----TCPSPTYYEHFPHSPCTKFNC--LRNSIPNAPQEIEDMRN-----IPY 444
              + YVD     C    + E        +F    L ++      EIE  R+     +  
Sbjct: 452 ---LLYVDDMLVACKEKRHLEQVKEMLKAEFEMKDLGSAKRILGMEIERDRSKRVLRLSQ 511

Query: 445 ASTVGNQAWHMKADM-SNIPYASAVGSLMYLMVCTRPDLAHAMSVVSRFMSNL------- 504
            S +      MK  +     YAS VGS+MY MVC+RPDLA+A+S++ R+MS L       
Sbjct: 512 KSYISKHLRLMKRKVYGKDTYASMVGSVMYTMVCSRPDLAYAVSMIRRYMSCLGKPHWQA 571

Query: 505 --------------------DSDI---LEGYVDADYAADCDRRRSLSGYVFTYLGNLVSW 537
                               +S +   L+G+VDADYA + D R+SL+GYVF   G  +SW
Sbjct: 572 IKWLFQYLAGTRSLGLVYGGNSQLGTQLQGFVDADYARNIDTRKSLTGYVFIVFGRAMSW 614

BLAST of Lag0042148 vs. ExPASy TrEMBL
Match: A0A7H4LGA5 (Genome assembly, chromosome: II OS=Triticum aestivum OX=4565 GN=CAMPLR22A2D_LOCUS2253 PE=4 SV=1)

HSP 1 Score: 239.2 bits (609), Expect = 4.2e-59
Identity = 211/744 (28.36%), Postives = 317/744 (42.61%), Query Frame = 0

Query: 17  KNKRKSEG-KNSEDGNNVNVTEGYDSAEVLVVTEGDVDSKWILDSWCSFHMKPNRHWFQG 76
           +NK K +G K SE+  NV   +  D A V++    + + +W+LD+ C+FHM P+R  F  
Sbjct: 242 QNKEKRKGNKQSENFANVARDDSSDDALVVIAGCAETNDEWVLDTACTFHMCPHRDCFNT 301

Query: 77  FEPMEE-GKVLLGNPHECNVRRINSIQVKMFDNQTRIIPRVRYVPELKRSLLFLGTFDKA 136
           F+     G VL  +   C +  I S+++KMFD   + +  VRY+P++KR+L+ + + D  
Sbjct: 302 FDSTTSVGSVLGFDNSPCKIEGIGSVRIKMFDGTIKTLTDVRYIPKMKRNLISVSSLDAM 361

Query: 137 GY-------VCKLENEGAMVKLRGKLA--NGLYILEGSTIIG--TAAIASLNEQQTTTLW 196
           GY       V K+  + +++ ++G L+  NGLY L GST+ G  T  I+  ++     LW
Sbjct: 362 GYQYSGGDSVLKV-TKRSLIVMKGDLSSTNGLYYLRGSTVSGNATPVISKNSDCDAANLW 421

Query: 197 HRRLGHVSEKGLMELHKQGLLGSENLGTLGLCKHCVYG-----------------VPHQQ 256
           H RLGH+SE GL EL+K+GL      G L  C+HC++G                 + +  
Sbjct: 422 HMRLGHMSELGLAELNKRGLRDGFEPGKLKFCEHCIFGKHKRVKFNTSTHTIEGILDYAH 481

Query: 257 YDL-----------------------KLLWKCGLKNPPDL----------------THLR 316
            DL                       + +W   LK+  +                   LR
Sbjct: 482 SDLWGPFRRKSLGGASYMLTIIDDYSRKVWPYFLKHKYEAFSAFKEWKIMIERQTEKKLR 541

Query: 317 ISRCVAYAHMKEGKLDNRAEKCILLGYSHGIKGYRLWAARPNIWRIVGKILMVF------ 376
           +  C  YAH+  GKL+ RA KCI LGY  G+KG++LW   P    +V    ++F      
Sbjct: 542 VFGCTTYAHVDNGKLEPRAVKCIFLGYKSGVKGFKLW--NPETQMVVISRNVIFNESAML 601

Query: 377 -----YNLFVMRNDQRIARNRAPRVVVCL------------------VNDFVNNLTHPRK 436
                 N+ V    Q I +    +V   +                  VND     T  + 
Sbjct: 602 HDVSSTNVPVESEQQPIVQEPTVQVEHVIDSGDTSGNEIVDAHDEPVVNDDNVTPTLNQP 661

Query: 437 CNPP--NFRSQRPSEGVVAP-RATHRRIIGISRCNLGVETVGYVDTCPSPTYYEHFPHSP 496
             PP  N    R   G+  P R      I     ++  E  G V+T    +Y E    S 
Sbjct: 662 IVPPSWNLARDRVRRGINKPDRLIEECNIVSFALSVAEEIEGNVET---SSYSEAIISSD 721

Query: 497 CTKFNCLRN------------SIPNAPQEIEDMR-------------------------- 556
             K+    +             +   P+E + +R                          
Sbjct: 722 SIKWMTAMHDEMKSLEKNGTWDLVRLPREKKPIRCKWVFKRKEGVSPNDETRYKASIVAM 781

Query: 557 -------------------------NIPYASTVGNQAWHMKAD---MSNIPYASAVGSLM 567
                                      P  S +  +   + AD   MS +PY+SAVGSLM
Sbjct: 782 HDFELEQLDVRTAFLHGELEEDIYIEQPEGSVIPGKEKLVYADIEYMSRVPYSSAVGSLM 841

BLAST of Lag0042148 vs. ExPASy TrEMBL
Match: A0A5A7VKC2 (Retrotransposon protein, putative, Ty1-copia subclass OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold174G00310 PE=4 SV=1)

HSP 1 Score: 234.2 bits (596), Expect = 1.4e-57
Identity = 217/788 (27.54%), Postives = 323/788 (40.99%), Query Frame = 0

Query: 16   KKNKRKSEGKNSEDGNN-VNVTEGYDSAEVLVVTEGDVDSKWILDSWCSFHMKPNRHWFQ 75
            K  +  +   N  DG N   +T+GY+SAEVL+V+  D+   WI+DS C+FHM P+R +  
Sbjct: 236  KSREASTSEANVTDGYNFAEITDGYESAEVLMVSHRDIQDAWIMDSGCTFHMTPHRDFLT 295

Query: 76   GFEPMEEGKVLLGNPHECNVRRINSIQVKMFDNQTRIIPRVRYVPELKRSLLFLGTFDKA 135
             F+  + GKVLLG+   C+V+R  S+Q+   D   RI+  VRYVP+LKR+L+ LG  D++
Sbjct: 296  NFQKGDGGKVLLGDNGICDVKRTGSVQIATHDEMVRILTNVRYVPKLKRNLISLGELDRS 355

Query: 136  GYVCKLEN------EGAMVKLRGKLANGLYILEGSTIIGTAAIASLNEQQTTTLWHRRLG 195
            G   K EN      +G++VKLRG L +GLY+LEG+T+ G+ AIAS        LWH RL 
Sbjct: 356  GCTIKSENGVMKVTKGSLVKLRGTLRHGLYVLEGTTVSGSTAIASGKVTYMFMLWHNRLA 415

Query: 196  HVSEKGLMELHKQGLLGSENLGTLGLCKHCVYGV-------------------------- 255
            HVSE+GL  L +QGLL       L  C+HC+ G                           
Sbjct: 416  HVSERGLQALSQQGLLEGVKNVELSFCEHCIMGKCTRVQFGKGKHTTKGILDYVHSDLWG 475

Query: 256  PHQQ----------------------YDLK---------LLWKCGLK----NPPDLTHLR 315
            P ++                      Y LK         L WK  ++      P L HL+
Sbjct: 476  PTKEAFMGGSRYFISIIDDFSRKVWIYPLKQKDEAFGKFLEWKKQVEKQTGKAPSLDHLK 535

Query: 316  ISRCVAYAHMKEGKLDNRAEKCILLGYSHGIKGYR--------------LWAARPNIW-- 375
            +  C  YAH+K+GKL+ RA KC+ +GY  G+K  R                  RP++   
Sbjct: 536  VFGCTTYAHVKDGKLNKRALKCMFIGYPQGVKKERKQQTSDHVVTKVRIASEGRPSVGLY 595

Query: 376  ------RIVGKI-----------------LMVFYNLFVMRN-------DQRIARNRAPR- 435
                   +V KI                 +++    F+  +       + ++ R+RA R 
Sbjct: 596  AFSDQPPLVSKIEDTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQNYQLTRDRAQRE 655

Query: 436  ---------------VVVC---------------LVND---------------FVNNLTH 495
                            + C               +V+D                  N T 
Sbjct: 656  RHAPIRYGYADLVAYALTCAADGIEEKPLTFEKAIVSDSKKRWKDVMEVELFSLHKNQTW 715

Query: 496  ---PRKCN--------------------PPNFRSQRPSEG-----------VVAPRATHR 537
               P+  N                     P ++++  ++G           V +P   H 
Sbjct: 716  SLVPKPLNQKLIQSKWIYKIKPGTWGNSKPRYKARLVAKGYTQKEGVDFHEVFSPVVRHS 775

BLAST of Lag0042148 vs. ExPASy TrEMBL
Match: A0A5A7TIS5 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold110G00990 PE=4 SV=1)

HSP 1 Score: 233.4 bits (594), Expect = 2.3e-57
Identity = 182/608 (29.93%), Postives = 265/608 (43.59%), Query Frame = 0

Query: 38  GYDSAEVLVVTEGDVDSKWILDSWCSFHMKPNRHWFQGFEPMEEGKVLLGNPHECNVRRI 97
           GY+SAEVL+V+  D+   WI+DS C+FHM P R +   F+  + GKVLLG+   CNV+  
Sbjct: 127 GYESAEVLMVSYRDIQDAWIMDSGCTFHMTPYRDFLTNFQNGDRGKVLLGDNVTCNVKGT 186

Query: 98  NSIQVKMFDNQTRIIPRVRYVPELKRSLLFLGTFDKAGYVCKLEN------EGAMVKLRG 157
            S+Q+   D   RI+    YVP+LKR+L+ L   D++G   K EN      +G++VKLR 
Sbjct: 187 GSVQIATHDGIVRILTNATYVPKLKRNLISLSELDRSGCTIKSENGVMKVTKGSLVKLRE 246

Query: 158 KLANGLYILEGSTIIGTAAIASLNEQQTTTLWHRRLGHVSEKGLMELHKQGLLGSEN--- 217
              +GLY+LEG+TI G+ AIAS      + LWH+RL H  +       K   L ++N   
Sbjct: 247 TWRHGLYVLEGTTISGSVAIASAKVIDMSMLWHKRLAHKKQVENQTSRKVKYLRTDNGLE 306

Query: 218 ---------LGTLGLCKHCVYGVPHQQYDLKLLWKCGL--KNPPDLTHLRISRCVAYAHM 277
                      + G+ +H       QQ  L   +   +  +    L HLR+  C  YAH+
Sbjct: 307 FVNNKFNNFCKSEGITRHFTVTYTPQQNSLAERFNRTIMERKALSLDHLRVFGCTTYAHV 366

Query: 278 KEGKLDNRAEKCILLGYSHGIKGYRLWAARPNIWRIVGKILMVFYNLFVMRNDQ--RIAR 337
           K+GKL+ RA KC+ +GY   IKGY+LW            + M      + R+ Q  RI  
Sbjct: 367 KDGKLNKRALKCMFIGYPQCIKGYKLWC-----------LEMGMNKCIISRDSQQERILI 426

Query: 338 NRAPRVVVCLVNDFVNNLTHPRKCNPPNFRSQRPSEGVVAPRATHRRIIGISRCNLGVET 397
           +    +     N+ + N  +   C+        P     A    +   +  +  N+  E 
Sbjct: 427 DEGAFIEESSSNNDLQN--YQLTCDKAQRERHAPIRYGYADLVAY--ALTCAADNIKAEP 486

Query: 398 VGYVDTCPSPTYYEHFPHSPCTKFNCLRNSI----PNAPQEIEDMRNIPY---ASTVGNQ 457
           + + +   S +  +         F+  +N I    P    +      + Y    ST GN 
Sbjct: 487 LTFEEAIVSDSKKQWKDAMEAEFFSLHKNQIWSLVPKPLNQKLIQSKLIYKIKPSTGGNS 546

Query: 458 AWHMKADMSNIPYASAVG-----------------------SLMYL-------------M 517
               KA +    Y    G                        ++Y+             M
Sbjct: 547 KPRYKARLVAKGYTQKEGVDFHEIFSPVMDVTITFLHGELEEVIYMAQPKGYKVKCKEDM 606

Query: 518 VCTR--------------PDLAHAMSVVSRFMSNL------------------------- 537
           VC                PDL +AM ++SRFMSN                          
Sbjct: 607 VCRLYKSLYGLKQSPRQWPDLGYAMGMISRFMSNPGKQHWKAVKWVLRYLKGSASVSLCY 666

BLAST of Lag0042148 vs. TAIR 10
Match: ATMG00300.1 (Gag-Pol-related retrotransposon family protein )

HSP 1 Score: 67.0 bits (162), Expect = 5.5e-11
Identity = 37/106 (34.91%), Postives = 57/106 (53.77%), Query Frame = 0

Query: 143 EGAMVKLRGKLANGLYILEGSTIIGTAAIASLNEQQTTTLWHRRLGHVSEKGLMELHKQG 202
           +G    L+G   + LYIL+GS   G + +A   + + T LWH RL H+S++G+  L K+G
Sbjct: 33  KGCRTILKGNRHDSLYILQGSVETGESNLAETAKDE-TRLWHSRLAHMSQRGMELLVKKG 92

Query: 203 LLGSENLGTLGLCKHCVYGVPHQQYDLKLLWKCG---LKNPPDLTH 246
            L S  + +L  C+ C+YG  H     ++ +  G    KNP D  H
Sbjct: 93  FLDSSKVSSLKFCEDCIYGKTH-----RVNFSTGQHTTKNPLDYVH 132

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG8478826.11.3e-6731.99hypothetical protein CXB51_028794 [Gossypium anomalum][more]
CAD6269918.12.6e-6331.71unnamed protein product [Miscanthus lutarioriparius][more]
KAG8474542.15.8e-6330.38hypothetical protein CXB51_031315 [Gossypium anomalum][more]
KAG8501848.18.4e-6227.80hypothetical protein CXB51_004653 [Gossypium anomalum][more]
KAG8472304.11.1e-6126.98hypothetical protein CXB51_034358 [Gossypium anomalum][more]
Match NameE-valueIdentityDescription
P109787.7e-2648.85Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P0CV724.5e-1843.41Secreted RxLR effector protein 161 OS=Plasmopara viticola OX=143451 GN=RXLR161 P... [more]
P041467.5e-1331.97Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
P932937.8e-1034.91Uncharacterized mitochondrial protein AtMg00300 OS=Arabidopsis thaliana OX=3702 ... [more]
P256004.9e-0428.57Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... [more]
Match NameE-valueIdentityDescription
A0A438FFY52.0e-6129.88Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
A0A438CQ403.8e-6030.92Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
A0A7H4LGA54.2e-5928.36Genome assembly, chromosome: II OS=Triticum aestivum OX=4565 GN=CAMPLR22A2D_LOCU... [more]
A0A5A7VKC21.4e-5727.54Retrotransposon protein, putative, Ty1-copia subclass OS=Cucumis melo var. makuw... [more]
A0A5A7TIS52.3e-5729.93Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
Match NameE-valueIdentityDescription
ATMG00300.15.5e-1134.91Gag-Pol-related retrotransposon family protein [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 156..221
e-value: 1.9E-12
score: 46.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 15..35
NoneNo IPR availableCDDcd09272RNase_HI_RT_Ty1coord: 479..536
e-value: 1.43917E-26
score: 103.317

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0042148.1Lag0042148.1mRNA