Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCGTTCAAGACTTCTTCAATGGTCGCTGTCATGAACAAGTCTTACATGGGTTCTACTGCCCATTGTTGCTTCAATAAACTGGCGTTGCAAGAAGATAAAGCTTCTATCGTTGCAGGCCAAGAAACAACCTTGCAAGGGGCATATACTAATGACAAGTTTATTGTTAAGTATAACCCTTTGTTTGAACATGATTCTGATGTAGTGACTGTCATGATGACTGGGACTAGAACTATGGAAGAAAGAATGGTTGAGATGCAGGAGCACATCGACACCTTGATGAAGGCGATTGAAGAAAAAGATTCTCAAATTGCGCAACTGAAGTGCCAAATTGAGAACCAACATATCGCCGAATCAAATCAAACCCAAGTCATAAAAAATCATGACAAAGGAAAGACTATAGTGCAAGATGATCAGCCACAGTGTTCTACTTCGATCGCTTCACTATCCATCCAACAGCTCCAAGATATGATCACAAACTGTATCAGAGCTCAGTACGGTGGACCTACTCAAGATTCCCTCTTGTATTCCAAACCTTATACTAAGAGGATTGATAACTTGAGAATTCCAATCGGGTATCAGCCACCGAAATTTCAGCAGTTTGATGGAAAAGGCAATCCTAAACAACATATTGCCCACTTCGTTGAGACATGCGAGAACGCTGGTACTCGAGGGGACCTACTAGTCAAACAGTTCGTTCGAACACTTAAAGGAAATGCTTTTGACTGGTACACTGATCTAGAACCTGAGTCAGTAGACAGTTGGGAGGAACTCGAAAGAGAGTTTTTGAATCGCTTCTACAGCACTAGACGAACCGTTAGCATGTTCGAGCTCACAAACACTAAACAACGAAAAGGTGAACTCGTTGTTAACTATATAAATCGCTGGAGAGCTATGAGTCTAGATTGCAAAGATCGCCTCACTGAACTCTCTTCCGTTGAGATGTGCATTCAAGGCATGCACTGGGAACTCCTCTACATCCTTAAAGGTATAAAGCCTCGCACCTTTGAGGAACTAGCAACTCGTGCCCACGATATGGAGCTAAGTATTGCTAGTCGAGAAAACCAAGACCTTCTCCTCCCTAATATGAGAAAAGAAGGAAGGAACGACGAAGAGACTATAGAAGAATCTATGGTTGTCAACACAACCCTTCCCAAGTCGTCTTCGAAAGGAAAGCGACAAACAAATGGAGCGCATCACTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAACTGATAGAGCTTCCTAAGTGTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAGGATGTTTCGTCCTAAAGGACTTAATTTTAAAGCTGGCTAAGGAAGGCAAAATTGAGCTTGACCTTGATGAAGTAGCCCAATCAAATCTTGCTACAATCAAAGGAAAGAGCAAACATCAAAGAAAGAAGGATCCTAAGAAACTTCAACCCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCAACAACTGGTGATGTTGAATAAATCGTTCTCCAAAAATTTCCACAAAAAGGAAAAAAAGAACCTTGCAACTTCCTACTGCATCGACGTAGAAGAAGTTGACAATTCCAAGAAGAGTGAACAAAGGACTTCCGTCTTTGATCGCATCAAGCCTCCAACTACTCGTCCTTCGGTATTCCAAAGAATGAGTATGGCTGCGACAGAGGAAGAAAATCAATGTTTGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGACCTTCAACATCTGTTTTTGATCGCCTCAAAGTAACAAGCGATCAACCTAAAAGAAAGATGGATAAATTGGAGGTAAAACCTTTCGATGAAGTAAACAGCGACAAGAAGCTTCAAAGTAGCATCCCGTCACGTATGAAGAGGAAGTTATCTGTTCTCATAAATACAGAAGGTTCCTTGAAGGTGAAACCAAATCTCATTATCTTGACCAATCCTACAAGTCAAGGATCTGATCAAGACCATGATGAAGATAAGGGCTTTTAAAATGTAAAAGCTCCTTATCGCAAGAGCCTAAACTGCATGATGCTCCTAGCCCACACGAGCTTAAAAGGTGAGAGCCAAAAAAAAAAAAAAAAAAATCTTGAACTACGTTATGACTTGATCCCTATTCCATAAGGGTACGTAGGCAGCTTAAAGAAAATTTTAAGTTCAGTCCCTATAAACAAAAAAAAAAAAGGTTCTTCGCTGCAGTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGCGTTGTTCCTTCTCCAAGTTCGAAGGTTCTTCGTTGTATCCTGCTGCGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACAACTGCTACGTTGTTCCTCCTCCAAGTGCGAATGATCTTATGTGGTGCGTTGTTGCATTGTTCCCTCTTCTCTCAAGTTCGATGGTTCTCACGCAGCTTTGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTTTCACGCCCTCCGTTGTAGTTCCTTCTTTCCAAGGTCGAAGGTTCTCACTCGCTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACACACTTCGCTCCAGTTCCTTCTCCCAAATTCGAAGGTTCTCATGCGCTTCGCGCTGCAGTTCCTTCCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCTTTCCCCCAAGTTCGAAGGTTCTCAGGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCACTTCGCTGCATTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCGGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGCACTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCAAACGCTTCGCTGTCGTTCCTTCCTTACAGTTCGTAGGTTCTCACACGCTTCGTTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCGCTCACGCGTTTCACTGCAGTTCCTTCCTCCAATTTTGAAGGTTCTCACATCGCTTCGCTTCGCTCACGCGCTTCGCTGCGATCCTTCCCCCAAGTTCAAAGGTTCTCACGCACTCCGTTGCAGTTCCTTCTTTCCAAGGTCGAAGGTTCTCACTTGCTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCTCACGGGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATTGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCATCGCCATAGTTCCTTCCTCCGCGCATCGCCATAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCACACGCTTCGCCACAGTTCCTTCCTCCCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCGCCACAGTTCCTTCCTCCAAGTTCAAAGGTTCTCTCGCCACAGTTCCTTCCCTCCAAGTTCGAAGGTTCTCTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTTCGCCACAGTTCCTTCATCCAAGTTCGAAGGTTCTCTCACGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCATCGCCATAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCCACAGTTCACGCGCACTTCCTCCAAGTTCGAAGGATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTGTTCTCACGTGCATCGCGCGACAGTTCCTTCCTCCAAGTTCGCGCTTCGCCACAGTTCACGCGCACTTCCTCCAAGTTCGAAGGATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTGTTTTCCTCCAAGTTCGAAGGTTGTTCTCACGTGCATCGCGCGACAGTTCCTTCCTCCAAGTTCGAAGGTTGTTCTCACGTGCATCGCCTCCTCCAAGTTTGAAAGGTTTCTAGCTCCTAGGCTTCAAAGGCTCTCATGAGAAATATGAATCTTAAAATTTAAGATTAACAATTTTGGGACTCGAACCCATGACCTTTAAATATGCAGCTTGGCTATTAAACCAATAGACCATTCAAGTCTTTTGATTAACATTTGCTTGTTAATATATACATATATGCTATTCGGTCAACTCATATTATTCTCAATCCAAATTTTGTCAAACCCAAAATCAACCAAAGTCCCAAACTCAAATTCTATTCCAAATGCAAGTTAAAATTGGATAGAATTTTACCTCAAGCCAAAAGTTTGGCATAATCCAAATTTTATCCCAAATGAACCAAAGTCAAAACCTCAAATTCTATTCCAAATGCAAGTTAAAATTGGGTAGGATTTTACCTCAAATCAAAATTTGGGTATAAACCAAAATTTGCCCAAACTTCAAATCAAAGTTTGCTGCCAAATAAGAAAAAAAAATCATCAAAATTTTGACCAAACTTCAAATTTTGATTAATTGGCGATAAAAGTGTGTCAAATAAAAAAATCAAAATTTTGACCAAAATTCATATTTTGATTAAATTGACACACCAAAGGGAGTCAAAATCAAGAACAAATATCTTGCATTTTGATCCCAAAATATCTTTTTCGAGATTCACTCACACATAACTTGTGACTAACTCTCGAAAAGGGGGCATTTGTTGAAGGAGAAATTTTGGCCAATGACAAATTGCCACGTCATTATTTAAAATTAATGACAAAATTATAATTAATTAAATTTGGGACCAATTAGAAATTGACATGTGTCCCAAATTTAATTTGAAGACAAAGTTGGCCAATTATAAATTTTTAGGGTCAAATGTCCATTTGATCAATTAATTAAAAATAAATTATTTTGATCAGATTAATTGATTTGACCAATTAATTAAAGAAAATAATTAATTGGATCAAATTTGGGTCTTTTAAATATTTGGGTCATATGGTTTTGGGTGAAATATAAACCAGACCCAAGACCAATAAAGCCCAAGCCCAAGTTGTTAGGCCCAAAAGTCACCAGGGCCCATCCAGCGAGAACTCTATAAATAGAGGGGTTCTCCATCATTTCAAGGGTTCAGAAATTCTACACTCTCACAAAGACAAGAGTTCAGAGTTTCAAAGCTCTCAAGCAGAACCAAAGAATTCAGAGAGACTCCACCAAGTCTGAAGACCGAAAACTCTCTGCAATCCATAAGTTCAAGTGTTGAACACTTCTTGAAGACCAAACACTCTTCAAGACTTCAACACTCCTTGAAGATCAAAGACTCTTCAGGACATCAACATTTCTTGAAGACCCAACACTCTTCAAGACTTCAACACTCCTTGAAGATCAAAGACTCTTCAGGACATCAACATTTCTTGAAGACCCAACACTCTTCAAGACTTCAACACTCCTTGAAGATCAAAGACTCTTCAGGACATCAACACTTCTTGAAGGCTGAAAACTCCTTCAAGACTAGAAGACTTCAAGCTCCAAGAATCCATTGA
mRNA sequence
ATGTCGTTCAAGACTTCTTCAATGGTCGCTGTCATGAACAAGTCTTACATGGGTTCTACTGCCCATTGTTGCTTCAATAAACTGGCGTTGCAAGAAGATAAAGCTTCTATCGTTGCAGGCCAAGAAACAACCTTGCAAGGGGCATATACTAATGACAAGTTTATTGTTAAGTATAACCCTTTGTTTGAACATGATTCTGATGTAGTGACTGTCATGATGACTGGGACTAGAACTATGGAAGAAAGAATGGTTGAGATGCAGGAGCACATCGACACCTTGATGAAGGCGATTGAAGAAAAAGATTCTCAAATTGCGCAACTGAAGTGCCAAATTGAGAACCAACATATCGCCGAATCAAATCAAACCCAAGTCATAAAAAATCATGACAAAGGAAAGACTATAGTGCAAGATGATCAGCCACAGTGTTCTACTTCGATCGCTTCACTATCCATCCAACAGCTCCAAGATATGATCACAAACTGTATCAGAGCTCAGTACGGTGGACCTACTCAAGATTCCCTCTTGTATTCCAAACCTTATACTAAGAGGATTGATAACTTGAGAATTCCAATCGGGTATCAGCCACCGAAATTTCAGCAGTTTGATGGAAAAGGCAATCCTAAACAACATATTGCCCACTTCGTTGAGACATGCGAGAACGCTGGTACTCGAGGGGACCTACTAGTCAAACAGTTCGTTCGAACACTTAAAGGAAATGCTTTTGACTGGTACACTGATCTAGAACCTGAGTCAGTAGACAGTTGGGAGGAACTCGAAAGAGAGTTTTTGAATCGCTTCTACAGCACTAGACGAACCGTTAGCATGTTCGAGCTCACAAACACTAAACAACGAAAAGGTGAACTCGTTGTTAACTATATAAATCGCTGGAGAGCTATGAGTCTAGATTGCAAAGATCGCCTCACTGAACTCTCTTCCGTTGAGATGTGCATTCAAGGCATGCACTGGGAACTCCTCTACATCCTTAAAGGTATAAAGCCTCGCACCTTTGAGGAACTAGCAACTCGTGCCCACGATATGGAGCTAAGTATTGCTAGTCGAGAAAACCAAGACCTTCTCCTCCCTAATATGAGAAAAGAAGGAAGGAACGACGAAGAGACTATAGAAGAATCTATGGTTGTCAACACAACCCTTCCCAAGTCGTCTTCGAAAGGAAAGCGACAAACAAATGGAGCGCATCACTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAACTGATAGAGCTTCCTAAGTGTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAGGATGTTTCGTCCTAAAGGACTTAATTTTAAAGCTGGCTAAGGAAGGCAAAATTGAGCTTGACCTTGATGAAGTAGCCCAATCAAATCTTGCTACAATCAAAGGAAAGAGCAAACATCAAAGAAAGAAGGATCCTAAGAAACTTCAACCCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCAACAACTGGTGATGTTGAATAAATCGTTCTCCAAAAATTTCCACAAAAAGGAAAAAAAGAACCTTGCAACTTCCTACTGCATCGACGTAGAAGAAGTTGACAATTCCAAGAAGAGTGAACAAAGGACTTCCGTCTTTGATCGCATCAAGCCTCCAACTACTCGTCCTTCGGTATTCCAAAGAATGAGTATGGCTGCGACAGAGGAAGAAAATCAATGTTTGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGACCTTCAACATCTGTTTTTGATCGCCTCAAAGTAACAAGCGATCAACCTAAAAGAAAGATGGATAAATTGGAGGTAAAACCTTTCGATGAAGTAAACAGCGACAAGAAGCTTCAAAGTAGCATCCCGTCACGTATGAAGAGGAAGTTATCTGTTCTCATAAATACAGAAGGTTCCTTGAAGTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGCGTTGTTCCTTCTCCAAGTTCGAAGGTTCTTCGTTGTATCCTGCTGCGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACAACTGCTACGTTGTTCCTCCTCCAAGTGCGAATGATCTTATGTGGTGCGTTGTTGCATTGTTCCCTCTTCTCTCAAGTTCGATGGTTCTCACGCAGCTTTGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTTTCACGCCCTCCGTTGTAGTTCCTTCTTTCCAAGGTCGAAGGTTCTCACTCGCTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACACACTTCGCTCCAGTTCCTTCTCCCAAATTCGAAGGTTCTCATGCGCTTCGCGCTGCAGTTCCTTCCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCTTTCCCCCAAGTTCGAAGGTTCTCAGGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCACTTCGCTGCATTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCGGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGCACTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCAAACGCTTCGCTGTCGTTCCTTCCTTACAGTTCGTAGGTTCTCACACGCTTCGTTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCGCTCACGCGTTTCACTGCAGTTCCTTCCTCCAATTTTGAAGGTTCTCACATCGCTTCGCTTCGCTCACGCGCTTCGCTGCGATCCTTCCCCCAAGTTCAAAGGTTCTCACGGGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATTGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCATCGCCATAGTTCCTTCCTCCGCGCATCGCCATAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCACACGCTTCGCCACAGTTCCTTCCTCCCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCGCCACAGTTCCTTCCTCCAAGTTCAAAGGTTCTCTCGCCACAGTTCCTTCCCTCCAAGTTCGAAGGTTCTCTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTTCGCCACAGTTCCTTCATCCAAGTTCGAAGGTTCTCTCACGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCATCGCCATAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCCACAGTTCACGCGCACTTCCTCCAAGTTCGAAGGATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTGTTCTCACGTGCATCGCGCGACAGTTCCTTCCTCCAAGTTCGCGCTTCGCCACAGTTCACGCGCACTTCCTCCAAGTTCGAAGGATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTGTTTTCCTCCAAGTTCGAAGGTTGTTCTCACGTGCATCGCGCGACAGTTCCTTCCTCCAAGTTCGAAGGTTGTTCTCACGTGCATCGCCTCCTCCAAGTTTGAAAGAAATTCTACACTCTCACAAAGACAAGAGTTCAGAGTTTCAAAGCTCTCAAGCAGAACCAAAGAATTCAGAGAGACTCCACCAAGTCTGAAGACCGAAAACTCTCTGCAATCCATAAGTTCAAGTGTTGAACACTTCTTGAAGACCAAACACTCTTCAAGACTTCAACACTCCTTGAAGATCAAAGACTCTTCAGGACATCAACATTTCTTGAAGACCCAACACTCTTCAAGACTTCAACACTCCTTGAAGATCAAAGACTCTTCAGGACATCAACATTTCTTGAAGACCCAACACTCTTCAAGACTTCAACACTCCTTGAAGATCAAAGACTCTTCAGGACATCAACACTTCTTGAAGGCTGAAAACTCCTTCAAGACTAGAAGACTTCAAGCTCCAAGAATCCATTGA
Coding sequence (CDS)
ATGTCGTTCAAGACTTCTTCAATGGTCGCTGTCATGAACAAGTCTTACATGGGTTCTACTGCCCATTGTTGCTTCAATAAACTGGCGTTGCAAGAAGATAAAGCTTCTATCGTTGCAGGCCAAGAAACAACCTTGCAAGGGGCATATACTAATGACAAGTTTATTGTTAAGTATAACCCTTTGTTTGAACATGATTCTGATGTAGTGACTGTCATGATGACTGGGACTAGAACTATGGAAGAAAGAATGGTTGAGATGCAGGAGCACATCGACACCTTGATGAAGGCGATTGAAGAAAAAGATTCTCAAATTGCGCAACTGAAGTGCCAAATTGAGAACCAACATATCGCCGAATCAAATCAAACCCAAGTCATAAAAAATCATGACAAAGGAAAGACTATAGTGCAAGATGATCAGCCACAGTGTTCTACTTCGATCGCTTCACTATCCATCCAACAGCTCCAAGATATGATCACAAACTGTATCAGAGCTCAGTACGGTGGACCTACTCAAGATTCCCTCTTGTATTCCAAACCTTATACTAAGAGGATTGATAACTTGAGAATTCCAATCGGGTATCAGCCACCGAAATTTCAGCAGTTTGATGGAAAAGGCAATCCTAAACAACATATTGCCCACTTCGTTGAGACATGCGAGAACGCTGGTACTCGAGGGGACCTACTAGTCAAACAGTTCGTTCGAACACTTAAAGGAAATGCTTTTGACTGGTACACTGATCTAGAACCTGAGTCAGTAGACAGTTGGGAGGAACTCGAAAGAGAGTTTTTGAATCGCTTCTACAGCACTAGACGAACCGTTAGCATGTTCGAGCTCACAAACACTAAACAACGAAAAGGTGAACTCGTTGTTAACTATATAAATCGCTGGAGAGCTATGAGTCTAGATTGCAAAGATCGCCTCACTGAACTCTCTTCCGTTGAGATGTGCATTCAAGGCATGCACTGGGAACTCCTCTACATCCTTAAAGGTATAAAGCCTCGCACCTTTGAGGAACTAGCAACTCGTGCCCACGATATGGAGCTAAGTATTGCTAGTCGAGAAAACCAAGACCTTCTCCTCCCTAATATGAGAAAAGAAGGAAGGAACGACGAAGAGACTATAGAAGAATCTATGGTTGTCAACACAACCCTTCCCAAGTCGTCTTCGAAAGGAAAGCGACAAACAAATGGAGCGCATCACTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAACTGATAGAGCTTCCTAAGTGTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAGGATGTTTCGTCCTAAAGGACTTAATTTTAAAGCTGGCTAAGGAAGGCAAAATTGAGCTTGACCTTGATGAAGTAGCCCAATCAAATCTTGCTACAATCAAAGGAAAGAGCAAACATCAAAGAAAGAAGGATCCTAAGAAACTTCAACCCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCAACAACTGGTGATGTTGAATAAATCGTTCTCCAAAAATTTCCACAAAAAGGAAAAAAAGAACCTTGCAACTTCCTACTGCATCGACGTAGAAGAAGTTGACAATTCCAAGAAGAGTGAACAAAGGACTTCCGTCTTTGATCGCATCAAGCCTCCAACTACTCGTCCTTCGGTATTCCAAAGAATGAGTATGGCTGCGACAGAGGAAGAAAATCAATGTTTGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGACCTTCAACATCTGTTTTTGATCGCCTCAAAGTAACAAGCGATCAACCTAAAAGAAAGATGGATAAATTGGAGGTAAAACCTTTCGATGAAGTAAACAGCGACAAGAAGCTTCAAAGTAGCATCCCGTCACGTATGAAGAGGAAGTTATCTGTTCTCATAAATACAGAAGGTTCCTTGAAGTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGCGTTGTTCCTTCTCCAAGTTCGAAGGTTCTTCGTTGTATCCTGCTGCGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACAACTGCTACGTTGTTCCTCCTCCAAGTGCGAATGATCTTATGTGGTGCGTTGTTGCATTGTTCCCTCTTCTCTCAAGTTCGATGGTTCTCACGCAGCTTTGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTTTCACGCCCTCCGTTGTAGTTCCTTCTTTCCAAGGTCGAAGGTTCTCACTCGCTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACACACTTCGCTCCAGTTCCTTCTCCCAAATTCGAAGGTTCTCATGCGCTTCGCGCTGCAGTTCCTTCCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCTTTCCCCCAAGTTCGAAGGTTCTCAGGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCACTTCGCTGCATTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCGGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGCACTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCAAACGCTTCGCTGTCGTTCCTTCCTTACAGTTCGTAGGTTCTCACACGCTTCGTTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCGCTCACGCGTTTCACTGCAGTTCCTTCCTCCAATTTTGAAGGTTCTCACATCGCTTCGCTTCGCTCACGCGCTTCGCTGCGATCCTTCCCCCAAGTTCAAAGGTTCTCACGGGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATTGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCATCGCCATAGTTCCTTCCTCCGCGCATCGCCATAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCACACGCTTCGCCACAGTTCCTTCCTCCCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCGCCACAGTTCCTTCCTCCAAGTTCAAAGGTTCTCTCGCCACAGTTCCTTCCCTCCAAGTTCGAAGGTTCTCTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTTCGCCACAGTTCCTTCATCCAAGTTCGAAGGTTCTCTCACGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCATCGCCATAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCCACAGTTCACGCGCACTTCCTCCAAGTTCGAAGGATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTGTTCTCACGTGCATCGCGCGACAGTTCCTTCCTCCAAGTTCGCGCTTCGCCACAGTTCACGCGCACTTCCTCCAAGTTCGAAGGATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTGTTTTCCTCCAAGTTCGAAGGTTGTTCTCACGTGCATCGCGCGACAGTTCCTTCCTCCAAGTTCGAAGGTTGTTCTCACGTGCATCGCCTCCTCCAAGTTTGAAAGAAATTCTACACTCTCACAAAGACAAGAGTTCAGAGTTTCAAAGCTCTCAAGCAGAACCAAAGAATTCAGAGAGACTCCACCAAGTCTGAAGACCGAAAACTCTCTGCAATCCATAAGTTCAAGTGTTGAACACTTCTTGAAGACCAAACACTCTTCAAGACTTCAACACTCCTTGAAGATCAAAGACTCTTCAGGACATCAACATTTCTTGAAGACCCAACACTCTTCAAGACTTCAACACTCCTTGAAGATCAAAGACTCTTCAGGACATCAACATTTCTTGAAGACCCAACACTCTTCAAGACTTCAACACTCCTTGAAGATCAAAGACTCTTCAGGACATCAACACTTCTTGAAGGCTGAAAACTCCTTCAAGACTAGAAGACTTCAAGCTCCAAGAATCCATTGA
Protein sequence
MSFKTSSMVAVMNKSYMGSTAHCCFNKLALQEDKASIVAGQETTLQGAYTNDKFIVKYNPLFEHDSDVVTVMMTGTRTMEERMVEMQEHIDTLMKAIEEKDSQIAQLKCQIENQHIAESNQTQVIKNHDKGKTIVQDDQPQCSTSIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRIPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEETIEESMVVNTTLPKSSSKGKRQTNGAHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVEGCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKNFHKKEKKNLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCLMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDQPKRKMDKLEVKPFDEVNSDKKLQSSIPSRMKRKLSVLINTEGSLKFLLSKFEGPYTVRYCVVPSPSSKVLRCILLRCSFSKFEGSQLYNCYVVPPPSANDLMWCVVALFPLLSSSMVLTQLCWSFFSPSSKVFTPSVVVPSFQGRRFSLAALQFFLPKFEGSHTSLQFLLPNSKVLMRFALQFLPSKFEGSHIASLRSFLQVRRFSRASLCNSFPQVRRFSGASCSSFLQIRRFSRTSLHFLPPKFEGSHALRCGSFPPSSKVLTHFAAVPSSKFEGSQTLRCRSFLTVRRFSHASLQFLPPKFEGSHVASLRSRVSLQFLPPILKVLTSLRFAHALRCDPSPKFKGSHGHRHSSFLQVRRFSRASPQFLPPSSKVLTRIATVPSSKFEGSLTRIAIVPSSAHRHSSFLQVRRFSHTLRHSSFLPRASPQFLPPSSKVLSPQFLPPSSKVLSPQFLPPSSKVLSPQFLPPSSKVLSPQFLPSKFEGSLATVPSSKFEGSLATVPSSKFEGSLATVPSSKFEGSLTRFATVPSSKFEGSLTRFATVPSSKFEGSLTRFATVPSSKFEGSLTRIAIVPSSKFEGSHALRHSSRALPPSSKDRHSSFLQVRRLFSRASPQFLPPSSKVVLTCIARQFLPPSSRFATVHAHFLQVRRIATVPSSKFEGCFPPSSKVVLTCIARQFLPPSSKVVLTCIASSKFERNSTLSQRQEFRVSKLSSRTKEFRETPPSLKTENSLQSISSSVEHFLKTKHSSRLQHSLKIKDSSGHQHFLKTQHSSRLQHSLKIKDSSGHQHFLKTQHSSRLQHSLKIKDSSGHQHFLKAENSFKTRRLQAPRIH
Homology
BLAST of Lag0019130 vs. NCBI nr
Match:
KAA0056121.1 (ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa])
HSP 1 Score: 711.1 bits (1834), Expect = 2.0e-200
Identity = 396/667 (59.37%), Postives = 488/667 (73.16%), Query Frame = 0
Query: 34 KASIVAGQETTLQGAYTNDKFIVKYNPLFEHDSDVVTVMMTGTRTMEERMVEMQEHIDTL 93
K IV + + Y++ K + P ++++VM+T T E RM E+++ ++ L
Sbjct: 51 KGGIVIKENHAIDEHYSSSKPSSEEMP----HPNIMSVMVTNVDTSENRMAELEKKVNML 110
Query: 94 MKAIEEKDSQIAQLKCQIENQHIAESNQTQVIKNHDKGKTIVQDDQPQCSTSIASLSIQQ 153
MK +EE+D +IA LK IE++ AES+ +KN DKGK ++Q+ QPQ STSIASLS+QQ
Sbjct: 111 MKVVEERDYEIAFLKNHIESRDAAESSHKHTVKNTDKGKAVMQESQPQNSTSIASLSVQQ 170
Query: 154 LQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRIPIGYQPPKFQQFDGKGNPKQHIAH 213
LQ+MI + I+ QYGGP Q LY KPYTKRIDNLR+P GYQPPKFQQFDGKGNPKQH+AH
Sbjct: 171 LQEMIASSIKMQYGGPAQTFSLYFKPYTKRIDNLRMPNGYQPPKFQQFDGKGNPKQHVAH 230
Query: 214 FVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTV 273
F++TCE AGTRGDLLVKQFVRTLKGNA DWY DLEPES+D+WE+LER+FLNRFYSTR V
Sbjct: 231 FIKTCETAGTRGDLLVKQFVRTLKGNACDWYIDLEPESIDNWEQLERDFLNRFYSTRHIV 290
Query: 274 SMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKP 333
SM ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKP
Sbjct: 291 SMMELTNTRQQKGELVIDYINRWRALSLDCKDRLTELSAVEMCTQGMHWGLLYILQGIKP 350
Query: 334 RTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPK 393
RTFEELATRAHDMELSIA+R +D L+P R + ++T I+ESMVV+ T K
Sbjct: 351 RTFEELATRAHDMELSIANRGAKDFLIPKSRSDKNELDDTKKIANSVIKESMVVHATPLK 410
Query: 394 SSSKGK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRP 453
S SK K R+ +G TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRP
Sbjct: 411 SFSKRKETKIERKHDGDEKRQSTLKERQEKVYPFPDSDVADMLEQLLENQLIQLPECKRP 470
Query: 454 EEMEKVDDPKYCKYHRVIGHPVEGCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKS 513
E+ KVDDP YCKYHRVI HPVE CFVLK+LILKLA+E KIELD+DEVAQ+N A I+ S
Sbjct: 471 EQAGKVDDPNYCKYHRVISHPVEKCFVLKELILKLAREKKIELDIDEVAQTNHA-IEMTS 530
Query: 514 KHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKNFHKKEKKNLATSYCI 573
+ KD LQ +R RS P++++ + + + + + N +S
Sbjct: 531 NPIKGKDEDFLQLRRSITLAEFLPRSFLEDDPEEILEVTACHAASIVEVD-NNYGSS--- 590
Query: 574 DVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCLMSTSTRPSAFQRLS 633
+EV+NS + QRTSVFDRIKP TTR SVFQR+S+A EEENQC TR S +RLS
Sbjct: 591 --KEVNNSNEINQRTSVFDRIKPSTTRSSVFQRLSVATKEEENQCPTFIYTRTSTSKRLS 650
Query: 634 VSTSKKSRPSTSVFDRLKVTSDQPKRKMDKLEVKPFDEVNSDKKLQSSIPSRMKRKLSVL 679
+ST KK RPSTS FDRLK+T+DQ +R+M + KPF E N D K+ S +PSRMKRKL V
Sbjct: 651 ISTLKKDRPSTSSFDRLKMTNDQQQREMKSSKAKPFREENDDDKIHSCVPSRMKRKLFVD 706
BLAST of Lag0019130 vs. NCBI nr
Match:
TYK03695.1 (retrotransposon gag protein [Cucumis melo var. makuwa])
HSP 1 Score: 696.8 bits (1797), Expect = 4.0e-196
Identity = 411/791 (51.96%), Postives = 509/791 (64.35%), Query Frame = 0
Query: 15 SYMGS--TAHCCFNKLALQEDKASIVAGQETTLQ-GAYTNDKFIVKYNP-LFEHDS---- 74
SY+G H C ++ ED + Q ++K NP + EH+S
Sbjct: 12 SYIGKRPNTHSCSREIQSFEDMPPFEVAKNIWEQISKPPKGGIVIKENPAVDEHNSLSEC 71
Query: 75 --------DVVTVMMTGTRTMEERMVEMQEHIDTLMKAIEEKDSQIAQLKCQIENQHIAE 134
++++VM+T T E RM E+++ ++ LMK +EE+D +IA LK IE++ AE
Sbjct: 72 SNEEVPQPNIMSVMVTNVDTSENRMAELEKKVNMLMKVVEERDYEIAFLKNHIESRDAAE 131
Query: 135 SNQTQVIKNHDKGKTIVQDDQPQCSTSIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSK 194
S+ +KN DKGK ++Q+ QPQ STSIASLS+QQLQ+MI + I+ QYGGP Q LYSK
Sbjct: 132 SSHKHTVKNTDKGKAVMQESQPQNSTSIASLSVQQLQEMIASSIKTQYGGPAQTFSLYSK 191
Query: 195 PYTKRIDNLRIPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKG 254
PYTKRIDNLR+P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKG
Sbjct: 192 PYTKRIDNLRMPNGYQPPKFQQFDGKGNPKQHVAHFIETCETAGTRGDLLVKQFVRTLKG 251
Query: 255 NAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRA 314
NAFD Y DLEPES+D+WE+LER+FLNRFYSTRR VSM ELTNT+Q+KGELV++YINRWRA
Sbjct: 252 NAFDLYMDLEPESIDNWEQLERDFLNRFYSTRRIVSMMELTNTRQQKGELVIDYINRWRA 311
Query: 315 MSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDL 374
+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI +R +D
Sbjct: 312 LSLDCKDRLTELSAVEMCTQGMHWGLLYILQGIKPRTFEELATRAHDMELSIPNRGAKDF 371
Query: 375 LLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKGK-----RQTNG--AHHLTLK 434
L+P R + +T I+ESMVV+ T KS SK K R+ +G TLK
Sbjct: 372 LIPKSRSDKNELNDTKKIANSVIKESMVVHATPLKSFSKRKETKIERKHDGDEKRQSTLK 431
Query: 435 ERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVEGC 494
ERQ+K+YPF D+D+ DMLEQLLE QLI+LPKCKRP++ EKVDDP YCKYHRVI HPVE C
Sbjct: 432 ERQEKVYPFSDSDVADMLEQLLENQLIQLPKCKRPKQAEKVDDPNYCKYHRVISHPVEKC 491
Query: 495 FVLKDLILKLAKEGKIELDLDEVAQSN--------------------------------- 554
FVLK+LILKLA+E KIEL++DEVAQ+N
Sbjct: 492 FVLKELILKLAREKKIELNIDEVAQTNHVAIEMTSNVPPLTQLDDQRKSLIQFGTSILFQ 551
Query: 555 --LATIKGKSKHQRKKD---------------PKKLQ-----------------PKRKRS 614
+ TI ++K KD P +Q K +R+
Sbjct: 552 QRIVTINSQNKEAHGKDDDEGWITVTRQKGRQPNSIQKESQFHQKYAKGSISHKKKGRRN 611
Query: 615 KK-------------FSQPQQLVMLNKSFSKNF---HKKEKKNLATSYCIDVEEVDNSKK 674
KK F Q ++ + L + ++F H +E + T + + EV+N+
Sbjct: 612 KKMWNPKPIKGKDEDFLQLRRSITLAEFLPRSFLEDHPEEILEVTTCHAASIVEVNNNYG 671
Query: 675 S----------EQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCLMSTSTRPSAFQRLS 683
S QRTSVFDRIKP TTR SVFQR+SMA EEENQC TR S F+RLS
Sbjct: 672 SSKEVNNLNEINQRTSVFDRIKPSTTRSSVFQRLSMATKEEENQCPTFIYTRTSTFKRLS 731
BLAST of Lag0019130 vs. NCBI nr
Match:
KAA0032121.1 (ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa])
HSP 1 Score: 631.3 bits (1627), Expect = 2.0e-176
Identity = 383/739 (51.83%), Postives = 486/739 (65.76%), Query Frame = 0
Query: 4 KTSSMVAVMNKSYMGSTAHCCFNKLALQEDKASIVAGQETTLQGAYTNDK--FIVKYNPL 63
K +S + ++ SY+G K ++QE + V ++ +L+ + K +++ NPL
Sbjct: 5 KAASKSSAVSDSYIGLVTQSHL-KRSMQEQEQGFVL-KKKSLEQLIESPKGGIVIRDNPL 64
Query: 64 FEHDS------------DVVTVMMTGTRTMEERMVEMQEHIDTLMKAIEEKDSQIAQLKC 123
F + + +VV+VMM T E M EM+ I+ LMK +EE+D +IA LK
Sbjct: 65 FNNSTPASNLSDKESHLEVVSVMMVDV-TAEATMAEMERKINFLMKVVEERDHEIAALKD 124
Query: 124 QIENQHIAESNQTQVIKNHDKGKTIVQDDQP-QCSTSIASLSIQQLQDMITNCIRAQYGG 183
Q++ +ES+QT V+K DKGK +V+++QP Q S S+ASLS+QQLQDMI N IRAQYGG
Sbjct: 125 QMKACETSESSQTPVVKATDKGKNVVEENQPQQQSVSVASLSVQQLQDMIANSIRAQYGG 184
Query: 184 PTQDSLLYSKPYTKRIDNLRIPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLL 243
P Q S +YSKPYTKRIDNLR+P+GYQP KFQQFDGKGNPKQHI HFVETCENAG+RGD L
Sbjct: 185 PPQTSFMYSKPYTKRIDNLRMPLGYQPLKFQQFDGKGNPKQHIVHFVETCENAGSRGDQL 244
Query: 244 VKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGEL 303
V+QFVR+LKGNAF+ TRR VSM ELTNT QRKGE
Sbjct: 245 VRQFVRSLKGNAFE-------------------------CTRRVVSMMELTNT-QRKGEP 304
Query: 304 VVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMEL 363
V++YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEELATRAHDM+L
Sbjct: 305 VIDYINRWRALSLDCKDKLTELSAVEMCTQGMHWELLYILQGIKPRTFEELATRAHDMKL 364
Query: 364 SIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKGKRQTNGAHH- 423
SIA+R +D L+ R + +T + ESM+V T KS SK K + +H
Sbjct: 365 SIANRGVKDFLVQRTRSDKNEINDTKKIANNVLNESMLVQETPLKSFSKRKETKHKRNHD 424
Query: 424 ------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYH 483
TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LP+CKRPE++ KVDDP YCKYH
Sbjct: 425 GDEKRRPTLRERQKKVYPFPDSDVADMLEQLIEKQLIQLPECKRPEQVGKVDDPNYCKYH 484
Query: 484 RVIGHPVEGCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSK--------HQRKK- 543
RVI H VE CFVLK+LI KLA+E KIELD+DEVAQ+N + S QRK
Sbjct: 485 RVISHLVEKCFVLKELIRKLARENKIELDIDEVAQTNHVAVNMTSSVPLSILLYDQRKSL 544
Query: 544 ------DPKKLQPKRKRSKKFSQPQQ----------LVMLNKSFSKNFHKKEKKNLATSY 603
+P ++ ++K SQ ++ + L +SF ++ H +E + +
Sbjct: 545 IQFGTFEPILVRFQQKTMTSNSQNKEEPSEDEGEEWIEFLPRSFLED-HPEEILEVTACH 604
Query: 604 CIDV----------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCLMS 663
+ EE+DNS + +QRTSVFD IKP TTR SVFQR+SMA +EENQC
Sbjct: 605 TTSIVEVDNNYDSYEEMDNSNEIKQRTSVFDCIKPLTTRSSVFQRLSMATKKEENQCPTF 664
Query: 664 TSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDQPKRKMDKLEVKPFDEVNSDKKLQSS 679
T + SAF+RLS+S SKK RPST FDRLK+T+DQ +R+M L+ KPF E N D K+ S
Sbjct: 665 TYAQTSAFKRLSISISKKHRPSTYTFDRLKMTNDQQQREMKTLKAKPFQEENDDDKIHSR 713
BLAST of Lag0019130 vs. NCBI nr
Match:
KAA0033746.1 (retrotransposon gag protein [Cucumis melo var. makuwa])
HSP 1 Score: 617.5 bits (1591), Expect = 3.1e-172
Identity = 384/802 (47.88%), Postives = 479/802 (59.73%), Query Frame = 0
Query: 4 KTSSMVAVMNKSYMGSTAHCCFNKLALQEDKASIVAGQETTLQGAYTNDK--FIVKYNPL 63
K +S + + SY+G K ++QE + V ++ +L+ + K I++ NPL
Sbjct: 5 KAASKSSTASDSYIGLVTQSHL-KRSMQEQEQGFVL-KKISLEQLIESPKGGIIIRDNPL 64
Query: 64 F------------EHDSDVVTVMMTGTRTMEERMVEMQEHIDTLMKAIEEKDSQIAQLKC 123
F E +VV+VMM T E M EM+ I+ LMK EE+D +IA LK
Sbjct: 65 FNNSMPASNLSNKESHLEVVSVMMVDV-TAEATMEEMERKINFLMKNFEERDHEIAALKD 124
Query: 124 QIENQHIAESNQTQVIKNHDKGKTIVQDDQP-QCSTSIASLSIQQLQDMITNCIRAQYGG 183
Q++ ES+QT V+K DKGK +VQ++QP Q S S+ASLS+QQLQDMI N IRAQYGG
Sbjct: 125 QMKACKTGESSQTPVVKATDKGKNVVQENQPQQQSVSVASLSVQQLQDMIANSIRAQYGG 184
Query: 184 PTQDSLLYSKPYTKRIDNLRIPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLL 243
P Q S +YSK YTKRIDNLR+P+GYQPPKFQQFDGKGNPKQHIAHFVETCENAG+RGD L
Sbjct: 185 PPQTSFMYSKSYTKRIDNLRMPLGYQPPKFQQFDGKGNPKQHIAHFVETCENAGSRGDQL 244
Query: 244 VKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGEL 303
V+QFVR+LKGNAF+WYTDLEPE GE
Sbjct: 245 VRQFVRSLKGNAFEWYTDLEPE-----------------------------------GEP 304
Query: 304 VVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMEL 363
V++YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEEL+TRAHDMEL
Sbjct: 305 VIDYINRWRALSLDCKDKLTELSAVEMCTQGMHWELLYILQGIKPRTFEELSTRAHDMEL 364
Query: 364 SIASRENQDLLLPNMRKEGRND--------EETIEESMVVNTTLPKSSSKGKRQTNGAHH 423
SIA+ +D L+ ++ +N+ + ESM+V T KS SK K + ++
Sbjct: 365 SIANIGAKDFLVQRTKRSDKNEINDTKKIANNILNESMLVQETPLKSFSKRKETKHERNY 424
Query: 424 -------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKY 483
TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LP+CKRPE+ KVDDP YCKY
Sbjct: 425 DGDEKRRPTLRERQKKVYPFPDSDVADMLEQLIEKQLIQLPECKRPEQAGKVDDPNYCKY 484
Query: 484 HRVIGHPVEGCFVLKDLILKLAKEGKIELDLDEVAQSN---------------------- 543
HRVI HPVE CFVLK+LILKLA+E KIELD+DEVAQ+N
Sbjct: 485 HRVISHPVEKCFVLKELILKLARENKIELDIDEVAQTNHVAVNMTSSVLPSILLYDQRES 544
Query: 544 ------------------------------------------------------------ 603
Sbjct: 545 LIQFGTFEPILVRFQQKTMTSNSQNKEETSEDEGEEWIGVTHKKERQIGSVQTNSNFHQK 604
Query: 604 --LATIKGKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKNFHKKEKKNLA--- 663
I K K +R K K +P + + K F QP++ + L + ++F + + +
Sbjct: 605 HSKGNISHKKKGRRNKKMWKPKPIKGKDKDFFQPRRSINLAEFLPRSFLEDHPEKILEVT 664
Query: 664 ---TSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQC 679
T+ ++V EEVDNS + +QRT VF RIKP T R SVFQR+SMA EEENQC
Sbjct: 665 ACHTTSIVEVDNNYDSYEEVDNSNEIKQRTFVFHRIKPLTIRSSVFQRLSMAMKEEENQC 724
BLAST of Lag0019130 vs. NCBI nr
Match:
XP_031742199.1 (uncharacterized protein LOC105435721 [Cucumis sativus])
HSP 1 Score: 617.1 bits (1590), Expect = 4.0e-172
Identity = 326/523 (62.33%), Postives = 405/523 (77.44%), Query Frame = 0
Query: 2 SFKTSSMVAVMNKSYMGSTAHCCFNKLALQEDKASIVAGQETTLQGAYTNDK--FIVKYN 61
S K +S + + +Y G + +D+ S +A ++ L+ + K ++K N
Sbjct: 3 SKKAASKSSAASDTYTGPITRSRSKGIIQGQDQGSAIA--QSILKQLMESPKAGIVIKEN 62
Query: 62 PLF-EHDS-----------DVVTVMMTGTRTMEERMVEMQEHIDTLMKAIEEKDSQIAQL 121
PL+ ++DS DV++VMM +E M EM+ I+ LMK ++E+D +IA L
Sbjct: 63 PLYNDYDSASSRSLKEAHPDVMSVMMADV-AVETAMAEMERKINLLMKVVDERDHEIAAL 122
Query: 122 KCQIENQHIAESNQTQVIKNHDKGKTIVQDDQP-QCSTSIASLSIQQLQDMITNCIRAQY 181
K Q++ + AES+QT V+K DKGK +VQ++QP Q STS+ASLS+QQLQDMITN IRAQY
Sbjct: 123 KEQMQTRETAESSQTPVVKVDDKGKNVVQENQPQQQSTSVASLSVQQLQDMITNSIRAQY 182
Query: 182 GGPTQDSLLYSKPYTKRIDNLRIPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGD 241
GGP+Q S +YSKPYTKRIDNLR+P+GYQPPKFQQFDGKGNPKQH+AHFVETCENAG+RGD
Sbjct: 183 GGPSQTSFMYSKPYTKRIDNLRMPLGYQPPKFQQFDGKGNPKQHVAHFVETCENAGSRGD 242
Query: 242 LLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKG 301
LV+QFVR+LKGNAF+WYTDLEPES++SWE+LE+EFLNRFYSTRRTVSM ELTNTKQRKG
Sbjct: 243 QLVRQFVRSLKGNAFEWYTDLEPESIESWEQLEKEFLNRFYSTRRTVSMMELTNTKQRKG 302
Query: 302 ELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDM 361
E V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDM
Sbjct: 303 EPVIDYINRWRALSLDCKDRLTELSAVEMCTQGMHWGLLYILQGIKPRTFEELATRAHDM 362
Query: 362 ELSIASRENQDLLLPNMRKEGRN-------DEETIEESMVVNTTLPKSSSKGK-----RQ 421
ELSIASR +D L+P ++K+ + + T +ESMVVNTT P SKGK ++
Sbjct: 363 ELSIASRGTKDFLVPEVKKDKKEMKGAEKIVKSTSKESMVVNTT-PLKFSKGKEARVEKK 422
Query: 422 TNGA--HHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCK 481
+G+ LTLKERQ+K+YPFPD+DI DMLEQLLE QLI+LP+CKRPE+ KVDDP YCK
Sbjct: 423 DDGSERRRLTLKERQEKVYPFPDSDIADMLEQLLEKQLIQLPECKRPEQAGKVDDPNYCK 482
Query: 482 YHRVIGHPVEGCFVLKDLILKLAKEGKIELDLDEVAQSNLATI 496
YHRVI HPVE CFVLK+LIL+LA+E +IELDL+EVAQ+N A +
Sbjct: 483 YHRVISHPVEKCFVLKELILRLAREKRIELDLEEVAQTNHAEV 521
BLAST of Lag0019130 vs. ExPASy TrEMBL
Match:
A0A5A7URH1 (Ty3-gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold17350G00010 PE=4 SV=1)
HSP 1 Score: 711.1 bits (1834), Expect = 9.8e-201
Identity = 396/667 (59.37%), Postives = 488/667 (73.16%), Query Frame = 0
Query: 34 KASIVAGQETTLQGAYTNDKFIVKYNPLFEHDSDVVTVMMTGTRTMEERMVEMQEHIDTL 93
K IV + + Y++ K + P ++++VM+T T E RM E+++ ++ L
Sbjct: 51 KGGIVIKENHAIDEHYSSSKPSSEEMP----HPNIMSVMVTNVDTSENRMAELEKKVNML 110
Query: 94 MKAIEEKDSQIAQLKCQIENQHIAESNQTQVIKNHDKGKTIVQDDQPQCSTSIASLSIQQ 153
MK +EE+D +IA LK IE++ AES+ +KN DKGK ++Q+ QPQ STSIASLS+QQ
Sbjct: 111 MKVVEERDYEIAFLKNHIESRDAAESSHKHTVKNTDKGKAVMQESQPQNSTSIASLSVQQ 170
Query: 154 LQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRIPIGYQPPKFQQFDGKGNPKQHIAH 213
LQ+MI + I+ QYGGP Q LY KPYTKRIDNLR+P GYQPPKFQQFDGKGNPKQH+AH
Sbjct: 171 LQEMIASSIKMQYGGPAQTFSLYFKPYTKRIDNLRMPNGYQPPKFQQFDGKGNPKQHVAH 230
Query: 214 FVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTV 273
F++TCE AGTRGDLLVKQFVRTLKGNA DWY DLEPES+D+WE+LER+FLNRFYSTR V
Sbjct: 231 FIKTCETAGTRGDLLVKQFVRTLKGNACDWYIDLEPESIDNWEQLERDFLNRFYSTRHIV 290
Query: 274 SMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKP 333
SM ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKP
Sbjct: 291 SMMELTNTRQQKGELVIDYINRWRALSLDCKDRLTELSAVEMCTQGMHWGLLYILQGIKP 350
Query: 334 RTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPK 393
RTFEELATRAHDMELSIA+R +D L+P R + ++T I+ESMVV+ T K
Sbjct: 351 RTFEELATRAHDMELSIANRGAKDFLIPKSRSDKNELDDTKKIANSVIKESMVVHATPLK 410
Query: 394 SSSKGK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRP 453
S SK K R+ +G TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRP
Sbjct: 411 SFSKRKETKIERKHDGDEKRQSTLKERQEKVYPFPDSDVADMLEQLLENQLIQLPECKRP 470
Query: 454 EEMEKVDDPKYCKYHRVIGHPVEGCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKS 513
E+ KVDDP YCKYHRVI HPVE CFVLK+LILKLA+E KIELD+DEVAQ+N A I+ S
Sbjct: 471 EQAGKVDDPNYCKYHRVISHPVEKCFVLKELILKLAREKKIELDIDEVAQTNHA-IEMTS 530
Query: 514 KHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKNFHKKEKKNLATSYCI 573
+ KD LQ +R RS P++++ + + + + + N +S
Sbjct: 531 NPIKGKDEDFLQLRRSITLAEFLPRSFLEDDPEEILEVTACHAASIVEVD-NNYGSS--- 590
Query: 574 DVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCLMSTSTRPSAFQRLS 633
+EV+NS + QRTSVFDRIKP TTR SVFQR+S+A EEENQC TR S +RLS
Sbjct: 591 --KEVNNSNEINQRTSVFDRIKPSTTRSSVFQRLSVATKEEENQCPTFIYTRTSTSKRLS 650
Query: 634 VSTSKKSRPSTSVFDRLKVTSDQPKRKMDKLEVKPFDEVNSDKKLQSSIPSRMKRKLSVL 679
+ST KK RPSTS FDRLK+T+DQ +R+M + KPF E N D K+ S +PSRMKRKL V
Sbjct: 651 ISTLKKDRPSTSSFDRLKMTNDQQQREMKSSKAKPFREENDDDKIHSCVPSRMKRKLFVD 706
BLAST of Lag0019130 vs. ExPASy TrEMBL
Match:
A0A5D3BX77 (Retrotransposon gag protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold863G00570 PE=4 SV=1)
HSP 1 Score: 696.8 bits (1797), Expect = 1.9e-196
Identity = 411/791 (51.96%), Postives = 509/791 (64.35%), Query Frame = 0
Query: 15 SYMGS--TAHCCFNKLALQEDKASIVAGQETTLQ-GAYTNDKFIVKYNP-LFEHDS---- 74
SY+G H C ++ ED + Q ++K NP + EH+S
Sbjct: 12 SYIGKRPNTHSCSREIQSFEDMPPFEVAKNIWEQISKPPKGGIVIKENPAVDEHNSLSEC 71
Query: 75 --------DVVTVMMTGTRTMEERMVEMQEHIDTLMKAIEEKDSQIAQLKCQIENQHIAE 134
++++VM+T T E RM E+++ ++ LMK +EE+D +IA LK IE++ AE
Sbjct: 72 SNEEVPQPNIMSVMVTNVDTSENRMAELEKKVNMLMKVVEERDYEIAFLKNHIESRDAAE 131
Query: 135 SNQTQVIKNHDKGKTIVQDDQPQCSTSIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSK 194
S+ +KN DKGK ++Q+ QPQ STSIASLS+QQLQ+MI + I+ QYGGP Q LYSK
Sbjct: 132 SSHKHTVKNTDKGKAVMQESQPQNSTSIASLSVQQLQEMIASSIKTQYGGPAQTFSLYSK 191
Query: 195 PYTKRIDNLRIPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKG 254
PYTKRIDNLR+P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKG
Sbjct: 192 PYTKRIDNLRMPNGYQPPKFQQFDGKGNPKQHVAHFIETCETAGTRGDLLVKQFVRTLKG 251
Query: 255 NAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRA 314
NAFD Y DLEPES+D+WE+LER+FLNRFYSTRR VSM ELTNT+Q+KGELV++YINRWRA
Sbjct: 252 NAFDLYMDLEPESIDNWEQLERDFLNRFYSTRRIVSMMELTNTRQQKGELVIDYINRWRA 311
Query: 315 MSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDL 374
+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI +R +D
Sbjct: 312 LSLDCKDRLTELSAVEMCTQGMHWGLLYILQGIKPRTFEELATRAHDMELSIPNRGAKDF 371
Query: 375 LLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKGK-----RQTNG--AHHLTLK 434
L+P R + +T I+ESMVV+ T KS SK K R+ +G TLK
Sbjct: 372 LIPKSRSDKNELNDTKKIANSVIKESMVVHATPLKSFSKRKETKIERKHDGDEKRQSTLK 431
Query: 435 ERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVEGC 494
ERQ+K+YPF D+D+ DMLEQLLE QLI+LPKCKRP++ EKVDDP YCKYHRVI HPVE C
Sbjct: 432 ERQEKVYPFSDSDVADMLEQLLENQLIQLPKCKRPKQAEKVDDPNYCKYHRVISHPVEKC 491
Query: 495 FVLKDLILKLAKEGKIELDLDEVAQSN--------------------------------- 554
FVLK+LILKLA+E KIEL++DEVAQ+N
Sbjct: 492 FVLKELILKLAREKKIELNIDEVAQTNHVAIEMTSNVPPLTQLDDQRKSLIQFGTSILFQ 551
Query: 555 --LATIKGKSKHQRKKD---------------PKKLQ-----------------PKRKRS 614
+ TI ++K KD P +Q K +R+
Sbjct: 552 QRIVTINSQNKEAHGKDDDEGWITVTRQKGRQPNSIQKESQFHQKYAKGSISHKKKGRRN 611
Query: 615 KK-------------FSQPQQLVMLNKSFSKNF---HKKEKKNLATSYCIDVEEVDNSKK 674
KK F Q ++ + L + ++F H +E + T + + EV+N+
Sbjct: 612 KKMWNPKPIKGKDEDFLQLRRSITLAEFLPRSFLEDHPEEILEVTTCHAASIVEVNNNYG 671
Query: 675 S----------EQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCLMSTSTRPSAFQRLS 683
S QRTSVFDRIKP TTR SVFQR+SMA EEENQC TR S F+RLS
Sbjct: 672 SSKEVNNLNEINQRTSVFDRIKPSTTRSSVFQRLSMATKEEENQCPTFIYTRTSTFKRLS 731
BLAST of Lag0019130 vs. ExPASy TrEMBL
Match:
A0A5A7SRE2 (Ty3-gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold452G00210 PE=4 SV=1)
HSP 1 Score: 631.3 bits (1627), Expect = 9.9e-177
Identity = 383/739 (51.83%), Postives = 486/739 (65.76%), Query Frame = 0
Query: 4 KTSSMVAVMNKSYMGSTAHCCFNKLALQEDKASIVAGQETTLQGAYTNDK--FIVKYNPL 63
K +S + ++ SY+G K ++QE + V ++ +L+ + K +++ NPL
Sbjct: 5 KAASKSSAVSDSYIGLVTQSHL-KRSMQEQEQGFVL-KKKSLEQLIESPKGGIVIRDNPL 64
Query: 64 FEHDS------------DVVTVMMTGTRTMEERMVEMQEHIDTLMKAIEEKDSQIAQLKC 123
F + + +VV+VMM T E M EM+ I+ LMK +EE+D +IA LK
Sbjct: 65 FNNSTPASNLSDKESHLEVVSVMMVDV-TAEATMAEMERKINFLMKVVEERDHEIAALKD 124
Query: 124 QIENQHIAESNQTQVIKNHDKGKTIVQDDQP-QCSTSIASLSIQQLQDMITNCIRAQYGG 183
Q++ +ES+QT V+K DKGK +V+++QP Q S S+ASLS+QQLQDMI N IRAQYGG
Sbjct: 125 QMKACETSESSQTPVVKATDKGKNVVEENQPQQQSVSVASLSVQQLQDMIANSIRAQYGG 184
Query: 184 PTQDSLLYSKPYTKRIDNLRIPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLL 243
P Q S +YSKPYTKRIDNLR+P+GYQP KFQQFDGKGNPKQHI HFVETCENAG+RGD L
Sbjct: 185 PPQTSFMYSKPYTKRIDNLRMPLGYQPLKFQQFDGKGNPKQHIVHFVETCENAGSRGDQL 244
Query: 244 VKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGEL 303
V+QFVR+LKGNAF+ TRR VSM ELTNT QRKGE
Sbjct: 245 VRQFVRSLKGNAFE-------------------------CTRRVVSMMELTNT-QRKGEP 304
Query: 304 VVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMEL 363
V++YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEELATRAHDM+L
Sbjct: 305 VIDYINRWRALSLDCKDKLTELSAVEMCTQGMHWELLYILQGIKPRTFEELATRAHDMKL 364
Query: 364 SIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKGKRQTNGAHH- 423
SIA+R +D L+ R + +T + ESM+V T KS SK K + +H
Sbjct: 365 SIANRGVKDFLVQRTRSDKNEINDTKKIANNVLNESMLVQETPLKSFSKRKETKHKRNHD 424
Query: 424 ------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYH 483
TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LP+CKRPE++ KVDDP YCKYH
Sbjct: 425 GDEKRRPTLRERQKKVYPFPDSDVADMLEQLIEKQLIQLPECKRPEQVGKVDDPNYCKYH 484
Query: 484 RVIGHPVEGCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSK--------HQRKK- 543
RVI H VE CFVLK+LI KLA+E KIELD+DEVAQ+N + S QRK
Sbjct: 485 RVISHLVEKCFVLKELIRKLARENKIELDIDEVAQTNHVAVNMTSSVPLSILLYDQRKSL 544
Query: 544 ------DPKKLQPKRKRSKKFSQPQQ----------LVMLNKSFSKNFHKKEKKNLATSY 603
+P ++ ++K SQ ++ + L +SF ++ H +E + +
Sbjct: 545 IQFGTFEPILVRFQQKTMTSNSQNKEEPSEDEGEEWIEFLPRSFLED-HPEEILEVTACH 604
Query: 604 CIDV----------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCLMS 663
+ EE+DNS + +QRTSVFD IKP TTR SVFQR+SMA +EENQC
Sbjct: 605 TTSIVEVDNNYDSYEEMDNSNEIKQRTSVFDCIKPLTTRSSVFQRLSMATKKEENQCPTF 664
Query: 664 TSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDQPKRKMDKLEVKPFDEVNSDKKLQSS 679
T + SAF+RLS+S SKK RPST FDRLK+T+DQ +R+M L+ KPF E N D K+ S
Sbjct: 665 TYAQTSAFKRLSISISKKHRPSTYTFDRLKMTNDQQQREMKTLKAKPFQEENDDDKIHSR 713
BLAST of Lag0019130 vs. ExPASy TrEMBL
Match:
A0A5A7SUW1 (Retrotransposon gag protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold239G002250 PE=4 SV=1)
HSP 1 Score: 617.5 bits (1591), Expect = 1.5e-172
Identity = 384/802 (47.88%), Postives = 479/802 (59.73%), Query Frame = 0
Query: 4 KTSSMVAVMNKSYMGSTAHCCFNKLALQEDKASIVAGQETTLQGAYTNDK--FIVKYNPL 63
K +S + + SY+G K ++QE + V ++ +L+ + K I++ NPL
Sbjct: 5 KAASKSSTASDSYIGLVTQSHL-KRSMQEQEQGFVL-KKISLEQLIESPKGGIIIRDNPL 64
Query: 64 F------------EHDSDVVTVMMTGTRTMEERMVEMQEHIDTLMKAIEEKDSQIAQLKC 123
F E +VV+VMM T E M EM+ I+ LMK EE+D +IA LK
Sbjct: 65 FNNSMPASNLSNKESHLEVVSVMMVDV-TAEATMEEMERKINFLMKNFEERDHEIAALKD 124
Query: 124 QIENQHIAESNQTQVIKNHDKGKTIVQDDQP-QCSTSIASLSIQQLQDMITNCIRAQYGG 183
Q++ ES+QT V+K DKGK +VQ++QP Q S S+ASLS+QQLQDMI N IRAQYGG
Sbjct: 125 QMKACKTGESSQTPVVKATDKGKNVVQENQPQQQSVSVASLSVQQLQDMIANSIRAQYGG 184
Query: 184 PTQDSLLYSKPYTKRIDNLRIPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLL 243
P Q S +YSK YTKRIDNLR+P+GYQPPKFQQFDGKGNPKQHIAHFVETCENAG+RGD L
Sbjct: 185 PPQTSFMYSKSYTKRIDNLRMPLGYQPPKFQQFDGKGNPKQHIAHFVETCENAGSRGDQL 244
Query: 244 VKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGEL 303
V+QFVR+LKGNAF+WYTDLEPE GE
Sbjct: 245 VRQFVRSLKGNAFEWYTDLEPE-----------------------------------GEP 304
Query: 304 VVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMEL 363
V++YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEEL+TRAHDMEL
Sbjct: 305 VIDYINRWRALSLDCKDKLTELSAVEMCTQGMHWELLYILQGIKPRTFEELSTRAHDMEL 364
Query: 364 SIASRENQDLLLPNMRKEGRND--------EETIEESMVVNTTLPKSSSKGKRQTNGAHH 423
SIA+ +D L+ ++ +N+ + ESM+V T KS SK K + ++
Sbjct: 365 SIANIGAKDFLVQRTKRSDKNEINDTKKIANNILNESMLVQETPLKSFSKRKETKHERNY 424
Query: 424 -------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKY 483
TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LP+CKRPE+ KVDDP YCKY
Sbjct: 425 DGDEKRRPTLRERQKKVYPFPDSDVADMLEQLIEKQLIQLPECKRPEQAGKVDDPNYCKY 484
Query: 484 HRVIGHPVEGCFVLKDLILKLAKEGKIELDLDEVAQSN---------------------- 543
HRVI HPVE CFVLK+LILKLA+E KIELD+DEVAQ+N
Sbjct: 485 HRVISHPVEKCFVLKELILKLARENKIELDIDEVAQTNHVAVNMTSSVLPSILLYDQRES 544
Query: 544 ------------------------------------------------------------ 603
Sbjct: 545 LIQFGTFEPILVRFQQKTMTSNSQNKEETSEDEGEEWIGVTHKKERQIGSVQTNSNFHQK 604
Query: 604 --LATIKGKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKNFHKKEKKNLA--- 663
I K K +R K K +P + + K F QP++ + L + ++F + + +
Sbjct: 605 HSKGNISHKKKGRRNKKMWKPKPIKGKDKDFFQPRRSINLAEFLPRSFLEDHPEKILEVT 664
Query: 664 ---TSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQC 679
T+ ++V EEVDNS + +QRT VF RIKP T R SVFQR+SMA EEENQC
Sbjct: 665 ACHTTSIVEVDNNYDSYEEVDNSNEIKQRTFVFHRIKPLTIRSSVFQRLSMAMKEEENQC 724
BLAST of Lag0019130 vs. ExPASy TrEMBL
Match:
A0A5A7TZU9 (Ribonuclease H OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold498G00940 PE=4 SV=1)
HSP 1 Score: 612.1 bits (1577), Expect = 6.2e-171
Identity = 311/466 (66.74%), Postives = 374/466 (80.26%), Query Frame = 0
Query: 55 IVKYNP-LFEHDS------------DVVTVMMTGTRTMEERMVEMQEHIDTLMKAIEEKD 114
++K NP + EH+S ++++VM+T T E+RM E+++ ++ LMKA+EE+D
Sbjct: 55 VIKENPAMDEHNSLSERSNEEVPQPNIMSVMVTDVDTSEDRMAELEKKVNMLMKAVEERD 114
Query: 115 SQIAQLKCQIENQHIAESNQTQVIKNHDKGKTIVQDDQPQCSTSIASLSIQQLQDMITNC 174
+IA LK IE++ AES+ T IKN +KGK I+Q+ QPQ STSIASLS+QQLQ+MI N
Sbjct: 115 FEIALLKNHIESRDAAESSHTHTIKNANKGKAIMQESQPQNSTSIASLSVQQLQEMIANS 174
Query: 175 IRAQYGGPTQDSLLYSKPYTKRIDNLRIPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA 234
I+ QYGGP Q LYSKPYTKRIDN+R+P GYQPPKFQQFDGKGNPKQH+AHF+ETCE A
Sbjct: 175 IKTQYGGPAQTFSLYSKPYTKRIDNMRMPHGYQPPKFQQFDGKGNPKQHVAHFIETCETA 234
Query: 235 GTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNT 294
GTRGDLLVKQFVRTLKGNAFDWYTDLEPES+DSWE+LER+FLNRFYSTRR VSM ELT T
Sbjct: 235 GTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEQLERDFLNRFYSTRRIVSMIELTAT 294
Query: 295 KQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELAT 354
KQRKGE V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELAT
Sbjct: 295 KQRKGEPVIDYINRWRALSLDCKDRLTELSAVEMCTQGMHWGLLYILQGIKPRTFEELAT 354
Query: 355 RAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTT----LPKSSSK 414
RAHDMELSIA+R N DLL+P +RKE + + T +E+MVV+TT + K
Sbjct: 355 RAHDMELSIANRGNNDLLVPEVRKEKKEVKSTQKALKGVTKEAMVVSTTPLKLVSKEKKM 414
Query: 415 GKRQTNG-AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPK 474
KRQ G TLKERQ+K+YPFPD+D+PDML+QLLE QLI+LP+CKRP EM +V+DP
Sbjct: 415 EKRQDEGEKRRPTLKERQEKVYPFPDSDLPDMLDQLLEKQLIQLPECKRPAEMGRVNDPN 474
Query: 475 YCKYHRVIGHPVEGCFVLKDLILKLAKEGKIELDLDEVAQSNLATI 496
YCKYHRVI HPVE CFVLK+LILKLA + KIEL+LD+VAQ+N A +
Sbjct: 475 YCKYHRVISHPVEKCFVLKELILKLALDKKIELELDDVAQTNHAAV 520
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
KAA0056121.1 | 2.0e-200 | 59.37 | ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa] | [more] |
TYK03695.1 | 4.0e-196 | 51.96 | retrotransposon gag protein [Cucumis melo var. makuwa] | [more] |
KAA0032121.1 | 2.0e-176 | 51.83 | ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa] | [more] |
KAA0033746.1 | 3.1e-172 | 47.88 | retrotransposon gag protein [Cucumis melo var. makuwa] | [more] |
XP_031742199.1 | 4.0e-172 | 62.33 | uncharacterized protein LOC105435721 [Cucumis sativus] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A5A7URH1 | 9.8e-201 | 59.37 | Ty3-gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C2... | [more] |
A0A5D3BX77 | 1.9e-196 | 51.96 | Retrotransposon gag protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaf... | [more] |
A0A5A7SRE2 | 9.9e-177 | 51.83 | Ty3-gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C2... | [more] |
A0A5A7SUW1 | 1.5e-172 | 47.88 | Retrotransposon gag protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaf... | [more] |
A0A5A7TZU9 | 6.2e-171 | 66.74 | Ribonuclease H OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold498G00940... | [more] |
Pages
Match Name | E-value | Identity | Description | |