Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTAGAAGTTATTTTGAATAATTACTCGAAGAGTAATTAGCCGAACAAATTCAACAAAGAAACACCTCATTGGCCAAACACTCCCTAAGCAACTTGTGTGTCACTCATCCATGGCGGAGCAATTACAGGAGGGGGATGGAAATGGCTAATTGAGCATTTTCTCTCTCGTCTTTCAGTTTCGATCTTGGCGGATCTTCGATCTGCTCGTTTTTCGGAACATTGCTTCAATCCCGTTCTCTCCCTTTCCGCCACTTCACCATGTCTCCTACTCCCGAGGAACCCAACAATTTGCAGAACGGAATCGAAACCGAATCGCACATTTCTTCGGAATCAGAGCGAGCTGACGAACGCAGATCGCACCCAGAAACCCTAGCAGATACAATCCCCAATGCCGGATTACGGCCAGAACAAGAATCGGAATCAGTCAACGAAGAACCAGATTCCGAGCCGGAGAATCGAGGGCAGCAGCCGTCGGAGTCAGTCCGGTTACAGGTTGTGATGGATGTTGCAGATCCGAAGGAAACCTCGACCCCGTCCAACGGCACCGATAACTCGCAACCTGCGCTGCGTAAAGACGAAGGAAGCCGCACGTTTACGATGAGAGAGTTGCTGAACGGATTGAAGGGTGAAGATGGCAGCGACAGCGTTAATGGATCTGAAGGCGACGTGCCCGAGCCCAACTCCGCTTACAGGTTTGCAATCTTGCTCTCTTTTGTTGATTTCTTGAGGATACTGAATTTCGTTGATTGGTGATTGAATCCTTTGTAGTAATGCATTAGGATCTTCAATTTTATAGAATTGAATGCTGAATGTTGAGTTTTCAACCTGATTTTTTTCATCTATTGTTGGAGGGAGTTCTGTCCAGCTTGAAGGTTGAGTACCGAATTTCCATTAAAATAGTAATGGGTTAGTAGTTTGTTTCAAAGAATAATATTTTAGGCGTATAAATCAAGGCTCCCATTTGAACAAGTCAATTTCTGAAAGTCAAAGTTGATCAAAACTTTTCCAACCAGCAAGGTTTCTTCTTTTTATCGACAATTGTTAATTTTCACCCATCTGATAGATTCCTCCGGCCCCTGCGTAGCCAATTCTTTCAAGCCGAAACTCGCTAACTCGAGCAGTTTTCTTCTGTGCAGGGGTCTTTCTTTTTCTTCCTAGAAGGGTAAGATACTCTGATTGATTTGAACGATTCAATGCTGTCAGGAGCTATGTTCATCGTCTCTTGATGTCCTTTGGCTCTAGTTGTTATGGTTTAGTTATGAATTTCTAGCTTCTCTAGGGGACTTTTTTTGTTTTGCATGTGGCAAACCCGATGACCCAATTATATAGGACTGAGCTCGTGTTAAAATTCAGTTTGTTCCCATTCACCATTCATTAAGCCTAGTGGTGAAAAAGGGCTTTGGGAATCTAAGAAGTTATGGATTTAATCCATGGTGGCCACCGCCTAGTCAAAGTGCTCAAAAGCTGACTTGAACACGCATATTATATAAAATAACCATTCAGTTAGCCCGAATTATTTGTCCAATCTGAAAATTTTCCCAATCCAATCCTGAACGGGTTTAATTCTTTCCTTGATCACACTATCCCAAAACATCATATATAACACTTTTCTGTGGTTTGCATATAAAACCTTTTAGTTGTGAAGCAGCTGGGATCCTTTGGGTTATCCAAACGTAAGTATTTCTATGGGAACTTTGTTTGGAAACGAGTTCCATTATGATTTTTATTTTTGCTTTCTTTTTTTCAACTCTGTCTCCATCTCTTTCTCACAAGTTCCATTATTATTTTTATTTTTTATTTGATTTCTTCTCAACTTATGTCGCTATAAATTTGGATTTTGTGGCTTCAAATTTTGGGGATTTCATTTTCTACCCAAATAGGCTTTTATAAAGTGGTAGGAATGTGTAGTTTGGTTTCAATTTTAGTGAAGTCTGGTTGAATTGAATGATATGGCCAATTTCAATTGCACTTTGCCCTTATCCCCTTAGGTATTATTAACTTGACAGAAATCCCAATTGGGCGTACGAGATTGTTCATGTTTCATCTTTTATGTGCTTTGGACCAACCTTGAAACCGGATTACTGAAATGATAAACTGACCTGTTATCCATTTGGCTTGCTCAATTTTTACACTGGACGCTCTTGGTTTACTTCATTTCAGAAAAAAAAATATCGTGCAGCTTGGTTTTGGGTTAAAATTAACCCAACCTGAACCCACTAACACCCTAATAATTGTGTACGTTGTATACTTATCCAGTTACTGATGCATTTTTATGGATTGCAGTCCCTAAATTTCTTTACCCTATCTTTTTCTCAATTTTAAAAGATACATTACTCCTCTACCAAGTTCATTTCTAGGTGCACCATGACTCATTTTTGGAATTAATTGTTAGTTCTTTCTCTCCATTTGATTTTTATTTGAATCAAGAAAATGCTTGTTATTATTAGTGGATTCTTTAACTGTTATTCATAACAGTTGAGTTTTGATGTTACCACCTTTTGTTCAATAGTTATATTTATTTTTTCTTTATTTCTTTTCCTTACTGTATAAATTTGCAGTCTTAATCAAGATAGCCCACATCAGCCTTATTCTGAACAGAGCAGAGCTGCCATGGAGTTGATCAACAGTGTTACAGGGGTTGATGAAGAAGGCCGTTCTCGCCAACGGATTCTCACATTCGCTGCTAAGAGGTATTGCCAGTATAGTTTTTGTACTTTGTAGTTGTCTATTAACTAGGCCTATTACTTTTAAGTTCTGATAAAACTTGGACTTTGAATTCCTATTAAAATAAGAGCCCAGTTGTACTTCCTCATTTCCCCAATGGGAGAGTTTGTGTTTGAGCTCTAGTTGTGTAATCGTGATGCACTAAAAAGAATGGGAATAAGCTTTTCACTGCAACCAAAAAGAGAAAAAAGAAAAGAAAAGAAAAGCTTGTGTATACATGAGGGTTAGTAAAGAAGACAATATATACTAATTACTAATCTATCTATGTATGCATGGTTATATTTATATTGATAGTGATGGGTATTTTGATTTGGTTTCATGCAAACCAGTAAAGGAATAACGAATTTTGTATCATTTAGAGTTGAAAAACTTATCAGAGAATATTTATATTCCAGTTAAGCGACCACATTGTTTTCTGAATGGGGGTATGCCTAGCCTAAGTGCAAAGCACCTATGGCATAGATGAGATGCAAGTCCATCACTTCAAGACATCTTGGGTTTGTCTGCTTGGAGATAATTCTAGGAGGATAGATCTCATTAAGGCTCACATTTCCCCCGTTGAATCGTGCATGGGCTCTAAAGGATTATTACGCATCAGTGTACTTTGGGCTTTAGAAGAACCATGGGTAGCCACCAAAATTAAATTAGATGTCCAAGACATATGAGTTGTGCTCAACAAGTACATATCAAACTTTATTTTATTTTTTTTCAAAAATGAATACATAGGTTTTTCTGTAAACCTAATTGGTACATATGGAAAAATGAAAACAAAAGAAAGATCTAGAATTAATTTTTGCCTTTTGGATACAATATATTGCATGGTTATTTCTTATCAGTAATCGAATCTATTTCTTTTTAAAGATAAGTATTTGCCACTCCTATGGAGGTTCTTTGTTTCTTTACATAAAAATAAGTATTTGCCACTCTTATGGAGGTTCTTTGTTTCTTAACATCTATTTTTGATTCATATATTGCAGATATGCTAGTGCAATTGAGAGAAATGCCCAAGACTATGATGCTCTATATAATTGGGCTTTGGTTCTGCAGGTCCTAGACTTTATATTATTTATTTTGGGAAACTGAATTTTTCTACCCCTTTATTTCTTGATCAAATACATTGTAGAATGCAGGGAGATATTATTGATGGGTCATTAAAATAATTCCAGCTCAATTGAGTTGTGTTTGGTATAAAAAACTATTCCACAATTTAGCTAACACTTTCCAGTACCTAGTGTCAATATTTTATGTAGGAAGACAAAAAGGATATTTGAATATAAAATGCAATGAAACATTTAATTTTCTTTCCATCGTACAAGTAGAACTTCGTTATAATAGGACACTATAAATTGTACAGATCTTTTCATTCTGTTCTAGGCAATATCACAATAAACTCTGTATTAATCTGGTTGTTGCAGGAGAGTGCAGACAATGTTAGTCCAGATTCTACTACACCTTCTAAAGATGCATTGCTTGAGGAGGCTTGTAAAAAGTACGATGAGGCTACCCATTTTTGCCCAACACTTCATGATGTATGTACAAAAAGCCTTTTTATAATTTTACTAAGGATTCTGTTGCAAAGAATGTTAACAGAAGATGGGGGAAAAGAATATAAAAAGATGGACAAACTTTGAATAGCTTCTAATGTTGATTCTGCCAGAAGTGAACTTAATTAGATTATAGAGTTCTTTTGTTCTTTTTTCCAAGATGGATCTAAATAAGTTCTCCTTGGGTCCACTTTTCACATCGGTAATTATTTATTTCTGCTTAGGTTTTGATAAACCATGGCTTATACTTGGTACTTGCAGATTAGAAAATAATTGTACGCATTTTTCACATCACCTGCAAAATTCATAGTTGGTGTTGTCTTAATAATTTTATGTATTTGGATTGCATGCCAGGCTTTTTACAATTGGGCTATTGCAATCTCTGATCGAGCCAAAATGCGTGGTCGTACAAAGGAGGCTGAAGAACTGTGGAAGCAGGTTCATACTTCCATCCCTAGACGATTTTTTTCATTACACTTGAGAAAAATTCCCCGCCCTAGTGTATCTATGATTTGGAAATGTCTATGAGTGGAAAGGATGAACCTTATCTTTTTCTGGGATCTGAAAAGATTTACTGTGCTATCGTTATTTCTGTGATTCAATGGACACTCCACGCCTTCTCCATTTTGTGTTATTTTCTTCTCTTTTTGGTAAAATGTTTCTTTTATGAAATAGCTTTTGTTCCGTAAGTTGAAACCATCAATTTCATTACTTGGTGAAGGCATTGGTAGCCTCCTTTATAGGTTGAAAGTTTGAATATACGTCCAAACTCTTACAAATTTATATATTTTTCTAAAAAATTGCCTTCTCTCTCAAAAGTATCATTTAGGCTGTTAACTGTATAGCAGATCTAATTATAATTGATGAAAATTTTGAAATGTCGGTTAACCTCATCAAATACAGTTTCTTTGTTTACCTTAGTTCAATGAAGCTGAAAATAATTGAGGGATTGAATAGTAGGATCTGGCTGTTATCCCGTGAGACTAGTTGAGGCAGCTTGGTGTTGACACTCTCAAATATATTAAAAATAAATAAATAATTGAGGAATTGAGTGGAACTTATTAAAGTTCATCCTCAGAAATAATTGATCTATAAGCTGAGCATGAATAACATATCACAGAATTATAAGTGTAGAAAAAAAAAAATTGAAAGTTAGAAAGAATAAAAGAAGAAAGAACTGTTTGTATTCTTGTTTCCACTACTAATATTCTTATTGTTCTAATTTTCAGTAATTGTGTTTTCCGTTTCTTGTGAACTAGTCATTTCAATTATTCCATCTTTCAGGCTACCAAAAATTATGAAAAAGCTGTCCAACTCAACTGGAATAGTCCCCAGGTACATCTCAAAGAAATTCCTTGAAAACCCACTTTGTGCTTTAAATTTTTCTTTGGATTTTTTTTTCTTTCTTGGCAATGTGTTCACTTCTGTTAAATTGGTTCTAGTATAAGCCTACATAATTTCAAAAAAAAAAAAAAAAAATTCAAATGAAGTGTGCATTTATTTTTGCAAATACATGAAGGCTAACACTTGTGGATATATTTCAGGCGCTTAATAATTGGGGGCTTGCTCTTCAGGTACATGCTATGTTTTTCACATTATATTAATCTTCGAATTGAAAGGCTCTCTTTCCCACCTCCATGTATCCAACCACGAACAGTATGTAAGGAACTTATTTTTATTAAATATCCCGATCCATTTCAGGAACTCAGTGCGATTGTGCCAGCACGAGAAAAACAGACAATTGTAAAAACAGCTATCAGCAAGGTAACATTGTTATTTAGAAACATTTAAATGGCAAAGAATTGCAGCCTTTTGTTTGATATTTACTGTTGCATTCTATGCCTTGATAACTCAGTTGTCATCCATTGCAGTTTCGTGGCGCTATACAGTTGCAATTTGATTTTCATCGAGCCATCTACAACCTTGGTACTGTTCTGGTGAGTTTGTTCCTGCCTTATATATGGAAATTGCATGATCTTACATCTGTTCAATGCGCTGCTTTTAGATTGACTTCATCAACTTGTGTTGTAGAGATGAATATAGGAACTTTTTAATTGCAGTTTATATGAATGACAAGGGAACGGAAGACTTAAGTCTTGATTGATTTATATAAAGGATATTAATTTGCCCAAATCTTAACTCATTTCTGGCATCTACAGTATGGACTAGCTGAGGACACATTACGGACTGGTGGAACTGGCAACGTCAAGGATGCTTCCCCCAATGACTTGTACAGCCAATCAGCAATTTATATTGCATCAGCTCATGCTCTAAAACCAAATTACTCTGTAAGCCCAGTTCTTTTGTTTCGGAAAATTTCTTTGTTGGGCCTGATCTAATTACCAAATACTATTTTCAGTTCAGCTTCTCTTACATTTTTACCAATAATTGGACATGTATTTGTGCACAAATATGATAGTGTCTTAAGTCAACTTCTCGTTGTTGTGGAAAAGGAAGGAGGGACATTTCAAATACTGTATTCGAAGGGCTTTAAGTCTTGATTTTCTTCCATTTCTCTTTGTTTGAGTTTGAATATTTGACTTATTTATGTATTTTATTATTATTTCTTTTATAGTGCCATTTTTCGTCTTTTCACTTATTCACAACTAGGACAACATTGAATATCTGCACATATATCTCTTTAATGATAGAAGAGACATTGATTGATTCTCTCAACTTTAGTGTTATCGTGGTATTAGAACTTTAGAGTTTCTTGTCGAGTTAGGGTTGAGAGAGTCTTGGGTTTGGAGGCAAGTCATACACCGCTCTCATCAGGATACTTAGAACGTCGTCAAAATCTATTCGTCGTCAGAATTTAACACATTGTGTCAGAGGTTGTTTTTCGAAGTATTTTTGTCCGTTTTTCTCATAGCCTGCATTGATAGAAAGCTCATTGCCTGATCCATATCTCAATCAAATTTCATCACAAATGGAGGTGCGAGTGGGCTTTATTCTCACAAATAAGTTTCTGACGCTCATTTCATGTGCAGCCGGTGGGTGTGTAACAGAGCGTGCACAGTCCGTTGTTAACAATTTTTGTCGAAATTTTCCCACTGTCTTGTCCTCTTTCCAAGGGTGTATTGTTTCTGTTGTTTCAATGCTTAGAGGGGGGTGTTTTAAGCAGTCGACGACTGTTTGGGTTTGTTCGCTAAGAGTTTTTCTTGTAACTTTTTATCAGATTTTTTTTCTCATATTATTTACTGATTTTGTCTCCCGATTGAAAACGCTTCAAGAAGTTCCGGGCGTTTTGATGACTTCCAGGTGGCTTTGGAAACTTTTCTTTTCCATTTTTTTTAAAGTGATTCCGATTTTTTTTTACTATCTATTTGGTGATTTTGTGCGTGTATGAAAATTATGGCTGACTAGAAATCAACGGTTGTGACTGAGATGGTCTCTAAGATTACCAAACACAAGCTCAAATTATTATGCGTGGTGATCGGATATTCGTCACTATATTCGAAGCATTTAGATGGAAGATTCTTCACCACTAGGAGATTTGAAGAAAAATTGATTACGAGACGACTCAATGATGTTCATGTAACTCAATCAAAGAAATACTCGAAAATATTGAATTTTTGTATTCTAGAAAATGTAATATTAATAAAGTGTTTGATGTTTGCAAGACTCTTTATTAGCTTGGTCAGGGTGAGAAATCTTTCATTACTTATTTCATGGAAGTTAAGAAGACGTATGCATTGATGCCCATTAGTACTGATCCTAAAGTTCAATTGCCACAACGAGAACAAATGTTCACGATGAGTTTCTTAGTTGGTCTCTCCTCGAAATATGACATGCCCAAGGATCTAGTTCTGACATCTCATCATTAGAGGCAGCTTATACTCGTATACTTAATATTGAGAAGCCAAGAGTCGTTTTGTCATTTGATTCAAATAGTGCATTGGTTGGATAATCCAATGATTATAGAGGTAATAAGGGAGTTGAATAAATTCATGGGGCCAACGAGGCTGGAGCGAACTATCCTCATTGATATTCTAATACTCAACGATCAAATTGTTGAATAGGACTCAGAGAGCCCCCCATGCAAATGTAGTGTTAGCTCCAGACGATTCAAAGAAGTCAGTTACCAATTTCTATAGAACAAAAATTCATCTCTCTCAGATCTGAGTTTTTTTGTCTTTTTTTCTTTTTGCAATTGTTCCTGTACTTTTTTCATTCTCTTAATGAAATTTCAGTCCCCTTTTTTAAAAAAAAAAAATTCTACAGAACAATTTGCTAAATTTAGCAGTAGCATGAGTCATTGACGACATCATCTACTCCTATCACTACCATCAGTGAGACAAGTAACACATCTAAATGCCATCTTTCCTCCACCACAAAATGGGTCATTGACTCTAGCGCTACCGACTATATGACTGGTAATCCTAGTTTATTCTCTAAATTTTTTCCATCCATGTCTATACCTACAGTTACTATAGCTGATGGAACTGCTAGCTATATCTTAGGTTCAGACACTGCCAATCTTACTGATTCCATCTCGTTATCGTGTGTTTTAACTTTTAAGTTTGCCTTAGTTCTCTTTTAATTTGATTTCTATTAGTAAACTCACTTGCGACCTTCTTTGTTTTGTGTTTTTCTTTCCTGGTTGTTGTTTCAAGTTCTTACGACAAAGAGGACTATTGGTAAAGGGCGTCCTTGGGCAACCAATATCGATACCATTACATGTTCTAGTGTGCCTTCTCCTTTTGAAGAGCATTGTTGTCTGTGTCATCTGTCTATCTCCGCATTGAAGAGACTTCGTCCACAATTTCACCATTTATCTTTATTAGATTGTAAATCGTGTCAGTTTGTTAAGTTTCATCGTTTCAGTTTGTATCCTCGAGTCAATAAGCGAGCTCGTGCTCCATTTGAGTTAGTACATTTGGATTTTTGGGGTCCATGTCCAATAGTGTCCAAGCTAGATTTTCGATATTTCATTATATTTGTTGATGATTATTCTCGTGTAACTTGGTTATGTTTGATGAAAAATTATTCTAAGTTGCTTATCCTTTTTTGCAATTTTCATGCTGAGATTCAAACTCAATTTTGTGGTTGCCTTAAAATTTTTTCTTTTTCTTTTTATTGATATTCGTGATTGTCTGGGCCAGCTTGCGCGCACCTCGACTATTGTCACAGGGCATACGCCTGACCCTACCACATTTGGATGCCAAGGAAACCCGTAAGGAATTAATTCCTAGGTAGGTGGCCACCATGGGAATTGAACCCATACCTTATGCACCCCAAAACCCTCTTTGACCAATTGAGCCATCCCATTATGGTTATGGTTGCCTTAAAATCTTACGCAGTGATAATGTTAAAGAATATTTTTCTCATGCTCTCAGATCTTACCTAGATGCACATGGCATGCTTCATCAATCTTCTTGTGTGATACTCCATCTCATAATGGAGTAGCAAAATGTAAGAATCGTCGTCTTCTTGAAACACTGAAGCATTAATGTTTCAGATACATGTTTCAAACCCTTTTGGGTTGATGGTATTTCAACGGCATGTTTCTTAATAAATCGTATGTCGTCATCGGTTCTCAAGGGTGAGATACTTTTTCGTGCTTTGCATCCCCAATCGTCATGTTCCCCCTTACACCGAAAATTTTTAGGTGTACGTGCTTTGTCCGAGATGTTCGCCCTACTCTCACTAAGCTCACTCCAGTTCTTAAAATGTAATTTCTTGGGTATTCTTAAGTCCAGAAAGAATACAAGTGTAACTGTCCAAGTCTTGAAAATACTATGTGTCTCATGATGTCACCTTCTTTGGACACTTTTCCTTCTTTTTGTCTTTCTAGTCAAGTACAAGTTAGGGGGAGAATCAAAAGAGTGCTGACGACTTCCTTATATATACCGTTGTGACCCATAGTGACCCTTCTACTAGTCCTCCCTCGGTCCATCCACCTATTACTCAAGTCTATACTCGATGACAGCCTCTTACTGATCCATCCATTCTAGCTGCATGACCACTTTCTGCGGCCTCGGTAGATCTAGGGACAAGCGATGATCTTCCTATCACCCTTTGTAAAGGTAAATGTCAGTGCACTCATCCTATTTCATCCTTTGTTTCGTATAATCATTTATCACCATCGACTTGTTCTTTCATTGCCTCTTTAGAGTCTGTGCCTATTCCTTAAAGAATTCGTTAAGCTTTATCTCACCCCGGTTGGTGTGCATAGATGGTGGAAGAGATGATTGTTTTCGATGACAATTATATTTAGGATTTTGTTTCTCTTCCCACGGGAAAGAAGCCTATTGGTTGTAAATGGTTGTTTGCAGTCAAAGTCAATCCTGATGTCGGTCTATAGCTCGGTTGAAAGCTGTTCTTGTTGCAAAAGTCTATGCACAAATGTACAAAGTTGATTATGATGACAAGTATACTTAGGATTTTATTTCTCTTCCTGCGAGAAAGAAGCCTATTAGTTGTAAATGGTTGTTTGCAGTCAAAGTCAATCCTGATGGGCCTATAGCTCGGTTGAAAGCTCTTCTTGTTGCAAAAGGCTATGCACAAACGTATGAAGTTGATTATGATGATACTTTTTCCCGTTGCGAGAATGGCTTCAATAAGGTTATTTATATCATTGACATCTATTATCATTGGACTTTACATCAGCTTGATATAAAAAATGCATTCCTTCATGACGATCTGAAAGAAGAGGTCTATATGGAGCAACCACCTGGGTTTGTTGCTCAGGGAGAGAATGGTAAGGTATGTTGCCTCCATGAATCCTTGTATGGGCCGAAACAAAGCCTTAGAGCTTGGCTTGGAAAATTCAGTCAGGTGATTGAAGAGTTTGGGATAAAGAAGAGCAAGTCTAATTACTCTGTTTTCTTTAAACGATATGAGACTGGTGTTATCTTATTGATTATGTACGTTGATGACATTGTTATTATGGGTGATGATGCATTGAGTATTCGATCACTCAAGACTTTCCTTCATAGTCAGTTCAATACAAAAGACTTAGGTATGCTGAAGTTTTTTTTAGGTACTGAATAAGAAGCAAGAAGGGAATTGTATTATCATAGAGAAAGTATGTTTTTGACTTGCAGACTGGAACATGGAAGTTAGGGACTAAGCTATGTAGTACTCCAATGATGCCCAATTCATAGTTTACAAAAGAAGAACTGTTGAAAGATCTTGAAAAGTATAAAAGATTAGTAGGAAAATTAAATTATCTGATTGGGTTATGTTAGAACAAATTTTGGATTATTTGAATGCTTCTCCTAGTCGTGGTTTATTATACAATGATAAACCATACTGACATTGAAGGCTTCTCAGATGTTAATTAGGTAGGGTCTAAAGAAGACTAGAGATCGACCTCAGGGTGTTGTGTATTTGTGGGTGGTAACTTGGTTTCTTGGAAGTAAGAAATAGAACGTAGTGTTACGTTCAAGTGCTAAATCAGAATACAAAGTTATGGCACAGTCAGTGTGTGAATTAGTTTGGATACGTCAACTTCTCATTGAGATGGAGTTTGATGTCACGACACCAATGAGGTTATGGTGTGATAACCAAGCAGCCAGTCATATTGCATCTAATTCAGTTTTCCATGAGCGGATGAAAAACATAGAGGTCGATTGTCATTTTATATGATAAAAGATGCATCAAGGTGTGGTTTCCACGGGGTATGCGAAGATTGGAGAAGAGTTAGGAGATACCAAAGCCGTTGAATGGAGTTCGGATAGGTTATCTCTAACAAGTTGCACATGATTAATATATATGCTCCAAATTGAGGGTCAGTGTTAGAGTTTGAATATTAGACTTGTTTATATATTTATTATTATTTCTTTTATAGTGCCTTTTTCATCTTTTCACTTGTACACAGTATATCTATATATATACATATTCCTTTAATGAAGAGACATCGATTGATTCTCTCGCATTGTCACTCTTGTTCCTTGATTCTTGATTGATTTGTCAATTTTTTTGTTCACTTACAAGCTTCAATTGAAATGTTGACGTTTGTTTTTAATAGGTTTACAGCAGTGCCTTGCGCTTGGTTCGTTCAATGGTTAGTTCTTTCTCAAAAAATGTCCATAATAACAATTGATCATATCTTATGTTTTTTTTTGGATAAGAAACAATTTCATTATAGATATAAAATTATGTGGGCAAGAAGCCAATTACAAAAAGGAGTTCCAACTGTTAATGAGTACATTAAAGCTATAATCATGAAACATAGGTGATAGTTTACACCAAGACACCACAACCGATGTCTTATGTTTTATTTTCTCAATATAAACCAGAAAACTGTCTTTAGCGTTCGTATGGTTTAGTTTAGTATGGATCGTATTTGTCTTTTAATGAGAAGATGAGAAAGATCTGACGAAGTTTTAATATAAATGAAGCTGTCAAAAGTCATAATTGATTCCATAATGTCTCGGAGGGGCGCCCTTGTTGGTAGAGACTTTGGATCTCCAAGGTATGCTCCTTTGGAGGTGTCAAGTTCAACCTACAAGTGTACTTAATTATAAAATCTTGTGTGTCTCCCTGATCCGAGCCTTAAGATGGGAGTTTGTTCCAGTTCTTGATTATAAAAAAAATAGATTCCATATTAAGGATATAATGTGATCAAGAGTTGAAATACCGTATTGATGAAGATTACTTTAGCAGTCAAAATGTAGGTCACATTTGTCATGTCGTTACAACTTTATTATATACGATCAAACATTTTACAGTATTTTTTAAGATTCCAAGTAATCTGTATCCTGCAAATTAGTACCTATCATATATTCAGATTTTACTTTTGTTGTCATCCCTTGATCCAGAAATATATTTTGGTTCAATGAACTTGATGAAACTTGACTTTTTGACAGCTGCCGTTACCGTATTTAAAAGTTGGATACCTGACAGCACCTCCTCTGGGGAGACCACTTGCTCCTCACAGTGATTGGAAACGTTCACAATTTTTTCTAAATCATGAAGTATTGCAAAAGGTACAATTTATATATATGCATACGTACATATTTTATCAGCTAACCATGCAAGATCAGTGTGTTTACGTGATTTGGTTGAAACTGGTTTTGAAGTATTATGCTCAATCCTCTTTCAGCTTAAAATAGGAGGGGAACAAGTACAATCATCTCCTAGTGTTTTAGGAAGATCTGGAAGTACCTTGAATGGCGGCGATAGGACAGTCAAAGTAGAAATTCCAGACATTGTCTCTGTATCGGCATGCGCAGATCTAACCTTACCTCCTGGTGCTGGACTCTGCATTGACACAATCCATGGACCAGTTTTCTTGGTGAGCTTATTCTTGAAGCATTTACATTTTGTTTTTAATAACAAAATTGTTATTGATGGTACGAAATTGTGAAAGAGGGGAAGATCCCCAATGGGATAAGAGTTGCAGAAGATATGGTAAAGAGAAAATGTATATTATAGTAATGAAGAAGAGAATGATAACAAAAATGAAAAAGGAGAGATTTTTTAAAATCCAACCAACAGATTGTGGTTCTTGGTGGTGCATGAATCACACAAAATTCTTTTGTACTTGTATCCTTTCCATTTTTATTAACAATTGGAAAACCTTCCGGTAGTTGTACTTCAGGAAGGGGATCTCTCTACGCTATGGCCTTAGGATGCTCTTTTTAGCTTTTGTGTGAACATTTGGAAACCCACAAATAAATTTAATTATAAAATCTCTCGTCTATTGGATCCGAGCCTTGGAATGGGTGTAGATGCCCCAAGTATGTAGTGAAACGAAGTTGCCATTCTTCAGTTATCAAAAGAAAAAAAAAACTCCTATTTTATCCCTTTTGTTTCTAAATATGAAAGATTATCCTGTCTGATAACGAGTAATTGCTACTAGAGTGGAGTGCACTGATCACATGGAAATTCCCATAGTTTCTTTTAGTTACATTGTACCATATCCATTGTTGCATGCATTATTGATCATCAACTTTTGTTTTTAATATTCGTGCCTGGGCTAGCTTATACTTCGACTATTCTCACAAGAGATGCCGCCTAATCCTACAACATGAAATCGTAGGAATTTATTTATTTCATAAATTGGCCACTATAAATTGAATCTTAAACTCTCTTTAACCACTATATTTTGGTTAATAAGTATTAACTTATATGATCACCTTATTCATTCTATCAGGTTGCTGACTCATGGGATGCGCTCGATGCATGGCTCGATGCAATTAGACTAGTTTACACTATCTACGCTCGAGGCAAGAACGAAGTTTTGGCTGGCATCTTAACCGGTTGATTGTTACCAAGTATGCGAATGTATTATTGATATTACCTTGATGTTTATATGATGCTTACTCACAGTCGATTGAGTATTCATTTCTCTAAATTGAAACTCCAAATTTTGGGGTGCTTAATACATGTTTTTCTAGTCAGGTTCCTTCTCCATGTGTGTAATAATCTATTATGTACATGTCAGCTTGTTAACAGCACGCACTAGTGGTGCCGTTTTGGACTTTTGAAAAGAGCTTCGGATTAATTTCATTTGAAAATTCAA
mRNA sequence
GTAGAAGTTATTTTGAATAATTACTCGAAGAGTAATTAGCCGAACAAATTCAACAAAGAAACACCTCATTGGCCAAACACTCCCTAAGCAACTTGTGTGTCACTCATCCATGGCGGAGCAATTACAGGAGGGGGATGGAAATGGCTAATTGAGCATTTTCTCTCTCGTCTTTCAGTTTCGATCTTGGCGGATCTTCGATCTGCTCGTTTTTCGGAACATTGCTTCAATCCCGTTCTCTCCCTTTCCGCCACTTCACCATGTCTCCTACTCCCGAGGAACCCAACAATTTGCAGAACGGAATCGAAACCGAATCGCACATTTCTTCGGAATCAGAGCGAGCTGACGAACGCAGATCGCACCCAGAAACCCTAGCAGATACAATCCCCAATGCCGGATTACGGCCAGAACAAGAATCGGAATCAGTCAACGAAGAACCAGATTCCGAGCCGGAGAATCGAGGGCAGCAGCCGTCGGAGTCAGTCCGGTTACAGGTTGTGATGGATGTTGCAGATCCGAAGGAAACCTCGACCCCGTCCAACGGCACCGATAACTCGCAACCTGCGCTGCGTAAAGACGAAGGAAGCCGCACGTTTACGATGAGAGAGTTGCTGAACGGATTGAAGGGTGAAGATGGCAGCGACAGCGTTAATGGATCTGAAGGCGACGTGCCCGAGCCCAACTCCGCTTACAGTCTTAATCAAGATAGCCCACATCAGCCTTATTCTGAACAGAGCAGAGCTGCCATGGAGTTGATCAACAGTGTTACAGGGGTTGATGAAGAAGGCCGTTCTCGCCAACGGATTCTCACATTCGCTGCTAAGAGATATGCTAGTGCAATTGAGAGAAATGCCCAAGACTATGATGCTCTATATAATTGGGCTTTGGTTCTGCAGGAGAGTGCAGACAATGTTAGTCCAGATTCTACTACACCTTCTAAAGATGCATTGCTTGAGGAGGCTTGTAAAAAGTACGATGAGGCTACCCATTTTTGCCCAACACTTCATGATGCTTTTTACAATTGGGCTATTGCAATCTCTGATCGAGCCAAAATGCGTGGTCGTACAAAGGAGGCTGAAGAACTGTGGAAGCAGGCTACCAAAAATTATGAAAAAGCTGTCCAACTCAACTGGAATAGTCCCCAGGCGCTTAATAATTGGGGGCTTGCTCTTCAGGAACTCAGTGCGATTGTGCCAGCACGAGAAAAACAGACAATTGTAAAAACAGCTATCAGCAAGTTTCGTGGCGCTATACAGTTGCAATTTGATTTTCATCGAGCCATCTACAACCTTGGTACTGTTCTGTATGGACTAGCTGAGGACACATTACGGACTGGTGGAACTGGCAACGTCAAGGATGCTTCCCCCAATGACTTGTACAGCCAATCAGCAATTTATATTGCATCAGCTCATGCTCTAAAACCAAATTACTCTGTTTACAGCAGTGCCTTGCGCTTGGTTCGTTCAATGCTGCCGTTACCGTATTTAAAAGTTGGATACCTGACAGCACCTCCTCTGGGGAGACCACTTGCTCCTCACAGTGATTGGAAACGTTCACAATTTTTTCTAAATCATGAAGTATTGCAAAAGCTTAAAATAGGAGGGGAACAAGTACAATCATCTCCTAGTGTTTTAGGAAGATCTGGAAGTACCTTGAATGGCGGCGATAGGACAGTCAAAGTAGAAATTCCAGACATTGTCTCTGTATCGGCATGCGCAGATCTAACCTTACCTCCTGGTGCTGGACTCTGCATTGACACAATCCATGGACCAGTTTTCTTGGTTGCTGACTCATGGGATGCGCTCGATGCATGGCTCGATGCAATTAGACTAGTTTACACTATCTACGCTCGAGGCAAGAACGAAGTTTTGGCTGGCATCTTAACCGGTTGATTGTTACCAAGTATGCGAATGTATTATTGATATTACCTTGATGTTTATATGATGCTTACTCACAGTCGATTGAGTATTCATTTCTCTAAATTGAAACTCCAAATTTTGGGGTGCTTAATACATGTTTTTCTAGTCAGGTTCCTTCTCCATGTGTGTAATAATCTATTATGTACATGTCAGCTTGTTAACAGCACGCACTAGTGGTGCCGTTTTGGACTTTTGAAAAGAGCTTCGGATTAATTTCATTTGAAAATTCAA
Coding sequence (CDS)
ATGTCTCCTACTCCCGAGGAACCCAACAATTTGCAGAACGGAATCGAAACCGAATCGCACATTTCTTCGGAATCAGAGCGAGCTGACGAACGCAGATCGCACCCAGAAACCCTAGCAGATACAATCCCCAATGCCGGATTACGGCCAGAACAAGAATCGGAATCAGTCAACGAAGAACCAGATTCCGAGCCGGAGAATCGAGGGCAGCAGCCGTCGGAGTCAGTCCGGTTACAGGTTGTGATGGATGTTGCAGATCCGAAGGAAACCTCGACCCCGTCCAACGGCACCGATAACTCGCAACCTGCGCTGCGTAAAGACGAAGGAAGCCGCACGTTTACGATGAGAGAGTTGCTGAACGGATTGAAGGGTGAAGATGGCAGCGACAGCGTTAATGGATCTGAAGGCGACGTGCCCGAGCCCAACTCCGCTTACAGTCTTAATCAAGATAGCCCACATCAGCCTTATTCTGAACAGAGCAGAGCTGCCATGGAGTTGATCAACAGTGTTACAGGGGTTGATGAAGAAGGCCGTTCTCGCCAACGGATTCTCACATTCGCTGCTAAGAGATATGCTAGTGCAATTGAGAGAAATGCCCAAGACTATGATGCTCTATATAATTGGGCTTTGGTTCTGCAGGAGAGTGCAGACAATGTTAGTCCAGATTCTACTACACCTTCTAAAGATGCATTGCTTGAGGAGGCTTGTAAAAAGTACGATGAGGCTACCCATTTTTGCCCAACACTTCATGATGCTTTTTACAATTGGGCTATTGCAATCTCTGATCGAGCCAAAATGCGTGGTCGTACAAAGGAGGCTGAAGAACTGTGGAAGCAGGCTACCAAAAATTATGAAAAAGCTGTCCAACTCAACTGGAATAGTCCCCAGGCGCTTAATAATTGGGGGCTTGCTCTTCAGGAACTCAGTGCGATTGTGCCAGCACGAGAAAAACAGACAATTGTAAAAACAGCTATCAGCAAGTTTCGTGGCGCTATACAGTTGCAATTTGATTTTCATCGAGCCATCTACAACCTTGGTACTGTTCTGTATGGACTAGCTGAGGACACATTACGGACTGGTGGAACTGGCAACGTCAAGGATGCTTCCCCCAATGACTTGTACAGCCAATCAGCAATTTATATTGCATCAGCTCATGCTCTAAAACCAAATTACTCTGTTTACAGCAGTGCCTTGCGCTTGGTTCGTTCAATGCTGCCGTTACCGTATTTAAAAGTTGGATACCTGACAGCACCTCCTCTGGGGAGACCACTTGCTCCTCACAGTGATTGGAAACGTTCACAATTTTTTCTAAATCATGAAGTATTGCAAAAGCTTAAAATAGGAGGGGAACAAGTACAATCATCTCCTAGTGTTTTAGGAAGATCTGGAAGTACCTTGAATGGCGGCGATAGGACAGTCAAAGTAGAAATTCCAGACATTGTCTCTGTATCGGCATGCGCAGATCTAACCTTACCTCCTGGTGCTGGACTCTGCATTGACACAATCCATGGACCAGTTTTCTTGGTTGCTGACTCATGGGATGCGCTCGATGCATGGCTCGATGCAATTAGACTAGTTTACACTATCTACGCTCGAGGCAAGAACGAAGTTTTGGCTGGCATCTTAACCGGTTGA
Protein sequence
MSPTPEEPNNLQNGIETESHISSESERADERRSHPETLADTIPNAGLRPEQESESVNEEPDSEPENRGQQPSESVRLQVVMDVADPKETSTPSNGTDNSQPALRKDEGSRTFTMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVSPDSTTPSKDALLEEACKKYDEATHFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRGAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDASPNDLYSQSAIYIASAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHEVLQKLKIGGEQVQSSPSVLGRSGSTLNGGDRTVKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDAWLDAIRLVYTIYARGKNEVLAGILTG
Homology
BLAST of Sed0004618 vs. NCBI nr
Match:
XP_004146133.1 (protein HLB1 isoform X1 [Cucumis sativus] >KGN55671.1 hypothetical protein Csa_010331 [Cucumis sativus])
HSP 1 Score: 929.5 bits (2401), Expect = 1.3e-266
Identity = 482/551 (87.48%), Postives = 509/551 (92.38%), Query Frame = 0
Query: 1 MSPTPEEPNNLQNGIETESHISSESERADERRSHP-ETLADTIPNAGLRPEQESESV-NE 60
MSPTPEEPNNLQNGIE + HISSES++ E RS P E D+IP++ L+ E+ESESV N
Sbjct: 1 MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNG 60
Query: 61 EPDSEPENRGQQPSESVRLQVVMDVADP-----KETSTPSNG-TDNSQPALRKDEGSRTF 120
PDSEPE+ +Q SES+ L VV V DP KETSTPSNG T+N QPALRKDEGSRTF
Sbjct: 61 VPDSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPSNGNTENLQPALRKDEGSRTF 120
Query: 121 TMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSLNQDSPHQPYSEQSRAAMELINSVTGV 180
TMRELLNGLKGEDGSDS+N SEG+ PE NS YSLNQDSPHQPYSEQSRAAMELINSVTGV
Sbjct: 121 TMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGV 180
Query: 181 DEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVSPDSTTPSKDALLE 240
DEEGRSRQRILTFAA+RYASAIERN QDYDALYNWALVLQESADNVSPDST+PSKDALLE
Sbjct: 181 DEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLE 240
Query: 241 EACKKYDEATHFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN 300
EACKKYDEATH CPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
Sbjct: 241 EACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN 300
Query: 301 SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRGAIQLQFDFHRAIYNLGTVLYGLA 360
SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFR AIQLQFDFHRAIYNLGTVLYGLA
Sbjct: 301 SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLA 360
Query: 361 EDTLRTGGTGNVKDASPNDLYSQSAIYIASAHALKPNYSVYSSALRLVRSMLPLPYLKVG 420
EDTLRTGG+GNVKD SPN+LYSQSAIYIA+AHALKPNYSVYSSALRLVRSMLPLPYLKVG
Sbjct: 361 EDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVG 420
Query: 421 YLTAPPLGRPLAPHSDWKRSQFFLNHEVLQKLKIGGEQVQSSPSVLGRSGSTLNGGDRTV 480
YLTAPP+GRPLAPHSDWKRSQFFLNH+VLQKL IGGEQ+Q+SPS+LGRSGSTLN GDRT+
Sbjct: 421 YLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLN-GDRTI 480
Query: 481 KVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDAWLDAIRLVYTIYARG 540
KVEIPDIVSVSACADLTLPPGAGLCIDTIHGP+FLVADSWD LD WLDAIRLVYTIYARG
Sbjct: 481 KVEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARG 540
Query: 541 KNEVLAGILTG 544
KNEVLAGI+TG
Sbjct: 541 KNEVLAGIITG 550
BLAST of Sed0004618 vs. NCBI nr
Match:
XP_008448563.1 (PREDICTED: uncharacterized protein LOC103490705 isoform X1 [Cucumis melo])
HSP 1 Score: 921.8 bits (2381), Expect = 2.8e-264
Identity = 478/551 (86.75%), Postives = 506/551 (91.83%), Query Frame = 0
Query: 1 MSPTPEEPNNLQNGIETESHISSESERADERRSH-PETLADTIPNAGLRPEQESESV-NE 60
MSPTPEEPNNLQNGIE + HISSES++ E RS E AD+IP++ L+ E+ESESV N
Sbjct: 1 MSPTPEEPNNLQNGIEIQPHISSESDQISEPRSELEEPTADSIPSSELQQERESESVSNG 60
Query: 61 EPDSEPENRGQQPSESVRLQVVMDVADP-----KETSTPSNG-TDNSQPALRKDEGSRTF 120
DSEPE+ +Q SES+ L VV V DP KETSTP NG T+N QPALRKDEGSRTF
Sbjct: 61 VADSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPFNGNTENLQPALRKDEGSRTF 120
Query: 121 TMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSLNQDSPHQPYSEQSRAAMELINSVTGV 180
TMRELLNGLKGEDGSD +N SEG+ PE NS +SLNQDSPHQPYSEQSRAAMELINS+TGV
Sbjct: 121 TMRELLNGLKGEDGSDGLNESEGERPEGNSGHSLNQDSPHQPYSEQSRAAMELINSITGV 180
Query: 181 DEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVSPDSTTPSKDALLE 240
DEEGRSRQRILTFAA+RYASAIERN QDYDALYNWALVLQESADNVSPDST+PSKDALLE
Sbjct: 181 DEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLE 240
Query: 241 EACKKYDEATHFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN 300
EACKKYDEATH CPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
Sbjct: 241 EACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN 300
Query: 301 SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRGAIQLQFDFHRAIYNLGTVLYGLA 360
SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFR AIQLQFDFHRAIYNLGTVLYGLA
Sbjct: 301 SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLA 360
Query: 361 EDTLRTGGTGNVKDASPNDLYSQSAIYIASAHALKPNYSVYSSALRLVRSMLPLPYLKVG 420
EDTLRTGGTGN+KD SPN+LYSQSAIYIA+AHALKPNYSVYSSALRLVRSMLPLPYLKVG
Sbjct: 361 EDTLRTGGTGNIKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVG 420
Query: 421 YLTAPPLGRPLAPHSDWKRSQFFLNHEVLQKLKIGGEQVQSSPSVLGRSGSTLNGGDRTV 480
YLTAPP+GRPLAPHSDWKRSQFFLNH+VLQKL IGGEQ+Q+SPS LGRSGSTLN GDRT+
Sbjct: 421 YLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSTLGRSGSTLN-GDRTI 480
Query: 481 KVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDAWLDAIRLVYTIYARG 540
KVEIPDIVSVSACADLTLPPGAGLCIDTIHGP+FLVADSWDALD WLDAIRLVYTIYARG
Sbjct: 481 KVEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDALDGWLDAIRLVYTIYARG 540
Query: 541 KNEVLAGILTG 544
KNEVLAGI+TG
Sbjct: 541 KNEVLAGIITG 550
BLAST of Sed0004618 vs. NCBI nr
Match:
XP_038876586.1 (protein HLB1 [Benincasa hispida])
HSP 1 Score: 919.8 bits (2376), Expect = 1.1e-263
Identity = 477/550 (86.73%), Postives = 503/550 (91.45%), Query Frame = 0
Query: 1 MSPTPEEPNNLQNGIETESHISSESERADERRSHPETLADTIPNAGLRPEQESESVNE-E 60
MSPTPEEPNNLQNGIE + HIS ES++ E RS PE AD I ++ L E+ESESVN
Sbjct: 1 MSPTPEEPNNLQNGIEIQPHISPESDQTSEPRSEPEPTADAILSSELHQERESESVNNGV 60
Query: 61 PDSEPENRGQQPSESVRLQVVMDVADP-----KETSTPSNG-TDNSQPALRKDEGSRTFT 120
DSEP +R +Q ES+ LQV DVADP KETS PSNG T+NS+PALRKDEGSRTFT
Sbjct: 61 ADSEPVSRRKQLPESIHLQVETDVADPRFEEHKETSIPSNGNTENSKPALRKDEGSRTFT 120
Query: 121 MRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSLNQDSPHQPYSEQSRAAMELINSVTGVD 180
MRELLNGLKGEDG+DS+N SEG+ PE N YSLNQDSPHQPYSEQSRAAMELI+SVTGVD
Sbjct: 121 MRELLNGLKGEDGNDSLNESEGERPEGNPGYSLNQDSPHQPYSEQSRAAMELISSVTGVD 180
Query: 181 EEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVSPDSTTPSKDALLEE 240
EEGRSRQRILTFAA+RYASAIERN QDYDALYNWALVLQESADNVSPDST+PSKDALLEE
Sbjct: 181 EEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEE 240
Query: 241 ACKKYDEATHFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNS 300
ACKKYDEAT CPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNS
Sbjct: 241 ACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNS 300
Query: 301 PQALNNWGLALQELSAIVPAREKQTIVKTAISKFRGAIQLQFDFHRAIYNLGTVLYGLAE 360
PQALNNWGLALQELSAIVPAREKQTIVKTAISKFR AIQLQFDFHRAIYNLGTVLYGLAE
Sbjct: 301 PQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAE 360
Query: 361 DTLRTGGTGNVKDASPNDLYSQSAIYIASAHALKPNYSVYSSALRLVRSMLPLPYLKVGY 420
DTLRTGGTGNVKD SPN+LYSQSAIYIA+AHALKPNYSVYSSALRLVRSMLPLPYLKVGY
Sbjct: 361 DTLRTGGTGNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGY 420
Query: 421 LTAPPLGRPLAPHSDWKRSQFFLNHEVLQKLKIGGEQVQSSPSVLGRSGSTLNGGDRTVK 480
LTAPP+GRPLAPH DWKRSQFFLNH+VLQKL IGGEQ+Q+SPS+LGRSGSTLN GD T+K
Sbjct: 421 LTAPPVGRPLAPHGDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLN-GDWTIK 480
Query: 481 VEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDAWLDAIRLVYTIYARGK 540
VEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALD WLDAIRLVYTIYARGK
Sbjct: 481 VEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGK 540
Query: 541 NEVLAGILTG 544
NEVLAGI+TG
Sbjct: 541 NEVLAGIITG 549
BLAST of Sed0004618 vs. NCBI nr
Match:
XP_022965252.1 (protein HLB1-like isoform X2 [Cucurbita maxima])
HSP 1 Score: 908.3 bits (2346), Expect = 3.2e-260
Identity = 469/552 (84.96%), Postives = 496/552 (89.86%), Query Frame = 0
Query: 1 MSPTPEEPNNLQNGIETESHISSESERADERRSHPETLADTIPNAGLRPEQESESVNEEP 60
MSP PEEPNNLQNGIE E HIS ES + E +S PE+ AD IP A L+ E+ESESVN
Sbjct: 1 MSPIPEEPNNLQNGIEIEPHISVESNQIGESKSEPESTADVIPTAELQQERESESVNGVA 60
Query: 61 DSEPENRGQQP----SESVRLQVVMDVAD-----PKETSTPSNGTDNSQPALRKDEGSRT 120
DSEP++ P SES+ LQVV DV D PK TS SNG +NSQPALRKDEGSRT
Sbjct: 61 DSEPQSELDSPRKQLSESIELQVVTDVTDPRFEEPKGTSISSNGAENSQPALRKDEGSRT 120
Query: 121 FTMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSLNQDSPHQPYSEQSRAAMELINSVTG 180
FTMRELLNGLK EDG+DS+N SEG+ PE NS YSLNQDSPHQPYSEQSRAAMELINSVTG
Sbjct: 121 FTMRELLNGLKVEDGNDSLNESEGEKPEANSGYSLNQDSPHQPYSEQSRAAMELINSVTG 180
Query: 181 VDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVSPDSTTPSKDALL 240
VDEEGRSRQRILTFAA+RYASAIERN QDYDALYNWALVLQESADNVSPDST+PSKDALL
Sbjct: 181 VDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALL 240
Query: 241 EEACKKYDEATHFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNW 300
EEACKKYDEAT CPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQAT+NYEKAVQLNW
Sbjct: 241 EEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATRNYEKAVQLNW 300
Query: 301 NSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRGAIQLQFDFHRAIYNLGTVLYGL 360
NSPQALNNWGLALQELSAIVPAREK TIVKTAISKFR AIQLQFDFHRAIYNLGTVLYGL
Sbjct: 301 NSPQALNNWGLALQELSAIVPAREKPTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGL 360
Query: 361 AEDTLRTGGTGNVKDASPNDLYSQSAIYIASAHALKPNYSVYSSALRLVRSMLPLPYLKV 420
AEDTLRTGGTG VKD SPN+LYSQSAIYIA+AHALKP+YSVYSSALRLVRSMLPLPYLKV
Sbjct: 361 AEDTLRTGGTGTVKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRLVRSMLPLPYLKV 420
Query: 421 GYLTAPPLGRPLAPHSDWKRSQFFLNHEVLQKLKIGGEQVQSSPSVLGRSGSTLNGGDRT 480
GYLTAPP+GRP APH DWKRSQFFLNH+VLQKL IGGEQ+Q+SP++LGRSGSTLN GDRT
Sbjct: 421 GYLTAPPVGRPFAPHGDWKRSQFFLNHDVLQKLNIGGEQIQTSPTLLGRSGSTLN-GDRT 480
Query: 481 VKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDAWLDAIRLVYTIYAR 540
+KVEIPDIVSVSACADLTLPPGAGLCIDTIHG +FLVADSWDALD WLDAIRLVYTIYAR
Sbjct: 481 MKVEIPDIVSVSACADLTLPPGAGLCIDTIHGQIFLVADSWDALDGWLDAIRLVYTIYAR 540
Query: 541 GKNEVLAGILTG 544
GKNEVLAGI+ G
Sbjct: 541 GKNEVLAGIIAG 551
BLAST of Sed0004618 vs. NCBI nr
Match:
XP_023552571.1 (protein HLB1-like [Cucurbita pepo subsp. pepo])
HSP 1 Score: 903.3 bits (2333), Expect = 1.0e-258
Identity = 472/580 (81.38%), Postives = 500/580 (86.21%), Query Frame = 0
Query: 1 MSPTPEEPNNLQNGIETESHISSESERADERRSHPETLADTIPNAGLRPEQESESVN--- 60
MSPTPEEPNNLQNGIE ESHIS ES + E +S PE+ AD +P A L+ E++SESVN
Sbjct: 1 MSPTPEEPNNLQNGIEIESHISVESNQIGESKSEPESTADVVPTAELQQERQSESVNGVA 60
Query: 61 -----------------------------EEPDSEPENRGQQPSESVRLQVVMDVAD--- 120
EP SE ++ +Q SES+ LQVV DV D
Sbjct: 61 GLEPQSELVIPTAELQQERESESFNGAADSEPQSELDSPRKQLSESIELQVVTDVTDPRF 120
Query: 121 --PKETSTPSNGTDNSQPALRKDEGSRTFTMRELLNGLKGEDGSDSVNGSEGDVPEPNSA 180
PK TS SNGT+NSQPALRKDEGSRTFTMRELLNGLK EDG+DS+N SEG+ PE NS
Sbjct: 121 EEPKGTSISSNGTENSQPALRKDEGSRTFTMRELLNGLKVEDGNDSLNESEGEKPEANSG 180
Query: 181 YSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDA 240
YSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAA+RYASAIERN QDYDA
Sbjct: 181 YSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDA 240
Query: 241 LYNWALVLQESADNVSPDSTTPSKDALLEEACKKYDEATHFCPTLHDAFYNWAIAISDRA 300
LYNWALVLQESADNVSPDST+PSKDALLEEACKKYDEAT CPTLHDAFYNWAIAISDRA
Sbjct: 241 LYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRA 300
Query: 301 KMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTA 360
KMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTA
Sbjct: 301 KMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTA 360
Query: 361 ISKFRGAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDASPNDLYSQSAIYIASA 420
ISKFR AIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTG VKD SPN+LYSQSAIYIA+A
Sbjct: 361 ISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGTVKDVSPNELYSQSAIYIAAA 420
Query: 421 HALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHEVLQK 480
HALKP+YSVYSSALRLVRSMLPLPYLKVGYLTAPP+GRP APH DWKRSQFFLNH+VLQK
Sbjct: 421 HALKPSYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPFAPHGDWKRSQFFLNHDVLQK 480
Query: 481 LKIGGEQVQSSPSVLGRSGSTLNGGDRTVKVEIPDIVSVSACADLTLPPGAGLCIDTIHG 540
L IGGEQ Q+SP++LGRSGSTLN GDRT+KVEIPDIVSVSACADLTLPPGAGLCIDTIHG
Sbjct: 481 LNIGGEQTQTSPTLLGRSGSTLN-GDRTMKVEIPDIVSVSACADLTLPPGAGLCIDTIHG 540
Query: 541 PVFLVADSWDALDAWLDAIRLVYTIYARGKNEVLAGILTG 544
+FLVADSWDALD WLDAIRLVYTIYARGKNEVLAGI+ G
Sbjct: 541 QIFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG 579
BLAST of Sed0004618 vs. ExPASy Swiss-Prot
Match:
Q9FHY8 (Protein HLB1 OS=Arabidopsis thaliana OX=3702 GN=HLB1 PE=1 SV=1)
HSP 1 Score: 614.0 bits (1582), Expect = 1.6e-174
Identity = 352/578 (60.90%), Postives = 417/578 (72.15%), Query Frame = 0
Query: 1 MSPTPEEPNNLQNGI-----ETESHISSESERADERR---SHPETLADTIPN------AG 60
M+ T EEP LQNG ETE + E + E + PE AD P
Sbjct: 1 MADTVEEP-QLQNGAAPAESETEQNPIPEPQLQTEPKLTGEIPEIEADLTPEEVQSEVTD 60
Query: 61 LRPEQESESVNEE------PDSEPEN-RGQQPSESVRLQVV--------MDVADPKETST 120
+PE+ V E D++PE + + E V+ V +D++
Sbjct: 61 AKPEEVQSEVKPEEVKTVVTDAKPEEAQSEVKPEEVQSVVTDTKPDLTDVDLSPGGSEEI 120
Query: 121 PSNGTDNSQPAL-----RKDEGSRTFTMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSL 180
P T+ Q + + D+G++TFTMRELL+ LK E EGD +SA
Sbjct: 121 PIRSTEVEQESTTSVLKKDDDGNKTFTMRELLSELKSE---------EGDGTPHSSASPF 180
Query: 181 NQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYN 240
+++S QP ++ AM+LIN + DEEGRSRQR+L FAA++YASAIERN D+DALYN
Sbjct: 181 SRESASQP--AENNPAMDLINRIQVNDEEGRSRQRVLAFAARKYASAIERNPDDHDALYN 240
Query: 241 WALVLQESADNVSPDSTTPSKDALLEEACKKYDEATHFCPTLHDAFYNWAIAISDRAKMR 300
WAL+LQESADNVSPDS +PSKD LLEEACKKYDEAT CPTL+DA+YNWAIAISDRAK+R
Sbjct: 241 WALILQESADNVSPDSVSPSKDDLLEEACKKYDEATRLCPTLYDAYYNWAIAISDRAKIR 300
Query: 301 GRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISK 360
GRTKEAEELW+QA NYEKAVQLNWNS QALNNWGL LQELS IVPAREK+ +V+TAISK
Sbjct: 301 GRTKEAEELWEQAADNYEKAVQLNWNSSQALNNWGLVLQELSQIVPAREKEKVVRTAISK 360
Query: 361 FRGAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDASPNDLYSQSAIYIASAHAL 420
FR AI+LQFDFHRAIYNLGTVLYGLAEDTLRTGG+GN KD P +LYSQSAIYIA+AH+L
Sbjct: 361 FRAAIRLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNGKDMPPGELYSQSAIYIAAAHSL 420
Query: 421 KPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHE-VLQKLK 480
KP+YSVYSSALRLVRSMLPLP+LKVGYLTAPP+G LAPHSDWKR++F LNHE +LQ LK
Sbjct: 421 KPSYSVYSSALRLVRSMLPLPHLKVGYLTAPPVGNSLAPHSDWKRTEFELNHERLLQVLK 480
Query: 481 IGGEQVQSSPSVLGRSGSTLNGGDRTVKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPV 540
++ + S + ST N +TVKV I +IVSV+ CADLTLPPGAGLCIDTIHGPV
Sbjct: 481 PEPREMGRNLSGKAETMST-NVERKTVKVNITEIVSVTPCADLTLPPGAGLCIDTIHGPV 540
Query: 541 FLVADSWDALDAWLDAIRLVYTIYARGKNEVLAGILTG 544
FLVADSW++LD WLDAIRLVYTIYARGK++VLAGI+TG
Sbjct: 541 FLVADSWESLDGWLDAIRLVYTIYARGKSDVLAGIITG 565
BLAST of Sed0004618 vs. ExPASy TrEMBL
Match:
A0A0A0L688 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G002900 PE=4 SV=1)
HSP 1 Score: 929.5 bits (2401), Expect = 6.5e-267
Identity = 482/551 (87.48%), Postives = 509/551 (92.38%), Query Frame = 0
Query: 1 MSPTPEEPNNLQNGIETESHISSESERADERRSHP-ETLADTIPNAGLRPEQESESV-NE 60
MSPTPEEPNNLQNGIE + HISSES++ E RS P E D+IP++ L+ E+ESESV N
Sbjct: 1 MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNG 60
Query: 61 EPDSEPENRGQQPSESVRLQVVMDVADP-----KETSTPSNG-TDNSQPALRKDEGSRTF 120
PDSEPE+ +Q SES+ L VV V DP KETSTPSNG T+N QPALRKDEGSRTF
Sbjct: 61 VPDSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPSNGNTENLQPALRKDEGSRTF 120
Query: 121 TMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSLNQDSPHQPYSEQSRAAMELINSVTGV 180
TMRELLNGLKGEDGSDS+N SEG+ PE NS YSLNQDSPHQPYSEQSRAAMELINSVTGV
Sbjct: 121 TMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGV 180
Query: 181 DEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVSPDSTTPSKDALLE 240
DEEGRSRQRILTFAA+RYASAIERN QDYDALYNWALVLQESADNVSPDST+PSKDALLE
Sbjct: 181 DEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLE 240
Query: 241 EACKKYDEATHFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN 300
EACKKYDEATH CPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
Sbjct: 241 EACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN 300
Query: 301 SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRGAIQLQFDFHRAIYNLGTVLYGLA 360
SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFR AIQLQFDFHRAIYNLGTVLYGLA
Sbjct: 301 SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLA 360
Query: 361 EDTLRTGGTGNVKDASPNDLYSQSAIYIASAHALKPNYSVYSSALRLVRSMLPLPYLKVG 420
EDTLRTGG+GNVKD SPN+LYSQSAIYIA+AHALKPNYSVYSSALRLVRSMLPLPYLKVG
Sbjct: 361 EDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVG 420
Query: 421 YLTAPPLGRPLAPHSDWKRSQFFLNHEVLQKLKIGGEQVQSSPSVLGRSGSTLNGGDRTV 480
YLTAPP+GRPLAPHSDWKRSQFFLNH+VLQKL IGGEQ+Q+SPS+LGRSGSTLN GDRT+
Sbjct: 421 YLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLN-GDRTI 480
Query: 481 KVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDAWLDAIRLVYTIYARG 540
KVEIPDIVSVSACADLTLPPGAGLCIDTIHGP+FLVADSWD LD WLDAIRLVYTIYARG
Sbjct: 481 KVEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARG 540
Query: 541 KNEVLAGILTG 544
KNEVLAGI+TG
Sbjct: 541 KNEVLAGIITG 550
BLAST of Sed0004618 vs. ExPASy TrEMBL
Match:
A0A1S3BJC9 (uncharacterized protein LOC103490705 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103490705 PE=4 SV=1)
HSP 1 Score: 921.8 bits (2381), Expect = 1.4e-264
Identity = 478/551 (86.75%), Postives = 506/551 (91.83%), Query Frame = 0
Query: 1 MSPTPEEPNNLQNGIETESHISSESERADERRSH-PETLADTIPNAGLRPEQESESV-NE 60
MSPTPEEPNNLQNGIE + HISSES++ E RS E AD+IP++ L+ E+ESESV N
Sbjct: 1 MSPTPEEPNNLQNGIEIQPHISSESDQISEPRSELEEPTADSIPSSELQQERESESVSNG 60
Query: 61 EPDSEPENRGQQPSESVRLQVVMDVADP-----KETSTPSNG-TDNSQPALRKDEGSRTF 120
DSEPE+ +Q SES+ L VV V DP KETSTP NG T+N QPALRKDEGSRTF
Sbjct: 61 VADSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPFNGNTENLQPALRKDEGSRTF 120
Query: 121 TMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSLNQDSPHQPYSEQSRAAMELINSVTGV 180
TMRELLNGLKGEDGSD +N SEG+ PE NS +SLNQDSPHQPYSEQSRAAMELINS+TGV
Sbjct: 121 TMRELLNGLKGEDGSDGLNESEGERPEGNSGHSLNQDSPHQPYSEQSRAAMELINSITGV 180
Query: 181 DEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVSPDSTTPSKDALLE 240
DEEGRSRQRILTFAA+RYASAIERN QDYDALYNWALVLQESADNVSPDST+PSKDALLE
Sbjct: 181 DEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLE 240
Query: 241 EACKKYDEATHFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN 300
EACKKYDEATH CPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
Sbjct: 241 EACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN 300
Query: 301 SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRGAIQLQFDFHRAIYNLGTVLYGLA 360
SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFR AIQLQFDFHRAIYNLGTVLYGLA
Sbjct: 301 SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLA 360
Query: 361 EDTLRTGGTGNVKDASPNDLYSQSAIYIASAHALKPNYSVYSSALRLVRSMLPLPYLKVG 420
EDTLRTGGTGN+KD SPN+LYSQSAIYIA+AHALKPNYSVYSSALRLVRSMLPLPYLKVG
Sbjct: 361 EDTLRTGGTGNIKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVG 420
Query: 421 YLTAPPLGRPLAPHSDWKRSQFFLNHEVLQKLKIGGEQVQSSPSVLGRSGSTLNGGDRTV 480
YLTAPP+GRPLAPHSDWKRSQFFLNH+VLQKL IGGEQ+Q+SPS LGRSGSTLN GDRT+
Sbjct: 421 YLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSTLGRSGSTLN-GDRTI 480
Query: 481 KVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDAWLDAIRLVYTIYARG 540
KVEIPDIVSVSACADLTLPPGAGLCIDTIHGP+FLVADSWDALD WLDAIRLVYTIYARG
Sbjct: 481 KVEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDALDGWLDAIRLVYTIYARG 540
Query: 541 KNEVLAGILTG 544
KNEVLAGI+TG
Sbjct: 541 KNEVLAGIITG 550
BLAST of Sed0004618 vs. ExPASy TrEMBL
Match:
A0A6J1HJU5 (protein HLB1-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111465172 PE=4 SV=1)
HSP 1 Score: 908.3 bits (2346), Expect = 1.5e-260
Identity = 469/552 (84.96%), Postives = 496/552 (89.86%), Query Frame = 0
Query: 1 MSPTPEEPNNLQNGIETESHISSESERADERRSHPETLADTIPNAGLRPEQESESVNEEP 60
MSP PEEPNNLQNGIE E HIS ES + E +S PE+ AD IP A L+ E+ESESVN
Sbjct: 1 MSPIPEEPNNLQNGIEIEPHISVESNQIGESKSEPESTADVIPTAELQQERESESVNGVA 60
Query: 61 DSEPENRGQQP----SESVRLQVVMDVAD-----PKETSTPSNGTDNSQPALRKDEGSRT 120
DSEP++ P SES+ LQVV DV D PK TS SNG +NSQPALRKDEGSRT
Sbjct: 61 DSEPQSELDSPRKQLSESIELQVVTDVTDPRFEEPKGTSISSNGAENSQPALRKDEGSRT 120
Query: 121 FTMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSLNQDSPHQPYSEQSRAAMELINSVTG 180
FTMRELLNGLK EDG+DS+N SEG+ PE NS YSLNQDSPHQPYSEQSRAAMELINSVTG
Sbjct: 121 FTMRELLNGLKVEDGNDSLNESEGEKPEANSGYSLNQDSPHQPYSEQSRAAMELINSVTG 180
Query: 181 VDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVSPDSTTPSKDALL 240
VDEEGRSRQRILTFAA+RYASAIERN QDYDALYNWALVLQESADNVSPDST+PSKDALL
Sbjct: 181 VDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALL 240
Query: 241 EEACKKYDEATHFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNW 300
EEACKKYDEAT CPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQAT+NYEKAVQLNW
Sbjct: 241 EEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATRNYEKAVQLNW 300
Query: 301 NSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRGAIQLQFDFHRAIYNLGTVLYGL 360
NSPQALNNWGLALQELSAIVPAREK TIVKTAISKFR AIQLQFDFHRAIYNLGTVLYGL
Sbjct: 301 NSPQALNNWGLALQELSAIVPAREKPTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGL 360
Query: 361 AEDTLRTGGTGNVKDASPNDLYSQSAIYIASAHALKPNYSVYSSALRLVRSMLPLPYLKV 420
AEDTLRTGGTG VKD SPN+LYSQSAIYIA+AHALKP+YSVYSSALRLVRSMLPLPYLKV
Sbjct: 361 AEDTLRTGGTGTVKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRLVRSMLPLPYLKV 420
Query: 421 GYLTAPPLGRPLAPHSDWKRSQFFLNHEVLQKLKIGGEQVQSSPSVLGRSGSTLNGGDRT 480
GYLTAPP+GRP APH DWKRSQFFLNH+VLQKL IGGEQ+Q+SP++LGRSGSTLN GDRT
Sbjct: 421 GYLTAPPVGRPFAPHGDWKRSQFFLNHDVLQKLNIGGEQIQTSPTLLGRSGSTLN-GDRT 480
Query: 481 VKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDAWLDAIRLVYTIYAR 540
+KVEIPDIVSVSACADLTLPPGAGLCIDTIHG +FLVADSWDALD WLDAIRLVYTIYAR
Sbjct: 481 MKVEIPDIVSVSACADLTLPPGAGLCIDTIHGQIFLVADSWDALDGWLDAIRLVYTIYAR 540
Query: 541 GKNEVLAGILTG 544
GKNEVLAGI+ G
Sbjct: 541 GKNEVLAGIIAG 551
BLAST of Sed0004618 vs. ExPASy TrEMBL
Match:
A0A6J1EA05 (protein HLB1-like OS=Cucurbita moschata OX=3662 GN=LOC111431218 PE=4 SV=1)
HSP 1 Score: 898.7 bits (2321), Expect = 1.2e-257
Identity = 470/580 (81.03%), Postives = 498/580 (85.86%), Query Frame = 0
Query: 1 MSPTPEEPNNLQNGIETESHISSESERADERRSHPETLADTIPNAGLRPEQESESVNEEP 60
MSPTPEEPNNLQNGIE E HIS ES + E +S PE+ AD +P A L+ E+E ESVN
Sbjct: 1 MSPTPEEPNNLQNGIEIEPHISVESNQIGESKSEPESTADVVPTAELQQERELESVNGVE 60
Query: 61 DSEPENR--------------------------------GQQPSESVRLQVVMDVAD--- 120
D EP++ +Q SES++LQV DVAD
Sbjct: 61 DLEPQSELVIPTAELQQERESESVNGVADSELQSELDSPRKQLSESIQLQVATDVADPRF 120
Query: 121 --PKETSTPSNGTDNSQPALRKDEGSRTFTMRELLNGLKGEDGSDSVNGSEGDVPEPNSA 180
PK TS SNGT+NSQPALRKDEGSRTFTMRELLNGLK EDG+DS+N SEG+ PE NS
Sbjct: 121 EEPKGTSISSNGTENSQPALRKDEGSRTFTMRELLNGLKVEDGNDSLNESEGEKPEANSG 180
Query: 181 YSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDA 240
YSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAA+RYASAIERN QDYDA
Sbjct: 181 YSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDA 240
Query: 241 LYNWALVLQESADNVSPDSTTPSKDALLEEACKKYDEATHFCPTLHDAFYNWAIAISDRA 300
LYNWALVLQESADNVSPDST+PSKDALLEEACKKYDEAT CPTLHDAFYNWAIAISDRA
Sbjct: 241 LYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRA 300
Query: 301 KMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTA 360
KMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTA
Sbjct: 301 KMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTA 360
Query: 361 ISKFRGAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDASPNDLYSQSAIYIASA 420
ISKFR AIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTG VKD SPN+LYSQSAIYIA+A
Sbjct: 361 ISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGTVKDVSPNELYSQSAIYIAAA 420
Query: 421 HALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHEVLQK 480
HALKP+YSVYSSALRLVRSMLPLPYLKVGYLTAPP+GRP APH DWKRSQFFLNH+VLQK
Sbjct: 421 HALKPSYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPFAPHGDWKRSQFFLNHDVLQK 480
Query: 481 LKIGGEQVQSSPSVLGRSGSTLNGGDRTVKVEIPDIVSVSACADLTLPPGAGLCIDTIHG 540
L IGGEQ Q+SP++LGRSGSTLN GDRT+KVEIPDIVSVSACADLTLPPGAGLCIDTIHG
Sbjct: 481 LNIGGEQTQTSPTLLGRSGSTLN-GDRTMKVEIPDIVSVSACADLTLPPGAGLCIDTIHG 540
Query: 541 PVFLVADSWDALDAWLDAIRLVYTIYARGKNEVLAGILTG 544
+FLVADSWDALD WLDAIRLVYTIYARGKNEVLAGI+ G
Sbjct: 541 QIFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG 579
BLAST of Sed0004618 vs. ExPASy TrEMBL
Match:
A0A6J1KVY8 (protein HLB1-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111498676 PE=4 SV=1)
HSP 1 Score: 897.9 bits (2319), Expect = 2.1e-257
Identity = 468/554 (84.48%), Postives = 498/554 (89.89%), Query Frame = 0
Query: 1 MSPTPEEPNNLQNGIETESHISSESERADERRSHPETLADTIPNAGLRPEQESES----- 60
MS TPEEPNNLQNGI TE ISSESE+ DE RS PE +AD IP A + E+ESES
Sbjct: 1 MSTTPEEPNNLQNGIVTEPQISSESEQTDESRSEPERIADAIPKAESQLERESESESVYV 60
Query: 61 -VNEEPDSEPENRGQQPSESVRLQVVMDVADP-----KETSTPSNGTDNSQPALRKDEGS 120
E +SE +R +Q SES+ LQVV +V+DP K TS PSNG +NSQP LRKDEGS
Sbjct: 61 EAEAEAESELASRRKQLSESLPLQVVTNVSDPKFDESKGTSIPSNGIENSQPTLRKDEGS 120
Query: 121 RTFTMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSLNQDSPHQPYSEQSRAAMELINSV 180
RTFTMRELLNGLKGEDG+DSVN SEG+ P+ YSLNQDSP QPYSEQSRAAMELI+SV
Sbjct: 121 RTFTMRELLNGLKGEDGNDSVNESEGEKPD---GYSLNQDSPQQPYSEQSRAAMELISSV 180
Query: 181 TGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVSPDSTTPSKDA 240
TGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVSPDST+PSKDA
Sbjct: 181 TGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDA 240
Query: 241 LLEEACKKYDEATHFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQL 300
LLEEACKKYDEAT CPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQL
Sbjct: 241 LLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQL 300
Query: 301 NWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRGAIQLQFDFHRAIYNLGTVLY 360
NWNSPQALNNWGLALQELSAIVPAREKQTIV+TAISKFR AIQLQFDFHRAIYNLGTVLY
Sbjct: 301 NWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLY 360
Query: 361 GLAEDTLRTGGTGNVKDASPNDLYSQSAIYIASAHALKPNYSVYSSALRLVRSMLPLPYL 420
GLAEDTLRTGGTGN KD SPN+LYSQSAIYIA+AHALKP+YSVYSSALRLVRSMLPLPYL
Sbjct: 361 GLAEDTLRTGGTGNFKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRLVRSMLPLPYL 420
Query: 421 KVGYLTAPPLGRPLAPHSDWKRSQFFLNHEVLQKLKIGGEQVQSSPSVLGRSGSTLNGGD 480
KVGYLTAPP+G+PLAPHSDWKRSQ+FLNH+VLQKLKIGGEQ+Q+SP+ LGRSGSTLN GD
Sbjct: 421 KVGYLTAPPVGKPLAPHSDWKRSQYFLNHDVLQKLKIGGEQIQTSPNALGRSGSTLN-GD 480
Query: 481 RTVKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDAWLDAIRLVYTIY 540
+KVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALD WLDA+RLVYTIY
Sbjct: 481 MPIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAVRLVYTIY 540
Query: 541 ARGKNEVLAGILTG 544
ARGKN+VLAGI TG
Sbjct: 541 ARGKNDVLAGIATG 550
BLAST of Sed0004618 vs. TAIR 10
Match:
AT5G41950.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 614.0 bits (1582), Expect = 1.2e-175
Identity = 352/578 (60.90%), Postives = 417/578 (72.15%), Query Frame = 0
Query: 1 MSPTPEEPNNLQNGI-----ETESHISSESERADERR---SHPETLADTIPN------AG 60
M+ T EEP LQNG ETE + E + E + PE AD P
Sbjct: 1 MADTVEEP-QLQNGAAPAESETEQNPIPEPQLQTEPKLTGEIPEIEADLTPEEVQSEVTD 60
Query: 61 LRPEQESESVNEE------PDSEPEN-RGQQPSESVRLQVV--------MDVADPKETST 120
+PE+ V E D++PE + + E V+ V +D++
Sbjct: 61 AKPEEVQSEVKPEEVKTVVTDAKPEEAQSEVKPEEVQSVVTDTKPDLTDVDLSPGGSEEI 120
Query: 121 PSNGTDNSQPAL-----RKDEGSRTFTMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSL 180
P T+ Q + + D+G++TFTMRELL+ LK E EGD +SA
Sbjct: 121 PIRSTEVEQESTTSVLKKDDDGNKTFTMRELLSELKSE---------EGDGTPHSSASPF 180
Query: 181 NQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYN 240
+++S QP ++ AM+LIN + DEEGRSRQR+L FAA++YASAIERN D+DALYN
Sbjct: 181 SRESASQP--AENNPAMDLINRIQVNDEEGRSRQRVLAFAARKYASAIERNPDDHDALYN 240
Query: 241 WALVLQESADNVSPDSTTPSKDALLEEACKKYDEATHFCPTLHDAFYNWAIAISDRAKMR 300
WAL+LQESADNVSPDS +PSKD LLEEACKKYDEAT CPTL+DA+YNWAIAISDRAK+R
Sbjct: 241 WALILQESADNVSPDSVSPSKDDLLEEACKKYDEATRLCPTLYDAYYNWAIAISDRAKIR 300
Query: 301 GRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISK 360
GRTKEAEELW+QA NYEKAVQLNWNS QALNNWGL LQELS IVPAREK+ +V+TAISK
Sbjct: 301 GRTKEAEELWEQAADNYEKAVQLNWNSSQALNNWGLVLQELSQIVPAREKEKVVRTAISK 360
Query: 361 FRGAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDASPNDLYSQSAIYIASAHAL 420
FR AI+LQFDFHRAIYNLGTVLYGLAEDTLRTGG+GN KD P +LYSQSAIYIA+AH+L
Sbjct: 361 FRAAIRLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNGKDMPPGELYSQSAIYIAAAHSL 420
Query: 421 KPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHE-VLQKLK 480
KP+YSVYSSALRLVRSMLPLP+LKVGYLTAPP+G LAPHSDWKR++F LNHE +LQ LK
Sbjct: 421 KPSYSVYSSALRLVRSMLPLPHLKVGYLTAPPVGNSLAPHSDWKRTEFELNHERLLQVLK 480
Query: 481 IGGEQVQSSPSVLGRSGSTLNGGDRTVKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPV 540
++ + S + ST N +TVKV I +IVSV+ CADLTLPPGAGLCIDTIHGPV
Sbjct: 481 PEPREMGRNLSGKAETMST-NVERKTVKVNITEIVSVTPCADLTLPPGAGLCIDTIHGPV 540
Query: 541 FLVADSWDALDAWLDAIRLVYTIYARGKNEVLAGILTG 544
FLVADSW++LD WLDAIRLVYTIYARGK++VLAGI+TG
Sbjct: 541 FLVADSWESLDGWLDAIRLVYTIYARGKSDVLAGIITG 565
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9FHY8 | 1.6e-174 | 60.90 | Protein HLB1 OS=Arabidopsis thaliana OX=3702 GN=HLB1 PE=1 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0L688 | 6.5e-267 | 87.48 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G002900 PE=4 SV=1 | [more] |
A0A1S3BJC9 | 1.4e-264 | 86.75 | uncharacterized protein LOC103490705 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A6J1HJU5 | 1.5e-260 | 84.96 | protein HLB1-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111465172 PE=4 SV... | [more] |
A0A6J1EA05 | 1.2e-257 | 81.03 | protein HLB1-like OS=Cucurbita moschata OX=3662 GN=LOC111431218 PE=4 SV=1 | [more] |
A0A6J1KVY8 | 2.1e-257 | 84.48 | protein HLB1-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111498676 PE=4 SV... | [more] |
Match Name | E-value | Identity | Description | |
AT5G41950.1 | 1.2e-175 | 60.90 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |