Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TATATTGTGAAATCTCTAACCATTTGATTATGTGGAAGAGAAATCACCAAAATAAACAAGCATGGCTGGTTTAAAAAAAAAGAAAAGAAAAGAAAAGAAAAGAAAAGAAAAAACTTTGTAAAAGAGAGAGTATGAGCAGAGTAGTAATAAATGTCATTTGGTGGTGGTAAACGAGTTTAGCAGAGTTTGACGGCCAGGAAAGGGATTCCTAATCCTGGTATTGTCTATTTGAGCATTGGGAGTGGTTTTCGCTTCACTTTTGGATTGGGATTGTTGTTGTCGCTTCACTCTCAGGCGATCTCTTCTTCTCTCCCTACTTGTCCCGCCCCCGTCTGATACATTTTGATTTTGCTTCGTCATCATCTTGGATTCCGGGTTAGTTTCAGTTGCAGTTGCAGTTGCTTTGTTTCGTCGTGGGATTCTTCCACCTTTCGAGGTCCCCAAAAGCTCAAAGTTGACGTCTCTCCAGCCGGCTTTGTGGGCGTTGTCCCAATCCCAGCCAAACCTTTCGCTTATCTCCAACAGCTCAGAAATGACGTTGTCCCCAGTTGCCCGTGTGTTGCTCATGCTCCCACTCAGAATGTTTTGGGCGAACTCCCACTGGATGCGGTCTTGGTAGACTGGGTCTTCCAGAACTTGGTCTGCCAGAATTGCCCACTCTTCAAACTCCCTTGCTCTCAGATTTTTGGAGACCCATTTCAGATAGCTTGAAGGCAGACTTCCCAGCATTTTGCCCTTGTGCTTTCCAAACCCAATCACCCGGTCTCTGGCTGGAATAGCAGCCGCACTCCCATTCCTTCTCCGCACACACACAGTCAACGCCTTTGGCTTAGGCTGCAAAAGGAATGAGAGTCTCGTCAAACTCAAACTCATGGGAGAGCACGAAGATAAGATGACCACCAGCTTAACCCTCAACCACAACCAACCAATCTTTAATTCTTAATCCATATATAATTCCTCGACCAAAATCGGATATTTTAAAACCCAAGAAGGATAAAAAAAAAAGCGCCTTCTGAAGTCAATGTCAAACTCTCGACTAGTTTCGTAGTAAAAATTTGATTTTTCCAAAACGAAATCATTATTGATTACTGGCCTGGAACGTCGAGTCTGCACGCGCCTTATCCGATACGATAAAGCTGAGTTGGGTGCACAGTGCACTCAGGCTCAGGCTCAGGCTCAGGTTCGCACGATGCTGTCGTCTTTGCAACGCCCCACTGGCCTCCGCAGTTCATCACATCTCTTCTCTCCAATTCCTCATGCTCCTCGTCAATGTCTTCTTCTTCCATGTTCCTCTTCCCTCACTGGTTTTCCCGAGATGTCCACGCAACCTTTGGAATCCAACGCGGCCGAGGTTTCCACATTCAAGCAGTGGCGGAAGAACGACGACGACATGGCCGACGATGAGTACCAAGATAAGGGCATTTCTAGAATTCCCGTGCCTAGACAGAAATACATACCAGTTTCGAAAGTCCAGTTGTTGGATGCCATTGTTTCGACCTTCTTTAACTCCAACCTTGATGATGATGATGATGATGATGGCGACGCTCAACATTTTCTGCTCCTCTCTTCGTCAGTTCGATCTCTCGTTCTATCCATTTCCTTCCTGTTGCTACTCGAAAATTGAACAAATCAAATTCACTTGACCGCTTTTGCAGGTGCTTGGACTCCATTCTTCACGCTGAACACAAGAAAATTTTAGAAGAAATGCGGAACGATTATTCCCTTTCTCAGTCGCTCAATAATGAGGCTACTTTTGATGAGGTTTCAACTAATACTGATGGCCAGCTTGTTTCTAATGAGAAAGAGGAGTTCATCTCAGCCAAGGATGGCATGACTGGGGTCGGGAGCATGGAGGACCTAGTGCAGAAGATTGGAGTTAGCAATCCGATGCCTTTCAGTTATAGTTTGGACTTTCGTAATCTCATGAGTTCTCCGAGGGGTGGCGCAAATAGTTACATCAACGGGGAGTCTCCGTAGGTCTTGAAGTTTTGAATTTCTTTGCATCTTTTTTCATTTGAGATAAATAAACATACAAACATGCATGTGTTTATGATCAAAAGAACATAAAAACTTCAATCAAGTTGATCTCTTTAGTTCTTTATTGGCTAAATTATAAAAAATAACATTGAACTATGAGATCTGTTTAAAAAATGCCCCTGGACTTTGAAAATTTAAAGTTTCAAAAATACCTTTGAACTTTAAAAAATGTTTCAAAAATACCATGAACCGTTAAAGTTTCAGTTTCAAAAATACCAACTTTCAAACAAATTTCAAAAATACCCTTATTGAAAAATGCTCCCTCTTTTTTCCTCCTTTCACTGTAAATTCTTCCTCCATGAAAATTAATGAAAAACTTTACTTATTTGTTGTGTATTCATAGTCCAGGAGCTTTTCTTCTCTTTCAACATATCATCCCAAGAGCTTGATTAGACTTTCGTGTTAAAACTTCTACTTTTGATGAGGTTTCAACTAATAAGATGTAATGCTTATGTGGTTGTACATTCTCTCCTTTTTCCATTCCTAGACAACTTCAACCTGAATCTTAATGAAATACTTCCTTGTGTATTTTTGTTAGAGTAGCAGTTGCCACTCGTTTCCAGCGTGCTTTTATGAAACTTCTTAAAAATGCTCAATTTGAAGAACTCTCAGCCACGGACCTGGTCTTGACATCAGCATTGAATACAGACTATCTGCTTACTTTGCCAATATACGTCGATTGGAAGAGGGCATCCGAGTCTAATGCAATTATTTTCAGGTTCTGCAATAGACTGTTGTTTTTATTATCTTTGTAAAGTTTGAGAGTTAATGCTTGTTTCTTATTTTTTTGTATGTTCTTGTATTCTTTCATTTGTTCAATGAAAGTTTGGTTTCTTACCAAAAAGAAAAAATGCTGGTTGCTTATCTTAGGTATGTTGTACCTTTTAATTCTTAAAAGGATGCCTTGTTTAGTGAGTGCATTTATTATATTTGCTTGACTTAGTTTATTCCTGATATTTAAAAATGATGTGATACAGTAATGTTTTTTACATGGAACCATAGTTAGAGGTCAGGAGTGTGGTATTTTTCAAATTTGGGAAGATTGTTTTAAAACAGTTCACTGTACCTTTATCTGGTGATTCTACCCCATCTCTTTCCTTTTCCTTTTTTTTATAATTTTCTCATAGCACTTTTCTATAACAATGATGCATCTAAACTCTAAAAGAATACTTGGCAGTTGGTTAGGTATTAATATTGATGCCCTACAGGGCTATTTGATACGAAATTAGAAGGTTAGATATGTCGTTGTTTTGCATCTAATTTGTGGAAGCAATGGCCTAGTCAATTATACTATGACTCATGATATATGGTTTCAGCATATATTTTCCTGTTCCAACGTTGGGCAGGTTCCAATGGTGATTGTATTTTCTTTAACAGGCGAGGATATGCAACTGAGAGGCAGAAAGGCCTGTTAATTGTTGATAAACTAGATTATTTACAGTCTAGACTTCTACGTGGATTCTTTTCCATAATCTCAAAACCAGTGGGGAGACTTGGTACTTGGATAGCTGAGGTTGTCCATTTTGGATATTGGTTGCAAACTTTGATCCTCTTTTCTTAAACATCTAATGTGTCAATACTTCTGTAACTACGTTTAACTTCCAGCATCATTCGTTGTCATTCTTGTCACCTTTTCTTTTTGATCATTAACAGGTTGTACTTGGTGCTCCACAGATGCAAGAAATACAAGAATGGGTTAAGAGGTTGAGGCTTTGGGTGAGCGAACTTCCTCTATCTCAACAATTATTTCGTTATGATGAAGAAGATTCTGATGATCTACTGAGAGACAATCGGATTTTGGATAGAGACCTTCCAATTTGGCTGGCAGCTCAGAGTGCAGTGTCTCGTTATGAAGGAATTCTTTCTTCCATGGGACCTCGTGGAAGACTCTTAAGGAGACTGCTTACATGGATAGGACTTCTTCCTCCTATGCCAGAACAACCATTCAAGCTTAATGATGACAGTAAAGCTTCTGAACCTTATTTAAGGTTTGACCATTCTAATTCTCCCATCTAGTTATTTCTATTTTTTTTTCATATAAAAAAGAAAATTCATGTTTAGTTAACCAGCTACGGTACGACTGATACCTTAATTGCTAATTTGATATAATTTTTGGCAATGTTGCTAAAAGTGCTCATGGCCTTTTTGCTGTTTTGATGGATCAAAAGTGAGCATGAAGCCTTGAAAAATAGCAGATCTTGAAAAACTATTTCACAGCAACGCATCATTTAGCTCCTAGACTTTGTAGCCTAAACTGAGCCAATCAGATGTTATCATCTTCTTTTAGAACTTGGTTCTATTATGATACAAAAGGAAATAAAGAGTAAATTACCTTTCCGGTCCTTGAAGTTTGGGCAGCATGTTGATTTCGTCCCTCGGTTTCAAAATCATACAATCAAATCCCCCAAGTTTAGAAAATAGTTCTTTTGAGTCTCTCGCTTAACGACCTCCGATAATTATTTAATGGAGGTGCTAATGTGACATTTTTTAAAATAAATTTTAATAAAATAATCATAATAATAATTTTTTAAAAATTCTCTCCTCTCTTCCCTTTCCCCTCCCTCCCCCTCTTCTTCTACCTCTCTCCATCGTCGCCCTGCTACAACCTCCTTCAATCCCTCCTTTAGGTCCAGCCTACTATGAGGATCTCGTATCCAACAACCCAACCTAAAAAAATGGACCTCATTGCTATCATCGCCGCCACCCACCATGGCCATCTTCTCCTTCAAAAGCCACCCCGCCTCCAAATCTGAAAACAGAAAAACCAAAACCTCTGTTTCCATATCTAGATTTAGGGATGGGGTGGGTTTTTCAGATCTAAAAACGCAGGTGCTTGGGACTTGCAAGAAACAAGAAATAGAGATGGCAATGATGAGTGGTGGAGCTGCATGAAGAAGAAGATGATGGTGTCGGGCGGAGGGGTTCAGATTGGGATTTTTGTTGGAAGAAGAAGAAGAGGGCGACGGTGAATGGCGGCAACGATGACGACGAGTGGTGGCATTGGCAATGGAGGAGGGTTGAAGGGGAAGGACAGGGAAAGGTAGGAGAAGAGGGGGAAGGGAAGGGAAAAGGGGGGGAAGGAGAGTGAGGGAAAGAATTTTTTAAAATTATTATTATTTTATTTTGAAGAATGTCACATCAGCACTTCCGATAAATAATTAATGGAGGCTTTTTACCGAGGGATTCAATAGAACTATTTTCTAAACTTGAGGGACTTGATTGTACGATTTCTAAACCTAGGGACAAAATTGACTGGTTGCCCAAATCTTAGGAACCAGAAAGGTAATACCCAAGAAAATATTACAATATCCAAACACTTCAAGAGGTGGACTCTCCCTAAATAACTACTCTCAAAATTCTCTCTACCTTCATCCCTTTCGCCCACTCTATTTGGAACCACTATTTGCTAACCAATTCTATTAACTTTTACTTAATGATAATATGCCCTTATAATATCCTACTAATATTCCTAATGTTTCCTTACTGATATTCCTAACAGAGTACTTGTGTTTTAATTTCAAATCTGTTAATAAGTAGATATAGTCATGATTTTATTTCGGATGGTCAATGTCTATGATGTTTGTTACTAGCCTCTTAGCCATGTGCGTCATTTATTGTTGATGTTGTTATTGGTTACAACCACTGCTTGTTTAAATGCTGTTGGATGCTGCTTATTGGTCCCATGTTCACTTTAGCACAATATGATAATTTGTTAGAGCCACCACCTTTATGGGCACGAAAGTTATTCGCTTTGACATTGCTTTGAAATTGTTGCAACCTTAGCTGTCATGAATTTAGGTTGGGTGCACTTTTAGATATACAAATGAAGAAAATTTTAGGCTAATTAACTGTATATTAGGAACCTTTGGTCTCTTCCTCTCTGGAAGTCTTTTTTGGCTGTAGTTACATTAGTAGTTTTCCTCTTGAATAACTTTACAATACAATTGTTTCTAATCCAGGCCTATCTTCATATCACGAATAACGCTCAGTGATATATGGAGGCCTGCAATGAAAAATTGTGGGAATGATATTTGGAAACGGTTGAAAACTTCCATTTCCATCCTCCTCTCTCAATCAGTGCTCCAGGTATTCTTTACTTGCTCCTTTTAATTTTCTAAGTGGTTCTCTTCCCATTAGCTGAGCTGAGCTTGAGCTGTTAAAGTAATAAGGATTCACCCGAACTTTGATGTTTATTTATTTATTTACTTTTAACAATAAGAGATTGGGAATTTTAATAAAAATTGAGAGATGCAACCTACACACCGAGGAAAAAGAAACCCCACCTGGGAGAACTATTGCATGAAGTCTTTCCAATTCAAATTAATCAAAAATTATTTTGATTCAAATTAATCAAAAGGCTTCTCTTTGTCTCTTTTGAGCCAAAAAGACCGTATAAAAGCTCTAACTACATATTTCCGCAATACTTTTACTTTATCTTTCAACCACCTATGCGTGAGGGCTTCATGGAGCCAAATATCTAATGCTATCAGGGAGGGCAAAGGAGAGAATAGAAGATTCCAGCAGAAATAACCATCCTCTAGTGGCAAAAGGGCAATGTAAAAACAAGTGATTTTAAGATTCTGTTTCCTTTAGACAGAAATAGTAGGTATTTGGGGAGATGTACCAGTCAGAATTTTTCATCTGTAACTTTCCATAAGTGTTGAGGATTCTATATCCAAGAGACTACAATTTTCTTAAAGACCTTTATGGGTATTTTTCCTTCCTATATCTGATTGATTAAGAGTGTCTTTAACTTTGTTTGAATGGAGTAGAGATTAAAATAAGTTGACTTTATAGAGAAATCCCCAGAATTTTCTAGTTGTCAAAGAGCCTTATCATCTGAAGAACTGATGCTCGCTGCATCAATCTTGTTTCTAAGCTGGAACAAACTTTCAATTTCTCTATAGAAAGGAATCCTTCTAAGCCCTAAATCCCAATTACATGTCTATACTCCAACAGTCACCAAGCACCAAATCTTTTTTGTTTGATATCACGAAGATGTCTGAGAATGATGTTGCAAGGGGGTAGAATCAATCTAGTACATCTTCCCAACTTTGATGCTTATTTTAGTCATGCCTCAATGCTGTTTATTACCTGGTGAAAGTTATGCTATTGTTTTGATCTTGGTTGTGTTCTCTTCAAGAGAATAATATATTTTCTGCATGATGTGTTAATGTTGTGGCATATACATGCAATGCATTATTTGCAGTTTATTGACAGTCATTGTTCATGTGTTAAAGTTTATTTACCAATTCCTGCCGTCTTTATAACTTGCTGTTTGCTTGTGAACTAGGAGCCAGCTTTCCAAGAATTGATTTTACTTTACACCAAAGAAGAAAGTAGTGGAGATAAGACCGAGGTTCCATCGTTGCAGTTAAAGATCTATGAAGAAATTCCTATTCCAGACTTACCAGTATATACTTCTAACACTTAAATGTCTGTTAATGTTTTCATTCTTTTTTCTCATACAATGTATTTGCCCTTACTTTATCTAGCACCTATTATGTTGCATTTTTTATTTGAGATGTTAGCTCTTCTTATTTTTACTCATGAATTTCAATCTTGGTTAAAATACATGTAGGTGATCTTTCCTGACAAGAAACTATCTTTTCGAATTATCGATGCGGTATGACCTAGATTTGATTTCAAATGCATTTATTAGAATGTATCGAATTATAGCCATGTGCAGCATTGCTGCACAGTGTGTAGGATATCTGTGCATGGTTCTTTTGTAATTGTGTCATTTTATTGGCTTAAATTTTTTCTCTCGTCTTTTTTATACCAAAATATGATATTTACAAAGAGCATGGGATGGTTTTTGCTAGTGGCCAATAAAGCTTTTTGTAGTGTTTCATGCATGGAAGAGAGAAATACTATCTGTCGAGATGGGTCATGATTATTTAATTGAGATCAGGGTTGGAAAAAAATGTCCTAGTTAAGATACCAGGTGATTGAGTGGTAAATAGCACTCATGAAAATGAATGAGAGTTGGCAGATCTGAGAATGAACTAATAGTTGCCTAGTTATCTGTATCATTGCAGCTAATTTAGTAAGTTGATTTATTGGCTATTGGCATGCAAGGATTAACAGAACTTCTGTCATGCACTCTAGTCTTGCTTCTCTAATCGGGATAAAATTATATAGAGTTCCATGTTCAGTAACGAGGTTAGTGTTCACAAGTAAACACTTGATAACTTGTTATGAATGCCTATACAAGAATGATCATGGTCAAAGGCTTTCTTTCCCGAGGGACTTCATGAGTCTAGTGGGCAGCACAAGTAGCTTTAATTCTCAATGGAGAAATGGTTTATTGACGTACCCATTTCCACCATAGTTTTGTATTAACGATCACTGATGCAATTTACATCGATGGCTACTGACTCTCCTTTTGTTTCATTGAGCAATATGAACCCATATGAAATAAGTTTAGTCTTAATTGTGAATTAAGTTTTATGTTCACTCACCTATTTATTTTATTATCTGTATAGTTATTATTAGTTGGATTCTATGAATTGTTGATGGTTCATTTTTATCATTTTTACGTAACCTCTTGATCTAATGCCTCAAAATTGCAGCTACGCTTGGATGCTGCAACAATATTGGGACTTTTGGCCTTCTTTATCAATTACAAGTTTGAGAATGTCTTGTCTTCACCGTATGCTTCTTTTTAACTTGCTACATGATTACAGTCAGTAATTTGTCCAATGAAATTGAATAAAGCTAGAAACTTTGCATATTCTTTTCCCCAGGCCACATTCTATAACATCCTATATGGAAAAAAAATTAAAAGATTTTTGCGTAAGCAGCAGAATTTGAACCTCCACAGGTAGAATAGAACCCACATGATTTTCAGTCATGTATTACTATCTTTAATACAACAGAGAAGTAGGTAATCTGGATATCTGTCACTTAATAAACTCTGGTGGAGTTATATCATAACTGCTTGGTCTTTTTTCAGACGATTCTTTGGGCATTCTTCTAATTTTTCATGTATTTTATATTTGAAAAAGAACATTTAATATTCAATGTGAAATATGCAGTTTACGTGCTTTTATACACTATAATGTTCATGTACTACAATTTTTTTTTTTTGATAATGAACCATGCTTTCATTGAGAAAAAATGAAAGAATACAAGGGCGAACAAAAAGAATCAGCCTACAA
mRNA sequence
TATATTGTGAAATCTCTAACCATTTGATTATGTGGAAGAGAAATCACCAAAATAAACAAGCATGGCTGGTTTAAAAAAAAAGAAAAGAAAAGAAAAGAAAAGAAAAGAAAAAACTTTGTAAAAGAGAGAGTATGAGCAGAGTAGTAATAAATGTCATTTGGTGGTGGTAAACGAGTTTAGCAGAGTTTGACGGCCAGGAAAGGGATTCCTAATCCTGGTATTGTCTATTTGAGCATTGGGAGTGGTTTTCGCTTCACTTTTGGATTGGGATTGTTGTTGTCGCTTCACTCTCAGGCGATCTCTTCTTCTCTCCCTACTTGTCCCGCCCCCGTCTGATACATTTTGATTTTGCTTCGTCATCATCTTGGATTCCGGGTTAGTTTCAGTTGCAGTTGCAGTTGCTTTGTTTCGTCGTGGGATTCTTCCACCTTTCGAGGTCCCCAAAAGCTCAAAGTTGACGTCTCTCCAGCCGGCTTTGTGGGCGTTGTCCCAATCCCAGCCAAACCTTTCGCTTATCTCCAACAGCTCAGAAATGACGTTGTCCCCAGTTGCCCGTGTGTTGCTCATGCTCCCACTCAGAATGTTTTGGGCGAACTCCCACTGGATGCGGTCTTGGTAGACTGGGTCTTCCAGAACTTGGTCTGCCAGAATTGCCCACTCTTCAAACTCCCTTGCTCTCAGATTTTTGGAGACCCATTTCAGATAGCTTGAAGGCAGACTTCCCAGCATTTTGCCCTTGTGCTTTCCAAACCCAATCACCCGGTCTCTGGCTGGAATAGCAGCCGCACTCCCATTCCTTCTCCGCACACACACAGTCAACGCCTTTGGCTTAGGCTGCAAAAGGAATGAGAGTCTCGTCAAACTCAAACTCATGGGAGAGCACGAAGATAAGATGACCACCAGCTTAACCCTCAACCACAACCAACCAATCTTTAATTCTTAATCCATATATAATTCCTCGACCAAAATCGGATATTTTAAAACCCAAGAAGGATAAAAAAAAAAGCGCCTTCTGAAGTCAATGTCAAACTCTCGACTAGTTTCGTAGTAAAAATTTGATTTTTCCAAAACGAAATCATTATTGATTACTGGCCTGGAACGTCGAGTCTGCACGCGCCTTATCCGATACGATAAAGCTGAGTTGGGTGCACAGTGCACTCAGGCTCAGGCTCAGGCTCAGGTTCGCACGATGCTGTCGTCTTTGCAACGCCCCACTGGCCTCCGCAGTTCATCACATCTCTTCTCTCCAATTCCTCATGCTCCTCGTCAATGTCTTCTTCTTCCATGTTCCTCTTCCCTCACTGGTTTTCCCGAGATGTCCACGCAACCTTTGGAATCCAACGCGGCCGAGGTTTCCACATTCAAGCAGTGGCGGAAGAACGACGACGACATGGCCGACGATGAGTACCAAGATAAGGGCATTTCTAGAATTCCCGTGCCTAGACAGAAATACATACCAGTTTCGAAAGTCCAGTTGTTGGATGCCATTGTTTCGACCTTCTTTAACTCCAACCTTGATGATGATGATGATGATGATGGCGACGCTCAACATTTTCTGCTCCTCTCTTCGTGCTTGGACTCCATTCTTCACGCTGAACACAAGAAAATTTTAGAAGAAATGCGGAACGATTATTCCCTTTCTCAGTCGCTCAATAATGAGGCTACTTTTGATGAGGTTTCAACTAATACTGATGGCCAGCTTGTTTCTAATGAGAAAGAGGAGTTCATCTCAGCCAAGGATGGCATGACTGGGGTCGGGAGCATGGAGGACCTAGTGCAGAAGATTGGAGTTAGCAATCCGATGCCTTTCAGTTATAGTTTGGACTTTCGTAATCTCATGAGTTCTCCGAGGGGTGGCGCAAATAGTTACATCAACGGGGAGTCTCCAGTAGCAGTTGCCACTCGTTTCCAGCGTGCTTTTATGAAACTTCTTAAAAATGCTCAATTTGAAGAACTCTCAGCCACGGACCTGGTCTTGACATCAGCATTGAATACAGACTATCTGCTTACTTTGCCAATATACGTCGATTGGAAGAGGGCATCCGAGTCTAATGCAATTATTTTCAGGCGAGGATATGCAACTGAGAGGCAGAAAGGCCTGTTAATTGTTGATAAACTAGATTATTTACAGTCTAGACTTCTACGTGGATTCTTTTCCATAATCTCAAAACCAGTGGGGAGACTTGGTACTTGGATAGCTGAGGTTGTACTTGGTGCTCCACAGATGCAAGAAATACAAGAATGGGTTAAGAGGTTGAGGCTTTGGGTGAGCGAACTTCCTCTATCTCAACAATTATTTCGTTATGATGAAGAAGATTCTGATGATCTACTGAGAGACAATCGGATTTTGGATAGAGACCTTCCAATTTGGCTGGCAGCTCAGAGTGCAGTGTCTCGTTATGAAGGAATTCTTTCTTCCATGGGACCTCGTGGAAGACTCTTAAGGAGACTGCTTACATGGATAGGACTTCTTCCTCCTATGCCAGAACAACCATTCAAGCTTAATGATGACAGTAAAGCTTCTGAACCTTATTTAAGGCCTATCTTCATATCACGAATAACGCTCAGTGATATATGGAGGCCTGCAATGAAAAATTGTGGGAATGATATTTGGAAACGGTTGAAAACTTCCATTTCCATCCTCCTCTCTCAATCAGTGCTCCAGGAGCCAGCTTTCCAAGAATTGATTTTACTTTACACCAAAGAAGAAAGTAGTGGAGATAAGACCGAGGTTCCATCGTTGCAGTTAAAGATCTATGAAGAAATTCCTATTCCAGACTTACCAGTGATCTTTCCTGACAAGAAACTATCTTTTCGAATTATCGATGCGGTATGACCTAGATTTGATTTCAAATGCATTTATTAGAATGTATCGAATTATAGCCATGTGCAGCATTGCTGCACAGTGTGTAGGATATCTGTGCATGGTTCTTTTGTAATTGTGTCATTTTATTGGCTTAAATTTTTTCTCTCGTCTTTTTTATACCAAAATATGATATTTACAAAGAGCATGGGATGGTTTTTGCTAGTGGCCAATAAAGCTTTTTGTAGTGTTTCATGCATGGAAGAGAGAAATACTATCTGTCGAGATGGGTCATGATTATTTAATTGAGATCAGGGTTGGAAAAAAATGTCCTAGTTAAGATACCAGGTGATTGAGTGGTAAATAGCACTCATGAAAATGAATGAGAGTTGGCAGATCTGAGAATGAACTAATAGTTGCCTAGTTATCTGTATCATTGCAGCTAATTTAGTAAGTTGATTTATTGGCTATTGGCATGCAAGGATTAACAGAACTTCTGTCATGCACTCTAGTCTTGCTTCTCTAATCGGGATAAAATTATATAGAGTTCCATGTTCAGTAACGAGGTTAGTGTTCACAAGTAAACACTTGATAACTTGTTATGAATGCCTATACAAGAATGATCATGGTCAAAGGCTTTCTTTCCCGAGGGACTTCATGAGTCTAGTGGGCAGCACAAGTAGCTTTAATTCTCAATGGAGAAATGGTTTATTGACGTACCCATTTCCACCATAGTTTTGTATTAACGATCACTGATGCAATTTACATCGATGGCTACTGACTCTCCTTTTGTTTCATTGAGCAATATGAACCCATATGAAATAAGTTTAGTCTTAATTGTGAATTAAGTTTTATGTTCACTCACCTATTTATTTTATTATCTGTATAGTTATTATTAGTTGGATTCTATGAATTGTTGATGGTTCATTTTTATCATTTTTACGTAACCTCTTGATCTAATGCCTCAAAATTGCAGCTACGCTTGGATGCTGCAACAATATTGGGACTTTTGGCCTTCTTTATCAATTACAAGTTTGAGAATGTCTTGTCTTCACCGTATGCTTCTTTTTAACTTGCTACATGATTACAGTCAGTAATTTGTCCAATGAAATTGAATAAAGCTAGAAACTTTGCATATTCTTTTCCCCAGGCCACATTCTATAACATCCTATATGGAAAAAAAATTAAAAGATTTTTGCGTAAGCAGCAGAATTTGAACCTCCACAGGTAGAATAGAACCCACATGATTTTCAGTCATGTATTACTATCTTTAATACAACAGAGAAGTAGGTAATCTGGATATCTGTCACTTAATAAACTCTGGTGGAGTTATATCATAACTGCTTGGTCTTTTTTCAGACGATTCTTTGGGCATTCTTCTAATTTTTCATGTATTTTATATTTGAAAAAGAACATTTAATATTCAATGTGAAATATGCAGTTTACGTGCTTTTATACACTATAATGTTCATGTACTACAATTTTTTTTTTTTGATAATGAACCATGCTTTCATTGAGAAAAAATGAAAGAATACAAGGGCGAACAAAAAGAATCAGCCTACAA
Coding sequence (CDS)
ATGCTGTCGTCTTTGCAACGCCCCACTGGCCTCCGCAGTTCATCACATCTCTTCTCTCCAATTCCTCATGCTCCTCGTCAATGTCTTCTTCTTCCATGTTCCTCTTCCCTCACTGGTTTTCCCGAGATGTCCACGCAACCTTTGGAATCCAACGCGGCCGAGGTTTCCACATTCAAGCAGTGGCGGAAGAACGACGACGACATGGCCGACGATGAGTACCAAGATAAGGGCATTTCTAGAATTCCCGTGCCTAGACAGAAATACATACCAGTTTCGAAAGTCCAGTTGTTGGATGCCATTGTTTCGACCTTCTTTAACTCCAACCTTGATGATGATGATGATGATGATGGCGACGCTCAACATTTTCTGCTCCTCTCTTCGTGCTTGGACTCCATTCTTCACGCTGAACACAAGAAAATTTTAGAAGAAATGCGGAACGATTATTCCCTTTCTCAGTCGCTCAATAATGAGGCTACTTTTGATGAGGTTTCAACTAATACTGATGGCCAGCTTGTTTCTAATGAGAAAGAGGAGTTCATCTCAGCCAAGGATGGCATGACTGGGGTCGGGAGCATGGAGGACCTAGTGCAGAAGATTGGAGTTAGCAATCCGATGCCTTTCAGTTATAGTTTGGACTTTCGTAATCTCATGAGTTCTCCGAGGGGTGGCGCAAATAGTTACATCAACGGGGAGTCTCCAGTAGCAGTTGCCACTCGTTTCCAGCGTGCTTTTATGAAACTTCTTAAAAATGCTCAATTTGAAGAACTCTCAGCCACGGACCTGGTCTTGACATCAGCATTGAATACAGACTATCTGCTTACTTTGCCAATATACGTCGATTGGAAGAGGGCATCCGAGTCTAATGCAATTATTTTCAGGCGAGGATATGCAACTGAGAGGCAGAAAGGCCTGTTAATTGTTGATAAACTAGATTATTTACAGTCTAGACTTCTACGTGGATTCTTTTCCATAATCTCAAAACCAGTGGGGAGACTTGGTACTTGGATAGCTGAGGTTGTACTTGGTGCTCCACAGATGCAAGAAATACAAGAATGGGTTAAGAGGTTGAGGCTTTGGGTGAGCGAACTTCCTCTATCTCAACAATTATTTCGTTATGATGAAGAAGATTCTGATGATCTACTGAGAGACAATCGGATTTTGGATAGAGACCTTCCAATTTGGCTGGCAGCTCAGAGTGCAGTGTCTCGTTATGAAGGAATTCTTTCTTCCATGGGACCTCGTGGAAGACTCTTAAGGAGACTGCTTACATGGATAGGACTTCTTCCTCCTATGCCAGAACAACCATTCAAGCTTAATGATGACAGTAAAGCTTCTGAACCTTATTTAAGGCCTATCTTCATATCACGAATAACGCTCAGTGATATATGGAGGCCTGCAATGAAAAATTGTGGGAATGATATTTGGAAACGGTTGAAAACTTCCATTTCCATCCTCCTCTCTCAATCAGTGCTCCAGGAGCCAGCTTTCCAAGAATTGATTTTACTTTACACCAAAGAAGAAAGTAGTGGAGATAAGACCGAGGTTCCATCGTTGCAGTTAAAGATCTATGAAGAAATTCCTATTCCAGACTTACCAGTGATCTTTCCTGACAAGAAACTATCTTTTCGAATTATCGATGCGGTATGA
Protein sequence
MLSSLQRPTGLRSSSHLFSPIPHAPRQCLLLPCSSSLTGFPEMSTQPLESNAAEVSTFKQWRKNDDDMADDEYQDKGISRIPVPRQKYIPVSKVQLLDAIVSTFFNSNLDDDDDDDGDAQHFLLLSSCLDSILHAEHKKILEEMRNDYSLSQSLNNEATFDEVSTNTDGQLVSNEKEEFISAKDGMTGVGSMEDLVQKIGVSNPMPFSYSLDFRNLMSSPRGGANSYINGESPVAVATRFQRAFMKLLKNAQFEELSATDLVLTSALNTDYLLTLPIYVDWKRASESNAIIFRRGYATERQKGLLIVDKLDYLQSRLLRGFFSIISKPVGRLGTWIAEVVLGAPQMQEIQEWVKRLRLWVSELPLSQQLFRYDEEDSDDLLRDNRILDRDLPIWLAAQSAVSRYEGILSSMGPRGRLLRRLLTWIGLLPPMPEQPFKLNDDSKASEPYLRPIFISRITLSDIWRPAMKNCGNDIWKRLKTSISILLSQSVLQEPAFQELILLYTKEESSGDKTEVPSLQLKIYEEIPIPDLPVIFPDKKLSFRIIDAV
Homology
BLAST of Lcy05g000130 vs. ExPASy TrEMBL
Match:
A0A6J1GRM1 (uncharacterized protein LOC111456493 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111456493 PE=4 SV=1)
HSP 1 Score: 924.9 bits (2389), Expect = 1.6e-265
Identity = 475/549 (86.52%), Postives = 502/549 (91.44%), Query Frame = 0
Query: 1 MLSSLQRPTGLRSSSHLFSPIPHAPRQCLLLPCSSSLTGFPEMSTQPLESNAAEVSTFKQ 60
ML+SLQ PTGLRSSS LFSPIPHA R LPCSSSLTGFP++STQPLESNAAE S F Q
Sbjct: 1 MLASLQLPTGLRSSSRLFSPIPHAHRHSRPLPCSSSLTGFPDISTQPLESNAAEASRFNQ 60
Query: 61 WRKNDDDMADDEYQDKGISRIPVPRQKYIPVSKVQLLDAIVSTFFNSNLDDDDDDDGDAQ 120
W N+ DMADDE+QDKGISRIPVPR K+IPVSK QLLDAIVSTFFNSN DDDDDDD DAQ
Sbjct: 61 WPNNNGDMADDEFQDKGISRIPVPRHKHIPVSKAQLLDAIVSTFFNSNHDDDDDDDPDAQ 120
Query: 121 HFLLLSSCLDSILHAEHKKILEEMRNDYSLSQSLNNEATFDEVSTNTDGQLVSNEKEEFI 180
HFLLLSSCLDSILHAEHKK LEEMRNDYSL+QSL NEA E STNTDGQ VSNEKEE I
Sbjct: 121 HFLLLSSCLDSILHAEHKKTLEEMRNDYSLTQSLENEANSGESSTNTDGQTVSNEKEESI 180
Query: 181 SAKDGMTGVGSMEDLVQKIGVSNPMPFSYSLDFRNLMSSPRGGANSYINGESPVAVATRF 240
+ KD +TG+GSME+LVQKIGV NPMPFSY+LDFRNL+SS +GG SYINGES VAVATRF
Sbjct: 181 TFKDAITGIGSMEELVQKIGVGNPMPFSYNLDFRNLLSSLKGGVYSYINGESSVAVATRF 240
Query: 241 QRAFMKLLKNAQFEELSATDLVLTSALNTDYLLTLPIYVDWKRASESNAIIFRRGYATER 300
QR+F++LLKNA+FEELSA DL LTSALNTDYLLTLPIYVDWKRASESNAIIFRRGYATER
Sbjct: 241 QRSFIQLLKNAEFEELSAMDLGLTSALNTDYLLTLPIYVDWKRASESNAIIFRRGYATER 300
Query: 301 QKGLLIVDKLDYLQSRLLRGFFSIISKPVGRLGTWIAEVVLGAPQMQEIQEWVKRLRLWV 360
Q GLLIVDKLDY+QSRLLRG FSII+KP+GRLGTWIAEVV GAPQM EIQEWVKRLRLWV
Sbjct: 301 QTGLLIVDKLDYIQSRLLRGLFSIIAKPLGRLGTWIAEVVYGAPQMPEIQEWVKRLRLWV 360
Query: 361 SELPLSQQLFRYDEEDSDDLLRDNRILDRDLPIWLAAQSAVSRYEGILSSMGPRGRLLRR 420
SELP SQQLFRYDEEDSD LL DNRI D+DLPIWLAAQSAVSRYEGILSSMGPRGRLLRR
Sbjct: 361 SELPTSQQLFRYDEEDSDGLLSDNRISDKDLPIWLAAQSAVSRYEGILSSMGPRGRLLRR 420
Query: 421 LLTWIGLLPPMPEQPFKLNDDSKASEPYLRPIFISRITLSDIWRPAMKNCGNDIWKRLKT 480
LLTWIGLLPPMPEQPF NDDSKASEPYLRPIFISRI+LSDIWRPAMKNCGN+IWKRLKT
Sbjct: 421 LLTWIGLLPPMPEQPFTPNDDSKASEPYLRPIFISRISLSDIWRPAMKNCGNNIWKRLKT 480
Query: 481 SISILLSQSVLQEPAFQELILLYTK-EESSGDKTEVPSLQLKIYEEIPIPDLPVIFPDKK 540
SISILLSQSVLQEPAFQELILLYTK +SGDKTEVPSLQLKIYE+IPIPDLPVIFPDKK
Sbjct: 481 SISILLSQSVLQEPAFQELILLYTKGGRNSGDKTEVPSLQLKIYEKIPIPDLPVIFPDKK 540
Query: 541 LSFRIIDAV 549
LSFRIIDA+
Sbjct: 541 LSFRIIDAL 549
BLAST of Lcy05g000130 vs. ExPASy TrEMBL
Match:
A0A6J1JSS6 (uncharacterized protein LOC111488573 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111488573 PE=4 SV=1)
HSP 1 Score: 919.1 bits (2374), Expect = 8.9e-264
Identity = 474/549 (86.34%), Postives = 503/549 (91.62%), Query Frame = 0
Query: 1 MLSSLQRPTGLRSSSHLFSPIPHAPRQCLLLPCSSSLTGFPEMSTQPLESNAAEVSTFKQ 60
ML+SLQ PTGLRSSS LFSPIPHA R L LPCSSSLTGFP+++TQPLESNAAE S F Q
Sbjct: 1 MLASLQLPTGLRSSSRLFSPIPHAHRHSLPLPCSSSLTGFPDITTQPLESNAAEASRFNQ 60
Query: 61 WRKNDDDMADDEYQDKGISRIPVPRQKYIPVSKVQLLDAIVSTFFNSNLDDDDDDDGDAQ 120
W N+ DMADD++QDKGISRIPVPRQK+IPVSK QLLDAIVSTFFNSN DDDDDD DAQ
Sbjct: 61 WPNNNGDMADDDFQDKGISRIPVPRQKHIPVSKAQLLDAIVSTFFNSN-HDDDDDDPDAQ 120
Query: 121 HFLLLSSCLDSILHAEHKKILEEMRNDYSLSQSLNNEATFDEVSTNTDGQLVSNEKEEFI 180
HFLLLSSCLDSILHAEHKK LEEMRNDYSL+QSL NEA E STNTDGQ VSN KEE I
Sbjct: 121 HFLLLSSCLDSILHAEHKKTLEEMRNDYSLTQSLENEANSGESSTNTDGQTVSNGKEESI 180
Query: 181 SAKDGMTGVGSMEDLVQKIGVSNPMPFSYSLDFRNLMSSPRGGANSYINGESPVAVATRF 240
+ KD +TG+GSME+LVQKIGV NPMPFSY+LDFRNL+SS +GG NSYINGES VAVATRF
Sbjct: 181 TFKDAITGIGSMEELVQKIGVGNPMPFSYNLDFRNLLSSLKGGVNSYINGESSVAVATRF 240
Query: 241 QRAFMKLLKNAQFEELSATDLVLTSALNTDYLLTLPIYVDWKRASESNAIIFRRGYATER 300
QR+F++LLKNA+FEELSA DL LTSALNTDYLLTLPIYVDWKRASESNAIIFRRGYATER
Sbjct: 241 QRSFIQLLKNAEFEELSAMDLGLTSALNTDYLLTLPIYVDWKRASESNAIIFRRGYATER 300
Query: 301 QKGLLIVDKLDYLQSRLLRGFFSIISKPVGRLGTWIAEVVLGAPQMQEIQEWVKRLRLWV 360
Q GLLIVDKLDY+QSRLLRG FSII+KP+GRLGTWIAEVV GAPQM EIQEWVKRLRLWV
Sbjct: 301 QTGLLIVDKLDYIQSRLLRGLFSIIAKPLGRLGTWIAEVVYGAPQMPEIQEWVKRLRLWV 360
Query: 361 SELPLSQQLFRYDEEDSDDLLRDNRILDRDLPIWLAAQSAVSRYEGILSSMGPRGRLLRR 420
SELP SQQLFRYDEEDSD LLRDNRI D+DLPIWLAAQSAVSRYEGILSSMGPRGRLLRR
Sbjct: 361 SELPTSQQLFRYDEEDSDGLLRDNRISDKDLPIWLAAQSAVSRYEGILSSMGPRGRLLRR 420
Query: 421 LLTWIGLLPPMPEQPFKLNDDSKASEPYLRPIFISRITLSDIWRPAMKNCGNDIWKRLKT 480
LLTWIGLLPPMPEQPF NDDSKASEPYLRPIFISRI+LSDIWRPAMKN GN+IWKRLKT
Sbjct: 421 LLTWIGLLPPMPEQPFTPNDDSKASEPYLRPIFISRISLSDIWRPAMKNYGNNIWKRLKT 480
Query: 481 SISILLSQSVLQEPAFQELILLYTK-EESSGDKTEVPSLQLKIYEEIPIPDLPVIFPDKK 540
SISILLSQSVLQEPAFQELILLYTK +SGDKTEVPSLQLKIYE+IPIPDLPVIFPDKK
Sbjct: 481 SISILLSQSVLQEPAFQELILLYTKGGRNSGDKTEVPSLQLKIYEKIPIPDLPVIFPDKK 540
Query: 541 LSFRIIDAV 549
LSFRIIDA+
Sbjct: 541 LSFRIIDAL 548
BLAST of Lcy05g000130 vs. ExPASy TrEMBL
Match:
A0A6J1GQ20 (uncharacterized protein LOC111456493 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111456493 PE=4 SV=1)
HSP 1 Score: 906.7 bits (2342), Expect = 4.5e-260
Identity = 469/549 (85.43%), Postives = 496/549 (90.35%), Query Frame = 0
Query: 1 MLSSLQRPTGLRSSSHLFSPIPHAPRQCLLLPCSSSLTGFPEMSTQPLESNAAEVSTFKQ 60
ML+SLQ PTGLRSSS LFSPIPHA R LPCSSSLTGFP++STQPLESNAAE S F Q
Sbjct: 1 MLASLQLPTGLRSSSRLFSPIPHAHRHSRPLPCSSSLTGFPDISTQPLESNAAEASRFNQ 60
Query: 61 WRKNDDDMADDEYQDKGISRIPVPRQKYIPVSKVQLLDAIVSTFFNSNLDDDDDDDGDAQ 120
W N+ DMADDE+QDKGISRIPVPR K+IPVSK QLLDAIVSTFFNSN DDDDDDD DAQ
Sbjct: 61 WPNNNGDMADDEFQDKGISRIPVPRHKHIPVSKAQLLDAIVSTFFNSNHDDDDDDDPDAQ 120
Query: 121 HFLLLSSCLDSILHAEHKKILEEMRNDYSLSQSLNNEATFDEVSTNTDGQLVSNEKEEFI 180
HFLLLSSCLDSILHAEHKK LEEMRNDYSL+QSL NEA E STNTDGQ VSNEKEE I
Sbjct: 121 HFLLLSSCLDSILHAEHKKTLEEMRNDYSLTQSLENEANSGESSTNTDGQTVSNEKEESI 180
Query: 181 SAKDGMTGVGSMEDLVQKIGVSNPMPFSYSLDFRNLMSSPRGGANSYINGESPVAVATRF 240
+ KD +TG+GSME+LVQKIGV NPMPFSY+LDFRNL+SS +GG SYINGES VAVATRF
Sbjct: 181 TFKDAITGIGSMEELVQKIGVGNPMPFSYNLDFRNLLSSLKGGVYSYINGESSVAVATRF 240
Query: 241 QRAFMKLLKNAQFEELSATDLVLTSALNTDYLLTLPIYVDWKRASESNAIIFRRGYATER 300
QR+F++LLKNA+FEELSA DL LTSALNTDYLLTLPIYVDWKRASESNAIIFRRGYATER
Sbjct: 241 QRSFIQLLKNAEFEELSAMDLGLTSALNTDYLLTLPIYVDWKRASESNAIIFRRGYATER 300
Query: 301 QKGLLIVDKLDYLQSRLLRGFFSIISKPVGRLGTWIAEVVLGAPQMQEIQEWVKRLRLWV 360
Q GLLIVDKLDY+QSRLLRG FSII+KP+GRLGTWIAE M EIQEWVKRLRLWV
Sbjct: 301 QTGLLIVDKLDYIQSRLLRGLFSIIAKPLGRLGTWIAE-------MPEIQEWVKRLRLWV 360
Query: 361 SELPLSQQLFRYDEEDSDDLLRDNRILDRDLPIWLAAQSAVSRYEGILSSMGPRGRLLRR 420
SELP SQQLFRYDEEDSD LL DNRI D+DLPIWLAAQSAVSRYEGILSSMGPRGRLLRR
Sbjct: 361 SELPTSQQLFRYDEEDSDGLLSDNRISDKDLPIWLAAQSAVSRYEGILSSMGPRGRLLRR 420
Query: 421 LLTWIGLLPPMPEQPFKLNDDSKASEPYLRPIFISRITLSDIWRPAMKNCGNDIWKRLKT 480
LLTWIGLLPPMPEQPF NDDSKASEPYLRPIFISRI+LSDIWRPAMKNCGN+IWKRLKT
Sbjct: 421 LLTWIGLLPPMPEQPFTPNDDSKASEPYLRPIFISRISLSDIWRPAMKNCGNNIWKRLKT 480
Query: 481 SISILLSQSVLQEPAFQELILLYTK-EESSGDKTEVPSLQLKIYEEIPIPDLPVIFPDKK 540
SISILLSQSVLQEPAFQELILLYTK +SGDKTEVPSLQLKIYE+IPIPDLPVIFPDKK
Sbjct: 481 SISILLSQSVLQEPAFQELILLYTKGGRNSGDKTEVPSLQLKIYEKIPIPDLPVIFPDKK 540
Query: 541 LSFRIIDAV 549
LSFRIIDA+
Sbjct: 541 LSFRIIDAL 542
BLAST of Lcy05g000130 vs. ExPASy TrEMBL
Match:
A0A5D3BXV4 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold347G001880 PE=4 SV=1)
HSP 1 Score: 905.6 bits (2339), Expect = 1.0e-259
Identity = 467/549 (85.06%), Postives = 498/549 (90.71%), Query Frame = 0
Query: 1 MLSSLQRPTGLRSSSHLFSPIPHAPRQCLLLPCSSSLTGFPEMSTQPLESNAAEVSTFKQ 60
MLSSLQ PTGLRSS+ L SPIP RQCL LP SSSLT FP+MSTQPLES AAE STF Q
Sbjct: 1 MLSSLQLPTGLRSSAPLLSPIPQGHRQCLPLPSSSSLTAFPDMSTQPLESIAAEPSTFNQ 60
Query: 61 WRKNDDDMADDEYQDKGISRIPVPRQKYIPVSKVQLLDAIVSTFFNSNLDDDDDDDGDAQ 120
W+ N+ DMADD+Y+DK ISRIPVPR K+IPVSK +LLDAIVSTFFNSN DDD DAQ
Sbjct: 61 WQNNNGDMADDDYEDKDISRIPVPRHKHIPVSKARLLDAIVSTFFNSN--HADDDHNDAQ 120
Query: 121 HFLLLSSCLDSILHAEHKKILEEMRNDYSLSQSLNNEATFDEVSTNTDGQLVSNEKEEFI 180
HF L+SSCLDSILHAEHKKILEEMR+DYSLSQSL NEAT EVSTNTDGQLVSNE E
Sbjct: 121 HFQLISSCLDSILHAEHKKILEEMRSDYSLSQSLENEATPAEVSTNTDGQLVSNETEVST 180
Query: 181 SAKDGMTGVGSMEDLVQKIGVSNPMPFSYSLDFRNLMSSPRGGANSYINGESPVAVATRF 240
+AKD M G+ SMEDLVQKIGVS+ MPF YSLDFRNL+SSP+GG NSYINGES VAVATRF
Sbjct: 181 TAKDAMVGIESMEDLVQKIGVSSAMPFGYSLDFRNLLSSPKGGINSYINGESSVAVATRF 240
Query: 241 QRAFMKLLKNAQFEELSATDLVLTSALNTDYLLTLPIYVDWKRASESNAIIFRRGYATER 300
QR+FMKLLKNAQFEELSA DLVLTSALNTDYLLTLPIYVDWKRASESNAIIFRRGYATER
Sbjct: 241 QRSFMKLLKNAQFEELSAMDLVLTSALNTDYLLTLPIYVDWKRASESNAIIFRRGYATER 300
Query: 301 QKGLLIVDKLDYLQSRLLRGFFSIISKPVGRLGTWIAEVVLGAPQMQEIQEWVKRLRLWV 360
Q+GLLIVDKLDY+QSRLLRG FS+ISKP+ RLGTWIAE LGAPQMQEIQEWVKRLRLWV
Sbjct: 301 QRGLLIVDKLDYIQSRLLRGLFSLISKPLRRLGTWIAEAALGAPQMQEIQEWVKRLRLWV 360
Query: 361 SELPLSQQLFRYDEEDSDDLLRDNRILDRDLPIWLAAQSAVSRYEGILSSMGPRGRLLRR 420
+LP+SQQLFRYDEEDSDDLLRDN+I D+DLPIWLAAQSAVSRYEGILSS GPRGRLLRR
Sbjct: 361 RDLPMSQQLFRYDEEDSDDLLRDNQISDKDLPIWLAAQSAVSRYEGILSSTGPRGRLLRR 420
Query: 421 LLTWIGLLPPMPEQPFKLNDDSKASEPYLRPIFISRITLSDIWRPAMKNCGNDIWKRLKT 480
LLTWIGLLPPMPEQPFKL DDSKA EPYLRPIFISRI+LSDIWRPAMKNCGNDIWK+LKT
Sbjct: 421 LLTWIGLLPPMPEQPFKLTDDSKAFEPYLRPIFISRISLSDIWRPAMKNCGNDIWKQLKT 480
Query: 481 SISILLSQSVLQEPAFQELILLYTKE-ESSGDKTEVPSLQLKIYEEIPIPDLPVIFPDKK 540
SISILLSQSVLQEPAF+ELILLYTK +SGD+TEVPSLQLKIYE+IPIPDLPVIFPDKK
Sbjct: 481 SISILLSQSVLQEPAFEELILLYTKNGRNSGDRTEVPSLQLKIYEKIPIPDLPVIFPDKK 540
Query: 541 LSFRIIDAV 549
LSFRIIDA+
Sbjct: 541 LSFRIIDAL 547
BLAST of Lcy05g000130 vs. ExPASy TrEMBL
Match:
A0A1S3CHX6 (uncharacterized protein LOC103500618 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103500618 PE=4 SV=1)
HSP 1 Score: 904.8 bits (2337), Expect = 1.7e-259
Identity = 470/551 (85.30%), Postives = 499/551 (90.56%), Query Frame = 0
Query: 1 MLSSLQRPTGLRSSSHLFSPIP--HAPRQCLLLPCSSSLTGFPEMSTQPLESNAAEVSTF 60
MLSSLQ PTGLRSSS L SPIP A RQCL LP SSSLT FP+MSTQPLES AAE STF
Sbjct: 1 MLSSLQLPTGLRSSSPLLSPIPQGQARRQCLPLPSSSSLTAFPDMSTQPLESIAAEPSTF 60
Query: 61 KQWRKNDDDMADDEYQDKGISRIPVPRQKYIPVSKVQLLDAIVSTFFNSNLDDDDDDDGD 120
QW+ N+ DMADD+Y+DK ISRIPVPR K+IPVSK QLLDAIVSTFFNSN DDD D
Sbjct: 61 NQWQNNNGDMADDDYEDKDISRIPVPRHKHIPVSKAQLLDAIVSTFFNSN--HADDDHND 120
Query: 121 AQHFLLLSSCLDSILHAEHKKILEEMRNDYSLSQSLNNEATFDEVSTNTDGQLVSNEKEE 180
AQHF L+SSCLDSILHAEHKKILEEMR+DYSLSQSL NEAT EVSTNTDGQLVSNE E
Sbjct: 121 AQHFQLISSCLDSILHAEHKKILEEMRSDYSLSQSLENEATPAEVSTNTDGQLVSNETEV 180
Query: 181 FISAKDGMTGVGSMEDLVQKIGVSNPMPFSYSLDFRNLMSSPRGGANSYINGESPVAVAT 240
+AKD M G+ SMEDLVQKIGVS+ MPF YSLDFRNL+SSP+GG NSYINGES VAVAT
Sbjct: 181 STTAKDAMVGIESMEDLVQKIGVSSAMPFGYSLDFRNLLSSPKGGINSYINGESSVAVAT 240
Query: 241 RFQRAFMKLLKNAQFEELSATDLVLTSALNTDYLLTLPIYVDWKRASESNAIIFRRGYAT 300
RFQR+FMKLLKNAQFEELSA DLVLTSALNTDYLLTLPIYVDWKRASESNAIIFRRGYAT
Sbjct: 241 RFQRSFMKLLKNAQFEELSAMDLVLTSALNTDYLLTLPIYVDWKRASESNAIIFRRGYAT 300
Query: 301 ERQKGLLIVDKLDYLQSRLLRGFFSIISKPVGRLGTWIAEVVLGAPQMQEIQEWVKRLRL 360
ERQ+GLLIVDKLDY+QSRLLRG FS+ISKP+ RLGTWIAE LGAPQMQEIQEWVKRLRL
Sbjct: 301 ERQRGLLIVDKLDYIQSRLLRGLFSLISKPLRRLGTWIAEAALGAPQMQEIQEWVKRLRL 360
Query: 361 WVSELPLSQQLFRYDEEDSDDLLRDNRILDRDLPIWLAAQSAVSRYEGILSSMGPRGRLL 420
WV +LP+SQQLFRYDEEDSDDLLRDN+I D+DLPIWLAAQSAVSRYEGILSS GPRGRLL
Sbjct: 361 WVRDLPMSQQLFRYDEEDSDDLLRDNQISDKDLPIWLAAQSAVSRYEGILSSTGPRGRLL 420
Query: 421 RRLLTWIGLLPPMPEQPFKLNDDSKASEPYLRPIFISRITLSDIWRPAMKNCGNDIWKRL 480
RRLLTWIGLLPPMPEQPFKL DDSKA EPYLRPIFISRI+LSDIWRPAMKNCGNDIWK+L
Sbjct: 421 RRLLTWIGLLPPMPEQPFKLTDDSKAFEPYLRPIFISRISLSDIWRPAMKNCGNDIWKQL 480
Query: 481 KTSISILLSQSVLQEPAFQELILLYTKE-ESSGDKTEVPSLQLKIYEEIPIPDLPVIFPD 540
KTSISILLSQSVLQEPAF+ELILLYTK +SGD+TEVPSLQLKIYE+IPIPDLPVIFPD
Sbjct: 481 KTSISILLSQSVLQEPAFEELILLYTKNGRNSGDRTEVPSLQLKIYEKIPIPDLPVIFPD 540
Query: 541 KKLSFRIIDAV 549
KKLSFRIIDA+
Sbjct: 541 KKLSFRIIDAL 549
BLAST of Lcy05g000130 vs. NCBI nr
Match:
KAG7013933.1 (hypothetical protein SDJN02_24102 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 926.8 bits (2394), Expect = 8.8e-266
Identity = 477/550 (86.73%), Postives = 504/550 (91.64%), Query Frame = 0
Query: 1 MLSSLQRPTGLRSSSHLFSPIPHAPRQCLLLPCSSSLTGFPEMSTQPLESNAAEVSTFKQ 60
ML+SLQ PTGLRSSS LFSPIPHA R L LPCSSSLTGFP++STQPLESNAAE S F Q
Sbjct: 1 MLASLQLPTGLRSSSRLFSPIPHAHRHSLPLPCSSSLTGFPDISTQPLESNAAEASRFNQ 60
Query: 61 WRKNDDDMADDEYQDKGISRIPVPRQKYIPVSKVQLLDAIVSTFFNSNLDDDDDDDG-DA 120
W N+ DMADDE+QDKGISRIPVPR K+IPVSK QLLDAIVSTFFNSN DDDDDDD DA
Sbjct: 61 WPNNNGDMADDEFQDKGISRIPVPRHKHIPVSKAQLLDAIVSTFFNSNHDDDDDDDDPDA 120
Query: 121 QHFLLLSSCLDSILHAEHKKILEEMRNDYSLSQSLNNEATFDEVSTNTDGQLVSNEKEEF 180
QHFLLLSSCLDSILHAEHKK LEEMRNDYSL+QSL NEA E STNTDGQ VSNEKEE
Sbjct: 121 QHFLLLSSCLDSILHAEHKKTLEEMRNDYSLTQSLENEANSGETSTNTDGQAVSNEKEES 180
Query: 181 ISAKDGMTGVGSMEDLVQKIGVSNPMPFSYSLDFRNLMSSPRGGANSYINGESPVAVATR 240
I+ KD +TG+GSME+LVQKIGV NPMPFSY+LDFRNL+SS +GG NSYINGES VAVATR
Sbjct: 181 ITFKDAITGIGSMEELVQKIGVGNPMPFSYNLDFRNLLSSLKGGVNSYINGESSVAVATR 240
Query: 241 FQRAFMKLLKNAQFEELSATDLVLTSALNTDYLLTLPIYVDWKRASESNAIIFRRGYATE 300
FQR+F++LLKNA+FEELSA DL LTSALNTDYLLTLPIYVDWKRASESNAIIFRRGYATE
Sbjct: 241 FQRSFIQLLKNAEFEELSAMDLGLTSALNTDYLLTLPIYVDWKRASESNAIIFRRGYATE 300
Query: 301 RQKGLLIVDKLDYLQSRLLRGFFSIISKPVGRLGTWIAEVVLGAPQMQEIQEWVKRLRLW 360
RQ GLLIVDKLDY+QSRLLRG FSII+KP+GRLGTWIAEVV GAPQM EIQEWVKRLRLW
Sbjct: 301 RQTGLLIVDKLDYIQSRLLRGLFSIIAKPLGRLGTWIAEVVYGAPQMPEIQEWVKRLRLW 360
Query: 361 VSELPLSQQLFRYDEEDSDDLLRDNRILDRDLPIWLAAQSAVSRYEGILSSMGPRGRLLR 420
VSELP SQQLFRYDEEDSD LL DNRI D+DLPIWLAAQSAVSRYEGILSSMGPRGRLLR
Sbjct: 361 VSELPTSQQLFRYDEEDSDGLLSDNRISDKDLPIWLAAQSAVSRYEGILSSMGPRGRLLR 420
Query: 421 RLLTWIGLLPPMPEQPFKLNDDSKASEPYLRPIFISRITLSDIWRPAMKNCGNDIWKRLK 480
RLLTWIGLLPPMPEQPF NDDSKASEPYLRPIFISRI+LSDIWRPAMKNCGN+IWKRLK
Sbjct: 421 RLLTWIGLLPPMPEQPFTPNDDSKASEPYLRPIFISRISLSDIWRPAMKNCGNNIWKRLK 480
Query: 481 TSISILLSQSVLQEPAFQELILLYTK-EESSGDKTEVPSLQLKIYEEIPIPDLPVIFPDK 540
TSISILLSQSVLQEPAFQELILLYTK +SGDKTEVPSLQLKIYE+IPIPDLPVIFPDK
Sbjct: 481 TSISILLSQSVLQEPAFQELILLYTKGGRNSGDKTEVPSLQLKIYEKIPIPDLPVIFPDK 540
Query: 541 KLSFRIIDAV 549
KLSFRIIDA+
Sbjct: 541 KLSFRIIDAL 550
BLAST of Lcy05g000130 vs. NCBI nr
Match:
XP_022954140.1 (uncharacterized protein LOC111456493 isoform X1 [Cucurbita moschata])
HSP 1 Score: 924.9 bits (2389), Expect = 3.3e-265
Identity = 475/549 (86.52%), Postives = 502/549 (91.44%), Query Frame = 0
Query: 1 MLSSLQRPTGLRSSSHLFSPIPHAPRQCLLLPCSSSLTGFPEMSTQPLESNAAEVSTFKQ 60
ML+SLQ PTGLRSSS LFSPIPHA R LPCSSSLTGFP++STQPLESNAAE S F Q
Sbjct: 1 MLASLQLPTGLRSSSRLFSPIPHAHRHSRPLPCSSSLTGFPDISTQPLESNAAEASRFNQ 60
Query: 61 WRKNDDDMADDEYQDKGISRIPVPRQKYIPVSKVQLLDAIVSTFFNSNLDDDDDDDGDAQ 120
W N+ DMADDE+QDKGISRIPVPR K+IPVSK QLLDAIVSTFFNSN DDDDDDD DAQ
Sbjct: 61 WPNNNGDMADDEFQDKGISRIPVPRHKHIPVSKAQLLDAIVSTFFNSNHDDDDDDDPDAQ 120
Query: 121 HFLLLSSCLDSILHAEHKKILEEMRNDYSLSQSLNNEATFDEVSTNTDGQLVSNEKEEFI 180
HFLLLSSCLDSILHAEHKK LEEMRNDYSL+QSL NEA E STNTDGQ VSNEKEE I
Sbjct: 121 HFLLLSSCLDSILHAEHKKTLEEMRNDYSLTQSLENEANSGESSTNTDGQTVSNEKEESI 180
Query: 181 SAKDGMTGVGSMEDLVQKIGVSNPMPFSYSLDFRNLMSSPRGGANSYINGESPVAVATRF 240
+ KD +TG+GSME+LVQKIGV NPMPFSY+LDFRNL+SS +GG SYINGES VAVATRF
Sbjct: 181 TFKDAITGIGSMEELVQKIGVGNPMPFSYNLDFRNLLSSLKGGVYSYINGESSVAVATRF 240
Query: 241 QRAFMKLLKNAQFEELSATDLVLTSALNTDYLLTLPIYVDWKRASESNAIIFRRGYATER 300
QR+F++LLKNA+FEELSA DL LTSALNTDYLLTLPIYVDWKRASESNAIIFRRGYATER
Sbjct: 241 QRSFIQLLKNAEFEELSAMDLGLTSALNTDYLLTLPIYVDWKRASESNAIIFRRGYATER 300
Query: 301 QKGLLIVDKLDYLQSRLLRGFFSIISKPVGRLGTWIAEVVLGAPQMQEIQEWVKRLRLWV 360
Q GLLIVDKLDY+QSRLLRG FSII+KP+GRLGTWIAEVV GAPQM EIQEWVKRLRLWV
Sbjct: 301 QTGLLIVDKLDYIQSRLLRGLFSIIAKPLGRLGTWIAEVVYGAPQMPEIQEWVKRLRLWV 360
Query: 361 SELPLSQQLFRYDEEDSDDLLRDNRILDRDLPIWLAAQSAVSRYEGILSSMGPRGRLLRR 420
SELP SQQLFRYDEEDSD LL DNRI D+DLPIWLAAQSAVSRYEGILSSMGPRGRLLRR
Sbjct: 361 SELPTSQQLFRYDEEDSDGLLSDNRISDKDLPIWLAAQSAVSRYEGILSSMGPRGRLLRR 420
Query: 421 LLTWIGLLPPMPEQPFKLNDDSKASEPYLRPIFISRITLSDIWRPAMKNCGNDIWKRLKT 480
LLTWIGLLPPMPEQPF NDDSKASEPYLRPIFISRI+LSDIWRPAMKNCGN+IWKRLKT
Sbjct: 421 LLTWIGLLPPMPEQPFTPNDDSKASEPYLRPIFISRISLSDIWRPAMKNCGNNIWKRLKT 480
Query: 481 SISILLSQSVLQEPAFQELILLYTK-EESSGDKTEVPSLQLKIYEEIPIPDLPVIFPDKK 540
SISILLSQSVLQEPAFQELILLYTK +SGDKTEVPSLQLKIYE+IPIPDLPVIFPDKK
Sbjct: 481 SISILLSQSVLQEPAFQELILLYTKGGRNSGDKTEVPSLQLKIYEKIPIPDLPVIFPDKK 540
Query: 541 LSFRIIDAV 549
LSFRIIDA+
Sbjct: 541 LSFRIIDAL 549
BLAST of Lcy05g000130 vs. NCBI nr
Match:
XP_023549142.1 (uncharacterized protein LOC111807589 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 922.9 bits (2384), Expect = 1.3e-264
Identity = 475/550 (86.36%), Postives = 504/550 (91.64%), Query Frame = 0
Query: 1 MLSSLQRPTGLRSSSHLFSPIPHAPRQCLLLPCSSSLTGFPEMSTQPLESNAAEVSTFKQ 60
ML+SLQ PTGLRSSS LFSPIPHA R L LPCSSSLTGFP++STQPLESNAAE S F Q
Sbjct: 1 MLASLQLPTGLRSSSRLFSPIPHAHRHSLPLPCSSSLTGFPDISTQPLESNAAEASRFNQ 60
Query: 61 WRKNDDDMADDEYQDKGISRIPVPRQKYIPVSKVQLLDAIVSTFFNSNLDDDDDDDG-DA 120
W N+ DMADD++QDKGISRIPVPR K+IPVSK QLLDAIVSTFFNSN DDDDDDD DA
Sbjct: 61 WPNNNGDMADDDFQDKGISRIPVPRHKHIPVSKAQLLDAIVSTFFNSNHDDDDDDDDPDA 120
Query: 121 QHFLLLSSCLDSILHAEHKKILEEMRNDYSLSQSLNNEATFDEVSTNTDGQLVSNEKEEF 180
QHFLLLSSCLDSILHAEHKK LEEMRNDYSL+QSL NEA E STNTDGQ VSNEKEE
Sbjct: 121 QHFLLLSSCLDSILHAEHKKTLEEMRNDYSLTQSLENEANSGESSTNTDGQTVSNEKEES 180
Query: 181 ISAKDGMTGVGSMEDLVQKIGVSNPMPFSYSLDFRNLMSSPRGGANSYINGESPVAVATR 240
I++KD +TG+GSME+LVQKIGV NPMPFSY+LDFRNL+SS +GG NSYINGES VAVATR
Sbjct: 181 ITSKDAITGIGSMEELVQKIGVGNPMPFSYNLDFRNLLSSLKGGVNSYINGESSVAVATR 240
Query: 241 FQRAFMKLLKNAQFEELSATDLVLTSALNTDYLLTLPIYVDWKRASESNAIIFRRGYATE 300
FQR+F++LLKNA+FEELSA DL LTSALNTDYLLTLPIYVDWKRASESNAIIFRRGYATE
Sbjct: 241 FQRSFIQLLKNAEFEELSAMDLGLTSALNTDYLLTLPIYVDWKRASESNAIIFRRGYATE 300
Query: 301 RQKGLLIVDKLDYLQSRLLRGFFSIISKPVGRLGTWIAEVVLGAPQMQEIQEWVKRLRLW 360
RQ GLLIVDKLDY+QSRLLRG FSII+KP+GRLGTWIAEVV GAPQM EIQEWVKRLRLW
Sbjct: 301 RQTGLLIVDKLDYIQSRLLRGLFSIIAKPLGRLGTWIAEVVYGAPQMPEIQEWVKRLRLW 360
Query: 361 VSELPLSQQLFRYDEEDSDDLLRDNRILDRDLPIWLAAQSAVSRYEGILSSMGPRGRLLR 420
VSELP SQQLFRYDEEDSD LL DNRI D+DLPIWLAAQSAVSRYEGILSSMGPRGRLLR
Sbjct: 361 VSELPTSQQLFRYDEEDSDGLLSDNRISDKDLPIWLAAQSAVSRYEGILSSMGPRGRLLR 420
Query: 421 RLLTWIGLLPPMPEQPFKLNDDSKASEPYLRPIFISRITLSDIWRPAMKNCGNDIWKRLK 480
RLLTWIGLLPPMPEQ F NDDSKASEPYLRPIFISRI+LSDIWRPAMKNCGN+IWKRLK
Sbjct: 421 RLLTWIGLLPPMPEQAFTPNDDSKASEPYLRPIFISRISLSDIWRPAMKNCGNNIWKRLK 480
Query: 481 TSISILLSQSVLQEPAFQELILLYTK-EESSGDKTEVPSLQLKIYEEIPIPDLPVIFPDK 540
TSISILLSQSVLQEPAFQELILLYTK +SGDKTEVPSLQLKIYE+IPIPDLPVIFPDK
Sbjct: 481 TSISILLSQSVLQEPAFQELILLYTKGGRNSGDKTEVPSLQLKIYEKIPIPDLPVIFPDK 540
Query: 541 KLSFRIIDAV 549
KLSFRIIDA+
Sbjct: 541 KLSFRIIDAL 550
BLAST of Lcy05g000130 vs. NCBI nr
Match:
XP_022992156.1 (uncharacterized protein LOC111488573 isoform X1 [Cucurbita maxima])
HSP 1 Score: 919.1 bits (2374), Expect = 1.8e-263
Identity = 474/549 (86.34%), Postives = 503/549 (91.62%), Query Frame = 0
Query: 1 MLSSLQRPTGLRSSSHLFSPIPHAPRQCLLLPCSSSLTGFPEMSTQPLESNAAEVSTFKQ 60
ML+SLQ PTGLRSSS LFSPIPHA R L LPCSSSLTGFP+++TQPLESNAAE S F Q
Sbjct: 1 MLASLQLPTGLRSSSRLFSPIPHAHRHSLPLPCSSSLTGFPDITTQPLESNAAEASRFNQ 60
Query: 61 WRKNDDDMADDEYQDKGISRIPVPRQKYIPVSKVQLLDAIVSTFFNSNLDDDDDDDGDAQ 120
W N+ DMADD++QDKGISRIPVPRQK+IPVSK QLLDAIVSTFFNSN DDDDDD DAQ
Sbjct: 61 WPNNNGDMADDDFQDKGISRIPVPRQKHIPVSKAQLLDAIVSTFFNSN-HDDDDDDPDAQ 120
Query: 121 HFLLLSSCLDSILHAEHKKILEEMRNDYSLSQSLNNEATFDEVSTNTDGQLVSNEKEEFI 180
HFLLLSSCLDSILHAEHKK LEEMRNDYSL+QSL NEA E STNTDGQ VSN KEE I
Sbjct: 121 HFLLLSSCLDSILHAEHKKTLEEMRNDYSLTQSLENEANSGESSTNTDGQTVSNGKEESI 180
Query: 181 SAKDGMTGVGSMEDLVQKIGVSNPMPFSYSLDFRNLMSSPRGGANSYINGESPVAVATRF 240
+ KD +TG+GSME+LVQKIGV NPMPFSY+LDFRNL+SS +GG NSYINGES VAVATRF
Sbjct: 181 TFKDAITGIGSMEELVQKIGVGNPMPFSYNLDFRNLLSSLKGGVNSYINGESSVAVATRF 240
Query: 241 QRAFMKLLKNAQFEELSATDLVLTSALNTDYLLTLPIYVDWKRASESNAIIFRRGYATER 300
QR+F++LLKNA+FEELSA DL LTSALNTDYLLTLPIYVDWKRASESNAIIFRRGYATER
Sbjct: 241 QRSFIQLLKNAEFEELSAMDLGLTSALNTDYLLTLPIYVDWKRASESNAIIFRRGYATER 300
Query: 301 QKGLLIVDKLDYLQSRLLRGFFSIISKPVGRLGTWIAEVVLGAPQMQEIQEWVKRLRLWV 360
Q GLLIVDKLDY+QSRLLRG FSII+KP+GRLGTWIAEVV GAPQM EIQEWVKRLRLWV
Sbjct: 301 QTGLLIVDKLDYIQSRLLRGLFSIIAKPLGRLGTWIAEVVYGAPQMPEIQEWVKRLRLWV 360
Query: 361 SELPLSQQLFRYDEEDSDDLLRDNRILDRDLPIWLAAQSAVSRYEGILSSMGPRGRLLRR 420
SELP SQQLFRYDEEDSD LLRDNRI D+DLPIWLAAQSAVSRYEGILSSMGPRGRLLRR
Sbjct: 361 SELPTSQQLFRYDEEDSDGLLRDNRISDKDLPIWLAAQSAVSRYEGILSSMGPRGRLLRR 420
Query: 421 LLTWIGLLPPMPEQPFKLNDDSKASEPYLRPIFISRITLSDIWRPAMKNCGNDIWKRLKT 480
LLTWIGLLPPMPEQPF NDDSKASEPYLRPIFISRI+LSDIWRPAMKN GN+IWKRLKT
Sbjct: 421 LLTWIGLLPPMPEQPFTPNDDSKASEPYLRPIFISRISLSDIWRPAMKNYGNNIWKRLKT 480
Query: 481 SISILLSQSVLQEPAFQELILLYTK-EESSGDKTEVPSLQLKIYEEIPIPDLPVIFPDKK 540
SISILLSQSVLQEPAFQELILLYTK +SGDKTEVPSLQLKIYE+IPIPDLPVIFPDKK
Sbjct: 481 SISILLSQSVLQEPAFQELILLYTKGGRNSGDKTEVPSLQLKIYEKIPIPDLPVIFPDKK 540
Query: 541 LSFRIIDAV 549
LSFRIIDA+
Sbjct: 541 LSFRIIDAL 548
BLAST of Lcy05g000130 vs. NCBI nr
Match:
XP_022954141.1 (uncharacterized protein LOC111456493 isoform X2 [Cucurbita moschata])
HSP 1 Score: 906.7 bits (2342), Expect = 9.4e-260
Identity = 469/549 (85.43%), Postives = 496/549 (90.35%), Query Frame = 0
Query: 1 MLSSLQRPTGLRSSSHLFSPIPHAPRQCLLLPCSSSLTGFPEMSTQPLESNAAEVSTFKQ 60
ML+SLQ PTGLRSSS LFSPIPHA R LPCSSSLTGFP++STQPLESNAAE S F Q
Sbjct: 1 MLASLQLPTGLRSSSRLFSPIPHAHRHSRPLPCSSSLTGFPDISTQPLESNAAEASRFNQ 60
Query: 61 WRKNDDDMADDEYQDKGISRIPVPRQKYIPVSKVQLLDAIVSTFFNSNLDDDDDDDGDAQ 120
W N+ DMADDE+QDKGISRIPVPR K+IPVSK QLLDAIVSTFFNSN DDDDDDD DAQ
Sbjct: 61 WPNNNGDMADDEFQDKGISRIPVPRHKHIPVSKAQLLDAIVSTFFNSNHDDDDDDDPDAQ 120
Query: 121 HFLLLSSCLDSILHAEHKKILEEMRNDYSLSQSLNNEATFDEVSTNTDGQLVSNEKEEFI 180
HFLLLSSCLDSILHAEHKK LEEMRNDYSL+QSL NEA E STNTDGQ VSNEKEE I
Sbjct: 121 HFLLLSSCLDSILHAEHKKTLEEMRNDYSLTQSLENEANSGESSTNTDGQTVSNEKEESI 180
Query: 181 SAKDGMTGVGSMEDLVQKIGVSNPMPFSYSLDFRNLMSSPRGGANSYINGESPVAVATRF 240
+ KD +TG+GSME+LVQKIGV NPMPFSY+LDFRNL+SS +GG SYINGES VAVATRF
Sbjct: 181 TFKDAITGIGSMEELVQKIGVGNPMPFSYNLDFRNLLSSLKGGVYSYINGESSVAVATRF 240
Query: 241 QRAFMKLLKNAQFEELSATDLVLTSALNTDYLLTLPIYVDWKRASESNAIIFRRGYATER 300
QR+F++LLKNA+FEELSA DL LTSALNTDYLLTLPIYVDWKRASESNAIIFRRGYATER
Sbjct: 241 QRSFIQLLKNAEFEELSAMDLGLTSALNTDYLLTLPIYVDWKRASESNAIIFRRGYATER 300
Query: 301 QKGLLIVDKLDYLQSRLLRGFFSIISKPVGRLGTWIAEVVLGAPQMQEIQEWVKRLRLWV 360
Q GLLIVDKLDY+QSRLLRG FSII+KP+GRLGTWIAE M EIQEWVKRLRLWV
Sbjct: 301 QTGLLIVDKLDYIQSRLLRGLFSIIAKPLGRLGTWIAE-------MPEIQEWVKRLRLWV 360
Query: 361 SELPLSQQLFRYDEEDSDDLLRDNRILDRDLPIWLAAQSAVSRYEGILSSMGPRGRLLRR 420
SELP SQQLFRYDEEDSD LL DNRI D+DLPIWLAAQSAVSRYEGILSSMGPRGRLLRR
Sbjct: 361 SELPTSQQLFRYDEEDSDGLLSDNRISDKDLPIWLAAQSAVSRYEGILSSMGPRGRLLRR 420
Query: 421 LLTWIGLLPPMPEQPFKLNDDSKASEPYLRPIFISRITLSDIWRPAMKNCGNDIWKRLKT 480
LLTWIGLLPPMPEQPF NDDSKASEPYLRPIFISRI+LSDIWRPAMKNCGN+IWKRLKT
Sbjct: 421 LLTWIGLLPPMPEQPFTPNDDSKASEPYLRPIFISRISLSDIWRPAMKNCGNNIWKRLKT 480
Query: 481 SISILLSQSVLQEPAFQELILLYTK-EESSGDKTEVPSLQLKIYEEIPIPDLPVIFPDKK 540
SISILLSQSVLQEPAFQELILLYTK +SGDKTEVPSLQLKIYE+IPIPDLPVIFPDKK
Sbjct: 481 SISILLSQSVLQEPAFQELILLYTKGGRNSGDKTEVPSLQLKIYEKIPIPDLPVIFPDKK 540
Query: 541 LSFRIIDAV 549
LSFRIIDA+
Sbjct: 541 LSFRIIDAL 542
BLAST of Lcy05g000130 vs. TAIR 10
Match:
AT2G46915.1 (Protein of unknown function (DUF3754) )
HSP 1 Score: 449.5 bits (1155), Expect = 3.8e-126
Identity = 254/484 (52.48%), Postives = 330/484 (68.18%), Query Frame = 0
Query: 70 DDEYQDKGISRIPVPRQKYIPVSKVQLLDAIVSTFFNSNLDDDDDDDGDAQHFLLLSSCL 129
DD +++GIS I VPR+KYI VSK L++ IV+ D D GDA FLLLSSCL
Sbjct: 87 DDSEEEEGISSIHVPREKYINVSKSDLVNGIVTKLL------DSQDGGDADIFLLLSSCL 146
Query: 130 DSILHAEHKKILEEMRNDYSLSQSLNNEATFDEVSTNTDGQLVSNEKEEFISAKDGMTGV 189
DSILHAEHK+ILE+MR D+ +QSL E + N + ++ +G++
Sbjct: 147 DSILHAEHKRILEQMRADFVATQSLEEEE-------------LKNSEPRSVNGYEGLS-- 206
Query: 190 GSMEDLVQKIGVSNPMPFSYSLDFRNLMSSPRGGANSYINGESPVAVATRFQRAFMKLLK 249
P + D N + S G ++ V ATRFQR+F++LL
Sbjct: 207 ---------------FPLADGFDIWNFLIS--SGKHAKKRSAESVMAATRFQRSFIQLLD 266
Query: 250 NAQFEELSATDLVLTSALNTDYLLTLPIYVDWKRASESNAIIFRRGYATERQKGLLIVDK 309
NA FEELSA DL LTSALNTDYLLTLP+YVDWK+ASESNAI+FRRG+ATE++KGLL+V+K
Sbjct: 267 NAGFEELSARDLALTSALNTDYLLTLPVYVDWKKASESNAIVFRRGFATEKEKGLLLVEK 326
Query: 310 LDYLQSRLLRGFFSIISKPVGRLGTWIAEVVLGAPQMQEIQEWVKRLRLWVSELPLSQQL 369
LDY+QS++L+ FS I+KP+ ++G I + + A Q QEIQ+ + +++W+ +L L ++
Sbjct: 327 LDYIQSKVLQVIFSTIAKPLRKVGKLINKALSEASQTQEIQDLSEGMKVWLKDLSLFKE- 386
Query: 370 FRYDEEDSDDLLRDNRILDRDLPIWLAAQSAVSRYEGILSSMGPRGRLLRRLLTWIGLLP 429
Y ++ SD+ L+D + D LP+ LAAQ AVSRYEG+L+ +GPR +L R+LL WIG +
Sbjct: 387 -SYLDQTSDNFLKDGFLPDSVLPMQLAAQRAVSRYEGLLTPVGPRAKLFRKLLGWIGFIS 446
Query: 430 PMPEQPFKLNDDSKASEPYLRPIFISRITLSDIWRPAMKN-CGNDIWKRLKTSISILLSQ 489
E P +L +DS +SEPYLRPIF+SR+TL+DIW+PA K CGNDIWKR+KTSISILLS
Sbjct: 447 RDYETPSQLANDSSSSEPYLRPIFLSRMTLADIWKPASKKACGNDIWKRIKTSISILLSP 506
Query: 490 SVLQEPAFQELILLYTKEESSGD---KTEV-PSLQLKIYEEIPIPDLPVIFPDKKLSFRI 549
S LQEPAF+ELILLYTK+ S D K E SLQL+I+E IPIPDLPVIFP KKL FRI
Sbjct: 507 STLQEPAFEELILLYTKDASEKDDKNKDETRSSLQLEIFERIPIPDLPVIFPHKKLYFRI 530
BLAST of Lcy05g000130 vs. TAIR 10
Match:
AT5G13940.1 (aminopeptidases )
HSP 1 Score: 45.8 bits (107), Expect = 1.3e-04
Identity = 21/64 (32.81%), Postives = 42/64 (65.62%), Query Frame = 0
Query: 476 KRLKTSISILLSQSVLQEPAFQELILLYTKEESSGDKTEVPSLQLKIYEEIPIPDLPVIF 535
++LK S+S L+ + +QEP F+ +I++Y + SG K ++ +K ++ IP+ D+ ++
Sbjct: 525 EKLKLSLSNLMKKITIQEPTFERIIVVYRR--VSGKKESERNIYVKHFKTIPMADMEIVL 584
Query: 536 PDKK 540
P+KK
Sbjct: 585 PEKK 586
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1GRM1 | 1.6e-265 | 86.52 | uncharacterized protein LOC111456493 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1JSS6 | 8.9e-264 | 86.34 | uncharacterized protein LOC111488573 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1GQ20 | 4.5e-260 | 85.43 | uncharacterized protein LOC111456493 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A5D3BXV4 | 1.0e-259 | 85.06 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A1S3CHX6 | 1.7e-259 | 85.30 | uncharacterized protein LOC103500618 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
Match Name | E-value | Identity | Description | |
KAG7013933.1 | 8.8e-266 | 86.73 | hypothetical protein SDJN02_24102 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
XP_022954140.1 | 3.3e-265 | 86.52 | uncharacterized protein LOC111456493 isoform X1 [Cucurbita moschata] | [more] |
XP_023549142.1 | 1.3e-264 | 86.36 | uncharacterized protein LOC111807589 isoform X1 [Cucurbita pepo subsp. pepo] | [more] |
XP_022992156.1 | 1.8e-263 | 86.34 | uncharacterized protein LOC111488573 isoform X1 [Cucurbita maxima] | [more] |
XP_022954141.1 | 9.4e-260 | 85.43 | uncharacterized protein LOC111456493 isoform X2 [Cucurbita moschata] | [more] |