Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAATAAAAAAAGAAGAAAGAAAAAAACCACAACATTTTGAGCTTTCTTTCGTGTGCTCTGGTCGACATTTTTGGAGCTCCAAATGATAAGATGGCAAAGCAGGCGAATTCTGTATTCCTCGAAGAATGGTTGAATAGCACCAGTGGAACTAGCAGTGCCCTTAACTTCAAAACCACTTCCTCGTCTGCTCGAGAAATCATCCAAGCATGGGCTGAGCTTAGAAGCTCTTTAGAGCATCAATCGTTCGATGATCGCCACATTCAATCGCTCAAAACTCTTGTCAACTCACAATCTTCACTCTTTGTTGCAGACCCACAAGCTAAGCTGGTGATTTCTATACTCTCTTCTCCCAATTTCCCCCTTCCCCATGAATCGTATCCTCTCTTTTTGAGGGTTCTTTATATTTGGGTCAGAAAATCTCTCCGCCCCTCTTTGCTACTTGTCGATTCATTCGTTGAGGTTCTCTCTCAGATTTTTTCATCCAAAATTGAATTGGGAAAAAACCCTTTGTTCTTCTCGGAAGTGGTTTTGGTTTTGGGTGCTATTTCGTACATTCCTTCAGCTTCAGAAAATTCAAAATTATGCTGTTTGGAGTTGCTTTGCAGGGTTTTGGAAGAAGGGTACCTGCTGGTTGGATCAGTAGGAGGGATGATTCCAGGGTTTCTTGCAGGTATTGGTTATGCTTTATCTTCATCAGTGAATGCTCATGTTGTTAGACTGTTGGATTCTTTATTAGGAATTTGGGGTCAGGTAGGTGGCCCTACGGGCAATCTTTCTAGTGGGTTAATGATCCTGCATTTGATTGAGTGGGTGACTTCTGGTTTGATTACCCTTCATTCTTTTGAGAAATTAGATGTTTTTAGCCGAGCTACTTTAGAGTCTTCAAAGGAAAGTTATGCTTCATTTGCTGTTGTTATGGCTGCAGCTGGAATATTGAGGGCTTTTAATACTTGCAATGTTTTGTTGAGTAGTTCAGAAAGAGAAACGATATCTAGAATAAGGATTTCAGCTCAGGATTGTTTAGAATCTATATCGAGAAATTTTATTTCGACTATGGAAGGATCTTCCGTTTCAGGCAATGACCGTAGTAGGAGCCTGTTTCTATTGTGTATTTCATCTGCAATAGCACGTTGTGGCCCTGTTTCATCTCGCCCACCTGTGCTCATTTGCATTATTTATGCTTTGTTGAGTGAAATATTTCCTTTGCAGCGTTTATATGCCAAAATTCTTGAATTCTCTTCTGTTGAGTCGGGTGCGTTGGGGCTACCTCTAGTAAAAGAGCATCTTGATAGTATTCCTTTTAAGGAAGCAGGGGCCATCACGGGTGTTCTTTGCACTCAGTATGCTTCAATTGACGAAGAGGACAAAAAGATTGTGGAGAATCTTGTATGGGATTACTGTCATGACATCTACTCAAGGCACAGACTAGTAGGGTTGGTGCTTCGTGGCAGAGAGGATGAATTACTAGAGGGTATAGAGAAAATTGCAGAGTCTGCTTTTTTCATGGTTGTAGTATTTGCATTAGCTGTCACAAAAGAAAAGTTGGATTCCAAATGTACACCGGAAAGTCAATTTGACGTATCAGTAAGAATACTTGCTTCATTCTCTTGTATGGAATACTTTCGGCATATTCGCTTGCCAGAATATATGGATACTATCCGAGGGGTTGTTGCAAGCATTCAGGAGAATGAGTCTGCCTGTGTCTCTTTCATTGAATCAATGCCCACATACAAAGATCAAACAAACGGGCCAGGTCTTGCATTCAAGTTATTTTACTTGGCCATGAAGATTTAATTATTTTGGCTTTTATGCTAATATTGTTATATGCTCATCAAGCATTTGAAATTTGAAATTCATCTTTTTGTAGATAACTTTATTGGGCAGAAATTAAAATATACATGGACAGAGGATGAGGTGCAAACTGCACGCATGCTGTTTTATATACGAGTCATTCCAACTTGCATCGAACGTGTTCCTACCCAAGTGTATAGGAAAGTGGTTGCTCCAACAATGTTTTTGTATCCTATTCTAGCTACCGCGTTCTTGCAAATTATGCTTTCTTTCTTCTTTCATCTTTTTCCTCATGAACTATTTGTTTCCCATTACCAAGTGAATTAGATATATGGGACATCCTAATGCCAAAGTAGCCCGAGCTTCACACTCGGTGTTTATAGCTTTCATATCAGGGAAGGATGACAGTGAAGAAGAAAAGAGAGTGATATTGAAGGAGGAGCTTGTTTTCTACTACATTGAGAGATCTTTATCAGTATCACTCTTAACAAGATTTTTTTTTCTTTATTTATGTGATCAAACTTTAGATTTACAATTTGAGTTAGATATTGAGCATTCAAATTCTATGATTTTAGGGATATCCTGGCATTACACCATTTGAAGGTATGGCTTCAGGTGTTGCAGCTTTGGTGCGATATCTTCCTGCAGGAAGTCCAGCCATCTTTTATTGTATCGAAAGTCTTACTGTAAAAGCTACAAGCCTTTGCAGTGAAAACTTCATGGAAAATGCTGATCTGTGGAAGTCGTGGCAGGGAGACTTAGAGCCTTCCAAGAAAATTCTTGATATGCTTTTACGGCTCATTTCTCTTGTTGATATACAGGTAACTATCTTAAGATGCTTGTTCTTCCATATTTAGTACTGAATTCATTCTGTTATTTCTCTTTTGAACAATATATTAACTTATCTTTTAATTGAATGTTAATTTTAGGAAATTAGATCTGTAGCCAAGATATGATTCAATTAGATGATTCATGTATTTGATGTTCTTGAATTAGTAGTTACTGACCATCGTGAAGTAACATTTGGGTCTTGGGGAGCAAAGAGGTGATGAGTTTTAATCCATGGTAGCCACCTACCTAGGAATTAATTTTTTGTAGGTTTCCTTGACATCCAAATGTTGTAGGGTGGACGGTATGTCCCTTAACATTAGTCAAGGTGTGCACAAGCTGGCCTGGACAATCACAGATATCTAAAAAAAAGAAGAAGAAGAATTAATAGCTACTGATGGAATAATTACTTTTGAACTTCTATTAGTAGTTATTACAGTTATTCTGTTTTCTATGGAATTCTTCTTTGGATAAAGTACAATTTTAATTGATGCATGGAATCCTGGATTGTGGGTTCTTGCACTGTTCAAAGCAACCGGGACTCTACTTCGCTATTCTGCATAACTAGGAATGATTTTGATCAAGGAAAGAAATCGACCATGTATCATCCTGAATTTCTATCTGTTTTTTTTTTTTTTTTTTAGTTTGGATTTTACATCTGGGATATTTTAGGTACTCCCAAGCTTGATGAAAAATTTAGCACAACTGATCATCAAGCTACCGGCCGAGGGCCAAAATATGGTTCTTGATCAGTTATACTCCCTGGTTTCAGAAGCTGATGATGTCACACGCAAACCCTGGCTAGTTTCATGGTTACAATCATTATCATATCTATGCTGCCAATCTAAAAGTGGAGATACACACTCTAATGAGATGCAAAATATACGGCTTTTAAACTTCTCTTGGATTGTTGACCCATTGAACCGTTTTCGATCCTATGCACGACTCTGAGGTATGAATTTTTAGTTCTTGGTTTAATTTTCTCCCTTCCATGTTAATGTTAAGTTTATTATATGCATTTGCGACAGTAGTTTACTTGCTTACCTATTATTCTGTACATCTAATATTTTCCTTTATGATTTTATCGTAGTTGTGTTTAACAATTAGAAGCTGTGTGCATGTGTTAAAAAGAAAAATTTTATGGTTTGGATGAATATATTTTATGTAATTGTGAAGACTGAAGACAAATACTGTATGTTTCTAATTGTTGATTAGAATAATAGAAGGTTACATATATAGGCATTAAAGGAAAGGAAACCCTAACAAGAAAGTTGAAATTACCATAAAGGACAATAAACTAATAAACCCCTAAAGAATCTTGTTGGGTTCTATCCTCTCGGTTGTTTTTTACATTTTTTCTTTTGCGGATAAGGGGTAGTCTTTCAGAAAAGTGTTAATCTATGGACTATCTTCCAGAAGCTTTCTCAGGAGGTGAAGAACATTCTGATAAGTGGTCAAACCATCTCAAGGAAGTCCTCTACTTTGGTTGCCTAGTCGATGTTCCTCTTTCCAATTTAGGAGTTTGGGGAGGAGAGGAGAGGAAGGAATCTCAACCCCCTTATTTGTTTTTGGTACATCACAACCCCTCCCAACTCCTACTCCACTCATTTTCCAACATCCCATCAAATATCTTATCTAATCCCATCAAATATTATCATAATAAAATATATTAAACTTAATAAATTATAAATGAGTACAAATATACGATGATTACAATAAAAAATAATAAAAATATAAAATAAACATCTTCTCCCCTCAACACCAATCCCTCCTCTCCTCTCATAACACCAATTTCCAAACAAGTATTTCATCCTTCTAAATCACTCCAGTTCCTCTACCTCTTACACTCCCATCCTCTCCTCCCCAAACTCCAAAATCCAATCACCCCCTCAGGATGAAAATGGTCAGATTTATGGAGATGTATGCAGTTAGAGCTCTGTTGTGGTTGTTGCTGATTGATACAAATCAGACAATTTTTTAGGATAGACAGGAACATATAGATTTTGTGATATTTTTCGCATATTGCCTCCTGTTGGAGTGATCTTCTCAAGATCTTCCTAGGAACAGGCTCTTTTCTTGATTTATCTTGATTCAAGGTGTATTTTTTGGTATAGATTCAAGGTTGTTTCCCATTCTTCTCCCCTCAGGATGAAAATGGTCAGATTTATGGAGATGTATGCAGTTAGAGCTCTGTTGTGGTTGTTGCTGATTGATACAAATCAGACAATTTTTTAGGATAGACAGGAACATATAGATTTTGTGATATTTTTCGCATATTGCCTCCTGTTGGAGTGATCTTCTCAAGATCTTCCTAGGAACAGGCTCTTTTCTTGATTTATCTTGATTCAAGGTGTATTTTTTGGTATAGATTCAAGGTTGTTTCCCATTCTTCTCCCCATTTTTTTTTTCTGAATCATGTTGGTTATTAACAAATAGCAAATCAACTAAAAACTAAAGCTCAACTGCTTATCCAGTTATATTTAATACATTATGTTGGAGATATTCACCTTGTGGCAACGGTTCTGTTGGTTGATATAATTAAATTTGCCCTAATCCCTGAGCTTAGGCTTTTGGATTGATTGGTGGTTTAACATAATTCAACATGGTATCAGAGCAGGAGGTCATGAGTTCAAGTCCCTGCCAAGTCGTTTCCTCCCCAATTAAAATTAAATTCCACTTGTAGGGCATTTCTCAAATTTCAAAGCCCACAAGTGAGGGGGGGTGTTGGTTGATATAATTAAATTTTCCCTAATCCATGAGCTTAGGCTTTTGGATTGATCGGTGGTTTAACATAATGATAGAATCTACCACTATTTTATTAAGAACTAAGATAATCTCTCGAGAGGATCTCTCAAAGAGCCCAAATACAAAGACAACAACTGTTTGAATGAATTGCTAAAGGCCTAACATATCCTATATATAGTAAACCTAGAAGATGACCTAATTTAGTAATAACTCTAATAAGGCCCATCATTGATAATACAACTGAATTACAATTAAAGACTATTAAATATAATTAATAAATACTACTTTCATAATGTCTTGTTCCTCCTCGAGTAAACCTTCACGTGGGCCCGGTTCGTATCAATACTTCCCCCAAAGAGCCACCTTGTCCTCAAGGTGAAAGTCGAGAAACTGGAGCTCAAGATCCCCGGCGCCTCCCATGTAGCCTCGTGCAGGTGGACCCCTCCCACTCGAATCAAAACTTGGCGCGAACCCCCTTCGAGCGGGTCCTCCCATGCCCAACACAAACCTTAGGGCAAACCACGACACACAAATCTTCCGTCACCATGGACGGCGTAGCCAGAACTAACACCGAAGAACCCACCGCCTTCCGCAGGGCGGACACATGAAAAACCGGATGAATCTTCACTGATGGAGGCAACTCGAGTCGATATGCCACTCGCCCAACCTGTGCCAAAACACGATACGGACCAATAAATCGGGTGCTAACTTGGGATGTTTAAACCGCGCGAGAGACGATAGTCGGTAAGGTCGGAGTTTAATATAAACCAAGTCATCAACAAAGAAAGCCCGAACATCGCGGCGATTGACATTCGCGCGATCCGACATAGACATTTGTGCACGCGATAAATTAGCCTTCAAAGAATCCAACATTCGATCTCTTTCTAACATCAACGAGTCAATCGCCGCCACGGGGACTAGCTCCAAATCATATCCCGAATCGAGGCGGGCGGCCGTAACGATCGCAAAAAGGCGTCATGCTCGTGGGCGAATGAAACGATGTGTTAAAACTGAATTCACCCAAGACAACCATTTATACCATGCCTTCTGTTGAGTCATCACAAAACATCGTAGGTAGGACTCCAAACAACGATTTACAACCTCGGTTTGGCCATCCGTTTGAGGATGGTATGTAATCTTTGCGGCGAAGTTTCGTTCCCAAGCCTTAAACATTTCCTCCCACAAGAAGCTAGTGAAAATCTTATCCCGATCGACACCACGATGCTTTTAGGTATCCCATGTAAGCGCACCACCTCCTTTATAAAAACACGAGACACGGATAAGGAAGTGAAGGGGTGGCGTAGGGGAATGAAATGCGCATATTTAGACAATCACAACCAACCACGACTAAAATAGTGTCAAACCCCTCCGAACGCGCAAACCCTCAACAAAGTCCATGGAGATGTCCTCCCAAATTTGTTCGGGAATCGGTAATGGTCGTAATAATCCTGCCGGCGATAATGACAGTGTGTTTGGCACGCACACGATCGAACATTCCGCCACAAAGGCGCGAACACGAGCCTTCATCCCTTGCCAATAAACTTCCTTAGCAAGACGCGATATGTTTTTAAGACTCAAAATGTCCACCGACAAATTTCATGGAATTCTAGCAATAATAGAGGATCGTTGGTGACGTCGCAGAATAACACCAATCTGCCTGGTAAAGCAAAACATCGCCTGGCAATCAATACCTGGTGGACCCTCCTCACCGCTTTGAGAGCCGTAAAAATAGGCGATAATTTTGCATCCTCTTTAATCTGCTGCGTAAACACACTTTGTGTTAACCCCGCCACGACACATCGAGCATTCCCAACTCACCCGATAGAGGCATCCGGAAAGAGCATCACCGCTGCTCGATTCTCAAGACCCTTCTTGTACTCAATCTTCGAAATCATACCCCATAAGCTTAGCGATCCATCGATAATAATCACCATCCACCACCCGTTGCTCTAGTAAAAATTTAAGACTTTTACGGTCCGTGCGTACAATAAAATGGTGTCCCAATAAGTACGCCCTCCATCATTTGAGCAGCGAACACGATAGCCATTAACTCCGCTCATAGACAGCTTAACGCGATGAGTGATCGGCAAAGCCCTACTAAAATATGCCATCGGTTGGCCTTGCTTTGCATTAACACCGCACCCACTCCAATTCTGAAGCGTCGGTTTCGACCACAAATCCTGGTTGAAATCCGTGTCGGAGGATCGGAACATCTTTGCTCATGGCATGTTTCATTCTCTGAAGCGATCCTCGGCAGCCTAGCCCCCACTCAAATTTCCCTTCTTGATAGAATCGTGTTAAGGGAAAAGCCATCGATCCATAGTTAGCTACGAATCGCCGTAGTATCCCGTGAGACCTAGGAATCCCCTTAATTCTTTAATATTCCCGGGACTCGGCCATCCCACCATTGCTTCAATCTTTGCCGGGTCCATGACACGCCTTCCTTGAGATATGAAATGCCCCAAGTACTCGATACGGCGCAACCCAAACTGGCATTTCTTGGCGTTGGCCACAAACGCATGTTCAATCAAAATCTCCAAAACCTGAGCCAGGTGCTCCCTATGCTCCCTAATGGTCATACTGTAAATGAGAATGTCATCAAAGAAAACCAACACAAACTTACGCAGATCTGGGGCGCAAAATGTCGTTCATAATAGACTGGAAGGTGGGTGGAGCATTTCTCAATCCGAAAGGCATGACAACGAACTCATAATGGCCTCATGGGGGTTCTGAACGCTGCTTTGTGCACATCCGTAGGATTAAACGAATCTGAATCTGGTAACCGGCCTTCAAATCAATTTTGGAAAAAATCGTCGCTCCATGAAGTTCATCCGAAGCTCATCCACCAAGGGAATGGGATACTTATCTGAGTGCGGTGACATGATTCAGAGCCACAAATCAACACAAAATCGCCAACTACCATCCTTCTTCCTTACCAAAGAAGATCGGACTCGAAAATGCACTCGTGCTAGGGCGGATTATCCTCGCCATAACATTTCTCGCACCAATTTTTCAATCTCATTTTTACGGTATTGCGGATATCGATACGGACGCACATTCACGCGAATCGCTCCCGCCATCAATTCGATGGAGTGATCCTGTTCCTCGGTGGTGGTAATCCTGTCAATGACTCAAACACGGGTGAGCACGAATTAATTAAGGAGTGCGCGATTCACTGGGTACTGAGTTAAGTCGGGAAGGTTTTTCGTCTGACCAGCCTCAGCACCTGTGGTCTCAATCATATTCAACTCCACTAATGACCCCTCGTCCTCGGGTCGTAAGGATTTCATCATAGATTTCAACGACACTGCGCCTTAACCAGCGTGGATCCCCTGTAGTAATTCGGCTTCCCATGAGCCCAATACAAATCGCATCTGCAAGGAGCGGAAATTGAATTCAATCTTCCCAAGGTCTCCAACCAAGCTACCCCTAGGATCACATCAAGACTTCCAGGGGTAAAGGAAGGAAATCGTTAACTACTTTAAGCTCTGCTAAATGGAGTTCCACATTTTTGCAAATTCCTGCTGCCCTTACCGACTTGCCTGTTCCCAGCATAATACCATAATCATGCGATGGTTCCACCTGGAGATTCAAATTTAGCCACGATCACATCGATATAAAGTTATGGGTGGCTCCACCGTCGATAAGAACCACTACCAAGACCTTGAATAGATCCTGTGACTTTCAACGTTTTTGGTGAACTCAACCCCGCCATCGAGTTTAATGACAGTATTTGCTAAGTCCTCACATCCTCTGTGTCACTTGATATCCCCTCATTGTCATTTGCCGAAACTGTCTCTTCGGTAATTCCATCCTCTACAACGCGTATCTCTAGAGCACAGGCTCCTTCTTCTTACAACGGTGCCTGCTCAAATTTCTCATCACAACGAAAACACAATCCTTTATCTTTACGGATCCGAATCTCACTATCCGACAAGCGTTTATAGGGTAATGTGGACGGTGTACTTGTGGTCATCGGCCTGCAGGAAGTTAGGGCTATAGTTCTCAGTTGGTATTTCTGTCGTCCTCGTGCTCGTAGCCCCGATTCCTATCTTTGCTAGTAGATGGCTTAATCTGCTTTGAAGAAGATCCTACCCCTTTTACTTGAGCTCGAAAAACAAGGTCATCCTCAATCACCGAGCCATGAACTTCTTATCCCCGAATACCCACGGTCGCAGCTTCCTCATCTCACTTCGATTTCTTCCTTCAATCCACTTTCCCATTTTCCTCCAACGCACTCGCACTAATATCGCGTATGCCCTTAGCATACTTCTCAAACAGCGCGCCTATACTCTTTCACGGTTCCCACTGTTGTAAACTCATAAGATTTGCATATTTGTTGTCATTGATCGTAGTTTGGAAACGGTGCAACAACAATTCTCGAAACTCCTCCCACGAAGCAATGAAGCACGGTCCTCCTCATCACAGAACCATTCTAGTGCCTCTCCTTCCATACATACACCAAGTCACAGATCCACTCTTTCGTGCCCGCCACACCGATTCACCCGAAATACCTTTCGACGCGGCATAACCATCCGTCCGGATCCTCATCCGTTAGACCCTTGAACACCGGCATTTCCAACTTTCGTAACCTCCGATCAAACATTGGCCCATCCCTACTGCCTATCCCCGGTCCGCCCCTTTCAAATTGATCCTGCCTATCTCCCCAATCCTCGCACTCACTGGCCTCGCGCCATGGATTTCGCCCCGATTGTTCCAACCGTCATGCGCATTCGGCCTTCGTGGCGGCTGCGCACCCTCCGTGCCTCGGGAACCCCGTTTCCCGGTCAAACCCGTCGATCCCAAATTCCGACCTGTTCCGGCCACTGTTCGGCTCAGGCCCGACCTCCCGACCAGACTCGGTCCCCCCACGTCCGGACCGGCCCAGCCCAGCCCGACCTCGCCCGGCCGCGCGCGCGCGCGCGCGCGCCGCCCCGCGCCTGCCTCCACCCGGCCGTCGCTTCACCCGCGCGCACCCGCGCGCCACTTCTCGCGCCCGGCCCGACATCGCCCGGCCGCGCAGCTATCCGCGCGCCCCGCCCTCGCCCGGTCCGACTCCACCCAGCCCGATACCGCCCCAGCCCGCACCGCACCCGCGCGCCCCGCTCGCGCCCGGTCCGACACTGCCCCTGCTCGCGCCGTACCCACGCGCGCCGCCCTCGTCCGGCTCGGTCTCGCCCAGCCCGACACCGCCCCACGCGCGCGTCGCATGCCCTACTCCGCTCACGCGCCAGCGCCATTGTTCCACCCCGACAGCGCCGCTCTGCCCGATGTCGCCTCTGCTTCCTCCAGATTCTCCTCCGCCGTGGCAAAGAATCGCGCCAGCACCTCCCTCCGTCCTCTCGGTCATTAATCCCTTCCCTTTGTCAGCCCGAATCGATTCCATTATTAGTTCTAAATTCCGGGTTATCGACTCAAACTTTAGATCCTGGGAATCTATCTTTTTTCCCAACCGCTCCTCCATCGCTTCCATCTTCCCATTAGTTTCTTGCAACTTCCTCGCCAAGTCGGCCACTCCTTCTTCACATTCTTGTATCCTGGATTCCATCTTCGTAGTAACCATTCCCGGATCGGTAAATGGCTCTGATACCAAAATGATAGAATCTACCACTATTTTATTAAGAACTAAGATAATCTCTCGAGAGGATCTCTCAAAGAGCCCAAATACAAAGACAACAACTGTTTGAATGAATTGCTAAAGGCCTAACATATCCTATATATAGTAAACCTAGAAGATGACCTAATTTAGTAATAACTCTAATAAGGCCCATCAGCCCAATACAACTGAATTACAATTAAAGACTATTAAATATAATTAATAAATACTACTTTCATAATGTCTTGTTCCTCCTCGAGTAAACCTTCAGCGGGCCCGGGTTCGTATCACATAATTCAACAGGTTCTCATATTGGAAAGGTGAGGATTTTACAACACTTTTATAAGGGATATGAGTTACTTCCATCTTAGCCAATTAGTTTTGAGGTGGAATCCCATATTCTCTAATCTAATATCAGAGCTCCACGTGGGTTATTGGTCCCCCAAATTCCAATCCAACCCAAAATGCTTTGAATTCTAAAACGGGTGTGGACCCCACCCAAGCGGGCATAAGTTTAAGCGGGTATTTGATCCCAAAGTTCGGATCTGACCCTAGCAAAAAGTAAGCCAAAGCAGATATTTCGGTCCACAAATTTGGATCCATTTTAAGATGGCCTTTCCCAATATATGGCCACCATCTTGAGCGGGCATATTGGAGATACTCACCTTGTGACAATTGTCTCATATTGAAAATGTGAGGATTCTATAACACCTTTATAAGGGATATGAGTTACTCTCCTCTTAGCTAATTGGTTTTGAGGTGGAACTCCATATTCTCTAATCATATTAGATAAATACGAGGTTCCACCTCAAAACCAATTGGTAATGAAGGGAGTAACTCATATCTCTTATAAATGTGTTAAAACCACAATGTGAGTATCCCCAACACATTCATCATATCATTTAGTGTTGATTTCGGGCTTCTATGTGTATTCATATTTAACGGTTCTCACTTTGATATAATTTAGACCCTGAAAGTACCTATTATGCATTTCTCCAAGTATACTTTTCATCTCACGAACACTAAGTACTGGGATTCTTATATTCAGCAATCGGCGTGAATTTCAGTCCTTTGCATAAACTCCCAAGTATTTTTGTTAGTGCGTCCTACCATCTTACAGTTGCACGTTCTTGGGAATTTAATTTGGCCGTAACTCTGTATTCTGTTTCTATGCAATGACATGAGCTTTTGAAGACAAACTTCTTGTGCTTGAGTTTCAACATAGTTGACGAAACTTGAAATGGAAGACCACTTTTTCTGCTTCTCTATATTCATAGTTGGTTGAACTGCAACTAGAACTAAACCCGACCTGTGTTTTGATTTCCCGACTTTACGCATCGCATCTTATTCCAATCTTACTATGTGTTTTCATAGTTGTTGTCACCCAACCTTGCATGCCTCGAGAACAACATATAGCAGCATTTTTCCACAGTTATGGTCCTGTCCCATGTTTGCAGTTCCAAATTTATCTGAGGATTGCACTCCAATTTTGTCAGAATGCAAAGTAAAAAATGAAAGAAACATTGAAACATTCCTAGTCTGGCCTTCAGTAGAGTTTTGCCCACACTTTGCACCATAAAAGAGATTTATTTTAGAAATGAGCTTCACTAGAAACGCCTAGTCATGACAGACAAAGAAGCTATACATCGTCAACGACTCGACTCCAACATAATTACTCTTATTCATATAGTCGACTCCAACATAATTACTCTACAGCAGTCATCGGTATGAAAACGATGCATCGTTTTCTATCGTATCGAGTTGTTGATCTTCTATTACATCATATATTCCAAAATCTCCCATTGTAATTGTAAGGCTGTGATCGGTCCTTCAGTCATTCATAGTTTACCAAATAAGGAACAAGAGCAATATGCTTGGTTGTTCTCAATAATAAATTTTCATGACTTGGTTGTTTATGTTTATATGCTTGGGGAATATCCTTCATCAATATATTAAACATTTTAAGGATCCATATACTGATGTACTGTATCATGTGTATTGTATAATGTCGATTTTGGAATAGTAGAAATAATTTCTATTGTTTGTATTAATCGTTTATTTACATCTTTAAAATGACCTTTTAAAAAAAAAAATAGGTGAAATTCTATTCACCAAATAAGGTAAATTTAAAACAAGGAATATTATATATGAAATAGTCCATTTAAAGGAGAATATAAAATAGAATAAAATCAAATAATTCAATACTTTCCTTTAAGTTGGTGCAAAGACGTCTGACATCCATTTTGTCAATAAGTCTGACAAATTATAACTTGGGAAGTCCTTTAGTCAAAATATCAATTGTTTGTTCTGTTGTAGGTAAATATGGAATGTAGATTACTCTAGTGTGAATTTTCTTTTTAATAAAATGATTGTCAATTTCAATATGCTTGGTTCTATCATGAAGAAGAGGGTTATGTCAATGAATATAGTAGACTAATTGTCACGATAAACTTGAATAAGAGTTGCTTGTGTGATATTTAGTTCCTCATTGAGTTTTTTTAATCCATATATCTTCACAAATATCATGGGCCAAACCTCTTATATTTGGGCAACCATGAATAAGGGGGAGAATGAGAATGAGAATGATTCCTCGTGTTTGGGCCACTAGGATTTTTTCTTCCATAATCACGTTCTCCAAAAATCACCTATTTTCTTTCTCCTTTTAATTACTTTTCATTCTCCCCACGACAATTTTTTATACATCACTATTTTTCACTCTCATCATATTCATTATTTTATTTTCCTATAATCATTTCATTCACCCCAAACTCGAAATAATCATTCTTTCTATAACTATTCTCTTCATCCCAAATACAAATAATCATTCTCTCCATAATCATTCATTTCATATTCTCTCCCTAATTTTCATTCCCTCTATAATCATTCTAGCTACTGTACTTTGTTTTTTTTTCTTTACTTCTCCAAGCAACAAGATTTCCTCCAACAAAAAAACAATAATTTGATGTAGATCTTATGTCTTTACTGATTCCTGCCCAGTCAGCGTTGATGTAAATTTCAATATCGAAATTCTCATGCTTCTAAAAATAGTATGCTTTTTCTTGGAGGCCCTTTTAGGTATTTCAAGATTCTATAGACAACTTCAAAATGAGTGGGGCATGAAGCTTGCATAAATCTACTTATCATACTTACTGCAAAGGCAATGTCTGGTCTTGTATGACATAGTCTACCCATAAGTCGTTGATAACGTTCCTTATCTCACTTCTTCCTCAATTGTTGCTTGCAGTTTTAGATTTGATTAAATAAGGGTTTCTGTTGTTCTACATCACATTAATATTGTCTTTTTGAGTAAATCAAGAATATATTTTCTCTAATTAAAAAAATACCTTATTTATATCTAGCAAA
mRNA sequence
AAAATAAAAAAAGAAGAAAGAAAAAAACCACAACATTTTGAGCTTTCTTTCGTGTGCTCTGGTCGACATTTTTGGAGCTCCAAATGATAAGATGGCAAAGCAGGCGAATTCTGTATTCCTCGAAGAATGGTTGAATAGCACCAGTGGAACTAGCAGTGCCCTTAACTTCAAAACCACTTCCTCGTCTGCTCGAGAAATCATCCAAGCATGGGCTGAGCTTAGAAGCTCTTTAGAGCATCAATCGTTCGATGATCGCCACATTCAATCGCTCAAAACTCTTGTCAACTCACAATCTTCACTCTTTGTTGCAGACCCACAAGCTAAGCTGGTGATTTCTATACTCTCTTCTCCCAATTTCCCCCTTCCCCATGAATCGTATCCTCTCTTTTTGAGGGTTCTTTATATTTGGGTCAGAAAATCTCTCCGCCCCTCTTTGCTACTTGTCGATTCATTCGTTGAGGTTCTCTCTCAGATTTTTTCATCCAAAATTGAATTGGGAAAAAACCCTTTGTTCTTCTCGGAAGTGGTTTTGGTTTTGGGTGCTATTTCGTACATTCCTTCAGCTTCAGAAAATTCAAAATTATGCTGTTTGGAGTTGCTTTGCAGGGTTTTGGAAGAAGGGTACCTGCTGGTTGGATCAGTAGGAGGGATGATTCCAGGGTTTCTTGCAGGTATTGGTTATGCTTTATCTTCATCAGTGAATGCTCATGTTGTTAGACTGTTGGATTCTTTATTAGGAATTTGGGGTCAGGTAGGTGGCCCTACGGGCAATCTTTCTAGTGGGTTAATGATCCTGCATTTGATTGAGTGGGTGACTTCTGGTTTGATTACCCTTCATTCTTTTGAGAAATTAGATGTTTTTAGCCGAGCTACTTTAGAGTCTTCAAAGGAAAGTTATGCTTCATTTGCTGTTGTTATGGCTGCAGCTGGAATATTGAGGGCTTTTAATACTTGCAATGTTTTGTTGAGTAGTTCAGAAAGAGAAACGATATCTAGAATAAGGATTTCAGCTCAGGATTGTTTAGAATCTATATCGAGAAATTTTATTTCGACTATGGAAGGATCTTCCGTTTCAGGCAATGACCGTAGTAGGAGCCTGTTTCTATTGTGTATTTCATCTGCAATAGCACGTTGTGGCCCTGTTTCATCTCGCCCACCTGTGCTCATTTGCATTATTTATGCTTTGTTGAGTGAAATATTTCCTTTGCAGCGTTTATATGCCAAAATTCTTGAATTCTCTTCTGTTGAGTCGGGTGCGTTGGGGCTACCTCTAGTAAAAGAGCATCTTGATAGTATTCCTTTTAAGGAAGCAGGGGCCATCACGGGTGTTCTTTGCACTCAGTATGCTTCAATTGACGAAGAGGACAAAAAGATTGTGGAGAATCTTGTATGGGATTACTGTCATGACATCTACTCAAGGCACAGACTAGTAGGGTTGGTGCTTCGTGGCAGAGAGGATGAATTACTAGAGGGTATAGAGAAAATTGCAGAGTCTGCTTTTTTCATGGTTGTAGTATTTGCATTAGCTGTCACAAAAGAAAAGTTGGATTCCAAATGTACACCGGAAAGTCAATTTGACGTATCAGTAAGAATACTTGCTTCATTCTCTTGTATGGAATACTTTCGGCATATTCGCTTGCCAGAATATATGGATACTATCCGAGGGGTTGTTGCAAGCATTCAGGAGAATGAGTCTGCCTGTGTCTCTTTCATTGAATCAATGCCCACATACAAAGATCAAACAAACGGGCCAGATAACTTTATTGGGCAGAAATTAAAATATACATGGACAGAGGATGAGGTGCAAACTGCACGCATGCTGTTTTATATACGAGTCATTCCAACTTGCATCGAACGTGTTCCTACCCAAGTGTATAGGAAAGTGGTTGCTCCAACAATGTTTTTATATATGGGACATCCTAATGCCAAAGTAGCCCGAGCTTCACACTCGGTGTTTATAGCTTTCATATCAGGGAAGGATGACAGTGAAGAAGAAAAGAGAGTGATATTGAAGGAGGAGCTTGTTTTCTACTACATTGAGAGATCTTTATCAGGATATCCTGGCATTACACCATTTGAAGGTATGGCTTCAGGTGTTGCAGCTTTGGTGCGATATCTTCCTGCAGGAAGTCCAGCCATCTTTTATTGTATCGAAAGTCTTACTGTAAAAGCTACAAGCCTTTGCAGTGAAAACTTCATGGAAAATGCTGATCTGTGGAAGTCGTGGCAGGGAGACTTAGAGCCTTCCAAGAAAATTCTTGATATGCTTTTACGGCTCATTTCTCTTGTTGATATACAGGTACTCCCAAGCTTGATGAAAAATTTAGCACAACTGATCATCAAGCTACCGGCCGAGGGCCAAAATATGGTTCTTGATCAGTTATACTCCCTGGTTTCAGAAGCTGATGATGTCACACGCAAACCCTGGCTAGTTTCATGGTTACAATCATTATCATATCTATGCTGCCAATCTAAAAGTGGAGATACACACTCTAATGAGATGCAAAATATACGGCTTTTAAACTTCTCTTGGATTGTTGACCCATTGAACCGTTTTCGATCCTATGCACGACTCTGAGGTATTTCAAGATTCTATAGACAACTTCAAAATGAGTGGGGCATGAAGCTTGCATAAATCTACTTATCATACTTACTGCAAAGGCAATGTCTGGTCTTGTATGACATAGTCTACCCATAAGTCGTTGATAACGTTCCTTATCTCACTTCTTCCTCAATTGTTGCTTGCAGTTTTAGATTTGATTAAATAAGGGTTTCTGTTGTTCTACATCACATTAATATTGTCTTTTTGAGTAAATCAAGAATATATTTTCTCTAATTAAAAAAATACCTTATTTATATCTAGCAAA
Coding sequence (CDS)
ATGGCAAAGCAGGCGAATTCTGTATTCCTCGAAGAATGGTTGAATAGCACCAGTGGAACTAGCAGTGCCCTTAACTTCAAAACCACTTCCTCGTCTGCTCGAGAAATCATCCAAGCATGGGCTGAGCTTAGAAGCTCTTTAGAGCATCAATCGTTCGATGATCGCCACATTCAATCGCTCAAAACTCTTGTCAACTCACAATCTTCACTCTTTGTTGCAGACCCACAAGCTAAGCTGGTGATTTCTATACTCTCTTCTCCCAATTTCCCCCTTCCCCATGAATCGTATCCTCTCTTTTTGAGGGTTCTTTATATTTGGGTCAGAAAATCTCTCCGCCCCTCTTTGCTACTTGTCGATTCATTCGTTGAGGTTCTCTCTCAGATTTTTTCATCCAAAATTGAATTGGGAAAAAACCCTTTGTTCTTCTCGGAAGTGGTTTTGGTTTTGGGTGCTATTTCGTACATTCCTTCAGCTTCAGAAAATTCAAAATTATGCTGTTTGGAGTTGCTTTGCAGGGTTTTGGAAGAAGGGTACCTGCTGGTTGGATCAGTAGGAGGGATGATTCCAGGGTTTCTTGCAGGTATTGGTTATGCTTTATCTTCATCAGTGAATGCTCATGTTGTTAGACTGTTGGATTCTTTATTAGGAATTTGGGGTCAGGTAGGTGGCCCTACGGGCAATCTTTCTAGTGGGTTAATGATCCTGCATTTGATTGAGTGGGTGACTTCTGGTTTGATTACCCTTCATTCTTTTGAGAAATTAGATGTTTTTAGCCGAGCTACTTTAGAGTCTTCAAAGGAAAGTTATGCTTCATTTGCTGTTGTTATGGCTGCAGCTGGAATATTGAGGGCTTTTAATACTTGCAATGTTTTGTTGAGTAGTTCAGAAAGAGAAACGATATCTAGAATAAGGATTTCAGCTCAGGATTGTTTAGAATCTATATCGAGAAATTTTATTTCGACTATGGAAGGATCTTCCGTTTCAGGCAATGACCGTAGTAGGAGCCTGTTTCTATTGTGTATTTCATCTGCAATAGCACGTTGTGGCCCTGTTTCATCTCGCCCACCTGTGCTCATTTGCATTATTTATGCTTTGTTGAGTGAAATATTTCCTTTGCAGCGTTTATATGCCAAAATTCTTGAATTCTCTTCTGTTGAGTCGGGTGCGTTGGGGCTACCTCTAGTAAAAGAGCATCTTGATAGTATTCCTTTTAAGGAAGCAGGGGCCATCACGGGTGTTCTTTGCACTCAGTATGCTTCAATTGACGAAGAGGACAAAAAGATTGTGGAGAATCTTGTATGGGATTACTGTCATGACATCTACTCAAGGCACAGACTAGTAGGGTTGGTGCTTCGTGGCAGAGAGGATGAATTACTAGAGGGTATAGAGAAAATTGCAGAGTCTGCTTTTTTCATGGTTGTAGTATTTGCATTAGCTGTCACAAAAGAAAAGTTGGATTCCAAATGTACACCGGAAAGTCAATTTGACGTATCAGTAAGAATACTTGCTTCATTCTCTTGTATGGAATACTTTCGGCATATTCGCTTGCCAGAATATATGGATACTATCCGAGGGGTTGTTGCAAGCATTCAGGAGAATGAGTCTGCCTGTGTCTCTTTCATTGAATCAATGCCCACATACAAAGATCAAACAAACGGGCCAGATAACTTTATTGGGCAGAAATTAAAATATACATGGACAGAGGATGAGGTGCAAACTGCACGCATGCTGTTTTATATACGAGTCATTCCAACTTGCATCGAACGTGTTCCTACCCAAGTGTATAGGAAAGTGGTTGCTCCAACAATGTTTTTATATATGGGACATCCTAATGCCAAAGTAGCCCGAGCTTCACACTCGGTGTTTATAGCTTTCATATCAGGGAAGGATGACAGTGAAGAAGAAAAGAGAGTGATATTGAAGGAGGAGCTTGTTTTCTACTACATTGAGAGATCTTTATCAGGATATCCTGGCATTACACCATTTGAAGGTATGGCTTCAGGTGTTGCAGCTTTGGTGCGATATCTTCCTGCAGGAAGTCCAGCCATCTTTTATTGTATCGAAAGTCTTACTGTAAAAGCTACAAGCCTTTGCAGTGAAAACTTCATGGAAAATGCTGATCTGTGGAAGTCGTGGCAGGGAGACTTAGAGCCTTCCAAGAAAATTCTTGATATGCTTTTACGGCTCATTTCTCTTGTTGATATACAGGTACTCCCAAGCTTGATGAAAAATTTAGCACAACTGATCATCAAGCTACCGGCCGAGGGCCAAAATATGGTTCTTGATCAGTTATACTCCCTGGTTTCAGAAGCTGATGATGTCACACGCAAACCCTGGCTAGTTTCATGGTTACAATCATTATCATATCTATGCTGCCAATCTAAAAGTGGAGATACACACTCTAATGAGATGCAAAATATACGGCTTTTAAACTTCTCTTGGATTGTTGACCCATTGAACCGTTTTCGATCCTATGCACGACTCTGA
Protein sequence
MAKQANSVFLEEWLNSTSGTSSALNFKTTSSSAREIIQAWAELRSSLEHQSFDDRHIQSLKTLVNSQSSLFVADPQAKLVISILSSPNFPLPHESYPLFLRVLYIWVRKSLRPSLLLVDSFVEVLSQIFSSKIELGKNPLFFSEVVLVLGAISYIPSASENSKLCCLELLCRVLEEGYLLVGSVGGMIPGFLAGIGYALSSSVNAHVVRLLDSLLGIWGQVGGPTGNLSSGLMILHLIEWVTSGLITLHSFEKLDVFSRATLESSKESYASFAVVMAAAGILRAFNTCNVLLSSSERETISRIRISAQDCLESISRNFISTMEGSSVSGNDRSRSLFLLCISSAIARCGPVSSRPPVLICIIYALLSEIFPLQRLYAKILEFSSVESGALGLPLVKEHLDSIPFKEAGAITGVLCTQYASIDEEDKKIVENLVWDYCHDIYSRHRLVGLVLRGREDELLEGIEKIAESAFFMVVVFALAVTKEKLDSKCTPESQFDVSVRILASFSCMEYFRHIRLPEYMDTIRGVVASIQENESACVSFIESMPTYKDQTNGPDNFIGQKLKYTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKVVAPTMFLYMGHPNAKVARASHSVFIAFISGKDDSEEEKRVILKEELVFYYIERSLSGYPGITPFEGMASGVAALVRYLPAGSPAIFYCIESLTVKATSLCSENFMENADLWKSWQGDLEPSKKILDMLLRLISLVDIQVLPSLMKNLAQLIIKLPAEGQNMVLDQLYSLVSEADDVTRKPWLVSWLQSLSYLCCQSKSGDTHSNEMQNIRLLNFSWIVDPLNRFRSYARL
Homology
BLAST of Sed0002136.9 vs. NCBI nr
Match:
XP_022944201.1 (uncharacterized protein LOC111448717 isoform X1 [Cucurbita moschata] >XP_022944209.1 uncharacterized protein LOC111448717 isoform X1 [Cucurbita moschata])
HSP 1 Score: 1376.7 bits (3562), Expect = 0.0e+00
Identity = 711/830 (85.66%), Postives = 764/830 (92.05%), Query Frame = 0
Query: 1 MAKQANSVFLEEWLNSTSGTSSALNFKTTSSSAREIIQAWAELRSSLEHQSFDDRHIQSL 60
MAKQANSVFLEEWL S SG SS N K +SSSAREIIQAWAELRSSLEH+ FDDRHIQSL
Sbjct: 1 MAKQANSVFLEEWLKSISGISSGFNSKISSSSAREIIQAWAELRSSLEHRLFDDRHIQSL 60
Query: 61 KTLVNSQSSLFVADPQAKLVISILSSPNFPLPHESYPLFLRVLYIWVRKSLRPSLLLVDS 120
KTLVNSQSSL+VADPQAKLV+SILSSPN LP ESYPLFLR+LYIWVRKSLRPSL+LVDS
Sbjct: 61 KTLVNSQSSLYVADPQAKLVVSILSSPNLSLPDESYPLFLRILYIWVRKSLRPSLVLVDS 120
Query: 121 FVEVLSQIFSSKIELGKNPLFFSEVVLVLGAISYIPSASENSKLCCLELLCRVL-EEGYL 180
VEVLSQIFSSKI L KNPLF SE VL+LGAISY+ SASE SKLCCLELLCR+L EE +L
Sbjct: 121 SVEVLSQIFSSKIGLRKNPLFISEGVLILGAISYVVSASEKSKLCCLELLCRILEEEEWL 180
Query: 181 LVGSVGGMIPGFLAGIGYALSSSVNAHVVRLLDSLLGIWGQVGGPTGNLSSGLMILHLIE 240
L+GSVGG +P F AGIGYALSSS+NAHVVRLLDSLLGIWG++G PTGNLS+GLMILHLIE
Sbjct: 181 LIGSVGGTVPEFFAGIGYALSSSLNAHVVRLLDSLLGIWGKIGSPTGNLSTGLMILHLIE 240
Query: 241 WVTSGLITLHSFEKLDVFSRATLESSKESYASFAVVMAAAGILRAFNTCNVLLSSSERET 300
WVTSGLI+LHSF+KL+ S+ LESSKESYASFAVVMAAAGILRAFN+ LLSSSERET
Sbjct: 241 WVTSGLISLHSFKKLNFLSQTALESSKESYASFAVVMAAAGILRAFNSYKALLSSSERET 300
Query: 301 ISRIRISAQDCLESISRNFISTMEGSSVSGN-DRSRSLFLLCISSAIARCGPVSSRPPVL 360
ISRIRISAQDCLESI++NFISTMEGSS++GN D RSL LLCIS A+ARCGPV+SRPPVL
Sbjct: 301 ISRIRISAQDCLESIAKNFISTMEGSSITGNDDHGRSLLLLCISLAVARCGPVASRPPVL 360
Query: 361 ICIIYALLSEIFPLQRLYAKILEFSSVESGALGLPLVKEHLDSIPFKEAGAITGVLCTQY 420
IC+ YALL+EIFPLQRLYAK+L+FS ESG LGL LVKEHLDSIPFKEAG I GVLC+QY
Sbjct: 361 ICVTYALLTEIFPLQRLYAKLLKFSFGESGVLGLTLVKEHLDSIPFKEAGVIAGVLCSQY 420
Query: 421 ASIDEEDKKIVENLVWDYCHDIYSRHRLVGLVLRGREDELLEGIEKIAESAFFMVVVFAL 480
ASIDE+DKK VENLVWDYC DIYSRHR VGLVLR REDELLE IEKIAESAF MVVVFAL
Sbjct: 421 ASIDEDDKKFVENLVWDYCQDIYSRHRRVGLVLRHREDELLENIEKIAESAFLMVVVFAL 480
Query: 481 AVTKEKLDSKCTPESQFDVSVRILASFSCMEYFRHIRLPEYMDTIRGVVASIQENESACV 540
AVTKEKL+SK TPE+QFDVSVRIL SFSCMEYFR IR+PEYMDTIRGVVAS+QENESACV
Sbjct: 481 AVTKEKLNSKYTPETQFDVSVRILDSFSCMEYFRRIRMPEYMDTIRGVVASVQENESACV 540
Query: 541 SFIESMPTYKDQTNGPDNFIGQKLKYTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKV 600
SFIESMP+Y+DQT+GPD+ IGQKL+YTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKV
Sbjct: 541 SFIESMPSYQDQTHGPDSSIGQKLQYTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKV 600
Query: 601 VAPTMFLYMGHPNAKVARASHSVFIAFISGKDDSEEEKRVILKEELVFYYIERSLSGYPG 660
VAPTMFLYMGHPNAKVARASHSVFIAFISGKDD E+ RV+LKEELVFYYIERSLSGYPG
Sbjct: 601 VAPTMFLYMGHPNAKVARASHSVFIAFISGKDDDEDGNRVMLKEELVFYYIERSLSGYPG 660
Query: 661 ITPFEGMASGVAALVRYLPAGSPAIFYCIESLTVKATSLCSENFMENADLWKSWQGDLEP 720
ITPFEGMASGVAALVRYLPAGSP+IFYCI+SLTVKATSLCSENFM++ADLWK+WQGDLEP
Sbjct: 661 ITPFEGMASGVAALVRYLPAGSPSIFYCIDSLTVKATSLCSENFMDDADLWKTWQGDLEP 720
Query: 721 SKKILDMLLRLISLVDIQVLPSLMKNLAQLIIKLPAEGQNMVLDQLYSLVSEADDVTRKP 780
SKKILDMLLRLISLVDIQVLPSLM NLAQL+IKLP+EGQNMVLDQLYSLVSEADDVTRKP
Sbjct: 721 SKKILDMLLRLISLVDIQVLPSLMTNLAQLVIKLPSEGQNMVLDQLYSLVSEADDVTRKP 780
Query: 781 WLVSWLQSLSYLCCQSKSGDTHSNEMQNIRLLNFSWIVDPLNRFRSYARL 829
LVSWLQSLSYLC QS+S D HSNE Q RL NF+WIVDPLNR RSYARL
Sbjct: 781 LLVSWLQSLSYLCSQSRSADAHSNEKQTTRLSNFAWIVDPLNRIRSYARL 830
BLAST of Sed0002136.9 vs. NCBI nr
Match:
XP_038903921.1 (uncharacterized protein LOC120090375 isoform X1 [Benincasa hispida])
HSP 1 Score: 1373.6 bits (3554), Expect = 0.0e+00
Identity = 716/828 (86.47%), Postives = 758/828 (91.55%), Query Frame = 0
Query: 1 MAKQANSVFLEEWLNSTSGTSSALNFKTTSSSAREIIQAWAELRSSLEHQSFDDRHIQSL 60
MAKQ++S+FLEEWL S GT ALN K TSSSAREIIQAWAELRSSLEHQSFDDRHIQSL
Sbjct: 1 MAKQSSSLFLEEWLKSIGGT--ALNSKLTSSSAREIIQAWAELRSSLEHQSFDDRHIQSL 60
Query: 61 KTLVNSQSSLFVADPQAKLVISILSSPNFPLPHESYPLFLRVLYIWVRKSLRPSLLLVDS 120
K LVNSQSSL+VADPQAKLVISILSSPNF +P ESYPLFLR+LYIWVRKSLRPSL+LVDS
Sbjct: 61 KILVNSQSSLYVADPQAKLVISILSSPNFSIPDESYPLFLRILYIWVRKSLRPSLVLVDS 120
Query: 121 FVEVLSQIFSSKIELGKNPLFFSEVVLVLGAISYIPSASENSKLCCLELLCRVLEEGYLL 180
VEVLS IFSSKIEL KNPLFFSE VLVLGAISY+ SASE SKLCCLELLCRVLEE YLL
Sbjct: 121 SVEVLSHIFSSKIELRKNPLFFSEGVLVLGAISYLLSASEKSKLCCLELLCRVLEEEYLL 180
Query: 181 VGSVGGMIPGFLAGIGYALSSSVNAHVVRLLDSLLGIWGQVGGPTGNLSSGLMILHLIEW 240
VGSVG +IP FLAGIGYALSSSVNAHVVRLLDSLLGIWG +GGP LSSGLMILH+IEW
Sbjct: 181 VGSVGEIIPEFLAGIGYALSSSVNAHVVRLLDSLLGIWGNIGGPIDTLSSGLMILHMIEW 240
Query: 241 VTSGLITLHSFEKLDVFSRATLESSKESYASFAVVMAAAGILRAFNTCNVLLSSSERETI 300
VTSG+I+LHSFEKLDVFS+A L SSKESYASFAVVMAAAGILRAFNT LLSSSERETI
Sbjct: 241 VTSGMISLHSFEKLDVFSQAILVSSKESYASFAVVMAAAGILRAFNTQKGLLSSSERETI 300
Query: 301 SRIRISAQDCLESISRNFISTMEGSSVSGNDRSRSLFLLCISSAIARCGPVSSRPPVLIC 360
SRIRISAQDCLESI+RNFISTMEGSS++GND RS+ LLCIS AIARCGPVSS PPVLIC
Sbjct: 301 SRIRISAQDCLESIARNFISTMEGSSITGNDHRRSVLLLCISLAIARCGPVSSCPPVLIC 360
Query: 361 IIYALLSEIFPLQRLYAKILEFSSVESGALGLPLVKEHLDSIPFKEAGAITGVLCTQYAS 420
++YALL+EIFPLQRLYAKI EFS E GALGL LV EHL SIPFKEAGAITGV C+QYA+
Sbjct: 361 VVYALLTEIFPLQRLYAKINEFSFAELGALGLTLVNEHLGSIPFKEAGAITGVFCSQYAT 420
Query: 421 IDEEDKKIVENLVWDYCHDIYSRHRLVGLVLRGREDELLEGIEKIAESAFFMVVVFALAV 480
++EEDK VENLVWDYC D+YSRHRL GLVLRGREDELLE IEKIAESAF MVVVFALAV
Sbjct: 421 LEEEDKSFVENLVWDYCQDVYSRHRLAGLVLRGREDELLENIEKIAESAFLMVVVFALAV 480
Query: 481 TKEKLDSKCTPESQFDVSVRILASFSCMEYFRHIRLPEYMDTIRGVVASIQENESACVSF 540
TKEKLDSK T ESQFD+SVRIL SFSCMEYFR IRLPEYMDTIRGVVASIQ NESACVSF
Sbjct: 481 TKEKLDSKYTLESQFDISVRILVSFSCMEYFRRIRLPEYMDTIRGVVASIQGNESACVSF 540
Query: 541 IESMPTYKDQTNGPDNFIGQKLKYTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKVVA 600
IESMPTY+DQTNGPDN IG+ KY+WT+DEVQTARMLFY+RVIPTCIERVPTQVY KVVA
Sbjct: 541 IESMPTYQDQTNGPDNSIGRITKYSWTKDEVQTARMLFYVRVIPTCIERVPTQVYGKVVA 600
Query: 601 PTMFLYMGHPNAKVARASHSVFIAFISGKDDSEEEKRVILKEELVFYYIERSLSGYPGIT 660
PTMFLYMGHPNAKVARASHSVFIAF+SGKDD +EKRV LKEELVFYYIERSLSGYPGIT
Sbjct: 601 PTMFLYMGHPNAKVARASHSVFIAFMSGKDDLGDEKRVTLKEELVFYYIERSLSGYPGIT 660
Query: 661 PFEGMASGVAALVRYLPAGSPAIFYCIESLTVKATSLCSENFMENADLWKSWQGDLEPSK 720
PFEGMASGVAALVRYLPAGSPAIFYCI+SLTVKATSLCSENFM++ADLWK+WQGDLEPSK
Sbjct: 661 PFEGMASGVAALVRYLPAGSPAIFYCIDSLTVKATSLCSENFMDDADLWKTWQGDLEPSK 720
Query: 721 KILDMLLRLISLVDIQVLPSLMKNLAQLIIKLPAEGQNMVLDQLYSLVSEADDVTRKPWL 780
KILDMLLRL+SLVDIQVLPSLMKNLAQLII+LP EGQNMVLDQLYSLVSEADDVTRKP L
Sbjct: 721 KILDMLLRLVSLVDIQVLPSLMKNLAQLIIELPTEGQNMVLDQLYSLVSEADDVTRKPML 780
Query: 781 VSWLQSLSYLCCQSKSGDTHSNEMQNIRLLNFSWIVDPLNRFRSYARL 829
VSWLQSLSYLC QSKS D S E Q+ RL NF+WIVDPLNR RSYARL
Sbjct: 781 VSWLQSLSYLCSQSKSTDARSIEKQSTRLTNFAWIVDPLNRIRSYARL 826
BLAST of Sed0002136.9 vs. NCBI nr
Match:
XP_023005293.1 (uncharacterized protein LOC111498339 isoform X1 [Cucurbita maxima] >XP_023005295.1 uncharacterized protein LOC111498339 isoform X1 [Cucurbita maxima] >XP_023005296.1 uncharacterized protein LOC111498339 isoform X1 [Cucurbita maxima])
HSP 1 Score: 1371.7 bits (3549), Expect = 0.0e+00
Identity = 711/830 (85.66%), Postives = 762/830 (91.81%), Query Frame = 0
Query: 1 MAKQANSVFLEEWLNSTSGTSSALNFKTTSSSAREIIQAWAELRSSLEHQSFDDRHIQSL 60
MAKQANSVFLEEWL S SG SS N K +SSSAREIIQAWAELRSSLEHQ FDDRHIQSL
Sbjct: 1 MAKQANSVFLEEWLKSISGISSGFNSKISSSSAREIIQAWAELRSSLEHQLFDDRHIQSL 60
Query: 61 KTLVNSQSSLFVADPQAKLVISILSSPNFPLPHESYPLFLRVLYIWVRKSLRPSLLLVDS 120
KTLVNSQSSL+VADPQAKLVISILSSPN LP ESYPLFLR+LYIWVRKSLRPSL+LVDS
Sbjct: 61 KTLVNSQSSLYVADPQAKLVISILSSPNLSLPDESYPLFLRILYIWVRKSLRPSLVLVDS 120
Query: 121 FVEVLSQIFSSKIELGKNPLFFSEVVLVLGAISYIPSASENSKLCCLELLCRVL-EEGYL 180
VE+LSQIFSSKI L KNPLF SE VL+LGAISY+ SASE KLCCLELLCR+L EE +L
Sbjct: 121 SVEILSQIFSSKIGLRKNPLFISEGVLILGAISYVVSASEKFKLCCLELLCRILEEEEWL 180
Query: 181 LVGSVGGMIPGFLAGIGYALSSSVNAHVVRLLDSLLGIWGQVGGPTGNLSSGLMILHLIE 240
L+GSVGG +P F AGIGYALSSSVNAHVVRLLDSLLGIWG++G PTGNLS+GLMILHLIE
Sbjct: 181 LIGSVGGTVPEFFAGIGYALSSSVNAHVVRLLDSLLGIWGKIGSPTGNLSTGLMILHLIE 240
Query: 241 WVTSGLITLHSFEKLDVFSRATLESSKESYASFAVVMAAAGILRAFNTCNVLLSSSERET 300
WVTSGLI+LHSF+KLD S+A LESSKESYASFAVVMAAAGILRAFN+ LLSSSERET
Sbjct: 241 WVTSGLISLHSFKKLDFLSQAALESSKESYASFAVVMAAAGILRAFNSYKALLSSSERET 300
Query: 301 ISRIRISAQDCLESISRNFISTMEGSSVSGN-DRSRSLFLLCISSAIARCGPVSSRPPVL 360
ISRIRISAQDCLESI++NFISTMEGSS++GN D RSL LLCIS A+ARCGPV+SRPPVL
Sbjct: 301 ISRIRISAQDCLESIAKNFISTMEGSSITGNDDHGRSLLLLCISLAVARCGPVASRPPVL 360
Query: 361 ICIIYALLSEIFPLQRLYAKILEFSSVESGALGLPLVKEHLDSIPFKEAGAITGVLCTQY 420
IC+ YALL+EIFPLQRLYAK+LEFS ESG LGL LVKEHLDSIPFKEAG I GVLC+QY
Sbjct: 361 ICVTYALLTEIFPLQRLYAKLLEFSFGESGVLGLSLVKEHLDSIPFKEAGVIAGVLCSQY 420
Query: 421 ASIDEEDKKIVENLVWDYCHDIYSRHRLVGLVLRGREDELLEGIEKIAESAFFMVVVFAL 480
ASIDE+DKKIVENLVWDYC DIYSRHR VGLVLR REDELLE IEKIAESAF MVVVFAL
Sbjct: 421 ASIDEDDKKIVENLVWDYCQDIYSRHRRVGLVLRHREDELLENIEKIAESAFLMVVVFAL 480
Query: 481 AVTKEKLDSKCTPESQFDVSVRILASFSCMEYFRHIRLPEYMDTIRGVVASIQENESACV 540
AVTKEKL+SK T E+QFDVSVRIL SFSCMEYFR IR+PEYMDTIRGVVAS+QENESACV
Sbjct: 481 AVTKEKLNSKYTLETQFDVSVRILNSFSCMEYFRRIRMPEYMDTIRGVVASVQENESACV 540
Query: 541 SFIESMPTYKDQTNGPDNFIGQKLKYTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKV 600
SFIESMP+Y+DQT+GPD+ IGQKL+Y WTEDEVQTARMLFYIRVIPTCIE VPTQVYRKV
Sbjct: 541 SFIESMPSYQDQTHGPDSSIGQKLQYIWTEDEVQTARMLFYIRVIPTCIELVPTQVYRKV 600
Query: 601 VAPTMFLYMGHPNAKVARASHSVFIAFISGKDDSEEEKRVILKEELVFYYIERSLSGYPG 660
VAPTMFLYMGHPN+KVARASHSVFIAFISGKDD E+ RV+LKEELVFYYIERSLSGYPG
Sbjct: 601 VAPTMFLYMGHPNSKVARASHSVFIAFISGKDDGEDGNRVMLKEELVFYYIERSLSGYPG 660
Query: 661 ITPFEGMASGVAALVRYLPAGSPAIFYCIESLTVKATSLCSENFMENADLWKSWQGDLEP 720
ITPFEGMASGVAALVRYLPAGSP+IFYCI+SLTVKATSLCSENFM++ADLWK+WQGDLEP
Sbjct: 661 ITPFEGMASGVAALVRYLPAGSPSIFYCIDSLTVKATSLCSENFMDDADLWKTWQGDLEP 720
Query: 721 SKKILDMLLRLISLVDIQVLPSLMKNLAQLIIKLPAEGQNMVLDQLYSLVSEADDVTRKP 780
SKKILDMLLRLISLVDIQVLPSLM NLAQL+IKLP+EGQNMVLDQLYSLVSEADDVTRKP
Sbjct: 721 SKKILDMLLRLISLVDIQVLPSLMTNLAQLVIKLPSEGQNMVLDQLYSLVSEADDVTRKP 780
Query: 781 WLVSWLQSLSYLCCQSKSGDTHSNEMQNIRLLNFSWIVDPLNRFRSYARL 829
LVSWLQSLSYLC +S+S D HSNE Q RL NF+WIVDPLNR RSYARL
Sbjct: 781 SLVSWLQSLSYLCSRSRSADAHSNEKQTTRLSNFAWIVDPLNRIRSYARL 830
BLAST of Sed0002136.9 vs. NCBI nr
Match:
XP_023539379.1 (uncharacterized protein LOC111800037 [Cucurbita pepo subsp. pepo] >XP_023539380.1 uncharacterized protein LOC111800037 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1370.1 bits (3545), Expect = 0.0e+00
Identity = 705/829 (85.04%), Postives = 760/829 (91.68%), Query Frame = 0
Query: 1 MAKQANSVFLEEWLNSTSGTSSALNFKTTSSSAREIIQAWAELRSSLEHQSFDDRHIQSL 60
MAKQANSVFLEEWL S SG SS + K +SSSAREIIQAWAELRSSLEH+ FDDRHIQSL
Sbjct: 1 MAKQANSVFLEEWLKSISGISSGFHSKISSSSAREIIQAWAELRSSLEHRLFDDRHIQSL 60
Query: 61 KTLVNSQSSLFVADPQAKLVISILSSPNFPLPHESYPLFLRVLYIWVRKSLRPSLLLVDS 120
KTLVNSQSSL+VADPQAKLVISILSSPN LP ESYPLFLR+LYIWVRKSLRPSL+LVDS
Sbjct: 61 KTLVNSQSSLYVADPQAKLVISILSSPNLSLPDESYPLFLRILYIWVRKSLRPSLVLVDS 120
Query: 121 FVEVLSQIFSSKIELGKNPLFFSEVVLVLGAISYIPSASENSKLCCLELLCRVL-EEGYL 180
VE+LSQIFSSKI L KNPLF SE VL+LGA SY+ SASE SKLCCLELLCR+L EE +L
Sbjct: 121 SVEILSQIFSSKIGLRKNPLFISEGVLILGANSYVVSASEKSKLCCLELLCRILEEEEWL 180
Query: 181 LVGSVGGMIPGFLAGIGYALSSSVNAHVVRLLDSLLGIWGQVGGPTGNLSSGLMILHLIE 240
L+GSVGG +P F AGIGYALSSSVN HVVRLLDSLLGIWG++G PTGNLS+GLMILHLIE
Sbjct: 181 LIGSVGGTVPEFFAGIGYALSSSVNVHVVRLLDSLLGIWGKIGSPTGNLSTGLMILHLIE 240
Query: 241 WVTSGLITLHSFEKLDVFSRATLESSKESYASFAVVMAAAGILRAFNTCNVLLSSSERET 300
WVTSGLI+LHSF+KLD+ S+A LESSKESYASFAVVMAAAGILRAFN+ LL SSERET
Sbjct: 241 WVTSGLISLHSFKKLDLLSQAALESSKESYASFAVVMAAAGILRAFNSYKALLKSSERET 300
Query: 301 ISRIRISAQDCLESISRNFISTMEGSSVSGNDRSRSLFLLCISSAIARCGPVSSRPPVLI 360
ISRIRISA DCLESI++N ISTMEGSS++GND RSL LLCIS A+ARCGPV+SRPPVLI
Sbjct: 301 ISRIRISAVDCLESIAKNLISTMEGSSITGNDHGRSLLLLCISLAVARCGPVASRPPVLI 360
Query: 361 CIIYALLSEIFPLQRLYAKILEFSSVESGALGLPLVKEHLDSIPFKEAGAITGVLCTQYA 420
C+ YALL+EIFPLQRLYAK+LEFS ESG LG LVKEHLDS+PFKEAG I GVLC++YA
Sbjct: 361 CVTYALLTEIFPLQRLYAKLLEFSLGESGVLGHSLVKEHLDSVPFKEAGVIAGVLCSRYA 420
Query: 421 SIDEEDKKIVENLVWDYCHDIYSRHRLVGLVLRGREDELLEGIEKIAESAFFMVVVFALA 480
SIDE+DKK VENLVWDYC DIYSRHR +GLVLR REDELLE IEKIAESAF MVVVFALA
Sbjct: 421 SIDEDDKKFVENLVWDYCQDIYSRHRRIGLVLRHREDELLENIEKIAESAFLMVVVFALA 480
Query: 481 VTKEKLDSKCTPESQFDVSVRILASFSCMEYFRHIRLPEYMDTIRGVVASIQENESACVS 540
VTKEKL+SK TPE+QFDVSVRIL SFSCMEYFR IR+PEYMDTIRGVVAS+QENESACVS
Sbjct: 481 VTKEKLNSKYTPETQFDVSVRILDSFSCMEYFRRIRMPEYMDTIRGVVASVQENESACVS 540
Query: 541 FIESMPTYKDQTNGPDNFIGQKLKYTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKVV 600
FIESMP+Y+DQT+GPD+ IGQKL+YTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKVV
Sbjct: 541 FIESMPSYQDQTHGPDSSIGQKLQYTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKVV 600
Query: 601 APTMFLYMGHPNAKVARASHSVFIAFISGKDDSEEEKRVILKEELVFYYIERSLSGYPGI 660
APTMFLYMGHPNAKVARASHSVFIAFISGKDD E+ RV+LKEELVFYYIERSLSGYPGI
Sbjct: 601 APTMFLYMGHPNAKVARASHSVFIAFISGKDDDEDGNRVMLKEELVFYYIERSLSGYPGI 660
Query: 661 TPFEGMASGVAALVRYLPAGSPAIFYCIESLTVKATSLCSENFMENADLWKSWQGDLEPS 720
TPFEGMASGVAALVRYLPAGSP+IFYCI+SLTVKATSLCSENFM++ADLWK+WQGDLEPS
Sbjct: 661 TPFEGMASGVAALVRYLPAGSPSIFYCIDSLTVKATSLCSENFMDDADLWKTWQGDLEPS 720
Query: 721 KKILDMLLRLISLVDIQVLPSLMKNLAQLIIKLPAEGQNMVLDQLYSLVSEADDVTRKPW 780
KKILDMLLRLISLVDIQVLPSLM NLAQL+IKLP+EGQNMVLDQLYSLVSEADDVTRKP
Sbjct: 721 KKILDMLLRLISLVDIQVLPSLMTNLAQLVIKLPSEGQNMVLDQLYSLVSEADDVTRKPL 780
Query: 781 LVSWLQSLSYLCCQSKSGDTHSNEMQNIRLLNFSWIVDPLNRFRSYARL 829
LVSWLQSLSYLC +S+S D HSNE Q RL NFSWIVDPLNR RSYARL
Sbjct: 781 LVSWLQSLSYLCSRSRSADAHSNEKQTARLSNFSWIVDPLNRIRSYARL 829
BLAST of Sed0002136.9 vs. NCBI nr
Match:
KAG6596825.1 (hypothetical protein SDJN03_10005, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 1354.7 bits (3505), Expect = 0.0e+00
Identity = 701/830 (84.46%), Postives = 756/830 (91.08%), Query Frame = 0
Query: 1 MAKQANSVFLEEWLNSTSGTSSALNFKTTSSSAREIIQAWAELRSSLEHQSFDDRHIQSL 60
MAKQANSVFLEEWL S SG SS N K +SSSAREIIQAWAELRSSLEH+ F DRHIQSL
Sbjct: 1 MAKQANSVFLEEWLKSISGISSGFNSKISSSSAREIIQAWAELRSSLEHRLFADRHIQSL 60
Query: 61 KTLVNSQSSLFVADPQAKLVISILSSPNFPLPHESYPLFLRVLYIWVRKSLRPSLLLVDS 120
KTLVNSQSSL+VADPQAKLV+SILSSPN LP ESYPLFLR+LYIWVRKSLRPSL+LVDS
Sbjct: 61 KTLVNSQSSLYVADPQAKLVVSILSSPNLSLPDESYPLFLRILYIWVRKSLRPSLVLVDS 120
Query: 121 FVEVLSQIFSSKIELGKNPLFFSEVVLVLGAISYIPSASENSKLCCLELLCRVL-EEGYL 180
VE+LSQIFSSKI L KNPLF SE VL+LGAISY+ SASE SKLCCLELLCR+L EE +L
Sbjct: 121 SVEILSQIFSSKIGLRKNPLFISEGVLILGAISYVVSASEKSKLCCLELLCRILEEEEWL 180
Query: 181 LVGSVGGMIPGFLAGIGYALSSSVNAHVVRLLDSLLGIWGQVGGPTGNLSSGLMILHLIE 240
L+GSVGG +P F AGIGYALSSS+NAHVV+LLDSLLGIW ++G PTGNLS+GLMILHLIE
Sbjct: 181 LIGSVGGTVPEFFAGIGYALSSSLNAHVVKLLDSLLGIWCKIGSPTGNLSTGLMILHLIE 240
Query: 241 WVTSGLITLHSFEKLDVFSRATLESSKESYASFAVVMAAAGILRAFNTCNVLLSSSERET 300
WVTSGLI+LHSF+KLD S+ LESSKESYASFAVVMAAAGILRAFN+ LL SSERET
Sbjct: 241 WVTSGLISLHSFKKLDFLSQTALESSKESYASFAVVMAAAGILRAFNSYKALLKSSERET 300
Query: 301 ISRIRISAQDCLESISRNFISTMEGSSVSG-NDRSRSLFLLCISSAIARCGPVSSRPPVL 360
ISRIRISA DCLESI++NFISTMEGSS++G ND RSL LLCIS A+ARCGPV+SRPPVL
Sbjct: 301 ISRIRISAVDCLESIAKNFISTMEGSSIAGNNDHGRSLLLLCISLAVARCGPVASRPPVL 360
Query: 361 ICIIYALLSEIFPLQRLYAKILEFSSVESGALGLPLVKEHLDSIPFKEAGAITGVLCTQY 420
IC+ YALL+EIFPLQRLYAK+LEFS ESG LG LV EHLDSIPFKEAG I GVLC++Y
Sbjct: 361 ICVTYALLTEIFPLQRLYAKLLEFSLGESGVLGHTLVNEHLDSIPFKEAGVIAGVLCSRY 420
Query: 421 ASIDEEDKKIVENLVWDYCHDIYSRHRLVGLVLRGREDELLEGIEKIAESAFFMVVVFAL 480
ASIDE+DKK VENLVWDYC DIYSRHR VGLVLR REDELLE IEKIAESAF MVVVFAL
Sbjct: 421 ASIDEDDKKFVENLVWDYCQDIYSRHRRVGLVLRHREDELLENIEKIAESAFLMVVVFAL 480
Query: 481 AVTKEKLDSKCTPESQFDVSVRILASFSCMEYFRHIRLPEYMDTIRGVVASIQENESACV 540
AVTKEKL+SK T E+QFDVSVRIL SFSCMEYFR IR+PEYMDTIRGVVAS+QENESACV
Sbjct: 481 AVTKEKLNSKYTLETQFDVSVRILDSFSCMEYFRRIRMPEYMDTIRGVVASVQENESACV 540
Query: 541 SFIESMPTYKDQTNGPDNFIGQKLKYTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKV 600
SFIESMP+Y+DQT+GPD+ IGQKL+YTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKV
Sbjct: 541 SFIESMPSYQDQTHGPDSSIGQKLQYTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKV 600
Query: 601 VAPTMFLYMGHPNAKVARASHSVFIAFISGKDDSEEEKRVILKEELVFYYIERSLSGYPG 660
VAPTMFLYMGHPNAKVARASHSVFIAFISGKDD E+ RV+LKEELVFYYIERSLSGYPG
Sbjct: 601 VAPTMFLYMGHPNAKVARASHSVFIAFISGKDDDEDGNRVMLKEELVFYYIERSLSGYPG 660
Query: 661 ITPFEGMASGVAALVRYLPAGSPAIFYCIESLTVKATSLCSENFMENADLWKSWQGDLEP 720
ITPFEGMASGVAALVRYLPAGSP+IFYCI+SLTVKATSLCSENFM++ DLWK+WQGDLEP
Sbjct: 661 ITPFEGMASGVAALVRYLPAGSPSIFYCIDSLTVKATSLCSENFMDDVDLWKTWQGDLEP 720
Query: 721 SKKILDMLLRLISLVDIQVLPSLMKNLAQLIIKLPAEGQNMVLDQLYSLVSEADDVTRKP 780
SKKILDMLLRLISLVDIQVLPSLM NLAQL+IKLP+EGQNMVLDQLYSLVSEADDVTRKP
Sbjct: 721 SKKILDMLLRLISLVDIQVLPSLMTNLAQLVIKLPSEGQNMVLDQLYSLVSEADDVTRKP 780
Query: 781 WLVSWLQSLSYLCCQSKSGDTHSNEMQNIRLLNFSWIVDPLNRFRSYARL 829
LVSWLQSLSYLC +S+S D HSNE Q RL NF+WIVDPLNR RSYARL
Sbjct: 781 LLVSWLQSLSYLCSRSRSADAHSNEKQTTRLSNFAWIVDPLNRIRSYARL 830
BLAST of Sed0002136.9 vs. ExPASy TrEMBL
Match:
A0A6J1FWB1 (uncharacterized protein LOC111448717 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111448717 PE=4 SV=1)
HSP 1 Score: 1376.7 bits (3562), Expect = 0.0e+00
Identity = 711/830 (85.66%), Postives = 764/830 (92.05%), Query Frame = 0
Query: 1 MAKQANSVFLEEWLNSTSGTSSALNFKTTSSSAREIIQAWAELRSSLEHQSFDDRHIQSL 60
MAKQANSVFLEEWL S SG SS N K +SSSAREIIQAWAELRSSLEH+ FDDRHIQSL
Sbjct: 1 MAKQANSVFLEEWLKSISGISSGFNSKISSSSAREIIQAWAELRSSLEHRLFDDRHIQSL 60
Query: 61 KTLVNSQSSLFVADPQAKLVISILSSPNFPLPHESYPLFLRVLYIWVRKSLRPSLLLVDS 120
KTLVNSQSSL+VADPQAKLV+SILSSPN LP ESYPLFLR+LYIWVRKSLRPSL+LVDS
Sbjct: 61 KTLVNSQSSLYVADPQAKLVVSILSSPNLSLPDESYPLFLRILYIWVRKSLRPSLVLVDS 120
Query: 121 FVEVLSQIFSSKIELGKNPLFFSEVVLVLGAISYIPSASENSKLCCLELLCRVL-EEGYL 180
VEVLSQIFSSKI L KNPLF SE VL+LGAISY+ SASE SKLCCLELLCR+L EE +L
Sbjct: 121 SVEVLSQIFSSKIGLRKNPLFISEGVLILGAISYVVSASEKSKLCCLELLCRILEEEEWL 180
Query: 181 LVGSVGGMIPGFLAGIGYALSSSVNAHVVRLLDSLLGIWGQVGGPTGNLSSGLMILHLIE 240
L+GSVGG +P F AGIGYALSSS+NAHVVRLLDSLLGIWG++G PTGNLS+GLMILHLIE
Sbjct: 181 LIGSVGGTVPEFFAGIGYALSSSLNAHVVRLLDSLLGIWGKIGSPTGNLSTGLMILHLIE 240
Query: 241 WVTSGLITLHSFEKLDVFSRATLESSKESYASFAVVMAAAGILRAFNTCNVLLSSSERET 300
WVTSGLI+LHSF+KL+ S+ LESSKESYASFAVVMAAAGILRAFN+ LLSSSERET
Sbjct: 241 WVTSGLISLHSFKKLNFLSQTALESSKESYASFAVVMAAAGILRAFNSYKALLSSSERET 300
Query: 301 ISRIRISAQDCLESISRNFISTMEGSSVSGN-DRSRSLFLLCISSAIARCGPVSSRPPVL 360
ISRIRISAQDCLESI++NFISTMEGSS++GN D RSL LLCIS A+ARCGPV+SRPPVL
Sbjct: 301 ISRIRISAQDCLESIAKNFISTMEGSSITGNDDHGRSLLLLCISLAVARCGPVASRPPVL 360
Query: 361 ICIIYALLSEIFPLQRLYAKILEFSSVESGALGLPLVKEHLDSIPFKEAGAITGVLCTQY 420
IC+ YALL+EIFPLQRLYAK+L+FS ESG LGL LVKEHLDSIPFKEAG I GVLC+QY
Sbjct: 361 ICVTYALLTEIFPLQRLYAKLLKFSFGESGVLGLTLVKEHLDSIPFKEAGVIAGVLCSQY 420
Query: 421 ASIDEEDKKIVENLVWDYCHDIYSRHRLVGLVLRGREDELLEGIEKIAESAFFMVVVFAL 480
ASIDE+DKK VENLVWDYC DIYSRHR VGLVLR REDELLE IEKIAESAF MVVVFAL
Sbjct: 421 ASIDEDDKKFVENLVWDYCQDIYSRHRRVGLVLRHREDELLENIEKIAESAFLMVVVFAL 480
Query: 481 AVTKEKLDSKCTPESQFDVSVRILASFSCMEYFRHIRLPEYMDTIRGVVASIQENESACV 540
AVTKEKL+SK TPE+QFDVSVRIL SFSCMEYFR IR+PEYMDTIRGVVAS+QENESACV
Sbjct: 481 AVTKEKLNSKYTPETQFDVSVRILDSFSCMEYFRRIRMPEYMDTIRGVVASVQENESACV 540
Query: 541 SFIESMPTYKDQTNGPDNFIGQKLKYTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKV 600
SFIESMP+Y+DQT+GPD+ IGQKL+YTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKV
Sbjct: 541 SFIESMPSYQDQTHGPDSSIGQKLQYTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKV 600
Query: 601 VAPTMFLYMGHPNAKVARASHSVFIAFISGKDDSEEEKRVILKEELVFYYIERSLSGYPG 660
VAPTMFLYMGHPNAKVARASHSVFIAFISGKDD E+ RV+LKEELVFYYIERSLSGYPG
Sbjct: 601 VAPTMFLYMGHPNAKVARASHSVFIAFISGKDDDEDGNRVMLKEELVFYYIERSLSGYPG 660
Query: 661 ITPFEGMASGVAALVRYLPAGSPAIFYCIESLTVKATSLCSENFMENADLWKSWQGDLEP 720
ITPFEGMASGVAALVRYLPAGSP+IFYCI+SLTVKATSLCSENFM++ADLWK+WQGDLEP
Sbjct: 661 ITPFEGMASGVAALVRYLPAGSPSIFYCIDSLTVKATSLCSENFMDDADLWKTWQGDLEP 720
Query: 721 SKKILDMLLRLISLVDIQVLPSLMKNLAQLIIKLPAEGQNMVLDQLYSLVSEADDVTRKP 780
SKKILDMLLRLISLVDIQVLPSLM NLAQL+IKLP+EGQNMVLDQLYSLVSEADDVTRKP
Sbjct: 721 SKKILDMLLRLISLVDIQVLPSLMTNLAQLVIKLPSEGQNMVLDQLYSLVSEADDVTRKP 780
Query: 781 WLVSWLQSLSYLCCQSKSGDTHSNEMQNIRLLNFSWIVDPLNRFRSYARL 829
LVSWLQSLSYLC QS+S D HSNE Q RL NF+WIVDPLNR RSYARL
Sbjct: 781 LLVSWLQSLSYLCSQSRSADAHSNEKQTTRLSNFAWIVDPLNRIRSYARL 830
BLAST of Sed0002136.9 vs. ExPASy TrEMBL
Match:
A0A6J1KX18 (uncharacterized protein LOC111498339 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111498339 PE=4 SV=1)
HSP 1 Score: 1371.7 bits (3549), Expect = 0.0e+00
Identity = 711/830 (85.66%), Postives = 762/830 (91.81%), Query Frame = 0
Query: 1 MAKQANSVFLEEWLNSTSGTSSALNFKTTSSSAREIIQAWAELRSSLEHQSFDDRHIQSL 60
MAKQANSVFLEEWL S SG SS N K +SSSAREIIQAWAELRSSLEHQ FDDRHIQSL
Sbjct: 1 MAKQANSVFLEEWLKSISGISSGFNSKISSSSAREIIQAWAELRSSLEHQLFDDRHIQSL 60
Query: 61 KTLVNSQSSLFVADPQAKLVISILSSPNFPLPHESYPLFLRVLYIWVRKSLRPSLLLVDS 120
KTLVNSQSSL+VADPQAKLVISILSSPN LP ESYPLFLR+LYIWVRKSLRPSL+LVDS
Sbjct: 61 KTLVNSQSSLYVADPQAKLVISILSSPNLSLPDESYPLFLRILYIWVRKSLRPSLVLVDS 120
Query: 121 FVEVLSQIFSSKIELGKNPLFFSEVVLVLGAISYIPSASENSKLCCLELLCRVL-EEGYL 180
VE+LSQIFSSKI L KNPLF SE VL+LGAISY+ SASE KLCCLELLCR+L EE +L
Sbjct: 121 SVEILSQIFSSKIGLRKNPLFISEGVLILGAISYVVSASEKFKLCCLELLCRILEEEEWL 180
Query: 181 LVGSVGGMIPGFLAGIGYALSSSVNAHVVRLLDSLLGIWGQVGGPTGNLSSGLMILHLIE 240
L+GSVGG +P F AGIGYALSSSVNAHVVRLLDSLLGIWG++G PTGNLS+GLMILHLIE
Sbjct: 181 LIGSVGGTVPEFFAGIGYALSSSVNAHVVRLLDSLLGIWGKIGSPTGNLSTGLMILHLIE 240
Query: 241 WVTSGLITLHSFEKLDVFSRATLESSKESYASFAVVMAAAGILRAFNTCNVLLSSSERET 300
WVTSGLI+LHSF+KLD S+A LESSKESYASFAVVMAAAGILRAFN+ LLSSSERET
Sbjct: 241 WVTSGLISLHSFKKLDFLSQAALESSKESYASFAVVMAAAGILRAFNSYKALLSSSERET 300
Query: 301 ISRIRISAQDCLESISRNFISTMEGSSVSGN-DRSRSLFLLCISSAIARCGPVSSRPPVL 360
ISRIRISAQDCLESI++NFISTMEGSS++GN D RSL LLCIS A+ARCGPV+SRPPVL
Sbjct: 301 ISRIRISAQDCLESIAKNFISTMEGSSITGNDDHGRSLLLLCISLAVARCGPVASRPPVL 360
Query: 361 ICIIYALLSEIFPLQRLYAKILEFSSVESGALGLPLVKEHLDSIPFKEAGAITGVLCTQY 420
IC+ YALL+EIFPLQRLYAK+LEFS ESG LGL LVKEHLDSIPFKEAG I GVLC+QY
Sbjct: 361 ICVTYALLTEIFPLQRLYAKLLEFSFGESGVLGLSLVKEHLDSIPFKEAGVIAGVLCSQY 420
Query: 421 ASIDEEDKKIVENLVWDYCHDIYSRHRLVGLVLRGREDELLEGIEKIAESAFFMVVVFAL 480
ASIDE+DKKIVENLVWDYC DIYSRHR VGLVLR REDELLE IEKIAESAF MVVVFAL
Sbjct: 421 ASIDEDDKKIVENLVWDYCQDIYSRHRRVGLVLRHREDELLENIEKIAESAFLMVVVFAL 480
Query: 481 AVTKEKLDSKCTPESQFDVSVRILASFSCMEYFRHIRLPEYMDTIRGVVASIQENESACV 540
AVTKEKL+SK T E+QFDVSVRIL SFSCMEYFR IR+PEYMDTIRGVVAS+QENESACV
Sbjct: 481 AVTKEKLNSKYTLETQFDVSVRILNSFSCMEYFRRIRMPEYMDTIRGVVASVQENESACV 540
Query: 541 SFIESMPTYKDQTNGPDNFIGQKLKYTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKV 600
SFIESMP+Y+DQT+GPD+ IGQKL+Y WTEDEVQTARMLFYIRVIPTCIE VPTQVYRKV
Sbjct: 541 SFIESMPSYQDQTHGPDSSIGQKLQYIWTEDEVQTARMLFYIRVIPTCIELVPTQVYRKV 600
Query: 601 VAPTMFLYMGHPNAKVARASHSVFIAFISGKDDSEEEKRVILKEELVFYYIERSLSGYPG 660
VAPTMFLYMGHPN+KVARASHSVFIAFISGKDD E+ RV+LKEELVFYYIERSLSGYPG
Sbjct: 601 VAPTMFLYMGHPNSKVARASHSVFIAFISGKDDGEDGNRVMLKEELVFYYIERSLSGYPG 660
Query: 661 ITPFEGMASGVAALVRYLPAGSPAIFYCIESLTVKATSLCSENFMENADLWKSWQGDLEP 720
ITPFEGMASGVAALVRYLPAGSP+IFYCI+SLTVKATSLCSENFM++ADLWK+WQGDLEP
Sbjct: 661 ITPFEGMASGVAALVRYLPAGSPSIFYCIDSLTVKATSLCSENFMDDADLWKTWQGDLEP 720
Query: 721 SKKILDMLLRLISLVDIQVLPSLMKNLAQLIIKLPAEGQNMVLDQLYSLVSEADDVTRKP 780
SKKILDMLLRLISLVDIQVLPSLM NLAQL+IKLP+EGQNMVLDQLYSLVSEADDVTRKP
Sbjct: 721 SKKILDMLLRLISLVDIQVLPSLMTNLAQLVIKLPSEGQNMVLDQLYSLVSEADDVTRKP 780
Query: 781 WLVSWLQSLSYLCCQSKSGDTHSNEMQNIRLLNFSWIVDPLNRFRSYARL 829
LVSWLQSLSYLC +S+S D HSNE Q RL NF+WIVDPLNR RSYARL
Sbjct: 781 SLVSWLQSLSYLCSRSRSADAHSNEKQTTRLSNFAWIVDPLNRIRSYARL 830
BLAST of Sed0002136.9 vs. ExPASy TrEMBL
Match:
A0A6J1DGY7 (uncharacterized protein LOC111020395 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111020395 PE=4 SV=1)
HSP 1 Score: 1345.9 bits (3482), Expect = 0.0e+00
Identity = 698/828 (84.30%), Postives = 754/828 (91.06%), Query Frame = 0
Query: 1 MAKQANSVFLEEWLNSTSGTSSALNFKTTSSSAREIIQAWAELRSSLEHQSFDDRHIQSL 60
MAK+ANSVFLEEWL S SGTSS+LN K+TS SAREIIQAWA LRSSLE+QSFDDRHIQSL
Sbjct: 1 MAKKANSVFLEEWLRSVSGTSSSLNSKSTSPSAREIIQAWAALRSSLENQSFDDRHIQSL 60
Query: 61 KTLVNSQSSLFVADPQAKLVISILSSPNFPLPHESYPLFLRVLYIWVRKSLRPSLLLVDS 120
KTLVNSQSSL+VADPQAK+VISILSSPNF LP ESYPLFLR+LYIWVRKSLRPSL+L+DS
Sbjct: 61 KTLVNSQSSLYVADPQAKIVISILSSPNFSLPDESYPLFLRILYIWVRKSLRPSLILIDS 120
Query: 121 FVEVLSQIFSSKIELGKNPLFFSEVVLVLGAISYIPSASENSKLCCLELLCRVLEEGYLL 180
VEVLSQIFSS+IEL K+P F SE +LVLGA S++ SASENSKL CLELLC + E+ YLL
Sbjct: 121 SVEVLSQIFSSRIELRKSPSFLSEGILVLGAFSFLFSASENSKLFCLELLCSLFEQEYLL 180
Query: 181 VGSVGGMIPGFLAGIGYALSSSVNAHVVRLLDSLLGIWGQVGGPTGNLSSGLMILHLIEW 240
+GSVGG+IP LAGIGYALSSSVNAH+VRLLDSLLGIWG+VGGP+G++SSGLMILHL EW
Sbjct: 181 IGSVGGIIPEVLAGIGYALSSSVNAHIVRLLDSLLGIWGKVGGPSGSVSSGLMILHLNEW 240
Query: 241 VTSGLITLHSFEKLDVFSRATLESSKESYASFAVVMAAAGILRAFNTCNVLLSSSERETI 300
VTSGLI+LHSFEKLDVFSRATLESSKESYASFAVVMAAAGILRAFNT LLSSSERETI
Sbjct: 241 VTSGLISLHSFEKLDVFSRATLESSKESYASFAVVMAAAGILRAFNTYKALLSSSERETI 300
Query: 301 SRIRISAQDCLESISRNFISTMEGSSVSGNDRSRSLFLLCISSAIARCGPVSSRPPVLIC 360
SRIRI AQDCLESI+RNFIS EG ++GND+ RSL LLCIS A+ARCGP+SSR P+LIC
Sbjct: 301 SRIRILAQDCLESIARNFISFTEGFLITGNDQ-RSLLLLCISLALARCGPLSSRSPLLIC 360
Query: 361 IIYALLSEIFPLQRLYAKILEFSSVESGALGLPLVKEHLDSIPFKEAGAITGVLCTQYAS 420
++YALL+EIFPL+RLYAKILE S ES ALGL LVKEHLDSIPFKE+GA+ GVLC+QYAS
Sbjct: 361 VVYALLTEIFPLRRLYAKILEVSFGESTALGLTLVKEHLDSIPFKESGAVAGVLCSQYAS 420
Query: 421 IDEEDKKIVENLVWDYCHDIYSRHRLVGLVLRGREDELLEGIEKIAESAFFMVVVFALAV 480
IDEE+K VENLVWDYC D+YSRHR VGLVL GREDELLE IEKIAESAF MVVVFALAV
Sbjct: 421 IDEENKTFVENLVWDYCQDVYSRHRQVGLVLCGREDELLENIEKIAESAFLMVVVFALAV 480
Query: 481 TKEKLDSKCTPESQFDVSVRILASFSCMEYFRHIRLPEYMDTIRGVVASIQENESACVSF 540
TKEKLD K T E+QFD+SVRIL SFSCMEYFR IRLPEYMD IRGVVASIQENESACVSF
Sbjct: 481 TKEKLDPKYTLETQFDISVRILDSFSCMEYFRRIRLPEYMDAIRGVVASIQENESACVSF 540
Query: 541 IESMPTYKDQTNGPDNFIGQKLKYTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKVVA 600
IESMPTY+DQTNGPDN G+K+KY WTEDEVQTARMLFY+RVIPTCIERVPTQV+RKVV
Sbjct: 541 IESMPTYQDQTNGPDNSTGRKIKYIWTEDEVQTARMLFYLRVIPTCIERVPTQVFRKVVV 600
Query: 601 PTMFLYMGHPNAKVARASHSVFIAFISGKDDSEEEKRVILKEELVFYYIERSLSGYPGIT 660
PTMFLYMGHPN KVA+ASHSVFIAFISGKDD E+EKRVILKEELVFYY+ERSLSGYPGIT
Sbjct: 601 PTMFLYMGHPNGKVAQASHSVFIAFISGKDDDEDEKRVILKEELVFYYLERSLSGYPGIT 660
Query: 661 PFEGMASGVAALVRYLPAGSPAIFYCIESLTVKATSLCSENFMENADLWKSWQGDLEPSK 720
PFEGMASGVAALVRYLPAGSPAIFYCI SL KAT LCSENFM +ADLWK+WQGDLEPSK
Sbjct: 661 PFEGMASGVAALVRYLPAGSPAIFYCINSLIEKATKLCSENFMSDADLWKTWQGDLEPSK 720
Query: 721 KILDMLLRLISLVDIQVLPSLMKNLAQLIIKLPAEGQNMVLDQLYSLVSEADDVTRKPWL 780
KILDMLLRLISLVDIQVLPSLMKNLAQLII+LP EGQNMVLDQLYSLVSEADDVTRKP L
Sbjct: 721 KILDMLLRLISLVDIQVLPSLMKNLAQLIIELPTEGQNMVLDQLYSLVSEADDVTRKPLL 780
Query: 781 VSWLQSLSYLCCQSKSGDTHSNEMQNIRLLNFSWIVDPLNRFRSYARL 829
VSWLQSLSYLC QSKS D SNE Q+ +LL +WIVDPLNR RSYARL
Sbjct: 781 VSWLQSLSYLCSQSKSADARSNEKQSKQLLKLAWIVDPLNRIRSYARL 827
BLAST of Sed0002136.9 vs. ExPASy TrEMBL
Match:
A0A5D3D7C1 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold443G001100 PE=4 SV=1)
HSP 1 Score: 1331.2 bits (3444), Expect = 0.0e+00
Identity = 696/828 (84.06%), Postives = 743/828 (89.73%), Query Frame = 0
Query: 1 MAKQANSVFLEEWLNSTSGTSSALNFKTTSSSAREIIQAWAELRSSLEHQSFDDRHIQSL 60
MAKQ +SVFLEEWL S SG + N K TSSSAREIIQAWAELRSSLEHQ FDDRHIQSL
Sbjct: 1 MAKQGSSVFLEEWLKSISGIA---NSKPTSSSAREIIQAWAELRSSLEHQLFDDRHIQSL 60
Query: 61 KTLVNSQSSLFVADPQAKLVISILSSPNFPLPHESYPLFLRVLYIWVRKSLRPSLLLVDS 120
K LVNSQSSL+VADPQAKLVIS+LSSPNF + ESYPLFLR+LYIWVRKSLRPSL+L+DS
Sbjct: 61 KILVNSQSSLYVADPQAKLVISLLSSPNFSISDESYPLFLRILYIWVRKSLRPSLVLLDS 120
Query: 121 FVEVLSQIFSSKIELGKNPLFFSEVVLVLGAISYIPSASENSKLCCLELLCRVLEEGYLL 180
VEVLSQIFSSKIEL K PLF SE VLVLGAISY SASE SKLCCLELLCRVLEE YLL
Sbjct: 121 SVEVLSQIFSSKIELRKKPLFISEGVLVLGAISYQLSASEKSKLCCLELLCRVLEEDYLL 180
Query: 181 VGSVGGMIPGFLAGIGYALSSSVNAHVVRLLDSLLGIWGQVGGPTGNLSSGLMILHLIEW 240
VGG++P FLAGIGYALSSSVNAHVVRLLDSLLGIW +V GP LSSGLMILH+IEW
Sbjct: 181 ---VGGIVPEFLAGIGYALSSSVNAHVVRLLDSLLGIWSKVNGPIDTLSSGLMILHMIEW 240
Query: 241 VTSGLITLHSFEKLDVFSRATLESSKESYASFAVVMAAAGILRAFNTCNVLLSSSERETI 300
VTSGLI LHSFEKLDVFS AT SSKESYASFAVVMAAAGILR FNT LL+SSERETI
Sbjct: 241 VTSGLINLHSFEKLDVFSHATFVSSKESYASFAVVMAAAGILRGFNTYKGLLNSSERETI 300
Query: 301 SRIRISAQDCLESISRNFISTMEGSSVSGNDRSRSLFLLCISSAIARCGPVSSRPPVLIC 360
SRIRI+AQDCLESI+RNFISTME SS++GND RS+ LLCIS AIARCGPVS+RPPVLI
Sbjct: 301 SRIRIAAQDCLESIARNFISTMEASSITGNDHRRSVLLLCISLAIARCGPVSARPPVLIS 360
Query: 361 IIYALLSEIFPLQRLYAKILEFSSVESGALGLPLVKEHLDSIPFKEAGAITGVLCTQYAS 420
++Y LL+EIFPLQRLYAKI EFS E G LGL LVKEHL SIPFKEAGAI GVLC+QYAS
Sbjct: 361 VVYGLLTEIFPLQRLYAKINEFSFAELGVLGLTLVKEHLGSIPFKEAGAIAGVLCSQYAS 420
Query: 421 IDEEDKKIVENLVWDYCHDIYSRHRLVGLVLRGREDELLEGIEKIAESAFFMVVVFALAV 480
+ EE++ IVENLVWDYC D+YSRHRLVGLVLRGREDELLE IEKIAESAF MVVVFALAV
Sbjct: 421 LGEEERSIVENLVWDYCRDVYSRHRLVGLVLRGREDELLENIEKIAESAFLMVVVFALAV 480
Query: 481 TKEKLDSKCTPESQFDVSVRILASFSCMEYFRHIRLPEYMDTIRGVVASIQENESACVSF 540
TKEKLDSK T ESQFDVSVRIL SFSCMEYFR IRL EYM+TIRGVVASIQ NESACVSF
Sbjct: 481 TKEKLDSKYTLESQFDVSVRILVSFSCMEYFRRIRLQEYMETIRGVVASIQGNESACVSF 540
Query: 541 IESMPTYKDQTNGPDNFIGQKLKYTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKVVA 600
IESMPTY+DQTNGPDN IGQK+KY+W +DEVQTARMLFYIRVIPTCIE VPTQVY KVVA
Sbjct: 541 IESMPTYQDQTNGPDNSIGQKIKYSWVKDEVQTARMLFYIRVIPTCIEHVPTQVYGKVVA 600
Query: 601 PTMFLYMGHPNAKVARASHSVFIAFISGKDDSEEEKRVILKEELVFYYIERSLSGYPGIT 660
PTMFLYMGHPNAKVARASHSVFIAF+SGKDD ++EKR LKEELVFYY+ERSLSGYPGIT
Sbjct: 601 PTMFLYMGHPNAKVARASHSVFIAFMSGKDDIDDEKRATLKEELVFYYVERSLSGYPGIT 660
Query: 661 PFEGMASGVAALVRYLPAGSPAIFYCIESLTVKATSLCSENFMENADLWKSWQGDLEPSK 720
PFEGMASGVAALVRYLPAGSPAIFYCI+SLTVKATSLCSENFM++ DLWK+WQGDLEPSK
Sbjct: 661 PFEGMASGVAALVRYLPAGSPAIFYCIDSLTVKATSLCSENFMDDGDLWKTWQGDLEPSK 720
Query: 721 KILDMLLRLISLVDIQVLPSLMKNLAQLIIKLPAEGQNMVLDQLYSLVSEADDVTRKPWL 780
KILDMLLRLISLVDIQVLPSLMK+LAQLIIKLP EGQN+VLDQLYSLVSEADDVTRKP L
Sbjct: 721 KILDMLLRLISLVDIQVLPSLMKSLAQLIIKLPTEGQNLVLDQLYSLVSEADDVTRKPML 780
Query: 781 VSWLQSLSYLCCQSKSGDTHSNEMQNIRLLNFSWIVDPLNRFRSYARL 829
VSWLQSLSYLC SKS + S+E Q+ RL NF+W+VDPLNR RSYARL
Sbjct: 781 VSWLQSLSYLCSLSKSAEARSDEKQSTRLANFAWLVDPLNRIRSYARL 822
BLAST of Sed0002136.9 vs. ExPASy TrEMBL
Match:
A0A0A0L5R8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G038110 PE=4 SV=1)
HSP 1 Score: 1328.2 bits (3436), Expect = 0.0e+00
Identity = 693/832 (83.29%), Postives = 743/832 (89.30%), Query Frame = 0
Query: 1 MAKQANSVFLEEWLNSTSGTSSALNFKTTSSSAREIIQAWAELRSSLEHQSFDDRHIQSL 60
MAKQ +SVFLE+WL S G + N K TSSSAREIIQAWAELRSSLEHQ FDDRHIQSL
Sbjct: 1 MAKQGSSVFLEDWLKSIGGIA---NSKPTSSSAREIIQAWAELRSSLEHQFFDDRHIQSL 60
Query: 61 KTLVNSQSSLFVADPQAKLVISILSSPNFPLPHESYPLFLRVLYIWVRKSLRPSLLLVDS 120
K LVNSQSSL+VADPQAKLVIS+LSSPNF + ESYPLFLR+LYIW+RKSLRPSL+LVDS
Sbjct: 61 KILVNSQSSLYVADPQAKLVISLLSSPNFSISDESYPLFLRILYIWLRKSLRPSLVLVDS 120
Query: 121 FVEVLSQIFSSKIELGKNPLFFSEVVLVLGAISYIPSASENSKLCCLELLCRVLEEGYLL 180
VEVLSQIFSSKIEL KNPLF SE VLVLGAISY+PSASE SKLCCLELLCRVLEE YLL
Sbjct: 121 SVEVLSQIFSSKIELRKNPLFISEGVLVLGAISYLPSASEKSKLCCLELLCRVLEEDYLL 180
Query: 181 VGSVGGMIPGFLAGIGYALSSSVNAHVVRLLDSLLGIWGQVGGPTGNLSSGLMILHLIEW 240
VGG++P FLAGIGYA SSSVNAHVVRLLDSLLGIW +V GP LSSGLMILH+I W
Sbjct: 181 ---VGGIVPEFLAGIGYAFSSSVNAHVVRLLDSLLGIWSKVNGPIDTLSSGLMILHMIAW 240
Query: 241 VTSGLITLHSFEKLDVFSRATLESSKESYASFAVVMAAAGILRAFNTCNVLLSSSERETI 300
VTSGLI LHSFEKLDVFS ATL SSKESYASFAVVMAAAGILRAFNT LLSSSERETI
Sbjct: 241 VTSGLINLHSFEKLDVFSHATLVSSKESYASFAVVMAAAGILRAFNTYKGLLSSSERETI 300
Query: 301 SRIRISAQDCLESISRNFISTMEGSSVSGNDRSRSLFLLCISSAIARCGPVSSRPPVLIC 360
SRIRISAQDCLESI+RNFISTMEGSS++GND RS+ LLCIS AIARCGPVS+RPPVLI
Sbjct: 301 SRIRISAQDCLESIARNFISTMEGSSITGNDHRRSVLLLCISLAIARCGPVSARPPVLIS 360
Query: 361 IIYALLSEIFPLQRLYAKILEFSSVESGALGLPLVKEHLDSIPFKEAGAITGVLCTQYAS 420
++YALL+EIFPLQRLYAKI EFS E LGL LVKEHL SIPFKEAGAI GVLC+QYAS
Sbjct: 361 VVYALLTEIFPLQRLYAKINEFSFSELSVLGLTLVKEHLGSIPFKEAGAIAGVLCSQYAS 420
Query: 421 IDEEDKKIVENLVWDYCHDIYSRHRLVGLVLRGREDELLEGIEKIAESAFFMVVVFALAV 480
+ EE+K IVENLVWDYC D+YSRHRLV LVL GREDELLE IEKIAESAF MVVVFALAV
Sbjct: 421 LGEEEKSIVENLVWDYCRDVYSRHRLVNLVLHGREDELLESIEKIAESAFLMVVVFALAV 480
Query: 481 TKEKLDSKCTPESQFDVSVRILASFSCMEYFRHIRLPEYMDTIRGVVASIQENESACVSF 540
TKEKL SK T ESQFDVSV+IL SFSCMEYFR IRLPEYMDTIRGVV SIQ NESACV F
Sbjct: 481 TKEKLGSKYTLESQFDVSVKILVSFSCMEYFRRIRLPEYMDTIRGVVGSIQGNESACVYF 540
Query: 541 IESMPTYKDQTNGPDNFIGQKLKYTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKVVA 600
IESMPTY+DQTNGPDN IGQK++Y+W +DEVQTARMLFYIRV+PTCIE VPTQVY KVVA
Sbjct: 541 IESMPTYQDQTNGPDNSIGQKIQYSWAKDEVQTARMLFYIRVVPTCIEHVPTQVYGKVVA 600
Query: 601 PTMFLYMGHPNAKVARASHSVFIAFISGKDDSEEEKRVILKEELVFYYIERSLSGYPGIT 660
PTMFLYMGHPN+KV RASHSVFIAF+SGKDD ++EKR LKEELVFYYIERSLSGYPGIT
Sbjct: 601 PTMFLYMGHPNSKVVRASHSVFIAFMSGKDDIDDEKRTTLKEELVFYYIERSLSGYPGIT 660
Query: 661 PFEGMASGVAALVRYLPAGSPAIFYCIESLTVKATSLCSENFMENADLWKSWQGDLEPSK 720
PFEGMASGVAALVRYLPAGSPAIFYCI+SLTVKATSLCSENFM++ DLWK+WQGDLEPSK
Sbjct: 661 PFEGMASGVAALVRYLPAGSPAIFYCIDSLTVKATSLCSENFMDDGDLWKTWQGDLEPSK 720
Query: 721 KILDMLLRLISLVDIQVLPSLMKNLAQLIIKLPAEGQNMVLDQLYSLVSEADDVTRKPWL 780
KILDMLLRLISLVDIQVLPSLMK+LAQLIIKLP EGQN++LDQLYSLVSEADDVTRKP L
Sbjct: 721 KILDMLLRLISLVDIQVLPSLMKSLAQLIIKLPTEGQNLILDQLYSLVSEADDVTRKPML 780
Query: 781 VSWLQSLSYLCCQSKSGDTHSNEMQNI----RLLNFSWIVDPLNRFRSYARL 829
VSWLQSLSYLC SKS + HSNE Q++ RL NF+W++DPLNR RSYARL
Sbjct: 781 VSWLQSLSYLCSLSKSAEAHSNEKQSLKQSTRLANFAWLIDPLNRIRSYARL 826
BLAST of Sed0002136.9 vs. TAIR 10
Match:
AT1G73970.1 (unknown protein; Has 34 Blast hits to 33 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 32; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )
HSP 1 Score: 767.7 bits (1981), Expect = 9.6e-222
Identity = 416/799 (52.07%), Postives = 566/799 (70.84%), Query Frame = 0
Query: 1 MAKQA-NSVFLEEWLNSTSGTSSA--LNFKTTSSSAREIIQAWAELRSSLEHQSFDDRHI 60
MA++A NS FLEEWL + SG+S + L + ++ SAR IIQAW+E+R SL++Q+FD R++
Sbjct: 1 MARKANNSFFLEEWLRTVSGSSVSGDLVKQNSAPSARSIIQAWSEIRESLQNQNFDSRYL 60
Query: 61 QSLKTLVNSQSSLFVADPQAKLVISILSSPNFPLPHESYPLFLRVLYIWVRKSLRPSLLL 120
Q+L+ LV+S+S++ VADPQAKL+ISIL+ + LP ESY L LR+LY+W+RK+ RPS L
Sbjct: 61 QALRALVSSESTIHVADPQAKLLISILAFQDVSLPSESYTLVLRLLYVWIRKAFRPSQAL 120
Query: 121 VDSFVEVLSQIFSSKIELGKNPLFFSEVVLVLGAISYIPSASENSKLCCLELLCRVLEEG 180
V V+ + + + L P ++ VLV GA + +PS S + K+ CLELLCR+LEE
Sbjct: 121 VGVAVQAIRGVVDDRRNL--QPALVAQSVLVSGAFACVPSLSGDVKVLCLELLCRLLEEE 180
Query: 181 YLLVGSVGGMIPGFLAGIGYALSSSVNAHVVRLLDSLLGIWGQVGGPTGNLSSGLMILHL 240
Y LVGS ++P LAGIGYALSSS++ H VRLLD L GIW + GP G ++ GLMILHL
Sbjct: 181 YSLVGSQEELVPVVLAGIGYALSSSLDVHYVRLLDLLFGIWLKDEGPRGTVTYGLMILHL 240
Query: 241 IEWVTSGLITLHSFEKLDVFSRATLESSKESYASFAVVMAAAGILRAFNTCNVLLSSSER 300
IEWV SG + +S K+ +F+ LE+SKE YA FAV MAAAG++RA + S ++
Sbjct: 241 IEWVVSGYMRSNSINKMSLFANEVLETSKEKYAVFAVFMAAAGVVRA--STAGFSSGAQS 300
Query: 301 ETISRIRISAQDCLESISRNFISTMEGSSVSGNDRSRSLFLLCISSAIARCGPVSSRPPV 360
IS++R SA+ +E +++ +S ++ R L L C + A+ARCG VSS P+
Sbjct: 301 LEISKLRNSAEKRIEFVAQILVSNGNVVTLPTTQREGPL-LKCFAIALARCGSVSSSAPL 360
Query: 361 LICIIYALLSEIFPLQRLYAKILEFSSVESGALGLPLVKEHLDSIPFKEAGAITGVLCTQ 420
L+C+ ALL+++FPL ++Y E L V+EHL + FKE+GAI+G C Q
Sbjct: 361 LLCLTSALLTQVFPLGQIYESFCNAFGKEPIGPRLIWVREHLSDVLFKESGAISGAFCNQ 420
Query: 421 YASIDEEDKKIVENLVWDYCHDIYSRHRLVGLVLRGREDELLEGIEKIAESAFFMVVVFA 480
Y+S EE+K IVEN++WD+C ++Y +HR + ++L G ED LL IEKIAES+F MVVVFA
Sbjct: 421 YSSASEENKYIVENMIWDFCQNLYLQHRQIAMLLCGIEDTLLGDIEKIAESSFLMVVVFA 480
Query: 481 LAVTKEKLDSKCTPESQFDVSVRILASFSCMEYFRHIRLPEYMDTIRGVVASIQENESAC 540
LAVTK+ L + E + SV+IL SFSC+EYFRHIRLPEYM+TIR V++ +QEN++ C
Sbjct: 481 LAVTKQWLKPIVSKERKMVTSVKILVSFSCVEYFRHIRLPEYMETIREVISCVQENDAPC 540
Query: 541 VSFIESMPTYKDQTNGPDNFIGQKLKYTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRK 600
VSF+ES+P Y TN P + Q++KY W+ D+VQT+R+LFY+RVIPTCI R+ +R
Sbjct: 541 VSFVESIPAYDSLTN-PKDLFTQRIKYEWSRDDVQTSRILFYLRVIPTCIGRLSASAFRG 600
Query: 601 VVAPTMFLYMGHPNAKVARASHSVFIAFISGKDDSEEEKRVILKEELVFYYIERSLSGYP 660
VVA TMFLY+GHPN KVA+ASH++ AF+S +SEE++R KE+LVFYY++RSL YP
Sbjct: 601 VVASTMFLYIGHPNRKVAQASHTLLAAFLSSAKESEEDERTQFKEQLVFYYMQRSLEVYP 660
Query: 661 GITPFEGMASGVAALVRYLPAGSPAIFYCIESLTVKATSLCSENFMENADLWKSWQGDLE 720
ITPFEG+ASGVA LV++LPAGSPAIFY + SL KA++ +E+ +
Sbjct: 661 EITPFEGLASGVATLVQHLPAGSPAIFYSVHSLVEKASTFSTESLQGRKS---------D 720
Query: 721 PSKKILDMLLRLISLVDIQVLPSLMKNLAQLIIKLPAEGQNMVLDQLYSLVSEADDVTRK 780
P +IL++LLRL+SLVDIQVLP LMK+LAQL+IKLP E QN+VL +LY V+E+DDV RK
Sbjct: 721 PGNQILELLLRLVSLVDIQVLPYLMKSLAQLVIKLPKERQNVVLGELYGQVAESDDVIRK 780
Query: 781 PWLVSWLQSLSYLCCQSKS 797
P LVSWLQSL+YLC +++
Sbjct: 781 PSLVSWLQSLNYLCSNNRT 784
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022944201.1 | 0.0e+00 | 85.66 | uncharacterized protein LOC111448717 isoform X1 [Cucurbita moschata] >XP_0229442... | [more] |
XP_038903921.1 | 0.0e+00 | 86.47 | uncharacterized protein LOC120090375 isoform X1 [Benincasa hispida] | [more] |
XP_023005293.1 | 0.0e+00 | 85.66 | uncharacterized protein LOC111498339 isoform X1 [Cucurbita maxima] >XP_023005295... | [more] |
XP_023539379.1 | 0.0e+00 | 85.04 | uncharacterized protein LOC111800037 [Cucurbita pepo subsp. pepo] >XP_023539380.... | [more] |
KAG6596825.1 | 0.0e+00 | 84.46 | hypothetical protein SDJN03_10005, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1FWB1 | 0.0e+00 | 85.66 | uncharacterized protein LOC111448717 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1KX18 | 0.0e+00 | 85.66 | uncharacterized protein LOC111498339 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1DGY7 | 0.0e+00 | 84.30 | uncharacterized protein LOC111020395 isoform X1 OS=Momordica charantia OX=3673 G... | [more] |
A0A5D3D7C1 | 0.0e+00 | 84.06 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A0A0L5R8 | 0.0e+00 | 83.29 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G038110 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT1G73970.1 | 9.6e-222 | 52.07 | unknown protein; Has 34 Blast hits to 33 proteins in 15 species: Archae - 0; Bac... | [more] |
Relationships
This mRNA is a part of the following gene feature(s):
Feature Name | Unique Name | Type |
Sed0002136 | Sed0002136 | gene |
The following five_prime_UTR feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
Sed0002136.9-five_prime_utr | Sed0002136.9-five_prime_utr-LG04:5903924..5904014 | five_prime_UTR |
The following exon feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
Sed0002136.9-exon | Sed0002136.9-exon-LG04:5903924..5905677 | exon |
Sed0002136.9-exon | Sed0002136.9-exon-LG04:5905792..5905942 | exon |
Sed0002136.9-exon | Sed0002136.9-exon-LG04:5906043..5906190 | exon |
Sed0002136.9-exon | Sed0002136.9-exon-LG04:5906291..5906536 | exon |
Sed0002136.9-exon | Sed0002136.9-exon-LG04:5907216..5907495 | exon |
Sed0002136.9-exon | Sed0002136.9-exon-LG04:5917960..5918247 | exon |
The following CDS feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
Sed0002136.9-cds | Sed0002136.9-cds-LG04:5904015..5905677 | CDS |
Sed0002136.9-cds | Sed0002136.9-cds-LG04:5905792..5905942 | CDS |
Sed0002136.9-cds | Sed0002136.9-cds-LG04:5906043..5906190 | CDS |
Sed0002136.9-cds | Sed0002136.9-cds-LG04:5906291..5906536 | CDS |
Sed0002136.9-cds | Sed0002136.9-cds-LG04:5907216..5907494 | CDS |
The following three_prime_UTR feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
Sed0002136.9-three_prime_utr | Sed0002136.9-three_prime_utr-LG04:5907495..5907495 | three_prime_UTR |
Sed0002136.9-three_prime_utr | Sed0002136.9-three_prime_utr-LG04:5917960..5918247 | three_prime_UTR |
The following polypeptide feature(s) derives from this mRNA:
Feature Name | Unique Name | Type |
Sed0002136.9 | Sed0002136.9-protein | polypeptide |