Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAATAAAAAAAGAAGAAAGAAAAAAACCACAACATTTTGAGCTTTCTTTCGTGTGCTCTGGTCGACATTTTTGGAGCTCCAAATGATAAGATGGCAAAGCAGGCGAATTCTGTATTCCTCGAAGAATGGTTGAATAGCACCAGTGGAACTAGCAGTGCCCTTAACTTCAAAACCACTTCCTCGTCTGCTCGAGAAATCATCCAAGCATGGGCTGAGCTTAGAAGCTCTTTAGAGCATCAATCGTTCGATGATCGCCACATTCAATCGCTCAAAACTCTTGTCAACTCACAATCTTCACTCTTTGTTGCAGACCCACAAGCTAAGCTGGTGATTTCTATACTCTCTTCTCCCAATTTCCCCCTTCCCCATGAATCGTATCCTCTCTTTTTGAGGGTTCTTTATATTTGGGTCAGAAAATCTCTCCGCCCCTCTTTGCTACTTGTCGATTCATTCGTTGAGGTTCTCTCTCAGATTTTTTCATCCAAAATTGAATTGGGAAAAAACCCTTTGTTCTTCTCGGAAGTGGTTTTGGTTTTGGGTGCTATTTCGTACATTCCTTCAGCTTCAGAAAATTCAAAATTATGCTGTTTGGAGTTGCTTTGCAGGGTTTTGGAAGAAGGGTACCTGCTGGTTGGATCAGTAGGAGGGATGATTCCAGGGTTTCTTGCAGGTATTGGTTATGCTTTATCTTCATCAGTGAATGCTCATGTTGTTAGACTGTTGGATTCTTTATTAGGAATTTGGGGTCAGGTAGGTGGCCCTACGGGCAATCTTTCTAGTGGGTTAATGATCCTGCATTTGATTGAGTGGGTGACTTCTGGTTTGATTACCCTTCATTCTTTTGAGAAATTAGATGTTTTTAGCCGAGCTACTTTAGAGTCTTCAAAGGAAAGTTATGCTTCATTTGCTGTTGTTATGGCTGCAGCTGGAATATTGAGGGCTTTTAATACTTGCAATGTTTTGTTGAGTAGTTCAGAAAGAGAAACGATATCTAGAATAAGGATTTCAGCTCAGGATTGTTTAGAATCTATATCGAGAAATTTTATTTCGACTATGGAAGGATCTTCCGTTTCAGGCAATGACCGTAGTAGGAGCCTGTTTCTATTGTGTATTTCATCTGCAATAGCACGTTGTGGCCCTGTTTCATCTCGCCCACCTGTGCTCATTTGCATTATTTATGCTTTGTTGAGTGAAATATTTCCTTTGCAGCGTTTATATGCCAAAATTCTTGAATTCTCTTCTGTTGAGTCGGGTGCGTTGGGGCTACCTCTAGTAAAAGAGCATCTTGATAGTATTCCTTTTAAGGAAGCAGGGGCCATCACGGGTGTTCTTTGCACTCAGTATGCTTCAATTGACGAAGAGGACAAAAAGATTGTGGAGAATCTTGTATGGGATTACTGTCATGACATCTACTCAAGGCACAGACTAGTAGGGTTGGTGCTTCGTGGCAGAGAGGATGAATTACTAGAGGGTATAGAGAAAATTGCAGAGTCTGCTTTTTTCATGGTTGTAGTATTTGCATTAGCTGTCACAAAAGAAAAGTTGGATTCCAAATGTACACCGGAAAGTCAATTTGACGTATCAGTAAGAATACTTGCTTCATTCTCTTGTATGGAATACTTTCGGCATATTCGCTTGCCAGAATATATGGATACTATCCGAGGGGTTGTTGCAAGCATTCAGGAGAATGAGTCTGCCTGTGTCTCTTTCATTGAATCAATGCCCACATACAAAGATCAAACAAACGGGCCAGGTCTTGCATTCAAGTTATTTTACTTGGCCATGAAGATTTAATTATTTTGGCTTTTATGCTAATATTGTTATATGCTCATCAAGCATTTGAAATTTGAAATTCATCTTTTTGTAGATAACTTTATTGGGCAGAAATTAAAATATACATGGACAGAGGATGAGGTGCAAACTGCACGCATGCTGTTTTATATACGAGTCATTCCAACTTGCATCGAACGTGTTCCTACCCAAGTGTATAGGAAAGTGGTTGCTCCAACAATGTTTTTGTATCCTATTCTAGCTACCGCGTTCTTGCAAATTATGCTTTCTTTCTTCTTTCATCTTTTTCCTCATGAACTATTTGTTTCCCATTACCAAGTGAATTAGATATATGGGACATCCTAATGCCAAAGTAGCCCGAGCTTCACACTCGGTGTTTATAGCTTTCATATCAGGGAAGGATGACAGTGAAGAAGAAAAGAGAGTGATATTGAAGGAGGAGCTTGTTTTCTACTACATTGAGAGATCTTTATCAGTATCACTCTTAACAAGATTTTTTTTTCTTTATTTATGTGATCAAACTTTAGATTTACAATTTGAGTTAGATATTGAGCATTCAAATTCTATGATTTTAGGGATATCCTGGCATTACACCATTTGAAGGTATGGCTTCAGGTGTTGCAGCTTTGGTGCGATATCTTCCTGCAGGAAGTCCAGCCATCTTTTATTGTATCGAAAGTCTTACTGTAAAAGCTACAAGCCTTTGCAGTGAAAACTTCATGGAAAATGCTGATCTGTGGAAGTCGTGGCAGGGAGACTTAGAGCCTTCCAAGAAAATTCTTGATATGCTTTTACGGCTCATTTCTCTTGTTGATATACAGGTAACTATCTTAAGATGCTTGTTCTTCCATATTTAGTACTGAATTCATTCTGTTATTTCTCTTTTGAACAATATATTAACTTATCTTTTAATTGAATGTTAATTTTAGGAAATTAGATCTGTAGCCAAGATATGATTCAATTAGATGATTCATGTATTTGATGTTCTTGAATTAGTAGTTACTGACCATCGTGAAGTAACATTTGGGTCTTGGGGAGCAAAGAGGTGATGAGTTTTAATCCATGGTAGCCACCTACCTAGGAATTAATTTTTTGTAGGTTTCCTTGACATCCAAATGTTGTAGGGTGGACGGTATGTCCCTTAACATTAGTCAAGGTGTGCACAAGCTGGCCTGGACAATCACAGATATCTAAAAAAAAGAAGAAGAAGAATTAATAGCTACTGATGGAATAATTACTTTTGAACTTCTATTAGTAGTTATTACAGTTATTCTGTTTTCTATGGAATTCTTCTTTGGATAAAGTACAATTTTAATTGATGCATGGAATCCTGGATTGTGGGTTCTTGCACTGTTCAAAGCAACCGGGACTCTACTTCGCTATTCTGCATAACTAGGAATGATTTTGATCAAGGAAAGAAATCGACCATGTATCATCCTGAATTTCTATCTGTTTTTTTTTTTTTTTTTTAGTTTGGATTTTACATCTGGGATATTTTAGGTACTCCCAAGCTTGATGAAAAATTTAGCACAACTGATCATCAAGCTACCGGCCGAGGGCCAAAATATGGTTCTTGATCAGTTATACTCCCTGGTTTCAGAAGCTGATGATGTCACACGCAAACCCTGGCTAGTTTCATGGTTACAATCATTATCATATCTATGCTGCCAATCTAAAAGTGGAGATACACACTCTAATGAGATGCAAAATATACGGCTTTTAAACTTCTCTTGGATTGTTGACCCATTGAACCGTTTTCGATCCTATGCACGACTCTGAGGTATGAATTTTTAGTTCTTGGTTTAATTTTCTCCCTTCCATGTTAATGTTAAGTTTATTATATGCATTTGCGACAGTAGTTTACTTGCTTACCTATTATTCTGTACATCTAATATTTTCCTTTATGATTTTATCGTAGTTGTGTTTAACAATTAGAAGCTGTGTGCATGTGTTAAAAAGAAAAATTTTATGGTTTGGATGAATATATTTTATGTAATTGTGAAGACTGAAGACAAATACTGTATGTTTCTAATTGTTGATTAGAATAATAGAAGGTTACATATATAGGCATTAAAGGAAAGGAAACCCTAACAAGAAAGTTGAAATTACCATAAAGGACAATAAACTAATAAACCCCTAAAGAATCTTGTTGGGTTCTATCCTCTCGGTTGTTTTTTACATTTTTTCTTTTGCGGATAAGGGGTAGTCTTTCAGAAAAGTGTTAATCTATGGACTATCTTCCAGAAGCTTTCTCAGGAGGTGAAGAACATTCTGATAAGTGGTCAAACCATCTCAAGGAAGTCCTCTACTTTGGTTGCCTAGTCGATGTTCCTCTTTCCAATTTAGGAGTTTGGGGAGGAGAGGAGAGGAAGGAATCTCAACCCCCTTATTTGTTTTTGGTACATCACAACCCCTCCCAACTCCTACTCCACTCATTTTCCAACATCCCATCAAATATCTTATCTAATCCCATCAAATATTATCATAATAAAATATATTAAACTTAATAAATTATAAATGAGTACAAATATACGATGATTACAATAAAAAATAATAAAAATATAAAATAAACATCTTCTCCCCTCAACACCAATCCCTCCTCTCCTCTCATAACACCAATTTCCAAACAAGTATTTCATCCTTCTAAATCACTCCAGTTCCTCTACCTCTTACACTCCCATCCTCTCCTCCCCAAACTCCAAAATCCAATCACCCCCTCAGGATGAAAATGGTCAGATTTATGGAGATGTATGCAGTTAGAGCTCTGTTGTGGTTGTTGCTGATTGATACAAATCAGACAATTTTTTAGGATAGACAGGAACATATAGATTTTGTGATATTTTTCGCATATTGCCTCCTGTTGGAGTGATCTTCTCAAGATCTTCCTAGGAACAGGCTCTTTTCTTGATTTATCTTGATTCAAGGTGTATTTTTTGGTATAGATTCAAGGTTGTTTCCCATTCTTCTCCCCTCAGGATGAAAATGGTCAGATTTATGGAGATGTATGCAGTTAGAGCTCTGTTGTGGTTGTTGCTGATTGATACAAATCAGACAATTTTTTAGGATAGACAGGAACATATAGATTTTGTGATATTTTTCGCATATTGCCTCCTGTTGGAGTGATCTTCTCAAGATCTTCCTAGGAACAGGCTCTTTTCTTGATTTATCTTGATTCAAGGTGTATTTTTTGGTATAGATTCAAGGTTGTTTCCCATTCTTCTCCCCATTTTTTTTTTCTGAATCATGTTGGTTATTAACAAATAGCAAATCAACTAAAAACTAAAGCTCAACTGCTTATCCAGTTATATTTAATACATTATGTTGGAGATATTCACCTTGTGGCAACGGTTCTGTTGGTTGATATAATTAAATTTGCCCTAATCCCTGAGCTTAGGCTTTTGGATTGATTGGTGGTTTAACATAATTCAACATGGTATCAGAGCAGGAGGTCATGAGTTCAAGTCCCTGCCAAGTCGTTTCCTCCCCAATTAAAATTAAATTCCACTTGTAGGGCATTTCTCAAATTTCAAAGCCCACAAGTGAGGGGGGGTGTTGGTTGATATAATTAAATTTTCCCTAATCCATGAGCTTAGGCTTTTGGATTGATCGGTGGTTTAACATAATGATAGAATCTACCACTATTTTATTAAGAACTAAGATAATCTCTCGAGAGGATCTCTCAAAGAGCCCAAATACAAAGACAACAACTGTTTGAATGAATTGCTAAAGGCCTAACATATCCTATATATAGTAAACCTAGAAGATGACCTAATTTAGTAATAACTCTAATAAGGCCCATCATTGATAATACAACTGAATTACAATTAAAGACTATTAAATATAATTAATAAATACTACTTTCATAATGTCTTGTTCCTCCTCGAGTAAACCTTCACGTGGGCCCGGTTCGTATCAATACTTCCCCCAAAGAGCCACCTTGTCCTCAAGGTGAAAGTCGAGAAACTGGAGCTCAAGATCCCCGGCGCCTCCCATGTAGCCTCGTGCAGGTGGACCCCTCCCACTCGAATCAAAACTTGGCGCGAACCCCCTTCGAGCGGGTCCTCCCATGCCCAACACAAACCTTAGGGCAAACCACGACACACAAATCTTCCGTCACCATGGACGGCGTAGCCAGAACTAACACCGAAGAACCCACCGCCTTCCGCAGGGCGGACACATGAAAAACCGGATGAATCTTCACTGATGGAGGCAACTCGAGTCGATATGCCACTCGCCCAACCTGTGCCAAAACACGATACGGACCAATAAATCGGGTGCTAACTTGGGATGTTTAAACCGCGCGAGAGACGATAGTCGGTAAGGTCGGAGTTTAATATAAACCAAGTCATCAACAAAGAAAGCCCGAACATCGCGGCGATTGACATTCGCGCGATCCGACATAGACATTTGTGCACGCGATAAATTAGCCTTCAAAGAATCCAACATTCGATCTCTTTCTAACATCAACGAGTCAATCGCCGCCACGGGGACTAGCTCCAAATCATATCCCGAATCGAGGCGGGCGGCCGTAACGATCGCAAAAAGGCGTCATGCTCGTGGGCGAATGAAACGATGTGTTAAAACTGAATTCACCCAAGACAACCATTTATACCATGCCTTCTGTTGAGTCATCACAAAACATCGTAGGTAGGACTCCAAACAACGATTTACAACCTCGGTTTGGCCATCCGTTTGAGGATGGTATGTAATCTTTGCGGCGAAGTTTCGTTCCCAAGCCTTAAACATTTCCTCCCACAAGAAGCTAGTGAAAATCTTATCCCGATCGACACCACGATGCTTTTAGGTATCCCATGTAAGCGCACCACCTCCTTTATAAAAACACGAGACACGGATAAGGAAGTGAAGGGGTGGCGTAGGGGAATGAAATGCGCATATTTAGACAATCACAACCAACCACGACTAAAATAGTGTCAAACCCCTCCGAACGCGCAAACCCTCAACAAAGTCCATGGAGATGTCCTCCCAAATTTGTTCGGGAATCGGTAATGGTCGTAATAATCCTGCCGGCGATAATGACAGTGTGTTTGGCACGCACACGATCGAACATTCCGCCACAAAGGCGCGAACACGAGCCTTCATCCCTTGCCAATAAACTTCCTTAGCAAGACGCGATATGTTTTTAAGACTCAAAATGTCCACCGACAAATTTCATGGAATTCTAGCAATAATAGAGGATCGTTGGTGACGTCGCAGAATAACACCAATCTGCCTGGTAAAGCAAAACATCGCCTGGCAATCAATACCTGGTGGACCCTCCTCACCGCTTTGAGAGCCGTAAAAATAGGCGATAATTTTGCATCCTCTTTAATCTGCTGCGTAAACACACTTTGTGTTAACCCCGCCACGACACATCGAGCATTCCCAACTCACCCGATAGAGGCATCCGGAAAGAGCATCACCGCTGCTCGATTCTCAAGACCCTTCTTGTACTCAATCTTCGAAATCATACCCCATAAGCTTAGCGATCCATCGATAATAATCACCATCCACCACCCGTTGCTCTAGTAAAAATTTAAGACTTTTACGGTCCGTGCGTACAATAAAATGGTGTCCCAATAAGTACGCCCTCCATCATTTGAGCAGCGAACACGATAGCCATTAACTCCGCTCATAGACAGCTTAACGCGATGAGTGATCGGCAAAGCCCTACTAAAATATGCCATCGGTTGGCCTTGCTTTGCATTAACACCGCACCCACTCCAATTCTGAAGCGTCGGTTTCGACCACAAATCCTGGTTGAAATCCGTGTCGGAGGATCGGAACATCTTTGCTCATGGCATGTTTCATTCTCTGAAGCGATCCTCGGCAGCCTAGCCCCCACTCAAATTTCCCTTCTTGATAGAATCGTGTTAAGGGAAAAGCCATCGATCCATAGTTAGCTACGAATCGCCGTAGTATCCCGTGAGACCTAGGAATCCCCTTAATTCTTTAATATTCCCGGGACTCGGCCATCCCACCATTGCTTCAATCTTTGCCGGGTCCATGACACGCCTTCCTTGAGATATGAAATGCCCCAAGTACTCGATACGGCGCAACCCAAACTGGCATTTCTTGGCGTTGGCCACAAACGCATGTTCAATCAAAATCTCCAAAACCTGAGCCAGGTGCTCCCTATGCTCCCTAATGGTCATACTGTAAATGAGAATGTCATCAAAGAAAACCAACACAAACTTACGCAGATCTGGGGCGCAAAATGTCGTTCATAATAGACTGGAAGGTGGGTGGAGCATTTCTCAATCCGAAAGGCATGACAACGAACTCATAATGGCCTCATGGGGGTTCTGAACGCTGCTTTGTGCACATCCGTAGGATTAAACGAATCTGAATCTGGTAACCGGCCTTCAAATCAATTTTGGAAAAAATCGTCGCTCCATGAAGTTCATCCGAAGCTCATCCACCAAGGGAATGGGATACTTATCTGAGTGCGGTGACATGATTCAGAGCCACAAATCAACACAAAATCGCCAACTACCATCCTTCTTCCTTACCAAAGAAGATCGGACTCGAAAATGCACTCGTGCTAGGGCGGATTATCCTCGCCATAACATTTCTCGCACCAATTTTTCAATCTCATTTTTACGGTATTGCGGATATCGATACGGACGCACATTCACGCGAATCGCTCCCGCCATCAATTCGATGGAGTGATCCTGTTCCTCGGTGGTGGTAATCCTGTCAATGACTCAAACACGGGTGAGCACGAATTAATTAAGGAGTGCGCGATTCACTGGGTACTGAGTTAAGTCGGGAAGGTTTTTCGTCTGACCAGCCTCAGCACCTGTGGTCTCAATCATATTCAACTCCACTAATGACCCCTCGTCCTCGGGTCGTAAGGATTTCATCATAGATTTCAACGACACTGCGCCTTAACCAGCGTGGATCCCCTGTAGTAATTCGGCTTCCCATGAGCCCAATACAAATCGCATCTGCAAGGAGCGGAAATTGAATTCAATCTTCCCAAGGTCTCCAACCAAGCTACCCCTAGGATCACATCAAGACTTCCAGGGGTAAAGGAAGGAAATCGTTAACTACTTTAAGCTCTGCTAAATGGAGTTCCACATTTTTGCAAATTCCTGCTGCCCTTACCGACTTGCCTGTTCCCAGCATAATACCATAATCATGCGATGGTTCCACCTGGAGATTCAAATTTAGCCACGATCACATCGATATAAAGTTATGGGTGGCTCCACCGTCGATAAGAACCACTACCAAGACCTTGAATAGATCCTGTGACTTTCAACGTTTTTGGTGAACTCAACCCCGCCATCGAGTTTAATGACAGTATTTGCTAAGTCCTCACATCCTCTGTGTCACTTGATATCCCCTCATTGTCATTTGCCGAAACTGTCTCTTCGGTAATTCCATCCTCTACAACGCGTATCTCTAGAGCACAGGCTCCTTCTTCTTACAACGGTGCCTGCTCAAATTTCTCATCACAACGAAAACACAATCCTTTATCTTTACGGATCCGAATCTCACTATCCGACAAGCGTTTATAGGGTAATGTGGACGGTGTACTTGTGGTCATCGGCCTGCAGGAAGTTAGGGCTATAGTTCTCAGTTGGTATTTCTGTCGTCCTCGTGCTCGTAGCCCCGATTCCTATCTTTGCTAGTAGATGGCTTAATCTGCTTTGAAGAAGATCCTACCCCTTTTACTTGAGCTCGAAAAACAAGGTCATCCTCAATCACCGAGCCATGAACTTCTTATCCCCGAATACCCACGGTCGCAGCTTCCTCATCTCACTTCGATTTCTTCCTTCAATCCACTTTCCCATTTTCCTCCAACGCACTCGCACTAATATCGCGTATGCCCTTAGCATACTTCTCAAACAGCGCGCCTATACTCTTTCACGGTTCCCACTGTTGTAAACTCATAAGATTTGCATATTTGTTGTCATTGATCGTAGTTTGGAAACGGTGCAACAACAATTCTCGAAACTCCTCCCACGAAGCAATGAAGCACGGTCCTCCTCATCACAGAACCATTCTAGTGCCTCTCCTTCCATACATACACCAAGTCACAGATCCACTCTTTCGTGCCCGCCACACCGATTCACCCGAAATACCTTTCGACGCGGCATAACCATCCGTCCGGATCCTCATCCGTTAGACCCTTGAACACCGGCATTTCCAACTTTCGTAACCTCCGATCAAACATTGGCCCATCCCTACTGCCTATCCCCGGTCCGCCCCTTTCAAATTGATCCTGCCTATCTCCCCAATCCTCGCACTCACTGGCCTCGCGCCATGGATTTCGCCCCGATTGTTCCAACCGTCATGCGCATTCGGCCTTCGTGGCGGCTGCGCACCCTCCGTGCCTCGGGAACCCCGTTTCCCGGTCAAACCCGTCGATCCCAAATTCCGACCTGTTCCGGCCACTGTTCGGCTCAGGCCCGACCTCCCGACCAGACTCGGTCCCCCCACGTCCGGACCGGCCCAGCCCAGCCCGACCTCGCCCGGCCGCGCGCGCGCGCGCGCGCGCCGCCCCGCGCCTGCCTCCACCCGGCCGTCGCTTCACCCGCGCGCACCCGCGCGCCACTTCTCGCGCCCGGCCCGACATCGCCCGGCCGCGCAGCTATCCGCGCGCCCCGCCCTCGCCCGGTCCGACTCCACCCAGCCCGATACCGCCCCAGCCCGCACCGCACCCGCGCGCCCCGCTCGCGCCCGGTCCGACACTGCCCCTGCTCGCGCCGTACCCACGCGCGCCGCCCTCGTCCGGCTCGGTCTCGCCCAGCCCGACACCGCCCCACGCGCGCGTCGCATGCCCTACTCCGCTCACGCGCCAGCGCCATTGTTCCACCCCGACAGCGCCGCTCTGCCCGATGTCGCCTCTGCTTCCTCCAGATTCTCCTCCGCCGTGGCAAAGAATCGCGCCAGCACCTCCCTCCGTCCTCTCGGTCATTAATCCCTTCCCTTTGTCAGCCCGAATCGATTCCATTATTAGTTCTAAATTCCGGGTTATCGACTCAAACTTTAGATCCTGGGAATCTATCTTTTTTCCCAACCGCTCCTCCATCGCTTCCATCTTCCCATTAGTTTCTTGCAACTTCCTCGCCAAGTCGGCCACTCCTTCTTCACATTCTTGTATCCTGGATTCCATCTTCGTAGTAACCATTCCCGGATCGGTAAATGGCTCTGATACCAAAATGATAGAATCTACCACTATTTTATTAAGAACTAAGATAATCTCTCGAGAGGATCTCTCAAAGAGCCCAAATACAAAGACAACAACTGTTTGAATGAATTGCTAAAGGCCTAACATATCCTATATATAGTAAACCTAGAAGATGACCTAATTTAGTAATAACTCTAATAAGGCCCATCAGCCCAATACAACTGAATTACAATTAAAGACTATTAAATATAATTAATAAATACTACTTTCATAATGTCTTGTTCCTCCTCGAGTAAACCTTCAGCGGGCCCGGGTTCGTATCACATAATTCAACAGGTTCTCATATTGGAAAGGTGAGGATTTTACAACACTTTTATAAGGGATATGAGTTACTTCCATCTTAGCCAATTAGTTTTGAGGTGGAATCCCATATTCTCTAATCTAATATCAGAGCTCCACGTGGGTTATTGGTCCCCCAAATTCCAATCCAACCCAAAATGCTTTGAATTCTAAAACGGGTGTGGACCCCACCCAAGCGGGCATAAGTTTAAGCGGGTATTTGATCCCAAAGTTCGGATCTGACCCTAGCAAAAAGTAAGCCAAAGCAGATATTTCGGTCCACAAATTTGGATCCATTTTAAGATGGCCTTTCCCAATATATGGCCACCATCTTGAGCGGGCATATTGGAGATACTCACCTTGTGACAATTGTCTCATATTGAAAATGTGAGGATTCTATAACACCTTTATAAGGGATATGAGTTACTCTCCTCTTAGCTAATTGGTTTTGAGGTGGAACTCCATATTCTCTAATCATATTAGATAAATACGAGGTTCCACCTCAAAACCAATTGGTAATGAAGGGAGTAACTCATATCTCTTATAAATGTGTTAAAACCACAATGTGAGTATCCCCAACACATTCATCATATCATTTAGTGTTGATTTCGGGCTTCTATGTGTATTCATATTTAACGGTTCTCACTTTGATATAATTTAGACCCTGAAAGTACCTATTATGCATTTCTCCAAGTATACTTTTCATCTCACGAACACTAAGTACTGGGATTCTTATATTCAGCAATCGGCGTGAATTTCAGTCCTTTGCATAAACTCCCAAGTATTTTTGTTAGTGCGTCCTACCATCTTACAGTTGCACGTTCTTGGGAATTTAATTTGGCCGTAACTCTGTATTCTGTTTCTATGCAATGACATGAGCTTTTGAAGACAAACTTCTTGTGCTTGAGTTTCAACATAGTTGACGAAACTTGAAATGGAAGACCACTTTTTCTGCTTCTCTATATTCATAGTTGGTTGAACTGCAACTAGAACTAAACCCGACCTGTGTTTTGATTTCCCGACTTTACGCATCGCATCTTATTCCAATCTTACTATGTGTTTTCATAGTTGTTGTCACCCAACCTTGCATGCCTCGAGAACAACATATAGCAGCATTTTTCCACAGTTATGGTCCTGTCCCATGTTTGCAGTTCCAAATTTATCTGAGGATTGCACTCCAATTTTGTCAGAATGCAAAGTAAAAAATGAAAGAAACATTGAAACATTCCTAGTCTGGCCTTCAGTAGAGTTTTGCCCACACTTTGCACCATAAAAGAGATTTATTTTAGAAATGAGCTTCACTAGAAACGCCTAGTCATGACAGACAAAGAAGCTATACATCGTCAACGACTCGACTCCAACATAATTACTCTTATTCATATAGTCGACTCCAACATAATTACTCTACAGCAGTCATCGGTATGAAAACGATGCATCGTTTTCTATCGTATCGAGTTGTTGATCTTCTATTACATCATATATTCCAAAATCTCCCATTGTAATTGTAAGGCTGTGATCGGTCCTTCAGTCATTCATAGTTTACCAAATAAGGAACAAGAGCAATATGCTTGGTTGTTCTCAATAATAAATTTTCATGACTTGGTTGTTTATGTTTATATGCTTGGGGAATATCCTTCATCAATATATTAAACATTTTAAGGATCCATATACTGATGTACTGTATCATGTGTATTGTATAATGTCG
mRNA sequence
AAAATAAAAAAAGAAGAAAGAAAAAAACCACAACATTTTGAGCTTTCTTTCGTGTGCTCTGGTCGACATTTTTGGAGCTCCAAATGATAAGATGGCAAAGCAGGCGAATTCTGTATTCCTCGAAGAATGGTTGAATAGCACCAGTGGAACTAGCAGTGCCCTTAACTTCAAAACCACTTCCTCGTCTGCTCGAGAAATCATCCAAGCATGGGCTGAGCTTAGAAGCTCTTTAGAGCATCAATCGTTCGATGATCGCCACATTCAATCGCTCAAAACTCTTGTCAACTCACAATCTTCACTCTTTGTTGCAGACCCACAAGCTAAGCTGGTGATTTCTATACTCTCTTCTCCCAATTTCCCCCTTCCCCATGAATCGTATCCTCTCTTTTTGAGGGTTCTTTATATTTGGGTCAGAAAATCTCTCCGCCCCTCTTTGCTACTTGTCGATTCATTCGTTGAGGTTCTCTCTCAGATTTTTTCATCCAAAATTGAATTGGGAAAAAACCCTTTGTTCTTCTCGGAAGTGGTTTTGGTTTTGGGTGCTATTTCGTACATTCCTTCAGCTTCAGAAAATTCAAAATTATGCTGTTTGGAGTTGCTTTGCAGGGTTTTGGAAGAAGGGTACCTGCTGGTTGGATCAGTAGGAGGGATGATTCCAGGGTTTCTTGCAGGTATTGGTTATGCTTTATCTTCATCAGTGAATGCTCATGTTGTTAGACTGTTGGATTCTTTATTAGGAATTTGGGGTCAGGTAGGTGGCCCTACGGGCAATCTTTCTAGTGGGTTAATGATCCTGCATTTGATTGAGTGGGTGACTTCTGGTTTGATTACCCTTCATTCTTTTGAGAAATTAGATGTTTTTAGCCGAGCTACTTTAGAGTCTTCAAAGGAAAGTTATGCTTCATTTGCTGTTGTTATGGCTGCAGCTGGAATATTGAGGGCTTTTAATACTTGCAATGTTTTGTTGAGTAGTTCAGAAAGAGAAACGATATCTAGAATAAGGATTTCAGCTCAGGATTGTTTAGAATCTATATCGAGAAATTTTATTTCGACTATGGAAGGATCTTCCGTTTCAGGCAATGACCGTAGTAGGAGCCTGTTTCTATTGTGTATTTCATCTGCAATAGCACGTTGTGGCCCTGTTTCATCTCGCCCACCTGTGCTCATTTGCATTATTTATGCTTTGTTGAGTGAAATATTTCCTTTGCAGCGTTTATATGCCAAAATTCTTGAATTCTCTTCTGTTGAGTCGGGTGCGTTGGGGCTACCTCTAGTAAAAGAGCATCTTGATAGTATTCCTTTTAAGGAAGCAGGGGCCATCACGGGTGTTCTTTGCACTCAGTATGCTTCAATTGACGAAGAGGACAAAAAGATTGTGGAGAATCTTGTATGGGATTACTGTCATGACATCTACTCAAGGCACAGACTAGTAGGGTTGGTGCTTCGTGGCAGAGAGGATGAATTACTAGAGGGTATAGAGAAAATTGCAGAGTCTGCTTTTTTCATGGTTGTAGTATTTGCATTAGCTGTCACAAAAGAAAAGTTGGATTCCAAATGTACACCGGAAAGTCAATTTGACGTATCAGTAAGAATACTTGCTTCATTCTCTTGTATGGAATACTTTCGGCATATTCGCTTGCCAGAATATATGGATACTATCCGAGGGGTTGTTGCAAGCATTCAGGAGAATGAGTCTGCCTGTGTCTCTTTCATTGAATCAATGCCCACATACAAAGATCAAACAAACGGGCCAGATAACTTTATTGGGCAGAAATTAAAATATACATGGACAGAGGATGAGGTGCAAACTGCACGCATGCTGTTTTATATACGAGTCATTCCAACTTGCATCGAACGTGTTCCTACCCAAGTGTATAGGAAAGTGGTTGCTCCAACAATGTTTTTATATATGGGACATCCTAATGCCAAAGTAGCCCGAGCTTCACACTCGGTGTTTATAGCTTTCATATCAGGGAAGGATGACAGTGAAGAAGAAAAGAGAGTGATATTGAAGGAGGAGCTTGTTTTCTACTACATTGAGAGATCTTTATCAGTATCACTCTTAACAAGATTTTTTTTTCTTTATTTATGTGATCAAACTTTAGATTTACAATTTGAGTTAGATATTGAGCATTCAAATTCTATGATTTTAGGGATATCCTGGCATTACACCATTTGAAGGTATGGCTTCAGGTGTTGCAGCTTTGGTGCGATATCTTCCTGCAGGAAGTCCAGCCATCTTTTATTGTATCGAAAGTCTTACTGTAAAAGCTACAAGCCTTTGCAGTGAAAACTTCATGGAAAATGCTGATCTGTGGAAGTCGTGGCAGGGAGACTTAGAGCCTTCCAAGAAAATTCTTGATATGCTTTTACGGCTCATTTCTCTTGTTGATATACAGGTACTCCCAAGCTTGATGAAAAATTTAGCACAACTGATCATCAAGCTACCGGCCGAGGGCCAAAATATGGTTCTTGATCAGTTATACTCCCTGGTTTCAGAAGCTGATGATGTCACACGCAAACCCTGGCTAGTTTCATGGTTACAATCATTATCATATCTATGCTGCCAATCTAAAAGTGGAGATACACACTCTAATGAGATGCAAAATATACGGCTTTTAAACTTCTCTTGGATTGTTGACCCATTGAACCGTTTTCGATCCTATGCACGACTCTGAGTTGTTGTCACCCAACCTTGCATGCCTCGAGAACAACATATAGCAGCATTTTTCCACAGTTATGGTCCTGTCCCATGTTTGCAGTTCCAAATTTATCTGAGGATTGCACTCCAATTTTGTCAGAATGCAAAGTAAAAAATGAAAGAAACATTGAAACATTCCTAGTCTGGCCTTCAGTAGAGTTTTGCCCACACTTTGCACCATAAAAGAGATTTATTTTAGAAATGAGCTTCACTAGAAACGCCTAGTCATGACAGACAAAGAAGCTATACATCGTCAACGACTCGACTCCAACATAATTACTCTTATTCATATAGTCGACTCCAACATAATTACTCTACAGCAGTCATCGGTATGAAAACGATGCATCGTTTTCTATCGTATCGAGTTGTTGATCTTCTATTACATCATATATTCCAAAATCTCCCATTGTAATTGTAAGGCTGTGATCGGTCCTTCAGTCATTCATAGTTTACCAAATAAGGAACAAGAGCAATATGCTTGGTTGTTCTCAATAATAAATTTTCATGACTTGGTTGTTTATGTTTATATGCTTGGGGAATATCCTTCATCAATATATTAAACATTTTAAGGATCCATATACTGATGTACTGTATCATGTGTATTGTATAATGTCG
Coding sequence (CDS)
ATGGCAAAGCAGGCGAATTCTGTATTCCTCGAAGAATGGTTGAATAGCACCAGTGGAACTAGCAGTGCCCTTAACTTCAAAACCACTTCCTCGTCTGCTCGAGAAATCATCCAAGCATGGGCTGAGCTTAGAAGCTCTTTAGAGCATCAATCGTTCGATGATCGCCACATTCAATCGCTCAAAACTCTTGTCAACTCACAATCTTCACTCTTTGTTGCAGACCCACAAGCTAAGCTGGTGATTTCTATACTCTCTTCTCCCAATTTCCCCCTTCCCCATGAATCGTATCCTCTCTTTTTGAGGGTTCTTTATATTTGGGTCAGAAAATCTCTCCGCCCCTCTTTGCTACTTGTCGATTCATTCGTTGAGGTTCTCTCTCAGATTTTTTCATCCAAAATTGAATTGGGAAAAAACCCTTTGTTCTTCTCGGAAGTGGTTTTGGTTTTGGGTGCTATTTCGTACATTCCTTCAGCTTCAGAAAATTCAAAATTATGCTGTTTGGAGTTGCTTTGCAGGGTTTTGGAAGAAGGGTACCTGCTGGTTGGATCAGTAGGAGGGATGATTCCAGGGTTTCTTGCAGGTATTGGTTATGCTTTATCTTCATCAGTGAATGCTCATGTTGTTAGACTGTTGGATTCTTTATTAGGAATTTGGGGTCAGGTAGGTGGCCCTACGGGCAATCTTTCTAGTGGGTTAATGATCCTGCATTTGATTGAGTGGGTGACTTCTGGTTTGATTACCCTTCATTCTTTTGAGAAATTAGATGTTTTTAGCCGAGCTACTTTAGAGTCTTCAAAGGAAAGTTATGCTTCATTTGCTGTTGTTATGGCTGCAGCTGGAATATTGAGGGCTTTTAATACTTGCAATGTTTTGTTGAGTAGTTCAGAAAGAGAAACGATATCTAGAATAAGGATTTCAGCTCAGGATTGTTTAGAATCTATATCGAGAAATTTTATTTCGACTATGGAAGGATCTTCCGTTTCAGGCAATGACCGTAGTAGGAGCCTGTTTCTATTGTGTATTTCATCTGCAATAGCACGTTGTGGCCCTGTTTCATCTCGCCCACCTGTGCTCATTTGCATTATTTATGCTTTGTTGAGTGAAATATTTCCTTTGCAGCGTTTATATGCCAAAATTCTTGAATTCTCTTCTGTTGAGTCGGGTGCGTTGGGGCTACCTCTAGTAAAAGAGCATCTTGATAGTATTCCTTTTAAGGAAGCAGGGGCCATCACGGGTGTTCTTTGCACTCAGTATGCTTCAATTGACGAAGAGGACAAAAAGATTGTGGAGAATCTTGTATGGGATTACTGTCATGACATCTACTCAAGGCACAGACTAGTAGGGTTGGTGCTTCGTGGCAGAGAGGATGAATTACTAGAGGGTATAGAGAAAATTGCAGAGTCTGCTTTTTTCATGGTTGTAGTATTTGCATTAGCTGTCACAAAAGAAAAGTTGGATTCCAAATGTACACCGGAAAGTCAATTTGACGTATCAGTAAGAATACTTGCTTCATTCTCTTGTATGGAATACTTTCGGCATATTCGCTTGCCAGAATATATGGATACTATCCGAGGGGTTGTTGCAAGCATTCAGGAGAATGAGTCTGCCTGTGTCTCTTTCATTGAATCAATGCCCACATACAAAGATCAAACAAACGGGCCAGATAACTTTATTGGGCAGAAATTAAAATATACATGGACAGAGGATGAGGTGCAAACTGCACGCATGCTGTTTTATATACGAGTCATTCCAACTTGCATCGAACGTGTTCCTACCCAAGTGTATAGGAAAGTGGTTGCTCCAACAATGTTTTTATATATGGGACATCCTAATGCCAAAGTAGCCCGAGCTTCACACTCGGTGTTTATAGCTTTCATATCAGGGAAGGATGACAGTGAAGAAGAAAAGAGAGTGATATTGAAGGAGGAGCTTGTTTTCTACTACATTGAGAGATCTTTATCAGTATCACTCTTAACAAGATTTTTTTTTCTTTATTTATGTGATCAAACTTTAGATTTACAATTTGAGTTAGATATTGAGCATTCAAATTCTATGATTTTAGGGATATCCTGGCATTACACCATTTGA
Protein sequence
MAKQANSVFLEEWLNSTSGTSSALNFKTTSSSAREIIQAWAELRSSLEHQSFDDRHIQSLKTLVNSQSSLFVADPQAKLVISILSSPNFPLPHESYPLFLRVLYIWVRKSLRPSLLLVDSFVEVLSQIFSSKIELGKNPLFFSEVVLVLGAISYIPSASENSKLCCLELLCRVLEEGYLLVGSVGGMIPGFLAGIGYALSSSVNAHVVRLLDSLLGIWGQVGGPTGNLSSGLMILHLIEWVTSGLITLHSFEKLDVFSRATLESSKESYASFAVVMAAAGILRAFNTCNVLLSSSERETISRIRISAQDCLESISRNFISTMEGSSVSGNDRSRSLFLLCISSAIARCGPVSSRPPVLICIIYALLSEIFPLQRLYAKILEFSSVESGALGLPLVKEHLDSIPFKEAGAITGVLCTQYASIDEEDKKIVENLVWDYCHDIYSRHRLVGLVLRGREDELLEGIEKIAESAFFMVVVFALAVTKEKLDSKCTPESQFDVSVRILASFSCMEYFRHIRLPEYMDTIRGVVASIQENESACVSFIESMPTYKDQTNGPDNFIGQKLKYTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKVVAPTMFLYMGHPNAKVARASHSVFIAFISGKDDSEEEKRVILKEELVFYYIERSLSVSLLTRFFFLYLCDQTLDLQFELDIEHSNSMILGISWHYTI
Homology
BLAST of Sed0002136.6 vs. NCBI nr
Match:
XP_038903923.1 (uncharacterized protein LOC120090375 isoform X2 [Benincasa hispida])
HSP 1 Score: 1067.8 bits (2760), Expect = 4.1e-308
Identity = 561/654 (85.78%), Postives = 595/654 (90.98%), Query Frame = 0
Query: 1 MAKQANSVFLEEWLNSTSGTSSALNFKTTSSSAREIIQAWAELRSSLEHQSFDDRHIQSL 60
MAKQ++S+FLEEWL S GT ALN K TSSSAREIIQAWAELRSSLEHQSFDDRHIQSL
Sbjct: 1 MAKQSSSLFLEEWLKSIGGT--ALNSKLTSSSAREIIQAWAELRSSLEHQSFDDRHIQSL 60
Query: 61 KTLVNSQSSLFVADPQAKLVISILSSPNFPLPHESYPLFLRVLYIWVRKSLRPSLLLVDS 120
K LVNSQSSL+VADPQAKLVISILSSPNF +P ESYPLFLR+LYIWVRKSLRPSL+LVDS
Sbjct: 61 KILVNSQSSLYVADPQAKLVISILSSPNFSIPDESYPLFLRILYIWVRKSLRPSLVLVDS 120
Query: 121 FVEVLSQIFSSKIELGKNPLFFSEVVLVLGAISYIPSASENSKLCCLELLCRVLEEGYLL 180
VEVLS IFSSKIEL KNPLFFSE VLVLGAISY+ SASE SKLCCLELLCRVLEE YLL
Sbjct: 121 SVEVLSHIFSSKIELRKNPLFFSEGVLVLGAISYLLSASEKSKLCCLELLCRVLEEEYLL 180
Query: 181 VGSVGGMIPGFLAGIGYALSSSVNAHVVRLLDSLLGIWGQVGGPTGNLSSGLMILHLIEW 240
VGSVG +IP FLAGIGYALSSSVNAHVVRLLDSLLGIWG +GGP LSSGLMILH+IEW
Sbjct: 181 VGSVGEIIPEFLAGIGYALSSSVNAHVVRLLDSLLGIWGNIGGPIDTLSSGLMILHMIEW 240
Query: 241 VTSGLITLHSFEKLDVFSRATLESSKESYASFAVVMAAAGILRAFNTCNVLLSSSERETI 300
VTSG+I+LHSFEKLDVFS+A L SSKESYASFAVVMAAAGILRAFNT LLSSSERETI
Sbjct: 241 VTSGMISLHSFEKLDVFSQAILVSSKESYASFAVVMAAAGILRAFNTQKGLLSSSERETI 300
Query: 301 SRIRISAQDCLESISRNFISTMEGSSVSGNDRSRSLFLLCISSAIARCGPVSSRPPVLIC 360
SRIRISAQDCLESI+RNFISTMEGSS++GND RS+ LLCIS AIARCGPVSS PPVLIC
Sbjct: 301 SRIRISAQDCLESIARNFISTMEGSSITGNDHRRSVLLLCISLAIARCGPVSSCPPVLIC 360
Query: 361 IIYALLSEIFPLQRLYAKILEFSSVESGALGLPLVKEHLDSIPFKEAGAITGVLCTQYAS 420
++YALL+EIFPLQRLYAKI EFS E GALGL LV EHL SIPFKEAGAITGV C+QYA+
Sbjct: 361 VVYALLTEIFPLQRLYAKINEFSFAELGALGLTLVNEHLGSIPFKEAGAITGVFCSQYAT 420
Query: 421 IDEEDKKIVENLVWDYCHDIYSRHRLVGLVLRGREDELLEGIEKIAESAFFMVVVFALAV 480
++EEDK VENLVWDYC D+YSRHRL GLVLRGREDELLE IEKIAESAF MVVVFALAV
Sbjct: 421 LEEEDKSFVENLVWDYCQDVYSRHRLAGLVLRGREDELLENIEKIAESAFLMVVVFALAV 480
Query: 481 TKEKLDSKCTPESQFDVSVRILASFSCMEYFRHIRLPEYMDTIRGVVASIQENESACVSF 540
TKEKLDSK T ESQFD+SVRIL SFSCMEYFR IRLPEYMDTIRGVVASIQ NESACVSF
Sbjct: 481 TKEKLDSKYTLESQFDISVRILVSFSCMEYFRRIRLPEYMDTIRGVVASIQGNESACVSF 540
Query: 541 IESMPTYKDQTNGPDNFIGQKLKYTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKVVA 600
IESMPTY+DQTNGPDN IG+ KY+WT+DEVQTARMLFY+RVIPTCIERVPTQVY KVVA
Sbjct: 541 IESMPTYQDQTNGPDNSIGRITKYSWTKDEVQTARMLFYVRVIPTCIERVPTQVYGKVVA 600
Query: 601 PTMFLYMGHPNAKVARASHSVFIAFISGKDDSEEEKRVILKEELVFYYIERSLS 655
PTMFLYMGHPNAKVARASHSVFIAF+SGKDD +EKRV LKEELVFYYIERSLS
Sbjct: 601 PTMFLYMGHPNAKVARASHSVFIAFMSGKDDLGDEKRVTLKEELVFYYIERSLS 652
BLAST of Sed0002136.6 vs. NCBI nr
Match:
XP_038903921.1 (uncharacterized protein LOC120090375 isoform X1 [Benincasa hispida])
HSP 1 Score: 1067.8 bits (2760), Expect = 4.1e-308
Identity = 561/654 (85.78%), Postives = 595/654 (90.98%), Query Frame = 0
Query: 1 MAKQANSVFLEEWLNSTSGTSSALNFKTTSSSAREIIQAWAELRSSLEHQSFDDRHIQSL 60
MAKQ++S+FLEEWL S GT ALN K TSSSAREIIQAWAELRSSLEHQSFDDRHIQSL
Sbjct: 1 MAKQSSSLFLEEWLKSIGGT--ALNSKLTSSSAREIIQAWAELRSSLEHQSFDDRHIQSL 60
Query: 61 KTLVNSQSSLFVADPQAKLVISILSSPNFPLPHESYPLFLRVLYIWVRKSLRPSLLLVDS 120
K LVNSQSSL+VADPQAKLVISILSSPNF +P ESYPLFLR+LYIWVRKSLRPSL+LVDS
Sbjct: 61 KILVNSQSSLYVADPQAKLVISILSSPNFSIPDESYPLFLRILYIWVRKSLRPSLVLVDS 120
Query: 121 FVEVLSQIFSSKIELGKNPLFFSEVVLVLGAISYIPSASENSKLCCLELLCRVLEEGYLL 180
VEVLS IFSSKIEL KNPLFFSE VLVLGAISY+ SASE SKLCCLELLCRVLEE YLL
Sbjct: 121 SVEVLSHIFSSKIELRKNPLFFSEGVLVLGAISYLLSASEKSKLCCLELLCRVLEEEYLL 180
Query: 181 VGSVGGMIPGFLAGIGYALSSSVNAHVVRLLDSLLGIWGQVGGPTGNLSSGLMILHLIEW 240
VGSVG +IP FLAGIGYALSSSVNAHVVRLLDSLLGIWG +GGP LSSGLMILH+IEW
Sbjct: 181 VGSVGEIIPEFLAGIGYALSSSVNAHVVRLLDSLLGIWGNIGGPIDTLSSGLMILHMIEW 240
Query: 241 VTSGLITLHSFEKLDVFSRATLESSKESYASFAVVMAAAGILRAFNTCNVLLSSSERETI 300
VTSG+I+LHSFEKLDVFS+A L SSKESYASFAVVMAAAGILRAFNT LLSSSERETI
Sbjct: 241 VTSGMISLHSFEKLDVFSQAILVSSKESYASFAVVMAAAGILRAFNTQKGLLSSSERETI 300
Query: 301 SRIRISAQDCLESISRNFISTMEGSSVSGNDRSRSLFLLCISSAIARCGPVSSRPPVLIC 360
SRIRISAQDCLESI+RNFISTMEGSS++GND RS+ LLCIS AIARCGPVSS PPVLIC
Sbjct: 301 SRIRISAQDCLESIARNFISTMEGSSITGNDHRRSVLLLCISLAIARCGPVSSCPPVLIC 360
Query: 361 IIYALLSEIFPLQRLYAKILEFSSVESGALGLPLVKEHLDSIPFKEAGAITGVLCTQYAS 420
++YALL+EIFPLQRLYAKI EFS E GALGL LV EHL SIPFKEAGAITGV C+QYA+
Sbjct: 361 VVYALLTEIFPLQRLYAKINEFSFAELGALGLTLVNEHLGSIPFKEAGAITGVFCSQYAT 420
Query: 421 IDEEDKKIVENLVWDYCHDIYSRHRLVGLVLRGREDELLEGIEKIAESAFFMVVVFALAV 480
++EEDK VENLVWDYC D+YSRHRL GLVLRGREDELLE IEKIAESAF MVVVFALAV
Sbjct: 421 LEEEDKSFVENLVWDYCQDVYSRHRLAGLVLRGREDELLENIEKIAESAFLMVVVFALAV 480
Query: 481 TKEKLDSKCTPESQFDVSVRILASFSCMEYFRHIRLPEYMDTIRGVVASIQENESACVSF 540
TKEKLDSK T ESQFD+SVRIL SFSCMEYFR IRLPEYMDTIRGVVASIQ NESACVSF
Sbjct: 481 TKEKLDSKYTLESQFDISVRILVSFSCMEYFRRIRLPEYMDTIRGVVASIQGNESACVSF 540
Query: 541 IESMPTYKDQTNGPDNFIGQKLKYTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKVVA 600
IESMPTY+DQTNGPDN IG+ KY+WT+DEVQTARMLFY+RVIPTCIERVPTQVY KVVA
Sbjct: 541 IESMPTYQDQTNGPDNSIGRITKYSWTKDEVQTARMLFYVRVIPTCIERVPTQVYGKVVA 600
Query: 601 PTMFLYMGHPNAKVARASHSVFIAFISGKDDSEEEKRVILKEELVFYYIERSLS 655
PTMFLYMGHPNAKVARASHSVFIAF+SGKDD +EKRV LKEELVFYYIERSLS
Sbjct: 601 PTMFLYMGHPNAKVARASHSVFIAFMSGKDDLGDEKRVTLKEELVFYYIERSLS 652
BLAST of Sed0002136.6 vs. NCBI nr
Match:
XP_022944217.1 (uncharacterized protein LOC111448717 isoform X2 [Cucurbita moschata])
HSP 1 Score: 1067.4 bits (2759), Expect = 5.3e-308
Identity = 556/656 (84.76%), Postives = 600/656 (91.46%), Query Frame = 0
Query: 1 MAKQANSVFLEEWLNSTSGTSSALNFKTTSSSAREIIQAWAELRSSLEHQSFDDRHIQSL 60
MAKQANSVFLEEWL S SG SS N K +SSSAREIIQAWAELRSSLEH+ FDDRHIQSL
Sbjct: 1 MAKQANSVFLEEWLKSISGISSGFNSKISSSSAREIIQAWAELRSSLEHRLFDDRHIQSL 60
Query: 61 KTLVNSQSSLFVADPQAKLVISILSSPNFPLPHESYPLFLRVLYIWVRKSLRPSLLLVDS 120
KTLVNSQSSL+VADPQAKLV+SILSSPN LP ESYPLFLR+LYIWVRKSLRPSL+LVDS
Sbjct: 61 KTLVNSQSSLYVADPQAKLVVSILSSPNLSLPDESYPLFLRILYIWVRKSLRPSLVLVDS 120
Query: 121 FVEVLSQIFSSKIELGKNPLFFSEVVLVLGAISYIPSASENSKLCCLELLCRVL-EEGYL 180
VEVLSQIFSSKI L KNPLF SE VL+LGAISY+ SASE SKLCCLELLCR+L EE +L
Sbjct: 121 SVEVLSQIFSSKIGLRKNPLFISEGVLILGAISYVVSASEKSKLCCLELLCRILEEEEWL 180
Query: 181 LVGSVGGMIPGFLAGIGYALSSSVNAHVVRLLDSLLGIWGQVGGPTGNLSSGLMILHLIE 240
L+GSVGG +P F AGIGYALSSS+NAHVVRLLDSLLGIWG++G PTGNLS+GLMILHLIE
Sbjct: 181 LIGSVGGTVPEFFAGIGYALSSSLNAHVVRLLDSLLGIWGKIGSPTGNLSTGLMILHLIE 240
Query: 241 WVTSGLITLHSFEKLDVFSRATLESSKESYASFAVVMAAAGILRAFNTCNVLLSSSERET 300
WVTSGLI+LHSF+KL+ S+ LESSKESYASFAVVMAAAGILRAFN+ LLSSSERET
Sbjct: 241 WVTSGLISLHSFKKLNFLSQTALESSKESYASFAVVMAAAGILRAFNSYKALLSSSERET 300
Query: 301 ISRIRISAQDCLESISRNFISTMEGSSVSGN-DRSRSLFLLCISSAIARCGPVSSRPPVL 360
ISRIRISAQDCLESI++NFISTMEGSS++GN D RSL LLCIS A+ARCGPV+SRPPVL
Sbjct: 301 ISRIRISAQDCLESIAKNFISTMEGSSITGNDDHGRSLLLLCISLAVARCGPVASRPPVL 360
Query: 361 ICIIYALLSEIFPLQRLYAKILEFSSVESGALGLPLVKEHLDSIPFKEAGAITGVLCTQY 420
IC+ YALL+EIFPLQRLYAK+L+FS ESG LGL LVKEHLDSIPFKEAG I GVLC+QY
Sbjct: 361 ICVTYALLTEIFPLQRLYAKLLKFSFGESGVLGLTLVKEHLDSIPFKEAGVIAGVLCSQY 420
Query: 421 ASIDEEDKKIVENLVWDYCHDIYSRHRLVGLVLRGREDELLEGIEKIAESAFFMVVVFAL 480
ASIDE+DKK VENLVWDYC DIYSRHR VGLVLR REDELLE IEKIAESAF MVVVFAL
Sbjct: 421 ASIDEDDKKFVENLVWDYCQDIYSRHRRVGLVLRHREDELLENIEKIAESAFLMVVVFAL 480
Query: 481 AVTKEKLDSKCTPESQFDVSVRILASFSCMEYFRHIRLPEYMDTIRGVVASIQENESACV 540
AVTKEKL+SK TPE+QFDVSVRIL SFSCMEYFR IR+PEYMDTIRGVVAS+QENESACV
Sbjct: 481 AVTKEKLNSKYTPETQFDVSVRILDSFSCMEYFRRIRMPEYMDTIRGVVASVQENESACV 540
Query: 541 SFIESMPTYKDQTNGPDNFIGQKLKYTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKV 600
SFIESMP+Y+DQT+GPD+ IGQKL+YTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKV
Sbjct: 541 SFIESMPSYQDQTHGPDSSIGQKLQYTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKV 600
Query: 601 VAPTMFLYMGHPNAKVARASHSVFIAFISGKDDSEEEKRVILKEELVFYYIERSLS 655
VAPTMFLYMGHPNAKVARASHSVFIAFISGKDD E+ RV+LKEELVFYYIERSLS
Sbjct: 601 VAPTMFLYMGHPNAKVARASHSVFIAFISGKDDDEDGNRVMLKEELVFYYIERSLS 656
BLAST of Sed0002136.6 vs. NCBI nr
Match:
XP_022944201.1 (uncharacterized protein LOC111448717 isoform X1 [Cucurbita moschata] >XP_022944209.1 uncharacterized protein LOC111448717 isoform X1 [Cucurbita moschata])
HSP 1 Score: 1067.4 bits (2759), Expect = 5.3e-308
Identity = 556/656 (84.76%), Postives = 600/656 (91.46%), Query Frame = 0
Query: 1 MAKQANSVFLEEWLNSTSGTSSALNFKTTSSSAREIIQAWAELRSSLEHQSFDDRHIQSL 60
MAKQANSVFLEEWL S SG SS N K +SSSAREIIQAWAELRSSLEH+ FDDRHIQSL
Sbjct: 1 MAKQANSVFLEEWLKSISGISSGFNSKISSSSAREIIQAWAELRSSLEHRLFDDRHIQSL 60
Query: 61 KTLVNSQSSLFVADPQAKLVISILSSPNFPLPHESYPLFLRVLYIWVRKSLRPSLLLVDS 120
KTLVNSQSSL+VADPQAKLV+SILSSPN LP ESYPLFLR+LYIWVRKSLRPSL+LVDS
Sbjct: 61 KTLVNSQSSLYVADPQAKLVVSILSSPNLSLPDESYPLFLRILYIWVRKSLRPSLVLVDS 120
Query: 121 FVEVLSQIFSSKIELGKNPLFFSEVVLVLGAISYIPSASENSKLCCLELLCRVL-EEGYL 180
VEVLSQIFSSKI L KNPLF SE VL+LGAISY+ SASE SKLCCLELLCR+L EE +L
Sbjct: 121 SVEVLSQIFSSKIGLRKNPLFISEGVLILGAISYVVSASEKSKLCCLELLCRILEEEEWL 180
Query: 181 LVGSVGGMIPGFLAGIGYALSSSVNAHVVRLLDSLLGIWGQVGGPTGNLSSGLMILHLIE 240
L+GSVGG +P F AGIGYALSSS+NAHVVRLLDSLLGIWG++G PTGNLS+GLMILHLIE
Sbjct: 181 LIGSVGGTVPEFFAGIGYALSSSLNAHVVRLLDSLLGIWGKIGSPTGNLSTGLMILHLIE 240
Query: 241 WVTSGLITLHSFEKLDVFSRATLESSKESYASFAVVMAAAGILRAFNTCNVLLSSSERET 300
WVTSGLI+LHSF+KL+ S+ LESSKESYASFAVVMAAAGILRAFN+ LLSSSERET
Sbjct: 241 WVTSGLISLHSFKKLNFLSQTALESSKESYASFAVVMAAAGILRAFNSYKALLSSSERET 300
Query: 301 ISRIRISAQDCLESISRNFISTMEGSSVSGN-DRSRSLFLLCISSAIARCGPVSSRPPVL 360
ISRIRISAQDCLESI++NFISTMEGSS++GN D RSL LLCIS A+ARCGPV+SRPPVL
Sbjct: 301 ISRIRISAQDCLESIAKNFISTMEGSSITGNDDHGRSLLLLCISLAVARCGPVASRPPVL 360
Query: 361 ICIIYALLSEIFPLQRLYAKILEFSSVESGALGLPLVKEHLDSIPFKEAGAITGVLCTQY 420
IC+ YALL+EIFPLQRLYAK+L+FS ESG LGL LVKEHLDSIPFKEAG I GVLC+QY
Sbjct: 361 ICVTYALLTEIFPLQRLYAKLLKFSFGESGVLGLTLVKEHLDSIPFKEAGVIAGVLCSQY 420
Query: 421 ASIDEEDKKIVENLVWDYCHDIYSRHRLVGLVLRGREDELLEGIEKIAESAFFMVVVFAL 480
ASIDE+DKK VENLVWDYC DIYSRHR VGLVLR REDELLE IEKIAESAF MVVVFAL
Sbjct: 421 ASIDEDDKKFVENLVWDYCQDIYSRHRRVGLVLRHREDELLENIEKIAESAFLMVVVFAL 480
Query: 481 AVTKEKLDSKCTPESQFDVSVRILASFSCMEYFRHIRLPEYMDTIRGVVASIQENESACV 540
AVTKEKL+SK TPE+QFDVSVRIL SFSCMEYFR IR+PEYMDTIRGVVAS+QENESACV
Sbjct: 481 AVTKEKLNSKYTPETQFDVSVRILDSFSCMEYFRRIRMPEYMDTIRGVVASVQENESACV 540
Query: 541 SFIESMPTYKDQTNGPDNFIGQKLKYTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKV 600
SFIESMP+Y+DQT+GPD+ IGQKL+YTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKV
Sbjct: 541 SFIESMPSYQDQTHGPDSSIGQKLQYTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKV 600
Query: 601 VAPTMFLYMGHPNAKVARASHSVFIAFISGKDDSEEEKRVILKEELVFYYIERSLS 655
VAPTMFLYMGHPNAKVARASHSVFIAFISGKDD E+ RV+LKEELVFYYIERSLS
Sbjct: 601 VAPTMFLYMGHPNAKVARASHSVFIAFISGKDDDEDGNRVMLKEELVFYYIERSLS 656
BLAST of Sed0002136.6 vs. NCBI nr
Match:
XP_023005293.1 (uncharacterized protein LOC111498339 isoform X1 [Cucurbita maxima] >XP_023005295.1 uncharacterized protein LOC111498339 isoform X1 [Cucurbita maxima] >XP_023005296.1 uncharacterized protein LOC111498339 isoform X1 [Cucurbita maxima])
HSP 1 Score: 1064.3 bits (2751), Expect = 4.5e-307
Identity = 557/656 (84.91%), Postives = 598/656 (91.16%), Query Frame = 0
Query: 1 MAKQANSVFLEEWLNSTSGTSSALNFKTTSSSAREIIQAWAELRSSLEHQSFDDRHIQSL 60
MAKQANSVFLEEWL S SG SS N K +SSSAREIIQAWAELRSSLEHQ FDDRHIQSL
Sbjct: 1 MAKQANSVFLEEWLKSISGISSGFNSKISSSSAREIIQAWAELRSSLEHQLFDDRHIQSL 60
Query: 61 KTLVNSQSSLFVADPQAKLVISILSSPNFPLPHESYPLFLRVLYIWVRKSLRPSLLLVDS 120
KTLVNSQSSL+VADPQAKLVISILSSPN LP ESYPLFLR+LYIWVRKSLRPSL+LVDS
Sbjct: 61 KTLVNSQSSLYVADPQAKLVISILSSPNLSLPDESYPLFLRILYIWVRKSLRPSLVLVDS 120
Query: 121 FVEVLSQIFSSKIELGKNPLFFSEVVLVLGAISYIPSASENSKLCCLELLCRVL-EEGYL 180
VE+LSQIFSSKI L KNPLF SE VL+LGAISY+ SASE KLCCLELLCR+L EE +L
Sbjct: 121 SVEILSQIFSSKIGLRKNPLFISEGVLILGAISYVVSASEKFKLCCLELLCRILEEEEWL 180
Query: 181 LVGSVGGMIPGFLAGIGYALSSSVNAHVVRLLDSLLGIWGQVGGPTGNLSSGLMILHLIE 240
L+GSVGG +P F AGIGYALSSSVNAHVVRLLDSLLGIWG++G PTGNLS+GLMILHLIE
Sbjct: 181 LIGSVGGTVPEFFAGIGYALSSSVNAHVVRLLDSLLGIWGKIGSPTGNLSTGLMILHLIE 240
Query: 241 WVTSGLITLHSFEKLDVFSRATLESSKESYASFAVVMAAAGILRAFNTCNVLLSSSERET 300
WVTSGLI+LHSF+KLD S+A LESSKESYASFAVVMAAAGILRAFN+ LLSSSERET
Sbjct: 241 WVTSGLISLHSFKKLDFLSQAALESSKESYASFAVVMAAAGILRAFNSYKALLSSSERET 300
Query: 301 ISRIRISAQDCLESISRNFISTMEGSSVSGN-DRSRSLFLLCISSAIARCGPVSSRPPVL 360
ISRIRISAQDCLESI++NFISTMEGSS++GN D RSL LLCIS A+ARCGPV+SRPPVL
Sbjct: 301 ISRIRISAQDCLESIAKNFISTMEGSSITGNDDHGRSLLLLCISLAVARCGPVASRPPVL 360
Query: 361 ICIIYALLSEIFPLQRLYAKILEFSSVESGALGLPLVKEHLDSIPFKEAGAITGVLCTQY 420
IC+ YALL+EIFPLQRLYAK+LEFS ESG LGL LVKEHLDSIPFKEAG I GVLC+QY
Sbjct: 361 ICVTYALLTEIFPLQRLYAKLLEFSFGESGVLGLSLVKEHLDSIPFKEAGVIAGVLCSQY 420
Query: 421 ASIDEEDKKIVENLVWDYCHDIYSRHRLVGLVLRGREDELLEGIEKIAESAFFMVVVFAL 480
ASIDE+DKKIVENLVWDYC DIYSRHR VGLVLR REDELLE IEKIAESAF MVVVFAL
Sbjct: 421 ASIDEDDKKIVENLVWDYCQDIYSRHRRVGLVLRHREDELLENIEKIAESAFLMVVVFAL 480
Query: 481 AVTKEKLDSKCTPESQFDVSVRILASFSCMEYFRHIRLPEYMDTIRGVVASIQENESACV 540
AVTKEKL+SK T E+QFDVSVRIL SFSCMEYFR IR+PEYMDTIRGVVAS+QENESACV
Sbjct: 481 AVTKEKLNSKYTLETQFDVSVRILNSFSCMEYFRRIRMPEYMDTIRGVVASVQENESACV 540
Query: 541 SFIESMPTYKDQTNGPDNFIGQKLKYTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKV 600
SFIESMP+Y+DQT+GPD+ IGQKL+Y WTEDEVQTARMLFYIRVIPTCIE VPTQVYRKV
Sbjct: 541 SFIESMPSYQDQTHGPDSSIGQKLQYIWTEDEVQTARMLFYIRVIPTCIELVPTQVYRKV 600
Query: 601 VAPTMFLYMGHPNAKVARASHSVFIAFISGKDDSEEEKRVILKEELVFYYIERSLS 655
VAPTMFLYMGHPN+KVARASHSVFIAFISGKDD E+ RV+LKEELVFYYIERSLS
Sbjct: 601 VAPTMFLYMGHPNSKVARASHSVFIAFISGKDDGEDGNRVMLKEELVFYYIERSLS 656
BLAST of Sed0002136.6 vs. ExPASy TrEMBL
Match:
A0A6J1FWB1 (uncharacterized protein LOC111448717 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111448717 PE=4 SV=1)
HSP 1 Score: 1067.4 bits (2759), Expect = 2.6e-308
Identity = 556/656 (84.76%), Postives = 600/656 (91.46%), Query Frame = 0
Query: 1 MAKQANSVFLEEWLNSTSGTSSALNFKTTSSSAREIIQAWAELRSSLEHQSFDDRHIQSL 60
MAKQANSVFLEEWL S SG SS N K +SSSAREIIQAWAELRSSLEH+ FDDRHIQSL
Sbjct: 1 MAKQANSVFLEEWLKSISGISSGFNSKISSSSAREIIQAWAELRSSLEHRLFDDRHIQSL 60
Query: 61 KTLVNSQSSLFVADPQAKLVISILSSPNFPLPHESYPLFLRVLYIWVRKSLRPSLLLVDS 120
KTLVNSQSSL+VADPQAKLV+SILSSPN LP ESYPLFLR+LYIWVRKSLRPSL+LVDS
Sbjct: 61 KTLVNSQSSLYVADPQAKLVVSILSSPNLSLPDESYPLFLRILYIWVRKSLRPSLVLVDS 120
Query: 121 FVEVLSQIFSSKIELGKNPLFFSEVVLVLGAISYIPSASENSKLCCLELLCRVL-EEGYL 180
VEVLSQIFSSKI L KNPLF SE VL+LGAISY+ SASE SKLCCLELLCR+L EE +L
Sbjct: 121 SVEVLSQIFSSKIGLRKNPLFISEGVLILGAISYVVSASEKSKLCCLELLCRILEEEEWL 180
Query: 181 LVGSVGGMIPGFLAGIGYALSSSVNAHVVRLLDSLLGIWGQVGGPTGNLSSGLMILHLIE 240
L+GSVGG +P F AGIGYALSSS+NAHVVRLLDSLLGIWG++G PTGNLS+GLMILHLIE
Sbjct: 181 LIGSVGGTVPEFFAGIGYALSSSLNAHVVRLLDSLLGIWGKIGSPTGNLSTGLMILHLIE 240
Query: 241 WVTSGLITLHSFEKLDVFSRATLESSKESYASFAVVMAAAGILRAFNTCNVLLSSSERET 300
WVTSGLI+LHSF+KL+ S+ LESSKESYASFAVVMAAAGILRAFN+ LLSSSERET
Sbjct: 241 WVTSGLISLHSFKKLNFLSQTALESSKESYASFAVVMAAAGILRAFNSYKALLSSSERET 300
Query: 301 ISRIRISAQDCLESISRNFISTMEGSSVSGN-DRSRSLFLLCISSAIARCGPVSSRPPVL 360
ISRIRISAQDCLESI++NFISTMEGSS++GN D RSL LLCIS A+ARCGPV+SRPPVL
Sbjct: 301 ISRIRISAQDCLESIAKNFISTMEGSSITGNDDHGRSLLLLCISLAVARCGPVASRPPVL 360
Query: 361 ICIIYALLSEIFPLQRLYAKILEFSSVESGALGLPLVKEHLDSIPFKEAGAITGVLCTQY 420
IC+ YALL+EIFPLQRLYAK+L+FS ESG LGL LVKEHLDSIPFKEAG I GVLC+QY
Sbjct: 361 ICVTYALLTEIFPLQRLYAKLLKFSFGESGVLGLTLVKEHLDSIPFKEAGVIAGVLCSQY 420
Query: 421 ASIDEEDKKIVENLVWDYCHDIYSRHRLVGLVLRGREDELLEGIEKIAESAFFMVVVFAL 480
ASIDE+DKK VENLVWDYC DIYSRHR VGLVLR REDELLE IEKIAESAF MVVVFAL
Sbjct: 421 ASIDEDDKKFVENLVWDYCQDIYSRHRRVGLVLRHREDELLENIEKIAESAFLMVVVFAL 480
Query: 481 AVTKEKLDSKCTPESQFDVSVRILASFSCMEYFRHIRLPEYMDTIRGVVASIQENESACV 540
AVTKEKL+SK TPE+QFDVSVRIL SFSCMEYFR IR+PEYMDTIRGVVAS+QENESACV
Sbjct: 481 AVTKEKLNSKYTPETQFDVSVRILDSFSCMEYFRRIRMPEYMDTIRGVVASVQENESACV 540
Query: 541 SFIESMPTYKDQTNGPDNFIGQKLKYTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKV 600
SFIESMP+Y+DQT+GPD+ IGQKL+YTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKV
Sbjct: 541 SFIESMPSYQDQTHGPDSSIGQKLQYTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKV 600
Query: 601 VAPTMFLYMGHPNAKVARASHSVFIAFISGKDDSEEEKRVILKEELVFYYIERSLS 655
VAPTMFLYMGHPNAKVARASHSVFIAFISGKDD E+ RV+LKEELVFYYIERSLS
Sbjct: 601 VAPTMFLYMGHPNAKVARASHSVFIAFISGKDDDEDGNRVMLKEELVFYYIERSLS 656
BLAST of Sed0002136.6 vs. ExPASy TrEMBL
Match:
A0A6J1FV65 (uncharacterized protein LOC111448717 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111448717 PE=4 SV=1)
HSP 1 Score: 1067.4 bits (2759), Expect = 2.6e-308
Identity = 556/656 (84.76%), Postives = 600/656 (91.46%), Query Frame = 0
Query: 1 MAKQANSVFLEEWLNSTSGTSSALNFKTTSSSAREIIQAWAELRSSLEHQSFDDRHIQSL 60
MAKQANSVFLEEWL S SG SS N K +SSSAREIIQAWAELRSSLEH+ FDDRHIQSL
Sbjct: 1 MAKQANSVFLEEWLKSISGISSGFNSKISSSSAREIIQAWAELRSSLEHRLFDDRHIQSL 60
Query: 61 KTLVNSQSSLFVADPQAKLVISILSSPNFPLPHESYPLFLRVLYIWVRKSLRPSLLLVDS 120
KTLVNSQSSL+VADPQAKLV+SILSSPN LP ESYPLFLR+LYIWVRKSLRPSL+LVDS
Sbjct: 61 KTLVNSQSSLYVADPQAKLVVSILSSPNLSLPDESYPLFLRILYIWVRKSLRPSLVLVDS 120
Query: 121 FVEVLSQIFSSKIELGKNPLFFSEVVLVLGAISYIPSASENSKLCCLELLCRVL-EEGYL 180
VEVLSQIFSSKI L KNPLF SE VL+LGAISY+ SASE SKLCCLELLCR+L EE +L
Sbjct: 121 SVEVLSQIFSSKIGLRKNPLFISEGVLILGAISYVVSASEKSKLCCLELLCRILEEEEWL 180
Query: 181 LVGSVGGMIPGFLAGIGYALSSSVNAHVVRLLDSLLGIWGQVGGPTGNLSSGLMILHLIE 240
L+GSVGG +P F AGIGYALSSS+NAHVVRLLDSLLGIWG++G PTGNLS+GLMILHLIE
Sbjct: 181 LIGSVGGTVPEFFAGIGYALSSSLNAHVVRLLDSLLGIWGKIGSPTGNLSTGLMILHLIE 240
Query: 241 WVTSGLITLHSFEKLDVFSRATLESSKESYASFAVVMAAAGILRAFNTCNVLLSSSERET 300
WVTSGLI+LHSF+KL+ S+ LESSKESYASFAVVMAAAGILRAFN+ LLSSSERET
Sbjct: 241 WVTSGLISLHSFKKLNFLSQTALESSKESYASFAVVMAAAGILRAFNSYKALLSSSERET 300
Query: 301 ISRIRISAQDCLESISRNFISTMEGSSVSGN-DRSRSLFLLCISSAIARCGPVSSRPPVL 360
ISRIRISAQDCLESI++NFISTMEGSS++GN D RSL LLCIS A+ARCGPV+SRPPVL
Sbjct: 301 ISRIRISAQDCLESIAKNFISTMEGSSITGNDDHGRSLLLLCISLAVARCGPVASRPPVL 360
Query: 361 ICIIYALLSEIFPLQRLYAKILEFSSVESGALGLPLVKEHLDSIPFKEAGAITGVLCTQY 420
IC+ YALL+EIFPLQRLYAK+L+FS ESG LGL LVKEHLDSIPFKEAG I GVLC+QY
Sbjct: 361 ICVTYALLTEIFPLQRLYAKLLKFSFGESGVLGLTLVKEHLDSIPFKEAGVIAGVLCSQY 420
Query: 421 ASIDEEDKKIVENLVWDYCHDIYSRHRLVGLVLRGREDELLEGIEKIAESAFFMVVVFAL 480
ASIDE+DKK VENLVWDYC DIYSRHR VGLVLR REDELLE IEKIAESAF MVVVFAL
Sbjct: 421 ASIDEDDKKFVENLVWDYCQDIYSRHRRVGLVLRHREDELLENIEKIAESAFLMVVVFAL 480
Query: 481 AVTKEKLDSKCTPESQFDVSVRILASFSCMEYFRHIRLPEYMDTIRGVVASIQENESACV 540
AVTKEKL+SK TPE+QFDVSVRIL SFSCMEYFR IR+PEYMDTIRGVVAS+QENESACV
Sbjct: 481 AVTKEKLNSKYTPETQFDVSVRILDSFSCMEYFRRIRMPEYMDTIRGVVASVQENESACV 540
Query: 541 SFIESMPTYKDQTNGPDNFIGQKLKYTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKV 600
SFIESMP+Y+DQT+GPD+ IGQKL+YTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKV
Sbjct: 541 SFIESMPSYQDQTHGPDSSIGQKLQYTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKV 600
Query: 601 VAPTMFLYMGHPNAKVARASHSVFIAFISGKDDSEEEKRVILKEELVFYYIERSLS 655
VAPTMFLYMGHPNAKVARASHSVFIAFISGKDD E+ RV+LKEELVFYYIERSLS
Sbjct: 601 VAPTMFLYMGHPNAKVARASHSVFIAFISGKDDDEDGNRVMLKEELVFYYIERSLS 656
BLAST of Sed0002136.6 vs. ExPASy TrEMBL
Match:
A0A6J1KX18 (uncharacterized protein LOC111498339 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111498339 PE=4 SV=1)
HSP 1 Score: 1064.3 bits (2751), Expect = 2.2e-307
Identity = 557/656 (84.91%), Postives = 598/656 (91.16%), Query Frame = 0
Query: 1 MAKQANSVFLEEWLNSTSGTSSALNFKTTSSSAREIIQAWAELRSSLEHQSFDDRHIQSL 60
MAKQANSVFLEEWL S SG SS N K +SSSAREIIQAWAELRSSLEHQ FDDRHIQSL
Sbjct: 1 MAKQANSVFLEEWLKSISGISSGFNSKISSSSAREIIQAWAELRSSLEHQLFDDRHIQSL 60
Query: 61 KTLVNSQSSLFVADPQAKLVISILSSPNFPLPHESYPLFLRVLYIWVRKSLRPSLLLVDS 120
KTLVNSQSSL+VADPQAKLVISILSSPN LP ESYPLFLR+LYIWVRKSLRPSL+LVDS
Sbjct: 61 KTLVNSQSSLYVADPQAKLVISILSSPNLSLPDESYPLFLRILYIWVRKSLRPSLVLVDS 120
Query: 121 FVEVLSQIFSSKIELGKNPLFFSEVVLVLGAISYIPSASENSKLCCLELLCRVL-EEGYL 180
VE+LSQIFSSKI L KNPLF SE VL+LGAISY+ SASE KLCCLELLCR+L EE +L
Sbjct: 121 SVEILSQIFSSKIGLRKNPLFISEGVLILGAISYVVSASEKFKLCCLELLCRILEEEEWL 180
Query: 181 LVGSVGGMIPGFLAGIGYALSSSVNAHVVRLLDSLLGIWGQVGGPTGNLSSGLMILHLIE 240
L+GSVGG +P F AGIGYALSSSVNAHVVRLLDSLLGIWG++G PTGNLS+GLMILHLIE
Sbjct: 181 LIGSVGGTVPEFFAGIGYALSSSVNAHVVRLLDSLLGIWGKIGSPTGNLSTGLMILHLIE 240
Query: 241 WVTSGLITLHSFEKLDVFSRATLESSKESYASFAVVMAAAGILRAFNTCNVLLSSSERET 300
WVTSGLI+LHSF+KLD S+A LESSKESYASFAVVMAAAGILRAFN+ LLSSSERET
Sbjct: 241 WVTSGLISLHSFKKLDFLSQAALESSKESYASFAVVMAAAGILRAFNSYKALLSSSERET 300
Query: 301 ISRIRISAQDCLESISRNFISTMEGSSVSGN-DRSRSLFLLCISSAIARCGPVSSRPPVL 360
ISRIRISAQDCLESI++NFISTMEGSS++GN D RSL LLCIS A+ARCGPV+SRPPVL
Sbjct: 301 ISRIRISAQDCLESIAKNFISTMEGSSITGNDDHGRSLLLLCISLAVARCGPVASRPPVL 360
Query: 361 ICIIYALLSEIFPLQRLYAKILEFSSVESGALGLPLVKEHLDSIPFKEAGAITGVLCTQY 420
IC+ YALL+EIFPLQRLYAK+LEFS ESG LGL LVKEHLDSIPFKEAG I GVLC+QY
Sbjct: 361 ICVTYALLTEIFPLQRLYAKLLEFSFGESGVLGLSLVKEHLDSIPFKEAGVIAGVLCSQY 420
Query: 421 ASIDEEDKKIVENLVWDYCHDIYSRHRLVGLVLRGREDELLEGIEKIAESAFFMVVVFAL 480
ASIDE+DKKIVENLVWDYC DIYSRHR VGLVLR REDELLE IEKIAESAF MVVVFAL
Sbjct: 421 ASIDEDDKKIVENLVWDYCQDIYSRHRRVGLVLRHREDELLENIEKIAESAFLMVVVFAL 480
Query: 481 AVTKEKLDSKCTPESQFDVSVRILASFSCMEYFRHIRLPEYMDTIRGVVASIQENESACV 540
AVTKEKL+SK T E+QFDVSVRIL SFSCMEYFR IR+PEYMDTIRGVVAS+QENESACV
Sbjct: 481 AVTKEKLNSKYTLETQFDVSVRILNSFSCMEYFRRIRMPEYMDTIRGVVASVQENESACV 540
Query: 541 SFIESMPTYKDQTNGPDNFIGQKLKYTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKV 600
SFIESMP+Y+DQT+GPD+ IGQKL+Y WTEDEVQTARMLFYIRVIPTCIE VPTQVYRKV
Sbjct: 541 SFIESMPSYQDQTHGPDSSIGQKLQYIWTEDEVQTARMLFYIRVIPTCIELVPTQVYRKV 600
Query: 601 VAPTMFLYMGHPNAKVARASHSVFIAFISGKDDSEEEKRVILKEELVFYYIERSLS 655
VAPTMFLYMGHPN+KVARASHSVFIAFISGKDD E+ RV+LKEELVFYYIERSLS
Sbjct: 601 VAPTMFLYMGHPNSKVARASHSVFIAFISGKDDGEDGNRVMLKEELVFYYIERSLS 656
BLAST of Sed0002136.6 vs. ExPASy TrEMBL
Match:
A0A6J1KSR2 (uncharacterized protein LOC111498339 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111498339 PE=4 SV=1)
HSP 1 Score: 1064.3 bits (2751), Expect = 2.2e-307
Identity = 557/656 (84.91%), Postives = 598/656 (91.16%), Query Frame = 0
Query: 1 MAKQANSVFLEEWLNSTSGTSSALNFKTTSSSAREIIQAWAELRSSLEHQSFDDRHIQSL 60
MAKQANSVFLEEWL S SG SS N K +SSSAREIIQAWAELRSSLEHQ FDDRHIQSL
Sbjct: 1 MAKQANSVFLEEWLKSISGISSGFNSKISSSSAREIIQAWAELRSSLEHQLFDDRHIQSL 60
Query: 61 KTLVNSQSSLFVADPQAKLVISILSSPNFPLPHESYPLFLRVLYIWVRKSLRPSLLLVDS 120
KTLVNSQSSL+VADPQAKLVISILSSPN LP ESYPLFLR+LYIWVRKSLRPSL+LVDS
Sbjct: 61 KTLVNSQSSLYVADPQAKLVISILSSPNLSLPDESYPLFLRILYIWVRKSLRPSLVLVDS 120
Query: 121 FVEVLSQIFSSKIELGKNPLFFSEVVLVLGAISYIPSASENSKLCCLELLCRVL-EEGYL 180
VE+LSQIFSSKI L KNPLF SE VL+LGAISY+ SASE KLCCLELLCR+L EE +L
Sbjct: 121 SVEILSQIFSSKIGLRKNPLFISEGVLILGAISYVVSASEKFKLCCLELLCRILEEEEWL 180
Query: 181 LVGSVGGMIPGFLAGIGYALSSSVNAHVVRLLDSLLGIWGQVGGPTGNLSSGLMILHLIE 240
L+GSVGG +P F AGIGYALSSSVNAHVVRLLDSLLGIWG++G PTGNLS+GLMILHLIE
Sbjct: 181 LIGSVGGTVPEFFAGIGYALSSSVNAHVVRLLDSLLGIWGKIGSPTGNLSTGLMILHLIE 240
Query: 241 WVTSGLITLHSFEKLDVFSRATLESSKESYASFAVVMAAAGILRAFNTCNVLLSSSERET 300
WVTSGLI+LHSF+KLD S+A LESSKESYASFAVVMAAAGILRAFN+ LLSSSERET
Sbjct: 241 WVTSGLISLHSFKKLDFLSQAALESSKESYASFAVVMAAAGILRAFNSYKALLSSSERET 300
Query: 301 ISRIRISAQDCLESISRNFISTMEGSSVSGN-DRSRSLFLLCISSAIARCGPVSSRPPVL 360
ISRIRISAQDCLESI++NFISTMEGSS++GN D RSL LLCIS A+ARCGPV+SRPPVL
Sbjct: 301 ISRIRISAQDCLESIAKNFISTMEGSSITGNDDHGRSLLLLCISLAVARCGPVASRPPVL 360
Query: 361 ICIIYALLSEIFPLQRLYAKILEFSSVESGALGLPLVKEHLDSIPFKEAGAITGVLCTQY 420
IC+ YALL+EIFPLQRLYAK+LEFS ESG LGL LVKEHLDSIPFKEAG I GVLC+QY
Sbjct: 361 ICVTYALLTEIFPLQRLYAKLLEFSFGESGVLGLSLVKEHLDSIPFKEAGVIAGVLCSQY 420
Query: 421 ASIDEEDKKIVENLVWDYCHDIYSRHRLVGLVLRGREDELLEGIEKIAESAFFMVVVFAL 480
ASIDE+DKKIVENLVWDYC DIYSRHR VGLVLR REDELLE IEKIAESAF MVVVFAL
Sbjct: 421 ASIDEDDKKIVENLVWDYCQDIYSRHRRVGLVLRHREDELLENIEKIAESAFLMVVVFAL 480
Query: 481 AVTKEKLDSKCTPESQFDVSVRILASFSCMEYFRHIRLPEYMDTIRGVVASIQENESACV 540
AVTKEKL+SK T E+QFDVSVRIL SFSCMEYFR IR+PEYMDTIRGVVAS+QENESACV
Sbjct: 481 AVTKEKLNSKYTLETQFDVSVRILNSFSCMEYFRRIRMPEYMDTIRGVVASVQENESACV 540
Query: 541 SFIESMPTYKDQTNGPDNFIGQKLKYTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKV 600
SFIESMP+Y+DQT+GPD+ IGQKL+Y WTEDEVQTARMLFYIRVIPTCIE VPTQVYRKV
Sbjct: 541 SFIESMPSYQDQTHGPDSSIGQKLQYIWTEDEVQTARMLFYIRVIPTCIELVPTQVYRKV 600
Query: 601 VAPTMFLYMGHPNAKVARASHSVFIAFISGKDDSEEEKRVILKEELVFYYIERSLS 655
VAPTMFLYMGHPN+KVARASHSVFIAFISGKDD E+ RV+LKEELVFYYIERSLS
Sbjct: 601 VAPTMFLYMGHPNSKVARASHSVFIAFISGKDDGEDGNRVMLKEELVFYYIERSLS 656
BLAST of Sed0002136.6 vs. ExPASy TrEMBL
Match:
A0A6J1DGY7 (uncharacterized protein LOC111020395 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111020395 PE=4 SV=1)
HSP 1 Score: 1048.1 bits (2709), Expect = 1.6e-302
Identity = 546/654 (83.49%), Postives = 596/654 (91.13%), Query Frame = 0
Query: 1 MAKQANSVFLEEWLNSTSGTSSALNFKTTSSSAREIIQAWAELRSSLEHQSFDDRHIQSL 60
MAK+ANSVFLEEWL S SGTSS+LN K+TS SAREIIQAWA LRSSLE+QSFDDRHIQSL
Sbjct: 1 MAKKANSVFLEEWLRSVSGTSSSLNSKSTSPSAREIIQAWAALRSSLENQSFDDRHIQSL 60
Query: 61 KTLVNSQSSLFVADPQAKLVISILSSPNFPLPHESYPLFLRVLYIWVRKSLRPSLLLVDS 120
KTLVNSQSSL+VADPQAK+VISILSSPNF LP ESYPLFLR+LYIWVRKSLRPSL+L+DS
Sbjct: 61 KTLVNSQSSLYVADPQAKIVISILSSPNFSLPDESYPLFLRILYIWVRKSLRPSLILIDS 120
Query: 121 FVEVLSQIFSSKIELGKNPLFFSEVVLVLGAISYIPSASENSKLCCLELLCRVLEEGYLL 180
VEVLSQIFSS+IEL K+P F SE +LVLGA S++ SASENSKL CLELLC + E+ YLL
Sbjct: 121 SVEVLSQIFSSRIELRKSPSFLSEGILVLGAFSFLFSASENSKLFCLELLCSLFEQEYLL 180
Query: 181 VGSVGGMIPGFLAGIGYALSSSVNAHVVRLLDSLLGIWGQVGGPTGNLSSGLMILHLIEW 240
+GSVGG+IP LAGIGYALSSSVNAH+VRLLDSLLGIWG+VGGP+G++SSGLMILHL EW
Sbjct: 181 IGSVGGIIPEVLAGIGYALSSSVNAHIVRLLDSLLGIWGKVGGPSGSVSSGLMILHLNEW 240
Query: 241 VTSGLITLHSFEKLDVFSRATLESSKESYASFAVVMAAAGILRAFNTCNVLLSSSERETI 300
VTSGLI+LHSFEKLDVFSRATLESSKESYASFAVVMAAAGILRAFNT LLSSSERETI
Sbjct: 241 VTSGLISLHSFEKLDVFSRATLESSKESYASFAVVMAAAGILRAFNTYKALLSSSERETI 300
Query: 301 SRIRISAQDCLESISRNFISTMEGSSVSGNDRSRSLFLLCISSAIARCGPVSSRPPVLIC 360
SRIRI AQDCLESI+RNFIS EG ++GND+ RSL LLCIS A+ARCGP+SSR P+LIC
Sbjct: 301 SRIRILAQDCLESIARNFISFTEGFLITGNDQ-RSLLLLCISLALARCGPLSSRSPLLIC 360
Query: 361 IIYALLSEIFPLQRLYAKILEFSSVESGALGLPLVKEHLDSIPFKEAGAITGVLCTQYAS 420
++YALL+EIFPL+RLYAKILE S ES ALGL LVKEHLDSIPFKE+GA+ GVLC+QYAS
Sbjct: 361 VVYALLTEIFPLRRLYAKILEVSFGESTALGLTLVKEHLDSIPFKESGAVAGVLCSQYAS 420
Query: 421 IDEEDKKIVENLVWDYCHDIYSRHRLVGLVLRGREDELLEGIEKIAESAFFMVVVFALAV 480
IDEE+K VENLVWDYC D+YSRHR VGLVL GREDELLE IEKIAESAF MVVVFALAV
Sbjct: 421 IDEENKTFVENLVWDYCQDVYSRHRQVGLVLCGREDELLENIEKIAESAFLMVVVFALAV 480
Query: 481 TKEKLDSKCTPESQFDVSVRILASFSCMEYFRHIRLPEYMDTIRGVVASIQENESACVSF 540
TKEKLD K T E+QFD+SVRIL SFSCMEYFR IRLPEYMD IRGVVASIQENESACVSF
Sbjct: 481 TKEKLDPKYTLETQFDISVRILDSFSCMEYFRRIRLPEYMDAIRGVVASIQENESACVSF 540
Query: 541 IESMPTYKDQTNGPDNFIGQKLKYTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRKVVA 600
IESMPTY+DQTNGPDN G+K+KY WTEDEVQTARMLFY+RVIPTCIERVPTQV+RKVV
Sbjct: 541 IESMPTYQDQTNGPDNSTGRKIKYIWTEDEVQTARMLFYLRVIPTCIERVPTQVFRKVVV 600
Query: 601 PTMFLYMGHPNAKVARASHSVFIAFISGKDDSEEEKRVILKEELVFYYIERSLS 655
PTMFLYMGHPN KVA+ASHSVFIAFISGKDD E+EKRVILKEELVFYY+ERSLS
Sbjct: 601 PTMFLYMGHPNGKVAQASHSVFIAFISGKDDDEDEKRVILKEELVFYYLERSLS 653
BLAST of Sed0002136.6 vs. TAIR 10
Match:
AT1G73970.1 (unknown protein; Has 34 Blast hits to 33 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 32; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )
HSP 1 Score: 615.1 bits (1585), Expect = 6.7e-176
Identity = 334/658 (50.76%), Postives = 461/658 (70.06%), Query Frame = 0
Query: 1 MAKQA-NSVFLEEWLNSTSGTSSA--LNFKTTSSSAREIIQAWAELRSSLEHQSFDDRHI 60
MA++A NS FLEEWL + SG+S + L + ++ SAR IIQAW+E+R SL++Q+FD R++
Sbjct: 1 MARKANNSFFLEEWLRTVSGSSVSGDLVKQNSAPSARSIIQAWSEIRESLQNQNFDSRYL 60
Query: 61 QSLKTLVNSQSSLFVADPQAKLVISILSSPNFPLPHESYPLFLRVLYIWVRKSLRPSLLL 120
Q+L+ LV+S+S++ VADPQAKL+ISIL+ + LP ESY L LR+LY+W+RK+ RPS L
Sbjct: 61 QALRALVSSESTIHVADPQAKLLISILAFQDVSLPSESYTLVLRLLYVWIRKAFRPSQAL 120
Query: 121 VDSFVEVLSQIFSSKIELGKNPLFFSEVVLVLGAISYIPSASENSKLCCLELLCRVLEEG 180
V V+ + + + L P ++ VLV GA + +PS S + K+ CLELLCR+LEE
Sbjct: 121 VGVAVQAIRGVVDDRRNL--QPALVAQSVLVSGAFACVPSLSGDVKVLCLELLCRLLEEE 180
Query: 181 YLLVGSVGGMIPGFLAGIGYALSSSVNAHVVRLLDSLLGIWGQVGGPTGNLSSGLMILHL 240
Y LVGS ++P LAGIGYALSSS++ H VRLLD L GIW + GP G ++ GLMILHL
Sbjct: 181 YSLVGSQEELVPVVLAGIGYALSSSLDVHYVRLLDLLFGIWLKDEGPRGTVTYGLMILHL 240
Query: 241 IEWVTSGLITLHSFEKLDVFSRATLESSKESYASFAVVMAAAGILRAFNTCNVLLSSSER 300
IEWV SG + +S K+ +F+ LE+SKE YA FAV MAAAG++RA + S ++
Sbjct: 241 IEWVVSGYMRSNSINKMSLFANEVLETSKEKYAVFAVFMAAAGVVRA--STAGFSSGAQS 300
Query: 301 ETISRIRISAQDCLESISRNFISTMEGSSVSGNDRSRSLFLLCISSAIARCGPVSSRPPV 360
IS++R SA+ +E +++ +S ++ R L L C + A+ARCG VSS P+
Sbjct: 301 LEISKLRNSAEKRIEFVAQILVSNGNVVTLPTTQREGPL-LKCFAIALARCGSVSSSAPL 360
Query: 361 LICIIYALLSEIFPLQRLYAKILEFSSVESGALGLPLVKEHLDSIPFKEAGAITGVLCTQ 420
L+C+ ALL+++FPL ++Y E L V+EHL + FKE+GAI+G C Q
Sbjct: 361 LLCLTSALLTQVFPLGQIYESFCNAFGKEPIGPRLIWVREHLSDVLFKESGAISGAFCNQ 420
Query: 421 YASIDEEDKKIVENLVWDYCHDIYSRHRLVGLVLRGREDELLEGIEKIAESAFFMVVVFA 480
Y+S EE+K IVEN++WD+C ++Y +HR + ++L G ED LL IEKIAES+F MVVVFA
Sbjct: 421 YSSASEENKYIVENMIWDFCQNLYLQHRQIAMLLCGIEDTLLGDIEKIAESSFLMVVVFA 480
Query: 481 LAVTKEKLDSKCTPESQFDVSVRILASFSCMEYFRHIRLPEYMDTIRGVVASIQENESAC 540
LAVTK+ L + E + SV+IL SFSC+EYFRHIRLPEYM+TIR V++ +QEN++ C
Sbjct: 481 LAVTKQWLKPIVSKERKMVTSVKILVSFSCVEYFRHIRLPEYMETIREVISCVQENDAPC 540
Query: 541 VSFIESMPTYKDQTNGPDNFIGQKLKYTWTEDEVQTARMLFYIRVIPTCIERVPTQVYRK 600
VSF+ES+P Y TN P + Q++KY W+ D+VQT+R+LFY+RVIPTCI R+ +R
Sbjct: 541 VSFVESIPAYDSLTN-PKDLFTQRIKYEWSRDDVQTSRILFYLRVIPTCIGRLSASAFRG 600
Query: 601 VVAPTMFLYMGHPNAKVARASHSVFIAFISGKDDSEEEKRVILKEELVFYYIERSLSV 656
VVA TMFLY+GHPN KVA+ASH++ AF+S +SEE++R KE+LVFYY++RSL V
Sbjct: 601 VVASTMFLYIGHPNRKVAQASHTLLAAFLSSAKESEEDERTQFKEQLVFYYMQRSLEV 652
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038903923.1 | 4.1e-308 | 85.78 | uncharacterized protein LOC120090375 isoform X2 [Benincasa hispida] | [more] |
XP_038903921.1 | 4.1e-308 | 85.78 | uncharacterized protein LOC120090375 isoform X1 [Benincasa hispida] | [more] |
XP_022944217.1 | 5.3e-308 | 84.76 | uncharacterized protein LOC111448717 isoform X2 [Cucurbita moschata] | [more] |
XP_022944201.1 | 5.3e-308 | 84.76 | uncharacterized protein LOC111448717 isoform X1 [Cucurbita moschata] >XP_0229442... | [more] |
XP_023005293.1 | 4.5e-307 | 84.91 | uncharacterized protein LOC111498339 isoform X1 [Cucurbita maxima] >XP_023005295... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1FWB1 | 2.6e-308 | 84.76 | uncharacterized protein LOC111448717 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1FV65 | 2.6e-308 | 84.76 | uncharacterized protein LOC111448717 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1KX18 | 2.2e-307 | 84.91 | uncharacterized protein LOC111498339 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1KSR2 | 2.2e-307 | 84.91 | uncharacterized protein LOC111498339 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1DGY7 | 1.6e-302 | 83.49 | uncharacterized protein LOC111020395 isoform X1 OS=Momordica charantia OX=3673 G... | [more] |
Match Name | E-value | Identity | Description | |
AT1G73970.1 | 6.7e-176 | 50.76 | unknown protein; Has 34 Blast hits to 33 proteins in 15 species: Archae - 0; Bac... | [more] |
Relationships
This mRNA is a part of the following gene feature(s):
Feature Name | Unique Name | Type |
Sed0002136 | Sed0002136 | gene |
The following five_prime_UTR feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
Sed0002136.6-five_prime_utr | Sed0002136.6-five_prime_utr-LG04:5903924..5904014 | five_prime_UTR |
The following exon feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
Sed0002136.6-exon | Sed0002136.6-exon-LG04:5903924..5905677 | exon |
Sed0002136.6-exon | Sed0002136.6-exon-LG04:5905792..5905942 | exon |
Sed0002136.6-exon | Sed0002136.6-exon-LG04:5906043..5906536 | exon |
Sed0002136.6-exon | Sed0002136.6-exon-LG04:5907216..5907495 | exon |
Sed0002136.6-exon | Sed0002136.6-exon-LG04:5916259..5916895 | exon |
The following CDS feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
Sed0002136.6-cds | Sed0002136.6-cds-LG04:5904015..5905677 | CDS |
Sed0002136.6-cds | Sed0002136.6-cds-LG04:5905792..5905942 | CDS |
Sed0002136.6-cds | Sed0002136.6-cds-LG04:5906043..5906316 | CDS |
The following three_prime_UTR feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
Sed0002136.6-three_prime_utr | Sed0002136.6-three_prime_utr-LG04:5906317..5906536 | three_prime_UTR |
Sed0002136.6-three_prime_utr | Sed0002136.6-three_prime_utr-LG04:5907216..5907495 | three_prime_UTR |
Sed0002136.6-three_prime_utr | Sed0002136.6-three_prime_utr-LG04:5916259..5916895 | three_prime_UTR |
The following polypeptide feature(s) derives from this mRNA:
Feature Name | Unique Name | Type |
Sed0002136.6 | Sed0002136.6-protein | polypeptide |