Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CACAGCAAAACGAAACCGGTGAGTTTCGAACTCTCTCATTCTCCCAAAGCGTGCTTGTGACTTAGCTAGGGTTTCACTCCGAGGAGAAAAATGCAGGAGAAGAAATCAGAAACTGCGATTCCGTCTCCGAGGTCATCTCTAAACCCTAATTCTCTTCCTTGAATCGCGATTTCTTGATCGCTTGCGCTGTATTGTATCCTCAATTTTTCTGTAGAAGCTTCGTATACTGAATTGTTTTGAAACTTATATTCTGAAGTTTTTTTAATCCATAGAATCTTGTGCGAGAAAACTCGTTATGGCGTCTGCGCTTTGCTATGGAATTTTGTTTCATGTTCTAGACCTTAATTCAACAACTCTGACCAACTGGTTGGATTATTCATAGAATCTAGTAGTTTTTCTTCACATATTTTAGTTTCATTACCAACTGGTTGGATATGGATTATTGTTCATAACAAGGTTTAAGCTATGAAGCACGGACATAGGCATGGATACGGGATAGGGTTACGACACGATGATACGCCAATTTCTTAAAAAGTAAGTTACGGATACGCTAAGAAGCACGGACACGGACACGAGATACGGACACGACATGACACAGATACGTAGATACGCCATTATTTAAAAATGCTGGATACGGATACGTCGAAGACACGTGATTTATCATTATTTTTTATAATATACTTTAAAAGATGAAATCCAAAATATTTTAGTTAGTCATAAGCCTACTCGTAACGATCTCTAAATATTTTAGGTTTTGTCTCATTTTCTTTCCCTCCATTTAATTTTCTCTCTTTCTTTCCATTTGTCTATCTATCTTCGATAGACATCGTCGTCAACGGCCTCCACCAACACTTCTTGTTGTCTTGTCATCCGTCATTGGAAACAAATAAGTCTTAATTGAAGTGTTCATGAAGTGTCCATAAAGTGTCCGAAATTGAAAATAATAATAATAAAAGAGGACCGGAAATTAGAGTGTTGGACACGTGTCCGACGAGTGTTCGGAAGTATCGGTATCGGACACGAGTACGACACATATACTTTGTCAAAATAGAAGTGTCCGTGCTTCCTAGCGGATACGTTTGAGATACGAAAATTTTACTAAAATAAACCGTATGAAATTGAATTTAGGTTAGTTTTTTTCTTTTTATAATTTTACTACATAAAATATATGAAATTGAATTTCATTTTAGTAGCAATAAAATGGGTTCAATAAAATGAGAATAATGGGCGCGACTCACTAAAATGAGTTTAATTTGGTGAATATAATGAGTTGGGCTCATAAAAACGGCTTTTAAAAAATGCAAAAAAGCATCCTCACGTATCCTGGTTGTTTCCTTGTGTCCAATATTAAAAAAATAAAATAATACATAAGATACTTCAATTGACGTATTTGATACGTATCTTAGGCGTATCTTGCTATATCTGTGTCCGAAACGTATCTAATATAGATTCTTCGCCCAAAATAGTGTGTCCGTGCTTTATAGGGTTTAAGTAGGAGAACAAATCTTGGTTTGCTAGTTTATGGCTCCCGAACTTTGGGATTTCTTTTCCTTCAGTTGATTTTAATGGATCTCTGTAAATAAGTTAAAGTCTCTTTTATAATGGTTATATATGGTAACAAGTGAATTGCCTTTGTGTACAATTCATATCATGATTTTAATTGGCTTTCAGAGTTTGAAATTGTTCATTGGCCTCCTCTATGTTCTGTTGGCAGGAACCTTTAGCAGCACAGGATTATTAGTTTCTTCAAATTTTCCTAGATTTTTCGATCTATTCACACTGACCTTATCCAGTGATGATCCAAAATTAATCTCTCCAAATAGTTTTCAAGGGGTGGCGCAGTGGTTGAAGATTTGGATTTTGAAGGTATGCTCCCTTCAAGGTCCTAGGTTCGAGACTCACCTGTGACATTACTCCTTCGATGTCTCCCAGTGCCTGGCCTAAGGACGGGCGTGGTTATCCTTGTTTAAAAAAAAAATGAATCTCTCCAAATAATCCAGTTCAGTCTACCAAAAGTGTCATGCATAAACTCTTCTATCACATCATTTTAGAGCATCCAAGTTTTACTTCTTGATTCTTATAAACGTGTAGGTTCACCAAGGGGGGAGGCATTTTGGAGGAGTGGAGGAACGTTTTTGGACAGATGTAGAGGCACTAGTAGATGACTAAGGTATGAGTACTTCTAAATCTTTTGAATTCCATGCTTAGATTATAGTATTGCTATAGAATTTGATTATAAAGTCATGTTTTATTTTGACATAAACTTTCACAAAAATTGCTTTAGCCTTTGGTCTTAAAAAAAGAAACAAACATTTTGGCTTTGAAATTTTAAGATATTCATTCTAGTCTCCGAACTCTTAGAAACACAGTATTATTATTATTTTTTCCATCATCTTTGTTCATTTTAGGCTGATTGTTTTGATCTATGGATAAAATGACCTTGAATGTTTGTATTAACATGATTGTGCACATATATGATATATTTGAGCCAAATAAAAGTTTGTTGAGGGGGGTATTAAGGACCCTTATTCTTTTTTGAAAAATCAAAAGATAAAAATGGTGTAGGTTGGTCAAAGATAGGTTTTAAAACCCAATTTTATTGATCAAATGAATTTTTTTCTTAAAAGTAATGTCGGAAAGGTAGCATCTATTTACTTTTATGCATAATGTCGCTTGTATTCAAACTGGTATGTAGTTGAAATGACATAAAATGAGTATTTTTTATAAAGATGGTAGAGGATTGTAACTCTCACTTGGATGTTGTTGAACTACAAAAAGCTTTATGCATGTTTGTGGCCGCTATCCAAGAATGAATATCCTCCTAAAAATTGTTGTGAAATTATACCTCCTAATATAACTTTTTCCATGGTATGGATTTGATCAGTTTATTCTTATTTCGTTTTTTAGTCTCTAAATTTTTCCCAGTTTGTTCTCGTGTCCAAAAACTTCTCTCTCCTTTGATTTTGGTCCTTAGATATTTATTATTATTATTATTATCTATTGAATGTGAATTTGCTGTGATACTGTGGACACAACAAATAGATAAGTGATGGAATGTTATAAATTAAATATAGAGTATGATTTTTTTAGAGAGAGAAAAAATTGAGCATGTTAAAATTGGAGAGTAAAATCTAGATTTTAATATTTAAAAATATCGAGATAAAAAGTGAGTAAAAATTGGGGGACTAAAAAAGAATCCAATATGTGCAAAGAAAATGTCAAAAGTTGAATTACTAAAATTGTATTTAAATTGAAATAATAACTTCAAAAGCTTAGATTAGATTAGAACGAGCCTAATCGATACCTTTTCCACCGTATTGAAATTGAAGGCTATTTGTTTCAAAATAAATAAATAAATAAATAAAAATTGAAGGCTATTTAAATTATTCCTTTCTTTCTTTCTGACTTTACCTTTCCTTTCGGTCCCGATAAGGCTCTCAAGCGTTTACCAAAGTAAGGTGTGCGGTGATTTGGACCAAACCAAAGGCTGAATAATATATGAGTAAATTGTACAAGATGAACTTTTCACTCAAAATAAGTAAAAATGTGGAAAAGTTTTAAATTATTTATAAATATTAATAATACTAAGAAAGTCTGTGATAATATTTGTTGATATAAAAAATTATATTTGCTATTTTATAAAAAAATTATTATATTTGTAATTATTTTTTTAGATTTGATTAGAAAAATTATATAGATTAAATTCTAATCTTTTTTAAACCTTAAAGATTATTTTTTTAAAATTTGAAATGTTTTAATTATAATTTTATTTGTATTAAATTTATAATTTACTTTCTTTCATTGTATTTTGAATTGTTTTCCCTTATCTATTTAAAAACTAAAAAATAAAAGAGAAAGAAAATAAATGTGTTGCAAAACGACTCATGTGGGACCACCAACGCAAGCCAAATTGTAGATACCAGTTCCCTAATATCTGTGAAACTCAACCGAAATCAAATAATTGATATGGAATGTTGGTCCCCGCTACATATAAATATTATATGATAGAAAGTATAACAACCGTGAAATAAGTTTGTAGACTCGGTGGATACATTATTTTATTTTATTTTTGAAGTAGGGGTTAGTTTTATTTTGGTCATTCGATTTTCAACTTATTAATTTATCCTTTCTTTTTTTCTTTAGTAACAAATTTGTTCTTTATGTTGTAAAGTCGTTAGTTTTGGCTCCACTCAATAATTTCTTTAGCTAGCTTGGTAACTAAGAATCACAATTTCACTCTACTACCTATTCTGCCCAGTTGAAGCATTGGAACTATATTGATACTTTTGAACTTAATATATACATATAGTACTTCATTTTTAATTCCCTATAGTATTTTTTCTTCTTCTTAATATCCTTAACAAACAAGTCAAATGTTCTATCATAAAAAAGAAAAAAAAAATTGTACTATACAATTTATCTAAATTTTGTTGTACATGATTACAAAATACGTCAATTTAGCCTAATAATTATATAGGTTATGTAAAAAAAAAGGAGATAGATCAGTAGTATGCAATCAATAGTTTGACCAAACCATTGCAACTATATTCTCTATGAAAACATTCATTCATTTTTTCTTTCGAAGAACATGATTCTATTATACGATGATTCTAAAGTTTAGCCAAATATCATGATGAACGTGGGGGTTTAAGACGGTTTGGCCTATCAAACGATTGATTTATTTTTGGTCGTGGGACCCACATTCCAAATCAATTATTTGATTTGGATTGAGACATAAAAAACAAGGGAAGCTTGCCCCCCAAAGAAATAAAAAATTCAAACCTAGAAAATAATTATATACCGTCATTTAAAAAAAATAGTGAATAAAATTATGATATTGATAATTATTTATAAGGCACATAAATTAGATTTCCACCAAATAATGAATTATGGGTGTATTTGATTTTTGGACATAAGGAGTGGTTAATCATTATTCCTAAAAAATCTTTGTTTGTTTTTTGTAGTAAACAACCACGGTTAACTATTCTTCAACTCCTTTATCAAATCTGACATTAAATTTTACATCAAATTTATCATTCAACCACTATTAATTTCAATATCGAAATAAGTAGTTGATTACTCTATTATTATTAATCTTAATCCTCAAATGAGTAGTTAACCACTTATATCTTCCACTTCATTCTTTTCTATAAAAAACAAACACATCTTATATATATTTTTTGAGTACGTACTGGGAGTGTGGGATAAATATCGATTTTCAAGTTCGAACTTGTGAGATTGATTTTAAATTGATTTATAATTAAATAATAAAATAATATGAATATTCTTAAAATAAGATCCTTACATTTTAATGCAAAATAAAAAAAAAGAAGAAACAGATGATCTTAATTATCGAACCAAAATAAGGGGAAAAAAAAGAAGAAAAAACGAAAGTTTTGTTATGAAATTTGCCGTCGTATCGTCAGTCTCTCGTGGCTTCTCCAGAAAACGGGCCTTCGTCCGGTTTTACGGATAAGAAACGACGTGACAGGTGGGCCCGCGAAAAGGCTGTAAGGATCTGGGAGCTACAATCTGGTGGGGCCCAGCTGATGTGGGCAGTATGCAATACTCACCGGATCTTAAGCCGACTTTATATTCTGAATTTTATATCGCCCTTGAAGTCCGCGAGGTAGGTCCCTTTTACCCCTTCACACCCACAAAAAAATATCTAATCCCTTTCTATATATGTCGGTCATTGCTATGTCATTCCTTTCCCTTCTCTTCATTCTTACCTTTCGCGATTGGAATTGCAATTACCAGTTCCCGTTTTCCGATTGCAAAATTCTCTTTGATTTCTATCCGCTCTCCTTAATCTTCTTGTCCGATTTGTGTTTCGTCGTGTTTGAACCGTTCTGGAATCGAATTCCGGTGCGTGCGAGGAAGACGAAGATCTCTGGACTTTGAATCCACATACGTTAATCTCGAATTCTGTTTTGTTAGGTTTATCGATGGCGAACCCTAGAAAAGAAGAATCCATTTCCAGCAACGTCAGCGATGGAACAGATCGGGCCGATCGTGATAACGTTGAGGAATTTAGGGATTCGTCTCGAGTTGGCGTTGTTTCTTCGAATCGTGTGGAGGTTTCGGGTGGTTCGCATGCTTCCACGACGGAGATTAACCTTACGGAGCGGCTGACTGATATACTTGTGGATGAAGGTGATGGCGATCTGATGCTTCAGCAGAGCGATCGGGAGGATAGGGTTATACGGTGGCTTCAGGCGCTGGATTTGCAAGTCATGGGCGCTTGTCGTGCGGATGAAAGGTTGAAGCCGTTGTTGAAGATGACTGCGTCTAATGGCATCGCGGAAGATCACCTTCTTGCTCATTTGAGTCAGGTTTGTGTTTAACTATGTATTGGATTTTTAATATATATATATATATAGTGGATTTTTAATTGTCGACGATAATTCACTGCTTTATATTCCATGCTTGTTCTGCAATTCTACGAAGGACTCCTTGGTGCCGATGGAGATGTCATGGAATCTTTATTTGGCGTGAGTAAAAAAGATTGAATGATTTTATTTTTTGATTTTATTTACATCGCTTCCTCCGTCGTTTTTCCTCTGTAGTGATGATTATTGTTACGAGGTTTTGAAGTTTCTAGTTCTGCGTTTTTTGTTTTCCGTTTTGCTGATCAAATCTGTTTCTGTTTTTGTAATTCAGCATTTCGAGCCTGTTGAAGTTGGAATTCTAGCGAGATGTTTCTGCATACCTCTGGTGTCTATTCGCGTCGGAAAAGTTAACAAGCAAGGGACTCTCCTTTGCCCTACGACCACAAGGTAATTGTTGGCATAAGATTATATTGTCTTTTCATTATCATGTTATCTTTGAATAATATGTATAAGTTCCACTCTGATCAGAAAGAGGATCACTGAGATAGCGACCATATAGGCACTTTGTTGTTTTTTCCTGTATTTTTTGACAATTTTATCAGGTTAGTTGGGTTGAGAACATCCTAATAGTGTATTAGCTTTAGGGGGTGCTATCAATTCCTTTATAATTCGTTTGATTTAATTGAAGGTCAATTGCATTTCATTTTTCTAGAACACAATTCTGAATCTCCTAATTTGCAATGACCATTTGTACTTCAGTTCTTGGATTATCGATTGAAATTCTAGAAGACTTCTAATCCTAATTTCTTTTATTTTTAGTGTAAGACTTGCCATATTAATGCAAGTGTGGACTCTGAGTTCCTTGGTTGTCCTGTTTGTATAATATATTTGTTTCTTGATTGACTTTGTATGACTTCAATGTAATTTCACTCGACTAGTTTAAAGTTAGGGAAGTTGTTATGAAGCTTTTTGTAGAAAGATCACTCATCTGTTCATGGTGTCTTTTTCTCCTTATATAAAAGAATGCTGAGTGTCTTGAGCGGTGGTAACTGTGCTCTCTCTTCACCTCACCGTCGATATTTGTTCACATCATTTTGACTCTACAGCCCCTGATAAATTAATAGTATAAAAACTGTCACACCGGCACTTATAAGGTAGGTAGCACACATGATGCCTGGCGGTAGCTGCTATGCTTCTGTTTCTGGTGTATCTATATTCCGACATTTAGTTCATTAATCAATTTTAGCTGGGTGGGGTTTAACTAGGATTTTTGTTAGTGCTTCTACTTCTACGGTATAAGCATATAATGTTACTTGAAAGTGTGTAATTAGTGATAATTTATAGATTTTAGGACTCTACAATGTTAGCTTGTTAATGGAAACAGCGTACATTATTCCAAATAAATATATCTTTTGGTTTGTAGGTAAAGTGGCAGTTTTCTAATATGATCTCTGAATTATGAGGAAGTAAAGGTAGTTTTTCATGCTCTCATACTATAATAGCTTGCTCTTTTTTTTGTAGTGTAGGATTGCAATTGATTAAAATGGAGTTGTTTGGAATGGGTGTTATGAATGATGATTTTTTTTTGTAATTTCCTGACTTTTCAGAACAACTCTGTACATTTCCCTGCTCTGTTTAATTATTGTTTTTCCACTTTACGTTGAATCACTATCCATTTACCAGAGTGTTAGAAGCCTAACTGCTTGTTATCAGACACCAATGATTTTGTTGTTTTCTTGAAGAATAATGTTGTAAAAAGATTGTGTGATTGTTTTGATTGCTCAATTTAACTTAAATTTAAGGTTTTGATTTTTGTGGTCCCTTGTTATGTTTTGCCTTGGACGTTGATAGTATAGTTTTGCTATTATTTCTTAGGTCTTTTTTACTTAACTGGAGTCCATTTTTTGAACTAGGTTTCCTTTTGTGGGTTGTCGTTTTTTGTATGCACTTGCATCCTTTCTTTTTTCTCCTTATTGAAAGCACAGTTCTTATCATAAAAGAAAAAAAACATCATTTTAAAATTTAACAATTGATTATCTCGACCTTTTGATTTGATTGTTGTTCGGGATATCTCAGTGTTTTTGTATTGTTTCTAAAGGTATCCTTTTAGGTGGTTAGTTGTATGTTTTCAATATGAAATTTTCTAATAATGATGGAAGATGTAGCTGTGTACTACTGTACTAGCAGTGCACTAAGAGCTAAGAGGTCGTTACAATAGAGTAATAATTTCTTCATGGTGAGAAAATTTCAAATATTGGTGAGGATCTGGTTGTTAATAAACGAGATAGATATTGGATACAAGATTACTATCTGAATCTTGTATCCCACAAAAATTTTGTTACCTTCTCATCTCGATTGCTTTCCATCTTCCTTACTGTTAACTAGATTCTACTGTTAAGATTTTGATAGTGTTGCTTATTACATTCTTTTTATGTTTCTAAAGGTTTCTCTCTCTTGTTTATTTTCTCAGGGGAAATTTAAATCTAATGGTCCTCCCATCATCAGACTTTCGGCTCTCATTCATTGGGGATAATGGCCATGTAGAGAGACTATTCACTCTGAGTAACAGAATGACTAGTGTTACCATTGCAATTGATGAGATCTCGTCCGATGAGTCTGGACGGTCATTTGTTATTAAAGCAACTGATGAAAATGTATATTTTTGGTGCTCAGAGAAGTCGAAGCTCTTGGGTACTGAGCTACTTGGGAAGGTATTTGTACTTTTTGTATTTAGTTTAATGAAATGACTAGTAATTGTTCGTTTGTTTGACTTATAGCTTCAAAAACTCATGTATATAATAGAATTTTTTATCGTTCATTCATGAAATGACCAGACCATGCTGGAAGGGCACATGCAAAACTTTTGTGATTTATTTTTCATGAGTTGTGGTAGTTATGAATTTTGATACGAAACATTTTGGTAGTTTTTTTTATTATTCAATTGGTTTCTAATGTTACTACTAACACTGGATATTTATATATATATTTAGAACAATGATATGTATTGGATGGTTGGTTGGATATTTCCTTATTATATCGCTTTGCCTTTACCACAGATGAAAGATTTACTTCAGAGGAGGCCCTCTATTGCTGAACTAACTGGAATCAGTGAATCTCGACTTGGTTGCTTTGCAACCCGCCTTCGTGCCTATCTTATGGAGTCAACTGCTGTTAACCAACATACAGCAAGTTCTGCAGATTTGCACTCAGTAGACACTACAAGAGAACTATCCCGTTCATCACATTTTGGACAATCATGTGCATCATCAAAATCTATTCGGTCAAGAAATTTAGGTAGTCAAACAGTCAAAGCAAATTCTGCACATCAGGGTAGTCTTAGCCCTAGGTTGAATTCCTTTAAAGAAGGCCTTCCCAAAACGTTGCTTTCTCTGAGAGATGCTGCTAGGGAAAAGTTCAGGAGGCGTGGAGACAACTTGGCTTTAGACAACCATATTGTTGCTTCATCGATTTCGACCGATGCATTCGGTCTTAATTCTGAAACTCATGCTGCTGATTCAAGTAGACCGTTATCTGCATCGAGTTTTTTGGAATCATTGGGAAAATTAGCTGCCCCAATTCCTGCAAGTTCATCTCATGTTCCTTGTGTGGTTTCACCTCTCTTTAGTCCTTACTATTGCTGGTGTCCTCCTGGTTCATCATCGATTTCGCAGCGAAGGGAAGAACCGTCTCAATTCCCCATCCCATCCATTAGTGCATCTTCTCTTCCTCCGTTTCCTTCGCTGTTACCGGTTTCTGCAAACTTGTCGGTCTCCGTATCACCTTTGAATTTAGTTGATTCTCTGTCCGTGGATTTCCCTGCCTTATTTCCAGAGCCACTGGTCCGTCTGCCTTTGAAAACCTCCCAGCAGATCCCGACCTTCACTCCTTTGTTCTGCGATCCAATTGTTCACGTTCCAGTAATTGATGTTTGCTCTTCGGGTCAAGGCTACCTTGTTAGTGCAGGCCCTACCATTTCAACCTCCATTCCGCCATTGCATCATCCTAATCTCATGAATCCAATGATACCTGCTACTGATGTGGAAAAGGATGCTCGAGAGACGTTACGCCTGCTCATCAGCGGTTCAAGCCAGGGTAACCCTCAATTGATGAACGTACTCCCTGTTGTTCTAACAGATGCAGAAGCACATCAAAGTTTATTTTTAACAGGAAGCCGCGGTCTGTACAGTAATACTCGAGACATCGATGCAATTGCAAGCAGAATCGCTTCCTTGGGCATTGTGACACTTTCAGGGCAATCCTCAAGCGAGCATATTGGGAAGAAGCATAACGAGCTGAAGTGCCATCCAGCTGACAGCAGTGATTCCGAATGCTCTTGTTTGGATGGCGAGGACGAGCTTTCCCCGTCTCACTTGGAGGAGAAGAAATCAGGTTGATATATAAATTTTAGCAACAGAAATTCATTTCTTCTTTGTAGATCTTTGTCGCTTTTGTTGCCATCTGCCTGCTGCTACTATGGAGATATTTTGTGCTTTTGTATTTTGGGTTTATATAATTAGTGGTAGTTCTTCACCAAAAGAGGTTGGTGAGTTGGTATTTTGGTGTGTGTATTTGTTATAGTTCATATTTTCTTTCCTTCTGATAATAACAAGGCAAATGAAAAAACCTATTCATCCAATACCCAAGCTATTCAAATTAAATTAGCTTATAGGTATGTTGGATTATTATCCCAAGTATATTGAAGTTCCCGCAGGACCTCAGATGGACGACTTAATTAATTACATGTTCATATCTTCTCATTT
mRNA sequence
CACAGCAAAACGAAACCGGTGAGTTTCGAACTCTCTCATTCTCCCAAAGCGTGCTTGTGACTTAGCTAGGGTTTCACTCCGAGGAGAAAAATGCAGGAGAAGAAATCAGAAACTGCGATTCCGTCTCCGAGGTTCACCAAGGGGGGAGGCATTTTGGAGGAGTGGAGGAACGTTTTTGGACAGATGTAGAGGCACTAGTAGATGACTAAGGTTTATCGATGGCGAACCCTAGAAAAGAAGAATCCATTTCCAGCAACGTCAGCGATGGAACAGATCGGGCCGATCGTGATAACGTTGAGGAATTTAGGGATTCGTCTCGAGTTGGCGTTGTTTCTTCGAATCGTGTGGAGGTTTCGGGTGGTTCGCATGCTTCCACGACGGAGATTAACCTTACGGAGCGGCTGACTGATATACTTGTGGATGAAGGTGATGGCGATCTGATGCTTCAGCAGAGCGATCGGGAGGATAGGGTTATACGGTGGCTTCAGGCGCTGGATTTGCAAGTCATGGGCGCTTGTCGTGCGGATGAAAGGTTGAAGCCGTTGTTGAAGATGACTGCGTCTAATGGCATCGCGGAAGATCACCTTCTTGCTCATTTGAGTCAGGTTTGTGTTTAACTATGTATTGGATTTTTAATATATATATATATATAGTGGATTTTTAATTGTCGACGATAATTCACTGCTTTATATTCCATGCTTGTTCTGCAATTCTACGAAGGACTCCTTGGTGCCGATGGAGATGTCATGGAATCTTTATTTGGCCATTTCGAGCCTGTTGAAGTTGGAATTCTAGCGAGATGTTTCTGCATACCTCTGGTGTCTATTCGCGTCGGAAAAGTTAACAAGCAAGGGACTCTCCTTTGCCCTACGACCACAAGGGGAAATTTAAATCTAATGGTCCTCCCATCATCAGACTTTCGGCTCTCATTCATTGGGGATAATGGCCATGTAGAGAGACTATTCACTCTGAGTAACAGAATGACTAGTGTTACCATTGCAATTGATGAGATCTCGTCCGATGAGTCTGGACGGTCATTTGTTATTAAAGCAACTGATGAAAATGTATATTTTTGGTGCTCAGAGAAGTCGAAGCTCTTGGGTACTGAGCTACTTGGGAAGATGAAAGATTTACTTCAGAGGAGGCCCTCTATTGCTGAACTAACTGGAATCAGTGAATCTCGACTTGGTTGCTTTGCAACCCGCCTTCGTGCCTATCTTATGGAGTCAACTGCTGTTAACCAACATACAGCAAGTTCTGCAGATTTGCACTCAGTAGACACTACAAGAGAACTATCCCGTTCATCACATTTTGGACAATCATGTGCATCATCAAAATCTATTCGGTCAAGAAATTTAGGTAGTCAAACAGTCAAAGCAAATTCTGCACATCAGGGTAGTCTTAGCCCTAGGTTGAATTCCTTTAAAGAAGGCCTTCCCAAAACGTTGCTTTCTCTGAGAGATGCTGCTAGGGAAAAGTTCAGGAGGCGTGGAGACAACTTGGCTTTAGACAACCATATTGTTGCTTCATCGATTTCGACCGATGCATTCGGTCTTAATTCTGAAACTCATGCTGCTGATTCAAGTAGACCGTTATCTGCATCGAGTTTTTTGGAATCATTGGGAAAATTAGCTGCCCCAATTCCTGCAAGTTCATCTCATGTTCCTTGTGTGGTTTCACCTCTCTTTAGTCCTTACTATTGCTGGTGTCCTCCTGGTTCATCATCGATTTCGCAGCGAAGGGAAGAACCGTCTCAATTCCCCATCCCATCCATTAGTGCATCTTCTCTTCCTCCGTTTCCTTCGCTGTTACCGGTTTCTGCAAACTTGTCGGTCTCCGTATCACCTTTGAATTTAGTTGATTCTCTGTCCGTGGATTTCCCTGCCTTATTTCCAGAGCCACTGGTCCGTCTGCCTTTGAAAACCTCCCAGCAGATCCCGACCTTCACTCCTTTGTTCTGCGATCCAATTGTTCACGTTCCAGTAATTGATGTTTGCTCTTCGGGTCAAGGCTACCTTGTTAGTGCAGGCCCTACCATTTCAACCTCCATTCCGCCATTGCATCATCCTAATCTCATGAATCCAATGATACCTGCTACTGATGTGGAAAAGGATGCTCGAGAGACGTTACGCCTGCTCATCAGCGGTTCAAGCCAGGGTAACCCTCAATTGATGAACGTACTCCCTGTTGTTCTAACAGATGCAGAAGCACATCAAAGTTTATTTTTAACAGGAAGCCGCGGTCTGTACAGTAATACTCGAGACATCGATGCAATTGCAAGCAGAATCGCTTCCTTGGGCATTGTGACACTTTCAGGGCAATCCTCAAGCGAGCATATTGGGAAGAAGCATAACGAGCTGAAGTGCCATCCAGCTGACAGCAGTGATTCCGAATGCTCTTGTTTGGATGGCGAGGACGAGCTTTCCCCGTCTCACTTGGAGGAGAAGAAATCAGGTTGATATATAAATTTTAGCAACAGAAATTCATTTCTTCTTTGTAGATCTTTGTCGCTTTTGTTGCCATCTGCCTGCTGCTACTATGGAGATATTTTGTGCTTTTGTATTTTGGGTTTATATAATTAGTGGTAGTTCTTCACCAAAAGAGGTTGGTGAGTTGGTATTTTGGTGTGTGTATTTGTTATAGTTCATATTTTCTTTCCTTCTGATAATAACAAGGCAAATGAAAAAACCTATTCATCCAATACCCAAGCTATTCAAATTAAATTAGCTTATAGGTATGTTGGATTATTATCCCAAGTATATTGAAGTTCCCGCAGGACCTCAGATGGACGACTTAATTAATTACATGTTCATATCTTCTCATTT
Coding sequence (CDS)
ATGCTTGTTCTGCAATTCTACGAAGGACTCCTTGGTGCCGATGGAGATGTCATGGAATCTTTATTTGGCCATTTCGAGCCTGTTGAAGTTGGAATTCTAGCGAGATGTTTCTGCATACCTCTGGTGTCTATTCGCGTCGGAAAAGTTAACAAGCAAGGGACTCTCCTTTGCCCTACGACCACAAGGGGAAATTTAAATCTAATGGTCCTCCCATCATCAGACTTTCGGCTCTCATTCATTGGGGATAATGGCCATGTAGAGAGACTATTCACTCTGAGTAACAGAATGACTAGTGTTACCATTGCAATTGATGAGATCTCGTCCGATGAGTCTGGACGGTCATTTGTTATTAAAGCAACTGATGAAAATGTATATTTTTGGTGCTCAGAGAAGTCGAAGCTCTTGGGTACTGAGCTACTTGGGAAGATGAAAGATTTACTTCAGAGGAGGCCCTCTATTGCTGAACTAACTGGAATCAGTGAATCTCGACTTGGTTGCTTTGCAACCCGCCTTCGTGCCTATCTTATGGAGTCAACTGCTGTTAACCAACATACAGCAAGTTCTGCAGATTTGCACTCAGTAGACACTACAAGAGAACTATCCCGTTCATCACATTTTGGACAATCATGTGCATCATCAAAATCTATTCGGTCAAGAAATTTAGGTAGTCAAACAGTCAAAGCAAATTCTGCACATCAGGGTAGTCTTAGCCCTAGGTTGAATTCCTTTAAAGAAGGCCTTCCCAAAACGTTGCTTTCTCTGAGAGATGCTGCTAGGGAAAAGTTCAGGAGGCGTGGAGACAACTTGGCTTTAGACAACCATATTGTTGCTTCATCGATTTCGACCGATGCATTCGGTCTTAATTCTGAAACTCATGCTGCTGATTCAAGTAGACCGTTATCTGCATCGAGTTTTTTGGAATCATTGGGAAAATTAGCTGCCCCAATTCCTGCAAGTTCATCTCATGTTCCTTGTGTGGTTTCACCTCTCTTTAGTCCTTACTATTGCTGGTGTCCTCCTGGTTCATCATCGATTTCGCAGCGAAGGGAAGAACCGTCTCAATTCCCCATCCCATCCATTAGTGCATCTTCTCTTCCTCCGTTTCCTTCGCTGTTACCGGTTTCTGCAAACTTGTCGGTCTCCGTATCACCTTTGAATTTAGTTGATTCTCTGTCCGTGGATTTCCCTGCCTTATTTCCAGAGCCACTGGTCCGTCTGCCTTTGAAAACCTCCCAGCAGATCCCGACCTTCACTCCTTTGTTCTGCGATCCAATTGTTCACGTTCCAGTAATTGATGTTTGCTCTTCGGGTCAAGGCTACCTTGTTAGTGCAGGCCCTACCATTTCAACCTCCATTCCGCCATTGCATCATCCTAATCTCATGAATCCAATGATACCTGCTACTGATGTGGAAAAGGATGCTCGAGAGACGTTACGCCTGCTCATCAGCGGTTCAAGCCAGGGTAACCCTCAATTGATGAACGTACTCCCTGTTGTTCTAACAGATGCAGAAGCACATCAAAGTTTATTTTTAACAGGAAGCCGCGGTCTGTACAGTAATACTCGAGACATCGATGCAATTGCAAGCAGAATCGCTTCCTTGGGCATTGTGACACTTTCAGGGCAATCCTCAAGCGAGCATATTGGGAAGAAGCATAACGAGCTGAAGTGCCATCCAGCTGACAGCAGTGATTCCGAATGCTCTTGTTTGGATGGCGAGGACGAGCTTTCCCCGTCTCACTTGGAGGAGAAGAAATCAGGTTGA
Protein sequence
MLVLQFYEGLLGADGDVMESLFGHFEPVEVGILARCFCIPLVSIRVGKVNKQGTLLCPTTTRGNLNLMVLPSSDFRLSFIGDNGHVERLFTLSNRMTSVTIAIDEISSDESGRSFVIKATDENVYFWCSEKSKLLGTELLGKMKDLLQRRPSIAELTGISESRLGCFATRLRAYLMESTAVNQHTASSADLHSVDTTRELSRSSHFGQSCASSKSIRSRNLGSQTVKANSAHQGSLSPRLNSFKEGLPKTLLSLRDAAREKFRRRGDNLALDNHIVASSISTDAFGLNSETHAADSSRPLSASSFLESLGKLAAPIPASSSHVPCVVSPLFSPYYCWCPPGSSSISQRREEPSQFPIPSISASSLPPFPSLLPVSANLSVSVSPLNLVDSLSVDFPALFPEPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVCSSGQGYLVSAGPTISTSIPPLHHPNLMNPMIPATDVEKDARETLRLLISGSSQGNPQLMNVLPVVLTDAEAHQSLFLTGSRGLYSNTRDIDAIASRIASLGIVTLSGQSSSEHIGKKHNELKCHPADSSDSECSCLDGEDELSPSHLEEKKSG
Homology
BLAST of Sed0002029.2 vs. NCBI nr
Match:
XP_022993059.1 (uncharacterized protein LOC111489188 isoform X2 [Cucurbita maxima] >XP_022993060.1 uncharacterized protein LOC111489188 isoform X2 [Cucurbita maxima])
HSP 1 Score: 900.2 bits (2325), Expect = 9.4e-258
Identity = 480/575 (83.48%), Postives = 510/575 (88.70%), Query Frame = 0
Query: 18 MESLFGHFEPVEVGILARCFCIPLVSIRVGKVNKQGTLLCPTTTRGNLNLMVLPSSDFRL 77
MES+FGHFEPVEVGILARCFCIPLVSIRVGK++KQGTLLCPTTTRGNLNLMVLPSSDFRL
Sbjct: 1 MESVFGHFEPVEVGILARCFCIPLVSIRVGKIDKQGTLLCPTTTRGNLNLMVLPSSDFRL 60
Query: 78 SFIGDNGHVERLFTLSNRMTSVTIAIDEISSDESGRSFVIKATDENVYFWCSEKSKLLGT 137
SFIGDNGHVERLFTLSNR +S I IDEI+SD SGRSFVIKA D+N YFWCSEKSKLLGT
Sbjct: 61 SFIGDNGHVERLFTLSNRSSSAAITIDEIASDSSGRSFVIKANDQNTYFWCSEKSKLLGT 120
Query: 138 ELLGKMKDLLQRRPSIAELTGISESRLGCFATRLRAYLMESTAVNQHTASSADLH-SVDT 197
ELL KMKDLLQRRPSIA LTGISESRLGCFATRLRAYL+EST N H ASSAD H SVDT
Sbjct: 121 ELLLKMKDLLQRRPSIAGLTGISESRLGCFATRLRAYLVESTVANHHPASSADSHSSVDT 180
Query: 198 TRELSRSSHFGQSCASSKSIRSRNLGSQTVKANSAHQGSLSPRLNSFKEGLPKTLLSLRD 257
TRELS SSHFGQ SSKS+RSRN GS VKANSAHQGSLSPRLNSFKEGLPKTLLSLRD
Sbjct: 181 TRELSHSSHFGQ---SSKSMRSRNYGSPAVKANSAHQGSLSPRLNSFKEGLPKTLLSLRD 240
Query: 258 AAREKFRRRGDNLALDNHIVASSISTDAFGLNSETHAADSSRPLSASSFLESLGKLAAPI 317
+AREKFRRRGDNLALDNHI SSIS D +NSET D S PLS S+FL+SLGKLAAP
Sbjct: 241 SAREKFRRRGDNLALDNHIATSSISND---VNSETQTGDLSCPLSPSNFLKSLGKLAAPT 300
Query: 318 PASSSHVPCVVSPLFSPYYCWCPPGSSSISQRREEPSQFPIPSISASSLPPFPSLLPVSA 377
PA+SSH PCVVSPLF+PYYCWC PGSSSI QRREEPSQ PIPS SASSLPPFPSL P SA
Sbjct: 301 PANSSHAPCVVSPLFTPYYCWC-PGSSSILQRREEPSQLPIPSFSASSLPPFPSLFPASA 360
Query: 378 --NLSVSVSPLNLVDSLSVDFPALFPEPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVC 437
NLSV VSPLNLVDS S+DFPALFP+PLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVC
Sbjct: 361 PSNLSVPVSPLNLVDSPSLDFPALFPDPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVC 420
Query: 438 SSGQGYLVSAGPTISTSIPPLHHPNLMNPMIPATDVEKDARETLRLLISGSSQGNPQLMN 497
SSG GYLVSAGPTI+TSIPPL HP L+NPM+PATDVEKDARETLRLLISGSSQGNPQLMN
Sbjct: 421 SSGPGYLVSAGPTITTSIPPL-HPKLVNPMLPATDVEKDARETLRLLISGSSQGNPQLMN 480
Query: 498 VLPVVLTDAEAHQSLFLTGSRGLYSNTRDIDAIASRIASLGIVTLSGQSSSEHIGKKHN- 557
VLPVVLTD+EA++SLFLTGS GLYSNTRDIDAIA+ IASLGI +LSG+S+SEH+GK+ N
Sbjct: 481 VLPVVLTDSEANRSLFLTGSHGLYSNTRDIDAIANSIASLGIASLSGKSTSEHVGKRFNL 540
Query: 558 -ELKCHPADSSDSECSCLDGEDELSPSHLEEKKSG 588
L HP DSSDSECSC +GED S SH EE+K G
Sbjct: 541 DGLNGHPDDSSDSECSCSEGEDVFSQSHFEERKFG 567
BLAST of Sed0002029.2 vs. NCBI nr
Match:
XP_038886409.1 (uncharacterized protein LOC120076604 isoform X2 [Benincasa hispida] >XP_038886410.1 uncharacterized protein LOC120076604 isoform X2 [Benincasa hispida])
HSP 1 Score: 897.1 bits (2317), Expect = 8.0e-257
Identity = 480/575 (83.48%), Postives = 513/575 (89.22%), Query Frame = 0
Query: 18 MESLFGHFEPVEVGILARCFCIPLVSIRVGKVNKQGTLLCPTTTRGNLNLMVLPSSDFRL 77
MES+FGHFEPVEVGILARCFCIPLVSIRVGK++KQG+LLCPTTTRGNLNLMV+PSSDFRL
Sbjct: 1 MESIFGHFEPVEVGILARCFCIPLVSIRVGKIDKQGSLLCPTTTRGNLNLMVVPSSDFRL 60
Query: 78 SFIGDNGHVERLFTLSNRMTSVTIAIDEISSDESGRSFVIKATDENVYFWCSEKSKLLGT 137
SFIGDNG VERLFTLSNR +S +I IDEI SD SGRSFVIKA D+N+YFWCSEKSKLLGT
Sbjct: 61 SFIGDNGQVERLFTLSNRSSSASITIDEIESDNSGRSFVIKANDQNIYFWCSEKSKLLGT 120
Query: 138 ELLGKMKDLLQRRPSIAELTGISESRLGCFATRLRAYLMESTAVNQHTASSADLH-SVDT 197
EL+ KMKDLLQRRPSI+ELTGISESRLGCFATRLRAYL+EST N H ASSAD H S DT
Sbjct: 121 ELILKMKDLLQRRPSISELTGISESRLGCFATRLRAYLVESTVANHHPASSADSHSSADT 180
Query: 198 TRELSRSSHFGQSCASSKSIRSRNLGSQTVKANSAHQGSLSPRLNSFKEGLPKTLLSLRD 257
TRE S SSH GQS SSKS+RSRN GS KANSAHQGSLSPRLNSFKEGLPKTLLSLRD
Sbjct: 181 TRESSHSSHCGQSSVSSKSMRSRNSGSPATKANSAHQGSLSPRLNSFKEGLPKTLLSLRD 240
Query: 258 AAREKFRRRGDNLALDNHIVASSISTDAFGLNSETHAADSSRPLSASSFLESLGKLAAPI 317
AAREKFRRRG+NL LDNHIVASSISTDAF LNSET ADSS PLS S+FLESLGKLAAPI
Sbjct: 241 AAREKFRRRGENLGLDNHIVASSISTDAFCLNSETQTADSSCPLSPSNFLESLGKLAAPI 300
Query: 318 PASSSHVPCVVSPLFSPYYCWCPPGSSSISQRREEPSQFPIPSISASSLPPFPSLLPVS- 377
PASSS +PCVVSPLF+PYYCWC PG+SSI QRREE +Q PIPSISASSLPPFPS+LP S
Sbjct: 301 PASSS-LPCVVSPLFTPYYCWC-PGASSILQRREESNQLPIPSISASSLPPFPSMLPAST 360
Query: 378 -ANLSVSVSPLNLVDSLSVDFPALFPEPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVC 437
+NLSV +SPLNLVDS SVDFPALFPEPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVC
Sbjct: 361 PSNLSVPISPLNLVDSPSVDFPALFPEPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVC 420
Query: 438 SSGQGYLVSAGPTISTSIPPLHHPNLMNPMIPATDVEKDARETLRLLISGSSQGNPQLMN 497
SSG GYLVSAGPTISTSIPPL HP L+NPMIP TDVEKDARETLRLLISGSS GN QLMN
Sbjct: 421 SSGPGYLVSAGPTISTSIPPL-HPKLVNPMIPTTDVEKDARETLRLLISGSSPGNSQLMN 480
Query: 498 VLPVVLTDAEAHQSLFLTGSRGLYSNTRDIDAIASRIASLGIVTLSGQSSSEHIGKKHN- 557
VLPVVLTD+EA+QSLFLTGSRGLYSN RDID IA+ IASLGIV+LSGQS+SEH+GK+ N
Sbjct: 481 VLPVVLTDSEANQSLFLTGSRGLYSNARDIDVIANSIASLGIVSLSGQSTSEHVGKRFNI 540
Query: 558 -ELKCHPADSSDSECSCLDGEDELSPSHLEEKKSG 588
L H DS DSE S LDG+D LSPSH +E+KSG
Sbjct: 541 DGLNGHSDDSCDSESSYLDGDDMLSPSHSKERKSG 572
BLAST of Sed0002029.2 vs. NCBI nr
Match:
XP_022939304.1 (uncharacterized protein LOC111445260 isoform X2 [Cucurbita moschata])
HSP 1 Score: 896.0 bits (2314), Expect = 1.8e-256
Identity = 480/575 (83.48%), Postives = 507/575 (88.17%), Query Frame = 0
Query: 18 MESLFGHFEPVEVGILARCFCIPLVSIRVGKVNKQGTLLCPTTTRGNLNLMVLPSSDFRL 77
MES+FGHFEPVEVGILARCFCIPLVSIRVGK++KQGTLLCPTT RGNLNLMVLPSSDFRL
Sbjct: 1 MESVFGHFEPVEVGILARCFCIPLVSIRVGKIDKQGTLLCPTTARGNLNLMVLPSSDFRL 60
Query: 78 SFIGDNGHVERLFTLSNRMTSVTIAIDEISSDESGRSFVIKATDENVYFWCSEKSKLLGT 137
SFIGDNGHVERLFTLSNR +S I IDEI+SD SGRSFVIKA D+N YFWCSEKSKLLGT
Sbjct: 61 SFIGDNGHVERLFTLSNRSSSAAITIDEIASDSSGRSFVIKANDQNTYFWCSEKSKLLGT 120
Query: 138 ELLGKMKDLLQRRPSIAELTGISESRLGCFATRLRAYLMESTAVNQHTASSADLH-SVDT 197
ELL KMKDLLQRRPSIA LTGISESRLGCFATRLRAYL+EST N H ASSAD H SVDT
Sbjct: 121 ELLLKMKDLLQRRPSIAGLTGISESRLGCFATRLRAYLVESTVANHHPASSADSHSSVDT 180
Query: 198 TRELSRSSHFGQSCASSKSIRSRNLGSQTVKANSAHQGSLSPRLNSFKEGLPKTLLSLRD 257
TRELS SSHFGQ SSKSIRSRN GS VKANSAHQGSLSPRLNSFKEGLPKTLLSLRD
Sbjct: 181 TRELSHSSHFGQ---SSKSIRSRNYGSPAVKANSAHQGSLSPRLNSFKEGLPKTLLSLRD 240
Query: 258 AAREKFRRRGDNLALDNHIVASSISTDAFGLNSETHAADSSRPLSASSFLESLGKLAAPI 317
AAREKFRRRGDNLALDNHI SSIS D +NSET D S PLS S+FL+SLGKLAAP
Sbjct: 241 AAREKFRRRGDNLALDNHIATSSISND---VNSETQTGDLSCPLSPSNFLKSLGKLAAPT 300
Query: 318 PASSSHVPCVVSPLFSPYYCWCPPGSSSISQRREEPSQFPIPSISASSLPPFPSLLPVSA 377
PA+SSH PCVVSPLF+PYYCWC PGSSSI QRREEPSQ PIPS SASSLPPFPSL P SA
Sbjct: 301 PANSSHAPCVVSPLFTPYYCWC-PGSSSILQRREEPSQLPIPSFSASSLPPFPSLFPASA 360
Query: 378 --NLSVSVSPLNLVDSLSVDFPALFPEPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVC 437
NLSV VSPLNLVDS S+DFPALFP+PLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVC
Sbjct: 361 PSNLSVPVSPLNLVDSPSLDFPALFPDPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVC 420
Query: 438 SSGQGYLVSAGPTISTSIPPLHHPNLMNPMIPATDVEKDARETLRLLISGSSQGNPQLMN 497
SSG GYLVSAGPTI+TSIPPL HP L+NPM+PATDVEKDARETLRLLISGSSQGNPQLMN
Sbjct: 421 SSGPGYLVSAGPTITTSIPPL-HPKLVNPMLPATDVEKDARETLRLLISGSSQGNPQLMN 480
Query: 498 VLPVVLTDAEAHQSLFLTGSRGLYSNTRDIDAIASRIASLGIVTLSGQSSSEHIGKKHN- 557
VLPVVLTD+EA++SLFLTGS GLYSNTRDIDAIA+ IASLGI +LSG+S+SEH+GK+ N
Sbjct: 481 VLPVVLTDSEANRSLFLTGSHGLYSNTRDIDAIANSIASLGIASLSGKSTSEHVGKRFNL 540
Query: 558 -ELKCHPADSSDSECSCLDGEDELSPSHLEEKKSG 588
L HP DSSDSE SC +GED S SH EE K G
Sbjct: 541 DGLNGHPDDSSDSESSCSEGEDVFSQSHFEESKFG 567
BLAST of Sed0002029.2 vs. NCBI nr
Match:
XP_022993058.1 (uncharacterized protein LOC111489188 isoform X1 [Cucurbita maxima])
HSP 1 Score: 892.1 bits (2304), Expect = 2.6e-255
Identity = 477/580 (82.24%), Postives = 509/580 (87.76%), Query Frame = 0
Query: 13 ADGDVMESLFGHFEPVEVGILARCFCIPLVSIRVGKVNKQGTLLCPTTTRGNLNLMVLPS 72
A+ ++ L HFEPVEVGILARCFCIPLVSIRVGK++KQGTLLCPTTTRGNLNLMVLPS
Sbjct: 116 AEDRLLAQLSQHFEPVEVGILARCFCIPLVSIRVGKIDKQGTLLCPTTTRGNLNLMVLPS 175
Query: 73 SDFRLSFIGDNGHVERLFTLSNRMTSVTIAIDEISSDESGRSFVIKATDENVYFWCSEKS 132
SDFRLSFIGDNGHVERLFTLSNR +S I IDEI+SD SGRSFVIKA D+N YFWCSEKS
Sbjct: 176 SDFRLSFIGDNGHVERLFTLSNRSSSAAITIDEIASDSSGRSFVIKANDQNTYFWCSEKS 235
Query: 133 KLLGTELLGKMKDLLQRRPSIAELTGISESRLGCFATRLRAYLMESTAVNQHTASSADLH 192
KLLGTELL KMKDLLQRRPSIA LTGISESRLGCFATRLRAYL+EST N H ASSAD H
Sbjct: 236 KLLGTELLLKMKDLLQRRPSIAGLTGISESRLGCFATRLRAYLVESTVANHHPASSADSH 295
Query: 193 -SVDTTRELSRSSHFGQSCASSKSIRSRNLGSQTVKANSAHQGSLSPRLNSFKEGLPKTL 252
SVDTTRELS SSHFGQ SSKS+RSRN GS VKANSAHQGSLSPRLNSFKEGLPKTL
Sbjct: 296 SSVDTTRELSHSSHFGQ---SSKSMRSRNYGSPAVKANSAHQGSLSPRLNSFKEGLPKTL 355
Query: 253 LSLRDAAREKFRRRGDNLALDNHIVASSISTDAFGLNSETHAADSSRPLSASSFLESLGK 312
LSLRD+AREKFRRRGDNLALDNHI SSIS D +NSET D S PLS S+FL+SLGK
Sbjct: 356 LSLRDSAREKFRRRGDNLALDNHIATSSISND---VNSETQTGDLSCPLSPSNFLKSLGK 415
Query: 313 LAAPIPASSSHVPCVVSPLFSPYYCWCPPGSSSISQRREEPSQFPIPSISASSLPPFPSL 372
LAAP PA+SSH PCVVSPLF+PYYCWC PGSSSI QRREEPSQ PIPS SASSLPPFPSL
Sbjct: 416 LAAPTPANSSHAPCVVSPLFTPYYCWC-PGSSSILQRREEPSQLPIPSFSASSLPPFPSL 475
Query: 373 LPVSA--NLSVSVSPLNLVDSLSVDFPALFPEPLVRLPLKTSQQIPTFTPLFCDPIVHVP 432
P SA NLSV VSPLNLVDS S+DFPALFP+PLVRLPLKTSQQIPTFTPLFCDPIVHVP
Sbjct: 476 FPASAPSNLSVPVSPLNLVDSPSLDFPALFPDPLVRLPLKTSQQIPTFTPLFCDPIVHVP 535
Query: 433 VIDVCSSGQGYLVSAGPTISTSIPPLHHPNLMNPMIPATDVEKDARETLRLLISGSSQGN 492
VIDVCSSG GYLVSAGPTI+TSIPPL HP L+NPM+PATDVEKDARETLRLLISGSSQGN
Sbjct: 536 VIDVCSSGPGYLVSAGPTITTSIPPL-HPKLVNPMLPATDVEKDARETLRLLISGSSQGN 595
Query: 493 PQLMNVLPVVLTDAEAHQSLFLTGSRGLYSNTRDIDAIASRIASLGIVTLSGQSSSEHIG 552
PQLMNVLPVVLTD+EA++SLFLTGS GLYSNTRDIDAIA+ IASLGI +LSG+S+SEH+G
Sbjct: 596 PQLMNVLPVVLTDSEANRSLFLTGSHGLYSNTRDIDAIANSIASLGIASLSGKSTSEHVG 655
Query: 553 KKHN--ELKCHPADSSDSECSCLDGEDELSPSHLEEKKSG 588
K+ N L HP DSSDSECSC +GED S SH EE+K G
Sbjct: 656 KRFNLDGLNGHPDDSSDSECSCSEGEDVFSQSHFEERKFG 687
BLAST of Sed0002029.2 vs. NCBI nr
Match:
XP_023550042.1 (uncharacterized protein LOC111808350 isoform X2 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 890.6 bits (2300), Expect = 7.5e-255
Identity = 477/575 (82.96%), Postives = 507/575 (88.17%), Query Frame = 0
Query: 18 MESLFGHFEPVEVGILARCFCIPLVSIRVGKVNKQGTLLCPTTTRGNLNLMVLPSSDFRL 77
MES+FGHFEPVEVGILARCFCIPLVSIRVGK++KQGTLLCPTTTRGNLNLMVLPSSDFRL
Sbjct: 1 MESVFGHFEPVEVGILARCFCIPLVSIRVGKIDKQGTLLCPTTTRGNLNLMVLPSSDFRL 60
Query: 78 SFIGDNGHVERLFTLSNRMTSVTIAIDEISSDESGRSFVIKATDENVYFWCSEKSKLLGT 137
SFIGDNGHVERLFTLSNR +S I IDEI+SD SGRSFV+KA D+N YFWCSEKSKLLGT
Sbjct: 61 SFIGDNGHVERLFTLSNRPSSAAITIDEIASDCSGRSFVVKANDQNTYFWCSEKSKLLGT 120
Query: 138 ELLGKMKDLLQRRPSIAELTGISESRLGCFATRLRAYLMESTAVNQHTASSADLH-SVDT 197
ELL KMKDLL RRPSIA LTGISESRLGCFATRLRAYL+EST N H ASSAD H SV+T
Sbjct: 121 ELLLKMKDLLLRRPSIAGLTGISESRLGCFATRLRAYLVESTVANHHPASSADSHSSVET 180
Query: 198 TRELSRSSHFGQSCASSKSIRSRNLGSQTVKANSAHQGSLSPRLNSFKEGLPKTLLSLRD 257
TRELS SSHFGQ SSKS+RSRN GS VKANSAHQGSLSPRLNSFKEGLPKTLLSLRD
Sbjct: 181 TRELSHSSHFGQ---SSKSMRSRNYGSPAVKANSAHQGSLSPRLNSFKEGLPKTLLSLRD 240
Query: 258 AAREKFRRRGDNLALDNHIVASSISTDAFGLNSETHAADSSRPLSASSFLESLGKLAAPI 317
AAREKFRRRGDNLALDNHI S IS D +NSET AD S PLS S+FL+SLGKLAAP
Sbjct: 241 AAREKFRRRGDNLALDNHIATSPISND---VNSETQTADLSCPLSPSNFLKSLGKLAAPT 300
Query: 318 PASSSHVPCVVSPLFSPYYCWCPPGSSSISQRREEPSQFPIPSISASSLPPFPSLLPVSA 377
PA+SSH PCVVSPLF+PYYCWC PGSSSI QRREEPSQ PIPS ASSLPPFPSL P SA
Sbjct: 301 PANSSHAPCVVSPLFTPYYCWC-PGSSSILQRREEPSQLPIPSFGASSLPPFPSLFPASA 360
Query: 378 --NLSVSVSPLNLVDSLSVDFPALFPEPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVC 437
NLSV VSPLNLVDS S+DFPALFP+PLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVC
Sbjct: 361 PSNLSVPVSPLNLVDSPSLDFPALFPDPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVC 420
Query: 438 SSGQGYLVSAGPTISTSIPPLHHPNLMNPMIPATDVEKDARETLRLLISGSSQGNPQLMN 497
SSG GYLVSAGPTI+TSIPPL HPNL+NPM+PATDVEKDARETLRLLISGSSQGNPQLMN
Sbjct: 421 SSGPGYLVSAGPTITTSIPPL-HPNLVNPMLPATDVEKDARETLRLLISGSSQGNPQLMN 480
Query: 498 VLPVVLTDAEAHQSLFLTGSRGLYSNTRDIDAIASRIASLGIVTLSGQSSSEHIGKKHN- 557
VLPVVLTD+EA++SLFLTGS GLYSNTRDIDAIA+ IASLGI +LSG+S+SEH+GK+ N
Sbjct: 481 VLPVVLTDSEANRSLFLTGSHGLYSNTRDIDAIANSIASLGIASLSGKSTSEHVGKRFNL 540
Query: 558 -ELKCHPADSSDSECSCLDGEDELSPSHLEEKKSG 588
L HP DSSDSE SC +GED S SH EE K G
Sbjct: 541 DGLNGHPDDSSDSESSCSEGEDVFSQSHFEESKFG 567
BLAST of Sed0002029.2 vs. ExPASy TrEMBL
Match:
A0A6J1JXG8 (uncharacterized protein LOC111489188 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111489188 PE=4 SV=1)
HSP 1 Score: 900.2 bits (2325), Expect = 4.6e-258
Identity = 480/575 (83.48%), Postives = 510/575 (88.70%), Query Frame = 0
Query: 18 MESLFGHFEPVEVGILARCFCIPLVSIRVGKVNKQGTLLCPTTTRGNLNLMVLPSSDFRL 77
MES+FGHFEPVEVGILARCFCIPLVSIRVGK++KQGTLLCPTTTRGNLNLMVLPSSDFRL
Sbjct: 1 MESVFGHFEPVEVGILARCFCIPLVSIRVGKIDKQGTLLCPTTTRGNLNLMVLPSSDFRL 60
Query: 78 SFIGDNGHVERLFTLSNRMTSVTIAIDEISSDESGRSFVIKATDENVYFWCSEKSKLLGT 137
SFIGDNGHVERLFTLSNR +S I IDEI+SD SGRSFVIKA D+N YFWCSEKSKLLGT
Sbjct: 61 SFIGDNGHVERLFTLSNRSSSAAITIDEIASDSSGRSFVIKANDQNTYFWCSEKSKLLGT 120
Query: 138 ELLGKMKDLLQRRPSIAELTGISESRLGCFATRLRAYLMESTAVNQHTASSADLH-SVDT 197
ELL KMKDLLQRRPSIA LTGISESRLGCFATRLRAYL+EST N H ASSAD H SVDT
Sbjct: 121 ELLLKMKDLLQRRPSIAGLTGISESRLGCFATRLRAYLVESTVANHHPASSADSHSSVDT 180
Query: 198 TRELSRSSHFGQSCASSKSIRSRNLGSQTVKANSAHQGSLSPRLNSFKEGLPKTLLSLRD 257
TRELS SSHFGQ SSKS+RSRN GS VKANSAHQGSLSPRLNSFKEGLPKTLLSLRD
Sbjct: 181 TRELSHSSHFGQ---SSKSMRSRNYGSPAVKANSAHQGSLSPRLNSFKEGLPKTLLSLRD 240
Query: 258 AAREKFRRRGDNLALDNHIVASSISTDAFGLNSETHAADSSRPLSASSFLESLGKLAAPI 317
+AREKFRRRGDNLALDNHI SSIS D +NSET D S PLS S+FL+SLGKLAAP
Sbjct: 241 SAREKFRRRGDNLALDNHIATSSISND---VNSETQTGDLSCPLSPSNFLKSLGKLAAPT 300
Query: 318 PASSSHVPCVVSPLFSPYYCWCPPGSSSISQRREEPSQFPIPSISASSLPPFPSLLPVSA 377
PA+SSH PCVVSPLF+PYYCWC PGSSSI QRREEPSQ PIPS SASSLPPFPSL P SA
Sbjct: 301 PANSSHAPCVVSPLFTPYYCWC-PGSSSILQRREEPSQLPIPSFSASSLPPFPSLFPASA 360
Query: 378 --NLSVSVSPLNLVDSLSVDFPALFPEPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVC 437
NLSV VSPLNLVDS S+DFPALFP+PLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVC
Sbjct: 361 PSNLSVPVSPLNLVDSPSLDFPALFPDPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVC 420
Query: 438 SSGQGYLVSAGPTISTSIPPLHHPNLMNPMIPATDVEKDARETLRLLISGSSQGNPQLMN 497
SSG GYLVSAGPTI+TSIPPL HP L+NPM+PATDVEKDARETLRLLISGSSQGNPQLMN
Sbjct: 421 SSGPGYLVSAGPTITTSIPPL-HPKLVNPMLPATDVEKDARETLRLLISGSSQGNPQLMN 480
Query: 498 VLPVVLTDAEAHQSLFLTGSRGLYSNTRDIDAIASRIASLGIVTLSGQSSSEHIGKKHN- 557
VLPVVLTD+EA++SLFLTGS GLYSNTRDIDAIA+ IASLGI +LSG+S+SEH+GK+ N
Sbjct: 481 VLPVVLTDSEANRSLFLTGSHGLYSNTRDIDAIANSIASLGIASLSGKSTSEHVGKRFNL 540
Query: 558 -ELKCHPADSSDSECSCLDGEDELSPSHLEEKKSG 588
L HP DSSDSECSC +GED S SH EE+K G
Sbjct: 541 DGLNGHPDDSSDSECSCSEGEDVFSQSHFEERKFG 567
BLAST of Sed0002029.2 vs. ExPASy TrEMBL
Match:
A0A6J1FGS2 (uncharacterized protein LOC111445260 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111445260 PE=4 SV=1)
HSP 1 Score: 896.0 bits (2314), Expect = 8.6e-257
Identity = 480/575 (83.48%), Postives = 507/575 (88.17%), Query Frame = 0
Query: 18 MESLFGHFEPVEVGILARCFCIPLVSIRVGKVNKQGTLLCPTTTRGNLNLMVLPSSDFRL 77
MES+FGHFEPVEVGILARCFCIPLVSIRVGK++KQGTLLCPTT RGNLNLMVLPSSDFRL
Sbjct: 1 MESVFGHFEPVEVGILARCFCIPLVSIRVGKIDKQGTLLCPTTARGNLNLMVLPSSDFRL 60
Query: 78 SFIGDNGHVERLFTLSNRMTSVTIAIDEISSDESGRSFVIKATDENVYFWCSEKSKLLGT 137
SFIGDNGHVERLFTLSNR +S I IDEI+SD SGRSFVIKA D+N YFWCSEKSKLLGT
Sbjct: 61 SFIGDNGHVERLFTLSNRSSSAAITIDEIASDSSGRSFVIKANDQNTYFWCSEKSKLLGT 120
Query: 138 ELLGKMKDLLQRRPSIAELTGISESRLGCFATRLRAYLMESTAVNQHTASSADLH-SVDT 197
ELL KMKDLLQRRPSIA LTGISESRLGCFATRLRAYL+EST N H ASSAD H SVDT
Sbjct: 121 ELLLKMKDLLQRRPSIAGLTGISESRLGCFATRLRAYLVESTVANHHPASSADSHSSVDT 180
Query: 198 TRELSRSSHFGQSCASSKSIRSRNLGSQTVKANSAHQGSLSPRLNSFKEGLPKTLLSLRD 257
TRELS SSHFGQ SSKSIRSRN GS VKANSAHQGSLSPRLNSFKEGLPKTLLSLRD
Sbjct: 181 TRELSHSSHFGQ---SSKSIRSRNYGSPAVKANSAHQGSLSPRLNSFKEGLPKTLLSLRD 240
Query: 258 AAREKFRRRGDNLALDNHIVASSISTDAFGLNSETHAADSSRPLSASSFLESLGKLAAPI 317
AAREKFRRRGDNLALDNHI SSIS D +NSET D S PLS S+FL+SLGKLAAP
Sbjct: 241 AAREKFRRRGDNLALDNHIATSSISND---VNSETQTGDLSCPLSPSNFLKSLGKLAAPT 300
Query: 318 PASSSHVPCVVSPLFSPYYCWCPPGSSSISQRREEPSQFPIPSISASSLPPFPSLLPVSA 377
PA+SSH PCVVSPLF+PYYCWC PGSSSI QRREEPSQ PIPS SASSLPPFPSL P SA
Sbjct: 301 PANSSHAPCVVSPLFTPYYCWC-PGSSSILQRREEPSQLPIPSFSASSLPPFPSLFPASA 360
Query: 378 --NLSVSVSPLNLVDSLSVDFPALFPEPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVC 437
NLSV VSPLNLVDS S+DFPALFP+PLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVC
Sbjct: 361 PSNLSVPVSPLNLVDSPSLDFPALFPDPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVC 420
Query: 438 SSGQGYLVSAGPTISTSIPPLHHPNLMNPMIPATDVEKDARETLRLLISGSSQGNPQLMN 497
SSG GYLVSAGPTI+TSIPPL HP L+NPM+PATDVEKDARETLRLLISGSSQGNPQLMN
Sbjct: 421 SSGPGYLVSAGPTITTSIPPL-HPKLVNPMLPATDVEKDARETLRLLISGSSQGNPQLMN 480
Query: 498 VLPVVLTDAEAHQSLFLTGSRGLYSNTRDIDAIASRIASLGIVTLSGQSSSEHIGKKHN- 557
VLPVVLTD+EA++SLFLTGS GLYSNTRDIDAIA+ IASLGI +LSG+S+SEH+GK+ N
Sbjct: 481 VLPVVLTDSEANRSLFLTGSHGLYSNTRDIDAIANSIASLGIASLSGKSTSEHVGKRFNL 540
Query: 558 -ELKCHPADSSDSECSCLDGEDELSPSHLEEKKSG 588
L HP DSSDSE SC +GED S SH EE K G
Sbjct: 541 DGLNGHPDDSSDSESSCSEGEDVFSQSHFEESKFG 567
BLAST of Sed0002029.2 vs. ExPASy TrEMBL
Match:
A0A6J1K125 (uncharacterized protein LOC111489188 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111489188 PE=4 SV=1)
HSP 1 Score: 892.1 bits (2304), Expect = 1.2e-255
Identity = 477/580 (82.24%), Postives = 509/580 (87.76%), Query Frame = 0
Query: 13 ADGDVMESLFGHFEPVEVGILARCFCIPLVSIRVGKVNKQGTLLCPTTTRGNLNLMVLPS 72
A+ ++ L HFEPVEVGILARCFCIPLVSIRVGK++KQGTLLCPTTTRGNLNLMVLPS
Sbjct: 116 AEDRLLAQLSQHFEPVEVGILARCFCIPLVSIRVGKIDKQGTLLCPTTTRGNLNLMVLPS 175
Query: 73 SDFRLSFIGDNGHVERLFTLSNRMTSVTIAIDEISSDESGRSFVIKATDENVYFWCSEKS 132
SDFRLSFIGDNGHVERLFTLSNR +S I IDEI+SD SGRSFVIKA D+N YFWCSEKS
Sbjct: 176 SDFRLSFIGDNGHVERLFTLSNRSSSAAITIDEIASDSSGRSFVIKANDQNTYFWCSEKS 235
Query: 133 KLLGTELLGKMKDLLQRRPSIAELTGISESRLGCFATRLRAYLMESTAVNQHTASSADLH 192
KLLGTELL KMKDLLQRRPSIA LTGISESRLGCFATRLRAYL+EST N H ASSAD H
Sbjct: 236 KLLGTELLLKMKDLLQRRPSIAGLTGISESRLGCFATRLRAYLVESTVANHHPASSADSH 295
Query: 193 -SVDTTRELSRSSHFGQSCASSKSIRSRNLGSQTVKANSAHQGSLSPRLNSFKEGLPKTL 252
SVDTTRELS SSHFGQ SSKS+RSRN GS VKANSAHQGSLSPRLNSFKEGLPKTL
Sbjct: 296 SSVDTTRELSHSSHFGQ---SSKSMRSRNYGSPAVKANSAHQGSLSPRLNSFKEGLPKTL 355
Query: 253 LSLRDAAREKFRRRGDNLALDNHIVASSISTDAFGLNSETHAADSSRPLSASSFLESLGK 312
LSLRD+AREKFRRRGDNLALDNHI SSIS D +NSET D S PLS S+FL+SLGK
Sbjct: 356 LSLRDSAREKFRRRGDNLALDNHIATSSISND---VNSETQTGDLSCPLSPSNFLKSLGK 415
Query: 313 LAAPIPASSSHVPCVVSPLFSPYYCWCPPGSSSISQRREEPSQFPIPSISASSLPPFPSL 372
LAAP PA+SSH PCVVSPLF+PYYCWC PGSSSI QRREEPSQ PIPS SASSLPPFPSL
Sbjct: 416 LAAPTPANSSHAPCVVSPLFTPYYCWC-PGSSSILQRREEPSQLPIPSFSASSLPPFPSL 475
Query: 373 LPVSA--NLSVSVSPLNLVDSLSVDFPALFPEPLVRLPLKTSQQIPTFTPLFCDPIVHVP 432
P SA NLSV VSPLNLVDS S+DFPALFP+PLVRLPLKTSQQIPTFTPLFCDPIVHVP
Sbjct: 476 FPASAPSNLSVPVSPLNLVDSPSLDFPALFPDPLVRLPLKTSQQIPTFTPLFCDPIVHVP 535
Query: 433 VIDVCSSGQGYLVSAGPTISTSIPPLHHPNLMNPMIPATDVEKDARETLRLLISGSSQGN 492
VIDVCSSG GYLVSAGPTI+TSIPPL HP L+NPM+PATDVEKDARETLRLLISGSSQGN
Sbjct: 536 VIDVCSSGPGYLVSAGPTITTSIPPL-HPKLVNPMLPATDVEKDARETLRLLISGSSQGN 595
Query: 493 PQLMNVLPVVLTDAEAHQSLFLTGSRGLYSNTRDIDAIASRIASLGIVTLSGQSSSEHIG 552
PQLMNVLPVVLTD+EA++SLFLTGS GLYSNTRDIDAIA+ IASLGI +LSG+S+SEH+G
Sbjct: 596 PQLMNVLPVVLTDSEANRSLFLTGSHGLYSNTRDIDAIANSIASLGIASLSGKSTSEHVG 655
Query: 553 KKHN--ELKCHPADSSDSECSCLDGEDELSPSHLEEKKSG 588
K+ N L HP DSSDSECSC +GED S SH EE+K G
Sbjct: 656 KRFNLDGLNGHPDDSSDSECSCSEGEDVFSQSHFEERKFG 687
BLAST of Sed0002029.2 vs. ExPASy TrEMBL
Match:
A0A6J1FLA2 (uncharacterized protein LOC111445260 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111445260 PE=4 SV=1)
HSP 1 Score: 887.9 bits (2293), Expect = 2.3e-254
Identity = 477/580 (82.24%), Postives = 506/580 (87.24%), Query Frame = 0
Query: 13 ADGDVMESLFGHFEPVEVGILARCFCIPLVSIRVGKVNKQGTLLCPTTTRGNLNLMVLPS 72
A+ ++ L HFEPVEVGILARCFCIPLVSIRVGK++KQGTLLCPTT RGNLNLMVLPS
Sbjct: 116 AEDRLLAQLSQHFEPVEVGILARCFCIPLVSIRVGKIDKQGTLLCPTTARGNLNLMVLPS 175
Query: 73 SDFRLSFIGDNGHVERLFTLSNRMTSVTIAIDEISSDESGRSFVIKATDENVYFWCSEKS 132
SDFRLSFIGDNGHVERLFTLSNR +S I IDEI+SD SGRSFVIKA D+N YFWCSEKS
Sbjct: 176 SDFRLSFIGDNGHVERLFTLSNRSSSAAITIDEIASDSSGRSFVIKANDQNTYFWCSEKS 235
Query: 133 KLLGTELLGKMKDLLQRRPSIAELTGISESRLGCFATRLRAYLMESTAVNQHTASSADLH 192
KLLGTELL KMKDLLQRRPSIA LTGISESRLGCFATRLRAYL+EST N H ASSAD H
Sbjct: 236 KLLGTELLLKMKDLLQRRPSIAGLTGISESRLGCFATRLRAYLVESTVANHHPASSADSH 295
Query: 193 -SVDTTRELSRSSHFGQSCASSKSIRSRNLGSQTVKANSAHQGSLSPRLNSFKEGLPKTL 252
SVDTTRELS SSHFGQ SSKSIRSRN GS VKANSAHQGSLSPRLNSFKEGLPKTL
Sbjct: 296 SSVDTTRELSHSSHFGQ---SSKSIRSRNYGSPAVKANSAHQGSLSPRLNSFKEGLPKTL 355
Query: 253 LSLRDAAREKFRRRGDNLALDNHIVASSISTDAFGLNSETHAADSSRPLSASSFLESLGK 312
LSLRDAAREKFRRRGDNLALDNHI SSIS D +NSET D S PLS S+FL+SLGK
Sbjct: 356 LSLRDAAREKFRRRGDNLALDNHIATSSISND---VNSETQTGDLSCPLSPSNFLKSLGK 415
Query: 313 LAAPIPASSSHVPCVVSPLFSPYYCWCPPGSSSISQRREEPSQFPIPSISASSLPPFPSL 372
LAAP PA+SSH PCVVSPLF+PYYCWC PGSSSI QRREEPSQ PIPS SASSLPPFPSL
Sbjct: 416 LAAPTPANSSHAPCVVSPLFTPYYCWC-PGSSSILQRREEPSQLPIPSFSASSLPPFPSL 475
Query: 373 LPVSA--NLSVSVSPLNLVDSLSVDFPALFPEPLVRLPLKTSQQIPTFTPLFCDPIVHVP 432
P SA NLSV VSPLNLVDS S+DFPALFP+PLVRLPLKTSQQIPTFTPLFCDPIVHVP
Sbjct: 476 FPASAPSNLSVPVSPLNLVDSPSLDFPALFPDPLVRLPLKTSQQIPTFTPLFCDPIVHVP 535
Query: 433 VIDVCSSGQGYLVSAGPTISTSIPPLHHPNLMNPMIPATDVEKDARETLRLLISGSSQGN 492
VIDVCSSG GYLVSAGPTI+TSIPPL HP L+NPM+PATDVEKDARETLRLLISGSSQGN
Sbjct: 536 VIDVCSSGPGYLVSAGPTITTSIPPL-HPKLVNPMLPATDVEKDARETLRLLISGSSQGN 595
Query: 493 PQLMNVLPVVLTDAEAHQSLFLTGSRGLYSNTRDIDAIASRIASLGIVTLSGQSSSEHIG 552
PQLMNVLPVVLTD+EA++SLFLTGS GLYSNTRDIDAIA+ IASLGI +LSG+S+SEH+G
Sbjct: 596 PQLMNVLPVVLTDSEANRSLFLTGSHGLYSNTRDIDAIANSIASLGIASLSGKSTSEHVG 655
Query: 553 KKHN--ELKCHPADSSDSECSCLDGEDELSPSHLEEKKSG 588
K+ N L HP DSSDSE SC +GED S SH EE K G
Sbjct: 656 KRFNLDGLNGHPDDSSDSESSCSEGEDVFSQSHFEESKFG 687
BLAST of Sed0002029.2 vs. ExPASy TrEMBL
Match:
A0A6J1BY44 (uncharacterized protein LOC111006348 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111006348 PE=4 SV=1)
HSP 1 Score: 885.9 bits (2288), Expect = 8.9e-254
Identity = 470/575 (81.74%), Postives = 511/575 (88.87%), Query Frame = 0
Query: 18 MESLFGHFEPVEVGILARCFCIPLVSIRVGKVNKQGTLLCPTTTRGNLNLMVLPSSDFRL 77
MES+FGHFEPVEVGILARCFCIPLVS+RVGK+ K+G+LLCPTTTRGNLNLM++PSSDFRL
Sbjct: 1 MESVFGHFEPVEVGILARCFCIPLVSVRVGKIEKRGSLLCPTTTRGNLNLMIVPSSDFRL 60
Query: 78 SFIGDNGHVERLFTLSNRMTSVTIAIDEISSDESGRSFVIKATDENVYFWCSEKSKLLGT 137
SFIGDNGHVERLFTLSNR++S I IDEI SD+SGRSFVIKA D++VYFWCSEKSKLLG
Sbjct: 61 SFIGDNGHVERLFTLSNRVSSTAIIIDEIRSDQSGRSFVIKANDQDVYFWCSEKSKLLGM 120
Query: 138 ELLGKMKDLLQRRPSIAELTGISESRLGCFATRLRAYLMESTAVNQHTASSADLHS-VDT 197
ELL KMKDLLQRRPSI+ELTGISESRLGCFATRLRAYL+EST V+ H ASSAD HS V+T
Sbjct: 121 ELLLKMKDLLQRRPSISELTGISESRLGCFATRLRAYLVEST-VSHHPASSADSHSLVNT 180
Query: 198 TRELSRSSHFGQSCASSKSIRSRNLGSQTVKANSAHQGSLSPRLNSFKEGLPKTLLSLRD 257
TRELS++SHFGQS ASSKS+RSRN GS VKANSAHQGSLSPR NSFKEGLPKTL+SLRD
Sbjct: 181 TRELSQASHFGQSSASSKSMRSRNSGSPAVKANSAHQGSLSPRSNSFKEGLPKTLVSLRD 240
Query: 258 AAREKFRRRGDNLALDNHIVASSISTDAFGLNSETHAADSSRPLSASSFLESLGKLAAPI 317
AAREKFRRRGDNLALDNHIV SS+ TDAF ++SE DSS PLS S+ LES GKLAAP
Sbjct: 241 AAREKFRRRGDNLALDNHIVGSSLGTDAFCVHSEAPNTDSSNPLSPSNILESFGKLAAPA 300
Query: 318 PASSSHVPCVVSPLFSPYYCWCPPGSSSISQRREEPSQFPIPSISASSLPPFPSLLPV-- 377
PASSSH PCVVSPLF+PYYCWCPPG+SSI QRREEPSQ P SIS+ SLPPFPSLLPV
Sbjct: 301 PASSSHAPCVVSPLFTPYYCWCPPGASSILQRREEPSQLPTSSISSFSLPPFPSLLPVTT 360
Query: 378 SANLSVSVSPLNLVDSLSVDFPALFPEPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVC 437
SANLSV SPLNLVD+ SVDFPALFPEPLV LPLKTSQQIPTFTPLFCDPIVHVPVIDVC
Sbjct: 361 SANLSVPASPLNLVDAPSVDFPALFPEPLVHLPLKTSQQIPTFTPLFCDPIVHVPVIDVC 420
Query: 438 SSGQGYLVSAGPTISTSIPPLHHPNLMNPMIPATDVEKDARETLRLLISGSSQGNPQLMN 497
SSGQGYLVSAGPTISTSIPPL HP L+NPMIPATDVEKDARETLRLLISGSSQGNPQLMN
Sbjct: 421 SSGQGYLVSAGPTISTSIPPL-HPKLVNPMIPATDVEKDARETLRLLISGSSQGNPQLMN 480
Query: 498 VLPVVLTDAEAHQSLFLTGSRGLYSNTRDIDAIASRIASLGIVTLSGQSSSEHIGKKHN- 557
VLPVVLTD EA Q +FLTGSRGLYSN RDIDAIA+ IAS+GIV+L GQS+SE++GK+ N
Sbjct: 481 VLPVVLTDTEASQGIFLTGSRGLYSNARDIDAIANSIASIGIVSLPGQSTSENVGKRFNI 540
Query: 558 -ELKCHPADSSDSECSCLDGEDELSPSHLEEKKSG 588
+L HP DSSDSE SC DG +E SH +E+ SG
Sbjct: 541 DDLSDHPDDSSDSESSCFDGGNE--QSHSKERMSG 571
BLAST of Sed0002029.2 vs. TAIR 10
Match:
AT2G39950.1 (unknown protein; Has 978 Blast hits to 254 proteins in 81 species: Archae - 0; Bacteria - 8; Metazoa - 109; Fungi - 53; Plants - 41; Viruses - 0; Other Eukaryotes - 767 (source: NCBI BLink). )
HSP 1 Score: 355.5 bits (911), Expect = 8.0e-98
Identity = 253/569 (44.46%), Postives = 333/569 (58.52%), Query Frame = 0
Query: 13 ADGDVMESLFGHFEPVEVGILARCFCIPLVSIRVGKVNKQGTLLCPTTTRGNLNLMVLPS 72
A+ ++ L HFEP E+G+LARCFCIPLVS+RVGK+ K+G L+ PT RGNL+LMVLP+
Sbjct: 107 AEDRLLAHLSQHFEPAEIGMLARCFCIPLVSVRVGKIIKEGILMRPTPIRGNLSLMVLPT 166
Query: 73 SDFRLSFIGDNGHVERLFTLSNRMTSVTIAIDEISSDESGRSFVIKATDENV-YFWCSEK 132
SD RLSFIGDNGH E+LFT +++ ++I+EI+ D SGRSFVI+ + N Y+WCSEK
Sbjct: 167 SDLRLSFIGDNGHSEQLFTYTSKSQCSAVSIEEITVDSSGRSFVIRIANGNAFYYWCSEK 226
Query: 133 SKLLGTELLGKMKDLLQRRPSIAELTGISESRLGCFATRLRAYLMESTAVNQH--TASSA 192
SKLLGTEL KMKDL++++PSI+ELTGI ESRLG A+ LR YLM S N S
Sbjct: 227 SKLLGTELRRKMKDLIKKKPSISELTGIEESRLGSVASHLRLYLMGSVVPNIKGCQVPSP 286
Query: 193 DLHSVDTTRELSRSSHFGQSCASSKSIRSRNLGSQTVKANSAHQGSLSPRLNSFKEGLPK 252
D S E + SS S ASSKS+R+R+ G+Q K QGSLSPR +SFKE +
Sbjct: 287 DSSSSSGFSETADSS----SSASSKSLRARHCGTQQTKT----QGSLSPRASSFKENTLR 346
Query: 253 TLLSLRDAAREKFRRRGD-NLALDNHIVASSISTDAFG-LNSETHAADSS------RPLS 312
SLR ++R+K + + + ++ ++ +SI T+ G + SE +++ R +
Sbjct: 347 N-ASLRISSRDKSKGHSEGHFSIFDNSSITSIPTNVEGFIQSEGEVEEATENYNGIRQII 406
Query: 313 ASSFLESL-GKLAAPIPASSSHVPCVVSPLFSPYYCWCPPGSSSISQRREEPSQFPIPSI 372
A ES + P P P P+FSPYYCWCPP +SS+ QFP SI
Sbjct: 407 AFEEAESTPSTMTGPPPFPLKMGP----PVFSPYYCWCPPTTSSL-HAPSASYQFPPLSI 466
Query: 373 SASSLPPFPSLLPVSAN--LSVSVSPLNLVDSLSVDFPALFPEPLV-RLPL----KTSQQ 432
SLPP SLLP S + + SPL+L D + P PLV +P+ +S Q
Sbjct: 467 ELPSLPPLSSLLPASGSDGFLIPSSPLDLSD--------IPPLPLVHHIPIPGSSSSSSQ 526
Query: 433 IPTFTPLFCDPIVHVPVIDVCSSGQGYLVSAGPT--ISTSIPPLHHPNLMNPMIPATDVE 492
P+ CDPIVH+PVID+ SSGQ YLVSAGPT IST IPPL P+ + VE
Sbjct: 527 QQMMIPIMCDPIVHIPVIDIFSSGQSYLVSAGPTGIISTGIPPL-------PVENDSLVE 586
Query: 493 KDARETLRLLISGSSQGNPQLMNVLPVVLTDAEAHQSLFLTGSRGLYSNTRDIDAIASRI 552
K ARETLRLLISG++ +N H GSRGLYS +RD+ + S
Sbjct: 587 KGARETLRLLISGANATTSTPLN-----------HH-----GSRGLYSVSRDVSGV-SLF 629
Query: 553 ASLGIVTLSGQSSSEHIGKKHNELKCHPA 561
A +G+ S + G+ + + PA
Sbjct: 647 APIGLQQPSSVEGGDGGGESVSSSEAVPA 629
BLAST of Sed0002029.2 vs. TAIR 10
Match:
AT2G39950.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; Has 941 Blast hits to 229 proteins in 79 species: Archae - 0; Bacteria - 8; Metazoa - 89; Fungi - 54; Plants - 41; Viruses - 0; Other Eukaryotes - 749 (source: NCBI BLink). )
HSP 1 Score: 355.5 bits (911), Expect = 8.0e-98
Identity = 253/569 (44.46%), Postives = 333/569 (58.52%), Query Frame = 0
Query: 13 ADGDVMESLFGHFEPVEVGILARCFCIPLVSIRVGKVNKQGTLLCPTTTRGNLNLMVLPS 72
A+ ++ L HFEP E+G+LARCFCIPLVS+RVGK+ K+G L+ PT RGNL+LMVLP+
Sbjct: 26 AEDRLLAHLSQHFEPAEIGMLARCFCIPLVSVRVGKIIKEGILMRPTPIRGNLSLMVLPT 85
Query: 73 SDFRLSFIGDNGHVERLFTLSNRMTSVTIAIDEISSDESGRSFVIKATDENV-YFWCSEK 132
SD RLSFIGDNGH E+LFT +++ ++I+EI+ D SGRSFVI+ + N Y+WCSEK
Sbjct: 86 SDLRLSFIGDNGHSEQLFTYTSKSQCSAVSIEEITVDSSGRSFVIRIANGNAFYYWCSEK 145
Query: 133 SKLLGTELLGKMKDLLQRRPSIAELTGISESRLGCFATRLRAYLMESTAVNQH--TASSA 192
SKLLGTEL KMKDL++++PSI+ELTGI ESRLG A+ LR YLM S N S
Sbjct: 146 SKLLGTELRRKMKDLIKKKPSISELTGIEESRLGSVASHLRLYLMGSVVPNIKGCQVPSP 205
Query: 193 DLHSVDTTRELSRSSHFGQSCASSKSIRSRNLGSQTVKANSAHQGSLSPRLNSFKEGLPK 252
D S E + SS S ASSKS+R+R+ G+Q K QGSLSPR +SFKE +
Sbjct: 206 DSSSSSGFSETADSS----SSASSKSLRARHCGTQQTKT----QGSLSPRASSFKENTLR 265
Query: 253 TLLSLRDAAREKFRRRGD-NLALDNHIVASSISTDAFG-LNSETHAADSS------RPLS 312
SLR ++R+K + + + ++ ++ +SI T+ G + SE +++ R +
Sbjct: 266 N-ASLRISSRDKSKGHSEGHFSIFDNSSITSIPTNVEGFIQSEGEVEEATENYNGIRQII 325
Query: 313 ASSFLESL-GKLAAPIPASSSHVPCVVSPLFSPYYCWCPPGSSSISQRREEPSQFPIPSI 372
A ES + P P P P+FSPYYCWCPP +SS+ QFP SI
Sbjct: 326 AFEEAESTPSTMTGPPPFPLKMGP----PVFSPYYCWCPPTTSSL-HAPSASYQFPPLSI 385
Query: 373 SASSLPPFPSLLPVSAN--LSVSVSPLNLVDSLSVDFPALFPEPLV-RLPL----KTSQQ 432
SLPP SLLP S + + SPL+L D + P PLV +P+ +S Q
Sbjct: 386 ELPSLPPLSSLLPASGSDGFLIPSSPLDLSD--------IPPLPLVHHIPIPGSSSSSSQ 445
Query: 433 IPTFTPLFCDPIVHVPVIDVCSSGQGYLVSAGPT--ISTSIPPLHHPNLMNPMIPATDVE 492
P+ CDPIVH+PVID+ SSGQ YLVSAGPT IST IPPL P+ + VE
Sbjct: 446 QQMMIPIMCDPIVHIPVIDIFSSGQSYLVSAGPTGIISTGIPPL-------PVENDSLVE 505
Query: 493 KDARETLRLLISGSSQGNPQLMNVLPVVLTDAEAHQSLFLTGSRGLYSNTRDIDAIASRI 552
K ARETLRLLISG++ +N H GSRGLYS +RD+ + S
Sbjct: 506 KGARETLRLLISGANATTSTPLN-----------HH-----GSRGLYSVSRDVSGV-SLF 548
Query: 553 ASLGIVTLSGQSSSEHIGKKHNELKCHPA 561
A +G+ S + G+ + + PA
Sbjct: 566 APIGLQQPSSVEGGDGGGESVSSSEAVPA 548
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022993059.1 | 9.4e-258 | 83.48 | uncharacterized protein LOC111489188 isoform X2 [Cucurbita maxima] >XP_022993060... | [more] |
XP_038886409.1 | 8.0e-257 | 83.48 | uncharacterized protein LOC120076604 isoform X2 [Benincasa hispida] >XP_03888641... | [more] |
XP_022939304.1 | 1.8e-256 | 83.48 | uncharacterized protein LOC111445260 isoform X2 [Cucurbita moschata] | [more] |
XP_022993058.1 | 2.6e-255 | 82.24 | uncharacterized protein LOC111489188 isoform X1 [Cucurbita maxima] | [more] |
XP_023550042.1 | 7.5e-255 | 82.96 | uncharacterized protein LOC111808350 isoform X2 [Cucurbita pepo subsp. pepo] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1JXG8 | 4.6e-258 | 83.48 | uncharacterized protein LOC111489188 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1FGS2 | 8.6e-257 | 83.48 | uncharacterized protein LOC111445260 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1K125 | 1.2e-255 | 82.24 | uncharacterized protein LOC111489188 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1FLA2 | 2.3e-254 | 82.24 | uncharacterized protein LOC111445260 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1BY44 | 8.9e-254 | 81.74 | uncharacterized protein LOC111006348 isoform X2 OS=Momordica charantia OX=3673 G... | [more] |
Match Name | E-value | Identity | Description | |
AT2G39950.1 | 8.0e-98 | 44.46 | unknown protein; Has 978 Blast hits to 254 proteins in 81 species: Archae - 0; B... | [more] |
AT2G39950.2 | 8.0e-98 | 44.46 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |