Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAAAGAAAAAAGAAAAAAGAAAAAAGAAAAAAGGATAAAGAAAGAGAAGGGAAAGGAAAGAAGCAGAAGTGGCATTTCTAATTCACAATACAGTTTACATGCAGCGTTCAGTCCAGTCAAGTCACACACACTGTTCGTTCACTTTCTCCTCCTTCCCTACACTCCACACACATTCCCGCACTATCCACCGCCATTTTCTTCATCTTCTCAACTGCTTTCCCTCGTCTCCAAACCCTAATGTTAGCGTTTCGCTTCCATTCAAGGTACTTCAATCCTTCTTCCACCTTTGCCGAAGCTGCATGTGGTTACGGTTCATCTTCTGGAATTTGGCCCCGTTTTTGTGCCGTTATTGCGTTTTTTCTTCTCTTTGTTTTGGTATGACATGCATTAGGATCAGCCGAAAGTTTGCTGTTCAACGTGTTGTTAGTTTGTGTGAAAAATTGTTCTAGAATTAACCTGCTCTTTGTTTGAATTCATGGTGTAGGATGCGATGATCGTTTATCTTTTGAAACCTTGCTGTTCTGTTAGGCGTTTTTGGTTTTAAATCCGGAGATATTTTTCGTCTCTTTCTTATGGTATAACATGCATTAGTATCAGATAAAACTTTTCTGTTCATCAATCGTTCTAGAGTTGGTCTTTAACAAATTCATGCTCTTCTGTTATGACTTCCAAGATATCCGAAACTTTCTAATTCTTCACTCCTCTGTTTTAGAATTGTCTCATGAATTGTGCATAAAAGAAATAGAAATCGTAAGTCTACGGTTTACTGTATCTTTAATGACAAAGTGTTCAGCTTGTTTGGAGTGTTTTGACCTCATCAAAATTAAAACTTCTTGATGGTTGTTCTTTTTATGATATTTATAAGTAACATACTTGCAATGCTTTGTAACAATCTTCAGCTATGTTTTTTATTGTATTTCTTTTGAACTTGAGTCTCTGAGTTAGTTTTTTTAAGATGAAGTAATTAGTAATAATTATCTAAAATGTTCCGTTTGTCTCATTTACATTAAAAGAAGTAAAAAGTAAAAACTAGAAATGAGAAATAAAAAATGAAATACAAGAAGTTAAAAGTAAGAATTAGAAATAGGTGGCTGACTAAGATTTGAACTCATAAATTTAAACAAAAAAGAAAAAAAAAGAACAATGATATTACTATGACGTATGATCCCTTTTGAGGCAACAAAATTCAAAATGGAACAACTAGAAGGGTTTCTCGTTTCCATTAGATTGAACACGTGATTCTTCGCTCTCACGCACTTGGTGGTACAAGATCAGCGTGGATCTAATTTGAGTAATTTCTTCACAAGCAGTTAATTATCCTGCATGTTTTACTAACAACCAAGCCGCTAAAGTCTACACATTATTTGGTTTTTTCAACTCTATGCCAGTAGTCTGCCTATTCTGACACAATTTTTGTGTTATTATTAGCTTTATGTTAGCGAACATAAAGATTGAAAAATTGATTATCACTTGCTTGCTGGTAAATCTTCATTTTTCTAGGTGTGAGTAGACTAGCGAGGAAGTTCGGTTCCTTGTTAGGTTCCATACTTCCCTTTGTTCTTCATGTGTAAGGACTTATGGAACTATTCGTTAAGTTTAATCTTGTTTAATTGGAACCCTTTTCTCTAGCATGTCTCCTTTTTGGGTTGGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGTATTCTTTCATTAGTTTCTACTCAATGAAGGATCAATTTTTTGTAAAAGAAAAAAATTCTTTGCCTTGTCACAGAGAATATGGTCAATTATGGTTCATTGGATGTGGAAGAAATTCTGACGGGGGTATTAATTTAGCTATTCATCTGGCAGCTCTTTTAGCGGGCTCCCAATTTCTCATTCCAAGCAGCGTCAACCTAAGTCTGAATCCTTCTATGTTATTTTTCACCAATGATGAAAAGTTTGGCGTCTGATTCTCAGAAGAGGATTTCATATATGAGAAATTGAGAAACAGTAGTGGTCCAAAAGAACATCCTAACCATGAGTGCAACAGGTGTATGCCCAACCGAGGATGCCATATTTACATTATTAGATTATTTAGTTGAACCCATGCTTCCTGCAAAGTCATTGTCGAGAGAAAATCCACCACAATCTCTTCTGCAATCGGTTGCAAAACAGGTACTTCTCGAATTACCTTTCATGTGTTTAATATTTTAATTTCATTCACTTTCAAGCTTGAGAATGCTCAAACAATCATAATCGAGCCAACAATTTTTTGGTTCTGAACTGGGTGATTTGAGTTGATTTACTGGAAACATCAAAGTTGTACAAGCAGCAATCCGTGTTTCAATATTCTTCCATCTTTAGAGAGTACCAATAGTTTAGGATGATACTATTACTCGTTCGAAGAATGTAATTTCTGTGCTCTCTCTTCCTCAATTAACCGTGTGCTTTAATACTTTTGATCTTTTGGAAGTACAGGCTTTTATAAGCATAGAATTACTTTTATAAGCATAGAAGTACAGGCTTTTACTGTTGTATGATTAAAATGAAATATGTTTGAAGTAGGTTAGTTTTAACTGCATGTGACAGTTTATTAGTCACCACTCCTTCTCAGAAGTTGAAAGGCTCGTTTATTGTCTTAAAAAAAACAAAAATAATAAGGTAGCATGTAACTTAGTGTTTTTCCCTTGAATCTACTATGGTAGGTGCATGCCGTTGTTCTGTTGTACAACTACTACCACCGGAAACAACATCCGCACCTTGAATTTCTGAGTTTTGAGGCATTTTGTAAGTTAGCTGTGGTCGTTAAACCAGCTTTGTTGTCTCACATGAAACTCATGCAAAACTCAGATGATATAGAATTGGAAAATCCCGAGAACCAGCTTTCTCCAGCCGAAAAGGCAATTATGGATGCATGTGATATAGCCACTTGTCTACAGGCATCAAAAGATGATGACGTAGAGGGCTGGCCTCTTTCCAAGGTTGCTGTTCTTTTAATTGACTCCAAAAGGGAAAGTTGCCATTTGCTATTTAGTGTCATCACTCAAGGAGTTTGGTCTGTCATTGAACAAGATTTGGATACCTCGGAATGTCAACCAGAAACTGTGGACGAGGAAAAACATGTAAATAAAAAGAAAAGAGTGATCAAGAAACCTTCAAAAGAGGGGCCAGTTGATGAAATTAAAACTCAGCAGCTGGCATATTCAACAGTTAGGAAAGCAACAGGTTTGTATCTTACTTTATGAGCTCATCAGTGCGTTCAATAATTATGTATTGTTCTTATAATCATAAAGTTATAAGTATACATAGAATAATTACTTTTATTTTCTATCATTTTATTTTAGAGGCGATGTCCCATAGTGATTGTGTTTGCTTAAAAGATTCCTTTAAGATTTCTAATGATTGACTGCAAAGTTGATTGTCTCGTATATCATATCTTCCTCAATTAAGGATCATGTATGGTTAACTTTTAGGAAATAAAGATATCCATCATCCATTTTTTGGTCTTTCCGAAATCAGATTATTTTGGTTAAATTGCAAATTTGGTCCCTTCAGTTTGGAGAAAGAATTTAGTTCCTATGGTTTTAAATGTTAAAATTAGTTCCTACAGTTTGCTAAATCCTCATAAACAGTCTCTAATATTTATGAAGTTTTGTCAAAGCATCAGGATAAAATTCTAACTTTTAAAACCACGGAGGCCAAATTCTAACCTTTTCCAAACTAGAGGGACCAAATTTACAATTTAACCATTAATATTTAATTTTGGTATTGGTTGGATGATCTTAAGGTGCCTCATCATATTGGTTGTGTTTATGTCCTCTTTCTATTCTCTTCTGTAATTACCTTAAGAAAAGGGGAAGAACTTAGATCTAGCATGACATTATTTGGAGTATACGTACAAAAATAAACTAGTCTTTTTAAGGGAATACATTGTTTAGAACTCAGATCATCATATGACATTGTTAGGATTAATCTTCAAAACATTGTCTGGACTAAAGTGTTCCTTTTGTAATTACTATTTTAGGTATGTGGCTCATAACTTGAAGGAATATATTAATTTCTTCAATGAAAAAGTTCTATTTTTTGTTTTAAAAAATTTATGGAGAACAAATTTAGTAACCAAAGAAAGCAGTATGGTAGCTACCCCCTTCACCCGCATCCATTCCCTTGTTGACCTCATCTAGACATTGTATTTTTTCTTTTTGAGGTCTTAATGATGCAAAGTCAGTAAGAGTGCCTAAAGGATTTGGAACATTCTACGCAGATGGGAGGATGCTTGTTTGCAGGCATAATTTGGGATAACTTGTGAGAGGCTTCGAAGCAAATGGCATGTTGCATGTCATGTCGGTCTATGATTGAAGGGTGACTCTTTGGCAGACTGCTTTCTTTGCGGTCTGAGGGGAGCATCAGCCACTTCGAGGGAAATGCTATATTTGTTGCCCAAATTTTTTACATAGATTTCTTTTCATTTATTCAACAAAAAGTTCAAAAAAAGAAGAAGTTCTATCCTCAAACCACGTCCTTGTTAGGTTTTGACACCCTAATGCAATTCACTTAATGTGACACACCTCCCTCCTCAAACCCTCCCATAGAAATTCCCTTGTGACCATTCCAACTTTTCCTTGATGGATCCTTGAATGTTGAAAAGAAAGAAAGCACAAGGGGTTTCTGATGAGTCTTCCCCTTTAGAGAAGAAAGATTTTCTCCAAGAAGCTCTTCTGAAAATTTCAATCGACTTAAATGAATTGAAGTCTTTCATCACTCTTAAGATTAGTTTTCTTTACATCCAACTCTTAAGTGATTGCTCGTTCAACCATAAAACTGTTCATTCAGTCCAAATATATCTATGCCTTTTAAAGATTTTCAATTATTCTATAGTCCCCTAGAGAGCATATTATTGGAATTCGCCTAGCTTTGTGCAGCAATGTAGTGGAAGCAAAATTCACGTAGGCTATCCTTTTTTGTTGCCATTATTACCCTTGCAGTCCTATATTATACTATTGGCACTTATCTTTGGTGCTTACCAATACTTTCTGATCCTATTTATTTTCATGCAAAACAGGGATTAATCAAACTGATCTCAAAATTTTAGAAAGTCATGTTGTATACTCTCATAGTAAAGCGAAATCAGCAGTCAGCTTTTATGTGATTCAGTGCACTCGATCAGCAACTGAAGATGTAATTCAAGTTCCCATTAAAGATACCATTGACAGGTTTTACTCTTTTCCTTTCCCACCTGTCCCCTTCAACTTTGTGTCTTGATATCTTTTTTTGCTGTCCCACCAGCTTCTAAAATGTTTAGGTTAGTAATGGAGTTGGGTATTTGGTTTGAATGCATATCCCAGGACCTCATGTGCAATGAAGCATGGATGAAGGTCGGTCATCAAATCCATAGATACCTTTTTTGCTCATGATTTTTTTTTTTTTTCCCATGGTATTAGTTTGCAGGATTCGTTGTTTAAAATAAATGGTAGGAGATGGAGCATTACCTCAAAAGTTGAGTACTTCCACATTCTTCCATATGCTAGGATGATGCTAATCTGGTTTCACGGGTATACACTTAATTGCTTTTGATATTAGTATTACCGTTGGTATTTATTTATTCGTCTACATAACATGATAATTTTTATCTTACTTATTGTCTTGTTTATTTCTGCTTCGTTGCTGTATCTAACCACTCGCTCTTGCTCAGTAATGTGAGGGTTAACAATCATTCGTTTAACTTTTACTTCATTCAATATTAATTTTTTCTCCCACTTGATTTCATTCTCTGTTGCTGTATCCAACCTCCTAGCTCATGCTAGTTACCTGTATGGGAGGGTTTATTCCAAGCGCTGATTCTCAGTCATTTATCCTAAATCTCCCATTTTAGTAGTTTGCATGTGTGGGGAATAACCCCTTTTGTAATGAGTATTTTCCTATTATTGAAATATCTATGATAGGGAGTAATCACATACATGAGAACGGAAGTTCTTTTTGCTACATAATGGGAAGCGTTGTTATTCTTTTTTGACTTGTATACGGAAAGGTGATTGCATACTAATATCTCTATGCTCTCTTTCACGCAGGGTAACTTCAACCAATAGTTTACGAGTCATAGGTGGAGCAAAGGTTGATGAAAACTTGAACAAGCCTGAGAGAATAGATGTAATGAGGACACTTGAAATTCAAGACAACCAAGATGGTGCTAGTGCAAACAATTTGAATAAAGGGACTAGCACTTATGGTGAAGGATTGGAAAGACTGCCAGATAAAACTAACTATATCAGTAGTTTGAATGATGTGATGTTCAGGCCCCAGAATTCTAATGTGGATGACTTGGTTCCTTCCTATCCAGTGGAGAAGAAAAAGGATGTACCAAATACTAGCCAAGTTTTCTTTTCCTATGCAAAGAAAAAGAATGCTAGGCAAGCTGACAATCGCGATGCAGTGATGATCCCATGTATGGTGAATGAACCAAATGCCTCAGAAAGTGGCATCAAAGTTAAGGTAAGTCGTAGGAGTATCTTCAGATACTTATCTAGTCTTGATTTTATATTTATCTTTAGGTATCTTAAGATATGTAGTCTGTCTAAATTTCATATTTACCAATAGGACTCCTGATGTTTATGGGACGAGATAATTTTTGTTGCGGGATATTTTGTATGATTTATTTTCTTTTTTGGCACTTTGTATGCATGATTTAATCACTCAAAACAAGATTCTATGTTGAGACCCATCGACTAAGGGACAGTCTTGAGGATGAATTGATGTGGTTCCTTTATGCTTTAGACACCACGAAAGATAATACTGTCTTTTTTGTTTTCAAGTTTTTAATGGATTTTATTGAAGAACTTACATATTTTGAAGGAATTGTTAAATCCTTTGAAAAATGATTAATTACTTCCCAATTTACATGGAGTATACTTCTTGTGCAACTTTTTAAATGCTTAAGCAAGGAAACTGGAAAAGAAAAATCTAATTTACAAGGTGCTCCTTCTAAACATAGGGCACAACAAGATAGTTTTTATAAGTCTCAGTAGATTACAAGAGAACCAAGTAATCCTTTGTTATATTTATGAAATGCTATCTGCAGCAACTTTGGTAGAATAAAACAGAAAAAATCTGTTATAGAGAGTTTCTCCTTGGATTTAATGTGATAACAAGTCGAGAACAAGAACTCTCAACCAAGAATCTTTCAAATTATTCATTCAAGTTCATTAAGAAAAATCTATTATGAATGAGAACAAATTAATTCCAACTATTAGTTTAGTTTTGGATTCAGTATTCATTTTACTTGGAACCAATGCAAGAGTTCTTGAATTCAAATTCTGCATGAACACATTTTCACTTCAAAATCAATTATGTATAGGATGAGCTTGTTAATCAAAGGAAAATTTCAATATAATCTGATATCTTTTAATGTAGAAGTATTAGTTTTATTGTCTTTTATTTATAATTCAGCTGAAAGCATGATTTACTTTCAGGATAGGATATTGGCAACGAACCCTTGTCATGCTGAATGCAGTGGTGAAAAGATTGCTTCTGGAAATCTCTCTGACAATATTTCACTTGATCAATATAGGAACGGTGATCATGCTCTTGTCACCTGTCAATCGAACACAGAACATCTTACTAAGTTACAGGAAATTATAATTTCGAAAGAAACAGCATTGTCACAAGCTGCAATTAAAGCTCTAAGTAGGAAGAGGGATAAACTGGTACACATACATTTTATATTGTGGTTAAGTTAGGGCTTATGGTTTTCTGATGAATTTAATTTGAGCTAGAAACTATGTTTCAAAGTTCAACATAATGCAAATAATTAGATTATGTATTCTGTAATAAAGAACTGAGATTATGCTTGCTCTCAAGTTCTCCTTTGCCCAACATTGGATGATGGGTGTAGGAAAACAATTATGCTTGCTCTTATATCTTGTTATATGCACCTTACAATAGTATTCCAGTGTCCCTTTTTCTCTGACCAAGGGTTTCATATAGATTGAGCCCTTTTGAATATTGATATTTCATAATTGTTGTTTAGAGAGTGAAGTCCATTCATTCTGTTTGCAGTCTCATCAGCAGCGCATCATTGAAGATAAGATAGCTCGGTGTGATAAAAACATGCAGACAATATTAAGGGGTATGCTCTATTTTCTCTTTTCCCTTTCTGTTTTATTTCTACAGTTTTGTTTATTTTGCATGTTTCATGTACAAGGCCTCTAGAATTTTAGATCATTTCGTTCATCAAATACTAGCAGTCCTCTGTCCTCTCTTTTGTTTCAGTTTTAATTGATGAACCCCTACCCTCAACTTTCTATGATTTATAGTCTTCCGTGGTGCATCCAAGAATAGATAAAAGTATTTTGCAGAGATTTATTTGAGACTAAGGTTTGAATTATTGTTAGATCACTATTCTAAGGATTATACAGATCCTGAGCTACAAATGGAAGTAAAGTTCGAATTCTTAGATTTGATATGTGGCATTTGTCACTGCTTTACTAAATCGTTTGACGCCTATACTGGAGCTAAGTTTTTAATATTAGTGCAAATTGATCAATTGTGTCAAACTTGAAGAATTAATATTACTCGTGCCATATTTAGATGTGCTTTTTCAGCATTTAAAGGCAATGCCTTAAATTTGAACTTGAGTCTTACTTCGTTCCCTGGTTTTGCTTTCTTATTTTATTCCATAATTCTTTTGTAGATTTTATTTTGATCTTCTTTCATCAGATCAAGTTAACCAACAAGTCATCATAATTAGCCAGAGAAAATGTCTAAATTACGTTATATTTTGGTTACTAAAGGTGATGAAGATGGCTTGGTTATAAAGCTGGATTCTGTGATCGAATGTTGTAATGATGTCTGCATAAGAAGTATTGCGGAAGATAGATCTTATCAATGCTTTGAAGAAAACTGCTCATCTCAATATGGCACGAGTAAGAGATTGTCAGAAGCAATTCTCTGCGTACAGAATCCATGTCAGGTGAGTTAACCATGGAAGATAATAATATTATAAACTTTCATACTCAAGGGTAGTTGCATGGTTGTATTTGTGGTGTACATATATTTGTTAATTAATTTTGGTAAGAAACTGAGCTTTCATTGAAAACAAATGAAAGAATGTACACAAGCATACAAAAAACAAGGCAATTAAAGGAGTCCAACGATTAACATCCTCTAAAGACACATAAAACCTCACTCTCTCCCGTACCTCCACCAAAAACCTTACAACCCCTTAAAATAATGGGAAACATGACTTTCATTACAAGAGATGAAAGCAAAGAGAGGCTACAAGAAAGACTAGAGGGGCAAGTAAGGAACTCTTCCAAAGAACAGCCAAGATAAAATTATAAAGCACAAAAACATGAAACACGTTGATCTATTATTTCAAACAGCTGAATTAGAAGAATAAAGTTCTGTTAGATTGCTTTGTTATGGTTTAGTATACAGGTTGGAAATACTTGGACTAAAACAAAAGACACCTGCCATAGGGTAACCCACGGACTGCCACCATCTAAGCTGTTTACTATTTTATCATAGAGATGTAGCTACTATCACCAATATCATTGTTATCATCATCAAGAAAACTTTTAGGTATGATATTTAAGTCCCTATTATAAAAAATTGGAATGAAAGGAAGATGGGAATGAGGGATCAAAAGAGTTGTTCTAGGGGTGAGTTTAAAAAAAATTGAAAGAGACTTGGGAGTTATGGCTATTGCTAAATTATTACTAAACCCAAAACCTTAAACTGATGGATTATGGTAAATTTAATTTGTTTCCGTACTTTTATCCCTTTGCTGATGGGCTTGAATTCTTTTCCTTACAAAAAAAGGTACTAGTTTTGTCTACTTGTCATAATATGTTTTAAACTTCTTTATGGTCCCTCTTTGGATAGTAATGATATCAGCGTTGATACCAAATTTCTTTTGATATACTTAATTGTTTATGGTAATTGTAGAAAAACCCAAATCAAAGCTTAAGGGTGACTTGTCTGCTATTCATTTTTCTTTTACTATTTTTACATTTGTTTACCTTTACAGTTGTATATGTAATAATGCTCTTTTGGCATGGCAGGAACTGGATGACATATGTCGTAAAAATAATTGGATATTGCCTGTTTACGGAGTCTCGACATCAGATGGTAATGATAATGCTATTTTAGAAAAGTTGTCCATAAACTAACACTAAGGACCAATTTAAGATTTATCGAAAATAAAAGGATTAGAATTTTCGAAGTACAAGGACCCAAATGGTATTTTAACCTTTTACTTTTGATTTCGAGTTATTTGTATTGCCAATCATTTGTATTTTATATTGTCGGTTTTCCCTCTTGCATCTTGAAATGTTCTTGCTAATATGAAGTATTTCCTCAGAAAAATGTACAGTGGGTTTCAAATTTGATGGTGCTCATTCTGCCTCCTGTTTTTTCTGGTTTTTGCACGTCTGATCATGCATTGTCCTGCCGTACATTGCAATTTGAAATCATTTCTCAGTATCTAATATGGAATAGGAAATGTATTGTTTGTTCTTCTGAAGGTGGATTCCAAGCTAATGTGTATGTAAAAGGGATGGATTTTGCATATTCAAGCTGCAGCGAGCTGTGTCCAGACCCTTGTGAAGCCAGGAAATCGGCTGCAACAAAGATGCTAGGTCAACTATGGACGATGGCAAGTCAGACCAAGCAGGTTTAGGTGCCTTAGTATGAGCCTGAGGCTGGTTTTATAGGAGGCTAACAAAACCTAGACAACTATCTAAAGTTGTGTTAAATTTTCGTCTCTTCGCACTCTTTAATATCATCCTTGATATTTATTTTAGGTCTATATTCTATTTTCGTCTCTAAACTTTGATGCACGAAGCCTTTCTTACCAACGACAGCCACAACTTGCACACTTGCAACAGATTGGCCTCGTGCACAACTAGATGGGAATGCCTGTAGAGTCTGACTCGAGAGCATAGCATTGAAAATATGGACAGTGTATGAATAACAATAGTTCATAACTCCATGAGAAAAGCCAAAATCAATAGATCACAACTCCTTGGTGCCATTTAGGACTCCCAACTCCAAAACCATTTCACCTTTCAATGTTTCTGTTACATAATTTTGGACAAGTTTAC
mRNA sequence
AAAAAAGAAAAAAGAAAAAAGAAAAAAGAAAAAAGGATAAAGAAAGAGAAGGGAAAGGAAAGAAGCAGAAGTGGCATTTCTAATTCACAATACAGTTTACATGCAGCGTTCAGTCCAGTCAAGTCACACACACTGTTCGTTCACTTTCTCCTCCTTCCCTACACTCCACACACATTCCCGCACTATCCACCGCCATTTTCTTCATCTTCTCAACTGCTTTCCCTCGTCTCCAAACCCTAATGTTAGCGTTTCGCTTCCATTCAAGAGAATATGGTCAATTATGGTTCATTGGATGTGGAAGAAATTCTGACGGGGGTATTAATTTAGCTATTCATCTGGCAGCTCTTTTAGCGGGCTCCCAATTTCTCATTCCAAGCAGCGTCAACCTAAGTCTGAATCCTTCTATGTTATTTTTCACCAATGATGAAAAGTTTGGCGTCTGATTCTCAGAAGAGGATTTCATATATGAGAAATTGAGAAACAGTAGTGGTCCAAAAGAACATCCTAACCATGAGTGCAACAGGTGTATGCCCAACCGAGGATGCCATATTTACATTATTAGATTATTTAGTTGAACCCATGCTTCCTGCAAAGTCATTGTCGAGAGAAAATCCACCACAATCTCTTCTGCAATCGGTTGCAAAACAGGTGCATGCCGTTGTTCTGTTGTACAACTACTACCACCGGAAACAACATCCGCACCTTGAATTTCTGAGTTTTGAGGCATTTTGTAAGTTAGCTGTGGTCGTTAAACCAGCTTTGTTGTCTCACATGAAACTCATGCAAAACTCAGATGATATAGAATTGGAAAATCCCGAGAACCAGCTTTCTCCAGCCGAAAAGGCAATTATGGATGCATGTGATATAGCCACTTGTCTACAGGCATCAAAAGATGATGACGTAGAGGGCTGGCCTCTTTCCAAGGTTGCTGTTCTTTTAATTGACTCCAAAAGGGAAAGTTGCCATTTGCTATTTAGTGTCATCACTCAAGGAGTTTGGTCTGTCATTGAACAAGATTTGGATACCTCGGAATGTCAACCAGAAACTGTGGACGAGGAAAAACATGTAAATAAAAAGAAAAGAGTGATCAAGAAACCTTCAAAAGAGGGGCCAGTTGATGAAATTAAAACTCAGCAGCTGGCATATTCAACAGTTAGGAAAGCAACAGGGATTAATCAAACTGATCTCAAAATTTTAGAAAGTCATGTTGTATACTCTCATAGTAAAGCGAAATCAGCAGTCAGCTTTTATGTGATTCAGTGCACTCGATCAGCAACTGAAGATGTAATTCAAGTTCCCATTAAAGATACCATTGACAGTTTGCAGGATTCGTTGTTTAAAATAAATGGTAGGAGATGGAGCATTACCTCAAAAGTTGAGTACTTCCACATTCTTCCATATGCTAGGATGATGCTAATCTGGTTTCACGGGGTAACTTCAACCAATAGTTTACGAGTCATAGGTGGAGCAAAGGTTGATGAAAACTTGAACAAGCCTGAGAGAATAGATGTAATGAGGACACTTGAAATTCAAGACAACCAAGATGGTGCTAGTGCAAACAATTTGAATAAAGGGACTAGCACTTATGGTGAAGGATTGGAAAGACTGCCAGATAAAACTAACTATATCAGTAGTTTGAATGATGTGATGTTCAGGCCCCAGAATTCTAATGTGGATGACTTGGTTCCTTCCTATCCAGTGGAGAAGAAAAAGGATGTACCAAATACTAGCCAAGTTTTCTTTTCCTATGCAAAGAAAAAGAATGCTAGGCAAGCTGACAATCGCGATGCAGTGATGATCCCATGTATGGTGAATGAACCAAATGCCTCAGAAAGTGGCATCAAAGTTAAGGATAGGATATTGGCAACGAACCCTTGTCATGCTGAATGCAGTGGTGAAAAGATTGCTTCTGGAAATCTCTCTGACAATATTTCACTTGATCAATATAGGAACGGTGATCATGCTCTTGTCACCTGTCAATCGAACACAGAACATCTTACTAAGTTACAGGAAATTATAATTTCGAAAGAAACAGCATTGTCACAAGCTGCAATTAAAGCTCTAAGTAGGAAGAGGGATAAACTGTCTCATCAGCAGCGCATCATTGAAGATAAGATAGCTCGGTGTGATAAAAACATGCAGACAATATTAAGGGGTGATGAAGATGGCTTGGTTATAAAGCTGGATTCTGTGATCGAATGTTGTAATGATGTCTGCATAAGAAGTATTGCGGAAGATAGATCTTATCAATGCTTTGAAGAAAACTGCTCATCTCAATATGGCACGAGTAAGAGATTGTCAGAAGCAATTCTCTGCGTACAGAATCCATGTCAGGAACTGGATGACATATGTCGTAAAAATAATTGGATATTGCCTGTTTACGGAGTCTCGACATCAGATGGTGGATTCCAAGCTAATGTGTATGTAAAAGGGATGGATTTTGCATATTCAAGCTGCAGCGAGCTGTGTCCAGACCCTTGTGAAGCCAGGAAATCGGCTGCAACAAAGATGCTAGGTCAACTATGGACGATGGCAAGTCAGACCAAGCAGGTTTAGGTGCCTTAGTATGAGCCTGAGGCTGGTTTTATAGGAGGCTAACAAAACCTAGACAACTATCTAAAGTTGTGTTAAATTTTCGTCTCTTCGCACTCTTTAATATCATCCTTGATATTTATTTTAGGTCTATATTCTATTTTCGTCTCTAAACTTTGATGCACGAAGCCTTTCTTACCAACGACAGCCACAACTTGCACACTTGCAACAGATTGGCCTCGTGCACAACTAGATGGGAATGCCTGTAGAGTCTGACTCGAGAGCATAGCATTGAAAATATGGACAGTGTATGAATAACAATAGTTCATAACTCCATGAGAAAAGCCAAAATCAATAGATCACAACTCCTTGGTGCCATTTAGGACTCCCAACTCCAAAACCATTTCACCTTTCAATGTTTCTGTTACATAATTTTGGACAAGTTTAC
Coding sequence (CDS)
ATGAGTGCAACAGGTGTATGCCCAACCGAGGATGCCATATTTACATTATTAGATTATTTAGTTGAACCCATGCTTCCTGCAAAGTCATTGTCGAGAGAAAATCCACCACAATCTCTTCTGCAATCGGTTGCAAAACAGGTGCATGCCGTTGTTCTGTTGTACAACTACTACCACCGGAAACAACATCCGCACCTTGAATTTCTGAGTTTTGAGGCATTTTGTAAGTTAGCTGTGGTCGTTAAACCAGCTTTGTTGTCTCACATGAAACTCATGCAAAACTCAGATGATATAGAATTGGAAAATCCCGAGAACCAGCTTTCTCCAGCCGAAAAGGCAATTATGGATGCATGTGATATAGCCACTTGTCTACAGGCATCAAAAGATGATGACGTAGAGGGCTGGCCTCTTTCCAAGGTTGCTGTTCTTTTAATTGACTCCAAAAGGGAAAGTTGCCATTTGCTATTTAGTGTCATCACTCAAGGAGTTTGGTCTGTCATTGAACAAGATTTGGATACCTCGGAATGTCAACCAGAAACTGTGGACGAGGAAAAACATGTAAATAAAAAGAAAAGAGTGATCAAGAAACCTTCAAAAGAGGGGCCAGTTGATGAAATTAAAACTCAGCAGCTGGCATATTCAACAGTTAGGAAAGCAACAGGGATTAATCAAACTGATCTCAAAATTTTAGAAAGTCATGTTGTATACTCTCATAGTAAAGCGAAATCAGCAGTCAGCTTTTATGTGATTCAGTGCACTCGATCAGCAACTGAAGATGTAATTCAAGTTCCCATTAAAGATACCATTGACAGTTTGCAGGATTCGTTGTTTAAAATAAATGGTAGGAGATGGAGCATTACCTCAAAAGTTGAGTACTTCCACATTCTTCCATATGCTAGGATGATGCTAATCTGGTTTCACGGGGTAACTTCAACCAATAGTTTACGAGTCATAGGTGGAGCAAAGGTTGATGAAAACTTGAACAAGCCTGAGAGAATAGATGTAATGAGGACACTTGAAATTCAAGACAACCAAGATGGTGCTAGTGCAAACAATTTGAATAAAGGGACTAGCACTTATGGTGAAGGATTGGAAAGACTGCCAGATAAAACTAACTATATCAGTAGTTTGAATGATGTGATGTTCAGGCCCCAGAATTCTAATGTGGATGACTTGGTTCCTTCCTATCCAGTGGAGAAGAAAAAGGATGTACCAAATACTAGCCAAGTTTTCTTTTCCTATGCAAAGAAAAAGAATGCTAGGCAAGCTGACAATCGCGATGCAGTGATGATCCCATGTATGGTGAATGAACCAAATGCCTCAGAAAGTGGCATCAAAGTTAAGGATAGGATATTGGCAACGAACCCTTGTCATGCTGAATGCAGTGGTGAAAAGATTGCTTCTGGAAATCTCTCTGACAATATTTCACTTGATCAATATAGGAACGGTGATCATGCTCTTGTCACCTGTCAATCGAACACAGAACATCTTACTAAGTTACAGGAAATTATAATTTCGAAAGAAACAGCATTGTCACAAGCTGCAATTAAAGCTCTAAGTAGGAAGAGGGATAAACTGTCTCATCAGCAGCGCATCATTGAAGATAAGATAGCTCGGTGTGATAAAAACATGCAGACAATATTAAGGGGTGATGAAGATGGCTTGGTTATAAAGCTGGATTCTGTGATCGAATGTTGTAATGATGTCTGCATAAGAAGTATTGCGGAAGATAGATCTTATCAATGCTTTGAAGAAAACTGCTCATCTCAATATGGCACGAGTAAGAGATTGTCAGAAGCAATTCTCTGCGTACAGAATCCATGTCAGGAACTGGATGACATATGTCGTAAAAATAATTGGATATTGCCTGTTTACGGAGTCTCGACATCAGATGGTGGATTCCAAGCTAATGTGTATGTAAAAGGGATGGATTTTGCATATTCAAGCTGCAGCGAGCTGTGTCCAGACCCTTGTGAAGCCAGGAAATCGGCTGCAACAAAGATGCTAGGTCAACTATGGACGATGGCAAGTCAGACCAAGCAGGTTTAG
Protein sequence
MSATGVCPTEDAIFTLLDYLVEPMLPAKSLSRENPPQSLLQSVAKQVHAVVLLYNYYHRKQHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQNSDDIELENPENQLSPAEKAIMDACDIATCLQASKDDDVEGWPLSKVAVLLIDSKRESCHLLFSVITQGVWSVIEQDLDTSECQPETVDEEKHVNKKKRVIKKPSKEGPVDEIKTQQLAYSTVRKATGINQTDLKILESHVVYSHSKAKSAVSFYVIQCTRSATEDVIQVPIKDTIDSLQDSLFKINGRRWSITSKVEYFHILPYARMMLIWFHGVTSTNSLRVIGGAKVDENLNKPERIDVMRTLEIQDNQDGASANNLNKGTSTYGEGLERLPDKTNYISSLNDVMFRPQNSNVDDLVPSYPVEKKKDVPNTSQVFFSYAKKKNARQADNRDAVMIPCMVNEPNASESGIKVKDRILATNPCHAECSGEKIASGNLSDNISLDQYRNGDHALVTCQSNTEHLTKLQEIIISKETALSQAAIKALSRKRDKLSHQQRIIEDKIARCDKNMQTILRGDEDGLVIKLDSVIECCNDVCIRSIAEDRSYQCFEENCSSQYGTSKRLSEAILCVQNPCQELDDICRKNNWILPVYGVSTSDGGFQANVYVKGMDFAYSSCSELCPDPCEARKSAATKMLGQLWTMASQTKQV
Homology
BLAST of CmoCh08G002120 vs. ExPASy TrEMBL
Match:
A0A6J1HAN9 (uncharacterized protein LOC111461089 OS=Cucurbita moschata OX=3662 GN=LOC111461089 PE=4 SV=1)
HSP 1 Score: 1352.4 bits (3499), Expect = 0.0e+00
Identity = 681/681 (100.00%), Postives = 681/681 (100.00%), Query Frame = 0
Query: 1 MSATGVCPTEDAIFTLLDYLVEPMLPAKSLSRENPPQSLLQSVAKQVHAVVLLYNYYHRK 60
MSATGVCPTEDAIFTLLDYLVEPMLPAKSLSRENPPQSLLQSVAKQVHAVVLLYNYYHRK
Sbjct: 1 MSATGVCPTEDAIFTLLDYLVEPMLPAKSLSRENPPQSLLQSVAKQVHAVVLLYNYYHRK 60
Query: 61 QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQNSDDIELENPENQLSPAEKAIMDACDIA 120
QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQNSDDIELENPENQLSPAEKAIMDACDIA
Sbjct: 61 QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQNSDDIELENPENQLSPAEKAIMDACDIA 120
Query: 121 TCLQASKDDDVEGWPLSKVAVLLIDSKRESCHLLFSVITQGVWSVIEQDLDTSECQPETV 180
TCLQASKDDDVEGWPLSKVAVLLIDSKRESCHLLFSVITQGVWSVIEQDLDTSECQPETV
Sbjct: 121 TCLQASKDDDVEGWPLSKVAVLLIDSKRESCHLLFSVITQGVWSVIEQDLDTSECQPETV 180
Query: 181 DEEKHVNKKKRVIKKPSKEGPVDEIKTQQLAYSTVRKATGINQTDLKILESHVVYSHSKA 240
DEEKHVNKKKRVIKKPSKEGPVDEIKTQQLAYSTVRKATGINQTDLKILESHVVYSHSKA
Sbjct: 181 DEEKHVNKKKRVIKKPSKEGPVDEIKTQQLAYSTVRKATGINQTDLKILESHVVYSHSKA 240
Query: 241 KSAVSFYVIQCTRSATEDVIQVPIKDTIDSLQDSLFKINGRRWSITSKVEYFHILPYARM 300
KSAVSFYVIQCTRSATEDVIQVPIKDTIDSLQDSLFKINGRRWSITSKVEYFHILPYARM
Sbjct: 241 KSAVSFYVIQCTRSATEDVIQVPIKDTIDSLQDSLFKINGRRWSITSKVEYFHILPYARM 300
Query: 301 MLIWFHGVTSTNSLRVIGGAKVDENLNKPERIDVMRTLEIQDNQDGASANNLNKGTSTYG 360
MLIWFHGVTSTNSLRVIGGAKVDENLNKPERIDVMRTLEIQDNQDGASANNLNKGTSTYG
Sbjct: 301 MLIWFHGVTSTNSLRVIGGAKVDENLNKPERIDVMRTLEIQDNQDGASANNLNKGTSTYG 360
Query: 361 EGLERLPDKTNYISSLNDVMFRPQNSNVDDLVPSYPVEKKKDVPNTSQVFFSYAKKKNAR 420
EGLERLPDKTNYISSLNDVMFRPQNSNVDDLVPSYPVEKKKDVPNTSQVFFSYAKKKNAR
Sbjct: 361 EGLERLPDKTNYISSLNDVMFRPQNSNVDDLVPSYPVEKKKDVPNTSQVFFSYAKKKNAR 420
Query: 421 QADNRDAVMIPCMVNEPNASESGIKVKDRILATNPCHAECSGEKIASGNLSDNISLDQYR 480
QADNRDAVMIPCMVNEPNASESGIKVKDRILATNPCHAECSGEKIASGNLSDNISLDQYR
Sbjct: 421 QADNRDAVMIPCMVNEPNASESGIKVKDRILATNPCHAECSGEKIASGNLSDNISLDQYR 480
Query: 481 NGDHALVTCQSNTEHLTKLQEIIISKETALSQAAIKALSRKRDKLSHQQRIIEDKIARCD 540
NGDHALVTCQSNTEHLTKLQEIIISKETALSQAAIKALSRKRDKLSHQQRIIEDKIARCD
Sbjct: 481 NGDHALVTCQSNTEHLTKLQEIIISKETALSQAAIKALSRKRDKLSHQQRIIEDKIARCD 540
Query: 541 KNMQTILRGDEDGLVIKLDSVIECCNDVCIRSIAEDRSYQCFEENCSSQYGTSKRLSEAI 600
KNMQTILRGDEDGLVIKLDSVIECCNDVCIRSIAEDRSYQCFEENCSSQYGTSKRLSEAI
Sbjct: 541 KNMQTILRGDEDGLVIKLDSVIECCNDVCIRSIAEDRSYQCFEENCSSQYGTSKRLSEAI 600
Query: 601 LCVQNPCQELDDICRKNNWILPVYGVSTSDGGFQANVYVKGMDFAYSSCSELCPDPCEAR 660
LCVQNPCQELDDICRKNNWILPVYGVSTSDGGFQANVYVKGMDFAYSSCSELCPDPCEAR
Sbjct: 601 LCVQNPCQELDDICRKNNWILPVYGVSTSDGGFQANVYVKGMDFAYSSCSELCPDPCEAR 660
Query: 661 KSAATKMLGQLWTMASQTKQV 682
KSAATKMLGQLWTMASQTKQV
Sbjct: 661 KSAATKMLGQLWTMASQTKQV 681
BLAST of CmoCh08G002120 vs. ExPASy TrEMBL
Match:
A0A6J1KZE5 (uncharacterized protein LOC111497732 OS=Cucurbita maxima OX=3661 GN=LOC111497732 PE=4 SV=1)
HSP 1 Score: 1303.1 bits (3371), Expect = 0.0e+00
Identity = 660/680 (97.06%), Postives = 664/680 (97.65%), Query Frame = 0
Query: 1 MSATGVCPTEDAIFTLLDYLVEPMLPAKSLSRENPPQSLLQSVAKQVHAVVLLYNYYHRK 60
MSA GVCPTEDAIFTLLDYLVEPMLPAKSLSRENPPQSLLQSVAKQVHAVVLLYNYYHRK
Sbjct: 1 MSAPGVCPTEDAIFTLLDYLVEPMLPAKSLSRENPPQSLLQSVAKQVHAVVLLYNYYHRK 60
Query: 61 QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQNSDDIELENPENQLSPAEKAIMDACDIA 120
QHPHLEFLSFE FCKLAVVVKPALLSHMKLMQNSDDIELENPENQLSPAEKAIMDACDIA
Sbjct: 61 QHPHLEFLSFEEFCKLAVVVKPALLSHMKLMQNSDDIELENPENQLSPAEKAIMDACDIA 120
Query: 121 TCLQASKDDDVEGWPLSKVAVLLIDSKRESCHLLFSVITQGVWSVIEQDLDTSECQPETV 180
TCLQASKDDDVEGWPLSKVAVLLIDSKRESCHLLFSVITQGVWSVIEQDLDTSECQPET+
Sbjct: 121 TCLQASKDDDVEGWPLSKVAVLLIDSKRESCHLLFSVITQGVWSVIEQDLDTSECQPETM 180
Query: 181 DEEKHVNKKKRVIKKPSKEGPVDEIKTQQLAYSTVRKATGINQTDLKILESHVVYSHSKA 240
DEEKHVNKKKRVIKKPSKEGPVDEIKTQQLAYSTVRKATGINQ+DLKILESHVVYSHSKA
Sbjct: 181 DEEKHVNKKKRVIKKPSKEGPVDEIKTQQLAYSTVRKATGINQSDLKILESHVVYSHSKA 240
Query: 241 KSAVSFYVIQCTRSATEDVIQVPIKDTIDSLQDSLFKINGRRWSITSKVEYFHILPYARM 300
KSAV FYVIQCTRSATEDVIQVPIKDTIDSLQDSLFKINGRRWSITSKVEYFHILPYARM
Sbjct: 241 KSAVCFYVIQCTRSATEDVIQVPIKDTIDSLQDSLFKINGRRWSITSKVEYFHILPYARM 300
Query: 301 MLIWFHGVTSTNSLRVIGGAKVDENLNKPERIDVMRTLEIQDNQDGASANNLNKGTSTYG 360
MLIWFHGVTSTNSLRVIGGAKVDENLNKPERIDV RTLEIQDNQDGA+A NLNKGTSTYG
Sbjct: 301 MLIWFHGVTSTNSLRVIGGAKVDENLNKPERIDVTRTLEIQDNQDGANAYNLNKGTSTYG 360
Query: 361 EGLERLPDKTNYISSLNDVMFRPQNSNVDDLVPSYPVEKKKDVPNTSQVFFSYAKKKNAR 420
EGLERLPDKTNYISSLNDVM RPQNSNVDDLVPSYPVEKKKDVPNTSQVFFS KKKNAR
Sbjct: 361 EGLERLPDKTNYISSLNDVMCRPQNSNVDDLVPSYPVEKKKDVPNTSQVFFSCTKKKNAR 420
Query: 421 QADNRDAVMIPCMVNEPNASESGIKVKDRILATNPCHAECSGEKIASGNLSDNISLDQYR 480
Q DN AVMIPCMVNE NASESGIKVKDRILA NPC AECSGEKIASGNLSDNISLDQYR
Sbjct: 421 QVDNSYAVMIPCMVNESNASESGIKVKDRILAANPCLAECSGEKIASGNLSDNISLDQYR 480
Query: 481 NGDHALVTCQSNTEHLTKLQEIIISKETALSQAAIKALSRKRDKLSHQQRIIEDKIARCD 540
NGDHALVTCQSNTEHLTKLQEIIISKETALSQAAIKALSRKRDKLSHQQRIIEDKIA+CD
Sbjct: 481 NGDHALVTCQSNTEHLTKLQEIIISKETALSQAAIKALSRKRDKLSHQQRIIEDKIAQCD 540
Query: 541 KNMQTILRGDEDGLVIKLDSVIECCNDVCIRSIAEDRSYQCFEENCSSQYGTSKRLSEAI 600
KNMQTILRGDEDGLVIKLDSVIECC DVCIRSIAEDRSYQCFEENCSSQYGTSKRLSEAI
Sbjct: 541 KNMQTILRGDEDGLVIKLDSVIECCYDVCIRSIAEDRSYQCFEENCSSQYGTSKRLSEAI 600
Query: 601 LCVQNPCQELDDICRKNNWILPVYGVSTSDGGFQANVYVKGMDFAYSSCSELCPDPCEAR 660
LCVQNPCQELDDICRKNNWILPVYGVSTSDGGFQANV VKGMDFAYSSCSELCPDPCEAR
Sbjct: 601 LCVQNPCQELDDICRKNNWILPVYGVSTSDGGFQANVLVKGMDFAYSSCSELCPDPCEAR 660
Query: 661 KSAATKMLGQLWTMASQTKQ 681
KSAATKMLGQLWTMASQTKQ
Sbjct: 661 KSAATKMLGQLWTMASQTKQ 680
BLAST of CmoCh08G002120 vs. ExPASy TrEMBL
Match:
A0A6J1DAH9 (uncharacterized protein LOC111018541 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111018541 PE=4 SV=1)
HSP 1 Score: 1082.4 bits (2798), Expect = 0.0e+00
Identity = 543/680 (79.85%), Postives = 605/680 (88.97%), Query Frame = 0
Query: 1 MSATGVCPTEDAIFTLLDYLVEPMLPAKSLSRENPPQSLLQSVAKQVHAVVLLYNYYHRK 60
MSA GVCPTEDAI LLDYLVEPMLPAKS SR+NPPQSL QSVAKQVHAVV+LYNYYHRK
Sbjct: 1 MSALGVCPTEDAIHALLDYLVEPMLPAKSSSRDNPPQSLQQSVAKQVHAVVILYNYYHRK 60
Query: 61 QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQNSDDIELENPENQLSPAEKAIMDACDIA 120
QHPHLE LSFEAFCKLAVVVKPALLSHMKLMQ+SDD ELENPE QLSPAEKAIMDACDIA
Sbjct: 61 QHPHLELLSFEAFCKLAVVVKPALLSHMKLMQSSDDTELENPEKQLSPAEKAIMDACDIA 120
Query: 121 TCLQASKDDDVEGWPLSKVAVLLIDSKRESCHLLFSVITQGVWSVIEQDLDTSECQPETV 180
TCL+ASKD++VEGWPLSKVAVLLIDS++E CHLLFS ITQGVWSVIEQDLDTSECQPETV
Sbjct: 121 TCLEASKDENVEGWPLSKVAVLLIDSRKECCHLLFSFITQGVWSVIEQDLDTSECQPETV 180
Query: 181 DEEKHVNKKKRVIKKPSKE-GPVDEIKTQQLAYSTVRKATGINQTDLKILESHVVYSHSK 240
+EEKHVNKK+RVIKKPSKE VDE KTQQLAYS V++ATGINQ DLKIL+ HVVYS SK
Sbjct: 181 EEEKHVNKKRRVIKKPSKEVSVVDEAKTQQLAYSAVKEATGINQRDLKILDGHVVYSLSK 240
Query: 241 AKSAVSFYVIQCTRSATEDVIQVPIKDTIDSLQDSLFKINGRRWSITSKVEYFHILPYAR 300
KSAV FY+IQCT+SATEDVIQVPIKD +DSLQ SLF+ +GRRWSITSKVE+FHILPYA+
Sbjct: 241 EKSAVRFYMIQCTQSATEDVIQVPIKDAMDSLQGSLFRKDGRRWSITSKVEHFHILPYAK 300
Query: 301 MMLIWFHGVTSTNSLRVIGGAKVDENLNKPERIDVMRTLEIQDNQDGASANNLNKGTSTY 360
M+L W TS +SLRV+ G K+DENL+K ERID R LEIQ++QDG SAN+L+KGTS Y
Sbjct: 301 MVLTWLQRETSRDSLRVVSGEKMDENLSKLERIDAPRKLEIQNDQDGDSANDLSKGTSIY 360
Query: 361 GEGLERLPDKTNYISSLNDVMFRPQNSNVDDLVPSYPVEKKKDVPNTSQVFFSYAKKKNA 420
GEGLE+L +KTN++ SL+D + RPQ +NVDDLVPSYPV+KKKDVPNTSQV SY KK+NA
Sbjct: 361 GEGLEKLHNKTNHVGSLHDAICRPQITNVDDLVPSYPVDKKKDVPNTSQVIVSYTKKRNA 420
Query: 421 RQADNRDAVMIPCMVNEPNASESGIKVKDRILATNPCHAECSGEKIASGNLSDNISLDQY 480
RQ DN VMIPC NE NASESGIK+KD +LATNPC AECSGEKIASGN SDN+S DQ
Sbjct: 421 RQVDNGHEVMIPCTGNESNASESGIKIKDGVLATNPCIAECSGEKIASGNFSDNVSFDQN 480
Query: 481 RNGDHALVTCQSNTEHLTKLQEIIISKETALSQAAIKALSRKRDKLSHQQRIIEDKIARC 540
RNGDHAL+TCQSN EHL+KLQ I++SKETALSQAAI+AL RKRDKLSHQQRIIED+IA+C
Sbjct: 481 RNGDHALITCQSNIEHLSKLQAILVSKETALSQAAIRALIRKRDKLSHQQRIIEDEIAQC 540
Query: 541 DKNMQTILRGDEDGLVIKLDSVIECCNDVCIRSIAEDRSYQCFEENCSSQYGTSKRLSEA 600
DK +QTILRGDED LVIKLDSVIECCNDVC+R+ AED SYQCF+ENCSSQY T KRLSEA
Sbjct: 541 DKKVQTILRGDEDDLVIKLDSVIECCNDVCLRNTAEDGSYQCFKENCSSQYVTRKRLSEA 600
Query: 601 ILCVQNPCQELDDICRKNNWILPVYGVSTSDGGFQANVYVKGMDFAYSSCSELCPDPCEA 660
+LCV++PCQELD IC KNNWILPVY +S+SDGGFQANV+VKG+DF YSSCSE C +P EA
Sbjct: 601 VLCVRSPCQELDAICHKNNWILPVYSISSSDGGFQANVFVKGLDFEYSSCSETCSNPREA 660
Query: 661 RKSAATKMLGQLWTMASQTK 680
R SAATKMLGQLW++ASQ K
Sbjct: 661 RASAATKMLGQLWSIASQRK 680
BLAST of CmoCh08G002120 vs. ExPASy TrEMBL
Match:
A0A1S3BE29 (uncharacterized protein LOC103488666 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103488666 PE=4 SV=1)
HSP 1 Score: 1078.5 bits (2788), Expect = 0.0e+00
Identity = 543/684 (79.39%), Postives = 608/684 (88.89%), Query Frame = 0
Query: 1 MSATGVCPTEDAIFTLLDYLVEPMLPAKSLSRENPPQSLLQSVAKQVHAVVLLYNYYHRK 60
MSA GVCPTEDAI LLDYLVEPMLPAKS SRENPP++LLQSVAKQ+HAVVLLYN+YHRK
Sbjct: 1 MSAPGVCPTEDAIHALLDYLVEPMLPAKSSSRENPPEALLQSVAKQMHAVVLLYNFYHRK 60
Query: 61 QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQNSDDIELENPENQLSPAEKAIMDACDIA 120
QHPHLEFLSFEAFCKLAV+VKPALLSHMKLMQ+SDDIELENPE QLSPAEKAIMDACDIA
Sbjct: 61 QHPHLEFLSFEAFCKLAVIVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACDIA 120
Query: 121 TCLQASKDDDVEGWPLSKVAVLLIDSKRESCHLLFSVITQGVWSVIEQDLDTSECQPETV 180
TCL+AS D+++EGWPLSKVAV L+DSK+E C+LLFS ITQGVWSVIEQD+D+SE QPETV
Sbjct: 121 TCLEASPDENIEGWPLSKVAVFLVDSKKEHCYLLFSFITQGVWSVIEQDIDSSEWQPETV 180
Query: 181 DEEKHVNKKKRVIKKPSKEG-PVDEIKTQQLAYSTVRKATGINQTDLKILESHVVYSHSK 240
DEE+HVNKKKRVIKKPSKEG VDE KTQQ+AY+ V++ATGINQ+DLKILESHVVYS SK
Sbjct: 181 DEERHVNKKKRVIKKPSKEGLVVDETKTQQVAYTAVKEATGINQSDLKILESHVVYSLSK 240
Query: 241 AKSAVSFYVIQCTRSATEDVIQVPIKDTIDSLQDSLFKINGRRWSITSKVEYFHILPYAR 300
KSAV FY+IQCTRSATEDVIQVPI+D ++SLQDSLF+ +GRRWSITSKVEYFHILPYA+
Sbjct: 241 EKSAVCFYMIQCTRSATEDVIQVPIRDVVNSLQDSLFRKSGRRWSITSKVEYFHILPYAK 300
Query: 301 MMLIWFHGVTSTNSLRVIGGAKVDENLNKPERIDVMRTLEIQDNQDGASANNLNKGTSTY 360
M L WFH +S++ L VIG KVDENLN+PERIDV+R L++Q+NQ+GASANNLN + Y
Sbjct: 301 MALTWFHRESSSDKLGVIGEEKVDENLNRPERIDVIRRLKVQNNQNGASANNLNIRANIY 360
Query: 361 GEGLERLPDKTNYISSLNDVMFRPQNSNVDDLVPSYPVEKKKDVPNTSQVFFS----YAK 420
G+G ERLPDKTN + SL+D ++RPQ+++VDDLVPSYPVEKKKDVPNTSQ S Y K
Sbjct: 361 GKGFERLPDKTNCVGSLHDAIYRPQSTSVDDLVPSYPVEKKKDVPNTSQAIVSYTKTYTK 420
Query: 421 KKNARQADNRDAVMIPCMVNEPNASESGIKVKDRILATNPCHAECSGEKIASGNLSDNIS 480
K RQ DN +MIPCMVNE +ASESGIK KD ILATNPC AECSGEKIASGNLSDNIS
Sbjct: 421 KITDRQVDNSYELMIPCMVNESDASESGIKAKDGILATNPCIAECSGEKIASGNLSDNIS 480
Query: 481 LDQYRNGDHALVTCQSNTEHLTKLQEIIISKETALSQAAIKALSRKRDKLSHQQRIIEDK 540
DQ RNGDHAL+TCQSN EHL+KLQ II+SKETALSQAAIKAL RKRDKLSHQQR+IED+
Sbjct: 481 FDQNRNGDHALITCQSNAEHLSKLQAIIVSKETALSQAAIKALIRKRDKLSHQQRLIEDE 540
Query: 541 IARCDKNMQTILRGDEDGLVIKLDSVIECCNDVCIRSIAEDRSYQCFEENCSSQYGTSKR 600
IA+CDKNMQTILRGDED LV+KLDSVI+CCND+C +S AED+SYQ FEENCSSQY T KR
Sbjct: 541 IAQCDKNMQTILRGDEDDLVLKLDSVIDCCNDLC-QSTAEDKSYQYFEENCSSQYVTRKR 600
Query: 601 LSEAILCVQNPCQELDDICRKNNWILPVYGVSTSDGGFQANVYVKGMDFAYSSCSELCPD 660
LSEAILC+QNPCQELD IC KNNWILPVYGVS+ DGGFQANV+VKGMDF YSSC ELC D
Sbjct: 601 LSEAILCIQNPCQELDGICHKNNWILPVYGVSSLDGGFQANVFVKGMDFEYSSCGELCSD 660
Query: 661 PCEARKSAATKMLGQLWTMASQTK 680
P +AR+SAA KMLGQLW MA+Q K
Sbjct: 661 PRDARESAAMKMLGQLWRMANQAK 683
BLAST of CmoCh08G002120 vs. ExPASy TrEMBL
Match:
A0A6J1EPE2 (uncharacterized protein LOC111436360 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111436360 PE=4 SV=1)
HSP 1 Score: 1015.8 bits (2625), Expect = 8.6e-293
Identity = 524/694 (75.50%), Postives = 584/694 (84.15%), Query Frame = 0
Query: 1 MSATGVCPTEDAIFTLLDYLVEPMLPAKSLSRENPPQSLLQSVAKQVHAVVLLYNYYHRK 60
MSATGVCPTEDAI LLDYLVEPMLP+KS S ENPP +LLQSVAKQ+HAVVLLYNYYHRK
Sbjct: 1 MSATGVCPTEDAILALLDYLVEPMLPSKSSSIENPPLALLQSVAKQMHAVVLLYNYYHRK 60
Query: 61 QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQNSDDIELENPENQLSPAEKAIMDACDIA 120
QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQ+SDDIELENPE QLSPAEKAIMDAC +A
Sbjct: 61 QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQSSDDIELENPEKQLSPAEKAIMDACGLA 120
Query: 121 TCLQASKDDDVEGWPLSKVAVLLIDSKRESCHLLFSVITQGVWSVIEQDLDTSECQPETV 180
TCL SKD+++EGWPLSKVAV LIDSK+E CHLLFS ITQGVWSVIEQ+LDTSECQP++V
Sbjct: 121 TCLYTSKDENIEGWPLSKVAVFLIDSKKEHCHLLFSSITQGVWSVIEQNLDTSECQPKSV 180
Query: 181 DEEKHVNKKKRVIKKPSKEG-PVDEIKTQQLAYSTVRKATGINQTDLKILESHVVYSHSK 240
+EEKHVNKKKRVIKKPSKEG V KTQQLAYS V++ATGINQ DLKILESHV YS SK
Sbjct: 181 EEEKHVNKKKRVIKKPSKEGLVVVGTKTQQLAYSAVKEATGINQHDLKILESHVAYSLSK 240
Query: 241 AKSAVSFYVIQCTRSATEDVIQVPIKDTIDSLQDSLFKINGRRWSITSKVEYFHILPYAR 300
KSAV FY++QCTRSATEDVIQVPIKD +DSLQDSLFK NGRRWS+TSKVEY+HILPY +
Sbjct: 241 EKSAVYFYMMQCTRSATEDVIQVPIKDAVDSLQDSLFKKNGRRWSVTSKVEYYHILPYVK 300
Query: 301 MMLIWFHGVTSTNSLRVIGGAKVDENLNKPERIDVMRTLEIQDNQDGASANNLNKGTSTY 360
M+L WFH T T++L V+GG K+DENLNKP+R DV R L Q+NQD A+ NN+NKGTS Y
Sbjct: 301 MVLTWFHRETLTDNLGVVGGEKIDENLNKPKRKDVTRKLGTQNNQDDATTNNMNKGTSIY 360
Query: 361 GEGLERLPDKTNYISSLNDVMFRPQNSNVDDLVPSYPVEKKKDVPNTSQVFFSYAKK--- 420
GLERLP+KTN +SSL+D + RPQ+ +VDDLVPS P+EK+K VP +QV SY KK
Sbjct: 361 DAGLERLPNKTNCMSSLHDAICRPQSPSVDDLVPSNPMEKRKGVPTPTQVIMSYVKKIHG 420
Query: 421 --------------KNARQADNRDAVMIPCMVNEPNASESGIKVKDRILATNPCHAECSG 480
RQ N IPC VNE ASESGIKV+D ILATNPC AECSG
Sbjct: 421 SPVYNHYEATIPCSVTGRQVYNHYEATIPCTVNESKASESGIKVEDGILATNPCIAECSG 480
Query: 481 EKIASGNLSDNISLDQYRNGDHALVTCQSNTEHLTKLQEIIISKETALSQAAIKALSRKR 540
EK+ASGNLSDNIS DQ RN DHAL+TCQSNT++L+K+Q IISKETALSQAAIKAL RKR
Sbjct: 481 EKVASGNLSDNIS-DQNRNDDHALITCQSNTKNLSKMQ-AIISKETALSQAAIKALIRKR 540
Query: 541 DKLSHQQRIIEDKIARCDKNMQTILRGDEDGLVIKLDSVIECCNDVCIRSIAEDRSYQCF 600
DKLSHQQRIIED+IA+CDKNMQTILRGDED V+KLDSVIECCNDVC+RS AED+ YQ
Sbjct: 541 DKLSHQQRIIEDEIAQCDKNMQTILRGDEDDFVVKLDSVIECCNDVCLRSAAEDKPYQYS 600
Query: 601 EENCSSQYGTSKRLSEAILCVQNPCQELDDICRKNNWILPVYGVSTSDGGFQANVYVKGM 660
EENCSSQ T KRLSE ILC++NPCQELDDIC KNNWILPVYGVS+SDGGFQANV +KG+
Sbjct: 601 EENCSSQLVTRKRLSEEILCIRNPCQELDDICHKNNWILPVYGVSSSDGGFQANVILKGL 660
Query: 661 DFAYSSCSELCPDPCEARKSAATKMLGQLWTMAS 677
DF YSS E+C +P EAR+SAA KMLGQLW MA+
Sbjct: 661 DFEYSSNGEVCHNPREARESAAMKMLGQLWRMAA 692
BLAST of CmoCh08G002120 vs. NCBI nr
Match:
XP_022960330.1 (uncharacterized protein LOC111461089 [Cucurbita moschata] >XP_022960331.1 uncharacterized protein LOC111461089 [Cucurbita moschata] >XP_022960332.1 uncharacterized protein LOC111461089 [Cucurbita moschata] >XP_022960333.1 uncharacterized protein LOC111461089 [Cucurbita moschata] >XP_022960334.1 uncharacterized protein LOC111461089 [Cucurbita moschata])
HSP 1 Score: 1352.4 bits (3499), Expect = 0.0e+00
Identity = 681/681 (100.00%), Postives = 681/681 (100.00%), Query Frame = 0
Query: 1 MSATGVCPTEDAIFTLLDYLVEPMLPAKSLSRENPPQSLLQSVAKQVHAVVLLYNYYHRK 60
MSATGVCPTEDAIFTLLDYLVEPMLPAKSLSRENPPQSLLQSVAKQVHAVVLLYNYYHRK
Sbjct: 1 MSATGVCPTEDAIFTLLDYLVEPMLPAKSLSRENPPQSLLQSVAKQVHAVVLLYNYYHRK 60
Query: 61 QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQNSDDIELENPENQLSPAEKAIMDACDIA 120
QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQNSDDIELENPENQLSPAEKAIMDACDIA
Sbjct: 61 QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQNSDDIELENPENQLSPAEKAIMDACDIA 120
Query: 121 TCLQASKDDDVEGWPLSKVAVLLIDSKRESCHLLFSVITQGVWSVIEQDLDTSECQPETV 180
TCLQASKDDDVEGWPLSKVAVLLIDSKRESCHLLFSVITQGVWSVIEQDLDTSECQPETV
Sbjct: 121 TCLQASKDDDVEGWPLSKVAVLLIDSKRESCHLLFSVITQGVWSVIEQDLDTSECQPETV 180
Query: 181 DEEKHVNKKKRVIKKPSKEGPVDEIKTQQLAYSTVRKATGINQTDLKILESHVVYSHSKA 240
DEEKHVNKKKRVIKKPSKEGPVDEIKTQQLAYSTVRKATGINQTDLKILESHVVYSHSKA
Sbjct: 181 DEEKHVNKKKRVIKKPSKEGPVDEIKTQQLAYSTVRKATGINQTDLKILESHVVYSHSKA 240
Query: 241 KSAVSFYVIQCTRSATEDVIQVPIKDTIDSLQDSLFKINGRRWSITSKVEYFHILPYARM 300
KSAVSFYVIQCTRSATEDVIQVPIKDTIDSLQDSLFKINGRRWSITSKVEYFHILPYARM
Sbjct: 241 KSAVSFYVIQCTRSATEDVIQVPIKDTIDSLQDSLFKINGRRWSITSKVEYFHILPYARM 300
Query: 301 MLIWFHGVTSTNSLRVIGGAKVDENLNKPERIDVMRTLEIQDNQDGASANNLNKGTSTYG 360
MLIWFHGVTSTNSLRVIGGAKVDENLNKPERIDVMRTLEIQDNQDGASANNLNKGTSTYG
Sbjct: 301 MLIWFHGVTSTNSLRVIGGAKVDENLNKPERIDVMRTLEIQDNQDGASANNLNKGTSTYG 360
Query: 361 EGLERLPDKTNYISSLNDVMFRPQNSNVDDLVPSYPVEKKKDVPNTSQVFFSYAKKKNAR 420
EGLERLPDKTNYISSLNDVMFRPQNSNVDDLVPSYPVEKKKDVPNTSQVFFSYAKKKNAR
Sbjct: 361 EGLERLPDKTNYISSLNDVMFRPQNSNVDDLVPSYPVEKKKDVPNTSQVFFSYAKKKNAR 420
Query: 421 QADNRDAVMIPCMVNEPNASESGIKVKDRILATNPCHAECSGEKIASGNLSDNISLDQYR 480
QADNRDAVMIPCMVNEPNASESGIKVKDRILATNPCHAECSGEKIASGNLSDNISLDQYR
Sbjct: 421 QADNRDAVMIPCMVNEPNASESGIKVKDRILATNPCHAECSGEKIASGNLSDNISLDQYR 480
Query: 481 NGDHALVTCQSNTEHLTKLQEIIISKETALSQAAIKALSRKRDKLSHQQRIIEDKIARCD 540
NGDHALVTCQSNTEHLTKLQEIIISKETALSQAAIKALSRKRDKLSHQQRIIEDKIARCD
Sbjct: 481 NGDHALVTCQSNTEHLTKLQEIIISKETALSQAAIKALSRKRDKLSHQQRIIEDKIARCD 540
Query: 541 KNMQTILRGDEDGLVIKLDSVIECCNDVCIRSIAEDRSYQCFEENCSSQYGTSKRLSEAI 600
KNMQTILRGDEDGLVIKLDSVIECCNDVCIRSIAEDRSYQCFEENCSSQYGTSKRLSEAI
Sbjct: 541 KNMQTILRGDEDGLVIKLDSVIECCNDVCIRSIAEDRSYQCFEENCSSQYGTSKRLSEAI 600
Query: 601 LCVQNPCQELDDICRKNNWILPVYGVSTSDGGFQANVYVKGMDFAYSSCSELCPDPCEAR 660
LCVQNPCQELDDICRKNNWILPVYGVSTSDGGFQANVYVKGMDFAYSSCSELCPDPCEAR
Sbjct: 601 LCVQNPCQELDDICRKNNWILPVYGVSTSDGGFQANVYVKGMDFAYSSCSELCPDPCEAR 660
Query: 661 KSAATKMLGQLWTMASQTKQV 682
KSAATKMLGQLWTMASQTKQV
Sbjct: 661 KSAATKMLGQLWTMASQTKQV 681
BLAST of CmoCh08G002120 vs. NCBI nr
Match:
KAG6592994.1 (hypothetical protein SDJN03_12470, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 1333.2 bits (3449), Expect = 0.0e+00
Identity = 672/681 (98.68%), Postives = 675/681 (99.12%), Query Frame = 0
Query: 1 MSATGVCPTEDAIFTLLDYLVEPMLPAKSLSRENPPQSLLQSVAKQVHAVVLLYNYYHRK 60
MSA GVCPTEDAIFTLLDYLVEPMLPAKSLSRENPPQSLLQSVAKQVHAVVLLYNYYHRK
Sbjct: 1 MSAPGVCPTEDAIFTLLDYLVEPMLPAKSLSRENPPQSLLQSVAKQVHAVVLLYNYYHRK 60
Query: 61 QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQNSDDIELENPENQLSPAEKAIMDACDIA 120
QHPHL+FLSFEAFCKLAVVVKPALLSHMKLMQNSDDIELENPENQLSPAEKAIMDACDIA
Sbjct: 61 QHPHLDFLSFEAFCKLAVVVKPALLSHMKLMQNSDDIELENPENQLSPAEKAIMDACDIA 120
Query: 121 TCLQASKDDDVEGWPLSKVAVLLIDSKRESCHLLFSVITQGVWSVIEQDLDTSECQPETV 180
TCLQASKDDDVEGWPLSKVAVLLIDSKRESCHLLFSVITQGVWSVIEQDLDTSECQPETV
Sbjct: 121 TCLQASKDDDVEGWPLSKVAVLLIDSKRESCHLLFSVITQGVWSVIEQDLDTSECQPETV 180
Query: 181 DEEKHVNKKKRVIKKPSKEGPVDEIKTQQLAYSTVRKATGINQTDLKILESHVVYSHSKA 240
DEEKHVNKKKRVIKKPSKEGPVDEIKTQQLAYSTVRKATGINQTDLKILESHVVYSHSKA
Sbjct: 181 DEEKHVNKKKRVIKKPSKEGPVDEIKTQQLAYSTVRKATGINQTDLKILESHVVYSHSKA 240
Query: 241 KSAVSFYVIQCTRSATEDVIQVPIKDTIDSLQDSLFKINGRRWSITSKVEYFHILPYARM 300
KSAV FYVIQCTRSATEDVIQVPIKDTIDSLQDSLFKINGRRWSITSKVEYFHILPYARM
Sbjct: 241 KSAVCFYVIQCTRSATEDVIQVPIKDTIDSLQDSLFKINGRRWSITSKVEYFHILPYARM 300
Query: 301 MLIWFHGVTSTNSLRVIGGAKVDENLNKPERIDVMRTLEIQDNQDGASANNLNKGTSTYG 360
MLIWFHGVTSTNSLRVIGGAKVDENLNKPERIDV RTLEIQDNQDGASANNLNKGTSTYG
Sbjct: 301 MLIWFHGVTSTNSLRVIGGAKVDENLNKPERIDVTRTLEIQDNQDGASANNLNKGTSTYG 360
Query: 361 EGLERLPDKTNYISSLNDVMFRPQNSNVDDLVPSYPVEKKKDVPNTSQVFFSYAKKKNAR 420
EGLERLPDKTNYISSLNDVM RPQNSNVDDLVPSYPVEKKKDVPNTSQVFFSYAKKKNAR
Sbjct: 361 EGLERLPDKTNYISSLNDVMCRPQNSNVDDLVPSYPVEKKKDVPNTSQVFFSYAKKKNAR 420
Query: 421 QADNRDAVMIPCMVNEPNASESGIKVKDRILATNPCHAECSGEKIASGNLSDNISLDQYR 480
QADNRDAVMIPCMVNEPNASESGIKVKDRILATNPC AECSGEKIASGNLSDNISLDQYR
Sbjct: 421 QADNRDAVMIPCMVNEPNASESGIKVKDRILATNPCLAECSGEKIASGNLSDNISLDQYR 480
Query: 481 NGDHALVTCQSNTEHLTKLQEIIISKETALSQAAIKALSRKRDKLSHQQRIIEDKIARCD 540
NGDHALVTCQSNTEHLTKLQEIIISKETALSQAAIKALSRKRDKLSHQQRIIEDKIARCD
Sbjct: 481 NGDHALVTCQSNTEHLTKLQEIIISKETALSQAAIKALSRKRDKLSHQQRIIEDKIARCD 540
Query: 541 KNMQTILRGDEDGLVIKLDSVIECCNDVCIRSIAEDRSYQCFEENCSSQYGTSKRLSEAI 600
KNMQTILRGDEDGLVIKLDSVIECCNDVCIRSIAEDRSYQCF+ENCSSQYGTSKRLSEAI
Sbjct: 541 KNMQTILRGDEDGLVIKLDSVIECCNDVCIRSIAEDRSYQCFDENCSSQYGTSKRLSEAI 600
Query: 601 LCVQNPCQELDDICRKNNWILPVYGVSTSDGGFQANVYVKGMDFAYSSCSELCPDPCEAR 660
LCVQNPCQELDDICRKNNWILPVYGVSTSDGGFQANV+VKGMDFAYSSCSELCPDPCEAR
Sbjct: 601 LCVQNPCQELDDICRKNNWILPVYGVSTSDGGFQANVFVKGMDFAYSSCSELCPDPCEAR 660
Query: 661 KSAATKMLGQLWTMASQTKQV 682
KSAATKM GQLWTMASQTKQV
Sbjct: 661 KSAATKMFGQLWTMASQTKQV 681
BLAST of CmoCh08G002120 vs. NCBI nr
Match:
KAG7025400.1 (hypothetical protein SDJN02_11895 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 1330.5 bits (3442), Expect = 0.0e+00
Identity = 671/681 (98.53%), Postives = 674/681 (98.97%), Query Frame = 0
Query: 1 MSATGVCPTEDAIFTLLDYLVEPMLPAKSLSRENPPQSLLQSVAKQVHAVVLLYNYYHRK 60
MSA GVCPTEDAIFTLLDYLVEPMLPAKSLSRENPPQSLLQSVAKQVHAVVLLYNYYHRK
Sbjct: 1 MSAPGVCPTEDAIFTLLDYLVEPMLPAKSLSRENPPQSLLQSVAKQVHAVVLLYNYYHRK 60
Query: 61 QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQNSDDIELENPENQLSPAEKAIMDACDIA 120
QHPHL+FLSFEAFCKLAVVVKPALL+HMKLMQNSDDIELENPENQLSPAEKAIMDACDIA
Sbjct: 61 QHPHLDFLSFEAFCKLAVVVKPALLTHMKLMQNSDDIELENPENQLSPAEKAIMDACDIA 120
Query: 121 TCLQASKDDDVEGWPLSKVAVLLIDSKRESCHLLFSVITQGVWSVIEQDLDTSECQPETV 180
TCL ASKDDDVEGWPLSKVAVLLIDSKRESCHLLFSVITQGVWSVIEQDLDTSECQPETV
Sbjct: 121 TCLLASKDDDVEGWPLSKVAVLLIDSKRESCHLLFSVITQGVWSVIEQDLDTSECQPETV 180
Query: 181 DEEKHVNKKKRVIKKPSKEGPVDEIKTQQLAYSTVRKATGINQTDLKILESHVVYSHSKA 240
DEEKHVNKKKRVIKKPSKEGPVDEIKTQQLAYSTVRKATGINQTDLKILESHVVYSHSKA
Sbjct: 181 DEEKHVNKKKRVIKKPSKEGPVDEIKTQQLAYSTVRKATGINQTDLKILESHVVYSHSKA 240
Query: 241 KSAVSFYVIQCTRSATEDVIQVPIKDTIDSLQDSLFKINGRRWSITSKVEYFHILPYARM 300
KSAV FYVIQCTRSATEDVIQVPIKDTIDSLQDSLFKINGRRWSITSKVEYFHILPYARM
Sbjct: 241 KSAVCFYVIQCTRSATEDVIQVPIKDTIDSLQDSLFKINGRRWSITSKVEYFHILPYARM 300
Query: 301 MLIWFHGVTSTNSLRVIGGAKVDENLNKPERIDVMRTLEIQDNQDGASANNLNKGTSTYG 360
MLIWFHGVTSTNSLRVIGGAKVDENLNKPERIDV RTLEIQDNQDGASANNLNKGTSTYG
Sbjct: 301 MLIWFHGVTSTNSLRVIGGAKVDENLNKPERIDVTRTLEIQDNQDGASANNLNKGTSTYG 360
Query: 361 EGLERLPDKTNYISSLNDVMFRPQNSNVDDLVPSYPVEKKKDVPNTSQVFFSYAKKKNAR 420
EGLERLPDKTNYISSLNDVM RPQNSNVDDLVPSYPVEKKKDVPNTSQVFFSYAKKKNAR
Sbjct: 361 EGLERLPDKTNYISSLNDVMCRPQNSNVDDLVPSYPVEKKKDVPNTSQVFFSYAKKKNAR 420
Query: 421 QADNRDAVMIPCMVNEPNASESGIKVKDRILATNPCHAECSGEKIASGNLSDNISLDQYR 480
QADNRDAVMIPCMVNEPNASESGIKVKDRILATNPC AECSGEKIASGNLSDNISLDQYR
Sbjct: 421 QADNRDAVMIPCMVNEPNASESGIKVKDRILATNPCLAECSGEKIASGNLSDNISLDQYR 480
Query: 481 NGDHALVTCQSNTEHLTKLQEIIISKETALSQAAIKALSRKRDKLSHQQRIIEDKIARCD 540
NGDHALVTCQSNTEHLTKLQEIIISKETALSQAAIKALSRKRDKLSHQQRIIEDKIARCD
Sbjct: 481 NGDHALVTCQSNTEHLTKLQEIIISKETALSQAAIKALSRKRDKLSHQQRIIEDKIARCD 540
Query: 541 KNMQTILRGDEDGLVIKLDSVIECCNDVCIRSIAEDRSYQCFEENCSSQYGTSKRLSEAI 600
KNMQTILRGDEDGLVIKLDSVIECCNDVCIRSIAEDRSYQCFEENCSSQYGTSKRLSEAI
Sbjct: 541 KNMQTILRGDEDGLVIKLDSVIECCNDVCIRSIAEDRSYQCFEENCSSQYGTSKRLSEAI 600
Query: 601 LCVQNPCQELDDICRKNNWILPVYGVSTSDGGFQANVYVKGMDFAYSSCSELCPDPCEAR 660
LCVQNPCQELDDICRKNNWILPVYGVSTSDGGFQANV+VKGMDFAYSSCSELCPDPCEAR
Sbjct: 601 LCVQNPCQELDDICRKNNWILPVYGVSTSDGGFQANVFVKGMDFAYSSCSELCPDPCEAR 660
Query: 661 KSAATKMLGQLWTMASQTKQV 682
KSAATKM GQLWTMASQTKQV
Sbjct: 661 KSAATKMFGQLWTMASQTKQV 681
BLAST of CmoCh08G002120 vs. NCBI nr
Match:
XP_023514125.1 (uncharacterized protein LOC111778491 isoform X2 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1328.9 bits (3438), Expect = 0.0e+00
Identity = 672/681 (98.68%), Postives = 674/681 (98.97%), Query Frame = 0
Query: 1 MSATGVCPTEDAIFTLLDYLVEPMLPAKSLSRENPPQSLLQSVAKQVHAVVLLYNYYHRK 60
MSA GVCPTEDAIFTLLDYLVEPMLPAKSLSRENPPQSLLQSVAKQVHAVVLLYNYYHRK
Sbjct: 1 MSAPGVCPTEDAIFTLLDYLVEPMLPAKSLSRENPPQSLLQSVAKQVHAVVLLYNYYHRK 60
Query: 61 QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQNSDDIELENPENQLSPAEKAIMDACDIA 120
QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQNSDDIELENPENQLSPAEKAIMDACDIA
Sbjct: 61 QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQNSDDIELENPENQLSPAEKAIMDACDIA 120
Query: 121 TCLQASKDDDVEGWPLSKVAVLLIDSKRESCHLLFSVITQGVWSVIEQDLDTSECQPETV 180
TCLQASKDDDVEGWPLSKVAVLLIDSKRESCHLLFSVITQGVWSVIEQDLDTSECQPETV
Sbjct: 121 TCLQASKDDDVEGWPLSKVAVLLIDSKRESCHLLFSVITQGVWSVIEQDLDTSECQPETV 180
Query: 181 DEEKHVNKKKRVIKKPSKEGPVDEIKTQQLAYSTVRKATGINQTDLKILESHVVYSHSKA 240
DEEKHVNKKKRVIKKPSKEGPVDEIKTQQLAYSTVRKATGINQ+DLKILESHVVYSHSKA
Sbjct: 181 DEEKHVNKKKRVIKKPSKEGPVDEIKTQQLAYSTVRKATGINQSDLKILESHVVYSHSKA 240
Query: 241 KSAVSFYVIQCTRSATEDVIQVPIKDTIDSLQDSLFKINGRRWSITSKVEYFHILPYARM 300
KSAV FYVIQCTRSATEDVIQVPIKDTIDSLQDSLFKINGRRWSITSKVEYFHILPYARM
Sbjct: 241 KSAVFFYVIQCTRSATEDVIQVPIKDTIDSLQDSLFKINGRRWSITSKVEYFHILPYARM 300
Query: 301 MLIWFHGVTSTNSLRVIGGAKVDENLNKPERIDVMRTLEIQDNQDGASANNLNKGTSTYG 360
MLIWFHGVTSTNSLRVIGGAKVDENLNKPERIDV RTLEIQDNQDGASANNLNKGTSTYG
Sbjct: 301 MLIWFHGVTSTNSLRVIGGAKVDENLNKPERIDVTRTLEIQDNQDGASANNLNKGTSTYG 360
Query: 361 EGLERLPDKTNYISSLNDVMFRPQNSNVDDLVPSYPVEKKKDVPNTSQVFFSYAKKKNAR 420
EGLERLPDKTNYISSLNDVM RPQNSNVDDLVPSYPVEKKKDVPNTSQVFFSYAKKKNAR
Sbjct: 361 EGLERLPDKTNYISSLNDVMCRPQNSNVDDLVPSYPVEKKKDVPNTSQVFFSYAKKKNAR 420
Query: 421 QADNRDAVMIPCMVNEPNASESGIKVKDRILATNPCHAECSGEKIASGNLSDNISLDQYR 480
QADNRDAVMIPCMVNEPNASESGI VKDRILATNPC AECSGEKIASGNLSDNISLDQYR
Sbjct: 421 QADNRDAVMIPCMVNEPNASESGIIVKDRILATNPCLAECSGEKIASGNLSDNISLDQYR 480
Query: 481 NGDHALVTCQSNTEHLTKLQEIIISKETALSQAAIKALSRKRDKLSHQQRIIEDKIARCD 540
NGDHALVTCQSNTEHLTKLQEIIISKETALSQAAIKALSRKRDKLSHQQRIIEDKIA+CD
Sbjct: 481 NGDHALVTCQSNTEHLTKLQEIIISKETALSQAAIKALSRKRDKLSHQQRIIEDKIAQCD 540
Query: 541 KNMQTILRGDEDGLVIKLDSVIECCNDVCIRSIAEDRSYQCFEENCSSQYGTSKRLSEAI 600
KNMQTILRGDEDGLVIKLDSVIECCNDVCIRSIAEDRSYQCFEENCSSQYGTSKRLSEAI
Sbjct: 541 KNMQTILRGDEDGLVIKLDSVIECCNDVCIRSIAEDRSYQCFEENCSSQYGTSKRLSEAI 600
Query: 601 LCVQNPCQELDDICRKNNWILPVYGVSTSDGGFQANVYVKGMDFAYSSCSELCPDPCEAR 660
LCVQNPCQELDDICRKNNWILPVYGVSTSDGGFQANV VKGMDFAYSSCSELCPDPCEAR
Sbjct: 601 LCVQNPCQELDDICRKNNWILPVYGVSTSDGGFQANVLVKGMDFAYSSCSELCPDPCEAR 660
Query: 661 KSAATKMLGQLWTMASQTKQV 682
KSAATKMLGQLWTMASQTKQV
Sbjct: 661 KSAATKMLGQLWTMASQTKQV 681
BLAST of CmoCh08G002120 vs. NCBI nr
Match:
XP_023514123.1 (uncharacterized protein LOC111778491 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023514124.1 uncharacterized protein LOC111778491 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1327.8 bits (3435), Expect = 0.0e+00
Identity = 671/681 (98.53%), Postives = 674/681 (98.97%), Query Frame = 0
Query: 1 MSATGVCPTEDAIFTLLDYLVEPMLPAKSLSRENPPQSLLQSVAKQVHAVVLLYNYYHRK 60
MSA GVCPTEDAIFTLLDYLVEPMLPAKSLSRENPPQSLLQSVAKQVHAVVLLYNYYHRK
Sbjct: 1 MSAPGVCPTEDAIFTLLDYLVEPMLPAKSLSRENPPQSLLQSVAKQVHAVVLLYNYYHRK 60
Query: 61 QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQNSDDIELENPENQLSPAEKAIMDACDIA 120
QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQNSDDIELENPENQLSPAEKAIMDACDIA
Sbjct: 61 QHPHLEFLSFEAFCKLAVVVKPALLSHMKLMQNSDDIELENPENQLSPAEKAIMDACDIA 120
Query: 121 TCLQASKDDDVEGWPLSKVAVLLIDSKRESCHLLFSVITQGVWSVIEQDLDTSECQPETV 180
TCLQASKDDDVEGWPLSKVAVLLIDSKRESCHLLFSVITQGVWSVIEQDLDTSECQPETV
Sbjct: 121 TCLQASKDDDVEGWPLSKVAVLLIDSKRESCHLLFSVITQGVWSVIEQDLDTSECQPETV 180
Query: 181 DEEKHVNKKKRVIKKPSKEGPVDEIKTQQLAYSTVRKATGINQTDLKILESHVVYSHSKA 240
DEEKHVNKKKRVIKKPSKEGPVDEIKTQQLAYSTVRKATGINQ+DLKILESHVVYSHSKA
Sbjct: 181 DEEKHVNKKKRVIKKPSKEGPVDEIKTQQLAYSTVRKATGINQSDLKILESHVVYSHSKA 240
Query: 241 KSAVSFYVIQCTRSATEDVIQVPIKDTIDSLQDSLFKINGRRWSITSKVEYFHILPYARM 300
KSAV FYVIQCTRSATEDVIQVPIKDTIDSLQDSLFKINGRRWSITSKVEYFHILPYARM
Sbjct: 241 KSAVFFYVIQCTRSATEDVIQVPIKDTIDSLQDSLFKINGRRWSITSKVEYFHILPYARM 300
Query: 301 MLIWFHGVTSTNSLRVIGGAKVDENLNKPERIDVMRTLEIQDNQDGASANNLNKGTSTYG 360
MLIWFHGVTSTNSLRVIGGAKVDENLNKPERIDV RTLEIQDNQDGASANNLNKGTSTYG
Sbjct: 301 MLIWFHGVTSTNSLRVIGGAKVDENLNKPERIDVTRTLEIQDNQDGASANNLNKGTSTYG 360
Query: 361 EGLERLPDKTNYISSLNDVMFRPQNSNVDDLVPSYPVEKKKDVPNTSQVFFSYAKKKNAR 420
EGLERLPDKTNYISSLNDVM RPQNSNVDDLVPSYPVEKKKDVPNTSQVFFSYAKKKNAR
Sbjct: 361 EGLERLPDKTNYISSLNDVMCRPQNSNVDDLVPSYPVEKKKDVPNTSQVFFSYAKKKNAR 420
Query: 421 QADNRDAVMIPCMVNEPNASESGIKVKDRILATNPCHAECSGEKIASGNLSDNISLDQYR 480
QADNRDAVMIPCMVNEPNASESGI VKDRILATNPC AECSGEKIASGNLSDNISLDQYR
Sbjct: 421 QADNRDAVMIPCMVNEPNASESGIIVKDRILATNPCLAECSGEKIASGNLSDNISLDQYR 480
Query: 481 NGDHALVTCQSNTEHLTKLQEIIISKETALSQAAIKALSRKRDKLSHQQRIIEDKIARCD 540
NGDHALVTCQSNTEHLTKLQEIIISKETALSQAAIKALSRKRDKLSHQQRIIEDKIA+CD
Sbjct: 481 NGDHALVTCQSNTEHLTKLQEIIISKETALSQAAIKALSRKRDKLSHQQRIIEDKIAQCD 540
Query: 541 KNMQTILRGDEDGLVIKLDSVIECCNDVCIRSIAEDRSYQCFEENCSSQYGTSKRLSEAI 600
KNMQTILRGDEDGLVIKLDSVIECCNDVCIRSIAEDRSYQCFEENCSSQYGTSKRLSEAI
Sbjct: 541 KNMQTILRGDEDGLVIKLDSVIECCNDVCIRSIAEDRSYQCFEENCSSQYGTSKRLSEAI 600
Query: 601 LCVQNPCQELDDICRKNNWILPVYGVSTSDGGFQANVYVKGMDFAYSSCSELCPDPCEAR 660
LCVQNPCQELDDICRKNNWILPVYGVSTSDGGFQANV VKGMDFAYSSCSELCPDPCEAR
Sbjct: 601 LCVQNPCQELDDICRKNNWILPVYGVSTSDGGFQANVLVKGMDFAYSSCSELCPDPCEAR 660
Query: 661 KSAATKMLGQLWTMASQTKQV 682
KSAATKMLGQLWTMASQTKQ+
Sbjct: 661 KSAATKMLGQLWTMASQTKQL 681
BLAST of CmoCh08G002120 vs. TAIR 10
Match:
AT1G05950.1 (unknown protein; Has 50 Blast hits to 45 proteins in 14 species: Archae - 5; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 34; Viruses - 0; Other Eukaryotes - 7 (source: NCBI BLink). )
HSP 1 Score: 332.8 bits (852), Expect = 6.5e-91
Identity = 238/673 (35.36%), Postives = 367/673 (54.53%), Query Frame = 0
Query: 4 TGVCPTEDAIFTLLDYLVEPMLPAKSLSRENPPQSLLQSVAKQVHAVVLLYNYYHRKQHP 63
T CPTEDAI LL+ LV+P+LP+K + + P S+ +SVAKQVHAVVLLYNYYHRK +P
Sbjct: 14 TDSCPTEDAIRALLESLVDPLLPSKP-TDDLPSTSIRESVAKQVHAVVLLYNYYHRKDNP 73
Query: 64 HLEFLSFEAFCKLAVVVKPALLSHMKLMQNSDDIELENPENQLSPAEKAIMDACDIATCL 123
HLE LSFE+F LA V+KPALL H+K E Q EK I+DAC ++ L
Sbjct: 74 HLECLSFESFRSLATVMKPALLQHLK--------EDGGVSGQTVLLEKVIVDACSLSMSL 133
Query: 124 QASKDDDV-EGWPLSKVAVLLIDSKRESCHLLFSVITQGVWSVIEQDLDTSECQPETVDE 183
AS D + P+ +VAVLL+DS+++SC+L S ITQGVWS++E+ ++
Sbjct: 134 DASSDLFILNKCPIRRVAVLLVDSEKKSCYLQHSSITQGVWSLLEKPIEK---------- 193
Query: 184 EKHVNKKKRVIKKPSKEGPVDEIKTQQLAYSTVRKATGINQTDLKILESHVVYSHSKAKS 243
+K ++E +E Q++A++ V++ATG+N D+ ILE H+V S S+ K+
Sbjct: 194 -----------EKAARENQKEEGVFQKVAFAVVKEATGVNHKDIVILERHLVCSLSEEKT 253
Query: 244 AVSFYVIQCTRSATEDVIQVPIKDTIDSLQDSLFKINGRRWSITSKVEYFHILPYARMML 303
AV FY+++CT S + + P+++ + +Q LF+ + W++ S VEYFH+LPYA ++
Sbjct: 254 AVRFYIMKCT-SQDKFSGENPVEEVLSCMQGPLFEKSFSDWTMNSIVEYFHVLPYATLIE 313
Query: 304 IWFHGVTSTNSLRVIGGAKVDENLNKPERIDVMRTLEIQDNQDGASANNLNKGTSTYGEG 363
WF T + V +++ + ++D + E+ D + L + Y
Sbjct: 314 DWFSRRGDTEFVIEKEPEAVCDDI-ESNKVDATKESEVSDIFERREKAALKR---RYEIK 373
Query: 364 LERLPDKTNYISSLNDVMFRPQNSNVDDLVPSYPVEKKKDVPNTSQVFFSYAKKKNARQA 423
+++ ++ + R QN L S K+ +V + + V A +A
Sbjct: 374 AKKVAALLSHPGARGKATTRLQNRY---LKGSMSGAKEPNVHSETVV---------ALKA 433
Query: 424 DNRDAVMIPCMVNEPNASESGIKVKDRILATNPCHAECSGEKIASGNLSDNISLDQYRNG 483
N M PC N N + G +V A++P +++ L ++ N
Sbjct: 434 KNVGNEMSPCKDNYSNGEKGGFEV-----ASDP-------KELKERGLQRKKAVPDRLNS 493
Query: 484 DHAL----VTCQSNTEHLTKLQEIIISKETALSQAAIKALSRKRDKLSHQQRIIEDKIAR 543
H L + ++ +L +LQ ++SK T+LS+ A+K L KRDKL+ QQR IED+IA+
Sbjct: 494 IHKLNSTPASAHNSNPNLEELQTSLLSKATSLSETALKVLLCKRDKLTRQQRNIEDEIAK 553
Query: 544 CDKNMQTILRGDEDGLVIKLDSVIECCNDVCIRSIAEDRSYQCFEENCSSQYGTSKRLSE 603
CDK +Q I +GD + ++L++V+ECCN+ R R+ Q + + Q +LSE
Sbjct: 554 CDKCIQNI-KGDWE---LQLETVLECCNETYPR-----RNLQESLDKSACQSNKRLKLSE 613
Query: 604 AILCVQNPCQELDDICRKNNWILPVYGVSTSDGGFQANVYVKGMDFAYSSCSELCPDPCE 663
+ ++ CQ LDDIC NNW+LP Y V+ SDGG++A V + G A + E D E
Sbjct: 614 TLPSTKSLCQRLDDICLMNNWVLPNYRVAPSDGGYEAEVRITGNHVACTIHGEEKSDAEE 618
Query: 664 ARKSAATKMLGQL 672
AR+SAA +L +L
Sbjct: 674 ARESAAACLLTKL 618
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1HAN9 | 0.0e+00 | 100.00 | uncharacterized protein LOC111461089 OS=Cucurbita moschata OX=3662 GN=LOC1114610... | [more] |
A0A6J1KZE5 | 0.0e+00 | 97.06 | uncharacterized protein LOC111497732 OS=Cucurbita maxima OX=3661 GN=LOC111497732... | [more] |
A0A6J1DAH9 | 0.0e+00 | 79.85 | uncharacterized protein LOC111018541 isoform X1 OS=Momordica charantia OX=3673 G... | [more] |
A0A1S3BE29 | 0.0e+00 | 79.39 | uncharacterized protein LOC103488666 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A6J1EPE2 | 8.6e-293 | 75.50 | uncharacterized protein LOC111436360 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
Match Name | E-value | Identity | Description | |
XP_022960330.1 | 0.0e+00 | 100.00 | uncharacterized protein LOC111461089 [Cucurbita moschata] >XP_022960331.1 unchar... | [more] |
KAG6592994.1 | 0.0e+00 | 98.68 | hypothetical protein SDJN03_12470, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
KAG7025400.1 | 0.0e+00 | 98.53 | hypothetical protein SDJN02_11895 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
XP_023514125.1 | 0.0e+00 | 98.68 | uncharacterized protein LOC111778491 isoform X2 [Cucurbita pepo subsp. pepo] | [more] |
XP_023514123.1 | 0.0e+00 | 98.53 | uncharacterized protein LOC111778491 isoform X1 [Cucurbita pepo subsp. pepo] >XP... | [more] |
Match Name | E-value | Identity | Description | |
AT1G05950.1 | 6.5e-91 | 35.36 | unknown protein; Has 50 Blast hits to 45 proteins in 14 species: Archae - 5; Bac... | [more] |