Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CGTCATCTGCTCGTTGATTTTGGTTTCATAGTTGTAAACGCTTATGTGATCAGTTTCATTCTTGTGTACTCTCATCGATCTGAAAAGCTCGTAGGAATCTGATTTCTGGTATGAATTTTGCATCTCTGCCTTAAGAACACAAATTTTCCACGAATCGAACGAAGAAGATACGTTGATTTGTTAGGTTTCTGTTTTTTTTCTTTATAAATCGAAGCAGAGGTAGGGTTTTGAGTTGTTGAGCTGCGAAAAGTTGTTGGAGTTTTCTGAAAATTAAATTGGATGTGATTTCGATGTTTAGCTGGTCGAACGCGAATATATCTGACTAAGACGAACAGATTGATTGAAGATCGAAAAATATATTAAGCTTCTGAGATCTCGGAAGCTTGGTTGTACCCGAAGTGAAGAGATTTTTTCATTATTTCTTGTTGTTTCTGTAATTAGAATGTCGTACTCGTTTCTTCTTTTGTGGTTATAATTTTTCTCAGTATCTATTCATTTGGCAGGTTGGGTTTAAGCGTACTGTCATGTCAACTAGTGATTATAATGCAATTGTTCCTATCAAGAAGAGGAGGTTTCCTTCAATGCAATCTCCTCCGCCCAAAGAAATATCTTCTCTTCCATTAGTAGATGATAACATAGCAAAAGTAGACGAGCCTTGCGTATCAGATGGTCCAACAGTTTCCAATTCTAGTACAATCACAACTTCTGAATTTTCAGAGAAGAAGATTTCATTTTCTGAGGACGGTAAACGGAAATCTGATTTATGCAATATGAATATGGTCCAAAGCATTATTGGACCTTCCAGAGTCGAGTTTCAGGAAAATGATGTGTGTTCTACTGGCTGCGTGGAAAATAAAGAAACATGTATGGTGAATGAAAATCATGCGCTTGTTCTGCATGAGAAACCAGAGTTCAAGTTACCACATTCTGATGCTAACTCTAACCCTGGACTTTGTGCCGAAAAGGAAAGTGATGAGATTGACAGAAAACAACTTGATAGATTAGAGTTTTCAACTTCTGTAGCGAAAAAAGAAGCCGAATTATCAATTGGTTCGAAGGAACATCTTGTTCCAGACTCAGTTCTTGAAGGGAGTGACTTGAAATCTCTGAAGCAGATTAATTTAGAACCAGGGTTATTGAACTTAAGTTTAAGCAAGGAAGGAAGTCTCGATCAGCCTCTCACTGTAAATGTTGGGTCTAGTTATGATGGTTCTATTCAGGAGTCAAATAGGGAAAATTGGGATCTAAATACCTCGATGGAGTTTTGGGAAGGCTGTTCAAGTGGCGATCCTCCCGAGCATGTTCCAGCAGTTCAGACAAACACGGTTGTCACTATGCACAGATTCTCAACAGAAATGGTTAATACTGATACTCTGTCAGGAAAGCTAACTCCTTTAGATGACAGTGATCATCTTCATCTAAGTCTTAGTTCATCAGATCATAGGCATGTAATAAGTCAGGAACAAAGTTCATTTGTCAAGTTAGGCTTTAGGAAAACAAGTCCTTCTTTAAGCTCAACAGGAAGAGGTTTGCAGTTTGATGATCTTAACGGTGCACTAAAAGTCGTAAAGCCAGAGCCATTTGTTGAGGCTTCCAAACTCGAGTCTAAAAGTGATGAAGTTAATGTGCTGGGATTATCAGACAGTGCTATTGTGAAGCGCGAATTTCTTCAAATTCCCAATGCTTCAGATATTTATATACCAATGAACACAGTTAAGGCCAAGTCTGTTAACTCTGAATCAAATTACGAAAGCAAACAGGTAGCACTCGAAACATTAGGTGGTAGATTAGATTTGGTAGCTAAGCAAGTTCTTCCAGAGGTAGATAGTTCTTGTCCTGCACCGATGCCTTTTGTGGCAGAGATGACTGAAGCAGCTGGAAACTCTTGTTCAACTGATTTGATCACAGATGGAGGCATGTCAAACCATTCAGAATTGCAAACTCCTACTGAAGAACATCTTAATTTGAAAGTGCACGAAGGAGCATATCGTTGTGGTGGTGAACTCGTCGATTCAGAAATGACTGATATAAGTAAGGATCCAGGTTCCAAAGATTTCAATAGTCCTATTATAAAGCCTATAGCAATGCCTAGAAATCCTTCTCGTACAAATGATTCAATTATAGAGGCAAACATGTCAAGCCCTTCCGAATTACATATTCCAACTACAGGACCTCTTAATACGAAAGTGCACCAAGCGGGATATGGCTGTGACGGTGGACTTGTGAATTCGGTAATGACAGATGTAAGTAAGGATACATGTTCCAAAGATTCCAGTAGTTCTGTTATAAAACCAGTCATTGTTGAAGATGAAAATCAGAATAACCCGCTCTGGCGTCCTTCGACACACACGAATGAGCAGTGCTCTAGTTTGCAGGGAGGTGAGGAAAGTTCTGTAAATGATGAGGAAAAGATCAGTCTATCAGCCGATTTATTAGAAGAAGATCCATATAGTTCTGAATATGAATCAGATGGTAAGTTGGATGTAAATGAGGCCATGGATGCAGTTGATAACGATATAGAAGAAGATTATGAAGACGGAGAGGTTCGGGAACCGACATTGACGACTCAAGTAGAAAGCAGTATATGCGAGACGAAAAAAGTAAAAAATTTTGATCATGGTGATTCTAGCAATGGACTGCCTGGTTCTGATTGCTGTTCCTCCTTGGTTTCTGTTAAGCAGGAAAATAAATTAGAAATCCTCGATGTTAAACGAGAAGACAATCTTCATTCTGTAACTTCGAATCAATCGTCCGAGCAAGAACGATCGAAAGAGTTGCCTGTCGAAGAGCATACCACTAGAGTGTGTTTGAACAAGGCCAACAAGGCTAAAACATCTGCCTTAGAGGACCAGGAAACTTCCCCTGAGAAAGCCAGTAATGGAATCGAAGAATCGATTACGACAGTTTCTCAGAGCGACGCAGAGAAGGTTAAAACAGTAGATATCGTGCGAAACGACAATCCAGCTTTGCCAAATGTCGAGCCTTTAAATGATGATGATGTGACTGACGATATTACTCGTGGCAGTAAGCATAGCCGCATTGTTAGTCCCTGCAAACCTTCATCGTCTTCACTTCCTAGTAAAACGAAATCGAGTTTGGCGAGGTCGGTTTTAACACAAACTGATAGAGAACGAATACCCGACATGGGGCATGAAGGGGAAAAATTACATCCACAAGGAAGGTGATTATCTATTTCTTAGCTGTGCATTAATTTGATTACATTCTGCATTGCTTTGCTGATTTCTTTCTATTTTCTAGAGATGAACCATATAGGGACGTTTTCCAGAGATTTTATGTGAATAGACATCAGAATCTATCACCCCAAACCAATTTTAGCCGTAGAAGAGGTAGATTCACTATCCGGATTAACTCTGTCCAAGGTGAATGGGATTTTAATCCAACAATCTCTCCAGGAAATTACAATGATCAAGTACCACCACCCTATGATGCCCGTAGACGTAAATACATGCCTGCTGTTTCTGATGATGATATTGATCAAAACCATTATAAAATGAAACCCGATGGTCCATTTCGTAGCGCTGGTGATCATCGAGGTAGACAGATATTAGACGACGAAGGCCCCCTTTTTTGTCATATGGCCTCTAGGAGGAAGTCGCCTGGTCGAAGAGATGGGCCTCCTCCAGTGCGAGGTGTTAAAATGGTACACAGAATGCCTAGAAACATCAGTCCAAGTAGATGTAATCGTGAACGTGGATCGGAACTGGTTGGACCGCGACACGGTGAGAAGTTCATGAGGACATTTGAAGACGAAACTATGGATCCATTATACGCCCACCCTCAACCTTCGTTTGAAGTAGATCGGCCTCCTTTTATCCGAGACCGAAGGAACTTTCCTATTCAAAGAAAAAGTTTTCAAAGAGTTGATTCTAAATCTCCAGGAAGGTCCAGAGGACGCTCTCCTTCCCAATGGTTTCCATCCAAAAGAAAGTCTGAGAGGTTCTTTGGACATCCAGAAATGGCACGGCGAAGTCCTCCACCCGGTTATAGGATGAGATCGCCCGATCAACCTCCTCAAATCCATGGAGATATGCCAGATCGAAGACATGGTTTCCCGTTTCCGTCACTGCCACCTAATGATTTGAGGGATATGGGTTCTGCTCGTGACCATGGCCATATGAGACCAGGTCTACGAAGTCGAAACCGAACCGACAGAATGTCTTTTAGAAACAGGAGGTTTGAAGATATGGATCCTCGAGATAATAGGATAGAGAGTAACGAATACTTCGATGGACCTGTACATCCTGGTCAAATGAATGAACTGATTGATGATGGCAACGACGACGATCGAAGAAGGTTTTCGGACAGACACGAACATCTTCACCAATTCCGGCCACAATGTAATGATTCAGACGGTGAAAACTATCACAACGATGCAGACGAAAGAGCGAGGCCTTACAGATACTGCACGGAGGATGAAGAAGAGTTCCATGAAAGAGGTAAGATGAGGGAGAGGGAATTTGATAGACGTGTAAAGAACCAACCAGAAAATTTAGGTAGACGAACAGTGATTGAAGAACATGAAGTAGAAGAATACAGACATGGTCATGGTCGGCAGATGTGGAATGAACATCATCACCATCATCATCATCATAGCTTTGAAGACATTTCCCGAATGAAAAGAAAAAGAATTTGAAATTTCCTTTTCAATTCAGCTCAGAAGCCAGGTGAGATTCAGAAGAAGCTGCATTGTTCTTGTGCTTAGGAGGTGTTTGGATCCCTTTTATATAGAAAAATGGCCTGCTGCTCATTGGATGGATGGACTTATCCCATCCATATCTTATTAAATATTAAAACAATTCATTCACTTAAAAATATAATTCAATTCAATTTCCTAATTAGTCTCTAGGTGATTCTGATACGTTCAAGAAGAGAATTTAACCGCAAACTTCAAAATCCATACCTCCTTCCAGGTACTCATGGAGCTTGATGATCAAATAAACAATAATGTAAGATTCTCTCAGGTTTCTTCATATATAATTCAAAAGCGTTGGTTGAATCTTAAGTGCGGAAAAGAGAGAAATCCGGAAGATTCAAGAACAAGCAGATGAACAAAAGACTAAATTATTGAAGTTAACTTTTGATTTCATCTTGCGGCCAAACGCTAACCTTCTCCTTCTCGTATTTGATATACAGAGCAGCCGCCGCCGCTACGAAGTATACTTGAATCAGTATAATCGCTGGCGTGTGGTAATTGATGACGCCTTTTCCAGTGGCTAATTCCACTAGAACCAGTGCCGATAATCCGAGTATCGCAATCCATCCATTGATCCTCTCGCTGTACGGACTGAATCCGAATTCTACCTTAAGTCCGCCGGAAACCGCTTCCTTCAGCGGCACTGGCGGAAGATACACTGACGAGGCGGCGGCGGTTGATCCTTTCCTGGACTTCCGAGCCTTACGAATCTCTGCCTTCTTTCCCGTTCCGCGTCGGAGTTGGAGTCGGGCCAGTCGGTCCTCGAAATCGTCGCCGTCCGATCTAGGTTCCGAATCGGCGGCAGGAGATTCCGGATCTGCTGCTCTGGTTACGAAGGTGAGAGACTTCGGCTTGTGAGGCTTGGAATTGGAAGCGAAAGCGAGGGAGTTTGGAAATTGCAGTGAGGAAGATGGAATTGCCGCCATGGAAGGAGAATGCTGGGCTCGAGGATAAACTCGCCCCTCTGCTGTAACCGTCTGGACCTTATATTTTTTATTTTTCTTTTTATATTTTATTTTTCCGCTCTTTAAAATTAGTGGTTTATTTTTTTCCTTCTTTTTTTGAATTAAATAATTTAGCGGTTTCTATTTGTTCCGTTTTTATATTTTTGTTTATACGGTAATGACTAGAAATGATAATTATTTAAAGTTTAGGGACTAAAAAAATAAATTTAAAAATATTAAGACAAATAAATATTTTAAAAGTATTGGTGTGAAGTTTTAGAAACCAAATTCGTTAGTTCGCTCTCCAGGTCAATGCCATCGCCGATTTGGATTTCTCGTGATTCTTCCTTGATTCCATTCTTACTCTCTCTCGGTATCACCAGCAAAACGAGCATGTATTAGATCTGCGTTTCGGTATTGTTCGCTCTCTGCCTTTAGGATAGTAATCTCTTTGAATCGGTTGAGTGTGTTTCTTTCGAGGCAGAGCGGAGTTTTGTGGAAGTTTGAGTCGTTGTTGAGGATCAGCTGTTCAAATACTGGAATGTTTTCTACCAGTTCAATCGATTTCCTTATCAAGTGATTCTGGTGATTTGATGATCGGTTAGATGTGGAATGTGTTTGCGTAGATGGAGGTTTATGCTTCTTTTATTTTGTTCCTTTTAATTTTATCGTCAATTTAGTGTTGTTCGTTACGATTCTGGAATCTTTTGTTCTTTCATGAGATGGATGGAGAAGTTTTTGCGTGGATATTCTGCATAGTTGCTCTGAAATTTATTAACATTATGCCGAAAATTTCACTGCTGGAGTTTGATATACGGTTCATTGTGCTGTGTCTCCCCTGATCCAATCTCGAGGCGTGGATCATCTAAATTTTTGTGGTTAAAATTTGAACACAGGAACCTCTCATATGCTGGATTTAAGATTTGGATGCTGGTTTTATGATATTTTCACGATGCAACAGTGCTGCATCGCTAATATCTTTAGTTTTCGCTTGAGATTCATTGATTATTTCGACACAATGCCTGTCATATGCTATCTCGCATACTGAATCTTTTTTAAGATTACTAATGTATTGGTATCCATACGGTGCAGAAATCGGAAGTGAATTCTAAGAAGTTAGTGAGATTTTGGTGTTGGATTGAAGTGGTAAAAGATCAAGAAACTGTATTAAGCCGAGGTTTTGGAATAATGGCGGTACCAGAAAGTGAAGAGGTGCTCAACATTTTTCTTTGCTGTTTTCTGGATTTTGAAGCTCGTGCAAGTTTCTTCTTATAGGGTTTTATAATTATTCAGTAAAATGTTCCATTGGCACTATGCAGGTTGGTTTTAAGCGCATTGGATTCTCAGTTAGTGATTATGATGCAAATCTTCCTATCAAGAAAAGGAGATTTCCGGTAGTGCAGATCTCTCCATCTCCATCTGAAGGTATATCTTCATTCCATCCAGATGGAAATTTATTGAAGATTGAGCGGCCATCTCCACCTAAAAAGCTATCTTCATTTAATCCCGATGAAAATTCGTTAGAGGTCGAACAGCCGAGTCTATCTGTGACAATAGTTTCAAGTTCTAGTGCAGACACATGTTATGGGTTGTCAAACAGGAACCAGGACTGTGTTTCTAATGAGAATAAACGAAAATCTGATACTCATTCATGCTATGTGGATATGGTCCAGAACGATATTGGGATGCCAGGAGTCGAGTTTCCGGGACCCGGTTTGGGAGGACATGAAGATAAGTCCTTGGTAACTGAAAAACACTCCGTTCATCGATCACCGGAGATCTACGGTGAGTTGAAGTTATCATCAACTGGCGTCGACTCGGATCCTCTTGGTAGTAACAAAGAGGAAGAAATTGATGTAAAAATGCCTGAAGAAAAGTGCAGCTCTTCAATTTGTCAAGTTGAAGGAGGAGCTGAAGTATCAGAGAAATTGGTTTCTTACAAGAGTGACCTGAATAAGCAGAATTCTTTGGAGCCTGTGTTAATGGACTTGTCTTTAAACAAGCAAGGAAGTAGCTGCCATTGTGTCAAAGGTAACGTAGGGTCTGATTGCGATGATTGAATCAGATGGTAGCTGGAATATTGCCGAGGTTGAAGACGACGACGACGATGATAATAACATAGAAGAAGACTATGAAAATGGCGAGGTTCGGGAATCAATGCAAAAAGAGGCCCGTGCTTGTGAGAAAAGAGAAATTGAGCCATTGGATCATGCTGATTGTGATGATAAGAAGATCAATTCTGCTGGATTGCCTGATCATGAATGTTTCACATTAGGCCCTCTGGAACAGGAAACGAAACCTGAAAATCTGGACTCTAAGAGTGAAGACGATGTTCATACTACAACTGAAAGTACATCTTGTGAGCAAGAACATGAAGATCTTTGTGTGAAAGAACCACTTGACGTAGAGAATACTATTGGTGAGGATGTAAACAGGCCTATGAAGGCTGCAGGAAGAAGCCAATTATCTCAATATGTTAATAAGGACAAGTTAGAGGGCCACGACACCGCCGATGAAATCGAGGAACTGATTCCGAAATTTTCTCAGGGTGAGATGGAGAAAGCTATTGCTGTAGAGGTAGAGAGCAGAATAGGGATCTAACTTTGCCTACCAATATGTTGGACAAACGATCTGGGGAATGGGACTTTGGTCCCAACTTTTCTCCTGAAACATACAGTGACCAGCAGATAGATTACCATGTTCCTGATCTTGATCACGACCGATATAAAATTATTCCTGATGGTCGATTTGTCGGTGCTAACCGTCGCAGTAGGTCATTGCTGGACAATGAGGGACCTTTTTTTTTTCCATGGACCCTCAAGGAGGAGGTCACCTGGAAGAACTCATGGACCATGGTGGCAAAATGGTTAACAGAATGCCTAGAGATTATAGTCTTAGTAGATGCATGATGAAGTTGGTTCTTTTGATCAACATGTAGGAAAGTTCACTAGGAACTTTGCTGATGACACGGGATCCGATATATCGACGACCTCATCCTGCATACGAATTAGACAGACCTTTGTTCCGGGAAAGAAGGAACTTCTCATTCCAAAGAAGTGATTCTAAGTCTATAGTAAGATCCCGATCTCGCTCTCCGAGCCAATGTCTCTTTGAAAGATCTGATAGGTTTTATGGACGTCCCGACATGACACGTCGAAGATCTCCAAATTATAGGACAGACGGGACGAGATCGCCCGATCAGCATCCTATATGTGCGCATATGACAGGCCAAAGACAAGGATTCTGTTTCCTTTCACCATCTGATGATTTGAGGGATGTTGGTCCTACACCCAACCATGGCCATATGAGATCTATCATTCCTAATAGGAATCAAACTGAAAGATTACCTCTTAGAAACAGAAGTTATGATGCTATAGATCATCAAGTAAGGATAGGGAGCAATGAACTTTTTGATGATCCC
mRNA sequence
CGTCATCTGCTCGTTGATTTTGGTTTCATAGTTGTAAACGCTTATGTGATCAGTTTCATTCTTGTGTACTCTCATCGATCTGAAAAGCTCGTAGGAATCTGATTTCTGGTTGGGTTTAAGCGTACTGTCATGTCAACTAGTGATTATAATGCAATTGTTCCTATCAAGAAGAGGAGGTTTCCTTCAATGCAATCTCCTCCGCCCAAAGAAATATCTTCTCTTCCATTAGTAGATGATAACATAGCAAAAGTAGACGAGCCTTGCGTATCAGATGGTCCAACAGTTTCCAATTCTAGTACAATCACAACTTCTGAATTTTCAGAGAAGAAGATTTCATTTTCTGAGGACGGTAAACGGAAATCTGATTTATGCAATATGAATATGGTCCAAAGCATTATTGGACCTTCCAGAGTCGAGTTTCAGGAAAATGATGTGTGTTCTACTGGCTGCGTGGAAAATAAAGAAACATGTATGGTGAATGAAAATCATGCGCTTGTTCTGCATGAGAAACCAGAGTTCAAGTTACCACATTCTGATGCTAACTCTAACCCTGGACTTTGTGCCGAAAAGGAAAGTGATGAGATTGACAGAAAACAACTTGATAGATTAGAGTTTTCAACTTCTGTAGCGAAAAAAGAAGCCGAATTATCAATTGGTTCGAAGGAACATCTTGTTCCAGACTCAGTTCTTGAAGGGAGTGACTTGAAATCTCTGAAGCAGATTAATTTAGAACCAGGGTTATTGAACTTAAGTTTAAGCAAGGAAGGAAGTCTCGATCAGCCTCTCACTGTAAATGTTGGGTCTAGTTATGATGGTTCTATTCAGGAGTCAAATAGGGAAAATTGGGATCTAAATACCTCGATGGAGTTTTGGGAAGGCTGTTCAAGTGGCGATCCTCCCGAGCATGTTCCAGCAGTTCAGACAAACACGGTTGTCACTATGCACAGATTCTCAACAGAAATGGTTAATACTGATACTCTGTCAGGAAAGCTAACTCCTTTAGATGACAGTGATCATCTTCATCTAAGTCTTAGTTCATCAGATCATAGGCATGTAATAAGTCAGGAACAAAGTTCATTTGTCAAGTTAGGCTTTAGGAAAACAAGTCCTTCTTTAAGCTCAACAGGAAGAGGTTTGCAGTTTGATGATCTTAACGGTGCACTAAAAGTCGTAAAGCCAGAGCCATTTGTTGAGGCTTCCAAACTCGAGTCTAAAAGTGATGAAGTTAATGTGCTGGGATTATCAGACAGTGCTATTGTGAAGCGCGAATTTCTTCAAATTCCCAATGCTTCAGATATTTATATACCAATGAACACAGTTAAGGCCAAGTCTGTTAACTCTGAATCAAATTACGAAAGCAAACAGGTAGCACTCGAAACATTAGGTGGTAGATTAGATTTGGTAGCTAAGCAAGTTCTTCCAGAGGTAGATAGTTCTTGTCCTGCACCGATGCCTTTTGTGGCAGAGATGACTGAAGCAGCTGGAAACTCTTGTTCAACTGATTTGATCACAGATGGAGGCATGTCAAACCATTCAGAATTGCAAACTCCTACTGAAGAACATCTTAATTTGAAAGTGCACGAAGGAGCATATCGTTGTGGTGGTGAACTCGTCGATTCAGAAATGACTGATATAAGTAAGGATCCAGGTTCCAAAGATTTCAATAGTCCTATTATAAAGCCTATAGCAATGCCTAGAAATCCTTCTCGTACAAATGATTCAATTATAGAGGCAAACATGTCAAGCCCTTCCGAATTACATATTCCAACTACAGGACCTCTTAATACGAAAGTGCACCAAGCGGGATATGGCTGTGACGGTGGACTTGTGAATTCGGTAATGACAGATGTAAGTAAGGATACATGTTCCAAAGATTCCAGTAGTTCTGTTATAAAACCAGTCATTGTTGAAGATGAAAATCAGAATAACCCGCTCTGGCGTCCTTCGACACACACGAATGAGCAGTGCTCTAGTTTGCAGGGAGGTGAGGAAAGTTCTGTAAATGATGAGGAAAAGATCAGTCTATCAGCCGATTTATTAGAAGAAGATCCATATAGTTCTGAATATGAATCAGATGGTAAGTTGGATGTAAATGAGGCCATGGATGCAGTTGATAACGATATAGAAGAAGATTATGAAGACGGAGAGGTTCGGGAACCGACATTGACGACTCAAGTAGAAAGCAGTATATGCGAGACGAAAAAAGTAAAAAATTTTGATCATGGTGATTCTAGCAATGGACTGCCTGGTTCTGATTGCTGTTCCTCCTTGGTTTCTGTTAAGCAGGAAAATAAATTAGAAATCCTCGATGTTAAACGAGAAGACAATCTTCATTCTGTAACTTCGAATCAATCGTCCGAGCAAGAACGATCGAAAGAGTTGCCTGTCGAAGAGCATACCACTAGAGTGTGTTTGAACAAGGCCAACAAGGCTAAAACATCTGCCTTAGAGGACCAGGAAACTTCCCCTGAGAAAGCCAGTAATGGAATCGAAGAATCGATTACGACAGTTTCTCAGAGCGACGCAGAGAAGGTTAAAACAGTAGATATCGTGCGAAACGACAATCCAGCTTTGCCAAATGTCGAGCCTTTAAATGATGATGATGTGACTGACGATATTACTCGTGGCAGTAAGCATAGCCGCATTGTTAGTCCCTGCAAACCTTCATCGTCTTCACTTCCTAGTAAAACGAAATCGAGTTTGGCGAGGTCGGTTTTAACACAAACTGATAGAGAACGAATACCCGACATGGGGCATGAAGGGGAAAAATTACATCCACAAGGAAGAGATGAACCATATAGGGACGTTTTCCAGAGATTTTATGTGAATAGACATCAGAATCTATCACCCCAAACCAATTTTAGCCGTAGAAGAGGTAGATTCACTATCCGGATTAACTCTGTCCAAGGTGAATGGGATTTTAATCCAACAATCTCTCCAGGAAATTACAATGATCAAGTACCACCACCCTATGATGCCCGTAGACGTAAATACATGCCTGCTGTTTCTGATGATGATATTGATCAAAACCATTATAAAATGAAACCCGATGGTCCATTTCGTAGCGCTGGTGATCATCGAGGTAGACAGATATTAGACGACGAAGGCCCCCTTTTTTGTCATATGGCCTCTAGGAGGAAGTCGCCTGGTCGAAGAGATGGGCCTCCTCCAGTGCGAGGTGTTAAAATGGTACACAGAATGCCTAGAAACATCAGTCCAAGTAGATGTAATCGTGAACGTGGATCGGAACTGGTTGGACCGCGACACGGTGAGAAGTTCATGAGGACATTTGAAGACGAAACTATGGATCCATTATACGCCCACCCTCAACCTTCGTTTGAAGTAGATCGGCCTCCTTTTATCCGAGACCGAAGGAACTTTCCTATTCAAAGAAAAAGTTTTCAAAGAGTTGATTCTAAATCTCCAGGAAGGTCCAGAGGACGCTCTCCTTCCCAATGGTTTCCATCCAAAAGAAAGTCTGAGAGGTTCTTTGGACATCCAGAAATGGCACGGCGAAGTCCTCCACCCGGTTATAGGATGAGATCGCCCGATCAACCTCCTCAAATCCATGGAGATATGCCAGATCGAAGACATGGTTTCCCGTTTCCGTCACTGCCACCTAATGATTTGAGGGATATGGGTTCTGCTCGTGACCATGGCCATATGAGACCAGGTCTACGAAGTCGAAACCGAACCGACAGAATGTCTTTTAGAAACAGGAGGTTTGAAGATATGGATCCTCGAGATAATAGGATAGAGAGTAACGAATACTTCGATGGACCTGTACATCCTGGTCAAATGAATGAACTGATTGATGATGGCAACGACGACGATCGAAGAAGGTTTTCGGACAGACACGAACATCTTCACCAATTCCGGCCACAATGTAATGATTCAGACGGTGAAAACTATCACAACGATGCAGACGAAAGAGCGAGGCCTTACAGATACTGCACGGAGGATGAAGAAGAGTTCCATGAAAGAGAGCAGCCGCCGCCGCTACGAATGGCTAATTCCACTAGAACCAGTGCCGATAATCCGAGTATCGCAATCCATCCATTGATCCTCTCGCTGTACGGACTGAATCCGAATTCTACCTTAAGTCCGCCGGAAACCGCTTCCTTCAGCGGCACTGGCGGAAGATACACTGACGAGGCGGCGGCGGTTGATCCTTTCCTGGACTTCCGAGCCTTACGAATCTCTGCCTTCTTTCCCGTTCCGCGTCGGAGTTGGAGTCGGGCCAGTCGGTCCTCGAAATCGTCGCCGTCCGATCTAGGTTCCGAATCGGCGGCAGGAGATTCCGGATCTGCTGCTCTGGTTACGAAGAGCGGAGTTTTGTGGAAGTTTGAGTCGTTGTTGAGGATCAGCTGTTCAAATACTGGAATGTTTTCTACCATAAAATGTTCCATTGGCACTATGCAGGTTGGTTTTAAGCGCATTGGATTCTCAGTTAGTGATTATGATGCAAATCTTCCTATCAAGAAAAGGAGATTTCCGGTAGTGCAGATCTCTCCATCTCCATCTGAAGGTATATCTTCATTCCATCCAGATGGAAATTTATTGAAGATTGAGCGGCCATCTCCACCTAAAAAGCTATCTTCATTTAATCCCGATGAAAATTCGTTAGAGGTCGAACAGCCGAGTCTATCTGTGACAATAGTTTCAAGTTCTAGTGCAGACACATGTTATGGGTTGTCAAACAGGAACCAGGACTGTGTTTCTAATGAGAATAAACGAAAATCTGATACTCATTCATGCTATGTGGATATGGTCCAGAACGATATTGGGATGCCAGGAGTCGAGTTTCCGGGACCCGGTTTGGGAGGACATGAAGATAAGTCCTTGGTAACTGAAAAACACTCCGTTCATCGATCACCGGAGATCTACGGTGAGTTGAAGTTATCATCAACTGGCGTCGACTCGGATCCTCTTGGTAGTAACAAAGAGGAAGAAATTGATGTAAAAATGCCTGAAGAAAAGTGCAGCTCTTCAATTTGTCAAGTTGAAGGAGGAGCTGAAGTATCAGAGAAATTGGTTTCTTACAAGAGTGACCTGAATAAGCAGAATTCTTTGGAGCCTGTGTTAATGGACTTGTCTTTAAACAAGCAAGGAAGTAGCTGCCATTGTGTCAAAGGTAACGGTCTGATTGCGATGATTGAATCAGATGGTAGCTGGAATATTGCCGAGGTTGAAGACGACGACGACGATGATAATAACATAGAAGAAGACTATGAAAATGGCGAGGTTCGGGAATCAATGCAAAAAGAGGCCCGTGCTTGTGAGAAAAGAGAAATTGAGCCATTGGATCATGCTGATTGTGATGATAAGAAGATCAATTCTGCTGGATTGCCTGATCATGAATGTTTCACATTAGGCCCTCTGGAACAGGAAACGAAACCTGAAAATCTGGACTCTAAGAGTGAAGACGATGTTCATACTACAACTGAAAGTACATCTTGTGAGCAAGAACATGAAGATCTTTGTGTGAAAGAACCACTTGACGTAGAGAATACTATTGGTGAGGATGTAAACAGGCCTATGAAGGCTGCAGGAAGAAGCCAATTATCTCAATATGTTAATAAGGACAAGTTAGAGGGCCACGACACCGCCGATGAAATCGAGGAACTGATTCCGAAATTTTCTCAGGGTGAGATGGAGAAAGCTATTGCTGTAGAGAATAGGGATCTAACTTTGCCTACCAATATGTTGGACAAACGATCTGGGGAATGGGACTTTGGTCCCAACTTTTCTCCTGAAACATACAGTGACCAGCAGATAGATTACCATGTTCCTGATCTTGATCACGACCGATATAAAATTATTCCTGATGGTCGATTTGTCGGTGCTAACCGTCGCAGTAGGTCATTGCTGGACAATGAGGGACCTTTTTTTTTTCCATGGACCCTCAAGGAGGAGGTCACCTGGAAGAACTCATGGACCATGGTGGCAAAATGGTTAACAGAATGCCTAGAGATTATAGTCTTAGAAAGTTCACTAGGAACTTTGCTGATGACACGGGATCCGATATATCGACGACCTCATCCTGCATACGAATTAGACAGACCTTTGTTCCGGGAAAGAAGGAACTTCTCATTCCAAAGAAGTGATTCTAAGTCTATAGTAAGATCCCGATCTCGCTCTCCGAGCCAATGTCTCTTTGAAAGATCTGATAGGTTTTATGGACGTCCCGACATGACACGTCGAAGATCTCCAAATTATAGGACAGACGGGACGAGATCGCCCGATCAGCATCCTATATGTGCGCATATGACAGGCCAAAGACAAGGATTCTGTTTCCTTTCACCATCTGATGATTTGAGGGATGTTGGTCCTACACCCAACCATGGCCATATGAGATCTATCATTCCTAATAGGAATCAAACTGAAAGATTACCTCTTAGAAACAGAAGTTATGATGCTATAGATCATCAAGTAAGGATAGGGAGCAATGAACTTTTTGATGATCCC
Coding sequence (CDS)
ATGTCAACTAGTGATTATAATGCAATTGTTCCTATCAAGAAGAGGAGGTTTCCTTCAATGCAATCTCCTCCGCCCAAAGAAATATCTTCTCTTCCATTAGTAGATGATAACATAGCAAAAGTAGACGAGCCTTGCGTATCAGATGGTCCAACAGTTTCCAATTCTAGTACAATCACAACTTCTGAATTTTCAGAGAAGAAGATTTCATTTTCTGAGGACGGTAAACGGAAATCTGATTTATGCAATATGAATATGGTCCAAAGCATTATTGGACCTTCCAGAGTCGAGTTTCAGGAAAATGATGTGTGTTCTACTGGCTGCGTGGAAAATAAAGAAACATGTATGGTGAATGAAAATCATGCGCTTGTTCTGCATGAGAAACCAGAGTTCAAGTTACCACATTCTGATGCTAACTCTAACCCTGGACTTTGTGCCGAAAAGGAAAGTGATGAGATTGACAGAAAACAACTTGATAGATTAGAGTTTTCAACTTCTGTAGCGAAAAAAGAAGCCGAATTATCAATTGGTTCGAAGGAACATCTTGTTCCAGACTCAGTTCTTGAAGGGAGTGACTTGAAATCTCTGAAGCAGATTAATTTAGAACCAGGGTTATTGAACTTAAGTTTAAGCAAGGAAGGAAGTCTCGATCAGCCTCTCACTGTAAATGTTGGGTCTAGTTATGATGGTTCTATTCAGGAGTCAAATAGGGAAAATTGGGATCTAAATACCTCGATGGAGTTTTGGGAAGGCTGTTCAAGTGGCGATCCTCCCGAGCATGTTCCAGCAGTTCAGACAAACACGGTTGTCACTATGCACAGATTCTCAACAGAAATGGTTAATACTGATACTCTGTCAGGAAAGCTAACTCCTTTAGATGACAGTGATCATCTTCATCTAAGTCTTAGTTCATCAGATCATAGGCATGTAATAAGTCAGGAACAAAGTTCATTTGTCAAGTTAGGCTTTAGGAAAACAAGTCCTTCTTTAAGCTCAACAGGAAGAGGTTTGCAGTTTGATGATCTTAACGGTGCACTAAAAGTCGTAAAGCCAGAGCCATTTGTTGAGGCTTCCAAACTCGAGTCTAAAAGTGATGAAGTTAATGTGCTGGGATTATCAGACAGTGCTATTGTGAAGCGCGAATTTCTTCAAATTCCCAATGCTTCAGATATTTATATACCAATGAACACAGTTAAGGCCAAGTCTGTTAACTCTGAATCAAATTACGAAAGCAAACAGGTAGCACTCGAAACATTAGGTGGTAGATTAGATTTGGTAGCTAAGCAAGTTCTTCCAGAGGTAGATAGTTCTTGTCCTGCACCGATGCCTTTTGTGGCAGAGATGACTGAAGCAGCTGGAAACTCTTGTTCAACTGATTTGATCACAGATGGAGGCATGTCAAACCATTCAGAATTGCAAACTCCTACTGAAGAACATCTTAATTTGAAAGTGCACGAAGGAGCATATCGTTGTGGTGGTGAACTCGTCGATTCAGAAATGACTGATATAAGTAAGGATCCAGGTTCCAAAGATTTCAATAGTCCTATTATAAAGCCTATAGCAATGCCTAGAAATCCTTCTCGTACAAATGATTCAATTATAGAGGCAAACATGTCAAGCCCTTCCGAATTACATATTCCAACTACAGGACCTCTTAATACGAAAGTGCACCAAGCGGGATATGGCTGTGACGGTGGACTTGTGAATTCGGTAATGACAGATGTAAGTAAGGATACATGTTCCAAAGATTCCAGTAGTTCTGTTATAAAACCAGTCATTGTTGAAGATGAAAATCAGAATAACCCGCTCTGGCGTCCTTCGACACACACGAATGAGCAGTGCTCTAGTTTGCAGGGAGGTGAGGAAAGTTCTGTAAATGATGAGGAAAAGATCAGTCTATCAGCCGATTTATTAGAAGAAGATCCATATAGTTCTGAATATGAATCAGATGGTAAGTTGGATGTAAATGAGGCCATGGATGCAGTTGATAACGATATAGAAGAAGATTATGAAGACGGAGAGGTTCGGGAACCGACATTGACGACTCAAGTAGAAAGCAGTATATGCGAGACGAAAAAAGTAAAAAATTTTGATCATGGTGATTCTAGCAATGGACTGCCTGGTTCTGATTGCTGTTCCTCCTTGGTTTCTGTTAAGCAGGAAAATAAATTAGAAATCCTCGATGTTAAACGAGAAGACAATCTTCATTCTGTAACTTCGAATCAATCGTCCGAGCAAGAACGATCGAAAGAGTTGCCTGTCGAAGAGCATACCACTAGAGTGTGTTTGAACAAGGCCAACAAGGCTAAAACATCTGCCTTAGAGGACCAGGAAACTTCCCCTGAGAAAGCCAGTAATGGAATCGAAGAATCGATTACGACAGTTTCTCAGAGCGACGCAGAGAAGGTTAAAACAGTAGATATCGTGCGAAACGACAATCCAGCTTTGCCAAATGTCGAGCCTTTAAATGATGATGATGTGACTGACGATATTACTCGTGGCAGTAAGCATAGCCGCATTGTTAGTCCCTGCAAACCTTCATCGTCTTCACTTCCTAGTAAAACGAAATCGAGTTTGGCGAGGTCGGTTTTAACACAAACTGATAGAGAACGAATACCCGACATGGGGCATGAAGGGGAAAAATTACATCCACAAGGAAGAGATGAACCATATAGGGACGTTTTCCAGAGATTTTATGTGAATAGACATCAGAATCTATCACCCCAAACCAATTTTAGCCGTAGAAGAGGTAGATTCACTATCCGGATTAACTCTGTCCAAGGTGAATGGGATTTTAATCCAACAATCTCTCCAGGAAATTACAATGATCAAGTACCACCACCCTATGATGCCCGTAGACGTAAATACATGCCTGCTGTTTCTGATGATGATATTGATCAAAACCATTATAAAATGAAACCCGATGGTCCATTTCGTAGCGCTGGTGATCATCGAGGTAGACAGATATTAGACGACGAAGGCCCCCTTTTTTGTCATATGGCCTCTAGGAGGAAGTCGCCTGGTCGAAGAGATGGGCCTCCTCCAGTGCGAGGTGTTAAAATGGTACACAGAATGCCTAGAAACATCAGTCCAAGTAGATGTAATCGTGAACGTGGATCGGAACTGGTTGGACCGCGACACGGTGAGAAGTTCATGAGGACATTTGAAGACGAAACTATGGATCCATTATACGCCCACCCTCAACCTTCGTTTGAAGTAGATCGGCCTCCTTTTATCCGAGACCGAAGGAACTTTCCTATTCAAAGAAAAAGTTTTCAAAGAGTTGATTCTAAATCTCCAGGAAGGTCCAGAGGACGCTCTCCTTCCCAATGGTTTCCATCCAAAAGAAAGTCTGAGAGGTTCTTTGGACATCCAGAAATGGCACGGCGAAGTCCTCCACCCGGTTATAGGATGAGATCGCCCGATCAACCTCCTCAAATCCATGGAGATATGCCAGATCGAAGACATGGTTTCCCGTTTCCGTCACTGCCACCTAATGATTTGAGGGATATGGGTTCTGCTCGTGACCATGGCCATATGAGACCAGGTCTACGAAGTCGAAACCGAACCGACAGAATGTCTTTTAGAAACAGGAGGTTTGAAGATATGGATCCTCGAGATAATAGGATAGAGAGTAACGAATACTTCGATGGACCTGTACATCCTGGTCAAATGAATGAACTGATTGATGATGGCAACGACGACGATCGAAGAAGGTTTTCGGACAGACACGAACATCTTCACCAATTCCGGCCACAATGTAATGATTCAGACGGTGAAAACTATCACAACGATGCAGACGAAAGAGCGAGGCCTTACAGATACTGCACGGAGGATGAAGAAGAGTTCCATGAAAGAGAGCAGCCGCCGCCGCTACGAATGGCTAATTCCACTAGAACCAGTGCCGATAATCCGAGTATCGCAATCCATCCATTGATCCTCTCGCTGTACGGACTGAATCCGAATTCTACCTTAAGTCCGCCGGAAACCGCTTCCTTCAGCGGCACTGGCGGAAGATACACTGACGAGGCGGCGGCGGTTGATCCTTTCCTGGACTTCCGAGCCTTACGAATCTCTGCCTTCTTTCCCGTTCCGCGTCGGAGTTGGAGTCGGGCCAGTCGGTCCTCGAAATCGTCGCCGTCCGATCTAGGTTCCGAATCGGCGGCAGGAGATTCCGGATCTGCTGCTCTGGTTACGAAGAGCGGAGTTTTGTGGAAGTTTGAGTCGTTGTTGAGGATCAGCTGTTCAAATACTGGAATGTTTTCTACCATAAAATGTTCCATTGGCACTATGCAGGTTGGTTTTAAGCGCATTGGATTCTCAGTTAGTGATTATGATGCAAATCTTCCTATCAAGAAAAGGAGATTTCCGGTAGTGCAGATCTCTCCATCTCCATCTGAAGGTATATCTTCATTCCATCCAGATGGAAATTTATTGAAGATTGAGCGGCCATCTCCACCTAAAAAGCTATCTTCATTTAATCCCGATGAAAATTCGTTAGAGGTCGAACAGCCGAGTCTATCTGTGACAATAGTTTCAAGTTCTAGTGCAGACACATGTTATGGGTTGTCAAACAGGAACCAGGACTGTGTTTCTAATGAGAATAAACGAAAATCTGATACTCATTCATGCTATGTGGATATGGTCCAGAACGATATTGGGATGCCAGGAGTCGAGTTTCCGGGACCCGGTTTGGGAGGACATGAAGATAAGTCCTTGGTAACTGAAAAACACTCCGTTCATCGATCACCGGAGATCTACGGTGAGTTGAAGTTATCATCAACTGGCGTCGACTCGGATCCTCTTGGTAGTAACAAAGAGGAAGAAATTGATGTAAAAATGCCTGAAGAAAAGTGCAGCTCTTCAATTTGTCAAGTTGAAGGAGGAGCTGAAGTATCAGAGAAATTGGTTTCTTACAAGAGTGACCTGAATAAGCAGAATTCTTTGGAGCCTGTGTTAATGGACTTGTCTTTAAACAAGCAAGGAAGTAGCTGCCATTGTGTCAAAGGTAACGGTCTGATTGCGATGATTGAATCAGATGGTAGCTGGAATATTGCCGAGGTTGAAGACGACGACGACGATGATAATAACATAGAAGAAGACTATGAAAATGGCGAGGTTCGGGAATCAATGCAAAAAGAGGCCCGTGCTTGTGAGAAAAGAGAAATTGAGCCATTGGATCATGCTGATTGTGATGATAAGAAGATCAATTCTGCTGGATTGCCTGATCATGAATGTTTCACATTAGGCCCTCTGGAACAGGAAACGAAACCTGAAAATCTGGACTCTAAGAGTGAAGACGATGTTCATACTACAACTGAAAGTACATCTTGTGAGCAAGAACATGAAGATCTTTGTGTGAAAGAACCACTTGACGTAGAGAATACTATTGGTGAGGATGTAAACAGGCCTATGAAGGCTGCAGGAAGAAGCCAATTATCTCAATATGTTAATAAGGACAAGTTAGAGGGCCACGACACCGCCGATGAAATCGAGGAACTGATTCCGAAATTTTCTCAGGGTGAGATGGAGAAAGCTATTGCTGTAGAGAATAGGGATCTAACTTTGCCTACCAATATGTTGGACAAACGATCTGGGGAATGGGACTTTGGTCCCAACTTTTCTCCTGAAACATACAGTGACCAGCAGATAGATTACCATGTTCCTGATCTTGATCACGACCGATATAAAATTATTCCTGATGGTCGATTTGTCGGTGCTAACCGTCGCAGTAGGTCATTGCTGGACAATGAGGGACCTTTTTTTTTTCCATGGACCCTCAAGGAGGAGGTCACCTGGAAGAACTCATGGACCATGGTGGCAAAATGGTTAACAGAATGCCTAGAGATTATAGTCTTAGAAAGTTCACTAGGAACTTTGCTGATGACACGGGATCCGATATATCGACGACCTCATCCTGCATACGAATTAGACAGACCTTTGTTCCGGGAAAGAAGGAACTTCTCATTCCAAAGAAGTGATTCTAAGTCTATAGTAAGATCCCGATCTCGCTCTCCGAGCCAATGTCTCTTTGAAAGATCTGATAGGTTTTATGGACGTCCCGACATGACACGTCGAAGATCTCCAAATTATAGGACAGACGGGACGAGATCGCCCGATCAGCATCCTATATGTGCGCATATGACAGGCCAAAGACAAGGATTCTGTTTCCTTTCACCATCTGATGATTTGAGGGATGTTGGTCCTACACCCAACCATGGCCATATGAGATCTATCATTCCTAATAGGAATCAAACTGAAAGATTACCTCTTAGAAACAGAAGTTATGATGCTATAGATCATCAAGTAAGGATAGGGAGCAATGAACTTTTTGATGATCCC
Protein sequence
MSTSDYNAIVPIKKRRFPSMQSPPPKEISSLPLVDDNIAKVDEPCVSDGPTVSNSSTITTSEFSEKKISFSEDGKRKSDLCNMNMVQSIIGPSRVEFQENDVCSTGCVENKETCMVNENHALVLHEKPEFKLPHSDANSNPGLCAEKESDEIDRKQLDRLEFSTSVAKKEAELSIGSKEHLVPDSVLEGSDLKSLKQINLEPGLLNLSLSKEGSLDQPLTVNVGSSYDGSIQESNRENWDLNTSMEFWEGCSSGDPPEHVPAVQTNTVVTMHRFSTEMVNTDTLSGKLTPLDDSDHLHLSLSSSDHRHVISQEQSSFVKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKLESKSDEVNVLGLSDSAIVKREFLQIPNASDIYIPMNTVKAKSVNSESNYESKQVALETLGGRLDLVAKQVLPEVDSSCPAPMPFVAEMTEAAGNSCSTDLITDGGMSNHSELQTPTEEHLNLKVHEGAYRCGGELVDSEMTDISKDPGSKDFNSPIIKPIAMPRNPSRTNDSIIEANMSSPSELHIPTTGPLNTKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNNPLWRPSTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMDAVDNDIEEDYEDGEVREPTLTTQVESSICETKKVKNFDHGDSSNGLPGSDCCSSLVSVKQENKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKAKTSALEDQETSPEKASNGIEESITTVSQSDAEKVKTVDIVRNDNPALPNVEPLNDDDVTDDITRGSKHSRIVSPCKPSSSSLPSKTKSSLARSVLTQTDRERIPDMGHEGEKLHPQGRDEPYRDVFQRFYVNRHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPTISPGNYNDQVPPPYDARRRKYMPAVSDDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGVKMVHRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDETMDPLYAHPQPSFEVDRPPFIRDRRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGYRMRSPDQPPQIHGDMPDRRHGFPFPSLPPNDLRDMGSARDHGHMRPGLRSRNRTDRMSFRNRRFEDMDPRDNRIESNEYFDGPVHPGQMNELIDDGNDDDRRRFSDRHEHLHQFRPQCNDSDGENYHNDADERARPYRYCTEDEEEFHEREQPPPLRMANSTRTSADNPSIAIHPLILSLYGLNPNSTLSPPETASFSGTGGRYTDEAAAVDPFLDFRALRISAFFPVPRRSWSRASRSSKSSPSDLGSESAAGDSGSAALVTKSGVLWKFESLLRISCSNTGMFSTIKCSIGTMQVGFKRIGFSVSDYDANLPIKKRRFPVVQISPSPSEGISSFHPDGNLLKIERPSPPKKLSSFNPDENSLEVEQPSLSVTIVSSSSADTCYGLSNRNQDCVSNENKRKSDTHSCYVDMVQNDIGMPGVEFPGPGLGGHEDKSLVTEKHSVHRSPEIYGELKLSSTGVDSDPLGSNKEEEIDVKMPEEKCSSSICQVEGGAEVSEKLVSYKSDLNKQNSLEPVLMDLSLNKQGSSCHCVKGNGLIAMIESDGSWNIAEVEDDDDDDNNIEEDYENGEVRESMQKEARACEKREIEPLDHADCDDKKINSAGLPDHECFTLGPLEQETKPENLDSKSEDDVHTTTESTSCEQEHEDLCVKEPLDVENTIGEDVNRPMKAAGRSQLSQYVNKDKLEGHDTADEIEELIPKFSQGEMEKAIAVENRDLTLPTNMLDKRSGEWDFGPNFSPETYSDQQIDYHVPDLDHDRYKIIPDGRFVGANRRSRSLLDNEGPFFFPWTLKEEVTWKNSWTMVAKWLTECLEIIVLESSLGTLLMTRDPIYRRPHPAYELDRPLFRERRNFSFQRSDSKSIVRSRSRSPSQCLFERSDRFYGRPDMTRRRSPNYRTDGTRSPDQHPICAHMTGQRQGFCFLSPSDDLRDVGPTPNHGHMRSIIPNRNQTERLPLRNRSYDAIDHQVRIGSNELFDDP
Homology
BLAST of Cp4.1LG13g09640 vs. NCBI nr
Match:
XP_023550091.1 (uncharacterized protein LOC111808389 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 2557 bits (6628), Expect = 0.0
Identity = 1285/1285 (100.00%), Postives = 1285/1285 (100.00%), Query Frame = 0
Query: 1 MSTSDYNAIVPIKKRRFPSMQSPPPKEISSLPLVDDNIAKVDEPCVSDGPTVSNSSTITT 60
MSTSDYNAIVPIKKRRFPSMQSPPPKEISSLPLVDDNIAKVDEPCVSDGPTVSNSSTITT
Sbjct: 1 MSTSDYNAIVPIKKRRFPSMQSPPPKEISSLPLVDDNIAKVDEPCVSDGPTVSNSSTITT 60
Query: 61 SEFSEKKISFSEDGKRKSDLCNMNMVQSIIGPSRVEFQENDVCSTGCVENKETCMVNENH 120
SEFSEKKISFSEDGKRKSDLCNMNMVQSIIGPSRVEFQENDVCSTGCVENKETCMVNENH
Sbjct: 61 SEFSEKKISFSEDGKRKSDLCNMNMVQSIIGPSRVEFQENDVCSTGCVENKETCMVNENH 120
Query: 121 ALVLHEKPEFKLPHSDANSNPGLCAEKESDEIDRKQLDRLEFSTSVAKKEAELSIGSKEH 180
ALVLHEKPEFKLPHSDANSNPGLCAEKESDEIDRKQLDRLEFSTSVAKKEAELSIGSKEH
Sbjct: 121 ALVLHEKPEFKLPHSDANSNPGLCAEKESDEIDRKQLDRLEFSTSVAKKEAELSIGSKEH 180
Query: 181 LVPDSVLEGSDLKSLKQINLEPGLLNLSLSKEGSLDQPLTVNVGSSYDGSIQESNRENWD 240
LVPDSVLEGSDLKSLKQINLEPGLLNLSLSKEGSLDQPLTVNVGSSYDGSIQESNRENWD
Sbjct: 181 LVPDSVLEGSDLKSLKQINLEPGLLNLSLSKEGSLDQPLTVNVGSSYDGSIQESNRENWD 240
Query: 241 LNTSMEFWEGCSSGDPPEHVPAVQTNTVVTMHRFSTEMVNTDTLSGKLTPLDDSDHLHLS 300
LNTSMEFWEGCSSGDPPEHVPAVQTNTVVTMHRFSTEMVNTDTLSGKLTPLDDSDHLHLS
Sbjct: 241 LNTSMEFWEGCSSGDPPEHVPAVQTNTVVTMHRFSTEMVNTDTLSGKLTPLDDSDHLHLS 300
Query: 301 LSSSDHRHVISQEQSSFVKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKLE 360
LSSSDHRHVISQEQSSFVKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKLE
Sbjct: 301 LSSSDHRHVISQEQSSFVKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKLE 360
Query: 361 SKSDEVNVLGLSDSAIVKREFLQIPNASDIYIPMNTVKAKSVNSESNYESKQVALETLGG 420
SKSDEVNVLGLSDSAIVKREFLQIPNASDIYIPMNTVKAKSVNSESNYESKQVALETLGG
Sbjct: 361 SKSDEVNVLGLSDSAIVKREFLQIPNASDIYIPMNTVKAKSVNSESNYESKQVALETLGG 420
Query: 421 RLDLVAKQVLPEVDSSCPAPMPFVAEMTEAAGNSCSTDLITDGGMSNHSELQTPTEEHLN 480
RLDLVAKQVLPEVDSSCPAPMPFVAEMTEAAGNSCSTDLITDGGMSNHSELQTPTEEHLN
Sbjct: 421 RLDLVAKQVLPEVDSSCPAPMPFVAEMTEAAGNSCSTDLITDGGMSNHSELQTPTEEHLN 480
Query: 481 LKVHEGAYRCGGELVDSEMTDISKDPGSKDFNSPIIKPIAMPRNPSRTNDSIIEANMSSP 540
LKVHEGAYRCGGELVDSEMTDISKDPGSKDFNSPIIKPIAMPRNPSRTNDSIIEANMSSP
Sbjct: 481 LKVHEGAYRCGGELVDSEMTDISKDPGSKDFNSPIIKPIAMPRNPSRTNDSIIEANMSSP 540
Query: 541 SELHIPTTGPLNTKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNN 600
SELHIPTTGPLNTKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNN
Sbjct: 541 SELHIPTTGPLNTKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNN 600
Query: 601 PLWRPSTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMDA 660
PLWRPSTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMDA
Sbjct: 601 PLWRPSTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMDA 660
Query: 661 VDNDIEEDYEDGEVREPTLTTQVESSICETKKVKNFDHGDSSNGLPGSDCCSSLVSVKQE 720
VDNDIEEDYEDGEVREPTLTTQVESSICETKKVKNFDHGDSSNGLPGSDCCSSLVSVKQE
Sbjct: 661 VDNDIEEDYEDGEVREPTLTTQVESSICETKKVKNFDHGDSSNGLPGSDCCSSLVSVKQE 720
Query: 721 NKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKAKTSALEDQETSP 780
NKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKAKTSALEDQETSP
Sbjct: 721 NKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKAKTSALEDQETSP 780
Query: 781 EKASNGIEESITTVSQSDAEKVKTVDIVRNDNPALPNVEPLNDDDVTDDITRGSKHSRIV 840
EKASNGIEESITTVSQSDAEKVKTVDIVRNDNPALPNVEPLNDDDVTDDITRGSKHSRIV
Sbjct: 781 EKASNGIEESITTVSQSDAEKVKTVDIVRNDNPALPNVEPLNDDDVTDDITRGSKHSRIV 840
Query: 841 SPCKPSSSSLPSKTKSSLARSVLTQTDRERIPDMGHEGEKLHPQGRDEPYRDVFQRFYVN 900
SPCKPSSSSLPSKTKSSLARSVLTQTDRERIPDMGHEGEKLHPQGRDEPYRDVFQRFYVN
Sbjct: 841 SPCKPSSSSLPSKTKSSLARSVLTQTDRERIPDMGHEGEKLHPQGRDEPYRDVFQRFYVN 900
Query: 901 RHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPTISPGNYNDQVPPPYDARRRKYMPAVS 960
RHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPTISPGNYNDQVPPPYDARRRKYMPAVS
Sbjct: 901 RHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPTISPGNYNDQVPPPYDARRRKYMPAVS 960
Query: 961 DDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGVKM 1020
DDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGVKM
Sbjct: 961 DDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGVKM 1020
Query: 1021 VHRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDETMDPLYAHPQPSFEVDRPPFIRD 1080
VHRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDETMDPLYAHPQPSFEVDRPPFIRD
Sbjct: 1021 VHRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDETMDPLYAHPQPSFEVDRPPFIRD 1080
Query: 1081 RRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGYRMRSP 1140
RRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGYRMRSP
Sbjct: 1081 RRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGYRMRSP 1140
Query: 1141 DQPPQIHGDMPDRRHGFPFPSLPPNDLRDMGSARDHGHMRPGLRSRNRTDRMSFRNRRFE 1200
DQPPQIHGDMPDRRHGFPFPSLPPNDLRDMGSARDHGHMRPGLRSRNRTDRMSFRNRRFE
Sbjct: 1141 DQPPQIHGDMPDRRHGFPFPSLPPNDLRDMGSARDHGHMRPGLRSRNRTDRMSFRNRRFE 1200
Query: 1201 DMDPRDNRIESNEYFDGPVHPGQMNELIDDGNDDDRRRFSDRHEHLHQFRPQCNDSDGEN 1260
DMDPRDNRIESNEYFDGPVHPGQMNELIDDGNDDDRRRFSDRHEHLHQFRPQCNDSDGEN
Sbjct: 1201 DMDPRDNRIESNEYFDGPVHPGQMNELIDDGNDDDRRRFSDRHEHLHQFRPQCNDSDGEN 1260
Query: 1261 YHNDADERARPYRYCTEDEEEFHER 1285
YHNDADERARPYRYCTEDEEEFHER
Sbjct: 1261 YHNDADERARPYRYCTEDEEEFHER 1285
BLAST of Cp4.1LG13g09640 vs. NCBI nr
Match:
XP_023550092.1 (uncharacterized protein LOC111808389 isoform X2 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 2497 bits (6472), Expect = 0.0
Identity = 1263/1285 (98.29%), Postives = 1263/1285 (98.29%), Query Frame = 0
Query: 1 MSTSDYNAIVPIKKRRFPSMQSPPPKEISSLPLVDDNIAKVDEPCVSDGPTVSNSSTITT 60
MSTSDYNAIVPIKKRRFPSMQSPPPKEISSLPLVDDNIAKVDEPCVSDGPTVSNSSTITT
Sbjct: 1 MSTSDYNAIVPIKKRRFPSMQSPPPKEISSLPLVDDNIAKVDEPCVSDGPTVSNSSTITT 60
Query: 61 SEFSEKKISFSEDGKRKSDLCNMNMVQSIIGPSRVEFQENDVCSTGCVENKETCMVNENH 120
SEFSEKKISFSEDGKRKSDLCNMNMVQSIIGPSRVEFQENDVCSTGCVENKETCMVNENH
Sbjct: 61 SEFSEKKISFSEDGKRKSDLCNMNMVQSIIGPSRVEFQENDVCSTGCVENKETCMVNENH 120
Query: 121 ALVLHEKPEFKLPHSDANSNPGLCAEKESDEIDRKQLDRLEFSTSVAKKEAELSIGSKEH 180
ALVLHEKPEFKLPHSDANSNPGLCAEKESDEIDRKQLDRLEFSTSVAKKEAELSIGSKEH
Sbjct: 121 ALVLHEKPEFKLPHSDANSNPGLCAEKESDEIDRKQLDRLEFSTSVAKKEAELSIGSKEH 180
Query: 181 LVPDSVLEGSDLKSLKQINLEPGLLNLSLSKEGSLDQPLTVNVGSSYDGSIQESNRENWD 240
LVPDSVLEGSDLKSLKQINLEPGLLNLSLSKEGSLDQPLTVNVGSSYDGSIQESNRENWD
Sbjct: 181 LVPDSVLEGSDLKSLKQINLEPGLLNLSLSKEGSLDQPLTVNVGSSYDGSIQESNRENWD 240
Query: 241 LNTSMEFWEGCSSGDPPEHVPAVQTNTVVTMHRFSTEMVNTDTLSGKLTPLDDSDHLHLS 300
LNTSMEFWEGCSSGDPPEHVPAVQTNTVVTMHRFSTEMVNTDTLSGKLTPLDDSDHLHLS
Sbjct: 241 LNTSMEFWEGCSSGDPPEHVPAVQTNTVVTMHRFSTEMVNTDTLSGKLTPLDDSDHLHLS 300
Query: 301 LSSSDHRHVISQEQSSFVKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKLE 360
LSSSDHRHVISQEQSSFVKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKLE
Sbjct: 301 LSSSDHRHVISQEQSSFVKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKLE 360
Query: 361 SKSDEVNVLGLSDSAIVKREFLQIPNASDIYIPMNTVKAKSVNSESNYESKQVALETLGG 420
SKSDEVNVLGLSDSAIVKREFLQIPNASDIYIPMNTVKAKSVNSESNYESKQVALETLGG
Sbjct: 361 SKSDEVNVLGLSDSAIVKREFLQIPNASDIYIPMNTVKAKSVNSESNYESKQVALETLGG 420
Query: 421 RLDLVAKQVLPEVDSSCPAPMPFVAEMTEAAGNSCSTDLITDGGMSNHSELQTPTEEHLN 480
RLDLVAKQVLPEVDSSCPAPMPFVAEMTEAAGNSCSTDLITDGGMSNHSELQTPTEEHLN
Sbjct: 421 RLDLVAKQVLPEVDSSCPAPMPFVAEMTEAAGNSCSTDLITDGGMSNHSELQTPTEEHLN 480
Query: 481 LKVHEGAYRCGGELVDSEMTDISKDPGSKDFNSPIIKPIAMPRNPSRTNDSIIEANMSSP 540
LKVHEGAYRCGGELVDSEMTDISKDPGSKDFNSPIIKPIAMPRNPSRTNDSIIEANMSSP
Sbjct: 481 LKVHEGAYRCGGELVDSEMTDISKDPGSKDFNSPIIKPIAMPRNPSRTNDSIIEANMSSP 540
Query: 541 SELHIPTTGPLNTKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNN 600
SELHIPTTGPLNTKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNN
Sbjct: 541 SELHIPTTGPLNTKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNN 600
Query: 601 PLWRPSTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMDA 660
PLWRPSTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMDA
Sbjct: 601 PLWRPSTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMDA 660
Query: 661 VDNDIEEDYEDGEVREPTLTTQVESSICETKKVKNFDHGDSSNGLPGSDCCSSLVSVKQE 720
VDNDIEEDYEDGEVREPTLTTQVESSICETKKVKNFDHGDSSNGLPGSDCCSSLVSVKQE
Sbjct: 661 VDNDIEEDYEDGEVREPTLTTQVESSICETKKVKNFDHGDSSNGLPGSDCCSSLVSVKQE 720
Query: 721 NKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKAKTSALEDQETSP 780
NKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKAKTSALEDQETSP
Sbjct: 721 NKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKAKTSALEDQETSP 780
Query: 781 EKASNGIEESITTVSQSDAEKVKTVDIVRNDNPALPNVEPLNDDDVTDDITRGSKHSRIV 840
EKASNGIEESITTVSQSDAEKVKTVDIVRNDNPALPNVEPLNDDDVTDDITRGSKHSRIV
Sbjct: 781 EKASNGIEESITTVSQSDAEKVKTVDIVRNDNPALPNVEPLNDDDVTDDITRGSKHSRIV 840
Query: 841 SPCKPSSSSLPSKTKSSLARSVLTQTDRERIPDMGHEGEKLHPQGRDEPYRDVFQRFYVN 900
SPCKPSSSSLPSKTKSSLARSVLTQTDRERIPDMGHEGEKLHPQGRDEPYRDVFQRFYVN
Sbjct: 841 SPCKPSSSSLPSKTKSSLARSVLTQTDRERIPDMGHEGEKLHPQGRDEPYRDVFQRFYVN 900
Query: 901 RHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPTISPGNYNDQVPPPYDARRRKYMPAVS 960
RHQNLSPQTNFSRRRG NYNDQVPPPYDARRRKYMPAVS
Sbjct: 901 RHQNLSPQTNFSRRRG----------------------NYNDQVPPPYDARRRKYMPAVS 960
Query: 961 DDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGVKM 1020
DDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGVKM
Sbjct: 961 DDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGVKM 1020
Query: 1021 VHRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDETMDPLYAHPQPSFEVDRPPFIRD 1080
VHRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDETMDPLYAHPQPSFEVDRPPFIRD
Sbjct: 1021 VHRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDETMDPLYAHPQPSFEVDRPPFIRD 1080
Query: 1081 RRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGYRMRSP 1140
RRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGYRMRSP
Sbjct: 1081 RRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGYRMRSP 1140
Query: 1141 DQPPQIHGDMPDRRHGFPFPSLPPNDLRDMGSARDHGHMRPGLRSRNRTDRMSFRNRRFE 1200
DQPPQIHGDMPDRRHGFPFPSLPPNDLRDMGSARDHGHMRPGLRSRNRTDRMSFRNRRFE
Sbjct: 1141 DQPPQIHGDMPDRRHGFPFPSLPPNDLRDMGSARDHGHMRPGLRSRNRTDRMSFRNRRFE 1200
Query: 1201 DMDPRDNRIESNEYFDGPVHPGQMNELIDDGNDDDRRRFSDRHEHLHQFRPQCNDSDGEN 1260
DMDPRDNRIESNEYFDGPVHPGQMNELIDDGNDDDRRRFSDRHEHLHQFRPQCNDSDGEN
Sbjct: 1201 DMDPRDNRIESNEYFDGPVHPGQMNELIDDGNDDDRRRFSDRHEHLHQFRPQCNDSDGEN 1260
Query: 1261 YHNDADERARPYRYCTEDEEEFHER 1285
YHNDADERARPYRYCTEDEEEFHER
Sbjct: 1261 YHNDADERARPYRYCTEDEEEFHER 1263
BLAST of Cp4.1LG13g09640 vs. NCBI nr
Match:
XP_022992789.1 (uncharacterized protein LOC111489020 isoform X1 [Cucurbita maxima] >XP_022992790.1 uncharacterized protein LOC111489020 isoform X1 [Cucurbita maxima])
HSP 1 Score: 2467 bits (6395), Expect = 0.0
Identity = 1239/1285 (96.42%), Postives = 1256/1285 (97.74%), Query Frame = 0
Query: 1 MSTSDYNAIVPIKKRRFPSMQSPPPKEISSLPLVDDNIAKVDEPCVSDGPTVSNSSTITT 60
MSTSDYNAIVPIKKRRFP +QSPPPKEISSLPLVDDNIAKVDEPCVSDGPTVSNSSTITT
Sbjct: 1 MSTSDYNAIVPIKKRRFPLIQSPPPKEISSLPLVDDNIAKVDEPCVSDGPTVSNSSTITT 60
Query: 61 SEFSEKKISFSEDGKRKSDLCNMNMVQSIIGPSRVEFQENDVCSTGCVENKETCMVNENH 120
SEFSEKKISFSEDGKRKSDLCNMNMVQ IIGPSRVEFQEND CS GCVENKETCMVNENH
Sbjct: 61 SEFSEKKISFSEDGKRKSDLCNMNMVQRIIGPSRVEFQENDACSAGCVENKETCMVNENH 120
Query: 121 ALVLHEKPEFKLPHSDANSNPGLCAEKESDEIDRKQLDRLEFSTSVAKKEAELSIGSKEH 180
ALVLHEKPEFKLPHSDANSNPGLCAEKESDE+DRKQLDRLEFSTS+AKKEAELS+GSKEH
Sbjct: 121 ALVLHEKPEFKLPHSDANSNPGLCAEKESDEVDRKQLDRLEFSTSLAKKEAELSVGSKEH 180
Query: 181 LVPDSVLEGSDLKSLKQINLEPGLLNLSLSKEGSLDQPLTVNVGSSYDGSIQESNRENWD 240
LVPDSVLEGSDLKSLKQINLEP LLNLSLSKEGSLDQ LTVNVGSSYDGSIQESNRENWD
Sbjct: 181 LVPDSVLEGSDLKSLKQINLEPVLLNLSLSKEGSLDQCLTVNVGSSYDGSIQESNRENWD 240
Query: 241 LNTSMEFWEGCSSGDPPEHVPAVQTNTVVTMHRFSTEMVNTDTLSGKLTPLDDSDHLHLS 300
LNTSMEFWEGCSSGDPPEHVPAVQTNT+VT HRFSTEMVNTDTLSGKLTPLDDSDHLHLS
Sbjct: 241 LNTSMEFWEGCSSGDPPEHVPAVQTNTIVTTHRFSTEMVNTDTLSGKLTPLDDSDHLHLS 300
Query: 301 LSSSDHRHVISQEQSSFVKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKLE 360
LSSSDHRHVISQEQSSF KLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKL
Sbjct: 301 LSSSDHRHVISQEQSSFAKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKLM 360
Query: 361 SKSDEVNVLGLSDSAIVKREFLQIPNASDIYIPMNTVKAKSVNSESNYESKQVALETLGG 420
SKSDEVNVLGLSDSAIVKREFLQIPNASD+YIPMN VKAKSVNSESNYESKQ AL+TLGG
Sbjct: 361 SKSDEVNVLGLSDSAIVKREFLQIPNASDVYIPMNPVKAKSVNSESNYESKQEALKTLGG 420
Query: 421 RLDLVAKQVLPEVDSSCPAPMPFVAEMTEAAGNSCSTDLITDGGMSNHSELQTPTEEHLN 480
RLDLVAKQVLPEVDSSCPAPMPFVAEMTEAAGNSCSTDLITDG MSNH ELQTPT+EHLN
Sbjct: 421 RLDLVAKQVLPEVDSSCPAPMPFVAEMTEAAGNSCSTDLITDGDMSNHPELQTPTKEHLN 480
Query: 481 LKVHEGAYRCGGELVDSEMTDISKDPGSKDFNSPIIKPIAMPRNPSRTNDSIIEANMSSP 540
LKVHEGAY CGGELVDSEMTDISKDPGSKD N PIIKPIAMPRNPS TNDSIIEANMSSP
Sbjct: 481 LKVHEGAYCCGGELVDSEMTDISKDPGSKDSNGPIIKPIAMPRNPSPTNDSIIEANMSSP 540
Query: 541 SELHIPTTGPLNTKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNN 600
SELH PTTGPLN KVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNN
Sbjct: 541 SELHTPTTGPLNMKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNN 600
Query: 601 PLWRPSTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMDA 660
PLWRP THTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMD
Sbjct: 601 PLWRPFTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMDT 660
Query: 661 VDNDIEEDYEDGEVREPTLTTQVESSICETKKVKNFDHGDSSNGLPGSDCCSSLVSVKQE 720
VDNDIEEDYEDGEVREPTLTTQVESSICETKKVK FDHGDSSNGLPGSDCCSSLVSVKQE
Sbjct: 661 VDNDIEEDYEDGEVREPTLTTQVESSICETKKVKIFDHGDSSNGLPGSDCCSSLVSVKQE 720
Query: 721 NKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKAKTSALEDQETSP 780
NKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKAK SALEDQETSP
Sbjct: 721 NKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKAKISALEDQETSP 780
Query: 781 EKASNGIEESITTVSQSDAEKVKTVDIVRNDNPALPNVEPLNDDDVTDDITRGSKHSRIV 840
EKA+NGIEESITTVSQSDAEKVKTVDIVRNDNPALPNVEPLNDDDVTDDITRGSKHSRIV
Sbjct: 781 EKATNGIEESITTVSQSDAEKVKTVDIVRNDNPALPNVEPLNDDDVTDDITRGSKHSRIV 840
Query: 841 SPCKPSSSSLPSKTKSSLARSVLTQTDRERIPDMGHEGEKLHPQGRDEPYRDVFQRFYVN 900
SPCKPS+SSLPSKT+SSLARSVLTQTDR+RIPDM HEGEKLHPQGRDEPYRDVFQRFYVN
Sbjct: 841 SPCKPSTSSLPSKTRSSLARSVLTQTDRKRIPDMAHEGEKLHPQGRDEPYRDVFQRFYVN 900
Query: 901 RHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPTISPGNYNDQVPPPYDARRRKYMPAVS 960
RHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPTISPGNY+DQVPPPYDARRRKYMPAVS
Sbjct: 901 RHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPTISPGNYSDQVPPPYDARRRKYMPAVS 960
Query: 961 DDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGVKM 1020
DDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGVKM
Sbjct: 961 DDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGVKM 1020
Query: 1021 VHRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDETMDPLYAHPQPSFEVDRPPFIRD 1080
HRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDETMDPLYAHPQPSFEVDRPPFIRD
Sbjct: 1021 AHRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDETMDPLYAHPQPSFEVDRPPFIRD 1080
Query: 1081 RRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGYRMRSP 1140
RRNFPIQRKSFQRVDSKSPG SRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGYRMRSP
Sbjct: 1081 RRNFPIQRKSFQRVDSKSPGTSRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGYRMRSP 1140
Query: 1141 DQPPQIHGDMPDRRHGFPFPSLPPNDLRDMGSARDHGHMRPGLRSRNRTDRMSFRNRRFE 1200
DQPPQIHGDMP RRHGFPFPSLPPN+LRDMGSARDHGHMRP LRSRNRTDRMSFRNRRFE
Sbjct: 1141 DQPPQIHGDMPVRRHGFPFPSLPPNNLRDMGSARDHGHMRPSLRSRNRTDRMSFRNRRFE 1200
Query: 1201 DMDPRDNRIESNEYFDGPVHPGQMNELIDDGNDDDRRRFSDRHEHLHQFRPQCNDSDGEN 1260
DMDPRDNRIESNEYFDGPVHPGQ+NELIDDGNDDDRRRF++RHEHLHQFRPQCNDSD EN
Sbjct: 1201 DMDPRDNRIESNEYFDGPVHPGQLNELIDDGNDDDRRRFANRHEHLHQFRPQCNDSDSEN 1260
Query: 1261 YHNDADERARPYRYCTEDEEEFHER 1285
YHNDADERARPYRYCTEDEEEFHER
Sbjct: 1261 YHNDADERARPYRYCTEDEEEFHER 1285
BLAST of Cp4.1LG13g09640 vs. NCBI nr
Match:
XP_022938519.1 (uncharacterized protein LOC111444729 isoform X1 [Cucurbita moschata] >XP_022938520.1 uncharacterized protein LOC111444729 isoform X1 [Cucurbita moschata])
HSP 1 Score: 2459 bits (6372), Expect = 0.0
Identity = 1238/1287 (96.19%), Postives = 1260/1287 (97.90%), Query Frame = 0
Query: 1 MSTSDYNAIVPIKKRRFPSMQSPPPKEISSLPLVDDNIAKVDEPCVSDGPTVSNSSTITT 60
MSTSDYNAIVPIKKRRFP +QSPPPKEISSLPLVDD+IAKVDEPCVSDGPTVSNSSTITT
Sbjct: 1 MSTSDYNAIVPIKKRRFPLIQSPPPKEISSLPLVDDSIAKVDEPCVSDGPTVSNSSTITT 60
Query: 61 SEFSEKKISFSEDGKRKSDLCNMNMVQSIIGPSRVEFQENDVCSTGCVENKETCMVNENH 120
SEFSEKKISFSEDGKRKSDLCNMNMVQSIIGPSRVEFQEND CSTGCVENKETCM+NENH
Sbjct: 61 SEFSEKKISFSEDGKRKSDLCNMNMVQSIIGPSRVEFQENDACSTGCVENKETCMMNENH 120
Query: 121 ALVLHEKPEFKLPHSDANSNPGLCAEKESDEIDRKQLDRLEFSTSVAKKEAELSIGSKEH 180
ALVLHEKPEFKLPHSDANSNPGLCAEKESDEIDRKQLDRLEFSTSVAKKEAELS+GSKEH
Sbjct: 121 ALVLHEKPEFKLPHSDANSNPGLCAEKESDEIDRKQLDRLEFSTSVAKKEAELSVGSKEH 180
Query: 181 LVPDSVLEGSDLKSLKQINLEPGLLNLSLSKEGSLDQPLTVNVGSSYDGSIQESNRENWD 240
LVP+SVLEGSDLKSLKQINLEP LLNLSLSKEGSLDQ LTVNVGSSYDGSIQESNRENWD
Sbjct: 181 LVPNSVLEGSDLKSLKQINLEPVLLNLSLSKEGSLDQRLTVNVGSSYDGSIQESNRENWD 240
Query: 241 LNTSMEFWEGCSSGDPPEHVPAVQTNTVVTMHRFSTEMVNTDTLSGKLTPLDDSDHLHLS 300
LNTSMEFWEGCSSGDPPEHVPAVQTNT+VT HRFSTEMVNTDTL GKLTPLDDSDHLHLS
Sbjct: 241 LNTSMEFWEGCSSGDPPEHVPAVQTNTIVTTHRFSTEMVNTDTLPGKLTPLDDSDHLHLS 300
Query: 301 LSSSDHRHVISQEQSSFVKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKLE 360
LSSSDHRHVISQEQSSFVKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKLE
Sbjct: 301 LSSSDHRHVISQEQSSFVKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKLE 360
Query: 361 SKSDEVNVLGLSDSAIVKREFLQIPNASDIYIPMNTVKAKSVNSESNYESKQVALETLGG 420
SKSD VNVLGLSDSAIVKREFLQIPN SDIYIPMNTVKA+SVNSE NYESKQ AL+TLGG
Sbjct: 361 SKSDGVNVLGLSDSAIVKREFLQIPNVSDIYIPMNTVKARSVNSELNYESKQEALKTLGG 420
Query: 421 RLDLVAKQVLPEVDSSCPAPMPFVAEMTEAAGNSCSTDLITDGGMSNHSELQTPTEEHLN 480
RLDLVAKQVLPEV SSCPAPMPFVAEMTEAA NSCSTDLITDG MSNH ELQTPT+EHLN
Sbjct: 421 RLDLVAKQVLPEVGSSCPAPMPFVAEMTEAARNSCSTDLITDGDMSNHPELQTPTKEHLN 480
Query: 481 LKVHEGAYRCGGELVDSEMTDISKDPGSKDFNSPIIKPIAMPRNPSRTNDSIIEANMSSP 540
L VHEGAYR GEL+DSEMTD+SKDPGSKDFNSPIIKPIAMPRNPSRTNDSIIEANMSSP
Sbjct: 481 LNVHEGAYRFAGELIDSEMTDVSKDPGSKDFNSPIIKPIAMPRNPSRTNDSIIEANMSSP 540
Query: 541 SELHIPTTGPLNTKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNN 600
SELHIPTTGPLNTKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNN
Sbjct: 541 SELHIPTTGPLNTKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNN 600
Query: 601 PLWRPSTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMDA 660
PLWRPSTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMD
Sbjct: 601 PLWRPSTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMDT 660
Query: 661 VDNDIEEDYEDGEVREPTLTTQVESSICETKKVKNFDHGDSSNGLPGSDCCSSLVSVKQE 720
VDND+EEDYEDGEVREPTLTTQVESSICETKKVKNFDH DSSNGLPGSDCCSSLVSVKQE
Sbjct: 661 VDNDVEEDYEDGEVREPTLTTQVESSICETKKVKNFDHADSSNGLPGSDCCSSLVSVKQE 720
Query: 721 NKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKAKTSALEDQETSP 780
NKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKAKTSA+EDQETSP
Sbjct: 721 NKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKAKTSAIEDQETSP 780
Query: 781 EKASNGIEESITTVSQSDAEKVKTVDIVRNDNPALPNVEPLNDDDVTDDITRGSKHSRIV 840
EKA+NGIEESITTVSQSDAEKVKTVD+VRN+NPALPNVEPLNDDDVTDDITRGSKHSRIV
Sbjct: 781 EKATNGIEESITTVSQSDAEKVKTVDMVRNNNPALPNVEPLNDDDVTDDITRGSKHSRIV 840
Query: 841 SPCKPSSSSLPSKTKSSLARSVLTQTDRERIPDMGHEGEKLHPQGRDEPYRDVFQRFYVN 900
SPCKPS+SSLPSKT+SSLARSVLTQTDRERIPDM HEGEKLHPQGRDEPYRDVFQRFYVN
Sbjct: 841 SPCKPSTSSLPSKTRSSLARSVLTQTDRERIPDMAHEGEKLHPQGRDEPYRDVFQRFYVN 900
Query: 901 RHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPTISPGNYND-QVPPP-YDARRRKYMPA 960
RHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPTISPGNY+D QVPPP YDARRRKYMPA
Sbjct: 901 RHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPTISPGNYSDHQVPPPPYDARRRKYMPA 960
Query: 961 VSDDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGV 1020
VSDDDIDQNHYKMKPD PFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGV
Sbjct: 961 VSDDDIDQNHYKMKPDCPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGV 1020
Query: 1021 KMVHRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDETMDPLYAHPQPSFEVDRPPFI 1080
KMVHRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDE MDPLYAHPQPSFEVDR PFI
Sbjct: 1021 KMVHRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDEAMDPLYAHPQPSFEVDRSPFI 1080
Query: 1081 RDRRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGYRMR 1140
RDRRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGYRMR
Sbjct: 1081 RDRRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGYRMR 1140
Query: 1141 SPDQPPQIHGDMPDRRHGFPFPSLPPNDLRDMGSARDHGHMRPGLRSRNRTDRMSFRNRR 1200
SPDQPPQIHGDMP RRHGFPFPSLPPNDLRDMGSARDHGHMRPG+RSRNRT+RMSFRNRR
Sbjct: 1141 SPDQPPQIHGDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRPGIRSRNRTERMSFRNRR 1200
Query: 1201 FEDMDPRDNRIESNEYFDGPVHPGQMNELIDDGNDDDRRRFSDRHEHLHQFRPQCNDSDG 1260
FEDMDPRDNRIESNEYFDGPVHPGQ+NELIDDGNDDDRRRFSDRHEHLHQFRPQCNDSDG
Sbjct: 1201 FEDMDPRDNRIESNEYFDGPVHPGQLNELIDDGNDDDRRRFSDRHEHLHQFRPQCNDSDG 1260
Query: 1261 ENYHNDADERARPYRYCTEDEEEFHER 1285
ENY NDADERARPYRYCTEDEEEFHER
Sbjct: 1261 ENYRNDADERARPYRYCTEDEEEFHER 1287
BLAST of Cp4.1LG13g09640 vs. NCBI nr
Match:
XP_022992791.1 (uncharacterized protein LOC111489020 isoform X2 [Cucurbita maxima])
HSP 1 Score: 2407 bits (6239), Expect = 0.0
Identity = 1217/1285 (94.71%), Postives = 1234/1285 (96.03%), Query Frame = 0
Query: 1 MSTSDYNAIVPIKKRRFPSMQSPPPKEISSLPLVDDNIAKVDEPCVSDGPTVSNSSTITT 60
MSTSDYNAIVPIKKRRFP +QSPPPKEISSLPLVDDNIAKVDEPCVSDGPTVSNSSTITT
Sbjct: 1 MSTSDYNAIVPIKKRRFPLIQSPPPKEISSLPLVDDNIAKVDEPCVSDGPTVSNSSTITT 60
Query: 61 SEFSEKKISFSEDGKRKSDLCNMNMVQSIIGPSRVEFQENDVCSTGCVENKETCMVNENH 120
SEFSEKKISFSEDGKRKSDLCNMNMVQ IIGPSRVEFQEND CS GCVENKETCMVNENH
Sbjct: 61 SEFSEKKISFSEDGKRKSDLCNMNMVQRIIGPSRVEFQENDACSAGCVENKETCMVNENH 120
Query: 121 ALVLHEKPEFKLPHSDANSNPGLCAEKESDEIDRKQLDRLEFSTSVAKKEAELSIGSKEH 180
ALVLHEKPEFKLPHSDANSNPGLCAEKESDE+DRKQLDRLEFSTS+AKKEAELS+GSKEH
Sbjct: 121 ALVLHEKPEFKLPHSDANSNPGLCAEKESDEVDRKQLDRLEFSTSLAKKEAELSVGSKEH 180
Query: 181 LVPDSVLEGSDLKSLKQINLEPGLLNLSLSKEGSLDQPLTVNVGSSYDGSIQESNRENWD 240
LVPDSVLEGSDLKSLKQINLEP LLNLSLSKEGSLDQ LTVNVGSSYDGSIQESNRENWD
Sbjct: 181 LVPDSVLEGSDLKSLKQINLEPVLLNLSLSKEGSLDQCLTVNVGSSYDGSIQESNRENWD 240
Query: 241 LNTSMEFWEGCSSGDPPEHVPAVQTNTVVTMHRFSTEMVNTDTLSGKLTPLDDSDHLHLS 300
LNTSMEFWEGCSSGDPPEHVPAVQTNT+VT HRFSTEMVNTDTLSGKLTPLDDSDHLHLS
Sbjct: 241 LNTSMEFWEGCSSGDPPEHVPAVQTNTIVTTHRFSTEMVNTDTLSGKLTPLDDSDHLHLS 300
Query: 301 LSSSDHRHVISQEQSSFVKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKLE 360
LSSSDHRHVISQEQSSF KLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKL
Sbjct: 301 LSSSDHRHVISQEQSSFAKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKLM 360
Query: 361 SKSDEVNVLGLSDSAIVKREFLQIPNASDIYIPMNTVKAKSVNSESNYESKQVALETLGG 420
SKSDEVNVLGLSDSAIVKREFLQIPNASD+YIPMN VKAKSVNSESNYESKQ AL+TLGG
Sbjct: 361 SKSDEVNVLGLSDSAIVKREFLQIPNASDVYIPMNPVKAKSVNSESNYESKQEALKTLGG 420
Query: 421 RLDLVAKQVLPEVDSSCPAPMPFVAEMTEAAGNSCSTDLITDGGMSNHSELQTPTEEHLN 480
RLDLVAKQVLPEVDSSCPAPMPFVAEMTEAAGNSCSTDLITDG MSNH ELQTPT+EHLN
Sbjct: 421 RLDLVAKQVLPEVDSSCPAPMPFVAEMTEAAGNSCSTDLITDGDMSNHPELQTPTKEHLN 480
Query: 481 LKVHEGAYRCGGELVDSEMTDISKDPGSKDFNSPIIKPIAMPRNPSRTNDSIIEANMSSP 540
LKVHEGAY CGGELVDSEMTDISKDPGSKD N PIIKPIAMPRNPS TNDSIIEANMSSP
Sbjct: 481 LKVHEGAYCCGGELVDSEMTDISKDPGSKDSNGPIIKPIAMPRNPSPTNDSIIEANMSSP 540
Query: 541 SELHIPTTGPLNTKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNN 600
SELH PTTGPLN KVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNN
Sbjct: 541 SELHTPTTGPLNMKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNN 600
Query: 601 PLWRPSTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMDA 660
PLWRP THTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMD
Sbjct: 601 PLWRPFTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMDT 660
Query: 661 VDNDIEEDYEDGEVREPTLTTQVESSICETKKVKNFDHGDSSNGLPGSDCCSSLVSVKQE 720
VDNDIEEDYEDGEVREPTLTTQVESSICETKKVK FDHGDSSNGLPGSDCCSSLVSVKQE
Sbjct: 661 VDNDIEEDYEDGEVREPTLTTQVESSICETKKVKIFDHGDSSNGLPGSDCCSSLVSVKQE 720
Query: 721 NKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKAKTSALEDQETSP 780
NKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKAK SALEDQETSP
Sbjct: 721 NKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKAKISALEDQETSP 780
Query: 781 EKASNGIEESITTVSQSDAEKVKTVDIVRNDNPALPNVEPLNDDDVTDDITRGSKHSRIV 840
EKA+NGIEESITTVSQSDAEKVKTVDIVRNDNPALPNVEPLNDDDVTDDITRGSKHSRIV
Sbjct: 781 EKATNGIEESITTVSQSDAEKVKTVDIVRNDNPALPNVEPLNDDDVTDDITRGSKHSRIV 840
Query: 841 SPCKPSSSSLPSKTKSSLARSVLTQTDRERIPDMGHEGEKLHPQGRDEPYRDVFQRFYVN 900
SPCKPS+SSLPSKT+SSLARSVLTQTDR+RIPDM HEGEKLHPQGRDEPYRDVFQRFYVN
Sbjct: 841 SPCKPSTSSLPSKTRSSLARSVLTQTDRKRIPDMAHEGEKLHPQGRDEPYRDVFQRFYVN 900
Query: 901 RHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPTISPGNYNDQVPPPYDARRRKYMPAVS 960
RHQNLSPQTNFSRRRG NY+DQVPPPYDARRRKYMPAVS
Sbjct: 901 RHQNLSPQTNFSRRRG----------------------NYSDQVPPPYDARRRKYMPAVS 960
Query: 961 DDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGVKM 1020
DDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGVKM
Sbjct: 961 DDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGVKM 1020
Query: 1021 VHRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDETMDPLYAHPQPSFEVDRPPFIRD 1080
HRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDETMDPLYAHPQPSFEVDRPPFIRD
Sbjct: 1021 AHRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDETMDPLYAHPQPSFEVDRPPFIRD 1080
Query: 1081 RRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGYRMRSP 1140
RRNFPIQRKSFQRVDSKSPG SRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGYRMRSP
Sbjct: 1081 RRNFPIQRKSFQRVDSKSPGTSRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGYRMRSP 1140
Query: 1141 DQPPQIHGDMPDRRHGFPFPSLPPNDLRDMGSARDHGHMRPGLRSRNRTDRMSFRNRRFE 1200
DQPPQIHGDMP RRHGFPFPSLPPN+LRDMGSARDHGHMRP LRSRNRTDRMSFRNRRFE
Sbjct: 1141 DQPPQIHGDMPVRRHGFPFPSLPPNNLRDMGSARDHGHMRPSLRSRNRTDRMSFRNRRFE 1200
Query: 1201 DMDPRDNRIESNEYFDGPVHPGQMNELIDDGNDDDRRRFSDRHEHLHQFRPQCNDSDGEN 1260
DMDPRDNRIESNEYFDGPVHPGQ+NELIDDGNDDDRRRF++RHEHLHQFRPQCNDSD EN
Sbjct: 1201 DMDPRDNRIESNEYFDGPVHPGQLNELIDDGNDDDRRRFANRHEHLHQFRPQCNDSDSEN 1260
Query: 1261 YHNDADERARPYRYCTEDEEEFHER 1285
YHNDADERARPYRYCTEDEEEFHER
Sbjct: 1261 YHNDADERARPYRYCTEDEEEFHER 1263
BLAST of Cp4.1LG13g09640 vs. ExPASy TrEMBL
Match:
A0A6J1JYG4 (uncharacterized protein LOC111489020 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111489020 PE=4 SV=1)
HSP 1 Score: 2467 bits (6395), Expect = 0.0
Identity = 1239/1285 (96.42%), Postives = 1256/1285 (97.74%), Query Frame = 0
Query: 1 MSTSDYNAIVPIKKRRFPSMQSPPPKEISSLPLVDDNIAKVDEPCVSDGPTVSNSSTITT 60
MSTSDYNAIVPIKKRRFP +QSPPPKEISSLPLVDDNIAKVDEPCVSDGPTVSNSSTITT
Sbjct: 1 MSTSDYNAIVPIKKRRFPLIQSPPPKEISSLPLVDDNIAKVDEPCVSDGPTVSNSSTITT 60
Query: 61 SEFSEKKISFSEDGKRKSDLCNMNMVQSIIGPSRVEFQENDVCSTGCVENKETCMVNENH 120
SEFSEKKISFSEDGKRKSDLCNMNMVQ IIGPSRVEFQEND CS GCVENKETCMVNENH
Sbjct: 61 SEFSEKKISFSEDGKRKSDLCNMNMVQRIIGPSRVEFQENDACSAGCVENKETCMVNENH 120
Query: 121 ALVLHEKPEFKLPHSDANSNPGLCAEKESDEIDRKQLDRLEFSTSVAKKEAELSIGSKEH 180
ALVLHEKPEFKLPHSDANSNPGLCAEKESDE+DRKQLDRLEFSTS+AKKEAELS+GSKEH
Sbjct: 121 ALVLHEKPEFKLPHSDANSNPGLCAEKESDEVDRKQLDRLEFSTSLAKKEAELSVGSKEH 180
Query: 181 LVPDSVLEGSDLKSLKQINLEPGLLNLSLSKEGSLDQPLTVNVGSSYDGSIQESNRENWD 240
LVPDSVLEGSDLKSLKQINLEP LLNLSLSKEGSLDQ LTVNVGSSYDGSIQESNRENWD
Sbjct: 181 LVPDSVLEGSDLKSLKQINLEPVLLNLSLSKEGSLDQCLTVNVGSSYDGSIQESNRENWD 240
Query: 241 LNTSMEFWEGCSSGDPPEHVPAVQTNTVVTMHRFSTEMVNTDTLSGKLTPLDDSDHLHLS 300
LNTSMEFWEGCSSGDPPEHVPAVQTNT+VT HRFSTEMVNTDTLSGKLTPLDDSDHLHLS
Sbjct: 241 LNTSMEFWEGCSSGDPPEHVPAVQTNTIVTTHRFSTEMVNTDTLSGKLTPLDDSDHLHLS 300
Query: 301 LSSSDHRHVISQEQSSFVKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKLE 360
LSSSDHRHVISQEQSSF KLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKL
Sbjct: 301 LSSSDHRHVISQEQSSFAKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKLM 360
Query: 361 SKSDEVNVLGLSDSAIVKREFLQIPNASDIYIPMNTVKAKSVNSESNYESKQVALETLGG 420
SKSDEVNVLGLSDSAIVKREFLQIPNASD+YIPMN VKAKSVNSESNYESKQ AL+TLGG
Sbjct: 361 SKSDEVNVLGLSDSAIVKREFLQIPNASDVYIPMNPVKAKSVNSESNYESKQEALKTLGG 420
Query: 421 RLDLVAKQVLPEVDSSCPAPMPFVAEMTEAAGNSCSTDLITDGGMSNHSELQTPTEEHLN 480
RLDLVAKQVLPEVDSSCPAPMPFVAEMTEAAGNSCSTDLITDG MSNH ELQTPT+EHLN
Sbjct: 421 RLDLVAKQVLPEVDSSCPAPMPFVAEMTEAAGNSCSTDLITDGDMSNHPELQTPTKEHLN 480
Query: 481 LKVHEGAYRCGGELVDSEMTDISKDPGSKDFNSPIIKPIAMPRNPSRTNDSIIEANMSSP 540
LKVHEGAY CGGELVDSEMTDISKDPGSKD N PIIKPIAMPRNPS TNDSIIEANMSSP
Sbjct: 481 LKVHEGAYCCGGELVDSEMTDISKDPGSKDSNGPIIKPIAMPRNPSPTNDSIIEANMSSP 540
Query: 541 SELHIPTTGPLNTKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNN 600
SELH PTTGPLN KVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNN
Sbjct: 541 SELHTPTTGPLNMKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNN 600
Query: 601 PLWRPSTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMDA 660
PLWRP THTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMD
Sbjct: 601 PLWRPFTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMDT 660
Query: 661 VDNDIEEDYEDGEVREPTLTTQVESSICETKKVKNFDHGDSSNGLPGSDCCSSLVSVKQE 720
VDNDIEEDYEDGEVREPTLTTQVESSICETKKVK FDHGDSSNGLPGSDCCSSLVSVKQE
Sbjct: 661 VDNDIEEDYEDGEVREPTLTTQVESSICETKKVKIFDHGDSSNGLPGSDCCSSLVSVKQE 720
Query: 721 NKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKAKTSALEDQETSP 780
NKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKAK SALEDQETSP
Sbjct: 721 NKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKAKISALEDQETSP 780
Query: 781 EKASNGIEESITTVSQSDAEKVKTVDIVRNDNPALPNVEPLNDDDVTDDITRGSKHSRIV 840
EKA+NGIEESITTVSQSDAEKVKTVDIVRNDNPALPNVEPLNDDDVTDDITRGSKHSRIV
Sbjct: 781 EKATNGIEESITTVSQSDAEKVKTVDIVRNDNPALPNVEPLNDDDVTDDITRGSKHSRIV 840
Query: 841 SPCKPSSSSLPSKTKSSLARSVLTQTDRERIPDMGHEGEKLHPQGRDEPYRDVFQRFYVN 900
SPCKPS+SSLPSKT+SSLARSVLTQTDR+RIPDM HEGEKLHPQGRDEPYRDVFQRFYVN
Sbjct: 841 SPCKPSTSSLPSKTRSSLARSVLTQTDRKRIPDMAHEGEKLHPQGRDEPYRDVFQRFYVN 900
Query: 901 RHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPTISPGNYNDQVPPPYDARRRKYMPAVS 960
RHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPTISPGNY+DQVPPPYDARRRKYMPAVS
Sbjct: 901 RHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPTISPGNYSDQVPPPYDARRRKYMPAVS 960
Query: 961 DDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGVKM 1020
DDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGVKM
Sbjct: 961 DDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGVKM 1020
Query: 1021 VHRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDETMDPLYAHPQPSFEVDRPPFIRD 1080
HRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDETMDPLYAHPQPSFEVDRPPFIRD
Sbjct: 1021 AHRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDETMDPLYAHPQPSFEVDRPPFIRD 1080
Query: 1081 RRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGYRMRSP 1140
RRNFPIQRKSFQRVDSKSPG SRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGYRMRSP
Sbjct: 1081 RRNFPIQRKSFQRVDSKSPGTSRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGYRMRSP 1140
Query: 1141 DQPPQIHGDMPDRRHGFPFPSLPPNDLRDMGSARDHGHMRPGLRSRNRTDRMSFRNRRFE 1200
DQPPQIHGDMP RRHGFPFPSLPPN+LRDMGSARDHGHMRP LRSRNRTDRMSFRNRRFE
Sbjct: 1141 DQPPQIHGDMPVRRHGFPFPSLPPNNLRDMGSARDHGHMRPSLRSRNRTDRMSFRNRRFE 1200
Query: 1201 DMDPRDNRIESNEYFDGPVHPGQMNELIDDGNDDDRRRFSDRHEHLHQFRPQCNDSDGEN 1260
DMDPRDNRIESNEYFDGPVHPGQ+NELIDDGNDDDRRRF++RHEHLHQFRPQCNDSD EN
Sbjct: 1201 DMDPRDNRIESNEYFDGPVHPGQLNELIDDGNDDDRRRFANRHEHLHQFRPQCNDSDSEN 1260
Query: 1261 YHNDADERARPYRYCTEDEEEFHER 1285
YHNDADERARPYRYCTEDEEEFHER
Sbjct: 1261 YHNDADERARPYRYCTEDEEEFHER 1285
BLAST of Cp4.1LG13g09640 vs. ExPASy TrEMBL
Match:
A0A6J1FEB1 (uncharacterized protein LOC111444729 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111444729 PE=4 SV=1)
HSP 1 Score: 2459 bits (6372), Expect = 0.0
Identity = 1238/1287 (96.19%), Postives = 1260/1287 (97.90%), Query Frame = 0
Query: 1 MSTSDYNAIVPIKKRRFPSMQSPPPKEISSLPLVDDNIAKVDEPCVSDGPTVSNSSTITT 60
MSTSDYNAIVPIKKRRFP +QSPPPKEISSLPLVDD+IAKVDEPCVSDGPTVSNSSTITT
Sbjct: 1 MSTSDYNAIVPIKKRRFPLIQSPPPKEISSLPLVDDSIAKVDEPCVSDGPTVSNSSTITT 60
Query: 61 SEFSEKKISFSEDGKRKSDLCNMNMVQSIIGPSRVEFQENDVCSTGCVENKETCMVNENH 120
SEFSEKKISFSEDGKRKSDLCNMNMVQSIIGPSRVEFQEND CSTGCVENKETCM+NENH
Sbjct: 61 SEFSEKKISFSEDGKRKSDLCNMNMVQSIIGPSRVEFQENDACSTGCVENKETCMMNENH 120
Query: 121 ALVLHEKPEFKLPHSDANSNPGLCAEKESDEIDRKQLDRLEFSTSVAKKEAELSIGSKEH 180
ALVLHEKPEFKLPHSDANSNPGLCAEKESDEIDRKQLDRLEFSTSVAKKEAELS+GSKEH
Sbjct: 121 ALVLHEKPEFKLPHSDANSNPGLCAEKESDEIDRKQLDRLEFSTSVAKKEAELSVGSKEH 180
Query: 181 LVPDSVLEGSDLKSLKQINLEPGLLNLSLSKEGSLDQPLTVNVGSSYDGSIQESNRENWD 240
LVP+SVLEGSDLKSLKQINLEP LLNLSLSKEGSLDQ LTVNVGSSYDGSIQESNRENWD
Sbjct: 181 LVPNSVLEGSDLKSLKQINLEPVLLNLSLSKEGSLDQRLTVNVGSSYDGSIQESNRENWD 240
Query: 241 LNTSMEFWEGCSSGDPPEHVPAVQTNTVVTMHRFSTEMVNTDTLSGKLTPLDDSDHLHLS 300
LNTSMEFWEGCSSGDPPEHVPAVQTNT+VT HRFSTEMVNTDTL GKLTPLDDSDHLHLS
Sbjct: 241 LNTSMEFWEGCSSGDPPEHVPAVQTNTIVTTHRFSTEMVNTDTLPGKLTPLDDSDHLHLS 300
Query: 301 LSSSDHRHVISQEQSSFVKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKLE 360
LSSSDHRHVISQEQSSFVKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKLE
Sbjct: 301 LSSSDHRHVISQEQSSFVKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKLE 360
Query: 361 SKSDEVNVLGLSDSAIVKREFLQIPNASDIYIPMNTVKAKSVNSESNYESKQVALETLGG 420
SKSD VNVLGLSDSAIVKREFLQIPN SDIYIPMNTVKA+SVNSE NYESKQ AL+TLGG
Sbjct: 361 SKSDGVNVLGLSDSAIVKREFLQIPNVSDIYIPMNTVKARSVNSELNYESKQEALKTLGG 420
Query: 421 RLDLVAKQVLPEVDSSCPAPMPFVAEMTEAAGNSCSTDLITDGGMSNHSELQTPTEEHLN 480
RLDLVAKQVLPEV SSCPAPMPFVAEMTEAA NSCSTDLITDG MSNH ELQTPT+EHLN
Sbjct: 421 RLDLVAKQVLPEVGSSCPAPMPFVAEMTEAARNSCSTDLITDGDMSNHPELQTPTKEHLN 480
Query: 481 LKVHEGAYRCGGELVDSEMTDISKDPGSKDFNSPIIKPIAMPRNPSRTNDSIIEANMSSP 540
L VHEGAYR GEL+DSEMTD+SKDPGSKDFNSPIIKPIAMPRNPSRTNDSIIEANMSSP
Sbjct: 481 LNVHEGAYRFAGELIDSEMTDVSKDPGSKDFNSPIIKPIAMPRNPSRTNDSIIEANMSSP 540
Query: 541 SELHIPTTGPLNTKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNN 600
SELHIPTTGPLNTKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNN
Sbjct: 541 SELHIPTTGPLNTKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNN 600
Query: 601 PLWRPSTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMDA 660
PLWRPSTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMD
Sbjct: 601 PLWRPSTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMDT 660
Query: 661 VDNDIEEDYEDGEVREPTLTTQVESSICETKKVKNFDHGDSSNGLPGSDCCSSLVSVKQE 720
VDND+EEDYEDGEVREPTLTTQVESSICETKKVKNFDH DSSNGLPGSDCCSSLVSVKQE
Sbjct: 661 VDNDVEEDYEDGEVREPTLTTQVESSICETKKVKNFDHADSSNGLPGSDCCSSLVSVKQE 720
Query: 721 NKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKAKTSALEDQETSP 780
NKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKAKTSA+EDQETSP
Sbjct: 721 NKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKAKTSAIEDQETSP 780
Query: 781 EKASNGIEESITTVSQSDAEKVKTVDIVRNDNPALPNVEPLNDDDVTDDITRGSKHSRIV 840
EKA+NGIEESITTVSQSDAEKVKTVD+VRN+NPALPNVEPLNDDDVTDDITRGSKHSRIV
Sbjct: 781 EKATNGIEESITTVSQSDAEKVKTVDMVRNNNPALPNVEPLNDDDVTDDITRGSKHSRIV 840
Query: 841 SPCKPSSSSLPSKTKSSLARSVLTQTDRERIPDMGHEGEKLHPQGRDEPYRDVFQRFYVN 900
SPCKPS+SSLPSKT+SSLARSVLTQTDRERIPDM HEGEKLHPQGRDEPYRDVFQRFYVN
Sbjct: 841 SPCKPSTSSLPSKTRSSLARSVLTQTDRERIPDMAHEGEKLHPQGRDEPYRDVFQRFYVN 900
Query: 901 RHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPTISPGNYND-QVPPP-YDARRRKYMPA 960
RHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPTISPGNY+D QVPPP YDARRRKYMPA
Sbjct: 901 RHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPTISPGNYSDHQVPPPPYDARRRKYMPA 960
Query: 961 VSDDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGV 1020
VSDDDIDQNHYKMKPD PFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGV
Sbjct: 961 VSDDDIDQNHYKMKPDCPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGV 1020
Query: 1021 KMVHRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDETMDPLYAHPQPSFEVDRPPFI 1080
KMVHRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDE MDPLYAHPQPSFEVDR PFI
Sbjct: 1021 KMVHRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDEAMDPLYAHPQPSFEVDRSPFI 1080
Query: 1081 RDRRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGYRMR 1140
RDRRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGYRMR
Sbjct: 1081 RDRRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGYRMR 1140
Query: 1141 SPDQPPQIHGDMPDRRHGFPFPSLPPNDLRDMGSARDHGHMRPGLRSRNRTDRMSFRNRR 1200
SPDQPPQIHGDMP RRHGFPFPSLPPNDLRDMGSARDHGHMRPG+RSRNRT+RMSFRNRR
Sbjct: 1141 SPDQPPQIHGDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRPGIRSRNRTERMSFRNRR 1200
Query: 1201 FEDMDPRDNRIESNEYFDGPVHPGQMNELIDDGNDDDRRRFSDRHEHLHQFRPQCNDSDG 1260
FEDMDPRDNRIESNEYFDGPVHPGQ+NELIDDGNDDDRRRFSDRHEHLHQFRPQCNDSDG
Sbjct: 1201 FEDMDPRDNRIESNEYFDGPVHPGQLNELIDDGNDDDRRRFSDRHEHLHQFRPQCNDSDG 1260
Query: 1261 ENYHNDADERARPYRYCTEDEEEFHER 1285
ENY NDADERARPYRYCTEDEEEFHER
Sbjct: 1261 ENYRNDADERARPYRYCTEDEEEFHER 1287
BLAST of Cp4.1LG13g09640 vs. ExPASy TrEMBL
Match:
A0A6J1JUI7 (uncharacterized protein LOC111489020 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111489020 PE=4 SV=1)
HSP 1 Score: 2407 bits (6239), Expect = 0.0
Identity = 1217/1285 (94.71%), Postives = 1234/1285 (96.03%), Query Frame = 0
Query: 1 MSTSDYNAIVPIKKRRFPSMQSPPPKEISSLPLVDDNIAKVDEPCVSDGPTVSNSSTITT 60
MSTSDYNAIVPIKKRRFP +QSPPPKEISSLPLVDDNIAKVDEPCVSDGPTVSNSSTITT
Sbjct: 1 MSTSDYNAIVPIKKRRFPLIQSPPPKEISSLPLVDDNIAKVDEPCVSDGPTVSNSSTITT 60
Query: 61 SEFSEKKISFSEDGKRKSDLCNMNMVQSIIGPSRVEFQENDVCSTGCVENKETCMVNENH 120
SEFSEKKISFSEDGKRKSDLCNMNMVQ IIGPSRVEFQEND CS GCVENKETCMVNENH
Sbjct: 61 SEFSEKKISFSEDGKRKSDLCNMNMVQRIIGPSRVEFQENDACSAGCVENKETCMVNENH 120
Query: 121 ALVLHEKPEFKLPHSDANSNPGLCAEKESDEIDRKQLDRLEFSTSVAKKEAELSIGSKEH 180
ALVLHEKPEFKLPHSDANSNPGLCAEKESDE+DRKQLDRLEFSTS+AKKEAELS+GSKEH
Sbjct: 121 ALVLHEKPEFKLPHSDANSNPGLCAEKESDEVDRKQLDRLEFSTSLAKKEAELSVGSKEH 180
Query: 181 LVPDSVLEGSDLKSLKQINLEPGLLNLSLSKEGSLDQPLTVNVGSSYDGSIQESNRENWD 240
LVPDSVLEGSDLKSLKQINLEP LLNLSLSKEGSLDQ LTVNVGSSYDGSIQESNRENWD
Sbjct: 181 LVPDSVLEGSDLKSLKQINLEPVLLNLSLSKEGSLDQCLTVNVGSSYDGSIQESNRENWD 240
Query: 241 LNTSMEFWEGCSSGDPPEHVPAVQTNTVVTMHRFSTEMVNTDTLSGKLTPLDDSDHLHLS 300
LNTSMEFWEGCSSGDPPEHVPAVQTNT+VT HRFSTEMVNTDTLSGKLTPLDDSDHLHLS
Sbjct: 241 LNTSMEFWEGCSSGDPPEHVPAVQTNTIVTTHRFSTEMVNTDTLSGKLTPLDDSDHLHLS 300
Query: 301 LSSSDHRHVISQEQSSFVKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKLE 360
LSSSDHRHVISQEQSSF KLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKL
Sbjct: 301 LSSSDHRHVISQEQSSFAKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKLM 360
Query: 361 SKSDEVNVLGLSDSAIVKREFLQIPNASDIYIPMNTVKAKSVNSESNYESKQVALETLGG 420
SKSDEVNVLGLSDSAIVKREFLQIPNASD+YIPMN VKAKSVNSESNYESKQ AL+TLGG
Sbjct: 361 SKSDEVNVLGLSDSAIVKREFLQIPNASDVYIPMNPVKAKSVNSESNYESKQEALKTLGG 420
Query: 421 RLDLVAKQVLPEVDSSCPAPMPFVAEMTEAAGNSCSTDLITDGGMSNHSELQTPTEEHLN 480
RLDLVAKQVLPEVDSSCPAPMPFVAEMTEAAGNSCSTDLITDG MSNH ELQTPT+EHLN
Sbjct: 421 RLDLVAKQVLPEVDSSCPAPMPFVAEMTEAAGNSCSTDLITDGDMSNHPELQTPTKEHLN 480
Query: 481 LKVHEGAYRCGGELVDSEMTDISKDPGSKDFNSPIIKPIAMPRNPSRTNDSIIEANMSSP 540
LKVHEGAY CGGELVDSEMTDISKDPGSKD N PIIKPIAMPRNPS TNDSIIEANMSSP
Sbjct: 481 LKVHEGAYCCGGELVDSEMTDISKDPGSKDSNGPIIKPIAMPRNPSPTNDSIIEANMSSP 540
Query: 541 SELHIPTTGPLNTKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNN 600
SELH PTTGPLN KVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNN
Sbjct: 541 SELHTPTTGPLNMKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNN 600
Query: 601 PLWRPSTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMDA 660
PLWRP THTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMD
Sbjct: 601 PLWRPFTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMDT 660
Query: 661 VDNDIEEDYEDGEVREPTLTTQVESSICETKKVKNFDHGDSSNGLPGSDCCSSLVSVKQE 720
VDNDIEEDYEDGEVREPTLTTQVESSICETKKVK FDHGDSSNGLPGSDCCSSLVSVKQE
Sbjct: 661 VDNDIEEDYEDGEVREPTLTTQVESSICETKKVKIFDHGDSSNGLPGSDCCSSLVSVKQE 720
Query: 721 NKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKAKTSALEDQETSP 780
NKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKAK SALEDQETSP
Sbjct: 721 NKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKAKISALEDQETSP 780
Query: 781 EKASNGIEESITTVSQSDAEKVKTVDIVRNDNPALPNVEPLNDDDVTDDITRGSKHSRIV 840
EKA+NGIEESITTVSQSDAEKVKTVDIVRNDNPALPNVEPLNDDDVTDDITRGSKHSRIV
Sbjct: 781 EKATNGIEESITTVSQSDAEKVKTVDIVRNDNPALPNVEPLNDDDVTDDITRGSKHSRIV 840
Query: 841 SPCKPSSSSLPSKTKSSLARSVLTQTDRERIPDMGHEGEKLHPQGRDEPYRDVFQRFYVN 900
SPCKPS+SSLPSKT+SSLARSVLTQTDR+RIPDM HEGEKLHPQGRDEPYRDVFQRFYVN
Sbjct: 841 SPCKPSTSSLPSKTRSSLARSVLTQTDRKRIPDMAHEGEKLHPQGRDEPYRDVFQRFYVN 900
Query: 901 RHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPTISPGNYNDQVPPPYDARRRKYMPAVS 960
RHQNLSPQTNFSRRRG NY+DQVPPPYDARRRKYMPAVS
Sbjct: 901 RHQNLSPQTNFSRRRG----------------------NYSDQVPPPYDARRRKYMPAVS 960
Query: 961 DDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGVKM 1020
DDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGVKM
Sbjct: 961 DDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGVKM 1020
Query: 1021 VHRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDETMDPLYAHPQPSFEVDRPPFIRD 1080
HRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDETMDPLYAHPQPSFEVDRPPFIRD
Sbjct: 1021 AHRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDETMDPLYAHPQPSFEVDRPPFIRD 1080
Query: 1081 RRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGYRMRSP 1140
RRNFPIQRKSFQRVDSKSPG SRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGYRMRSP
Sbjct: 1081 RRNFPIQRKSFQRVDSKSPGTSRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGYRMRSP 1140
Query: 1141 DQPPQIHGDMPDRRHGFPFPSLPPNDLRDMGSARDHGHMRPGLRSRNRTDRMSFRNRRFE 1200
DQPPQIHGDMP RRHGFPFPSLPPN+LRDMGSARDHGHMRP LRSRNRTDRMSFRNRRFE
Sbjct: 1141 DQPPQIHGDMPVRRHGFPFPSLPPNNLRDMGSARDHGHMRPSLRSRNRTDRMSFRNRRFE 1200
Query: 1201 DMDPRDNRIESNEYFDGPVHPGQMNELIDDGNDDDRRRFSDRHEHLHQFRPQCNDSDGEN 1260
DMDPRDNRIESNEYFDGPVHPGQ+NELIDDGNDDDRRRF++RHEHLHQFRPQCNDSD EN
Sbjct: 1201 DMDPRDNRIESNEYFDGPVHPGQLNELIDDGNDDDRRRFANRHEHLHQFRPQCNDSDSEN 1260
Query: 1261 YHNDADERARPYRYCTEDEEEFHER 1285
YHNDADERARPYRYCTEDEEEFHER
Sbjct: 1261 YHNDADERARPYRYCTEDEEEFHER 1263
BLAST of Cp4.1LG13g09640 vs. ExPASy TrEMBL
Match:
A0A6J1FDD8 (uncharacterized protein LOC111444729 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111444729 PE=4 SV=1)
HSP 1 Score: 2399 bits (6216), Expect = 0.0
Identity = 1216/1287 (94.48%), Postives = 1238/1287 (96.19%), Query Frame = 0
Query: 1 MSTSDYNAIVPIKKRRFPSMQSPPPKEISSLPLVDDNIAKVDEPCVSDGPTVSNSSTITT 60
MSTSDYNAIVPIKKRRFP +QSPPPKEISSLPLVDD+IAKVDEPCVSDGPTVSNSSTITT
Sbjct: 1 MSTSDYNAIVPIKKRRFPLIQSPPPKEISSLPLVDDSIAKVDEPCVSDGPTVSNSSTITT 60
Query: 61 SEFSEKKISFSEDGKRKSDLCNMNMVQSIIGPSRVEFQENDVCSTGCVENKETCMVNENH 120
SEFSEKKISFSEDGKRKSDLCNMNMVQSIIGPSRVEFQEND CSTGCVENKETCM+NENH
Sbjct: 61 SEFSEKKISFSEDGKRKSDLCNMNMVQSIIGPSRVEFQENDACSTGCVENKETCMMNENH 120
Query: 121 ALVLHEKPEFKLPHSDANSNPGLCAEKESDEIDRKQLDRLEFSTSVAKKEAELSIGSKEH 180
ALVLHEKPEFKLPHSDANSNPGLCAEKESDEIDRKQLDRLEFSTSVAKKEAELS+GSKEH
Sbjct: 121 ALVLHEKPEFKLPHSDANSNPGLCAEKESDEIDRKQLDRLEFSTSVAKKEAELSVGSKEH 180
Query: 181 LVPDSVLEGSDLKSLKQINLEPGLLNLSLSKEGSLDQPLTVNVGSSYDGSIQESNRENWD 240
LVP+SVLEGSDLKSLKQINLEP LLNLSLSKEGSLDQ LTVNVGSSYDGSIQESNRENWD
Sbjct: 181 LVPNSVLEGSDLKSLKQINLEPVLLNLSLSKEGSLDQRLTVNVGSSYDGSIQESNRENWD 240
Query: 241 LNTSMEFWEGCSSGDPPEHVPAVQTNTVVTMHRFSTEMVNTDTLSGKLTPLDDSDHLHLS 300
LNTSMEFWEGCSSGDPPEHVPAVQTNT+VT HRFSTEMVNTDTL GKLTPLDDSDHLHLS
Sbjct: 241 LNTSMEFWEGCSSGDPPEHVPAVQTNTIVTTHRFSTEMVNTDTLPGKLTPLDDSDHLHLS 300
Query: 301 LSSSDHRHVISQEQSSFVKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKLE 360
LSSSDHRHVISQEQSSFVKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKLE
Sbjct: 301 LSSSDHRHVISQEQSSFVKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKLE 360
Query: 361 SKSDEVNVLGLSDSAIVKREFLQIPNASDIYIPMNTVKAKSVNSESNYESKQVALETLGG 420
SKSD VNVLGLSDSAIVKREFLQIPN SDIYIPMNTVKA+SVNSE NYESKQ AL+TLGG
Sbjct: 361 SKSDGVNVLGLSDSAIVKREFLQIPNVSDIYIPMNTVKARSVNSELNYESKQEALKTLGG 420
Query: 421 RLDLVAKQVLPEVDSSCPAPMPFVAEMTEAAGNSCSTDLITDGGMSNHSELQTPTEEHLN 480
RLDLVAKQVLPEV SSCPAPMPFVAEMTEAA NSCSTDLITDG MSNH ELQTPT+EHLN
Sbjct: 421 RLDLVAKQVLPEVGSSCPAPMPFVAEMTEAARNSCSTDLITDGDMSNHPELQTPTKEHLN 480
Query: 481 LKVHEGAYRCGGELVDSEMTDISKDPGSKDFNSPIIKPIAMPRNPSRTNDSIIEANMSSP 540
L VHEGAYR GEL+DSEMTD+SKDPGSKDFNSPIIKPIAMPRNPSRTNDSIIEANMSSP
Sbjct: 481 LNVHEGAYRFAGELIDSEMTDVSKDPGSKDFNSPIIKPIAMPRNPSRTNDSIIEANMSSP 540
Query: 541 SELHIPTTGPLNTKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNN 600
SELHIPTTGPLNTKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNN
Sbjct: 541 SELHIPTTGPLNTKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNN 600
Query: 601 PLWRPSTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMDA 660
PLWRPSTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMD
Sbjct: 601 PLWRPSTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMDT 660
Query: 661 VDNDIEEDYEDGEVREPTLTTQVESSICETKKVKNFDHGDSSNGLPGSDCCSSLVSVKQE 720
VDND+EEDYEDGEVREPTLTTQVESSICETKKVKNFDH DSSNGLPGSDCCSSLVSVKQE
Sbjct: 661 VDNDVEEDYEDGEVREPTLTTQVESSICETKKVKNFDHADSSNGLPGSDCCSSLVSVKQE 720
Query: 721 NKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKAKTSALEDQETSP 780
NKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKAKTSA+EDQETSP
Sbjct: 721 NKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKAKTSAIEDQETSP 780
Query: 781 EKASNGIEESITTVSQSDAEKVKTVDIVRNDNPALPNVEPLNDDDVTDDITRGSKHSRIV 840
EKA+NGIEESITTVSQSDAEKVKTVD+VRN+NPALPNVEPLNDDDVTDDITRGSKHSRIV
Sbjct: 781 EKATNGIEESITTVSQSDAEKVKTVDMVRNNNPALPNVEPLNDDDVTDDITRGSKHSRIV 840
Query: 841 SPCKPSSSSLPSKTKSSLARSVLTQTDRERIPDMGHEGEKLHPQGRDEPYRDVFQRFYVN 900
SPCKPS+SSLPSKT+SSLARSVLTQTDRERIPDM HEGEKLHPQGRDEPYRDVFQRFYVN
Sbjct: 841 SPCKPSTSSLPSKTRSSLARSVLTQTDRERIPDMAHEGEKLHPQGRDEPYRDVFQRFYVN 900
Query: 901 RHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPTISPGNYND-QVPPP-YDARRRKYMPA 960
RHQNLSPQTNFSRRRG NY+D QVPPP YDARRRKYMPA
Sbjct: 901 RHQNLSPQTNFSRRRG----------------------NYSDHQVPPPPYDARRRKYMPA 960
Query: 961 VSDDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGV 1020
VSDDDIDQNHYKMKPD PFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGV
Sbjct: 961 VSDDDIDQNHYKMKPDCPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGV 1020
Query: 1021 KMVHRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDETMDPLYAHPQPSFEVDRPPFI 1080
KMVHRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDE MDPLYAHPQPSFEVDR PFI
Sbjct: 1021 KMVHRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDEAMDPLYAHPQPSFEVDRSPFI 1080
Query: 1081 RDRRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGYRMR 1140
RDRRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGYRMR
Sbjct: 1081 RDRRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGYRMR 1140
Query: 1141 SPDQPPQIHGDMPDRRHGFPFPSLPPNDLRDMGSARDHGHMRPGLRSRNRTDRMSFRNRR 1200
SPDQPPQIHGDMP RRHGFPFPSLPPNDLRDMGSARDHGHMRPG+RSRNRT+RMSFRNRR
Sbjct: 1141 SPDQPPQIHGDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRPGIRSRNRTERMSFRNRR 1200
Query: 1201 FEDMDPRDNRIESNEYFDGPVHPGQMNELIDDGNDDDRRRFSDRHEHLHQFRPQCNDSDG 1260
FEDMDPRDNRIESNEYFDGPVHPGQ+NELIDDGNDDDRRRFSDRHEHLHQFRPQCNDSDG
Sbjct: 1201 FEDMDPRDNRIESNEYFDGPVHPGQLNELIDDGNDDDRRRFSDRHEHLHQFRPQCNDSDG 1260
Query: 1261 ENYHNDADERARPYRYCTEDEEEFHER 1285
ENY NDADERARPYRYCTEDEEEFHER
Sbjct: 1261 ENYRNDADERARPYRYCTEDEEEFHER 1265
BLAST of Cp4.1LG13g09640 vs. ExPASy TrEMBL
Match:
A0A6J1BWB0 (uncharacterized protein LOC111006113 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111006113 PE=4 SV=1)
HSP 1 Score: 1449 bits (3751), Expect = 0.0
Identity = 835/1318 (63.35%), Postives = 949/1318 (72.00%), Query Frame = 0
Query: 1 MSTSDYNAIVPIKKRRFPSMQSPPP---KEISSLPLVDDNIAKVDEPCVSDGPTVSNSST 60
MS SDYN IVPIKKRRF +QS P KE+SSL L DDN+ KV EP +SDG TVS+S T
Sbjct: 16 MSASDYNVIVPIKKRRFTIVQSSPSSPHKELSSLSL-DDNLVKVAEPGISDGITVSSSVT 75
Query: 61 ITTSEFSEKK-ISFSEDGKRKSDLCNMNMVQSIIGPSRVEFQENDVCSTGCVENKETCMV 120
ITTSE SEKK ISFSE+ +RK DLCN N VQS I PS V FQE+D C VENK +
Sbjct: 76 ITTSELSEKKEISFSEESERKVDLCNSNRVQSNIEPSGVRFQEDDACFNHQVENKAMNVE 135
Query: 121 NENHALVLHEKPEFKLPHSDANSNPGLCAEKESDEIDRKQLDRLEFSTSVAKKEAELSIG 180
NE HAL L EKPE KLP SD NS GLCA K+ IDRK+L++ + TS+ K EAELS+G
Sbjct: 136 NEKHALHLLEKPELKLPTSDPNSKLGLCANKKRVGIDRKELEKCKSLTSLVKTEAELSVG 195
Query: 181 SKEHLVPDSVLEGSDLKSLKQINLEPGLLNLSLSKEGSLDQPLTVNVGSSYDGSIQESNR 240
E LVPD V++GSD K KQ NLEP LNLSLSK+GS Q LT NVGS YDGS+Q+SNR
Sbjct: 196 LNERLVPDLVVKGSDRKWQKQNNLEPVSLNLSLSKQGSYTQCLTSNVGSDYDGSLQQSNR 255
Query: 241 ENWDLNTSMEFWEGCSSGDPPEHVPAVQTNTVVTMHRFSTEMVNTDTLSGKLTPLDDSDH 300
NWDLNTSME WEGC+S DP VP VQTNT+VT HR STEMV D SGK TPLD SD+
Sbjct: 256 GNWDLNTSMESWEGCASDDPSVQVPVVQTNTIVTTHRCSTEMVRADISSGKPTPLDQSDY 315
Query: 301 LHLSLSSSDHRHVISQEQSSFVKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEA 360
LHLSL+SSD R V QEQ S VKL FR T SLSS G +QFDDLN ALKVVK EPFV+
Sbjct: 316 LHLSLNSSDLRPVTKQEQISSVKLDFRSTDSSLSSPGN-MQFDDLNVALKVVKAEPFVKG 375
Query: 361 SKLESKSDEVNVLGLSDSAIVKREF-----LQIPNASDIYIPMNTVKAKSVNSESNYESK 420
S+LESKS+EV LGLS A++ E L++P AS+I PMN VKAKS SE YESK
Sbjct: 376 SELESKSNEVKGLGLSGDALMNGELDDQCNLELPKASNICSPMNIVKAKSFKSEPVYESK 435
Query: 421 QVALETLGGRLDLVAKQVLPEVDSSCPAPMPFVAEMTEAAGN-SCSTDLITDGGMSNHSE 480
+ ALE LGGRL+L++KQVLP+VD+SCP +P VAEM+EAA N SCST L TDG M NHSE
Sbjct: 436 KEALEMLGGRLNLISKQVLPDVDNSCPIAVPVVAEMSEAARNPSCSTYLATDGDMLNHSE 495
Query: 481 LQTPTEEHLNLKVHEGAYRCGGELVDSEMTDISKDPGSKDFNSPIIKPIAMPRNPSRTND 540
L TPT+ +LN CGG LV+SE TDI+KDPG D
Sbjct: 496 LPTPTKGNLN--------ECGGGLVNSEKTDITKDPGLGD-------------------- 555
Query: 541 SIIEANMSSPSELHIPTTGPLNTKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKP 600
SS S+ KP
Sbjct: 556 ----------------------------------------------------SSISIAKP 615
Query: 601 VIVEDENQNNPLWRPSTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDG 660
EDENQNNP W +NEQCS LQGGEESSV+DEEKISLSAD+LEE PYSSEYESDG
Sbjct: 616 FNAEDENQNNPKWCLLKLSNEQCSGLQGGEESSVSDEEKISLSADILEEYPYSSEYESDG 675
Query: 661 KLDVNEAMDAVDNDIEEDYEDGEVREPTLTTQVESSICETKKVKNFDHGDSSN------- 720
K DV+ AM V NDIEEDYEDGEVREP L TQVESS+C ++V+NFDHGD S
Sbjct: 676 KQDVDGAMAEVHNDIEEDYEDGEVREPLLKTQVESSVCVKREVENFDHGDFSKDKKINSV 735
Query: 721 GLPGSDCCSSLVSVKQENKLEILDVKREDNLHSVTSNQSSEQERS-----KELPVEEHTT 780
GLPG+D S+L+SVKQENKLE DV++ED HSVT+NQSSEQE+ KE+ VEE+ +
Sbjct: 736 GLPGTDF-STLISVKQENKLESHDVRQEDKFHSVTTNQSSEQEKDEASYLKEILVEENAS 795
Query: 781 RVCLNKANKAKT------SALEDQETSPEKASNGIEESITTVSQSDAEKVKTVDIVRNDN 840
+ + + ALEDQ +S +KA++GIEE I TVSQ DAE VKTVD VRN++
Sbjct: 796 NKVIKATGRRQLFHCEERDALEDQNSS-DKATDGIEEPIVTVSQGDAENVKTVDFVRNND 855
Query: 841 PALPNV-EPLNDDDVTDDITRGSKHSRIVSPCKPSSSSLPSKTKSSLARSVLTQTDRERI 900
P LPNV EP+N+DD TDD GS+H ++PC S+SS PSKT+S+ RSVLT+TDRE+I
Sbjct: 856 PVLPNVKEPVNNDDATDDFIHGSRH---INPCHGSTSSSPSKTRSNSLRSVLTRTDREQI 915
Query: 901 PDMGHEGEKLHPQGRDEPYRDVFQRFYVNRHQNLSPQTNFSRRRGRFTIRINSVQGEWDF 960
D+ EG KL PQGRD+ Y V Q+ YVNRHQNLSPQTNF RR RFTIR +S+QGEWDF
Sbjct: 916 LDVALEGGKLQPQGRDDRYSGVSQKIYVNRHQNLSPQTNFHRRE-RFTIRTDSLQGEWDF 975
Query: 961 NPTISPGNYNDQVPPPYDARRRKYMPAVSDDDIDQNHYKMKPDGPFRSAGDHRGRQILDD 1020
NPT+SPG Y+DQ+P YDA RRKY+ AVSDDDIDQNHYK+KP+GPFRSAG +GRQILDD
Sbjct: 976 NPTVSPGIYSDQIP--YDAPRRKYLSAVSDDDIDQNHYKIKPNGPFRSAG-RQGRQILDD 1035
Query: 1021 EGPLFCHMASRRKSPGRRDGPPPVRGVKMVHRMPRNISPSRCNRERGSELVGPRHGEKFM 1080
EGP +CH+ SRRKSPG RDGPP VRGVKMVHRMPRNISPS C RE GSELVGPRHGEKFM
Sbjct: 1036 EGPPYCHIPSRRKSPGIRDGPP-VRGVKMVHRMPRNISPSGCIREAGSELVGPRHGEKFM 1095
Query: 1081 RTFEDETMDPLYAHPQPSFEVDRPPFIRDRRNFPIQRKSFQRVDSKSPGRSRGRSPSQWF 1140
RTFEDETMDP+YAHPQP +EVDRPPFIR+RRNF IQRK+F R+DSKSPGRSRGRSP QW
Sbjct: 1096 RTFEDETMDPIYAHPQPPYEVDRPPFIRERRNFTIQRKTFPRIDSKSPGRSRGRSPGQWV 1155
Query: 1141 PSKRKSERFFGHPEMARRSPPPGYR---MRSPDQPPQIHGDMPDRRHGFPFPSLPPNDLR 1200
P KRKS RF GH M RRS P GYR MRSPDQPP IHGDMP RRHGFPF LP +DLR
Sbjct: 1156 PGKRKSYRFCGHLGMTRRSSP-GYRGDRMRSPDQPPPIHGDMPVRRHGFPFSPLPSSDLR 1215
Query: 1201 DMGSARDHGHMRPGLRSRNRTDRMSFRNRRFEDMDPRDNRIESNEYFDGPVHPGQMNELI 1260
DM SA D GHMR +R RNR+DR+SFRNRRFE MDPRD RIES+EYFDGP Q+NEL
Sbjct: 1216 DMRSAPDQGHMRSDIRCRNRSDRLSFRNRRFEIMDPRD-RIESSEYFDGP---SQLNELS 1236
Query: 1261 DDGNDDDRRRFSDRHEHLHQFRPQCNDSDGENYHNDADERARPYRYCTED-EEEFHER 1285
DGNDDDRRRFSDRHEHLH FRPQ NDSDGENYHN+A++ RP+R+C ED EFHER
Sbjct: 1276 GDGNDDDRRRFSDRHEHLHSFRPQYNDSDGENYHNNAEDSRRPFRFCAEDGPPEFHER 1236
BLAST of Cp4.1LG13g09640 vs. TAIR 10
Match:
AT5G13590.1 (unknown protein; Has 150 Blast hits to 121 proteins in 42 species: Archae - 0; Bacteria - 8; Metazoa - 80; Fungi - 5; Plants - 17; Viruses - 0; Other Eukaryotes - 40 (source: NCBI BLink). )
HSP 1 Score: 68.2 bits (165), Expect = 9.2e-11
Identity = 175/680 (25.74%), Postives = 265/680 (38.97%), Query Frame = 0
Query: 604 RPSTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMDAVDN 663
+P C+S +G +E S NDEEKI+L LEE YS +ESD D++ +
Sbjct: 540 KPCISKELPCNS-RGTDELSRNDEEKITLPGKELEEQLYSYGFESDRGYDLSRVIKEQVG 599
Query: 664 DIEEDYEDGEVREPTLTTQVESSICETKKVKNFDHGDSSNGLPGSDCCSSLVSVKQENKL 723
+DG+V+ P SN + +C S +
Sbjct: 600 K-RNLCDDGKVQGPAAVF------------------TESNEVAHPECGGS--------ET 659
Query: 724 EILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKAKTSALEDQETSPEKA 783
E ++ ++H SN E+ L L + + ++D E +
Sbjct: 660 EQRNINVPCHVHFHNSNHVEEKGSQPAL----------LGYTGETEGRIVQDGEGT---- 719
Query: 784 SNGIEESITTVSQSDAEKVKTVDIVRNDNPALPNVEPLN-DDDVTDDITRGSKHSRIVSP 843
++TVS ++ +IV N +P E D+D + + GS+ SRI++
Sbjct: 720 -----SGVSTVSGG----IENPEIVDNSSPVSLKAEMSTIDNDSPMECSDGSQ-SRIINL 779
Query: 844 CKPSSSSLPSKTKSSLARSVLTQTDRERIPDMGHEGEKLHPQGRDEPYRDVFQRFYVNRH 903
+ S P K + V + +R+R D E + +G DE + F R +
Sbjct: 780 TQVKS---PVKALDASGSFVPPRMERDRFHDFPLEPREYTFRGSDESCK--FSRERYHGR 839
Query: 904 QNLSPQTNFSRRRGRFTIRINSVQGEWDFNPTISPGNYNDQVPPPYDARRRKYMPAVSDD 963
SP+ NF R R P + N +DQ D ++ ++
Sbjct: 840 IMRSPRLNFIPDRRRL--------------PDNTESNLHDQ-----DTKKFEF------- 899
Query: 964 DIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGVKMVH 1023
NH + G F S RGR+ +D + H RR SP
Sbjct: 900 ---DNHGNTRRGGAFMS-NFQRGRRPANDGVTPYAHSFPRR-SPS--------------- 959
Query: 1024 RMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDETMDPLYAHPQPSFEVDRPPFIRDRR 1083
N P+ N+E S G R GEKF R + +PL+ + Q + R F R R
Sbjct: 960 -FSYNRGPT--NKEDTSAFHGFRDGEKFTRGLQCNNTEPLFMNHQRPYR-GRSGFARGRT 1019
Query: 1084 NFPIQ-RKSFQRVDSKSPGRSRGRSPSQWFPSKRKS-ERFFGHPEMARRSPPPGY---RM 1143
F ++ F S+SP RSR RS + +S E F GH + + R P GY RM
Sbjct: 1020 KFVNNPKRDFPGFRSRSPVRSRERSDGSSSSFRNRSQEEFSGHTDFSHRRSPSGYKVERM 1079
Query: 1144 RSPDQPPQIHGDMPDRRHGFPFPSLPPNDLRDMGSARDHGHM------RPGLRSRNRTDR 1203
SPD + R + PF P N R G AR G++ R G R +D
Sbjct: 1080 SSPDHSGYSREMVVRRHNSPPFSHRPSNAGRGRGYARGRGYVRGRGYGRDGNSFRKPSDH 1107
Query: 1204 MSFRNR-RFEDMDPRDNRIESNEYFDGPVHPGQMNELIDDGNDDDRRRFSDRHEHL-HQF 1263
+ RN ++DPR+ S+++F+G +H +E + +RRRF RH+ F
Sbjct: 1140 VVHRNHGNMNNLDPRERVDYSDDFFEGQIH----SERFGVDVNAERRRFGYRHDGTSSSF 1107
Query: 1264 RPQCNDSDG---ENYHNDAD 1267
RP N +DG N ND D
Sbjct: 1200 RPSFN-NDGCAPTNVENDPD 1107
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023550091.1 | 0.0 | 100.00 | uncharacterized protein LOC111808389 isoform X1 [Cucurbita pepo subsp. pepo] | [more] |
XP_023550092.1 | 0.0 | 98.29 | uncharacterized protein LOC111808389 isoform X2 [Cucurbita pepo subsp. pepo] | [more] |
XP_022992789.1 | 0.0 | 96.42 | uncharacterized protein LOC111489020 isoform X1 [Cucurbita maxima] >XP_022992790... | [more] |
XP_022938519.1 | 0.0 | 96.19 | uncharacterized protein LOC111444729 isoform X1 [Cucurbita moschata] >XP_0229385... | [more] |
XP_022992791.1 | 0.0 | 94.71 | uncharacterized protein LOC111489020 isoform X2 [Cucurbita maxima] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1JYG4 | 0.0 | 96.42 | uncharacterized protein LOC111489020 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1FEB1 | 0.0 | 96.19 | uncharacterized protein LOC111444729 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1JUI7 | 0.0 | 94.71 | uncharacterized protein LOC111489020 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1FDD8 | 0.0 | 94.48 | uncharacterized protein LOC111444729 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1BWB0 | 0.0 | 63.35 | uncharacterized protein LOC111006113 isoform X1 OS=Momordica charantia OX=3673 G... | [more] |
Match Name | E-value | Identity | Description | |
AT5G13590.1 | 9.2e-11 | 25.74 | unknown protein; Has 150 Blast hits to 121 proteins in 42 species: Archae - 0; B... | [more] |