Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGGCGCGAATCTGAATCTCGTCTCTTCTAATATTTCTTCTGTTGGTAGAACTAGCCCTAGATCTCTCTCCCGCCTTTTTCTGTCACTGGATTGCCCTGTTTCGCCTTCTGCTCGTTGATCTCCTTCCTCCTCCGTCGTCTGCCAGCCTCGACGCCAGCGATTGAAAGAGTCCTATCACTGGTTAGTCAGTACTTGCTTTTGCTTTTCTGGTACTATTGTTCGAATCGGAGGCGGTCGATTCATTGATCGGATTTGCAAATATTCCCTTTTAGTTCGTCAGCCGAATTAGTTTGATGCATTTTTTATTTTGATAATGAGATCTAGACATTTGGACGAAACATTGGACATTAGCTCCGGAAGTTTAATTAGCTGAGTTGTTTAGCATGTTCATTAATTCTTATATTTCAGGCTTTTTGGTTGATCGGAATAGCTTTCTCTAACGTTCACGGATTAATAGGTACAACGTCCTACGGATTGATAGAATGTTCATTCATAACGTCAACTAGATTTAGTTCTCTAATGTCCATGGTTAAGTCGACGTGTTTATTTTAGTGAAAATGAGCTGTCTCCTTTCCGAAAGGAGAACCGTTCAATTGGCCGAGTAGTGTTAATGTCGAGATCTGGATTTAACAATTTAATATATGAGTATAAAGTAAATTATGAATGTGTATTGTGGAGGATTTACTTGGCGATGACTGGTATTTTTCACGCCTTGCTAGTTTCCTGGCTACTGACCTTTGTTTCTGCATAAACGTGTCAATCCTTTTGTTCACCCATAATTTGGGCGACGAGTCACGAGGTCACAAGGTTTTCAACTATATCGGACTTGATTGTACCAATGTTCCTCAATTTCCCATTTTGGAGCAGTCTTATTGAAATTTTCCTTGAAATTGAAGATGAATAAGTTTTTTTGGGCGGTGGACCTACACTATTTTGATCCTCAATTTTCACTTTGTTGTTTAGAGTCATGCTGTCCTGATGAATTTATTGCCACCATTACTTAATAACTATTTTTCACGGCAGGAGAAGCATTATCTCTGTTCAGCGAGGGGAAGAATTATACTTATCGAATGGAATTAAGAAGTTTCAGTCATTTGCATTATATCAATGTCACTAAAGGTGGTGCGATGTCAAAAGTTCTAAACGTCAACTCCCATGGAAAGCCTGCAGTCGTATTTAAGAAGCTTACTGACATATATGAATCTATAGATGACAAGACTCAAGAATCACTTCCAAGACGATGGTCGAGAGAAGGTTTGGAAGAAAATATTCCTGATGAATGTGAGTTTAAGGTGGAAACCCAAGTTCTTTACGCAGAAAGAAAACTATTCAATGATGAGCCCGAAGTTTCTGATTCTGACAATAAAAGTGATACCGATGGGGAACAGAGTGATGTAGAAGTTGATAACATGACTTTAAAGCAGATAATGGAAGGCTGCAAGAAAAGAAAGTTGAGGCAGTCGAGATCCGTTGACTCAAGTAAGGAAAAACTGGGGACATGTTCCAGACGAGAACCAGATCATGCATGCTTGTTATCTGATGAGGATGATAGTGATCTTAATGTAGCTCTTAGCATCTGGAAATCCAAACTTTCTAAATGTAGAAAATTGAAAACCAAATGTGACGAAGGTAGAATATCTACTAGTTCACATTGCGGCCAAACAATTGGAAATTCTGACCCGATCAACAGTGATCAAGATCTCCATCCATCTGGTTCAGATCTACCCGTTCCAGTAGACATTAAAGTTGAAACTCCCGAACCCGATGTGACAGAAATCCAAAGCACAAACTACAAAATTGATGAGTGGTCTCTATTTTGTGATGAAAATATAAACTCGTGTCTGAAACACGGGCCTAATGGAGCTGATGAATCAATTTTGTATCCCAAGTTGACAACATCTGAGAAAGAAGCTGAATATTGTGTTCTAAACAGTGCATGTCATGAATATTTGGAAGATGATGAACCCAAAACTCTTCAGATGGTAGGGGAATCTAGCAACGAGTGGATGTATGAAGATAACCTGGAGGTACATAAACCCCATTATTCAGATTTTCCTGCATCAGAGAGCCTGGAGGGGCAATGTACCCCAGGTTATATATCCAATTACAGCATGTCAGAAGCTATTTCCTCGACTAAGGAACAGCTCTCTGGCACTTATATTACAAACGAGGTCATATTTCAGAATAATAGTGAAGATATGTCTGAAGCAATTGCCCCGACCGAGGAACAGTGCTGTGACACTTATATTTCACAATGTATACCCTTTACACACGAGGTCATAAGTCTCAATAATCTGAATAGCCTCAAAGTCCAAGAGACGAGTCCTGAAGGTGAAGTATGTTTAACTGAGATCAGTTATAAAGACAAGTTGGCATTTGTTCATGAAAAAGGTATTCCAACAGAATCTAACAGTAATTGTAACTTGCGCCCTGATCATGGAAAAAGTATTTCAACAAATTCTATCAGTGATGGTAACTTTAGTCCTGATCAGCATTTGATATCTACTGGCGAATGTCCAGCTACGGAGAGACAACCACAAATGTCTAATTATTATGATTCAGAAAGAAATACTCCACCAGATTTTCATCTTGATGGTTCTTTGGACACGTTTAATCAAACTGAAGAACCCAAACGTCATCCAATAAGGCTGCTGTTAAAGAGAACAGTAAGTTGAATCTAATTGATGTAATATTTGGTTTTAAATTATGCGGAGGTTGTTTATATTATTGTGCTTTCACTTTCAGTCCATTTCTCCAACATCTCAGAAAAGATTGTCGAAGGGTATGAGGTCTATGCAGTTACATGACAAAGAATACAAAAGTAAGTTCCTGAAGTTAAACATCGGTTTAGTAAGTTCCTTAGCACTAATACGTGGGAGGTGGGTTCATTTAATAATATATATATATATATATATATATATATATATATAAATTTTTTTTTACTTAATTATTTTATTTTAGAATTTATTTTTATAAGCATATGTGAAATCCTAGCGAAATCCTTAAAGTATATTTATGACTAGTCCTATAGTTTGGGAGTAGTTTGTTCACTTTTCGTTTTCATTTTCCTAATTTTTCTCTTTTCATTTTAGCATGTAGTGGCAAACCATATTCCAACCAAATCAAGTACAGGGATGGCTCAGCTGAAGAGCGTGACCAGATGAAGAGAGTGTATTCGGATACGTATCATAAGCAAAAGATTAGGAAATCAAAGAAGAGAAGTCTTCACTCAGCGAGCACCACTGTAGTTCCTCAAGCTAGCATTAGAAGCACTGCTGTCCAAAATTGTTCAGACAGTGCAATTGCATTCACACAAAGGCAGATGCAGGACATAGAATGTCTTGCTCTGAAACTTACCAATCAGTTAACGTCAATGAAAGCAATTGTTGACGACAGACTTCATGTTGAAGGAAACCAAGCTACAAGTTTCAAGTTTAACACGGATGAGGTAACTAAACATATATGGGAACACAACTTAATTGACAGCATGAGTTCTATTACTTTATTGAGAAATTGAATTTTCCATGTTCCCTTACAATATTCATGTTTAACATGCCGATTTCAGATAATTATTGGACGTGATCTTCGGAGAGCTTTCATTATTAATATCCTTTGGCATTAATTAGCATATAGGAACGTGTTTGTAAAGCCTTTAATCATTTACCCCACTTTCCCTTAGTATTTATTACCTAGGATGTACTAAGCTGTAAGTTTCAATCCTGCACGCCAAGTGTTTAACCAAGTCATGCAAGGGCTTTGGCAATTATTTCGGGGAGATTTATTTTTTCACTTCTCGAGGTTTTTGCTGGTGGGGAATCATGTCCAATAAAATTCTTCCTTTAACCTATTGCCAGCTTCTTTTGGGGGATAACTAGGCGAAAGATAAATGGATTAAAGGGTTAATTGGGTGCTGCTTATGTCAATTGAGGACATTGCTCCTTCCTGTAACTTCTTGTTGAACTTTAGACATTTAGCACAGATTGTTGCTTAGATTTTGGTATGCCGGCAATTAGCACCTGAACTTCCATGCAACTATTATGTTGTCCCCTCATGTATTATTCTGCACTTTGGCAGGTGCGAACGGCCATTGCAGACGCCACGAAAGCGCAAGCGCAAGCAAGAAAATGGCTTTCCATAATGTCGAGGGATTGCAACCGCTTTTGTAAAATAATGGTTAGTACCTCAGCGTTATCCGATTTTTCTTACAGAAGGAAAGTACTGAATTTTCTTCCCTTAACTGCAGAAAACAACCGAGCATGGTTCAAATGTTTCTTCTCTAACTGCAATTCAGAAGGTGAAGAGGAAGATTACATTTGCTGATGAAGCTGGTGGAAAGCTTTGTGAAGTTAGGTTGATCGAGGACGGCATCAAGTAACTTCGTAAAGCCTATCTACTTGGTTATTTACTTATACTGGATCATTGTTCTCTTGAGATTTTATTTTTTTTTGTAAATTCCCATGTCTCATATAAAATAAAGTGCAGCCAATGGATTCTATTGAGGAAATTAAATCATTTTCATCGTTCCTTTTTCCACATTAAACCAAAATGAGTTTGACCATATTAAAGATAAAAGATCCCAGGTTGAGTTGGAGGGTAAACTTCTTTATTCTTTATAATGGTGTAGAAATACTTTTCTAACAGACTCGTATAATATCTGTTAGTGGTAGGTTTGGACGGTTAAAAATGGTATCGGAGTCAAACATCGAGCTGGTGCGAACAAGGATGATGGACCAAAAGGAGGTAAATTGTGAGATGCTTATTGAAGTGGTAGACAGGTTTTAAAACTTATAGGATTATTATAAAATGGTAGGTAGATTGATAGGCCATCACTTGTTTGAGTGGGAATCGTAGAACAGAAACTCCCAAACCTCGGAAAAAGCTCCAACTATAATCCCATGATACTCCGGCGAGTTCAACGCCAAATAACACTGCAAAAGGTCCTCCAATCCCTTCGCCTCAAAAATCTCCTTCTCCAATATCATCTGTACCATTGACCTCTTGAAATCTTCTTGGGGTTCCTTCGATTTCTTGACCTGCACTAAGCTCTCCCCCATTTTCCATTCCACTTGACTTGGACATGACCCGATTTCGCTTCTCTTCGCCGGCGATCCATCGGAGGAGAAACTAATGGAAGAGTTCAACAATGCCTCTGTTCCTTCGCCTTCCCCTCCCTCGTACCCATACGACTTGTGCTTTGCCAGGAATGCCTTGCTTTGAAGCCCCGTTTTCTTTGATTTCTTCTTCTTCTTCCACTTCTTTCTGGTTATCCAAGAAAAGGGGGAACGCCGATTCTCCTCGCCTGAAATTTCATGCTCTGTTTCGGCTCTGTTTTCTTTCACTTGCATTAGTCCGTTTGCCCGAGCGTATGATTTCAATTTGCAGCTGATGCAGACTGATGTGGACTCTTCTGTTGAAAAATCGGCGAAGAAACCCACTGGCGTTGATGGCGGTGGCGATGGTGGATTGTGGTAACAGCTGTAGCAGGATTTGGGGTTGGCCGGCAATTGCTTCTCGTAGTGGAAATTTTCCGGCGAAGGTGTTATGGGGAAAGTTAAAGGGTGTGGGTGTTTGGGGCGGCAGAATCGGAAAGAGGAGATGAAGAGACGGACTTTGAGCTTGAAGCTTCTGGCCATGGAGAGAAGGGAGGAGGATGGCCGGCGAATTGCTCTGTTTCTCTGTGTATAAATAATTGTGACAAGAGACAGTGGCCGGCCGGCAGCTAAGTTGAGCCAATAGGGTTGTGGTTTGGACCAAATCCCAACTGATGACGAGAGGTTTATTGGTACTTAATTGTGTAGAACTTTTCCTAGAAAAGTGCAGTATAATTGGTCGTAATGGCGTGGAAACCTCTCTTAGATGAATTTAAAAAACTCTTAGGGGAATCTCAAAAAAAGAGAACAATATAGGTGAATTTGCGTTGTTATAAATGATATCAAAGTCAACACCGAATAATGTGTCAGTGAGTAAGGACATCAATGTGTGAGTAAGGATACTAGGCTTTTGTTATAAATAGTATCAGAGCAAGATATGATATGCCTGTTATGACGTAGGACTCTGATTACTACCGGTAGCAGAGCTAAACTCCAAACTATGACACAGAAGATGTGACACGGAAGAGGGTGAATCCAGCGATCCCACGTGAACAAAATAACCCTAAAGTTTGAATCCTCCCGAGTGTTGCTCTCTTTTAGCAAGTTATATTCCAATAGGTTATAATATTCCAACCAAGTGGGTCCCCTCCTCTAGTCTCCACTCCCTTTTGTGCGCAGGCAAATTGCAGAAGAAGCTCTGTCCATGGCCTTGGCCCCATTCGCACACAGTCACCATTCATGATCCCAATAAAAGCTTGCATTTACATTACGACAAAACCCTCCATTAATTTCTCATCACTCTCCTCTCTTTCTCACTCTCTACTGTTGAAGAAACACGCCCAAAACAACCCCACTTTCTCCCCCAAAAATGAAACCCGACCTTTCACCTACAATGGCCACACTGAAGAAGGACTCTCCATCAGAAACTGGGGTCTCCTTTTTCCTCTCAAGAAAAGCTCGCTACAAGTTCTGGGTATTGGCCGCCATTCTACTCCTCGCTTTCTGGTCCATGTTCACCGGCTCCGTCTCCCTCAAATGGTCCGCCGGAACTTTCGCCAGATTCTACGACGGCCCCCGCAAGCCGATCTTCGACGATCTTGACATTCTGGTGATCCTTTTAGCTATTTCCACGGTGGGTTCTTGTTTTTACTGTTGTTTTTTTTTTTTTACTGTTTTAGTGTATTGTTTTGCTGAAAATTAGGAAGTTGAAGAGCGGGAGAGGGATGTCCGGCACATGTGGAATCTGTACTCCCATGGAGGCGGCGGCCGGTTGCCGCGATTCTGGTTGGAGGCTTTTGAAGCGGCGTACGAGGATTTGATCGGCGATGTTCCCGCCGTTCGAGATGCTGCCCTTTTGGAGATCGCTAGGATGTCTCTGCAATCTGTTCATGTCGACCCAATCCCGATCAAATCCAAGGTTAGCCGTCGAGATCAAACATTTTCTTGAACCCTGTCTTGTTTGTGCGACTGTTTCTGATTGAAATTGGTTTCTGAAACCAATGGGTTGGATTCTCTTGGCGATGAAAAGGTTCCTTTCTCTATTTTATTTTATTTTATTTGATTTCAAAAATGTCATATCAACATTAAAAATACTTTTAAATAATTACTCTATTAATTAGGAAATGGAAAAAGAAATGTAGGACTTCTCGGACCTGTCTCTCAAAACTCGGGATCTTTAAGACTATTCCTATAAACAAGGATAATTAAGTATATTTTAAAATTTTAGGGATAAATTAATCTATTTCTCAAACTTATTTTAAAATTTTTAGGGATCAAATGAATCCATTTCTGAAATTTTATGGATCAAAGATTCTTTTTCTTTTTCTTCCTATGTCATGTGCATTTAATTGTTATAAAAGTGGCACCAACTTTTTTTTTTTTTTTTTTTTTNNNNNNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGGCCACATGCAGCCGTAACAACTATACTTATCAAAAGTGGATGTCTCGATCATGCTTGTTCCAAACTCCTTCTTACTTGTTTTTAAGTTAGGTATGCACTTACTAATATTGTTGTGTTAATCATGCTTGTTCCAAACTCTTTCTTACTTGTTTTTAAGTTAGGTATGCACTTACTAATATTGTTGTGTTAAAGGTGTATGATTGGGAAGGTGTATGATTGGAAAGGTGTATGATCAATAACCAAAATCATATCTAACGGTGCTTCAATCCGAGTTATATGCTATCAAAACAGTTGTTCAACATATTTCATCGCTCTTACTCTCAAACGTGCCTGAGGCTCAAGCCTATTGTTCGTGAAGACCAAATTTTACACGGCCAACTCTAAAGTAGAACAAGTATTGTACAATCGTTATTACTGATATAGTGAAATAGAATGAAAGGTAAAACAGTGGTTGTATCCAACACCCAACTCTCAAACCACGGCCTCCGAACACCTCCCTAGTAAGCATGGCACTTTACACAACCCATCACAAGCAGCAGCATTATGCATCGAGTTCGAGGCTCCTACACATCATACATAATCAAATACAGCCTTTTGCTAACACTCAATAGACTAGGCCAGGCTATGAAAATGGAGCAGAACCAGCCCAGCTCACAAATTAAACTATACACTAAAAGAAGTCAAAGAATTGAAGTTGCATGCCTAAGCATCGGAACATTCCATGACCACTGCTGCGCTATGCATAAAAAGAAGACCAACAAAGGAAGCGAAACACAAAGAAGAATAAATTACCTCGGTTAGGTTAAATCCAATCCAGCTCAAAAATGTTTGTGACAGACAGACCCAATAAAACAACAAACCTAAGAAGACAAGACAGACAGTTAATTGTCAAAATCGACGAGCAACAGACAAAAATAGTCACTGCTGCTGCATCTCTATGATGTTTTGGTGGTGGTAATTGAGGTACAAAAAAAAATCCATTTACACAAAGTTGGATGCATTTATTTACATTTGCCTCCAAAGCTGCTCTCTTTCTTTCCTACTAATCTTCAACCGTCTGAATCTCACCCATCATGATTCGGTTACTAACTACATTAAACCCCAAAACTGATGGATTGATCTTCTTTCCTTTCTTTCCTTTCTTCTTTCCTCCTCCACTTTTCGCCGACCCATCTCGGCCAGCAGACACGTCCGGGTCCAGATCCCCCCCACCAGCATTGCCGGAATTCACATCTCGGGAAGCCATTGCAGATACCTTCCTCTCGTTCCGACTTCGGAATGCAATTTCAATAACATCTGCTGGGAGCAACTCCTTGTAATTGAGGAATTGATCGATGAACTCGTGGTCGGGATCATATGATCCAAGATTCTCTATCAAAAGCAGCTCTGCCTCAGATCTTGACTGCTTCAAGCAGAATTCCAGGAAACTCGTGTCTAGAACAGGCAAATAGAGTTCATACTCCATTAATAATAAAAGATAAACATAAAGCATGACAAATAATTTTCAGAAAGAATGACACGTTCAAGTATTTATATGCTTAAAAGGTTGCATTTTTAATCGTGATAGATGCGTTTAATATATAATAAAAGAAAACTGTAACGGATGTTTCTAGTGAAACTCTGGAAAAGCAAGTTAGATGTTTCTAGTGATGCAAACATTAACTACAAAGTTAAAAAGAAAACTGTAAATTAGTTTCCCAATGATCTAGAAGTCTCACAAAATGATACCTAACGGATGTTTATCATCTAAAATAAAAAGGCCTTAGTTCTCTCAAATATTTTGTCATGTCCATGTGCCACTCTGTGAGTGATCTAATACTCCAGTGTGACGAAATCTCAGAAATGTCAAATATCAAAAATATCGTGCAAACCATAAGCATCCAATCCTCGTAGGACATTTTTTTTCAAGATAAGAAACGATAATGCATTCACATCCATGCCTCGTCGGACATGAATTCATAAGCAAAATGTCTATGTCTATCTTCAACCAGGCAAGAAAAGATACCTTTTGTCCCAATGAGCCTTACACATTCATTCTCACACCAATCACGGAAGCCCATAGCTTCTACAAGGTGAGGAAAAAACTTTATCATTATACATGCCAAAGAAGATTTTGGCAAGAAAAAGGGTAAAAAAAAAAAAAAAAAAACCCAAAC
mRNA sequence
TGGCGCGAATCTGAATCTCGTCTCTTCTAATATTTCTTCTGTTGGTAGAACTAGCCCTAGATCTCTCTCCCGCCTTTTTCTGTCACTGGATTGCCCTGTTTCGCCTTCTGCTCGTTGATCTCCTTCCTCCTCCGTCGTCTGCCAGCCTCGACGCCAGCGATTGAAAGAGTCCTATCACTGAGTCATGCTGTCCTGATGAATTTATTGCCACCATTACTTAATAACTATTTTTCACGGCAGGAGAAGCATTATCTCTGTTCAGCGAGGGGAAGAATTATACTTATCGAATGGAATTAAGAAGTTTCAGTCATTTGCATTATATCAATGTCACTAAAGGTGGTGCGATGTCAAAAGTTCTAAACGTCAACTCCCATGGAAAGCCTGCAGTCGTATTTAAGAAGCTTACTGACATATATGAATCTATAGATGACAAGACTCAAGAATCACTTCCAAGACGATGGTCGAGAGAAGGTTTGGAAGAAAATATTCCTGATGAATGTGAGTTTAAGGTGGAAACCCAAGTTCTTTACGCAGAAAGAAAACTATTCAATGATGAGCCCGAAGTTTCTGATTCTGACAATAAAAGTGATACCGATGGGGAACAGAGTGATGTAGAAGTTGATAACATGACTTTAAAGCAGATAATGGAAGGCTGCAAGAAAAGAAAGTTGAGGCAGTCGAGATCCGTTGACTCAAGTAAGGAAAAACTGGGGACATGTTCCAGACGAGAACCAGATCATGCATGCTTGTTATCTGATGAGGATGATAGTGATCTTAATGTAGCTCTTAGCATCTGGAAATCCAAACTTTCTAAATGTAGAAAATTGAAAACCAAATGTGACGAAGGTAGAATATCTACTAGTTCACATTGCGGCCAAACAATTGGAAATTCTGACCCGATCAACAGTGATCAAGATCTCCATCCATCTGGTTCAGATCTACCCGTTCCAGTAGACATTAAAGTTGAAACTCCCGAACCCGATGTGACAGAAATCCAAAGCACAAACTACAAAATTGATGAGTGGTCTCTATTTTGTGATGAAAATATAAACTCGTGTCTGAAACACGGGCCTAATGGAGCTGATGAATCAATTTTGTATCCCAAGTTGACAACATCTGAGAAAGAAGCTGAATATTGTGTTCTAAACAGTGCATGTCATGAATATTTGGAAGATGATGAACCCAAAACTCTTCAGATGGTAGGGGAATCTAGCAACGAGTGGATGTATGAAGATAACCTGGAGGTACATAAACCCCATTATTCAGATTTTCCTGCATCAGAGAGCCTGGAGGGGCAATGTACCCCAGGTTATATATCCAATTACAGCATGTCAGAAGCTATTTCCTCGACTAAGGAACAGCTCTCTGGCACTTATATTACAAACGAGGTCATATTTCAGAATAATAGTGAAGATATGTCTGAAGCAATTGCCCCGACCGAGGAACAGTGCTGTGACACTTATATTTCACAATGTATACCCTTTACACACGAGGTCATAAGTCTCAATAATCTGAATAGCCTCAAAGTCCAAGAGACGAGTCCTGAAGGTGAAGTATGTTTAACTGAGATCAGTTATAAAGACAAGTTGGCATTTGTTCATGAAAAAGGTATTCCAACAGAATCTAACAGTAATTGTAACTTGCGCCCTGATCATGGAAAAAGTATTTCAACAAATTCTATCAGTGATGGTAACTTTAGTCCTGATCAGCATTTGATATCTACTGGCGAATGTCCAGCTACGGAGAGACAACCACAAATGTCTAATTATTATGATTCAGAAAGAAATACTCCACCAGATTTTCATCTTGATGGTTCTTTGGACACGTTTAATCAAACTGAAGAACCCAAACGTCATCCAATAAGGCTGCTGTTAAAGAGAACATCCATTTCTCCAACATCTCAGAAAAGATTGTCGAAGGGTATGAGGTCTATGCAGTTACATGACAAAGAATACAAAACATGTAGTGGCAAACCATATTCCAACCAAATCAAGTACAGGGATGGCTCAGCTGAAGAGCGTGACCAGATGAAGAGAGTGTATTCGGATACGTATCATAAGCAAAAGATTAGGAAATCAAAGAAGAGAAGTCTTCACTCAGCGAGCACCACTGTAGTTCCTCAAGCTAGCATTAGAAGCACTGCTGTCCAAAATTGTTCAGACAGTGCAATTGCATTCACACAAAGGCAGATGCAGGACATAGAATGTCTTGCTCTGAAACTTACCAATCAGTTAACGTCAATGAAAGCAATTGTTGACGACAGACTTCATGTTGAAGGAAACCAAGCTACAAGTTTCAAGTTTAACACGGATGAGGTGCGAACGGCCATTGCAGACGCCACGAAAGCGCAAGCGCAAGCAAGAAAATGGCTTTCCATAATGTCGAGGGATTGCAACCGCTTTTGTAAAATAATGAAAACAACCGAGCATGGTTCAAATGTTTCTTCTCTAACTGCAATTCAGAAGGTGAAGAGGAAGATTACATTTGCTGATGAAGCTGGTGGAAAGCTTTGTGAAGTTAGGTTGATCGAGGACGGCATCAAGTAACTTCGTAAAGCCTATCTACTTGGTTATTTACTTATACTGGATCATTGTTCTCTTGAGATTTTATTTTTTTTTGTAAATTCCCATGTCTCATATAAAATAAAGTGCAGCCAATGGATTCTATTGAGGAAATTAAATCATTTTCATCGTTCCTTTTTCCACATTAAACCAAAATGAGTTTGACCATATTAAAGATAAAAGATCCCAGGTTGAGTTGGAGGGTAAACTTCTTTATTCTTTATAATGGTGTAGAAATACTTTTCTAACAGACTCGTATAATATCTGTTAGTGGTAGGTTTGGACGGTTAAAAATGGTATCGGAGTCAAACATCGAGCTGGTGCGAACAAGGATGATGGACCAAAAGGAGGTAAATTGTGAGATGCTTATTGAAGTGGTAGACAGGTTTTAAAACTTATAGGATTATTATAAAATGGTAGGTAGATTGATAGGCCATCACTTGTTTGAGTGGGAATCGTAGAACAGAAACTCCCAAACCTCGGAAAAAGCTCCAACTATAATCCCATGATACTCCGGCGAGTTCAACGCCAAATAACACTGCAAAAGGTCCTCCAATCCCTTCGCCTCAAAAATCTCCTTCTCCAATATCATCTGTACCATTGACCTCTTGAAATCTTCTTGGGGTTCCTTCGATTTCTTGACCTGCACTAAGCTCTCCCCCATTTTCCATTCCACTTGACTTGGACATGACCCGATTTCGCTTCTCTTCGCCGGCGATCCATCGGAGGAGAAACTAATGGAAGAGTTCAACAATGCCTCTGTTCCTTCGCCTTCCCCTCCCTCGTACCCATACGACTTGTGCTTTGCCAGGAATGCCTTGCTTTGAAGCCCCGTTTTCTTTGATTTCTTCTTCTTCTTCCACTTCTTTCTGGTTATCCAAGAAAAGGGGGAACGCCGATTCTCCTCGCCTGAAATTTCATGCTCTGTTTCGGCTCTGTTTTCTTTCACTTGCATTAGTCCGTTTGCCCGAGCGTATGATTTCAATTTGCAGCTGATGCAGACTGATGTGGACTCTTCTGTTGAAAAATCGGCGAAGAAACCCACTGGCGTTGATGGCGGTGGCGATGGTGGATTGTGGTAACAGCTGTAGCAGGATTTGGGGTTGGCCGGCAATTGCTTCTCGTAGTGGAAATTTTCCGGCGAAGGTGTTATGGGGAAAGTTAAAGGGTGTGGGTGTTTGGGGCGGCAGAATCGGAAAGAGGAGATGAAGAGACGGACTTTGAGCTTGAAGCTTCTGGCCATGGAGAGAAGGGAGGAGGATGGCCGGCGAATTGCTCTGTTTCTCTGTGTATAAATAATTGTGACAAGAGACAGTGGCCGGCCGGCAGCTAAGTTGAGCCAATAGGGTTGTGGTTTGGACCAAATCCCAACTGATGACGAGAGGTTTATTGGTACTTAATTGTGTAGAACTTTTCCTAGAAAAGTGCAGTATAATTGGTCGTAATGGCGTGGAAACCTCTCTTAGATGAATTTAAAAAACTCTTAGGGGAATCTCAAAAAAAGAGAACAATATAGGTGAATTTGCGTTGTTATAAATGATATCAAAGTCAACACCGAATAATGTGTCAGTGAGTAAGGACATCAATGTGTGAGTAAGGATACTAGGCTTTTGTTATAAATAGTATCAGAGCAAGATATGATATGCCTGTTATGACGTAGGACTCTGATTACTACCGGTAGCAGAGCTAAACTCCAAACTATGACACAGAAGATGTGACACGGAAGAGGGTGAATCCAGCGATCCCACGTGAACAAAATAACCCTAAAGTTTGAATCCTCCCGAGTGTTGCTCTCTTTTAGCAAGTTATATTCCAATAGGTTATAATATTCCAACCAAGTGGGTCCCCTCCTCTAGTCTCCACTCCCTTTTGTGCGCAGGCAAATTGCAGAAGAAGCTCTGTCCATGGCCTTGGCCCCATTCGCACACAGTCACCATTCATGATCCCAATAAAAGCTTGCATTTACATTACGACAAAACCCTCCATTAATTTCTCATCACTCTCCTCTCTTTCTCACTCTCTACTGTTGAAGAAACACGCCCAAAACAACCCCACTTTCTCCCCCAAAAATGAAACCCGACCTTTCACCTACAATGGCCACACTGAAGAAGGACTCTCCATCAGAAACTGGGGTCTCCTTTTTCCTCTCAAGAAAAGCTCGCTACAAGTTCTGGGTATTGGCCGCCATTCTACTCCTCGCTTTCTGGTCCATGTTCACCGGCTCCGTCTCCCTCAAATGGTCCGCCGGAACTTTCGCCAGATTCTACGACGGCCCCCGCAAGCCGATCTTCGACGATCTTGACATTCTGGAAGTTGAAGAGCGGGAGAGGGATGTCCGGCACATGTGGAATCTGTACTCCCATGGAGGCGGCGGCCGGTTGCCGCGATTCTGGTTGGAGGCTTTTGAAGCGGCGTACGAGGATTTGATCGGCGATGTTCCCGCCGTTCGAGATGCTGCCCTTTTGGAGATCGCTAGGATGTCTCTGCAATCTGTTCATGTCGACCCAATCCCGATCAAATCCAAGGTTAGCCGTCGAGATCAAACATTTTCTTGAACCCTGTCTTGTTTGTGCGACTGTTTCTGATTGAAATTGGTTTCTGAAACCAATGGGTTGGATTCTCTTGGCGATGAAAAGGTTCCTTTCTCTATTTTATTTTATTTTATTTGATTTCAAAAATGTCATATCAACATTAAAAATACTTTTAAATAATTACTCTATTAATTAGGAAATGGAAAAAGAAATGTAGGACTTCTCGGACCTGTCTCTCAAAACTCGGGATCTTTAAGACTATTCCTATAAACAAGGATAATTAAGTATATTTTAAAATTTTAGGGATAAATTAATCTATTTCTCAAACTTATTTTAAAATTTTTAGGGATCAAATGAATCCATTTCTGAAATTTTATGGATCAAAGATTCTTTTTCTTTTTCTTCCTATGTCATGTGCATTTAATTGTTATAAAAGTGGCACCAACTTTTTTTTTTTTTTTTTTTTTNNNNNNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGGCCACATGCAGCCGTAACAACTATACTTATCAAAAGTGGATGTCTCGATCATGCTTGTTCCAAACTCCTTCTTACTTGTTTTTAAGTTAGGTATGCACTTACTAATATTGTTGTGTTAATCATGCTTGTTCCAAACTCTTTCTTACTTGTTTTTAAGTTAGGTATGCACTTACTAATATTGTTGTGTTAAAGGTGTATGATTGGGAAGGTGTATGATTGGAAAGGTGTATGATCAATAACCAAAATCATATCTAACGGTGCTTCAATCCGAGTTATATGCTATCAAAACAGTTGTTCAACATATTTCATCGCTCTTACTCTCAAACGTGCCTGAGGCTCAAGCCTATTGTTCGTGAAGACCAAATTTTACACGGCCAACTCTAAAGTAGAACAAGTATTGTACAATCGTTATTACTGATATAGTGAAATAGAATGAAAGGTAAAACAGTGGTTGTATCCAACACCCAACTCTCAAACCACGGCCTCCGAACACCTCCCTAGTAAGCATGGCACTTTACACAACCCATCACAAGCAGCAGCATTATGCATCGAGTTCGAGGCTCCTACACATCATACATAATCAAATACAGCCTTTTGCTAACACTCAATAGACTAGGCCAGGCTATGAAAATGGAGCAGAACCAGCCCAGCTCACAAATTAAACTATACACTAAAAGAAGTCAAAGAATTGAAGTTGCATGCCTAAGCATCGGAACATTCCATGACCACTGCTGCGCTATGCATAAAAAGAAGACCAACAAAGGAAGCGAAACACAAAGAAGAATAAATTACCTCGGTTAGGTTAAATCCAATCCAGCTCAAAAATGTTTGTGACAGACAGACCCAATAAAACAACAAACCTAAGAAGACAAGACAGACAGTTAATTGTCAAAATCGACGAGCAACAGACAAAAATAGTCACTGCTGCTGCATCTCTATGATGTTTTGGTGGTGGTAATTGAGGTACAAAAAAAAATCCATTTACACAAAGTTGGATGCATTTATTTACATTTGCCTCCAAAGCTGCTCTCTTTCTTTCCTACTAATCTTCAACCGTCTGAATCTCACCCATCATGATTCGGTTACTAACTACATTAAACCCCAAAACTGATGGATTGATCTTCTTTCCTTTCTTTCCTTTCTTCTTTCCTCCTCCACTTTTCGCCGACCCATCTCGGCCAGCAGACACGTCCGGGTCCAGATCCCCCCCACCAGCATTGCCGGAATTCACATCTCGGGAAGCCATTGCAGATACCTTCCTCTCGTTCCGACTTCGGAATGCAATTTCAATAACATCTGCTGGGAGCAACTCCTTGTAATTGAGGAATTGATCGATGAACTCGTGGTCGGGATCATATGATCCAAGATTCTCTATCAAAAGCAGCTCTGCCTCAGATCTTGACTGCTTCAAGCAGAATTCCAGGAAACTCGTGTCTAGAACAGGCAAATAGAGTTCATACTCCATTAATAATAAAAGATAAACATAAAGCATGACAAATAATTTTCAGAAAGAATGACACGTTCAAGTATTTATATGCTTAAAAGGTTGCATTTTTAATCGTGATAGATGCGTTTAATATATAATAAAAGAAAACTGTAACGGATGTTTCTAGTGAAACTCTGGAAAAGCAAGTTAGATGTTTCTAGTGATGCAAACATTAACTACAAAGTTAAAAAGAAAACTGTAAATTAGTTTCCCAATGATCTAGAAGTCTCACAAAATGATACCTAACGGATGTTTATCATCTAAAATAAAAAGGCCTTAGTTCTCTCAAATATTTTGTCATGTCCATGTGCCACTCTGTGAGTGATCTAATACTCCAGTGTGACGAAATCTCAGAAATGTCAAATATCAAAAATATCGTGCAAACCATAAGCATCCAATCCTCGTAGGACATTTTTTTTCAAGATAAGAAACGATAATGCATTCACATCCATGCCTCGTCGGACATGAATTCATAAGCAAAATGTCTATGTCTATCTTCAACCAGGCAAGAAAAGATACCTTTTGTCCCAATGAGCCTTACACATTCATTCTCACACCAATCACGGAAGCCCATAGCTTCTACAAGGTGAGGAAAAAACTTTATCATTATACATGCCAAAGAAGATTTTGGCAAGAAAAAGGGTAAAAAAAAAAAAAAAAAAACCCAAAC
Coding sequence (CDS)
ATGGAATTAAGAAGTTTCAGTCATTTGCATTATATCAATGTCACTAAAGGTGGTGCGATGTCAAAAGTTCTAAACGTCAACTCCCATGGAAAGCCTGCAGTCGTATTTAAGAAGCTTACTGACATATATGAATCTATAGATGACAAGACTCAAGAATCACTTCCAAGACGATGGTCGAGAGAAGGTTTGGAAGAAAATATTCCTGATGAATGTGAGTTTAAGGTGGAAACCCAAGTTCTTTACGCAGAAAGAAAACTATTCAATGATGAGCCCGAAGTTTCTGATTCTGACAATAAAAGTGATACCGATGGGGAACAGAGTGATGTAGAAGTTGATAACATGACTTTAAAGCAGATAATGGAAGGCTGCAAGAAAAGAAAGTTGAGGCAGTCGAGATCCGTTGACTCAAGTAAGGAAAAACTGGGGACATGTTCCAGACGAGAACCAGATCATGCATGCTTGTTATCTGATGAGGATGATAGTGATCTTAATGTAGCTCTTAGCATCTGGAAATCCAAACTTTCTAAATGTAGAAAATTGAAAACCAAATGTGACGAAGGTAGAATATCTACTAGTTCACATTGCGGCCAAACAATTGGAAATTCTGACCCGATCAACAGTGATCAAGATCTCCATCCATCTGGTTCAGATCTACCCGTTCCAGTAGACATTAAAGTTGAAACTCCCGAACCCGATGTGACAGAAATCCAAAGCACAAACTACAAAATTGATGAGTGGTCTCTATTTTGTGATGAAAATATAAACTCGTGTCTGAAACACGGGCCTAATGGAGCTGATGAATCAATTTTGTATCCCAAGTTGACAACATCTGAGAAAGAAGCTGAATATTGTGTTCTAAACAGTGCATGTCATGAATATTTGGAAGATGATGAACCCAAAACTCTTCAGATGGTAGGGGAATCTAGCAACGAGTGGATGTATGAAGATAACCTGGAGGTACATAAACCCCATTATTCAGATTTTCCTGCATCAGAGAGCCTGGAGGGGCAATGTACCCCAGGTTATATATCCAATTACAGCATGTCAGAAGCTATTTCCTCGACTAAGGAACAGCTCTCTGGCACTTATATTACAAACGAGGTCATATTTCAGAATAATAGTGAAGATATGTCTGAAGCAATTGCCCCGACCGAGGAACAGTGCTGTGACACTTATATTTCACAATGTATACCCTTTACACACGAGGTCATAAGTCTCAATAATCTGAATAGCCTCAAAGTCCAAGAGACGAGTCCTGAAGGTGAAGTATGTTTAACTGAGATCAGTTATAAAGACAAGTTGGCATTTGTTCATGAAAAAGGTATTCCAACAGAATCTAACAGTAATTGTAACTTGCGCCCTGATCATGGAAAAAGTATTTCAACAAATTCTATCAGTGATGGTAACTTTAGTCCTGATCAGCATTTGATATCTACTGGCGAATGTCCAGCTACGGAGAGACAACCACAAATGTCTAATTATTATGATTCAGAAAGAAATACTCCACCAGATTTTCATCTTGATGGTTCTTTGGACACGTTTAATCAAACTGAAGAACCCAAACGTCATCCAATAAGGCTGCTGTTAAAGAGAACATCCATTTCTCCAACATCTCAGAAAAGATTGTCGAAGGGTATGAGGTCTATGCAGTTACATGACAAAGAATACAAAACATGTAGTGGCAAACCATATTCCAACCAAATCAAGTACAGGGATGGCTCAGCTGAAGAGCGTGACCAGATGAAGAGAGTGTATTCGGATACGTATCATAAGCAAAAGATTAGGAAATCAAAGAAGAGAAGTCTTCACTCAGCGAGCACCACTGTAGTTCCTCAAGCTAGCATTAGAAGCACTGCTGTCCAAAATTGTTCAGACAGTGCAATTGCATTCACACAAAGGCAGATGCAGGACATAGAATGTCTTGCTCTGAAACTTACCAATCAGTTAACGTCAATGAAAGCAATTGTTGACGACAGACTTCATGTTGAAGGAAACCAAGCTACAAGTTTCAAGTTTAACACGGATGAGGTGCGAACGGCCATTGCAGACGCCACGAAAGCGCAAGCGCAAGCAAGAAAATGGCTTTCCATAATGTCGAGGGATTGCAACCGCTTTTGTAAAATAATGAAAACAACCGAGCATGGTTCAAATGTTTCTTCTCTAACTGCAATTCAGAAGGTGAAGAGGAAGATTACATTTGCTGATGAAGCTGGTGGAAAGCTTTGTGAAGTTAGGTTGATCGAGGACGGCATCAAGTAA
Protein sequence
MELRSFSHLHYINVTKGGAMSKVLNVNSHGKPAVVFKKLTDIYESIDDKTQESLPRRWSREGLEENIPDECEFKVETQVLYAERKLFNDEPEVSDSDNKSDTDGEQSDVEVDNMTLKQIMEGCKKRKLRQSRSVDSSKEKLGTCSRREPDHACLLSDEDDSDLNVALSIWKSKLSKCRKLKTKCDEGRISTSSHCGQTIGNSDPINSDQDLHPSGSDLPVPVDIKVETPEPDVTEIQSTNYKIDEWSLFCDENINSCLKHGPNGADESILYPKLTTSEKEAEYCVLNSACHEYLEDDEPKTLQMVGESSNEWMYEDNLEVHKPHYSDFPASESLEGQCTPGYISNYSMSEAISSTKEQLSGTYITNEVIFQNNSEDMSEAIAPTEEQCCDTYISQCIPFTHEVISLNNLNSLKVQETSPEGEVCLTEISYKDKLAFVHEKGIPTESNSNCNLRPDHGKSISTNSISDGNFSPDQHLISTGECPATERQPQMSNYYDSERNTPPDFHLDGSLDTFNQTEEPKRHPIRLLLKRTSISPTSQKRLSKGMRSMQLHDKEYKTCSGKPYSNQIKYRDGSAEERDQMKRVYSDTYHKQKIRKSKKRSLHSASTTVVPQASIRSTAVQNCSDSAIAFTQRQMQDIECLALKLTNQLTSMKAIVDDRLHVEGNQATSFKFNTDEVRTAIADATKAQAQARKWLSIMSRDCNRFCKIMKTTEHGSNVSSLTAIQKVKRKITFADEAGGKLCEVRLIEDGIK
Homology
BLAST of Cp4.1LG11g01830 vs. NCBI nr
Match:
XP_023545220.1 (uncharacterized protein LOC111804698 [Cucurbita pepo subsp. pepo] >XP_023545221.1 uncharacterized protein LOC111804698 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1494 bits (3868), Expect = 0.0
Identity = 751/751 (100.00%), Postives = 751/751 (100.00%), Query Frame = 0
Query: 1 MELRSFSHLHYINVTKGGAMSKVLNVNSHGKPAVVFKKLTDIYESIDDKTQESLPRRWSR 60
MELRSFSHLHYINVTKGGAMSKVLNVNSHGKPAVVFKKLTDIYESIDDKTQESLPRRWSR
Sbjct: 1 MELRSFSHLHYINVTKGGAMSKVLNVNSHGKPAVVFKKLTDIYESIDDKTQESLPRRWSR 60
Query: 61 EGLEENIPDECEFKVETQVLYAERKLFNDEPEVSDSDNKSDTDGEQSDVEVDNMTLKQIM 120
EGLEENIPDECEFKVETQVLYAERKLFNDEPEVSDSDNKSDTDGEQSDVEVDNMTLKQIM
Sbjct: 61 EGLEENIPDECEFKVETQVLYAERKLFNDEPEVSDSDNKSDTDGEQSDVEVDNMTLKQIM 120
Query: 121 EGCKKRKLRQSRSVDSSKEKLGTCSRREPDHACLLSDEDDSDLNVALSIWKSKLSKCRKL 180
EGCKKRKLRQSRSVDSSKEKLGTCSRREPDHACLLSDEDDSDLNVALSIWKSKLSKCRKL
Sbjct: 121 EGCKKRKLRQSRSVDSSKEKLGTCSRREPDHACLLSDEDDSDLNVALSIWKSKLSKCRKL 180
Query: 181 KTKCDEGRISTSSHCGQTIGNSDPINSDQDLHPSGSDLPVPVDIKVETPEPDVTEIQSTN 240
KTKCDEGRISTSSHCGQTIGNSDPINSDQDLHPSGSDLPVPVDIKVETPEPDVTEIQSTN
Sbjct: 181 KTKCDEGRISTSSHCGQTIGNSDPINSDQDLHPSGSDLPVPVDIKVETPEPDVTEIQSTN 240
Query: 241 YKIDEWSLFCDENINSCLKHGPNGADESILYPKLTTSEKEAEYCVLNSACHEYLEDDEPK 300
YKIDEWSLFCDENINSCLKHGPNGADESILYPKLTTSEKEAEYCVLNSACHEYLEDDEPK
Sbjct: 241 YKIDEWSLFCDENINSCLKHGPNGADESILYPKLTTSEKEAEYCVLNSACHEYLEDDEPK 300
Query: 301 TLQMVGESSNEWMYEDNLEVHKPHYSDFPASESLEGQCTPGYISNYSMSEAISSTKEQLS 360
TLQMVGESSNEWMYEDNLEVHKPHYSDFPASESLEGQCTPGYISNYSMSEAISSTKEQLS
Sbjct: 301 TLQMVGESSNEWMYEDNLEVHKPHYSDFPASESLEGQCTPGYISNYSMSEAISSTKEQLS 360
Query: 361 GTYITNEVIFQNNSEDMSEAIAPTEEQCCDTYISQCIPFTHEVISLNNLNSLKVQETSPE 420
GTYITNEVIFQNNSEDMSEAIAPTEEQCCDTYISQCIPFTHEVISLNNLNSLKVQETSPE
Sbjct: 361 GTYITNEVIFQNNSEDMSEAIAPTEEQCCDTYISQCIPFTHEVISLNNLNSLKVQETSPE 420
Query: 421 GEVCLTEISYKDKLAFVHEKGIPTESNSNCNLRPDHGKSISTNSISDGNFSPDQHLISTG 480
GEVCLTEISYKDKLAFVHEKGIPTESNSNCNLRPDHGKSISTNSISDGNFSPDQHLISTG
Sbjct: 421 GEVCLTEISYKDKLAFVHEKGIPTESNSNCNLRPDHGKSISTNSISDGNFSPDQHLISTG 480
Query: 481 ECPATERQPQMSNYYDSERNTPPDFHLDGSLDTFNQTEEPKRHPIRLLLKRTSISPTSQK 540
ECPATERQPQMSNYYDSERNTPPDFHLDGSLDTFNQTEEPKRHPIRLLLKRTSISPTSQK
Sbjct: 481 ECPATERQPQMSNYYDSERNTPPDFHLDGSLDTFNQTEEPKRHPIRLLLKRTSISPTSQK 540
Query: 541 RLSKGMRSMQLHDKEYKTCSGKPYSNQIKYRDGSAEERDQMKRVYSDTYHKQKIRKSKKR 600
RLSKGMRSMQLHDKEYKTCSGKPYSNQIKYRDGSAEERDQMKRVYSDTYHKQKIRKSKKR
Sbjct: 541 RLSKGMRSMQLHDKEYKTCSGKPYSNQIKYRDGSAEERDQMKRVYSDTYHKQKIRKSKKR 600
Query: 601 SLHSASTTVVPQASIRSTAVQNCSDSAIAFTQRQMQDIECLALKLTNQLTSMKAIVDDRL 660
SLHSASTTVVPQASIRSTAVQNCSDSAIAFTQRQMQDIECLALKLTNQLTSMKAIVDDRL
Sbjct: 601 SLHSASTTVVPQASIRSTAVQNCSDSAIAFTQRQMQDIECLALKLTNQLTSMKAIVDDRL 660
Query: 661 HVEGNQATSFKFNTDEVRTAIADATKAQAQARKWLSIMSRDCNRFCKIMKTTEHGSNVSS 720
HVEGNQATSFKFNTDEVRTAIADATKAQAQARKWLSIMSRDCNRFCKIMKTTEHGSNVSS
Sbjct: 661 HVEGNQATSFKFNTDEVRTAIADATKAQAQARKWLSIMSRDCNRFCKIMKTTEHGSNVSS 720
Query: 721 LTAIQKVKRKITFADEAGGKLCEVRLIEDGI 751
LTAIQKVKRKITFADEAGGKLCEVRLIEDGI
Sbjct: 721 LTAIQKVKRKITFADEAGGKLCEVRLIEDGI 751
BLAST of Cp4.1LG11g01830 vs. NCBI nr
Match:
XP_022962613.1 (uncharacterized protein LOC111463010 [Cucurbita moschata] >XP_022962614.1 uncharacterized protein LOC111463010 [Cucurbita moschata])
HSP 1 Score: 1459 bits (3778), Expect = 0.0
Identity = 733/752 (97.47%), Postives = 738/752 (98.14%), Query Frame = 0
Query: 1 MELRSFSHLHYINVTKGGAMSKVLNVNSHGKPAVVFKKLTDIYESIDDKTQESLPRRWSR 60
MELRSFSHLHYIN TKGGAMSKVLNVNSHGKPAVVFK+LTDIYESIDDKTQESLPRRWSR
Sbjct: 1 MELRSFSHLHYINDTKGGAMSKVLNVNSHGKPAVVFKRLTDIYESIDDKTQESLPRRWSR 60
Query: 61 EGLEENIPDECEFKVETQVLYAERKLFNDEPEVSDSDNKSDTDGEQSDVEVDNMTLKQIM 120
EGLEENIPDECEFKVETQVLYAERKLFNDEPEVSDSDNK DTDGEQSDVEVDNMTLKQIM
Sbjct: 61 EGLEENIPDECEFKVETQVLYAERKLFNDEPEVSDSDNKGDTDGEQSDVEVDNMTLKQIM 120
Query: 121 EGCKKRKLRQSRSVDSSKEKLGTCSRREPDHACLLSDEDDSDLNVALSIWKSKLSKCRKL 180
EGCKKRKLRQSRSVDSSKEK GTCSR EPDHACLLSDEDDSDLNVALSIWKSKLSKCRKL
Sbjct: 121 EGCKKRKLRQSRSVDSSKEKPGTCSRLEPDHACLLSDEDDSDLNVALSIWKSKLSKCRKL 180
Query: 181 KTKCDEGRISTSSHCGQTIGNSDPINSDQDLHPSGSDLPVPVDIKVETPEPDVTEIQSTN 240
KTKCDEGRISTSSHCGQTIGNSDPINSDQDLHPSGSDLPVPVDIKVETPEPDVTEIQSTN
Sbjct: 181 KTKCDEGRISTSSHCGQTIGNSDPINSDQDLHPSGSDLPVPVDIKVETPEPDVTEIQSTN 240
Query: 241 YKIDEWSLFCDENINSCLKHGPNGADESILYPKLTTSEKEAEYCVLNSACHEYLEDDEPK 300
YKIDEWSLFCDENINSCLKHGPNGADESI YPKLTTSEKEAEYCVLNSACHEYLEDDEPK
Sbjct: 241 YKIDEWSLFCDENINSCLKHGPNGADESIFYPKLTTSEKEAEYCVLNSACHEYLEDDEPK 300
Query: 301 TLQMVGESSNEWMYEDNLEVHKPHYSDFPASESLEGQCTPGYISNYSMSEAISSTKEQLS 360
TLQMVGESSNEWMYEDNLEVHKPHYSDFPASESLEGQCTPGYISNYSMSEAISSTKEQLS
Sbjct: 301 TLQMVGESSNEWMYEDNLEVHKPHYSDFPASESLEGQCTPGYISNYSMSEAISSTKEQLS 360
Query: 361 GTYITNEVIFQNNSEDMSEAIAPTEEQCCDTYISQCIPFTHEVISLNNLNSLKVQETSPE 420
GTYITNEVIFQNNSEDMSEAIAPTEEQCCDTYISQCIPFTHEVISLNNLNSLKVQETSPE
Sbjct: 361 GTYITNEVIFQNNSEDMSEAIAPTEEQCCDTYISQCIPFTHEVISLNNLNSLKVQETSPE 420
Query: 421 GEVCLTEISYKDKLAFVHEKGIPTESNSNCNLRPDHGKSISTNSISDGNFSPDQHLISTG 480
GEVCLTEISYKDK AFVHEKGIPTESNSNCNLRPDHGKSISTNS+SDGN SPDQHLISTG
Sbjct: 421 GEVCLTEISYKDKSAFVHEKGIPTESNSNCNLRPDHGKSISTNSVSDGNLSPDQHLISTG 480
Query: 481 ECPATERQPQMSNYYDSERNTPPDFHLDGSLDTFNQTEEPKRHPIRLLLKRTSISPTSQK 540
ECPATERQPQMSNYYDSERNTPPDFHLDGSLDTFNQTEEPKRHPIRLLLKRTSISPTSQK
Sbjct: 481 ECPATERQPQMSNYYDSERNTPPDFHLDGSLDTFNQTEEPKRHPIRLLLKRTSISPTSQK 540
Query: 541 RLSKGMRSMQLHDKEYKTCSGKPYSNQIKYRDGSAEERDQMKRVYSDTYHKQKIRKSKKR 600
RLSKGMRSMQLHDKEYKTCSGKPY NQIKYRDGS EE DQMKRVYSD YHK+ IRKSKKR
Sbjct: 541 RLSKGMRSMQLHDKEYKTCSGKPYFNQIKYRDGSTEECDQMKRVYSDIYHKKNIRKSKKR 600
Query: 601 SLHSASTTVVPQASIRSTAVQNCSDSAIAFTQRQMQDIECLALKLTNQLTSMKAIVDDRL 660
SLHSAST VPQAS+RSTAVQNCSDSAIAFTQRQMQDIECLALKLTNQLTSMKAIVDDRL
Sbjct: 601 SLHSASTAKVPQASMRSTAVQNCSDSAIAFTQRQMQDIECLALKLTNQLTSMKAIVDDRL 660
Query: 661 HVEGNQATSFKFNTDEVRTAIADATKAQAQARKWLSIMSRDCNRFCKIMKTTEHGSNVSS 720
HVEGNQATSFKFNTDEVRTAIADATKA+AQARKWLSIMSRDCNRFCKIMKTTEHGSNVSS
Sbjct: 661 HVEGNQATSFKFNTDEVRTAIADATKAEAQARKWLSIMSRDCNRFCKIMKTTEHGSNVSS 720
Query: 721 LTAIQKVKRKITFADEAGGKLCEVRLIEDGIK 752
LTAIQKVKRKITFADEAGGKLCEVRLIEDGIK
Sbjct: 721 LTAIQKVKRKITFADEAGGKLCEVRLIEDGIK 752
BLAST of Cp4.1LG11g01830 vs. NCBI nr
Match:
KAG6598431.1 (hypothetical protein SDJN03_08209, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 1434 bits (3713), Expect = 0.0
Identity = 723/752 (96.14%), Postives = 728/752 (96.81%), Query Frame = 0
Query: 1 MELRSFSHLHYINVTKGGAMSKVLNVNSHGKPAVVFKKLTDIYESIDDKTQESLPRRWSR 60
MELRSF HLHYINVTKGGAMSKVLNVNSHGKPAVVFKKLTDIYESIDDKTQESLPRRWSR
Sbjct: 1 MELRSFGHLHYINVTKGGAMSKVLNVNSHGKPAVVFKKLTDIYESIDDKTQESLPRRWSR 60
Query: 61 EGLEENIPDECEFKVETQVLYAERKLFNDEPEVSDSDNKSDTDGEQSDVEVDNMTLKQIM 120
EGLEENIPDECEFKVETQVLYAERKLFNDEPEVSDSDNK DTDGEQSDVEVDNMTLKQIM
Sbjct: 61 EGLEENIPDECEFKVETQVLYAERKLFNDEPEVSDSDNKGDTDGEQSDVEVDNMTLKQIM 120
Query: 121 EGCKKRKLRQSRSVDSSKEKLGTCSRREPDHACLLSDEDDSDLNVALSIWKSKLSKCRKL 180
EGCKKRKLRQSRSVDSSKEK GTCSRREPDHACLLSDEDDSDLNVALSIWKSKLSKCRKL
Sbjct: 121 EGCKKRKLRQSRSVDSSKEKPGTCSRREPDHACLLSDEDDSDLNVALSIWKSKLSKCRKL 180
Query: 181 KTKCDEGRISTSSHCGQTIGNSDPINSDQDLHPSGSDLPVPVDIKVETPEPDVTEIQSTN 240
KTKCDEGRISTSSHCGQTIGNSDPINSDQDLHPSGSDLPVPVDIKVETPEPDVTEIQSTN
Sbjct: 181 KTKCDEGRISTSSHCGQTIGNSDPINSDQDLHPSGSDLPVPVDIKVETPEPDVTEIQSTN 240
Query: 241 YKIDEWSLFCDENINSCLKHGPNGADESILYPKLTTSEKEAEYCVLNSACHEYLEDDEPK 300
YKIDEWSLFCDENINSCLKHGPNGADESI YPKLTTSEKEAEYCVLNSACHEYLEDDEPK
Sbjct: 241 YKIDEWSLFCDENINSCLKHGPNGADESIFYPKLTTSEKEAEYCVLNSACHEYLEDDEPK 300
Query: 301 TLQMVGESSNEWMYEDNLEVHKPHYSDFPASESLEGQCTPGYISNYSMSEAISSTKEQLS 360
TLQMVGESSNEWMYEDNLEVHKPHYSDFPASESLEGQCTPGYISNYSMSEAISSTKEQLS
Sbjct: 301 TLQMVGESSNEWMYEDNLEVHKPHYSDFPASESLEGQCTPGYISNYSMSEAISSTKEQLS 360
Query: 361 GTYITNEVIFQNNSEDMSEAIAPTEEQCCDTYISQCIPFTHEVISLNNLNSLKVQETSPE 420
GTYITNEVIFQNNSEDMSEAIAPTEEQCCDTYISQCIPFTHEVISLNNLNSLKVQETSPE
Sbjct: 361 GTYITNEVIFQNNSEDMSEAIAPTEEQCCDTYISQCIPFTHEVISLNNLNSLKVQETSPE 420
Query: 421 GEVCLTEISYKDKLAFVHEKGIPTESNSNCNLRPDHGKSISTNSISDGNFSPDQHLISTG 480
GEVCLTEISYKDK AFVHEKGIPTESNSNCNLRPDHGKSISTNS+SDGN SPDQHLISTG
Sbjct: 421 GEVCLTEISYKDKSAFVHEKGIPTESNSNCNLRPDHGKSISTNSVSDGNLSPDQHLISTG 480
Query: 481 ECPATERQPQMSNYYDSERNTPPDFHLDGSLDTFNQTEEPKRHPIRLLLKRTSISPTSQK 540
ECPATERQPQMSNYYDSERNTPPDFHLDGSLDTFNQTEEPKRHPIRLLLKRTSISPTSQK
Sbjct: 481 ECPATERQPQMSNYYDSERNTPPDFHLDGSLDTFNQTEEPKRHPIRLLLKRTSISPTSQK 540
Query: 541 RLSKGMRSMQLHDKEYKTCSGKPYSNQIKYRDGSAEERDQMKRVYSDTYHKQKIRKSKKR 600
RLSKGMRSMQLHDKEYKTCSGKPY NQIKYRD +D YHK+ IRKSKKR
Sbjct: 541 RLSKGMRSMQLHDKEYKTCSGKPYFNQIKYRD-------------ADIYHKKNIRKSKKR 600
Query: 601 SLHSASTTVVPQASIRSTAVQNCSDSAIAFTQRQMQDIECLALKLTNQLTSMKAIVDDRL 660
SLHSASTT VPQAS+RSTAVQNCSDSAIAFTQRQMQDIECLALKLTNQLTSMKAIVDDRL
Sbjct: 601 SLHSASTTKVPQASMRSTAVQNCSDSAIAFTQRQMQDIECLALKLTNQLTSMKAIVDDRL 660
Query: 661 HVEGNQATSFKFNTDEVRTAIADATKAQAQARKWLSIMSRDCNRFCKIMKTTEHGSNVSS 720
HVEGNQATSFKFN DEVRTAIADATKA+AQARKWLSIMSRDCNRFCKIMKTTEHGSNVSS
Sbjct: 661 HVEGNQATSFKFNKDEVRTAIADATKAEAQARKWLSIMSRDCNRFCKIMKTTEHGSNVSS 720
Query: 721 LTAIQKVKRKITFADEAGGKLCEVRLIEDGIK 752
LTAIQKVKRKITFADEAGGKLCEVRLIEDGIK
Sbjct: 721 LTAIQKVKRKITFADEAGGKLCEVRLIEDGIK 739
BLAST of Cp4.1LG11g01830 vs. NCBI nr
Match:
KAG7029375.1 (hypothetical protein SDJN02_07713, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 1432 bits (3707), Expect = 0.0
Identity = 723/752 (96.14%), Postives = 729/752 (96.94%), Query Frame = 0
Query: 1 MELRSFSHLHYINVTKGGAMSKVLNVNSHGKPAVVFKKLTDIYESIDDKTQESLPRRWSR 60
MELRSFSHLHYINVTKGGAMSKVLNVNSHGKPAVVFKKLTDIY+SIDDKTQESLPRRWSR
Sbjct: 19 MELRSFSHLHYINVTKGGAMSKVLNVNSHGKPAVVFKKLTDIYKSIDDKTQESLPRRWSR 78
Query: 61 EGLEENIPDECEFKVETQVLYAERKLFNDEPEVSDSDNKSDTDGEQSDVEVDNMTLKQIM 120
EGLEENIPDECEFKVETQVLYAERKLFNDEPEVSDSDNK DTDGEQSDVEVDNMTLKQIM
Sbjct: 79 EGLEENIPDECEFKVETQVLYAERKLFNDEPEVSDSDNKGDTDGEQSDVEVDNMTLKQIM 138
Query: 121 EGCKKRKLRQSRSVDSSKEKLGTCSRREPDHACLLSDEDDSDLNVALSIWKSKLSKCRKL 180
EGCKKRKLRQSRSVDSSKEK GTCSRREPDHACLLSDEDDSDLNVALSIWKSKLSKCRKL
Sbjct: 139 EGCKKRKLRQSRSVDSSKEKPGTCSRREPDHACLLSDEDDSDLNVALSIWKSKLSKCRKL 198
Query: 181 KTKCDEGRISTSSHCGQTIGNSDPINSDQDLHPSGSDLPVPVDIKVETPEPDVTEIQSTN 240
KTKCDEGRISTSSHCGQTIGNSDPINSDQDLHPSGSDLPVPVDIKVETPEPDVTEIQSTN
Sbjct: 199 KTKCDEGRISTSSHCGQTIGNSDPINSDQDLHPSGSDLPVPVDIKVETPEPDVTEIQSTN 258
Query: 241 YKIDEWSLFCDENINSCLKHGPNGADESILYPKLTTSEKEAEYCVLNSACHEYLEDDEPK 300
YKIDEWSLFCDENINSCLKHGPNGADESI YPKLTTSEKEAEYCVLNSACHEYLEDDEPK
Sbjct: 259 YKIDEWSLFCDENINSCLKHGPNGADESIFYPKLTTSEKEAEYCVLNSACHEYLEDDEPK 318
Query: 301 TLQMVGESSNEWMYEDNLEVHKPHYSDFPASESLEGQCTPGYISNYSMSEAISSTKEQLS 360
TLQMVGESSNEWMYEDNLEVHKPHYSDFPASESLEGQCTPGYISNYSMSEAISSTKEQLS
Sbjct: 319 TLQMVGESSNEWMYEDNLEVHKPHYSDFPASESLEGQCTPGYISNYSMSEAISSTKEQLS 378
Query: 361 GTYITNEVIFQNNSEDMSEAIAPTEEQCCDTYISQCIPFTHEVISLNNLNSLKVQETSPE 420
GTYITNEVIFQNNSEDMSEAIAPTEEQCCDTYISQCIPFTHEVISLNNLNSLKVQETSPE
Sbjct: 379 GTYITNEVIFQNNSEDMSEAIAPTEEQCCDTYISQCIPFTHEVISLNNLNSLKVQETSPE 438
Query: 421 GEVCLTEISYKDKLAFVHEKGIPTESNSNCNLRPDHGKSISTNSISDGNFSPDQHLISTG 480
GEVCLTEISYKDK AFVHEKGIPTESNSNCNLRPDHGKSISTNS+SDGN SPDQHLISTG
Sbjct: 439 GEVCLTEISYKDKSAFVHEKGIPTESNSNCNLRPDHGKSISTNSVSDGNLSPDQHLISTG 498
Query: 481 ECPATERQPQMSNYYDSERNTPPDFHLDGSLDTFNQTEEPKRHPIRLLLKRTSISPTSQK 540
ECPATERQPQMSNYYDSERNTPPDFHLDGSLDTFNQTEEPKRHPIRLLLKRTSISPTSQK
Sbjct: 499 ECPATERQPQMSNYYDSERNTPPDFHLDGSLDTFNQTEEPKRHPIRLLLKRTSISPTSQK 558
Query: 541 RLSKGMRSMQLHDKEYKTCSGKPYSNQIKYRDGSAEERDQMKRVYSDTYHKQKIRKSKKR 600
RLSKGMRSMQLHDKEYKTCSGKPY NQIKYRD +D YHK+ IRKSKKR
Sbjct: 559 RLSKGMRSMQLHDKEYKTCSGKPYFNQIKYRD-------------ADIYHKKNIRKSKKR 618
Query: 601 SLHSASTTVVPQASIRSTAVQNCSDSAIAFTQRQMQDIECLALKLTNQLTSMKAIVDDRL 660
SLHSASTT VPQAS+RSTAVQNCSDSAIAFTQRQMQDIECLALKLTNQLTSMKAIVDDRL
Sbjct: 619 SLHSASTTKVPQASMRSTAVQNCSDSAIAFTQRQMQDIECLALKLTNQLTSMKAIVDDRL 678
Query: 661 HVEGNQATSFKFNTDEVRTAIADATKAQAQARKWLSIMSRDCNRFCKIMKTTEHGSNVSS 720
HVEGNQATSFKFNTDEVRTAIADATKA+AQARKWLSIMSRDCNRF KIMKTTEHGSNVSS
Sbjct: 679 HVEGNQATSFKFNTDEVRTAIADATKAEAQARKWLSIMSRDCNRFGKIMKTTEHGSNVSS 738
Query: 721 LTAIQKVKRKITFADEAGGKLCEVRLIEDGIK 752
LTAIQKVKRKITFADEAGGKLCEVRLIEDGIK
Sbjct: 739 LTAIQKVKRKITFADEAGGKLCEVRLIEDGIK 757
BLAST of Cp4.1LG11g01830 vs. NCBI nr
Match:
XP_022997327.1 (uncharacterized protein LOC111492272 isoform X1 [Cucurbita maxima] >XP_022997328.1 uncharacterized protein LOC111492272 isoform X1 [Cucurbita maxima])
HSP 1 Score: 1426 bits (3692), Expect = 0.0
Identity = 718/751 (95.61%), Postives = 732/751 (97.47%), Query Frame = 0
Query: 1 MELRSFSHLHYINVTKGGAMSKVLNVNSHGKPAVVFKKLTDIYESIDDKTQESLPRRWSR 60
MELRSFSHLHYINVTKGGAMSKVLNVNSHGKPAVVFKKLTDIYESIDDKTQESLPRRWSR
Sbjct: 1 MELRSFSHLHYINVTKGGAMSKVLNVNSHGKPAVVFKKLTDIYESIDDKTQESLPRRWSR 60
Query: 61 EGLEENIPDECEFKVETQVLYAERKLFNDEPEVSDSDNKSDTDGEQSDVEVDNMTLKQIM 120
EGLEENIPDECEFKVETQVLYAERKLFN+EPEVSDSD+K DTDG++SDVEVD+MTLKQI
Sbjct: 61 EGLEENIPDECEFKVETQVLYAERKLFNNEPEVSDSDSKGDTDGQKSDVEVDSMTLKQIT 120
Query: 121 EGCKKRKLRQSRSVDSSKEKLGTCSRREPDHACLLSDEDDSDLNVALSIWKSKLSKCRKL 180
EGCKKRKLRQSRSVDSSKEKL TCSRRE DHACLLSDEDDSDLNVAL+IWKSKLSK RKL
Sbjct: 121 EGCKKRKLRQSRSVDSSKEKLRTCSRRELDHACLLSDEDDSDLNVALNIWKSKLSKRRKL 180
Query: 181 KTKCDEGRISTSSHCGQTIGNSDPINSDQDLHPSGSDLPVPVDIKVETPEPDVTEIQSTN 240
KTKCDE RISTSSHCGQTIGNSDPINSDQDLHPSGSDLPVPVDIKVETPEPDV+EIQSTN
Sbjct: 181 KTKCDESRISTSSHCGQTIGNSDPINSDQDLHPSGSDLPVPVDIKVETPEPDVSEIQSTN 240
Query: 241 YKIDEWSLFCDENINSCLKHGPNGADESILYPKLTTSEKEAEYCVLNSACHEYLEDDEPK 300
YKIDEWSLFCDENINSCLKHGPNGADESI YPKLTTSEKEAEYCVLNSACHEYLEDDEPK
Sbjct: 241 YKIDEWSLFCDENINSCLKHGPNGADESIFYPKLTTSEKEAEYCVLNSACHEYLEDDEPK 300
Query: 301 TLQMVGESSNEWMYEDNLEVHKPHYSDFPASESLEGQCTPGYISNYSMSEAISSTKEQLS 360
TLQMVGESSNEWMYEDNLE HKPHYSDFPASESLEGQCTPGYISNYSMSEAISSTKEQLS
Sbjct: 301 TLQMVGESSNEWMYEDNLEEHKPHYSDFPASESLEGQCTPGYISNYSMSEAISSTKEQLS 360
Query: 361 GTYITNEVIFQNNSEDMSEAIAPTEEQCCDTYISQCIPFTHEVISLNNLNSLKVQETSPE 420
GTYITNEVIFQNNSEDMSEAIAPTEEQCCDTYISQCIPFTH+VI LNNLNSLKVQETSPE
Sbjct: 361 GTYITNEVIFQNNSEDMSEAIAPTEEQCCDTYISQCIPFTHDVICLNNLNSLKVQETSPE 420
Query: 421 GEVCLTEISYKDKLAFVHEKGIPTESNSNCNLRPDHGKSISTNSISDGNFSPDQHLISTG 480
EVCLTEISYKDKLAFVHEKG PTESNSNCNLRPDHGK ISTNSISDGN SPDQHLISTG
Sbjct: 421 AEVCLTEISYKDKLAFVHEKGTPTESNSNCNLRPDHGKRISTNSISDGNLSPDQHLISTG 480
Query: 481 ECPATERQPQMSNYYDSERNTPPDFHLDGSLDTFNQTEEPKRHPIRLLLKRTSISPTSQK 540
ECPATERQPQMSNYYDSERNTPPDFHLDGSLD F QTEEPKRHP RLLLKRTSISPTSQK
Sbjct: 481 ECPATERQPQMSNYYDSERNTPPDFHLDGSLDKFYQTEEPKRHPTRLLLKRTSISPTSQK 540
Query: 541 RLSKGMRSMQLHDKEYKTCSGKPYSNQIKYRDGSAEERDQMKRVYSDTYHKQKIRKSKKR 600
RLSKGMRSMQLHDKEYKTCSGKPY NQIKYRDGSAEE DQMK V+SDTYHKQKIRKSKKR
Sbjct: 541 RLSKGMRSMQLHDKEYKTCSGKPYFNQIKYRDGSAEECDQMKIVHSDTYHKQKIRKSKKR 600
Query: 601 SLHSASTTVVPQASIRSTAVQNCSDSAIAFTQRQMQDIECLALKLTNQLTSMKAIVDDRL 660
SLHSASTTVVPQAS+RSTAVQNCSDSAIAFTQRQMQDIECLALKLTNQLTSMKAIVDDRL
Sbjct: 601 SLHSASTTVVPQASMRSTAVQNCSDSAIAFTQRQMQDIECLALKLTNQLTSMKAIVDDRL 660
Query: 661 HVEGNQATSFKFNTDEVRTAIADATKAQAQARKWLSIMSRDCNRFCKIMKTTEHGSNVSS 720
HVEGNQATSFKFNTDEVRTA+ADATKA+AQARKWLSIMSRDC+RFCKIMKTTEHGSNVSS
Sbjct: 661 HVEGNQATSFKFNTDEVRTAVADATKAEAQARKWLSIMSRDCSRFCKIMKTTEHGSNVSS 720
Query: 721 LTAIQKVKRKITFADEAGGKLCEVRLIEDGI 751
LTAIQK+KRKITFADEAGGKLCEVRLIEDGI
Sbjct: 721 LTAIQKLKRKITFADEAGGKLCEVRLIEDGI 751
BLAST of Cp4.1LG11g01830 vs. ExPASy TrEMBL
Match:
A0A6J1HHL2 (uncharacterized protein LOC111463010 OS=Cucurbita moschata OX=3662 GN=LOC111463010 PE=4 SV=1)
HSP 1 Score: 1459 bits (3778), Expect = 0.0
Identity = 733/752 (97.47%), Postives = 738/752 (98.14%), Query Frame = 0
Query: 1 MELRSFSHLHYINVTKGGAMSKVLNVNSHGKPAVVFKKLTDIYESIDDKTQESLPRRWSR 60
MELRSFSHLHYIN TKGGAMSKVLNVNSHGKPAVVFK+LTDIYESIDDKTQESLPRRWSR
Sbjct: 1 MELRSFSHLHYINDTKGGAMSKVLNVNSHGKPAVVFKRLTDIYESIDDKTQESLPRRWSR 60
Query: 61 EGLEENIPDECEFKVETQVLYAERKLFNDEPEVSDSDNKSDTDGEQSDVEVDNMTLKQIM 120
EGLEENIPDECEFKVETQVLYAERKLFNDEPEVSDSDNK DTDGEQSDVEVDNMTLKQIM
Sbjct: 61 EGLEENIPDECEFKVETQVLYAERKLFNDEPEVSDSDNKGDTDGEQSDVEVDNMTLKQIM 120
Query: 121 EGCKKRKLRQSRSVDSSKEKLGTCSRREPDHACLLSDEDDSDLNVALSIWKSKLSKCRKL 180
EGCKKRKLRQSRSVDSSKEK GTCSR EPDHACLLSDEDDSDLNVALSIWKSKLSKCRKL
Sbjct: 121 EGCKKRKLRQSRSVDSSKEKPGTCSRLEPDHACLLSDEDDSDLNVALSIWKSKLSKCRKL 180
Query: 181 KTKCDEGRISTSSHCGQTIGNSDPINSDQDLHPSGSDLPVPVDIKVETPEPDVTEIQSTN 240
KTKCDEGRISTSSHCGQTIGNSDPINSDQDLHPSGSDLPVPVDIKVETPEPDVTEIQSTN
Sbjct: 181 KTKCDEGRISTSSHCGQTIGNSDPINSDQDLHPSGSDLPVPVDIKVETPEPDVTEIQSTN 240
Query: 241 YKIDEWSLFCDENINSCLKHGPNGADESILYPKLTTSEKEAEYCVLNSACHEYLEDDEPK 300
YKIDEWSLFCDENINSCLKHGPNGADESI YPKLTTSEKEAEYCVLNSACHEYLEDDEPK
Sbjct: 241 YKIDEWSLFCDENINSCLKHGPNGADESIFYPKLTTSEKEAEYCVLNSACHEYLEDDEPK 300
Query: 301 TLQMVGESSNEWMYEDNLEVHKPHYSDFPASESLEGQCTPGYISNYSMSEAISSTKEQLS 360
TLQMVGESSNEWMYEDNLEVHKPHYSDFPASESLEGQCTPGYISNYSMSEAISSTKEQLS
Sbjct: 301 TLQMVGESSNEWMYEDNLEVHKPHYSDFPASESLEGQCTPGYISNYSMSEAISSTKEQLS 360
Query: 361 GTYITNEVIFQNNSEDMSEAIAPTEEQCCDTYISQCIPFTHEVISLNNLNSLKVQETSPE 420
GTYITNEVIFQNNSEDMSEAIAPTEEQCCDTYISQCIPFTHEVISLNNLNSLKVQETSPE
Sbjct: 361 GTYITNEVIFQNNSEDMSEAIAPTEEQCCDTYISQCIPFTHEVISLNNLNSLKVQETSPE 420
Query: 421 GEVCLTEISYKDKLAFVHEKGIPTESNSNCNLRPDHGKSISTNSISDGNFSPDQHLISTG 480
GEVCLTEISYKDK AFVHEKGIPTESNSNCNLRPDHGKSISTNS+SDGN SPDQHLISTG
Sbjct: 421 GEVCLTEISYKDKSAFVHEKGIPTESNSNCNLRPDHGKSISTNSVSDGNLSPDQHLISTG 480
Query: 481 ECPATERQPQMSNYYDSERNTPPDFHLDGSLDTFNQTEEPKRHPIRLLLKRTSISPTSQK 540
ECPATERQPQMSNYYDSERNTPPDFHLDGSLDTFNQTEEPKRHPIRLLLKRTSISPTSQK
Sbjct: 481 ECPATERQPQMSNYYDSERNTPPDFHLDGSLDTFNQTEEPKRHPIRLLLKRTSISPTSQK 540
Query: 541 RLSKGMRSMQLHDKEYKTCSGKPYSNQIKYRDGSAEERDQMKRVYSDTYHKQKIRKSKKR 600
RLSKGMRSMQLHDKEYKTCSGKPY NQIKYRDGS EE DQMKRVYSD YHK+ IRKSKKR
Sbjct: 541 RLSKGMRSMQLHDKEYKTCSGKPYFNQIKYRDGSTEECDQMKRVYSDIYHKKNIRKSKKR 600
Query: 601 SLHSASTTVVPQASIRSTAVQNCSDSAIAFTQRQMQDIECLALKLTNQLTSMKAIVDDRL 660
SLHSAST VPQAS+RSTAVQNCSDSAIAFTQRQMQDIECLALKLTNQLTSMKAIVDDRL
Sbjct: 601 SLHSASTAKVPQASMRSTAVQNCSDSAIAFTQRQMQDIECLALKLTNQLTSMKAIVDDRL 660
Query: 661 HVEGNQATSFKFNTDEVRTAIADATKAQAQARKWLSIMSRDCNRFCKIMKTTEHGSNVSS 720
HVEGNQATSFKFNTDEVRTAIADATKA+AQARKWLSIMSRDCNRFCKIMKTTEHGSNVSS
Sbjct: 661 HVEGNQATSFKFNTDEVRTAIADATKAEAQARKWLSIMSRDCNRFCKIMKTTEHGSNVSS 720
Query: 721 LTAIQKVKRKITFADEAGGKLCEVRLIEDGIK 752
LTAIQKVKRKITFADEAGGKLCEVRLIEDGIK
Sbjct: 721 LTAIQKVKRKITFADEAGGKLCEVRLIEDGIK 752
BLAST of Cp4.1LG11g01830 vs. ExPASy TrEMBL
Match:
A0A6J1K4P1 (uncharacterized protein LOC111492272 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111492272 PE=4 SV=1)
HSP 1 Score: 1426 bits (3692), Expect = 0.0
Identity = 718/751 (95.61%), Postives = 732/751 (97.47%), Query Frame = 0
Query: 1 MELRSFSHLHYINVTKGGAMSKVLNVNSHGKPAVVFKKLTDIYESIDDKTQESLPRRWSR 60
MELRSFSHLHYINVTKGGAMSKVLNVNSHGKPAVVFKKLTDIYESIDDKTQESLPRRWSR
Sbjct: 1 MELRSFSHLHYINVTKGGAMSKVLNVNSHGKPAVVFKKLTDIYESIDDKTQESLPRRWSR 60
Query: 61 EGLEENIPDECEFKVETQVLYAERKLFNDEPEVSDSDNKSDTDGEQSDVEVDNMTLKQIM 120
EGLEENIPDECEFKVETQVLYAERKLFN+EPEVSDSD+K DTDG++SDVEVD+MTLKQI
Sbjct: 61 EGLEENIPDECEFKVETQVLYAERKLFNNEPEVSDSDSKGDTDGQKSDVEVDSMTLKQIT 120
Query: 121 EGCKKRKLRQSRSVDSSKEKLGTCSRREPDHACLLSDEDDSDLNVALSIWKSKLSKCRKL 180
EGCKKRKLRQSRSVDSSKEKL TCSRRE DHACLLSDEDDSDLNVAL+IWKSKLSK RKL
Sbjct: 121 EGCKKRKLRQSRSVDSSKEKLRTCSRRELDHACLLSDEDDSDLNVALNIWKSKLSKRRKL 180
Query: 181 KTKCDEGRISTSSHCGQTIGNSDPINSDQDLHPSGSDLPVPVDIKVETPEPDVTEIQSTN 240
KTKCDE RISTSSHCGQTIGNSDPINSDQDLHPSGSDLPVPVDIKVETPEPDV+EIQSTN
Sbjct: 181 KTKCDESRISTSSHCGQTIGNSDPINSDQDLHPSGSDLPVPVDIKVETPEPDVSEIQSTN 240
Query: 241 YKIDEWSLFCDENINSCLKHGPNGADESILYPKLTTSEKEAEYCVLNSACHEYLEDDEPK 300
YKIDEWSLFCDENINSCLKHGPNGADESI YPKLTTSEKEAEYCVLNSACHEYLEDDEPK
Sbjct: 241 YKIDEWSLFCDENINSCLKHGPNGADESIFYPKLTTSEKEAEYCVLNSACHEYLEDDEPK 300
Query: 301 TLQMVGESSNEWMYEDNLEVHKPHYSDFPASESLEGQCTPGYISNYSMSEAISSTKEQLS 360
TLQMVGESSNEWMYEDNLE HKPHYSDFPASESLEGQCTPGYISNYSMSEAISSTKEQLS
Sbjct: 301 TLQMVGESSNEWMYEDNLEEHKPHYSDFPASESLEGQCTPGYISNYSMSEAISSTKEQLS 360
Query: 361 GTYITNEVIFQNNSEDMSEAIAPTEEQCCDTYISQCIPFTHEVISLNNLNSLKVQETSPE 420
GTYITNEVIFQNNSEDMSEAIAPTEEQCCDTYISQCIPFTH+VI LNNLNSLKVQETSPE
Sbjct: 361 GTYITNEVIFQNNSEDMSEAIAPTEEQCCDTYISQCIPFTHDVICLNNLNSLKVQETSPE 420
Query: 421 GEVCLTEISYKDKLAFVHEKGIPTESNSNCNLRPDHGKSISTNSISDGNFSPDQHLISTG 480
EVCLTEISYKDKLAFVHEKG PTESNSNCNLRPDHGK ISTNSISDGN SPDQHLISTG
Sbjct: 421 AEVCLTEISYKDKLAFVHEKGTPTESNSNCNLRPDHGKRISTNSISDGNLSPDQHLISTG 480
Query: 481 ECPATERQPQMSNYYDSERNTPPDFHLDGSLDTFNQTEEPKRHPIRLLLKRTSISPTSQK 540
ECPATERQPQMSNYYDSERNTPPDFHLDGSLD F QTEEPKRHP RLLLKRTSISPTSQK
Sbjct: 481 ECPATERQPQMSNYYDSERNTPPDFHLDGSLDKFYQTEEPKRHPTRLLLKRTSISPTSQK 540
Query: 541 RLSKGMRSMQLHDKEYKTCSGKPYSNQIKYRDGSAEERDQMKRVYSDTYHKQKIRKSKKR 600
RLSKGMRSMQLHDKEYKTCSGKPY NQIKYRDGSAEE DQMK V+SDTYHKQKIRKSKKR
Sbjct: 541 RLSKGMRSMQLHDKEYKTCSGKPYFNQIKYRDGSAEECDQMKIVHSDTYHKQKIRKSKKR 600
Query: 601 SLHSASTTVVPQASIRSTAVQNCSDSAIAFTQRQMQDIECLALKLTNQLTSMKAIVDDRL 660
SLHSASTTVVPQAS+RSTAVQNCSDSAIAFTQRQMQDIECLALKLTNQLTSMKAIVDDRL
Sbjct: 601 SLHSASTTVVPQASMRSTAVQNCSDSAIAFTQRQMQDIECLALKLTNQLTSMKAIVDDRL 660
Query: 661 HVEGNQATSFKFNTDEVRTAIADATKAQAQARKWLSIMSRDCNRFCKIMKTTEHGSNVSS 720
HVEGNQATSFKFNTDEVRTA+ADATKA+AQARKWLSIMSRDC+RFCKIMKTTEHGSNVSS
Sbjct: 661 HVEGNQATSFKFNTDEVRTAVADATKAEAQARKWLSIMSRDCSRFCKIMKTTEHGSNVSS 720
Query: 721 LTAIQKVKRKITFADEAGGKLCEVRLIEDGI 751
LTAIQK+KRKITFADEAGGKLCEVRLIEDGI
Sbjct: 721 LTAIQKLKRKITFADEAGGKLCEVRLIEDGI 751
BLAST of Cp4.1LG11g01830 vs. ExPASy TrEMBL
Match:
A0A6J1KB44 (uncharacterized protein LOC111492272 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111492272 PE=4 SV=1)
HSP 1 Score: 1426 bits (3692), Expect = 0.0
Identity = 718/751 (95.61%), Postives = 732/751 (97.47%), Query Frame = 0
Query: 1 MELRSFSHLHYINVTKGGAMSKVLNVNSHGKPAVVFKKLTDIYESIDDKTQESLPRRWSR 60
MELRSFSHLHYINVTKGGAMSKVLNVNSHGKPAVVFKKLTDIYESIDDKTQESLPRRWSR
Sbjct: 1 MELRSFSHLHYINVTKGGAMSKVLNVNSHGKPAVVFKKLTDIYESIDDKTQESLPRRWSR 60
Query: 61 EGLEENIPDECEFKVETQVLYAERKLFNDEPEVSDSDNKSDTDGEQSDVEVDNMTLKQIM 120
EGLEENIPDECEFKVETQVLYAERKLFN+EPEVSDSD+K DTDG++SDVEVD+MTLKQI
Sbjct: 61 EGLEENIPDECEFKVETQVLYAERKLFNNEPEVSDSDSKGDTDGQKSDVEVDSMTLKQIT 120
Query: 121 EGCKKRKLRQSRSVDSSKEKLGTCSRREPDHACLLSDEDDSDLNVALSIWKSKLSKCRKL 180
EGCKKRKLRQSRSVDSSKEKL TCSRRE DHACLLSDEDDSDLNVAL+IWKSKLSK RKL
Sbjct: 121 EGCKKRKLRQSRSVDSSKEKLRTCSRRELDHACLLSDEDDSDLNVALNIWKSKLSKRRKL 180
Query: 181 KTKCDEGRISTSSHCGQTIGNSDPINSDQDLHPSGSDLPVPVDIKVETPEPDVTEIQSTN 240
KTKCDE RISTSSHCGQTIGNSDPINSDQDLHPSGSDLPVPVDIKVETPEPDV+EIQSTN
Sbjct: 181 KTKCDESRISTSSHCGQTIGNSDPINSDQDLHPSGSDLPVPVDIKVETPEPDVSEIQSTN 240
Query: 241 YKIDEWSLFCDENINSCLKHGPNGADESILYPKLTTSEKEAEYCVLNSACHEYLEDDEPK 300
YKIDEWSLFCDENINSCLKHGPNGADESI YPKLTTSEKEAEYCVLNSACHEYLEDDEPK
Sbjct: 241 YKIDEWSLFCDENINSCLKHGPNGADESIFYPKLTTSEKEAEYCVLNSACHEYLEDDEPK 300
Query: 301 TLQMVGESSNEWMYEDNLEVHKPHYSDFPASESLEGQCTPGYISNYSMSEAISSTKEQLS 360
TLQMVGESSNEWMYEDNLE HKPHYSDFPASESLEGQCTPGYISNYSMSEAISSTKEQLS
Sbjct: 301 TLQMVGESSNEWMYEDNLEEHKPHYSDFPASESLEGQCTPGYISNYSMSEAISSTKEQLS 360
Query: 361 GTYITNEVIFQNNSEDMSEAIAPTEEQCCDTYISQCIPFTHEVISLNNLNSLKVQETSPE 420
GTYITNEVIFQNNSEDMSEAIAPTEEQCCDTYISQCIPFTH+VI LNNLNSLKVQETSPE
Sbjct: 361 GTYITNEVIFQNNSEDMSEAIAPTEEQCCDTYISQCIPFTHDVICLNNLNSLKVQETSPE 420
Query: 421 GEVCLTEISYKDKLAFVHEKGIPTESNSNCNLRPDHGKSISTNSISDGNFSPDQHLISTG 480
EVCLTEISYKDKLAFVHEKG PTESNSNCNLRPDHGK ISTNSISDGN SPDQHLISTG
Sbjct: 421 AEVCLTEISYKDKLAFVHEKGTPTESNSNCNLRPDHGKRISTNSISDGNLSPDQHLISTG 480
Query: 481 ECPATERQPQMSNYYDSERNTPPDFHLDGSLDTFNQTEEPKRHPIRLLLKRTSISPTSQK 540
ECPATERQPQMSNYYDSERNTPPDFHLDGSLD F QTEEPKRHP RLLLKRTSISPTSQK
Sbjct: 481 ECPATERQPQMSNYYDSERNTPPDFHLDGSLDKFYQTEEPKRHPTRLLLKRTSISPTSQK 540
Query: 541 RLSKGMRSMQLHDKEYKTCSGKPYSNQIKYRDGSAEERDQMKRVYSDTYHKQKIRKSKKR 600
RLSKGMRSMQLHDKEYKTCSGKPY NQIKYRDGSAEE DQMK V+SDTYHKQKIRKSKKR
Sbjct: 541 RLSKGMRSMQLHDKEYKTCSGKPYFNQIKYRDGSAEECDQMKIVHSDTYHKQKIRKSKKR 600
Query: 601 SLHSASTTVVPQASIRSTAVQNCSDSAIAFTQRQMQDIECLALKLTNQLTSMKAIVDDRL 660
SLHSASTTVVPQAS+RSTAVQNCSDSAIAFTQRQMQDIECLALKLTNQLTSMKAIVDDRL
Sbjct: 601 SLHSASTTVVPQASMRSTAVQNCSDSAIAFTQRQMQDIECLALKLTNQLTSMKAIVDDRL 660
Query: 661 HVEGNQATSFKFNTDEVRTAIADATKAQAQARKWLSIMSRDCNRFCKIMKTTEHGSNVSS 720
HVEGNQATSFKFNTDEVRTA+ADATKA+AQARKWLSIMSRDC+RFCKIMKTTEHGSNVSS
Sbjct: 661 HVEGNQATSFKFNTDEVRTAVADATKAEAQARKWLSIMSRDCSRFCKIMKTTEHGSNVSS 720
Query: 721 LTAIQKVKRKITFADEAGGKLCEVRLIEDGI 751
LTAIQK+KRKITFADEAGGKLCEVRLIEDGI
Sbjct: 721 LTAIQKLKRKITFADEAGGKLCEVRLIEDGI 751
BLAST of Cp4.1LG11g01830 vs. ExPASy TrEMBL
Match:
A0A6J1CT21 (uncharacterized protein LOC111014058 OS=Momordica charantia OX=3673 GN=LOC111014058 PE=4 SV=1)
HSP 1 Score: 993 bits (2566), Expect = 0.0
Identity = 535/794 (67.38%), Postives = 611/794 (76.95%), Query Frame = 0
Query: 1 MELRSFSHLHYINVTKGGAMSKVLNVNSHGKPAVVFKKLTDIYESIDDKTQESLPRRWSR 60
MELRS++H HYI+ KGG M+KVLN+NS GKPAVVFKKLTD+YE ID+K Q SLP + R
Sbjct: 1 MELRSYNHFHYIHAIKGGLMTKVLNINSRGKPAVVFKKLTDVYEFIDEKDQNSLPTQLLR 60
Query: 61 EGLEENIPDECEFKVETQVLYAERKLFNDEPEVSDSDNKSDTDGEQSDVEVDNMTLKQIM 120
E LEENIP+ +FKVET+ YAERKLF DEP VSDS + DTDG++SDVEVD+MT++QIM
Sbjct: 61 ERLEENIPEGYKFKVETEDFYAERKLFKDEPTVSDSGSGGDTDGQKSDVEVDSMTIQQIM 120
Query: 121 EGCKKRKLRQSRSVDSSKEKLGTCSRREPDHACLLSDEDD-SDLNVALSIWKSKLSKCRK 180
EGCKKRK RQS+SVDSSKEKL TCS++E + +CLLSDEDD SDLNVALS+WKSKLS+ +K
Sbjct: 121 EGCKKRKSRQSKSVDSSKEKLRTCSKQELERSCLLSDEDDDSDLNVALSVWKSKLSRRKK 180
Query: 181 LKTKCDEGRISTSSHCGQTIGNSDPINSDQDLHPSGSDLPVPVDIKVETPEPDVTEIQST 240
LKTKC+ RISTSS C Q GNSDPINSDQDL PS +DLP+PVD+KVETPE DVTEIQ+T
Sbjct: 181 LKTKCNGSRISTSSQCSQITGNSDPINSDQDLLPSSADLPIPVDVKVETPETDVTEIQNT 240
Query: 241 NYKIDEWSLFCDENINSCLKHG----------------PNGADESILYPKLTTSEKEAEY 300
NY ID+ SL CDEN+NSCL P GADE L TTS KEAEY
Sbjct: 241 NYIIDDLSLLCDENVNSCLSSELSLLCDENVNLCLSSEPIGADELFLNRGSTTSNKEAEY 300
Query: 301 CVLNSACHEYLEDDEPKTLQMVGESSNEWMYEDNLEVHKPHYSDFPASESLEGQCTPGYI 360
CVLNSACHEYL D+P+ LQMVGES+ EWM +DNLE+ KP+YSDFPASES+EG+ P +
Sbjct: 301 CVLNSACHEYLVGDDPEFLQMVGESNTEWMKKDNLEIQKPNYSDFPASESMEGRYAPRCL 360
Query: 361 SNYSMSEAISSTKEQLSGTYI------TNEVIFQNNSEDMSEAIAPTEEQCCDTYISQCI 420
SN SMSE IS T+EQ SGTYI T+E I QNN EDMSE I+PTEEQC DTYIS+
Sbjct: 361 SNDSMSEEISLTEEQCSGTYISQGKSITHEAICQNNCEDMSEEISPTEEQCTDTYISE-- 420
Query: 421 PFTHEVISLNNLNSLKVQETSPEGEVCLTEISYKDKLAFVHE-KGIPTESNSNCNLRPDH 480
E S EVCLTE YKD L HE KGI TE+ S+C+LR DH
Sbjct: 421 ------------------EMSFGAEVCLTENGYKDTLELDHERKGISTEATSDCDLRADH 480
Query: 481 G------------------KSISTNSISDGNFSPDQHLISTGECPATERQPQMSNYYDSE 540
G KSIST+S SDGN SPDQHLIS G+CPA E +PQ+SN+ DSE
Sbjct: 481 GESISTKSTTDCNLSPDHEKSISTSSTSDGNLSPDQHLISIGKCPAQEIEPQISNFSDSE 540
Query: 541 RNTPPDFHLDGSLDTFNQTEEPKRHPIRLLLKRTSISPTSQKRLSKGMRSMQLHDKEYKT 600
RNT PDFHLD S+D FNQ EEPKRHP RLL RT+ISPTSQ+RLSK M+SM+LHDKE KT
Sbjct: 541 RNTSPDFHLDDSMDKFNQFEEPKRHPTRLLSTRTTISPTSQERLSKAMKSMRLHDKECKT 600
Query: 601 CSGKPYSNQIKYRDGSAEERDQMKRVYSDTYHKQKIRKSKKRSLHSASTTVVPQASIRST 660
C GKPY Q Y+ G+AEE DQMKRVYSD +H+Q IRKSKKRSLHS S T VP RST
Sbjct: 601 CGGKPYFKQANYKVGTAEECDQMKRVYSDIFHEQNIRKSKKRSLHSTSNTKVPHGRTRST 660
Query: 661 AVQNCSDSAIAFTQRQMQDIECLALKLTNQLTSMKAIVDDRLHVEGNQATSFKFNTDEVR 720
AVQ+CSD+AIAFTQRQMQDIE +ALKLTNQL SMKAIV+DRLHVEGN+AT FKFNTDEVR
Sbjct: 661 AVQSCSDNAIAFTQRQMQDIESIALKLTNQLKSMKAIVEDRLHVEGNKATGFKFNTDEVR 720
Query: 721 TAIADATKAQAQARKWLSIMSRDCNRFCKIMKTTEHGSNVSSLTAIQKVKRKITFADEAG 752
TAI+DATKA+A A+KWLS+MSRDCNRFCKIMKTTE+GS S +AIQK+KRKITFADEAG
Sbjct: 721 TAISDATKAEASAKKWLSMMSRDCNRFCKIMKTTENGSTASP-SAIQKIKRKITFADEAG 773
BLAST of Cp4.1LG11g01830 vs. ExPASy TrEMBL
Match:
A0A5A7VDU2 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold82G005880 PE=4 SV=1)
HSP 1 Score: 989 bits (2557), Expect = 0.0
Identity = 531/769 (69.05%), Postives = 609/769 (79.19%), Query Frame = 0
Query: 1 MELRSFSHLHYINVTKGGAMSKVLNVNSHGKPAVVFKKLTDIYESIDDKTQESLPRRWSR 60
MELRSF HLHYI+ KGG ++K+LN+N GKPAVVFKKLTDIY SIDD+ +E LP R SR
Sbjct: 1 MELRSFCHLHYIHAIKGGVVNKILNIN-RGKPAVVFKKLTDIYGSIDDEAREPLPTRGSR 60
Query: 61 EGLEENIPDECEFKVETQVLYAERKLFNDEPEVSDSDNKSDTDGEQSDVEVDNMTLKQIM 120
EGLEEN CEFKV++QVLYAER L NDEPE+SDSD+K D+D ++SD EVD+MTLKQIM
Sbjct: 61 EGLEENTLHGCEFKVDSQVLYAERTLCNDEPEISDSDSKGDSDVQKSDFEVDSMTLKQIM 120
Query: 121 EGCKKRKLRQSRSVDSSKEKLGTCSRREPDHACLLSDEDDSDLNVALSIWKSKLSKCRKL 180
EGCKKRKL QSR VDSSKEK TC ++E DH+ +L++EDDSDLN+ALSIWKSKLSK RKL
Sbjct: 121 EGCKKRKLSQSRFVDSSKEKTRTCFKQELDHSFMLTEEDDSDLNIALSIWKSKLSKRRKL 180
Query: 181 KTKCDEGRISTSSHCGQTIGNSDPINSDQDLHPSGSDLPVPVDIKVETPEPDVTEIQSTN 240
KTKC+E RISTSS C QTIG+SDP +SD+DL PSGS+LP+PVD+KVETPE DVTEIQ+TN
Sbjct: 181 KTKCEESRISTSSQCDQTIGSSDPTDSDEDLLPSGSNLPLPVDVKVETPEIDVTEIQNTN 240
Query: 241 YKIDEWSLFCDENINSCLKHGPNGADESILYPKLTTSEKEAEYCVLNSACHEYLEDDEPK 300
+ I+E SLFCDENIN CL +GP G ++ L LT SEKEAEYCVLNSAC+EY E EP
Sbjct: 241 HTIEECSLFCDENINFCLSYGPVGPNDLNLDIGLTASEKEAEYCVLNSACYEYFEGYEPG 300
Query: 301 TLQMVGESSNEWMYEDNLEVHKPHYSDFPASESLEGQCTPGYISNYSMSEAISSTKEQLS 360
T QMVGESS +WM ED LEVHK H+SDF AS+S++GQ TP YISN S+SEAI TKEQ S
Sbjct: 301 TFQMVGESSTKWMNEDKLEVHKTHHSDFSASDSMKGQYTPSYISNSSISEAIPLTKEQCS 360
Query: 361 GTYI------TNEVIFQNNSEDMSEAIAPTEEQCCDTYISQCIPFTHEVISLNN------ 420
G+YI TN + QN+S+ MSE IA TEEQCCDTYIS+ P THE LNN
Sbjct: 361 GSYISPDNSITNVAMCQNSSKGMSEVIALTEEQCCDTYISEGKPLTHEATCLNNGEGSTH 420
Query: 421 ------LNSLKVQETSPEGEVCLTEISYKDKLAFVHEKGIPTESNSNCNLRPDHGKSIST 480
LNSL+ E S EVCLTE SYKD+LA E+ IP +S+ + NL PDHGK IST
Sbjct: 421 MHALTNLNSLEAPEMSHGAEVCLTENSYKDELAVDDERSIPKKSSCDSNLSPDHGKYIST 480
Query: 481 NSISDGNFSPDQHLISTGECPATERQPQMSNYYDSERNTPPDFHLDGSLDTFNQTEEPKR 540
+SISD N DQ LIS ECPA ERQPQMS+ DSERNT P+ HLDGS+D FNQ EEPKR
Sbjct: 481 SSISDRNSGSDQRLISDDECPAKERQPQMSDCSDSERNTSPNSHLDGSVDKFNQFEEPKR 540
Query: 541 HPIRLLLKRTSISPTSQKRLSKGMRSMQLHDKEYKTCSGKPYSNQIKYRDGSAEERDQMK 600
HP RLL RT+ISPTSQ+RLSK M+SM+LHDKE KT KP+ Q KY G+AEE DQ K
Sbjct: 541 HPTRLLSTRTTISPTSQERLSKAMKSMRLHDKECKTYGVKPHFKQTKYAVGAAEECDQTK 600
Query: 601 RVYSDTYHKQKIRKSKKRSLHSASTTVVPQASIRSTAVQNCSDSAIAFTQRQMQDIECLA 660
+V+SD Y ++ IRKSKKRS HS+STT VPQA+ VQNCS+SAI FTQRQMQDIECLA
Sbjct: 601 QVHSDIYQEKNIRKSKKRSFHSSSTTKVPQAT-----VQNCSESAITFTQRQMQDIECLA 660
Query: 661 LKLTNQLTSMKAIVDDRLHVEGNQATSFKFNTDEVRTAIADATKAQAQARKWLSIMSRDC 720
LKLT+QL SMKAIV+DRLHVEGN++TSFKFN DEVRTAIADATKA+ A+KWLSIMSRDC
Sbjct: 661 LKLTSQLKSMKAIVEDRLHVEGNKSTSFKFNADEVRTAIADATKAETSAKKWLSIMSRDC 720
Query: 721 NRFCKIMKTTEHGSNVSSLTAIQKVKRKITFADEAGGKLCEVRLIEDGI 751
NRFCKIM TTEH SN S AIQK KRK+TFADEAGGKLCEVRLIED +
Sbjct: 721 NRFCKIMNTTEHYSNASP-AAIQKAKRKVTFADEAGGKLCEVRLIEDDV 762
BLAST of Cp4.1LG11g01830 vs. TAIR 10
Match:
AT3G56870.1 (unknown protein; Has 204 Blast hits to 201 proteins in 58 species: Archae - 0; Bacteria - 10; Metazoa - 72; Fungi - 8; Plants - 41; Viruses - 0; Other Eukaryotes - 73 (source: NCBI BLink). )
HSP 1 Score: 139.4 bits (350), Expect = 1.2e-32
Identity = 182/685 (26.57%), Postives = 286/685 (41.75%), Query Frame = 0
Query: 93 VSDSDNKSD-TDGEQSDVEVDNMTLKQIMEGCKKRKLRQSRSVDSSKEKLGTCSRREPDH 152
V D DN SD T G +S+ + TL+ I + CK+RK + D++ E ++
Sbjct: 63 VRDLDNGSDFTQGSESE-DFSMTTLEMIQKQCKERKRKLRNCRDTTTETFSNVEVKKE-- 122
Query: 153 ACLLSDEDDSDLNVALSIWKSKLSKCRKLKTKCDEGRISTSSHCGQTIGNSDPINSDQDL 212
++ ++ D+ LS W +K SK RK K + + + TS+ S P
Sbjct: 123 --YVTQDEGCDIEEPLSSWDTKFSKRRKKKQERKKAKCGTST--------SSP------- 182
Query: 213 HPSGSDLPVPVDIKVETPEPDVTEIQSTNYKIDEWSLFCDEN----INSCLKHG--PNGA 272
PS + +P+ + P E+ +Y + E ++ C E IN+ L + +
Sbjct: 183 -PSVEKVDLPLVLFHVKP-----EVWDDSYSVSE-AMDCSEKSESPINTVLVEEIMLDSS 242
Query: 273 DESILYPKLTTSEKEAEYCVLNSACHEYLE-----DDEPKTLQMVGESSNEWMYEDNLEV 332
+ L P + + A + E D + K + + S E E L+V
Sbjct: 243 SDMRLVPYCSAEPNFPGVVAIEEAFEDASEEFSDADFQNKQIVLYSSVSRE---EMELDV 302
Query: 333 HKPHYSDFPASESLEGQCTPGYISNYSMSEAISSTKEQLSGTYITNEVIFQNNSEDMSEA 392
+ H SE C +IS Y+ S KE I + ++ +
Sbjct: 303 NPQH------SEYENIGCVKNFISAYTSSGCEEEDKEDEESNDIKANLDMSVTGLEIVKI 362
Query: 393 IAPTEEQCCDTYISQCIPFTHEVISLNNLNSLKVQETSPEGEVCLTEISYKDKLAFVHEK 452
AP E+++++ L + E + E K F
Sbjct: 363 EAP------------------EILAIDYPGCLSIINFCAEDSEIVWETEDITKDNFPEAT 422
Query: 453 GI---PTESNSNCNLRPDHGKSISTNSISDGNFSPDQHLISTGECPATERQPQMSNYYDS 512
I NS NL+P S S+ Q L S E A + ++S Y
Sbjct: 423 DILQLTNCCNSLDNLQPVPEDSTSSKEEDHLTERLQQSLYSKHEDEAGDH--KLSQLY-- 482
Query: 513 ERNTPPDFHLDGSLDTFNQTEEPKRHPIRLLLKRTSISPTSQKRLSKGMRSMQLHDKEYK 572
P + D+ Q ++P P LL R ++SPTSQ++L K M +K K
Sbjct: 483 --KEPDEVQKVAETDSIQQ-QQPHHQPENLLSGRKALSPTSQEKLRKAMEHPDSPEKRSK 542
Query: 573 TCSGKPY-SNQIKYRDGSAEERDQMKRVYSDTYHKQKIRK---SKKRSLHSASTTVVPQA 632
GK Y S+Q +R A+ D + RV KQ I+K + ++ + +T P+
Sbjct: 543 KSRGKLYFSSQNSHRILKAQGLDNIDRVEIIPSSKQAIQKATNNTRQMKYQRATHKFPRR 602
Query: 633 SIRS----------TAVQNCSDSAIAFTQRQMQDIECLALKLTNQLTSMKAIVDDRLHVE 692
+ ++ T++Q CS AIAF+Q QM+D + +A +LT +L SM+ I L E
Sbjct: 603 NTQAAKAQPFSTGGTSIQGCSQKAIAFSQGQMRDFQNVAARLTKELKSMRQITKRCLQAE 662
Query: 693 GNQATSFKFNTDEVRTAIADATKAQAQARKWLSIMSRDCNRFCKIMKTTEHGSNVSSLTA 749
N + N DEV+T I +A K + +KWLSI+ RDCNRFCK+M S +
Sbjct: 663 SNTSNMSDCNLDEVKTVIGNAEKTEESCKKWLSIIERDCNRFCKLMSMVREDSPATE--N 684
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023545220.1 | 0.0 | 100.00 | uncharacterized protein LOC111804698 [Cucurbita pepo subsp. pepo] >XP_023545221.... | [more] |
XP_022962613.1 | 0.0 | 97.47 | uncharacterized protein LOC111463010 [Cucurbita moschata] >XP_022962614.1 unchar... | [more] |
KAG6598431.1 | 0.0 | 96.14 | hypothetical protein SDJN03_08209, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
KAG7029375.1 | 0.0 | 96.14 | hypothetical protein SDJN02_07713, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
XP_022997327.1 | 0.0 | 95.61 | uncharacterized protein LOC111492272 isoform X1 [Cucurbita maxima] >XP_022997328... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1HHL2 | 0.0 | 97.47 | uncharacterized protein LOC111463010 OS=Cucurbita moschata OX=3662 GN=LOC1114630... | [more] |
A0A6J1K4P1 | 0.0 | 95.61 | uncharacterized protein LOC111492272 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1KB44 | 0.0 | 95.61 | uncharacterized protein LOC111492272 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1CT21 | 0.0 | 67.38 | uncharacterized protein LOC111014058 OS=Momordica charantia OX=3673 GN=LOC111014... | [more] |
A0A5A7VDU2 | 0.0 | 69.05 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... | [more] |
Match Name | E-value | Identity | Description | |
AT3G56870.1 | 1.2e-32 | 26.57 | unknown protein; Has 204 Blast hits to 201 proteins in 58 species: Archae - 0; B... | [more] |