Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATATAATCGTAGCACCCAAATCCATCGTCATCAACCAGAAACCCCTACATCCGTTTTCTCTTCTTGAATCTTGATGTCCCTTCAATTCCGTTTATATTATGCTATTTCATTCCAGTCCAATCGAATCCAATCCTATTCGATATAATCAATCCTCTCCTACTCTTCACCAGATTTGCTTCAGGAGGACGGCGTTATTTCGATCCGCCGACGATCTGTTAGTATTTGTTTTTTTCTTCTAATTACCTGTTGCGATTGTAATTCTTAGCTGCTATTCCATCTTGGCTTGCTTTTTGCCTCCCGATCAGATTGGGTTGATTTCGGGTTGTGAGACAATGGAAGAATTGAGAAATGTAACTATGAGAGTTTGAAAGATGCCTGGGTTAACGCAAAAAAATGACCATTTAAATGGTGGGTCATCGGCTGTATACTCGCTCTCCGCCAATGGCTTTTGGTCCCAGCATCGCGACGATGTTAGCTACGTTCAGCTCCAGAAGGTATTCATTTCTTTCGTTGCCAGACGATTTAATTCCAGTTTTCGTTTCATCATTTTTTTTATAGGAATAATTGGCACTAGATTAGATCACTTTCCTGCTCTGTATTTTTTCCCGAATTATACTTCATAGAACTCTGGATTTACTACTGCTGCCGATGTAATCTAGGGAAGGAGTGTCTAGCATTTTAGATGTGCGAGTTCAACACTTTAGACCTTTCATCTTTTACTCAAAATACAGTTACCGAATTTTCGTATTTTTCCTCTACATGTTGCTCCCTGTGACCAGCAAATCTGCTTTGTAGGGAACTAAAGTATCACATTGTTTAATTATGACATTTGAGTCGTTAGTTGGAATAAGTATGATCTAATTCCCGCATTTCTTAGAACTTATGTCATGAACTGACCATTGACTGATAAAGATGTGTTTACTCTCAATGTAGTTTTGGAGTGAACTGCTGCCCCAAGCTAGGCAGAAACTCCTGAGAATTGACAAGCAAACTCTCTTTGAGCAAGCTCGTAAGAATATGTACTGCTCTCGATGTAATGGTTTGCTGCTTGAAGGATTTTTGCAGATTGTCATGTATGGGAAGTCTTTACAACAAGGAAACACATGTGTGAATCACACTTGCAACAGATTAGGGGGTTCAAAAAGTCAAACTTGCGATGGGTCATTGGCAGTTAATGGGTTTCATGATGAAATTCAAGACCCATCTGTCCATCCTTGGGGTGGTTTGACCACAACGCGCGAGGGGTTGCTGACACTTTTGGGCTGCTATTTGTATTCAAAGTCTTTCCTGGGTCTCCAAAATGTAAGTGCTTATTTTTAGTTTTCTTAGCTGTTAAGCACAACATATTAAGTCATATTTAATGGTTAAGGTTTTCTTTGGAACAGGTATTTGACAGTGCACGAGCTAGGGAGCGAGAGCGTGAATTGCTTTATCCTGATGCTTGTGGTGGGGGAGGTCGAGGCTGGATAAGTCAAGGAACAGCGGGCTATGGCAGGGGACATGGTACAAGGGAAACATGCGCCCTGCACACTGCTAGGCTTTCTTGTGATACATTGGTGGATTTCTGGTCAGCATTAGGAGAAGAAACTCGACAATCTCTTCTAAGGATGAAAGAAGAAGATTTTATTGAGAGACTAATGTACAGGTCTCTAGCAAATGCTCATCCCTTGTTTCATGCATTGCATTTAAATAAATGGAAGGAAAATATGATAGTTGAGGCTTGATGGAGATTTGAGAAGATAATGTGACCTTTAGAGATATGAAGTATCATGAACACATTTTGGAGTAAACTAACATAAACAAATAAAAGGTTGATTTCTTGGTAAGGATGAATCTGATTTTTGTTTGCATCTTTTTTAGTAATCTTTTATCTGCATTTTGAAGATTCAAGCTAGTCTAGAATTCTGCTGTCTAGATTTGTTGAATAGATATAATTTTCAGTTAGAGAACCTATAGCTCAAGTAAATTATATGGTTTACAAGCCAAGCAACATCCAGGTTGTGTATTTTGCTCTCATTGGTTAAAAACTCCTTGCTCCCTAAGATATTAAAAATCTTAAGTTTTCTAACTATCAAGAATTGTAGATTTTATGCCAACCTAAACATAGTTCAACTGGTTAAGACATTAATACATTTTTCCTAAAGGTCACATGCTTGAATCTCCACCCCTTGATGATCTCAAAAGAAAATTGTCGAGTTAGATGCCAAGAGGGTAGATTTGGGAGATTCTTGGACATAAAAGAATAACATAGCTTCTAATATTCTCCTCGATGGACAGAAGAAAGGTCTTTATTTGAGCAAGGATATTGCTTGAGAGAATTAAAAGGTACGAATCCACAATAAGTTTGGATTTGCACTAAAAGCAATTGGAGTTTTTTTTGCAAGTATTGAACGTCATAATTTGGAGGTTCGCAACCTAGAAAGAATTGTACAAATTTGAGTACTTCTAACTTATAGCTGACTGTTTGTATTTTGACTTACCCATGGTTATGGAACCAAATCCATTAATATAGAACATTTACATTTCACTGTGAAAAAGTGCTTTATTGTAGTGGAATAAGATGCTTTTGTACATTGATGTATATATTTAATTTATTTGGAGCTATTTGAGCGGGATCCACTTTTTGGAGAATTTGGATCAATCAATAACTTATTACTTATTGACCCTTCTTTGAATACTAAGGCCAATCTAATTTGGGTCAATGTGGCGAAAGCTATTTTTTTGGGAATCATGGATGGAAAGAATCAAAGAATTTTTCGTGGCAAGAATCTTTCTTGGTTCGAAAGATTGATTACGCTCATCTCGAAACCTGTTGGAATTCTTTATCAAAGATTTTTGTTGGTTTTTCACTTAGGATATTTGCCTCGATTGGAATGCCTTTGTATATCCATTTTAGGTTATTTTGCTTACTTGTTTGTTTCCTTTTATTCCCTTTCTTTTATTGTTTTCTTATTTTCACTCCTTAGCCTAGTGGAGTTTGTATCTTTGAGCATTAGTCTCTTTTCATTCCCTTAATGATTAGTTTTGTTTCTTGTTAAGAAAAACAATAATAAAAATATTTCTTGTGGAACATCTTTCACAATCCCTAGAGAGATAGTTCATTAGTAATAATCGTTGTACGGTTCACTGAGCACCATGTCTTAAACTAGTTATTTCTTCAAGTGAGGAGGATAAGAATCTCGTAGGGCTTTTGTGTAGGAAGGGGAGAACATTCCTAACTCATTTGCAGTTTGTTGATAATACATTTTTTTTAGGTGAGGATGATTGAAGTGTGATTGACAATCTCTTTAAAAGTGGTTAAGACTTTTGAGATGGCATTAGGATTGAAGGTGAATCTAGAATAAACAACCATGGGTTATGTAATTAATAAACTACTTGTTTGTGTTTAGAATTCCTATTTTTGATCATAAAAGCTAAGTGAAGTTGTTGAATGATTTCTTGTTGGAGGGGGATGAGGAGGGTGGAAGTCTCGATTTGATTGGTTGGGAGGTAATGCCAAGGCCGGTTGACCTTGGAGTGCTGGGCATAGGCAATATGGGATTTTGTAATGAGAGGCTTTTTGACTAAATGGTTGTGATGTTTATCTTTGGAGTTGAACGCTTTATGTTATAAGGTCATAGCTAGTTAGTGTAGTCCTCATCTTTTTGAATGGAACTCAGTTGGGGATTTTAAAGACGTTCAAAAACCCCTAGAAAGTTCTCTCTTTAGATGGTCCCTTCTGTAAATTTGTTAAAATGCTTTGTGGGGAACAACTTAAGAACTTATTTCTAGGAAAGCTCAATGGTTTGATAGTCCCATAAGTATTTTTCATCTTGTTTGTACTACCTCTACCTCTCCTCTTTGAGGATTTGTCTTGTGGCATCAATCATTTCGTACTTCGAAAATTCTTATGCGTTTAATTAAAGTTTTTGCTGCTGTCATTAGATAGGGAGGTGTGGGATGTGTTTGCATTCCTCTCTCTGTTGGGAAACTTAGTCTTACCTTGGGTAGTAGTGATATTGGTGTTTGGCTTCCCAATCCCTTTAAAGAGTTTCTTGTCGTTCATTTCTTACCCTTCTAGAAACCACTATGCACAATTTTTTTCCTTTTTATTTTCCTCGGTGTGGAAGCTAATCATTTCCAAAAAGGTTAAACTTTTTGCATCGCAGGTCTTACATGGGAGAATCAACACTATGGATCGTATTCAGTGGTTCTTCTTTTTTCCTTCATTAGGGTCACAGTGGTAGACTCTATGTAGCAAGGCATATGAAGATTTATGCCATATCCTTTAAAGATGCATGTTTCTCTCATGGTTTAGGACTCCTTTTGGAGACATTTGGTATGTGCATGGCTTGAAACAAGGATTGTTGTCTTGTCTTATGATGGAGGACGTTTTGTTCCACCTAACCTTTTGAGTTAAGGGCCATACTCTTTGGCACGCTGGTTTTTTTTTTTTTTTTTTTTTTTTTCNCTTTGCTAATTTATGGACCATTTTGGTTAGTGAGGAGTGAGAAATTTTTTGAGAGTTAGAGAGGTTGTGGGAAGAGGTGTGAGCTTTTGCCAAGTTTAATGTTTCTCTTTGTGCACTGGTGTTTCTATAAGCTAAAGATTTTTGGAACTACCTTCTTGGTCTTATTGCGTTTGATCAGGTTGGCTGGGTCTTCCTTGATGTGATCCCTAGAGAGAAAGGATCTGGTGCACAGTGGAGAAAATGGATTTGAGGTTGTATATCAAGTGCTAATTACTCAATCATCATCAAGGGGAGACCCAATGGTAAAATAATTCCCACGAGGTTCAAGGCATGGAGATCCGCTCTCACCTTTCCTCTTTCATCATTGTGGCTGGTTTTCTAAGTAGGCTTTTATGCCATGGGGCAGCAATGGGTCTGATCTAGGTGAATGGAATAGGTCAATTCTATCTTCTCATGAATCACCTCCAATTTCTACACACTACTTTTCTTCTCTATGGATCCCGTTGCAATTGTTTGATATTGTCACTATCCCTGAACGGGCAACTTGTCTAAACATTGATTATAGAAAAAACTGAAGTTTTGTGGGTCAATGTTGAAGATGCGGTTTTGGATGAATTAACCATCACTTTTGGTTGCAAAAAGGGGTTGTGGCCTGCTTCATACTTGGATCTTCCTTTGGGAGTAATCCTAGAATTTCTCTTCTGGGAGCCCACATTAGAGAGAATTCGACAGAAGCTAAACAATTGGCTGCATTCATATATATCTAGAGGATGGATACTCTCATTGAAGCAACTCTATCTAATATGATGCCTATTTATTATCTATCTTATTTGATGCCATAAAAATTGGTTCAAATCATGGAAAATATTTTAAGAGATTTTCTTTGGGAAGGATCTCAACTTAATGGAGGAATGCACAACATCAATTTGAGGAAAACTGTAAAATCCTTCGCATTAGGAAGTTTGGGAGTTGATAATATATCACAAAGGAATTCAGCTCTTCTTGCGAAATGGATAGGCATCATATTAAATTTTAGTCTTGTAATTCTTGAAATCCACCTTTTAGAAATCCACCAAACAGACAAAAACAATTTAACTTGATCCCCAAAACCATCAAGAATAGTAGAGTCACTTGCCTACAAAGTCCTTTTGTTTCTTTCATACCACAATCTTCACAAAAGTGCTACCACCTAATTCTACTACAAAATACTAGAATTTTCCCAAAAAAGGGCTTATGAAACACCCAGCTGATGTTAAGAGTTCTGAAAAGGTTGGATGGACCTGTGAGAGGAAGCAAACGGGGCAAAGAAATGTGGAAGAATTTCAATCCTTGAAAGGGAGGATGCCTGACTTTGGATGATGCTGGTGCTTTCTAAACTCCCAACGTTATAGATCTACATTTTTTATCCAAAGAAAGTAGTAACCTGAAATAATTTTGATGGACTTCATTTGGATGGAGGAAAAGAAATTACGAGACTTTGCCTGATAAATTGCACTGGAGATTTCCTTTGGAGAAAGGGGAACTGTAGAAGGATATAGTTTGTAGCATCTATGTAATGAACGGTTGTGGTTTCAATCTCTTTTGGAGGTTTTTTTTTTCTTCTTCTAGTATTAAAAGTCGCTTTGGTAAAGAATTCAAAATAGATGGTCTACTATAATAATTCTGCCAAGTTGTTCATTTATATAAATGTGAAATGGAGGATATAACAAATTAATATTATTTTACTAATCATTTTAGTCTTTCTGCCTGCTTGGTGCAACTCATGGAACCATAAGCCTTTTCTTTACAGGACATAGCCTCCTCTAATCCAATGATGCATAAGAATCGAACCTAAAAATATGAGCCTAATTTTGAAGTGTGCCTCAAGGTGAAACCTCAAAAGTCTTGTAAAAATCGTTCTACTTACTACGTTTTTCACTTAGTTGAACAAATTTAAATATTATTTCAGATTATCATAGTAATTGAGGAATATTGGAAACATGCCTGATTTATTCTAATAAGGCAAGCAATACTTTGTGTATCTATCCTTATGTTAAGTTCTCTTGTCTGCATTTTTCTTGCCTTGTAATTATTAGTTTTTTGCTTCTTACATTATATTGATTCTTTCAGGTTTGACAGCAAGAGGTTTTGTAGAGATTGCAGAAGAAATGTGATCCGTGAGTTCAAGGAGCTGAAGGAACTGAAGCGCATGAGGAGAGAGCCTTGCTGCACTAGTTGGTTTTGTGTTGCAGATATGGCTTTTCATTACGAGGTGCATCCTTGCACTTATCTTCAACTTTAAGCGCTCTTCTACCATCATGGATTATAATTTTAACTTTTGGATGAATATGAATCGAGACATGATGACTATGTAGACCCTGTGAGGTTTTTCAAGAACGCGTGCTTGTTGCAATTGGCATGAACGAAATTATATTTGCTATTGTCTCAGGTCTCAGATGATACAATCCAGGCCGATTGGCATCAAACCTTTGCTGACTCCGTGGAGACATATCATTATTTTGAGTGGGCTGTTGGATCAGGAGAAGGAAAATCTGACATTCTGGAATTTGAAAATGTTGGCATGAACGGAAGTGTCAAAATGAATGGCCTAGATCTTGGTGGTTTGAATTCATGCTTTATCACCCTCAGAGCTTGGAAATTAGATGGACGCTGCACAGAGCTATCAGTGAAAGCTCATGCATTAAAAGGTCAACAATGTGTTCATCGAAGACTTATAGTTGGTGATGGATTTGTTACAATCACTAGAGGGGAAAATATTAGGAGGTTTTTTGAGCATGCTGAAGAGGCTGAAGAGGAGGAGGTTGTTCTCTTTCTTGTAATTGTCTCTCTAGGATTTTATGCCTGCTTAGAATTTTGCCCGATATTCGTATGCAATATTATATTTCACGAACTTAAAATTCTTTGTGAGGTGACAAATATTTCCTTTTCTGTTGTAAAAAAAGAAAAAGTTTCCTCTTCTAAATCTTGGCAACTCTAATTGGTTCTCTGAATTTTTTGGCTTTTAGGAGGATGATTCGATGGATAAGGACGCAAATGATTTGGATGGAGATTGCTCTCGTCCTCAAAAGCATGCGAAGAGTCCAGAACTTGCTCGGGAGTTTCTTTTGGATGCTGCAACTGTTATCTTTAAAGAACAGGTATGTTGTTTGCAACCTTTTCGGCCTGTGCTTCTAGCTAATTTGCATTTGAGATATTGTTCTTTCTATAGTACAGATCTCATGTTCTTGTTGTAAGAAATTTCATTAAGCACCAGGGATATAATTACCAGATAGGCACATTGGTACAAGGAAAAACCAGCTAAAAGTTAAATGTCTGCATCTATAGAATCAGATGAAACGTTTTCCATTATTTTTAAATGAGATTCTTTATACGTGTGTGTGTGTGTGTGTGATTGTATTTTTTCTTTTGTTTTTGTTTTTGGTTTTCCAAATCTTTTCGATTTTATTTAATGAGATGTAGAAAAGATGTCTAATTTGCAGTTAATTTCTTCCAAGACTTTCAGAAAATAGTGAAGATTATGTTAGTCAGTTCAAATAGTTTGTTTTACCCGAGATAGAGTGGATTCAAGAAATGAGTCAGTGCAAGACACGGAGAATTGGAGAGGATTAGGGAGAAGAAAGATAGGAGAAATTCTCAAAACATTTAAACTAACTTCAACGTTGAACTTTGCTGTGGTATACTAGTAAGTCAATTAGTTTGTTGGGATTCTAAAAAATCCAACATCTTTGGATATTTAGCATCCAGAATGAAATATAAATGTTCAGCATCATTATGTAGCTGTTTTTGAATTACACTTAACATTTTTTATGTTTAGAGATATTTGGTTGTTGTATTCATTTGTAATACATGGAAATGAGCATTCAACACTCTAGAGTTTAAATTTTGTATTTAACTACATTAAAGTTTAAAGTATACTGATCTATTTATTTAGAAATTTAATTATATATCTATGTAATTTGGGATATTATACAGTTTAATAAAAAAAAATATTTTGTTATGAAGTAATATATAAATATCTATTACTTGAAGGCCATCAAGCAATCAATTGCGGGTGTAGAACAATGGATAAGTGCAGTGGAGAGTTATTCTGTTGTTTCCTAAATGATTGATACATCTCTTATTTGAAAGAAATTTCTCTTGATCCCTCGGGTTCTTTTTCGACCGAATTTCTCTTGATCCCTTGGGCTGTTTTTCAACCAAATCCCTCTTCTTTAGATTGAGAACCGAAGACCGGCCTTAATGTGGCTCTTTGTCGAAGTTAATTTGGGATTTCTCCCCAAGAAGGTGAAGATCTTCCTTCTGTCGTTGGCACATAGAAGTCTGAGTATGCACGAAAGTCTCTAAAGGAGGAGTTTTTCTCTTGTTCTTGGGTTTTTGGTTTGTACCCTGTCCTGGTGCAGTGAGGAATTTGCAGACCGTCTTTTTCTGCTTTGCCCTTTTGCTAGACGTGGTTGACTTAAGCTGCTGGATTTTTTTTCTTCGTACCTTCTCTTCCCAATAAGGTGGATGGTTGGTTGTGGGAATCATTGGGTGGTTGGAAGCTGAAAGACAAAGCTGGGATCCTTTCGGGGTTTGCTTCTAGAGCTCTTTTGTGGAGTTTATGGCTGGAGAGTAATAGAAGGCCTTTTGAAGATAAGTCTTCGTCTTTTTGAGTTTTTTTGGGATTGTGTACAGTTAAATACCTCTTGGTGGTGTCATAGCTATAAAAAATTCTTTTATCATCTACCTCTATCCATGATTATTTCGGACTGGCTAGGTAATTGTAAGTAGTTCCTTGGGTGGGGGCTCCCTCATCCCCCAAGCCTTTAAGTTGTTGTCTCTGCCCTTTCGGTTGTGCGTTACTCGATGTTCTTATAAGAAAAAAGAAAAAAAAAAAAAAGGACCTTTTGAATTTTTGCTTGTTGGATTTGCCCATCTGCTTATTAAAATCTGATTGAAAGCTCAAAGTTCTTTTGTTTTGTCTTCTCCTCACAGTCCAAGGAACTTCCCCAATGATAGTTTATTTAGATTGCATTGCAAAATTTGATACCAGAACTGATCCAAGTTTTCCCTAGGGAATTGTGGATTGTTTTTATTCTTATACTTTGTGGTCTTTGTCAGGTTGAAAAAGCATTCAGAGAAGGAACAGCTCGCCAAAATGCTCATAGCATTTTTGTTTGTCTTGCACTAAAATTATTGGAAGAACGAGTTCACATAGCATGCAAAGAAATCATTACTCTAGAAAAGCAGGTTTCAAGCGAGGCTATGTCATTACTCACAAATCCATTTACAAGAATCGAGATTTCTCTTTACTTCTAACCTATGTGAATTTTGCTTCAGATGAAACTTCTTGAAGAAGAAGAGAAGGAAAAGCGTGAAGAAAAGGAACGCAAAGAGCGGAAAAGGACAAAAGAACGAGAGAAGAAGCTCCGAAGAAAAGAAAGATTAAAAGGGAAGGAAAAGGATAAAGATAAGATAAGTTCTGAATCAGCTGAAGTATGTTCTCATTCTGATATCTTAGAGGACTTATCCCCATGTGTTTTGGAGCAAAATTCCATCTCTGTCGATGAAACATGTGATGCCAGCATTCCTGAATCCTCTGATACTCTGGACGAGCAATTTTTAAATGAATCCATTGTTTCAGAAGTGCAAAGTTCATATGATGATGGCCTTGCCGGGAAACCTACTGATGGGAATGATGGAAATGAACCTTTCATGGTTGATTCATCAAAGTTTTCTCGCTGGAGATTAAAATTTCCAAAGGAAGTTCAAGATCATTCTTCCAAGTGGTCTGAGAGGCGACGATTTTCAGTTTCCGAAAATGGGGCAGGGGCTAGCAGATCTGAGCAAAGATATTATGGTGATAGTTTGGAGACTCCTTCGAGGACCATGAATGGATCAAACAGGAAACTAAGAACAAATTCATTAAAGGCATATGGTCGACACATCTCTAAGTTCAATGAAAAGTCGCACTCTTCCAACAACCGGGTATCTTACGACTACCGTTCCTGCATCTGTAACCAAAATAATGAATTAAACAAAAAGGCGGAGGCGTTTGTTTCTTCAGTTAGAGTTAATCGAGATGTCAAATCTGCGAGCACATCAGAATCGTCATTTGATATGTCCAAGCAGTGTTCTCATTCTAGCAGGTACAGCTATGGAGATCATTCTCGTGATGGTGGAAGACTGAAAAACAAAAACAATTCTCCTGGTAAAGATTACGTTTATTCAAAGAAAGTTTGGGAGCCCATGGAATCACAGAAGAAATATCCTAGAAGTAACTCAGATTCAAATGTAGCAATGAAGTCTTCAACTTGTAAGTTTAATGTTGAACCCGATTTTGACCTTGTGAAGTCGGGGCATGATGCCTGTAGGGGTGAAGTTGCTGTTACTTCGAGTACAGTTGATCAAGAAGAGAGTAATTCGACTGAGTCGACCTCTGGTGTTGAATCAGATGAAGTCTCCCAAAATGGACTCGAATGGAAGGATCATAAAAACATAGAAGAAGATGCATGCGAGGTAACAAAGCGTTCGGTAAATTCAACAGACACGACATTGACATCGAGTGGGACTAATAACCGAGTAGGAACTAGGTCTTTAAATTCTGATAGCTGCTCATCATGCCCGAGTGAAGGAGACAGTAATAATATCTGCTTGAACCATGGAAATCTGGAATCGTTGTCCACATCGGACTCCGAAGATGCTAGTCATCACTCAGAAGGAAAAGAATCTTCAGCATCCATTCAGAATGGCTTCTCTGAACATCGTGAGACAAGGATGGATAAAGTAGTTGAAGTTGATGCCTTGGGGATCAGGAACCATTCCGGTCTTTCGCAAGAGATCGAGGGATGTAAAGTTCAAGGAAATGCACCGAACCGAGTTCCCCGGGACTTCGAAGCAGGATTCTCTGCTGTTAGTTTGGACTCCTCCCCATGTCAAGTGACACTTCCTCCAACTCAGAATCAAACTCAAAATATTCACTTTCCAGTGTTTCAGGTTTCTCCAGCGATGGGTTATTACCATCAAAACTCAGTTTCATGGCCTGCAGCAGCAGCTCATGCTACTAATGGGATAATACCTTTCTCCTATTCAAATCCCTGTCTGTATGCCAATCCTCTTGGGTATGGTTTAAGTGATAACCCACGCTTCTGTATGCAGTATGGCCATTTGCATCATCTAGCTGCTCCCGTCTTCAACCCGAGCCCGGTTCCTATTTATCAGCCAGCTTCCAAAGCCAACAATGGTGTATATACTGAAGAACGAAGTCAGGTCCCCATATCAGAAAGCTCAGATGTTGTAGCTAATCCCGACATCATCGGTACCACTGGACTCCCATACGCAATCAGTTCACCACCAGGCAGAGATCGCAAGCAAAACGACACTTCCATATTCCCAAAGGATAGCTCAAGCTTTTCATTGTTCCATTTTGGAGGGCCTGTTGCATTTTCAACAGGAGGTAACTTAAACCCCATGCCTTCCAAGGAAGACGATATTGTCGGGGATTTTTCGAGAAATAACGAAGCAGCGGATGTTGTTGACGATGTCCATGCTTTCAATAAGAAGGAAACTGCCATTGAAGAATACAACTTGTTTGCAGCAAGCAATGGCATGAGGTTCTCGTTCTTCTGAATGGGAGCAAGATGATATATGAGGGGACTGTTTTCGAGTTTCTAGTCCCTATCTTCATATTAATTTTTTTTTTTTTTAATGGAATTTTACAGTACTTCTTACATAATAAATTTTCTTTTCTTTTATTTTTTTTTTATTTTTTTTATTATTATTTTTTTTTTTTTAATGTAGTTGATTGCTCCCAAAATTTGTCTCATTATTTTCGAGATGTAGAGAACACACAAGAAAAAAAGAAAAAAAGAAAAAAAAAAGCAAAATTGGATGAAGGAGTGGTATGATGTTTGAAAGTTTTCTAAATGAGATGAGTCTTCTTATAGATGAAGAATGTGTTTATATGAACATTTTGTTGCCCATATTTGCTT
mRNA sequence
ATATAATCGTAGCACCCAAATCCATCGTCATCAACCAGAAACCCCTACATCCGTTTTCTCTTCTTGAATCTTGATGTCCCTTCAATTCCGTTTATATTATGCTATTTCATTCCAGTCCAATCGAATCCAATCCTATTCGATATAATCAATCCTCTCCTACTCTTCACCAGATTTGCTTCAGGAGGACGGCGTTATTTCGATCCGCCGACGATCTATTGGGTTGATTTCGGGTTGTGAGACAATGGAAGAATTGAGAAATGTAACTATGAGAGTTTGAAAGATGCCTGGGTTAACGCAAAAAAATGACCATTTAAATGGTGGGTCATCGGCTGTATACTCGCTCTCCGCCAATGGCTTTTGGTCCCAGCATCGCGACGATGTTAGCTACGTTCAGCTCCAGAAGTTTTGGAGTGAACTGCTGCCCCAAGCTAGGCAGAAACTCCTGAGAATTGACAAGCAAACTCTCTTTGAGCAAGCTCGTAAGAATATGTACTGCTCTCGATGTAATGGTTTGCTGCTTGAAGGATTTTTGCAGATTGTCATGTATGGGAAGTCTTTACAACAAGGAAACACATGTGTGAATCACACTTGCAACAGATTAGGGGGTTCAAAAAGTCAAACTTGCGATGGGTCATTGGCAGTTAATGGGTTTCATGATGAAATTCAAGACCCATCTGTCCATCCTTGGGGTGGTTTGACCACAACGCGCGAGGGGTTGCTGACACTTTTGGGCTGCTATTTGTATTCAAAGTCTTTCCTGGGTCTCCAAAATGTATTTGACAGTGCACGAGCTAGGGAGCGAGAGCGTGAATTGCTTTATCCTGATGCTTGTGGTGGGGGAGGTCGAGGCTGGATAAGTCAAGGAACAGCGGGCTATGGCAGGGGACATGGTACAAGGGAAACATGCGCCCTGCACACTGCTAGGCTTTCTTGTGATACATTGGTGGATTTCTGGTCAGCATTAGGAGAAGAAACTCGACAATCTCTTCTAAGGATGAAAGAAGAAGATTTTATTGAGAGACTAATGTACAGGTTTGACAGCAAGAGGTTTTGTAGAGATTGCAGAAGAAATGTGATCCGTGAGTTCAAGGAGCTGAAGGAACTGAAGCGCATGAGGAGAGAGCCTTGCTGCACTAGTTGGTTTTGTGTTGCAGATATGGCTTTTCATTACGAGGTCTCAGATGATACAATCCAGGCCGATTGGCATCAAACCTTTGCTGACTCCGTGGAGACATATCATTATTTTGAGTGGGCTGTTGGATCAGGAGAAGGAAAATCTGACATTCTGGAATTTGAAAATGTTGGCATGAACGGAAGTGTCAAAATGAATGGCCTAGATCTTGGTGGTTTGAATTCATGCTTTATCACCCTCAGAGCTTGGAAATTAGATGGACGCTGCACAGAGCTATCAGTGAAAGCTCATGCATTAAAAGGTCAACAATGTGTTCATCGAAGACTTATAGTTGGTGATGGATTTGTTACAATCACTAGAGGGGAAAATATTAGGAGGTTTTTTGAGCATGCTGAAGAGGCTGAAGAGGAGGAGGAGGATGATTCGATGGATAAGGACGCAAATGATTTGGATGGAGATTGCTCTCGTCCTCAAAAGCATGCGAAGAGTCCAGAACTTGCTCGGGAGTTTCTTTTGGATGCTGCAACTGTTATCTTTAAAGAACAGGTTGAAAAAGCATTCAGAGAAGGAACAGCTCGCCAAAATGCTCATAGCATTTTTGTTTGTCTTGCACTAAAATTATTGGAAGAACGAGTTCACATAGCATGCAAAGAAATCATTACTCTAGAAAAGCAGATGAAACTTCTTGAAGAAGAAGAGAAGGAAAAGCGTGAAGAAAAGGAACGCAAAGAGCGGAAAAGGACAAAAGAACGAGAGAAGAAGCTCCGAAGAAAAGAAAGATTAAAAGGGAAGGAAAAGGATAAAGATAAGATAAGTTCTGAATCAGCTGAAGTATGTTCTCATTCTGATATCTTAGAGGACTTATCCCCATGTGTTTTGGAGCAAAATTCCATCTCTGTCGATGAAACATGTGATGCCAGCATTCCTGAATCCTCTGATACTCTGGACGAGCAATTTTTAAATGAATCCATTGTTTCAGAAGTGCAAAGTTCATATGATGATGGCCTTGCCGGGAAACCTACTGATGGGAATGATGGAAATGAACCTTTCATGGTTGATTCATCAAAGTTTTCTCGCTGGAGATTAAAATTTCCAAAGGAAGTTCAAGATCATTCTTCCAAGTGGTCTGAGAGGCGACGATTTTCAGTTTCCGAAAATGGGGCAGGGGCTAGCAGATCTGAGCAAAGATATTATGGTGATAGTTTGGAGACTCCTTCGAGGACCATGAATGGATCAAACAGGAAACTAAGAACAAATTCATTAAAGGCATATGGTCGACACATCTCTAAGTTCAATGAAAAGTCGCACTCTTCCAACAACCGGGTATCTTACGACTACCGTTCCTGCATCTGTAACCAAAATAATGAATTAAACAAAAAGGCGGAGGCGTTTGTTTCTTCAGTTAGAGTTAATCGAGATGTCAAATCTGCGAGCACATCAGAATCGTCATTTGATATGTCCAAGCAGTGTTCTCATTCTAGCAGGTACAGCTATGGAGATCATTCTCGTGATGGTGGAAGACTGAAAAACAAAAACAATTCTCCTGGTAAAGATTACGTTTATTCAAAGAAAGTTTGGGAGCCCATGGAATCACAGAAGAAATATCCTAGAAGTAACTCAGATTCAAATGTAGCAATGAAGTCTTCAACTTGTAAGTTTAATGTTGAACCCGATTTTGACCTTGTGAAGTCGGGGCATGATGCCTGTAGGGGTGAAGTTGCTGTTACTTCGAGTACAGTTGATCAAGAAGAGAGTAATTCGACTGAGTCGACCTCTGGTGTTGAATCAGATGAAGTCTCCCAAAATGGACTCGAATGGAAGGATCATAAAAACATAGAAGAAGATGCATGCGAGGTAACAAAGCGTTCGGTAAATTCAACAGACACGACATTGACATCGAGTGGGACTAATAACCGAGTAGGAACTAGGTCTTTAAATTCTGATAGCTGCTCATCATGCCCGAGTGAAGGAGACAGTAATAATATCTGCTTGAACCATGGAAATCTGGAATCGTTGTCCACATCGGACTCCGAAGATGCTAGTCATCACTCAGAAGGAAAAGAATCTTCAGCATCCATTCAGAATGGCTTCTCTGAACATCGTGAGACAAGGATGGATAAAGTAGTTGAAGTTGATGCCTTGGGGATCAGGAACCATTCCGGTCTTTCGCAAGAGATCGAGGGATGTAAAGTTCAAGGAAATGCACCGAACCGAGTTCCCCGGGACTTCGAAGCAGGATTCTCTGCTGTTAGTTTGGACTCCTCCCCATGTCAAGTGACACTTCCTCCAACTCAGAATCAAACTCAAAATATTCACTTTCCAGTGTTTCAGGTTTCTCCAGCGATGGGTTATTACCATCAAAACTCAGTTTCATGGCCTGCAGCAGCAGCTCATGCTACTAATGGGATAATACCTTTCTCCTATTCAAATCCCTGTCTGTATGCCAATCCTCTTGGGTATGGTTTAAGTGATAACCCACGCTTCTGTATGCAGTATGGCCATTTGCATCATCTAGCTGCTCCCGTCTTCAACCCGAGCCCGGTTCCTATTTATCAGCCAGCTTCCAAAGCCAACAATGGTGTATATACTGAAGAACGAAGTCAGGTCCCCATATCAGAAAGCTCAGATGTTGTAGCTAATCCCGACATCATCGGTACCACTGGACTCCCATACGCAATCAGTTCACCACCAGGCAGAGATCGCAAGCAAAACGACACTTCCATATTCCCAAAGGATAGCTCAAGCTTTTCATTGTTCCATTTTGGAGGGCCTGTTGCATTTTCAACAGGAGGTAACTTAAACCCCATGCCTTCCAAGGAAGACGATATTGTCGGGGATTTTTCGAGAAATAACGAAGCAGCGGATGTTGTTGACGATGTCCATGCTTTCAATAAGAAGGAAACTGCCATTGAAGAATACAACTTGTTTGCAGCAAGCAATGGCATGAGGTTCTCGTTCTTCTGAATGGGAGCAAGATGATATATGAGGGGACTGTTTTCGAGTTTCTAGTCCCTATCTTCATATTAATTTTTTTTTTTTTTAATGGAATTTTACAGTACTTCTTACATAATAAATTTTCTTTTCTTTTATTTTTTTTTTATTTTTTTTATTATTATTTTTTTTTTTTTAATGTAGTTGATTGCTCCCAAAATTTGTCTCATTATTTTCGAGATGTAGAGAACACACAAGAAAAAAAGAAAAAAAGAAAAAAAAAAGCAAAATTGGATGAAGGAGTGGTATGATGTTTGAAAGTTTTCTAAATGAGATGAGTCTTCTTATAGATGAAGAATGTGTTTATATGAACATTTTGTTGCCCATATTTGCTT
Coding sequence (CDS)
ATGCCTGGGTTAACGCAAAAAAATGACCATTTAAATGGTGGGTCATCGGCTGTATACTCGCTCTCCGCCAATGGCTTTTGGTCCCAGCATCGCGACGATGTTAGCTACGTTCAGCTCCAGAAGTTTTGGAGTGAACTGCTGCCCCAAGCTAGGCAGAAACTCCTGAGAATTGACAAGCAAACTCTCTTTGAGCAAGCTCGTAAGAATATGTACTGCTCTCGATGTAATGGTTTGCTGCTTGAAGGATTTTTGCAGATTGTCATGTATGGGAAGTCTTTACAACAAGGAAACACATGTGTGAATCACACTTGCAACAGATTAGGGGGTTCAAAAAGTCAAACTTGCGATGGGTCATTGGCAGTTAATGGGTTTCATGATGAAATTCAAGACCCATCTGTCCATCCTTGGGGTGGTTTGACCACAACGCGCGAGGGGTTGCTGACACTTTTGGGCTGCTATTTGTATTCAAAGTCTTTCCTGGGTCTCCAAAATGTATTTGACAGTGCACGAGCTAGGGAGCGAGAGCGTGAATTGCTTTATCCTGATGCTTGTGGTGGGGGAGGTCGAGGCTGGATAAGTCAAGGAACAGCGGGCTATGGCAGGGGACATGGTACAAGGGAAACATGCGCCCTGCACACTGCTAGGCTTTCTTGTGATACATTGGTGGATTTCTGGTCAGCATTAGGAGAAGAAACTCGACAATCTCTTCTAAGGATGAAAGAAGAAGATTTTATTGAGAGACTAATGTACAGGTTTGACAGCAAGAGGTTTTGTAGAGATTGCAGAAGAAATGTGATCCGTGAGTTCAAGGAGCTGAAGGAACTGAAGCGCATGAGGAGAGAGCCTTGCTGCACTAGTTGGTTTTGTGTTGCAGATATGGCTTTTCATTACGAGGTCTCAGATGATACAATCCAGGCCGATTGGCATCAAACCTTTGCTGACTCCGTGGAGACATATCATTATTTTGAGTGGGCTGTTGGATCAGGAGAAGGAAAATCTGACATTCTGGAATTTGAAAATGTTGGCATGAACGGAAGTGTCAAAATGAATGGCCTAGATCTTGGTGGTTTGAATTCATGCTTTATCACCCTCAGAGCTTGGAAATTAGATGGACGCTGCACAGAGCTATCAGTGAAAGCTCATGCATTAAAAGGTCAACAATGTGTTCATCGAAGACTTATAGTTGGTGATGGATTTGTTACAATCACTAGAGGGGAAAATATTAGGAGGTTTTTTGAGCATGCTGAAGAGGCTGAAGAGGAGGAGGAGGATGATTCGATGGATAAGGACGCAAATGATTTGGATGGAGATTGCTCTCGTCCTCAAAAGCATGCGAAGAGTCCAGAACTTGCTCGGGAGTTTCTTTTGGATGCTGCAACTGTTATCTTTAAAGAACAGGTTGAAAAAGCATTCAGAGAAGGAACAGCTCGCCAAAATGCTCATAGCATTTTTGTTTGTCTTGCACTAAAATTATTGGAAGAACGAGTTCACATAGCATGCAAAGAAATCATTACTCTAGAAAAGCAGATGAAACTTCTTGAAGAAGAAGAGAAGGAAAAGCGTGAAGAAAAGGAACGCAAAGAGCGGAAAAGGACAAAAGAACGAGAGAAGAAGCTCCGAAGAAAAGAAAGATTAAAAGGGAAGGAAAAGGATAAAGATAAGATAAGTTCTGAATCAGCTGAAGTATGTTCTCATTCTGATATCTTAGAGGACTTATCCCCATGTGTTTTGGAGCAAAATTCCATCTCTGTCGATGAAACATGTGATGCCAGCATTCCTGAATCCTCTGATACTCTGGACGAGCAATTTTTAAATGAATCCATTGTTTCAGAAGTGCAAAGTTCATATGATGATGGCCTTGCCGGGAAACCTACTGATGGGAATGATGGAAATGAACCTTTCATGGTTGATTCATCAAAGTTTTCTCGCTGGAGATTAAAATTTCCAAAGGAAGTTCAAGATCATTCTTCCAAGTGGTCTGAGAGGCGACGATTTTCAGTTTCCGAAAATGGGGCAGGGGCTAGCAGATCTGAGCAAAGATATTATGGTGATAGTTTGGAGACTCCTTCGAGGACCATGAATGGATCAAACAGGAAACTAAGAACAAATTCATTAAAGGCATATGGTCGACACATCTCTAAGTTCAATGAAAAGTCGCACTCTTCCAACAACCGGGTATCTTACGACTACCGTTCCTGCATCTGTAACCAAAATAATGAATTAAACAAAAAGGCGGAGGCGTTTGTTTCTTCAGTTAGAGTTAATCGAGATGTCAAATCTGCGAGCACATCAGAATCGTCATTTGATATGTCCAAGCAGTGTTCTCATTCTAGCAGGTACAGCTATGGAGATCATTCTCGTGATGGTGGAAGACTGAAAAACAAAAACAATTCTCCTGGTAAAGATTACGTTTATTCAAAGAAAGTTTGGGAGCCCATGGAATCACAGAAGAAATATCCTAGAAGTAACTCAGATTCAAATGTAGCAATGAAGTCTTCAACTTGTAAGTTTAATGTTGAACCCGATTTTGACCTTGTGAAGTCGGGGCATGATGCCTGTAGGGGTGAAGTTGCTGTTACTTCGAGTACAGTTGATCAAGAAGAGAGTAATTCGACTGAGTCGACCTCTGGTGTTGAATCAGATGAAGTCTCCCAAAATGGACTCGAATGGAAGGATCATAAAAACATAGAAGAAGATGCATGCGAGGTAACAAAGCGTTCGGTAAATTCAACAGACACGACATTGACATCGAGTGGGACTAATAACCGAGTAGGAACTAGGTCTTTAAATTCTGATAGCTGCTCATCATGCCCGAGTGAAGGAGACAGTAATAATATCTGCTTGAACCATGGAAATCTGGAATCGTTGTCCACATCGGACTCCGAAGATGCTAGTCATCACTCAGAAGGAAAAGAATCTTCAGCATCCATTCAGAATGGCTTCTCTGAACATCGTGAGACAAGGATGGATAAAGTAGTTGAAGTTGATGCCTTGGGGATCAGGAACCATTCCGGTCTTTCGCAAGAGATCGAGGGATGTAAAGTTCAAGGAAATGCACCGAACCGAGTTCCCCGGGACTTCGAAGCAGGATTCTCTGCTGTTAGTTTGGACTCCTCCCCATGTCAAGTGACACTTCCTCCAACTCAGAATCAAACTCAAAATATTCACTTTCCAGTGTTTCAGGTTTCTCCAGCGATGGGTTATTACCATCAAAACTCAGTTTCATGGCCTGCAGCAGCAGCTCATGCTACTAATGGGATAATACCTTTCTCCTATTCAAATCCCTGTCTGTATGCCAATCCTCTTGGGTATGGTTTAAGTGATAACCCACGCTTCTGTATGCAGTATGGCCATTTGCATCATCTAGCTGCTCCCGTCTTCAACCCGAGCCCGGTTCCTATTTATCAGCCAGCTTCCAAAGCCAACAATGGTGTATATACTGAAGAACGAAGTCAGGTCCCCATATCAGAAAGCTCAGATGTTGTAGCTAATCCCGACATCATCGGTACCACTGGACTCCCATACGCAATCAGTTCACCACCAGGCAGAGATCGCAAGCAAAACGACACTTCCATATTCCCAAAGGATAGCTCAAGCTTTTCATTGTTCCATTTTGGAGGGCCTGTTGCATTTTCAACAGGAGGTAACTTAAACCCCATGCCTTCCAAGGAAGACGATATTGTCGGGGATTTTTCGAGAAATAACGAAGCAGCGGATGTTGTTGACGATGTCCATGCTTTCAATAAGAAGGAAACTGCCATTGAAGAATACAACTTGTTTGCAGCAAGCAATGGCATGAGGTTCTCGTTCTTCTGA
Protein sequence
MPGLTQKNDHLNGGSSAVYSLSANGFWSQHRDDVSYVQLQKFWSELLPQARQKLLRIDKQTLFEQARKNMYCSRCNGLLLEGFLQIVMYGKSLQQGNTCVNHTCNRLGGSKSQTCDGSLAVNGFHDEIQDPSVHPWGGLTTTREGLLTLLGCYLYSKSFLGLQNVFDSARARERERELLYPDACGGGGRGWISQGTAGYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMKEEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRMRREPCCTSWFCVADMAFHYEVSDDTIQADWHQTFADSVETYHYFEWAVGSGEGKSDILEFENVGMNGSVKMNGLDLGGLNSCFITLRAWKLDGRCTELSVKAHALKGQQCVHRRLIVGDGFVTITRGENIRRFFEHAEEAEEEEEDDSMDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNAHSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEKERKERKRTKEREKKLRRKERLKGKEKDKDKISSESAEVCSHSDILEDLSPCVLEQNSISVDETCDASIPESSDTLDEQFLNESIVSEVQSSYDDGLAGKPTDGNDGNEPFMVDSSKFSRWRLKFPKEVQDHSSKWSERRRFSVSENGAGASRSEQRYYGDSLETPSRTMNGSNRKLRTNSLKAYGRHISKFNEKSHSSNNRVSYDYRSCICNQNNELNKKAEAFVSSVRVNRDVKSASTSESSFDMSKQCSHSSRYSYGDHSRDGGRLKNKNNSPGKDYVYSKKVWEPMESQKKYPRSNSDSNVAMKSSTCKFNVEPDFDLVKSGHDACRGEVAVTSSTVDQEESNSTESTSGVESDEVSQNGLEWKDHKNIEEDACEVTKRSVNSTDTTLTSSGTNNRVGTRSLNSDSCSSCPSEGDSNNICLNHGNLESLSTSDSEDASHHSEGKESSASIQNGFSEHRETRMDKVVEVDALGIRNHSGLSQEIEGCKVQGNAPNRVPRDFEAGFSAVSLDSSPCQVTLPPTQNQTQNIHFPVFQVSPAMGYYHQNSVSWPAAAAHATNGIIPFSYSNPCLYANPLGYGLSDNPRFCMQYGHLHHLAAPVFNPSPVPIYQPASKANNGVYTEERSQVPISESSDVVANPDIIGTTGLPYAISSPPGRDRKQNDTSIFPKDSSSFSLFHFGGPVAFSTGGNLNPMPSKEDDIVGDFSRNNEAADVVDDVHAFNKKETAIEEYNLFAASNGMRFSFF
Homology
BLAST of Cp4.1LG19g00810 vs. NCBI nr
Match:
XP_023517706.1 (uncharacterized protein LOC111781381 [Cucurbita pepo subsp. pepo] >XP_023517707.1 uncharacterized protein LOC111781381 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 2497 bits (6471), Expect = 0.0
Identity = 1271/1271 (100.00%), Postives = 1271/1271 (100.00%), Query Frame = 0
Query: 1 MPGLTQKNDHLNGGSSAVYSLSANGFWSQHRDDVSYVQLQKFWSELLPQARQKLLRIDKQ 60
MPGLTQKNDHLNGGSSAVYSLSANGFWSQHRDDVSYVQLQKFWSELLPQARQKLLRIDKQ
Sbjct: 1 MPGLTQKNDHLNGGSSAVYSLSANGFWSQHRDDVSYVQLQKFWSELLPQARQKLLRIDKQ 60
Query: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVMYGKSLQQGNTCVNHTCNRLGGSKSQTCDGSLA 120
TLFEQARKNMYCSRCNGLLLEGFLQIVMYGKSLQQGNTCVNHTCNRLGGSKSQTCDGSLA
Sbjct: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVMYGKSLQQGNTCVNHTCNRLGGSKSQTCDGSLA 120
Query: 121 VNGFHDEIQDPSVHPWGGLTTTREGLLTLLGCYLYSKSFLGLQNVFDSARARERERELLY 180
VNGFHDEIQDPSVHPWGGLTTTREGLLTLLGCYLYSKSFLGLQNVFDSARARERERELLY
Sbjct: 121 VNGFHDEIQDPSVHPWGGLTTTREGLLTLLGCYLYSKSFLGLQNVFDSARARERERELLY 180
Query: 181 PDACGGGGRGWISQGTAGYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
PDACGGGGRGWISQGTAGYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK
Sbjct: 181 PDACGGGGRGWISQGTAGYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
Query: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRMRREPCCTSWFCVADMAFHYEVS 300
EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRMRREPCCTSWFCVADMAFHYEVS
Sbjct: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRMRREPCCTSWFCVADMAFHYEVS 300
Query: 301 DDTIQADWHQTFADSVETYHYFEWAVGSGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC 360
DDTIQADWHQTFADSVETYHYFEWAVGSGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC
Sbjct: 301 DDTIQADWHQTFADSVETYHYFEWAVGSGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC 360
Query: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLIVGDGFVTITRGENIRRFFEHAEEAEE 420
FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLIVGDGFVTITRGENIRRFFEHAEEAEE
Sbjct: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLIVGDGFVTITRGENIRRFFEHAEEAEE 420
Query: 421 EEEDDSMDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
EEEDDSMDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA
Sbjct: 421 EEEDDSMDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
Query: 481 HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEKERKERKRTKEREKKLR 540
HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEKERKERKRTKEREKKLR
Sbjct: 481 HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEKERKERKRTKEREKKLR 540
Query: 541 RKERLKGKEKDKDKISSESAEVCSHSDILEDLSPCVLEQNSISVDETCDASIPESSDTLD 600
RKERLKGKEKDKDKISSESAEVCSHSDILEDLSPCVLEQNSISVDETCDASIPESSDTLD
Sbjct: 541 RKERLKGKEKDKDKISSESAEVCSHSDILEDLSPCVLEQNSISVDETCDASIPESSDTLD 600
Query: 601 EQFLNESIVSEVQSSYDDGLAGKPTDGNDGNEPFMVDSSKFSRWRLKFPKEVQDHSSKWS 660
EQFLNESIVSEVQSSYDDGLAGKPTDGNDGNEPFMVDSSKFSRWRLKFPKEVQDHSSKWS
Sbjct: 601 EQFLNESIVSEVQSSYDDGLAGKPTDGNDGNEPFMVDSSKFSRWRLKFPKEVQDHSSKWS 660
Query: 661 ERRRFSVSENGAGASRSEQRYYGDSLETPSRTMNGSNRKLRTNSLKAYGRHISKFNEKSH 720
ERRRFSVSENGAGASRSEQRYYGDSLETPSRTMNGSNRKLRTNSLKAYGRHISKFNEKSH
Sbjct: 661 ERRRFSVSENGAGASRSEQRYYGDSLETPSRTMNGSNRKLRTNSLKAYGRHISKFNEKSH 720
Query: 721 SSNNRVSYDYRSCICNQNNELNKKAEAFVSSVRVNRDVKSASTSESSFDMSKQCSHSSRY 780
SSNNRVSYDYRSCICNQNNELNKKAEAFVSSVRVNRDVKSASTSESSFDMSKQCSHSSRY
Sbjct: 721 SSNNRVSYDYRSCICNQNNELNKKAEAFVSSVRVNRDVKSASTSESSFDMSKQCSHSSRY 780
Query: 781 SYGDHSRDGGRLKNKNNSPGKDYVYSKKVWEPMESQKKYPRSNSDSNVAMKSSTCKFNVE 840
SYGDHSRDGGRLKNKNNSPGKDYVYSKKVWEPMESQKKYPRSNSDSNVAMKSSTCKFNVE
Sbjct: 781 SYGDHSRDGGRLKNKNNSPGKDYVYSKKVWEPMESQKKYPRSNSDSNVAMKSSTCKFNVE 840
Query: 841 PDFDLVKSGHDACRGEVAVTSSTVDQEESNSTESTSGVESDEVSQNGLEWKDHKNIEEDA 900
PDFDLVKSGHDACRGEVAVTSSTVDQEESNSTESTSGVESDEVSQNGLEWKDHKNIEEDA
Sbjct: 841 PDFDLVKSGHDACRGEVAVTSSTVDQEESNSTESTSGVESDEVSQNGLEWKDHKNIEEDA 900
Query: 901 CEVTKRSVNSTDTTLTSSGTNNRVGTRSLNSDSCSSCPSEGDSNNICLNHGNLESLSTSD 960
CEVTKRSVNSTDTTLTSSGTNNRVGTRSLNSDSCSSCPSEGDSNNICLNHGNLESLSTSD
Sbjct: 901 CEVTKRSVNSTDTTLTSSGTNNRVGTRSLNSDSCSSCPSEGDSNNICLNHGNLESLSTSD 960
Query: 961 SEDASHHSEGKESSASIQNGFSEHRETRMDKVVEVDALGIRNHSGLSQEIEGCKVQGNAP 1020
SEDASHHSEGKESSASIQNGFSEHRETRMDKVVEVDALGIRNHSGLSQEIEGCKVQGNAP
Sbjct: 961 SEDASHHSEGKESSASIQNGFSEHRETRMDKVVEVDALGIRNHSGLSQEIEGCKVQGNAP 1020
Query: 1021 NRVPRDFEAGFSAVSLDSSPCQVTLPPTQNQTQNIHFPVFQVSPAMGYYHQNSVSWPAAA 1080
NRVPRDFEAGFSAVSLDSSPCQVTLPPTQNQTQNIHFPVFQVSPAMGYYHQNSVSWPAAA
Sbjct: 1021 NRVPRDFEAGFSAVSLDSSPCQVTLPPTQNQTQNIHFPVFQVSPAMGYYHQNSVSWPAAA 1080
Query: 1081 AHATNGIIPFSYSNPCLYANPLGYGLSDNPRFCMQYGHLHHLAAPVFNPSPVPIYQPASK 1140
AHATNGIIPFSYSNPCLYANPLGYGLSDNPRFCMQYGHLHHLAAPVFNPSPVPIYQPASK
Sbjct: 1081 AHATNGIIPFSYSNPCLYANPLGYGLSDNPRFCMQYGHLHHLAAPVFNPSPVPIYQPASK 1140
Query: 1141 ANNGVYTEERSQVPISESSDVVANPDIIGTTGLPYAISSPPGRDRKQNDTSIFPKDSSSF 1200
ANNGVYTEERSQVPISESSDVVANPDIIGTTGLPYAISSPPGRDRKQNDTSIFPKDSSSF
Sbjct: 1141 ANNGVYTEERSQVPISESSDVVANPDIIGTTGLPYAISSPPGRDRKQNDTSIFPKDSSSF 1200
Query: 1201 SLFHFGGPVAFSTGGNLNPMPSKEDDIVGDFSRNNEAADVVDDVHAFNKKETAIEEYNLF 1260
SLFHFGGPVAFSTGGNLNPMPSKEDDIVGDFSRNNEAADVVDDVHAFNKKETAIEEYNLF
Sbjct: 1201 SLFHFGGPVAFSTGGNLNPMPSKEDDIVGDFSRNNEAADVVDDVHAFNKKETAIEEYNLF 1260
Query: 1261 AASNGMRFSFF 1271
AASNGMRFSFF
Sbjct: 1261 AASNGMRFSFF 1271
BLAST of Cp4.1LG19g00810 vs. NCBI nr
Match:
KAG7027601.1 (hypothetical protein SDJN02_11615 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 2426 bits (6288), Expect = 0.0
Identity = 1240/1275 (97.25%), Postives = 1248/1275 (97.88%), Query Frame = 0
Query: 1 MPGLTQKNDHLNGGSSAVYSLSANGFWSQHRDDVSYVQLQKFWSELLPQARQKLLRIDKQ 60
MPGLTQKNDHLNGGSSAVYSLSANGFWSQH DDVSYVQLQKFWSELLPQARQKLLRIDKQ
Sbjct: 1 MPGLTQKNDHLNGGSSAVYSLSANGFWSQHGDDVSYVQLQKFWSELLPQARQKLLRIDKQ 60
Query: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVMYGKSLQQGNTCVNHTCNRLGGSKSQTCDGSLA 120
TLFEQARKNMYCSRCNGLLLEGFLQIVMYGKSLQQGNTCVNHTCNRLG SKSQTCDGSLA
Sbjct: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVMYGKSLQQGNTCVNHTCNRLGVSKSQTCDGSLA 120
Query: 121 VNGFHDEIQDPSVHPWGGLTTTREGLLTLLGCYLYSKSFLGLQNVFDSARARERERELLY 180
VNGFHDEIQDPSVHPWGGLTTTREGLLTLLGCYLYSKSFLGLQNVFDSARARERERELLY
Sbjct: 121 VNGFHDEIQDPSVHPWGGLTTTREGLLTLLGCYLYSKSFLGLQNVFDSARARERERELLY 180
Query: 181 PDACGGGGRGWISQGTAGYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
PDACGGGGRGWISQGT GYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK
Sbjct: 181 PDACGGGGRGWISQGTVGYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
Query: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRMRREPCCTSWFCVADMAFHYEVS 300
EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKR+RREPCCTSWFCVADMAFHYEVS
Sbjct: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFHYEVS 300
Query: 301 DDTIQADWHQTFADSVETYHYFEWAVGSGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC 360
DDTIQADWHQTFADSVETYHYFEWAVGSGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC
Sbjct: 301 DDTIQADWHQTFADSVETYHYFEWAVGSGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC 360
Query: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLIVGDGFVTITRGENIRRFFEHAEEAEE 420
FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLIVGDGFVTITRGENIRRFFEHAEEAEE
Sbjct: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLIVGDGFVTITRGENIRRFFEHAEEAEE 420
Query: 421 EEEDDSMDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
EEEDDSMDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA
Sbjct: 421 EEEDDSMDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
Query: 481 HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEKERKERKRTKEREKKLR 540
HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEKERKERKRTKEREKKLR
Sbjct: 481 HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEKERKERKRTKEREKKLR 540
Query: 541 RKERLKGKEKDKDKISSESAEVCSHSDILEDLSPCVLEQNSISVDETCDASIPESSDTLD 600
RKERLKGKEKDKDKISSESAEVCSHSDILEDLSPCVLEQ SISVDETCDASIPESSDTLD
Sbjct: 541 RKERLKGKEKDKDKISSESAEVCSHSDILEDLSPCVLEQTSISVDETCDASIPESSDTLD 600
Query: 601 EQFLNESIVSEVQSSYDDGLAGKPTDGNDGNEPFMVDSSKFSRWRLKFPKEVQDHSSKWS 660
EQFL+ESI+SEVQSSYDDGLAGKPTDGNDGNE FMVDSSK SRWRLKFPKEVQDHS KWS
Sbjct: 601 EQFLDESIISEVQSSYDDGLAGKPTDGNDGNESFMVDSSKCSRWRLKFPKEVQDHSFKWS 660
Query: 661 ERRRFSVSENGAGASRSEQRYYGDSLETPSRTMNGSNRKLRTNSLKAYGRHISKFNEKSH 720
ERRRFSVSENGAGASRSEQRYYGDSLE PSRTMNGSNRKLRTNSLKAYGRHISKFNEKSH
Sbjct: 661 ERRRFSVSENGAGASRSEQRYYGDSLENPSRTMNGSNRKLRTNSLKAYGRHISKFNEKSH 720
Query: 721 SSNNRVSYDYRSCICNQNNELNKKAEAFVSSVRVNRDVKSASTSESSFDMSKQCSHSSRY 780
SSNNRVSYDYRSC+CNQNNELNKKAEAFVSSVRVNRDVKSASTSESSFDMSKQCSHS+RY
Sbjct: 721 SSNNRVSYDYRSCVCNQNNELNKKAEAFVSSVRVNRDVKSASTSESSFDMSKQCSHSNRY 780
Query: 781 SYGDHSRDGGRLKNKNNSPGKDYVYSKKVWEPMESQKKYPRSNSDSNVAMKSSTCKFNVE 840
SYGDHSRDGGRLKNKNNSPGKDYVYSKKVWEPMESQKKYPRSNSDSNVAMKSSTCKFNVE
Sbjct: 781 SYGDHSRDGGRLKNKNNSPGKDYVYSKKVWEPMESQKKYPRSNSDSNVAMKSSTCKFNVE 840
Query: 841 PDFDLVKSGHDACRGEVAVTSSTVDQEESNSTESTSGVESDEVSQNGLEWKDHKNIEEDA 900
PDFDLVKSGHDAC GEVAVTSSTVDQEESNSTESTSGVESDEVSQNGLEWKDHKNIEEDA
Sbjct: 841 PDFDLVKSGHDACSGEVAVTSSTVDQEESNSTESTSGVESDEVSQNGLEWKDHKNIEEDA 900
Query: 901 CEVTKRSVNSTDTTLTSSGTNNRVGTRSLNSDSCSSCPSEGDSNNICLNHGNLESLSTSD 960
CEVTKRSVNSTDTTLTSSGTNNRVGTRSLNSDSCSSCPSEGDS+NICLNHGNLESLSTSD
Sbjct: 901 CEVTKRSVNSTDTTLTSSGTNNRVGTRSLNSDSCSSCPSEGDSSNICLNHGNLESLSTSD 960
Query: 961 SEDASHHSEGKESSASIQNGFSEHRETRMDKVVEVDALGIRNHSGLSQEIEGCKVQGNAP 1020
SEDASHHSEGKESSASIQNGFSEH ETRMDKVVEVDALGIRNHSGL QEIEGCKVQGNAP
Sbjct: 961 SEDASHHSEGKESSASIQNGFSEHHETRMDKVVEVDALGIRNHSGLLQEIEGCKVQGNAP 1020
Query: 1021 NRVPRDFEAGFSAVSLDSSPCQVTLPPTQNQTQNIHFPVFQVSPAMGYYHQNSVSWPAAA 1080
NRVPRDFEAGFSAVSLDSSPCQVTLPPTQNQTQNIHFPVFQVSPAMGYYHQNSVSWPAAA
Sbjct: 1021 NRVPRDFEAGFSAVSLDSSPCQVTLPPTQNQTQNIHFPVFQVSPAMGYYHQNSVSWPAAA 1080
Query: 1081 AHATNGIIPFSYSNPCLYANPLGYGLSDNPRFCMQYGHLHHLAAPVFNPSPVPIYQPASK 1140
A NGIIPFSYSN CLYANPLGYGLSDNPRFCMQY HLHHL PVFNPSPVPIYQPASK
Sbjct: 1081 AAHANGIIPFSYSNHCLYANPLGYGLSDNPRFCMQYSHLHHLGTPVFNPSPVPIYQPASK 1140
Query: 1141 ANNGVYTEERSQVP----ISESSDVVANPDIIGTTGLPYAISSPPGRDRKQNDTSIFPKD 1200
ANNGVYT+ERSQVP + ESSDVVANPD I TTGLPYAISSPPGRDRKQNDTSIFPKD
Sbjct: 1141 ANNGVYTDERSQVPKTGSMVESSDVVANPDAISTTGLPYAISSPPGRDRKQNDTSIFPKD 1200
Query: 1201 SSSFSLFHFGGPVAFSTGGNLNPMPSKEDDIVGDFSRNNEAADVVDDVHAFNKKETAIEE 1260
SSSFSLFHFGGPVAFSTGGNLNPMPSKEDDIV DFSRNNEA DVVDDVHAFNKKETAIEE
Sbjct: 1201 SSSFSLFHFGGPVAFSTGGNLNPMPSKEDDIVRDFSRNNEATDVVDDVHAFNKKETAIEE 1260
Query: 1261 YNLFAASNGMRFSFF 1271
YNLFAASNGMRFSFF
Sbjct: 1261 YNLFAASNGMRFSFF 1275
BLAST of Cp4.1LG19g00810 vs. NCBI nr
Match:
XP_022925078.1 (uncharacterized protein LOC111432432 [Cucurbita moschata])
HSP 1 Score: 2425 bits (6286), Expect = 0.0
Identity = 1240/1275 (97.25%), Postives = 1249/1275 (97.96%), Query Frame = 0
Query: 1 MPGLTQKNDHLNGGSSAVYSLSANGFWSQHRDDVSYVQLQKFWSELLPQARQKLLRIDKQ 60
MPGLTQKNDHLNGGSSAVYSLSANGFWSQH DDVSYVQLQKFWSELLPQARQKLLRIDKQ
Sbjct: 1 MPGLTQKNDHLNGGSSAVYSLSANGFWSQHGDDVSYVQLQKFWSELLPQARQKLLRIDKQ 60
Query: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVMYGKSLQQGNTCVNHTCNRLGGSKSQTCDGSLA 120
TLFEQARKNMYCSRCNGLLLEGFLQIVMYGKSLQQGNTCVNHTCNRLG SKSQTCDGSLA
Sbjct: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVMYGKSLQQGNTCVNHTCNRLGVSKSQTCDGSLA 120
Query: 121 VNGFHDEIQDPSVHPWGGLTTTREGLLTLLGCYLYSKSFLGLQNVFDSARARERERELLY 180
VNGFHDEIQDPSVHPWGGLTTTREGLLTLLGCYLYSKSFLGLQNVFDSARARERERELLY
Sbjct: 121 VNGFHDEIQDPSVHPWGGLTTTREGLLTLLGCYLYSKSFLGLQNVFDSARARERERELLY 180
Query: 181 PDACGGGGRGWISQGTAGYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
PDACGGGGRGWISQGT GYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK
Sbjct: 181 PDACGGGGRGWISQGTVGYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
Query: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRMRREPCCTSWFCVADMAFHYEVS 300
EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRMRREPCCTSWFCVADMAFHYEVS
Sbjct: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRMRREPCCTSWFCVADMAFHYEVS 300
Query: 301 DDTIQADWHQTFADSVETYHYFEWAVGSGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC 360
DDTIQADWHQTFADSVETYHYFEWAVGSGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC
Sbjct: 301 DDTIQADWHQTFADSVETYHYFEWAVGSGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC 360
Query: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLIVGDGFVTITRGENIRRFFEHAEEAEE 420
FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLIVGDGFVTITRGENIRRFFEHAEEAEE
Sbjct: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLIVGDGFVTITRGENIRRFFEHAEEAEE 420
Query: 421 EEEDDSMDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
EEEDDSMDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA
Sbjct: 421 EEEDDSMDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
Query: 481 HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEKERKERKRTKEREKKLR 540
HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEKERKERKRTKEREKKLR
Sbjct: 481 HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEKERKERKRTKEREKKLR 540
Query: 541 RKERLKGKEKDKDKISSESAEVCSHSDILEDLSPCVLEQNSISVDETCDASIPESSDTLD 600
RKERLKGKEKDKDKISSESAEVCSHSDILEDLSPCVLEQNSISVDETCDASIPESSDTLD
Sbjct: 541 RKERLKGKEKDKDKISSESAEVCSHSDILEDLSPCVLEQNSISVDETCDASIPESSDTLD 600
Query: 601 EQFLNESIVSEVQSSYDDGLAGKPTDGNDGNEPFMVDSSKFSRWRLKFPKEVQDHSSKWS 660
EQFL+ESI+SEVQSSYDDGLAGKPTDGNDGNE FMVDSSKFSRWRLKFPKEVQDHS KWS
Sbjct: 601 EQFLDESIISEVQSSYDDGLAGKPTDGNDGNESFMVDSSKFSRWRLKFPKEVQDHSFKWS 660
Query: 661 ERRRFSVSENGAGASRSEQRYYGDSLETPSRTMNGSNRKLRTNSLKAYGRHISKFNEKSH 720
ERRRFSVSENGAGASRSEQRYYGDSLE PSRTMNGSNRKLRTNSLKAYGRHISKFNEKSH
Sbjct: 661 ERRRFSVSENGAGASRSEQRYYGDSLENPSRTMNGSNRKLRTNSLKAYGRHISKFNEKSH 720
Query: 721 SSNNRVSYDYRSCICNQNNELNKKAEAFVSSVRVNRDVKSASTSESSFDMSKQCSHSSRY 780
SSNNRVSYDYRSC+CNQNNELNKKAEAFVSSVRVNRDVKSASTSESSFDMSKQCSHS+RY
Sbjct: 721 SSNNRVSYDYRSCVCNQNNELNKKAEAFVSSVRVNRDVKSASTSESSFDMSKQCSHSNRY 780
Query: 781 SYGDHSRDGGRLKNKNNSPGKDYVYSKKVWEPMESQKKYPRSNSDSNVAMKSSTCKFNVE 840
SYGDHSRDGGRLKNKNNSPGKDYVYSKKVWEPMESQKKYPRSNSDSNVAMKSSTCKFNVE
Sbjct: 781 SYGDHSRDGGRLKNKNNSPGKDYVYSKKVWEPMESQKKYPRSNSDSNVAMKSSTCKFNVE 840
Query: 841 PDFDLVKSGHDACRGEVAVTSSTVDQEESNSTESTSGVESDEVSQNGLEWKDHKNIEEDA 900
PDFDLVKSGHDAC GEVAVTSSTVDQEESNSTESTSGVESDEVSQNGLEWKDHKNIEEDA
Sbjct: 841 PDFDLVKSGHDACSGEVAVTSSTVDQEESNSTESTSGVESDEVSQNGLEWKDHKNIEEDA 900
Query: 901 CEVTKRSVNSTDTTLTSSGTNNRVGTRSLNSDSCSSCPSEGDSNNICLNHGNLESLSTSD 960
CEVTKRSVNST TTLTSSGTNNRVGTRSLNSDSCSSCPSEGDS+NICLNHGNLESLSTSD
Sbjct: 901 CEVTKRSVNSTGTTLTSSGTNNRVGTRSLNSDSCSSCPSEGDSSNICLNHGNLESLSTSD 960
Query: 961 SEDASHHSEGKESSASIQNGFSEHRETRMDKVVEVDALGIRNHSGLSQEIEGCKVQGNAP 1020
SEDASHHSEGKESSASIQNGFSEH ETRMDKVVEVDALGIRNHSGL QEIEGCKVQ NAP
Sbjct: 961 SEDASHHSEGKESSASIQNGFSEHHETRMDKVVEVDALGIRNHSGLLQEIEGCKVQVNAP 1020
Query: 1021 NRVPRDFEAGFSAVSLDSSPCQVTLPPTQNQTQNIHFPVFQVSPAMGYYHQNSVSWPAAA 1080
NRVPRDFEAGFSAVSLDSSPCQVTLPPTQNQTQNIHFPVFQVSPAMGYYHQNSVSWPAAA
Sbjct: 1021 NRVPRDFEAGFSAVSLDSSPCQVTLPPTQNQTQNIHFPVFQVSPAMGYYHQNSVSWPAAA 1080
Query: 1081 AHATNGIIPFSYSNPCLYANPLGYGLSDNPRFCMQYGHLHHLAAPVFNPSPVPIYQPASK 1140
AHA NGIIPFSYSN CLY NPLGYGLSDNPRFCMQY HLHHL PVFNPSPVPIYQPASK
Sbjct: 1081 AHA-NGIIPFSYSNHCLYTNPLGYGLSDNPRFCMQYSHLHHLTTPVFNPSPVPIYQPASK 1140
Query: 1141 ANNGVYTEERSQVP----ISESSDVVANPDIIGTTGLPYAISSPPGRDRKQNDTSIFPKD 1200
ANNGVYTEERS+VP + ESSDVVANPD+IGTTGLPYAISSPPGRDRKQNDTSI P D
Sbjct: 1141 ANNGVYTEERSEVPKTGSMVESSDVVANPDVIGTTGLPYAISSPPGRDRKQNDTSILPND 1200
Query: 1201 SSSFSLFHFGGPVAFSTGGNLNPMPSKEDDIVGDFSRNNEAADVVDDVHAFNKKETAIEE 1260
SSSFSLFHFGGPVAFSTGGNL PMPSKEDDIVGDFSRNNEA DVVD+VHAFNKKETAIEE
Sbjct: 1201 SSSFSLFHFGGPVAFSTGGNLKPMPSKEDDIVGDFSRNNEATDVVDNVHAFNKKETAIEE 1260
Query: 1261 YNLFAASNGMRFSFF 1271
YNLFAASNGMRFSFF
Sbjct: 1261 YNLFAASNGMRFSFF 1274
BLAST of Cp4.1LG19g00810 vs. NCBI nr
Match:
KAG6595628.1 (hypothetical protein SDJN03_12181, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 2424 bits (6281), Expect = 0.0
Identity = 1238/1274 (97.17%), Postives = 1247/1274 (97.88%), Query Frame = 0
Query: 1 MPGLTQKNDHLNGGSSAVYSLSANGFWSQHRDDVSYVQLQKFWSELLPQARQKLLRIDKQ 60
MPGLTQKNDHLNGGSSAVYSLSANGFWSQH DDVSYVQLQKFWSELLPQARQKLLRIDKQ
Sbjct: 1 MPGLTQKNDHLNGGSSAVYSLSANGFWSQHGDDVSYVQLQKFWSELLPQARQKLLRIDKQ 60
Query: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVMYGKSLQQGNTCVNHTCNRLGGSKSQTCDGSLA 120
TLFEQARKNMYCSRCNGLLLEGFLQIVMYGKSLQQGNTCVNHTCNRLG SKSQTCDGSLA
Sbjct: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVMYGKSLQQGNTCVNHTCNRLGVSKSQTCDGSLA 120
Query: 121 VNGFHDEIQDPSVHPWGGLTTTREGLLTLLGCYLYSKSFLGLQNVFDSARARERERELLY 180
VNGFHDEIQDPSVHPWGGLTTTREGLLTLLGCYLYSKSFLGLQNVFDSARARERERELLY
Sbjct: 121 VNGFHDEIQDPSVHPWGGLTTTREGLLTLLGCYLYSKSFLGLQNVFDSARARERERELLY 180
Query: 181 PDACGGGGRGWISQGTAGYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
PDACGGGGRGWISQGT GYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK
Sbjct: 181 PDACGGGGRGWISQGTVGYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
Query: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRMRREPCCTSWFCVADMAFHYEVS 300
EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRMRREPCCTSWFCVADMAFHYEVS
Sbjct: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRMRREPCCTSWFCVADMAFHYEVS 300
Query: 301 DDTIQADWHQTFADSVETYHYFEWAVGSGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC 360
DDTIQADWHQTFADSVETY+YFEWAVGSGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC
Sbjct: 301 DDTIQADWHQTFADSVETYNYFEWAVGSGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC 360
Query: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLIVGDGFVTITRGENIRRFFEHAEEAEE 420
FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLIVGDGFVTITRGENIRRFFEHAEEAEE
Sbjct: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLIVGDGFVTITRGENIRRFFEHAEEAEE 420
Query: 421 EEEDDSMDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
EEEDDSMDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA
Sbjct: 421 EEEDDSMDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
Query: 481 HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEKERKERKRTKEREKKLR 540
HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEKERKERKRTKEREKKLR
Sbjct: 481 HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEKERKERKRTKEREKKLR 540
Query: 541 RKERLKGKEKDKDKISSESAEVCSHSDILEDLSPCVLEQNSISVDETCDASIPESSDTLD 600
RKERLKGKEKDKDKISSESAEVCSHSDILEDLSPCVLEQNSISVDETCDASIPESSDTLD
Sbjct: 541 RKERLKGKEKDKDKISSESAEVCSHSDILEDLSPCVLEQNSISVDETCDASIPESSDTLD 600
Query: 601 EQFLNESIVSEVQSSYDDGLAGKPTDGNDGNEPFMVDSSKFSRWRLKFPKEVQDHSSKWS 660
EQFL+ESI+SEVQSSYDDGLAGKPTDGNDGNE FMVDSSKFSRWRLKFPKEVQDHS KWS
Sbjct: 601 EQFLDESIISEVQSSYDDGLAGKPTDGNDGNESFMVDSSKFSRWRLKFPKEVQDHSFKWS 660
Query: 661 ERRRFSVSENGAGASRSEQRYYGDSLETPSRTMNGSNRKLRTNSLKAYGRHISKFNEKSH 720
ERRRFSVSENGAGASRSEQRYYGDSLE PSRTMNGSNRKLRTNSLKAYGRHISKFNEKSH
Sbjct: 661 ERRRFSVSENGAGASRSEQRYYGDSLENPSRTMNGSNRKLRTNSLKAYGRHISKFNEKSH 720
Query: 721 SSNNRVSYDYRSCICNQNNELNKKAEAFVSSVRVNRDVKSASTSESSFDMSKQCSHSSRY 780
SSNNRVSYDYRSC+CNQNNELNKKAEAFVSSVRVNRDVKSASTSESSFDMSKQCSHS+RY
Sbjct: 721 SSNNRVSYDYRSCVCNQNNELNKKAEAFVSSVRVNRDVKSASTSESSFDMSKQCSHSNRY 780
Query: 781 SYGDHSRDGGRLKNKNNSPGKDYVYSKKVWEPMESQKKYPRSNSDSNVAMKSSTCKFNVE 840
SYGDHSRDGGRLKNKNNSPGKDYVYSKKVWEPMESQKKYPRSNSDSNVAMKSSTCKFNVE
Sbjct: 781 SYGDHSRDGGRLKNKNNSPGKDYVYSKKVWEPMESQKKYPRSNSDSNVAMKSSTCKFNVE 840
Query: 841 PDFDLVKSGHDACRGEVAVTSSTVDQEESNSTESTSGVESDEVSQNGLEWKDHKNIEEDA 900
PDFDLVKSGHDAC GEVAVTSSTVDQEESNSTESTSGVESDEVSQNGLEWKDHKNIEEDA
Sbjct: 841 PDFDLVKSGHDACSGEVAVTSSTVDQEESNSTESTSGVESDEVSQNGLEWKDHKNIEEDA 900
Query: 901 CEVTKRSVNSTDTTLTSSGTNNRVGTRSLNSDSCSSCPSEGDSNNICLNHGNLESLSTSD 960
CEVTKRSVNSTD TLTSSGTNNRVGTRSLNSDSCSSCPSEGDS+NICLNHGNLESLSTSD
Sbjct: 901 CEVTKRSVNSTDMTLTSSGTNNRVGTRSLNSDSCSSCPSEGDSSNICLNHGNLESLSTSD 960
Query: 961 SEDASHHSEGKESSASIQNGFSEHRETRMDKVVEVDALGIRNHSGLSQEIEGCKVQGNAP 1020
SEDASHHSEGKESSASIQNGFSEH ETRMDKVVEVDALGIRNHSGL QEIEGCKVQGNAP
Sbjct: 961 SEDASHHSEGKESSASIQNGFSEHHETRMDKVVEVDALGIRNHSGLLQEIEGCKVQGNAP 1020
Query: 1021 NRVPRDFEAGFSAVSLDSSPCQVTLPPTQNQTQNIHFPVFQVSPAMGYYHQNSVSWPAAA 1080
NRVPRDFEAGFSAVSLDSSPCQVTLPPTQNQTQNIHFPVFQVSPAMGYYHQNSVSWPAAA
Sbjct: 1021 NRVPRDFEAGFSAVSLDSSPCQVTLPPTQNQTQNIHFPVFQVSPAMGYYHQNSVSWPAAA 1080
Query: 1081 AHATNGIIPFSYSNPCLYANPLGYGLSDNPRFCMQYGHLHHLAAPVFNPSPVPIYQPASK 1140
A NGIIPFSYSN CLYANPLGYGLSDNPRFCMQY HLHHL PVFNPSPVPIYQPASK
Sbjct: 1081 AAHANGIIPFSYSNHCLYANPLGYGLSDNPRFCMQYSHLHHLGTPVFNPSPVPIYQPASK 1140
Query: 1141 ANNGVYTEERSQVP----ISESSDVVANPDIIGTTGLPYAISSPPGRDRKQNDTSIFPKD 1200
ANNGVYT+ERSQVP + ESSDV+ANPD I TTGLPYAISSPPGRDRKQNDTSIFPKD
Sbjct: 1141 ANNGVYTDERSQVPKTGSMVESSDVLANPDAISTTGLPYAISSPPGRDRKQNDTSIFPKD 1200
Query: 1201 SSSFSLFHFGGPVAFSTGGNLNPMPSKEDDIVGDFSRNNEAADVVDDVHAFNKKETAIEE 1260
SSSFSLFHFGGPVAFSTGGNLNPMPSKEDDIVGDFSRNNEA DVVDDVHAFNKKETAIEE
Sbjct: 1201 SSSFSLFHFGGPVAFSTGGNLNPMPSKEDDIVGDFSRNNEATDVVDDVHAFNKKETAIEE 1260
Query: 1261 YNLFAASNGMRFSF 1270
YNLFAASNGM F F
Sbjct: 1261 YNLFAASNGMSFLF 1274
BLAST of Cp4.1LG19g00810 vs. NCBI nr
Match:
XP_022966143.1 (uncharacterized protein LOC111465909 [Cucurbita maxima])
HSP 1 Score: 2419 bits (6269), Expect = 0.0
Identity = 1242/1275 (97.41%), Postives = 1248/1275 (97.88%), Query Frame = 0
Query: 1 MPGLTQKNDHLNGGSSAVYSLSANGFWSQHRDDVSYVQLQKFWSELLPQARQKLLRIDKQ 60
MPGLTQKNDHLNGGSSAVYSLSANGFWSQHRDDVSYVQLQKFWSELLPQARQKLLRIDKQ
Sbjct: 1 MPGLTQKNDHLNGGSSAVYSLSANGFWSQHRDDVSYVQLQKFWSELLPQARQKLLRIDKQ 60
Query: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVMYGKSLQQGNTCVNHTCNRLGGSKSQTCDGSLA 120
TLFEQARKNMYCSRCNGLLLEGFLQIVMYGKSLQQGNTCVNHTCNRLG SKSQTCDGSLA
Sbjct: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVMYGKSLQQGNTCVNHTCNRLGVSKSQTCDGSLA 120
Query: 121 VNGFHDEIQDPSVHPWGGLTTTREGLLTLLGCYLYSKSFLGLQNVFDSARARERERELLY 180
VNGFHDEIQDPSVHPWGGLTTTREGLLTLLGCYLYSKSFLGLQNVFDSARARERERELLY
Sbjct: 121 VNGFHDEIQDPSVHPWGGLTTTREGLLTLLGCYLYSKSFLGLQNVFDSARARERERELLY 180
Query: 181 PDACGGGGRGWISQGTAGYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
PDACGGGGRGWISQG AGYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK
Sbjct: 181 PDACGGGGRGWISQGAAGYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
Query: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRMRREPCCTSWFCVADMAFHYEVS 300
EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRMRREPCCTSWFCVADMAFHYEVS
Sbjct: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRMRREPCCTSWFCVADMAFHYEVS 300
Query: 301 DDTIQADWHQTFADSVETYHYFEWAVGSGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC 360
DDTIQADWHQTFADSVETYHYFEWAVGSGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC
Sbjct: 301 DDTIQADWHQTFADSVETYHYFEWAVGSGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC 360
Query: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLIVGDGFVTITRGENIRRFFEHAEEAEE 420
FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLIVGDGFVTITRGENIRRFFEHAEEAEE
Sbjct: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLIVGDGFVTITRGENIRRFFEHAEEAEE 420
Query: 421 EEEDDSMDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
EEEDDSMDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA
Sbjct: 421 EEEDDSMDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
Query: 481 HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEKERKERKRTKEREKKLR 540
HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEKERKERKRTKEREKKLR
Sbjct: 481 HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEKERKERKRTKEREKKLR 540
Query: 541 RKERLKGKEKDKDKISSESAEVCSHSDILEDLSPCVLEQNSISVDETCDASIPESSDTLD 600
RKERLKGKEKDKDKISSESAEVCSHSDILEDLSPCVLEQNSISV ETCDASIPESSDTLD
Sbjct: 541 RKERLKGKEKDKDKISSESAEVCSHSDILEDLSPCVLEQNSISVGETCDASIPESSDTLD 600
Query: 601 EQFLNESIVSEVQSSYDDGLAGKPTDGNDGNEPFMVDSSKFSRWRLKFPKEVQDHSSKWS 660
EQFLNESI+SEVQSSYDDGL GKPTDGNDGNE FMVDSSKFSRWRLKFPKEVQDHS KWS
Sbjct: 601 EQFLNESIISEVQSSYDDGLGGKPTDGNDGNESFMVDSSKFSRWRLKFPKEVQDHSFKWS 660
Query: 661 ERRRFSVSENGAGASRSEQRYYGDSLETPSRTMNGSNRKLRTNSLKAYGRHISKFNEKSH 720
ERRRFSVSENGAGASRSEQRYYGDSLETPSRTMNGSNRKLRTNSLKAYGRHISKFNEKSH
Sbjct: 661 ERRRFSVSENGAGASRSEQRYYGDSLETPSRTMNGSNRKLRTNSLKAYGRHISKFNEKSH 720
Query: 721 SSNNRVSYDYRSCICNQNNELNKKAEAFVSSVRVNRDVKSASTSESSFDMSKQCSHSSRY 780
SSNNRVSYDYRSCICNQNNELNKKAEAFVSSVRVNRDVKSASTSESSFDMSKQCSHSSRY
Sbjct: 721 SSNNRVSYDYRSCICNQNNELNKKAEAFVSSVRVNRDVKSASTSESSFDMSKQCSHSSRY 780
Query: 781 SYGDHSRDGGRLKNKNNSPGKDYVYSKKVWEPMESQKKYPRSNSDSNVAMKSSTCKFNVE 840
SYGDHSRDGGRLKNKNNSPGKDYVYSKKVWEPMESQKKYPRSNSDSNVAMKSSTCKFNVE
Sbjct: 781 SYGDHSRDGGRLKNKNNSPGKDYVYSKKVWEPMESQKKYPRSNSDSNVAMKSSTCKFNVE 840
Query: 841 PDFDLVKSGHDACRGEVAVTSSTVDQEESNSTESTSGVESDEVSQNGLEWKDHKNIEEDA 900
PDFD VKSGHDAC GEV VTSSTVDQEESNSTESTSGVESDEVSQNGLEWKDHK IE DA
Sbjct: 841 PDFDHVKSGHDACSGEVTVTSSTVDQEESNSTESTSGVESDEVSQNGLEWKDHKTIE-DA 900
Query: 901 CEVTKRSVNSTDTTLTSSGTNNRVGTRSLNSDSCSSCPSEGDSNNICLNHGNLESLSTSD 960
CEVTKRSVNSTDTTLTSSGTNNRVGTRSLNSDSCSSCPSEG SNNICLNHGNLESLSTSD
Sbjct: 901 CEVTKRSVNSTDTTLTSSGTNNRVGTRSLNSDSCSSCPSEGYSNNICLNHGNLESLSTSD 960
Query: 961 SEDASHHSEGKESSASIQNGFSEHRETRMDKVVEVDALGIRNHSGLSQEIEGCKVQGNAP 1020
SE++SHHSEGKESSASIQNGFSEH ETRMDKVVEVDALGIRNHSGLSQEIEGCKVQ NAP
Sbjct: 961 SEESSHHSEGKESSASIQNGFSEHHETRMDKVVEVDALGIRNHSGLSQEIEGCKVQANAP 1020
Query: 1021 NRVPRDFEAGFSAVSLDSSPCQVTLPPTQNQTQNIHFPVFQVSPAMGYYHQNSVSWPAAA 1080
NRVPRDFEAGFSAVSLDSSPCQVTLPPTQNQTQNIHFPVFQVSP MGYYHQNSVSWPAAA
Sbjct: 1021 NRVPRDFEAGFSAVSLDSSPCQVTLPPTQNQTQNIHFPVFQVSPTMGYYHQNSVSWPAAA 1080
Query: 1081 AHATNGIIPFSYSNPCLYANPLGYGLSDNPRFCMQYGHLHHLAAPVFNPSPVPIYQPASK 1140
AHA NGIIPFSYSN CLYANPLGYGLSDNPRFCMQY HLHHLA PV NPSPVPIYQPASK
Sbjct: 1081 AHA-NGIIPFSYSNHCLYANPLGYGLSDNPRFCMQYSHLHHLATPVINPSPVPIYQPASK 1140
Query: 1141 ANNGVYTEERSQVPIS----ESSDVVANPDIIGTTGLPYAISSPPGRDRKQNDTSIFPKD 1200
ANNGVYTEERSQVP S ESSD+VANPD+IGTTGLPYAI+SPPGRDRKQNDTSIFPKD
Sbjct: 1141 ANNGVYTEERSQVPKSCSMVESSDIVANPDVIGTTGLPYAINSPPGRDRKQNDTSIFPKD 1200
Query: 1201 SSSFSLFHFGGPVAFSTGGNLNPMPSKEDDIVGDFSRNNEAADVVDDVHAFNKKETAIEE 1260
SSSFSLFHFGGPVAFSTG NLNPMPSKEDDIVGDF RNNEAADVVDDVHAFNKKETAIEE
Sbjct: 1201 SSSFSLFHFGGPVAFSTG-NLNPMPSKEDDIVGDFLRNNEAADVVDDVHAFNKKETAIEE 1260
Query: 1261 YNLFAASNGMRFSFF 1271
YNLFAASNGMRFSFF
Sbjct: 1261 YNLFAASNGMRFSFF 1272
BLAST of Cp4.1LG19g00810 vs. ExPASy TrEMBL
Match:
A0A6J1EE83 (uncharacterized protein LOC111432432 OS=Cucurbita moschata OX=3662 GN=LOC111432432 PE=4 SV=1)
HSP 1 Score: 2425 bits (6286), Expect = 0.0
Identity = 1240/1275 (97.25%), Postives = 1249/1275 (97.96%), Query Frame = 0
Query: 1 MPGLTQKNDHLNGGSSAVYSLSANGFWSQHRDDVSYVQLQKFWSELLPQARQKLLRIDKQ 60
MPGLTQKNDHLNGGSSAVYSLSANGFWSQH DDVSYVQLQKFWSELLPQARQKLLRIDKQ
Sbjct: 1 MPGLTQKNDHLNGGSSAVYSLSANGFWSQHGDDVSYVQLQKFWSELLPQARQKLLRIDKQ 60
Query: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVMYGKSLQQGNTCVNHTCNRLGGSKSQTCDGSLA 120
TLFEQARKNMYCSRCNGLLLEGFLQIVMYGKSLQQGNTCVNHTCNRLG SKSQTCDGSLA
Sbjct: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVMYGKSLQQGNTCVNHTCNRLGVSKSQTCDGSLA 120
Query: 121 VNGFHDEIQDPSVHPWGGLTTTREGLLTLLGCYLYSKSFLGLQNVFDSARARERERELLY 180
VNGFHDEIQDPSVHPWGGLTTTREGLLTLLGCYLYSKSFLGLQNVFDSARARERERELLY
Sbjct: 121 VNGFHDEIQDPSVHPWGGLTTTREGLLTLLGCYLYSKSFLGLQNVFDSARARERERELLY 180
Query: 181 PDACGGGGRGWISQGTAGYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
PDACGGGGRGWISQGT GYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK
Sbjct: 181 PDACGGGGRGWISQGTVGYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
Query: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRMRREPCCTSWFCVADMAFHYEVS 300
EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRMRREPCCTSWFCVADMAFHYEVS
Sbjct: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRMRREPCCTSWFCVADMAFHYEVS 300
Query: 301 DDTIQADWHQTFADSVETYHYFEWAVGSGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC 360
DDTIQADWHQTFADSVETYHYFEWAVGSGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC
Sbjct: 301 DDTIQADWHQTFADSVETYHYFEWAVGSGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC 360
Query: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLIVGDGFVTITRGENIRRFFEHAEEAEE 420
FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLIVGDGFVTITRGENIRRFFEHAEEAEE
Sbjct: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLIVGDGFVTITRGENIRRFFEHAEEAEE 420
Query: 421 EEEDDSMDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
EEEDDSMDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA
Sbjct: 421 EEEDDSMDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
Query: 481 HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEKERKERKRTKEREKKLR 540
HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEKERKERKRTKEREKKLR
Sbjct: 481 HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEKERKERKRTKEREKKLR 540
Query: 541 RKERLKGKEKDKDKISSESAEVCSHSDILEDLSPCVLEQNSISVDETCDASIPESSDTLD 600
RKERLKGKEKDKDKISSESAEVCSHSDILEDLSPCVLEQNSISVDETCDASIPESSDTLD
Sbjct: 541 RKERLKGKEKDKDKISSESAEVCSHSDILEDLSPCVLEQNSISVDETCDASIPESSDTLD 600
Query: 601 EQFLNESIVSEVQSSYDDGLAGKPTDGNDGNEPFMVDSSKFSRWRLKFPKEVQDHSSKWS 660
EQFL+ESI+SEVQSSYDDGLAGKPTDGNDGNE FMVDSSKFSRWRLKFPKEVQDHS KWS
Sbjct: 601 EQFLDESIISEVQSSYDDGLAGKPTDGNDGNESFMVDSSKFSRWRLKFPKEVQDHSFKWS 660
Query: 661 ERRRFSVSENGAGASRSEQRYYGDSLETPSRTMNGSNRKLRTNSLKAYGRHISKFNEKSH 720
ERRRFSVSENGAGASRSEQRYYGDSLE PSRTMNGSNRKLRTNSLKAYGRHISKFNEKSH
Sbjct: 661 ERRRFSVSENGAGASRSEQRYYGDSLENPSRTMNGSNRKLRTNSLKAYGRHISKFNEKSH 720
Query: 721 SSNNRVSYDYRSCICNQNNELNKKAEAFVSSVRVNRDVKSASTSESSFDMSKQCSHSSRY 780
SSNNRVSYDYRSC+CNQNNELNKKAEAFVSSVRVNRDVKSASTSESSFDMSKQCSHS+RY
Sbjct: 721 SSNNRVSYDYRSCVCNQNNELNKKAEAFVSSVRVNRDVKSASTSESSFDMSKQCSHSNRY 780
Query: 781 SYGDHSRDGGRLKNKNNSPGKDYVYSKKVWEPMESQKKYPRSNSDSNVAMKSSTCKFNVE 840
SYGDHSRDGGRLKNKNNSPGKDYVYSKKVWEPMESQKKYPRSNSDSNVAMKSSTCKFNVE
Sbjct: 781 SYGDHSRDGGRLKNKNNSPGKDYVYSKKVWEPMESQKKYPRSNSDSNVAMKSSTCKFNVE 840
Query: 841 PDFDLVKSGHDACRGEVAVTSSTVDQEESNSTESTSGVESDEVSQNGLEWKDHKNIEEDA 900
PDFDLVKSGHDAC GEVAVTSSTVDQEESNSTESTSGVESDEVSQNGLEWKDHKNIEEDA
Sbjct: 841 PDFDLVKSGHDACSGEVAVTSSTVDQEESNSTESTSGVESDEVSQNGLEWKDHKNIEEDA 900
Query: 901 CEVTKRSVNSTDTTLTSSGTNNRVGTRSLNSDSCSSCPSEGDSNNICLNHGNLESLSTSD 960
CEVTKRSVNST TTLTSSGTNNRVGTRSLNSDSCSSCPSEGDS+NICLNHGNLESLSTSD
Sbjct: 901 CEVTKRSVNSTGTTLTSSGTNNRVGTRSLNSDSCSSCPSEGDSSNICLNHGNLESLSTSD 960
Query: 961 SEDASHHSEGKESSASIQNGFSEHRETRMDKVVEVDALGIRNHSGLSQEIEGCKVQGNAP 1020
SEDASHHSEGKESSASIQNGFSEH ETRMDKVVEVDALGIRNHSGL QEIEGCKVQ NAP
Sbjct: 961 SEDASHHSEGKESSASIQNGFSEHHETRMDKVVEVDALGIRNHSGLLQEIEGCKVQVNAP 1020
Query: 1021 NRVPRDFEAGFSAVSLDSSPCQVTLPPTQNQTQNIHFPVFQVSPAMGYYHQNSVSWPAAA 1080
NRVPRDFEAGFSAVSLDSSPCQVTLPPTQNQTQNIHFPVFQVSPAMGYYHQNSVSWPAAA
Sbjct: 1021 NRVPRDFEAGFSAVSLDSSPCQVTLPPTQNQTQNIHFPVFQVSPAMGYYHQNSVSWPAAA 1080
Query: 1081 AHATNGIIPFSYSNPCLYANPLGYGLSDNPRFCMQYGHLHHLAAPVFNPSPVPIYQPASK 1140
AHA NGIIPFSYSN CLY NPLGYGLSDNPRFCMQY HLHHL PVFNPSPVPIYQPASK
Sbjct: 1081 AHA-NGIIPFSYSNHCLYTNPLGYGLSDNPRFCMQYSHLHHLTTPVFNPSPVPIYQPASK 1140
Query: 1141 ANNGVYTEERSQVP----ISESSDVVANPDIIGTTGLPYAISSPPGRDRKQNDTSIFPKD 1200
ANNGVYTEERS+VP + ESSDVVANPD+IGTTGLPYAISSPPGRDRKQNDTSI P D
Sbjct: 1141 ANNGVYTEERSEVPKTGSMVESSDVVANPDVIGTTGLPYAISSPPGRDRKQNDTSILPND 1200
Query: 1201 SSSFSLFHFGGPVAFSTGGNLNPMPSKEDDIVGDFSRNNEAADVVDDVHAFNKKETAIEE 1260
SSSFSLFHFGGPVAFSTGGNL PMPSKEDDIVGDFSRNNEA DVVD+VHAFNKKETAIEE
Sbjct: 1201 SSSFSLFHFGGPVAFSTGGNLKPMPSKEDDIVGDFSRNNEATDVVDNVHAFNKKETAIEE 1260
Query: 1261 YNLFAASNGMRFSFF 1271
YNLFAASNGMRFSFF
Sbjct: 1261 YNLFAASNGMRFSFF 1274
BLAST of Cp4.1LG19g00810 vs. ExPASy TrEMBL
Match:
A0A6J1HR26 (uncharacterized protein LOC111465909 OS=Cucurbita maxima OX=3661 GN=LOC111465909 PE=4 SV=1)
HSP 1 Score: 2419 bits (6269), Expect = 0.0
Identity = 1242/1275 (97.41%), Postives = 1248/1275 (97.88%), Query Frame = 0
Query: 1 MPGLTQKNDHLNGGSSAVYSLSANGFWSQHRDDVSYVQLQKFWSELLPQARQKLLRIDKQ 60
MPGLTQKNDHLNGGSSAVYSLSANGFWSQHRDDVSYVQLQKFWSELLPQARQKLLRIDKQ
Sbjct: 1 MPGLTQKNDHLNGGSSAVYSLSANGFWSQHRDDVSYVQLQKFWSELLPQARQKLLRIDKQ 60
Query: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVMYGKSLQQGNTCVNHTCNRLGGSKSQTCDGSLA 120
TLFEQARKNMYCSRCNGLLLEGFLQIVMYGKSLQQGNTCVNHTCNRLG SKSQTCDGSLA
Sbjct: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVMYGKSLQQGNTCVNHTCNRLGVSKSQTCDGSLA 120
Query: 121 VNGFHDEIQDPSVHPWGGLTTTREGLLTLLGCYLYSKSFLGLQNVFDSARARERERELLY 180
VNGFHDEIQDPSVHPWGGLTTTREGLLTLLGCYLYSKSFLGLQNVFDSARARERERELLY
Sbjct: 121 VNGFHDEIQDPSVHPWGGLTTTREGLLTLLGCYLYSKSFLGLQNVFDSARARERERELLY 180
Query: 181 PDACGGGGRGWISQGTAGYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
PDACGGGGRGWISQG AGYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK
Sbjct: 181 PDACGGGGRGWISQGAAGYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
Query: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRMRREPCCTSWFCVADMAFHYEVS 300
EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRMRREPCCTSWFCVADMAFHYEVS
Sbjct: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRMRREPCCTSWFCVADMAFHYEVS 300
Query: 301 DDTIQADWHQTFADSVETYHYFEWAVGSGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC 360
DDTIQADWHQTFADSVETYHYFEWAVGSGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC
Sbjct: 301 DDTIQADWHQTFADSVETYHYFEWAVGSGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC 360
Query: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLIVGDGFVTITRGENIRRFFEHAEEAEE 420
FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLIVGDGFVTITRGENIRRFFEHAEEAEE
Sbjct: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLIVGDGFVTITRGENIRRFFEHAEEAEE 420
Query: 421 EEEDDSMDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
EEEDDSMDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA
Sbjct: 421 EEEDDSMDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
Query: 481 HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEKERKERKRTKEREKKLR 540
HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEKERKERKRTKEREKKLR
Sbjct: 481 HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEKERKERKRTKEREKKLR 540
Query: 541 RKERLKGKEKDKDKISSESAEVCSHSDILEDLSPCVLEQNSISVDETCDASIPESSDTLD 600
RKERLKGKEKDKDKISSESAEVCSHSDILEDLSPCVLEQNSISV ETCDASIPESSDTLD
Sbjct: 541 RKERLKGKEKDKDKISSESAEVCSHSDILEDLSPCVLEQNSISVGETCDASIPESSDTLD 600
Query: 601 EQFLNESIVSEVQSSYDDGLAGKPTDGNDGNEPFMVDSSKFSRWRLKFPKEVQDHSSKWS 660
EQFLNESI+SEVQSSYDDGL GKPTDGNDGNE FMVDSSKFSRWRLKFPKEVQDHS KWS
Sbjct: 601 EQFLNESIISEVQSSYDDGLGGKPTDGNDGNESFMVDSSKFSRWRLKFPKEVQDHSFKWS 660
Query: 661 ERRRFSVSENGAGASRSEQRYYGDSLETPSRTMNGSNRKLRTNSLKAYGRHISKFNEKSH 720
ERRRFSVSENGAGASRSEQRYYGDSLETPSRTMNGSNRKLRTNSLKAYGRHISKFNEKSH
Sbjct: 661 ERRRFSVSENGAGASRSEQRYYGDSLETPSRTMNGSNRKLRTNSLKAYGRHISKFNEKSH 720
Query: 721 SSNNRVSYDYRSCICNQNNELNKKAEAFVSSVRVNRDVKSASTSESSFDMSKQCSHSSRY 780
SSNNRVSYDYRSCICNQNNELNKKAEAFVSSVRVNRDVKSASTSESSFDMSKQCSHSSRY
Sbjct: 721 SSNNRVSYDYRSCICNQNNELNKKAEAFVSSVRVNRDVKSASTSESSFDMSKQCSHSSRY 780
Query: 781 SYGDHSRDGGRLKNKNNSPGKDYVYSKKVWEPMESQKKYPRSNSDSNVAMKSSTCKFNVE 840
SYGDHSRDGGRLKNKNNSPGKDYVYSKKVWEPMESQKKYPRSNSDSNVAMKSSTCKFNVE
Sbjct: 781 SYGDHSRDGGRLKNKNNSPGKDYVYSKKVWEPMESQKKYPRSNSDSNVAMKSSTCKFNVE 840
Query: 841 PDFDLVKSGHDACRGEVAVTSSTVDQEESNSTESTSGVESDEVSQNGLEWKDHKNIEEDA 900
PDFD VKSGHDAC GEV VTSSTVDQEESNSTESTSGVESDEVSQNGLEWKDHK IE DA
Sbjct: 841 PDFDHVKSGHDACSGEVTVTSSTVDQEESNSTESTSGVESDEVSQNGLEWKDHKTIE-DA 900
Query: 901 CEVTKRSVNSTDTTLTSSGTNNRVGTRSLNSDSCSSCPSEGDSNNICLNHGNLESLSTSD 960
CEVTKRSVNSTDTTLTSSGTNNRVGTRSLNSDSCSSCPSEG SNNICLNHGNLESLSTSD
Sbjct: 901 CEVTKRSVNSTDTTLTSSGTNNRVGTRSLNSDSCSSCPSEGYSNNICLNHGNLESLSTSD 960
Query: 961 SEDASHHSEGKESSASIQNGFSEHRETRMDKVVEVDALGIRNHSGLSQEIEGCKVQGNAP 1020
SE++SHHSEGKESSASIQNGFSEH ETRMDKVVEVDALGIRNHSGLSQEIEGCKVQ NAP
Sbjct: 961 SEESSHHSEGKESSASIQNGFSEHHETRMDKVVEVDALGIRNHSGLSQEIEGCKVQANAP 1020
Query: 1021 NRVPRDFEAGFSAVSLDSSPCQVTLPPTQNQTQNIHFPVFQVSPAMGYYHQNSVSWPAAA 1080
NRVPRDFEAGFSAVSLDSSPCQVTLPPTQNQTQNIHFPVFQVSP MGYYHQNSVSWPAAA
Sbjct: 1021 NRVPRDFEAGFSAVSLDSSPCQVTLPPTQNQTQNIHFPVFQVSPTMGYYHQNSVSWPAAA 1080
Query: 1081 AHATNGIIPFSYSNPCLYANPLGYGLSDNPRFCMQYGHLHHLAAPVFNPSPVPIYQPASK 1140
AHA NGIIPFSYSN CLYANPLGYGLSDNPRFCMQY HLHHLA PV NPSPVPIYQPASK
Sbjct: 1081 AHA-NGIIPFSYSNHCLYANPLGYGLSDNPRFCMQYSHLHHLATPVINPSPVPIYQPASK 1140
Query: 1141 ANNGVYTEERSQVPIS----ESSDVVANPDIIGTTGLPYAISSPPGRDRKQNDTSIFPKD 1200
ANNGVYTEERSQVP S ESSD+VANPD+IGTTGLPYAI+SPPGRDRKQNDTSIFPKD
Sbjct: 1141 ANNGVYTEERSQVPKSCSMVESSDIVANPDVIGTTGLPYAINSPPGRDRKQNDTSIFPKD 1200
Query: 1201 SSSFSLFHFGGPVAFSTGGNLNPMPSKEDDIVGDFSRNNEAADVVDDVHAFNKKETAIEE 1260
SSSFSLFHFGGPVAFSTG NLNPMPSKEDDIVGDF RNNEAADVVDDVHAFNKKETAIEE
Sbjct: 1201 SSSFSLFHFGGPVAFSTG-NLNPMPSKEDDIVGDFLRNNEAADVVDDVHAFNKKETAIEE 1260
Query: 1261 YNLFAASNGMRFSFF 1271
YNLFAASNGMRFSFF
Sbjct: 1261 YNLFAASNGMRFSFF 1272
BLAST of Cp4.1LG19g00810 vs. ExPASy TrEMBL
Match:
A0A6J1DQ45 (uncharacterized protein LOC111022059 OS=Momordica charantia OX=3673 GN=LOC111022059 PE=4 SV=1)
HSP 1 Score: 2071 bits (5366), Expect = 0.0
Identity = 1092/1285 (84.98%), Postives = 1150/1285 (89.49%), Query Frame = 0
Query: 1 MPGLTQKNDHLNGGSSAVYSLSANGFWSQHRDDVSYVQLQKFWSELLPQARQKLLRIDKQ 60
MPGLTQKND LNGGSSA+YSLS NGFWSQ RDDVSY QLQKFWSEL P RQKLLRIDKQ
Sbjct: 1 MPGLTQKNDQLNGGSSAIYSLSPNGFWSQQRDDVSYNQLQKFWSELPPHTRQKLLRIDKQ 60
Query: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVMYGKSLQQGNTCVNHTCNRLGGSKSQTCDGSLA 120
TLFEQARKNMYCSRCNGLLLEGFLQIV+YGKSLQQG TCVNH+CNRLG SK+ TCDGSL+
Sbjct: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCVNHSCNRLGVSKNHTCDGSLS 120
Query: 121 VNGFHDEIQDPSVHPWGGLTTTREGLLTLLGCYLYSKSFLGLQNVFDSARARERERELLY 180
VNGF DEIQDPSVHPWGGLTTTR+GLLTLL CYL SKSFLGLQNVFDSARARERERELLY
Sbjct: 121 VNGFQDEIQDPSVHPWGGLTTTRDGLLTLLDCYLCSKSFLGLQNVFDSARARERERELLY 180
Query: 181 PDACGGGGRGWISQGTAGYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
PDACGGGGRGWISQGTAG+GRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK
Sbjct: 181 PDACGGGGRGWISQGTAGFGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
Query: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRMRREPCCTSWFCVADMAFHYEVS 300
EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKR+RREPCCTSWFCVADMAFHYEVS
Sbjct: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFHYEVS 300
Query: 301 DDTIQADWHQTFADSVETYHYFEWAVGSGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC 360
DDT+QADW QTFADSVETYHYFEWAVG+GEGKSDILEFENVGMNGSVKMNGLDLGGLNSC
Sbjct: 301 DDTVQADWRQTFADSVETYHYFEWAVGTGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC 360
Query: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLIVGDGFVTITRGENIRRFFEHAEEAEE 420
FITLRAWKLDGRCTELSVKAHALKGQQCVHRRL VGDGFVTITRGENIRRFFEHAEEAEE
Sbjct: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLSVGDGFVTITRGENIRRFFEHAEEAEE 420
Query: 421 EEEDDSMDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
EEEDDSMDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA
Sbjct: 421 EEEDDSMDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
Query: 481 HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEKERKERKRTKEREKKLR 540
HSIFVCLALKLLEER+HIACKEIITLEKQMKLLEEEEKEKREEKERKERKRTKEREKKLR
Sbjct: 481 HSIFVCLALKLLEERIHIACKEIITLEKQMKLLEEEEKEKREEKERKERKRTKEREKKLR 540
Query: 541 RKERLKGKEKDKDKISSESAEVCSHSDILEDLSPCVLEQNSISVDETCDASIPESSDTLD 600
RKERLKGKEKDKDK SESAEVC+HSD+LEDLSPCVLE NS SV + CDAS+PESSD LD
Sbjct: 541 RKERLKGKEKDKDKTCSESAEVCAHSDVLEDLSPCVLEPNSDSVGDACDASMPESSDMLD 600
Query: 601 EQFLNESIVSEVQSSYDDGLAGKPTDGNDGNEPFMVDSSKFSRWRLKFPKEVQDHSSKWS 660
EQFL+ESI+SEVQ+SYDD GKPTDGNDGNE F+VD SKFSRWRLKFPKEVQD S KWS
Sbjct: 601 EQFLDESIISEVQNSYDDSFDGKPTDGNDGNESFIVDQSKFSRWRLKFPKEVQDQSFKWS 660
Query: 661 ERRRFSV-SENGAGASRSEQRYYGDSLETPSRTMNGSNRKLRTNSLKAYGRHISKFNEKS 720
ERRRF+V SENGA +RSEQRYYGDSLE PSR+MNG+NRKLR+NS+KAYGRH SKFNEK
Sbjct: 661 ERRRFTVVSENGALVNRSEQRYYGDSLENPSRSMNGTNRKLRSNSIKAYGRHGSKFNEKL 720
Query: 721 HSSNNRVSYDYRSCICNQNNELNKKAEAFVSSVRVNRDVKSASTSESSFDMSKQCSHSSR 780
HSSNNRVS DYRSCIC+QNNE NKK E FVSSVRVNRD KS S SESSFDMSKQ S++
Sbjct: 721 HSSNNRVSXDYRSCICSQNNEFNKKVEXFVSSVRVNRDAKSVSKSESSFDMSKQSYRSNK 780
Query: 781 YSYGDHSRDGGRLKNK----NNSPGKDYVYSKKVWEPMESQKKYPRSNSDSNVAMKSSTC 840
Y YGD SRD GRLKNK NNSPGKD+VYSKKVWEPMESQKKYPRSNSD NVAMKSST
Sbjct: 781 YGYGDQSRDSGRLKNKAALSNNSPGKDFVYSKKVWEPMESQKKYPRSNSDPNVAMKSSTF 840
Query: 841 KFNVEPDFDLVKSGHDACRGEVAVTSSTVDQEESNSTESTSGVESDEVSQNGL--EWKDH 900
KF VEPD+DL KS HD C GEV+V S VDQEESNSTESTSG+ESDEV QNGL E KDH
Sbjct: 841 KFGVEPDYDLAKSRHDVCSGEVSVASGKVDQEESNSTESTSGIESDEVFQNGLPTEPKDH 900
Query: 901 KNIEEDACE-VTKRSVNST-DTTLTSSGTNNRVGTRSLNSDSCSSCPSEGDSNNICLNHG 960
KN+EEDACE T+ S+NST ++TL SSG NN VGT SL+SD+CSSC SEGDSN IC NHG
Sbjct: 901 KNVEEDACEEATQCSINSTINSTLRSSGKNNHVGTSSLSSDNCSSCLSEGDSNXICSNHG 960
Query: 961 NLESLSTSDSEDASHHS-EGKESSASIQNGFSEHRETRMDKVVEVDALGIRNHSGLSQEI 1020
NLES STSDSEDASH S EGKESSASIQNGFSE E RMDKV +++G R H GL Q+
Sbjct: 961 NLESSSTSDSEDASHQSSEGKESSASIQNGFSERHEIRMDKVNGGESMGTRIHFGLPQDN 1020
Query: 1021 EGCKVQGNAPNRVPRDFEAGFSAVSLDSSPCQVTLPPTQNQTQNIHFPVFQVSPAMGYYH 1080
EGCKV GNAP VP +FEAGFSAVSLDS PCQVTLP QNQ NIHFPVFQV P+MGYYH
Sbjct: 1021 EGCKVLGNAPMNVPHNFEAGFSAVSLDS-PCQVTLPSIQNQ--NIHFPVFQVPPSMGYYH 1080
Query: 1081 QNSVSWPAAAAHATNGIIPFSYSNPCLYANPLGYGLSDNPRFCMQYGHLHHLAAPVFNPS 1140
QNSVSWPAA A NG++PFSYSN CLYANPLGYGL NPRFCMQYGHLHHLA PVFNPS
Sbjct: 1081 QNSVSWPAAHA---NGMMPFSYSNHCLYANPLGYGLDGNPRFCMQYGHLHHLATPVFNPS 1140
Query: 1141 PVPIYQPASKANNGVYTEERSQVP----ISESSDVVANPDIIGTTGLPYAISSPPGRDRK 1200
PVPIYQPA+KA+NG+Y E+RSQV I+ESSDV ANPD++ T GLPYA+ SPP D K
Sbjct: 1141 PVPIYQPAAKASNGIYVEDRSQVSKAGAIAESSDV-ANPDVVVTAGLPYALGSPPSGDCK 1200
Query: 1201 QNDTSIFPKDSSSFSLFHFGGPVAFSTGGNLNPMPSKEDDIVGDFSRNNEAADVVDDVHA 1260
QNDTS K SSSFSLFHFGGPVA STGG LN MPSKEDD G F RN+EA DVVD+ HA
Sbjct: 1201 QNDTSKLQKGSSSFSLFHFGGPVALSTGGKLNLMPSKEDD-TGVFPRNSEA-DVVDNGHA 1260
Query: 1261 FNKKETAIEEYNLFAASNGMRFSFF 1271
FNKK+TAIEEYNLFAASNGMRFSFF
Sbjct: 1261 FNKKDTAIEEYNLFAASNGMRFSFF 1276
BLAST of Cp4.1LG19g00810 vs. ExPASy TrEMBL
Match:
A0A1S3B599 (uncharacterized protein LOC103486163 OS=Cucumis melo OX=3656 GN=LOC103486163 PE=4 SV=1)
HSP 1 Score: 2042 bits (5290), Expect = 0.0
Identity = 1081/1290 (83.80%), Postives = 1149/1290 (89.07%), Query Frame = 0
Query: 1 MPGLTQKNDHLNGGSSAVYSLSANGFWSQHRDDVSYVQLQKFWSELLPQARQKLLRIDKQ 60
MPGLTQKNDHLNGGSSA+YSLSA+GFWSQHRDDVSY QLQKFWS+LLPQARQKLLRIDKQ
Sbjct: 1 MPGLTQKNDHLNGGSSAIYSLSAHGFWSQHRDDVSYNQLQKFWSDLLPQARQKLLRIDKQ 60
Query: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVMYGKSLQQGNTCVNHTCNRLGGSKSQTCDGSLA 120
TLFEQARKNMYCSRCNGLLLEGFLQIV+YGKSLQQG TCVNH+CNRLG SK+Q CDGSL+
Sbjct: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVIYGKSLQQGKTCVNHSCNRLGVSKNQACDGSLS 120
Query: 121 VNGFHDEIQDPSVHPWGGLTTTREGLLTLLGCYLYSKSFLGLQNVFDSARARERERELLY 180
VNGF DEIQDPSVHPWGGLTTTR+G+LTLL CYL+SKSFLGLQNVFDSARARERERELLY
Sbjct: 121 VNGFQDEIQDPSVHPWGGLTTTRDGVLTLLDCYLHSKSFLGLQNVFDSARARERERELLY 180
Query: 181 PDACGGGGRGWISQGTAGYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
PDACGGGGRGWISQGTA YGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK
Sbjct: 181 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
Query: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRMRREPCCTSWFCVADMAFHYEVS 300
EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKR+RREPCCTSWFCVADMAF+YEVS
Sbjct: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 300
Query: 301 DDTIQADWHQTFADSVETYHYFEWAVGSGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC 360
DDTIQADWHQTFADSVETYHYFEW+VG+GEGKSDILEFENVGMNGSVK+NGLDLGGLNSC
Sbjct: 301 DDTIQADWHQTFADSVETYHYFEWSVGTGEGKSDILEFENVGMNGSVKINGLDLGGLNSC 360
Query: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLIVGDGFVTITRGENIRRFFEHAEEAEE 420
FITLRAWKLDGRCTELSVKAHALKGQQCVHRRL VGDGFVTITRGENIRRFFEHAEEAEE
Sbjct: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGENIRRFFEHAEEAEE 420
Query: 421 EEEDDSMDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
EEEDDS+DKD+NDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA
Sbjct: 421 EEEDDSIDKDSNDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
Query: 481 HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEKERKERKRTKEREKKLR 540
HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREE+ERKERKRTKEREKKLR
Sbjct: 481 HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEQERKERKRTKEREKKLR 540
Query: 541 RKERLKGKEKDKDKISSESAEVCSHSDILEDLSPCVLEQNSISVDETCDASIPESSDTLD 600
RKERLKGK DKDK+SSESAEVC+ SD+LEDLSPCVLE S +V E CD S+PESSD LD
Sbjct: 541 RKERLKGK--DKDKLSSESAEVCARSDVLEDLSPCVLEPTSNAVGEVCDTSVPESSDILD 600
Query: 601 EQFLNESIVSEVQSSYDDGLAGKPTDGNDGNEPFMVDSSKFSRWRLKFPKEVQDHSSKWS 660
E FLNESI+SE Q+S+DD L GK TDGNDGNE F+ D SK SRWRLKFPKEVQDH KWS
Sbjct: 601 ELFLNESIISEGQNSFDDSLDGKFTDGNDGNESFISDQSKVSRWRLKFPKEVQDHPFKWS 660
Query: 661 ERRRFSV-SENGAGASRSEQRYYGDSLETPSRTMNGSNRKLRTNSLKAYGRHISKFNEKS 720
ERRRF V SENG ++SEQRY+ DS E PSR+MNGSNRKLRTNSLKAYGRH+SKFNEK
Sbjct: 661 ERRRFMVVSENGMLVNKSEQRYHPDSSENPSRSMNGSNRKLRTNSLKAYGRHVSKFNEKL 720
Query: 721 HSSNNRVSYDYRSCICNQNNELNKKAEAFVSSVRVNRDVKSASTSESSFDMSKQCSHSSR 780
HSSNNRVSYDYRSCICNQ NE NKKAE FVSSVRVNRDVKS S SESSFDMSKQ S++
Sbjct: 721 HSSNNRVSYDYRSCICNQTNEFNKKAEPFVSSVRVNRDVKSVSKSESSFDMSKQSYRSNK 780
Query: 781 YSYGDHSRDGGRLKNK----NNSPGKDYVYSKKVWEPMESQKKYPRSNSDSNVAMKSSTC 840
YSYGDHSRD GRLK K NNSPGKD+VYSKKVWEPMESQKKYPRSNSDSNVA+KSST
Sbjct: 781 YSYGDHSRDNGRLKTKAALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSDSNVALKSSTF 840
Query: 841 KFNVEPDFDLVKSGHDA-------CRGEVAVTSSTVDQEESNSTESTSGVESDEVSQN-- 900
KF+ EPD+D+VKS C GEV+VTS VDQEESNSTESTSG+ESD+VSQN
Sbjct: 841 KFDAEPDYDVVKSRDGVVKSRDGFCSGEVSVTSGAVDQEESNSTESTSGIESDDVSQNEN 900
Query: 901 GLEWKDHKNIEEDACEVTKRSVNST-DTTLTSSGTNNRVGTRSLNSDSCSSCPSEGDSNN 960
+E KDHKN+EED CEV + S NS DTTLTSSGT+N+VGT SLNSD+CSSC SEGDSN
Sbjct: 901 SIESKDHKNVEEDVCEVKQCSANSAIDTTLTSSGTSNQVGTSSLNSDNCSSCLSEGDSNT 960
Query: 961 ICLNHGNLESLSTSDSEDASHHSEGKESSASIQNGFSEHRETRMDKVVEVDALGIRNHSG 1020
I NHGNLES STSDSE ASH SEGKESSASIQNGFSEH E R+DK + +A G R++SG
Sbjct: 961 IGSNHGNLESSSTSDSEYASHQSEGKESSASIQNGFSEHHEIRIDKGIGGEARGSRSYSG 1020
Query: 1021 LSQEIEGCKVQGNAPNRVPRDFEAGFSAVSLDSSPCQVTLPPTQNQTQNIHFPVFQVSPA 1080
L Q+ EGC VQ NAP VP +FEAGFSAVSLDS PCQVTLP QNQ NIHFPVFQV P+
Sbjct: 1021 LPQDNEGCNVQVNAPKNVPHNFEAGFSAVSLDS-PCQVTLPSIQNQ--NIHFPVFQVPPS 1080
Query: 1081 MGYYHQNSVSWPAAAAHATNGIIPFSYSNPCLYANPLGYGLSDNPRFCMQYGHLHHLAAP 1140
M YYHQNSVSWPAAA HA NGI+PFSYSN CLYANPLGYGL+ NPRFCMQYGHLHHL+ P
Sbjct: 1081 MNYYHQNSVSWPAAA-HA-NGIMPFSYSNHCLYANPLGYGLNGNPRFCMQYGHLHHLSNP 1140
Query: 1141 VFNPSPVPIYQPASKANNGVYTEERSQVP----ISESSDVVANPDIIGTTGLPYAISSPP 1200
VFNPSPVPIY PASKA+NG+Y E+R+QV ISESS VAN D+ TTG YA+SSPP
Sbjct: 1141 VFNPSPVPIYHPASKASNGIYAEDRTQVSKSGAISESS--VANSDVAVTTGHQYALSSPP 1200
Query: 1201 GRDRKQNDTSIFPKDSSSFSLFHFGGPVAFSTGGNLNPMPSKEDDIVGDFSRNNEAADVV 1260
D KQNDTS +DSSSFSLFHFGGPVA STGG LN PSKEDD VGDFSRNNE +VV
Sbjct: 1201 SGDLKQNDTSKLQQDSSSFSLFHFGGPVALSTGGKLNLTPSKEDD-VGDFSRNNEV-EVV 1260
Query: 1261 DDVHAFNKKETAIEEYNLFAASNGMRFSFF 1271
D+ HAFN KETAIEEYNLFAASNGMRFSFF
Sbjct: 1261 DNGHAFNMKETAIEEYNLFAASNGMRFSFF 1279
BLAST of Cp4.1LG19g00810 vs. ExPASy TrEMBL
Match:
A0A6J1HVV4 (uncharacterized protein LOC111467149 OS=Cucurbita maxima OX=3661 GN=LOC111467149 PE=4 SV=1)
HSP 1 Score: 2033 bits (5267), Expect = 0.0
Identity = 1079/1293 (83.45%), Postives = 1141/1293 (88.24%), Query Frame = 0
Query: 1 MPGLTQKNDHLNGGSSAVYSLSANGFWSQHRDDVSYVQLQKFWSELLPQARQKLLRIDKQ 60
MPGLTQKN HLN GSSA+YSLSANGFWSQHRDDVSY QLQKFW ELLPQARQKLLRIDKQ
Sbjct: 1 MPGLTQKNYHLNCGSSAIYSLSANGFWSQHRDDVSYNQLQKFWIELLPQARQKLLRIDKQ 60
Query: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVMYGKSLQQGNTCVNHTCNRLGGSKSQTCDGSLA 120
TLFEQARKNMYCSRCNGLLLEGFLQIV+YGKSLQQG T VNH CNRLG SK+Q DG+L
Sbjct: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVLYGKSLQQGKTRVNHACNRLGVSKNQAGDGALT 120
Query: 121 VNGFHDEIQDPSVHPWGGLTTTREGLLTLLGCYLYSKSFLGLQNVFDSARARERERELLY 180
VNGF DEIQDPSVHPWGGLTTTR+GLLTLL CYL SKSFL LQNVFDSARARERERELLY
Sbjct: 121 VNGFEDEIQDPSVHPWGGLTTTRDGLLTLLDCYLCSKSFLDLQNVFDSARARERERELLY 180
Query: 181 PDACGGGGRGWISQGTAGYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
PDACGGGGRGWISQGTA YGRGHGTRETCALHTARLSCDTLVDFWSALGEETR SLLRMK
Sbjct: 181 PDACGGGGRGWISQGTASYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRLSLLRMK 240
Query: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRMRREPCCTSWFCVADMAFHYEVS 300
EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKR+RREPCCTSWFCVADMAF+YEVS
Sbjct: 241 EEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRIRREPCCTSWFCVADMAFNYEVS 300
Query: 301 DDTIQADWHQTFADSVETYHYFEWAVGSGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC 360
DDTIQADWH TFADSVETYHYFEWAVG+GEGKSDILEFENVGMNGSVKMNGLDLGGLNSC
Sbjct: 301 DDTIQADWHHTFADSVETYHYFEWAVGTGEGKSDILEFENVGMNGSVKMNGLDLGGLNSC 360
Query: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLIVGDGFVTITRGENIRRFFEHAEEAEE 420
FITLRAWKLDGRCTELSVKAHALKGQQCVHRRL VGDGFVTITRGE+IRRFFEHAEEAEE
Sbjct: 361 FITLRAWKLDGRCTELSVKAHALKGQQCVHRRLTVGDGFVTITRGESIRRFFEHAEEAEE 420
Query: 421 EEEDDSMDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
EEEDDSMDKDAN LDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA
Sbjct: 421 EEEDDSMDKDANGLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQNA 480
Query: 481 HSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEKERKERKRTKEREKKLR 540
HSIFVCLALKLLEERVHIACKEIITLEKQ+KLLEEEEKEKREEKERKERKRTKEREKKLR
Sbjct: 481 HSIFVCLALKLLEERVHIACKEIITLEKQVKLLEEEEKEKREEKERKERKRTKEREKKLR 540
Query: 541 RKERLKGKEKDKDKISSESAEVCSHSDILEDLSPCVLEQNSISVDETCDASIPESSDTLD 600
RKERLKGKEKDKDKISSESAE C+HSD+LEDLSPC LE NS +V E CDAS+PESSDT +
Sbjct: 541 RKERLKGKEKDKDKISSESAEACAHSDVLEDLSPCDLEPNSDAVGEVCDASVPESSDTFN 600
Query: 601 EQFLNESIVSEVQSSYDDGLAGK---------PTDGNDGNEPFMVDSSKFSRWRLKFPKE 660
E FLN+SI+SE Q+SYDD GK DGNDGNE F+ D SK SRWRLKFPKE
Sbjct: 601 ELFLNQSIISEGQNSYDDSFDGKLGDGNDGNDGNDGNDGNESFIGDQSKVSRWRLKFPKE 660
Query: 661 VQDHSSKWSERRRFSVSENGAGASRSEQRYYGDSLETPSRTMNGSNRKLRTNSLKAYGRH 720
VQDHS KWSERRR VSENGA A+RSEQRYY DSLE PSR+MN SNRKLRTNSLKAYGRH
Sbjct: 661 VQDHSFKWSERRRSMVSENGALANRSEQRYYADSLENPSRSMNASNRKLRTNSLKAYGRH 720
Query: 721 ISKFNEKSHSSNNRVSYDYRSCICNQNNELNKKAEAFVSSVRVNRDVKSASTSESSFDMS 780
+SKFNEK HSSNN VSYDYRSC+CNQNNE NKKAE FVSSVRVNRD KSAS SES FDMS
Sbjct: 721 VSKFNEKMHSSNNWVSYDYRSCVCNQNNEFNKKAEPFVSSVRVNRDAKSASKSESLFDMS 780
Query: 781 KQCSHSSRYSYGDHSRDGGRLKNK----NNSPGKDYVYSKKVWEPMESQKKYPRSNSDSN 840
KQ +++SYGD+SRD GRLKNK NNSPGKD+VYSKKVWEPMESQKKYPRSNSDSN
Sbjct: 781 KQSYRPNKFSYGDYSRDSGRLKNKAALLNNSPGKDFVYSKKVWEPMESQKKYPRSNSDSN 840
Query: 841 VAMKSSTCKFNVEPDFDLVKSGHDACRGEVAVTSSTVDQEESNSTESTSGVESDEVSQNG 900
VA+KSST KF VEPD++LVKS H+ C GEV+V S TVDQEESNSTESTS +ESDEV QNG
Sbjct: 841 VALKSSTFKFGVEPDYELVKSRHECCSGEVSVASGTVDQEESNSTESTSVIESDEVFQNG 900
Query: 901 L--EWKDHKNIEEDACE-VTKRSVNST-DTTLTSSGTNNRVGTRSLNSDSCSSCPSEGDS 960
L E KDHKN+E+DACE VT SVN T D +TSSGT+N+ GT SLNSD+CSSCPSEGDS
Sbjct: 901 LPIESKDHKNVEDDACEEVTPCSVNLTVDMKMTSSGTSNQAGTSSLNSDNCSSCPSEGDS 960
Query: 961 NNICLNHGNLESLSTSDSEDASHHSEGKESSASIQNGFSEHRETRMDKVVEVDALGIRNH 1020
N IC NHGNLES STSDSE ASH SEGKESSASIQ GFSEH E RMDK + DALG N
Sbjct: 961 NTICSNHGNLESSSTSDSEYASHQSEGKESSASIQYGFSEHHEIRMDKAIGGDALGSTNS 1020
Query: 1021 SGLSQEIEGCKVQGNAPNRVPRDFEAGFSAVSLDSSPCQVTLPPTQNQTQNIHFPVFQVS 1080
SGLSQ+ EGCKVQGNAP VP++FEAGFSAV+LDS PC VTLP QNQ N+HFPVFQV
Sbjct: 1021 SGLSQDNEGCKVQGNAPKNVPQNFEAGFSAVNLDS-PCHVTLPSVQNQ--NVHFPVFQVP 1080
Query: 1081 PAMGYYHQNSVSWPAAAAHATNGIIPFSYSNPCLYANPLGYGLSDNPRFCMQYGHLHHLA 1140
P+MGYYHQNSVSWPAA HA NGI+PFSYSN CLYANPLGYGL+ NPRFCM+YGHLHHLA
Sbjct: 1081 PSMGYYHQNSVSWPAAV-HA-NGIMPFSYSNHCLYANPLGYGLNGNPRFCMRYGHLHHLA 1140
Query: 1141 APVFNPSPVPIYQPASKANNGVYTEERSQVP----ISESSDVVANPDIIGTTGLPYAISS 1200
PVFNPSPVPIYQPA+KA+NG++ E+R+QV I+ESS VANPD++ TTGLPYA+SS
Sbjct: 1141 NPVFNPSPVPIYQPAAKASNGIFVEDRTQVSKSGAITESS--VANPDVVVTTGLPYALSS 1200
Query: 1201 PPGRDRKQNDTSI-FPKDSSSFSLFHFGGPVAFSTGGNLNPMPSKEDDIVGDFSRNNEAA 1260
PP D KQNDTS KDSSSFSLFHFGGPVA STGG LNPMPSKED NNE
Sbjct: 1201 PPSGDCKQNDTSSKLQKDSSSFSLFHFGGPVALSTGGKLNPMPSKED--------NNEV- 1260
Query: 1261 DVVDDVHAFNKKETAIEEYNLFAASNGMRFSFF 1271
+VV + H FNKKETAIEEYNLFAASNGMRFSFF
Sbjct: 1261 EVVGNGHGFNKKETAIEEYNLFAASNGMRFSFF 1277
BLAST of Cp4.1LG19g00810 vs. TAIR 10
Match:
AT3G58050.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G41960.1); Has 13384 Blast hits to 8116 proteins in 546 species: Archae - 41; Bacteria - 766; Metazoa - 5596; Fungi - 1431; Plants - 589; Viruses - 46; Other Eukaryotes - 4915 (source: NCBI BLink). )
HSP 1 Score: 1104.0 bits (2854), Expect = 0.0e+00
Identity = 691/1325 (52.15%), Postives = 842/1325 (63.55%), Query Frame = 0
Query: 1 MPGLTQKNDHLNGGSSAVYSLSANGFWSQHRDDVSYVQLQKFWSELLPQARQKLLRIDKQ 60
MPGL Q+N+ YS GFWS+ D VSY QLQKFWSEL P+ARQ+LL+IDKQ
Sbjct: 1 MPGLAQRNND-------QYSF---GFWSKEIDGVSYNQLQKFWSELSPKARQELLKIDKQ 60
Query: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVMYGKSLQQGNTCVNHTCNRLGGSKSQTCDGSLA 120
TLFEQARKNMYCSRCNGLLLEGFLQIVM+GKSL + N CN+ GGSK Q ++
Sbjct: 61 TLFEQARKNMYCSRCNGLLLEGFLQIVMHGKSLHPEGSLGNSPCNKSGGSKYQYDCNAVV 120
Query: 121 VNGFHDEIQDPSVHPWGGLTTTREGLLTLLGCYLYSKSFLGLQNVFDSARARERERELLY 180
NG DE+QDPSVHPWGGLTTTR+G LTLL CYLY+KS GLQNVFDSA ARERERELLY
Sbjct: 121 SNGCADEMQDPSVHPWGGLTTTRDGSLTLLDCYLYAKSLKGLQNVFDSAPARERERELLY 180
Query: 181 PDACGGGGRGWISQGTAGYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRMK 240
PDACGGGGRGWISQG A +GRGHGTRETCALHTARLSCDTLVDFWSAL E+TRQSLLRMK
Sbjct: 181 PDACGGGGRGWISQGIASFGRGHGTRETCALHTARLSCDTLVDFWSALSEDTRQSLLRMK 240
Query: 241 EEDFIERLMYR-----------------------------FDSKRFCRDCRRNVIREFKE 300
EEDF+ERL YR FDSKRFCRDCRRNVIREFKE
Sbjct: 241 EEDFMERLRYRICYHSSYHILNCKMNRHFVVWTIQDVLTKFDSKRFCRDCRRNVIREFKE 300
Query: 301 LKELKRMRREPCCTSWFCVADMAFHYEVSDDTIQADWHQTFADSVETYHYFEWAVGSGEG 360
LKELKRMRREP CT+WFCVA+ F YEVS D+++ADW +TF+++ YH+FEWA+GSGEG
Sbjct: 301 LKELKRMRREPRCTTWFCVANTTFQYEVSIDSVKADWRETFSENAGKYHHFEWAIGSGEG 360
Query: 361 KSDILEFENVGMNGSVKMNGLDLGGLNSCFITLRAWKLDGRCTELSVKAHALKGQQCVHR 420
K DIL+FENVGMNG V++NGL+L GLNSC+ITLRA+KLDGR +E+S KAHALKGQ CVH
Sbjct: 361 KCDILKFENVGMNGRVQVNGLNLRGLNSCYITLRAYKLDGRWSEVSAKAHALKGQNCVHG 420
Query: 421 RLIVGDGFVTITRGENIRRFFEHAEEAEEEEEDDSMDKDANDLDGDCSRPQKHAKSPELA 480
RL+VGDGFV+I RGE+IRRFFEHAEEAEEEE++D MDKD N+LDG+CSRPQKHAKSPELA
Sbjct: 421 RLVVGDGFVSIKRGESIRRFFEHAEEAEEEEDEDMMDKDGNELDGECSRPQKHAKSPELA 480
Query: 481 REFLLDAATVIFKEQVEKAFREGTARQNAHSIFVCLALKLLEERVHIACKEIITLEKQMK 540
REFLLDAATVIFKEQVEKAFREGTARQNAHSIFVCL LKLLE+ +H+ACKEIITLEKQ+K
Sbjct: 481 REFLLDAATVIFKEQVEKAFREGTARQNAHSIFVCLTLKLLEQHLHVACKEIITLEKQVK 540
Query: 541 LLEEEEKEKREEKERKERKRTKEREKKLRRKERLKGKEKDKDKISSESAEVCSHSDIL-- 600
LLEEEEKEKREE+ERKE+KR+KEREKKLR+KERLK K+K K+K + E CS D+L
Sbjct: 541 LLEEEEKEKREEEERKEKKRSKEREKKLRKKERLKEKDKGKEKKNPE----CSDKDMLLN 600
Query: 601 -----EDLSPCVLE-QNSISVDET------CDASIPESSDTLDEQFLNESIVSEVQSSYD 660
EDL E N+I+ +E+ D S P S D + Q L+ ++ Y
Sbjct: 601 SSREEEDLPNLYDETNNTINSEESEIETGYADLSPPGSPDVQERQCLDGCPSPRAENHYC 660
Query: 661 DGLAGKPTDGNDGNEPFMVDSSKFSRWRLKFPKEVQ-DHSSKWSERRRFSVSENGAGASR 720
D D D N F D K ++ KEVQ D++ +WS++RR+ S+N + SR
Sbjct: 661 DRPDRDIKDLEDENVYFTNDHQKPVHQNARYWKEVQSDNALRWSDKRRY--SDNASFVSR 720
Query: 721 SEQRYYGDSLETPSRTMNGSNRKLRTNSLKAYGRHISKFNEKSHSSNNRVS--YDYRSCI 780
SE RY D LE PSR NGSNR+LR N+ K G + K +EK +NR+S +D+ SC
Sbjct: 721 SEARYRNDRLEVPSRGFNGSNRQLRVNASKTGGLNGIKSHEKFQCCDNRISERFDFSSCS 780
Query: 781 CNQNNELNKKAEAFVSSVRVNRDVKSASTSESSFDMSKQCSHSSRYSYGDHSRDGGRLKN 840
C + E K E + R R+ K+ S S+S+ D SK +RY+ D++R+ RLK+
Sbjct: 781 CKPSCEYRAKVEPKTAGSRSTREPKTISNSDSALDASKPVFQGNRYTQPDYTRE-LRLKS 840
Query: 841 K-----NNSPGKDYVYSKKVWEPMESQKKYPRSNSDSNVAMKSSTCKFNVEPDFDLVKSG 900
K N S +D ++SK+VWEPME KKYPRSNS S V ++ ST F E D + +
Sbjct: 841 KVGVGPNPSTTRDSLHSKQVWEPME-PKKYPRSNSYSEVTVRCST--FKAEEIEDAIVA- 900
Query: 901 HDACRGEVAVTSSTVDQEESNSTESTSGVESDEVSQNGLEWKDHKNIEEDACEVTKRSVN 960
NS++ S + E N ++ KD ++E TK +
Sbjct: 901 -------------------ENSSDLLSQCKVTEKLDN-IKLKDENSMESGE---TKNGWH 960
Query: 961 STDTTLTSSGTNNRVGTRSLNSDSCSSCPSEGDSNNICLNHGNLESLSTSDSEDASHHSE 1020
D ++S+ +SD+CSSC SEG+SN + N+GN ES STSDSEDAS SE
Sbjct: 961 LKDPMMSSTS----------SSDNCSSCLSEGESNTVSSNNGNTESSSTSDSEDASQQSE 1020
Query: 1021 GKES-SASIQNGFSEHRETRMDKVVEVDALGIRNHSGLSQEIEGCKVQGNAPNRVPRDFE 1080
G+ES QN T K+ E + + G + N+ N +
Sbjct: 1021 GRESIVVGTQNDILIPDTTGKSKIPETPIV-----------VTGNNMDNNSNNNMVHGL- 1080
Query: 1081 AGFSAVSLDSSPCQVTLPPTQNQTQNIHFPVFQVSPAMGYYHQ-NSVSWPAAAAHATNGI 1140
+D P P TQN+ +PVFQ + MGY+HQ VSWP A NG+
Sbjct: 1081 -------VDVQPQGGMFP--HLLTQNLQYPVFQTASPMGYFHQAPPVSWPTGPA---NGL 1140
Query: 1141 IPFSYSNPCLYANPLGYGLSDNPRFCMQYGH-LHHLAAPVFNPSPVPIYQPASKANNGVY 1200
IPF + NP LY PLGY ++ +P C+QYG L+H A P FNP PVP++ P SK N
Sbjct: 1141 IPFPHPNPYLYTGPLGYSMNGDPPLCLQYGSPLNHAATPFFNPGPVPVFHPFSKTN---- 1200
Query: 1201 TEERSQVPISESSDVVANPDIIGTTGLPYAISSPPGRDRKQNDTSIFPKDSSSFSLFHFG 1260
TE+++Q N + L +PP D SFSLFHF
Sbjct: 1201 TEDQAQ-----------NLE----PPLELNCLAPPETQTVNED---------SFSLFHFS 1209
Query: 1261 GPVAFSTGGNLNPMPSKEDDIVGDFSRNNEAADVVDDVHAFNKKETAIEEYNLFAASNGM 1272
GPV STG P SK+ + DVV +++ K+ +EEYNLFA NG+
Sbjct: 1261 GPVGLSTGSKSKPAHSKDGIL----------RDVVGNIYTKAKESKEVEEYNLFATGNGL 1209
BLAST of Cp4.1LG19g00810 vs. TAIR 10
Match:
AT2G41960.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G58050.1); Has 11991 Blast hits to 7260 proteins in 458 species: Archae - 17; Bacteria - 481; Metazoa - 5028; Fungi - 1325; Plants - 615; Viruses - 38; Other Eukaryotes - 4487 (source: NCBI BLink). )
HSP 1 Score: 928.7 bits (2399), Expect = 5.0e-270
Identity = 624/1301 (47.96%), Postives = 786/1301 (60.42%), Query Frame = 0
Query: 1 MPGL-TQKNDHLNGGSSAVYSLSANGFWSQHRDDVSYVQLQKFWSELLPQARQKLLRIDK 60
MPGL T N+H S++GFWS+ D ++Y QL +FWSEL +AR +LLRIDK
Sbjct: 9 MPGLTTHMNEH----------YSSSGFWSEDDDGLTYDQLDQFWSELSSKARHELLRIDK 68
Query: 61 QTLFEQARKNMYCSRCNGLLLEGFLQIVMYGKSLQQGNTCVNHTCNRLGGSKSQTCDGSL 120
QTLFEQARKNM CSRC GLLLEGF QI+ G++ + R+ G C S
Sbjct: 69 QTLFEQARKNMCCSRCLGLLLEGFAQILSAGRAAYE---------KRMMGPSKDNCK-SN 128
Query: 121 AVNGFHDEIQDPSVHPWGGLTTTREGLLTLLGCYLYSKSFLGLQNVFDSARARERERELL 180
Q P VH WGGLTTTR G +TLL C+L +K+F GLQNVF+S RARERERELL
Sbjct: 129 GTRKCTVAYQSPPVHRWGGLTTTRSGCITLLDCFLTAKTFKGLQNVFESNRARERERELL 188
Query: 181 YPDACGGGGRGWISQGTAGYGRGHGTRETCALHTARLSCDTLVDFWSALGEETRQSLLRM 240
YPDACGGGGR W+SQG AG+G+GHGTRETC LHT RLSCDTLVDFWSAL E +RQSLLRM
Sbjct: 189 YPDACGGGGRVWLSQGIAGFGKGHGTRETCNLHTTRLSCDTLVDFWSALEEHSRQSLLRM 248
Query: 241 KEEDFIERLMYRFDSKRFCRDCRRNVIREFKELKELKRMRREPCCTSWFCVADMAFHYEV 300
KEEDF+ERL YRFD K+FCRDCRRNVIREFKELKELKR++R+P CT WFCVAD AF YEV
Sbjct: 249 KEEDFVERLTYRFDCKKFCRDCRRNVIREFKELKELKRIQRDPRCTDWFCVADTAFQYEV 308
Query: 301 SDDTIQADWHQTFADSVETYHYFEWAVGSGEGKSDILEFENVGMNGSVKMNGLDLGGLNS 360
D+++ADW Q F ++ YH+FEWA+G+GEG+SDILEF+ VG + S ++NGLDL GL+
Sbjct: 309 DIDSVRADWSQYFTENA-GYHHFEWAIGTGEGESDILEFKYVGNDRSARVNGLDLRGLHE 368
Query: 361 CFITLRAWKLDGRCTELSVKAHALKGQQCVHRRLIVGDGFVTITRGENIRRFFEHAEEAE 420
C+ITLRA+K +GR +E+SVKAHAL+GQQCVH RL+VGDGFV+I RGE IR FFEHAEEAE
Sbjct: 369 CYITLRAFKKNGRPSEISVKAHALRGQQCVHSRLVVGDGFVSIKRGECIRMFFEHAEEAE 428
Query: 421 EEEEDDSMDKDANDLDGDCSRPQKHAKSPELAREFLLDAATVIFKEQVEKAFREGTARQN 480
EEE++ +DKD N+LDG+C RPQKHAKSPELAREFLLDAATVIFKEQVEKAFR+GTARQN
Sbjct: 429 EEEDEVLIDKDGNELDGECLRPQKHAKSPELAREFLLDAATVIFKEQVEKAFRDGTARQN 488
Query: 481 AHSIFVCLALKLLEERVHIACKEIITLEKQMKLLEEEEKEKREEKERKERKRTKEREKKL 540
AHSIFVCL+ +LLE+RVHIACKEI+TLEKQ KLLEEEEKEKREE+ERKERKR KEREKKL
Sbjct: 489 AHSIFVCLSSELLEQRVHIACKEIVTLEKQNKLLEEEEKEKREEEERKERKRIKEREKKL 548
Query: 541 RRKERLKGKEKDKD----KISSE------SAEVCSHSDILEDLSPCVLEQNSISVDETCD 600
RRKERLK KE++K+ K S + S E ++ ED + + + S + D
Sbjct: 549 RRKERLKEKEREKEQKNPKFSDKAILPIMSREEEGSRNLDEDTNNTIRCEESGIENGDVD 608
Query: 601 ASIPESSDTLDEQFLNESIVSEVQSSYDDGLAGKPTDGNDGNEPFMVDSSKFSRWRLKFP 660
S P S D DE+ L+ I V++ D + D D N F + + + +
Sbjct: 609 LSSPGSPDDQDEECLDGCISPRVETHSCDSTDKEIIDHEDENGCF---TPRPAHKTARLW 668
Query: 661 KEVQ-DHSSKWSERRRFSVSENGAGASRSEQRYYGDSLETPSRTMNGSNRKLRTNSLKAY 720
KEVQ DHS + SE+RRF +E + S SE Y D LE S NGS++ +R + KA
Sbjct: 669 KEVQTDHSLRLSEKRRF--TEKTSFVSSSEAGYCNDRLEMSSGHFNGSDKNVRVKASKAG 728
Query: 721 GR-HISKFNEKSHSSNNRVS--YDYRSCICNQNNELNKKAEAFVSSVRVNRDVKSASTSE 780
G + S+ +E+ S+ R YDY SC C N +K E+ S+ R R+ KS S+
Sbjct: 729 GSPNSSRSHEEFQCSDGRTGERYDYHSCSCKPINGYREKVESNTSATRGMREPKSVFKSD 788
Query: 781 SSFDMSKQCSHSSRYSYGDHSRD-GGRLKNKNNSPGKDYVYSKKVWEPMESQKKYPRSNS 840
S D+SK + ++RY+ + R+ ++ N N+ D V +KV + +E K+ R++S
Sbjct: 789 SDLDVSK-LNRANRYTQSGYRREIRSKMNNSRNACKMDPVNVRKVLDSVE--PKHSRNSS 848
Query: 841 DSNVAMKSSTCKFNVEPDFDLVKSGHDACRGEVAVTSSTVDQEESNSTESTSGVESDEVS 900
S+V S + E E+ S TV +G S
Sbjct: 849 TSDVL---SLTTYKAE---------------EIKDVSPTV---------KPAGTPS---- 908
Query: 901 QNGLEWKDHKNIEEDACEVTKRSVNSTDTTLTSSGTNNRVGTRSLNSDSCSSCPSEGDSN 960
C+ T + N + T V N S P S+
Sbjct: 909 ---------------LCKATDKLGNGSFNNSTEVDKKMEVHITLKNDYLYSKDPMMSRSS 968
Query: 961 NICLNHGNLESLSTSDSEDASHHSEGKESSASIQNGFSEHRETRMDKVVEV-----DALG 1020
+ N+GN+ES S SDSE AS SEG+E+ QN + E ++KV E+ D L
Sbjct: 969 S--SNNGNIESSSMSDSEVASQQSEGRENLVDTQNDMPDCHEKMVEKVTEMSMDERDVLK 1028
Query: 1021 IRNHSGLSQEIEGCKVQGN---APNRVPRDFEAGFSAVSLDSSPCQVTLPPTQNQTQNIH 1080
I+N S L + K+ G P++ + G + S S P + LP N Q+I
Sbjct: 1029 IKNISNLPADNGESKLSGTPFMVPSQNMENMVPGLNTGSYLSQPQNMILPQMLN--QSIP 1088
Query: 1081 FPVFQVSPAMGYYHQNSVSWPAAAAHATNGIIPFSYSNPCLYANPLGYGLSDNPRFCMQY 1140
PVFQ MGYYHQ VSW +A +TNG++ F + N +Y PLGY L+ CMQY
Sbjct: 1089 LPVFQAPSTMGYYHQAPVSWSSA---STNGLMQFPHPNHYVYTGPLGYSLNGESPLCMQY 1148
Query: 1141 G-HLHHLAAPVFNPSPVPIYQPASKANNGVYTEERSQ--VPISESSDVVANPDIIGTTGL 1200
G L+H AAP FN PVPI+ P ++ N + T +++Q P+ S AN L
Sbjct: 1149 GTPLNHSAAPFFNSGPVPIFHPFAETNT-MNTVDQAQPLEPLEHSFLKEANERRFNEMPL 1208
Query: 1201 PYAISSPPGRDRKQNDTSIFPKDSSSFSLFHFGGPVAFSTGGNLNPMPSKEDDIVGDFSR 1260
P + Q D+ +FSLFHFGGPVA STG NP SK D I+ DFS
Sbjct: 1209 ----METPRKRCPQTDS------DENFSLFHFGGPVALSTGSKANPARSK-DGILEDFSL 1215
Query: 1261 NNEAADVVDDVHAFNKKE---TAIEEYNLFAASNGMRFSFF 1272
V D +KKE T EEYNLFA SN +RFS F
Sbjct: 1269 QFSGDHVFGDPTGNSKKEKENTVGEEYNLFATSNSLRFSIF 1215
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023517706.1 | 0.0 | 100.00 | uncharacterized protein LOC111781381 [Cucurbita pepo subsp. pepo] >XP_023517707.... | [more] |
KAG7027601.1 | 0.0 | 97.25 | hypothetical protein SDJN02_11615 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
XP_022925078.1 | 0.0 | 97.25 | uncharacterized protein LOC111432432 [Cucurbita moschata] | [more] |
KAG6595628.1 | 0.0 | 97.17 | hypothetical protein SDJN03_12181, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022966143.1 | 0.0 | 97.41 | uncharacterized protein LOC111465909 [Cucurbita maxima] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1EE83 | 0.0 | 97.25 | uncharacterized protein LOC111432432 OS=Cucurbita moschata OX=3662 GN=LOC1114324... | [more] |
A0A6J1HR26 | 0.0 | 97.41 | uncharacterized protein LOC111465909 OS=Cucurbita maxima OX=3661 GN=LOC111465909... | [more] |
A0A6J1DQ45 | 0.0 | 84.98 | uncharacterized protein LOC111022059 OS=Momordica charantia OX=3673 GN=LOC111022... | [more] |
A0A1S3B599 | 0.0 | 83.80 | uncharacterized protein LOC103486163 OS=Cucumis melo OX=3656 GN=LOC103486163 PE=... | [more] |
A0A6J1HVV4 | 0.0 | 83.45 | uncharacterized protein LOC111467149 OS=Cucurbita maxima OX=3661 GN=LOC111467149... | [more] |
Match Name | E-value | Identity | Description | |
AT3G58050.1 | 0.0e+00 | 52.15 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT2G41960.1 | 5.0e-270 | 47.96 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |