Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAGAGATATATATAGAGAGAGAGAGAGATATATATAGAGAGAGAGAGAGAGAGCACATAAATCATAAGCGCCTATTTGTGATTTATGATCGGAGAGGATGAGACAAGAGCGATGATCCCCAACCTCCATTACTCTTGATTCCGTCATCTTTCCACCAATGAATCCCTACTCCGAGGAAAGACTCACCGAAGAGGTTCTCTATCTTCACTCTCTGTGGCGGCGAGGTCCGCCGAGGGGCCCTAAGCCCACTCGCTATTATTTATCCACCGCCGTCGCCGCTGCTACGAATAAGAGACCCAGAGACACAAAGAATCGAAAGCAAAAGAAGAAGAAGCCACGCCTCGAGCCATTACAAGACACCGGCCCCGAATGGCCCTGCCCGGAGCCAGTGCAAAATCAGCCCTCGACGTCATCTGGGTGGCCGCCAATGCCCTGTGCTACTCCGGCGGCTCGGCTGGTGTCGTCTGAAGAGCGAGCAAATCGTGTGGCGTTGCAATTGCAGTACAAGGGTATCGAGGCTTGCCGGAGATTTCTCATTAGAAATGCCGATTCAGGGAGTGATGAAGAGGTGGAGGAGGAAGAGGGGAATGATGGGGAGATTATGGAAAGTGAAGAGTACAAATTCTTTTTGAATCTGTTTATGGAGAATGATGAACTTAGGGGCTATTACGAGAAGAATTCTGAAGATGGGTTGTTTTGTTGCTTGGTTTGTGATGGAATGGGGAAGAAGAAATCTGGGAAAAGGTTTAAGAACTGCATTGGGCTTGTTCATCATTCGAATTCGATATCTAGAACGAAGAAGAAGGTGGCTCATAGGGCTTTTGGACAGGCCGTATGCAGGGTTTTTGGTTGGGATATTGATCGACTTCCAACCATTGTGTTGAATGGCGAGCCTCTCAGTCGATCATTAGCCAATTCTGGAGATTTCAAGGTACTTTACTGGTCTGCCCATGTGTTTGAACTTGAACTTGTTTATAATGGCATTCAATACAAAATTGCTTTCGATTATAATGTATTGTAGGATCAGCCAGAGGAAAATCAGGTGGCTGAAGAACATGATTCTTGGGTTCATAATGAAAATGTAGCCATTTTGAATGATGAAATTGATATGAAGAATGAACAGAAATGGGAGGAAGAAAAGACAGCTGAAGATTTGATTTCTGGCGAGGTTAGTTCATTTACTTCCTTTATATGAATCAGTTGAGGTGTTCTTGATGATCATGCAATTGTCTTGTCTTATGTTCATTAGAAAACGAAGAACAATGATTCCTCGGCAGTCGTAACCGAATGCCGAAAACATGTAGTTTCTGCTGATGAGCTGATACAGTTGAATGTGTTGCAGGTACCCGAGTCGATTATGGAAGCATGTGAAGAATTTTTTGCTGCCTCCTTGACATCTATGGCTGACGACGATGTTAGTGAAAACAACGCAATCGAGGAACGCGAAGAGTTCAAATTCTTTTTAAAGCTGTTCATTGAGAATGAAAGCTTGAGAAGATATTACAAGAACAAGTATGATGATGGAGAATTTTCGTGTTTAGTTTGTGAAGGAGCGGGAAAGAAAACGTTGAGGAGTTTTAAGACGTGCGTTCGCCTTCTCCGACATACAACTTATCCTGGGAAGAACAAAACAGGGAAAAAACGGGTTAAGCCTCACATTGCTAAGATGTTGAAAGTAAAGATGCTGGCTCATAGAGCATATAGTTTAGTTATATGCCAGGTTCTTGGTTGGGACATAGAAAAGCTTCCTGCAATCGTGTTAAAAGGCGAAGGCCATGGTTGTTCGTTAAAGAAGCTAGACGTGTTGAAAGTATGCTAATTCATTTGATCAATCTCTTCCTATGCTCAAACTTTCATGTTCTTGATTCATACTAACAATGCTGGAATTGCAGGACGACCCGGTTGGCAATGCAGGTGATAATACGAACGAAGTAGATGATCCTGTGAGAGATGACTCTACTGAGATCGACTAAGTTCACAACCAATCCGTCGGTGCAGTCAGGATGATACCGGAGAAGATGACTCGAAAAAGGTTAGTGAAGTGTTCTTGTTCTTGTTGTTTCTGTGTTAGACTGGTAGATGAAGGTTTTGTGGCATTTTGAGTGTGAGATATGATATTTAGGCCTATTCAAAAGAAGCCCTGTTGGCTCCCATTTTCTCCCCCCATTCCCTTCATTCTATTGGTTTCTCATGAATCTTTAGGTTGTTGATAAATGCATAGCTAGGAGTCTCATGTTGGTTTCTCGTGGTCACCTTTAAAACTCCGATCCTGCTGCTAGGGAGAGGGAGAGGTTTCCATATCCTTATAAGGAATGTTCTGTTTCCCTTTCCAACTGACGTAGGACCTCACAATTCAAATCTCTTTTTGCCCACCGTCCTCATTGGCATACTGCCCGGTGTGTAGCTCTGATATCATTTGTAACCGTTCAAGCCCACCGCTAGCAAATATTGTTCATTTGGCGTCAGCCTCACGGTTTTCAAACATGTCTATTAGGGAGAGGTTTCCACACCCTTATAAGAAATGCTCTGTTCCCCTGTTCCCCTCTCCAACTGATGTGGGACCTCATAATCCCCCCCCCNTCGGTGTCTGGCTGCGATACCATTTGTAACAGCCTAAGCCTGCCGCTAGCAGATATTGTCTGCTTTGGCATCAACCTCACGGTTTTAAAATGCGTCCATTAGAGAGAGGTTTCCACATCCTTATAAGGAATGCTTCGTTCCCCTCTCTATCTGACGTGGGACCTCACAATCCACCCTTCTTGGGACCCAGTGTCTGGCTCTAATACCATTTGTAACAGCCCAAACTCACCGTTAGCAGATAATGTCCGCTTTGGTGTCAGCCTCACGGTTTTAAAATGCGTCTACTAAGAAGAAGTTTCCACATCCTTATAAGGAATGTTTCGTTCCCCCCTCTAACCGACAAGGGACTTCATAATCTACCCCCTTGGATCCCAGCGTTCTCGCTGGCACATTGCCCCGTGTGTAGCTCTAATACTATTTGTAACAGCCCAAACCCACTACTAACAGATATTGTCTGTTTTGGCCCGTAACGTATCACCGTCAGCCTCACAGTTTTAAAACGCTTCTACTAAGGAGAGGTTTCCCCACGCTGTTCCACTCTCCAACCGATGTGGGACCTCACAGAACTAATCACATGGATAATACTACATAAGAAGGCACCATGTTTGCCATCTAGCTAAGAACGATAACGTTAAATTCCCTATACCATGAACATTTCAATCGATTCATTGAATAATTGTATCATATCAACTTTCAATACTCAACTCTACAAGGGAGATCACCAGAATTAACACACCTCATCTCTACAGGTTTTGGCTCAGCCCCAAGCTCGTTATAACCAAGTCGGTCTCTAATGGCGACGGAAGATGAGAGGAGCAGGGCAAGGATGATTCACAACGTAATAGAGAGGAGAGTTTCGGTTTCTTTCATTTCTAATTTTCTTTTTAAGTTCATTTCATTTATATCGAGTCGAATCAAATTACATGAATTTCAAACAAATTTAGATTGAAATTCATCGACTCCCGTTAATTTCTGGTCAAAATTCATGAATCCCGACCTTTTTCGATGCATATTTATAAATTTTGATCGAGATTATGGTAGTGAACAAACATAGAACAGATATCTTGACGGAGACACCGACATTATTTTATTTTCAAATGGGTTTCAAATTTAAAATTTTGATTAGTGATTTTTTTTTTCATTTAATAATAATTTTTAAAATAACTTAATTATTACACAAAAAAATATTATTATTATTATTATTATTTATCATTTTTTCAAATACAACCTTGACTATAATTCTTATAAATAATTTTAAAATAAAAACTATTACTTTTCATCATGAAAATTATTAAACTAATTATTTAATTAAAAAAAATTATTTTCTAAAAAAAAAATAATAATAAAAATAAAGGAAGAATCATTTCTAAGTTTGGGAGCCAAAAACCGTTTTTTTAGAATCTGAAATAATATTATAATTTAGAGATTAAACGTCTCTGCTTTTGCCTCTGCCATTTAAAGATAAACTAAAACCCTGTTCCTGTGCAGCTGCCGATCAAGAACACAACCCATCCAACCCCATGATTATGGACTGACCTCCGCGGCACCTCAGGTTGATTTCTTCTTCCAACCCCTTTTTGATTCTGCTGTTCGCTTTCTGTGAATGTTGCGTAAATATTCAATTTCGCAAAACAATGTGATTCTGCTGATCAAATTTGAGAATCTGCTCCTCGAATGCCTTAAATCTCTATATTTTCGCTCCATCTTCAGTTTGGTTCTATGATCGTTTAAGAACACTGAGAACTTCGAGGATCTTTTGTTATTTTTTTTTCTAACAAAACTCTCATTCAATGAGTTGAAAGATTAGAAAATCATTCCTTGTGGTACTGTTCTTTGGTTATAAGGATTATGAATCTTGCAGGTGAACAAAAATGGATTTGAAACATAAGGGTATATCATGGGTTGGAAACATGTTCCAAAAATTTGAAGCAGTGTGCCAGGAAGTGGATAATATTATAAACCAGGTTATCCTCATTTCCTTCTCTTTTTTACGTTATGTGATCCAACATTTTGACAATATAGTTCATGTGGTAGCTTTCTTATTGGGTTGATTGTGTCACTGAATCGTTCATCTTTGTCGTGCAATTTAGTAATTTTTCGCTGAGGTTTGCAGTAGGAATAAACTGTTAAACTGATCTTATGCACAGGATAAGGTTGAATATGTTGAAAATCGGGTTAGTTCAGCAAGTGTTAATGTGAAGAGATTAGATGTTGTTCAAGGTTTACTTCCTCCTACAGAGGGTTCTGTGAAATATGAAGCTAAAGCAGTGGCTCCGAGGGGACGTACATATTTCAAGTCACTGTCATACAATGAAGAAAAATCTGCACATAATGTTGCTGATAAATCATCTGTGGGGCATGGTACTATCAATCATCAAGCTTCTTGTAAAGTTCTCTTTGTAAATGAAGAAGTTGCTCGAGTTCCTAATCGTTCTTCTCTTCGGTTGAATGCTGGTTTACATGAGAACAAAAAAGAAAAACCTGTTAATGAACTACTTTCGGAGAAAAGTGATGGCTCATTGACTGATAAGTTTGCGTTCGTGGAGTCGGATGCTATTGATCCTTTGAATCGATCACTGAGAAATGTAAGTCGTGAAGTTAATGAAATTAATAAAAGTTGTTCTCCGGTTTTTGATGACTCCGATCTGCAATTGGTGGATAATGTACTCTTAGTAGGGAACAACAATGGGGCTTTGACAAATAATGATGCAAGTAAGAGTTCTAAAGAGGATACGACCATAGAGTTCAATGCTAGTGATCCGTTGAACCATACGGCTAATCATAAATCTTGTCAAGTTAAAGTTACAAATGGAGAAGAATTTTTTATTTTGGATAACTCTCATCTGCCAATGGAATCTTCCAGATTCTCGTCGAAGGACGACGACTTGTCAAATGAAAACACCAATGAGTTTGTAAAGAAGGTTGGGATCATGGAACCTAATGCTGCTGATCATTTGAACGACAAACATCTTAGTCATGTATGGAGCAGTACAAACTTCGTAAGTAAAGAAGCTGATAATTCTAATATGCTTTTGAAGTCCGAGGTACCTTCAAGCAGAATCGATCATGCCTTGATAGATAAAGATTTCAATGAGGGTCCTGTAAAGGATGCTATCTTTGAGGATGATCTTGAAAGTTATTTATTGAATCTTCCCAGTGAAGAAGCTATGATTTCTAATGGAAACCATCTGCAAATGGAGCCTGAACTACTTGCTAGAAACAATGATGATGCTTTGACAGATGCATACTCTAATGAAAGTTTAGAAAAGGATACCATTTTGGAGTTGGAGTATGATGCAAGTTATCCTTTAAAGAACCAGCCAAGACGTATATCAAGCAGCGTAAAATATAAAAATGAAGAAGTTTCTTCAGTTTCAATAGATAGGGCATCAGATGCAAGTTGTAAAGAACAAGACAATTTAGAATTATCAACTGAGTTAACTTTGCATTGTGATGAAGAGTCGATTAAGGGCAGTTCGTGCATTTATGGTAATGAACGTGACGGGGATATTGCGACCTCAACTCGAAATCCACAGGAAACTTCGGTTCATGGTGCTGATGTTGAATCCATCCATAAAGTAGGAGAACCTCCTAGCATCTCGTTGAACAATTTAGTTGACTTATCACCTAGGATGGAGACACATTTGAGGTACTTCGAAAATGTTCCACATGCTACTTCTTCTGAACTGGCTTCTGTAGTTTTAGCTAGTGGAGAAACTGTAAAAGAGACAAAGTCAGTCTCCTCTCTGAAACCGCTACCGAAGGGTCCGTTTTCTGCTTCCAGAAGTTCGGTCGACAACTTTTCTAGTACCACCGTTCATGAAAAACCAGTCGATCAGCGTGCATACATTGAGTGTAGATCTCATCCATCTTTCGAAGTGGTCACTCGTGCATCTAATGGAAACAAGGCTTCGGAGACGAGATTTAACTCCTCCAGAAGCTCCTTATCATCATTTGAATCGCTTGGTCTGTACTCTTATCTTCTATCCTAGGATAAGTGTGTTTTCTTATGATTTTCTTAAACAATTCATGATTCAGCAGGAACTCATGCCAGTAGCCAGGTTGAGTTTTCCAAATCTACTGGTTCTGGGATTCTAAGTTTCTCTACTGAAGTAGGTATGTCCCAAAGCTTGTGTTCATACAATTTTGTAAATTCCCGGCTGTTATCTATAATTCTTATGATGTGGTGGTAGTTCAATCTGTTGGCTATGGAGAAAATCTTTCATTAGGAATAAAGCAACAATCTTAAAAACTTTTCTCAGGTTGTCTGTATGATTCGAGTGGCCATATTCTGGATTTTGAAATGGAAACAGTGGATTTGGGACATAAGGTGACCGTCGAAGACGAGTGTGGCGTTATTGACTATAAAGCTCTCCATGCTGTCTCTCGCCGAACCCAAAAGCTCCATTCTTACAAGGTCCATATATTATCTAAAGTTACATATATGAGCTTCATCTTGCAATGTTAAGTTTATTTAAAACCTGGAAACACCTCTCGTTCTTTAAGATGATAATGTTTAACGATAACTTTTACCGTCCGTTTCTGATCTATTTTTCTTTTCTTTCTGAAAATAGAAGAGAATCCAGGATGCTTTTACTACCAAAAAGAGGTTGGCAAAGGATTATGAACAGCTAGCAATCTGGTATGGAGATACTGATCTGGACTCCATCACAGACAGTTCCCAGAAGTCGGACAAGAAGAACGCATCCGATTCCGAGTGGGAGCTCCTGTAAATAAGACAGCTAATTCACTTCGTCTCGGCAATCAAACTTGTTTCCAGGTGGAGGAGAATCTTATATGCTGGAGATGAAGAGGAAGCTCGTCTGTTAATACCTACTCAAGAATAAGGTTCCTCACTTTATCTATTGAAGTGCATAAGTTACCTTGGAAATTCCTAAATAAACGAGTTGCAAGAAATTTTGCACATATTGGCACTCTTTCTGGGCTATGAACCTTTGATACTTTAATTAATTTCATATCCTTGGTAAATAAAGCTGTCTTGTTTTATCTTTTTTGGAAAGATCTACTCTTTGTTTGACCTTTTGAACAGAACACTCGTGGGCTTGTCGAGTTCAACAATTAGAAGCGCTTTTGGTGTTCGTTCTTCCCATCCAGACACTTGGACGACATTGCATTCCACGTTCGACCACTCTCAAGCTTACAGGCTCCCATTCTTTGCTCATGATGTCAATGAAATAGAAAGTGACAAACAGTAAGTCTTAACTTTGCATTTTGTGGCTCTGTCGTATACTTTTTATTGTCATCTGAACTTTTACTACAAAGATTAGCATCTGACTGCCATCATATTGGAATGATTGCCTCTGTCTCTTCGTGTCGGGACAGCCTGGAACTGCTATGTAGTATAGTTTTACTTCTCATTGCTCCAATAAACAAAATTCAAACTCAAATCACATGACTTTGGGGATCTGGTTTTTTTGAAGTGAAGAAGATCATCACTGTGTTTTTGCCAAACTAATGTGAAAGAGTGTTGATTCAATGGTCTCATTCAGGATTCTGTTTTTGGATATCTTTATTAAAGTAGTGGACTAGCCATATCCTCTTGAGATTCTAAAATAGTGTAGCTTTACAGGAGCACGAGAATTACTGTAGGAGTTTGAAAAGGATTGTCTTTTGCCTATCTCCAAAGGCTGATA
mRNA sequence
GAGAGATATATATAGAGAGAGAGAGAGATATATATAGAGAGAGAGAGAGAGAGCACATAAATCATAAGCGCCTATTTGTGATTTATGATCGGAGAGGATGAGACAAGAGCGATGATCCCCAACCTCCATTACTCTTGATTCCGTCATCTTTCCACCAATGAATCCCTACTCCGAGGAAAGACTCACCGAAGAGGTTCTCTATCTTCACTCTCTGTGGCGGCGAGGTCCGCCGAGGGGCCCTAAGCCCACTCGCTATTATTTATCCACCGCCGTCGCCGCTGCTACGAATAAGAGACCCAGAGACACAAAGAATCGAAAGCAAAAGAAGAAGAAGCCACGCCTCGAGCCATTACAAGACACCGGCCCCGAATGGCCCTGCCCGGAGCCAGTGCAAAATCAGCCCTCGACGTCATCTGGGTGGCCGCCAATGCCCTGTGCTACTCCGGCGGCTCGGCTGGTGTCGTCTGAAGAGCGAGCAAATCGTGTGGCGTTGCAATTGCAGTACAAGGGTATCGAGGCTTGCCGGAGATTTCTCATTAGAAATGCCGATTCAGGGAGTGATGAAGAGGTGGAGGAGGAAGAGGGGAATGATGGGGAGATTATGGAAAGTGAAGAGTACAAATTCTTTTTGAATCTGTTTATGGAGAATGATGAACTTAGGGGCTATTACGAGAAGAATTCTGAAGATGGGTTGTTTTGTTGCTTGGTTTGTGATGGAATGGGGAAGAAGAAATCTGGGAAAAGGTTTAAGAACTGCATTGGGCTTGTTCATCATTCGAATTCGATATCTAGAACGAAGAAGAAGGTGGCTCATAGGGCTTTTGGACAGGCCGTATGCAGGGTTTTTGGTTGGGATATTGATCGACTTCCAACCATTGTGTTGAATGGCGAGCCTCTCAGTCGATCATTAGCCAATTCTGGAGATTTCAAGGATCAGCCAGAGGAAAATCAGGTGGCTGAAGAACATGATTCTTGGGTTCATAATGAAAATGTAGCCATTTTGAATGATGAAATTGATATGAAGAATGAACAGAAATGGGAGGAAGAAAAGACAGCTGAAGATTTGATTTCTGGCGAGAAAACGAAGAACAATGATTCCTCGGCAGTCGTAACCGAATGCCGAAAACATGTAGTTTCTGCTGATGAGCTGATACAGTTGAATGTGTTGCAGGTACCCGAGTCGATTATGGAAGCATGTGAAGAATTTTTTGCTGCCTCCTTGACATCTATGGCTGACGACGATGTTAGTGAAAACAACGCAATCGAGGAACGCGAAGAGTTCAAATTCTTTTTAAAGCTGTTCATTGAGAATGAAAGCTTGAGAAGATATTACAAGAACAAGTATGATGATGGAGAATTTTCGTGTTTAGTTTGTGAAGGAGCGGGAAAGAAAACGTTGAGGAGTTTTAAGACGTGCGTTCGCCTTCTCCGACATACAACTTATCCTGGGAAGAACAAAACAGGGAAAAAACGGGTTAAGCCTCACATTGCTAAGATGTTGAAAGTAAAGATGCTGGCTCATAGAGCATATAGTTTAGTTATATGCCAGGTTCTTGGTTGGGACATAGAAAAGCTTCCTGCAATCGTGTTAAAAGGCGAAGGCCATGGTTGTTCGTTAAAGAAGCTAGACGTGTTGAAAGACGACCCGGTTGGCAATGCAGGTGATAATACGAACGAAGTAGATGATCCTGTGAGAGATGACTCTACTGAGATCGACTAAGTTCACAACCAATCCGTCGGTGCAGTCAGGATGATACCGGAGAAGATGACTCGAAAAAGGTTTTGGCTCAGCCCCAAGCTCGTTATAACCAAGTCGGTCTCTAATGGCGACGGAAGATGAGAGGAGCAGGGCAAGGATGATTCACAACGTAATAGAGAGGAGAGTTTCGGTTTCTTTCATTTCTAATTTTCTTTTTAAGTTCATTTCATTTATATCGAGTCGAATCAAATTACATGAATTTCAAACAAATTTAGATTGAAATTCATCGACTCCCGTTAATTTCTGGTCAAAATTCATGAATCCCGACCTTTTTCGATGCATATTTATAAATTTTGATCGAGATTATGGTAGTGAACAAACATAGAACAGATATCTTGACGGAGACACCGACATTATTTTATTTTCAAATGGGTTTCAAATTTAAAATTTTGATTAGTGATTTTTTTTTTCATTTAATAATAATTTTTAAAATAACTTAATTATTACACAAAAAAATATTATTATTATTATTATTATTTATCATTTTTTCAAATACAACCTTGACTATAATTCTTATAAATAATTTTAAAATAAAAACTATTACTTTTCATCATGAAAATTATTAAACTAATTATTTAATTAAAAAAAATTATTTTCTAAAAAAAAAATAATAATAAAAATAAAGGAAGAATCATTTCTAAGTTTGGGAGCCAAAAACCGTTTTTTTAGAATCTGAAATAATATTATAATTTAGAGATTAAACGTCTCTGCTTTTGCCTCTGCCATTTAAAGATAAACTAAAACCCTGTTCCTGTGCAGCTGCCGATCAAGAACACAACCCATCCAACCCCATGATTATGGACTGACCTCCGCGGCACCTCAGGTGAACAAAAATGGATTTGAAACATAAGGGTATATCATGGGTTGGAAACATGTTCCAAAAATTTGAAGCAGTGTGCCAGGAAGTGGATAATATTATAAACCAGGATAAGGTTGAATATGTTGAAAATCGGGTTAGTTCAGCAAGTGTTAATGTGAAGAGATTAGATGTTGTTCAAGGTTTACTTCCTCCTACAGAGGGTTCTGTGAAATATGAAGCTAAAGCAGTGGCTCCGAGGGGACGTACATATTTCAAGTCACTGTCATACAATGAAGAAAAATCTGCACATAATGTTGCTGATAAATCATCTGTGGGGCATGGTACTATCAATCATCAAGCTTCTTGTAAAGTTCTCTTTGTAAATGAAGAAGTTGCTCGAGTTCCTAATCGTTCTTCTCTTCGGTTGAATGCTGGTTTACATGAGAACAAAAAAGAAAAACCTGTTAATGAACTACTTTCGGAGAAAAGTGATGGCTCATTGACTGATAAGTTTGCGTTCGTGGAGTCGGATGCTATTGATCCTTTGAATCGATCACTGAGAAATGTAAGTCGTGAAGTTAATGAAATTAATAAAAGTTGTTCTCCGGTTTTTGATGACTCCGATCTGCAATTGGTGGATAATGTACTCTTAGTAGGGAACAACAATGGGGCTTTGACAAATAATGATGCAAGTAAGAGTTCTAAAGAGGATACGACCATAGAGTTCAATGCTAGTGATCCGTTGAACCATACGGCTAATCATAAATCTTGTCAAGTTAAAGTTACAAATGGAGAAGAATTTTTTATTTTGGATAACTCTCATCTGCCAATGGAATCTTCCAGATTCTCGTCGAAGGACGACGACTTGTCAAATGAAAACACCAATGAGTTTGTAAAGAAGGTTGGGATCATGGAACCTAATGCTGCTGATCATTTGAACGACAAACATCTTAGTCATGTATGGAGCAGTACAAACTTCGTAAGTAAAGAAGCTGATAATTCTAATATGCTTTTGAAGTCCGAGGTACCTTCAAGCAGAATCGATCATGCCTTGATAGATAAAGATTTCAATGAGGGTCCTGTAAAGGATGCTATCTTTGAGGATGATCTTGAAAGTTATTTATTGAATCTTCCCAGTGAAGAAGCTATGATTTCTAATGGAAACCATCTGCAAATGGAGCCTGAACTACTTGCTAGAAACAATGATGATGCTTTGACAGATGCATACTCTAATGAAAGTTTAGAAAAGGATACCATTTTGGAGTTGGAGTATGATGCAAGTTATCCTTTAAAGAACCAGCCAAGACGTATATCAAGCAGCGTAAAATATAAAAATGAAGAAGTTTCTTCAGTTTCAATAGATAGGGCATCAGATGCAAGTTGTAAAGAACAAGACAATTTAGAATTATCAACTGAGTTAACTTTGCATTGTGATGAAGAGTCGATTAAGGGCAGTTCGTGCATTTATGGTAATGAACGTGACGGGGATATTGCGACCTCAACTCGAAATCCACAGGAAACTTCGGTTCATGGTGCTGATGTTGAATCCATCCATAAAGTAGGAGAACCTCCTAGCATCTCGTTGAACAATTTAGTTGACTTATCACCTAGGATGGAGACACATTTGAGGTACTTCGAAAATGTTCCACATGCTACTTCTTCTGAACTGGCTTCTGTAGTTTTAGCTAGTGGAGAAACTGTAAAAGAGACAAAGTCAGTCTCCTCTCTGAAACCGCTACCGAAGGGTCCGTTTTCTGCTTCCAGAAGTTCGGTCGACAACTTTTCTAGTACCACCGTTCATGAAAAACCAGTCGATCAGCGTGCATACATTGAGTGTAGATCTCATCCATCTTTCGAAGTGGTCACTCGTGCATCTAATGGAAACAAGGCTTCGGAGACGAGATTTAACTCCTCCAGAAGCTCCTTATCATCATTTGAATCGCTTGGAACTCATGCCAGTAGCCAGGTTGAGTTTTCCAAATCTACTGGTTCTGGGATTCTAAGTTTCTCTACTGAAGTAGGTTGTCTGTATGATTCGAGTGGCCATATTCTGGATTTTGAAATGGAAACAGTGGATTTGGGACATAAGGTGACCGTCGAAGACGAGTGTGGCGTTATTGACTATAAAGCTCTCCATGCTGTCTCTCGCCGAACCCAAAAGCTCCATTCTTACAAGAAGAGAATCCAGGATGCTTTTACTACCAAAAAGAGGTTGGCAAAGGATTATGAACAGCTAGCAATCTGGTATGGAGATACTGATCTGGACTCCATCACAGACAGTTCCCAGAAGTCGGACAAGAAGAACGCATCCGATTCCGAGTGGGAGCTCCTGTAAATAAGACAGCTAATTCACTTCGTCTCGGCAATCAAACTTGTTTCCAGGTGGAGGAGAATCTTATATGCTGGAGATGAAGAGGAAGCTCGTCTGTTAATACCTACTCAAGAATAAGGTTCCTCACTTTATCTATTGAAGTGCATAAGTTACCTTGGAAATTCCTAAATAAACGAGTTGCAAGAAATTTTGCACATATTGGCACTCTTTCTGGGCTATGAACCTTTGATACTTTAATTAATTTCATATCCTTGGTAAATAAAGCTGTCTTGTTTTATCTTTTTTGGAAAGATCTACTCTTTGTTTGACCTTTTGAACAGAACACTCGTGGGCTTGTCGAGTTCAACAATTAGAAGCGCTTTTGGTGTTCGTTCTTCCCATCCAGACACTTGGACGACATTGCATTCCACGTTCGACCACTCTCAAGCTTACAGGCTCCCATTCTTTGCTCATGATGTCAATGAAATAGAAAGTGACAAACAGAGCACGAGAATTACTGTAGGAGTTTGAAAAGGATTGTCTTTTGCCTATCTCCAAAGGCTGATA
Coding sequence (CDS)
ATGAATCCCTACTCCGAGGAAAGACTCACCGAAGAGGTTCTCTATCTTCACTCTCTGTGGCGGCGAGGTCCGCCGAGGGGCCCTAAGCCCACTCGCTATTATTTATCCACCGCCGTCGCCGCTGCTACGAATAAGAGACCCAGAGACACAAAGAATCGAAAGCAAAAGAAGAAGAAGCCACGCCTCGAGCCATTACAAGACACCGGCCCCGAATGGCCCTGCCCGGAGCCAGTGCAAAATCAGCCCTCGACGTCATCTGGGTGGCCGCCAATGCCCTGTGCTACTCCGGCGGCTCGGCTGGTGTCGTCTGAAGAGCGAGCAAATCGTGTGGCGTTGCAATTGCAGTACAAGGGTATCGAGGCTTGCCGGAGATTTCTCATTAGAAATGCCGATTCAGGGAGTGATGAAGAGGTGGAGGAGGAAGAGGGGAATGATGGGGAGATTATGGAAAGTGAAGAGTACAAATTCTTTTTGAATCTGTTTATGGAGAATGATGAACTTAGGGGCTATTACGAGAAGAATTCTGAAGATGGGTTGTTTTGTTGCTTGGTTTGTGATGGAATGGGGAAGAAGAAATCTGGGAAAAGGTTTAAGAACTGCATTGGGCTTGTTCATCATTCGAATTCGATATCTAGAACGAAGAAGAAGGTGGCTCATAGGGCTTTTGGACAGGCCGTATGCAGGGTTTTTGGTTGGGATATTGATCGACTTCCAACCATTGTGTTGAATGGCGAGCCTCTCAGTCGATCATTAGCCAATTCTGGAGATTTCAAGGATCAGCCAGAGGAAAATCAGGTGGCTGAAGAACATGATTCTTGGGTTCATAATGAAAATGTAGCCATTTTGAATGATGAAATTGATATGAAGAATGAACAGAAATGGGAGGAAGAAAAGACAGCTGAAGATTTGATTTCTGGCGAGAAAACGAAGAACAATGATTCCTCGGCAGTCGTAACCGAATGCCGAAAACATGTAGTTTCTGCTGATGAGCTGATACAGTTGAATGTGTTGCAGGTACCCGAGTCGATTATGGAAGCATGTGAAGAATTTTTTGCTGCCTCCTTGACATCTATGGCTGACGACGATGTTAGTGAAAACAACGCAATCGAGGAACGCGAAGAGTTCAAATTCTTTTTAAAGCTGTTCATTGAGAATGAAAGCTTGAGAAGATATTACAAGAACAAGTATGATGATGGAGAATTTTCGTGTTTAGTTTGTGAAGGAGCGGGAAAGAAAACGTTGAGGAGTTTTAAGACGTGCGTTCGCCTTCTCCGACATACAACTTATCCTGGGAAGAACAAAACAGGGAAAAAACGGGTTAAGCCTCACATTGCTAAGATGTTGAAAGTAAAGATGCTGGCTCATAGAGCATATAGTTTAGTTATATGCCAGGTTCTTGGTTGGGACATAGAAAAGCTTCCTGCAATCGTGTTAAAAGGCGAAGGCCATGGTTGTTCGTTAAAGAAGCTAGACGTGTTGAAAGACGACCCGGTTGGCAATGCAGGTGATAATACGAACGAAGTAGATGATCCTGTGAGAGATGACTCTACTGAGATCGACTAA
Protein sequence
MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIEACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLFCCLVCDGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLANSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTAEDLISGEKTKNNDSSAVVTECRKHVVSADELIQLNVLQVPESIMEACEEFFAASLTSMADDDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLKVKMLAHRAYSLVICQVLGWDIEKLPAIVLKGEGHGCSLKKLDVLKDDPVGNAGDNTNEVDDPVRDDSTEID
Homology
BLAST of Cp4.1LG06g06470 vs. NCBI nr
Match:
KAG6591921.1 (hypothetical protein SDJN03_14267, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 998 bits (2580), Expect = 0.0
Identity = 497/516 (96.32%), Postives = 502/516 (97.29%), Query Frame = 0
Query: 1 MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP 60
MNPYSEERLTEEVLYLHSLW RGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP
Sbjct: 1 MNPYSEERLTEEVLYLHSLWWRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP 60
Query: 61 RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE 120
RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEER NRVALQLQYKGIE
Sbjct: 61 RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERGNRVALQLQYKGIE 120
Query: 121 ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLF 180
ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKN EDGLF
Sbjct: 121 ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLF 180
Query: 181 CCLVCDGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI 240
CCLVC GMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI
Sbjct: 181 CCLVCGGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI 240
Query: 241 VLNGEPLSRSLANSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA 300
VLNGEPLSRSLANSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA
Sbjct: 241 VLNGEPLSRSLANSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA 300
Query: 301 EDLISGEKTKNNDSSAVVTECRKHVVSADELIQLNVLQVPESIMEACEEFFAASLTSMAD 360
EDLISGEKTKN+DSS VVTECRKHVVS+DELIQL+VL VPESI EACEEFFAA LTSMAD
Sbjct: 301 EDLISGEKTKNDDSSVVVTECRKHVVSSDELIQLDVLHVPESITEACEEFFAAFLTSMAD 360
Query: 361 DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC 420
DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC
Sbjct: 361 DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC 420
Query: 421 VRLLRHTTYPGKNKTGKKRVKPHIAKMLKVKMLAHRAYSLVICQVLGWDIEKLPAIVLKG 480
VRLLRHTTYPGKNKTGKKRVKPHIAKMLK+KMLAHRAYSLVICQVLGWDIEKLPAIVLK
Sbjct: 421 VRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKD 480
Query: 481 EGHGCSLKKLDVLK---DDPVGNAGDNTNEVDDPVR 513
EGHGCSL KLDVLK DDPVGNAGDN NEVDDPV+
Sbjct: 481 EGHGCSLTKLDVLKELQDDPVGNAGDNMNEVDDPVK 516
BLAST of Cp4.1LG06g06470 vs. NCBI nr
Match:
XP_023535254.1 (uncharacterized protein LOC111796743 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 972 bits (2513), Expect = 0.0
Identity = 489/520 (94.04%), Postives = 489/520 (94.04%), Query Frame = 0
Query: 1 MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP 60
MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP
Sbjct: 1 MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP 60
Query: 61 RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE 120
RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE
Sbjct: 61 RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE 120
Query: 121 ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLF 180
ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLF
Sbjct: 121 ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLF 180
Query: 181 CCLVCDGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI 240
CCLVCDGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI
Sbjct: 181 CCLVCDGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI 240
Query: 241 VLNGEPLSRSLANSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA 300
VLNGEPLSRSLANSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA
Sbjct: 241 VLNGEPLSRSLANSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA 300
Query: 301 EDLISGEKTKNNDSSAVVTECRKHVVSADELIQLNVLQVPESIMEACEEFFAASLTSMAD 360
EDLISGE VPESIMEACEEFFAASLTSMAD
Sbjct: 301 EDLISGE-------------------------------VPESIMEACEEFFAASLTSMAD 360
Query: 361 DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC 420
DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC
Sbjct: 361 DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC 420
Query: 421 VRLLRHTTYPGKNKTGKKRVKPHIAKMLKVKMLAHRAYSLVICQVLGWDIEKLPAIVLKG 480
VRLLRHTTYPGKNKTGKKRVKPHIAKMLKVKMLAHRAYSLVICQVLGWDIEKLPAIVLKG
Sbjct: 421 VRLLRHTTYPGKNKTGKKRVKPHIAKMLKVKMLAHRAYSLVICQVLGWDIEKLPAIVLKG 480
Query: 481 EGHGCSLKKLDVLKDDPVGNAGDNTNEVDDPVRDDSTEID 520
EGHGCSLKKLDVLKDDPVGNAGDNTNEVDDPVRDDSTEID
Sbjct: 481 EGHGCSLKKLDVLKDDPVGNAGDNTNEVDDPVRDDSTEID 489
BLAST of Cp4.1LG06g06470 vs. NCBI nr
Match:
KAG7024795.1 (hypothetical protein SDJN02_13614, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 971 bits (2510), Expect = 0.0
Identity = 480/494 (97.17%), Postives = 485/494 (98.18%), Query Frame = 0
Query: 1 MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP 60
MNPYSEERLTEEVLYLHSLW RGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP
Sbjct: 1 MNPYSEERLTEEVLYLHSLWWRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP 60
Query: 61 RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE 120
RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEER NRVALQLQYKGIE
Sbjct: 61 RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERGNRVALQLQYKGIE 120
Query: 121 ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLF 180
ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKN EDGLF
Sbjct: 121 ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLF 180
Query: 181 CCLVCDGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI 240
CCLVC GMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI
Sbjct: 181 CCLVCGGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI 240
Query: 241 VLNGEPLSRSLANSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA 300
VLNGEPLSRSLANSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA
Sbjct: 241 VLNGEPLSRSLANSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA 300
Query: 301 EDLISGEKTKNNDSSAVVTECRKHVVSADELIQLNVLQVPESIMEACEEFFAASLTSMAD 360
EDLISGEKTKN+DSS VVTECRKHVVS+DELIQL+VL VPESI EACEEFFAA LTSMAD
Sbjct: 301 EDLISGEKTKNDDSSVVVTECRKHVVSSDELIQLDVLHVPESITEACEEFFAAFLTSMAD 360
Query: 361 DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC 420
DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC
Sbjct: 361 DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC 420
Query: 421 VRLLRHTTYPGKNKTGKKRVKPHIAKMLKVKMLAHRAYSLVICQVLGWDIEKLPAIVLKG 480
VRLLRHTTYPGKNKTGKKRVKPHIAKMLK+KMLAHRAYSLVICQVLGWDIEKLPAI+LKG
Sbjct: 421 VRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIMLKG 480
Query: 481 EGHGCSLKKLDVLK 494
EGHGCSL KLDVLK
Sbjct: 481 EGHGCSLTKLDVLK 494
BLAST of Cp4.1LG06g06470 vs. NCBI nr
Match:
XP_023535255.1 (uncharacterized protein LOC111796743 isoform X2 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 963 bits (2490), Expect = 0.0
Identity = 487/520 (93.65%), Postives = 487/520 (93.65%), Query Frame = 0
Query: 1 MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP 60
MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP
Sbjct: 1 MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP 60
Query: 61 RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE 120
RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE
Sbjct: 61 RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE 120
Query: 121 ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLF 180
ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLF
Sbjct: 121 ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLF 180
Query: 181 CCLVCDGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI 240
CCLVCDGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI
Sbjct: 181 CCLVCDGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI 240
Query: 241 VLNGEPLSRSLANSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA 300
VLNGEPLSRSLANSGDFK PEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA
Sbjct: 241 VLNGEPLSRSLANSGDFK--PEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA 300
Query: 301 EDLISGEKTKNNDSSAVVTECRKHVVSADELIQLNVLQVPESIMEACEEFFAASLTSMAD 360
EDLISGE VPESIMEACEEFFAASLTSMAD
Sbjct: 301 EDLISGE-------------------------------VPESIMEACEEFFAASLTSMAD 360
Query: 361 DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC 420
DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC
Sbjct: 361 DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC 420
Query: 421 VRLLRHTTYPGKNKTGKKRVKPHIAKMLKVKMLAHRAYSLVICQVLGWDIEKLPAIVLKG 480
VRLLRHTTYPGKNKTGKKRVKPHIAKMLKVKMLAHRAYSLVICQVLGWDIEKLPAIVLKG
Sbjct: 421 VRLLRHTTYPGKNKTGKKRVKPHIAKMLKVKMLAHRAYSLVICQVLGWDIEKLPAIVLKG 480
Query: 481 EGHGCSLKKLDVLKDDPVGNAGDNTNEVDDPVRDDSTEID 520
EGHGCSLKKLDVLKDDPVGNAGDNTNEVDDPVRDDSTEID
Sbjct: 481 EGHGCSLKKLDVLKDDPVGNAGDNTNEVDDPVRDDSTEID 487
BLAST of Cp4.1LG06g06470 vs. NCBI nr
Match:
XP_022937203.1 (uncharacterized protein LOC111443568 isoform X1 [Cucurbita moschata])
HSP 1 Score: 954 bits (2467), Expect = 0.0
Identity = 480/520 (92.31%), Postives = 483/520 (92.88%), Query Frame = 0
Query: 1 MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP 60
MNPYSEERLTEEVLYLHSLW+RGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP
Sbjct: 1 MNPYSEERLTEEVLYLHSLWQRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP 60
Query: 61 RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE 120
RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE
Sbjct: 61 RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE 120
Query: 121 ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLF 180
ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKN EDGLF
Sbjct: 121 ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLF 180
Query: 181 CCLVCDGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI 240
CCLVC GMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI
Sbjct: 181 CCLVCGGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI 240
Query: 241 VLNGEPLSRSLANSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA 300
VLNGEPLSRSLA SGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA
Sbjct: 241 VLNGEPLSRSLATSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA 300
Query: 301 EDLISGEKTKNNDSSAVVTECRKHVVSADELIQLNVLQVPESIMEACEEFFAASLTSMAD 360
EDLISGE VPESI EACEEFFAA LTSMAD
Sbjct: 301 EDLISGE-------------------------------VPESITEACEEFFAAFLTSMAD 360
Query: 361 DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC 420
DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC
Sbjct: 361 DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC 420
Query: 421 VRLLRHTTYPGKNKTGKKRVKPHIAKMLKVKMLAHRAYSLVICQVLGWDIEKLPAIVLKG 480
VRLLRHTTYPGKNKTGKKRVKPHIAKMLK+KMLAHRAYSLVICQVLGWDIEKLPAIVLKG
Sbjct: 421 VRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKG 480
Query: 481 EGHGCSLKKLDVLKDDPVGNAGDNTNEVDDPVRDDSTEID 520
EGHGCSL KLDVLKD+PVGNAGDNTNEVDDPVRDDSTEID
Sbjct: 481 EGHGCSLTKLDVLKDNPVGNAGDNTNEVDDPVRDDSTEID 489
BLAST of Cp4.1LG06g06470 vs. ExPASy TrEMBL
Match:
A0A6J1FFD4 (uncharacterized protein LOC111443568 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111443568 PE=4 SV=1)
HSP 1 Score: 954 bits (2467), Expect = 0.0
Identity = 480/520 (92.31%), Postives = 483/520 (92.88%), Query Frame = 0
Query: 1 MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP 60
MNPYSEERLTEEVLYLHSLW+RGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP
Sbjct: 1 MNPYSEERLTEEVLYLHSLWQRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP 60
Query: 61 RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE 120
RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE
Sbjct: 61 RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE 120
Query: 121 ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLF 180
ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKN EDGLF
Sbjct: 121 ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLF 180
Query: 181 CCLVCDGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI 240
CCLVC GMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI
Sbjct: 181 CCLVCGGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI 240
Query: 241 VLNGEPLSRSLANSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA 300
VLNGEPLSRSLA SGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA
Sbjct: 241 VLNGEPLSRSLATSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA 300
Query: 301 EDLISGEKTKNNDSSAVVTECRKHVVSADELIQLNVLQVPESIMEACEEFFAASLTSMAD 360
EDLISGE VPESI EACEEFFAA LTSMAD
Sbjct: 301 EDLISGE-------------------------------VPESITEACEEFFAAFLTSMAD 360
Query: 361 DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC 420
DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC
Sbjct: 361 DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC 420
Query: 421 VRLLRHTTYPGKNKTGKKRVKPHIAKMLKVKMLAHRAYSLVICQVLGWDIEKLPAIVLKG 480
VRLLRHTTYPGKNKTGKKRVKPHIAKMLK+KMLAHRAYSLVICQVLGWDIEKLPAIVLKG
Sbjct: 421 VRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKG 480
Query: 481 EGHGCSLKKLDVLKDDPVGNAGDNTNEVDDPVRDDSTEID 520
EGHGCSL KLDVLKD+PVGNAGDNTNEVDDPVRDDSTEID
Sbjct: 481 EGHGCSLTKLDVLKDNPVGNAGDNTNEVDDPVRDDSTEID 489
BLAST of Cp4.1LG06g06470 vs. ExPASy TrEMBL
Match:
A0A6J1FAI7 (uncharacterized protein LOC111443568 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111443568 PE=4 SV=1)
HSP 1 Score: 946 bits (2444), Expect = 0.0
Identity = 478/520 (91.92%), Postives = 481/520 (92.50%), Query Frame = 0
Query: 1 MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP 60
MNPYSEERLTEEVLYLHSLW+RGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP
Sbjct: 1 MNPYSEERLTEEVLYLHSLWQRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP 60
Query: 61 RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE 120
RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE
Sbjct: 61 RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE 120
Query: 121 ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLF 180
ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKN EDGLF
Sbjct: 121 ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLF 180
Query: 181 CCLVCDGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI 240
CCLVC GMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI
Sbjct: 181 CCLVCGGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI 240
Query: 241 VLNGEPLSRSLANSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA 300
VLNGEPLSRSLA SGDFK PEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA
Sbjct: 241 VLNGEPLSRSLATSGDFK--PEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA 300
Query: 301 EDLISGEKTKNNDSSAVVTECRKHVVSADELIQLNVLQVPESIMEACEEFFAASLTSMAD 360
EDLISGE VPESI EACEEFFAA LTSMAD
Sbjct: 301 EDLISGE-------------------------------VPESITEACEEFFAAFLTSMAD 360
Query: 361 DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC 420
DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC
Sbjct: 361 DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC 420
Query: 421 VRLLRHTTYPGKNKTGKKRVKPHIAKMLKVKMLAHRAYSLVICQVLGWDIEKLPAIVLKG 480
VRLLRHTTYPGKNKTGKKRVKPHIAKMLK+KMLAHRAYSLVICQVLGWDIEKLPAIVLKG
Sbjct: 421 VRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKG 480
Query: 481 EGHGCSLKKLDVLKDDPVGNAGDNTNEVDDPVRDDSTEID 520
EGHGCSL KLDVLKD+PVGNAGDNTNEVDDPVRDDSTEID
Sbjct: 481 EGHGCSLTKLDVLKDNPVGNAGDNTNEVDDPVRDDSTEID 487
BLAST of Cp4.1LG06g06470 vs. ExPASy TrEMBL
Match:
A0A6J1IMA4 (uncharacterized protein LOC111476868 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111476868 PE=4 SV=1)
HSP 1 Score: 916 bits (2367), Expect = 0.0
Identity = 463/520 (89.04%), Postives = 472/520 (90.77%), Query Frame = 0
Query: 1 MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP 60
MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRD KNR+QKKKK
Sbjct: 1 MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRDPKNRRQKKKKS 60
Query: 61 RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE 120
R EPLQDTGPEWP PEPVQNQP TSSGWPPMPCATPAARLVSSEERANRVALQLQY GIE
Sbjct: 61 RPEPLQDTGPEWPFPEPVQNQPLTSSGWPPMPCATPAARLVSSEERANRVALQLQYNGIE 120
Query: 121 ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLF 180
ACRRFL RNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLF
Sbjct: 121 ACRRFLTRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLF 180
Query: 181 CCLVCDGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI 240
CCLVC GMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQA+CRVFGWDIDRLPTI
Sbjct: 181 CCLVCGGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAICRVFGWDIDRLPTI 240
Query: 241 VLNGEPLSRSLANSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA 300
VLNGEPLSRSLA+SGDFKDQPEE+QVAEEHDSWV ENVAI ND+IDMKNEQKWEEEKTA
Sbjct: 241 VLNGEPLSRSLAHSGDFKDQPEEDQVAEEHDSWVQIENVAISNDDIDMKNEQKWEEEKTA 300
Query: 301 EDLISGEKTKNNDSSAVVTECRKHVVSADELIQLNVLQVPESIMEACEEFFAASLTSMAD 360
E+ ISGE VPESIMEACEEFFAA LTSMAD
Sbjct: 301 EESISGE-------------------------------VPESIMEACEEFFAAFLTSMAD 360
Query: 361 DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC 420
DDVSENNAIEE EEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVC+GAGKKTLRSFKTC
Sbjct: 361 DDVSENNAIEECEEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCKGAGKKTLRSFKTC 420
Query: 421 VRLLRHTTYPGKNKTGKKRVKPHIAKMLKVKMLAHRAYSLVICQVLGWDIEKLPAIVLKG 480
VRLLRHTTY GKNKTG KRVKPHIAKMLK+KMLAHRAYSLVICQVLGWDIEKLPAIVLKG
Sbjct: 421 VRLLRHTTYTGKNKTGNKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKG 480
Query: 481 EGHGCSLKKLDVLKDDPVGNAGDNTNEVDDPVRDDSTEID 520
EGHGCSL KLDVLKDDPVGNAGDNTNEVDDPV+DDSTEID
Sbjct: 481 EGHGCSLTKLDVLKDDPVGNAGDNTNEVDDPVKDDSTEID 489
BLAST of Cp4.1LG06g06470 vs. ExPASy TrEMBL
Match:
A0A6J1INL5 (uncharacterized protein LOC111476868 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111476868 PE=4 SV=1)
HSP 1 Score: 907 bits (2344), Expect = 0.0
Identity = 461/520 (88.65%), Postives = 470/520 (90.38%), Query Frame = 0
Query: 1 MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKP 60
MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRD KNR+QKKKK
Sbjct: 1 MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAATNKRPRDPKNRRQKKKKS 60
Query: 61 RLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRVALQLQYKGIE 120
R EPLQDTGPEWP PEPVQNQP TSSGWPPMPCATPAARLVSSEERANRVALQLQY GIE
Sbjct: 61 RPEPLQDTGPEWPFPEPVQNQPLTSSGWPPMPCATPAARLVSSEERANRVALQLQYNGIE 120
Query: 121 ACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLF 180
ACRRFL RNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLF
Sbjct: 121 ACRRFLTRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNSEDGLF 180
Query: 181 CCLVCDGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTI 240
CCLVC GMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQA+CRVFGWDIDRLPTI
Sbjct: 181 CCLVCGGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAICRVFGWDIDRLPTI 240
Query: 241 VLNGEPLSRSLANSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA 300
VLNGEPLSRSLA+SGDFK PEE+QVAEEHDSWV ENVAI ND+IDMKNEQKWEEEKTA
Sbjct: 241 VLNGEPLSRSLAHSGDFK--PEEDQVAEEHDSWVQIENVAISNDDIDMKNEQKWEEEKTA 300
Query: 301 EDLISGEKTKNNDSSAVVTECRKHVVSADELIQLNVLQVPESIMEACEEFFAASLTSMAD 360
E+ ISGE VPESIMEACEEFFAA LTSMAD
Sbjct: 301 EESISGE-------------------------------VPESIMEACEEFFAAFLTSMAD 360
Query: 361 DDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTC 420
DDVSENNAIEE EEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVC+GAGKKTLRSFKTC
Sbjct: 361 DDVSENNAIEECEEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCKGAGKKTLRSFKTC 420
Query: 421 VRLLRHTTYPGKNKTGKKRVKPHIAKMLKVKMLAHRAYSLVICQVLGWDIEKLPAIVLKG 480
VRLLRHTTY GKNKTG KRVKPHIAKMLK+KMLAHRAYSLVICQVLGWDIEKLPAIVLKG
Sbjct: 421 VRLLRHTTYTGKNKTGNKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKG 480
Query: 481 EGHGCSLKKLDVLKDDPVGNAGDNTNEVDDPVRDDSTEID 520
EGHGCSL KLDVLKDDPVGNAGDNTNEVDDPV+DDSTEID
Sbjct: 481 EGHGCSLTKLDVLKDDPVGNAGDNTNEVDDPVKDDSTEID 487
BLAST of Cp4.1LG06g06470 vs. ExPASy TrEMBL
Match:
A0A1S3CJZ0 (uncharacterized protein LOC103501816 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103501816 PE=4 SV=1)
HSP 1 Score: 587 bits (1514), Expect = 2.63e-204
Identity = 336/543 (61.88%), Postives = 392/543 (72.19%), Query Frame = 0
Query: 1 MNPYSEERLTEEVLYLHSLWRRGPPRGPKPTRYYLSTAVAAA--TNKRPRDT---KNRKQ 60
M+PYS+ERLT+EVLYLHSLW RGPPR PKPT + STAVA +NKRP D KN+ +
Sbjct: 1 MDPYSDERLTKEVLYLHSLWHRGPPRNPKPTHDHSSTAVADPNPSNKRPIDPDRRKNKNK 60
Query: 61 KKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPPM-PCATPAARLVSSEERANRVALQL 120
KKKKPR +P QD+GPEWPCPEPVQNQPSTSSGWPP+ P ATPAA+LVSSEER N ALQL
Sbjct: 61 KKKKPRSDPPQDSGPEWPCPEPVQNQPSTSSGWPPIQPVATPAAQLVSSEERKNLAALQL 120
Query: 121 QYKGIEACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKN 180
QYKG +ACR+F RNADSGSDEE EEEE +DGE+MES+EY FFL +F+EN+ELR YYEKN
Sbjct: 121 QYKGSDACRKFFARNADSGSDEEEEEEEEDDGEMMESKEYTFFLKMFVENEELRVYYEKN 180
Query: 181 SEDGLFCCLVCDGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDI 240
E GLFCCLVC GMGKKK GK+FKNC+ LV HS SIS TKKK AHRAFG V RVFGWDI
Sbjct: 181 CESGLFCCLVCVGMGKKKFGKKFKNCLALVQHSISISGTKKKRAHRAFGHVVSRVFGWDI 240
Query: 241 DRLPTIVLNGEPLSRSLANSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKW 300
DRLPTIVL GEPLSRSLANSGD K QPEE V + NE V++ +E +EQK
Sbjct: 241 DRLPTIVLKGEPLSRSLANSGDLKVQPEEIHVDNK------NEVVSVSVNE----DEQKL 300
Query: 301 EEEKTAED-------LISGEKTKNNDSSAVVTECRKHVVSADELI--------QLNVLQV 360
EE KTAED LISGE ND + T+ + V +AD I +++ L V
Sbjct: 301 EEVKTAEDPTSNSKDLISGE----NDDAYKDTDVKLQVENADNSISGMGESNGEMDNLHV 360
Query: 361 PESIMEACEEFFAASLTSMADDDVSENNAI---EEREEFKFFLKLFIENESLRRYYKNKY 420
+I+ AC+EF AA SM DDDVSE + EEREEFKFFLKLF ENE+LRRYY+N Y
Sbjct: 361 --TILRACKEFQAAFFRSMNDDDVSEKESTDGAEEREEFKFFLKLFTENENLRRYYENHY 420
Query: 421 DDGEFSCLVCEGAGKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLKVKMLAHR 480
DGEF+CL CE AG+K ++ FKTC RLL+H+T GKN K+ KP K+LK+ MLAHR
Sbjct: 421 GDGEFTCLACEVAGRK-VKCFKTCSRLLQHSTQLGKNNIEKQGQKPQKTKVLKMGMLAHR 480
Query: 481 AYSLVICQVLGWDIEKLPAIVLKGEGHGCSLKKLDVLKDDPVGNAGDNTNEVDDPVRDDS 519
AY+ V+C+VLG DI+ LPAIVL GE G SL K DV K + ++ DD V DDS
Sbjct: 481 AYTSVVCKVLGCDIKMLPAIVLNGEALGLSLTKSDVSKLQDKSDVQMQSSNADDIVEDDS 526
BLAST of Cp4.1LG06g06470 vs. TAIR 10
Match:
AT1G78810.1 (unknown protein; Has 75 Blast hits to 52 proteins in 16 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 2; Plants - 66; Viruses - 0; Other Eukaryotes - 3 (source: NCBI BLink). )
HSP 1 Score: 207.2 bits (526), Expect = 3.1e-53
Identity = 161/520 (30.96%), Postives = 241/520 (46.35%), Query Frame = 0
Query: 1 MNPYSEERLTEEVLYLHSLWRRGPP-RGPKPTRYY---------------------LSTA 60
MN Y +E L +EV+YLHSLW +GPP R P P+ + L +
Sbjct: 2 MNIYDDESLKQEVIYLHSLWHQGPPTRKPIPSPNFNLIHDPIQRPRPNYIPPSDLQLLSR 61
Query: 61 VAAAT----NKRPRDTKNRKQKKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPP-MPC 120
A T ++ P + +N K+PR D+G EWP + V PST SGWP PC
Sbjct: 62 YGAVTPQIISRNPNNPQNLYNNNKRPR----PDSGREWPVND-VPQPPSTGSGWPEYRPC 121
Query: 121 ATPAARLVSSEERANRVALQLQYKGIEACRRFLIRNAD------SGSDEEVEEEEGNDGE 180
R +S+EE+ A LQ CR F R + +G DE E +EG++ +
Sbjct: 122 --KKTRPISAEEKEKLAANMLQRDIHRTCREFFGRKSGEEDSSVAGGDES-EIDEGDEDQ 181
Query: 181 IME------SEEYKFFLNLFMENDELRGYYEKNSEDGLFCCLVCDGMGKKKSGKRFKNCI 240
+E S+E++F +F EN +L+ YYEKN+ +G F CLVC G+G +KS ++FK+C+
Sbjct: 182 SLEKEESSSSKEFQFLSRVFEENVKLKEYYEKNTGNGEFWCLVCGGIG-EKSCRKFKSCL 241
Query: 241 GLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLANSGDFKDQP 300
L+ HS +I +T K+ HRA Q VC V GWD+
Sbjct: 242 ALIQHSLTIHKTDLKIQHRALAQVVCNVLGWDV--------------------------- 301
Query: 301 EENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTAEDLISGEKTKNNDSSAVVTEC 360
N +K ++ ++ G +DS + +
Sbjct: 302 ----------------------------NNPVVSSQKDSQTVVEGASEPPSDSK--IPQE 361
Query: 361 RKHVVSADELIQLNVLQVPESIMEACEEFFAASLTSMADDDVSENNAIEEREEFKFFLKL 420
++ V+S +E + VLQ+ ++ EA ++ F T A D EN EE + K+
Sbjct: 362 KQQVMSVEEHAKAAVLQMQQNASEALKDIFVKDGTGAA-DGTEENGDENLSEELELISKV 421
Query: 421 FIENESLRRYYKNKYDDGEFSCLVCEGA-GKKTLRSFKTCVRLLRHTTYPGKNKTGKKRV 480
F EN L+ YY+ Y+ G F CLVC A KK L+ FK C +++H T
Sbjct: 422 FSENVELKSYYEKNYEGGAFICLVCCAATDKKMLKRFKHCYGVVQHCT------------ 437
BLAST of Cp4.1LG06g06470 vs. TAIR 10
Match:
AT1G78810.2 (unknown protein; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )
HSP 1 Score: 207.2 bits (526), Expect = 3.1e-53
Identity = 161/520 (30.96%), Postives = 241/520 (46.35%), Query Frame = 0
Query: 1 MNPYSEERLTEEVLYLHSLWRRGPP-RGPKPTRYY---------------------LSTA 60
MN Y +E L +EV+YLHSLW +GPP R P P+ + L +
Sbjct: 2 MNIYDDESLKQEVIYLHSLWHQGPPTRKPIPSPNFNLIHDPIQRPRPNYIPPSDLQLLSR 61
Query: 61 VAAAT----NKRPRDTKNRKQKKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPP-MPC 120
A T ++ P + +N K+PR D+G EWP + V PST SGWP PC
Sbjct: 62 YGAVTPQIISRNPNNPQNLYNNNKRPR----PDSGREWPVND-VPQPPSTGSGWPEYRPC 121
Query: 121 ATPAARLVSSEERANRVALQLQYKGIEACRRFLIRNAD------SGSDEEVEEEEGNDGE 180
R +S+EE+ A LQ CR F R + +G DE E +EG++ +
Sbjct: 122 --KKTRPISAEEKEKLAANMLQRDIHRTCREFFGRKSGEEDSSVAGGDES-EIDEGDEDQ 181
Query: 181 IME------SEEYKFFLNLFMENDELRGYYEKNSEDGLFCCLVCDGMGKKKSGKRFKNCI 240
+E S+E++F +F EN +L+ YYEKN+ +G F CLVC G+G +KS ++FK+C+
Sbjct: 182 SLEKEESSSSKEFQFLSRVFEENVKLKEYYEKNTGNGEFWCLVCGGIG-EKSCRKFKSCL 241
Query: 241 GLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLANSGDFKDQP 300
L+ HS +I +T K+ HRA Q VC V GWD+
Sbjct: 242 ALIQHSLTIHKTDLKIQHRALAQVVCNVLGWDV--------------------------- 301
Query: 301 EENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTAEDLISGEKTKNNDSSAVVTEC 360
N +K ++ ++ G +DS + +
Sbjct: 302 ----------------------------NNPVVSSQKDSQTVVEGASEPPSDSK--IPQE 361
Query: 361 RKHVVSADELIQLNVLQVPESIMEACEEFFAASLTSMADDDVSENNAIEEREEFKFFLKL 420
++ V+S +E + VLQ+ ++ EA ++ F T A D EN EE + K+
Sbjct: 362 KQQVMSVEEHAKAAVLQMQQNASEALKDIFVKDGTGAA-DGTEENGDENLSEELELISKV 421
Query: 421 FIENESLRRYYKNKYDDGEFSCLVCEGA-GKKTLRSFKTCVRLLRHTTYPGKNKTGKKRV 480
F EN L+ YY+ Y+ G F CLVC A KK L+ FK C +++H T
Sbjct: 422 FSENVELKSYYEKNYEGGAFICLVCCAATDKKMLKRFKHCYGVVQHCT------------ 437
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
KAG6591921.1 | 0.0 | 96.32 | hypothetical protein SDJN03_14267, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_023535254.1 | 0.0 | 94.04 | uncharacterized protein LOC111796743 isoform X1 [Cucurbita pepo subsp. pepo] | [more] |
KAG7024795.1 | 0.0 | 97.17 | hypothetical protein SDJN02_13614, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
XP_023535255.1 | 0.0 | 93.65 | uncharacterized protein LOC111796743 isoform X2 [Cucurbita pepo subsp. pepo] | [more] |
XP_022937203.1 | 0.0 | 92.31 | uncharacterized protein LOC111443568 isoform X1 [Cucurbita moschata] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1FFD4 | 0.0 | 92.31 | uncharacterized protein LOC111443568 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1FAI7 | 0.0 | 91.92 | uncharacterized protein LOC111443568 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1IMA4 | 0.0 | 89.04 | uncharacterized protein LOC111476868 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1INL5 | 0.0 | 88.65 | uncharacterized protein LOC111476868 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A1S3CJZ0 | 2.63e-204 | 61.88 | uncharacterized protein LOC103501816 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
Match Name | E-value | Identity | Description | |
AT1G78810.1 | 3.1e-53 | 30.96 | unknown protein; Has 75 Blast hits to 52 proteins in 16 species: Archae - 0; Bac... | [more] |
AT1G78810.2 | 3.1e-53 | 30.96 | unknown protein; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae ... | [more] |