Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TAATTTAAATTTTATACAACGACTTTGAGAACCTTTAAAAAAGGCCACGTTTGATTGACCCATTCTAATCCGCCATTGAAATCTGAATGTCAAGCTTTTTTTCCCCTCTCCCTCTCTCTCTCCTCCAGACAGATTCACAATTCCAATCAATCTCCGGGTTCAATCTTTCTCTCAATCGCACGAATTTCTCTGAAGAAGGAGAAGGAGAAGAAGAAAAAGGTTTTTGTTGTTTCCTTTGGATCCAGGAAAATGGACGTGGTATTCTTCTGTAGATACATCTTTAATTTTCAGTTGCTTCCTCGTATTTAACGTTTTAACGGTGAAGAATTATCTTTCTATTGCAGGAATCTCCTAATGTGGATCTTCCGACGACAGACAAAGAAATCGTACCGGAGAAGATTGAGGATGAAGAGATCAAAGAGCCTTTGATTCATTGCGAGCTCTGCGATGCGGAAATTGTTCATAAACTTGCTCAGGTTCTTCTTCCTGGATTGTCCACCGCTTGTGTCGATAATACGAGTGGCGATATTTTTCGAACCCCTGGTTCAGTGGCTGCCGATATGAGGAAAGAAATGGTGGATTATCTTACCATGAGGAGCGAAACTTGTGTTGCCGAATCTGTAATTTTAGAGAATGCGTCCGATGCTGAGGTATCTGATCATCCTTACGATATCATCTCTGATTTTGTTGATGATTTTGCTGCTACGAAGAGAAATTTGTTTAGTAGAGTTTCAGGATGGGTTCTAAGTGAGAAAAGAGAGGATAAGATAGATGATTTTGTTCAAGAAATGGACATCAATGGCTTTTGGCCACTTGATAGGAGAGAAGCAATTGCTCAGGTTTTGCTTAAGAATGTGGACTTTAAGAGTGAGTTTCATTGTGATAAGAAATTTCACTCTACAGAAGAACTAGCTGAGCATGTTGAAAACTGTGGGTTTAGGTCCGTAACTTGCACAAATGAAGGCTGCACTGCGAGATTTTGTGCAAGCCACGCAGAACAACATGATTCCATCTGCCCCTTCAAGATAATTTCATGTGAACAGAAGTGTTCTGCCTTTATTATGAGACGTGAAATGGACCGCCATTGCATAACTGTTTGTCCAATGAAGCTTGTGAATTGCCCCTTCCATAATTTGGGTTGTCAATCCCCTGTTCCTTACTCTTTGATAGCGCAGCATTGTTCAGAGAGTTTTGATTCTCATTTGCTGCATATTCTTCACTCTATACACAAGGAAGCCAATGAAGAGACTCTTATACATAGACGGCAACAGCTTGAAGAGGTGAGGTGTTGCAAAGTTATTTGAATTCGTTTATAATTTTGAAATGGGAATTTACTTTGTTGAATCAATGATAGAGTTATCAGACATTAAAATCATTTAGCTCTGACCAAACTGAGAAAAACATGCATATATCAAAGTGTGATCTGATATTAAATGTGTTCATTTTGGTTTATGCTTTGACTGTGATAAGCATATTGATTTTCTGTAGGCATGAATCTCGTAATGATTTTGTATGGTTTGACTGTATGAGTTTTCGATTCTCACTGATCATGAATGTCAGATTTACTCTTCATCTTGACCATAAGGAGGGGTGCATCTTTCGTGATTTAGCTATGGATTAAATATTTGAGTAGTTAGGGGACCTCAATTGTATTTAATCTACTTAACTGCGTTAGACTTGATGAGACAATAAATGGTGCAGATAGGATTTGGAGTTAAATTTGCTCTTTAGGTGGATGTGAATACAAATTTAATGTTAATGAGCTTGATTGACTCTCGTAATATTTGGATGATAGTAAATTGTATGTTAGTTATTGTTGTTGTTGGTTGCTATTTTTGTAATTAATGGTTTCAATTTGTGTGTGTGTATGTATATATGTATATCTATTTCTTTGAGTTATTGATTTGTTCTTGTAACCATAAGATGCATGCCCTCATTATGATTTAATTTATCCTTCCGTTCATTTATATTGGGTAATATCATTGGAAAACCTTAGTTGCAGATGGGATTTGGTATAATTTCTGTCTTAATCTATAAATAATTGGAGGTTCATTCACTTTATCTTATGTTGCTTCAATCTTTTTAATACCTGTTTTTTTAATATTCGTACAGTCAAAGCATAAATCATATGCTTATCACACACATCCTGACACAATGAAATCCAAGGGCCTGGATGAATAGGACGGAGGTATAGAGAGAAAAGATATAGCTAATTAGCTGCGTCCCCTTCTCTTTCTTTTGTGGAAAAGCGTACTCGTTCAGAACTTGGATTCAACCTTCTATAGATACTTCGAATCCATTGAACTTTCTAATTGAGTCATCGCTAGTGGGTTAAGCTCTTAAGAGTGTAGATTAAGGAATTAAATTTATCCTAAAAGTTAAACTTCTAGGAGTACTGATGTTACCAATACTGTATAATCTGGAGTCTATCTCTTCCATTGTGTATCATATAAGACAATGATTGTACTCCTAAGCTGCATTTTCATCAATTGCATGTGTCTTTGTATCTATTCTAATTACAATTCTAATGGTTCTTCCAGGCATCATCACTTGACCACCTTAGAGGGCTTCAAAACTTGAGATTGTTAACCTCAAAAATCAAAGAAATGGATTCTGGGCTAGGACCATTAGTAGTCATCTACAAGGTTGAGGACACAGAAGAAGCGAAGGATGGCTCTGATGAAAGTGACGAAGAGAAGGAGGCTTCTAATGCAACTGAAGATGCCAAGGATGCCTCTAATGCAACAGAAGAAAAGAAGGAAATGTCTAATGGAACTGAAGAAACGAAGGATGCCTCAATTGCAACTGAAGAAAGGAAGAATGTGTCAACTGCAATTGAAGAAAGGGAGGTCATATCTAGTGCAATTGAGGAAACAAAAGATGCTTCTAAGACAACTGGAGAAACAAAGGACAATTCTGATGACAAAGAAGAAGCCAGTGATGCTTCTAATGAAAAAGAAGAAACAAAAGATGAGTCGCATTCAACTGAAGAAACAAAGGATGCTAATGCAGAGGAAAAGATGAAGGATGATTCTGATTCAGAGGAAGAAATGAAGAATGCTTCTGATGTAAAAGAAGAAATGAAGGATAATTCTGATGGAGAGGAAAAAAAGAAGGATGATTCTGATGGAGAGGAAAAACGGAAGGATGATTCTGATTTAGAGGAAGAAAAGAAGGATGATTCTGATGCAAAGGAGGAAGAAGCAAGGAAAGAAATTGAAGAAAGGAAGGAGGTTTCCAATGGAATTGAAGAAAAGAATGGTACCTCAAATCCACCTGAAGGAGAAACCAAGAGTGCTTCGAATGTAGTAAGAAGTGATCCAGATGAAGCTGAAAGATAAAAATAATAAAAAAAAAAATCGACACACCATTTGAGAAACCCTTGGCTTTGCTGAAGAAAGTCCAACTACAGAATGCTTCCTTCCCATCCTCTTTATCCATTAACATCACATGGTCAGTGTCCCTTTCTTCAACTAAAACAAATGGCCCTTCATTTTTCCCTCTTGTTATTCTTTTGTATACATTATGATTCATATTTCCCAAATGCAAAATTGTAAACTACTTTCATTCTACACCTATAAATTATATAGTGTTCCTCTGAACCTGCCTTCAATTTTGACCCTTTTGAGTGGGATGTAAAACCACCAACATGAATATATCTCAAGTGATTAAGATGTTCGAGAAAAGGTTAGCATTGCTTCTAATTACAAAACAACTGCAATGCAACTGATCTACCAAAAATTGGCCCTTTCTGTTTCTTGTATAAGGTGTGTGAACCATAATACCTAACTTTCCAATAGTAGTATATATATTATAACTCGTGTTCATGTTGGTTAACTTTTTGTATTTATTACATATTTCGGTCTTGAATTTTATGTTTTGCTTAGATTAAGTAGAGGCAAATCAATAATTATTTAGCCAAAAAGAACATTAGGACTGAATTACTTTAAAAGACACTATACAATATATTAACCAAAAGCTTTATACTTCAGTCCAACTAAACTAACATGAAAAGAAAAAGGAAAGTAAGGATAAGTATAGTTATGTATGTTACAAAATTTGCAATTGTGCAAAGTATTAGTATTTTATTAATATATTCGTGGTCTATTTTATTAATCTCATTTGGGAGACGATCCTAGATCAGATTTCTCTGTTAGGTTCTTAGTGAATGCTTGATTTTGGGAAGAATCACATTGAAGGGAAGCTTCCTGAGGGCGTTGGGGCTTTGAAGAATCTTCAAATTCTCAATCTCAGAAGCAACTTGATTTCTGGTACAGTGTTTTCTGTTGCCTTCCATAATCTGACTGAACTTCTTGTTGTTGATTTGTCAGAAAATTCTCATTTGTTGAGTGATATTCTTAGTGAGATTGGGAAGCTTGAGAAGCTTGAGAAGCTTAAGAAGCTATGGCTTTACAACTCTGGTTTCTATGGTGAAATCCCTTCTTCCCACCTTCTTCTTTGTTGGGTTTGAGAAGTTTGAGTGTTTTGGATTTCTCAAAACAATCTCACCGGGGAACGCCCTGAAATGTTGGGTTCTTCTTTAAAGAATTTGGTGTCTTTTCATGTTTCTGAGAATAAGCTTGTGGGATCTTTCCCAAATGGGTTTTGTACTGGAAAAGGCATAGTGGGGAAATCTCCTGATTTGAGCCTATCAGGAAGGGTTATCGCAGGCTTCACTTGGTTGGCATTGTGAGTCTTAGTGTTCATACCAATTTTTTTTTAGTGGGAGTTTGCCTAATTCCTTGAACCAGTGCTTGAATCTTGAGAGGTTTCAAGCCCGTTTGATGATGTTTCCGTTCCCTGTTTCCTGTTTACTGTTTATCATTTTTTAAGAAATGGATTTATTTGATAATTGTTCCCGTTTCTTGTTTCTAAATTTTAAGGAACATTTCTAAAAAATGGGCCAAAATTGAGAAACAACAAAAAAATAGTTTCTTCTGTTATTGTTTCTATTTCTATTTTTCCATTACCATAAGTAATTATTGTCAGTTTTTGGTGTATTGGTATTAGGCGCCAATGGTAACTAATCATGAGTTTGAATCCCGTCTGAAGTGATTTTTTACTTTTATTTTTTTTGAAGAATCTGTTTTTCTCATTACATTTTTTATTTTTATTTTTTCTTTATTAGTTTTGTTATCTACAAAAATAATTCTCCTATTTTTATATTTTATTTTAATTTTGAATTTTAAAGTTCTCCAATATTTAGATTTATATATTATTTTAATTTTTAATGTGTTTTTTTATATTTCAAATTTTAACTCCCAAGATGGGCACTTTTATTTATTATTATTTTTTTTTCCTTTTATATTTGTTTCCATCTTTTGTATTTTATTTATTTTCTTTTTTAACTTTTCTATTTACAAAAATAATTTTGCTATATTTATATTTTATTTTAATTTTGAATTTTAAAATTCTCCAATATTTAGATTTATATATTATTTTAATTTTCAATATGTATTAAGTTTTTTTATATTTCAACTTTTGAGTATTAAATTTTTCAATATGTGTGTTTATATATTATTTTAATTCTAATATTTTTAGTATATCTGAAATCGAGTAGAATTTTACCAAATATACATTTTGCTGACTTTGACATTCATTTAGCTTAACTTTAATATTTTACAACCACAATATTTACATAATTATATAATTAAGTTGGATTTGACTCGACATCATTTAAGACAAATGACTTGAGTTGGAGATGACAATATTGATCAAATTTGGGCAACATATGAAAGATGTGATACAACTTTGTTTTTATATTTAGTTGTCATTGAGATTTACTTCTATTAGATGGCATAAACTTTTGATAATATCTTTTTCGATGTTTGATATATGACATATCTTCTTTTTTTATATTATATTGTTAAAAAAAAAAAAAACAAAAAAAAAAAAAAAACAAGAAACAAGAAATGGTTATCAAACAAATTTATGTTTTTGTTTATTTTTTCAAAAAATAGGAAACAGAAATAGTTATCAAACATAGTCTTGCTTCTTATTCTAAAAAACGAAGAAACAGGAAACGAGAAATAGGAAATAAGAAATTGGAAATGAGAATGTTATCAAATGGGCCCTTCAAGTTCAGAACAATGGGTTTTCTGGGGATTTTACTAAAGGTTTGTGGTCATTGCCTAAGATTAAGCTCATCAGAGCTGAAAACAATGGTTTCTCTGATGAAATTCCATAGTCTATATCTATGGCTGCTCAGCTTGATAACAACAGTTTTTCAAGTAAAATACCTCGGGGTCTCGGGTCTATTCGAAGCTTATATCGATTCTCTGCGTTGCTCAATCGCTTTTATGGTGCATTGCCACCAAACATCTGTGATTTGCCATTAATAATGAGTATAGTTAATCTGTCCCACAGTTCTCTCTCTGGTCAAATTCCCGAGCCGAAAAATTGCAAGAGACTGTCTCTTTGTCCTTAGCAGGCAATATTCTTACTCGAGAAATTCCTACTTCCGTTGCAGATCTACCAGTGTTAAATTATCTTGATCTTTCTGATAATAATCTCACTGGTTTGATCCCTCAAGGACTCGAGAACTTAAAGTTGCACTCTTTAATATTTCATTCTCCTTGATTTCGTGGCTACCAGCTTCATTTCTGCAAGGAAATCCTGATCTTTGTGGCCCTGGTTTGCAAAGTCCTTGTTCTCAAGGCCATCCAACAAACCATATGTATGGACTTAAACAAAATGACATGTGCCCTCGTCTCTCTAGCTTGTGTTTTAGGAGTTCTAAGTTTAGCTGCTGGGTTCATTCTGTATTATTGATCCTACAAACCGATATCCCGAGTCGATAACTGGCAAATCTGCTCAAGGATGTGGGAGTACTAAGTTTAGCCAAGTGTTCATGTTAAGCTTACCGCGCTGTGAACTGATCAACGTAAAGAAACTCGTTAATTTTGGGAGTCATTCGTGGAAGTCGTTGAAAGCTGGGGCCAAGAATTTGGCCAAGATCAGGCACAAAAACGTCATCAAAGGAAGCTTGGCTGACTTGATATGCAGAAATGATTCTTGCCTGAATTGGAATGTGAGACTGAGAATTTCTATTGAGGTTGCTCAAGGACTAGCTTACATTAACAAGGACAATGTCCCACATTTACTTCATCGAAATGCCAAATTGTCGCATATTCTATTGGATGCCGACTTTGTCCTGAAGTTCAAGGATTTTGCTCTTCACCATATCGTTGGAGAGTCGCCACTTCACTCGACAGTAGCTTCGGAACCTTCTCATTCCTGCTGTACTGCATCAGGTACACAACTTTATGATCAACAATGAATTAGATTACTAGTGAACTAGGGTGTTTGTAAAACTTGATATTCCAAAAAATCTATCCATCCAATCAAATCTGTTTTGATCAAGTTGGGTTATATTGGATTCTTTTTGTAACTAAGGCCCCATTAGATAACTATTTGCCTTTTAGTTTGTTAGTTTTTAAAAGTTATGTCTATAAACACTAATTTCACCTCTAAATTTGTATGTCCTACCAATGTTTTTAAAAATCAAGCAAATTTTAAAAACTAAAAAAAGTAGCTTTTAATTCTTGTTTGGTAGTCATTTTGTTTTTGGTTTCTAAAAATTAAGCTTATTTCCTCTTTTATCTCTTGCAATGGTTTTCATCTATGTTAAGTAAAAGAGTTTACTTCTAAGCCAAATTCCAAATTCCGAAAACTTGTTTTTAAGAGTTTTTTTTTTTAGTTTTCAAAACATGGTTTGGTTTTTGAACTCATTGGTAGGAAGTAGATAATAAAGCAAGAAATTTAAAGCTAAAATTAGTGTTTATAAGCTTAAATTTCAAGGACGAAAAAGAAAAAACCAAACAATTACGAAACGGGACATAAGTAGTTTTCTACTAGGATCCATAAGACATTGTTCAAAATATGTTGCTAGTATAGAAGTCAAGGAAGTGAAGTGATGTCTAAAGAGTCTTATTTCATCTATCATTTTTGAATGCAGAATATAAATACAACAAAAAGGCAACAGAGCAAATGGATGTGTACAGCTTTGGTGCAGTGTTGCGAGAGCTGGTGACTGGTAGACAAGCCGAGCACTCGGAATCAACGGACTCTCCCGACGTCGTCCAGTGGGTGAGAAGGAAGGTGAACATAGCCAATGGAGCTTCCCAAGTCTTGGACTCGAGCATCATGGAACATTGTCAACGACAAATGTTGGAAGCTCTAGACATTGCCCTCCAATGCACTTCTATAATGCCTGAAAAACGTCCGTCGATGCTTGAAGTCGCCAAGGCCCTTCAATTGGTTGGATTGACGACGAACCTTCACGATGCAGCCTTCTCGGTTGCAGAGGATAGTTTGGTTTCAAGCGAAAGACCTCCTCTTGCTGCATGACTTGTTTTTGTATATCCAAATTCAGATATCTTTGTAGGAGACTTCTTGATTGATGTTTGGGGCATGGTATTTCTGAGTGTGTTCGAGTTTCCTATTTTATTTTGATTTTGAAATGAAGAAGCTTAAATTTTGGTGTAATGTAATTGCTTTGAGGAAGCATATGATAATTTTTGAAGTTATTTATTTGTACGACATTGGAGTAGCAGTAATATGTACTTTTATAAATATTTGACAATACAGAGCATGAGGGACCAGCTGAAGGGTATTGGTGGGGTTTTGTGGCGAAGGGAAGGGTAGGAATGTTTTTTGAGAGGCGGCTCTCTCGAAATGCCCGGAAGTTCCATTTCCATTTCCTTATTTTCTGTCTATCTGGTTATTTTGATCTTGTGACCTTTGGTGGCTTGTAATATCCTTGAGACCATGGGATATCTGTTACTCCTTATTAAATCAAAGACATTGTTGATTAAATATTAATA
mRNA sequence
TAATTTAAATTTTATACAACGACTTTGAGAACCTTTAAAAAAGGCCACGTTTGATTGACCCATTCTAATCCGCCATTGAAATCTGAATGTCAAGCTTTTTTTCCCCTCTCCCTCTCTCTCTCCTCCAGACAGATTCACAATTCCAATCAATCTCCGGGTTCAATCTTTCTCTCAATCGCACGAATTTCTCTGAAGAAGGAGAAGGAGAAGAAGAAAAAGGTTTTTGTTGTTTCCTTTGGATCCAGGAAAATGGACGTGGAATCTCCTAATGTGGATCTTCCGACGACAGACAAAGAAATCGTACCGGAGAAGATTGAGGATGAAGAGATCAAAGAGCCTTTGATTCATTGCGAGCTCTGCGATGCGGAAATTGTTCATAAACTTGCTCAGGTTCTTCTTCCTGGATTGTCCACCGCTTGTGTCGATAATACGAGTGGCGATATTTTTCGAACCCCTGGTTCAGTGGCTGCCGATATGAGGAAAGAAATGGTGGATTATCTTACCATGAGGAGCGAAACTTGTGTTGCCGAATCTGTAATTTTAGAGAATGCGTCCGATGCTGAGGTATCTGATCATCCTTACGATATCATCTCTGATTTTGTTGATGATTTTGCTGCTACGAAGAGAAATTTGTTTAGTAGAGTTTCAGGATGGGTTCTAAGTGAGAAAAGAGAGGATAAGATAGATGATTTTGTTCAAGAAATGGACATCAATGGCTTTTGGCCACTTGATAGGAGAGAAGCAATTGCTCAGGTTTTGCTTAAGAATGTGGACTTTAAGAGTGAGTTTCATTGTGATAAGAAATTTCACTCTACAGAAGAACTAGCTGAGCATGTTGAAAACTGTGGGTTTAGGTCCGTAACTTGCACAAATGAAGGCTGCACTGCGAGATTTTGTGCAAGCCACGCAGAACAACATGATTCCATCTGCCCCTTCAAGATAATTTCATGTGAACAGAAGTGTTCTGCCTTTATTATGAGACGTGAAATGGACCGCCATTGCATAACTGTTTGTCCAATGAAGCTTGTGAATTGCCCCTTCCATAATTTGGGTTGTCAATCCCCTGTTCCTTACTCTTTGATAGCGCAGCATTGTTCAGAGAGTTTTGATTCTCATTTGCTGCATATTCTTCACTCTATACACAAGGAAGCCAATGAAGAGACTCTTATACATAGACGGCAACAGCTTGAAGAGGCATCATCACTTGACCACCTTAGAGGGCTTCAAAACTTGAGATTGTTAACCTCAAAAATCAAAGAAATGGATTCTGGGCTAGGACCATTAGTAGTCATCTACAAGGTTGAGGACACAGAAGAAGCGAAGGATGGCTCTGATGAAAGTGACGAAGAGAAGGAGGCTTCTAATGCAACTGAAGATGCCAAGGATGCCTCTAATGCAACAGAAGAAAAGAAGGAAATGTCTAATGGAACTGAAGAAACGAAGGATGCCTCAATTGCAACTGAAGAAAGGAAGAATGTGTCAACTGCAATTGAAGAAAGGGAGGTCATATCTAGTGCAATTGAGGAAACAAAAGATGCTTCTAAGACAACTGGAGAAACAAAGGACAATTCTGATGACAAAGAAGAAGCCAGTGATGCTTCTAATGAAAAAGAAGAAACAAAAGATGAGTCGCATTCAACTGAAGAAACAAAGGATGCTAATGCAGAGGAAAAGATGAAGGATGATTCTGATTCAGAGGAAGAAATGAAGAATGCTTCTGATGTAAAAGAAGAAATGAAGGATAATTCTGATGGAGAGGAAAAAAAGAAGGATGATTCTGATGGAGAGGAAAAACGGAAGGATGATTCTGATTTAGAGGAAGAAAAGAAGGATGATTCTGATGCAAAGGAGGAAGAAGCAAGGAAAGAAATTGAAGAAAGGAAGGAGGTTTCCAATGGAATTGAAGAAAAGAATGGTACCTCAAATCCACCTGAAGGAGAAACCAAGAGTGCTTCGAATGTAGTAAGAAGTGATCCAGATGAAGCTGAAAGATAAAAATAATAAAAAAAAAAATCGACACACCATTTGAGAAACCCTTGGCTTTGCTGAAGAAAGTCCAACTACAGAATGCTTCCTTCCCATCCTCTTTATCCATTAACATCACATGAAAATTCTCATTTGTTGAGTGATATTCTTAGTGAGATTGGGAAGCTTGAGAAGCTTGAGAAGCTTAAGAAGCTATGGCTTTACAACTCTGGTTTCTATGGTGAAATCCCTTCTTCCCACCTTCTTCTTTGTTGGGTTTGAGAAGTTTGAGTGTTTTGGATTTCTCAAAACAATCTCACCGGGGAACGCCCTGAAATGTTGGGTTCTTCTTTAAAGAATTTGGTGTCTTTTCATGTTTCTGAGAATAAGCTTGTGGGATCTTTCCCAAATGGGTTTTGTACTGGAAAAGGCATAGTGGGGAAATCTCCTGATTTGAGCCTATCAGGAAGGGTTATCGCAGGCTTCACTTGGTTGGCATTGTGAGTCTTAGTGTTCATACCAATTTTTTTTTAGTGGGAGTTTGCCTAATTCCTTGAACCAGTGCTTGAATCTTGAGAGGTTTCAAGCCCGTTTGATGATGTTTCCGTTCCCTGTTTCCTGTTTACTGTTTATCATTTTTTAAGAAATGGATTTATTTGATAATTGTTCCCGTTTCTTGTTTCTAAATTTTAAGGAACATTTCTAAAAAATGGGCCAAAATTGAGAAACAACAAAAAAATAGTTTCTTCTGTTATTGTTTCTATTTCTATTTTTCCATTACCATAAGTAATTATTGTCAGTTTTTGGTGTATTGGTATTAGGCGCCAATGGTAACTAATCATGAGTTTGAATCCCGTCTGAAGTGATTTTTTACTTTTATTTTTTTTGAAGAATCTGTTTTTCTCATTACATTTTTTATTTTTATTTTTTCTTTATTAGTTTTGTTATCTACAAAAATAATTCTCCTATTTTTATATTTTATTTTAATTTTGAATTTTAAAGTTCTCCAATATTTAGATTTATATATTATTTTAATTTTTAATGTGTTTTTTTATATTTCAAATTTTAACTCCCAAGATGGGCACTTTTATTTATTATTATTTTTTTTTCCTTTTATATTTGTTTCCATCTTTTGTATTTTATTTATTTTCTTTTTTAACTTTTCTATTTACAAAAATAATTTTGCTATATTTATATTTTATTTTAATTTTGAATTTTAAAATTCTCCAATATTTAGATTTATATATTATTTTAATTTTCAATATGTATTAAGTTTTTTTATATTTCAACTTTTGAGTATTAAATTTTTCAATATGTGTGTTTATATATTATTTTAATTCTAATATTTTTAGTATATCTGAAATCGAGTAGAATTTTACCAAATATACATTTTGCTGACTTTGACATTCATTTAGCTTAACTTTAATATTTTACAACCACAATATTTACATAATTATATAATTAAGTTGGATTTGACTCGACATCATTTAAGACAAATGACTTGAGTTGGAGATGACAATATTGATCAAATTTGGGCAACATATGAAAGATGTGATACAACTTTGTTTTTATATTTAGTTGTCATTGAGATTTACTTCTATTAGATGGCATAAACTTTTGATAATATCTTTTTCGATGTTTGATATATGACATATCTTCTTTTTTTATATTATATTGTTAAAAAAAAAAAAAACAAAAAAAAAAAAAAAACAAGAAACAAGAAATGGTTATCAAACAAATTTATGTTTTTGTTTATTTTTTCAAAAAATAGGAAACAGAAATAGTTATCAAACATAGTCTTGCTTCTTATTCTAAAAAACGAAGAAACAGGAAACGAGAAATAGGAAATAAGAAATTGGAAATGAGAATGTTATCAAATGGGCCCTTCAAGTTCAGAACAATGGGTTTTCTGGGGATTTTACTAAAGGTTTGTGGTCATTGCCTAAGATTAAGCTCATCAGAGCTGAAAACAATGGTTTCTCTGATGAAATTCCATAGTCTATATCTATGGCTGCTCAGCTTGATAACAACAGTTTTTCAAGTAAAATACCTCGGGGTCTCGGGTCTATTCGAAGCTTATATCGATTCTCTGCGTTGCTCAATCGCTTTTATGGTGCATTGCCACCAAACATCTGTGATTTGCCATTAATAATGAGTATAGTTAATCTGTCCCACAGTTCTCTCTCTGGTCAAATTCCCGAGCCGAAAAATTGCAAGAGACTGTCTCTTTGTCCTTAGCAGGCAATATTCTTACTCGAGAAATTCCTACTTCCGTTGCAGATCTACCAGTGTTAAATTATCTTGATCTTTCTGATAATAATCTCACTGGTTTGATCCCTCAAGGACTCGAGAACTTAAAGTTGCACTCTTTAATATTTCATTCTCCTTGATTTCGTGGCTACCAGCTTCATTTCTGCAAGGAAATCCTGATCTTTGTGGCCCTGGTTTGCAAAGTCCTTGTTCTCAAGGCCATCCAACAAACCATATGTATGGACTTAAACAAAATGACATGTGCCCTCGTCTCTCTAGCTTGTGTTTTAGGAGTTCTAAGTTTAGCTGCTGGGTTCATTCTGTATTATTGATCCTACAAACCGATATCCCGAGTCGATAACTGGCAAATCTGCTCAAGGATGTGGGAGTACTAAGTTTAGCCAAGTGTTCATGTTAAGCTTACCGCGCTGTGAACTGATCAACGTAAAGAAACTCGTTAATTTTGGGAGTCATTCGTGGAAGTCGTTGAAAGCTGGGGCCAAGAATTTGGCCAAGATCAGGCACAAAAACGTCATCAAAGGAAGCTTGGCTGACTTGATATGCAGAAATGATTCTTGCCTGAATTGGAATGTGAGACTGAGAATTTCTATTGAGGTTGCTCAAGGACTAGCTTACATTAACAAGGACAATGTCCCACATTTACTTCATCGAAATGCCAAATTGTCGCATATTCTATTGGATGCCGACTTTGTCCTGAAGTTCAAGGATTTTGCTCTTCACCATATCGTTGGAGAGTCGCCACTTCACTCGACAGTAGCTTCGGAACCTTCTCATTCCTGCTGTACTGCATCAGAATATAAATACAACAAAAAGGCAACAGAGCAAATGGATGTGTACAGCTTTGGTGCAGTGTTGCGAGAGCTGGTGACTGGTAGACAAGCCGAGCACTCGGAATCAACGGACTCTCCCGACGTCGTCCAGTGGGTGAGAAGGAAGGTGAACATAGCCAATGGAGCTTCCCAAGTCTTGGACTCGAGCATCATGGAACATTGTCAACGACAAATGTTGGAAGCTCTAGACATTGCCCTCCAATGCACTTCTATAATGCCTGAAAAACGTCCGTCGATGCTTGAAGTCGCCAAGGCCCTTCAATTGGTTGGATTGACGACGAACCTTCACGATGCAGCCTTCTCGGTTGCAGAGGATAGTTTGGTTTCAAGCGAAAGACCTCCTCTTGCTGCATGACTTGTTTTTGTATATCCAAATTCAGATATCTTTGTAGGAGACTTCTTGATTGATGTTTGGGGCATGGTATTTCTGAGTGTGTTCGAGTTTCCTATTTTATTTTGATTTTGAAATGAAGAAGCTTAAATTTTGGTGTAATGTAATTGCTTTGAGGAAGCATATGATAATTTTTGAAGTTATTTATTTGTACGACATTGGAGTAGCAGTAATATGTACTTTTATAAATATTTGACAATACAGAGCATGAGGGACCAGCTGAAGGGTATTGGTGGGGTTTTGTGGCGAAGGGAAGGGTAGGAATGTTTTTTGAGAGGCGGCTCTCTCGAAATGCCCGGAAGTTCCATTTCCATTTCCTTATTTTCTGTCTATCTGGTTATTTTGATCTTGTGACCTTTGGTGGCTTGTAATATCCTTGAGACCATGGGATATCTGTTACTCCTTATTAAATCAAAGACATTGTTGATTAAATATTAATA
Coding sequence (CDS)
ATGGACGTGGAATCTCCTAATGTGGATCTTCCGACGACAGACAAAGAAATCGTACCGGAGAAGATTGAGGATGAAGAGATCAAAGAGCCTTTGATTCATTGCGAGCTCTGCGATGCGGAAATTGTTCATAAACTTGCTCAGGTTCTTCTTCCTGGATTGTCCACCGCTTGTGTCGATAATACGAGTGGCGATATTTTTCGAACCCCTGGTTCAGTGGCTGCCGATATGAGGAAAGAAATGGTGGATTATCTTACCATGAGGAGCGAAACTTGTGTTGCCGAATCTGTAATTTTAGAGAATGCGTCCGATGCTGAGGTATCTGATCATCCTTACGATATCATCTCTGATTTTGTTGATGATTTTGCTGCTACGAAGAGAAATTTGTTTAGTAGAGTTTCAGGATGGGTTCTAAGTGAGAAAAGAGAGGATAAGATAGATGATTTTGTTCAAGAAATGGACATCAATGGCTTTTGGCCACTTGATAGGAGAGAAGCAATTGCTCAGGTTTTGCTTAAGAATGTGGACTTTAAGAGTGAGTTTCATTGTGATAAGAAATTTCACTCTACAGAAGAACTAGCTGAGCATGTTGAAAACTGTGGGTTTAGGTCCGTAACTTGCACAAATGAAGGCTGCACTGCGAGATTTTGTGCAAGCCACGCAGAACAACATGATTCCATCTGCCCCTTCAAGATAATTTCATGTGAACAGAAGTGTTCTGCCTTTATTATGAGACGTGAAATGGACCGCCATTGCATAACTGTTTGTCCAATGAAGCTTGTGAATTGCCCCTTCCATAATTTGGGTTGTCAATCCCCTGTTCCTTACTCTTTGATAGCGCAGCATTGTTCAGAGAGTTTTGATTCTCATTTGCTGCATATTCTTCACTCTATACACAAGGAAGCCAATGAAGAGACTCTTATACATAGACGGCAACAGCTTGAAGAGGCATCATCACTTGACCACCTTAGAGGGCTTCAAAACTTGAGATTGTTAACCTCAAAAATCAAAGAAATGGATTCTGGGCTAGGACCATTAGTAGTCATCTACAAGGTTGAGGACACAGAAGAAGCGAAGGATGGCTCTGATGAAAGTGACGAAGAGAAGGAGGCTTCTAATGCAACTGAAGATGCCAAGGATGCCTCTAATGCAACAGAAGAAAAGAAGGAAATGTCTAATGGAACTGAAGAAACGAAGGATGCCTCAATTGCAACTGAAGAAAGGAAGAATGTGTCAACTGCAATTGAAGAAAGGGAGGTCATATCTAGTGCAATTGAGGAAACAAAAGATGCTTCTAAGACAACTGGAGAAACAAAGGACAATTCTGATGACAAAGAAGAAGCCAGTGATGCTTCTAATGAAAAAGAAGAAACAAAAGATGAGTCGCATTCAACTGAAGAAACAAAGGATGCTAATGCAGAGGAAAAGATGAAGGATGATTCTGATTCAGAGGAAGAAATGAAGAATGCTTCTGATGTAAAAGAAGAAATGAAGGATAATTCTGATGGAGAGGAAAAAAAGAAGGATGATTCTGATGGAGAGGAAAAACGGAAGGATGATTCTGATTTAGAGGAAGAAAAGAAGGATGATTCTGATGCAAAGGAGGAAGAAGCAAGGAAAGAAATTGAAGAAAGGAAGGAGGTTTCCAATGGAATTGAAGAAAAGAATGGTACCTCAAATCCACCTGAAGGAGAAACCAAGAGTGCTTCGAATGTAGTAAGAAGTGATCCAGATGAAGCTGAAAGATAA
Protein sequence
MDVESPNVDLPTTDKEIVPEKIEDEEIKEPLIHCELCDAEIVHKLAQVLLPGLSTACVDNTSGDIFRTPGSVAADMRKEMVDYLTMRSETCVAESVILENASDAEVSDHPYDIISDFVDDFAATKRNLFSRVSGWVLSEKREDKIDDFVQEMDINGFWPLDRREAIAQVLLKNVDFKSEFHCDKKFHSTEELAEHVENCGFRSVTCTNEGCTARFCASHAEQHDSICPFKIISCEQKCSAFIMRREMDRHCITVCPMKLVNCPFHNLGCQSPVPYSLIAQHCSESFDSHLLHILHSIHKEANEETLIHRRQQLEEASSLDHLRGLQNLRLLTSKIKEMDSGLGPLVVIYKVEDTEEAKDGSDESDEEKEASNATEDAKDASNATEEKKEMSNGTEETKDASIATEERKNVSTAIEEREVISSAIEETKDASKTTGETKDNSDDKEEASDASNEKEETKDESHSTEETKDANAEEKMKDDSDSEEEMKNASDVKEEMKDNSDGEEKKKDDSDGEEKRKDDSDLEEEKKDDSDAKEEEARKEIEERKEVSNGIEEKNGTSNPPEGETKSASNVVRSDPDEAER
Homology
BLAST of CcUC04G059960 vs. NCBI nr
Match:
XP_038883573.1 (acidic repeat-containing protein [Benincasa hispida])
HSP 1 Score: 880.6 bits (2274), Expect = 7.6e-252
Identity = 506/629 (80.45%), Postives = 533/629 (84.74%), Query Frame = 0
Query: 1 MDVESPNVDLPTTDKEIVPEKIEDEEIKEPLIHCELCDAEIVHKLAQVLLPGLSTACVDN 60
MDV S NVDLPTTDKEI+PEKIEDEEIKEPLIHCELCDAEI+HKLAQVLLPGLSTACVDN
Sbjct: 1 MDVGSANVDLPTTDKEIIPEKIEDEEIKEPLIHCELCDAEIIHKLAQVLLPGLSTACVDN 60
Query: 61 TSGDIFRTPGSVAADMRKEMVDYLTMRSETCVAESVILENASDAEVSDHPYDIISDFVDD 120
TSGDIFRTPGSVAADMRKEMVDYLTMRSETCVAESVILENA DAEVSDHPYDIISDFVDD
Sbjct: 61 TSGDIFRTPGSVAADMRKEMVDYLTMRSETCVAESVILENAPDAEVSDHPYDIISDFVDD 120
Query: 121 FAATKRNLFSRVSGWVLSEKREDKIDDFVQEMDINGFWPLDRREAIAQVLLKNVDFKSEF 180
FAATKRNLFSRVSGWVLSEKREDKIDDFVQEMDINGFWPLDRREAIAQVLLKNVDFKSEF
Sbjct: 121 FAATKRNLFSRVSGWVLSEKREDKIDDFVQEMDINGFWPLDRREAIAQVLLKNVDFKSEF 180
Query: 181 HCDKKFHSTEELAEHVENCGFRSVTCTNEGCTARFCASHAEQHDSICPFKIISCEQKCSA 240
HCDKKFHS EELAEHVENCGFRS+TCTNEGCTARFCASHAEQHDSICPFKIISCEQKCSA
Sbjct: 181 HCDKKFHSAEELAEHVENCGFRSLTCTNEGCTARFCASHAEQHDSICPFKIISCEQKCSA 240
Query: 241 FIMRREMDRHCITVCPMKLVNCPFHNLGCQSPVPYSLIAQHCSESFDSHLLHILHSIHKE 300
FIMRREMDRHCITVCPMKLVNCPFHNLGCQSPVPY +IAQHCSESFDSHLLHILHSIHKE
Sbjct: 241 FIMRREMDRHCITVCPMKLVNCPFHNLGCQSPVPYCMIAQHCSESFDSHLLHILHSIHKE 300
Query: 301 ANEETLIHRRQQLEEASSLDHLRGLQNLRLLTSKIKEMDSGLGPLVVIYKVEDTEEAKDG 360
ANEETLIHRRQQLEEASSLDHLRGLQNLRLLTSKIKEM+S LGPLVVI KVEDTEEAKD
Sbjct: 301 ANEETLIHRRQQLEEASSLDHLRGLQNLRLLTSKIKEMESRLGPLVVICKVEDTEEAKDS 360
Query: 361 SDESDEEKEASNATEDAKDASNATEEKKEMSNGTEETKDASIATEERKNVSTAIEEREVI 420
SD SDEEKEASNATEDAKD SNAT+E K+MSNG EETKDASIATEERK+ STAIEEREV+
Sbjct: 361 SDGSDEEKEASNATEDAKDGSNATKE-KDMSNGNEETKDASIATEERKDASTAIEEREVM 420
Query: 421 SSAIEETKDAS----------------KTTG----------------------------- 480
SSAIEE KDAS K T
Sbjct: 421 SSAIEEMKDASNEKEEDSDVSNEKEEMKDTSNKKEEDRDDSNEKEEAKDASSNEKETKEE 480
Query: 481 --ETKDNSDDKEEASDASNEKEETKDESHSTEETKDA-NAEEKMKDDSDSEEEMKNASDV 540
ETKD S++KEE +ASNEKEET+D S+STEETKDA NA+E+MK+ SDSEEEMKN SD
Sbjct: 481 KEETKDASNEKEETKNASNEKEETRDASNSTEETKDASNAKEEMKNGSDSEEEMKNGSDS 540
Query: 541 KEEMKDNSDGEEKKKDDSDGEEKRKDDSDLEEEKKDDSDAKEEEARKEIEERKEVSNGIE 582
+EEMK++SD +E+KKDDSD +E++KDDSD EEEKKD SDA EEE +KEIEERK
Sbjct: 541 EEEMKNDSDIKEEKKDDSDVKEEKKDDSDSEEEKKDGSDAAEEEVKKEIEERK------- 600
BLAST of CcUC04G059960 vs. NCBI nr
Match:
XP_031739873.1 (FK506-binding protein 5 [Cucumis sativus] >KAE8649769.1 hypothetical protein Csa_012392 [Cucumis sativus])
HSP 1 Score: 800.8 bits (2067), Expect = 7.7e-228
Identity = 485/659 (73.60%), Postives = 523/659 (79.36%), Query Frame = 0
Query: 1 MDVESPNVDLPTTDKEIVPEKIEDEEIKEPLIHCELCDAEIVHKLAQVLLPGLSTACVDN 60
MD ESPNVDLPTTDKEI+PEKIEDEEIKEP IHCELCDAEIVHKLAQVLLPGLSTACVDN
Sbjct: 1 MDAESPNVDLPTTDKEIIPEKIEDEEIKEPFIHCELCDAEIVHKLAQVLLPGLSTACVDN 60
Query: 61 TSGDIFRTPGSVAADMRKEMVDYLTMRSETCVAESVILENASDAEVSDHPYDIISDFVDD 120
TSGDIFRTPGSVAAD+RKEMVDYLTMRSETCVAESVIL+N S+AEVSDHPYDIISDFVDD
Sbjct: 61 TSGDIFRTPGSVAADIRKEMVDYLTMRSETCVAESVILDNPSEAEVSDHPYDIISDFVDD 120
Query: 121 FAATKRNLFSRVSGWVLSEKREDKIDDFVQEMDINGFWPLDRREAIAQVLLKNVDFKSEF 180
F+ATKRNLFSRVSGW+LSEKREDKIDDFVQEMD+NGFWPLDRREAIAQ LLKNVDFKSEF
Sbjct: 121 FSATKRNLFSRVSGWILSEKREDKIDDFVQEMDVNGFWPLDRREAIAQTLLKNVDFKSEF 180
Query: 181 HCDKKFHSTEELAEHVENCGFRSVTCTNEGCTARFCASHAEQHDSICPFKIISCEQKCSA 240
HCDKKFHS EELA HVENCGFRS+TCTNEGCTARFCASHAEQHDSICPFKII CEQKCSA
Sbjct: 181 HCDKKFHSVEELAGHVENCGFRSLTCTNEGCTARFCASHAEQHDSICPFKIILCEQKCSA 240
Query: 241 FIMRREMDRHCITVCPMKLVNCPFHNLGCQSPVPYSLIAQHCSESFDSHLLHILHSIHKE 300
FIMRREMDRHCITVCPMKLVNCPFHNLGCQSPVPY LIAQHCSESFDSHLLHILHS+HKE
Sbjct: 241 FIMRREMDRHCITVCPMKLVNCPFHNLGCQSPVPYCLIAQHCSESFDSHLLHILHSVHKE 300
Query: 301 ANEETLIHRRQQLEEASSLDHLRGLQNLRLLTSKIKEMDSGLGPLVVIYKVEDTEEAKDG 360
ANEETLIHR+QQLEEASSLDHLRGLQNLRLLTSKIKEMDS LGPLVVI +VEDTEEAKD
Sbjct: 301 ANEETLIHRQQQLEEASSLDHLRGLQNLRLLTSKIKEMDSQLGPLVVICRVEDTEEAKDD 360
Query: 361 SDESDEEKEASNATEDAKD-ASNATEE-KKEMSNGTEETKDASIAT-EERKNVS-TAIEE 420
SD+SDEEKEAS TE+ KD ASN T+E K+EM NG+EETKD SIA EERK+ S T IEE
Sbjct: 361 SDKSDEEKEASKVTENTKDAASNVTQETKEEMPNGSEETKDGSIANEEERKDASPTVIEE 420
Query: 421 RE-VISSAIEETKDASKTT----------------------------------------- 480
RE ++SS IEETKDA KTT
Sbjct: 421 REAIMSSVIEETKDAFKTTEETKDASGKKEASDASSSESENEETKEASDKKEASNASSEK 480
Query: 481 -------------------GETKDNSDDKEEASDASNEKE---------ETKDESHSTEE 540
ETKD SD KEEA DAS+EKE ETKD S+STEE
Sbjct: 481 EETKDASDEKEASDASSEKEETKDASDKKEEAHDASSEKEETNDASNEKETKDLSNSTEE 540
Query: 541 TKD-ANAEEKMKDDSDSEEEMKNASDVK-EEMKDNSDGEEKKKDDSD-GEEKRKDDSDL- 582
T D +NA+ KMKDDSDSEEEMKNASDVK EE+K++SD EE+ K+DSD EE+ K+ SD+
Sbjct: 541 TNDGSNAKGKMKDDSDSEEEMKNASDVKEEEVKNDSDSEEEMKNDSDVKEEEMKNASDVK 600
BLAST of CcUC04G059960 vs. NCBI nr
Match:
XP_008442263.1 (PREDICTED: glutamic acid-rich protein [Cucumis melo])
HSP 1 Score: 788.1 bits (2034), Expect = 5.2e-224
Identity = 466/620 (75.16%), Postives = 504/620 (81.29%), Query Frame = 0
Query: 1 MDVESPNVDLPTTDKEIVPEKIEDEEIKEPLIHCELCDAEIVHKLAQVLLPGLSTACVDN 60
MD ESPNVDLPTTDKEIVPEKIEDEEIKEP IHCELCDAEIVHKLAQVLLPGLSTACVDN
Sbjct: 1 MDAESPNVDLPTTDKEIVPEKIEDEEIKEPFIHCELCDAEIVHKLAQVLLPGLSTACVDN 60
Query: 61 TSGDIFRTPGSVAADMRKEMVDYLTMRSETCVAESVILENASDAEVSDHPYDIISDFVDD 120
TSGDIFRTPGSVAAD+RKEMVDYLTMRSETCVAESVIL+N S+AEVSDHPYDI+SDFVDD
Sbjct: 61 TSGDIFRTPGSVAADIRKEMVDYLTMRSETCVAESVILDNPSEAEVSDHPYDIVSDFVDD 120
Query: 121 FAATKRNLFSRVSGWVLSEKREDKIDDFVQEMDINGFWPLDRREAIAQVLLKNVDFKSEF 180
FAATKRNLFSRVSGW+LSEKREDKIDDFVQEMD+NGFWPLDRREAIAQ LLKNVDFKSEF
Sbjct: 121 FAATKRNLFSRVSGWILSEKREDKIDDFVQEMDVNGFWPLDRREAIAQTLLKNVDFKSEF 180
Query: 181 HCDKKFHSTEELAEHVENCGFRSVTCTNEGCTARFCASHAEQHDSICPFKIISCEQKCSA 240
HCDKKFHS EELAEHVENCGFRS+TCTNEGCT+RFCASHAEQHDSICPFKII CEQKCSA
Sbjct: 181 HCDKKFHSVEELAEHVENCGFRSLTCTNEGCTSRFCASHAEQHDSICPFKIILCEQKCSA 240
Query: 241 FIMRREMDRHCITVCPMKLVNCPFHNLGCQSPVPYSLIAQHCSESFDSHLLHILHSIHKE 300
FIMRREMDRHCITVCPMKLVNCPFHNLGCQSPVPY LIAQHCSESFDSHLLHILHSIHKE
Sbjct: 241 FIMRREMDRHCITVCPMKLVNCPFHNLGCQSPVPYCLIAQHCSESFDSHLLHILHSIHKE 300
Query: 301 ANEETLIHRRQQLEEASSLDHLRGLQNLRLLTSKIKEMDSGLGPLVVIYKVEDTEEAKDG 360
ANEETLIHRRQQLEE SSLDHLRGLQNLRLLT KIKEM+S LGPLVVI KVE+ D
Sbjct: 301 ANEETLIHRRQQLEETSSLDHLRGLQNLRLLTLKIKEMESQLGPLVVICKVEE-----DS 360
Query: 361 SDESDEEKEASNATEDAKD-ASNATEEKK-EMSNGTEETKDASIAT-EERKNVS-TAIEE 420
SD+SDEEKEASN TE+AKD ASN +E+K EM NG+EETKD SI T EERK+ S TAIEE
Sbjct: 361 SDKSDEEKEASNVTEEAKDAASNVIQERKEEMPNGSEETKDGSIGTEEERKDASTTAIEE 420
Query: 421 REVI-SSAIEETKDASKTTGETKDNSDDKEEASDASNEKEETKDE----------SHSTE 480
REVI SS IEETKD SKTT E KD SD KEEA+DAS+EK+ETKD S+ E
Sbjct: 421 REVIMSSVIEETKDTSKTTEEMKDASDKKEEANDASSEKQETKDASDKKEEASNVSNENE 480
Query: 481 ETKDANAEEKMKDDSDSEEEMKNASDVKE-------------------EMKDNSDGEEKK 540
ET+DA+ +++ D S +EE K+AS+ KE E KD S+ E+
Sbjct: 481 ETRDASDKKEASDASSEKEETKDASEKKEEASDASSEKEETNDTSNEKETKDLSNSTEET 540
Query: 541 KDDSDGEEKRKDDSDLEEEKKDDSDAKEEEAR---KEIEERKEVSNGIEE--KNGTSNPP 582
D S+ + K KDDSD EEE K+ SD KEEE + + EE + + EE K
Sbjct: 541 NDGSNAKGKMKDDSDSEEEMKNASDVKEEEMKNKDSDSEEENKYDSAKEEEVKKEIEESE 600
BLAST of CcUC04G059960 vs. NCBI nr
Match:
TYK29843.1 (glutamic acid-rich protein [Cucumis melo var. makuwa])
HSP 1 Score: 778.5 bits (2009), Expect = 4.1e-221
Identity = 455/601 (75.71%), Postives = 491/601 (81.70%), Query Frame = 0
Query: 1 MDVESPNVDLPTTDKEIVPEKIEDEEIKEPLIHCELCDAEIVHKLAQVLLPGLSTACVDN 60
MD ESPNVDLPTTDKEIVPEKIEDEEIKEP IHCELCDAEIVHKLAQVLLPGLSTACVDN
Sbjct: 1 MDAESPNVDLPTTDKEIVPEKIEDEEIKEPFIHCELCDAEIVHKLAQVLLPGLSTACVDN 60
Query: 61 TSGDIFRTPGSVAADMRKEMVDYLTMRSETCVAESVILENASDAEVSDHPYDIISDFVDD 120
TSGDIFRTPGSVAAD+RKEMVDYLTMRSETCVAESVIL+N S+AEVSDHPYDI+SDFVDD
Sbjct: 61 TSGDIFRTPGSVAADIRKEMVDYLTMRSETCVAESVILDNPSEAEVSDHPYDIVSDFVDD 120
Query: 121 FAATKRNLFSRVSGWVLSEKREDKIDDFVQEMDINGFWPLDRREAIAQVLLKNVDFKSEF 180
FAATKRNLFSRVSGW+LSEKREDKIDDFVQEMD+NGFWPLDRREAIAQ LLKNVDFKSEF
Sbjct: 121 FAATKRNLFSRVSGWILSEKREDKIDDFVQEMDVNGFWPLDRREAIAQTLLKNVDFKSEF 180
Query: 181 HCDKKFHSTEELAEHVENCGFRSVTCTNEGCTARFCASHAEQHDSICPFKIISCEQKCSA 240
HCDKKFHS EELAEHVENCGFRS+TCTNEGCT+RFCASHAEQHDSICPFKII CEQKCSA
Sbjct: 181 HCDKKFHSVEELAEHVENCGFRSLTCTNEGCTSRFCASHAEQHDSICPFKIILCEQKCSA 240
Query: 241 FIMRREMDRHCITVCPMKLVNCPFHNLGCQSPVPYSLIAQHCSESFDSHLLHILHSIHKE 300
FIMRREMDRHCITVCPMKLVNCPFHNLGCQSPVPY LIAQHCSESFDSHLLHILHSIHKE
Sbjct: 241 FIMRREMDRHCITVCPMKLVNCPFHNLGCQSPVPYCLIAQHCSESFDSHLLHILHSIHKE 300
Query: 301 ANEETLIHRRQQLEEASSLDHLRGLQNLRLLTSKIKEMDSGLGPLVVIYKVEDTEEAKDG 360
ANEETLIHRRQQLEE SSLDHLRGLQNLRLLT KIKEM+S LGPLVVI KVE+ D
Sbjct: 301 ANEETLIHRRQQLEETSSLDHLRGLQNLRLLTLKIKEMESQLGPLVVICKVEE-----DS 360
Query: 361 SDESDEEKEASNATEDAKD-ASNATEEKK-EMSNGTEETKDASIAT-EERKNVS-TAIEE 420
SD+SDEEKEASN TE+AKD ASN +E+K EM NG+EETKD SI T EERK+ S TAIEE
Sbjct: 361 SDKSDEEKEASNVTEEAKDAASNVIQERKEEMPNGSEETKDGSIGTEEERKDASTTAIEE 420
Query: 421 REVI-SSAIEETKDASKTTGETKDNSD--------------------------------- 480
REVI SS IEETKD SKTT E KD SD
Sbjct: 421 REVIMSSVIEETKDTSKTTEEMKDASDKKEEANDASSEKQETKDASDKKEEASNVSNENE 480
Query: 481 ------DKEEASDASNEKEETKDESHSTEETKDANAEEKMKDDSDSEEEMKNASDVKEEM 540
DK+EASDAS+EKEETKD S EE DA++E++ +D+ +E+E K+ S+ EE
Sbjct: 481 ETRDASDKKEASDASSEKEETKDASEKKEEASDASSEKEETNDTSNEKETKDLSNSTEET 540
Query: 541 KDNSDGEEKKKDDSDGEEKRKDDSDLEEE--KKDDSD---------AKEEEARKEIEERK 547
D S+ + K KDDSD EE+ K+ SD++EE K DSD AKEEE +KEIEE +
Sbjct: 541 NDGSNAKGKMKDDSDSEEEMKNASDVKEEEMKNKDSDSEEENKYDSAKEEEVKKEIEESE 596
BLAST of CcUC04G059960 vs. NCBI nr
Match:
KAG7033547.1 (hypothetical protein SDJN02_03269 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 753.4 bits (1944), Expect = 1.4e-213
Identity = 438/605 (72.40%), Postives = 493/605 (81.49%), Query Frame = 0
Query: 1 MDVESPNVDLPTTDKEIVPEKIEDEEIKEPLIHCELCDAEIVHKLAQVLLPGLSTACVDN 60
M+VESPNVDLPTT+KEIVPEKIEDEEIK+PL HC+LCDAE+VHKLAQ+LLPGLSTACVDN
Sbjct: 1 MEVESPNVDLPTTEKEIVPEKIEDEEIKDPLCHCDLCDAELVHKLAQLLLPGLSTACVDN 60
Query: 61 TSGDIFRTPGSVAADMRKEMVDYLTMRSETCVAESVILENASDAEVSDHPYDIISDFVDD 120
T+G IFRTPGSVAADMRKEMVDYLT+RSET VAESVILENA DAE+SDHPYDIISDFV+D
Sbjct: 61 TTGGIFRTPGSVAADMRKEMVDYLTLRSETFVAESVILENAPDAELSDHPYDIISDFVED 120
Query: 121 FAATKRNLFSRVSGWVLSEKREDKIDDFVQEMDINGFWPLDRREAIAQVLLKNVDFKSEF 180
F+ +KRN FSRVS WVLSEKREDKIDDFVQEMD+NGFWPLDRR+AIAQ LLKNVDFKSE+
Sbjct: 121 FSLSKRNFFSRVSAWVLSEKREDKIDDFVQEMDVNGFWPLDRRQAIAQALLKNVDFKSEY 180
Query: 181 HCDKKFHSTEELAEHVENCGFRSVTCTNEGCTARFCASHAEQHDSICPFKIISCEQKCSA 240
HCDKKFHS EELAEHVE CGFR++TCTNEGCTA FCA+H EQHDSICPFKII CEQKCSA
Sbjct: 181 HCDKKFHSAEELAEHVEICGFRTLTCTNEGCTALFCANHTEQHDSICPFKIIQCEQKCSA 240
Query: 241 FIMRREMDRHCITVCPMKLVNCPFHNLGCQSPVPYSLIAQHCSESFDSHLLHILHSIHKE 300
FIMRREMDRHCITVCPMKLVNCPFHN+GCQ PVPYSLIAQHC+ESFDSHLLHILHSIHKE
Sbjct: 241 FIMRREMDRHCITVCPMKLVNCPFHNMGCQFPVPYSLIAQHCTESFDSHLLHILHSIHKE 300
Query: 301 ANEETLIHRRQQLEEASSLDHLRGLQNLRLLTSKIKEMDSGLGPLVVIYKVEDTEEAKDG 360
ANEETL HRRQQLEEA+SLDHLRGLQNLRLLT KIKEM+S LGPLV+I +VE+TEEAKD
Sbjct: 301 ANEETLKHRRQQLEEAASLDHLRGLQNLRLLTKKIKEMESELGPLVIIAEVEETEEAKDA 360
Query: 361 SDESDEEKEASNATEDAKDASNATEEKKEMSNGTEETKDASIATEERKNVSTAIEEREVI 420
S+E++E KE SNATE+AKDASNATEEKK++SNGTEETKDASIA EE K+ TAI+EREV+
Sbjct: 361 SNETEEGKETSNATEEAKDASNATEEKKDVSNGTEETKDASIAAEETKDAPTAIQEREVV 420
Query: 421 SSAIEETKDASKTTGETKDNSD-DKEEASDA--------------------SNEKEETKD 480
S+AIEETK T+GE ++ +D +KEE DA S+ EETKD
Sbjct: 421 STAIEETKG---TSGEGEEANDAEKEETKDAIIATEEGNAVPTAIHERDVVSSAIEETKD 480
Query: 481 ES-HSTEETKDA-NAEEKMKDDSDSEEEMKNASDVKEEMKDNSDGEEKKKDDSDGEEKRK 540
S TEE DA N +E+ D S+ +EE ++AS V EE KD SD E K EE+ K
Sbjct: 481 VSPKETEEPNDASNEKEEANDASNEKEETEDASTVIEETKDGSDSIETKNASDAAEEETK 540
Query: 541 DDSDLEEEKKDD-SDAKEEEARKEIEERKEVSNGIEEKNGTSNPPEGETKSASNVVRSDP 582
D ++ EEE+ D S +EEEA+KE EE KE NG EE+ G S E + P
Sbjct: 541 DATNAEEEETTDASKEEEEEAKKETEESKEGDNGSEERKGPSKGMEER--------KDYP 594
BLAST of CcUC04G059960 vs. ExPASy TrEMBL
Match:
A0A0A0L1F8 (TRAF-type domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G572340 PE=4 SV=1)
HSP 1 Score: 796.6 bits (2056), Expect = 7.0e-227
Identity = 470/642 (73.21%), Postives = 514/642 (80.06%), Query Frame = 0
Query: 1 MDVESPNVDLPTTDKEIVPEKIEDEEIKEPLIHCELCDAEIVHKLAQVLLPGLSTACVDN 60
MD ESPNVDLPTTDKEI+PEKIEDEEIKEP IHCELCDAEIVHKLAQVLLPGLSTACVDN
Sbjct: 1 MDAESPNVDLPTTDKEIIPEKIEDEEIKEPFIHCELCDAEIVHKLAQVLLPGLSTACVDN 60
Query: 61 TSGDIFRTPGSVAADMRKEMVDYLTMRSETCVAESVILENASDAEVSDHPYDIISDFVDD 120
TSGDIFRTPGSVAAD+RKEMVDYLTMRSETCVAESVIL+N S+AEVSDHPYDIISDFVDD
Sbjct: 61 TSGDIFRTPGSVAADIRKEMVDYLTMRSETCVAESVILDNPSEAEVSDHPYDIISDFVDD 120
Query: 121 FAATKRNLFSRVSGWVLSEKREDKIDDFVQEMDINGFWPLDRREAIAQVLLKNVDFKSEF 180
F+ATKRNLFSRVSGW+LSEKREDKIDDFVQEMD+NGFWPLDRREAIAQ LLKNVDFKSEF
Sbjct: 121 FSATKRNLFSRVSGWILSEKREDKIDDFVQEMDVNGFWPLDRREAIAQTLLKNVDFKSEF 180
Query: 181 HCDKKFHSTEELAEHVENCGFRSVTCTNEGCTARFCASHAEQHDSICPFKIISCEQKCSA 240
HCDKKFHS EELA HVENCGFRS+TCTNEGCTARFCASHAEQHDSICPFKII CEQKCSA
Sbjct: 181 HCDKKFHSVEELAGHVENCGFRSLTCTNEGCTARFCASHAEQHDSICPFKIILCEQKCSA 240
Query: 241 FIMRREMDRHCITVCPMKLVNCPFHNLGCQSPVPYSLIAQHCSESFDSHLLHILHSIHKE 300
FIMRREMDRHCITVCPMKLVNCPFHNLGCQSPVPY LIAQHCSESFDSHLLHILHS+HKE
Sbjct: 241 FIMRREMDRHCITVCPMKLVNCPFHNLGCQSPVPYCLIAQHCSESFDSHLLHILHSVHKE 300
Query: 301 ANEETLIHRRQQLEEASSLDHLRGLQNLRLLTSKIKEMDSGLGPLVVIYKVEDTEEAKDG 360
ANEETLIHR+QQLEEASSLDHLRGLQNLRLLTSKIKEMDS LGPLVVI +VEDTEEAKD
Sbjct: 301 ANEETLIHRQQQLEEASSLDHLRGLQNLRLLTSKIKEMDSQLGPLVVICRVEDTEEAKDD 360
Query: 361 SDESDEEKEASNATEDAKD-ASNATEE-KKEMSNGTEETKDASIAT-EERKNVS-TAIEE 420
SD+SDEEKEAS TE+ KD ASN T+E K+EM NG+EETKD SIA EERK+ S T IEE
Sbjct: 361 SDKSDEEKEASKVTENTKDAASNVTQETKEEMPNGSEETKDGSIANEEERKDASPTVIEE 420
Query: 421 RE-VISSAIEETKDASKTTGETKD------------------------------------ 480
RE ++SS IEETKDA KTT ETKD
Sbjct: 421 REAIMSSVIEETKDAFKTTEETKDASGKKEASDASSSESENEETKEASDKKEASNASSEK 480
Query: 481 ----NSDDKEEASDASNEKEETKDESHSTEETKDANAEEKMKDDSDSEEEMKNASDVKEE 540
++ D++EASDAS+EKEETKD S EE DA++E++ +D+ +E+E K+ S+ EE
Sbjct: 481 EETKDASDEKEASDASSEKEETKDASDKKEEAHDASSEKEETNDASNEKETKDLSNSTEE 540
Query: 541 MKDNSDGEEKKKDDSDGEEKRK-DDSDLEEEKKDDSDAKEEEARK----EIEERKEVSNG 582
D S+ + K KDDSD EE+ K +DSD EEE K+DSD KEEE + + EE K S+
Sbjct: 541 TNDGSNAKGKMKDDSDSEEEMKNNDSDSEEEMKNDSDVKEEEMKNASDVKEEEVKNDSDS 600
BLAST of CcUC04G059960 vs. ExPASy TrEMBL
Match:
A0A1S3B4U5 (glutamic acid-rich protein OS=Cucumis melo OX=3656 GN=LOC103486171 PE=4 SV=1)
HSP 1 Score: 788.1 bits (2034), Expect = 2.5e-224
Identity = 466/620 (75.16%), Postives = 504/620 (81.29%), Query Frame = 0
Query: 1 MDVESPNVDLPTTDKEIVPEKIEDEEIKEPLIHCELCDAEIVHKLAQVLLPGLSTACVDN 60
MD ESPNVDLPTTDKEIVPEKIEDEEIKEP IHCELCDAEIVHKLAQVLLPGLSTACVDN
Sbjct: 1 MDAESPNVDLPTTDKEIVPEKIEDEEIKEPFIHCELCDAEIVHKLAQVLLPGLSTACVDN 60
Query: 61 TSGDIFRTPGSVAADMRKEMVDYLTMRSETCVAESVILENASDAEVSDHPYDIISDFVDD 120
TSGDIFRTPGSVAAD+RKEMVDYLTMRSETCVAESVIL+N S+AEVSDHPYDI+SDFVDD
Sbjct: 61 TSGDIFRTPGSVAADIRKEMVDYLTMRSETCVAESVILDNPSEAEVSDHPYDIVSDFVDD 120
Query: 121 FAATKRNLFSRVSGWVLSEKREDKIDDFVQEMDINGFWPLDRREAIAQVLLKNVDFKSEF 180
FAATKRNLFSRVSGW+LSEKREDKIDDFVQEMD+NGFWPLDRREAIAQ LLKNVDFKSEF
Sbjct: 121 FAATKRNLFSRVSGWILSEKREDKIDDFVQEMDVNGFWPLDRREAIAQTLLKNVDFKSEF 180
Query: 181 HCDKKFHSTEELAEHVENCGFRSVTCTNEGCTARFCASHAEQHDSICPFKIISCEQKCSA 240
HCDKKFHS EELAEHVENCGFRS+TCTNEGCT+RFCASHAEQHDSICPFKII CEQKCSA
Sbjct: 181 HCDKKFHSVEELAEHVENCGFRSLTCTNEGCTSRFCASHAEQHDSICPFKIILCEQKCSA 240
Query: 241 FIMRREMDRHCITVCPMKLVNCPFHNLGCQSPVPYSLIAQHCSESFDSHLLHILHSIHKE 300
FIMRREMDRHCITVCPMKLVNCPFHNLGCQSPVPY LIAQHCSESFDSHLLHILHSIHKE
Sbjct: 241 FIMRREMDRHCITVCPMKLVNCPFHNLGCQSPVPYCLIAQHCSESFDSHLLHILHSIHKE 300
Query: 301 ANEETLIHRRQQLEEASSLDHLRGLQNLRLLTSKIKEMDSGLGPLVVIYKVEDTEEAKDG 360
ANEETLIHRRQQLEE SSLDHLRGLQNLRLLT KIKEM+S LGPLVVI KVE+ D
Sbjct: 301 ANEETLIHRRQQLEETSSLDHLRGLQNLRLLTLKIKEMESQLGPLVVICKVEE-----DS 360
Query: 361 SDESDEEKEASNATEDAKD-ASNATEEKK-EMSNGTEETKDASIAT-EERKNVS-TAIEE 420
SD+SDEEKEASN TE+AKD ASN +E+K EM NG+EETKD SI T EERK+ S TAIEE
Sbjct: 361 SDKSDEEKEASNVTEEAKDAASNVIQERKEEMPNGSEETKDGSIGTEEERKDASTTAIEE 420
Query: 421 REVI-SSAIEETKDASKTTGETKDNSDDKEEASDASNEKEETKDE----------SHSTE 480
REVI SS IEETKD SKTT E KD SD KEEA+DAS+EK+ETKD S+ E
Sbjct: 421 REVIMSSVIEETKDTSKTTEEMKDASDKKEEANDASSEKQETKDASDKKEEASNVSNENE 480
Query: 481 ETKDANAEEKMKDDSDSEEEMKNASDVKE-------------------EMKDNSDGEEKK 540
ET+DA+ +++ D S +EE K+AS+ KE E KD S+ E+
Sbjct: 481 ETRDASDKKEASDASSEKEETKDASEKKEEASDASSEKEETNDTSNEKETKDLSNSTEET 540
Query: 541 KDDSDGEEKRKDDSDLEEEKKDDSDAKEEEAR---KEIEERKEVSNGIEE--KNGTSNPP 582
D S+ + K KDDSD EEE K+ SD KEEE + + EE + + EE K
Sbjct: 541 NDGSNAKGKMKDDSDSEEEMKNASDVKEEEMKNKDSDSEEENKYDSAKEEEVKKEIEESE 600
BLAST of CcUC04G059960 vs. ExPASy TrEMBL
Match:
A0A5D3E181 (Glutamic acid-rich protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold208G00810 PE=4 SV=1)
HSP 1 Score: 778.5 bits (2009), Expect = 2.0e-221
Identity = 455/601 (75.71%), Postives = 491/601 (81.70%), Query Frame = 0
Query: 1 MDVESPNVDLPTTDKEIVPEKIEDEEIKEPLIHCELCDAEIVHKLAQVLLPGLSTACVDN 60
MD ESPNVDLPTTDKEIVPEKIEDEEIKEP IHCELCDAEIVHKLAQVLLPGLSTACVDN
Sbjct: 1 MDAESPNVDLPTTDKEIVPEKIEDEEIKEPFIHCELCDAEIVHKLAQVLLPGLSTACVDN 60
Query: 61 TSGDIFRTPGSVAADMRKEMVDYLTMRSETCVAESVILENASDAEVSDHPYDIISDFVDD 120
TSGDIFRTPGSVAAD+RKEMVDYLTMRSETCVAESVIL+N S+AEVSDHPYDI+SDFVDD
Sbjct: 61 TSGDIFRTPGSVAADIRKEMVDYLTMRSETCVAESVILDNPSEAEVSDHPYDIVSDFVDD 120
Query: 121 FAATKRNLFSRVSGWVLSEKREDKIDDFVQEMDINGFWPLDRREAIAQVLLKNVDFKSEF 180
FAATKRNLFSRVSGW+LSEKREDKIDDFVQEMD+NGFWPLDRREAIAQ LLKNVDFKSEF
Sbjct: 121 FAATKRNLFSRVSGWILSEKREDKIDDFVQEMDVNGFWPLDRREAIAQTLLKNVDFKSEF 180
Query: 181 HCDKKFHSTEELAEHVENCGFRSVTCTNEGCTARFCASHAEQHDSICPFKIISCEQKCSA 240
HCDKKFHS EELAEHVENCGFRS+TCTNEGCT+RFCASHAEQHDSICPFKII CEQKCSA
Sbjct: 181 HCDKKFHSVEELAEHVENCGFRSLTCTNEGCTSRFCASHAEQHDSICPFKIILCEQKCSA 240
Query: 241 FIMRREMDRHCITVCPMKLVNCPFHNLGCQSPVPYSLIAQHCSESFDSHLLHILHSIHKE 300
FIMRREMDRHCITVCPMKLVNCPFHNLGCQSPVPY LIAQHCSESFDSHLLHILHSIHKE
Sbjct: 241 FIMRREMDRHCITVCPMKLVNCPFHNLGCQSPVPYCLIAQHCSESFDSHLLHILHSIHKE 300
Query: 301 ANEETLIHRRQQLEEASSLDHLRGLQNLRLLTSKIKEMDSGLGPLVVIYKVEDTEEAKDG 360
ANEETLIHRRQQLEE SSLDHLRGLQNLRLLT KIKEM+S LGPLVVI KVE+ D
Sbjct: 301 ANEETLIHRRQQLEETSSLDHLRGLQNLRLLTLKIKEMESQLGPLVVICKVEE-----DS 360
Query: 361 SDESDEEKEASNATEDAKD-ASNATEEKK-EMSNGTEETKDASIAT-EERKNVS-TAIEE 420
SD+SDEEKEASN TE+AKD ASN +E+K EM NG+EETKD SI T EERK+ S TAIEE
Sbjct: 361 SDKSDEEKEASNVTEEAKDAASNVIQERKEEMPNGSEETKDGSIGTEEERKDASTTAIEE 420
Query: 421 REVI-SSAIEETKDASKTTGETKDNSD--------------------------------- 480
REVI SS IEETKD SKTT E KD SD
Sbjct: 421 REVIMSSVIEETKDTSKTTEEMKDASDKKEEANDASSEKQETKDASDKKEEASNVSNENE 480
Query: 481 ------DKEEASDASNEKEETKDESHSTEETKDANAEEKMKDDSDSEEEMKNASDVKEEM 540
DK+EASDAS+EKEETKD S EE DA++E++ +D+ +E+E K+ S+ EE
Sbjct: 481 ETRDASDKKEASDASSEKEETKDASEKKEEASDASSEKEETNDTSNEKETKDLSNSTEET 540
Query: 541 KDNSDGEEKKKDDSDGEEKRKDDSDLEEE--KKDDSD---------AKEEEARKEIEERK 547
D S+ + K KDDSD EE+ K+ SD++EE K DSD AKEEE +KEIEE +
Sbjct: 541 NDGSNAKGKMKDDSDSEEEMKNASDVKEEEMKNKDSDSEEENKYDSAKEEEVKKEIEESE 596
BLAST of CcUC04G059960 vs. ExPASy TrEMBL
Match:
A0A6J1EKK8 (uncharacterized protein LOC111435517 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111435517 PE=4 SV=1)
HSP 1 Score: 750.4 bits (1936), Expect = 5.8e-213
Identity = 438/604 (72.52%), Postives = 493/604 (81.62%), Query Frame = 0
Query: 4 ESPNVDLPTTDKEIVPEKIEDEEIKEPLIHCELCDAEIVHKLAQVLLPGLSTACVDNTSG 63
ESPNVDLPTT+KEIVPEKIEDEEIK+PL HC+LCDAE+VHKLAQ+LLPGLSTACVDNT+G
Sbjct: 3 ESPNVDLPTTEKEIVPEKIEDEEIKDPLCHCDLCDAELVHKLAQLLLPGLSTACVDNTTG 62
Query: 64 DIFRTPGSVAADMRKEMVDYLTMRSETCVAESVILENASDAEVSDHPYDIISDFVDDFAA 123
IFRTPGSVAADMRKEMVDYLT+RSET VAESVILENA DAE+SDHPYDIISDFV+DF+
Sbjct: 63 GIFRTPGSVAADMRKEMVDYLTLRSETFVAESVILENAPDAELSDHPYDIISDFVEDFSL 122
Query: 124 TKRNLFSRVSGWVLSEKREDKIDDFVQEMDINGFWPLDRREAIAQVLLKNVDFKSEFHCD 183
+KRN FSRVS WVLSEKREDKIDDFVQEMD+NGFWPLDRR+AIAQ LLKNVDFKSE+HCD
Sbjct: 123 SKRNFFSRVSAWVLSEKREDKIDDFVQEMDVNGFWPLDRRQAIAQALLKNVDFKSEYHCD 182
Query: 184 KKFHSTEELAEHVENCGFRSVTCTNEGCTARFCASHAEQHDSICPFKIISCEQKCSAFIM 243
KKFHS EELAEHVE CGFR++TCTNEGCTA FCA+H EQHDSICPFKII CEQKCSAFIM
Sbjct: 183 KKFHSAEELAEHVEICGFRTLTCTNEGCTALFCANHTEQHDSICPFKIIQCEQKCSAFIM 242
Query: 244 RREMDRHCITVCPMKLVNCPFHNLGCQSPVPYSLIAQHCSESFDSHLLHILHSIHKEANE 303
RREMDRHCITVCPMKLVNCPFHN+GCQ PVPYSLIAQHC+ESFDSHLLHILHSIHKEANE
Sbjct: 243 RREMDRHCITVCPMKLVNCPFHNMGCQFPVPYSLIAQHCTESFDSHLLHILHSIHKEANE 302
Query: 304 ETLIHRRQQLEEASSLDHLRGLQNLRLLTSKIKEMDSGLGPLVVIYKVEDTEEAKDGSDE 363
ETL HRRQQLEEA+SLDHLRGLQNLRLLT KIKEM+S LGPLV+I +VE+TEEAKD S++
Sbjct: 303 ETLKHRRQQLEEAASLDHLRGLQNLRLLTKKIKEMESELGPLVIIAEVEETEEAKDASNK 362
Query: 364 SDEEKEASNATEDAKDASNATEEKKEMSNGTEETKDASIATEERKNVSTAIEEREVISSA 423
++E KE SNATE+AKDASNATEEKK++SNGTEETKDASIA EE K+ TAI+EREV+S+A
Sbjct: 363 TEEGKETSNATEEAKDASNATEEKKDVSNGTEETKDASIAAEETKDAPTAIQEREVVSTA 422
Query: 424 IEETKDASKTTGETKDNSD-DKEEASDA--------------------SNEKEETKDES- 483
IEETK T+GE ++ +D +KEE DA S+ EETKD S
Sbjct: 423 IEETKG---TSGEGEEANDAEKEETKDAIIATEEGNAVPTAIHERDVVSSAIEETKDVSP 482
Query: 484 HSTEETKDA-NAEEKMKDDSDSEEEMKNASDVKEEMKDNSDGEEKKKDDSDGEEKRKDDS 543
TEE DA N +E+ D S+ +EE ++AS V EE KD SD E K EE+ KD +
Sbjct: 483 KETEEPNDASNEKEEANDASNEKEETEDASTVIEETKDGSDSIETKNASDAAEEETKDAT 542
Query: 544 DLEEEKKDD-SDAKE--EEARKEIEERKEVSNGIEEKNGTSNPPEGETKSASNVVRSDPD 582
+ EEE+ D S+AKE EEA+KE EE KE NG EE+ G S E + DPD
Sbjct: 543 NAEEEETTDASNAKEEGEEAKKETEESKEGDNGSEERKGPSKGMEER--------KDDPD 595
BLAST of CcUC04G059960 vs. ExPASy TrEMBL
Match:
A0A6J1HXM1 (uncharacterized protein LOC111467177 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111467177 PE=4 SV=1)
HSP 1 Score: 748.8 bits (1932), Expect = 1.7e-212
Identity = 435/602 (72.26%), Postives = 485/602 (80.56%), Query Frame = 0
Query: 4 ESPNVDLPTTDKEIVPEKIEDEEIKEPLIHCELCDAEIVHKLAQVLLPGLSTACVDNTSG 63
ESPNVDLPTT+KEIVPEKIEDEEIK+PL HC+LCDAE+VHKLAQ+LLPGLSTACVDNT+G
Sbjct: 3 ESPNVDLPTTEKEIVPEKIEDEEIKDPLCHCDLCDAELVHKLAQLLLPGLSTACVDNTTG 62
Query: 64 DIFRTPGSVAADMRKEMVDYLTMRSETCVAESVILENASDAEVSDHPYDIISDFVDDFAA 123
IFRTPGSVAADMRKEMVDYLT+RSET VAESVILENA DAE+SDHPYDIISDFV+DF+
Sbjct: 63 GIFRTPGSVAADMRKEMVDYLTLRSETFVAESVILENAPDAELSDHPYDIISDFVEDFSL 122
Query: 124 TKRNLFSRVSGWVLSEKREDKIDDFVQEMDINGFWPLDRREAIAQVLLKNVDFKSEFHCD 183
+KRN FSRVS WVLSEKREDKIDDFVQEMD+NGFWPLDRR+AIAQ LLKNVDFKSE+HCD
Sbjct: 123 SKRNFFSRVSAWVLSEKREDKIDDFVQEMDVNGFWPLDRRQAIAQALLKNVDFKSEYHCD 182
Query: 184 KKFHSTEELAEHVENCGFRSVTCTNEGCTARFCASHAEQHDSICPFKIISCEQKCSAFIM 243
KKFHS EELAEHVE CGFR++TCTNEGCTA FCA+H EQHDSICPFKII CEQKCSAFIM
Sbjct: 183 KKFHSAEELAEHVEICGFRTLTCTNEGCTALFCANHTEQHDSICPFKIIQCEQKCSAFIM 242
Query: 244 RREMDRHCITVCPMKLVNCPFHNLGCQSPVPYSLIAQHCSESFDSHLLHILHSIHKEANE 303
RREMDRHCITVCPMKLVNCPFHN+GCQ PVPYSLIAQHC+ESFDSHLLHILHSIHKEANE
Sbjct: 243 RREMDRHCITVCPMKLVNCPFHNMGCQFPVPYSLIAQHCTESFDSHLLHILHSIHKEANE 302
Query: 304 ETLIHRRQQLEEASSLDHLRGLQNLRLLTSKIKEMDSGLGPLVVIYKVEDTEEAKDGSDE 363
ETL HRRQQLEEASSLDHLRGLQNLRLLT KIKEM+S LGPLV+I +VE+TEEAKD S+E
Sbjct: 303 ETLKHRRQQLEEASSLDHLRGLQNLRLLTKKIKEMESELGPLVIISEVEETEEAKDASNE 362
Query: 364 SDEEKEASNATEDAKDASNATEEKKEMSNGTEETKDASIATEERKNVSTAIEEREVISSA 423
++E KE SNATE+AKDASNATEEKK++SNGTEETKDASIA EE K+ STAI+E EV+S+A
Sbjct: 363 TEEGKETSNATEEAKDASNATEEKKDVSNGTEETKDASIAAEETKDASTAIQESEVVSAA 422
Query: 424 IEETKDASKTTGETKDNSDDKEEASDASNEKEETKDE--------------------SHS 483
I+ ETK D+ EEA+DA EKEETKD S +
Sbjct: 423 IK----------ETKGTLDEGEEANDA--EKEETKDAIIATEEGNAVPTAIHERDVMSSA 482
Query: 484 TEETKDAN--AEEKMKDDSDSEEEMKNASDVKEEMKDNSDGEEKKKDDSDGEEKRKDDSD 543
EETKD + A E+ D S+ +EE + S+ KEE +D S E+ KD SD E +
Sbjct: 483 IEETKDVSPMATEEPNDASNEKEEANDTSNEKEETEDASTVIEETKDGSDSIETKNASDA 542
Query: 544 LEEEKKDDSDAKE--EEARKEIEERKEVSNGIEEKNGTSNPPEGETKSASNVVRSDPDEA 582
EEE KD S+AKE EEA+KE EE KE NG EE+ G S E + DPDEA
Sbjct: 543 AEEETKDASNAKEEGEEAKKETEESKEGDNGSEERKGPSKGMEER--------KDDPDEA 584
BLAST of CcUC04G059960 vs. TAIR 10
Match:
AT3G11950.1 (TRAF-like superfamily protein )
HSP 1 Score: 375.6 bits (963), Expect = 7.4e-104
Identity = 264/583 (45.28%), Postives = 356/583 (61.06%), Query Frame = 0
Query: 8 VDLPTTDKEIVPEKIEDEEIKEPLIHCELCDAEIVHKLAQVLLPGLSTACVDNTSGDIFR 67
+D P +D E IED++ P HC+L D ++VHK+AQV LPGL+TACVDNT+GDIFR
Sbjct: 1 MDPPVSDL----ESIEDQKEGGPSFHCDLYDTQVVHKIAQVFLPGLATACVDNTTGDIFR 60
Query: 68 TPGSVAADMRKEMVDYLTMRSETCVAESVILENASDAEVSDHPYDIISDFVDDFAATKRN 127
+PGSVAAD+RKEM++YLT RSET VAE ++L+ S+ E S P+DIISDF+DDFA +KRN
Sbjct: 61 SPGSVAADIRKEMIEYLTRRSETFVAEHIVLQGGSEIEASHDPFDIISDFIDDFATSKRN 120
Query: 128 LFSRVSGWVLSEKREDKIDDFVQEMDINGFWPLDRREAIAQVLLKNVDFKSEFHCDKKFH 187
LFSRVSGW+LSE+RED IDDF QEM+I+GFW D RE IAQ LLKNVDFKS HC+ KF
Sbjct: 121 LFSRVSGWMLSERREDNIDDFAQEMEISGFWLTDHREGIAQTLLKNVDFKSSAHCEMKFQ 180
Query: 188 STEELAEHVENCGFRSVTCTNEGCTARFCASHAEQHDSICPFKIISCEQKCSAFIMRREM 247
+ ELAEH NCG+R++ C NEGCTA FCA+ E HDS+CPFKII CEQ CS IMRR+M
Sbjct: 181 TEGELAEHAMNCGYRTMNCENEGCTAVFCANQMENHDSVCPFKIIPCEQNCSESIMRRDM 240
Query: 248 DRHCITVCPMKLVNCPFHNLGCQSPVPYSLIAQHCSESFDSHLLHILHSIHKEANEETLI 307
DRHCITVCPMKLVNCPFH++GC S V + QH ++ SHL++IL SI+KEA+ + L
Sbjct: 241 DRHCITVCPMKLVNCPFHSVGCLSDVHQCEVQQHHLDNVSSHLMYILRSIYKEASLDDLK 300
Query: 308 HRRQQLEEASSLDHLRGLQNLRLLTSKIKEMDSGLGPLVVIYKV---------EDTEEAK 367
R +Q+++ S+ L +N R LT+ +KE+D LGPL + K+ E+TE+
Sbjct: 301 PRAEQIQQLST--RLSEARNARSLTNLVKEIDGKLGPLEIKPKIVTDSESDKPENTEKKA 360
Query: 368 DGSDESDEEKEASN--ATEDAKDASNATEEKKEMSNGTEETKDASIATEERKNVSTA--- 427
E E+ E SN A + A A E+K + DA++ E K VS A
Sbjct: 361 LEEAEIKEKPETSNLKAVTLEQTAREAPEDKL-----VSKEVDAAMVKEAAKKVSEAEIA 420
Query: 428 ---IEEREVISSAIEETKDASKTTGETKDNS-DDKEEASDASNEKEETKDESHSTE---E 487
EE E+ + + E + K E +NS DD E ++ + DE+ E E
Sbjct: 421 DNVNEEGELKAQKLLEIGEFIK---EGDNNSADDLSERTETKAPEVVVMDEAREEEDSVE 480
Query: 488 TKDANAEE-----KMKDDSDSEEEMKNASDVKEEMKD----NSDGEEKKKDDSDGEEKRK 547
TKD E +++ + +EE K +++ K E + +G+E+ K ++ E +
Sbjct: 481 TKDTRTYETIRGLEIEANEMIDEETKKSTETKTEAPSRIVMDKEGDEETKKSTETETEAP 540
Query: 548 DDSDLEEEKKDDSDAKEEEARKEIEERKEVSNGIEEKNGTSNP 561
+E EK +++ A E E + S G + P
Sbjct: 541 SRIVMETEKDEETMNSRARASDEAEALSKSSQGFSTVQAETVP 569
BLAST of CcUC04G059960 vs. TAIR 10
Match:
AT3G11950.2 (TRAF-like superfamily protein )
HSP 1 Score: 375.6 bits (963), Expect = 7.4e-104
Identity = 264/583 (45.28%), Postives = 356/583 (61.06%), Query Frame = 0
Query: 8 VDLPTTDKEIVPEKIEDEEIKEPLIHCELCDAEIVHKLAQVLLPGLSTACVDNTSGDIFR 67
+D P +D E IED++ P HC+L D ++VHK+AQV LPGL+TACVDNT+GDIFR
Sbjct: 1 MDPPVSDL----ESIEDQKEGGPSFHCDLYDTQVVHKIAQVFLPGLATACVDNTTGDIFR 60
Query: 68 TPGSVAADMRKEMVDYLTMRSETCVAESVILENASDAEVSDHPYDIISDFVDDFAATKRN 127
+PGSVAAD+RKEM++YLT RSET VAE ++L+ S+ E S P+DIISDF+DDFA +KRN
Sbjct: 61 SPGSVAADIRKEMIEYLTRRSETFVAEHIVLQGGSEIEASHDPFDIISDFIDDFATSKRN 120
Query: 128 LFSRVSGWVLSEKREDKIDDFVQEMDINGFWPLDRREAIAQVLLKNVDFKSEFHCDKKFH 187
LFSRVSGW+LSE+RED IDDF QEM+I+GFW D RE IAQ LLKNVDFKS HC+ KF
Sbjct: 121 LFSRVSGWMLSERREDNIDDFAQEMEISGFWLTDHREGIAQTLLKNVDFKSSAHCEMKFQ 180
Query: 188 STEELAEHVENCGFRSVTCTNEGCTARFCASHAEQHDSICPFKIISCEQKCSAFIMRREM 247
+ ELAEH NCG+R++ C NEGCTA FCA+ E HDS+CPFKII CEQ CS IMRR+M
Sbjct: 181 TEGELAEHAMNCGYRTMNCENEGCTAVFCANQMENHDSVCPFKIIPCEQNCSESIMRRDM 240
Query: 248 DRHCITVCPMKLVNCPFHNLGCQSPVPYSLIAQHCSESFDSHLLHILHSIHKEANEETLI 307
DRHCITVCPMKLVNCPFH++GC S V + QH ++ SHL++IL SI+KEA+ + L
Sbjct: 241 DRHCITVCPMKLVNCPFHSVGCLSDVHQCEVQQHHLDNVSSHLMYILRSIYKEASLDDLK 300
Query: 308 HRRQQLEEASSLDHLRGLQNLRLLTSKIKEMDSGLGPLVVIYKV---------EDTEEAK 367
R +Q+++ S+ L +N R LT+ +KE+D LGPL + K+ E+TE+
Sbjct: 301 PRAEQIQQLST--RLSEARNARSLTNLVKEIDGKLGPLEIKPKIVTDSESDKPENTEKKA 360
Query: 368 DGSDESDEEKEASN--ATEDAKDASNATEEKKEMSNGTEETKDASIATEERKNVSTA--- 427
E E+ E SN A + A A E+K + DA++ E K VS A
Sbjct: 361 LEEAEIKEKPETSNLKAVTLEQTAREAPEDKL-----VSKEVDAAMVKEAAKKVSEAEIA 420
Query: 428 ---IEEREVISSAIEETKDASKTTGETKDNS-DDKEEASDASNEKEETKDESHSTE---E 487
EE E+ + + E + K E +NS DD E ++ + DE+ E E
Sbjct: 421 DNVNEEGELKAQKLLEIGEFIK---EGDNNSADDLSERTETKAPEVVVMDEAREEEDSVE 480
Query: 488 TKDANAEE-----KMKDDSDSEEEMKNASDVKEEMKD----NSDGEEKKKDDSDGEEKRK 547
TKD E +++ + +EE K +++ K E + +G+E+ K ++ E +
Sbjct: 481 TKDTRTYETIRGLEIEANEMIDEETKKSTETKTEAPSRIVMDKEGDEETKKSTETETEAP 540
Query: 548 DDSDLEEEKKDDSDAKEEEARKEIEERKEVSNGIEEKNGTSNP 561
+E EK +++ A E E + S G + P
Sbjct: 541 SRIVMETEKDEETMNSRARASDEAEALSKSSQGFSTVQAETVP 569
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038883573.1 | 7.6e-252 | 80.45 | acidic repeat-containing protein [Benincasa hispida] | [more] |
XP_031739873.1 | 7.7e-228 | 73.60 | FK506-binding protein 5 [Cucumis sativus] >KAE8649769.1 hypothetical protein Csa... | [more] |
XP_008442263.1 | 5.2e-224 | 75.16 | PREDICTED: glutamic acid-rich protein [Cucumis melo] | [more] |
TYK29843.1 | 4.1e-221 | 75.71 | glutamic acid-rich protein [Cucumis melo var. makuwa] | [more] |
KAG7033547.1 | 1.4e-213 | 72.40 | hypothetical protein SDJN02_03269 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0L1F8 | 7.0e-227 | 73.21 | TRAF-type domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G572340 P... | [more] |
A0A1S3B4U5 | 2.5e-224 | 75.16 | glutamic acid-rich protein OS=Cucumis melo OX=3656 GN=LOC103486171 PE=4 SV=1 | [more] |
A0A5D3E181 | 2.0e-221 | 75.71 | Glutamic acid-rich protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaff... | [more] |
A0A6J1EKK8 | 5.8e-213 | 72.52 | uncharacterized protein LOC111435517 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1HXM1 | 1.7e-212 | 72.26 | uncharacterized protein LOC111467177 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |