Cp4.1LG12g02770 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG12g02770
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionChaperone protein dnaJ, putative
LocationCp4.1LG12 : 1860001 .. 1876466 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAGATTCGAACACAACTAGAGACGTCGGCGATTTATCGCGACCGACCTCCGAACTTCACACTTCACAGCCATGGAGGAAGCCGTTTCTCGAATCCTCACAGAGCTCGAAGAAGCTCGCTGTTTCGACGGCTCTACAAATCTCCATTCTCAGCCTCCGCCGCCGCTCTCCGATTCCGCTCTCTTCGACCTTCAAAGCTTGTTGGACAACTCAATCGGTACCGATGAACAACAACCAGTCGATCGCCTCTACGAAGACCTCTCTGCCAAATCCCTGTCTCCGTCCTCCCTCATACGCGCCATCGTCTCCGCCATGGATGAGCCTTCCCCTCGCATTTCAATCTTAGCCTCTAAAGTCTATTTGTCCCTTCTTCTGGCCCCAAATGCACCAGTCTTCACGCTGTTCAATCCGATGGATTTTCTCTCGTTTCTCAGGTCTATGAGACGATTCTTGAAGCAGCGACCACGGACTACGCCGAATCAGGATGATTCAAATCAGGAGTCCATTGCTCCCAAACGGAAGAGGAAAGGCGGTGTTAAGGGTAAGGGTTTGAGGAATTGTGCGCAGAGGCAGAGTTCTAATGAAGGATATCATGATGGTGAATTCGATGCAAGAGTATTGTATCCTGTGCTTGAGAGGTTAGGGATATTAATGAGTTTGATTCACTTGGATCGATTCCCTGATAGTTTGAAATCTTTGGTTGAAACTGTAATTGATATTCCGGTTTTGGCACTAGAAGTATGCACTAATTTAAGTATCTATAGTAAGTTAACCAATTTATGTTCGCGGATTTTGAGTGCTACATTGCGTCCTGAGCACGGGGATCTGGTGAGTATTGCCGCTGAGGTGATTAAATCTCTATCACCATTGATTCTTCATCATAAAGATCAGGCGCGAGCGTTTGCGCTGGAGTTTGTCACTATTCAAATAGCGAATGCAGCAAAGGAATCAGATGGTGTTAAAAGCGCTCTGGTGAATCTTCCAAGGTATTTGGTTCAGAAGGCACCTGAGAAATCCGAGCCTCGTGCTTTAGCTGTTGATTCAATAATGGAGGTTGTTAAAGTCATGGAATTCAAGGATCAAATTGGGTTTGTGGATTATGTAGTGAAGATGACCCGAGGAAAGTCTAATCTTAGGCTGTTAGCTAGCGATCTTATCTCAACGCTGATAATGTCCTTGAGTGATCCAGTGGCCGTTGATTCCGAAAGTGAATTGAAGGATTCGTGGAGATTTGGGTGCTTGGTGGCATTGGTCCAACGATGTTCAGATGCTGGTGCTACAATTCGTGCCCGGGCACTATCCAACTTAGCTCACCTTGTGGTGTTCTTGTCTGACAATGACAAGAATAAGGCCTTACTGAAGGAAGTGTTGGGGTCTGGTGACAGACATTGCAAAAAGAATGGAAGTGAAATTCATGCTCTTTTGCGAAAAAGGTGTGTGGATGAGAAGGCGGCAGTTAGGAAGGCTGCGTTATTTCTGGTTACCAAGTGTACAGCCCTCCTTGGTGGATCAATGGATGGAGATATGCTGAAGACCGTTGGAATTGCTTGTTCCGACCCACTTGTTAGCATACGGAAAGCTGCAATGTCGGCTCTTTCTGAGGTGAATCACTAACGTTTTTAACAAGTTTAAAGTTTATATAATGTGTTTAGTTGGTACGAGGTGTTGAGGATTGTTGGGACGGAGTCCCAACATTGACTAACTTAGAGAATGATCATAGGTTTATAAGTAAAGAATACATCTTCATTAGTATGAGTAGCCTTTTGGAGAAACCAAAAGTAAAGCCATGAGTGCTTATGCTCAAAGTAGACAATATCATCACTGCGGAGGCTCGTAATTCCTAACATAGTATTAGAGTCATGTCCTTAAATGAATAATGTCAATAGAATCCTCAAGTGTCGAACAAAGAAATTGTGAGCCTCGAAGGTGTAGTCAAAAGCGACTCAAGTGTTGAACAAATGGTGTATTTTGTTCGAGGACTCTAGAAAAGGAGTCGAACCTCGATTATGGAGAGGTTGTTCGAGGGCTCCATAGGCCTTGGGGGAAACTCTATAGTGTACTTGGTTCTAGAAGAGGATTGTTGAGAATTGTTGGGAGGGAGTCCCACGTTCACTAATTTTCTCATGAAGTTGAACGTTGTTTTCTCCTGTCTCTTTCCTTTCATTCCTAGCTGATAACCGTGCATATCCACATAAAAATTAAAATAAAGTTGTCGTCTTAAATGAGGTAAATGAAACTAAGTGATCATTGAGCCTATAGTGGTTGTATGATATATTAAATTCTCTTTTACGTTTGAATCTTTATTTACTCTTAGGCCTTTCGAAGATTCCCAGATGAAAGTGTCACTGTCGAGTGGCTACATTCCGTTCCACGTTTGATTGCTGATAATGAATCTAGCATTCAAGAAGAGTGTGAAAACTTGTTTCAAGAACTAGTACTAGACCGGCTATCTAGAGTTGGATCTTCAAGTTTACCCCGTGATGGATCAAAAACCCTTGATTTAAAGAGGCAATTTGAGTTGTTATTTCCTGGAGTATTGGATCTTCTTAAAGAGATAAGTCATGCAGAGGTAATGCCGTGGGTGAAGAAAGTTTGTGCTAACTTGGGAAAAAAGAAACGATTAAAACCCTCAATTGCCGTTTCACTTCAGAAGATTATAGGGACATCTGAATCCTTATGGCTAAGCCAATCCTTACCACCGGAGAAGTGGACAGCTCCCCCGGGTGCCTGGTTCCTGCTGTCGGAAGTGTCAGCTTATCTTGTGAAAGAAGTGGACTGGGAGTTTCTTCATCATCACTGGAAAATCCTGGATGATCATGGTAGGGCTGAGTTTGGGAGTCCAGTGGCTCAAGTAGGTTTATTCGAAGATGAAAACAACTCAGAGTCAAATACCATCGCTTGGGCTCAAGATCGAGTTTTTCTCTTGCAAACCATCTCTAATGTTTCTCTTGAGCTGCCTCCAGAACCTGCAGAAAAGTTGGCTCACGAATTGATTAAAAAGGTTGAAGAATTCAGCATGCATTCTACCGAGGTAAGCATCAAAAGGATTGTTTTCAGACTTTGTTTTACAGTCTATAATTGTTGGTTTATTTCTCCAATGCTGTTGATTAGATAATCAAGATTAAATTCACCAATTAGTAGTTTCTTAACACACTTCAGTGGACACTGTTCATCATGGGTACAGTTATATGCTGTATAATATTTCTCTCCCATTTATGCATGCATAGTTTCTTGTGGGACTTGTTTGAGAATGATTTTGAAATAGTTGTCATGTTCAAAATTACGCCCACTCTTTTATCATTCAAAGTCAATATGTTTTTGTCGTTTAAAACACAAGTTTTATACCATCAAAATTAATTTAAAATGATTAAAGAAATGTTTGAAATCGTTTCAAAATTATTTCCAAGCATTTTTAGCATGTATGTGTACTGTTTCACTGAAATGTCATTAAAGTGGGGGAAAATTCTTCTTGGAATTAGCTAACAACTCTTCTGCAGTGAACGTGAATAGATGGCCGGTGAGCGTGGGTAGTTGGCTGTTCATGATAAGTTCATCTTATCAGCCAATGCGTGTGAGTGCATCTGCATGTGTTTTGCAATTGGGATTTTTTCTCATTTTTTATTTTGCTTCATCATTCTTGTTTCAAACATCTTTAAAATGGGAAAAAAGGCCTGTAAACATCCAGAATATTTGAAAATTTTATGGAAAACAGTGATCAAGTAATAATCTAAATACAAGCCAAAAATGTTCTGGGTTTGGATTTTCAAAAATGTTCTGGGTTTGGATTTTCTTATAGTAATTTGCCTGCCAAGCATCAATTTTGATATCAATACTTACTATTCCAGTCTCTTTTTCTTGATATCTCACACGCATTTGTATTTAATGGTGATCTCTTGGTAGGTGAATGCGCATGTAAAAACACTTAAAACATTGTGCAAACGGCAGGCTATGCAATCTGCTGAAGCCGATACGCTGATCCTAACATGGGTAAACCAGCTGCTCTCCAAAGCCTCTCAAATATTGGAGAAGTATATATCCAAGCATAGAAAAGCAAAAAAAGATGTGAACTTCATAACACCACCTCCTAAAAGTGGTAGTAGATTGGGAAAGCGAGCAACTGCCCCGAGCAAGTCCCTGTCACGAGCAATCACTGCTGCTTACACCATTGGATCTTTGATTATCATCTGTCCATCTGCTGACATAACTACCATTATACCACTTCTCCACACCATTATCACTTCTGGAAACCGTGATCCCAAATCGAATAAACTACCTATTCAAACGGTATCTATAAAGGAAACTGCACCTTCTTTATACGTTCAAGCATGGTTAACAATGGGGAAGATTTGCCTTGCTGACGAGAAGCGTGCAAAGAGCTACATTCCTCTATTTGTACAGGTTAGGTTTTGATTCTTCAGCATTGTGCCACTTATATTGTGCCACTTATATTGTTGAGGATGGTTGGGAGGGAGTCCCACATTGGCTAATTTAGGGAATGATCACGAGTTTATAAGTTAGGAATACATCTCCATTGGTATGAGGTCTTTTGGGGAAGCCCAAAGTAAAGCCATGAGAGCTTATGCTCAAAGTGAACAATATCACACTATTGTGGAGAGTCATGAAGATGTACCAGTAGATTTCTGGCATTTGTCTGGTTGAAGGAATAATACCACGATAACTACATGAGTTGAATCATATACTCTTTTTGAAATATCATCCATTGTAGAATTAAAGTGTTCCCCCTACCAATAATTATAATCGATCTGTTTGTTATTGAAGTTCTTTCTATATATATATATATATATATATATATATAATAAATAATTAAAGTTCTTCCTCCATGTCTTATAATAGATATCAAAGCTATAATCGTGGTACATACTACTTTATCAATGTCTTTTATGCAATCAAAATTTACTTTTGATACACCTCTAGTGGCCTAAAATTTTTATGTTAGTTCAGGAGCTTGAAAATAGTGACTGTGCAGCTCTCCGCAACAACCTTGTTATCACGATGGCAGATTTTTGTGTACGCTATACTGCTCTAGTTGATTGGTAAGACTAATGCACGATGTGAGTGTTTGGCAATGGCAATGATAGTGAAGTCTTTTAATGTTTATTCCATGTTACTATCTAACTATCTTTGAATTGAACAGCTACCTTACGAAGATCACGAAATGCCTTCGCGACCCTTGTGAACTTGTCAGAAGACATACATTCATACTTCTATCGAGATTACTACAGGTAATTCTTCATTCCCCAACCCCTAAAAACTTCTCGGAGATTTTCACTTATCCATGTTCATGAGTTAAATACTGTAACGACCCAAGCTCACCGCTAGCAGATATTGTCCTATTTGAGCTTTCCCTCAAGATTTTTAAAACGCGTCTGCTAGGGAGAGGTTTCCACATTCTTTAAAGAATGTTTCATTCTCCTCCCCAACGGACGTGGATCTCACAAAAACCATTTTAATTTTCAGAGAGACTACGTGAAATGGAGAGGGGTTTTGTTTCTTCGGTTTCTCTTGTCACTTGTCGACGAATCAGAGAAGATTCGTCAACTAGCAGATTTCCTTTTTGGGAACATTTTAAAAGGTTAGCATTACTAGCCGTTCTCATGGTTAATATTGATCACATCATTACAACATGCCAATACATTTGAATTGTGAAACTGAACTTGCAGTAAAGGCGCCACTTTTGGCTTACAACAGTTTTGTAGAAGCAATTTATGTTCTGAATGACTGTCGTGCCCATCCTGGACATAACGATTCGAAGGCTTCGAGAGCTGAAAGCCGATTATTCTCCATTCGGTAAGGAATGTTTGAACTTCGTGACATGTTATCACAATTATCTGTAATGGAAAATGATTAAAGTTCATATTCTTAGAGGTAATGATGAAAGTTCGAGGCGCAAGAGAATGCACATTTATGTTTCTTTGTTAAAACAAATGGCTCCCGAACATCTCCTGGCCACCTTTGCAAAGCTATGTTCAGAGATTCTTGGTGAAGCTTCAGATGGTAAACTCAGTATGGATGATACCACTGCACGGTCTGTTTTACAGGTTCAAATTCAAAAGCTTTCAACAATCCTTGTATTTTTTCTTCTCTTCTTCTCTTCTTTAGTGTTTACATACTAGAATCTTATTAGTTCTCCTTGGCTGAGATCACAGGACACCTTTGATATTCTTGCTTGTAAAGAGATTCGATTATCGATAAATCGAGGGTCGTCGTCGGAATCTGGTGATGTAGATGAGGAAGGAGGCGAGAGCGGAGGGGTGTCTGCTGCTAGAGGACAGGTCATCACTCATGTTGTGAGAAAAAGTCTCATTCAAAACTCCCTTCCCATCTTCATTGAGCTAAAAAGGCTAATGGAAAGCAAGAATAGCCCTCTTATAGGTGTGAAATCACTTCTTTAAATGCTTTTTAAGTAGAAAGTCGTTCCAAACCGACCCTGCGATATGTTTAACCTTTTATACTTTGTGAAAATAGGTTCTCTTATGGAATGTCTTCGAGTTCTTGTCAAAGACTACAAGAACGACATCGATGACATGTTGGTAGCTGATAAGCAACTTCAGAAAGAGCTCGTCTATGACATTCAAAAGTACGAAGCTACCAAAGCTAAGTCGGCCGCGGCCGAGGCGGTCAACGAGATGCAAAAGTCAACTAATTATCTTTCTCCTGAGGCTCCTCCTCATGTTAGAAACTCCATTAACAAGCTAACCTCCAAACTCCAAAAGGACTCAAGAGTTGCTTCAGCCATTGCTGATGTAGCCGCTGCAGCCACTGCTAAGTCTGTGCTTAGAGAAGTGAATAGAGGGACTTCGACGCCGCCTCTTGGTTCTCTAAGTTTGCCTAAACTCAAGTCTCGTACTGATGGAAACAATGGCGCAAATACCTCACGCTTGAATGTGATTGAATCTGTGAGGAAACGGCAGTCTTTTGATTCTGATGAAGAAAACTAATGTGCATAGGCTAACCAGGTAACGTAGTTTGATAATTTGTGAAGCTTTCTATGTTGTAATCTTGTTCTCTTTTGTTTATGTATTTAATTTCTTCTTCCATGTGAAATAGAAGCTTATTAGAATGGTTATACCTTAGAAGTATGAGAGAATGTGTTGAGAATGGCTCGTGAATGTGAGAAATTTTGTGTACAATATGAAATCCTCTTGATTCGGGATTCGAGAGTTTTAACATTTGTGATCTAGGAGTTTGTATGTTCGTAAATGTTCTTAGATCCAATTAGTTAAGGTTTTCATATCAAAATGATGTCTAGGTGGAAAACCTCTTGATTCGGGATTCAAGAGTTTTAACATTTGTGATCTAGGATTTTGTATGTTCATAAACGTTCTTAGATCCAATTAGTTAAGGTTTTCATATCAAAATGACGTCTAGGTGGAAATCCTCTTGATTCGAGATTCAAGAGTTTTAACATTTGTGATCTAGGAGTTTGTATGTTCATAAACGTTCTTAGATCCAATTAGTTAAGATTTTCATATCAAAATGTCATCTAGGTGGAAATCCTCCATTTGATCGACAGATCTAAATTAAGAATAAACCTTTAAAGTTTAATTATCAATTTCTTTACTTTTAAAGTTGTGTCTATTTGGTTAACATGAATATAGATTCATTCTTATGGATCACCCCGCTAATTTTTAAATTTTTAAATTTTATATTTAATAAATTTGTAAATCTTAAAAAGATTAAATTTATAATTTAAAAAATATTGCGATTAATACTTGTAATTTAACTAACTGTATATCTAGTTAAAGTTGAAGTAGGTGTGATCCATGTATTATTTATTTTAATTCCTATTAAAAAATTATTTATAATTTCATATAGTTTATTAGTAACGTGATGGTTGAATCGATAATTATATTTTATTACATAAAATGGATTGAACTCATATTTAAAAAAAAAAAAGGTGAAATACTTAATTTTTAAGATATGTGTACATGTAATCCTATTTTATCCAAATAATGGTAATTATTTAAATAAACAATTTTTTTATTAGAATAAATTTAAATTCAGGGTATATATATATATATATATATATTAAGTTTATCTAGATGTTATACTTTAATGTAATTTCAAGGAATTTAATTTAACTTTAAATTGTTACCTAATTTAGATTTAAATTTAGAAGTGTTTATTTTAGGGTTTTAAATTTAATAAATATTGTGATTATCTGGAATGAGAATTAAGCCATCCATCCTTATTAAAATTATAATTAATACCTTATTTATTTATTTAAAAAAAAAATTATTGCTTAAATCTCTACCTTAAATCTCAGAAAGTCTCCACGGCTTTTAATGGAAATGAAAAGTCCTGTCTGCTGACGAACGAATGCCATTGCAGCAGGCCAACTCCGAGCTCGGTGTTCATGGGGAAAATTTTTCTATTTGGAGTTCAATCTTCTGAATGAATCCCGATCCTCATCTTTCTCCGAGTTTGAAGCCGTCAATCTTCTTTTATCAGGCACGACTCTCATTCCTCCTCCTTTTCTATCTATCGTGTATATATAATGTTGATGACAATGATGTATACTCTAGTTTAGCCTTGCTTATGTGATTTTGTTGATCCGCTTACTGCCAGTATACTGTTGCGAAATTTGATAGTGGATTTCTATTTTTTTTTTCTGTAATTTAGGTCTTATCTGTCTCGAATTTGTTATCAGTATAGTTGGAATTGTATATGAAATTGTGCTGAAGGAAATCCGGCGGTTTAGGGTTTTCTTTGAAAGATTGTGTCGGTGCGATTATGGATGGGAATAAGGACGAAGCTTTAAGATGTATTCGCATAGCGGAAGAAGCAATTGCATCGGGGAGTAAAGAGCGAGCACTCAAATTCATTAAAATTGCTTGTCGCCTTAATCGGAGTTTGGAAGTGAACGAGTTATTGTCGGCGTGTGAGGATATTGACTCGAAATCACCTTCATCTTCCTCCGATGGGAAACGCGCGGGAAAAGTTCATAGTGTTTCTGGTTCGGCGAATCATGTTGACGGTTTGAATGGTGAACGTAATTACAGTATTGAACATGTTCAATTGATCAGGCAGATAAAAACAGCTAAGGACTATTACAAAATTCTTGGTGTGGAGAAAACTAGTTCGGCTGAAGAAATTAAGAGGGCTTATAAGAAATTGTCCTTGAAAGTTCACCCTGATAAGAACAAGGCTCCTGGTTCAGATGAAGCATTCAAGAAATTATCCAAGGCTTTCATGTGTTTGAGCGATGACACGTCGAGAAGGCAATATGACCACACAGCTTTGGTTGATCAATATGAGTACAACCAACAGCATAATGTAAGGCGCAGGAGAACGGGTCGTGATTTGTTTGAAGAAAATTTTGATCCTGATGAGATATTCAGGGCGTTCTTTGGGCAAGGAAATATGTTTCAGACGAGTCGTGCTTATACTTATAGCACTGGAGGTGCGAGAAGCCAACGGAGGACAGAATCTGATGGGGGAGGTCCCAACTTGTTGATTTTTCTTCTAATGTTACCATTCTTGTTAATTGTCTTGTTGGCTTATATGCCCTTTCCAGAGCCAGACTATTCTTTGCATAAAAATCTATCCTACAGTATTCCCATGGCTACGGAGAAACATGGAGTGGAGTTTTTTGTCAAATCATCAGATTTTGATGAAAGGTATCCTCTTGGAAGCGGAGCTCGATCTGAAATAGAGAGTAGTGTGATTAGGGATTACAAAAACATGGTTTGGCGTTATTGTCATGTAGAACTCCAGAGACGCAAGTGGAATAAAAATCTCCCAACTCCTCATTGCGAGAAGATGAATAATCTCGGGTTTGCATAAAGGTAAACAGGGGTTCTCTCATTTCTTAGGCGTTATTATCCAAGTCTTGTTGTTTGCTACTTCAGTTGTTAACTTTGCTTGAATATAAAGACTGCAATTTTAAGTTTTGTTATTTTCTGAAGCTTTTTATCCCATTTGATCTAGCCTTTTGTGTCTCTCATGTGTTAGGTGGTTTAGACTCAGATTGTTTAATGAACCTGTCTCTATATGATCCCTGAAAAAGAAATCATTATTACTGAAGTCTCTGTCTCNTGCCTTGTAAATCTGCCTTGGAAAAATAAAAAAAATAAAATCATACATCTATACCGTTTTTCCTGTGAAAGTATGATATTCATAACTCGCCAACTGATGGGAAACCGATCCTGTTTTAAATTAAAGCAGGTTATATTGATGGATTTTGAAGTTGTGAGACTCTATGCTCAGATGCAGAACAGGACACTCATGGCTGGATCACGAGGGCTAAGACCCAAAGGTTTATCGGAGTTGTGCTGAGCCAATCACTTGGCACCGTTGGCCATCAATGACGCTGGAGTGGTACGACTCTGTAATTTTTCTATATCACTTCCGCATGAAGTTTTTATGATCCATTTTTCTAATAAAGTCATAACCGTTGTACAAACTGAAAGAAGATACTGATTTGCTTCTTTACATTCTTCAAATATTGAAGTGAAGCTCAAGTCTATACGTAAATGTCCTTTGGATCATCATTCCATCTTTTCCTTCCCTTAAATTTTTTCGTTTTTATGCACATCCTTGCCCCATTATTGTTTTGAATCATTTGAGTATATAATACAGGATTCATGAACATGTGAAAATGAGATTCAAATATTGCTTCTAACATAGCAGATTATTTGGCATATCCTGTTTTAGTTCAAATTTGATGAAAATAGTTATGAAATCTGTGAAGTTTATCTCCCGGAAAACTCTGTACTCGACTTCCTTGTTTGTACTTTGTAGACCCAATAACTTACAATTAGATAGCATTTAAGGTATTGGGTGACAATAATGGGATTGATAGGAGGGATACATTTTGAAAAATTTCTAGAATCTAAACGAATGTAAACTCCCAAAAGCTCCATCAAAAGAGAAAATGATAAAGAATGCTACCAATTGAAATATTGCTCCTACTCAAACTCCTTCAAAAATAAATGTCTTCCATAGGTACAACCTCTACCTCAATACATTATTCGTCCAAGCCCAATCCGAGGAATCTCATCTAGATTTTCAGAAGAAACACAAAAAAACTTCCAGTTGAGTTGATCAATCTTATATTCGAGGTTTAGCATCACCAGCAGGAGCACTCTGCATTCAAGCCAAAAATAAGCACCACTTTGAGAAGCAATTATCATCATAAATATGTAACTAGGATTGGCAATTCAGGAGTGTTTCTTTCCATTCTCAATAACTCTGGGCAATATCATCCGGAGTCTCAAAGTGATGCAGCGTAGTATGTAATATGGATGAATAATCGTGACTTAGTAATTTTGTTTTAGTTTGTTGTTGGTTATTTGGTTGTGAGGCTTAAATTCTATCAGAATTGATATCTGGAAAGTTCATTAGTCCCTTAGAAGTTGGCAAGCGATCTTGTATTCCATAGGATATTTGGATGCTCCGAGCTTTTGCCTCGAAACTTATTCCATCACATCTTTTGTTTGCCAAAAATAGACTTAATCTGTTTCTTAGCTTGCTGTTTTGGCTAGAGTTCAGCTCTTGATTTATGTGGGAGATGTATAAGAGATGAGGATTTAGGTGCACGGAAAATAAGCTCGGTTAAATTAGGATGTCGATTTCTCACCAGAGTTGCACCCTGAGTAGCTCTTTCAGCCAAGTATCCACAAGACTTGAAAAATCCACCATACTGTTTCGACCATTCCTCCAATCTTGAATAGATGTAATTTGATCCAAGAGAATCCGCCCAGAACATGAGTCCTCCCCTGCTTACGGACACAAAATTTTATACTCTAGTTTCGGAACAAAAGAGATATATTAAAATGAGATCAACTGGTCAAGTTAACAAACCTGTAAGAGGGGAAACCCATTCCCATTACACCAGCAATGTCCAGGTCGGCTGCTTTGACCGCTATGCCTTCTGCTAGGACACGGCATGCCTCATTCACCACCGGGAATAATATCATCTCCACAATGTCCTTTTCCGATAACTTCATTAGCTACATTACCGAAATCGTTTAGAGATACCTCAACAATAATCACTTTCCCCCAGTTTCTTATATTTCTAAGGTCATAAAGAAATGCTAAATCAAGAGAAGTGAGCGAACATATGTTTTGTGAGGAGACATAGGGGAGAAAGAGACCTTAGGATCAACTGAAATACCGGAAGTGCTCCGAGCCTTCTCGATATAACTCTTTAACTCCGGATTTGGCCCAGCTTTACGATTCTGGTCATAGACATAGAAACCTTTACGAGTGGATTCACCTGAGATGTGTAATGTACACAAATGATTCTTGTAAGGAATGTAACACTGGTGAAGATCATCTAACATTACATAATAAAGGTAGCAATATATTCAATAATAGATGAAGCGTGTGGAGCGTTCTTTGTATGAGGTTTGGAAGGAAGTGAGGTATAACGCCTCGCTTTGGGCATCCGTTTGTTTGACCGTTTGTAACTATTAGCTTCACATTGGAGCCCATTCTTGTCAGGTAGTTGTAGGATTTAAGCTTATTTTGGAAGGAAAAAGCTACCTGCATTCTTATCCTCTTGCATAAGAGGAATTAGCACAGATTTATAAGTTCTTTCTGGAAAAACTTGAACAAGCTGACTGGTAGCTGCTTCTGCCACACCGAAACCAACGAGGTCACACAACCTGAAGGAAAAAACAAGAGCAAATAGGATTAAAAAATTCAATAGGATTCAATGAGAACAGAATATGGTGCAAACTAGCAAACTTCTTAACCATTATAAACCTGAAGGGTCCCATTGGCATTCCAAACTTGGAAATCGCCCTATCAATTCGATAGGGATCTACTCCACGTTCGGCAAGTAAAATGGCAGCCTGAGAGTAGGGGAAAAACATTCTGTTCACTGCAAATCCTGTGCAATTTCCAATGATAATAGGCGTTTTCTTTATAGTCTTCCCAACATCTAGCAGATCAACAATTACTTGTTCATCTGTATGCTTGGTACGAACAATTTCCAATAGAGGCATGACATGGGCCGGACTGTATCAGAAGCAAAAGCTCAACGCTACGCCAAATANGGGGGGGGGGGGGGGGGGGGAATGTGGATATACCTGAAGAAATGGGCTCCAATAATTCTGTCACGAGACTTCGTTCTCTCTCCAATCAACTCCAAATCTATTGTGGAAGTATTAGTAGCAAGCATACAATGTGGTGAGCAATATTTCTCAAGATCAGCAAAGATCTGTTGCTTCAAAGAAACATTCTCGGTAACAGCCTGTGTCGAGAACACGAAAATCAACATTAGCTATTAGCTATTGCCCTCGACCTGTCTAGAAAAGAGAAAAGCAGCCCAAAGTACCTCTATCACCATATCCACATCTTTAAAACTTTCATAGTCAAGAACTCCCTTGAGTAAAGAAATAGTCTTCTCAAAATTCTCTTTAGTCATATTCCCTTTTTCAACTCGGCTTTGTAGGTTCGCTGTAGGAGAATACATCAAATCATGAGTTCAAAAAAAGCTGCAAATTGAATTGTGGAGTACAAAAGCATGTCAAAATCTTTAGTTTCTAGGGGTACCCCTGACTCTATCAATGCCAGCCTGCAAGAACTCATCGTTCACTTCTTTAAGTATTACAGGATAGTTGCTAAGAATCAATGCTGTAGCTATTCCAGATCCCATTAATCCTCCTCCCACGATAGCAACTTTCGTGATTCTTCTCGGTGCCAAACCAAGATCGGTAACCCCGGGTACCTGTCGAACATGAGGAGATAATAAATTTTTACGTGCAGGGGATAATATGGCCAGCCACTAACAAATTAGCTAGAACTACTGAGCTATAGTATTCTGGTTCAATCTTGAGCTATAGAATTTCTCGACTTCAGTATCGAACATTGTTGTTATTCTAGTTCTAGCCAGTGAGATCGAGCCCCCATAGATAATAAAACCATGAACATCCCAAGATCAAGAGTTCAAGAACAATTCTTCTGAATTAGTTAGCAGAATACTCATTAATGTGCTGCCACTTCTATAAACCCGATCGAACTTTCATGTCCGTCCTTTATTTTCTAACCTTTGTTGTCGAACGCTGGGCAAAGAAGATATGAATTAAGCTTTTACAAGTATCAGTATGTAGGAGTCCCTGAAATTCTTCAACCTCCTGCAATGTCCAGAACAAGATATAGAGAGAGAGATGAAGCGACTACGATTTAAAGATGAAAAGGAATGAAATGGAAGATAAGAAACCTTCCGAAGTCCAGCACGAGGGCCAAAGACGACACCCGTTTCAATGACGTCAATGCAGGCGAACGGGTGCTTAAGATTCGGTGACTGTTTCCTTGCCTGAACTCTGGCAGTCTCAAATATTTCCTTAGCCTCAGCAAGGGACTCTAACTTGTCAGTCCTATGGAGACTATGAACCCATGGCCTTCTCCGCTCTAGGATTTCGAGAGCCCGTTTACGTGCAGTCATGATCAACTCTTCAGGAGGGGCAATGTCATCGACAAGCCCCAAAGAATGAGCTTCATGTCCTTTAATTAGCTTTGATGTCTGTTTACACATGAGAATATAATACATGATATTTTATACTCATGTTACAAAAAGTAATCCATCTAGCTTACCAACATCATTTCTAGAGCCTTTGAGAGACCAACGAGACGTGGAAGCCGTTGTGTTCCTGAGTAGTTTAGAGCCTCGAGATCAAGATAGGCGAACCAAAACACCCCAACCATTTTAAGAATGCGTTTATTTTAAGTCTTAAGGAACTAATCAACACGTTAAACTGAAATCGTACCTCCAAAACCAGGAATTATTCCAAGCTGAAGTTCGGGCAAACCTAGTAGAGCGGTAGGAGTTGATATTCGAGCATGACAGGCCTGAGAAATGAAGGCAAATAAGAGATTGTAACAGGATTAATCCTATGAACATAGCTAGACACTTACCATTGCAACCTCTAAACCTCCACCCAAAGCAAGCCCATGTATTGCTGCAACTGCAGGTTTTCGGGCAGCTGACGGATGAGATTTAAGGAATAAAAACCCAGTAGGTCAGCATAGAATCAAACCTCTCAAAAGTAATATAACAAAACTATCATAGGAACTCCTTACCTTCAAAAATTTCAGTGATAACTTCAATTGATATGTTGCTAACACTTGGTTGGTCCCCTGAGACACATCTTACATGAGGTCAATAACGTCTCAAGCCCACCACTAGTAGATATTGTTCGCTATGATCCGTTACGTATCGCCGTGAACCTCGTGGTTTTAAGACACGTCTGCTAGGGAGAGGTTTCCACACCCTTATAAAGAATGTTTCATTCCCCTCTCCAACCAATGTGGGATCTCACAATCCACCCCCCTTCAGACCCCAACGTCCTCACTAGTACACCGTTCAGCACCTAGCTCTGATACCATTTGTAATAGCCCAAGCCAACCACTAGCAGATATTGTCGCCAACCTCACGGTTTTATAACGCATATGCTAGGGAGAGATTTTCACACTTTTATAAAGAATGTTTTTTTCTCTTCTCCAACCGATGTGGGATCTCACAATCCACCTCATCGTGGCCCAGCGTCCTCGCTGGCACACCGCTTAGTACCTAGCTCTGATACCCTTTGTAACAGCCCAAGCCCACCACTAACAGATATTATCCGCTTTAGCTCGTTACGTATCTCCGTCAGCCTTACGGTTTTAAAATGCGTCTGCTAAGGAGAGGTTTCCACACCCTTATATATAGAATGTTTCGTTTCACCTTCCAACCAACGTGGAATCTCATATCTGGTAAAATCCTTCTGCATAGATATAATGAACTGTTAATGAGGCCTACCCTTTCCTCCTTGGAGTACACCAAAAGCAGCTATATCAAAGCCACCAGAAAACTTCCCCTTCGCACCTACATTCCAAACAATAAAACAATCAGTTCCTTCGTCCCTTACCGGGATCCTAACGTACCTAAGTATAGACAAAACACCAATTACCATCCTAAAACGTCAATGGTTCGATCGAACAAAATTGGTTTAACGTAATACTAACCTGTGACAACAATTGCCTTCACATCATCTCTTTTCAAGGCTTGTTCATAGTTATCTCTTAGGCTGAATAATACTGTATATTGTGTGTAAATGTGCAGAAAACAAAGACAAAAATGGATGGACATGTTAGAATCTCAGTAGAAATAGAGATTTGATAGAAGAAGAAGAAGATGATGATGAAAAAGTGTACCATCAAAAGACAAAGCGTTAACTGGAGGGTTGATGATGGTGATTATAGCCACTCCATCATCTCCCACCTCCATCTCTGTTCTTCCTTTTGCATTGCTTCCCATTTTGCCCCTCTCTCTCTCTCTCTCTAAGATCAAACAATGTAAGGGGAAATTTTTGTAGGAGAAAGTGAAGAGAAATGGAGGAGGAGGTGCATGAAGAGTTGTGATTTCCGCACGGTTTCCGACGAAGGAACAAATATTATTCAAGGTAATCGTTATGTATCACTTTCATTTTGATATGAAACGTTCTTACTCTTATATCT

mRNA sequence

AAAGATTCGAACACAACTAGAGACGTCGGCGATTTATCGCGACCGACCTCCGAACTTCACACTTCACAGCCATGGAGGAAGCCGTTTCTCGAATCCTCACAGAGCTCGAAGAAGCTCGCTGTTTCGACGGCTCTACAAATCTCCATTCTCAGCCTCCGCCGCCGCTCTCCGATTCCGCTCTCTTCGACCTTCAAAGCTTGTTGGACAACTCAATCGGTACCGATGAACAACAACCAGTCGATCGCCTCTACGAAGACCTCTCTGCCAAATCCCTGTCTCCGTCCTCCCTCATACGCGCCATCGTCTCCGCCATGGATGAGCCTTCCCCTCGCATTTCAATCTTAGCCTCTAAAGTCTATTTGTCCCTTCTTCTGGCCCCAAATGCACCAGTCTTCACGCTGTTCAATCCGATGGATTTTCTCTCGTTTCTCAGGTCTATGAGACGATTCTTGAAGCAGCGACCACGGACTACGCCGAATCAGGATGATTCAAATCAGGAGTCCATTGCTCCCAAACGGAAGAGGAAAGGCGGTGTTAAGGGTAAGGGTTTGAGGAATTGTGCGCAGAGGCAGAGTTCTAATGAAGGATATCATGATGGTGAATTCGATGCAAGAGTATTGTATCCTGTGCTTGAGAGGTTAGGGATATTAATGAGTTTGATTCACTTGGATCGATTCCCTGATAGTTTGAAATCTTTGGTTGAAACTGTAATTGATATTCCGGTTTTGGCACTAGAAGTATGCACTAATTTAAGTATCTATAGTAAGTTAACCAATTTATGTTCGCGGATTTTGAGTGCTACATTGCGTCCTGAGCACGGGGATCTGGTGAGTATTGCCGCTGAGGTGATTAAATCTCTATCACCATTGATTCTTCATCATAAAGATCAGGCGCGAGCGTTTGCGCTGGAGTTTGTCACTATTCAAATAGCGAATGCAGCAAAGGAATCAGATGGTGTTAAAAGCGCTCTGGTGAATCTTCCAAGGTATTTGGTTCAGAAGGCACCTGAGAAATCCGAGCCTCGTGCTTTAGCTGTTGATTCAATAATGGAGGTTGTTAAAGTCATGGAATTCAAGGATCAAATTGGGTTTGTGGATTATGTAGTGAAGATGACCCGAGGAAAGTCTAATCTTAGGCTGTTAGCTAGCGATCTTATCTCAACGCTGATAATGTCCTTGAGTGATCCAGTGGCCGTTGATTCCGAAAGTGAATTGAAGGATTCGTGGAGATTTGGGTGCTTGGTGGCATTGGTCCAACGATGTTCAGATGCTGGTGCTACAATTCGTGCCCGGGCACTATCCAACTTAGCTCACCTTGTGGTGTTCTTGTCTGACAATGACAAGAATAAGGCCTTACTGAAGGAAGTGTTGGGGTCTGGTGACAGACATTGCAAAAAGAATGGAAGTGAAATTCATGCTCTTTTGCGAAAAAGGTGTGTGGATGAGAAGGCGGCAGTTAGGAAGGCTGCGTTATTTCTGGTTACCAAGTGTACAGCCCTCCTTGGTGGATCAATGGATGGAGATATGCTGAAGACCGTTGGAATTGCTTGTTCCGACCCACTTGTTAGCATACGGAAAGCTGCAATGTCGGCTCTTTCTGAGGCCTTTCGAAGATTCCCAGATGAAAGTGTCACTGTCGAGTGGCTACATTCCGTTCCACGTTTGATTGCTGATAATGAATCTAGCATTCAAGAAGAGTGTGAAAACTTGTTTCAAGAACTAGTACTAGACCGGCTATCTAGAGTTGGATCTTCAAGTTTACCCCGTGATGGATCAAAAACCCTTGATTTAAAGAGGCAATTTGAGTTGTTATTTCCTGGAGTATTGGATCTTCTTAAAGAGATAAGTCATGCAGAGGTAATGCCGTGGGTGAAGAAAGTTTGTGCTAACTTGGGAAAAAAGAAACGATTAAAACCCTCAATTGCCGTTTCACTTCAGAAGATTATAGGGACATCTGAATCCTTATGGCTAAGCCAATCCTTACCACCGGAGAAGTGGACAGCTCCCCCGGGTGCCTGGTTCCTGCTGTCGGAAGTGTCAGCTTATCTTGTGAAAGAAGTGGACTGGGAGTTTCTTCATCATCACTGGAAAATCCTGGATGATCATGGTAGGGCTGAGTTTGGGAGTCCAGTGGCTCAAGTAGGTTTATTCGAAGATGAAAACAACTCAGAGTCAAATACCATCGCTTGGGCTCAAGATCGAGTTTTTCTCTTGCAAACCATCTCTAATGTTTCTCTTGAGCTGCCTCCAGAACCTGCAGAAAAGTTGGCTCACGAATTGATTAAAAAGGTTGAAGAATTCAGCATGCATTCTACCGAGGTGAATGCGCATGTAAAAACACTTAAAACATTGTGCAAACGGCAGGCTATGCAATCTGCTGAAGCCGATACGCTGATCCTAACATGGGTAAACCAGCTGCTCTCCAAAGCCTCTCAAATATTGGAGAAGTATATATCCAAGCATAGAAAAGCAAAAAAAGATGTGAACTTCATAACACCACCTCCTAAAAGTGGTAGTAGATTGGGAAAGCGAGCAACTGCCCCGAGCAAGTCCCTGTCACGAGCAATCACTGCTGCTTACACCATTGGATCTTTGATTATCATCTGTCCATCTGCTGACATAACTACCATTATACCACTTCTCCACACCATTATCACTTCTGGAAACCGTGATCCCAAATCGAATAAACTACCTATTCAAACGGTATCTATAAAGGAAACTGCACCTTCTTTATACGTTCAAGCATGGTTAACAATGGGGAAGATTTGCCTTGCTGACGAGAAGCGTGCAAAGAGCTACATTCCTCTATTTGTACAGGAGCTTGAAAATAGTGACTGTGCAGCTCTCCGCAACAACCTTGTTATCACGATGGCAGATTTTTGTGTACGCTATACTGCTCTAGTTGATTGCTACCTTACGAAGATCACGAAATGCCTTCGCGACCCTTGTGAACTTGTCAGAAGACATACATTCATACTTCTATCGAGATTACTACAGAGAGACTACGTGAAATGGAGAGGGGTTTTGTTTCTTCGGTTTCTCTTGTCACTTGTCGACGAATCAGAGAAGATTCGTCAACTAGCAGATTTCCTTTTTGGGAACATTTTAAAAGTAAAGGCGCCACTTTTGGCTTACAACAGTTTTGTAGAAGCAATTTATGTTCTGAATGACTGTCGTGCCCATCCTGGACATAACGATTCGAAGGCTTCGAGAGCTGAAAGCCGATTATTCTCCATTCGAGGTAATGATGAAAGTTCGAGGCGCAAGAGAATGCACATTTATGTTTCTTTGTTAAAACAAATGGCTCCCGAACATCTCCTGGCCACCTTTGCAAAGCTATGTTCAGAGATTCTTGGTGAAGCTTCAGATGGTAAACTCAGTATGGATGATACCACTGCACGGTCTGTTTTACAGGACACCTTTGATATTCTTGCTTGTAAAGAGATTCGATTATCGATAAATCGAGGGTCGTCGTCGGAATCTGGTGATGTAGATGAGGAAGGAGGCGAGAGCGGAGGGGTGTCTGCTGCTAGAGGACAGGTCATCACTCATGTTGTGAGAAAAAGTCTCATTCAAAACTCCCTTCCCATCTTCATTGAGCTAAAAAGGCTAATGGAAAGCAAGAATAGCCCTCTTATAGGTTCTCTTATGGAATGTCTTCGAGTTCTTGTCAAAGACTACAAGAACGACATCGATGACATGTTGGTAGCTGATAAGCAACTTCAGAAAGAGCTCGTCTATGACATTCAAAAGTACGAAGCTACCAAAGCTAAGTCGGCCGCGGCCGAGGCGGTCAACGAGATGCAAAAGTCAACTAATTATCTTTCTCCTGAGGCTCCTCCTCATGTTAGAAACTCCATTAACAAGCTAACCTCCAAACTCCAAAAGGACTCAAGAGTTGCTTCAGCCATTGCTGATGTAGCCGCTGCAGCCACTGCTAAGTCTGTGCTTAGAGAAGTGAATAGAGGGACTTCGACGCCGCCTCTTGGTTCTCTAAGTTTGCCTAAACTCAAGTCTCGTACTGATGGAAACAATGGCGCAAATACCTCACGCTTGAATGTGATTGAATCTGTGAGGAAACGGCAGTCTTTTGATTCTGATGAAGAAAACTAATGTGCATAGGCTAACCAGAAAGTCTCCACGGCTTTTAATGGAAATGAAAAGTCCTGTCTGCTGACGAACGAATGCCATTGCAGCAGGCCAACTCCGAGCTCGGTGTTCATGGGGAAAATTTTTCTATTTGGAGTTCAATCTTCTGAATGAATCCCGATCCTCATCTTTCTCCGAGTTTGAAGCCGTCAATCTTCTTTTATCAGGTCTTATCTGTCTCGAATTTGTTATCAGTATAGTTGGAATTGTATATGAAATTGTGCTGAAGGAAATCCGGCGGTTTAGGGTTTTCTTTGAAAGATTGTGTCGGTGCGATTATGGATGGGAATAAGGACGAAGCTTTAAGATGTATTCGCATAGCGGAAGAAGCAATTGCATCGGGGAGTAAAGAGCGAGCACTCAAATTCATTAAAATTGCTTGTCGCCTTAATCGGAGTTTGGAAGTGAACGAGTTATTGTCGGCGTGTGAGGATATTGACTCGAAATCACCTTCATCTTCCTCCGATGGGAAACGCGCGGGAAAAGTTCATAGTGTTTCTGGTTCGGCGAATCATGTTGACGGTTTGAATGGTGAACGTAATTACAGTATTGAACATGTTCAATTGATCAGGCAGATAAAAACAGCTAAGGACTATTACAAAATTCTTGGTGTGGAGAAAACTAGTTCGGCTGAAGAAATTAAGAGGGCTTATAAGAAATTGTCCTTGAAAGTTCACCCTGATAAGAACAAGGCTCCTGGTTCAGATGAAGCATTCAAGAAATTATCCAAGGCTTTCATGTGTTTGAGCGATGACACGTCGAGAAGGCAATATGACCACACAGCTTTGGTTGATCAATATGAGTACAACCAACAGCATAATGTAAGGCGCAGGAGAACGGGTCGTGATTTGTTTGAAGAAAATTTTGATCCTGATGAGATATTCAGGGCGTTCTTTGGGCAAGGAAATATGTTTCAGACGAGTCGTGCTTATACTTATAGCACTGGAGGTGCGAGAAGCCAACGGAGGACAGAATCTGATGGGGGAGGTCCCAACTTGTTGATTTTTCTTCTAATGTTACCATTCTTGTTAATTGTCTTGTTGGCTTATATGCCCTTTCCAGAGCCAGACTATTCTTTGCATAAAAATCTATCCTACAGTATTCCCATGGCTACGGAGAAACATGGAGTGGAGTTTTTTGTCAAATCATCAGATTTTGATGAAAGGTATCCTCTTGGAAGCGGAGCTCGATCTGAAATAGAGAGTAGTGTGATTAGGGATTACAAAAACATGGTTTGGCGTTATTGTCATGTAGAACTCCAGAGACGCAAGTGGAATAAAAATCTCCCAACTCCTCATTGCGAGAAGATGAATAATCTCGGGTTTGCATAAGTTATATTGATGGATTTTGAAGTTGTGAGACTCTATGCTCAGATGCAGAACAGGACACTCATGGCTGGATCACGAGGGCTAAGACCCAAAGGTTTATCGGAGTTGTGCTGAGCCAATCACTTGGCACCGTTGGCCATCAATGACGCTGGAGTGAAAACAAAGACAAAAATGGATGGACATGTTAGAATCTCAGTAGAAATAGAGATTTGATAGAAGAAGAAGAAGATGATGATGAAAAAGTGTACCATCAAAAGACAAAGCGTTAACTGGAGGGTTGATGATGGTGATTATAGCCACTCCATCATCTCCCACCTCCATCTCTGTTCTTCCTTTTGCATTGCTTCCCATTTTGCCCCTCTCTCTCTCTCTCTCTAAGATCAAACAATGTAAGGGGAAATTTTTGTAGGAGAAAGTGAAGAGAAATGGAGGAGGAGGTGCATGAAGAGTTGTGATTTCCGCACGGTTTCCGACGAAGGAACAAATATTATTCAAGGTAATCGTTATGTATCACTTTCATTTTGATATGAAACGTTCTTACTCTTATATCT

Coding sequence (CDS)

ATGGATGGGAATAAGGACGAAGCTTTAAGATGTATTCGCATAGCGGAAGAAGCAATTGCATCGGGGAGTAAAGAGCGAGCACTCAAATTCATTAAAATTGCTTGTCGCCTTAATCGGAGTTTGGAAGTGAACGAGTTATTGTCGGCGTGTGAGGATATTGACTCGAAATCACCTTCATCTTCCTCCGATGGGAAACGCGCGGGAAAAGTTCATAGTGTTTCTGGTTCGGCGAATCATGTTGACGGTTTGAATGGTGAACGTAATTACAGTATTGAACATGTTCAATTGATCAGGCAGATAAAAACAGCTAAGGACTATTACAAAATTCTTGGTGTGGAGAAAACTAGTTCGGCTGAAGAAATTAAGAGGGCTTATAAGAAATTGTCCTTGAAAGTTCACCCTGATAAGAACAAGGCTCCTGGTTCAGATGAAGCATTCAAGAAATTATCCAAGGCTTTCATGTGTTTGAGCGATGACACGTCGAGAAGGCAATATGACCACACAGCTTTGGTTGATCAATATGAGTACAACCAACAGCATAATGTAAGGCGCAGGAGAACGGGTCGTGATTTGTTTGAAGAAAATTTTGATCCTGATGAGATATTCAGGGCGTTCTTTGGGCAAGGAAATATGTTTCAGACGAGTCGTGCTTATACTTATAGCACTGGAGGTGCGAGAAGCCAACGGAGGACAGAATCTGATGGGGGAGGTCCCAACTTGTTGATTTTTCTTCTAATGTTACCATTCTTGTTAATTGTCTTGTTGGCTTATATGCCCTTTCCAGAGCCAGACTATTCTTTGCATAAAAATCTATCCTACAGTATTCCCATGGCTACGGAGAAACATGGAGTGGAGTTTTTTGTCAAATCATCAGATTTTGATGAAAGGTATCCTCTTGGAAGCGGAGCTCGATCTGAAATAGAGAGTAGTGTGATTAGGGATTACAAAAACATGGTTTGGCGTTATTGTCATGTAGAACTCCAGAGACGCAAGTGGAATAAAAATCTCCCAACTCCTCATTGCGAGAAGATGAATAATCTCGGGTTTGCATAA

Protein sequence

MDGNKDEALRCIRIAEEAIASGSKERALKFIKIACRLNRSLEVNELLSACEDIDSKSPSSSSDGKRAGKVHSVSGSANHVDGLNGERNYSIEHVQLIRQIKTAKDYYKILGVEKTSSAEEIKRAYKKLSLKVHPDKNKAPGSDEAFKKLSKAFMCLSDDTSRRQYDHTALVDQYEYNQQHNVRRRRTGRDLFEENFDPDEIFRAFFGQGNMFQTSRAYTYSTGGARSQRRTESDGGGPNLLIFLLMLPFLLIVLLAYMPFPEPDYSLHKNLSYSIPMATEKHGVEFFVKSSDFDERYPLGSGARSEIESSVIRDYKNMVWRYCHVELQRRKWNKNLPTPHCEKMNNLGFA
BLAST of Cp4.1LG12g02770 vs. Swiss-Prot
Match: DNJ49_ARATH (Chaperone protein dnaJ 49 OS=Arabidopsis thaliana GN=ATJ49 PE=2 SV=2)

HSP 1 Score: 347.4 bits (890), Expect = 1.8e-94
Identity = 185/355 (52.11%), Postives = 256/355 (72.11%), Query Frame = 1

Query: 1   MDGNKDEALRCIRIAEEAIASGSKERALKFIKIACRLNRSLEVNELLSACEDIDSKSPSS 60
           MDGNKD+A RC+RIAE+AI SG KERALKFI +A RLN SL V+EL++AC+++DS S +S
Sbjct: 1   MDGNKDDASRCLRIAEDAIVSGDKERALKFINMAKRLNPSLSVDELVAACDNLDSVSRNS 60

Query: 61  SSDGKRAGKVHSVSGSANHVDGLNGERNYSIEHVQLIRQIKTAKDYYKILGVEKTSSAEE 120
           S     + K+ ++ G  + ++   G+  Y+ E+V L+R I    DYY ILG+EK  S +E
Sbjct: 61  SV----SEKLKTMDGDDDKLE--TGKMKYTEENVDLVRNIIRNNDYYAILGLEKNCSVDE 120

Query: 121 IKRAYKKLSLKVHPDKNKAPGSDEAFKKLSKAFMCLSDDTSRRQYDHTALVDQYEYNQQH 180
           I++AY+KLSLKVHPDKNKAPGS+EAFKK+SKAF CLSD  SRRQ+D   +VD++++ Q+ 
Sbjct: 121 IRKAYRKLSLKVHPDKNKAPGSEEAFKKVSKAFTCLSDGNSRRQFDQVGIVDEFDHVQRR 180

Query: 181 NVRRRR---TGRDLFEENFDPDEIFRAFFGQGN-MFQTSRAYTYSTGGARSQ-RRTESDG 240
           N R RR   T  D F++ FDP+EIFR  FGQ   +F+ S AY   T   R+Q R  E + 
Sbjct: 181 NRRPRRRYNTRNDFFDDEFDPEEIFRTVFGQQREVFRASHAYR--TRQPRNQFREEEINV 240

Query: 241 GGPNLLIFLLMLPFLLIVLLAYMPFPEPDYSLHKNLSYSIPMATEKHGVEFFVKS-SDFD 300
            GP+ L  + +LPF L++LLAY+PF EPDYSLHKN SY IP  T+   + F+V+S S FD
Sbjct: 241 AGPSCLTIIQILPFFLLLLLAYLPFSEPDYSLHKNQSYQIPKTTQNTEISFYVRSASAFD 300

Query: 301 ERYPLGSGARSEIESSVIRDYKNMVWRYCHVELQRRKWNKNLPTPHCEKMNNLGF 350
           E++PL S AR+ +E +VI++YK+ +++ C +ELQ+R+WNK +PTPHC ++ + GF
Sbjct: 301 EKFPLSSSARANLEGNVIKEYKHFLFQSCRIELQKRRWNKKIPTPHCIELQDRGF 347

BLAST of Cp4.1LG12g02770 vs. Swiss-Prot
Match: DJB14_XENTR (DnaJ homolog subfamily B member 14 OS=Xenopus tropicalis GN=dnajb14 PE=2 SV=1)

HSP 1 Score: 160.2 bits (404), Expect = 4.0e-38
Identity = 117/339 (34.51%), Postives = 173/339 (51.03%), Query Frame = 1

Query: 1   MDGNKDEALRCIRIAEEAIASGSKERALKFIKIACRLNRSLEVNELLSACEDIDS--KSP 60
           M+ N+DEA RC+RIA+ AI +G KE+A +F+  A RL  S E   LL A E  D+    P
Sbjct: 1   MESNRDEAERCVRIAKAAIEAGDKEKAKRFLSKAERLYPSSEARALLQAFEKNDTAGNGP 60

Query: 61  SSS--SDGKRAGKVHSVSGSANHVDGLNGERNYSIEHVQLIRQIKTAKDYYKILGVEKTS 120
            S+  + G    K    S ++   D   G     ++ VQ I++ KT   YY++LGV   +
Sbjct: 61  QSAKMAKGTEQPKAEKDSNASASSDTGKGHTQDQLDGVQRIKKCKT---YYEVLGVSTDA 120

Query: 121 SAEEIKRAYKKLSLKVHPDKNKAPGSDEAFKKLSKAFMCLSDDTSRRQYDHTALVDQYEY 180
             E++K+AY+KL+LK HPDKN APG+ EAFKK+  A+  LS+   R+QYD T   DQ + 
Sbjct: 121 GEEDLKKAYRKLALKFHPDKNHAPGATEAFKKIGNAYAVLSNPEKRKQYDLTGSEDQMQN 180

Query: 181 NQQHNVRRRRTGRDLFEENFDPDEIFRAFFGQGNMFQTSRAYTYSTGGAR---------- 240
           N ++       G   FE +  P+++F  FFG G  F +   +T+S G AR          
Sbjct: 181 NHRNGGFDYHRG---FEADITPEDLFNMFFGGG--FPSGSVHTFSNGRARYSHHQHHHHS 240

Query: 241 -SQRRTESDGGGPNLLIFLL-MLPFLLIVLLAYMPFPEPDYSLHKNLSYSIPMATEKHGV 300
              R  E   GG ++ I L+ ++  +L+ LL+      P YSL+     +    TE   +
Sbjct: 241 GHDREDERADGGFSMFIQLMPIIVLILVSLLSQFMVSNPPYSLYPRSGQATKRVTENLQI 300

Query: 301 EFFVKSSDFDERYP--LGSGARSEIESSVIRDYKNMVWR 322
            ++V S DF   Y   L       IE   + + +N  WR
Sbjct: 301 AYYV-SKDFQSEYSGILLQKLEKNIEEDYVANVRNNCWR 330

BLAST of Cp4.1LG12g02770 vs. Swiss-Prot
Match: DJB14_XENLA (DnaJ homolog subfamily B member 14 OS=Xenopus laevis GN=dnajb14 PE=2 SV=1)

HSP 1 Score: 156.8 bits (395), Expect = 4.5e-37
Identity = 107/337 (31.75%), Postives = 171/337 (50.74%), Query Frame = 1

Query: 1   MDGNKDEALRCIRIAEEAIASGSKERALKFIKIACRLNRSLEVNELLSACEDIDSKSPSS 60
           M+ N+DEA RC+RI + AI +G KE+A +F   A RL  S E   LL A E  D     +
Sbjct: 1   MESNRDEAERCVRIGKAAIEAGDKEKARRFFSKAERLYPSSEARVLLDALEKND-----T 60

Query: 61  SSDGKRAGKVHSVSGSANHVDGLNGE--RNYSIEHVQLIRQIKTAKDYYKILGVEKTSSA 120
           + +G ++ K+   +         +G+  + ++ + V  +++IK  K YY++LGV   +  
Sbjct: 61  AGNGPQSEKMSKSTEQPKAEKDSSGDTGKGHTQDQVDGVQRIKKCKTYYEVLGVSPDAGE 120

Query: 121 EEIKRAYKKLSLKVHPDKNKAPGSDEAFKKLSKAFMCLSDDTSRRQYDHTALVDQYEYNQ 180
           E++K+AY+KL+LK HPDKN APG+ EAFKK+  A+  LS+   R+QYD T   D  + N 
Sbjct: 121 EDLKKAYRKLALKFHPDKNHAPGATEAFKKIGNAYAVLSNPEKRKQYDLTGSEDNVQNNH 180

Query: 181 QHNVRRRRTGRDLFEENFDPDEIFRAFFGQGNMFQTSRAYTYSTGGAR------------ 240
           ++       G   FE +  P+++F  FFG G  F +   +T+S G  R            
Sbjct: 181 RNGGFDYHRG---FEADITPEDLFNMFFGGG--FPSGSVHTFSNGRTRYSHHQHHHHSGH 240

Query: 241 SQRRTESDGGGPNLLIFLLMLPFLLIVLLAYMPFPEPDYSLHKNLSYSIPMATEKHGVEF 300
            +    +DGG    +  + ++  +L+ LL+ +    P YSL+     +I   TE   + +
Sbjct: 241 DREEERADGGFSMFIQLMPIIVLILVSLLSQLMVSNPPYSLYPRSGQTIKRVTENLQISY 300

Query: 301 FVKSSDFDERY--PLGSGARSEIESSVIRDYKNMVWR 322
           +V S DF   Y   L       IE   + + +N  WR
Sbjct: 301 YV-SKDFKSEYNGMLLQKLEKNIEEDYVANVRNNCWR 326

BLAST of Cp4.1LG12g02770 vs. Swiss-Prot
Match: DJB12_MOUSE (DnaJ homolog subfamily B member 12 OS=Mus musculus GN=Dnajb12 PE=1 SV=2)

HSP 1 Score: 154.8 bits (390), Expect = 1.7e-36
Identity = 115/345 (33.33%), Postives = 183/345 (53.04%), Query Frame = 1

Query: 1   MDGNKDEALRCIRIAEEAIASGSKERALKFIKIACRLNRSLEVNELLSACEDIDSKSPSS 60
           M+ NKDEA RCI IA +AI S   ERAL+F++ A RL  +  V+ L+   E ++ K  S+
Sbjct: 1   MESNKDEAERCISIALKAIQSNQPERALRFLEKAQRLYPTPRVSALI---ESLNQKPQST 60

Query: 61  SSDGKRAGKVHSVSGSANHVD--GLNGE-------RNYSIEHVQLIRQIKTAKDYYKILG 120
               +     H+ +  A   +    NGE       + Y+ E V  ++++K  KDYY+ILG
Sbjct: 61  GDHPQPTDTTHTTTKKAGGTETPSANGEAGGGESAKGYTSEQVAAVKRVKQCKDYYEILG 120

Query: 121 VEKTSSAEEIKRAYKKLSLKVHPDKNKAPGSDEAFKKLSKAFMCLSDDTSRRQYDHTALV 180
           V +++S E++K+AY+KL+LK HPDKN APG+ EAFK +  A+  LS+   R+QY      
Sbjct: 121 VSRSASDEDLKKAYRKLALKFHPDKNHAPGATEAFKAIGTAYAVLSNPEKRKQY------ 180

Query: 181 DQYEYNQQHNVRRRRTGRDL---FEENFDPDEIFRAFFGQGNMFQTSRAYTYSTGGAR-- 240
           DQ+  ++    R   +  D    FE +  P+++F  FFG G  F +S  + YS G  R  
Sbjct: 181 DQFGDDKSQAARHGHSHGDFHRGFEADISPEDLFNMFFGGG--FPSSNVHVYSNGRMRYT 240

Query: 241 -SQRRTESDG-GGPNLLIFLLMLPFLLIVL---LAYMPFPEPDYSL--HKNLSYSIPMAT 300
             QR+   D  G   L +F+ ++P L+++L   L+ +    P YSL    ++ +     T
Sbjct: 241 YQQRQDRRDNQGDGGLGVFVQLMPILILILVSALSQLMVSSPPYSLSPRPSVGHIHKRVT 300

Query: 301 EKHGVEFFVKSSDFDERYPLGSGARS---EIESSVIRDYKNMVWR 322
           +   V ++V +  F E Y  GS  ++    +E   I + +N  W+
Sbjct: 301 DHLNVAYYV-ADTFSEEY-TGSSLKTVERNVEDDYIANLRNNCWK 332

BLAST of Cp4.1LG12g02770 vs. Swiss-Prot
Match: DJB12_HUMAN (DnaJ homolog subfamily B member 12 OS=Homo sapiens GN=DNAJB12 PE=1 SV=4)

HSP 1 Score: 151.0 bits (380), Expect = 2.4e-35
Identity = 114/344 (33.14%), Postives = 178/344 (51.74%), Query Frame = 1

Query: 1   MDGNKDEALRCIRIAEEAIASGSKERALKFIKIACRLNRSLEVNELLSACEDIDSKSPSS 60
           M+ NKDEA RCI IA +AI S   +RAL+F++ A RL  +  V  L+   E ++ K  ++
Sbjct: 1   MESNKDEAERCISIALKAIQSNQPDRALRFLEKAQRLYPTPRVRALI---ESLNQKPQTA 60

Query: 61  SSDGKRAGKVHSVSGSANHVDG--LNGE------RNYSIEHVQLIRQIKTAKDYYKILGV 120
                     H+    A   D    NGE      + Y+ E V  ++++K  KDYY+ILGV
Sbjct: 61  GDQPPPTDTTHATHRKAGGTDAPSANGEAGGESTKGYTAEQVAAVKRVKQCKDYYEILGV 120

Query: 121 EKTSSAEEIKRAYKKLSLKVHPDKNKAPGSDEAFKKLSKAFMCLSDDTSRRQYDHTALVD 180
            + +S E++K+AY++L+LK HPDKN APG+ EAFK +  A+  LS+   R+QY      D
Sbjct: 121 SRGASDEDLKKAYRRLALKFHPDKNHAPGATEAFKAIGTAYAVLSNPEKRKQY------D 180

Query: 181 QYEYNQQHNVRRRRTGRDL---FEENFDPDEIFRAFFGQGNMFQTSRAYTYSTGGAR--- 240
           Q+  ++    R      D    FE +  P+++F  FFG G  F +S  + YS G  R   
Sbjct: 181 QFGDDKSQAARHGHGHGDFHRGFEADISPEDLFNMFFGGG--FPSSNVHVYSNGRMRYTY 240

Query: 241 SQRRTESDG-GGPNLLIFLLMLPFLLIVL---LAYMPFPEPDYSL--HKNLSYSIPMATE 300
            QR+   D  G   L +F+ ++P L+++L   L+ +    P YSL    ++ +     T+
Sbjct: 241 QQRQDRRDNQGDGGLGVFVQLMPILILILVSALSQLMVSSPPYSLSPRPSVGHIHRRVTD 300

Query: 301 KHGVEFFVKSSDFDERYPLGSGARS---EIESSVIRDYKNMVWR 322
             GV ++V    F E Y  GS  ++    +E   I + +N  W+
Sbjct: 301 HLGVVYYV-GDTFSEEY-TGSSLKTVERNVEDDYIANLRNNCWK 331

BLAST of Cp4.1LG12g02770 vs. TrEMBL
Match: A0A0A0KG46_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G306340 PE=4 SV=1)

HSP 1 Score: 581.3 bits (1497), Expect = 8.1e-163
Identity = 294/352 (83.52%), Postives = 318/352 (90.34%), Query Frame = 1

Query: 1   MDGNKDEALRCIRIAEEAIASGSKERALKFIKIACRLNRSLEVNELLSACEDIDSKSPSS 60
           MDGNKDEALRCIRIAEE+IASG+KERAL+FIKIA RLN+S++V+ELL+ACE+I S     
Sbjct: 1   MDGNKDEALRCIRIAEESIASGNKERALRFIKIARRLNQSVQVDELLAACEEIGS----G 60

Query: 61  SSDGKRAGKVHSVSGSANHVDGLNGERNYSIEHVQLIRQIKTAKDYYKILGVEKTSSAEE 120
           SS+ KRAGK  SVSGS  H DGLNGERNYS+EHVQLIRQIKT KDYY ILGVEKTSSAEE
Sbjct: 61  SSEEKRAGKGESVSGSVKHGDGLNGERNYSMEHVQLIRQIKTTKDYYGILGVEKTSSAEE 120

Query: 121 IKRAYKKLSLKVHPDKNKAPGSDEAFKKLSKAFMCLSDDTSRRQYDHTALVDQYEYNQQH 180
           IKRAY+KLSLKVHPDKNKAPGS+EAFKKLSKAF CLSDDT RRQYDHT LVDQYEYNQQH
Sbjct: 121 IKRAYRKLSLKVHPDKNKAPGSEEAFKKLSKAFSCLSDDTLRRQYDHTPLVDQYEYNQQH 180

Query: 181 NVR--RRRTGRDLFEENFDPDEIFRAFFGQGNMFQTSRAYTYSTGGARSQRRTESDGGGP 240
           NVR  RRR G DLFEENFDPDEIFRAFFGQGNMFQTSRAYTY TGGA SQ+RTES GGGP
Sbjct: 181 NVRQRRRRNGHDLFEENFDPDEIFRAFFGQGNMFQTSRAYTYRTGGAGSQQRTESYGGGP 240

Query: 241 NLLIFLLMLPFLLIVLLAYMPFPEPDYSLHKNLSYSIPMATEKHGVEFFVKSSDFDERYP 300
           N LI LLMLPFLLI LLAYMPFPEP+Y+LHK+LSYSIPMATEKHGVEFFVKSSDFDERYP
Sbjct: 241 NFLIILLMLPFLLICLLAYMPFPEPEYALHKSLSYSIPMATEKHGVEFFVKSSDFDERYP 300

Query: 301 LGSGARSEIESSVIRDYKNMVWRYCHVELQRRKWNKNLPTPHCEKMNNLGFA 351
           LGS  R E+E+SV+RDY+NMVWRYCH+ELQRR+WNKNLPTPHCEK+N L  A
Sbjct: 301 LGSPGRVELENSVLRDYRNMVWRYCHIELQRRQWNKNLPTPHCEKLNTLAVA 348

BLAST of Cp4.1LG12g02770 vs. TrEMBL
Match: M5XBK5_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa007951mg PE=4 SV=1)

HSP 1 Score: 464.5 bits (1194), Expect = 1.1e-127
Identity = 237/350 (67.71%), Postives = 282/350 (80.57%), Query Frame = 1

Query: 1   MDGNKDEALRCIRIAEEAIASGSKERALKFIKIACRLNRSLEVNELLSACEDIDSKSPSS 60
           MDGNKDEAL+C+RIAEEAIASG+K RALKFIKIA RLN+SL+VNELL+ACE IDS SP+S
Sbjct: 1   MDGNKDEALKCVRIAEEAIASGNKGRALKFIKIARRLNQSLQVNELLAACEKIDSGSPAS 60

Query: 61  SSDGKRAGKVHSVSGSANHVDGLNGERNYSIEHVQLIRQIKTAKDYYKILGVEKTSSAEE 120
           S   K A ++ +  G      GLNGE +Y+ EHVQLIR+IK  KDYY ILGVEKT S E+
Sbjct: 61  SIGEKGATEIKNEPGVEKLGQGLNGEVSYTEEHVQLIRKIKRNKDYYAILGVEKTCSVED 120

Query: 121 IKRAYKKLSLKVHPDKNKAPGSDEAFKKLSKAFMCLSDDTSRRQYDHTALVDQYEYNQQH 180
           I++AY+KLSLKVHPDKNKAPGS+EAFK +SKAF CLSD  SRRQYD T LVD++EYNQQH
Sbjct: 121 IRKAYRKLSLKVHPDKNKAPGSEEAFKIVSKAFKCLSDGDSRRQYDQTGLVDEFEYNQQH 180

Query: 181 NV--RRRRTGRDLFEENFDPDEIFRAFFGQGNMFQTSRAYTYSTGGARSQRRTESDGGGP 240
           NV  RRRR G DLF+++FDPDEIFRAFFGQ +MF+TS  + Y T      +R E  GGGP
Sbjct: 181 NVRRRRRRAGHDLFDDDFDPDEIFRAFFGQSDMFRTS--HVYRTSRTAGHQREEVQGGGP 240

Query: 241 NLLIFLLMLPFLLIVLLAYMPFPEPDYSLHKNLSYSIPMATEKHGVEFFVKSSDFDERYP 300
           N+++ + +LPFL+IVLLAY+PF EP+YSL K  +Y IP  TEKHGVEF+VKS  FDE YP
Sbjct: 241 NIMVLIQLLPFLVIVLLAYLPFSEPNYSLQKTYNYQIPKTTEKHGVEFYVKSEAFDENYP 300

Query: 301 LGSGARSEIESSVIRDYKNMVWRYCHVELQRRKWNKNLPTPHCEKMNNLG 349
           LGS ARS IE+ VI+DYKN++  YC VELQRR W+KNLPTPHC+K+NNLG
Sbjct: 301 LGSVARSNIENHVIKDYKNVLLHYCRVELQRRHWSKNLPTPHCDKLNNLG 348

BLAST of Cp4.1LG12g02770 vs. TrEMBL
Match: A0A061EQA7_THECC (Heat shock protein DnaJ OS=Theobroma cacao GN=TCM_021223 PE=4 SV=1)

HSP 1 Score: 461.1 bits (1185), Expect = 1.2e-126
Identity = 237/353 (67.14%), Postives = 278/353 (78.75%), Query Frame = 1

Query: 1   MDGNKDEALRCIRIAEEAIASGSKERALKFIKIACRLNRSLEVNELLSACEDIDS-KSPS 60
           MDGNKDEALRC+ IAEEAIASG+KERALKFIKIA RLN SL V++LL+ACE++DS  SP+
Sbjct: 1   MDGNKDEALRCVHIAEEAIASGNKERALKFIKIAQRLNHSLSVDQLLAACENLDSGSSPA 60

Query: 61  SSSDGKRAGKVHSVSGSANHVDGLNGERNYSIEHVQLIRQIKTAKDYYKILGVEKTSSAE 120
           S    K      +  GS     GLNGER+Y+ EHVQLIRQIK  KDYY ILGVEKT SA+
Sbjct: 61  SPVVEKCVSSNKNRGGSTKLDKGLNGERSYTEEHVQLIRQIKRHKDYYAILGVEKTCSAD 120

Query: 121 EIKRAYKKLSLKVHPDKNKAPGSDEAFKKLSKAFMCLSDDTSRRQYDHTALVDQYEYNQQ 180
           E++RAYKKLSLKVHPDKNKAPGS+EAFKK+ KAF CLS D SRRQYD   LVD++EYNQQ
Sbjct: 121 EVRRAYKKLSLKVHPDKNKAPGSEEAFKKVCKAFKCLSVDDSRRQYDQVGLVDEFEYNQQ 180

Query: 181 HNVR--RRRTGRDLFEENFDPDEIFRAFFGQGNMFQTSRAYTYSTGGARSQRRTESDGGG 240
           HNVR  RRR G DLF++ FDPDEIFRAFFGQG+MF+TS  + Y T G    +R +  GGG
Sbjct: 181 HNVRQRRRRYGNDLFDDEFDPDEIFRAFFGQGDMFRTS--HVYRTRGMGGHQREQRHGGG 240

Query: 241 PNLLIFLLMLPFLLIVLLAYMPFPEPDYSLHKNLSYSIPMATEKHGVEFFVKSSDFDERY 300
           PN L+ L +LPFLLI LLAY+P  EP+YSL +N SY IP  TEK+GVEF+VKSS FD  +
Sbjct: 241 PNFLVLLQILPFLLIFLLAYLPISEPEYSLFRNYSYQIPKTTEKYGVEFYVKSSAFDVNF 300

Query: 301 PLGSGARSEIESSVIRDYKNMVWRYCHVELQRRKWNKNLPTPHCEKMNNLGFA 351
           PLGS AR+  E +VI+DY++M+WRYCHVE Q+R WNKNLPTPHC K+ NLG A
Sbjct: 301 PLGSPARANFEDNVIKDYRHMLWRYCHVERQKRHWNKNLPTPHCNKLQNLGLA 351

BLAST of Cp4.1LG12g02770 vs. TrEMBL
Match: B9RT65_RICCO (Chaperone protein dnaJ, putative OS=Ricinus communis GN=RCOM_0681300 PE=4 SV=1)

HSP 1 Score: 454.1 bits (1167), Expect = 1.5e-124
Identity = 230/358 (64.25%), Postives = 282/358 (78.77%), Query Frame = 1

Query: 1   MDGNKDEALRCIRIAEEAIASGSKERALKFIKIACRLNRSLEVNELLSACEDIDSKSPSS 60
           MDGNKDEALRCIRIAEEAIAS +KERALKFI+IA RLN  L VN+LL+ACE + S   +S
Sbjct: 1   MDGNKDEALRCIRIAEEAIASRNKERALKFIRIAQRLNHDLSVNDLLTACEKLGSSGSNS 60

Query: 61  ---SSDGKRA--GKVHSVSGSANHVDGLNGERNYSIEHVQLIRQIKTAKDYYKILGVEKT 120
              S D K    G   +        +GLNGE+NY+ EHV+LIRQ+K  KDYY ILGVEKT
Sbjct: 61  NPPSLDEKCVLNGDAKNKPSHGKIDEGLNGEKNYTEEHVELIRQVKINKDYYSILGVEKT 120

Query: 121 SSAEEIKRAYKKLSLKVHPDKNKAPGSDEAFKKLSKAFMCLSDDTSRRQYDHTALVDQYE 180
           SS E+I+RAY+KLSLKVHPDKNKAPGS+EAFKK+ KAF CLSDD SRRQYD T LVD++E
Sbjct: 121 SSVEDIRRAYRKLSLKVHPDKNKAPGSEEAFKKVCKAFKCLSDDNSRRQYDQTGLVDEFE 180

Query: 181 YNQQHNVRR---RRTGRDLFEENFDPDEIFRAFFGQGNMFQTSRAYTYSTGGARSQRRTE 240
           YNQQ+NVRR   RR   D ++++FDP+EIFR+FFGQ +MF+    + Y +G    Q+R E
Sbjct: 181 YNQQYNVRRTRRRRNVHDFYDDDFDPNEIFRSFFGQTDMFRAH--HVYRSGATAGQQRGE 240

Query: 241 SDGGGPNLLIFLLMLPFLLIVLLAYMPFPEPDYSLHKNLSYSIPMATEKHGVEFFVKSSD 300
             GGGP+LL+ L +LPFLLI LLAY+PF EPDYSLHKN SY IP  TEKHG+EFFVKS+ 
Sbjct: 241 FHGGGPSLLLLLQILPFLLIFLLAYLPFSEPDYSLHKNYSYQIPKTTEKHGLEFFVKSAS 300

Query: 301 FDERYPLGSGARSEIESSVIRDYKNMVWRYCHVELQRRKWNKNLPTPHCEKMNNLGFA 351
           FD+ YP+GS AR+ IE +VI+DY+N++WR+CH+ELQRR W+KN+PTPHC+K++NLG A
Sbjct: 301 FDDNYPIGSTARANIEDNVIKDYRNVLWRHCHIELQRRHWSKNMPTPHCDKLHNLGLA 356

BLAST of Cp4.1LG12g02770 vs. TrEMBL
Match: A0A0K9QDC8_SPIOL (Uncharacterized protein OS=Spinacia oleracea GN=SOVF_198660 PE=4 SV=1)

HSP 1 Score: 453.4 bits (1165), Expect = 2.6e-124
Identity = 224/354 (63.28%), Postives = 279/354 (78.81%), Query Frame = 1

Query: 1   MDGNKDEALRCIRIAEEAIASGSKERALKFIKIACRLNRSLEVNELLSACEDIDSKSPSS 60
           MDGNKDEAL+CI IA+EAIASG KERALKFI+IA RLN +L V++L++ACE +DS +   
Sbjct: 1   MDGNKDEALKCIGIAKEAIASGRKERALKFIRIAQRLNSTLSVDDLIAACEKMDSSASGP 60

Query: 61  SSDGKRAGKVHS-VSGSANHVDGLNGERNYSIEHVQLIRQIKTAKDYYKILGVEKTSSAE 120
           S +G    +  S VS  A   +G N ERNY+ EHV+LI+Q+   KDYY +LGVEKT S E
Sbjct: 61  SENGSHVDRSQSGVSSGAKSAEGSNVERNYTEEHVKLIKQVNKNKDYYAVLGVEKTCSVE 120

Query: 121 EIKRAYKKLSLKVHPDKNKAPGSDEAFKKLSKAFMCLSDDTSRRQYDHTALVDQYEYNQQ 180
           EI++AY+KLSLKVHPDKN+APG++EAFKK+ KAF CLS++ SR+QYDHT LVD +EYNQQ
Sbjct: 121 EIRKAYRKLSLKVHPDKNQAPGAEEAFKKVCKAFKCLSEEDSRKQYDHTGLVDDFEYNQQ 180

Query: 181 HNV---RRRRTGRDLFEENFDPDEIFRAFFGQGNMFQTSRAYTYSTGGARSQRRTESDGG 240
           HN    RRRRT  D F++ FDPDEIFRAFFGQ NMF TS  + Y T G  +Q+RTE +GG
Sbjct: 181 HNNVRRRRRRTNNDFFDDEFDPDEIFRAFFGQTNMFHTS--HVYRTRGMGTQQRTEGNGG 240

Query: 241 GPNLLIFLLMLPFLLIVLLAYMPFPEPDYSLHKNLSYSIPMATEKHGVEFFVKSSDFDER 300
           GPNL++FL ++PFLLI+LLAY+PF EP+YSL +N SY  P  TEK GVEF+VKS +FD  
Sbjct: 241 GPNLMVFLQLVPFLLILLLAYLPFSEPEYSLQRNYSYQFPRVTEKFGVEFYVKSQEFDRS 300

Query: 301 YPLGSGARSEIESSVIRDYKNMVWRYCHVELQRRKWNKNLPTPHCEKMNNLGFA 351
           YPLGS AR  IE +VI+DYKN++ RYCH+ELQRR+W++NLPTPHC+K+ N G A
Sbjct: 301 YPLGSAARENIEENVIQDYKNLLGRYCHIELQRRQWSRNLPTPHCDKLQNFGVA 352

BLAST of Cp4.1LG12g02770 vs. TAIR10
Match: AT5G49060.1 (AT5G49060.1 Heat shock protein DnaJ, N-terminal with domain of unknown function (DUF1977))

HSP 1 Score: 347.4 bits (890), Expect = 1.0e-95
Identity = 185/355 (52.11%), Postives = 256/355 (72.11%), Query Frame = 1

Query: 1   MDGNKDEALRCIRIAEEAIASGSKERALKFIKIACRLNRSLEVNELLSACEDIDSKSPSS 60
           MDGNKD+A RC+RIAE+AI SG KERALKFI +A RLN SL V+EL++AC+++DS S +S
Sbjct: 1   MDGNKDDASRCLRIAEDAIVSGDKERALKFINMAKRLNPSLSVDELVAACDNLDSVSRNS 60

Query: 61  SSDGKRAGKVHSVSGSANHVDGLNGERNYSIEHVQLIRQIKTAKDYYKILGVEKTSSAEE 120
           S     + K+ ++ G  + ++   G+  Y+ E+V L+R I    DYY ILG+EK  S +E
Sbjct: 61  SV----SEKLKTMDGDDDKLE--TGKMKYTEENVDLVRNIIRNNDYYAILGLEKNCSVDE 120

Query: 121 IKRAYKKLSLKVHPDKNKAPGSDEAFKKLSKAFMCLSDDTSRRQYDHTALVDQYEYNQQH 180
           I++AY+KLSLKVHPDKNKAPGS+EAFKK+SKAF CLSD  SRRQ+D   +VD++++ Q+ 
Sbjct: 121 IRKAYRKLSLKVHPDKNKAPGSEEAFKKVSKAFTCLSDGNSRRQFDQVGIVDEFDHVQRR 180

Query: 181 NVRRRR---TGRDLFEENFDPDEIFRAFFGQGN-MFQTSRAYTYSTGGARSQ-RRTESDG 240
           N R RR   T  D F++ FDP+EIFR  FGQ   +F+ S AY   T   R+Q R  E + 
Sbjct: 181 NRRPRRRYNTRNDFFDDEFDPEEIFRTVFGQQREVFRASHAYR--TRQPRNQFREEEINV 240

Query: 241 GGPNLLIFLLMLPFLLIVLLAYMPFPEPDYSLHKNLSYSIPMATEKHGVEFFVKS-SDFD 300
            GP+ L  + +LPF L++LLAY+PF EPDYSLHKN SY IP  T+   + F+V+S S FD
Sbjct: 241 AGPSCLTIIQILPFFLLLLLAYLPFSEPDYSLHKNQSYQIPKTTQNTEISFYVRSASAFD 300

Query: 301 ERYPLGSGARSEIESSVIRDYKNMVWRYCHVELQRRKWNKNLPTPHCEKMNNLGF 350
           E++PL S AR+ +E +VI++YK+ +++ C +ELQ+R+WNK +PTPHC ++ + GF
Sbjct: 301 EKFPLSSSARANLEGNVIKEYKHFLFQSCRIELQKRRWNKKIPTPHCIELQDRGF 347

BLAST of Cp4.1LG12g02770 vs. TAIR10
Match: AT3G57340.1 (AT3G57340.1 Heat shock protein DnaJ, N-terminal with domain of unknown function (DUF1977))

HSP 1 Score: 243.0 bits (619), Expect = 2.7e-64
Identity = 141/367 (38.42%), Postives = 214/367 (58.31%), Query Frame = 1

Query: 1   MDGNKDEALRCIRIAEEAIASGSKERALKFIKIACRLNRSLEVNELLSACEDIDSKSPSS 60
           MDGNKD+AL+C++I + A+ +G + RALKF+  A RL+ +L +++L+S   +  S  P S
Sbjct: 1   MDGNKDDALKCLKICKSAMEAGDRPRALKFLAKARRLDPNLPIDDLVSELNNNKSDEPGS 60

Query: 61  S-SDGKRAGKVHSVSGS-------ANHVDGLNGERNYSIEHVQLIRQIKTAKDYYKILGV 120
           + S G  A K  S S          +     +   +Y+ E + ++R+IK+ KDYY+ILG+
Sbjct: 61  AKSPGSAAAKDSSNSSDRPSLRQRGSSTTSSSSSMSYTEEQISIVRKIKSKKDYYEILGL 120

Query: 121 EKTSSAEEIKRAYKKLSLKVHPDKNKAPGSDEAFKKLSKAFMCLSDDTSRRQYDHTALVD 180
           E   S +++++AY+KLSLKVHPDKN+APGS+EAFK +SKAF CLS+D +R++YD +   D
Sbjct: 121 ESNCSVDDVRKAYRKLSLKVHPDKNQAPGSEEAFKSVSKAFQCLSNDEARKKYDVSG-SD 180

Query: 181 QYEYNQQHNVRRRR-TGRDLFEENFDPDEIFRAFFGQGNM--------FQTSRAYTYSTG 240
           +  Y  + + R     G   +E+ FDP+EIFR+FFG G              R++ +   
Sbjct: 181 EPIYQPRRSARSNGFNGGYYYEDEFDPNEIFRSFFGGGGFGGGGMPPATAQFRSFNFGAT 240

Query: 241 GARSQRRTESDGGGPNLLIFLLMLPFLLIVLLAYMPFPEPDYSLHKNLSYSIPMATEKHG 300
             R+    ++   G N  I L +LP + I+LL +MP  +P Y L     Y     T+K G
Sbjct: 241 RQRTANNNQAPDAGFNARILLQLLPVVFILLLNFMPSSQPVYQLSATYPYQYKFTTQK-G 300

Query: 301 VEFFVKSSDFDERYPLGSGARSEIESSVIRDYKNMVWRYCHVELQRRKWNKNLPTPHCEK 351
           V +FVKSS F++ YP  S  R  +E  V RDY +++ + C  E+QR++W     TPHC+ 
Sbjct: 301 VNYFVKSSKFEQDYPRDSNDRHTLEEQVERDYVSILSQNCRYEMQRKQWGFVRETPHCDM 360

BLAST of Cp4.1LG12g02770 vs. TAIR10
Match: AT5G05750.1 (AT5G05750.1 DNAJ heat shock N-terminal domain-containing protein)

HSP 1 Score: 191.4 bits (485), Expect = 9.2e-49
Identity = 113/286 (39.51%), Postives = 178/286 (62.24%), Query Frame = 1

Query: 1   MDGNKDEALRCIRIAEEAIASGSKERALKFIKIACRLNRSLEVNELLSACE-DIDSKSPS 60
           MDGNKD+AL+C++I ++AI +G + RALKF++ A RL+ +L ++ L+S  +   D  +  
Sbjct: 1   MDGNKDDALKCLKIGKDAIEAGDRSRALKFLEKARRLDPNLPIDGLVSDLKKQSDEPAAE 60

Query: 61  SSSDGKRAGKVHSVS--------GSANHVDGLNGERNYSIEHVQLIRQIKTAKDYYKILG 120
             S G  A +    S        GS++   G +   + + E   ++R+IK+ KDYY+ILG
Sbjct: 61  EDSPGSAANESSKPSDRPSLRQRGSSSSAAGSSSSSSSTEEQRTIVREIKSKKDYYEILG 120

Query: 121 VEKTSSAEEIKRAYKKLSLKVHPDKNKAPGSDEAFKKLSKAFMCLSDDTSRRQYDHTALV 180
           ++   S E+++++Y+KLSLKVHPDKNKAPGS+EAFK +SKAF CLS++ +RR+YD +   
Sbjct: 121 LKSNCSVEDLRKSYRKLSLKVHPDKNKAPGSEEAFKSVSKAFQCLSNEDTRRKYDGSG-S 180

Query: 181 DQYEYNQQHNVRRRRTGRDLFEENFDPDEIFRAFFGQGNMFQTS---RAYTYSTGGARSQ 240
           D+  Y  + + RR       +++ FD DEIFR+FFG G M   +   R++ +  GG R+ 
Sbjct: 181 DEPAYQPRRDARRNNGFNGFYDDEFDADEIFRSFFGGGEMNPATTQFRSFNFG-GGTRTA 240

Query: 241 RRTESDGGGPNLLIFLLMLPFLLIVLLAYMPFPEPDYSLHKNLSYS 275
            +    G  P +L  L +LP + I+LL ++P P+P YSL   ++ S
Sbjct: 241 NQASDTGFNPRVL--LQILPVVFILLLNFLPSPQPIYSLSHRITTS 282

BLAST of Cp4.1LG12g02770 vs. TAIR10
Match: AT3G06778.1 (AT3G06778.1 Chaperone DnaJ-domain superfamily protein)

HSP 1 Score: 70.5 bits (171), Expect = 2.4e-12
Identity = 32/74 (43.24%), Postives = 50/74 (67.57%), Query Frame = 1

Query: 93  HVQLIRQIKTAKDYYKILGVEKTSSAEEIKRAYKKLSLKVHPDKNKAPGSDEAFKKLSKA 152
           H+  I       D+Y ILG+++ +  + I++ Y KL+LKVHPDKN  P +D AFK + +A
Sbjct: 30  HINRISSGSCFIDWYLILGIQEDAEVKVIRKRYHKLALKVHPDKNNHPKADIAFKLIHEA 89

Query: 153 FMCLSDDTSRRQYD 167
           ++CLSD+T RR ++
Sbjct: 90  YLCLSDETKRRSFN 103

BLAST of Cp4.1LG12g02770 vs. TAIR10
Match: AT2G22360.1 (AT2G22360.1 DNAJ heat shock family protein)

HSP 1 Score: 68.2 bits (165), Expect = 1.2e-11
Identity = 44/119 (36.97%), Postives = 59/119 (49.58%), Query Frame = 1

Query: 100 IKTAKDYYKILGVEKTSSAEEIKRAYKKLSLKVHPDKNKAPGSDEAFKKLSKAFMCLSDD 159
           ++   DYY +LGV K ++  EIK AY+KL+   HPD NK PG++E FK++S A+  LSDD
Sbjct: 81  VRADADYYSVLGVSKNATKAEIKSAYRKLARNYHPDVNKDPGAEEKFKEISNAYEVLSDD 140

Query: 160 TSRRQYDHTALVDQYEYNQQHNVRRRRTGRDLFEENFDP-DEIFRAFFGQGNMFQTSRA 218
             +  YD         Y +         G   F   FD  D +F  F G       SRA
Sbjct: 141 EKKSLYD--------RYGEAGLKGAAGFGNGDFSNPFDLFDSLFEGFGGGMGRGSRSRA 191

BLAST of Cp4.1LG12g02770 vs. NCBI nr
Match: gi|449460955|ref|XP_004148209.1| (PREDICTED: chaperone protein dnaJ 49 [Cucumis sativus])

HSP 1 Score: 581.3 bits (1497), Expect = 1.2e-162
Identity = 294/352 (83.52%), Postives = 318/352 (90.34%), Query Frame = 1

Query: 1   MDGNKDEALRCIRIAEEAIASGSKERALKFIKIACRLNRSLEVNELLSACEDIDSKSPSS 60
           MDGNKDEALRCIRIAEE+IASG+KERAL+FIKIA RLN+S++V+ELL+ACE+I S     
Sbjct: 1   MDGNKDEALRCIRIAEESIASGNKERALRFIKIARRLNQSVQVDELLAACEEIGS----G 60

Query: 61  SSDGKRAGKVHSVSGSANHVDGLNGERNYSIEHVQLIRQIKTAKDYYKILGVEKTSSAEE 120
           SS+ KRAGK  SVSGS  H DGLNGERNYS+EHVQLIRQIKT KDYY ILGVEKTSSAEE
Sbjct: 61  SSEEKRAGKGESVSGSVKHGDGLNGERNYSMEHVQLIRQIKTTKDYYGILGVEKTSSAEE 120

Query: 121 IKRAYKKLSLKVHPDKNKAPGSDEAFKKLSKAFMCLSDDTSRRQYDHTALVDQYEYNQQH 180
           IKRAY+KLSLKVHPDKNKAPGS+EAFKKLSKAF CLSDDT RRQYDHT LVDQYEYNQQH
Sbjct: 121 IKRAYRKLSLKVHPDKNKAPGSEEAFKKLSKAFSCLSDDTLRRQYDHTPLVDQYEYNQQH 180

Query: 181 NVR--RRRTGRDLFEENFDPDEIFRAFFGQGNMFQTSRAYTYSTGGARSQRRTESDGGGP 240
           NVR  RRR G DLFEENFDPDEIFRAFFGQGNMFQTSRAYTY TGGA SQ+RTES GGGP
Sbjct: 181 NVRQRRRRNGHDLFEENFDPDEIFRAFFGQGNMFQTSRAYTYRTGGAGSQQRTESYGGGP 240

Query: 241 NLLIFLLMLPFLLIVLLAYMPFPEPDYSLHKNLSYSIPMATEKHGVEFFVKSSDFDERYP 300
           N LI LLMLPFLLI LLAYMPFPEP+Y+LHK+LSYSIPMATEKHGVEFFVKSSDFDERYP
Sbjct: 241 NFLIILLMLPFLLICLLAYMPFPEPEYALHKSLSYSIPMATEKHGVEFFVKSSDFDERYP 300

Query: 301 LGSGARSEIESSVIRDYKNMVWRYCHVELQRRKWNKNLPTPHCEKMNNLGFA 351
           LGS  R E+E+SV+RDY+NMVWRYCH+ELQRR+WNKNLPTPHCEK+N L  A
Sbjct: 301 LGSPGRVELENSVLRDYRNMVWRYCHIELQRRQWNKNLPTPHCEKLNTLAVA 348

BLAST of Cp4.1LG12g02770 vs. NCBI nr
Match: gi|659117133|ref|XP_008458441.1| (PREDICTED: chaperone protein dnaJ 49-like [Cucumis melo])

HSP 1 Score: 576.6 bits (1485), Expect = 2.9e-161
Identity = 292/352 (82.95%), Postives = 317/352 (90.06%), Query Frame = 1

Query: 1   MDGNKDEALRCIRIAEEAIASGSKERALKFIKIACRLNRSLEVNELLSACEDIDSKSPSS 60
           MDGNKDEALRCIRIAEE+IASG+KERAL+FIKIA RLN+SL+++ELL+ACE++ S     
Sbjct: 1   MDGNKDEALRCIRIAEESIASGNKERALRFIKIARRLNQSLQIDELLTACEEMGS----G 60

Query: 61  SSDGKRAGKVHSVSGSANHVDGLNGERNYSIEHVQLIRQIKTAKDYYKILGVEKTSSAEE 120
           SS  KRAGK  SVSGS  HVDGLNGERNYS+EHVQLIRQIKT  DYY ILGVEKTSSAEE
Sbjct: 61  SSAEKRAGKGESVSGSVKHVDGLNGERNYSMEHVQLIRQIKTTMDYYGILGVEKTSSAEE 120

Query: 121 IKRAYKKLSLKVHPDKNKAPGSDEAFKKLSKAFMCLSDDTSRRQYDHTALVDQYEYNQQH 180
           IKRAY+KLSLKVHPDKNKAPGS+EAFKKLSKAF CLSDDT RR+YD TALVDQYEYNQQH
Sbjct: 121 IKRAYRKLSLKVHPDKNKAPGSEEAFKKLSKAFSCLSDDTLRRRYDQTALVDQYEYNQQH 180

Query: 181 NVR--RRRTGRDLFEENFDPDEIFRAFFGQGNMFQTSRAYTYSTGGARSQRRTESDGGGP 240
           NVR  RRR G DLFEENFDPDEIFRAFFGQGNMFQTSRAYTY TGGA SQ+RTESDGGGP
Sbjct: 181 NVRHRRRRNGHDLFEENFDPDEIFRAFFGQGNMFQTSRAYTYRTGGAGSQQRTESDGGGP 240

Query: 241 NLLIFLLMLPFLLIVLLAYMPFPEPDYSLHKNLSYSIPMATEKHGVEFFVKSSDFDERYP 300
           + LI LLMLPFLLI LLAYMPFPEP+Y+LHKNLSYSIPMATEKHGVEFFVKSSDF+ RYP
Sbjct: 241 SFLIILLMLPFLLICLLAYMPFPEPEYALHKNLSYSIPMATEKHGVEFFVKSSDFNVRYP 300

Query: 301 LGSGARSEIESSVIRDYKNMVWRYCHVELQRRKWNKNLPTPHCEKMNNLGFA 351
           LGS  R EIE+SV+RDY+NMVWRYCH+ELQRR+WNKNLPTPHCEK+N L  A
Sbjct: 301 LGSPGRVEIENSVLRDYRNMVWRYCHIELQRRQWNKNLPTPHCEKLNTLAVA 348

BLAST of Cp4.1LG12g02770 vs. NCBI nr
Match: gi|645233143|ref|XP_008223201.1| (PREDICTED: chaperone protein dnaJ 49 [Prunus mume])

HSP 1 Score: 467.6 bits (1202), Expect = 1.9e-128
Identity = 239/352 (67.90%), Postives = 282/352 (80.11%), Query Frame = 1

Query: 1   MDGNKDEALRCIRIAEEAIASGSKERALKFIKIACRLNRSLEVNELLSACEDIDSKSPSS 60
           MDGNKDEALRC+RIAEEAIASG+K RALKFIKIA RLN+SL+VNELL+ACE IDS SP+S
Sbjct: 1   MDGNKDEALRCVRIAEEAIASGNKGRALKFIKIAQRLNQSLQVNELLAACEKIDSGSPAS 60

Query: 61  SSDGKRAGKVHSVSGSANHVDGLNGERNYSIEHVQLIRQIKTAKDYYKILGVEKTSSAEE 120
           S   K A  + +  G      GLNGE +Y+ EHVQLIR+IK  KDYY ILGVEKT S E+
Sbjct: 61  SIGEKGATVIKNEPGVGKLGQGLNGEVSYTEEHVQLIRKIKRNKDYYAILGVEKTCSVED 120

Query: 121 IKRAYKKLSLKVHPDKNKAPGSDEAFKKLSKAFMCLSDDTSRRQYDHTALVDQYEYNQQH 180
           I++AY+KLSLKVHPDKNKAPGS+EAFK +SKAF CLSD  SRRQYD T LVD++EYNQQH
Sbjct: 121 IRKAYRKLSLKVHPDKNKAPGSEEAFKIVSKAFKCLSDGDSRRQYDQTGLVDEFEYNQQH 180

Query: 181 NV--RRRRTGRDLFEENFDPDEIFRAFFGQGNMFQTSRAYTYSTGGARSQRRTESDGGGP 240
           NV  RRRR G DLF+++FDPDEIFRAFFGQ +MF+TS  + Y T      +R E  GGGP
Sbjct: 181 NVRRRRRRAGHDLFDDDFDPDEIFRAFFGQSDMFRTS--HVYRTSRTAGHQREEVQGGGP 240

Query: 241 NLLIFLLMLPFLLIVLLAYMPFPEPDYSLHKNLSYSIPMATEKHGVEFFVKSSDFDERYP 300
           N+++ + +LPFL+IVLLAY+PF EP+YSL K  +Y IP  TEKHGVEF+VKS  FDE YP
Sbjct: 241 NIMVLIQLLPFLVIVLLAYLPFSEPNYSLQKTYNYQIPKTTEKHGVEFYVKSETFDENYP 300

Query: 301 LGSGARSEIESSVIRDYKNMVWRYCHVELQRRKWNKNLPTPHCEKMNNLGFA 351
           LGS ARS IE+ VI+DYKN++  YC VELQRR W+KNLPTPHC+K+NNLG A
Sbjct: 301 LGSVARSNIENHVIKDYKNVLLHYCRVELQRRHWSKNLPTPHCDKLNNLGIA 350

BLAST of Cp4.1LG12g02770 vs. NCBI nr
Match: gi|596134454|ref|XP_007222268.1| (hypothetical protein PRUPE_ppa007951mg [Prunus persica])

HSP 1 Score: 464.5 bits (1194), Expect = 1.6e-127
Identity = 237/350 (67.71%), Postives = 282/350 (80.57%), Query Frame = 1

Query: 1   MDGNKDEALRCIRIAEEAIASGSKERALKFIKIACRLNRSLEVNELLSACEDIDSKSPSS 60
           MDGNKDEAL+C+RIAEEAIASG+K RALKFIKIA RLN+SL+VNELL+ACE IDS SP+S
Sbjct: 1   MDGNKDEALKCVRIAEEAIASGNKGRALKFIKIARRLNQSLQVNELLAACEKIDSGSPAS 60

Query: 61  SSDGKRAGKVHSVSGSANHVDGLNGERNYSIEHVQLIRQIKTAKDYYKILGVEKTSSAEE 120
           S   K A ++ +  G      GLNGE +Y+ EHVQLIR+IK  KDYY ILGVEKT S E+
Sbjct: 61  SIGEKGATEIKNEPGVEKLGQGLNGEVSYTEEHVQLIRKIKRNKDYYAILGVEKTCSVED 120

Query: 121 IKRAYKKLSLKVHPDKNKAPGSDEAFKKLSKAFMCLSDDTSRRQYDHTALVDQYEYNQQH 180
           I++AY+KLSLKVHPDKNKAPGS+EAFK +SKAF CLSD  SRRQYD T LVD++EYNQQH
Sbjct: 121 IRKAYRKLSLKVHPDKNKAPGSEEAFKIVSKAFKCLSDGDSRRQYDQTGLVDEFEYNQQH 180

Query: 181 NV--RRRRTGRDLFEENFDPDEIFRAFFGQGNMFQTSRAYTYSTGGARSQRRTESDGGGP 240
           NV  RRRR G DLF+++FDPDEIFRAFFGQ +MF+TS  + Y T      +R E  GGGP
Sbjct: 181 NVRRRRRRAGHDLFDDDFDPDEIFRAFFGQSDMFRTS--HVYRTSRTAGHQREEVQGGGP 240

Query: 241 NLLIFLLMLPFLLIVLLAYMPFPEPDYSLHKNLSYSIPMATEKHGVEFFVKSSDFDERYP 300
           N+++ + +LPFL+IVLLAY+PF EP+YSL K  +Y IP  TEKHGVEF+VKS  FDE YP
Sbjct: 241 NIMVLIQLLPFLVIVLLAYLPFSEPNYSLQKTYNYQIPKTTEKHGVEFYVKSEAFDENYP 300

Query: 301 LGSGARSEIESSVIRDYKNMVWRYCHVELQRRKWNKNLPTPHCEKMNNLG 349
           LGS ARS IE+ VI+DYKN++  YC VELQRR W+KNLPTPHC+K+NNLG
Sbjct: 301 LGSVARSNIENHVIKDYKNVLLHYCRVELQRRHWSKNLPTPHCDKLNNLG 348

BLAST of Cp4.1LG12g02770 vs. NCBI nr
Match: gi|590661215|ref|XP_007035611.1| (Heat shock protein DnaJ [Theobroma cacao])

HSP 1 Score: 461.1 bits (1185), Expect = 1.8e-126
Identity = 237/353 (67.14%), Postives = 278/353 (78.75%), Query Frame = 1

Query: 1   MDGNKDEALRCIRIAEEAIASGSKERALKFIKIACRLNRSLEVNELLSACEDIDS-KSPS 60
           MDGNKDEALRC+ IAEEAIASG+KERALKFIKIA RLN SL V++LL+ACE++DS  SP+
Sbjct: 1   MDGNKDEALRCVHIAEEAIASGNKERALKFIKIAQRLNHSLSVDQLLAACENLDSGSSPA 60

Query: 61  SSSDGKRAGKVHSVSGSANHVDGLNGERNYSIEHVQLIRQIKTAKDYYKILGVEKTSSAE 120
           S    K      +  GS     GLNGER+Y+ EHVQLIRQIK  KDYY ILGVEKT SA+
Sbjct: 61  SPVVEKCVSSNKNRGGSTKLDKGLNGERSYTEEHVQLIRQIKRHKDYYAILGVEKTCSAD 120

Query: 121 EIKRAYKKLSLKVHPDKNKAPGSDEAFKKLSKAFMCLSDDTSRRQYDHTALVDQYEYNQQ 180
           E++RAYKKLSLKVHPDKNKAPGS+EAFKK+ KAF CLS D SRRQYD   LVD++EYNQQ
Sbjct: 121 EVRRAYKKLSLKVHPDKNKAPGSEEAFKKVCKAFKCLSVDDSRRQYDQVGLVDEFEYNQQ 180

Query: 181 HNVR--RRRTGRDLFEENFDPDEIFRAFFGQGNMFQTSRAYTYSTGGARSQRRTESDGGG 240
           HNVR  RRR G DLF++ FDPDEIFRAFFGQG+MF+TS  + Y T G    +R +  GGG
Sbjct: 181 HNVRQRRRRYGNDLFDDEFDPDEIFRAFFGQGDMFRTS--HVYRTRGMGGHQREQRHGGG 240

Query: 241 PNLLIFLLMLPFLLIVLLAYMPFPEPDYSLHKNLSYSIPMATEKHGVEFFVKSSDFDERY 300
           PN L+ L +LPFLLI LLAY+P  EP+YSL +N SY IP  TEK+GVEF+VKSS FD  +
Sbjct: 241 PNFLVLLQILPFLLIFLLAYLPISEPEYSLFRNYSYQIPKTTEKYGVEFYVKSSAFDVNF 300

Query: 301 PLGSGARSEIESSVIRDYKNMVWRYCHVELQRRKWNKNLPTPHCEKMNNLGFA 351
           PLGS AR+  E +VI+DY++M+WRYCHVE Q+R WNKNLPTPHC K+ NLG A
Sbjct: 301 PLGSPARANFEDNVIKDYRHMLWRYCHVERQKRHWNKNLPTPHCNKLQNLGLA 351

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
DNJ49_ARATH1.8e-9452.11Chaperone protein dnaJ 49 OS=Arabidopsis thaliana GN=ATJ49 PE=2 SV=2[more]
DJB14_XENTR4.0e-3834.51DnaJ homolog subfamily B member 14 OS=Xenopus tropicalis GN=dnajb14 PE=2 SV=1[more]
DJB14_XENLA4.5e-3731.75DnaJ homolog subfamily B member 14 OS=Xenopus laevis GN=dnajb14 PE=2 SV=1[more]
DJB12_MOUSE1.7e-3633.33DnaJ homolog subfamily B member 12 OS=Mus musculus GN=Dnajb12 PE=1 SV=2[more]
DJB12_HUMAN2.4e-3533.14DnaJ homolog subfamily B member 12 OS=Homo sapiens GN=DNAJB12 PE=1 SV=4[more]
Match NameE-valueIdentityDescription
A0A0A0KG46_CUCSA8.1e-16383.52Uncharacterized protein OS=Cucumis sativus GN=Csa_6G306340 PE=4 SV=1[more]
M5XBK5_PRUPE1.1e-12767.71Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa007951mg PE=4 SV=1[more]
A0A061EQA7_THECC1.2e-12667.14Heat shock protein DnaJ OS=Theobroma cacao GN=TCM_021223 PE=4 SV=1[more]
B9RT65_RICCO1.5e-12464.25Chaperone protein dnaJ, putative OS=Ricinus communis GN=RCOM_0681300 PE=4 SV=1[more]
A0A0K9QDC8_SPIOL2.6e-12463.28Uncharacterized protein OS=Spinacia oleracea GN=SOVF_198660 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G49060.11.0e-9552.11 Heat shock protein DnaJ, N-terminal with domain of unknown function ... [more]
AT3G57340.12.7e-6438.42 Heat shock protein DnaJ, N-terminal with domain of unknown function ... [more]
AT5G05750.19.2e-4939.51 DNAJ heat shock N-terminal domain-containing protein[more]
AT3G06778.12.4e-1243.24 Chaperone DnaJ-domain superfamily protein[more]
AT2G22360.11.2e-1136.97 DNAJ heat shock family protein[more]
Match NameE-valueIdentityDescription
gi|449460955|ref|XP_004148209.1|1.2e-16283.52PREDICTED: chaperone protein dnaJ 49 [Cucumis sativus][more]
gi|659117133|ref|XP_008458441.1|2.9e-16182.95PREDICTED: chaperone protein dnaJ 49-like [Cucumis melo][more]
gi|645233143|ref|XP_008223201.1|1.9e-12867.90PREDICTED: chaperone protein dnaJ 49 [Prunus mume][more]
gi|596134454|ref|XP_007222268.1|1.6e-12767.71hypothetical protein PRUPE_ppa007951mg [Prunus persica][more]
gi|590661215|ref|XP_007035611.1|1.8e-12667.14Heat shock protein DnaJ [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR015399DUF1977_DnaJ-like
IPR001623DnaJ_domain
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG12g02770.1Cp4.1LG12g02770.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001623DnaJ domainPRINTSPR00625JDOMAINcoord: 141..161
score: 3.2E-15coord: 107..125
score: 3.2E-15coord: 125..140
score: 3.2
IPR001623DnaJ domainGENE3DG3DSA:1.10.287.110coord: 97..209
score: 1.5
IPR001623DnaJ domainPFAMPF00226DnaJcoord: 105..166
score: 7.4
IPR001623DnaJ domainSMARTSM00271dnaj_3coord: 104..161
score: 6.9
IPR001623DnaJ domainPROFILEPS50076DNAJ_2coord: 105..169
score: 19
IPR001623DnaJ domainunknownSSF46565Chaperone J-domaincoord: 101..212
score: 1.57
IPR015399Domain of unknown function DUF1977, DnaJ-likePFAMPF09320DUF1977coord: 261..331
score: 5.
NoneNo IPR availablePANTHERPTHR24078DNAJ HOMOLOG SUBFAMILY C MEMBERcoord: 10..46
score: 9.6E-32coord: 92..230
score: 9.6
NoneNo IPR availablePANTHERPTHR24078:SF193DNAJ HOMOLOG SUBFAMILY C MEMBER 2coord: 92..230
score: 9.6E-32coord: 10..46
score: 9.6

The following gene(s) are paralogous to this gene:

None