Cp4.1LG01g17370 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g17370
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionDnaJ domain-containing protein
LocationCp4.1LG01 : 12189671 .. 12199155 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCCACTAATCCCTCTTCGTTGATTCTATTCTACCATCCATTTCTTCTCTCGATCCTCCACCGTATGCCCCCTGCACTTCATTGCAGGAGGTCTCTCTCTCTCTCTCCCTCTCCCCCCTCCGAATTCAGATCTCTTCATCTCCGTTCCTTCTCTTCCTCGGCGATCCCTTTTCCCTCTCCCAATTTTTACTTTTCTTCTCTCTGTATCGTTGTGTTTTTGTTTCTCAATCACTCTGTATTTGTATTTTAACTCATCGTCATGAACGACTTTGAGGGTCTTCTAGCTACCAATTATGGATTCAAGCCCCAAGGTAAAGCCGCCCCAATGGCTGCTTCAAAGGGTACTTCCAATATCAACAACCCCACCACCTCCCCGAATTTCGATCTTGGATCTCGTGCCTCCTTCCGATCCACTAAGAACTCCAATTCCCTATCTGGGTCTCTCGCCGATGACCATGATTCGCTTAATCGATCGGTGAGTGCTCATGAGAATCGCGAATTCGGTGGTCTCGATGATCTGCTTGGGGAGTCGTCCAGATTCTCCAGAAAATCGGAGAGTCGTGCTGGCGAATCCGATGTTAATTTCGACTCTCTGTTTCATGGGGCTGGTAATTCTGGTCAACCGGTGGCTTCGAACTTACCGGTTTATGATAAACCCGTGTATGATGATGATATCTTTGATGGCATTCCGGGCTTGAAAAACTCGTCTAAAGTTCAGTATGATGATGTGTTTTCATCAATGTCTTCGCCGCCTAAAGCAGATTCAGCATTCGATGATCTTCTTGGGGGATTTAGCAAATCGGGAGGTGTATCGAAGAGTAAGGATAAAGAGATTCCTGCTTTTGATGATTTGATACCTGGGTTTAGAGGCAGCAGCTCCCCTGGTGACAGGTATAACATGGTGATGGTGTGCAATCTTCAAATGGGCTTCTGTAGTTTAGTTAAGCTTCTTAATTGTTTACTGATCAATTTGAGTTTGGATTGCTTTCTTTCATACTTATTGTTTAACCCTTTGATGTTAAATTGTGCAGTTCGGTTGCTTTTTTTGAACTGATGTTTCATTGCAGCAATCATTGTATTCAGCATTTATGTTTGGATTTATAGACATGGCACTTTAGAATGAATTGTTGCTGTTTCCTGGGTTTATGGGGCTGCTTTGTAGCTAATACATGGAGTTCTTGTAAGAATCTGTGCTAGTCAATCCCCTCGAGAACTATCGCTAAATTTTAATAAGAAGCGAAACGTTTCATCGAAAATATGAAAAGAGACTAATGCTGTTGCCTGGTTTTCCATTGGAAAATTATACTCAAAAGCTGGATTCCTAATCCCTGTTATTGAGAGAGTCTCTAAATGGTTAGAGATATGGAAACGATGCTATCGTTTCCGGTGAGGGAGATTTAGCCTTGCGGACTCTTTTTTCGTCAAGTTTGCCTACCTAATTTTTGTTCTTGTTCAACCTCCTTTCCAAGTTCCTAACCTCCTTGAAAGGTAAAATGGGATTCATGGTCTCACATTTACTCTTGTTCATCTATTTAACCAAGAAAAATTTGGTATCTAATTACAATGTCTCAAACAAGGAGGAAATTTGCTCCTTTTCCTTTGCAGCGAACTACGACCAAGAACTAACTGATATGGATTAAGGCTGTTCAAGGTATAAGGCCAAGAAGATTAATAAGACATTGCAAGGAATTCATTTCACTTGTTTTATACCAAGAAGATCTTTAGATTTGAGTTGCATTAAATGCACCCGGGCTTAATTTATGACGAACCAAAATTGAACTCAAGTTCATTAAATGAAGTTGAGTTGCATTAAAGTGAATTCTAGTTGCATGGTAGGAATTTAAGCTGTATTAAATTAACACAAGTGTATCTCAATGTTCGTTTAGTACAAGTCCAAGTCAAGTATCTACCAGTGAAACCACCTTTGCAACTACGTCAAGTGGAGCTTCATTTATACACGAGAGATTAGAAAGGCTTTTAAGGCTGATCGATGGATTTTAATCTCGGATTTATTGCTTAGTTTCTTGTAATTTTAGTTTTAAATCCTTCATCTATGTTCTTTAGGTAGGTTACTTCGAAGTTAGCATTGTTATAACATTCATGCAGGTAGATTAGCTAATTCCTTTGTAGATGGGAGACTTGGAAAATGCAATAAAAAGAGCCTTTGTGCTTTCTTTTACCGTAGAATTAAATTTCTTTGTAATTTTATGTAATTTAGCATGATTAGAATTATTCAAGTTTGACTGGATTATTTTTTTCTGTGGAGTGATTCATATCTCGAACACGGAACAATCTTCTTTCCCTCGAGATATTTGATCTTCAATGTAATTCTGTATTTACATTTCTTGGGTTCTTTTACTGGTTCGATTTTTAAATTTGTTTTAGGATTTTTGAAGCTGACTTCTATGGATCTTGTTCTTCAATAAGTTGTTTAGCCCCAATCATCAAACTCGTTGAATGCTTTGCAATGCCTTATAAATCTTGGCTTTAGTCCTTGAATAGCCTTAGGATTCATATCACTATTGACCCTCCCTATTAGATGATCACTCAATATTGGGAAAAGAATTTAGGCTTGTGTTTCTATTCGGGGCCTTTTCCATTTAAAGCTGAGAACTTTTGGCTTTCCCTTCACAACTTCTCAATTTAGAAGAGAAGATGGCATTATAGTCAAGTTTTAAGTCGAAAAGCTCAAAATTTTGAAGCAAGATTTGAAAGGACGGAATATGGAAATTTATTGACATCTTACTGTAAAAAAGCTATAAATTACTGAGTCAATTAATCTTTAATACAAGCTTTTCAATCTTGTAAAGTTAACTAAGAGGTGAAGTTTGTCTTTTTGAGGCAAATGGATCTGTTCTCTCTATTTTTATCTGTTATTGTGATTGATTTCTTTAATAGATTACTGAGGTGTAAGGATTTCTTCTCTTCCAGATTTCTCCGTGAAGTGTGTGCGTTTTTGAAACTTTGTGTAGTGCCCTAGGCCAATCGTTCAAATTTTGAATTTTGCACTTGACGGTCCTAACATTCTTCTGCATCTCGGGCTTAGATGTGTCATATACCTTACACGAGTTACTTATGACAATGAAGACTATCCCTATAGACCAACGTAGGTCTTTTAGCACACTCTACCCTCATTGACGTGATCGTTAGAATTTTTTTTCATGAAGTCATCCAATATAGGATTGTTTTGAGCTAAGAATGTTCAAAGTGAACATTTTGAGTAGGAGAGTTGTGGAAGATCTTGATGATATTCTTTAGGGTTATCACTTTGCTTATGTAGTTTGAAACTTATTTTTAGGAGTTTGTGTGGGATGGACTAGACATAGAGATTGTATAGAAATGTTGAAGGAGTTCCTACCCCATCCTCGAATGGAAGACTTTTGTGCCAAGCTGGGGTTTGTCATTATAGGATGCTTTGGGGAGAAAAAAGGATCTACCTTCTTCATATAGATAGTAACTATCAACGTTTTAAGCATTTCTTAACCATGTTTGCATAGCCATAGGATCCCCTAATTTGGACGTGATGCCAACTCATTTATGTACCTCTCTAGTTGCGTGTCACATGTCCACCCACTTTTTCCTTGATTGATCTCGACTTCATCATACAAGGAGAGATTCCTTTCAAATGCCACCGGTAACACATTAAGGGCGATGCAGGCATGGTTAAAGAAAAACTTAAAAGAATTGTTAGTTATACCATAAAAAGATGCACCTTCATGTTCAGTGACCTAAAACGTTTGATATTTTTGGTTTGTAGCAATTTTAACTCTTAGGAAAAATTTCTAAAAAACATGTTAGTAAGGAAAATCATGTTGAAAGGACCCATGTTGGTACGTAGGACTGTCTTCATTCTCACAAGTAGCTTAGGACAAGTAGATGGTGCATCGAGGTCGAGGAGGATCTTGGAAAATGAAGGCCATCTGAATCTTGGGGCAATCCTATTTGTTCCTAGGATCTTTGAAAAGGAGGCTTTGTGTGCAGTTTTCTTTTGAGAGGACAACCGTTTGACTTGAAAATTGTAGTGGCTCACTAATGGCACAGTGTTTCTTGGTGTGCTCAAAAGGAGTCGGTTCGAAATTCATGAAACTCATTTTTAAGCGTATAAATCCATGAACTTATTGACTTTGAATTTCCTTCTTGTATAACAATAATAAAATGATATATATTTTAATAAATGCGTCCTTGTCGTGTTAGTGTCCTAGATTTTAATAAACGCGTCCTTATCTTTTAAAAATCTTCCTGTAATATTACTTTCAGGTCGAACTCAGGAGGCAGCTGGTCATCAGAGCCAAATTCAGTTAAAAGTAAAACGTCCTCCATGCCAATGGAAAACCCGTTTGGCGTGTCAAGGGAGCGTGATGATCCACACCATGAAGAAGCTAGTGACGTCGGTAATTTTAAGAGCCCAAAGTTTGATGGTTACCCTTCCTCTGGTGCAAACAATAAAGGTTTTGATGATATGGACCCATTTGCTAGCCTTGGTAAATCTGTCCCTGCATTTTCATCAGAAGGGAATAATAGAGGTAAGGCTAGGAGTCCTCCTAAGGTAGATGGAAGTTCTGCTGGGCCTCAGAATTCTAAGAGTAAAGATACTATGGAGAAACTATCTGGAAAAACTTCTGGACAACCTTTGAAGAAGGAGGTGTCGGCTAAGAATGACAGACAATTTGATGAGCCGGTGTTCGACATTCCTGCAGTTCCAACTAGTTCTCATAAATTTGTTCCTCAATCAACATCTCCTCCAGCTCCGAAAGATGAAAACGTGATGGGCGATACGTCTAGATTTGAAGATAGTGTTGAATCAGATGAAATATGGCTCACTGTATCAGAGATTCCTCTTTTTACACAACCCACTGTTGCTCCGCCCCCTTCAAGACCGCCACCTCCTATACCGCAGCAAGTCCCAAAAGAAGGAACGAGTCCGTATGGGTCTCGAAGTTCAAAGATGAATGCTAATGACTTTTCTTCAGTTCCCAATTCTACACATCATTATCAGATTCCTAAATCTGCTTCTGCTTCTGCTTCTGCTTCAACGAGAGATCAAGTATCTTCGGTGGATGAACTCGAGGAATTTGCCATGGGAAGGAATCAGAGTAATGCTGAAGATCTAGTAAATGGTCTTTCTAATGAAGAAGCAGAGATGAACTCTGCTGCTGCAGCCATGAAGGAGGCAATGGATAGAGCAGAGGCTAAGTTTAAGCATGCTAAGGAAGTGAGGGAAAGAGAGAGTACAAGAACTTCTAAAAGTAAAGAATCTGTATATTGGGATAGAGATGAGAAAGCCATGCGAAATAATAGAGTTGAAGATGAGGAGAGCATGGATCATGAACGGTTCCAACGAGAAAGGGAAAGGGAAGAGAAGGAGAAACGGAGAGCTGAACGAGAGAAAGAAAGAGCGAGGGAGCTTGAGAGGGAAAGAGAGGAGAAAGAGAAAGAACAAAGGAGGTTAGAGAAGGAAAGGGAAAGAGCTAGGGAGATTGAGATGGAAAGGATAAAAGCCAGACAGGCTGTTGAAAGGGCCACAAGGGAAGCACGTGAGAGGGCAGCTACTGAAGCTCGCTTAAAAGCTGAAAGGGCTGCTGTTGAAAAGGTAAACGCAGAGGCTCGAGGACGTGCTGAAAGGGTTGCAGTTCAAAGAGTTCAAGCTGAGGCCCGTGAAAGGGCTGCAGGAGAGGCCCGAGAAAGAGCGGAAAGGGCTGCAGCAGAAGCACGGGAAAGAGTAGAAAAGGCAGCTGCAGAAGCAAAAGAGAAGGAAGCACGAGAGAGAGCTTCTGTTGCAAGGGCAGAAGCTGAAGCAAGGAGCAGGGCAGAGCGAGTTGCCGTGGAAAGAGTTGCTGCAGAGGCTCGAGAAAGGGCTGCTGCAGATGCTCGAGAGAGGGCTGCAGCAGCAGCCAGGGCAAGCCAACAGAAGAACGAAAACGACCTTGAGTCATTTTTCAGCATGGGTAGAGCCAGTAGTGCACCTAGGCCCAGGGCCAACCCTATGGTGAGGACTTAAGTTCGTAAATCTTGTAACCCGGTCCCCTTTGCTTTCTTTTCAATCTCTGGTTTCTCATCAAGGACTGTTTTCAGGACAACTTCTTTGATTCCCAATCACCAAGTAGACCAGAGCCGCAGACAACTAAACAGCCTTCTACCACTTCATCAAGTATGAGGAAAGCATCTTCAACCACTAATATTGTTGATGATCTTTCATCTATCTTTGGAGGTACTTTTACATCTTAATACACTTCAATAGATGTTGGACTTCGGTCTCGAATGTTGACTTTAACTATTTATTTTGGAAAAGGTCCTCCACCTTCTGGAGAATTCCAGGAAGTCGATGGGGAGAGCGAGGAAAGACGAAGAGCCAGATTGGAACGCCACCAGAGGGTACAAACGCGTGCGGTAAGCTTCTATGTTCTGTTTAAATATGCATTACAAAACATATTGATGCTGAGAATAGAGATACAATGTGCACTGCTTTTATGAGGCTTTTGAACTTTGTGTGCATTGTTTCAGGCAAAAGCTCTGGCTGAGAAAAATGAGCGAGATCTTCAAATGCAAAGGGAACAAGCCGAGAGACATGTGAGTGCATTTTCTTGACACATGTTGCCTTTCTTTCTATTACACCTAGGTCCACAGCTAGCAGATATTGTCCTCTTTGGACTTTCTCTTTCGGGCTTCACCTCAAGTTTTTTAAAACGCGTTGGCTAGGGAAAGGTTTCCACACCCTTATAAAAGAGTGTTTCGTTCCCCTCCCCAACCAATTTGGGATCTCATAATCTACCCCCCTTCAGGGCCCAGCGTCCTCGCTGGCACTCGTTCCTTTCTCCAATCGATGTGCGACCTTTACCAAATCCACCCCCTTTGGGTCCCAGTTCCTTATTGGCACACCTCCTCGTGTCTACCCTCCTTCGAGGAACAGCCTCCTCGCTGGCACATCGCTCGGTGTCTGGCTCTGATACCATTTGCAACAGCCCAGGCCCACCATTAGCAGATATTGTCCTCTTTGGGCTTTCCCTTTCGGGCTTCCCTTCAAGGTTTTTAAAACGCTTCTGCTAGGGAAAGGTTTCCACACCCTTATAAAGGGTGTTTCGTTCTCCTCCCCAACCGATGTGGGATCTCACACTATTCCATTTCTAAGTTCATCCATTTCTCTTCTGTGCGTGCTGGAACTATGATTTTATTGCTTTGAAGCTATGTTCCTTGAGTGGATTGGAGAAAACCTGGTTTTTCATATTGTCGTTATTTGTTTCTTCCTGTTTTCATTTATTTTTCTTAAATTTTCTGACTGAACGTAGATTTGTCAATGAAATAATGAAGTACTCAATGTTGTCTGATAGGAAAAATGGTGCTCAGTTGTATTTCTGTAGATTTTCCCTCATAGGCAAAGCTATCTTGTATCTATGTATTGTTCTTTTCTTTTGTAATTTATTTGAGGGTTATAATAGAGAATTTCACCCTCATTTAGGGAGGGGGTTTTAGCAAATATTGTCATGGTATCAGAGCCCTTTCGGTTTAGCAACCCAGATCTTTTATTTTCATGGCCTTTGAATCCTCTACTCATCTTCTCCACCACCACTGGCGGCTTCGACGTCGCAGGGGCAGTCTCCATGGCTGCCGCAAGTGGAGCCGATTGATTATTCTTCGTCTCCATCTTTCGCCTTTCTCTGTACATACAACCTCACCTTCTCCAACACAATCGGTTTTTCCGTGCAGAGAATTGCTGAAACACTGGATGCTGAGATCAAGCGTTGGGCTGCAGGGAAGGAGGGCAATTTACGTGCTCTTTTATCAACTTTGCAATACGTAAGTGTTGTGAATTTAGTAGCTAAACAATTTTGCGAGCAAGTGTTGTATGTTGTGATGGGCTGAGTAGTTTGTTTCACTGTTAAATGATACAATTAGGTGCTCTGGCCTGAGTGTGGGTGGCAGCCAGTTTCTTTAACTGAGATGGTTATTCCAAATTCAGTCAAGAAAGTATACAGGAAAGCCACGCTATGCATTCACCCTGATAAGGTGCAGCAAAAAGGTGCCACTCTTCAGCAAAAGTACGTTGCTGAGAAGGTGTTTGACATTCTAAAGGTAAGAACATTTCCAGTTTGTTATCGATGTTATACGAGCACTTGCGCTTGTTAACCGTTTCTATCTCGTAACAACAGGAAGCATGGAACAAATTTAACTCGGAGGAACTTTTCTAAGTTCAATGGATGATTTTTCGAAGACGAAACGGGAGAATGAATGTTTGGTTCAGATGGCCAACGCAACCGATAAAGGCGTGGCTTCGCAATTTGGTGGACGAGACGACGTCCAACTGGTAATATGATTTCATTGTTTCTCATTCTTTTTCTGCTCATCCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNATAATTGATGATCTTCCTGAGTCTATAAAAGAAAGAAAGATCAATGAATGAATGGTAAATGATGGGGAGAGTTAGATATAAAGTAGAAATACGATGAAGTTGTGTGGTTTTTGCTTATCTATATTCTTGGTTGTGATGTGTACAAACTGGGTGAATATATGTAGAAGTTGTATTTATAAGTCACACATTAACTTCTTTGCCATTATATTATTAGTAACATGAAATCTTCAATGCATTTGTTCAATGTTTCGAGTATATGGAAGCAAAAGTACATTAGCGTTCAACGATCCCAGCTCAACGTTGTTCTCGCTCATCAAATTTCTTTTCCACTGTGAATGCGATTAACTTGGAATTGATAGGCTACAGTTTCTATCGATTTTTGTTGAGATTTTTTATCTTAAAAACTCTATCAGTGAATGAATTTTTCAAATATCTTGAGTCAAACTCATTGTTCAGTGTTATTATATTATTGGATTTTTCAAGATCTTTTTTTTAGCGAGAGTTTCAACCTGAATTTAGTGGCATTACGTTTCTTCCCTTTTTTCTATTCTTTTTTGAGGTACAATCTTGATTTTCTTGGATAGTCTTCGGTTGTCACTATCCTGGAATTCATTGGAGACAAAGATTATTTGCTACCTATTGTTTGGTGTAAATACTTAATTTATACCACGATAAATTTTTAAAATTTAAACATAAATTTTATGTAGTCGTTATTGTAAATATCTTTTGTCCCTATCAATTGTATCGAATATCCATTTAATATCGGAAGTTTAAAAAATGGTTCTAAATAATTTGAATATTAACTTGATTATTTGCTAACTAAAGATCAGGTACTCGTGGTTGTAATGGATGGATGATGATTTTGGTTTAGGTGGAGAATGAGGGAGAAAGTGAAGAGGAAAAAATCTGTAAGCAAATTGGAGTTCATTCAGGGAATGCATTTTCGGACCTCACCTCATTTGATTGCTTGTTCCCTTTATTTATTTATATTTATAACTTGTAACACCTCCCCATTCAAAAGCTTAAGTCACTGTTCATTACTATTGACTCGGAGTACGTTTTGTATAACTCTTACATATATTTAAAAAAGAGTTAATTA

mRNA sequence

TCCACTAATCCCTCTTCGTTGATTCTATTCTACCATCCATTTCTTCTCTCGATCCTCCACCGTATGCCCCCTGCACTTCATTGCAGGAGGTCTCTCTCTCTCTCTCCCTCTCCCCCCTCCGAATTCAGATCTCTTCATCTCCGTTCCTTCTCTTCCTCGGCGATCCCTTTTCCCTCTCCCAATTTTTACTTTTCTTCTCTCTGTATCGTTGTGTTTTTGTTTCTCAATCACTCTGTATTTGTATTTTAACTCATCGTCATGAACGACTTTGAGGGTCTTCTAGCTACCAATTATGGATTCAAGCCCCAAGGTAAAGCCGCCCCAATGGCTGCTTCAAAGGGTACTTCCAATATCAACAACCCCACCACCTCCCCGAATTTCGATCTTGGATCTCGTGCCTCCTTCCGATCCACTAAGAACTCCAATTCCCTATCTGGGTCTCTCGCCGATGACCATGATTCGCTTAATCGATCGGTGAGTGCTCATGAGAATCGCGAATTCGGTGGTCTCGATGATCTGCTTGGGGAGTCGTCCAGATTCTCCAGAAAATCGGAGAGTCGTGCTGGCGAATCCGATGTTAATTTCGACTCTCTGTTTCATGGGGCTGGTAATTCTGGTCAACCGGTGGCTTCGAACTTACCGGTTTATGATAAACCCGTGTATGATGATGATATCTTTGATGGCATTCCGGGCTTGAAAAACTCGTCTAAAGTTCAGTATGATGATGTGTTTTCATCAATGTCTTCGCCGCCTAAAGCAGATTCAGCATTCGATGATCTTCTTGGGGGATTTAGCAAATCGGGAGGTGTATCGAAGAGTAAGGATAAAGAGATTCCTGCTTTTGATGATTTGATACCTGGGTTTAGAGGCAGCAGCTCCCCTGGTGACAGTAAAACGTCCTCCATGCCAATGGAAAACCCGTTTGGCGTGTCAAGGGAGCGTGATGATCCACACCATGAAGAAGCTAGTGACGTCGGTAATTTTAAGAGCCCAAAGTTTGATGGTTACCCTTCCTCTGGTGCAAACAATAAAGGTTTTGATGATATGGACCCATTTGCTAGCCTTGGTAAATCTGTCCCTGCATTTTCATCAGAAGGGAATAATAGAGGTAAGGCTAGGAGTCCTCCTAAGGTAGATGGAAGTTCTGCTGGGCCTCAGAATTCTAAGAGTAAAGATACTATGGAGAAACTATCTGGAAAAACTTCTGGACAACCTTTGAAGAAGGAGGTGTCGGCTAAGAATGACAGACAATTTGATGAGCCGGTGTTCGACATTCCTGCAGTTCCAACTAGTTCTCATAAATTTGTTCCTCAATCAACATCTCCTCCAGCTCCGAAAGATGAAAACGTGATGGGCGATACGTCTAGATTTGAAGATAGTGTTGAATCAGATGAAATATGGCTCACTGTATCAGAGATTCCTCTTTTTACACAACCCACTGTTGCTCCGCCCCCTTCAAGACCGCCACCTCCTATACCGCAGCAAGTCCCAAAAGAAGGAACGAGTCCGTATGGGTCTCGAAGTTCAAAGATGAATGCTAATGACTTTTCTTCAGTTCCCAATTCTACACATCATTATCAGATTCCTAAATCTGCTTCTGCTTCTGCTTCTGCTTCAACGAGAGATCAAGTATCTTCGGTGGATGAACTCGAGGAATTTGCCATGGGAAGGAATCAGAGTAATGCTGAAGATCTAGTAAATGGTCTTTCTAATGAAGAAGCAGAGATGAACTCTGCTGCTGCAGCCATGAAGGAGGCAATGGATAGAGCAGAGGCTAAGTTTAAGCATGCTAAGGAAGTGAGGGAAAGAGAGAGTACAAGAACTTCTAAAAGTAAAGAATCTGTATATTGGGATAGAGATGAGAAAGCCATGCGAAATAATAGAGTTGAAGATGAGGAGAGCATGGATCATGAACGGTTCCAACGAGAAAGGGAAAGGGAAGAGAAGGAGAAACGGAGAGCTGAACGAGAGAAAGAAAGAGCGAGGGAGCTTGAGAGGGAAAGAGAGGAGAAAGAGAAAGAACAAAGGAGGTTAGAGAAGGAAAGGGAAAGAGCTAGGGAGATTGAGATGGAAAGGATAAAAGCCAGACAGGCTGTTGAAAGGGCCACAAGGGAAGCACGTGAGAGGGCAGCTACTGAAGCTCGCTTAAAAGCTGAAAGGGCTGCTGTTGAAAAGGTAAACGCAGAGGCTCGAGGACGTGCTGAAAGGGTTGCAGTTCAAAGAGTTCAAGCTGAGGCCCGTGAAAGGGCTGCAGGAGAGGCCCGAGAAAGAGCGGAAAGGGCTGCAGCAGAAGCACGGGAAAGAGTAGAAAAGGCAGCTGCAGAAGCAAAAGAGAAGGAAGCACGAGAGAGAGCTTCTGTTGCAAGGGCAGAAGCTGAAGCAAGGAGCAGGGCAGAGCGAGTTGCCGTGGAAAGAGTTGCTGCAGAGGCTCGAGAAAGGGCTGCTGCAGATGCTCGAGAGAGGGCTGCAGCAGCAGCCAGGGCAAGCCAACAGAAGAACGAAAACGACCTTGAGTCATTTTTCAGCATGGGTAGAGCCAGTAGTGCACCTAGGCCCAGGGCCAACCCTATGGACAACTTCTTTGATTCCCAATCACCAAGTAGACCAGAGCCGCAGACAACTAAACAGCCTTCTACCACTTCATCAAGTATGAGGAAAGCATCTTCAACCACTAATATTGTTGATGATCTTTCATCTATCTTTGGAGGTCCTCCACCTTCTGGAGAATTCCAGGAAGTCGATGGGGAGAGCGAGGAAAGACGAAGAGCCAGATTGGAACGCCACCAGAGGGTACAAACGCGTGCGGTGCTCTGGCCTGAGTGTGGGTGGCAGCCAGTTTCTTTAACTGAGATGGTTATTCCAAATTCAGTCAAGAAAGTATACAGGAAAGCCACGCTATGCATTCACCCTGATAAGGTGCAGCAAAAAGGTGCCACTCTTCAGCAAAAGTACGTTGCTGAGAAGGTGTTTGACATTCTAAAGACGAAACGGGAGAATGAATGTTTGGTTCAGATGGCCAACGCAACCGATAAAGGCGTGGCTTCGCAATTTGGTGGACGAGACGACGTCCAACTGGTGGAGAATGAGGGAGAAAGTGAAGAGGAAAAAATCTGTAAGCAAATTGGAGTTCATTCAGGGAATGCATTTTCGGACCTCACCTCATTTGATTGCTTGTTCCCTTTATTTATTTATATTTATAACTTGTAACACCTCCCCATTCAAAAGCTTAAGTCACTGTTCATTACTATTGACTCGGAGTACGTTTTGTATAACTCTTACATATATTTAAAAAAGAGTTAATTA

Coding sequence (CDS)

ATGAACGACTTTGAGGGTCTTCTAGCTACCAATTATGGATTCAAGCCCCAAGGTAAAGCCGCCCCAATGGCTGCTTCAAAGGGTACTTCCAATATCAACAACCCCACCACCTCCCCGAATTTCGATCTTGGATCTCGTGCCTCCTTCCGATCCACTAAGAACTCCAATTCCCTATCTGGGTCTCTCGCCGATGACCATGATTCGCTTAATCGATCGGTGAGTGCTCATGAGAATCGCGAATTCGGTGGTCTCGATGATCTGCTTGGGGAGTCGTCCAGATTCTCCAGAAAATCGGAGAGTCGTGCTGGCGAATCCGATGTTAATTTCGACTCTCTGTTTCATGGGGCTGGTAATTCTGGTCAACCGGTGGCTTCGAACTTACCGGTTTATGATAAACCCGTGTATGATGATGATATCTTTGATGGCATTCCGGGCTTGAAAAACTCGTCTAAAGTTCAGTATGATGATGTGTTTTCATCAATGTCTTCGCCGCCTAAAGCAGATTCAGCATTCGATGATCTTCTTGGGGGATTTAGCAAATCGGGAGGTGTATCGAAGAGTAAGGATAAAGAGATTCCTGCTTTTGATGATTTGATACCTGGGTTTAGAGGCAGCAGCTCCCCTGGTGACAGTAAAACGTCCTCCATGCCAATGGAAAACCCGTTTGGCGTGTCAAGGGAGCGTGATGATCCACACCATGAAGAAGCTAGTGACGTCGGTAATTTTAAGAGCCCAAAGTTTGATGGTTACCCTTCCTCTGGTGCAAACAATAAAGGTTTTGATGATATGGACCCATTTGCTAGCCTTGGTAAATCTGTCCCTGCATTTTCATCAGAAGGGAATAATAGAGGTAAGGCTAGGAGTCCTCCTAAGGTAGATGGAAGTTCTGCTGGGCCTCAGAATTCTAAGAGTAAAGATACTATGGAGAAACTATCTGGAAAAACTTCTGGACAACCTTTGAAGAAGGAGGTGTCGGCTAAGAATGACAGACAATTTGATGAGCCGGTGTTCGACATTCCTGCAGTTCCAACTAGTTCTCATAAATTTGTTCCTCAATCAACATCTCCTCCAGCTCCGAAAGATGAAAACGTGATGGGCGATACGTCTAGATTTGAAGATAGTGTTGAATCAGATGAAATATGGCTCACTGTATCAGAGATTCCTCTTTTTACACAACCCACTGTTGCTCCGCCCCCTTCAAGACCGCCACCTCCTATACCGCAGCAAGTCCCAAAAGAAGGAACGAGTCCGTATGGGTCTCGAAGTTCAAAGATGAATGCTAATGACTTTTCTTCAGTTCCCAATTCTACACATCATTATCAGATTCCTAAATCTGCTTCTGCTTCTGCTTCTGCTTCAACGAGAGATCAAGTATCTTCGGTGGATGAACTCGAGGAATTTGCCATGGGAAGGAATCAGAGTAATGCTGAAGATCTAGTAAATGGTCTTTCTAATGAAGAAGCAGAGATGAACTCTGCTGCTGCAGCCATGAAGGAGGCAATGGATAGAGCAGAGGCTAAGTTTAAGCATGCTAAGGAAGTGAGGGAAAGAGAGAGTACAAGAACTTCTAAAAGTAAAGAATCTGTATATTGGGATAGAGATGAGAAAGCCATGCGAAATAATAGAGTTGAAGATGAGGAGAGCATGGATCATGAACGGTTCCAACGAGAAAGGGAAAGGGAAGAGAAGGAGAAACGGAGAGCTGAACGAGAGAAAGAAAGAGCGAGGGAGCTTGAGAGGGAAAGAGAGGAGAAAGAGAAAGAACAAAGGAGGTTAGAGAAGGAAAGGGAAAGAGCTAGGGAGATTGAGATGGAAAGGATAAAAGCCAGACAGGCTGTTGAAAGGGCCACAAGGGAAGCACGTGAGAGGGCAGCTACTGAAGCTCGCTTAAAAGCTGAAAGGGCTGCTGTTGAAAAGGTAAACGCAGAGGCTCGAGGACGTGCTGAAAGGGTTGCAGTTCAAAGAGTTCAAGCTGAGGCCCGTGAAAGGGCTGCAGGAGAGGCCCGAGAAAGAGCGGAAAGGGCTGCAGCAGAAGCACGGGAAAGAGTAGAAAAGGCAGCTGCAGAAGCAAAAGAGAAGGAAGCACGAGAGAGAGCTTCTGTTGCAAGGGCAGAAGCTGAAGCAAGGAGCAGGGCAGAGCGAGTTGCCGTGGAAAGAGTTGCTGCAGAGGCTCGAGAAAGGGCTGCTGCAGATGCTCGAGAGAGGGCTGCAGCAGCAGCCAGGGCAAGCCAACAGAAGAACGAAAACGACCTTGAGTCATTTTTCAGCATGGGTAGAGCCAGTAGTGCACCTAGGCCCAGGGCCAACCCTATGGACAACTTCTTTGATTCCCAATCACCAAGTAGACCAGAGCCGCAGACAACTAAACAGCCTTCTACCACTTCATCAAGTATGAGGAAAGCATCTTCAACCACTAATATTGTTGATGATCTTTCATCTATCTTTGGAGGTCCTCCACCTTCTGGAGAATTCCAGGAAGTCGATGGGGAGAGCGAGGAAAGACGAAGAGCCAGATTGGAACGCCACCAGAGGGTACAAACGCGTGCGGTGCTCTGGCCTGAGTGTGGGTGGCAGCCAGTTTCTTTAACTGAGATGGTTATTCCAAATTCAGTCAAGAAAGTATACAGGAAAGCCACGCTATGCATTCACCCTGATAAGGTGCAGCAAAAAGGTGCCACTCTTCAGCAAAAGTACGTTGCTGAGAAGGTGTTTGACATTCTAAAGACGAAACGGGAGAATGAATGTTTGGTTCAGATGGCCAACGCAACCGATAAAGGCGTGGCTTCGCAATTTGGTGGACGAGACGACGTCCAACTGGTGGAGAATGAGGGAGAAAGTGAAGAGGAAAAAATCTGTAAGCAAATTGGAGTTCATTCAGGGAATGCATTTTCGGACCTCACCTCATTTGATTGCTTGTTCCCTTTATTTATTTATATTTATAACTTGTAA

Protein sequence

MNDFEGLLATNYGFKPQGKAAPMAASKGTSNINNPTTSPNFDLGSRASFRSTKNSNSLSGSLADDHDSLNRSVSAHENREFGGLDDLLGESSRFSRKSESRAGESDVNFDSLFHGAGNSGQPVASNLPVYDKPVYDDDIFDGIPGLKNSSKVQYDDVFSSMSSPPKADSAFDDLLGGFSKSGGVSKSKDKEIPAFDDLIPGFRGSSSPGDSKTSSMPMENPFGVSRERDDPHHEEASDVGNFKSPKFDGYPSSGANNKGFDDMDPFASLGKSVPAFSSEGNNRGKARSPPKVDGSSAGPQNSKSKDTMEKLSGKTSGQPLKKEVSAKNDRQFDEPVFDIPAVPTSSHKFVPQSTSPPAPKDENVMGDTSRFEDSVESDEIWLTVSEIPLFTQPTVAPPPSRPPPPIPQQVPKEGTSPYGSRSSKMNANDFSSVPNSTHHYQIPKSASASASASTRDQVSSVDELEEFAMGRNQSNAEDLVNGLSNEEAEMNSAAAAMKEAMDRAEAKFKHAKEVRERESTRTSKSKESVYWDRDEKAMRNNRVEDEESMDHERFQREREREEKEKRRAEREKERARELEREREEKEKEQRRLEKERERAREIEMERIKARQAVERATREARERAATEARLKAERAAVEKVNAEARGRAERVAVQRVQAEARERAAGEARERAERAAAEARERVEKAAAEAKEKEARERASVARAEAEARSRAERVAVERVAAEARERAAADARERAAAAARASQQKNENDLESFFSMGRASSAPRPRANPMDNFFDSQSPSRPEPQTTKQPSTTSSSMRKASSTTNIVDDLSSIFGGPPPSGEFQEVDGESEERRRARLERHQRVQTRAVLWPECGWQPVSLTEMVIPNSVKKVYRKATLCIHPDKVQQKGATLQQKYVAEKVFDILKTKRENECLVQMANATDKGVASQFGGRDDVQLVENEGESEEEKICKQIGVHSGNAFSDLTSFDCLFPLFIYIYNL
BLAST of Cp4.1LG01g17370 vs. Swiss-Prot
Match: AUXI2_ARATH (Auxilin-related protein 2 OS=Arabidopsis thaliana GN=At4g12770 PE=1 SV=1)

HSP 1 Score: 379.0 bits (972), Expect = 1.5e-103
Identity = 385/902 (42.68%), Postives = 497/902 (55.10%), Query Frame = 1

Query: 1   MNDFEGLLATNYGFKPQGKAAPMAASKGTSNINNPTTSPNFDLGSRASFRSTKNSNSLSG 60
           M+DF GLLA ++G KPQGK+APMA+   +S  +  T + ++   S A  +S  +S  +  
Sbjct: 1   MDDFTGLLARDFGLKPQGKSAPMASQSNSSAADFNTFASSYSFASAAGKKS--DSLPVFD 60

Query: 61  SLADDHDSLN-RSVSAHENREFGGLDDLLGESSRFSRKSESRAGESDVNFDSLFHGAGNS 120
            L  D D L  R V +    ++G   D        SR   + A + DV F          
Sbjct: 61  DLGRDGDDLLFRDVFSGPPPKYGSSGD--------SRSPSAPAFDYDVMFKE-------P 120

Query: 121 GQPVASNLPVYDKPVYDD-DIFDGIPGLK----NSSKVQYDDVFSSMSSPP-----KADS 180
               AS++PVYDKPVYDD D+F+ IP LK    +S   ++++VFSS+SS P     +  S
Sbjct: 121 KSKSASSMPVYDKPVYDDEDVFESIPELKIPSTSSQSARFENVFSSISSSPTKHRKQNSS 180

Query: 181 AFDDLLGGFSKSGGVSKSKDKEIPAFDDLIPGFRGSSSPG------------------DS 240
            FDDL+G  +     S  ++K    FDDLIPGF  +SSP                    +
Sbjct: 181 PFDDLMGN-NLGKKESDREEKGSSIFDDLIPGFGRTSSPPAKRTTSETTSQSQKPPYRTA 240

Query: 241 KTSSMPMENPFGVSRERDDPHHEEAS--------DVGNFKSPKFDGYPSSGANNKGFDDM 300
           +TSS   E+PF V  E      E ++        ++G F S K D    S  +   F D 
Sbjct: 241 ETSSNVKEDPFVVLEESTSTLREPSTGGFTDPLEEIGKFNSRKTD---HSSVHGGVFVDT 300

Query: 301 DPFASLGKSVPAFSSEGNNRGKARSPPKVDGSSAGPQNSKSKDTMEKLSGKTSGQPLKKE 360
           DP  SLGKS P  +S G +    R P  + GS +  ++S              G    K 
Sbjct: 301 DPLDSLGKSGPDMNSRGKSH--LRPPGNISGSQSPVESS--------------GLYHSKN 360

Query: 361 VSAKNDRQFDEPVFDIPAVPTSSHKFVPQSTSPPAPKDENVMGDTSRFEDSVESDEIWLT 420
           VS      FD+ V              PQ+TS P P + +       FE S   D++WLT
Sbjct: 361 VS------FDDVV-------------EPQNTSTPPPTNSD-----GSFESS---DDVWLT 420

Query: 421 VSEIPLFTQPTVAPPPSRPPPPIPQQVPKEGTSPYGSRSSKMNANDFSSVPNSTHHYQIP 480
           VSEIPLFTQPT APPP+RPPPP P            +R  K   N+  S+P S +H  +P
Sbjct: 421 VSEIPLFTQPTSAPPPTRPPPPRP------------TRPIKKKVNE-PSIPTSAYHSHVP 480

Query: 481 KSASASASASTRDQVSSVDELEEFAMGRNQSNAEDLVNGLSNEEAEMNS----AAAAMKE 540
            S  AS ++ T  Q+   DEL++F++GRNQ+ A    +  S E++++ S    +AAAMK+
Sbjct: 481 SSGRASVNSPTASQM---DELDDFSIGRNQTAANGYPDPSSGEDSDVFSTAAASAAAMKD 540

Query: 541 AMDRAEAKFKHAKEVRERESTRTSKSKESVYWDRDEKAMRNNRVEDEESMDHERFQRERE 600
           AMD+AEAKF+HAKE RE+ES + S+S+E             +  E+ +S       RERE
Sbjct: 541 AMDKAEAKFRHAKERREKESLKASRSREG------------DHTENYDS-------RERE 600

Query: 601 REEKEKR----RAEREKERARELEREREEKEKEQRRLEKERERAREIEMERIKARQAVER 660
             EK+ R    RAERE E  +   REREE+E+EQ+R+E+ERER        + ARQAVER
Sbjct: 601 LREKQVRLDRERAEREAEMEKTQAREREEREREQKRIERERER--------LLARQAVER 660

Query: 661 ATREARERAATEARLKAERAAVEKVNAEARGRAERVAVQRVQAEARERAAGEARERAERA 720
           ATREARERAATEA  K +RAAV KV  +AR RAER AVQR  AEARERAA  ARE+AE+A
Sbjct: 661 ATREARERAATEAHAKVQRAAVGKVT-DARERAERAAVQRAHAEARERAAAGAREKAEKA 720

Query: 721 AAEARERVEKAAAEAKEKEARERASVARAEAEARSRAERVAVERVAAEARERAAADARER 780
           AAEARER   A AE +EK             EA+ RAER AVER AAEAR RAAA A+ +
Sbjct: 721 AAEARER---ANAEVREK-------------EAKVRAERAAVERAAAEARGRAAAQAKAK 768

Query: 781 AAAAARASQQKNENDLESFF-SMGRASSAPRPRANPMDNFFDSQS------PSRPEPQTT 840
                   QQ+N NDL+SFF S+ R SS PR R NP D F DS +       SRP   ++
Sbjct: 781 -------QQQENNNDLDSFFNSVSRPSSVPRQRTNPPDPFQDSWNKGGSFESSRP---SS 768

Query: 841 KQPSTTSSSMRKASSTTNIVDDLSSIFGGP-PPSGEFQEVDGESEERRRARLERHQRVQT 850
           + PS  + ++RKASS TNIVDDLSSIFG P   SG FQ+VDGE+EERRRARLERHQR Q 
Sbjct: 841 RVPSGPTENLRKASSATNIVDDLSSIFGAPASQSGGFQDVDGETEERRRARLERHQRTQE 768

BLAST of Cp4.1LG01g17370 vs. Swiss-Prot
Match: AUXI1_ARATH (Auxilin-related protein 1 OS=Arabidopsis thaliana GN=AUXI1 PE=1 SV=2)

HSP 1 Score: 375.6 bits (963), Expect = 1.7e-102
Identity = 377/902 (41.80%), Postives = 493/902 (54.66%), Query Frame = 1

Query: 1   MNDFEGLLATNYGFKPQGKAAPMAASKGTSNINNPTTSPNFDLGSRASFRSTKNSNSL-- 60
           M+DF GLLA ++G KPQGK+APMA+   +S  +  T + ++   + A     K S+SL  
Sbjct: 1   MDDFTGLLARDFGLKPQGKSAPMASQSNSSAADFNTFASSYSFATAAG----KKSDSLPV 60

Query: 61  -SGSLADDHDSLNRSVSAHENREFGGLDDLLGESSRFSRKSESRAGESDVNFDSLFHGAG 120
                 D  D L + V +      G      G SS  SR   + A     ++D++F    
Sbjct: 61  FDDPGRDGDDLLFKDVFS------GPPPPKYGSSSGDSRSPSAPA----FDYDAMFKEPK 120

Query: 121 NSGQPVASNLPVYDKPVYDD-DIFDGIPGLK----NSSKVQYDDVFSSMSSPP-----KA 180
           +     AS++PVYDKPVYDD D+F+ IP LK    +S   ++++VFSS+SS P     + 
Sbjct: 121 SKS---ASSMPVYDKPVYDDEDVFESIPELKIPSTSSQSARFENVFSSISSSPTKHRKQN 180

Query: 181 DSAFDDLLGG-FSKSGGVSKSKDKEIPAFDDLIPGFRGSSSPGDSKT------------- 240
            S FDDL+G    K G  S  ++K    FDDLIPGF  +SSP   +T             
Sbjct: 181 SSPFDDLMGNNLGKKGADSDREEKGSSIFDDLIPGFGRTSSPPSKRTTSETTNQSEKAPY 240

Query: 241 -----SSMPMENPFGVSRERDDPHHEEA-----SDVGNFKSPKFDGYPSSGANNKGFDDM 300
                SS   E+PF V  E +    E +      D+G F S K D    S  +   F D+
Sbjct: 241 RTAETSSNVEEDPFVVLEESESTPREPSRTDPLDDIGKFNSRKTD---HSSVHGGVFVDI 300

Query: 301 DPFASLGKSVPAFSSEGNNRGKARSPPKVDGSSAGPQNSKSKDTME---KLSGKTSGQPL 360
           DP  +LGK  P                          NSK K  +     +SG  S  P+
Sbjct: 301 DPLDNLGKPGP------------------------DMNSKGKSHLRPPGNISGSQS-PPV 360

Query: 361 KKEVSAKNDRQFDEPVFDIPAVPTSSHKFVPQSTSPPAPKDENVMGDTSRFEDSVESDEI 420
           +   S  + +   E   +            P + S P P + N       FE S   D++
Sbjct: 361 ESPGSYHSKKVSFEDFLE------------PHNMSTPPPTNSN-----GSFESS---DDV 420

Query: 421 WLTVSEIPLFTQPTVAPPPSRPPPPIPQQVPKEGTSPYGSRSSKMNANDFSSVPNSTHHY 480
           WLTVSEIPLFTQPT APPP+RPPPP P            +R  K   N+  S+P S +H 
Sbjct: 421 WLTVSEIPLFTQPTSAPPPTRPPPPRP------------TRPIKKKVNE-PSIPTSAYHS 480

Query: 481 QIPKSASASASASTRDQVSSVDELEEFAMGRNQSNAEDLVNGLSNEEAEMNS----AAAA 540
            +P S  AS ++ T  Q+   DEL++F++GRNQ+ A    +  S E++++ S    +AAA
Sbjct: 481 HVPSSGRASVNSPTASQM---DELDDFSIGRNQTAANGYPDPSSGEDSDVFSTAAASAAA 540

Query: 541 MKEAMDRAEAKFKHAKEVRERESTRTSKSKESVYWDRDEKAMRNNRVEDEESMDHERFQR 600
           MK+AMD+AEAKF+HAKE RE+E+ + S+S+E             +  E+ +S       R
Sbjct: 541 MKDAMDKAEAKFRHAKERREKENLKASRSREG------------DHTENYDS-------R 600

Query: 601 EREREEKEKR----RAEREKERARELEREREEKEKEQRRLEKERERAREIEMERIKARQA 660
           ERE  EK+ R    RAERE E  +  ERE+EE+E+EQ+R+E+ERER        + ARQA
Sbjct: 601 ERELREKQVRLDRERAEREAEMEKAQEREKEEREREQKRIERERER--------LVARQA 660

Query: 661 VERATREARERAATEARLKAERAAVEKVNAEARGRAERVAVQRVQAEARERAAGEARERA 720
           VERATREARERAATEA  K +RAAV K   +AR RAER AVQR  AEARERAA  AR++A
Sbjct: 661 VERATREARERAATEAHAKVQRAAVGKAT-DARERAERAAVQRAHAEARERAAAGARDKA 720

Query: 721 ERAAAEARERVEKAAAEAKEKEARERASVARAEAEARSRAERVAVERVAAEARERAAADA 780
            +AAAEARE+ EKAAAEAKE     RA+    E E R RAER AVER AAEAR RAAA A
Sbjct: 721 AKAAAEAREKAEKAAAEAKE-----RANAEAREKETRVRAERAAVERAAAEARGRAAAQA 780

Query: 781 RERAAAAARASQQKNENDLESFFS-MGRASSAPRPRANPMDNFFDSQSPS---RPEPQTT 840
           + +        QQ+N NDL+SFFS + R +SAPR R NP+D F DS +         ++ 
Sbjct: 781 KAK-------QQQENTNDLDSFFSSISRPNSAPRQRTNPLDPFQDSWNKGGSFESSRESL 781

Query: 841 KQPSTTSSSMRKASSTTNIVDDLSSIFG-GPPPSGEFQEVDGESEERRRARLERHQRVQT 850
           + P     ++RK SS TNIVDDLSSIFG     SG FQ+VDGE+EERRRARLERHQR Q 
Sbjct: 841 RVPPGQPENLRKTSSVTNIVDDLSSIFGASASQSGGFQDVDGETEERRRARLERHQRTQE 781

BLAST of Cp4.1LG01g17370 vs. Swiss-Prot
Match: JAC1_ARATH (J domain-containing protein required for chloroplast accumulation response 1 OS=Arabidopsis thaliana GN=JAC1 PE=1 SV=1)

HSP 1 Score: 68.9 bits (167), Expect = 3.4e-10
Identity = 47/145 (32.41%), Postives = 80/145 (55.17%), Query Frame = 1

Query: 776 DSQSPSRPEPQTTKQPSTTSSSMRKASSTTNIVDDLSSIFGGPPPSGEFQEVDGESEERR 835
           D+    R EP TT    TTS  + +       V+D++          + +E + ++EE +
Sbjct: 504 DTVQEERQEPSTTH---TTSEDIDEPFHVNFDVEDITQ------DENKMEEANKDAEEIK 563

Query: 836 R--ARLERHQRVQT----------RAVLWPECGWQPVSLTEMVIPNSVKKVYRKATLCIH 895
              A++ +    ++          + +LW   GW+PV L +M+  N+V+K Y++A L +H
Sbjct: 564 NIDAKIRKWSSGKSGNIRSLLSTLQYILWSGSGWKPVPLMDMIEGNAVRKSYQRALLILH 623

Query: 896 PDKVQQKGATLQQKYVAEKVFDILK 909
           PDK+QQKGA+  QKY+AEKVF++L+
Sbjct: 624 PDKLQQKGASANQKYMAEKVFELLQ 639

BLAST of Cp4.1LG01g17370 vs. Swiss-Prot
Match: UCP7_SCHPO (UBA domain-containing protein 7 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=ucp7 PE=3 SV=1)

HSP 1 Score: 57.4 bits (137), Expect = 1.0e-06
Identity = 42/130 (32.31%), Postives = 63/130 (48.46%), Query Frame = 1

Query: 778 QSPSRPEPQTTKQPSTTSSSMRKASSTTNIVDDLSSIFGGPPPSGEFQEVDGESEERRRA 837
           Q  S P     K  S     +R A      +D+  S    P      Q++  + +E + +
Sbjct: 562 QPKSTPNHTNIKVKSERLQHVRMAQQKAEQLDEERSRLREP-----VQQIVNKWKEGKES 621

Query: 838 RLERHQRVQTRAVLWPECGWQPVSLTEMVIPNSVKKVYRKATLCIHPDKVQQKGATLQQK 897
            L R        +LWPEC WQ VSL+E+V+P  VK  Y KA   +HPDK+ Q+  +++ +
Sbjct: 622 NL-RALLASLDTILWPECRWQKVSLSELVLPKKVKIAYMKAVSRVHPDKLPQQ-TSVEHQ 681

Query: 898 YVAEKVFDIL 908
            +AE  F IL
Sbjct: 682 LIAESAFSIL 684

BLAST of Cp4.1LG01g17370 vs. TrEMBL
Match: A0A0A0KYM6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G496260 PE=4 SV=1)

HSP 1 Score: 1320.4 bits (3416), Expect = 0.0e+00
Identity = 786/984 (79.88%), Postives = 828/984 (84.15%), Query Frame = 1

Query: 1   MNDFEGLLATNYGFKPQGKAAPMAASKGTSNINNPTTSPNFDLGSRASFRSTKNSNSLSG 60
           MNDFEGLLATNYGFKPQGKAAPMAASKGTSNIN PT+SPNFDLGSR SFRS+K SNSLSG
Sbjct: 1   MNDFEGLLATNYGFKPQGKAAPMAASKGTSNIN-PTSSPNFDLGSRPSFRSSKTSNSLSG 60

Query: 61  SLADDHDSLNRSVSAHENREFGGLDDLLGESSRFSRKSESRAGESDVNFDSLFHGAGNSG 120
           SLADD DSLNRS+SAH+NREF GLDDLLG S RFSRKSE+RAG+SDVNFDSLF+G GNS 
Sbjct: 61  SLADDRDSLNRSMSAHDNREFDGLDDLLGGSGRFSRKSEARAGDSDVNFDSLFNGVGNSS 120

Query: 121 QPVASNLPVYDKPVYDDDIFDGIPGLKNSSKVQYDDVFSSMSSPPKADSAFDDLLGGFSK 180
           QP ASNLPVYDKPVYDDDIFDGIPGL+NSSKVQYDDVFSSMSSPPKA+SAFDDLLGGF K
Sbjct: 121 QPPASNLPVYDKPVYDDDIFDGIPGLRNSSKVQYDDVFSSMSSPPKAESAFDDLLGGFGK 180

Query: 181 SGGVSKSK--------DKEIPAFDDLIPGFRGSSSPGD--------------SKTSSMPM 240
           S  V KSK        D+EIPAFDDLIPGFRG S PGD              S TSS  M
Sbjct: 181 SDSVPKSKGGKGTQSKDREIPAFDDLIPGFRGGSPPGDRSNSSWSSEPTSVKSTTSSKAM 240

Query: 241 ENPFGVSRERDDPHHEEASDVGNFKSPKFDGYPSSGANNKGFDDMDPFASLGKSVPAFSS 300
           ENPFGVSRE +D H EEASD+GNFKSPKFDGYPSS ANNK FDDMDPFASLGKSVPAFSS
Sbjct: 241 ENPFGVSREHNDLH-EEASDIGNFKSPKFDGYPSSDANNKAFDDMDPFASLGKSVPAFSS 300

Query: 301 EGNNRGKARSPPKVDGSSAGPQNSKSKDTMEKLSGKTSGQPLKKEVSAKNDRQFDEPVFD 360
           EGNNR KARSPP+VDG++AGPQNS SKD MEK S KTS QPLKK+V AKNDR FD+PVFD
Sbjct: 301 EGNNRAKARSPPRVDGTAAGPQNSNSKDAMEKPSTKTSVQPLKKDVPAKNDRHFDQPVFD 360

Query: 361 IPAVPTSSHKFVPQSTSPPAPKDENVMGDTSRFEDSVESDEIWLTVSEIPLFTQPTVAPP 420
           IP V T+SHKFVPQSTSPPA  D NVMG+TSRFEDSVE DEIWLTVSEIPLFTQPTVAPP
Sbjct: 361 IPTVSTNSHKFVPQSTSPPASDDANVMGETSRFEDSVEPDEIWLTVSEIPLFTQPTVAPP 420

Query: 421 PSRPPPPIPQQVPKEGTSPYGSRSSKMNANDFSSVPNSTHHYQIPKSASASASASTRDQV 480
           PSRPPPPIPQQVPKEG  PYG RSSKMNANDFSS P+STHH+QIPKS S S     RDQV
Sbjct: 421 PSRPPPPIPQQVPKEGMGPYGLRSSKMNANDFSSFPSSTHHFQIPKSTSPSM----RDQV 480

Query: 481 SSVDELEEFAMGRNQSNAEDLVNGLSNEEAEMNSAAAAMKEAMDRAEAKFKHAKEVRERE 540
           SSVDELE+FAMGRN SNA++ VN LSNEEAEMNSAAAAMKEAMDRAEAKFKHAKEVRERE
Sbjct: 481 SSVDELEQFAMGRNPSNADEQVNSLSNEEAEMNSAAAAMKEAMDRAEAKFKHAKEVRERE 540

Query: 541 STRTSKSKESVYWDRDEKAMRNNRVEDEESMDHERFQREREREEKEK--RRAEREKERAR 600
           STRTSK KE+VYWDRDEKA R++RVEDEE++D ERFQREREREEKEK  R+AER+KERAR
Sbjct: 541 STRTSKIKEAVYWDRDEKATRSDRVEDEEAIDRERFQREREREEKEKEKRKAERDKERAR 600

Query: 601 ELEREREEKEKEQRRLEKERERAREIEMERIKARQAVERATREARERAATEARLKAERAA 660
           ELEREREEKEKE RRLEKERERARE+EMERIK RQAVERATREARERAA EARLKAERAA
Sbjct: 601 ELEREREEKEKELRRLEKERERARELEMERIKVRQAVERATREARERAAIEARLKAERAA 660

Query: 661 VEKVNAEARGRAERVAVQRVQAEARERAAGEARERAERAAAEARERVEKAAAEAKEKEAR 720
           VEKVNAEAR RAER AVQR Q+EARERAA EARERAERAA EARER EKAAAEAKE+EAR
Sbjct: 661 VEKVNAEARERAERAAVQRAQSEARERAAAEARERAERAATEARERAEKAAAEAKEREAR 720

Query: 721 ERASVARAEAEARSRAERVAVERVAAEARERAAADARERAAAAARASQQKNENDLESFFS 780
           ERASVARAE+EARSRAER AVER AAEARERAA DARERAAAAARASQQKNENDLESFFS
Sbjct: 721 ERASVARAESEARSRAERAAVERAAAEARERAAVDARERAAAAARASQQKNENDLESFFS 780

Query: 781 MGRASSAPRPRANPMDNFFDSQSPSRPEPQTTKQPSTTSSSMRKASSTTNIVDDLSSIFG 840
           MGR SS P+ RANPMDNF D+QSP+RPE  TTK   T  ++MRKASS TNIVDDLSSIFG
Sbjct: 781 MGRPSSVPKHRANPMDNF-DAQSPNRPE--TTKPSPTPPTNMRKASSATNIVDDLSSIFG 840

Query: 841 GPPPSGEFQEVDGESEERRRARLERHQRVQTRA--------------------------- 900
           GPP SGEFQEVDGE+EERRRARLERHQRVQTRA                           
Sbjct: 841 GPPSSGEFQEVDGETEERRRARLERHQRVQTRAAKALAEKNERDLQMQREQAERHRIAET 900

Query: 901 -------------------------VLWPECGWQPVSLTEMVIPNSVKKVYRKATLCIHP 909
                                    VLWPECGWQPVSLTEMVIPN+VKKVYRKATLCIHP
Sbjct: 901 LDAEIKRWAAGKEGNLRALLSTLQYVLWPECGWQPVSLTEMVIPNAVKKVYRKATLCIHP 960

BLAST of Cp4.1LG01g17370 vs. TrEMBL
Match: M5X738_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa000822mg PE=4 SV=1)

HSP 1 Score: 678.3 bits (1749), Expect = 1.4e-191
Identity = 502/897 (55.96%), Postives = 617/897 (68.78%), Query Frame = 1

Query: 1   MNDFEGLLATNYGFKPQGKAAPMAASKGTSNINNPTTSPNFDLGSRASFRSTKNSNSLSG 60
           MNDFEGLLA+++GFKP GK+APM+AS       N + +PNFDLGS    RST+ +NS SG
Sbjct: 1   MNDFEGLLASDFGFKPSGKSAPMSASSA-----NSSKAPNFDLGSSGPSRSTRATNSFSG 60

Query: 61  SLADDHDSLNRSVSAHENREFGGLDDLLGESSRFSRKSES-----RAGESDVNFDSLFHG 120
           SLADD DS+       + +EFG   D+ G S+R+S KSES     R  ++  NFDS+F G
Sbjct: 61  SLADDRDSI---FGPSKTQEFG---DIFGGSARYSTKSESTKSPSRGEDAAFNFDSMFGG 120

Query: 121 AGNSGQPVASNLPVYDKPVYDDDIFDGIPGLKN-SSKVQYDDVFSSMSSPPK--ADSAFD 180
           + +S    ++  PVYDKPVYDDDIFDG+PGLK+ SSKV+Y+DVFS+++SPP   + S FD
Sbjct: 121 STDSVPKSSNPGPVYDKPVYDDDIFDGVPGLKSTSSKVKYEDVFSTVTSPPSKGSSSGFD 180

Query: 181 DLLGGFSKSGGVSKSK--------DKEIPAFDDLIPGFRGSSSPGD-------------- 240
           DLLGGF K+    KS         +K +P  DDL+PGF GS+   +              
Sbjct: 181 DLLGGFGKAEPQLKSSGSRGSDRAEKVVPGLDDLLPGFGGSNPASERSTSEANWPPETNA 240

Query: 241 ---SKTSSMPMENPFGVSRERDDPHHEEASDVGNFKSPKFDGYPSSGANNKGFDDMDPFA 300
              SKT+S  ME+PF V  +  DP  EE S +    S K D    S  N + FDD+DPF 
Sbjct: 241 NNLSKTTSKVMEDPFVVPGQFKDPL-EEISRLSKSASSKVDS--PSVDNGRAFDDIDPFD 300

Query: 301 SLGKSVPAFSSEGNNRGKARSPPKVDGSSAGPQNSKSKDTMEKLSGKTSGQPLKKEVSAK 360
            LGKSVP FSS  N RGK     + D S+   + S +K++ EK S K+     +K+V   
Sbjct: 301 GLGKSVPVFSSGRNYRGKDSGNLRADTSTNNSRASTAKESTEKPSVKSPDNQSQKKVPVG 360

Query: 361 NDRQFDEPVFDIPAVPTSSHKFVPQSTSPPA-----PKDENVMGDTS-RFEDSVES-DEI 420
           N     + +FD+P V T S K   Q+ SPP+     PK+ NV  D S R E++++S D +
Sbjct: 361 NHWDSHQTLFDMPTVSTDSQKSAGQTMSPPSYVNVSPKEANVQVDRSPRSEENLDSSDFL 420

Query: 421 WLTVSEIPLFTQPTVAPPPSRPPPPIPQQVPKEGTSPYGSRSSKMNANDFSSVPNSTHHY 480
           WLTVSEIPL TQPT APPPSRPPPP P QV K       + +++  A++     +ST  +
Sbjct: 421 WLTVSEIPLMTQPTSAPPPSRPPPPRPVQVSKTRMGSPATTNARKKASE-----SSTQFF 480

Query: 481 QIPKSASASASASTRDQVSSVDELEEFAMGRNQSNAEDLVNGLSNEEAEMNSAAAAMKEA 540
           Q PKSA A+A       VSS+DELE+FAMG++QSN ++  NGL  EE EMNS AAAMKEA
Sbjct: 481 QAPKSAPAAARGPG---VSSIDELEDFAMGKSQSNFDEHANGLPGEELEMNSVAAAMKEA 540

Query: 541 MDRAEAKFKHAKEVRERESTRTSKSKESVYWDRDEKAMRNNRV--EDEESMDHERFQRER 600
           MDRAEAKF+HAKEVRER ST+ ++SKE+   ++DEKAM++ +V  E +E +D ER QRE 
Sbjct: 541 MDRAEAKFRHAKEVRERGSTKAARSKEA-QLEKDEKAMQDEKVLREKQERLDSERLQRES 600

Query: 601 EREEKEKRRAEREKERARELEREREEKEKEQRRLEKERERAREIEMERIKARQAVERATR 660
           E E+ E+ R E+E    RE+ER  EEKE+EQR+LE+ERER RE+E ER KARQAVERATR
Sbjct: 601 EEEDMEQSRVEKE----REIERVWEEKEREQRKLERERERTREMERERDKARQAVERATR 660

Query: 661 EARERAATEARLKA-----ERAAVEKVNAEARGRAERVAVQRVQAEARERAAGEARERAE 720
           EARERAA EAR+KA     ERAAV+KV AEAR RAER AVQR QAEARERAA EA+ERAE
Sbjct: 661 EARERAAAEARIKAERAKSERAAVDKVTAEARERAERAAVQRAQAEARERAAAEAKERAE 720

Query: 721 RAAAEARERVEKAAAEAKEKEARERASVARAEAEARSRAERVAVERVAAEARERAAADAR 780
           +AAAEARER   A AEAKE+EARERA+ ARA AEAR+RAER AVER AAEARERAAA+AR
Sbjct: 721 KAAAEARER---ANAEAKEREARERAAAARAGAEARTRAERAAVERAAAEARERAAAEAR 780

Query: 781 ER-AAAAARASQQKNENDLESFFSMGRASSAPRPRANPMDNFFDSQSPSRPEPQTTKQPS 840
           ER AAAAARA+QQK+ENDLE FFSMGRASSAPRPRAN     F     +R EP+  K  S
Sbjct: 781 ERAAAAAARANQQKSENDLEHFFSMGRASSAPRPRANSSMALFQDPFQNRQEPEVPKTSS 840

Query: 841 TTSSSMRKASSTTNIVDDLSSIFGGPPPS-GEFQEVDGESEERRRARLERHQRVQTR 849
           T SS++RKA+STTNIVDDLS+IFG  P S GEFQEV+GE+EERRRARLERHQR Q R
Sbjct: 841 TASSNIRKANSTTNIVDDLSAIFGAAPSSGGEFQEVEGETEERRRARLERHQRTQER 867

BLAST of Cp4.1LG01g17370 vs. TrEMBL
Match: A0A059B8F0_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_H05116 PE=4 SV=1)

HSP 1 Score: 659.8 bits (1701), Expect = 5.1e-186
Identity = 518/1032 (50.19%), Postives = 632/1032 (61.24%), Query Frame = 1

Query: 1   MNDFEGLLATNYGFKPQGKAAPM---AASKGTSNINNPTTSPNFDLGSRASFRSTKNSNS 60
           MNDF+ LL  ++GFKPQGK+APM    A  G      P  + +FDLGSR           
Sbjct: 1   MNDFDALLTADFGFKPQGKSAPMKPSGAGPGPGPSPGPMPAFDFDLGSR----------- 60

Query: 61  LSGSLADDHDSLNRSVSAHENREFGGLDDLLGESSRFSRKSESRAGE-----SDVNFDSL 120
            SG +A    +     +A +++      DL G ++R + +SE   G      S  +FDS+
Sbjct: 61  -SGRVASSDHAFGSKPNA-KSQTLDDFGDLFGGNARGATRSEGGGGGGGGGGSGFDFDSM 120

Query: 121 FHGAGNSGQPVASNLPVYDKPVYDDDIFDGIPGLKNSSKVQYDDVFSSMS-SPPKA---- 180
           F  +  S        PVYDKPVYDDDIF+GIPG+K+SS V+Y+DVF S+S SPP+     
Sbjct: 121 FASSAKS-----PGSPVYDKPVYDDDIFEGIPGIKSSSGVKYEDVFRSISDSPPRRKGNE 180

Query: 181 DSAFDDLLGGFSKSGGVSKSK--DKEIPAFDD-LIPGFRGSSSPGDSKTSSMPM------ 240
           DS FDDLLGGF K    SKSK  D+ +PAFDD LIPGF GS +P  ++    P       
Sbjct: 181 DSGFDDLLGGFGKKESESKSKGYDRSVPAFDDDLIPGFGGSGAPAKNRPMPEPTWSSEPT 240

Query: 241 -------ENPFGVSRERDDPHH-----------EEASDVGNFKSPKFDGYPSSGANNKGF 300
                  E+PF  S     PH            EE S +G  +S K D  PS  +    +
Sbjct: 241 HHTSKMTEDPFA-SLGSTSPHVTSSSGLSTDPLEEFSKLGKSESMKTDA-PSVSSGGV-Y 300

Query: 301 DDMDPFASLGKSVPAFSSEGNNRGKARSPPKVDGSSAGPQNSKSKDTMEKLSGKTSGQPL 360
           DDMDPF  LGKSVP+FSSE  N GK RSP + + SS      +S D             +
Sbjct: 301 DDMDPFDGLGKSVPSFSSERTNSGKDRSPLRREHSSR-----RSPDNY-----------V 360

Query: 361 KKEVSAKNDRQFDEPVFDIPAVPTSSHKFVPQSTSPP-----APKDENVMGDTS-RFEDS 420
           ++     ND + ++  FDIP     +     Q+ S P     +P D +   D S R+E+ 
Sbjct: 361 EESFPDDNDLKSNKTFFDIPPASRGTPVSAGQTASDPPYAQASPNDASSPVDMSPRYEEH 420

Query: 421 VES-DEIWLTVSEIPLFTQPTVAPPPSRPPPPIPQQVPKEGTSPYGSRSSKMNANDFSSV 480
           V S DE+WLTVSEIPLFT PT  PPPSRPPPP P +V + G+S +GS ++K N N+FSS 
Sbjct: 421 VSSSDEVWLTVSEIPLFTPPTSVPPPSRPPPPRPARVIRAGSSSFGSANAKRNVNEFSSF 480

Query: 481 PNSTHHYQIPKSASASASASTRDQVSSVDELEEFAMGRNQSNAEDLVNGLSNEEAEMNS- 540
           PNST +++  KS   +  +S     S +DEL +FAMGR+++N +D  NG  +EE  MNS 
Sbjct: 481 PNSTQNFEDSKSVPPTFRSSAS---SPIDELGDFAMGRSRNNVDDHSNGFYSEEMGMNSD 540

Query: 541 ---AAAAMKEAMDRAEAKFKHAKEVRERESTRTSKSKESV-------YWDRDEKAMRNNR 600
              +AAAMKEAM+RAEAKF+HAKEVRERES++T++S+++V       + D DE+A+R   
Sbjct: 541 AAASAAAMKEAMERAEAKFRHAKEVRERESSKTARSRDTVHLEKDEDFLDADERALR--- 600

Query: 601 VEDEESMDHERFQREREREEKEKRRAEREKERARELEREREEKEKEQRRLEKERERAREI 660
            E +E +D ER QRE+E EE+E+RR E+E        REREEKE+EQRRLEKERER    
Sbjct: 601 -EKQERLDRERMQREKEEEEREQRRLEKE--------REREEKEREQRRLEKERERR--- 660

Query: 661 EMERIKARQAVERATREARERAA--------TEARLKAERAAVEKVNAEARGRAERVAVQ 720
           E+ER KARQAV RATREARERAA        T+AR KAERAAV+KVNAEAR RAER AV 
Sbjct: 661 EIEREKARQAVGRATREARERAAAEARERAATDARQKAERAAVDKVNAEARERAERAAVF 720

Query: 721 RVQAEARERAAGEARERAERAAAEARERVEKAAAEAKEKEARERASVARAEAEARSRAER 780
           R QAEARERAA EARERAE+AAAEAR+R           EA+ RA      AEAR +AER
Sbjct: 721 RAQAEARERAAVEARERAEKAAAEARDR-----------EAQGRA------AEARMKAER 780

Query: 781 VAVERVAAEARERAAADARERAAA-----AARASQQKNENDLESFFSMGRASSAPRPRAN 840
            AVER AAEARERAA +ARERAAA     AARA+QQKN+NDLESFFSMGRASSAPRPRAN
Sbjct: 781 AAVERAAAEARERAAVEARERAAAEAREKAARANQQKNDNDLESFFSMGRASSAPRPRAN 840

Query: 841 PMDNFFDSQSPSRPEPQTTKQPSTTSSSMRKASSTTNIVDDLSSIFGGP-PPSGEFQEVD 900
             D FF++QS SR  P   K   +TSSSM+KASS TNIVDDLSSIFGG    SG+FQEV+
Sbjct: 841 STDPFFETQSQSRKGPDVAKTSESTSSSMKKASSNTNIVDDLSSIFGGGVASSGDFQEVE 900

Query: 901 GESEERRRARLERHQRVQTRA--------------------------------------- 909
           GESEERR+ARLER QR Q RA                                       
Sbjct: 901 GESEERRKARLERQQRAQERAAKALAEKNQRDLQVQREQAERHRISETLDVEIKRWAAGK 960

BLAST of Cp4.1LG01g17370 vs. TrEMBL
Match: A0A059B9Z8_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_H05116 PE=4 SV=1)

HSP 1 Score: 657.5 bits (1695), Expect = 2.5e-185
Identity = 515/1027 (50.15%), Postives = 630/1027 (61.34%), Query Frame = 1

Query: 1   MNDFEGLLATNYGFKPQGKAAPM---AASKGTSNINNPTTSPNFDLGSRASFRSTKNSNS 60
           MNDF+ LL  ++GFKPQGK+APM    A  G      P  + +FDLGSR           
Sbjct: 1   MNDFDALLTADFGFKPQGKSAPMKPSGAGPGPGPSPGPMPAFDFDLGSR----------- 60

Query: 61  LSGSLADDHDSLNRSVSAHENREFGGLDDLLGESSRFSRKSESRAGE-----SDVNFDSL 120
            SG +A    +     +A +++      DL G ++R + +SE   G      S  +FDS+
Sbjct: 61  -SGRVASSDHAFGSKPNA-KSQTLDDFGDLFGGNARGATRSEGGGGGGGGGGSGFDFDSM 120

Query: 121 FHGAGNSGQPVASNLPVYDKPVYDDDIFDGIPGLKNSSKVQYDDVFSSMS-SPPKA---- 180
           F  +  S        PVYDKPVYDDDIF+GIPG+K+SS V+Y+DVF S+S SPP+     
Sbjct: 121 FASSAKS-----PGSPVYDKPVYDDDIFEGIPGIKSSSGVKYEDVFRSISDSPPRRKGNE 180

Query: 181 DSAFDDLLGGFSKSGGVSKSK--DKEIPAFDD-LIPGFRGSSSPGDSKTSSMPM------ 240
           DS FDDLLGGF K    SKSK  D+ +PAFDD LIPGF GS +P  ++    P       
Sbjct: 181 DSGFDDLLGGFGKKESESKSKGYDRSVPAFDDDLIPGFGGSGAPAKNRPMPEPTWSSEPT 240

Query: 241 -------ENPFGVSRERDDPHH-----------EEASDVGNFKSPKFDGYPSSGANNKGF 300
                  E+PF  S     PH            EE S +G  +S K D  PS  +    +
Sbjct: 241 HHTSKMTEDPFA-SLGSTSPHVTSSSGLSTDPLEEFSKLGKSESMKTDA-PSVSSGGV-Y 300

Query: 301 DDMDPFASLGKSVPAFSSEGNNRGKARSPPKVDGSSAGPQNSKSKDTMEKLSGKTSGQPL 360
           DDMDPF  LGKSVP+FSSE  N GK RSP + + SS      +S D             +
Sbjct: 301 DDMDPFDGLGKSVPSFSSERTNSGKDRSPLRREHSSR-----RSPDNY-----------V 360

Query: 361 KKEVSAKNDRQFDEPVFDIPAVPTSSHKFVPQSTSPP-----APKDENVMGDTS-RFEDS 420
           ++     ND + ++  FDIP     +     Q+ S P     +P D +   D S R+E+ 
Sbjct: 361 EESFPDDNDLKSNKTFFDIPPASRGTPVSAGQTASDPPYAQASPNDASSPVDMSPRYEEH 420

Query: 421 VES-DEIWLTVSEIPLFTQPTVAPPPSRPPPPIPQQVPKEGTSPYGSRSSKMNANDFSSV 480
           V S DE+WLTVSEIPLFT PT  PPPSRPPPP P +V + G+S +GS ++K N N+FSS 
Sbjct: 421 VSSSDEVWLTVSEIPLFTPPTSVPPPSRPPPPRPARVIRAGSSSFGSANAKRNVNEFSSF 480

Query: 481 PNSTHHYQIPKSASASASASTRDQVSSVDELEEFAMGRNQSNAEDLVNGLSNEEAEMNS- 540
           PNST +++  KS   +  +S     S +DEL +FAMGR+++N +D  NG  +EE  MNS 
Sbjct: 481 PNSTQNFEDSKSVPPTFRSSAS---SPIDELGDFAMGRSRNNVDDHSNGFYSEEMGMNSD 540

Query: 541 ---AAAAMKEAMDRAEAKFKHAKEVRERESTRTSKSKESV-------YWDRDEKAMRNNR 600
              +AAAMKEAM+RAEAKF+HAKEVRERES++T++S+++V       + D DE+A+R   
Sbjct: 541 AAASAAAMKEAMERAEAKFRHAKEVRERESSKTARSRDTVHLEKDEDFLDADERALR--- 600

Query: 601 VEDEESMDHERFQREREREEKEKRRAEREKERARELEREREEKEKEQRRLEKERERAREI 660
            E +E +D ER QRE+E EE+E+RR E+E        REREEKE+EQRRLEKERER    
Sbjct: 601 -EKQERLDRERMQREKEEEEREQRRLEKE--------REREEKEREQRRLEKERERR--- 660

Query: 661 EMERIKARQAVERATREARERAA--------TEARLKAERAAVEKVNAEARGRAERVAVQ 720
           E+ER KARQAV RATREARERAA        T+AR KAERAAV+KVNAEAR RAER AV 
Sbjct: 661 EIEREKARQAVGRATREARERAAAEARERAATDARQKAERAAVDKVNAEARERAERAAVF 720

Query: 721 RVQAEARERAAGEARERAERAAAEARERVEKAAAEAKEKEARERASVARAEAEARSRAER 780
           R QAEARERAA EARERAE+AAAEAR+R           EA+ RA      AEAR +AER
Sbjct: 721 RAQAEARERAAVEARERAEKAAAEARDR-----------EAQGRA------AEARMKAER 780

Query: 781 VAVERVAAEARERAAADARERAAAAARASQQKNENDLESFFSMGRASSAPRPRANPMDNF 840
            AVER AAEARERAAA+ARE+AA   RA+QQKN+NDLESFFSMGRASSAPRPRAN  D F
Sbjct: 781 AAVERAAAEARERAAAEAREKAA---RANQQKNDNDLESFFSMGRASSAPRPRANSTDPF 840

Query: 841 FDSQSPSRPEPQTTKQPSTTSSSMRKASSTTNIVDDLSSIFGGP-PPSGEFQEVDGESEE 900
           F++QS SR  P   K   +TSSSM+KASS TNIVDDLSSIFGG    SG+FQEV+GESEE
Sbjct: 841 FETQSQSRKGPDVAKTSESTSSSMKKASSNTNIVDDLSSIFGGGVASSGDFQEVEGESEE 900

Query: 901 RRRARLERHQRVQTRA-------------------------------------------- 909
           RR+ARLER QR Q RA                                            
Sbjct: 901 RRKARLERQQRAQERAAKALAEKNQRDLQVQREQAERHRISETLDVEIKRWAAGKEGNLR 952

BLAST of Cp4.1LG01g17370 vs. TrEMBL
Match: A0A0R0KHB7_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_04G209400 PE=4 SV=1)

HSP 1 Score: 654.4 bits (1687), Expect = 2.1e-184
Identity = 498/969 (51.39%), Postives = 617/969 (63.67%), Query Frame = 1

Query: 1   MNDFEGLLATNYGFKPQGKAAPMAASKGTSNINNPTTSPNFDLGSRASFRSTKNSNSLSG 60
           MNDF+GLLAT++GFKPQGK+APMAASK +SN  N T+S NFDLGSR    ST+ SNS   
Sbjct: 1   MNDFDGLLATDFGFKPQGKSAPMAASKVSSN--NKTSSLNFDLGSR----STRTSNS--- 60

Query: 61  SLADDHDSLNRSVSAHENREFGGLDDLLGESSRFSRKSESRAGESDVNFDSLFHGAGNSG 120
                  S+N + +A         DDL G+SS F   ++ R+   +V             
Sbjct: 61  -------SVNAAAAA------ASFDDLFGDSSVFRASADFRSKSVNV------------- 120

Query: 121 QPVASNLPVYDKPVYDDD--IFDGIPGLKNSSKVQYDDVFSSMSSPPKADSAFDDLLGGF 180
            PV    PV+DKPVYDDD  IFDG+PGLK+SSKV YDDVF+   S   A +AFDDLLG  
Sbjct: 121 -PVHDG-PVFDKPVYDDDDDIFDGVPGLKSSSKVSYDDVFAPGGSAAAA-AAFDDLLGRL 180

Query: 181 SKSGGVSKSKDKEIPAFDDLIPGFRGSSSPGD----------------SKTSSMPMENPF 240
            KS  V K        FDDLIPGFR S +  D                SKT+S   ++PF
Sbjct: 181 GKSEKVEKGAAD----FDDLIPGFRSSKASTDGTIPDINLSPEPTIDASKTASSTTDDPF 240

Query: 241 GVSRERDDPHH----------EEASDVGNFKSPKFDGYPSSGANNKGFDDMDPFASLGKS 300
            V      P            EE S   + +S K D   SS +N + +DD+DPF  LG S
Sbjct: 241 KVFESTSAPVDSSSDYFTDPLEEISKFTSSRSTKNDR--SSNSNGEVYDDIDPFGGLGNS 300

Query: 301 VPAFSSEGNNRGKARSP-PKVDGSSAGPQNSKSKDTMEKLSGKTSGQPLKKEVSAKNDRQ 360
           VPAFS+E N+   + SP P+ + SS+  ++ +S D     S ++  +    ++   +D++
Sbjct: 301 VPAFSAERNSMKGSSSPTPRSNTSSSWTRDKESNDIS---SVRSPDRKTPNKILVDHDQE 360

Query: 361 FDEPVFDIPAVPTSSHKFVPQSTSPPAP-----KDENVMGDTS-RFEDSVES-DEIWLTV 420
           F +  FD+P   + S+K V Q ++ P+      K  N + D S ++E+ +ES D+IWL V
Sbjct: 361 FHQAAFDMPTYSSDSYKPVGQRSTFPSYDNNGFKQANTLEDMSPKYEEKLESNDDIWLMV 420

Query: 421 SEIPLFTQPTVAPPPSRPPPPIPQQVPKEGTSPYGSRSSKMNANDFSSVPNSTHHYQIPK 480
           SEIPLFTQPT APPPSRPPPP P  + K G     S + +   NDFS  P+ST   Q PK
Sbjct: 421 SEIPLFTQPTAAPPPSRPPPPRPVHILKSGAGSSASANVRKKDNDFSYFPSSTQFSQGPK 480

Query: 481 SASASASASTRDQVSSVDELEEFAMGRNQSNAEDLVNGLSNEEAEMNSAAAAMKEAMDRA 540
           SA A+A  S+  Q    DELE+FAMG+++ N ++ VNGL+++E EMNSAAAAMKEAMDRA
Sbjct: 481 SAPAAAKFSSASQF---DELEDFAMGKSRDNDDEGVNGLADKELEMNSAAAAMKEAMDRA 540

Query: 541 EAKFKHAKEVRERESTRTSKSKESVYWDRDEKAMRNNRVEDEESMDHERFQREREREEKE 600
           EAKF+HAK VRERE+T+ +KSKE V  D+D K +  +R + +E +DHE   +++EREEKE
Sbjct: 541 EAKFRHAKGVRERENTKVAKSKEPVQLDKDGKVVSEDRGK-QERLDHE--WQQKEREEKE 600

Query: 601 KRRAEREKERARELEREREEKEKEQRRLEKERERAREIEMERIKARQAVERATREARERA 660
           +RR E+E          REEKE+EQ+RLE+ERERAR          QAVERATREARERA
Sbjct: 601 QRRCEKE----------REEKEREQQRLERERERAR----------QAVERATREARERA 660

Query: 661 ATEARLKAERAAVEKVNAEARGRAERVAVQRVQAEARERAAGEARERAERAAAEARERVE 720
           A EAR +AERAAVEK NAEAR RAER AVQR QAEARERAA                   
Sbjct: 661 AAEARQRAERAAVEKANAEARKRAERTAVQRAQAEARERAA------------------- 720

Query: 721 KAAAEAKEKEARERASVARAEAEARSRAERVAVERVAAEARERAAADARERAAAAARASQ 780
              AEAKE+E RERA+ ARAE EAR +AER AVER AAEARERA A ARERAAAAAR SQ
Sbjct: 721 ---AEAKEREVRERAAAARAEPEARVKAERAAVERAAAEARERAVAQARERAAAAARMSQ 780

Query: 781 QKNENDLESFFSM-GRASSAPRP-RANPMDNFFDSQSPSRPEPQTTKQPSTTSSSMRKAS 840
           Q+N+NDLESFFS   RA+SAPRP R +  D+ FD+Q  S      T++ +  SSSM+KAS
Sbjct: 781 QQNDNDLESFFSTDARANSAPRPPRPSSSDSVFDAQFQS----DVTRKSTGVSSSMKKAS 840

Query: 841 STTNIVDDLSSIFG-GPPPSGEFQEVDGESEERRRARLERHQRVQTRA------------ 900
           S+TNIVDDLSSIFG  P  SGEFQE++GE+EERRRARLERHQR Q RA            
Sbjct: 841 SSTNIVDDLSSIFGAAPSSSGEFQEIEGETEERRRARLERHQRTQERAAKALAEKNQRDL 870

Query: 901 ----------VLWPECGWQPVSLTEMVIPNSVKKVYRKATLCIHPDKVQQKGATLQQKYV 909
                     VLWPECGWQPVSLT+++   +V+KVYRKATLC HPDKVQQKGAT+QQKY+
Sbjct: 901 QTQRDQAERHVLWPECGWQPVSLTDLITAAAVRKVYRKATLCTHPDKVQQKGATIQQKYI 870

BLAST of Cp4.1LG01g17370 vs. TAIR10
Match: AT4G12770.1 (AT4G12770.1 Chaperone DnaJ-domain superfamily protein)

HSP 1 Score: 384.0 bits (985), Expect = 2.7e-106
Identity = 383/901 (42.51%), Postives = 496/901 (55.05%), Query Frame = 1

Query: 1   MNDFEGLLATNYGFKPQGKAAPMAASKGTSNINNPTTSPNFDLGSRASFRSTKNSNSLSG 60
           M+DF GLLA ++G KPQGK+APMA+   +S  +  T + ++   S A  +S  +S  +  
Sbjct: 1   MDDFTGLLARDFGLKPQGKSAPMASQSNSSAADFNTFASSYSFASAAGKKS--DSLPVFD 60

Query: 61  SLADDHDSLN-RSVSAHENREFGGLDDLLGESSRFSRKSESRAGESDVNFDSLFHGAGNS 120
            L  D D L  R V +    ++G   D        SR   + A + DV F          
Sbjct: 61  DLGRDGDDLLFRDVFSGPPPKYGSSGD--------SRSPSAPAFDYDVMFKE-------P 120

Query: 121 GQPVASNLPVYDKPVYDD-DIFDGIPGLK----NSSKVQYDDVFSSMSSPP-----KADS 180
               AS++PVYDKPVYDD D+F+ IP LK    +S   ++++VFSS+SS P     +  S
Sbjct: 121 KSKSASSMPVYDKPVYDDEDVFESIPELKIPSTSSQSARFENVFSSISSSPTKHRKQNSS 180

Query: 181 AFDDLLGGFSKSGGVSKSKDKEIPAFDDLIPGFRGSSSPG------------------DS 240
            FDDL+G  +     S  ++K    FDDLIPGF  +SSP                    +
Sbjct: 181 PFDDLMGN-NLGKKESDREEKGSSIFDDLIPGFGRTSSPPAKRTTSETTSQSQKPPYRTA 240

Query: 241 KTSSMPMENPFGVSRERDDPHHEEAS--------DVGNFKSPKFDGYPSSGANNKGFDDM 300
           +TSS   E+PF V  E      E ++        ++G F S K D    S  +   F D 
Sbjct: 241 ETSSNVKEDPFVVLEESTSTLREPSTGGFTDPLEEIGKFNSRKTD---HSSVHGGVFVDT 300

Query: 301 DPFASLGKSVPAFSSEGNNRGKARSPPKVDGSSAGPQNSKSKDTMEKLSGKTSGQPLKKE 360
           DP  SLGKS P  +S G +    R P  + GS +  ++S              G    K 
Sbjct: 301 DPLDSLGKSGPDMNSRGKSH--LRPPGNISGSQSPVESS--------------GLYHSKN 360

Query: 361 VSAKNDRQFDEPVFDIPAVPTSSHKFVPQSTSPPAPKDENVMGDTSRFEDSVESDEIWLT 420
           VS      FD+ V              PQ+TS P P + +       FE S   D++WLT
Sbjct: 361 VS------FDDVV-------------EPQNTSTPPPTNSD-----GSFESS---DDVWLT 420

Query: 421 VSEIPLFTQPTVAPPPSRPPPPIPQQVPKEGTSPYGSRSSKMNANDFSSVPNSTHHYQIP 480
           VSEIPLFTQPT APPP+RPPPP P            +R  K   N+  S+P S +H  +P
Sbjct: 421 VSEIPLFTQPTSAPPPTRPPPPRP------------TRPIKKKVNE-PSIPTSAYHSHVP 480

Query: 481 KSASASASASTRDQVSSVDELEEFAMGRNQSNAEDLVNGLSNEEAEMNS----AAAAMKE 540
            S  AS ++ T  Q+   DEL++F++GRNQ+ A    +  S E++++ S    +AAAMK+
Sbjct: 481 SSGRASVNSPTASQM---DELDDFSIGRNQTAANGYPDPSSGEDSDVFSTAAASAAAMKD 540

Query: 541 AMDRAEAKFKHAKEVRERESTRTSKSKESVY---WDRDEKAMRNNRVEDEESMDHERFQR 600
           AMD+AEAKF+HAKE RE+ES + S+S+E  +   +D  E+ +R  +V         R  R
Sbjct: 541 AMDKAEAKFRHAKERREKESLKASRSREGDHTENYDSRERELREKQV---------RLDR 600

Query: 601 EREREEKEKRRAEREKERARELEREREEKEKEQRRLEKERERAREIEMERIKARQAVERA 660
           E         RAERE E  +   REREE+E+EQ+R+E+ER        ER+ ARQAVERA
Sbjct: 601 E---------RAEREAEMEKTQAREREEREREQKRIERER--------ERLLARQAVERA 660

Query: 661 TREARERAATEARLKAERAAVEKVNAEARGRAERVAVQRVQAEARERAAGEARERAERAA 720
           TREARERAATEA  K +RAAV KV  +AR RAER AVQR  AEARERAA  ARE+AE+AA
Sbjct: 661 TREARERAATEAHAKVQRAAVGKV-TDARERAERAAVQRAHAEARERAAAGAREKAEKAA 720

Query: 721 AEARERVEKAAAEAKEKEARERASVARAEAEARSRAERVAVERVAAEARERAAADARERA 780
           AEARER   A AE +EK             EA+ RAER AVER AAEAR RAAA A+ + 
Sbjct: 721 AEARER---ANAEVREK-------------EAKVRAERAAVERAAAEARGRAAAQAKAK- 768

Query: 781 AAAARASQQKNENDLESFF-SMGRASSAPRPRANPMDNFFDSQS------PSRPEPQTTK 840
                  QQ+N NDL+SFF S+ R SS PR R NP D F DS +       SRP   +++
Sbjct: 781 ------QQQENNNDLDSFFNSVSRPSSVPRQRTNPPDPFQDSWNKGGSFESSRP---SSR 768

Query: 841 QPSTTSSSMRKASSTTNIVDDLSSIFGGP-PPSGEFQEVDGESEERRRARLERHQRVQTR 850
            PS  + ++RKASS TNIVDDLSSIFG P   SG FQ+VDGE+EERRRARLERHQR Q R
Sbjct: 841 VPSGPTENLRKASSATNIVDDLSSIFGAPASQSGGFQDVDGETEERRRARLERHQRTQER 768

BLAST of Cp4.1LG01g17370 vs. TAIR10
Match: AT4G12780.1 (AT4G12780.1 Chaperone DnaJ-domain superfamily protein)

HSP 1 Score: 375.6 bits (963), Expect = 9.6e-104
Identity = 377/902 (41.80%), Postives = 493/902 (54.66%), Query Frame = 1

Query: 1   MNDFEGLLATNYGFKPQGKAAPMAASKGTSNINNPTTSPNFDLGSRASFRSTKNSNSL-- 60
           M+DF GLLA ++G KPQGK+APMA+   +S  +  T + ++   + A     K S+SL  
Sbjct: 1   MDDFTGLLARDFGLKPQGKSAPMASQSNSSAADFNTFASSYSFATAAG----KKSDSLPV 60

Query: 61  -SGSLADDHDSLNRSVSAHENREFGGLDDLLGESSRFSRKSESRAGESDVNFDSLFHGAG 120
                 D  D L + V +      G      G SS  SR   + A     ++D++F    
Sbjct: 61  FDDPGRDGDDLLFKDVFS------GPPPPKYGSSSGDSRSPSAPA----FDYDAMFKEPK 120

Query: 121 NSGQPVASNLPVYDKPVYDD-DIFDGIPGLK----NSSKVQYDDVFSSMSSPP-----KA 180
           +     AS++PVYDKPVYDD D+F+ IP LK    +S   ++++VFSS+SS P     + 
Sbjct: 121 SKS---ASSMPVYDKPVYDDEDVFESIPELKIPSTSSQSARFENVFSSISSSPTKHRKQN 180

Query: 181 DSAFDDLLGG-FSKSGGVSKSKDKEIPAFDDLIPGFRGSSSPGDSKT------------- 240
            S FDDL+G    K G  S  ++K    FDDLIPGF  +SSP   +T             
Sbjct: 181 SSPFDDLMGNNLGKKGADSDREEKGSSIFDDLIPGFGRTSSPPSKRTTSETTNQSEKAPY 240

Query: 241 -----SSMPMENPFGVSRERDDPHHEEA-----SDVGNFKSPKFDGYPSSGANNKGFDDM 300
                SS   E+PF V  E +    E +      D+G F S K D    S  +   F D+
Sbjct: 241 RTAETSSNVEEDPFVVLEESESTPREPSRTDPLDDIGKFNSRKTD---HSSVHGGVFVDI 300

Query: 301 DPFASLGKSVPAFSSEGNNRGKARSPPKVDGSSAGPQNSKSKDTME---KLSGKTSGQPL 360
           DP  +LGK  P                          NSK K  +     +SG  S  P+
Sbjct: 301 DPLDNLGKPGP------------------------DMNSKGKSHLRPPGNISGSQS-PPV 360

Query: 361 KKEVSAKNDRQFDEPVFDIPAVPTSSHKFVPQSTSPPAPKDENVMGDTSRFEDSVESDEI 420
           +   S  + +   E   +            P + S P P + N       FE S   D++
Sbjct: 361 ESPGSYHSKKVSFEDFLE------------PHNMSTPPPTNSN-----GSFESS---DDV 420

Query: 421 WLTVSEIPLFTQPTVAPPPSRPPPPIPQQVPKEGTSPYGSRSSKMNANDFSSVPNSTHHY 480
           WLTVSEIPLFTQPT APPP+RPPPP P            +R  K   N+  S+P S +H 
Sbjct: 421 WLTVSEIPLFTQPTSAPPPTRPPPPRP------------TRPIKKKVNE-PSIPTSAYHS 480

Query: 481 QIPKSASASASASTRDQVSSVDELEEFAMGRNQSNAEDLVNGLSNEEAEMNS----AAAA 540
            +P S  AS ++ T  Q+   DEL++F++GRNQ+ A    +  S E++++ S    +AAA
Sbjct: 481 HVPSSGRASVNSPTASQM---DELDDFSIGRNQTAANGYPDPSSGEDSDVFSTAAASAAA 540

Query: 541 MKEAMDRAEAKFKHAKEVRERESTRTSKSKESVYWDRDEKAMRNNRVEDEESMDHERFQR 600
           MK+AMD+AEAKF+HAKE RE+E+ + S+S+E             +  E+ +S       R
Sbjct: 541 MKDAMDKAEAKFRHAKERREKENLKASRSREG------------DHTENYDS-------R 600

Query: 601 EREREEKEKR----RAEREKERARELEREREEKEKEQRRLEKERERAREIEMERIKARQA 660
           ERE  EK+ R    RAERE E  +  ERE+EE+E+EQ+R+E+ERER        + ARQA
Sbjct: 601 ERELREKQVRLDRERAEREAEMEKAQEREKEEREREQKRIERERER--------LVARQA 660

Query: 661 VERATREARERAATEARLKAERAAVEKVNAEARGRAERVAVQRVQAEARERAAGEARERA 720
           VERATREARERAATEA  K +RAAV K   +AR RAER AVQR  AEARERAA  AR++A
Sbjct: 661 VERATREARERAATEAHAKVQRAAVGKAT-DARERAERAAVQRAHAEARERAAAGARDKA 720

Query: 721 ERAAAEARERVEKAAAEAKEKEARERASVARAEAEARSRAERVAVERVAAEARERAAADA 780
            +AAAEARE+ EKAAAEAKE     RA+    E E R RAER AVER AAEAR RAAA A
Sbjct: 721 AKAAAEAREKAEKAAAEAKE-----RANAEAREKETRVRAERAAVERAAAEARGRAAAQA 780

Query: 781 RERAAAAARASQQKNENDLESFFS-MGRASSAPRPRANPMDNFFDSQSPS---RPEPQTT 840
           + +        QQ+N NDL+SFFS + R +SAPR R NP+D F DS +         ++ 
Sbjct: 781 KAK-------QQQENTNDLDSFFSSISRPNSAPRQRTNPLDPFQDSWNKGGSFESSRESL 781

Query: 841 KQPSTTSSSMRKASSTTNIVDDLSSIFG-GPPPSGEFQEVDGESEERRRARLERHQRVQT 850
           + P     ++RK SS TNIVDDLSSIFG     SG FQ+VDGE+EERRRARLERHQR Q 
Sbjct: 841 RVPPGQPENLRKTSSVTNIVDDLSSIFGASASQSGGFQDVDGETEERRRARLERHQRTQE 781

BLAST of Cp4.1LG01g17370 vs. TAIR10
Match: AT1G21660.1 (AT1G21660.1 Chaperone DnaJ-domain superfamily protein)

HSP 1 Score: 101.3 bits (251), Expect = 3.5e-21
Identity = 44/59 (74.58%), Postives = 54/59 (91.53%), Query Frame = 1

Query: 850 VLWPECGWQPVSLTEMVIPNSVKKVYRKATLCIHPDKVQQKGATLQQKYVAEKVFDILK 909
           VLWP CGW+ VS+T+++  ++VKKVYRKATL +HPDKVQQKGATL+QKY+AEKVFDILK
Sbjct: 453 VLWPGCGWEAVSITDLITSSAVKKVYRKATLYVHPDKVQQKGATLEQKYIAEKVFDILK 511

BLAST of Cp4.1LG01g17370 vs. TAIR10
Match: AT4G36520.1 (AT4G36520.1 Chaperone DnaJ-domain superfamily protein)

HSP 1 Score: 91.3 bits (225), Expect = 3.6e-18
Identity = 109/308 (35.39%), Postives = 153/308 (49.68%), Query Frame = 1

Query: 631  KAERAAVEKVNAEARGRAERVAVQRVQAEARERAAGEARERA-----------ERAAAEA 690
            K ER    +V+ +    AER+  +R     + R   E RER            +RA A+A
Sbjct: 1118 KVERPLPSRVSVQREKEAERLKRERDLEMEQLRKVEEEREREREREKDRMAFDQRALADA 1177

Query: 691  RERVEKAAAEAKEKEARERASVARAEAEARSRAERVAVERVAAEARERAAADARERAAAA 750
            RER+EKA AEA+EK   ++ S+     EAR RAER AVER  +EAR+RAA    E+AA  
Sbjct: 1178 RERLEKACAEAREKSLPDKLSM-----EARLRAERAAVERATSEARDRAA----EKAAFE 1237

Query: 751  ARASQQKNENDLES-----FFSMGRASSAPRPRANPM----DNFFDSQSPSRPEPQ---- 810
            AR   +++ +D +S     F      S + +   N +      + DS       PQ    
Sbjct: 1238 ARERMERSVSDKQSQSSGFFGERMEISLSDKQFQNSVSFGASRYQDSHGTEGESPQRYTS 1297

Query: 811  TTKQPSTTSSSMRKASSTTNIVD------DLSSIFGGPPPSGEFQEVDGESEERRRARLE 870
              ++   T+  + KA +  N+ D          I        E +      E   RA L 
Sbjct: 1298 RLERHQRTADRVAKALAEKNMRDLVAQREQAERIRIAETLDTEVKRWSSGKEGNIRALLS 1357

Query: 871  RHQRVQTRAVLWPECGWQPVSLTEMVIPNSVKKVYRKATLCIHPDKVQQKGATLQQKYVA 909
              Q      +L PE GWQP+ LTE++   +VK+ YRKATLC+HPDK+QQ+GA + QKY+ 
Sbjct: 1358 TLQ-----YILGPESGWQPLPLTEVITSAAVKRAYRKATLCVHPDKLQQRGANIHQKYIC 1411

BLAST of Cp4.1LG01g17370 vs. TAIR10
Match: AT1G75100.1 (AT1G75100.1 J-domain protein required for chloroplast accumulation response 1)

HSP 1 Score: 68.9 bits (167), Expect = 1.9e-11
Identity = 47/145 (32.41%), Postives = 80/145 (55.17%), Query Frame = 1

Query: 776 DSQSPSRPEPQTTKQPSTTSSSMRKASSTTNIVDDLSSIFGGPPPSGEFQEVDGESEERR 835
           D+    R EP TT    TTS  + +       V+D++          + +E + ++EE +
Sbjct: 504 DTVQEERQEPSTTH---TTSEDIDEPFHVNFDVEDITQ------DENKMEEANKDAEEIK 563

Query: 836 R--ARLERHQRVQT----------RAVLWPECGWQPVSLTEMVIPNSVKKVYRKATLCIH 895
              A++ +    ++          + +LW   GW+PV L +M+  N+V+K Y++A L +H
Sbjct: 564 NIDAKIRKWSSGKSGNIRSLLSTLQYILWSGSGWKPVPLMDMIEGNAVRKSYQRALLILH 623

Query: 896 PDKVQQKGATLQQKYVAEKVFDILK 909
           PDK+QQKGA+  QKY+AEKVF++L+
Sbjct: 624 PDKLQQKGASANQKYMAEKVFELLQ 639

BLAST of Cp4.1LG01g17370 vs. NCBI nr
Match: gi|778694906|ref|XP_011653895.1| (PREDICTED: auxilin-related protein 2 [Cucumis sativus])

HSP 1 Score: 1320.4 bits (3416), Expect = 0.0e+00
Identity = 786/984 (79.88%), Postives = 828/984 (84.15%), Query Frame = 1

Query: 1   MNDFEGLLATNYGFKPQGKAAPMAASKGTSNINNPTTSPNFDLGSRASFRSTKNSNSLSG 60
           MNDFEGLLATNYGFKPQGKAAPMAASKGTSNIN PT+SPNFDLGSR SFRS+K SNSLSG
Sbjct: 1   MNDFEGLLATNYGFKPQGKAAPMAASKGTSNIN-PTSSPNFDLGSRPSFRSSKTSNSLSG 60

Query: 61  SLADDHDSLNRSVSAHENREFGGLDDLLGESSRFSRKSESRAGESDVNFDSLFHGAGNSG 120
           SLADD DSLNRS+SAH+NREF GLDDLLG S RFSRKSE+RAG+SDVNFDSLF+G GNS 
Sbjct: 61  SLADDRDSLNRSMSAHDNREFDGLDDLLGGSGRFSRKSEARAGDSDVNFDSLFNGVGNSS 120

Query: 121 QPVASNLPVYDKPVYDDDIFDGIPGLKNSSKVQYDDVFSSMSSPPKADSAFDDLLGGFSK 180
           QP ASNLPVYDKPVYDDDIFDGIPGL+NSSKVQYDDVFSSMSSPPKA+SAFDDLLGGF K
Sbjct: 121 QPPASNLPVYDKPVYDDDIFDGIPGLRNSSKVQYDDVFSSMSSPPKAESAFDDLLGGFGK 180

Query: 181 SGGVSKSK--------DKEIPAFDDLIPGFRGSSSPGD--------------SKTSSMPM 240
           S  V KSK        D+EIPAFDDLIPGFRG S PGD              S TSS  M
Sbjct: 181 SDSVPKSKGGKGTQSKDREIPAFDDLIPGFRGGSPPGDRSNSSWSSEPTSVKSTTSSKAM 240

Query: 241 ENPFGVSRERDDPHHEEASDVGNFKSPKFDGYPSSGANNKGFDDMDPFASLGKSVPAFSS 300
           ENPFGVSRE +D H EEASD+GNFKSPKFDGYPSS ANNK FDDMDPFASLGKSVPAFSS
Sbjct: 241 ENPFGVSREHNDLH-EEASDIGNFKSPKFDGYPSSDANNKAFDDMDPFASLGKSVPAFSS 300

Query: 301 EGNNRGKARSPPKVDGSSAGPQNSKSKDTMEKLSGKTSGQPLKKEVSAKNDRQFDEPVFD 360
           EGNNR KARSPP+VDG++AGPQNS SKD MEK S KTS QPLKK+V AKNDR FD+PVFD
Sbjct: 301 EGNNRAKARSPPRVDGTAAGPQNSNSKDAMEKPSTKTSVQPLKKDVPAKNDRHFDQPVFD 360

Query: 361 IPAVPTSSHKFVPQSTSPPAPKDENVMGDTSRFEDSVESDEIWLTVSEIPLFTQPTVAPP 420
           IP V T+SHKFVPQSTSPPA  D NVMG+TSRFEDSVE DEIWLTVSEIPLFTQPTVAPP
Sbjct: 361 IPTVSTNSHKFVPQSTSPPASDDANVMGETSRFEDSVEPDEIWLTVSEIPLFTQPTVAPP 420

Query: 421 PSRPPPPIPQQVPKEGTSPYGSRSSKMNANDFSSVPNSTHHYQIPKSASASASASTRDQV 480
           PSRPPPPIPQQVPKEG  PYG RSSKMNANDFSS P+STHH+QIPKS S S     RDQV
Sbjct: 421 PSRPPPPIPQQVPKEGMGPYGLRSSKMNANDFSSFPSSTHHFQIPKSTSPSM----RDQV 480

Query: 481 SSVDELEEFAMGRNQSNAEDLVNGLSNEEAEMNSAAAAMKEAMDRAEAKFKHAKEVRERE 540
           SSVDELE+FAMGRN SNA++ VN LSNEEAEMNSAAAAMKEAMDRAEAKFKHAKEVRERE
Sbjct: 481 SSVDELEQFAMGRNPSNADEQVNSLSNEEAEMNSAAAAMKEAMDRAEAKFKHAKEVRERE 540

Query: 541 STRTSKSKESVYWDRDEKAMRNNRVEDEESMDHERFQREREREEKEK--RRAEREKERAR 600
           STRTSK KE+VYWDRDEKA R++RVEDEE++D ERFQREREREEKEK  R+AER+KERAR
Sbjct: 541 STRTSKIKEAVYWDRDEKATRSDRVEDEEAIDRERFQREREREEKEKEKRKAERDKERAR 600

Query: 601 ELEREREEKEKEQRRLEKERERAREIEMERIKARQAVERATREARERAATEARLKAERAA 660
           ELEREREEKEKE RRLEKERERARE+EMERIK RQAVERATREARERAA EARLKAERAA
Sbjct: 601 ELEREREEKEKELRRLEKERERARELEMERIKVRQAVERATREARERAAIEARLKAERAA 660

Query: 661 VEKVNAEARGRAERVAVQRVQAEARERAAGEARERAERAAAEARERVEKAAAEAKEKEAR 720
           VEKVNAEAR RAER AVQR Q+EARERAA EARERAERAA EARER EKAAAEAKE+EAR
Sbjct: 661 VEKVNAEARERAERAAVQRAQSEARERAAAEARERAERAATEARERAEKAAAEAKEREAR 720

Query: 721 ERASVARAEAEARSRAERVAVERVAAEARERAAADARERAAAAARASQQKNENDLESFFS 780
           ERASVARAE+EARSRAER AVER AAEARERAA DARERAAAAARASQQKNENDLESFFS
Sbjct: 721 ERASVARAESEARSRAERAAVERAAAEARERAAVDARERAAAAARASQQKNENDLESFFS 780

Query: 781 MGRASSAPRPRANPMDNFFDSQSPSRPEPQTTKQPSTTSSSMRKASSTTNIVDDLSSIFG 840
           MGR SS P+ RANPMDNF D+QSP+RPE  TTK   T  ++MRKASS TNIVDDLSSIFG
Sbjct: 781 MGRPSSVPKHRANPMDNF-DAQSPNRPE--TTKPSPTPPTNMRKASSATNIVDDLSSIFG 840

Query: 841 GPPPSGEFQEVDGESEERRRARLERHQRVQTRA--------------------------- 900
           GPP SGEFQEVDGE+EERRRARLERHQRVQTRA                           
Sbjct: 841 GPPSSGEFQEVDGETEERRRARLERHQRVQTRAAKALAEKNERDLQMQREQAERHRIAET 900

Query: 901 -------------------------VLWPECGWQPVSLTEMVIPNSVKKVYRKATLCIHP 909
                                    VLWPECGWQPVSLTEMVIPN+VKKVYRKATLCIHP
Sbjct: 901 LDAEIKRWAAGKEGNLRALLSTLQYVLWPECGWQPVSLTEMVIPNAVKKVYRKATLCIHP 960

BLAST of Cp4.1LG01g17370 vs. NCBI nr
Match: gi|659083018|ref|XP_008442142.1| (PREDICTED: auxilin-related protein 2-like [Cucumis melo])

HSP 1 Score: 1319.3 bits (3413), Expect = 0.0e+00
Identity = 787/984 (79.98%), Postives = 821/984 (83.43%), Query Frame = 1

Query: 1   MNDFEGLLATNYGFKPQGKAAPMAASKGTSNINNPTTSPNFDLGSRASFRSTKNSNSLSG 60
           MNDFEGLLATNYGFKPQGKAAPMAASKG SNIN PT+SPNFDLGSR SFRSTK SNSLSG
Sbjct: 1   MNDFEGLLATNYGFKPQGKAAPMAASKGNSNIN-PTSSPNFDLGSRPSFRSTKTSNSLSG 60

Query: 61  SLADDHDSLNRSVSAHENREFGGLDDLLGESSRFSRKSESRAGESDVNFDSLFHGAGNSG 120
           SLADD DSLNRS+SAH+NREF GLDDLLG S RFSRK E+RAG+SDVNFDSLF+G GNS 
Sbjct: 61  SLADDRDSLNRSMSAHDNREFDGLDDLLGGSGRFSRKPEARAGDSDVNFDSLFNGVGNSS 120

Query: 121 QPVASNLPVYDKPVYDDDIFDGIPGLKNSSKVQYDDVFSSMSSPPKADSAFDDLLGGFSK 180
           QP ASNLPVYDKPVYDDDIFDGIPGL+NSSKVQYDDVFSSMSSPPKA+SAFDDLLGGF K
Sbjct: 121 QPPASNLPVYDKPVYDDDIFDGIPGLRNSSKVQYDDVFSSMSSPPKAESAFDDLLGGFGK 180

Query: 181 SGGVSKSK--------DKEIPAFDDLIPGFRGSSSPGD----------------SKTSSM 240
           S  V KSK        D+EIPAFDDLIPGFRG S PGD                S TSS 
Sbjct: 181 SDSVPKSKGGKGTQSKDQEIPAFDDLIPGFRGGSPPGDRSNSRASWSSEPTSVKSTTSSK 240

Query: 241 PMENPFGVSRERDDPHHEEASDVGNFKSPKFDGYPSSGANNKGFDDMDPFASLGKSVPAF 300
            MENPFGVSRE +D H EEASD+GNFKSPKFDGY SS ANNK FDDMDPFASL KSVPAF
Sbjct: 241 AMENPFGVSREHNDIH-EEASDIGNFKSPKFDGYSSSDANNKAFDDMDPFASLSKSVPAF 300

Query: 301 SSEGNNRGKARSPPKVDGSSAGPQNSKSKDTMEKLSGKTSGQPLKKEVSAKNDRQFDEPV 360
           SSEGNNRGKARSPP+VDG++AGPQN   KD MEK S K S QPLKK+V AKNDR FD+PV
Sbjct: 301 SSEGNNRGKARSPPRVDGTAAGPQNPNGKDAMEKPSIKNSVQPLKKDVPAKNDRHFDQPV 360

Query: 361 FDIPAVPTSSHKFVPQSTSPPAPKDENVMGDTSRFEDSVESDEIWLTVSEIPLFTQPTVA 420
           FDIP V T+SHKF PQSTSPPA  D NVMGDTSRFEDSVESDEIWLTVSEIPLFTQPTVA
Sbjct: 361 FDIPTVSTNSHKFGPQSTSPPASNDANVMGDTSRFEDSVESDEIWLTVSEIPLFTQPTVA 420

Query: 421 PPPSRPPPPIPQQVPKEGTSPYGSRSSKMNANDFSSVPNSTHHYQIPKSASASASASTRD 480
           PPPSRPPPPIPQQVPKEG   YG RSSKMNANDFSS P+STHH+QIPKS S S     RD
Sbjct: 421 PPPSRPPPPIPQQVPKEGMGSYGLRSSKMNANDFSSFPSSTHHFQIPKSTSPSM----RD 480

Query: 481 QVSSVDELEEFAMGRNQSNAEDLVNGLSNEEAEMNSAAAAMKEAMDRAEAKFKHAKEVRE 540
           QVSSVDELE+FAMGRN SNA++ VN LSNEEAEMNSAAAAMKEAMDRAEAKFKHAKEVRE
Sbjct: 481 QVSSVDELEQFAMGRNPSNADEQVNSLSNEEAEMNSAAAAMKEAMDRAEAKFKHAKEVRE 540

Query: 541 RESTRTSKSKESVYWDRDEKAMRNNRVEDEESMDHERFQREREREEKEKRRAEREKERAR 600
           RESTRTSK KE+VYWDRDEKA R+ RVEDEES+D ERFQRERE +EKEKR+AEREKER R
Sbjct: 541 RESTRTSKIKEAVYWDRDEKATRSERVEDEESIDRERFQREREEKEKEKRKAEREKERVR 600

Query: 601 ELEREREEKEKEQRRLEKERERAREIEMERIKARQAVERATREARERAATEARLKAERAA 660
           ELEREREEKEKEQRRLEKERERARE+EMERIK RQAVERATREARERAATEARLKAERAA
Sbjct: 601 ELEREREEKEKEQRRLEKERERARELEMERIKVRQAVERATREARERAATEARLKAERAA 660

Query: 661 VEKVNAEARGRAERVAVQRVQAEARERAAGEARERAERAAAEARERVEKAAAEAKEKEAR 720
           VEKVNAEAR RAER AVQR QAEARERAA EARERAERAA EARER EKAAAEAKE+EAR
Sbjct: 661 VEKVNAEARERAERAAVQRAQAEARERAAAEARERAERAATEARERAEKAAAEAKEREAR 720

Query: 721 ERASVARAEAEARSRAERVAVERVAAEARERAAADARERAAAAARASQQKNENDLESFFS 780
           ERASVARAEAEARSRAER AVER AAEARERAA DARERAAAAARASQQKNENDLESFFS
Sbjct: 721 ERASVARAEAEARSRAERAAVERAAAEARERAAVDARERAAAAARASQQKNENDLESFFS 780

Query: 781 MGRASSAPRPRANPMDNFFDSQSPSRPEPQTTKQPSTTSSSMRKASSTTNIVDDLSSIFG 840
           MGR SS P+ RANPMDNF D+QSP+RPE  TTK  ST  S+MRKASS TNIVDDLSSIFG
Sbjct: 781 MGRPSSVPKHRANPMDNF-DAQSPNRPE--TTKPSSTPPSNMRKASSATNIVDDLSSIFG 840

Query: 841 GPPPSGEFQEVDGESEERRRARLERHQRVQTRA--------------------------- 900
           GPP SGEFQEVDGESEERRRARLERHQRVQTRA                           
Sbjct: 841 GPPSSGEFQEVDGESEERRRARLERHQRVQTRAAKALAEKNERDLQMQREQAERHRIAET 900

Query: 901 -------------------------VLWPECGWQPVSLTEMVIPNSVKKVYRKATLCIHP 909
                                    VLWPECGWQPVSLTEMVIPN+VKKVYRKATLCIHP
Sbjct: 901 LDAEIKRWAAGKEGNLRALLSTLQYVLWPECGWQPVSLTEMVIPNAVKKVYRKATLCIHP 960

BLAST of Cp4.1LG01g17370 vs. NCBI nr
Match: gi|1009146358|ref|XP_015890839.1| (PREDICTED: auxilin-related protein 2-like [Ziziphus jujuba])

HSP 1 Score: 790.4 bits (2040), Expect = 3.6e-225
Identity = 557/1004 (55.48%), Postives = 675/1004 (67.23%), Query Frame = 1

Query: 1   MNDFEGLLATNYGFKPQGKAAPMAASKGTSNINNPTT-SPNFDLGSRASFRSTKNSNSLS 60
           MNDFEGLLAT+YGFKP GK+APM+AS   S+  + T  +PNFDLGSR   RS ++SNS  
Sbjct: 1   MNDFEGLLATDYGFKPSGKSAPMSASAAASSKGSATNHNPNFDLGSRGPPRSARSSNSFG 60

Query: 61  GSLADDHDSLNRSVSAHENREFGGLDDLLGESSRFSRKSESRAGESDVNFDSLF---HGA 120
           GS +DD DS+  S  + +  +F  L D+ G S+R++ KSE+R  +S  +FDS+F    G 
Sbjct: 61  GSPSDDRDSVFGSSGSQKPHDFSDLGDIFGGSARYASKSETRGADSAFDFDSMFGDTSGG 120

Query: 121 GNSGQPVASNL-PVYDKPVYDDDIFDGIPGLKNSSKVQYDDVFSSM---SSPPKADSAFD 180
           G   +  +SN  PVY KPVYDDDIFDG+PGLK+SSKV+Y+DVF+S    SSP  + SAFD
Sbjct: 121 GGGAKSSSSNSEPVYHKPVYDDDIFDGVPGLKSSSKVKYEDVFASSGASSSPKGSSSAFD 180

Query: 181 DLLGGFSKSGGVSKSK--------DKEIPAFDDLIPGFRGSSSPGD-------------- 240
           DLLGGF K+   SKS         DK +P+FDDLIPGF GSS P +              
Sbjct: 181 DLLGGFGKAEPRSKSFGSKGQEKIDKGVPSFDDLIPGFGGSSPPTERSASETWPPETTAS 240

Query: 241 -SKTSSMPMENPFGVSRERDDPHHEEASDVGNFKSPKFDGYPSSGANNKGFDDMDPFASL 300
            SKT+S  ME+PF V  + DDP  EE S + N  S K D       N + FDD+DPF  L
Sbjct: 241 VSKTTSKVMEDPFVVPGKFDDPL-EEISKLSNSGSTKSD-------NGRVFDDVDPFDGL 300

Query: 301 GKSVPAFSSEGNNRGKARSPPKVDGSSAGPQNSKSKDTMEKLSGKTSGQPLKKEVSAKND 360
           GKSVPAFSS+  ++ K RS  + D   +G +    +++ EK S ++     +K+    ND
Sbjct: 301 GKSVPAFSSD-RSKEKDRSASRSDSIGSGTRAYSGRESTEKPSVRSPDSQYQKKTPVDND 360

Query: 361 RQFDEPVFDIPAVPTSSHKFVPQSTSPPAPKDENVMGDTS------RFEDSVESDE-IWL 420
               + +FD+P   T SHK   Q  SPP+  + +V  +T+      RFE+S +S E IWL
Sbjct: 361 WDSQKTMFDMPTASTYSHKSSGQPMSPPSYANSSVKDETTQMDGSLRFEESSDSPEDIWL 420

Query: 421 TVSEIPLFTQPTVAPPPSRPPPPIPQQVPKEGTSPYGSRSSKMNANDFSSVPNSTHHYQI 480
           TVSEIPL+T PT APPPSRPPPP P QV K  T    S +++  AN++SS PNST ++Q+
Sbjct: 421 TVSEIPLYTLPTSAPPPSRPPPPRPVQVSKARTRSPASTNARRTANEYSSYPNSTQNFQV 480

Query: 481 PKSASASASASTRDQVSSVDELEEFAMGRNQSNAEDLVNGLSNEEAEMNSAAAAMKEAMD 540
           PKSA A+  ++T    S +DELE+FA  R++SNA+D  NGL  EE EMNS AAAMKEAMD
Sbjct: 481 PKSAPAAKGSAT----SPIDELEDFATSRSRSNADDHANGLFGEELEMNSVAAAMKEAMD 540

Query: 541 RAEAKFKHAKEVRERESTRTSKSKESVYWDRDEKAMRNNRVEDEESMDHERFQREREREE 600
           +AEAKF+HAKEVRERES++ ++S+E    ++DEKA+        E ++ E+ QRERE EE
Sbjct: 541 KAEAKFRHAKEVRERESSKAARSREPQP-EKDEKAVH-------ERLEREQLQREREEEE 600

Query: 601 KEKRRAEREKERARELEREREEKEKEQRRLEKERERAREIEMERIKARQAVERATREARE 660
            E+       +R RE+EREREEKE+EQRRLEKER+RAREIE ER KARQAVERATREARE
Sbjct: 601 MEQ-------QRLREIEREREEKEREQRRLEKERDRAREIEREREKARQAVERATREARE 660

Query: 661 RAATEARL-----KAERAAVEKVNAEARGRAERVAVQRVQAEARERAAGEARERAERAAA 720
           RAA EAR      KAERAAV K NAEAR RAER AVQR QAEARERAA EA+ERAE+AAA
Sbjct: 661 RAAVEARARAERAKAERAAVGKANAEARERAERAAVQRAQAEARERAAAEAKERAEKAAA 720

Query: 721 EARERVEKAAAEAKEKEARERASVARAEAEARSRAERVAVERVAAEARERAAADARERAA 780
           EARER   + +EAKE+EAR+RA      AEAR+RAER AVER AAEARERAAA+ARERAA
Sbjct: 721 EARER---SNSEAKEREARDRA------AEARARAERAAVERAAAEARERAAAEARERAA 780

Query: 781 AAA-RASQQKNENDLESFFSMGRASSAPRPRANPMDNFFDSQSPSRPEPQTTKQPSTTSS 840
           AAA RA+Q KNENDLE+FFSMGRASSAPRPR+N  D F D+Q  SR EP+  K    TSS
Sbjct: 781 AAAARANQPKNENDLEAFFSMGRASSAPRPRSNSSDPF-DTQFQSRKEPEVAKTYGGTSS 840

Query: 841 SMRKASSTTNIVDDLSSIFGGPPPSGEFQEVDGESEERRRARLERHQRVQTRA------- 900
           +MRKASSTTNIVDDLSSIFG  P SGEFQ+V+GE+EERRRARLERHQR Q RA       
Sbjct: 841 NMRKASSTTNIVDDLSSIFGAAPSSGEFQDVEGETEERRRARLERHQRTQERAAKALAEK 900

Query: 901 ---------------------------------------------VLWPECGWQPVSLTE 909
                                                        VLWPECGW PVSLT+
Sbjct: 901 NERDLQIQRDQAERHRIAETLDVEIKRWAAGKEGNLRALLSTLQYVLWPECGWSPVSLTD 960

BLAST of Cp4.1LG01g17370 vs. NCBI nr
Match: gi|694374646|ref|XP_009364182.1| (PREDICTED: auxilin-related protein 2-like [Pyrus x bretschneideri])

HSP 1 Score: 751.5 bits (1939), Expect = 1.8e-213
Identity = 543/1003 (54.14%), Postives = 667/1003 (66.50%), Query Frame = 1

Query: 1   MNDFEGLLATNYGFKPQGKAAPMAASKGTSNINNPTTSPNFDLGSRASFRSTKNSNSLSG 60
           MNDFEGLLA++YGFK  GK+APM+AS      N+ + S NFDL S  + +S + +NS SG
Sbjct: 1   MNDFEGLLASDYGFKSSGKSAPMSASSA----NSSSKSSNFDLRSTGTSQSARATNSFSG 60

Query: 61  SLADDHDSLNRSVSAHENREFGGLDDLLGESSRFSRKSESRAGESDVNFDSLFHGAGNSG 120
           S+ DD DS+       + ++FG   D+ G S+R+  K ESR  ++  NFDS+F G  +S 
Sbjct: 61  SVVDDRDSI---FGPSKGQDFG---DIFGGSARYPGKLESRGDDAAFNFDSMFGGPADSV 120

Query: 121 QPVASNLPVYDKPVYDDDIFDGIPGLKNSS-KVQYDDVFSSMSSPPKADSA----FDDLL 180
              ++  PVYDKPVYDDDIFDG+PGLK+++ KV+Y+DVFSS++SPP   S+    FDDLL
Sbjct: 121 PKSSNPGPVYDKPVYDDDIFDGVPGLKSTAAKVKYEDVFSSVTSPPSKGSSRSSGFDDLL 180

Query: 181 GGFSKSGGVSKSK--------DKEIPAFDDLIPGFRGSSSPGD----------------- 240
           GGF K+   SKS         +K +P FDDL+PGF G  SP +                 
Sbjct: 181 GGFGKAEPQSKSSGSRGSVKTEKGVPGFDDLLPGFGGGISPRERSTSEANWPPETAANNV 240

Query: 241 SKTSSMPMENPFGVSRERDDPHHEEASDVGNFKSPKFDGYPSSGANNKGFDDMDPFASLG 300
           SKT+S  ME+PF VS +  DP  EE S +    S K D  PS   N + FDD+DPF  LG
Sbjct: 241 SKTNSKVMEDPFVVSGQYGDPL-EEISRLSKSGSSKVDS-PSVN-NGRAFDDIDPFDGLG 300

Query: 301 KSVPAFSSEGNNRGKARSPPKVDGSSAGPQNSKSKDTMEKLSGKTSGQPLKKEVSAKNDR 360
           +SVPAFSS  NNRG      K D S    + S  K++ EK S ++     +K+V  ++  
Sbjct: 301 QSVPAFSSGRNNRGNDSGNLKADTSVNNSRASTGKESTEKPSVRSPDNLSQKKVPIEDHW 360

Query: 361 QFDEPVFDIPAVPTSSHKFVPQSTSPPA-----PKDENVMGDTS-RFEDSVES-DEIWLT 420
              + +FD+P+V + S K   Q+ SPP+     PK+ NV  D S R E++++S D IWLT
Sbjct: 361 DSHQTLFDMPSVSSDSQKSSGQTMSPPSYVNPSPKEVNVQVDRSPRSEENMDSSDNIWLT 420

Query: 421 VSEIPLFTQPTVAPPPSRPPPPIPQQVPKEGTSPYGSRSSKMNANDFSSVPNSTHHYQIP 480
           VSEIPL TQPT APPPSRPPPP P Q+ K       S +++  A++     +ST  +Q P
Sbjct: 421 VSEIPLMTQPTSAPPPSRPPPPRPVQLSKARMGSPASTNARRKASE-----SSTQFFQAP 480

Query: 481 KSASASASASTRDQVSSVDELEEFAMGRNQSNAEDLVNGLSNEEAEMNSAAAAMKEAMDR 540
           KSA A+A  S+    S++DELE+FA G+NQ+N  +  NGL+ EE EMNS AAAMKEAMDR
Sbjct: 481 KSAPAAARGSS---ASTIDELEDFARGKNQNNFSEHANGLAGEELEMNSVAAAMKEAMDR 540

Query: 541 AEAKFKHAKEVRERESTRTSKSKESVYWDRDEKAMRNNRVEDEESMDHERFQREREREEK 600
           AEAKF+HAKEVRER STR ++SKE+   ++DEKA++ N     E  D ER QRE E E  
Sbjct: 541 AEAKFRHAKEVRERGSTRAARSKEAQL-EKDEKALQENL----ERFDSERLQREGEEENM 600

Query: 601 EKRRAEREKERARELEREREEKEKEQRRLEKERERAREIEMERIKARQAVERATREARER 660
           E+ RAE+EKE    +ER  EEKE+EQRRLEKERER RE+E+ER KARQAVERATREARER
Sbjct: 601 EQSRAEKEKE----IERVWEEKEREQRRLEKERERTREMELERDKARQAVERATREARER 660

Query: 661 AATEARLKAERA-----AVEKVNAEARGRAERVAVQRVQAEARERAAGEARERAERAAAE 720
           AATEAR+KAERA     AV+K  AEAR RAER AVQR Q+EARERAA EA+ERAE+AAAE
Sbjct: 661 AATEARMKAERAKAERAAVDKATAEARERAERAAVQRAQSEARERAAAEAKERAEKAAAE 720

Query: 721 ARERVEKAAAEAKEKEARERASVARAEAEARSRAERVAVERVAAEARERAAADARERAAA 780
           ARER   A AE KE+EARERA+ ARA AEAR+RAER AVER AAEARERAAA+ARERAAA
Sbjct: 721 ARER---ANAEVKEREARERAAAARAGAEARTRAERAAVERAAAEARERAAAEARERAAA 780

Query: 781 AARASQQKNENDLESFFSMGRASSAPRPRANPMDNFFDSQSPSRPEPQTTKQPSTTSSSM 840
           AARA+QQ+++NDLE FF+MGRASSAPRPRAN  D F      +R EP+  K  ST SS++
Sbjct: 781 AARANQQRSDNDLEHFFNMGRASSAPRPRANSSDPF-----QNRQEPEVPKASSTASSNI 840

Query: 841 RKASSTTNIVDDLSSIFGGPPPSG-EFQEVDGESEERRRARLERHQRVQTRA-------- 900
           RKA+STTNIVDDLS+IFG  P SG EFQ+V+GE+EERRRARLERHQR Q R         
Sbjct: 841 RKANSTTNIVDDLSAIFGAAPSSGGEFQDVEGETEERRRARLERHQRTQERTLKALAEKN 900

Query: 901 --------------------------------------------VLWPECGWQPVSLTEM 909
                                                       VLWPECGWQPVSLT+M
Sbjct: 901 ERDLQAQREQAERHRIAESLDVEIKRWAVGKEGNLRALLSTMQYVLWPECGWQPVSLTDM 960

BLAST of Cp4.1LG01g17370 vs. NCBI nr
Match: gi|658060031|ref|XP_008365846.1| (PREDICTED: auxilin-related protein 1-like [Malus domestica])

HSP 1 Score: 737.3 bits (1902), Expect = 3.6e-209
Identity = 539/1003 (53.74%), Postives = 662/1003 (66.00%), Query Frame = 1

Query: 1   MNDFEGLLATNYGFKPQGKAAPMAASKGTSNINNPTTSPNFDLGSRASFRSTKNSNSLSG 60
           MNDFEGLLA++YGFKP GK+APM+AS      N+ + S NFDLGS  + RS + +NS SG
Sbjct: 1   MNDFEGLLASDYGFKPSGKSAPMSASSA----NSSSKSSNFDLGSTGTSRSARATNSFSG 60

Query: 61  SLADDHDSLNRSVSAHENREFGGLDDLLGESSRFSRKSESRAGESDVNFDSLFHGAGNSG 120
           S+ +D DS+       + ++FGG       S+R+  K ESR  ++  NFDS+F G  +S 
Sbjct: 61  SVVNDRDSI---FGPSKGQDFGG-------SARYPGKLESRGEDAAFNFDSMFGGPADSV 120

Query: 121 QPVASNLPVYDKPVYDDDIFDGIPGLKNSS-KVQYDDVFSSMSSPPKADSA----FDDLL 180
              ++  PV+DKPVYDDDIFDG+PGLK+++ KV+Y+DVFSS++S P   S+    FDDLL
Sbjct: 121 PKSSNPGPVFDKPVYDDDIFDGVPGLKSTAAKVKYEDVFSSVTSLPSKGSSRTSGFDDLL 180

Query: 181 GGFSKSGGVSKSK--------DKEIPAFDDLIPGFRGSSSPGD----------------- 240
           GGF K+   SKS         +K +P  DDL+PGF G  SP +                 
Sbjct: 181 GGFGKAEPQSKSSGSRGSVKTEKGVPGLDDLLPGFGGGISPSERSTSEANWPPETTANNV 240

Query: 241 SKTSSMPMENPFGVSRERDDPHHEEASDVGNFKSPKFDGYPSSGANNKGFDDMDPFASLG 300
           SKT+S  ME+PF V  +  DP  +E S      S K D  PS   N + FDD+DPF  LG
Sbjct: 241 SKTNSKVMEDPFVVLGQYSDPL-KEISRRSKSGSSKVDS-PSVN-NGRAFDDIDPFDGLG 300

Query: 301 KSVPAFSSEGNNRGKARSPPKVDGSSAGPQNSKSKDTMEKLSGKTSGQPLKKEVSAKNDR 360
           KSVPAFSS  NNRG      + D S    + S SK++ EK S ++     +K+V  ++  
Sbjct: 301 KSVPAFSSGRNNRGNDSGNLRADTSVNNSRASTSKESTEKPSVRSPDNLSQKKVPIEDHW 360

Query: 361 QFDEPVFDIPAVPTSSHKFVPQSTSPPA-----PKDENVMGDTS-RFEDSVES-DEIWLT 420
              + +FD+P V + SHK   Q  SPP+     PK+ N+  D S R E++++S D +WLT
Sbjct: 361 DSHQTLFDMPTVSSDSHKSSGQMMSPPSYVNPSPKEVNLQVDRSPRSEENMDSSDHVWLT 420

Query: 421 VSEIPLFTQPTVAPPPSRPPPPIPQQVPKEGTSPYGSRSSKMNANDFSSVPNSTHHYQIP 480
           VSEIPL TQPT APPPSRPPPP P QV K       S +++ NA+D     +ST  +Q  
Sbjct: 421 VSEIPLMTQPTSAPPPSRPPPPRPVQVSKARMGSPASINARRNASD-----SSTQFFQAL 480

Query: 481 KSASASASASTRDQVSSVDELEEFAMGRNQSNAEDLVNGLSNEEAEMNSAAAAMKEAMDR 540
           KSA A+A  S+    S++DELE+FA G+NQ+N  +  NGL+ EE EMNS AAAMKEAMD+
Sbjct: 481 KSAPAAARGSS---ASTIDELEDFARGKNQNNFSEHANGLAGEELEMNSVAAAMKEAMDK 540

Query: 541 AEAKFKHAKEVRERESTRTSKSKESVYWDRDEKAMRNNRVEDEESMDHERFQREREREEK 600
           AEAKF+HAKEVRER ST+ ++SKE+   ++DEKAM+    E  E  D ER QRE E E  
Sbjct: 541 AEAKFRHAKEVRERGSTKAARSKEAQL-EKDEKAMQ----EKLERFDSERLQREGEEENM 600

Query: 601 EKRRAEREKERARELEREREEKEKEQRRLEKERERAREIEMERIKARQAVERATREARER 660
           E+ R E+EKE    +ER  EEKE+EQRRLEKERER RE+E+ER KARQAVERATREARER
Sbjct: 601 EQSRVEKEKE----IERVWEEKEREQRRLEKERERTREMELERDKARQAVERATREARER 660

Query: 661 AATEARLKAERA-----AVEKVNAEARGRAERVAVQRVQAEARERAAGEARERAERAAAE 720
           AATEAR+KAERA     AV+K  AEAR RAER AVQR Q+EARERAA EA+ERAE+AAAE
Sbjct: 661 AATEARMKAERAKAERAAVDKATAEARERAERAAVQRAQSEARERAAAEAKERAEKAAAE 720

Query: 721 ARERVEKAAAEAKEKEARERASVARAEAEARSRAERVAVERVAAEARERAAADARERAAA 780
           ARER   A AEAKE+EARERA+ ARA AEAR+RAER AVER AAEARERAAA+ARERAAA
Sbjct: 721 ARER---ANAEAKEREARERAAAARAGAEARTRAERAAVERAAAEARERAAAEARERAAA 780

Query: 781 AARASQQKNENDLESFFSMGRASSAPRPRANPMDNFFDSQSPSRPEPQTTKQPSTTSSSM 840
           AARA+QQ+++NDLE FF+MGRASSAPRPRAN  D F       R EP+  K  ST SS++
Sbjct: 781 AARANQQRSDNDLEHFFNMGRASSAPRPRANSSDPF-----QYRQEPEVPKASSTXSSNI 840

Query: 841 RKASSTTNIVDDLSSIFGGPPPSG-EFQEVDGESEERRRARLERHQRVQTRA-------- 900
           RKA+S+TNIVDDLS+IFG  P SG EFQ+V+GE+EERRRARLERHQR Q R         
Sbjct: 841 RKANSSTNIVDDLSAIFGAAPSSGGEFQDVEGETEERRRARLERHQRTQERTLKALAEKN 900

Query: 901 --------------------------------------------VLWPECGWQPVSLTEM 909
                                                       VLWPECGWQPVSLT+M
Sbjct: 901 ERDLQAQREQAERHRIAETLDVEIKRWAVGKEGNLRALLSTMQYVLWPECGWQPVSLTDM 960

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AUXI2_ARATH1.5e-10342.68Auxilin-related protein 2 OS=Arabidopsis thaliana GN=At4g12770 PE=1 SV=1[more]
AUXI1_ARATH1.7e-10241.80Auxilin-related protein 1 OS=Arabidopsis thaliana GN=AUXI1 PE=1 SV=2[more]
JAC1_ARATH3.4e-1032.41J domain-containing protein required for chloroplast accumulation response 1 OS=... [more]
UCP7_SCHPO1.0e-0632.31UBA domain-containing protein 7 OS=Schizosaccharomyces pombe (strain 972 / ATCC ... [more]
Match NameE-valueIdentityDescription
A0A0A0KYM6_CUCSA0.0e+0079.88Uncharacterized protein OS=Cucumis sativus GN=Csa_4G496260 PE=4 SV=1[more]
M5X738_PRUPE1.4e-19155.96Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa000822mg PE=4 SV=1[more]
A0A059B8F0_EUCGR5.1e-18650.19Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_H05116 PE=4 SV=1[more]
A0A059B9Z8_EUCGR2.5e-18550.15Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_H05116 PE=4 SV=1[more]
A0A0R0KHB7_SOYBN2.1e-18451.39Uncharacterized protein OS=Glycine max GN=GLYMA_04G209400 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G12770.12.7e-10642.51 Chaperone DnaJ-domain superfamily protein[more]
AT4G12780.19.6e-10441.80 Chaperone DnaJ-domain superfamily protein[more]
AT1G21660.13.5e-2174.58 Chaperone DnaJ-domain superfamily protein[more]
AT4G36520.13.6e-1835.39 Chaperone DnaJ-domain superfamily protein[more]
AT1G75100.11.9e-1132.41 J-domain protein required for chloroplast accumulation response 1[more]
Match NameE-valueIdentityDescription
gi|778694906|ref|XP_011653895.1|0.0e+0079.88PREDICTED: auxilin-related protein 2 [Cucumis sativus][more]
gi|659083018|ref|XP_008442142.1|0.0e+0079.98PREDICTED: auxilin-related protein 2-like [Cucumis melo][more]
gi|1009146358|ref|XP_015890839.1|3.6e-22555.48PREDICTED: auxilin-related protein 2-like [Ziziphus jujuba][more]
gi|694374646|ref|XP_009364182.1|1.8e-21354.14PREDICTED: auxilin-related protein 2-like [Pyrus x bretschneideri][more]
gi|658060031|ref|XP_008365846.1|3.6e-20953.74PREDICTED: auxilin-related protein 1-like [Malus domestica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001623DnaJ_domain
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g17370.1Cp4.1LG01g17370.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001623DnaJ domainGENE3DG3DSA:1.10.287.110coord: 847..908
score: 4.1
IPR001623DnaJ domainunknownSSF46565Chaperone J-domaincoord: 838..908
score: 6.93
NoneNo IPR availableunknownCoilCoilcoord: 658..707
score: -coord: 483..514
score: -coord: 544..619
score: -coord: 623..645
scor
NoneNo IPR availablePANTHERPTHR23172AUXILIN/CYCLIN G-ASSOCIATED KINASE-RELATEDcoord: 647..757
score: 1.3E-138coord: 789..908
score: 1.3E-138coord: 306..631
score: 1.3E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG01g17370CmaCh03G001980Cucurbita maxima (Rimu)cmacpeB666
Cp4.1LG01g17370CmoCh03G001800Cucurbita moschata (Rifu)cmocpeB617
Cp4.1LG01g17370Carg27664Silver-seed gourdcarcpeB0314
The following gene(s) are paralogous to this gene:

None