Cp4.1LG01g17650 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g17650
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionCysteine--tRNA ligase
LocationCp4.1LG01 : 13292866 .. 13313441 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCCGCGTTCTCCCCAGCCCTCTCTCGTTCTTCTTCTGCAACGCCGACGCCGGAGTCGGCTCCCCTTTTTCTTTTACCAACTCCCGCCGCTCCCTAATTCCTTTCCTGCACGTGCCGACGCTGTTGGTACTCTCTCGATCGTGCCGCCGCCGCCACCGCCGCCTCCCTCATTGGAGTTTGCACTGTTTTTCTATTCTCTTGTTACCGGTATTCTCTCGTTCTCAATCTATGTCTTTCTCTTTTCCAATCGAATTGACCGATTAGAAAAACTCAAAATTAAGACTTGGGAGTGACATAAATGAGGAGTAGCTTGAAAGAACAGTCTCAGCATCGATGAGTTTTTAGATTAAAATCGTGGTAGACATAAATTTGAATAATTCCCTCTTTGTTATCGTATCCAGGTTCAGTTTCTTGTCTGCCGTCGAAATGGCGAAAGAGGAATTTAGGGTGTACAATTCGATGACGCAGCAAAAGGAAGTATTCACCACGAAGGAGCCTGGTAAGGTTGGCATGTACGTCTGTGGCATCACAGCCTATGATTTCAGCCATATCGGCCATGCCCGTGCCGCTGTCAACTTTGACGTCCTCTACAGGTTTACTACTTTCGTGCATTCGTATCTCTATGTCAAGTGATTGCATTTGCGAGTTTTTTTTGGTCTCTTGGATAATGCGATTGTAGACAAAGCCTGGATTTCCAGGTTCTGGGAGTTCTATGTATATTGTTTGCATACAATACTGCTTGCATTGGTTTTCAGAATATACAAGAAAACCTCGTAGTTTTCCATAATCACAAGAGAAAACAAACTTATTAGCTTTCAGCTTGAACAATGTTACCTAGAGGCAGAGGAAAAAAGAAACAATTGCAGGAAAAGGTAAAGGAATAAGTGAATCCTCTGTAAGAAAGATGTTAAGTTAGATCTGGTGCATATATATGTAGTGTTTGGAGTCATTTCCTGAATAGGGAAGTTCAAACAAACAGACACGGCCCTTGAAACCACCCAAGCACGCAATACGCCACAACTTTCTCCATTCTCCAAATTGAATGATTGATGGACTAACTATATTTTCCAAATCTAATATCAGCGTGATATTTTCTAGATAGTCTCCCATTAGAGGCTTGTTCCAAAAATCATATAGTATTCCACCCCATGAAGCCATTGGATCAAAATTAATCCTATGAAGGAACAATGCGGTACATTTTAGAATGGAAAACAGTGGAGAATTGATGACCATTGGATCTCTCACCCAAAATGGCTGTAGCAATAATTTTTGAGTAGATGCACCTCGAAGTATTTCACATTTGAGTAGATGCACTTCAAAGCATTCCACCTTGTGGCCCTTAACGAAAAACAATTGCTTCATGAAAATTTCAATTGGAAATACGTACATGTATGGGATAGTGGCTAGGAGAAAACAATCCTTTCTTGTGGATTTGATTCTTAAAACAATTCAAAATTGGGCCATTTTGTGAGAAATAACTCTCTGCCGTGTTATCTAGTTCATTTGGAGTTCCAATCACATTGCATGGATGTGTTTAGATGCTGAGGGTTCTTCAAGTGGTATTATTATCTTTCGTATTTTGTGTTTGATTTTTCTTATCGTTGAAGAAGTTGTGGAAGGCACTTTTCCTTGTATATGAAATTCTCTTTGGTCGTCACATCTTCTTCATAGATTGTAGGATTTACGGTCTTCTTGTTAGTTTTTGGCTTGAGTTGGGCGCTCTTACTTGGTCTTTGTTCAGATGCTTGAGTGAAAATAAAGGATCACAATGCAGTTAGGTGGCCATTTAAGAAGCCTTCTAGGTGTTCCTTTTCTTCTTAACAAAAGTATGAAACACTTAATTAGGCTGATTTGTGACTTCTTTTTTGATTGATTTGTCAAGCAGTGGACTAATTTGAACCCGTATGTGGAAATGGAAACACTTCCTTTTGTTGGCTTTTGAGGAGTAGATAGAAAACATTCACAGCCCCAAAATAGAAAAGCCTTGATAGACCCACATTTGATGACTTCCTCGTGAATCCCACTTCTAATGACTCCCTCATGAATCTTTGGAGGTTGATCCTATGGTTATCCTCCCCTCTCACGTATCAAACCAAAAATCCTTTTCCTTTTGGAAGATTATTATTGAAAAAATAGAAAGAAGGTTGTCAACACGGTCATCACATTATACTTCAAAAGGCGGAAGACTCACCCTAATACAAGCCACATTATCCAACTTCCCTACTTACTACATGTCTCTATTTGAAATGCCACAAAAAGTGGCCGCAGATATAGGAAGATTATTCATAAACTACTTGTGGAAAGATGGCACACTGTTTGGTGGAATATTATAAACCTCCCAACAGAAAAGGGAGGCCTCGGTCTTCTCTCGATAAGGAAGAAGAACAAAGCTTTCCTCGCCAAATGGATATGGAGATATCATCATGAAGAAAAGGCCTTATGGATAAATCTTATAAAGGCTAAATATACTCCTACATCAAACAAAAATCTGCCCCCTCCATCTTTTCTACAAAAGGGCCTTGGAAGTTCATAAAGAAACATCAAAACCTCATCACAATCGAACTCGCCATAAGGTGGGTGATAGGGGAAGCACATTATTTTGGACAGACCTATGGATTGAAAACACCATGCTAGCCTTGAAGTATCAACTTTTGTACAGGCTTTCTCATAGCAAAAAAGCCATGATTAAAGAAACGTGGAATGTTGTCAACAAATTTTGGGACCTGAAGCTTGGTAGGAATTTAAAGGATAATGAAGCAACAGAGTGGGCCAAATTAAGCCTTGACCTTGCCCCTGTGGTATTGTCAAACAAAGCAGACTCACTAACTTGGCTCCCTAGCGCTGACAAAGTCTTCTCTACAAAATCCTTGATGATGGACATGGGGAAAAAGTNTTTTCCTTGTATATGAAATTCTCTTTGGTCGTCACATCTTCTTCATAGATTGTAGGATTTACGGTCTTCTTGTTAGTTTTTGGCTTGAGTTGGGCGCTCTTACTTGGTCTTTGTTCAGATGCTTGAGTGAAAATAAAGGATCACAATGCAGTTAGGTGGCCATTTAAGAAGCCTTCTAGGTGTTCCTTTTCTTCTTAACAAAAGTATGAAACACTTAATTAGGCTGATTTGTGACTTCTTTTTTGATTGATTTGTCAAGCAGTGGACTAATTTGAACCCGTATGTGGAAATGGAAACACTTCCTTTTGTTGGCTTTTGAGGAGTAGATAGAAAACATTCACAGCCCCAAAATAGAAAAGCCTTGATAGACCCACATTTGATGACTTCCTCGTGAATCCCACTTCTAATGACTCCCTCATGAATCTTTGGAGGTTGATCCTATGGTTATCCTCCCCTCTCACGTATCAAACCAAAAATCCTTTTCCTTTTGGAAGATTATTATTGAAAAAATAGAAAGAAGGTTGTCAACACGGTCATCACATTATACTTCAAAAGGCGGAAGACTCACCCTAATACAAGCCACATTATCCAACTTCCCTACTTACTACATGTCTCTATTTGAAATGCCACAAAAAGTGGCCGCAGATATAGGAAGATTATTCATAAACTACTTGTGGAAAGATGGCACACTGTTTGGTGGAATATTATAAACCTCCCAACAGAAAAGGGAGGCCTCGGTCTTCTCTCGATAAGGAAGAAGAACAAAGCTTTCCTCGCCAAATGGATATGGAGATATCATCATGAAGAAAAGGCCTTATGGATAAATCTTATAAAGGCTAAATATACTCCTACATCAAACAAAAATCTGCCCCCTCCATCTTTTCTACAAAAGGGCCTTGGAAGTTCATAAAGAAACATCAAAACCTCATCACAATCGAACTCGCCATAAGGTGGGTGATAGGGGAAGCACATTATTTTGGACAGACCTATGGATTGAAAACACCATGCTAGCCTTGAAGTATCAACTTTTGTACAGGCTTTCTCATAGCAAAAAAGCCATGATTAAAGAAACGTGGAATGTTGTCAACAAATTTTGGGACCTGAAGCTTGGTAGGAATTTAAAGGATAATGAAGCAACAGAGTGGGCCAAATTAAGCCTTGACCTTGCCCCTGTGGTATTGTCAAACAAAGCAGACTCACTAACTTGGCTCCCTAGCGCTGACAAAGTCTTCTCTACAAAATCCTTGATGATGGACATGGGGAAAAAGTAGAAGCAATAAATCCCATGCTAGCAAAGACAGTATGGAAAGGACATCAACCTAAAAAGGTGAAGTTCTTCCTTTGGGAAATAGCGCATAAAGCCATTAGCACAAGTGAAAATCTCCAAAAAAGAATGCCTTACATCACTCTTTCTCCAAATTGGTGTCCATTATGCAAGAAAGAAAACGAATCATAAGACCACTTGTTTATGTAGTGCACATACCCTCAGAATTCCTAGACAACGATTCTCAATATATTCGGATGACNATAAATCTTATAAAGGCTAAATATACTCCTACATCAAACAAAAATCTGCCCCCTCCATCTTTTCTACAAAAGGGCCTTGGAAGTTCATAAAGAAACATCAAAACCTCATCACAATCGAACTCGCCATAAGGTGGGTGATAGGGGAAGCACATTATTTTGGACAGACCTATGGATTGAAAACACCATGCTAGCCTTGAAGTATCAACTTTTGTACAGGCTTTCTCATAGCAAAAAAGCCATGATTAAAGAAACGTGGAATGTTGTCAACAAATTTTGGGACCTGAAGCTTGGTAGGAATTTAAAGGATAATGAAGCAACAGAGTGGGCCAAATTAAGCCTTGACCTTGCCCCTGTGGTATTGTCAAACAAAGCAGACTCACTAACTTGGCTCCCTAGCGCTGACAAAGTCTTCTCTACAAAATCCTTGATGATGGACATGGGGAAAAAGTAGAAGCAATAAATCCCATGCTAGCAAAGACAGTATGGAAAGGACATCAACCTAAAAAGGTGAAGTTCTTCCTTTGGGAAATAGCGCATAAAGCCATTAGCACAAGTGAAAATCTCCAAAAAAGAATGCCTTACATCACTCTTTCTCCAAATTGGTGTCCATTATGCAAGAAAGAAAACGAATCATAAGACCACTTGTTTATGTAGTGCACATACCCTCAGAATTCCTAGACAACGATTCTCAATATATTCGGATGACATCTCACATTTCCTAGGGAGGTAAAGAAAATTTGGATATGGCCTTAACGTCCACCCTTTCAAGAATGCAAAAGCCCCATTATGGAAAATCTTCATCATGACTTTCTTTTGGAATTTGTGGAAAGGAAGAAATCAGAGCATATTTGCAGAAAAAACAGGTACCTACATAAAATTTTTCGACAATTTTGTTTACCAAGCTGTATCTTGGTATAAACTATCTAATACTTTTACTTCCTATAGTTATACCTCCCTTATTGCAAATTGGGAAGGTCTTTTGCAAACATCATGAATTATACATCCCTTTTGTAAATTTCAATCATCAATGAAATTGTCTCTTATAAAAAAAAGTATCAAAATACGTGGCTTAGTCATAAGGAATATTTTTCCTTCTATTGTTGGTTATATGATTGAGGGATGTTGATGGTGTCAGATAGTCAAGTTATAACTCATGCTAGTATGTTAAAATTATAGTTTTCAAAGAAAAAATGGCAATAGCTCAAAGGGTATCTTTATTCTTTGAACTTCCTATGTTGAATGGTATGGATTAGGTAATTCATCTTCTCTTGAGTGAAAAGGGCTAAGGAAAGAAATACATGGCTTGCTCATAAGGATTCTCTACTACTGTTCGATGTATGATGGAAGGATGTTGATGGTGATAGGGTGGTTAAGTCATTCCTTGGAGTCTCTTAGATAGAAAAAAAATACCCTTGACTTTGCCATATGTTTAAAAGATACCCTTTTTTTTTCCAGAAGGTTGCAATATTGCTGTCGACCTTTCATAAATGTCTGGAAACTATCGTTGGAGTAGAAAATTGATGTGAACTCAGACATCTCAATTATACAATATGAACTTCAAAGAAGATTTCAAGAATGATGTCCTAAAAGTACGGACATAGACATTACACAGACTAGAAAACAGATACAACATGACATAGACATAGTGACATGCCAATTCATAAAAATGTAGGACATAGACACATCGGGGATACATTACTTTTTCTTGAACTTTTTTATTAAATAATATTAAAAATCAAATTCAAAATATTTTAGTTTTGATAACTATTAAGAATAATCAAAACTAAAAAAAAGTGAACAAAATTTTTATTGGTCGTTGATTTTTGTTTAAAATTTAAATCTACATAAAATCCAACAAGTCATTATGAAGAACAAAGCCCAAACCACGAAGAACCCAATAAAAGCCCACAAAATAAGATTTATAAAAAAGTTTATAAATAAAGAAAAAGAATGCTAACCCCAAAACACTCTTTTCCTCCCCTCCTACTTTCTACTTTCCTTCCATCTCTCCCCACCTTCGTTCAATCTCTCTCCCTTTACCACTGATGGACTTCACCGTTGACTACTTTTTCTAACACGACTGTTTTGCCATCAACTACTTGAAAGGTTTGGGTTTTTGAATGACATGTTCATGATGTGTTCTATCACGCATCCAAATTTTTTAAAAATAATAAAAAGGGGCACACAATTTTTTATGTCGGACACGTGTTCGGATGTATTCGTGTCCAACATAGACACTTTGTCAAATACAAAGTGTTCGTGCTTCCTAGATGTTGTCCAAGTTTACAACAAAATTGAGTTGCATTAAATGCACTCGAACTTCATTCATGACAAAATGAAGTTAAACTCAAGTTACATTAAATGAAGTCAAGTCACATTTAAAGTTGAAATTAGGTTGCATTTAATGAACTCAAGTGCAACTCATGCTTCACTAAAGAAGAATCAAACTTCATTTATGGAACAATGGCTTCAAAATATTATAAGTTGGACTTCATTTTCTCTTGGATGGTCACAAGATTTTGAAGGCTGACCAAGAGACTTCTATCTAGGATTTATGGCTCTTTTTATTGTAACTTCAGCTTTAAACCTTCCATTTATAATTTTAGGTTACTGTAGAGTTAATGGTTGATATGGCATTCATGTACTTGGAATTACAATCCTTAAATTGCCAATAGGTCACAATTTAGGGTTTATTTGGAGGCTATATATAGCTTAGCTAATTCCTTTTGTAAAAGGGGACTTGAGAATGGAAATTTAGTAAAAAGAGCCTTTGTGCTTTTGTTTAGCTTGAAACTATGATTCTTGAAAATTCTATGTGGTTTAGGTTGTATGCATAAATTCATTTATGCTTGACTTGATCAAACTTTCTTATGGAGTGATTGAAATGTCAAATAAGGAATGATCTTATTTACAGCGATATTCGATTTTTACTTTCCTTTGGTTGATTTTCAAATGCGTTTTTATGGATTTTGAGGTTGAGTTCTATGAGTTCCGTTCTTCCACATGGTTTTTAGATCCCAAAACTGGTTTCATAACATTTTGGCATTGGAGCTACTTTGATTTCAATATCATTTGGTACCTGAGCTACTTTGATTTCATACCATTTTGGTGTCCAAACGCTGCACCAATGACTAGGATGGAAAAATGGACGTGGCTATGGCTAGAACAACCTGGCAATGAGTAGTACACATGCTTGGTTAAGTTTGAAGGAACGAAATTTAGACACTTCAATTTTTTGTTCAACATTCTAATTGGTTTTTTACTCCATTGGAAGTTTTGAAACGTCTATGAAAGGTTGGGGTAATATTGTAACTTTTGAAAGAAGAGGGTATTTTTGAAACAGAGGGAAAAACTCAAGGGTATTTTTTATAATTTAGCCTAAATGGGGATGAAAAATCTACTTGTGTTTGGGTTTCCAGTTGAAAAAAGGAATCCTAGTGATCTGCTCTGAAGGATTACCGACTAATCAATCATTTGGAGGTGGAAAAGGAGTTCATCAACTTCTGTCAAAAGCTTCATTGATGTGAGGGGGCATTCTTTAAAATATTTTGTGGCTGTGTTTAGTTCTATAGGGGGCTTATTCCTCGGTAAATTTTATCTTGTCCACAAAATCTTTTGTTTCAAATGAAAAATAATAATAATAATAATAAATAAATAAATAAATAAATAAAGAGAGAAATTATTCCAGCTTGTTTTAACTTCAACAACCGTCAGGACAAGCTACCATATGTTGTATTGCCAAGTAGTCCAAGATTTGAACTTCATAAGCAAACAGTGGTCGAAGTCTTTCAAATACTAAAATAAAGATTGCAACTTGGATTAGATTTCAAATACTAAAACAAAATTGCAACTTTGGTTGCTTTTATATTAAACATAATCAGATTTTTCGCACTGCAGATCTTCAAGTACAAATCCTAGATATTTGAGACTGAACACCAAAATTGACGCCTTATTGGAACTGTTTGGTTATTATTTTTTGTTTTTGTTTAGCTAAACATTGGAGAAGTAGTAGAACAACTGTTCCTGGTAACCTGGTTCTTTGTTGTTCTATAAAATAGTCTGTGGTGCTTGTAGTATTGAAGAAACTGTGTAAGTCCAAGCGGGGAGGGATGAAGAAGCGAAGATACACCTTTGCATCTTTTTAGTGGTGATTTAATTTGTACAAACGAATTATGTAACAAATTAGGTGCCACACTAAAATTTCAATGATGCTTGATTGGAACGTAAAGATATTGGAGTGGGAATATGAAAGTGAGATGCTAATGTGAGGCTACATGATAAGAACAATATGAACTTCAAGGAAGATGAGTTTCATTTATTCTCAGATGGTTACAAATTTTGGGGGGACGATAATTTAAAGTTATTGTATTTGATTAATGTATTTTAGGACTTTTTATTATTATCTTTTTAAGGTTACATTTACCTAGTGGCTAATTATGATCATATGATATGAAAGTTACAACTCTTGGAATGCCTTATGGAAAGTTATGGGTCTTCATTTTGAGGCTATATAAAGCCATGTCTGATGTCTTTGTAAAGTAGACTTAAAAAATGTAGTAAAAAGAGCATTTGTGTGTTCTTTAGCCAAGACCTAAGTTTCTTTGCAATTCTATGTAGTTTAGGTTGCATGTTTAGAATCATTCAAACTTGAATGATCAATCTTGCTTGTGGAGTGATTCGAATCTGAAACAAGAATTTTTGTCTTGGTAATCTGAATCTTACTTTCCCTGGAGTGATTCCTTATCACTATAAGCCTTTGACGATGTGAATGAGAACTCTTCTTGCTAAATGCATGTTTTCTTTGGCATTCTGTAAGGCTCAATAATTTCTGATAAACTTCCTGAATGTGCAGATCTTTTCTTCCTCATAAACGTTTCTTAAAATGGTGAATGTTTTTCTAACTGAGATAATTGATTATCTTTGGGCCTTGGTGGTTATGTCCTTTTGTAATTGCCCGTTGGGTCTTATATTGCCTACTCAAAGCTCCTTTTTGTAGCTTGACTCTTTCTGTTTGGGCTTTATTTTTGTATGCTTTTGTATTCTTTCATTTTTTTTTTTTTTTGTAATGACACCTCGAAGAAAAAAGAGACTGGGATATTTGATTTCTCACATATTGATCTATTGACATGAAATATTTTATATACCTATGCAAAGCGAATCGTTTCGGTTGTTTTTTTTTTTTGTCTGAATATTTATAATAAAACAGAGATTTCAAATTTTAGTATTTTCTGATTCGTCATGAAAGTGAAGCATGACCTATAATTTAAACTGCTGATTGAATGTTGGACTCTGTTTTCATGTTGGACTTTTTCTGCATTTTGGGTGATTTGTTTTGCAGTTCTGACTTTATTTTTTATTGAATGGTTCAAAATTGTCTGTAACTTGGTAACAATATCCCTGCAGATATCTGATGCACCTGGGATATGAAGTTACTTATGTGCGGAACTTTACTGATGTCGATGACAAGGTGAGTGTAGAATTACACAAGCACTCGTCTTACATTGAACAAGAATTACACACACATGCTCAATGATTTACTTGTGCAGGGATGATTTCTTAGAGTTCTATCTTTTTGTATTCTTCTATTAGAGAATTTTCCACCTGATACAGTCTTGGGATAAGCCAATTTGAAGTGAGATTTCCCTTACAGCTGATCATCCCGAAACTTAATCTTCTCTTTGCTCTCGAATCTAAACCATGCCATGTATCTCTGCTGGATTTAGTTTGTCTATTCTGATACACCCAGTAAGGCCATATGATTGGAGGCCTTATATTTTAGTTTCTGATTCGTCACTCGTGATGGAAAATGTCAGTAGTGACGGAAAATGTCAGTTTTTCCGGTTCCTGCATAGGTAGTGAACACATTGCAGACCTAAAAAGCACAAAAAGTTGAATCAAAGGAAAAAAAAAAAGCTTAAATTTAGCAGCTTGAGAGAAAGTGAGAGAGAGTCACGACATCTGAAGCCGGCCTCTTGGTGCCACTGTCGCTGCCTCCTTAAGGAAGGCAAAGAGTCGCGACCTCCTTAAGACAAAGAAGGCGAAGAGTCGCGACCTCCGGGGTTGTGCCAAGGTGGGGAAGACGAAGGTTTGCGAACAACACTTCGGAAGACAAAGAGTTGCGTCTGAATCGCTGCACAATGACAGAACCCGCTAGCCACTCCTCCGACTGAATAAGACCATGATGACAGCCGCTCCTTCTACGAAGACACCTGCACATCCCATGATCCCCCCCACCACCCACTAAAATAGAAAAAACTAAAATCATTGACGACACATGAAACTATTCCTTGCTGTCCACTAAAATACCAAACCATCAACTAAACTCAAGACCTCAGAGAAGAAGACAAGACCATAGACATGTTCATCGACCTACACAACCCTTGCCTGATACTAAAAAGTCGCCACAAATGTCAGAAGATAGAAAGACAGATGTAGAAACATACCTTTCGATTATAAACAAAAAGGAAAAGACAATCAACAAGTTTTCCCTCTCCCCCATTCATACTGTATGGGATATGATGGTGATTGTCACTCGGAAACATTTTCACGATGACTGGTTTATGATAAACCGGAGCATCCAATCCATGCTCAATGACTTTAGCCAACTAAACCCCTACAGAGCAGACAAGACGATTCTTCGGTACACGTCAAAGGAGCAAGCCTCATTAGTTGATAATAAAAAAGGTTGGTAAAAAGTTGGTAACTAGGCATTGAAAATAGAACCATGGTCTGATGAAAAGCACCTGAATGATCCCGTGGTTCCCTCTTATGGAGGTTGGATTAAAATAAAAAATCTTCCTCTAAATCGTTGGGACATCGATACTTTGAAACACATAAGAGATGCCTGTGGAGGTTTTATTGAGATTGCTAAAAAAAACCCTCATCGAGAGTGGACCTCATGGAAGCCACCATTTGTATTGAGAAAAATACACAGGTTTCATACCAGCAGAAATCTCAATACCATCATTATCCAAGATTCCCCTTATGGTCAATATAGACCCTTGCTTTTATGGAGAATCGTTTGTTGACGTTATGACTGGCTACCAACGCACTGAACAAGCACAGAAAAGATTTTGTACCCCAGAGGATACCAAATTTATTATAGATACTTCATGCGGCATTCTTGGACCACATCGTTGGTCAAACCACCGAGAAACAACACCTAGAGGGGCCTCTCAAGTACAACAAGACTCAACACCAAATTTACACATAACAGAGTCTATGGTCATACCACAAAGCCCCAAAAAGGTAACGACCATACCCACCTGCCATGGACAGCCCCATACCCTCCAAGCCAACTCCTTAAAAAAAAGGCCAACCCCCCAACCAAACCATACTCCGCCACCCTCCCCATCTTCCATGCCCCATAAACCCATAGCCTCCCCACACATAAACCAATGGCCACCAACCCCTCACCCTACCTTACACCATACTCCAAAACTCCATCTACTATACCACTTCTTATCCCCGACCTACAAGTTGTGACAATATCACCAGAAAAACATGCCATCATCATAAATAACCAAAAAACCTTCCTCACCCCAAGAACAAAATTCTCCACGGTTGGCATTGACCAAGAATCAAACGATGAATATGCATCTAGCCGTTTACCCCTCTCCACCGCCCCACCATGCCCCGCTGCAGAAAAGGCCAACATTTCTCACCTATTGCTAGAGATCAACCTTGCGGATACAAAGCTTGGCACTCTTTTTCAGGGAAAGCAGGATGAAACTGAGGTAGAAGAGGCCTCCTCGATATCAGCAACACTCACATCAGTAACACTCACACCAACAAGAGCCATCCAAATCTTGGCACCATGGCTAAAAGAGCACCACATGTGCATCATGGCTATACCATTTGGGACAAAGAAGCTCACTACGGCCAAGAAATCGAGTAAACTAACTAGGGAGATTGCTAGCCTTCTTTCAAACATCAATTATGAGAAATCGAGCAAATCACAATTGGGAGGTCAATACTTCTTATGATTATTATCTCTTGGAATGTTCAGGGATTGGGAGATTGGCAAAAGAGAGCCTTAATCAAGTCCTTTCTCCTCAAATACAATCCCACTATGGTTATTCTTCAAGAAACAAAATTGAATTATGTTGATCGCTGGATACTTAAATCCCTTTGGAGTGCCACACATATTGGTTGGTCAGCTTAGGATGCTTCCGTTTATCAGGGGGGATCTTAATCATGTGGAACGACCTAGCTTTGACCATTTCAGAAGTCACCAAAGGCTTATAAACCTTAACCACACATATTACACTGGCTGATAATTAGAGTGTTTGGATTATAGGAATCTATGGCCCCTGTGGATATCAAGAAAGACATTTTGGCAGGAATTATAGGATCTATGCTTATGTAAAGAGAATTGACTTTTGGCAAGCGATTTCAATACTTCTTGATGGGCGCATGAAAAATCCAGTGGAAAAGCCACCAAAAGCATGAGACACTTCAACAATTTCATAAGCTTTCATAAGAGAAGCTGAACTCATTGACCTTCCCCTTAGTAATGGACTCTATACATGGTTAAACAATAGATCTCCACCCACCTTCAACAATTTCATAAGAGAAGCTGAACTCATTGACCTTTCCCTTAGTAATGGACTCTATACATGGTTAAACAATAGATCTCCACCCACCTTGATACTCATCGATAGATTTCTTGCCACAGAAGGCCTCCTTGGAAAATTCAGTAATGCTGGATTCAAAGACGGAATAGACCTACATCTGATCATTATCCCATTCTACTTTCCATGGGTTGCTGTAAATGGGGCCCCTCACCCTTTCGATTTGAGAATACGCGGCTGAACCATTCTAAATTTTTGCCAATGGTAGAGTACTGGTGGAAGAATACTCCCCTCTGTGGATGGTTAGGGCACGGTTTCATCAACAAACTTAAAAGACTAAAAGAAGTCCTAAAGAAATGGAATAAGGAGGTTTTTGGTTGCATTTCCACAAAAAGGAACCAACTGTTAATCGAAATTGTCCTCCTTGATCAGATGGAGGAAACTGGCTCCATCACACTAGTTCAACACACAGAAAAAGCTCTTGAATGCAGAACTTATTTCGTTGGCAGTAAATGAAAAGCAATCTTGGAGACAAAAATGTAAAAAACAATGGGTAGAGGAAGGTGACAAACTCCAAGTTTTTCCATCGTATCATGGCTACAAAGAAACGAAAGAGTACCATCATGGAATTCTATCCGCACAAGGCAGAAGTTTAGTAAACGAAGAAGAAATTGTATTAGAATTTGTGTCCTTCTACACGTCCTTATACGCAAGGGACAATGCCCTCTGCGAATTTCCTCACGATCTTGACTGGAGTCCTATTGATCAACAGCAAGCTGCCTCCCTCGAGGTCATTTTCACTGAAGAAGAGGTGGGGAAGGCAGTTTGGCACTTGGGCTCCGACAAAACTCTAGGATCAGATGGTTTTACCTCAGATCATTATGATAGCTGATATCATGAGAGTATTCCAAGATTTTTTTCGAAACAGAGTTATTAATGGCAACCTGAATGAAACATACATATGTTTGATCTCAAAGAAACTGGATGCTCGCATTGTTACAGACTTTCATCCCATAAGCCTCACGACCGGTCTATATAAAATCATAGCAAGGGTATTATCTGAACGTCTTAAAAAGGTTCCCCCCTTCACAATCACTGGGAAACAAATAGGTTTTGTTGAGGGGCGACAAATTCTTGATGCATCCCTCATGCCCAACGAACTCATTGATGAAAGGGAAAGAAAGAAACAAAAATGTGTAGTCATTAAACTTGACATGGAAAAGGCTTCGACAAGGTTGATTGGACTTTCCTCGAAAACATTCTTGTGGCAAAAGGCTTTAGCCCAAAATGGCGAAGGTGGATTAAAGGATGCATCTCACCCACAAATTTCTCCATCATCATCAATGGAAGACTGAGAGGGAAGATTTATGCCACATGAGGTCTGAGACAAGGAGATCCACTATCCCCTTTTCTCTTCATCATGGTAATGGACGGTCTCAGTTGCCTACTGACAAAAGCAGAATACGAAGGGCAGATAAAAGGTTTTTAGATTGGTAATGAAGGCTTGAGCATCAACCACCTTCCGTTCGCAGATGACACAATCCTCTTTTCTGATTTGGTTGAGACGGTAAAAACCTTTGAAGGATTCTCTGGACAAAATATCAATCTCCAAAAAACAGAGATCATGGGCATCAATATTAGCACAGAGATCATGGAAGAATTTGCAGCATATATGGTTGTAAAAAGGGAGAATGACCGAATATGTACCTAGGGCTACCTTTAAAAGAAAACCACAAATCCTTTTCCTTTTTGAAGATTAATATTGAGAAAATAGAAAGAAGGTCGTCAACATGGTCATCACGTTATAGTTCAAAAGGCGGAAGACTTACCCTAATACAAACCACATTATCCAACTTCCCTACTTACTACATGTCTATTTGAAATGCCACAAAAAGTGGCCGCGGATATAGAAAGACTTTTCAGAAACTACTTGTGGAAAGATGTTGCACACCTTGTTCATTGGAATATTATAAACCTCCCAACAGAAAAGGGAGGCCTCGGTCTTCTCTCGATAAGGAAGAAGAACAAAGCTCTCCTCGCCAAATGGATATGGAGATATCATCATGAAGAAAAGGCCTTATGGCGAAATCTTATAAAGGCTAAATATACGCCTACATCAAACAAAAATCACCCCCTCCATCTTCTACAAAATAGCACATATAGCCATCAGCACAAGTGAAAATCTTCAAAAAAGAATGTCCTATATCACTCTATCTCCAAATTGGTGTCCATTATGCAAGAAAGAAAACGAATCACAAAATCACTTGTTTATACAATGCACATACGCTCAGAATTTCTGGATAACGATTCTCAATATATTTGAATGTCATCTCACATTTCCTAGGGAGGTAAAGGATTTTTTGGATATGGCCTTAACGTACCACCCTTTCAAGAACGCAAAAGCCCTATTATGGGAAAACCTCATCATGGCTTTCTTTTGGAATCGGTGGAAAAAAAGAAATAAGAGAATATTTGTAGAAAAGACACAGACCTACACAAAACTTTTCGACAATGTTGTTTACCAAGCTATATCTTGGTGTAAACTAAGGTGGATCTATCGCTCTTCTTGGTTCAGATCTAATTTATTTTTAATTTTACTCCCTTTTTACTTGAGCTCAACATGGGACAGCTTTTTATTTGCTTCTTACTGAAAGAGTAAATGTTTTCGTTTGTATTTTTCATTTAAAGGACCCTTGTCCAGCAACAATGACCATAATTAATAAATGCACAAGGGGTAACATCCCACGATGTTTTTTCATTGACTAATGAGTGATTAAGTTTCTCCTTTGCTTTAGTAATTCGAAGCCAGTGATGTAACTTACAAACTGTCTAATTATAGTTAGTCTTGGAGACAATTACTATGAAGTTAGATTGCCTAATTTGTTTATTTTTACATTTTCCTAGTAGTAAAGTCGTGGCCTTCTCTAGTCGTTATATTTTTCTGTATTTGAGGTTGCTATTCTTACTATTAACAGATAATTCGACGGGCAAATGAGTCAGGGGAGAATCCACTGTCATTAAGTGAACGCTTTTGCCAGGAATATCTTGCTGACATGGATGATCTTCAGTGCCTATCTCCAACACACCAACCTCGTGTGTCAGATCACTTGGAACAAATTAAGGACATGATAACTCAGGTATGGATCTTTATACTCACTCAACAATGAAAGCAAAATGCACCAATCCAGGTCAACAGCAAACTTTATCTATTTTTGTTCCTGTAAATTGCAGAATGCATCCATATCTCGTATGTATATTTATATTTTAGAATACGTGCAACTTTTGTCCATACCTAGCCAATTGTGGTTTAGAAATTCGCTGTAAGAAGGCATAATGATTGGTCAGAGATCCGGTGTTACTGCAGTAGTGTGTTTACCACCTGCTCTTTAAATGTGTTTTCTTCCAAATATAACTATGTTTGCTTTAATGACTATTTTTGTAATTATTGTGGGAAAAAGAAATGTGCATGGATTGTGTGTGTTGAAATTTTTGCTATGGTTGTCTATCCACGTGACACCAATGAGGGTGAATTAAATATTAGAAGACTTAGAGAGAATGAGTTCAAGTCATGGTTGGCAACTATCTATAATATAGTTCCTATGAGTTTTTTTTATTTTTGACAATGAAATGTAGGAATTTTATTCCCGTCAGTTTTCTTGACTGTTAAATGTAGGAGAGTTAAGTGGTTGTCCTTAAGTCTTCTCAAGGTGCACGCTTGGATGGATACTCATGGATATAAAATTGAATTGGCTTTTCATGTCAATGATACCAATAAGTTCGAGGGTATTTTCTAAAATGAATTTATTGATTAATCTAGGGAGCATCAAATGTAGGAGAGTTTGGTGGTTGTCCGTAAGTATTCTCAAGGTGCACGCCTGATGGATACAAAAATGGAATTGGTTTTCATGTGAATGATACCAATAAGTTGGAGGATATTTTCTAACATGATTTATTGAACGCTTGGATGGATACTGATGGATATAAAAGTGGAATTTGCTTTTCATGTGAATGATACCAATAAGTTGGAGGATATTTTCTAAAATGAATTCATTGATTAATATAGGGAGCATTAAATGTAGGAGTGTTAGGTGGTTGTTCTCAAGTATTCTCAAGGTGCACGCTTGGATGGATACTAATGGATATAAAAATGGAATTGGCTTTTCATATGAATGATACCAATAAGTTGGAGGCTATTTTCTAAAATGAATTTAGGGAGTGTTCTATTTAGAAGACACAATGGAGTCTTGCATTTTATGAACTTTTAGTTATCTTGCTGTTTCATATGAATATTTCTTTGTTCCAGATGTGAATGAAGGGCTAACGTTTACTCATCTGACTGTGATTTGGATTGTCTTATTTGATTTTTTGTTGATAAATTTGCAGATTATAAATAATGAATATGGCTATGTGGTTGATGGAGACGTGTTCTTTTCTGTGGAAAAATTTCCAAATTATGGCCAATTGTCAGGACAAAAACTTGAAAATCATAGAGCAGGTGAACGGGTTGCTGTAGATCCAAGAAAGCGTAGTCCTTTTGACTTTGCATTGTGGAAGGTTAAAAGCGGATATCAATTTGGCTATAAAGTAGGAATAGGAATTAATTGTGTAGTGTTTTTAATAAATATAATTTTTGTAGGCTGCAAAACCTGGTGAGCCAAGTTGGGAAAGTCCGTGGGGTCATGGAAGACCAGGATGGCATATAGAATGCAGTGCTATGAGTGCTCATTATCTTACTTTTAAGTTTGATATCCATGGAGGTGGCATTGATTTGATCTTTCCACATCATGAAAACGAGGTCGCACAAAGCTGTGCTGCCTGCCAAGAGAGTAAAATCAGTTATTGGATGCACAATGGGCATGTTACAAATAATAATGAAAAGATGTCCAAATCACTGGGTAACTTTTTCACCATTCGCCAGGTAGTATGGGCATAAAGGAGCTATTGAAGTTTCTTTACCTTTTGTTTAGTTTTTGTTATTACCTACTACTTTTTTCTTGTACTTGCAGATTACGGAAAGGTATCATCCATTAGTTTTGAGACACTTTTTAATAAGTGCTCACTATCGGTCTCCTCTCAACTACACTGTCTCCCAGTTGGATAGTGCATCAGATACTATTTATTATATATACCAGGTTTGTGATCCCATCAATGTCTGAGAATACCCTCTACCGTCCCATGGTCTGTCATTTATTATTATCAGTTTTTTTAATGAAACTCATGGTCGCTGTTTATAATCGGTGATGCAAGGTGCCATAATTAACTCATGATAGATAATTCCTAAATGATCAGTTCATTTATTTAGTTGATACTCATGGCCCATTTTTGTTTAATTGTTAGACTTTCCAAGATTGTGAAGATGCTTTATTACAACATCAAGGGGAAATCCTGACTGAGGGTTTGGGAAAAACAGCCAAAAAAGATCCTGTTTCCTCTGCTGCTGAAGAATGCATCAATAATCTACGCTCCGAATTTCAAACTAGAATGGCAGATGACTTGAACACGGCACATATATTGACTGGAGCTTTCCAGGAAGCTCTGAAGTTCATAAATAGTACTTTAACCTCGCTGAAGGTACAGTTTTTTGCACGCAGGCACTTCTCCAATGCTGGGAATTTGTTTATACCACTGTTTAGAAGTTTTGATTTAATCAAATTGAATTTTGAACTTGAATAAGCAGTGGAGTGAATACTTTCATTTAAAGGAATTCAATCCATTGAGTCGATTTTAATTTTTTCTTTTATTTTTATTTTTGTTGAGACTTGCTACTTGCTGTAAGCATTTTTGTGATTGGGCTAGACTAGGGGGCTGGTTTTTATGATTATATTGGAGTTTCTTTTGGTTTTGGTAATTTAGTATCAAGTGTTGAGTTGGTAGTATTTTTGTGGTCAGGTTAGATGCTCTGTTTTTATGGCTTTGATTGTAGTTGGGTTGTTTTCATAGTTGGGAGTTTTTATGTTGTTTTTATTAGTGAGATGTTGGGTGATCTATTGGTTGGTTTTATTTTCTAAATTTGTTCTTTCTTTCATTTCTCTCGATCACAACTTGGTTCTGTATATGTTAATTTTACCATTTATTGAAGTGTATTGTTAAATTGATGAAAATAATTATAATCTACTAAACTGACTACAGCGGGAAAATATATTCTCGTTTTACATTTTGTTATCCCGCTGCCTCTTCTCGTGTTCTTTTCAATGTAAAATAACCATTCATGTTTTTGATTTTATCAGAAGAAGCAACCAAAGAAACAACAGCTCTCAATGATTCAGGCCCTTATCAGTGTGAAGAAGGAATTGCGAGAAGTTCTAGATGTTCTAGGATTACTATCCTCTTCTACTTCCTCTGAAGTAAATATAATCTGTCTTGGTGTTTTATTTATTTTGCTCCTTTTATTTATCTTCACACTTTTATTGTCTTGGTCCCTCACATGTCAATAGACTTTTATTTATGTTTTTGGGAAAAGAAAACTGAAATAGGAATAAGAAATAAGAGATTAGACCTTGTTGTTTCTAAAACTTATAAACAGTTACCAAATGCTATCGTTTCTAGAAATTAGAAACAAGAAACAAAGACTGTTATTAAATCAACTCTGAATTTGTCTATGTAAGTTAAAAGATCAAATGAAATATTCAAAGCATTGGGATCATAAGAGAACAAACCTTCTGTCTTCTCTACATAAATAAATAAATGAATGAATATGAGAACGAGATGAAGGTTTGTATATTATAGACTGGAATTCAACTAGTTGAACATGCAATTCTGATGTGAGGTGTTTTCCCTGTGTAGGTTTTGCAGCAGTTGAAAGACAAAGCAGTTAAGAGGGCAGGCATGGTGGAAGATGATATACTGAAATTGATCGACGATAGAACACAAGCAAGGAAAAACAAAGATTTTGCGAGAAGTGACCAAATCCGAGCCGACCTATCTGCCTTGGGCATAGCTCTCATGGATGTTGGCAAAGAGACCGTCTGGAGACCGTGTGTTCCTGTAGAACTGGCACCACCGCCTCCTCCGTCTGAAGAGAACAAGTCTACTCCATTGGCCGAGAAAAAACCAACAGAGCAGCAGGTTGCTCAGCCACCAGCAGTTGAACAGAACAAGGCAGCTCAAGAACAAACAGTACAGGATGCTCAGCCACATGAGGAAAATAAGGCGACCGTGCCCGTCTGATGTTGCATCCGCTTTGTTGAGGAGAGGGATGAAGGATGAGATTTACATTACATTGTTGGCCAGTTGTAAACACTCGTTGGCCCTTAATATAGAAACCTGAAGGATTTTGGAAGAGCATGTGTAGCATACAGGTATGAACTCTACTTCCATTTACTTTAATTTTTGTGAACTGTGATACCGAGAGAAATTTTTATCTTATACGGCTGTTTTGGGGACCAAAAATTAATCCCAGCTGACTGCCTGTTGGGTTTTTCATTTGAAATATTTGATTATGAACCAAGAAGGAGACTGGATTGCTCAATCACCGTCATTCTCTTTATGGTTGAGTGTTTTATCAAATCTTAAGGACGAGAATGAGAACACACAAATTTGAGCAAAGGAAATACACCTGATTTACACTGATTTAATGAGAATTG

mRNA sequence

CTCCGCGTTCTCCCCAGCCCTCTCTCGTTCTTCTTCTGCAACGCCGACGCCGGAGTCGGCTCCCCTTTTTCTTTTACCAACTCCCGCCGCTCCCTAATTCCTTTCCTGCACGTGCCGACGCTGTTGGTACTCTCTCGATCGTGCCGCCGCCGCCACCGCCGCCTCCCTCATTGGAGTTTGCACTGTTTTTCTATTCTCTTGTTACCGGTTCAGTTTCTTGTCTGCCGTCGAAATGGCGAAAGAGGAATTTAGGGTGTACAATTCGATGACGCAGCAAAAGGAAGTATTCACCACGAAGGAGCCTGGTAAGGTTGGCATGTACGTCTGTGGCATCACAGCCTATGATTTCAGCCATATCGGCCATGCCCGTGCCGCTGTCAACTTTGACGTCCTCTACAGATATCTGATGCACCTGGGATATGAAGTTACTTATGTGCGGAACTTTACTGATGTCGATGACAAGATAATTCGACGGGCAAATGAGTCAGGGGAGAATCCACTGTCATTAAGTGAACGCTTTTGCCAGGAATATCTTGCTGACATGGATGATCTTCAGTGCCTATCTCCAACACACCAACCTCGTGTGTCAGATCACTTGGAACAAATTAAGGACATGATAACTCAGATTATAAATAATGAATATGGCTATGTGGTTGATGGAGACGTGTTCTTTTCTGTGGAAAAATTTCCAAATTATGGCCAATTGTCAGGACAAAAACTTGAAAATCATAGAGCAGGTGAACGGGTTGCTGTAGATCCAAGAAAGCGTAGTCCTTTTGACTTTGCATTGTGGAAGGCTGCAAAACCTGGTGAGCCAAGTTGGGAAAGTCCGTGGGGTCATGGAAGACCAGGATGGCATATAGAATGCAGTGCTATGAGTGCTCATTATCTTACTTTTAAGTTTGATATCCATGGAGGTGGCATTGATTTGATCTTTCCACATCATGAAAACGAGGTCGCACAAAGCTGTGCTGCCTGCCAAGAGAGTAAAATCAGTTATTGGATGCACAATGGGCATGTTACAAATAATAATGAAAAGATGTCCAAATCACTGGGTAACTTTTTCACCATTCGCCAGATTACGGAAAGGTATCATCCATTAGTTTTGAGACACTTTTTAATAAGTGCTCACTATCGGTCTCCTCTCAACTACACTGTCTCCCAGTTGGATAGTGCATCAGATACTATTTATTATATATACCAGACTTTCCAAGATTGTGAAGATGCTTTATTACAACATCAAGGGGAAATCCTGACTGAGGGTTTGGGAAAAACAGCCAAAAAAGATCCTGTTTCCTCTGCTGCTGAAGAATGCATCAATAATCTACGCTCCGAATTTCAAACTAGAATGGCAGATGACTTGAACACGGCACATATATTGACTGGAGCTTTCCAGGAAGCTCTGAAGTTCATAAATAGTACTTTAACCTCGCTGAAGAAGAAGCAACCAAAGAAACAACAGCTCTCAATGATTCAGGCCCTTATCAGTGTGAAGAAGGAATTGCGAGAAGTTCTAGATGTTCTAGGATTACTATCCTCTTCTACTTCCTCTGAAGTTTTGCAGCAGTTGAAAGACAAAGCAGTTAAGAGGGCAGGCATGGTGGAAGATGATATACTGAAATTGATCGACGATAGAACACAAGCAAGGAAAAACAAAGATTTTGCGAGAAGTGACCAAATCCGAGCCGACCTATCTGCCTTGGGCATAGCTCTCATGGATGTTGGCAAAGAGACCGTCTGGAGACCGTGTGTTCCTGTAGAACTGGCACCACCGCCTCCTCCGTCTGAAGAGAACAAGTCTACTCCATTGGCCGAGAAAAAACCAACAGAGCAGCAGGTTGCTCAGCCACCAGCAGTTGAACAGAACAAGGCAGCTCAAGAACAAACAGTACAGGATGCTCAGCCACATGAGGAAAATAAGGCGACCGTGCCCGTCTGATGTTGCATCCGCTTTGTTGAGGAGAGGGATGAAGGATGAGATTTACATTACATTGTTGGCCAGTTGTAAACACTCGTTGGCCCTTAATATAGAAACCTGAAGGATTTTGGAAGAGCATGTGTAGCATACAGGTATGAACTCTACTTCCATTTACTTTAATTTTTGTGAACTGTGATACCGAGAGAAATTTTTATCTTATACGGCTGTTTTGGGGACCAAAAATTAATCCCAGCTGACTGCCTGTTGGGTTTTTCATTTGAAATATTTGATTATGAACCAAGAAGGAGACTGGATTGCTCAATCACCGTCATTCTCTTTATGGTTGAGTGTTTTATCAAATCTTAAGGACGAGAATGAGAACACACAAATTTGAGCAAAGGAAATACACCTGATTTACACTGATTTAATGAGAATTG

Coding sequence (CDS)

ATGGCGAAAGAGGAATTTAGGGTGTACAATTCGATGACGCAGCAAAAGGAAGTATTCACCACGAAGGAGCCTGGTAAGGTTGGCATGTACGTCTGTGGCATCACAGCCTATGATTTCAGCCATATCGGCCATGCCCGTGCCGCTGTCAACTTTGACGTCCTCTACAGATATCTGATGCACCTGGGATATGAAGTTACTTATGTGCGGAACTTTACTGATGTCGATGACAAGATAATTCGACGGGCAAATGAGTCAGGGGAGAATCCACTGTCATTAAGTGAACGCTTTTGCCAGGAATATCTTGCTGACATGGATGATCTTCAGTGCCTATCTCCAACACACCAACCTCGTGTGTCAGATCACTTGGAACAAATTAAGGACATGATAACTCAGATTATAAATAATGAATATGGCTATGTGGTTGATGGAGACGTGTTCTTTTCTGTGGAAAAATTTCCAAATTATGGCCAATTGTCAGGACAAAAACTTGAAAATCATAGAGCAGGTGAACGGGTTGCTGTAGATCCAAGAAAGCGTAGTCCTTTTGACTTTGCATTGTGGAAGGCTGCAAAACCTGGTGAGCCAAGTTGGGAAAGTCCGTGGGGTCATGGAAGACCAGGATGGCATATAGAATGCAGTGCTATGAGTGCTCATTATCTTACTTTTAAGTTTGATATCCATGGAGGTGGCATTGATTTGATCTTTCCACATCATGAAAACGAGGTCGCACAAAGCTGTGCTGCCTGCCAAGAGAGTAAAATCAGTTATTGGATGCACAATGGGCATGTTACAAATAATAATGAAAAGATGTCCAAATCACTGGGTAACTTTTTCACCATTCGCCAGATTACGGAAAGGTATCATCCATTAGTTTTGAGACACTTTTTAATAAGTGCTCACTATCGGTCTCCTCTCAACTACACTGTCTCCCAGTTGGATAGTGCATCAGATACTATTTATTATATATACCAGACTTTCCAAGATTGTGAAGATGCTTTATTACAACATCAAGGGGAAATCCTGACTGAGGGTTTGGGAAAAACAGCCAAAAAAGATCCTGTTTCCTCTGCTGCTGAAGAATGCATCAATAATCTACGCTCCGAATTTCAAACTAGAATGGCAGATGACTTGAACACGGCACATATATTGACTGGAGCTTTCCAGGAAGCTCTGAAGTTCATAAATAGTACTTTAACCTCGCTGAAGAAGAAGCAACCAAAGAAACAACAGCTCTCAATGATTCAGGCCCTTATCAGTGTGAAGAAGGAATTGCGAGAAGTTCTAGATGTTCTAGGATTACTATCCTCTTCTACTTCCTCTGAAGTTTTGCAGCAGTTGAAAGACAAAGCAGTTAAGAGGGCAGGCATGGTGGAAGATGATATACTGAAATTGATCGACGATAGAACACAAGCAAGGAAAAACAAAGATTTTGCGAGAAGTGACCAAATCCGAGCCGACCTATCTGCCTTGGGCATAGCTCTCATGGATGTTGGCAAAGAGACCGTCTGGAGACCGTGTGTTCCTGTAGAACTGGCACCACCGCCTCCTCCGTCTGAAGAGAACAAGTCTACTCCATTGGCCGAGAAAAAACCAACAGAGCAGCAGGTTGCTCAGCCACCAGCAGTTGAACAGAACAAGGCAGCTCAAGAACAAACAGTACAGGATGCTCAGCCACATGAGGAAAATAAGGCGACCGTGCCCGTCTGA

Protein sequence

MAKEEFRVYNSMTQQKEVFTTKEPGKVGMYVCGITAYDFSHIGHARAAVNFDVLYRYLMHLGYEVTYVRNFTDVDDKIIRRANESGENPLSLSERFCQEYLADMDDLQCLSPTHQPRVSDHLEQIKDMITQIINNEYGYVVDGDVFFSVEKFPNYGQLSGQKLENHRAGERVAVDPRKRSPFDFALWKAAKPGEPSWESPWGHGRPGWHIECSAMSAHYLTFKFDIHGGGIDLIFPHHENEVAQSCAACQESKISYWMHNGHVTNNNEKMSKSLGNFFTIRQITERYHPLVLRHFLISAHYRSPLNYTVSQLDSASDTIYYIYQTFQDCEDALLQHQGEILTEGLGKTAKKDPVSSAAEECINNLRSEFQTRMADDLNTAHILTGAFQEALKFINSTLTSLKKKQPKKQQLSMIQALISVKKELREVLDVLGLLSSSTSSEVLQQLKDKAVKRAGMVEDDILKLIDDRTQARKNKDFARSDQIRADLSALGIALMDVGKETVWRPCVPVELAPPPPPSEENKSTPLAEKKPTEQQVAQPPAVEQNKAAQEQTVQDAQPHEENKATVPV
BLAST of Cp4.1LG01g17650 vs. Swiss-Prot
Match: SYCC2_ARATH (Cysteine--tRNA ligase 2, cytoplasmic OS=Arabidopsis thaliana GN=At5g38830 PE=2 SV=1)

HSP 1 Score: 682.9 bits (1761), Expect = 2.9e-195
Identity = 332/506 (65.61%), Postives = 417/506 (82.41%), Query Frame = 1

Query: 3   KEEFRVYNSMTQQKEVFTTKEPGKVGMYVCGITAYDFSHIGHARAAVNFDVLYRYLMHLG 62
           K E ++YN+MTQQKEV     PGK+G+YVCGITAYDFSHIGHARAAV+FDVLYRYL HL 
Sbjct: 5   KMELKLYNTMTQQKEVLIPITPGKIGLYVCGITAYDFSHIGHARAAVSFDVLYRYLKHLD 64

Query: 63  YEVTYVRNFTDVDDKIIRRANESGENPLSLSERFCQEYLADMDDLQCLSPTHQPRVSDHL 122
           Y+VT+VRNFTDVDDKII RAN++GE+PL LS RFC EYL DM  LQCL PTHQPRVS+H+
Sbjct: 65  YDVTFVRNFTDVDDKIIDRANKNGEDPLDLSNRFCDEYLVDMGALQCLPPTHQPRVSEHM 124

Query: 123 EQIKDMITQIINNEYGYVVDGDVFFSVEKFPNYGQLSGQKLENHRAGERVAVDPRKRSPF 182
           + I  MI +II  + GYVV+GDVFFSV+K PNYG+LSGQ LE+ RAGERVAVD RKR+P 
Sbjct: 125 DNIIKMIEKIIEKDCGYVVEGDVFFSVDKSPNYGKLSGQLLEHTRAGERVAVDSRKRNPA 184

Query: 183 DFALWKAAKPGEPSWESPWGHGRPGWHIECSAMSAHYLTFKFDIHGGGIDLIFPHHENEV 242
           DFALWKAAKP EPSWESPWG GRPGWHIECSAMS HYL+ KFDIHGGG DL FPHHENE+
Sbjct: 185 DFALWKAAKPDEPSWESPWGPGRPGWHIECSAMSVHYLSPKFDIHGGGADLKFPHHENEI 244

Query: 243 AQSCAACQESKISYWMHNGHVTNNNEKMSKSLGNFFTIRQITERYHPLVLRHFLISAHYR 302
           AQ+CAAC++S ++YW+HNGHVT NNEKM+KS  NF TIR+IT  YHPL LRHFL+SA YR
Sbjct: 245 AQTCAACEDSGVNYWLHNGHVTINNEKMAKSKHNFKTIREITASYHPLALRHFLMSAQYR 304

Query: 303 SPLNYTVSQLDSASDTIYYIYQTFQDCEDALLQHQGEILTEGLGKTAKKDPVSSAAEECI 362
           SPL++T SQL+S+S+ +YY+YQT QD ++ L  +Q + L+E  GK+ +    ++  ++ I
Sbjct: 305 SPLSFTASQLESSSEALYYVYQTLQDLDEGLSPYQ-DALSEDGGKSEQ----TAEGKDII 364

Query: 363 NNLRSEFQTRMADDLNTAHILTGAFQEALKFINSTLTSLKKKQPKKQQLSMIQALISVKK 422
             L++EF+++M DDLNTAHILTGA+Q+ALKFIN++L+ LKK Q KKQ++SM+ +L+ ++K
Sbjct: 365 KKLKTEFESKMLDDLNTAHILTGAYQDALKFINASLSKLKKMQ-KKQRMSMLVSLVEIEK 424

Query: 423 ELREVLDVLGLLSSSTSSEVLQQLKDKAVKRAGMVEDDILKLIDDRTQARKNKDFARSDQ 482
             REVLDVLGLL++ + +E+L+++K K + RA + E+ I +LI++R  ARKNKDFA+SD+
Sbjct: 425 AAREVLDVLGLLTTLSYAEILKEMKLKTLIRAEIGEEGISQLIEERITARKNKDFAKSDE 484

Query: 483 IRADLSALGIALMDVGKETVWRPCVP 509
           IR  L+  GIALMD+GKETVWRPC P
Sbjct: 485 IREKLTRKGIALMDIGKETVWRPCFP 504

BLAST of Cp4.1LG01g17650 vs. Swiss-Prot
Match: SYCM_ARATH (Cysteine--tRNA ligase, chloroplastic/mitochondrial OS=Arabidopsis thaliana GN=SYCO PE=2 SV=1)

HSP 1 Score: 606.3 bits (1562), Expect = 3.4e-172
Identity = 303/511 (59.30%), Postives = 383/511 (74.95%), Query Frame = 1

Query: 4   EEFRVYNSMTQQKEVFTTKEPGKVGMYVCGITAYDFSHIGHARAAVNFDVLYRYLMHLGY 63
           +E  ++N+M+++KE+F  K  GKVGMYVCG+TAYD SHIGHAR  V FDVL RYL HLGY
Sbjct: 63  KELWLHNTMSRKKELFKPKVEGKVGMYVCGVTAYDLSHIGHARVYVTFDVLLRYLKHLGY 122

Query: 64  EVTYVRNFTDVDDKIIRRANESGENPLSLSERFCQEYLADMDDLQCLSPTHQPRVSDHLE 123
           EV+YVRNFTDVDDKII RA E  E+P+SLS RFC+E+  DM+ LQCL P+ QPRVSDH+ 
Sbjct: 123 EVSYVRNFTDVDDKIIARAKELEEDPISLSRRFCEEFNRDMEQLQCLDPSVQPRVSDHIP 182

Query: 124 QIKDMITQIINNEYGYVVDGDVFFSVEKFPNYGQLSGQKLENHRAGERVAVDPRKRSPFD 183
           QI D+I QI++N Y Y VDGD++FSV+KFP YG+LSG+KLE++RAGERVAVD RK+ P D
Sbjct: 183 QIIDLIKQILDNGYAYKVDGDIYFSVDKFPTYGKLSGRKLEDNRAGERVAVDTRKKHPAD 242

Query: 184 FALWKAAKPGEPSWESPWGHGRPGWHIECSAMSAHYLTFKFDIHGGGIDLIFPHHENEVA 243
           FALWKAAK GEP WESPWG GRPGWHIECSAMSA YL + FDIHGGG+DL+FPHHENE+A
Sbjct: 243 FALWKAAKEGEPFWESPWGRGRPGWHIECSAMSAAYLGYSFDIHGGGMDLVFPHHENEIA 302

Query: 244 QSCAACQESKISYWMHNGHVTNNNEKMSKSLGNFFTIRQITERYHPLVLRHFLISAHYRS 303
           QSCAAC  S ISYW+HNG VT ++EKMSKSLGNFFTIRQ+ + YHPL LR FL+  HYRS
Sbjct: 303 QSCAACDSSNISYWIHNGFVTVDSEKMSKSLGNFFTIRQVIDLYHPLALRLFLMGTHYRS 362

Query: 304 PLNYTVSQLDSASDTIYYIYQTFQDCEDALLQHQGEILTEGLGKTAKKDPVSSAAEECIN 363
           P+NY+   L+SAS+ I+YIYQT  DCE AL +            T +   V S     IN
Sbjct: 363 PINYSDFLLESASERIFYIYQTLHDCESALGEKDS---------TFENGSVPSDTLTSIN 422

Query: 364 NLRSEFQTRMADDLNTAHILTGAFQEALKFINSTLTSLKKKQPKKQQLSMIQALISVKKE 423
             R+EF   M+DDL T  +   A  E LK IN  + + K K+  +++    ++L +++  
Sbjct: 423 TFRTEFVASMSDDLLTP-VTLAAMSEPLKTINDLIHTRKGKKQARRE----ESLKALETT 482

Query: 424 LREVLDVLGLLSSSTSSEVLQQLKDKAVKRAGMVEDDILKLIDDRTQARKNKDFARSDQI 483
           +R+VL +LGL+ +S S EVL+QLK+KA+KRAG+ E+D+L+ + +RT ARKNK++ RSD I
Sbjct: 483 IRDVLTILGLMPTSYS-EVLEQLKEKALKRAGLKEEDVLQRVQERTDARKNKEYERSDAI 542

Query: 484 RADLSALGIALMDVGKETVWRPCVPVELAPP 515
           R DL+ +GIALMD  + T WRP +P+ L  P
Sbjct: 543 RKDLAKVGIALMDSPEGTTWRPAIPLALQEP 558

BLAST of Cp4.1LG01g17650 vs. Swiss-Prot
Match: SYCC1_ARATH (Cysteine--tRNA ligase 1, cytoplasmic OS=Arabidopsis thaliana GN=At3g56300 PE=3 SV=1)

HSP 1 Score: 567.4 bits (1461), Expect = 1.8e-160
Identity = 296/507 (58.38%), Postives = 367/507 (72.39%), Query Frame = 1

Query: 3   KEEFRVYNSMTQQKEVFTTKEPGKVGMYVCGITAYDFSHIGHARAAVNFDVLYRYLMHLG 62
           K +  +YN+MTQ KEV+    PGK+G+YVCGITAYD+SHIGHARAAV+FD+LYRYL HLG
Sbjct: 8   KPDLTLYNTMTQLKEVYKPMNPGKIGIYVCGITAYDYSHIGHARAAVSFDLLYRYLRHLG 67

Query: 63  YEVTYVRNFTDVDDKIIRRANESGENPLSLSERFCQEYLADMDDLQCLSPTHQPRVSDHL 122
           Y+VTYVRNFTDVDDK    A   GE PL LS RFC+EYL DM  LQCL PTHQPRVSDH+
Sbjct: 68  YQVTYVRNFTDVDDK----AKNCGEKPLDLSNRFCEEYLLDMAALQCLLPTHQPRVSDHM 127

Query: 123 EQIKDMITQIINNEYGYVVDGDVFFSVEKFPNYGQLSGQKLENHRAGERVAVDPRKRSPF 182
           EQI  MI +II N  GY V GDVFFSV+K P+YGQLSGQ+L++ +AG+RVAVD RKR+P 
Sbjct: 128 EQIIKMIEKIIENGCGYAVGGDVFFSVDKSPSYGQLSGQRLDHTQAGKRVAVDSRKRNPA 187

Query: 183 DFALWKAAKPGEPSWESPWGHGRPGWHIECSAMSAHYLTFKFDIHGGGIDLIFPHHENEV 242
           DFAL KAAK GEPSWESPWGHGRPGWHIE            FDIHGGG DL FPHHENE+
Sbjct: 188 DFALRKAAKSGEPSWESPWGHGRPGWHIE------------FDIHGGGADLKFPHHENEI 247

Query: 243 AQSCAACQESKISYWMHNGHVTNNNEKMSKSLGNFFTIRQITERYHPLVLRHFLISAHYR 302
           AQ+CAAC++S ++YW+HNGHVTNNN KM KSL NFFTIRQI   YHPL LRHFL+SA YR
Sbjct: 248 AQTCAACEDSGVNYWLHNGHVTNNNVKMGKSLNNFFTIRQIAANYHPLALRHFLMSAQYR 307

Query: 303 SPLNYTVSQLDSASDTIYYIYQTFQDCEDALLQHQGEILTEGLGKTAKKDPVSSAAEECI 362
           SPLNY+VSQL+S+SD +Y             L    E ++  +GKT +    S+ A+E I
Sbjct: 308 SPLNYSVSQLESSSDALY------------SLSPYREEMSGDVGKTQQ----SAEAKEMI 367

Query: 363 NNLRSEFQTRMADDLNTAHILTGAFQEALKFINSTLTSLKKKQPKKQQLSMIQALISVKK 422
             +++                      ALKFIN +++ LKK Q KKQ++S++ +L+ V+K
Sbjct: 368 KKVKN----------------------ALKFINVSISKLKKMQ-KKQRMSLVVSLVEVEK 427

Query: 423 ELREVLDVLGLLSSSTSSEVLQQLKDKAVKRAGMVEDDILKLIDDRTQARKNKDFARSDQ 482
            +REVLDVLGLL++ +  E+L+ +K KA+ RAGM E+++L+ I++R  ARK+KDF RSD+
Sbjct: 428 AVREVLDVLGLLTTLSYGELLKDMKQKALTRAGMGEEEVLQRIEERNMARKSKDFRRSDR 459

Query: 483 IRADLSALGIALMDVGKETVWRPCVPV 510
           IR  L+  GI L DV  +TVWRP  P+
Sbjct: 488 IRELLAFKGIFLEDVPGDTVWRPSTPL 459

BLAST of Cp4.1LG01g17650 vs. Swiss-Prot
Match: SYC_GEODF (Cysteine--tRNA ligase OS=Geobacter daltonii (strain DSM 22248 / JCM 15807 / FRC-32) GN=cysS PE=3 SV=1)

HSP 1 Score: 444.9 bits (1143), Expect = 1.3e-123
Identity = 241/503 (47.91%), Postives = 316/503 (62.82%), Query Frame = 1

Query: 7   RVYNSMTQQKEVFTTKEPGKVGMYVCGITAYDFSHIGHARAAVNFDVLYRYLMHLGYEVT 66
           RVYN+++  KE F   EPGKV MYVCG+T YD  HIGHARA V FDV+YRY  HLG +VT
Sbjct: 4   RVYNTLSGNKEEFVPVEPGKVKMYVCGVTVYDHCHIGHARANVVFDVIYRYFCHLGLDVT 63

Query: 67  YVRNFTDVDDKIIRRANESGENPLSLSERFCQEYLADMDDLQCLSPTHQPRVSDHLEQIK 126
           YVRN+TD+DDKII RAN  G     +SERF +E+  DM+ L    PT QP+ ++H+++I 
Sbjct: 64  YVRNYTDIDDKIINRANREGVTYDLISERFIKEFDRDMERLGLKLPTCQPKATEHIDEII 123

Query: 127 DMITQIINNEYGYVVDGDVFFSVEKFPNYGQLSGQKLENHRAGERVAVDPRKRSPFDFAL 186
            ++  +I+ ++ Y   GDV F VEKF +Y +LSG+ LE+ +AG R+ VD RKR P DFAL
Sbjct: 124 SLVQTLIDKDFAYQAGGDVNFCVEKFDSYLKLSGRTLEDMQAGARIEVDERKRHPMDFAL 183

Query: 187 WKAAKPGEPSWESPWGHGRPGWHIECSAMSAHYLTFKFDIHGGGIDLIFPHHENEVAQSC 246
           WK AKPGEP WESPWG GRPGWHIECSAMS  YL   FDIHGGG DLIFPHHENE+AQS 
Sbjct: 184 WKEAKPGEPFWESPWGKGRPGWHIECSAMSMKYLGTTFDIHGGGKDLIFPHHENEIAQSE 243

Query: 247 AACQESKISYWMHNGHVTNNNEKMSKSLGNFFTIRQITERYHPLVLRHFLISAHYRSPLN 306
           AA  +  ++YW+HNG V  N+EKMSKSLGNFFTI+++ +RY   VLR FL+SAHYRSP++
Sbjct: 244 AATGKPFVNYWLHNGFVNINSEKMSKSLGNFFTIKEVLDRYDNEVLRFFLLSAHYRSPID 303

Query: 307 YTVSQLDSASDTIYYIYQTFQDCEDALLQHQGEILTEGLGKTAKKDPVSSAAE------E 366
           ++   L  A   +  IY+     E        E L  G G T      SS  E      +
Sbjct: 304 FSDQNLTEAEAGLERIYKALAAVE--------ETLAAGNGCTGAPVDASSLNEAEGELFD 363

Query: 367 CINNLRSEFQTRMADDLNTAHILTGAFQEALKFINSTLTSLKKKQPKKQQLSMIQALISV 426
              ++ + F   M DD NTA  +   F + ++ +N  L+           L  +     +
Sbjct: 364 KTTSISARFGEAMDDDFNTALAMAHVF-DLVRCVNRVLSETAGASDNICSLCTL-----I 423

Query: 427 KKELREVLDVLGLLSSSTSSEVLQQLKDKAVKRAGMVEDDILKLIDDRTQARKNKDFARS 486
           K E+ ++  VLG+ SS  +S  L++LK +      +  D+I +LI +RT ARK KDF RS
Sbjct: 424 KAEVAKIAGVLGIFSSKPAS-FLERLKSRKAGNLDIAVDEIERLIAERTAARKAKDFKRS 483

Query: 487 DQIRADLSALGIALMDVGKETVW 504
           D+IR  L+A  I L+D  + T W
Sbjct: 484 DEIRDQLAAKNIVLLDSQQGTTW 491

BLAST of Cp4.1LG01g17650 vs. Swiss-Prot
Match: SYC_PELPD (Cysteine--tRNA ligase OS=Pelobacter propionicus (strain DSM 2379) GN=cysS PE=3 SV=1)

HSP 1 Score: 443.4 bits (1139), Expect = 3.9e-123
Identity = 235/497 (47.28%), Postives = 316/497 (63.58%), Query Frame = 1

Query: 7   RVYNSMTQQKEVFTTKEPGKVGMYVCGITAYDFSHIGHARAAVNFDVLYRYLMHLGYEVT 66
           R+YN++T +K+ F    PGK GMYVCG+T YD+ HIGHARA V FDV+YRYL + GY VT
Sbjct: 4   RIYNTLTGEKDTFVPLHPGKAGMYVCGVTVYDYCHIGHARANVVFDVIYRYLGYSGYAVT 63

Query: 67  YVRNFTDVDDKIIRRANESGENPLSLSERFCQEYLADMDDLQCLSPTHQPRVSDHLEQIK 126
           YVRNFTD+DDKII RAN+ G +  ++SER+ + +  DM  L    PT +P+ +DH+  I 
Sbjct: 64  YVRNFTDIDDKIINRANQEGVDYTTISERYIEAFNQDMARLGLAKPTVEPKATDHMGGII 123

Query: 127 DMITQIINNEYGYVVDGDVFFSVEKFPNYGQLSGQKLENHRAGERVAVDPRKRSPFDFAL 186
            +I  +I   + Y  DGDV+++VE FP+Y +LSG+ LE+  AG RV VD RKR+P DFAL
Sbjct: 124 SVIETLIAKGHAYESDGDVYYAVESFPSYLRLSGRNLEDMLAGARVEVDDRKRNPMDFAL 183

Query: 187 WKAAKPGEPSWESPWGHGRPGWHIECSAMSAHYLTFKFDIHGGGIDLIFPHHENEVAQSC 246
           WK +KPGEPSW+SPWG GRPGWHIECSAMS  YL   FDIHGGG DL+FPHHENE+AQS 
Sbjct: 184 WKGSKPGEPSWDSPWGAGRPGWHIECSAMSMEYLGKTFDIHGGGKDLVFPHHENEIAQSE 243

Query: 247 AACQESKISYWMHNGHVTNNNEKMSKSLGNFFTIRQITERYHPLVLRHFLISAHYRSPLN 306
           AA     + YWMHNG V  N+EKMSKSLGNFFTIR++ E+Y P  LR F++SAHYRSP++
Sbjct: 244 AANGCQFVRYWMHNGFVNINSEKMSKSLGNFFTIREVLEQYDPETLRFFILSAHYRSPID 303

Query: 307 YTVSQLDSASDTIYYIYQTFQDCEDALLQHQGEILTEGLGKTAKKDPVSSAAEECINNLR 366
           ++   L+ A   +  IY      + A+   +G+ +     + A   P  +   E + +L 
Sbjct: 304 FSDQNLNDAQAGLERIYSCLAAVDGAM---EGQDVPNQPVEGAPLPPAGAELHEKLQSLI 363

Query: 367 SEFQTRMADDLNTAHILTGAFQEALKFINSTLTSLKKKQPKKQQLSMIQALISVKKELRE 426
           S F+  M DD NTA  L G   EA++  N  +     + P     + +  L  V++   E
Sbjct: 364 SRFREAMDDDFNTAQAL-GVLFEAVRATNRFMAESGDQTP-----ATLALLGQVRRLFAE 423

Query: 427 VLDVLGLLSSSTSSEVLQQLKDKAVKRAGMVEDDILKLIDDRTQARKNKDFARSDQIRAD 486
             DVLGL +S  ++  L+ +K     +  +   +I +LI +R  AR N+DF R D+IR  
Sbjct: 424 TGDVLGLFTSQPAA-WLESIKQAKSDQMEISPQEIEQLIAERAAARTNRDFKRGDEIRDL 483

Query: 487 LSALGIALMDVGKETVW 504
           L   GI L+D  + T W
Sbjct: 484 LLQKGIQLLDSPQGTTW 490

BLAST of Cp4.1LG01g17650 vs. TrEMBL
Match: M5XD29_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004241mg PE=3 SV=1)

HSP 1 Score: 818.9 bits (2114), Expect = 3.8e-234
Identity = 396/515 (76.89%), Postives = 445/515 (86.41%), Query Frame = 1

Query: 2   AKEEFRVYNSMTQQKEVFTTKEPGKVGMYVCGITAYDFSHIGHARAAVNFDVLYRYLMHL 61
           +KEEF VYNSMT+QKE F  K PGKVGMYVCG+TAY  SH+GH RAAVNFDVLYRYL HL
Sbjct: 4   SKEEFMVYNSMTRQKESFKPKVPGKVGMYVCGVTAYALSHLGHGRAAVNFDVLYRYLQHL 63

Query: 62  GYEVTYVRNFTDVDDKIIRRANESGENPLSLSERFCQEYLADMDDLQCLSPTHQPRVSDH 121
           GYEVTYVRNFTDVDDKII RANE GE+PLSLS RFCQEYL DM DLQCL PTHQPRVSDH
Sbjct: 64  GYEVTYVRNFTDVDDKIINRANEVGEDPLSLSNRFCQEYLKDMGDLQCLLPTHQPRVSDH 123

Query: 122 LEQIKDMITQIINNEYGYVVDGDVFFSVEKFPNYGQLSGQKLENHRAGERVAVDPRKRSP 181
           +E IKD+ITQIIN +Y Y VDGDVFF+VEKFPNYGQLSGQ+LE++RAGERVAVD RKR+P
Sbjct: 124 MEHIKDLITQIINKDYAYAVDGDVFFAVEKFPNYGQLSGQRLEHNRAGERVAVDSRKRNP 183

Query: 182 FDFALWKAAKPGEPSWESPWGHGRPGWHIECSAMSAHYLTFKFDIHGGGIDLIFPHHENE 241
            DFALWK+AKPGEPSW+SPWG GRPGWHIECSAMSAHYLTF FDIHGGGIDLIFPHHENE
Sbjct: 184 ADFALWKSAKPGEPSWDSPWGPGRPGWHIECSAMSAHYLTFNFDIHGGGIDLIFPHHENE 243

Query: 242 VAQSCAACQESKISYWMHNGHVTNNNEKMSKSLGNFFTIRQITERYHPLVLRHFLISAHY 301
           +AQSCAACQES +SYWMHNGHVTNNNEKMSKSLGNFFTI +ITERYHPL LRHFLISAHY
Sbjct: 244 IAQSCAACQESSVSYWMHNGHVTNNNEKMSKSLGNFFTISEITERYHPLALRHFLISAHY 303

Query: 302 RSPLNYTVSQLDSASDTIYYIYQTFQDCEDALLQHQGEILTEGLGKTAKKDPVSSAAEEC 361
           RSPLNYTVSQL+ +SD +YYIYQT QDCEDAL   Q   L EG  K  +   ++ AA+EC
Sbjct: 304 RSPLNYTVSQLEGSSDAVYYIYQTLQDCEDALSPFQEGSLKEGTEKNGRTVKITPAAQEC 363

Query: 362 INNLRSEFQTRMADDLNTAHILTGAFQEALKFINSTLTSLKKKQPKKQQLSMIQALISVK 421
           I+ L +EF+T+M DDLNTAHILTGAFQ+ALK INS+L  LKKKQ ++QQL MIQ+L+ +K
Sbjct: 364 ISKLHNEFETKMCDDLNTAHILTGAFQDALKLINSSLNLLKKKQQRQQQLLMIQSLVEIK 423

Query: 422 KELREVLDVLGLLSSSTSSEVLQQLKDKAVKRAGMVEDDILKLIDDRTQARKNKDFARSD 481
           KE++E+L++LGLLSS T SEVLQQ K KA+KRAG+VEDD+L  I +RT ARKNKDFA+SD
Sbjct: 424 KEVQELLNILGLLSSDTYSEVLQQFKMKALKRAGLVEDDVLDQIKERTLARKNKDFAKSD 483

Query: 482 QIRADLSALGIALMDVGKETVWRPCVPVELAPPPP 517
           QIRA L+  GIALMD+GKET+WRPCVP    P  P
Sbjct: 484 QIRAYLTTKGIALMDLGKETIWRPCVPAGEQPSLP 518

BLAST of Cp4.1LG01g17650 vs. TrEMBL
Match: B9HWF9_POPTR (tRNA synthetase class 1 family protein OS=Populus trichocarpa GN=POPTR_0010s13240g PE=3 SV=1)

HSP 1 Score: 803.5 bits (2074), Expect = 1.6e-229
Identity = 390/531 (73.45%), Postives = 450/531 (84.75%), Query Frame = 1

Query: 4   EEFRVYNSMTQQKEVFTTKEPGKVGMYVCGITAYDFSHIGHARAAVNFDVLYRYLMHLGY 63
           EE ++YNSMTQQKEVF ++ PGKV MYVCG+T+YDFSH+GHARAAV FD+L+RYL HLGY
Sbjct: 5   EELKLYNSMTQQKEVFKSRIPGKVSMYVCGVTSYDFSHLGHARAAVAFDILFRYLQHLGY 64

Query: 64  EVTYVRNFTDVDDKIIRRANESGENPLSLSERFCQEYLADMDDLQCLSPTHQPRVSDHLE 123
           EVTYVRNFTD+DDKIIRRANE GE+PLSLS RFC+EYL DM DLQCL PTHQPRV+DH+E
Sbjct: 65  EVTYVRNFTDIDDKIIRRANEIGEDPLSLSSRFCEEYLVDMTDLQCLIPTHQPRVTDHVE 124

Query: 124 QIKDMITQIINNEYGYVVDGDVFFSVEKFPNYGQLSGQKLENHRAGERVAVDPRKRSPFD 183
           QIKDMITQII  +  Y V+GDVFF+V K PNYGQLSGQ+LEN+RAGERVAVD RKR+P D
Sbjct: 125 QIKDMITQIIEKDCAYAVEGDVFFAVNKSPNYGQLSGQRLENNRAGERVAVDSRKRNPAD 184

Query: 184 FALWKAAKPGEPSWESPWGHGRPGWHIECSAMSAHYLTFKFDIHGGGIDLIFPHHENEVA 243
           FALWKAAKPGEPSWESPWG GRPGWHIECSAMSA YLTFKFDIHGGGIDLIFPHHENE+A
Sbjct: 185 FALWKAAKPGEPSWESPWGPGRPGWHIECSAMSAQYLTFKFDIHGGGIDLIFPHHENEIA 244

Query: 244 QSCAACQESKISYWMHNGHVTNNNEKMSKSLGNFFTIRQITERYHPLVLRHFLISAHYRS 303
           QSCAAC+ES +SYWMHNGHVTNNNEKMSKSLGNFFTIRQITERYHPL LRHFLISAHYRS
Sbjct: 245 QSCAACEESSVSYWMHNGHVTNNNEKMSKSLGNFFTIRQITERYHPLALRHFLISAHYRS 304

Query: 304 PLNYTVSQLDSASDTIYYIYQTFQDCEDALLQHQGEILTEGLGKTAKKDPVSSAAEECIN 363
           PLNY+VSQL+S+SD ++YIYQT QDCEDALL  Q   L EG G+ A    +++ A++CI+
Sbjct: 305 PLNYSVSQLESSSDAVFYIYQTLQDCEDALLPFQEGSLKEGAGQNANLVAITADAQKCIS 364

Query: 364 NLRSEFQTRMADDLNTAHILTGAFQEALKFINSTLTSLKKKQPKKQQLSMIQALISVKKE 423
            L  +F+T+M+DDLNT+ +LTGAFQEALK +N +L  LKKKQ KKQQLS+I+++  VKKE
Sbjct: 365 RLHEDFETKMSDDLNTSPLLTGAFQEALKVVNGSLGMLKKKQQKKQQLSLIRSVTEVKKE 424

Query: 424 LREVLDVLGLLSSSTSSEVLQQLKDKAVKRAGMVEDDILKLIDDRTQARKNKDFARSDQI 483
           + EVL +LGL    T +EVLQQLK KA+KRAG+ EDD++ LI+DR  ARK++DF +SDQI
Sbjct: 425 VTEVLRILGLFPPCTCAEVLQQLKGKALKRAGLTEDDVMSLIEDRAVARKSQDFKKSDQI 484

Query: 484 RADLSALGIALMDVGKETVWRPCVPVELAPPPPPSEENKSTPLAEKKPTEQ 535
           R DLSA GIALMDVGKETVWRPCVPVE       +EE   T + E  P  Q
Sbjct: 485 RTDLSARGIALMDVGKETVWRPCVPVE-------NEEKAKTVVEEPTPPPQ 528

BLAST of Cp4.1LG01g17650 vs. TrEMBL
Match: V4W6G1_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10014843mg PE=3 SV=1)

HSP 1 Score: 802.7 bits (2072), Expect = 2.8e-229
Identity = 390/516 (75.58%), Postives = 450/516 (87.21%), Query Frame = 1

Query: 7   RVYNSMTQQKEVFTTKEPGKVGMYVCGITAYDFSHIGHARAAVNFDVLYRYLMHLGYEVT 66
           ++YNSMTQQKE+FT   PGKVGMY+CG+TAYD SH+GHARAAV+FD+LYRYL HL YEVT
Sbjct: 20  KIYNSMTQQKELFTPIVPGKVGMYICGVTAYDLSHLGHARAAVSFDLLYRYLEHLKYEVT 79

Query: 67  YVRNFTDVDDKIIRRANESGENPLSLSERFCQEYLADMDDLQCLSPTHQPRVSDHLEQIK 126
           YVRNFTDVDDKIIRRAN+ GENPLSLS R+CQEYL DM DLQCL PT+QPRVSDH+ QIK
Sbjct: 80  YVRNFTDVDDKIIRRANDLGENPLSLSNRYCQEYLVDMADLQCLPPTYQPRVSDHMGQIK 139

Query: 127 DMITQIINNEYGYVVDGDVFFSVEKFPNYGQLSGQKLENHRAGERVAVDPRKRSPFDFAL 186
           DMITQIINN+  YVV+GDVFF+VEK PNYG+LSGQ+LEN+RAGERVAVD RKR+P DFAL
Sbjct: 140 DMITQIINNDCAYVVEGDVFFAVEKSPNYGRLSGQRLENNRAGERVAVDSRKRNPADFAL 199

Query: 187 WKAAKPGEPSWESPWGHGRPGWHIECSAMSAHYLTFKFDIHGGGIDLIFPHHENEVAQSC 246
           WKAAK GEPSW+SPWG GRPGWHIECSAMSAHYL+ KFDIHGGGIDLIFPHHENE+AQSC
Sbjct: 200 WKAAKAGEPSWDSPWGPGRPGWHIECSAMSAHYLSSKFDIHGGGIDLIFPHHENEIAQSC 259

Query: 247 AACQESKISYWMHNGHVTNNNEKMSKSLGNFFTIRQITERYHPLVLRHFLISAHYRSPLN 306
           AACQ+S +SYWMHNGHVTNNNEKMSKSLGNFFTIRQITERYHPL LRHFLISAHYRSPLN
Sbjct: 260 AACQDSNVSYWMHNGHVTNNNEKMSKSLGNFFTIRQITERYHPLALRHFLISAHYRSPLN 319

Query: 307 YTVSQLDSASDTIYYIYQTFQDCEDALLQHQGEILTEGLGKTAKKDPVSSAAEECINNLR 366
           Y+V QLDSASD ++YIYQT QDCE AL   Q        GKTA+   ++SAAE+CIN LR
Sbjct: 320 YSVLQLDSASDAVFYIYQTLQDCEVALSPFQEH------GKTAR---INSAAEDCINKLR 379

Query: 367 SEFQTRMADDLNTAHILTGAFQEALKFINSTLTSLKKKQPKKQQLSMIQALISVKKELRE 426
            EF  RM+DDLNT+HILTGAFQ+ALKFINS+L  LKKKQPK+QQLS+I++L  ++ E++E
Sbjct: 380 DEFHARMSDDLNTSHILTGAFQDALKFINSSLNMLKKKQPKQQQLSLIESLRKIENEVKE 439

Query: 427 VLDVLGLLSSSTSSEVLQQLKDKAVKRAGMVEDDILKLIDDRTQARKNKDFARSDQIRAD 486
           VL +LGLL     SEVLQQLKDKA+KRA ++E+D+L+LI++R  ARKNKDF++SDQIRAD
Sbjct: 440 VLRILGLLPPGAYSEVLQQLKDKALKRAELMEEDVLRLIEERATARKNKDFSKSDQIRAD 499

Query: 487 LSALGIALMDVGKETVWRPCVPVELAPPPPPSEENK 523
           L+  GIALMD+GKET+WRPCVPVE     PP+E+ +
Sbjct: 500 LTRNGIALMDMGKETIWRPCVPVEQEQEAPPAEKEQ 526

BLAST of Cp4.1LG01g17650 vs. TrEMBL
Match: A0A061GQF5_THECC (Cysteinyl-tRNA synthetase OS=Theobroma cacao GN=TCM_038835 PE=3 SV=1)

HSP 1 Score: 798.9 bits (2062), Expect = 4.0e-228
Identity = 389/514 (75.68%), Postives = 452/514 (87.94%), Query Frame = 1

Query: 5   EFRVYNSMTQQKEVFTTKEPGKVGMYVCGITAYDFSHIGHARAAVNFDVLYRYLMHLGYE 64
           +F VY++MTQQKEVF  K PGKVGMYVCG+TAYDFSH+GHARAAV FDVLYRYL HLGYE
Sbjct: 26  QFVVYSTMTQQKEVFKPKIPGKVGMYVCGVTAYDFSHLGHARAAVAFDVLYRYLQHLGYE 85

Query: 65  VTYVRNFTDVDDKIIRRANESGENPLSLSERFCQEYLADMDDLQCLSPTHQPRVSDHLEQ 124
           VTYVRNFTDVDDKIIRRANE+GE+PLSLS+R+C+EY  DM DLQCLSPTH+PRVSDHLEQ
Sbjct: 86  VTYVRNFTDVDDKIIRRANETGEDPLSLSDRYCKEYNVDMADLQCLSPTHEPRVSDHLEQ 145

Query: 125 IKDMITQIINNEYGYVVDGDVFFSVEKFPNYGQLSGQKLENHRAGERVAVDPRKRSPFDF 184
           IKDMITQIIN ++GYVVDGDVFF+V+KFPNYG+LSGQKLEN+RAGERVAVD RKR+P DF
Sbjct: 146 IKDMITQIINKDFGYVVDGDVFFAVDKFPNYGKLSGQKLENNRAGERVAVDSRKRNPSDF 205

Query: 185 ALWKAAKPGEPSWESPWGHGRPGWHIECSAMSAHYLTFKFDIHGGGIDLIFPHHENEVAQ 244
           ALWKAAKPGEPSW+SPWGHGRPGWHIECSAMSAHYL+FKFDIHGGG+DLIFPHHENE+AQ
Sbjct: 206 ALWKAAKPGEPSWDSPWGHGRPGWHIECSAMSAHYLSFKFDIHGGGLDLIFPHHENEIAQ 265

Query: 245 SCAACQESKISYWMHNGHVTNNNEKMSKSLGNFFTIRQITERYHPLVLRHFLISAHYRSP 304
           SCAACQES +SYWMHNGHVTNNNEKMSKSLGNFFTIRQITERYHPL LR+FLISAHYRSP
Sbjct: 266 SCAACQESDVSYWMHNGHVTNNNEKMSKSLGNFFTIRQITERYHPLALRYFLISAHYRSP 325

Query: 305 LNYTVSQLDSASDTIYYIYQTFQDCEDALLQHQGEILTEGLGKTAKKDPVSSAAEECINN 364
           LNY+V QL+ AS+ ++YIYQT +DC+DALLQ Q E   +  GK A+  P    A+ECI+ 
Sbjct: 326 LNYSVVQLEGASEAVFYIYQTLKDCQDALLQLQEERPKD--GKPARTTP---DAQECISK 385

Query: 365 LRSEFQTRMADDLNTAHILTGAFQEALKFINSTLTSLKKKQPKKQQLSMIQALISVKKEL 424
           L SEFQ +M+DDL+T+ ILTGAF EALK IN+ LT LKKKQ K+Q+LS+IQ+L  V+KE+
Sbjct: 386 LCSEFQAKMSDDLSTSLILTGAFLEALKLINNLLTMLKKKQQKQQRLSVIQSLTEVEKEV 445

Query: 425 REVLDVLGLLSSSTSSEVLQQLKDKAVKRAGMVEDDILKLIDDRTQARKNKDFARSDQIR 484
           ++VLDVLGL    + +EVL QL+D+A+ RAG+VEDD+L+LI +R + R+NKDF +SDQ+R
Sbjct: 446 KKVLDVLGLQPPCSYAEVLLQLRDRALTRAGLVEDDVLRLISERVEVRRNKDFLKSDQMR 505

Query: 485 ADLSALGIALMDVGKETVWRPCVPV----ELAPP 515
           ADL A GIALMDVG ET+WRPCVPV    E+ PP
Sbjct: 506 ADLQAKGIALMDVGTETIWRPCVPVQQELEVVPP 534

BLAST of Cp4.1LG01g17650 vs. TrEMBL
Match: A0A067GNP2_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g0093292mg PE=3 SV=1)

HSP 1 Score: 798.5 bits (2061), Expect = 5.3e-228
Identity = 391/516 (75.78%), Postives = 447/516 (86.63%), Query Frame = 1

Query: 7   RVYNSMTQQKEVFTTKEPGKVGMYVCGITAYDFSHIGHARAAVNFDVLYRYLMHLGYEVT 66
           ++YNSMTQQKE+FT   PGKVGMYVCG+TAYD SH+GHARAAV+FD+LYRYL HL  EVT
Sbjct: 20  KIYNSMTQQKELFTPIVPGKVGMYVCGVTAYDLSHLGHARAAVSFDLLYRYLEHLKCEVT 79

Query: 67  YVRNFTDVDDKIIRRANESGENPLSLSERFCQEYLADMDDLQCLSPTHQPRVSDHLEQIK 126
           YVRNFTDVDDKIIRRAN+ GENPLSLS R+CQEYL DM DLQCL PT+QPRVSDH+EQIK
Sbjct: 80  YVRNFTDVDDKIIRRANDLGENPLSLSNRYCQEYLVDMADLQCLPPTYQPRVSDHMEQIK 139

Query: 127 DMITQIINNEYGYVVDGDVFFSVEKFPNYGQLSGQKLENHRAGERVAVDPRKRSPFDFAL 186
           DMITQIINN+  YVV+GDVFF+VEK PNYG+LSGQ+LEN+RAGERVAVD RKR+P DFAL
Sbjct: 140 DMITQIINNDCAYVVEGDVFFAVEKSPNYGRLSGQRLENNRAGERVAVDSRKRNPADFAL 199

Query: 187 WKAAKPGEPSWESPWGHGRPGWHIECSAMSAHYLTFKFDIHGGGIDLIFPHHENEVAQSC 246
           WKAAK GEPSW+SPWG GRPGWHIECSAMSAHYL+ KFDIHGGGIDLIFPHHENE+AQSC
Sbjct: 200 WKAAKAGEPSWDSPWGPGRPGWHIECSAMSAHYLSSKFDIHGGGIDLIFPHHENEIAQSC 259

Query: 247 AACQESKISYWMHNGHVTNNNEKMSKSLGNFFTIRQITERYHPLVLRHFLISAHYRSPLN 306
           AACQ+S +SYWMHNGHVTNNNEKMSKSLGNFFTIRQITERYHPL LRHFLISAHYRSPLN
Sbjct: 260 AACQDSNVSYWMHNGHVTNNNEKMSKSLGNFFTIRQITERYHPLALRHFLISAHYRSPLN 319

Query: 307 YTVSQLDSASDTIYYIYQTFQDCEDALLQHQGEILTEGLGKTAKKDPVSSAAEECINNLR 366
           Y+V QLDSASD ++YIYQT QDCE AL   Q        GKTA+ +P   AAE+CIN LR
Sbjct: 320 YSVLQLDSASDAVFYIYQTLQDCEVALSPFQEH------GKTARINP---AAEDCINKLR 379

Query: 367 SEFQTRMADDLNTAHILTGAFQEALKFINSTLTSLKKKQPKKQQLSMIQALISVKKELRE 426
            EF  RM+DDLNT+HILTGAFQ+ALKFINS+L  LKKKQPK+QQLS+I++L  ++ E++E
Sbjct: 380 DEFHARMSDDLNTSHILTGAFQDALKFINSSLNMLKKKQPKQQQLSLIESLRKIENEVKE 439

Query: 427 VLDVLGLLSSSTSSEVLQQLKDKAVKRAGMVEDDILKLIDDRTQARKNKDFARSDQIRAD 486
           VL +LGLL     SEVLQQLKDKA+KRA + E+D+L+LI++R  ARKNKDF++SDQIRAD
Sbjct: 440 VLRILGLLPPGAYSEVLQQLKDKALKRAELTEEDVLQLIEERAAARKNKDFSKSDQIRAD 499

Query: 487 LSALGIALMDVGKETVWRPCVPVELAPPPPPSEENK 523
           L+  GIALMD+GKET+WRPCV VE     PP+E+ K
Sbjct: 500 LTRKGIALMDMGKETIWRPCVLVEQEQEAPPAEKEK 526

BLAST of Cp4.1LG01g17650 vs. TAIR10
Match: AT5G38830.1 (AT5G38830.1 Cysteinyl-tRNA synthetase, class Ia family protein)

HSP 1 Score: 682.9 bits (1761), Expect = 1.6e-196
Identity = 332/506 (65.61%), Postives = 417/506 (82.41%), Query Frame = 1

Query: 3   KEEFRVYNSMTQQKEVFTTKEPGKVGMYVCGITAYDFSHIGHARAAVNFDVLYRYLMHLG 62
           K E ++YN+MTQQKEV     PGK+G+YVCGITAYDFSHIGHARAAV+FDVLYRYL HL 
Sbjct: 5   KMELKLYNTMTQQKEVLIPITPGKIGLYVCGITAYDFSHIGHARAAVSFDVLYRYLKHLD 64

Query: 63  YEVTYVRNFTDVDDKIIRRANESGENPLSLSERFCQEYLADMDDLQCLSPTHQPRVSDHL 122
           Y+VT+VRNFTDVDDKII RAN++GE+PL LS RFC EYL DM  LQCL PTHQPRVS+H+
Sbjct: 65  YDVTFVRNFTDVDDKIIDRANKNGEDPLDLSNRFCDEYLVDMGALQCLPPTHQPRVSEHM 124

Query: 123 EQIKDMITQIINNEYGYVVDGDVFFSVEKFPNYGQLSGQKLENHRAGERVAVDPRKRSPF 182
           + I  MI +II  + GYVV+GDVFFSV+K PNYG+LSGQ LE+ RAGERVAVD RKR+P 
Sbjct: 125 DNIIKMIEKIIEKDCGYVVEGDVFFSVDKSPNYGKLSGQLLEHTRAGERVAVDSRKRNPA 184

Query: 183 DFALWKAAKPGEPSWESPWGHGRPGWHIECSAMSAHYLTFKFDIHGGGIDLIFPHHENEV 242
           DFALWKAAKP EPSWESPWG GRPGWHIECSAMS HYL+ KFDIHGGG DL FPHHENE+
Sbjct: 185 DFALWKAAKPDEPSWESPWGPGRPGWHIECSAMSVHYLSPKFDIHGGGADLKFPHHENEI 244

Query: 243 AQSCAACQESKISYWMHNGHVTNNNEKMSKSLGNFFTIRQITERYHPLVLRHFLISAHYR 302
           AQ+CAAC++S ++YW+HNGHVT NNEKM+KS  NF TIR+IT  YHPL LRHFL+SA YR
Sbjct: 245 AQTCAACEDSGVNYWLHNGHVTINNEKMAKSKHNFKTIREITASYHPLALRHFLMSAQYR 304

Query: 303 SPLNYTVSQLDSASDTIYYIYQTFQDCEDALLQHQGEILTEGLGKTAKKDPVSSAAEECI 362
           SPL++T SQL+S+S+ +YY+YQT QD ++ L  +Q + L+E  GK+ +    ++  ++ I
Sbjct: 305 SPLSFTASQLESSSEALYYVYQTLQDLDEGLSPYQ-DALSEDGGKSEQ----TAEGKDII 364

Query: 363 NNLRSEFQTRMADDLNTAHILTGAFQEALKFINSTLTSLKKKQPKKQQLSMIQALISVKK 422
             L++EF+++M DDLNTAHILTGA+Q+ALKFIN++L+ LKK Q KKQ++SM+ +L+ ++K
Sbjct: 365 KKLKTEFESKMLDDLNTAHILTGAYQDALKFINASLSKLKKMQ-KKQRMSMLVSLVEIEK 424

Query: 423 ELREVLDVLGLLSSSTSSEVLQQLKDKAVKRAGMVEDDILKLIDDRTQARKNKDFARSDQ 482
             REVLDVLGLL++ + +E+L+++K K + RA + E+ I +LI++R  ARKNKDFA+SD+
Sbjct: 425 AAREVLDVLGLLTTLSYAEILKEMKLKTLIRAEIGEEGISQLIEERITARKNKDFAKSDE 484

Query: 483 IRADLSALGIALMDVGKETVWRPCVP 509
           IR  L+  GIALMD+GKETVWRPC P
Sbjct: 485 IREKLTRKGIALMDIGKETVWRPCFP 504

BLAST of Cp4.1LG01g17650 vs. TAIR10
Match: AT2G31170.1 (AT2G31170.1 Cysteinyl-tRNA synthetase, class Ia family protein)

HSP 1 Score: 606.3 bits (1562), Expect = 1.9e-173
Identity = 303/511 (59.30%), Postives = 383/511 (74.95%), Query Frame = 1

Query: 4   EEFRVYNSMTQQKEVFTTKEPGKVGMYVCGITAYDFSHIGHARAAVNFDVLYRYLMHLGY 63
           +E  ++N+M+++KE+F  K  GKVGMYVCG+TAYD SHIGHAR  V FDVL RYL HLGY
Sbjct: 63  KELWLHNTMSRKKELFKPKVEGKVGMYVCGVTAYDLSHIGHARVYVTFDVLLRYLKHLGY 122

Query: 64  EVTYVRNFTDVDDKIIRRANESGENPLSLSERFCQEYLADMDDLQCLSPTHQPRVSDHLE 123
           EV+YVRNFTDVDDKII RA E  E+P+SLS RFC+E+  DM+ LQCL P+ QPRVSDH+ 
Sbjct: 123 EVSYVRNFTDVDDKIIARAKELEEDPISLSRRFCEEFNRDMEQLQCLDPSVQPRVSDHIP 182

Query: 124 QIKDMITQIINNEYGYVVDGDVFFSVEKFPNYGQLSGQKLENHRAGERVAVDPRKRSPFD 183
           QI D+I QI++N Y Y VDGD++FSV+KFP YG+LSG+KLE++RAGERVAVD RK+ P D
Sbjct: 183 QIIDLIKQILDNGYAYKVDGDIYFSVDKFPTYGKLSGRKLEDNRAGERVAVDTRKKHPAD 242

Query: 184 FALWKAAKPGEPSWESPWGHGRPGWHIECSAMSAHYLTFKFDIHGGGIDLIFPHHENEVA 243
           FALWKAAK GEP WESPWG GRPGWHIECSAMSA YL + FDIHGGG+DL+FPHHENE+A
Sbjct: 243 FALWKAAKEGEPFWESPWGRGRPGWHIECSAMSAAYLGYSFDIHGGGMDLVFPHHENEIA 302

Query: 244 QSCAACQESKISYWMHNGHVTNNNEKMSKSLGNFFTIRQITERYHPLVLRHFLISAHYRS 303
           QSCAAC  S ISYW+HNG VT ++EKMSKSLGNFFTIRQ+ + YHPL LR FL+  HYRS
Sbjct: 303 QSCAACDSSNISYWIHNGFVTVDSEKMSKSLGNFFTIRQVIDLYHPLALRLFLMGTHYRS 362

Query: 304 PLNYTVSQLDSASDTIYYIYQTFQDCEDALLQHQGEILTEGLGKTAKKDPVSSAAEECIN 363
           P+NY+   L+SAS+ I+YIYQT  DCE AL +            T +   V S     IN
Sbjct: 363 PINYSDFLLESASERIFYIYQTLHDCESALGEKDS---------TFENGSVPSDTLTSIN 422

Query: 364 NLRSEFQTRMADDLNTAHILTGAFQEALKFINSTLTSLKKKQPKKQQLSMIQALISVKKE 423
             R+EF   M+DDL T  +   A  E LK IN  + + K K+  +++    ++L +++  
Sbjct: 423 TFRTEFVASMSDDLLTP-VTLAAMSEPLKTINDLIHTRKGKKQARRE----ESLKALETT 482

Query: 424 LREVLDVLGLLSSSTSSEVLQQLKDKAVKRAGMVEDDILKLIDDRTQARKNKDFARSDQI 483
           +R+VL +LGL+ +S S EVL+QLK+KA+KRAG+ E+D+L+ + +RT ARKNK++ RSD I
Sbjct: 483 IRDVLTILGLMPTSYS-EVLEQLKEKALKRAGLKEEDVLQRVQERTDARKNKEYERSDAI 542

Query: 484 RADLSALGIALMDVGKETVWRPCVPVELAPP 515
           R DL+ +GIALMD  + T WRP +P+ L  P
Sbjct: 543 RKDLAKVGIALMDSPEGTTWRPAIPLALQEP 558

BLAST of Cp4.1LG01g17650 vs. TAIR10
Match: AT3G56300.1 (AT3G56300.1 Cysteinyl-tRNA synthetase, class Ia family protein)

HSP 1 Score: 567.4 bits (1461), Expect = 1.0e-161
Identity = 296/507 (58.38%), Postives = 367/507 (72.39%), Query Frame = 1

Query: 3   KEEFRVYNSMTQQKEVFTTKEPGKVGMYVCGITAYDFSHIGHARAAVNFDVLYRYLMHLG 62
           K +  +YN+MTQ KEV+    PGK+G+YVCGITAYD+SHIGHARAAV+FD+LYRYL HLG
Sbjct: 8   KPDLTLYNTMTQLKEVYKPMNPGKIGIYVCGITAYDYSHIGHARAAVSFDLLYRYLRHLG 67

Query: 63  YEVTYVRNFTDVDDKIIRRANESGENPLSLSERFCQEYLADMDDLQCLSPTHQPRVSDHL 122
           Y+VTYVRNFTDVDDK    A   GE PL LS RFC+EYL DM  LQCL PTHQPRVSDH+
Sbjct: 68  YQVTYVRNFTDVDDK----AKNCGEKPLDLSNRFCEEYLLDMAALQCLLPTHQPRVSDHM 127

Query: 123 EQIKDMITQIINNEYGYVVDGDVFFSVEKFPNYGQLSGQKLENHRAGERVAVDPRKRSPF 182
           EQI  MI +II N  GY V GDVFFSV+K P+YGQLSGQ+L++ +AG+RVAVD RKR+P 
Sbjct: 128 EQIIKMIEKIIENGCGYAVGGDVFFSVDKSPSYGQLSGQRLDHTQAGKRVAVDSRKRNPA 187

Query: 183 DFALWKAAKPGEPSWESPWGHGRPGWHIECSAMSAHYLTFKFDIHGGGIDLIFPHHENEV 242
           DFAL KAAK GEPSWESPWGHGRPGWHIE            FDIHGGG DL FPHHENE+
Sbjct: 188 DFALRKAAKSGEPSWESPWGHGRPGWHIE------------FDIHGGGADLKFPHHENEI 247

Query: 243 AQSCAACQESKISYWMHNGHVTNNNEKMSKSLGNFFTIRQITERYHPLVLRHFLISAHYR 302
           AQ+CAAC++S ++YW+HNGHVTNNN KM KSL NFFTIRQI   YHPL LRHFL+SA YR
Sbjct: 248 AQTCAACEDSGVNYWLHNGHVTNNNVKMGKSLNNFFTIRQIAANYHPLALRHFLMSAQYR 307

Query: 303 SPLNYTVSQLDSASDTIYYIYQTFQDCEDALLQHQGEILTEGLGKTAKKDPVSSAAEECI 362
           SPLNY+VSQL+S+SD +Y             L    E ++  +GKT +    S+ A+E I
Sbjct: 308 SPLNYSVSQLESSSDALY------------SLSPYREEMSGDVGKTQQ----SAEAKEMI 367

Query: 363 NNLRSEFQTRMADDLNTAHILTGAFQEALKFINSTLTSLKKKQPKKQQLSMIQALISVKK 422
             +++                      ALKFIN +++ LKK Q KKQ++S++ +L+ V+K
Sbjct: 368 KKVKN----------------------ALKFINVSISKLKKMQ-KKQRMSLVVSLVEVEK 427

Query: 423 ELREVLDVLGLLSSSTSSEVLQQLKDKAVKRAGMVEDDILKLIDDRTQARKNKDFARSDQ 482
            +REVLDVLGLL++ +  E+L+ +K KA+ RAGM E+++L+ I++R  ARK+KDF RSD+
Sbjct: 428 AVREVLDVLGLLTTLSYGELLKDMKQKALTRAGMGEEEVLQRIEERNMARKSKDFRRSDR 459

Query: 483 IRADLSALGIALMDVGKETVWRPCVPV 510
           IR  L+  GI L DV  +TVWRP  P+
Sbjct: 488 IRELLAFKGIFLEDVPGDTVWRPSTPL 459

BLAST of Cp4.1LG01g17650 vs. NCBI nr
Match: gi|449464950|ref|XP_004150192.1| (PREDICTED: cysteine--tRNA ligase, cytoplasmic isoform X1 [Cucumis sativus])

HSP 1 Score: 984.6 bits (2544), Expect = 7.4e-284
Identity = 500/568 (88.03%), Postives = 526/568 (92.61%), Query Frame = 1

Query: 1   MAKEEFRVYNSMTQQKEVFTTKEPGKVGMYVCGITAYDFSHIGHARAAVNFDVLYRYLMH 60
           MAKEEFRVYNSMTQQKEVFTTKEPGKVGMYVCGITAYDFSHIGHARAAVNFDVLYRYL H
Sbjct: 1   MAKEEFRVYNSMTQQKEVFTTKEPGKVGMYVCGITAYDFSHIGHARAAVNFDVLYRYLKH 60

Query: 61  LGYEVTYVRNFTDVDDKIIRRANESGENPLSLSERFCQEYLADMDDLQCLSPTHQPRVSD 120
           LGYEVTYVRNFTDVDDKIIRRANESGENP +LS+RFCQEYL+DM DLQCLSPTHQPRVSD
Sbjct: 61  LGYEVTYVRNFTDVDDKIIRRANESGENPFALSDRFCQEYLSDMADLQCLSPTHQPRVSD 120

Query: 121 HLEQIKDMITQIINNEYGYVVDGDVFFSVEKFPNYGQLSGQKLENHRAGERVAVDPRKRS 180
           HLEQIKDMITQII N YGY VDGDVFFSV+KFPNYGQLSGQKLENHRAGERVAVD RK +
Sbjct: 121 HLEQIKDMITQIIKNGYGYAVDGDVFFSVDKFPNYGQLSGQKLENHRAGERVAVDSRKNN 180

Query: 181 PFDFALWKAAKPGEPSWESPWGHGRPGWHIECSAMSAHYLTFKFDIHGGGIDLIFPHHEN 240
           P DFALWKAAKPGEPSWESPWG GRPGWHIECSAMSAHYLTFKFDIHGGGIDLIFPHHEN
Sbjct: 181 PADFALWKAAKPGEPSWESPWGPGRPGWHIECSAMSAHYLTFKFDIHGGGIDLIFPHHEN 240

Query: 241 EVAQSCAACQESKISYWMHNGHVTNNNEKMSKSLGNFFTIRQITERYHPLVLRHFLISAH 300
           EVAQSCAACQESKISYWMHNGHVTNNNEKMSKSLGNFFTIRQITERYHPLVLRHFLISAH
Sbjct: 241 EVAQSCAACQESKISYWMHNGHVTNNNEKMSKSLGNFFTIRQITERYHPLVLRHFLISAH 300

Query: 301 YRSPLNYTVSQLDSASDTIYYIYQTFQDCEDALLQHQGEILTEGLGKTAKKDPVSSAAEE 360
           YRSPLNYTVSQLDSASDT+YYIYQT QDCEDAL QHQGE L++GLGKT KKDPVSSAAEE
Sbjct: 301 YRSPLNYTVSQLDSASDTVYYIYQTMQDCEDALSQHQGENLSDGLGKTGKKDPVSSAAEE 360

Query: 361 CINNLRSEFQTRMADDLNTAHILTGAFQEALKFINSTLTSLKKKQPKKQQLSMIQALISV 420
           CI NLRSEFQTRMADDLNTAHILTGAFQEALKFINSTLTSLKKKQPKKQQLSMIQ+L++V
Sbjct: 361 CIINLRSEFQTRMADDLNTAHILTGAFQEALKFINSTLTSLKKKQPKKQQLSMIQSLVNV 420

Query: 421 KKELREVLDVLGLLSSSTSSEVLQQLKDKAVKRAGMVEDDILKLIDDRTQARKNKDFARS 480
           KKELREVLDVLGLLS+ST SEVLQQLKDKA+KRAGMVEDDIL LI++RTQARK+K+F +S
Sbjct: 421 KKELREVLDVLGLLSTSTYSEVLQQLKDKALKRAGMVEDDILHLIEERTQARKDKNFGKS 480

Query: 481 DQIRADLSALGIALMDVGKETVWRPCVPVELAPPPPPSEENKSTPLAEKKPTEQQVAQPP 540
           D IRA LS+LGIALMDVG+ET+WRPCVPVE A   PP E+  +   AE+KP EQQVAQPP
Sbjct: 481 DNIRAKLSSLGIALMDVGRETIWRPCVPVEPAVLVPPGEQKATQ--AEQKPMEQQVAQPP 540

Query: 541 AVEQNKAAQEQTVQDAQPHE-ENKATVP 568
           A EQ      QTV +AQ +E E KATVP
Sbjct: 541 AGEQ------QTVDNAQTNEAEKKATVP 560

BLAST of Cp4.1LG01g17650 vs. NCBI nr
Match: gi|659072340|ref|XP_008465226.1| (PREDICTED: LOW QUALITY PROTEIN: cysteine--tRNA ligase, cytoplasmic-like [Cucumis melo])

HSP 1 Score: 980.3 bits (2533), Expect = 1.4e-282
Identity = 498/568 (87.68%), Postives = 522/568 (91.90%), Query Frame = 1

Query: 1   MAKEEFRVYNSMTQQKEVFTTKEPGKVGMYVCGITAYDFSHIGHARAAVNFDVLYRYLMH 60
           MAKEEFRVYNSMTQQKEVFTTKEPGKVGMYVCGITAYDFSHIGHARAAVNFDVLYRYL H
Sbjct: 1   MAKEEFRVYNSMTQQKEVFTTKEPGKVGMYVCGITAYDFSHIGHARAAVNFDVLYRYLKH 60

Query: 61  LGYEVTYVRNFTDVDDKIIRRANESGENPLSLSERFCQEYLADMDDLQCLSPTHQPRVSD 120
           LGYEVTYVRNFTDVDDKIIRRANESGENP +LS+RFCQEYL+DM DLQCLSPTHQPRVSD
Sbjct: 61  LGYEVTYVRNFTDVDDKIIRRANESGENPFALSDRFCQEYLSDMADLQCLSPTHQPRVSD 120

Query: 121 HLEQIKDMITQIINNEYGYVVDGDVFFSVEKFPNYGQLSGQKLENHRAGERVAVDPRKRS 180
           HLEQIKDMITQII N YGY VDGDVFFSV+KFPNYGQLSGQKLENHRAGERVAVD RKR+
Sbjct: 121 HLEQIKDMITQIIKNGYGYAVDGDVFFSVDKFPNYGQLSGQKLENHRAGERVAVDSRKRN 180

Query: 181 PFDFALWKAAKPGEPSWESPWGHGRPGWHIECSAMSAHYLTFKFDIHGGGIDLIFPHHEN 240
           P DFALWKAAKPGEPSWESPWG GRPGWHIECSAMSAHYLTFKFDIHGGGIDLIFPHHEN
Sbjct: 181 PADFALWKAAKPGEPSWESPWGPGRPGWHIECSAMSAHYLTFKFDIHGGGIDLIFPHHEN 240

Query: 241 EVAQSCAACQESKISYWMHNGHVTNNNEKMSKSLGNFFTIRQITERYHPLVLRHFLISAH 300
           EVAQSCAACQESKISYWMHNGHVTNNNEKMSKSLGNFFTIRQITERYHPLVLRHFLISAH
Sbjct: 241 EVAQSCAACQESKISYWMHNGHVTNNNEKMSKSLGNFFTIRQITERYHPLVLRHFLISAH 300

Query: 301 YRSPLNYTVSQLDSASDTIYYIYQTFQDCEDALLQHQGEILTEGLGKTAKKDPVSSAAEE 360
           YRSPLNYTVSQLDSASDT+YYIYQT QDCEDAL QHQGE L++GLGKT KKDPVSSAAEE
Sbjct: 301 YRSPLNYTVSQLDSASDTVYYIYQTLQDCEDALSQHQGENLSDGLGKTGKKDPVSSAAEE 360

Query: 361 CINNLRSEFQTRMADDLNTAHILTGAFQEALKFINSTLTSLKKKQPKKQQLSMIQALISV 420
           CI NLRSEFQTRMADDLNTAHILTGAFQEALKFINSTLTSLKKKQPKKQQLSMIQ+L++V
Sbjct: 361 CIINLRSEFQTRMADDLNTAHILTGAFQEALKFINSTLTSLKKKQPKKQQLSMIQSLVNV 420

Query: 421 KKELREVLDVLGLLSSSTSSEVLQQLKDKAVKRAGMVEDDILKLIDDRTQARKNKDFARS 480
           KKELREVLDVLGLLS+ST SEVLQQLKDKA+KRAGMVEDDIL LI++RTQA KNKDF +S
Sbjct: 421 KKELREVLDVLGLLSTSTYSEVLQQLKDKALKRAGMVEDDILHLIEERTQAXKNKDFGKS 480

Query: 481 DQIRADLSALGIALMDVGKETVWRPCVPVELAPPPPPSEENKSTPLAEKKPTEQQVAQPP 540
           D IRA+LS+LGIALMDVGKET+WRPCVPVE A   PP E+  +   AE+KP EQQVAQPP
Sbjct: 481 DNIRAELSSLGIALMDVGKETIWRPCVPVEPAVLVPPCEQKATQ--AEQKPMEQQVAQPP 540

Query: 541 AVEQNKAAQEQTVQDAQPHE-ENKATVP 568
           A +           DAQ +E E KAT P
Sbjct: 541 AGD-----------DAQTNEAEKKATGP 555

BLAST of Cp4.1LG01g17650 vs. NCBI nr
Match: gi|596156994|ref|XP_007222813.1| (hypothetical protein PRUPE_ppa004241mg [Prunus persica])

HSP 1 Score: 818.9 bits (2114), Expect = 5.4e-234
Identity = 396/515 (76.89%), Postives = 445/515 (86.41%), Query Frame = 1

Query: 2   AKEEFRVYNSMTQQKEVFTTKEPGKVGMYVCGITAYDFSHIGHARAAVNFDVLYRYLMHL 61
           +KEEF VYNSMT+QKE F  K PGKVGMYVCG+TAY  SH+GH RAAVNFDVLYRYL HL
Sbjct: 4   SKEEFMVYNSMTRQKESFKPKVPGKVGMYVCGVTAYALSHLGHGRAAVNFDVLYRYLQHL 63

Query: 62  GYEVTYVRNFTDVDDKIIRRANESGENPLSLSERFCQEYLADMDDLQCLSPTHQPRVSDH 121
           GYEVTYVRNFTDVDDKII RANE GE+PLSLS RFCQEYL DM DLQCL PTHQPRVSDH
Sbjct: 64  GYEVTYVRNFTDVDDKIINRANEVGEDPLSLSNRFCQEYLKDMGDLQCLLPTHQPRVSDH 123

Query: 122 LEQIKDMITQIINNEYGYVVDGDVFFSVEKFPNYGQLSGQKLENHRAGERVAVDPRKRSP 181
           +E IKD+ITQIIN +Y Y VDGDVFF+VEKFPNYGQLSGQ+LE++RAGERVAVD RKR+P
Sbjct: 124 MEHIKDLITQIINKDYAYAVDGDVFFAVEKFPNYGQLSGQRLEHNRAGERVAVDSRKRNP 183

Query: 182 FDFALWKAAKPGEPSWESPWGHGRPGWHIECSAMSAHYLTFKFDIHGGGIDLIFPHHENE 241
            DFALWK+AKPGEPSW+SPWG GRPGWHIECSAMSAHYLTF FDIHGGGIDLIFPHHENE
Sbjct: 184 ADFALWKSAKPGEPSWDSPWGPGRPGWHIECSAMSAHYLTFNFDIHGGGIDLIFPHHENE 243

Query: 242 VAQSCAACQESKISYWMHNGHVTNNNEKMSKSLGNFFTIRQITERYHPLVLRHFLISAHY 301
           +AQSCAACQES +SYWMHNGHVTNNNEKMSKSLGNFFTI +ITERYHPL LRHFLISAHY
Sbjct: 244 IAQSCAACQESSVSYWMHNGHVTNNNEKMSKSLGNFFTISEITERYHPLALRHFLISAHY 303

Query: 302 RSPLNYTVSQLDSASDTIYYIYQTFQDCEDALLQHQGEILTEGLGKTAKKDPVSSAAEEC 361
           RSPLNYTVSQL+ +SD +YYIYQT QDCEDAL   Q   L EG  K  +   ++ AA+EC
Sbjct: 304 RSPLNYTVSQLEGSSDAVYYIYQTLQDCEDALSPFQEGSLKEGTEKNGRTVKITPAAQEC 363

Query: 362 INNLRSEFQTRMADDLNTAHILTGAFQEALKFINSTLTSLKKKQPKKQQLSMIQALISVK 421
           I+ L +EF+T+M DDLNTAHILTGAFQ+ALK INS+L  LKKKQ ++QQL MIQ+L+ +K
Sbjct: 364 ISKLHNEFETKMCDDLNTAHILTGAFQDALKLINSSLNLLKKKQQRQQQLLMIQSLVEIK 423

Query: 422 KELREVLDVLGLLSSSTSSEVLQQLKDKAVKRAGMVEDDILKLIDDRTQARKNKDFARSD 481
           KE++E+L++LGLLSS T SEVLQQ K KA+KRAG+VEDD+L  I +RT ARKNKDFA+SD
Sbjct: 424 KEVQELLNILGLLSSDTYSEVLQQFKMKALKRAGLVEDDVLDQIKERTLARKNKDFAKSD 483

Query: 482 QIRADLSALGIALMDVGKETVWRPCVPVELAPPPP 517
           QIRA L+  GIALMD+GKET+WRPCVP    P  P
Sbjct: 484 QIRAYLTTKGIALMDLGKETIWRPCVPAGEQPSLP 518

BLAST of Cp4.1LG01g17650 vs. NCBI nr
Match: gi|645276382|ref|XP_008243262.1| (PREDICTED: cysteine--tRNA ligase, cytoplasmic [Prunus mume])

HSP 1 Score: 815.5 bits (2105), Expect = 6.0e-233
Identity = 395/515 (76.70%), Postives = 445/515 (86.41%), Query Frame = 1

Query: 2   AKEEFRVYNSMTQQKEVFTTKEPGKVGMYVCGITAYDFSHIGHARAAVNFDVLYRYLMHL 61
           +KEEF VYNSMT+QKE+F  K PGKVGMYVCG+TAY  SH+GHARAAVNFDVLYRYL HL
Sbjct: 4   SKEEFMVYNSMTRQKEIFNPKVPGKVGMYVCGVTAYALSHLGHARAAVNFDVLYRYLQHL 63

Query: 62  GYEVTYVRNFTDVDDKIIRRANESGENPLSLSERFCQEYLADMDDLQCLSPTHQPRVSDH 121
           GYEVTYVRNFTDVDDKII RANE GE+PLSLS RFCQEYL DM DLQCL PTHQPRVSDH
Sbjct: 64  GYEVTYVRNFTDVDDKIINRANEVGEDPLSLSNRFCQEYLKDMGDLQCLLPTHQPRVSDH 123

Query: 122 LEQIKDMITQIINNEYGYVVDGDVFFSVEKFPNYGQLSGQKLENHRAGERVAVDPRKRSP 181
           +E IKD+ITQIIN +Y Y VDGDVFF+VEKFPNYGQLSGQ+LE++RAGERVAVD RKR+P
Sbjct: 124 MEHIKDLITQIINKDYAYSVDGDVFFAVEKFPNYGQLSGQRLEHNRAGERVAVDSRKRNP 183

Query: 182 FDFALWKAAKPGEPSWESPWGHGRPGWHIECSAMSAHYLTFKFDIHGGGIDLIFPHHENE 241
            DFALWK+AKPGEPSW+SPWG GRPGWHIECSAMSAHYLTF FDIHGGGIDLIFPHHENE
Sbjct: 184 ADFALWKSAKPGEPSWDSPWGPGRPGWHIECSAMSAHYLTFNFDIHGGGIDLIFPHHENE 243

Query: 242 VAQSCAACQESKISYWMHNGHVTNNNEKMSKSLGNFFTIRQITERYHPLVLRHFLISAHY 301
           +AQSCAACQ S +SYWMHNGHVTNNNEKMSKSLGNFFTI +ITERYHPL LRHFLISAHY
Sbjct: 244 IAQSCAACQGSSVSYWMHNGHVTNNNEKMSKSLGNFFTISEITERYHPLALRHFLISAHY 303

Query: 302 RSPLNYTVSQLDSASDTIYYIYQTFQDCEDALLQHQGEILTEGLGKTAKKDPVSSAAEEC 361
           RSPLNYTVSQL+ +SD +YYIYQT QD EDAL   Q   L EG  K  +   ++ AA+EC
Sbjct: 304 RSPLNYTVSQLEGSSDAVYYIYQTLQDSEDALSPFQEGSLKEGTEKNGRTVKITPAAQEC 363

Query: 362 INNLRSEFQTRMADDLNTAHILTGAFQEALKFINSTLTSLKKKQPKKQQLSMIQALISVK 421
           I+ L +EF+T+M DDLNTAHILTGAFQ+ALK INS+L  LKKKQ ++QQL MIQ+L+ +K
Sbjct: 364 ISKLHNEFETKMCDDLNTAHILTGAFQDALKLINSSLNILKKKQQRQQQLLMIQSLVEIK 423

Query: 422 KELREVLDVLGLLSSSTSSEVLQQLKDKAVKRAGMVEDDILKLIDDRTQARKNKDFARSD 481
           KE++E+L++LGLLSS T SEVLQQ K KA+KRAG+VEDD+L  I +RT ARKNKDFA+SD
Sbjct: 424 KEVQELLNILGLLSSDTYSEVLQQFKVKALKRAGLVEDDVLDQIKERTLARKNKDFAKSD 483

Query: 482 QIRADLSALGIALMDVGKETVWRPCVPVELAPPPP 517
           QIRA L+  GIALMD+GKET+WRPCVP    P  P
Sbjct: 484 QIRAYLTTKGIALMDLGKETIWRPCVPAGEQPSLP 518

BLAST of Cp4.1LG01g17650 vs. NCBI nr
Match: gi|224111652|ref|XP_002315932.1| (tRNA synthetase class 1 family protein [Populus trichocarpa])

HSP 1 Score: 803.5 bits (2074), Expect = 2.4e-229
Identity = 390/531 (73.45%), Postives = 450/531 (84.75%), Query Frame = 1

Query: 4   EEFRVYNSMTQQKEVFTTKEPGKVGMYVCGITAYDFSHIGHARAAVNFDVLYRYLMHLGY 63
           EE ++YNSMTQQKEVF ++ PGKV MYVCG+T+YDFSH+GHARAAV FD+L+RYL HLGY
Sbjct: 5   EELKLYNSMTQQKEVFKSRIPGKVSMYVCGVTSYDFSHLGHARAAVAFDILFRYLQHLGY 64

Query: 64  EVTYVRNFTDVDDKIIRRANESGENPLSLSERFCQEYLADMDDLQCLSPTHQPRVSDHLE 123
           EVTYVRNFTD+DDKIIRRANE GE+PLSLS RFC+EYL DM DLQCL PTHQPRV+DH+E
Sbjct: 65  EVTYVRNFTDIDDKIIRRANEIGEDPLSLSSRFCEEYLVDMTDLQCLIPTHQPRVTDHVE 124

Query: 124 QIKDMITQIINNEYGYVVDGDVFFSVEKFPNYGQLSGQKLENHRAGERVAVDPRKRSPFD 183
           QIKDMITQII  +  Y V+GDVFF+V K PNYGQLSGQ+LEN+RAGERVAVD RKR+P D
Sbjct: 125 QIKDMITQIIEKDCAYAVEGDVFFAVNKSPNYGQLSGQRLENNRAGERVAVDSRKRNPAD 184

Query: 184 FALWKAAKPGEPSWESPWGHGRPGWHIECSAMSAHYLTFKFDIHGGGIDLIFPHHENEVA 243
           FALWKAAKPGEPSWESPWG GRPGWHIECSAMSA YLTFKFDIHGGGIDLIFPHHENE+A
Sbjct: 185 FALWKAAKPGEPSWESPWGPGRPGWHIECSAMSAQYLTFKFDIHGGGIDLIFPHHENEIA 244

Query: 244 QSCAACQESKISYWMHNGHVTNNNEKMSKSLGNFFTIRQITERYHPLVLRHFLISAHYRS 303
           QSCAAC+ES +SYWMHNGHVTNNNEKMSKSLGNFFTIRQITERYHPL LRHFLISAHYRS
Sbjct: 245 QSCAACEESSVSYWMHNGHVTNNNEKMSKSLGNFFTIRQITERYHPLALRHFLISAHYRS 304

Query: 304 PLNYTVSQLDSASDTIYYIYQTFQDCEDALLQHQGEILTEGLGKTAKKDPVSSAAEECIN 363
           PLNY+VSQL+S+SD ++YIYQT QDCEDALL  Q   L EG G+ A    +++ A++CI+
Sbjct: 305 PLNYSVSQLESSSDAVFYIYQTLQDCEDALLPFQEGSLKEGAGQNANLVAITADAQKCIS 364

Query: 364 NLRSEFQTRMADDLNTAHILTGAFQEALKFINSTLTSLKKKQPKKQQLSMIQALISVKKE 423
            L  +F+T+M+DDLNT+ +LTGAFQEALK +N +L  LKKKQ KKQQLS+I+++  VKKE
Sbjct: 365 RLHEDFETKMSDDLNTSPLLTGAFQEALKVVNGSLGMLKKKQQKKQQLSLIRSVTEVKKE 424

Query: 424 LREVLDVLGLLSSSTSSEVLQQLKDKAVKRAGMVEDDILKLIDDRTQARKNKDFARSDQI 483
           + EVL +LGL    T +EVLQQLK KA+KRAG+ EDD++ LI+DR  ARK++DF +SDQI
Sbjct: 425 VTEVLRILGLFPPCTCAEVLQQLKGKALKRAGLTEDDVMSLIEDRAVARKSQDFKKSDQI 484

Query: 484 RADLSALGIALMDVGKETVWRPCVPVELAPPPPPSEENKSTPLAEKKPTEQ 535
           R DLSA GIALMDVGKETVWRPCVPVE       +EE   T + E  P  Q
Sbjct: 485 RTDLSARGIALMDVGKETVWRPCVPVE-------NEEKAKTVVEEPTPPPQ 528

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
SYCC2_ARATH2.9e-19565.61Cysteine--tRNA ligase 2, cytoplasmic OS=Arabidopsis thaliana GN=At5g38830 PE=2 S... [more]
SYCM_ARATH3.4e-17259.30Cysteine--tRNA ligase, chloroplastic/mitochondrial OS=Arabidopsis thaliana GN=SY... [more]
SYCC1_ARATH1.8e-16058.38Cysteine--tRNA ligase 1, cytoplasmic OS=Arabidopsis thaliana GN=At3g56300 PE=3 S... [more]
SYC_GEODF1.3e-12347.91Cysteine--tRNA ligase OS=Geobacter daltonii (strain DSM 22248 / JCM 15807 / FRC-... [more]
SYC_PELPD3.9e-12347.28Cysteine--tRNA ligase OS=Pelobacter propionicus (strain DSM 2379) GN=cysS PE=3 S... [more]
Match NameE-valueIdentityDescription
M5XD29_PRUPE3.8e-23476.89Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004241mg PE=3 SV=1[more]
B9HWF9_POPTR1.6e-22973.45tRNA synthetase class 1 family protein OS=Populus trichocarpa GN=POPTR_0010s1324... [more]
V4W6G1_9ROSI2.8e-22975.58Uncharacterized protein OS=Citrus clementina GN=CICLE_v10014843mg PE=3 SV=1[more]
A0A061GQF5_THECC4.0e-22875.68Cysteinyl-tRNA synthetase OS=Theobroma cacao GN=TCM_038835 PE=3 SV=1[more]
A0A067GNP2_CITSI5.3e-22875.78Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g0093292mg PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G38830.11.6e-19665.61 Cysteinyl-tRNA synthetase, class Ia family protein[more]
AT2G31170.11.9e-17359.30 Cysteinyl-tRNA synthetase, class Ia family protein[more]
AT3G56300.11.0e-16158.38 Cysteinyl-tRNA synthetase, class Ia family protein[more]
Match NameE-valueIdentityDescription
gi|449464950|ref|XP_004150192.1|7.4e-28488.03PREDICTED: cysteine--tRNA ligase, cytoplasmic isoform X1 [Cucumis sativus][more]
gi|659072340|ref|XP_008465226.1|1.4e-28287.68PREDICTED: LOW QUALITY PROTEIN: cysteine--tRNA ligase, cytoplasmic-like [Cucumis... [more]
gi|596156994|ref|XP_007222813.1|5.4e-23476.89hypothetical protein PRUPE_ppa004241mg [Prunus persica][more]
gi|645276382|ref|XP_008243262.1|6.0e-23376.70PREDICTED: cysteine--tRNA ligase, cytoplasmic [Prunus mume][more]
gi|224111652|ref|XP_002315932.1|2.4e-22973.45tRNA synthetase class 1 family protein [Populus trichocarpa][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006423cysteinyl-tRNA aminoacylation
GO:0006418tRNA aminoacylation for protein translation
Vocabulary: Cellular Component
TermDefinition
GO:0005737cytoplasm
Vocabulary: Molecular Function
TermDefinition
GO:0004817cysteine-tRNA ligase activity
GO:0005524ATP binding
GO:0004812aminoacyl-tRNA ligase activity
GO:0000166nucleotide binding
Vocabulary: INTERPRO
TermDefinition
IPR024909Cys-tRNA/MSH_ligase
IPR015803Cys-tRNA-ligase
IPR015273Cys-tRNA-synt_Ia_DALR
IPR014729Rossmann-like_a/b/a_fold
IPR009080tRNAsynth_Ia_anticodon-bd
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006534 cysteine metabolic process
biological_process GO:0006423 cysteinyl-tRNA aminoacylation
biological_process GO:0006418 tRNA aminoacylation for protein translation
cellular_component GO:0005737 cytoplasm
cellular_component GO:0005575 cellular_component
molecular_function GO:0005524 ATP binding
molecular_function GO:0004817 cysteine-tRNA ligase activity
molecular_function GO:0004812 aminoacyl-tRNA ligase activity
molecular_function GO:0000166 nucleotide binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g17650.1Cp4.1LG01g17650.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR009080Aminoacyl-tRNA synthetase, class Ia, anticodon-bindingunknownSSF47323Anticodon-binding domain of a subclass of class I aminoacyl-tRNA synthetasescoord: 351..505
score: 9.94
IPR014729Rossmann-like alpha/beta/alpha sandwich foldGENE3DG3DSA:3.40.50.620coord: 22..308
score: 1.1E
IPR015273Cysteinyl-tRNA synthetase, class Ia, DALRPFAMPF09190DALR_2coord: 368..438
score: 2.
IPR015803Cysteine-tRNA ligaseHAMAPMF_00041Cys_tRNA_synthcoord: 6..505
score: 3
IPR015803Cysteine-tRNA ligaseTIGRFAMsTIGR00435TIGR00435coord: 7..504
score: 1.5E
IPR024909Cysteinyl-tRNA synthetase/mycothiol ligasePRINTSPR00983TRNASYNTHCYScoord: 65..74
score: 8.7E-30coord: 29..40
score: 8.7E-30coord: 225..246
score: 8.7E-30coord: 194..212
score: 8.7
IPR024909Cysteinyl-tRNA synthetase/mycothiol ligasePANTHERPTHR10890CYSTEINYL-TRNA SYNTHETASEcoord: 3..545
score:
NoneNo IPR availableGENE3DG3DSA:1.20.120.640coord: 311..432
score: 1.
NoneNo IPR availablePANTHERPTHR10890:SF10CYSTEINE-TRNA LIGASEcoord: 3..545
score:
NoneNo IPR availableunknownSSF52374Nucleotidylyl transferasecoord: 5..316
score: 4.29