Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCAATCTCCACTACACCCCTATTATTTTGACATTGTGTAGGAATGTCAGAATGTCAACATGGACGATATTTGGATAATTAAAGATCAAATTAGTTTTCAAGTGTGAAATTTTCTTAAGGTGATTTATGCAAAATTTCCCTCTTTTATAAAAGCCTCAGGTCCCCAAACTTCATTACAATGACACACACCAAGTTTTGGCGCAAGCCGTGTGGAAGGAGCTCTCACTTCTCTGCGGTCTCCGTCTTTCTCTCTCTATAACTATATTCATATAGCGCGTACACTTGAATGTCGGCGTCGCAGAGAATTTTCTGCGCTATAACTCTTCCTCACCCTCGTTTGTATTCATCTTGGGCCTTCCCTTTCATTTGCCACCCTCTATCCCACAATATCTTACCACGCTCTCTCACTCTCGCACCTTTAACTTCCTCCCCCTTTTCCCTTTCTCCTGATTCTCGATTCATCATGCCTTACAATCAGGTGCCCATTTTTTTTTTTTTGTTTGATGTTTTCGGGTTGTGAATTTATGTGTCTGTGAAGAGATGTCGATTTTTGTATCTGGGTATCTCAATTTGTTTTCAATAATGCTTTTGATGTTCATAACTGTTTCGTTTTGAATGCGGCAATCTGTTTTCTATTGATATTGAGGATGGACGGGCTTTGTATTTGTTATTTGTTATTTGTTATCTCTTTTCGACGTTTGATGATTAATGTTTTTACAATGCTAGCATGTAAAAAGGATGCCGATATGGTAATTGGTGTTTTTTTTATACGCTTATCATATATGAAACACTTCAATTCTATTGCTTATGTCTCAAGGCATCGATTATATGTAATTAAAAATTTATCTTTAGGATTATCTATGTTGAGTTTGATCTTGATTTGCAGCGAAGGGGTGGCCATAGAGAACAGAAGTGGAAAGAGAAGGCAAAGGTTGACAGAAATTCTACGGAGTCAGAGGCTGCTGCTGACGTTGTTACTAATGCACTCGGTAAATTGAGGGTCACTGAAAGTGATCAACCTCATGTTCTTACTTCTAGTGCGCAGTTTGGAAATGCCCAGCTGACAAATCAGGCCACCCCTGGGCTTGCTCATAGAGCAATTTGGAAACCAAAAGCGTATGGAACAACCAGTGGGGCCGCAGTGGTTGAAGGTGAAAAAGAACCTACCAATGGAACGTCAACCGAAAACAAAGGGAGTAAAGCTGAGCTGGCAGCACAGAATGGCGCCGTTAGCTTGAGTCAATTATTCAAGGGCAATCAGATTGAAAAGTTTACCGTGGATAATTCTACTTACACACAAGCACAAATAAGAGCTACGTTTTACCCTAAATTTGAGAATGAGAAGTCGGATCAGGAGGTATTTCTTTACTTGATGTGTGTCTTTTTGAAGTTAAACTTGGACTCTGGATTTGCTTTATTACACAGAATTATTTACTTTTAGCTCATTTATTAGCTTGACATTATTCTACCTCCAAGATGTTTTTCCCATTCACCAAGTCATTGGCTTGTTTGGGTATGTGTTTCTAGGTGGATGCTTGTGGTTGTGTACTTGTTTTTGAGTCACTTGACCATCATCTTTGGGTCATGGGCGAAATTTTGCTGTAGCTTTTGGGCCATTGGTGTTTCTTTAAAATCCAGGTTCCCAAGGCTTTTTAATATCTCCCCGGTAACAAATGTGGTTGTTAGTTCTTGTTGGGATTTATCAACTTCTTCATGGAGTTTCTCTTTCAGACAAGCATTGAAGGATGAAGAGGTGGTCAATTTTCAGTCTCTTTCTTCATGCCTCTCTAAAATTTCACTATATTCGTGTGAGGATGTCCAAATTTGGTCCCTTGAATCATCAGGTTTTTTCTTGATTAAGTTTCTCTCAAATCGTTTGGCTTCTTCTTCTTTGATGCCCAATGGTCTGTTCAAATCATTATGGAAGTTGAAAAGCCCGAAGAAAATTATTATTATTGTTTGGATATTGCTTAATGGAAGTCTTAATTCAAAGAAAGTTTTGCAAAGGAAGAATACGTCTCTATGGCTTAGGCCCTCCATCTGTTTTATATGTTTTAAGGATGAGGAATCTCTCATTTCTTTGGCTGGAATTCATCCTATTGTCTTGAACCTCAGTGTTCGTGGGGGGACTTTAACTTTTTTTGGTGGGCCTAAGTTCACTACAACCTTGGGAGTTCAATACTAGGTTTGTAACCTAATATTATCACAGAAATTGCGGGCCTTTTGGATGTTGCTATTTTGTTGCAAGCCTTGTAGTATTCTAAGTTCACTACAACCTTTAGAGACTTGCCCTTATTCATTTTGTCCTTAGCGGCAACCTATTTTATGTACCTATTTAGAATCCCCAATGTGGTTAGCAAGATTTTGGAGAAGTTGGCAAGGGATTTCTTGCGGGAAACTCGAAAGGTTGACAAGAGGAAAGGCTTGCTGTTGGACCTAGGATTAGGCATAGGCAATGTAAGAGCGAGAAACAAGGTCTTGTTGGCTAAATGGCTTATGCAAATTCATCATGATTATGACACCTTGTGGCATAAGTTATTGTTACCAAGTATGGGCTTCACCTTGTTGAGTCCACCTTCGCCTTCTTTGGGTTTCCACCACCTTTTGACGATTAGGAAAACGACATGGAGATGTCCCATCATTAACTAGGAGGTTTTTTTGCATATGCCCATAAAGGTTGAATTTTGTGGTAGGTTGGTTTCTTCACAACTTTTGGGGATTTGGACTGAGAGAAACAATAGAGTTTTAAAGATATGGAGAGATCTTGGGAGGAGGTTTTGCCAATTGCTACGTTTAAAGCATCTATTTTGGGTGTCAGTTTCCAAGGAATCTTGTAATTACATGCAAAGTCTTATTATTTTGAGTTGGAACCCTTTTTTGTATAATTAGTTTCATGGCTCCTTTTAATAAACTATCTTGTTTACCCCTTGTTTTCCCTTTTGTAGTCGTTTAGAAAAATGGGAGCATGGTCTCTCATTTTTCTGTGGATTAATGGATGGGGAAATTCTCTTGTGAACCTTTTACTTTTCTTTTGGCTTCTATTGGACCTTTGTCTTTGTGAAACGTGCATCATGCCACCATATTTAATAAACCGTTCCTTTATGGAAAACAAAGAAGGTCATATGGTCATTCTGGGGCAATTTTTAAACTACTAAGTTTTGATAAGGTGATCGGATTTAATTGTTTATTTTTGTATTAGATTGATATTTATTTCGTTGTCAATGATTTGAGAATTTCGTTTCCTGGTAATATGTTTAATGTGTATATGATTTTATGTCAGATTAGAACAAGGATGATAGAGATGGTATCGAAAGGCTTGGCTACATTGGAGGTATAGAGATACCTGCAATTTTTGACAAGAATGCTGTAATGTTTATGGTGTTTAATGTTCCATTTATGATGTAGTTACTATGTATTATTTAATGTCATTCACTCCCCATTATGGATTTTAAAACCTTCAATGATGGCGTTTTAGTTTCGAAGTAGAAGTATATTGAACAATTTGGATGTTGATTCAATTTGGGTGTTCTTTTGGTGACTACTCATTTTTTGAATTGTTATGCTTCGGTACTGCAAAAGTGGATCCATACTACAACCTTTCTTGTACAATATTTGTTCTATAGGCTGTACAACTGTGTCCTGAGTTCTGAAAAAGGATTCTGTACTTGTGAGTCTAAATTCATATCTTTTTGTGAAAAATTTCATTGCTTATATAACAAAACAGTAGGTAAAATAGTTTCCTCAGTTCTTCATTGCTCATACCTTCTGGTTATTGTCTCTAATGGTAAAAGTAAATTGATCTTGTTCTGCATTCTATAAGTTGTTATTAGGTTATTCAACTTGGAAATTTCTACAGGTTTCACTAAAACACTCAGGGTCTTTGTTTATGTACGCTGGCCATGAAGGTGGAGCATATGCAAAAAACAGCTTCGGGAATATGTATGCCGATATATGCCTTTTCTTTTTAATTCTTTTTTGAAGGTGTATTGTATGCCAATTTTCCTAAATCCATTTTGATGAAGCCGGTAAGGAATTGAATTTGAAGTGGTACTCATATCTTTTCGGTATTTTTTCTTGAACAAGGATATTTACATATGGTCATTGCGTGTAACTGTTCTGCTCCCACTACCTTCAATATCATTTGCCATCATTATTATTTCATTTGTATTTCTTAGTTTTAGAAGAGGAGGATGCCATAACTTCAGACTTCTTTTTGGAACTTGTTACCATTTCCATTCTATTTCATCACGTTTTTGAAGCTTTTTCTGAATAAAATTCCACTTTCAGCTACACTGCTGTTGGTGTCTTTGTTCTGGGAAGGATGTTTCGAGAGGCTTGGGGAGCTGGAGCAGCAAAAAAGCAGGCAGAATTCAATGATTTCCTTGAGGTATTCTTAACATTAGAAAAACAATGATTTTGTTATCTAAAGATTTTAGCTTTGATCTGATAAGCGCCCACAATGCTCGTTGGATGCAAAACTAAAGGTCATTTTAGGATCCTTGCAAACAGTCACAAAATAAAACTAACTTAACAATCGATTTCAACCAAGTGCTTAGTTAGAATGATGAATAAACTCTTAGATCGAACATAGAATCTATAATTGTCACTCTTAGATCCTAAGACAACTCACTCCAGAATTAGCTTGATCAAAACTAATCCAATAAACTGATTTAAACATCAAATCTAAAGCAAGATTTGAAAGAAAAGACAACACTAAAGGGGTTGAATAAGCACAAGCTTTTTTAATATCAATAATTTGAGTTTTGAAGAACATTAAAAACCATTTTAAAGGCCAAATGGTCGTGCATAAGACCAAAAACTCAAAAGTCATTCAAATACAATTAATTGCAAATTGTAGAAAATAAGACCTAGGAACTTAAAACTTAATTGCATACTAATTGATAAATCAAAATGCAAACTAAAATGTAGAAAGTCTTCAAATTGTCTTAAATAATTCCTCTAGAAGGCGGCAAGCTTGATTTGCACGTTTGACAAGGTTTAACATCATGTTTTCCCATGCAGTACACACTTGACAGTATTTTCTTCACAAAATCTATGCACTCTTTTTTTACCATTGTGCCATCTTCCAAACTATAAGACTTTTCCCGTGCCATCTTCATGTCATTCTCTTCTTGACTCGTTGAACTAACATTATATATAAATTTGGGCTCAAAATTATTGGTCGGTTCTTGTGAATTTGCTAGTTTTTGAAGATGTAGTGTGAAAGCCTCTAGAATCTTCTTTGGGTTGCTTCTTGTAATTGCTCCTTCGGGTACATGCAATGGTTCAAATGGTTGAATTCACATCATGATCCTTGCATCTTTTTTATGCCTTGTTGATTGGTGCCCCATTTTTGTTTATATTCTACAAGTTTTATTTTTGGTTCTTTTGGTTGTGGGTTTTCTTCCTTTACGTCCTTTTGTTTCTTTCATCTTTCTCTATTAAAGGACGGCTTTCCATGAAAAATAAAATGCTATGAGAATCTTGAACATGTTTATATTTGTTTTGTCATTTCACATCTCCTGGCAGCACTCAAATCATCTTGCTACCTTTCTTCTTTTAGAGTAACCGCATGTGCATATCAATGGAGTTAGTAACTGCTGTTTTGGGAGATCATGGCCAGAGACCACGTGAGGATTATGGTAATATCAATTTAGAATGTTTTATTGTTTTGCTTCCTTTTAAAAGAAAAAAAAAAAAAGAAAGAAAGAAAAAGAAAAATAGAACTTCTAATTTATTACCCTTTCGTCTTAAATTCAGAAGGAGCTCCAAAATCTTTAAATATTCAAGTTGTACTAATTGTTTTTGGATTTAATTAAGAGGGAGTGCCAGAAGGAATTGTATGAACAATCTTTCCTTTGATACGGATGGAATTCTCACTCAAGATTTTTTCCCTTAGTTCTTTTTAGTTATCTTTTGTATGACTGTTGATTTGAAACTGGTGGAAGTCCTTACTTGAGCGTCCTTTTTTTATAGATTTTCAATCCTTAAGCTGTTCAGCTGTTTGATTTGATTTTTGAGTTTTTATATATAATATTTCTCTTGACATGTTTCCTGCCCTCCAATTAAACTTATGGTTACTTTTTTCAACATTAGTCAGTTATGTTTCCGTTGAAGTTATTATTATTTTTATATTGTGCTTCTCTCTCAGTATATTGCATTAGGTTTTAAGTAGTGCGCTCTTTCATTTTGTGCAGTGGTAGTTACAGCCGTTACAGAACTGGGCAAGGGAAAGCCGAAGTTCTATTCAACTGCAGAGATAATAGCCTTTTGTAGAAAATGGCGGTTACCAACTAATCATGTTTGGTTATTCTCAAGCAGGTGATGGCCCATAACTTCATTCACATTTTTTTTTTCCATTTGAAGATTTTTTTAAATTATCTATATCTATCTATCTATATATATAGATAGATAGATAGATGTTATGTAATTTCTAATTTTTGCATGACATGGAAAATTAGATAAAAGATTCTATTTCAAAAAAAAGTGATGATATCTTCACAGTTGCTGAACTCAGTAAAACCTTCCATGCTTCTCTGATTTTTGACACTGAAATGTTACCATAACCTGTGTGTTAAATTGCCGATTGACCCAAAAGCTTAAGCTAGTGAGTCAATTGGCGATTTAACATGATATCAGAGCAGGTGGTCCAGGAAGGTCCCATGTTCAAACTCCTACAATGTTGTTTACTCCCTAATTAAAATCAATTTCCACTTGTTGGGTTTTTTTCATAATTCAAGCCTACAAGGGAGGAGGATTGTTCGATGATATAATTAAATTTTCCTTCACCCATCAGCTTAAGCTTTTGAGTCAATCAATAATTTAACATTGTGCTTCATCATTTTTTTTTTGTCCCAAAAAAATGACTGAAGATCTTTTAAGATTGTTAGTCTTATTTGGCAGCAATGAATTGTCTGAGAGTTATTTTATTCGTCATTAACATTTAGTGCTTTATTAAATGTGTGTGTCATCTAGAGTTCCAAGTATTTCTTGCATGCAGCTACTTCTCTACTTTTGTTTACTTTCGTGCAGGTTTGATTTTGATTATATGACTTGAGTGCTCAAGTGATTTCTCTTTTAAATTCGGACGGTGTACTATGAATATAATGTGAATTTTACATTTTTTAAAATCTTGTAGTGTCTACTTACTGTATGTTAATGTTAAACAACTATCTTCTGGCTCCTGATCCTTTTCTTTTCTAACTTCTGAAGGAAGTCGGTGACTTCTTTTTTTGCTGCATTTGATGCCCTATGTGAAGAAGGAACTGCTACTTCAGTATGTAAGGCTCTCGATGAAGTTGCAGAAATATCTGTACCAGGTATAGTTTTTGTCAGTTTAAGCTTTTAATAGTCAGGTACAGTTGTTAGCTGCATGTTGTCCTTTTTCCAATTTGTATTCCTATATACCCTGTTTGCTTTTGAGACCTTTGTCAAAGTTAATACACTATCCCATTGAACAAATCCGTTCAATGGATAGGTCCCTTTGCTTTATGCCTGTTCTATTATTCCATGACTTGCAGCTATAAACGTTAACAAGTTGTTCCTACTTTCTTGCTTGTTGCTTATAAATTATAATCTTCTCATTTTGATGGACCCTGCAGTGTTTACTCACTTTGTGTTAAATCAGTTGAGAAGTTGATGATGATTGTCAACTGGAATTGGTGTGCAGGCTCAAAAGATCACATAAAAGTGCAGGGTGAAATTCTTGAGGGTCTTGTGGCCCGTATGGTGAGCCATGAGAGTTCAAAACACATGGAGAAAGTATTGGAAGAATTTCCTGCTCTGCCGGACAACGAAGGAGGTGTGGTGTTGCTACCGTTATTTTTTTTTAATGCTTTTCCATTGGTTACCTTCATTTTTTGTTTGTATACCCTGTTTTGTCTTAGGTGGACTTGATTTAGGACCAAGCCTGAGGGAAATTTGTGCTGCAAATAGGTCAGATGAAAAACAGGTAACAAAAATTCCTCGGCATGAAATATGATGGTTTTGTGTGTTGATTTTTCACTTTGCGTCTTCTTGGAACAGAAATGGAATGATTTTCACCTGTTAGGCATCGTTTTTTAACTGTTATCTCTGCTACTGTAAAATACAATTGCAGCAAATAAAAGCACTTCTTCAAAACGTTGGTACTGCCTTTTGCCCTGACCATTCTGACTGGTACGGTGATTCCTACTCAAGAAATGCTGACAGATCTGTTTTATCAAAATTCTTACAAGCCAACCCAGCTGATTTTTCAACCTTCAAATTACAGGTTAGTTGTCACTTCATGTTATGAAAATGAATTTTTATCTCTATTTGTGTTTTCTCTTAATACAAATGTTGCTTTATCTGCTCCAGATCATTTGAGAACTGTTTTCACTACCACTGTTTCACGTGTTGGTTATTTTGAAATTCCTTTTGAAGAATTTGTTTTGGTGGAAAATATCATTCCCTCTTCCTCTGTTTTTTATTGATTTAGTTGGACATGCCACCTTCTTTATTAGCATCTTTCACTCTCCTCGAAATGAGTTGATCATGAAGTACTTCTAATCTGTCCAGGAAATGATTCGTCTAATGAGAGAAAAACGTCTTCCAGCTGCATTCAAATGCTACCATAATTTCCACAAAGTTGGTTCCATATCAAATGACAACCTTTTCTATAAAATGGTCATTCATGTTCACAGCGACTCTGCTTTTCGGCGATATCAAAAAGAAATGAGGTACTGCTTGGGACTTGCATTCTATTTACATTTCCAGTGTTCAAAATAATTATTTCTGTAGAGTTGAGCTACTGTCGTGTTTTTATTAATTGTTGCGTCAAATGTGTTGAACATGTTTCAGGCACAAGCCAGGTTTGTGGCCATTATATCGAGGTAATAGTCAACCTTTCATGATGTGAATTCCTTTATCTTTAGATTTCAACTACTCTGATTCAAATTATTTTAGGCCTTCACACCTTTTTAGATGATGATCATGATTCCTAAATCTAATGTTTTACTCTGGTTTGTTATGTTTATTGTGCTTTAGCAGGGAAAACATTGAACCCAAACAAAAATTTCACTAATGGTAGAAACGTTTCTAAAAAGATAAAGAAGAAAAGCCTACTAGTGGTAGACAAAAGAAAAGTAAGAAACGTAGAGCAATTTAAATAGGGCTAAGTTACAAGTGTAGTCTTTTTGAACTTTTAAGGTTATGCCTTCTAGTTAGAAAGCTTAGAGTGTGCTTGATTTGTGATTTGGTGACATAAATATGAAAAAAAAACAAGGGACTCAATGGAAAACAAAGTTGTATTTCATGTTTCCAATGTTTTTGGATGTGTTCAATAGCAAATTTCAAAATTTGACTCCAAGTTGAACAATGTTTCATAATATATTTATAGACCATAAAAATAACTTTACTTACCTGCTAAATATTAAATTAAATCTCAAATATTATTTATTAATCTAATGAACACGTGTTTTTTGGTTATAAATTGTGTAGGATCAATATTTTACAAAATCATATCATCTATATTATATTATACATATTTTACTTTAACAAAAGTAAAACGAGTTTACAATATGAAACACTATATTTAATTTTTATTTGTTAATGTAAAAATTCAAAAGGGAAATTATTATAAATGGAAAAAGTATCAAACTATTTACAAATATAGAAAGATTTCACTGTCTATCATCAATAAACCGCTATTATCACTGAGCCATAGAAGTCTATCGCGGTCTATCGCCCAGTGATAGAAGTCTATCGCGGTCTATCACGGTCTGTCGTCTATTAGCAGTGAATTTTTTTTATATTTGTAGATATTTTGGCTCATTTTGCTATATTTGAAAACAACCCAATTCAAAATTTGTATACACTATTTTATAATTTCTACTATATGGATAACGAAATTCAACTGTGGTAAAAAGAATAAATCTACTAAATAGATATTTATTGAAACCATTTGATGCATTGAATCTGAAAATATAAAACAAAATATAGATTGCATACCAGACAAGCCCTTAATATATATTTGAAAGTTAAAAATATATTAAATATAACCTTGAAAGTTGGTAATGTAACCCATAAAGAACTAAAAACTTTAAATTTGTGAAAAGGCTCCCCAATTATTATGATTGAAGCAGTGATAAAATATTAAATTTGTTGTAATCCATCCACTTAGACTTTTGTTAGTGGTGATTTAATATAGAACTTTTAGCCAACAAGTGAAAGAGAGTGTTTATTAGTAGAAGATTAACCCATCAACTTAATCTTTTGGATTTTGTGGTCATTCAATAATCATAAATATCATTGCAAGACTCGTCAACAGAAATAGAGGATAGAGAACACTATACTCGAGGATTTAAATTTAATCAAATCAGTGTATTCTCAGCTGGGAATTTGGCTTCAAATTTTCAATGATTTACTCCACACCATATTTATGGAAGGGTGATTTTAATTGTGTTTATCTTAATCATCTTGGCTTTAAGAGGAAGAATAGGAAAGTGAGGGATGTGGCCTCTCTTTCGGTCTTGCTTGAGGGGGTTGTCTTTAGACAAGAAGGAAAGGATGTCAGGGTGTGGATTCCCAATCCTTAGGAGGAATTCTTGTGTAACTCTTTTTTCGTAGCCTTAGTTGAGATGTCCGAGTGTACCTTCTGATCCCAATTTTGTTCTCATGTGTTGTGTCTAGTTATCTTGCTATGTATTGAAACCTAAACTATTGTAACATGAGCTTTGTCTCATTTCATTTTATCAATGAAAGAGATTGTTTCCTTTTAAAAAACCTCTTTTTCATAGTCTTCTTGATCCTTCTCCCTTAGGTGAGTTGGTCATTTCTACTCTCTAGAGGATTAAGATTCCTAGAAAAGTTAAGTTCTTTACCTAGCAAGTTCTACATGGAATATGGAATAATTAACACTTTGGATCCACTTGTTTGGAAAATATCCTTGTTAGTCAGTCTGTTTTGCTATATTTTCAGTCGGAAGGCGGGGAAAAACTTGAAGCACATTCTTTGAAGTTGTGAGTTCGCGAGAACTGTTTGGAGTGACTTTTTTCAGATGTTTGGTTGCTTGTCATAGGGAACCATGGATGAAGAGATAAGACTTTTTGAGACAAATTTCTTATCAGTATAGATGGGTCCATGAGGAGTTTGATAATGTACATGTGGGTGGAGTTTATAGATGAATTGATTTTTGGTGAAGTAATAGAATAATATTTATGATGTATATAGGGATTAATTTATATTTTTTTACTTTGTCATAAGTCTTTCCCAGTTTCCTCTTTCCTCATTTGCACTTTGTAAACTCTCCAAGCTAGCCTCAATATTGTAACATCTTCATCCTCTTTAGTAAAAATTTGCTTCTTTAAGAAGAAATAAGAAATGAAAATAAAAAAGAAAAAGGAAAGAAGTAAGTAAGATATGCTGTATTGAGCTTTTTGACTGTTCTGAACTGTATAAAAGACTTATTGCTCACAGTTTCATTTGTTCCCAGGCTTTTTTGTTGACATCAATTTATTCAAAGAAAACAAGGACAAGGCTGCTGAATTAGTGAAAAGTAAGAGCAATTTGATGGAGATTGAAGGCAATGGGACCTTAGGAAGAGATGGATTTGCTGATGAAGATGCAAATCTGATGATAAAACTGAAATTTCTTACATATAAGGTAATTTTGCTTTAATTTGTCTAGTTTAAGTTATCTTAAAAGTATAAGAAGGAAATAGTTTACCTCATACAAGAGCAGGACTTTTTTGTAGGTTCTTTTTCTTTTGTTCCTTCCCGGTTCATTTCATATTGGGCCAGTTTTGTTGTTCACAATATGGTTCCCCCACCACCTTTTTGGAAAATAATATACGGTTCAATAGATATTGGCCATTGGTGTGCCATCCTTTTGGGTCATGGTGTAAACATCACTAGCCTTTCATTTTTAATTCTTAAAACCCTAATCAATTGATGTAGTAATGCTAATTTTAGTTGTGTATAGTGGTCACAAGGCTCTTTGATTTAATCAAGACTCATGAGGTCTGAAATCTGAGATATTAAATCAACAAGTTTCAGAGCATCGCATGTTGTAGGGTTTAGGTTGGAATTGGATGAATGTGGAGACTGAGTAAAAAACAAATTGCTTAGGTGTACTTTTTGTTGTGATTGCTATTAACAAATTGCTTAATTAAAATAGTCTGAACGATTTATATTTCTCTCTTAGCTGCGGACTTTTCTGATTCGTAATGGCTTGTCAATTCTCTTCAAAGAAGGTCCAGCTGCATACAAGGCCTATTACTTGAGGTATGATTAATTGATTTGATGAGGCATGCCATACACGTAAATTTAGAAAATGAATCTATAATTGAGGGAGGGAAATTGTAGACTAATAATCTTATGCATGGTGCAAGTGGATGAATGGGATTACTGTCCTTCACTCCTCCACTCAGTTCAAATACACAACCAGGTTGCAAACATGGAAATCTTTTTTCTTTTCCAGATCATTTCTTCTAGAATTGTTTTATTCATAATTAAGTATAAGGTCTCTTGTGCAAGCATTTTTTTCTATGAGAAACGTACTTTCATTATCAAGAGGAAAAATACAAAAAGAAGGGAGATGAGATATCACCACACACCGACATGTTACGAAAAATAAGCCCAACTGGTGCAAATATGAGGTAGGCTATAAGTACAAAAGAATTTAGAAGACCATGAAAAAGAAAAAAAATGTTACCAAATCCCAAAAGATGTCACCGTTAGTTTCTTGACCATAAAAAGTTTTAATATTTCTTTCAGACTAGATCCTTCATAGGAGGGCAAAGACTGCATTTGACCTAAGGATTTTGGTCTGGGCAAAAAACTGAGTAATCTGAAGAACCATCTGAAGCTTTGTTGGGCCCAACACCACGGTAAAATGAGCAAATTGAACAACCACCCCAGCAAATAACAAAGTGCAAAAGAAGGAAGGTGCATAAAAGATTCCTCACTGTTTATACAAAGAACACAAAAATTGGGAGAAAGGGCAGAATTTCGGAGAATTCTTTGGATTCTACAGTATTCAGCCTCTCAAAAGAGAATATCAACAATTTAACCTTCACCTTCTTAGGACATGAAGCTTTCCATATAAGGTTAGAATTTTGCCTGTTGAAAGCTGCATGTTTCACAATTTGCTTAGGAGAAAAGGAGTTAACGGAAAAGGAGCCATCATTTTCCAGCATCCACAGATTCACATCATCCCTGTTATTTGGCCTAAAAGAATGAAGCTTCATCAAGAGAGCAGCCCATTCATCAAACTCCCGCATCCAAGTGCTATGTACTACAGAGAAGGGGAATAGGTTTACCATTGTTCAATATGGAGCCAGATTGTAGTTTTAAGGCAATTCCACATCCAAGATCTTCATTTCTACCCAAGGCCTTTTCTTCCCCAACAATACATCCCAAAAGAAAAAAATCCATCTATTGATCATAGGCTTTCACAAAACTGATCTTAGATATAAGACCCTCCAACATCAAATTCTTATAATCTTTAATAGTCTTGTTCACAACGAGGATTTCGTCCATAATTCTTCTCCATACCACATAGGCGATATTCCAAGGGTCTATGGAGAACTTCCCTAAGTCATTCGGGCCAGAAGCTTAACAATGATGTTATAAAGGATAATGATAAGGCTAATAGGTCTAAAATCTTTGACCTAATCTTGGGGATGAGGCAAATATAACATCCATTCATGGACTAATCGATGACCCTCCTCTTGTAGAACTTTGACCAATTCTTTCTTTAAGTTTTTGTGCTGATATTTGAGAAATTTGAATTGAAATTGATAGTAAGAGTTTAAAAATGTTGAAAAGGACTTAGAATTTCCTTAGTATATTACCGTATTTTGTCTGAACACTCATTATTACATTAGTATTAATGTTGAGCGTGTGTTTTTTTTCTCTTCTGTATACTTTATTTTAATGATGTAGGCAAATGAAGTTGTGGGGTACATCAGCCGGAAAACAAAGGGAGCTCAGCAAGATGCTTGATGAATGGTAAGTATACAATTATTGGTTCTTGGGCTGCTTTCTTGTTCCACTAGAACTGTAGGTACTTCACAGGAAAAAATATTGAAGTTGATGATATACTTTTAACATGATAGGGCTGTATACTTGAGGAGGAAGTATGGAAATAAACAACTGTCATCGGCTACCTATCTTAGTGAAGCCGAACCTTTTCTTGAACAGTATGCTAAACGCAGTCCTCAGAATCAGGCTCTTATTGGATCTGCTGGAAATTTAGTTAGAGCAGAAGATTTCTTGGCCATTGTCGAGGAAGGAATGGACGAAGAGGGTGATCTTCAGAAGGAGCAGGAGGCAGCACCATCAAGTCCAATGCTCTCTGGGAAGGATGCTGTGCCAAAAGCAGAGGGTTTAATTGTGTTTTTTCCAGGTTACTATTTGGCTAATTGGCTAGGCTTTGTTTATTTCCAATGTTGTGAAGATTTTTGTGGTCGAGTTTTTACATGTGCAAATTTTATTAGCTAGGTAAGGATTATGAATTATCTTGGTGGAGGAACTTCATACTTCCATAATATGTGTGTGTTAGATGATATAATATTAAATTTACCTCACCCATCAACTTAAGCTTTTGGGTCAATTGGTGATTTAAGATGGTGTTAAAGCAAGTGGTCCAGGAGGTCTTGTGTTCAAACTCCTACAATGCCTATTTCCTCTCCAATTAATATTGATTTCCACTTGGTGGGTCTTCTACATATTTCAAGCCCACAAGTGAAGGAGAGTATTAGATATACTATTAAATTTATCTTCACCCATCAGCTCAAGCTTTTGGGTCAATTGGTAATTTAAGAGAGGACATGTATTTTTGTCATGTTTTAGATAAGGCTACCTGAACCTAGTTCTTCTTGTAGCCATGTCATTTTATAACAGTTGTACGGGTATGCTGTCTAGTACTTTGAATTTGTGGTAAAAGAATTAAGGAACTATTCTACCCCATTTGACGTTTCAACAATATTTAATCTCTTCAAGTGAAGACATTAAAGTATGGGTTGAATGCAGTTTGAGTTAAACAAGTACATTTATGGTTGGTTGTCATTTTCCCACATCCTCCAATTATAAAGCTACAAAAGTTAAGTTGTTACACTGTATCACATTGAGGTTCAAATAAATTTATTGTTTTCATGTGGAAATTCTTTGGCAGTTTTTTTTTTTTTTCTTTTTGCATTTTGACTCTTACAAGGTGCTGTTGTTTTGGATCTTGAGGTGGCCCTCAGATGCATTACACATTTTTGTAGCATTCAGTTAATTTTAGCTTGGTATGGTTTATGAAATACATAGATCTTCTCTAATGTATCATGTACCCCTTTTTATGGTTTAACCAATTTCTCTGGTTATAGATTTTTTATAGGTCAATTTTATATGTTGCTTTGAACCTCAATTTACAGGAATTCCAGGCTGTGCAAAGTCTGCTCTTTGCAGAGAGATACTGAATGCTCCAGGAGCACTTGGAGATGATCGACCAGTCAATACTCTGATGGGGGACCTGATTAAAGGTATTACTTGAATTCTTTGACTCTTCACAGATTCTAAGCTAGCAATTTCAGATGTTCCCCGAACACACACACACACAACCAGAAAAGAAAGAAATAAAGGGTATACAAAAGTTTCTCCAATTTGTAACAAGGGAAGTTAAGTTGTTAAAAGGCTGTAAATGTTTACACCAGTTCAAAATAAAGTTCAATTATTTATTTATTTATTTATTTTTGATAAAGCTTCTTAGCCTTTGGAAGTTTGAGTTTTTTTCTTGTCGAGAGAAAAATGTATAGCCCACAGGTTCCAAGATTGGGTTTTTGGGCCTTCTCAAGGTGTTTAATTTGTAATTCATTCTAACTTCCTAATTGCTTTGGTCATTCTAAGCTTGCTTTTCCTTCTTGAAGCTAGTGAGAATATGGATGTATTGCATTGAAAATATGGGCATGCACTTTTGGATTTGGTAGAATGTAATTGCTATCGGGAGACGCATTTTTCTAATGCCCAAAATGCTTGATTAAGAGTGTTAGACATATAATTTTGAACAATTAAACTTGTTTCTTATATATTTAATATGCACAGGAAGATATTGGCAGAAGGTTGCTGATGAGCGTAGGAGAAAACCATACTCCATAATGCTTGCAGACAAAAATGCACCAAATGAAGAAGTGTGGAGACAGGTAATGTTCTTGTTCTTTACTCTGAATAGTAAACATTGATTCAGTGAACAGAGCAGGAGGAATATAAAATGAAGTGTCTAATGTTAGCCTCTTCTATCAAATAATTTGGTTGCAAATTCAATTCCCTTGCTTCAGATATTGCACAGTATTTTAGCATTTTGTCTCTATCTCTATTATCAACTTTTTATTTGGAAATGAGACTTCAGATTATCAAAAAATATACCAGAAGAAAGTAGGACGAGTCCTTAGAATGTTCTAGTTGCTGCTTGTAATACTCAGGTTCCTTGTTTCTCTGTGGCCTACAGTACGAATATTCCAGTGATATGTAACTTGTGGGGACAGCATTTGATTTTAATGTTGGAGGATATTCTTTGGTTCTTTTCTTGTCATCTTAATCCTTCTGTCCCTTGCATTTTCTTTTCAGTTTTACATGAGCATTGTTTCATTTAGAAGAAACTTATTGAGACTGTTATGAAACTGGTTTTTGATATATACCTAGGAATGGTCCAATTTATTCGAACTTCTTTGGTGCATGTAACTGGACCATTTCTTCACTAGCATATTACATTTCAAAATTATTTATCGTCCATTTTGGTGCTATTCTCAAACATCTATATAATGTCACTATTGATGTTATACCTATTTCATATTTACAGCTTGTGGATGAACCTGATAAGGCCACTTTTATCACATCTTCAACCTGATGGACCTTAGTCCTTGCAACCAAATCTGCACCTTGGAATAGGAAAATTGAATCTATTTCAGAATTTTGTTGATAGGCCACTTTTATTATAACTTAAATATCACATTTGTTAAAGGAATTTTAGAAAGCTATCCAATCAATTGAAGTTAGCATATGTTACAACTTAGTTAACTTTTTTCATTTTGACATTTGAATATATTTGTTGCTTTACTTTCTTATAGAATTGATTATTTAAACAAAGAATATGTCAATCACGGTCGAATGTCTTATTTTCTGTTATATCTTGTATTCCTTTTAAAATAGATGAGCATTTTCTTGTAGTACTTACTATTGCTCCTATTTTCGAAAAGAGCCCCATCTATTTTTTGATGGGAGAAGCCAAAAGGTCGTGTTAGTTTCTGAGGATGTTCTGTTTTTTAACTCTATGTTTCTATTAAACTCCTTATAAGGAACTGCTAAATATGTGCAGATTGAAGATATGTGTTGTAGCACAAGAGCCTCTGCAGTTCCAGTTGTACCTGATTCTGAAGGTTAGTCATTTGGATGCTTATAACGTTCAGTTCCTTGTCTCATGATATGTGGGGACAAGTTTTTCTTTTAATTGTTTGAAACTTGCTTTTTCAACAGCAAGTTCATGGACTATACTAATATTGTGATACTGATGGCTTTTCTCATTCACCAATGAATTCAACAATTTCCTTCTTGATAATTTGATGGGGATGGCTTTTGAAAAAAGTTGATTTTAAACCTAGATGCAATTCAGCCTCAGTTTAGTTTGGTTCTAATTAGTTTGCTTTTGATCTTGTATTTTGTTTTTGGAACCTTGCCCTTGGTATTGTTTTTTATCTTGATTAGTAGATCTGATGTATTGGATATGATGAGAGTGCTAAGGGGATGTCAACCTAGTTGAGATGTCCGGATGCACTCGCTGATCCATTGGTTTTAGCTTTCTTGACTCTTCATTATATTGTTATTTTATACTCTTTACTGTCCCATTTCATTATCTTAATGAAGAGCTCGTTTCCTTTTTCATAAAAAAAGAAAATTCAGGCAATTGAACTTGAACTTGTGAGGAAGAACTGAGAAATGATTTGTATGAATCATAAATAATTTGGTGGTTCTGGTTCTGGTTTAATAAGTTTCCAGCCTCTTAGACGGTTCGGCATGCTTTTTGAGGACTTCCTACTAAATTCTGTGAACATCTAAACTATAAAAATGAATTAAATAAAAATATGCTGCGGTGTAATGGAACAGGATCAGAGATTATGCTGTTTGAAAATGCTTTGTTGTGGAATAGAAAACAGGGGGAAGTAAACATTGGAGAGAGAAATTAATGATAATAAAGGAAAAAAGTTTAAAGAAAATTAGTGATACCTTAAGGTTGGCTTTTCTTGCAACTGTACACATCTTCATGCTCGTAGACACATAAACTGTGAAGAAAAAGAAAATAGAAGGAAATTATGAAGAGCATGACTGATCTGAATATAAAGGAAAGTCGAGAAAATCAGGAACGAATCTGGAAGAAAACACAATAATTGCCTCTAGCAAAGCCTCAGCTTCCAACTCATTGTCCCTTCCAATAGTGTTTTTTCATGGCATTAGGGAGGTTATGTTCAAAATTCTGTTGTATGTAGATTATGTTGCGAATGTTGTGTTATTAAAAGGGGATTGTGCCTATTGACATCATACTATGCCTTTGCTGCAAACTGTTGTTTTGGTTACTTACCATCCACACTTCACAGAATAATTATCATTTATTGTTGCTGAATGTGCAGGAACTGATTCTAACCCCTTCTCTCTTGACGCACTGGCTGTCTTCATGTTCCGTGTGCTGCAAAGAGTTAACCATCCAGTATGCGCTCTATGTTGATATTTCAAACTGTAGACGTGCCTCATTCATCATTGATTAACTAATGATTCTACTAATGATCTGTAATTCTGAAGGGAAATCTTGACAAGGCATCCCCAAATGCAGGCTATGTCCTACTAATGTTTTACCACCTTTACGAGGGCAAGGTGTGAACACTTCTCATTGCCCATTTCATGCTGGGTTTCACTTTTAGTATCTTGATTATTTTTCTCTAATGCTTTTCAGAGTCGCAGAGAGTTTGAAGGTGAGCTTATTGATCGTTTTGGGTCTTTGGCTAAGATGCCATTGCTGAAATCTGATAGGTACAAGAACTCATCCTCAGCACTGGTCTTATTTTTCATTATTTCTCTGTCCCTATATGTTAGAATCTCTTTGAAGTTTCACCCTTTACTTGTCTTAATGATGAACACTTATAATGATTATTTTTCTTTTCCATTATTTAAATCCTTTTGCAGTTTTCTTTCTTTGGTTTTGTCTTCTCATTTCCAATTATTGTTTTTAAACAAATTAACAAATTAAAATTAAAGAGGGAATCAAAATCTATACCATGAAAAGCCAACCTATTTTCTAGAAATTTCATAAGAGAGCAAAATTTACTCATATTTTTGACATGTGATCTTTGCACGAAAAGGTTTCCACCCTGAGCCATTTTGCTTTAATGATCTCTCTTAAATCGGTGTAACTAGCATCTTTATTTTATAGATTAAAGTTTATATGACAAATTGCACTGCTTTTTTTCCCCTTTTTTCCAATGCAAAATATTGAGATTGTTTTTGCAGTTTGTACATCTAATATTTTGCTGTTCTTGGACTGCAGGAATCCTTTACCTGATAATTTGAAGACTATCTTAGAGGAAGGATTAAGTCTGTATAAGCTCCATACTAGTAGACATGGAAGGTTGGTGGCTTGTTCAAGTATTTATAGATGTTATTGTTTAAGGAGGGTAAGGATAAGATAGTAAATGAGGGGGAGACAAATCTGTTACGTGAGGAGGAGGGATGAGGGAATGTTGTATCTGGGGACCATTTAGTTGTTAAATCTGAGGGAGGGCGAGAGGAGTATGGAAAAGTTTTGCTGTAGTTTTCATTTCATGCTTTTGGTTTTCAAGAAGTTTCTCGCAGAGAGTCTCATAGGCTGTTAGTTTTGTAAAGCCTTAGGGCTATTTCCATTTTTGATATATCGATACATAGCAAGGAGATACCTTACAATTACTTTGTTGACTGGAAGTTTCTAGCATAGGAATATCAATGAATAATCTGCAATGTGTTTTCTTGTTAGCCACCTCTTATGCAGTTGTCTGGAATCCTTCGCAGGGTGGACTCCACCAAAGGTTCCTATGCGAAAGAATGGGCCAAATGGGAAAAGAAATTGCGAGAAACTTTGTTTGGTAACACCGAGTATCTCAATGCTATTCAGGTACATTTTACTTCTCTTTAAAAGATCAAATTTTTCTCCATCCTTCAAACTCAATTTAGTTTGTATCAAACATTTTTGAAGCTTTAAAAAGCTTTTATTTTATTTTATATTTTTGAGTCAAACAATTGTTTGATGACCTTATTCACGTATAAAAGTGTTTTGGAAATTTCAACTAAATTTCATACACCACAAAATTTGAACTTTGGAAGTTAATTTCTGTTTTGCCTTTAGTCTATCGGTTGTTTTTTCTCTTATTATTTTTTACTAAGATTTCATCAACAATTCAAAAAAATAAAAACAATGCTCATCTTTTAAATAATTGAAAAACTGAAGTAGAATGGGTATCGTGGACCTAAAGGTGCAAGTATTGGCAACGCTAGTTTCTATGTTAACCATTTTTTGATCAAGACCTTAATAAAATAATCAATGTTAATTATTCAGGTTCCATTTGAGTTTGCTGTTCAAGATGTGTTGGAGCAATTAAAGAAGATCTCGAAAGGTGACTTTAAAAGCCCTATTACAGAGAGGAGGAAGTCTGGGGCCATAGTGTTTGCTGCCGTCAGTCTCCCTGTTCAGGAGATCCAAAATCTTCTTGGCACTGTAAGTGAATATGTGTATGAGCCACTCCGGCTCCCACCCCAAAGCTTAAACTAGCTCATAGGTTATGGGATAGTTAGTCTTTTATATATACTCATTCTTAACACTCCCTTTCACTCACTTGGAAATTGGTCCAAGACCGATCGGGTGATTATGAACACAAGACTTCTCTTCTACAGTACCTTGTCAAACTTCAACTAAATCCAAAAACTCAAGCTGATAGGATGTAGTATATTTAATTTTTTATACTTCCTTTTCTGTAAAGTTTGCTAAATTAGGTTCTAAAGCAACCTCCAGTTTTCTAACAGACCATTGCACGTTATTTGATGTGTCTTGTGCACAGTTGCACAGTTTATTACTGAAACAGTGGGATAAAGATAATGTGGGAGGAAAAAGAATAGTTGTTTGCATTAAAAATTTCTGGTTACAGCTTGACATATTCCTAAGACTTTTACCTGCTGAAAAATTTGTTTACCGCATTGGACAATGGACAGTCACACAGGGATTAAGTCATGGTGTAATTAGCAATTATCGAAATCATTATCAACTTCCCTTTCAATTTTATTTGTGATCTGCTTCGATATCATGCAATAAACCACATACAACAATACCATTTAAATTTGTATAATATATGTAGTGTCATCATATGCACAAGACAAATCTCACTATGTGAATCCTTACATTTTCCACCATTATTGTACAGTTGGGCAAGAAAAATCCCCGTGTTGAAGCATTCCTTAAAGAACACTTCAAGGATTATACACTTAAAGGGGCTCACGTCACACTCGCACACAAGAGAAGCCATGGTGTTAAAGGTGTAGCTGACTACGGCATCTTTGAAAACAAAGAAGTTCCAGTTGAGCTGACAGCCCTACTTTTCTCAGATAACATGGCTGCCTTTGAAGCCCGCCTAGGCAGCATTGAGAATGAAAGAGTGATTTCCAAAAATGAGTGGCCCCATGTAACCTTATGGACTAGAGAAGGGGTTGCAGCAAAAGAAGCTAACGCCTTACCACAGTTAGTATCAGAGGGCAAAGCAACTCTTGTTGAAATCAACCCTCCCATTATCATTTCAGGCATGGTGAGATTCTTTTAGCCTTTGTCCCTCAATCCATATCTCTGCATTCTCAGGTTGATAGGATAGAAATAAATGGATAGGAAATGTCCAAGGCATTAAATTTTGCTCGCTGAGGTTTTTAGATGCAGGTAATGAGAAGTTTTGTAGAATTTGTCTATGACAAAATTTTGTATGTACAGGCCACTTTGATCTACCTCAAAACAGGTTGAACTTTGTTTATATGATTAAGTTGGAAATTTTGTCTATTCAATACTGTTTTCTTTTTAGATATGGGATGAGTGTTGGGAGGTTGTTAAGTATTGAAGTCTCTTTTTTCATCCTTAAGGGTACATTTTTTATGCCCAAGAGGTGGCCAAGGTCTCTTTTAACCATGTTCGACATTGGAATAATGTCGTGGCCAATGGGCTTCTAAGGCATTGGAAGAGCTATCTTCTCCTCTTTTTGTTTGGTTCCCTTATTTCTCTTGATGTAGTTTAATTGTTTGTAGTGATGAATTGAACCATAGATTTCTAGAATGGTAAGTGGTGTCTTATCTATTAAGATACATTTAAAATATACTATAATATAGGTTTTGTTTTATAGCATCTAAAAGCATTTTTTAGGTTTGAGAGGATAAAAACATGTAAATAAGGATCTACCTTTCCATCAAACCTGCGGCTGCTCCCTTAAATCCTGTAGCTCATAATAACAATATTTTAAATTTGCAAACAATCCAAATTTGAACAGGAAGGTGCTTTGTTTTCTTTGTTTGTTGCTTTGTAAGTAGAAATTATTGAATGTGCTTAACAATTTTTTACGGAGTATTTTAAGCTTAAAGATTTTTTTGGCAATCTAGACAAAAGAAGAAGCATTGAACCTTAGCCGATTGCTACTAAAGAACTAGGAGTTGATTCCAACAAAGAGAAGTGCATTTATGTCATTCATGATGATTTGTGTTTTGTGAATGAGCTAATTTTGAGTAATGGATCATCTAGTTAACCAGTCTTCACAACACTTAGTCTTCCACATGATGACGTATGTGTTTTGTGAGTGAGCTAGCTTTAGTTTGGCAATGAATGGTAGCTAAATGTAACTTGAATAGGAAAGCCTTTTCGATGCACAAAAGTATAGAGACTAAAATTGATCAAACGTGACTAAAATGTTATTTAAACTTAAATTTTTTTATCTACATCTAAACCAGAATTGAGAGGCTCTATTGATTTTGAGGGATTGAACA
mRNA sequence
CCAATCTCCACTACACCCCTATTATTTTGACATTGTGTAGGAATGTCAGAATGTCAACATGGACGATATTTGGATAATTAAAGATCAAATTAGTTTTCAAGTGTGAAATTTTCTTAAGGTGATTTATGCAAAATTTCCCTCTTTTATAAAAGCCTCAGGTCCCCAAACTTCATTACAATGACACACACCAAGTTTTGGCGCAAGCCGTGTGGAAGGAGCTCTCACTTCTCTGCGGTCTCCGTCTTTCTCTCTCTATAACTATATTCATATAGCGCGTACACTTGAATGTCGGCGTCGCAGAGAATTTTCTGCGCTATAACTCTTCCTCACCCTCGTTTGTATTCATCTTGGGCCTTCCCTTTCATTTGCCACCCTCTATCCCACAATATCTTACCACGCTCTCTCACTCTCGCACCTTTAACTTCCTCCCCCTTTTCCCTTTCTCCTGATTCTCGATTCATCATGCCTTACAATCAGCGAAGGGGTGGCCATAGAGAACAGAAGTGGAAAGAGAAGGCAAAGGTTGACAGAAATTCTACGGAGTCAGAGGCTGCTGCTGACGTTGTTACTAATGCACTCGGTAAATTGAGGGTCACTGAAAGTGATCAACCTCATGTTCTTACTTCTAGTGCGCAGTTTGGAAATGCCCAGCTGACAAATCAGGCCACCCCTGGGCTTGCTCATAGAGCAATTTGGAAACCAAAAGCGTATGGAACAACCAGTGGGGCCGCAGTGGTTGAAGGTGAAAAAGAACCTACCAATGGAACGTCAACCGAAAACAAAGGGAGTAAAGCTGAGCTGGCAGCACAGAATGGCGCCGTTAGCTTGAGTCAATTATTCAAGGGCAATCAGATTGAAAAGTTTACCGTGGATAATTCTACTTACACACAAGCACAAATAAGAGCTACGTTTTACCCTAAATTTGAGAATGAGAAGTCGGATCAGGAGATTAGAACAAGGATGATAGAGATGGTATCGAAAGGCTTGGCTACATTGGAGGTTTCACTAAAACACTCAGGGTCTTTGTTTATGTACGCTGGCCATGAAGGTGGAGCATATGCAAAAAACAGCTTCGGGAATATCTACACTGCTGTTGGTGTCTTTGTTCTGGGAAGGATGTTTCGAGAGGCTTGGGGAGCTGGAGCAGCAAAAAAGCAGGCAGAATTCAATGATTTCCTTGAGAGTAACCGCATGTGCATATCAATGGAGTTAGTAACTGCTGTTTTGGGAGATCATGGCCAGAGACCACGTGAGGATTATGTGGTAGTTACAGCCGTTACAGAACTGGGCAAGGGAAAGCCGAAGTTCTATTCAACTGCAGAGATAATAGCCTTTTGTAGAAAATGGCGGTTACCAACTAATCATGTTTGGTTATTCTCAAGCAGGAAGTCGGTGACTTCTTTTTTTGCTGCATTTGATGCCCTATGTGAAGAAGGAACTGCTACTTCAGTATGTAAGGCTCTCGATGAAGTTGCAGAAATATCTGTACCAGGCTCAAAAGATCACATAAAAGTGCAGGGTGAAATTCTTGAGGGTCTTGTGGCCCGTATGGTGAGCCATGAGAGTTCAAAACACATGGAGAAAGTATTGGAAGAATTTCCTGCTCTGCCGGACAACGAAGGAGGTGGACTTGATTTAGGACCAAGCCTGAGGGAAATTTGTGCTGCAAATAGGTCAGATGAAAAACAGCAAATAAAAGCACTTCTTCAAAACGTTGGTACTGCCTTTTGCCCTGACCATTCTGACTGGTACGGTGATTCCTACTCAAGAAATGCTGACAGATCTGTTTTATCAAAATTCTTACAAGCCAACCCAGCTGATTTTTCAACCTTCAAATTACAGGAAATGATTCGTCTAATGAGAGAAAAACGTCTTCCAGCTGCATTCAAATGCTACCATAATTTCCACAAAGTTGGTTCCATATCAAATGACAACCTTTTCTATAAAATGGTCATTCATGTTCACAGCGACTCTGCTTTTCGGCGATATCAAAAAGAAATGAGGCACAAGCCAGGTTTGTGGCCATTATATCGAGGCTTTTTTGTTGACATCAATTTATTCAAAGAAAACAAGGACAAGGCTGCTGAATTAGTGAAAAGTAAGAGCAATTTGATGGAGATTGAAGGCAATGGGACCTTAGGAAGAGATGGATTTGCTGATGAAGATGCAAATCTGATGATAAAACTGAAATTTCTTACATATAAGCTGCGGACTTTTCTGATTCGTAATGGCTTGTCAATTCTCTTCAAAGAAGGTCCAGCTGCATACAAGGCCTATTACTTGAGGCAAATGAAGTTGTGGGGTACATCAGCCGGAAAACAAAGGGAGCTCAGCAAGATGCTTGATGAATGGGCTGTATACTTGAGGAGGAAGTATGGAAATAAACAACTGTCATCGGCTACCTATCTTAGTGAAGCCGAACCTTTTCTTGAACAGTATGCTAAACGCAGTCCTCAGAATCAGGCTCTTATTGGATCTGCTGGAAATTTAGTTAGAGCAGAAGATTTCTTGGCCATTGTCGAGGAAGGAATGGACGAAGAGGGTGATCTTCAGAAGGAGCAGGAGGCAGCACCATCAAGTCCAATGCTCTCTGGGAAGGATGCTGTGCCAAAAGCAGAGGGTTTAATTGTGTTTTTTCCAGGAATTCCAGGCTGTGCAAAGTCTGCTCTTTGCAGAGAGATACTGAATGCTCCAGGAGCACTTGGAGATGATCGACCAGTCAATACTCTGATGGGGGACCTGATTAAAGGAAGATATTGGCAGAAGGTTGCTGATGAGCGTAGGAGAAAACCATACTCCATAATGCTTGCAGACAAAAATGCACCAAATGAAGAAGTGTGGAGACAGATTGAAGATATGTGTTGTAGCACAAGAGCCTCTGCAGTTCCAGTTGTACCTGATTCTGAAGGAACTGATTCTAACCCCTTCTCTCTTGACGCACTGGCTGTCTTCATGTTCCGTGTGCTGCAAAGAGTTAACCATCCAGGAAATCTTGACAAGGCATCCCCAAATGCAGGCTATGTCCTACTAATGTTTTACCACCTTTACGAGGGCAAGAGTCGCAGAGAGTTTGAAGGTGAGCTTATTGATCGTTTTGGGTCTTTGGCTAAGATGCCATTGCTGAAATCTGATAGGAATCCTTTACCTGATAATTTGAAGACTATCTTAGAGGAAGGATTAAGTCTGTATAAGCTCCATACTAGTAGACATGGAAGGGTGGACTCCACCAAAGGTTCCTATGCGAAAGAATGGGCCAAATGGGAAAAGAAATTGCGAGAAACTTTGTTTGGTAACACCGAGTATCTCAATGCTATTCAGGTTCCATTTGAGTTTGCTGTTCAAGATGTGTTGGAGCAATTAAAGAAGATCTCGAAAGGTGACTTTAAAAGCCCTATTACAGAGAGGAGGAAGTCTGGGGCCATAGTGTTTGCTGCCGTCAGTCTCCCTGTTCAGGAGATCCAAAATCTTCTTGGCACTTTGGGCAAGAAAAATCCCCGTGTTGAAGCATTCCTTAAAGAACACTTCAAGGATTATACACTTAAAGGGGCTCACGTCACACTCGCACACAAGAGAAGCCATGGTGTTAAAGGTGTAGCTGACTACGGCATCTTTGAAAACAAAGAAGTTCCAGTTGAGCTGACAGCCCTACTTTTCTCAGATAACATGGCTGCCTTTGAAGCCCGCCTAGGCAGCATTGAGAATGAAAGAGTGATTTCCAAAAATGAGTGGCCCCATGTAACCTTATGGACTAGAGAAGGGGTTGCAGCAAAAGAAGCTAACGCCTTACCACAGTTAGTATCAGAGGGCAAAGCAACTCTTGTTGAAATCAACCCTCCCATTATCATTTCAGGCATGGTGAGATTCTTTTAGCCTTTGTCCCTCAATCCATATCTCTGCATTCTCAGGTTGATAGGATAGAAATAAATGGATAGGAAATGTCCAAGGCATTAAATTTTGCTCGCTGAGGTTTTTAGATGCAGGTAATGAGAAGTTTTGTAGAATTTGTCTATGACAAAATTTTGTATGTACAGGCCACTTTGATCTACCTCAAAACAGGTTGAACTTTGTTTATATGATTAAGTTGGAAATTTTGTCTATTCAATACTGTTTTCTTTTTAGATATGGGATGAGTGTTGGGAGGTTGTTAAGTATTGAAGTCTCTTTTTTCATCCTTAAGGGTACATTTTTTATGCCCAAGAGGTGGCCAAGGTCTCTTTTAACCATGTTCGACATTGGAATAATGTCGTGGCCAATGGGCTTCTAAGGCATTGGAAGAGCTATCTTCTCCTCTTTTTGTTTGGTTCCCTTATTTCTCTTGATGTAGTTTAATTGTTTGTAGTGATGAATTGAACCATAGATTTCTAGAATGGTAAGTGGTGTCTTATCTATTAAGATACATTTAAAATATACTATAATATAGGTTTTGTTTTATAGCATCTAAAAGCATTTTTTAGGTTTGAGAGGATAAAAACATGTAAATAAGGATCTACCTTTCCATCAAACCTGCGGCTGCTCCCTTAAATCCTGTAGCTCATAATAACAATATTTTAAATTTGCAAACAATCCAAATTTGAACAGGAAGGTGCTTTGTTTTCTTTGTTTGTTGCTTTGTAAGTAGAAATTATTGAATGTGCTTAACAATTTTTTACGGAGTATTTTAAGCTTAAAGATTTTTTTGGCAATCTAGACAAAAGAAGAAGCATTGAACCTTAGCCGATTGCTACTAAAGAACTAGGAGTTGATTCCAACAAAGAGAAGTGCATTTATGTCATTCATGATGATTTGTGTTTTGTGAATGAGCTAATTTTGAGTAATGGATCATCTAGTTAACCAGTCTTCACAACACTTAGTCTTCCACATGATGACGTATGTGTTTTGTGAGTGAGCTAGCTTTAGTTTGGCAATGAATGGTAGCTAAATGTAACTTGAATAGGAAAGCCTTTTCGATGCACAAAAGTATAGAGACTAAAATTGATCAAACGTGACTAAAATGTTATTTAAACTTAAATTTTTTTATCTACATCTAAACCAGAATTGAGAGGCTCTATTGATTTTGAGGGATTGAACA
Coding sequence (CDS)
ATGTCGGCGTCGCAGAGAATTTTCTGCGCTATAACTCTTCCTCACCCTCGTTTGTATTCATCTTGGGCCTTCCCTTTCATTTGCCACCCTCTATCCCACAATATCTTACCACGCTCTCTCACTCTCGCACCTTTAACTTCCTCCCCCTTTTCCCTTTCTCCTGATTCTCGATTCATCATGCCTTACAATCAGCGAAGGGGTGGCCATAGAGAACAGAAGTGGAAAGAGAAGGCAAAGGTTGACAGAAATTCTACGGAGTCAGAGGCTGCTGCTGACGTTGTTACTAATGCACTCGGTAAATTGAGGGTCACTGAAAGTGATCAACCTCATGTTCTTACTTCTAGTGCGCAGTTTGGAAATGCCCAGCTGACAAATCAGGCCACCCCTGGGCTTGCTCATAGAGCAATTTGGAAACCAAAAGCGTATGGAACAACCAGTGGGGCCGCAGTGGTTGAAGGTGAAAAAGAACCTACCAATGGAACGTCAACCGAAAACAAAGGGAGTAAAGCTGAGCTGGCAGCACAGAATGGCGCCGTTAGCTTGAGTCAATTATTCAAGGGCAATCAGATTGAAAAGTTTACCGTGGATAATTCTACTTACACACAAGCACAAATAAGAGCTACGTTTTACCCTAAATTTGAGAATGAGAAGTCGGATCAGGAGATTAGAACAAGGATGATAGAGATGGTATCGAAAGGCTTGGCTACATTGGAGGTTTCACTAAAACACTCAGGGTCTTTGTTTATGTACGCTGGCCATGAAGGTGGAGCATATGCAAAAAACAGCTTCGGGAATATCTACACTGCTGTTGGTGTCTTTGTTCTGGGAAGGATGTTTCGAGAGGCTTGGGGAGCTGGAGCAGCAAAAAAGCAGGCAGAATTCAATGATTTCCTTGAGAGTAACCGCATGTGCATATCAATGGAGTTAGTAACTGCTGTTTTGGGAGATCATGGCCAGAGACCACGTGAGGATTATGTGGTAGTTACAGCCGTTACAGAACTGGGCAAGGGAAAGCCGAAGTTCTATTCAACTGCAGAGATAATAGCCTTTTGTAGAAAATGGCGGTTACCAACTAATCATGTTTGGTTATTCTCAAGCAGGAAGTCGGTGACTTCTTTTTTTGCTGCATTTGATGCCCTATGTGAAGAAGGAACTGCTACTTCAGTATGTAAGGCTCTCGATGAAGTTGCAGAAATATCTGTACCAGGCTCAAAAGATCACATAAAAGTGCAGGGTGAAATTCTTGAGGGTCTTGTGGCCCGTATGGTGAGCCATGAGAGTTCAAAACACATGGAGAAAGTATTGGAAGAATTTCCTGCTCTGCCGGACAACGAAGGAGGTGGACTTGATTTAGGACCAAGCCTGAGGGAAATTTGTGCTGCAAATAGGTCAGATGAAAAACAGCAAATAAAAGCACTTCTTCAAAACGTTGGTACTGCCTTTTGCCCTGACCATTCTGACTGGTACGGTGATTCCTACTCAAGAAATGCTGACAGATCTGTTTTATCAAAATTCTTACAAGCCAACCCAGCTGATTTTTCAACCTTCAAATTACAGGAAATGATTCGTCTAATGAGAGAAAAACGTCTTCCAGCTGCATTCAAATGCTACCATAATTTCCACAAAGTTGGTTCCATATCAAATGACAACCTTTTCTATAAAATGGTCATTCATGTTCACAGCGACTCTGCTTTTCGGCGATATCAAAAAGAAATGAGGCACAAGCCAGGTTTGTGGCCATTATATCGAGGCTTTTTTGTTGACATCAATTTATTCAAAGAAAACAAGGACAAGGCTGCTGAATTAGTGAAAAGTAAGAGCAATTTGATGGAGATTGAAGGCAATGGGACCTTAGGAAGAGATGGATTTGCTGATGAAGATGCAAATCTGATGATAAAACTGAAATTTCTTACATATAAGCTGCGGACTTTTCTGATTCGTAATGGCTTGTCAATTCTCTTCAAAGAAGGTCCAGCTGCATACAAGGCCTATTACTTGAGGCAAATGAAGTTGTGGGGTACATCAGCCGGAAAACAAAGGGAGCTCAGCAAGATGCTTGATGAATGGGCTGTATACTTGAGGAGGAAGTATGGAAATAAACAACTGTCATCGGCTACCTATCTTAGTGAAGCCGAACCTTTTCTTGAACAGTATGCTAAACGCAGTCCTCAGAATCAGGCTCTTATTGGATCTGCTGGAAATTTAGTTAGAGCAGAAGATTTCTTGGCCATTGTCGAGGAAGGAATGGACGAAGAGGGTGATCTTCAGAAGGAGCAGGAGGCAGCACCATCAAGTCCAATGCTCTCTGGGAAGGATGCTGTGCCAAAAGCAGAGGGTTTAATTGTGTTTTTTCCAGGAATTCCAGGCTGTGCAAAGTCTGCTCTTTGCAGAGAGATACTGAATGCTCCAGGAGCACTTGGAGATGATCGACCAGTCAATACTCTGATGGGGGACCTGATTAAAGGAAGATATTGGCAGAAGGTTGCTGATGAGCGTAGGAGAAAACCATACTCCATAATGCTTGCAGACAAAAATGCACCAAATGAAGAAGTGTGGAGACAGATTGAAGATATGTGTTGTAGCACAAGAGCCTCTGCAGTTCCAGTTGTACCTGATTCTGAAGGAACTGATTCTAACCCCTTCTCTCTTGACGCACTGGCTGTCTTCATGTTCCGTGTGCTGCAAAGAGTTAACCATCCAGGAAATCTTGACAAGGCATCCCCAAATGCAGGCTATGTCCTACTAATGTTTTACCACCTTTACGAGGGCAAGAGTCGCAGAGAGTTTGAAGGTGAGCTTATTGATCGTTTTGGGTCTTTGGCTAAGATGCCATTGCTGAAATCTGATAGGAATCCTTTACCTGATAATTTGAAGACTATCTTAGAGGAAGGATTAAGTCTGTATAAGCTCCATACTAGTAGACATGGAAGGGTGGACTCCACCAAAGGTTCCTATGCGAAAGAATGGGCCAAATGGGAAAAGAAATTGCGAGAAACTTTGTTTGGTAACACCGAGTATCTCAATGCTATTCAGGTTCCATTTGAGTTTGCTGTTCAAGATGTGTTGGAGCAATTAAAGAAGATCTCGAAAGGTGACTTTAAAAGCCCTATTACAGAGAGGAGGAAGTCTGGGGCCATAGTGTTTGCTGCCGTCAGTCTCCCTGTTCAGGAGATCCAAAATCTTCTTGGCACTTTGGGCAAGAAAAATCCCCGTGTTGAAGCATTCCTTAAAGAACACTTCAAGGATTATACACTTAAAGGGGCTCACGTCACACTCGCACACAAGAGAAGCCATGGTGTTAAAGGTGTAGCTGACTACGGCATCTTTGAAAACAAAGAAGTTCCAGTTGAGCTGACAGCCCTACTTTTCTCAGATAACATGGCTGCCTTTGAAGCCCGCCTAGGCAGCATTGAGAATGAAAGAGTGATTTCCAAAAATGAGTGGCCCCATGTAACCTTATGGACTAGAGAAGGGGTTGCAGCAAAAGAAGCTAACGCCTTACCACAGTTAGTATCAGAGGGCAAAGCAACTCTTGTTGAAATCAACCCTCCCATTATCATTTCAGGCATGGTGAGATTCTTTTAG
Protein sequence
MSASQRIFCAITLPHPRLYSSWAFPFICHPLSHNILPRSLTLAPLTSSPFSLSPDSRFIMPYNQRRGGHREQKWKEKAKVDRNSTESEAAADVVTNALGKLRVTESDQPHVLTSSAQFGNAQLTNQATPGLAHRAIWKPKAYGTTSGAAVVEGEKEPTNGTSTENKGSKAELAAQNGAVSLSQLFKGNQIEKFTVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEVSLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGAGAAKKQAEFNDFLESNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRKWRLPTNHVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVSHESSKHMEKVLEEFPALPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGTAFCPDHSDWYGDSYSRNADRSVLSKFLQANPADFSTFKLQEMIRLMREKRLPAAFKCYHNFHKVGSISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKDKAAELVKSKSNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEGPAAYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVYLRRKYGNKQLSSATYLSEAEPFLEQYAKRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKAEGLIVFFPGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRRKPYSIMLADKNAPNEEVWRQIEDMCCSTRASAVPVVPDSEGTDSNPFSLDALAVFMFRVLQRVNHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSLAKMPLLKSDRNPLPDNLKTILEEGLSLYKLHTSRHGRVDSTKGSYAKEWAKWEKKLRETLFGNTEYLNAIQVPFEFAVQDVLEQLKKISKGDFKSPITERRKSGAIVFAAVSLPVQEIQNLLGTLGKKNPRVEAFLKEHFKDYTLKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDNMAAFEARLGSIENERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISGMVRFF
Homology
BLAST of Clc03G02590 vs. NCBI nr
Match:
XP_038894223.1 (tRNA ligase 1 isoform X1 [Benincasa hispida])
HSP 1 Score: 2288.5 bits (5929), Expect = 0.0e+00
Identity = 1151/1199 (96.00%), Postives = 1171/1199 (97.66%), Query Frame = 0
Query: 1 MSASQRIFCAITLPHPRLYSS-----WAFPFICHPLSHNILPRSLTLAPLTSSPFSLSPD 60
MSASQRIFCAITLPHPRLY+ AFPFICHPLSH ILPRSLTLAPLTSSPF LS D
Sbjct: 1 MSASQRIFCAITLPHPRLYAPSAFNYRAFPFICHPLSHFILPRSLTLAPLTSSPFPLSRD 60
Query: 61 SRFIMPYNQRRGGHREQKWKEKAKVDRNSTESEAAADVVTNALGKLRVTESDQPHVLTSS 120
SRFIMPYNQR+GG REQKWKEKAKVDRNSTESEAAA+VVTNALGKLRVTE+DQPHVLTSS
Sbjct: 61 SRFIMPYNQRKGGRREQKWKEKAKVDRNSTESEAAAEVVTNALGKLRVTENDQPHVLTSS 120
Query: 121 AQFGNAQLTNQATPGLAHRAIWKPKAYGTTSGAAVVEGEKEPTNGTSTENKGSKAELAAQ 180
AQFGNAQLTNQ TPGLAHRA+WKPKAYGTTSGAA VEGEK PTNGTSTENKGS AELAAQ
Sbjct: 121 AQFGNAQLTNQVTPGLAHRAVWKPKAYGTTSGAAEVEGEKAPTNGTSTENKGSNAELAAQ 180
Query: 181 NGAVSLSQLFKGNQIEKFTVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLA 240
NGAV LSQLFKGNQIEKFTVDNSTYT+AQIRATFYPKFENEKSDQEIRTRMIEMVSKGLA
Sbjct: 181 NGAVGLSQLFKGNQIEKFTVDNSTYTRAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLA 240
Query: 241 TLEVSLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGAGAAKKQAEFN 300
TLEVSLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMF+EAWGA AAKKQAEFN
Sbjct: 241 TLEVSLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFQEAWGAAAAKKQAEFN 300
Query: 301 DFLESNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRKWR 360
DFLESNRM ISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRKWR
Sbjct: 301 DFLESNRMSISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRKWR 360
Query: 361 LPTNHVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEIL 420
LPTNHVWLFSSRKS TSFFAAFDALCEEGTATSVCKALDEVAEISVPG+KDHIKVQGEIL
Sbjct: 361 LPTNHVWLFSSRKSATSFFAAFDALCEEGTATSVCKALDEVAEISVPGTKDHIKVQGEIL 420
Query: 421 EGLVARMVSHESSKHMEKVLEEFPALPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQ 480
EGLVAR+VSHESSKHMEKVLE+FPALPDNE GGLDLGPSLREICAANRSDEKQQIKALLQ
Sbjct: 421 EGLVARIVSHESSKHMEKVLEDFPALPDNEVGGLDLGPSLREICAANRSDEKQQIKALLQ 480
Query: 481 NVGTAFCPDHSDWYGDSYSRNADRSVLSKFLQANPADFSTFKLQEMIRLMREKRLPAAFK 540
NVG+AFCPDHSDWYGDS+SRNADRSVLSKFLQANPADFST KLQEMIRLMREKRLPAAFK
Sbjct: 481 NVGSAFCPDHSDWYGDSHSRNADRSVLSKFLQANPADFSTSKLQEMIRLMREKRLPAAFK 540
Query: 541 CYHNFHKVGSISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKEN 600
CYHNFHKVGSISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKEN
Sbjct: 541 CYHNFHKVGSISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKEN 600
Query: 601 KDKAAELVKSKSNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFK 660
KDK AELVKSK+NLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFK
Sbjct: 601 KDK-AELVKSKNNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFK 660
Query: 661 EGPAAYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVYLRRKYGNKQLSSATYLSEAEPFL 720
EGPAAYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVYLRRKYGNKQLSSATYLSEAEPFL
Sbjct: 661 EGPAAYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVYLRRKYGNKQLSSATYLSEAEPFL 720
Query: 721 EQYAKRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVP 780
EQYAKRSPQNQALIGSAGNLV+AEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVP
Sbjct: 721 EQYAKRSPQNQALIGSAGNLVKAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVP 780
Query: 781 KAEGLIVFFPGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRRK 840
KAEGLIVFFPGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRRK
Sbjct: 781 KAEGLIVFFPGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRRK 840
Query: 841 PYSIMLADKNAPNEEVWRQIEDMCCSTRASAVPVVPDSEGTDSNPFSLDALAVFMFRVLQ 900
PYSIMLADKNAPNEEVWRQIEDMC STRASAVPVVPDSEGTDSNPFSLDALAVFMFRVLQ
Sbjct: 841 PYSIMLADKNAPNEEVWRQIEDMCRSTRASAVPVVPDSEGTDSNPFSLDALAVFMFRVLQ 900
Query: 901 RVNHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSLAKMPLLKSDRNPLPD 960
RVNHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSL K+PLLKSDRNPLP+
Sbjct: 901 RVNHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSLVKIPLLKSDRNPLPN 960
Query: 961 NLKTILEEGLSLYKLHTSRHGRVDSTKGSYAKEWAKWEKKLRETLFGNTEYLNAIQVPFE 1020
NLKTILEEGLSLYKLHTSRHGRVDSTKGSYAKEW KWEK+LRETLFGNTEYLNAIQVPFE
Sbjct: 961 NLKTILEEGLSLYKLHTSRHGRVDSTKGSYAKEWTKWEKQLRETLFGNTEYLNAIQVPFE 1020
Query: 1021 FAVQDVLEQLKKISKGDFKSPITERRKSGAIVFAAVSLPVQEIQNLLGTLGKKNPRVEAF 1080
FAVQDVLEQLKKISKGDFKSPITERRKSGAIVFAAV+LPVQEIQNLLGTLGKKNPRVEAF
Sbjct: 1021 FAVQDVLEQLKKISKGDFKSPITERRKSGAIVFAAVNLPVQEIQNLLGTLGKKNPRVEAF 1080
Query: 1081 LKEHFKDYTLKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDNMAAFEARLG 1140
LKEH+KDYTLKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSD MAAFEARLG
Sbjct: 1081 LKEHYKDYTLKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDKMAAFEARLG 1140
Query: 1141 SIENERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISGMVRFF 1195
SIENERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPI ISG V+FF
Sbjct: 1141 SIENERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPINISGTVKFF 1198
BLAST of Clc03G02590 vs. NCBI nr
Match:
XP_004147268.2 (tRNA ligase 1 isoform X1 [Cucumis sativus] >KGN64758.2 hypothetical protein Csa_013879 [Cucumis sativus])
HSP 1 Score: 2251.9 bits (5834), Expect = 0.0e+00
Identity = 1128/1195 (94.39%), Postives = 1155/1195 (96.65%), Query Frame = 0
Query: 1 MSASQRIFCAITLPHPRLYSSW-AFPFICHPLSHNILPRSLTLAPLTSSPFSLSPDSRFI 60
MSA QRIFCA TLPHP SS+ FPFI HPLSH ILPRSLTLAPLTSSP +S DSRF+
Sbjct: 1 MSALQRIFCAKTLPHPPFSSSYRVFPFISHPLSHYILPRSLTLAPLTSSPLPISCDSRFV 60
Query: 61 MPYNQRRGGHREQKWKEKAKVDRNSTESEAAADVVTNALGKLRVTESDQPHVLTSSAQFG 120
MPYNQRRG EQKWKEKAK DRNSTESEAAA+VVTNALGKLRVTESDQPHVLTSSAQFG
Sbjct: 61 MPYNQRRGSRGEQKWKEKAKADRNSTESEAAAEVVTNALGKLRVTESDQPHVLTSSAQFG 120
Query: 121 NAQLTNQATPGLAHRAIWKPKAYGTTSGAAVVEGEKEPTNGTSTENKGSKAELAAQNGAV 180
NAQLTNQATPGLAHRAIWKPKAYGTTSGAAV+EGEK PTN TSTENKGS A +AAQ+G V
Sbjct: 121 NAQLTNQATPGLAHRAIWKPKAYGTTSGAAVIEGEKAPTNETSTENKGSNAGVAAQDGVV 180
Query: 181 SLSQLFKGNQIEKFTVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEV 240
SLSQLFK NQIEKFTVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEV
Sbjct: 181 SLSQLFKSNQIEKFTVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEV 240
Query: 241 SLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGAGAAKKQAEFNDFLE 300
SLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGA AAKKQAEFNDFLE
Sbjct: 241 SLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGAEAAKKQAEFNDFLE 300
Query: 301 SNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRKWRLPTN 360
SNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCR WRLPTN
Sbjct: 301 SNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRNWRLPTN 360
Query: 361 HVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLV 420
HVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLV
Sbjct: 361 HVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLV 420
Query: 421 ARMVSHESSKHMEKVLEEFPALPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGT 480
ARMVSHESSKHM+KVLEEFPALPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGT
Sbjct: 421 ARMVSHESSKHMQKVLEEFPALPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGT 480
Query: 481 AFCPDHSDWYGDSYSRNADRSVLSKFLQANPADFSTFKLQEMIRLMREKRLPAAFKCYHN 540
AFCPDHSDWYGDS+SRNADRSVLSKFLQANPADFST KLQEMIRLMRE+RLPAAFKCYHN
Sbjct: 481 AFCPDHSDWYGDSHSRNADRSVLSKFLQANPADFSTSKLQEMIRLMRERRLPAAFKCYHN 540
Query: 541 FHKVGSISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKDKA 600
FHKV SISNDNLFYKMVIHVHSDSAFRRYQKE+RHKP LWPLYRGFFVDINLFKENKDKA
Sbjct: 541 FHKVASISNDNLFYKMVIHVHSDSAFRRYQKELRHKPSLWPLYRGFFVDINLFKENKDKA 600
Query: 601 AELVKSKSNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEGPA 660
AELVKSKSNLM+ EGNGTLGRDGFADED+NLMIKLKFLTYKLRTFLIRNGLSILFKEG
Sbjct: 601 AELVKSKSNLMDTEGNGTLGRDGFADEDSNLMIKLKFLTYKLRTFLIRNGLSILFKEGAV 660
Query: 661 AYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVYLRRKYGNKQLSSATYLSEAEPFLEQYA 720
AYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVY+RRKYGNKQLSSATYLSEAEPFLEQYA
Sbjct: 661 AYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVYMRRKYGNKQLSSATYLSEAEPFLEQYA 720
Query: 721 KRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKAEG 780
KRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKE EAAPSSPMLSGKDAVPKAEG
Sbjct: 721 KRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKELEAAPSSPMLSGKDAVPKAEG 780
Query: 781 LIVFFPGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRRKPYSI 840
LIVFFPGIPGCAKSALC+EIL APGALGDDRPVNTLMGDLIKGRYWQKVAD+RRRKPYSI
Sbjct: 781 LIVFFPGIPGCAKSALCKEILKAPGALGDDRPVNTLMGDLIKGRYWQKVADDRRRKPYSI 840
Query: 841 MLADKNAPNEEVWRQIEDMCCSTRASAVPVVPDSEGTDSNPFSLDALAVFMFRVLQRVNH 900
MLADKNAPNEEVWRQIEDMC STRASAVPV+PDSEGTDSNPFSLDALAVFMFRVLQRVNH
Sbjct: 841 MLADKNAPNEEVWRQIEDMCRSTRASAVPVIPDSEGTDSNPFSLDALAVFMFRVLQRVNH 900
Query: 901 PGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSLAKMPLLKSDRNPLPDNLKT 960
PGNLDKASPNAGYVLLMFYHLY+GKSRREFEGELIDRFGSL KMPLLKSDRNPLPD+LKT
Sbjct: 901 PGNLDKASPNAGYVLLMFYHLYDGKSRREFEGELIDRFGSLVKMPLLKSDRNPLPDDLKT 960
Query: 961 ILEEGLSLYKLHTSRHGRVDSTKGSYAKEWAKWEKKLRETLFGNTEYLNAIQVPFEFAVQ 1020
ILEEG+SLYKLHTSRHGRVDSTKGSYAKEWAKWEK+LRETLF NTEYLNAIQVPFE AVQ
Sbjct: 961 ILEEGISLYKLHTSRHGRVDSTKGSYAKEWAKWEKQLRETLFSNTEYLNAIQVPFELAVQ 1020
Query: 1021 DVLEQLKKISKGDFKSPITERRKSGAIVFAAVSLPVQEIQNLLGTLGKKNPRVEAFLKEH 1080
DVLEQLKK+SKGD+KSPITERRKSGAIVFAAVSLPVQEIQNLLGTL KKN R+EAFL+EH
Sbjct: 1021 DVLEQLKKVSKGDYKSPITERRKSGAIVFAAVSLPVQEIQNLLGTLAKKNSRIEAFLREH 1080
Query: 1081 FKDYTLKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDNMAAFEARLGSIEN 1140
+KDY LKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSD MAAFEARLGSIEN
Sbjct: 1081 YKDYKLKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDKMAAFEARLGSIEN 1140
Query: 1141 ERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISGMVRFF 1195
ERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISGMV+FF
Sbjct: 1141 ERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISGMVKFF 1195
BLAST of Clc03G02590 vs. NCBI nr
Match:
XP_008463605.1 (PREDICTED: uncharacterized protein LOC103501711 isoform X1 [Cucumis melo])
HSP 1 Score: 2242.2 bits (5809), Expect = 0.0e+00
Identity = 1124/1195 (94.06%), Postives = 1152/1195 (96.40%), Query Frame = 0
Query: 1 MSASQRIFCAITLPHPRLYSSW-AFPFICHPLSHNILPRSLTLAPLTSSPFSLSPDSRFI 60
MSA QRIF A LPHP SS+ FPFICHPLSH ILPRSLTLAPLTSSPF LS DSRF+
Sbjct: 1 MSALQRIFYAKILPHPPFSSSYKVFPFICHPLSHFILPRSLTLAPLTSSPFPLSCDSRFV 60
Query: 61 MPYNQRRGGHREQKWKEKAKVDRNSTESEAAADVVTNALGKLRVTESDQPHVLTSSAQFG 120
MPYNQRRGG EQKWKEKAKVD++ TESEAA +VVTNALGKLRVTESDQ HVLTSSAQFG
Sbjct: 61 MPYNQRRGGRGEQKWKEKAKVDKSPTESEAAVEVVTNALGKLRVTESDQSHVLTSSAQFG 120
Query: 121 NAQLTNQATPGLAHRAIWKPKAYGTTSGAAVVEGEKEPTNGTSTENKGSKAELAAQNGAV 180
NAQLTNQA PGLAHRAIWKPKAYGTTSGAAV+EGEK TNGTSTENKGS A LA Q GAV
Sbjct: 121 NAQLTNQAIPGLAHRAIWKPKAYGTTSGAAVIEGEKASTNGTSTENKGSNAGLAVQGGAV 180
Query: 181 SLSQLFKGNQIEKFTVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEV 240
LSQLFK NQIEKF VDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEV
Sbjct: 181 GLSQLFKSNQIEKFIVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEV 240
Query: 241 SLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGAGAAKKQAEFNDFLE 300
SLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGA AAKKQAEFNDFL+
Sbjct: 241 SLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGAEAAKKQAEFNDFLQ 300
Query: 301 SNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRKWRLPTN 360
SNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCR WRLPTN
Sbjct: 301 SNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRNWRLPTN 360
Query: 361 HVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLV 420
HVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLV
Sbjct: 361 HVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLV 420
Query: 421 ARMVSHESSKHMEKVLEEFPALPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGT 480
ARMVSHESSKHM+KVLEEFPA+PDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGT
Sbjct: 421 ARMVSHESSKHMQKVLEEFPAVPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGT 480
Query: 481 AFCPDHSDWYGDSYSRNADRSVLSKFLQANPADFSTFKLQEMIRLMREKRLPAAFKCYHN 540
AFCPDHSDWYGDS+SRNADRSVLSKFLQANPADFST KLQEMIRLMRE+RLPAAFKCYHN
Sbjct: 481 AFCPDHSDWYGDSHSRNADRSVLSKFLQANPADFSTSKLQEMIRLMRERRLPAAFKCYHN 540
Query: 541 FHKVGSISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKDKA 600
FHKV SISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKDKA
Sbjct: 541 FHKVASISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKDKA 600
Query: 601 AELVKSKSNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEGPA 660
A LVKSKSNLM+ EGNGTLGRDGFADED+NLMIKLKFLTYKLRTFLIRNGLSILFKEGP
Sbjct: 601 AGLVKSKSNLMDTEGNGTLGRDGFADEDSNLMIKLKFLTYKLRTFLIRNGLSILFKEGPV 660
Query: 661 AYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVYLRRKYGNKQLSSATYLSEAEPFLEQYA 720
AYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVY+RRKYGNKQLSSATYLSEAEPFLEQYA
Sbjct: 661 AYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVYIRRKYGNKQLSSATYLSEAEPFLEQYA 720
Query: 721 KRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKAEG 780
KRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKAEG
Sbjct: 721 KRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKAEG 780
Query: 781 LIVFFPGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRRKPYSI 840
LIVFFPGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERR+KPYSI
Sbjct: 781 LIVFFPGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRKKPYSI 840
Query: 841 MLADKNAPNEEVWRQIEDMCCSTRASAVPVVPDSEGTDSNPFSLDALAVFMFRVLQRVNH 900
MLADKNAPNEEVWRQIEDMC STRASAVPV+PDSEGTDSNPFSLDALAVFMFRVLQRVNH
Sbjct: 841 MLADKNAPNEEVWRQIEDMCRSTRASAVPVIPDSEGTDSNPFSLDALAVFMFRVLQRVNH 900
Query: 901 PGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSLAKMPLLKSDRNPLPDNLKT 960
PGNLDKASPNAGYVLLMFYHLY+GKSRREFEGELIDRFGSL KMPLLK DRNPLPD+LK+
Sbjct: 901 PGNLDKASPNAGYVLLMFYHLYDGKSRREFEGELIDRFGSLVKMPLLKPDRNPLPDDLKS 960
Query: 961 ILEEGLSLYKLHTSRHGRVDSTKGSYAKEWAKWEKKLRETLFGNTEYLNAIQVPFEFAVQ 1020
ILEEG+SLYKLHTSRHGRVDSTKGSYAKEWAKWEK+LRETLF NTEYLNAIQVPFE AVQ
Sbjct: 961 ILEEGISLYKLHTSRHGRVDSTKGSYAKEWAKWEKQLRETLFSNTEYLNAIQVPFESAVQ 1020
Query: 1021 DVLEQLKKISKGDFKSPITERRKSGAIVFAAVSLPVQEIQNLLGTLGKKNPRVEAFLKEH 1080
DVLEQLKKIS+GD+KSPITERRKSGAIVFAAVSLPVQEIQN+LGTLGKKN R+EAFLKEH
Sbjct: 1021 DVLEQLKKISEGDYKSPITERRKSGAIVFAAVSLPVQEIQNVLGTLGKKNSRIEAFLKEH 1080
Query: 1081 FKDYTLKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDNMAAFEARLGSIEN 1140
+KDY LKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSD MA FEARLGSIEN
Sbjct: 1081 YKDYKLKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDKMAGFEARLGSIEN 1140
Query: 1141 ERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISGMVRFF 1195
ERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISG+V+FF
Sbjct: 1141 ERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISGIVKFF 1195
BLAST of Clc03G02590 vs. NCBI nr
Match:
XP_008463612.1 (PREDICTED: uncharacterized protein LOC103501711 isoform X2 [Cucumis melo])
HSP 1 Score: 2149.0 bits (5567), Expect = 0.0e+00
Identity = 1073/1130 (94.96%), Postives = 1099/1130 (97.26%), Query Frame = 0
Query: 65 RRGGHREQKWKEKAKVDRNSTESEAAADVVTNALGKLRVTESDQPHVLTSSAQFGNAQLT 124
RRGG EQKWKEKAKVD++ TESEAA +VVTNALGKLRVTESDQ HVLTSSAQFGNAQLT
Sbjct: 11 RRGGRGEQKWKEKAKVDKSPTESEAAVEVVTNALGKLRVTESDQSHVLTSSAQFGNAQLT 70
Query: 125 NQATPGLAHRAIWKPKAYGTTSGAAVVEGEKEPTNGTSTENKGSKAELAAQNGAVSLSQL 184
NQA PGLAHRAIWKPKAYGTTSGAAV+EGEK TNGTSTENKGS A LA Q GAV LSQL
Sbjct: 71 NQAIPGLAHRAIWKPKAYGTTSGAAVIEGEKASTNGTSTENKGSNAGLAVQGGAVGLSQL 130
Query: 185 FKGNQIEKFTVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEVSLKHS 244
FK NQIEKF VDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEVSLKHS
Sbjct: 131 FKSNQIEKFIVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEVSLKHS 190
Query: 245 GSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGAGAAKKQAEFNDFLESNRMC 304
GSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGA AAKKQAEFNDFL+SNRMC
Sbjct: 191 GSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGAEAAKKQAEFNDFLQSNRMC 250
Query: 305 ISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRKWRLPTNHVWLF 364
ISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCR WRLPTNHVWLF
Sbjct: 251 ISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRNWRLPTNHVWLF 310
Query: 365 SSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVS 424
SSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVS
Sbjct: 311 SSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVS 370
Query: 425 HESSKHMEKVLEEFPALPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGTAFCPD 484
HESSKHM+KVLEEFPA+PDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGTAFCPD
Sbjct: 371 HESSKHMQKVLEEFPAVPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGTAFCPD 430
Query: 485 HSDWYGDSYSRNADRSVLSKFLQANPADFSTFKLQEMIRLMREKRLPAAFKCYHNFHKVG 544
HSDWYGDS+SRNADRSVLSKFLQANPADFST KLQEMIRLMRE+RLPAAFKCYHNFHKV
Sbjct: 431 HSDWYGDSHSRNADRSVLSKFLQANPADFSTSKLQEMIRLMRERRLPAAFKCYHNFHKVA 490
Query: 545 SISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKDKAAELVK 604
SISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKDKAA LVK
Sbjct: 491 SISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKDKAAGLVK 550
Query: 605 SKSNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEGPAAYKAY 664
SKSNLM+ EGNGTLGRDGFADED+NLMIKLKFLTYKLRTFLIRNGLSILFKEGP AYKAY
Sbjct: 551 SKSNLMDTEGNGTLGRDGFADEDSNLMIKLKFLTYKLRTFLIRNGLSILFKEGPVAYKAY 610
Query: 665 YLRQMKLWGTSAGKQRELSKMLDEWAVYLRRKYGNKQLSSATYLSEAEPFLEQYAKRSPQ 724
YLRQMKLWGTSAGKQRELSKMLDEWAVY+RRKYGNKQLSSATYLSEAEPFLEQYAKRSPQ
Sbjct: 611 YLRQMKLWGTSAGKQRELSKMLDEWAVYIRRKYGNKQLSSATYLSEAEPFLEQYAKRSPQ 670
Query: 725 NQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKAEGLIVFF 784
NQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKAEGLIVFF
Sbjct: 671 NQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKAEGLIVFF 730
Query: 785 PGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRRKPYSIMLADK 844
PGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERR+KPYSIMLADK
Sbjct: 731 PGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRKKPYSIMLADK 790
Query: 845 NAPNEEVWRQIEDMCCSTRASAVPVVPDSEGTDSNPFSLDALAVFMFRVLQRVNHPGNLD 904
NAPNEEVWRQIEDMC STRASAVPV+PDSEGTDSNPFSLDALAVFMFRVLQRVNHPGNLD
Sbjct: 791 NAPNEEVWRQIEDMCRSTRASAVPVIPDSEGTDSNPFSLDALAVFMFRVLQRVNHPGNLD 850
Query: 905 KASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSLAKMPLLKSDRNPLPDNLKTILEEG 964
KASPNAGYVLLMFYHLY+GKSRREFEGELIDRFGSL KMPLLK DRNPLPD+LK+ILEEG
Sbjct: 851 KASPNAGYVLLMFYHLYDGKSRREFEGELIDRFGSLVKMPLLKPDRNPLPDDLKSILEEG 910
Query: 965 LSLYKLHTSRHGRVDSTKGSYAKEWAKWEKKLRETLFGNTEYLNAIQVPFEFAVQDVLEQ 1024
+SLYKLHTSRHGRVDSTKGSYAKEWAKWEK+LRETLF NTEYLNAIQVPFE AVQDVLEQ
Sbjct: 911 ISLYKLHTSRHGRVDSTKGSYAKEWAKWEKQLRETLFSNTEYLNAIQVPFESAVQDVLEQ 970
Query: 1025 LKKISKGDFKSPITERRKSGAIVFAAVSLPVQEIQNLLGTLGKKNPRVEAFLKEHFKDYT 1084
LKKIS+GD+KSPITERRKSGAIVFAAVSLPVQEIQN+LGTLGKKN R+EAFLKEH+KDY
Sbjct: 971 LKKISEGDYKSPITERRKSGAIVFAAVSLPVQEIQNVLGTLGKKNSRIEAFLKEHYKDYK 1030
Query: 1085 LKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDNMAAFEARLGSIENERVIS 1144
LKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSD MA FEARLGSIENERVIS
Sbjct: 1031 LKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDKMAGFEARLGSIENERVIS 1090
Query: 1145 KNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISGMVRFF 1195
KNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISG+V+FF
Sbjct: 1091 KNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISGIVKFF 1140
BLAST of Clc03G02590 vs. NCBI nr
Match:
KAG7019255.1 (tRNA ligase 1 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 2131.7 bits (5522), Expect = 0.0e+00
Identity = 1071/1197 (89.47%), Postives = 1124/1197 (93.90%), Query Frame = 0
Query: 1 MSASQRIFCAITLP---HPRLYSSWAFPFICHPLSHNILPRSLTLAPLTSSPFSLSPDSR 60
MSA+ RIFCAITLP P L+S AFPF+ LSH IL SLTL P + PF++ DSR
Sbjct: 1 MSATYRIFCAITLPLSSSPALHSR-AFPFVSCSLSHFILHPSLTL-PASVFPFTVCRDSR 60
Query: 61 FIMPYNQRRGGHREQKWKEKAKVDRNSTESEAAADVVTNALGKLRVTESDQPHVLTSSAQ 120
F MPYNQRRGG REQKWKEKAKV+ STESE A++VVTNAL LRVTES+QPH+ +S Q
Sbjct: 61 FTMPYNQRRGGRREQKWKEKAKVEGISTESETASEVVTNALSNLRVTESNQPHIPITSVQ 120
Query: 121 FGNAQLTNQATPGLAHRAIWKPKAYGTTSGAAVVEGEKEPTNGTSTENKGSKAELAAQNG 180
FGNAQ TN ATPGL HRAIWKPKAYGTTSGAAVVEGEK P GTSTENKGS AE+AA +
Sbjct: 121 FGNAQPTNLATPGLGHRAIWKPKAYGTTSGAAVVEGEKAPAVGTSTENKGSNAEIAANSS 180
Query: 181 AVSLSQLFKGNQIEKFTVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATL 240
A++LSQL KGNQIE+FTVDNS YTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATL
Sbjct: 181 AIALSQLLKGNQIEQFTVDNSAYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATL 240
Query: 241 EVSLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGAGAAKKQAEFNDF 300
EVSLKHSGSLFMYAGH+GGAYAKNSFGNIYTAVGVFVLGRMFREAWG+ A KKQAEFNDF
Sbjct: 241 EVSLKHSGSLFMYAGHQGGAYAKNSFGNIYTAVGVFVLGRMFREAWGSVAPKKQAEFNDF 300
Query: 301 LESNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRKWRLP 360
LESNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELG GKPKFYST+EIIAFCRKWRLP
Sbjct: 301 LESNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGNGKPKFYSTSEIIAFCRKWRLP 360
Query: 361 TNHVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEG 420
TNHVWLFSSRKSVTSFFA+FDALCEEGTAT+VCKALDEVAEISVPGSKDHIKVQGEILEG
Sbjct: 361 TNHVWLFSSRKSVTSFFASFDALCEEGTATTVCKALDEVAEISVPGSKDHIKVQGEILEG 420
Query: 421 LVARMVSHESSKHMEKVLEEFPALPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNV 480
LVARMVSHESSKHMEKVLEEFPALP NEGGGLDLGPSLREICAANRSDEKQQIKALLQNV
Sbjct: 421 LVARMVSHESSKHMEKVLEEFPALPYNEGGGLDLGPSLREICAANRSDEKQQIKALLQNV 480
Query: 481 GTAFCPDHSDWYGDSYSRNADRSVLSKFLQANPADFSTFKLQEMIRLMREKRLPAAFKCY 540
G+AFCPDHSDWYGDS+SRNADRSV+SKFLQA PADFST KLQEM+RLMRE+RLPAAFKCY
Sbjct: 481 GSAFCPDHSDWYGDSHSRNADRSVVSKFLQAKPADFSTSKLQEMVRLMRERRLPAAFKCY 540
Query: 541 HNFHKVGSISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKD 600
HNFHK+GSISNDNLFYKMVIHV SDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENK+
Sbjct: 541 HNFHKIGSISNDNLFYKMVIHVQSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKE 600
Query: 601 KAAELVKSKSNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEG 660
K AE+VKSK+NLME EGNGT+GRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEG
Sbjct: 601 KTAEIVKSKNNLMETEGNGTVGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEG 660
Query: 661 PAAYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVYLRRKYGNKQLSSATYLSEAEPFLEQ 720
AAYKAYYLRQMKLWGTS GKQRELSKMLDEWAVYLRRKYGNKQLSS+ YLSEAEPFLEQ
Sbjct: 661 SAAYKAYYLRQMKLWGTSFGKQRELSKMLDEWAVYLRRKYGNKQLSSSIYLSEAEPFLEQ 720
Query: 721 YAKRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKA 780
YAKRSPQNQALIGSAGNLVRAEDFLA+VEEGMDEEGDLQKE +AAPSSPMLS KD VPKA
Sbjct: 721 YAKRSPQNQALIGSAGNLVRAEDFLAVVEEGMDEEGDLQKE-DAAPSSPMLSRKDVVPKA 780
Query: 781 EGLIVFFPGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRRKPY 840
EGLIVFFPGIPGCAKSALCREILNAPG LGDDRPVNTLMGDLIKGRYWQKVADERRRKPY
Sbjct: 781 EGLIVFFPGIPGCAKSALCREILNAPGGLGDDRPVNTLMGDLIKGRYWQKVADERRRKPY 840
Query: 841 SIMLADKNAPNEEVWRQIEDMCCSTRASAVPVVPDSEGTDSNPFSLDALAVFMFRVLQRV 900
SIMLADKNAPNEEVWRQIEDMC STRASAVPV+PDSEGTDSNPFSLDALAVFMFRVL RV
Sbjct: 841 SIMLADKNAPNEEVWRQIEDMCHSTRASAVPVIPDSEGTDSNPFSLDALAVFMFRVLHRV 900
Query: 901 NHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSLAKMPLLKSDRNPLPDNL 960
NHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSL K+PLLKSDR+PLPDNL
Sbjct: 901 NHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSLVKIPLLKSDRSPLPDNL 960
Query: 961 KTILEEGLSLYKLHTSRHGRVDSTKGSYAKEWAKWEKKLRETLFGNTEYLNAIQVPFEFA 1020
KTILEEGLSLYKLHTSRHGR DSTKGSYAKEWAKWEK+LRETLFGN EYLNAIQVPFEFA
Sbjct: 961 KTILEEGLSLYKLHTSRHGRADSTKGSYAKEWAKWEKQLRETLFGNAEYLNAIQVPFEFA 1020
Query: 1021 VQDVLEQLKKISKGDFKSPITERRKSGAIVFAAVSLPVQEIQNLLGTLGKKNPRVEAFLK 1080
VQ+VLEQLKKISKGD+KSPITERRKS IV+AAVSLPVQ+IQ+ L TLG KNP+VEAF+K
Sbjct: 1021 VQNVLEQLKKISKGDYKSPITERRKSATIVYAAVSLPVQDIQDALDTLGNKNPQVEAFIK 1080
Query: 1081 EHFKDYTLKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDNMAAFEARLGSI 1140
E +KDYTLK AHVTLAHKRSHG+K VADYGIFENKEVPVELTALLFSD MAAFEAR+GSI
Sbjct: 1081 EGYKDYTLKSAHVTLAHKRSHGIKAVADYGIFENKEVPVELTALLFSDKMAAFEARVGSI 1140
Query: 1141 ENERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISGMVRFF 1195
E+ERVISKNEWPHVTLWTREG+AAKEAN LPQLVSEGKATLVE+NPPIIISG V+FF
Sbjct: 1141 EDERVISKNEWPHVTLWTREGIAAKEANTLPQLVSEGKATLVELNPPIIISGKVQFF 1194
BLAST of Clc03G02590 vs. ExPASy Swiss-Prot
Match:
Q0WL81 (tRNA ligase 1 OS=Arabidopsis thaliana OX=3702 GN=RNL PE=1 SV=1)
HSP 1 Score: 1538.1 bits (3981), Expect = 0.0e+00
Identity = 770/1120 (68.75%), Postives = 904/1120 (80.71%), Query Frame = 0
Query: 78 AKVDRNSTESEAAADVVTNALGKLRVTESDQPHVLTSSAQFGNAQLTNQATPGLAHRAIW 137
A + + + A+ V N G L + ES+ + S N ++ N +W
Sbjct: 3 APFESGDSSATVVAEAVNNQFGGLSLKESNTNAPVLPSQTTSNHRVQN---------LVW 62
Query: 138 KPKAYGTTSGAAVVEGEKEPTNGTSTENKGSKAELAAQNGAVSLSQLFKGNQIEKFTVDN 197
KPK+YGT SG++ E ++ GS + ++LS++F GN +EKF+VD
Sbjct: 63 KPKSYGTVSGSS---SATEVGKTSAVSQIGSSGDTKV---GLNLSKIFGGNLLEKFSVDK 122
Query: 198 STYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEVSLKHSGSLFMYAGHEGGA 257
STY AQIRATFYPKFENEK+DQEIRTRMIEMVSKGLATLEVSLKHSGSLFMYAGH+GGA
Sbjct: 123 STYCHAQIRATFYPKFENEKTDQEIRTRMIEMVSKGLATLEVSLKHSGSLFMYAGHKGGA 182
Query: 258 YAKNSFGNIYTAVGVFVLGRMFREAWGAGAAKKQAEFNDFLESNRMCISMELVTAVLGDH 317
YAKNSFGNIYTAVGVFVL RMFREAWG A KK+AEFNDFLE NRMCISMELVTAVLGDH
Sbjct: 183 YAKNSFGNIYTAVGVFVLSRMFREAWGTKAPKKEAEFNDFLEKNRMCISMELVTAVLGDH 242
Query: 318 GQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRKWRLPTNHVWLFSSRKSVTSFFAAF 377
GQRP +DYVVVTAVTELG GKP+FYST+EII+FCRKWRLPTNHVWLFS+RKSVTSFFAAF
Sbjct: 243 GQRPLDDYVVVTAVTELGNGKPQFYSTSEIISFCRKWRLPTNHVWLFSTRKSVTSFFAAF 302
Query: 378 DALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVSHESSKHMEKVLEE 437
DALCEEG ATSVC+ALDEVA+ISVP SKDH+KVQGEILEGLVAR+VS +SS+ ME VL +
Sbjct: 303 DALCEEGIATSVCRALDEVADISVPASKDHVKVQGEILEGLVARIVSSQSSRDMENVLRD 362
Query: 438 FPALPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGTAFCPDHSDWYGD-SYSRN 497
P P +G LDLG SLREICAA+RS+EKQQ++ALL++VG +FCP +W+GD S+ ++
Sbjct: 363 HPP-PPCDGANLDLGLSLREICAAHRSNEKQQMRALLRSVGPSFCPSDVEWFGDESHPKS 422
Query: 498 ADRSVLSKFLQANPADFSTFKLQEMIRLMREKRLPAAFKCYHNFHKVGSISNDNLFYKMV 557
AD+SV++KFLQ+ PAD+ST KLQEM+RLM+EKRLPAAFKCYHNFH+ IS DNLFYK+V
Sbjct: 423 ADKSVITKFLQSQPADYSTSKLQEMVRLMKEKRLPAAFKCYHNFHRAEDISPDNLFYKLV 482
Query: 558 IHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKDKAAELVKSKSNLMEIEGNG 617
+HVHSDS FRRY KEMRH P LWPLYRGFFVDINLFK NK + +KS N E +G G
Sbjct: 483 VHVHSDSGFRRYHKEMRHMPSLWPLYRGFFVDINLFKSNKGRDLMALKSIDNASENDGRG 542
Query: 618 TLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEGPAAYKAYYLRQMKLWGTSA 677
+DG AD+DANLMIK+KFLTYKLRTFLIRNGLSILFK+G AAYK YYLRQMK+WGTS
Sbjct: 543 E--KDGLADDDANLMIKMKFLTYKLRTFLIRNGLSILFKDGAAAYKTYYLRQMKIWGTSD 602
Query: 678 GKQRELSKMLDEWAVYLRRKYGNKQLSSATYLSEAEPFLEQYAKRSPQNQALIGSAGNLV 737
GKQ+EL KMLDEWA Y+RRK GN QLSS+TYLSEAEPFLEQYAKRSP+N LIGSAGNLV
Sbjct: 603 GKQKELCKMLDEWAAYIRRKCGNDQLSSSTYLSEAEPFLEQYAKRSPKNHILIGSAGNLV 662
Query: 738 RAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKAEGLIVFFPGIPGCAKSALC 797
R EDFLAIV+ +DEEGDL K+Q P++P + K+AV K EGLIVFFPGIPG AKSALC
Sbjct: 663 RTEDFLAIVDGDLDEEGDLVKKQGVTPATPEPAVKEAVQKDEGLIVFFPGIPGSAKSALC 722
Query: 798 REILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRRKPYSIMLADKNAPNEEVWRQIE 857
+E+LNAPG GDDRPV+TLMGDL+KG+YW KVADERR+KP SIMLADKNAPNE+VWRQIE
Sbjct: 723 KELLNAPGGFGDDRPVHTLMGDLVKGKYWPKVADERRKKPQSIMLADKNAPNEDVWRQIE 782
Query: 858 DMCCSTRASAVPVVPDSEGTDSNPFSLDALAVFMFRVLQRVNHPGNLDKASPNAGYVLLM 917
DMC TRASAVP+V DSEGTD+NP+SLDALAVFMFRVLQRVNHPG LDK S NAGYVLLM
Sbjct: 783 DMCRRTRASAVPIVADSEGTDTNPYSLDALAVFMFRVLQRVNHPGKLDKESSNAGYVLLM 842
Query: 918 FYHLYEGKSRREFEGELIDRFGSLAKMPLLKSDRNPLPDNLKTILEEGLSLYKLHTSRHG 977
FYHLYEGK+R EFE ELI+RFGSL KMPLLKSDR PLPD +K++LEEG+ L+ LH+ RHG
Sbjct: 843 FYHLYEGKNRNEFESELIERFGSLIKMPLLKSDRTPLPDPVKSVLEEGIDLFNLHSRRHG 902
Query: 978 RVDSTKGSYAKEWAKWEKKLRETLFGNTEYLNAIQVPFEFAVQDVLEQLKKISKGDFKSP 1037
R++STKG+YA EW KWEK+LR+TL N+EYL++IQVPFE V V E+LK I+KGD+K P
Sbjct: 903 RLESTKGTYAAEWTKWEKQLRDTLVANSEYLSSIQVPFESMVHQVREELKTIAKGDYKPP 962
Query: 1038 ITERRKSGAIVFAAVSLPVQEIQNLLGTLGKKNPRVEAFLKEHFKDY--TLKGAHVTLAH 1097
+E+RK G+IVFAA++LP ++ +LL L NP + +FL+ K L+ +HVTLAH
Sbjct: 963 SSEKRKHGSIVFAAINLPATQVHSLLEKLAAANPTMRSFLEGKKKSIQEKLERSHVTLAH 1022
Query: 1098 KRSHGVKGVADYGIFENKEVPVELTALLFSDNMAAFEARLGSIENERVISKNEWPHVTLW 1157
KRSHGV VA Y N+EVPVELT L+++D MAA A +GS++ E V+SKNEWPHVTLW
Sbjct: 1023 KRSHGVATVASYSQHLNREVPVELTELIYNDKMAALTAHVGSVDGETVVSKNEWPHVTLW 1082
Query: 1158 TREGVAAKEANALPQLVSEGKATLVEINPPIIISGMVRFF 1195
T EGV AKEAN LPQL EGKA+ + I+PP+ ISG + FF
Sbjct: 1083 TAEGVTAKEANTLPQLYLEGKASRLVIDPPVSISGPLEFF 1104
BLAST of Clc03G02590 vs. ExPASy TrEMBL
Match:
A0A1S3CK49 (uncharacterized protein LOC103501711 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103501711 PE=4 SV=1)
HSP 1 Score: 2242.2 bits (5809), Expect = 0.0e+00
Identity = 1124/1195 (94.06%), Postives = 1152/1195 (96.40%), Query Frame = 0
Query: 1 MSASQRIFCAITLPHPRLYSSW-AFPFICHPLSHNILPRSLTLAPLTSSPFSLSPDSRFI 60
MSA QRIF A LPHP SS+ FPFICHPLSH ILPRSLTLAPLTSSPF LS DSRF+
Sbjct: 1 MSALQRIFYAKILPHPPFSSSYKVFPFICHPLSHFILPRSLTLAPLTSSPFPLSCDSRFV 60
Query: 61 MPYNQRRGGHREQKWKEKAKVDRNSTESEAAADVVTNALGKLRVTESDQPHVLTSSAQFG 120
MPYNQRRGG EQKWKEKAKVD++ TESEAA +VVTNALGKLRVTESDQ HVLTSSAQFG
Sbjct: 61 MPYNQRRGGRGEQKWKEKAKVDKSPTESEAAVEVVTNALGKLRVTESDQSHVLTSSAQFG 120
Query: 121 NAQLTNQATPGLAHRAIWKPKAYGTTSGAAVVEGEKEPTNGTSTENKGSKAELAAQNGAV 180
NAQLTNQA PGLAHRAIWKPKAYGTTSGAAV+EGEK TNGTSTENKGS A LA Q GAV
Sbjct: 121 NAQLTNQAIPGLAHRAIWKPKAYGTTSGAAVIEGEKASTNGTSTENKGSNAGLAVQGGAV 180
Query: 181 SLSQLFKGNQIEKFTVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEV 240
LSQLFK NQIEKF VDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEV
Sbjct: 181 GLSQLFKSNQIEKFIVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEV 240
Query: 241 SLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGAGAAKKQAEFNDFLE 300
SLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGA AAKKQAEFNDFL+
Sbjct: 241 SLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGAEAAKKQAEFNDFLQ 300
Query: 301 SNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRKWRLPTN 360
SNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCR WRLPTN
Sbjct: 301 SNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRNWRLPTN 360
Query: 361 HVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLV 420
HVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLV
Sbjct: 361 HVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLV 420
Query: 421 ARMVSHESSKHMEKVLEEFPALPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGT 480
ARMVSHESSKHM+KVLEEFPA+PDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGT
Sbjct: 421 ARMVSHESSKHMQKVLEEFPAVPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGT 480
Query: 481 AFCPDHSDWYGDSYSRNADRSVLSKFLQANPADFSTFKLQEMIRLMREKRLPAAFKCYHN 540
AFCPDHSDWYGDS+SRNADRSVLSKFLQANPADFST KLQEMIRLMRE+RLPAAFKCYHN
Sbjct: 481 AFCPDHSDWYGDSHSRNADRSVLSKFLQANPADFSTSKLQEMIRLMRERRLPAAFKCYHN 540
Query: 541 FHKVGSISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKDKA 600
FHKV SISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKDKA
Sbjct: 541 FHKVASISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKDKA 600
Query: 601 AELVKSKSNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEGPA 660
A LVKSKSNLM+ EGNGTLGRDGFADED+NLMIKLKFLTYKLRTFLIRNGLSILFKEGP
Sbjct: 601 AGLVKSKSNLMDTEGNGTLGRDGFADEDSNLMIKLKFLTYKLRTFLIRNGLSILFKEGPV 660
Query: 661 AYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVYLRRKYGNKQLSSATYLSEAEPFLEQYA 720
AYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVY+RRKYGNKQLSSATYLSEAEPFLEQYA
Sbjct: 661 AYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVYIRRKYGNKQLSSATYLSEAEPFLEQYA 720
Query: 721 KRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKAEG 780
KRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKAEG
Sbjct: 721 KRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKAEG 780
Query: 781 LIVFFPGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRRKPYSI 840
LIVFFPGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERR+KPYSI
Sbjct: 781 LIVFFPGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRKKPYSI 840
Query: 841 MLADKNAPNEEVWRQIEDMCCSTRASAVPVVPDSEGTDSNPFSLDALAVFMFRVLQRVNH 900
MLADKNAPNEEVWRQIEDMC STRASAVPV+PDSEGTDSNPFSLDALAVFMFRVLQRVNH
Sbjct: 841 MLADKNAPNEEVWRQIEDMCRSTRASAVPVIPDSEGTDSNPFSLDALAVFMFRVLQRVNH 900
Query: 901 PGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSLAKMPLLKSDRNPLPDNLKT 960
PGNLDKASPNAGYVLLMFYHLY+GKSRREFEGELIDRFGSL KMPLLK DRNPLPD+LK+
Sbjct: 901 PGNLDKASPNAGYVLLMFYHLYDGKSRREFEGELIDRFGSLVKMPLLKPDRNPLPDDLKS 960
Query: 961 ILEEGLSLYKLHTSRHGRVDSTKGSYAKEWAKWEKKLRETLFGNTEYLNAIQVPFEFAVQ 1020
ILEEG+SLYKLHTSRHGRVDSTKGSYAKEWAKWEK+LRETLF NTEYLNAIQVPFE AVQ
Sbjct: 961 ILEEGISLYKLHTSRHGRVDSTKGSYAKEWAKWEKQLRETLFSNTEYLNAIQVPFESAVQ 1020
Query: 1021 DVLEQLKKISKGDFKSPITERRKSGAIVFAAVSLPVQEIQNLLGTLGKKNPRVEAFLKEH 1080
DVLEQLKKIS+GD+KSPITERRKSGAIVFAAVSLPVQEIQN+LGTLGKKN R+EAFLKEH
Sbjct: 1021 DVLEQLKKISEGDYKSPITERRKSGAIVFAAVSLPVQEIQNVLGTLGKKNSRIEAFLKEH 1080
Query: 1081 FKDYTLKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDNMAAFEARLGSIEN 1140
+KDY LKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSD MA FEARLGSIEN
Sbjct: 1081 YKDYKLKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDKMAGFEARLGSIEN 1140
Query: 1141 ERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISGMVRFF 1195
ERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISG+V+FF
Sbjct: 1141 ERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISGIVKFF 1195
BLAST of Clc03G02590 vs. ExPASy TrEMBL
Match:
A0A1S3CL84 (uncharacterized protein LOC103501711 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103501711 PE=4 SV=1)
HSP 1 Score: 2149.0 bits (5567), Expect = 0.0e+00
Identity = 1073/1130 (94.96%), Postives = 1099/1130 (97.26%), Query Frame = 0
Query: 65 RRGGHREQKWKEKAKVDRNSTESEAAADVVTNALGKLRVTESDQPHVLTSSAQFGNAQLT 124
RRGG EQKWKEKAKVD++ TESEAA +VVTNALGKLRVTESDQ HVLTSSAQFGNAQLT
Sbjct: 11 RRGGRGEQKWKEKAKVDKSPTESEAAVEVVTNALGKLRVTESDQSHVLTSSAQFGNAQLT 70
Query: 125 NQATPGLAHRAIWKPKAYGTTSGAAVVEGEKEPTNGTSTENKGSKAELAAQNGAVSLSQL 184
NQA PGLAHRAIWKPKAYGTTSGAAV+EGEK TNGTSTENKGS A LA Q GAV LSQL
Sbjct: 71 NQAIPGLAHRAIWKPKAYGTTSGAAVIEGEKASTNGTSTENKGSNAGLAVQGGAVGLSQL 130
Query: 185 FKGNQIEKFTVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEVSLKHS 244
FK NQIEKF VDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEVSLKHS
Sbjct: 131 FKSNQIEKFIVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEVSLKHS 190
Query: 245 GSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGAGAAKKQAEFNDFLESNRMC 304
GSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGA AAKKQAEFNDFL+SNRMC
Sbjct: 191 GSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGAEAAKKQAEFNDFLQSNRMC 250
Query: 305 ISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRKWRLPTNHVWLF 364
ISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCR WRLPTNHVWLF
Sbjct: 251 ISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRNWRLPTNHVWLF 310
Query: 365 SSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVS 424
SSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVS
Sbjct: 311 SSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVS 370
Query: 425 HESSKHMEKVLEEFPALPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGTAFCPD 484
HESSKHM+KVLEEFPA+PDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGTAFCPD
Sbjct: 371 HESSKHMQKVLEEFPAVPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGTAFCPD 430
Query: 485 HSDWYGDSYSRNADRSVLSKFLQANPADFSTFKLQEMIRLMREKRLPAAFKCYHNFHKVG 544
HSDWYGDS+SRNADRSVLSKFLQANPADFST KLQEMIRLMRE+RLPAAFKCYHNFHKV
Sbjct: 431 HSDWYGDSHSRNADRSVLSKFLQANPADFSTSKLQEMIRLMRERRLPAAFKCYHNFHKVA 490
Query: 545 SISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKDKAAELVK 604
SISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKDKAA LVK
Sbjct: 491 SISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKDKAAGLVK 550
Query: 605 SKSNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEGPAAYKAY 664
SKSNLM+ EGNGTLGRDGFADED+NLMIKLKFLTYKLRTFLIRNGLSILFKEGP AYKAY
Sbjct: 551 SKSNLMDTEGNGTLGRDGFADEDSNLMIKLKFLTYKLRTFLIRNGLSILFKEGPVAYKAY 610
Query: 665 YLRQMKLWGTSAGKQRELSKMLDEWAVYLRRKYGNKQLSSATYLSEAEPFLEQYAKRSPQ 724
YLRQMKLWGTSAGKQRELSKMLDEWAVY+RRKYGNKQLSSATYLSEAEPFLEQYAKRSPQ
Sbjct: 611 YLRQMKLWGTSAGKQRELSKMLDEWAVYIRRKYGNKQLSSATYLSEAEPFLEQYAKRSPQ 670
Query: 725 NQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKAEGLIVFF 784
NQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKAEGLIVFF
Sbjct: 671 NQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKAEGLIVFF 730
Query: 785 PGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRRKPYSIMLADK 844
PGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERR+KPYSIMLADK
Sbjct: 731 PGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRKKPYSIMLADK 790
Query: 845 NAPNEEVWRQIEDMCCSTRASAVPVVPDSEGTDSNPFSLDALAVFMFRVLQRVNHPGNLD 904
NAPNEEVWRQIEDMC STRASAVPV+PDSEGTDSNPFSLDALAVFMFRVLQRVNHPGNLD
Sbjct: 791 NAPNEEVWRQIEDMCRSTRASAVPVIPDSEGTDSNPFSLDALAVFMFRVLQRVNHPGNLD 850
Query: 905 KASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSLAKMPLLKSDRNPLPDNLKTILEEG 964
KASPNAGYVLLMFYHLY+GKSRREFEGELIDRFGSL KMPLLK DRNPLPD+LK+ILEEG
Sbjct: 851 KASPNAGYVLLMFYHLYDGKSRREFEGELIDRFGSLVKMPLLKPDRNPLPDDLKSILEEG 910
Query: 965 LSLYKLHTSRHGRVDSTKGSYAKEWAKWEKKLRETLFGNTEYLNAIQVPFEFAVQDVLEQ 1024
+SLYKLHTSRHGRVDSTKGSYAKEWAKWEK+LRETLF NTEYLNAIQVPFE AVQDVLEQ
Sbjct: 911 ISLYKLHTSRHGRVDSTKGSYAKEWAKWEKQLRETLFSNTEYLNAIQVPFESAVQDVLEQ 970
Query: 1025 LKKISKGDFKSPITERRKSGAIVFAAVSLPVQEIQNLLGTLGKKNPRVEAFLKEHFKDYT 1084
LKKIS+GD+KSPITERRKSGAIVFAAVSLPVQEIQN+LGTLGKKN R+EAFLKEH+KDY
Sbjct: 971 LKKISEGDYKSPITERRKSGAIVFAAVSLPVQEIQNVLGTLGKKNSRIEAFLKEHYKDYK 1030
Query: 1085 LKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDNMAAFEARLGSIENERVIS 1144
LKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSD MA FEARLGSIENERVIS
Sbjct: 1031 LKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDKMAGFEARLGSIENERVIS 1090
Query: 1145 KNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISGMVRFF 1195
KNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISG+V+FF
Sbjct: 1091 KNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISGIVKFF 1140
BLAST of Clc03G02590 vs. ExPASy TrEMBL
Match:
A0A6J1HM92 (tRNA ligase 1 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111464863 PE=4 SV=1)
HSP 1 Score: 2131.3 bits (5521), Expect = 0.0e+00
Identity = 1071/1197 (89.47%), Postives = 1124/1197 (93.90%), Query Frame = 0
Query: 1 MSASQRIFCAITLP---HPRLYSSWAFPFICHPLSHNILPRSLTLAPLTSSPFSLSPDSR 60
MSA+ RIFCAITLP P L+S AFPF+ LSH IL SLTL P + PF++ DSR
Sbjct: 1 MSATYRIFCAITLPLSSSPALHSR-AFPFVSCSLSHFILHPSLTL-PASVFPFTVCRDSR 60
Query: 61 FIMPYNQRRGGHREQKWKEKAKVDRNSTESEAAADVVTNALGKLRVTESDQPHVLTSSAQ 120
F MPYNQRRGG REQKWKEKAKV+ STESE A++VVTNAL LRVTES+QPH+ +S Q
Sbjct: 61 FTMPYNQRRGGRREQKWKEKAKVEGISTESETASEVVTNALSNLRVTESNQPHIPITSVQ 120
Query: 121 FGNAQLTNQATPGLAHRAIWKPKAYGTTSGAAVVEGEKEPTNGTSTENKGSKAELAAQNG 180
FGNAQ TN ATPGL HRAIWKPKAYGTTSGAAVVEGEK P GTS ENKGS AE+AA +
Sbjct: 121 FGNAQPTNLATPGLGHRAIWKPKAYGTTSGAAVVEGEKAPAVGTSIENKGSNAEIAANSS 180
Query: 181 AVSLSQLFKGNQIEKFTVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATL 240
A++LSQL KGNQIE+FTVDNS YTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATL
Sbjct: 181 AIALSQLLKGNQIEQFTVDNSAYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATL 240
Query: 241 EVSLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGAGAAKKQAEFNDF 300
EVSLKHSGSLFMYAGH+GGAYAKNSFGNIYTAVGVFVLGRMFREAWG+ A KKQAEFNDF
Sbjct: 241 EVSLKHSGSLFMYAGHQGGAYAKNSFGNIYTAVGVFVLGRMFREAWGSVAPKKQAEFNDF 300
Query: 301 LESNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRKWRLP 360
LESNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELG GKPKFYST+EIIAFCRKWRLP
Sbjct: 301 LESNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGNGKPKFYSTSEIIAFCRKWRLP 360
Query: 361 TNHVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEG 420
TNHVWLFSSRKSVTSFFA+FDALCEEGTAT+VCKALDEVAEISVPGSKDHIKVQGEILEG
Sbjct: 361 TNHVWLFSSRKSVTSFFASFDALCEEGTATTVCKALDEVAEISVPGSKDHIKVQGEILEG 420
Query: 421 LVARMVSHESSKHMEKVLEEFPALPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNV 480
LVARMVSHESSKHMEKVLEEFPALP NEGGGLDLGPSLREICAANRSDEKQQIKALLQNV
Sbjct: 421 LVARMVSHESSKHMEKVLEEFPALPYNEGGGLDLGPSLREICAANRSDEKQQIKALLQNV 480
Query: 481 GTAFCPDHSDWYGDSYSRNADRSVLSKFLQANPADFSTFKLQEMIRLMREKRLPAAFKCY 540
G+AFCPDHSDWYGDS+SRNADRSV+SKFLQA PADFST KLQEM+RLMRE+RLPAAFKCY
Sbjct: 481 GSAFCPDHSDWYGDSHSRNADRSVVSKFLQAKPADFSTSKLQEMVRLMRERRLPAAFKCY 540
Query: 541 HNFHKVGSISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKD 600
HNFHK+GSISNDNLFYKMVIHV SDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENK+
Sbjct: 541 HNFHKIGSISNDNLFYKMVIHVQSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKE 600
Query: 601 KAAELVKSKSNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEG 660
K AE+VKSK+NLME EGNGT+GRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEG
Sbjct: 601 KTAEIVKSKNNLMETEGNGTVGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEG 660
Query: 661 PAAYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVYLRRKYGNKQLSSATYLSEAEPFLEQ 720
AAYKAYYLRQMKLWGTS GKQRELSKMLDEWAVYLRRKYGNKQLSS+ YLSEAEPFLEQ
Sbjct: 661 SAAYKAYYLRQMKLWGTSFGKQRELSKMLDEWAVYLRRKYGNKQLSSSIYLSEAEPFLEQ 720
Query: 721 YAKRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKA 780
YAKRSPQNQALIGSAGNLVRAEDFLA+VEEGMDEEGDLQKE +AAPSSPMLS KD VPKA
Sbjct: 721 YAKRSPQNQALIGSAGNLVRAEDFLAVVEEGMDEEGDLQKE-DAAPSSPMLSRKDVVPKA 780
Query: 781 EGLIVFFPGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRRKPY 840
EGLIVFFPGIPGCAKSALCREILNAPG LGDDRPVNTLMGDLIKGRYWQKVADERRRKPY
Sbjct: 781 EGLIVFFPGIPGCAKSALCREILNAPGGLGDDRPVNTLMGDLIKGRYWQKVADERRRKPY 840
Query: 841 SIMLADKNAPNEEVWRQIEDMCCSTRASAVPVVPDSEGTDSNPFSLDALAVFMFRVLQRV 900
SIMLADKNAPNEEVWRQIEDMC STRASAVPV+PDSEGTDSNPFSLDALAVFMFRVLQRV
Sbjct: 841 SIMLADKNAPNEEVWRQIEDMCHSTRASAVPVIPDSEGTDSNPFSLDALAVFMFRVLQRV 900
Query: 901 NHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSLAKMPLLKSDRNPLPDNL 960
NHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSL K+PLLKSDR+PLPDNL
Sbjct: 901 NHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSLVKIPLLKSDRSPLPDNL 960
Query: 961 KTILEEGLSLYKLHTSRHGRVDSTKGSYAKEWAKWEKKLRETLFGNTEYLNAIQVPFEFA 1020
KTILEEGLSLYKLHTSRHGR DSTKGSYAKEWAKWEK+LRETLFGN EYLNAIQVPFEFA
Sbjct: 961 KTILEEGLSLYKLHTSRHGRADSTKGSYAKEWAKWEKQLRETLFGNAEYLNAIQVPFEFA 1020
Query: 1021 VQDVLEQLKKISKGDFKSPITERRKSGAIVFAAVSLPVQEIQNLLGTLGKKNPRVEAFLK 1080
VQ+VLEQLKKISKGD+KSPITERRKS IV+AAVSLPVQ+IQ+ L TLG KNP+VEAF+K
Sbjct: 1021 VQNVLEQLKKISKGDYKSPITERRKSATIVYAAVSLPVQDIQDALDTLGNKNPQVEAFIK 1080
Query: 1081 EHFKDYTLKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDNMAAFEARLGSI 1140
E +KDYTLK AHVTLAHKRSHG+K VADYGIFENKEVPVELTALLFSD MAAFEAR+GSI
Sbjct: 1081 EGYKDYTLKSAHVTLAHKRSHGIKAVADYGIFENKEVPVELTALLFSDKMAAFEARVGSI 1140
Query: 1141 ENERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISGMVRFF 1195
E+ERVISKNEWPHVTLWTREG+AAKEAN LPQLVSEGKATLVE+NPPIIISG V+FF
Sbjct: 1141 EDERVISKNEWPHVTLWTREGIAAKEANTLPQLVSEGKATLVELNPPIIISGKVQFF 1194
BLAST of Clc03G02590 vs. ExPASy TrEMBL
Match:
A0A6J1DUP6 (tRNA ligase 1 OS=Momordica charantia OX=3673 GN=LOC111024537 PE=4 SV=1)
HSP 1 Score: 2127.1 bits (5510), Expect = 0.0e+00
Identity = 1073/1201 (89.34%), Postives = 1113/1201 (92.67%), Query Frame = 0
Query: 1 MSASQRIFCAITLPHP------RLYSSWAFPFICHPLSHNILPRSLTLAPLTSSPFSLSP 60
MSAS RIFCAITLPHP L++S AF SH I PRSL L PL SSPF LSP
Sbjct: 1 MSASHRIFCAITLPHPPRFSPSSLFNSRAF----LSTSHFIFPRSLALPPLISSPFHLSP 60
Query: 61 DSRFIMPYNQRRGGHREQKWKEKAKVDRNSTESEAAADVVTNALGKLRVTESDQPHVLTS 120
SR IMPYNQR G REQKWKEKAK+DR STESEAAA+VVTNALGKLRV+ES QPHV S
Sbjct: 61 HSRSIMPYNQRSDGRREQKWKEKAKLDRTSTESEAAAEVVTNALGKLRVSESGQPHVPIS 120
Query: 121 SAQFGNAQLTNQATPGLAHRAIWKPKAYGTTS-GAAVVEGEKEPTNGTSTENKGSKAELA 180
S +FGNAQLTNQ GL +R IWKPKAYGTTS GAAVVE EK P GTS ENKG+ A LA
Sbjct: 121 SREFGNAQLTNQVPSGLGNRGIWKPKAYGTTSGGAAVVEAEKAPAVGTSIENKGNTAGLA 180
Query: 181 AQNGAVSLSQLFKGNQIEKFTVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKG 240
AQNG V LSQLFKGNQIE FTVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKG
Sbjct: 181 AQNGTVGLSQLFKGNQIENFTVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKG 240
Query: 241 LATLEVSLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGAGAAKKQAE 300
LATLEVSLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWG+ AAKKQAE
Sbjct: 241 LATLEVSLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGSKAAKKQAE 300
Query: 301 FNDFLESNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRK 360
FN+FLESNRMCISMELVTAVLGDHGQRPREDYVVVTAVT+LG GKPKFYSTAEII FCR+
Sbjct: 301 FNNFLESNRMCISMELVTAVLGDHGQRPREDYVVVTAVTDLGNGKPKFYSTAEIIVFCRE 360
Query: 361 WRLPTNHVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGE 420
WRLPTNHVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGE
Sbjct: 361 WRLPTNHVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGE 420
Query: 421 ILEGLVARMVSHESSKHMEKVLEEFPALPDNEGGGLDLGPSLREICAANRSDEKQQIKAL 480
ILEGLVAR+VSHESSKHMEKVLEEFP+LPD EGGGLDLG SLREICAANRSDEKQQIKAL
Sbjct: 421 ILEGLVARIVSHESSKHMEKVLEEFPSLPDEEGGGLDLGRSLREICAANRSDEKQQIKAL 480
Query: 481 LQNVGTAFCPDHSDWYGDSYSRNADRSVLSKFLQANPADFSTFKLQEMIRLMREKRLPAA 540
LQNVG++FCPDHSDW GDS+SR ADRSVLSKFLQ +P DFST KLQEMIRLMREKRLPAA
Sbjct: 481 LQNVGSSFCPDHSDWSGDSHSRTADRSVLSKFLQTSPTDFSTSKLQEMIRLMREKRLPAA 540
Query: 541 FKCYHNFHKVGSISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFK 600
FKCYHNFHKVGSISND+LFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFK
Sbjct: 541 FKCYHNFHKVGSISNDDLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFK 600
Query: 601 ENKDKAAELVKSKSNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSIL 660
NKDKAAE++KSKSNLME+EGNG LGRDG ADEDANLMIKLKFLTYKLRTFLIRNGLSIL
Sbjct: 601 ANKDKAAEIMKSKSNLMEVEGNGILGRDGLADEDANLMIKLKFLTYKLRTFLIRNGLSIL 660
Query: 661 FKEGPAAYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVYLRRKYGNKQLSSATYLSEAEP 720
FKEGPAAYKAYYLRQMKLWGTS GKQRELSKMLDEWAVYLRRKYGN+QLSSATYLSEAEP
Sbjct: 661 FKEGPAAYKAYYLRQMKLWGTSVGKQRELSKMLDEWAVYLRRKYGNRQLSSATYLSEAEP 720
Query: 721 FLEQYAKRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDA 780
FLEQYAKRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQE APSSPML GKD
Sbjct: 721 FLEQYAKRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEVAPSSPMLPGKDT 780
Query: 781 VPKAEGLIVFFPGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERR 840
V KAEGLIVFFPGIPGCAKSALCREILNAPG LGDDRPV +LMGDLIKGRYWQKV DERR
Sbjct: 781 VSKAEGLIVFFPGIPGCAKSALCREILNAPGGLGDDRPVKSLMGDLIKGRYWQKVVDERR 840
Query: 841 RKPYSIMLADKNAPNEEVWRQIEDMCCSTRASAVPVVPDSEGTDSNPFSLDALAVFMFRV 900
RKPYSIMLADKNAPNEEVWRQIEDMC STRASAVPVVPDSEGTD NPFSLDALAVFMFRV
Sbjct: 841 RKPYSIMLADKNAPNEEVWRQIEDMCHSTRASAVPVVPDSEGTDGNPFSLDALAVFMFRV 900
Query: 901 LQRVNHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSLAKMPLLKSDRNPL 960
LQRVNHPGNLDKASPNAGYVLLMFYHLYEGKSRREFE ELIDRFGSL KMPLLK DR+PL
Sbjct: 901 LQRVNHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEDELIDRFGSLVKMPLLKCDRSPL 960
Query: 961 PDNLKTILEEGLSLYKLHTSRHGRVDSTKGSYAKEWAKWEKKLRETLFGNTEYLNAIQVP 1020
PDNLKTILEEGLSLYKLHTSRHGR DSTKGSYAKEWAKWEK+LRETLFGNTEYLN+IQVP
Sbjct: 961 PDNLKTILEEGLSLYKLHTSRHGRADSTKGSYAKEWAKWEKQLRETLFGNTEYLNSIQVP 1020
Query: 1021 FEFAVQDVLEQLKKISKGDFKSPITERRKSGAIVFAAVSLPVQEIQNLLGTLGKKNPRVE 1080
FE AVQDVLEQLKKI+KGD+K+PI+ERRKS IVFAAVSLPVQEIQNLL TLGKKNP VE
Sbjct: 1021 FEVAVQDVLEQLKKIAKGDYKTPISERRKSATIVFAAVSLPVQEIQNLLDTLGKKNPHVE 1080
Query: 1081 AFLKEHFKDYTLKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDNMAAFEAR 1140
+FLK+ +KDYTLK AHVTLAHKRSHGVK VADYGIF+NKEVPVELTALLFSD MAAFEA
Sbjct: 1081 SFLKQDYKDYTLKAAHVTLAHKRSHGVKAVADYGIFQNKEVPVELTALLFSDKMAAFEAH 1140
Query: 1141 LGSIENERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISGMVRF 1195
LGS+E+ERV+SKNEWPHVTLWTREGVAAKEAN LPQLVSEGKATLVE+NPP IISG V+F
Sbjct: 1141 LGSVEDERVVSKNEWPHVTLWTREGVAAKEANTLPQLVSEGKATLVELNPPTIISGTVKF 1197
BLAST of Clc03G02590 vs. ExPASy TrEMBL
Match:
A0A6J1I3R5 (tRNA ligase 1 OS=Cucurbita maxima OX=3661 GN=LOC111469400 PE=4 SV=1)
HSP 1 Score: 2110.1 bits (5466), Expect = 0.0e+00
Identity = 1061/1200 (88.42%), Postives = 1116/1200 (93.00%), Query Frame = 0
Query: 1 MSASQRIFCAITLPHPRLYSSWA------FPFICHPLSHNILPRSLTLAPLTSSPFSLSP 60
MSA RIFCAITLP RL S A FPFI + SH IL SLT+ S P ++S
Sbjct: 1 MSAPHRIFCAITLPRHRLSYSSAFNYRVFFPFIPYSFSHRILSPSLTITDSISFPSTVSS 60
Query: 61 DSRFIMPYNQRRGGHREQKWKEKAKVDRNSTESEAAADVVTNALGKLRVTESDQPHVLTS 120
D RF+MPYNQRRGG REQKWKEKAKV+ STESEAA+ VVTNAL LRVTES+QPH+ +
Sbjct: 61 DFRFMMPYNQRRGGRREQKWKEKAKVEGISTESEAASQVVTNALSNLRVTESNQPHIPIT 120
Query: 121 SAQFGNAQLTNQATPGLAHRAIWKPKAYGTTSGAAVVEGEKEPTNGTSTENKGSKAELAA 180
S QFGNAQ TN ATPGL HRAIWKPKAYGTT GAAVVEGEK GTS ENKGS AE+AA
Sbjct: 121 SVQFGNAQPTNLATPGLGHRAIWKPKAYGTTIGAAVVEGEKASAVGTSIENKGSNAEIAA 180
Query: 181 QNGAVSLSQLFKGNQIEKFTVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGL 240
+ A++L+QL KGNQIEKFTVDNS YTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGL
Sbjct: 181 NSSAIALNQLLKGNQIEKFTVDNSAYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGL 240
Query: 241 ATLEVSLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGAGAAKKQAEF 300
ATLEVSLKHSGSLFMYAGH+GGAYAKNSFGNIYTAVGVFVLGRMF+EAWG+ A KKQAEF
Sbjct: 241 ATLEVSLKHSGSLFMYAGHQGGAYAKNSFGNIYTAVGVFVLGRMFQEAWGSVAPKKQAEF 300
Query: 301 NDFLESNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRKW 360
NDFLESNRMCISMELVTAVLGDHGQRP+EDYVVVTAVTELG GKPKFYST+EIIAFCRKW
Sbjct: 301 NDFLESNRMCISMELVTAVLGDHGQRPQEDYVVVTAVTELGNGKPKFYSTSEIIAFCRKW 360
Query: 361 RLPTNHVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEI 420
RLPTNHVWLFSSRKSVTSFFA+FDALCEEGTAT+VCKALDEVAEISVPGSKDHIKVQGEI
Sbjct: 361 RLPTNHVWLFSSRKSVTSFFASFDALCEEGTATTVCKALDEVAEISVPGSKDHIKVQGEI 420
Query: 421 LEGLVARMVSHESSKHMEKVLEEFPALPDNEGGGLDLGPSLREICAANRSDEKQQIKALL 480
LEGLVARMVSHESSKHMEKVLEEFPALP NEGGGLDL PSLREICAANRSDEKQQIKALL
Sbjct: 421 LEGLVARMVSHESSKHMEKVLEEFPALPYNEGGGLDLEPSLREICAANRSDEKQQIKALL 480
Query: 481 QNVGTAFCPDHSDWYGDSYSRNADRSVLSKFLQANPADFSTFKLQEMIRLMREKRLPAAF 540
QNVG+AFCPDHSDWYGDS+SRNADRSV+SKFLQA PADFSTFKLQEM+RLMRE+RLPAAF
Sbjct: 481 QNVGSAFCPDHSDWYGDSHSRNADRSVVSKFLQAKPADFSTFKLQEMVRLMRERRLPAAF 540
Query: 541 KCYHNFHKVGSISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKE 600
KCYHNFHKVGSISNDNLFYKMVIHV SDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKE
Sbjct: 541 KCYHNFHKVGSISNDNLFYKMVIHVQSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKE 600
Query: 601 NKDKAAELVKSKSNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILF 660
NK+KAAE+VKSK+NLME EGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILF
Sbjct: 601 NKEKAAEIVKSKNNLMETEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILF 660
Query: 661 KEGPAAYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVYLRRKYGNKQLSSATYLSEAEPF 720
KEGPAAYKAYYLRQMKLWGTS GKQRELSKMLDEWAVYLRRKYGNKQLSS+ YLSEAEPF
Sbjct: 661 KEGPAAYKAYYLRQMKLWGTSFGKQRELSKMLDEWAVYLRRKYGNKQLSSSIYLSEAEPF 720
Query: 721 LEQYAKRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAV 780
LEQYAKRSPQNQ LIGSAGNLVRAEDFLA+V+EGMDEEGDLQKE + APSSPMLS KD V
Sbjct: 721 LEQYAKRSPQNQTLIGSAGNLVRAEDFLAVVDEGMDEEGDLQKE-DTAPSSPMLSRKDVV 780
Query: 781 PKAEGLIVFFPGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRR 840
PKAEGLIVFFPGIPGCAKS+LCREILNAPGALGDDRPVNTL GDLIKGRYWQKVADERRR
Sbjct: 781 PKAEGLIVFFPGIPGCAKSSLCREILNAPGALGDDRPVNTLTGDLIKGRYWQKVADERRR 840
Query: 841 KPYSIMLADKNAPNEEVWRQIEDMCCSTRASAVPVVPDSEGTDSNPFSLDALAVFMFRVL 900
KPYSIMLADKNAPNEEVWRQIEDMC ST ASAVPV+PDSEGTDSNPFSLDALAVFMFRVL
Sbjct: 841 KPYSIMLADKNAPNEEVWRQIEDMCHSTGASAVPVIPDSEGTDSNPFSLDALAVFMFRVL 900
Query: 901 QRVNHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSLAKMPLLKSDRNPLP 960
QRVNHPGNLDKASPNAGYVLLMFYH YEGKSRREFEGELIDRFGSL K+PLLKSDR+PLP
Sbjct: 901 QRVNHPGNLDKASPNAGYVLLMFYHFYEGKSRREFEGELIDRFGSLVKIPLLKSDRSPLP 960
Query: 961 DNLKTILEEGLSLYKLHTSRHGRVDSTKGSYAKEWAKWEKKLRETLFGNTEYLNAIQVPF 1020
DNLKTILEEGLSLYKLHTSRHG DSTKGSYAKEWA+WEK+LRETLFGN EYLNAIQVPF
Sbjct: 961 DNLKTILEEGLSLYKLHTSRHGWTDSTKGSYAKEWAEWEKQLRETLFGNAEYLNAIQVPF 1020
Query: 1021 EFAVQDVLEQLKKISKGDFKSPITERRKSGAIVFAAVSLPVQEIQNLLGTLGKKNPRVEA 1080
EF+VQ+VLEQLKKISKGD+KSPITE RKS IV+AAVSLPVQEIQN L TLG KNP+VEA
Sbjct: 1021 EFSVQNVLEQLKKISKGDYKSPITE-RKSATIVYAAVSLPVQEIQNALDTLGNKNPQVEA 1080
Query: 1081 FLKEHFKDYTLKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDNMAAFEARL 1140
F+KE +KDYTLK AHVTLAHKRSHG+K VADYGIFENKEVPVELTALLFSD MAAFEAR+
Sbjct: 1081 FIKEGYKDYTLKSAHVTLAHKRSHGIKAVADYGIFENKEVPVELTALLFSDKMAAFEARV 1140
Query: 1141 GSIENERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISGMVRFF 1195
GSIE+ERVISKNEWPHVTLWTREG+AAKEAN+LPQLVSEGKATL+E+NPPIIISG V+FF
Sbjct: 1141 GSIEDERVISKNEWPHVTLWTREGIAAKEANSLPQLVSEGKATLLELNPPIIISGKVQFF 1198
BLAST of Clc03G02590 vs. TAIR 10
Match:
AT1G07910.1 (RNAligase )
HSP 1 Score: 1538.1 bits (3981), Expect = 0.0e+00
Identity = 770/1120 (68.75%), Postives = 904/1120 (80.71%), Query Frame = 0
Query: 78 AKVDRNSTESEAAADVVTNALGKLRVTESDQPHVLTSSAQFGNAQLTNQATPGLAHRAIW 137
A + + + A+ V N G L + ES+ + S N ++ N +W
Sbjct: 3 APFESGDSSATVVAEAVNNQFGGLSLKESNTNAPVLPSQTTSNHRVQN---------LVW 62
Query: 138 KPKAYGTTSGAAVVEGEKEPTNGTSTENKGSKAELAAQNGAVSLSQLFKGNQIEKFTVDN 197
KPK+YGT SG++ E ++ GS + ++LS++F GN +EKF+VD
Sbjct: 63 KPKSYGTVSGSS---SATEVGKTSAVSQIGSSGDTKV---GLNLSKIFGGNLLEKFSVDK 122
Query: 198 STYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEVSLKHSGSLFMYAGHEGGA 257
STY AQIRATFYPKFENEK+DQEIRTRMIEMVSKGLATLEVSLKHSGSLFMYAGH+GGA
Sbjct: 123 STYCHAQIRATFYPKFENEKTDQEIRTRMIEMVSKGLATLEVSLKHSGSLFMYAGHKGGA 182
Query: 258 YAKNSFGNIYTAVGVFVLGRMFREAWGAGAAKKQAEFNDFLESNRMCISMELVTAVLGDH 317
YAKNSFGNIYTAVGVFVL RMFREAWG A KK+AEFNDFLE NRMCISMELVTAVLGDH
Sbjct: 183 YAKNSFGNIYTAVGVFVLSRMFREAWGTKAPKKEAEFNDFLEKNRMCISMELVTAVLGDH 242
Query: 318 GQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRKWRLPTNHVWLFSSRKSVTSFFAAF 377
GQRP +DYVVVTAVTELG GKP+FYST+EII+FCRKWRLPTNHVWLFS+RKSVTSFFAAF
Sbjct: 243 GQRPLDDYVVVTAVTELGNGKPQFYSTSEIISFCRKWRLPTNHVWLFSTRKSVTSFFAAF 302
Query: 378 DALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVSHESSKHMEKVLEE 437
DALCEEG ATSVC+ALDEVA+ISVP SKDH+KVQGEILEGLVAR+VS +SS+ ME VL +
Sbjct: 303 DALCEEGIATSVCRALDEVADISVPASKDHVKVQGEILEGLVARIVSSQSSRDMENVLRD 362
Query: 438 FPALPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGTAFCPDHSDWYGD-SYSRN 497
P P +G LDLG SLREICAA+RS+EKQQ++ALL++VG +FCP +W+GD S+ ++
Sbjct: 363 HPP-PPCDGANLDLGLSLREICAAHRSNEKQQMRALLRSVGPSFCPSDVEWFGDESHPKS 422
Query: 498 ADRSVLSKFLQANPADFSTFKLQEMIRLMREKRLPAAFKCYHNFHKVGSISNDNLFYKMV 557
AD+SV++KFLQ+ PAD+ST KLQEM+RLM+EKRLPAAFKCYHNFH+ IS DNLFYK+V
Sbjct: 423 ADKSVITKFLQSQPADYSTSKLQEMVRLMKEKRLPAAFKCYHNFHRAEDISPDNLFYKLV 482
Query: 558 IHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKDKAAELVKSKSNLMEIEGNG 617
+HVHSDS FRRY KEMRH P LWPLYRGFFVDINLFK NK + +KS N E +G G
Sbjct: 483 VHVHSDSGFRRYHKEMRHMPSLWPLYRGFFVDINLFKSNKGRDLMALKSIDNASENDGRG 542
Query: 618 TLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEGPAAYKAYYLRQMKLWGTSA 677
+DG AD+DANLMIK+KFLTYKLRTFLIRNGLSILFK+G AAYK YYLRQMK+WGTS
Sbjct: 543 E--KDGLADDDANLMIKMKFLTYKLRTFLIRNGLSILFKDGAAAYKTYYLRQMKIWGTSD 602
Query: 678 GKQRELSKMLDEWAVYLRRKYGNKQLSSATYLSEAEPFLEQYAKRSPQNQALIGSAGNLV 737
GKQ+EL KMLDEWA Y+RRK GN QLSS+TYLSEAEPFLEQYAKRSP+N LIGSAGNLV
Sbjct: 603 GKQKELCKMLDEWAAYIRRKCGNDQLSSSTYLSEAEPFLEQYAKRSPKNHILIGSAGNLV 662
Query: 738 RAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKAEGLIVFFPGIPGCAKSALC 797
R EDFLAIV+ +DEEGDL K+Q P++P + K+AV K EGLIVFFPGIPG AKSALC
Sbjct: 663 RTEDFLAIVDGDLDEEGDLVKKQGVTPATPEPAVKEAVQKDEGLIVFFPGIPGSAKSALC 722
Query: 798 REILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRRKPYSIMLADKNAPNEEVWRQIE 857
+E+LNAPG GDDRPV+TLMGDL+KG+YW KVADERR+KP SIMLADKNAPNE+VWRQIE
Sbjct: 723 KELLNAPGGFGDDRPVHTLMGDLVKGKYWPKVADERRKKPQSIMLADKNAPNEDVWRQIE 782
Query: 858 DMCCSTRASAVPVVPDSEGTDSNPFSLDALAVFMFRVLQRVNHPGNLDKASPNAGYVLLM 917
DMC TRASAVP+V DSEGTD+NP+SLDALAVFMFRVLQRVNHPG LDK S NAGYVLLM
Sbjct: 783 DMCRRTRASAVPIVADSEGTDTNPYSLDALAVFMFRVLQRVNHPGKLDKESSNAGYVLLM 842
Query: 918 FYHLYEGKSRREFEGELIDRFGSLAKMPLLKSDRNPLPDNLKTILEEGLSLYKLHTSRHG 977
FYHLYEGK+R EFE ELI+RFGSL KMPLLKSDR PLPD +K++LEEG+ L+ LH+ RHG
Sbjct: 843 FYHLYEGKNRNEFESELIERFGSLIKMPLLKSDRTPLPDPVKSVLEEGIDLFNLHSRRHG 902
Query: 978 RVDSTKGSYAKEWAKWEKKLRETLFGNTEYLNAIQVPFEFAVQDVLEQLKKISKGDFKSP 1037
R++STKG+YA EW KWEK+LR+TL N+EYL++IQVPFE V V E+LK I+KGD+K P
Sbjct: 903 RLESTKGTYAAEWTKWEKQLRDTLVANSEYLSSIQVPFESMVHQVREELKTIAKGDYKPP 962
Query: 1038 ITERRKSGAIVFAAVSLPVQEIQNLLGTLGKKNPRVEAFLKEHFKDY--TLKGAHVTLAH 1097
+E+RK G+IVFAA++LP ++ +LL L NP + +FL+ K L+ +HVTLAH
Sbjct: 963 SSEKRKHGSIVFAAINLPATQVHSLLEKLAAANPTMRSFLEGKKKSIQEKLERSHVTLAH 1022
Query: 1098 KRSHGVKGVADYGIFENKEVPVELTALLFSDNMAAFEARLGSIENERVISKNEWPHVTLW 1157
KRSHGV VA Y N+EVPVELT L+++D MAA A +GS++ E V+SKNEWPHVTLW
Sbjct: 1023 KRSHGVATVASYSQHLNREVPVELTELIYNDKMAALTAHVGSVDGETVVSKNEWPHVTLW 1082
Query: 1158 TREGVAAKEANALPQLVSEGKATLVEINPPIIISGMVRFF 1195
T EGV AKEAN LPQL EGKA+ + I+PP+ ISG + FF
Sbjct: 1083 TAEGVTAKEANTLPQLYLEGKASRLVIDPPVSISGPLEFF 1104
BLAST of Clc03G02590 vs. TAIR 10
Match:
AT1G07910.2 (RNAligase )
HSP 1 Score: 1538.1 bits (3981), Expect = 0.0e+00
Identity = 770/1120 (68.75%), Postives = 904/1120 (80.71%), Query Frame = 0
Query: 78 AKVDRNSTESEAAADVVTNALGKLRVTESDQPHVLTSSAQFGNAQLTNQATPGLAHRAIW 137
A + + + A+ V N G L + ES+ + S N ++ N +W
Sbjct: 3 APFESGDSSATVVAEAVNNQFGGLSLKESNTNAPVLPSQTTSNHRVQN---------LVW 62
Query: 138 KPKAYGTTSGAAVVEGEKEPTNGTSTENKGSKAELAAQNGAVSLSQLFKGNQIEKFTVDN 197
KPK+YGT SG++ E ++ GS + ++LS++F GN +EKF+VD
Sbjct: 63 KPKSYGTVSGSS---SATEVGKTSAVSQIGSSGDTKV---GLNLSKIFGGNLLEKFSVDK 122
Query: 198 STYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEVSLKHSGSLFMYAGHEGGA 257
STY AQIRATFYPKFENEK+DQEIRTRMIEMVSKGLATLEVSLKHSGSLFMYAGH+GGA
Sbjct: 123 STYCHAQIRATFYPKFENEKTDQEIRTRMIEMVSKGLATLEVSLKHSGSLFMYAGHKGGA 182
Query: 258 YAKNSFGNIYTAVGVFVLGRMFREAWGAGAAKKQAEFNDFLESNRMCISMELVTAVLGDH 317
YAKNSFGNIYTAVGVFVL RMFREAWG A KK+AEFNDFLE NRMCISMELVTAVLGDH
Sbjct: 183 YAKNSFGNIYTAVGVFVLSRMFREAWGTKAPKKEAEFNDFLEKNRMCISMELVTAVLGDH 242
Query: 318 GQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRKWRLPTNHVWLFSSRKSVTSFFAAF 377
GQRP +DYVVVTAVTELG GKP+FYST+EII+FCRKWRLPTNHVWLFS+RKSVTSFFAAF
Sbjct: 243 GQRPLDDYVVVTAVTELGNGKPQFYSTSEIISFCRKWRLPTNHVWLFSTRKSVTSFFAAF 302
Query: 378 DALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVSHESSKHMEKVLEE 437
DALCEEG ATSVC+ALDEVA+ISVP SKDH+KVQGEILEGLVAR+VS +SS+ ME VL +
Sbjct: 303 DALCEEGIATSVCRALDEVADISVPASKDHVKVQGEILEGLVARIVSSQSSRDMENVLRD 362
Query: 438 FPALPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGTAFCPDHSDWYGD-SYSRN 497
P P +G LDLG SLREICAA+RS+EKQQ++ALL++VG +FCP +W+GD S+ ++
Sbjct: 363 HPP-PPCDGANLDLGLSLREICAAHRSNEKQQMRALLRSVGPSFCPSDVEWFGDESHPKS 422
Query: 498 ADRSVLSKFLQANPADFSTFKLQEMIRLMREKRLPAAFKCYHNFHKVGSISNDNLFYKMV 557
AD+SV++KFLQ+ PAD+ST KLQEM+RLM+EKRLPAAFKCYHNFH+ IS DNLFYK+V
Sbjct: 423 ADKSVITKFLQSQPADYSTSKLQEMVRLMKEKRLPAAFKCYHNFHRAEDISPDNLFYKLV 482
Query: 558 IHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKDKAAELVKSKSNLMEIEGNG 617
+HVHSDS FRRY KEMRH P LWPLYRGFFVDINLFK NK + +KS N E +G G
Sbjct: 483 VHVHSDSGFRRYHKEMRHMPSLWPLYRGFFVDINLFKSNKGRDLMALKSIDNASENDGRG 542
Query: 618 TLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEGPAAYKAYYLRQMKLWGTSA 677
+DG AD+DANLMIK+KFLTYKLRTFLIRNGLSILFK+G AAYK YYLRQMK+WGTS
Sbjct: 543 E--KDGLADDDANLMIKMKFLTYKLRTFLIRNGLSILFKDGAAAYKTYYLRQMKIWGTSD 602
Query: 678 GKQRELSKMLDEWAVYLRRKYGNKQLSSATYLSEAEPFLEQYAKRSPQNQALIGSAGNLV 737
GKQ+EL KMLDEWA Y+RRK GN QLSS+TYLSEAEPFLEQYAKRSP+N LIGSAGNLV
Sbjct: 603 GKQKELCKMLDEWAAYIRRKCGNDQLSSSTYLSEAEPFLEQYAKRSPKNHILIGSAGNLV 662
Query: 738 RAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKAEGLIVFFPGIPGCAKSALC 797
R EDFLAIV+ +DEEGDL K+Q P++P + K+AV K EGLIVFFPGIPG AKSALC
Sbjct: 663 RTEDFLAIVDGDLDEEGDLVKKQGVTPATPEPAVKEAVQKDEGLIVFFPGIPGSAKSALC 722
Query: 798 REILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRRKPYSIMLADKNAPNEEVWRQIE 857
+E+LNAPG GDDRPV+TLMGDL+KG+YW KVADERR+KP SIMLADKNAPNE+VWRQIE
Sbjct: 723 KELLNAPGGFGDDRPVHTLMGDLVKGKYWPKVADERRKKPQSIMLADKNAPNEDVWRQIE 782
Query: 858 DMCCSTRASAVPVVPDSEGTDSNPFSLDALAVFMFRVLQRVNHPGNLDKASPNAGYVLLM 917
DMC TRASAVP+V DSEGTD+NP+SLDALAVFMFRVLQRVNHPG LDK S NAGYVLLM
Sbjct: 783 DMCRRTRASAVPIVADSEGTDTNPYSLDALAVFMFRVLQRVNHPGKLDKESSNAGYVLLM 842
Query: 918 FYHLYEGKSRREFEGELIDRFGSLAKMPLLKSDRNPLPDNLKTILEEGLSLYKLHTSRHG 977
FYHLYEGK+R EFE ELI+RFGSL KMPLLKSDR PLPD +K++LEEG+ L+ LH+ RHG
Sbjct: 843 FYHLYEGKNRNEFESELIERFGSLIKMPLLKSDRTPLPDPVKSVLEEGIDLFNLHSRRHG 902
Query: 978 RVDSTKGSYAKEWAKWEKKLRETLFGNTEYLNAIQVPFEFAVQDVLEQLKKISKGDFKSP 1037
R++STKG+YA EW KWEK+LR+TL N+EYL++IQVPFE V V E+LK I+KGD+K P
Sbjct: 903 RLESTKGTYAAEWTKWEKQLRDTLVANSEYLSSIQVPFESMVHQVREELKTIAKGDYKPP 962
Query: 1038 ITERRKSGAIVFAAVSLPVQEIQNLLGTLGKKNPRVEAFLKEHFKDY--TLKGAHVTLAH 1097
+E+RK G+IVFAA++LP ++ +LL L NP + +FL+ K L+ +HVTLAH
Sbjct: 963 SSEKRKHGSIVFAAINLPATQVHSLLEKLAAANPTMRSFLEGKKKSIQEKLERSHVTLAH 1022
Query: 1098 KRSHGVKGVADYGIFENKEVPVELTALLFSDNMAAFEARLGSIENERVISKNEWPHVTLW 1157
KRSHGV VA Y N+EVPVELT L+++D MAA A +GS++ E V+SKNEWPHVTLW
Sbjct: 1023 KRSHGVATVASYSQHLNREVPVELTELIYNDKMAALTAHVGSVDGETVVSKNEWPHVTLW 1082
Query: 1158 TREGVAAKEANALPQLVSEGKATLVEINPPIIISGMVRFF 1195
T EGV AKEAN LPQL EGKA+ + I+PP+ ISG + FF
Sbjct: 1083 TAEGVTAKEANTLPQLYLEGKASRLVIDPPVSISGPLEFF 1104
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038894223.1 | 0.0e+00 | 96.00 | tRNA ligase 1 isoform X1 [Benincasa hispida] | [more] |
XP_004147268.2 | 0.0e+00 | 94.39 | tRNA ligase 1 isoform X1 [Cucumis sativus] >KGN64758.2 hypothetical protein Csa_... | [more] |
XP_008463605.1 | 0.0e+00 | 94.06 | PREDICTED: uncharacterized protein LOC103501711 isoform X1 [Cucumis melo] | [more] |
XP_008463612.1 | 0.0e+00 | 94.96 | PREDICTED: uncharacterized protein LOC103501711 isoform X2 [Cucumis melo] | [more] |
KAG7019255.1 | 0.0e+00 | 89.47 | tRNA ligase 1 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
Match Name | E-value | Identity | Description | |
Q0WL81 | 0.0e+00 | 68.75 | tRNA ligase 1 OS=Arabidopsis thaliana OX=3702 GN=RNL PE=1 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
A0A1S3CK49 | 0.0e+00 | 94.06 | uncharacterized protein LOC103501711 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A1S3CL84 | 0.0e+00 | 94.96 | uncharacterized protein LOC103501711 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A6J1HM92 | 0.0e+00 | 89.47 | tRNA ligase 1 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111464863 PE=4 SV=1 | [more] |
A0A6J1DUP6 | 0.0e+00 | 89.34 | tRNA ligase 1 OS=Momordica charantia OX=3673 GN=LOC111024537 PE=4 SV=1 | [more] |
A0A6J1I3R5 | 0.0e+00 | 88.42 | tRNA ligase 1 OS=Cucurbita maxima OX=3661 GN=LOC111469400 PE=4 SV=1 | [more] |