Clc03G02590 (gene) Watermelon (cordophanus) v2

Overview
NameClc03G02590
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptiontRNA ligase 1
LocationClcChr03: 2540044 .. 2563066 (-)
RNA-Seq ExpressionClc03G02590
SyntenyClc03G02590
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCAATCTCCACTACACCCCTATTATTTTGACATTGTGTAGGAATGTCAGAATGTCAACATGGACGATATTTGGATAATTAAAGATCAAATTAGTTTTCAAGTGTGAAATTTTCTTAAGGTGATTTATGCAAAATTTCCCTCTTTTATAAAAGCCTCAGGTCCCCAAACTTCATTACAATGACACACACCAAGTTTTGGCGCAAGCCGTGTGGAAGGAGCTCTCACTTCTCTGCGGTCTCCGTCTTTCTCTCTCTATAACTATATTCATATAGCGCGTACACTTGAATGTCGGCGTCGCAGAGAATTTTCTGCGCTATAACTCTTCCTCACCCTCGTTTGTATTCATCTTGGGCCTTCCCTTTCATTTGCCACCCTCTATCCCACAATATCTTACCACGCTCTCTCACTCTCGCACCTTTAACTTCCTCCCCCTTTTCCCTTTCTCCTGATTCTCGATTCATCATGCCTTACAATCAGGTGCCCATTTTTTTTTTTTTGTTTGATGTTTTCGGGTTGTGAATTTATGTGTCTGTGAAGAGATGTCGATTTTTGTATCTGGGTATCTCAATTTGTTTTCAATAATGCTTTTGATGTTCATAACTGTTTCGTTTTGAATGCGGCAATCTGTTTTCTATTGATATTGAGGATGGACGGGCTTTGTATTTGTTATTTGTTATTTGTTATCTCTTTTCGACGTTTGATGATTAATGTTTTTACAATGCTAGCATGTAAAAAGGATGCCGATATGGTAATTGGTGTTTTTTTTATACGCTTATCATATATGAAACACTTCAATTCTATTGCTTATGTCTCAAGGCATCGATTATATGTAATTAAAAATTTATCTTTAGGATTATCTATGTTGAGTTTGATCTTGATTTGCAGCGAAGGGGTGGCCATAGAGAACAGAAGTGGAAAGAGAAGGCAAAGGTTGACAGAAATTCTACGGAGTCAGAGGCTGCTGCTGACGTTGTTACTAATGCACTCGGTAAATTGAGGGTCACTGAAAGTGATCAACCTCATGTTCTTACTTCTAGTGCGCAGTTTGGAAATGCCCAGCTGACAAATCAGGCCACCCCTGGGCTTGCTCATAGAGCAATTTGGAAACCAAAAGCGTATGGAACAACCAGTGGGGCCGCAGTGGTTGAAGGTGAAAAAGAACCTACCAATGGAACGTCAACCGAAAACAAAGGGAGTAAAGCTGAGCTGGCAGCACAGAATGGCGCCGTTAGCTTGAGTCAATTATTCAAGGGCAATCAGATTGAAAAGTTTACCGTGGATAATTCTACTTACACACAAGCACAAATAAGAGCTACGTTTTACCCTAAATTTGAGAATGAGAAGTCGGATCAGGAGGTATTTCTTTACTTGATGTGTGTCTTTTTGAAGTTAAACTTGGACTCTGGATTTGCTTTATTACACAGAATTATTTACTTTTAGCTCATTTATTAGCTTGACATTATTCTACCTCCAAGATGTTTTTCCCATTCACCAAGTCATTGGCTTGTTTGGGTATGTGTTTCTAGGTGGATGCTTGTGGTTGTGTACTTGTTTTTGAGTCACTTGACCATCATCTTTGGGTCATGGGCGAAATTTTGCTGTAGCTTTTGGGCCATTGGTGTTTCTTTAAAATCCAGGTTCCCAAGGCTTTTTAATATCTCCCCGGTAACAAATGTGGTTGTTAGTTCTTGTTGGGATTTATCAACTTCTTCATGGAGTTTCTCTTTCAGACAAGCATTGAAGGATGAAGAGGTGGTCAATTTTCAGTCTCTTTCTTCATGCCTCTCTAAAATTTCACTATATTCGTGTGAGGATGTCCAAATTTGGTCCCTTGAATCATCAGGTTTTTTCTTGATTAAGTTTCTCTCAAATCGTTTGGCTTCTTCTTCTTTGATGCCCAATGGTCTGTTCAAATCATTATGGAAGTTGAAAAGCCCGAAGAAAATTATTATTATTGTTTGGATATTGCTTAATGGAAGTCTTAATTCAAAGAAAGTTTTGCAAAGGAAGAATACGTCTCTATGGCTTAGGCCCTCCATCTGTTTTATATGTTTTAAGGATGAGGAATCTCTCATTTCTTTGGCTGGAATTCATCCTATTGTCTTGAACCTCAGTGTTCGTGGGGGGACTTTAACTTTTTTTGGTGGGCCTAAGTTCACTACAACCTTGGGAGTTCAATACTAGGTTTGTAACCTAATATTATCACAGAAATTGCGGGCCTTTTGGATGTTGCTATTTTGTTGCAAGCCTTGTAGTATTCTAAGTTCACTACAACCTTTAGAGACTTGCCCTTATTCATTTTGTCCTTAGCGGCAACCTATTTTATGTACCTATTTAGAATCCCCAATGTGGTTAGCAAGATTTTGGAGAAGTTGGCAAGGGATTTCTTGCGGGAAACTCGAAAGGTTGACAAGAGGAAAGGCTTGCTGTTGGACCTAGGATTAGGCATAGGCAATGTAAGAGCGAGAAACAAGGTCTTGTTGGCTAAATGGCTTATGCAAATTCATCATGATTATGACACCTTGTGGCATAAGTTATTGTTACCAAGTATGGGCTTCACCTTGTTGAGTCCACCTTCGCCTTCTTTGGGTTTCCACCACCTTTTGACGATTAGGAAAACGACATGGAGATGTCCCATCATTAACTAGGAGGTTTTTTTGCATATGCCCATAAAGGTTGAATTTTGTGGTAGGTTGGTTTCTTCACAACTTTTGGGGATTTGGACTGAGAGAAACAATAGAGTTTTAAAGATATGGAGAGATCTTGGGAGGAGGTTTTGCCAATTGCTACGTTTAAAGCATCTATTTTGGGTGTCAGTTTCCAAGGAATCTTGTAATTACATGCAAAGTCTTATTATTTTGAGTTGGAACCCTTTTTTGTATAATTAGTTTCATGGCTCCTTTTAATAAACTATCTTGTTTACCCCTTGTTTTCCCTTTTGTAGTCGTTTAGAAAAATGGGAGCATGGTCTCTCATTTTTCTGTGGATTAATGGATGGGGAAATTCTCTTGTGAACCTTTTACTTTTCTTTTGGCTTCTATTGGACCTTTGTCTTTGTGAAACGTGCATCATGCCACCATATTTAATAAACCGTTCCTTTATGGAAAACAAAGAAGGTCATATGGTCATTCTGGGGCAATTTTTAAACTACTAAGTTTTGATAAGGTGATCGGATTTAATTGTTTATTTTTGTATTAGATTGATATTTATTTCGTTGTCAATGATTTGAGAATTTCGTTTCCTGGTAATATGTTTAATGTGTATATGATTTTATGTCAGATTAGAACAAGGATGATAGAGATGGTATCGAAAGGCTTGGCTACATTGGAGGTATAGAGATACCTGCAATTTTTGACAAGAATGCTGTAATGTTTATGGTGTTTAATGTTCCATTTATGATGTAGTTACTATGTATTATTTAATGTCATTCACTCCCCATTATGGATTTTAAAACCTTCAATGATGGCGTTTTAGTTTCGAAGTAGAAGTATATTGAACAATTTGGATGTTGATTCAATTTGGGTGTTCTTTTGGTGACTACTCATTTTTTGAATTGTTATGCTTCGGTACTGCAAAAGTGGATCCATACTACAACCTTTCTTGTACAATATTTGTTCTATAGGCTGTACAACTGTGTCCTGAGTTCTGAAAAAGGATTCTGTACTTGTGAGTCTAAATTCATATCTTTTTGTGAAAAATTTCATTGCTTATATAACAAAACAGTAGGTAAAATAGTTTCCTCAGTTCTTCATTGCTCATACCTTCTGGTTATTGTCTCTAATGGTAAAAGTAAATTGATCTTGTTCTGCATTCTATAAGTTGTTATTAGGTTATTCAACTTGGAAATTTCTACAGGTTTCACTAAAACACTCAGGGTCTTTGTTTATGTACGCTGGCCATGAAGGTGGAGCATATGCAAAAAACAGCTTCGGGAATATGTATGCCGATATATGCCTTTTCTTTTTAATTCTTTTTTGAAGGTGTATTGTATGCCAATTTTCCTAAATCCATTTTGATGAAGCCGGTAAGGAATTGAATTTGAAGTGGTACTCATATCTTTTCGGTATTTTTTCTTGAACAAGGATATTTACATATGGTCATTGCGTGTAACTGTTCTGCTCCCACTACCTTCAATATCATTTGCCATCATTATTATTTCATTTGTATTTCTTAGTTTTAGAAGAGGAGGATGCCATAACTTCAGACTTCTTTTTGGAACTTGTTACCATTTCCATTCTATTTCATCACGTTTTTGAAGCTTTTTCTGAATAAAATTCCACTTTCAGCTACACTGCTGTTGGTGTCTTTGTTCTGGGAAGGATGTTTCGAGAGGCTTGGGGAGCTGGAGCAGCAAAAAAGCAGGCAGAATTCAATGATTTCCTTGAGGTATTCTTAACATTAGAAAAACAATGATTTTGTTATCTAAAGATTTTAGCTTTGATCTGATAAGCGCCCACAATGCTCGTTGGATGCAAAACTAAAGGTCATTTTAGGATCCTTGCAAACAGTCACAAAATAAAACTAACTTAACAATCGATTTCAACCAAGTGCTTAGTTAGAATGATGAATAAACTCTTAGATCGAACATAGAATCTATAATTGTCACTCTTAGATCCTAAGACAACTCACTCCAGAATTAGCTTGATCAAAACTAATCCAATAAACTGATTTAAACATCAAATCTAAAGCAAGATTTGAAAGAAAAGACAACACTAAAGGGGTTGAATAAGCACAAGCTTTTTTAATATCAATAATTTGAGTTTTGAAGAACATTAAAAACCATTTTAAAGGCCAAATGGTCGTGCATAAGACCAAAAACTCAAAAGTCATTCAAATACAATTAATTGCAAATTGTAGAAAATAAGACCTAGGAACTTAAAACTTAATTGCATACTAATTGATAAATCAAAATGCAAACTAAAATGTAGAAAGTCTTCAAATTGTCTTAAATAATTCCTCTAGAAGGCGGCAAGCTTGATTTGCACGTTTGACAAGGTTTAACATCATGTTTTCCCATGCAGTACACACTTGACAGTATTTTCTTCACAAAATCTATGCACTCTTTTTTTACCATTGTGCCATCTTCCAAACTATAAGACTTTTCCCGTGCCATCTTCATGTCATTCTCTTCTTGACTCGTTGAACTAACATTATATATAAATTTGGGCTCAAAATTATTGGTCGGTTCTTGTGAATTTGCTAGTTTTTGAAGATGTAGTGTGAAAGCCTCTAGAATCTTCTTTGGGTTGCTTCTTGTAATTGCTCCTTCGGGTACATGCAATGGTTCAAATGGTTGAATTCACATCATGATCCTTGCATCTTTTTTATGCCTTGTTGATTGGTGCCCCATTTTTGTTTATATTCTACAAGTTTTATTTTTGGTTCTTTTGGTTGTGGGTTTTCTTCCTTTACGTCCTTTTGTTTCTTTCATCTTTCTCTATTAAAGGACGGCTTTCCATGAAAAATAAAATGCTATGAGAATCTTGAACATGTTTATATTTGTTTTGTCATTTCACATCTCCTGGCAGCACTCAAATCATCTTGCTACCTTTCTTCTTTTAGAGTAACCGCATGTGCATATCAATGGAGTTAGTAACTGCTGTTTTGGGAGATCATGGCCAGAGACCACGTGAGGATTATGGTAATATCAATTTAGAATGTTTTATTGTTTTGCTTCCTTTTAAAAGAAAAAAAAAAAAAGAAAGAAAGAAAAAGAAAAATAGAACTTCTAATTTATTACCCTTTCGTCTTAAATTCAGAAGGAGCTCCAAAATCTTTAAATATTCAAGTTGTACTAATTGTTTTTGGATTTAATTAAGAGGGAGTGCCAGAAGGAATTGTATGAACAATCTTTCCTTTGATACGGATGGAATTCTCACTCAAGATTTTTTCCCTTAGTTCTTTTTAGTTATCTTTTGTATGACTGTTGATTTGAAACTGGTGGAAGTCCTTACTTGAGCGTCCTTTTTTTATAGATTTTCAATCCTTAAGCTGTTCAGCTGTTTGATTTGATTTTTGAGTTTTTATATATAATATTTCTCTTGACATGTTTCCTGCCCTCCAATTAAACTTATGGTTACTTTTTTCAACATTAGTCAGTTATGTTTCCGTTGAAGTTATTATTATTTTTATATTGTGCTTCTCTCTCAGTATATTGCATTAGGTTTTAAGTAGTGCGCTCTTTCATTTTGTGCAGTGGTAGTTACAGCCGTTACAGAACTGGGCAAGGGAAAGCCGAAGTTCTATTCAACTGCAGAGATAATAGCCTTTTGTAGAAAATGGCGGTTACCAACTAATCATGTTTGGTTATTCTCAAGCAGGTGATGGCCCATAACTTCATTCACATTTTTTTTTTCCATTTGAAGATTTTTTTAAATTATCTATATCTATCTATCTATATATATAGATAGATAGATAGATGTTATGTAATTTCTAATTTTTGCATGACATGGAAAATTAGATAAAAGATTCTATTTCAAAAAAAAGTGATGATATCTTCACAGTTGCTGAACTCAGTAAAACCTTCCATGCTTCTCTGATTTTTGACACTGAAATGTTACCATAACCTGTGTGTTAAATTGCCGATTGACCCAAAAGCTTAAGCTAGTGAGTCAATTGGCGATTTAACATGATATCAGAGCAGGTGGTCCAGGAAGGTCCCATGTTCAAACTCCTACAATGTTGTTTACTCCCTAATTAAAATCAATTTCCACTTGTTGGGTTTTTTTCATAATTCAAGCCTACAAGGGAGGAGGATTGTTCGATGATATAATTAAATTTTCCTTCACCCATCAGCTTAAGCTTTTGAGTCAATCAATAATTTAACATTGTGCTTCATCATTTTTTTTTTGTCCCAAAAAAATGACTGAAGATCTTTTAAGATTGTTAGTCTTATTTGGCAGCAATGAATTGTCTGAGAGTTATTTTATTCGTCATTAACATTTAGTGCTTTATTAAATGTGTGTGTCATCTAGAGTTCCAAGTATTTCTTGCATGCAGCTACTTCTCTACTTTTGTTTACTTTCGTGCAGGTTTGATTTTGATTATATGACTTGAGTGCTCAAGTGATTTCTCTTTTAAATTCGGACGGTGTACTATGAATATAATGTGAATTTTACATTTTTTAAAATCTTGTAGTGTCTACTTACTGTATGTTAATGTTAAACAACTATCTTCTGGCTCCTGATCCTTTTCTTTTCTAACTTCTGAAGGAAGTCGGTGACTTCTTTTTTTGCTGCATTTGATGCCCTATGTGAAGAAGGAACTGCTACTTCAGTATGTAAGGCTCTCGATGAAGTTGCAGAAATATCTGTACCAGGTATAGTTTTTGTCAGTTTAAGCTTTTAATAGTCAGGTACAGTTGTTAGCTGCATGTTGTCCTTTTTCCAATTTGTATTCCTATATACCCTGTTTGCTTTTGAGACCTTTGTCAAAGTTAATACACTATCCCATTGAACAAATCCGTTCAATGGATAGGTCCCTTTGCTTTATGCCTGTTCTATTATTCCATGACTTGCAGCTATAAACGTTAACAAGTTGTTCCTACTTTCTTGCTTGTTGCTTATAAATTATAATCTTCTCATTTTGATGGACCCTGCAGTGTTTACTCACTTTGTGTTAAATCAGTTGAGAAGTTGATGATGATTGTCAACTGGAATTGGTGTGCAGGCTCAAAAGATCACATAAAAGTGCAGGGTGAAATTCTTGAGGGTCTTGTGGCCCGTATGGTGAGCCATGAGAGTTCAAAACACATGGAGAAAGTATTGGAAGAATTTCCTGCTCTGCCGGACAACGAAGGAGGTGTGGTGTTGCTACCGTTATTTTTTTTTAATGCTTTTCCATTGGTTACCTTCATTTTTTGTTTGTATACCCTGTTTTGTCTTAGGTGGACTTGATTTAGGACCAAGCCTGAGGGAAATTTGTGCTGCAAATAGGTCAGATGAAAAACAGGTAACAAAAATTCCTCGGCATGAAATATGATGGTTTTGTGTGTTGATTTTTCACTTTGCGTCTTCTTGGAACAGAAATGGAATGATTTTCACCTGTTAGGCATCGTTTTTTAACTGTTATCTCTGCTACTGTAAAATACAATTGCAGCAAATAAAAGCACTTCTTCAAAACGTTGGTACTGCCTTTTGCCCTGACCATTCTGACTGGTACGGTGATTCCTACTCAAGAAATGCTGACAGATCTGTTTTATCAAAATTCTTACAAGCCAACCCAGCTGATTTTTCAACCTTCAAATTACAGGTTAGTTGTCACTTCATGTTATGAAAATGAATTTTTATCTCTATTTGTGTTTTCTCTTAATACAAATGTTGCTTTATCTGCTCCAGATCATTTGAGAACTGTTTTCACTACCACTGTTTCACGTGTTGGTTATTTTGAAATTCCTTTTGAAGAATTTGTTTTGGTGGAAAATATCATTCCCTCTTCCTCTGTTTTTTATTGATTTAGTTGGACATGCCACCTTCTTTATTAGCATCTTTCACTCTCCTCGAAATGAGTTGATCATGAAGTACTTCTAATCTGTCCAGGAAATGATTCGTCTAATGAGAGAAAAACGTCTTCCAGCTGCATTCAAATGCTACCATAATTTCCACAAAGTTGGTTCCATATCAAATGACAACCTTTTCTATAAAATGGTCATTCATGTTCACAGCGACTCTGCTTTTCGGCGATATCAAAAAGAAATGAGGTACTGCTTGGGACTTGCATTCTATTTACATTTCCAGTGTTCAAAATAATTATTTCTGTAGAGTTGAGCTACTGTCGTGTTTTTATTAATTGTTGCGTCAAATGTGTTGAACATGTTTCAGGCACAAGCCAGGTTTGTGGCCATTATATCGAGGTAATAGTCAACCTTTCATGATGTGAATTCCTTTATCTTTAGATTTCAACTACTCTGATTCAAATTATTTTAGGCCTTCACACCTTTTTAGATGATGATCATGATTCCTAAATCTAATGTTTTACTCTGGTTTGTTATGTTTATTGTGCTTTAGCAGGGAAAACATTGAACCCAAACAAAAATTTCACTAATGGTAGAAACGTTTCTAAAAAGATAAAGAAGAAAAGCCTACTAGTGGTAGACAAAAGAAAAGTAAGAAACGTAGAGCAATTTAAATAGGGCTAAGTTACAAGTGTAGTCTTTTTGAACTTTTAAGGTTATGCCTTCTAGTTAGAAAGCTTAGAGTGTGCTTGATTTGTGATTTGGTGACATAAATATGAAAAAAAAACAAGGGACTCAATGGAAAACAAAGTTGTATTTCATGTTTCCAATGTTTTTGGATGTGTTCAATAGCAAATTTCAAAATTTGACTCCAAGTTGAACAATGTTTCATAATATATTTATAGACCATAAAAATAACTTTACTTACCTGCTAAATATTAAATTAAATCTCAAATATTATTTATTAATCTAATGAACACGTGTTTTTTGGTTATAAATTGTGTAGGATCAATATTTTACAAAATCATATCATCTATATTATATTATACATATTTTACTTTAACAAAAGTAAAACGAGTTTACAATATGAAACACTATATTTAATTTTTATTTGTTAATGTAAAAATTCAAAAGGGAAATTATTATAAATGGAAAAAGTATCAAACTATTTACAAATATAGAAAGATTTCACTGTCTATCATCAATAAACCGCTATTATCACTGAGCCATAGAAGTCTATCGCGGTCTATCGCCCAGTGATAGAAGTCTATCGCGGTCTATCACGGTCTGTCGTCTATTAGCAGTGAATTTTTTTTATATTTGTAGATATTTTGGCTCATTTTGCTATATTTGAAAACAACCCAATTCAAAATTTGTATACACTATTTTATAATTTCTACTATATGGATAACGAAATTCAACTGTGGTAAAAAGAATAAATCTACTAAATAGATATTTATTGAAACCATTTGATGCATTGAATCTGAAAATATAAAACAAAATATAGATTGCATACCAGACAAGCCCTTAATATATATTTGAAAGTTAAAAATATATTAAATATAACCTTGAAAGTTGGTAATGTAACCCATAAAGAACTAAAAACTTTAAATTTGTGAAAAGGCTCCCCAATTATTATGATTGAAGCAGTGATAAAATATTAAATTTGTTGTAATCCATCCACTTAGACTTTTGTTAGTGGTGATTTAATATAGAACTTTTAGCCAACAAGTGAAAGAGAGTGTTTATTAGTAGAAGATTAACCCATCAACTTAATCTTTTGGATTTTGTGGTCATTCAATAATCATAAATATCATTGCAAGACTCGTCAACAGAAATAGAGGATAGAGAACACTATACTCGAGGATTTAAATTTAATCAAATCAGTGTATTCTCAGCTGGGAATTTGGCTTCAAATTTTCAATGATTTACTCCACACCATATTTATGGAAGGGTGATTTTAATTGTGTTTATCTTAATCATCTTGGCTTTAAGAGGAAGAATAGGAAAGTGAGGGATGTGGCCTCTCTTTCGGTCTTGCTTGAGGGGGTTGTCTTTAGACAAGAAGGAAAGGATGTCAGGGTGTGGATTCCCAATCCTTAGGAGGAATTCTTGTGTAACTCTTTTTTCGTAGCCTTAGTTGAGATGTCCGAGTGTACCTTCTGATCCCAATTTTGTTCTCATGTGTTGTGTCTAGTTATCTTGCTATGTATTGAAACCTAAACTATTGTAACATGAGCTTTGTCTCATTTCATTTTATCAATGAAAGAGATTGTTTCCTTTTAAAAAACCTCTTTTTCATAGTCTTCTTGATCCTTCTCCCTTAGGTGAGTTGGTCATTTCTACTCTCTAGAGGATTAAGATTCCTAGAAAAGTTAAGTTCTTTACCTAGCAAGTTCTACATGGAATATGGAATAATTAACACTTTGGATCCACTTGTTTGGAAAATATCCTTGTTAGTCAGTCTGTTTTGCTATATTTTCAGTCGGAAGGCGGGGAAAAACTTGAAGCACATTCTTTGAAGTTGTGAGTTCGCGAGAACTGTTTGGAGTGACTTTTTTCAGATGTTTGGTTGCTTGTCATAGGGAACCATGGATGAAGAGATAAGACTTTTTGAGACAAATTTCTTATCAGTATAGATGGGTCCATGAGGAGTTTGATAATGTACATGTGGGTGGAGTTTATAGATGAATTGATTTTTGGTGAAGTAATAGAATAATATTTATGATGTATATAGGGATTAATTTATATTTTTTTACTTTGTCATAAGTCTTTCCCAGTTTCCTCTTTCCTCATTTGCACTTTGTAAACTCTCCAAGCTAGCCTCAATATTGTAACATCTTCATCCTCTTTAGTAAAAATTTGCTTCTTTAAGAAGAAATAAGAAATGAAAATAAAAAAGAAAAAGGAAAGAAGTAAGTAAGATATGCTGTATTGAGCTTTTTGACTGTTCTGAACTGTATAAAAGACTTATTGCTCACAGTTTCATTTGTTCCCAGGCTTTTTTGTTGACATCAATTTATTCAAAGAAAACAAGGACAAGGCTGCTGAATTAGTGAAAAGTAAGAGCAATTTGATGGAGATTGAAGGCAATGGGACCTTAGGAAGAGATGGATTTGCTGATGAAGATGCAAATCTGATGATAAAACTGAAATTTCTTACATATAAGGTAATTTTGCTTTAATTTGTCTAGTTTAAGTTATCTTAAAAGTATAAGAAGGAAATAGTTTACCTCATACAAGAGCAGGACTTTTTTGTAGGTTCTTTTTCTTTTGTTCCTTCCCGGTTCATTTCATATTGGGCCAGTTTTGTTGTTCACAATATGGTTCCCCCACCACCTTTTTGGAAAATAATATACGGTTCAATAGATATTGGCCATTGGTGTGCCATCCTTTTGGGTCATGGTGTAAACATCACTAGCCTTTCATTTTTAATTCTTAAAACCCTAATCAATTGATGTAGTAATGCTAATTTTAGTTGTGTATAGTGGTCACAAGGCTCTTTGATTTAATCAAGACTCATGAGGTCTGAAATCTGAGATATTAAATCAACAAGTTTCAGAGCATCGCATGTTGTAGGGTTTAGGTTGGAATTGGATGAATGTGGAGACTGAGTAAAAAACAAATTGCTTAGGTGTACTTTTTGTTGTGATTGCTATTAACAAATTGCTTAATTAAAATAGTCTGAACGATTTATATTTCTCTCTTAGCTGCGGACTTTTCTGATTCGTAATGGCTTGTCAATTCTCTTCAAAGAAGGTCCAGCTGCATACAAGGCCTATTACTTGAGGTATGATTAATTGATTTGATGAGGCATGCCATACACGTAAATTTAGAAAATGAATCTATAATTGAGGGAGGGAAATTGTAGACTAATAATCTTATGCATGGTGCAAGTGGATGAATGGGATTACTGTCCTTCACTCCTCCACTCAGTTCAAATACACAACCAGGTTGCAAACATGGAAATCTTTTTTCTTTTCCAGATCATTTCTTCTAGAATTGTTTTATTCATAATTAAGTATAAGGTCTCTTGTGCAAGCATTTTTTTCTATGAGAAACGTACTTTCATTATCAAGAGGAAAAATACAAAAAGAAGGGAGATGAGATATCACCACACACCGACATGTTACGAAAAATAAGCCCAACTGGTGCAAATATGAGGTAGGCTATAAGTACAAAAGAATTTAGAAGACCATGAAAAAGAAAAAAAATGTTACCAAATCCCAAAAGATGTCACCGTTAGTTTCTTGACCATAAAAAGTTTTAATATTTCTTTCAGACTAGATCCTTCATAGGAGGGCAAAGACTGCATTTGACCTAAGGATTTTGGTCTGGGCAAAAAACTGAGTAATCTGAAGAACCATCTGAAGCTTTGTTGGGCCCAACACCACGGTAAAATGAGCAAATTGAACAACCACCCCAGCAAATAACAAAGTGCAAAAGAAGGAAGGTGCATAAAAGATTCCTCACTGTTTATACAAAGAACACAAAAATTGGGAGAAAGGGCAGAATTTCGGAGAATTCTTTGGATTCTACAGTATTCAGCCTCTCAAAAGAGAATATCAACAATTTAACCTTCACCTTCTTAGGACATGAAGCTTTCCATATAAGGTTAGAATTTTGCCTGTTGAAAGCTGCATGTTTCACAATTTGCTTAGGAGAAAAGGAGTTAACGGAAAAGGAGCCATCATTTTCCAGCATCCACAGATTCACATCATCCCTGTTATTTGGCCTAAAAGAATGAAGCTTCATCAAGAGAGCAGCCCATTCATCAAACTCCCGCATCCAAGTGCTATGTACTACAGAGAAGGGGAATAGGTTTACCATTGTTCAATATGGAGCCAGATTGTAGTTTTAAGGCAATTCCACATCCAAGATCTTCATTTCTACCCAAGGCCTTTTCTTCCCCAACAATACATCCCAAAAGAAAAAAATCCATCTATTGATCATAGGCTTTCACAAAACTGATCTTAGATATAAGACCCTCCAACATCAAATTCTTATAATCTTTAATAGTCTTGTTCACAACGAGGATTTCGTCCATAATTCTTCTCCATACCACATAGGCGATATTCCAAGGGTCTATGGAGAACTTCCCTAAGTCATTCGGGCCAGAAGCTTAACAATGATGTTATAAAGGATAATGATAAGGCTAATAGGTCTAAAATCTTTGACCTAATCTTGGGGATGAGGCAAATATAACATCCATTCATGGACTAATCGATGACCCTCCTCTTGTAGAACTTTGACCAATTCTTTCTTTAAGTTTTTGTGCTGATATTTGAGAAATTTGAATTGAAATTGATAGTAAGAGTTTAAAAATGTTGAAAAGGACTTAGAATTTCCTTAGTATATTACCGTATTTTGTCTGAACACTCATTATTACATTAGTATTAATGTTGAGCGTGTGTTTTTTTTCTCTTCTGTATACTTTATTTTAATGATGTAGGCAAATGAAGTTGTGGGGTACATCAGCCGGAAAACAAAGGGAGCTCAGCAAGATGCTTGATGAATGGTAAGTATACAATTATTGGTTCTTGGGCTGCTTTCTTGTTCCACTAGAACTGTAGGTACTTCACAGGAAAAAATATTGAAGTTGATGATATACTTTTAACATGATAGGGCTGTATACTTGAGGAGGAAGTATGGAAATAAACAACTGTCATCGGCTACCTATCTTAGTGAAGCCGAACCTTTTCTTGAACAGTATGCTAAACGCAGTCCTCAGAATCAGGCTCTTATTGGATCTGCTGGAAATTTAGTTAGAGCAGAAGATTTCTTGGCCATTGTCGAGGAAGGAATGGACGAAGAGGGTGATCTTCAGAAGGAGCAGGAGGCAGCACCATCAAGTCCAATGCTCTCTGGGAAGGATGCTGTGCCAAAAGCAGAGGGTTTAATTGTGTTTTTTCCAGGTTACTATTTGGCTAATTGGCTAGGCTTTGTTTATTTCCAATGTTGTGAAGATTTTTGTGGTCGAGTTTTTACATGTGCAAATTTTATTAGCTAGGTAAGGATTATGAATTATCTTGGTGGAGGAACTTCATACTTCCATAATATGTGTGTGTTAGATGATATAATATTAAATTTACCTCACCCATCAACTTAAGCTTTTGGGTCAATTGGTGATTTAAGATGGTGTTAAAGCAAGTGGTCCAGGAGGTCTTGTGTTCAAACTCCTACAATGCCTATTTCCTCTCCAATTAATATTGATTTCCACTTGGTGGGTCTTCTACATATTTCAAGCCCACAAGTGAAGGAGAGTATTAGATATACTATTAAATTTATCTTCACCCATCAGCTCAAGCTTTTGGGTCAATTGGTAATTTAAGAGAGGACATGTATTTTTGTCATGTTTTAGATAAGGCTACCTGAACCTAGTTCTTCTTGTAGCCATGTCATTTTATAACAGTTGTACGGGTATGCTGTCTAGTACTTTGAATTTGTGGTAAAAGAATTAAGGAACTATTCTACCCCATTTGACGTTTCAACAATATTTAATCTCTTCAAGTGAAGACATTAAAGTATGGGTTGAATGCAGTTTGAGTTAAACAAGTACATTTATGGTTGGTTGTCATTTTCCCACATCCTCCAATTATAAAGCTACAAAAGTTAAGTTGTTACACTGTATCACATTGAGGTTCAAATAAATTTATTGTTTTCATGTGGAAATTCTTTGGCAGTTTTTTTTTTTTTTCTTTTTGCATTTTGACTCTTACAAGGTGCTGTTGTTTTGGATCTTGAGGTGGCCCTCAGATGCATTACACATTTTTGTAGCATTCAGTTAATTTTAGCTTGGTATGGTTTATGAAATACATAGATCTTCTCTAATGTATCATGTACCCCTTTTTATGGTTTAACCAATTTCTCTGGTTATAGATTTTTTATAGGTCAATTTTATATGTTGCTTTGAACCTCAATTTACAGGAATTCCAGGCTGTGCAAAGTCTGCTCTTTGCAGAGAGATACTGAATGCTCCAGGAGCACTTGGAGATGATCGACCAGTCAATACTCTGATGGGGGACCTGATTAAAGGTATTACTTGAATTCTTTGACTCTTCACAGATTCTAAGCTAGCAATTTCAGATGTTCCCCGAACACACACACACACAACCAGAAAAGAAAGAAATAAAGGGTATACAAAAGTTTCTCCAATTTGTAACAAGGGAAGTTAAGTTGTTAAAAGGCTGTAAATGTTTACACCAGTTCAAAATAAAGTTCAATTATTTATTTATTTATTTATTTTTGATAAAGCTTCTTAGCCTTTGGAAGTTTGAGTTTTTTTCTTGTCGAGAGAAAAATGTATAGCCCACAGGTTCCAAGATTGGGTTTTTGGGCCTTCTCAAGGTGTTTAATTTGTAATTCATTCTAACTTCCTAATTGCTTTGGTCATTCTAAGCTTGCTTTTCCTTCTTGAAGCTAGTGAGAATATGGATGTATTGCATTGAAAATATGGGCATGCACTTTTGGATTTGGTAGAATGTAATTGCTATCGGGAGACGCATTTTTCTAATGCCCAAAATGCTTGATTAAGAGTGTTAGACATATAATTTTGAACAATTAAACTTGTTTCTTATATATTTAATATGCACAGGAAGATATTGGCAGAAGGTTGCTGATGAGCGTAGGAGAAAACCATACTCCATAATGCTTGCAGACAAAAATGCACCAAATGAAGAAGTGTGGAGACAGGTAATGTTCTTGTTCTTTACTCTGAATAGTAAACATTGATTCAGTGAACAGAGCAGGAGGAATATAAAATGAAGTGTCTAATGTTAGCCTCTTCTATCAAATAATTTGGTTGCAAATTCAATTCCCTTGCTTCAGATATTGCACAGTATTTTAGCATTTTGTCTCTATCTCTATTATCAACTTTTTATTTGGAAATGAGACTTCAGATTATCAAAAAATATACCAGAAGAAAGTAGGACGAGTCCTTAGAATGTTCTAGTTGCTGCTTGTAATACTCAGGTTCCTTGTTTCTCTGTGGCCTACAGTACGAATATTCCAGTGATATGTAACTTGTGGGGACAGCATTTGATTTTAATGTTGGAGGATATTCTTTGGTTCTTTTCTTGTCATCTTAATCCTTCTGTCCCTTGCATTTTCTTTTCAGTTTTACATGAGCATTGTTTCATTTAGAAGAAACTTATTGAGACTGTTATGAAACTGGTTTTTGATATATACCTAGGAATGGTCCAATTTATTCGAACTTCTTTGGTGCATGTAACTGGACCATTTCTTCACTAGCATATTACATTTCAAAATTATTTATCGTCCATTTTGGTGCTATTCTCAAACATCTATATAATGTCACTATTGATGTTATACCTATTTCATATTTACAGCTTGTGGATGAACCTGATAAGGCCACTTTTATCACATCTTCAACCTGATGGACCTTAGTCCTTGCAACCAAATCTGCACCTTGGAATAGGAAAATTGAATCTATTTCAGAATTTTGTTGATAGGCCACTTTTATTATAACTTAAATATCACATTTGTTAAAGGAATTTTAGAAAGCTATCCAATCAATTGAAGTTAGCATATGTTACAACTTAGTTAACTTTTTTCATTTTGACATTTGAATATATTTGTTGCTTTACTTTCTTATAGAATTGATTATTTAAACAAAGAATATGTCAATCACGGTCGAATGTCTTATTTTCTGTTATATCTTGTATTCCTTTTAAAATAGATGAGCATTTTCTTGTAGTACTTACTATTGCTCCTATTTTCGAAAAGAGCCCCATCTATTTTTTGATGGGAGAAGCCAAAAGGTCGTGTTAGTTTCTGAGGATGTTCTGTTTTTTAACTCTATGTTTCTATTAAACTCCTTATAAGGAACTGCTAAATATGTGCAGATTGAAGATATGTGTTGTAGCACAAGAGCCTCTGCAGTTCCAGTTGTACCTGATTCTGAAGGTTAGTCATTTGGATGCTTATAACGTTCAGTTCCTTGTCTCATGATATGTGGGGACAAGTTTTTCTTTTAATTGTTTGAAACTTGCTTTTTCAACAGCAAGTTCATGGACTATACTAATATTGTGATACTGATGGCTTTTCTCATTCACCAATGAATTCAACAATTTCCTTCTTGATAATTTGATGGGGATGGCTTTTGAAAAAAGTTGATTTTAAACCTAGATGCAATTCAGCCTCAGTTTAGTTTGGTTCTAATTAGTTTGCTTTTGATCTTGTATTTTGTTTTTGGAACCTTGCCCTTGGTATTGTTTTTTATCTTGATTAGTAGATCTGATGTATTGGATATGATGAGAGTGCTAAGGGGATGTCAACCTAGTTGAGATGTCCGGATGCACTCGCTGATCCATTGGTTTTAGCTTTCTTGACTCTTCATTATATTGTTATTTTATACTCTTTACTGTCCCATTTCATTATCTTAATGAAGAGCTCGTTTCCTTTTTCATAAAAAAAGAAAATTCAGGCAATTGAACTTGAACTTGTGAGGAAGAACTGAGAAATGATTTGTATGAATCATAAATAATTTGGTGGTTCTGGTTCTGGTTTAATAAGTTTCCAGCCTCTTAGACGGTTCGGCATGCTTTTTGAGGACTTCCTACTAAATTCTGTGAACATCTAAACTATAAAAATGAATTAAATAAAAATATGCTGCGGTGTAATGGAACAGGATCAGAGATTATGCTGTTTGAAAATGCTTTGTTGTGGAATAGAAAACAGGGGGAAGTAAACATTGGAGAGAGAAATTAATGATAATAAAGGAAAAAAGTTTAAAGAAAATTAGTGATACCTTAAGGTTGGCTTTTCTTGCAACTGTACACATCTTCATGCTCGTAGACACATAAACTGTGAAGAAAAAGAAAATAGAAGGAAATTATGAAGAGCATGACTGATCTGAATATAAAGGAAAGTCGAGAAAATCAGGAACGAATCTGGAAGAAAACACAATAATTGCCTCTAGCAAAGCCTCAGCTTCCAACTCATTGTCCCTTCCAATAGTGTTTTTTCATGGCATTAGGGAGGTTATGTTCAAAATTCTGTTGTATGTAGATTATGTTGCGAATGTTGTGTTATTAAAAGGGGATTGTGCCTATTGACATCATACTATGCCTTTGCTGCAAACTGTTGTTTTGGTTACTTACCATCCACACTTCACAGAATAATTATCATTTATTGTTGCTGAATGTGCAGGAACTGATTCTAACCCCTTCTCTCTTGACGCACTGGCTGTCTTCATGTTCCGTGTGCTGCAAAGAGTTAACCATCCAGTATGCGCTCTATGTTGATATTTCAAACTGTAGACGTGCCTCATTCATCATTGATTAACTAATGATTCTACTAATGATCTGTAATTCTGAAGGGAAATCTTGACAAGGCATCCCCAAATGCAGGCTATGTCCTACTAATGTTTTACCACCTTTACGAGGGCAAGGTGTGAACACTTCTCATTGCCCATTTCATGCTGGGTTTCACTTTTAGTATCTTGATTATTTTTCTCTAATGCTTTTCAGAGTCGCAGAGAGTTTGAAGGTGAGCTTATTGATCGTTTTGGGTCTTTGGCTAAGATGCCATTGCTGAAATCTGATAGGTACAAGAACTCATCCTCAGCACTGGTCTTATTTTTCATTATTTCTCTGTCCCTATATGTTAGAATCTCTTTGAAGTTTCACCCTTTACTTGTCTTAATGATGAACACTTATAATGATTATTTTTCTTTTCCATTATTTAAATCCTTTTGCAGTTTTCTTTCTTTGGTTTTGTCTTCTCATTTCCAATTATTGTTTTTAAACAAATTAACAAATTAAAATTAAAGAGGGAATCAAAATCTATACCATGAAAAGCCAACCTATTTTCTAGAAATTTCATAAGAGAGCAAAATTTACTCATATTTTTGACATGTGATCTTTGCACGAAAAGGTTTCCACCCTGAGCCATTTTGCTTTAATGATCTCTCTTAAATCGGTGTAACTAGCATCTTTATTTTATAGATTAAAGTTTATATGACAAATTGCACTGCTTTTTTTCCCCTTTTTTCCAATGCAAAATATTGAGATTGTTTTTGCAGTTTGTACATCTAATATTTTGCTGTTCTTGGACTGCAGGAATCCTTTACCTGATAATTTGAAGACTATCTTAGAGGAAGGATTAAGTCTGTATAAGCTCCATACTAGTAGACATGGAAGGTTGGTGGCTTGTTCAAGTATTTATAGATGTTATTGTTTAAGGAGGGTAAGGATAAGATAGTAAATGAGGGGGAGACAAATCTGTTACGTGAGGAGGAGGGATGAGGGAATGTTGTATCTGGGGACCATTTAGTTGTTAAATCTGAGGGAGGGCGAGAGGAGTATGGAAAAGTTTTGCTGTAGTTTTCATTTCATGCTTTTGGTTTTCAAGAAGTTTCTCGCAGAGAGTCTCATAGGCTGTTAGTTTTGTAAAGCCTTAGGGCTATTTCCATTTTTGATATATCGATACATAGCAAGGAGATACCTTACAATTACTTTGTTGACTGGAAGTTTCTAGCATAGGAATATCAATGAATAATCTGCAATGTGTTTTCTTGTTAGCCACCTCTTATGCAGTTGTCTGGAATCCTTCGCAGGGTGGACTCCACCAAAGGTTCCTATGCGAAAGAATGGGCCAAATGGGAAAAGAAATTGCGAGAAACTTTGTTTGGTAACACCGAGTATCTCAATGCTATTCAGGTACATTTTACTTCTCTTTAAAAGATCAAATTTTTCTCCATCCTTCAAACTCAATTTAGTTTGTATCAAACATTTTTGAAGCTTTAAAAAGCTTTTATTTTATTTTATATTTTTGAGTCAAACAATTGTTTGATGACCTTATTCACGTATAAAAGTGTTTTGGAAATTTCAACTAAATTTCATACACCACAAAATTTGAACTTTGGAAGTTAATTTCTGTTTTGCCTTTAGTCTATCGGTTGTTTTTTCTCTTATTATTTTTTACTAAGATTTCATCAACAATTCAAAAAAATAAAAACAATGCTCATCTTTTAAATAATTGAAAAACTGAAGTAGAATGGGTATCGTGGACCTAAAGGTGCAAGTATTGGCAACGCTAGTTTCTATGTTAACCATTTTTTGATCAAGACCTTAATAAAATAATCAATGTTAATTATTCAGGTTCCATTTGAGTTTGCTGTTCAAGATGTGTTGGAGCAATTAAAGAAGATCTCGAAAGGTGACTTTAAAAGCCCTATTACAGAGAGGAGGAAGTCTGGGGCCATAGTGTTTGCTGCCGTCAGTCTCCCTGTTCAGGAGATCCAAAATCTTCTTGGCACTGTAAGTGAATATGTGTATGAGCCACTCCGGCTCCCACCCCAAAGCTTAAACTAGCTCATAGGTTATGGGATAGTTAGTCTTTTATATATACTCATTCTTAACACTCCCTTTCACTCACTTGGAAATTGGTCCAAGACCGATCGGGTGATTATGAACACAAGACTTCTCTTCTACAGTACCTTGTCAAACTTCAACTAAATCCAAAAACTCAAGCTGATAGGATGTAGTATATTTAATTTTTTATACTTCCTTTTCTGTAAAGTTTGCTAAATTAGGTTCTAAAGCAACCTCCAGTTTTCTAACAGACCATTGCACGTTATTTGATGTGTCTTGTGCACAGTTGCACAGTTTATTACTGAAACAGTGGGATAAAGATAATGTGGGAGGAAAAAGAATAGTTGTTTGCATTAAAAATTTCTGGTTACAGCTTGACATATTCCTAAGACTTTTACCTGCTGAAAAATTTGTTTACCGCATTGGACAATGGACAGTCACACAGGGATTAAGTCATGGTGTAATTAGCAATTATCGAAATCATTATCAACTTCCCTTTCAATTTTATTTGTGATCTGCTTCGATATCATGCAATAAACCACATACAACAATACCATTTAAATTTGTATAATATATGTAGTGTCATCATATGCACAAGACAAATCTCACTATGTGAATCCTTACATTTTCCACCATTATTGTACAGTTGGGCAAGAAAAATCCCCGTGTTGAAGCATTCCTTAAAGAACACTTCAAGGATTATACACTTAAAGGGGCTCACGTCACACTCGCACACAAGAGAAGCCATGGTGTTAAAGGTGTAGCTGACTACGGCATCTTTGAAAACAAAGAAGTTCCAGTTGAGCTGACAGCCCTACTTTTCTCAGATAACATGGCTGCCTTTGAAGCCCGCCTAGGCAGCATTGAGAATGAAAGAGTGATTTCCAAAAATGAGTGGCCCCATGTAACCTTATGGACTAGAGAAGGGGTTGCAGCAAAAGAAGCTAACGCCTTACCACAGTTAGTATCAGAGGGCAAAGCAACTCTTGTTGAAATCAACCCTCCCATTATCATTTCAGGCATGGTGAGATTCTTTTAGCCTTTGTCCCTCAATCCATATCTCTGCATTCTCAGGTTGATAGGATAGAAATAAATGGATAGGAAATGTCCAAGGCATTAAATTTTGCTCGCTGAGGTTTTTAGATGCAGGTAATGAGAAGTTTTGTAGAATTTGTCTATGACAAAATTTTGTATGTACAGGCCACTTTGATCTACCTCAAAACAGGTTGAACTTTGTTTATATGATTAAGTTGGAAATTTTGTCTATTCAATACTGTTTTCTTTTTAGATATGGGATGAGTGTTGGGAGGTTGTTAAGTATTGAAGTCTCTTTTTTCATCCTTAAGGGTACATTTTTTATGCCCAAGAGGTGGCCAAGGTCTCTTTTAACCATGTTCGACATTGGAATAATGTCGTGGCCAATGGGCTTCTAAGGCATTGGAAGAGCTATCTTCTCCTCTTTTTGTTTGGTTCCCTTATTTCTCTTGATGTAGTTTAATTGTTTGTAGTGATGAATTGAACCATAGATTTCTAGAATGGTAAGTGGTGTCTTATCTATTAAGATACATTTAAAATATACTATAATATAGGTTTTGTTTTATAGCATCTAAAAGCATTTTTTAGGTTTGAGAGGATAAAAACATGTAAATAAGGATCTACCTTTCCATCAAACCTGCGGCTGCTCCCTTAAATCCTGTAGCTCATAATAACAATATTTTAAATTTGCAAACAATCCAAATTTGAACAGGAAGGTGCTTTGTTTTCTTTGTTTGTTGCTTTGTAAGTAGAAATTATTGAATGTGCTTAACAATTTTTTACGGAGTATTTTAAGCTTAAAGATTTTTTTGGCAATCTAGACAAAAGAAGAAGCATTGAACCTTAGCCGATTGCTACTAAAGAACTAGGAGTTGATTCCAACAAAGAGAAGTGCATTTATGTCATTCATGATGATTTGTGTTTTGTGAATGAGCTAATTTTGAGTAATGGATCATCTAGTTAACCAGTCTTCACAACACTTAGTCTTCCACATGATGACGTATGTGTTTTGTGAGTGAGCTAGCTTTAGTTTGGCAATGAATGGTAGCTAAATGTAACTTGAATAGGAAAGCCTTTTCGATGCACAAAAGTATAGAGACTAAAATTGATCAAACGTGACTAAAATGTTATTTAAACTTAAATTTTTTTATCTACATCTAAACCAGAATTGAGAGGCTCTATTGATTTTGAGGGATTGAACA

mRNA sequence

CCAATCTCCACTACACCCCTATTATTTTGACATTGTGTAGGAATGTCAGAATGTCAACATGGACGATATTTGGATAATTAAAGATCAAATTAGTTTTCAAGTGTGAAATTTTCTTAAGGTGATTTATGCAAAATTTCCCTCTTTTATAAAAGCCTCAGGTCCCCAAACTTCATTACAATGACACACACCAAGTTTTGGCGCAAGCCGTGTGGAAGGAGCTCTCACTTCTCTGCGGTCTCCGTCTTTCTCTCTCTATAACTATATTCATATAGCGCGTACACTTGAATGTCGGCGTCGCAGAGAATTTTCTGCGCTATAACTCTTCCTCACCCTCGTTTGTATTCATCTTGGGCCTTCCCTTTCATTTGCCACCCTCTATCCCACAATATCTTACCACGCTCTCTCACTCTCGCACCTTTAACTTCCTCCCCCTTTTCCCTTTCTCCTGATTCTCGATTCATCATGCCTTACAATCAGCGAAGGGGTGGCCATAGAGAACAGAAGTGGAAAGAGAAGGCAAAGGTTGACAGAAATTCTACGGAGTCAGAGGCTGCTGCTGACGTTGTTACTAATGCACTCGGTAAATTGAGGGTCACTGAAAGTGATCAACCTCATGTTCTTACTTCTAGTGCGCAGTTTGGAAATGCCCAGCTGACAAATCAGGCCACCCCTGGGCTTGCTCATAGAGCAATTTGGAAACCAAAAGCGTATGGAACAACCAGTGGGGCCGCAGTGGTTGAAGGTGAAAAAGAACCTACCAATGGAACGTCAACCGAAAACAAAGGGAGTAAAGCTGAGCTGGCAGCACAGAATGGCGCCGTTAGCTTGAGTCAATTATTCAAGGGCAATCAGATTGAAAAGTTTACCGTGGATAATTCTACTTACACACAAGCACAAATAAGAGCTACGTTTTACCCTAAATTTGAGAATGAGAAGTCGGATCAGGAGATTAGAACAAGGATGATAGAGATGGTATCGAAAGGCTTGGCTACATTGGAGGTTTCACTAAAACACTCAGGGTCTTTGTTTATGTACGCTGGCCATGAAGGTGGAGCATATGCAAAAAACAGCTTCGGGAATATCTACACTGCTGTTGGTGTCTTTGTTCTGGGAAGGATGTTTCGAGAGGCTTGGGGAGCTGGAGCAGCAAAAAAGCAGGCAGAATTCAATGATTTCCTTGAGAGTAACCGCATGTGCATATCAATGGAGTTAGTAACTGCTGTTTTGGGAGATCATGGCCAGAGACCACGTGAGGATTATGTGGTAGTTACAGCCGTTACAGAACTGGGCAAGGGAAAGCCGAAGTTCTATTCAACTGCAGAGATAATAGCCTTTTGTAGAAAATGGCGGTTACCAACTAATCATGTTTGGTTATTCTCAAGCAGGAAGTCGGTGACTTCTTTTTTTGCTGCATTTGATGCCCTATGTGAAGAAGGAACTGCTACTTCAGTATGTAAGGCTCTCGATGAAGTTGCAGAAATATCTGTACCAGGCTCAAAAGATCACATAAAAGTGCAGGGTGAAATTCTTGAGGGTCTTGTGGCCCGTATGGTGAGCCATGAGAGTTCAAAACACATGGAGAAAGTATTGGAAGAATTTCCTGCTCTGCCGGACAACGAAGGAGGTGGACTTGATTTAGGACCAAGCCTGAGGGAAATTTGTGCTGCAAATAGGTCAGATGAAAAACAGCAAATAAAAGCACTTCTTCAAAACGTTGGTACTGCCTTTTGCCCTGACCATTCTGACTGGTACGGTGATTCCTACTCAAGAAATGCTGACAGATCTGTTTTATCAAAATTCTTACAAGCCAACCCAGCTGATTTTTCAACCTTCAAATTACAGGAAATGATTCGTCTAATGAGAGAAAAACGTCTTCCAGCTGCATTCAAATGCTACCATAATTTCCACAAAGTTGGTTCCATATCAAATGACAACCTTTTCTATAAAATGGTCATTCATGTTCACAGCGACTCTGCTTTTCGGCGATATCAAAAAGAAATGAGGCACAAGCCAGGTTTGTGGCCATTATATCGAGGCTTTTTTGTTGACATCAATTTATTCAAAGAAAACAAGGACAAGGCTGCTGAATTAGTGAAAAGTAAGAGCAATTTGATGGAGATTGAAGGCAATGGGACCTTAGGAAGAGATGGATTTGCTGATGAAGATGCAAATCTGATGATAAAACTGAAATTTCTTACATATAAGCTGCGGACTTTTCTGATTCGTAATGGCTTGTCAATTCTCTTCAAAGAAGGTCCAGCTGCATACAAGGCCTATTACTTGAGGCAAATGAAGTTGTGGGGTACATCAGCCGGAAAACAAAGGGAGCTCAGCAAGATGCTTGATGAATGGGCTGTATACTTGAGGAGGAAGTATGGAAATAAACAACTGTCATCGGCTACCTATCTTAGTGAAGCCGAACCTTTTCTTGAACAGTATGCTAAACGCAGTCCTCAGAATCAGGCTCTTATTGGATCTGCTGGAAATTTAGTTAGAGCAGAAGATTTCTTGGCCATTGTCGAGGAAGGAATGGACGAAGAGGGTGATCTTCAGAAGGAGCAGGAGGCAGCACCATCAAGTCCAATGCTCTCTGGGAAGGATGCTGTGCCAAAAGCAGAGGGTTTAATTGTGTTTTTTCCAGGAATTCCAGGCTGTGCAAAGTCTGCTCTTTGCAGAGAGATACTGAATGCTCCAGGAGCACTTGGAGATGATCGACCAGTCAATACTCTGATGGGGGACCTGATTAAAGGAAGATATTGGCAGAAGGTTGCTGATGAGCGTAGGAGAAAACCATACTCCATAATGCTTGCAGACAAAAATGCACCAAATGAAGAAGTGTGGAGACAGATTGAAGATATGTGTTGTAGCACAAGAGCCTCTGCAGTTCCAGTTGTACCTGATTCTGAAGGAACTGATTCTAACCCCTTCTCTCTTGACGCACTGGCTGTCTTCATGTTCCGTGTGCTGCAAAGAGTTAACCATCCAGGAAATCTTGACAAGGCATCCCCAAATGCAGGCTATGTCCTACTAATGTTTTACCACCTTTACGAGGGCAAGAGTCGCAGAGAGTTTGAAGGTGAGCTTATTGATCGTTTTGGGTCTTTGGCTAAGATGCCATTGCTGAAATCTGATAGGAATCCTTTACCTGATAATTTGAAGACTATCTTAGAGGAAGGATTAAGTCTGTATAAGCTCCATACTAGTAGACATGGAAGGGTGGACTCCACCAAAGGTTCCTATGCGAAAGAATGGGCCAAATGGGAAAAGAAATTGCGAGAAACTTTGTTTGGTAACACCGAGTATCTCAATGCTATTCAGGTTCCATTTGAGTTTGCTGTTCAAGATGTGTTGGAGCAATTAAAGAAGATCTCGAAAGGTGACTTTAAAAGCCCTATTACAGAGAGGAGGAAGTCTGGGGCCATAGTGTTTGCTGCCGTCAGTCTCCCTGTTCAGGAGATCCAAAATCTTCTTGGCACTTTGGGCAAGAAAAATCCCCGTGTTGAAGCATTCCTTAAAGAACACTTCAAGGATTATACACTTAAAGGGGCTCACGTCACACTCGCACACAAGAGAAGCCATGGTGTTAAAGGTGTAGCTGACTACGGCATCTTTGAAAACAAAGAAGTTCCAGTTGAGCTGACAGCCCTACTTTTCTCAGATAACATGGCTGCCTTTGAAGCCCGCCTAGGCAGCATTGAGAATGAAAGAGTGATTTCCAAAAATGAGTGGCCCCATGTAACCTTATGGACTAGAGAAGGGGTTGCAGCAAAAGAAGCTAACGCCTTACCACAGTTAGTATCAGAGGGCAAAGCAACTCTTGTTGAAATCAACCCTCCCATTATCATTTCAGGCATGGTGAGATTCTTTTAGCCTTTGTCCCTCAATCCATATCTCTGCATTCTCAGGTTGATAGGATAGAAATAAATGGATAGGAAATGTCCAAGGCATTAAATTTTGCTCGCTGAGGTTTTTAGATGCAGGTAATGAGAAGTTTTGTAGAATTTGTCTATGACAAAATTTTGTATGTACAGGCCACTTTGATCTACCTCAAAACAGGTTGAACTTTGTTTATATGATTAAGTTGGAAATTTTGTCTATTCAATACTGTTTTCTTTTTAGATATGGGATGAGTGTTGGGAGGTTGTTAAGTATTGAAGTCTCTTTTTTCATCCTTAAGGGTACATTTTTTATGCCCAAGAGGTGGCCAAGGTCTCTTTTAACCATGTTCGACATTGGAATAATGTCGTGGCCAATGGGCTTCTAAGGCATTGGAAGAGCTATCTTCTCCTCTTTTTGTTTGGTTCCCTTATTTCTCTTGATGTAGTTTAATTGTTTGTAGTGATGAATTGAACCATAGATTTCTAGAATGGTAAGTGGTGTCTTATCTATTAAGATACATTTAAAATATACTATAATATAGGTTTTGTTTTATAGCATCTAAAAGCATTTTTTAGGTTTGAGAGGATAAAAACATGTAAATAAGGATCTACCTTTCCATCAAACCTGCGGCTGCTCCCTTAAATCCTGTAGCTCATAATAACAATATTTTAAATTTGCAAACAATCCAAATTTGAACAGGAAGGTGCTTTGTTTTCTTTGTTTGTTGCTTTGTAAGTAGAAATTATTGAATGTGCTTAACAATTTTTTACGGAGTATTTTAAGCTTAAAGATTTTTTTGGCAATCTAGACAAAAGAAGAAGCATTGAACCTTAGCCGATTGCTACTAAAGAACTAGGAGTTGATTCCAACAAAGAGAAGTGCATTTATGTCATTCATGATGATTTGTGTTTTGTGAATGAGCTAATTTTGAGTAATGGATCATCTAGTTAACCAGTCTTCACAACACTTAGTCTTCCACATGATGACGTATGTGTTTTGTGAGTGAGCTAGCTTTAGTTTGGCAATGAATGGTAGCTAAATGTAACTTGAATAGGAAAGCCTTTTCGATGCACAAAAGTATAGAGACTAAAATTGATCAAACGTGACTAAAATGTTATTTAAACTTAAATTTTTTTATCTACATCTAAACCAGAATTGAGAGGCTCTATTGATTTTGAGGGATTGAACA

Coding sequence (CDS)

ATGTCGGCGTCGCAGAGAATTTTCTGCGCTATAACTCTTCCTCACCCTCGTTTGTATTCATCTTGGGCCTTCCCTTTCATTTGCCACCCTCTATCCCACAATATCTTACCACGCTCTCTCACTCTCGCACCTTTAACTTCCTCCCCCTTTTCCCTTTCTCCTGATTCTCGATTCATCATGCCTTACAATCAGCGAAGGGGTGGCCATAGAGAACAGAAGTGGAAAGAGAAGGCAAAGGTTGACAGAAATTCTACGGAGTCAGAGGCTGCTGCTGACGTTGTTACTAATGCACTCGGTAAATTGAGGGTCACTGAAAGTGATCAACCTCATGTTCTTACTTCTAGTGCGCAGTTTGGAAATGCCCAGCTGACAAATCAGGCCACCCCTGGGCTTGCTCATAGAGCAATTTGGAAACCAAAAGCGTATGGAACAACCAGTGGGGCCGCAGTGGTTGAAGGTGAAAAAGAACCTACCAATGGAACGTCAACCGAAAACAAAGGGAGTAAAGCTGAGCTGGCAGCACAGAATGGCGCCGTTAGCTTGAGTCAATTATTCAAGGGCAATCAGATTGAAAAGTTTACCGTGGATAATTCTACTTACACACAAGCACAAATAAGAGCTACGTTTTACCCTAAATTTGAGAATGAGAAGTCGGATCAGGAGATTAGAACAAGGATGATAGAGATGGTATCGAAAGGCTTGGCTACATTGGAGGTTTCACTAAAACACTCAGGGTCTTTGTTTATGTACGCTGGCCATGAAGGTGGAGCATATGCAAAAAACAGCTTCGGGAATATCTACACTGCTGTTGGTGTCTTTGTTCTGGGAAGGATGTTTCGAGAGGCTTGGGGAGCTGGAGCAGCAAAAAAGCAGGCAGAATTCAATGATTTCCTTGAGAGTAACCGCATGTGCATATCAATGGAGTTAGTAACTGCTGTTTTGGGAGATCATGGCCAGAGACCACGTGAGGATTATGTGGTAGTTACAGCCGTTACAGAACTGGGCAAGGGAAAGCCGAAGTTCTATTCAACTGCAGAGATAATAGCCTTTTGTAGAAAATGGCGGTTACCAACTAATCATGTTTGGTTATTCTCAAGCAGGAAGTCGGTGACTTCTTTTTTTGCTGCATTTGATGCCCTATGTGAAGAAGGAACTGCTACTTCAGTATGTAAGGCTCTCGATGAAGTTGCAGAAATATCTGTACCAGGCTCAAAAGATCACATAAAAGTGCAGGGTGAAATTCTTGAGGGTCTTGTGGCCCGTATGGTGAGCCATGAGAGTTCAAAACACATGGAGAAAGTATTGGAAGAATTTCCTGCTCTGCCGGACAACGAAGGAGGTGGACTTGATTTAGGACCAAGCCTGAGGGAAATTTGTGCTGCAAATAGGTCAGATGAAAAACAGCAAATAAAAGCACTTCTTCAAAACGTTGGTACTGCCTTTTGCCCTGACCATTCTGACTGGTACGGTGATTCCTACTCAAGAAATGCTGACAGATCTGTTTTATCAAAATTCTTACAAGCCAACCCAGCTGATTTTTCAACCTTCAAATTACAGGAAATGATTCGTCTAATGAGAGAAAAACGTCTTCCAGCTGCATTCAAATGCTACCATAATTTCCACAAAGTTGGTTCCATATCAAATGACAACCTTTTCTATAAAATGGTCATTCATGTTCACAGCGACTCTGCTTTTCGGCGATATCAAAAAGAAATGAGGCACAAGCCAGGTTTGTGGCCATTATATCGAGGCTTTTTTGTTGACATCAATTTATTCAAAGAAAACAAGGACAAGGCTGCTGAATTAGTGAAAAGTAAGAGCAATTTGATGGAGATTGAAGGCAATGGGACCTTAGGAAGAGATGGATTTGCTGATGAAGATGCAAATCTGATGATAAAACTGAAATTTCTTACATATAAGCTGCGGACTTTTCTGATTCGTAATGGCTTGTCAATTCTCTTCAAAGAAGGTCCAGCTGCATACAAGGCCTATTACTTGAGGCAAATGAAGTTGTGGGGTACATCAGCCGGAAAACAAAGGGAGCTCAGCAAGATGCTTGATGAATGGGCTGTATACTTGAGGAGGAAGTATGGAAATAAACAACTGTCATCGGCTACCTATCTTAGTGAAGCCGAACCTTTTCTTGAACAGTATGCTAAACGCAGTCCTCAGAATCAGGCTCTTATTGGATCTGCTGGAAATTTAGTTAGAGCAGAAGATTTCTTGGCCATTGTCGAGGAAGGAATGGACGAAGAGGGTGATCTTCAGAAGGAGCAGGAGGCAGCACCATCAAGTCCAATGCTCTCTGGGAAGGATGCTGTGCCAAAAGCAGAGGGTTTAATTGTGTTTTTTCCAGGAATTCCAGGCTGTGCAAAGTCTGCTCTTTGCAGAGAGATACTGAATGCTCCAGGAGCACTTGGAGATGATCGACCAGTCAATACTCTGATGGGGGACCTGATTAAAGGAAGATATTGGCAGAAGGTTGCTGATGAGCGTAGGAGAAAACCATACTCCATAATGCTTGCAGACAAAAATGCACCAAATGAAGAAGTGTGGAGACAGATTGAAGATATGTGTTGTAGCACAAGAGCCTCTGCAGTTCCAGTTGTACCTGATTCTGAAGGAACTGATTCTAACCCCTTCTCTCTTGACGCACTGGCTGTCTTCATGTTCCGTGTGCTGCAAAGAGTTAACCATCCAGGAAATCTTGACAAGGCATCCCCAAATGCAGGCTATGTCCTACTAATGTTTTACCACCTTTACGAGGGCAAGAGTCGCAGAGAGTTTGAAGGTGAGCTTATTGATCGTTTTGGGTCTTTGGCTAAGATGCCATTGCTGAAATCTGATAGGAATCCTTTACCTGATAATTTGAAGACTATCTTAGAGGAAGGATTAAGTCTGTATAAGCTCCATACTAGTAGACATGGAAGGGTGGACTCCACCAAAGGTTCCTATGCGAAAGAATGGGCCAAATGGGAAAAGAAATTGCGAGAAACTTTGTTTGGTAACACCGAGTATCTCAATGCTATTCAGGTTCCATTTGAGTTTGCTGTTCAAGATGTGTTGGAGCAATTAAAGAAGATCTCGAAAGGTGACTTTAAAAGCCCTATTACAGAGAGGAGGAAGTCTGGGGCCATAGTGTTTGCTGCCGTCAGTCTCCCTGTTCAGGAGATCCAAAATCTTCTTGGCACTTTGGGCAAGAAAAATCCCCGTGTTGAAGCATTCCTTAAAGAACACTTCAAGGATTATACACTTAAAGGGGCTCACGTCACACTCGCACACAAGAGAAGCCATGGTGTTAAAGGTGTAGCTGACTACGGCATCTTTGAAAACAAAGAAGTTCCAGTTGAGCTGACAGCCCTACTTTTCTCAGATAACATGGCTGCCTTTGAAGCCCGCCTAGGCAGCATTGAGAATGAAAGAGTGATTTCCAAAAATGAGTGGCCCCATGTAACCTTATGGACTAGAGAAGGGGTTGCAGCAAAAGAAGCTAACGCCTTACCACAGTTAGTATCAGAGGGCAAAGCAACTCTTGTTGAAATCAACCCTCCCATTATCATTTCAGGCATGGTGAGATTCTTTTAG

Protein sequence

MSASQRIFCAITLPHPRLYSSWAFPFICHPLSHNILPRSLTLAPLTSSPFSLSPDSRFIMPYNQRRGGHREQKWKEKAKVDRNSTESEAAADVVTNALGKLRVTESDQPHVLTSSAQFGNAQLTNQATPGLAHRAIWKPKAYGTTSGAAVVEGEKEPTNGTSTENKGSKAELAAQNGAVSLSQLFKGNQIEKFTVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEVSLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGAGAAKKQAEFNDFLESNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRKWRLPTNHVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVSHESSKHMEKVLEEFPALPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGTAFCPDHSDWYGDSYSRNADRSVLSKFLQANPADFSTFKLQEMIRLMREKRLPAAFKCYHNFHKVGSISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKDKAAELVKSKSNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEGPAAYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVYLRRKYGNKQLSSATYLSEAEPFLEQYAKRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKAEGLIVFFPGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRRKPYSIMLADKNAPNEEVWRQIEDMCCSTRASAVPVVPDSEGTDSNPFSLDALAVFMFRVLQRVNHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSLAKMPLLKSDRNPLPDNLKTILEEGLSLYKLHTSRHGRVDSTKGSYAKEWAKWEKKLRETLFGNTEYLNAIQVPFEFAVQDVLEQLKKISKGDFKSPITERRKSGAIVFAAVSLPVQEIQNLLGTLGKKNPRVEAFLKEHFKDYTLKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDNMAAFEARLGSIENERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISGMVRFF
Homology
BLAST of Clc03G02590 vs. NCBI nr
Match: XP_038894223.1 (tRNA ligase 1 isoform X1 [Benincasa hispida])

HSP 1 Score: 2288.5 bits (5929), Expect = 0.0e+00
Identity = 1151/1199 (96.00%), Postives = 1171/1199 (97.66%), Query Frame = 0

Query: 1    MSASQRIFCAITLPHPRLYSS-----WAFPFICHPLSHNILPRSLTLAPLTSSPFSLSPD 60
            MSASQRIFCAITLPHPRLY+       AFPFICHPLSH ILPRSLTLAPLTSSPF LS D
Sbjct: 1    MSASQRIFCAITLPHPRLYAPSAFNYRAFPFICHPLSHFILPRSLTLAPLTSSPFPLSRD 60

Query: 61   SRFIMPYNQRRGGHREQKWKEKAKVDRNSTESEAAADVVTNALGKLRVTESDQPHVLTSS 120
            SRFIMPYNQR+GG REQKWKEKAKVDRNSTESEAAA+VVTNALGKLRVTE+DQPHVLTSS
Sbjct: 61   SRFIMPYNQRKGGRREQKWKEKAKVDRNSTESEAAAEVVTNALGKLRVTENDQPHVLTSS 120

Query: 121  AQFGNAQLTNQATPGLAHRAIWKPKAYGTTSGAAVVEGEKEPTNGTSTENKGSKAELAAQ 180
            AQFGNAQLTNQ TPGLAHRA+WKPKAYGTTSGAA VEGEK PTNGTSTENKGS AELAAQ
Sbjct: 121  AQFGNAQLTNQVTPGLAHRAVWKPKAYGTTSGAAEVEGEKAPTNGTSTENKGSNAELAAQ 180

Query: 181  NGAVSLSQLFKGNQIEKFTVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLA 240
            NGAV LSQLFKGNQIEKFTVDNSTYT+AQIRATFYPKFENEKSDQEIRTRMIEMVSKGLA
Sbjct: 181  NGAVGLSQLFKGNQIEKFTVDNSTYTRAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLA 240

Query: 241  TLEVSLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGAGAAKKQAEFN 300
            TLEVSLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMF+EAWGA AAKKQAEFN
Sbjct: 241  TLEVSLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFQEAWGAAAAKKQAEFN 300

Query: 301  DFLESNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRKWR 360
            DFLESNRM ISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRKWR
Sbjct: 301  DFLESNRMSISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRKWR 360

Query: 361  LPTNHVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEIL 420
            LPTNHVWLFSSRKS TSFFAAFDALCEEGTATSVCKALDEVAEISVPG+KDHIKVQGEIL
Sbjct: 361  LPTNHVWLFSSRKSATSFFAAFDALCEEGTATSVCKALDEVAEISVPGTKDHIKVQGEIL 420

Query: 421  EGLVARMVSHESSKHMEKVLEEFPALPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQ 480
            EGLVAR+VSHESSKHMEKVLE+FPALPDNE GGLDLGPSLREICAANRSDEKQQIKALLQ
Sbjct: 421  EGLVARIVSHESSKHMEKVLEDFPALPDNEVGGLDLGPSLREICAANRSDEKQQIKALLQ 480

Query: 481  NVGTAFCPDHSDWYGDSYSRNADRSVLSKFLQANPADFSTFKLQEMIRLMREKRLPAAFK 540
            NVG+AFCPDHSDWYGDS+SRNADRSVLSKFLQANPADFST KLQEMIRLMREKRLPAAFK
Sbjct: 481  NVGSAFCPDHSDWYGDSHSRNADRSVLSKFLQANPADFSTSKLQEMIRLMREKRLPAAFK 540

Query: 541  CYHNFHKVGSISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKEN 600
            CYHNFHKVGSISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKEN
Sbjct: 541  CYHNFHKVGSISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKEN 600

Query: 601  KDKAAELVKSKSNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFK 660
            KDK AELVKSK+NLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFK
Sbjct: 601  KDK-AELVKSKNNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFK 660

Query: 661  EGPAAYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVYLRRKYGNKQLSSATYLSEAEPFL 720
            EGPAAYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVYLRRKYGNKQLSSATYLSEAEPFL
Sbjct: 661  EGPAAYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVYLRRKYGNKQLSSATYLSEAEPFL 720

Query: 721  EQYAKRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVP 780
            EQYAKRSPQNQALIGSAGNLV+AEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVP
Sbjct: 721  EQYAKRSPQNQALIGSAGNLVKAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVP 780

Query: 781  KAEGLIVFFPGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRRK 840
            KAEGLIVFFPGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRRK
Sbjct: 781  KAEGLIVFFPGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRRK 840

Query: 841  PYSIMLADKNAPNEEVWRQIEDMCCSTRASAVPVVPDSEGTDSNPFSLDALAVFMFRVLQ 900
            PYSIMLADKNAPNEEVWRQIEDMC STRASAVPVVPDSEGTDSNPFSLDALAVFMFRVLQ
Sbjct: 841  PYSIMLADKNAPNEEVWRQIEDMCRSTRASAVPVVPDSEGTDSNPFSLDALAVFMFRVLQ 900

Query: 901  RVNHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSLAKMPLLKSDRNPLPD 960
            RVNHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSL K+PLLKSDRNPLP+
Sbjct: 901  RVNHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSLVKIPLLKSDRNPLPN 960

Query: 961  NLKTILEEGLSLYKLHTSRHGRVDSTKGSYAKEWAKWEKKLRETLFGNTEYLNAIQVPFE 1020
            NLKTILEEGLSLYKLHTSRHGRVDSTKGSYAKEW KWEK+LRETLFGNTEYLNAIQVPFE
Sbjct: 961  NLKTILEEGLSLYKLHTSRHGRVDSTKGSYAKEWTKWEKQLRETLFGNTEYLNAIQVPFE 1020

Query: 1021 FAVQDVLEQLKKISKGDFKSPITERRKSGAIVFAAVSLPVQEIQNLLGTLGKKNPRVEAF 1080
            FAVQDVLEQLKKISKGDFKSPITERRKSGAIVFAAV+LPVQEIQNLLGTLGKKNPRVEAF
Sbjct: 1021 FAVQDVLEQLKKISKGDFKSPITERRKSGAIVFAAVNLPVQEIQNLLGTLGKKNPRVEAF 1080

Query: 1081 LKEHFKDYTLKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDNMAAFEARLG 1140
            LKEH+KDYTLKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSD MAAFEARLG
Sbjct: 1081 LKEHYKDYTLKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDKMAAFEARLG 1140

Query: 1141 SIENERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISGMVRFF 1195
            SIENERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPI ISG V+FF
Sbjct: 1141 SIENERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPINISGTVKFF 1198

BLAST of Clc03G02590 vs. NCBI nr
Match: XP_004147268.2 (tRNA ligase 1 isoform X1 [Cucumis sativus] >KGN64758.2 hypothetical protein Csa_013879 [Cucumis sativus])

HSP 1 Score: 2251.9 bits (5834), Expect = 0.0e+00
Identity = 1128/1195 (94.39%), Postives = 1155/1195 (96.65%), Query Frame = 0

Query: 1    MSASQRIFCAITLPHPRLYSSW-AFPFICHPLSHNILPRSLTLAPLTSSPFSLSPDSRFI 60
            MSA QRIFCA TLPHP   SS+  FPFI HPLSH ILPRSLTLAPLTSSP  +S DSRF+
Sbjct: 1    MSALQRIFCAKTLPHPPFSSSYRVFPFISHPLSHYILPRSLTLAPLTSSPLPISCDSRFV 60

Query: 61   MPYNQRRGGHREQKWKEKAKVDRNSTESEAAADVVTNALGKLRVTESDQPHVLTSSAQFG 120
            MPYNQRRG   EQKWKEKAK DRNSTESEAAA+VVTNALGKLRVTESDQPHVLTSSAQFG
Sbjct: 61   MPYNQRRGSRGEQKWKEKAKADRNSTESEAAAEVVTNALGKLRVTESDQPHVLTSSAQFG 120

Query: 121  NAQLTNQATPGLAHRAIWKPKAYGTTSGAAVVEGEKEPTNGTSTENKGSKAELAAQNGAV 180
            NAQLTNQATPGLAHRAIWKPKAYGTTSGAAV+EGEK PTN TSTENKGS A +AAQ+G V
Sbjct: 121  NAQLTNQATPGLAHRAIWKPKAYGTTSGAAVIEGEKAPTNETSTENKGSNAGVAAQDGVV 180

Query: 181  SLSQLFKGNQIEKFTVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEV 240
            SLSQLFK NQIEKFTVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEV
Sbjct: 181  SLSQLFKSNQIEKFTVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEV 240

Query: 241  SLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGAGAAKKQAEFNDFLE 300
            SLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGA AAKKQAEFNDFLE
Sbjct: 241  SLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGAEAAKKQAEFNDFLE 300

Query: 301  SNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRKWRLPTN 360
            SNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCR WRLPTN
Sbjct: 301  SNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRNWRLPTN 360

Query: 361  HVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLV 420
            HVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLV
Sbjct: 361  HVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLV 420

Query: 421  ARMVSHESSKHMEKVLEEFPALPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGT 480
            ARMVSHESSKHM+KVLEEFPALPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGT
Sbjct: 421  ARMVSHESSKHMQKVLEEFPALPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGT 480

Query: 481  AFCPDHSDWYGDSYSRNADRSVLSKFLQANPADFSTFKLQEMIRLMREKRLPAAFKCYHN 540
            AFCPDHSDWYGDS+SRNADRSVLSKFLQANPADFST KLQEMIRLMRE+RLPAAFKCYHN
Sbjct: 481  AFCPDHSDWYGDSHSRNADRSVLSKFLQANPADFSTSKLQEMIRLMRERRLPAAFKCYHN 540

Query: 541  FHKVGSISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKDKA 600
            FHKV SISNDNLFYKMVIHVHSDSAFRRYQKE+RHKP LWPLYRGFFVDINLFKENKDKA
Sbjct: 541  FHKVASISNDNLFYKMVIHVHSDSAFRRYQKELRHKPSLWPLYRGFFVDINLFKENKDKA 600

Query: 601  AELVKSKSNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEGPA 660
            AELVKSKSNLM+ EGNGTLGRDGFADED+NLMIKLKFLTYKLRTFLIRNGLSILFKEG  
Sbjct: 601  AELVKSKSNLMDTEGNGTLGRDGFADEDSNLMIKLKFLTYKLRTFLIRNGLSILFKEGAV 660

Query: 661  AYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVYLRRKYGNKQLSSATYLSEAEPFLEQYA 720
            AYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVY+RRKYGNKQLSSATYLSEAEPFLEQYA
Sbjct: 661  AYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVYMRRKYGNKQLSSATYLSEAEPFLEQYA 720

Query: 721  KRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKAEG 780
            KRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKE EAAPSSPMLSGKDAVPKAEG
Sbjct: 721  KRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKELEAAPSSPMLSGKDAVPKAEG 780

Query: 781  LIVFFPGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRRKPYSI 840
            LIVFFPGIPGCAKSALC+EIL APGALGDDRPVNTLMGDLIKGRYWQKVAD+RRRKPYSI
Sbjct: 781  LIVFFPGIPGCAKSALCKEILKAPGALGDDRPVNTLMGDLIKGRYWQKVADDRRRKPYSI 840

Query: 841  MLADKNAPNEEVWRQIEDMCCSTRASAVPVVPDSEGTDSNPFSLDALAVFMFRVLQRVNH 900
            MLADKNAPNEEVWRQIEDMC STRASAVPV+PDSEGTDSNPFSLDALAVFMFRVLQRVNH
Sbjct: 841  MLADKNAPNEEVWRQIEDMCRSTRASAVPVIPDSEGTDSNPFSLDALAVFMFRVLQRVNH 900

Query: 901  PGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSLAKMPLLKSDRNPLPDNLKT 960
            PGNLDKASPNAGYVLLMFYHLY+GKSRREFEGELIDRFGSL KMPLLKSDRNPLPD+LKT
Sbjct: 901  PGNLDKASPNAGYVLLMFYHLYDGKSRREFEGELIDRFGSLVKMPLLKSDRNPLPDDLKT 960

Query: 961  ILEEGLSLYKLHTSRHGRVDSTKGSYAKEWAKWEKKLRETLFGNTEYLNAIQVPFEFAVQ 1020
            ILEEG+SLYKLHTSRHGRVDSTKGSYAKEWAKWEK+LRETLF NTEYLNAIQVPFE AVQ
Sbjct: 961  ILEEGISLYKLHTSRHGRVDSTKGSYAKEWAKWEKQLRETLFSNTEYLNAIQVPFELAVQ 1020

Query: 1021 DVLEQLKKISKGDFKSPITERRKSGAIVFAAVSLPVQEIQNLLGTLGKKNPRVEAFLKEH 1080
            DVLEQLKK+SKGD+KSPITERRKSGAIVFAAVSLPVQEIQNLLGTL KKN R+EAFL+EH
Sbjct: 1021 DVLEQLKKVSKGDYKSPITERRKSGAIVFAAVSLPVQEIQNLLGTLAKKNSRIEAFLREH 1080

Query: 1081 FKDYTLKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDNMAAFEARLGSIEN 1140
            +KDY LKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSD MAAFEARLGSIEN
Sbjct: 1081 YKDYKLKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDKMAAFEARLGSIEN 1140

Query: 1141 ERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISGMVRFF 1195
            ERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISGMV+FF
Sbjct: 1141 ERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISGMVKFF 1195

BLAST of Clc03G02590 vs. NCBI nr
Match: XP_008463605.1 (PREDICTED: uncharacterized protein LOC103501711 isoform X1 [Cucumis melo])

HSP 1 Score: 2242.2 bits (5809), Expect = 0.0e+00
Identity = 1124/1195 (94.06%), Postives = 1152/1195 (96.40%), Query Frame = 0

Query: 1    MSASQRIFCAITLPHPRLYSSW-AFPFICHPLSHNILPRSLTLAPLTSSPFSLSPDSRFI 60
            MSA QRIF A  LPHP   SS+  FPFICHPLSH ILPRSLTLAPLTSSPF LS DSRF+
Sbjct: 1    MSALQRIFYAKILPHPPFSSSYKVFPFICHPLSHFILPRSLTLAPLTSSPFPLSCDSRFV 60

Query: 61   MPYNQRRGGHREQKWKEKAKVDRNSTESEAAADVVTNALGKLRVTESDQPHVLTSSAQFG 120
            MPYNQRRGG  EQKWKEKAKVD++ TESEAA +VVTNALGKLRVTESDQ HVLTSSAQFG
Sbjct: 61   MPYNQRRGGRGEQKWKEKAKVDKSPTESEAAVEVVTNALGKLRVTESDQSHVLTSSAQFG 120

Query: 121  NAQLTNQATPGLAHRAIWKPKAYGTTSGAAVVEGEKEPTNGTSTENKGSKAELAAQNGAV 180
            NAQLTNQA PGLAHRAIWKPKAYGTTSGAAV+EGEK  TNGTSTENKGS A LA Q GAV
Sbjct: 121  NAQLTNQAIPGLAHRAIWKPKAYGTTSGAAVIEGEKASTNGTSTENKGSNAGLAVQGGAV 180

Query: 181  SLSQLFKGNQIEKFTVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEV 240
             LSQLFK NQIEKF VDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEV
Sbjct: 181  GLSQLFKSNQIEKFIVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEV 240

Query: 241  SLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGAGAAKKQAEFNDFLE 300
            SLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGA AAKKQAEFNDFL+
Sbjct: 241  SLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGAEAAKKQAEFNDFLQ 300

Query: 301  SNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRKWRLPTN 360
            SNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCR WRLPTN
Sbjct: 301  SNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRNWRLPTN 360

Query: 361  HVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLV 420
            HVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLV
Sbjct: 361  HVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLV 420

Query: 421  ARMVSHESSKHMEKVLEEFPALPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGT 480
            ARMVSHESSKHM+KVLEEFPA+PDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGT
Sbjct: 421  ARMVSHESSKHMQKVLEEFPAVPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGT 480

Query: 481  AFCPDHSDWYGDSYSRNADRSVLSKFLQANPADFSTFKLQEMIRLMREKRLPAAFKCYHN 540
            AFCPDHSDWYGDS+SRNADRSVLSKFLQANPADFST KLQEMIRLMRE+RLPAAFKCYHN
Sbjct: 481  AFCPDHSDWYGDSHSRNADRSVLSKFLQANPADFSTSKLQEMIRLMRERRLPAAFKCYHN 540

Query: 541  FHKVGSISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKDKA 600
            FHKV SISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKDKA
Sbjct: 541  FHKVASISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKDKA 600

Query: 601  AELVKSKSNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEGPA 660
            A LVKSKSNLM+ EGNGTLGRDGFADED+NLMIKLKFLTYKLRTFLIRNGLSILFKEGP 
Sbjct: 601  AGLVKSKSNLMDTEGNGTLGRDGFADEDSNLMIKLKFLTYKLRTFLIRNGLSILFKEGPV 660

Query: 661  AYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVYLRRKYGNKQLSSATYLSEAEPFLEQYA 720
            AYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVY+RRKYGNKQLSSATYLSEAEPFLEQYA
Sbjct: 661  AYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVYIRRKYGNKQLSSATYLSEAEPFLEQYA 720

Query: 721  KRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKAEG 780
            KRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKAEG
Sbjct: 721  KRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKAEG 780

Query: 781  LIVFFPGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRRKPYSI 840
            LIVFFPGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERR+KPYSI
Sbjct: 781  LIVFFPGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRKKPYSI 840

Query: 841  MLADKNAPNEEVWRQIEDMCCSTRASAVPVVPDSEGTDSNPFSLDALAVFMFRVLQRVNH 900
            MLADKNAPNEEVWRQIEDMC STRASAVPV+PDSEGTDSNPFSLDALAVFMFRVLQRVNH
Sbjct: 841  MLADKNAPNEEVWRQIEDMCRSTRASAVPVIPDSEGTDSNPFSLDALAVFMFRVLQRVNH 900

Query: 901  PGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSLAKMPLLKSDRNPLPDNLKT 960
            PGNLDKASPNAGYVLLMFYHLY+GKSRREFEGELIDRFGSL KMPLLK DRNPLPD+LK+
Sbjct: 901  PGNLDKASPNAGYVLLMFYHLYDGKSRREFEGELIDRFGSLVKMPLLKPDRNPLPDDLKS 960

Query: 961  ILEEGLSLYKLHTSRHGRVDSTKGSYAKEWAKWEKKLRETLFGNTEYLNAIQVPFEFAVQ 1020
            ILEEG+SLYKLHTSRHGRVDSTKGSYAKEWAKWEK+LRETLF NTEYLNAIQVPFE AVQ
Sbjct: 961  ILEEGISLYKLHTSRHGRVDSTKGSYAKEWAKWEKQLRETLFSNTEYLNAIQVPFESAVQ 1020

Query: 1021 DVLEQLKKISKGDFKSPITERRKSGAIVFAAVSLPVQEIQNLLGTLGKKNPRVEAFLKEH 1080
            DVLEQLKKIS+GD+KSPITERRKSGAIVFAAVSLPVQEIQN+LGTLGKKN R+EAFLKEH
Sbjct: 1021 DVLEQLKKISEGDYKSPITERRKSGAIVFAAVSLPVQEIQNVLGTLGKKNSRIEAFLKEH 1080

Query: 1081 FKDYTLKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDNMAAFEARLGSIEN 1140
            +KDY LKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSD MA FEARLGSIEN
Sbjct: 1081 YKDYKLKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDKMAGFEARLGSIEN 1140

Query: 1141 ERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISGMVRFF 1195
            ERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISG+V+FF
Sbjct: 1141 ERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISGIVKFF 1195

BLAST of Clc03G02590 vs. NCBI nr
Match: XP_008463612.1 (PREDICTED: uncharacterized protein LOC103501711 isoform X2 [Cucumis melo])

HSP 1 Score: 2149.0 bits (5567), Expect = 0.0e+00
Identity = 1073/1130 (94.96%), Postives = 1099/1130 (97.26%), Query Frame = 0

Query: 65   RRGGHREQKWKEKAKVDRNSTESEAAADVVTNALGKLRVTESDQPHVLTSSAQFGNAQLT 124
            RRGG  EQKWKEKAKVD++ TESEAA +VVTNALGKLRVTESDQ HVLTSSAQFGNAQLT
Sbjct: 11   RRGGRGEQKWKEKAKVDKSPTESEAAVEVVTNALGKLRVTESDQSHVLTSSAQFGNAQLT 70

Query: 125  NQATPGLAHRAIWKPKAYGTTSGAAVVEGEKEPTNGTSTENKGSKAELAAQNGAVSLSQL 184
            NQA PGLAHRAIWKPKAYGTTSGAAV+EGEK  TNGTSTENKGS A LA Q GAV LSQL
Sbjct: 71   NQAIPGLAHRAIWKPKAYGTTSGAAVIEGEKASTNGTSTENKGSNAGLAVQGGAVGLSQL 130

Query: 185  FKGNQIEKFTVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEVSLKHS 244
            FK NQIEKF VDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEVSLKHS
Sbjct: 131  FKSNQIEKFIVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEVSLKHS 190

Query: 245  GSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGAGAAKKQAEFNDFLESNRMC 304
            GSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGA AAKKQAEFNDFL+SNRMC
Sbjct: 191  GSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGAEAAKKQAEFNDFLQSNRMC 250

Query: 305  ISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRKWRLPTNHVWLF 364
            ISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCR WRLPTNHVWLF
Sbjct: 251  ISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRNWRLPTNHVWLF 310

Query: 365  SSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVS 424
            SSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVS
Sbjct: 311  SSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVS 370

Query: 425  HESSKHMEKVLEEFPALPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGTAFCPD 484
            HESSKHM+KVLEEFPA+PDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGTAFCPD
Sbjct: 371  HESSKHMQKVLEEFPAVPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGTAFCPD 430

Query: 485  HSDWYGDSYSRNADRSVLSKFLQANPADFSTFKLQEMIRLMREKRLPAAFKCYHNFHKVG 544
            HSDWYGDS+SRNADRSVLSKFLQANPADFST KLQEMIRLMRE+RLPAAFKCYHNFHKV 
Sbjct: 431  HSDWYGDSHSRNADRSVLSKFLQANPADFSTSKLQEMIRLMRERRLPAAFKCYHNFHKVA 490

Query: 545  SISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKDKAAELVK 604
            SISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKDKAA LVK
Sbjct: 491  SISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKDKAAGLVK 550

Query: 605  SKSNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEGPAAYKAY 664
            SKSNLM+ EGNGTLGRDGFADED+NLMIKLKFLTYKLRTFLIRNGLSILFKEGP AYKAY
Sbjct: 551  SKSNLMDTEGNGTLGRDGFADEDSNLMIKLKFLTYKLRTFLIRNGLSILFKEGPVAYKAY 610

Query: 665  YLRQMKLWGTSAGKQRELSKMLDEWAVYLRRKYGNKQLSSATYLSEAEPFLEQYAKRSPQ 724
            YLRQMKLWGTSAGKQRELSKMLDEWAVY+RRKYGNKQLSSATYLSEAEPFLEQYAKRSPQ
Sbjct: 611  YLRQMKLWGTSAGKQRELSKMLDEWAVYIRRKYGNKQLSSATYLSEAEPFLEQYAKRSPQ 670

Query: 725  NQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKAEGLIVFF 784
            NQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKAEGLIVFF
Sbjct: 671  NQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKAEGLIVFF 730

Query: 785  PGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRRKPYSIMLADK 844
            PGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERR+KPYSIMLADK
Sbjct: 731  PGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRKKPYSIMLADK 790

Query: 845  NAPNEEVWRQIEDMCCSTRASAVPVVPDSEGTDSNPFSLDALAVFMFRVLQRVNHPGNLD 904
            NAPNEEVWRQIEDMC STRASAVPV+PDSEGTDSNPFSLDALAVFMFRVLQRVNHPGNLD
Sbjct: 791  NAPNEEVWRQIEDMCRSTRASAVPVIPDSEGTDSNPFSLDALAVFMFRVLQRVNHPGNLD 850

Query: 905  KASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSLAKMPLLKSDRNPLPDNLKTILEEG 964
            KASPNAGYVLLMFYHLY+GKSRREFEGELIDRFGSL KMPLLK DRNPLPD+LK+ILEEG
Sbjct: 851  KASPNAGYVLLMFYHLYDGKSRREFEGELIDRFGSLVKMPLLKPDRNPLPDDLKSILEEG 910

Query: 965  LSLYKLHTSRHGRVDSTKGSYAKEWAKWEKKLRETLFGNTEYLNAIQVPFEFAVQDVLEQ 1024
            +SLYKLHTSRHGRVDSTKGSYAKEWAKWEK+LRETLF NTEYLNAIQVPFE AVQDVLEQ
Sbjct: 911  ISLYKLHTSRHGRVDSTKGSYAKEWAKWEKQLRETLFSNTEYLNAIQVPFESAVQDVLEQ 970

Query: 1025 LKKISKGDFKSPITERRKSGAIVFAAVSLPVQEIQNLLGTLGKKNPRVEAFLKEHFKDYT 1084
            LKKIS+GD+KSPITERRKSGAIVFAAVSLPVQEIQN+LGTLGKKN R+EAFLKEH+KDY 
Sbjct: 971  LKKISEGDYKSPITERRKSGAIVFAAVSLPVQEIQNVLGTLGKKNSRIEAFLKEHYKDYK 1030

Query: 1085 LKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDNMAAFEARLGSIENERVIS 1144
            LKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSD MA FEARLGSIENERVIS
Sbjct: 1031 LKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDKMAGFEARLGSIENERVIS 1090

Query: 1145 KNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISGMVRFF 1195
            KNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISG+V+FF
Sbjct: 1091 KNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISGIVKFF 1140

BLAST of Clc03G02590 vs. NCBI nr
Match: KAG7019255.1 (tRNA ligase 1 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 2131.7 bits (5522), Expect = 0.0e+00
Identity = 1071/1197 (89.47%), Postives = 1124/1197 (93.90%), Query Frame = 0

Query: 1    MSASQRIFCAITLP---HPRLYSSWAFPFICHPLSHNILPRSLTLAPLTSSPFSLSPDSR 60
            MSA+ RIFCAITLP    P L+S  AFPF+   LSH IL  SLTL P +  PF++  DSR
Sbjct: 1    MSATYRIFCAITLPLSSSPALHSR-AFPFVSCSLSHFILHPSLTL-PASVFPFTVCRDSR 60

Query: 61   FIMPYNQRRGGHREQKWKEKAKVDRNSTESEAAADVVTNALGKLRVTESDQPHVLTSSAQ 120
            F MPYNQRRGG REQKWKEKAKV+  STESE A++VVTNAL  LRVTES+QPH+  +S Q
Sbjct: 61   FTMPYNQRRGGRREQKWKEKAKVEGISTESETASEVVTNALSNLRVTESNQPHIPITSVQ 120

Query: 121  FGNAQLTNQATPGLAHRAIWKPKAYGTTSGAAVVEGEKEPTNGTSTENKGSKAELAAQNG 180
            FGNAQ TN ATPGL HRAIWKPKAYGTTSGAAVVEGEK P  GTSTENKGS AE+AA + 
Sbjct: 121  FGNAQPTNLATPGLGHRAIWKPKAYGTTSGAAVVEGEKAPAVGTSTENKGSNAEIAANSS 180

Query: 181  AVSLSQLFKGNQIEKFTVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATL 240
            A++LSQL KGNQIE+FTVDNS YTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATL
Sbjct: 181  AIALSQLLKGNQIEQFTVDNSAYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATL 240

Query: 241  EVSLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGAGAAKKQAEFNDF 300
            EVSLKHSGSLFMYAGH+GGAYAKNSFGNIYTAVGVFVLGRMFREAWG+ A KKQAEFNDF
Sbjct: 241  EVSLKHSGSLFMYAGHQGGAYAKNSFGNIYTAVGVFVLGRMFREAWGSVAPKKQAEFNDF 300

Query: 301  LESNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRKWRLP 360
            LESNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELG GKPKFYST+EIIAFCRKWRLP
Sbjct: 301  LESNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGNGKPKFYSTSEIIAFCRKWRLP 360

Query: 361  TNHVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEG 420
            TNHVWLFSSRKSVTSFFA+FDALCEEGTAT+VCKALDEVAEISVPGSKDHIKVQGEILEG
Sbjct: 361  TNHVWLFSSRKSVTSFFASFDALCEEGTATTVCKALDEVAEISVPGSKDHIKVQGEILEG 420

Query: 421  LVARMVSHESSKHMEKVLEEFPALPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNV 480
            LVARMVSHESSKHMEKVLEEFPALP NEGGGLDLGPSLREICAANRSDEKQQIKALLQNV
Sbjct: 421  LVARMVSHESSKHMEKVLEEFPALPYNEGGGLDLGPSLREICAANRSDEKQQIKALLQNV 480

Query: 481  GTAFCPDHSDWYGDSYSRNADRSVLSKFLQANPADFSTFKLQEMIRLMREKRLPAAFKCY 540
            G+AFCPDHSDWYGDS+SRNADRSV+SKFLQA PADFST KLQEM+RLMRE+RLPAAFKCY
Sbjct: 481  GSAFCPDHSDWYGDSHSRNADRSVVSKFLQAKPADFSTSKLQEMVRLMRERRLPAAFKCY 540

Query: 541  HNFHKVGSISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKD 600
            HNFHK+GSISNDNLFYKMVIHV SDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENK+
Sbjct: 541  HNFHKIGSISNDNLFYKMVIHVQSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKE 600

Query: 601  KAAELVKSKSNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEG 660
            K AE+VKSK+NLME EGNGT+GRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEG
Sbjct: 601  KTAEIVKSKNNLMETEGNGTVGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEG 660

Query: 661  PAAYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVYLRRKYGNKQLSSATYLSEAEPFLEQ 720
             AAYKAYYLRQMKLWGTS GKQRELSKMLDEWAVYLRRKYGNKQLSS+ YLSEAEPFLEQ
Sbjct: 661  SAAYKAYYLRQMKLWGTSFGKQRELSKMLDEWAVYLRRKYGNKQLSSSIYLSEAEPFLEQ 720

Query: 721  YAKRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKA 780
            YAKRSPQNQALIGSAGNLVRAEDFLA+VEEGMDEEGDLQKE +AAPSSPMLS KD VPKA
Sbjct: 721  YAKRSPQNQALIGSAGNLVRAEDFLAVVEEGMDEEGDLQKE-DAAPSSPMLSRKDVVPKA 780

Query: 781  EGLIVFFPGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRRKPY 840
            EGLIVFFPGIPGCAKSALCREILNAPG LGDDRPVNTLMGDLIKGRYWQKVADERRRKPY
Sbjct: 781  EGLIVFFPGIPGCAKSALCREILNAPGGLGDDRPVNTLMGDLIKGRYWQKVADERRRKPY 840

Query: 841  SIMLADKNAPNEEVWRQIEDMCCSTRASAVPVVPDSEGTDSNPFSLDALAVFMFRVLQRV 900
            SIMLADKNAPNEEVWRQIEDMC STRASAVPV+PDSEGTDSNPFSLDALAVFMFRVL RV
Sbjct: 841  SIMLADKNAPNEEVWRQIEDMCHSTRASAVPVIPDSEGTDSNPFSLDALAVFMFRVLHRV 900

Query: 901  NHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSLAKMPLLKSDRNPLPDNL 960
            NHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSL K+PLLKSDR+PLPDNL
Sbjct: 901  NHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSLVKIPLLKSDRSPLPDNL 960

Query: 961  KTILEEGLSLYKLHTSRHGRVDSTKGSYAKEWAKWEKKLRETLFGNTEYLNAIQVPFEFA 1020
            KTILEEGLSLYKLHTSRHGR DSTKGSYAKEWAKWEK+LRETLFGN EYLNAIQVPFEFA
Sbjct: 961  KTILEEGLSLYKLHTSRHGRADSTKGSYAKEWAKWEKQLRETLFGNAEYLNAIQVPFEFA 1020

Query: 1021 VQDVLEQLKKISKGDFKSPITERRKSGAIVFAAVSLPVQEIQNLLGTLGKKNPRVEAFLK 1080
            VQ+VLEQLKKISKGD+KSPITERRKS  IV+AAVSLPVQ+IQ+ L TLG KNP+VEAF+K
Sbjct: 1021 VQNVLEQLKKISKGDYKSPITERRKSATIVYAAVSLPVQDIQDALDTLGNKNPQVEAFIK 1080

Query: 1081 EHFKDYTLKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDNMAAFEARLGSI 1140
            E +KDYTLK AHVTLAHKRSHG+K VADYGIFENKEVPVELTALLFSD MAAFEAR+GSI
Sbjct: 1081 EGYKDYTLKSAHVTLAHKRSHGIKAVADYGIFENKEVPVELTALLFSDKMAAFEARVGSI 1140

Query: 1141 ENERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISGMVRFF 1195
            E+ERVISKNEWPHVTLWTREG+AAKEAN LPQLVSEGKATLVE+NPPIIISG V+FF
Sbjct: 1141 EDERVISKNEWPHVTLWTREGIAAKEANTLPQLVSEGKATLVELNPPIIISGKVQFF 1194

BLAST of Clc03G02590 vs. ExPASy Swiss-Prot
Match: Q0WL81 (tRNA ligase 1 OS=Arabidopsis thaliana OX=3702 GN=RNL PE=1 SV=1)

HSP 1 Score: 1538.1 bits (3981), Expect = 0.0e+00
Identity = 770/1120 (68.75%), Postives = 904/1120 (80.71%), Query Frame = 0

Query: 78   AKVDRNSTESEAAADVVTNALGKLRVTESDQPHVLTSSAQFGNAQLTNQATPGLAHRAIW 137
            A  +   + +   A+ V N  G L + ES+    +  S    N ++ N          +W
Sbjct: 3    APFESGDSSATVVAEAVNNQFGGLSLKESNTNAPVLPSQTTSNHRVQN---------LVW 62

Query: 138  KPKAYGTTSGAAVVEGEKEPTNGTSTENKGSKAELAAQNGAVSLSQLFKGNQIEKFTVDN 197
            KPK+YGT SG++      E    ++    GS  +       ++LS++F GN +EKF+VD 
Sbjct: 63   KPKSYGTVSGSS---SATEVGKTSAVSQIGSSGDTKV---GLNLSKIFGGNLLEKFSVDK 122

Query: 198  STYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEVSLKHSGSLFMYAGHEGGA 257
            STY  AQIRATFYPKFENEK+DQEIRTRMIEMVSKGLATLEVSLKHSGSLFMYAGH+GGA
Sbjct: 123  STYCHAQIRATFYPKFENEKTDQEIRTRMIEMVSKGLATLEVSLKHSGSLFMYAGHKGGA 182

Query: 258  YAKNSFGNIYTAVGVFVLGRMFREAWGAGAAKKQAEFNDFLESNRMCISMELVTAVLGDH 317
            YAKNSFGNIYTAVGVFVL RMFREAWG  A KK+AEFNDFLE NRMCISMELVTAVLGDH
Sbjct: 183  YAKNSFGNIYTAVGVFVLSRMFREAWGTKAPKKEAEFNDFLEKNRMCISMELVTAVLGDH 242

Query: 318  GQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRKWRLPTNHVWLFSSRKSVTSFFAAF 377
            GQRP +DYVVVTAVTELG GKP+FYST+EII+FCRKWRLPTNHVWLFS+RKSVTSFFAAF
Sbjct: 243  GQRPLDDYVVVTAVTELGNGKPQFYSTSEIISFCRKWRLPTNHVWLFSTRKSVTSFFAAF 302

Query: 378  DALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVSHESSKHMEKVLEE 437
            DALCEEG ATSVC+ALDEVA+ISVP SKDH+KVQGEILEGLVAR+VS +SS+ ME VL +
Sbjct: 303  DALCEEGIATSVCRALDEVADISVPASKDHVKVQGEILEGLVARIVSSQSSRDMENVLRD 362

Query: 438  FPALPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGTAFCPDHSDWYGD-SYSRN 497
             P  P  +G  LDLG SLREICAA+RS+EKQQ++ALL++VG +FCP   +W+GD S+ ++
Sbjct: 363  HPP-PPCDGANLDLGLSLREICAAHRSNEKQQMRALLRSVGPSFCPSDVEWFGDESHPKS 422

Query: 498  ADRSVLSKFLQANPADFSTFKLQEMIRLMREKRLPAAFKCYHNFHKVGSISNDNLFYKMV 557
            AD+SV++KFLQ+ PAD+ST KLQEM+RLM+EKRLPAAFKCYHNFH+   IS DNLFYK+V
Sbjct: 423  ADKSVITKFLQSQPADYSTSKLQEMVRLMKEKRLPAAFKCYHNFHRAEDISPDNLFYKLV 482

Query: 558  IHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKDKAAELVKSKSNLMEIEGNG 617
            +HVHSDS FRRY KEMRH P LWPLYRGFFVDINLFK NK +    +KS  N  E +G G
Sbjct: 483  VHVHSDSGFRRYHKEMRHMPSLWPLYRGFFVDINLFKSNKGRDLMALKSIDNASENDGRG 542

Query: 618  TLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEGPAAYKAYYLRQMKLWGTSA 677
               +DG AD+DANLMIK+KFLTYKLRTFLIRNGLSILFK+G AAYK YYLRQMK+WGTS 
Sbjct: 543  E--KDGLADDDANLMIKMKFLTYKLRTFLIRNGLSILFKDGAAAYKTYYLRQMKIWGTSD 602

Query: 678  GKQRELSKMLDEWAVYLRRKYGNKQLSSATYLSEAEPFLEQYAKRSPQNQALIGSAGNLV 737
            GKQ+EL KMLDEWA Y+RRK GN QLSS+TYLSEAEPFLEQYAKRSP+N  LIGSAGNLV
Sbjct: 603  GKQKELCKMLDEWAAYIRRKCGNDQLSSSTYLSEAEPFLEQYAKRSPKNHILIGSAGNLV 662

Query: 738  RAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKAEGLIVFFPGIPGCAKSALC 797
            R EDFLAIV+  +DEEGDL K+Q   P++P  + K+AV K EGLIVFFPGIPG AKSALC
Sbjct: 663  RTEDFLAIVDGDLDEEGDLVKKQGVTPATPEPAVKEAVQKDEGLIVFFPGIPGSAKSALC 722

Query: 798  REILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRRKPYSIMLADKNAPNEEVWRQIE 857
            +E+LNAPG  GDDRPV+TLMGDL+KG+YW KVADERR+KP SIMLADKNAPNE+VWRQIE
Sbjct: 723  KELLNAPGGFGDDRPVHTLMGDLVKGKYWPKVADERRKKPQSIMLADKNAPNEDVWRQIE 782

Query: 858  DMCCSTRASAVPVVPDSEGTDSNPFSLDALAVFMFRVLQRVNHPGNLDKASPNAGYVLLM 917
            DMC  TRASAVP+V DSEGTD+NP+SLDALAVFMFRVLQRVNHPG LDK S NAGYVLLM
Sbjct: 783  DMCRRTRASAVPIVADSEGTDTNPYSLDALAVFMFRVLQRVNHPGKLDKESSNAGYVLLM 842

Query: 918  FYHLYEGKSRREFEGELIDRFGSLAKMPLLKSDRNPLPDNLKTILEEGLSLYKLHTSRHG 977
            FYHLYEGK+R EFE ELI+RFGSL KMPLLKSDR PLPD +K++LEEG+ L+ LH+ RHG
Sbjct: 843  FYHLYEGKNRNEFESELIERFGSLIKMPLLKSDRTPLPDPVKSVLEEGIDLFNLHSRRHG 902

Query: 978  RVDSTKGSYAKEWAKWEKKLRETLFGNTEYLNAIQVPFEFAVQDVLEQLKKISKGDFKSP 1037
            R++STKG+YA EW KWEK+LR+TL  N+EYL++IQVPFE  V  V E+LK I+KGD+K P
Sbjct: 903  RLESTKGTYAAEWTKWEKQLRDTLVANSEYLSSIQVPFESMVHQVREELKTIAKGDYKPP 962

Query: 1038 ITERRKSGAIVFAAVSLPVQEIQNLLGTLGKKNPRVEAFLKEHFKDY--TLKGAHVTLAH 1097
             +E+RK G+IVFAA++LP  ++ +LL  L   NP + +FL+   K     L+ +HVTLAH
Sbjct: 963  SSEKRKHGSIVFAAINLPATQVHSLLEKLAAANPTMRSFLEGKKKSIQEKLERSHVTLAH 1022

Query: 1098 KRSHGVKGVADYGIFENKEVPVELTALLFSDNMAAFEARLGSIENERVISKNEWPHVTLW 1157
            KRSHGV  VA Y    N+EVPVELT L+++D MAA  A +GS++ E V+SKNEWPHVTLW
Sbjct: 1023 KRSHGVATVASYSQHLNREVPVELTELIYNDKMAALTAHVGSVDGETVVSKNEWPHVTLW 1082

Query: 1158 TREGVAAKEANALPQLVSEGKATLVEINPPIIISGMVRFF 1195
            T EGV AKEAN LPQL  EGKA+ + I+PP+ ISG + FF
Sbjct: 1083 TAEGVTAKEANTLPQLYLEGKASRLVIDPPVSISGPLEFF 1104

BLAST of Clc03G02590 vs. ExPASy TrEMBL
Match: A0A1S3CK49 (uncharacterized protein LOC103501711 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103501711 PE=4 SV=1)

HSP 1 Score: 2242.2 bits (5809), Expect = 0.0e+00
Identity = 1124/1195 (94.06%), Postives = 1152/1195 (96.40%), Query Frame = 0

Query: 1    MSASQRIFCAITLPHPRLYSSW-AFPFICHPLSHNILPRSLTLAPLTSSPFSLSPDSRFI 60
            MSA QRIF A  LPHP   SS+  FPFICHPLSH ILPRSLTLAPLTSSPF LS DSRF+
Sbjct: 1    MSALQRIFYAKILPHPPFSSSYKVFPFICHPLSHFILPRSLTLAPLTSSPFPLSCDSRFV 60

Query: 61   MPYNQRRGGHREQKWKEKAKVDRNSTESEAAADVVTNALGKLRVTESDQPHVLTSSAQFG 120
            MPYNQRRGG  EQKWKEKAKVD++ TESEAA +VVTNALGKLRVTESDQ HVLTSSAQFG
Sbjct: 61   MPYNQRRGGRGEQKWKEKAKVDKSPTESEAAVEVVTNALGKLRVTESDQSHVLTSSAQFG 120

Query: 121  NAQLTNQATPGLAHRAIWKPKAYGTTSGAAVVEGEKEPTNGTSTENKGSKAELAAQNGAV 180
            NAQLTNQA PGLAHRAIWKPKAYGTTSGAAV+EGEK  TNGTSTENKGS A LA Q GAV
Sbjct: 121  NAQLTNQAIPGLAHRAIWKPKAYGTTSGAAVIEGEKASTNGTSTENKGSNAGLAVQGGAV 180

Query: 181  SLSQLFKGNQIEKFTVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEV 240
             LSQLFK NQIEKF VDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEV
Sbjct: 181  GLSQLFKSNQIEKFIVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEV 240

Query: 241  SLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGAGAAKKQAEFNDFLE 300
            SLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGA AAKKQAEFNDFL+
Sbjct: 241  SLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGAEAAKKQAEFNDFLQ 300

Query: 301  SNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRKWRLPTN 360
            SNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCR WRLPTN
Sbjct: 301  SNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRNWRLPTN 360

Query: 361  HVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLV 420
            HVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLV
Sbjct: 361  HVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLV 420

Query: 421  ARMVSHESSKHMEKVLEEFPALPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGT 480
            ARMVSHESSKHM+KVLEEFPA+PDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGT
Sbjct: 421  ARMVSHESSKHMQKVLEEFPAVPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGT 480

Query: 481  AFCPDHSDWYGDSYSRNADRSVLSKFLQANPADFSTFKLQEMIRLMREKRLPAAFKCYHN 540
            AFCPDHSDWYGDS+SRNADRSVLSKFLQANPADFST KLQEMIRLMRE+RLPAAFKCYHN
Sbjct: 481  AFCPDHSDWYGDSHSRNADRSVLSKFLQANPADFSTSKLQEMIRLMRERRLPAAFKCYHN 540

Query: 541  FHKVGSISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKDKA 600
            FHKV SISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKDKA
Sbjct: 541  FHKVASISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKDKA 600

Query: 601  AELVKSKSNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEGPA 660
            A LVKSKSNLM+ EGNGTLGRDGFADED+NLMIKLKFLTYKLRTFLIRNGLSILFKEGP 
Sbjct: 601  AGLVKSKSNLMDTEGNGTLGRDGFADEDSNLMIKLKFLTYKLRTFLIRNGLSILFKEGPV 660

Query: 661  AYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVYLRRKYGNKQLSSATYLSEAEPFLEQYA 720
            AYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVY+RRKYGNKQLSSATYLSEAEPFLEQYA
Sbjct: 661  AYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVYIRRKYGNKQLSSATYLSEAEPFLEQYA 720

Query: 721  KRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKAEG 780
            KRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKAEG
Sbjct: 721  KRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKAEG 780

Query: 781  LIVFFPGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRRKPYSI 840
            LIVFFPGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERR+KPYSI
Sbjct: 781  LIVFFPGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRKKPYSI 840

Query: 841  MLADKNAPNEEVWRQIEDMCCSTRASAVPVVPDSEGTDSNPFSLDALAVFMFRVLQRVNH 900
            MLADKNAPNEEVWRQIEDMC STRASAVPV+PDSEGTDSNPFSLDALAVFMFRVLQRVNH
Sbjct: 841  MLADKNAPNEEVWRQIEDMCRSTRASAVPVIPDSEGTDSNPFSLDALAVFMFRVLQRVNH 900

Query: 901  PGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSLAKMPLLKSDRNPLPDNLKT 960
            PGNLDKASPNAGYVLLMFYHLY+GKSRREFEGELIDRFGSL KMPLLK DRNPLPD+LK+
Sbjct: 901  PGNLDKASPNAGYVLLMFYHLYDGKSRREFEGELIDRFGSLVKMPLLKPDRNPLPDDLKS 960

Query: 961  ILEEGLSLYKLHTSRHGRVDSTKGSYAKEWAKWEKKLRETLFGNTEYLNAIQVPFEFAVQ 1020
            ILEEG+SLYKLHTSRHGRVDSTKGSYAKEWAKWEK+LRETLF NTEYLNAIQVPFE AVQ
Sbjct: 961  ILEEGISLYKLHTSRHGRVDSTKGSYAKEWAKWEKQLRETLFSNTEYLNAIQVPFESAVQ 1020

Query: 1021 DVLEQLKKISKGDFKSPITERRKSGAIVFAAVSLPVQEIQNLLGTLGKKNPRVEAFLKEH 1080
            DVLEQLKKIS+GD+KSPITERRKSGAIVFAAVSLPVQEIQN+LGTLGKKN R+EAFLKEH
Sbjct: 1021 DVLEQLKKISEGDYKSPITERRKSGAIVFAAVSLPVQEIQNVLGTLGKKNSRIEAFLKEH 1080

Query: 1081 FKDYTLKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDNMAAFEARLGSIEN 1140
            +KDY LKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSD MA FEARLGSIEN
Sbjct: 1081 YKDYKLKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDKMAGFEARLGSIEN 1140

Query: 1141 ERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISGMVRFF 1195
            ERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISG+V+FF
Sbjct: 1141 ERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISGIVKFF 1195

BLAST of Clc03G02590 vs. ExPASy TrEMBL
Match: A0A1S3CL84 (uncharacterized protein LOC103501711 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103501711 PE=4 SV=1)

HSP 1 Score: 2149.0 bits (5567), Expect = 0.0e+00
Identity = 1073/1130 (94.96%), Postives = 1099/1130 (97.26%), Query Frame = 0

Query: 65   RRGGHREQKWKEKAKVDRNSTESEAAADVVTNALGKLRVTESDQPHVLTSSAQFGNAQLT 124
            RRGG  EQKWKEKAKVD++ TESEAA +VVTNALGKLRVTESDQ HVLTSSAQFGNAQLT
Sbjct: 11   RRGGRGEQKWKEKAKVDKSPTESEAAVEVVTNALGKLRVTESDQSHVLTSSAQFGNAQLT 70

Query: 125  NQATPGLAHRAIWKPKAYGTTSGAAVVEGEKEPTNGTSTENKGSKAELAAQNGAVSLSQL 184
            NQA PGLAHRAIWKPKAYGTTSGAAV+EGEK  TNGTSTENKGS A LA Q GAV LSQL
Sbjct: 71   NQAIPGLAHRAIWKPKAYGTTSGAAVIEGEKASTNGTSTENKGSNAGLAVQGGAVGLSQL 130

Query: 185  FKGNQIEKFTVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEVSLKHS 244
            FK NQIEKF VDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEVSLKHS
Sbjct: 131  FKSNQIEKFIVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEVSLKHS 190

Query: 245  GSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGAGAAKKQAEFNDFLESNRMC 304
            GSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGA AAKKQAEFNDFL+SNRMC
Sbjct: 191  GSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGAEAAKKQAEFNDFLQSNRMC 250

Query: 305  ISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRKWRLPTNHVWLF 364
            ISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCR WRLPTNHVWLF
Sbjct: 251  ISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRNWRLPTNHVWLF 310

Query: 365  SSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVS 424
            SSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVS
Sbjct: 311  SSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVS 370

Query: 425  HESSKHMEKVLEEFPALPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGTAFCPD 484
            HESSKHM+KVLEEFPA+PDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGTAFCPD
Sbjct: 371  HESSKHMQKVLEEFPAVPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGTAFCPD 430

Query: 485  HSDWYGDSYSRNADRSVLSKFLQANPADFSTFKLQEMIRLMREKRLPAAFKCYHNFHKVG 544
            HSDWYGDS+SRNADRSVLSKFLQANPADFST KLQEMIRLMRE+RLPAAFKCYHNFHKV 
Sbjct: 431  HSDWYGDSHSRNADRSVLSKFLQANPADFSTSKLQEMIRLMRERRLPAAFKCYHNFHKVA 490

Query: 545  SISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKDKAAELVK 604
            SISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKDKAA LVK
Sbjct: 491  SISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKDKAAGLVK 550

Query: 605  SKSNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEGPAAYKAY 664
            SKSNLM+ EGNGTLGRDGFADED+NLMIKLKFLTYKLRTFLIRNGLSILFKEGP AYKAY
Sbjct: 551  SKSNLMDTEGNGTLGRDGFADEDSNLMIKLKFLTYKLRTFLIRNGLSILFKEGPVAYKAY 610

Query: 665  YLRQMKLWGTSAGKQRELSKMLDEWAVYLRRKYGNKQLSSATYLSEAEPFLEQYAKRSPQ 724
            YLRQMKLWGTSAGKQRELSKMLDEWAVY+RRKYGNKQLSSATYLSEAEPFLEQYAKRSPQ
Sbjct: 611  YLRQMKLWGTSAGKQRELSKMLDEWAVYIRRKYGNKQLSSATYLSEAEPFLEQYAKRSPQ 670

Query: 725  NQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKAEGLIVFF 784
            NQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKAEGLIVFF
Sbjct: 671  NQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKAEGLIVFF 730

Query: 785  PGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRRKPYSIMLADK 844
            PGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERR+KPYSIMLADK
Sbjct: 731  PGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRKKPYSIMLADK 790

Query: 845  NAPNEEVWRQIEDMCCSTRASAVPVVPDSEGTDSNPFSLDALAVFMFRVLQRVNHPGNLD 904
            NAPNEEVWRQIEDMC STRASAVPV+PDSEGTDSNPFSLDALAVFMFRVLQRVNHPGNLD
Sbjct: 791  NAPNEEVWRQIEDMCRSTRASAVPVIPDSEGTDSNPFSLDALAVFMFRVLQRVNHPGNLD 850

Query: 905  KASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSLAKMPLLKSDRNPLPDNLKTILEEG 964
            KASPNAGYVLLMFYHLY+GKSRREFEGELIDRFGSL KMPLLK DRNPLPD+LK+ILEEG
Sbjct: 851  KASPNAGYVLLMFYHLYDGKSRREFEGELIDRFGSLVKMPLLKPDRNPLPDDLKSILEEG 910

Query: 965  LSLYKLHTSRHGRVDSTKGSYAKEWAKWEKKLRETLFGNTEYLNAIQVPFEFAVQDVLEQ 1024
            +SLYKLHTSRHGRVDSTKGSYAKEWAKWEK+LRETLF NTEYLNAIQVPFE AVQDVLEQ
Sbjct: 911  ISLYKLHTSRHGRVDSTKGSYAKEWAKWEKQLRETLFSNTEYLNAIQVPFESAVQDVLEQ 970

Query: 1025 LKKISKGDFKSPITERRKSGAIVFAAVSLPVQEIQNLLGTLGKKNPRVEAFLKEHFKDYT 1084
            LKKIS+GD+KSPITERRKSGAIVFAAVSLPVQEIQN+LGTLGKKN R+EAFLKEH+KDY 
Sbjct: 971  LKKISEGDYKSPITERRKSGAIVFAAVSLPVQEIQNVLGTLGKKNSRIEAFLKEHYKDYK 1030

Query: 1085 LKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDNMAAFEARLGSIENERVIS 1144
            LKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSD MA FEARLGSIENERVIS
Sbjct: 1031 LKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDKMAGFEARLGSIENERVIS 1090

Query: 1145 KNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISGMVRFF 1195
            KNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISG+V+FF
Sbjct: 1091 KNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISGIVKFF 1140

BLAST of Clc03G02590 vs. ExPASy TrEMBL
Match: A0A6J1HM92 (tRNA ligase 1 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111464863 PE=4 SV=1)

HSP 1 Score: 2131.3 bits (5521), Expect = 0.0e+00
Identity = 1071/1197 (89.47%), Postives = 1124/1197 (93.90%), Query Frame = 0

Query: 1    MSASQRIFCAITLP---HPRLYSSWAFPFICHPLSHNILPRSLTLAPLTSSPFSLSPDSR 60
            MSA+ RIFCAITLP    P L+S  AFPF+   LSH IL  SLTL P +  PF++  DSR
Sbjct: 1    MSATYRIFCAITLPLSSSPALHSR-AFPFVSCSLSHFILHPSLTL-PASVFPFTVCRDSR 60

Query: 61   FIMPYNQRRGGHREQKWKEKAKVDRNSTESEAAADVVTNALGKLRVTESDQPHVLTSSAQ 120
            F MPYNQRRGG REQKWKEKAKV+  STESE A++VVTNAL  LRVTES+QPH+  +S Q
Sbjct: 61   FTMPYNQRRGGRREQKWKEKAKVEGISTESETASEVVTNALSNLRVTESNQPHIPITSVQ 120

Query: 121  FGNAQLTNQATPGLAHRAIWKPKAYGTTSGAAVVEGEKEPTNGTSTENKGSKAELAAQNG 180
            FGNAQ TN ATPGL HRAIWKPKAYGTTSGAAVVEGEK P  GTS ENKGS AE+AA + 
Sbjct: 121  FGNAQPTNLATPGLGHRAIWKPKAYGTTSGAAVVEGEKAPAVGTSIENKGSNAEIAANSS 180

Query: 181  AVSLSQLFKGNQIEKFTVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATL 240
            A++LSQL KGNQIE+FTVDNS YTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATL
Sbjct: 181  AIALSQLLKGNQIEQFTVDNSAYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATL 240

Query: 241  EVSLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGAGAAKKQAEFNDF 300
            EVSLKHSGSLFMYAGH+GGAYAKNSFGNIYTAVGVFVLGRMFREAWG+ A KKQAEFNDF
Sbjct: 241  EVSLKHSGSLFMYAGHQGGAYAKNSFGNIYTAVGVFVLGRMFREAWGSVAPKKQAEFNDF 300

Query: 301  LESNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRKWRLP 360
            LESNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELG GKPKFYST+EIIAFCRKWRLP
Sbjct: 301  LESNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGNGKPKFYSTSEIIAFCRKWRLP 360

Query: 361  TNHVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEG 420
            TNHVWLFSSRKSVTSFFA+FDALCEEGTAT+VCKALDEVAEISVPGSKDHIKVQGEILEG
Sbjct: 361  TNHVWLFSSRKSVTSFFASFDALCEEGTATTVCKALDEVAEISVPGSKDHIKVQGEILEG 420

Query: 421  LVARMVSHESSKHMEKVLEEFPALPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNV 480
            LVARMVSHESSKHMEKVLEEFPALP NEGGGLDLGPSLREICAANRSDEKQQIKALLQNV
Sbjct: 421  LVARMVSHESSKHMEKVLEEFPALPYNEGGGLDLGPSLREICAANRSDEKQQIKALLQNV 480

Query: 481  GTAFCPDHSDWYGDSYSRNADRSVLSKFLQANPADFSTFKLQEMIRLMREKRLPAAFKCY 540
            G+AFCPDHSDWYGDS+SRNADRSV+SKFLQA PADFST KLQEM+RLMRE+RLPAAFKCY
Sbjct: 481  GSAFCPDHSDWYGDSHSRNADRSVVSKFLQAKPADFSTSKLQEMVRLMRERRLPAAFKCY 540

Query: 541  HNFHKVGSISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKD 600
            HNFHK+GSISNDNLFYKMVIHV SDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENK+
Sbjct: 541  HNFHKIGSISNDNLFYKMVIHVQSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKE 600

Query: 601  KAAELVKSKSNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEG 660
            K AE+VKSK+NLME EGNGT+GRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEG
Sbjct: 601  KTAEIVKSKNNLMETEGNGTVGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEG 660

Query: 661  PAAYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVYLRRKYGNKQLSSATYLSEAEPFLEQ 720
             AAYKAYYLRQMKLWGTS GKQRELSKMLDEWAVYLRRKYGNKQLSS+ YLSEAEPFLEQ
Sbjct: 661  SAAYKAYYLRQMKLWGTSFGKQRELSKMLDEWAVYLRRKYGNKQLSSSIYLSEAEPFLEQ 720

Query: 721  YAKRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKA 780
            YAKRSPQNQALIGSAGNLVRAEDFLA+VEEGMDEEGDLQKE +AAPSSPMLS KD VPKA
Sbjct: 721  YAKRSPQNQALIGSAGNLVRAEDFLAVVEEGMDEEGDLQKE-DAAPSSPMLSRKDVVPKA 780

Query: 781  EGLIVFFPGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRRKPY 840
            EGLIVFFPGIPGCAKSALCREILNAPG LGDDRPVNTLMGDLIKGRYWQKVADERRRKPY
Sbjct: 781  EGLIVFFPGIPGCAKSALCREILNAPGGLGDDRPVNTLMGDLIKGRYWQKVADERRRKPY 840

Query: 841  SIMLADKNAPNEEVWRQIEDMCCSTRASAVPVVPDSEGTDSNPFSLDALAVFMFRVLQRV 900
            SIMLADKNAPNEEVWRQIEDMC STRASAVPV+PDSEGTDSNPFSLDALAVFMFRVLQRV
Sbjct: 841  SIMLADKNAPNEEVWRQIEDMCHSTRASAVPVIPDSEGTDSNPFSLDALAVFMFRVLQRV 900

Query: 901  NHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSLAKMPLLKSDRNPLPDNL 960
            NHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSL K+PLLKSDR+PLPDNL
Sbjct: 901  NHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSLVKIPLLKSDRSPLPDNL 960

Query: 961  KTILEEGLSLYKLHTSRHGRVDSTKGSYAKEWAKWEKKLRETLFGNTEYLNAIQVPFEFA 1020
            KTILEEGLSLYKLHTSRHGR DSTKGSYAKEWAKWEK+LRETLFGN EYLNAIQVPFEFA
Sbjct: 961  KTILEEGLSLYKLHTSRHGRADSTKGSYAKEWAKWEKQLRETLFGNAEYLNAIQVPFEFA 1020

Query: 1021 VQDVLEQLKKISKGDFKSPITERRKSGAIVFAAVSLPVQEIQNLLGTLGKKNPRVEAFLK 1080
            VQ+VLEQLKKISKGD+KSPITERRKS  IV+AAVSLPVQ+IQ+ L TLG KNP+VEAF+K
Sbjct: 1021 VQNVLEQLKKISKGDYKSPITERRKSATIVYAAVSLPVQDIQDALDTLGNKNPQVEAFIK 1080

Query: 1081 EHFKDYTLKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDNMAAFEARLGSI 1140
            E +KDYTLK AHVTLAHKRSHG+K VADYGIFENKEVPVELTALLFSD MAAFEAR+GSI
Sbjct: 1081 EGYKDYTLKSAHVTLAHKRSHGIKAVADYGIFENKEVPVELTALLFSDKMAAFEARVGSI 1140

Query: 1141 ENERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISGMVRFF 1195
            E+ERVISKNEWPHVTLWTREG+AAKEAN LPQLVSEGKATLVE+NPPIIISG V+FF
Sbjct: 1141 EDERVISKNEWPHVTLWTREGIAAKEANTLPQLVSEGKATLVELNPPIIISGKVQFF 1194

BLAST of Clc03G02590 vs. ExPASy TrEMBL
Match: A0A6J1DUP6 (tRNA ligase 1 OS=Momordica charantia OX=3673 GN=LOC111024537 PE=4 SV=1)

HSP 1 Score: 2127.1 bits (5510), Expect = 0.0e+00
Identity = 1073/1201 (89.34%), Postives = 1113/1201 (92.67%), Query Frame = 0

Query: 1    MSASQRIFCAITLPHP------RLYSSWAFPFICHPLSHNILPRSLTLAPLTSSPFSLSP 60
            MSAS RIFCAITLPHP       L++S AF       SH I PRSL L PL SSPF LSP
Sbjct: 1    MSASHRIFCAITLPHPPRFSPSSLFNSRAF----LSTSHFIFPRSLALPPLISSPFHLSP 60

Query: 61   DSRFIMPYNQRRGGHREQKWKEKAKVDRNSTESEAAADVVTNALGKLRVTESDQPHVLTS 120
             SR IMPYNQR  G REQKWKEKAK+DR STESEAAA+VVTNALGKLRV+ES QPHV  S
Sbjct: 61   HSRSIMPYNQRSDGRREQKWKEKAKLDRTSTESEAAAEVVTNALGKLRVSESGQPHVPIS 120

Query: 121  SAQFGNAQLTNQATPGLAHRAIWKPKAYGTTS-GAAVVEGEKEPTNGTSTENKGSKAELA 180
            S +FGNAQLTNQ   GL +R IWKPKAYGTTS GAAVVE EK P  GTS ENKG+ A LA
Sbjct: 121  SREFGNAQLTNQVPSGLGNRGIWKPKAYGTTSGGAAVVEAEKAPAVGTSIENKGNTAGLA 180

Query: 181  AQNGAVSLSQLFKGNQIEKFTVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKG 240
            AQNG V LSQLFKGNQIE FTVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKG
Sbjct: 181  AQNGTVGLSQLFKGNQIENFTVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKG 240

Query: 241  LATLEVSLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGAGAAKKQAE 300
            LATLEVSLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWG+ AAKKQAE
Sbjct: 241  LATLEVSLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGSKAAKKQAE 300

Query: 301  FNDFLESNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRK 360
            FN+FLESNRMCISMELVTAVLGDHGQRPREDYVVVTAVT+LG GKPKFYSTAEII FCR+
Sbjct: 301  FNNFLESNRMCISMELVTAVLGDHGQRPREDYVVVTAVTDLGNGKPKFYSTAEIIVFCRE 360

Query: 361  WRLPTNHVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGE 420
            WRLPTNHVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGE
Sbjct: 361  WRLPTNHVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGE 420

Query: 421  ILEGLVARMVSHESSKHMEKVLEEFPALPDNEGGGLDLGPSLREICAANRSDEKQQIKAL 480
            ILEGLVAR+VSHESSKHMEKVLEEFP+LPD EGGGLDLG SLREICAANRSDEKQQIKAL
Sbjct: 421  ILEGLVARIVSHESSKHMEKVLEEFPSLPDEEGGGLDLGRSLREICAANRSDEKQQIKAL 480

Query: 481  LQNVGTAFCPDHSDWYGDSYSRNADRSVLSKFLQANPADFSTFKLQEMIRLMREKRLPAA 540
            LQNVG++FCPDHSDW GDS+SR ADRSVLSKFLQ +P DFST KLQEMIRLMREKRLPAA
Sbjct: 481  LQNVGSSFCPDHSDWSGDSHSRTADRSVLSKFLQTSPTDFSTSKLQEMIRLMREKRLPAA 540

Query: 541  FKCYHNFHKVGSISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFK 600
            FKCYHNFHKVGSISND+LFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFK
Sbjct: 541  FKCYHNFHKVGSISNDDLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFK 600

Query: 601  ENKDKAAELVKSKSNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSIL 660
             NKDKAAE++KSKSNLME+EGNG LGRDG ADEDANLMIKLKFLTYKLRTFLIRNGLSIL
Sbjct: 601  ANKDKAAEIMKSKSNLMEVEGNGILGRDGLADEDANLMIKLKFLTYKLRTFLIRNGLSIL 660

Query: 661  FKEGPAAYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVYLRRKYGNKQLSSATYLSEAEP 720
            FKEGPAAYKAYYLRQMKLWGTS GKQRELSKMLDEWAVYLRRKYGN+QLSSATYLSEAEP
Sbjct: 661  FKEGPAAYKAYYLRQMKLWGTSVGKQRELSKMLDEWAVYLRRKYGNRQLSSATYLSEAEP 720

Query: 721  FLEQYAKRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDA 780
            FLEQYAKRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQE APSSPML GKD 
Sbjct: 721  FLEQYAKRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEVAPSSPMLPGKDT 780

Query: 781  VPKAEGLIVFFPGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERR 840
            V KAEGLIVFFPGIPGCAKSALCREILNAPG LGDDRPV +LMGDLIKGRYWQKV DERR
Sbjct: 781  VSKAEGLIVFFPGIPGCAKSALCREILNAPGGLGDDRPVKSLMGDLIKGRYWQKVVDERR 840

Query: 841  RKPYSIMLADKNAPNEEVWRQIEDMCCSTRASAVPVVPDSEGTDSNPFSLDALAVFMFRV 900
            RKPYSIMLADKNAPNEEVWRQIEDMC STRASAVPVVPDSEGTD NPFSLDALAVFMFRV
Sbjct: 841  RKPYSIMLADKNAPNEEVWRQIEDMCHSTRASAVPVVPDSEGTDGNPFSLDALAVFMFRV 900

Query: 901  LQRVNHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSLAKMPLLKSDRNPL 960
            LQRVNHPGNLDKASPNAGYVLLMFYHLYEGKSRREFE ELIDRFGSL KMPLLK DR+PL
Sbjct: 901  LQRVNHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEDELIDRFGSLVKMPLLKCDRSPL 960

Query: 961  PDNLKTILEEGLSLYKLHTSRHGRVDSTKGSYAKEWAKWEKKLRETLFGNTEYLNAIQVP 1020
            PDNLKTILEEGLSLYKLHTSRHGR DSTKGSYAKEWAKWEK+LRETLFGNTEYLN+IQVP
Sbjct: 961  PDNLKTILEEGLSLYKLHTSRHGRADSTKGSYAKEWAKWEKQLRETLFGNTEYLNSIQVP 1020

Query: 1021 FEFAVQDVLEQLKKISKGDFKSPITERRKSGAIVFAAVSLPVQEIQNLLGTLGKKNPRVE 1080
            FE AVQDVLEQLKKI+KGD+K+PI+ERRKS  IVFAAVSLPVQEIQNLL TLGKKNP VE
Sbjct: 1021 FEVAVQDVLEQLKKIAKGDYKTPISERRKSATIVFAAVSLPVQEIQNLLDTLGKKNPHVE 1080

Query: 1081 AFLKEHFKDYTLKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDNMAAFEAR 1140
            +FLK+ +KDYTLK AHVTLAHKRSHGVK VADYGIF+NKEVPVELTALLFSD MAAFEA 
Sbjct: 1081 SFLKQDYKDYTLKAAHVTLAHKRSHGVKAVADYGIFQNKEVPVELTALLFSDKMAAFEAH 1140

Query: 1141 LGSIENERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISGMVRF 1195
            LGS+E+ERV+SKNEWPHVTLWTREGVAAKEAN LPQLVSEGKATLVE+NPP IISG V+F
Sbjct: 1141 LGSVEDERVVSKNEWPHVTLWTREGVAAKEANTLPQLVSEGKATLVELNPPTIISGTVKF 1197

BLAST of Clc03G02590 vs. ExPASy TrEMBL
Match: A0A6J1I3R5 (tRNA ligase 1 OS=Cucurbita maxima OX=3661 GN=LOC111469400 PE=4 SV=1)

HSP 1 Score: 2110.1 bits (5466), Expect = 0.0e+00
Identity = 1061/1200 (88.42%), Postives = 1116/1200 (93.00%), Query Frame = 0

Query: 1    MSASQRIFCAITLPHPRLYSSWA------FPFICHPLSHNILPRSLTLAPLTSSPFSLSP 60
            MSA  RIFCAITLP  RL  S A      FPFI +  SH IL  SLT+    S P ++S 
Sbjct: 1    MSAPHRIFCAITLPRHRLSYSSAFNYRVFFPFIPYSFSHRILSPSLTITDSISFPSTVSS 60

Query: 61   DSRFIMPYNQRRGGHREQKWKEKAKVDRNSTESEAAADVVTNALGKLRVTESDQPHVLTS 120
            D RF+MPYNQRRGG REQKWKEKAKV+  STESEAA+ VVTNAL  LRVTES+QPH+  +
Sbjct: 61   DFRFMMPYNQRRGGRREQKWKEKAKVEGISTESEAASQVVTNALSNLRVTESNQPHIPIT 120

Query: 121  SAQFGNAQLTNQATPGLAHRAIWKPKAYGTTSGAAVVEGEKEPTNGTSTENKGSKAELAA 180
            S QFGNAQ TN ATPGL HRAIWKPKAYGTT GAAVVEGEK    GTS ENKGS AE+AA
Sbjct: 121  SVQFGNAQPTNLATPGLGHRAIWKPKAYGTTIGAAVVEGEKASAVGTSIENKGSNAEIAA 180

Query: 181  QNGAVSLSQLFKGNQIEKFTVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGL 240
             + A++L+QL KGNQIEKFTVDNS YTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGL
Sbjct: 181  NSSAIALNQLLKGNQIEKFTVDNSAYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGL 240

Query: 241  ATLEVSLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGAGAAKKQAEF 300
            ATLEVSLKHSGSLFMYAGH+GGAYAKNSFGNIYTAVGVFVLGRMF+EAWG+ A KKQAEF
Sbjct: 241  ATLEVSLKHSGSLFMYAGHQGGAYAKNSFGNIYTAVGVFVLGRMFQEAWGSVAPKKQAEF 300

Query: 301  NDFLESNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRKW 360
            NDFLESNRMCISMELVTAVLGDHGQRP+EDYVVVTAVTELG GKPKFYST+EIIAFCRKW
Sbjct: 301  NDFLESNRMCISMELVTAVLGDHGQRPQEDYVVVTAVTELGNGKPKFYSTSEIIAFCRKW 360

Query: 361  RLPTNHVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEI 420
            RLPTNHVWLFSSRKSVTSFFA+FDALCEEGTAT+VCKALDEVAEISVPGSKDHIKVQGEI
Sbjct: 361  RLPTNHVWLFSSRKSVTSFFASFDALCEEGTATTVCKALDEVAEISVPGSKDHIKVQGEI 420

Query: 421  LEGLVARMVSHESSKHMEKVLEEFPALPDNEGGGLDLGPSLREICAANRSDEKQQIKALL 480
            LEGLVARMVSHESSKHMEKVLEEFPALP NEGGGLDL PSLREICAANRSDEKQQIKALL
Sbjct: 421  LEGLVARMVSHESSKHMEKVLEEFPALPYNEGGGLDLEPSLREICAANRSDEKQQIKALL 480

Query: 481  QNVGTAFCPDHSDWYGDSYSRNADRSVLSKFLQANPADFSTFKLQEMIRLMREKRLPAAF 540
            QNVG+AFCPDHSDWYGDS+SRNADRSV+SKFLQA PADFSTFKLQEM+RLMRE+RLPAAF
Sbjct: 481  QNVGSAFCPDHSDWYGDSHSRNADRSVVSKFLQAKPADFSTFKLQEMVRLMRERRLPAAF 540

Query: 541  KCYHNFHKVGSISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKE 600
            KCYHNFHKVGSISNDNLFYKMVIHV SDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKE
Sbjct: 541  KCYHNFHKVGSISNDNLFYKMVIHVQSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKE 600

Query: 601  NKDKAAELVKSKSNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILF 660
            NK+KAAE+VKSK+NLME EGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILF
Sbjct: 601  NKEKAAEIVKSKNNLMETEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILF 660

Query: 661  KEGPAAYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVYLRRKYGNKQLSSATYLSEAEPF 720
            KEGPAAYKAYYLRQMKLWGTS GKQRELSKMLDEWAVYLRRKYGNKQLSS+ YLSEAEPF
Sbjct: 661  KEGPAAYKAYYLRQMKLWGTSFGKQRELSKMLDEWAVYLRRKYGNKQLSSSIYLSEAEPF 720

Query: 721  LEQYAKRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAV 780
            LEQYAKRSPQNQ LIGSAGNLVRAEDFLA+V+EGMDEEGDLQKE + APSSPMLS KD V
Sbjct: 721  LEQYAKRSPQNQTLIGSAGNLVRAEDFLAVVDEGMDEEGDLQKE-DTAPSSPMLSRKDVV 780

Query: 781  PKAEGLIVFFPGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRR 840
            PKAEGLIVFFPGIPGCAKS+LCREILNAPGALGDDRPVNTL GDLIKGRYWQKVADERRR
Sbjct: 781  PKAEGLIVFFPGIPGCAKSSLCREILNAPGALGDDRPVNTLTGDLIKGRYWQKVADERRR 840

Query: 841  KPYSIMLADKNAPNEEVWRQIEDMCCSTRASAVPVVPDSEGTDSNPFSLDALAVFMFRVL 900
            KPYSIMLADKNAPNEEVWRQIEDMC ST ASAVPV+PDSEGTDSNPFSLDALAVFMFRVL
Sbjct: 841  KPYSIMLADKNAPNEEVWRQIEDMCHSTGASAVPVIPDSEGTDSNPFSLDALAVFMFRVL 900

Query: 901  QRVNHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSLAKMPLLKSDRNPLP 960
            QRVNHPGNLDKASPNAGYVLLMFYH YEGKSRREFEGELIDRFGSL K+PLLKSDR+PLP
Sbjct: 901  QRVNHPGNLDKASPNAGYVLLMFYHFYEGKSRREFEGELIDRFGSLVKIPLLKSDRSPLP 960

Query: 961  DNLKTILEEGLSLYKLHTSRHGRVDSTKGSYAKEWAKWEKKLRETLFGNTEYLNAIQVPF 1020
            DNLKTILEEGLSLYKLHTSRHG  DSTKGSYAKEWA+WEK+LRETLFGN EYLNAIQVPF
Sbjct: 961  DNLKTILEEGLSLYKLHTSRHGWTDSTKGSYAKEWAEWEKQLRETLFGNAEYLNAIQVPF 1020

Query: 1021 EFAVQDVLEQLKKISKGDFKSPITERRKSGAIVFAAVSLPVQEIQNLLGTLGKKNPRVEA 1080
            EF+VQ+VLEQLKKISKGD+KSPITE RKS  IV+AAVSLPVQEIQN L TLG KNP+VEA
Sbjct: 1021 EFSVQNVLEQLKKISKGDYKSPITE-RKSATIVYAAVSLPVQEIQNALDTLGNKNPQVEA 1080

Query: 1081 FLKEHFKDYTLKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDNMAAFEARL 1140
            F+KE +KDYTLK AHVTLAHKRSHG+K VADYGIFENKEVPVELTALLFSD MAAFEAR+
Sbjct: 1081 FIKEGYKDYTLKSAHVTLAHKRSHGIKAVADYGIFENKEVPVELTALLFSDKMAAFEARV 1140

Query: 1141 GSIENERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPIIISGMVRFF 1195
            GSIE+ERVISKNEWPHVTLWTREG+AAKEAN+LPQLVSEGKATL+E+NPPIIISG V+FF
Sbjct: 1141 GSIEDERVISKNEWPHVTLWTREGIAAKEANSLPQLVSEGKATLLELNPPIIISGKVQFF 1198

BLAST of Clc03G02590 vs. TAIR 10
Match: AT1G07910.1 (RNAligase )

HSP 1 Score: 1538.1 bits (3981), Expect = 0.0e+00
Identity = 770/1120 (68.75%), Postives = 904/1120 (80.71%), Query Frame = 0

Query: 78   AKVDRNSTESEAAADVVTNALGKLRVTESDQPHVLTSSAQFGNAQLTNQATPGLAHRAIW 137
            A  +   + +   A+ V N  G L + ES+    +  S    N ++ N          +W
Sbjct: 3    APFESGDSSATVVAEAVNNQFGGLSLKESNTNAPVLPSQTTSNHRVQN---------LVW 62

Query: 138  KPKAYGTTSGAAVVEGEKEPTNGTSTENKGSKAELAAQNGAVSLSQLFKGNQIEKFTVDN 197
            KPK+YGT SG++      E    ++    GS  +       ++LS++F GN +EKF+VD 
Sbjct: 63   KPKSYGTVSGSS---SATEVGKTSAVSQIGSSGDTKV---GLNLSKIFGGNLLEKFSVDK 122

Query: 198  STYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEVSLKHSGSLFMYAGHEGGA 257
            STY  AQIRATFYPKFENEK+DQEIRTRMIEMVSKGLATLEVSLKHSGSLFMYAGH+GGA
Sbjct: 123  STYCHAQIRATFYPKFENEKTDQEIRTRMIEMVSKGLATLEVSLKHSGSLFMYAGHKGGA 182

Query: 258  YAKNSFGNIYTAVGVFVLGRMFREAWGAGAAKKQAEFNDFLESNRMCISMELVTAVLGDH 317
            YAKNSFGNIYTAVGVFVL RMFREAWG  A KK+AEFNDFLE NRMCISMELVTAVLGDH
Sbjct: 183  YAKNSFGNIYTAVGVFVLSRMFREAWGTKAPKKEAEFNDFLEKNRMCISMELVTAVLGDH 242

Query: 318  GQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRKWRLPTNHVWLFSSRKSVTSFFAAF 377
            GQRP +DYVVVTAVTELG GKP+FYST+EII+FCRKWRLPTNHVWLFS+RKSVTSFFAAF
Sbjct: 243  GQRPLDDYVVVTAVTELGNGKPQFYSTSEIISFCRKWRLPTNHVWLFSTRKSVTSFFAAF 302

Query: 378  DALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVSHESSKHMEKVLEE 437
            DALCEEG ATSVC+ALDEVA+ISVP SKDH+KVQGEILEGLVAR+VS +SS+ ME VL +
Sbjct: 303  DALCEEGIATSVCRALDEVADISVPASKDHVKVQGEILEGLVARIVSSQSSRDMENVLRD 362

Query: 438  FPALPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGTAFCPDHSDWYGD-SYSRN 497
             P  P  +G  LDLG SLREICAA+RS+EKQQ++ALL++VG +FCP   +W+GD S+ ++
Sbjct: 363  HPP-PPCDGANLDLGLSLREICAAHRSNEKQQMRALLRSVGPSFCPSDVEWFGDESHPKS 422

Query: 498  ADRSVLSKFLQANPADFSTFKLQEMIRLMREKRLPAAFKCYHNFHKVGSISNDNLFYKMV 557
            AD+SV++KFLQ+ PAD+ST KLQEM+RLM+EKRLPAAFKCYHNFH+   IS DNLFYK+V
Sbjct: 423  ADKSVITKFLQSQPADYSTSKLQEMVRLMKEKRLPAAFKCYHNFHRAEDISPDNLFYKLV 482

Query: 558  IHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKDKAAELVKSKSNLMEIEGNG 617
            +HVHSDS FRRY KEMRH P LWPLYRGFFVDINLFK NK +    +KS  N  E +G G
Sbjct: 483  VHVHSDSGFRRYHKEMRHMPSLWPLYRGFFVDINLFKSNKGRDLMALKSIDNASENDGRG 542

Query: 618  TLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEGPAAYKAYYLRQMKLWGTSA 677
               +DG AD+DANLMIK+KFLTYKLRTFLIRNGLSILFK+G AAYK YYLRQMK+WGTS 
Sbjct: 543  E--KDGLADDDANLMIKMKFLTYKLRTFLIRNGLSILFKDGAAAYKTYYLRQMKIWGTSD 602

Query: 678  GKQRELSKMLDEWAVYLRRKYGNKQLSSATYLSEAEPFLEQYAKRSPQNQALIGSAGNLV 737
            GKQ+EL KMLDEWA Y+RRK GN QLSS+TYLSEAEPFLEQYAKRSP+N  LIGSAGNLV
Sbjct: 603  GKQKELCKMLDEWAAYIRRKCGNDQLSSSTYLSEAEPFLEQYAKRSPKNHILIGSAGNLV 662

Query: 738  RAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKAEGLIVFFPGIPGCAKSALC 797
            R EDFLAIV+  +DEEGDL K+Q   P++P  + K+AV K EGLIVFFPGIPG AKSALC
Sbjct: 663  RTEDFLAIVDGDLDEEGDLVKKQGVTPATPEPAVKEAVQKDEGLIVFFPGIPGSAKSALC 722

Query: 798  REILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRRKPYSIMLADKNAPNEEVWRQIE 857
            +E+LNAPG  GDDRPV+TLMGDL+KG+YW KVADERR+KP SIMLADKNAPNE+VWRQIE
Sbjct: 723  KELLNAPGGFGDDRPVHTLMGDLVKGKYWPKVADERRKKPQSIMLADKNAPNEDVWRQIE 782

Query: 858  DMCCSTRASAVPVVPDSEGTDSNPFSLDALAVFMFRVLQRVNHPGNLDKASPNAGYVLLM 917
            DMC  TRASAVP+V DSEGTD+NP+SLDALAVFMFRVLQRVNHPG LDK S NAGYVLLM
Sbjct: 783  DMCRRTRASAVPIVADSEGTDTNPYSLDALAVFMFRVLQRVNHPGKLDKESSNAGYVLLM 842

Query: 918  FYHLYEGKSRREFEGELIDRFGSLAKMPLLKSDRNPLPDNLKTILEEGLSLYKLHTSRHG 977
            FYHLYEGK+R EFE ELI+RFGSL KMPLLKSDR PLPD +K++LEEG+ L+ LH+ RHG
Sbjct: 843  FYHLYEGKNRNEFESELIERFGSLIKMPLLKSDRTPLPDPVKSVLEEGIDLFNLHSRRHG 902

Query: 978  RVDSTKGSYAKEWAKWEKKLRETLFGNTEYLNAIQVPFEFAVQDVLEQLKKISKGDFKSP 1037
            R++STKG+YA EW KWEK+LR+TL  N+EYL++IQVPFE  V  V E+LK I+KGD+K P
Sbjct: 903  RLESTKGTYAAEWTKWEKQLRDTLVANSEYLSSIQVPFESMVHQVREELKTIAKGDYKPP 962

Query: 1038 ITERRKSGAIVFAAVSLPVQEIQNLLGTLGKKNPRVEAFLKEHFKDY--TLKGAHVTLAH 1097
             +E+RK G+IVFAA++LP  ++ +LL  L   NP + +FL+   K     L+ +HVTLAH
Sbjct: 963  SSEKRKHGSIVFAAINLPATQVHSLLEKLAAANPTMRSFLEGKKKSIQEKLERSHVTLAH 1022

Query: 1098 KRSHGVKGVADYGIFENKEVPVELTALLFSDNMAAFEARLGSIENERVISKNEWPHVTLW 1157
            KRSHGV  VA Y    N+EVPVELT L+++D MAA  A +GS++ E V+SKNEWPHVTLW
Sbjct: 1023 KRSHGVATVASYSQHLNREVPVELTELIYNDKMAALTAHVGSVDGETVVSKNEWPHVTLW 1082

Query: 1158 TREGVAAKEANALPQLVSEGKATLVEINPPIIISGMVRFF 1195
            T EGV AKEAN LPQL  EGKA+ + I+PP+ ISG + FF
Sbjct: 1083 TAEGVTAKEANTLPQLYLEGKASRLVIDPPVSISGPLEFF 1104

BLAST of Clc03G02590 vs. TAIR 10
Match: AT1G07910.2 (RNAligase )

HSP 1 Score: 1538.1 bits (3981), Expect = 0.0e+00
Identity = 770/1120 (68.75%), Postives = 904/1120 (80.71%), Query Frame = 0

Query: 78   AKVDRNSTESEAAADVVTNALGKLRVTESDQPHVLTSSAQFGNAQLTNQATPGLAHRAIW 137
            A  +   + +   A+ V N  G L + ES+    +  S    N ++ N          +W
Sbjct: 3    APFESGDSSATVVAEAVNNQFGGLSLKESNTNAPVLPSQTTSNHRVQN---------LVW 62

Query: 138  KPKAYGTTSGAAVVEGEKEPTNGTSTENKGSKAELAAQNGAVSLSQLFKGNQIEKFTVDN 197
            KPK+YGT SG++      E    ++    GS  +       ++LS++F GN +EKF+VD 
Sbjct: 63   KPKSYGTVSGSS---SATEVGKTSAVSQIGSSGDTKV---GLNLSKIFGGNLLEKFSVDK 122

Query: 198  STYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEVSLKHSGSLFMYAGHEGGA 257
            STY  AQIRATFYPKFENEK+DQEIRTRMIEMVSKGLATLEVSLKHSGSLFMYAGH+GGA
Sbjct: 123  STYCHAQIRATFYPKFENEKTDQEIRTRMIEMVSKGLATLEVSLKHSGSLFMYAGHKGGA 182

Query: 258  YAKNSFGNIYTAVGVFVLGRMFREAWGAGAAKKQAEFNDFLESNRMCISMELVTAVLGDH 317
            YAKNSFGNIYTAVGVFVL RMFREAWG  A KK+AEFNDFLE NRMCISMELVTAVLGDH
Sbjct: 183  YAKNSFGNIYTAVGVFVLSRMFREAWGTKAPKKEAEFNDFLEKNRMCISMELVTAVLGDH 242

Query: 318  GQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRKWRLPTNHVWLFSSRKSVTSFFAAF 377
            GQRP +DYVVVTAVTELG GKP+FYST+EII+FCRKWRLPTNHVWLFS+RKSVTSFFAAF
Sbjct: 243  GQRPLDDYVVVTAVTELGNGKPQFYSTSEIISFCRKWRLPTNHVWLFSTRKSVTSFFAAF 302

Query: 378  DALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVSHESSKHMEKVLEE 437
            DALCEEG ATSVC+ALDEVA+ISVP SKDH+KVQGEILEGLVAR+VS +SS+ ME VL +
Sbjct: 303  DALCEEGIATSVCRALDEVADISVPASKDHVKVQGEILEGLVARIVSSQSSRDMENVLRD 362

Query: 438  FPALPDNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGTAFCPDHSDWYGD-SYSRN 497
             P  P  +G  LDLG SLREICAA+RS+EKQQ++ALL++VG +FCP   +W+GD S+ ++
Sbjct: 363  HPP-PPCDGANLDLGLSLREICAAHRSNEKQQMRALLRSVGPSFCPSDVEWFGDESHPKS 422

Query: 498  ADRSVLSKFLQANPADFSTFKLQEMIRLMREKRLPAAFKCYHNFHKVGSISNDNLFYKMV 557
            AD+SV++KFLQ+ PAD+ST KLQEM+RLM+EKRLPAAFKCYHNFH+   IS DNLFYK+V
Sbjct: 423  ADKSVITKFLQSQPADYSTSKLQEMVRLMKEKRLPAAFKCYHNFHRAEDISPDNLFYKLV 482

Query: 558  IHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKDKAAELVKSKSNLMEIEGNG 617
            +HVHSDS FRRY KEMRH P LWPLYRGFFVDINLFK NK +    +KS  N  E +G G
Sbjct: 483  VHVHSDSGFRRYHKEMRHMPSLWPLYRGFFVDINLFKSNKGRDLMALKSIDNASENDGRG 542

Query: 618  TLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEGPAAYKAYYLRQMKLWGTSA 677
               +DG AD+DANLMIK+KFLTYKLRTFLIRNGLSILFK+G AAYK YYLRQMK+WGTS 
Sbjct: 543  E--KDGLADDDANLMIKMKFLTYKLRTFLIRNGLSILFKDGAAAYKTYYLRQMKIWGTSD 602

Query: 678  GKQRELSKMLDEWAVYLRRKYGNKQLSSATYLSEAEPFLEQYAKRSPQNQALIGSAGNLV 737
            GKQ+EL KMLDEWA Y+RRK GN QLSS+TYLSEAEPFLEQYAKRSP+N  LIGSAGNLV
Sbjct: 603  GKQKELCKMLDEWAAYIRRKCGNDQLSSSTYLSEAEPFLEQYAKRSPKNHILIGSAGNLV 662

Query: 738  RAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAVPKAEGLIVFFPGIPGCAKSALC 797
            R EDFLAIV+  +DEEGDL K+Q   P++P  + K+AV K EGLIVFFPGIPG AKSALC
Sbjct: 663  RTEDFLAIVDGDLDEEGDLVKKQGVTPATPEPAVKEAVQKDEGLIVFFPGIPGSAKSALC 722

Query: 798  REILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRRKPYSIMLADKNAPNEEVWRQIE 857
            +E+LNAPG  GDDRPV+TLMGDL+KG+YW KVADERR+KP SIMLADKNAPNE+VWRQIE
Sbjct: 723  KELLNAPGGFGDDRPVHTLMGDLVKGKYWPKVADERRKKPQSIMLADKNAPNEDVWRQIE 782

Query: 858  DMCCSTRASAVPVVPDSEGTDSNPFSLDALAVFMFRVLQRVNHPGNLDKASPNAGYVLLM 917
            DMC  TRASAVP+V DSEGTD+NP+SLDALAVFMFRVLQRVNHPG LDK S NAGYVLLM
Sbjct: 783  DMCRRTRASAVPIVADSEGTDTNPYSLDALAVFMFRVLQRVNHPGKLDKESSNAGYVLLM 842

Query: 918  FYHLYEGKSRREFEGELIDRFGSLAKMPLLKSDRNPLPDNLKTILEEGLSLYKLHTSRHG 977
            FYHLYEGK+R EFE ELI+RFGSL KMPLLKSDR PLPD +K++LEEG+ L+ LH+ RHG
Sbjct: 843  FYHLYEGKNRNEFESELIERFGSLIKMPLLKSDRTPLPDPVKSVLEEGIDLFNLHSRRHG 902

Query: 978  RVDSTKGSYAKEWAKWEKKLRETLFGNTEYLNAIQVPFEFAVQDVLEQLKKISKGDFKSP 1037
            R++STKG+YA EW KWEK+LR+TL  N+EYL++IQVPFE  V  V E+LK I+KGD+K P
Sbjct: 903  RLESTKGTYAAEWTKWEKQLRDTLVANSEYLSSIQVPFESMVHQVREELKTIAKGDYKPP 962

Query: 1038 ITERRKSGAIVFAAVSLPVQEIQNLLGTLGKKNPRVEAFLKEHFKDY--TLKGAHVTLAH 1097
             +E+RK G+IVFAA++LP  ++ +LL  L   NP + +FL+   K     L+ +HVTLAH
Sbjct: 963  SSEKRKHGSIVFAAINLPATQVHSLLEKLAAANPTMRSFLEGKKKSIQEKLERSHVTLAH 1022

Query: 1098 KRSHGVKGVADYGIFENKEVPVELTALLFSDNMAAFEARLGSIENERVISKNEWPHVTLW 1157
            KRSHGV  VA Y    N+EVPVELT L+++D MAA  A +GS++ E V+SKNEWPHVTLW
Sbjct: 1023 KRSHGVATVASYSQHLNREVPVELTELIYNDKMAALTAHVGSVDGETVVSKNEWPHVTLW 1082

Query: 1158 TREGVAAKEANALPQLVSEGKATLVEINPPIIISGMVRFF 1195
            T EGV AKEAN LPQL  EGKA+ + I+PP+ ISG + FF
Sbjct: 1083 TAEGVTAKEANTLPQLYLEGKASRLVIDPPVSISGPLEFF 1104

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038894223.10.0e+0096.00tRNA ligase 1 isoform X1 [Benincasa hispida][more]
XP_004147268.20.0e+0094.39tRNA ligase 1 isoform X1 [Cucumis sativus] >KGN64758.2 hypothetical protein Csa_... [more]
XP_008463605.10.0e+0094.06PREDICTED: uncharacterized protein LOC103501711 isoform X1 [Cucumis melo][more]
XP_008463612.10.0e+0094.96PREDICTED: uncharacterized protein LOC103501711 isoform X2 [Cucumis melo][more]
KAG7019255.10.0e+0089.47tRNA ligase 1 [Cucurbita argyrosperma subsp. argyrosperma][more]
Match NameE-valueIdentityDescription
Q0WL810.0e+0068.75tRNA ligase 1 OS=Arabidopsis thaliana OX=3702 GN=RNL PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A1S3CK490.0e+0094.06uncharacterized protein LOC103501711 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A1S3CL840.0e+0094.96uncharacterized protein LOC103501711 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A6J1HM920.0e+0089.47tRNA ligase 1 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111464863 PE=4 SV=1[more]
A0A6J1DUP60.0e+0089.34tRNA ligase 1 OS=Momordica charantia OX=3673 GN=LOC111024537 PE=4 SV=1[more]
A0A6J1I3R50.0e+0088.42tRNA ligase 1 OS=Cucurbita maxima OX=3661 GN=LOC111469400 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G07910.10.0e+0068.75RNAligase [more]
AT1G07910.20.0e+0068.75RNAligase [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR015965tRNA ligase, phosphodiesterasePFAMPF08302tRNA_lig_CPDcoord: 1034..1175
e-value: 5.4E-6
score: 26.1
IPR038837tRNA ligase 1PANTHERPTHR35460TRNA LIGASE 1coord: 134..1177
NoneNo IPR availablePANTHERPTHR35460:SF1TRNA LIGASE 1coord: 134..1177

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc03G02590.1Clc03G02590.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006388 tRNA splicing, via endonucleolytic cleavage and ligation
molecular_function GO:0005524 ATP binding
molecular_function GO:0003972 RNA ligase (ATP) activity