Lag0033876 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0033876
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionPHD domain-containing protein
Locationchr3: 2581747 .. 2600541 (-)
RNA-Seq ExpressionLag0033876
SyntenyLag0033876
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCTCACTGCTCTCTATCTCCATCTTGGTGTGTTATGTGTTCGAATGACAACGAGAACCCAGGCCACTTATTTGGGTGTTGCTCATTTGCTCGTTGTTACTGGTCTTATATCTTGGAAGCTTTTGAGTGGAATTTGGTCGTACCTAACAATGTGTTTGATCTCATATCGTTGATATTTATGGGTCATCCTTTCCATGGTTCAAAGAAGGTTTTGTGGTTGGCTTTTAATAGGGTGTTTTTTTGGTCTCTTTGGTGTGAAAGAAATAGTAGGATCTTCAGGGATACGACCTCCAACTTTGCTACTTTTATAGATTTGATTGTTTTTAATTCTCTTTATTGGTGTAAATGTAAACACCCGTTCTCTGATTATTTTTTTTTTTTGATAATGAACCAGCCTTTCATTGAGAAAAAAATGAAAGAATACAAGGGCATACAAAAAAACCAAGTCCCAAAAGGAAATCCCCTTAGATAAAGGGACGCCAATCTAGCAAAATAAGACCTAAAGGGTAATTACAAAATAAACGCGTCACCGACGCCCATAAAGAGACAGAATATTTAATAATGTCCCAAACATCCTCCGCATTCCTCTCCTTCCCTCTAAAAATTCTGTTATTTCTCTCCCCCCACAAAACCCAATAAACCCGTTCTCTGATTATAGTTTTTCATCTTTAATTGCTAATTGGAAAACATTTATGTAACTCACCTATTGGTGTTTGGAGTGTGAGCTCCTATTTCATTCAATCAATGACATTGTTTCCCTTCTCAAAAAAAATGATGGTCTTGTTTCTCATTAAAAAAATATATTGGGGACAATTTTTGATGACTTTTTAGTGAAGAGGATTTTGGACTCTCAAATTTAGGGGCATGTTCCAAATACATGAGTTGCTTGATGTGCGCAAACACACCCTACTGGAAGCGTATGGGTCAAGTTTTAATACAGTGTATAGTGAATACGAGTGTCGTCCTTTGGATTGGGTTTTTAGTCATTTAATTATTGCGAATACTAGTATTCGATTTTATTTAGAGATTAAAACGTGATGAGTGATGAATCAAATAACAAAAGCTAAAAAGTGGAAATTCAATGAGAGAAACTCTTAGGAAATTGACTTCATCAATTGAGATTATAGATTAATTACTTGAATGTGATTAACAGGCAATGTTCCATAAGTCAAGGAAACTAAGATTATAACTATCTCTCTTGAGCATAATTAATTCTCATGCACATAATGAATCCAATATTCCTATCTAAATTCATTAACATGCAAGGTCTTTAGTCTAATCTTTTTTACAATTTCTAAAGCGATCCTAAATCACTCGATCTTATAACTACTTCAGTCATGCAAGTTAAGATCTAATTTAAACCTCCTCTCCCGAGCAAGATTCAGAATAAACAACAATTAATCTATGGTCGGTAGATTAAAAGCATTAAGAATATAACTTGCATAAATGGAAGCAAAGCAATAAAACATTCAAGAATCAATCCATAAAACTCAAAAGCTACATCAATCCCTAAGATACAAGCTTAGCTACTCATAGTCTTGAAGTTACAATCTAAGCTTGAATTAAAATCCATGAAAACAACACAAGAGTAAGGGAAAGAAAGATAAAAACTCGAGTCTCGTAGATGTCGAACGTCCCCATCACAATCTCCATGAGCTCTCGCCTTGAACCTCGCTCCCCGCTTTTCCTCTAGCTCTCTTAAGGTGTCTTGTATTGAGTTTAGGGTAAAATCTCTCAAATTGAACTCCTTTTTTCCGTAGCAAATCGCATTCTATTTATAGGAAATATTGGACAACGTTTCGACGCTACCCTGTTGTTGCTCAAAATACATATGGAAAATCCAAGAGCATTGAGACGCTCTCAGCCCAGTGTCTCGACACTGTGCCTTCGGGAATGCTAGCTGGGTTGTTGGTTGCAGCAGCATCGAGACGCCAAAAGGGAGCGTCGAGACGCTCTACTCTCGGGTCAATAGGCCATTGCAGCTCTCTTTTCTTCTCTACTTCAAGTCCTTTGCCTCTTTTTGCTTCCCGAACACTCGATTGCTTCCATATTCAACATTTCATTCCAAAACACAATTCCTACAAAATAAACATAAAAGATTATGAAATTGGTCAAATAAATCTCTAAGACTAACAATAAAATACCGGGAAAAATAGCTCATAACGTGAGCTATCATTGCTCAAATGGATTTTTACTGAGGGTTGTTCCTTTGCAAAGATTAGAAGCTGGTTGCAATAGTTCTCAAGTTTAGAGGATTTTGGATTATCAGGAGAGAACTATCTTGTTATAACTCAACCAAATTAATCTTTTTAGGCCTCAAAGTTGGATTGCTTACCTAAGATATTAAAGTTCCCACAAGATTTCTATGCTAAACGTTCAAGAAGTTTATCAAGCTAGCAAGGACATTCGTGTTATCCCATGAAAAAAAAGGAAGAATTACGATGTCAACCCTGCTTGTGTAGTTTAGATTTAAAGGAAGGGTACACGGGAATTATCCATTAACCCAACTTTCCTTAAGATTTTGGTGACACATTCTTCTGCCTTCTTTTGCTTATAATATTTACAAGCTTTCTTCTGTACTTTATTTTTCATATTCTCATAGGCTACCAGACACGCAGAAATGGAAGCTATTGACATCCTGATTGAGGCATGGCAGAGGGATGGACTTTCAACCTCAGAAGTTGCTAATAAATTCTCGAAGTGCAAACTTTATGTTACCTGTGAACCATGTATTATGTGTGCTTCTGCCCTATCAATACTTGGTATGTTTTCTTTGCCCTTTTGTTTTCTTTTTCTCGTATCTCCATGCAGAGGTAACTGTCCGTCCAAAGAGTTTTGGTCATAGTTACCTTTCACTTAACTTCACTGAATATGGTTGACTTATTGCTGTTGGATCTTCTGTTTTAGGTATAAAGGAAGTATATTATGGTTGTGCAAACGATAAATTTGGTGGATGTGGATCTATATTGTCACTTCACTTGGGTAGCCGGGAGGCACATACAAGGTATATTTCTTTATCTACCATTATCAGATTTTAGTTTTGGGGCCAATGAACTCTTTGGTCATTTCGAACACATTGCTTCACTTCCAGATTAAATTAGATTTTTATGTAATTTAAGTCATTTACGTACTTGTAAACTATCAATTCATATGCTTCCTTCTAACTGGGTTGCTTCTTCAAGAAAAGGTGGGATAAAATACACCTACTAAGGGAAATTTCTATATTTTTCCTTTCCTCTTCTTACTTTTTTTTTTTTTTTTTTTTGGAAAAAGAAAACTTAGGACAGTAAACAAAATCAAATCAAGCTAGTAACTAAAAGAAAACAAGGGCCTATAGACAAATCCTCTCCAAACTCCTTTTTAGTTCAACAACATGTTGGGGTAGTGATATTAACCTTTGACCTTTAGGTTTAGGTTGATTGGTTGAGGGTACAATATCTTAACCAGTTATGCAACAAACGAATTGTTACCAGTAACACATTAATGATGGAAAAATAATTTATGCTACAAGTGTACAAGCAACAAGATAAATCCCTTGTTATTAACGTGGATTGATTGATGAGGGGAAGAGTAGGTTGTTAAGCATGGTACCCAAGTAGCTGGACAGTTTCTGATGTAAGCGTGAGAGTGTTGTGACATTGTATGGAGAAAGGTAGACTTTTGGTGAGGGATCTAGAATCTTGGGATAATACGAACTCTCAAAATTCTCCTAGATATGTATCACTTTGGCCTTATCAGTGAGAGCTCTATCATAACAGTTGTCATGTCATGAGTTAGAATACATTTGCTTGGCTCGTAAACTACCCCTAAAAGTTACAAGGCTCTAAATATCCGAAATCATGACTGAGACTACAATTATTACTTGAATTTCAGTTAAGTAGGAAAGAAAGCTTGAAAGCTGGAAAGGGAGTGTCTCCCGTGATCAAGCAAGTTTTGCTCTCCACGCTTATGGCTCCTTTCCTGCTTGTGTGTCCAAATCCTGCTTGTGCCCTTCTCCTATGTTAAGTGCAAAAAATGAAAATGCATATTGTTTCTAAACCAAACTTGCGGAAAGACTGAGGGGAGCAAGCAAAACTTTAATCTGTTTGCTTCTGAAGGAAACTTGCAGGTCATCCAGTGACTAATTGAATCTCTATACCCAGCACAACCTGACACTCAGATATCACCTAGTTCTCAAGGTAATGAATATAAAGTAGCAATGTTCGGTGGAGGCTTCCCAACTTTAATTGATCCAAACATCGACCTTTGGTGAAGGGGTACCACAAAGGTAGCAACTCGTGCCCTTAACACTTGTTCAATTGGCACAAGTAGTGAAAATAGTTATTAGTTACGGCCAAAGAATTCAATATCTCAGCAACGATCATTTCATTTTCCATTTCAGCCTTCGGCTAATATCATGTAGAAAGCTTTCTTTCCCATGTTACTAATACTAGTAGGTTAAGTATTAAGGGAGCCTAAATTCGTCCAACATTATTTTACATTTTGCACTATCCTCCCACACGAAGAAAAGGTTAGTTCAAATGTTCTAGATACTATGAAGTAATATAAATGATATAAACTGGAAAGATAATTACCCAAAGAAAATTAAAATTTTCTTATGGGAATTAGCCCACAAAGCCATCAATACCAATGATGTTCTGCAAAGACGCAATGTCTCTTTCGCCTGGAAGATGCCTTTTATGCAAAGGTGACAACATAAGGCCACATCTTTGCCCTTTGCCCCTTCGCTCAGAGACTTTGGTCCAGAACCAAAAAGTCTTTTGGTTGGCCCCTTGTTATCTCAGGTAGTATGGATGTTCTCCTTGCATCGGTTCTGACAGGCCACCCCTTCAAAAATATTAAAGCTCTCCTATGGATGAATATCAACCAAGCCTTTTTTTGGAATTTATGGAAAGAAAGAAATAGAAGAATCTTCCAGGACAAAGATCTCACTTTTGAAGCCTTTTTCGTCTGTTATCTTCAATGCTATATCTTGTTGTAAACTCTCTCCTATGCTAACTTCATACTCATACGCCTCCCTCATTAATAGTTGGGAGAGTCTTTTGTAATCTCTATGGATCTCGTATCCCTTTTGTAAATTTCATACATCAATGAAATTGTTTCTTATAAAAAAAAATATAAATGATATAAACTAAACTGGGTTGAAAGTGTAGATTTTTCGGAGGGATGATAATTTGAACATGCGAAGGAAGTCCTTTCACATTCTGTCTGGTCTTTTTCTTTTTTTCATTTCTGTATTACTTTTCTCTGTTGTTGCCTAAACCTCTTTAATTTGTTATAATAGTGGTAATGTGCAAGGAAAGGGGTTCAAATGCACTGCAGGAATAATGGCATCAGAAGCAGTTGCTCTTTTTCGAAGTTTTTACGAACAGGGGAATCCCAATGGTATGTGAAGTTATTTTGGTAATCTGATTTGCATGATTTGCTCGTGTATTGCAATTTTATCTTCCAAAAAGACCTGCCTATTTGTTTGGCATAATCTCTTTGTCAGATTTGTCTTTTGGATTAAGAGAGACTATGACATACCCATGCCATTTCTCCCACTTGCCCTAGTAAAATTGTAATACATCTTGGAGGTTTGATTGATAGTGGTGTTCTGTTGCAGCTCCAAAACCTCACAGGCCCCTTGTTCACCATCAGGCAGGTCAATGAGAAGCTGTTGAAATGAGGTTACTTTTGGTTATGATACCGTTTGTGTCAAAACTGACGTTAACAATGATAAGAGCTTTTGAACCAAGCGAAGGCACATGGATGAATTTCAATCCCCATTTAGTTCTTATCTCAAACAATATCAGTTTTTGAGCTCGAATATGCCAACCTCAAGGTCATCGGTATACATTGAGATTAGGATATTTCTGCCCACTTATCATTGGAACGGCTACCTGAACGGTTATATTTATATTGATTCAAACGTTCACTCATATTGTAATTAAATTCTTTGTATCTACTATGACTCAATTTTCAGCATTAACCAAAAACTATTCCTCGAATCAAATGCCTGCAAATAGATTTTGCTGGAAAATCAAATTGGTAGAGGTAGAGTAGAGATTGAAAAGTAGAGATGATGTGTGAGGAATGGGAGAAGATTGAAGAGAAAGAGAAGGCTAGAGGAAAAACTATTCTATTGTTTTCCAATACTGCTTGGTAGATGCATTCTGATTTTATTCAAATATGGCAGAGAATTTTATGACTTTCCTTGAATCTTCCTTTGAACTTGAGTGTCAGTTTCATTTATGCAAAACTATGGTGTTCCAAATCTCAAAAGAAAGACGAATAGAATGAAGCCATTGGAGAGGAAAGTTGATGGAAGTGTTACTTTTTGGCCAAGAGGCAGAGCAATTGGCCAACTGCAAAGCTTGTGCATTGTCCCATTTTACATTTCTGCGTGTCCTTCAATTAGTTGTCATATTGAATGGAAAGCAAAACAAGGTTGCCATCACTATATGAACATAAATTGATATTCAAGTGTCATAGTTTTGAGAGTTATCAAATTGTATTGATTACAATTTGCTTGAGGATGGAATTGCTTCAAGCCAAGAACAAATTGGATGATTAAATGGCATCCATCGGATTTTAGTGTTCAAGAAATTTTCGACAATGTCTTCTATAGAAAGAAAAATCTTAATGGAATTGATGTATCTCAATAATGAACATGTGGATTAGTAAACTAAATGAAGTTGTAACAATCATCAACTAAGTGTCCAACTTGAACATATAGCTTAATGATTAAGATATATTTGCTACTAAAATGTTATAAGTCCAAACTTCCATCCTACGTTAAAAAAAATAGTTTCGAGTTAAAATTTCCATTTCTTTCTATATTAATAGAGATAAATATATCAGGTAAGTTTTATAACAATATAGAAAGGAGTCATTTGTTCGACACAACTTGCATATTCAAGCCAATTCGGAATAGTTCATTGGATAAGACATCAATTACCATTTCAAAGGTTGATGGTTCAATTATTCACCCTTGCAATTATTAAATTAAAAAAAAAAAAAAAAAAAAAAAACTCGCATATTCAAAAGTCTCTTATGTAGACCAATTTGTTGATGAATCAAATATCACTTTTATGCTTCATTACCGACAATTCTAACATGTTACTGAAAATAAATTGACATATGATATATTGTCTACTTTGATGTAAGCTCATTGTCACTATTAATTGTAAAAGTATATGATTGAGAGTTAACATGGATATGTCCAAATTAAGAGACCTATCAATCAATACTTTTAAATGATTATTTTCATCACTTGAACTTTGATCTAAAATATACTGATAGACACTAAAAGAAGACCAGTACGTATTAGTTAAAATAACCTCCATCGATCTAGAAACTCTTTTTTTAACTTTTCTCAATCAATAGCATGTATTTTTTACAATCATTCATTCATAGACATACCGATCATTTTTCTTGGTTGTTCAACCTTTAGATTATGATTTTAAATAAAAAAAGCTTTATTTATTAATAAAAGAGATGCATAATTTCTTATTTGATTCAATATGTTCACCAACATGTTAGAGTGGAGATTCGAATCTCTGACCCTTTGATTAAATATATAATGTCAGCTATATTGAAAGTTGTTTATTTTCTTGATTTAGTTTGAATGATTCATTTTCAAACTTTTCATATGCTTTGTCCACCCGTACAAATACCCAAGAAATAATAGCTGTCAATACACCTATTCGGAATCATCAATTATTTTTGGTCCTGTTTTTTTCTCTTTTTCCTTGTAACACTTGTATTTTACAAACAAAACTTAGTCTAATATACAAATTGAGTGATCTACAAAGGAAAAATGAAAAGTTGTTTCTATTCTAGTGATCGTCAGGGAACCCTCTATGAATCAATTTGAGTGCAAAAACACGGACGTTGTCTTTTCTGAAATATAAAAGAAAAAGGGTAATAAAAACGCATAATTTTGAAATAGATGAATCTGGGATTCTGAGTGATAAGGAGCAGAAGGGAAGCATTTTCACAACTGGGTTGATCATTCAGAAGAACTGGCAGATTGTACGCTTCTTTTGGTTAGTTTAATCTTTCCCTCTCTACTTTGAGAATCGGGAACAAAGCGTAGCATGATGTTGTTCAACTTTTCTTGCCCATTCAATTGCAGGTTGCTTGTGTTGAAACTTCATTTGTTATTGAATGGATTATTTAGCAGTATTTGAGTATATAACTGACCAACTTAAGAGAGATTATTATAAAATGTTTTTTTATGATAGATTTTCTGGTGTGGTTGTGGTAGTCGGCGGAAACTTGAGAATTACATGCCGTAGTCTACTTTTTTCTTCCATGGGTTAGATTCAAACTAAAAGGTTATCTGAGAGGATGATATAAATTTGCACAATATTTGTTTTCCAAGTTGCTCGATAATGTGAAGTTGGATGGACGGTTATTATTTCTTGAGTATGCAAATTTAGGGTTGAATGTTCTGGAGTTGGGAAGGATAGGGACCGAATAGACAGATTTTGGTGAGTTCCATGGTGAGACTTTGTTCGTGTTTTTTTGATTTACCATCGTGTGATTGGCTATGCCAGTTTGTAGGGCTTCCAATCAGAAGGAAGATTGACACAGTAATGGGGTTATTATTCGCTTTTCTTTTCTTTTTTAAGAAAACTAAAAGAAACATTTGAGGCCCTGTCGATAGATTATAAATTTATATGCGATCCACAGTTCAATTTCTTCAGAGAGGCATGCTGTCTTTTGTCTGTCCGAGAGAATATACTGATGCTAGATAAATGATCGGATATGCTCCCCAACTCAAGCAAATCTTCTTTTAGTTCTACGTAATATCTTATGTTGTGCTTGCTCTTGAAATTCACAGATGTGCCCCCATTGTGATGAATTCTCTCATGATGGCTGCAGAAAAGCTGGATCAATCATAGAGGAAAAGAAGAACGATGGTGGCTTGCGTTGCTTAAATTTTTCAAGCTCCTTTTCCCAGATATCAACTGTTAGTACGATGCCTGAAGGTTCAAAATCTAATGTAGTATATAGGAGAAAGAAGCTTCGAGGCAATTCTGATTCCAGGTTGTTGGCTAATGGGACAGATTGTATTTCTTTGATTAGTTGCGATGGTCATTTGGTAGAAGGCGAAGAGCAAGCTGCAGCTTCTCGCCATAACCACAAGAGTGAAATTGTTGGAAGTGTTGTCCCTCCTCTTCCTGTTTACAATGGAAAAACTCATGTCTCTGAACTAGATTCAGTCGATGGTTGTACCATTGGGGAAGGACAAGGTTCTGACGAAACACTCAATAATAACCTGCAAAAAAGTTTGGAGGTTGACAGCATGAATGATAGCTGCTCCTCATCAAAGTCAAACATGGAACTTGTTTCAACTTCCTTGAAGGTTGAAGTGGATGACACAGGTGAGTGCTCCTCTTCTAGTATTCGAGTTATGGAGGATATGGCAGAGGATATATCAGGAAGAGATCTATGCATCTCTATCCTTAGAAGCCATGGGCTTCTGTCTTCTATGGCTCATGCTCCTGGGGAAGAAAGTGATTTTAGGAGTGACAATAATTGTTTTCGATTCTGCAAAACTTGTGGCTCTTCGGAATCAGTCTTGAAGATGTTAATTTGTGATCACTGTGACGATGCATTTCATGTCTCATGTGGCAATCATCGCATGAAGAAAGTGTTAAATGATGAGTGGTATTGCAATTCATGTTTGAAGAAGAAGCATAAAATTTTGAAGGAAACAATTACAAAGAAATTGGCAAACATCTCGAGTAGAAATGGATCTTCTAAGGGTGAATCAAATTCCATAGCATTAATGTTAAAGGACACAGAACCATATACAACTGGTGTTCGGATTGGCAAAGGTTTTCAAGCAGAAGTTCCAGATTGGTCTGGCCCGATTTCTGAGTATGTTATTTTCTTTAATTGCTTATTTTGCTTTCATGCAAACACAAAATGAATGGCTATGTTAGGTAAGCATTTTAGGGGAAAGGATGATTTTTCTAAATAAAAAATAAAATTTGCATATAGCCTGGTGAAAAAAGCTTAGGTCCTAAAATGATATCAGAGATATTAAGTGAAAGGTAGAAGTAGTTGTTGCCCACGGGCCATGACAATCCCTATGAATCATAATGCTTTCATGAAAGGATGGAAAATTATGGACTTGCTTTTGGAATGTTTAGCTTGGGTGAAATAGGAAAATTTTAAGAGGCATGGAGAGTTCTTGGAGTGAGGTTTGGGTCTTGGCTAGGTTCGTTGCCTATTTTATCCTTTATCTCTCATTCTTATGAGTTGGAGGCCTTTCGCGTAATTTTTGAACTTTTGTGGTTTGGTAGCCTTTTCTTTTCTTGCTTGTTTTTGTCTTGTGGCTTTCCATTTTTTGTATTGCCGTTGTATATTTTTGAAAAAGCATTGCCATTGTATATTCTTTAATTTTTCTTATTGAAAGTTAGTATTTCATTTTTAAAAAATAGACATGGTCTTGATAACAAATGAACGATTATTCTACTATCTTGTGGGGATTGTGGCTCGAGAGGAATTGTAGGATATTTGGAGGGGAAGAGAGGTCTTTGAAGGAGGTGTGGGAGGTGGTGAGATTTGATGCATCTTTATGAGCGTTTGTTAACTGATTTTTTTTTTTGTATTTATTAACATGGTATTATTCTTTTGGATTGAAGTCCTTTTTTGTAGTTTAGACTTCTTTTGTGAACTCGTTTTTTGTATGTCCCGTACATGCTTTCATTTTTCTCAATGAAAGCTCGTTCTTACCAAAAAAAAAAACTTAAGGAGGATTATTTTTTTGTTATATCATTAGTATTTAGGCCCATTTAAGTGCACCTTGAGTGTTCTCCTGGACAATTTGTCTAGCCTCAATACATTTGGTTGTTATAGGATTATAATATTCTGGATAGGTGGTTACCGTGGTGCTTGAACCCTTTCTCCAAATATATTCTTACATTTTCACTCATTCCAATGACCATTAGGTCAACCTATGGTGGATTGAAGTACTGAGGACGATAGATTTAGAAAGTAGGTGGGCTTTTAAGTATTTCATATCTCCATGAAATAGTTGTTTCTTATAAAAAAAGGTTTAAAGAACATATGGTAGAGTTGATTGAAATTTTTTGTTAGCGGCTATGAACTAGAAAAGGTATTCAATGACATATGAATGGAGGGATGCATTTATGGGATGAATTTCTCCATTTTTTTTAATGGGAGGCTGAGGGGAAGGGTAGATGCAGCAAAAGATTTGCTACTTGGAATAATTTCTTGTAATCCACCTATAGGTGTGTATCCTTATTTCATTTATCAATGAAATTATGTTTCTCTTCAAAAAAAAAAAAAAAAAAAAGATGCAGCAAAAGATTTGAGATAAATGGACCTTTGTCGTTTTTTCTCCTTTTAATTGCAGGCCATGTCTTAAGTAGTTTAATTGGCCGTTGTATGGCTGTTGCGTCGGTGGTTGGTGGGGAATGCATTGTTTTTTTTTTATAAAGAACCATGCTTTCATTGAGAAAAAAATGAAAGAAGACAAGGGCATACAAAAAAAAAAAAGCTAGCCCAAGAGGGAAAGAGAACCCATTAGATAAAGGGGCTCCAATCCAAAATGATCAGACCTAAGGAATAATTACAAAATAAGCGCGTCACCGATGCCCAAAGAGAAACATTATATTTAACAAGGGACCAAACCTCATGAGAATCCCTCTCCAACCCTCTAAACGTTCTATTGTTTCTCTCACCCCAAAGCCCCCAAATAATGGCGCAAATCCCAGCTTGCCACAAAGACTTCCCCTTCTCCCTAAAGGGAGGATGGGGGAGGAACTCCTCGATCCTTTCTCTAAGGCCTATGGGTCTGGAAAACTGCAAACCAAACAAGGCAAAGAACTCATCCCACACCGCCCTAGCAAAAGCGCAACTCCACAACATATGGTCGAGGTCTTCCTCCGCCATCCGACAGAGAAAACCCCCTGCTCCCTAAAGGGAGGATGGTGGGGAATGCATTGTTGGGACAAAGAAAACCCTATATTCCACTCACAACTTGCAGATGATACTTCACTTTTATGTAAAATTGATGATTCTAGGGTTGGCAATTTATTTGTAATTTCAAAATTATTTGAGACAGTGTTGGGTGATGATAAATTTTGAGATGTTGCAAGCAGAATGGAATTTTTGGGGGAGGTTGTAGGAGCTTAGGATGATATGCTAATGTCCTCCTTTGGCTTACTTGAGAAATGCCCTGGGTGGAGTCTTTGAAAAGGTAGATTTTGGAATCTTATTGAAGAAAGGATATTAGGGAAGTTTGGGAAGTGGAATTTTTTTTGATTGTCATAAGGTGGCTGACTAACACCATTCCAAGTTATTCTATCTAGCCTACCCACTCACTTCATGTTCATTTGTTTTGGATCCCAAGGAAGGTGGCATTGATGATATAGAAGGAAAGCCTTTCCTTTCTTTAGGAAGGAAACAAGACAAGAAGAAGCTTGAGTATTCTTGCTAATGAAATGTATCCCTCCTCTCTAAATGGCTTTAGAGATTTACCATGGAAAAGGAGGCTCTTTGGAAGAAGGTGGTGGCTGTCAAGTATGGTATCCTCAAAAGTGGTTGGTGGTCTCTCCCTATTATCACCACGTTGTCAGGAGGACATTGGATAGATATTCTCAAAGTTAAATAAGTATTGGCTTCTAGAATTGGCTATATTGTGAATAAGGTCTTAAGACCCAAATTTGGGAGGATAGCAAGAACCATGTTATCAGTCAAGAATGCAACCCCTCAAATGCTACTTATGACCTTAATTTCATAAGAAACCTCAATGATTTGGGGAAGGCTGGATTTCCCTCCTGGATTTTCTCAATGACTTTGCTGTCAAACACACTGATGATATAAGGAAGTGGTTATTTTATGTTTCTGGGATTTTGAATGTTAAATTGCTCTGCAGTTATCTTCGCCGAGAAGCCATCATATATGCAAGTGGTTGTTGATGGTTTTGGGATTTTTAATGTTAAATTGCTCCTCCGATGGCCAAGAAGACCCCATCCTTGGGAGCTCCTATAATGGGATTTTTAATGGTTAAATGCTAGCTTGTGCGGATGGGTAGGGAGAATAGAGGCTCAAAGAAAGTCAAAATCCGCATGGCTTAATTTAAAGTGAAAAAAAAAAAAAAAATCTGAATTTGCCCTCTTCAAGCTATTGCCTTATGTGTGTTGAACAATGAATCTCAAAGCTAACTCTTTTCATCCATTTCCCTTTGGCATACTAGTATCAGATTCACCTTAACAACTCTTTTGGATGGTCAGCCTTGATTCATAGAGTTATTCAACAACCCTCTTCCCTCTTTCTCAGCAGACATCTGTTTAAGGACATATCCGAAACCATTTGGCTGAATATTTGAGGAGTGTTTTCTGAACCTCTTGGAATGAACGAAACAATATAATCTTTCAGAATGAAGAAAACTCTCCCTCTTTTTTATGGGATACTAATTTTTTTTGGAGTTGCTACCTGGTGCAAACTTTATTCTTCTCTTTATATAAATAGTTTGAATTTGGTTTGTACTAGTTATCTCTTATAAAAAAGAACAAAGGGTTTCCCAGTACATTTTTACTACTATGGAAGATTCCTGGTGCTTAGGTTGATTTTTCAGGTGCTTTGTGTCTAAAAATTAGTAAAAATGTATTTATTTATTATTATTATTTTTTTGCCAATCGGGACCTTTTCTGCAAATCTTTTTGGTCCTTTGGGGGATTTCTTGCCCTCCTCTTCTTATTCCTCTTCTTCTTCTTTTTTCTTATTTTTTTTTCTTAATAATGTTTGGTTTCCCATAAGAACATATTCTCTTATGCAAAAATAGGTCACCATTGGGCCAATTCCTTGTCCCCCCTTTTCTCATGATGTATGCTATATAATCTCTCAAATTTCTCTAAAATCTTTGTTGTTAAAAGTTCTGTTGTTCCTTTCTAACTAAATCATCCAAAAAAGGCACTCTCAAGTTAAACAAGTTCCAAAGCTGAACCACCAAAAAAACCAGCTAATGAAAGATGGTCTATCATCTTTGGTCTCTTCACTTCCAAGGCAGTTCAGACAGAGCATATGGTGCGGGTTCTTCTTTTGTAGCTTGTATTGGATGTTTGTGCTTTTCTATGCTAGGGTTCACAAAAGAACTTGACCTTTTCAGGAGTTTTGTTCTCCCAAAGCTGCTTGGATAGTTGTCCCTCCAGTTTGGCTGTAGAGCTAAAAAAGAGTTGATGTTGCTGATTCAACTGTGAATCCCTCTCGAGTCTTTGGCCCAAAGCAGCCTATCCTCCTCTTGGTTTGGGGAGAAATCATCCAATCTACTAATAAGCACCATCCATTCTGCAAACTCCTCATTTTTCTTATTCCTTCTTAATTACAGTTTCCAAACAAATGAACATGATGTTCCAAACATCATGAATAGTCTTCTCTTTTGGTAAAGAAATTCTACAGAGCCTTGCAAATAATACACACAAGGGTGAGGCAAGAGGCAAGTGTCCTTCCAAAATCTTACCTATTTTCTTTCTCTATTTTGAAGAGGAAAGAGTTTTCCCATATTGTTTGCATTTAGAAACGAGAACCGAGGGTATTTGCAATGTTTTACTTTTCCTTTTCGAATATTATATTTATGATGTTTTGTAATTATTAGGTCGAGAATATCTTATCTCTGGACCTTTTGCTGTGTACTTTTTTTTATCGATACTAGTTTGTTTCCAATAATATATACCTATGTTTATCTTTTCATTGTTCTTACATGATTACCATAGTGGACCATATCAAAGTTCAAGAAAAAGAGGGTAGCTCTCTATTGAGTTTGTTCAAGTTTATACAGCTGTAACATATTTGATTGCTGAGGCAAAATATTTATCCTGCAGCTGCAAGTTCGGTGGTATTGATGGTGTCAACATATTTGATTGTAGAGGCCAGATTTTAGAACAAAACTTACCTTGATATTGAAGTGGTGCATCTAGGTTCCTCCATAGATTGTGTTTATCTTTTATTTCATACCTTTTTTGTGTAGTTTTTCTTACGTCCAAGTTAGGCTGAAGAAGTATAGGTTTTCGTTATTTTCTTTTTGCACATGCAATTGCAGAAGGTCTTGTTGATAAAAGAGTTAGTCTCGTTACTTATGCCATTTGTGAGTCTTTTCATTAGTTAAGTAAATAATTGTAAAACTACATGCTTTTCAGGTGGTTTAAGACTATGTTTAGCTACCCAAGACACTTCTTTTTGGTGTATTTTATTCTTACATTAGTTCTATTGTGAGATCAATTGAGGTGCATAGTAGCACAGGCTATCTTGAACCTGAACCCTCATAGTTATCTCAAAGAGCAAAGAGAAATGTAAAGATAACAAAAACATTGCTTACAGTAGAACATTTATTTGACATGATAAGGAGAAAGTGTGCAACGCTAGGATGCAGCATTTATTTTTAAATTCCAGGTCACATTTCTTCCTTTTCCTTAAAGCATATATTTCTTTTGGGCTGATGTAGTGATACTGATGCCATCGGTGAGCCACTGGAAATGGATCCTTCAGAATCTTTTCTTGTGCATGTAAGGCATCGATTTGCTTGTGTTAATTTTCATTTAGTAGCAGAGTTTGTAAATCAATGAAAAAATAATGACATGGCTTTCTACTTCCCGACCAACAAAATAATAGGAGCAGAGTACCAATAAATCTAGTAGACTGAGCGCTATTGGAAATTGGCTTCAATGTCAACAAGTTATAGATGGAGTGGGTGGTGTTAATGGAGTCATATGTGGCAAGTGGCGCAGGTAAGGAAAATGATATATCTTATGCTGCACTCTCAGTCTCTCTCTCTCTCTAATTAATATCCTGTTTCCATTTCGGAACTTAAACTTTGTTCTGCATGATCTTGATGCCATTACATTAACACATTCAGTTTCTGCTCAATACTTATAATATAATAGCAAATATTATTGAATGTCAATTGTGGCAGGATTAATATCCTGTTTGTTTTTGTTACAAAAGATGAAAGTTATTGTCATTTTTGTTCTTTAGGTTATTGCGTGTTCTTCTCATAGCTGATTTTTTAAATTTATTTCTTTAAATGTTTTCCAGGAAATCAACTTTATGCTCATGTTTATTCAGTTTTTGTCACCATAATGTTCCAAACTTCTAAGTTAGGGGATGAAATCCTGATCTGTTTTTTTTTTCATTGTTATCTTTGTTGTTCATGATTATGAATCATTTGTTGATTTTGTGAACATGTCAAATCGTGAATTTATTTTTCATTTTAAACTTGGCATCGTAGGGCTCCTCTTTTTGAAGTCCAAACTGATGACTGGGAATGCTTCTGCTCCATCCTCTGGGATCCAACTCATGCTGATTGTGCTGTACCTCAGGTGAAATGACCTCAATGCTGGTCTCTCTCTTCTTTTGATCTTCTGTATACAATTCTTCTGCAGACACAACGTTCTTTTAGGAATGCACGTTATTGTCAAGAGGTCAATGTGAAATCTTGGATGCTTATTCTATGGGCTTCTAACATCTCAAGATACCTACGGAAGGGGCTTTCAAAGACCTTTTATTTAGATTACTTACAACATAGTATGTTAAAAAGTCAACCTGAGAATCAAATATAGTAGGGTAAGACAATCGTTCCATAAGATTGGTTGAGGTGTGAGTAAATAAGCATCAATCCACTCTCTTATATTGAGGATTTTTTTTTTTGGGGTAAGAAACTGAACTATCAAGGAACAAATGAAAAAAATACAAGGGTATACAAAAAACAAATCCAAAAGAAAAAGGAGCCTAAAGCTAACTACAAAAAGAGACTCCAATCCCAAAGAGCAAGACTAAGACCATAATTACAAAATGGAGAAAACTTTTAATTATAACCTTCTCTTCTAATCATAAAGTCTAAAAAAGAGTCCTGAATAAACGAATTTCAAAGGTGTTCCTTTTCTTTAATGAAAGGATGACCCACCAAAATAGAGCAGACTACATTGGTTATATTGTTCAATAACTCTATGAATACTAGAACCTTTGCGTGCGTTTGTCATTTTCCATAGCTATGATACATCACTTAGCCCCATTCATTGTCTATTTAATCCTAAAAGGATAACTTGTCTTTTGTATCTTATTTATCAAAATGGTCTCACCTCCTCATCTTTATCTTGATTCTGTACTACAAAATCGTCATTTATAATTAACAGATTTTCTTTTATAGCAATTCTCCAAATTTGGCATTCTTGCAAAGATAAGAAATTTTGGGTGCACATAAAGATAGAATTTTAATGCACCAGCGTGGGGAACTCTAGATGAATAACTTTCTCTTACGTAGGGAATTATAGGGTAGTTTTGTAATTTCCTTTTTAGTTCAATACTTTTTTTTTTTAATAGAAACTGAGCTTTCGTCTAGAAAAAATGAAATAATGCAAAAGGGATATACAAAAAACACTGTCCAAAAATGGAGCCAAAACAACACCAAGTCATTTCTTTGTTCGTAAATAGTTCTTAGTTAGTTGAACTATTTCTCTATAAATAGAGGATTTGGGCATATGTATTCATTAATCATAATAACCAAATTCATTTGAGTCATATTTCACTTCAGAGAGCTTTAGGTTCCATTATACTCTCATGTGCTTTGTATCATTTGGATTGAGTCCCAAAAGATATTAATCATAGGAGATCGCAATGAATATTATTAAAGCAATTCCTTGTACAAAATTGCAGGAATTGGAGACGGGTCAAGTTTTGAAGCAGTTGAAGTACATTGAGATGGTATTGTCATTTCTCTCACGTGCTAGTCATTACACTACATGTGCTTTTCCTGTAGATTTTTTTAGTTAACCTTTCTTGTATTTTTCTCTCTTTTTCTTGTAGAAAACTTGAAAATAACAGTATTTTTTTTTTTTTACACATTGTCCACAATATCCCTTTATGGACATTGGGTCTCCTTGTTTTCTTCTCTGTAGGCTTTTAATTGCTCTCATGGTTTTCCGTTAATAATCACTAAGGTTCTGGGATCCCAAACAACAGTTAAAATAGTTCTCCATCTTAGTCTAGAAGTACTCTTCATCATTAACACTGAGTTAGGAAATTGCAATCTATGTTGCCTTCTTCAAATCAATTTTGACAACCACACCAACTTGCTGTCTTGCTTTGTATTCTCAAGTCATCAACTGTTTTATTAGCTAATAAAATCAGGTAAAATGTATTTGTTGTTAACTAAGAAAATTTGTATCCTGAAATAATAAAATGAATCACATCCTTAGGAGTTTGGTCTAGCAGCCAATAGTTTGCTATGATCTCATGAAGGCATAACCAGGTTGATTAGTCTATATTCATTCATCCTGATGGAGTTGATGTTAACTAAAAGTTATCGAAAGTGGCTCCAGTCCAGAAGAACAAGACCAAGATCATAATTACAAAAAGGTCTAGTGACATACGCTCACAAGGAAGTATCAAATCTAACTACCTCCCTTTTTTCTTTGGCTTGTTTTTTGTATGTCCTTGTTATCTTTCATCTTTTCAACGAAAGTTAAATTTCTTACCCATAAAAGCAAAAAGAAAAAAGTTATTGAAAGTGTACCTTGGAGGAGGATCCTTCTCACAAAGACTGCTTTAGAACTCAATCATCCCTCCTATCTTCTCTTTTCTCAACAAACTTCTTCTATCCCCAAGTTCCAGCTTGCAAATGATATTTGTCTTTCTACAGTTATTTGCAATAATGTGGAAATATGCAGTGTTGGAGTCGCCTACATTCATCCAATCCACCTTGGTTATTTGCATCCGTCTCATATCCTCCCTGAGAAAATTATTGATTGAAACTGGAACTGGGGATTTGAATTGA

mRNA sequence

ATGTCTCACTGCTCTCTATCTCCATCTTGGTGTGTTATGTGTTCGAATGACAACGAGAACCCAGGCCACTTATTTGGGTGTTGCTCATTTGCTCGTTGTTACTGGTCTTATATCTTGGAAGCTTTTGAGTGGAATTTGGTCGTACCTAACAATGTGTTTGATCTCATATCGTTGATATTTATGGGTCATCCTTTCCATGGTTCAAAGAAGGTTTTGTGGAAATATTGGACAACGTTTCGACGCTACCCTGTTGTTGCTCAAAATACATATGGAAAATCCAAGAGCATTGAGACGCTCTCAGCCCAGTGTCTCGACACTGTGCCTTCGGGAATGCTAGCTGGGTTGTTGGTTGCAGCAGCATCGAGACGCCAAAAGGGAGCGTCGAGACGCTCTACTCTCGGACACGCAGAAATGGAAGCTATTGACATCCTGATTGAGGCATGGCAGAGGGATGGACTTTCAACCTCAGAAGTTGCTAATAAATTCTCGAAGTGCAAACTTTATGTTACCTGTGAACCATGTATTATGTGTGCTTCTGCCCTATCAATACTTGGTATAAAGGAAGTATATTATGGTTGTGCAAACGATAAATTTGGTGGATGTGGATCTATATTGTCACTTCACTTGGGTAGCCGGGAGGCACATACAAGTGGTAATGTGCAAGGAAAGGGGTTCAAATGCACTGCAGGAATAATGGCATCAGAAGCAGTTGCTCTTTTTCGAAGTTTTTACGAACAGGGGAATCCCAATGCTCCAAAACCTCACAGGCCCCTTGTTCACCATCAGGCAGTTTCATTTATGCAAAACTATGGTGTTCCAAATCTCAAAAGAAAGACGAATAGAATGAAGCCATTGGAGAGGAAAGTTGATGGAAGTGTTACTTTTTGGCCAAGAGGCAGAGCAATTGGCCAACTGCAAAGCTTGTGCATTGTCCCATTTTACATTTCTGCGTGTCCTTCAATTAGTTGTCATATTGAATGGAAAGCAAAACAAGGAGCAGAAGGGAAGCATTTTCACAACTGGGTTGATCATTCAGAAGAACTGGCAGATTGTACGCTTCTTTTGATGTGCCCCCATTGTGATGAATTCTCTCATGATGGCTGCAGAAAAGCTGGATCAATCATAGAGGAAAAGAAGAACGATGGTGGCTTGCGTTGCTTAAATTTTTCAAGCTCCTTTTCCCAGATATCAACTGTTAGTACGATGCCTGAAGGTTCAAAATCTAATGTAGTATATAGGAGAAAGAAGCTTCGAGGCAATTCTGATTCCAGGTTGTTGGCTAATGGGACAGATTGTATTTCTTTGATTAGTTGCGATGGTCATTTGGTAGAAGGCGAAGAGCAAGCTGCAGCTTCTCGCCATAACCACAAGAGTGAAATTGTTGGAAGTGTTGTCCCTCCTCTTCCTGTTTACAATGGAAAAACTCATGTCTCTGAACTAGATTCAGTCGATGGTTGTACCATTGGGGAAGGACAAGGTTCTGACGAAACACTCAATAATAACCTGCAAAAAAGTTTGGAGGTTGACAGCATGAATGATAGCTGCTCCTCATCAAAGTCAAACATGGAACTTGTTTCAACTTCCTTGAAGGTTGAAGTGGATGACACAGGTGAGTGCTCCTCTTCTAGTATTCGAGTTATGGAGGATATGGCAGAGGATATATCAGGAAGAGATCTATGCATCTCTATCCTTAGAAGCCATGGGCTTCTGTCTTCTATGGCTCATGCTCCTGGGGAAGAAAGTGATTTTAGGAGTGACAATAATTGTTTTCGATTCTGCAAAACTTGTGGCTCTTCGGAATCAGTCTTGAAGATGTTAATTTGTGATCACTGTGACGATGCATTTCATGTCTCATGTGGCAATCATCGCATGAAGAAAGTGTTAAATGATGAGTGGTATTGCAATTCATGTTTGAAGAAGAAGCATAAAATTTTGAAGGAAACAATTACAAAGAAATTGGCAAACATCTCGAGTAGAAATGGATCTTCTAAGGGTGAATCAAATTCCATAGCATTAATGTTAAAGGACACAGAACCATATACAACTGGTGTTCGGATTGGCAAAGGTTTTCAAGCAGAAGTTCCAGATTGGTCTGGCCCGATTTCTGATGATACTGATGCCATCGGTGAGCCACTGGAAATGGATCCTTCAGAATCTTTTCTTGTGCATGAGCAGAGTACCAATAAATCTAGTAGACTGAGCGCTATTGGAAATTGGCTTCAATGTCAACAAGTTATAGATGGAGTGGGTGGTGTTAATGGAGTCATATGTGGCAAGTGGCGCAGGGCTCCTCTTTTTGAAGTCCAAACTGATGACTGGGAATGCTTCTGCTCCATCCTCTGGGATCCAACTCATGCTGATTGTGCTGTACCTCAGGAATTGGAGACGGGTCAAGTTTTGAAGCAGTTGAAGTACATTGAGATGTTATTTGCAATAATGTGGAAATATGCAGTGTTGGAGTCGCCTACATTCATCCAATCCACCTTGGTTATTTGCATCCGTCTCATATCCTCCCTGAGAAAATTATTGATTGAAACTGGAACTGGGGATTTGAATTGA

Coding sequence (CDS)

ATGTCTCACTGCTCTCTATCTCCATCTTGGTGTGTTATGTGTTCGAATGACAACGAGAACCCAGGCCACTTATTTGGGTGTTGCTCATTTGCTCGTTGTTACTGGTCTTATATCTTGGAAGCTTTTGAGTGGAATTTGGTCGTACCTAACAATGTGTTTGATCTCATATCGTTGATATTTATGGGTCATCCTTTCCATGGTTCAAAGAAGGTTTTGTGGAAATATTGGACAACGTTTCGACGCTACCCTGTTGTTGCTCAAAATACATATGGAAAATCCAAGAGCATTGAGACGCTCTCAGCCCAGTGTCTCGACACTGTGCCTTCGGGAATGCTAGCTGGGTTGTTGGTTGCAGCAGCATCGAGACGCCAAAAGGGAGCGTCGAGACGCTCTACTCTCGGACACGCAGAAATGGAAGCTATTGACATCCTGATTGAGGCATGGCAGAGGGATGGACTTTCAACCTCAGAAGTTGCTAATAAATTCTCGAAGTGCAAACTTTATGTTACCTGTGAACCATGTATTATGTGTGCTTCTGCCCTATCAATACTTGGTATAAAGGAAGTATATTATGGTTGTGCAAACGATAAATTTGGTGGATGTGGATCTATATTGTCACTTCACTTGGGTAGCCGGGAGGCACATACAAGTGGTAATGTGCAAGGAAAGGGGTTCAAATGCACTGCAGGAATAATGGCATCAGAAGCAGTTGCTCTTTTTCGAAGTTTTTACGAACAGGGGAATCCCAATGCTCCAAAACCTCACAGGCCCCTTGTTCACCATCAGGCAGTTTCATTTATGCAAAACTATGGTGTTCCAAATCTCAAAAGAAAGACGAATAGAATGAAGCCATTGGAGAGGAAAGTTGATGGAAGTGTTACTTTTTGGCCAAGAGGCAGAGCAATTGGCCAACTGCAAAGCTTGTGCATTGTCCCATTTTACATTTCTGCGTGTCCTTCAATTAGTTGTCATATTGAATGGAAAGCAAAACAAGGAGCAGAAGGGAAGCATTTTCACAACTGGGTTGATCATTCAGAAGAACTGGCAGATTGTACGCTTCTTTTGATGTGCCCCCATTGTGATGAATTCTCTCATGATGGCTGCAGAAAAGCTGGATCAATCATAGAGGAAAAGAAGAACGATGGTGGCTTGCGTTGCTTAAATTTTTCAAGCTCCTTTTCCCAGATATCAACTGTTAGTACGATGCCTGAAGGTTCAAAATCTAATGTAGTATATAGGAGAAAGAAGCTTCGAGGCAATTCTGATTCCAGGTTGTTGGCTAATGGGACAGATTGTATTTCTTTGATTAGTTGCGATGGTCATTTGGTAGAAGGCGAAGAGCAAGCTGCAGCTTCTCGCCATAACCACAAGAGTGAAATTGTTGGAAGTGTTGTCCCTCCTCTTCCTGTTTACAATGGAAAAACTCATGTCTCTGAACTAGATTCAGTCGATGGTTGTACCATTGGGGAAGGACAAGGTTCTGACGAAACACTCAATAATAACCTGCAAAAAAGTTTGGAGGTTGACAGCATGAATGATAGCTGCTCCTCATCAAAGTCAAACATGGAACTTGTTTCAACTTCCTTGAAGGTTGAAGTGGATGACACAGGTGAGTGCTCCTCTTCTAGTATTCGAGTTATGGAGGATATGGCAGAGGATATATCAGGAAGAGATCTATGCATCTCTATCCTTAGAAGCCATGGGCTTCTGTCTTCTATGGCTCATGCTCCTGGGGAAGAAAGTGATTTTAGGAGTGACAATAATTGTTTTCGATTCTGCAAAACTTGTGGCTCTTCGGAATCAGTCTTGAAGATGTTAATTTGTGATCACTGTGACGATGCATTTCATGTCTCATGTGGCAATCATCGCATGAAGAAAGTGTTAAATGATGAGTGGTATTGCAATTCATGTTTGAAGAAGAAGCATAAAATTTTGAAGGAAACAATTACAAAGAAATTGGCAAACATCTCGAGTAGAAATGGATCTTCTAAGGGTGAATCAAATTCCATAGCATTAATGTTAAAGGACACAGAACCATATACAACTGGTGTTCGGATTGGCAAAGGTTTTCAAGCAGAAGTTCCAGATTGGTCTGGCCCGATTTCTGATGATACTGATGCCATCGGTGAGCCACTGGAAATGGATCCTTCAGAATCTTTTCTTGTGCATGAGCAGAGTACCAATAAATCTAGTAGACTGAGCGCTATTGGAAATTGGCTTCAATGTCAACAAGTTATAGATGGAGTGGGTGGTGTTAATGGAGTCATATGTGGCAAGTGGCGCAGGGCTCCTCTTTTTGAAGTCCAAACTGATGACTGGGAATGCTTCTGCTCCATCCTCTGGGATCCAACTCATGCTGATTGTGCTGTACCTCAGGAATTGGAGACGGGTCAAGTTTTGAAGCAGTTGAAGTACATTGAGATGTTATTTGCAATAATGTGGAAATATGCAGTGTTGGAGTCGCCTACATTCATCCAATCCACCTTGGTTATTTGCATCCGTCTCATATCCTCCCTGAGAAAATTATTGATTGAAACTGGAACTGGGGATTTGAATTGA

Protein sequence

MSHCSLSPSWCVMCSNDNENPGHLFGCCSFARCYWSYILEAFEWNLVVPNNVFDLISLIFMGHPFHGSKKVLWKYWTTFRRYPVVAQNTYGKSKSIETLSAQCLDTVPSGMLAGLLVAAASRRQKGASRRSTLGHAEMEAIDILIEAWQRDGLSTSEVANKFSKCKLYVTCEPCIMCASALSILGIKEVYYGCANDKFGGCGSILSLHLGSREAHTSGNVQGKGFKCTAGIMASEAVALFRSFYEQGNPNAPKPHRPLVHHQAVSFMQNYGVPNLKRKTNRMKPLERKVDGSVTFWPRGRAIGQLQSLCIVPFYISACPSISCHIEWKAKQGAEGKHFHNWVDHSEELADCTLLLMCPHCDEFSHDGCRKAGSIIEEKKNDGGLRCLNFSSSFSQISTVSTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEGEEQAAASRHNHKSEIVGSVVPPLPVYNGKTHVSELDSVDGCTIGEGQGSDETLNNNLQKSLEVDSMNDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRVMEDMAEDISGRDLCISILRSHGLLSSMAHAPGEESDFRSDNNCFRFCKTCGSSESVLKMLICDHCDDAFHVSCGNHRMKKVLNDEWYCNSCLKKKHKILKETITKKLANISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDPSESFLVHEQSTNKSSRLSAIGNWLQCQQVIDGVGGVNGVICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEMLFAIMWKYAVLESPTFIQSTLVICIRLISSLRKLLIETGTGDLN
Homology
BLAST of Lag0033876 vs. NCBI nr
Match: KAG6587732.1 (tRNA-specific adenosine deaminase TAD2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1121.7 bits (2900), Expect = 0.0e+00
Identity = 565/696 (81.18%), Postives = 590/696 (84.77%), Query Frame = 0

Query: 137 EMEAIDILIEAWQRDGLSTSEVANKFSKCKLYVTCEPCIMCASALSILGIKEVYYGCAND 196
           EMEAIDILIEAWQRDGLSTSEVA K SKCKLYVTCEPCIMCASALSILGI EVYYGCAND
Sbjct: 85  EMEAIDILIEAWQRDGLSTSEVAEKCSKCKLYVTCEPCIMCASALSILGINEVYYGCAND 144

Query: 197 KFGGCGSILSLHLGSREAHTSGNVQGKGFKCTAGIMASEAVALFRSFYEQGNPNAPKPHR 256
           KFGGCGSILSLHLGSREAHTSGN QG+GFKCTAGIMASEAVALFRSFYEQGNPNAP PHR
Sbjct: 145 KFGGCGSILSLHLGSREAHTSGNRQGRGFKCTAGIMASEAVALFRSFYEQGNPNAPNPHR 204

Query: 257 PLVHHQAVSFMQNYG-----VPNLKRKTNRMKPLERKVDGSVTFWPRGRAIGQLQSLCIV 316
           PLVH++A +  Q        V  L     R  PL   +     F P GR +  L S    
Sbjct: 205 PLVHNRAGAKQQAICKSLSIVCVLWLGNLRWCPLRTVIH----FLPPGRFVYILHSY--- 264

Query: 317 PFYISACPSISCHIEWKAKQ------------------GAEGKHFHNWVDHSEELADCTL 376
                ACP ISC+IEWKA+Q                   AE KHFHNWV HSEELADCTL
Sbjct: 265 -----ACPLISCYIEWKARQVSVLSRVYKWFRFEQCAREAEEKHFHNWVYHSEELADCTL 324

Query: 377 LLMCPHCDEFSHDGCRKAGSIIEEKKNDGGLRCLNFSSSFSQISTVSTMPEGSKSNVVYR 436
           LLMCPHCDEF HDGCRKAG IIEEKKNDGG RCLNF  +FSQIST+S MP GSKSNVVY+
Sbjct: 325 LLMCPHCDEFFHDGCRKAGQIIEEKKNDGGFRCLNFPRAFSQISTISMMPGGSKSNVVYK 384

Query: 437 RKKLRGNSDSRLLANGTDCISLISCDGHLVEGEEQAAASRHNHKSEIVGSVVPPLPVYNG 496
           RKKLRGNSDSRLLANGTDC SLISCDGHL+E +EQA  SRH HKSEIVG+V+PP PV  G
Sbjct: 385 RKKLRGNSDSRLLANGTDCCSLISCDGHLIEDKEQATTSRHIHKSEIVGNVIPPRPVGYG 444

Query: 497 KTHVSELDSVDGCTIGEGQGSDETLNNNLQKSLEVDSMNDSCSSSKSNMELVSTSLKVEV 556
           K  VSEL+S++GCTIGEG GSDETLNNNLQKSLEVDS+NDSCSSSKSNME VSTSLKVEV
Sbjct: 445 KAQVSELESINGCTIGEGHGSDETLNNNLQKSLEVDSINDSCSSSKSNMERVSTSLKVEV 504

Query: 557 DDTGECSSSSIRVMEDMAEDISGRDLCISILRSHGLLSSMAHAPGEESDFRSDNNCFRFC 616
           DDTGECSSSSIRVMEDM EDISGRDLCISILRS+GLLSSMAHA  +ESD RS+NNCFR C
Sbjct: 505 DDTGECSSSSIRVMEDMVEDISGRDLCISILRSNGLLSSMAHASEQESDLRSENNCFRLC 564

Query: 617 KTCGSSESVLKMLICDHCDDAFHVSCGNHRMKKVLNDEWYCNSCLKKKHKILKETITKKL 676
           KTCGSS+S LKMLICDHC+DAFHV CGNHRMKKV NDEWYCNSCLKKKHKIL E ITKKL
Sbjct: 565 KTCGSSDSALKMLICDHCEDAFHVLCGNHRMKKVSNDEWYCNSCLKKKHKILNEAITKKL 624

Query: 677 ANISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLE 736
           ANISSRNGSSK ESNSIALML DTEPYTTGVRIGKGFQAEVPDWSGPISDDTDA GEPLE
Sbjct: 625 ANISSRNGSSKSESNSIALMLTDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDATGEPLE 684

Query: 737 MDPSESFLVHEQSTNKSSRLSAIGNWLQCQQVIDGVGGVNGVICGKWRRAPLFEVQTDDW 796
           MDPS SFL+HEQSTNK  RLSAIGNWLQCQQVIDGVGGVNGVICGKWRRAPLFEVQTDDW
Sbjct: 685 MDPSGSFLMHEQSTNKPCRLSAIGNWLQCQQVIDGVGGVNGVICGKWRRAPLFEVQTDDW 744

Query: 797 ECFCSILWDPTHADCAVPQELETGQVLKQLKYIEML 810
           ECFCSILWDP HADCAVPQELETGQVLKQLKYIEML
Sbjct: 745 ECFCSILWDPAHADCAVPQELETGQVLKQLKYIEML 768

BLAST of Lag0033876 vs. NCBI nr
Match: KAG6589690.1 (tRNA-specific adenosine deaminase TAD2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1017.3 bits (2629), Expect = 7.7e-293
Identity = 520/694 (74.93%), Postives = 564/694 (81.27%), Query Frame = 0

Query: 116 LVAAASRRQKGASRRSTLGHAEMEAIDILIEAWQRDGLSTSEVANKFSKCKLYVTCEPCI 175
           +V A  R +   +R +T  HAEMEAIDILIEAWQRDGLSTSEVA+KFSKCKLYVTCEPCI
Sbjct: 43  MVIATGRNRTNETRNAT-RHAEMEAIDILIEAWQRDGLSTSEVADKFSKCKLYVTCEPCI 102

Query: 176 MCASALSILGIKEVYYGCANDKFGGCGSILSLHLGSREAHTSGNVQGKGFKCTAGIMASE 235
           MCASALSILGIKEVYYGCANDKFGGCGS+LSLHLGSREA TS N QG+GFKCTAGIMASE
Sbjct: 103 MCASALSILGIKEVYYGCANDKFGGCGSVLSLHLGSREAPTSSNGQGRGFKCTAGIMASE 162

Query: 236 AVALFRSFYEQGNPNAPKPHRPLVHHQAVSFMQNYGVPNLKRKTNRMKPLERKVDGSVTF 295
           AVALFRSFYEQGNPNAPKPHR L     ++          K  ++ +K    +++ S   
Sbjct: 163 AVALFRSFYEQGNPNAPKPHRLLEELLKLA----------KACSSYLKQWSFELEAST-- 222

Query: 296 WPRGRAIGQLQSLCIVPFYISACPSISCHIEWKAKQGAEGKHFHNWVDHSEELADCTLLL 355
             R  +  Q+ +  +   +I    S             E KHFHNW DHS+ELAD  LLL
Sbjct: 223 --RKYSTNQMSANRLCWKFILVVGS-----------RREAKHFHNWADHSKELADRMLLL 282

Query: 356 MCPHCDEFSHDGCRKAGSIIEEKKNDGGLRCLNFSSSFSQISTVSTMPEGSKSNVVYRRK 415
           MC HCD FSH+GCRKAG I+EE KNDG   CLN S +F QISTVS MPE SK NVVY R+
Sbjct: 283 MCRHCDGFSHNGCRKAGPIVEENKNDGDFLCLNSSRAFPQISTVSAMPEDSKYNVVYTRR 342

Query: 416 KLRGNSDSRLLANGTDCISLISCDGHLVEGEEQAAASRHNHKSEIVGSVVPPLPVYNGKT 475
           KLRGNSDSRL A  TDCISLISCDG      +QAAASRHNHK +IVG+VVP  PVY GKT
Sbjct: 343 KLRGNSDSRLWAIETDCISLISCDG------QQAAASRHNHKRKIVGNVVPSFPVYEGKT 402

Query: 476 HVSELDSVDGCTIGEGQGSDETLNNNLQKSLEVDSMNDSCSSSKSNMELVSTSLKVEVDD 535
            VSE +SVDGCTIG+G GS+  LNN LQKSLEVDS+NDSCSSSKSNMELVSTS KVEVDD
Sbjct: 403 RVSEWESVDGCTIGDGHGSERMLNNGLQKSLEVDSINDSCSSSKSNMELVSTSKKVEVDD 462

Query: 536 TGECSSSSIRVMEDMAEDISGRDLCISILRSHGLLSSMAHAPGEESDFRSDNNCFRFCKT 595
           TGECSSSSIRVMEDM EDISGRDLCISILRS+GLLSSMAHA  EESDFRS+NNCFR CKT
Sbjct: 463 TGECSSSSIRVMEDMEEDISGRDLCISILRSNGLLSSMAHASEEESDFRSENNCFRLCKT 522

Query: 596 CGSSESVLKMLICDHCDDAFHVSCGNHRMKKVLNDEWYCNSCLKKKHKILKETITKKLAN 655
           CGSS+SVLKMLICDHC+DAFHVSC NHRMKKV+NDEWYCNSCLKK HKILK+ I KKLAN
Sbjct: 523 CGSSDSVLKMLICDHCEDAFHVSCCNHRMKKVMNDEWYCNSCLKKNHKILKDVIMKKLAN 582

Query: 656 ISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMD 715
            SSRNG SKGESNS+ALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDD DAIGE LEM 
Sbjct: 583 TSSRNGYSKGESNSVALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDNDAIGERLEMH 642

Query: 716 PSESFLVHEQSTNKSSRLSAIGNWLQCQQVIDGVGGVNGVICGKWRRAPLFEVQTDDWEC 775
           PSE  L+HE STNK  R S IGNWLQCQQV++GVGGVN  ICGKWRRAPLFEVQTD+WEC
Sbjct: 643 PSEPLLMHELSTNKPCRSSTIGNWLQCQQVMNGVGGVNSTICGKWRRAPLFEVQTDNWEC 702

Query: 776 FCSILWDPTHADCAVPQELETGQVLKQLKYIEML 810
           FCS+LWDPTHADCAVPQELET QVLKQLKYIEML
Sbjct: 703 FCSMLWDPTHADCAVPQELETVQVLKQLKYIEML 704

BLAST of Lag0033876 vs. NCBI nr
Match: KAG7023369.1 (tRNA-specific adenosine deaminase 2 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 874.8 bits (2259), Expect = 6.1e-250
Identity = 463/694 (66.71%), Postives = 498/694 (71.76%), Query Frame = 0

Query: 116 LVAAASRRQKGASRRSTLGHAEMEAIDILIEAWQRDGLSTSEVANKFSKCKLYVTCEPCI 175
           +V A  R +   +R +T  HAEMEAIDILIEAWQRDGLSTSEVA+KFSKCKLYVTCEPCI
Sbjct: 43  MVIATGRNRTNETRNAT-RHAEMEAIDILIEAWQRDGLSTSEVADKFSKCKLYVTCEPCI 102

Query: 176 MCASALSILGIKEVYYGCANDKFGGCGSILSLHLGSREAHTSGNVQGKGFKCTAGIMASE 235
           MCASALSILGIKEVYYGCANDKFGGCGS+LSLHLGSREA T                   
Sbjct: 103 MCASALSILGIKEVYYGCANDKFGGCGSVLSLHLGSREAPTR------------------ 162

Query: 236 AVALFRSFYEQGNPNAPKPHRPLVHHQAVSFMQNYGVPNLKRKTNRMKPLERKVDGSVTF 295
                                              G+P L+  T  +  + +        
Sbjct: 163 -----------------------------------GIPMLQSLTGSLVTIRQ-------- 222

Query: 296 WPRGRAIGQLQSLCIVPFYISACPSISCHIEWKAKQGAEGKHFHNWVDHSEELADCTLLL 355
                                                         V+   E        
Sbjct: 223 ----------------------------------------------VNERNE-------- 282

Query: 356 MCPHCDEFSHDGCRKAGSIIEEKKNDGGLRCLNFSSSFSQISTVSTMPEGSKSNVVYRRK 415
           MC HCD FSH+GCRKAG I+EE KNDG   CLN S +F QISTVS MPE SK NVVY R+
Sbjct: 283 MCRHCDGFSHNGCRKAGPIVEENKNDGDFLCLNSSRAFPQISTVSAMPEDSKYNVVYTRR 342

Query: 416 KLRGNSDSRLLANGTDCISLISCDGHLVEGEEQAAASRHNHKSEIVGSVVPPLPVYNGKT 475
           KLRGNSDSRL A  TDCISLISCDG      +QAAASRHNHK +IVG+VVP  PVY GKT
Sbjct: 343 KLRGNSDSRLWAIETDCISLISCDG------QQAAASRHNHKRKIVGNVVPSFPVYEGKT 402

Query: 476 HVSELDSVDGCTIGEGQGSDETLNNNLQKSLEVDSMNDSCSSSKSNMELVSTSLKVEVDD 535
            VSE +SVDGCTIG+G GS+  LNN LQKSLEVDS+NDSCSSSKSNMELVSTS KVEVDD
Sbjct: 403 RVSEWESVDGCTIGDGHGSERMLNNGLQKSLEVDSINDSCSSSKSNMELVSTSKKVEVDD 462

Query: 536 TGECSSSSIRVMEDMAEDISGRDLCISILRSHGLLSSMAHAPGEESDFRSDNNCFRFCKT 595
           TGECSSSSIRVMEDM EDISGRDLCISILRS+GLLSS AHA  EESDFRS+NNCFR CKT
Sbjct: 463 TGECSSSSIRVMEDMEEDISGRDLCISILRSNGLLSSTAHASEEESDFRSENNCFRLCKT 522

Query: 596 CGSSESVLKMLICDHCDDAFHVSCGNHRMKKVLNDEWYCNSCLKKKHKILKETITKKLAN 655
           CGSS+SVLKMLICDHC+DAFHVSC NHRMKKV+NDEWYCNSCLKK HKILK+ I KKLAN
Sbjct: 523 CGSSDSVLKMLICDHCEDAFHVSCCNHRMKKVMNDEWYCNSCLKKNHKILKDVIMKKLAN 582

Query: 656 ISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMD 715
            SSRNG SKGESNS+ALML+DTEPYTTGVRIGKGFQAEVPDWSGPISDD DAIGE LEM 
Sbjct: 583 TSSRNGYSKGESNSVALMLEDTEPYTTGVRIGKGFQAEVPDWSGPISDDNDAIGERLEMH 614

Query: 716 PSESFLVHEQSTNKSSRLSAIGNWLQCQQVIDGVGGVNGVICGKWRRAPLFEVQTDDWEC 775
           PSE  L+HE STNK  R S IGNWLQCQQV++GVGGVN  ICGKWRRAPLFEVQTDDWEC
Sbjct: 643 PSEPLLMHELSTNKPCRSSTIGNWLQCQQVMNGVGGVNSTICGKWRRAPLFEVQTDDWEC 614

Query: 776 FCSILWDPTHADCAVPQELETGQVLKQLKYIEML 810
           FCS+LWDPTHADCAVPQELET QVLKQLKYIEML
Sbjct: 703 FCSMLWDPTHADCAVPQELETVQVLKQLKYIEML 614

BLAST of Lag0033876 vs. NCBI nr
Match: XP_038878482.1 (uncharacterized protein LOC120070708 isoform X1 [Benincasa hispida] >XP_038878483.1 uncharacterized protein LOC120070708 isoform X1 [Benincasa hispida])

HSP 1 Score: 829.7 bits (2142), Expect = 2.3e-236
Identity = 405/454 (89.21%), Postives = 424/454 (93.39%), Query Frame = 0

Query: 356 MCPHCDEFSHDGCRKAGSIIEEKKNDGGLRCLNFSSSFSQISTVSTMPEGSKSNVVYRRK 415
           MCPHCDEFS DGCRKAG IIEEKKN+GG RCLNF  +F QISTVS MPE SKSNVVYRRK
Sbjct: 1   MCPHCDEFSRDGCRKAGPIIEEKKNNGG-RCLNFPRAFPQISTVSMMPESSKSNVVYRRK 60

Query: 416 KLRGNSDSRLLANGTDCISLISCDGHLVEGEEQAAASRHNHKSEIVGSVVPPLPVYNGKT 475
           KLRGNSDSRLLANGTDCISL SCDGHL E +EQAAAS+HNHK+EI+G+ VPP PVYNGKT
Sbjct: 61  KLRGNSDSRLLANGTDCISLTSCDGHLGEDKEQAAASQHNHKNEIIGNPVPPFPVYNGKT 120

Query: 476 HVSELDSVDGCTIGEGQGSDETLNNNLQKSLEVDSMNDSCSSSKSNMELVSTSLKVEVDD 535
            VSEL+SV+GC  GEG GSDET NNNLQKSLEVDS+NDSCSSSKSNMELVSTS+KVEVDD
Sbjct: 121 QVSELESVNGCIFGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELVSTSVKVEVDD 180

Query: 536 TGECSSSSIRVMEDMAEDISGRDLCISILRSHGLLSSMAHAPGEESDFRSDNNCFRFCKT 595
           TGECSSSSI+VMEDM EDISGRDLCI ILRS+GLLSSMAHAP EESDFRSDNNCFR CKT
Sbjct: 181 TGECSSSSIQVMEDMVEDISGRDLCIVILRSNGLLSSMAHAPEEESDFRSDNNCFRLCKT 240

Query: 596 CGSSESVLKMLICDHCDDAFHVSCGNHRMKKVLNDEWYCNSCLKKKHKILKETITKKLAN 655
           CGSSESVLKMLICDHC+DAFHVSC NHRMKKV NDEWYCNSCLKKKHKILKETI+KKLAN
Sbjct: 241 CGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKILKETISKKLAN 300

Query: 656 ISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMD 715
           ISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDA GEPLE+D
Sbjct: 301 ISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDATGEPLEVD 360

Query: 716 PSESFLVHEQSTNKSSRLSAIGNWLQCQQVIDGVGGVNGVICGKWRRAPLFEVQTDDWEC 775
           PSESFL+HE+STNK  RLS IGNWLQCQQVIDG+GGVNG ICGKWRRAPLFEVQTDDWEC
Sbjct: 361 PSESFLMHEESTNKPCRLSTIGNWLQCQQVIDGLGGVNGAICGKWRRAPLFEVQTDDWEC 420

Query: 776 FCSILWDPTHADCAVPQELETGQVLKQLKYIEML 810
           FCSILWDP HADCAVPQELETGQVLKQLKYIEML
Sbjct: 421 FCSILWDPAHADCAVPQELETGQVLKQLKYIEML 453

BLAST of Lag0033876 vs. NCBI nr
Match: XP_023531970.1 (uncharacterized protein LOC111794074 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 828.9 bits (2140), Expect = 3.9e-236
Identity = 402/454 (88.55%), Postives = 420/454 (92.51%), Query Frame = 0

Query: 356 MCPHCDEFSHDGCRKAGSIIEEKKNDGGLRCLNFSSSFSQISTVSTMPEGSKSNVVYRRK 415
           MCPHCDEF HDGCRKAG IIEEKKNDGGLRCLNF  +FSQIST+S MP GSKSNVVY+RK
Sbjct: 1   MCPHCDEFFHDGCRKAGQIIEEKKNDGGLRCLNFPRAFSQISTISMMPGGSKSNVVYKRK 60

Query: 416 KLRGNSDSRLLANGTDCISLISCDGHLVEGEEQAAASRHNHKSEIVGSVVPPLPVYNGKT 475
           KLRGNSDSRLLANGTDC SLISCDGHL+E +EQA  SRH HKSEIVG+V+PP PV +GK 
Sbjct: 61  KLRGNSDSRLLANGTDCCSLISCDGHLIEDKEQATTSRHIHKSEIVGNVIPPRPVCDGKA 120

Query: 476 HVSELDSVDGCTIGEGQGSDETLNNNLQKSLEVDSMNDSCSSSKSNMELVSTSLKVEVDD 535
            VSEL+S++GCTIGEG GSDETLNNNLQKSLEVDSMNDSCSSSKSNME VSTSLKVEVDD
Sbjct: 121 QVSELESINGCTIGEGHGSDETLNNNLQKSLEVDSMNDSCSSSKSNMERVSTSLKVEVDD 180

Query: 536 TGECSSSSIRVMEDMAEDISGRDLCISILRSHGLLSSMAHAPGEESDFRSDNNCFRFCKT 595
           TGECSSSSIRVMEDM EDISGRDLCISILRS+GLLSS+AHA  +ESD RS+NNCFR CKT
Sbjct: 181 TGECSSSSIRVMEDMVEDISGRDLCISILRSNGLLSSLAHASEQESDLRSENNCFRLCKT 240

Query: 596 CGSSESVLKMLICDHCDDAFHVSCGNHRMKKVLNDEWYCNSCLKKKHKILKETITKKLAN 655
           CGSS+S LKMLICDHC+DAFHV CGNHRMKKV NDEWYCNSCLKKKHKIL E ITKKLAN
Sbjct: 241 CGSSDSALKMLICDHCEDAFHVLCGNHRMKKVSNDEWYCNSCLKKKHKILNEAITKKLAN 300

Query: 656 ISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMD 715
           ISSRNGSSK ESNSIALML DTEPYTTGVRIGKGFQAEVPDWSGPISDDTDA GEPLEMD
Sbjct: 301 ISSRNGSSKSESNSIALMLTDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDATGEPLEMD 360

Query: 716 PSESFLVHEQSTNKSSRLSAIGNWLQCQQVIDGVGGVNGVICGKWRRAPLFEVQTDDWEC 775
           PS SFL+HEQSTNK  RLSAIGNWLQCQQVIDGVGGVNGVICGKWRRAPLFEVQTDDWEC
Sbjct: 361 PSGSFLMHEQSTNKPCRLSAIGNWLQCQQVIDGVGGVNGVICGKWRRAPLFEVQTDDWEC 420

Query: 776 FCSILWDPTHADCAVPQELETGQVLKQLKYIEML 810
           FCSILWDP HADCAVPQELETGQVLKQLKYIEML
Sbjct: 421 FCSILWDPAHADCAVPQELETGQVLKQLKYIEML 454

BLAST of Lag0033876 vs. ExPASy Swiss-Prot
Match: Q6IDB6 (tRNA-specific adenosine deaminase TAD2 OS=Arabidopsis thaliana OX=3702 GN=TAD2 PE=1 SV=1)

HSP 1 Score: 211.1 bits (536), Expect = 5.0e-53
Identity = 102/143 (71.33%), Postives = 113/143 (79.02%), Query Frame = 0

Query: 120 ASRRQKGASRRSTLGHAEMEAIDILIEAWQRDGLSTSEVANKFSKCKLYVTCEPCIMCAS 179
           AS R +    R+   HAEMEAID L+  WQ+DGLS S+VA KFSKC LYVTCEPCIMCAS
Sbjct: 43  ASGRNRTNETRNATRHAEMEAIDQLVGQWQKDGLSPSQVAEKFSKCVLYVTCEPCIMCAS 102

Query: 180 ALSILGIKEVYYGCANDKFGGCGSILSLHLGSREAHTSGNVQGKGFKCTAGIMASEAVAL 239
           ALS LGIKEVYYGC NDKFGGCGSILSLHLGS EA      +GKG+KC  GIMA EAV+L
Sbjct: 103 ALSFLGIKEVYYGCPNDKFGGCGSILSLHLGSEEAQ-----RGKGYKCRGGIMAEEAVSL 162

Query: 240 FRSFYEQGNPNAPKPHRPLVHHQ 263
           F+ FYEQGNPNAPKPHRP+V  +
Sbjct: 163 FKCFYEQGNPNAPKPHRPVVQRE 180

BLAST of Lag0033876 vs. ExPASy Swiss-Prot
Match: Q5E9J7 (tRNA-specific adenosine deaminase 2 OS=Bos taurus OX=9913 GN=DEADC1 PE=2 SV=1)

HSP 1 Score: 114.0 bits (284), Expect = 8.3e-24
Identity = 73/176 (41.48%), Postives = 97/176 (55.11%), Query Frame = 0

Query: 82  YPVVAQNTYGKSKSIETLSAQCLDT--VPSG--MLAGLLVAAASRRQKGASRRSTLGHAE 141
           Y V A+ T    +    ++   LD   VP G  M+    V    R +   ++ +T  HAE
Sbjct: 15  YSVSAEETEKWMEQAMQMAKDALDNTEVPVGCLMVYNNEVVGKGRNEVNQTKNAT-RHAE 74

Query: 142 MEAIDILIEAWQRDGLSTSEVANKFSKCKLYVTCEPCIMCASALSILGIKEVYYGCANDK 201
           M AID  ++  +R G S SEV   F    LYVT EPCIMCA+AL ++ I  V YGC N++
Sbjct: 75  MVAIDQALDWCRRRGRSPSEV---FEHTVLYVTVEPCIMCAAALRLMRIPLVVYGCQNER 134

Query: 202 FGGCGSILSLHLGSREAHTSGNVQGKGFKCTAGIMASEAVALFRSFYEQGNPNAPK 254
           FGGCGS+L +      A       GK F+CT G  A EAV + ++FY+Q NPNAPK
Sbjct: 135 FGGCGSVLDI------ASADLPSTGKPFQCTPGYRAEEAVEMLKTFYKQENPNAPK 180

BLAST of Lag0033876 vs. ExPASy Swiss-Prot
Match: Q7Z6V5 (tRNA-specific adenosine deaminase 2 OS=Homo sapiens OX=9606 GN=ADAT2 PE=1 SV=1)

HSP 1 Score: 108.2 bits (269), Expect = 4.6e-22
Identity = 60/137 (43.80%), Postives = 83/137 (60.58%), Query Frame = 0

Query: 117 VAAASRRQKGASRRSTLGHAEMEAIDILIEAWQRDGLSTSEVANKFSKCKLYVTCEPCIM 176
           V    R +   ++ +T  HAEM AID +++  ++ G S SEV   F    LYVT EPCIM
Sbjct: 54  VVGKGRNEVNQTKNAT-RHAEMVAIDQVLDWCRQSGKSPSEV---FEHTVLYVTVEPCIM 113

Query: 177 CASALSILGIKEVYYGCANDKFGGCGSILSLHLGSREAHTSGNVQGKGFKCTAGIMASEA 236
           CA+AL ++ I  V YGC N++FGGCGS+L++      A       G+ F+C  G  A EA
Sbjct: 114 CAAALRLMKIPLVVYGCQNERFGGCGSVLNI------ASADLPNTGRPFQCIPGYRAEEA 173

Query: 237 VALFRSFYEQGNPNAPK 254
           V + ++FY+Q NPNAPK
Sbjct: 174 VEMLKTFYKQENPNAPK 180

BLAST of Lag0033876 vs. ExPASy Swiss-Prot
Match: Q5RIV4 (tRNA-specific adenosine deaminase 2 OS=Danio rerio OX=7955 GN=adat2 PE=2 SV=2)

HSP 1 Score: 106.7 bits (265), Expect = 1.3e-21
Identity = 58/120 (48.33%), Postives = 74/120 (61.67%), Query Frame = 0

Query: 135 HAEMEAIDILIEAWQRDGLSTSEVANKFSKCKLYVTCEPCIMCASALSILGIKEVYYGCA 194
           HAEM A+D +++ W R  L   +      +  LYVT EPCIMCA+AL +L I  V YGC 
Sbjct: 70  HAEMVALDQVLD-WCR--LREKDCKEVCEQTVLYVTVEPCIMCAAALRLLRIPFVVYGCK 129

Query: 195 NDKFGGCGSILSLHLGSREAHTSGNVQGKGFKCTAGIMASEAVALFRSFYEQGNPNAPKP 254
           N++FGGCGS+L +       HT     G  FKC AG  A EAV + ++FY+Q NPNAPKP
Sbjct: 130 NERFGGCGSVLDVS-SDHLPHT-----GTSFKCIAGYRAEEAVEMLKTFYKQENPNAPKP 180

BLAST of Lag0033876 vs. ExPASy Swiss-Prot
Match: Q6P6J0 (tRNA-specific adenosine deaminase 2 OS=Mus musculus OX=10090 GN=Adat2 PE=1 SV=1)

HSP 1 Score: 105.9 bits (263), Expect = 2.3e-21
Identity = 60/137 (43.80%), Postives = 81/137 (59.12%), Query Frame = 0

Query: 117 VAAASRRQKGASRRSTLGHAEMEAIDILIEAWQRDGLSTSEVANKFSKCKLYVTCEPCIM 176
           V    R +   ++ +T  HAEM AID +++   + G S S V   F    LYVT EPCIM
Sbjct: 54  VVGKGRNEVNQTKNAT-RHAEMVAIDQVLDWCHQHGQSPSTV---FEHTVLYVTVEPCIM 113

Query: 177 CASALSILGIKEVYYGCANDKFGGCGSILSLHLGSREAHTSGNVQGKGFKCTAGIMASEA 236
           CA+AL ++ I  V YGC N++FGGCGS+L++      A       G+ F+C  G  A EA
Sbjct: 114 CAAALRLMKIPLVVYGCQNERFGGCGSVLNI------ASADLPNTGRPFQCIPGYRAEEA 173

Query: 237 VALFRSFYEQGNPNAPK 254
           V L ++FY+Q NPNAPK
Sbjct: 174 VELLKTFYKQENPNAPK 180

BLAST of Lag0033876 vs. ExPASy TrEMBL
Match: A0A6J1F145 (uncharacterized protein LOC111441172 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111441172 PE=4 SV=1)

HSP 1 Score: 823.9 bits (2127), Expect = 6.0e-235
Identity = 400/454 (88.11%), Postives = 418/454 (92.07%), Query Frame = 0

Query: 356 MCPHCDEFSHDGCRKAGSIIEEKKNDGGLRCLNFSSSFSQISTVSTMPEGSKSNVVYRRK 415
           MCPHCDEF HDGCRKAG IIEEKKNDGG RCLNF  +FSQIST+S MP GSKSNVVY+RK
Sbjct: 1   MCPHCDEFFHDGCRKAGQIIEEKKNDGGFRCLNFPRAFSQISTISMMPGGSKSNVVYKRK 60

Query: 416 KLRGNSDSRLLANGTDCISLISCDGHLVEGEEQAAASRHNHKSEIVGSVVPPLPVYNGKT 475
           KLRGNSDSRLLANGTDC SLISCDGHL+E +EQA  S+H HKSEIVG+V+PP PV  GK 
Sbjct: 61  KLRGNSDSRLLANGTDCCSLISCDGHLIEDKEQATTSQHIHKSEIVGNVIPPRPVGYGKA 120

Query: 476 HVSELDSVDGCTIGEGQGSDETLNNNLQKSLEVDSMNDSCSSSKSNMELVSTSLKVEVDD 535
            VSEL+S++GCTIGEG GSDETLNNNLQKSLEVDS+NDSCSSSKSNME VSTSLKVEVDD
Sbjct: 121 QVSELESINGCTIGEGHGSDETLNNNLQKSLEVDSINDSCSSSKSNMERVSTSLKVEVDD 180

Query: 536 TGECSSSSIRVMEDMAEDISGRDLCISILRSHGLLSSMAHAPGEESDFRSDNNCFRFCKT 595
           TGECSSSSIRVMEDM EDISGRDLCISILRS+GLLSSMAHA  +ESD RS+NNCFR CKT
Sbjct: 181 TGECSSSSIRVMEDMVEDISGRDLCISILRSNGLLSSMAHASEQESDLRSENNCFRLCKT 240

Query: 596 CGSSESVLKMLICDHCDDAFHVSCGNHRMKKVLNDEWYCNSCLKKKHKILKETITKKLAN 655
           CGSS+S LKMLICDHC+DAFHV CGNHRMKKV NDEWYCNSCLKKKHKIL E ITKKLAN
Sbjct: 241 CGSSDSALKMLICDHCEDAFHVLCGNHRMKKVSNDEWYCNSCLKKKHKILNEAITKKLAN 300

Query: 656 ISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMD 715
           ISSRNGSSK ESNSIALML DTEPYTTGVRIGKGFQAEVPDWSGPISDDTDA GEPLEMD
Sbjct: 301 ISSRNGSSKSESNSIALMLTDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDATGEPLEMD 360

Query: 716 PSESFLVHEQSTNKSSRLSAIGNWLQCQQVIDGVGGVNGVICGKWRRAPLFEVQTDDWEC 775
           PS SFL+HEQSTNK  RLSAIGNWLQCQQVIDGVGGVNGVICGKWRRAPLFEVQTDDWEC
Sbjct: 361 PSGSFLMHEQSTNKPCRLSAIGNWLQCQQVIDGVGGVNGVICGKWRRAPLFEVQTDDWEC 420

Query: 776 FCSILWDPTHADCAVPQELETGQVLKQLKYIEML 810
           FCSILWDP HADCAVPQELETGQVLKQLKYIEML
Sbjct: 421 FCSILWDPAHADCAVPQELETGQVLKQLKYIEML 454

BLAST of Lag0033876 vs. ExPASy TrEMBL
Match: A0A6J1IGX7 (uncharacterized protein LOC111472812 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111472812 PE=4 SV=1)

HSP 1 Score: 820.5 bits (2118), Expect = 6.7e-234
Identity = 399/454 (87.89%), Postives = 417/454 (91.85%), Query Frame = 0

Query: 356 MCPHCDEFSHDGCRKAGSIIEEKKNDGGLRCLNFSSSFSQISTVSTMPEGSKSNVVYRRK 415
           MCPHCDEF HDGCRKAG IIEEKKNDGGLRCLNF  +FSQIST+S MP GSKSNVVY+RK
Sbjct: 1   MCPHCDEFFHDGCRKAGQIIEEKKNDGGLRCLNFPRAFSQISTISMMPGGSKSNVVYKRK 60

Query: 416 KLRGNSDSRLLANGTDCISLISCDGHLVEGEEQAAASRHNHKSEIVGSVVPPLPVYNGKT 475
           KLRGNSDSRLLANGTDC SLISCDGHL+E +EQA  SRH HKSEIVG+V+PP PV +GK 
Sbjct: 61  KLRGNSDSRLLANGTDCCSLISCDGHLIEDKEQATTSRHIHKSEIVGNVIPPRPVCDGKA 120

Query: 476 HVSELDSVDGCTIGEGQGSDETLNNNLQKSLEVDSMNDSCSSSKSNMELVSTSLKVEVDD 535
            VS L+S++GCTIGEG GSDETLNNNLQKSLEVDS+NDSCSSSKSNME VSTSLKVEVDD
Sbjct: 121 QVSGLESINGCTIGEGHGSDETLNNNLQKSLEVDSINDSCSSSKSNMERVSTSLKVEVDD 180

Query: 536 TGECSSSSIRVMEDMAEDISGRDLCISILRSHGLLSSMAHAPGEESDFRSDNNCFRFCKT 595
           TGECSSSSIRVMEDM EDISGRDLCISILRS+GLLSSMAHA  +ESD RS+NNCFR CKT
Sbjct: 181 TGECSSSSIRVMEDMVEDISGRDLCISILRSNGLLSSMAHASEQESDLRSENNCFRLCKT 240

Query: 596 CGSSESVLKMLICDHCDDAFHVSCGNHRMKKVLNDEWYCNSCLKKKHKILKETITKKLAN 655
           CGSS+S LKMLICDHC+DAFHV CGNHRMKKV NDEWYCNSCLKKKHKIL E ITKKLAN
Sbjct: 241 CGSSDSALKMLICDHCEDAFHVLCGNHRMKKVSNDEWYCNSCLKKKHKILNEAITKKLAN 300

Query: 656 ISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMD 715
           ISSRNGSSK ESNSIALML DTEPYTTGVRIGKGFQAEVPDWSG ISDDTDA  EPLEMD
Sbjct: 301 ISSRNGSSKSESNSIALMLTDTEPYTTGVRIGKGFQAEVPDWSGTISDDTDATAEPLEMD 360

Query: 716 PSESFLVHEQSTNKSSRLSAIGNWLQCQQVIDGVGGVNGVICGKWRRAPLFEVQTDDWEC 775
           PS SFL+HEQSTNK  RLSAIGNWLQCQQVIDGVGGVNGVICGKWRRAPLFEVQTDDWEC
Sbjct: 361 PSGSFLMHEQSTNKPCRLSAIGNWLQCQQVIDGVGGVNGVICGKWRRAPLFEVQTDDWEC 420

Query: 776 FCSILWDPTHADCAVPQELETGQVLKQLKYIEML 810
           FCSILWDP HADCAVPQELETGQVLKQLKYIEML
Sbjct: 421 FCSILWDPAHADCAVPQELETGQVLKQLKYIEML 454

BLAST of Lag0033876 vs. ExPASy TrEMBL
Match: A0A1S3BXC2 (uncharacterized protein LOC103494237 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103494237 PE=4 SV=1)

HSP 1 Score: 802.0 bits (2070), Expect = 2.5e-228
Identity = 393/454 (86.56%), Postives = 412/454 (90.75%), Query Frame = 0

Query: 356 MCPHCDEFSHDGCRKAGSIIEEKKNDGGLRCLNFSSSFSQISTVSTMPEGSKSNVVYRRK 415
           MCPHCDEFSHDGCRKAG IIEEKKN+GGLRCLNF  +F    T   M EGSKSNVVYRRK
Sbjct: 1   MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRTF---PTAIMMSEGSKSNVVYRRK 60

Query: 416 KLRGNSDSRLLANGTDCISLISCDGHLVEGEEQAAASRHNHKSEIVGSVVPPLPVYNGKT 475
           KLRG+SDSR LANGTDCISLISCDGHL E +EQAAAS+ NH+ EIVG+ VPP PV +GKT
Sbjct: 61  KLRGSSDSRFLANGTDCISLISCDGHLAEDKEQAAASQRNHEHEIVGNAVPPFPVCDGKT 120

Query: 476 HVSELDSVDGCTIGEGQGSDETLNNNLQKSLEVDSMNDSCSSSKSNMELVSTSLKVEVDD 535
            VSEL+S +GC  GEG GSDET NNNLQKSLEVDS+NDSCSSSKSNMELVSTSLKVEVDD
Sbjct: 121 QVSELESANGCIFGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDD 180

Query: 536 TGECSSSSIRVMEDMAEDISGRDLCISILRSHGLLSSMAHAPGEESDFRSDNNCFRFCKT 595
           TGECSSSSI+VMED  EDISGRDLCISILRS+GLLSSMAH P EESD RSDNNCFR CKT
Sbjct: 181 TGECSSSSIQVMEDTVEDISGRDLCISILRSNGLLSSMAHVPEEESDSRSDNNCFRLCKT 240

Query: 596 CGSSESVLKMLICDHCDDAFHVSCGNHRMKKVLNDEWYCNSCLKKKHKILKETITKKLAN 655
           CGSSESVLKMLICDHC+DAFHVSC NHRMKKV NDEWYCNSCLKKKHK+LKE I+KKL N
Sbjct: 241 CGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLKEAISKKLTN 300

Query: 656 ISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMD 715
             SRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMD
Sbjct: 301 TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMD 360

Query: 716 PSESFLVHEQSTNKSSRLSAIGNWLQCQQVIDGVGGVNGVICGKWRRAPLFEVQTDDWEC 775
            SESFL+HEQSTNK+ RLS IGNWLQCQQV+DGVGG NG ICGKWRRAPLFEVQTDDWEC
Sbjct: 361 SSESFLMHEQSTNKACRLSTIGNWLQCQQVVDGVGGGNGGICGKWRRAPLFEVQTDDWEC 420

Query: 776 FCSILWDPTHADCAVPQELETGQVLKQLKYIEML 810
           FCSILWDPTHADCAVPQELETGQVLKQLKYIEML
Sbjct: 421 FCSILWDPTHADCAVPQELETGQVLKQLKYIEML 451

BLAST of Lag0033876 vs. ExPASy TrEMBL
Match: A0A1S3BWJ2 (uncharacterized protein LOC103494237 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103494237 PE=4 SV=1)

HSP 1 Score: 790.4 bits (2040), Expect = 7.4e-225
Identity = 393/473 (83.09%), Postives = 412/473 (87.10%), Query Frame = 0

Query: 356 MCPHCDEFSHDGCRKAGSIIEEKKNDGGLRCLNFSSSFSQISTVSTMPEGSKSNVVYRRK 415
           MCPHCDEFSHDGCRKAG IIEEKKN+GGLRCLNF  +F    T   M EGSKSNVVYRRK
Sbjct: 1   MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRTF---PTAIMMSEGSKSNVVYRRK 60

Query: 416 KLRGNSDSRLLANGTDCISLISCDGHLVEGEEQAAASRHNHKSEIVGSVVPPLPVYNGKT 475
           KLRG+SDSR LANGTDCISLISCDGHL E +EQAAAS+ NH+ EIVG+ VPP PV +GKT
Sbjct: 61  KLRGSSDSRFLANGTDCISLISCDGHLAEDKEQAAASQRNHEHEIVGNAVPPFPVCDGKT 120

Query: 476 HVSELDSVDGCTIGEGQGSDETLNNNLQKSLEVDSMNDSCSSSKSNMELVSTSLKVEVDD 535
            VSEL+S +GC  GEG GSDET NNNLQKSLEVDS+NDSCSSSKSNMELVSTSLKVEVDD
Sbjct: 121 QVSELESANGCIFGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDD 180

Query: 536 TGECSSSSIRVMEDMAEDISGRDLCISILRSHGLLSSMAHAPGEESDFRSDNNCFRFCKT 595
           TGECSSSSI+VMED  EDISGRDLCISILRS+GLLSSMAH P EESD RSDNNCFR CKT
Sbjct: 181 TGECSSSSIQVMEDTVEDISGRDLCISILRSNGLLSSMAHVPEEESDSRSDNNCFRLCKT 240

Query: 596 CGSSESVLKMLICDHCDDAFHVSCGNHRMKKVLNDEWYCNSCLKKKHKILKETITKKLAN 655
           CGSSESVLKMLICDHC+DAFHVSC NHRMKKV NDEWYCNSCLKKKHK+LKE I+KKL N
Sbjct: 241 CGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLKEAISKKLTN 300

Query: 656 ISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMD 715
             SRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMD
Sbjct: 301 TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMD 360

Query: 716 PSESFLVHEQSTNKSSRLSAIGNWLQCQQVIDGVGGVNGVICGKWRRAPLFEVQTDDWEC 775
            SESFL+HEQSTNK+ RLS IGNWLQCQQV+DGVGG NG ICGKWRRAPLFEVQTDDWEC
Sbjct: 361 SSESFLMHEQSTNKACRLSTIGNWLQCQQVVDGVGGGNGGICGKWRRAPLFEVQTDDWEC 420

Query: 776 FCSILWDPTHADCAVPQ-------------------ELETGQVLKQLKYIEML 810
           FCSILWDPTHADCAVPQ                   ELETGQVLKQLKYIEML
Sbjct: 421 FCSILWDPTHADCAVPQKFYKFCIVAKMRNLKVHIKELETGQVLKQLKYIEML 470

BLAST of Lag0033876 vs. ExPASy TrEMBL
Match: A0A6J1C1R7 (uncharacterized protein LOC111007173 OS=Momordica charantia OX=3673 GN=LOC111007173 PE=4 SV=1)

HSP 1 Score: 786.2 bits (2029), Expect = 1.4e-223
Identity = 388/454 (85.46%), Postives = 414/454 (91.19%), Query Frame = 0

Query: 356 MCPHCDEFSHDGCRKAGSIIEEKKNDGGLRCLNFSSSFSQISTVSTMPEGSKSNVVYRRK 415
           MCPHCDEFSH GCRKAG II+EKKN+ G  CLN   + SQISTVSTMPEGS S VVYRRK
Sbjct: 1   MCPHCDEFSHGGCRKAGPIIQEKKNNDGSHCLNCPRAVSQISTVSTMPEGSNSIVVYRRK 60

Query: 416 KLRGNSDSRLLANGTDCISLISCDGHLVEGEEQAAASRHNHKSEIVGSVVPPLPVYNGKT 475
           KLRGNSDSRL ANGTDCIS ISCDG L E  EQAAAS+H  +S+IVG++VP  PVY+GKT
Sbjct: 61  KLRGNSDSRLSANGTDCISFISCDGDLGEENEQAAASQHVQESDIVGNIVPLPPVYDGKT 120

Query: 476 HVSELDSVDGCTIGEGQGSDETLNNNLQKSLEVDSMNDSCSSSKSNMELVSTSLKVEVDD 535
           HVSEL+SV+GCTIGEG GSDETLNNNLQK+LEVDS+NDSCSSSKSNMELVSTSLKVEVDD
Sbjct: 121 HVSELESVNGCTIGEGHGSDETLNNNLQKNLEVDSINDSCSSSKSNMELVSTSLKVEVDD 180

Query: 536 TGECSSSSIRVMEDMAEDISGRDLCISILRSHGLLSSMAHAPGEESDFRSDNNCFRFCKT 595
           TGECSSSSI+VMEDM EDISGRDLCISILRS+GLLS MAHAP EES+F+SD+NCFR CK 
Sbjct: 181 TGECSSSSIQVMEDMIEDISGRDLCISILRSNGLLSCMAHAPKEESNFQSDSNCFRSCKI 240

Query: 596 CGSSESVLKMLICDHCDDAFHVSCGNHRMKKVLNDEWYCNSCLKKKHKILKETITKKLAN 655
           CGSSESVLKMLICDHC+DAFH+SC NHRMKKV NDEWYCNSCLKKKHK+LKETIT KLAN
Sbjct: 241 CGSSESVLKMLICDHCEDAFHISCCNHRMKKVSNDEWYCNSCLKKKHKMLKETITNKLAN 300

Query: 656 ISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMD 715
           ISSR+GSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPI+DDTDAIGEPLE+D
Sbjct: 301 ISSRSGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPINDDTDAIGEPLEVD 360

Query: 716 PSESFLVHEQSTNKSSRLSAIGNWLQCQQVIDGVGGVNGVICGKWRRAPLFEVQTDDWEC 775
           PSESF +HEQSTNK  RLSAIGNWLQCQQVI      NG+ICGKWRRAPLFEVQTDDWEC
Sbjct: 361 PSESFQMHEQSTNKPCRLSAIGNWLQCQQVI------NGIICGKWRRAPLFEVQTDDWEC 420

Query: 776 FCSILWDPTHADCAVPQELETGQVLKQLKYIEML 810
           FCSILWDPTHADCAVPQELET QVLKQLKYIEML
Sbjct: 421 FCSILWDPTHADCAVPQELETDQVLKQLKYIEML 448

BLAST of Lag0033876 vs. TAIR 10
Match: AT2G19260.1 (RING/FYVE/PHD zinc finger superfamily protein )

HSP 1 Score: 219.9 bits (559), Expect = 7.7e-57
Identity = 132/302 (43.71%), Postives = 161/302 (53.31%), Query Frame = 0

Query: 509 DSMNDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRVMEDMAEDISGRDLCISILRSHG 568
           D  NDSCSS KS+ E+ STS K   DD   C SS   V                      
Sbjct: 363 DGTNDSCSSLKSSSEVNSTSSKSREDD---CYSSDSGV---------------------- 422

Query: 569 LLSSMAHAPGEESDFRSDNNCFRFCKTCGSSESVLKMLICDHCDDAFHVSCGNHRMKKVL 628
                      E+D    ++ FR CK C    +V KMLICD C++A+H  C   +MK V 
Sbjct: 423 ----------SETDTDGSSSPFRQCKHCDKPGTVEKMLICDECEEAYHTRCCGVQMKDVA 482

Query: 629 N-DEWYCNSCLKKKHKILKETITKKLANISSRNGSSKGESNSIALMLKDTEPYTTGVRIG 688
             DEW C SCLK      + + TK    IS                 + T P+  G+RIG
Sbjct: 483 EIDEWLCPSCLKN-----QSSKTKTKGRISHER------------KWRVTVPFVIGIRIG 542

Query: 689 KGFQAEVPDWSGPISDDTDAIGEPLEMDPSESFLVHEQSTNKSSRLSAIGNWLQCQQVID 748
           K FQA+VPDWSGP   DT  +GEPLE+  SE     +++ N   + SA+ NWLQC++   
Sbjct: 543 KMFQADVPDWSGPTMSDTSFVGEPLEIGQSEYMHDLKKAKNSKKQCSAV-NWLQCRE--- 602

Query: 749 GVGGVNGVICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIE 808
                NGVICGKWRRAP  EVQT DWECFC   WDP+ ADCAVPQELET ++LKQLKYI+
Sbjct: 603 --EDTNGVICGKWRRAPRSEVQTKDWECFCCFSWDPSRADCAVPQELETSEILKQLKYIK 606

Query: 809 ML 810
           ML
Sbjct: 663 ML 606

BLAST of Lag0033876 vs. TAIR 10
Match: AT1G48175.1 (Cytidine/deoxycytidylate deaminase family protein )

HSP 1 Score: 211.1 bits (536), Expect = 3.6e-54
Identity = 102/143 (71.33%), Postives = 113/143 (79.02%), Query Frame = 0

Query: 120 ASRRQKGASRRSTLGHAEMEAIDILIEAWQRDGLSTSEVANKFSKCKLYVTCEPCIMCAS 179
           AS R +    R+   HAEMEAID L+  WQ+DGLS S+VA KFSKC LYVTCEPCIMCAS
Sbjct: 43  ASGRNRTNETRNATRHAEMEAIDQLVGQWQKDGLSPSQVAEKFSKCVLYVTCEPCIMCAS 102

Query: 180 ALSILGIKEVYYGCANDKFGGCGSILSLHLGSREAHTSGNVQGKGFKCTAGIMASEAVAL 239
           ALS LGIKEVYYGC NDKFGGCGSILSLHLGS EA      +GKG+KC  GIMA EAV+L
Sbjct: 103 ALSFLGIKEVYYGCPNDKFGGCGSILSLHLGSEEAQ-----RGKGYKCRGGIMAEEAVSL 162

Query: 240 FRSFYEQGNPNAPKPHRPLVHHQ 263
           F+ FYEQGNPNAPKPHRP+V  +
Sbjct: 163 FKCFYEQGNPNAPKPHRPVVQRE 180

BLAST of Lag0033876 vs. TAIR 10
Match: AT5G24330.1 (ARABIDOPSIS TRITHORAX-RELATED PROTEIN 6 )

HSP 1 Score: 53.9 bits (128), Expect = 7.3e-07
Identity = 26/70 (37.14%), Postives = 36/70 (51.43%), Query Frame = 0

Query: 577 PGEESDFRSDNNCFRFCKTCGSSESVLKMLICDHCDDAFHVSCGNHRMKKVLNDEWYCNS 636
           P   SD  SD++    C+ C S +   K+L+CD CD  FH+ C    +  V    W+C S
Sbjct: 19  PQHMSDHDSDSDWDTVCEECSSGKQPAKLLLCDKCDKGFHLFCLRPILVSVPKGSWFCPS 78

Query: 637 CLKKKHKILK 647
           C   KH+I K
Sbjct: 79  C--SKHQIPK 86

BLAST of Lag0033876 vs. TAIR 10
Match: AT1G77250.1 (RING/FYVE/PHD-type zinc finger family protein )

HSP 1 Score: 47.4 bits (111), Expect = 6.8e-05
Identity = 60/253 (23.72%), Postives = 102/253 (40.32%), Query Frame = 0

Query: 406 SKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEGEEQAAASRHNHKSEIVGSV- 465
           SK    Y+R+KL G S S    +  D  S+        E  E  +  R + ++ + G + 
Sbjct: 48  SKKIQTYKRRKL-GRSYSANKCSENDRFSMEGVSHS--ESRESVSYERFSIRTHLTGELP 107

Query: 466 VPPLPVYNGKTHVSELDSVDGC--TIGEGQGSDE--TLNNNLQKSLEVDSMNDSCSSSKS 525
            PP P        +      GC   +     S E  +LN  L ++L+   ++D  S +  
Sbjct: 108 KPPSPNKPSSESTNHETVTAGCQHVLSHVLASKEFASLNRLLSENLQGVKIDDFTSRT-- 167

Query: 526 NMELVSTSLKVEV-DDTGECSSSSIRVMEDMAEDISGRDLCISILRSHGLLSSMAH---- 585
              L+ T +K  V + +    S+ ++ +    +D+ G D+ + +  S   LS  ++    
Sbjct: 168 ---LIDTRMKEGVYEGSPLLFSTDLQEVWQKMQDV-GNDMAV-LANSLLELSRTSYKEQL 227

Query: 586 --------APGEESDFRSDNNCFRFCKTCGSSESVLKMLICDHCDDAFHVSCGNHRMKKV 641
                    P   ++   +++    CK CG        L CDHC+D +HVSC     K +
Sbjct: 228 KQFYTGESKPCPNAENIRNDSVSDICKLCGEKAEARDCLACDHCEDMYHVSCAQPGGKGM 287

BLAST of Lag0033876 vs. TAIR 10
Match: AT3G01460.1 (methyl-CPG-binding domain 9 )

HSP 1 Score: 47.0 bits (110), Expect = 8.9e-05
Identity = 16/45 (35.56%), Postives = 27/45 (60.00%), Query Frame = 0

Query: 593 CKTCGSSESVLKMLICDHCDDAFHVSCGNHRMKKVLNDEWYCNSC 638
           C  CG  ES+  +++CD C+  FH+SC N  ++   + +W C+ C
Sbjct: 86  CGACGRPESIELVVVCDACERGFHMSCVNDGVEAAPSADWMCSDC 130

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG6587732.10.0e+0081.18tRNA-specific adenosine deaminase TAD2, partial [Cucurbita argyrosperma subsp. s... [more]
KAG6589690.17.7e-29374.93tRNA-specific adenosine deaminase TAD2, partial [Cucurbita argyrosperma subsp. s... [more]
KAG7023369.16.1e-25066.71tRNA-specific adenosine deaminase 2 [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_038878482.12.3e-23689.21uncharacterized protein LOC120070708 isoform X1 [Benincasa hispida] >XP_03887848... [more]
XP_023531970.13.9e-23688.55uncharacterized protein LOC111794074 isoform X1 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Q6IDB65.0e-5371.33tRNA-specific adenosine deaminase TAD2 OS=Arabidopsis thaliana OX=3702 GN=TAD2 P... [more]
Q5E9J78.3e-2441.48tRNA-specific adenosine deaminase 2 OS=Bos taurus OX=9913 GN=DEADC1 PE=2 SV=1[more]
Q7Z6V54.6e-2243.80tRNA-specific adenosine deaminase 2 OS=Homo sapiens OX=9606 GN=ADAT2 PE=1 SV=1[more]
Q5RIV41.3e-2148.33tRNA-specific adenosine deaminase 2 OS=Danio rerio OX=7955 GN=adat2 PE=2 SV=2[more]
Q6P6J02.3e-2143.80tRNA-specific adenosine deaminase 2 OS=Mus musculus OX=10090 GN=Adat2 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1F1456.0e-23588.11uncharacterized protein LOC111441172 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1IGX76.7e-23487.89uncharacterized protein LOC111472812 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A1S3BXC22.5e-22886.56uncharacterized protein LOC103494237 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A1S3BWJ27.4e-22583.09uncharacterized protein LOC103494237 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A6J1C1R71.4e-22385.46uncharacterized protein LOC111007173 OS=Momordica charantia OX=3673 GN=LOC111007... [more]
Match NameE-valueIdentityDescription
AT2G19260.17.7e-5743.71RING/FYVE/PHD zinc finger superfamily protein [more]
AT1G48175.13.6e-5471.33Cytidine/deoxycytidylate deaminase family protein [more]
AT5G24330.17.3e-0737.14ARABIDOPSIS TRITHORAX-RELATED PROTEIN 6 [more]
AT1G77250.16.8e-0523.72RING/FYVE/PHD-type zinc finger family protein [more]
AT3G01460.18.9e-0535.56methyl-CPG-binding domain 9 [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001965Zinc finger, PHD-typeSMARTSM00249PHD_3coord: 592..638
e-value: 1.5E-7
score: 41.1
IPR013083Zinc finger, RING/FYVE/PHD-typeGENE3D3.30.40.10Zinc/RING finger domain, C3HC4 (zinc finger)coord: 578..663
e-value: 1.1E-12
score: 49.3
NoneNo IPR availableGENE3D3.30.40.100coord: 737..805
e-value: 2.7E-10
score: 42.2
NoneNo IPR availableGENE3D3.40.140.10Cytidine Deaminase, domain 2coord: 111..257
e-value: 3.6E-35
score: 123.1
NoneNo IPR availablePANTHERPTHR10615HISTONE ACETYLTRANSFERASEcoord: 356..808
NoneNo IPR availablePANTHERPTHR10615:SF127REMODELING AND SPACING FACTOR 1coord: 356..808
NoneNo IPR availableCDDcd01285nucleoside_deaminasecoord: 128..206
e-value: 9.38745E-20
score: 83.4352
IPR002125Cytidine and deoxycytidylate deaminase domainPFAMPF00383dCMP_cyt_deam_1coord: 132..192
e-value: 6.2E-9
score: 35.6
IPR002125Cytidine and deoxycytidylate deaminase domainPROSITEPS51747CYT_DCMP_DEAMINASES_2coord: 94..212
score: 14.269957
IPR000949ELM2 domainPFAMPF01448ELM2coord: 684..791
e-value: 3.5E-5
score: 24.5
IPR019787Zinc finger, PHD-fingerPFAMPF00628PHDcoord: 592..639
e-value: 4.8E-8
score: 32.8
IPR019787Zinc finger, PHD-fingerPROSITEPS50016ZF_PHD_2coord: 590..640
score: 9.4633
IPR016192APOBEC/CMP deaminase, zinc-bindingPROSITEPS00903CYT_DCMP_DEAMINASES_1coord: 135..181
IPR019786Zinc finger, PHD-type, conserved sitePROSITEPS01359ZF_PHD_1coord: 593..637
IPR011124Zinc finger, CW-typePROSITEPS51050ZF_CWcoord: 733..796
score: 11.35385
IPR016193Cytidine deaminase-likeSUPERFAMILY53927Cytidine deaminase-likecoord: 132..249
IPR011011Zinc finger, FYVE/PHD-typeSUPERFAMILY57903FYVE/PHD zinc fingercoord: 581..645

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0033876.1Lag0033876.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0046274 lignin catabolic process
cellular_component GO:0048046 apoplast
cellular_component GO:0005634 nucleus
molecular_function GO:0005507 copper ion binding
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0052716 hydroquinone:oxygen oxidoreductase activity
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003824 catalytic activity