Lsi06G007560 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi06G007560
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionPlastid transcriptionally active 3
Locationchr06 : 14062461 .. 14081687 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACAGATTACATGAAGCCCGATACTGAGACATATAATTGGGTGATTCAAGCATATACAAGGGCTGAATCTTATGATAGGTATGTTCCTCCTGCTCATAAATTTCTTGTTATGTACCATCCACTTTGAGGTGGCGTTTAATTCTTTTGGGCGGGTGTCTCCTTCCTTTATTGTGGTATGATGGACAAAGTGGTCTTATTTCCTCCATTTCACGAGTGGATATGTGGAACTTGGTGCATTTTAATGCCTCTGCTTGACCCATGTTACAAAGCTTTTCATGATCTTTCTTTGTTTCTGATCACATGCTCTTTGTCATTGCGCTGTTTCAATTTATTTTTGGACGCTCTTCTGCTTAGTTTCAAGCCTATGCTTTTGTGTGGTCATTTTGTACTTTTATTTTTCTTGATGAAAGCAGTTCTTTTAGCTAAACATGCATCCATTTGGTAGTGATAAAAATTGGACTAACTTTCTGTTTCCTCTGTATTTTGGAGGGATTTTTTTTTAAAGAAGAAACAATTTTATTGAAAGATGATGAAATAATGGAGTCGAAATTCTGAACACCTATTGGTGAACTACAACAATGCTCTCCACTAAGAGGGGCTAAGATGAAAAAAAGCTGCTGTAATTTACACCAAAAGAAAGTAGTAGATTAAGTACTTTGGAGGGATTTTAAATCTTGATGGATTTTAAATGAATTCCTTTGTTTGTTTATCTCTTGAAATAGAATGTTTATAATAAAACATTTTGAAGAAATTTTTTGATTTAATGTTTACATATGTTATTTTATTTTTTATCAATAAATCAGGACTGAAAGGTTATGATTTATTTTCTTTTCCTTTTCGGGGACATCCACACAAAATATTTTAAATCCATTCATTCCTAATCCTTCTCATCCCACAAAATATTTTAATTATGTTAGTTTCAAATCTCATTCTAAACTAGATAGAAGACCATTGTTTGAAATCACAAAATAAAAATTCATATCTTGAACCTTCTTGATTCCAATTTCCAAATGCTTTAGAAGGAAGTTTCACATATGGTCGCGAATGTCTTTTCTGCCTTGTTCACTATTTTATGTGACGAATGATATATTTTTTATTTTGTTGAAGTGTATTTTTTTTTCTAAACAAGAAACGAAACTTTTTATTGAAGAAATGAAAACTCAACGGAGACAAACTCCTAAAGAGTGAATAAAAATAAAATAGACTATAAACAACATGTTAGGTACTTAAGCACCTTGGAGAACCACAACCCAAAAGCTAACTATTGAGGTAGGAGAGTCAAGCCACTTAAGTACCACATTGGTCATCCCATTCTAACCGATATGGGACAAAGGTAGCCCACACTACCTTGGTTCCTAACAATACCCCATCCCTAGAAAGCCGACAGTACTTCACTCGACCACACCGGTCAGAACCCTTCGTCGATTCCACTCCGGAACGTCAACAATTGACTTTGATACCATCGTTAGGTACTTAAGCACCTTAGAGAACCACACCCCAAAAGCCGAATATTGAGGTAGGAGAGCCAAGCCACTTAAGTACCATATTGATCATCCCAGCCTAACCGATGTGAGACAAAGGTAGCCCATACTATCTTGGTTCCTAACAATTGTTCATCTGCATCTACCTCCATGGCCAGAATCGAATAGCTCTGATGCCAAATTGATAGAATTCAGATATATTGAATTATAAAAAATAATGGAAATACAAGATAAACCAAGTGTTTGAGGTACTTAGACCCTCCCAATCTTGAGATCACTCTCGAGCCTAATTCACTCACCAAATATTTGTCTCCTTTGCCTATTTTCCTTATCTCTATTTATAACCAAATTCCATAACTAACTCCCCTATCTAATTACTATTATACCCTTAATATCCCTCTAACACCCTAATACTGTTCCTATCACAATAAAAAACAACAGATAGACTACAACAGTGATATGAAAGCATTCCAGTTTTGGAGGGGCCAAAGGGATGCTTTAGATGTGCTGAGTTGAGACAATGACACCACAGGACGATAGTTATCTTAAAAAAATCCTCTGATTTCTTTCCTTCCAGATTTTCAAAATCAAAGCTTTGACCACACTAGTCCAAAATAATTTTGTTCTAACAAGAAACAACTTCTCATTTGATATAATGAAAAGCTATAAATGTTCAAGGGCCACAAACTCTCAAAGGGAGTGAATTAAAAATACAAATAACAAATACAAGCCTAAAGAGATTATCTGTGGAAAATAAACAAATAAAGGGCCAAAAACATTTTCAAAATCTTCAACTAATAGACATAGAGTGCTTGCTTAAGAATGAAGTACCTATGAATGAGGCCTCCACCAAACACTAAAAACCGATAGTTGCCTCAAAACTTCCAGATAAAACCAAGAACCAACCTCCAATTAATCAAAAAAAACCTTTTTTTGAAAACTGAAAAGGATATGAGACTTTTTATTAAGAAAATGGAAAGAGACTAATGCTCATAGATACAAGATGAAATGAAAAAGAGCAAAACAAAGAAACAACATAGGAACATAGGGCAAAAAGTTATAAGACAAAAATACAATAAGCTTGCCGCTGCTTATAGTTGAAGTTGTTCTTTTATTTTTTCAGTTTTGTTTAGTCATGAATAGAGGATTAAGCTTTGTAAGTTTTGGATATTTTAGTTGTTCTGAGGTTTTTGTAATTTGTTATTATTGTATTGGCTGTAAGAAGACGCGGATTGGGGTGTGTTTTGCCCTTGGTTTGGTAGGATTTTTATTATTTGTTTTTGTTNCACATTTCCCTTCCCCCCTTACAAAATCTTGTATCTGTTTTCCTCATTCAAGGCCCTTCCCTCTCCACAAAAGGCCCTTCCCTCTCCACAAGCACATTGATTGCTTTCCTGCATTTGTTAAATTATGTATAATGTTCATTGTTGTAGACTGGAACTGTTTGTAGTTGATTTGTTAGATTTTGTGGCTATCACAGTTTCTTATAGGATTCATATGTTTTTTTTTTTCCAAAAACATCTAGAAACCACAAGAAGTGAATCAAAATTCATATCCACAACATCCTGCTCTTTATGAATCCAAACAACCAAACTTTTCTGGTTAATAAGAATTTTGTAGGTAGGATATGATGATGGTGCTAAAGGGGTGTCAACATAGTCGAGATGCCCGGGTGCACCTGCTGATCCTTGGATTGTTTCCTATTGTATCTTGATTTTTTGTATCTTTTCATTTTCTTAATGAAGAGGTTTTGTTTCTTGTTTAAAAAAAAAAAAAAAAAAGACAAAAATACAATAATAATTAAGAGTGTCTTAAAAAGGAAACCCAACAACGATAGATGTCTTGATTGTAGGTCCTTCAAAATATTTGGAGAGAAAGCACCACAGTGCAACCTTTAGACGAACCAAATCAAACCGAACAATAGGACTTTCGGAAGAACTTTATTTTTGGGTTTTGATCTTGGGGAGCTTAATTGCTCAAACTCTCCTAAAAGGAAGCTTCCAAGTTAATCACTCTCGGCCTCAATTTAACCTCTTTGTTCGGCGGTTATTGAAGACCCTCCACATATTTTCTTTGGATGTTGGTTCTCAATAGAATTTAGGAGGATGTTCTCCTTTTCTAATCTTCAATGGGCTTTAAATGTTGTTTTCAAGTCCAATGTTATCCAGATTTTAGTGGGTTTGTCTTATTTTTCTTGTTTTATATTGGCAAATTTGAGTTTTATGCTTTGTTTCCTCTTCTGTACTTCGAGCATTAGTCTCATTTCATTAAGTCCAAGGCCAAGACCAGTTTATAATTATAACTGAAGGACATTGTTTTTTACATTGGAATAAACCACCCAATGACATGCTGGATAATTTGAACCAGCAAGCTGATAAATATGGACACCCAAAAAAAATATGGTTGTGAGCTTCCCCTGTATGTTGTCAGAAACAGAAAATAGAGGGCATGTGGCTTGACTGAGTTAGCTTTTTTGATGAACATCCACAGTATTGAGACGACTATTCACCAAAACCCAACAAAGAATGTCTACTCTTTTTGGACTTTTAGATTTCCACACCACTGCAAACAAAACTTTTGAGGGTAGAAGATATGTTCAACTTTCTTGTAAGAGATTCTTTATAAGTGAATAAAACTTGTATTGTGGTTATAAGAAATACTTTCATTCTTGTATTGTGGTTATAAAACTTTTAATTTCATGATGGTCACCTACCTAGGATTTAATATCCTACGGGTTTCCTTGACACCCAAATGTTGTAAGGTCAGGCGGGTTGTCCCGTGAGATTAGTCGAGGTGTGCGTAAGCTAGCCTGGACACTCACGGATATCAACAAGAAAAAAAACTTTTAGTTATGTTCTTGGAGCTTCATATTTTGTTTAGTCTTTTGTTATGTTTCCTATCTTGTATTTTGAGCATTAGACTCATTTCATTAATTCAATGAAATGTCTTGTTTTCGTTTAAAAAAAAAGAAATACAAGTTTGTTTCTTTTTTGTTCTCTTATTTATGTAGATGCTTATAATGCTTACTATTTTACTACTTATCGTCTAATTTTTTTGAGTAGGGTGCAAGATGTTGCCGAATTGCTTGGCATGATGGTTGAAGACCACAAGCGTCTACAGCCAAACATGAGAACCTATGCGTAAGTTGCCTTAGTTAATCAGGAAATTAAAGAAATTATTCTCTCATTGCTTTGTTTGAAGTCAAAGATGTATTCTTATATCCACACGCTTAAAATTATTGAATCTTGAGACTACATCGGATCACTTTTTGTACTATTAAATCAAGGACTTGTACACTGTTCTTGGAGGCTGTGATTTTTTTTTGTTTTTTTTCCTTGTGTTTGGGGGACTTGGTCAAAGAGATATAGATTTTTTTTTAAGGGCTTGAGAAGTTGTGGGGGGAGGTGTGGTCCATTACCATAGTCACCGAGTCGTTTTGAGAGTATGTTACCAAGATGTTTTGGTAATTATCCTTTAGGCCTTATCCTTCCGGATAGGAGCTCTATAGTTACTTTAGTTGCTTTGTTATTCTTTCATATCATTCTTTCATGAAAATAAAAGTTGGTTTCCAAATTATAAATTAATTAAATTAATAAAAAAAAATTAAAAAAAAAAAAACTTGGGACACCCCAATAGGTATATCATGTAAAAGTAGAAGGATGAACCTCGTTTGCAACATTTGTAAACTAAGGCATGAAGTTCTTAAGTCTGTCATGTTAAATTTCTCCCGAAGAGAAATATTTGAGTAAATTTACTAATTAAAGCTGTAATTTAAATTAATTCTAACAGTATATTTTGAATTTATCTAATTAATTTCTTCCTTCATCAGAGGAAGCAAACTGAAAATAAAATAATAAAAAAGCTCCCTAGTTGTGTACATCCTCTTGTAGATCTTTTCAATTCTAAATGAGAAATTATTGATTTCTTAAAAAAGAAAAAAAATCAACAAAAATATAAATGGAAACCAAAGAATGACCTTCCAGATAATTTTGCTTATGATAAAAAACTGGATGGCGTTTTGTCTCGATCTTGACTATCCTCTTTCTCAGTGAGCGAATAAGGTCTCTTCTCCAAGAATTGACAAAGGAGAGGTTATCACTAGAATCCATGAGATAGAAAGAAAAGGGAGAGAAGAAGATTGATGAGTCTCATTCACTGATGAGACAGTTAACACTGGAACTTTTCTATTAAGAAATTATGTATTAAAAATTTGTTTTTCTACCTGGATAGCCAGAGCTGGGAGAGCTTTATTGTTTCTTTTTAGTGGATTTTCCTTATTTTATAATTTGATTATGTTTTGTTTTCTTTGTATTTTAGAGCATTATTCTCTTTTCATTTTTTTAATGAAAAGCTCAGAATCCTTTTTCAAAAATAAGTTCGTGTATTTTTTTTTTCAGGCTCTTGGTAGAGTGTTTTACTAAGTATTGTGTTATACGAGAAGCTATTAGGCATTTCCGTGCACTAAAAACCTTTCAAGGTGGAACAAAAGCTTTGCATAATGAAGGGAATTTTGGTGATCCACTTTCCTTATATCTTCGAGCTTTATGTAGAGAAGGTATTTATCAATTCGTAAGTGTTAGTGATAATAAAAATTTCTTTGTTTTCTTTTAAATGATTTTGGAGCAGGAACAAGAACTTTGATGGATTTTATTGTCAGACACTTATTCATGGTTTAGTAAGGTCAATCAATTGTCTTGTGAGTTTAGTTAAGGTGCTTATAAGCTAGTCCAAACACTCACAAATATTAAAAATTGTATGTATATCCATTATTCCTAAATATAACGTTAAATATGCCAAATATTTAAGAGTCTAAAAATTGACTCGTAATGTAAATTCAATTTACTGTGAGAGAGGTCATTAAATGTTTTATTTGTAAAAGGTCAATTCATAGTTATAGAACTTCTAAGTTCTAACCAAATAGAAGGATTGTCGTATATTAATTTTCTATTGGATTTATAAAAGTACTATCAGTATGTATATATATCATGCACATGGCTTATAATTTCATAGGGTATGTAGTTTGCTTTGTCGAACATATATACACACACACACACACACACATATATATTTTGAAAAAAGAAACAACTTCTCATTGATATAATGGAAAGGATAAAATTGTAGTAGTGGTAGTACGGGTCGATCCTCAGGGAACCAATGAGTTTCTCAACAAAAACACTTATGATTTCGAAGTAACTGAAGCGGAAGGGGGGGGGGGGGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGTTGGTATGGTTTGCTTGAGAATTTAAAGATGCTTTAAATGTAAATGTGTGAAAAGTAAATAGTAGGCAAGATGAGCGATAGAGAATTGAGAAAAAAATTATCGATTAAAGGCCCTGCTTGGGTCTTTTCGAAGTTTCCCCCAAATAGTTAACTCAATCATCGAACTGGAAAGATTAGATCCTATTCATTGTCTAAGTTTAGCCAATTCGCTTAGTAATACTAAACAAAAGTCCTAATTACCTAATTAGCCAATATGATTACATGAATGTATGCTAACCATTTGAATTAACCACATTAAGATTTAGGGATTACGCTAAGATTTAGGGATTAATCAATGCTCATTGAATGGATGCCAAGCAATTCATAAGTTAACTAACCATTTGCTACTTAGTGATTTAGTGACTTAACGATTACATATTAAGTATGAATCTATTTCGATAGAGCATAAAACCTTGCTAGTTTATCAGATAATCCCAAGATAATATGATCAATAACATTCAAGCAAGAAGGAAAACCCAAAGACAAGAGTAGAAATCAATTTTGATTAAGCCAAAGTCTCACGAATTTAACATAAAAGGGAAGAAACTTCTAAACCCAAGCTAGAACACTTACCTTAGCCAAGAATCATCCTTCAAATCAATGAAGAATGATCAATGTTCATCTATACAACCCCAAATTGAGTAAAGACAGTTTGGCGAATGAAATATAGCAGAGGGAGGATGGGGAAAGTCGGAGGATCCTTAGCTAGACGACTGGCTTTGTATTTATAATAATCGAACTGGCGCTGCAGCGCTGTGGCTTTTCGAATCTTAGCGCCGCGCATAGTGTTGTGACGCTTGCCTTCCGCTTATGTGATGGCGCCACGGTGCTGTCTTCTATTTTGCCACTGTGAGGAGCACCATGCTACAGCGTCGAGGAGCGCTGTGGCGCTCTCGCCTAGTGTGCCCTCCTGCCTTTGGTGCTGCGTGGGCCTGCCCTTAGGGGCATTTTCGTAATTTTTCTTTTCTTTTTCCGGTTGATGCTTTAGGGCTCTGTTTTAGCTCCATTTATACCCTAACGCTTGGTTCTATCTCTTTGATGCCCTAAACCTACAAATCATGAAATTAACATGTAAAATCAAGGAACAAGCCCGAGTGAGATAGCACATTTTTGGTGCTAAAGAACATTCAAGGATACAAACTCCCAAGGGGAGTGAAAAAGACAAGGAAAAATACAAATCAATAAGCACACTGATAAAGAAATATGAGCATTATGAGCTATTTCAATAACTTCACTGGAAAGACTTGGAGAGCTTGCAAACAAAAACTGAATCTGAGAAAAAAGCCTCCACCAAACTCAAATCCAAAAGCTAATACAATCTGCCAAAGGAAATCATGAACCGGACTTCAACCTCTAAGAAGAAAGGACCAAATAGCTTCTATTACCATGAATAAGCTCTCTTCCGACCGATAAAACTCCATCAAGACACTGTAAAATCCAGATTAGAATTCAAATATAAAAGAGTAAAGATTATCAATTATCAACAATGGACAAGCTCAAAATAAAGAATTTCTTTGCAGATCTTTTGACGACAATCGCAACATATCAAGACTAAATAATAACAACCTCCCTTTGATACCTTCCACATAACTAATGGACAACCGAGTGAAACCTTAAATAAAACTCCATGACCTCTAGGATTCCATGAGATAAATCATTGAAAGGATAATTGGAGAGAATAACTGAACTTTTATCTAAGACGAAGAAGATGGAATTACACGCATTTGATGGCCGCAAACCTTAATAAGAGAGGTGAATTTACTAGGAATTGTTGAAGTGCACACTATCGGGGAAGCTTCTTTGATCTGATCATCAACTGAATTCAATGACGGACCAACAACACCATCTGAAAAAAGGAGAGCAAAAGAAGATTCAATGATATCATCCTCATAATTACCAGTTTGATATGTTTCCCAAACTTGAGAATTCAACAAAGGTGATTTTGGCTCCAAACTACTGACACTAACATCTGATTCCTCAATGAAATCATAGTCCGATGAAGGAAGAACCGGATAGTCAGTCGAAATCGATACCCTTTTACTCGCAACTAAAATTGGACTTGGAAGAGAAAAAATCGAATGATTGAAGTTATTGGAAGACTGAAGCAATGATTTCTTCAAGGACTTTTTTTGTGAGAAACTATTGGTTGAGATGGAGGCAACTGAAGTCGTAAGCAAGACTCTTCAATATATTCCTGTTTCTCCTCAAAATTCTCAAAATTTGTTGGAAAAGAATTTACTTGACCTTTCTTTCGTGAATAAAAGAGGGGATAAGCTTCAATAAAATGAGGAAATGGAGCAATCACTTTTGGCATATTTTTCTGAATAATCCCTTATATATATTTTTAATTTTTCTTTTTAAAAGGGGAAAAATTCATTAATGAAATGAAAAATTACATAGTAGCAGCATCAACTTATATGAGAAAAACCCAGCAAATAGAACAGCAACCTAATTAGTAAACAGAAGAAACCAACAGACCGTGAAACCCTTAAGCTAAAACTATAGACCAGCCTATAGACATGGCAGGAACGAACTAAAAGATTACACAAAGTTCACTGAGCAGATAAAGTAGGCTAATACCTTCTATAGGTAAAATAGGAATAAATGTCAAGGTTAGAAAATTGTGGACCGGAAAAATGCAGACATTATGTATTTATTTGTTTTACATTTGGTGTCATGACTAACTCAATGGTTTGTAAATAGACTTGTGCAATGTATGCCTATTCTCTTTAATCATGATACCATCAATGGGTTGCTGGATGCATTGAATGGGCTTGGGATACAAAGTTGGACCTAAAAATTGGGACACCATCAATTTGTGTTAAGCTTGAGGATGGATGTACCATAAAACCAAAGTCATGAGAGCTATTATGGATTTTTAGCTGTCATTTGTCCTAAGTTCTAAATGCCCTTATCCACTTTTCTAGCTTCGATGAATGATGTCTAAAGGGCTTTCTTCTTCTTCTTGTTTTTCTTTTTTTCTTTTTTTTCATTGTTTTTGAAATTTGTTATGGTCTACTTTGATGACATTATAATACATAACCCATTGATGACATAATAGCTAATCCATCTTTAGGTCTCTTCTCTCTCCCCCTACACCTTTCAAACCCCTACCTCCCCACCATTCAGCTTCCAACCCCCGTAGATGTCTCTGACCACCACTCTCCTCCACATTCGGTTGCTGCTCCCCATCAATACCGTCCACCACACGCTAGCTATGCTCCCTCTTCTCTTTTTCTTTAAACCATAGCCTTCCCTACGCCCTAACCTGTCAGCTCTCCGCCCCAACCAGTCATGACTGCCGCTCCTTGTTGTCTTCCATCACTCTTCGACAGTCATTTCTCTCTCTTGCTTTTCTTATCTAAACCTTAGGCACTCCTTTTGTTGTTTTGTTCGTCAGCCTTGTCCATTTTCTTCTTTAAACCCTAGCCACTCGCTTTGACATGCACTATTTGTCCTGTCTTTGTTCTCCACCATCCCTTTTCTTACATTCGCTTCGCTGTTAGCAGTAGCTGTCGCTTGGGCTTTTTCATATTTCCTTGGAGATGATGAGCTATTGTTTACTTTTGCATTTGGTATGACAATGACATACGTTTTGTAAGACATGAACAAAATGAAGATTTCTCTCATTTGATTTGGTTTGAAAATTCTATGGTTGAACTGATGCAAAATCTAATGCACACATTTTTTCGTAAGCAAATCAGGAATGCTTTTGGGACTATTCGTTTGGCCAAGTTTAGATCTCCCCAAAGATGGAGTTTAGCCACCTTCTAGAGGCAGAACAATATTACATGTATGATGTACCCATTGGCGTTGATAAGAAATGTTGGTTTGTTTTTTGGGACATGGTATGAGATTTTCTTTTAAAAATTGAGAAAAAATCAGCACTCATCTTTCATTTTTTCCAAATGAAGTTGAGGAAGGAGAAATCTTGCAATTGGAGCTTGTCAAAGATAATCAACTTCACCAAACTTATGCAGATGTGGTACAAAAGAAGAGATCAGATGCTTTTACTTTTTGGGTGAGGAATGAAAAAGTAGTGGAAATGAACTTCAACTCTATCTTGGTGATTGCTAGATTTCTAGCTCACATTTTCTGGAAGGACATTCATAGTACTTTAGAGGTTTATTTTCAGTCTAGACCTCTTCATGGTAGATCAATTGTTGTTGAAATTGAGTGTTGATTTTTCCGCGTTGGAGTGTAATGGCAAGTGGAGGCAAATCGAAAAATTTCACTTAAAATTTGAGAATTGTACCAAGTAGAATATTCTCATCTGAAGTTAATTGAGACTCTTAACCTTATTGGTTGTACAACTGGTATAATTGAAGTGGAGAAAAATATTCATGGTTTCTTTTGGACCGAGATTGGGATAACATATTAGAAGTGACAAATTTTCTCTTCATTTTGGAGATATATATCCTTTAGATTGTCCACACATTGATCATGCTGATTTGTCAATTTTCAATTTTACAAACCAAGTGGATCTTTAGAGAATAAATCAAGTTATGGTGGATGAAGGATTTACTTTGCCATCTAAGAAATTTAAGGGCAAAGATGTTGATTTGAACTTTTTTCCTTTGCATGATTAGTTGTAGAAATCTATTAGGTACTTAAGCACCTTGGAGAACCGCACCCCAAAAGCTAGTTATTGAGGTGAGAGAGCCAAGCCACTTAAGTACCACATTGGTCATCTCATTCTAACCAATGTGGGATAAATGTAGCTCATACTTACCAGGCTCTTAATAATACCTCATCCCCAGAAAGCCGATGTCCCAGCGGCTACTCCAACAGTACTTCACTTGGCCATACCCGGTCATAACCCTCTGCCAATTCCACTTTGGAATGTCAACAACCAGCTCTGTTACCATTGTTAGGTACTTAAGCACCTTGGAGAACCACGCCCCAAAAGCCAACTATTGAGGTGGGAGAGGCAAGCCACTTAAGTACCACATTGGTCATCCCATTCTAACCCATGTGGGTCACAGGTAGCCCATACTACCTTGGTTCCTAACAAAATCAAATAGTCTCTTTCCTCAAAATGACCTTCAAGTTACACAACTTTCTTCAGAATTTGCTAAGTCCTCTCAACTATATTATCTGGTTACCTAATCCACATTTCATGCCTCTAAAATAAAAAATAGACTTATTGAAGCTTTACCCTTTCCATTATACTTGAAAGAAAGCCCGTTCTGTTGGTTTCTTGATTTTTAAAGATTAACTAAAGTATTTTGGAATAAGCTTGTTCGGCTCATCACTTCCTCACCAAAGTTACAGCAACCTCTTTCACAACCTTCCATCTCGTACACTCATCTAGGGCCAAATGTTACTTTAACCAAAAGTATCTTAAGCTCTTCAATTACTCATGATGTTGAATTGGATGAAACATCTAATGTTAATATGAGCAGTGAGGAAGCTAGAGCTCATTTAATTGATATAGAAATTGATGAATTTCCTTTCGAAGACTCTTTTGCAGAAGCATTCAAAAGAATTTTTTCTGGTTGATGATAAAAGGAATTCAGATGATGTTTTGCAATCAACAGCATCAGTAGTTAAGTATCCTTCAATTTGTCCTAAATTTCTTTCCTTAATCGAGGTTTGTGGACAATAGATGAAAGAAATCACGCCACAAATCAAAATCTAATTGCCTTTTGCTGGTTTCTAATGATTGTCCTGTAATGTGTGGTTCTTCTTGGCATTTGAGAGGCTTTGGGGATTGTTTTCGTGTTGTCTTGACGATTTTCTTCAGTTTCATTTTGAAGTTTAACAGTTGAAGCCTCTTTTGGAAATCTATGAGGCCTTTTATTTTGATGGTTTGGTGTTTAGTGTAGTTGGGAATCTTCCAAGACCTTTATTCTTTAGTTTCCCTTGGCTCCTGCCTCTTTTGGTAGGTTTTGCTTAGAGCTCAGCACTTTTTGCCTTTATTTTGGTGTTTTTTTGGGATTATTATTTTTTGGTTCTTTTTTAGGATGCTTTAACTTTATTTACATGGTTTTTTCCCCTTTGAATTCGTTCTTCGTTTTTTTATTTCCTTGGTGGAGCATTGTACTTTTTTAACATTAGTTTCTTCTCATTATTTCAATGAAAATTTCTTGTTTCTTGTGTCAAAGAAAAAGTTAATCCATCTTTCTAGGGTGTTGGGTGTTTATCATAACTAGCTTTTGCTTGATTAGCTATTTCGATTATTAAATGTTGCAATTAGAATCCTGTTAGACCCCCAATACTGCATACGTATGCTAGGAGGGGTAGAAGGGGTAATTCACCCACTTAGGGAGAGGTGATTAGACTATAAATATGTTTTTGTATCTTTTGGGGGGTTATCATATTTTGTTATATTCTGGAGTGACCAGAAAGGGAGAGTCCCACCCTCTTCAGTGGTGGAGATCTTGTAACACAGTTGTTTGATAAATAATATATTGTCCTGTGCGAAGTTAGTTTACATATCCTTTCTGGTGTATTCCTCGTAGCCATTAGAAAATCTCTGAACTGATCTCCCTTGTTTATGCAGGTAGGGTTGTAGAACTCTTAGAAGCCTTAGAGGCTATGACTAGAGATAACCAACAGATTCCTCCGAGAGCCATGATCTTGAGCAGAAAGTATCGATCACTGGTGAGCTCATGGATTGAACCTCTACAGGAAGAAGCTGAACATGGATTCGAGATAGACTACATTGCAAGGTGGTATATTATTTTTAGAGTAATCCGCTGCAACTCTGTCTCTCTCTAGCTCTTTTCTTTTGTTCTTTCCAGTAACACCTTGATAAATTTATTTGTCATTGAGTTTATTTATTTTCTTTATTTTGGAAAAGGGTAAAAAATCATGGGTAATAGTCATTGTTTTAAACACAATATCTAAACCAACCTGGAACAAGCCTGTTTTTGGCAATGTCCTTTGGTATTGGAGGAAAACACTTGAGCAGCATGCACTTTGGTCTGCTGGACGATGATCCACTGTTTACTTATTTGTTTGAACCCTTTTTTCTTCATTGTTTTACATGTCTTTTCTTTCTATTTTTTTTTCCATCCAGTAACATGTCTGTGCCCTTTTGTACTTTTGTGTGTCACCCTTCTAGATGTTGCTTGTGCTTTGAAAAACAGGTCTTATTCCAATCTGTACGTTAAAAATAGTGTGCTTCGGTGGTTTACCCCTAAGATTGTGGTACAACTAGTTTTATTTATTCCCAGCAAAGTGATACACGACCAAGAGAGAATTTTAGGGGTCCCTTTCTCTTCTGAATTCCTAGTTACCTCACCCAATAAAGTGCCCACTCGATAAACTGAACTCCTTAAATACTTACTGACTCCAATCCTCTTCCCACCTTATCCTTTCTGCTAATAGTTATAAGGGGGTTCACTCTTCACAATCACACAAACTACTCAGTAGACTTAATCCTCATTTGCTTTGTACTATCCCACTTCAATAATAGGTATATATCATTATCCATCCTAAAGGGCAGCTTGATGAAAATCTAGAAATTGAATAAGAGTAAAGAATTGTTTCTCACATAGTGAGTAAAAACTACATCCATCTATAGTTTTTCTTTTTTCTTTTGGGACAACCCGCCTGACCCTACAATATTTGGGTGTGAAGGAAATTCGCAGGATATTAAATCCTAGGTAGGTGGCCATCATGGATTGAACCCATGACCTCTTAGCCTTTTATCAAAATCATGCCCTAGATCCATCTATAGTTCAGAACTTCCAATAATATTAGGATCAGACTGTGAAACCTATGAACCTGAGACCTCATATAGTTGGACACAAACAACCATACCTGAATTATGTTGTGGTCTGGGTAATGGCCTTCTTTAAAAGAGGAAAACAGGGTGGATTGATGCGAGATCAAGTAACAACAGTATATCCCCAACGAATCCAATTTTTTATCATTGACATAAAATGGATCAACAACCCATGGTGTCAGCTCAGGATTAGTGTGCCTAGCCAAGGATCCTTGACAATTAGGATGTAGCTTGAGATAGTCCCGATCTCATAGTGTGTTTGGAATAACTTAGGGAAATAATGCTTTTTTAAACCAAATCAATTTTTCCGTAAGCAACTTTGAAAAAATTAATAATACAACTTCTTGAATGAATTTAGATGAAAAAATATTAGCAAATAAAAACAATATGAAAGTTTAATGCTCCAGTTTGTTCGGATCCTATTTGTTTAAAGTGGTATGAACTAGCTCTGCTGTGAATCAGATACATTGAAGAGGGTGGACTCACTGGAGAACGCAAGAGATGGGTCCCTCGAAGAGGAAGAACTCCTCTAGATCCTGATGCAGATGGATTCATCTATTCAAATCCTATGGAGACATCCTTTAAGCAACGATGTCTAGAAGACTGGAAGATGTACCACCGGAAGATTTTGAAAACCTTGCAGAATGAAGGACTTGCTGCTCTTGGGGATGCATCTGAAGCTGATTATCATAGAGTCGTAGAGAGATTGAAGAAAATAATAAAGGGTCCAGACCAAAATGTTTTAAAGCCAAAGGCTGCAAGTAAGATGATTGTATCAGAATTAAAAGAAGAATTAGAAGCACAAGGTTTACCAATTGACGGAACTAGAAATGTTCTTTACCAGCGTGTTCAAAAAGCAAGGAGAATTAACCGGTCTCGTGGTCGGCCCCTTTGGGTTCCTCCAGTGGAGGAGGAGGAAGAAGAGGTTTACAAGTCATTTGATGTTCATTTTTAAAGAAAAATTAATAGCCATTGAAGTTTTCAATTAACAAAAAAGTACTTATATCTTCAGGTTGATGAAGAGCTGGATGAACTAATTTCACGAATAAAGCTACACGAAGGAAATACAGAGTTCTGGAAACGCCGATTTCTTGGAGAAGGCTTGGACAGTAATAATGTTAAACCTTCTGAAGATGATAAATCAGAACCTCTTGATTCTTTGGATGATGTTGACACTGTAGAAGACGTTGCAAAGGAGATTGAAGAAGAAGAAGCTGAAGAGGAAGAGGAGGTAGAACAAACTGAGAATCAAGATGGTGAAAGAGTTATTAAGAAGGAAGTTGAAGCTAAAAAGCCTCTACAAATGATAGGTGTCCAATTGTTAAAAGATGTTGACCAAACCACAACAACATCCAAAAAGTCAAGGAGGAGAAGTTCTCGGGCATCACTTGAGGTAGAACTCCTAACTTCTGGTTTTTTTCCCCTCATCTATTCTCAATGCCAATGTCCTCGAACTAGTTAACTGAGCTAATTATCTAAGCATAAATATTCAAAGGGTTATTTAAAACGGTTGGGGAAGATATTATGGAAATGAAATTTTAGGAATAGGATGATGTGCCTACTGCATTAGATTTTCATAGACTAAAAAATGAAATTGAGACTGGAAAGGAGAAAAACAAGAAAACTTGTTAGATTGTGTGTTAATACTAATTTGGAAACTAAAAATTAACTTCTCAGTGATGGAAACATATGTCTGCTTTACATCTTTTACTCATTAAATCAAAACATTTCCATGTGTGCTTATATTTTCCTACTTTAATGTGTGGATGCAGGACGATCGTGATGAAGATTGGTTTCCTGAAGATATATTTGAGGCATTTGAGGAGTTGCGAAAGAGGAAAGTCTTTGATGTATCTGACATGTACACAATAGCTGATGTTTGGGGTTGGACTTGGGAGAGAGAGCTTAAGAACAGACCTCCCAGGAGGTGGTCACAGGAATGGGAAGTGGAGCTGGCTATTAAAATTATGCACAAGGCAATATGTCTCGACCCTTATCCATTTTACCTTACATTAGGATTGTCTTTTTATTATCTTATTCTATCTCTCTTTTACCTTACCGTCAACAGTCGACCTAAGAAGCACGGATACAACATGTTGCGGACATGACGACATGTAATTTTTAAAAAAAATCTAGGACACGACACAACAAAGACTCATTTATAAAAATATAAATTTTTAAAAGTATATATCATTTATTGTTATTGTCATATATGTGTCTTTTAGTCTACTCAACAAGTGTTTTGTGCATATCTAACACATTTGTTGCAATAACAAATGTCCGATACGTGTTCAACAAGTGTCGGATTGTCCAAGTGTCTGACACATCCAACAAGTGATGGAGCGTTCAAGTGTCTGACACGTGTAGGACATGGACATGCTAGCCAAACTAAAGTGTTCGTGCTTCTTAGCCGTCGACATTTGAGTAAATTATGAATAAACCTTTATAGATCTACTTGGGAATACATTTTAGGATTAGTTGGTGCTTTGCATCTTCTTGTCTTTTTTATTCAGCCTACACTTATTATTTACTTATTTATATTTTTATAAAATATAAGAAATAGAAGGAACAATTTTATTGATGTATGAAATTTAGAGGGGGAAGTAGTAAAAAAAGTTATAATGAACATTTCCACTTGGACAAAACGAGAGGATATCCTGTAAGAGTTAAAAGAGTATCTATTTTACATCAAGATAAAGTAAGGAAAAGAATACTATAGAAAATGACTAGATAAGTATTTTCCTTTGATTAAAAAATTCTCTGATTTCTTTCCATCCAAGTCAACCATAGAAAAGTTTGGAACAAGATTCATCCAAATGATCTCTTTGCTTCCTCGAGAGGGGTGATCCAAAAAGAGAGGAGATTAAGCATAAATAATGAAGGGATCCCTTTGGAAGACCATCTCCTACCTAAAGGTAGCCATCATGTTATTCCAAAGTGTTGAGCAAAAGAACAGAGATAAAAACGTGCGACTGTGTTTACTCATCATGCATTTCTTGTGTTAGAGATACATAAATGTTTGACACCTAAAACTGTGATAAAGCCCGTCAGAAGAGAGCTTCGATCTCTTTCTGCATGAATGACCGTGAAATCCATGGCATAGCAGGATAATTGGTTGCTGAATGTAGGAGTATTTTCCGTATCCAAAATTTCATTTGGGAAATCATTTTCATTAAAATTGTTTGTCTTGACCTCTTGACTTCGACGGTATTTAAATTTGCTTTGTCTAATCTTCATTCAATGTTCCACTTACTATTACAGGTGATTGAATTGGGTGGAACACCAACAATTGGTGACTGTGCCATGATCTTGCGAGCTGCCATCAAGGCTCCTGTACCTTCTGCCTTCTTGAAAATCTTGCAGACAACTCATGGTCTTGGCTATGTATTTGGGAGGTATGTTTATTTTTCTTGAGATAAGAGATTCGAAATTTATCTTCAATAGTCAAAGCCTAACTATCTGTAAATATGACATGAAAAGTAAACTTCATATCATTTGGTGAGGTTTATTTATTTATTTATCTATTTGAATTCTTACAATGTGTGGGGCTATTTGAATTCTTACAATATGTGGGGCTGAAATATTTGAACCAGTGACTTTTTGGTCGATAATATATGCTTAAACTAGTTGAACGGTACTCAGGTTGACATCTTTTAGTGAGTTGTAAATTTACTAATATCTCTTAATGGATTCATATTCTATCGTAACAACCCATGTTGAATCAGGTGGTGTGGAGTGGATATAATGTTTATGTATAGGTGTATGTGTCCAAGTCATTAGATTATATCACAGGTCATAAATTTATAAATAATTGTAAATTCATCCTTCAATAAAGTCATAAATTGTATTTGAATGTTAATTTTAACATGTAATGGCGCCAAGGATTTTTTAAACCCTTAAACTTCTGACCACCGTAATAATTTTGAATTGTCTCTGACAATTTTGGCCACACAAGTATGTATCTATGTTTGAACTCTTGGGTTGTCTTTGTATTGCAGCCCTTTATATGATGAGGTTATCACCCTGTGCCTTGATCTTGGGGAACTAGATGCAGCCATTGCAATTGTAGCAGACCTGGAAACAACAGGAATCTTGGTTCCCGATGAAACGCTCGATCGGGTAATCTCCGCTAGACAGACGAACGATGCAATGCCCAAGCCTGTCTCAGTCATTGATACTACACTAAATGATCACAGTTTAGCCAATGATGAAGCATCATAATGGATCAGTTTGTGTTCTTATTTTTCTTTTGTACAGTTCAGTCGTAGGGTTGCTAGCAAAATGGTTGTAAATTTTGTTTGTAGTTGCTTTATTGGTTTTGTATTTTGAATTTTGGAGTTGAAATATTTGATTTTAGAAACAAGATATTGAGTTTTTTTATTTTCTTTT

mRNA sequence

ATGACAGATTACATGAAGCCCGATACTGAGACATATAATTGGGTGATTCAAGCATATACAAGGGCTGAATCTTATGATAGGGTGCAAGATGTTGCCGAATTGCTTGGCATGATGGTTGAAGACCACAAGCGTCTACAGCCAAACATGAGAACCTATGCGCTCTTGGTAGAGTGTTTTACTAAGTATTGTGTTATACGAGAAGCTATTAGGCATTTCCGTGCACTAAAAACCTTTCAAGGTGGAACAAAAGCTTTGCATAATGAAGGGAATTTTGGTGATCCACTTTCCTTATATCTTCGAGCTTTATGTAGAGAAGGTAGGGTTGTAGAACTCTTAGAAGCCTTAGAGGCTATGACTAGAGATAACCAACAGATTCCTCCGAGAGCCATGATCTTGAGCAGAAAGTATCGATCACTGGTGAGCTCATGGATTGAACCTCTACAGGAAGAAGCTGAACATGGATTCGAGATAGACTACATTGCAAGATACATTGAAGAGGGTGGACTCACTGGAGAACGCAAGAGATGGGTCCCTCGAAGAGGAAGAACTCCTCTAGATCCTGATGCAGATGGATTCATCTATTCAAATCCTATGGAGACATCCTTTAAGCAACGATGTCTAGAAGACTGGAAGATGTACCACCGGAAGATTTTGAAAACCTTGCAGAATGAAGGACTTGCTGCTCTTGGGGATGCATCTGAAGCTGATTATCATAGAGTCGTAGAGAGATTGAAGAAAATAATAAAGGGTCCAGACCAAAATGTTTTAAAGCCAAAGGCTGCAAGTAAGATGATTGTATCAGAATTAAAAGAAGAATTAGAAGCACAAGGTTTACCAATTGACGGAACTAGAAATGTTCTTTACCAGCGTGTTCAAAAAGCAAGGAGAATTAACCGGTCTCGTGGTCGGCCCCTTTGGGTTCCTCCAGTGGAGGAGGAGGAAGAAGAGGTTGATGAAGAGCTGGATGAACTAATTTCACGAATAAAGCTACACGAAGGAAATACAGAGTTCTGGAAACGCCGATTTCTTGGAGAAGGCTTGGACAGTAATAATGTTAAACCTTCTGAAGATGATAAATCAGAACCTCTTGATTCTTTGGATGATGTTGACACTGTAGAAGACGTTGCAAAGGAGATTGAAGAAGAAGAAGCTGAAGAGGAAGAGGAGGTAGAACAAACTGAGAATCAAGATGGTGAAAGAGTTATTAAGAAGGAAGTTGAAGCTAAAAAGCCTCTACAAATGATAGGTGTCCAATTGTTAAAAGATGTTGACCAAACCACAACAACATCCAAAAAGTCAAGGAGGAGAAGTTCTCGGGCATCACTTGAGGACGATCGTGATGAAGATTGGTTTCCTGAAGATATATTTGAGGCATTTGAGGAGTTGCGAAAGAGGAAAGTCTTTGATGTATCTGACATGTACACAATAGCTGATGTTTGGGGTTGGACTTGGGAGAGAGAGCTTAAGAACAGACCTCCCAGGAGGTGGTCACAGGAATGGGAAGTGGAGCTGGCTATTAAAATTATGCACAAGGTGATTGAATTGGGTGGAACACCAACAATTGGTGACTGTGCCATGATCTTGCGAGCTGCCATCAAGGCTCCTGTACCTTCTGCCTTCTTGAAAATCTTGCAGACAACTCATGGTCTTGGCTATGTATTTGGGAGCCCTTTATATGATGAGGTTATCACCCTGTGCCTTGATCTTGGGGAACTAGATGCAGCCATTGCAATTGTAGCAGACCTGGAAACAACAGGAATCTTGGTTCCCGATGAAACGCTCGATCGGGTAATCTCCGCTAGACAGACGAACGATGCAATGCCCAAGCCTGTCTCAGTCATTGATACTACACTAAATGATCACAGTTTAGCCAATGATGAAGCATCATAATGGATCAGTTTGTGTTCTTATTTTTCTTTTGTACAGTTCAGTCGTAGGGTTGCTAGCAAAATGGTTGTAAATTTTGTTTGTAGTTGCTTTATTGGTTTTGTATTTTGAATTTTGGAGTTGAAATATTTGATTTTAGAAACAAGATATTGAGTTTTTTTATTTTCTTTT

Coding sequence (CDS)

ATGACAGATTACATGAAGCCCGATACTGAGACATATAATTGGGTGATTCAAGCATATACAAGGGCTGAATCTTATGATAGGGTGCAAGATGTTGCCGAATTGCTTGGCATGATGGTTGAAGACCACAAGCGTCTACAGCCAAACATGAGAACCTATGCGCTCTTGGTAGAGTGTTTTACTAAGTATTGTGTTATACGAGAAGCTATTAGGCATTTCCGTGCACTAAAAACCTTTCAAGGTGGAACAAAAGCTTTGCATAATGAAGGGAATTTTGGTGATCCACTTTCCTTATATCTTCGAGCTTTATGTAGAGAAGGTAGGGTTGTAGAACTCTTAGAAGCCTTAGAGGCTATGACTAGAGATAACCAACAGATTCCTCCGAGAGCCATGATCTTGAGCAGAAAGTATCGATCACTGGTGAGCTCATGGATTGAACCTCTACAGGAAGAAGCTGAACATGGATTCGAGATAGACTACATTGCAAGATACATTGAAGAGGGTGGACTCACTGGAGAACGCAAGAGATGGGTCCCTCGAAGAGGAAGAACTCCTCTAGATCCTGATGCAGATGGATTCATCTATTCAAATCCTATGGAGACATCCTTTAAGCAACGATGTCTAGAAGACTGGAAGATGTACCACCGGAAGATTTTGAAAACCTTGCAGAATGAAGGACTTGCTGCTCTTGGGGATGCATCTGAAGCTGATTATCATAGAGTCGTAGAGAGATTGAAGAAAATAATAAAGGGTCCAGACCAAAATGTTTTAAAGCCAAAGGCTGCAAGTAAGATGATTGTATCAGAATTAAAAGAAGAATTAGAAGCACAAGGTTTACCAATTGACGGAACTAGAAATGTTCTTTACCAGCGTGTTCAAAAAGCAAGGAGAATTAACCGGTCTCGTGGTCGGCCCCTTTGGGTTCCTCCAGTGGAGGAGGAGGAAGAAGAGGTTGATGAAGAGCTGGATGAACTAATTTCACGAATAAAGCTACACGAAGGAAATACAGAGTTCTGGAAACGCCGATTTCTTGGAGAAGGCTTGGACAGTAATAATGTTAAACCTTCTGAAGATGATAAATCAGAACCTCTTGATTCTTTGGATGATGTTGACACTGTAGAAGACGTTGCAAAGGAGATTGAAGAAGAAGAAGCTGAAGAGGAAGAGGAGGTAGAACAAACTGAGAATCAAGATGGTGAAAGAGTTATTAAGAAGGAAGTTGAAGCTAAAAAGCCTCTACAAATGATAGGTGTCCAATTGTTAAAAGATGTTGACCAAACCACAACAACATCCAAAAAGTCAAGGAGGAGAAGTTCTCGGGCATCACTTGAGGACGATCGTGATGAAGATTGGTTTCCTGAAGATATATTTGAGGCATTTGAGGAGTTGCGAAAGAGGAAAGTCTTTGATGTATCTGACATGTACACAATAGCTGATGTTTGGGGTTGGACTTGGGAGAGAGAGCTTAAGAACAGACCTCCCAGGAGGTGGTCACAGGAATGGGAAGTGGAGCTGGCTATTAAAATTATGCACAAGGTGATTGAATTGGGTGGAACACCAACAATTGGTGACTGTGCCATGATCTTGCGAGCTGCCATCAAGGCTCCTGTACCTTCTGCCTTCTTGAAAATCTTGCAGACAACTCATGGTCTTGGCTATGTATTTGGGAGCCCTTTATATGATGAGGTTATCACCCTGTGCCTTGATCTTGGGGAACTAGATGCAGCCATTGCAATTGTAGCAGACCTGGAAACAACAGGAATCTTGGTTCCCGATGAAACGCTCGATCGGGTAATCTCCGCTAGACAGACGAACGATGCAATGCCCAAGCCTGTCTCAGTCATTGATACTACACTAAATGATCACAGTTTAGCCAATGATGAAGCATCATAA

Protein sequence

MTDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMTRDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRRGRTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYHRVVERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSEPLDSLDDVDTVEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRAAIKAPVPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISARQTNDAMPKPVSVIDTTLNDHSLANDEAS
BLAST of Lsi06G007560 vs. TrEMBL
Match: A0A0A0M091_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G662830 PE=4 SV=1)

HSP 1 Score: 1128.2 bits (2917), Expect = 0.0e+00
Identity = 573/593 (96.63%), Postives = 583/593 (98.31%), Query Frame = 1

Query: 37  MMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLS 96
           MMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRAL+TF+GGT ALHNEGNFGDPLS
Sbjct: 1   MMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTTALHNEGNFGDPLS 60

Query: 97  LYLRALCREGRVVELLEALEAMTRDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGFE 156
           LYLRALCREGRVVELLEALEAM RDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGFE
Sbjct: 61  LYLRALCREGRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGFE 120

Query: 157 IDYIARYIEEGGLTGERKRWVPRRGRTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRK 216
           IDYIARYIEEGGLTGERKRWVPR+G+TPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRK
Sbjct: 121 IDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRK 180

Query: 217 ILKTLQNEGLAALGDASEADYHRVVERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQ 276
           ILKTLQNEGL AL DASEADYHRVVERL+KIIKGPDQNVLKPKAASKMIVSELKEELEAQ
Sbjct: 181 ILKTLQNEGLVALRDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQ 240

Query: 277 GLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTE 336
           GLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTE
Sbjct: 241 GLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTE 300

Query: 337 FWKRRFLGEGLDSNNVKPSEDDKSEPLDSLDDVDTVEDVAKEIEEEEAEEEEEVEQTENQ 396
           FWKRRFLGEGL SNNVKPSEDDKS+PLDSLDDVDT+EDVAKEIEEEEAEEEEEVEQTENQ
Sbjct: 301 FWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQ 360

Query: 397 DGERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIF 456
           DGERVIKKEVEAKKPLQMIGVQLLKDVDQ TTTSKKSRRRSSRASLEDDRDEDWFPEDIF
Sbjct: 361 DGERVIKKEVEAKKPLQMIGVQLLKDVDQPTTTSKKSRRRSSRASLEDDRDEDWFPEDIF 420

Query: 457 EAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELG 516
           EAF+EL+KRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELG
Sbjct: 421 EAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELG 480

Query: 517 GTPTIGDCAMILRAAIKAPVPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAI 576
           G PTIGDCAMILRAAIKAP+PSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAI
Sbjct: 481 GIPTIGDCAMILRAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAI 540

Query: 577 AIVADLETTGILVPDETLDRVISARQTNDAMPKPVSVIDTTLNDHSLANDEAS 630
           AIVADLETTGILV DETLDRVISARQTNDAMPKP S IDTTLNDHSLANDEAS
Sbjct: 541 AIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDEAS 593

BLAST of Lsi06G007560 vs. TrEMBL
Match: A0A0D2TZR4_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_013G119100 PE=4 SV=1)

HSP 1 Score: 1045.4 bits (2702), Expect = 2.7e-302
Identity = 525/604 (86.92%), Postives = 565/604 (93.54%), Query Frame = 1

Query: 3   DYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKY 62
           +YMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPN++TYALLVECFTKY
Sbjct: 277 EYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNVKTYALLVECFTKY 336

Query: 63  CVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMTRDN 122
           CV+REAIRHFRALK ++GGT  LHNEGNF DPLSLYLRALCREGRVVEL+EALEAM++DN
Sbjct: 337 CVVREAIRHFRALKNYEGGTIVLHNEGNFDDPLSLYLRALCREGRVVELVEALEAMSKDN 396

Query: 123 QQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRRGR 182
           Q IPPRAMILSRKYR+LVSSWIEPLQEEAE G+EIDYIARYIEEGGLTGERKRWVPRRG+
Sbjct: 397 QPIPPRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYIARYIEEGGLTGERKRWVPRRGK 456

Query: 183 TPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYHRVVE 242
           TPLDPDA GFIYSNPMETSFKQRCLE+WK+YHRK+LKTLQNEGLAALGDA+E+DY RVVE
Sbjct: 457 TPLDPDATGFIYSNPMETSFKQRCLEEWKIYHRKLLKTLQNEGLAALGDATESDYMRVVE 516

Query: 243 RLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRG 302
           RL+KIIKGPDQNVLKPKAASKM+VSELKEELEAQGLP DGTRNVLYQRVQKARRINRSRG
Sbjct: 517 RLRKIIKGPDQNVLKPKAASKMVVSELKEELEAQGLPTDGTRNVLYQRVQKARRINRSRG 576

Query: 303 RPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSEP 362
           RPLWVPPVEEEEEEVDEELDELISRIKL EGNTEFWKRRFLGEGL+ N VK  ++D+SE 
Sbjct: 577 RPLWVPPVEEEEEEVDEELDELISRIKLEEGNTEFWKRRFLGEGLNVNQVKLIDEDESEA 636

Query: 363 L-DSLDDVDTVEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLK 422
             D LD+ D VED  K+IEEEE EEEEEVEQTE+++ +R+  KEVEAKKPLQMIGVQLLK
Sbjct: 637 ADDELDESDVVEDAGKDIEEEEGEEEEEVEQTESREVDRIKDKEVEAKKPLQMIGVQLLK 696

Query: 423 DVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWG 482
           D DQTTT SKKSRRRSSR S+EDD DEDWFPEDIFEAF+E+R RKVFDV DMYTIAD WG
Sbjct: 697 DSDQTTTRSKKSRRRSSRVSVEDDDDEDWFPEDIFEAFQEMRDRKVFDVEDMYTIADAWG 756

Query: 483 WTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRAAIKAPVPSAFL 542
           WTWERELKN+PPRRWSQEWEVELAI++M KVIELGGTPTIGDCAMILRAAIKAPVPSAFL
Sbjct: 757 WTWERELKNKPPRRWSQEWEVELAIQVMQKVIELGGTPTIGDCAMILRAAIKAPVPSAFL 816

Query: 543 KILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISAR 602
           KILQ TH LG+VFGSPLYDE I+LC+DLGELDAAIAIVADLETTGI VPD+TLDRVISAR
Sbjct: 817 KILQKTHSLGFVFGSPLYDEAISLCIDLGELDAAIAIVADLETTGIAVPDQTLDRVISAR 876

Query: 603 QTND 606
           QT D
Sbjct: 877 QTMD 880

BLAST of Lsi06G007560 vs. TrEMBL
Match: A0A0B0PXA0_GOSAR (Uncharacterized protein OS=Gossypium arboreum GN=F383_10624 PE=4 SV=1)

HSP 1 Score: 1036.2 bits (2678), Expect = 1.7e-299
Identity = 525/604 (86.92%), Postives = 563/604 (93.21%), Query Frame = 1

Query: 3   DYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKY 62
           +YMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPN++TYALLVECFTKY
Sbjct: 277 EYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNVKTYALLVECFTKY 336

Query: 63  CVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMTRDN 122
           CV+REAIRHF ALK ++GGT  LHNEGNF DPLSL+LRALCREGRVVELL+ALEAM++DN
Sbjct: 337 CVVREAIRHFLALKNYEGGTIVLHNEGNFDDPLSLFLRALCREGRVVELLQALEAMSKDN 396

Query: 123 QQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRRGR 182
           Q IPPRAMILSRKYR+LVSSWIEPLQEEAE G+EIDYIARYIEEGGLTGERKRWVPRRG+
Sbjct: 397 QPIPPRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYIARYIEEGGLTGERKRWVPRRGK 456

Query: 183 TPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYHRVVE 242
           TPLDPDA GFIYSNPMETSFKQRCLE+WK+YHRK+LKTLQNEGLAALGDA+E+DY RVVE
Sbjct: 457 TPLDPDATGFIYSNPMETSFKQRCLEEWKIYHRKLLKTLQNEGLAALGDATESDYMRVVE 516

Query: 243 RLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRG 302
           RL+KIIKGPDQNVLKPKAASKM+VSELKEELEAQGLP DGTRNVLYQRVQKARRINRSRG
Sbjct: 517 RLRKIIKGPDQNVLKPKAASKMVVSELKEELEAQGLPTDGTRNVLYQRVQKARRINRSRG 576

Query: 303 RPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSEP 362
           RPLWVPPVEEEEEEVDEELDELISRIKL EGNTEFWKRRFLGEGL+ N VK  ++D+SE 
Sbjct: 577 RPLWVPPVEEEEEEVDEELDELISRIKLEEGNTEFWKRRFLGEGLNVNQVKLIDEDESEA 636

Query: 363 L-DSLDDVDTVEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLK 422
             D LD+ D VED AK+IEEEE EEEEEVEQTE+Q+ +R+  KEVEAKKPLQMIGVQLLK
Sbjct: 637 ADDELDESDVVEDAAKDIEEEEGEEEEEVEQTESQEVDRIKDKEVEAKKPLQMIGVQLLK 696

Query: 423 DVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWG 482
           D DQTTT SKKSRRRSSR S+EDD DEDWFPEDIFEAF+E+R RKVFDV DMYTIAD WG
Sbjct: 697 DSDQTTTRSKKSRRRSSRVSVEDDDDEDWFPEDIFEAFQEMRDRKVFDVEDMYTIADAWG 756

Query: 483 WTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRAAIKAPVPSAFL 542
           WTWERELKN+PPRRWSQEWEVELAI    +VIELGGTPTIGDCAMILRAAIKAPVPSAFL
Sbjct: 757 WTWERELKNKPPRRWSQEWEVELAI----QVIELGGTPTIGDCAMILRAAIKAPVPSAFL 816

Query: 543 KILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISAR 602
           KILQ TH LGYVFGSPLYDEVI+LC+DLGELDAAIAIVADLETTGI VPD+TLDRVISAR
Sbjct: 817 KILQKTHSLGYVFGSPLYDEVISLCIDLGELDAAIAIVADLETTGIAVPDQTLDRVISAR 876

Query: 603 QTND 606
           QT D
Sbjct: 877 QTMD 876

BLAST of Lsi06G007560 vs. TrEMBL
Match: A0A061F8G9_THECC (Plastid transcriptionally active 3 isoform 2 OS=Theobroma cacao GN=TCM_026112 PE=4 SV=1)

HSP 1 Score: 1035.4 bits (2676), Expect = 2.8e-299
Identity = 521/628 (82.96%), Postives = 574/628 (91.40%), Query Frame = 1

Query: 3   DYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKY 62
           +YMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKR+QPN++TYALLVECFTKY
Sbjct: 152 EYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRVQPNVKTYALLVECFTKY 211

Query: 63  CVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMTRDN 122
           CV++EAIRHFRALK F+GGT+ L NEGNF DPLSLYLRALCREGR+VELLEAL+AM +DN
Sbjct: 212 CVVKEAIRHFRALKKFEGGTRVLQNEGNFDDPLSLYLRALCREGRIVELLEALQAMAKDN 271

Query: 123 QQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRRGR 182
           Q IPPRAMILSRKYR+LVSSWIEPLQEEAE G+EIDYIARYIEEGGLTGERKRWVPRRG+
Sbjct: 272 QPIPPRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYIARYIEEGGLTGERKRWVPRRGK 331

Query: 183 TPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYHRVVE 242
           TPLDPDA GFIYSNPMETSFKQRCLEDWK++HRK+LKTLQNEGLAALG ASE+DY RV E
Sbjct: 332 TPLDPDAAGFIYSNPMETSFKQRCLEDWKLHHRKLLKTLQNEGLAALGGASESDYVRVSE 391

Query: 243 RLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRG 302
           RLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRG
Sbjct: 392 RLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRG 451

Query: 303 RPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSEP 362
           RPLWVPPVEEEEEEVDEE+DELISRIKL EGNTEFWKRRFLGE L+ ++VKP ++ +SEP
Sbjct: 452 RPLWVPPVEEEEEEVDEEVDELISRIKLEEGNTEFWKRRFLGEHLNVDHVKPIDEGESEP 511

Query: 363 L-DSLDDVDTVEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLK 422
             D LDD D VED AK+IE++EA+EEEE EQ E+Q+G+R+  KEVEAKKPLQMIGVQLLK
Sbjct: 512 ADDELDDGDVVEDAAKDIEDDEADEEEEGEQAESQEGDRIKDKEVEAKKPLQMIGVQLLK 571

Query: 423 DVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWG 482
           D DQTTT SKKSRRRSSR S+EDD D+DWFPEDIFEAF+ELR+RKVFDV DMYTIAD WG
Sbjct: 572 DSDQTTTRSKKSRRRSSRVSVEDDDDDDWFPEDIFEAFQELRERKVFDVEDMYTIADAWG 631

Query: 483 WTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRAAIKAPVPSAFL 542
           WTWE+ELKN+PPR+WSQEWEVELAI++M KVIELGGTPT+GDCAMILRAAIKAP+PSAFL
Sbjct: 632 WTWEKELKNKPPRKWSQEWEVELAIQVMQKVIELGGTPTVGDCAMILRAAIKAPMPSAFL 691

Query: 543 KILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISAR 602
           KILQT H LG+VFGSPLYDEVI++C+DLGELDAAIAIVADLET GI VPD+TLDRVISAR
Sbjct: 692 KILQTAHSLGFVFGSPLYDEVISICVDLGELDAAIAIVADLETAGIAVPDQTLDRVISAR 751

Query: 603 QTNDAMPKPVSVIDTTLNDHSLANDEAS 630
           QT D     VS   ++    S ++   S
Sbjct: 752 QTVDTAGGDVSSSSSSSTTSSSSSSTTS 779

BLAST of Lsi06G007560 vs. TrEMBL
Match: A0A061F1L8_THECC (Plastid transcriptionally active 3 isoform 1 OS=Theobroma cacao GN=TCM_026112 PE=4 SV=1)

HSP 1 Score: 1035.4 bits (2676), Expect = 2.8e-299
Identity = 521/628 (82.96%), Postives = 574/628 (91.40%), Query Frame = 1

Query: 3   DYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKY 62
           +YMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKR+QPN++TYALLVECFTKY
Sbjct: 275 EYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRVQPNVKTYALLVECFTKY 334

Query: 63  CVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMTRDN 122
           CV++EAIRHFRALK F+GGT+ L NEGNF DPLSLYLRALCREGR+VELLEAL+AM +DN
Sbjct: 335 CVVKEAIRHFRALKKFEGGTRVLQNEGNFDDPLSLYLRALCREGRIVELLEALQAMAKDN 394

Query: 123 QQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRRGR 182
           Q IPPRAMILSRKYR+LVSSWIEPLQEEAE G+EIDYIARYIEEGGLTGERKRWVPRRG+
Sbjct: 395 QPIPPRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYIARYIEEGGLTGERKRWVPRRGK 454

Query: 183 TPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYHRVVE 242
           TPLDPDA GFIYSNPMETSFKQRCLEDWK++HRK+LKTLQNEGLAALG ASE+DY RV E
Sbjct: 455 TPLDPDAAGFIYSNPMETSFKQRCLEDWKLHHRKLLKTLQNEGLAALGGASESDYVRVSE 514

Query: 243 RLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRG 302
           RLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRG
Sbjct: 515 RLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRG 574

Query: 303 RPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSEP 362
           RPLWVPPVEEEEEEVDEE+DELISRIKL EGNTEFWKRRFLGE L+ ++VKP ++ +SEP
Sbjct: 575 RPLWVPPVEEEEEEVDEEVDELISRIKLEEGNTEFWKRRFLGEHLNVDHVKPIDEGESEP 634

Query: 363 L-DSLDDVDTVEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLK 422
             D LDD D VED AK+IE++EA+EEEE EQ E+Q+G+R+  KEVEAKKPLQMIGVQLLK
Sbjct: 635 ADDELDDGDVVEDAAKDIEDDEADEEEEGEQAESQEGDRIKDKEVEAKKPLQMIGVQLLK 694

Query: 423 DVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWG 482
           D DQTTT SKKSRRRSSR S+EDD D+DWFPEDIFEAF+ELR+RKVFDV DMYTIAD WG
Sbjct: 695 DSDQTTTRSKKSRRRSSRVSVEDDDDDDWFPEDIFEAFQELRERKVFDVEDMYTIADAWG 754

Query: 483 WTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRAAIKAPVPSAFL 542
           WTWE+ELKN+PPR+WSQEWEVELAI++M KVIELGGTPT+GDCAMILRAAIKAP+PSAFL
Sbjct: 755 WTWEKELKNKPPRKWSQEWEVELAIQVMQKVIELGGTPTVGDCAMILRAAIKAPMPSAFL 814

Query: 543 KILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISAR 602
           KILQT H LG+VFGSPLYDEVI++C+DLGELDAAIAIVADLET GI VPD+TLDRVISAR
Sbjct: 815 KILQTAHSLGFVFGSPLYDEVISICVDLGELDAAIAIVADLETAGIAVPDQTLDRVISAR 874

Query: 603 QTNDAMPKPVSVIDTTLNDHSLANDEAS 630
           QT D     VS   ++    S ++   S
Sbjct: 875 QTVDTAGGDVSSSSSSSTTSSSSSSTTS 902

BLAST of Lsi06G007560 vs. TAIR10
Match: AT3G04260.1 (AT3G04260.1 plastid transcriptionally active 3)

HSP 1 Score: 935.3 bits (2416), Expect = 2.0e-272
Identity = 471/619 (76.09%), Postives = 543/619 (87.72%), Query Frame = 1

Query: 4   YMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYC 63
           +MKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKR+QPN++TYALLVECFTKYC
Sbjct: 279 FMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRVQPNVKTYALLVECFTKYC 338

Query: 64  VIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMTRDNQ 123
           V++EAIRHFRALK F+GGT  LHN GNF DPLSLYLRALCREGR+VEL++AL+AM +DNQ
Sbjct: 339 VVKEAIRHFRALKNFEGGTVILHNAGNFEDPLSLYLRALCREGRIVELIDALDAMRKDNQ 398

Query: 124 QIPPRAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRRGRT 183
            IPPRAMI+SRKYR+LVSSWIEPLQEEAE G+EIDY+ARYIEEGGLTGERKRWVPRRG+T
Sbjct: 399 PIPPRAMIMSRKYRTLVSSWIEPLQEEAELGYEIDYLARYIEEGGLTGERKRWVPRRGKT 458

Query: 184 PLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYHRVVER 243
           PLDPDA GFIYSNP+ETSFKQRCLEDWK++HRK+L+TLQ+EGL  LGDASE+DY RVVER
Sbjct: 459 PLDPDASGFIYSNPIETSFKQRCLEDWKVHHRKLLRTLQSEGLPVLGDASESDYMRVVER 518

Query: 244 LKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGR 303
           L+ IIKGP  N+LKPKAASKM+VSELKEELEAQGLPIDGTRNVLYQRVQKARRIN+SRGR
Sbjct: 519 LRNIIKGPALNLLKPKAASKMVVSELKEELEAQGLPIDGTRNVLYQRVQKARRINKSRGR 578

Query: 304 PLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSE------- 363
           PLWVPP+EEEEEEVDEE+D+LI RIKLHEG+TEFWKRRFLGEGL   +V+  E       
Sbjct: 579 PLWVPPIEEEEEEVDEEVDDLICRIKLHEGDTEFWKRRFLGEGLIETSVESKETTESVVT 638

Query: 364 -------DDKSEPLDSLDDVDTVEDVAKEIEEEEAEEEEEVEQTENQ-DGERVIK-KEVE 423
                  +D S+  D+ +D D  E    E ++E  EEE  V +TEN+ +GE ++K K  +
Sbjct: 639 GESEKAIEDISKEADNEEDDDEEEQEGDEDDDENEEEEVVVPETENRAEGEDLVKNKAAD 698

Query: 424 AKKPLQMIGVQLLKDVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKV 483
           AKK LQMIGVQLLK+ D+   T KK  +R+SR +LEDD DEDWFPE+ FEAF+E+R+RKV
Sbjct: 699 AKKHLQMIGVQLLKESDEANRT-KKRGKRASRMTLEDDADEDWFPEEPFEAFKEMRERKV 758

Query: 484 FDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMI 543
           FDV+DMYTIADVWGWTWE++ KN+ PR+WSQEWEVELAI +M KVIELGG PTIGDCA+I
Sbjct: 759 FDVADMYTIADVWGWTWEKDFKNKTPRKWSQEWEVELAIVLMTKVIELGGIPTIGDCAVI 818

Query: 544 LRAAIKAPVPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGI 603
           LRAA++AP+PSAFLKILQTTH LGY FGSPLYDE+ITLCLDLGELDAAIAIVAD+ETTGI
Sbjct: 819 LRAALRAPMPSAFLKILQTTHSLGYSFGSPLYDEIITLCLDLGELDAAIAIVADMETTGI 878

Query: 604 LVPDETLDRVISARQTNDA 607
            VPD+TLD+VISARQ+N++
Sbjct: 879 TVPDQTLDKVISARQSNES 896

BLAST of Lsi06G007560 vs. NCBI nr
Match: gi|659086064|ref|XP_008443746.1| (PREDICTED: uncharacterized protein LOC103487261 isoform X1 [Cucumis melo])

HSP 1 Score: 1202.2 bits (3109), Expect = 0.0e+00
Identity = 607/627 (96.81%), Postives = 618/627 (98.56%), Query Frame = 1

Query: 3   DYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKY 62
           DYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKY
Sbjct: 273 DYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKY 332

Query: 63  CVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMTRDN 122
           CVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRV++LLEALEAM RDN
Sbjct: 333 CVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVLDLLEALEAMARDN 392

Query: 123 QQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRRGR 182
           QQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPR+G+
Sbjct: 393 QQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGK 452

Query: 183 TPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYHRVVE 242
           TPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGL AL DASEADYHRVVE
Sbjct: 453 TPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALRDASEADYHRVVE 512

Query: 243 RLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRG 302
           +LKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRG
Sbjct: 513 KLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRG 572

Query: 303 RPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSEP 362
           RPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKS+ 
Sbjct: 573 RPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSDS 632

Query: 363 LDSLDDVDTVEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKD 422
           LDSLDDVDT+EDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKD
Sbjct: 633 LDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKD 692

Query: 423 VDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGW 482
           VDQ T TSKKSRRRSSRASLEDDRDEDWFPEDIFEAF+EL+KRKVFDVSDMYTIADVWGW
Sbjct: 693 VDQPTATSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGW 752

Query: 483 TWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRAAIKAPVPSAFLK 542
           TWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRAAIKAP+PSAFLK
Sbjct: 753 TWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRAAIKAPLPSAFLK 812

Query: 543 ILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISARQ 602
           ILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVPDETLDRVIS RQ
Sbjct: 813 ILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISTRQ 872

Query: 603 TNDAMPKPVSVIDTTLNDHSLANDEAS 630
           TNDAMPKP S IDTT+NDHSLANDEAS
Sbjct: 873 TNDAMPKPDSAIDTTVNDHSLANDEAS 899

BLAST of Lsi06G007560 vs. NCBI nr
Match: gi|778664211|ref|XP_011660243.1| (PREDICTED: uncharacterized protein LOC101209618 [Cucumis sativus])

HSP 1 Score: 1198.3 bits (3099), Expect = 0.0e+00
Identity = 607/627 (96.81%), Postives = 617/627 (98.41%), Query Frame = 1

Query: 3   DYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKY 62
           DYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKY
Sbjct: 273 DYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKY 332

Query: 63  CVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMTRDN 122
           CVIREAIRHFRAL+TF+GGT ALHNEGNFGDPLSLYLRALCREGRVVELLEALEAM RDN
Sbjct: 333 CVIREAIRHFRALRTFEGGTTALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDN 392

Query: 123 QQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRRGR 182
           QQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPR+G+
Sbjct: 393 QQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGK 452

Query: 183 TPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYHRVVE 242
           TPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGL AL DASEADYHRVVE
Sbjct: 453 TPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALRDASEADYHRVVE 512

Query: 243 RLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRG 302
           RL+KIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRG
Sbjct: 513 RLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRG 572

Query: 303 RPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSEP 362
           RPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGL SNNVKPSEDDKS+P
Sbjct: 573 RPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDP 632

Query: 363 LDSLDDVDTVEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKD 422
           LDSLDDVDT+EDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKD
Sbjct: 633 LDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKD 692

Query: 423 VDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGW 482
           VDQ TTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAF+EL+KRKVFDVSDMYTIADVWGW
Sbjct: 693 VDQPTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGW 752

Query: 483 TWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRAAIKAPVPSAFLK 542
           TWERELKNRPPRRWSQEWEVELAIKIMHKVIELGG PTIGDCAMILRAAIKAP+PSAFLK
Sbjct: 753 TWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRAAIKAPLPSAFLK 812

Query: 543 ILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISARQ 602
           ILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILV DETLDRVISARQ
Sbjct: 813 ILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQ 872

Query: 603 TNDAMPKPVSVIDTTLNDHSLANDEAS 630
           TNDAMPKP S IDTTLNDHSLANDEAS
Sbjct: 873 TNDAMPKPDSAIDTTLNDHSLANDEAS 899

BLAST of Lsi06G007560 vs. NCBI nr
Match: gi|700211623|gb|KGN66719.1| (hypothetical protein Csa_1G662830 [Cucumis sativus])

HSP 1 Score: 1128.2 bits (2917), Expect = 0.0e+00
Identity = 573/593 (96.63%), Postives = 583/593 (98.31%), Query Frame = 1

Query: 37  MMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLS 96
           MMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRAL+TF+GGT ALHNEGNFGDPLS
Sbjct: 1   MMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTTALHNEGNFGDPLS 60

Query: 97  LYLRALCREGRVVELLEALEAMTRDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGFE 156
           LYLRALCREGRVVELLEALEAM RDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGFE
Sbjct: 61  LYLRALCREGRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGFE 120

Query: 157 IDYIARYIEEGGLTGERKRWVPRRGRTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRK 216
           IDYIARYIEEGGLTGERKRWVPR+G+TPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRK
Sbjct: 121 IDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRK 180

Query: 217 ILKTLQNEGLAALGDASEADYHRVVERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQ 276
           ILKTLQNEGL AL DASEADYHRVVERL+KIIKGPDQNVLKPKAASKMIVSELKEELEAQ
Sbjct: 181 ILKTLQNEGLVALRDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQ 240

Query: 277 GLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTE 336
           GLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTE
Sbjct: 241 GLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTE 300

Query: 337 FWKRRFLGEGLDSNNVKPSEDDKSEPLDSLDDVDTVEDVAKEIEEEEAEEEEEVEQTENQ 396
           FWKRRFLGEGL SNNVKPSEDDKS+PLDSLDDVDT+EDVAKEIEEEEAEEEEEVEQTENQ
Sbjct: 301 FWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQ 360

Query: 397 DGERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIF 456
           DGERVIKKEVEAKKPLQMIGVQLLKDVDQ TTTSKKSRRRSSRASLEDDRDEDWFPEDIF
Sbjct: 361 DGERVIKKEVEAKKPLQMIGVQLLKDVDQPTTTSKKSRRRSSRASLEDDRDEDWFPEDIF 420

Query: 457 EAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELG 516
           EAF+EL+KRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELG
Sbjct: 421 EAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELG 480

Query: 517 GTPTIGDCAMILRAAIKAPVPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAI 576
           G PTIGDCAMILRAAIKAP+PSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAI
Sbjct: 481 GIPTIGDCAMILRAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAI 540

Query: 577 AIVADLETTGILVPDETLDRVISARQTNDAMPKPVSVIDTTLNDHSLANDEAS 630
           AIVADLETTGILV DETLDRVISARQTNDAMPKP S IDTTLNDHSLANDEAS
Sbjct: 541 AIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDEAS 593

BLAST of Lsi06G007560 vs. NCBI nr
Match: gi|659086066|ref|XP_008443747.1| (PREDICTED: uncharacterized protein LOC103487261 isoform X2 [Cucumis melo])

HSP 1 Score: 1071.6 bits (2770), Expect = 4.9e-310
Identity = 538/556 (96.76%), Postives = 548/556 (98.56%), Query Frame = 1

Query: 3   DYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKY 62
           DYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKY
Sbjct: 273 DYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKY 332

Query: 63  CVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMTRDN 122
           CVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRV++LLEALEAM RDN
Sbjct: 333 CVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVLDLLEALEAMARDN 392

Query: 123 QQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRRGR 182
           QQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPR+G+
Sbjct: 393 QQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGK 452

Query: 183 TPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYHRVVE 242
           TPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGL AL DASEADYHRVVE
Sbjct: 453 TPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALRDASEADYHRVVE 512

Query: 243 RLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRG 302
           +LKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRG
Sbjct: 513 KLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRG 572

Query: 303 RPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSEP 362
           RPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKS+ 
Sbjct: 573 RPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSDS 632

Query: 363 LDSLDDVDTVEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKD 422
           LDSLDDVDT+EDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKD
Sbjct: 633 LDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKD 692

Query: 423 VDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGW 482
           VDQ T TSKKSRRRSSRASLEDDRDEDWFPEDIFEAF+EL+KRKVFDVSDMYTIADVWGW
Sbjct: 693 VDQPTATSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGW 752

Query: 483 TWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRAAIKAPVPSAFLK 542
           TWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRAAIKAP+PSAFLK
Sbjct: 753 TWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRAAIKAPLPSAFLK 812

Query: 543 ILQTTHGLGYVFGSPL 559
           ILQTTHGLGYVFG  L
Sbjct: 813 ILQTTHGLGYVFGRTL 828

BLAST of Lsi06G007560 vs. NCBI nr
Match: gi|823262880|ref|XP_012464200.1| (PREDICTED: uncharacterized protein LOC105783342 isoform X1 [Gossypium raimondii])

HSP 1 Score: 1045.4 bits (2702), Expect = 3.9e-302
Identity = 525/604 (86.92%), Postives = 565/604 (93.54%), Query Frame = 1

Query: 3   DYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKY 62
           +YMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPN++TYALLVECFTKY
Sbjct: 277 EYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNVKTYALLVECFTKY 336

Query: 63  CVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMTRDN 122
           CV+REAIRHFRALK ++GGT  LHNEGNF DPLSLYLRALCREGRVVEL+EALEAM++DN
Sbjct: 337 CVVREAIRHFRALKNYEGGTIVLHNEGNFDDPLSLYLRALCREGRVVELVEALEAMSKDN 396

Query: 123 QQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRRGR 182
           Q IPPRAMILSRKYR+LVSSWIEPLQEEAE G+EIDYIARYIEEGGLTGERKRWVPRRG+
Sbjct: 397 QPIPPRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYIARYIEEGGLTGERKRWVPRRGK 456

Query: 183 TPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYHRVVE 242
           TPLDPDA GFIYSNPMETSFKQRCLE+WK+YHRK+LKTLQNEGLAALGDA+E+DY RVVE
Sbjct: 457 TPLDPDATGFIYSNPMETSFKQRCLEEWKIYHRKLLKTLQNEGLAALGDATESDYMRVVE 516

Query: 243 RLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRG 302
           RL+KIIKGPDQNVLKPKAASKM+VSELKEELEAQGLP DGTRNVLYQRVQKARRINRSRG
Sbjct: 517 RLRKIIKGPDQNVLKPKAASKMVVSELKEELEAQGLPTDGTRNVLYQRVQKARRINRSRG 576

Query: 303 RPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSEP 362
           RPLWVPPVEEEEEEVDEELDELISRIKL EGNTEFWKRRFLGEGL+ N VK  ++D+SE 
Sbjct: 577 RPLWVPPVEEEEEEVDEELDELISRIKLEEGNTEFWKRRFLGEGLNVNQVKLIDEDESEA 636

Query: 363 L-DSLDDVDTVEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLK 422
             D LD+ D VED  K+IEEEE EEEEEVEQTE+++ +R+  KEVEAKKPLQMIGVQLLK
Sbjct: 637 ADDELDESDVVEDAGKDIEEEEGEEEEEVEQTESREVDRIKDKEVEAKKPLQMIGVQLLK 696

Query: 423 DVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWG 482
           D DQTTT SKKSRRRSSR S+EDD DEDWFPEDIFEAF+E+R RKVFDV DMYTIAD WG
Sbjct: 697 DSDQTTTRSKKSRRRSSRVSVEDDDDEDWFPEDIFEAFQEMRDRKVFDVEDMYTIADAWG 756

Query: 483 WTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRAAIKAPVPSAFL 542
           WTWERELKN+PPRRWSQEWEVELAI++M KVIELGGTPTIGDCAMILRAAIKAPVPSAFL
Sbjct: 757 WTWERELKNKPPRRWSQEWEVELAIQVMQKVIELGGTPTIGDCAMILRAAIKAPVPSAFL 816

Query: 543 KILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISAR 602
           KILQ TH LG+VFGSPLYDE I+LC+DLGELDAAIAIVADLETTGI VPD+TLDRVISAR
Sbjct: 817 KILQKTHSLGFVFGSPLYDEAISLCIDLGELDAAIAIVADLETTGIAVPDQTLDRVISAR 876

Query: 603 QTND 606
           QT D
Sbjct: 877 QTMD 880

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0M091_CUCSA0.0e+0096.63Uncharacterized protein OS=Cucumis sativus GN=Csa_1G662830 PE=4 SV=1[more]
A0A0D2TZR4_GOSRA2.7e-30286.92Uncharacterized protein OS=Gossypium raimondii GN=B456_013G119100 PE=4 SV=1[more]
A0A0B0PXA0_GOSAR1.7e-29986.92Uncharacterized protein OS=Gossypium arboreum GN=F383_10624 PE=4 SV=1[more]
A0A061F8G9_THECC2.8e-29982.96Plastid transcriptionally active 3 isoform 2 OS=Theobroma cacao GN=TCM_026112 PE... [more]
A0A061F1L8_THECC2.8e-29982.96Plastid transcriptionally active 3 isoform 1 OS=Theobroma cacao GN=TCM_026112 PE... [more]
Match NameE-valueIdentityDescription
AT3G04260.12.0e-27276.09 plastid transcriptionally active 3[more]
Match NameE-valueIdentityDescription
gi|659086064|ref|XP_008443746.1|0.0e+0096.81PREDICTED: uncharacterized protein LOC103487261 isoform X1 [Cucumis melo][more]
gi|778664211|ref|XP_011660243.1|0.0e+0096.81PREDICTED: uncharacterized protein LOC101209618 [Cucumis sativus][more]
gi|700211623|gb|KGN66719.1|0.0e+0096.63hypothetical protein Csa_1G662830 [Cucumis sativus][more]
gi|659086066|ref|XP_008443747.1|4.9e-31096.76PREDICTED: uncharacterized protein LOC103487261 isoform X2 [Cucumis melo][more]
gi|823262880|ref|XP_012464200.1|3.9e-30286.92PREDICTED: uncharacterized protein LOC105783342 isoform X1 [Gossypium raimondii][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR003034SAP_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi06G007560.1Lsi06G007560.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003034SAP domainGENE3DG3DSA:1.10.720.30coord: 262..295
score: 2.
IPR003034SAP domainPFAMPF02037SAPcoord: 262..294
score: 5.
IPR003034SAP domainSMARTSM00513sap_9coord: 261..295
score: 3.
IPR003034SAP domainPROFILEPS50800SAPcoord: 261..295
score: 10
NoneNo IPR availableunknownCoilCoilcoord: 314..334
score: -coord: 369..397
scor
NoneNo IPR availablePANTHERPTHR31407FAMILY NOT NAMEDcoord: 184..407
score: 4.1
NoneNo IPR availablePANTHERPTHR31407:SF5PLASTID TRANSCRIPTIONALLY ACTIVE 3coord: 184..407
score: 4.1
NoneNo IPR availableunknownSSF68906SAP domaincoord: 261..297
score: 7.6