Lsi06G007550 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi06G007550
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionPentatricopeptide repeat-containing protein
Locationchr06 : 14036432 .. 14062407 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CGTCCCGCCTTTTTAAACAGCAACACTTCACTTCACTTCACTTCCCATTCTTCCCTCTTCCTTCTCGCCGTTTCTTATCCAAACTTCCGCAGCCATTAATGTCCAAATTCCTGCTCTCTCACGCTCACCTTCTCACCCTTCCCCACAAACACCATTCCTTTTCTCTCAACCATGGCGTCGTTCCCATCCGCTCAGTCCTATCTGCTCCGGAGAAGCGAGGTAGAAAGAAGCGGCAGTCGCGGCAGCAACAATTACACCCAAAGGACGACGATTCCACTGCACTTGAGAAGGCCCTCCGCTTCACTTTCATGGAGGAACTCATGGACCGCGCTAGAAACCACGATCCCCTTGGCGTTTCTGATGTCATTTACGATATGGTTGCCGCTGGATTGAGCCCTGGACCTCGCTCGTTCCATGGATTAGTTGTTTCTCATACTCTCAATGGTGATACTGAGGGAGCGGTGAGAACTCAGAAGCTCTTTCTCTCTCTCTCTCTCTGTGTGCGCGCGTTTGTAAATTTGTTGTTGCTGTTGTTCTTGGTTTTCGTGTGTTTATGGGTAATGTAGTCGAATTGTATACTTGCAATTGAAAGTCTAGTGCTTGAACTTATCCCATCTTCCTTCCTTTTCCATTATCGGATATTTTGAAGTATAGGGATCTTTAGCCCTTGTTTTCTCTTTCCCGTTCGAAAAACTTATCTTATCGGTATCATAATTCGATGATGAAGTTTTCTCCTCTAGTATATCCTTCGCATTTTCACTTGGTTAACAGAAGGCAGGGAAACGCATTAACGTGGACTGTGTCGAATCTCTTGTCTATTTGCCTGTATGTGTGTGTCTATTCTTTCATTCGACCAGGTGTTTCTGTACATTAGGCTCCATCCATTGTCATGTAGACGTGTTTTTTTTTAGATGGAAGGGACGGGATACATTTTTATCCTGACAAAGTAAAACTTGTTTTGGTCCCATTGTAATTTTTGTTTACAAATTCCCAGATATGTATTTTTCTAAATCTTATGGAAATGCTGAATGTAGATGCAATCTCTGAGAAGGGAATTAAGTGCTGGACTTCGTCCTCTTCACGAAACGTTTGTTGCATTAGTTCGGTTATTTGGTTCCAAGGGTCTTGCTACTAGAGGCTTAGAAATCCTTGCAGCCATGGAGAAATTGAATTATGACATCCGTCAAGCATGGCTCATTCTTACTGGTAATAGCTTGAACTTCCTGTTGAGAATGCTCCGAATAGATACATTTTAATTATGCTTATATTTTGTCTTAGTTGCTTGAGTTATTTTGACTAGAGGAACTCCTAAGGAACAAATATTTAGAAGACGCCAATGAAGTGTTCTTAAAGGGTGCCAAAGGGGGTCTCAGAGCCACCGACAAGATTTATGATCTTCTGATTGAGGAAGATTGTAAAGCCGGGGATCATTCAAATGCCTTAGAGATCTCATATGAAATGGAGGCTGCCGGGCGGATGGCAACGACCTTTCATTTCAATTGCCTTCTTAGTGTCCAGGTGATGTATCTTACTATTGGATAATGTAATGCATGTCTTTTACATACATATTTGATCTTGTACATGCATAGTGATTTAAGGCTTCTGTTTTAAATTTAGCAATGTAGAAAGTGAACCTTATTATAATGGCTCTGGTAGTGTGCTCATATTTCTGTATGTACGTTCTTCTAGGCTTTACCAATTAAGCTATTATTGGTTGGTGTCTAGCAATAAGAATTTTTAACATTCAAATGTGTTACTTTGCTCTCTTGGTCTTTGTGATTTGTAAAACACAAGGATGATATTTACTTCTTCTTTTTAACTCTGCGATTACTTCTACGCCTCTTCCTTCTTCCTTCTTCCCTCTTCCATTTCTTCTGGTGTGGGGAGGGGGTCACTATAGGAAATGGGGAATTTATTGTTTGATGTGGCCTCACTTTTGTCTGTCCTTGAGGCAGTTATTCTTAGATAGGGGAGAAAGGATTTTAGGATGGGGAGTCCAATCCTTCGGAGGAGTTCTTGTGTAATTTTTTTTTTTTTTTTTTTTTAGGCTTATGGATCTTTCTCCCTTAGGCAAGTCAGTTTTTTCAGCTTACTAGAGGATTAAGTTTTCTAAAAAAGTTAAAGTTCTTTCTTTACAAGTTTTACATAGAAGAGTTAACACTTTGGATCGCCTTGTGAGGAAGATGCCTCCATTAGTTGGTCTATTTTGCTATATTCTTCGTTAGAAAACGAAGGAAGATATAGATCACATTCTTTAGAGATGTAAGTTTACGAAAGCCGTGTGGAGTGACTTCTTTCAGACTTTTGGTTCTTTCGTGAGAAGGGTTGTTTTTTTATGGCTTGCTGGAGTGTGCTATTTTGTGGTATCTTTGGAGTGAGCAAAATAATTGAGTGGTTGGAGGGTTGAAAAGGGATATTAGTATTTAATACCTTGTTAGATTTCTTGTTTCCCTTTGGACTTCGATTTCGAAGGTTTTTTTTGTAACTATTCTATAGATTTTATTTTGTTTTGTTTTGTTTTTTTTAAAGAAAACAAGGCCTTTCATTGATGTAATGATAAGAGTCTAATGCTTAAAATATAGAAAACAGAGCCTGAAAAATAAAAGTTTAAATACAAACTAATTTAAATAGAGAAAATGAAACCATTCCAATAAACTAATGTCTTGAATAGTTTATGTAGGTTCTATTTTGTATAGTTGCTAATGTTTTCAAGGTGACCCAAGGCGCATGCCTAAGACGAGAGGCGTTGCGACTTTGGGCCGTTGCGCCTTGAAGGTAGTTGGGGCGAGCTCCTTCAACCAAGCGTGCGCCTAACTATGCCTTTTTTCGCTTTGATGCGCCTTCGCCATATGTTGAGGCGAGCGCCTCTGTTTTTTTGTCTTTTTTTTTCTTTTAGTATATTAAGAGCAGAAGAGTCTGGTGAAAGACAAAAGATGGAAGAGAAGTATAGAGAAAACAAGTGAATAAAAAGAAAGATGAAGAGAGTGGTTGGAAGTAGAGATGGAAAGTAGGAAGAGAATTATAGAAAAAAGGAAAAATAAAAAGGTAGAAGAAGAGTCTCACTTGAGACATATGTGGTTGCTGCATGAAACTGCCAGATGGCAATATGCGTACTCACAAGACCTTTCCTAGAAGAATTAAATAGGGATAGTTGTATGAAACTGCCACCTGGCAATTTCCTCCCATAAATACAAGGATTTATATTTAATGGTTGCGTGTTTAATAGATAAATACTTGGTCAGACTACCTCTTAACCCCACAAAGTAAATTCTATGGCTATTACTTTACTTTTTCTAAAATAGTTTAGTAATTTTTTTCTAAAAAAAAAAATAAATAATTTTGTAATATTTTTATGCTTTTAGCCTTTTAGAATTGAGTAATTTCTTTTAATTTGCAAGCGTTCTAAATAATATTTATTCATTATTTTATTAAGTGCGCCTTGGTTCAATCAGGTGATGCCTTTTTGTCGCCTCTCGCCTTCCGGCGATTTGAAAACTTGCCTTAGTGTGCGCCTTGCGCTTTGAAAACACTGATAGTTTCTAGTTGGGCTCTTGTTTTTGTGAGGGTTTTTTTTGTATATCCATGTATTCTTTCATTTTTATTTCAACGAAAGTTGTTATCTTAAAAAAGAAAAGTAATTGTTTTTTTTTCCTTGATGACAAATGAGCCTTTTATTGAGAACAAATGAAAAATGATACACGAGCATTAAAAAAGATAGGTCCAAAAAAGAAGAGCCAATTGCTTTTACCAAAAAAGAAGAGCCAATTGTAATGATTTGGTCCCATCGTGAAAAATCCTAATTAATGGGGGTTTCTAACGTATGTAGATCTATAAATCATTTAGGGACTAGATGGTTTTCTTTTTTTTGGGATATAAATGATGACTTTCATTGAGATAAAAATGAAAGAATGCAAGGGCATAAAAAAACTAAGCCCTCAAAAGAGAGAAATTCTTATAAAAGATGGGACTCCAGCTAAACAAAATAGAACCTATAGTATAGTTACAAAAGGTTTTCAAAACTGAAGCCCAGAGAGAAACACGATATCTAATGAGGGATTAGACCTCAAAAGAGTCCCTCTTTAACCCCTTAAACATCATGTTAATCTATTCCCCCCACAAGACCCAAAGAACAACACACGTAATCATGTACTCGAGACTCTTGGAATTTATGTAAAAGGATATGTTGGGGGTGTTATGGGTGTATTTTCCTAGTAAAGAGGCCTAGGTGCATCTATTGGTCCCTCACTTTTATGTTTCCGTGTATTTTGAGTAATAGTCTCTTTTCATTATATCAATGAAAAGTTTGTTTCCTTTTTATAAAAAAAAAACAGCACACACTTCGGCCGTCCAAAACACGCCCCTTCTCCCGAAAAGAATGGAGGAGGAACTCATCGATCATATCACTAGCAACCCTATGCTGAGCATGTGAGAAACCAAACGTCTGAAAGAAATAATCTCGCACAGCTCTCCCAAACTCACAACGTTAGAGAATATGATCCAAGTATTCCTCTGCTTTTCGACTAAGAATACAACAAAACAGACCAACAGCCGATGGTAACTTCCTTACAAGCTGATCCATTGTGGATGCTAAAGACGTAGGGATTAGATGTATTTTTAAACGATAAAGTACTAAATTCTTACATGCTAAAGATCAATTCCTCACTTTAACAAGATTTTTTTGTTTGTTTATTTTTTTTTTGTTTTTTGAAATAGAAATAAAACTTTTCATTGATAAAATAAAAAGAGGCTAATACTCAAATTACAATGAAACAAAAGAGCAAAGAATACAAGAATCTTAGGATCAGTAGGTGCACCCGAATATCTCAACTACATTGACACATCCCTAGCACTCTCATCAAATCCCTATACAAAATAAATCCACTCGAGAGAATTAATGTGTAACGACCCGACTTCCTAGGTCTTGTTCTAGGTCATTACGTCATGCATGCATACACCTCAAAATGACACATTTCATATATTTGACAAAAGTCAAAGAAATTTATAAAATAAATAACTTTATTAAATTTAAATGACTAAAAATCTTGGATCGGGGTACCCCTAGAAGTTATGAAAACAAAACTGTAAATTTTGCATAAATATCTAACAGCGCAAAACCAATATTCTGAATTTAAATATTAAGACAGGAGTTTAAAAACATGAAAACATAAAAACATGAGCGGAAGCATTTGAATGGTCCCAACGGCGCGGTCACGGATTCTTCCTGTTATTCGCCTGTTTATCCTTGCCTTTACCTGAAAGTTGTAACATAAGAAAGAGTGAGTATAAATACTCGGTAAGTGACCCCATTAGCGGGATCACATGCATTCTAATACTGTTATGGGTGAACATCTAAGTAGATAACCTGCCCGCCCAGTCACTCATGTCTAGTGGCCTGAAGACACACCGTCAAAATCATGAAAGTGAACCTGATGGTTCACGATATGGTAACATGGAGATGAACCCGTCAGTTCACACATGTCTAGTGGCCTGATGGCGCACCGTCAAAACATGGAAGTGAACCTGTCGATTCACGGTATCGTAACATGAGGCGTGACGGGGAGGTAAATCACCACTATCTGTCTAACATACAATTCATATATATATATATATATATATATATATCTAATAATAGAGTCTCACACCAACAAATCATATACTCATAAAAACATAAGTGTGCATGCTCAATTTCCATGGATCATAAAAAAAATCACCCAAATTTCACTGATAACTTCGGTAACATGAACTAGGGCGTCTAGCACAAATGCAAGTCGATAATAACGTCACTTACCTCAAAATTAAGTCAACTTCACCACCAATTCATCTTTTTCCTTGCCTTGAGTAACCCTAAACACAAATTAAAATTAACTTAAATATTCAAACTTAAACCAACGAAAATGCCTACACTGACGATCTCCAAAAGCTTACCTAAAACGAGGCTTGAACTAAGATTGAACCCAAACTTTTCTCAACAAACCAAACCTAACACAAGATCAACCGTTCAGAATGGTACCCCATATGTCTCGAACAGGGTGACAAAATTTCACTACGATCCGACGGTTGAAACTTCGGCAAATGACAAGTTTCCGAGAGCTGTCGCTGAAAAATCGAACTGCACTGCCTCTTTCTCCTTCTTCGCTTTTCGGTGTAATTTTCTCCCTCTTTACCTTATTAGGTCACATAATAATATATACGTACAATCCCAAACCCTAATTCCTTAATAAATTAGGGTTACATATAGATATTTTAAATTTATTTTCCCTCCACCAATTCCACTAAACAAATTAAATCCCCAAATTAAAATTCAATTTTCACCAATTAAATTTAAATCCAAATCTTTTATTTTTAATTTAAATTGAATTATCCTTCCAATATAATCCAATTAAGCCTTAAAAATTTAAAATAACACTCAAATAATTTAAGATAATTAAATTAAAATTACCTAAAATTTTGGATTGTCACATAATGTATCACAAATACATGGTTACATAGAGAAAATAAAAGAACCCTAATTAAAGCAAATATCCTAAATAGAATAATTAGCAAAAGACTTTGATTGGGAACTCCACGAAGATGCTAGTAGATAAGTGGAACCAAAACGGTCTGTCCATTCTGAGGGCTTATCGTGAAAAACTCTCTGATTTCGTTTGAATCAAATCTATGAAATTAAAGCTTTAACGACATTTGACCAAAGAATCTGGGCTTTAGAAGAAAGTGAAGGGCCAACCAAAATTTGGGACATATTGTCTCGAATTGATTGCCCAAAAATCCAAGATAGGTCAAAGTAGGAAAATAACTTCCACCAACACTTGCTTGTGTAGGAACACTCGAACAATAAATGTTGTAGGTTTTCTCCATTTGCTTGACACAGTGGGCAGATAGATGGAGATAAGCTTTGGTCCCTTAGCTTATGTTGTAAAGTAGCCGAACAATTTAGGGATCCAAAAAATAGAATCCATATCAGCACATTGATTCTTTTGGGGCTCTTGGATTTCTGTAACACTTCCTATTGCAAGCAATAACTTGATATTAATTGAAAACGAGATTACAAACTTTTCACTGTTCCGGGAGGGTGGTTCTCTCCCAACGTCGTCAAGACTTTGACTTTCCCAAATGTATCCCCCCAAAAGTAATACAAAATATATATATATGAACACATTCACAAACAGACAAACTAACTAAACGAGTGAATTGCCCTTTTTGCCCCTCCTGCTGTACGTATGCAAAATTGGGGGCCTAACAATACCCCCGGGGATGAAAGCTTCCTTGTCCTCAAGGGAGAAATCAGGAAATTGTTCTCGTATGAATTCCATGCTCTCCCAAGTTGCCTCGTGTTCTGGTAAGCCCGTCCATTGAACTAAGAGTTCCCGTTCCCTGGAATCGTCATTAAATCGATAGGCATAAACTTCGTTTGGGATCACTTTCCATTCAAAATCGTCTGTAAGTTGTGGAGCATGAGGTTGTATGGGCGTAACACTGCTTAAGGCATGTTTCAGTTGGGACACATGAAATACCGGATGGATAGTAGCTTCGTCGGGCAGTTCTAATTTGTAGGCGACCTTGCCAATCCGTTCCGTGATGCGATAAGGTCCGAAGTATTTAGGGGACAATTTTTCGTTACGCCGTTTGGCCTCAAATACTGACTGGAGGTGCCTTTTATGCATATCCATGTCTGGGCTGTAAATAAGAATGTCGTCAAAGAAAACTAATACAAATCGTCAGAGAAAAGGTCGAAAAACCTGATTCATTAAGGGTTGGAAGGTGGCTGGAGCATTCTTCAGGCCGAACGACATAACCACAAAGTCGTAATGTCCCTCATTGTACGAAAGGTTGTTTTCTGGATATCTTTCGGATGAACCCTTATTTGGTGATACCCTGATCTTAGGTCTAATTTAAAAAATACCTTAGCTCCGTGTAGTTCATCTAACAATTCTTCTATGATGGGATTGGAAATTTATCAGATACTGTTAGCTCATTGAGAGCTCTGTAGTCAACACAAAAACGCCAACTACCATCCTTCTTCTTCACGAGGAGTATTGGGCTTGCGTACGGCCCATGACTTGGACGAATCACACCAGACGATAGCATTTCCTTAACCAGCTTCTCAATTTCATCTCTCTAATCTACGAAATATCTGTAGGCCCCGAATTCACTGGCTGGCTATCGCCCTTTAACGCTATGTGGTGATCGCACTCTCAGTGGGGTGGTAAACCAACGGGAATCTGAAAAATGTCCATGTTGTGGGACAATACTTCCGACACTTGAGGATTTATGGCTACCACTGTAGCTTCTGGGAGAATTTCTTTGTCAATTTCGGCCGTTAAGGACCTTAGTTCCACAAGGAACCCCTGATCGTCAGACCTCCACGATTTCCTCAAACTTTCAATGACACCTCTCGTCTGAGCAAGGTAGGGTCGCCTTTGATCACTACTTTTCCTTGTCCGGTTCCAATTTTTAACGTTAGATTTTTCCAATCCACCACCGTCATGCCTAACGAATGTAGCCATTGCATGCCTAAAATTACGTCAACCCCTCCCAACTCCAGCGGCAGGAAATCTTCCTTAACCGTCACACCCGGAAGGGAAATCACCACTTCTTTACAAATTCCTCTACCTCTGACCGCTGTCCCTGTTCCCATGATGACCCCATAGTTGGACGTCTCTGTCACGGGAAGCTTGAGTCTTTCTACCACCTTGGGAGAAATGAAATTATGAGTGGCTCTGCAATCTACCAGGACTATAACGTCTTCCGCTTCAATCTTCCCCCTAACTTTGATAGTTCCTGGATTAGATATTCCTACCACAGTATTTAGTGATAGCTCCACTACTTCCTCGACAGTAGGCTCCTCCGGATCTTGCATGTTCCCTTTCTCTACTTCTTCGTTCCAAACTTCGTTGTCTAGCACCATCAATCTCAGCTCCCTGCTCTTGCACTTATGCCCCACCGAAAATTTTTCATCGCAGCGATAACACAAGTCTTTTTCTCATTTCGCCTGAAGTTCGGCGTCCAACAATCTCTTAAATGGGATGTCTTTTCGTTGGTAAGGCTGCGTATTGGACAGAGTGATAGAGCGCACCGGCACCGCCTCAATGGCCCTAATCGGGTTCTTCGAGGTAATCCCATGGGCCGAGTGACTCGTTGGGCCGGTCCTTATGTCTTTCCCCATTTCGAGACGGTTCATCATGATGAGGTTTTTATCCTCAACCCGTTGGGCCGCTTGCATAATCTGATTTAAGCCCAACGGCTCATAGCACATGACCTCAGCCCTGACCTCTGGACTTAATCCATTCAGAAACGTACTTTCCAACACCTCATCAGTCAAGTGGCCCAACAGCGCCGACAAAGCCTTGAACCGCTCCCTGTACTCAGCCACTGTGCTCAGTTGCTGGACGGCCAGGAATCGTGCGCACAGAGACCCCTCCTGAGTTGCACGAAATCGGTCCAACATCCGCTTCTTCAAGTCTCTCCACCCTACAAACTGTTCCCGTCCATCCGCCCAATTATACCAGTGTAGTGCTGCGTCGGTAAAACTAATAGATGTAACAGTGACTTTTTTCGATTCAGATAACTGGTGGATTTCAAAATACCTCTCGGCTCGAAACAACCATGAATCGGGATTTTCTCCATCAAAGATAGGCATTTCTACCTTCTTGAACTTTCCGCGGTCGGTGTAACCTTCTTCGGGTTTAGAATGTTGATTACTCTCTTCTCCTCCCACGTGCACTTCCAACATCTCACCCTCATCTTCGGAATATTTTGCTTAGGTTGAATCACTCGATCTTCAGGAGAATGATTGTTATGCAATCCCTTTACAAGGAATGATATTGCTCTCTGAGTTTCATCCGTTGCATTTCTCTGCTCCTCCAGATTGTGGAACATCACACCCATATTCTTCTGGTGTTCCTCAAAGATTCTCCTTTGGTTATCCATCTGTTGGAAAATCTTATTCATATTTTCGACCAGGTCTCTCATGGAAGCATCTAACCCAGGCAACTTTTGAATTTCTTCTCTTACCTCACTCAAAGTTCTTTCAGACGCCTCAACTCTCTCTTCGAGACGCTTCTGCGCCATCTTCTTCGGCCTTCCCAGGATGAACGCTCTGATACCAAATGTAACACTTCCTATTGGAAGTGATAACTTGAAATTAATTGAAAACGAGATTACAAACTTTCCATCGTTCCGAGAGGGTGGTTCTCTCCCAACATCAGTCAAGACTTTAACTTTCCCAAAAGTATCCCCCCAAAAGTAATACAAATTATATATATATATATATATATATGAACACATTCGCAAACAGACAAACTAACTAAACGGGTGAATTGCCCTTTTTGGCCCTCCTGCTGTACGTATGCAGAATTGGGGGCCTAACAATTTCCAAAGACTTTGGTATATCTCCTTGTCAATAGGAGAGGAAGAATTTAAGTATTTAGACATAGATTTCACCAAGTACCTTCTTGAAGCATCCAAGGACCACACTCGAGAATCTGAAGAATGAGAAACCCTTCTGTTTTCGACTCTGCATAAAAGCTACTGGAATTCTAGGATCTCCGATTCTTTTAGCTGCCCTTGGAAGATTAAATTCCAAGACCCAGTGGTTGAATCCCAATGGACACTAATACTACCATTAGGAGATGAGACAATGCGAAATAATCTTGGAAAACTCATTTGAAAGGAGACTCCGAACACCATGAATCAGTCCAAAAGTAAACGTTTCTTCCATTCTCAAGTTTAAAGTTTGCAAGGGAACCAAAATTATCCCAAGTCTTTGAGATGCTGATCCATGTGCTTCTCAAACTCAAACTGCGCTTACCGGAGATGTGCCAATGATGAGGGTTAGTACTATGTATGCTTTTAACCACTCTACACCAAAGTGTGTCTTGCTCTTTGATGAATCTCCAATTCCATTTGAACAAAAGTGCTGTGTTTCTATTTTTCAGATTGCCAACACTAATACCTCCATCTTCTAGGGCTTTTGAAGCTGTTTCCCATTCCACTAGGTGATTTAACTTGCTACCTTTTTGTACCTCCCACAAAAATCTACCCATGATTCTCTTAATTTCTGCAGTAACCGAGCAAGGCATTTGGAAAATAGACATATAGTAAGTAGGTCGATTTGCCAAGACTGCATTACAAAGAGTTAGTCCCTCTAGACAAATTGTACCTTTTCCATCTATCAAGTTTCAAGTGCACTTTTTAAATAATTGGCTGCCAAGATGAGTGTTGCTTCGAGTAACCACCAAGAGGCAATCCCAAGTAGATGAAGGTTAATTTACCAGCTTTGCAATTCGGCCTACTTGCGTGGAATTTAAGGTAGCTTCATCAACATTAACTCCATATAAAGCCGACTTGTCCCAATTAACTTTTTGACTTGAGCACCATTCAGAAAAACCAATTGTTTGAATTAAAGTGTCTAGCATGGCCTCATCATACTTGCAAAAAAGAAGGGTATTGTCTGCAAATTGCAAGATGGATAAACGAACTTTGTCTTACCTACCAAGAAGCCTTCAAACAAACCATTACCATGTATTCGAGAGATCAAAGCACCAAGAACTTCGCCAATAATGAGGAATAAAAATGGTGATAATGGGTCAACTTGGTCAAATGCTTACTCAAGATCTAGCTTTAGAATCCACCCTTTCTTATTCTTTGCCCTATAGTCTTCAATTGCTTCATTAGAAATTAGAACCGGATCTAATATTTGCCTTCCCACTATAAATGCACTTTGTGTATTAGAGATGATGCTTGGCATTACCTTCCTTATCCTCTTGGCCAGGACTTTAGAAACTTTCTTGTACATTGATGTTGTTAGACTTATAGGTCAAAATCCTTCACATGTACTGAGTTTTCCTTATTGGGAATGAGACAAATGAAGTTTTCTTTTACACACGCATTTAACCTCCCATTTGTATGGAACTCATCAAACATTTTCTGGAAACTTTCCCTAAAATTCTGCCAATTTTCAATAAAAGACTATACTGTGAAACCAAGGTAGAGATTAAATTGTTACAAATTTTCAACTATTTGTACATTAGTAACAAATTATAGAATATAGGGATAAAGTTGTTTCATATTAAAGATGGTCATGGTTTGGCCTAGTGGTCAATGGGAGAAATTATTGGAGGAGGTTGGAAGTGGAACAATGTGGATAGTAGTTTTACATCGTTGGGTGTTTGAGTGGTAAATTGCCAAATGTATTTCACAATTGTATAATGTATCCTATAGTTATTATATGAGGCATTTAGTTTTTTATTTTTTTTAATCCATGGTTATTAGGTTAGGATGGCCACATTTTATTAATTTTTGAGTCATTGGGGCCAATAAAAAAAAGTAATTGGCTCAAAGGGAATAAGTTCAAACCATGGTGACTTCAATGAGTTTTCGTGGTAATCAAATGTTGTAGGATTATATAGTTGTCCGCCCATGAGATTAGTTGAGGTGCACGCAAGTGCCTTTAATGAAGCCCACCTCGACACCTAAAGCCTAATCTTTCACTTTTTTTTTTGAATTTTTATGGAATAAGCTAAGAAATCTATATCTACATATAAAAGGGTTATGTGGGGGAGAATCTTTACATCTCCATTTTGCCCGTAAATCTTTTTATAATTCCTATTTTGTCTTTCATTACAACTATTTAGGTTGAGATAGTTACACGCGAAGTGTTGTATACCTTCGTGTAGGTGTGATCTAGACCATAGATATAAACTAAACGGTGGATGTCGTCGTTTGCCCTGTTACAACTATTCGGGTCAAAATGATTACACGACTAAGTGGTATATCCTTTTGTATATGGGTGTGATCTAGACTATAGATATGAAACATCACGATCTAAAGGTGGAGTTATGTTGGAAGTTGAATGGATAGTCACGTTCTTGGGGGTCAGCTAGCCTGTTAGCTTCTTCTTGGAGTTCGCTGTCCAAGTTTTTTAAAGATTATATGTTCAAGATATTTGTCTTAATTGAAATGCTTTTATCTTTTCTAAGTAATCAAGCAATTTAGTATTTTTGTAATCTCTGAATACTTGTATCTCATATGGGAAATGATGAGGGTGCTACGGTGGTGTAACACAATTGAGATGTTTGGGTGCACTTTCTAATCCACAACTTCTAATTGTCTTGGTTTGCTATTTTTGTCTCATTGTAATTTGAGCTATTGCTCTTTTCATTATATAAACAAAAAGTTCTATTTCCGTTTAAAAAAAAAAAAAAGAAAAAAAGACAGTCACATTCTTTGCCCCTTAACATTTCGTAAATTTCTATTTTGTCTCTCTTCAAATTGTTATAAATGTTCGAGACAAACAATGTAAAACACATCTCTTTTTTTATTATTTTTTTTATGTACTCAAATAAAACTATAAATTTCATATGCATCACACGTGTTATGTCTAGTATTCTTAAAATATGTTATTTTTTTCTCTTCAATCTCATAAAAACTTTAATTTCTTAAACTTTTCATCATTTTATTATTCATAATACTATTTTTCTTTGTATATACAAAATATATTAATATATTCCTATTGTGCATCTCACAATGTACTCATGCTTAAGCCTTAGAGGACTATTGCGATTTAGAACGTCATTAGAGTTTTTTTAAACTTGTATATACTTGTGCAATGGCTATGTAATGGATGATATTTATTTCCTCTATTCTAGTAAATTATGAATATAATTATTGTCCATTTGTGTGTGAAGACTTACTATCTGTTAGAGAGATCAAACTGATATGATTTATTTACTTGTTTAGGCTACTTGTGGAATACCTGAAATTGCTTTCTCAACATTTGAGAACATGGAATATGGAGAAGGTTAACATTTTATTTTCCTTTTTTCATGCACTTTGTTTGATATTGTCAATAATGCTTAGGCAATCTATATTTCGTAGTAAATGATTGTTCTTTGGTTGTGTGTGTATGTGCTAGTTCTCCATTTATTTCTTTTCATATACTTCAAAATAATTATGTTATTTAATGATTTTTGTAATGAAAAAAATTAAGATTAGACGTGGAGTATCAAAATAAAATTAAATGAAAAACAAAAAACAAAAGAACCTCAAGAAGAGTTCAACTAGAAGGGACTGGGCCCAATTGCCTTAGGTGTGGACCGTTGAACATGATCCAACATATTAACTACCATACAAAAGATTTAACCTTCTTCGAAATTTTGATCTTCTAGAACAACCAATGATCCACGACCTCCACATGTTACCAAGGCCTTTCTTGTTCAACACCTTATCCACGAAGTCCCAGTCCACATGATTGTAAGCTTTTTGAAATTTAATCTAATTTGATATTTAATCTAATTTGATTTTTTTTTGTAAGAAACAATTTCATTGATGTATGAAATTTACCAAAAGGATTAACAATCAATAGAATTACAATAAGCTTTTTCAGTTGTTTAGAAGGGAGGCATATGAGTAAGAAGCAAAAGAAATTAGAGCTTTCACACCAGGAAATAACATAATAAAAGGGGGGGGGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAAGGGGGGGGGGGGGAGGTATATGCGTGAGAAGCAAAAAAATTAGAGCTTTTACACCAGGAAATAACATAATAAATGACAAGGTCAAAGATGCTGAGTTGAAGGGGCTTAAGACTCAGATCCAAGAGTGGCTCGAAACCATCATGGGTTGGCTTAGTGGTAAAATATGAGGGCTTGACTTTGATAAAAGGCTAAGAGGCCATGGGTTCAATCCATGGTGGCTACCTACCAAGAATTTAATATCCTATGAGTTTCCTTGACATCCAAATGTTGTAGGGTCAGGCGGGTTGTCAAAAAAAAAAAAAAAATTCAAGAGTTGCTTGATAAGGGTTTCATACGTCCAAGTGTGTCGGCTTGGAGTGCACCAGTTTTGCTTGTGAAGAAAAAGGATGGTTCTCTGCGTTTTTGCATTGACTATAGGGAGCTGATCCAAGTAACCATCAAGAACAAGTACCCTCTCCCTAGGATTTACAACACCTATTTTTTGATTGTCATTATGCTGGAAAATGTTGGCAACGACTGTTCAGCCTTTTGAACTAAGCTGGGTTTTTGGAAGCAATTTCAGGGTCAATGTATTGCGGATTTTGGCTGGTCCTTAGTTGAAATGCAGCCCTTGTTTGCTTTGGAACAATGCTGACAAAGCATTGTTAACGAATTTATGGTTTGAAAGAAATCAAAGAGTGTTCAATGATAAAACAACTCCTTGGTTGGATCGATTCGAGTTAGCAAGACTTAACGCTTCCTCATGGTGTGCTCTTTTCAAATCCTTTGAAGACTTCTCAGTTCATAATATCGATCTTAATTGGAAGACGTTCATACATGCAGAGTTTTAGTTTTGGGTTTTGGTCTTGCTGTTTAGAGTAATTTCCTTATGTATTAGGGCGCAACTTGTTTGTAGTTTTTACTTCTAGTTGGTTGTTTTGGCTGTTTTTATTTTCGCTTTGCTTTGAAACATTTCGCCCCATTGTATTCGGGGAGTATTAGTTGTTCTGTTGGACTGTCTTGTTCTGACATTGGATTTTTGTTTTGCTCGCTTAGTTGGAGATGATGAGAGTGCTAAGGGGGTGTCAACATAGTTGAGATGTTCGGGTGCATTCACTAATCCCTAGATCTGTTTGCTTTTGTATCCATCTATGTAACTTGAGCACTAGTTTCATTTCATCATTTCAATGAAGAGACTCATTTCCTTTTTCAGAAATAGTTACAGGGAGCCACAGTTTTCTCTAACATTGATCGCTGTTCAAGTAGCCATCGTTTAAGAATAAAAGATAGTAACATCCCCAAGGCAGCGTTCAGGTCTAGCTATGGCATTATGAATTCATTGTAATGTCTTCTAGATTGACTAATGCTTCTGCAGTGTTTATGGACTTGATGAACAAGATATTCAAAGAGTTTCTTGACACTTTTGTGGTAGTTTTTATTGACGATATCTTGGTTTATTCCAAGACAGAGGCAGAACATGAGGAGCATCTCAAGAGAATTCTAGAGACTCTAAGGATGAATTAGTTGTATGCCAAGTTCTCTAAGTGTAAATTTTGGCTGTACCAGGTGTTGTTTCTAGGTCTTTCTGTGTTTTGAACACATGACCTTTTAGCCATTTATCAAGGTCATGCCCCCATATTTACCACTAGGCCAACCCATGATGATTTCAAGCAACTCTTGAATCCGAGTCTTAAGCTCCTTCAACTCAGCTGGAGTCATTATGTATGAGGTTTTAGATATAGGAGTTGTGCCTGACTTTAGCTCAATACCGAAATCTATCTCCCGGAATGGTGGCAACCTCGGAAGATCCTTTAACAAAACATTTGCATATTCTCTAATAGCTGGCACTGACGTCGAGGTGACCTTAATGCATCATGTATCAACCACATTGACCAATATGCCACATGTGCCTTGGTTGATCAACTTTCTTGCTTTTATAGCTGAAACTACTTTAGACATGACCTTGGTCTTGGCCTCTTTAAAGTTAACACTGGTTTTTGCTGGAGGACTAAAAACAACCTCCTTATAAAAGCAATCTATGCTGACATGGTTTTCAGCCAACAAATCCATGCCTCAGATGACATCAAAGTCTCGCATGTCTAGAACTATCAAAGTTACACCTAAAATTTGTTTAGCTAATAAAACCTAGCATGCTTTTATGCTTTCACATGCCAAAACATGTTGTAAAGGTTCCAATTCTAGCCTAGCATGATATAGGATTGAGTTACTGCAGCTGCCGGTGGATGAAGTTGTCCTTGTTCTCTGTTTTAGTTTGTTTTATCATTGATATAGGATTGAGTTAGTAGGCTCTGGTTGTTTTAGTTGTGCTGGTGTTTTTGTTTGTTGTTTTTCTTTGTAATGTTTGTAATGTTTGATGATTGGGGTGTATTTTGCCTTTGGTCTGTCCTGTTTATTTCATCTTTGTTTTATTTAGGGATGTGATGAGGGTGCTAAGGGCATGTCAACCTAGTTGAGATGCCTGTTGATCCTTCACATGTTTCCTCTTGTAACTTGAACTTTTGTATCTTTTCATTATCTTAATGAAAAGGTTTCGTTTCTTGTTTCAAAAAAAAAAAAAGTTCTAGCCTAGCATGCTTTACAAAAGCTGATGATACAAAAGAATGCGAGGAATTAGAATCAAGTAGCATTAGAGCATAGTGACCTAAAATGGGAAGTGTACCTGTCACCACAGGATTAGACTTCTCGGTCTCTCGTTGAGTCGTAGTATAAACCCTGCCTTGCTATTGCTGTTGTATTCTTGAATTTCCTTGATTTGCACTCGAGGGCTGATCCCTACCACCTGCAACATTCCTACTAGGCCACCTATCAGTCATATGTGCTTCTTGCCTACAGCTATAATACACTCTTGCTCTGGTCAAGTAGCGCCCTTTGTGACGCTTTTTGCAAGAGCGACACACTAGTCTCTCTTCCTAGGCAGTTGCTATCTTGGCAGCATGTTGCCTGAAGTCGTGTTGCGGTCTTTCTGTGTTTCAGTACCTCTGCTGGGAATTGTAAGACATTTGATATGACTTCCTTTTTTGGCTCGAAAAGGAACCTGGTCCTGAAAATCTCAAATACCCATCCATGACATGAGAGTCTATCCGTGTAGTTGCACGAAGGATAGAGCACTGCTACATGGCTTGATTGTTCAAGGGCTGATACAATGCACCACATCCCTTCCCTCAAACCTTGGATAAATCTTTCCTTCTTATCCTCGTTAGTGGCTACCTGATATGAACGGAGGAACCTAAATCATGGAATATGAACATCAAGAAAGACGCTAAGAATGCACTATATATTGTCCAAATTCACAATGAAGAAAACTTGCATTTATATAAAGTCAAGTAGAATTTATGAAGAATCAAGCTTCATTATGGAACAAAGACTTCAAGTGGGACTTCACAACTCAAAGATGGTTACAAATTGATGGGTTGTTGACTAAAAAGAAATATTTTCGGATTTATTGTAATAATTAAATGTAATTTTCAGTTTTAAGTACTTTTAAGTAATTTTATGTTACATTTAGATCATGGGTGCTAATGGTCATCAATTATGAGAGTTACAACTCTTTAAATGCCTCATACAATGTCTAGAGTTATATTTTGAGGCCTATATAATGCCATGTAATTGTTCAAAAAGGGGAGACTTGGAAAATATTGTAAAAAGAGCCTTTGAGCTTTCTTTTATAGCCTTGAGCTATATTTCCTTTGCAATTCTATGGTGTTTAGGTTGCATGTTCAGAATCATTCAAGCTTGACTTGATCGATCTTGCTTGTGGAGTGATTCGAAGATCAAGGGTCAATGAGAGTCGCAAGCTTTATCTGATCCGTGGTTTTGTTGATCGTCTAAGTAATCCGTCGTCTAGCCATTCTTAGGGGCTGAGATCATCTTGATTGGGAGTCTTTTGCGGTTGAGTTCTAAAGGTCTTATTCGTCAACAAGGTTGTTAGATCCAAAGTTTTGAAGGTTACTTATCACTACCAGCTTAGGAGCAAAACGAGATTGCTTGTTGAATTCCCTTTCGTTCAACAACTCACGTAACTCCCTGCTTAAGGTTTAAAAACTCTATCTGCTAAGGTTTAAAAACTTTGTCTGCTTATTAAACCTTGTCTGGGTGGAAAGTGCTGCACTTGAAAACACGCTTTAAACTGCTCCCAAGTCTCAGGTCTAGCACTAGTGTTTATCGACCTTTCTACAAACTGCCACCAAATCTTGACATCATTCGTGAGCATGAGCACTGTACACTGTAGCTTCTAGTCGTCAAGACATTTCATATATCAGAAGATAGTCTCTATTAAAGAGAACCAAAGTTTTGCCTTTATGTGGTCCTTCAGTGACCTATCAAATGTACGGGGATATATTTCCTCAAATCCTACAAGTGTTTAGCTTTAGGTGACAAATCATGTTAGTTCTGACCCTACACCTTAACTTGCTGAGCTATTGGATCAGCCACAGCTGCTCGAGCCAACTCCTCTATACTCTGGATCATAGCTGTGGTAAAAGTTGTCCTTGCTTGTGCTGCCATAGTAGCGAAATCATTCTGAAAATTGGGTGGGGCCTATGGTTGCAAGGTCACTGGTGGGTCCTAAGGTTAACCTCCTACTGGCACATTCCGATATTTCTTTGTGTTCTTGGCACTGGAAGGTCTTGTGGCATTACAAAAAAGTCAGGGTCTTGTTGCTGTTGTCTTCAGGAAGATGACAACAATAAGTCAGGGTTTATGCTACGAGTCAACGAGAGTCTTAGAAGTCTAATACTGTTGTAACAGGTACACTCCCATTTTGGGTCTCTATGCCAGTGCTATTTGCTTCTGGTTCCTCGCATTCTTTTGTATCATTAGTTTTTGTAAAGCATACTAGGATAGAATTGGAACCTTTACAACATGTTTTGGTAGTTTCTACTCCATCTAGAGTTATATTTTTTGGCACGTGAAAGAGTAAAAAGATGTCAGGTTTTATTAGCAGAACGAATGTTGGTGTAACTTTGATAGTTCTAGACATGCAAGACTTTGATGTCATCTTAGGCTGAGATTGGTTGGCTGAAAATCATGCCAACATAGATTGTTTAGGTAAGGAGGTTGTTTTTAGTCCTCCGGCTAAGTCTAGTTTTTAATTTATAGGAGCAGACTGAGGTCTTACCTAAAGTAGTTTCAACTATAAAAGGAAGAAAGTTGATCAACGGAGGCGCATGAGGCATATTGGCCAGTGTTGTTGATACTGGCCTGTTGGAATAGAACACATGCACAACGAAAGCAAGGATCTATATACACTCTATAATGCATGAACAATAGGAAAAGCATGCTGAAAATTCAAGAGAAAAACAAAAAGTTTACCCTTGTAGACTATTGAACTTCTCTTCTTCTCCAAAAAGTGGATGCTCTTCAACCCCAAATCTTCTTGGGACTACCACCGGTGTTACCTTGCTATTCTCCAGCCAAGAACACGGTAGTGGGACCGATTTTAGTGAAGGAATAAGGAAGGAAATAGAAATTTGTAGAGTTTGAACAAATGAAAGACTTAGTGAAATTGTCTAGCAAATTCCCACTAAATCACCAACCTTTGTATCATCTTCAACAACTAATTTTATAGAAGATTGACATGCAAAGAGGTGCTTTGCATGTAGGACACTCCAACAAGCACTAGCATAAAATGGGTTAGTTGTCATGATGTGAAGAAGCCAACACAAACTCATCCAAGTTAGTGGAAAGTTCCAACTCTTAAGTTGGAATTTTCCACTAAATGATTTTTTTTCGTTTAAAAACTTGTTTTTATTAAAACTTATTTTTAAAATAACTTTAATTAAAGCAAATTTTAATAATTAATTAATTAAAACTATTTTAATTAATTAATCTTATTTAATATCTAATATTAAATAAAAGACACCAATCCCTTTTAACAACTAATCTTATTGATTGTGCTTGAATCAATATTAAAAAAAAATAATTTAAATCCAATTTAAATATTTGTAACTCTCCAATTTTATTTAATTCTCAATTAAATAAACAAACGTTAATTATATCGCATATAATTGATGTTTTCCCCTAAACCGAATTTAAACATTTTAAATTCATTCTTCACATAGTTCTTTGGTTTTATCCGATTGAGCTAGCAAGGAGACCCAATGTATCTACAGATCATGGACTCCAACGAGCTGCGATTAACCGGTTAAACTCTGTAACCTAGTTAACCAATATTCGTTAACTAATAGGTCTGTCCACTATAGCTTGTAGCTGCACTCCCCTCATTGTAGATATATTTCTGTCCACCCGATATAATTGTCATCAGTAAGTTAATCCTTCACAGATTGTTCGTAATCTTAGCTGGGTCAATTACCGTTTTACCCCCGAGTTACTTCTTTTCTCCTTAAGTACCACTGTCCCTCTAATGAACAATTGATTTGAGATCTAATCATCAAATCGAGTCCCTCTCGGACGAACAATCTCTCTACTATGCCTAAAAGCGGGTAGGAGTGAATTCCATCTTGCAAGTCTATGTCCCCAGCTATCTACCCAGTCTTACCCTTGAAATGGGGGGCTTATTGAGTCGGCGAAGTCAGACCACTCTCACCCATGCAGATCTAAGGATAATCCCGAATAGAGAGGAGTTCATAGTTAGCTCAGGATTAAGATCGAGTTACCTAGATCATCGATTTGAAAATAGTCAGTGTTAATAGTAAACGAAGTTATAAAGAAACAATGACTATTTCGTGGTTTGGTCTTGTGCAAACTCATTGCACAGAATACCTCCACTCACATGTCTCCACATGAATAATTCAGGATCACATCGTTTGTATTAAATACAAAGTGGGCCGTATTCATTAGCGTTCCTAGGATAAGGCACCTAACCTAATCCTTATACTAAAGACCATTTTGACTATCTACTCGAACTTGATCCACTTTTATGTCTCTACATAAAGTTCAAGTACTCATACAATAGCCATGGGTTCATAGTTTATTGGATTTAGGGTTACATTCACATATTCGATAACAACTTTACTGAATAAGTTCGATAACAACTTTATTGATAATAGAAAGTGTTAAAAGTTTATAAACCACAAGTTTTAGGACATAAAACTCAACATGACCAATATCTAAAGCCCCATACAGAATGGCCCCAGTTGAGTTGAAGGAGCTTAAGACCCAAATTCAAGAGTTGCTTGATAAGGGTTTCATATGTTCAAGTGTGTCACCCTGACTCGATAAGTCTAAATCCTGAACTTCAGGTTATACTAGAAGAAACTTGACTCAAAGGGATCAAGATCAAATAGTTGACTTATCACAGGACCAATCTAATGCCCCGACGAATGATTTTGAAGATCCAGGTATTAGACACTCTACTTCTGTTCCTCCTAGTTCTTTATCTTCTCATAATCCTTTACCTGATGTCTCTGATCTTGATATTCCAATTGCCCATAGGAAAGGTCCCCACAAAAAATCCCATTGCAAACTATCTTTCTTATCATAGATTGTCTGACTGTCCTAAAACCTTCACATCCAAAATAACCAACCTATTTGTTCCAAGCAATATACAGGAGGCCCTAAATGATTTGAATTGGAAATTAGCAGTGATGGAAGAGATGAATGCGATGAAACGAAATTGCACATGTTACATAGTTGAACTACCAAAAAATAAAAAAATAGTGGGATGCAAATGAGTGTTCACTGTAAAATGTAACGTTGATGGTAGTGTTGAAAGGTACAAGGCCATATTGGTTACCAAGGGGTTCACTCACACCTATGGAATTGATTTATCAAGAAACATTTGTTCTAGTTGCTAAGATTAATTCTATCAGAATTCTGTTGCTGTTGCAGTTAATTTTAATTGGTCTCTTTCTCAATGGGGATCTTGAAGAAAAGGTATTTATGGACTTGCCACCTTTTTAAGGTGGATCTCGGGGTTAACAAAGTGTGCAAGTTAAAGAAATCATTATATGACCTTAAACATTCTCCTAGAGTAAGGTTTGAACGTTTTGAAAAGGTAGTCACAAACTATGGATTCAGTTAAAGTCAAGCTAATCATACTATGTTCTATAAACATACTGGAAATAGCAAGGTTGTTGTTTTGATAGTGTATGTTGATGATATCATACTTATAGGTAATGATGAGATAGGTCTGAATATTTTGAAGAAAAACTAGCTAATGATTTCCAAATCAAGGACTTGGGAATCTTAAAGTATTTTCTAGGCATGGAGTTTCCCAGGTTCAAAAGTGACATGCTTGCCAACCAAAGGAAGTATATTCTTGACCTACTGAACGAGACAAGTTTACTTGGTTGCAAGATAGCAGAAACACCCATTGAGTAGAATTTAAAATTGGTAGCTGCAACTAAAAAAGAGGTAAAAGAAAAAGAAAAAGACCAAAGACTTGTGGGGAGACTCATATACTTCTCACACAAGTCCTGACCTCGCTTTTGCAGTTAGTATGGTAAGTCAATTCATGCATGCCCTGGGCCAGTTTACTTTGAAGCAGTTTATAGAATCTTGAGATATTTGAAAGGTACTCCAGGAAAAGGTATACTGTTTAAGAAGCATGACCACCTACATGTTGAGGATGTTGATTGGGCAGACAACACGACTGATAGAAGATCCACTTCAGGCTATTGCTCCTTTGTTGGAGGAAATCTAGTTACTTGGCGTAGTAAAAAAAAAAAGTGTGGTTGCAAGAAGTAGTGCTGAAGCAGAATTTAGGGCGTTAGCCCATGGTATTTGTGAGGCCATATGGATAAGAAGACTATTGGAAGAATTAAAATTCTCTCAGACGATGCCTGCACATTTATTGTGATAACAAGTTAGCAATTTTCATTTATTGTATTAAGGATTGGTCTTCTTTATCTTTCATTTTGTTCTTGTGGCTCCATTTTCTTTGTAGTATTTTCTCTTCTCTTTATTTGTTTCATTGTACTTTGAGCATTAGACTCATTTCATTAATTCAATGAAAAAGTCTTGTTTCTGTTTTAAAAAAAAAAACAAGTCATCAATTTTCATTGCCCACAATCCAGTCCTTCATGATAGAATGAAACATATTGAAGTTGATAAACACTTCATAAAGGAGAAGATTGATGCATGAATTATATGCATTACAGAGTAAATTGCAGATGTGTTAACTAAAGGACTTCCAAAGTGGCAATTCAACAATTTGATTGACAAGCTGACCATGAATGATATCTTTAAACCAGCTTGAGGGGGAGTGTTGATTATTTCCTTTTTTTGTTAATATCTACATTGTATTTATTATATTATATTTAATACAATAAATTGTATTATATTTAATACAATAAATACAATGATATCTTTAATATCTCCAGTGCTAAGGAGGTGTCAACTTAGTTGAAATGTCCGAGTGCGCTCCCTGATCCTTAGGTTTTATTGATCTTGTTTCCTTTTGTAAATCAAGATTTTATCTCAATTCATTATATCAACGAAAGAGACTTGTTTCCTTTAAAAAAAAATTTGTTTGTATTTTTATTTATTCCTTATTTGTAATGAGGTATTTCTTCTATTTAAGAAACCCTTTCCTCCTATGAGAAATAAGAGAGAAAATAATATTTTACAACAATCAGTTATAGGGAGCCACAATTTTCTCTAAGATCGACCTCCGTTCGGGTTACCATTAGTTAAGATTAAAAGATAGTGATATCCACAAGATAACGTTTAGGTCTAAATATGGACATTATGAATCACTATAAACAATGATCCTGTAGTGTTTATGGGCCTGATGAACAAAGTGTTCAAAGAGTTTCTTGACACTTCTGTGTTAGATTTTATTGATGATATTTTGATTTATTCAAGACAGAGGCAAAACATGAGTGGCGTACCCGACTTCTAAGATCTCTAACTGGGGTCATAATAAGAAGACAAAACAAAGTAAAAATCTTAGTACAAATAATAAACCAAACCAAACAAAATACCATTGCACATTTTCCAGAGGAATCCACAGCCAAAGGAAAGGAGGTATGCTAGAAATCGGGGTTGACCCACTCTAACAAGATGAACTCAACAACGAAGAAGCACACCAGCCCACCCCCGAATCGAGTGAAGGGAAACGAGAAAAGGGGAAGGCAGAAGTAATAGTGTAAAGCCCGTTAGGGTAAGGGAGAGAGAGAATGTGAGTGGAAAATGTGGGGTTGCTCTGGTGAGCCTGAGATGAAGAAGATTATTATCTATTGAATGGTTTTTGCTGTTAGGAAAAGTAGCTGTGAAGAAGAGTGGTTGTGGAAGAGAAGGGGGGAACGTGGGGATAGCTTAAAAATCAACCACAAAAGAGCTCTATTGTTGATGTTTAAGCTGATTTTCATCCATTCATAATGCTTATAATGCTCAAGACATAAAAGTGATCAATATCAAAAGAGAATTTTATTCAAATCAATTAAAACTTGCCAAAACAGGAAGACATTCTCTCTATTTATAGAGAATTGAAAAGCAAACTAATCCTAATCCTAATTAATTAAAGAAACTAATACTAATCTTAATAAATCAAAGAAACTAATTGTAATACTAATTAATTAAAGAAATTAACCTAATCCTAATAAATCAAAGAAACTAATCCTGATCCTAATAAATTAAAGATTTGCCCATAATACCCTAATCTCCTACATCAAAAAGGGAAAAAAAAACTCAACTCAACAGTACAAATCCTTCTTGAGGGTTTCTATAATTACCGCCTGTTTTGGTTCATTGATCTAGTAACAATATTTTGGTATATCTATATGAAGAATGAAACATTCAATAAATATGGTCGGGTTTCAGCTGGTTTTGCTTTTATGTGCAGTTTGTAACTGA

mRNA sequence

CGTCCCGCCTTTTTAAACAGCAACACTTCACTTCACTTCACTTCCCATTCTTCCCTCTTCCTTCTCGCCGTTTCTTATCCAAACTTCCGCAGCCATTAATGTCCAAATTCCTGCTCTCTCACGCTCACCTTCTCACCCTTCCCCACAAACACCATTCCTTTTCTCTCAACCATGGCGTCGTTCCCATCCGCTCAGTCCTATCTGCTCCGGAGAAGCGAGGTAGAAAGAAGCGGCAGTCGCGGCAGCAACAATTACACCCAAAGGACGACGATTCCACTGCACTTGAGAAGGCCCTCCGCTTCACTTTCATGGAGGAACTCATGGACCGCGCTAGAAACCACGATCCCCTTGGCGTTTCTGATGTCATTTACGATATGGTTGCCGCTGGATTGAGCCCTGGACCTCGCTCGTTCCATGGATTAGTTGTTTCTCATACTCTCAATGGTGATACTGAGGGAGCGATGCAATCTCTGAGAAGGGAATTAAGTGCTGGACTTCGTCCTCTTCACGAAACGTTTGTTGCATTAGTTCGGTTATTTGGTTCCAAGGGTCTTGCTACTAGAGGCTTAGAAATCCTTGCAGCCATGGAGAAATTGAATTATGACATCCGTCAAGCATGGCTCATTCTTACTGAGGAACTCCTAAGGAACAAATATTTAGAAGACGCCAATGAAGTGTTCTTAAAGGGTGCCAAAGGGGGTCTCAGAGCCACCGACAAGATTTATGATCTTCTGATTGAGGAAGATTGTAAAGCCGGGGATCATTCAAATGCCTTAGAGATCTCATATGAAATGGAGGCTGCCGGGCGGATGGCAACGACCTTTCATTTCAATTGCCTTCTTAGTGTCCAGGCTACTTGTGGAATACCTGAAATTGCTTTCTCAACATTTGAGAACATGGAATATGGAGAAGTTTGTAACTGA

Coding sequence (CDS)

ATGTCCAAATTCCTGCTCTCTCACGCTCACCTTCTCACCCTTCCCCACAAACACCATTCCTTTTCTCTCAACCATGGCGTCGTTCCCATCCGCTCAGTCCTATCTGCTCCGGAGAAGCGAGGTAGAAAGAAGCGGCAGTCGCGGCAGCAACAATTACACCCAAAGGACGACGATTCCACTGCACTTGAGAAGGCCCTCCGCTTCACTTTCATGGAGGAACTCATGGACCGCGCTAGAAACCACGATCCCCTTGGCGTTTCTGATGTCATTTACGATATGGTTGCCGCTGGATTGAGCCCTGGACCTCGCTCGTTCCATGGATTAGTTGTTTCTCATACTCTCAATGGTGATACTGAGGGAGCGATGCAATCTCTGAGAAGGGAATTAAGTGCTGGACTTCGTCCTCTTCACGAAACGTTTGTTGCATTAGTTCGGTTATTTGGTTCCAAGGGTCTTGCTACTAGAGGCTTAGAAATCCTTGCAGCCATGGAGAAATTGAATTATGACATCCGTCAAGCATGGCTCATTCTTACTGAGGAACTCCTAAGGAACAAATATTTAGAAGACGCCAATGAAGTGTTCTTAAAGGGTGCCAAAGGGGGTCTCAGAGCCACCGACAAGATTTATGATCTTCTGATTGAGGAAGATTGTAAAGCCGGGGATCATTCAAATGCCTTAGAGATCTCATATGAAATGGAGGCTGCCGGGCGGATGGCAACGACCTTTCATTTCAATTGCCTTCTTAGTGTCCAGGCTACTTGTGGAATACCTGAAATTGCTTTCTCAACATTTGAGAACATGGAATATGGAGAAGTTTGTAACTGA

Protein sequence

MSKFLLSHAHLLTLPHKHHSFSLNHGVVPIRSVLSAPEKRGRKKRQSRQQQLHPKDDDSTALEKALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILTEELLRNKYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEVCN
BLAST of Lsi06G007550 vs. TrEMBL
Match: I1NIL0_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_20G221100 PE=4 SV=1)

HSP 1 Score: 399.1 bits (1024), Expect = 4.5e-108
Identity = 205/274 (74.82%), Postives = 234/274 (85.40%), Query Frame = 1

Query: 1   MSKFLLSHAHLLTLPHKHHSFSLNH---GVVPIRSVLSAPEKRGRKKRQSRQQQLHPKDD 60
           MS  +L + +  T  +    F LN      V +R+ +S+P+KRGRKK+Q+       KDD
Sbjct: 1   MSSLILPYTY--TYGYARFPFKLNRFSPRTVTVRAAVSSPDKRGRKKKQA-------KDD 60

Query: 61  DSTALEKALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGD 120
           DS A+E  LRF+FMEELMDRARN D  GVS+V+YDM+AAGLSPGPRSFHGLVVSH LNGD
Sbjct: 61  DS-AVENGLRFSFMEELMDRARNRDSNGVSEVMYDMIAAGLSPGPRSFHGLVVSHALNGD 120

Query: 121 TEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLIL 180
            E AM+SLRREL+AGLRP+HETF+AL+RLFGSKG ATRGLEILAAMEKLNYDIRQAWLIL
Sbjct: 121 EEAAMESLRRELAAGLRPVHETFLALIRLFGSKGRATRGLEILAAMEKLNYDIRQAWLIL 180

Query: 181 TEELLRNKYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGR 240
            EEL+ NK+LEDANEVFLKGAKGGL+ATD++YDLLIEEDCKAGDHSNAL+I+YEMEAAGR
Sbjct: 181 IEELVWNKHLEDANEVFLKGAKGGLKATDEVYDLLIEEDCKAGDHSNALDIAYEMEAAGR 240

Query: 241 MATTFHFNCLLSVQATCGIPEIAFSTFENMEYGE 272
           MATTFHFNCLLSVQATCGIPEIAF+TFENMEYGE
Sbjct: 241 MATTFHFNCLLSVQATCGIPEIAFATFENMEYGE 264

BLAST of Lsi06G007550 vs. TrEMBL
Match: I1LBT1_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_10G168600 PE=4 SV=2)

HSP 1 Score: 399.1 bits (1024), Expect = 4.5e-108
Identity = 202/274 (73.72%), Postives = 232/274 (84.67%), Query Frame = 1

Query: 1   MSKFLLSHAHLLTLPHKHHSFSLNH---GVVPIRSVLSAPEKRGRKKRQSRQQQLHPKDD 60
           MS  +L + +  T  +    F LN      V +R+ +SAP+KRGRKK+QS+        D
Sbjct: 1   MSSLILPYTY--TYGYARFPFKLNRFSPRAVTVRAAVSAPDKRGRKKKQSK--------D 60

Query: 61  DSTALEKALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGD 120
           D +A+E  LRF+FMEELMDRARN D  GVS+V+YDM+AAGLSPGPRSFHGLVVSH LNGD
Sbjct: 61  DESAVENGLRFSFMEELMDRARNRDSNGVSEVMYDMIAAGLSPGPRSFHGLVVSHALNGD 120

Query: 121 TEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLIL 180
            E AM+SLRREL+AGLRP+HETF+AL+RLFGSKG ATRGLEILAAMEKLNYDIRQAWLIL
Sbjct: 121 EEAAMESLRRELAAGLRPVHETFLALIRLFGSKGRATRGLEILAAMEKLNYDIRQAWLIL 180

Query: 181 TEELLRNKYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGR 240
            EEL+RN +LEDANEVFLKGAKGGL+ATD++YDLLI+EDCK GDHSNAL+I+YEMEAAGR
Sbjct: 181 IEELVRNMHLEDANEVFLKGAKGGLKATDEVYDLLIQEDCKVGDHSNALDIAYEMEAAGR 240

Query: 241 MATTFHFNCLLSVQATCGIPEIAFSTFENMEYGE 272
           MATTFHFNCLLSVQATCGIPEIAF+TFENMEYGE
Sbjct: 241 MATTFHFNCLLSVQATCGIPEIAFATFENMEYGE 264

BLAST of Lsi06G007550 vs. TrEMBL
Match: A0A0B2PBH9_GLYSO (Pentatricopeptide repeat-containing protein, chloroplastic OS=Glycine soja GN=glysoja_013869 PE=4 SV=1)

HSP 1 Score: 399.1 bits (1024), Expect = 4.5e-108
Identity = 205/274 (74.82%), Postives = 234/274 (85.40%), Query Frame = 1

Query: 1   MSKFLLSHAHLLTLPHKHHSFSLNH---GVVPIRSVLSAPEKRGRKKRQSRQQQLHPKDD 60
           MS  +L + +  T  +    F LN      V +R+ +S+P+KRGRKK+Q+       KDD
Sbjct: 1   MSSLILPYTY--TYGYARFPFKLNRFSPRTVTVRAAVSSPDKRGRKKKQA-------KDD 60

Query: 61  DSTALEKALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGD 120
           DS A+E  LRF+FMEELMDRARN D  GVS+V+YDM+AAGLSPGPRSFHGLVVSH LNGD
Sbjct: 61  DS-AVENGLRFSFMEELMDRARNRDSNGVSEVMYDMIAAGLSPGPRSFHGLVVSHALNGD 120

Query: 121 TEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLIL 180
            E AM+SLRREL+AGLRP+HETF+AL+RLFGSKG ATRGLEILAAMEKLNYDIRQAWLIL
Sbjct: 121 EEAAMESLRRELAAGLRPVHETFLALIRLFGSKGRATRGLEILAAMEKLNYDIRQAWLIL 180

Query: 181 TEELLRNKYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGR 240
            EEL+ NK+LEDANEVFLKGAKGGL+ATD++YDLLIEEDCKAGDHSNAL+I+YEMEAAGR
Sbjct: 181 IEELVWNKHLEDANEVFLKGAKGGLKATDEVYDLLIEEDCKAGDHSNALDIAYEMEAAGR 240

Query: 241 MATTFHFNCLLSVQATCGIPEIAFSTFENMEYGE 272
           MATTFHFNCLLSVQATCGIPEIAF+TFENMEYGE
Sbjct: 241 MATTFHFNCLLSVQATCGIPEIAFATFENMEYGE 264

BLAST of Lsi06G007550 vs. TrEMBL
Match: A0A151U5M8_CAJCA (Uncharacterized protein OS=Cajanus cajan GN=KK1_007275 PE=4 SV=1)

HSP 1 Score: 396.4 bits (1017), Expect = 2.9e-107
Identity = 197/268 (73.51%), Postives = 228/268 (85.07%), Query Frame = 1

Query: 4   FLLSHAHLLTLPHKHHSFSLNHGVVPIRSVLSAPEKRGRKKRQSRQQQLHPKDDDSTALE 63
           +  +H +    P K + FS     V +R+ +S+P+KR RKK+        P  DD TALE
Sbjct: 8   YTCTHGYAPNFPFKFNRFSPR--TVTVRAAVSSPDKRSRKKK--------PAKDDETALE 67

Query: 64  KALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQ 123
             LRF+FMEELMDRAR+ D  GVS+V+YDM+AAGL+PGPRSFHGLVVSH LNGD E AM+
Sbjct: 68  NGLRFSFMEELMDRARSRDSNGVSEVMYDMIAAGLNPGPRSFHGLVVSHALNGDEEAAME 127

Query: 124 SLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILTEELLR 183
           SLRREL+AGLRP+HETF+ALVRLFGSKG ATRGLEILAAMEKLNYDIRQAW++L EEL++
Sbjct: 128 SLRRELAAGLRPVHETFLALVRLFGSKGRATRGLEILAAMEKLNYDIRQAWIVLIEELVQ 187

Query: 184 NKYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFH 243
           NK+LEDAN+VFLKGAKGGLRATD++YDLLIEEDCK GDHSNAL+I+YEMEAAGRMATTFH
Sbjct: 188 NKHLEDANQVFLKGAKGGLRATDEVYDLLIEEDCKVGDHSNALDIAYEMEAAGRMATTFH 247

Query: 244 FNCLLSVQATCGIPEIAFSTFENMEYGE 272
           FNCLLSVQATCGIPEIAF+TFENMEYGE
Sbjct: 248 FNCLLSVQATCGIPEIAFATFENMEYGE 265

BLAST of Lsi06G007550 vs. TrEMBL
Match: M5WJZ9_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001139mg PE=4 SV=1)

HSP 1 Score: 388.7 bits (997), Expect = 6.1e-105
Identity = 196/258 (75.97%), Postives = 220/258 (85.27%), Query Frame = 1

Query: 13  TLPHKHHSFSLNHGVVPIRSVLSAPEKRGRKKRQSRQQQLHPKDDDSTALEKALRFTFME 72
           T P K    +    VV   S +SAPEKR R+KR+  +         S+A EK+LRFTFME
Sbjct: 14  TFPCKFKCPNDTVSVVVRSSAVSAPEKRTRRKRRQTKGDNDSSSPSSSAAEKSLRFTFME 73

Query: 73  ELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAG 132
           ELM RARN D  GVSDVIYDMVAAGL+PGPRSFHGL+V+H LNGDTE AMQSLRRELS+G
Sbjct: 74  ELMGRARNRDANGVSDVIYDMVAAGLTPGPRSFHGLIVAHALNGDTEAAMQSLRRELSSG 133

Query: 133 LRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILTEELLRNKYLEDANE 192
           LRPLHETF+AL+RLFGSKG ATRGLEILAAMEKL+YDIR+AWL+L EEL+R ++LEDAN+
Sbjct: 134 LRPLHETFIALIRLFGSKGRATRGLEILAAMEKLHYDIRRAWLLLVEELVRTRHLEDANK 193

Query: 193 VFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQA 252
           VFLKGAKGGLRATD++YDLLI EDCK GDHSNAL+I+YEMEAAGRMATTFHFNCLLSVQA
Sbjct: 194 VFLKGAKGGLRATDEVYDLLIVEDCKVGDHSNALDIAYEMEAAGRMATTFHFNCLLSVQA 253

Query: 253 TCGIPEIAFSTFENMEYG 271
           TCGIPEIAFSTFENMEYG
Sbjct: 254 TCGIPEIAFSTFENMEYG 271

BLAST of Lsi06G007550 vs. TAIR10
Match: AT3G04260.1 (AT3G04260.1 plastid transcriptionally active 3)

HSP 1 Score: 370.9 bits (951), Expect = 6.6e-103
Identity = 185/255 (72.55%), Postives = 216/255 (84.71%), Query Frame = 1

Query: 26  GVVPIRSVLSAPEKRGRKKRQSRQQQLHPKDDDST--------ALEKALRFTFMEELMDR 85
           G+  IR  +SAPEK+ R++R+ ++      DD  +        ALE++LR TFM+ELM+R
Sbjct: 24  GISSIRCSISAPEKKPRRRRKQKRGDGAENDDSLSFGSGEAVSALERSLRLTFMDELMER 83

Query: 86  ARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLRPLH 145
           ARN D  GVS+VIYDM+AAGLSPGPRSFHGLVV+H LNGD +GAM SLR+EL AG RPL 
Sbjct: 84  ARNRDTSGVSEVIYDMIAAGLSPGPRSFHGLVVAHALNGDEQGAMHSLRKELGAGQRPLP 143

Query: 146 ETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILTEELLRNKYLEDANEVFLKG 205
           ET +ALVRL GSKG ATRGLEILAAMEKL YDIRQAWLIL EEL+R  +LEDAN+VFLKG
Sbjct: 144 ETMIALVRLSGSKGNATRGLEILAAMEKLKYDIRQAWLILVEELMRINHLEDANKVFLKG 203

Query: 206 AKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIP 265
           A+GG+RATD++YDL+IEEDCKAGDHSNAL+ISYEMEAAGRMATTFHFNCLLSVQATCGIP
Sbjct: 204 ARGGMRATDQLYDLMIEEDCKAGDHSNALDISYEMEAAGRMATTFHFNCLLSVQATCGIP 263

Query: 266 EIAFSTFENMEYGEV 273
           E+A++TFENMEYGEV
Sbjct: 264 EVAYATFENMEYGEV 278

BLAST of Lsi06G007550 vs. TAIR10
Match: AT5G18390.1 (AT5G18390.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 52.0 bits (123), Expect = 6.8e-07
Identity = 41/148 (27.70%), Postives = 64/148 (43.24%), Query Frame = 1

Query: 120 GAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDI-RQAWLILT 179
           GA   +RR +  GL+P   T+  LV  + S G      E L  M +  ++   +   +L 
Sbjct: 200 GAYALIRRMIRKGLKPDKRTYAILVNGWCSAGKMKEAQEFLDEMSRRGFNPPARGRDLLI 259

Query: 180 EELLRNKYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRM 239
           E LL   YLE A E+  K  KGG     + +++LIE   K+G+    +E+ Y     G  
Sbjct: 260 EGLLNAGYLESAKEMVSKMTKGGFVPDIQTFNILIEAISKSGEVEFCIEMYYTACKLGLC 319

Query: 240 ATTFHFNCLLSVQATCGIPEIAFSTFEN 267
                +  L+   +  G  + AF    N
Sbjct: 320 VDIDTYKTLIPAVSKIGKIDEAFRLLNN 347

BLAST of Lsi06G007550 vs. TAIR10
Match: AT5G59900.1 (AT5G59900.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 51.6 bits (122), Expect = 8.9e-07
Identity = 40/158 (25.32%), Postives = 69/158 (43.67%), Query Frame = 1

Query: 92  DMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKG 151
           +MV  GL      ++ L+  H   GD   A   +   ++  L P   T+ +L+  + SKG
Sbjct: 427 EMVDTGLKLSVYPYNSLINGHCKFGDISAAEGFMAEMINKKLEPTVVTYTSLMGGYCSKG 486

Query: 152 LATRGLEILAAME-KLNYDIRQAWLILTEELLRNKYLEDANEVFLKGAKGGLRATDKIYD 211
              + L +   M  K        +  L   L R   + DA ++F + A+  ++     Y+
Sbjct: 487 KINKALRLYHEMTGKGIAPSIYTFTTLLSGLFRAGLIRDAVKLFNEMAEWNVKPNRVTYN 546

Query: 212 LLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLL 249
           ++IE  C+ GD S A E   EM   G +  T+ +  L+
Sbjct: 547 VMIEGYCEEGDMSKAFEFLKEMTEKGIVPDTYSYRPLI 584

BLAST of Lsi06G007550 vs. TAIR10
Match: AT4G34830.1 (AT4G34830.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 48.5 bits (114), Expect = 7.5e-06
Identity = 40/182 (21.98%), Postives = 72/182 (39.56%), Query Frame = 1

Query: 88  DVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLF 147
           +V + M  +G+     +F  L+      G    A  +     S  ++P    F AL+   
Sbjct: 523 EVFHQMSNSGVEANLHTFGALIDGCARAGQVAKAFGAYGILRSKNVKPDRVVFNALISAC 582

Query: 148 GSKGLATRGLEILAAMEKLNYDIRQAWL---ILTEELLRNKYLEDANEVFLKGAKGGLRA 207
           G  G   R  ++LA M+   + I    +    L +       +E A EV+    K G+R 
Sbjct: 583 GQSGAVDRAFDVLAEMKAETHPIDPDHISIGALMKACCNAGQVERAKEVYQMIHKYGIRG 642

Query: 208 TDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTF 267
           T ++Y + +    K+GD   A  I  +M+          F+ L+ V     + + AF   
Sbjct: 643 TPEVYTIAVNSCSKSGDWDFACSIYKDMKEKDVTPDEVFFSALIDVAGHAKMLDEAFGIL 702

BLAST of Lsi06G007550 vs. TAIR10
Match: AT2G18940.1 (AT2G18940.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 48.5 bits (114), Expect = 7.5e-06
Identity = 35/122 (28.69%), Postives = 58/122 (47.54%), Query Frame = 1

Query: 130 SAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQA-WLILTEELLRNKYLE 189
           S G  P   T+ AL+++FG  G+ T  L +L  ME+ +       +  L    +R  + +
Sbjct: 309 SCGYEPGTVTYNALLQVFGKAGVYTEALSVLKEMEENSCPADSVTYNELVAAYVRAGFSK 368

Query: 190 DANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLL 249
           +A  V     K G+      Y  +I+   KAG    AL++ Y M+ AG +  T  +N +L
Sbjct: 369 EAAGVIEMMTKKGVMPNAITYTTVIDAYGKAGKEDEALKLFYSMKEAGCVPNTCTYNAVL 428

Query: 250 SV 251
           S+
Sbjct: 429 SL 430

BLAST of Lsi06G007550 vs. NCBI nr
Match: gi|778664211|ref|XP_011660243.1| (PREDICTED: uncharacterized protein LOC101209618 [Cucumis sativus])

HSP 1 Score: 501.5 bits (1290), Expect = 9.2e-139
Identity = 253/272 (93.01%), Postives = 263/272 (96.69%), Query Frame = 1

Query: 1   MSKFLLSHAHLLTLPHKHHSFSLNHGVVPIRSVLSAPEKRGRKKRQSR-QQQLHPKDDDS 60
           MSKFLLSHAHLLTLP  H SFSLNHG++PIRSVLSAP+KRGRKKRQSR QQQL PKD+DS
Sbjct: 1   MSKFLLSHAHLLTLPSNHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQPKDNDS 60

Query: 61  TALEKALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTE 120
           T+LE +LRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTE
Sbjct: 61  TSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTE 120

Query: 121 GAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILTE 180
           GAMQSLRRELSAGL PLHETFVALVRLFGSKGLA RGLEILAAMEKLNYDIRQAWLILTE
Sbjct: 121 GAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEKLNYDIRQAWLILTE 180

Query: 181 ELLRNKYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMA 240
           EL+R+KYLEDAN+VFLKGAK GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMA
Sbjct: 181 ELVRSKYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMA 240

Query: 241 TTFHFNCLLSVQATCGIPEIAFSTFENMEYGE 272
           TTFHFNCLLSVQATCGIPEIAFSTFENMEYGE
Sbjct: 241 TTFHFNCLLSVQATCGIPEIAFSTFENMEYGE 272

BLAST of Lsi06G007550 vs. NCBI nr
Match: gi|659086066|ref|XP_008443747.1| (PREDICTED: uncharacterized protein LOC103487261 isoform X2 [Cucumis melo])

HSP 1 Score: 500.7 bits (1288), Expect = 1.6e-138
Identity = 252/272 (92.65%), Postives = 263/272 (96.69%), Query Frame = 1

Query: 1   MSKFLLSHAHLLTLPHKHHSFSLNHGVVPIRSVLSAPEKRGRKKRQSR-QQQLHPKDDDS 60
           MSK LLSHAHLLTLP+ H SFSLNHG++PIRSVLSAP+KRGRKKRQSR QQQL  KDDDS
Sbjct: 1   MSKLLLSHAHLLTLPYNHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQLKDDDS 60

Query: 61  TALEKALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTE 120
           T+LE +LRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTE
Sbjct: 61  TSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTE 120

Query: 121 GAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILTE 180
           GAMQSLRRELS+GLRPLHETFVALVRLFGSKGLA RGLEILAAME+LNYDIRQAWLILTE
Sbjct: 121 GAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMERLNYDIRQAWLILTE 180

Query: 181 ELLRNKYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMA 240
           EL+RNKYLEDAN+VFLKGAK GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMA
Sbjct: 181 ELVRNKYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMA 240

Query: 241 TTFHFNCLLSVQATCGIPEIAFSTFENMEYGE 272
           TTFHFNCLLSVQATCGIPEIAFSTFENMEYGE
Sbjct: 241 TTFHFNCLLSVQATCGIPEIAFSTFENMEYGE 272

BLAST of Lsi06G007550 vs. NCBI nr
Match: gi|659086064|ref|XP_008443746.1| (PREDICTED: uncharacterized protein LOC103487261 isoform X1 [Cucumis melo])

HSP 1 Score: 500.7 bits (1288), Expect = 1.6e-138
Identity = 252/272 (92.65%), Postives = 263/272 (96.69%), Query Frame = 1

Query: 1   MSKFLLSHAHLLTLPHKHHSFSLNHGVVPIRSVLSAPEKRGRKKRQSR-QQQLHPKDDDS 60
           MSK LLSHAHLLTLP+ H SFSLNHG++PIRSVLSAP+KRGRKKRQSR QQQL  KDDDS
Sbjct: 1   MSKLLLSHAHLLTLPYNHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQLKDDDS 60

Query: 61  TALEKALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTE 120
           T+LE +LRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTE
Sbjct: 61  TSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTE 120

Query: 121 GAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILTE 180
           GAMQSLRRELS+GLRPLHETFVALVRLFGSKGLA RGLEILAAME+LNYDIRQAWLILTE
Sbjct: 121 GAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMERLNYDIRQAWLILTE 180

Query: 181 ELLRNKYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMA 240
           EL+RNKYLEDAN+VFLKGAK GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMA
Sbjct: 181 ELVRNKYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMA 240

Query: 241 TTFHFNCLLSVQATCGIPEIAFSTFENMEYGE 272
           TTFHFNCLLSVQATCGIPEIAFSTFENMEYGE
Sbjct: 241 TTFHFNCLLSVQATCGIPEIAFSTFENMEYGE 272

BLAST of Lsi06G007550 vs. NCBI nr
Match: gi|356533668|ref|XP_003535382.1| (PREDICTED: uncharacterized protein LOC100802355 [Glycine max])

HSP 1 Score: 399.1 bits (1024), Expect = 6.4e-108
Identity = 202/274 (73.72%), Postives = 232/274 (84.67%), Query Frame = 1

Query: 1   MSKFLLSHAHLLTLPHKHHSFSLNH---GVVPIRSVLSAPEKRGRKKRQSRQQQLHPKDD 60
           MS  +L + +  T  +    F LN      V +R+ +SAP+KRGRKK+QS+        D
Sbjct: 1   MSSLILPYTY--TYGYARFPFKLNRFSPRAVTVRAAVSAPDKRGRKKKQSK--------D 60

Query: 61  DSTALEKALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGD 120
           D +A+E  LRF+FMEELMDRARN D  GVS+V+YDM+AAGLSPGPRSFHGLVVSH LNGD
Sbjct: 61  DESAVENGLRFSFMEELMDRARNRDSNGVSEVMYDMIAAGLSPGPRSFHGLVVSHALNGD 120

Query: 121 TEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLIL 180
            E AM+SLRREL+AGLRP+HETF+AL+RLFGSKG ATRGLEILAAMEKLNYDIRQAWLIL
Sbjct: 121 EEAAMESLRRELAAGLRPVHETFLALIRLFGSKGRATRGLEILAAMEKLNYDIRQAWLIL 180

Query: 181 TEELLRNKYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGR 240
            EEL+RN +LEDANEVFLKGAKGGL+ATD++YDLLI+EDCK GDHSNAL+I+YEMEAAGR
Sbjct: 181 IEELVRNMHLEDANEVFLKGAKGGLKATDEVYDLLIQEDCKVGDHSNALDIAYEMEAAGR 240

Query: 241 MATTFHFNCLLSVQATCGIPEIAFSTFENMEYGE 272
           MATTFHFNCLLSVQATCGIPEIAF+TFENMEYGE
Sbjct: 241 MATTFHFNCLLSVQATCGIPEIAFATFENMEYGE 264

BLAST of Lsi06G007550 vs. NCBI nr
Match: gi|734324187|gb|KHN04962.1| (Pentatricopeptide repeat-containing protein, chloroplastic [Glycine soja])

HSP 1 Score: 399.1 bits (1024), Expect = 6.4e-108
Identity = 205/274 (74.82%), Postives = 234/274 (85.40%), Query Frame = 1

Query: 1   MSKFLLSHAHLLTLPHKHHSFSLNH---GVVPIRSVLSAPEKRGRKKRQSRQQQLHPKDD 60
           MS  +L + +  T  +    F LN      V +R+ +S+P+KRGRKK+Q+       KDD
Sbjct: 1   MSSLILPYTY--TYGYARFPFKLNRFSPRTVTVRAAVSSPDKRGRKKKQA-------KDD 60

Query: 61  DSTALEKALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGD 120
           DS A+E  LRF+FMEELMDRARN D  GVS+V+YDM+AAGLSPGPRSFHGLVVSH LNGD
Sbjct: 61  DS-AVENGLRFSFMEELMDRARNRDSNGVSEVMYDMIAAGLSPGPRSFHGLVVSHALNGD 120

Query: 121 TEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLIL 180
            E AM+SLRREL+AGLRP+HETF+AL+RLFGSKG ATRGLEILAAMEKLNYDIRQAWLIL
Sbjct: 121 EEAAMESLRRELAAGLRPVHETFLALIRLFGSKGRATRGLEILAAMEKLNYDIRQAWLIL 180

Query: 181 TEELLRNKYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGR 240
            EEL+ NK+LEDANEVFLKGAKGGL+ATD++YDLLIEEDCKAGDHSNAL+I+YEMEAAGR
Sbjct: 181 IEELVWNKHLEDANEVFLKGAKGGLKATDEVYDLLIEEDCKAGDHSNALDIAYEMEAAGR 240

Query: 241 MATTFHFNCLLSVQATCGIPEIAFSTFENMEYGE 272
           MATTFHFNCLLSVQATCGIPEIAF+TFENMEYGE
Sbjct: 241 MATTFHFNCLLSVQATCGIPEIAFATFENMEYGE 264

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
I1NIL0_SOYBN4.5e-10874.82Uncharacterized protein OS=Glycine max GN=GLYMA_20G221100 PE=4 SV=1[more]
I1LBT1_SOYBN4.5e-10873.72Uncharacterized protein OS=Glycine max GN=GLYMA_10G168600 PE=4 SV=2[more]
A0A0B2PBH9_GLYSO4.5e-10874.82Pentatricopeptide repeat-containing protein, chloroplastic OS=Glycine soja GN=gl... [more]
A0A151U5M8_CAJCA2.9e-10773.51Uncharacterized protein OS=Cajanus cajan GN=KK1_007275 PE=4 SV=1[more]
M5WJZ9_PRUPE6.1e-10575.97Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001139mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G04260.16.6e-10372.55 plastid transcriptionally active 3[more]
AT5G18390.16.8e-0727.70 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G59900.18.9e-0725.32 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G34830.17.5e-0621.98 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT2G18940.17.5e-0628.69 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778664211|ref|XP_011660243.1|9.2e-13993.01PREDICTED: uncharacterized protein LOC101209618 [Cucumis sativus][more]
gi|659086066|ref|XP_008443747.1|1.6e-13892.65PREDICTED: uncharacterized protein LOC103487261 isoform X2 [Cucumis melo][more]
gi|659086064|ref|XP_008443746.1|1.6e-13892.65PREDICTED: uncharacterized protein LOC103487261 isoform X1 [Cucumis melo][more]
gi|356533668|ref|XP_003535382.1|6.4e-10873.72PREDICTED: uncharacterized protein LOC100802355 [Glycine max][more]
gi|734324187|gb|KHN04962.1|6.4e-10874.82Pentatricopeptide repeat-containing protein, chloroplastic [Glycine soja][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0055114 oxidation-reduction process
biological_process GO:0006979 response to oxidative stress
cellular_component GO:0005575 cellular_component
cellular_component GO:0009507 chloroplast
cellular_component GO:0016020 membrane
cellular_component GO:0009508 plastid chromosome
molecular_function GO:0003674 molecular_function
molecular_function GO:0020037 heme binding
molecular_function GO:0004601 peroxidase activity
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi06G007550.1Lsi06G007550.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR31407FAMILY NOT NAMEDcoord: 66..271
score: 5.5
NoneNo IPR availablePANTHERPTHR31407:SF5PLASTID TRANSCRIPTIONALLY ACTIVE 3coord: 66..271
score: 5.5