Lsi02G002760 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi02G002760
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionDNA-directed RNA polymerases I, II, and III subunit RPABC4
Locationchr02 : 2340063 .. 2354580 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGAAAATAATAATAATAAAAGAAAACCCTACTACACGTTCCCTCTCCCCTTTTATTTTTGTTTGACGTTTGATTTTTCCCTCACCCATAATTGTAGCAGCCACAACCAAAGCCCTACGCGACTTTGATTTCTCTCTGCTCTCGTATCTCCCTCGCAGCGATAGGTACGCACCAACTATTTCTTTTCTCTTATCTCCCTCCGCCCTCGCGGTCGTCGGTCGTCGGCACACCCTCACAAACACTTTTCGTTTCTCATTATTCTATTGATGATTGCTTCTTACTTGTGTGAGCAATTATTATTTTGCAGGAAAATTTTTGAAGTTTGACTCATGGATCCTCAACCTGAACCAGTCAGCTACATCTGTGGAGGTACACTTGTTGTATTTACTGATTCCATGATTCAAAATCTTCCGATTGCACAGGTTCTTACGTTCTGTTATCGGATGCATCTTTTAATAAGTATCAAGTTACTTTGATTGTATTAAGACTGTGGGAATTTAGAAGTTTCTTTATTTGAGTCTAGATTTAGAATACTTGCACTAAATATTCACCTACATGACTACTTTTGGGGCTTGAGAACTGAAAATTTTGTGTAGATTGTGGAATGGAGAACACTCTGAAGCAGGGTGATGTTATACAGTGCCGAGAGTGTGGTTATCGTATTCTCTACAAGAAGCGCACCCGTCGCAGTAAGATATATTTCTTCTGGTTTCTTTATATATATTTTTTTTCTTTTTCTTTTTTGATCATGAATATGTCGTCTCTAAATATGATCTTTCCGTTAAAGATGTTTGGAATTTAGAATCTCATCTAATAAGTTGTTTCAGACTTTCATATGATTGACTATAAATTTTATATGAAACAAGAAACAGAGGTGGGAGTATTTATTTAATCGTAGAGTACAAAAGAGTCAGTGGGGTGAGCAAGCTACCTAGTTCAAAACGAATGAGCTTTTTGGTGTGAGTAGATATTGAAGATGGGTAACACATACCGGTCAAACTCTGTGACGGGTGGCTTTGGCCTGCTTTGTACCTAGGCATGTTTGAGAATAATTTTAAAATTGTTAAAATCACTTTGTCATCTTTAAAATTACTCCAAAACATGCTTTTGCTCATTCAAAATCAATTTGGCTATATGAAAGTTGTGTTTAGAAGTTTAAAATAAAAAATTAAATTGATTTTGAACATGACAAAAATAATTTTAACTCTTTCAAAATAACTCTCAAACATCCTCTTAGTTGCTCTCATTCTAAAGTCAATTTGATTTTCATTGCATTTGAGGTCTCTCTTGTTTGAGAAATCTGTACCCACTTCACTACTTCTCATGAGCTCAACGATCAGCTATGTTTTCACTTCTTTTTATCTGTTTTGAATTCCTTTTATCACACAATCAACATCCATTCTCAATGCTGTTCTCTTCCTGATCAAACCTTATTCACCTTCTTTGTAGTTCAGTTGAAGATTCAAGACATGGATCATTTAAAATAATCATTTTATTAACTTAGTTTGTTTGAAGTGTATGGATTGCACATATTTGTAACTTGATTAGCAGGATGGCTGTAGTTTGGGTAATTTACTGTCACGACGTATTGTATTTGAGCAGAATCTATTCTACTCAACCTATGGTGATAATTGACTGGAAAAAGTGTTTCTCTTGGATCCTTGTGATCTTCTATGACTTGGGATGCTCTCTAATTACAAGTCTTGAGATTAAATTTAGATTCTCTCTTTTTATGACGGGTGAAACGACCATTGCAGTTATTGGAGTCCATTCTTGTTGTTTAGCTCTCTCAATGTTTTTGTTATGGTTCGATTCACATTACTTTCTGTGTCATTCTTTCCAAATTTAAAGTCTAATCCGATTCTTGCCTATTGCTGCAGTTGTTCAGTACGAGGCCCGCTGAAATGCTTATTCTGGCGATGGACTTCTAGTGTCATGAATATTTTGTAAGTCTCTTTTAGGAAAACAAGAACGGAACTCGATATGGCTTAAGGACTTGCTGTAGTACAAACATGTTGATAATGCTGCCACTATAATATGATTAAATATTTTCCATTTCAATTGCCGTTATCCTCGTGAGATAACTTCATATCTAACTGATAATATGAATGGTTAATCTCTTGATAGACTGGATGGTTCACATTTGCATTTCGATATCAGTATTTGATTCCTATTCAAAATTTTCTTGTATTTTTCCTGCGAGGGTGGTAAATTTGTTTTGTTATTATTATTTTTAAGCATCATTGTTAGTGATCTTGATGGTTCTTTTAAGATGGGGTCCTAAAAATTAAAGGCTATGTTTTGATAATTTTTTTTTTTTTTGAAAATTAATGCTTACAATCATTATTTTACCTCTGGTTGTCTTATTTGATAGTACCTACTTTTTAAACTTGTTCTCAAGAGCAATGCTAAAATTTGAAAATTATAAAATTAGTTCTTGAAATTTTGATTACGTTTTGTAATTTTGTTTGTGTTTTTCAAAGTGAAAAACATACCCAAAAAAAAAAAAAAAAGTGTGGAAACAAGCCTAATTTTCAAAAATAAAAAACAAAATGTTTATCAAATGGGTCTAAGTGTTTTTTTTTTTTTTTTTTTGGTTTCATCTAGCGCTTGACTATATATGAGGTAATTCTTATTCAATTGGTTAAAGTTTGTTTGGATTATTAGTAAAAAAAAGTGTTTTTCAAATATTTTTTAAACACTTTTCATATAGAAAAAAACTATTTATAATTAAAACACTCCTATGTTTGGTTACATTTTCGTGAAAATATTATTATATATAAGTTTGTTTGGAATTAATGGCTTCAAATTTATATTTAAAACGATTGACCACTTTTGGTGGTCACGGTCGATAGTAGTTGGTGATGGTTAGCGACGGTAGTTTGAATGTTTTTCTTTTTAAGGCCAAAATTTACTCCGAGAAGCATAATGAAGTAAATTTCAGTATAGGTTAAAAAGTTAAATGTTAGATGATAGTTCCATGAGTTAGATGGAGTAGGCGAAAATGTCTCACTTCTTGTCTAACCTACAAACTTGAGAGATTTCTCATTTTAAGAAAACTAATTCAAAATTAACAAAGTAAATAACTCTATTTTACCAGTTTAATCTAAAAACTCATTGGGTCAAACGAAGAGGGTAGTTATCCACCTCACCTAATTCCTTCTCAAGTACCTCCTAAAAGAATACGGGTATATTCATGATCGTTTAGTTTGAAATGTATTTTTTTCTTTAAATTTTACGAACTTTTTAATATGGGTGCAACTTAAGATGGGATCAAATTTTTTTCAATGTCAAATAAGTTTAATTTTTTTCTTCAAATATCAAAGACTAGAAATCATTTTGAAACTTCAAACACTTAAATAAACAATAAACCGAAATTTAGAATAAATCTATTTTTCATAAAATGAACAATTAAACGAAAAAAATAGAGAATAATACTAGAGTTTTATACCTAATTGAGTGTCAATTATTTAGGGACGTTGCGATATTCTTAGCAAGAATGTCTATTATATTTCAAAGTCGATGTTTTAATTATCAAAATCATCCTTTCCACATCCTCAACAAAAATGGTAAATAATATACATAGTAAACTAACTCTTGAGCCGAGTTTATTTTGTTATTCATGGAAGCTATTATATATGGTAAATTTCTCCCCGTGACAAAAGAAAGTTAACACTTAAAAACGAAAACCAAATTTTAATATTAGATATCTAAAAGGACTTTTTTTAAATATAGAAAAATGATTTTTTTTATATTTGTAAATAATGTGATATTTTTTTTTTGTTTATAATAATTTTAGTTCTAGAATTTTATTAAATATATACAGATTTTTTAGGTAGGCATATATAGTTTTAAGTTAGAGGTGTTTTCTCAAATTTCGAAAACTGAATTATGAAAACTTATAGTCAGTATCGTCTTAGTTTCTGTAAGTTTTGTTCCATGTAGACATCTAATTTTAGACTTTAATACTCTCGATGAATCTTAAAATCAATTTGTTGTTAAAATTTTCGAAACAAAATCTTTATTAATATTCTATTTATAAATTATGAATATATATTCACACGATATCTTTTCTCACATAAAAATTACTTATTCAAAATATAATGACTAAATTTAAAATTTATTGAGAGTTCCAACACTAAAATTGAGCAATTAAAAGTAGGAAAAAAAAATTACTTTTGGTTCTAATTTCCAACTAGTTCTTACGTTTCAATTATTATACTTTTAGTCTTTATTGAGTTTGATTTCAATTTAGTCCTTAGGTTTCAAAATATTACAATTACATTTTTGAGATTTGAGTTTTGTTACAATTTGGTATTACATTTCAAGATTTCCCTCCATTTTCACTAAATACCTCACTTTTACTATTTAATGTTAATGTATATTAATTAATTTAAAATAATTATAATTAATTAAGTTTCATTATTTTTTCATCACCATCAAAATTAATTTTAAATTTTAATTCATAATTAATTTGAATTAATCAATGAACATCATTTACACCAAATATTAAAAGCGAATATTTAATAAAAAAAATTGAAGTTAAAAAGTAAAAATGTCAAAGCCTAAGAACCAAATTGAAACAAAACTCAAATTGATAAAATTGTAACACTTTGAAACTCGGAGACTAAATCAACATCATACTAAAAAAGGACTAAAAATATAACATTTTAAAATCTATGGACCAAATAAAAACTAAATCTAAAATTTAGGGATAGAAAAAATATTCTTTCCTTAAAGGTATATGTACTAAATTCGAATCGAACTCAAAGTATAGGACCAGTTTTGGGAAATTTGTACGAATAACTAAAAAATTGGGCCGAAAAGGTCAAATGACCCGAATTTTTTAAAAAATGTCAAAGCACGTTTTGCTGATGTACAAGATCAAATGAAATTAACGAAATGAACTCGCGCACCCGAAAAAGCTTATCTCTCTTGCTCTTGCTCTCTCTTTTTTTTTTCTCACCCACAATCCATCTCATCTCGCTCACTCTCTTGTCACTCTCACTCTCTTTTCTTTTTCTCTCTCACAATTTCTCTCATCTCGCACTTCTCACTCTCACTTATTAGAATTGGTTGCGTCGAAGCTGTTCATGTGACTCGTCGTGTCGGAGAAGAAGCAAAATAAGTTGTAGCAAAAGAAAAAGATGGAGAATAAGAAGATGCAGAACAAAAAGAAGAAGAAGAAGAAAAAGGAAAAAAAAAGGGAGTCGTAAAGAAAAAAGAATAAAAAAATTAAGAAAGATAATATTATAATTTCAGGTGATATGAAATATCATTTGACATTTTTTTTCTTTCAAAAATTAGATCATTTACCATTTTCAACTTTTAAAAGGAGTGTTTTATGCAATTATTTCATCAATTTTAACCCAAATTTATTTAGTGAAGTGCACGATAAAATTATTCTTCCCTCTCTAAACTTTCTGTTCCCTTTACTCAATCGAGTGTGACTGCCATAAAATCGAAAAACTCAATCGTCAAATACGAATCAAATTATACTAATTTATCAGTCGCGCTCCTTTTAAAGATAAACAAAAACCTCGAATTTTAAGCAAGCCACCAGCAGATTACGATAGAAAGTAGAAGATAAAGGCTATCGTCCATTGGAAGAACAAGCTTTCCTCCTTGTAATTCCTTCGATACTCTCGCTAATGGAAATGGGTTAGTTTTCTTCTACTGCTTTTAGCTAATCGCTCACTCGCTTACTTTCAATTCGCCAGTTTGTTCTGATTTGTGTGCAGTGAAATGGCTTGACAATCCAATTTTCGCGCTCCTTTTCTAGGAAAGTAACTGGAAATCATTGTATATTTTTGTAGAAGAGTCAATATATAATCTATCTTAATCTATCTACTTGTTTTCTCTATTTTGAGTTGGATATGGCAGGGTAGGAAACATTTCTAATTTTGGCAAATAGTTGATTATTATAGATTTTCTATGTTCCCCTGTAATCTGAATCTTGTGGCTATATTTGGGTGTAGATTTTGTCTAAATTGGTGTAGTTCTTAAATTGAGATTTTTCATTTTTTCTTCGATATTTCTGGGTCTAGTCAAATTTAGTACTAGGCTTCCTCAATGGTTCTAATTGCCTGATTAATTTCGAGGGAGCAGTACCATGCATATTTTCTTCTTTTTCTTCGTCTTCTTTTCATGATATTTCGTTGAGGTTTTGGCTGTTTTTTTTCTGTCGAACGAGTTTCTTTTGACTGGTAAGTTTTGTGTACTACAGCTTTTTCTTGTCTTGATTGGGGAAAGCCATTTCAGGAGAAGGTTGTTTTTTATATCTGGGTTTTCTCCCTTAATGCATGCACCTTCTTCCTATTCTTTGGGTTTCTGACTATTGGATTGTTTTGGTTATGCAATGTAATGTCGTTAAGTTGGGTTGCCTGCAATTTGATGTTTCAATAAACGACATTTCACTGGAGAAAATGGGTTTAATTTAATGGTCTCTTCATCAAAGCTCTTTTTGTGTTTCTTCCTTCAAGAAGTTTGAGATTTCTGATATTTCTACATTATCGCGGTTTGTTTTGCTAGAATCTGCCTGCATTTGTTTGATGGTCACATGAGAGTTGTTTTATTTGAGGTTCTAATTTCATGGTTTCATGATCACCCTCAAGAATTATTGAGGCAAGCCATGAGCAAATTTATGCGATGAGTGGAGCTCCCGACACTGGTTATAAAAAAGTGTATATTATTGTTGTAGTGGAAAGATTGAGACTCATTTCAATTGGATTAGATATCTTGAACTATTATTTGATAAAGATTATTATTGTTTCAAGTTGCTTACTTATACTCAACAAGGAAGTTGTCATTCTTTTTAGCCAATTGTTATTCTTTGTTTTTCTTTGGCACCGTCCCATATTTGATTTTCATTCTCATATTCTAGTTATATGTTGATATTGCATTTATTGATATTGTTCTGATCAAACGAATGCTATCATGTCAAAGCATTGTATTTAAGACAAAATATGAAAACCTTATCAGACATTTTTAATGAAAATATTCTTTTCAGGTTACTCATTCTTTTTCTTCATATCCTTTCGTGTGTATATTGTTCATATGTGAATGTCAAGATAAAAAATTAGTTTGTTATTTTAGACTCTCAGTATATATTCTATCATACCTTTTCTGGAGTTATTTTTGAACCTCACTTCATGTAAACAACCTTACTGGTTGCCCTTGGTCCTCTAGAATTCTTATCTTTTGAACCGTTTTGATGACAGTTACAGAAAACAATGCAGTGTGCTCTTGTAAGAAGTAGTAATTTTCAGAAAGTTCTAGACAAAGGAAAGGAGTCGTTAGAATTGAGACTCGAGGAAAACAGTTGTTCCAGAGGAATTAAGGTCCTAATTAAGAAAATTTTATTCTCTTTGGTTTTGGTTTTGTTTGCATAAAATTTTGTGTCTGAATATTTTATATTTAAATGTTATCCTTCATTCGTTTCAGGATTCTAAAGTTTCTTCTTTTGCATGGAGGAACTTTTTTGATTACAGGTAATGTTTTAGTCATACAATTGTTGAGCTATGACTTTCATGGAGAAATTATTGATTACTATTTCTCAACTCATTTGGTTTACTTTTTCAGATGTGCTGTCATTAGTTTTCTTACAGTCGAATCTGATGGACTCTGGAGAATTGTTGCACTACCACCACAATACCTAGATAGCTTGGATGTGAGCTGTCTGCCTCAAATGAATCAGTCTACAGCTGAGAGAAAATTGGTGCAGAAAGGCCCTGCCTCTAATGGTACATATTCATTTAATTCATTCAGATGTAGAAGCTTGCTGGAGTCAAATAATAAGTTATTCGATAGTAAAGCAATTAAGTCGTCGAATAAATCCTCTGGCAAGTTCTCGTGTAGGAGTTCATGCTCTGGCTCTGCTTTGATGTCGAGTGACTCTAGTGCAATCTCTGACATCCCCGTTGGTGGAGCTAAAATGCAGAGATATGGGAAGAAAAATCCAAGAAAGAAGGCAAAAAAGAAAGAAATAGAATGTAAGAAGATATCTTCTGATTTTGTCTCTGCTGAAACAGAAGTATCATCCAAGGATTCTGCCCATGGAAGTTTTTTGTCTGAAGCTTGTGGCAATAATGATTCAGATTGTAGAGATGGATCTGTTTTGTGTTCGATTGCACAAGGAACTTTTCTGCCAGATTTTAGGGCCAATAAAAATGATTTTAAACGAGATTCTGAGAGGATTATTCAGCCACTTGGAACCACAGATTCAATATCCTCTAATATTGTTGACGGGAATGCATCTGAGGTTTCATCTTCTGCATCAAAGAATTTTAGTGGGTATTATAAAGTTTGTGGATCCAAAAACCAGGCCCTAATCAAAGTACCTGGTTGTACCCATGTCAATGGGGGAGTAAATTCAAGAGAGAGGTTATTTGCTGGCAGCTACAATGATTTTTGCTCCAAGGATTCTTTGGATAATAATTCCCCAGATTCTAACTGTTTTAGTTCAAACGGTAACTCTGATAATTTTAACTTGAAATTAGATGAAAAGAAATGTTTTGGAGTTGATCTGTTGGAAGAAAGAAGTTCACCTTCTAGAGTGAACTATTGTTCTCATAATTCAGTAAGAGATGAAGTAGATGTGAATGCCAAAGTGGAGAAAGCTAATCGTGGTATTCGGGGATGTACTGTTAGTGAAACTTGTTCGGTTTTACCTGGAAAGAAAACTAAACAAAATAAAAAATTGACCGGGAGTTCAAGGATGAATAGATATGGTGGTCTGGGGAGTTCACAAAGGCGTACGGGGAAGGAAAACAGACTTACTGTCTGGCAAAAGGTTCAAAGGAATAATAGTGGTGAATGTTGTGAACAGTTAGACCAAGTAAGTCCTATCAGCAAACATTTTAAAGGCATCTGTAATCCTGTTGTTGGTGTGCAAATGCCAAAGGTCAAGGATAAAAAAACCGGGAACAGAAAACAGTTGAAAGAAAAATTTCCCAGGAGGTTGAAAAGAAAAAATACTTCAGGACAAGAGAAGATCTATCGTCCTACTAGGAACAATTGTGGTAGTAATACTAGTTCAATGGTTTACAAACCACCAAATGGAAGGTTGGATATTCGATCAGTGGGCTTTGACATAAGAAGATCAAGTGGCGATCCAAGATCTCGTTTTCATAATGATACAACTGATAAATGCACGACTTCTGAATCATTTGAAAGTACACAAGTCTGTCTTGATGGATTGGTGTCAAGCAAACTTATCTCCGATGGTTTGAATAGTAAAAAAGTAGAGAATGACTCTGGCTCATCGCCAAGGTCCTGCAACTCCTTAAATCAGTCAAATCTGGTAGAGGTTCAGTCTCCTGTTTACCTTCCTCATCTTTTCTTTCAAGCAACAAAAGGAAGTTCTCTTGCTGAATGCAGCAAGCACAATAACCAATCTAGATCACCCCTTCATAACTGGTTGCCAAGTGGGGCAGAAGGTTCCAGATTGGCCACCTTGGCCAGACCTGATTTTTCATCTCTGAAAGATGCAAGTACGCGACCTACTGAGTTTGGCACTTCAGAAAAATCAATTCAAGAAAGAGTCAATTGCAACATAGTAGATCCTGTTTCTGTTGTAACTGAGGGGATTCAGCATTCTAGAGATGGGAATCATGGTCCTTTAGAACATGAATGTGAGGTGCCGAAGGTGTATGGTTACAATACAGCTGCACTACAGGATCATAGGTGTGAGTTTGATGTGGATGAGCATTTTAATTCCAAATCCTCATGTGAAGATGCATCTAGAATGGAGCAAGCAGTGAATAATGCATGTAGGGCGCAATTGGTATCTGAAGCTATTCAAATGGAAACCGGTAGTCCAATCGCAGAATTCGAAAGATTCCTTCAATTGTCCTCCCCTGTTATCAACCAGAGACCCAAGTTAAGAAGTAGTGAAATTTACCCAAGAAATCCACCAGGTGATGTGATACCATGTAGCAATGAGACCGCCGACATTTCTTTGGGTTGCCTGTGGCAATGGTATGAAAAACATGGCAACTATGGCTTAGAAATAAAAGCCAATGGTCATGAAAATTCAAATGGATTTGGCGCTGATAACTCTGCATTCTGTGCATATTTTGTTCCATTTCTTTCAGCTGTTCAACTATTCAAGAGCCATAAAACTCATGCTCCAACTACAGGTCCTGTGGGATTTGATTCATGTGTAAGCGATATAAAAGTGAAGGAGCCGTCGACTTGTCATCTTCCAATATTTTCAGTCCTTTTTCCTAAGCCCTGTACTGATGATGCAAGTGTTCTGCGGGTTTGTAATCAGTTACATGGTTCAGAGCAACATTTGGGTTCTGAGAGGAGCAAATCTTCAGAACAATCTGTCAACTTAAAATCATCTGGAGAATCAGAACTTATTTTTGAATATTTTGAAGGGGAACAACCTCAGCAGAGAAGGCCGTTATTTGATAAGTAATTGCTCCTACGCTTTTTTGGAAATTACTAGTGTGATTTTGTTGGTTATATATTTTCTAACTCAGATATTTTCTTATCATACCTCAACTTATTATTATTATTGTTGTTGTTGTTGTGTGTGTGTGTGTGGGCAAGTAATAGGATACGAAATATAACAAATTAGTATCATCTCAGCTTTCTTCCCAAACCCTAAGTTTCTCCTAAACATCCTCTAATGTCTTTCCAACAAGATCTTTTAAGGACTACCATTTATTGCACGAGACTAGGAATCTTTGCTTTCCCTCGCAATGTTTGATCTTAGTAGTTTTGGCATTCCAGTAGACGCATCATTCGGTAAATGCTAACTCAAACCCAACTTTTGAACTGGTGATACCCACACTTAGAAATATATTTGGAGAAGATGAATAAATGACAGCCCCCGAGGAATCATTTGGTAGGGGTTGGGACTTTTGATGGGGTTAGAGCTCAGGTTCTAGATTTAAGCCTTGGAGTGGAAACTTTAATGCAGGAGTTGATGTTTCTTGGAGTTGGCAAAGGACCATGGAAAATCTCCTTGGATGAGTGGGACGTGCTCGTTACCATGAATCCCTAAAAGAAGAAGTACGAGAGTGAATGATCATCGTTCTCACTACTCTAATAAGAGAGACACACCAATTAGAAGACATTCTACTCTATCCAGCAGTATTTATGTCTTTCAAGACAAGTGTCAGTCATAAAAAATTTATTTTTGGAAATTTCCTTTTCATGATAGTTGGGTCCGATGATTTGACAAAACAACTACGTTAGAAGTAAGTTGGCGAAGTACCAAGTAGTATATTGTATTTACTATATTCACCCAAATTACCCAATCTTCACAACACAGAATAACCATGAGGAATTAGCCACTTAGCTGACGATTCTCCAAAATTTCTAGCTATGAATTTATTTTCTTTTAAATCTTGTTCTGTCTAGTTTAGTCTAAATTGGAAGTTTCACCCTGCCTTGGCCTCATTCCAACAATGGTTCATTCATCTCTTATTGGGGTTGGCCAGTTAGCAGTCTTCCCAAGAAGAAATGCTTTCCCTATTCCTCACTCTAAATTTTACGTAAGCATCTATATAAGCTTAAAAAAAAACAAAAACAGAACTTTTCATTAATGAAATGAAAAGAGGCTAATGCTCAAAATACAATGAAACAAAAGAACAAAAAGACCCGATCATAGAAACTAGGGATCAGTAGGTGCACCCATTTAAGCTTTTCTGTTAAAAACAAATTCCTTTGGCTTCACTGAATCCTTTGTCATTTGTAAACCATCTACCTACATTAGTTCCATATATAATAACCACAGCAAAGTGCCAAATAGTTGCTGTTGCTGGAGGAAAAATCCCATGCTCATTCCAATTCCATGGCCATTTTTCTCTTTCAGGTTACCTCACTCGAGACTGCTGCTTCTTAGGTTAAAGACAAATTTTCCATTAACTAAAACTTCATAATCAACTAAATGAATTCTCTAACATATCTCTCCATTGGTTAGCCGTATTATGAATTTTAAAGAGAGAAAATAGGAAGGAATGTTGGATAGCACAGATTGATAAAGAGTAGGGTGACCTCTTCTTGAAGGGGAAAAATCGTGTTCTTTCATTTATCCAGTTTCCCCGATACTCATTTGATACTACTGGATTCCAAAACCAACCTTGATTACATGTCCTTCAAGAGGCATACCTAGAGAAGTTGGAGACCAACGTGTAAATGTTGCATTCCATCTCTTGCTGCTTAATGTTCAAGAGGCATTCCTGGAGATTCCAGTTTCTCTAAAACACTGAAAGTATATATAATCCTCCTGTCGACCTCACACAATTTGACTCTTGAAGATAAAAATGTTTGACAATGCCCCTTTCTTCTTCACACTACAAACATTTCATTACAAAATCTTCACAATTTAACAAATAACAACCTATCAGACCCTAGTGAAATTTATGAAGAAAACATTTTCTATTGTCATTATGAACTTCGAGTGGGCACTGGCATCTATTTATATATTTATTCATGATCATGTGATTTGCCTTTAACCCCTTTCTCCTTGATCAATAACTTTATGATAGTATTCTGGTTTTAAGGAATGGTTGCATTTCAAATTCTGCATTCCAGTTACTTAATACAAGGATCAACAAAGTGCTTGATATGTTGGTTATCTTAGAAAGCTGTGATAAGGCTTTACACATCATTCTTAATTGCTAATTGAGAACTATTTTGAAATTGATCTCAGTTTTATTGCTGAATCAGGATACATCAACTGGTTGAGGGAGATGGACGTCCACAGGGAAAAATTTATGGGGATCCGACCATGCTCAATTCCATAACTTTGAATGATCTGCATGCTGGATCATGGTTGGTTGTGACAGACATTTGCAGTGTTTGCATTAGTTCCATAGTCTACTAAAACTAAATCATATATGACCAAATTCTTGATGTATCTTTACATTGTACAAAATAATACGAATGTCATTGACACATCACGTCAGAGAACAAAAAAGTGACTTGTTCGTTTCTGGCATTTCAGGTACTCAGTGGCATGGTATCCCATTTATAGGATACCAGATGGCAACCTTCGAGCTGCATTTTTGACTTACCACTCACTAGGACATTTTGTTTCAAGAACTTCCCAACCTAACTCTCCAGATACAAATTCTTGTTTAGTTTGTCCAGTCGTGGGTCTTCAAAGTTATAATGCACAGGTAAAGTTTTGCTGATTACTACTACTAATTCCAACCTGCTTTAACGTTTTGAAGTGGTTGTGTATATTGTTTAGATATTGCTAAAGGAAACAAAGAAACAACACGTTCTGAAGATTGTATTTGCCTTCTCTCTCTCTCTGTATAAAATATGTGTTAATTGATTAGTACTTTATTCTTGTTTATCTCAGAGGCTAATTACGTATGGATGGAAATGTGTAGGCTTTTGCCATTCAACTTTATTAATTGCGATTAGGGGCTACCTATTCTCTGTGATACTTTCCTATGAGCAATACTTGGGTGCAGCGCAGGTTACACCCATCTTCATGCATCCCCTTCTAATCACAAAGAGAAATAATATATATTAAAAAAAAAATTTAAAAAAGAAAATTTTGCCATTTGGCAGCTTGTGATTGAACACTTAGGTGAGATGCACAAAGATGGGTGCAGCTCGAGATGCACTTAAGTATTTGTCCTTCCTTATAATCCCTGAAAGGAAAAGTTGATAATGAAACACTGATCCTCCGACATTTTATTAGACCACCTCAGGTTTCCTGACATTACTAATGGAAAGTGGTTGATTATAAGATGTCTAGACAAACCATTGATTTTCTCCCTCCAAATCTTTAAGTGAAAGAGGAAACGTTTGAGTTGAAAATGCCTAGCTTCTTCTAAGGGGGATAAGAAAACCACCTTAGATTCAGCATAGTTGTCATCCATCAAGTTTTGAACGGTCGACCTTAATTGATAAATTTCTATTATTGCTTCTGCAGATCTTTAACAAAGTTTCTAAAATTGTTGCATTGCATGAGGATTGTTGTGCATCTTTCATAAACACTCTCTTTGTTCCATGAGCAGAATGAATGCTGGTTTGAGCCTAGAAACAGTACGCCCACGTTAACCCCTGGCTTGAGTCCTCCTAGAATCCTCGAGGAGCGCCTGAGGACGCTGGAAGAGACTGCATCTCTCATGGCCAGAGCTGTGGTTAAGAAAGGAAATCTGAACTCTGAAAACACGCATCCAGATTACGAGTTCTTCCTCTCACGGCGACTCTAGTTACATATCCAGATGTTTCATGCTTGGAACTTATAAAGGAGATTCCTTCCTTTTGTCAATATTCTTGAGGATTTAGTTTAGGTTAGGACCTAATCTCAAGTAGGATGTAAGAGAGCAGCTGATCTTAGTCTGTAATGTATTTACACAATGTTTTTTGTTCCTTTTTCTGTCTTGCTACGCTAACTCTTCAGGCAGTGCAACTTTCATCTGTATATATTGTTTTTTACACAAATTACCTCATGTAAAAATGATATCTCCTGGATTTATAGAGATTTTGAGCATTTTCTTTTTCTTATTTCGTTTTAAAATTAGTTCGGTTGATACACACCA

mRNA sequence

AGAAAATAATAATAATAAAAGAAAACCCTACTACACGTTCCCTCTCCCCTTTTATTTTTGTTTGACGTTTGATTTTTCCCTCACCCATAATTGTAGCAGCCACAACCAAAGCCCTACGCGACTTTGATTTCTCTCTGCTCTCGTATCTCCCTCGCAGCGATAGGAAAATTTTTGAAGTTTGACTCATGGATCCTCAACCTGAACCAGTCAGCTACATCTGTGGAGATTGTGGAATGGAGAACACTCTGAAGCAGGGTGATGTTATACAGTGCCGAGAGTGTGGTTATCGTATTCTCTACAAGAAGCGCACCCGTCGCACTTTTTCTTGTCTTGATTGGGGAAAGCCATTTCAGGAGAAGTTACAGAAAACAATGCAGTGTGCTCTTGTAAGAAGTAGTAATTTTCAGAAAGTTCTAGACAAAGGAAAGGAGTCGTTAGAATTGAGACTCGAGGAAAACAGTTGTTCCAGAGGAATTAAGGATTCTAAAGTTTCTTCTTTTGCATGGAGGAACTTTTTTGATTACAGATGTGCTGTCATTAGTTTTCTTACAGTCGAATCTGATGGACTCTGGAGAATTGTTGCACTACCACCACAATACCTAGATAGCTTGGATGTGAGCTGTCTGCCTCAAATGAATCAGTCTACAGCTGAGAGAAAATTGGTGCAGAAAGGCCCTGCCTCTAATGGTACATATTCATTTAATTCATTCAGATGTAGAAGCTTGCTGGAGTCAAATAATAAGTTATTCGATAGTAAAGCAATTAAGTCGTCGAATAAATCCTCTGGCAAGTTCTCGTGTAGGAGTTCATGCTCTGGCTCTGCTTTGATGTCGAGTGACTCTAGTGCAATCTCTGACATCCCCGTTGGTGGAGCTAAAATGCAGAGATATGGGAAGAAAAATCCAAGAAAGAAGGCAAAAAAGAAAGAAATAGAATGTAAGAAGATATCTTCTGATTTTGTCTCTGCTGAAACAGAAGTATCATCCAAGGATTCTGCCCATGGAAGTTTTTTGTCTGAAGCTTGTGGCAATAATGATTCAGATTGTAGAGATGGATCTGTTTTGTGTTCGATTGCACAAGGAACTTTTCTGCCAGATTTTAGGGCCAATAAAAATGATTTTAAACGAGATTCTGAGAGGATTATTCAGCCACTTGGAACCACAGATTCAATATCCTCTAATATTGTTGACGGGAATGCATCTGAGGTTTCATCTTCTGCATCAAAGAATTTTAGTGGGTATTATAAAGTTTGTGGATCCAAAAACCAGGCCCTAATCAAAGTACCTGGTTGTACCCATGTCAATGGGGGAGTAAATTCAAGAGAGAGGTTATTTGCTGGCAGCTACAATGATTTTTGCTCCAAGGATTCTTTGGATAATAATTCCCCAGATTCTAACTGTTTTAGTTCAAACGGTAACTCTGATAATTTTAACTTGAAATTAGATGAAAAGAAATGTTTTGGAGTTGATCTGTTGGAAGAAAGAAGTTCACCTTCTAGAGTGAACTATTGTTCTCATAATTCAGTAAGAGATGAAGTAGATGTGAATGCCAAAGTGGAGAAAGCTAATCGTGGTATTCGGGGATGTACTGTTAGTGAAACTTGTTCGGTTTTACCTGGAAAGAAAACTAAACAAAATAAAAAATTGACCGGGAGTTCAAGGATGAATAGATATGGTGGTCTGGGGAGTTCACAAAGGCGTACGGGGAAGGAAAACAGACTTACTGTCTGGCAAAAGGTTCAAAGGAATAATAGTGGTGAATGTTGTGAACAGTTAGACCAAGTAAGTCCTATCAGCAAACATTTTAAAGGCATCTGTAATCCTGTTGTTGGTGTGCAAATGCCAAAGGTCAAGGATAAAAAAACCGGGAACAGAAAACAGTTGAAAGAAAAATTTCCCAGGAGGTTGAAAAGAAAAAATACTTCAGGACAAGAGAAGATCTATCGTCCTACTAGGAACAATTGTGGTAGTAATACTAGTTCAATGGTTTACAAACCACCAAATGGAAGGTTGGATATTCGATCAGTGGGCTTTGACATAAGAAGATCAAGTGGCGATCCAAGATCTCGTTTTCATAATGATACAACTGATAAATGCACGACTTCTGAATCATTTGAAAGTACACAAGTCTGTCTTGATGGATTGGTGTCAAGCAAACTTATCTCCGATGGTTTGAATAGTAAAAAAGTAGAGAATGACTCTGGCTCATCGCCAAGGTCCTGCAACTCCTTAAATCAGTCAAATCTGGTAGAGGTTCAGTCTCCTGTTTACCTTCCTCATCTTTTCTTTCAAGCAACAAAAGGAAGTTCTCTTGCTGAATGCAGCAAGCACAATAACCAATCTAGATCACCCCTTCATAACTGGTTGCCAAGTGGGGCAGAAGGTTCCAGATTGGCCACCTTGGCCAGACCTGATTTTTCATCTCTGAAAGATGCAAGTACGCGACCTACTGAGTTTGGCACTTCAGAAAAATCAATTCAAGAAAGAGTCAATTGCAACATAGTAGATCCTGTTTCTGTTGTAACTGAGGGGATTCAGCATTCTAGAGATGGGAATCATGGTCCTTTAGAACATGAATGTGAGGTGCCGAAGGTGTATGGTTACAATACAGCTGCACTACAGGATCATAGGTGTGAGTTTGATGTGGATGAGCATTTTAATTCCAAATCCTCATGTGAAGATGCATCTAGAATGGAGCAAGCAGTGAATAATGCATGTAGGGCGCAATTGGTATCTGAAGCTATTCAAATGGAAACCGGTAGTCCAATCGCAGAATTCGAAAGATTCCTTCAATTGTCCTCCCCTGTTATCAACCAGAGACCCAAGTTAAGAAGTAGTGAAATTTACCCAAGAAATCCACCAGGTGATGTGATACCATGTAGCAATGAGACCGCCGACATTTCTTTGGGTTGCCTGTGGCAATGGTATGAAAAACATGGCAACTATGGCTTAGAAATAAAAGCCAATGGTCATGAAAATTCAAATGGATTTGGCGCTGATAACTCTGCATTCTGTGCATATTTTGTTCCATTTCTTTCAGCTGTTCAACTATTCAAGAGCCATAAAACTCATGCTCCAACTACAGGTCCTGTGGGATTTGATTCATGTGTAAGCGATATAAAAGTGAAGGAGCCGTCGACTTGTCATCTTCCAATATTTTCAGTCCTTTTTCCTAAGCCCTGTACTGATGATGCAAGTGTTCTGCGGGTTTGTAATCAGTTACATGGTTCAGAGCAACATTTGGGTTCTGAGAGGAGCAAATCTTCAGAACAATCTGTCAACTTAAAATCATCTGGAGAATCAGAACTTATTTTTGAATATTTTGAAGGGGAACAACCTCAGCAGAGAAGGCCGTTATTTGATAAGATACATCAACTGGTTGAGGGAGATGGACGTCCACAGGGAAAAATTTATGGGGATCCGACCATGCTCAATTCCATAACTTTGAATGATCTGCATGCTGGATCATGGTACTCAGTGGCATGGTATCCCATTTATAGGATACCAGATGGCAACCTTCGAGCTGCATTTTTGACTTACCACTCACTAGGACATTTTGTTTCAAGAACTTCCCAACCTAACTCTCCAGATACAAATTCTTGTTTAGTTTGTCCAGTCGTGGGTCTTCAAAGTTATAATGCACAGAATGAATGCTGGTTTGAGCCTAGAAACAGTACGCCCACGTTAACCCCTGGCTTGAGTCCTCCTAGAATCCTCGAGGAGCGCCTGAGGACGCTGGAAGAGACTGCATCTCTCATGGCCAGAGCTGTGGTTAAGAAAGGAAATCTGAACTCTGAAAACACGCATCCAGATTACGAGTTCTTCCTCTCACGGCGACTCTAGTTACATATCCAGATGTTTCATGCTTGGAACTTATAAAGGAGATTCCTTCCTTTTGTCAATATTCTTGAGGATTTAGTTTAGGTTAGGACCTAATCTCAAGTAGGATGTAAGAGAGCAGCTGATCTTAGTCTGTAATGTATTTACACAATGTTTTTTGTTCCTTTTTCTGTCTTGCTACGCTAACTCTTCAGGCAGTGCAACTTTCATCTGTATATATTGTTTTTTACACAAATTACCTCATGTAAAAATGATATCTCCTGGATTTATAGAGATTTTGAGCATTTTCTTTTTCTTATTTCGTTTTAAAATTAGTTCGGTTGATACACACCA

Coding sequence (CDS)

ATGGATCCTCAACCTGAACCAGTCAGCTACATCTGTGGAGATTGTGGAATGGAGAACACTCTGAAGCAGGGTGATGTTATACAGTGCCGAGAGTGTGGTTATCGTATTCTCTACAAGAAGCGCACCCGTCGCACTTTTTCTTGTCTTGATTGGGGAAAGCCATTTCAGGAGAAGTTACAGAAAACAATGCAGTGTGCTCTTGTAAGAAGTAGTAATTTTCAGAAAGTTCTAGACAAAGGAAAGGAGTCGTTAGAATTGAGACTCGAGGAAAACAGTTGTTCCAGAGGAATTAAGGATTCTAAAGTTTCTTCTTTTGCATGGAGGAACTTTTTTGATTACAGATGTGCTGTCATTAGTTTTCTTACAGTCGAATCTGATGGACTCTGGAGAATTGTTGCACTACCACCACAATACCTAGATAGCTTGGATGTGAGCTGTCTGCCTCAAATGAATCAGTCTACAGCTGAGAGAAAATTGGTGCAGAAAGGCCCTGCCTCTAATGGTACATATTCATTTAATTCATTCAGATGTAGAAGCTTGCTGGAGTCAAATAATAAGTTATTCGATAGTAAAGCAATTAAGTCGTCGAATAAATCCTCTGGCAAGTTCTCGTGTAGGAGTTCATGCTCTGGCTCTGCTTTGATGTCGAGTGACTCTAGTGCAATCTCTGACATCCCCGTTGGTGGAGCTAAAATGCAGAGATATGGGAAGAAAAATCCAAGAAAGAAGGCAAAAAAGAAAGAAATAGAATGTAAGAAGATATCTTCTGATTTTGTCTCTGCTGAAACAGAAGTATCATCCAAGGATTCTGCCCATGGAAGTTTTTTGTCTGAAGCTTGTGGCAATAATGATTCAGATTGTAGAGATGGATCTGTTTTGTGTTCGATTGCACAAGGAACTTTTCTGCCAGATTTTAGGGCCAATAAAAATGATTTTAAACGAGATTCTGAGAGGATTATTCAGCCACTTGGAACCACAGATTCAATATCCTCTAATATTGTTGACGGGAATGCATCTGAGGTTTCATCTTCTGCATCAAAGAATTTTAGTGGGTATTATAAAGTTTGTGGATCCAAAAACCAGGCCCTAATCAAAGTACCTGGTTGTACCCATGTCAATGGGGGAGTAAATTCAAGAGAGAGGTTATTTGCTGGCAGCTACAATGATTTTTGCTCCAAGGATTCTTTGGATAATAATTCCCCAGATTCTAACTGTTTTAGTTCAAACGGTAACTCTGATAATTTTAACTTGAAATTAGATGAAAAGAAATGTTTTGGAGTTGATCTGTTGGAAGAAAGAAGTTCACCTTCTAGAGTGAACTATTGTTCTCATAATTCAGTAAGAGATGAAGTAGATGTGAATGCCAAAGTGGAGAAAGCTAATCGTGGTATTCGGGGATGTACTGTTAGTGAAACTTGTTCGGTTTTACCTGGAAAGAAAACTAAACAAAATAAAAAATTGACCGGGAGTTCAAGGATGAATAGATATGGTGGTCTGGGGAGTTCACAAAGGCGTACGGGGAAGGAAAACAGACTTACTGTCTGGCAAAAGGTTCAAAGGAATAATAGTGGTGAATGTTGTGAACAGTTAGACCAAGTAAGTCCTATCAGCAAACATTTTAAAGGCATCTGTAATCCTGTTGTTGGTGTGCAAATGCCAAAGGTCAAGGATAAAAAAACCGGGAACAGAAAACAGTTGAAAGAAAAATTTCCCAGGAGGTTGAAAAGAAAAAATACTTCAGGACAAGAGAAGATCTATCGTCCTACTAGGAACAATTGTGGTAGTAATACTAGTTCAATGGTTTACAAACCACCAAATGGAAGGTTGGATATTCGATCAGTGGGCTTTGACATAAGAAGATCAAGTGGCGATCCAAGATCTCGTTTTCATAATGATACAACTGATAAATGCACGACTTCTGAATCATTTGAAAGTACACAAGTCTGTCTTGATGGATTGGTGTCAAGCAAACTTATCTCCGATGGTTTGAATAGTAAAAAAGTAGAGAATGACTCTGGCTCATCGCCAAGGTCCTGCAACTCCTTAAATCAGTCAAATCTGGTAGAGGTTCAGTCTCCTGTTTACCTTCCTCATCTTTTCTTTCAAGCAACAAAAGGAAGTTCTCTTGCTGAATGCAGCAAGCACAATAACCAATCTAGATCACCCCTTCATAACTGGTTGCCAAGTGGGGCAGAAGGTTCCAGATTGGCCACCTTGGCCAGACCTGATTTTTCATCTCTGAAAGATGCAAGTACGCGACCTACTGAGTTTGGCACTTCAGAAAAATCAATTCAAGAAAGAGTCAATTGCAACATAGTAGATCCTGTTTCTGTTGTAACTGAGGGGATTCAGCATTCTAGAGATGGGAATCATGGTCCTTTAGAACATGAATGTGAGGTGCCGAAGGTGTATGGTTACAATACAGCTGCACTACAGGATCATAGGTGTGAGTTTGATGTGGATGAGCATTTTAATTCCAAATCCTCATGTGAAGATGCATCTAGAATGGAGCAAGCAGTGAATAATGCATGTAGGGCGCAATTGGTATCTGAAGCTATTCAAATGGAAACCGGTAGTCCAATCGCAGAATTCGAAAGATTCCTTCAATTGTCCTCCCCTGTTATCAACCAGAGACCCAAGTTAAGAAGTAGTGAAATTTACCCAAGAAATCCACCAGGTGATGTGATACCATGTAGCAATGAGACCGCCGACATTTCTTTGGGTTGCCTGTGGCAATGGTATGAAAAACATGGCAACTATGGCTTAGAAATAAAAGCCAATGGTCATGAAAATTCAAATGGATTTGGCGCTGATAACTCTGCATTCTGTGCATATTTTGTTCCATTTCTTTCAGCTGTTCAACTATTCAAGAGCCATAAAACTCATGCTCCAACTACAGGTCCTGTGGGATTTGATTCATGTGTAAGCGATATAAAAGTGAAGGAGCCGTCGACTTGTCATCTTCCAATATTTTCAGTCCTTTTTCCTAAGCCCTGTACTGATGATGCAAGTGTTCTGCGGGTTTGTAATCAGTTACATGGTTCAGAGCAACATTTGGGTTCTGAGAGGAGCAAATCTTCAGAACAATCTGTCAACTTAAAATCATCTGGAGAATCAGAACTTATTTTTGAATATTTTGAAGGGGAACAACCTCAGCAGAGAAGGCCGTTATTTGATAAGATACATCAACTGGTTGAGGGAGATGGACGTCCACAGGGAAAAATTTATGGGGATCCGACCATGCTCAATTCCATAACTTTGAATGATCTGCATGCTGGATCATGGTACTCAGTGGCATGGTATCCCATTTATAGGATACCAGATGGCAACCTTCGAGCTGCATTTTTGACTTACCACTCACTAGGACATTTTGTTTCAAGAACTTCCCAACCTAACTCTCCAGATACAAATTCTTGTTTAGTTTGTCCAGTCGTGGGTCTTCAAAGTTATAATGCACAGAATGAATGCTGGTTTGAGCCTAGAAACAGTACGCCCACGTTAACCCCTGGCTTGAGTCCTCCTAGAATCCTCGAGGAGCGCCTGAGGACGCTGGAAGAGACTGCATCTCTCATGGCCAGAGCTGTGGTTAAGAAAGGAAATCTGAACTCTGAAAACACGCATCCAGATTACGAGTTCTTCCTCTCACGGCGACTCTAG

Protein sequence

MDPQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRRTFSCLDWGKPFQEKLQKTMQCALVRSSNFQKVLDKGKESLELRLEENSCSRGIKDSKVSSFAWRNFFDYRCAVISFLTVESDGLWRIVALPPQYLDSLDVSCLPQMNQSTAERKLVQKGPASNGTYSFNSFRCRSLLESNNKLFDSKAIKSSNKSSGKFSCRSSCSGSALMSSDSSAISDIPVGGAKMQRYGKKNPRKKAKKKEIECKKISSDFVSAETEVSSKDSAHGSFLSEACGNNDSDCRDGSVLCSIAQGTFLPDFRANKNDFKRDSERIIQPLGTTDSISSNIVDGNASEVSSSASKNFSGYYKVCGSKNQALIKVPGCTHVNGGVNSRERLFAGSYNDFCSKDSLDNNSPDSNCFSSNGNSDNFNLKLDEKKCFGVDLLEERSSPSRVNYCSHNSVRDEVDVNAKVEKANRGIRGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGLGSSQRRTGKENRLTVWQKVQRNNSGECCEQLDQVSPISKHFKGICNPVVGVQMPKVKDKKTGNRKQLKEKFPRRLKRKNTSGQEKIYRPTRNNCGSNTSSMVYKPPNGRLDIRSVGFDIRRSSGDPRSRFHNDTTDKCTTSESFESTQVCLDGLVSSKLISDGLNSKKVENDSGSSPRSCNSLNQSNLVEVQSPVYLPHLFFQATKGSSLAECSKHNNQSRSPLHNWLPSGAEGSRLATLARPDFSSLKDASTRPTEFGTSEKSIQERVNCNIVDPVSVVTEGIQHSRDGNHGPLEHECEVPKVYGYNTAALQDHRCEFDVDEHFNSKSSCEDASRMEQAVNNACRAQLVSEAIQMETGSPIAEFERFLQLSSPVINQRPKLRSSEIYPRNPPGDVIPCSNETADISLGCLWQWYEKHGNYGLEIKANGHENSNGFGADNSAFCAYFVPFLSAVQLFKSHKTHAPTTGPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVLRVCNQLHGSEQHLGSERSKSSEQSVNLKSSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRPQGKIYGDPTMLNSITLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFVSRTSQPNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNSTPTLTPGLSPPRILEERLRTLEETASLMARAVVKKGNLNSENTHPDYEFFLSRRL
BLAST of Lsi02G002760 vs. Swiss-Prot
Match: NRPBC_ARATH (DNA-directed RNA polymerases II, IV and V subunit 12 OS=Arabidopsis thaliana GN=NRPB12 PE=1 SV=1)

HSP 1 Score: 92.8 bits (229), Expect = 2.7e-17
Identity = 39/44 (88.64%), Postives = 41/44 (93.18%), Query Frame = 1

Query: 1  MDPQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRR 45
          MDP PEPV+Y+CGDCG ENTLK GDVIQCRECGYRILYKKRTRR
Sbjct: 1  MDPAPEPVTYVCGDCGQENTLKSGDVIQCRECGYRILYKKRTRR 44

BLAST of Lsi02G002760 vs. Swiss-Prot
Match: RPBCL_ARATH (DNA-directed RNA polymerase subunit 12-like protein OS=Arabidopsis thaliana GN=NRPB12L PE=3 SV=1)

HSP 1 Score: 69.3 bits (168), Expect = 3.3e-10
Identity = 30/41 (73.17%), Postives = 34/41 (82.93%), Query Frame = 1

Query: 2  DPQPEP-VSYICGDCGMENTLKQGDVIQCRECGYRILYKKR 42
          D QPE  V Y+CGDCG EN LK+GDV QCR+CG+RILYKKR
Sbjct: 10 DKQPEQLVIYVCGDCGQENILKRGDVFQCRDCGFRILYKKR 50

BLAST of Lsi02G002760 vs. Swiss-Prot
Match: RPAB4_MOUSE (DNA-directed RNA polymerases I, II, and III subunit RPABC4 OS=Mus musculus GN=Polr2k PE=3 SV=2)

HSP 1 Score: 66.2 bits (160), Expect = 2.8e-09
Identity = 26/42 (61.90%), Postives = 34/42 (80.95%), Query Frame = 1

Query: 3  PQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRR 45
          P+ +P+ YICG+C  EN +K  D I+CRECGYRI+YKKRT+R
Sbjct: 10 PKQQPMIYICGECHTENEIKSRDPIRCRECGYRIMYKKRTKR 51

BLAST of Lsi02G002760 vs. Swiss-Prot
Match: RPAB4_HUMAN (DNA-directed RNA polymerases I, II, and III subunit RPABC4 OS=Homo sapiens GN=POLR2K PE=1 SV=1)

HSP 1 Score: 66.2 bits (160), Expect = 2.8e-09
Identity = 26/42 (61.90%), Postives = 34/42 (80.95%), Query Frame = 1

Query: 3  PQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRR 45
          P+ +P+ YICG+C  EN +K  D I+CRECGYRI+YKKRT+R
Sbjct: 10 PKQQPMIYICGECHTENEIKSRDPIRCRECGYRIMYKKRTKR 51

BLAST of Lsi02G002760 vs. Swiss-Prot
Match: RPAB4_BOVIN (DNA-directed RNA polymerases I, II, and III subunit RPABC4 OS=Bos taurus GN=POLR2K PE=1 SV=1)

HSP 1 Score: 66.2 bits (160), Expect = 2.8e-09
Identity = 26/42 (61.90%), Postives = 34/42 (80.95%), Query Frame = 1

Query: 3  PQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRR 45
          P+ +P+ YICG+C  EN +K  D I+CRECGYRI+YKKRT+R
Sbjct: 10 PKQQPMIYICGECHTENEIKSRDPIRCRECGYRIMYKKRTKR 51

BLAST of Lsi02G002760 vs. TrEMBL
Match: A0A0A0LT77_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G043170 PE=4 SV=1)

HSP 1 Score: 1789.6 bits (4634), Expect = 0.0e+00
Identity = 945/1195 (79.08%), Postives = 1013/1195 (84.77%), Query Frame = 1

Query: 63   MQCALVRSSNFQKVLDKGKESLELRLEENSCSRGIK-DSKVSSFAWRNFFDYRCAVISFL 122
            MQC LV SS+FQKVLDKGKESLELRLE+NSCSRGI  DSKVSSFAWRNFFDYR A+IS L
Sbjct: 1    MQCTLV-SSDFQKVLDKGKESLELRLEKNSCSRGISTDSKVSSFAWRNFFDYRRAIISCL 60

Query: 123  TVESDGLWRIVALPPQYLDSLDVSCLPQMNQSTAERKLVQKGPASNGTYSFNSFRCRSLL 182
            T+ESDGLWRIVALPPQYLDSL++SCLPQMNQ TA RKLVQKGPASNGTYSFNS RCRSLL
Sbjct: 61   TLESDGLWRIVALPPQYLDSLNLSCLPQMNQFTAGRKLVQKGPASNGTYSFNSLRCRSLL 120

Query: 183  ESNNKLFDSKAIKSSNKSSGKFSCRSSCSGSALMSSDSSAISDIPVGGAKMQRYGKKNPR 242
            ESN KL DSKAIKS  +SSGKF C SSCSGSALMSSDS AISDIPV GAKMQRYGKKNPR
Sbjct: 121  ESNKKLLDSKAIKSPKQSSGKFPCTSSCSGSALMSSDSIAISDIPVDGAKMQRYGKKNPR 180

Query: 243  KKAKKKEIECKKISSDFVSAETEVSSKDSAHGSFLSEACGNNDSDCRDGSVLCSIAQGTF 302
            KKAKKKEIECK ISSDFVSAETEVS +DSA  SFLSEACG+NDSD RD SVLCSIAQ TF
Sbjct: 181  KKAKKKEIECKNISSDFVSAETEVSLQDSARASFLSEACGSNDSDFRDRSVLCSIAQETF 240

Query: 303  LPDFRANKNDFKRDSERIIQPLGTTDSISSNIVDGNASEVSSSASKNFSGYYKVCGSKNQ 362
            LPDF         + + +IQPLGT DS+SS IVDG++S+VSS A KNFSGYYKVCGS+NQ
Sbjct: 241  LPDF---------EQDSVIQPLGTVDSVSSEIVDGHSSKVSSLAIKNFSGYYKVCGSENQ 300

Query: 363  ALIKVPGCTHVNGGVNSRERLFAGSYNDFCSKDSLDNNSPDSNCFSSNGNSDNFNLKLDE 422
            ALI VPGC HV+ G+NSRER  AGS NDFCSKD LDN S DS   S NGN D+ NLKL+E
Sbjct: 301  ALINVPGCIHVDVGLNSRERFIAGSCNDFCSKDYLDNISRDSKWVSLNGNCDDLNLKLNE 360

Query: 423  KKCFGVDLLEERSSPSRVNYCSHNSVRDEVDVNAKVEKANRGIRGCTVSETCSVLPGKKT 482
            K+ FGVDLLEERSSPS+      NS RDEVD+NA+VEKAN GIRGCTVSETCSVLPGKKT
Sbjct: 361  KQGFGVDLLEERSSPSQ------NSARDEVDLNAEVEKANLGIRGCTVSETCSVLPGKKT 420

Query: 483  KQNKKLTGSSRMNRYGGLGSSQRRTGKENRLTVWQKVQRNNSGECCEQLDQVSPISKHFK 542
            KQNKKLTGSSRMNRYGGLGSSQRRTGKENR TVWQKVQR++SG C EQLDQVSPISK FK
Sbjct: 421  KQNKKLTGSSRMNRYGGLGSSQRRTGKENRHTVWQKVQRSSSGGCSEQLDQVSPISKQFK 480

Query: 543  GICNPVVGVQMPKVKDKKTGNRKQLKEKFPRRLKRKNTSGQEKIYRPTRNNCGSNTSSMV 602
            GICNPVVGVQMPKVKDKKTGN+KQLKEK PRRLKRKNTSGQEKIYRPTRN+CGSNTSSMV
Sbjct: 481  GICNPVVGVQMPKVKDKKTGNKKQLKEKCPRRLKRKNTSGQEKIYRPTRNSCGSNTSSMV 540

Query: 603  YKPPNGRLDIRSVGFDIRRSSGDPRSRFHNDTTDKCTTSESFESTQVCLDGLVSSKLISD 662
            +KPPN +LD+RS+GFDIRRSSGDPRS F ND+TDKCT SES ES QV LD L+S+KLI+D
Sbjct: 541  HKPPNEKLDVRSMGFDIRRSSGDPRSCFQNDSTDKCTNSESVESKQVHLDELISNKLIND 600

Query: 663  GLNSKKVENDSGSSPRSCNSLNQSNLVEVQSPVYLPHLFFQ--ATKGSSLAECSKHNNQS 722
            GL+S+KVENDS S P+SCNS NQSN VEV+SPVYLPHLFFQ      SSL +     NQS
Sbjct: 601  GLSSQKVENDSSSLPKSCNSSNQSNPVEVKSPVYLPHLFFQKVGNDSSSLPKSCNSLNQS 660

Query: 723  R----------------------------------SPLHNWLPSGAEGSRLATLARPDFS 782
                                               SPL NWLPSGAEGSR  TLARPDFS
Sbjct: 661  NPVEVKSSVYLPHLFFQATKGSSLDERSKHDTQSRSPLQNWLPSGAEGSRSITLARPDFS 720

Query: 783  SLKDASTRPTEFGTSEKSIQERVNCNIVDPVSVVTEGIQHSRDGNHGPLEHECEVPKVYG 842
            SL+DA+T+P EFGT EKSI+ERVNCN+++PVS V EGIQH RD + GPLEHEC V K+YG
Sbjct: 721  SLRDANTQPAEFGTLEKSIKERVNCNVLNPVSDVIEGIQHYRDRDDGPLEHECGVQKMYG 780

Query: 843  YNTAALQDHRCEFDVDEHFNSKSSCEDASRMEQAVNNACRAQLVSEAIQMETGSPIAEFE 902
            Y+T  LQDH+ EFDVDEHFN KSSCED SRMEQAVNNACRAQL SEAIQMETG PIAEFE
Sbjct: 781  YDTTTLQDHKSEFDVDEHFNCKSSCEDVSRMEQAVNNACRAQLASEAIQMETGCPIAEFE 840

Query: 903  RFLQLSSPVINQRPKLRSSEIYPRNPPGDVIPCSNETADISLGCLWQWYEKHGNYGLEIK 962
            RFL LSSPVI+QRP   SS+I PRN PGDVIPCSNET +ISLGCLWQWYEKHG+YGLEIK
Sbjct: 841  RFLHLSSPVIDQRPN-SSSDICPRNLPGDVIPCSNETTNISLGCLWQWYEKHGSYGLEIK 900

Query: 963  ANGHENSNGFGADNSAFCAYFVPFLSAVQLFKSHKTHAPT-TGPVGFDSCVSDIKVKEPS 1022
            A G ENSNGFGA NSAF AYFVPFLSAVQLFKS KTH  T TGP+GF+SCVSDIKVKEPS
Sbjct: 901  AKGQENSNGFGAVNSAFRAYFVPFLSAVQLFKSRKTHVGTATGPLGFNSCVSDIKVKEPS 960

Query: 1023 TCHLPIFSVLFPKPCTDDASVLRVCNQLHGSEQHLGSERSKSSEQSVNLKSSGESELIFE 1082
            TCHLPIFS+LFPKPCTDD SVLRVCNQ H SEQHL SE+ KSSEQS +L+ SGESELIFE
Sbjct: 961  TCHLPIFSLLFPKPCTDDTSVLRVCNQFHSSEQHLASEKKKSSEQSASLQLSGESELIFE 1020

Query: 1083 YFEGEQPQQRRPLFDKIHQLVEGDGRPQGKIYGDPTMLNSITLNDLHAGSWYSVAWYPIY 1142
            YFEGEQPQ RRPLFDKIHQLVEGDG  QGKIYGDPT+LNSITL+DLHAGSWYSVAWYPIY
Sbjct: 1021 YFEGEQPQLRRPLFDKIHQLVEGDGL-QGKIYGDPTVLNSITLDDLHAGSWYSVAWYPIY 1080

Query: 1143 RIPDGNLRAAFLTYHSLGHFVSRTSQPNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNS- 1202
            RIPDGNLRAAFLTYHSLGHFVSRTSQ    DTNSCLVCPVVGLQSYNAQNECWFEPR+S 
Sbjct: 1081 RIPDGNLRAAFLTYHSLGHFVSRTSQ----DTNSCLVCPVVGLQSYNAQNECWFEPRDST 1140

Query: 1203 -TPTLTPGLSPPRILEERLRTLEETASLMARAVVKKGNLNSENTHPDYEFFLSRR 1218
             T T T  L+PPRIL+ERLRTLEETASLMARAVVKKGNLNS NTHPDYEFFLSRR
Sbjct: 1141 RTSTFTSNLNPPRILQERLRTLEETASLMARAVVKKGNLNSGNTHPDYEFFLSRR 1173

BLAST of Lsi02G002760 vs. TrEMBL
Match: V4TSI5_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10018551mg PE=4 SV=1)

HSP 1 Score: 575.9 bits (1483), Expect = 1.2e-160
Identity = 447/1239 (36.08%), Postives = 642/1239 (51.82%), Query Frame = 1

Query: 60   QKTMQCALVRSS--NFQKVLDKGK-ESLELRLEENSCSRGIKDSKVSSFAWRNFFDYRCA 119
            Q+ M CA VRS+  + QK  + GK  SL    E+++    ++DS+++S   RN  D RCA
Sbjct: 3    QQKMHCA-VRSTYTDNQKFFEGGKFYSLNKSFEKDNFRASLEDSEIASLNSRNS-DNRCA 62

Query: 120  VISFLTVESDGLWRIVALPPQYLD--------------SLDVSCLPQMNQSTAERKLVQK 179
            V++  T ES GLWRIVA+PP  LD               L +     +N    +R+  QK
Sbjct: 63   VMTVCTPESVGLWRIVAVPPPCLDHTNQLGSVAQGNMDGLHLVSPSSINSFKVDRRKAQK 122

Query: 180  GPASNGTYSFNSFRCRSL------LESNNKLFDSKAIKS---SNKSSGKFSCRSSCSGSA 239
            G   + TY  N+   R         +S N+   +K  K    S+ SS + S   S S S 
Sbjct: 123  GSVHDVTYPVNASTLRRSPGSDVQQQSRNRTLANKVTKLNEFSSSSSSQSSIPCSTSSSV 182

Query: 240  LMS-SDSSAISDIPVGGAKMQRYGKKNPRKKAKKKEIECKKISSDFVSAETEVSSKDSAH 299
            +   S+S   S+I V   K+    ++N R  A+KK  + +KIS D VS   E+ S D+ H
Sbjct: 183  IQGRSNSFKSSNIFVENPKVDNIVERNSRSNARKKGKQNRKISCDSVSTGPEILSSDNGH 242

Query: 300  GSFLSEACGNNDSDCRDGSVLCSIAQGTFLPDFRANKNDFKRDSERIIQPLGTTDSISSN 359
            G   S    N D D  DG + C+ +      D R + N  + D+  I     +  + +S 
Sbjct: 243  GILTSGPSDNVDIDRGDGLISCATSLEDLFLDGRNDINHVEEDNNGICNSSESQKTCTSY 302

Query: 360  IVDGNASEVS-SSASKNFSGYYKVCGSKNQALIKVPGCTHVNGGVNSRERLFAGSYNDFC 419
            I + N SE   SS++ +F+G + +  SK    ++  G    +GGV  +  L    Y+   
Sbjct: 303  IDEVNLSEAEVSSSAPSFAGEHPLTDSKMMVQMEDQGSV-TDGGVEEQHPLRISCYDAIH 362

Query: 420  SKDSLD-NNSPDSNCFSSNGNSDNFNLKLDEKKCFG-----------VDLLEERSSPSRV 479
            S    D N+    +  S   NSDN        K +G           VD    + S S +
Sbjct: 363  SNGFSDMNDCRVRDSVSIGSNSDNSTSASFYTKPYGRESNKSSFSESVDSRSRKGSFSPL 422

Query: 480  NYCSHNSVRDEVDVNAKVEKANRGIRGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGL 539
            N  S  SV D  D +      N+G+     S+    +PGK  K+ K + GSS   +  G 
Sbjct: 423  NLLS--SVVDFCDYSEGKRYVNQGLNH---SDMQVAVPGKWNKKAKMVPGSSNALKPRGA 482

Query: 540  GSSQRRTGKENRLTVWQKVQRNNSGECCEQLDQVSPISKHFKGIC---------NPVVGV 599
             +S+   GKEN   VWQKVQ+N++ +C  +  + + +   F G           + +  V
Sbjct: 483  RNSRISAGKENSHCVWQKVQKNDANKCNSESRKANAVCSQFLGTVKESSLLKRNSDMTYV 542

Query: 600  QMPKVKDKKTGNRKQLKEKFPRRLKRKNTSGQEKIYRP------TRNNCGSNTSSMVYKP 659
             +P     K+ ++KQL++K PR+LKRK + G +  Y          +   +N  S +   
Sbjct: 543  NIP----SKSEDKKQLRDKAPRKLKRKISPGSKHEYNSYSQRAMYSSKASANARSKIGSQ 602

Query: 660  PNGRLDIRSVGFDIRRSSGDPRS-----RFHNDTTDKCTTSESFESTQVCLD---GLVSS 719
             N   D+ +   +  R S  P S         +       S + ES+    D    L S+
Sbjct: 603  QNEIRDVSAQLNNQTRVSSAPSSCSDVGSPEFELQSSKVESLNSESSHSSQDCPKNLEST 662

Query: 720  KLISDGLNSKKVENDSGSSPRSCNSLNQSNLVEVQSPVYLPHLFF----QATKGSSLAEC 779
            + +S  +++ K   DS  + +SC SL++ N++EV SP+ LPHL F    Q  K  SLAE 
Sbjct: 663  ERVSGAVSALKEHQDSPLA-KSCYSLDKMNMLEVPSPICLPHLIFNEVAQTEKDESLAEH 722

Query: 780  SKHNNQSRSPLHNWLPSGAEGSRLATLARPDFSSLKDASTRPTEFGTSEKSIQERVNCNI 839
             K ++ S SP+  W+P G + S+    A      L  A  + TE+ T  K+  ++   N 
Sbjct: 723  GKQDHISGSPVQKWIPIGTKNSQSTFSASCGSLQLAHADGKGTEYWTLRKNFDKKSASNS 782

Query: 840  VDPVSVVTEGIQHSRDGNHGPLEHECEVPKVYGYNTAALQDHR-----CEFDVDEHFNSK 899
             + +S +  G+     G +   +   E     G N +  + +      C     E  N  
Sbjct: 783  QNLISSLNVGMMSM--GLNSESKSLQEYKDTRGVNASPFKGNNNVAADCLISESEDQNFS 842

Query: 900  SSCEDASRMEQAVNNACRAQLVSEAIQMETGSPIAEFERFLQLSSPVINQRPKLRSSEIY 959
            +     +++ QAV+NAC  Q  SEA+QM +G  IAEFE+FL  SSPVI+ +  L S +  
Sbjct: 843  TFETGINKILQAVDNACWMQAASEAVQMASGGRIAEFEQFLHFSSPVISCKSNLSSCKNC 902

Query: 960  PRNPPGDVIPCSNETADISLGCLWQWYEKHGNYGLEIKANGHENSNGFGADNSAFCAYFV 1019
              +       C +ET ++SL CLWQWYEK G+YGLEI+A  +E +N  G D  +F AYFV
Sbjct: 903  SEDQVVRASLCRHETPNVSLECLWQWYEKQGSYGLEIRAEDYEQTNRLGVDRFSFRAYFV 962

Query: 1020 PFLSAVQLFKSHKTHAPTTG---PVG--FDSCVSDIKVKEPSTC-HLPIFSVLFPKPCTD 1079
            PFLSAVQLFK+ K+H+ + G   P    F +C +  K++  +   HLPIFS+LFP+P T 
Sbjct: 963  PFLSAVQLFKNRKSHSSSNGHGFPTSGVFGTCETGQKLQSSANIGHLPIFSMLFPQPHTS 1022

Query: 1080 DASVLRVCNQLHGSEQHLGSERSKSSEQSVNLKSSGESELIFEYFEGEQPQQRRPLFDKI 1139
             AS L    +L  SE    S++   S  SV  ++S + EL+FEYFE EQP+QRRPL++KI
Sbjct: 1023 GASSLPPVKELGKSEWSSVSDKEGMSVPSV--ENSNDLELLFEYFESEQPRQRRPLYEKI 1082

Query: 1140 HQLVEGDGRPQGKIYGDPTMLNSITLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSL 1199
             +LV G+G     +YGD T+LN+I L DLH  SWYSVAWYPIYRIPDGN RAAFLTYHSL
Sbjct: 1083 QELVTGEGPSNCSVYGDRTILNTINLCDLHPASWYSVAWYPIYRIPDGNFRAAFLTYHSL 1142

Query: 1200 GHFVSRTSQPNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNSTPTL---TPGLSPPRILE 1218
            GH V R++  +S +  +C+V P VGLQSYNAQ ECWF+ ++ST +    +P +S   IL+
Sbjct: 1143 GHMVHRSANVDSANGKACIVSPAVGLQSYNAQGECWFQLKHSTSSRKAESPTVSSSVILK 1202

BLAST of Lsi02G002760 vs. TrEMBL
Match: A0A067DT06_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g042224mg PE=4 SV=1)

HSP 1 Score: 553.9 bits (1426), Expect = 4.8e-154
Identity = 423/1161 (36.43%), Postives = 602/1161 (51.85%), Query Frame = 1

Query: 122  TVESDGLWRIVALPPQYLDSLDVSCLPQMNQSTAERKLVQKGPASNGTYSFNSFRCRSL- 181
            T ES GLWRIVA+PP  LD  +            +R+  QKG   + TY  N+   R   
Sbjct: 5    TPESVGLWRIVAVPPPCLDHTNQLGSVAQGNMDVDRRKAQKGSVHDVTYPVNASTLRRSP 64

Query: 182  -----LESNNKLFDSKAIKSSNKSSGKFSCRS-SCSGSALM---SSDSSAISDIPVGGAK 241
                  +S N+   +K  K +  SS   S  S  CS S+ +    S+S   S+I V   K
Sbjct: 65   GSDVQQQSRNRTLANKVTKLNEFSSSSSSQSSIPCSNSSSVIQGRSNSFKSSNIFVENPK 124

Query: 242  MQRYGKKNPRKKAKKKEIECKKISSDFVSAETEVSSKDSAHGSFLSEACGNNDSDCRDGS 301
            +    ++N R  A+KK  + +KIS D VS   E+ S D+ HG   S    N D D  DG 
Sbjct: 125  VDNIVERNSRSNARKKGKQNRKISCDSVSTGPEILSSDNGHGILTSGPSDNVDIDRGDGL 184

Query: 302  VLCSIAQGTFLPDFRANKNDFKRDSERIIQPLGTTDSISSNIVDGNASEVS-SSASKNFS 361
            + C+ +      D R + N  + D+  I     +  + +S I + N SE   SS++ +F+
Sbjct: 185  ISCATSLEDLFLDGRNDINHVEEDNNGICNSSESQKTCTSYIDEVNLSEAEVSSSAPSFA 244

Query: 362  GYYKVCGSKNQALIKVPGCTHVNGGVNSRERLFAGSYNDFCSKDSLD-NNSPDSNCFSSN 421
            G + +  SK    ++  G    +GGV  +  L    Y+   S    D N+    +  S  
Sbjct: 245  GEHPLTDSKMMVQMEDQGSV-TDGGVEEQHPLRISCYDAIHSNGFSDMNDCRVRDSVSIG 304

Query: 422  GNSDNFNLKLDEKKCFG-----------VDLLEERSSPSRVNYCSHNSVRDEVDVNAKVE 481
             NSDN        K +G           VD    + S S +N  S  SV D  D +    
Sbjct: 305  SNSDNSTSASFYTKPYGRESNKSSFSESVDSRSRKGSFSPLNLLS--SVVDFCDYSEGKR 364

Query: 482  KANRGIRGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGLGSSQRRTGKENRLTVWQKV 541
              N+G+     S+    +P K  K+ K + GSS   +  G  +S+   GKEN   VWQKV
Sbjct: 365  YVNQGLNH---SDMQVAVPRKWNKKAKMVPGSSNALKPRGARNSRISAGKENSHCVWQKV 424

Query: 542  QRNNSGECCEQLDQVSPISKHFKGIC---------NPVVGVQMPKVKDKKTGNRKQLKEK 601
            Q+N++ +C  +  + + +   F G           + +  V +P     K+ ++KQL++K
Sbjct: 425  QKNDANKCNSESRKANAVCSQFLGTVKESSLLKRNSDMTYVNIP----SKSEDKKQLRDK 484

Query: 602  FPRRLKRKNTSGQEKIYRP------TRNNCGSNTSSMVYKPPNGRLDIRSVGFDIRRSSG 661
             PR+LKRK + G +  Y          +   +N  S +    N   D+ +   +  R S 
Sbjct: 485  APRKLKRKISPGSKHEYNSYSQRAMYSSKASANARSKIGSQQNEIRDVSAQLNNQTRVSS 544

Query: 662  DPRS-----RFHNDTTDKCTTSESFESTQVCLD---GLVSSKLISDGLNSKKVENDSGSS 721
             P S         +       S + ES+    D    L S++ +S  +++ K   DS  +
Sbjct: 545  APSSCSDVGSPEFELQSSKVESLNSESSHSSQDCPKNLESTERVSGAVSALKEHQDSPLA 604

Query: 722  PRSCNSLNQSNLVEVQSPVYLPHLFF----QATKGSSLAECSKHNNQSRSPLHNWLPSGA 781
             +SC SL++ N++EV SP+ LPHL F    Q  K  SLAE  K ++ S SP+  W+P G 
Sbjct: 605  -KSCYSLDKMNMLEVPSPICLPHLIFNEVAQTEKDESLAEHGKQDHISGSPVQKWIPIGT 664

Query: 782  EGSRLATLARPDFSSLKDASTRPTEFGTSEKSIQERVNCNIVDPVSVVTEGIQH-SRDGN 841
            +GS+    A      L  A  + TE+ T  K+I ++   N  + +S +  G+     D  
Sbjct: 665  KGSQSTFSASCGSLQLAHADGKGTEYWTLRKNIDKKSASNSQNLISSLNVGMMSMGLDSE 724

Query: 842  HGPLEHECEVPKVYGYNTAALQDHR-----CEFDVDEHFNSKSSCEDASRMEQAVNNACR 901
               L+   E     G N +  + +      C     E  N  +     +++ QAV+NAC 
Sbjct: 725  SKSLQ---EYKDTRGVNASPFKGNNNVAADCLISESEDQNFSTFETGINKILQAVDNACW 784

Query: 902  AQLVSEAIQMETGSPIAEFERFLQLSSPVINQRPKLRSSEIYPRNPPGDVIPCSNETADI 961
             Q  SEA+QM +G  IAEFE+FL  SSPVI+ +  L S +    +       C +ET ++
Sbjct: 785  MQAASEAVQMASGGRIAEFEQFLHFSSPVISCKSNLSSCKNCSEDQVVRASLCRHETPNV 844

Query: 962  SLGCLWQWYEKHGNYGLEIKANGHENSNGFGADNSAFCAYFVPFLSAVQLFKSHKTHAPT 1021
            SL CLWQWYEK G+YGLEI+A  +E +N  G D  +F AYFVPFLSAVQLFK+ K+H+ +
Sbjct: 845  SLECLWQWYEKQGSYGLEIRAVDYEQTNRLGVDRFSFRAYFVPFLSAVQLFKNRKSHSSS 904

Query: 1022 TG---PVG--FDSCVSDIKVKEPSTC-HLPIFSVLFPKPCTDDASVLRVCNQLHGSEQHL 1081
             G   P    F +C +  K++  +   HLPIFS+LFP+P T  AS L    +L  SE   
Sbjct: 905  NGHGFPTSGVFGTCETGQKLQSSANIGHLPIFSMLFPQPHTSGASSLPPVKELGKSEWSS 964

Query: 1082 GSERSKSSEQSVNLKSSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRPQGKIYGDP 1141
             S++   S  SV  ++S + EL+FEYFE EQP+QRRPL++KI +LV G+G     +YGD 
Sbjct: 965  VSDKEGMSVPSV--ENSNDLELLFEYFESEQPRQRRPLYEKIQELVTGEGPSNCSVYGDR 1024

Query: 1142 TMLNSITLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFVSRTSQPNSPDTNSC 1201
            T+LN+I L DLH  SWYSVAWYPIYRIPDGN RAAFLTYHSLGH V R++  +S +  +C
Sbjct: 1025 TILNTINLCDLHPASWYSVAWYPIYRIPDGNFRAAFLTYHSLGHMVHRSANVDSANGKAC 1084

Query: 1202 LVCPVVGLQSYNAQNECWFEPRNSTPTL---TPGLSPPRILEERLRTLEETASLMARAVV 1218
            +V P VGLQSYNAQ ECWF+ ++ST +    +P +S   IL+ERLRTLEETAS+M+RAVV
Sbjct: 1085 IVSPAVGLQSYNAQGECWFQLKHSTSSRKAESPTVSSSVILKERLRTLEETASVMSRAVV 1144

BLAST of Lsi02G002760 vs. TrEMBL
Match: A0A061EXP5_THECC (Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_025230 PE=4 SV=1)

HSP 1 Score: 539.3 bits (1388), Expect = 1.2e-149
Identity = 446/1246 (35.79%), Postives = 627/1246 (50.32%), Query Frame = 1

Query: 59   LQKTMQCALVRS-SNFQKVLDKGKESLELRLEENSCSRGIKDSKVSSFAWRNFFDYRCAV 118
            +Q+ M CAL ++  + QKV + GK +      + + SR  +DS +SSF  RN    RCA+
Sbjct: 2    IQQKMPCALQQTHQDNQKVSEVGKANCSKNSLQLNDSRRSEDSGISSFNLRNI-GQRCAI 61

Query: 119  ISFLTVESDGLWRIVALPPQYLD--------------SLDVSCLPQMNQSTAERKLVQKG 178
            ++  T+ SDG WRIVA+P QYLD              S+ +   P +N    + +  +KG
Sbjct: 62   LTLPTLGSDGQWRIVAIPLQYLDHNNLFRSGTHLNMNSMHLVSSPLINSVKVDGRKTKKG 121

Query: 179  PASNGTYSFNSFRCRSLLESNNK-LFDSKAIKSSNKSSGKFSCRSSCSGSALMSSDSSAI 238
            P    TYS    R RS   SN +  F ++ + +      + +  SSC  S++  +DSS  
Sbjct: 122  PQPEVTYSAKQCRARSFSGSNMQHQFRTRTVANKMTKLDEVANNSSCQ-SSVTCNDSSVF 181

Query: 239  ----------SDIPVGGAKMQRYGKKNPRKKAKKKEIECKKISSDFVSAETEVSSKDSAH 298
                      S + V  ++  +  K+N RKKAKKK    KK   D  S  +EV S +   
Sbjct: 182  KPKGSTATNPSAMFVDCSEEDKSKKRNSRKKAKKKGKHRKKHLCDVSSTASEVCS-EYTR 241

Query: 299  GSFLSEACGNNDSDCRDGSVLCSIAQGTFLPDFRANKNDFKRDSERIIQPLGTTDSISSN 358
            GS  SE CGNND + +   V C+ +    L     N  DF   S  +I    + +   S+
Sbjct: 242  GSSASEICGNNDMN-QGMVVSCATSPSNGL----LNIADFADSSNGVITSFESPNICISD 301

Query: 359  I--VDGNASEVSSSASKNFSGYY---KVCGSKNQALIK--VPGCTHVNGGVNSRERLFAG 418
            I  VD   S V S   K  S Y       G ++Q   +  V         V S + +   
Sbjct: 302  IDQVDITESIVPSQVQKLPSEYLINDSEIGKEDQQFSRSRVGLERRYPSQVGSLDCIHQE 361

Query: 419  SYNDFCSKDSLDNNSPDSNCFSSNGNSDNFNLKLDEKKCFGVDLLEERSSPSRVNYCSHN 478
             ++D      LD+ S  S+  S    S +  +K  +            S+  + ++   N
Sbjct: 362  DFSDLHDSLVLDSVSVGSS--SEESMSASHIVKPFDNSHENSQSEAPGSNTKKGSFYHQN 421

Query: 479  SVRDEVDVNAKVEKANRGI--RGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGLGSSQ 538
            S+    + +   +    G+    C V    S   GK+ KQ K + GSS   + G +G+  
Sbjct: 422  SLCSISETHDYTQGPKHGLDFSSCDVQMIAS---GKRGKQFKSVPGSSSTCKLGSIGNLH 481

Query: 539  RRTGKENRLTVWQKVQRNNSGECCEQLDQVSPISKHFKGICNPVVGVQMPKVKDKK---- 598
               G EN  +VWQ+VQR+   +C  +L + SPI        + V     P +K       
Sbjct: 482  GGMGTENSHSVWQRVQRHGVEKCNTELKKASPICSG-----SDVTAKDAPLLKRSSNAAN 541

Query: 599  ------TGNRKQLKEKFPRRLKRK--------NTSGQEKIYRPTRNNCGSN--TSSMVYK 658
                  T ++++LK+K PR+LKRK         +S   K   P + N  ++  TSSM   
Sbjct: 542  ETTLSGTNDKRKLKDKVPRKLKRKVSPASKQEKSSCSRKGSHPNKVNLNAHAKTSSM--- 601

Query: 659  PPNGRLDIRSVGFDIRRSSGDPRS----RFHNDTTDKCTTSESFESTQVCLDGLVSSKLI 718
              +  LD+ +   D R      RS     F    T K   SES  + QV    +   + +
Sbjct: 602  QKDEMLDVLTALNDQRVIKNVSRSCAQLGFARVETMK---SESLNNLQVSPGSMEPCESV 661

Query: 719  SD---GLNSKKVENDSGSSPRSCNSLNQSNLVEVQSPVYLPHLFF----QATKGSSLAEC 778
             D   GLN++ +EN      +SC  L+Q NL EV++PVYLPHL      +  K  SLAE 
Sbjct: 662  CDAASGLNNQCIENQDSLLKKSCVPLDQPNLHEVRAPVYLPHLMVNGVARTEKEFSLAEY 721

Query: 779  SKHNNQSRSPLHNWLPSGAEGSRLATLARPDFSSLKDASTRPTEFGTSEKSIQERVNCNI 838
             K ++ S S L  W+P G +     T  R    S + ++    E  T +   +E+V    
Sbjct: 722  GKQSHSSGSVLQKWIPVGIKDPGFTTSVRSASLSTEHSNGPEAEDWTFKNKFEEKVAPCA 781

Query: 839  VDPVSVVTEGIQHS--RDGNH--GPLEHECEVPKVYGYNTAALQDHR----CEFDVDE-- 898
             +  S V  G   S  +D  H     E++  +  +   N    ++        F +DE  
Sbjct: 782  QNLSSSVDAGTMCSIGKDSGHAISSPENDNHIKNLRNLNACINENENKHNGANFLIDETK 841

Query: 899  HFNSKSSCEDASRMEQAVNNACRAQLVSEAIQMETGSPIAEFERFLQLSSPVINQRPKLR 958
              N  +   D +++ +A+N+A RAQ+ SEA+QM  G PIAEFER L  SSPVI       
Sbjct: 842  EQNLSALATDLNKISKALNDAYRAQMASEAVQMAIGGPIAEFERLLHFSSPVICHSYSSV 901

Query: 959  SSEIYPRNPPGDVIPCSNETADISLGCLWQWYEKHGNYGLEIKANGHENSNGFGADNSAF 1018
            + +   ++     + C +ET ++ LGCLWQWYEKHG+YGLEI+A  +EN    G D   F
Sbjct: 902  ACQSCLQDQVPSGLLCRHETPNVPLGCLWQWYEKHGSYGLEIRAEDYENPKRLGVDRFEF 961

Query: 1019 CAYFVPFLSAVQLFKSHKTHAPTTGPV--------GFDSCVSDIKVKEPSTCHLPIFSVL 1078
             AYFVPFLSAVQLF++ K+H+              G+D+  +       S  HLPI SVL
Sbjct: 962  RAYFVPFLSAVQLFRNSKSHSTPNNTTIASPGVSEGYDTGSTSRDFTNVS--HLPILSVL 1021

Query: 1079 FPKPCTDDASVLRVCNQLHGSEQHLGSERSKSSEQSVNLKSSGESELIFEYFEGEQPQQR 1138
             P+P T + S     N +  SE  L S ++  S +SV++  S   E +FEYFE EQPQQR
Sbjct: 1022 VPQPRTSEPSSHLPVNDVVRSEPSLVSSKNGLSAKSVDMAWSDCLEPVFEYFESEQPQQR 1081

Query: 1139 RPLFDKIHQLVEGDGRPQGKIYGDPTMLNSITLNDLHAGSWYSVAWYPIYRIPDGNLRAA 1198
            R L++KI +LV  D   + K+YGDP  LNSI ++DLH  SWYSVAWYPIYRIPDGN RAA
Sbjct: 1082 RALYEKIQELVRDDVSSRCKMYGDPVHLNSINIHDLHPRSWYSVAWYPIYRIPDGNFRAA 1141

Query: 1199 FLTYHSLGHFVSRTSQPNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNSTP---TLTPGL 1218
            FLTYHSLGH V R+S+ + P  ++C+V PVVGLQSYNAQ ECWF+PR+ST    +   GL
Sbjct: 1142 FLTYHSLGHLVRRSSKFDYPSLDACIVSPVVGLQSYNAQGECWFQPRHSTVNDFSEIHGL 1201

BLAST of Lsi02G002760 vs. TrEMBL
Match: M5WX69_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa017129mg PE=4 SV=1)

HSP 1 Score: 516.5 bits (1329), Expect = 8.6e-143
Identity = 373/946 (39.43%), Postives = 506/946 (53.49%), Query Frame = 1

Query: 324  GTTDSISSNIVDGNASEVSSSASKNFSGYYKVCGSKNQALIKVPGCTHVNGGVNSRERLF 383
            G  +S + N    ++ EV   +  NF     +  S      +V G   ++  V+    ++
Sbjct: 145  GPKNSETPNTCTSSSDEVGIPSIGNFENQLLLKDSGFPIFDEVDG---IHTQVSCYSDMY 204

Query: 384  AGSYNDFCSKDSLDNNSPDSNCFSS-NGNSDNFNLKLDEKKCFGVDLLEERS-SPSRVNY 443
               Y+D      LD+ S  SN   S N   D    K  EK+ F +D+ +    S  +  +
Sbjct: 205  TRGYSDMHDSFVLDSMSIGSNSGDSINAGHDE---KHAEKEIFKIDISKPPGLSSGKGRF 264

Query: 444  CSHNSVRDEVDVNAKVEKANRGIRGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGLGS 503
                 + D VD     E+A  GI+GC  ++   V+P K++KQNK    ++ ++++G  G+
Sbjct: 265  SCQRFLNDVVDNYDHTEEARHGIQGCRSNDMQLVVPNKRSKQNKVAPRTANVSKFGSNGN 324

Query: 504  SQRRTGKENRLTVWQKVQRNNSGECCEQLDQVSPI-SKHFKGICNPVVGVQMPKVKD--- 563
               R GKEN  +VWQKVQRN+S +C  +L + S + S+    +    +  +   V D   
Sbjct: 325  LHIRIGKENNHSVWQKVQRNDSSDCTGELKKASSVYSRLDLPLREAPLLKRTSNVADVNA 384

Query: 564  -KKTGNRKQLKEKFPRRLKRKNTSGQEKIYR------PTRNNCGSNTSSMVYKPPNGRLD 623
              K+ ++KQ K+K  ++LKRK     ++ Y          +  G +  +      N  LD
Sbjct: 385  FSKSEDKKQQKDKVSKKLKRKTGPPLKQEYNFYSRKGSHASIAGLDGCAKARMDQNDILD 444

Query: 624  IRSVGFD------IRRSSGDP---RSRFHNDTTDKCTTSESFESTQVCLDGLVSSKLISD 683
            I S   D      + RS   P   R  + +   + C TSES  + ++C + +       D
Sbjct: 445  ISSQLKDKKSLSLVSRSCSPPSCPRGGYQSSKVE-CMTSESVHNMKLCQNEM-------D 504

Query: 684  GLNSKKVENDSGSSPRSCNSLNQSNLVEVQSPVYLPHLFFQAT-----KGSSLAECSKHN 743
               S  V N + S  R  +SL++SNL++VQSPVYLPHL   AT     K  SLAE S+ N
Sbjct: 505  HFESVCVGNKNSSVQRKWDSLSESNLLQVQSPVYLPHLLCNATSQEVQKEVSLAESSRQN 564

Query: 744  NQSRSPL-HNWLPSGAEGSRLATLARPDFSSLKDASTRPTEFGTSEKSIQERVNCNIVDP 803
            + S   L H W+P G++   L +  R   SSL+ +    ++    +   +  V  N  + 
Sbjct: 565  SSSSGSLKHKWMPIGSKNPGLTSSTRSGSSSLEHSDEAASKRWALKDPAKGNVVSNTQNL 624

Query: 804  VSVVTEGI--QHSRDGNHGPLEHECEVPKVYGYNTAALQDHRCEFDVDEHFNSKSSCED- 863
            VS V  G   Q+S D        +  + K       A   H    DV    N  +  +D 
Sbjct: 625  VSKVAVGCTGQNSEDVTCSSDAIDGRLSKSSTIEDLANNKH----DVANCINDSAVSKDL 684

Query: 864  ------ASRMEQAVNNACRAQLVSEAIQMETGSPIAEFERFLQLSSPVINQRPKLRSSEI 923
                  ++R+ +AVNNACRAQL SEA+QM TG PIAEFER L  SSPVI+Q P   S   
Sbjct: 685  NVFEAESNRILEAVNNACRAQLASEAVQMATGRPIAEFERLLYYSSPVIHQSPNSISCHT 744

Query: 924  Y-PRNPP---GDVIPCSNETADISLGCLWQWYEKHGNYGLEIKANGHENSNGFGADNSAF 983
               RN     G V  C +ET   +LGCLWQWYEK+G+YGLEI+A    NS   GAD+ AF
Sbjct: 745  CCSRNQVDQVGGVSLCRHETPHTTLGCLWQWYEKYGSYGLEIRAEEFGNSKRLGADHFAF 804

Query: 984  CAYFVPFLSAVQLFKSHKTHAPTT------GPVGFDSC-VSDIKVKEPSTCHLPIFSVLF 1043
             AYFVP+LS +QLF++ ++                 +C +S    K  S   LPIFSVLF
Sbjct: 805  RAYFVPYLSGIQLFRNGRSTDSVDINNRLHSSQELSTCRISKTPKKSSSIGSLPIFSVLF 864

Query: 1044 PKPCTDDASVLR-VCNQLHGSEQHLGSERSKSSEQSVNLKSSGESELIFEYFEGEQPQQR 1103
            P P   + +V   + NQL  +                    S + EL+FEYFE EQPQ+R
Sbjct: 865  PHPDHKEHAVTPPLVNQLSDTT------------------GSSDLELLFEYFESEQPQER 924

Query: 1104 RPLFDKIHQLVEGDGRPQGKIYGDPTMLNSITLNDLHAGSWYSVAWYPIYRIPDGNLRAA 1163
            RPL+DKI +LV GDG    K+YGDPT L+SI LNDLH  SWYSVAWYPIYRIPDGN RAA
Sbjct: 925  RPLYDKIKELVRGDGLSHSKVYGDPTKLDSINLNDLHPRSWYSVAWYPIYRIPDGNFRAA 984

Query: 1164 FLTYHSLGHFVSRTSQPNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNST---PTLTPGL 1218
            FLTYHSLGH V R ++  S + +SC+V PVVGL+SYNAQ+ECWF+ R ST    T+TPGL
Sbjct: 985  FLTYHSLGHLVHRHAKFESRNVDSCIVSPVVGLRSYNAQDECWFQLRPSTLRQTTVTPGL 1044

BLAST of Lsi02G002760 vs. TAIR10
Match: AT5G41010.1 (AT5G41010.1 DNA directed RNA polymerase, 7 kDa subunit)

HSP 1 Score: 92.8 bits (229), Expect = 1.5e-18
Identity = 39/44 (88.64%), Postives = 41/44 (93.18%), Query Frame = 1

Query: 1  MDPQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRR 45
          MDP PEPV+Y+CGDCG ENTLK GDVIQCRECGYRILYKKRTRR
Sbjct: 1  MDPAPEPVTYVCGDCGQENTLKSGDVIQCRECGYRILYKKRTRR 44

BLAST of Lsi02G002760 vs. TAIR10
Match: AT4G16100.1 (AT4G16100.1 Protein of unknown function (DUF789))

HSP 1 Score: 90.1 bits (222), Expect = 1.0e-17
Identity = 94/340 (27.65%), Postives = 130/340 (38.24%), Query Frame = 1

Query: 831  EDASRMEQAVNNACRAQLVSEAIQMETGSPIAEFERFLQLSSPVIN-QRPKLRSSEIYPR 890
            ++  + E+   + C       +    TG+  +   RFL  ++P+++ Q   L SS+ +  
Sbjct: 56   KEIKQPEECSTSDCSVPSRVSSTTTTTGTTSSNLGRFLDCTTPIVSTQHLPLTSSKGWRT 115

Query: 891  NPPGDVIPCSNETADISLGCLWQWYEKHGNYGLEIKANGHENSNGFGADNSAFCAYFVPF 950
              P              L  LW  +E+   YG+ +        NG      +   Y+VP+
Sbjct: 116  REP-------EYRPYFLLNDLWDSFEEWSAYGVGVPLL----LNGI----DSVVQYYVPY 175

Query: 951  LSAVQLFKSHKTHAPTTGPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVLRVC 1010
            LS +QL++       T   VG +S                      P+  + D S    C
Sbjct: 176  LSGIQLYEDPSRACTTRRRVGEESDGDS------------------PRDMSSDGS--NDC 235

Query: 1011 NQLHGSEQHLGSERSK---SSEQSVNLKSSGESELIFEYFEGEQPQQRRPLFDKIHQLVE 1070
             +L  +      E      SS       S+   EL+FEY EG  P  R PL DKI  L  
Sbjct: 236  RELSQNLYRASLEEKPCIGSSSDESEASSNSPGELVFEYLEGAMPFGREPLTDKISNLSS 295

Query: 1071 GDGRPQGKIYGDPTMLNSITLNDLHAGSWYSVAWYPIYRIPDG----NLRAAFLTYHSLG 1130
                           L +    DL   SW SVAWYPIYRIP G    NL A FLT+HSL 
Sbjct: 296  -----------QFPALRTYRSCDLSPSSWVSVAWYPIYRIPLGQSLQNLDACFLTFHSLS 349

Query: 1131 HFVSRTS----QPNSPDTNSC-LVCPVVGLQSYNAQNECW 1158
                 TS    Q +S    S  L  P  GL SY  +   W
Sbjct: 356  TPCRGTSNEEGQSSSKSVASAKLPLPTFGLASYKFKLSEW 349

BLAST of Lsi02G002760 vs. TAIR10
Match: AT2G01260.1 (AT2G01260.1 Protein of unknown function (DUF789))

HSP 1 Score: 83.2 bits (204), Expect = 1.2e-15
Identity = 74/261 (28.35%), Postives = 104/261 (39.85%), Query Frame = 1

Query: 907  LGCLWQWYEKHGNYGLEIKANGHENSNGFGADNSAFCAYFVPFLSAVQLFK-SHKTHAPT 966
            LG +W  + +   YG  +    + N +           Y+VP LSA+Q++  SH   +  
Sbjct: 105  LGDIWDSFAEWSAYGTGVPLVLNNNKD-------RVIQYYVPSLSAIQIYAHSHALDSSL 164

Query: 967  TGPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVLRVCNQLHGSEQHLGSERSK 1026
                  DS  SD +                    +D   V    + +   +QH   +   
Sbjct: 165  KSRRPGDSSDSDFRDSSSDV-----------SSDSDSERVSARVDCISLRDQH---QEDS 224

Query: 1027 SSEQSVNLKSSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRPQGKIYGDPTMLNSI 1086
            SS+    L S G   L+FEY E + P  R P  DK+  L                 L ++
Sbjct: 225  SSDDGEPLGSQGR--LMFEYLERDLPYIREPFADKVLDLA-----------AQFPELMTL 284

Query: 1087 TLNDLHAGSWYSVAWYPIYRIPDG----NLRAAFLTYHSL-----GHFVSRTSQPNSPDT 1146
               DL   SW+SVAWYPIYRIP G    +L A FLTYHSL     G    ++     P  
Sbjct: 285  RSCDLLRSSWFSVAWYPIYRIPTGPTLKDLDACFLTYHSLHTSFGGEGSEQSMSLTQPRE 331

Query: 1147 NSCLVCPVVGLQSYNAQNECW 1158
            +  +  PV GL SY  +   W
Sbjct: 345  SEKMSLPVFGLASYKFRGSLW 331

BLAST of Lsi02G002760 vs. TAIR10
Match: AT5G23380.1 (AT5G23380.1 Protein of unknown function (DUF789))

HSP 1 Score: 79.7 bits (195), Expect = 1.4e-14
Identity = 79/262 (30.15%), Postives = 118/262 (45.04%), Query Frame = 1

Query: 968  PVGFDSCVSDIK-VKEPSTCHLPIFSVLFPKPCTDDASVLRVCNQLHGSEQHLGSERSKS 1027
            P+  ++  SD+K    PS   + IF++   KP +DD+    +   + G+E       S S
Sbjct: 72   PLSLENFDSDVKQYYNPSLSAIQIFTI---KPFSDDSRSSAI--GIDGTETGSAITDSDS 131

Query: 1028 SEQSVNLKSSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRPQGKIYGDPTMLNSIT 1087
            + +   L +     L F+Y E E+P  R PL  K+  L E           + T L+S+T
Sbjct: 132  NGKLQCLDAGDLGYLYFQYNEVERPFDRFPLTFKMADLAE-----------EHTGLSSLT 191

Query: 1088 LNDLHAGSWYSVAWYPIYRIP-----DGNLRAAFLTYHSL----GHFVSRTSQPNSPDTN 1147
             +DL   SW S+AWYPIY IP     DG + AAFLTYH L       + +  + N    +
Sbjct: 192  SSDLSPNSWISIAWYPIYPIPPVIGVDG-ISAAFLTYHLLKPNFPETIGKDDKGNEQGES 251

Query: 1148 SC--LVCPVVGLQSYNAQNECWFEPRNSTPTLTPGLSPPRILEERLRTLEETASLMARAV 1207
            S   ++ P  G  +Y A    W         + PG S  +  E      EE+A    R  
Sbjct: 252  STPEVLLPPFGAMTYKAFGNLW---------MMPGTSDYQNREMN----EESADSWLR-- 295

Query: 1208 VKKGNLNSENTHPDYEFFLSRR 1218
             K+G      +H D+ FF+SR+
Sbjct: 312  -KRG-----FSHSDFNFFMSRK 295

BLAST of Lsi02G002760 vs. TAIR10
Match: AT1G17830.1 (AT1G17830.1 Protein of unknown function (DUF789))

HSP 1 Score: 77.0 bits (188), Expect = 8.8e-14
Identity = 73/276 (26.45%), Postives = 114/276 (41.30%), Query Frame = 1

Query: 907  LGCLWQWYEKHGNYGLEIKANGHENSNGFGADNSAFCAYFVPFLSAVQLFKSHKTHAPTT 966
            L  LW  +++   YGL  K    + +NG      +   Y+VP+LSA+Q++ +  T     
Sbjct: 61   LSDLWDCFDEPSAYGLGSKV---DLNNG-----ESVMQYYVPYLSAIQIYTNKST----- 120

Query: 967  GPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVLRVCNQLHGSEQHLGSERSKS 1026
                     SD+ V   S C             +DD+ + ++   +      +    S  
Sbjct: 121  ---AISRIHSDV-VDCESECW------------SDDSEIEKLSRSMSSGSSKIWDSVSDD 180

Query: 1027 SEQSVNLKSSGESELI----FEYFEGEQPQQRRPLFDKIHQLVEGDGRPQGKIYGDPTML 1086
            S   ++  SS   + +    F+YFE  +P  R PL  K+++L E         Y   + L
Sbjct: 181  SGYEIDGTSSLMRDKLGSIDFQYFESVKPHLRVPLTAKVNELAEK--------YPGLSTL 240

Query: 1087 NSITLNDLHAGSWYSVAWYPIYRIP----DGNLRAAFLTYHSL-----GHFVSRTSQPN- 1146
             S+   DL   SW ++AWYPIY IP    D +L   FL+YH+L     G+ +    + N 
Sbjct: 241  RSV---DLSPASWLAIAWYPIYHIPSRKTDKDLSTCFLSYHTLSSAFQGNLIEGDDEINE 295

Query: 1147 -----------SPDTNSCLVCPVVGLQSYNAQNECW 1158
                        P T S  + P  GL SY  Q + W
Sbjct: 301  TMKEETLCFDEGPVTKSIPLAP-FGLVSYKLQGDLW 295

BLAST of Lsi02G002760 vs. NCBI nr
Match: gi|778657520|ref|XP_004137638.2| (PREDICTED: uncharacterized protein LOC101212209 [Cucumis sativus])

HSP 1 Score: 1789.6 bits (4634), Expect = 0.0e+00
Identity = 945/1195 (79.08%), Postives = 1013/1195 (84.77%), Query Frame = 1

Query: 63   MQCALVRSSNFQKVLDKGKESLELRLEENSCSRGIK-DSKVSSFAWRNFFDYRCAVISFL 122
            MQC LV SS+FQKVLDKGKESLELRLE+NSCSRGI  DSKVSSFAWRNFFDYR A+IS L
Sbjct: 1    MQCTLV-SSDFQKVLDKGKESLELRLEKNSCSRGISTDSKVSSFAWRNFFDYRRAIISCL 60

Query: 123  TVESDGLWRIVALPPQYLDSLDVSCLPQMNQSTAERKLVQKGPASNGTYSFNSFRCRSLL 182
            T+ESDGLWRIVALPPQYLDSL++SCLPQMNQ TA RKLVQKGPASNGTYSFNS RCRSLL
Sbjct: 61   TLESDGLWRIVALPPQYLDSLNLSCLPQMNQFTAGRKLVQKGPASNGTYSFNSLRCRSLL 120

Query: 183  ESNNKLFDSKAIKSSNKSSGKFSCRSSCSGSALMSSDSSAISDIPVGGAKMQRYGKKNPR 242
            ESN KL DSKAIKS  +SSGKF C SSCSGSALMSSDS AISDIPV GAKMQRYGKKNPR
Sbjct: 121  ESNKKLLDSKAIKSPKQSSGKFPCTSSCSGSALMSSDSIAISDIPVDGAKMQRYGKKNPR 180

Query: 243  KKAKKKEIECKKISSDFVSAETEVSSKDSAHGSFLSEACGNNDSDCRDGSVLCSIAQGTF 302
            KKAKKKEIECK ISSDFVSAETEVS +DSA  SFLSEACG+NDSD RD SVLCSIAQ TF
Sbjct: 181  KKAKKKEIECKNISSDFVSAETEVSLQDSARASFLSEACGSNDSDFRDRSVLCSIAQETF 240

Query: 303  LPDFRANKNDFKRDSERIIQPLGTTDSISSNIVDGNASEVSSSASKNFSGYYKVCGSKNQ 362
            LPDF         + + +IQPLGT DS+SS IVDG++S+VSS A KNFSGYYKVCGS+NQ
Sbjct: 241  LPDF---------EQDSVIQPLGTVDSVSSEIVDGHSSKVSSLAIKNFSGYYKVCGSENQ 300

Query: 363  ALIKVPGCTHVNGGVNSRERLFAGSYNDFCSKDSLDNNSPDSNCFSSNGNSDNFNLKLDE 422
            ALI VPGC HV+ G+NSRER  AGS NDFCSKD LDN S DS   S NGN D+ NLKL+E
Sbjct: 301  ALINVPGCIHVDVGLNSRERFIAGSCNDFCSKDYLDNISRDSKWVSLNGNCDDLNLKLNE 360

Query: 423  KKCFGVDLLEERSSPSRVNYCSHNSVRDEVDVNAKVEKANRGIRGCTVSETCSVLPGKKT 482
            K+ FGVDLLEERSSPS+      NS RDEVD+NA+VEKAN GIRGCTVSETCSVLPGKKT
Sbjct: 361  KQGFGVDLLEERSSPSQ------NSARDEVDLNAEVEKANLGIRGCTVSETCSVLPGKKT 420

Query: 483  KQNKKLTGSSRMNRYGGLGSSQRRTGKENRLTVWQKVQRNNSGECCEQLDQVSPISKHFK 542
            KQNKKLTGSSRMNRYGGLGSSQRRTGKENR TVWQKVQR++SG C EQLDQVSPISK FK
Sbjct: 421  KQNKKLTGSSRMNRYGGLGSSQRRTGKENRHTVWQKVQRSSSGGCSEQLDQVSPISKQFK 480

Query: 543  GICNPVVGVQMPKVKDKKTGNRKQLKEKFPRRLKRKNTSGQEKIYRPTRNNCGSNTSSMV 602
            GICNPVVGVQMPKVKDKKTGN+KQLKEK PRRLKRKNTSGQEKIYRPTRN+CGSNTSSMV
Sbjct: 481  GICNPVVGVQMPKVKDKKTGNKKQLKEKCPRRLKRKNTSGQEKIYRPTRNSCGSNTSSMV 540

Query: 603  YKPPNGRLDIRSVGFDIRRSSGDPRSRFHNDTTDKCTTSESFESTQVCLDGLVSSKLISD 662
            +KPPN +LD+RS+GFDIRRSSGDPRS F ND+TDKCT SES ES QV LD L+S+KLI+D
Sbjct: 541  HKPPNEKLDVRSMGFDIRRSSGDPRSCFQNDSTDKCTNSESVESKQVHLDELISNKLIND 600

Query: 663  GLNSKKVENDSGSSPRSCNSLNQSNLVEVQSPVYLPHLFFQ--ATKGSSLAECSKHNNQS 722
            GL+S+KVENDS S P+SCNS NQSN VEV+SPVYLPHLFFQ      SSL +     NQS
Sbjct: 601  GLSSQKVENDSSSLPKSCNSSNQSNPVEVKSPVYLPHLFFQKVGNDSSSLPKSCNSLNQS 660

Query: 723  R----------------------------------SPLHNWLPSGAEGSRLATLARPDFS 782
                                               SPL NWLPSGAEGSR  TLARPDFS
Sbjct: 661  NPVEVKSSVYLPHLFFQATKGSSLDERSKHDTQSRSPLQNWLPSGAEGSRSITLARPDFS 720

Query: 783  SLKDASTRPTEFGTSEKSIQERVNCNIVDPVSVVTEGIQHSRDGNHGPLEHECEVPKVYG 842
            SL+DA+T+P EFGT EKSI+ERVNCN+++PVS V EGIQH RD + GPLEHEC V K+YG
Sbjct: 721  SLRDANTQPAEFGTLEKSIKERVNCNVLNPVSDVIEGIQHYRDRDDGPLEHECGVQKMYG 780

Query: 843  YNTAALQDHRCEFDVDEHFNSKSSCEDASRMEQAVNNACRAQLVSEAIQMETGSPIAEFE 902
            Y+T  LQDH+ EFDVDEHFN KSSCED SRMEQAVNNACRAQL SEAIQMETG PIAEFE
Sbjct: 781  YDTTTLQDHKSEFDVDEHFNCKSSCEDVSRMEQAVNNACRAQLASEAIQMETGCPIAEFE 840

Query: 903  RFLQLSSPVINQRPKLRSSEIYPRNPPGDVIPCSNETADISLGCLWQWYEKHGNYGLEIK 962
            RFL LSSPVI+QRP   SS+I PRN PGDVIPCSNET +ISLGCLWQWYEKHG+YGLEIK
Sbjct: 841  RFLHLSSPVIDQRPN-SSSDICPRNLPGDVIPCSNETTNISLGCLWQWYEKHGSYGLEIK 900

Query: 963  ANGHENSNGFGADNSAFCAYFVPFLSAVQLFKSHKTHAPT-TGPVGFDSCVSDIKVKEPS 1022
            A G ENSNGFGA NSAF AYFVPFLSAVQLFKS KTH  T TGP+GF+SCVSDIKVKEPS
Sbjct: 901  AKGQENSNGFGAVNSAFRAYFVPFLSAVQLFKSRKTHVGTATGPLGFNSCVSDIKVKEPS 960

Query: 1023 TCHLPIFSVLFPKPCTDDASVLRVCNQLHGSEQHLGSERSKSSEQSVNLKSSGESELIFE 1082
            TCHLPIFS+LFPKPCTDD SVLRVCNQ H SEQHL SE+ KSSEQS +L+ SGESELIFE
Sbjct: 961  TCHLPIFSLLFPKPCTDDTSVLRVCNQFHSSEQHLASEKKKSSEQSASLQLSGESELIFE 1020

Query: 1083 YFEGEQPQQRRPLFDKIHQLVEGDGRPQGKIYGDPTMLNSITLNDLHAGSWYSVAWYPIY 1142
            YFEGEQPQ RRPLFDKIHQLVEGDG  QGKIYGDPT+LNSITL+DLHAGSWYSVAWYPIY
Sbjct: 1021 YFEGEQPQLRRPLFDKIHQLVEGDGL-QGKIYGDPTVLNSITLDDLHAGSWYSVAWYPIY 1080

Query: 1143 RIPDGNLRAAFLTYHSLGHFVSRTSQPNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNS- 1202
            RIPDGNLRAAFLTYHSLGHFVSRTSQ    DTNSCLVCPVVGLQSYNAQNECWFEPR+S 
Sbjct: 1081 RIPDGNLRAAFLTYHSLGHFVSRTSQ----DTNSCLVCPVVGLQSYNAQNECWFEPRDST 1140

Query: 1203 -TPTLTPGLSPPRILEERLRTLEETASLMARAVVKKGNLNSENTHPDYEFFLSRR 1218
             T T T  L+PPRIL+ERLRTLEETASLMARAVVKKGNLNS NTHPDYEFFLSRR
Sbjct: 1141 RTSTFTSNLNPPRILQERLRTLEETASLMARAVVKKGNLNSGNTHPDYEFFLSRR 1173

BLAST of Lsi02G002760 vs. NCBI nr
Match: gi|659066969|ref|XP_008436988.1| (PREDICTED: uncharacterized protein LOC103482551 [Cucumis melo])

HSP 1 Score: 971.8 bits (2511), Expect = 1.1e-279
Identity = 502/691 (72.65%), Postives = 543/691 (78.58%), Query Frame = 1

Query: 600  MVYKPPNGRLDIRSVGFDIRRSSGDPRSRFHNDTTDKCTTSESFESTQV----------C 659
            MV+KPPN RLDIRS+GFDIRRSSG+PRSRF NDTTDKC  SE+ E  QV           
Sbjct: 1    MVHKPPNERLDIRSMGFDIRRSSGNPRSRFQNDTTDKCMNSEAVEGKQVHPDELFSNKLI 60

Query: 660  LDGLVSSKLISDG--------------------------LNSKKVENDSGSSPRSCNSLN 719
             DGL S K+ +D                           L  +KVENDS S P+SCNS N
Sbjct: 61   YDGLSSQKVENDSSSLPKSCNSSNQSNPVEVKSPVYLPHLFFQKVENDSSSLPKSCNSSN 120

Query: 720  QSNLVEVQSPV------------------------------------YLPHLFFQATKGS 779
             SN VEV+SPV                                    YLPHLFFQATKGS
Sbjct: 121  LSNPVEVKSPVYLPHLFFQKVENDSSSLPKSCSSSNLSNTVEVKSPVYLPHLFFQATKGS 180

Query: 780  SLAECSKHNNQSRSPLHNWLPSGAEGSRLATLARPDFSSLKDASTRPTEFGTSEKSIQER 839
            SLAE SKH  QSRSPL NWLPSGAEGSR  TLARPDFSSL+DA+T+P EFGTSEKSI+ER
Sbjct: 181  SLAERSKHETQSRSPLQNWLPSGAEGSRSTTLARPDFSSLRDANTQPAEFGTSEKSIKER 240

Query: 840  VNCNIVDPVSVVTEGIQHSRDGNHGPLEHECEVPKVYGYNTAALQDHRCEFDVDEHFNSK 899
            VNC++++PVS V EGIQH RD +HG LEHECEV K+YG++T  LQ+ +CEF+VDEHFN K
Sbjct: 241  VNCSLLNPVSDVLEGIQHYRDRDHGSLEHECEVQKIYGFDTTTLQNQKCEFNVDEHFNCK 300

Query: 900  SSCEDASRMEQAVNNACRAQLVSEAIQMETGSPIAEFERFLQLSSPVINQRPKLRSSEIY 959
            SSCED SRMEQAVNNAC+AQL SEAIQMETG PIAEFERFL LSSPVI+QRPKLRSSEI 
Sbjct: 301  SSCEDVSRMEQAVNNACKAQLASEAIQMETGCPIAEFERFLHLSSPVIDQRPKLRSSEIC 360

Query: 960  PRNPPGDVIPCSNETADISLGCLWQWYEKHGNYGLEIKANGHENSNGFGADNSAFCAYFV 1019
            PRN PGDVIPCSNET +ISL CLWQWYEKHG+YGLEIKA  HENSNGFG  NSAF AYFV
Sbjct: 361  PRNLPGDVIPCSNETTNISLACLWQWYEKHGSYGLEIKAKSHENSNGFGVVNSAFRAYFV 420

Query: 1020 PFLSAVQLFKSHKTH-APTTGPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVL 1079
            PFLSA+QLFKS KTH   TTGP+GFDSCVSDIKVKEPSTCHLPIFS+LFP+P TDD SVL
Sbjct: 421  PFLSAIQLFKSRKTHVGTTTGPLGFDSCVSDIKVKEPSTCHLPIFSLLFPEPSTDDTSVL 480

Query: 1080 RVCNQLHGSEQHLGSERSKSSEQSVNLKSSGESELIFEYFEGEQPQQRRPLFDKIHQLVE 1139
            RVCN+ H SEQ L SE+ KSS+QS +L+ SGESELIFEYFEGEQPQ RRPLFDKIHQLVE
Sbjct: 481  RVCNRFHSSEQDLASEKRKSSKQSASLQLSGESELIFEYFEGEQPQLRRPLFDKIHQLVE 540

Query: 1140 GDGRPQGKIYGDPTMLNSITLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFVS 1199
            GDG  QGKIYGDPTMLNSITL+DLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFVS
Sbjct: 541  GDGCLQGKIYGDPTMLNSITLDDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFVS 600

Query: 1200 RTSQPNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNSTPTLTPGLSPPRILEERLRTLEE 1218
            RTSQ    DTNSCLVCPVVGLQSYNAQNECWFEPR ST T T  L+PPR+L+ERLRTLEE
Sbjct: 601  RTSQ----DTNSCLVCPVVGLQSYNAQNECWFEPRESTSTFTSDLNPPRVLQERLRTLEE 660

BLAST of Lsi02G002760 vs. NCBI nr
Match: gi|659066971|ref|XP_008436999.1| (PREDICTED: uncharacterized protein LOC103482558 [Cucumis melo])

HSP 1 Score: 739.6 bits (1908), Expect = 8.9e-210
Identity = 402/500 (80.40%), Postives = 430/500 (86.00%), Query Frame = 1

Query: 63  MQCALVRSSNFQKVLDKGKESLELRLEENSCSRGI-KDSKVSSFAWRNFFDYRCAVISFL 122
           MQCALVRSS+FQKVLDKGKESL+LRLE+NSCSRGI KD +VSSFAWRNFFDYRCAVI FL
Sbjct: 1   MQCALVRSSDFQKVLDKGKESLDLRLEKNSCSRGISKDFEVSSFAWRNFFDYRCAVIRFL 60

Query: 123 TVESDGLWRIVALPPQYLDSLDVSCLPQMNQSTAERKLVQKGPASNGTYSFNSFRCRSLL 182
           T+ESDGLWRIVALPPQYLDSL+VSCLPQMNQ TA RKLVQKG ASNGTYSFNS RCRSLL
Sbjct: 61  TLESDGLWRIVALPPQYLDSLNVSCLPQMNQFTAGRKLVQKGSASNGTYSFNSLRCRSLL 120

Query: 183 ESNNKLFDSKAIKSSNKSSGKFSCRSSCSGSALMSSDSSAISDIPVGGAKMQRYGKKNPR 242
           ESN KL DSKAIKS NKSSGK  C SSCS SALMSSDS A SDIP+ GAKMQRYGKKNPR
Sbjct: 121 ESNKKLLDSKAIKSPNKSSGKLLCTSSCSASALMSSDSIATSDIPIDGAKMQRYGKKNPR 180

Query: 243 KKAKKKEIECKKISSDFVSAETEVSSKDSAHGSFLSEACGNNDSDCRDGSVLCSIAQGTF 302
           KKAKKKE+E KKISS+FVSAETEVS +DSA  SFLSEACG+NDSD R+ +VLCSIA  TF
Sbjct: 181 KKAKKKELEYKKISSEFVSAETEVSLQDSARASFLSEACGSNDSDFRNRTVLCSIAPETF 240

Query: 303 LPDFRANKNDFKRDSERIIQPLGTTDSISSNIVDGNASEVSSSASKNFSGYYKVCGSKNQ 362
           LP       DF+RDSE  IQPLGT DS+SS IVDG++S+VSSSA KNFSGY+KVCGS+NQ
Sbjct: 241 LP-------DFERDSE--IQPLGTVDSVSSEIVDGHSSKVSSSAIKNFSGYHKVCGSENQ 300

Query: 363 ALIKVPGCTHVNGGVNSRERLFAGSYNDFCSKDSLDNNSPDSNCFSSNGNSDNFNLKLDE 422
           AL   PGC HV+ G+NSRE L AGS NDFCS DSLDNNS DS   S N N D+ NLKL+E
Sbjct: 301 ALTNAPGCFHVDVGLNSRESLLAGSCNDFCSTDSLDNNSCDSKWVSLNSNCDDLNLKLNE 360

Query: 423 KKCFGVDLLEERSSPSRVNYCSHNSVRDEVDVNAKVEKANRGIRGCTVSETCSVLPGKKT 482
           KK FGVDLLEERSSP R N CS NS RDEVD+N +VEK   GI+GCTVSETCSVLPGKKT
Sbjct: 361 KKGFGVDLLEERSSPYREN-CSQNSARDEVDLNTEVEK---GIQGCTVSETCSVLPGKKT 420

Query: 483 KQNKKLTGSSRMNRYGGLGSSQRRTGKENRLTVWQKVQRNNSGECCEQLDQVSPISKHFK 542
           KQNKKLTGSSRMNRYGGLGSSQRRTGKENR TVWQKVQR+NSG C EQLDQVSPISK FK
Sbjct: 421 KQNKKLTGSSRMNRYGGLGSSQRRTGKENRHTVWQKVQRSNSGGCSEQLDQVSPISKQFK 480

Query: 543 GICNPVVGVQMPKVKDKKTG 562
           GICNPV GVQMPKVKDKK G
Sbjct: 481 GICNPVAGVQMPKVKDKKQG 487

BLAST of Lsi02G002760 vs. NCBI nr
Match: gi|645270267|ref|XP_008240381.1| (PREDICTED: probable GPI-anchored adhesin-like protein PGA55 isoform X1 [Prunus mume])

HSP 1 Score: 607.1 bits (1564), Expect = 6.9e-170
Identity = 471/1243 (37.89%), Postives = 655/1243 (52.70%), Query Frame = 1

Query: 63   MQCALVRS-SNFQKVLDKGKESLELRLEENSCSRGIKDSKVSSFAWRNFFDYRCAVISFL 122
            M CAL R+ S+ QK  D  + SL  + E+ S    + D +V  F  RNF D RC ++S L
Sbjct: 1    MHCALQRTNSDIQKNSDTRRYSLSKK-EQKSFRTSLDDCEVPYFTGRNF-DRRCPILSVL 60

Query: 123  TVESDGLWRIVALPPQY--------------LDSLDVSCLPQMNQSTAERKLVQKGPASN 182
              E DG WR VALPP                +D+L +   P +N     R+ +QKGP  +
Sbjct: 61   FREPDGHWRTVALPPLCPDNINHLVSGTLVNMDTLHLVYPPPINPFKVNRQKMQKGPPLD 120

Query: 183  GTYSFNSFRCRSLL------ESNNKLFDSKAIKSSNKSSGKFSCRSSCSGSALMS-SDSS 242
             TYS  SF  R         +S NK   +KA K +  S   F    S S S + + S+S 
Sbjct: 121  FTYSVKSFTGRRFTGSAVRHQSRNKTLANKATKWNELSRKSFHNGCSDSSSTIPNGSNSF 180

Query: 243  AISDIPVGGAKMQRYGKKNPRKKAKKKEIECKKISSDFVSAETEVSSKDSAHGSFLSEAC 302
              S + +G  K+    K++ RKK++KK  +  K+S+     E EV S++ A+GS  SE C
Sbjct: 181  NSSTMSIGNKKINSIAKRSSRKKSRKKGKQSTKVSN-----EPEVLSEEYANGSSASEPC 240

Query: 303  GNNDSDCRDGSVLCSIAQGTFLPDFRANKNDFKRDSERIIQPLGTTDSISSNIVDGNASE 362
            G+ND D   G V  S A    LPD                   G  +S + N    ++ E
Sbjct: 241  GHNDGD---GQVSSSTAPEISLPDS------------------GPKNSETPNTCTSSSDE 300

Query: 363  VSSSASKNFSGYYKVCGSKNQALIKVPG------CTHVNGGVNSRERLFAGSYNDFCSKD 422
            V   ++ NF         +NQ L+K  G         ++  V+    ++   Y+D     
Sbjct: 301  VGIPSAGNF---------ENQLLLKDSGFPIFDDVEGIHTQVSCYSDMYTKGYSDMHDTF 360

Query: 423  SLDNNSPDSNCFSSNGNSDNFNLKLDEKKCFGVDLLEERS-SPSRVNYCSHNSVRDEVDV 482
             LD+ S  SN  S +  +   + K  EK+ F +D+ +    S  +  +     + D VD 
Sbjct: 361  VLDSISIGSN--SGDSTNAGHDEKHAEKEIFKIDISKPPGLSSGKGRFSCQRFLNDVVDN 420

Query: 483  NAKVEKANRGIRGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGLGSSQRRTGKENRLT 542
                E+A  GI+GC  ++   V+P K++KQNK    ++ ++++G  G+   R GKEN  +
Sbjct: 421  YDHTEEARHGIQGCRSNDMQLVVPNKRSKQNKVAPRTANVSKFGSNGNLHIRIGKENNHS 480

Query: 543  VWQKVQRNNSGECCEQLDQVSPI-SKHFKGICNPVVGVQMPKVKD----KKTGNRKQLKE 602
            VWQKVQRN+S +C  +L + S + S+    +    +  +   V D     K+ ++KQ K+
Sbjct: 481  VWQKVQRNDSSDCTGELKKASSVYSRLDLPLREAPLLKRTSNVADVNAFSKSEDKKQQKD 540

Query: 603  KFPRRLKRKNTSGQEKIYR------PTRNNCGSNTSSMVYKPPNGRLDIRSVGFD----- 662
            K  ++LKRK     ++ Y          +  G +  +      N  LDI S   D     
Sbjct: 541  KVSKKLKRKTGPSLKQEYNFYSRKGSHASIAGLDGCAKARMGQNDILDISSQLKDKKSLS 600

Query: 663  -IRRSSGDP---RSRFHNDTTDKCTTSESFESTQVCLDGLVSSKLISDGLNSKKVENDSG 722
             + RS   P   R  + +   + C TSES  + ++C +         D L S  V N + 
Sbjct: 601  LVSRSCSPPSCPRGGYQSSKVE-CMTSESGHNMKLCQNE-------KDHLESVCVGNKNS 660

Query: 723  SSPRSCNSLNQSNLVEVQSPVYLPHLFFQAT-----KGSSLAECSKHNNQSRSPL-HNWL 782
               R  +SL++SNL+++QSPVYLPHL   AT     K  SLAE S+ N+ S   L H W+
Sbjct: 661  LVQRKWDSLSESNLLQLQSPVYLPHLLCNATSQEVQKEVSLAESSRQNSSSSGSLTHKWM 720

Query: 783  PSGAEGSRLATLARPDFSSLKDASTRPTEFGTSEKSIQERVNCNIVDPVSVVTEGI--QH 842
            P G++   L +  R   SSL+ +    ++    + + +  V  N  + VS V  G   Q+
Sbjct: 721  PIGSKNPGLPSSTRSGSSSLEHSDEAASKRWALKDTAKGNVVSNAQNLVSKVAVGCTGQN 780

Query: 843  SRDGNHGPLEHE--CEVPKVYGY--NTAALQD-HRCEFDVDEHFNSKSSCED-------A 902
            S D        +  C    + G    ++ ++D    + DV    N  +  +D       +
Sbjct: 781  SEDVTCSQNSEDVTCSSDAIDGRLSKSSTIEDLANNKLDVANRINDSAVSKDLNVFEAES 840

Query: 903  SRMEQAVNNACRAQLVSEAIQMETGSPIAEFERFLQLSSPVINQRPKLRSS-EIYPRNPP 962
            +R+ +AVNNACRAQL SEA+QM TG PIAEFER L  SSPVI+Q P   S      RN  
Sbjct: 841  NRILEAVNNACRAQLASEAVQMATGRPIAEFERLLYYSSPVIHQSPNSISCYTCCSRNQV 900

Query: 963  ---GDVIPCSNETADISLGCLWQWYEKHGNYGLEIKANGHENSNGFGADNSAFCAYFVPF 1022
               G V  C +ET  I+LGCLWQWYEK+G+YGLEI+A    NS   GAD+ AF AYFVP+
Sbjct: 901  DQVGGVSFCRHETPQITLGCLWQWYEKYGSYGLEIRAEEFGNSKRLGADHFAFRAYFVPY 960

Query: 1023 LSAVQLFKS----------HKTHAPTTGPVGFDSC-VSDIKVKEPSTCHLPIFSVLFPKP 1082
            LS +QLF++          ++ H+         +C +S    K  S   LPIFSVLFP P
Sbjct: 961  LSGIQLFRNGRCTDSVDINNRLHSSQE----LSTCRISKTPKKFSSIGSLPIFSVLFPHP 1020

Query: 1083 CTDDASVLR-VCNQLHGSEQHLGSERSKSSEQSVNLKSSGESELIFEYFEGEQPQQRRPL 1142
               + +V   + NQL  SEQ   + +  S+ Q  +   S + EL+FEYFE EQPQ+RRPL
Sbjct: 1021 DHKEHAVTPPLVNQLCVSEQSSAAAKDVSA-QLADTTGSSDLELLFEYFESEQPQERRPL 1080

Query: 1143 FDKIHQLVEGDGRPQGKIYGDPTMLNSITLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLT 1202
            +DKI +LV GDG    K+YGDPT L+SI LNDLH  SWYSVAWYPIYRIPDGN RAAFLT
Sbjct: 1081 YDKIKELVRGDGLSHSKVYGDPTKLDSINLNDLHPRSWYSVAWYPIYRIPDGNFRAAFLT 1140

Query: 1203 YHSLGHFVSRTSQPNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNST---PTLTPGLSPP 1218
            YHSLGHFV R ++  S + +SC+V PVVGL+SYNAQ+ECWF+ R ST    T+TPGL+P 
Sbjct: 1141 YHSLGHFVHRHAKFESRNVDSCIVSPVVGLRSYNAQDECWFQLRPSTLRQTTVTPGLNPC 1191

BLAST of Lsi02G002760 vs. NCBI nr
Match: gi|694447337|ref|XP_009349819.1| (PREDICTED: uncharacterized protein LOC103941352 isoform X1 [Pyrus x bretschneideri])

HSP 1 Score: 590.1 bits (1520), Expect = 8.8e-165
Identity = 476/1239 (38.42%), Postives = 641/1239 (51.74%), Query Frame = 1

Query: 63   MQCALVRSSN---FQKVLDKGKESLELRLEENSCSRGIKDSKVSSFAWRNFFDYRCAVIS 122
            M CAL R+++    QK+ D+ ++ L L  +  S    ++D +V S  WRN  D RC + +
Sbjct: 1    MHCALPRTTSDTDVQKISDRRRDLL-LWKQRKSSRTSLEDCEVPSVTWRNS-DRRCGIFT 60

Query: 123  FLTVESDGLWRIVALP--------------PQYLDSLDVSCLPQMNQSTAERKLVQKGPA 182
            FL+++ D  WRIVALP              P  +DSL +   P +N     R  VQK   
Sbjct: 61   FLSLKPDEQWRIVALPSQCPYNINQPVSDTPVNMDSLHLLYPPPLNPFKVTRHRVQKVLP 120

Query: 183  SNGTYSFNSFRCRSLLESN------NKLFDSKAIKSSNKSSGKF--SCRSSCSGSALMSS 242
             + TYS NSF  R    S+      NK   +KA K +      F  S  SS S SA+ + 
Sbjct: 121  LDATYSVNSFTSRRFTGSSVRHQPRNKTLTNKATKWNGVPRKSFHKSITSSDSASAIPNG 180

Query: 243  DSSAI--SDIPVGGAKMQRYGKKNPRKKAKKKEIECKKISSDFVSAETEVSSKDSAHGSF 302
             S+AI  S++ +G  K+    K++ RKK +KK  + KK S +  S E+EV S++  +GS 
Sbjct: 181  -SNAINSSNMSIGNQKIDNTTKRSSRKKNRKKGKQNKKFSCNISSNESEVLSEEYPNGSS 240

Query: 303  LSEACGNNDSDCRDGSVLCSIAQGTFLPDFRANKNDFKRDSERIIQPLGTTDSISSNIVD 362
             S+ CGNND D    S   S A  T LPD                   G  +S +SN   
Sbjct: 241  ASKTCGNNDGDRPLSS---STAPDTSLPDD------------------GAKNSETSNTCT 300

Query: 363  GNASEVSSSASKNFSGYYKVCGSKNQALIKVPGCTHVNG--GVNS----RERLFAGSYND 422
             ++ E   S+  NF         +NQ L+K  G    NG  G++     R  ++   Y D
Sbjct: 301  SSSDEAGISSVGNF---------ENQVLLKDSGFPIFNGVEGIHPQTSCRNDMYTKGYYD 360

Query: 423  FCSKDSLDNNSPDSNCFSSNGNSDNFNLKLDEKKCFGVDLLEERSSPSRVNYCS-HNSVR 482
                  LD+ S  S  +S +  +   + K  E +   + + E  S  SR  Y S  +S+ 
Sbjct: 361  IHDSFILDSVSFGS--YSDDSTNAGHDEKHAETEIHEIYISEPPSLSSRKGYFSCQSSLN 420

Query: 483  DEVDVNAKVEKANRGIRGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGLGSSQRRTGK 542
            D VD     E    GI+G + S+   +   K++KQNK    +S ++++G  G+   RTGK
Sbjct: 421  DAVDSYNHTEGTRHGIQGRSNSDVQLIALNKRSKQNKVAPRNSNVSKFGSSGNLHARTGK 480

Query: 543  ENRLTVWQKVQRNNSGECCEQLDQVSPISKHF----------KGICNPVVGVQMPKVKDK 602
            E+  +VWQKVQRN+SG+C  +L + S +   +          K  CN       PK    
Sbjct: 481  ESNQSVWQKVQRNDSGDCTGELKKASSVYSRYDLPLRESYFLKRTCNAADVNAFPK---- 540

Query: 603  KTGNRKQLKEKFPRRLKRKNTSGQEKIY----RPTRNNCGSNTSSMVYKPPNGRLDIR-- 662
             +G+RKQ K+K  ++LKRK+    ++ Y    R   +   S     V K    + DI   
Sbjct: 541  -SGDRKQQKDKVSKKLKRKSDPALKQEYNCYSRKGSHASMSGLDGCV-KDRIEQNDISDQ 600

Query: 663  ---SVGFDIRRSSGDPRSRFH---NDTTDKCTTSESFESTQVCLDGLVSSKLISDGLNSK 722
               + G D+   S  P S        +  +C TSES  S Q+C + +   + + + ++  
Sbjct: 601  AKDNKGLDLASRSCSPPSCLSAGFQSSKVECMTSESVPSMQLCPNEMAHLESVGNSVSHM 660

Query: 723  KVENDSGSSPRSCNSLNQSNLVEVQSPVYLPHLFF-----QATKGSSLAECSKHNNQSRS 782
            K ++    S              +QSPVYLPHL       +  K +SLAE  ++ + S S
Sbjct: 661  KYQSVRNESST------------MQSPVYLPHLHCNTASQEVQKETSLAESRQNYSTSGS 720

Query: 783  PLHNWLPSGAEGSRLATLARPDFSSLK---DASTRPTEFGTSEKSIQERVNCNIVDPVSV 842
              H W+P G +   L    R   SSL+   +A++R      + K        N V  V+V
Sbjct: 721  FTHKWMPIGLKNPGLTNSTRSGSSSLEHSDEAASRRWTLKDTAKGYAAFNTQNPVSDVAV 780

Query: 843  VTEG-----IQHSRDGNHGPLEHECEVPKVYGYNTAALQDHRCEFDVDEHFNSKSSCEDA 902
            V  G     +  S +G  G L       ++   N     ++    DV    N+  +  D+
Sbjct: 781  VCPGQSSGDLTCSSNGFEGRLPKPSTTKELIN-NKLNAANYIKNSDVPRDVNAFEA--DS 840

Query: 903  SRMEQAVNNACRAQLVSEAIQMETGSPIAEFERFLQLSSPVINQRPKLRSSE-IYPRN-- 962
            +R+ +AVNNACRAQL SEAIQM TG PIAEFER L  SSP I+Q P   S      RN  
Sbjct: 841  NRILEAVNNACRAQLASEAIQMATGRPIAEFERLLYHSSPAIHQSPNSVSCHTCCSRNQV 900

Query: 963  -PPGDVIPCSNETADISLGCLWQWYEKHGNYGLEIKANGHENSNGFGADNSAFCAYFVPF 1022
               G V  C +ET DISLG LWQWYEK+G+YGLEI+A    +S   GAD  AF AYFVP+
Sbjct: 901  DQVGGVPLCRHETPDISLGSLWQWYEKYGSYGLEIRAEELGDSKRLGADRFAFRAYFVPY 960

Query: 1023 LSAVQLFKS-HKTHAPTTGPV-GFD----SCVSDIKVKEPSTCHLPIFSVLFPKP-CTDD 1082
            LS +QLFK+ +  +A       G D    S  SD      S    P+FS+L P+P   +D
Sbjct: 961  LSGIQLFKNGNADYADANNRFPGSDAPSASLDSDTSKNSSSIGSFPLFSLLLPQPDHKED 1020

Query: 1083 ASVLRVCNQLHGSEQHLGSERSKSSEQSVNLKSSGESELIFEYFEGEQPQQRRPLFDKIH 1142
            A    + NQ   SEQ   S R   S +  +   SG+ EL+FEYFE EQPQ RRPL+DKI 
Sbjct: 1021 AVTPPLVNQQCISEQSSASARD-VSVRLTDTTGSGDLELLFEYFESEQPQVRRPLYDKIK 1080

Query: 1143 QLVEGDGRPQGKIYGDPTMLNSITLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLG 1202
            +LV+GDG    K YGDPT LNS  LNDLH  SWYSVAWYPIYRIPDGNLRAAFLTYHSLG
Sbjct: 1081 ELVQGDGLSHSKAYGDPTNLNSKNLNDLHPRSWYSVAWYPIYRIPDGNLRAAFLTYHSLG 1140

Query: 1203 HFVSRTSQPNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNSTP---TLTP-GLSPPRILE 1218
            H V R+++  S   ++C+V PVVGLQSYNAQ ECWF+ R S P   T+TP GL+P  +LE
Sbjct: 1141 HLVHRSTKFESHKLDTCIVSPVVGLQSYNAQAECWFKLRPSAPRQTTVTPWGLNPCGVLE 1182

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
NRPBC_ARATH2.7e-1788.64DNA-directed RNA polymerases II, IV and V subunit 12 OS=Arabidopsis thaliana GN=... [more]
RPBCL_ARATH3.3e-1073.17DNA-directed RNA polymerase subunit 12-like protein OS=Arabidopsis thaliana GN=N... [more]
RPAB4_MOUSE2.8e-0961.90DNA-directed RNA polymerases I, II, and III subunit RPABC4 OS=Mus musculus GN=Po... [more]
RPAB4_HUMAN2.8e-0961.90DNA-directed RNA polymerases I, II, and III subunit RPABC4 OS=Homo sapiens GN=PO... [more]
RPAB4_BOVIN2.8e-0961.90DNA-directed RNA polymerases I, II, and III subunit RPABC4 OS=Bos taurus GN=POLR... [more]
Match NameE-valueIdentityDescription
A0A0A0LT77_CUCSA0.0e+0079.08Uncharacterized protein OS=Cucumis sativus GN=Csa_1G043170 PE=4 SV=1[more]
V4TSI5_9ROSI1.2e-16036.08Uncharacterized protein OS=Citrus clementina GN=CICLE_v10018551mg PE=4 SV=1[more]
A0A067DT06_CITSI4.8e-15436.43Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g042224mg PE=4 SV=1[more]
A0A061EXP5_THECC1.2e-14935.79Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_025230 PE=4 SV=1[more]
M5WX69_PRUPE8.6e-14339.43Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa017129mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G41010.11.5e-1888.64 DNA directed RNA polymerase, 7 kDa subunit[more]
AT4G16100.11.0e-1727.65 Protein of unknown function (DUF789)[more]
AT2G01260.11.2e-1528.35 Protein of unknown function (DUF789)[more]
AT5G23380.11.4e-1430.15 Protein of unknown function (DUF789)[more]
AT1G17830.18.8e-1426.45 Protein of unknown function (DUF789)[more]
Match NameE-valueIdentityDescription
gi|778657520|ref|XP_004137638.2|0.0e+0079.08PREDICTED: uncharacterized protein LOC101212209 [Cucumis sativus][more]
gi|659066969|ref|XP_008436988.1|1.1e-27972.65PREDICTED: uncharacterized protein LOC103482551 [Cucumis melo][more]
gi|659066971|ref|XP_008436999.1|8.9e-21080.40PREDICTED: uncharacterized protein LOC103482558 [Cucumis melo][more]
gi|645270267|ref|XP_008240381.1|6.9e-17037.89PREDICTED: probable GPI-anchored adhesin-like protein PGA55 isoform X1 [Prunus m... [more]
gi|694447337|ref|XP_009349819.1|8.8e-16538.42PREDICTED: uncharacterized protein LOC103941352 isoform X1 [Pyrus x bretschneide... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003899DNA-directed RNA polymerase activity
GO:0003677DNA binding
Vocabulary: Biological Process
TermDefinition
GO:0006351transcription, DNA-templated
Vocabulary: INTERPRO
TermDefinition
IPR008507DUF789
IPR006591RNAP_P/RPABC4
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006144 purine nucleobase metabolic process
biological_process GO:0006206 pyrimidine nucleobase metabolic process
biological_process GO:0006351 transcription, DNA-templated
biological_process GO:0008150 biological_process
biological_process GO:0000394 RNA splicing, via endonucleolytic cleavage and ligation
biological_process GO:0006366 transcription from RNA polymerase II promoter
cellular_component GO:0005730 nucleolus
cellular_component GO:0005575 cellular_component
cellular_component GO:0005665 DNA-directed RNA polymerase II, core complex
cellular_component GO:0000418 DNA-directed RNA polymerase IV complex
cellular_component GO:0000419 DNA-directed RNA polymerase V complex
molecular_function GO:0003677 DNA binding
molecular_function GO:0003899 DNA-directed RNA polymerase activity
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi02G002760.1Lsi02G002760.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006591RNA polymerase archaeal subunit P/eukaryotic subunit RPABC4GENE3DG3DSA:2.20.28.30coord: 7..45
score: 1.6
IPR006591RNA polymerase archaeal subunit P/eukaryotic subunit RPABC4PFAMPF03604DNA_RNApol_7kDcoord: 10..41
score: 2.3
IPR006591RNA polymerase archaeal subunit P/eukaryotic subunit RPABC4SMARTSM00659rpolcxc3coord: 8..49
score: 5.2
IPR008507Protein of unknown function DUF789PFAMPF05623DUF789coord: 862..1213
score: 9.9
NoneNo IPR availablePRODOMPD012151coord: 14..44
score: 1.
NoneNo IPR availablePANTHERPTHR32010FAMILY NOT NAMEDcoord: 1025..1218
score: 4.4E-198coord: 73..1004
score: 4.4E
NoneNo IPR availablePANTHERPTHR32010:SF8SUBFAMILY NOT NAMEDcoord: 1025..1218
score: 4.4E-198coord: 73..1004
score: 4.4E