Lsi02G002760.1 (mRNA) Bottle gourd (USVL1VR-Ls) v1

Overview
NameLsi02G002760.1
TypemRNA
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls) v1)
DescriptionProtein of unknown function (DUF789)
Locationchr02: 2340063 .. 2354580 (-)
Sequence length4172
RNA-Seq ExpressionLsi02G002760.1
SyntenyLsi02G002760.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGAAAATAATAATAATAAAAGAAAACCCTACTACACGTTCCCTCTCCCCTTTTATTTTTGTTTGACGTTTGATTTTTCCCTCACCCATAATTGTAGCAGCCACAACCAAAGCCCTACGCGACTTTGATTTCTCTCTGCTCTCGTATCTCCCTCGCAGCGATAGGTACGCACCAACTATTTCTTTTCTCTTATCTCCCTCCGCCCTCGCGGTCGTCGGTCGTCGGCACACCCTCACAAACACTTTTCGTTTCTCATTATTCTATTGATGATTGCTTCTTACTTGTGTGAGCAATTATTATTTTGCAGGAAAATTTTTGAAGTTTGACTCATGGATCCTCAACCTGAACCAGTCAGCTACATCTGTGGAGGTACACTTGTTGTATTTACTGATTCCATGATTCAAAATCTTCCGATTGCACAGGTTCTTACGTTCTGTTATCGGATGCATCTTTTAATAAGTATCAAGTTACTTTGATTGTATTAAGACTGTGGGAATTTAGAAGTTTCTTTATTTGAGTCTAGATTTAGAATACTTGCACTAAATATTCACCTACATGACTACTTTTGGGGCTTGAGAACTGAAAATTTTGTGTAGATTGTGGAATGGAGAACACTCTGAAGCAGGGTGATGTTATACAGTGCCGAGAGTGTGGTTATCGTATTCTCTACAAGAAGCGCACCCGTCGCAGTAAGATATATTTCTTCTGGTTTCTTTATATATATTTTTTTTCTTTTTCTTTTTTGATCATGAATATGTCGTCTCTAAATATGATCTTTCCGTTAAAGATGTTTGGAATTTAGAATCTCATCTAATAAGTTGTTTCAGACTTTCATATGATTGACTATAAATTTTATATGAAACAAGAAACAGAGGTGGGAGTATTTATTTAATCGTAGAGTACAAAAGAGTCAGTGGGGTGAGCAAGCTACCTAGTTCAAAACGAATGAGCTTTTTGGTGTGAGTAGATATTGAAGATGGGTAACACATACCGGTCAAACTCTGTGACGGGTGGCTTTGGCCTGCTTTGTACCTAGGCATGTTTGAGAATAATTTTAAAATTGTTAAAATCACTTTGTCATCTTTAAAATTACTCCAAAACATGCTTTTGCTCATTCAAAATCAATTTGGCTATATGAAAGTTGTGTTTAGAAGTTTAAAATAAAAAATTAAATTGATTTTGAACATGACAAAAATAATTTTAACTCTTTCAAAATAACTCTCAAACATCCTCTTAGTTGCTCTCATTCTAAAGTCAATTTGATTTTCATTGCATTTGAGGTCTCTCTTGTTTGAGAAATCTGTACCCACTTCACTACTTCTCATGAGCTCAACGATCAGCTATGTTTTCACTTCTTTTTATCTGTTTTGAATTCCTTTTATCACACAATCAACATCCATTCTCAATGCTGTTCTCTTCCTGATCAAACCTTATTCACCTTCTTTGTAGTTCAGTTGAAGATTCAAGACATGGATCATTTAAAATAATCATTTTATTAACTTAGTTTGTTTGAAGTGTATGGATTGCACATATTTGTAACTTGATTAGCAGGATGGCTGTAGTTTGGGTAATTTACTGTCACGACGTATTGTATTTGAGCAGAATCTATTCTACTCAACCTATGGTGATAATTGACTGGAAAAAGTGTTTCTCTTGGATCCTTGTGATCTTCTATGACTTGGGATGCTCTCTAATTACAAGTCTTGAGATTAAATTTAGATTCTCTCTTTTTATGACGGGTGAAACGACCATTGCAGTTATTGGAGTCCATTCTTGTTGTTTAGCTCTCTCAATGTTTTTGTTATGGTTCGATTCACATTACTTTCTGTGTCATTCTTTCCAAATTTAAAGTCTAATCCGATTCTTGCCTATTGCTGCAGTTGTTCAGTACGAGGCCCGCTGAAATGCTTATTCTGGCGATGGACTTCTAGTGTCATGAATATTTTGTAAGTCTCTTTTAGGAAAACAAGAACGGAACTCGATATGGCTTAAGGACTTGCTGTAGTACAAACATGTTGATAATGCTGCCACTATAATATGATTAAATATTTTCCATTTCAATTGCCGTTATCCTCGTGAGATAACTTCATATCTAACTGATAATATGAATGGTTAATCTCTTGATAGACTGGATGGTTCACATTTGCATTTCGATATCAGTATTTGATTCCTATTCAAAATTTTCTTGTATTTTTCCTGCGAGGGTGGTAAATTTGTTTTGTTATTATTATTTTTAAGCATCATTGTTAGTGATCTTGATGGTTCTTTTAAGATGGGGTCCTAAAAATTAAAGGCTATGTTTTGATAATTTTTTTTTTTTTTGAAAATTAATGCTTACAATCATTATTTTACCTCTGGTTGTCTTATTTGATAGTACCTACTTTTTAAACTTGTTCTCAAGAGCAATGCTAAAATTTGAAAATTATAAAATTAGTTCTTGAAATTTTGATTACGTTTTGTAATTTTGTTTGTGTTTTTCAAAGTGAAAAACATACCCAAAAAAAAAAAAAAAAGTGTGGAAACAAGCCTAATTTTCAAAAATAAAAAACAAAATGTTTATCAAATGGGTCTAAGTGTTTTTTTTTTTTTTTTTTTGGTTTCATCTAGCGCTTGACTATATATGAGGTAATTCTTATTCAATTGGTTAAAGTTTGTTTGGATTATTAGTAAAAAAAAGTGTTTTTCAAATATTTTTTAAACACTTTTCATATAGAAAAAAACTATTTATAATTAAAACACTCCTATGTTTGGTTACATTTTCGTGAAAATATTATTATATATAAGTTTGTTTGGAATTAATGGCTTCAAATTTATATTTAAAACGATTGACCACTTTTGGTGGTCACGGTCGATAGTAGTTGGTGATGGTTAGCGACGGTAGTTTGAATGTTTTTCTTTTTAAGGCCAAAATTTACTCCGAGAAGCATAATGAAGTAAATTTCAGTATAGGTTAAAAAGTTAAATGTTAGATGATAGTTCCATGAGTTAGATGGAGTAGGCGAAAATGTCTCACTTCTTGTCTAACCTACAAACTTGAGAGATTTCTCATTTTAAGAAAACTAATTCAAAATTAACAAAGTAAATAACTCTATTTTACCAGTTTAATCTAAAAACTCATTGGGTCAAACGAAGAGGGTAGTTATCCACCTCACCTAATTCCTTCTCAAGTACCTCCTAAAAGAATACGGGTATATTCATGATCGTTTAGTTTGAAATGTATTTTTTTCTTTAAATTTTACGAACTTTTTAATATGGGTGCAACTTAAGATGGGATCAAATTTTTTTCAATGTCAAATAAGTTTAATTTTTTTCTTCAAATATCAAAGACTAGAAATCATTTTGAAACTTCAAACACTTAAATAAACAATAAACCGAAATTTAGAATAAATCTATTTTTCATAAAATGAACAATTAAACGAAAAAAATAGAGAATAATACTAGAGTTTTATACCTAATTGAGTGTCAATTATTTAGGGACGTTGCGATATTCTTAGCAAGAATGTCTATTATATTTCAAAGTCGATGTTTTAATTATCAAAATCATCCTTTCCACATCCTCAACAAAAATGGTAAATAATATACATAGTAAACTAACTCTTGAGCCGAGTTTATTTTGTTATTCATGGAAGCTATTATATATGGTAAATTTCTCCCCGTGACAAAAGAAAGTTAACACTTAAAAACGAAAACCAAATTTTAATATTAGATATCTAAAAGGACTTTTTTTAAATATAGAAAAATGATTTTTTTTATATTTGTAAATAATGTGATATTTTTTTTTTGTTTATAATAATTTTAGTTCTAGAATTTTATTAAATATATACAGATTTTTTAGGTAGGCATATATAGTTTTAAGTTAGAGGTGTTTTCTCAAATTTCGAAAACTGAATTATGAAAACTTATAGTCAGTATCGTCTTAGTTTCTGTAAGTTTTGTTCCATGTAGACATCTAATTTTAGACTTTAATACTCTCGATGAATCTTAAAATCAATTTGTTGTTAAAATTTTCGAAACAAAATCTTTATTAATATTCTATTTATAAATTATGAATATATATTCACACGATATCTTTTCTCACATAAAAATTACTTATTCAAAATATAATGACTAAATTTAAAATTTATTGAGAGTTCCAACACTAAAATTGAGCAATTAAAAGTAGGAAAAAAAAATTACTTTTGGTTCTAATTTCCAACTAGTTCTTACGTTTCAATTATTATACTTTTAGTCTTTATTGAGTTTGATTTCAATTTAGTCCTTAGGTTTCAAAATATTACAATTACATTTTTGAGATTTGAGTTTTGTTACAATTTGGTATTACATTTCAAGATTTCCCTCCATTTTCACTAAATACCTCACTTTTACTATTTAATGTTAATGTATATTAATTAATTTAAAATAATTATAATTAATTAAGTTTCATTATTTTTTCATCACCATCAAAATTAATTTTAAATTTTAATTCATAATTAATTTGAATTAATCAATGAACATCATTTACACCAAATATTAAAAGCGAATATTTAATAAAAAAAATTGAAGTTAAAAAGTAAAAATGTCAAAGCCTAAGAACCAAATTGAAACAAAACTCAAATTGATAAAATTGTAACACTTTGAAACTCGGAGACTAAATCAACATCATACTAAAAAAGGACTAAAAATATAACATTTTAAAATCTATGGACCAAATAAAAACTAAATCTAAAATTTAGGGATAGAAAAAATATTCTTTCCTTAAAGGTATATGTACTAAATTCGAATCGAACTCAAAGTATAGGACCAGTTTTGGGAAATTTGTACGAATAACTAAAAAATTGGGCCGAAAAGGTCAAATGACCCGAATTTTTTAAAAAATGTCAAAGCACGTTTTGCTGATGTACAAGATCAAATGAAATTAACGAAATGAACTCGCGCACCCGAAAAAGCTTATCTCTCTTGCTCTTGCTCTCTCTTTTTTTTTTCTCACCCACAATCCATCTCATCTCGCTCACTCTCTTGTCACTCTCACTCTCTTTTCTTTTTCTCTCTCACAATTTCTCTCATCTCGCACTTCTCACTCTCACTTATTAGAATTGGTTGCGTCGAAGCTGTTCATGTGACTCGTCGTGTCGGAGAAGAAGCAAAATAAGTTGTAGCAAAAGAAAAAGATGGAGAATAAGAAGATGCAGAACAAAAAGAAGAAGAAGAAGAAAAAGGAAAAAAAAAGGGAGTCGTAAAGAAAAAAGAATAAAAAAATTAAGAAAGATAATATTATAATTTCAGGTGATATGAAATATCATTTGACATTTTTTTTCTTTCAAAAATTAGATCATTTACCATTTTCAACTTTTAAAAGGAGTGTTTTATGCAATTATTTCATCAATTTTAACCCAAATTTATTTAGTGAAGTGCACGATAAAATTATTCTTCCCTCTCTAAACTTTCTGTTCCCTTTACTCAATCGAGTGTGACTGCCATAAAATCGAAAAACTCAATCGTCAAATACGAATCAAATTATACTAATTTATCAGTCGCGCTCCTTTTAAAGATAAACAAAAACCTCGAATTTTAAGCAAGCCACCAGCAGATTACGATAGAAAGTAGAAGATAAAGGCTATCGTCCATTGGAAGAACAAGCTTTCCTCCTTGTAATTCCTTCGATACTCTCGCTAATGGAAATGGGTTAGTTTTCTTCTACTGCTTTTAGCTAATCGCTCACTCGCTTACTTTCAATTCGCCAGTTTGTTCTGATTTGTGTGCAGTGAAATGGCTTGACAATCCAATTTTCGCGCTCCTTTTCTAGGAAAGTAACTGGAAATCATTGTATATTTTTGTAGAAGAGTCAATATATAATCTATCTTAATCTATCTACTTGTTTTCTCTATTTTGAGTTGGATATGGCAGGGTAGGAAACATTTCTAATTTTGGCAAATAGTTGATTATTATAGATTTTCTATGTTCCCCTGTAATCTGAATCTTGTGGCTATATTTGGGTGTAGATTTTGTCTAAATTGGTGTAGTTCTTAAATTGAGATTTTTCATTTTTTCTTCGATATTTCTGGGTCTAGTCAAATTTAGTACTAGGCTTCCTCAATGGTTCTAATTGCCTGATTAATTTCGAGGGAGCAGTACCATGCATATTTTCTTCTTTTTCTTCGTCTTCTTTTCATGATATTTCGTTGAGGTTTTGGCTGTTTTTTTTCTGTCGAACGAGTTTCTTTTGACTGGTAAGTTTTGTGTACTACAGCTTTTTCTTGTCTTGATTGGGGAAAGCCATTTCAGGAGAAGGTTGTTTTTTATATCTGGGTTTTCTCCCTTAATGCATGCACCTTCTTCCTATTCTTTGGGTTTCTGACTATTGGATTGTTTTGGTTATGCAATGTAATGTCGTTAAGTTGGGTTGCCTGCAATTTGATGTTTCAATAAACGACATTTCACTGGAGAAAATGGGTTTAATTTAATGGTCTCTTCATCAAAGCTCTTTTTGTGTTTCTTCCTTCAAGAAGTTTGAGATTTCTGATATTTCTACATTATCGCGGTTTGTTTTGCTAGAATCTGCCTGCATTTGTTTGATGGTCACATGAGAGTTGTTTTATTTGAGGTTCTAATTTCATGGTTTCATGATCACCCTCAAGAATTATTGAGGCAAGCCATGAGCAAATTTATGCGATGAGTGGAGCTCCCGACACTGGTTATAAAAAAGTGTATATTATTGTTGTAGTGGAAAGATTGAGACTCATTTCAATTGGATTAGATATCTTGAACTATTATTTGATAAAGATTATTATTGTTTCAAGTTGCTTACTTATACTCAACAAGGAAGTTGTCATTCTTTTTAGCCAATTGTTATTCTTTGTTTTTCTTTGGCACCGTCCCATATTTGATTTTCATTCTCATATTCTAGTTATATGTTGATATTGCATTTATTGATATTGTTCTGATCAAACGAATGCTATCATGTCAAAGCATTGTATTTAAGACAAAATATGAAAACCTTATCAGACATTTTTAATGAAAATATTCTTTTCAGGTTACTCATTCTTTTTCTTCATATCCTTTCGTGTGTATATTGTTCATATGTGAATGTCAAGATAAAAAATTAGTTTGTTATTTTAGACTCTCAGTATATATTCTATCATACCTTTTCTGGAGTTATTTTTGAACCTCACTTCATGTAAACAACCTTACTGGTTGCCCTTGGTCCTCTAGAATTCTTATCTTTTGAACCGTTTTGATGACAGTTACAGAAAACAATGCAGTGTGCTCTTGTAAGAAGTAGTAATTTTCAGAAAGTTCTAGACAAAGGAAAGGAGTCGTTAGAATTGAGACTCGAGGAAAACAGTTGTTCCAGAGGAATTAAGGTCCTAATTAAGAAAATTTTATTCTCTTTGGTTTTGGTTTTGTTTGCATAAAATTTTGTGTCTGAATATTTTATATTTAAATGTTATCCTTCATTCGTTTCAGGATTCTAAAGTTTCTTCTTTTGCATGGAGGAACTTTTTTGATTACAGGTAATGTTTTAGTCATACAATTGTTGAGCTATGACTTTCATGGAGAAATTATTGATTACTATTTCTCAACTCATTTGGTTTACTTTTTCAGATGTGCTGTCATTAGTTTTCTTACAGTCGAATCTGATGGACTCTGGAGAATTGTTGCACTACCACCACAATACCTAGATAGCTTGGATGTGAGCTGTCTGCCTCAAATGAATCAGTCTACAGCTGAGAGAAAATTGGTGCAGAAAGGCCCTGCCTCTAATGGTACATATTCATTTAATTCATTCAGATGTAGAAGCTTGCTGGAGTCAAATAATAAGTTATTCGATAGTAAAGCAATTAAGTCGTCGAATAAATCCTCTGGCAAGTTCTCGTGTAGGAGTTCATGCTCTGGCTCTGCTTTGATGTCGAGTGACTCTAGTGCAATCTCTGACATCCCCGTTGGTGGAGCTAAAATGCAGAGATATGGGAAGAAAAATCCAAGAAAGAAGGCAAAAAAGAAAGAAATAGAATGTAAGAAGATATCTTCTGATTTTGTCTCTGCTGAAACAGAAGTATCATCCAAGGATTCTGCCCATGGAAGTTTTTTGTCTGAAGCTTGTGGCAATAATGATTCAGATTGTAGAGATGGATCTGTTTTGTGTTCGATTGCACAAGGAACTTTTCTGCCAGATTTTAGGGCCAATAAAAATGATTTTAAACGAGATTCTGAGAGGATTATTCAGCCACTTGGAACCACAGATTCAATATCCTCTAATATTGTTGACGGGAATGCATCTGAGGTTTCATCTTCTGCATCAAAGAATTTTAGTGGGTATTATAAAGTTTGTGGATCCAAAAACCAGGCCCTAATCAAAGTACCTGGTTGTACCCATGTCAATGGGGGAGTAAATTCAAGAGAGAGGTTATTTGCTGGCAGCTACAATGATTTTTGCTCCAAGGATTCTTTGGATAATAATTCCCCAGATTCTAACTGTTTTAGTTCAAACGGTAACTCTGATAATTTTAACTTGAAATTAGATGAAAAGAAATGTTTTGGAGTTGATCTGTTGGAAGAAAGAAGTTCACCTTCTAGAGTGAACTATTGTTCTCATAATTCAGTAAGAGATGAAGTAGATGTGAATGCCAAAGTGGAGAAAGCTAATCGTGGTATTCGGGGATGTACTGTTAGTGAAACTTGTTCGGTTTTACCTGGAAAGAAAACTAAACAAAATAAAAAATTGACCGGGAGTTCAAGGATGAATAGATATGGTGGTCTGGGGAGTTCACAAAGGCGTACGGGGAAGGAAAACAGACTTACTGTCTGGCAAAAGGTTCAAAGGAATAATAGTGGTGAATGTTGTGAACAGTTAGACCAAGTAAGTCCTATCAGCAAACATTTTAAAGGCATCTGTAATCCTGTTGTTGGTGTGCAAATGCCAAAGGTCAAGGATAAAAAAACCGGGAACAGAAAACAGTTGAAAGAAAAATTTCCCAGGAGGTTGAAAAGAAAAAATACTTCAGGACAAGAGAAGATCTATCGTCCTACTAGGAACAATTGTGGTAGTAATACTAGTTCAATGGTTTACAAACCACCAAATGGAAGGTTGGATATTCGATCAGTGGGCTTTGACATAAGAAGATCAAGTGGCGATCCAAGATCTCGTTTTCATAATGATACAACTGATAAATGCACGACTTCTGAATCATTTGAAAGTACACAAGTCTGTCTTGATGGATTGGTGTCAAGCAAACTTATCTCCGATGGTTTGAATAGTAAAAAAGTAGAGAATGACTCTGGCTCATCGCCAAGGTCCTGCAACTCCTTAAATCAGTCAAATCTGGTAGAGGTTCAGTCTCCTGTTTACCTTCCTCATCTTTTCTTTCAAGCAACAAAAGGAAGTTCTCTTGCTGAATGCAGCAAGCACAATAACCAATCTAGATCACCCCTTCATAACTGGTTGCCAAGTGGGGCAGAAGGTTCCAGATTGGCCACCTTGGCCAGACCTGATTTTTCATCTCTGAAAGATGCAAGTACGCGACCTACTGAGTTTGGCACTTCAGAAAAATCAATTCAAGAAAGAGTCAATTGCAACATAGTAGATCCTGTTTCTGTTGTAACTGAGGGGATTCAGCATTCTAGAGATGGGAATCATGGTCCTTTAGAACATGAATGTGAGGTGCCGAAGGTGTATGGTTACAATACAGCTGCACTACAGGATCATAGGTGTGAGTTTGATGTGGATGAGCATTTTAATTCCAAATCCTCATGTGAAGATGCATCTAGAATGGAGCAAGCAGTGAATAATGCATGTAGGGCGCAATTGGTATCTGAAGCTATTCAAATGGAAACCGGTAGTCCAATCGCAGAATTCGAAAGATTCCTTCAATTGTCCTCCCCTGTTATCAACCAGAGACCCAAGTTAAGAAGTAGTGAAATTTACCCAAGAAATCCACCAGGTGATGTGATACCATGTAGCAATGAGACCGCCGACATTTCTTTGGGTTGCCTGTGGCAATGGTATGAAAAACATGGCAACTATGGCTTAGAAATAAAAGCCAATGGTCATGAAAATTCAAATGGATTTGGCGCTGATAACTCTGCATTCTGTGCATATTTTGTTCCATTTCTTTCAGCTGTTCAACTATTCAAGAGCCATAAAACTCATGCTCCAACTACAGGTCCTGTGGGATTTGATTCATGTGTAAGCGATATAAAAGTGAAGGAGCCGTCGACTTGTCATCTTCCAATATTTTCAGTCCTTTTTCCTAAGCCCTGTACTGATGATGCAAGTGTTCTGCGGGTTTGTAATCAGTTACATGGTTCAGAGCAACATTTGGGTTCTGAGAGGAGCAAATCTTCAGAACAATCTGTCAACTTAAAATCATCTGGAGAATCAGAACTTATTTTTGAATATTTTGAAGGGGAACAACCTCAGCAGAGAAGGCCGTTATTTGATAAGTAATTGCTCCTACGCTTTTTTGGAAATTACTAGTGTGATTTTGTTGGTTATATATTTTCTAACTCAGATATTTTCTTATCATACCTCAACTTATTATTATTATTGTTGTTGTTGTTGTGTGTGTGTGTGTGGGCAAGTAATAGGATACGAAATATAACAAATTAGTATCATCTCAGCTTTCTTCCCAAACCCTAAGTTTCTCCTAAACATCCTCTAATGTCTTTCCAACAAGATCTTTTAAGGACTACCATTTATTGCACGAGACTAGGAATCTTTGCTTTCCCTCGCAATGTTTGATCTTAGTAGTTTTGGCATTCCAGTAGACGCATCATTCGGTAAATGCTAACTCAAACCCAACTTTTGAACTGGTGATACCCACACTTAGAAATATATTTGGAGAAGATGAATAAATGACAGCCCCCGAGGAATCATTTGGTAGGGGTTGGGACTTTTGATGGGGTTAGAGCTCAGGTTCTAGATTTAAGCCTTGGAGTGGAAACTTTAATGCAGGAGTTGATGTTTCTTGGAGTTGGCAAAGGACCATGGAAAATCTCCTTGGATGAGTGGGACGTGCTCGTTACCATGAATCCCTAAAAGAAGAAGTACGAGAGTGAATGATCATCGTTCTCACTACTCTAATAAGAGAGACACACCAATTAGAAGACATTCTACTCTATCCAGCAGTATTTATGTCTTTCAAGACAAGTGTCAGTCATAAAAAATTTATTTTTGGAAATTTCCTTTTCATGATAGTTGGGTCCGATGATTTGACAAAACAACTACGTTAGAAGTAAGTTGGCGAAGTACCAAGTAGTATATTGTATTTACTATATTCACCCAAATTACCCAATCTTCACAACACAGAATAACCATGAGGAATTAGCCACTTAGCTGACGATTCTCCAAAATTTCTAGCTATGAATTTATTTTCTTTTAAATCTTGTTCTGTCTAGTTTAGTCTAAATTGGAAGTTTCACCCTGCCTTGGCCTCATTCCAACAATGGTTCATTCATCTCTTATTGGGGTTGGCCAGTTAGCAGTCTTCCCAAGAAGAAATGCTTTCCCTATTCCTCACTCTAAATTTTACGTAAGCATCTATATAAGCTTAAAAAAAAACAAAAACAGAACTTTTCATTAATGAAATGAAAAGAGGCTAATGCTCAAAATACAATGAAACAAAAGAACAAAAAGACCCGATCATAGAAACTAGGGATCAGTAGGTGCACCCATTTAAGCTTTTCTGTTAAAAACAAATTCCTTTGGCTTCACTGAATCCTTTGTCATTTGTAAACCATCTACCTACATTAGTTCCATATATAATAACCACAGCAAAGTGCCAAATAGTTGCTGTTGCTGGAGGAAAAATCCCATGCTCATTCCAATTCCATGGCCATTTTTCTCTTTCAGGTTACCTCACTCGAGACTGCTGCTTCTTAGGTTAAAGACAAATTTTCCATTAACTAAAACTTCATAATCAACTAAATGAATTCTCTAACATATCTCTCCATTGGTTAGCCGTATTATGAATTTTAAAGAGAGAAAATAGGAAGGAATGTTGGATAGCACAGATTGATAAAGAGTAGGGTGACCTCTTCTTGAAGGGGAAAAATCGTGTTCTTTCATTTATCCAGTTTCCCCGATACTCATTTGATACTACTGGATTCCAAAACCAACCTTGATTACATGTCCTTCAAGAGGCATACCTAGAGAAGTTGGAGACCAACGTGTAAATGTTGCATTCCATCTCTTGCTGCTTAATGTTCAAGAGGCATTCCTGGAGATTCCAGTTTCTCTAAAACACTGAAAGTATATATAATCCTCCTGTCGACCTCACACAATTTGACTCTTGAAGATAAAAATGTTTGACAATGCCCCTTTCTTCTTCACACTACAAACATTTCATTACAAAATCTTCACAATTTAACAAATAACAACCTATCAGACCCTAGTGAAATTTATGAAGAAAACATTTTCTATTGTCATTATGAACTTCGAGTGGGCACTGGCATCTATTTATATATTTATTCATGATCATGTGATTTGCCTTTAACCCCTTTCTCCTTGATCAATAACTTTATGATAGTATTCTGGTTTTAAGGAATGGTTGCATTTCAAATTCTGCATTCCAGTTACTTAATACAAGGATCAACAAAGTGCTTGATATGTTGGTTATCTTAGAAAGCTGTGATAAGGCTTTACACATCATTCTTAATTGCTAATTGAGAACTATTTTGAAATTGATCTCAGTTTTATTGCTGAATCAGGATACATCAACTGGTTGAGGGAGATGGACGTCCACAGGGAAAAATTTATGGGGATCCGACCATGCTCAATTCCATAACTTTGAATGATCTGCATGCTGGATCATGGTTGGTTGTGACAGACATTTGCAGTGTTTGCATTAGTTCCATAGTCTACTAAAACTAAATCATATATGACCAAATTCTTGATGTATCTTTACATTGTACAAAATAATACGAATGTCATTGACACATCACGTCAGAGAACAAAAAAGTGACTTGTTCGTTTCTGGCATTTCAGGTACTCAGTGGCATGGTATCCCATTTATAGGATACCAGATGGCAACCTTCGAGCTGCATTTTTGACTTACCACTCACTAGGACATTTTGTTTCAAGAACTTCCCAACCTAACTCTCCAGATACAAATTCTTGTTTAGTTTGTCCAGTCGTGGGTCTTCAAAGTTATAATGCACAGGTAAAGTTTTGCTGATTACTACTACTAATTCCAACCTGCTTTAACGTTTTGAAGTGGTTGTGTATATTGTTTAGATATTGCTAAAGGAAACAAAGAAACAACACGTTCTGAAGATTGTATTTGCCTTCTCTCTCTCTCTGTATAAAATATGTGTTAATTGATTAGTACTTTATTCTTGTTTATCTCAGAGGCTAATTACGTATGGATGGAAATGTGTAGGCTTTTGCCATTCAACTTTATTAATTGCGATTAGGGGCTACCTATTCTCTGTGATACTTTCCTATGAGCAATACTTGGGTGCAGCGCAGGTTACACCCATCTTCATGCATCCCCTTCTAATCACAAAGAGAAATAATATATATTAAAAAAAAAATTTAAAAAAGAAAATTTTGCCATTTGGCAGCTTGTGATTGAACACTTAGGTGAGATGCACAAAGATGGGTGCAGCTCGAGATGCACTTAAGTATTTGTCCTTCCTTATAATCCCTGAAAGGAAAAGTTGATAATGAAACACTGATCCTCCGACATTTTATTAGACCACCTCAGGTTTCCTGACATTACTAATGGAAAGTGGTTGATTATAAGATGTCTAGACAAACCATTGATTTTCTCCCTCCAAATCTTTAAGTGAAAGAGGAAACGTTTGAGTTGAAAATGCCTAGCTTCTTCTAAGGGGGATAAGAAAACCACCTTAGATTCAGCATAGTTGTCATCCATCAAGTTTTGAACGGTCGACCTTAATTGATAAATTTCTATTATTGCTTCTGCAGATCTTTAACAAAGTTTCTAAAATTGTTGCATTGCATGAGGATTGTTGTGCATCTTTCATAAACACTCTCTTTGTTCCATGAGCAGAATGAATGCTGGTTTGAGCCTAGAAACAGTACGCCCACGTTAACCCCTGGCTTGAGTCCTCCTAGAATCCTCGAGGAGCGCCTGAGGACGCTGGAAGAGACTGCATCTCTCATGGCCAGAGCTGTGGTTAAGAAAGGAAATCTGAACTCTGAAAACACGCATCCAGATTACGAGTTCTTCCTCTCACGGCGACTCTAGTTACATATCCAGATGTTTCATGCTTGGAACTTATAAAGGAGATTCCTTCCTTTTGTCAATATTCTTGAGGATTTAGTTTAGGTTAGGACCTAATCTCAAGTAGGATGTAAGAGAGCAGCTGATCTTAGTCTGTAATGTATTTACACAATGTTTTTTGTTCCTTTTTCTGTCTTGCTACGCTAACTCTTCAGGCAGTGCAACTTTCATCTGTATATATTGTTTTTTACACAAATTACCTCATGTAAAAATGATATCTCCTGGATTTATAGAGATTTTGAGCATTTTCTTTTTCTTATTTCGTTTTAAAATTAGTTCGGTTGATACACACCA

mRNA sequence

AGAAAATAATAATAATAAAAGAAAACCCTACTACACGTTCCCTCTCCCCTTTTATTTTTGTTTGACGTTTGATTTTTCCCTCACCCATAATTGTAGCAGCCACAACCAAAGCCCTACGCGACTTTGATTTCTCTCTGCTCTCGTATCTCCCTCGCAGCGATAGGAAAATTTTTGAAGTTTGACTCATGGATCCTCAACCTGAACCAGTCAGCTACATCTGTGGAGATTGTGGAATGGAGAACACTCTGAAGCAGGGTGATGTTATACAGTGCCGAGAGTGTGGTTATCGTATTCTCTACAAGAAGCGCACCCGTCGCACTTTTTCTTGTCTTGATTGGGGAAAGCCATTTCAGGAGAAGTTACAGAAAACAATGCAGTGTGCTCTTGTAAGAAGTAGTAATTTTCAGAAAGTTCTAGACAAAGGAAAGGAGTCGTTAGAATTGAGACTCGAGGAAAACAGTTGTTCCAGAGGAATTAAGGATTCTAAAGTTTCTTCTTTTGCATGGAGGAACTTTTTTGATTACAGATGTGCTGTCATTAGTTTTCTTACAGTCGAATCTGATGGACTCTGGAGAATTGTTGCACTACCACCACAATACCTAGATAGCTTGGATGTGAGCTGTCTGCCTCAAATGAATCAGTCTACAGCTGAGAGAAAATTGGTGCAGAAAGGCCCTGCCTCTAATGGTACATATTCATTTAATTCATTCAGATGTAGAAGCTTGCTGGAGTCAAATAATAAGTTATTCGATAGTAAAGCAATTAAGTCGTCGAATAAATCCTCTGGCAAGTTCTCGTGTAGGAGTTCATGCTCTGGCTCTGCTTTGATGTCGAGTGACTCTAGTGCAATCTCTGACATCCCCGTTGGTGGAGCTAAAATGCAGAGATATGGGAAGAAAAATCCAAGAAAGAAGGCAAAAAAGAAAGAAATAGAATGTAAGAAGATATCTTCTGATTTTGTCTCTGCTGAAACAGAAGTATCATCCAAGGATTCTGCCCATGGAAGTTTTTTGTCTGAAGCTTGTGGCAATAATGATTCAGATTGTAGAGATGGATCTGTTTTGTGTTCGATTGCACAAGGAACTTTTCTGCCAGATTTTAGGGCCAATAAAAATGATTTTAAACGAGATTCTGAGAGGATTATTCAGCCACTTGGAACCACAGATTCAATATCCTCTAATATTGTTGACGGGAATGCATCTGAGGTTTCATCTTCTGCATCAAAGAATTTTAGTGGGTATTATAAAGTTTGTGGATCCAAAAACCAGGCCCTAATCAAAGTACCTGGTTGTACCCATGTCAATGGGGGAGTAAATTCAAGAGAGAGGTTATTTGCTGGCAGCTACAATGATTTTTGCTCCAAGGATTCTTTGGATAATAATTCCCCAGATTCTAACTGTTTTAGTTCAAACGGTAACTCTGATAATTTTAACTTGAAATTAGATGAAAAGAAATGTTTTGGAGTTGATCTGTTGGAAGAAAGAAGTTCACCTTCTAGAGTGAACTATTGTTCTCATAATTCAGTAAGAGATGAAGTAGATGTGAATGCCAAAGTGGAGAAAGCTAATCGTGGTATTCGGGGATGTACTGTTAGTGAAACTTGTTCGGTTTTACCTGGAAAGAAAACTAAACAAAATAAAAAATTGACCGGGAGTTCAAGGATGAATAGATATGGTGGTCTGGGGAGTTCACAAAGGCGTACGGGGAAGGAAAACAGACTTACTGTCTGGCAAAAGGTTCAAAGGAATAATAGTGGTGAATGTTGTGAACAGTTAGACCAAGTAAGTCCTATCAGCAAACATTTTAAAGGCATCTGTAATCCTGTTGTTGGTGTGCAAATGCCAAAGGTCAAGGATAAAAAAACCGGGAACAGAAAACAGTTGAAAGAAAAATTTCCCAGGAGGTTGAAAAGAAAAAATACTTCAGGACAAGAGAAGATCTATCGTCCTACTAGGAACAATTGTGGTAGTAATACTAGTTCAATGGTTTACAAACCACCAAATGGAAGGTTGGATATTCGATCAGTGGGCTTTGACATAAGAAGATCAAGTGGCGATCCAAGATCTCGTTTTCATAATGATACAACTGATAAATGCACGACTTCTGAATCATTTGAAAGTACACAAGTCTGTCTTGATGGATTGGTGTCAAGCAAACTTATCTCCGATGGTTTGAATAGTAAAAAAGTAGAGAATGACTCTGGCTCATCGCCAAGGTCCTGCAACTCCTTAAATCAGTCAAATCTGGTAGAGGTTCAGTCTCCTGTTTACCTTCCTCATCTTTTCTTTCAAGCAACAAAAGGAAGTTCTCTTGCTGAATGCAGCAAGCACAATAACCAATCTAGATCACCCCTTCATAACTGGTTGCCAAGTGGGGCAGAAGGTTCCAGATTGGCCACCTTGGCCAGACCTGATTTTTCATCTCTGAAAGATGCAAGTACGCGACCTACTGAGTTTGGCACTTCAGAAAAATCAATTCAAGAAAGAGTCAATTGCAACATAGTAGATCCTGTTTCTGTTGTAACTGAGGGGATTCAGCATTCTAGAGATGGGAATCATGGTCCTTTAGAACATGAATGTGAGGTGCCGAAGGTGTATGGTTACAATACAGCTGCACTACAGGATCATAGGTGTGAGTTTGATGTGGATGAGCATTTTAATTCCAAATCCTCATGTGAAGATGCATCTAGAATGGAGCAAGCAGTGAATAATGCATGTAGGGCGCAATTGGTATCTGAAGCTATTCAAATGGAAACCGGTAGTCCAATCGCAGAATTCGAAAGATTCCTTCAATTGTCCTCCCCTGTTATCAACCAGAGACCCAAGTTAAGAAGTAGTGAAATTTACCCAAGAAATCCACCAGGTGATGTGATACCATGTAGCAATGAGACCGCCGACATTTCTTTGGGTTGCCTGTGGCAATGGTATGAAAAACATGGCAACTATGGCTTAGAAATAAAAGCCAATGGTCATGAAAATTCAAATGGATTTGGCGCTGATAACTCTGCATTCTGTGCATATTTTGTTCCATTTCTTTCAGCTGTTCAACTATTCAAGAGCCATAAAACTCATGCTCCAACTACAGGTCCTGTGGGATTTGATTCATGTGTAAGCGATATAAAAGTGAAGGAGCCGTCGACTTGTCATCTTCCAATATTTTCAGTCCTTTTTCCTAAGCCCTGTACTGATGATGCAAGTGTTCTGCGGGTTTGTAATCAGTTACATGGTTCAGAGCAACATTTGGGTTCTGAGAGGAGCAAATCTTCAGAACAATCTGTCAACTTAAAATCATCTGGAGAATCAGAACTTATTTTTGAATATTTTGAAGGGGAACAACCTCAGCAGAGAAGGCCGTTATTTGATAAGATACATCAACTGGTTGAGGGAGATGGACGTCCACAGGGAAAAATTTATGGGGATCCGACCATGCTCAATTCCATAACTTTGAATGATCTGCATGCTGGATCATGGTACTCAGTGGCATGGTATCCCATTTATAGGATACCAGATGGCAACCTTCGAGCTGCATTTTTGACTTACCACTCACTAGGACATTTTGTTTCAAGAACTTCCCAACCTAACTCTCCAGATACAAATTCTTGTTTAGTTTGTCCAGTCGTGGGTCTTCAAAGTTATAATGCACAGAATGAATGCTGGTTTGAGCCTAGAAACAGTACGCCCACGTTAACCCCTGGCTTGAGTCCTCCTAGAATCCTCGAGGAGCGCCTGAGGACGCTGGAAGAGACTGCATCTCTCATGGCCAGAGCTGTGGTTAAGAAAGGAAATCTGAACTCTGAAAACACGCATCCAGATTACGAGTTCTTCCTCTCACGGCGACTCTAGTTACATATCCAGATGTTTCATGCTTGGAACTTATAAAGGAGATTCCTTCCTTTTGTCAATATTCTTGAGGATTTAGTTTAGGTTAGGACCTAATCTCAAGTAGGATGTAAGAGAGCAGCTGATCTTAGTCTGTAATGTATTTACACAATGTTTTTTGTTCCTTTTTCTGTCTTGCTACGCTAACTCTTCAGGCAGTGCAACTTTCATCTGTATATATTGTTTTTTACACAAATTACCTCATGTAAAAATGATATCTCCTGGATTTATAGAGATTTTGAGCATTTTCTTTTTCTTATTTCGTTTTAAAATTAGTTCGGTTGATACACACCA

Coding sequence (CDS)

ATGGATCCTCAACCTGAACCAGTCAGCTACATCTGTGGAGATTGTGGAATGGAGAACACTCTGAAGCAGGGTGATGTTATACAGTGCCGAGAGTGTGGTTATCGTATTCTCTACAAGAAGCGCACCCGTCGCACTTTTTCTTGTCTTGATTGGGGAAAGCCATTTCAGGAGAAGTTACAGAAAACAATGCAGTGTGCTCTTGTAAGAAGTAGTAATTTTCAGAAAGTTCTAGACAAAGGAAAGGAGTCGTTAGAATTGAGACTCGAGGAAAACAGTTGTTCCAGAGGAATTAAGGATTCTAAAGTTTCTTCTTTTGCATGGAGGAACTTTTTTGATTACAGATGTGCTGTCATTAGTTTTCTTACAGTCGAATCTGATGGACTCTGGAGAATTGTTGCACTACCACCACAATACCTAGATAGCTTGGATGTGAGCTGTCTGCCTCAAATGAATCAGTCTACAGCTGAGAGAAAATTGGTGCAGAAAGGCCCTGCCTCTAATGGTACATATTCATTTAATTCATTCAGATGTAGAAGCTTGCTGGAGTCAAATAATAAGTTATTCGATAGTAAAGCAATTAAGTCGTCGAATAAATCCTCTGGCAAGTTCTCGTGTAGGAGTTCATGCTCTGGCTCTGCTTTGATGTCGAGTGACTCTAGTGCAATCTCTGACATCCCCGTTGGTGGAGCTAAAATGCAGAGATATGGGAAGAAAAATCCAAGAAAGAAGGCAAAAAAGAAAGAAATAGAATGTAAGAAGATATCTTCTGATTTTGTCTCTGCTGAAACAGAAGTATCATCCAAGGATTCTGCCCATGGAAGTTTTTTGTCTGAAGCTTGTGGCAATAATGATTCAGATTGTAGAGATGGATCTGTTTTGTGTTCGATTGCACAAGGAACTTTTCTGCCAGATTTTAGGGCCAATAAAAATGATTTTAAACGAGATTCTGAGAGGATTATTCAGCCACTTGGAACCACAGATTCAATATCCTCTAATATTGTTGACGGGAATGCATCTGAGGTTTCATCTTCTGCATCAAAGAATTTTAGTGGGTATTATAAAGTTTGTGGATCCAAAAACCAGGCCCTAATCAAAGTACCTGGTTGTACCCATGTCAATGGGGGAGTAAATTCAAGAGAGAGGTTATTTGCTGGCAGCTACAATGATTTTTGCTCCAAGGATTCTTTGGATAATAATTCCCCAGATTCTAACTGTTTTAGTTCAAACGGTAACTCTGATAATTTTAACTTGAAATTAGATGAAAAGAAATGTTTTGGAGTTGATCTGTTGGAAGAAAGAAGTTCACCTTCTAGAGTGAACTATTGTTCTCATAATTCAGTAAGAGATGAAGTAGATGTGAATGCCAAAGTGGAGAAAGCTAATCGTGGTATTCGGGGATGTACTGTTAGTGAAACTTGTTCGGTTTTACCTGGAAAGAAAACTAAACAAAATAAAAAATTGACCGGGAGTTCAAGGATGAATAGATATGGTGGTCTGGGGAGTTCACAAAGGCGTACGGGGAAGGAAAACAGACTTACTGTCTGGCAAAAGGTTCAAAGGAATAATAGTGGTGAATGTTGTGAACAGTTAGACCAAGTAAGTCCTATCAGCAAACATTTTAAAGGCATCTGTAATCCTGTTGTTGGTGTGCAAATGCCAAAGGTCAAGGATAAAAAAACCGGGAACAGAAAACAGTTGAAAGAAAAATTTCCCAGGAGGTTGAAAAGAAAAAATACTTCAGGACAAGAGAAGATCTATCGTCCTACTAGGAACAATTGTGGTAGTAATACTAGTTCAATGGTTTACAAACCACCAAATGGAAGGTTGGATATTCGATCAGTGGGCTTTGACATAAGAAGATCAAGTGGCGATCCAAGATCTCGTTTTCATAATGATACAACTGATAAATGCACGACTTCTGAATCATTTGAAAGTACACAAGTCTGTCTTGATGGATTGGTGTCAAGCAAACTTATCTCCGATGGTTTGAATAGTAAAAAAGTAGAGAATGACTCTGGCTCATCGCCAAGGTCCTGCAACTCCTTAAATCAGTCAAATCTGGTAGAGGTTCAGTCTCCTGTTTACCTTCCTCATCTTTTCTTTCAAGCAACAAAAGGAAGTTCTCTTGCTGAATGCAGCAAGCACAATAACCAATCTAGATCACCCCTTCATAACTGGTTGCCAAGTGGGGCAGAAGGTTCCAGATTGGCCACCTTGGCCAGACCTGATTTTTCATCTCTGAAAGATGCAAGTACGCGACCTACTGAGTTTGGCACTTCAGAAAAATCAATTCAAGAAAGAGTCAATTGCAACATAGTAGATCCTGTTTCTGTTGTAACTGAGGGGATTCAGCATTCTAGAGATGGGAATCATGGTCCTTTAGAACATGAATGTGAGGTGCCGAAGGTGTATGGTTACAATACAGCTGCACTACAGGATCATAGGTGTGAGTTTGATGTGGATGAGCATTTTAATTCCAAATCCTCATGTGAAGATGCATCTAGAATGGAGCAAGCAGTGAATAATGCATGTAGGGCGCAATTGGTATCTGAAGCTATTCAAATGGAAACCGGTAGTCCAATCGCAGAATTCGAAAGATTCCTTCAATTGTCCTCCCCTGTTATCAACCAGAGACCCAAGTTAAGAAGTAGTGAAATTTACCCAAGAAATCCACCAGGTGATGTGATACCATGTAGCAATGAGACCGCCGACATTTCTTTGGGTTGCCTGTGGCAATGGTATGAAAAACATGGCAACTATGGCTTAGAAATAAAAGCCAATGGTCATGAAAATTCAAATGGATTTGGCGCTGATAACTCTGCATTCTGTGCATATTTTGTTCCATTTCTTTCAGCTGTTCAACTATTCAAGAGCCATAAAACTCATGCTCCAACTACAGGTCCTGTGGGATTTGATTCATGTGTAAGCGATATAAAAGTGAAGGAGCCGTCGACTTGTCATCTTCCAATATTTTCAGTCCTTTTTCCTAAGCCCTGTACTGATGATGCAAGTGTTCTGCGGGTTTGTAATCAGTTACATGGTTCAGAGCAACATTTGGGTTCTGAGAGGAGCAAATCTTCAGAACAATCTGTCAACTTAAAATCATCTGGAGAATCAGAACTTATTTTTGAATATTTTGAAGGGGAACAACCTCAGCAGAGAAGGCCGTTATTTGATAAGATACATCAACTGGTTGAGGGAGATGGACGTCCACAGGGAAAAATTTATGGGGATCCGACCATGCTCAATTCCATAACTTTGAATGATCTGCATGCTGGATCATGGTACTCAGTGGCATGGTATCCCATTTATAGGATACCAGATGGCAACCTTCGAGCTGCATTTTTGACTTACCACTCACTAGGACATTTTGTTTCAAGAACTTCCCAACCTAACTCTCCAGATACAAATTCTTGTTTAGTTTGTCCAGTCGTGGGTCTTCAAAGTTATAATGCACAGAATGAATGCTGGTTTGAGCCTAGAAACAGTACGCCCACGTTAACCCCTGGCTTGAGTCCTCCTAGAATCCTCGAGGAGCGCCTGAGGACGCTGGAAGAGACTGCATCTCTCATGGCCAGAGCTGTGGTTAAGAAAGGAAATCTGAACTCTGAAAACACGCATCCAGATTACGAGTTCTTCCTCTCACGGCGACTCTAG

Protein sequence

MDPQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRRTFSCLDWGKPFQEKLQKTMQCALVRSSNFQKVLDKGKESLELRLEENSCSRGIKDSKVSSFAWRNFFDYRCAVISFLTVESDGLWRIVALPPQYLDSLDVSCLPQMNQSTAERKLVQKGPASNGTYSFNSFRCRSLLESNNKLFDSKAIKSSNKSSGKFSCRSSCSGSALMSSDSSAISDIPVGGAKMQRYGKKNPRKKAKKKEIECKKISSDFVSAETEVSSKDSAHGSFLSEACGNNDSDCRDGSVLCSIAQGTFLPDFRANKNDFKRDSERIIQPLGTTDSISSNIVDGNASEVSSSASKNFSGYYKVCGSKNQALIKVPGCTHVNGGVNSRERLFAGSYNDFCSKDSLDNNSPDSNCFSSNGNSDNFNLKLDEKKCFGVDLLEERSSPSRVNYCSHNSVRDEVDVNAKVEKANRGIRGCTVSETCSVLPGKKTKQNKKLTGSSRMNRYGGLGSSQRRTGKENRLTVWQKVQRNNSGECCEQLDQVSPISKHFKGICNPVVGVQMPKVKDKKTGNRKQLKEKFPRRLKRKNTSGQEKIYRPTRNNCGSNTSSMVYKPPNGRLDIRSVGFDIRRSSGDPRSRFHNDTTDKCTTSESFESTQVCLDGLVSSKLISDGLNSKKVENDSGSSPRSCNSLNQSNLVEVQSPVYLPHLFFQATKGSSLAECSKHNNQSRSPLHNWLPSGAEGSRLATLARPDFSSLKDASTRPTEFGTSEKSIQERVNCNIVDPVSVVTEGIQHSRDGNHGPLEHECEVPKVYGYNTAALQDHRCEFDVDEHFNSKSSCEDASRMEQAVNNACRAQLVSEAIQMETGSPIAEFERFLQLSSPVINQRPKLRSSEIYPRNPPGDVIPCSNETADISLGCLWQWYEKHGNYGLEIKANGHENSNGFGADNSAFCAYFVPFLSAVQLFKSHKTHAPTTGPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVLRVCNQLHGSEQHLGSERSKSSEQSVNLKSSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRPQGKIYGDPTMLNSITLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFVSRTSQPNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNSTPTLTPGLSPPRILEERLRTLEETASLMARAVVKKGNLNSENTHPDYEFFLSRRL
Homology
BLAST of Lsi02G002760.1 vs. ExPASy Swiss-Prot
Match: Q9FLM8 (DNA-directed RNA polymerases II, IV and V subunit 12 OS=Arabidopsis thaliana OX=3702 GN=NRPB12 PE=1 SV=1)

HSP 1 Score: 92.8 bits (229), Expect = 2.8e-17
Identity = 39/44 (88.64%), Postives = 41/44 (93.18%), Query Frame = 0

Query: 1  MDPQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRR 45
          MDP PEPV+Y+CGDCG ENTLK GDVIQCRECGYRILYKKRTRR
Sbjct: 1  MDPAPEPVTYVCGDCGQENTLKSGDVIQCRECGYRILYKKRTRR 44

BLAST of Lsi02G002760.1 vs. ExPASy Swiss-Prot
Match: Q9C8M4 (DNA-directed RNA polymerase subunit 12-like protein OS=Arabidopsis thaliana OX=3702 GN=NRPB12L PE=3 SV=1)

HSP 1 Score: 69.3 bits (168), Expect = 3.4e-10
Identity = 30/41 (73.17%), Postives = 34/41 (82.93%), Query Frame = 0

Query: 2  DPQPEP-VSYICGDCGMENTLKQGDVIQCRECGYRILYKKR 42
          D QPE  V Y+CGDCG EN LK+GDV QCR+CG+RILYKKR
Sbjct: 10 DKQPEQLVIYVCGDCGQENILKRGDVFQCRDCGFRILYKKR 50

BLAST of Lsi02G002760.1 vs. ExPASy Swiss-Prot
Match: Q3ZBC0 (DNA-directed RNA polymerases I, II, and III subunit RPABC4 OS=Bos taurus OX=9913 GN=POLR2K PE=1 SV=1)

HSP 1 Score: 66.2 bits (160), Expect = 2.8e-09
Identity = 26/42 (61.90%), Postives = 34/42 (80.95%), Query Frame = 0

Query: 3  PQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRR 45
          P+ +P+ YICG+C  EN +K  D I+CRECGYRI+YKKRT+R
Sbjct: 10 PKQQPMIYICGECHTENEIKSRDPIRCRECGYRIMYKKRTKR 51

BLAST of Lsi02G002760.1 vs. ExPASy Swiss-Prot
Match: P53803 (DNA-directed RNA polymerases I, II, and III subunit RPABC4 OS=Homo sapiens OX=9606 GN=POLR2K PE=1 SV=1)

HSP 1 Score: 66.2 bits (160), Expect = 2.8e-09
Identity = 26/42 (61.90%), Postives = 34/42 (80.95%), Query Frame = 0

Query: 3  PQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRR 45
          P+ +P+ YICG+C  EN +K  D I+CRECGYRI+YKKRT+R
Sbjct: 10 PKQQPMIYICGECHTENEIKSRDPIRCRECGYRIMYKKRTKR 51

BLAST of Lsi02G002760.1 vs. ExPASy Swiss-Prot
Match: Q63871 (DNA-directed RNA polymerases I, II, and III subunit RPABC4 OS=Mus musculus OX=10090 GN=Polr2k PE=3 SV=2)

HSP 1 Score: 66.2 bits (160), Expect = 2.8e-09
Identity = 26/42 (61.90%), Postives = 34/42 (80.95%), Query Frame = 0

Query: 3  PQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRR 45
          P+ +P+ YICG+C  EN +K  D I+CRECGYRI+YKKRT+R
Sbjct: 10 PKQQPMIYICGECHTENEIKSRDPIRCRECGYRIMYKKRTKR 51

BLAST of Lsi02G002760.1 vs. ExPASy TrEMBL
Match: A0A0A0LT77 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G043170 PE=4 SV=1)

HSP 1 Score: 1810.8 bits (4689), Expect = 0.0e+00
Identity = 953/1195 (79.75%), Postives = 1021/1195 (85.44%), Query Frame = 0

Query: 63   MQCALVRSSNFQKVLDKGKESLELRLEENSCSRGIK-DSKVSSFAWRNFFDYRCAVISFL 122
            MQC LV SS+FQKVLDKGKESLELRLE+NSCSRGI  DSKVSSFAWRNFFDYR A+IS L
Sbjct: 1    MQCTLV-SSDFQKVLDKGKESLELRLEKNSCSRGISTDSKVSSFAWRNFFDYRRAIISCL 60

Query: 123  TVESDGLWRIVALPPQYLDSLDVSCLPQMNQSTAERKLVQKGPASNGTYSFNSFRCRSLL 182
            T+ESDGLWRIVALPPQYLDSL++SCLPQMNQ TA RKLVQKGPASNGTYSFNS RCRSLL
Sbjct: 61   TLESDGLWRIVALPPQYLDSLNLSCLPQMNQFTAGRKLVQKGPASNGTYSFNSLRCRSLL 120

Query: 183  ESNNKLFDSKAIKSSNKSSGKFSCRSSCSGSALMSSDSSAISDIPVGGAKMQRYGKKNPR 242
            ESN KL DSKAIKS  +SSGKF C SSCSGSALMSSDS AISDIPV GAKMQRYGKKNPR
Sbjct: 121  ESNKKLLDSKAIKSPKQSSGKFPCTSSCSGSALMSSDSIAISDIPVDGAKMQRYGKKNPR 180

Query: 243  KKAKKKEIECKKISSDFVSAETEVSSKDSAHGSFLSEACGNNDSDCRDGSVLCSIAQGTF 302
            KKAKKKEIECK ISSDFVSAETEVS +DSA  SFLSEACG+NDSD RD SVLCSIAQ TF
Sbjct: 181  KKAKKKEIECKNISSDFVSAETEVSLQDSARASFLSEACGSNDSDFRDRSVLCSIAQETF 240

Query: 303  LPDFRANKNDFKRDSERIIQPLGTTDSISSNIVDGNASEVSSSASKNFSGYYKVCGSKNQ 362
            LPDF         + + +IQPLGT DS+SS IVDG++S+VSS A KNFSGYYKVCGS+NQ
Sbjct: 241  LPDF---------EQDSVIQPLGTVDSVSSEIVDGHSSKVSSLAIKNFSGYYKVCGSENQ 300

Query: 363  ALIKVPGCTHVNGGVNSRERLFAGSYNDFCSKDSLDNNSPDSNCFSSNGNSDNFNLKLDE 422
            ALI VPGC HV+ G+NSRER  AGS NDFCSKD LDN S DS   S NGN D+ NLKL+E
Sbjct: 301  ALINVPGCIHVDVGLNSRERFIAGSCNDFCSKDYLDNISRDSKWVSLNGNCDDLNLKLNE 360

Query: 423  KKCFGVDLLEERSSPSRVNYCSHNSVRDEVDVNAKVEKANRGIRGCTVSETCSVLPGKKT 482
            K+ FGVDLLEERSSPS+      NS RDEVD+NA+VEKAN GIRGCTVSETCSVLPGKKT
Sbjct: 361  KQGFGVDLLEERSSPSQ------NSARDEVDLNAEVEKANLGIRGCTVSETCSVLPGKKT 420

Query: 483  KQNKKLTGSSRMNRYGGLGSSQRRTGKENRLTVWQKVQRNNSGECCEQLDQVSPISKHFK 542
            KQNKKLTGSSRMNRYGGLGSSQRRTGKENR TVWQKVQR++SG C EQLDQVSPISK FK
Sbjct: 421  KQNKKLTGSSRMNRYGGLGSSQRRTGKENRHTVWQKVQRSSSGGCSEQLDQVSPISKQFK 480

Query: 543  GICNPVVGVQMPKVKDKKTGNRKQLKEKFPRRLKRKNTSGQEKIYRPTRNNCGSNTSSMV 602
            GICNPVVGVQMPKVKDKKTGN+KQLKEK PRRLKRKNTSGQEKIYRPTRN+CGSNTSSMV
Sbjct: 481  GICNPVVGVQMPKVKDKKTGNKKQLKEKCPRRLKRKNTSGQEKIYRPTRNSCGSNTSSMV 540

Query: 603  YKPPNGRLDIRSVGFDIRRSSGDPRSRFHNDTTDKCTTSESFESTQVCLDGLVSSKLISD 662
            +KPPN +LD+RS+GFDIRRSSGDPRS F ND+TDKCT SES ES QV LD L+S+KLI+D
Sbjct: 541  HKPPNEKLDVRSMGFDIRRSSGDPRSCFQNDSTDKCTNSESVESKQVHLDELISNKLIND 600

Query: 663  GLNSKKVENDSGSSPRSCNSLNQSNLVEVQSP---------------------------- 722
            GL+S+KVENDS S P+SCNS NQSN VEV+SP                            
Sbjct: 601  GLSSQKVENDSSSLPKSCNSSNQSNPVEVKSPVYLPHLFFQKVGNDSSSLPKSCNSLNQS 660

Query: 723  --------VYLPHLFFQATKGSSLAECSKHNNQSRSPLHNWLPSGAEGSRLATLARPDFS 782
                    VYLPHLFFQATKGSSL E SKH+ QSRSPL NWLPSGAEGSR  TLARPDFS
Sbjct: 661  NPVEVKSSVYLPHLFFQATKGSSLDERSKHDTQSRSPLQNWLPSGAEGSRSITLARPDFS 720

Query: 783  SLKDASTRPTEFGTSEKSIQERVNCNIVDPVSVVTEGIQHSRDGNHGPLEHECEVPKVYG 842
            SL+DA+T+P EFGT EKSI+ERVNCN+++PVS V EGIQH RD + GPLEHEC V K+YG
Sbjct: 721  SLRDANTQPAEFGTLEKSIKERVNCNVLNPVSDVIEGIQHYRDRDDGPLEHECGVQKMYG 780

Query: 843  YNTAALQDHRCEFDVDEHFNSKSSCEDASRMEQAVNNACRAQLVSEAIQMETGSPIAEFE 902
            Y+T  LQDH+ EFDVDEHFN KSSCED SRMEQAVNNACRAQL SEAIQMETG PIAEFE
Sbjct: 781  YDTTTLQDHKSEFDVDEHFNCKSSCEDVSRMEQAVNNACRAQLASEAIQMETGCPIAEFE 840

Query: 903  RFLQLSSPVINQRPKLRSSEIYPRNPPGDVIPCSNETADISLGCLWQWYEKHGNYGLEIK 962
            RFL LSSPVI+QRP   SS+I PRN PGDVIPCSNET +ISLGCLWQWYEKHG+YGLEIK
Sbjct: 841  RFLHLSSPVIDQRPN-SSSDICPRNLPGDVIPCSNETTNISLGCLWQWYEKHGSYGLEIK 900

Query: 963  ANGHENSNGFGADNSAFCAYFVPFLSAVQLFKSHKTHAPT-TGPVGFDSCVSDIKVKEPS 1022
            A G ENSNGFGA NSAF AYFVPFLSAVQLFKS KTH  T TGP+GF+SCVSDIKVKEPS
Sbjct: 901  AKGQENSNGFGAVNSAFRAYFVPFLSAVQLFKSRKTHVGTATGPLGFNSCVSDIKVKEPS 960

Query: 1023 TCHLPIFSVLFPKPCTDDASVLRVCNQLHGSEQHLGSERSKSSEQSVNLKSSGESELIFE 1082
            TCHLPIFS+LFPKPCTDD SVLRVCNQ H SEQHL SE+ KSSEQS +L+ SGESELIFE
Sbjct: 961  TCHLPIFSLLFPKPCTDDTSVLRVCNQFHSSEQHLASEKKKSSEQSASLQLSGESELIFE 1020

Query: 1083 YFEGEQPQQRRPLFDKIHQLVEGDGRPQGKIYGDPTMLNSITLNDLHAGSWYSVAWYPIY 1142
            YFEGEQPQ RRPLFDKIHQLVEGDG  QGKIYGDPT+LNSITL+DLHAGSWYSVAWYPIY
Sbjct: 1021 YFEGEQPQLRRPLFDKIHQLVEGDGL-QGKIYGDPTVLNSITLDDLHAGSWYSVAWYPIY 1080

Query: 1143 RIPDGNLRAAFLTYHSLGHFVSRTSQPNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNS- 1202
            RIPDGNLRAAFLTYHSLGHFVSRTSQ    DTNSCLVCPVVGLQSYNAQNECWFEPR+S 
Sbjct: 1081 RIPDGNLRAAFLTYHSLGHFVSRTSQ----DTNSCLVCPVVGLQSYNAQNECWFEPRDST 1140

Query: 1203 -TPTLTPGLSPPRILEERLRTLEETASLMARAVVKKGNLNSENTHPDYEFFLSRR 1218
             T T T  L+PPRIL+ERLRTLEETASLMARAVVKKGNLNS NTHPDYEFFLSRR
Sbjct: 1141 RTSTFTSNLNPPRILQERLRTLEETASLMARAVVKKGNLNSGNTHPDYEFFLSRR 1173

BLAST of Lsi02G002760.1 vs. ExPASy TrEMBL
Match: A0A5D3BH03 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G002740 PE=4 SV=1)

HSP 1 Score: 1802.3 bits (4667), Expect = 0.0e+00
Identity = 944/1193 (79.13%), Postives = 1014/1193 (85.00%), Query Frame = 0

Query: 63   MQCALVRSSNFQKVLDKGKESLELRLEENSCSRGI-KDSKVSSFAWRNFFDYRCAVISFL 122
            MQCALVRSS+FQKVLDKGKESL+LRLE+NSCSRGI KD +VSSFAWRNFFDYRCAVI FL
Sbjct: 1    MQCALVRSSDFQKVLDKGKESLDLRLEKNSCSRGISKDFEVSSFAWRNFFDYRCAVIRFL 60

Query: 123  TVESDGLWRIVALPPQYLDSLDVSCLPQMNQSTAERKLVQKGPASNGTYSFNSFRCRSLL 182
            T+ESDGLWRIVALPPQYLDSL+VSCLPQMNQ TA RKLVQKG ASNGTYSFNS RCRSLL
Sbjct: 61   TLESDGLWRIVALPPQYLDSLNVSCLPQMNQFTAGRKLVQKGSASNGTYSFNSLRCRSLL 120

Query: 183  ESNNKLFDSKAIKSSNKSSGKFSCRSSCSGSALMSSDSSAISDIPVGGAKMQRYGKKNPR 242
            ESN KL DSKAIKS NKSSGK  C SSCS SALMSSDS A SDIP+ GAKMQRYGKKNPR
Sbjct: 121  ESNKKLLDSKAIKSPNKSSGKLLCTSSCSASALMSSDSIATSDIPIDGAKMQRYGKKNPR 180

Query: 243  KKAKKKEIECKKISSDFVSAETEVSSKDSAHGSFLSEACGNNDSDCRDGSVLCSIAQGTF 302
            KKAKKKE+E KKISS+FVSAETEVS +DSA  SFLSEACG+NDSD R+ +VLCSIA  TF
Sbjct: 181  KKAKKKELEYKKISSEFVSAETEVSLQDSARASFLSEACGSNDSDFRNRTVLCSIAPETF 240

Query: 303  LPDFRANKNDFKRDSERIIQPLGTTDSISSNIVDGNASEVSSSASKNFSGYYKVCGSKNQ 362
            LP       DF+RDSE  IQPLGT DS+SS IVDG++S+VSSSA KNFSGY+KVCGS+NQ
Sbjct: 241  LP-------DFERDSE--IQPLGTVDSVSSEIVDGHSSKVSSSAIKNFSGYHKVCGSENQ 300

Query: 363  ALIKVPGCTHVNGGVNSRERLFAGSYNDFCSKDSLDNNSPDSNCFSSNGNSDNFNLKLDE 422
            AL   PGC HV+ G+NSRE L AGS NDFCS DSLDNNS DS   S N N D+ NLKL+E
Sbjct: 301  ALTNAPGCFHVDVGLNSRESLLAGSCNDFCSTDSLDNNSCDSKWVSLNSNCDDLNLKLNE 360

Query: 423  KKCFGVDLLEERSSPSRVNYCSHNSVRDEVDVNAKVEKANRGIRGCTVSETCSVLPGKKT 482
            KK FGVDLLEERSSP R N CS NS RDEVD+N +VEK   GI+GCTVSETCSVLPGKKT
Sbjct: 361  KKGFGVDLLEERSSPYREN-CSQNSARDEVDLNTEVEK---GIQGCTVSETCSVLPGKKT 420

Query: 483  KQNKKLTGSSRMNRYGGLGSSQRRTGKENRLTVWQKVQRNNSGECCEQLDQVSPISKHFK 542
            KQNKKLTGSSRMNRYGGLGSSQRRTGKENR TVWQKVQR+NSG C EQLDQVSPISK FK
Sbjct: 421  KQNKKLTGSSRMNRYGGLGSSQRRTGKENRHTVWQKVQRSNSGGCSEQLDQVSPISKQFK 480

Query: 543  GICNPVVGVQMPKVKDKKTGNRKQLKEKFPRRLKRKNTSGQEKIYRPTRNNCGSNTSSMV 602
            GICNPV GVQMPKVKDKKTGNRKQLKEK  RRLKRKNTSGQEKIYRPTRN+CGSNTSSMV
Sbjct: 481  GICNPVAGVQMPKVKDKKTGNRKQLKEKCSRRLKRKNTSGQEKIYRPTRNSCGSNTSSMV 540

Query: 603  YKPPNGRLDIRSVGFDIRRSSGDPRSRFHNDTTDKCTTSESFESTQVCLDGLVSSKLISD 662
            +KPPN RLDIRS+GFDIRRSSG+PRSRF NDTTDKC  SE+ E  QV  D L S+KLI D
Sbjct: 541  HKPPNERLDIRSMGFDIRRSSGNPRSRFQNDTTDKCMNSEAVEGKQVHPDELFSNKLIYD 600

Query: 663  GLNSKKVENDSGSSPRSCNSLNQ------------------------------------S 722
            GL+S+KVENDS S P+SCNS NQ                                    S
Sbjct: 601  GLSSQKVENDSSSLPKSCNSSNQSNPVEVKSPVYLPHLFFQKVENDSSSLPKSCSSSNLS 660

Query: 723  NLVEVQSPVYLPHLFFQATKGSSLAECSKHNNQSRSPLHNWLPSGAEGSRLATLARPDFS 782
            N VEV+SPVYLPHLFFQATKGSSLAE SKH  QSRSPL NWLPSGAEGSR  TLARPDFS
Sbjct: 661  NTVEVKSPVYLPHLFFQATKGSSLAERSKHETQSRSPLQNWLPSGAEGSRSTTLARPDFS 720

Query: 783  SLKDASTRPTEFGTSEKSIQERVNCNIVDPVSVVTEGIQHSRDGNHGPLEHECEVPKVYG 842
            SL+DA+T+P EFGTSEKSI+ERVNC++++PVS V EGIQH RD +HG LEHECEV K+YG
Sbjct: 721  SLRDANTQPAEFGTSEKSIKERVNCSLLNPVSDVLEGIQHYRDRDHGSLEHECEVQKIYG 780

Query: 843  YNTAALQDHRCEFDVDEHFNSKSSCEDASRMEQAVNNACRAQLVSEAIQMETGSPIAEFE 902
            ++T  LQ+ +CEF+VDEHFN KSSCED SRMEQAVNNAC+AQL SEAIQMETG PIAEFE
Sbjct: 781  FDTTTLQNQKCEFNVDEHFNCKSSCEDVSRMEQAVNNACKAQLASEAIQMETGCPIAEFE 840

Query: 903  RFLQLSSPVINQRPKLRSSEIYPRNPPGDVIPCSNETADISLGCLWQWYEKHGNYGLEIK 962
            RFL LSSPVI+QRPKLRSSEI PRN PGDVIPCSNET +ISL CLWQWYEKHG+YGLEIK
Sbjct: 841  RFLHLSSPVIDQRPKLRSSEICPRNLPGDVIPCSNETTNISLACLWQWYEKHGSYGLEIK 900

Query: 963  ANGHENSNGFGADNSAFCAYFVPFLSAVQLFKSHKTH-APTTGPVGFDSCVSDIKVKEPS 1022
            A  HENSNGFG  NSAF AYFVPFLSA+QLFKS KTH   TTGP+GFDSCVSDIKVKEPS
Sbjct: 901  AKSHENSNGFGVVNSAFRAYFVPFLSAIQLFKSRKTHVGTTTGPLGFDSCVSDIKVKEPS 960

Query: 1023 TCHLPIFSVLFPKPCTDDASVLRVCNQLHGSEQHLGSERSKSSEQSVNLKSSGESELIFE 1082
            TCHLPIFS+LFP+P TDD SVLRVCN+ H SEQ L SE+ KSS+QS +L+ SGESELIFE
Sbjct: 961  TCHLPIFSLLFPEPSTDDTSVLRVCNRFHSSEQDLASEKRKSSKQSASLQLSGESELIFE 1020

Query: 1083 YFEGEQPQQRRPLFDKIHQLVEGDGRPQGKIYGDPTMLNSITLNDLHAGSWYSVAWYPIY 1142
            YFEGEQPQ RRPLFDKIHQLVEGDG  QGKIYGDPTMLNSITL+DLHAGSWYSVAWYPIY
Sbjct: 1021 YFEGEQPQLRRPLFDKIHQLVEGDGCLQGKIYGDPTMLNSITLDDLHAGSWYSVAWYPIY 1080

Query: 1143 RIPDGNLRAAFLTYHSLGHFVSRTSQPNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNST 1202
            RIPDGNLRAAFLTYHSLGHFVSRTSQ    DTNSCLVCPVVGLQSYNAQNECWFEPR ST
Sbjct: 1081 RIPDGNLRAAFLTYHSLGHFVSRTSQ----DTNSCLVCPVVGLQSYNAQNECWFEPREST 1140

Query: 1203 PTLTPGLSPPRILEERLRTLEETASLMARAVVKKGNLNSENTHPDYEFFLSRR 1218
             T T  L+PPR+L+ERLRTLEETASLMARAVVKKGNLNS NTHPDYEFFLSRR
Sbjct: 1141 STFTSDLNPPRVLQERLRTLEETASLMARAVVKKGNLNSGNTHPDYEFFLSRR 1176

BLAST of Lsi02G002760.1 vs. ExPASy TrEMBL
Match: A0A6J1C5T5 (uncharacterized protein LOC111008718 OS=Momordica charantia OX=3673 GN=LOC111008718 PE=4 SV=1)

HSP 1 Score: 1669.1 bits (4321), Expect = 0.0e+00
Identity = 869/1162 (74.78%), Postives = 965/1162 (83.05%), Query Frame = 0

Query: 63   MQCALVRS-SNFQKVLDKGKESLELRLEENSCSRGIKDSKVSSFAWRNFFDYRCAVISFL 122
            MQCAL R  S+ QK+ DKGKE LE+R +E++CSR IKDS+VSS AWRNFFDYRCAV+SFL
Sbjct: 1    MQCALERRISDLQKIPDKGKELLEVRFQEDNCSRRIKDSEVSSLAWRNFFDYRCAVLSFL 60

Query: 123  TVESDGLWRIVALPPQYLDSLDVSCLPQMNQSTAERKLVQKGPASNGTYSFNSFRCRSLL 182
            T+ESDG W+IVA P QYLD L  SCLPQMNQ  AERKLVQKGPASNGTYS NSFRCRSLL
Sbjct: 61   TLESDGPWKIVAPPLQYLDCLHASCLPQMNQFAAERKLVQKGPASNGTYSINSFRCRSLL 120

Query: 183  ESNNKLFDSKAIKSSNKSSGKFSCRSSCSGSALMSSDSSAISDIPVGGAKMQRYGKKNPR 242
            ESN KL DSKAIKS N+ SGKFSCRSSCS SAL+SSDSSAISDIP+GGAKM RYGKKNPR
Sbjct: 121  ESNKKLLDSKAIKSLNELSGKFSCRSSCSSSALISSDSSAISDIPIGGAKMHRYGKKNPR 180

Query: 243  KKAKKKEIECKKISSDFVSAETEVSSKDSAHGSFLSEACGNNDSDCRDGSVLCSIAQGTF 302
            KKAKKK IECKKIS DFV AETEVSS+DSA GS L EACGNND +  DGSV CS AQ TF
Sbjct: 181  KKAKKKGIECKKISCDFVCAETEVSSEDSARGSLLLEACGNNDLNPGDGSVSCSTAQETF 240

Query: 303  LPDFRANKNDFKRDSERIIQPLGTTDSISSNIVDGNASEVSSSASKNFSGYYKVCGSKNQ 362
            LPD RA+KN F  +SERIIQPLGT  SISS  V+G+AS+V  SA++N SG Y VCGS+NQ
Sbjct: 241  LPDIRASKNYFDGNSERIIQPLGTVHSISSETVEGDASQVLPSATQNLSGNYNVCGSENQ 300

Query: 363  ALIKVPGCTHVNGGVNSRERLFAGSYNDFCSKDSLDNNSPDSNCFSSNGNSDNFNLKLDE 422
             L+KV GC+H +GGV+ RERLF G   DF SK   DNNS +S C SSN + D  NLKL+E
Sbjct: 301  PLVKVTGCSHFDGGVDPRERLFVGCCGDFRSKGFSDNNSSESQCVSSNSDYDGLNLKLNE 360

Query: 423  KKCFGVDLLEERSSPSRVNYCS-HNSVRDEVDVNAKVEKANRGIRGCTVSETCSVLPGKK 482
            K+ FGV LLEE++SPSR NYCS H SVRDEVDVNA+VE+A  GI+GCT SET  VLPGKK
Sbjct: 361  KESFGVGLLEEKNSPSRENYCSRHISVRDEVDVNAEVERAKHGIQGCTNSETRLVLPGKK 420

Query: 483  TKQNKKLTGSSRMNRYGGLGSSQRRTGKENRLTVWQKVQRNNSGECCEQLDQVSPISKHF 542
            TKQNKKLTGSS++NR+G +G+SQRRTGKEN  TVWQKVQ+NNSG CC QLDQVSPI K F
Sbjct: 421  TKQNKKLTGSSKINRFGIVGNSQRRTGKENNHTVWQKVQKNNSGGCCAQLDQVSPICKQF 480

Query: 543  KGICNPVVGVQMPKVKDKKTGNRKQLKEKFPRRLKRKNTSGQEKIYRPTRNNCGSNTSSM 602
            KG C P VGVQ+PKVKD+KTGNRKQLK+K  R+L+RKNTS Q+KIYRP ++  G+NTSSM
Sbjct: 481  KGNCKP-VGVQIPKVKDRKTGNRKQLKDKSSRKLRRKNTSVQDKIYRPCKSGIGNNTSSM 540

Query: 603  VYKPPNGRLDIRSVGFDIRRSSGDPRSRFHNDTTDKCTTSESFESTQVCLDGLVSSKLIS 662
            V K PN RLDI S+GFDIRR +   +S+  ND T KC TSESFESTQ CLDGL+S +L+S
Sbjct: 541  VDKQPNERLDIPSMGFDIRRLNSASKSQLQNDNTGKCLTSESFESTQACLDGLMSDELVS 600

Query: 663  DGLNSKKVENDSGSSPRSCNSLNQSNLVEVQSPVYLPHLFF----QATKGSSLAECSKHN 722
            DGLNS++VEN+  SS RSCNSL+QSNL+EV SP+YLPHLFF    Q T+GSSLAE SKHN
Sbjct: 601  DGLNSQRVENEYSSSSRSCNSLDQSNLLEVHSPIYLPHLFFQRIDQVTQGSSLAEHSKHN 660

Query: 723  NQSRSPLHNWLPSGAEGSRLATLARPDFSSLKDASTRPTEFGTSEKSIQERVNCNIVDPV 782
            N SRSPL NW+PSGAEGSRL TLA PD SSLK  +  P E GTSE+SIQERV C++ DPV
Sbjct: 661  NHSRSPLQNWVPSGAEGSRLTTLAGPDSSSLKYVNKLPAELGTSEESIQERVVCDLQDPV 720

Query: 783  SVVTEGIQHSRDGNHGPLEHECEVPKVYGYNTAALQDHRCEFDVDEHFNSKSSCEDASRM 842
            SVVTE  + SRDGNHGPLE ECEV K+  ++   LQDH CE D+DEHFN KSSCEDAS+M
Sbjct: 721  SVVTEVSKSSRDGNHGPLEDECEVQKMCDHDITTLQDHSCELDMDEHFNCKSSCEDASKM 780

Query: 843  EQAVNNACRAQLVSEAIQMETGSPIAEFERFLQLSSPVINQRPKLRSSEIYPRNPPGDVI 902
            EQAVNNACR QL SEA+QMETG PIAEFE FL LSSPVI+QRPKL+S +I PRN  GD I
Sbjct: 781  EQAVNNACRVQLASEAVQMETGCPIAEFETFLHLSSPVISQRPKLKSCKICPRNLLGDAI 840

Query: 903  PCSNETADISLGCLWQWYEKHGNYGLEIKANGHENSNGFGADNSAFCAYFVPFLSAVQLF 962
             CS+E  +ISLGCLWQWYEKHG+YGLEIKA G+EN+N F  DNSAF AYFVPFLSAVQLF
Sbjct: 841  LCSHEIPNISLGCLWQWYEKHGSYGLEIKAKGNENANRFSYDNSAFLAYFVPFLSAVQLF 900

Query: 963  KSHKTHAPTT-GPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVLRVCNQLHGS 1022
            KSHKTHA TT  P G DSCV +IK+KEPSTCHLPIFSVLFPKP TDDAS+  V +Q H S
Sbjct: 901  KSHKTHAGTTANPAGLDSCVRNIKIKEPSTCHLPIFSVLFPKPHTDDASIPLVSSQFHSS 960

Query: 1023 EQHLGSERSKSSEQSVNLKSSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRPQGKI 1082
            EQ L SE++K SEQSV+LK SGESEL+FEYFE E PQQRRPLFDKI QLV GDGR QGKI
Sbjct: 961  EQPLASEKTKISEQSVDLKLSGESELVFEYFEVEPPQQRRPLFDKIQQLVGGDGRLQGKI 1020

Query: 1083 YGDPTMLNSITLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFVSRTSQPNSPD 1142
            YGDPTMLNSITLNDLHA SWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFV RTSQ NS D
Sbjct: 1021 YGDPTMLNSITLNDLHARSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFVCRTSQLNSSD 1080

Query: 1143 TNSCLVCPVVGLQSYNAQNECWFEPRNSTPTLTPGLSPPRILEERLRTLEETASLMARAV 1202
            T+SCLVCPVVGLQSYNAQNECWFEPRN T      + PP ILEERLRTLEETASLMARA+
Sbjct: 1081 TDSCLVCPVVGLQSYNAQNECWFEPRNGTSGFAFNVDPPGILEERLRTLEETASLMARAI 1140

Query: 1203 VKKGNLNSENTHPDYEFFLSRR 1218
            VKKGNLNSENTHPDYEFFLSRR
Sbjct: 1141 VKKGNLNSENTHPDYEFFLSRR 1161

BLAST of Lsi02G002760.1 vs. ExPASy TrEMBL
Match: A0A6J1GS60 (uncharacterized protein LOC111457006 OS=Cucurbita moschata OX=3662 GN=LOC111457006 PE=4 SV=1)

HSP 1 Score: 1635.2 bits (4233), Expect = 0.0e+00
Identity = 850/1159 (73.34%), Postives = 965/1159 (83.26%), Query Frame = 0

Query: 63   MQCALVRSSNFQKVLDKGKESLELRLEENSCSRGIKDSKVSSFAWRNFFDYRCAVISFLT 122
            MQCAL +SS FQKV DKGK+ LE++++E++CSR IKDS+VSSF WRNFFDYR AVIS LT
Sbjct: 1    MQCALEKSSEFQKVPDKGKQLLEVKIQEDNCSRRIKDSEVSSFEWRNFFDYRSAVISILT 60

Query: 123  VESDGLWRIVALPPQYLDSLDVSCLPQMNQSTAERKLVQKGPASNGTYSFNSFRCRSLLE 182
            +ESDGLWRIVALP Q LDSL VSCLPQMNQ TA+RKLV  GPASNGTYS NSFRCRSLLE
Sbjct: 61   LESDGLWRIVALPLQGLDSLHVSCLPQMNQFTADRKLVHNGPASNGTYSVNSFRCRSLLE 120

Query: 183  SNNKLFDSKAIKSSNKSSGKFSCRSSCSGSALMSSDSSAISDIPVGGAKMQRYGKKNPRK 242
            SN  L DSKA KSSNK+S KFS RSSCS SAL+S DSSAISDIP+G AK+QRYGKKN RK
Sbjct: 121  SNKNLLDSKAFKSSNKASSKFSWRSSCSSSALISGDSSAISDIPIGEAKIQRYGKKNSRK 180

Query: 243  KAKKKEIECKKISSDFVSAETEVSSKDSAHGSFLSEACGNNDSDCRDGSVLCSIAQGTFL 302
            KAKK++IECKK SSDFVSAETE+SS+DSA GS L EACGNN SDCRDG VLCS A+ TF 
Sbjct: 181  KAKKRDIECKKTSSDFVSAETEISSEDSARGSSLLEACGNNGSDCRDGPVLCSTARETFP 240

Query: 303  PDFRANKNDFKRDSERIIQPLGTTDSISSNIVDGNASEVSSSASKNFSGYYKVCGSKNQA 362
             D RA+KNDFKRDSERIIQPLGTTDSISS IV+G+ASEV  SA+KN SG Y    S+NQ 
Sbjct: 241  SDTRASKNDFKRDSERIIQPLGTTDSISSEIVEGDASEVPPSATKNSSGDYNGYVSENQP 300

Query: 363  LIKVPGCTHVNGGVNSRERLFAGSYNDFCSKDSLDNNSPDSNCFSSNGNSDNFNLKLDEK 422
            LIK PGCT  +G V+ +ERLF G  NDFCSKDS DNNSPDSNC       D+  LKL E 
Sbjct: 301  LIKAPGCTRFDGEVDRKERLFNGCCNDFCSKDSFDNNSPDSNC-------DSHTLKLTEN 360

Query: 423  KCFGVDLLEERSSPSRVNYCS-HNSVRDEVDVNAKVEKANRGIRGCTVSETCSVLPGKKT 482
            + FG+DLLE ++SPSR N CS HNS+RDEVDVNA+ EKAN GI+GCT SET  +LPGKKT
Sbjct: 361  EGFGIDLLEGQNSPSRENDCSHHNSIRDEVDVNAEEEKANHGIQGCTASETRLILPGKKT 420

Query: 483  KQNKKLTGSSRMNRYGGLGSSQRRTGKENRLTVWQKVQRNNSGECCEQLDQVS-PISKHF 542
            KQNKKL+G+SR NR+GG+GSSQR TGKEN  TVWQKVQ+NNSG CC QLDQVS P+SK  
Sbjct: 421  KQNKKLSGNSRTNRFGGMGSSQRCTGKENSRTVWQKVQKNNSGGCCAQLDQVSPPVSKQL 480

Query: 543  KGICNPVVGVQMPKVKDKKTGNRKQLKEKFPRRLKRKNTSGQEKIYRPTRNNCGSNTSSM 602
            KG+CNP VGVQ PKVKDKKTGNRKQLK+KF +RLK KNTS Q+KIYRP++++ GSNT+SM
Sbjct: 481  KGVCNP-VGVQTPKVKDKKTGNRKQLKDKFSKRLKNKNTSEQDKIYRPSKSSSGSNTNSM 540

Query: 603  VYKPPNGRLDIRSVGFDIRRSSGDPRSRFHNDTTDKCTTSESFESTQVCLDGLVSSKLIS 662
             +  PN RLDI ++GFDI +SSG  R+ F ND+TDKCTTSES ESTQVCLDG +S KLIS
Sbjct: 541  AHNRPNERLDIPAMGFDISKSSGGSRAPFQNDSTDKCTTSESSESTQVCLDGSMSDKLIS 600

Query: 663  DGLNSKKVENDSGSSPRSCNSLNQSNLVEVQSPVYLPHLFFQATKGSSLAECSKHNNQSR 722
            DGLN+++VEN+S +S  SC+SLNQSN ++ QSPVY+PHLFFQATKGSSLAE SKH+NQSR
Sbjct: 601  DGLNNQRVENESSTSLGSCSSLNQSNPLKAQSPVYVPHLFFQATKGSSLAERSKHSNQSR 660

Query: 723  SPLHNWLPSGAEGSRLAT-LARPDFSSLKDASTRPTEFGTSEKSIQERVNCNIVDPVSVV 782
            SPL NW+PS AEGSRL T LARPDFSSLKDA+ +P EFG SEKSIQE V+CN++DPVS  
Sbjct: 661  SPLQNWVPSVAEGSRLTTALARPDFSSLKDANKQPAEFGISEKSIQESVDCNLLDPVSNF 720

Query: 783  TEGIQHSRDGNHGPLEHECEVPKVYGYNTAALQDHRCEFDVDEHFNSKSSCEDASRMEQA 842
             E IQHSRD NH PLE ECE  + +G++T ALQD  CE DVDEHFN KS+C DA+++EQ 
Sbjct: 721  IEAIQHSRDRNHDPLEKECEAQESHGHDTNALQDRSCELDVDEHFNCKSTCGDATKIEQV 780

Query: 843  VNNACRAQLVSEAIQMETGSPIAEFERFLQLSSPVINQRPKLRSSEIYPRNPPGDVIPCS 902
            VN+AC+AQL  +A+       IAEFERFL LSSPVI+QRP LRS +I  +N  GD IPCS
Sbjct: 781  VNSACKAQLPFDAVHQ-----IAEFERFLHLSSPVISQRPNLRSCKICSKNSLGDGIPCS 840

Query: 903  NETADISLGCLWQWYEKHGNYGLEIKANGHENSNGFGADNSAFCAYFVPFLSAVQLFKSH 962
            +ETA+ISL CLWQWYEKHG+YGLE+KANGHE SNGFGADNS F AYFVPFLSAVQLFKSH
Sbjct: 841  HETANISLSCLWQWYEKHGSYGLEVKANGHEGSNGFGADNSEFHAYFVPFLSAVQLFKSH 900

Query: 963  KTHA-PTTGPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVLRVCNQLHGSEQH 1022
            KTH+  TT PVG DS VSDIK  EP T  LPIFSVLFPKPCTDDA+VL+ C+QLH SE+ 
Sbjct: 901  KTHSGATTCPVGLDSRVSDIKANEPPTAQLPIFSVLFPKPCTDDANVLQACSQLHSSEEP 960

Query: 1023 LGSERSKSSEQSVNLKSSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRPQGKIYGD 1082
            L SE+   SEQSV+   SGESELIFEYFE EQPQQRRPLFDKI QLV+GDG  +GKIYGD
Sbjct: 961  LASEKRNFSEQSVDSNLSGESELIFEYFEEEQPQQRRPLFDKIRQLVKGDGCLRGKIYGD 1020

Query: 1083 PTMLNSITLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFVSRTSQPNSPDTNS 1142
            PT+L SITLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFV RTSQ +S +T+S
Sbjct: 1021 PTVLESITLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFVCRTSQSSSSETDS 1080

Query: 1143 CLVCPVVGLQSYNAQNECWFEPRNSTPTLTPGLSPPRILEERLRTLEETASLMARAVVKK 1202
            C+VCPVVGLQS+NAQNECWF+PRNST       +PP +++ERLRTLEETASLMARAVVKK
Sbjct: 1081 CIVCPVVGLQSHNAQNECWFKPRNSTSM----FNPPGVVDERLRTLEETASLMARAVVKK 1140

Query: 1203 GNLNSENTHPDYEFFLSRR 1218
            GNLN+ N HPDYEFFLSRR
Sbjct: 1141 GNLNARNRHPDYEFFLSRR 1142

BLAST of Lsi02G002760.1 vs. ExPASy TrEMBL
Match: A0A6J1K4L4 (uncharacterized protein LOC111490028 OS=Cucurbita maxima OX=3661 GN=LOC111490028 PE=4 SV=1)

HSP 1 Score: 1633.2 bits (4228), Expect = 0.0e+00
Identity = 849/1158 (73.32%), Postives = 964/1158 (83.25%), Query Frame = 0

Query: 63   MQCALVRSSNFQKVLDKGKESLELRLEENSCSRGIKDSKVSSFAWRNFFDYRCAVISFLT 122
            MQCAL +SS FQKV DKGK+ LE++++E++CSR IKDS+VSSF WRNFFDYR AVIS LT
Sbjct: 1    MQCALEKSSEFQKVPDKGKQLLEVKIQEDNCSRRIKDSEVSSFEWRNFFDYRSAVISILT 60

Query: 123  VESDGLWRIVALPPQYLDSLDVSCLPQMNQSTAERKLVQKGPASNGTYSFNSFRCRSLLE 182
            +ESDGLWRIVALP Q LDSL VSCLPQMNQ TA+RKLV  GPASNGTYS NSFRCRSLLE
Sbjct: 61   LESDGLWRIVALPLQGLDSLHVSCLPQMNQFTADRKLVHNGPASNGTYSVNSFRCRSLLE 120

Query: 183  SNNKLFDSKAIKSSNKSSGKFSCRSSCSGSALMSSDSSAISDIPVGGAKMQRYGKKNPRK 242
            SN  L DSKA KSSNK+S KFS RSSCS SAL+S DSSAISDIP+G  K+QRYGKKN RK
Sbjct: 121  SNKNLLDSKAFKSSNKASCKFSWRSSCSSSALISGDSSAISDIPIGEDKIQRYGKKNSRK 180

Query: 243  KAKKKEIECKKISSDFVSAETEVSSKDSAHGSFLSEACGNNDSDCRDGSVLCSIAQGTFL 302
            KAKK++IECKK SSDFVSAETEVSS+DSA  S L E  GNN SDCRDGSVLCS A+ TF 
Sbjct: 181  KAKKRDIECKKTSSDFVSAETEVSSEDSARESSLLEVRGNNGSDCRDGSVLCSTARETFP 240

Query: 303  PDFRANKNDFKRDSERIIQPLGTTDSISSNIVDGNASEVSSSASKNFSGYYKVCGSKNQA 362
             D RA+KNDFKRDSERIIQPLGTTDSISS IV+G+ASE+  SA+KN  G Y   GS+NQ 
Sbjct: 241  SDSRASKNDFKRDSERIIQPLGTTDSISSEIVEGDASEIPPSATKNSIGDYNGYGSENQP 300

Query: 363  LIKVPGCTHVNGGVNSRERLFAGSYNDFCSKDSLDNNSPDSNCFSSNGNSDNFNLKLDEK 422
            LIK PGCT  +G V+ +ERLF G  NDFC+KDS DNNSPDSNC       D+  LKL E 
Sbjct: 301  LIKAPGCTRFDGEVDRKERLFNGCCNDFCTKDSFDNNSPDSNC-------DSHTLKLTEN 360

Query: 423  KCFGVDLLEERSSPSRVNYCS-HNSVRDEVDVNAKVEKANRGIRGCTVSETCSVLPGKKT 482
            + FG+DLLE ++SPSR N CS HNSVRD VDVNA+ EKAN GI+GCT SETC +LPGKKT
Sbjct: 361  EGFGIDLLEGQNSPSRENDCSHHNSVRDGVDVNAEAEKANHGIQGCTASETCLILPGKKT 420

Query: 483  KQNKKLTGSSRMNRYGGLGSSQRRTGKENRLTVWQKVQRNNSGECCEQLDQVSPISKHFK 542
            KQNKKL+G+SR NR+GG+GSSQR TGKEN  TVWQKVQ+NNSG CC QLDQVSPISK  K
Sbjct: 421  KQNKKLSGNSRTNRFGGMGSSQRCTGKENSRTVWQKVQKNNSGGCCAQLDQVSPISKQLK 480

Query: 543  GICNPVVGVQMPKVKDKKTGNRKQLKEKFPRRLKRKNTSGQEKIYRPTRNNCGSNTSSMV 602
            GICNP VGVQ PKVKDKKTGNRKQLK+KF +RLK KN+S Q+KIYRP++++ GSNT+SM 
Sbjct: 481  GICNP-VGVQTPKVKDKKTGNRKQLKDKFSKRLKNKNSSEQDKIYRPSKSSSGSNTNSMA 540

Query: 603  YKPPNGRLDIRSVGFDIRRSSGDPRSRFHNDTTDKCTTSESFESTQVCLDGLVSSKLISD 662
            +  PN RL I ++GFD+ +SS   R+ F ND+TDK  TSES ESTQVCLDG +S KLISD
Sbjct: 541  HNRPNERLVIPAMGFDMSKSSSGSRAPFQNDSTDKFMTSESSESTQVCLDGSMSDKLISD 600

Query: 663  GLNSKKVENDSGSSPRSCNSLNQSNLVEVQSPVYLPHLFFQATKGSSLAECSKHNNQSRS 722
            GLN+++VEN+S +S  SC+S+NQSN ++ QSPVY+PHLFFQATKGSSLAE SKH+NQSRS
Sbjct: 601  GLNNQRVENESSTSLGSCSSVNQSNPLKAQSPVYVPHLFFQATKGSSLAERSKHSNQSRS 660

Query: 723  PLHNWLPSGAEGSRLAT-LARPDFSSLKDASTRPTEFGTSEKSIQERVNCNIVDPVSVVT 782
            PL NW+PS AEGSRL T LARPDFSSLKDA+ +P EFG SEKSIQE VNCN++DPVS V 
Sbjct: 661  PLQNWVPSVAEGSRLTTALARPDFSSLKDANKQPAEFGISEKSIQESVNCNLLDPVSNVI 720

Query: 783  EGIQHSRDGNHGPLEHECEVPKVYGYNTAALQDHRCEFDVDEHFNSKSSCEDASRMEQAV 842
            E IQHSRDGNH PLE ECE  + +G++T ALQDHRCE DVDEHFN K++C DA+R+EQ V
Sbjct: 721  EAIQHSRDGNHDPLEKECEAQESHGHDTNALQDHRCELDVDEHFNCKATCGDATRIEQVV 780

Query: 843  NNACRAQLVSEAIQMETGSPIAEFERFLQLSSPVINQRPKLRSSEIYPRNPPGDVIPCSN 902
            N+AC+AQL  +A+       IAEFERFL LSSPVI+QRP LRS EI  +N  GDVIPCS+
Sbjct: 781  NSACKAQLAFDAVHQ-----IAEFERFLHLSSPVISQRPNLRSCEICSKNSLGDVIPCSH 840

Query: 903  ETADISLGCLWQWYEKHGNYGLEIKANGHENSNGFGADNSAFCAYFVPFLSAVQLFKSHK 962
            ETA+ISLGCLWQWYEKHG+YGLE+KANGHE SNGFGADNS F AYFVPFLSAVQLFKSHK
Sbjct: 841  ETANISLGCLWQWYEKHGSYGLEVKANGHEGSNGFGADNSEFHAYFVPFLSAVQLFKSHK 900

Query: 963  THA-PTTGPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVLRVCNQLHGSEQHL 1022
            TH+  TT PVG DS VSDIK  EP T  LPIFSVLFPKPCTD+A+VL+ C+QLH SE+ L
Sbjct: 901  THSGATTCPVGLDSRVSDIKANEPPTSQLPIFSVLFPKPCTDNANVLQACSQLHSSEESL 960

Query: 1023 GSERSKSSEQSVNLKSSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRPQGKIYGDP 1082
             SE+   SEQSV+   SGESELIFEYFE EQPQQRRPLFDKI QLV+GDG  +GKIYGDP
Sbjct: 961  ASEKRNFSEQSVDSNLSGESELIFEYFEEEQPQQRRPLFDKIRQLVKGDGCLRGKIYGDP 1020

Query: 1083 TMLNSITLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFVSRTSQPNSPDTNSC 1142
            T+L SITLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFV RTSQ +S +T+SC
Sbjct: 1021 TVLESITLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFVCRTSQSSSSETDSC 1080

Query: 1143 LVCPVVGLQSYNAQNECWFEPRNSTPTLTPGLSPPRILEERLRTLEETASLMARAVVKKG 1202
            +VCPVVGLQS+NAQNECWF+PR ST T     +PP +++ERLRTLEETASL+ARAVVKKG
Sbjct: 1081 IVCPVVGLQSHNAQNECWFKPRISTST----FNPPGVVDERLRTLEETASLLARAVVKKG 1140

Query: 1203 NLNSENTHPDYEFFLSRR 1218
            NLNS N HPDYEFFLSRR
Sbjct: 1141 NLNSRNRHPDYEFFLSRR 1141

BLAST of Lsi02G002760.1 vs. NCBI nr
Match: XP_038894653.1 (uncharacterized protein LOC120083142 isoform X1 [Benincasa hispida] >XP_038894654.1 uncharacterized protein LOC120083142 isoform X1 [Benincasa hispida] >XP_038894655.1 uncharacterized protein LOC120083142 isoform X1 [Benincasa hispida])

HSP 1 Score: 1954.9 bits (5063), Expect = 0.0e+00
Identity = 1005/1157 (86.86%), Postives = 1055/1157 (91.18%), Query Frame = 0

Query: 63   MQCALVRSSNFQKVLDKGKESLELRLEENSCSRGIKDSKVSSFAWRNFFDYRCAVISFLT 122
            MQCA + SS+FQKVLDK KESLELRLEEN CSRGIKDSKVSSFAWRNFF YRCAVISFLT
Sbjct: 1    MQCAPL-SSDFQKVLDKRKESLELRLEENGCSRGIKDSKVSSFAWRNFFYYRCAVISFLT 60

Query: 123  VESDGLWRIVALPPQYLDSLDVSCLPQMNQSTAERKLVQKGPASNGTYSFNSFRCRSLLE 182
            VESDGLWRIVALP QYLDS+DVSCLPQMNQ TAERKLVQ+GPAS GTYSFNSFRCRSLLE
Sbjct: 61   VESDGLWRIVALPLQYLDSVDVSCLPQMNQFTAERKLVQEGPASTGTYSFNSFRCRSLLE 120

Query: 183  SNNKLFDSKAIKSSNKSSGKFSCRSSCSGSALMSSDSSAISDIPVGGAKMQRYGKKNPRK 242
            SN KL DSKAIKSS+KSSGKFSC SSCS SALMSSDSSAISDIP G AKMQRYGKKNPRK
Sbjct: 121  SNKKLLDSKAIKSSDKSSGKFSCTSSCSSSALMSSDSSAISDIPNGRAKMQRYGKKNPRK 180

Query: 243  KAKKKEIECKKISSDFVSAETEVSSKDSAHGSFLSEACGNNDSDCRDGSVLCSIAQGTFL 302
            KAKKKEIE KKISS+FVSAETEVSSKDSA GSFLS+ACG+NDSDC D SVLCSIAQ  FL
Sbjct: 181  KAKKKEIESKKISSEFVSAETEVSSKDSACGSFLSKACGSNDSDCSDRSVLCSIAQEIFL 240

Query: 303  PDFRANKNDFKRDSERIIQPLGTTDSISSNIVDGNASEVSSSASKNFSGYYKVCGSKNQA 362
            PDFRA+KN F+RDSERIIQPLGT DSIS  IVD NASEVSSSA KN+S YYKVCGS+NQA
Sbjct: 241  PDFRASKNGFERDSERIIQPLGTADSISFEIVDENASEVSSSAIKNYSEYYKVCGSRNQA 300

Query: 363  LIKVPGCTHVNGGVNSRERLFAGSYNDFCSKDSLDNNSPDSNCFSSNGNSDNFNLKLDEK 422
            LIKVPGC HV+GGVNSRERLFA S  DFC KDSLDNNSPDS C S N N+DNFNLKL EK
Sbjct: 301  LIKVPGCAHVDGGVNSRERLFADSCKDFCFKDSLDNNSPDSKCVSLNSNTDNFNLKLKEK 360

Query: 423  KCFGVDLLEERSSPSRVNYCSHNSVRDEVDVNAKVEKANRGIRGCTVSETCSVLPGKKTK 482
            K FGVDLL+ERSSPS+ NYC  N+VRD VDVNA+VE+AN GIR  TVSET SVLPGKKTK
Sbjct: 361  KGFGVDLLKERSSPSKENYCFRNTVRD-VDVNAEVERANHGIRESTVSETRSVLPGKKTK 420

Query: 483  QNKKLTGSSRMNRYGGLGSSQRRTGKENRLTVWQKVQRNNSGECCEQLDQVSPISKHFKG 542
            QNKKL GS+RMNRYGGL SSQRRTGKENR TVWQKVQRNNSG CCEQLDQVSPISK FKG
Sbjct: 421  QNKKLAGSTRMNRYGGLVSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPISKQFKG 480

Query: 543  ICNPVVGVQMPKVKDKKTGNRKQLKEKFPRRLKRKNTSGQEKIYRPTRNNCGSNTSSMVY 602
            ICNP VGVQMPKVKDK+TGNRKQLKEKFPRRLKRKNTSGQEKIY PTRN+CGSNTSSMV+
Sbjct: 481  ICNPPVGVQMPKVKDKRTGNRKQLKEKFPRRLKRKNTSGQEKIYHPTRNSCGSNTSSMVH 540

Query: 603  KPPNGRLDIRSVGFDIRRSSGDPRSRFHNDTTDKCTTSESFESTQVCLDGLVSSKLISDG 662
            K PN  LDIRS+GFDIRRSS DPRSRF NDTTDKCTTSESFESTQVCL GL+S+KLIS+G
Sbjct: 541  KSPNKSLDIRSMGFDIRRSSDDPRSRFQNDTTDKCTTSESFESTQVCLGGLLSNKLISNG 600

Query: 663  LNSKKVENDSGSSPRSCNSLNQSNLVEVQSPVYLPHLFFQATKGSSLAECSKHNNQSRSP 722
            LNS+KVENDS SSPRSC+SLNQSN VEVQSPVYLPHLFFQATKGSSLAE S HNNQ R P
Sbjct: 601  LNSQKVENDSSSSPRSCDSLNQSNSVEVQSPVYLPHLFFQATKGSSLAERSNHNNQPRLP 660

Query: 723  LHNWLPSGAEGSRLATLARPDFSSLKDASTRPTEFGTSEKSIQERVNCNIVDPVSVVTEG 782
            L NWLPSGAEG  L TLARPDFSS+KDAS +P   GTSEKSIQERVNCN+++PVSVV EG
Sbjct: 661  LQNWLPSGAEG--LTTLARPDFSSMKDASMQPV--GTSEKSIQERVNCNLLNPVSVVIEG 720

Query: 783  IQHSRDGNHGPLEHECEVPKVYGYNTAALQDHRCEFDVDEHFNSKSSCEDASRMEQAVNN 842
            IQHSRDGNHGPLEHECEV K++GY+T  LQDH+ EFDVDEHF+ KSS EDASRMEQAVNN
Sbjct: 721  IQHSRDGNHGPLEHECEVQKMHGYDTTTLQDHKYEFDVDEHFSCKSSREDASRMEQAVNN 780

Query: 843  ACRAQLVSEAIQMETGSPIAEFERFLQLSSPVINQRPKLRSSEIYPRNPPGDVIPCSNET 902
            ACRAQLVSEAIQ+ETGSPIAEFERFL LSSPVINQRPKLR+SEI PRN PGDV+PCSNET
Sbjct: 781  ACRAQLVSEAIQIETGSPIAEFERFLHLSSPVINQRPKLRTSEISPRNLPGDVMPCSNET 840

Query: 903  ADISLGCLWQWYEKHGNYGLEIKANGHENSNGFGADNSAFCAYFVPFLSAVQLFKSHKTH 962
             +ISLGCLWQWYEKHG+YGLEIKANGHENSNGFGADNSAF AYFVPFLSA+QLFKS KTH
Sbjct: 841  DNISLGCLWQWYEKHGSYGLEIKANGHENSNGFGADNSAFRAYFVPFLSAIQLFKSQKTH 900

Query: 963  -APTTGPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVLRVCNQLHGSEQHLGS 1022
               TTGPVGFDSCV+DIKVKEPSTC LPIFSVLFPKPCTDDASVLRVC+Q H SEQHL S
Sbjct: 901  VGTTTGPVGFDSCVNDIKVKEPSTCRLPIFSVLFPKPCTDDASVLRVCDQFHSSEQHLAS 960

Query: 1023 ERSKSSEQSVNLKSSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRPQGKIYGDPTM 1082
            E+ K SEQSVN+K SGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDG PQGKIYGDPTM
Sbjct: 961  EKRKCSEQSVNIKLSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGCPQGKIYGDPTM 1020

Query: 1083 LNSITLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFVSRTSQPNSPDTNSCLV 1142
            LNSITLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFVSRT Q NSPDTNSCLV
Sbjct: 1021 LNSITLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFVSRTPQSNSPDTNSCLV 1080

Query: 1143 CPVVGLQSYNAQNECWFEPRNSTPTLTPGLSPPRILEERLRTLEETASLMARAVVKKGNL 1202
            CPVVGLQSYNAQNECWFEPRN  PT TPGL+PPRILEERLRTLEETASLMARAVVKKGNL
Sbjct: 1081 CPVVGLQSYNAQNECWFEPRNGKPTFTPGLNPPRILEERLRTLEETASLMARAVVKKGNL 1140

Query: 1203 NSENTHPDYEFFLSRRL 1219
            NSENTHPDYEFFLSRRL
Sbjct: 1141 NSENTHPDYEFFLSRRL 1151

BLAST of Lsi02G002760.1 vs. NCBI nr
Match: XP_038894656.1 (uncharacterized protein LOC120083142 isoform X2 [Benincasa hispida])

HSP 1 Score: 1864.4 bits (4828), Expect = 0.0e+00
Identity = 971/1157 (83.92%), Postives = 1021/1157 (88.25%), Query Frame = 0

Query: 63   MQCALVRSSNFQKVLDKGKESLELRLEENSCSRGIKDSKVSSFAWRNFFDYRCAVISFLT 122
            MQCA + SS+FQKVLDK KESLELRLEEN CSRGIKDSKVSSFAWRNFF YRCAVISFLT
Sbjct: 1    MQCAPL-SSDFQKVLDKRKESLELRLEENGCSRGIKDSKVSSFAWRNFFYYRCAVISFLT 60

Query: 123  VESDGLWRIVALPPQYLDSLDVSCLPQMNQSTAERKLVQKGPASNGTYSFNSFRCRSLLE 182
            VESDGLWRIVALP QYLDS+DVSCLPQMNQ TAERKLVQ+GPAS GTYSFNSFRCRSLLE
Sbjct: 61   VESDGLWRIVALPLQYLDSVDVSCLPQMNQFTAERKLVQEGPASTGTYSFNSFRCRSLLE 120

Query: 183  SNNKLFDSKAIKSSNKSSGKFSCRSSCSGSALMSSDSSAISDIPVGGAKMQRYGKKNPRK 242
            SN KL DSKAIKSS+KSSGKFSC SSCS SALMSSDSSAISDIP G AKMQRYGKKNPRK
Sbjct: 121  SNKKLLDSKAIKSSDKSSGKFSCTSSCSSSALMSSDSSAISDIPNGRAKMQRYGKKNPRK 180

Query: 243  KAKKKEIECKKISSDFVSAETEVSSKDSAHGSFLSEACGNNDSDCRDGSVLCSIAQGTFL 302
            KAKKKEIE KKISS+FVSAETEVSSKDSA GSFLS+ACG+NDSDC D SVLCSIAQ  FL
Sbjct: 181  KAKKKEIESKKISSEFVSAETEVSSKDSACGSFLSKACGSNDSDCSDRSVLCSIAQEIFL 240

Query: 303  PDFRANKNDFKRDSERIIQPLGTTDSISSNIVDGNASEVSSSASKNFSGYYKVCGSKNQA 362
            PDFRA+KN F+RDSERIIQPLGT DSIS  IVD NASEVSSSA KN+S YYKVCGS+NQA
Sbjct: 241  PDFRASKNGFERDSERIIQPLGTADSISFEIVDENASEVSSSAIKNYSEYYKVCGSRNQA 300

Query: 363  LIKVPGCTHVNGGVNSRERLFAGSYNDFCSKDSLDNNSPDSNCFSSNGNSDNFNLKLDEK 422
            LIKVPGC HV+GGVNSRERLFA S  DFC KDSLDNNSPDS C S N N+DNFNLKL EK
Sbjct: 301  LIKVPGCAHVDGGVNSRERLFADSCKDFCFKDSLDNNSPDSKCVSLNSNTDNFNLKLKEK 360

Query: 423  KCFGVDLLEERSSPSRVNYCSHNSVRDEVDVNAKVEKANRGIRGCTVSETCSVLPGKKTK 482
            K FGVDLL+ERSSPS+ NYC  N+VRD VDVNA+VE+AN GIR  TVSET SVLPGKKTK
Sbjct: 361  KGFGVDLLKERSSPSKENYCFRNTVRD-VDVNAEVERANHGIRESTVSETRSVLPGKKTK 420

Query: 483  QNKKLTGSSRMNRYGGLGSSQRRTGKENRLTVWQKVQRNNSGECCEQLDQVSPISKHFKG 542
            QNKKL GS+RMNRYGGL SSQRRTGKENR TVWQKVQRNNSG CCEQLDQVSPISK FKG
Sbjct: 421  QNKKLAGSTRMNRYGGLVSSQRRTGKENRHTVWQKVQRNNSGGCCEQLDQVSPISKQFKG 480

Query: 543  ICNPVVGVQMPKVKDKKTGNRKQLKEKFPRRLKRKNTSGQEKIYRPTRNNCGSNTSSMVY 602
            ICNP VGVQMPKVKDK+TGNRKQLKEKFPRRLKRKNTSGQEKIY PTRN+CGSNTSSMV+
Sbjct: 481  ICNPPVGVQMPKVKDKRTGNRKQLKEKFPRRLKRKNTSGQEKIYHPTRNSCGSNTSSMVH 540

Query: 603  KPPNGRLDIRSVGFDIRRSSGDPRSRFHNDTTDKCTTSESFESTQVCLDGLVSSKLISDG 662
            K PN  LDIRS+GFDIRRSS DPRSRF NDTTDKCTTSESFESTQVCL GL+S+KLIS+G
Sbjct: 541  KSPNKSLDIRSMGFDIRRSSDDPRSRFQNDTTDKCTTSESFESTQVCLGGLLSNKLISNG 600

Query: 663  LNSKKVENDSGSSPRSCNSLNQSNLVEVQSPVYLPHLFFQATKGSSLAECSKHNNQSRSP 722
            LNS+KVENDS SSPRSC+SLNQSN VEVQSPVYLPHLFFQATKGSSLAE S HNNQ R P
Sbjct: 601  LNSQKVENDSSSSPRSCDSLNQSNSVEVQSPVYLPHLFFQATKGSSLAERSNHNNQPRLP 660

Query: 723  LHNWLPSGAEGSRLATLARPDFSSLKDASTRPTEFGTSEKSIQERVNCNIVDPVSVVTEG 782
            L NWLPSGAEG  L TLARPDFSS+KDAS +P   GTSEKSIQERVNCN+++PVSVV EG
Sbjct: 661  LQNWLPSGAEG--LTTLARPDFSSMKDASMQPV--GTSEKSIQERVNCNLLNPVSVVIEG 720

Query: 783  IQHSRDGNHGPLEHECEVPKVYGYNTAALQDHRCEFDVDEHFNSKSSCEDASRMEQAVNN 842
            IQHSRDGNHGPLEHECEV K++GY+T  LQDH+ EFDVDEHF+ KSS EDASRMEQAVNN
Sbjct: 721  IQHSRDGNHGPLEHECEVQKMHGYDTTTLQDHKYEFDVDEHFSCKSSREDASRMEQAVNN 780

Query: 843  ACRAQLVSEAIQMETGSPIAEFERFLQLSSPVINQRPKLRSSEIYPRNPPGDVIPCSNET 902
            ACRAQLVSEAIQ+ETGSPIAEFERFL LSSPVINQRPKLR+SEI PRN PGDV+PCSNET
Sbjct: 781  ACRAQLVSEAIQIETGSPIAEFERFLHLSSPVINQRPKLRTSEISPRNLPGDVMPCSNET 840

Query: 903  ADISLGCLWQWYEKHGNYGLEIKANGHENSNGFGADNSAFCAYFVPFLSAVQLFKSHKTH 962
             +ISLGCLWQWYEKHG+YGLEIKANGHENSNGFGADNSAF AYFVPFLSA+QLFKS KTH
Sbjct: 841  DNISLGCLWQWYEKHGSYGLEIKANGHENSNGFGADNSAFRAYFVPFLSAIQLFKSQKTH 900

Query: 963  -APTTGPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVLRVCNQLHGSEQHLGS 1022
               TTGPVGFDSCV+DIKVKEPSTC LPIFSVLFPKPCTDDASVLRVC+Q H SEQHL S
Sbjct: 901  VGTTTGPVGFDSCVNDIKVKEPSTCRLPIFSVLFPKPCTDDASVLRVCDQFHSSEQHLAS 960

Query: 1023 ERSKSSEQSVNLKSSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRPQGKIYGDPTM 1082
            E+ K SEQSVN+K SGESELIFEYFEGEQPQQRRPLFDK                     
Sbjct: 961  EKRKCSEQSVNIKLSGESELIFEYFEGEQPQQRRPLFDK--------------------- 1020

Query: 1083 LNSITLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFVSRTSQPNSPDTNSCLV 1142
                          YSVAWYPIYRIPDGNLRAAFLTYHSLGHFVSRT Q NSPDTNSCLV
Sbjct: 1021 --------------YSVAWYPIYRIPDGNLRAAFLTYHSLGHFVSRTPQSNSPDTNSCLV 1080

Query: 1143 CPVVGLQSYNAQNECWFEPRNSTPTLTPGLSPPRILEERLRTLEETASLMARAVVKKGNL 1202
            CPVVGLQSYNAQNECWFEPRN  PT TPGL+PPRILEERLRTLEETASLMARAVVKKGNL
Sbjct: 1081 CPVVGLQSYNAQNECWFEPRNGKPTFTPGLNPPRILEERLRTLEETASLMARAVVKKGNL 1116

Query: 1203 NSENTHPDYEFFLSRRL 1219
            NSENTHPDYEFFLSRRL
Sbjct: 1141 NSENTHPDYEFFLSRRL 1116

BLAST of Lsi02G002760.1 vs. NCBI nr
Match: XP_004137638.2 (uncharacterized protein LOC101212209 [Cucumis sativus] >KGN64214.1 hypothetical protein Csa_014277 [Cucumis sativus])

HSP 1 Score: 1810.8 bits (4689), Expect = 0.0e+00
Identity = 953/1195 (79.75%), Postives = 1021/1195 (85.44%), Query Frame = 0

Query: 63   MQCALVRSSNFQKVLDKGKESLELRLEENSCSRGIK-DSKVSSFAWRNFFDYRCAVISFL 122
            MQC LV SS+FQKVLDKGKESLELRLE+NSCSRGI  DSKVSSFAWRNFFDYR A+IS L
Sbjct: 1    MQCTLV-SSDFQKVLDKGKESLELRLEKNSCSRGISTDSKVSSFAWRNFFDYRRAIISCL 60

Query: 123  TVESDGLWRIVALPPQYLDSLDVSCLPQMNQSTAERKLVQKGPASNGTYSFNSFRCRSLL 182
            T+ESDGLWRIVALPPQYLDSL++SCLPQMNQ TA RKLVQKGPASNGTYSFNS RCRSLL
Sbjct: 61   TLESDGLWRIVALPPQYLDSLNLSCLPQMNQFTAGRKLVQKGPASNGTYSFNSLRCRSLL 120

Query: 183  ESNNKLFDSKAIKSSNKSSGKFSCRSSCSGSALMSSDSSAISDIPVGGAKMQRYGKKNPR 242
            ESN KL DSKAIKS  +SSGKF C SSCSGSALMSSDS AISDIPV GAKMQRYGKKNPR
Sbjct: 121  ESNKKLLDSKAIKSPKQSSGKFPCTSSCSGSALMSSDSIAISDIPVDGAKMQRYGKKNPR 180

Query: 243  KKAKKKEIECKKISSDFVSAETEVSSKDSAHGSFLSEACGNNDSDCRDGSVLCSIAQGTF 302
            KKAKKKEIECK ISSDFVSAETEVS +DSA  SFLSEACG+NDSD RD SVLCSIAQ TF
Sbjct: 181  KKAKKKEIECKNISSDFVSAETEVSLQDSARASFLSEACGSNDSDFRDRSVLCSIAQETF 240

Query: 303  LPDFRANKNDFKRDSERIIQPLGTTDSISSNIVDGNASEVSSSASKNFSGYYKVCGSKNQ 362
            LPDF         + + +IQPLGT DS+SS IVDG++S+VSS A KNFSGYYKVCGS+NQ
Sbjct: 241  LPDF---------EQDSVIQPLGTVDSVSSEIVDGHSSKVSSLAIKNFSGYYKVCGSENQ 300

Query: 363  ALIKVPGCTHVNGGVNSRERLFAGSYNDFCSKDSLDNNSPDSNCFSSNGNSDNFNLKLDE 422
            ALI VPGC HV+ G+NSRER  AGS NDFCSKD LDN S DS   S NGN D+ NLKL+E
Sbjct: 301  ALINVPGCIHVDVGLNSRERFIAGSCNDFCSKDYLDNISRDSKWVSLNGNCDDLNLKLNE 360

Query: 423  KKCFGVDLLEERSSPSRVNYCSHNSVRDEVDVNAKVEKANRGIRGCTVSETCSVLPGKKT 482
            K+ FGVDLLEERSSPS+      NS RDEVD+NA+VEKAN GIRGCTVSETCSVLPGKKT
Sbjct: 361  KQGFGVDLLEERSSPSQ------NSARDEVDLNAEVEKANLGIRGCTVSETCSVLPGKKT 420

Query: 483  KQNKKLTGSSRMNRYGGLGSSQRRTGKENRLTVWQKVQRNNSGECCEQLDQVSPISKHFK 542
            KQNKKLTGSSRMNRYGGLGSSQRRTGKENR TVWQKVQR++SG C EQLDQVSPISK FK
Sbjct: 421  KQNKKLTGSSRMNRYGGLGSSQRRTGKENRHTVWQKVQRSSSGGCSEQLDQVSPISKQFK 480

Query: 543  GICNPVVGVQMPKVKDKKTGNRKQLKEKFPRRLKRKNTSGQEKIYRPTRNNCGSNTSSMV 602
            GICNPVVGVQMPKVKDKKTGN+KQLKEK PRRLKRKNTSGQEKIYRPTRN+CGSNTSSMV
Sbjct: 481  GICNPVVGVQMPKVKDKKTGNKKQLKEKCPRRLKRKNTSGQEKIYRPTRNSCGSNTSSMV 540

Query: 603  YKPPNGRLDIRSVGFDIRRSSGDPRSRFHNDTTDKCTTSESFESTQVCLDGLVSSKLISD 662
            +KPPN +LD+RS+GFDIRRSSGDPRS F ND+TDKCT SES ES QV LD L+S+KLI+D
Sbjct: 541  HKPPNEKLDVRSMGFDIRRSSGDPRSCFQNDSTDKCTNSESVESKQVHLDELISNKLIND 600

Query: 663  GLNSKKVENDSGSSPRSCNSLNQSNLVEVQSP---------------------------- 722
            GL+S+KVENDS S P+SCNS NQSN VEV+SP                            
Sbjct: 601  GLSSQKVENDSSSLPKSCNSSNQSNPVEVKSPVYLPHLFFQKVGNDSSSLPKSCNSLNQS 660

Query: 723  --------VYLPHLFFQATKGSSLAECSKHNNQSRSPLHNWLPSGAEGSRLATLARPDFS 782
                    VYLPHLFFQATKGSSL E SKH+ QSRSPL NWLPSGAEGSR  TLARPDFS
Sbjct: 661  NPVEVKSSVYLPHLFFQATKGSSLDERSKHDTQSRSPLQNWLPSGAEGSRSITLARPDFS 720

Query: 783  SLKDASTRPTEFGTSEKSIQERVNCNIVDPVSVVTEGIQHSRDGNHGPLEHECEVPKVYG 842
            SL+DA+T+P EFGT EKSI+ERVNCN+++PVS V EGIQH RD + GPLEHEC V K+YG
Sbjct: 721  SLRDANTQPAEFGTLEKSIKERVNCNVLNPVSDVIEGIQHYRDRDDGPLEHECGVQKMYG 780

Query: 843  YNTAALQDHRCEFDVDEHFNSKSSCEDASRMEQAVNNACRAQLVSEAIQMETGSPIAEFE 902
            Y+T  LQDH+ EFDVDEHFN KSSCED SRMEQAVNNACRAQL SEAIQMETG PIAEFE
Sbjct: 781  YDTTTLQDHKSEFDVDEHFNCKSSCEDVSRMEQAVNNACRAQLASEAIQMETGCPIAEFE 840

Query: 903  RFLQLSSPVINQRPKLRSSEIYPRNPPGDVIPCSNETADISLGCLWQWYEKHGNYGLEIK 962
            RFL LSSPVI+QRP   SS+I PRN PGDVIPCSNET +ISLGCLWQWYEKHG+YGLEIK
Sbjct: 841  RFLHLSSPVIDQRPN-SSSDICPRNLPGDVIPCSNETTNISLGCLWQWYEKHGSYGLEIK 900

Query: 963  ANGHENSNGFGADNSAFCAYFVPFLSAVQLFKSHKTHAPT-TGPVGFDSCVSDIKVKEPS 1022
            A G ENSNGFGA NSAF AYFVPFLSAVQLFKS KTH  T TGP+GF+SCVSDIKVKEPS
Sbjct: 901  AKGQENSNGFGAVNSAFRAYFVPFLSAVQLFKSRKTHVGTATGPLGFNSCVSDIKVKEPS 960

Query: 1023 TCHLPIFSVLFPKPCTDDASVLRVCNQLHGSEQHLGSERSKSSEQSVNLKSSGESELIFE 1082
            TCHLPIFS+LFPKPCTDD SVLRVCNQ H SEQHL SE+ KSSEQS +L+ SGESELIFE
Sbjct: 961  TCHLPIFSLLFPKPCTDDTSVLRVCNQFHSSEQHLASEKKKSSEQSASLQLSGESELIFE 1020

Query: 1083 YFEGEQPQQRRPLFDKIHQLVEGDGRPQGKIYGDPTMLNSITLNDLHAGSWYSVAWYPIY 1142
            YFEGEQPQ RRPLFDKIHQLVEGDG  QGKIYGDPT+LNSITL+DLHAGSWYSVAWYPIY
Sbjct: 1021 YFEGEQPQLRRPLFDKIHQLVEGDGL-QGKIYGDPTVLNSITLDDLHAGSWYSVAWYPIY 1080

Query: 1143 RIPDGNLRAAFLTYHSLGHFVSRTSQPNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNS- 1202
            RIPDGNLRAAFLTYHSLGHFVSRTSQ    DTNSCLVCPVVGLQSYNAQNECWFEPR+S 
Sbjct: 1081 RIPDGNLRAAFLTYHSLGHFVSRTSQ----DTNSCLVCPVVGLQSYNAQNECWFEPRDST 1140

Query: 1203 -TPTLTPGLSPPRILEERLRTLEETASLMARAVVKKGNLNSENTHPDYEFFLSRR 1218
             T T T  L+PPRIL+ERLRTLEETASLMARAVVKKGNLNS NTHPDYEFFLSRR
Sbjct: 1141 RTSTFTSNLNPPRILQERLRTLEETASLMARAVVKKGNLNSGNTHPDYEFFLSRR 1173

BLAST of Lsi02G002760.1 vs. NCBI nr
Match: TYJ99070.1 (uncharacterized protein E5676_scaffold248G002740 [Cucumis melo var. makuwa])

HSP 1 Score: 1802.3 bits (4667), Expect = 0.0e+00
Identity = 944/1193 (79.13%), Postives = 1014/1193 (85.00%), Query Frame = 0

Query: 63   MQCALVRSSNFQKVLDKGKESLELRLEENSCSRGI-KDSKVSSFAWRNFFDYRCAVISFL 122
            MQCALVRSS+FQKVLDKGKESL+LRLE+NSCSRGI KD +VSSFAWRNFFDYRCAVI FL
Sbjct: 1    MQCALVRSSDFQKVLDKGKESLDLRLEKNSCSRGISKDFEVSSFAWRNFFDYRCAVIRFL 60

Query: 123  TVESDGLWRIVALPPQYLDSLDVSCLPQMNQSTAERKLVQKGPASNGTYSFNSFRCRSLL 182
            T+ESDGLWRIVALPPQYLDSL+VSCLPQMNQ TA RKLVQKG ASNGTYSFNS RCRSLL
Sbjct: 61   TLESDGLWRIVALPPQYLDSLNVSCLPQMNQFTAGRKLVQKGSASNGTYSFNSLRCRSLL 120

Query: 183  ESNNKLFDSKAIKSSNKSSGKFSCRSSCSGSALMSSDSSAISDIPVGGAKMQRYGKKNPR 242
            ESN KL DSKAIKS NKSSGK  C SSCS SALMSSDS A SDIP+ GAKMQRYGKKNPR
Sbjct: 121  ESNKKLLDSKAIKSPNKSSGKLLCTSSCSASALMSSDSIATSDIPIDGAKMQRYGKKNPR 180

Query: 243  KKAKKKEIECKKISSDFVSAETEVSSKDSAHGSFLSEACGNNDSDCRDGSVLCSIAQGTF 302
            KKAKKKE+E KKISS+FVSAETEVS +DSA  SFLSEACG+NDSD R+ +VLCSIA  TF
Sbjct: 181  KKAKKKELEYKKISSEFVSAETEVSLQDSARASFLSEACGSNDSDFRNRTVLCSIAPETF 240

Query: 303  LPDFRANKNDFKRDSERIIQPLGTTDSISSNIVDGNASEVSSSASKNFSGYYKVCGSKNQ 362
            LP       DF+RDSE  IQPLGT DS+SS IVDG++S+VSSSA KNFSGY+KVCGS+NQ
Sbjct: 241  LP-------DFERDSE--IQPLGTVDSVSSEIVDGHSSKVSSSAIKNFSGYHKVCGSENQ 300

Query: 363  ALIKVPGCTHVNGGVNSRERLFAGSYNDFCSKDSLDNNSPDSNCFSSNGNSDNFNLKLDE 422
            AL   PGC HV+ G+NSRE L AGS NDFCS DSLDNNS DS   S N N D+ NLKL+E
Sbjct: 301  ALTNAPGCFHVDVGLNSRESLLAGSCNDFCSTDSLDNNSCDSKWVSLNSNCDDLNLKLNE 360

Query: 423  KKCFGVDLLEERSSPSRVNYCSHNSVRDEVDVNAKVEKANRGIRGCTVSETCSVLPGKKT 482
            KK FGVDLLEERSSP R N CS NS RDEVD+N +VEK   GI+GCTVSETCSVLPGKKT
Sbjct: 361  KKGFGVDLLEERSSPYREN-CSQNSARDEVDLNTEVEK---GIQGCTVSETCSVLPGKKT 420

Query: 483  KQNKKLTGSSRMNRYGGLGSSQRRTGKENRLTVWQKVQRNNSGECCEQLDQVSPISKHFK 542
            KQNKKLTGSSRMNRYGGLGSSQRRTGKENR TVWQKVQR+NSG C EQLDQVSPISK FK
Sbjct: 421  KQNKKLTGSSRMNRYGGLGSSQRRTGKENRHTVWQKVQRSNSGGCSEQLDQVSPISKQFK 480

Query: 543  GICNPVVGVQMPKVKDKKTGNRKQLKEKFPRRLKRKNTSGQEKIYRPTRNNCGSNTSSMV 602
            GICNPV GVQMPKVKDKKTGNRKQLKEK  RRLKRKNTSGQEKIYRPTRN+CGSNTSSMV
Sbjct: 481  GICNPVAGVQMPKVKDKKTGNRKQLKEKCSRRLKRKNTSGQEKIYRPTRNSCGSNTSSMV 540

Query: 603  YKPPNGRLDIRSVGFDIRRSSGDPRSRFHNDTTDKCTTSESFESTQVCLDGLVSSKLISD 662
            +KPPN RLDIRS+GFDIRRSSG+PRSRF NDTTDKC  SE+ E  QV  D L S+KLI D
Sbjct: 541  HKPPNERLDIRSMGFDIRRSSGNPRSRFQNDTTDKCMNSEAVEGKQVHPDELFSNKLIYD 600

Query: 663  GLNSKKVENDSGSSPRSCNSLNQ------------------------------------S 722
            GL+S+KVENDS S P+SCNS NQ                                    S
Sbjct: 601  GLSSQKVENDSSSLPKSCNSSNQSNPVEVKSPVYLPHLFFQKVENDSSSLPKSCSSSNLS 660

Query: 723  NLVEVQSPVYLPHLFFQATKGSSLAECSKHNNQSRSPLHNWLPSGAEGSRLATLARPDFS 782
            N VEV+SPVYLPHLFFQATKGSSLAE SKH  QSRSPL NWLPSGAEGSR  TLARPDFS
Sbjct: 661  NTVEVKSPVYLPHLFFQATKGSSLAERSKHETQSRSPLQNWLPSGAEGSRSTTLARPDFS 720

Query: 783  SLKDASTRPTEFGTSEKSIQERVNCNIVDPVSVVTEGIQHSRDGNHGPLEHECEVPKVYG 842
            SL+DA+T+P EFGTSEKSI+ERVNC++++PVS V EGIQH RD +HG LEHECEV K+YG
Sbjct: 721  SLRDANTQPAEFGTSEKSIKERVNCSLLNPVSDVLEGIQHYRDRDHGSLEHECEVQKIYG 780

Query: 843  YNTAALQDHRCEFDVDEHFNSKSSCEDASRMEQAVNNACRAQLVSEAIQMETGSPIAEFE 902
            ++T  LQ+ +CEF+VDEHFN KSSCED SRMEQAVNNAC+AQL SEAIQMETG PIAEFE
Sbjct: 781  FDTTTLQNQKCEFNVDEHFNCKSSCEDVSRMEQAVNNACKAQLASEAIQMETGCPIAEFE 840

Query: 903  RFLQLSSPVINQRPKLRSSEIYPRNPPGDVIPCSNETADISLGCLWQWYEKHGNYGLEIK 962
            RFL LSSPVI+QRPKLRSSEI PRN PGDVIPCSNET +ISL CLWQWYEKHG+YGLEIK
Sbjct: 841  RFLHLSSPVIDQRPKLRSSEICPRNLPGDVIPCSNETTNISLACLWQWYEKHGSYGLEIK 900

Query: 963  ANGHENSNGFGADNSAFCAYFVPFLSAVQLFKSHKTH-APTTGPVGFDSCVSDIKVKEPS 1022
            A  HENSNGFG  NSAF AYFVPFLSA+QLFKS KTH   TTGP+GFDSCVSDIKVKEPS
Sbjct: 901  AKSHENSNGFGVVNSAFRAYFVPFLSAIQLFKSRKTHVGTTTGPLGFDSCVSDIKVKEPS 960

Query: 1023 TCHLPIFSVLFPKPCTDDASVLRVCNQLHGSEQHLGSERSKSSEQSVNLKSSGESELIFE 1082
            TCHLPIFS+LFP+P TDD SVLRVCN+ H SEQ L SE+ KSS+QS +L+ SGESELIFE
Sbjct: 961  TCHLPIFSLLFPEPSTDDTSVLRVCNRFHSSEQDLASEKRKSSKQSASLQLSGESELIFE 1020

Query: 1083 YFEGEQPQQRRPLFDKIHQLVEGDGRPQGKIYGDPTMLNSITLNDLHAGSWYSVAWYPIY 1142
            YFEGEQPQ RRPLFDKIHQLVEGDG  QGKIYGDPTMLNSITL+DLHAGSWYSVAWYPIY
Sbjct: 1021 YFEGEQPQLRRPLFDKIHQLVEGDGCLQGKIYGDPTMLNSITLDDLHAGSWYSVAWYPIY 1080

Query: 1143 RIPDGNLRAAFLTYHSLGHFVSRTSQPNSPDTNSCLVCPVVGLQSYNAQNECWFEPRNST 1202
            RIPDGNLRAAFLTYHSLGHFVSRTSQ    DTNSCLVCPVVGLQSYNAQNECWFEPR ST
Sbjct: 1081 RIPDGNLRAAFLTYHSLGHFVSRTSQ----DTNSCLVCPVVGLQSYNAQNECWFEPREST 1140

Query: 1203 PTLTPGLSPPRILEERLRTLEETASLMARAVVKKGNLNSENTHPDYEFFLSRR 1218
             T T  L+PPR+L+ERLRTLEETASLMARAVVKKGNLNS NTHPDYEFFLSRR
Sbjct: 1141 STFTSDLNPPRVLQERLRTLEETASLMARAVVKKGNLNSGNTHPDYEFFLSRR 1176

BLAST of Lsi02G002760.1 vs. NCBI nr
Match: KAG6572995.1 (DNA-directed RNA polymerases II, IV and V subunit 12, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1731.1 bits (4482), Expect = 0.0e+00
Identity = 899/1225 (73.39%), Postives = 1019/1225 (83.18%), Query Frame = 0

Query: 1    MDPQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRRTFSCLDWGKPFQEK-- 60
            MDPQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRR     +W      +  
Sbjct: 1    MDPQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRRIVQS-EWVSLRLSRRL 60

Query: 61   --LQKTMQCALVRSSNFQKVLDKGKESLELRLEENSCSRGIKDSKVSSFAWRNFFDYRCA 120
              +QKTMQCAL +SS FQKV DKGK+ LE++++E++CSR IKDS+VSSF WRNFFDYR A
Sbjct: 61   KWVQKTMQCALEKSSEFQKVPDKGKQLLEVKIQEDNCSRRIKDSEVSSFEWRNFFDYRSA 120

Query: 121  VISFLTVESDGLWRIVALPPQYLDSLDVSCLPQMNQSTAERKLVQKGPASNGTYSFNSFR 180
            VIS LT+ESDGLWRIVALP Q LDSL VSCLPQMNQ TA+RKLV  GPAS+GTYS NSFR
Sbjct: 121  VISILTLESDGLWRIVALPLQGLDSLHVSCLPQMNQFTADRKLVHNGPASSGTYSVNSFR 180

Query: 181  CRSLLESNNKLFDSKAIKSSNKSSGKFSCRSSCSGSALMSSDSSAISDIPVGGAKMQRYG 240
            CRSLLESN  L DSKA KSSNK+S KFS RSSCS SAL+S DSSAISDIP+G AK+QRYG
Sbjct: 181  CRSLLESNKNLLDSKAFKSSNKASSKFSWRSSCSSSALISGDSSAISDIPIGEAKIQRYG 240

Query: 241  KKNPRKKAKKKEIECKKISSDFVSAETEVSSKDSAHGSFLSEACGNNDSDCRDGSVLCSI 300
            KKN RKKAKK++IECKK SSDFVSAETE+SS+DSA GS L EACGNN SDCRDGSVLCS 
Sbjct: 241  KKNSRKKAKKRDIECKKTSSDFVSAETEISSEDSARGSSLLEACGNNGSDCRDGSVLCST 300

Query: 301  AQGTFLPDFRANKNDFKRDSERIIQPLGTTDSISSNIVDGNASEVSSSASKNFSGYYKVC 360
            A+ TF  D RA+KNDFKRDSERIIQPLGTTDSISS IV+G+ASEV  SA+KN SG Y   
Sbjct: 301  ARETFPSDTRASKNDFKRDSERIIQPLGTTDSISSEIVEGDASEVPPSATKNSSGDYNGY 360

Query: 361  GSKNQALIKVPGCTHVNGGVNSRERLFAGSYNDFCSKDSLDNNSPDSNCFSSNGNSDNFN 420
             S+NQ LIK PGCT  +G V+ +ERLF G  NDFCSKDS DNNSPDSNC       D+  
Sbjct: 361  VSENQPLIKAPGCTRFDGEVDRKERLFNGCCNDFCSKDSFDNNSPDSNC-------DSHT 420

Query: 421  LKLDEKKCFGVDLLEERSSPSRVNYCS-HNSVRDEVDVNAKVEKANRGIRGCTVSETCSV 480
            LKL E + FG+DLLE ++SPSR N CS HNS+RDEVDVNA+ EKAN GI+GCT SET  +
Sbjct: 421  LKLTENEGFGIDLLEGQNSPSRENDCSHHNSIRDEVDVNAEEEKANHGIQGCTASETPLI 480

Query: 481  LPGKKTKQNKKLTGSSRMNRYGGLGSSQRRTGKENRLTVWQKVQRNNSGECCEQLDQVS- 540
            LPGKKTKQNKKL+G+SR NR+GG+GSSQR TGKEN  TVWQKVQ+NNSG CC QLDQVS 
Sbjct: 481  LPGKKTKQNKKLSGNSRTNRFGGMGSSQRCTGKENSRTVWQKVQKNNSGGCCAQLDQVSP 540

Query: 541  PISKHFKGICNPVVGVQMPKVKDKKTGNRKQLKEKFPRRLKRKNTSGQEKIYRPTRNNCG 600
            P+SK  KG+CNP VGVQ PKVKDKKTGNRKQLK+KF +RLK KNTS Q+KIYRP++++ G
Sbjct: 541  PVSKQLKGVCNP-VGVQTPKVKDKKTGNRKQLKDKFSKRLKNKNTSEQDKIYRPSKSSSG 600

Query: 601  SNTSSMVYKPPNGRLDIRSVGFDIRRSSGDPRSRFHNDTTDKCTTSESFESTQVCLDGLV 660
            SNT+SM +  PN RLDI ++GFDI +SS   R+ F ND+TDKC TSES ESTQVCLDG +
Sbjct: 601  SNTNSMAHNRPNERLDIPAMGFDISKSSSGSRAPFQNDSTDKCMTSESSESTQVCLDGSM 660

Query: 661  SSKLISDGLNSKKVENDSGSSPRSCNSLNQSNLVEVQSPVYLPHLFFQATKGSSLAECSK 720
            S KLISDGLN+++VEN+S +S RSC+SLNQSN ++ QSPVY+PHLFFQATKGSSLAE SK
Sbjct: 661  SDKLISDGLNNQRVENESSTSLRSCSSLNQSNPLKAQSPVYVPHLFFQATKGSSLAERSK 720

Query: 721  HNNQSRSPLHNWLPSGAEGSRLAT-LARPDFSSLKDASTRPTEFGTSEKSIQERVNCNIV 780
            H+NQSRSPL NW+PS AEGSRL T L RPDFSSLKDA+ +P EFG SEKSIQE V+CN++
Sbjct: 721  HSNQSRSPLQNWVPSVAEGSRLTTALGRPDFSSLKDANKQPAEFGISEKSIQESVDCNLL 780

Query: 781  DPVSVVTEGIQHSRDGNHGPLEHECEVPKVYGYNTAALQDHRCEFDVDEHFNSKSSCEDA 840
            DPVS V E IQHSRDGNH PLE ECE  + +G++T ALQD RCE DVDEHFN KS+C DA
Sbjct: 781  DPVSNVIEAIQHSRDGNHDPLEKECEAQESHGHDTNALQDRRCELDVDEHFNCKSTCGDA 840

Query: 841  SRMEQAVNNACRAQLVSEAIQMETGSPIAEFERFLQLSSPVINQRPKLRSSEIYPRNPPG 900
            +R+EQ VN+AC+AQL  +A+       IAEFERFL LSSPVI+QRP LRS +I  +N  G
Sbjct: 841  TRIEQVVNSACKAQLPFDAVHQ-----IAEFERFLHLSSPVISQRPNLRSCKICSKNSLG 900

Query: 901  DVIPCSNETADISLGCLWQWYEKHGNYGLEIKANGHENSNGFGADNSAFCAYFVPFLSAV 960
            D IPCS++TA+ISL CLWQWYEKHG+YGLE+KANGHE SNGFGADNS F AYFVPFLSAV
Sbjct: 901  DGIPCSHKTANISLSCLWQWYEKHGSYGLEVKANGHEGSNGFGADNSEFHAYFVPFLSAV 960

Query: 961  QLFKSHKTHA-PTTGPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVLRVCNQL 1020
            QLFKSHKTH+  TT PVG DS VSDIK  EP T  LPIFSVLFPKPCTDDA+VL+ C+QL
Sbjct: 961  QLFKSHKTHSGATTCPVGLDSRVSDIKANEPPTAQLPIFSVLFPKPCTDDANVLQACSQL 1020

Query: 1021 HGSEQHLGSERSKSSEQSVNLKSSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRPQ 1080
            H SE+ L SE+   SEQSV+   SGESELIFEYFE EQPQQRRPLFDKI QLV+GDG  +
Sbjct: 1021 HSSEEPLASEKRNFSEQSVDSNLSGESELIFEYFEEEQPQQRRPLFDKICQLVKGDGCLR 1080

Query: 1081 GKIYGDPTMLNSITLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFVSRTSQPN 1140
            GKIYGDPT+L S+TLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFV RTSQ +
Sbjct: 1081 GKIYGDPTVLESVTLNDLHAGSWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFVCRTSQSS 1140

Query: 1141 SPDTNSCLVCPVVGLQSYNAQNECWFEPRNSTPTLTPGLSPPRILEERLRTLEETASLMA 1200
            S +T+SC+VCPVVGLQS+NAQNECWF+PRNST T     +PP +++ERLRTLEETASLMA
Sbjct: 1141 SSETDSCIVCPVVGLQSHNAQNECWFKPRNSTST----FNPPGVVDERLRTLEETASLMA 1200

Query: 1201 RAVVKKGNLNSENTHPDYEFFLSRR 1218
            RAVVKKG+LNS N HPDYEFFLSRR
Sbjct: 1201 RAVVKKGDLNSRNRHPDYEFFLSRR 1207

BLAST of Lsi02G002760.1 vs. TAIR 10
Match: AT5G41010.1 (DNA directed RNA polymerase, 7 kDa subunit )

HSP 1 Score: 92.8 bits (229), Expect = 2.0e-18
Identity = 39/44 (88.64%), Postives = 41/44 (93.18%), Query Frame = 0

Query: 1  MDPQPEPVSYICGDCGMENTLKQGDVIQCRECGYRILYKKRTRR 45
          MDP PEPV+Y+CGDCG ENTLK GDVIQCRECGYRILYKKRTRR
Sbjct: 1  MDPAPEPVTYVCGDCGQENTLKSGDVIQCRECGYRILYKKRTRR 44

BLAST of Lsi02G002760.1 vs. TAIR 10
Match: AT4G16100.1 (Protein of unknown function (DUF789) )

HSP 1 Score: 89.7 bits (221), Expect = 1.7e-17
Identity = 94/340 (27.65%), Postives = 130/340 (38.24%), Query Frame = 0

Query: 831  EDASRMEQAVNNACRAQLVSEAIQMETGSPIAEFERFLQLSSPVIN-QRPKLRSSEIYPR 890
            ++  + E+   + C       +    TG+  +   RFL  ++P+++ Q   L SS+ +  
Sbjct: 56   KEIKQPEECSTSDCSVPSRVSSTTTTTGTTSSNLGRFLDCTTPIVSTQHLPLTSSKGWRT 115

Query: 891  NPPGDVIPCSNETADISLGCLWQWYEKHGNYGLEIKANGHENSNGFGADNSAFCAYFVPF 950
              P              L  LW  +E+   YG+ +        NG      +   Y+VP+
Sbjct: 116  REP-------EYRPYFLLNDLWDSFEEWSAYGVGVPL----LLNGI----DSVVQYYVPY 175

Query: 951  LSAVQLFKSHKTHAPTTGPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVLRVC 1010
            LS +QL++       T   VG +S                      P+  + D S    C
Sbjct: 176  LSGIQLYEDPSRACTTRRRVGEESDGDS------------------PRDMSSDGS--NDC 235

Query: 1011 NQLHGSEQHLGSERSK---SSEQSVNLKSSGESELIFEYFEGEQPQQRRPLFDKIHQLVE 1070
             +L  +      E      SS       S+   EL+FEY EG  P  R PL DKI  L  
Sbjct: 236  RELSQNLYRASLEEKPCIGSSSDESEASSNSPGELVFEYLEGAMPFGREPLTDKISNL-- 295

Query: 1071 GDGRPQGKIYGDPTMLNSITLNDLHAGSWYSVAWYPIYRIPDG----NLRAAFLTYHSLG 1130
                           L +    DL   SW SVAWYPIYRIP G    NL A FLT+HSL 
Sbjct: 296  ---------SSQFPALRTYRSCDLSPSSWVSVAWYPIYRIPLGQSLQNLDACFLTFHSLS 349

Query: 1131 HFVSRTS----QPNSPDTNSC-LVCPVVGLQSYNAQNECW 1158
                 TS    Q +S    S  L  P  GL SY  +   W
Sbjct: 356  TPCRGTSNEEGQSSSKSVASAKLPLPTFGLASYKFKLSEW 349

BLAST of Lsi02G002760.1 vs. TAIR 10
Match: AT2G01260.1 (Protein of unknown function (DUF789) )

HSP 1 Score: 80.9 bits (198), Expect = 7.9e-15
Identity = 67/223 (30.04%), Postives = 91/223 (40.81%), Query Frame = 0

Query: 945  YFVPFLSAVQLF-KSHKTHAPTTGPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDA 1004
            Y+VP LSA+Q++  SH   +        DS  SD +                    + D+
Sbjct: 136  YYVPSLSAIQIYAHSHALDSSLKSRRPGDSSDSDFRDSSSDV--------------SSDS 195

Query: 1005 SVLRVCNQLHGSEQHLGSERSKSSEQSVNLKSSGESELIFEYFEGEQPQQRRPLFDKIHQ 1064
               RV  ++         +   SS+    L S G   L+FEY E + P  R P  DK+  
Sbjct: 196  DSERVSARVDCISLRDQHQEDSSSDDGEPLGSQG--RLMFEYLERDLPYIREPFADKVLD 255

Query: 1065 LVEGDGRPQGKIYGDPTMLNSITLNDLHAGSWYSVAWYPIYRIPDG----NLRAAFLTYH 1124
            L                 L ++   DL   SW+SVAWYPIYRIP G    +L A FLTYH
Sbjct: 256  LA-----------AQFPELMTLRSCDLLRSSWFSVAWYPIYRIPTGPTLKDLDACFLTYH 315

Query: 1125 SL-----GHFVSRTSQPNSPDTNSCLVCPVVGLQSYNAQNECW 1158
            SL     G    ++     P  +  +  PV GL SY  +   W
Sbjct: 316  SLHTSFGGEGSEQSMSLTQPRESEKMSLPVFGLASYKFRGSLW 331

BLAST of Lsi02G002760.1 vs. TAIR 10
Match: AT5G49220.1 (Protein of unknown function (DUF789) )

HSP 1 Score: 80.9 bits (198), Expect = 7.9e-15
Identity = 106/393 (26.97%), Postives = 158/393 (40.20%), Query Frame = 0

Query: 831  EDASRMEQAVNNACRAQLVSEAIQMETGSPIAEFERFLQLSSPVINQR--PKLRSSEIYP 890
            E  SR+  + +  C     S +      S  +  +RFL+ ++PV+  R  P     E+  
Sbjct: 76   ESKSRVVVSGSEVCAGSSDSSSGSGRVLSDGSNLDRFLEHTTPVVPARLFPMRSRWELKT 135

Query: 891  RNPPGDVIPCSNETADISLGCLWQWYEKHGNYGLEIKANGHE-NSNGFGADNSAFCAYFV 950
            R         S+      L  LW+ + +   YG  +    H    +G    N +   Y+V
Sbjct: 136  RE--------SDCHTYFVLEDLWESFAEWSAYGAGVPLEMHPLEMHG----NDSTVQYYV 195

Query: 951  PFLSAVQLFKSHKTHAPTTGPVGFDSCVSDIKVKEPSTCHLPIFSVLFPKPCTDDASVLR 1010
            P+LS +QL+           PVG +   S+      ++  LP+           D SV  
Sbjct: 196  PYLSGIQLYVD--PLKKPRNPVGDNEGSSE---GSSNSRTLPV-----------DLSVGE 255

Query: 1011 VCNQLHGSEQHLGSERSKSSEQSVNLKSSGESELIFEYFEGEQPQQRRPLFDKIHQLVEG 1070
            + N++   +Q +    S S E  +   S+ +  L+FEY E E P  R PL +KI  L   
Sbjct: 256  L-NRISLKDQSITGSLS-SGEAEI---SNPQGRLLFEYLEYEPPFGREPLANKISDL--A 315

Query: 1071 DGRPQGKIYGDPTMLNSITLNDLHAGSWYSVAWYPIYRIPDG----NLRAAFLTYHSLGH 1130
               P+   Y    +L S         SW SV+WYPIYRIP G    NL A FLT+HSL  
Sbjct: 316  SRVPELMTYRSCDLLPS---------SWVSVSWYPIYRIPVGPTLQNLDACFLTFHSLST 375

Query: 1131 FVSRTSQPNSPDTNSC-LVCPVVGLQSYNAQNECWFEPRNSTPTLTPGLSPPRILEERLR 1190
               +++   S    S  L  P  GL SY  +   W                    + R++
Sbjct: 376  APPQSAMGCSDSQPSTKLPLPTFGLASYKLKVSVW-------------------NQNRIQ 403

Query: 1191 TLEETASLMARAVVKKGNLNSENTHPDYEFFLS 1216
              ++  SL+  A   K     +  HPDY FF S
Sbjct: 436  ESQKMTSLLQAA--DKWLKRLQVDHPDYRFFTS 403

BLAST of Lsi02G002760.1 vs. TAIR 10
Match: AT5G23380.1 (Protein of unknown function (DUF789) )

HSP 1 Score: 79.7 bits (195), Expect = 1.8e-14
Identity = 79/262 (30.15%), Postives = 118/262 (45.04%), Query Frame = 0

Query: 968  PVGFDSCVSDIK-VKEPSTCHLPIFSVLFPKPCTDDASVLRVCNQLHGSEQHLGSERSKS 1027
            P+  ++  SD+K    PS   + IF++   KP +DD+    +   + G+E       S S
Sbjct: 72   PLSLENFDSDVKQYYNPSLSAIQIFTI---KPFSDDSRSSAI--GIDGTETGSAITDSDS 131

Query: 1028 SEQSVNLKSSGESELIFEYFEGEQPQQRRPLFDKIHQLVEGDGRPQGKIYGDPTMLNSIT 1087
            + +   L +     L F+Y E E+P  R PL  K+  L E           + T L+S+T
Sbjct: 132  NGKLQCLDAGDLGYLYFQYNEVERPFDRFPLTFKMADLAE-----------EHTGLSSLT 191

Query: 1088 LNDLHAGSWYSVAWYPIYRIP-----DGNLRAAFLTYHSL----GHFVSRTSQPNSPDTN 1147
             +DL   SW S+AWYPIY IP     DG + AAFLTYH L       + +  + N    +
Sbjct: 192  SSDLSPNSWISIAWYPIYPIPPVIGVDG-ISAAFLTYHLLKPNFPETIGKDDKGNEQGES 251

Query: 1148 SC--LVCPVVGLQSYNAQNECWFEPRNSTPTLTPGLSPPRILEERLRTLEETASLMARAV 1207
            S   ++ P  G  +Y A    W         + PG S  +  E      EE+A    R  
Sbjct: 252  STPEVLLPPFGAMTYKAFGNLW---------MMPGTSDYQNREMN----EESADSWLR-- 295

Query: 1208 VKKGNLNSENTHPDYEFFLSRR 1218
             K+G      +H D+ FF+SR+
Sbjct: 312  -KRG-----FSHSDFNFFMSRK 295

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9FLM82.8e-1788.64DNA-directed RNA polymerases II, IV and V subunit 12 OS=Arabidopsis thaliana OX=... [more]
Q9C8M43.4e-1073.17DNA-directed RNA polymerase subunit 12-like protein OS=Arabidopsis thaliana OX=3... [more]
Q3ZBC02.8e-0961.90DNA-directed RNA polymerases I, II, and III subunit RPABC4 OS=Bos taurus OX=9913... [more]
P538032.8e-0961.90DNA-directed RNA polymerases I, II, and III subunit RPABC4 OS=Homo sapiens OX=96... [more]
Q638712.8e-0961.90DNA-directed RNA polymerases I, II, and III subunit RPABC4 OS=Mus musculus OX=10... [more]
Match NameE-valueIdentityDescription
A0A0A0LT770.0e+0079.75Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G043170 PE=4 SV=1[more]
A0A5D3BH030.0e+0079.13Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A6J1C5T50.0e+0074.78uncharacterized protein LOC111008718 OS=Momordica charantia OX=3673 GN=LOC111008... [more]
A0A6J1GS600.0e+0073.34uncharacterized protein LOC111457006 OS=Cucurbita moschata OX=3662 GN=LOC1114570... [more]
A0A6J1K4L40.0e+0073.32uncharacterized protein LOC111490028 OS=Cucurbita maxima OX=3661 GN=LOC111490028... [more]
Match NameE-valueIdentityDescription
XP_038894653.10.0e+0086.86uncharacterized protein LOC120083142 isoform X1 [Benincasa hispida] >XP_03889465... [more]
XP_038894656.10.0e+0083.92uncharacterized protein LOC120083142 isoform X2 [Benincasa hispida][more]
XP_004137638.20.0e+0079.75uncharacterized protein LOC101212209 [Cucumis sativus] >KGN64214.1 hypothetical ... [more]
TYJ99070.10.0e+0079.13uncharacterized protein E5676_scaffold248G002740 [Cucumis melo var. makuwa][more]
KAG6572995.10.0e+0073.39DNA-directed RNA polymerases II, IV and V subunit 12, partial [Cucurbita argyros... [more]
Match NameE-valueIdentityDescription
AT5G41010.12.0e-1888.64DNA directed RNA polymerase, 7 kDa subunit [more]
AT4G16100.11.7e-1727.65Protein of unknown function (DUF789) [more]
AT2G01260.17.9e-1530.04Protein of unknown function (DUF789) [more]
AT5G49220.17.9e-1526.97Protein of unknown function (DUF789) [more]
AT5G23380.11.8e-1430.15Protein of unknown function (DUF789) [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (USVL1VR-Ls) v1
Date Performed: 2021-10-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006591RNA polymerase archaeal subunit P/eukaryotic subunit RPABC4SMARTSM00659rpolcxc3coord: 8..49
e-value: 5.2E-16
score: 69.2
IPR006591RNA polymerase archaeal subunit P/eukaryotic subunit RPABC4PFAMPF03604DNA_RNApol_7kDcoord: 10..41
e-value: 1.9E-15
score: 56.3
NoneNo IPR availableGENE3D2.20.28.30RNA polymerase ii, chain Lcoord: 1..48
e-value: 2.6E-21
score: 76.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 480..508
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 585..606
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 585..603
NoneNo IPR availablePANTHERPTHR32010PHOTOSYSTEM II STABILITY/ASSEMBLY FACTOR HCF136, CHLOROPLASTICcoord: 63..1217
NoneNo IPR availablePANTHERPTHR32010:SF21DUF789 FAMILY PROTEINcoord: 63..1217
IPR008507Protein of unknown function DUF789PFAMPF05623DUF789coord: 862..1213
e-value: 4.3E-68
score: 230.1
IPR029040RNA polymerase subunit RPABC4/transcription elongation factor Spt4SUPERFAMILY63393RNA polymerase subunitscoord: 7..45

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Lsi02G002760Lsi02G002760gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Lsi02G002760.1.exon.1Lsi02G002760.1.exon.1exon
Lsi02G002760.1.exon.2Lsi02G002760.1.exon.2exon
Lsi02G002760.1.exon.3Lsi02G002760.1.exon.3exon
Lsi02G002760.1.exon.4Lsi02G002760.1.exon.4exon
Lsi02G002760.1.exon.5Lsi02G002760.1.exon.5exon
Lsi02G002760.1.exon.6Lsi02G002760.1.exon.6exon
Lsi02G002760.1.exon.7Lsi02G002760.1.exon.7exon
Lsi02G002760.1.exon.8Lsi02G002760.1.exon.8exon
Lsi02G002760.1.exon.9Lsi02G002760.1.exon.9exon
Lsi02G002760.1.exon.10Lsi02G002760.1.exon.10exon


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Lsi02G002760.1.three_prime_UTR.1Lsi02G002760.1.three_prime_UTR.1three_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Lsi02G002760.1.CDS.9Lsi02G002760.1.CDS.9CDS
Lsi02G002760.1.CDS.8Lsi02G002760.1.CDS.8CDS
Lsi02G002760.1.CDS.7Lsi02G002760.1.CDS.7CDS
Lsi02G002760.1.CDS.6Lsi02G002760.1.CDS.6CDS
Lsi02G002760.1.CDS.5Lsi02G002760.1.CDS.5CDS
Lsi02G002760.1.CDS.4Lsi02G002760.1.CDS.4CDS
Lsi02G002760.1.CDS.3Lsi02G002760.1.CDS.3CDS
Lsi02G002760.1.CDS.2Lsi02G002760.1.CDS.2CDS
Lsi02G002760.1.CDS.1Lsi02G002760.1.CDS.1CDS


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Lsi02G002760.1.five_prime_UTR.2Lsi02G002760.1.five_prime_UTR.2five_prime_UTR
Lsi02G002760.1.five_prime_UTR.1Lsi02G002760.1.five_prime_UTR.1five_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Lsi02G002760.1Lsi02G002760.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006351 transcription, DNA-templated
molecular_function GO:0003677 DNA binding
molecular_function GO:0003899 DNA-directed 5'-3' RNA polymerase activity