MC02g0332 (gene) Bitter gourd (Dali-11) v1

Overview
NameMC02g0332
Typegene
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptiontranscriptional elongation regulator MINIYO
LocationMC02: 2929402 .. 2937833 (-)
RNA-Seq ExpressionMC02g0332
SyntenyMC02g0332
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: utr5polypeptideCDSutr3
Hold the cursor over a type above to highlight its positions in the sequence below.
CTATAATTAAAATTTATTTATTTCTCCAAAATTGAATCCAATAAAAAGCCCACCAAAAAAGTCCAGCCCGATTCAAAATCAGTGCAAGCTGCAGCAGCGCTCTGCTAGTTGTGAAGCAGTTGAAATCAACTTCCTTCTCACTATCCAATTTCCAGACCTATTCGTTCGCATGGAGAAGACGACGCAGAGCAGTAGACGAAAGCAGCAGAAAAGCTCGGCTCGAGCAAAGGTATTTGGCGCCAAGGCGCTGCAGCTAAGCGAGGACGATGCCTCGCGCATAGTCGGCGGAATAGTGGAGAAAGGTATTTCAGACTCGCACCAGAACGAGCCCTTCGTCTCACTTGCCCCTCCCAAACCTTCTGTTCTGCCATTTCCTGTCGCTCGGCATCGGTCTCATGGCCCAGTAAGATTCCAGTTTGGCAGTTTCATTTATCTGTAGTGCACCCCATTTCTATGCTTAATTATGGAACTTAGCAGTCGGCGAGAGTAATTTTTATTCTTTTAATAAGTTTGACTATAGAAGCAGGAGGATTTTTTGTTTTATGTCAATATTGTGTAATATGATTGGAAGAAAGAGTATTTGAATATTCGTTAGTAGTTTTTCCAGTTGTAAATGGATTAATTTGGAGAAATATTTAGACTCATTTTGGATTCCCTATCTTATTAACAGTCCAATCTGATTTTGCAGCATTGGGTATCAGTAACTAGCAAAAAGGGTGGTGATAACATTAAACATGATCGGCAGGAGAATGGTGAAGATGATGAAAGTTTGATCGAGATTGAGTCTATAGCAAATTTTGCTAACCCAATACAAAGGAAAAATAAAAATAGCTTAGACTTTGGCAGGTGGAGAGAGGTTGTCCGAGGCCACAACCATGATGCAGCGAATAAAGAGGAAAAGAAAGTTGCAGGGTTAGCAAAAAATGAAAACCTAAATCGTGCTGGGGAAGCAATTAACACTGTTGATGATACCATGATGTCATGCAAGCCCTTATCAGCTGATGTGCTTGCACCTATTCTTATGAATGATGAACATAATTCATCTGGTTTTGTAAATGATCCCACTGGAATGAGGACAAAGGATTCTGGTTCTGATTTCGTGAGTTCAACAAATAATGCAGAAATAAAAGGGCTGGATCAACTATGTCTTTGGAAGGACTTTCAAGATGTTGATGATAGATCAGGACATGTTTCAGAGAGTGTTGAAATTAATGAGGGAATGCCAGTTGACGGTACTTCATTGCCAGATATGGCTATGGGGCTGCACCATTCAAATCCAGAAATGGTTCCTTGTTTTGGGTCCAATATCAAGGGAGAAGATGCATTCTCGACACTGGAGAGCCAGATCAATGCAGAGAACCGTGCACGGATACAAAGAATGTCACCTGAAGAGATTGCCGAAGCACAGACTGAGATAAAGGAGAAGATGAGTCCTGCACTAGTGAAAGCTCTGAAAAGGAGAGGTGAGGAAAAGTTGAAGAAGGGATCTTCAAAGCCTGACGTCAGCAAGAATTCTGAGTTGGATAATCTACAAAAAGAGGGTACATTTAATAGATATGATTCTCTCTGTGTAGAGAATGGCGTAACTTCAGCAAATTCAACCGTAAAGGACACAAAGAGTGGGCTTCAGAATGTTTCAGTGCAGAAATTTGACTTGGGTAGTAGCACATGGAGTGCTTGGAACGAAAGGGTTGAAGCAGTAAGGTTGTTAAGGTTTTCCCTGGAAGGAAATATAGTTGAAAGCTGTTCTTTCCAGCAATCAGAGAATGGTAAGACTTGGCGGTATTTTGTTGCTTTTTCTTTTCCCTGGAGGGGGAGGAGGGGCGATGGTGCATTTGATTAATAGAATTTCCATGGTGCTTTTGTTTATGTTTTCCATGATGTCTGTCATCTCCAATTATGAGATCTTGTGCATGAGCTAGATTTGGGCGTTTAGAACTTTTCCATCAGTTCCAGTTCCATAGAGGACGAACTCTATAGTTATTTAATGGTGATAACTGATCTAATAGTTCAACTAGCTCAAGTATCTTTTAACATTCCAATTGATAGATGGGCTGGGATATTAAATGTCTAGGAACAAAATGTTTATGTTTGTTTCTTAGTATTTATTCCTATGATTATGTCTAATAACTGAACCAAATTAATTTATTGTGAAAGCAATTTTTCTCCAATGTCACGATGTTGAGAGGATTTGGATACCTATGATGGTTGGCTTGCTATTCTCAATTTTCAAAATTGCATAAGAATGAAATCTTGTATGTAAAAACTCATACTCCGTAACCTTACGATGAATCTCATCAACCCAATGGCACTGTAGTATTATAAACATTTAAGGTACATAACTCCCTTTTGGAGTATACAAAGACCCCCTACCCAACTGAAAGTTGCAAGAAAACAGAACTTATAGAGAAACCCAATTTTTTGGGAATTTACTAGAACATAATCCAACTTAGTACTTGCTAAAGGATAGTGTGGACCTTTTTGTCAATTGTCATTTATTTTGAATATTTTAATTTCTTTCAGAAGAACCAAAAAGTATTCTTGTCACTTTCTTTTTATTTAGATCACACGTGAGTCTTGTTTTCTTTTCATTTCTAAAGCTTTCTATCATCACTAGAATGTATAAAAATGTATTATGTCTACAAGGTATACATTTTATTCTCTAATATTGATTAACTCCTCTATATGTGTCATGGCACAGGTGATCTTGCAGTCCATGGGTACAGCACTGAAAATGTTACTTCACGAGATTTTCTTCGAACTGATGGGGATCCTAGTGCTGCAGGTTACACGATTAAAGAAGCTGTGGCACTGACAAGGAGTGTGGTCAGTTTAGTTGCTTCATTGTAGTGACAATTTTTTTCCACCATAAAAGAACTGTTAGTTTTTTCCACTTTTGAGGTCTTATTTTTCTTGTTGCTTAATAATGTGTTGCTAGAGAACAAAGTTACTTCATTTTTTTAATTTCAAATCTTCAACATGTTTATAATTTTAAGCTCCTTAAATCTTCTTTCCTTTTTACTCTAGCTTCTCGTTTGTGTCAGATACCAGGTCAACGTGTCCTTGGGTTGCATGTTATTTCAAACGTGCTTGACAAGGCATTGCTTAATACACACCAAAAACCAGTTGGGTCCGCAATGGTTAAAGATGGTATCTCTATTGATTACAATGCAATTTGGGCTTATACCCTTGGCCCTGAACCAGAGCTTGCTTTGTCACTTAGGTGAGACTCCCTGAGTGCTTGTATTATTGACATTCATTTTTCCTTTTTTAAGAGTCTGCGCCTGCCCCAATCATTGACATTCATTTTAACTTGCTTTGGTATACTCATTGCTTGGCTTGGTTCATGCTACACCTGACTGTGTTGCTGCTACCATATTTTCAAATTTGTGAAAATTGATCCATGGTTTGTCATGGGAAGTTTATGTTACTCTAAGATGTGGTTAATTCATCATATCCTCATTGCTTTACATCTTTTCTAAAATCATCTCTGATTTTCAGTGTCCCAAGTCACTAAATTTTTTGTTTATGTGTAGGATATGCTTGGATGATAATCATAGCTCTGTCGTTCTAGCTTGTGCTGAAGTTATTCAGTGTATATTGGGCTGTAATTTGAACGAAATCTTCTTTGATACCTTACAGGTAGGATGTTTATAGCATGGAACTTTAATAAATTTTCTCTCCAATTTTTATTCTCAGTTAAAGTCTATTTTCTTCAGAAAACATCAACTTATGAGATGGATCTCTACACTGCTCCTATATTCCGAAGTAAACCAGAGATCAATGTTGGTTTCCTCCAAGGTGGATTTTGGAAGTACAATGCTAAACCTTCTAATATTCTTCCTTTCAGTGAAGATGTTGGGAATGTTGAAGATGGAGAGAAATATACAATCCAGGATGATATTGTGGTTGCACAACAAGATATTCTGGCAGGTTTGGTTCGAATGGGAATCCTTCACAGGCTTCGCTATCTTCTGGAGGTATACGTTCTCATTATCTAGATTTTGTACTTTTAATTTGACTATTGGGGATGACTGTCGCACCTGAATGGATCTCAACAGTTTCCCTTTTATTAAATTTGCATGATTAATTGATTAAGAAAGTAAAGATGACCTCAATGTATGGTGAGAATTCCCACTGAAGTTGGTATATTGATTAAAGAATAGCTATCTATTGCCAACACTACACAGGTTAGAAAGATTATAAGTTTATAACTATTGCTACTAATGAAGATTCTACATTGTGGCTTCTAATAAGTAAACAAATTATTGCTTTTGTGACTCTTTAAGGCATGACATGTGAAATATGTTTTTTTTTGGCCCTCTCTCTAATCGCAGCATGTTTAGTATCTACAATTGTTATGATTTAAAAAATGTTATTTCCATTTTTTACTTTGCGTTTTTGCTTCAATTTCCAATATGTGATGATTTTTATGGTTTCAGGCGGGCCCTTCAGTAGCTTTAGAAGAATGCATACTTTCGATACTTATTGCAATAGCAAGGCATTCCCCAACGTGCGCACAAGCAATAATGAAATGTGAAAGGCTTGTTGGGTTGATTATCAACAGGTTTACAATGAGCGACAAAATTGATATCCTTTCCTTTAAAATAAAATCTGTTGTCCTTTTGAAGGTAATTGCTAGTGCTTTGATGTATTCCAATAATTTTCATTTGTCTGTTGTTCGAATAGTTCGTTATTTCCATTATCAGACAGTTGCAGGAACTGATTTTGGGGTTTTGCCTCTGTTTTTCACACGACAGGTCTTAGCTTGTTCAGACAGGAATAATTGTGTTGCATTTGTAAAAACTGGAGCTTTTCCAACTATGATATGGCATTTGTTTCACTATATTACCTCCATTGACCATTGGGTCAAGTCAGGGAAGGAAAAATGTAAACTTTCATCAGCTTTGATGGTTGAACAATTGAGGCTGTGGAAGGTTTGCATTCAGGATGGATACTGTGTATCTTACTTTTCTGATGTTTTCCCTGCCTTGTGCTTATGGTTGAGCCCACCAAATTTTGACAAACTTGTTGAGAATAATGTCCTTCGTGAATTTGCAACCATTTGTACGGAGGTATACCATGTTCTGGAGGCTTTGGCTAGAAGACTTCCTAACTATTTTTCACAGAAACATTTAGACAGTCAAGAGCTAGGACTTGCTGGTAATGAATCTGAAATTTGGTCCTGGAGTTGTGCTGTTCCGATGGTTAATTTAGCTGTGAAATGGTTAGAGTCAAAGAGTGATCCATTCATATCCAAATTATTTGCGTCACAAAAAGAGATTAGGAGTGGCTTTGAGTTTGAAGGTATATCGTTGGCACCATTGTTGTGGGTTTATTCAGCTGTCATGAAGATGCTATCTCAAGTGTTTGAAAGGATTGTCCCGCAGGATATTATGTCCCTGGAAGGAAGTGGTCAGATTGTGCCTTCACTACCAGAGTTTATACCACGAGTTGGACTTGAGATAATTAGGAATGGCTTTTTGAGCTTTCCAGGTGCATATGATAAGAAACCCGAAACCTATCCCTTTGTAGGTAACTCTTTTGTAGAGGATCTTTGTTTTTTGAGAGAACATGGTGAATTTGAAACGTCTCTGGCTTCTGTATGTTGCCTTCATGGGTTGATGCTTAGTATTATGAATATTGATCGTCTGATCCACTTAGCCAAAACCGAGAGGCATGGTTTTCCTTTCAGAGATTACAACGGCTCAAGGGAAGGGGAAATTTTAATGGTTGGGATGTTTAAGGCATCCCTCATCGAACAGAGAAGCGTGCTTAACCTTTTTACAAAGGTAATTGCTTTGGAGTCTGACTCTCTGCAGTTAATAGAGACCTTTGGCAGAGGCGGCCCTGCACCTGGGGTGGGAACTGGCTGGGGCGTGTCTGGAGGTGGATATTGGTCTCCTGCTGTTTTATTAGCTCAAAATGATGCAGCATTTGTCATGTTCCTAATTCAAGCATTTCAGACCGTTCCAACTTTAAATATACTGACTGCTCAGGAATCCTTGACCATTCAAAGCATAAATTCTGCCTTGGCAATATGCTTGGTTCTTGGGCCAAGAGATACGTGTTTGGTCGAGAAAACAATGGAATTTTTGATCCAGGCTCCTATTTTGCATCATTTCAATTTCTATATTCAGAGTTTTATCCAACTCAATGGAAGAGTGAAGCAATTTGGGTGGAAGTATAGTGAAGACGACTGTTTGATCCTTTGTAAAACATTAAGTTCTCACTATAAGGATAGGTGGTTATCGCCAAAGGAATCCAAATCCACAAAGAATAAGAGCAACTTTAGTGACAAAATATTTAAGAAAAGCAGTAATTCTTTGGATACAATATATGAGGAGGAATCAGATGAGACAAATAGGATAGCGCAAGATTGTACTTGTTTGGTAGTTCAGTGGGCTTACCAAAGACTTCCACTTCCAAAGCATTGGTTTTTAAGTCCAGTCTCAACTATCTCTGATAGTAAGTATGTTGGTCTTCAAAAATCTTCTGATGCGCAAAAGATTGTGCAGGACTCTAGTGATGTGCTTGAGGTTGCTAAGAGTGGACTCTTCTTTATTTTAGGAGTTGAAGCATTTTCCACCTTCCTACCTGATTATTTCCCTTCACCTGTCCAGAGTGTGCCATTGATTTGGAAATTGCACTCTTTATCTGTTGTTTTACTCGCAGGCATGGGAGTTTTGGATGATGAGAAGAGTAGGGATGTTTATGAGGTTTTGCAAGACCTGTATGGTCAGTGTCTTAACGAAGCTAGGTATAGTAGACTCTCTGAACGAATCCAGGAGAAGAATGCAACAGATTTACCGTCACAACCAGAAAATAAGAGCAATTTAGAGTTCCTGATGTTTCAATCAGAGATTCATGACAGTTACTCAACATTTATTGAGACTCTAGTGGAGCAGTTTGCTGCCGAATCCTATGGAGATATACTATATGGACGGCAAATTGTACTATATCTTCACCGATGTGTTGAAGCTCCTGTGCGGATTGCTGCCTGGAATGCACTAAATAATGCCCGTGTTCTCGAACTTCTTCCACCTCTTGAGAAGTGCTTTGTTGATGCTGAAGGGTGTCTGGAACCTATTGAGGTGCGTTCCATCTTCACTGCTTTCAAATATGAGTATCAAACACATCTTTACCCGTTTTCTGCTTTCAAATATGAATATCAAACACATCTTTACCGGTTTTGGGTCCAAAATTATGCTCAATTACTACTTCAAGTCCAAAAATATAGAATACAATTCTTTCTTCACGAGATTGTCTTGAAAGTGGGAGGGACTGTATAAGTAGTGATTGTGATTCTTTTATCCATCCTCCTTAGGGAAATTTGCTAGAGTAGTTAAAAATAAATGAACAGAGGCAGTATTGGAGAGTTTTGGCATCAAATTTTCACGAGATTCTCTAGAAAGTTGGTTGTAGTAAGAAGATTGACATGTTTTTCATCGATTTCATTTACTGGAAAATCTAATCTGGAAAAAATGGTTCCTTCACAGGATAATGAAGCCATTCTGGAAGCCTATGTGAAGTCATGGGTTTCAGGTGCCCTTGACAGATCTGCAAGCCGAGGTTCAGTGGCCTATTTACTAGTTCTCCACCACCTCTCATCCTACATATTCCACTCTAACCATGTCGCCAACTTGTTGCTGCGTAACAAGCTCTCAAGGTCTCTTTTGCGAGACTACTCTCAAAAGCATCAGCGCAAGGTATTGAAGTTAATCTTGATAGTTCTAAATTTCAAGTAACAGCCATAACACATTGAACCTAGTGATATTATTATTCCGTTATGATATAACTTATGAGTAAGTAAAAGTTTCCCTCTGATGATATATCATGTTAACTGATGCATTCTGATCAAATACTTAATCGCAGGAAATGATGTCAGATCTTATCTTATATACTGCACCAGCGACGTATCGTGTAGCCGGACAGAAGGGTGTTTGTTCGTCGATCAAAATGAGCACTGTGGAGAAAAGGCTCGAGGTGTTGAAGGAAGCTTGTGAGAGGAACTCCTATCTTTTGACAGTGGTTGAAGAGCTCGGGTCCGCTGCAAAACACAAACTGTCCGCAATGTGCAGTCAGTTAGCCTAAAAAGCTAGCAAAATATATGATGCGTAGAGAAGAGAAGATTGGAGATGGAAGTTCTTGGTAATGAGGATTGAAGGGGGTGATGTGTTCCCTGTTCTCAAACTTCAACCATGGGTTGGGCAGCCGTATAGATCCTTGAATGGGAAAGTTTCTGACATTTGTACATGAATTGTGTAAATTAAACTGAAATGCAACAAAAACGATGGTCCTAACTTGATTTTGACATGAAAGTTTGTTTGATAAGAAACATGTAACTCAGTGAAATGCATATAACTAATCTAGTCTGATTGAATATTAAGTCTATTTAAAATTAATTATGATTCTATTTAGTTTATATTGTC

mRNA sequence

CTATAATTAAAATTTATTTATTTCTCCAAAATTGAATCCAATAAAAAGCCCACCAAAAAAGTCCAGCCCGATTCAAAATCAGTGCAAGCTGCAGCAGCGCTCTGCTAGTTGTGAAGCAGTTGAAATCAACTTCCTTCTCACTATCCAATTTCCAGACCTATTCGTTCGCATGGAGAAGACGACGCAGAGCAGTAGACGAAAGCAGCAGAAAAGCTCGGCTCGAGCAAAGGTATTTGGCGCCAAGGCGCTGCAGCTAAGCGAGGACGATGCCTCGCGCATAGTCGGCGGAATAGTGGAGAAAGGTATTTCAGACTCGCACCAGAACGAGCCCTTCGTCTCACTTGCCCCTCCCAAACCTTCTGTTCTGCCATTTCCTGTCGCTCGGCATCGGTCTCATGGCCCACATTGGGTATCAGTAACTAGCAAAAAGGGTGGTGATAACATTAAACATGATCGGCAGGAGAATGGTGAAGATGATGAAAGTTTGATCGAGATTGAGTCTATAGCAAATTTTGCTAACCCAATACAAAGGAAAAATAAAAATAGCTTAGACTTTGGCAGGTGGAGAGAGGTTGTCCGAGGCCACAACCATGATGCAGCGAATAAAGAGGAAAAGAAAGTTGCAGGGTTAGCAAAAAATGAAAACCTAAATCGTGCTGGGGAAGCAATTAACACTGTTGATGATACCATGATGTCATGCAAGCCCTTATCAGCTGATGTGCTTGCACCTATTCTTATGAATGATGAACATAATTCATCTGGTTTTGTAAATGATCCCACTGGAATGAGGACAAAGGATTCTGGTTCTGATTTCGTGAGTTCAACAAATAATGCAGAAATAAAAGGGCTGGATCAACTATGTCTTTGGAAGGACTTTCAAGATGTTGATGATAGATCAGGACATGTTTCAGAGAGTGTTGAAATTAATGAGGGAATGCCAGTTGACGGTACTTCATTGCCAGATATGGCTATGGGGCTGCACCATTCAAATCCAGAAATGGTTCCTTGTTTTGGGTCCAATATCAAGGGAGAAGATGCATTCTCGACACTGGAGAGCCAGATCAATGCAGAGAACCGTGCACGGATACAAAGAATGTCACCTGAAGAGATTGCCGAAGCACAGACTGAGATAAAGGAGAAGATGAGTCCTGCACTAGTGAAAGCTCTGAAAAGGAGAGGTGAGGAAAAGTTGAAGAAGGGATCTTCAAAGCCTGACGTCAGCAAGAATTCTGAGTTGGATAATCTACAAAAAGAGGGTACATTTAATAGATATGATTCTCTCTGTGTAGAGAATGGCGTAACTTCAGCAAATTCAACCGTAAAGGACACAAAGAGTGGGCTTCAGAATGTTTCAGTGCAGAAATTTGACTTGGGTAGTAGCACATGGAGTGCTTGGAACGAAAGGGTTGAAGCAGTAAGGTTGTTAAGGTTTTCCCTGGAAGGAAATATAGTTGAAAGCTGTTCTTTCCAGCAATCAGAGAATGGTGATCTTGCAGTCCATGGGTACAGCACTGAAAATGTTACTTCACGAGATTTTCTTCGAACTGATGGGGATCCTAGTGCTGCAGGTTACACGATTAAAGAAGCTGTGGCACTGACAAGGAGTGTGATACCAGGTCAACGTGTCCTTGGGTTGCATGTTATTTCAAACGTGCTTGACAAGGCATTGCTTAATACACACCAAAAACCAGTTGGGTCCGCAATGGTTAAAGATGGTATCTCTATTGATTACAATGCAATTTGGGCTTATACCCTTGGCCCTGAACCAGAGCTTGCTTTGTCACTTAGGATATGCTTGGATGATAATCATAGCTCTGTCGTTCTAGCTTGTGCTGAAGTTATTCAGTGTATATTGGGCTGTAATTTGAACGAAATCTTCTTTGATACCTTACAGAAAACATCAACTTATGAGATGGATCTCTACACTGCTCCTATATTCCGAAGTAAACCAGAGATCAATGTTGGTTTCCTCCAAGGTGGATTTTGGAAGTACAATGCTAAACCTTCTAATATTCTTCCTTTCAGTGAAGATGTTGGGAATGTTGAAGATGGAGAGAAATATACAATCCAGGATGATATTGTGGTTGCACAACAAGATATTCTGGCAGGTTTGGTTCGAATGGGAATCCTTCACAGGCTTCGCTATCTTCTGGAGGCGGGCCCTTCAGTAGCTTTAGAAGAATGCATACTTTCGATACTTATTGCAATAGCAAGGCATTCCCCAACGTGCGCACAAGCAATAATGAAATGTGAAAGGCTTGTTGGGTTGATTATCAACAGGTTTACAATGAGCGACAAAATTGATATCCTTTCCTTTAAAATAAAATCTGTTGTCCTTTTGAAGGTCTTAGCTTGTTCAGACAGGAATAATTGTGTTGCATTTGTAAAAACTGGAGCTTTTCCAACTATGATATGGCATTTGTTTCACTATATTACCTCCATTGACCATTGGGTCAAGTCAGGGAAGGAAAAATGTAAACTTTCATCAGCTTTGATGGTTGAACAATTGAGGCTGTGGAAGGTTTGCATTCAGGATGGATACTGTGTATCTTACTTTTCTGATGTTTTCCCTGCCTTGTGCTTATGGTTGAGCCCACCAAATTTTGACAAACTTGTTGAGAATAATGTCCTTCGTGAATTTGCAACCATTTGTACGGAGGTATACCATGTTCTGGAGGCTTTGGCTAGAAGACTTCCTAACTATTTTTCACAGAAACATTTAGACAGTCAAGAGCTAGGACTTGCTGGTAATGAATCTGAAATTTGGTCCTGGAGTTGTGCTGTTCCGATGGTTAATTTAGCTGTGAAATGGTTAGAGTCAAAGAGTGATCCATTCATATCCAAATTATTTGCGTCACAAAAAGAGATTAGGAGTGGCTTTGAGTTTGAAGGTATATCGTTGGCACCATTGTTGTGGGTTTATTCAGCTGTCATGAAGATGCTATCTCAAGTGTTTGAAAGGATTGTCCCGCAGGATATTATGTCCCTGGAAGGAAGTGGTCAGATTGTGCCTTCACTACCAGAGTTTATACCACGAGTTGGACTTGAGATAATTAGGAATGGCTTTTTGAGCTTTCCAGGTGCATATGATAAGAAACCCGAAACCTATCCCTTTGTAGGTAACTCTTTTGTAGAGGATCTTTGTTTTTTGAGAGAACATGGTGAATTTGAAACGTCTCTGGCTTCTGTATGTTGCCTTCATGGGTTGATGCTTAGTATTATGAATATTGATCGTCTGATCCACTTAGCCAAAACCGAGAGGCATGGTTTTCCTTTCAGAGATTACAACGGCTCAAGGGAAGGGGAAATTTTAATGGTTGGGATGTTTAAGGCATCCCTCATCGAACAGAGAAGCGTGCTTAACCTTTTTACAAAGGTAATTGCTTTGGAGTCTGACTCTCTGCAGTTAATAGAGACCTTTGGCAGAGGCGGCCCTGCACCTGGGGTGGGAACTGGCTGGGGCGTGTCTGGAGGTGGATATTGGTCTCCTGCTGTTTTATTAGCTCAAAATGATGCAGCATTTGTCATGTTCCTAATTCAAGCATTTCAGACCGTTCCAACTTTAAATATACTGACTGCTCAGGAATCCTTGACCATTCAAAGCATAAATTCTGCCTTGGCAATATGCTTGGTTCTTGGGCCAAGAGATACGTGTTTGGTCGAGAAAACAATGGAATTTTTGATCCAGGCTCCTATTTTGCATCATTTCAATTTCTATATTCAGAGTTTTATCCAACTCAATGGAAGAGTGAAGCAATTTGGGTGGAAGTATAGTGAAGACGACTGTTTGATCCTTTGTAAAACATTAAGTTCTCACTATAAGGATAGGTGGTTATCGCCAAAGGAATCCAAATCCACAAAGAATAAGAGCAACTTTAGTGACAAAATATTTAAGAAAAGCAGTAATTCTTTGGATACAATATATGAGGAGGAATCAGATGAGACAAATAGGATAGCGCAAGATTGTACTTGTTTGGTAGTTCAGTGGGCTTACCAAAGACTTCCACTTCCAAAGCATTGGTTTTTAAGTCCAGTCTCAACTATCTCTGATAGTAAGTATGTTGGTCTTCAAAAATCTTCTGATGCGCAAAAGATTGTGCAGGACTCTAGTGATGTGCTTGAGGTTGCTAAGAGTGGACTCTTCTTTATTTTAGGAGTTGAAGCATTTTCCACCTTCCTACCTGATTATTTCCCTTCACCTGTCCAGAGTGTGCCATTGATTTGGAAATTGCACTCTTTATCTGTTGTTTTACTCGCAGGCATGGGAGTTTTGGATGATGAGAAGAGTAGGGATGTTTATGAGGTTTTGCAAGACCTGTATGGTCAGTGTCTTAACGAAGCTAGGTATAGTAGACTCTCTGAACGAATCCAGGAGAAGAATGCAACAGATTTACCGTCACAACCAGAAAATAAGAGCAATTTAGAGTTCCTGATGTTTCAATCAGAGATTCATGACAGTTACTCAACATTTATTGAGACTCTAGTGGAGCAGTTTGCTGCCGAATCCTATGGAGATATACTATATGGACGGCAAATTGTACTATATCTTCACCGATGTGTTGAAGCTCCTGTGCGGATTGCTGCCTGGAATGCACTAAATAATGCCCGTGTTCTCGAACTTCTTCCACCTCTTGAGAAGTGCTTTGTTGATGCTGAAGGGTGTCTGGAACCTATTGAGGATAATGAAGCCATTCTGGAAGCCTATGTGAAGTCATGGGTTTCAGGTGCCCTTGACAGATCTGCAAGCCGAGGTTCAGTGGCCTATTTACTAGTTCTCCACCACCTCTCATCCTACATATTCCACTCTAACCATGTCGCCAACTTGTTGCTGCGTAACAAGCTCTCAAGGTCTCTTTTGCGAGACTACTCTCAAAAGCATCAGCGCAAGGAAATGATGTCAGATCTTATCTTATATACTGCACCAGCGACGTATCGTGTAGCCGGACAGAAGGGTGTTTGTTCGTCGATCAAAATGAGCACTGTGGAGAAAAGGCTCGAGGTGTTGAAGGAAGCTTGTGAGAGGAACTCCTATCTTTTGACAGTGGTTGAAGAGCTCGGGTCCGCTGCAAAACACAAACTGTCCGCAATGTGCAGTCAGTTAGCCTAAAAAGCTAGCAAAATATATGATGCGTAGAGAAGAGAAGATTGGAGATGGAAGTTCTTGGTAATGAGGATTGAAGGGGGTGATGTGTTCCCTGTTCTCAAACTTCAACCATGGGTTGGGCAGCCGTATAGATCCTTGAATGGGAAAGTTTCTGACATTTGTACATGAATTGTGTAAATTAAACTGAAATGCAACAAAAACGATGGTCCTAACTTGATTTTGACATGAAAGTTTGTTTGATAAGAAACATGTAACTCAGTGAAATGCATATAACTAATCTAGTCTGATTGAATATTAAGTCTATTTAAAATTAATTATGATTCTATTTAGTTTATATTGTC

Coding sequence (CDS)

ATGGAGAAGACGACGCAGAGCAGTAGACGAAAGCAGCAGAAAAGCTCGGCTCGAGCAAAGGTATTTGGCGCCAAGGCGCTGCAGCTAAGCGAGGACGATGCCTCGCGCATAGTCGGCGGAATAGTGGAGAAAGGTATTTCAGACTCGCACCAGAACGAGCCCTTCGTCTCACTTGCCCCTCCCAAACCTTCTGTTCTGCCATTTCCTGTCGCTCGGCATCGGTCTCATGGCCCACATTGGGTATCAGTAACTAGCAAAAAGGGTGGTGATAACATTAAACATGATCGGCAGGAGAATGGTGAAGATGATGAAAGTTTGATCGAGATTGAGTCTATAGCAAATTTTGCTAACCCAATACAAAGGAAAAATAAAAATAGCTTAGACTTTGGCAGGTGGAGAGAGGTTGTCCGAGGCCACAACCATGATGCAGCGAATAAAGAGGAAAAGAAAGTTGCAGGGTTAGCAAAAAATGAAAACCTAAATCGTGCTGGGGAAGCAATTAACACTGTTGATGATACCATGATGTCATGCAAGCCCTTATCAGCTGATGTGCTTGCACCTATTCTTATGAATGATGAACATAATTCATCTGGTTTTGTAAATGATCCCACTGGAATGAGGACAAAGGATTCTGGTTCTGATTTCGTGAGTTCAACAAATAATGCAGAAATAAAAGGGCTGGATCAACTATGTCTTTGGAAGGACTTTCAAGATGTTGATGATAGATCAGGACATGTTTCAGAGAGTGTTGAAATTAATGAGGGAATGCCAGTTGACGGTACTTCATTGCCAGATATGGCTATGGGGCTGCACCATTCAAATCCAGAAATGGTTCCTTGTTTTGGGTCCAATATCAAGGGAGAAGATGCATTCTCGACACTGGAGAGCCAGATCAATGCAGAGAACCGTGCACGGATACAAAGAATGTCACCTGAAGAGATTGCCGAAGCACAGACTGAGATAAAGGAGAAGATGAGTCCTGCACTAGTGAAAGCTCTGAAAAGGAGAGGTGAGGAAAAGTTGAAGAAGGGATCTTCAAAGCCTGACGTCAGCAAGAATTCTGAGTTGGATAATCTACAAAAAGAGGGTACATTTAATAGATATGATTCTCTCTGTGTAGAGAATGGCGTAACTTCAGCAAATTCAACCGTAAAGGACACAAAGAGTGGGCTTCAGAATGTTTCAGTGCAGAAATTTGACTTGGGTAGTAGCACATGGAGTGCTTGGAACGAAAGGGTTGAAGCAGTAAGGTTGTTAAGGTTTTCCCTGGAAGGAAATATAGTTGAAAGCTGTTCTTTCCAGCAATCAGAGAATGGTGATCTTGCAGTCCATGGGTACAGCACTGAAAATGTTACTTCACGAGATTTTCTTCGAACTGATGGGGATCCTAGTGCTGCAGGTTACACGATTAAAGAAGCTGTGGCACTGACAAGGAGTGTGATACCAGGTCAACGTGTCCTTGGGTTGCATGTTATTTCAAACGTGCTTGACAAGGCATTGCTTAATACACACCAAAAACCAGTTGGGTCCGCAATGGTTAAAGATGGTATCTCTATTGATTACAATGCAATTTGGGCTTATACCCTTGGCCCTGAACCAGAGCTTGCTTTGTCACTTAGGATATGCTTGGATGATAATCATAGCTCTGTCGTTCTAGCTTGTGCTGAAGTTATTCAGTGTATATTGGGCTGTAATTTGAACGAAATCTTCTTTGATACCTTACAGAAAACATCAACTTATGAGATGGATCTCTACACTGCTCCTATATTCCGAAGTAAACCAGAGATCAATGTTGGTTTCCTCCAAGGTGGATTTTGGAAGTACAATGCTAAACCTTCTAATATTCTTCCTTTCAGTGAAGATGTTGGGAATGTTGAAGATGGAGAGAAATATACAATCCAGGATGATATTGTGGTTGCACAACAAGATATTCTGGCAGGTTTGGTTCGAATGGGAATCCTTCACAGGCTTCGCTATCTTCTGGAGGCGGGCCCTTCAGTAGCTTTAGAAGAATGCATACTTTCGATACTTATTGCAATAGCAAGGCATTCCCCAACGTGCGCACAAGCAATAATGAAATGTGAAAGGCTTGTTGGGTTGATTATCAACAGGTTTACAATGAGCGACAAAATTGATATCCTTTCCTTTAAAATAAAATCTGTTGTCCTTTTGAAGGTCTTAGCTTGTTCAGACAGGAATAATTGTGTTGCATTTGTAAAAACTGGAGCTTTTCCAACTATGATATGGCATTTGTTTCACTATATTACCTCCATTGACCATTGGGTCAAGTCAGGGAAGGAAAAATGTAAACTTTCATCAGCTTTGATGGTTGAACAATTGAGGCTGTGGAAGGTTTGCATTCAGGATGGATACTGTGTATCTTACTTTTCTGATGTTTTCCCTGCCTTGTGCTTATGGTTGAGCCCACCAAATTTTGACAAACTTGTTGAGAATAATGTCCTTCGTGAATTTGCAACCATTTGTACGGAGGTATACCATGTTCTGGAGGCTTTGGCTAGAAGACTTCCTAACTATTTTTCACAGAAACATTTAGACAGTCAAGAGCTAGGACTTGCTGGTAATGAATCTGAAATTTGGTCCTGGAGTTGTGCTGTTCCGATGGTTAATTTAGCTGTGAAATGGTTAGAGTCAAAGAGTGATCCATTCATATCCAAATTATTTGCGTCACAAAAAGAGATTAGGAGTGGCTTTGAGTTTGAAGGTATATCGTTGGCACCATTGTTGTGGGTTTATTCAGCTGTCATGAAGATGCTATCTCAAGTGTTTGAAAGGATTGTCCCGCAGGATATTATGTCCCTGGAAGGAAGTGGTCAGATTGTGCCTTCACTACCAGAGTTTATACCACGAGTTGGACTTGAGATAATTAGGAATGGCTTTTTGAGCTTTCCAGGTGCATATGATAAGAAACCCGAAACCTATCCCTTTGTAGGTAACTCTTTTGTAGAGGATCTTTGTTTTTTGAGAGAACATGGTGAATTTGAAACGTCTCTGGCTTCTGTATGTTGCCTTCATGGGTTGATGCTTAGTATTATGAATATTGATCGTCTGATCCACTTAGCCAAAACCGAGAGGCATGGTTTTCCTTTCAGAGATTACAACGGCTCAAGGGAAGGGGAAATTTTAATGGTTGGGATGTTTAAGGCATCCCTCATCGAACAGAGAAGCGTGCTTAACCTTTTTACAAAGGTAATTGCTTTGGAGTCTGACTCTCTGCAGTTAATAGAGACCTTTGGCAGAGGCGGCCCTGCACCTGGGGTGGGAACTGGCTGGGGCGTGTCTGGAGGTGGATATTGGTCTCCTGCTGTTTTATTAGCTCAAAATGATGCAGCATTTGTCATGTTCCTAATTCAAGCATTTCAGACCGTTCCAACTTTAAATATACTGACTGCTCAGGAATCCTTGACCATTCAAAGCATAAATTCTGCCTTGGCAATATGCTTGGTTCTTGGGCCAAGAGATACGTGTTTGGTCGAGAAAACAATGGAATTTTTGATCCAGGCTCCTATTTTGCATCATTTCAATTTCTATATTCAGAGTTTTATCCAACTCAATGGAAGAGTGAAGCAATTTGGGTGGAAGTATAGTGAAGACGACTGTTTGATCCTTTGTAAAACATTAAGTTCTCACTATAAGGATAGGTGGTTATCGCCAAAGGAATCCAAATCCACAAAGAATAAGAGCAACTTTAGTGACAAAATATTTAAGAAAAGCAGTAATTCTTTGGATACAATATATGAGGAGGAATCAGATGAGACAAATAGGATAGCGCAAGATTGTACTTGTTTGGTAGTTCAGTGGGCTTACCAAAGACTTCCACTTCCAAAGCATTGGTTTTTAAGTCCAGTCTCAACTATCTCTGATAGTAAGTATGTTGGTCTTCAAAAATCTTCTGATGCGCAAAAGATTGTGCAGGACTCTAGTGATGTGCTTGAGGTTGCTAAGAGTGGACTCTTCTTTATTTTAGGAGTTGAAGCATTTTCCACCTTCCTACCTGATTATTTCCCTTCACCTGTCCAGAGTGTGCCATTGATTTGGAAATTGCACTCTTTATCTGTTGTTTTACTCGCAGGCATGGGAGTTTTGGATGATGAGAAGAGTAGGGATGTTTATGAGGTTTTGCAAGACCTGTATGGTCAGTGTCTTAACGAAGCTAGGTATAGTAGACTCTCTGAACGAATCCAGGAGAAGAATGCAACAGATTTACCGTCACAACCAGAAAATAAGAGCAATTTAGAGTTCCTGATGTTTCAATCAGAGATTCATGACAGTTACTCAACATTTATTGAGACTCTAGTGGAGCAGTTTGCTGCCGAATCCTATGGAGATATACTATATGGACGGCAAATTGTACTATATCTTCACCGATGTGTTGAAGCTCCTGTGCGGATTGCTGCCTGGAATGCACTAAATAATGCCCGTGTTCTCGAACTTCTTCCACCTCTTGAGAAGTGCTTTGTTGATGCTGAAGGGTGTCTGGAACCTATTGAGGATAATGAAGCCATTCTGGAAGCCTATGTGAAGTCATGGGTTTCAGGTGCCCTTGACAGATCTGCAAGCCGAGGTTCAGTGGCCTATTTACTAGTTCTCCACCACCTCTCATCCTACATATTCCACTCTAACCATGTCGCCAACTTGTTGCTGCGTAACAAGCTCTCAAGGTCTCTTTTGCGAGACTACTCTCAAAAGCATCAGCGCAAGGAAATGATGTCAGATCTTATCTTATATACTGCACCAGCGACGTATCGTGTAGCCGGACAGAAGGGTGTTTGTTCGTCGATCAAAATGAGCACTGTGGAGAAAAGGCTCGAGGTGTTGAAGGAAGCTTGTGAGAGGAACTCCTATCTTTTGACAGTGGTTGAAGAGCTCGGGTCCGCTGCAAAACACAAACTGTCCGCAATGTGCAGTCAGTTAGCCTAA

Protein sequence

MEKTTQSSRRKQQKSSARAKVFGAKALQLSEDDASRIVGGIVEKGISDSHQNEPFVSLAPPKPSVLPFPVARHRSHGPHWVSVTSKKGGDNIKHDRQENGEDDESLIEIESIANFANPIQRKNKNSLDFGRWREVVRGHNHDAANKEEKKVAGLAKNENLNRAGEAINTVDDTMMSCKPLSADVLAPILMNDEHNSSGFVNDPTGMRTKDSGSDFVSSTNNAEIKGLDQLCLWKDFQDVDDRSGHVSESVEINEGMPVDGTSLPDMAMGLHHSNPEMVPCFGSNIKGEDAFSTLESQINAENRARIQRMSPEEIAEAQTEIKEKMSPALVKALKRRGEEKLKKGSSKPDVSKNSELDNLQKEGTFNRYDSLCVENGVTSANSTVKDTKSGLQNVSVQKFDLGSSTWSAWNERVEAVRLLRFSLEGNIVESCSFQQSENGDLAVHGYSTENVTSRDFLRTDGDPSAAGYTIKEAVALTRSVIPGQRVLGLHVISNVLDKALLNTHQKPVGSAMVKDGISIDYNAIWAYTLGPEPELALSLRICLDDNHSSVVLACAEVIQCILGCNLNEIFFDTLQKTSTYEMDLYTAPIFRSKPEINVGFLQGGFWKYNAKPSNILPFSEDVGNVEDGEKYTIQDDIVVAQQDILAGLVRMGILHRLRYLLEAGPSVALEECILSILIAIARHSPTCAQAIMKCERLVGLIINRFTMSDKIDILSFKIKSVVLLKVLACSDRNNCVAFVKTGAFPTMIWHLFHYITSIDHWVKSGKEKCKLSSALMVEQLRLWKVCIQDGYCVSYFSDVFPALCLWLSPPNFDKLVENNVLREFATICTEVYHVLEALARRLPNYFSQKHLDSQELGLAGNESEIWSWSCAVPMVNLAVKWLESKSDPFISKLFASQKEIRSGFEFEGISLAPLLWVYSAVMKMLSQVFERIVPQDIMSLEGSGQIVPSLPEFIPRVGLEIIRNGFLSFPGAYDKKPETYPFVGNSFVEDLCFLREHGEFETSLASVCCLHGLMLSIMNIDRLIHLAKTERHGFPFRDYNGSREGEILMVGMFKASLIEQRSVLNLFTKVIALESDSLQLIETFGRGGPAPGVGTGWGVSGGGYWSPAVLLAQNDAAFVMFLIQAFQTVPTLNILTAQESLTIQSINSALAICLVLGPRDTCLVEKTMEFLIQAPILHHFNFYIQSFIQLNGRVKQFGWKYSEDDCLILCKTLSSHYKDRWLSPKESKSTKNKSNFSDKIFKKSSNSLDTIYEEESDETNRIAQDCTCLVVQWAYQRLPLPKHWFLSPVSTISDSKYVGLQKSSDAQKIVQDSSDVLEVAKSGLFFILGVEAFSTFLPDYFPSPVQSVPLIWKLHSLSVVLLAGMGVLDDEKSRDVYEVLQDLYGQCLNEARYSRLSERIQEKNATDLPSQPENKSNLEFLMFQSEIHDSYSTFIETLVEQFAAESYGDILYGRQIVLYLHRCVEAPVRIAAWNALNNARVLELLPPLEKCFVDAEGCLEPIEDNEAILEAYVKSWVSGALDRSASRGSVAYLLVLHHLSSYIFHSNHVANLLLRNKLSRSLLRDYSQKHQRKEMMSDLILYTAPATYRVAGQKGVCSSIKMSTVEKRLEVLKEACERNSYLLTVVEELGSAAKHKLSAMCSQLA
Homology
BLAST of MC02g0332 vs. ExPASy Swiss-Prot
Match: Q8GYU3 (Transcriptional elongation regulator MINIYO OS=Arabidopsis thaliana OX=3702 GN=IYO PE=1 SV=1)

HSP 1 Score: 1136.3 bits (2938), Expect = 0.0e+00
Identity = 685/1606 (42.65%), Postives = 943/1606 (58.72%), Query Frame = 0

Query: 37   IVGGIVEKGISDSHQNEPFVSLAPPKPSVLPFPVARHRSHGPHWVSVTSKKGGDNIKHDR 96
            +VG IVEKGIS   +N+P     PP+PS+L FPVARHRSHGPH   V S        +D 
Sbjct: 20   LVGSIVEKGIS---ENKPPSKPLPPRPSLLSFPVARHRSHGPHLAPVGSSIAQPKDYNDD 79

Query: 97   QENGEDDESLIEIESIANFANPIQRKNKNSLDFGRWREVVRGHNHDAAN--KEEKKVAGL 156
            QE  E +E  +  +SIA FA P+QRK K  +D GRW+++V G +  + +  ++ +K+  +
Sbjct: 80   QEEEEAEERFMNADSIAAFAKPLQRKEKKDMDLGRWKDMVSGDDPASTHVPQQSRKLKII 139

Query: 157  AKNENLNRAGEAINTVDDTMMSCKPLSADVLAPILMNDEHNSSGFVNDPTGMRTKDSGSD 216
                    + +A  T  +T+++ +            +D+     FV+D            
Sbjct: 140  ETRPPYVASADAATTSSNTLLAAR-----------ASDQRE---FVSDKAPF-------- 199

Query: 217  FVSSTNNAEIKGLDQLCLWKDFQDVDDRSGHVSESVEINEGMPVDGTSLPDMAMGLHHSN 276
                     IK L                    E V +N   P+  ++     +G  H  
Sbjct: 200  ---------IKNLG-----------------TKERVPLNASPPLAVSN----GLGTRH-- 259

Query: 277  PEMVPCFGSNIKGEDAFSTLESQINAENRARIQRMSPEEIAEAQTEIKEKMSPALVKALK 336
                           A S+LES I+ EN A++Q MSP+EIAEAQ E+ +KM PAL+  LK
Sbjct: 260  ---------------ASSSLESDIDVENHAKLQTMSPDEIAEAQAELLDKMDPALLSILK 319

Query: 337  RRGEEKLKKGS-SKPDVSKNSE-LDNLQKEGTFNRYDSLCVENGVTSANSTVKDTKSGLQ 396
            +RGE KLKK   S   VS   E   N + EG F           VT     +   KS +Q
Sbjct: 320  KRGEAKLKKRKHSVQGVSITDETAKNSRTEGHF-----------VTPKVMAIPKEKSVVQ 379

Query: 397  NVSVQKFDLGSSTWSAWNERVEAVRLLRFSLEGNIVESCSFQQSENGDLAVHGYSTENVT 456
               + +       W AW ERVEA R LRFS +GN+VE      +E G         E+  
Sbjct: 380  KPGIAQ----GFVWDAWTERVEAARDLRFSFDGNVVEEDVVSPAETGG---KWSGVESAA 439

Query: 457  SRDFLRTDGDPSAAGYTIKEAVALTRSVIPGQRVLGLHVISNVLDKALLNTHQKPVGSAM 516
             RDFLRT+GDP AAGYTIKEA+AL RSVIPGQR L LH++++VLDKAL    Q  +G A 
Sbjct: 440  ERDFLRTEGDPGAAGYTIKEAIALARSVIPGQRCLALHLLASVLDKALNKLCQSRIGYAR 499

Query: 517  VKDGISIDYNAIWAYTLGPEPELALSLRICLDDNHSSVVLACAEVIQCILGCNLNEIFFD 576
             +   S D+ AIWAY LGPEPEL L+LR+ LDDNH+SVV+AC +VIQC+L C+LNE FF+
Sbjct: 500  EEKDKSTDWEAIWAYALGPEPELVLALRMALDDNHASVVIACVKVIQCLLSCSLNENFFN 559

Query: 577  TLQKTSTYEMDLYTAPIFRSKPEINVGFLQGGFWKYNAKPSNILPFSEDVGNVEDGEKYT 636
             L+    +  D++TA +FRSKPEI++GFL+G +WKY+AKPSNI+ F E++ +    +  T
Sbjct: 560  ILENMGPHGKDIFTASVFRSKPEIDLGFLRGCYWKYSAKPSNIVAFREEILDDGTEDTDT 619

Query: 637  IQDDIVVAQQDILAGLVRMGILHRLRYLLEAGPSVALEECILSILIAIARHSPTCAQAIM 696
            IQ D+ VA QD+ AGLVRM IL R+ +LLE  P+ ALE+ I+S+ IAIARHSP C  AI+
Sbjct: 620  IQKDVFVAGQDVAAGLVRMDILPRIYHLLETEPTAALEDSIISVTIAIARHSPKCTTAIL 679

Query: 697  KCERLVGLIINRFTMSDKIDILSFKIKSVVLLKVLACSDRNNCVAFVKTGAFPTMIWHLF 756
            K  + V  I+ RF ++ ++D+LS +I SV LLKVLA  D++ C+ FVK G F  + WHLF
Sbjct: 680  KYPKFVQTIVKRFQLNKRMDVLSSQINSVRLLKVLARYDQSTCMEFVKNGTFNAVTWHLF 739

Query: 757  HYITSIDHWVKSGKEKCKLSSALMVEQLRLWKVCIQDGYCVSYFSDVFPALCLWLSPPNF 816
             + +S+D WVK GK+ CKLSS LMVEQLR WKVCI  G CVS F ++FPALCLWLS P+F
Sbjct: 740  QFTSSLDSWVKLGKQNCKLSSTLMVEQLRFWKVCIHSGCCVSRFPELFPALCLWLSCPSF 799

Query: 817  DKLVENNVLREFATICTEVYHVLEALARRLPNYFSQKHLDSQELGLAGNESEIWSWSCAV 876
            +KL E N++ EF ++  E Y VLEA A  LPN +SQ         +  NES  W WS   
Sbjct: 800  EKLREKNLISEFTSVSNEAYLVLEAFAETLPNMYSQ--------NIPRNESGTWDWSYVS 859

Query: 877  PMVNLAVKWLESKSDPFISKLFASQKEIRSGFEFEGISLAPLLWVYSAVMKMLSQVFERI 936
            PM++ A+ W+          L     +   G E   +S   LLW+YS VM+ +S+V E+I
Sbjct: 860  PMIDSALSWI---------TLAPQLLKWEKGIESVSVSTTTLLWLYSGVMRTISKVLEKI 919

Query: 937  VPQDIMSLEGSGQIVPSLPEFIPRVGLEIIRNGFLSFPGAYDKKPETYPFVGNSFVEDLC 996
                  S EG  + +P LPEF+P++GL II++  LSF  A   +        +SF+E LC
Sbjct: 920  ------SAEGEEEPLPWLPEFVPKIGLAIIKHKLLSFSVADVSRFGKDSSRCSSFMEYLC 979

Query: 997  FLREHG-EFETSLASVCCLHGLMLSIMNIDRLIHLAKTERHGFPFRDYNGSREGEILMVG 1056
            FLRE   + E +LASV CLHGL  +I++I  LI  A+++    P +    + +  +L  G
Sbjct: 980  FLRERSQDDELALASVNCLHGLTRTIVSIQNLIESARSKMKA-PHQVSISTGDESVLANG 1039

Query: 1057 MFKASLIEQRSVLNLFTKVIALESDSLQLIETFGRGGPAPGVGTGWGVSGGGYWSPAVLL 1116
            +   SL E  SV   F   ++ E   +Q IE   RGG APGVG GWG SGGG+WS  VLL
Sbjct: 1040 ILAESLAELTSVSCSFRDSVSSEWPIVQSIELHKRGGLAPGVGLGWGASGGGFWSTRVLL 1099

Query: 1117 AQNDAAFVMFLIQAFQTVPTLNILTAQESL-TIQSINSALAICLVLGPRDTCLVEKTMEF 1176
            AQ  A     L+  F  +   +    Q S+  +  +NSALA+CL+ GPRD  LVE+  E+
Sbjct: 1100 AQAGAG----LLSLFLNISLSDSQNDQGSVGFMDKVNSALAMCLIAGPRDYLLVERAFEY 1159

Query: 1177 LIQAPILHHFNFYIQSFIQLNGRVKQFGWKYSEDDCLILCKTLSSHYKDRWLSPK-ESKS 1236
            +++   L H    I+S    N +   F W+ SE D   +   L+SH++ RWL  K  S +
Sbjct: 1160 VLRPHALEHLACCIKS----NKKNISFEWECSEGDYHRMSSMLASHFRHRWLQQKGRSIA 1219

Query: 1237 TKNKSNFSDKIFKKSSNSLDTIYEEESDETNRIAQD--CTCLVVQWAYQRLPLPKHWFLS 1296
             +  S       +K +  L+TI+ E+ + +N   QD       ++WA+QR+PLP HWFLS
Sbjct: 1220 EEGVSG-----VRKGTVGLETIH-EDGEMSNSSTQDKKSDSSTIEWAHQRMPLPPHWFLS 1279

Query: 1297 PVSTISDSKYVGLQKSSDAQKIVQDSSDVLEVAKSGLFFILGVEAFSTFLPDYFPSPVQS 1356
             +S +   K               +S+++LEVAK+G+FF+ G+E+ S F     PSPV S
Sbjct: 1280 AISAVHSGK---------TSTGPPESTELLEVAKAGVFFLAGLESSSGF--GSLPSPVVS 1339

Query: 1357 VPLIWKLHSLSVVLLAGMGVLDDEKSRDVYEVLQDLYGQCLNEARYSRLSERIQEKNATD 1416
            VPL+WK H+LS VLL GM +++D+ +R++Y  LQ+LYGQ L+EAR               
Sbjct: 1340 VPLVWKFHALSTVLLVGMDIIEDKNTRNLYNYLQELYGQFLDEARL-------------- 1399

Query: 1417 LPSQPENKSNLEFLMFQSEIHDSYSTFIETLVEQFAAESYGDILYGRQIVLYLHRCVEAP 1476
                  N  + E L F+S+IH++YSTF+E +VEQ+AA SYGD++YGRQ+ +YLH+CVE  
Sbjct: 1400 ------NHRDTELLRFKSDIHENYSTFLEMVVEQYAAVSYGDVVYGRQVSVYLHQCVEHS 1459

Query: 1477 VRIAAWNALNNARVLELLPPLEKCFVDAEGCLEPIEDNEAILEAYVKSWVSGALDRSASR 1536
            VR++AW  L+NARVLELLP L+KC  +A+G LEP+E+NEA+LEAY+KSW  GALDR+A+R
Sbjct: 1460 VRLSAWTVLSNARVLELLPSLDKCLGEADGYLEPVEENEAVLEAYLKSWTCGALDRAATR 1460

Query: 1537 GSVAYLLVLHHLSSYIFHSNHVANLLLRNKLSRSLLRDYSQKHQRKEMMSDLILYTAPAT 1596
            GSVAY LV+HH SS +F +     + LRNK+ ++L+RD S+K  R+ MM DL+       
Sbjct: 1520 GSVAYTLVVHHFSSLVFCNQAKDKVSLRNKIVKTLVRDLSRKRHREGMMLDLL------R 1460

Query: 1597 YRVAGQKGVCSSIKMSTVEKRLEVLKEACERNSYLLTVVEELGSAA 1634
            Y+      +   +  +  EKR+EVLKE CE NS LL  +E+L SAA
Sbjct: 1580 YKKGSANAMEEEVIAAETEKRMEVLKEGCEGNSTLLLELEKLKSAA 1460

BLAST of MC02g0332 vs. ExPASy Swiss-Prot
Match: Q9BWH6 (RNA polymerase II-associated protein 1 OS=Homo sapiens OX=9606 GN=RPAP1 PE=1 SV=3)

HSP 1 Score: 97.4 bits (241), Expect = 1.6e-18
Identity = 141/572 (24.65%), Postives = 239/572 (41.78%), Query Frame = 0

Query: 260 GTSLPDMAMGLHHS--NPEMVPCFGSNIKGEDAFSTLESQINAENRARIQRMSPEEIAEA 319
           G  LP    G  HS   P +V   G  ++ ++A    ++ I+ EN AR+Q M+PEEI + 
Sbjct: 194 GCQLP----GSSHSFQGPNLVT--GKGLRDQEAEQEAQT-IHEENIARLQAMAPEEILQE 253

Query: 320 QTEIKEKMSPALVKALK-------RRGE---EKLKKGSSKPDVSKNSELDNLQKEGTFNR 379
           Q  +  ++ P+LV  L+       + GE   E+ + G    +V+K   L +        R
Sbjct: 254 QQRLLAQLDPSLVAFLRSHSHTQEQTGETASEEQRPGGPSANVTKEEPLMSAFASEPRKR 313

Query: 380 YDSLCVENGVTSANSTVKDTKSGLQNVSVQKF----DLGSSTWSAWNERVEAVRLLRFSL 439
            D L  E    +   T +     +  V ++K     DL         ER++A    RFSL
Sbjct: 314 -DKLEPEAPALALPVTPQKEWLHMDTVELEKLHWTQDLPPVRRQQTQERMQA----RFSL 373

Query: 440 EGNIVESCSFQQSENGDLAVHGYSTENVTSRDFLRTDGDPSAAGYTIKEAVALTRSVIPG 499
           +G ++       +  G L  HG   E                AGY+++E   LTRS +  
Sbjct: 374 QGELLAPDVDLPTHLG-LHHHGEEAER---------------AGYSLQELFHLTRSQVSQ 433

Query: 500 QRVLGLHVISNVLDKALLNTHQKPVGSAMVKDGISIDYNAIWAYTLGPEPELALSLRICL 559
           QR L LHV++ V+ +A         G  +    +S+  +A + +           LR  L
Sbjct: 434 QRALALHVLAQVISRA----QAGEFGDRLAGSVLSLLLDAGFLFL----------LRFSL 493

Query: 560 DDNHSSVVLACAEVIQCILGCNLNEIFFDTLQKTSTYEMDLYTAPIFRSKPEINVGFLQG 619
           DD    V+      ++ +L    +E   + L  T ++     T P+  S+ +        
Sbjct: 494 DDRVDGVIATAIRALRALLVAPGDE---ELLDSTFSWYHGALTFPLMPSQED-------- 553

Query: 620 GFWKYNAKPSNILPFSEDVGNVEDGEKYTIQDDIVVAQQDILAGLVRMGILHRLRYLLEA 679
              K +       P  +      + E     D   +A+ D++ GL+   +L RLRY+LE 
Sbjct: 554 ---KEDEDEDEECPAGKAKRKSPEEESRPPPD---LARHDVIKGLLATSLLPRLRYVLEV 613

Query: 680 ---GPSVALEECILSILIAIARHSPTCAQAIMKCERLVGLIINRFTMSDKIDILS----- 739
              GP+V L+  IL++LI +ARHS   A  +++C RL+  I+  F  +    + +     
Sbjct: 614 TYPGPAVVLD--ILAVLIRLARHSLESATRVLECPRLIETIVREFLPTSWSPVGAGPTPS 673

Query: 740 -FKI---KSVVLLKVLACSDRNNCVAFVKTGAFPTMIWHLFHYITSIDHWVKSGKEKCKL 799
            +K+    ++ LL+VLA + RN     + +    +    L   I      +    E+ ++
Sbjct: 674 LYKVPCATAMKLLRVLASAGRNIAARLLSSFDLRS---RLCRIIAEAPQELALPPEEAEM 698

Query: 800 SSALMVEQLRLWKVCIQDGYCVSYFSDVFPAL 804
              L  E LRLW V    G     + +++P L
Sbjct: 734 ---LSTEALRLWAVAASYGQGGYLYRELYPVL 698

BLAST of MC02g0332 vs. ExPASy Swiss-Prot
Match: A0JN53 (RNA polymerase II-associated protein 1 OS=Bos taurus OX=9913 GN=RPAP1 PE=2 SV=1)

HSP 1 Score: 97.1 bits (240), Expect = 2.0e-18
Identity = 148/635 (23.31%), Postives = 253/635 (39.84%), Query Frame = 0

Query: 282 GSNIKGEDAFSTLESQINAENRARIQRMSPEEIAEAQTEIKEKMSPALVKALKRRG--EE 341
           G  +KG++A    ++ I+ EN AR+Q ++PEEI + Q  +  ++ P+LV  LK      E
Sbjct: 212 GKGLKGQEAEQEAQT-IHEENVARLQALAPEEILQEQQRLLAQLDPSLVAFLKSHSCTRE 271

Query: 342 KLKKGSSKPDVSKNSELDNLQKEGTF-------NRYDSLCVENGVTSANSTVKDTKSGLQ 401
           + ++ +++         + + KE           + + L  E    +   T +     + 
Sbjct: 272 QAEEKATREQRPGRPSAEVIGKEAIAPTSASVPRQENELEPETPALALPVTPQKEWLHMD 331

Query: 402 NVSVQKF----DLGSSTWSAWNERVEAVRLLRFSLEGNIVESCSFQQSENGDLAVHGYST 461
            V ++K     DL         ER++A    RFSL+G ++       +  G L  HG   
Sbjct: 332 TVELEKLHWTQDLPPLRRQQTQERMQA----RFSLQGELLAPDMDLPTHLG-LHHHGEEA 391

Query: 462 ENVTSRDFLRTDGDPSAAGYTIKEAVALTRSVIPGQRVLGLHVISNVLDKALLNTHQKPV 521
           E                AGY+++E   LTRS +  QR L LHV++ V+ +A         
Sbjct: 392 ER---------------AGYSLQELFHLTRSQVSQQRALALHVLAQVIGRA----QAGEF 451

Query: 522 GSAMVKDGISIDYNAIWAYTLGPEPELALSLRICLDDNHSSVVLACAEVIQCILGCNLNE 581
           G  +V   + +  +A + +           LR  LDD    V+ A    ++ +L    +E
Sbjct: 452 GDRLVGSVLHLLLDAGFLFL----------LRFSLDDRVDGVIAAAVRALRALLVAPGDE 511

Query: 582 IFFDTLQKTSTYEMDLYTAPIFRSKPEINVGFLQGGFWKYNAKPSNILPFSEDVGNVEDG 641
              D     ST+                         W + A    ++P  ED  + ++ 
Sbjct: 512 ELLD-----STFS------------------------WYHGALMFALMPSQEDKEDEDED 571

Query: 642 EKYTIQDDIV------------VAQQDILAGLVRMGILHRLRYLLEA---GPSVALEECI 701
           E+   +                +A+ DI+ GL+   +L RLRY+LE    GPSV L+  I
Sbjct: 572 EEPPAEKAKTKSPEEGNRPPSDLARHDIIKGLLATNLLPRLRYVLEVTCPGPSVVLD--I 631

Query: 702 LSILIAIARHSPTCAQAIMKCERLVGLIINRFTMSDKIDILSFKIKS---------VVLL 761
           L++LI +ARHS   A  +++C RLV  ++  F  +    + S    S         + LL
Sbjct: 632 LTVLIRLARHSLESATRVLECPRLVETVVREFLPTSWSPMGSGPTSSLHRVPCAPAMKLL 691

Query: 762 KVLACSDRNNCVAFVKTGAFPTMIWHLFHYITSIDHWVKSGKEKCKL----SSALMVEQL 821
           +VLA + R N  A + +G         F   + +  ++    +   L    +  L  E  
Sbjct: 692 RVLASASR-NIAARLLSG---------FDLRSRLSRFIAEDPQDLALPLEEAETLSTEAF 751

Query: 822 RLWKVCIQDGYCVSYFSDVFPALCLWL--------SPPNFDKLVENNVLREFATICTEVY 868
           RLW V    G     + +++P L   L        SPP     V+       A++ T + 
Sbjct: 752 RLWAVAASYGLGSDLYRELYPVLMQALQDVPKELSSPPPRPLAVQ-----RIASLLTLLT 762

BLAST of MC02g0332 vs. ExPASy Swiss-Prot
Match: Q3T1I9 (RNA polymerase II-associated protein 1 OS=Rattus norvegicus OX=10116 GN=Rpap1 PE=1 SV=1)

HSP 1 Score: 96.3 bits (238), Expect = 3.5e-18
Identity = 139/589 (23.60%), Postives = 240/589 (40.75%), Query Frame = 0

Query: 260 GTSLPDMAMGLHHSNPEMVPCFGSNIKGEDAFSTLESQINAENRARIQRMSPEEIAEAQT 319
           G+ LP  +   H   P ++   G  ++ + A   +++ I+ EN AR+Q M PEEI + Q 
Sbjct: 194 GSQLPGRSHSFH--RPNLIT--GKGLRSQAAVQEVQT-IHEENVARLQAMDPEEILKEQQ 253

Query: 320 EIKEKMSPALVKALKRRGEEK--------LKKGSSKPDVSKNSELDNLQ----KEGTFNR 379
           ++  ++ P+LV  L+     +         ++   +P V  + E   +     + GT ++
Sbjct: 254 QLLAQLDPSLVAFLRAHNHTREQTETKATKEQNPERPSVPVSKEEPIMSTCTGESGTRDK 313

Query: 380 Y-----DSLCVENGVTSANSTVKDTKSGLQNVSVQKF----DLGSSTWSAWNERVEAVRL 439
                 D L           T       +  V ++K     DL         ER++A   
Sbjct: 314 LEDKLEDKLQPRTPALKLPMTPNKEWLHMDTVELEKLHWTQDLPPLRRQQTQERMQA--- 373

Query: 440 LRFSLEGNIVESCSFQQSENGDLAVHGYSTENVTSRDFLRTDGDPSAAGYTIKEAVALTR 499
            RFSL+G ++E      +  G L  HG   E                AGY+++E   LTR
Sbjct: 374 -RFSLQGELLEPDVDLPTHLG-LHHHGEEAER---------------AGYSLQELFHLTR 433

Query: 500 SVIPGQRVLGLHVISNVLDKALLNTHQKPVGSAMVKDGISIDYNAIWAYTLGPEPELALS 559
           S +  QR L LHV+S+++ +A         G  +V   + +  +A + +           
Sbjct: 434 SQVSQQRALALHVLSHIVGRA----QAGEFGDRLVGSVLRLLLDAGFLFL---------- 493

Query: 560 LRICLDDNHSSVVLACAEVIQCILGCNLNEIFFDTLQKTSTYEMDLYTAPIFRSKPEINV 619
           LR  LDD   SV+ A    ++ +L    +E   D     ST+                  
Sbjct: 494 LRFSLDDRIDSVIAAAVRALRALLVAPGDEELLD-----STFS----------------- 553

Query: 620 GFLQGGFWKYNAKPSNILPFSEDVGNVEDGEKYT------------IQDDIVVAQQDILA 679
                  W + A    ++P  +D  + ++ E+ T             +    +A+ D++ 
Sbjct: 554 -------WYHGASVFPMMPSHDDKEDEDEDEELTKEKVNRKTPEEGSRPPPDLARHDVIK 613

Query: 680 GLVRMGILHRLRYLLEA---GPSVALEECILSILIAIARHSPTCAQAIMKCERLVGLIIN 739
           GL+   +L R RY+LE    GPSV L+  IL++LI +ARHS   A  +++C RL+  I+ 
Sbjct: 614 GLLATNLLPRFRYVLEVTCPGPSVVLD--ILAVLIRLARHSLESAMRVLECPRLMETIVR 673

Query: 740 RFTMSDKIDI------LSFKI---KSVVLLKVLACSDRNNCVAFVKTGAFPTMIWHLFHY 799
            F  +    I        +K+    ++ LL+VLA + RN  +A     +F      L  +
Sbjct: 674 EFLPTSWSPIGVGPAPSLYKVPCAAAMKLLRVLASAGRN--IAARLLSSFDVRS-RLCRF 706

Query: 800 ITSIDHWVKSGKEKCKLSSALMVEQLRLWKVCIQDGYCVSYFSDVFPAL 804
           I      +    E+ ++   L  E  RLW V    G     + +++P L
Sbjct: 734 IAEAPRDLALPFEEAEI---LTTEAFRLWAVAASYGQGGDLYRELYPVL 706

BLAST of MC02g0332 vs. ExPASy Swiss-Prot
Match: Q80TE0 (RNA polymerase II-associated protein 1 OS=Mus musculus OX=10090 GN=Rpap1 PE=1 SV=2)

HSP 1 Score: 90.5 bits (223), Expect = 1.9e-16
Identity = 143/578 (24.74%), Postives = 234/578 (40.48%), Query Frame = 0

Query: 263 LPDMAMGLHHSNPEMVPCFGSNIKGEDAFSTLESQINAENRARIQRMSPEEIAEAQTEIK 322
           LP  + G H   P +V   G  ++ + A   +++ I+ EN AR+Q M PEEI + Q ++ 
Sbjct: 197 LPGRSHGFH--RPNLVT--GKGLRSKVAEQEVQT-IHEENVARLQAMDPEEILKEQQQLL 256

Query: 323 EKMSPALVKALK-------RRGEEKLKKGSSK-PDVSKNSEL-------------DNLQK 382
            ++ P+LV  L+       + G +  KK S K P V    E              D L++
Sbjct: 257 AQLDPSLVAFLRSHSQVQEQTGTKATKKQSPKRPSVLVTKEEPVTSTRTREPRTGDKLEE 316

Query: 383 --EGTF--NRYDSLCVENGVTSANSTVKDTKSGLQNVSVQKFDLGSSTWSAWNERVEAVR 442
             E T      D L           T       +  V + K            ++ +   
Sbjct: 317 KPEATVEDKMEDKLQPRTPALKLPMTPSKDWLHMDTVELDKLHWTQDLPPLRRQQTQERM 376

Query: 443 LLRFSLEGNIVESCSFQQSENGDLAVHGYSTENVTSRDFLRTDGDPSAAGYTIKEAVALT 502
             RFSL+G ++       +  G L  HG   E                AGY+++E   LT
Sbjct: 377 QARFSLQGELLAPDVDLPTHLG-LHHHGEEAER---------------AGYSLQELFHLT 436

Query: 503 RSVIPGQRVLGLHVISNVLDKALLNTHQKPVGSAMVKDGISIDYNAIWAYTLGPEPELAL 562
           RS +  QR L L V+S ++ +A         G  +V   + +  +A + +          
Sbjct: 437 RSQVSQQRALALQVLSQIVGRA----QAGEFGDRLVGSVLRLLLDAGFLFL--------- 496

Query: 563 SLRICLDDNHSSVVLACAEVIQCILGCNLNEIFFDTLQKTSTYEMDLYTAPIFRSKPEIN 622
            LR  LDD   SV+ A    ++ +L    +E   + L +T ++       P+  S+ +  
Sbjct: 497 -LRFSLDDRVDSVIAAAVRALRTLLVAPGDE---ELLDRTFSWYHGASVFPLMPSQDD-- 556

Query: 623 VGFLQGGFWKYNAKPSNILPFSEDVGNVEDGEKYTIQDDIVVAQQDILAGLVRMGILHRL 682
                    K +      L  +E V      E      D+  A+ D++ GL+   +L RL
Sbjct: 557 ---------KEDEDEDEELE-TEKVKRKTPEEGSRPPPDL--ARHDVIKGLLATNLLPRL 616

Query: 683 RYLLEA---GPSVALEECILSILIAIARHSPTCAQAIMKCERLVGLIINRFTMSDKIDI- 742
           RY+LE    GPSV L+  IL++LI +ARHS   A  +++C RL+  I+  F  +    I 
Sbjct: 617 RYVLEVTCPGPSVILD--ILAVLIRLARHSLESAMRVLECPRLMETIVQEFLPTSWSPIG 676

Query: 743 -----LSFKI---KSVVLLKVLACSDRNNCVAFVKTGAFPTMIWHLFHYITSIDHWVKSG 802
                  +K+    ++ LL+VLA + RN  +A      F      L  +I    H +   
Sbjct: 677 VGPTPSLYKVPCASAMKLLRVLASAGRN--IAARLLSGFDVRS-RLCRFIAEAPHDLALP 714

Query: 803 KEKCKLSSALMVEQLRLWKVCIQDGYCVSYFSDVFPAL 804
            E+ ++   L  E  RLW V    G     + +++P L
Sbjct: 737 PEEAEI---LTTEAFRLWAVAASYGQGGDLYRELYPVL 714

BLAST of MC02g0332 vs. NCBI nr
Match: XP_022140384.1 (transcriptional elongation regulator MINIYO [Momordica charantia])

HSP 1 Score: 3253 bits (8434), Expect = 0.0
Identity = 1643/1645 (99.88%), Postives = 1644/1645 (99.94%), Query Frame = 0

Query: 1    MEKTTQSSRRKQQKSSARAKVFGAKALQLSEDDASRIVGGIVEKGISDSHQNEPFVSLAP 60
            MEKTTQSSRRKQQKSSARAKVFGAKALQLSEDDASRIVGGIVEKGISDSHQNEPFVSLAP
Sbjct: 1    MEKTTQSSRRKQQKSSARAKVFGAKALQLSEDDASRIVGGIVEKGISDSHQNEPFVSLAP 60

Query: 61   PKPSVLPFPVARHRSHGPHWVSVTSKKGGDNIKHDRQENGEDDESLIEIESIANFANPIQ 120
            PKPSVLPFPVARHRSHGPHWVSVTSKKGGDNIKHDRQENGEDDESLIEIESIANFANPIQ
Sbjct: 61   PKPSVLPFPVARHRSHGPHWVSVTSKKGGDNIKHDRQENGEDDESLIEIESIANFANPIQ 120

Query: 121  RKNKNSLDFGRWREVVRGHNHDAANKEEKKVAGLAKNENLNRAGEAINTVDDTMMSCKPL 180
            RKNKNSLDFGRWREVVRGHNHDAANKEEKKVAGLAKNENLNRAGEAINTVDDTMMSCKPL
Sbjct: 121  RKNKNSLDFGRWREVVRGHNHDAANKEEKKVAGLAKNENLNRAGEAINTVDDTMMSCKPL 180

Query: 181  SADVLAPILMNDEHNSSGFVNDPTGMRTKDSGSDFVSSTNNAEIKGLDQLCLWKDFQDVD 240
            SADVLAPILMNDEHNSSGFVNDPTGMRTKDSGSDFVSSTNNAEIKGLDQLCLWKDFQDVD
Sbjct: 181  SADVLAPILMNDEHNSSGFVNDPTGMRTKDSGSDFVSSTNNAEIKGLDQLCLWKDFQDVD 240

Query: 241  DRSGHVSESVEINEGMPVDGTSLPDMAMGLHHSNPEMVPCFGSNIKGEDAFSTLESQINA 300
            DRSGHVSESVEINEGMPVDGTSLPDMAMGLHHSNPEMVPCFGSNIKGEDAFSTLESQINA
Sbjct: 241  DRSGHVSESVEINEGMPVDGTSLPDMAMGLHHSNPEMVPCFGSNIKGEDAFSTLESQINA 300

Query: 301  ENRARIQRMSPEEIAEAQTEIKEKMSPALVKALKRRGEEKLKKGSSKPDVSKNSELDNLQ 360
            ENRARIQRMSPEEIAEAQTEIKEKMSPALVKALKRRGEEKLKKGSSKPDVSKNSELDNLQ
Sbjct: 301  ENRARIQRMSPEEIAEAQTEIKEKMSPALVKALKRRGEEKLKKGSSKPDVSKNSELDNLQ 360

Query: 361  KEGTFNRYDSLCVENGVTSANSTVKDTKSGLQNVSVQKFDLGSSTWSAWNERVEAVRLLR 420
            KEGTFNRYDSLCVENGVTSANSTVKDTKSGLQNVSVQKFDLGSSTWSAWNERVEAVRLLR
Sbjct: 361  KEGTFNRYDSLCVENGVTSANSTVKDTKSGLQNVSVQKFDLGSSTWSAWNERVEAVRLLR 420

Query: 421  FSLEGNIVESCSFQQSENGDLAVHGYSTENVTSRDFLRTDGDPSAAGYTIKEAVALTRSV 480
            FSLEGNIVESCSFQQSENGDLAVHGYSTENVTSRDFLRTDGDPSAAGYTIKEAVALTRSV
Sbjct: 421  FSLEGNIVESCSFQQSENGDLAVHGYSTENVTSRDFLRTDGDPSAAGYTIKEAVALTRSV 480

Query: 481  IPGQRVLGLHVISNVLDKALLNTHQKPVGSAMVKDGISIDYNAIWAYTLGPEPELALSLR 540
            IPGQRVLGLHVISNVLDKALLNTHQKPVGSAMVKDGISIDYNAIWAYTLGPEPELALSLR
Sbjct: 481  IPGQRVLGLHVISNVLDKALLNTHQKPVGSAMVKDGISIDYNAIWAYTLGPEPELALSLR 540

Query: 541  ICLDDNHSSVVLACAEVIQCILGCNLNEIFFDTLQKTSTYEMDLYTAPIFRSKPEINVGF 600
            ICLDDNHSSVVLACAEVIQCILGCNLNEIFFDTLQKTSTYEMDLYTAPIFRSKPEINVGF
Sbjct: 541  ICLDDNHSSVVLACAEVIQCILGCNLNEIFFDTLQKTSTYEMDLYTAPIFRSKPEINVGF 600

Query: 601  LQGGFWKYNAKPSNILPFSEDVGNVEDGEKYTIQDDIVVAQQDILAGLVRMGILHRLRYL 660
            LQGGFWKYNAKPSNILPFSEDVGNVEDGEKYTIQDDIVVAQQDILAGLVRMGILHRLRYL
Sbjct: 601  LQGGFWKYNAKPSNILPFSEDVGNVEDGEKYTIQDDIVVAQQDILAGLVRMGILHRLRYL 660

Query: 661  LEAGPSVALEECILSILIAIARHSPTCAQAIMKCERLVGLIINRFTMSDKIDILSFKIKS 720
            LEAGPSVALEECILSILIAIARHSPTCAQAIMKCERLVGLIINRFTMSDKIDILSFKIKS
Sbjct: 661  LEAGPSVALEECILSILIAIARHSPTCAQAIMKCERLVGLIINRFTMSDKIDILSFKIKS 720

Query: 721  VVLLKVLACSDRNNCVAFVKTGAFPTMIWHLFHYITSIDHWVKSGKEKCKLSSALMVEQL 780
            VVLLKVLACSDRNNCVAFVKTGAFPTMIWHLFHYITSIDHWVKSGKEKCKLSSALMVEQL
Sbjct: 721  VVLLKVLACSDRNNCVAFVKTGAFPTMIWHLFHYITSIDHWVKSGKEKCKLSSALMVEQL 780

Query: 781  RLWKVCIQDGYCVSYFSDVFPALCLWLSPPNFDKLVENNVLREFATICTEVYHVLEALAR 840
            RLWKVCIQDGYCVSYFSDVFPALCLWLSPPNFDKLVENNVLREFATICTEVYHVLEALAR
Sbjct: 781  RLWKVCIQDGYCVSYFSDVFPALCLWLSPPNFDKLVENNVLREFATICTEVYHVLEALAR 840

Query: 841  RLPNYFSQKHLDSQELGLAGNESEIWSWSCAVPMVNLAVKWLESKSDPFISKLFASQKEI 900
            RLPNYFSQKHLDSQELGLAGNESEIWSWSCAVPMVNLAVKWLESKSDPFISKLFASQKEI
Sbjct: 841  RLPNYFSQKHLDSQELGLAGNESEIWSWSCAVPMVNLAVKWLESKSDPFISKLFASQKEI 900

Query: 901  RSGFEFEGISLAPLLWVYSAVMKMLSQVFERIVPQDIMSLEGSGQIVPSLPEFIPRVGLE 960
            RSGFEFEGISLAPLLWVYSAVMKMLSQVFERIVPQDIMSLEGSGQIVPSLPEFIPRVGLE
Sbjct: 901  RSGFEFEGISLAPLLWVYSAVMKMLSQVFERIVPQDIMSLEGSGQIVPSLPEFIPRVGLE 960

Query: 961  IIRNGFLSFPGAYDKKPETYPFVGNSFVEDLCFLREHGEFETSLASVCCLHGLMLSIMNI 1020
            IIRNGFLSFPGAYDKKPETYPFVGNSFVEDLCFLREHGEFETSLASVCCLHGLMLSIMNI
Sbjct: 961  IIRNGFLSFPGAYDKKPETYPFVGNSFVEDLCFLREHGEFETSLASVCCLHGLMLSIMNI 1020

Query: 1021 DRLIHLAKTERHGFPFRDYNGSREGEILMVGMFKASLIEQRSVLNLFTKVIALESDSLQL 1080
            DRLIHLAKTERHGFPFRDYNGSREGEILMVGMFKASLIEQRSVLNLFTKVIALESDSLQL
Sbjct: 1021 DRLIHLAKTERHGFPFRDYNGSREGEILMVGMFKASLIEQRSVLNLFTKVIALESDSLQL 1080

Query: 1081 IETFGRGGPAPGVGTGWGVSGGGYWSPAVLLAQNDAAFVMFLIQAFQTVPTLNILTAQES 1140
            IETFGRGGPAPGVGTGWGVSGGGYWSPAVLLAQNDAAFVMFLIQAFQTVPTLNILTAQES
Sbjct: 1081 IETFGRGGPAPGVGTGWGVSGGGYWSPAVLLAQNDAAFVMFLIQAFQTVPTLNILTAQES 1140

Query: 1141 LTIQSINSALAICLVLGPRDTCLVEKTMEFLIQAPILHHFNFYIQSFIQLNGRVKQFGWK 1200
            LTIQSINSALAICLVLGPRDTCLVEKTMEFLIQAPILHHFNFYIQSFIQLNGRVKQFGWK
Sbjct: 1141 LTIQSINSALAICLVLGPRDTCLVEKTMEFLIQAPILHHFNFYIQSFIQLNGRVKQFGWK 1200

Query: 1201 YSEDDCLILCKTLSSHYKDRWLSPKESKSTKNKSNFSDKIFKKSSNSLDTIYEEESDETN 1260
            YSEDDCLILCKTLSSHYKDRWLSPKESKSTKNKSNFSDKIFKKSSNSLDTIYEEESDETN
Sbjct: 1201 YSEDDCLILCKTLSSHYKDRWLSPKESKSTKNKSNFSDKIFKKSSNSLDTIYEEESDETN 1260

Query: 1261 RIAQDCTCLVVQWAYQRLPLPKHWFLSPVSTISDSKYVGLQKSSDAQKIVQDSSDVLEVA 1320
            RIAQDCTCLVVQWAYQRLPLPKHWFLSPVSTI DSKYVGLQKSSDAQKIVQDSSDVLEVA
Sbjct: 1261 RIAQDCTCLVVQWAYQRLPLPKHWFLSPVSTICDSKYVGLQKSSDAQKIVQDSSDVLEVA 1320

Query: 1321 KSGLFFILGVEAFSTFLPDYFPSPVQSVPLIWKLHSLSVVLLAGMGVLDDEKSRDVYEVL 1380
            KSGLFFILGVEAFSTFLPDYFPSPVQSVPLIWKLHSLSVVLLAGMGVLDDEKSRDVYEVL
Sbjct: 1321 KSGLFFILGVEAFSTFLPDYFPSPVQSVPLIWKLHSLSVVLLAGMGVLDDEKSRDVYEVL 1380

Query: 1381 QDLYGQCLNEARYSRLSERIQEKNATDLPSQPENKSNLEFLMFQSEIHDSYSTFIETLVE 1440
            QDLYGQCLN+ARYSRLSERIQEKNATDLPSQPENKSNLEFLMFQSEIHDSYSTFIETLVE
Sbjct: 1381 QDLYGQCLNKARYSRLSERIQEKNATDLPSQPENKSNLEFLMFQSEIHDSYSTFIETLVE 1440

Query: 1441 QFAAESYGDILYGRQIVLYLHRCVEAPVRIAAWNALNNARVLELLPPLEKCFVDAEGCLE 1500
            QFAAESYGDILYGRQIVLYLHRCVEAPVRIAAWNALNNARVLELLPPLEKCFVDAEGCLE
Sbjct: 1441 QFAAESYGDILYGRQIVLYLHRCVEAPVRIAAWNALNNARVLELLPPLEKCFVDAEGCLE 1500

Query: 1501 PIEDNEAILEAYVKSWVSGALDRSASRGSVAYLLVLHHLSSYIFHSNHVANLLLRNKLSR 1560
            PIEDNEAILEAYVKSWVSGALDRSASRGSVAYLLVLHHLSSYIFHSNHVANLLLRNKLSR
Sbjct: 1501 PIEDNEAILEAYVKSWVSGALDRSASRGSVAYLLVLHHLSSYIFHSNHVANLLLRNKLSR 1560

Query: 1561 SLLRDYSQKHQRKEMMSDLILYTAPATYRVAGQKGVCSSIKMSTVEKRLEVLKEACERNS 1620
            SLLRDYSQKHQRKEMMSDLILYTAPATYRVAGQKGVCSSIKMSTVEKRLEVLKEACERNS
Sbjct: 1561 SLLRDYSQKHQRKEMMSDLILYTAPATYRVAGQKGVCSSIKMSTVEKRLEVLKEACERNS 1620

Query: 1621 YLLTVVEELGSAAKHKLSAMCSQLA 1645
            YLLTVVEELGSAAKHKLSAMCSQLA
Sbjct: 1621 YLLTVVEELGSAAKHKLSAMCSQLA 1645

BLAST of MC02g0332 vs. NCBI nr
Match: XP_023512432.1 (transcriptional elongation regulator MINIYO [Cucurbita pepo subsp. pepo])

HSP 1 Score: 2548 bits (6605), Expect = 0.0
Identity = 1309/1640 (79.82%), Postives = 1428/1640 (87.07%), Query Frame = 0

Query: 1    MEKTTQSSRRKQQKSSARAKVFGAKALQLSEDDASRIVGGIVEKGISDSHQNEPFVSLAP 60
            MEK TQSSRR Q KSSARAKVFG  ALQLSE DASR+VGGIVEKGISD+ Q++PF S+AP
Sbjct: 1    MEKKTQSSRRSQPKSSARAKVFGTNALQLSEHDASRLVGGIVEKGISDAEQSKPFASVAP 60

Query: 61   PKPSVLPFPVARHRSHGPHWVSVTSKKGGDNIKHDRQENGEDDESLIEIESIANFANPIQ 120
            P+PSVLPFPVARHRSHGPHW SVTSK GGDNI+ DR  + E+DE L+EI+SIANFANPIQ
Sbjct: 61   PRPSVLPFPVARHRSHGPHWESVTSKMGGDNIRDDRPGDEEEDERLMEIDSIANFANPIQ 120

Query: 121  RKNKNSLDFGRWREVVRGHNHDAANKEEKKVAGLAKNENLNRAGEAINTVDDTMMSCKPL 180
            RK K+SLDFGRWRE V GHNHDAA+ EE KVA LAK E+L RAGEA NT D+  MSC+PL
Sbjct: 121  RKKKSSLDFGRWREAVPGHNHDAASGEENKVASLAKTEDLIRAGEANNTRDN--MSCEPL 180

Query: 181  SADVLAPILMNDEHNSSGFVNDPTGMRTKDSGSDFVSSTNNAEIKGLDQLCLWKDFQDVD 240
            SA VLAP LMN EH+SS FVN PTG +T  +G +F  S NN E+KGLD+  + ++ QD  
Sbjct: 181  SAGVLAPSLMNIEHSSSDFVNKPTGNKTNAAGLEFARSMNNVELKGLDKQHIPENLQDDY 240

Query: 241  DRSGHVSESVEINEGMPVDGTSLPDMAMGLHHSNPEMVPCFGSNIKGEDAFSTLESQINA 300
            D+ GH+SES E+ EG+P+DGTSL DMA  LHH N EMVPCF SNIKGEDAFSTLESQI+A
Sbjct: 241  DQWGHISES-EVKEGVPLDGTSLQDMATRLHHLNSEMVPCFESNIKGEDAFSTLESQIDA 300

Query: 301  ENRARIQRMSPEEIAEAQTEIKEKMSPALVKALKRRGEEKLKKGSSKPDVSKNSELDNLQ 360
            EN ARIQRMS EEIAEAQ EI EKMSPAL+K LK RGE KLKKGSSKPDVS + EL NLQ
Sbjct: 301  ENCARIQRMSLEEIAEAQAEIMEKMSPALLKTLKMRGEGKLKKGSSKPDVSNDYELGNLQ 360

Query: 361  KEGTFNRYDSLCVENGVTSANSTVKDTKSGLQNVSVQKFDLGSSTWSAWNERVEAVRLLR 420
            KE T +R  S  +ENGVTS  + +K  KSGLQNV+VQKFD GSS W+AWNERVEAVR LR
Sbjct: 361  KESTHDRNGSPNIENGVTSGTTALKYRKSGLQNVAVQKFDSGSSAWNAWNERVEAVRSLR 420

Query: 421  FSLEGNIVESCSFQQSENGDLAVHGYSTENVTSRDFLRTDGDPSAAGYTIKEAVALTRSV 480
            FSLEGNI+ES SFQQSEN    VHGYSTENV SRDFLRT+GDPSAAGYTIKEAVALTRSV
Sbjct: 421  FSLEGNILESYSFQQSEN----VHGYSTENVASRDFLRTEGDPSAAGYTIKEAVALTRSV 480

Query: 481  IPGQRVLGLHVISNVLDKALLNTHQKPVGSAMVKDGISIDYNAIWAYTLGPEPELALSLR 540
            IPGQRVLGLHVISNVLDKA LNT  K VGS MVKD  S+DYNAIWAY LGPEPELALSLR
Sbjct: 481  IPGQRVLGLHVISNVLDKASLNTRLKQVGSTMVKDSSSVDYNAIWAYILGPEPELALSLR 540

Query: 541  ICLDDNHSSVVLACAEVIQCILGCNLNEIFFDTLQKTSTYEMDLYTAPIFRSKPEINVGF 600
            +CLDDNH+SV+LACAEVIQC+L CNLNE FFDTL+KTSTYE DL TA +FRSKPEIN GF
Sbjct: 541  MCLDDNHNSVILACAEVIQCVLSCNLNETFFDTLEKTSTYEKDLCTAAVFRSKPEINAGF 600

Query: 601  LQGGFWKYNAKPSNILPFSEDVGNVEDGEKYTIQDDIVVAQQDILAGLVRMGILHRLRYL 660
            L GGFWKY+AKPSNILPFSEDV NVEDGEKYTIQDDIVVAQQDI AGLVRMG+L RLRYL
Sbjct: 601  LHGGFWKYSAKPSNILPFSEDVENVEDGEKYTIQDDIVVAQQDIAAGLVRMGLLPRLRYL 660

Query: 661  LEAGPSVALEECILSILIAIARHSPTCAQAIMKCERLVGLIINRFTMSDKIDILSFKIKS 720
            LEAGPSVALE+CILSIL+AIARHSP CA+AIM CERLV LII+RFTMSDKIDILS KIKS
Sbjct: 661  LEAGPSVALEDCILSILVAIARHSPACARAIMICERLVELIIHRFTMSDKIDILSLKIKS 720

Query: 721  VVLLKVLACSDRNNCVAFVKTGAFPTMIWHLFHYITSIDHWVKSGKEKCKLSSALMVEQL 780
            VVLLKVL+ SDR NC+ FVK+GAF TMIWHL+HY +SIDHWVKSGKEKCKLSS LMVEQL
Sbjct: 721  VVLLKVLSRSDRKNCIEFVKSGAFQTMIWHLYHYTSSIDHWVKSGKEKCKLSSTLMVEQL 780

Query: 781  RLWKVCIQDGYCVSYFSDVFPALCLWLSPPNFDKLVENNVLREFATICTEVYHVLEALAR 840
            RLWKVCIQ GYCVSYFSDVFPALCLWLSPPNFDKL+ENNVLREF TI  EVYH+LEALAR
Sbjct: 781  RLWKVCIQYGYCVSYFSDVFPALCLWLSPPNFDKLIENNVLREFTTISMEVYHILEALAR 840

Query: 841  RLPNYFSQKHLDSQELGLAGNESEIWSWSCAVPMVNLAVKWLESKSDPFISKLFASQKEI 900
            RLPN+FSQKHLDSQE G AGNESE+WSWSC VP+V+LA KWLESKSDPFISK F SQK  
Sbjct: 841  RLPNFFSQKHLDSQEPGHAGNESEVWSWSCVVPIVDLATKWLESKSDPFISKFFESQKGT 900

Query: 901  RSGFEFEGISLAPLLWVYSAVMKMLSQVFERIVPQDIMSLEGSGQIVPSLPEFIPRVGLE 960
             +GF F+GISLAPLLWVYSAVMKMLSQV ERI+P DIMS EGSGQIVP +PEFIPR+GL 
Sbjct: 901  MNGFGFKGISLAPLLWVYSAVMKMLSQVVERIIPHDIMSQEGSGQIVPWIPEFIPRIGLV 960

Query: 961  IIRNGFLSFPGAYDKKPETYPFVGNSFVEDLCFLREHGEFETSLASVCCLHGLMLSIMNI 1020
            II++GFLSF  A D KPETYP   NSFVEDLCFLREHGEFETSLASVCCLHGLMLSI++I
Sbjct: 961  IIKHGFLSFADASDMKPETYPSGRNSFVEDLCFLREHGEFETSLASVCCLHGLMLSIVHI 1020

Query: 1021 DRLIHLAKTERHGFPFRDYNGSREGEILMVGMFKASLIEQRSVLNLFTKVIALESDSLQL 1080
            DRLIHLAKTE   +  +DYN SREGEIL VGMFK SLIEQ+SVL+LFTKVIALE DSLQL
Sbjct: 1021 DRLIHLAKTESQDYSPKDYNSSREGEILRVGMFKTSLIEQKSVLDLFTKVIALECDSLQL 1080

Query: 1081 IETFGRGGPAPGVGTGWGVSGGGYWSPAVLLAQNDAAFVMFLIQAFQTVPTLNILTAQES 1140
            IETFGRGGPAPGVGTGWGVSGGGYWSP VLLAQNDAAF+M L++AFQ +PTLNIL AQES
Sbjct: 1081 IETFGRGGPAPGVGTGWGVSGGGYWSPDVLLAQNDAAFLMSLVEAFQAIPTLNILIAQES 1140

Query: 1141 LTIQSINSALAICLVLGPRDTCLVEKTMEFLIQAPILHHFNFYIQSFIQLNGRVKQFGWK 1200
            LT+QSINS+LA+CLVLGP +T LVEKT+ FL QAPILH+FN YIQ+F++LNG VKQFGWK
Sbjct: 1141 LTVQSINSSLAVCLVLGPGNTGLVEKTVNFLTQAPILHNFNLYIQNFLRLNGAVKQFGWK 1200

Query: 1201 YSEDDCLILCKTLSSHYKDRWLSPKESKSTKNKSNFSDKIFKKSSNSLDTIYEEESDETN 1260
            YSEDDCLI CKTLSSHYKDRWL+PKESKS KNKSNFSDK F   + SLDTIYEE SDETN
Sbjct: 1201 YSEDDCLIFCKTLSSHYKDRWLTPKESKSMKNKSNFSDKTFMNGNVSLDTIYEE-SDETN 1260

Query: 1261 RIAQDCTCLVVQWAYQRLPLPKHWFLSPVSTISDSKYVGLQKSSDAQKIVQDSSDVLEVA 1320
             +A+DCTCL+ QWAYQRLPLP HWF SP+STI D K+VGLQ +SDAQ ++QDS D LEVA
Sbjct: 1261 GMAEDCTCLIEQWAYQRLPLPSHWFFSPISTIRDIKHVGLQ-TSDAQILMQDSGDFLEVA 1320

Query: 1321 KSGLFFILGVEAFSTFLPDYFPSPVQSVPLIWKLHSLSVVLLAGMGVLDDEKSRDVYEVL 1380
            KSGLFFILGVEAFSTFLPD FPSPV SVPLIWKLHSLSV+LLAGMGVLDDEKSRDVYEVL
Sbjct: 1321 KSGLFFILGVEAFSTFLPDGFPSPVLSVPLIWKLHSLSVLLLAGMGVLDDEKSRDVYEVL 1380

Query: 1381 QDLYGQCLNEARYSRLSERIQEKNATDLPSQPENKSNLEFLMFQSEIHDSYSTFIETLVE 1440
            QDLYGQ LNEAR  RLS  I +K+A  L SQPENKSNLEFLMFQSEIHDSYSTFIETLVE
Sbjct: 1381 QDLYGQRLNEARSCRLSVNITQKDAKHLVSQPENKSNLEFLMFQSEIHDSYSTFIETLVE 1440

Query: 1441 QFAAESYGDILYGRQIVLYLHRCVEAPVRIAAWNALNNARVLELLPPLEKCFVDAEGCLE 1500
            QF+A SYGD+LYGRQIVLYLH+CVE+P R+A WNALN ARV +LLPPLEKC  DAEG L 
Sbjct: 1441 QFSAVSYGDVLYGRQIVLYLHQCVESPTRLATWNALNGARVFDLLPPLEKCIADAEGYLH 1500

Query: 1501 PIEDNEAILEAYVKSWVSGALDRSASRGSVAYLLVLHHLSSYIFHSNHVANLLLRNKLSR 1560
            PIEDNEAILEAY+KSWVSGALD+SASRGSVAYLLVLHHLSSYIFHS  V NLLLRNKLSR
Sbjct: 1501 PIEDNEAILEAYLKSWVSGALDKSASRGSVAYLLVLHHLSSYIFHSYPVDNLLLRNKLSR 1560

Query: 1561 SLLRDYSQKHQRKEMMSDLILYTAPATYRVAGQKGVCSSIKMSTVEKRLEVLKEACERNS 1620
            SLLRDYSQKHQ K MM +L+LYT P+TY V GQKG+ +SI+ S VEKRLEVLKEACERNS
Sbjct: 1561 SLLRDYSQKHQHKAMMLNLVLYTEPSTYLVTGQKGIGTSIETSAVEKRLEVLKEACERNS 1620

Query: 1621 YLLTVVEELGSAAKHKLSAM 1640
             LLTVVEELG AAK KLS +
Sbjct: 1621 SLLTVVEELGCAAKDKLSTI 1631

BLAST of MC02g0332 vs. NCBI nr
Match: XP_022985672.1 (transcriptional elongation regulator MINIYO [Cucurbita maxima])

HSP 1 Score: 2542 bits (6588), Expect = 0.0
Identity = 1305/1637 (79.72%), Postives = 1427/1637 (87.17%), Query Frame = 0

Query: 1    MEKTTQSSRRKQQKSSARAKVFGAKALQLSEDDASRIVGGIVEKGISDSHQNEPFVSLAP 60
            MEK TQSSRR Q KSSARA VFG  ALQLSE DASR+VGGIVEKGISD+ Q++PF S+AP
Sbjct: 1    MEKKTQSSRRSQPKSSARATVFGTNALQLSEHDASRLVGGIVEKGISDAEQSKPFASVAP 60

Query: 61   PKPSVLPFPVARHRSHGPHWVSVTSKKGGDNIKHDRQENGEDDESLIEIESIANFANPIQ 120
            P+PSVLPFPVARHRSHGPHW SVTSK GGDNI+ DR+ + E+DE L+EI+SIANFANPIQ
Sbjct: 61   PRPSVLPFPVARHRSHGPHWESVTSKMGGDNIRDDRRGDEEEDERLMEIDSIANFANPIQ 120

Query: 121  RKNKNSLDFGRWREVVRGHNHDAANKEEKKVAGLAKNENLNRAGEAINTVDDTMMSCKPL 180
            RK K+SLDFGRWRE V GHNHDAA+ EE KVA LAK +NL RAGEA NT D+  MSC+PL
Sbjct: 121  RKKKSSLDFGRWREAVPGHNHDAASGEENKVASLAKTKNLIRAGEANNTRDN--MSCEPL 180

Query: 181  SADVLAPILMNDEHNSSGFVNDPTGMRTKDSGSDFVSSTNNAEIKGLDQLCLWKDFQDVD 240
            SA VLAP LMN E++SS FVN+PTG +T  +G +F  S NN E+KGLD+  + ++ QD  
Sbjct: 181  SAGVLAPSLMNIENSSSDFVNNPTGNKTNAAGLEFARSMNNVELKGLDKQHIPENLQDDY 240

Query: 241  DRSGHVSESVEINEGMPVDGTSLPDMAMGLHHSNPEMVPCFGSNIKGEDAFSTLESQINA 300
            D+ G +SES E+ EG+P+DGTSL DMA  LHH N EMVPCF SNIKGEDAFSTLESQI+A
Sbjct: 241  DQWGRISES-EVKEGVPLDGTSLQDMATRLHHLNSEMVPCFESNIKGEDAFSTLESQIDA 300

Query: 301  ENRARIQRMSPEEIAEAQTEIKEKMSPALVKALKRRGEEKLKKGSSKPDVSKNSELDNLQ 360
            EN ARIQRMS EEIAEAQ EI EKMSPAL+K LK RG  KLKKGSSKPD S + EL NLQ
Sbjct: 301  ENCARIQRMSQEEIAEAQAEIMEKMSPALLKTLKMRGAGKLKKGSSKPDASNDYELGNLQ 360

Query: 361  KEGTFNRYDSLCVENGVTSANSTVKDTKSGLQNVSVQKFDLGSSTWSAWNERVEAVRLLR 420
            KE T +R  S  +ENGVTS  + +K   SGLQNV+VQKFD GSS W+AWNERVEAVR LR
Sbjct: 361  KESTHDRNGSTNIENGVTSGTTALKYRNSGLQNVAVQKFDSGSSAWNAWNERVEAVRSLR 420

Query: 421  FSLEGNIVESCSFQQSENGDLAVHGYSTENVTSRDFLRTDGDPSAAGYTIKEAVALTRSV 480
            FSLEGNIVES SFQQSEN    VHGYSTENV SRDFLRT+GDPSAAGYTIKEAVALTRSV
Sbjct: 421  FSLEGNIVESYSFQQSEN----VHGYSTENVASRDFLRTEGDPSAAGYTIKEAVALTRSV 480

Query: 481  IPGQRVLGLHVISNVLDKALLNTHQKPVGSAMVKDGISIDYNAIWAYTLGPEPELALSLR 540
            IPGQRVLGLHVISNVLDKA LNTH K VGS MVKDG S+DYNAIWAY LGPEPELALSLR
Sbjct: 481  IPGQRVLGLHVISNVLDKASLNTHLKQVGSTMVKDGSSVDYNAIWAYILGPEPELALSLR 540

Query: 541  ICLDDNHSSVVLACAEVIQCILGCNLNEIFFDTLQKTSTYEMDLYTAPIFRSKPEINVGF 600
            +CLDDNH+SV+LACAEVIQC+L CNLNE FFDTL+KTSTYE DL TA +FRSKPEIN GF
Sbjct: 541  MCLDDNHNSVILACAEVIQCVLSCNLNETFFDTLEKTSTYEKDLCTAAVFRSKPEINAGF 600

Query: 601  LQGGFWKYNAKPSNILPFSEDVGNVEDGEKYTIQDDIVVAQQDILAGLVRMGILHRLRYL 660
            L GGFWKY+AKPSNILPFSEDV NVEDGEKYTIQDDIVVAQQDI AGLVRMG+L RLRYL
Sbjct: 601  LHGGFWKYSAKPSNILPFSEDVENVEDGEKYTIQDDIVVAQQDIAAGLVRMGLLPRLRYL 660

Query: 661  LEAGPSVALEECILSILIAIARHSPTCAQAIMKCERLVGLIINRFTMSDKIDILSFKIKS 720
            LEAGPSVALE+C+LSIL+AIARHSP CA+AIM CERLV LII+RFTMSDKIDILS KIKS
Sbjct: 661  LEAGPSVALEDCLLSILVAIARHSPACARAIMICERLVELIIHRFTMSDKIDILSLKIKS 720

Query: 721  VVLLKVLACSDRNNCVAFVKTGAFPTMIWHLFHYITSIDHWVKSGKEKCKLSSALMVEQL 780
            VVLLKVL+ SDR NC+AFVK+GAF TMIWHL+HY +SIDHWVKSGKEKCKLSS LMVEQL
Sbjct: 721  VVLLKVLSRSDRKNCIAFVKSGAFQTMIWHLYHYTSSIDHWVKSGKEKCKLSSTLMVEQL 780

Query: 781  RLWKVCIQDGYCVSYFSDVFPALCLWLSPPNFDKLVENNVLREFATICTEVYHVLEALAR 840
            RLWKVCIQ GYCVSYFSDVFPALCLWLSPPNFDKL+ENNVLREF TI  EVYHVLEALAR
Sbjct: 781  RLWKVCIQYGYCVSYFSDVFPALCLWLSPPNFDKLIENNVLREFTTISMEVYHVLEALAR 840

Query: 841  RLPNYFSQKHLDSQELGLAGNESEIWSWSCAVPMVNLAVKWLESKSDPFISKLFASQKEI 900
            RLPN+FSQKHLDSQE G AGNESE+WSWSC VP+V+LA KWLESKSDPFISK F SQK  
Sbjct: 841  RLPNFFSQKHLDSQEPGHAGNESEVWSWSCVVPIVDLATKWLESKSDPFISKFFESQKGT 900

Query: 901  RSGFEFEGISLAPLLWVYSAVMKMLSQVFERIVPQDIMSLEGSGQIVPSLPEFIPRVGLE 960
             +GF FEGISLAPLLWVYSAVMKMLSQV ERI+P DIMS EGSGQIVP +PEFIPR+GLE
Sbjct: 901  MNGFGFEGISLAPLLWVYSAVMKMLSQVVERIIPHDIMSQEGSGQIVPWIPEFIPRIGLE 960

Query: 961  IIRNGFLSFPGAYDKKPETYPFVGNSFVEDLCFLREHGEFETSLASVCCLHGLMLSIMNI 1020
            II++GFLSF  A D KPETYP   NSFVE+LCFLREHGEFETSLASVCCLHGLMLSI++I
Sbjct: 961  IIKHGFLSFADASDMKPETYPSGRNSFVENLCFLREHGEFETSLASVCCLHGLMLSILHI 1020

Query: 1021 DRLIHLAKTERHGFPFRDYNGSREGEILMVGMFKASLIEQRSVLNLFTKVIALESDSLQL 1080
            DRLIHLAKTE   +  +DYN SREGEIL VGMFKASLIEQ+SVL+LFTKVI+LE DSLQL
Sbjct: 1021 DRLIHLAKTESPDYSPKDYNFSREGEILRVGMFKASLIEQKSVLDLFTKVISLECDSLQL 1080

Query: 1081 IETFGRGGPAPGVGTGWGVSGGGYWSPAVLLAQNDAAFVMFLIQAFQTVPTLNILTAQES 1140
            IETFGRGGPAPGVGTGWGVSGGGYWSP VLLAQNDAAF+M LI+AFQ +PTLNIL AQES
Sbjct: 1081 IETFGRGGPAPGVGTGWGVSGGGYWSPGVLLAQNDAAFLMSLIEAFQAIPTLNILIAQES 1140

Query: 1141 LTIQSINSALAICLVLGPRDTCLVEKTMEFLIQAPILHHFNFYIQSFIQLNGRVKQFGWK 1200
            LT+QSINSALA+CLVLGP +T LVE+T+ FL QAPILH+FN YIQ+F+QLNG VKQFGW+
Sbjct: 1141 LTVQSINSALAVCLVLGPGNTGLVEQTVNFLTQAPILHNFNLYIQNFLQLNGEVKQFGWE 1200

Query: 1201 YSEDDCLILCKTLSSHYKDRWLSPKESKSTKNKSNFSDKIFKKSSNSLDTIYEEESDETN 1260
            YSEDDCLI CKTLSSHYKD+WL+PKESKS KNKSNFSD+ F   + SLDTIYE  SDETN
Sbjct: 1201 YSEDDCLIFCKTLSSHYKDKWLTPKESKSMKNKSNFSDRTFMNGNVSLDTIYEG-SDETN 1260

Query: 1261 RIAQDCTCLVVQWAYQRLPLPKHWFLSPVSTISDSKYVGLQKSSDAQKIVQDSSDVLEVA 1320
             +A+DCTCL+ QWAYQRLPLP HWF SPVSTI DSK+ GLQKS DAQ ++QDS D LEVA
Sbjct: 1261 GMAEDCTCLIEQWAYQRLPLPGHWFFSPVSTICDSKHAGLQKS-DAQILMQDSGDFLEVA 1320

Query: 1321 KSGLFFILGVEAFSTFLPDYFPSPVQSVPLIWKLHSLSVVLLAGMGVLDDEKSRDVYEVL 1380
            KSGLFFILGVEAFSTFLPD FPSPV SVPLIWKLHSLSV+LL GMGVLDDEKSRDVYEVL
Sbjct: 1321 KSGLFFILGVEAFSTFLPDGFPSPVLSVPLIWKLHSLSVLLLTGMGVLDDEKSRDVYEVL 1380

Query: 1381 QDLYGQCLNEARYSRLSERIQEKNATDLPSQPENKSNLEFLMFQSEIHDSYSTFIETLVE 1440
            QDLYGQ LNEAR  RLS  + +K+A  L SQPENKSNLEFLMFQSEIHDSYSTFIETLVE
Sbjct: 1381 QDLYGQRLNEARSCRLSVHVTQKDAKHLLSQPENKSNLEFLMFQSEIHDSYSTFIETLVE 1440

Query: 1441 QFAAESYGDILYGRQIVLYLHRCVEAPVRIAAWNALNNARVLELLPPLEKCFVDAEGCLE 1500
            QF+A SYGD+LYGRQIVLYLH+CVE+P R+AAWNALN ARV +LLPPLEKC  D EG L+
Sbjct: 1441 QFSAVSYGDVLYGRQIVLYLHQCVESPTRLAAWNALNGARVFDLLPPLEKCIADPEGYLQ 1500

Query: 1501 PIEDNEAILEAYVKSWVSGALDRSASRGSVAYLLVLHHLSSYIFHSNHVANLLLRNKLSR 1560
            PIEDNEAILEAY+KSWVSGALD+SASRGSVAYLLVLHHLSSYIFHS  V NLLLRNKLSR
Sbjct: 1501 PIEDNEAILEAYLKSWVSGALDKSASRGSVAYLLVLHHLSSYIFHSYPVDNLLLRNKLSR 1560

Query: 1561 SLLRDYSQKHQRKEMMSDLILYTAPATYRVAGQKGVCSSIKMSTVEKRLEVLKEACERNS 1620
            SLLRDYSQKHQ K MM DL+LYT P+TY V GQKG+ +SI+ S VEKRLEVLKEACERNS
Sbjct: 1561 SLLRDYSQKHQHKAMMLDLVLYTEPSTYLVTGQKGIGTSIEASVVEKRLEVLKEACERNS 1620

Query: 1621 YLLTVVEELGSAAKHKL 1637
             LLTVV+ELG AAK KL
Sbjct: 1621 SLLTVVKELGCAAKDKL 1628

BLAST of MC02g0332 vs. NCBI nr
Match: KAG7010830.1 (Transcriptional elongation regulator MINIYO [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 2540 bits (6584), Expect = 0.0
Identity = 1307/1640 (79.70%), Postives = 1422/1640 (86.71%), Query Frame = 0

Query: 1    MEKTTQSSRRKQQKSSARAKVFGAKALQLSEDDASRIVGGIVEKGISDSHQNEPFVSLAP 60
            MEK  QSSRR Q KSSAR KVFG  ALQLSE DASR+VGGIVEKGISD+ Q++PF S+AP
Sbjct: 1    MEKKAQSSRRSQPKSSARGKVFGTNALQLSEHDASRLVGGIVEKGISDAEQSKPFASVAP 60

Query: 61   PKPSVLPFPVARHRSHGPHWVSVTSKKGGDNIKHDRQENGEDDESLIEIESIANFANPIQ 120
            P+PSVLPFPVARHRSHGPHW SVTSK GG+NI+ DR+ + E+DE L+EI+SIANFANPIQ
Sbjct: 61   PRPSVLPFPVARHRSHGPHWESVTSKMGGNNIRDDRRGDEEEDERLMEIDSIANFANPIQ 120

Query: 121  RKNKNSLDFGRWREVVRGHNHDAANKEEKKVAGLAKNENLNRAGEAINTVDDTMMSCKPL 180
            RK K+SLDFGRWRE V GHNH AA+ EE KVA LAK E+L RAGEA +T+D+  MSC+PL
Sbjct: 121  RKKKSSLDFGRWREAVPGHNHIAASGEENKVASLAKTEDLIRAGEANSTMDN--MSCEPL 180

Query: 181  SADVLAPILMNDEHNSSGFVNDPTGMRTKDSGSDFVSSTNNAEIKGLDQLCLWKDFQDVD 240
            SA VLAP LMN EH+SS FVN PTG +T  +G +F  S NN E+KGLD+  + ++ QD  
Sbjct: 181  SAGVLAPSLMNIEHSSSDFVNKPTGNKTNAAGLEFARSMNNVELKGLDKQHIPENLQDDY 240

Query: 241  DRSGHVSESVEINEGMPVDGTSLPDMAMGLHHSNPEMVPCFGSNIKGEDAFSTLESQINA 300
            D+ GH+SES E+ EG+P+DGTSL DMA  LHH N EMVPCF SNIKGEDAFSTLESQI+A
Sbjct: 241  DQWGHISES-EVKEGVPLDGTSLQDMATRLHHLNSEMVPCFESNIKGEDAFSTLESQIDA 300

Query: 301  ENRARIQRMSPEEIAEAQTEIKEKMSPALVKALKRRGEEKLKKGSSKPDVSKNSELDNLQ 360
            EN ARIQRMS EEIAEAQ EI EKMSPAL+K LK RGE KLKKGSSKPD S + EL NLQ
Sbjct: 301  ENCARIQRMSQEEIAEAQAEIMEKMSPALLKTLKMRGEGKLKKGSSKPDASNDYELGNLQ 360

Query: 361  KEGTFNRYDSLCVENGVTSANSTVKDTKSGLQNVSVQKFDLGSSTWSAWNERVEAVRLLR 420
            KE T +R  S  +ENGVTS  + +K   SGLQNV+VQKFD GSS W+AWNERVEAVR LR
Sbjct: 361  KESTHDRNGSPNIENGVTSGTTALKYRNSGLQNVAVQKFDSGSSAWNAWNERVEAVRSLR 420

Query: 421  FSLEGNIVESCSFQQSENGDLAVHGYSTENVTSRDFLRTDGDPSAAGYTIKEAVALTRSV 480
            FSLEGNIVES SFQQSEN    VHGYSTENV SRDFLRT+GDPSAAGYTIKEAVALTRSV
Sbjct: 421  FSLEGNIVESYSFQQSEN----VHGYSTENVASRDFLRTEGDPSAAGYTIKEAVALTRSV 480

Query: 481  IPGQRVLGLHVISNVLDKALLNTHQKPVGSAMVKDGISIDYNAIWAYTLGPEPELALSLR 540
            IPGQRVLGLHVISNVLDKA LNT  K VGS MVKD  S+DYNAIWAY LGPEPELALSLR
Sbjct: 481  IPGQRVLGLHVISNVLDKASLNTRLKQVGSTMVKDSSSVDYNAIWAYILGPEPELALSLR 540

Query: 541  ICLDDNHSSVVLACAEVIQCILGCNLNEIFFDTLQKTSTYEMDLYTAPIFRSKPEINVGF 600
            +CLDDNH+SV+LACAEVIQC+L  NLNE FFDTL+KTSTYE DL TA +FRSKPEIN GF
Sbjct: 541  MCLDDNHNSVILACAEVIQCVLSYNLNETFFDTLEKTSTYEKDLCTAAVFRSKPEINAGF 600

Query: 601  LQGGFWKYNAKPSNILPFSEDVGNVEDGEKYTIQDDIVVAQQDILAGLVRMGILHRLRYL 660
            L GGFWKY+AKPSNILP SEDV NVEDGEKYTIQDDIVVAQQDI AGLVRMG+L RLRYL
Sbjct: 601  LHGGFWKYSAKPSNILPISEDVENVEDGEKYTIQDDIVVAQQDIAAGLVRMGLLPRLRYL 660

Query: 661  LEAGPSVALEECILSILIAIARHSPTCAQAIMKCERLVGLIINRFTMSDKIDILSFKIKS 720
            LEAGPSVALE+CILSIL+AIARHSP CA+AIM CERLV LII+RFTMSDKIDILS KIKS
Sbjct: 661  LEAGPSVALEDCILSILVAIARHSPACARAIMICERLVELIIHRFTMSDKIDILSLKIKS 720

Query: 721  VVLLKVLACSDRNNCVAFVKTGAFPTMIWHLFHYITSIDHWVKSGKEKCKLSSALMVEQL 780
            VVLLKVL+ SDR NC+ FVK+GAF TMIWHL+HY +SIDHWVKSGKEKCKLSS LMVEQL
Sbjct: 721  VVLLKVLSRSDRKNCIEFVKSGAFQTMIWHLYHYTSSIDHWVKSGKEKCKLSSTLMVEQL 780

Query: 781  RLWKVCIQDGYCVSYFSDVFPALCLWLSPPNFDKLVENNVLREFATICTEVYHVLEALAR 840
            RLWKVCIQ GYCVSYFSDVFPALCLWLSPPNFDKL+ENNVLREF TI  EVYHVLEAL R
Sbjct: 781  RLWKVCIQYGYCVSYFSDVFPALCLWLSPPNFDKLIENNVLREFTTISMEVYHVLEALTR 840

Query: 841  RLPNYFSQKHLDSQELGLAGNESEIWSWSCAVPMVNLAVKWLESKSDPFISKLFASQKEI 900
            RLPN+FSQKHLDSQE G AGNESE+WSWSC VP+V+LA KWLESKSDPFISK F SQK  
Sbjct: 841  RLPNFFSQKHLDSQEPGHAGNESEVWSWSCVVPIVDLATKWLESKSDPFISKFFESQKGT 900

Query: 901  RSGFEFEGISLAPLLWVYSAVMKMLSQVFERIVPQDIMSLEGSGQIVPSLPEFIPRVGLE 960
             +GF FEGISLAPLLWVYSAVMKMLSQV ERI+P DIMS EGSGQIVP +PEFIPR+GLE
Sbjct: 901  MNGFGFEGISLAPLLWVYSAVMKMLSQVVERIIPHDIMSQEGSGQIVPWIPEFIPRIGLE 960

Query: 961  IIRNGFLSFPGAYDKKPETYPFVGNSFVEDLCFLREHGEFETSLASVCCLHGLMLSIMNI 1020
            II++GFLSF  A D KPETYP   NSFVEDLCFLREHGEFETSLASVCCLHGLMLSI++I
Sbjct: 961  IIKHGFLSFADASDMKPETYPSGRNSFVEDLCFLREHGEFETSLASVCCLHGLMLSIVHI 1020

Query: 1021 DRLIHLAKTERHGFPFRDYNGSREGEILMVGMFKASLIEQRSVLNLFTKVIALESDSLQL 1080
            DRLIHLAKTE   +  +DYN SREGEIL VGMFK SLIEQ+S+L+LFTKVIALE DSLQL
Sbjct: 1021 DRLIHLAKTESQDYSPKDYNSSREGEILRVGMFKTSLIEQKSLLDLFTKVIALECDSLQL 1080

Query: 1081 IETFGRGGPAPGVGTGWGVSGGGYWSPAVLLAQNDAAFVMFLIQAFQTVPTLNILTAQES 1140
            IETFGRGGPAPGVGTGWGVSGGGYWSP VLLAQNDAAF+M LI+AFQ +PTLNIL AQES
Sbjct: 1081 IETFGRGGPAPGVGTGWGVSGGGYWSPDVLLAQNDAAFLMSLIEAFQAIPTLNILIAQES 1140

Query: 1141 LTIQSINSALAICLVLGPRDTCLVEKTMEFLIQAPILHHFNFYIQSFIQLNGRVKQFGWK 1200
            LT+QSINSALA+CLVLGPR+T LVEKT+ FL QAPILH+FN YIQ+F+QLNG VKQFGWK
Sbjct: 1141 LTVQSINSALAVCLVLGPRNTGLVEKTVNFLTQAPILHNFNLYIQNFLQLNGEVKQFGWK 1200

Query: 1201 YSEDDCLILCKTLSSHYKDRWLSPKESKSTKNKSNFSDKIFKKSSNSLDTIYEEESDETN 1260
            YSEDDCLI CKTLSSHYKDRWL+PKESKS KNKSNFSDK F   + SLDTIYEE SDETN
Sbjct: 1201 YSEDDCLIFCKTLSSHYKDRWLTPKESKSMKNKSNFSDKTFMNGNVSLDTIYEE-SDETN 1260

Query: 1261 RIAQDCTCLVVQWAYQRLPLPKHWFLSPVSTISDSKYVGLQKSSDAQKIVQDSSDVLEVA 1320
            R+A+DCTCL+ QWAYQRLPLP HWF SP+STI DSK+VGLQKS DAQ  +QDS D LEVA
Sbjct: 1261 RMAEDCTCLIEQWAYQRLPLPGHWFFSPISTIRDSKHVGLQKS-DAQIFMQDSDDFLEVA 1320

Query: 1321 KSGLFFILGVEAFSTFLPDYFPSPVQSVPLIWKLHSLSVVLLAGMGVLDDEKSRDVYEVL 1380
            KSGLFFILGVEAFSTFLPD FPSPV SVPLIWKLHSLSV+LL GMG LDDEKSRDVYEVL
Sbjct: 1321 KSGLFFILGVEAFSTFLPDGFPSPVLSVPLIWKLHSLSVLLLTGMGFLDDEKSRDVYEVL 1380

Query: 1381 QDLYGQCLNEARYSRLSERIQEKNATDLPSQPENKSNLEFLMFQSEIHDSYSTFIETLVE 1440
            QDLY Q LNEAR  RLS  I +K+A  L SQPENKSNLEFL FQSEIHDSYSTFIETLVE
Sbjct: 1381 QDLYSQRLNEARSCRLSVNITQKDAKHLVSQPENKSNLEFLRFQSEIHDSYSTFIETLVE 1440

Query: 1441 QFAAESYGDILYGRQIVLYLHRCVEAPVRIAAWNALNNARVLELLPPLEKCFVDAEGCLE 1500
            QF+A SYGD+LYGRQIVLYLH+CVE+P R+AAWNALN ARV +LLPPLEKC  DAEG L 
Sbjct: 1441 QFSAVSYGDVLYGRQIVLYLHQCVESPTRLAAWNALNGARVFDLLPPLEKCIADAEGYLH 1500

Query: 1501 PIEDNEAILEAYVKSWVSGALDRSASRGSVAYLLVLHHLSSYIFHSNHVANLLLRNKLSR 1560
            PIEDNEAILEAY+KSWVSGALD+SASRGSVAYLLVLHHLSSYIFHS  V NLLLRNKLSR
Sbjct: 1501 PIEDNEAILEAYLKSWVSGALDKSASRGSVAYLLVLHHLSSYIFHSYPVDNLLLRNKLSR 1560

Query: 1561 SLLRDYSQKHQRKEMMSDLILYTAPATYRVAGQKGVCSSIKMSTVEKRLEVLKEACERNS 1620
            SLLRDYSQKHQ K MM DL+LYT P+TY V GQKG+ +SI+ S VEKRLEVLKEACERNS
Sbjct: 1561 SLLRDYSQKHQHKAMMLDLVLYTEPSTYLVTGQKGIGTSIETSAVEKRLEVLKEACERNS 1620

Query: 1621 YLLTVVEELGSAAKHKLSAM 1640
             LLTVVEELG AAK KLS +
Sbjct: 1621 SLLTVVEELGCAAKDKLSTI 1631

BLAST of MC02g0332 vs. NCBI nr
Match: XP_022943843.1 (transcriptional elongation regulator MINIYO [Cucurbita moschata])

HSP 1 Score: 2529 bits (6556), Expect = 0.0
Identity = 1304/1640 (79.51%), Postives = 1417/1640 (86.40%), Query Frame = 0

Query: 1    MEKTTQSSRRKQQKSSARAKVFGAKALQLSEDDASRIVGGIVEKGISDSHQNEPFVSLAP 60
            MEK  QSSRR Q KSSAR KVFG  ALQLSE DASR+VGGIVEKGISD+ Q++PF S+AP
Sbjct: 1    MEKKAQSSRRSQPKSSARGKVFGTNALQLSEHDASRLVGGIVEKGISDAEQSKPFASVAP 60

Query: 61   PKPSVLPFPVARHRSHGPHWVSVTSKKGGDNIKHDRQENGEDDESLIEIESIANFANPIQ 120
            P+PSVLPFPVARHRSHGPHW SVTSK GG+NI+ DR+ + E+DE L+EIESIANFANPIQ
Sbjct: 61   PRPSVLPFPVARHRSHGPHWESVTSKMGGNNIRDDRRGDEEEDERLMEIESIANFANPIQ 120

Query: 121  RKNKNSLDFGRWREVVRGHNHDAANKEEKKVAGLAKNENLNRAGEAINTVDDTMMSCKPL 180
            RK K+SLDFGRWRE V GHNH AA+ EE KVA LAK E+L RAGEA +T+D+  MSC+PL
Sbjct: 121  RKKKSSLDFGRWREAVPGHNHIAASGEENKVASLAKTEHLIRAGEANSTMDN--MSCEPL 180

Query: 181  SADVLAPILMNDEHNSSGFVNDPTGMRTKDSGSDFVSSTNNAEIKGLDQLCLWKDFQDVD 240
            SA VLAP LMN EH+SS FVN PTG +T  +G +F  S NN E+KGLD+  + ++ QD  
Sbjct: 181  SAGVLAPSLMNIEHSSSDFVNKPTGNKTNAAGLEFARSMNNVELKGLDKQHIPENLQDDY 240

Query: 241  DRSGHVSESVEINEGMPVDGTSLPDMAMGLHHSNPEMVPCFGSNIKGEDAFSTLESQINA 300
            D+ GH+SES E+ EG+P+DGTS  DMA  LHH N EMVPCF SNIKGEDAFSTLESQI+A
Sbjct: 241  DQWGHISES-EVKEGVPLDGTSFQDMATRLHHLNSEMVPCFESNIKGEDAFSTLESQIDA 300

Query: 301  ENRARIQRMSPEEIAEAQTEIKEKMSPALVKALKRRGEEKLKKGSSKPDVSKNSELDNLQ 360
            EN ARIQRMS EEIAEAQ EI EKM PAL K LK RGE KLKKGSSKPD S + EL NLQ
Sbjct: 301  ENCARIQRMSQEEIAEAQAEIMEKMRPALWKTLKMRGEGKLKKGSSKPDASNDYELGNLQ 360

Query: 361  KEGTFNRYDSLCVENGVTSANSTVKDTKSGLQNVSVQKFDLGSSTWSAWNERVEAVRLLR 420
            KE T +R  S  +ENGVTS  + +K   SGLQNV+VQKFD GSS W+AWNERVEAVR LR
Sbjct: 361  KESTHDRNGSPNIENGVTSGTTALKYRNSGLQNVAVQKFDSGSSAWNAWNERVEAVRSLR 420

Query: 421  FSLEGNIVESCSFQQSENGDLAVHGYSTENVTSRDFLRTDGDPSAAGYTIKEAVALTRSV 480
            FSLEGNIVES SFQQSEN    VHGYSTENV SRDFLRT+GDPSAAGYTIKEAVALTRSV
Sbjct: 421  FSLEGNIVESYSFQQSEN----VHGYSTENVASRDFLRTEGDPSAAGYTIKEAVALTRSV 480

Query: 481  IPGQRVLGLHVISNVLDKALLNTHQKPVGSAMVKDGISIDYNAIWAYTLGPEPELALSLR 540
            IPGQRVLGLHVISNVLDKA LNT  K VGS MVKD  S+DYNAIW Y LGPEPELALSLR
Sbjct: 481  IPGQRVLGLHVISNVLDKASLNTRLKQVGSTMVKDSSSVDYNAIWTYILGPEPELALSLR 540

Query: 541  ICLDDNHSSVVLACAEVIQCILGCNLNEIFFDTLQKTSTYEMDLYTAPIFRSKPEINVGF 600
            +CLDDNH+SV+LACAEVIQC+L CNLNE FFDTL+KTSTYE DL TA +FRSKPEIN GF
Sbjct: 541  MCLDDNHNSVILACAEVIQCVLSCNLNETFFDTLEKTSTYEKDLCTAAVFRSKPEINAGF 600

Query: 601  LQGGFWKYNAKPSNILPFSEDVGNVEDGEKYTIQDDIVVAQQDILAGLVRMGILHRLRYL 660
            L GGFWKY+AKPSNILP SEDV NVEDGEKYTIQDDIVVAQQDI AGLVRMG+L RLRYL
Sbjct: 601  LHGGFWKYSAKPSNILPISEDVENVEDGEKYTIQDDIVVAQQDIAAGLVRMGLLPRLRYL 660

Query: 661  LEAGPSVALEECILSILIAIARHSPTCAQAIMKCERLVGLIINRFTMSDKIDILSFKIKS 720
            LEAGPSVALE+CILSIL+AIARHSP CA+AIM CERLV LII+RFTMSDKIDILS KIKS
Sbjct: 661  LEAGPSVALEDCILSILVAIARHSPACARAIMICERLVELIIHRFTMSDKIDILSLKIKS 720

Query: 721  VVLLKVLACSDRNNCVAFVKTGAFPTMIWHLFHYITSIDHWVKSGKEKCKLSSALMVEQL 780
            VVLLKVL+ SDR NC+ FVK+GAF TMIWHL+HY +SIDHWVKSGKEKCKLSS LMVEQL
Sbjct: 721  VVLLKVLSRSDRKNCIEFVKSGAFQTMIWHLYHYTSSIDHWVKSGKEKCKLSSTLMVEQL 780

Query: 781  RLWKVCIQDGYCVSYFSDVFPALCLWLSPPNFDKLVENNVLREFATICTEVYHVLEALAR 840
            RLWKVCIQ GYCVSYFSDVFPALCLWLSPPNFDKL+ENNVLREF TI  EVYHVLEAL R
Sbjct: 781  RLWKVCIQYGYCVSYFSDVFPALCLWLSPPNFDKLIENNVLREFTTISMEVYHVLEALTR 840

Query: 841  RLPNYFSQKHLDSQELGLAGNESEIWSWSCAVPMVNLAVKWLESKSDPFISKLFASQKEI 900
            RLPN+FSQKHLDSQE G AGNESE+WSWSC VP+V+LA KWLESKSDPFISK F SQK  
Sbjct: 841  RLPNFFSQKHLDSQEPGHAGNESEVWSWSCVVPIVDLATKWLESKSDPFISKFFESQKGT 900

Query: 901  RSGFEFEGISLAPLLWVYSAVMKMLSQVFERIVPQDIMSLEGSGQIVPSLPEFIPRVGLE 960
             +GF FEGISLAPLLWVYSAVMKMLSQV ERI+P DIMS EGSGQIVP LPEFIPR+GLE
Sbjct: 901  MNGFGFEGISLAPLLWVYSAVMKMLSQVVERIIPHDIMSQEGSGQIVPWLPEFIPRIGLE 960

Query: 961  IIRNGFLSFPGAYDKKPETYPFVGNSFVEDLCFLREHGEFETSLASVCCLHGLMLSIMNI 1020
            II++GFLS     D KPETYP   NSFVEDLCFLREHGEFETSLASVCCLHGLMLSI++I
Sbjct: 961  IIKHGFLSLS---DNKPETYPSGRNSFVEDLCFLREHGEFETSLASVCCLHGLMLSIVHI 1020

Query: 1021 DRLIHLAKTERHGFPFRDYNGSREGEILMVGMFKASLIEQRSVLNLFTKVIALESDSLQL 1080
            DRLIHLAKTE   +  +DYN SREGEIL VGMFK SLIEQ+S+L+LFTKVIALE DSLQL
Sbjct: 1021 DRLIHLAKTESQDYSPKDYNSSREGEILRVGMFKTSLIEQKSLLDLFTKVIALECDSLQL 1080

Query: 1081 IETFGRGGPAPGVGTGWGVSGGGYWSPAVLLAQNDAAFVMFLIQAFQTVPTLNILTAQES 1140
            IETFGRGGPAPGVGTGWGVSGGGYWSP VLLA+NDAAF+M LI+AFQ VPTLNIL AQES
Sbjct: 1081 IETFGRGGPAPGVGTGWGVSGGGYWSPDVLLAENDAAFLMSLIEAFQAVPTLNILIAQES 1140

Query: 1141 LTIQSINSALAICLVLGPRDTCLVEKTMEFLIQAPILHHFNFYIQSFIQLNGRVKQFGWK 1200
            LT+QSINSALA+CLVLGPR+T LVEKT+ FL QAPILH+FN YIQ+F+QLNG VKQFGWK
Sbjct: 1141 LTVQSINSALAVCLVLGPRNTGLVEKTVNFLTQAPILHNFNLYIQNFLQLNGEVKQFGWK 1200

Query: 1201 YSEDDCLILCKTLSSHYKDRWLSPKESKSTKNKSNFSDKIFKKSSNSLDTIYEEESDETN 1260
            YSEDDCLI CKTLSSHYKDRWL+PKESKS KNKSNFSDK F   + SLDTIYEE SDETN
Sbjct: 1201 YSEDDCLIFCKTLSSHYKDRWLTPKESKSMKNKSNFSDKTFMNGNVSLDTIYEE-SDETN 1260

Query: 1261 RIAQDCTCLVVQWAYQRLPLPKHWFLSPVSTISDSKYVGLQKSSDAQKIVQDSSDVLEVA 1320
            R+A+DCTCL+ QWAYQRLPLP HWF SP+STI DSK+VGLQKS DAQ  +QDS D LEVA
Sbjct: 1261 RMAEDCTCLIEQWAYQRLPLPGHWFFSPISTIRDSKHVGLQKS-DAQIFMQDSDDFLEVA 1320

Query: 1321 KSGLFFILGVEAFSTFLPDYFPSPVQSVPLIWKLHSLSVVLLAGMGVLDDEKSRDVYEVL 1380
            KSGLFFILGVEAFSTFLPD FPSPV SVPLIWKLHSLSV+LL GMGVLDDEKSRDVYEVL
Sbjct: 1321 KSGLFFILGVEAFSTFLPDGFPSPVLSVPLIWKLHSLSVLLLTGMGVLDDEKSRDVYEVL 1380

Query: 1381 QDLYGQCLNEARYSRLSERIQEKNATDLPSQPENKSNLEFLMFQSEIHDSYSTFIETLVE 1440
            QDLY Q LNEAR  RLS  + +K+A  L SQPENKSNLEFL FQSEIHDSYSTFIETLVE
Sbjct: 1381 QDLYSQRLNEARSCRLSVNLTQKDAKHLVSQPENKSNLEFLRFQSEIHDSYSTFIETLVE 1440

Query: 1441 QFAAESYGDILYGRQIVLYLHRCVEAPVRIAAWNALNNARVLELLPPLEKCFVDAEGCLE 1500
            QF+A SYGD+LYGRQIVLYLH+CVE+P R+AAWNALN ARV +LLPPLEKC  DAEG L 
Sbjct: 1441 QFSAVSYGDVLYGRQIVLYLHQCVESPTRLAAWNALNGARVFDLLPPLEKCIADAEGYLH 1500

Query: 1501 PIEDNEAILEAYVKSWVSGALDRSASRGSVAYLLVLHHLSSYIFHSNHVANLLLRNKLSR 1560
            PIEDNEAILEAY+KSWVSGALD+SASRGSVAYLLVLHHLSSYIFHS  V NLLLRNKLSR
Sbjct: 1501 PIEDNEAILEAYLKSWVSGALDKSASRGSVAYLLVLHHLSSYIFHSYPVDNLLLRNKLSR 1560

Query: 1561 SLLRDYSQKHQRKEMMSDLILYTAPATYRVAGQKGVCSSIKMSTVEKRLEVLKEACERNS 1620
            SLLRDYSQKHQ K MM DL+LYT P+TY V GQKG+ +SI+ S VEKRLEVLKEACERNS
Sbjct: 1561 SLLRDYSQKHQHKAMMLDLVLYTEPSTYLVTGQKGIGTSIETSAVEKRLEVLKEACERNS 1620

Query: 1621 YLLTVVEELGSAAKHKLSAM 1640
             LLTVVEELG AAK K S +
Sbjct: 1621 SLLTVVEELGCAAKDKPSTI 1628

BLAST of MC02g0332 vs. ExPASy TrEMBL
Match: A0A6J1CFK3 (transcriptional elongation regulator MINIYO OS=Momordica charantia OX=3673 GN=LOC111011072 PE=3 SV=1)

HSP 1 Score: 3253 bits (8434), Expect = 0.0
Identity = 1643/1645 (99.88%), Postives = 1644/1645 (99.94%), Query Frame = 0

Query: 1    MEKTTQSSRRKQQKSSARAKVFGAKALQLSEDDASRIVGGIVEKGISDSHQNEPFVSLAP 60
            MEKTTQSSRRKQQKSSARAKVFGAKALQLSEDDASRIVGGIVEKGISDSHQNEPFVSLAP
Sbjct: 1    MEKTTQSSRRKQQKSSARAKVFGAKALQLSEDDASRIVGGIVEKGISDSHQNEPFVSLAP 60

Query: 61   PKPSVLPFPVARHRSHGPHWVSVTSKKGGDNIKHDRQENGEDDESLIEIESIANFANPIQ 120
            PKPSVLPFPVARHRSHGPHWVSVTSKKGGDNIKHDRQENGEDDESLIEIESIANFANPIQ
Sbjct: 61   PKPSVLPFPVARHRSHGPHWVSVTSKKGGDNIKHDRQENGEDDESLIEIESIANFANPIQ 120

Query: 121  RKNKNSLDFGRWREVVRGHNHDAANKEEKKVAGLAKNENLNRAGEAINTVDDTMMSCKPL 180
            RKNKNSLDFGRWREVVRGHNHDAANKEEKKVAGLAKNENLNRAGEAINTVDDTMMSCKPL
Sbjct: 121  RKNKNSLDFGRWREVVRGHNHDAANKEEKKVAGLAKNENLNRAGEAINTVDDTMMSCKPL 180

Query: 181  SADVLAPILMNDEHNSSGFVNDPTGMRTKDSGSDFVSSTNNAEIKGLDQLCLWKDFQDVD 240
            SADVLAPILMNDEHNSSGFVNDPTGMRTKDSGSDFVSSTNNAEIKGLDQLCLWKDFQDVD
Sbjct: 181  SADVLAPILMNDEHNSSGFVNDPTGMRTKDSGSDFVSSTNNAEIKGLDQLCLWKDFQDVD 240

Query: 241  DRSGHVSESVEINEGMPVDGTSLPDMAMGLHHSNPEMVPCFGSNIKGEDAFSTLESQINA 300
            DRSGHVSESVEINEGMPVDGTSLPDMAMGLHHSNPEMVPCFGSNIKGEDAFSTLESQINA
Sbjct: 241  DRSGHVSESVEINEGMPVDGTSLPDMAMGLHHSNPEMVPCFGSNIKGEDAFSTLESQINA 300

Query: 301  ENRARIQRMSPEEIAEAQTEIKEKMSPALVKALKRRGEEKLKKGSSKPDVSKNSELDNLQ 360
            ENRARIQRMSPEEIAEAQTEIKEKMSPALVKALKRRGEEKLKKGSSKPDVSKNSELDNLQ
Sbjct: 301  ENRARIQRMSPEEIAEAQTEIKEKMSPALVKALKRRGEEKLKKGSSKPDVSKNSELDNLQ 360

Query: 361  KEGTFNRYDSLCVENGVTSANSTVKDTKSGLQNVSVQKFDLGSSTWSAWNERVEAVRLLR 420
            KEGTFNRYDSLCVENGVTSANSTVKDTKSGLQNVSVQKFDLGSSTWSAWNERVEAVRLLR
Sbjct: 361  KEGTFNRYDSLCVENGVTSANSTVKDTKSGLQNVSVQKFDLGSSTWSAWNERVEAVRLLR 420

Query: 421  FSLEGNIVESCSFQQSENGDLAVHGYSTENVTSRDFLRTDGDPSAAGYTIKEAVALTRSV 480
            FSLEGNIVESCSFQQSENGDLAVHGYSTENVTSRDFLRTDGDPSAAGYTIKEAVALTRSV
Sbjct: 421  FSLEGNIVESCSFQQSENGDLAVHGYSTENVTSRDFLRTDGDPSAAGYTIKEAVALTRSV 480

Query: 481  IPGQRVLGLHVISNVLDKALLNTHQKPVGSAMVKDGISIDYNAIWAYTLGPEPELALSLR 540
            IPGQRVLGLHVISNVLDKALLNTHQKPVGSAMVKDGISIDYNAIWAYTLGPEPELALSLR
Sbjct: 481  IPGQRVLGLHVISNVLDKALLNTHQKPVGSAMVKDGISIDYNAIWAYTLGPEPELALSLR 540

Query: 541  ICLDDNHSSVVLACAEVIQCILGCNLNEIFFDTLQKTSTYEMDLYTAPIFRSKPEINVGF 600
            ICLDDNHSSVVLACAEVIQCILGCNLNEIFFDTLQKTSTYEMDLYTAPIFRSKPEINVGF
Sbjct: 541  ICLDDNHSSVVLACAEVIQCILGCNLNEIFFDTLQKTSTYEMDLYTAPIFRSKPEINVGF 600

Query: 601  LQGGFWKYNAKPSNILPFSEDVGNVEDGEKYTIQDDIVVAQQDILAGLVRMGILHRLRYL 660
            LQGGFWKYNAKPSNILPFSEDVGNVEDGEKYTIQDDIVVAQQDILAGLVRMGILHRLRYL
Sbjct: 601  LQGGFWKYNAKPSNILPFSEDVGNVEDGEKYTIQDDIVVAQQDILAGLVRMGILHRLRYL 660

Query: 661  LEAGPSVALEECILSILIAIARHSPTCAQAIMKCERLVGLIINRFTMSDKIDILSFKIKS 720
            LEAGPSVALEECILSILIAIARHSPTCAQAIMKCERLVGLIINRFTMSDKIDILSFKIKS
Sbjct: 661  LEAGPSVALEECILSILIAIARHSPTCAQAIMKCERLVGLIINRFTMSDKIDILSFKIKS 720

Query: 721  VVLLKVLACSDRNNCVAFVKTGAFPTMIWHLFHYITSIDHWVKSGKEKCKLSSALMVEQL 780
            VVLLKVLACSDRNNCVAFVKTGAFPTMIWHLFHYITSIDHWVKSGKEKCKLSSALMVEQL
Sbjct: 721  VVLLKVLACSDRNNCVAFVKTGAFPTMIWHLFHYITSIDHWVKSGKEKCKLSSALMVEQL 780

Query: 781  RLWKVCIQDGYCVSYFSDVFPALCLWLSPPNFDKLVENNVLREFATICTEVYHVLEALAR 840
            RLWKVCIQDGYCVSYFSDVFPALCLWLSPPNFDKLVENNVLREFATICTEVYHVLEALAR
Sbjct: 781  RLWKVCIQDGYCVSYFSDVFPALCLWLSPPNFDKLVENNVLREFATICTEVYHVLEALAR 840

Query: 841  RLPNYFSQKHLDSQELGLAGNESEIWSWSCAVPMVNLAVKWLESKSDPFISKLFASQKEI 900
            RLPNYFSQKHLDSQELGLAGNESEIWSWSCAVPMVNLAVKWLESKSDPFISKLFASQKEI
Sbjct: 841  RLPNYFSQKHLDSQELGLAGNESEIWSWSCAVPMVNLAVKWLESKSDPFISKLFASQKEI 900

Query: 901  RSGFEFEGISLAPLLWVYSAVMKMLSQVFERIVPQDIMSLEGSGQIVPSLPEFIPRVGLE 960
            RSGFEFEGISLAPLLWVYSAVMKMLSQVFERIVPQDIMSLEGSGQIVPSLPEFIPRVGLE
Sbjct: 901  RSGFEFEGISLAPLLWVYSAVMKMLSQVFERIVPQDIMSLEGSGQIVPSLPEFIPRVGLE 960

Query: 961  IIRNGFLSFPGAYDKKPETYPFVGNSFVEDLCFLREHGEFETSLASVCCLHGLMLSIMNI 1020
            IIRNGFLSFPGAYDKKPETYPFVGNSFVEDLCFLREHGEFETSLASVCCLHGLMLSIMNI
Sbjct: 961  IIRNGFLSFPGAYDKKPETYPFVGNSFVEDLCFLREHGEFETSLASVCCLHGLMLSIMNI 1020

Query: 1021 DRLIHLAKTERHGFPFRDYNGSREGEILMVGMFKASLIEQRSVLNLFTKVIALESDSLQL 1080
            DRLIHLAKTERHGFPFRDYNGSREGEILMVGMFKASLIEQRSVLNLFTKVIALESDSLQL
Sbjct: 1021 DRLIHLAKTERHGFPFRDYNGSREGEILMVGMFKASLIEQRSVLNLFTKVIALESDSLQL 1080

Query: 1081 IETFGRGGPAPGVGTGWGVSGGGYWSPAVLLAQNDAAFVMFLIQAFQTVPTLNILTAQES 1140
            IETFGRGGPAPGVGTGWGVSGGGYWSPAVLLAQNDAAFVMFLIQAFQTVPTLNILTAQES
Sbjct: 1081 IETFGRGGPAPGVGTGWGVSGGGYWSPAVLLAQNDAAFVMFLIQAFQTVPTLNILTAQES 1140

Query: 1141 LTIQSINSALAICLVLGPRDTCLVEKTMEFLIQAPILHHFNFYIQSFIQLNGRVKQFGWK 1200
            LTIQSINSALAICLVLGPRDTCLVEKTMEFLIQAPILHHFNFYIQSFIQLNGRVKQFGWK
Sbjct: 1141 LTIQSINSALAICLVLGPRDTCLVEKTMEFLIQAPILHHFNFYIQSFIQLNGRVKQFGWK 1200

Query: 1201 YSEDDCLILCKTLSSHYKDRWLSPKESKSTKNKSNFSDKIFKKSSNSLDTIYEEESDETN 1260
            YSEDDCLILCKTLSSHYKDRWLSPKESKSTKNKSNFSDKIFKKSSNSLDTIYEEESDETN
Sbjct: 1201 YSEDDCLILCKTLSSHYKDRWLSPKESKSTKNKSNFSDKIFKKSSNSLDTIYEEESDETN 1260

Query: 1261 RIAQDCTCLVVQWAYQRLPLPKHWFLSPVSTISDSKYVGLQKSSDAQKIVQDSSDVLEVA 1320
            RIAQDCTCLVVQWAYQRLPLPKHWFLSPVSTI DSKYVGLQKSSDAQKIVQDSSDVLEVA
Sbjct: 1261 RIAQDCTCLVVQWAYQRLPLPKHWFLSPVSTICDSKYVGLQKSSDAQKIVQDSSDVLEVA 1320

Query: 1321 KSGLFFILGVEAFSTFLPDYFPSPVQSVPLIWKLHSLSVVLLAGMGVLDDEKSRDVYEVL 1380
            KSGLFFILGVEAFSTFLPDYFPSPVQSVPLIWKLHSLSVVLLAGMGVLDDEKSRDVYEVL
Sbjct: 1321 KSGLFFILGVEAFSTFLPDYFPSPVQSVPLIWKLHSLSVVLLAGMGVLDDEKSRDVYEVL 1380

Query: 1381 QDLYGQCLNEARYSRLSERIQEKNATDLPSQPENKSNLEFLMFQSEIHDSYSTFIETLVE 1440
            QDLYGQCLN+ARYSRLSERIQEKNATDLPSQPENKSNLEFLMFQSEIHDSYSTFIETLVE
Sbjct: 1381 QDLYGQCLNKARYSRLSERIQEKNATDLPSQPENKSNLEFLMFQSEIHDSYSTFIETLVE 1440

Query: 1441 QFAAESYGDILYGRQIVLYLHRCVEAPVRIAAWNALNNARVLELLPPLEKCFVDAEGCLE 1500
            QFAAESYGDILYGRQIVLYLHRCVEAPVRIAAWNALNNARVLELLPPLEKCFVDAEGCLE
Sbjct: 1441 QFAAESYGDILYGRQIVLYLHRCVEAPVRIAAWNALNNARVLELLPPLEKCFVDAEGCLE 1500

Query: 1501 PIEDNEAILEAYVKSWVSGALDRSASRGSVAYLLVLHHLSSYIFHSNHVANLLLRNKLSR 1560
            PIEDNEAILEAYVKSWVSGALDRSASRGSVAYLLVLHHLSSYIFHSNHVANLLLRNKLSR
Sbjct: 1501 PIEDNEAILEAYVKSWVSGALDRSASRGSVAYLLVLHHLSSYIFHSNHVANLLLRNKLSR 1560

Query: 1561 SLLRDYSQKHQRKEMMSDLILYTAPATYRVAGQKGVCSSIKMSTVEKRLEVLKEACERNS 1620
            SLLRDYSQKHQRKEMMSDLILYTAPATYRVAGQKGVCSSIKMSTVEKRLEVLKEACERNS
Sbjct: 1561 SLLRDYSQKHQRKEMMSDLILYTAPATYRVAGQKGVCSSIKMSTVEKRLEVLKEACERNS 1620

Query: 1621 YLLTVVEELGSAAKHKLSAMCSQLA 1645
            YLLTVVEELGSAAKHKLSAMCSQLA
Sbjct: 1621 YLLTVVEELGSAAKHKLSAMCSQLA 1645

BLAST of MC02g0332 vs. ExPASy TrEMBL
Match: A0A6J1J5I2 (transcriptional elongation regulator MINIYO OS=Cucurbita maxima OX=3661 GN=LOC111483661 PE=3 SV=1)

HSP 1 Score: 2542 bits (6588), Expect = 0.0
Identity = 1305/1637 (79.72%), Postives = 1427/1637 (87.17%), Query Frame = 0

Query: 1    MEKTTQSSRRKQQKSSARAKVFGAKALQLSEDDASRIVGGIVEKGISDSHQNEPFVSLAP 60
            MEK TQSSRR Q KSSARA VFG  ALQLSE DASR+VGGIVEKGISD+ Q++PF S+AP
Sbjct: 1    MEKKTQSSRRSQPKSSARATVFGTNALQLSEHDASRLVGGIVEKGISDAEQSKPFASVAP 60

Query: 61   PKPSVLPFPVARHRSHGPHWVSVTSKKGGDNIKHDRQENGEDDESLIEIESIANFANPIQ 120
            P+PSVLPFPVARHRSHGPHW SVTSK GGDNI+ DR+ + E+DE L+EI+SIANFANPIQ
Sbjct: 61   PRPSVLPFPVARHRSHGPHWESVTSKMGGDNIRDDRRGDEEEDERLMEIDSIANFANPIQ 120

Query: 121  RKNKNSLDFGRWREVVRGHNHDAANKEEKKVAGLAKNENLNRAGEAINTVDDTMMSCKPL 180
            RK K+SLDFGRWRE V GHNHDAA+ EE KVA LAK +NL RAGEA NT D+  MSC+PL
Sbjct: 121  RKKKSSLDFGRWREAVPGHNHDAASGEENKVASLAKTKNLIRAGEANNTRDN--MSCEPL 180

Query: 181  SADVLAPILMNDEHNSSGFVNDPTGMRTKDSGSDFVSSTNNAEIKGLDQLCLWKDFQDVD 240
            SA VLAP LMN E++SS FVN+PTG +T  +G +F  S NN E+KGLD+  + ++ QD  
Sbjct: 181  SAGVLAPSLMNIENSSSDFVNNPTGNKTNAAGLEFARSMNNVELKGLDKQHIPENLQDDY 240

Query: 241  DRSGHVSESVEINEGMPVDGTSLPDMAMGLHHSNPEMVPCFGSNIKGEDAFSTLESQINA 300
            D+ G +SES E+ EG+P+DGTSL DMA  LHH N EMVPCF SNIKGEDAFSTLESQI+A
Sbjct: 241  DQWGRISES-EVKEGVPLDGTSLQDMATRLHHLNSEMVPCFESNIKGEDAFSTLESQIDA 300

Query: 301  ENRARIQRMSPEEIAEAQTEIKEKMSPALVKALKRRGEEKLKKGSSKPDVSKNSELDNLQ 360
            EN ARIQRMS EEIAEAQ EI EKMSPAL+K LK RG  KLKKGSSKPD S + EL NLQ
Sbjct: 301  ENCARIQRMSQEEIAEAQAEIMEKMSPALLKTLKMRGAGKLKKGSSKPDASNDYELGNLQ 360

Query: 361  KEGTFNRYDSLCVENGVTSANSTVKDTKSGLQNVSVQKFDLGSSTWSAWNERVEAVRLLR 420
            KE T +R  S  +ENGVTS  + +K   SGLQNV+VQKFD GSS W+AWNERVEAVR LR
Sbjct: 361  KESTHDRNGSTNIENGVTSGTTALKYRNSGLQNVAVQKFDSGSSAWNAWNERVEAVRSLR 420

Query: 421  FSLEGNIVESCSFQQSENGDLAVHGYSTENVTSRDFLRTDGDPSAAGYTIKEAVALTRSV 480
            FSLEGNIVES SFQQSEN    VHGYSTENV SRDFLRT+GDPSAAGYTIKEAVALTRSV
Sbjct: 421  FSLEGNIVESYSFQQSEN----VHGYSTENVASRDFLRTEGDPSAAGYTIKEAVALTRSV 480

Query: 481  IPGQRVLGLHVISNVLDKALLNTHQKPVGSAMVKDGISIDYNAIWAYTLGPEPELALSLR 540
            IPGQRVLGLHVISNVLDKA LNTH K VGS MVKDG S+DYNAIWAY LGPEPELALSLR
Sbjct: 481  IPGQRVLGLHVISNVLDKASLNTHLKQVGSTMVKDGSSVDYNAIWAYILGPEPELALSLR 540

Query: 541  ICLDDNHSSVVLACAEVIQCILGCNLNEIFFDTLQKTSTYEMDLYTAPIFRSKPEINVGF 600
            +CLDDNH+SV+LACAEVIQC+L CNLNE FFDTL+KTSTYE DL TA +FRSKPEIN GF
Sbjct: 541  MCLDDNHNSVILACAEVIQCVLSCNLNETFFDTLEKTSTYEKDLCTAAVFRSKPEINAGF 600

Query: 601  LQGGFWKYNAKPSNILPFSEDVGNVEDGEKYTIQDDIVVAQQDILAGLVRMGILHRLRYL 660
            L GGFWKY+AKPSNILPFSEDV NVEDGEKYTIQDDIVVAQQDI AGLVRMG+L RLRYL
Sbjct: 601  LHGGFWKYSAKPSNILPFSEDVENVEDGEKYTIQDDIVVAQQDIAAGLVRMGLLPRLRYL 660

Query: 661  LEAGPSVALEECILSILIAIARHSPTCAQAIMKCERLVGLIINRFTMSDKIDILSFKIKS 720
            LEAGPSVALE+C+LSIL+AIARHSP CA+AIM CERLV LII+RFTMSDKIDILS KIKS
Sbjct: 661  LEAGPSVALEDCLLSILVAIARHSPACARAIMICERLVELIIHRFTMSDKIDILSLKIKS 720

Query: 721  VVLLKVLACSDRNNCVAFVKTGAFPTMIWHLFHYITSIDHWVKSGKEKCKLSSALMVEQL 780
            VVLLKVL+ SDR NC+AFVK+GAF TMIWHL+HY +SIDHWVKSGKEKCKLSS LMVEQL
Sbjct: 721  VVLLKVLSRSDRKNCIAFVKSGAFQTMIWHLYHYTSSIDHWVKSGKEKCKLSSTLMVEQL 780

Query: 781  RLWKVCIQDGYCVSYFSDVFPALCLWLSPPNFDKLVENNVLREFATICTEVYHVLEALAR 840
            RLWKVCIQ GYCVSYFSDVFPALCLWLSPPNFDKL+ENNVLREF TI  EVYHVLEALAR
Sbjct: 781  RLWKVCIQYGYCVSYFSDVFPALCLWLSPPNFDKLIENNVLREFTTISMEVYHVLEALAR 840

Query: 841  RLPNYFSQKHLDSQELGLAGNESEIWSWSCAVPMVNLAVKWLESKSDPFISKLFASQKEI 900
            RLPN+FSQKHLDSQE G AGNESE+WSWSC VP+V+LA KWLESKSDPFISK F SQK  
Sbjct: 841  RLPNFFSQKHLDSQEPGHAGNESEVWSWSCVVPIVDLATKWLESKSDPFISKFFESQKGT 900

Query: 901  RSGFEFEGISLAPLLWVYSAVMKMLSQVFERIVPQDIMSLEGSGQIVPSLPEFIPRVGLE 960
             +GF FEGISLAPLLWVYSAVMKMLSQV ERI+P DIMS EGSGQIVP +PEFIPR+GLE
Sbjct: 901  MNGFGFEGISLAPLLWVYSAVMKMLSQVVERIIPHDIMSQEGSGQIVPWIPEFIPRIGLE 960

Query: 961  IIRNGFLSFPGAYDKKPETYPFVGNSFVEDLCFLREHGEFETSLASVCCLHGLMLSIMNI 1020
            II++GFLSF  A D KPETYP   NSFVE+LCFLREHGEFETSLASVCCLHGLMLSI++I
Sbjct: 961  IIKHGFLSFADASDMKPETYPSGRNSFVENLCFLREHGEFETSLASVCCLHGLMLSILHI 1020

Query: 1021 DRLIHLAKTERHGFPFRDYNGSREGEILMVGMFKASLIEQRSVLNLFTKVIALESDSLQL 1080
            DRLIHLAKTE   +  +DYN SREGEIL VGMFKASLIEQ+SVL+LFTKVI+LE DSLQL
Sbjct: 1021 DRLIHLAKTESPDYSPKDYNFSREGEILRVGMFKASLIEQKSVLDLFTKVISLECDSLQL 1080

Query: 1081 IETFGRGGPAPGVGTGWGVSGGGYWSPAVLLAQNDAAFVMFLIQAFQTVPTLNILTAQES 1140
            IETFGRGGPAPGVGTGWGVSGGGYWSP VLLAQNDAAF+M LI+AFQ +PTLNIL AQES
Sbjct: 1081 IETFGRGGPAPGVGTGWGVSGGGYWSPGVLLAQNDAAFLMSLIEAFQAIPTLNILIAQES 1140

Query: 1141 LTIQSINSALAICLVLGPRDTCLVEKTMEFLIQAPILHHFNFYIQSFIQLNGRVKQFGWK 1200
            LT+QSINSALA+CLVLGP +T LVE+T+ FL QAPILH+FN YIQ+F+QLNG VKQFGW+
Sbjct: 1141 LTVQSINSALAVCLVLGPGNTGLVEQTVNFLTQAPILHNFNLYIQNFLQLNGEVKQFGWE 1200

Query: 1201 YSEDDCLILCKTLSSHYKDRWLSPKESKSTKNKSNFSDKIFKKSSNSLDTIYEEESDETN 1260
            YSEDDCLI CKTLSSHYKD+WL+PKESKS KNKSNFSD+ F   + SLDTIYE  SDETN
Sbjct: 1201 YSEDDCLIFCKTLSSHYKDKWLTPKESKSMKNKSNFSDRTFMNGNVSLDTIYEG-SDETN 1260

Query: 1261 RIAQDCTCLVVQWAYQRLPLPKHWFLSPVSTISDSKYVGLQKSSDAQKIVQDSSDVLEVA 1320
             +A+DCTCL+ QWAYQRLPLP HWF SPVSTI DSK+ GLQKS DAQ ++QDS D LEVA
Sbjct: 1261 GMAEDCTCLIEQWAYQRLPLPGHWFFSPVSTICDSKHAGLQKS-DAQILMQDSGDFLEVA 1320

Query: 1321 KSGLFFILGVEAFSTFLPDYFPSPVQSVPLIWKLHSLSVVLLAGMGVLDDEKSRDVYEVL 1380
            KSGLFFILGVEAFSTFLPD FPSPV SVPLIWKLHSLSV+LL GMGVLDDEKSRDVYEVL
Sbjct: 1321 KSGLFFILGVEAFSTFLPDGFPSPVLSVPLIWKLHSLSVLLLTGMGVLDDEKSRDVYEVL 1380

Query: 1381 QDLYGQCLNEARYSRLSERIQEKNATDLPSQPENKSNLEFLMFQSEIHDSYSTFIETLVE 1440
            QDLYGQ LNEAR  RLS  + +K+A  L SQPENKSNLEFLMFQSEIHDSYSTFIETLVE
Sbjct: 1381 QDLYGQRLNEARSCRLSVHVTQKDAKHLLSQPENKSNLEFLMFQSEIHDSYSTFIETLVE 1440

Query: 1441 QFAAESYGDILYGRQIVLYLHRCVEAPVRIAAWNALNNARVLELLPPLEKCFVDAEGCLE 1500
            QF+A SYGD+LYGRQIVLYLH+CVE+P R+AAWNALN ARV +LLPPLEKC  D EG L+
Sbjct: 1441 QFSAVSYGDVLYGRQIVLYLHQCVESPTRLAAWNALNGARVFDLLPPLEKCIADPEGYLQ 1500

Query: 1501 PIEDNEAILEAYVKSWVSGALDRSASRGSVAYLLVLHHLSSYIFHSNHVANLLLRNKLSR 1560
            PIEDNEAILEAY+KSWVSGALD+SASRGSVAYLLVLHHLSSYIFHS  V NLLLRNKLSR
Sbjct: 1501 PIEDNEAILEAYLKSWVSGALDKSASRGSVAYLLVLHHLSSYIFHSYPVDNLLLRNKLSR 1560

Query: 1561 SLLRDYSQKHQRKEMMSDLILYTAPATYRVAGQKGVCSSIKMSTVEKRLEVLKEACERNS 1620
            SLLRDYSQKHQ K MM DL+LYT P+TY V GQKG+ +SI+ S VEKRLEVLKEACERNS
Sbjct: 1561 SLLRDYSQKHQHKAMMLDLVLYTEPSTYLVTGQKGIGTSIEASVVEKRLEVLKEACERNS 1620

Query: 1621 YLLTVVEELGSAAKHKL 1637
             LLTVV+ELG AAK KL
Sbjct: 1621 SLLTVVKELGCAAKDKL 1628

BLAST of MC02g0332 vs. ExPASy TrEMBL
Match: A0A6J1FXF4 (transcriptional elongation regulator MINIYO OS=Cucurbita moschata OX=3662 GN=LOC111448455 PE=3 SV=1)

HSP 1 Score: 2529 bits (6556), Expect = 0.0
Identity = 1304/1640 (79.51%), Postives = 1417/1640 (86.40%), Query Frame = 0

Query: 1    MEKTTQSSRRKQQKSSARAKVFGAKALQLSEDDASRIVGGIVEKGISDSHQNEPFVSLAP 60
            MEK  QSSRR Q KSSAR KVFG  ALQLSE DASR+VGGIVEKGISD+ Q++PF S+AP
Sbjct: 1    MEKKAQSSRRSQPKSSARGKVFGTNALQLSEHDASRLVGGIVEKGISDAEQSKPFASVAP 60

Query: 61   PKPSVLPFPVARHRSHGPHWVSVTSKKGGDNIKHDRQENGEDDESLIEIESIANFANPIQ 120
            P+PSVLPFPVARHRSHGPHW SVTSK GG+NI+ DR+ + E+DE L+EIESIANFANPIQ
Sbjct: 61   PRPSVLPFPVARHRSHGPHWESVTSKMGGNNIRDDRRGDEEEDERLMEIESIANFANPIQ 120

Query: 121  RKNKNSLDFGRWREVVRGHNHDAANKEEKKVAGLAKNENLNRAGEAINTVDDTMMSCKPL 180
            RK K+SLDFGRWRE V GHNH AA+ EE KVA LAK E+L RAGEA +T+D+  MSC+PL
Sbjct: 121  RKKKSSLDFGRWREAVPGHNHIAASGEENKVASLAKTEHLIRAGEANSTMDN--MSCEPL 180

Query: 181  SADVLAPILMNDEHNSSGFVNDPTGMRTKDSGSDFVSSTNNAEIKGLDQLCLWKDFQDVD 240
            SA VLAP LMN EH+SS FVN PTG +T  +G +F  S NN E+KGLD+  + ++ QD  
Sbjct: 181  SAGVLAPSLMNIEHSSSDFVNKPTGNKTNAAGLEFARSMNNVELKGLDKQHIPENLQDDY 240

Query: 241  DRSGHVSESVEINEGMPVDGTSLPDMAMGLHHSNPEMVPCFGSNIKGEDAFSTLESQINA 300
            D+ GH+SES E+ EG+P+DGTS  DMA  LHH N EMVPCF SNIKGEDAFSTLESQI+A
Sbjct: 241  DQWGHISES-EVKEGVPLDGTSFQDMATRLHHLNSEMVPCFESNIKGEDAFSTLESQIDA 300

Query: 301  ENRARIQRMSPEEIAEAQTEIKEKMSPALVKALKRRGEEKLKKGSSKPDVSKNSELDNLQ 360
            EN ARIQRMS EEIAEAQ EI EKM PAL K LK RGE KLKKGSSKPD S + EL NLQ
Sbjct: 301  ENCARIQRMSQEEIAEAQAEIMEKMRPALWKTLKMRGEGKLKKGSSKPDASNDYELGNLQ 360

Query: 361  KEGTFNRYDSLCVENGVTSANSTVKDTKSGLQNVSVQKFDLGSSTWSAWNERVEAVRLLR 420
            KE T +R  S  +ENGVTS  + +K   SGLQNV+VQKFD GSS W+AWNERVEAVR LR
Sbjct: 361  KESTHDRNGSPNIENGVTSGTTALKYRNSGLQNVAVQKFDSGSSAWNAWNERVEAVRSLR 420

Query: 421  FSLEGNIVESCSFQQSENGDLAVHGYSTENVTSRDFLRTDGDPSAAGYTIKEAVALTRSV 480
            FSLEGNIVES SFQQSEN    VHGYSTENV SRDFLRT+GDPSAAGYTIKEAVALTRSV
Sbjct: 421  FSLEGNIVESYSFQQSEN----VHGYSTENVASRDFLRTEGDPSAAGYTIKEAVALTRSV 480

Query: 481  IPGQRVLGLHVISNVLDKALLNTHQKPVGSAMVKDGISIDYNAIWAYTLGPEPELALSLR 540
            IPGQRVLGLHVISNVLDKA LNT  K VGS MVKD  S+DYNAIW Y LGPEPELALSLR
Sbjct: 481  IPGQRVLGLHVISNVLDKASLNTRLKQVGSTMVKDSSSVDYNAIWTYILGPEPELALSLR 540

Query: 541  ICLDDNHSSVVLACAEVIQCILGCNLNEIFFDTLQKTSTYEMDLYTAPIFRSKPEINVGF 600
            +CLDDNH+SV+LACAEVIQC+L CNLNE FFDTL+KTSTYE DL TA +FRSKPEIN GF
Sbjct: 541  MCLDDNHNSVILACAEVIQCVLSCNLNETFFDTLEKTSTYEKDLCTAAVFRSKPEINAGF 600

Query: 601  LQGGFWKYNAKPSNILPFSEDVGNVEDGEKYTIQDDIVVAQQDILAGLVRMGILHRLRYL 660
            L GGFWKY+AKPSNILP SEDV NVEDGEKYTIQDDIVVAQQDI AGLVRMG+L RLRYL
Sbjct: 601  LHGGFWKYSAKPSNILPISEDVENVEDGEKYTIQDDIVVAQQDIAAGLVRMGLLPRLRYL 660

Query: 661  LEAGPSVALEECILSILIAIARHSPTCAQAIMKCERLVGLIINRFTMSDKIDILSFKIKS 720
            LEAGPSVALE+CILSIL+AIARHSP CA+AIM CERLV LII+RFTMSDKIDILS KIKS
Sbjct: 661  LEAGPSVALEDCILSILVAIARHSPACARAIMICERLVELIIHRFTMSDKIDILSLKIKS 720

Query: 721  VVLLKVLACSDRNNCVAFVKTGAFPTMIWHLFHYITSIDHWVKSGKEKCKLSSALMVEQL 780
            VVLLKVL+ SDR NC+ FVK+GAF TMIWHL+HY +SIDHWVKSGKEKCKLSS LMVEQL
Sbjct: 721  VVLLKVLSRSDRKNCIEFVKSGAFQTMIWHLYHYTSSIDHWVKSGKEKCKLSSTLMVEQL 780

Query: 781  RLWKVCIQDGYCVSYFSDVFPALCLWLSPPNFDKLVENNVLREFATICTEVYHVLEALAR 840
            RLWKVCIQ GYCVSYFSDVFPALCLWLSPPNFDKL+ENNVLREF TI  EVYHVLEAL R
Sbjct: 781  RLWKVCIQYGYCVSYFSDVFPALCLWLSPPNFDKLIENNVLREFTTISMEVYHVLEALTR 840

Query: 841  RLPNYFSQKHLDSQELGLAGNESEIWSWSCAVPMVNLAVKWLESKSDPFISKLFASQKEI 900
            RLPN+FSQKHLDSQE G AGNESE+WSWSC VP+V+LA KWLESKSDPFISK F SQK  
Sbjct: 841  RLPNFFSQKHLDSQEPGHAGNESEVWSWSCVVPIVDLATKWLESKSDPFISKFFESQKGT 900

Query: 901  RSGFEFEGISLAPLLWVYSAVMKMLSQVFERIVPQDIMSLEGSGQIVPSLPEFIPRVGLE 960
             +GF FEGISLAPLLWVYSAVMKMLSQV ERI+P DIMS EGSGQIVP LPEFIPR+GLE
Sbjct: 901  MNGFGFEGISLAPLLWVYSAVMKMLSQVVERIIPHDIMSQEGSGQIVPWLPEFIPRIGLE 960

Query: 961  IIRNGFLSFPGAYDKKPETYPFVGNSFVEDLCFLREHGEFETSLASVCCLHGLMLSIMNI 1020
            II++GFLS     D KPETYP   NSFVEDLCFLREHGEFETSLASVCCLHGLMLSI++I
Sbjct: 961  IIKHGFLSLS---DNKPETYPSGRNSFVEDLCFLREHGEFETSLASVCCLHGLMLSIVHI 1020

Query: 1021 DRLIHLAKTERHGFPFRDYNGSREGEILMVGMFKASLIEQRSVLNLFTKVIALESDSLQL 1080
            DRLIHLAKTE   +  +DYN SREGEIL VGMFK SLIEQ+S+L+LFTKVIALE DSLQL
Sbjct: 1021 DRLIHLAKTESQDYSPKDYNSSREGEILRVGMFKTSLIEQKSLLDLFTKVIALECDSLQL 1080

Query: 1081 IETFGRGGPAPGVGTGWGVSGGGYWSPAVLLAQNDAAFVMFLIQAFQTVPTLNILTAQES 1140
            IETFGRGGPAPGVGTGWGVSGGGYWSP VLLA+NDAAF+M LI+AFQ VPTLNIL AQES
Sbjct: 1081 IETFGRGGPAPGVGTGWGVSGGGYWSPDVLLAENDAAFLMSLIEAFQAVPTLNILIAQES 1140

Query: 1141 LTIQSINSALAICLVLGPRDTCLVEKTMEFLIQAPILHHFNFYIQSFIQLNGRVKQFGWK 1200
            LT+QSINSALA+CLVLGPR+T LVEKT+ FL QAPILH+FN YIQ+F+QLNG VKQFGWK
Sbjct: 1141 LTVQSINSALAVCLVLGPRNTGLVEKTVNFLTQAPILHNFNLYIQNFLQLNGEVKQFGWK 1200

Query: 1201 YSEDDCLILCKTLSSHYKDRWLSPKESKSTKNKSNFSDKIFKKSSNSLDTIYEEESDETN 1260
            YSEDDCLI CKTLSSHYKDRWL+PKESKS KNKSNFSDK F   + SLDTIYEE SDETN
Sbjct: 1201 YSEDDCLIFCKTLSSHYKDRWLTPKESKSMKNKSNFSDKTFMNGNVSLDTIYEE-SDETN 1260

Query: 1261 RIAQDCTCLVVQWAYQRLPLPKHWFLSPVSTISDSKYVGLQKSSDAQKIVQDSSDVLEVA 1320
            R+A+DCTCL+ QWAYQRLPLP HWF SP+STI DSK+VGLQKS DAQ  +QDS D LEVA
Sbjct: 1261 RMAEDCTCLIEQWAYQRLPLPGHWFFSPISTIRDSKHVGLQKS-DAQIFMQDSDDFLEVA 1320

Query: 1321 KSGLFFILGVEAFSTFLPDYFPSPVQSVPLIWKLHSLSVVLLAGMGVLDDEKSRDVYEVL 1380
            KSGLFFILGVEAFSTFLPD FPSPV SVPLIWKLHSLSV+LL GMGVLDDEKSRDVYEVL
Sbjct: 1321 KSGLFFILGVEAFSTFLPDGFPSPVLSVPLIWKLHSLSVLLLTGMGVLDDEKSRDVYEVL 1380

Query: 1381 QDLYGQCLNEARYSRLSERIQEKNATDLPSQPENKSNLEFLMFQSEIHDSYSTFIETLVE 1440
            QDLY Q LNEAR  RLS  + +K+A  L SQPENKSNLEFL FQSEIHDSYSTFIETLVE
Sbjct: 1381 QDLYSQRLNEARSCRLSVNLTQKDAKHLVSQPENKSNLEFLRFQSEIHDSYSTFIETLVE 1440

Query: 1441 QFAAESYGDILYGRQIVLYLHRCVEAPVRIAAWNALNNARVLELLPPLEKCFVDAEGCLE 1500
            QF+A SYGD+LYGRQIVLYLH+CVE+P R+AAWNALN ARV +LLPPLEKC  DAEG L 
Sbjct: 1441 QFSAVSYGDVLYGRQIVLYLHQCVESPTRLAAWNALNGARVFDLLPPLEKCIADAEGYLH 1500

Query: 1501 PIEDNEAILEAYVKSWVSGALDRSASRGSVAYLLVLHHLSSYIFHSNHVANLLLRNKLSR 1560
            PIEDNEAILEAY+KSWVSGALD+SASRGSVAYLLVLHHLSSYIFHS  V NLLLRNKLSR
Sbjct: 1501 PIEDNEAILEAYLKSWVSGALDKSASRGSVAYLLVLHHLSSYIFHSYPVDNLLLRNKLSR 1560

Query: 1561 SLLRDYSQKHQRKEMMSDLILYTAPATYRVAGQKGVCSSIKMSTVEKRLEVLKEACERNS 1620
            SLLRDYSQKHQ K MM DL+LYT P+TY V GQKG+ +SI+ S VEKRLEVLKEACERNS
Sbjct: 1561 SLLRDYSQKHQHKAMMLDLVLYTEPSTYLVTGQKGIGTSIETSAVEKRLEVLKEACERNS 1620

Query: 1621 YLLTVVEELGSAAKHKLSAM 1640
             LLTVVEELG AAK K S +
Sbjct: 1621 SLLTVVEELGCAAKDKPSTI 1628

BLAST of MC02g0332 vs. ExPASy TrEMBL
Match: A0A1S3BKC4 (LOW QUALITY PROTEIN: transcriptional elongation regulator MINIYO OS=Cucumis melo OX=3656 GN=LOC103490563 PE=3 SV=1)

HSP 1 Score: 2404 bits (6229), Expect = 0.0
Identity = 1255/1643 (76.38%), Postives = 1390/1643 (84.60%), Query Frame = 0

Query: 1    MEKTTQSSRRKQQKSSARAKVFGAKALQLSEDDASRIVGGIVEKGISDSHQNEPFVSLAP 60
            MEK TQS RR Q  SSARAKVFG  +LQLSEDDA+R+VGGIVEKG+SDS Q+ PF S AP
Sbjct: 1    MEKKTQSCRRSQSNSSARAKVFGTNSLQLSEDDATRLVGGIVEKGVSDSEQSTPFFSPAP 60

Query: 61   PKPSVLPFPVARHRSHGPHWVSVTSKKGGDNIKHDRQENGEDDESLIEIESIANFANPIQ 120
             +PSVLPFPVARHRSHGPHW SVTSKKGGDNIK DR E+GE+DE+++  +SIANFANPIQ
Sbjct: 61   -RPSVLPFPVARHRSHGPHWESVTSKKGGDNIKADRGEDGEEDETMMVADSIANFANPIQ 120

Query: 121  RKNKNSLDFGRWREVVRGHNHDAANKEEKKVAGLAKNENLNRAGEAINTVDDTMMSCKPL 180
            RK K+SLDFGRWRE    HNH AAN+EEK++  LAK  +L+RAGEA    DD  MSC+P 
Sbjct: 121  RKKKSSLDFGRWREASPDHNHGAANREEKELQSLAKTASLSRAGEANTGTDD--MSCRPF 180

Query: 181  SADVLAPILMNDEHNSSGFVNDPTGMRTKDSGSDFVSSTNNAEIKGLDQLCLWKDFQDVD 240
            SA VLAP LM  E +SS FVND TG +T  +G          E+KG D+  L ++ QDV 
Sbjct: 181  SAHVLAPSLMECERSSSDFVNDSTGNKTNRAGF---------ELKGSDKQHLPENLQDVR 240

Query: 241  DRSGHVSESVEINEGMPVDGTSLPDMAMGLHHSNPEMVPCFGSNIKGEDAFSTLESQINA 300
            D+ G +SES E+NE M +DGTSL DM    HH N EM PCF SNIKG+DAF TL+SQI+A
Sbjct: 241  DQRGDISES-EVNESMQLDGTSLRDMGT-RHHLNSEMTPCFQSNIKGDDAFLTLKSQIDA 300

Query: 301  ENRARIQRMSPEEIAEAQTEIKEKMSPALVKALKRRGEEKLKKGSSKPDVSKNSELDNLQ 360
            ENRAR+Q+MSPEEIAEAQ EI EKMSPALVKALK RGE KLK+GSSKPDVS N EL NLQ
Sbjct: 301  ENRARMQKMSPEEIAEAQAEIMEKMSPALVKALKMRGEGKLKQGSSKPDVSSNYELGNLQ 360

Query: 361  KEGTFNRYDSLCVENGVTSANSTVKDTKSGLQNVSVQKFDLGSSTWSAWNERVEAVRLLR 420
            KE   +   S   ENGVTS  +T+KDTKSGLQ+VSVQK D GSS W+AWNERVEAVR LR
Sbjct: 361  KESRIDGNGSSNKENGVTSVKTTLKDTKSGLQDVSVQKIDSGSSIWNAWNERVEAVRSLR 420

Query: 421  FSLEGNIVESCSFQQSENGDLAVHGYSTENVTSRDFLRTDGDPSAAGYTIKEAVALTRSV 480
            FSLEGN+VES SFQQSEN    VHGYSTENV SRDFLRT+GDPSAAGYTI EAVALTRSV
Sbjct: 421  FSLEGNLVESYSFQQSEN----VHGYSTENVASRDFLRTEGDPSAAGYTINEAVALTRSV 480

Query: 481  IPGQRVLGLHVISNVLDKALLNTHQKPVGSAMVKDGISIDYNAIWAYTLGPEPELALSLR 540
            IPGQRVLGLHVISNVLDKALLNTH   VGS M+K+  S+DYNAIWAY LGPEPELALSLR
Sbjct: 481  IPGQRVLGLHVISNVLDKALLNTHLTQVGSTMIKNRSSVDYNAIWAYILGPEPELALSLR 540

Query: 541  ICLDDNHSSVVLACAEVIQCILGCNLNEIFFDTLQKTSTYEMDLYTAPIFRSKPEINVGF 600
            ICLDDNH+SVVLACAEVIQ +L CNLNE FFD+L+KTSTYE DLYTA +FRSKPEINVGF
Sbjct: 541  ICLDDNHNSVVLACAEVIQSVLSCNLNESFFDSLEKTSTYEKDLYTAAVFRSKPEINVGF 600

Query: 601  LQGGFWKYNAKPSNILPFSEDVGNVEDGEKYTIQDDIVVAQQDILAGLVRMGILHRLRYL 660
            LQGGFWKY+AK SNILP +ED G VEDG KYTIQDDIVVAQQDI AGLVRMGIL RL YL
Sbjct: 601  LQGGFWKYSAKSSNILPITEDFGIVEDGVKYTIQDDIVVAQQDIAAGLVRMGILPRLVYL 660

Query: 661  LEAGPSVALEECILSILIAIARHSPTCAQAIMKCERLVGLIINRFTMSDKIDILSFKIKS 720
            LEA PSVALEECILSIL+AIARHSP CAQAIMKC+RL+ LI+ RFTMS+KIDILS KIKS
Sbjct: 661  LEADPSVALEECILSILVAIARHSPICAQAIMKCDRLIELIVQRFTMSEKIDILSLKIKS 720

Query: 721  VVLLKVLACSDRNNCVAFVKTGAFPTMIWHLFHYITSIDHWVKSGKEKCKLSSALMVEQL 780
            VVLLKVLA SDR NC AFVK+GAF T+IWHL+HY +SID W+KSGKEKCKLSS LMVEQL
Sbjct: 721  VVLLKVLARSDRKNCFAFVKSGAFLTVIWHLYHYTSSIDQWLKSGKEKCKLSSTLMVEQL 780

Query: 781  RLWKVCIQDGYCVSYFSDVFPALCLWLSPPNFDKLVENNVLREFATICTEVYHVLEALAR 840
            RLWKVCIQ GYCVSYFSDVFP+LCLWL+PPNF KL+ENNVLREF TI  E YHVLEALAR
Sbjct: 781  RLWKVCIQYGYCVSYFSDVFPSLCLWLNPPNFGKLIENNVLREFTTISMEAYHVLEALAR 840

Query: 841  RLPNYFSQKHLDSQELGLAGNESEIWSWSCAVPMVNLAVKWLESKSDPFISKLFASQKEI 900
            RLP +F Q+++ +QE G  G+ESE WSWSCAVPMV+LA+KWL SK DPFI K F+SQK I
Sbjct: 841  RLPIFF-QRNIXTQEPGFTGDESEAWSWSCAVPMVDLAIKWLGSKKDPFICKFFSSQKGI 900

Query: 901  RSGFEFEGISLAPLLWVYSAVMKMLSQVFERIVPQDIMSLEGSGQIVPSLPEFIPRVGLE 960
            R+ F FEGISLAPLLWVYSAV KMLS+V ERI PQDI++  GS QIVP +PEFIP+VGLE
Sbjct: 901  RNDFVFEGISLAPLLWVYSAVFKMLSRVVERI-PQDILTQIGSDQIVPWIPEFIPQVGLE 960

Query: 961  IIRNGFLSFPGAYDKKPETYPFVGNSFVEDLCFLREHGEFETSLASVCCLHGLMLSIMNI 1020
            II+NGFL+F  A D  P+T P  GNSFVEDLCF REHGEFE SLASVCCLHGLMLSI+NI
Sbjct: 961  IIKNGFLNFADASDMNPKTSPSGGNSFVEDLCFWREHGEFEMSLASVCCLHGLMLSIVNI 1020

Query: 1021 DRLIHLAKTERHGFPFRDYNGSREGEILMVGMFKASLIEQRSVLNLFTKVIALESDSLQL 1080
            DRLI LAKTE   +P +D N SREGEIL VGMFK SL+EQRS+L+LFTK IALE DSL+L
Sbjct: 1021 DRLILLAKTESQAYPPKDVNSSREGEILRVGMFKTSLVEQRSMLDLFTKKIALECDSLRL 1080

Query: 1081 IETFGRGGPAPGVGTGWGVSGGGYWSPAVLLAQNDAAFVMFLIQAFQTVPTLNILTAQES 1140
            IETFGRGGPAPGVG GWGV GGGYWS AVLLAQND+AF+M LI+AF T+PTLN LTAQES
Sbjct: 1081 IETFGRGGPAPGVGIGWGVCGGGYWSLAVLLAQNDSAFLMSLIEAFHTIPTLNGLTAQES 1140

Query: 1141 LTIQSINSALAICLVLGPRDTCLVEKTMEFLIQAPILHHFNFYIQSFIQLNGRVKQFGWK 1200
            LT+QSINSALA+CLVLGPRD  L+EKTMEFLIQAPIL++FN YIQ F+QLNG+VKQFGWK
Sbjct: 1141 LTLQSINSALAVCLVLGPRDIGLIEKTMEFLIQAPILYNFNLYIQRFLQLNGKVKQFGWK 1200

Query: 1201 YSEDDCLILCKTLSSHYKDRWLSPKESKSTKNKSNFSDKIFKKSSNSLDTIYEEESDETN 1260
            YSEDDCLI C+TLSSHYKDRWL+PK SKS KNKSN SD  FK    SLDTIYEE SDETN
Sbjct: 1201 YSEDDCLIFCRTLSSHYKDRWLTPKGSKSVKNKSNLSDGTFKSGRVSLDTIYEE-SDETN 1260

Query: 1261 RIAQDCTCLVVQWAYQRLPLPKHWFLSPVSTISDSKYVGLQKSSDAQKIVQDSSDVLEVA 1320
            R+ + CTCL+VQWAYQRLPLP HWF SPVSTI DSK+ G QKS DAQ I+Q+SSD+ +VA
Sbjct: 1261 RVVEGCTCLIVQWAYQRLPLPGHWFFSPVSTICDSKHAGRQKS-DAQSIMQESSDLFDVA 1320

Query: 1321 KSGLFFILGVEAFSTFLPDYFPSPVQSVPLIWKLHSLSVVLLAGMGVLDDEKSRDVYEVL 1380
            KSGLFFILG+EAFS+FLPD FP PV SVPLIWKLHSLSVVLL  +GVLDDEKSRDVYEVL
Sbjct: 1321 KSGLFFILGIEAFSSFLPDDFPKPVLSVPLIWKLHSLSVVLLTDIGVLDDEKSRDVYEVL 1380

Query: 1381 QDLYGQCLNEARYSRLSERIQEKNATDLPSQPENK-SNLEFLMFQSEIHDSYSTFIETLV 1440
            QDLYGQ LNEA   R    I EK+A  LPSQ ENK SN+EFLMFQSEIHDSYS FIETLV
Sbjct: 1381 QDLYGQRLNEAMSRRHPADIVEKDAKHLPSQLENKRSNIEFLMFQSEIHDSYSLFIETLV 1440

Query: 1441 EQFAAESYGDILYGRQIVLYLHRCVEAPVRIAAWNALNNARVLELLPPLEKCFVDAEGCL 1500
            EQF++ SYGD+LYGRQIVLYLHRCVE+  R+AAWNALN+ARV ELLPPLEKC  DAEG L
Sbjct: 1441 EQFSSVSYGDVLYGRQIVLYLHRCVESQTRLAAWNALNSARVFELLPPLEKCLADAEGYL 1500

Query: 1501 EPIEDNEAILEAYVKSWVSGALDRSASRGSVAYLLVLHHLSSYIFHSNHVANLLLRNKLS 1560
            +PIEDNEAILEAYVKSWVSGALDRSASRGSVAYLL LHHLSSYIFHS  V NLLLRNKLS
Sbjct: 1501 QPIEDNEAILEAYVKSWVSGALDRSASRGSVAYLLSLHHLSSYIFHSYPVNNLLLRNKLS 1560

Query: 1561 RSLLRDYSQKHQRKEMMSDLILYTAPATYRVAGQKGVCSSIKMSTVEKRLEVLKEACERN 1620
            RSLLRD SQKH RKEMM++LILYT P+T+ +AGQKGV +SI MS VEKRLEVLKEACE+N
Sbjct: 1561 RSLLRDCSQKHHRKEMMTNLILYTKPSTHLIAGQKGVGTSIGMSDVEKRLEVLKEACEKN 1620

Query: 1621 SYLLTVVEELGSAAKHKLSAMCS 1642
            S+LLTVVEELGS+AK +LSAMC+
Sbjct: 1621 SFLLTVVEELGSSAKSELSAMCN 1621

BLAST of MC02g0332 vs. ExPASy TrEMBL
Match: A0A5A7V3U3 (Transcriptional elongation regulator MINIYO OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold89G002670 PE=3 SV=1)

HSP 1 Score: 2351 bits (6093), Expect = 0.0
Identity = 1235/1638 (75.40%), Postives = 1362/1638 (83.15%), Query Frame = 0

Query: 1    MEKTTQSSRRKQQKSSARAKVFGAKALQLSEDDASRIVGGIVEKGISDSHQNEPFVSLAP 60
            MEK TQS RR Q  SSARAKVFG  +LQLSEDDA+R+VGGIVEKG+SDS Q+ PF S AP
Sbjct: 1    MEKKTQSCRRSQSNSSARAKVFGTNSLQLSEDDATRLVGGIVEKGVSDSEQSTPFFSPAP 60

Query: 61   PKPSVLPFPVARHRSHGPHWVSVTSKKGGDNIKHDRQENGEDDESLIEIESIANFANPIQ 120
             +PSVLPFPVARHRSHGPHW SVTSKKGGDNIK DR E+GE+DE+++  +SIANFANPIQ
Sbjct: 61   -RPSVLPFPVARHRSHGPHWESVTSKKGGDNIKADRGEDGEEDETMMVADSIANFANPIQ 120

Query: 121  RKNKNSLDFGRWREVVRGHNHDAANKEEKKVAGLAKNENLNRAGEAINTVDDTMMSCKPL 180
            RK K+SLDFGRWRE    HNH AAN+EEK++  LAK  +L+RAGEA    DD  MSC+P 
Sbjct: 121  RKKKSSLDFGRWREASPDHNHGAANREEKELQSLAKTASLSRAGEANTGTDD--MSCRPF 180

Query: 181  SADVLAPILMNDEHNSSGFVNDPTGMRTKDSGSDFVSSTNNAEIKGLDQLCLWKDFQDVD 240
            S  VLAP LM  E +SS FVND TG +T  +G          E+KG D+  L ++ QDV 
Sbjct: 181  SVHVLAPSLMECERSSSDFVNDSTGNKTNSAGF---------ELKGSDKQHLPENLQDVR 240

Query: 241  DRSGHVSESVEINEGMPVDGTSLPDMAMGLHHSNPEMVPCFGSNIKGEDAFSTLESQINA 300
            D+ G +SES E+NE M +DGTSL DM    HH N EM PCF SNIKG+DAF TL+SQI+A
Sbjct: 241  DQRGDISES-EVNESMQLDGTSLRDMGT-RHHLNSEMTPCFQSNIKGDDAFLTLKSQIDA 300

Query: 301  ENRARIQRMSPEEIAEAQTEIKEKMSPALVKALKRRGEEKLKKGSSKPDVSKNSELDNLQ 360
            ENRAR+Q+MSPEEIAEAQ EI EKMSPALVKALK RGE KLK+GSSKPDVS N EL NLQ
Sbjct: 301  ENRARMQKMSPEEIAEAQAEIMEKMSPALVKALKMRGEGKLKQGSSKPDVSSNYELGNLQ 360

Query: 361  KEGTFNRYDSLCVENGVTSANSTVKDTKSGLQNVSVQKFDLGSSTWSAWNERVEAVRLLR 420
            KE   +   S   ENGVTS  +T+KDTKSGLQ+VSVQK D GSS W+AWNERVEAVR LR
Sbjct: 361  KESRIDGNGSSNKENGVTSVKTTLKDTKSGLQDVSVQKIDSGSSIWNAWNERVEAVRSLR 420

Query: 421  FSLEGNIVESCSFQQSENGDLAVHGYSTENVTSRDFLRTDGDPSAAGYTIKEAVALTRSV 480
            FSLEGN+VES SFQQS+N    VHGYSTENV SRDFLRT+GDPSAAGYTI EAVALTRSV
Sbjct: 421  FSLEGNLVESYSFQQSKN----VHGYSTENVASRDFLRTEGDPSAAGYTINEAVALTRSV 480

Query: 481  IPGQRVLGLHVISNVLDKALLNTHQKPVGSAMVKDGISIDYNAIWAYTLGPEPELALSLR 540
            IPGQRVLGLHVISNVLDKALLNTH   VGS M+K+  S+DYNAIWAY LGPEPELALSLR
Sbjct: 481  IPGQRVLGLHVISNVLDKALLNTHLTQVGSTMIKNRSSVDYNAIWAYILGPEPELALSLR 540

Query: 541  ICLDDNHSSVVLACAEVIQCILGCNLNEIFFDTLQKTSTYEMDLYTAPIFRSKPEINVGF 600
            +CLDDNH+SVVLACAEVIQ +L CNLNE FFD+L+KTSTYE DLYTA +FRSKPEINVGF
Sbjct: 541  MCLDDNHNSVVLACAEVIQSVLSCNLNESFFDSLEKTSTYEKDLYTAAVFRSKPEINVGF 600

Query: 601  LQGGFWKYNAKPSNILPFSEDVGNVEDGEKYTIQDDIVVAQQDILAGLVRMGILHRLRYL 660
            LQGGFWKY+AK SNILP +ED G VEDG KYTIQDDIVVAQQDI AG+VRMGIL RL YL
Sbjct: 601  LQGGFWKYSAKSSNILPITEDFGIVEDGVKYTIQDDIVVAQQDIAAGMVRMGILPRLVYL 660

Query: 661  LEAGPSVALEECILSILIAIARHSPTCAQAIMKCERLVGLIINRFTMSDKIDILSFKIKS 720
            LEA PSVALEECILSIL+AIARHSP CAQAIMKC+RL+ LI+ RFTMS+KIDILS KIKS
Sbjct: 661  LEADPSVALEECILSILVAIARHSPICAQAIMKCDRLIELIVQRFTMSEKIDILSLKIKS 720

Query: 721  VVLLKVLACSDRNNCVAFVKTGAFPTMIWHLFHYITSIDHWVKSGKEKCKLSSALMVEQL 780
            VVLLKVLA SDR NC AFVK+GAF T+IWHL+HY +SID W+KSGKEKCKLSS LMVEQL
Sbjct: 721  VVLLKVLARSDRKNCFAFVKSGAFLTVIWHLYHYTSSIDQWLKSGKEKCKLSSTLMVEQL 780

Query: 781  RLWKVCIQDGYCVSYFSDVFPALCLWLSPPNFDKLVENNVLREFATICTEVYHVLEALAR 840
            RLWKVCIQ GYCVSYFSDVFP+LCLWL+PPNF KL+ENNVLREF TI  E YHVLEALAR
Sbjct: 781  RLWKVCIQYGYCVSYFSDVFPSLCLWLNPPNFGKLIENNVLREFTTISMEAYHVLEALAR 840

Query: 841  RLPNYFSQKHLDSQELGLAGNESEIWSWSCAVPMVNLAVKWLESKSDPFISKLFASQKEI 900
            RLP +FS+KHLDSQE G  G+ESE WSWSCAVPMV+LA+KWL SK DPFI K F+SQK I
Sbjct: 841  RLPIFFSEKHLDSQEPGFTGDESEAWSWSCAVPMVDLAIKWLGSKKDPFICKFFSSQKGI 900

Query: 901  RSGFEFEGISLAPLLWVYSAVMKMLSQVFERIVPQDIMSLEGSGQIVPSLPEFIPRVGLE 960
            R+ F FEGISLAPLLWVYSAV KMLS+V ERI PQDI++  GS QIVP +PEF+P+VGLE
Sbjct: 901  RNDFVFEGISLAPLLWVYSAVFKMLSRVVERI-PQDILTQIGSDQIVPWIPEFVPQVGLE 960

Query: 961  IIRNGFLSFPGAYDKKPETYPFVGNSFVEDLCFLREHGEFETSLASVCCLHGLMLSIMNI 1020
            II+NGFLSF  A D  P+T P  GNSFVEDLCF REHGEFE SLASVCCLHGLMLSI+NI
Sbjct: 961  IIKNGFLSFADASDMNPKTSPSGGNSFVEDLCFWREHGEFEMSLASVCCLHGLMLSIVNI 1020

Query: 1021 DRLIHLAKTERHGFPFRDYNGSREGEILMVGMFKASLIEQRSVLNLFTKVIALESDSLQL 1080
            D LI LAKTE   +P +D N SREGEIL VGMFK SL+EQRS+L+LFTK IALE DSL+L
Sbjct: 1021 DCLILLAKTESQAYPPKDVNSSREGEILRVGMFKTSLVEQRSMLDLFTKKIALECDSLRL 1080

Query: 1081 IETFGRGGPAPGVGTGWGVSGGGYWSPAVLLAQNDAAFVMFLIQAFQTVPTLNILTAQES 1140
            IETFGRGGPAPGVG GWGV GGGYWS AVLLAQND+AF+M LI+AF T+PTLN LTAQES
Sbjct: 1081 IETFGRGGPAPGVGIGWGVCGGGYWSLAVLLAQNDSAFLMSLIEAFHTIPTLNGLTAQES 1140

Query: 1141 LTIQSINSALAICLVLGPRDTCLVEKTMEFLIQAPILHHFNFYIQSFIQLNGRVKQFGWK 1200
            LT+QSINSALA+CLVLGPRD  L+EKTMEFLIQAPIL++FN YIQ F+QLNG VKQFGWK
Sbjct: 1141 LTLQSINSALAVCLVLGPRDIGLIEKTMEFLIQAPILYNFNLYIQRFLQLNGNVKQFGWK 1200

Query: 1201 YSEDDCLILCKTLSSHYKDRWLSPKESKSTKNKSNFSDKIFKKSSNSLDTIYEEESDETN 1260
            YSEDDCLI C+TLSSHYKDRWL+PK SKS KNKSN SD  FK    SLDTIYEE SDETN
Sbjct: 1201 YSEDDCLIFCRTLSSHYKDRWLTPKGSKSVKNKSNLSDGTFKSGRVSLDTIYEE-SDETN 1260

Query: 1261 RIAQDCTCLVVQWAYQRLPLPKHWFLSPVSTISDSKYVGLQKSSDAQKIVQDSSDVLEVA 1320
            R+ + CTCL+VQWAYQRLPLP HWF SPVSTI  SK+   QKS DAQ I+Q+SSD+ +VA
Sbjct: 1261 RVVEGCTCLIVQWAYQRLPLPGHWFFSPVSTICYSKHASRQKS-DAQSIMQESSDLFDVA 1320

Query: 1321 KSGLFFILGVEAFSTFLPDYFPSPVQSVPLIWKLHSLSVVLLAGMGVLDDEKSRDVYEVL 1380
            KSGLFFILG+EAFS+FLPD FP PV SVPLIWKLHSLSVVLL  +GVLDDEKSRDVYEVL
Sbjct: 1321 KSGLFFILGIEAFSSFLPDDFPKPVLSVPLIWKLHSLSVVLLTDIGVLDDEKSRDVYEVL 1380

Query: 1381 QDLYGQCLNEARYSRLSERIQEKNATDLPSQPENK-SNLEFLMFQSEIHDSYSTFIETLV 1440
            QDLYGQ LNEA   R    I EK+A  LPSQ ENK SN+EFLMFQSEIHDSYS FIETLV
Sbjct: 1381 QDLYGQRLNEAMSCRHPADIVEKDAKHLPSQLENKRSNIEFLMFQSEIHDSYSLFIETLV 1440

Query: 1441 EQFAAESYGDILYGRQIVLYLHRCVEAPVRIAAWNALNNARVLELLPPLEKCFVDAEGCL 1500
            EQF++ SYGD+LYGRQIVLYLHRCVE+  R+AAWNALN+ARV ELLPPLEKC  DAEG L
Sbjct: 1441 EQFSSVSYGDVLYGRQIVLYLHRCVESQTRLAAWNALNSARVFELLPPLEKCLADAEGYL 1500

Query: 1501 EPIEDNEAILEAYVKSWVSGALDRSASRGSVAYLLVLHHLSSYIFHSNHVANLLLRNKLS 1560
            +PIEDNEAILEAYVKSWVSGALDRSASRGSVAYLL LHHLSSYIFHS  V NLLLRNKLS
Sbjct: 1501 QPIEDNEAILEAYVKSWVSGALDRSASRGSVAYLLSLHHLSSYIFHSYPVNNLLLRNKLS 1560

Query: 1561 RSLLRDYSQKHQRKEMMSDLILYTAPATYRVAGQKGVCSSIKMSTVEKRLEVLKEACERN 1620
            RSLLRD SQKH RK                     GV +SI MS VEKRLEVLKEACE+N
Sbjct: 1561 RSLLRDCSQKHHRK---------------------GVGTSIGMSDVEKRLEVLKEACEKN 1596

Query: 1621 SYLLTVVEELGSAAKHKL 1637
            S LLTVVEELGS+AK +L
Sbjct: 1621 SSLLTVVEELGSSAKSEL 1596

BLAST of MC02g0332 vs. TAIR 10
Match: AT4G38440.1 (LOCATED IN: chloroplast; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 12 growth stages; CONTAINS InterPro DOMAIN/s: RNA polymerase II-associated protein 1, C-terminal (InterPro:IPR013929), RNA polymerase II-associated protein 1, N-terminal (InterPro:IPR013930); Has 276 Blast hits to 220 proteins in 102 species: Archae - 0; Bacteria - 2; Metazoa - 151; Fungi - 65; Plants - 41; Viruses - 0; Other Eukaryotes - 17 (source: NCBI BLink). )

HSP 1 Score: 1136.3 bits (2938), Expect = 0.0e+00
Identity = 685/1606 (42.65%), Postives = 943/1606 (58.72%), Query Frame = 0

Query: 37   IVGGIVEKGISDSHQNEPFVSLAPPKPSVLPFPVARHRSHGPHWVSVTSKKGGDNIKHDR 96
            +VG IVEKGIS   +N+P     PP+PS+L FPVARHRSHGPH   V S        +D 
Sbjct: 20   LVGSIVEKGIS---ENKPPSKPLPPRPSLLSFPVARHRSHGPHLAPVGSSIAQPKDYNDD 79

Query: 97   QENGEDDESLIEIESIANFANPIQRKNKNSLDFGRWREVVRGHNHDAAN--KEEKKVAGL 156
            QE  E +E  +  +SIA FA P+QRK K  +D GRW+++V G +  + +  ++ +K+  +
Sbjct: 80   QEEEEAEERFMNADSIAAFAKPLQRKEKKDMDLGRWKDMVSGDDPASTHVPQQSRKLKII 139

Query: 157  AKNENLNRAGEAINTVDDTMMSCKPLSADVLAPILMNDEHNSSGFVNDPTGMRTKDSGSD 216
                    + +A  T  +T+++ +            +D+     FV+D            
Sbjct: 140  ETRPPYVASADAATTSSNTLLAAR-----------ASDQRE---FVSDKAPF-------- 199

Query: 217  FVSSTNNAEIKGLDQLCLWKDFQDVDDRSGHVSESVEINEGMPVDGTSLPDMAMGLHHSN 276
                     IK L                    E V +N   P+  ++     +G  H  
Sbjct: 200  ---------IKNLG-----------------TKERVPLNASPPLAVSN----GLGTRH-- 259

Query: 277  PEMVPCFGSNIKGEDAFSTLESQINAENRARIQRMSPEEIAEAQTEIKEKMSPALVKALK 336
                           A S+LES I+ EN A++Q MSP+EIAEAQ E+ +KM PAL+  LK
Sbjct: 260  ---------------ASSSLESDIDVENHAKLQTMSPDEIAEAQAELLDKMDPALLSILK 319

Query: 337  RRGEEKLKKGS-SKPDVSKNSE-LDNLQKEGTFNRYDSLCVENGVTSANSTVKDTKSGLQ 396
            +RGE KLKK   S   VS   E   N + EG F           VT     +   KS +Q
Sbjct: 320  KRGEAKLKKRKHSVQGVSITDETAKNSRTEGHF-----------VTPKVMAIPKEKSVVQ 379

Query: 397  NVSVQKFDLGSSTWSAWNERVEAVRLLRFSLEGNIVESCSFQQSENGDLAVHGYSTENVT 456
               + +       W AW ERVEA R LRFS +GN+VE      +E G         E+  
Sbjct: 380  KPGIAQ----GFVWDAWTERVEAARDLRFSFDGNVVEEDVVSPAETGG---KWSGVESAA 439

Query: 457  SRDFLRTDGDPSAAGYTIKEAVALTRSVIPGQRVLGLHVISNVLDKALLNTHQKPVGSAM 516
             RDFLRT+GDP AAGYTIKEA+AL RSVIPGQR L LH++++VLDKAL    Q  +G A 
Sbjct: 440  ERDFLRTEGDPGAAGYTIKEAIALARSVIPGQRCLALHLLASVLDKALNKLCQSRIGYAR 499

Query: 517  VKDGISIDYNAIWAYTLGPEPELALSLRICLDDNHSSVVLACAEVIQCILGCNLNEIFFD 576
             +   S D+ AIWAY LGPEPEL L+LR+ LDDNH+SVV+AC +VIQC+L C+LNE FF+
Sbjct: 500  EEKDKSTDWEAIWAYALGPEPELVLALRMALDDNHASVVIACVKVIQCLLSCSLNENFFN 559

Query: 577  TLQKTSTYEMDLYTAPIFRSKPEINVGFLQGGFWKYNAKPSNILPFSEDVGNVEDGEKYT 636
             L+    +  D++TA +FRSKPEI++GFL+G +WKY+AKPSNI+ F E++ +    +  T
Sbjct: 560  ILENMGPHGKDIFTASVFRSKPEIDLGFLRGCYWKYSAKPSNIVAFREEILDDGTEDTDT 619

Query: 637  IQDDIVVAQQDILAGLVRMGILHRLRYLLEAGPSVALEECILSILIAIARHSPTCAQAIM 696
            IQ D+ VA QD+ AGLVRM IL R+ +LLE  P+ ALE+ I+S+ IAIARHSP C  AI+
Sbjct: 620  IQKDVFVAGQDVAAGLVRMDILPRIYHLLETEPTAALEDSIISVTIAIARHSPKCTTAIL 679

Query: 697  KCERLVGLIINRFTMSDKIDILSFKIKSVVLLKVLACSDRNNCVAFVKTGAFPTMIWHLF 756
            K  + V  I+ RF ++ ++D+LS +I SV LLKVLA  D++ C+ FVK G F  + WHLF
Sbjct: 680  KYPKFVQTIVKRFQLNKRMDVLSSQINSVRLLKVLARYDQSTCMEFVKNGTFNAVTWHLF 739

Query: 757  HYITSIDHWVKSGKEKCKLSSALMVEQLRLWKVCIQDGYCVSYFSDVFPALCLWLSPPNF 816
             + +S+D WVK GK+ CKLSS LMVEQLR WKVCI  G CVS F ++FPALCLWLS P+F
Sbjct: 740  QFTSSLDSWVKLGKQNCKLSSTLMVEQLRFWKVCIHSGCCVSRFPELFPALCLWLSCPSF 799

Query: 817  DKLVENNVLREFATICTEVYHVLEALARRLPNYFSQKHLDSQELGLAGNESEIWSWSCAV 876
            +KL E N++ EF ++  E Y VLEA A  LPN +SQ         +  NES  W WS   
Sbjct: 800  EKLREKNLISEFTSVSNEAYLVLEAFAETLPNMYSQ--------NIPRNESGTWDWSYVS 859

Query: 877  PMVNLAVKWLESKSDPFISKLFASQKEIRSGFEFEGISLAPLLWVYSAVMKMLSQVFERI 936
            PM++ A+ W+          L     +   G E   +S   LLW+YS VM+ +S+V E+I
Sbjct: 860  PMIDSALSWI---------TLAPQLLKWEKGIESVSVSTTTLLWLYSGVMRTISKVLEKI 919

Query: 937  VPQDIMSLEGSGQIVPSLPEFIPRVGLEIIRNGFLSFPGAYDKKPETYPFVGNSFVEDLC 996
                  S EG  + +P LPEF+P++GL II++  LSF  A   +        +SF+E LC
Sbjct: 920  ------SAEGEEEPLPWLPEFVPKIGLAIIKHKLLSFSVADVSRFGKDSSRCSSFMEYLC 979

Query: 997  FLREHG-EFETSLASVCCLHGLMLSIMNIDRLIHLAKTERHGFPFRDYNGSREGEILMVG 1056
            FLRE   + E +LASV CLHGL  +I++I  LI  A+++    P +    + +  +L  G
Sbjct: 980  FLRERSQDDELALASVNCLHGLTRTIVSIQNLIESARSKMKA-PHQVSISTGDESVLANG 1039

Query: 1057 MFKASLIEQRSVLNLFTKVIALESDSLQLIETFGRGGPAPGVGTGWGVSGGGYWSPAVLL 1116
            +   SL E  SV   F   ++ E   +Q IE   RGG APGVG GWG SGGG+WS  VLL
Sbjct: 1040 ILAESLAELTSVSCSFRDSVSSEWPIVQSIELHKRGGLAPGVGLGWGASGGGFWSTRVLL 1099

Query: 1117 AQNDAAFVMFLIQAFQTVPTLNILTAQESL-TIQSINSALAICLVLGPRDTCLVEKTMEF 1176
            AQ  A     L+  F  +   +    Q S+  +  +NSALA+CL+ GPRD  LVE+  E+
Sbjct: 1100 AQAGAG----LLSLFLNISLSDSQNDQGSVGFMDKVNSALAMCLIAGPRDYLLVERAFEY 1159

Query: 1177 LIQAPILHHFNFYIQSFIQLNGRVKQFGWKYSEDDCLILCKTLSSHYKDRWLSPK-ESKS 1236
            +++   L H    I+S    N +   F W+ SE D   +   L+SH++ RWL  K  S +
Sbjct: 1160 VLRPHALEHLACCIKS----NKKNISFEWECSEGDYHRMSSMLASHFRHRWLQQKGRSIA 1219

Query: 1237 TKNKSNFSDKIFKKSSNSLDTIYEEESDETNRIAQD--CTCLVVQWAYQRLPLPKHWFLS 1296
             +  S       +K +  L+TI+ E+ + +N   QD       ++WA+QR+PLP HWFLS
Sbjct: 1220 EEGVSG-----VRKGTVGLETIH-EDGEMSNSSTQDKKSDSSTIEWAHQRMPLPPHWFLS 1279

Query: 1297 PVSTISDSKYVGLQKSSDAQKIVQDSSDVLEVAKSGLFFILGVEAFSTFLPDYFPSPVQS 1356
             +S +   K               +S+++LEVAK+G+FF+ G+E+ S F     PSPV S
Sbjct: 1280 AISAVHSGK---------TSTGPPESTELLEVAKAGVFFLAGLESSSGF--GSLPSPVVS 1339

Query: 1357 VPLIWKLHSLSVVLLAGMGVLDDEKSRDVYEVLQDLYGQCLNEARYSRLSERIQEKNATD 1416
            VPL+WK H+LS VLL GM +++D+ +R++Y  LQ+LYGQ L+EAR               
Sbjct: 1340 VPLVWKFHALSTVLLVGMDIIEDKNTRNLYNYLQELYGQFLDEARL-------------- 1399

Query: 1417 LPSQPENKSNLEFLMFQSEIHDSYSTFIETLVEQFAAESYGDILYGRQIVLYLHRCVEAP 1476
                  N  + E L F+S+IH++YSTF+E +VEQ+AA SYGD++YGRQ+ +YLH+CVE  
Sbjct: 1400 ------NHRDTELLRFKSDIHENYSTFLEMVVEQYAAVSYGDVVYGRQVSVYLHQCVEHS 1459

Query: 1477 VRIAAWNALNNARVLELLPPLEKCFVDAEGCLEPIEDNEAILEAYVKSWVSGALDRSASR 1536
            VR++AW  L+NARVLELLP L+KC  +A+G LEP+E+NEA+LEAY+KSW  GALDR+A+R
Sbjct: 1460 VRLSAWTVLSNARVLELLPSLDKCLGEADGYLEPVEENEAVLEAYLKSWTCGALDRAATR 1460

Query: 1537 GSVAYLLVLHHLSSYIFHSNHVANLLLRNKLSRSLLRDYSQKHQRKEMMSDLILYTAPAT 1596
            GSVAY LV+HH SS +F +     + LRNK+ ++L+RD S+K  R+ MM DL+       
Sbjct: 1520 GSVAYTLVVHHFSSLVFCNQAKDKVSLRNKIVKTLVRDLSRKRHREGMMLDLL------R 1460

Query: 1597 YRVAGQKGVCSSIKMSTVEKRLEVLKEACERNSYLLTVVEELGSAA 1634
            Y+      +   +  +  EKR+EVLKE CE NS LL  +E+L SAA
Sbjct: 1580 YKKGSANAMEEEVIAAETEKRMEVLKEGCEGNSTLLLELEKLKSAA 1460

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8GYU30.0e+0042.65Transcriptional elongation regulator MINIYO OS=Arabidopsis thaliana OX=3702 GN=I... [more]
Q9BWH61.6e-1824.65RNA polymerase II-associated protein 1 OS=Homo sapiens OX=9606 GN=RPAP1 PE=1 SV=... [more]
A0JN532.0e-1823.31RNA polymerase II-associated protein 1 OS=Bos taurus OX=9913 GN=RPAP1 PE=2 SV=1[more]
Q3T1I93.5e-1823.60RNA polymerase II-associated protein 1 OS=Rattus norvegicus OX=10116 GN=Rpap1 PE... [more]
Q80TE01.9e-1624.74RNA polymerase II-associated protein 1 OS=Mus musculus OX=10090 GN=Rpap1 PE=1 SV... [more]
Match NameE-valueIdentityDescription
XP_022140384.10.099.88transcriptional elongation regulator MINIYO [Momordica charantia][more]
XP_023512432.10.079.82transcriptional elongation regulator MINIYO [Cucurbita pepo subsp. pepo][more]
XP_022985672.10.079.72transcriptional elongation regulator MINIYO [Cucurbita maxima][more]
KAG7010830.10.079.70Transcriptional elongation regulator MINIYO [Cucurbita argyrosperma subsp. argyr... [more]
XP_022943843.10.079.51transcriptional elongation regulator MINIYO [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1CFK30.099.88transcriptional elongation regulator MINIYO OS=Momordica charantia OX=3673 GN=LO... [more]
A0A6J1J5I20.079.72transcriptional elongation regulator MINIYO OS=Cucurbita maxima OX=3661 GN=LOC11... [more]
A0A6J1FXF40.079.51transcriptional elongation regulator MINIYO OS=Cucurbita moschata OX=3662 GN=LOC... [more]
A0A1S3BKC40.076.38LOW QUALITY PROTEIN: transcriptional elongation regulator MINIYO OS=Cucumis melo... [more]
A0A5A7V3U30.075.40Transcriptional elongation regulator MINIYO OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT4G38440.10.0e+0042.65LOCATED IN: chloroplast; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 12... [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (Dali-11) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013929RNA polymerase II-associated protein 1, C-terminalPFAMPF08620RPAP1_Ccoord: 419..499
e-value: 5.2E-16
score: 58.3
IPR013930RNA polymerase II-associated protein 1, N-terminalPFAMPF08621RPAP1_Ncoord: 297..340
e-value: 5.6E-15
score: 54.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..20
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 336..358
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 336..356
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 79..103
NoneNo IPR availablePANTHERPTHR47605TRANSCRIPTIONAL ELONGATION REGULATOR MINIYOcoord: 1..1632
IPR016024Armadillo-type foldSUPERFAMILY48371ARM repeatcoord: 640..854

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MC02g0332.1MC02g0332.1mRNA