Chy1G021000 (gene) Cucumber (hystrix) v1

Overview
NameChy1G021000
Typegene
OrganismCucumis hystrix (Cucumber (hystrix) v1)
Descriptionpre-mRNA-processing protein 40C
LocationchrH01: 25723734 .. 25738528 (-)
RNA-Seq ExpressionChy1G021000
SyntenyChy1G021000
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTAGGAATATCAGGGCCATCTTCTGTTTTGAATTTGATCAAGAAGAAATTGCAAGACACTGGAACTCCTGTAGCTTCCTCACCTATTTCAGCTCCAACAACAGCTCAATTAGATCTAAATCTACCAAGAGATGTTGATGTTGCACTTAAGGCACTGCAAAAAGAGAACGGCAAAGATAAACGGAAATATGCTAATGCTGATGGAAATGTATCTGACTCCTCTTTGGACTCTGAAGACGTAGAAAGTGGGCCAACTGATGAGCAATTAATCATCCAGTTTAAGGTATCATTTAACTTCCCATTTATTGAGGATCATGTCATACTCTTTTTGACGATGGTAATTCTAGATGGAGTTTTTGTCCTTTTCTTGTCTAAATCTTCATTCTGAATATCAAGTTCATTCTGCATTTGATTCTTTCCTATGCATGCCAGGAAGAGGATGGCACAGATCCTAAGAGTATTGCAGAAGTGGAAGCTGAAGAGGACGAGGATGATTTCATAATGGAGGAGGTAAAGAGGAGACTGAAGGACCTGAGGAGGAACAGTTTCATGGTTTTGATTCCAGAGGAAGAAGAAGAAGAAATCGAAGGAGGAGAAGAAGAAGAAGTAGGTGAAGGGGAGCCTGAGTGGAGAGACGTGGAAGCAGAAGGTCGACAATGGTGGGGAGGGTTTGGTGCTGTTTATGATGATTACTGTGAGAGGATGCGTTTCTTTGATCGCAAGAGCATTGAATCTGGTATGTTCCCAAAATCAACGAATCATTTGAAGCAAGTGAAATTATGTTGCTCTTACATTACATTGCCATTGTCATTGCGTTTTAGGTCCTGCATCAACCTCCCAAAGATCTGCATCGAAAAAGAGTGCATCCCCTCTTCGGTGTCTTTCTCTGAAGAGGATTGAAGAACCTGAAGACGAGATGGAGGATGTCGATCCTTCATTGACTCCGATTGACTCCAATCACCACATAGAAATAGCGTATGTTGCTCACATTTGCTTGTCCTGGGAGGCCCTTCACTGTCAGTACACTCAACTTAACCACTTAATATCATGCCAACCCCAAAACTCTACTACTCATTATAATCTTACTGCTCAGCTCTTTCAGCAATTTCAAGTCCTCTTGCAAAGGTTTATTGAAAATGAACCCTTTCAACAAGCTCTCAGGCCTACAATTTATGCCCGAACCCGTCGAACTTTTCCTAAAATGTTGCATGTTCCAAACATACAAGGTATTGTGAATACGCAGATCCCTCCGAATTTGTTTAGGATACCCTGTATATGATCCGACTCTATTCTTGTTTCTTGTTCAGCTTCAGATCCAAATGGGGTGCAGGAACAGGAATCTGATTCCCTCATCCTCGCTCCTGACCTGCTGTTCATTATTGAGGCTTCAATCTTTACTTTCCACCGCTTCCTGAAGATGGAGAAGAAAACCTCAACTTCTGCTTCTTTATCATTTCGGAACCACACCCAGGATGCTGCTCTGCATGCTCGTGTTCGATCTTCTCTTGACAAGGTATGAAATTCAAGTGTGCTTAACTTTGTTGAAATGACAAATGTGACAATATCAACTCGATGATACCTAAATTGGTTATAAGACAAATACCTATTTTTCTTTTTTCTTTCTTTTTTTTTTTTTATAAGGAAGACAAACACCGTTTTCAAAGAACCAATTTAGTGCCTATAATTTAATTATGCATCGGCACTAGTTATATTCTCTGTAAGAGAAATAAGTGGATTTCAATGACTATGACATGATCATGTATCCAATAATTGAATTTGGCGAATAATATTAAATGGATTTGAGTTTTTGCAGAAGAAAACAAAGCTGAAAGAGGTTAGGAAGAAAAGTAAAGGGTGGAAACAGAAAACGTGTCCCCAAACGTATGAAGACATGCAATTACTTTTTGGAGTTGTGGACATTAAAATCATAAGAAGGCTTCTTAAGATGTCGAGGATTACTAAAGAGCAGCTGCTTTGGTGCGAGGAGAAAATGAACAAGTTAGATGTGTCTAATGGAAAATTGCGGAGAGATCCGTCTCCCCTTCTTTTTCCATGTTAATTACTTCTTTTTCTGCTGCTGCACTGCATCACAAAGTTATTGTATCTGTGATTATGTTTTATGTAATTGGATTATGGCTGCAGATGGACCCTACTACTAAGATGAATCTGAATTTTTGGGGGTTTTCGAACTCCTTTTGTTTTAGTGTTCCCAATTCATCTTTGGATAAGCTGAAAGACTCTTCGGTTACCAACTCTGACCCCTTTTTTTTGTCTTTACCAGGGGATGAGTTTATTTATTATATTGTTATTATTATTATTATTTTATAAAACCGTGGCAGATGGATTGAAACTCCAACCACCAGGGACGGAGGTCGTGCTAGTGGTACTATTGAGTTAAAGTTCACTTTGACGATGGTGAAAAAACTTAAATGCATGTATGTTATATTTCTTTTCATGGTTAGATTTTTGTCTGGGTTTCTTTGGGTGGATTTCAATCTTGACTATTCATGTGGACTTCAATATTATATTAACAGAGATAATGTAGGTTGTAATTAAAACTAATTTGACAATAAGAAGAGTAACTTTTCTAATTTATAAAGATTCTGAAGCATGTCTTGAATCCCACGATAGAAAGGTAAAGGGATATCACACTTTCTATAACACATTCAATATTGTTCTATTAACAGTCAATTAATTTTAAGATAAAACGTCTTTAAATCTAATATAGTATTATAATCCATGAAACTCAAATGGATATTTGATAAAATAAAACCTGGGATGAGAATCATAATGAATCAAGAAAACATCGTTTCTATAAGATACGTTGAAAATTTCAAGGTTAATATAATAGTAGTTGAGCTTATGATACTAAACTCTAAACTCTCTTATTTAGTTTATTTTCAATGATTAAATGGCTTTTCATATCACTTCAAAAAGCTTAAATTAGCAAAACAATGAATTGAGATTAAAAAAAATGTAAAAAAATGTAAAGTCGGTCTTTTACCACAAAAAGTAAACAAACAAAATTAACTGTCTTAATCACTACAATCTTGGTCATGCATATATATATATATATTTGTCCTTAAAATCATGAGTCAATTTCAAGTTGTCTCTCTATTTTGTATTTAACTCAAAATACAGAGTTTTGTATAAAATATCAAAGTTTGACTGATCTTTAAACTTTGGGCTAAGTTTCATTTTGTTGGGCCATTAGAAAAATCCAACCAATATATGAACCGACCGGCTTTTTTATACTTGAACCGACCGGACCGAACTAGATCTAATTTTAGGTCATCGTGAAACGCTCAGGCCCGAGATCGCCATTCGCCAGAAAGAAAAAAAGAAGGAACAAAAAAAGAAGGAAAAAGAGAAACAAAGGAATTCTCACAAGACGACCACGAATAGGCGTTGCAAAAACGCGAACAGGAGAACGACGACCGGAGAACAGCCATTGCACTATCGTGCGAATATCGGAGGTAGATCGTGAAGTACATACACATGGCTTCTGGGTATTCTTTATGGTTGCAGGTTATCGGGCCTGTTACTGCTTAGCTACATTTCATTTCACCATGACTTCAGCTTCAACTGTCTCCCAATCTGTGTCACTACCTGCTCCTCCTACTTCAAATTCGGCTGCTAATGGTTCTTCAATTCCCAATTTGATCCCCTCCACTTCACCGGTTCCTCCTGCCCCATCTTTCCATATTCATCAACTACCGTCCCTAGCTCCGATGGTTCCTGGTCCACCGGGAATGTCACCGTCGATGCCACTTGTGTCCACGGGTCCGGCAGTTTTGTTTCCGCCCACCGATTCCGCTTCTACTATTCCAGGACCCAATATGCATGCAATTCATAACCCAATTCATCCTTCTGCTCGTCCACAAATTTGTGGCTCCTATCCTTCTCTAACTCCTGTTGTTTCTCCACCTCATGCGATGTGGTTTCAGCCTCCTCAGTTGGGAGTCATGCCCAGGCCTCCCTTTCTGCCGTACTCTACTTCTTATCATGGCCCTCTTCCTTTTCCTGCTCGTGGAATGCCCCTTCCTTCTGTTCCATTGCCGGATCCTCAACCTCCCGGTGTTACCCCTGTTCCAGTTGCATCTGCCATTGCTGTGCCATCTGGTCATGGAAATCAGCTTATTGGCAATACATTGATTCAGACAGACTCAAATCATCCCGAACTTGGTATGCGGGCTGTCAAATTATTATCTTTATATCATGGTAAAGTTATTTATAATTTGGTTTTTATATGCAGCTAAGCTTTTGTTCTACATTCTTTGTCTGACCATTTAACAGATAGCCAGAAACACGCTCAAGGTGTTGGGCATTCTGAGAACATCTCTTTAAATAAGCACTCGGAGGATTGGACTGCCCACAAAACTGAGGCAGGAATAATTTATTACTATAATGCCTTGACCGGTGAATCTACCTATGAAAAACCTTCAGGTTTCAGAGGGGAGGTATTTTTCGGGAAGTTCTCTATAGCTTCGAATGCAAATAAATTTCGTTTTGCCTTCTTGTTGTTTACCTGTAAATATCCCTACCTATTTCCCACCCCTCTCCGCTTTAGCATCACTCCATTAGTTTTTTTTGTGGAACACGGGTTGAGTAACTGGCTTAAATGAACATTTGTTATCATCGTGGTTCATTATTGAACTTTATGATAGGCTCGTATGTACGTTATTATAAACAATGTAGGAATGACATGACCGTGTTCTGGCTTCTTTGTTTGTTTATGTAGAATTGAAGGTTTTTTTTTCCTATTTTTTATTTTTAATTTTCAATTTTGGTGCATCTCATTCGTTAATAGTTCTGTATACATATATGGAAATTTGATGTAATTGAATTTGTTGCAGGCTGAAAATCTCATGGCGCAGGCAACATCAGTTTCAATGTAAGTAGCCTTATTAACCATAATGGAAACACGATTGTTCCATATGAAACTGAAGCATGTTTACTTATTTTATTAGTTACTTCTACAATAGCAGTAACTACCTGCTTTGCAGCCATGTATTGGAGATCGTTGTTATGCCACACGACTGTAATTTTTTAGGCTGTGTTGTTACAATCTAAATATCAATGCATAAAAATTGTCAGCACTGGTCTATGATGGTTCATTTATGTTTTTGCAGTATTTTGGCATCAAAATTAGACAAGGTCTTTGTAATGGGTTGCTTATATGTCATGTTTAATGCTTTGTTTTGGGCTTCCACCTGTTCAAAATCAAGAAATTCCTTGATCCTTTATATGTAAATACTTGGAGTAGAGGGACCTATATTTTGATGTTCTTTAGGAGGTATCCTTGTTTTGTGGTCGTATCCCCAGTCTGTTGATGAGGATATTGTTGTGGGAGCTTTCTCTATTTCTACGTTGGCAATGATGTCTCTGGTAGAAATGTTCTGTCTCTTGATGGCTTCTAATGAAGGTAGATTAGACTTTTGATTGGACTTTGAAATGATTTAGATATTTGGAGTTGGTTTTAGTGTACGCATGTGAGCATAGGACTGGTAACTAAGGGAGGTTTAGGATTTGGACGCTTAAGACTATGATTGTGGCATTTCTTCGTGGAGCCTTACATGCTGTGGTCTAGTTAACCATGTGTACTACTTGCATGTTCAAGCTTTCATGTTCCATCCATCTAAAATTGTTAAATCTTAAGCCAAGCTTAAGCGGATGAGTGGAGGCAAATTTAATATTACATCATATAAACCCCCCTCACTTTTGGGCTTGCAACATGAAAAAAAGCCAACAAGTGAAAATCAATATTAATTGGGGAGGATTAACACACATGACCTACCTAGACCACTTGCTCTTATACCATGTTAAATCATTGATTTACCCGACAACTTAAATAAATTTAAGCTGATGGGTGAAGGAAAATTTAATAATTATATCATTGAGTAAATAAAATGAGACTATTACTTAGATACAATGAGCAAAACCCAAAAACCAAGGATCAAAATGTGCGCCCGTACATTTCAACTAGGTTGACACTTCCATAGCACCATCACATCACCATATCCTAATACATACTATCAAGGTATTTTTATGAGTGTATTTTTTTATATCCATGAGTGTCCAGTCCAGATTACACGTACCTCGGCTAATCTCACGGGACAATCCGCTCAACTTCACAACATTTGGGTGTCAAGGGAACTCGTAGGAAATTAATCCATAGTAGGTCATTGAGACTATGTCTCCTTTTTTACCATTAGGCCAACCATGATGGTTGTGTTCTTATGGATCCTCTTAATTTATACTAGTTCTTCATTTTGGCTTTAGCATAACTTTTGGGCTAATTTTGTTTCCTTCTAGATGTTGACTTTTACTAGTTTTAACAAGATCATTTTCTCCCAAAATTTTTGGATTCATCAATTAAGCTATGCATTAGAGGTTTAGATCATGGGATGATCATTTGTAAGGTGATCAATATATGGTAAATCTCGTCTCAAGGCTTCTTGGCGTTCTTTTTCTAAAGGCTTCATGGGATACTTTTATTTAGGATATTGGTGTTAATTAGATTTGGAATGCTTTTATGTTTCCCTTATAATCCACTTAAGTTACGATGTTTTGATTTGATTGTTGCTTTATTGTTCTCATTACTTCTTGGTTTTGTATTTTGGAACATTAGTCTATTTTTGTTATAACAATGATAAATATTTTTTCGCACATGTGTATATAGGCCAAATTGTGGGTTTCTAGTCTTTCTAGCGATTGAATGTGCTTGTGTTCCATTGTCTATAGGTCAAACTTGTCTGGTACAGATTGGGTTTTGGTTACTATGGGTGATGGTAAAAAGTACTACTACAACAACAAAACGAAGGTATGCTATTTTTCTCCCTCTTCAAAATCACCCCCTCCACCTAATTGGGAACTGAACTCTGTATTGAGCTTTCTTATATGTTTGGAAGTTTCCTGTCCGAACTAAATGTATGGATGTATATCCCCTCCTCTTAACCATTTGTGCATTGTGATAAAATGGTCCCCTGGTGTTAGTTCTTAAAAGCTACTTGAATTCGTGTGTTTGGATCCTAATGAAATGTGATCTCTGGTTGGATTTCATATTTCTTTTGGGCTTCGATTACAAAGACCTTTTGTAACTAGCCTATAGGTTCTATTTTGTTTATATGAACTCATTGGATGAAAAATGAACCTCAAAACCATGATATGGATTCTATGTCTTTTACTCAATGCTGTGGATTCTAAGGCTCTTATTCTGATTTACATCAGTGGTAAATTCTAGCCGTGAACTTAATTTTCATCTTTCAGTTGTGCCAATTTGACTCTCTCTTTGAATCAGATTAGCAGTTGGCAAATCCCAAATGAAGTGTCTGAATTAAGGCAACAGAATGATGAAAAAACAAAAGAACTTTCTGCTCCTTTGCCAAATAACAATGCATCGACCGATCTAGGAACTTCCTCTAGCAGTATCAATACTCCTGCCATAAATACAGGTGGTCGTGAAGCCACACCTCTTAGAACAGTAGGAATATCAGGGTCATCTTCTGCTCTGGATTTGATCAAGAAAAAATTGCAAGACTCTGGAACTCCGGTAGCTTCCTCACCTATTTCTGCACCAACAGTAGCTCAATCAGATGTAAATCTGCCTAGAGATGCTGATGGTACAGTCAAGGCACTGCAGACTGAGAACAAAGATAAGCCAAAAGATGCCAATGCTGATGGAAATGTATCCGACTCTTCCTCAGACTCGGAGGATGTAGACAGTGGGCCAACTAATGAGCAATTAATTATCCAGTTTAAGGTATCATCTAACTTCCCTTTTATTGGGGATCATTAACAAAATCTTTTTAACAGTTTGATAATATTTGGAGTTTTTGTTCCTTTGGTGGAAGTTGCTCATTATGTTGTAATTGCAGGAAATGCTTAAGGAGCGAGGAGTGGCACCATTCTCTAAATGGGACAAGGAATTGCCGAAGATAGTTTTTGATCCCCGTTTTAAGGTTTGTTGAGTATAAAGATTTGAATTACTTTGTTCGGGAATAAGGGGCTGCTACCAGTGGTACCCCTTGATACAGGTTTGTTTGATTTTGTTTGTCTATGCTAGACTACTTCCATACATTTGAGTTCTTAGTTGACGCATGACTTTCTTCTGTATAGCTTCCAGCAAGTTTGTTCTTTTGTATTACTCGTCTCTTTATGATGGTAAAAAAAAAAAAAACTCTTTTACTTTTTAAGAGTTAAGATGTACTTGTTTCCATCAATCTCTGCAACATTTTGTTATTTGTTTTTTTGGGACAGTGTCTGTGATCTTGGTGGTGAAATTGTTTTATCGTTGGGGATGTTGAGAGTCCCACCTTCGAAAAACCAAGGAGACTCACATTCCATATAAGATAGATTGACTACTCCTCTCATAGCCAATTAGTTTTGGGATGGAACCTCATACTAGCAAACATTCATGATTTTCATTTTTTTAGGGACCAGAACATGGTTTTATTTTTGAGTTTCTTATTGTATAATTCCCAAAGATACAACAGAAAACTGATTTCAAAGCAAAAGGAAAACAGAAAATGATTACACAGGAAATACAAAAGCTCTCAATTCCAATTGATTTCCGGTGGAAATAAACCGCCCAAACAATGAAATGGAGAACTCCAAAGAGAAGTTTTGAGATGAGCTAATCAATGAAATTTGTTTCTTATAAAAAGAATGGTCCAACCAAGAACAATATTTATCTTCAAAATCAATGCCTTCTTAGCACTGACCCATCCTATAGGTTCTATTTTGTATAGTTGGAGTCCCGTCTTATAGTTGGGCTTGTGGGCTGGTTTTTTGGTATGCCTGTGTTTTTTTTTCATTTTATCTCAATGTAAACTTGTTATGAGAAAAAAATGTTAATTGTTTTGAGAATTATTTATTCAATGACAGGCTATTCCCAGTTATTCAGCTAGGCGGTCCTTGTTTGAACACTATGTTAAGACCCGTGCTGAGGAAGAACGCAAGGAAAAGAGAGCTGCTCAGAAGGCTGCAATAGAGGGATTTAAACAGTTACTGGATAGCGCATCTGAGGTTATCTCTTTCCACATTGATTACCCTGTCTTGTATTGCTATATTTGTAGTTGGATCTTTTGAATTAACTTAAAATTGAGATGGTTGAGTTCAAAACCTCTTGAGCAAGCTATGATTATTATTATTATTTTTCGTAAACATAAGTTCAGTTTGTTTCTCTTGTCAGAATTTAAAATCCCTCTAACTAAAGTCATATATGTACTTAATTCTGATTTCTAACCAATTTGTATGTATTATTTATTTTTAGGATATTGATCACACAACCAGTTATCAAACATTAAAAAAGAAATGGGGGAATGACTCGCGGTTTGAAGCTTTGGATCGTAAGGATCGGGAGAATTTATTGAACGAAAGGTGCCATTTAATGAGTAACTAGAGCATTTGCAACTCTTTTTGTTGCTTTCTTCTTGAAATTTTAATGTCATAATTCACTCCTCAAGTTTATAAAGTTTATTAACATGGTTGGCAATTGTTCCTTCAGTCTGTCAAAATATGGCTTTGTCAGGTGATATTTTGATGTCAAAGTTGAGCACTGAAATTTCTATAGAGAAAGTGCATGGTAAATCACATAGAACGTCTTAATCCAAAGCATCTTATTGCCAAACATCATATTCCCGTGCATACTTTAAAGAAACAGCCCCTCAACTTGCAGCTTACCAACTTACTTGCTCTAAGAATCTTTATTTGACCAAAAAAGCTTGCTCACCCTCCTTCACCCCTGGTCATCCGTGAAAGTTTGATCTGTTGGCAACGTGGGAGATTGTTAGAGATCTTAACAATTTTGATGCACGAACTAACAATAAGAGTGTGAAAATAGATGTTCGACTGCGAATTTGACATGTGAAGAGTGATGAAAGTTGTGGAATCATATCATCACAGGAGTAGACAGTCGTGATAGTGTGTTTGATAGACTCCATGGATGACCTTCTGCAAGCCTCAAAGGCCCCCCAAAATGGATTACATTTGGGGTTTGTTTGGATCCAAAAACTAAAGAAAAGAGGCTTCCTTGAAACCTCAAAACCTTGTTATCTCACCTGGAGAAGACGGGGAAGGAAGGGTGGACTTTAAAAATATCCGTATAATCTTTATGAATGGAGTGGATCCAAATCAGGTCAATGTTGAGGGAATGAAGAAGTAAGTAAGACACCCTCTAATCGAAGTGTGTGAAGATGGACAGAACACTAGCATGTCTTTTACACAATCTAATAAAAGCCTAGTTTTCTCTGTGGTTTAACAGAGTAACGTGAGGAAACCAGTGTTGTGATATTAAATCAAATGGCCAAGCTCAAGACTCAATGAAAATCGGCAAATGGAAGCAAAAAGGAAAGGCAACACAATTGCCCAGTCCCTTAAAGCTAAGTCGAAAGTAAAATGAGCCCATCGCCAAAGGAATTACTTAAAAAGACGGTGCCCATCTGCTCCAGATGTTAAATATGTTTGGATTATCTTGAAAATGACTTCATTGCAGTTAGATCTTTTGTAAAAGATCTATGTTTAGCATGCCTCTTATTTAATTAACTCAGGTTTGACTGTTTGGACCAACATCACAGTCAAACAAATGTTGGATCATACTTATTGTTTATTTTCTGTAAATTGAAACTTTGGCCTTTTTACATATTGTATTAGGAGCTTTTTATTTTAAATGCCCCACGTTGCCATTTATTTTTTTATGATTTTTTATGTTTTGTGATCATGCCTTTTGGAATTTAAAATTTGGTTGCTGTCATGCCTTTGATATGGCTGCCCTCTTCTGATATGGTTTTACTCTAGGTGGATTCATTTCCAAATGTAATATGAACTGTTTTCTACACAGAAAGTGCTTACTATTGCTAACCCCTGAAAAAGCAACTTGAGTCAACAAGGTAGTAGGCTAGTAACCAGAATCTCATATTTATGTTTGATCCGTCCTTTAACTTTAATTCATTGACTTGGGTTTTAGCGGGATAATACCTCTATTGAACTTGTAGGGTCCTTTGTCTGAAGAAGGCTGCCGTCGAAAAGGCACAAGCTTTATGGGCCGCTTCCACCACTAGTTTCAAGTCCATGCTGCAGGAGAGAGAAGATATCAACATCAATTCCCGTTGGTTCAGGGTGGGTGCACCATGCTATCCTTAATTGGTAGTTCATTTTCTGAATAGTATATGATTGTCTATCCAAAATACCAACGGTTTTTGAATTGTAAGATAAGGTTGTAGAGAGAATAAACCTGATAAATTCATAGGTTTTATTAATTTTCAACTATGCTGCACCAGCTTTGGGACCCATAAACTTGATGGAAGATTTTAAGTAATTCTTTTTCTGATCCAATCAGTTAGATTGAGCGAGGTAATAGTCTTTGAAAAGTTAGATGTTCATTTGCAGTAAATAGTAATGAACACGAAAACACTTTGGACTACAACGCTACATTATTCTGAAAGAGGTGTAACTATTTTCCACTCCTCCCCGTCTGTCGTGTTATAGGTAAAAGATAGTCTACGGGAAGATCCAAGATACAGATCTGTTAAGCATGAGGAGCGTGAGATGTTATTCAATGAGTATATATCTGAACTTAAGGCTGCCGTGGAGGAAAAGCAGCGTGAATCAAAAGCTAGAAAGGAGGAGCAGGTACGCTAACTTGAAATTCTGATTCCTTGTTACAAGAACCTGCTTTTATCATTGTATGCTAGTCTATCCATGTTGGTTATTTGGGGTAGTGTTTTCAAGGTTCAATAGTTGCAAGAGCTGAGAGGCACGGTGAGGTGATAGGGTGACGCTCTGGTTGATAAGTGGCACCTAATGGCCGAAGCGCAAGGAACTTCTTTAACATGTTCACCGTCTTCACCTTTTTATTGCCACATATTTGTTCATGGCCACGAGCCCTTGTCCATGTCAAGGATTTTTTAGTCTAGGGCTTTCATCGTCCTTCTTTGACTTTGACCTTTGATATAATCCAACATCTTGTTTTTTTCTGAGAAATGATTTTTGCCGTAGCTGTTGATTTACATCACAATCCAAGATCTCAACTTAAATTGGAGAGCTTTTATCTTTCCACCTTAATTCAGTTGCTTTTATTTTATTCTGTTGATACCTTCTCTTCTCTTTATTATGGTTACTTCTCTTTGTACTAAACTCCTTACTAGTCATGATCTAGTAAAGATTCATTATCATGTCTTGTCTTAGGTTGAGTTTATTTTTTCCTTTTTGCTTTTTTTCTTCTGGATATGACGAGGTCGCCAAGGAGGCGTCAACCTAGTTGAGATGTTTGGGTGCGACTACTGATTCCTTTTTTAGCTCTTGTATAATCCTCTTGTACTTTAGAGCGATCGTTTCTTGTACTTAGAGAGATCCGTTTCCGTTTAAAGAAAAATGAATGGTAGCCACTTTTCTTTCATTCTCAACTAACATTTTTTTATTTTTTGAAAAGGAAACATAAATTTTCATTATATTAATAGAGACTAATGTTCCAAAATACAAGATCTCCAATTAGAAGACAAATACAAGAGACAAACACGACTCATGATGATAAGTAATAGAAGCCAAATAAGACAGAAAAGTCTTCAACTAAGAAGCACAGAAATTTAGAGAAAGAAGTCGTAAGAATCTAAGAGACCAATAGAAGCCAAACAAACAAGATCAAAACAACAAACCGACAACCGACAAAATTGCATGGTGAAAGACCAAGAAACCCCACGAAATGATGTAGTGGGATTGGGTAATTAGGGTATCTTAGTCAACTCCATGATTCATTAGGATTATGATTAGTTTTCATTGCGAATGAAGATAAGTTTCAATTTCCTTAACTGATCAGGATTAGGATTAGTTTTAGCTTGTTTTTCAATTCTTTACAAATTGAGTATTGTCTTCTTATTGTTCATTACTTTTGTTTCATAATAAATCCTTTTCTCCAAGGAATTTATTTACATCATTCGGTGTGGTAAGCAAAATTAACCACATGGTGAAATGGGATGCTCATACTAAATTTCAATGTTGAATGCTCATACTAAAATAACCGTCTGGTCTATAAAACGTAACTTCCATGTTGATTGCTCATAATAAATTTCAAATGCTTATTTCAGGAGAAACTGAAGGAAAGGGAGAGAGAATGGCGGAAAAGGAAGGAAAGAGAAGAGCAAGAAATGGAAAGGGTACGCCTAAAAGTACGGAAAAAGGAGGCAGTTGCATCTTTTCAAGCATTGCTTGTTGAATCGATCAAAGACCCTCAGGTAAATCTTTTGGACCATGATTACGTGGCTAGTTTTGGAAAAATATAGAAAGGAAAGAGAGATAATGGAGGAGGGTAATGGAGTAAGAAAAATGGAGCGTGGTATGCTTGATCAGACAGAATTTAGTAGTGGTTGAGATGCGCTGGTGAATTTAAAAGCATATAATATGATTATAATTTTAACTGAAATTTGTCTGCCATAAAGTTAAGCATAAGTTATTGGATCATTTATTTTTTTTATTAAACATCCAATCGATTCCTCCCTGCTAATAAACAGTAGTCAAGTACGTTCTGTGGATGGTGAAAGTAGTCAAATACGTTCCAGATTGTTTGATTCTGCAGTGTCAACTATGTAGAACTTTTTTTTTTTTTTTTTGGCTTAGGCCTCTTGGACCGAGTCAAAAGTTAAACTGGAGAAGGATCCACAAGGTCGTGCATCCAATACTGATTTAGATTCATCTGAGACAGAAAAATTATTCAGAGAACATGTAAAGATGCTGCAGGAGGTAATTCATCTTCAACTTGCTGATTATTGATTTTTCAAAATAGGTATATAGTCTGAAATTGAGCAGATCTTTATAACTCAAAGACAGTAGTTCAACTTATAGAACAAATACTGTTGAAGTGAAGTCGTTTAGATGGCTTTTTCCAGCCTTCTAATATAAATAGAGGCAGTCTAATGTTTGTTTCTCTTTCTCCCATCTTTTCCCGTTATCTCCTGCCCATATATTTTAAATTTTATGAAAACTATATATGCCTATCTTATTGTTGTCCCAAATGTTTTATTATCACGTTATCTTCTTCCATCGATTGGACTCTACAATCTTTGCTTCTCTTTTCAACAGATGCTTGTAATGATACATTCAATCACAATAGAATTGTGGGAGGTTTATGAATCTGGCTCGTCTTATCTTGATGATTTTGCATACTAATAAAAGCTTCTAGATTTTGGAAATAGTTATGGGTTCTGAACTTCTAATTCGAACTGGTATAAAAGTAGGTGGTATAGTCTCCATTCCAAAGCAAATGTACTTGAGAAATAATTTAGGAAACAGTTATCTGTAGATCGTATGAGAAAATAGGTGATCCCTTGCTGTTTTGCTCTGCACTACACCCATCATTCATTCCCAATTTTTGTAGTTAATTTCGTAAATGTTTAAACAAAGTGAATGGGAAAAGATTTAGCATGTGTAACTGGATAACGATTTTCTAGAACTTTTCCCTCACTCATTCTTTTGGTTTGAATTTTTCTGCAGCGGTGTGCAAACGAGTTCCGAAATCTATTATCTGAAGCCTTTACAGCAGAAGTAGTTGCTCAGGTATCGGAAGATGGAAAGACGGTTCTTAATTCATGGACGATGGCTAAACGAATCTTGAAGCCTGATCCCAGATATGGTAAAGTCCCAAGGAAGGAAAGGGAGGCACTTTGGCGTCGATATGCTGATGATACAGTGCGGAAGCAGAAGTTGGCAAATGATCACAAGGGAGAAAAATATAACGACTATAAGAATAGAGCAACCACCGACGCTGGAAAATTCCCTTCTAAACCAAGAATCCATGACTGA

mRNA sequence

ATGGTAGGAATATCAGGGCCATCTTCTGTTTTGAATTTGATCAAGAAGAAATTGCAAGACACTGGAACTCCTGTAGCTTCCTCACCTATTTCAGCTCCAACAACAGCTCAATTAGATCTAAATCTACCAAGAGATGTTGATGTTGCACTTAAGGCACTGCAAAAAGAGAACGGCAAAGATAAACGGAAATATGCTAATGCTGATGGAAATGTATCTGACTCCTCTTTGGACTCTGAAGACGTAGAAAGTGGGCCAACTGATGAGCAATTAATCATCCAGTTTAAGGAAGAGGATGGCACAGATCCTAAGAGTATTGCAGAAGTGGAAGCTGAAGAGGACGAGGATGATTTCATAATGGAGGAGGTAAAGAGGAGACTGAAGGACCTGAGGAGGAACAGTTTCATGGTTTTGATTCCAGAGGAAGAAGAAGAAGAAATCGAAGGAGGAGAAGAAGAAGAAGTAGGTGAAGGGGAGCCTGAGTGGAGAGACGTGGAAGCAGAAGGTCGACAATGGTGGGGAGGGTTTGGTGCTGTTTATGATGATTACTGTGAGAGGATGCGTTTCTTTGATCGCAAGAGCATTGAATCTGGTCCTGCATCAACCTCCCAAAGATCTGCATCGAAAAAGAGTGCATCCCCTCTTCGGTGTCTTTCTCTGAAGAGGATTGAAGAACCTGAAGACGAGATGGAGGATGTCGATCCTTCATTGACTCCGATTGACTCCAATCACCACATAGAAATAGCGTATGTTGCTCACATTTGCTTGTCCTGGGAGGCCCTTCACTGTCAGTACACTCAACTTAACCACTTAATATCATGCCAACCCCAAAACTCTACTACTCATTATAATCTTACTGCTCAGCTCTTTCAGCAATTTCAAGTCCTCTTGCAAAGGTTTATTGAAAATGAACCCTTTCAACAAGCTCTCAGGCCTACAATTTATGCCCGAACCCGTCGAACTTTTCCTAAAATGTTGCATGTTCCAAACATACAAGCTTCAGATCCAAATGGGGTGCAGGAACAGGAATCTGATTCCCTCATCCTCGCTCCTGACCTGCTGTTCATTATTGAGGCTTCAATCTTTACTTTCCACCGCTTCCTGAAGATGGAGAAGAAAACCTCAACTTCTGCTTCTTTATCATTTCGGAACCACACCCAGGATGCTGCTCTGCATGCTCGTGTTCGATCTTCTCTTGACAAGAAGAAAACAAAGCTGAAAGAGGTTAGGAAGAAAAGTAAAGGGTGGAAACAGAAAACGTGTCCCCAAACGTATGAAGACATGCAATTACTTTTTGGAGTTGTGGACATTAAAATCATAAGAAGGCTTCTTAAGATGTCGAGGATTACTAAAGAGCAGCTGCTTTGGTGCGAGGAGAAAATGAACAAGCCCGAGATCGCCATTCGCCAGAAAGAAAAAAAGAAGGAACAAAAAAAGAAGGAAAAAGAGAAACAAAGGAATTCTCACAAGACGACCACGAATAGGCGTTGCAAAAACGCGAACAGGAGAACGACGACCGGAGAACAGCCATTGCACTATCGTGCGAATATCGGAGGTTATCGGGCCTGTTACTGCTTAGCTACATTTCATTTCACCATGACTTCAGCTTCAACTGTCTCCCAATCTGTGTCACTACCTGCTCCTCCTACTTCAAATTCGGCTGCTAATGGTTCTTCAATTCCCAATTTGATCCCCTCCACTTCACCGGTTCCTCCTGCCCCATCTTTCCATATTCATCAACTACCGTCCCTAGCTCCGATGGTTCCTGGTCCACCGGGAATGTCACCGTCGATGCCACTTGTGTCCACGGGTCCGGCAGTTTTGTTTCCGCCCACCGATTCCGCTTCTACTATTCCAGGACCCAATATGCATGCAATTCATAACCCAATTCATCCTTCTGCTCGTCCACAAATTTGTGGCTCCTATCCTTCTCTAACTCCTGTTGTTTCTCCACCTCATGCGATGTGGTTTCAGCCTCCTCAGTTGGGAGTCATGCCCAGGCCTCCCTTTCTGCCGTACTCTACTTCTTATCATGGCCCTCTTCCTTTTCCTGCTCGTGGAATGCCCCTTCCTTCTGTTCCATTGCCGGATCCTCAACCTCCCGGTGTTACCCCTGTTCCAGTTGCATCTGCCATTGCTGTGCCATCTGGTCATGGAAATCAGCTTATTGGCAATACATTGATTCAGACAGACTCAAATCATCCCGAACTTGATAGCCAGAAACACGCTCAAGGTGTTGGGCATTCTGAGAACATCTCTTTAAATAAGCACTCGGAGGATTGGACTGCCCACAAAACTGAGGCAGGAATAATTTATTACTATAATGCCTTGACCGGTGAATCTACCTATGAAAAACCTTCAGGTTTCAGAGGGGAGGCTGAAAATCTCATGGCGCAGGCAACATCAGTTTCAATGTCAAACTTGTCTGGTACAGATTGGGTTTTGGTTACTATGGGTGATGGTAAAAAGTACTACTACAACAACAAAACGAAGATTAGCAGTTGGCAAATCCCAAATGAAGTGTCTGAATTAAGGCAACAGAATGATGAAAAAACAAAAGAACTTTCTGCTCCTTTGCCAAATAACAATGCATCGACCGATCTAGGAACTTCCTCTAGCAGTATCAATACTCCTGCCATAAATACAGGTGGTCGTGAAGCCACACCTCTTAGAACAGTAGGAATATCAGGGTCATCTTCTGCTCTGGATTTGATCAAGAAAAAATTGCAAGACTCTGGAACTCCGGTAGCTTCCTCACCTATTTCTGCACCAACAGTAGCTCAATCAGATGTAAATCTGCCTAGAGATGCTGATGGTACAGTCAAGGCACTGCAGACTGAGAACAAAGATAAGCCAAAAGATGCCAATGCTGATGGAAATGTATCCGACTCTTCCTCAGACTCGGAGGATGTAGACAGTGGGCCAACTAATGAGCAATTAATTATCCAGTTTAAGGAAATGCTTAAGGAGCGAGGAGTGGCACCATTCTCTAAATGGGACAAGGAATTGCCGAAGATAGTTTTTGATCCCCGTTTTAAGGCTATTCCCAGTTATTCAGCTAGGCGGTCCTTGTTTGAACACTATGTTAAGACCCGTGCTGAGGAAGAACGCAAGGAAAAGAGAGCTGCTCAGAAGGCTGCAATAGAGGGATTTAAACAGTTACTGGATAGCGCATCTGAGGATATTGATCACACAACCAGTTATCAAACATTAAAAAAGAAATGGGGGAATGACTCGCGGTTTGAAGCTTTGGATCGTAAGGATCGGGAGAATTTATTGAACGAAAGGGTCCTTTGTCTGAAGAAGGCTGCCGTCGAAAAGGCACAAGCTTTATGGGCCGCTTCCACCACTAGTTTCAAGTCCATGCTGCAGGAGAGAGAAGATATCAACATCAATTCCCGTTGGTTCAGGGTAAAAGATAGTCTACGGGAAGATCCAAGATACAGATCTGTTAAGCATGAGGAGCGTGAGATGTTATTCAATGAGTATATATCTGAACTTAAGGCTGCCGTGGAGGAAAAGCAGCGTGAATCAAAAGCTAGAAAGGAGGAGCAGGAGAAACTGAAGGAAAGGGAGAGAGAATGGCGGAAAAGGAAGGAAAGAGAAGAGCAAGAAATGGAAAGGGTACGCCTAAAAGTACGGAAAAAGGAGGCAGTTGCATCTTTTCAAGCATTGCTTGTTGAATCGATCAAAGACCCTCAGGCCTCTTGGACCGAGTCAAAAGTTAAACTGGAGAAGGATCCACAAGGTCGTGCATCCAATACTGATTTAGATTCATCTGAGACAGAAAAATTATTCAGAGAACATGTAAAGATGCTGCAGGAGCGGTGTGCAAACGAGTTCCGAAATCTATTATCTGAAGCCTTTACAGCAGAAGTAGTTGCTCAGGTATCGGAAGATGGAAAGACGGTTCTTAATTCATGGACGATGGCTAAACGAATCTTGAAGCCTGATCCCAGATATGGTAAAGTCCCAAGGAAGGAAAGGGAGGCACTTTGGCGTCGATATGCTGATGATACAGTGCGGAAGCAGAAGTTGGCAAATGATCACAAGGGAGAAAAATATAACGACTATAAGAATAGAGCAACCACCGACGCTGGAAAATTCCCTTCTAAACCAAGAATCCATGACTGA

Coding sequence (CDS)

ATGGTAGGAATATCAGGGCCATCTTCTGTTTTGAATTTGATCAAGAAGAAATTGCAAGACACTGGAACTCCTGTAGCTTCCTCACCTATTTCAGCTCCAACAACAGCTCAATTAGATCTAAATCTACCAAGAGATGTTGATGTTGCACTTAAGGCACTGCAAAAAGAGAACGGCAAAGATAAACGGAAATATGCTAATGCTGATGGAAATGTATCTGACTCCTCTTTGGACTCTGAAGACGTAGAAAGTGGGCCAACTGATGAGCAATTAATCATCCAGTTTAAGGAAGAGGATGGCACAGATCCTAAGAGTATTGCAGAAGTGGAAGCTGAAGAGGACGAGGATGATTTCATAATGGAGGAGGTAAAGAGGAGACTGAAGGACCTGAGGAGGAACAGTTTCATGGTTTTGATTCCAGAGGAAGAAGAAGAAGAAATCGAAGGAGGAGAAGAAGAAGAAGTAGGTGAAGGGGAGCCTGAGTGGAGAGACGTGGAAGCAGAAGGTCGACAATGGTGGGGAGGGTTTGGTGCTGTTTATGATGATTACTGTGAGAGGATGCGTTTCTTTGATCGCAAGAGCATTGAATCTGGTCCTGCATCAACCTCCCAAAGATCTGCATCGAAAAAGAGTGCATCCCCTCTTCGGTGTCTTTCTCTGAAGAGGATTGAAGAACCTGAAGACGAGATGGAGGATGTCGATCCTTCATTGACTCCGATTGACTCCAATCACCACATAGAAATAGCGTATGTTGCTCACATTTGCTTGTCCTGGGAGGCCCTTCACTGTCAGTACACTCAACTTAACCACTTAATATCATGCCAACCCCAAAACTCTACTACTCATTATAATCTTACTGCTCAGCTCTTTCAGCAATTTCAAGTCCTCTTGCAAAGGTTTATTGAAAATGAACCCTTTCAACAAGCTCTCAGGCCTACAATTTATGCCCGAACCCGTCGAACTTTTCCTAAAATGTTGCATGTTCCAAACATACAAGCTTCAGATCCAAATGGGGTGCAGGAACAGGAATCTGATTCCCTCATCCTCGCTCCTGACCTGCTGTTCATTATTGAGGCTTCAATCTTTACTTTCCACCGCTTCCTGAAGATGGAGAAGAAAACCTCAACTTCTGCTTCTTTATCATTTCGGAACCACACCCAGGATGCTGCTCTGCATGCTCGTGTTCGATCTTCTCTTGACAAGAAGAAAACAAAGCTGAAAGAGGTTAGGAAGAAAAGTAAAGGGTGGAAACAGAAAACGTGTCCCCAAACGTATGAAGACATGCAATTACTTTTTGGAGTTGTGGACATTAAAATCATAAGAAGGCTTCTTAAGATGTCGAGGATTACTAAAGAGCAGCTGCTTTGGTGCGAGGAGAAAATGAACAAGCCCGAGATCGCCATTCGCCAGAAAGAAAAAAAGAAGGAACAAAAAAAGAAGGAAAAAGAGAAACAAAGGAATTCTCACAAGACGACCACGAATAGGCGTTGCAAAAACGCGAACAGGAGAACGACGACCGGAGAACAGCCATTGCACTATCGTGCGAATATCGGAGGTTATCGGGCCTGTTACTGCTTAGCTACATTTCATTTCACCATGACTTCAGCTTCAACTGTCTCCCAATCTGTGTCACTACCTGCTCCTCCTACTTCAAATTCGGCTGCTAATGGTTCTTCAATTCCCAATTTGATCCCCTCCACTTCACCGGTTCCTCCTGCCCCATCTTTCCATATTCATCAACTACCGTCCCTAGCTCCGATGGTTCCTGGTCCACCGGGAATGTCACCGTCGATGCCACTTGTGTCCACGGGTCCGGCAGTTTTGTTTCCGCCCACCGATTCCGCTTCTACTATTCCAGGACCCAATATGCATGCAATTCATAACCCAATTCATCCTTCTGCTCGTCCACAAATTTGTGGCTCCTATCCTTCTCTAACTCCTGTTGTTTCTCCACCTCATGCGATGTGGTTTCAGCCTCCTCAGTTGGGAGTCATGCCCAGGCCTCCCTTTCTGCCGTACTCTACTTCTTATCATGGCCCTCTTCCTTTTCCTGCTCGTGGAATGCCCCTTCCTTCTGTTCCATTGCCGGATCCTCAACCTCCCGGTGTTACCCCTGTTCCAGTTGCATCTGCCATTGCTGTGCCATCTGGTCATGGAAATCAGCTTATTGGCAATACATTGATTCAGACAGACTCAAATCATCCCGAACTTGATAGCCAGAAACACGCTCAAGGTGTTGGGCATTCTGAGAACATCTCTTTAAATAAGCACTCGGAGGATTGGACTGCCCACAAAACTGAGGCAGGAATAATTTATTACTATAATGCCTTGACCGGTGAATCTACCTATGAAAAACCTTCAGGTTTCAGAGGGGAGGCTGAAAATCTCATGGCGCAGGCAACATCAGTTTCAATGTCAAACTTGTCTGGTACAGATTGGGTTTTGGTTACTATGGGTGATGGTAAAAAGTACTACTACAACAACAAAACGAAGATTAGCAGTTGGCAAATCCCAAATGAAGTGTCTGAATTAAGGCAACAGAATGATGAAAAAACAAAAGAACTTTCTGCTCCTTTGCCAAATAACAATGCATCGACCGATCTAGGAACTTCCTCTAGCAGTATCAATACTCCTGCCATAAATACAGGTGGTCGTGAAGCCACACCTCTTAGAACAGTAGGAATATCAGGGTCATCTTCTGCTCTGGATTTGATCAAGAAAAAATTGCAAGACTCTGGAACTCCGGTAGCTTCCTCACCTATTTCTGCACCAACAGTAGCTCAATCAGATGTAAATCTGCCTAGAGATGCTGATGGTACAGTCAAGGCACTGCAGACTGAGAACAAAGATAAGCCAAAAGATGCCAATGCTGATGGAAATGTATCCGACTCTTCCTCAGACTCGGAGGATGTAGACAGTGGGCCAACTAATGAGCAATTAATTATCCAGTTTAAGGAAATGCTTAAGGAGCGAGGAGTGGCACCATTCTCTAAATGGGACAAGGAATTGCCGAAGATAGTTTTTGATCCCCGTTTTAAGGCTATTCCCAGTTATTCAGCTAGGCGGTCCTTGTTTGAACACTATGTTAAGACCCGTGCTGAGGAAGAACGCAAGGAAAAGAGAGCTGCTCAGAAGGCTGCAATAGAGGGATTTAAACAGTTACTGGATAGCGCATCTGAGGATATTGATCACACAACCAGTTATCAAACATTAAAAAAGAAATGGGGGAATGACTCGCGGTTTGAAGCTTTGGATCGTAAGGATCGGGAGAATTTATTGAACGAAAGGGTCCTTTGTCTGAAGAAGGCTGCCGTCGAAAAGGCACAAGCTTTATGGGCCGCTTCCACCACTAGTTTCAAGTCCATGCTGCAGGAGAGAGAAGATATCAACATCAATTCCCGTTGGTTCAGGGTAAAAGATAGTCTACGGGAAGATCCAAGATACAGATCTGTTAAGCATGAGGAGCGTGAGATGTTATTCAATGAGTATATATCTGAACTTAAGGCTGCCGTGGAGGAAAAGCAGCGTGAATCAAAAGCTAGAAAGGAGGAGCAGGAGAAACTGAAGGAAAGGGAGAGAGAATGGCGGAAAAGGAAGGAAAGAGAAGAGCAAGAAATGGAAAGGGTACGCCTAAAAGTACGGAAAAAGGAGGCAGTTGCATCTTTTCAAGCATTGCTTGTTGAATCGATCAAAGACCCTCAGGCCTCTTGGACCGAGTCAAAAGTTAAACTGGAGAAGGATCCACAAGGTCGTGCATCCAATACTGATTTAGATTCATCTGAGACAGAAAAATTATTCAGAGAACATGTAAAGATGCTGCAGGAGCGGTGTGCAAACGAGTTCCGAAATCTATTATCTGAAGCCTTTACAGCAGAAGTAGTTGCTCAGGTATCGGAAGATGGAAAGACGGTTCTTAATTCATGGACGATGGCTAAACGAATCTTGAAGCCTGATCCCAGATATGGTAAAGTCCCAAGGAAGGAAAGGGAGGCACTTTGGCGTCGATATGCTGATGATACAGTGCGGAAGCAGAAGTTGGCAAATGATCACAAGGGAGAAAAATATAACGACTATAAGAATAGAGCAACCACCGACGCTGGAAAATTCCCTTCTAAACCAAGAATCCATGACTGA

Protein sequence

MVGISGPSSVLNLIKKKLQDTGTPVASSPISAPTTAQLDLNLPRDVDVALKALQKENGKDKRKYANADGNVSDSSLDSEDVESGPTDEQLIIQFKEEDGTDPKSIAEVEAEEDEDDFIMEEVKRRLKDLRRNSFMVLIPEEEEEEIEGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKSIESGPASTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNHHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFIENEPFQQALRPTIYARTRRTFPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASLSFRNHTQDAALHARVRSSLDKKKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIIRRLLKMSRITKEQLLWCEEKMNKPEIAIRQKEKKKEQKKKEKEKQRNSHKTTTNRRCKNANRRTTTGEQPLHYRANIGGYRACYCLATFHFTMTSASTVSQSVSLPAPPTSNSAANGSSIPNLIPSTSPVPPAPSFHIHQLPSLAPMVPGPPGMSPSMPLVSTGPAVLFPPTDSASTIPGPNMHAIHNPIHPSARPQICGSYPSLTPVVSPPHAMWFQPPQLGVMPRPPFLPYSTSYHGPLPFPARGMPLPSVPLPDPQPPGVTPVPVASAIAVPSGHGNQLIGNTLIQTDSNHPELDSQKHAQGVGHSENISLNKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFRGEAENLMAQATSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKELSAPLPNNNASTDLGTSSSSINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTVAQSDVNLPRDADGTVKALQTENKDKPKDANADGNVSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTLKKKWGNDSRFEALDRKDRENLLNERVLCLKKAAVEKAQALWAASTTSFKSMLQEREDININSRWFRVKDSLREDPRYRSVKHEEREMLFNEYISELKAAVEEKQRESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWTESKVKLEKDPQGRASNTDLDSSETEKLFREHVKMLQERCANEFRNLLSEAFTAEVVAQVSEDGKTVLNSWTMAKRILKPDPRYGKVPRKEREALWRRYADDTVRKQKLANDHKGEKYNDYKNRATTDAGKFPSKPRIHD*
Homology
BLAST of Chy1G021000 vs. ExPASy Swiss-Prot
Match: Q9LT25 (Pre-mRNA-processing protein 40C OS=Arabidopsis thaliana OX=3702 GN=PRP40C PE=1 SV=1)

HSP 1 Score: 693.7 bits (1789), Expect = 4.1e-198
Identity = 427/831 (51.38%), Postives = 554/831 (66.67%), Query Frame = 0

Query: 531  TMTSAST--VSQSVSLPAPPTSNSAANGSSIPNLIPSTSPVPPAPSFHIHQLPSLAPMVP 590
            +M+ AST  VSQSV         + A  SS  N IP  SP+       +   P   P   
Sbjct: 48   SMSIASTGFVSQSVPYSVTAQWGTNAAASSNVNPIPQASPM-------LANAPFGRPGTL 107

Query: 591  GPPGMSPSMPLVSTGPAVLFPPTDSASTIPGPNMHAIHNPIHPSARPQICGSYPSLTPVV 650
             PPG+  S P         FP ++  ST P P M A    ++P   P +   YP    + 
Sbjct: 108  APPGLMTSPP--------AFPGSNPFSTTPRPGMSAGPAQMNPGIHPHM---YPPYHSLP 167

Query: 651  SPPHAMWFQPPQLGVMPRPPFLPYSTSYHGPLPFPARGMPLPSVPLPDPQPPGVTPV-PV 710
              P  MW QPP +G +PR PFL + T++ G  PFP RG+  P++P     P G +P+  V
Sbjct: 168  GTPQGMWLQPPSMGGIPRAPFLSHPTTFPGSYPFPVRGIS-PNLPYSGSHPLGASPMGSV 227

Query: 711  ASAIAVPSGHGNQLIGNTLIQTDSNHPELDSQKHAQGVGHSENISLNKHSEDWTAHKTEA 770
             +  A+P    +   G    +       +D +  +Q VG+          + WTAHK+EA
Sbjct: 228  GNVHALPGRQPDISPGRKTEELSG----IDDRAGSQLVGN--------RLDAWTAHKSEA 287

Query: 771  GIIYYYNALTGESTYEKPSGFRGEAENLMAQATSVSMSNLSGTDWVLVTMGDGKKYYYNN 830
            G++YYYN++TG+STYEKP GF GE + +  Q   VSM +L GTDW LV+  DGKKYYYNN
Sbjct: 288  GVLYYYNSVTGQSTYEKPPGFGGEPDKVPVQPIPVSMESLPGTDWALVSTNDGKKYYYNN 347

Query: 831  KTKISSWQIPNEVSELRQQNDEKTKELSAPLPNNNASTDLGTSSSSINTPAINTGGREAT 890
            KTK+SSWQIP EV +  ++ +E+  E  A +P+ +  T+ G+  +S++ PAI+ GGR+A 
Sbjct: 348  KTKVSSWQIPAEVKDFGKKLEERAMESVASVPSADL-TEKGSDLTSLSAPAISNGGRDAA 407

Query: 891  PLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTVAQSDVNLPRDADGTVKALQTEN 950
             L+T      SSALDL+KKKL DSG PV+S+         S+ N  +  + T       +
Sbjct: 408  SLKTTNF--GSSALDLVKKKLHDSGMPVSST-------ITSEANSGKTTEVTPSGESGNS 467

Query: 951  KDKPKDANADGNVSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVF 1010
              K KDA   G +SDSSSDSED DSGP+ E+   QFKEMLKERG+APFSKW+KELPKI+F
Sbjct: 468  TGKVKDAPGAGALSDSSSDSEDEDSGPSKEECSKQFKEMLKERGIAPFSKWEKELPKIIF 527

Query: 1011 DPRFKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSY 1070
            DPRFKAIPS+S RRSLFE YVKTRAEEER+EKRAA KAAIEGF+QLLD AS DID  T Y
Sbjct: 528  DPRFKAIPSHSVRRSLFEQYVKTRAEEERREKRAAHKAAIEGFRQLLDDASTDIDQHTDY 587

Query: 1071 QTLKKKWGNDSRFEALDRKDRENLLNERVLCLKKAAVEKAQALWAASTTSFKSMLQERED 1130
            +  KKKWGND RFEA++RK+RE LLNERVL LK++A +KAQ + AA+ + FK+ML+ERE 
Sbjct: 588  RAFKKKWGNDLRFEAIERKEREGLLNERVLSLKRSAEQKAQEIRAAAASDFKTMLRERE- 647

Query: 1131 ININSRWFRVKDSLREDPRYRSVKHEEREMLFNEYISELKAAVEEKQRESKARKEEQEKL 1190
            I+INS W +VKDSLR +PRYRSV HE+RE+ + EYI+ELKAA      E KAR +E++KL
Sbjct: 648  ISINSHWSKVKDSLRNEPRYRSVAHEDREVFYYEYIAELKAAQRGDDHEMKAR-DEEDKL 707

Query: 1191 KEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWTESKVKLEKDP 1250
            +ERERE RKRKERE QE+ERVR K+R+KEA +S+QALLVE I+DP+ASWTESK  LE+DP
Sbjct: 708  RERERELRKRKEREVQEVERVRQKIRRKEASSSYQALLVEKIRDPEASWTESKPILERDP 767

Query: 1251 QGRASNTDLDSSETEKLFREHVKMLQERCANEFRNLLSEAFTAEVVAQVSEDGKTVLNSW 1310
            Q RASN DL+ ++ EKLFR+HVK L ERC ++F+ LL+EA ++E     +EDGKT LNSW
Sbjct: 768  QKRASNPDLEPADKEKLFRDHVKSLYERCVHDFKALLAEALSSEAATLQTEDGKTALNSW 827

Query: 1311 TMAKRILKPDPRYGKVPRKEREALWRRYADDTVRKQKLANDHKGEKYNDYK 1359
            + AK++LKPD RY K+PR++RE +WRRY +D  RKQ+  N ++ EK  DYK
Sbjct: 828  STAKQVLKPDIRYSKMPRQDREVVWRRYVEDISRKQRHEN-YQEEKQRDYK 834

BLAST of Chy1G021000 vs. ExPASy Swiss-Prot
Match: Q8CGF7 (Transcription elongation regulator 1 OS=Mus musculus OX=10090 GN=Tcerg1 PE=1 SV=2)

HSP 1 Score: 179.1 bits (453), Expect = 3.4e-43
Identity = 232/866 (26.79%), Postives = 369/866 (42.61%), Query Frame = 0

Query: 535  ASTVSQSVSLPAPPTSNSA-ANGSSIPNLIPSTSPVPPAPSFHIHQLPSLAPMVPGPPGM 594
            A   +Q+V  P P TS+ A A  +S P   PS++      +  + Q  S        P  
Sbjct: 251  AQVQAQAVGAPTPTTSSPAPAVSTSTPTSTPSSTTATTTTATSVAQTVSTPTTQDQTPSS 310

Query: 595  SPSMPLVSTGPAVLFPPTDSASTIPGPNMHAIHNPIHPSARPQICGSYPSLTPVVSPPHA 654
            + S+   +   +   P      T+P P+   +  P  P + PQ   + P+  PV+ PP  
Sbjct: 311  AVSVATPTVSVSAPAPTATPVQTVPQPHPQTL-PPAVPHSVPQPAAAIPAFPPVMVPP-- 370

Query: 655  MWFQPPQLGV-MPRPPF-------LPY----STSYHGPLPFPARGMPLPSVPLPDPQPP- 714
              F+ P  G+ +P P          PY    +T+  G LP    GM  P VP+  PQ   
Sbjct: 371  --FRVPLPGMPIPLPGVAMMQIVSCPYVKTVATTKTGVLP----GMAPPIVPMIHPQVAI 430

Query: 715  GVTPVPVASAIAVPSGHGNQLIGNTLIQTDSNHPELDSQKHAQGVGHSENISLNKHSEDW 774
              +P  +A A AV                                             +W
Sbjct: 431  AASPATLAGATAV--------------------------------------------SEW 490

Query: 775  TAHKTEAGIIYYYNALTGESTYEKPSGFR------------------------------- 834
            T +KT  G  YYYN  T EST+EKP   +                               
Sbjct: 491  TEYKTADGKTYYYNNRTLESTWEKPQELKEKEKLDEKIKEPIKEASEEPLPMETEEEDPK 550

Query: 835  ----------------GEAENLMAQATSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKISS 894
                             E E    +A  V+ + + GT W +V  GD + ++YN  T++S 
Sbjct: 551  EEPVKEIKEEPKEEEMTEEEKAAQKAKPVATTPIPGTPWCVVWTGDERVFFYNPTTRLSM 610

Query: 895  WQIPNEVSELRQQNDEKTKELSAPLPNNNASTDLGTSSSSINTPAINTGGREATPLRTVG 954
            W  P+++                      A  D       I  P    G  +   LR   
Sbjct: 611  WDRPDDLI-------------------GRADVD-----KIIQEPPHKKGLEDMKKLR--- 670

Query: 955  ISGSSSALDLIKKKLQDSGTPVASSPISAPTVAQSDVNLPRDADGTVKALQTENKDKPKD 1014
               + + L + K +        + S I        ++N     D  +KA     K + +D
Sbjct: 671  -HPAPTMLSIQKWQF-------SMSAIKEEQELMEEMN----EDEPIKA-----KKRKRD 730

Query: 1015 ANADGNVSDSSSDSEDVDSGPTNEQLII-------QFKEMLKERGVAPFSKWDKELPKIV 1074
             N D  +      + + +     E+ I+       QFK+ML ERGV+ FS W+KEL KIV
Sbjct: 731  DNKD--IDSEKEAAMEAEIKAARERAIVPLEARMKQFKDMLLERGVSAFSTWEKELHKIV 790

Query: 1075 FDPRFKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTS 1134
            FDPR+  + +   R+ +F+ YVKTRAEEER+EK+     A E FK++++ A    +   +
Sbjct: 791  FDPRYLLL-NPKERKQVFDQYVKTRAEEERREKKNKIMQAKEDFKKMMEEAK--FNPRAT 850

Query: 1135 YQTLKKKWGNDSRFEALDR-KDRENLLNERVLCLKKAAVEKAQALWAASTTSFKSMLQER 1194
            +     K   DSRF+A+++ KDRE L NE V   +K   E ++       + F  +L   
Sbjct: 851  FSEFAAKHAKDSRFKAIEKMKDREALFNEFVAAARKKEKEDSKTRGEKIKSDFFELLSNH 910

Query: 1195 EDININSRWFRVKDSLREDPRYRSVKHEE-REMLFNEYISEL-KAAVEEKQRESKARKEE 1254
              ++  SRW +VKD +  DPRY++V     RE LF +YI ++ K    EK++E + +   
Sbjct: 911  H-LDSQSRWSKVKDKVESDPRYKAVDSSSMREDLFKQYIEKIAKNLDSEKEKELERQARI 970

Query: 1255 QEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWTESKVKL 1314
            +  L+ERERE +K +  + +E++R R + +++EA+ +F+ALL + ++    SW++++  L
Sbjct: 971  EASLREREREVQKARSEQTKEIDREREQHKREEAIQNFKALLSDMVRSSDVSWSDTRRTL 1001

Query: 1315 EKDPQGRASNTDLDSSETEKLFREHVKMLQERCANEFRNLLSEAFTAEVVAQVSEDGKTV 1330
             KD +   S + L+  E EKLF EH++ L ++    FR LL E               T+
Sbjct: 1031 RKDHRWE-SGSLLEREEKEKLFNEHIEALTKKKREHFRQLLDET-----------SAITL 1001

BLAST of Chy1G021000 vs. ExPASy Swiss-Prot
Match: O14776 (Transcription elongation regulator 1 OS=Homo sapiens OX=9606 GN=TCERG1 PE=1 SV=2)

HSP 1 Score: 174.5 bits (441), Expect = 8.4e-42
Identity = 236/865 (27.28%), Postives = 368/865 (42.54%), Query Frame = 0

Query: 535  ASTVSQSVSLPAPPTSNSAANGSSIPNLIPSTSPVPPAPSFHIHQLPSLAPMVPGPPGMS 594
            AST + S   PA  TS S++  SS  +   + + V    S      P+     P    +S
Sbjct: 258  ASTPTTSSPAPAVSTSTSSSTPSSTTSTTTTATSVAQTVS-----TPTTQDQTPS-SAVS 317

Query: 595  PSMPLVSTGPAVLFPPTDSASTIPGPNMHAIHNPIHPSARPQICGSYPSLTPVVSPPHAM 654
             + P VS       P      T+P P+   +  P  P + PQ   + P+  PV+ PP   
Sbjct: 318  VATPTVSVSTPA--PTATPVQTVPQPHPQTL-PPAVPHSVPQPTTAIPAFPPVMVPP--- 377

Query: 655  WFQPPQLGV-MPRPPF-------LPY----STSYHGPLPFPARGMPLPSVPLPDPQPP-G 714
             F+ P  G+ +P P          PY    +T+  G LP    GM  P VP+  PQ    
Sbjct: 378  -FRVPLPGMPIPLPGVAMMQIVSCPYVKTVATTKTGVLP----GMAPPIVPMIHPQVAIA 437

Query: 715  VTPVPVASAIAVPSGHGNQLIGNTLIQTDSNHPELDSQKHAQGVGHSENISLNKHSEDWT 774
             +P  +A A AV                                             +WT
Sbjct: 438  ASPATLAGATAV--------------------------------------------SEWT 497

Query: 775  AHKTEAGIIYYYNALTGESTYEKPSGFR-------------------------------- 834
             +KT  G  YYYN  T EST+EKP   +                                
Sbjct: 498  EYKTADGKTYYYNNRTLESTWEKPQELKEKEKLEEKIKEPIKEPSEEPLPMETEEEDPKE 557

Query: 835  ---------------GEAENLMAQATSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKISSW 894
                            E E    +A  V+ + + GT W +V  GD + ++YN  T++S W
Sbjct: 558  EPIKEIKEEPKEEEMTEEEKAAQKAKPVATAPIPGTPWCVVWTGDERVFFYNPTTRLSMW 617

Query: 895  QIPNEVSELRQQNDEKTKELSAPLPNNNASTDLGTSSSSINTPAINTGGREATPLRTVGI 954
              P+++                      A  D       I  P    G  E   LR    
Sbjct: 618  DRPDDLI-------------------GRADVD-----KIIQEPPHKKGMEELKKLR---- 677

Query: 955  SGSSSALDLIKKKLQDSGTPVASSPISAPTVAQSDVNLPRDADGTVKALQTENKDKPKDA 1014
              + + L + K +        + S I        ++N     D  VKA     K + +D 
Sbjct: 678  HPTPTMLSIQKWQF-------SMSAIKEEQELMEEIN----EDEPVKA-----KKRKRDD 737

Query: 1015 NADGNVSDSSSDSEDVDSGPTNEQLII-------QFKEMLKERGVAPFSKWDKELPKIVF 1074
            N D  +      + + +     E+ I+       QFK+ML ERGV+ FS W+KEL KIVF
Sbjct: 738  NKD--IDSEKEAAMEAEIKAARERAIVPLEARMKQFKDMLLERGVSAFSTWEKELHKIVF 797

Query: 1075 DPRFKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSY 1134
            DPR+  + +   R+ +F+ YVKTRAEEER+EK+     A E FK++++ A    +   ++
Sbjct: 798  DPRYLLL-NPKERKQVFDQYVKTRAEEERREKKNKIMQAKEDFKKMMEEAK--FNPRATF 857

Query: 1135 QTLKKKWGNDSRFEALDR-KDRENLLNERVLCLKKAAVEKAQALWAASTTSFKSMLQERE 1194
                 K   DSRF+A+++ KDRE L NE V   +K   E ++       + F  +L    
Sbjct: 858  SEFAAKHAKDSRFKAIEKMKDREALFNEFVAAARKKEKEDSKTRGEKIKSDFFELLSNHH 917

Query: 1195 DININSRWFRVKDSLREDPRYRSVKHEE-REMLFNEYISEL-KAAVEEKQRESKARKEEQ 1254
             ++  SRW +VKD +  DPRY++V     RE LF +YI ++ K    EK++E + +   +
Sbjct: 918  -LDSQSRWSKVKDKVESDPRYKAVDSSSMREDLFKQYIEKIAKNLDSEKEKELERQARIE 977

Query: 1255 EKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWTESKVKLE 1314
              L+ERERE +K +  + +E++R R + +++EA+ +F+ALL + ++    SW++++  L 
Sbjct: 978  ASLREREREVQKARSEQTKEIDREREQHKREEAIQNFKALLSDMVRSSDVSWSDTRRTLR 999

Query: 1315 KDPQGRASNTDLDSSETEKLFREHVKMLQERCANEFRNLLSEAFTAEVVAQVSEDGKTVL 1330
            KD +   S + L+  E EKLF EH++ L ++    FR LL E               T+ 
Sbjct: 1038 KDHRWE-SGSLLEREEKEKLFNEHIEALTKKKREHFRQLLDET-----------SAITLT 999

BLAST of Chy1G021000 vs. ExPASy Swiss-Prot
Match: B6EUA9 (Pre-mRNA-processing protein 40A OS=Arabidopsis thaliana OX=3702 GN=PRP40A PE=1 SV=1)

HSP 1 Score: 107.1 bits (266), Expect = 1.6e-21
Identity = 193/783 (24.65%), Postives = 336/783 (42.91%), Query Frame = 0

Query: 585  PMVPG-------PPGMSPSMPLVSTGPAVLFPPTDSASTIPGPNMHAIH--NPIHPSARP 644
            PMVPG       P    P  P     P V   P   +  I    +  +    P+H ++  
Sbjct: 15   PMVPGQQGQHFVPAASQPFHPYGHVPPNVQSQPPQYSQPIQQQQLFPVRPGQPVHITSSS 74

Query: 645  Q-ICGSYPSLTPVVSPPHAMWFQPPQLGVMPRPPFLPYSTSYHGPLPFPARGMPL---PS 704
            Q +   Y     +++         PQ    P   F      +  P  F     P     S
Sbjct: 75   QAVSVPYIQTNKILTSGSTQ----PQPNAPPMTGFATSGPPFSSPYTFVPSSYPQQQPTS 134

Query: 705  VPLPDPQPPGVTPVPVASAIAVPSGHGNQLIGNTLIQTDSNHPELDSQKHAQGVGHSENI 764
            +  P+ Q       P A+   VP      L+ + + QT    P   S         S   
Sbjct: 135  LVQPNSQMHVAGVPPAANTWPVPVNQSTSLV-SPVQQTGQQTPVAVSTDPGNLTPQS--- 194

Query: 765  SLNKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFRGEAENLMAQATSVSMSNLSGTD 824
                 + DW  H +  G  YYYN  T +S +EKP       E   A A++V         
Sbjct: 195  -----ASDWQEHTSADGRKYYYNKRTKQSNWEKPLELMTPLER--ADASTV--------- 254

Query: 825  WVLVTMGDGKKYYYNNKTKISSWQIPNEVSELRQQ---NDEKT---KELSAPLPNNNA-S 884
            W   T  +GKKYYYN  TK S W IP ++   R+Q     EKT   +  S PL ++ A S
Sbjct: 255  WKEFTTPEGKKYYYNKVTKESKWTIPEDLKLAREQAQLASEKTSLSEAGSTPLSHHAASS 314

Query: 885  TDLGTSSSSINTPAINTG--GREATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISA 944
            +DL  S+ +   P+ ++   G  ++P++  G++   +    +      SG   A S   A
Sbjct: 315  SDLAVSTVTSVVPSTSSALTGHSSSPIQ-AGLAVPVTRPPSVAPVTPTSG---AISDTEA 374

Query: 945  PTVAQSDVNLPRDAD-----GTVKALQTENKDKPKDANADGNVSDSSSDSEDVDSGPTNE 1004
             T+   +++  R AD      T +  + ENK+   +  A+ + +   ++ E+     T +
Sbjct: 375  TTIKGDNLS-SRGADDSNDGATAQNNEAENKEMSVNGKANLSPAGDKANVEEPMVYATKQ 434

Query: 1005 QLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKAIPSYSARRSLFEHYVKTRAEEERK 1064
            +    FK +L+   V     W++ L +IV D R+ A+ +   R+  F  Y+  R + E +
Sbjct: 435  EAKAAFKSLLESVNVHSDWTWEQTLKEIVHDKRYGALRTLGERKQAFNEYLGQRKKVEAE 494

Query: 1065 EKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTLKKKWGNDSRFEALDR-KDRENLLNERV 1124
            E+R  QK A E F ++L+   E++  +  +      + ND RF+A+DR +DRE+L +  +
Sbjct: 495  ERRRRQKKAREEFVKMLEEC-EELSSSLKWSKAMSLFENDQRFKAVDRPRDREDLFDNYI 554

Query: 1125 LCLKKAAVEKAQALWAASTTSFKSMLQEREDININSRWFRVKDSLREDPRYRSVKHEERE 1184
            + L++   EKA          ++  L+  + I   ++W +++D L +D R   ++  +R 
Sbjct: 555  VELERKEREKAAEEHRQYMADYRKFLETCDYIKAGTQWRKIQDRLEDDDRCSCLEKIDRL 614

Query: 1185 MLFNEYISELKAAVEEKQRESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKE 1244
            + F EYI +L              ++E+E+LK  E+E  +R ER+ ++  R  L    +E
Sbjct: 615  IGFEEYILDL--------------EKEEEELKRVEKEHVRRAERKNRDAFRTLL----EE 674

Query: 1245 AVASFQALLVESIKDPQASWTESKVKLEKDPQGRASNTDLDSSETEKLFREHVKMLQERC 1304
             VA+        I   +  W +  ++L+  PQ +A  ++   S  + LF +  + L E+ 
Sbjct: 675  HVAA-------GILTAKTYWLDYCIELKDLPQYQAVASNTSGSTPKDLFEDVTEEL-EKQ 734

Query: 1305 ANEFRNLLSEAFTAEVVAQVS----EDGKTVLNSWTMAKRI------LKPDPRYGKVPRK 1330
             +E ++ + +A  +  ++ VS    ED K+ ++     ++I      L  D   G+V  K
Sbjct: 735  YHEDKSYVKDAMKSRKISMVSSWLFEDFKSAISEDLSTQQISDINLKLIYDDLVGRVKEK 741

BLAST of Chy1G021000 vs. ExPASy Swiss-Prot
Match: Q6NWY9 (Pre-mRNA-processing factor 40 homolog B OS=Homo sapiens OX=9606 GN=PRPF40B PE=1 SV=1)

HSP 1 Score: 96.7 bits (239), Expect = 2.2e-18
Identity = 133/552 (24.09%), Postives = 235/552 (42.57%), Query Frame = 0

Query: 664  MPRPPFLPYSTSYHGPLPFPARGMPLPS--VPLPDPQPPGVTPVPVASAIAVP------S 723
            M  PPF+P       P PFP  G+P  S   P   P PPG+ P P+   +  P       
Sbjct: 1    MMPPPFMPPPGI---PPPFPPMGLPPMSQRPPAIPPMPPGILP-PMLPPMGAPPPLTQIP 60

Query: 724  GHGNQLIGNTLIQ----TDSNHPELDSQKHA-QGVGHSENISLNKHSEDWTAHKTEAGII 783
            G    ++   L+     T +  P  D+   A  G G    +        W+ H    G I
Sbjct: 61   GMVPPMMPGMLMPAVPVTAATAPGADTASSAVAGTGPPRAL--------WSEHVAPDGRI 120

Query: 784  YYYNALTGESTYEKPSGFRGEAENLMAQATSVSMSNLSGTDWVLVTMGDGKKYYYNNKTK 843
            YYYNA   +S +EKPS  + +AE L++Q             W       GK YYYNN++K
Sbjct: 121  YYYNADDKQSVWEKPSVLKSKAELLLSQC-----------PWKEYKSDTGKPYYYNNQSK 180

Query: 844  ISSWQIPNEVSEL-----RQQNDEKTKELSAPLPNNNASTDLGTSSSSINTPAINTGGRE 903
             S W  P ++ +L     ++   ++ ++L   L                    + TG  E
Sbjct: 181  ESRWTRPKDLDDLEVLVKQEAAGKQQQQLPQTLQPQPPQPQPDPPPVPPGPTPVPTGLLE 240

Query: 904  ATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTVAQSDVNLPRDADGTVKALQT 963
              P       G S   D+++          A+ P+    + Q +      + G  +  Q 
Sbjct: 241  PEP-------GGSEDCDVLE----------ATQPLEQGFLQQLEEG--PSSSGQHQPQQE 300

Query: 964  ENKDKPKDANADGNVSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKI 1023
            E + KP+   +  + S+              E+    FKE+L+++ V   + W++ +  +
Sbjct: 301  EEESKPEPERSGLSWSN-------------REKAKQAFKELLRDKAVPSNASWEQAMKMV 360

Query: 1024 VFDPRFKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTT 1083
            V DPR+ A+P  S ++  F  Y   R +EE++E R   K A +  +  L+   E +  TT
Sbjct: 361  VTDPRYSALPKLSEKKQAFNAYKAQREKEEKEEARLRAKEAKQTLQHFLEQ-HERMTSTT 420

Query: 1084 SYQTLKKKWGNDSRFEALDRKDRENLLNERVLCLKKAAVEKAQALWAASTTSFKSMLQER 1143
             Y+  ++ +G    +  +  +DR+ + ++ +  L K   E+A+ L   +  + KS+L   
Sbjct: 421  RYRRAEQTFGELEVWAVVPERDRKEVYDDVLFFLAKKEKEQAKQLRRRNIQALKSILDGM 480

Query: 1144 EDININSRWFRVKDSLREDPRY------RSVKHEEREMLFNEYISELKAAVEEKQRESKA 1192
              +N  + W + +  L ++P +      +++  E+  + F E+I  L+   EE++RE   
Sbjct: 481  SSVNFQTTWSQAQQYLMDNPSFAQDHQLQNMDKEDALICFEEHIRALERE-EEEERERAR 495

BLAST of Chy1G021000 vs. ExPASy TrEMBL
Match: A0A0A0K978 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G447710 PE=4 SV=1)

HSP 1 Score: 1588.5 bits (4112), Expect = 0.0e+00
Identity = 833/845 (98.58%), Postives = 838/845 (99.17%), Query Frame = 0

Query: 532  MTSASTVSQSVSLPAPPTSNSAANGSSIPNLIPSTSPVPPAPSFHIHQLPSLAPMVPGPP 591
            M+SASTVSQSVSLPAPPTSNSAANGSSIPNLIPSTSPVPPAPSFHIHQLPS+APMVPGPP
Sbjct: 1    MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPSTSPVPPAPSFHIHQLPSVAPMVPGPP 60

Query: 592  GMSPSMPLVSTGPAVLFPPTDSASTIPGPNMHAIHNPIHPSARPQICGSYPSLTPVVSPP 651
            GMSPSMPLVSTGPAVLFPPTDSASTIPGPNMHAIHNPIHPSARPQICGSYPSLTPVVSPP
Sbjct: 61   GMSPSMPLVSTGPAVLFPPTDSASTIPGPNMHAIHNPIHPSARPQICGSYPSLTPVVSPP 120

Query: 652  HAMWFQPPQLGVMPRPPFLPYSTSYHGPLPFPARGMPLPSVPLPDPQPPGVTPVPVASAI 711
            HAMWFQPPQLG MPRPPFLPYSTSYHGPLPFPARGMPLPSVPLPDPQPPGVTPV VAS I
Sbjct: 121  HAMWFQPPQLGAMPRPPFLPYSTSYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASGI 180

Query: 712  AVPSGHGNQLIGNTLIQTDSNHPELDSQKHAQGVGHSENISLNKHSEDWTAHKTEAGIIY 771
            +VPSGHGNQLIGNTLIQTDSNHPELDS KHAQGVGHSENISLNKHSEDWTAHKTEAGIIY
Sbjct: 181  SVPSGHGNQLIGNTLIQTDSNHPELDSHKHAQGVGHSENISLNKHSEDWTAHKTEAGIIY 240

Query: 772  YYNALTGESTYEKPSGFRGEAENLMAQATSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI 831
            YYNALTGESTYEKPSGFRGEAENLMAQATSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI
Sbjct: 241  YYNALTGESTYEKPSGFRGEAENLMAQATSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI 300

Query: 832  SSWQIPNEVSELRQQNDEKTKELSAPLPNNNASTDLGTSSSSINTPAINTGGREATPLRT 891
            SSWQIPNEVSELRQQNDEKTKELSAPLPNNNASTDLGTSS+SINTPAINTGGREATPLRT
Sbjct: 301  SSWQIPNEVSELRQQNDEKTKELSAPLPNNNASTDLGTSSTSINTPAINTGGREATPLRT 360

Query: 892  VGISGSSSALDLIKKKLQDSGTPVASSPISAPTVAQSDVNLPRDADGTVKALQTENKDKP 951
            VGISGSSSALDLIKKKLQDSGTPVASSPISAPTVAQSDVNLPRDAD TVKALQTENKDKP
Sbjct: 361  VGISGSSSALDLIKKKLQDSGTPVASSPISAPTVAQSDVNLPRDADATVKALQTENKDKP 420

Query: 952  KDANADGNVSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF 1011
            KDANADGNVSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF
Sbjct: 421  KDANADGNVSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF 480

Query: 1012 KAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTLK 1071
            KAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQT K
Sbjct: 481  KAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFK 540

Query: 1072 KKWGNDSRFEALDRKDRENLLNERVLCLKKAAVEKAQALWAASTTSFKSMLQEREDININ 1131
            KKWGNDSRFEALDRKDRENLLNERVLCLKKAAVEKAQALWAASTTSFKSMLQEREDININ
Sbjct: 541  KKWGNDSRFEALDRKDRENLLNERVLCLKKAAVEKAQALWAASTTSFKSMLQEREDININ 600

Query: 1132 SRWFRVKDSLREDPRYRSVKHEEREMLFNEYISELKAAVEEKQRESKARKEEQEKLKERE 1191
            SRWFRVKDSLREDPRYRSVKHEEREMLFNEYISELKAA EEKQRESKARKEEQEKLKERE
Sbjct: 601  SRWFRVKDSLREDPRYRSVKHEEREMLFNEYISELKAAEEEKQRESKARKEEQEKLKERE 660

Query: 1192 REWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWTESKVKLEKDPQGRA 1251
            REWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWTESKVKLEKDPQGRA
Sbjct: 661  REWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWTESKVKLEKDPQGRA 720

Query: 1252 SNTDLDSSETEKLFREHVKMLQERCANEFRNLLSEAFTAEVVAQVSEDGKTVLNSWTMAK 1311
            SNTDLDSSETEKLFREHVKMLQERCANEFRNLLSE+FTAEVVAQVSEDGKTVLNSWTMAK
Sbjct: 721  SNTDLDSSETEKLFREHVKMLQERCANEFRNLLSESFTAEVVAQVSEDGKTVLNSWTMAK 780

Query: 1312 RILKPDPRYGKVPRKEREALWRRYADDTVRKQKLANDHKGEKYNDYKNRATTDAGKFPSK 1371
            RILKPDPRYGKVPRKEREALWRRYADDTVRKQKLANDHKGEKYNDYKNRATTDAGKFPSK
Sbjct: 781  RILKPDPRYGKVPRKEREALWRRYADDTVRKQKLANDHKGEKYNDYKNRATTDAGKFPSK 840

Query: 1372 PRIHD 1377
            PRIHD
Sbjct: 841  PRIHD 845

BLAST of Chy1G021000 vs. ExPASy TrEMBL
Match: A0A5A7V0S2 (Pre-mRNA-processing protein 40C OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold242G00550 PE=4 SV=1)

HSP 1 Score: 1557.3 bits (4031), Expect = 0.0e+00
Identity = 820/846 (96.93%), Postives = 828/846 (97.87%), Query Frame = 0

Query: 532  MTSASTVSQSVSLPAPPTSNSAANGSSIPNLIPSTSPVPPAPSFHIHQLPSLAPMVPGPP 591
            M+SASTVSQSVSLPAPPTSNS ANGSSIPNLIPSTSPVPPAPSFHIHQLP +APMVPGPP
Sbjct: 1    MSSASTVSQSVSLPAPPTSNSVANGSSIPNLIPSTSPVPPAPSFHIHQLPPVAPMVPGPP 60

Query: 592  GMSPSMPLVSTGPAVLFPPTDSASTIPGPNMHAIHNPIHPSARPQICGSYPSLTPVVSPP 651
            GMSPS PLVSTGPAVLFPPTDSASTIPGPNMHAIHNPIHPSARPQICGSYPSLTPVVSPP
Sbjct: 61   GMSPSTPLVSTGPAVLFPPTDSASTIPGPNMHAIHNPIHPSARPQICGSYPSLTPVVSPP 120

Query: 652  HAMWFQPPQLGVMPRPPFLPYSTSYHGPLPFPARGMPLPSVPLPDPQPPGVTPVPVASAI 711
            HAMWFQPPQLG MPRPPF+PYS SYHGPLPFPARGMPLPSVPLPDPQPPGVTPV VASAI
Sbjct: 121  HAMWFQPPQLGAMPRPPFIPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAI 180

Query: 712  AVPSGHGNQLIGNTLIQTDSNHPELDSQKHAQGVGHSENISLNKHSEDWTAHKTEAGIIY 771
             VPSGHGNQLIGN+LIQTDSNHPELDSQKH Q VGHSENISLNKHSEDWTAHKTEAGIIY
Sbjct: 181  PVPSGHGNQLIGNSLIQTDSNHPELDSQKHTQVVGHSENISLNKHSEDWTAHKTEAGIIY 240

Query: 772  YYNALTGESTYEKPSGFRGEAENLMAQATSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI 831
            YYNALTGESTYEKP GFRGEAENL+AQATSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI
Sbjct: 241  YYNALTGESTYEKPPGFRGEAENLVAQATSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI 300

Query: 832  SSWQIPNEVSELRQQNDEKTKELSAPLPNNNASTDLGTSSSSINTPAINTGGREATPLRT 891
            SSWQIPNEVSELRQQNDEKTKELSAPLPNNNA TDLGTSSSSINTPAINTGGREATPLRT
Sbjct: 301  SSWQIPNEVSELRQQNDEKTKELSAPLPNNNALTDLGTSSSSINTPAINTGGREATPLRT 360

Query: 892  VGISGSSSALDLIKKKLQDSGTPVASSPISAPTVAQSDVNLPRDADGTVKALQTE-NKDK 951
            VGI GSSSALDLIKKKLQDSGTPVASSPISA TVAQSDVNLPRDAD TVKALQTE NKDK
Sbjct: 361  VGIPGSSSALDLIKKKLQDSGTPVASSPISATTVAQSDVNLPRDADATVKALQTENNKDK 420

Query: 952  PKDANADGNVSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPR 1011
            PKDANADGNVSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPR
Sbjct: 421  PKDANADGNVSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPR 480

Query: 1012 FKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTL 1071
            FKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQT 
Sbjct: 481  FKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTF 540

Query: 1072 KKKWGNDSRFEALDRKDRENLLNERVLCLKKAAVEKAQALWAASTTSFKSMLQEREDINI 1131
            KKKWGNDSRFEALDRKDRENLLNERVLCLKKAAVEKAQALWAASTTSFKSMLQEREDIN+
Sbjct: 541  KKKWGNDSRFEALDRKDRENLLNERVLCLKKAAVEKAQALWAASTTSFKSMLQEREDINV 600

Query: 1132 NSRWFRVKDSLREDPRYRSVKHEEREMLFNEYISELKAAVEEKQRESKARKEEQEKLKER 1191
            NSRWFRVKDSLREDPRYRSVKHEEREMLFNEYISELKAA EEKQRESKARKEEQEKLKER
Sbjct: 601  NSRWFRVKDSLREDPRYRSVKHEEREMLFNEYISELKAAEEEKQRESKARKEEQEKLKER 660

Query: 1192 EREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWTESKVKLEKDPQGR 1251
            EREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWTESKVKLEKDPQGR
Sbjct: 661  EREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWTESKVKLEKDPQGR 720

Query: 1252 ASNTDLDSSETEKLFREHVKMLQERCANEFRNLLSEAFTAEVVAQVSEDGKTVLNSWTMA 1311
            ASN DLDSSETEKLFREHVKMLQERCANEFRNLLSEAFTAEVVAQVSEDGKTVL+SWTMA
Sbjct: 721  ASNPDLDSSETEKLFREHVKMLQERCANEFRNLLSEAFTAEVVAQVSEDGKTVLSSWTMA 780

Query: 1312 KRILKPDPRYGKVPRKEREALWRRYADDTVRKQKLANDHKGEKYNDYKNRATTDAGKFPS 1371
            KRILKPDPRYGKVPRKEREALWRRYADDT+RKQKLANDHKGEKYNDYKNRATTDAGKFPS
Sbjct: 781  KRILKPDPRYGKVPRKEREALWRRYADDTMRKQKLANDHKGEKYNDYKNRATTDAGKFPS 840

Query: 1372 KPRIHD 1377
            KPRIHD
Sbjct: 841  KPRIHD 846

BLAST of Chy1G021000 vs. ExPASy TrEMBL
Match: A0A1S3CHX0 (pre-mRNA-processing protein 40C OS=Cucumis melo OX=3656 GN=LOC103500614 PE=4 SV=1)

HSP 1 Score: 1557.3 bits (4031), Expect = 0.0e+00
Identity = 820/846 (96.93%), Postives = 828/846 (97.87%), Query Frame = 0

Query: 532  MTSASTVSQSVSLPAPPTSNSAANGSSIPNLIPSTSPVPPAPSFHIHQLPSLAPMVPGPP 591
            M+SASTVSQSVSLPAPPTSNS ANGSSIPNLIPSTSPVPPAPSFHIHQLP +APMVPGPP
Sbjct: 1    MSSASTVSQSVSLPAPPTSNSVANGSSIPNLIPSTSPVPPAPSFHIHQLPPVAPMVPGPP 60

Query: 592  GMSPSMPLVSTGPAVLFPPTDSASTIPGPNMHAIHNPIHPSARPQICGSYPSLTPVVSPP 651
            GMSPS PLVSTGPAVLFPPTDSASTIPGPNMHAIHNPIHPSARPQICGSYPSLTPVVSPP
Sbjct: 61   GMSPSTPLVSTGPAVLFPPTDSASTIPGPNMHAIHNPIHPSARPQICGSYPSLTPVVSPP 120

Query: 652  HAMWFQPPQLGVMPRPPFLPYSTSYHGPLPFPARGMPLPSVPLPDPQPPGVTPVPVASAI 711
            HAMWFQPPQLG MPRPPF+PYS SYHGPLPFPARGMPLPSVPLPDPQPPGVTPV VASAI
Sbjct: 121  HAMWFQPPQLGAMPRPPFIPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAI 180

Query: 712  AVPSGHGNQLIGNTLIQTDSNHPELDSQKHAQGVGHSENISLNKHSEDWTAHKTEAGIIY 771
             VPSGHGNQLIGN+LIQTDSNHPELDSQKH Q VGHSENISLNKHSEDWTAHKTEAGIIY
Sbjct: 181  PVPSGHGNQLIGNSLIQTDSNHPELDSQKHTQVVGHSENISLNKHSEDWTAHKTEAGIIY 240

Query: 772  YYNALTGESTYEKPSGFRGEAENLMAQATSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI 831
            YYNALTGESTYEKP GFRGEAENL+AQATSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI
Sbjct: 241  YYNALTGESTYEKPPGFRGEAENLVAQATSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI 300

Query: 832  SSWQIPNEVSELRQQNDEKTKELSAPLPNNNASTDLGTSSSSINTPAINTGGREATPLRT 891
            SSWQIPNEVSELRQQNDEKTKELSAPLPNNNA TDLGTSSSSINTPAINTGGREATPLRT
Sbjct: 301  SSWQIPNEVSELRQQNDEKTKELSAPLPNNNALTDLGTSSSSINTPAINTGGREATPLRT 360

Query: 892  VGISGSSSALDLIKKKLQDSGTPVASSPISAPTVAQSDVNLPRDADGTVKALQTE-NKDK 951
            VGI GSSSALDLIKKKLQDSGTPVASSPISA TVAQSDVNLPRDAD TVKALQTE NKDK
Sbjct: 361  VGIPGSSSALDLIKKKLQDSGTPVASSPISATTVAQSDVNLPRDADATVKALQTENNKDK 420

Query: 952  PKDANADGNVSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPR 1011
            PKDANADGNVSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPR
Sbjct: 421  PKDANADGNVSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPR 480

Query: 1012 FKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTL 1071
            FKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQT 
Sbjct: 481  FKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTF 540

Query: 1072 KKKWGNDSRFEALDRKDRENLLNERVLCLKKAAVEKAQALWAASTTSFKSMLQEREDINI 1131
            KKKWGNDSRFEALDRKDRENLLNERVLCLKKAAVEKAQALWAASTTSFKSMLQEREDIN+
Sbjct: 541  KKKWGNDSRFEALDRKDRENLLNERVLCLKKAAVEKAQALWAASTTSFKSMLQEREDINV 600

Query: 1132 NSRWFRVKDSLREDPRYRSVKHEEREMLFNEYISELKAAVEEKQRESKARKEEQEKLKER 1191
            NSRWFRVKDSLREDPRYRSVKHEEREMLFNEYISELKAA EEKQRESKARKEEQEKLKER
Sbjct: 601  NSRWFRVKDSLREDPRYRSVKHEEREMLFNEYISELKAAEEEKQRESKARKEEQEKLKER 660

Query: 1192 EREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWTESKVKLEKDPQGR 1251
            EREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWTESKVKLEKDPQGR
Sbjct: 661  EREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWTESKVKLEKDPQGR 720

Query: 1252 ASNTDLDSSETEKLFREHVKMLQERCANEFRNLLSEAFTAEVVAQVSEDGKTVLNSWTMA 1311
            ASN DLDSSETEKLFREHVKMLQERCANEFRNLLSEAFTAEVVAQVSEDGKTVL+SWTMA
Sbjct: 721  ASNPDLDSSETEKLFREHVKMLQERCANEFRNLLSEAFTAEVVAQVSEDGKTVLSSWTMA 780

Query: 1312 KRILKPDPRYGKVPRKEREALWRRYADDTVRKQKLANDHKGEKYNDYKNRATTDAGKFPS 1371
            KRILKPDPRYGKVPRKEREALWRRYADDT+RKQKLANDHKGEKYNDYKNRATTDAGKFPS
Sbjct: 781  KRILKPDPRYGKVPRKEREALWRRYADDTMRKQKLANDHKGEKYNDYKNRATTDAGKFPS 840

Query: 1372 KPRIHD 1377
            KPRIHD
Sbjct: 841  KPRIHD 846

BLAST of Chy1G021000 vs. ExPASy TrEMBL
Match: A0A5D3BYD0 (Pre-mRNA-processing protein 40C OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold347G001920 PE=4 SV=1)

HSP 1 Score: 1496.5 bits (3873), Expect = 0.0e+00
Identity = 789/846 (93.26%), Postives = 797/846 (94.21%), Query Frame = 0

Query: 532  MTSASTVSQSVSLPAPPTSNSAANGSSIPNLIPSTSPVPPAPSFHIHQLPSLAPMVPGPP 591
            M+SASTVSQSVSLPAPPTSNS ANGSSIPNLIPSTSPVPPAPSFHIHQLP +APMVPGPP
Sbjct: 1    MSSASTVSQSVSLPAPPTSNSVANGSSIPNLIPSTSPVPPAPSFHIHQLPPVAPMVPGPP 60

Query: 592  GMSPSMPLVSTGPAVLFPPTDSASTIPGPNMHAIHNPIHPSARPQICGSYPSLTPVVSPP 651
            GMSPS PLVSTGPAVLFPPTDSASTIPGPNMHAIHNPIHPSARPQICGSYPSLTPVVSPP
Sbjct: 61   GMSPSTPLVSTGPAVLFPPTDSASTIPGPNMHAIHNPIHPSARPQICGSYPSLTPVVSPP 120

Query: 652  HAMWFQPPQLGVMPRPPFLPYSTSYHGPLPFPARGMPLPSVPLPDPQPPGVTPVPVASAI 711
            HAMWFQPPQLG MPRPPF+PYS SYHGPLPFPARGMPLPSVPLPDPQPPGVTPV VASAI
Sbjct: 121  HAMWFQPPQLGAMPRPPFIPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAI 180

Query: 712  AVPSGHGNQLIGNTLIQTDSNHPELDSQKHAQGVGHSENISLNKHSEDWTAHKTEAGIIY 771
             VPSGHGNQLIGN+LIQTDSNHPELDSQKH Q VGHSENISLNKHSEDWTAHKTEAGIIY
Sbjct: 181  PVPSGHGNQLIGNSLIQTDSNHPELDSQKHTQVVGHSENISLNKHSEDWTAHKTEAGIIY 240

Query: 772  YYNALTGESTYEKPSGFRGEAENLMAQATSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI 831
            YYNALTGESTYEKP GFRGEAENL+AQATSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI
Sbjct: 241  YYNALTGESTYEKPPGFRGEAENLVAQATSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI 300

Query: 832  SSWQIPNEVSELRQQNDEKTKELSAPLPNNNASTDLGTSSSSINTPAINTGGREATPLRT 891
            SSWQIPNEVSELRQQNDEKTKELSAPLPNNNA TDLGTSSSSINTPAINTGGREATPLRT
Sbjct: 301  SSWQIPNEVSELRQQNDEKTKELSAPLPNNNALTDLGTSSSSINTPAINTGGREATPLRT 360

Query: 892  VGISGSSSALDLIKKKLQDSGTPVASSPISAPTVAQSDVNLPRDADGTVKALQTE-NKDK 951
            VGI GSSSALDLIKKKLQDSGTPVASSPISA TVAQSDVNLPRDAD TVKALQTE NKDK
Sbjct: 361  VGIPGSSSALDLIKKKLQDSGTPVASSPISATTVAQSDVNLPRDADATVKALQTENNKDK 420

Query: 952  PKDANADGNVSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPR 1011
            PKDANADGNVSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPR
Sbjct: 421  PKDANADGNVSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPR 480

Query: 1012 FKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTL 1071
            FKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQT 
Sbjct: 481  FKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTF 540

Query: 1072 KKKWGNDSRFEALDRKDRENLLNERVLCLKKAAVEKAQALWAASTTSFKSMLQEREDINI 1131
            KKKWGNDSRFEALDRKDRENLLNERVLCLKKAAVEKAQALWAASTTSFKSMLQEREDIN+
Sbjct: 541  KKKWGNDSRFEALDRKDRENLLNERVLCLKKAAVEKAQALWAASTTSFKSMLQEREDINV 600

Query: 1132 NSRWFRVKDSLREDPRYRSVKHEEREMLFNEYISELKAAVEEKQRESKARKEEQEKLKER 1191
            NSRWFRVKDSLREDPRYRSVKHEEREMLFNEYISELKAA EEKQRESKARKEEQ      
Sbjct: 601  NSRWFRVKDSLREDPRYRSVKHEEREMLFNEYISELKAAEEEKQRESKARKEEQ------ 660

Query: 1192 EREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWTESKVKLEKDPQGR 1251
                                     EAVASFQALLVESIKDPQASWTESKVKLEKDPQGR
Sbjct: 661  -------------------------EAVASFQALLVESIKDPQASWTESKVKLEKDPQGR 720

Query: 1252 ASNTDLDSSETEKLFREHVKMLQERCANEFRNLLSEAFTAEVVAQVSEDGKTVLNSWTMA 1311
            ASN DLDSSETEKLFREHVKMLQERCANEFRNLLSEAFTAEVVAQVSEDGKTVL+SWTMA
Sbjct: 721  ASNPDLDSSETEKLFREHVKMLQERCANEFRNLLSEAFTAEVVAQVSEDGKTVLSSWTMA 780

Query: 1312 KRILKPDPRYGKVPRKEREALWRRYADDTVRKQKLANDHKGEKYNDYKNRATTDAGKFPS 1371
            KRILKPDPRYGKVPRKEREALWRRYADDT+RKQKLANDHKGEKYNDYKNRATTDAGKFPS
Sbjct: 781  KRILKPDPRYGKVPRKEREALWRRYADDTMRKQKLANDHKGEKYNDYKNRATTDAGKFPS 815

Query: 1372 KPRIHD 1377
            KPRIHD
Sbjct: 841  KPRIHD 815

BLAST of Chy1G021000 vs. ExPASy TrEMBL
Match: A0A6J1GNF1 (pre-mRNA-processing protein 40C OS=Cucurbita moschata OX=3662 GN=LOC111456014 PE=4 SV=1)

HSP 1 Score: 1399.4 bits (3621), Expect = 0.0e+00
Identity = 744/848 (87.74%), Postives = 785/848 (92.57%), Query Frame = 0

Query: 532  MTSASTVSQSVSLPAPPTSNSAANGSSIPNLIPSTSPVPPAPSFHIHQLPSLAPMVPGPP 591
            M+SASTVSQS+SLPAPPTSNSAANGSSIPNLIP+TSPVPPA SFHIHQL    PMVPGPP
Sbjct: 1    MSSASTVSQSMSLPAPPTSNSAANGSSIPNLIPATSPVPPAQSFHIHQLAPGTPMVPGPP 60

Query: 592  GMSPSMPLVSTGPAVLFPPTDS--ASTIPGPNMHAIHNPIHPSARPQICGSYPSLTPVVS 651
            GMSPSMP       V+FPP+DS  +STIPGPNMHA  N I+ S RPQICGSYPSL PVVS
Sbjct: 61   GMSPSMP-------VMFPPSDSSASSTIPGPNMHAAPNSINTSVRPQICGSYPSLAPVVS 120

Query: 652  PPHAMWFQPPQLGVMPRPPFLPYSTSYHGPLPFPARGMPLPSVPLPDPQPPGVTPVPVAS 711
            PPHA+WFQPPQLG MPRPPFLPY  SYHGPLPFPARGMPLPSVPLPDPQPPGVTPV V+S
Sbjct: 121  PPHAIWFQPPQLGGMPRPPFLPYPASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVSS 180

Query: 712  AIAVPSGHGNQLIGNTLIQTDSNHPELDSQKHAQGVGHSENISLNKHSEDWTAHKTEAGI 771
            A AVPS HGN L GN+LIQTD NHPELD+QKHAQG+G SE+ISL+KHSE+WTAHKTEAGI
Sbjct: 181  ATAVPSTHGNHLTGNSLIQTDFNHPELDAQKHAQGMGQSESISLSKHSENWTAHKTEAGI 240

Query: 772  IYYYNALTGESTYEKPSGFRGEAENLMAQATSVSMSNLSGTDWVLVTMGDGKKYYYNNKT 831
            +YYYNALTGESTYEKPSGF+GE +NLM Q TSVSMSNLSGTDWVLVTMGDGKKYYYNNKT
Sbjct: 241  MYYYNALTGESTYEKPSGFKGEPDNLMVQPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKT 300

Query: 832  KISSWQIPNEVSELRQQNDEKTKELSAPLPNNNASTDLGTSSSSINTPAINTGGREATPL 891
            KISSWQIPNEV+ELRQQNDEKTKE SAPLPNNNA T+ G+S  S+NTPAINTGGREA PL
Sbjct: 301  KISSWQIPNEVTELRQQNDEKTKEHSAPLPNNNALTEPGSSPISMNTPAINTGGREAMPL 360

Query: 892  RTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTVAQSDVNLPRDADGTVKALQTEN-K 951
            RTVG+SG SSALDLIKKKLQ+SGTPVASSPIS PT+AQSDVNLPRDAD  VKALQTEN K
Sbjct: 361  RTVGVSGPSSALDLIKKKLQESGTPVASSPISVPTIAQSDVNLPRDADAAVKALQTENSK 420

Query: 952  DKPKDANADGNVSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFD 1011
            DKPKDAN DGNVSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFD
Sbjct: 421  DKPKDANGDGNVSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFD 480

Query: 1012 PRFKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQ 1071
            PRFKAIPSYSARRSLFEH+VKTRAEEERKEKRAAQKAAIEGFKQLLD ASEDIDHTTSYQ
Sbjct: 481  PRFKAIPSYSARRSLFEHFVKTRAEEERKEKRAAQKAAIEGFKQLLDRASEDIDHTTSYQ 540

Query: 1072 TLKKKWGNDSRFEALDRKDRENLLNERVLCLKKAAVEKAQALWAASTTSFKSMLQEREDI 1131
            T KKKWGND RFEALDRKDRENLL+ERVLCLKKAAVEKAQALWAASTTSFKSMLQER DI
Sbjct: 541  TFKKKWGNDPRFEALDRKDRENLLSERVLCLKKAAVEKAQALWAASTTSFKSMLQERGDI 600

Query: 1132 NINSRWFRVKDSLREDPRYRSVKHEEREMLFNEYISELKAAVEEKQRESKARKEEQEKLK 1191
            N+NSRW RVKDSLR+DPRYRSVKHE+REMLFNEYISELKA  EEKQRESKARKEEQEKLK
Sbjct: 601  NVNSRWLRVKDSLRDDPRYRSVKHEDREMLFNEYISELKAVEEEKQRESKARKEEQEKLK 660

Query: 1192 EREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWTESKVKLEKDPQ 1251
            EREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASW+ESKVKLEKDPQ
Sbjct: 661  EREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWSESKVKLEKDPQ 720

Query: 1252 GRASNTDLDSSETEKLFREHVKMLQERCANEFRNLLSEAFTAEVVAQVSEDGKTVLNSWT 1311
            GRASN DLDSS+TEKLFREHVKMLQERCANEFR LLSEAFTAEVV+QVSEDGKTVLNSWT
Sbjct: 721  GRASNPDLDSSDTEKLFREHVKMLQERCANEFRTLLSEAFTAEVVSQVSEDGKTVLNSWT 780

Query: 1312 MAKRILKPDPRYGKVPRKEREALWRRYADDTVRKQKLANDHKGEKYNDYKNRATTDAGKF 1371
            MAKR LKPDPRY K+PRKEREALWRRYADDT+RKQK AND K EK+++ K+R+T  AGK 
Sbjct: 781  MAKRTLKPDPRYSKLPRKEREALWRRYADDTLRKQKSANDDKVEKHSNSKSRSTNVAGKL 840

Query: 1372 PSKPRIHD 1377
            PSKPRIH+
Sbjct: 841  PSKPRIHE 841

BLAST of Chy1G021000 vs. NCBI nr
Match: XP_011659583.1 (pre-mRNA-processing protein 40C [Cucumis sativus] >KGN45409.1 hypothetical protein Csa_016573 [Cucumis sativus])

HSP 1 Score: 1582 bits (4096), Expect = 0.0
Identity = 833/845 (98.58%), Postives = 838/845 (99.17%), Query Frame = 0

Query: 532  MTSASTVSQSVSLPAPPTSNSAANGSSIPNLIPSTSPVPPAPSFHIHQLPSLAPMVPGPP 591
            M+SASTVSQSVSLPAPPTSNSAANGSSIPNLIPSTSPVPPAPSFHIHQLPS+APMVPGPP
Sbjct: 1    MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPSTSPVPPAPSFHIHQLPSVAPMVPGPP 60

Query: 592  GMSPSMPLVSTGPAVLFPPTDSASTIPGPNMHAIHNPIHPSARPQICGSYPSLTPVVSPP 651
            GMSPSMPLVSTGPAVLFPPTDSASTIPGPNMHAIHNPIHPSARPQICGSYPSLTPVVSPP
Sbjct: 61   GMSPSMPLVSTGPAVLFPPTDSASTIPGPNMHAIHNPIHPSARPQICGSYPSLTPVVSPP 120

Query: 652  HAMWFQPPQLGVMPRPPFLPYSTSYHGPLPFPARGMPLPSVPLPDPQPPGVTPVPVASAI 711
            HAMWFQPPQLG MPRPPFLPYSTSYHGPLPFPARGMPLPSVPLPDPQPPGVTPV VAS I
Sbjct: 121  HAMWFQPPQLGAMPRPPFLPYSTSYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASGI 180

Query: 712  AVPSGHGNQLIGNTLIQTDSNHPELDSQKHAQGVGHSENISLNKHSEDWTAHKTEAGIIY 771
            +VPSGHGNQLIGNTLIQTDSNHPELDS KHAQGVGHSENISLNKHSEDWTAHKTEAGIIY
Sbjct: 181  SVPSGHGNQLIGNTLIQTDSNHPELDSHKHAQGVGHSENISLNKHSEDWTAHKTEAGIIY 240

Query: 772  YYNALTGESTYEKPSGFRGEAENLMAQATSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI 831
            YYNALTGESTYEKPSGFRGEAENLMAQATSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI
Sbjct: 241  YYNALTGESTYEKPSGFRGEAENLMAQATSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI 300

Query: 832  SSWQIPNEVSELRQQNDEKTKELSAPLPNNNASTDLGTSSSSINTPAINTGGREATPLRT 891
            SSWQIPNEVSELRQQNDEKTKELSAPLPNNNASTDLGTSS+SINTPAINTGGREATPLRT
Sbjct: 301  SSWQIPNEVSELRQQNDEKTKELSAPLPNNNASTDLGTSSTSINTPAINTGGREATPLRT 360

Query: 892  VGISGSSSALDLIKKKLQDSGTPVASSPISAPTVAQSDVNLPRDADGTVKALQTENKDKP 951
            VGISGSSSALDLIKKKLQDSGTPVASSPISAPTVAQSDVNLPRDAD TVKALQTENKDKP
Sbjct: 361  VGISGSSSALDLIKKKLQDSGTPVASSPISAPTVAQSDVNLPRDADATVKALQTENKDKP 420

Query: 952  KDANADGNVSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF 1011
            KDANADGNVSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF
Sbjct: 421  KDANADGNVSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF 480

Query: 1012 KAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTLK 1071
            KAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQT K
Sbjct: 481  KAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFK 540

Query: 1072 KKWGNDSRFEALDRKDRENLLNERVLCLKKAAVEKAQALWAASTTSFKSMLQEREDININ 1131
            KKWGNDSRFEALDRKDRENLLNERVLCLKKAAVEKAQALWAASTTSFKSMLQEREDININ
Sbjct: 541  KKWGNDSRFEALDRKDRENLLNERVLCLKKAAVEKAQALWAASTTSFKSMLQEREDININ 600

Query: 1132 SRWFRVKDSLREDPRYRSVKHEEREMLFNEYISELKAAVEEKQRESKARKEEQEKLKERE 1191
            SRWFRVKDSLREDPRYRSVKHEEREMLFNEYISELKAA EEKQRESKARKEEQEKLKERE
Sbjct: 601  SRWFRVKDSLREDPRYRSVKHEEREMLFNEYISELKAAEEEKQRESKARKEEQEKLKERE 660

Query: 1192 REWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWTESKVKLEKDPQGRA 1251
            REWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWTESKVKLEKDPQGRA
Sbjct: 661  REWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWTESKVKLEKDPQGRA 720

Query: 1252 SNTDLDSSETEKLFREHVKMLQERCANEFRNLLSEAFTAEVVAQVSEDGKTVLNSWTMAK 1311
            SNTDLDSSETEKLFREHVKMLQERCANEFRNLLSE+FTAEVVAQVSEDGKTVLNSWTMAK
Sbjct: 721  SNTDLDSSETEKLFREHVKMLQERCANEFRNLLSESFTAEVVAQVSEDGKTVLNSWTMAK 780

Query: 1312 RILKPDPRYGKVPRKEREALWRRYADDTVRKQKLANDHKGEKYNDYKNRATTDAGKFPSK 1371
            RILKPDPRYGKVPRKEREALWRRYADDTVRKQKLANDHKGEKYNDYKNRATTDAGKFPSK
Sbjct: 781  RILKPDPRYGKVPRKEREALWRRYADDTVRKQKLANDHKGEKYNDYKNRATTDAGKFPSK 840

Query: 1372 PRIHD 1376
            PRIHD
Sbjct: 841  PRIHD 845

BLAST of Chy1G021000 vs. NCBI nr
Match: XP_008462197.1 (PREDICTED: pre-mRNA-processing protein 40C [Cucumis melo] >KAA0059331.1 pre-mRNA-processing protein 40C [Cucumis melo var. makuwa])

HSP 1 Score: 1551 bits (4015), Expect = 0.0
Identity = 820/846 (96.93%), Postives = 828/846 (97.87%), Query Frame = 0

Query: 532  MTSASTVSQSVSLPAPPTSNSAANGSSIPNLIPSTSPVPPAPSFHIHQLPSLAPMVPGPP 591
            M+SASTVSQSVSLPAPPTSNS ANGSSIPNLIPSTSPVPPAPSFHIHQLP +APMVPGPP
Sbjct: 1    MSSASTVSQSVSLPAPPTSNSVANGSSIPNLIPSTSPVPPAPSFHIHQLPPVAPMVPGPP 60

Query: 592  GMSPSMPLVSTGPAVLFPPTDSASTIPGPNMHAIHNPIHPSARPQICGSYPSLTPVVSPP 651
            GMSPS PLVSTGPAVLFPPTDSASTIPGPNMHAIHNPIHPSARPQICGSYPSLTPVVSPP
Sbjct: 61   GMSPSTPLVSTGPAVLFPPTDSASTIPGPNMHAIHNPIHPSARPQICGSYPSLTPVVSPP 120

Query: 652  HAMWFQPPQLGVMPRPPFLPYSTSYHGPLPFPARGMPLPSVPLPDPQPPGVTPVPVASAI 711
            HAMWFQPPQLG MPRPPF+PYS SYHGPLPFPARGMPLPSVPLPDPQPPGVTPV VASAI
Sbjct: 121  HAMWFQPPQLGAMPRPPFIPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAI 180

Query: 712  AVPSGHGNQLIGNTLIQTDSNHPELDSQKHAQGVGHSENISLNKHSEDWTAHKTEAGIIY 771
             VPSGHGNQLIGN+LIQTDSNHPELDSQKH Q VGHSENISLNKHSEDWTAHKTEAGIIY
Sbjct: 181  PVPSGHGNQLIGNSLIQTDSNHPELDSQKHTQVVGHSENISLNKHSEDWTAHKTEAGIIY 240

Query: 772  YYNALTGESTYEKPSGFRGEAENLMAQATSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI 831
            YYNALTGESTYEKP GFRGEAENL+AQATSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI
Sbjct: 241  YYNALTGESTYEKPPGFRGEAENLVAQATSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI 300

Query: 832  SSWQIPNEVSELRQQNDEKTKELSAPLPNNNASTDLGTSSSSINTPAINTGGREATPLRT 891
            SSWQIPNEVSELRQQNDEKTKELSAPLPNNNA TDLGTSSSSINTPAINTGGREATPLRT
Sbjct: 301  SSWQIPNEVSELRQQNDEKTKELSAPLPNNNALTDLGTSSSSINTPAINTGGREATPLRT 360

Query: 892  VGISGSSSALDLIKKKLQDSGTPVASSPISAPTVAQSDVNLPRDADGTVKALQTEN-KDK 951
            VGI GSSSALDLIKKKLQDSGTPVASSPISA TVAQSDVNLPRDAD TVKALQTEN KDK
Sbjct: 361  VGIPGSSSALDLIKKKLQDSGTPVASSPISATTVAQSDVNLPRDADATVKALQTENNKDK 420

Query: 952  PKDANADGNVSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPR 1011
            PKDANADGNVSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPR
Sbjct: 421  PKDANADGNVSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPR 480

Query: 1012 FKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTL 1071
            FKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQT 
Sbjct: 481  FKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTF 540

Query: 1072 KKKWGNDSRFEALDRKDRENLLNERVLCLKKAAVEKAQALWAASTTSFKSMLQEREDINI 1131
            KKKWGNDSRFEALDRKDRENLLNERVLCLKKAAVEKAQALWAASTTSFKSMLQEREDIN+
Sbjct: 541  KKKWGNDSRFEALDRKDRENLLNERVLCLKKAAVEKAQALWAASTTSFKSMLQEREDINV 600

Query: 1132 NSRWFRVKDSLREDPRYRSVKHEEREMLFNEYISELKAAVEEKQRESKARKEEQEKLKER 1191
            NSRWFRVKDSLREDPRYRSVKHEEREMLFNEYISELKAA EEKQRESKARKEEQEKLKER
Sbjct: 601  NSRWFRVKDSLREDPRYRSVKHEEREMLFNEYISELKAAEEEKQRESKARKEEQEKLKER 660

Query: 1192 EREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWTESKVKLEKDPQGR 1251
            EREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWTESKVKLEKDPQGR
Sbjct: 661  EREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWTESKVKLEKDPQGR 720

Query: 1252 ASNTDLDSSETEKLFREHVKMLQERCANEFRNLLSEAFTAEVVAQVSEDGKTVLNSWTMA 1311
            ASN DLDSSETEKLFREHVKMLQERCANEFRNLLSEAFTAEVVAQVSEDGKTVL+SWTMA
Sbjct: 721  ASNPDLDSSETEKLFREHVKMLQERCANEFRNLLSEAFTAEVVAQVSEDGKTVLSSWTMA 780

Query: 1312 KRILKPDPRYGKVPRKEREALWRRYADDTVRKQKLANDHKGEKYNDYKNRATTDAGKFPS 1371
            KRILKPDPRYGKVPRKEREALWRRYADDT+RKQKLANDHKGEKYNDYKNRATTDAGKFPS
Sbjct: 781  KRILKPDPRYGKVPRKEREALWRRYADDTMRKQKLANDHKGEKYNDYKNRATTDAGKFPS 840

Query: 1372 KPRIHD 1376
            KPRIHD
Sbjct: 841  KPRIHD 846

BLAST of Chy1G021000 vs. NCBI nr
Match: TYK03994.1 (pre-mRNA-processing protein 40C [Cucumis melo var. makuwa])

HSP 1 Score: 1489 bits (3856), Expect = 0.0
Identity = 789/846 (93.26%), Postives = 797/846 (94.21%), Query Frame = 0

Query: 532  MTSASTVSQSVSLPAPPTSNSAANGSSIPNLIPSTSPVPPAPSFHIHQLPSLAPMVPGPP 591
            M+SASTVSQSVSLPAPPTSNS ANGSSIPNLIPSTSPVPPAPSFHIHQLP +APMVPGPP
Sbjct: 1    MSSASTVSQSVSLPAPPTSNSVANGSSIPNLIPSTSPVPPAPSFHIHQLPPVAPMVPGPP 60

Query: 592  GMSPSMPLVSTGPAVLFPPTDSASTIPGPNMHAIHNPIHPSARPQICGSYPSLTPVVSPP 651
            GMSPS PLVSTGPAVLFPPTDSASTIPGPNMHAIHNPIHPSARPQICGSYPSLTPVVSPP
Sbjct: 61   GMSPSTPLVSTGPAVLFPPTDSASTIPGPNMHAIHNPIHPSARPQICGSYPSLTPVVSPP 120

Query: 652  HAMWFQPPQLGVMPRPPFLPYSTSYHGPLPFPARGMPLPSVPLPDPQPPGVTPVPVASAI 711
            HAMWFQPPQLG MPRPPF+PYS SYHGPLPFPARGMPLPSVPLPDPQPPGVTPV VASAI
Sbjct: 121  HAMWFQPPQLGAMPRPPFIPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAI 180

Query: 712  AVPSGHGNQLIGNTLIQTDSNHPELDSQKHAQGVGHSENISLNKHSEDWTAHKTEAGIIY 771
             VPSGHGNQLIGN+LIQTDSNHPELDSQKH Q VGHSENISLNKHSEDWTAHKTEAGIIY
Sbjct: 181  PVPSGHGNQLIGNSLIQTDSNHPELDSQKHTQVVGHSENISLNKHSEDWTAHKTEAGIIY 240

Query: 772  YYNALTGESTYEKPSGFRGEAENLMAQATSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI 831
            YYNALTGESTYEKP GFRGEAENL+AQATSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI
Sbjct: 241  YYNALTGESTYEKPPGFRGEAENLVAQATSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI 300

Query: 832  SSWQIPNEVSELRQQNDEKTKELSAPLPNNNASTDLGTSSSSINTPAINTGGREATPLRT 891
            SSWQIPNEVSELRQQNDEKTKELSAPLPNNNA TDLGTSSSSINTPAINTGGREATPLRT
Sbjct: 301  SSWQIPNEVSELRQQNDEKTKELSAPLPNNNALTDLGTSSSSINTPAINTGGREATPLRT 360

Query: 892  VGISGSSSALDLIKKKLQDSGTPVASSPISAPTVAQSDVNLPRDADGTVKALQTEN-KDK 951
            VGI GSSSALDLIKKKLQDSGTPVASSPISA TVAQSDVNLPRDAD TVKALQTEN KDK
Sbjct: 361  VGIPGSSSALDLIKKKLQDSGTPVASSPISATTVAQSDVNLPRDADATVKALQTENNKDK 420

Query: 952  PKDANADGNVSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPR 1011
            PKDANADGNVSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPR
Sbjct: 421  PKDANADGNVSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPR 480

Query: 1012 FKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTL 1071
            FKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQT 
Sbjct: 481  FKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTF 540

Query: 1072 KKKWGNDSRFEALDRKDRENLLNERVLCLKKAAVEKAQALWAASTTSFKSMLQEREDINI 1131
            KKKWGNDSRFEALDRKDRENLLNERVLCLKKAAVEKAQALWAASTTSFKSMLQEREDIN+
Sbjct: 541  KKKWGNDSRFEALDRKDRENLLNERVLCLKKAAVEKAQALWAASTTSFKSMLQEREDINV 600

Query: 1132 NSRWFRVKDSLREDPRYRSVKHEEREMLFNEYISELKAAVEEKQRESKARKEEQEKLKER 1191
            NSRWFRVKDSLREDPRYRSVKHEEREMLFNEYISELKAA EEKQRESKARKEEQ      
Sbjct: 601  NSRWFRVKDSLREDPRYRSVKHEEREMLFNEYISELKAAEEEKQRESKARKEEQ------ 660

Query: 1192 EREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWTESKVKLEKDPQGR 1251
                                     EAVASFQALLVESIKDPQASWTESKVKLEKDPQGR
Sbjct: 661  -------------------------EAVASFQALLVESIKDPQASWTESKVKLEKDPQGR 720

Query: 1252 ASNTDLDSSETEKLFREHVKMLQERCANEFRNLLSEAFTAEVVAQVSEDGKTVLNSWTMA 1311
            ASN DLDSSETEKLFREHVKMLQERCANEFRNLLSEAFTAEVVAQVSEDGKTVL+SWTMA
Sbjct: 721  ASNPDLDSSETEKLFREHVKMLQERCANEFRNLLSEAFTAEVVAQVSEDGKTVLSSWTMA 780

Query: 1312 KRILKPDPRYGKVPRKEREALWRRYADDTVRKQKLANDHKGEKYNDYKNRATTDAGKFPS 1371
            KRILKPDPRYGKVPRKEREALWRRYADDT+RKQKLANDHKGEKYNDYKNRATTDAGKFPS
Sbjct: 781  KRILKPDPRYGKVPRKEREALWRRYADDTMRKQKLANDHKGEKYNDYKNRATTDAGKFPS 815

Query: 1372 KPRIHD 1376
            KPRIHD
Sbjct: 841  KPRIHD 815

BLAST of Chy1G021000 vs. NCBI nr
Match: XP_038900162.1 (pre-mRNA-processing protein 40C [Benincasa hispida])

HSP 1 Score: 1448 bits (3748), Expect = 0.0
Identity = 773/846 (91.37%), Postives = 796/846 (94.09%), Query Frame = 0

Query: 532  MTSASTVSQSVSLPAPPTSNSAANGSSIPNLIPSTSPVPPAPSFHIHQLPSLAPMVPGPP 591
            M+SASTVSQSVSLPAPPTSNS ANGSSIPNLIP       APSFH HQL    PMVPGPP
Sbjct: 1    MSSASTVSQSVSLPAPPTSNSVANGSSIPNLIP-------APSFHSHQLLPGTPMVPGPP 60

Query: 592  GMSPSMPLVSTGPAVLFPPTDSASTIPGPNMHAIHNPIHPSARPQICGSYPSLTPVVSPP 651
            GMSPS+P+VST PA LFPP DSASTIPGP+MHA  N I+PS RPQICGSYPSLTPVVSPP
Sbjct: 61   GMSPSLPVVSTTPAALFPPNDSASTIPGPHMHATPNSINPSLRPQICGSYPSLTPVVSPP 120

Query: 652  HAMWFQPPQLGVMPRPPFLPYSTSYHGPLPFPARGMPLPSVPLPDPQPPGVTPVPVASAI 711
            HA+WFQPPQLG MPRPPFLPYS SYHGPLPFPARGMPLPSVPLPDPQPPGVTPV VASAI
Sbjct: 121  HAIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAI 180

Query: 712  AVPSGHGNQLIGNTLIQTDSNHPELDSQKHAQGVGHSENISLNKHSEDWTAHKTEAGIIY 771
            AV SGHGNQL GN+LIQTDSNHP+LDSQKHAQGVG SENI L KHSEDWTAHKTEAGIIY
Sbjct: 181  AVSSGHGNQLSGNSLIQTDSNHPQLDSQKHAQGVGQSENIPLTKHSEDWTAHKTEAGIIY 240

Query: 772  YYNALTGESTYEKPSGFRGEAENLMAQATSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI 831
            YYNALTGESTYEKPSGF+GE EN+MAQ TSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI
Sbjct: 241  YYNALTGESTYEKPSGFKGEPENVMAQPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI 300

Query: 832  SSWQIPNEVSELRQQNDEKTKELSAPLPNNNASTDLGTSSSSINTPAINTGGREATPLRT 891
            SSWQIPNEVSELRQQNDEKTKE SAPLPNNNA TDLGTSS SINTPAINTGGREATPLR 
Sbjct: 301  SSWQIPNEVSELRQQNDEKTKEHSAPLPNNNALTDLGTSSISINTPAINTGGREATPLRM 360

Query: 892  VGISGSSSALDLIKKKLQDSGTPVASSPISAPTVAQSDVNLPRDADGTVKALQTEN-KDK 951
            VGISGSSSALDLIKKKLQDSGTPVASSPISAPTVAQ DVNL RDAD TVKALQTEN KDK
Sbjct: 361  VGISGSSSALDLIKKKLQDSGTPVASSPISAPTVAQPDVNLLRDADATVKALQTENNKDK 420

Query: 952  PKDANADGNVSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPR 1011
            PKDA+ DGNVSDSSSDSEDVD+GPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPR
Sbjct: 421  PKDADGDGNVSDSSSDSEDVDNGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPR 480

Query: 1012 FKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTL 1071
            FKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAA+EGFKQLLD ASEDIDHTTSYQT 
Sbjct: 481  FKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAMEGFKQLLDRASEDIDHTTSYQTF 540

Query: 1072 KKKWGNDSRFEALDRKDRENLLNERVLCLKKAAVEKAQALWAASTTSFKSMLQEREDINI 1131
            KKKWGND RFEALDRKDRENLLNERVL LKKAA+EKAQALWAASTTSFKSMLQER DIN+
Sbjct: 541  KKKWGNDPRFEALDRKDRENLLNERVLYLKKAAIEKAQALWAASTTSFKSMLQERGDINV 600

Query: 1132 NSRWFRVKDSLREDPRYRSVKHEEREMLFNEYISELKAAVEEKQRESKARKEEQEKLKER 1191
            NSRWFRVKDSLR+DPRYRSVKHEEREMLFNEYISELKA  EEKQRESKARKEEQEKLKER
Sbjct: 601  NSRWFRVKDSLRDDPRYRSVKHEEREMLFNEYISELKAVEEEKQRESKARKEEQEKLKER 660

Query: 1192 EREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWTESKVKLEKDPQGR 1251
            EREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWTESKVKLEKDPQGR
Sbjct: 661  EREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWTESKVKLEKDPQGR 720

Query: 1252 ASNTDLDSSETEKLFREHVKMLQERCANEFRNLLSEAFTAEVVAQVSEDGKTVLNSWTMA 1311
            ASN DLDSS+TEKLFREHVKMLQERCANEFR LLSEAFTAEVVAQ+SEDGKTVLNSWTMA
Sbjct: 721  ASNPDLDSSDTEKLFREHVKMLQERCANEFRTLLSEAFTAEVVAQLSEDGKTVLNSWTMA 780

Query: 1312 KRILKPDPRYGKVPRKEREALWRRYADDTVRKQKLANDHKGEKYNDYKNRATTDAGKFPS 1371
            KRILKPDPRY KVPRKEREALWRRYADDT+RKQKLANDHKGEK+ND+K+RAT DAGKFPS
Sbjct: 781  KRILKPDPRYSKVPRKEREALWRRYADDTLRKQKLANDHKGEKHNDFKSRATIDAGKFPS 839

Query: 1372 KPRIHD 1376
            KPRIH+
Sbjct: 841  KPRIHE 839

BLAST of Chy1G021000 vs. NCBI nr
Match: XP_023547625.1 (pre-mRNA-processing protein 40C [Cucurbita pepo subsp. pepo] >XP_023547626.1 pre-mRNA-processing protein 40C [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1398 bits (3618), Expect = 0.0
Identity = 745/848 (87.85%), Postives = 787/848 (92.81%), Query Frame = 0

Query: 532  MTSASTVSQSVSLPAPPTSNSAANGSSIPNLIPSTSPVPPAPSFHIHQLPSLAPMVPGPP 591
            M+SASTVSQS+SLPAPPTSNSAANGSSIPNLIP+TSPVPPA SFHIHQL    PMVPGPP
Sbjct: 1    MSSASTVSQSMSLPAPPTSNSAANGSSIPNLIPATSPVPPAQSFHIHQLAPGTPMVPGPP 60

Query: 592  GMSPSMPLVSTGPAVLFPPTDSA--STIPGPNMHAIHNPIHPSARPQICGSYPSLTPVVS 651
            GMSPSMP       V+FPP+DS+  STIPGPNMHA  N I+ S RPQICGSYPSL PVVS
Sbjct: 61   GMSPSMP-------VMFPPSDSSASSTIPGPNMHAAPNSINTSVRPQICGSYPSLAPVVS 120

Query: 652  PPHAMWFQPPQLGVMPRPPFLPYSTSYHGPLPFPARGMPLPSVPLPDPQPPGVTPVPVAS 711
            PPHA+WFQPPQLG MPRPPFLPY  SYHGPLPFPARGMPLPSVPLPDPQPPGVTPV V+S
Sbjct: 121  PPHAIWFQPPQLGGMPRPPFLPYPASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVSS 180

Query: 712  AIAVPSGHGNQLIGNTLIQTDSNHPELDSQKHAQGVGHSENISLNKHSEDWTAHKTEAGI 771
            A AVPS HGN L GN+LIQTD NHPELD+QKHAQG+G SE+ISL+KHSE+WTAHKTEAGI
Sbjct: 181  ATAVPSSHGNHLTGNSLIQTDFNHPELDAQKHAQGMGQSESISLSKHSENWTAHKTEAGI 240

Query: 772  IYYYNALTGESTYEKPSGFRGEAENLMAQATSVSMSNLSGTDWVLVTMGDGKKYYYNNKT 831
            +YYYNALTGESTYEKPSGF+GE +NLM Q TSVSMSNLSGTDWVLVTMGDGKKYYYNNKT
Sbjct: 241  MYYYNALTGESTYEKPSGFKGEPDNLMVQPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKT 300

Query: 832  KISSWQIPNEVSELRQQNDEKTKELSAPLPNNNASTDLGTSSSSINTPAINTGGREATPL 891
            KISSWQIPNEV+ELRQQNDEKTKE S PLPNNNA T+ G+S  S+NTPAINTGGREA PL
Sbjct: 301  KISSWQIPNEVTELRQQNDEKTKEHSGPLPNNNALTEPGSSPISMNTPAINTGGREAMPL 360

Query: 892  RTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTVAQSDVNLPRDADGTVKALQTEN-K 951
            RTVG+SG SSALDLIKKKLQ+SGTPVASSPISAPT+AQSDVNLPRDAD  VKALQTEN K
Sbjct: 361  RTVGVSGPSSALDLIKKKLQESGTPVASSPISAPTIAQSDVNLPRDADAAVKALQTENSK 420

Query: 952  DKPKDANADGNVSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFD 1011
            DKPKDAN DGNVSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFD
Sbjct: 421  DKPKDANGDGNVSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFD 480

Query: 1012 PRFKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQ 1071
            PRFKAIPSYSARRSLFEH+VKTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQ
Sbjct: 481  PRFKAIPSYSARRSLFEHFVKTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQ 540

Query: 1072 TLKKKWGNDSRFEALDRKDRENLLNERVLCLKKAAVEKAQALWAASTTSFKSMLQEREDI 1131
            T KKKWGND RFEALDRKDRENLL+ERVLCLKKAAVEKAQALWAASTTSFKSMLQER DI
Sbjct: 541  TFKKKWGNDPRFEALDRKDRENLLSERVLCLKKAAVEKAQALWAASTTSFKSMLQERGDI 600

Query: 1132 NINSRWFRVKDSLREDPRYRSVKHEEREMLFNEYISELKAAVEEKQRESKARKEEQEKLK 1191
            N+NSRW RVKDSLR+DPRYRSVKHE+REMLFNEYISELKA  EEKQRESKA+KEEQEKLK
Sbjct: 601  NVNSRWLRVKDSLRDDPRYRSVKHEDREMLFNEYISELKAVEEEKQRESKAKKEEQEKLK 660

Query: 1192 EREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWTESKVKLEKDPQ 1251
            EREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASW+ESKVKLEKDPQ
Sbjct: 661  EREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWSESKVKLEKDPQ 720

Query: 1252 GRASNTDLDSSETEKLFREHVKMLQERCANEFRNLLSEAFTAEVVAQVSEDGKTVLNSWT 1311
            GRASN DLDSS+TEKLFREHVKMLQERCANEFR LLSEAFTAEVV+QVSEDGKTVLNSWT
Sbjct: 721  GRASNPDLDSSDTEKLFREHVKMLQERCANEFRTLLSEAFTAEVVSQVSEDGKTVLNSWT 780

Query: 1312 MAKRILKPDPRYGKVPRKEREALWRRYADDTVRKQKLANDHKGEKYNDYKNRATTDAGKF 1371
            MAKR LKPDPRY K+PRKEREALWRRYADDT+RKQK ANDHK EK+++ K+R+T  AGK 
Sbjct: 781  MAKRTLKPDPRYSKLPRKEREALWRRYADDTLRKQKSANDHKVEKHSNSKSRSTNVAGKL 840

Query: 1372 PSKPRIHD 1376
            PSKPRIH+
Sbjct: 841  PSKPRIHE 841

BLAST of Chy1G021000 vs. TAIR 10
Match: AT3G19840.1 (pre-mRNA-processing protein 40C )

HSP 1 Score: 693.7 bits (1789), Expect = 2.9e-199
Identity = 427/831 (51.38%), Postives = 554/831 (66.67%), Query Frame = 0

Query: 531  TMTSAST--VSQSVSLPAPPTSNSAANGSSIPNLIPSTSPVPPAPSFHIHQLPSLAPMVP 590
            +M+ AST  VSQSV         + A  SS  N IP  SP+       +   P   P   
Sbjct: 48   SMSIASTGFVSQSVPYSVTAQWGTNAAASSNVNPIPQASPM-------LANAPFGRPGTL 107

Query: 591  GPPGMSPSMPLVSTGPAVLFPPTDSASTIPGPNMHAIHNPIHPSARPQICGSYPSLTPVV 650
             PPG+  S P         FP ++  ST P P M A    ++P   P +   YP    + 
Sbjct: 108  APPGLMTSPP--------AFPGSNPFSTTPRPGMSAGPAQMNPGIHPHM---YPPYHSLP 167

Query: 651  SPPHAMWFQPPQLGVMPRPPFLPYSTSYHGPLPFPARGMPLPSVPLPDPQPPGVTPV-PV 710
              P  MW QPP +G +PR PFL + T++ G  PFP RG+  P++P     P G +P+  V
Sbjct: 168  GTPQGMWLQPPSMGGIPRAPFLSHPTTFPGSYPFPVRGIS-PNLPYSGSHPLGASPMGSV 227

Query: 711  ASAIAVPSGHGNQLIGNTLIQTDSNHPELDSQKHAQGVGHSENISLNKHSEDWTAHKTEA 770
             +  A+P    +   G    +       +D +  +Q VG+          + WTAHK+EA
Sbjct: 228  GNVHALPGRQPDISPGRKTEELSG----IDDRAGSQLVGN--------RLDAWTAHKSEA 287

Query: 771  GIIYYYNALTGESTYEKPSGFRGEAENLMAQATSVSMSNLSGTDWVLVTMGDGKKYYYNN 830
            G++YYYN++TG+STYEKP GF GE + +  Q   VSM +L GTDW LV+  DGKKYYYNN
Sbjct: 288  GVLYYYNSVTGQSTYEKPPGFGGEPDKVPVQPIPVSMESLPGTDWALVSTNDGKKYYYNN 347

Query: 831  KTKISSWQIPNEVSELRQQNDEKTKELSAPLPNNNASTDLGTSSSSINTPAINTGGREAT 890
            KTK+SSWQIP EV +  ++ +E+  E  A +P+ +  T+ G+  +S++ PAI+ GGR+A 
Sbjct: 348  KTKVSSWQIPAEVKDFGKKLEERAMESVASVPSADL-TEKGSDLTSLSAPAISNGGRDAA 407

Query: 891  PLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTVAQSDVNLPRDADGTVKALQTEN 950
             L+T      SSALDL+KKKL DSG PV+S+         S+ N  +  + T       +
Sbjct: 408  SLKTTNF--GSSALDLVKKKLHDSGMPVSST-------ITSEANSGKTTEVTPSGESGNS 467

Query: 951  KDKPKDANADGNVSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVF 1010
              K KDA   G +SDSSSDSED DSGP+ E+   QFKEMLKERG+APFSKW+KELPKI+F
Sbjct: 468  TGKVKDAPGAGALSDSSSDSEDEDSGPSKEECSKQFKEMLKERGIAPFSKWEKELPKIIF 527

Query: 1011 DPRFKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSY 1070
            DPRFKAIPS+S RRSLFE YVKTRAEEER+EKRAA KAAIEGF+QLLD AS DID  T Y
Sbjct: 528  DPRFKAIPSHSVRRSLFEQYVKTRAEEERREKRAAHKAAIEGFRQLLDDASTDIDQHTDY 587

Query: 1071 QTLKKKWGNDSRFEALDRKDRENLLNERVLCLKKAAVEKAQALWAASTTSFKSMLQERED 1130
            +  KKKWGND RFEA++RK+RE LLNERVL LK++A +KAQ + AA+ + FK+ML+ERE 
Sbjct: 588  RAFKKKWGNDLRFEAIERKEREGLLNERVLSLKRSAEQKAQEIRAAAASDFKTMLRERE- 647

Query: 1131 ININSRWFRVKDSLREDPRYRSVKHEEREMLFNEYISELKAAVEEKQRESKARKEEQEKL 1190
            I+INS W +VKDSLR +PRYRSV HE+RE+ + EYI+ELKAA      E KAR +E++KL
Sbjct: 648  ISINSHWSKVKDSLRNEPRYRSVAHEDREVFYYEYIAELKAAQRGDDHEMKAR-DEEDKL 707

Query: 1191 KEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWTESKVKLEKDP 1250
            +ERERE RKRKERE QE+ERVR K+R+KEA +S+QALLVE I+DP+ASWTESK  LE+DP
Sbjct: 708  RERERELRKRKEREVQEVERVRQKIRRKEASSSYQALLVEKIRDPEASWTESKPILERDP 767

Query: 1251 QGRASNTDLDSSETEKLFREHVKMLQERCANEFRNLLSEAFTAEVVAQVSEDGKTVLNSW 1310
            Q RASN DL+ ++ EKLFR+HVK L ERC ++F+ LL+EA ++E     +EDGKT LNSW
Sbjct: 768  QKRASNPDLEPADKEKLFRDHVKSLYERCVHDFKALLAEALSSEAATLQTEDGKTALNSW 827

Query: 1311 TMAKRILKPDPRYGKVPRKEREALWRRYADDTVRKQKLANDHKGEKYNDYK 1359
            + AK++LKPD RY K+PR++RE +WRRY +D  RKQ+  N ++ EK  DYK
Sbjct: 828  STAKQVLKPDIRYSKMPRQDREVVWRRYVEDISRKQRHEN-YQEEKQRDYK 834

BLAST of Chy1G021000 vs. TAIR 10
Match: AT3G20260.1 (Protein of unknown function (DUF1666) )

HSP 1 Score: 361.3 bits (926), Expect = 3.4e-99
Identity = 208/374 (55.61%), Postives = 264/374 (70.59%), Query Frame = 0

Query: 109 EAEEDEDDFIMEEVKRRLKDLRRNSFMVLIPEEEEEEIEGG---EEEEVGEGE--PEWRD 168
           E E+D+DDFI  EVKRRLK+LRRNSFMVLIPEEEEEE E     E+++ GE +   EWRD
Sbjct: 49  EIEDDDDDFITNEVKRRLKELRRNSFMVLIPEEEEEEEEESYLDEDDDDGEDKCSSEWRD 108

Query: 169 VEAEGRQWWGGFGAVYDDYCERMRFFDRKS----------IESGPASTSQRSASKKSASP 228
           V AEG QWWGGF AVY+ YCERM FFDR S          I   P++ S RSASKK +SP
Sbjct: 109 VVAEGLQWWGGFDAVYEKYCERMLFFDRLSSQQLKETGIGIAPSPSTPSPRSASKKLSSP 168

Query: 229 LRCLSLKRIEEPEDEMEDVDPSLTPIDSNHH-IEIAYVAHICLSWEALHCQYTQLNHLIS 288
            RCLSLK+ + PE+++E + P  T +D  +  +E AYVA +CL+WEALHCQYTQL+HLIS
Sbjct: 169 FRCLSLKKFDVPEEDIEHLQP--TEVDDPYQDLETAYVAQLCLTWEALHCQYTQLSHLIS 228

Query: 289 CQPQNSTTHYNLTAQLFQQFQVLLQRFIENEPFQQALRPTIYARTRRTFPKMLHVPNIQA 348
           CQP+  T  YN TAQLFQQF VLLQR+IENEPF+Q  R  +YAR R   PK+L  P IQ 
Sbjct: 229 CQPETPTC-YNHTAQLFQQFLVLLQRYIENEPFEQGSRSELYARARNAMPKLLQAPKIQG 288

Query: 349 SDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASLSFRNHTQDAALHA 408
           SD   + E+++  ++LA DL+ +IE+SI TF+ FLKM+KK        F NH  +     
Sbjct: 289 SDKKEM-EKDTGFMVLADDLIKVIESSILTFNVFLKMDKKKPNGGIHLFGNHNNNHVNST 348

Query: 409 R----VRSSLDKKKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIIRRLLKMSRI 463
                V+SS+DKK+ K KE+ KK+KG ++K+ PQT+E +QLLF  +DIK+  R+L+MS+I
Sbjct: 349 TPLLLVQSSIDKKRVKAKELSKKTKGLRKKSWPQTWEGVQLLFAAIDIKLATRVLRMSKI 408

BLAST of Chy1G021000 vs. TAIR 10
Match: AT5G39785.1 (Protein of unknown function (DUF1666) )

HSP 1 Score: 117.1 bits (292), Expect = 1.1e-25
Identity = 128/441 (29.02%), Postives = 207/441 (46.94%), Query Frame = 0

Query: 68  DGNVSDSSLDSEDVESGPTDEQLIIQFKEEDGTDPKSIAEVEAEEDEDDF--------IM 127
           DG +SDS      ++ G        Q ++ D +   S +E E EED + F        ++
Sbjct: 162 DGFLSDSDFAETSLKKG--------QNRKSDNSGSGSDSEEEEEEDTNGFESLWEHQDLI 221

Query: 128 EEVKRRLKDLRRNSFMVLIPEEEEEEIEGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVY 187
           E++K  +K ++    +  I EEEEE+    +  ++ E    WR  E +  +     G V+
Sbjct: 222 EQLKMEMKKVKAIGGLTTILEEEEED---DDCPKIMEDLKPWRIEEEKKFKHVDTIGEVH 281

Query: 188 D---DYCERMRFFDRKSIESGPA-------------STSQRSASKKSASPLRCLSLKRIE 247
                Y ERMR  D  S +   A             ST   + S+ S S +  ++++  +
Sbjct: 282 KFHRSYRERMRKLDILSFQKSYALGLLQSKSPQQATSTLGSNPSQTSFSSVFSVNIRLWK 341

Query: 248 EPEDEMEDVDPSLTPIDSNHHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYN 307
             + E+E +   +  I     +E  YV  +CLSWE LH QY +   L+      S   YN
Sbjct: 342 AKKSEIEPMVQFVKEIQG--ELENVYVGQMCLSWEILHWQYEKAIELLESDVYGS-RRYN 401

Query: 308 LTAQLFQQFQVLLQRFIENEPFQQALRPTIYARTRRTFPKMLHVPNIQ----ASDPNGVQ 367
             A  FQQFQVLLQRF+ENEPF++  R   Y + R     +L +P I+        NG +
Sbjct: 402 EVAGEFQQFQVLLQRFLENEPFEEP-RVQHYIKRRCVLRNLLQIPVIREDGNKDKKNGRR 461

Query: 368 ---EQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASLSFRNHTQ---------- 427
              E+ +D +I +  L+ I+E +I  F RF++ +K TS+      R  +Q          
Sbjct: 462 RDYEENNDGVIKSDQLVEIMEETIRLFWRFVRCDKLTSSIHDQKSRTKSQIEPDHEEDSE 521

Query: 428 DAALHARVRSSLDKKKTKLKEVRKKS-----KGWKQKTCPQTYEDMQLLFGVVDIKIIRR 463
           D  + A V+S L  K+ +L++V K       +  K K    T + +   F  VD+K++ R
Sbjct: 522 DLEMFAEVKSQLQNKEKRLRDVLKSERCIIRRFQKHKEEDSTEDQVLHFFSQVDMKLVTR 581

BLAST of Chy1G021000 vs. TAIR 10
Match: AT5G39785.2 (Protein of unknown function (DUF1666) )

HSP 1 Score: 110.5 bits (275), Expect = 1.1e-23
Identity = 127/442 (28.73%), Postives = 206/442 (46.61%), Query Frame = 0

Query: 68  DGNVSDSSLDSEDVESGPTDEQLIIQFKEEDGTDPKSIAEVEAEEDEDDF--------IM 127
           DG +SDS      ++ G        Q ++ D +   S +E E EED + F        ++
Sbjct: 162 DGFLSDSDFAETSLKKG--------QNRKSDNSGSGSDSEEEEEEDTNGFESLWEHQDLI 221

Query: 128 EEVKRRLKDLRRNSFMVLIPEEEEEEIEGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVY 187
           E++K  +K ++    +  I EEEEE+    +  ++ E    WR  E +  +     G V+
Sbjct: 222 EQLKMEMKKVKAIGGLTTILEEEEED---DDCPKIMEDLKPWRIEEEKKFKHVDTIGEVH 281

Query: 188 D---DYCERMRFFDRKSIESGPA-------------STSQRSASKKSASPLRCLSLKRIE 247
                Y ERMR  D  S +   A             ST   + S+ S S +  ++++  +
Sbjct: 282 KFHRSYRERMRKLDILSFQKSYALGLLQSKSPQQATSTLGSNPSQTSFSSVFSVNIRLWK 341

Query: 248 EPEDEMEDVDPSLTPIDSNHHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYN 307
             + E+E +   +  I     +E  YV  +CLSWE LH QY +   L+      S   YN
Sbjct: 342 AKKSEIEPMVQFVKEIQG--ELENVYVGQMCLSWEILHWQYEKAIELLESDVYGS-RRYN 401

Query: 308 LTAQLFQQFQVLLQRFIENEPFQQALRPTIYARTRRTFPKMLHVPNIQ----ASDPNGVQ 367
             A  FQQFQVLLQRF+ENEPF++  R   Y + R     +L +P I+        NG +
Sbjct: 402 EVAGEFQQFQVLLQRFLENEPFEEP-RVQHYIKRRCVLRNLLQIPVIREDGNKDKKNGRR 461

Query: 368 ---EQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASLSFRNHTQ---------- 427
              E+ +D +I +  L+ I+E +I  F RF++ +K TS+      R  +Q          
Sbjct: 462 RDYEENNDGVIKSDQLVEIMEETIRLFWRFVRCDKLTSSIHDQKSRTKSQIEPDHEEDSE 521

Query: 428 DAALHARVRSSLDK-KKTKLKEVRKKS-----KGWKQKTCPQTYEDMQLLFGVVDIKIIR 463
           D  + A V+S L    + +L++V K       +  K K    T + +   F  VD+K++ 
Sbjct: 522 DLEMFAEVKSQLQNVSEKRLRDVLKSERCIIRRFQKHKEEDSTEDQVLHFFSQVDMKLVT 581

BLAST of Chy1G021000 vs. TAIR 10
Match: AT1G44910.1 (pre-mRNA-processing protein 40A )

HSP 1 Score: 107.1 bits (266), Expect = 1.2e-22
Identity = 193/783 (24.65%), Postives = 336/783 (42.91%), Query Frame = 0

Query: 585  PMVPG-------PPGMSPSMPLVSTGPAVLFPPTDSASTIPGPNMHAIH--NPIHPSARP 644
            PMVPG       P    P  P     P V   P   +  I    +  +    P+H ++  
Sbjct: 15   PMVPGQQGQHFVPAASQPFHPYGHVPPNVQSQPPQYSQPIQQQQLFPVRPGQPVHITSSS 74

Query: 645  Q-ICGSYPSLTPVVSPPHAMWFQPPQLGVMPRPPFLPYSTSYHGPLPFPARGMPL---PS 704
            Q +   Y     +++         PQ    P   F      +  P  F     P     S
Sbjct: 75   QAVSVPYIQTNKILTSGSTQ----PQPNAPPMTGFATSGPPFSSPYTFVPSSYPQQQPTS 134

Query: 705  VPLPDPQPPGVTPVPVASAIAVPSGHGNQLIGNTLIQTDSNHPELDSQKHAQGVGHSENI 764
            +  P+ Q       P A+   VP      L+ + + QT    P   S         S   
Sbjct: 135  LVQPNSQMHVAGVPPAANTWPVPVNQSTSLV-SPVQQTGQQTPVAVSTDPGNLTPQS--- 194

Query: 765  SLNKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFRGEAENLMAQATSVSMSNLSGTD 824
                 + DW  H +  G  YYYN  T +S +EKP       E   A A++V         
Sbjct: 195  -----ASDWQEHTSADGRKYYYNKRTKQSNWEKPLELMTPLER--ADASTV--------- 254

Query: 825  WVLVTMGDGKKYYYNNKTKISSWQIPNEVSELRQQ---NDEKT---KELSAPLPNNNA-S 884
            W   T  +GKKYYYN  TK S W IP ++   R+Q     EKT   +  S PL ++ A S
Sbjct: 255  WKEFTTPEGKKYYYNKVTKESKWTIPEDLKLAREQAQLASEKTSLSEAGSTPLSHHAASS 314

Query: 885  TDLGTSSSSINTPAINTG--GREATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISA 944
            +DL  S+ +   P+ ++   G  ++P++  G++   +    +      SG   A S   A
Sbjct: 315  SDLAVSTVTSVVPSTSSALTGHSSSPIQ-AGLAVPVTRPPSVAPVTPTSG---AISDTEA 374

Query: 945  PTVAQSDVNLPRDAD-----GTVKALQTENKDKPKDANADGNVSDSSSDSEDVDSGPTNE 1004
             T+   +++  R AD      T +  + ENK+   +  A+ + +   ++ E+     T +
Sbjct: 375  TTIKGDNLS-SRGADDSNDGATAQNNEAENKEMSVNGKANLSPAGDKANVEEPMVYATKQ 434

Query: 1005 QLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKAIPSYSARRSLFEHYVKTRAEEERK 1064
            +    FK +L+   V     W++ L +IV D R+ A+ +   R+  F  Y+  R + E +
Sbjct: 435  EAKAAFKSLLESVNVHSDWTWEQTLKEIVHDKRYGALRTLGERKQAFNEYLGQRKKVEAE 494

Query: 1065 EKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTLKKKWGNDSRFEALDR-KDRENLLNERV 1124
            E+R  QK A E F ++L+   E++  +  +      + ND RF+A+DR +DRE+L +  +
Sbjct: 495  ERRRRQKKAREEFVKMLEEC-EELSSSLKWSKAMSLFENDQRFKAVDRPRDREDLFDNYI 554

Query: 1125 LCLKKAAVEKAQALWAASTTSFKSMLQEREDININSRWFRVKDSLREDPRYRSVKHEERE 1184
            + L++   EKA          ++  L+  + I   ++W +++D L +D R   ++  +R 
Sbjct: 555  VELERKEREKAAEEHRQYMADYRKFLETCDYIKAGTQWRKIQDRLEDDDRCSCLEKIDRL 614

Query: 1185 MLFNEYISELKAAVEEKQRESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKE 1244
            + F EYI +L              ++E+E+LK  E+E  +R ER+ ++  R  L    +E
Sbjct: 615  IGFEEYILDL--------------EKEEEELKRVEKEHVRRAERKNRDAFRTLL----EE 674

Query: 1245 AVASFQALLVESIKDPQASWTESKVKLEKDPQGRASNTDLDSSETEKLFREHVKMLQERC 1304
             VA+        I   +  W +  ++L+  PQ +A  ++   S  + LF +  + L E+ 
Sbjct: 675  HVAA-------GILTAKTYWLDYCIELKDLPQYQAVASNTSGSTPKDLFEDVTEEL-EKQ 734

Query: 1305 ANEFRNLLSEAFTAEVVAQVS----EDGKTVLNSWTMAKRI------LKPDPRYGKVPRK 1330
             +E ++ + +A  +  ++ VS    ED K+ ++     ++I      L  D   G+V  K
Sbjct: 735  YHEDKSYVKDAMKSRKISMVSSWLFEDFKSAISEDLSTQQISDINLKLIYDDLVGRVKEK 741

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LT254.1e-19851.38Pre-mRNA-processing protein 40C OS=Arabidopsis thaliana OX=3702 GN=PRP40C PE=1 S... [more]
Q8CGF73.4e-4326.79Transcription elongation regulator 1 OS=Mus musculus OX=10090 GN=Tcerg1 PE=1 SV=... [more]
O147768.4e-4227.28Transcription elongation regulator 1 OS=Homo sapiens OX=9606 GN=TCERG1 PE=1 SV=2[more]
B6EUA91.6e-2124.65Pre-mRNA-processing protein 40A OS=Arabidopsis thaliana OX=3702 GN=PRP40A PE=1 S... [more]
Q6NWY92.2e-1824.09Pre-mRNA-processing factor 40 homolog B OS=Homo sapiens OX=9606 GN=PRPF40B PE=1 ... [more]
Match NameE-valueIdentityDescription
A0A0A0K9780.0e+0098.58Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G447710 PE=4 SV=1[more]
A0A5A7V0S20.0e+0096.93Pre-mRNA-processing protein 40C OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_... [more]
A0A1S3CHX00.0e+0096.93pre-mRNA-processing protein 40C OS=Cucumis melo OX=3656 GN=LOC103500614 PE=4 SV=... [more]
A0A5D3BYD00.0e+0093.26Pre-mRNA-processing protein 40C OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_... [more]
A0A6J1GNF10.0e+0087.74pre-mRNA-processing protein 40C OS=Cucurbita moschata OX=3662 GN=LOC111456014 PE... [more]
Match NameE-valueIdentityDescription
XP_011659583.10.098.58pre-mRNA-processing protein 40C [Cucumis sativus] >KGN45409.1 hypothetical prote... [more]
XP_008462197.10.096.93PREDICTED: pre-mRNA-processing protein 40C [Cucumis melo] >KAA0059331.1 pre-mRNA... [more]
TYK03994.10.093.26pre-mRNA-processing protein 40C [Cucumis melo var. makuwa][more]
XP_038900162.10.091.37pre-mRNA-processing protein 40C [Benincasa hispida][more]
XP_023547625.10.087.85pre-mRNA-processing protein 40C [Cucurbita pepo subsp. pepo] >XP_023547626.1 pre... [more]
Match NameE-valueIdentityDescription
AT3G19840.12.9e-19951.38pre-mRNA-processing protein 40C [more]
AT3G20260.13.4e-9955.61Protein of unknown function (DUF1666) [more]
AT5G39785.11.1e-2529.02Protein of unknown function (DUF1666) [more]
AT5G39785.21.1e-2328.73Protein of unknown function (DUF1666) [more]
AT1G44910.11.2e-2224.65pre-mRNA-processing protein 40A [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (hystrix) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 398..418
NoneNo IPR availableCOILSCoilCoilcoord: 465..487
NoneNo IPR availableCOILSCoilCoilcoord: 1163..1218
NoneNo IPR availableGENE3D2.20.70.10coord: 797..856
e-value: 1.7E-12
score: 48.7
NoneNo IPR availableGENE3D2.20.70.10coord: 754..796
e-value: 1.5E-11
score: 45.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 941..956
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 540..563
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 911..928
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 843..892
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 57..87
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 540..567
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 466..507
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1236..1259
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 957..975
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 852..882
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1173..1199
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1237..1259
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1343..1376
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 466..489
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 911..975
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1343..1360
NoneNo IPR availablePANTHERPTHR15377:SF3TRANSCRIPTION ELONGATION REGULATOR HOMOLOGcoord: 532..1371
IPR001202WW domainSMARTSM00456ww_5coord: 755..787
e-value: 3.1E-6
score: 36.7
coord: 807..839
e-value: 0.0011
score: 28.2
IPR001202WW domainPFAMPF00397WWcoord: 758..785
e-value: 2.7E-7
score: 30.6
IPR001202WW domainPROSITEPS01159WW_DOMAIN_1coord: 812..837
IPR001202WW domainPROSITEPS01159WW_DOMAIN_1coord: 760..785
IPR001202WW domainPROSITEPS50020WW_DOMAIN_2coord: 812..839
score: 10.895901
IPR001202WW domainPROSITEPS50020WW_DOMAIN_2coord: 754..787
score: 9.773001
IPR001202WW domainCDDcd00201WWcoord: 810..839
e-value: 5.48257E-6
score: 42.1298
IPR001202WW domainCDDcd00201WWcoord: 758..785
e-value: 3.55716E-7
score: 45.5966
IPR002713FF domainSMARTSM00441FF_2coord: 1111..1165
e-value: 0.52
score: 19.4
coord: 976..1030
e-value: 6.9E-9
score: 45.5
coord: 1043..1098
e-value: 6.3
score: 11.0
coord: 1214..1271
e-value: 0.0057
score: 25.9
IPR002713FF domainPFAMPF01846FFcoord: 1046..1094
e-value: 1.3E-7
score: 31.6
coord: 1117..1162
e-value: 6.3E-10
score: 39.1
coord: 982..1027
e-value: 1.1E-11
score: 44.7
IPR002713FF domainPROSITEPS51676FFcoord: 1214..1271
score: 8.946642
IPR002713FF domainPROSITEPS51676FFcoord: 975..1030
score: 10.244658
IPR002713FF domainPROSITEPS51676FFcoord: 1108..1165
score: 8.613124
IPR002713FF domainPROSITEPS51676FFcoord: 1043..1098
score: 9.519031
IPR036517FF domain superfamilyGENE3D1.10.10.440FF domaincoord: 961..1032
e-value: 2.8E-18
score: 67.6
coord: 1107..1177
e-value: 9.7E-13
score: 49.9
IPR036517FF domain superfamilyGENE3D1.10.10.440FF domaincoord: 1033..1106
e-value: 9.0E-13
score: 50.2
IPR036517FF domain superfamilyGENE3D1.10.10.440FF domaincoord: 1272..1338
e-value: 2.8E-13
score: 51.7
coord: 1211..1271
e-value: 1.2E-18
score: 68.9
IPR036517FF domain superfamilySUPERFAMILY81698FF domaincoord: 1268..1344
IPR036517FF domain superfamilySUPERFAMILY81698FF domaincoord: 1207..1276
IPR036517FF domain superfamilySUPERFAMILY81698FF domaincoord: 976..1036
IPR036517FF domain superfamilySUPERFAMILY81698FF domaincoord: 1107..1169
IPR036517FF domain superfamilySUPERFAMILY81698FF domaincoord: 1035..1101
IPR012870Protein of unknown function DUF1666PFAMPF07891DUF1666coord: 244..466
e-value: 1.4E-70
score: 237.7
IPR045148Transcription elongation regulator 1-likePANTHERPTHR15377TRANSCRIPTION ELONGATION REGULATOR 1coord: 532..1371
IPR036020WW domain superfamilySUPERFAMILY51045WW domaincoord: 759..791
IPR036020WW domain superfamilySUPERFAMILY51045WW domaincoord: 811..843

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Chy1G021000.1Chy1G021000.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
molecular_function GO:0070063 RNA polymerase binding
molecular_function GO:0003712 transcription coregulator activity
molecular_function GO:0005515 protein binding