CaUC10G194330 (gene) Watermelon (USVL246-FR2) v1

Overview
NameCaUC10G194330
Typegene
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
Descriptionregulation of nuclear pre-mRNA domain-containing protein 1B-like
LocationCiama_Chr10: 29346097 .. 29366241 (-)
RNA-Seq ExpressionCaUC10G194330
SyntenyCaUC10G194330
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATCTGTTTATCTTTGATATCAAATTAGCTTTATCAAGATGGTGCATTTCCCACCGGAAAAAGGCCAAGCAGATTGTTGAAACATGGGATAAATTGTTCAACTCATCTCAGAAAGAGCAGCGTGTTTCATTTCTGTATTTGGCGAATGACATTTTGCAGAACAGCAGGCGCAAGGGGAGTGAATTTGTGAATGAATTCTGGAAAGTTCTTCCTGGTGCTCTCAAGTATGTCTATGATCATGGTGATGAAGGTGGAAAGAAAGCAGTTGCCAGACTTGTATGTCTTCTTTTTACTTGTGTAGAAAAAAATGATGGCAATAAATCTTCTTATGTTTTTATTTCTGTAAGCATGTCAGTTAACTCTGGGAACATGTTGCTCAAAAGTAAAACATTTTGCATTTCTGGGTATCTGACTTAAAAATTCTTCTCCTCCTTTACCTCTCCAAAAAAGAAAAAAAAAAGGTGAAGAAGAAAAACGAACAAGGGGGGATGGATTCCATTCATCCAGCTTGACAAGTTTGTATCATAAATATTTCTATTAAGGCTTCCCAAAGATTGGCATTGGTAATGGTAATAAAACTCTCATTTCTACTTAAGCAAGTCTTTTGATTATGGTGTTCCATGAGTACTGTGGGTGCTCAATTGACGATAATTCCTAAAGCTTGTATCTTAGTTAGGAATGATCTAGTAAATTCTAGATATGTTGGTTTAAAATTTCAGTTTGAAGCATAATATGTGGCCGCTTTTCCCTCCCTTACTGTTTGATTTGTTATACTAAAAGTGTTTCTATTAATGACAAATTATTATAAAAGAAGACAAAATATTAAACACTATGAAATGATCCTCGAAGTTGGCCACCAAAAGGAGTTGTCGGAGGTGGCCGTCGTCGAAATTGGGTGTCGTAGTTGGTCTTTGCTGGGGTTGGCTGCCAAAGTGATATCGAGTTAGCCGTCAGAGTTGGTCGCATAGAGGGTCGTCAGACGTGATCGCAAAGTTGGCCGCTGGAGGTAGTCGTTGTCGAGTTGGAGGTGGTTATCTCCGAAATTGGGCATCGGAGGTAGCCTTTGCCAGAGTTGTCCATTGCATTAGAGTCGTCAGAGTTGGTCGTAAGTGGGTGGAGTCAATGGAAGCATTGGAGGGAGGGAGAGATTATTTTAACCATTGAGTTGTAAATAGTTTTGTTGTAGGAAAGGGGGTTTAAAATACCCTTACTCTCTATTACTTTAACACGCCCCAAACGCTTGGGTGAGCTAGGCACTCAACCCACTCCAAGTTGGAGCCCCACATACCCTTTCAATGTTCTTCTATTTATTTCTCTATTTTAGACTTTGCTATTTAAATTTAAATGTTGTTTATCAAATTGGAATTTATGATACATGATGATTTTGATATGCGTTCTGTTTCACAAGTTAACAACCTTGTCCAAGACAACCCGAAAATAACAACACCCAAAACCTCTGACATACCAAGACGAACCACGCATTGCAGGACAATAATTATAGACTAGTGAAACACCAACTCATGGTCTTATATTCTTGTCAATGTAAGACCGAAGAAAAATTTGTGTTACTTCATGTCTTCCTTATACTGGTGGATGTTCTAAGTAATTGTTGATCATGTGACATTGGTTGATATTGGAAAATTTAGTTGGACTAGGATATCCTAATAAGCCTTACCCTTAGGGAAGATACTATACACTAGTTGGAAATTGAGTAGTGTTATCGTATAGTTGTCTTATTAGGTCTTTGGGAAGATTTTATATTTACAAGATTCTTAAGAGATCTTATTTTTAATTCTTAAGTTAATTTTAGGAGGTGATCAAGAAGTTTGTGGACCTTGGGGGATTTTTTTTGAAACCATTTGAATCGACATGTATTAGATAATTAAATTTACCTCTTGTCTTGGTCCAATTATTCTACAAAACCACATGGAGAATCGAGAAAGTTAATTAGCTCAAACGAAAAACACTCGAGCCGAAGAAGGAATTATATATTATTGAGGGGTTTTAAATTTCAATATTAACAAAAACACTCGAGCCAAAGAAGGAATTATATATTATTGAGGGTTTTTAAATTTCAATATCAACAAAAACACGCGAGCCGAAGAAGGAATTATACTGATCATTTTTGTGTTGCCGGTGGGTTCTTTTGCCGAAGAAGGAATTATATATTATTGAGGGTTTTTAAATTTCAATATTAACAAAAATTTCGACCTTGGGGGATTAAGCATTGGATAAGTAATTTAAGGGTGCAAAACAAGGTCTTGTAGGCTAAATGGTTGTTGTGTTTCCATCATGAATCTGACACCTTATGGCACAAAGTCAATGTAAGCAAGTATGGCCCTCACCAGTATAAGTGGACCTTGGGCAGGTTTGAAGGTTCTTAAAGAAACTCTTGGAAATATATTTCTATTGAGCTCTCTTACCTTTGTCACTTTGTTGATGAGAGGGATATCTACTTTTGAGAGGATAGACTCATTTGTTCTTTTCAAACTCCATCATTCAAAAGGTCACCCGAGTTTCCTTCATGTAAATTTTCCTTAATTAACTCTTATCAAATTCCCTAAGTTCTCCTCCCTCCTCCTTGATTGAAGCTTGTGGAAAAGATCTTTTTCTTATGGCCGTTGGTTAAAAAGAATGGTTTGAGTCTTCGTTGTCGTGTTTAGTTTCTAATTTTTTGGTTGCTAATGAAGAGTTTGCTGCTTACAAATCTGGCTTCGGCTTTCTTTTCTTGTTTTGGAATGATACTTGGTTGAGTCTTCCTCGTGTATTTGGGTTTGCTCATTTGACAAGTGTTTTGGTTGCATCTTTTTGGTTTGGAGATTCTATGCCTTGGGGTTTTAGAAGGCTTAAAGATGCATATTTGTGTATCTTCAGCAGCTACTTCATGTTTAATCAAGTTCAAAGAGGGGGGTTCTTCCTATTTTAAGTTGTTGGTAGCTGGAGTCGTCTGGGAAGTTGCCTTCTATCTAAGATATTTGTATTTTCCCTTTTCAAGTTATACTTTTGATTTGGGGGGTATTTTTCTCTTACGTGCAATAACTGGGATGAGCTGCTCACATTTTGTTGTATTCCATCTATTCTTGAATTTTCTTTTTCTATGTATTTTGAGCCTTAGTCTCTTTTCATTATATCAATGAAAAGTTTTGTTTCCTTTAAAAAAAAAAAAACTTTTGAGAGAACAGACTCATCTGTTCTTTGTTTTCCCATTTATATTAGTTATCCTCTACGAGGAACCACTCAGAGGCTATTATCCTTTCTCACTTTGGGATCTGTCTTTCGCCTTTGTTGGGGTTTTGTCATCTGTTATCCAATAGGGTAACAACAATTATTTTTGGCTCTTTTGTCTTCAATTAGAAAGTTTTGCTTTATTCGTGTGAGAAGACATGTTTGCCTTAAAGACTTTTCTTATAGTTCCTTCTTTCACCATATGTTATGTCCATACCCTTCTAGCGGTCATGTCCTTCCTTACTATAGAAAGTGAAAAATCTAACGAACGTTAATTTTTTGTGTGGCAAGTCATGCATGGAAGAGTAAAACCTTGGATTAGATTTTGGGGAAGATGTCCCTATGTAGCCTAAGAGGTTAAAGGAGATAATTTTCCAAAAATCAAATAAAAACTTTTTTATTTATCAGAAGTATTTAATACATGAGGATGCTTATAGTCTTGTTTCCTTCGTTCACTATAGCAATGAATAGTCTTGTTTCCGTTTAAAAAAAAAAAAACAGGAGAATGCTTATAGTGCTATTGTTCTTTCATGGTCAATCTTTGATTTTGTATTGGGTTGTGGATGCTTATAGTGCTATTGTTGTTTTTTTTCTTTTATTTTTGTTTCCTTGTAATTTTGAGCGTTAGTCTTTTTTCATCATATCAATGAAAAGTTTTGTTTCTTTTCAAAGAAAAAAACCCTACAACTATGGTATTTCTAATTCAAATCCCTCTTTGTTACCGTTTTTGTTGCTTACTATTTCCTACTCATCATCTTTTTTCCTCTTATTTCCTACCTAGATTAAACTGGTCTGCACCCTAAACACAAACTATTATATCCCGGACTAAAATATTATGCTTCTCAAACACAAAGTATTATAACCTACATACTATAATAACCACCTTCACGTCCCATCCCCTTAGGTCTTATTTTACTTGATCAAAGCCTATTTTATCCATTTGACTTCTTTTCTCCGTGAGCTTTATTTTTGTTTGTCCTTGTATTTTTTTCATTTTTTTCCAATGAAAATTTGATTTCATTAAAAAAAAAAAAACCACCTGCATTGAAAAATGTTTTCGGGAAGGATGCCGATGAGATTTTCAAAGAGGATGCTTGAATGGATGATGTTTGTGTTAGTAGTTTTACTTTAGACATTTGATCCAAGCTTGAGAGTTGATGAAGAGTCTTTTGGAGATGGTTTAAATCTTCTTATCCATAATGAGCAATCTGGTTTGGAAGATAGAGCTTGGGTTATCTAAGGGTCGTCCCCATTTAGCAGATTTCTTTGTTTTCCCTCTATTGCATGGGATTAAGATGTTTTTGCATTATTTTGAAGCATGCAGCATCACTCTCTAAGAAATAAAGCCTAAGAGTTTGTGTTAGTTATGGAGGAGCTTTCTTCATGGATTAAAATGTTGCTCTTCTTTTGCGTTGGTGGTGTTTTGAGTTAGGCCTTTTCATTTGCATTTTGTTTTAGGATGATTTTTGCGACTTTTCAGGCTGTTTTTAAAGGAATACATGGTTCATGCCTTAGGTTTGGGGAGCCTTTCTTTACCTGTTTGGTTGGTTGTATTTTTTTTTCTTTTTTAAATCTTCTATAGCCAATTGGGAGTTTGTTTCATTGAACTTTTGTGTTCTTTTCATCTTTAAATGAAAAGTAGTTTCTTGTTCCAGAAACACAATGTCTCGCTACAAATGTGTTGGTTGCATAGTCCACGTGCATTATTTCAATTTGCTTTTTTAAATACTGGTTGCATATAACTGTGACTTTGACGTACAAATTTCGATTTCTTTTTCATGAAAAAGCATGTTATATTTTGATAGACATAGAACTAATGAAAGTTAATACACTTTCTTTTTGGGTTACAGATCACTGTAGATTGACATTTCCTTCTAAATCATCCTCCTTTCGTGCATTTTTGTTTTTGTTTTTGTTTTGTCTTGTTTTGTTTTGCTTGAAATCCACAAGCCCTGTAATGTTGATTTAAATTGTAATAGTAAGATTGACCAGTTCCTCCCTTCCCCTCTTCTTATGCTTTTATCTCGAGGGTTATTTTTTAAGTTGGGATTTCATTTCTCACTCAATTTTTTCTCCCTTGTTTCCCTTCATGGGAGTTGGTTTGTTAAGGTCAACATTTGGGAAGAAAGGAAGGTTTTTGGTTCTCGGGGTCAGAGTCTTAAGGATGAAATGCTTGGTAAAATTCCACCTCCAACTCAATTACCCAGCAGCAATGGGAAGAGTTCAAATCCTATAAAGATAGTGAAGAGGGATGCCCACTCTGTGAGAATTGTAAGAGCTTAGCTGTGGTAGCAGCGAACTATTGCAATTTCCATTTTATTTCTTGTGTTATGACTCCTATATCTTTATTGCTTAACTCTAGAAACTGGCTGTTGGAGGTGTTCCAGAAAAGATTCTTACTGCATTTCAATCTGTACTTGATGAACATCTGAATGAAGATGCTGCTCTGAACAACTGTAGTTCTGCTACTCATCATCTGAATCAAATTGAGGAAGACTTAAATGCTTCCTTAGCTCAAGGTATAGCCTCATTTACTATAGTCATCTCGTGCAATTTAAGTTTAAGATGTTGTTGTGAAGTAAAGCTCTGCATTATAAGGATTGTCTTTTGTTAGAAGTGCTAATCATGTAATATAGACTTTTGATTGAAGTTTCAACATCCATGGAGAATCTAATCTATTAGCTAGATCATGTGGATTCTCATCGAGGATGTTATTATGACAGGGACCTTATATACTTCAGATAATCAAGTATTTATGTTTGTAAACGATGTGCTATTAAAGAATATTTATGATACATTGAATTTATTCTTTTTCATTAATCAATGAGAATTTTCCATTTTTTTCTTCTTAAAAAAAAGATCTGCTTATTAAAGTGACTTGAAAGTTTCTCCTAGTTCATTGGCTATGAGAAAATTTTGGTGATGGGCCTTTCTTTGTTATTTATTTATTTTTTATTCTTTTCCTTGTACTGCAATCGACAGTGTTGGACTTTGCTGCGTAGAATATCGGGTCAAATGGAATTGAGAATTTGTACCAATACATAGATTTGTGTACTCAATATCAGTGTTTTAAGATGCATCCTGGGCACTAGGTAACTTGAGGGTGCCCAACTTTTGTAATGCCTAGGTAGGTAAACTCTCTAAGGTATTCTTATAAGTTTATATTTTAAACAAAGCTGTTCCTATAACCGTTTCCTTCTTTTTCTTTTTTTTTTTTTTTTGAAAAGGAAACGGTCTCTTTCATTGATGAAATGAAATGAGACAAGGCTTATTTTACAATTGTTCCATTGACCATACAAAGATGATATTAAAGCTTATCACAATAGGAACGGAAAAAAAAAGGATCAGAGGGAGCACCCGGACATCTCAACTAGGTTGACACCCCCTTAGCATCCACATCACTTCCTTAACATTATAGGACTAAAACCGACCACCGAAAGTAAGAAGTACTAAAAGTACAAATTCAGCTTACAATAAGAAACAATTAAAGGGGAAATTCTAGTAGTGCGGGTTAATGAAAGCTTTCCAGTTGAGATGGAGATCTGACAGTGAGTAGTCTTGAAAAGCCTTTGATAAGGAACACCAAGATGATGCATTTAGTCTAGCCAACTCAAAACGATCGATCCAAGGGGAGGCTTTGTCATGGAAAACCCTCTGATTTCTTTCAAACCAAATTTCTGAGAGAATGGCTTTGACTGCATTGCACCGTAGAAGTTTTGACCCCTTTTTCAAAGCTGGACCAGCCAATAATTGCTGCACATTGTTGCTGAAGTGACCATCGAAAACCCAACAAATTTGGAAGATATTAAATAGCCGAAACCAACAATTTACTGTGTAACTACAGCAGAAAAAGAAATGCTGCAGGTCTTCTTGATTAGTTAAACATAAGACGCACATGTGGGGTGATAAAGAATGGGAGGGGAGCCTTTTTTGAAGAATGGCAGCACAATTTAATGATCCAAAAATCATTATCCACACTGTGATATTGATTCTTCGGGGGCTTTTGGTGTGCCAAAGGGAATTTTCCAGCTCTTTCTCGATTGGGGAAGCCTTTGAAAGATGCTTAGTTAGAGATTTAACTGAAAAAATCCCATTAGGTTCTAATGACCAAGATTTTCTATCCGGCCTTTCTACCACTGATTTATTAACTATTGCTGCTTAGAGTTGCTGGAAATCAGAAATTTCTTCATCTTTTAACAACCTTCTGAAAGAGATATCCCATAAGGAGGTGATGGGATCCCAAAAATCTGACACTGAACCATTAGGTTTGGCTGCTATCTTGAAAAGATTTGGGAATTTAGAGCTTAGGGGTAATTTATCAATCCATGGGTCCTGCCAAAATGATACTCTGCTTCCATTTCCAACTTTAAATTCAGCCAAAACCTCCACTTTTAGCCACTGTTTAGAGATGCTTGTCCAAGGACTTCTCAAGCTTCCATTTCCTTTGCCATTTGTATGCCAATCGAAGCAGCCTTTTCCATGAATACTCTTGATGACTAAACACCAAAGAGCTTCTGATTCTCTTGTGAAACGCCATCCCCATTTTGCAAGCAATGCACAACATTTTTCAATCTCACGTTTCCAAGACCGAGACCCCCATCTTTCAAGTCTTTTGTAACAAGCTCCCATTTGGCCAAGTGGTTCAACTTACCTCCTTTGTGACCCTCCCAAAAGAAATTTCTCATTGATCTTTCTATCGAAGAGGCTATCTTTTTCGGCATTAAAAATAATGACATGTAATATGTAGGGAGATTGGAAAGCACTGAATTACATAAAGTCAGTCTACCTCCTTTTGGACAGATTGTATCTCTTCCACTTATCCAATTTTCCTTGGATTTTATCAAGGATAGGTTGCCAAAATGATAATTTTTTTGGATATCCCCCTAGTGGAAGACCGAGGTACATGATTGGTAAAAATTCGACCTTGCAATTCAGCCTGGCAGCAGCCATAATCACTTCATTTTCCTCAATATTGACTCCACACAAAGCAGATTTTTCCCAATTAACCTTCTGACCGAAACACCATTCAAAGAGCTCTATAATTTTGTGCAAGTTTTCTAACATTTCTTCATCATATTTGCAAAATATTAGGGTGTCTATAACCGTTTCCTTCTTCAAAAAAAAAAAAAGCTATTCCTATACTAAGTTTTTATATATTTTTTGAAATGGAAACAAGACTTTTCATTGAATTAATAAAATGAGTCTAATGCTCAAAGTACAATGAAACAGACAAAGAAAAGAAAAAATACAACTTAAAAAATGGAGCAACAAGAACAAAATGAAAGATAAGAGACCAATCCTATATTTTAAAAAACCTTTATTATATAATTTTTTATTTTTTTAAAAAAATAAAATTAGTTTCCCGAAAGACTTCATATTTTAAAAATATCTCTAAAAACTCTTATTTCTTTAAGTTTTCATTATCTCATCATGTATAGCTATATTTTCTACATACAAGTCTTTTAATTATATTTCTGTTTTCCTCTTGAAGAAAATCAAATCACTTTTTTTTTTATGCTTAGTATTTAAAAGACAACAGATTTCAAAGGCCCTTCCCAATTCTCTCAAGTTTCTCAAGATCGTAGTAAAAGTCAATAAAGGGATCTGTCTTGTTAGGTCTATGAAAAGTTTATGTTCATAGTTGTTTTTTGTATGTGTGTATTTGTTGTTCTGTACTTTTCTTTTATGTTCATAGTTGTTCTTTATATGGGTGTCTTTTATTTATGTGCAACTCCTTATTCCTTGAAGCCATAAGATCCACTTTTGAGCTAACATCAATAGACGAAGGTAGATTAGACAAGTCAATAGTTAGACTAGATGAGGTAGCTTAGTATAAACAATTTCAAATGGTGACTTCCATGTTTGGCCACCTTTGTTTGTCAGATTCAATCTTGAGGTGACTGTTTTAGTTCATAAAAGGACTTTTTGAGTTTACATACATTTCCTCTAGTATATCTATCTCGAGGTGATGTCTAAGTATTTCTTTTGAGCATATCTACTTGCTGTTTTCGTGTACTTTGAGCATTAATCTCTTTAAATTTTTTCAATGAAAAGTTTCATTTCCATTTTTAAAAAAAAGCATAAAATAAATCAAGTTGAAATAATGGCCAATCTAGATGTGTAGAAAAAGAGAAGAGTACTCCAATTATATTTAGCTTGGCAACTGGGGCAAAGGTTTTGTGATAGTCAATTCCACAAGCTTGGCCTCTTGGGTCAATCCTTTGGTCACAAGACGAGCTTTGAATCCCTCTATATTGCCATCTGGTTCGTGTTTGTTTGAAATCCGTTTGCTTCCAATTGTGATTAGAATCAGTGAAGAAGTGGCAAACGTATTATCATGGAAATCGATGGATGCAGGAAGAGAAAATATTAGAATTGCTTGGCCAAGAAGAATCCAGAGCAGAATCAGTCGATACCCCTTCATTAGAGATGGATATGAGAAGCTCTCTAAAGGCTGATTTGATGAGCATTTATAGAGCAGAAGAAAGGAATCTTATACAAAAAAGTAAACTCAACTGGCTCAAATTAGGGGATGAAAATTCGAGATTTTTTCACCGATTTCTGGTTGCAAAAAAGAGAAGAAACCTGATTTCTGGATTACTAAATAACCAAGGCACTCCAACAAATTCCTAACGAGAGATCGAAGCCTTAATCATTGACTATTACAAAGCCTTATATTCAAAGGTGCCTAGTGCAGGTTCCTTTCCAAGCAATTTGGAATGGCAAGTGGTCTCAGATGCGTAAAATAGTCTGCTGGTTTCAGGCTTCACAGATGGGGAAATCAGAAAAGCTCTAAAAGGGTTGGGGAAGAACAAAGCACCGGGGCCAGACGGTTTTACAGCGGAATTTCTTGTAAAATTTTGGGAGAAGCTCAAGGTTCATTTCATTAGCCTCTTCAATGAATTCTTTACAAATGGTAGGCTGAATTCTTGTGTTAGGAGAACATTATATGTTTGATTAAAAAGAAAGAAGATGCAATTATGACTAAAGACTTCCACCCAATAAGTCTCACAACATTAGTGTACAAAGCAATAGCCAAAGTATTTGCCAAAAGATTGAAGAGAGTCATGCCAAGTATTATTGCTCCAACCCAAAGTGCTTTCATTGAGGGTCGCCAAATTTTAGATCCAATTCTCATAGCAAACGAGATAGTGGAAGAATATAGATGCAAAAAGAAGAAAGGATAGCTTCTCAAACTTGATTTGGAAAAGGCCTTTGATAGAGTTGATTGGACCTTCTTGGAGAAGGTTCTAAAAGAAAAGAAATTCGATCCTAGATTGATATCGTGGATCTTGGGTTGCATTAAAAACCCTAAGTACTCAATTTTCATTAATGGGCGGCCAAGAGGGAGGATTATGGCAACTAGAGGAATTCGGCAGGGTGACACTCTTTCTCCATTCATTTTTCTATTAATAAGTGAAGTATCGAGCAGCCTCATATCTAAACTACACAAGAAAAAGAAGTTTGAAGGGTTTATTGTTGGAAGAGATAGAGTTCGCATTCTATTGCTCCAATTCGCGGATGTCACACTAATTTTTTGCAAATATGATGTGGAGATGCTAGAGAACTTGCGCAAAACCATAGAGTTTTTTGAATGGTGCTCCGTTAAATCAGCTTTGTGTGAAATCAATATAGAAGTCAATGAAGTGTGCTCGGTTGCTGCCATGCTGAATTGCAAGGTCGAAAAATTGCCTACAATGTACCTCGGTCTTCCATTAGGCGGGTACCCTAAAAAGGAATCATTCTGGTAGCCTACCCTTGACATAATTCAAGGAAAACTGAATAAGTGGAAGAGATTTAACCTGTCTAGAGGTGGTAGATTGACATTATGCAAATCAGTACTCTCTAATCTTCCCACATATTATATGTCAATATTTTTTATGCTGGAAAAGATAGTCTCTTCACTAGAAAGATCAATGAGAAACTTCTTTTGGGAAGGTCATAAAGGAGGTAAATTAAATCACTTGGTCAAATGGGAGTTAGCTACAAAAGACCAAAAAGATGGTGGCCTCGCCTTGGGAAGTTTGAGAATAAGAAATGTAGCCTTGCTTTCAAAATGGGGGTGGCGGTTCACAAGAGAATCAGAAGCTCTTTGGTGTATGGTCATGAAGAGTATTCATGGTAGTGGTCGCTTCAATTGGCATACAAACGGAAAAGGGAACGGTAGTTTGAGAAGTCCCTGGACAAGTATCTCAAAACAATGGTTAAAAGTGGAAGTTTTGGCTGAGTTTAAAGTTGGAAATGAGTGTAGAATATGTTTTGGCTAGACCCATGGATTGATAAATTGTCCCGAAGCTCAAGGTTCCCAAAGCTCTTCAAGTTAGCAGCCAATTCTAATGGTACAGTGGCAGATTTTTGGGATCCAATTACATCCTCATGGGATATTTCATTCAGAAGATTGCTAAAAGACGAAGAAATCTTAGATTTTCAGCACCTCACAGCAGTAATTGCAAACAAATCAGTGTTAGAGGGGTTGGATAGAAGGTTTTGGTCTTTGGAACCAAATGGAATTTTTTCAGTTAAATCCCTAGCCAAGCATCTATCATTGGCCTCCCCAATCGATCATGAGTTGGAAAAAAGCCTTTGGCATACCAAAAGCCCCCGAAGAGTTAATATCATGGTATGGTTAATGATCTTTAGATCATTAAATTGTGCTGTAGTTCTACAGAAAAAGCTCCCCTCACATTCCTTATCACCCCACATTTGCCCTTTATGCTTAGCTAACCAAGAGGAGTTGTAGCACCTTTTCTTTGGTTGTAACTACACGGTAAAATGTTGGCATCGACTATTTAGTATCTTCCATGTTTTCTGGGTCTTCGACAGAAATTTCAGCAGCGATGTGCAGCAGTTAGTGGCTGGTCCAGCTTTGAAAAAGGGGCCTCAACTACTGTGGAGCAATGCGGTTAAAGCCATTCTATCTGAAATTTGGTTTGAAAGAAATCAGAGGGTATTCTATGATAAAGTCTCCCCATGGATTGATCGTTTTGAGTTTGCTAGATTAAATGCTTCCTCATGGTGCAATCTTTCAAGACTACTCAATCTCGGAATTCAACTTGAATTGGAAGGCATTTATTAATCCTCGTTATTAGAGTTTCTCCTTTCTATGGTTCTTATCGAGATCAAAGTTGTAATCTTTTGTACTCATTTTATCTTTCGATTGTTGATTTTTGTTTTTTTGTTTTTCGGAAGTGATGTGGGTGCTAAGGGGGTGTCAACCTAGTTGAGATACCTGGGTGCACCTTCTTATCCCAATTTTGTTCTCATGTGTTGTGATTATTTATCTTGCTCTGTATTGAAAATTGATCTGTTGTAACATGAGCTTTGTCTCATTTCATTTTATCAATGAAAGAGAATGTTTCTTTTTTTTAAAAAAAAAAAAAAAAAAAAATCCAATTGTGCACTTGTTGAGGGCAAATTTGTAAGTACCCATGTCCCATTTTTCTTCAGAACTTCTCTAAAGTAGCTGCTTTCCACTCAGGTTTTTGTAATGCCTTCTTGAAAGGGTTTGGAGTTTGTACTTGATCTAGCGATGTCACAAATGCTCTAAAATGTGGTGACAGATTAGTAAGGATGTTGAGTACATGAGCGAACACCTTTTCTTACAGTAATAGGTTTGCCTTTATCATCTTTTGACGTCGCGTATACTGTTTTAGAGACATTGAAATTTTGATTTAGATTTTGGCTTTGCTGAACTTGTGGAGATTGTATTTCCTTCCACTCAAGTTGTCTTGCTCTTTGCAAGTAAGCTTGCAGTTCAGGCTTAAATTACATATGGGACTTTCCTCAGAAGAAGGTATAGGAGAATTTTTGAAGGATGAGATTAAAGGAAAGTTAGGAATACAGTCCCAATTTTGTGGTTCAAGATCATAATGTTCCCCCTGAATTGCAAAATCGGAAAAATATGACTGTTATTCAAAGAATGTGACATCCATAGTGTGAAAGAACTGGTGACATTACCATCAGTAGAACATTGACATTCATGTGTTTCAAATAAATCCTATTGTTACGAAAAGCTATAACGTGGTGTAGTAAGTTGTACCAAATACACCTTGTTCGAGATCTTGGAGAGTACTCTCGATCTTGAAAAGTTCAACAGTGCTTTCCTTATTTGAGAAAGTATCTTGAGCAGCATCCCAAATTTCTTTGGCAGTTGAATATAGGTAAAAATTCTCCCCGATCTCAATAGACATGGACTTGATCAACCAACTCACAACTTGATTGTTCCCTATCCCTCCATTTGGGAAATTTTGGATCGTTGAGTTCAGGTGATGTTGCTTTACATGTGATGTAGTCTTCTTGTCTACGACTATATGAACATCATCACAAATTAGGACCACTGAAGATAGTTGGGCCTTGAGTTGGTGACATTTTATTTAAGTATTGTTGCTCTCGGTGAAGGGCAAACTGGTAACTTGAGATTCTAATCTTGACATTGGTACAAGAGAGGACAAAAAGCTTTATTAGTTTCTAAACAATGGTGGTAGGGGTGTTTGGGGATGATGGCGACGGTGGTGGCTGTCAACGTCGAAGGAAACCGAAAAAAAAAAAAAAAGGGGAAAATTGCGTGGAGGTTGCAACAATGAAGACTTCGTGCAGACTTGTAGAGACGGCGGCGTCAATGGTGACTCCGAGCGAGTAGAGATGGGATGCAAAGCTACCGTCATGGGGAAGGGCTCGAGATTTGGAGGTCTGCCATTTGCGATTTTTGAATGTCAATGGCTGTGATGCTTGAGACAGTTGCCTATCGAACTGCTAAGATTTTCAATGATGCTTGACTTTTTCTGAGGAAGAAGGCAGTCTGGCAACATTGGTTAGGGTTCTGGCCACGAGATTTGGGTTTCTTGGAAAATAAATCAGATCTTTGATACGATGAAGCCAGGTATAGGCAAAGATTTTTTTTTTTTCTTATTAGTCCTTGAGGGATGAGAAAAACTCGATATATACAAGAGGATTATCAAATCATATATGCTGGAATTAAGCTATCAATCTAAACCTATATAGAAAAGATAAATTACAATATACAAGCAAGTGGAGTTTAATCCATTAAACCCTATGTTCGAATTTCTTCTACCATCGCGGATTAAGCCTCACTTTTCCAGATGTCCATATGTATCCTCTGAAAAAAAGAATTTGTTTATGATTCAGACCAAATCGGAAGCTCGTCGGAAGTTGTCCTCTGTTTTCGATCCTGGGTTCTCCTCACGAGACAAGGAAACCCAAGATCAGATTAATGTGAATATGATTGAAGTTAAGGAGGATGAGCATTCAATGTCAAAAAGATCAGTTGGTCGTTCGAAGCCCGAGCCACCTAAGAAGGGGCATTTCAAATCCCCGCCAAACGAGTCAGTTTTTAAAATTCTGCCGCTCGACAAGCCGCCCAAAGAAGCCCAACGGTCACCTTCATTTACTTACTTTGTCGACTTTGATTCCAACAACAATGATAGTACACCCGCTCGTAGCCCAAAGGAAAAACACTCCAACTCTTCCATTTATACCTCTATCAGAATTCTGTTGTCTGTTGCAGTTAATTTTAATTGGCCGCTTGATCAGGTTGATGTTAAGAATGCCTTTCTCAATGGAGATCTTGAAGAAGAGGTATTTATGGACTTGTCACCCGACTTTGAAGCGGACCTTTGGGTCAACAAAGTATGCAAGTTAAAGAAATCATTATATGGCCTTAAACAGTCTTCTGAAGCTTGGTTTGAACGTTTTGGAAAGGCAGTCATGAGCTATGGATTCAACCAAAGTCAAGCCGATCACACTATGTTTTATAAGCATACTGGAAATGACAAGGTTGTTGTGTTGATAATATATGTTGATGATATTATACTTACAGGTAATGATGAGACAAGAATGACTATTGTGAAGAAGAAATTGGCAAATGATTTCCAAATCTAAGACCTGGGATCATTAAAATACTTCCTAGGTATGGAGTTTGCTAGGTCCAAAAGTGGCATTCTTGTACAGAGCAAATTGTAGATGTATTAACCAAAGGTCTTCCTAGGTGGCAATTCAACAAGTTGATTGACAAGCTGGCCATGAATGATATCTTCAAATCAGCTTAAGGGGGAGTGTTGATTATTTCCTTTTGTGTTGATATTTTCATTGTATTTATTGTGGAATATATACTGTATCTTTTTTCCGTATTGTATATTTATTACAAATAAAATCATTTTTTTATCAAAGATGAATAAGAGAGAAAAAATGCTTTTAGCAAAACCTAGCTACCAAAAAAGTATTTTCCCCTCTAAAAGATTGGATTAAGAAGAAGAAAAAAAAATCTCATTTCAGAACTGTCAAGTGGCAATTTATTCATCAACGGAAACATCTAGAAAACAAAAAAGTGAGTGTAAGTACTCCACAAGCATTTTTTTAATTAATGTTATTATTACAACAAGAATTTAAAATCAACTTCGTGTACAACAATTTTTTGTTTTTAAAGCTCAATTAAAGTTCTAGCTAAGGGTTTTGTAGATGAGTGGGCTAGCTCTGTTTCTTTTTTCATCGCGTTGACTTGTATCTCAGTAAGCAAGTTGCAAATGTTCTTATAAGTAGGACTAAACTCGAAGTTTGGGTGGTTGAATCAAGCATTTGAGCTAAGATAAGATTTGAGAGTAGTTAAGTATAAAGAGTTGCCTATCATGCTTTATTAGGAAATGTAATGTACTCTTTTTATGACAGGACTATAAAAATAGAAATTGATGGGCACTTCATGAAAGAAATGCTTTATAGTGGAGTCATTACCAGTTATTCCATCAAATAAACAGATGGCTGAATTTTCATGGAAGGATTGCAAAGATTCTACCTTAGTAAACCATCATGGGTTGGCCTAGTGGTAAAAAAGGAGACATAGTCTTAATAAATGACTTAAGAGATCAAGGGTTCAATCCATGGTGGCCACCTACCTAGGAATTAATTTCCTATGAGTTTCCTTGACACCCAAATGTTGTAGGGTCAGGCGGGTCGTCCCGTGAGATTAGTCGAGGTGCGCGTAAGCTGGTCCGGACACTCACGGATATCAAAAAAAAAAAAAAAGATTCTACCTTAGTAGCCTTAGTGAGCAAGTTAAGAATGAATGATAGAAATCTTCAATCCAACTTGAGTGTTCCACATTGGCCAAGTCAATGGGACATCTTCCCATATTTTTATTGCACCTAAACTCTAATTTTAGTCCATACATATCTTATAAAACGCCTTTTCTCTGGATAGATTGTAATTCAGTGGTCATTCTATTCTCTGGCTTAATTCAGTGGTCATTCTATTCTCTGGCTTATTTTTTTCCTGGTATATGGAATTCTCCCTGCTATATTTTTATGTTTGTGTTGTGAGACCTTATTCCTTGAAACGTACTATAAGAGCAGTTCTCTTTGTACCTTCCTTCCAAACTAGGAACTCAACCAAGGTCTGCTTTGCTGGACGACCTTCAAGAGCAGGAAAATGTAATTCAAGAATGTATTAGGCAACTCGAAAGTGTTGAGGCAACTAGGGCTTCTTTGGTCTCCTTGCTTGTAGAAGCGCTTCAGGACCAAGTAACTTTCCTTCAGCCTTTTTTATTTCCTCCAATAATTTAACCCGATTGTTAATCTCCTCTATATTGTTGAGATACTTTCATTATAAAGTTGATTTCTCTTTGGAAAATTACAGGAATCAAAGCTGGAACTAGTTCGCAATCAGTTGCAGGTAAAATCATCCACTTTGTTTTGTTTTGTTTTGTTATGTTTTGGAAGTAGCAAGGTCTGGGATCACCTGTTGGCTTGAAGTAGTTTTGAAGGTGGATCGATATGTCCTTTATACCTTGAATCATTGAAATTTGTTATAAGATATTTGGATATGTCAAAACATTGAACTTTGGTACACACCTAGAAACACTTAAATCAGAGTAAGAGATTTATATAATTCTTTCATTGTGGACACATGGTGGAGTGGACACAGACACCCTTGAGCAAGTGAACACAATAATACTGTAACCAAAAACGTAAAGGAGAAAAATCAGACTGAATGAACTTGTTTGAAGTGCATAATTATGTTGGCACATTTGATTTTGGCGTTCTTCATAAGTTTGATGTAAGATATTTGATTATTAAGAAAACTGACAAACAATAGTGAAAACTTAAACTGTTAATCTTTTGAATTCGCAAGACAGGATGGCTAGTTACCAATCTTGGGATGAAAGCCCTCGTAGTTTTTGGCCTTAAATGTTCTTTGTATTATCATTTAGATATTAGTGGTTGACCAACATTGTTAATAAGGGGAAAGACATTGGTGAAAATCATTCATATTTTTCACTAATGTAAATTCATGTATGGAACTTTATGCTTGATTGGCTGTATGCTTAATTTCAATGCTATAGTCTGCATCTGGCTGAAGAAAGAAACGAAGCTATTGCTTGCCTCACGTCCACTAGCATCACAACTTATCTACTCGTTCATGTGGTATGAATGAGTTTCAATGTCAATATTACAGTTCAAACAAGTAGCAGATCAATGATTTGGGAAATCTACTTTTAGTTTCTTTTCTCACAGCGATCATGGAAACATTTTATGTCAATTTGCTCAGTCATTGCTAACCTATTTTAGGCTGTAGTAATTAAATTGTACTGTAATTTTCTATTGTCTGAACCTACATCAAGAATTTAGTGTGGGTCATGCCTAATGCAGTTTTTAGACACTATTTATTAGGAAAATTTCTGAACTGTGCATTTTCTAATGAGAGAGGGAGGGAGATAGAGCACTAGGTTGTGTACGCTATTTGCTCATTTCCATGCAGCATATCAACTAGAGGGAGAGAGCACTTGGTTTTTGAGTATGTATGAACAAAACCCTCTCGGGCGTGTGTGCCTAAGCTGAAGACACAGGTCTAATGTCTCACCTTGAAAAGGTGAGCTCAAAAGATAAGGTGCATCTCAGGTGCACGCCTTTTGCAAAGCCCCAAGGCTCAAAGTCCGAGGCCTTGAGGTTTTTTCATTCTTTTTAAAAAATAGTAATAATTAGGATTTTCCTTTTTTATTAATTAAAAAAATCGAATTTATCAAGCCTAAATGCAAAATTTCTTCTATTCATGGGGTATTTTCTTTTCATATTTTCACTTCAACTATGCTTTTCTTCTTTTCATATATAGTGCTAAAAAACAAGGCCGCACTTTTTGTTTTTGCCTTGTGCTTAAGCTCCAGGAGACTATTGCTCTTTTATTGCACCTTGGGCTTCAAAAACACTGTATATGAATCTATATCAGATACTCACTGTATCCATGTCTATCAATTTGTTGATTGTTCTCTGCCATTCACATGGATGGAGATCTCATATTAAATATATCTCATATACTGGGAACGTAAAACGATTAGTTATAGAGTTTGGTATTTTCTAATTTCAGTTAGCTTATGTAGATTGGAAATTATTGCTTATGAATCTGCAGATTGCTCGATCTCAAATTGAGCTAGCAAGCAACGTTCGGAAGAGATTTACTTCGACTCCGGTTCCTGGTTCTTCAGCTACTACCATGGACCTACTGACAGAGATGACTCATGTGACCGATACTAAGTTATCTTCGGTTCAACAAAACATAATCTCTTCTCAGTCCCCTCTTGTCCAGACTATGGTGTCTTTTCCTGGTCCTAAAACCAGTGAAGAGGAAAACAAGAGAGCTGCTGCTGCTGCCGTTGCTGCAAAGTTGGCTGCTTCTACATCGTCGGCACAGATGCTTACCTCTGTTCTTTCATCCCTTGTTGCAGAAGAAGCTGCTTCCATGAATGGTAGCCTTAAATCATCTGGGTTCTCTTCATTGTCCATATTTTCTCCTGATAAACGACAGAAACTTGAGAAGCCAATGCCTAATTCTGATGTCAGCAGTTCAGATGGCGTCGGTGCATCATTTGTTACACCTATGCAACAACAATTGACAAGTTTACCACTTGCACAGTCAGTAAATGGGCAACCTGTTTCCCAGGGGAACCCGAGTCAGGCTTCATTTGCTCCACCACCTCCGCCAGCACCACCATCACTATCCTCAACTCCACCAGTGAATCAATATGCGCAGCCTGGTGGATTAGTGGGGGTATTACCGTATAATTTTGGAGCATATTCTCTACCACCTCCACCTCCTTTGCCTCCACATATTGCGATGGGTTTGAGTAGGCCGACATCTCAGCCACCTCCGCAACAGCTGCAGCAGCCACAGCAGTCACAGCCAACTTCAGCAGGATTTTATCGGCCACCAGGTATAGGTTTTTACGGGCAAGGCCAGCAGTCGACGCCACCCCCCGTCCCTAGGCAGTAAGTTCCCAGGTAGAATTTACTTGGGCCAAATCTTCACTTAAAAGGGTTAGTAGCTCTAATGGGTTAAACAAGCCCTACGCAATTTCTACTTGCCAACTGAGAGATCATTTCAGAATTTGTATTATCATGTAAATTAGTAGAATTTTTTCCTTTTTTTATTCTGTAGAAGTTTTAGTATATCTTCCCCTGCCAACAATTTAAAAGAGCATGTTGCAGCTGAGGTTCGTCCATGCTGATGGCAAAATTAGGTGGCTACCTTTAGGCTTTTCCAATGTGATGCTATAAAAATTTGGAAGCTTGACGGATTCTCGGCTACCTGAAAATTTTACTGTGGCCTGGTAAGCTTTTCTCCTATACGTTTGGAAGCTGGTTAGAGTACTGCCTAGAAGATTTTTAGTTGACATGTTAGATTCAGCTTTGAAGACCCTGTTGTCTTTAAAATCTGCATGGATAGAGTGGCCAAATAGACTTATTTTGGATAGCAATAACAGGATGATGATACTGGTCTGTAATATAACTTAGTGAGGCCATGAAAAATTACTAGTTAAATACAGCAGTGCCACTGTTTTGTAATCTGTTGCCCTGTTTTCTTATTCAGTAGAGAACTTTTATTTACAATAAACTGTTCCCTTCACTCTTCTTGGCT

mRNA sequence

ATGAATCTGTTTATCTTTGATATCAAATTAGCTTTATCAAGATGGTGCATTTCCCACCGGAAAAAGGCCAAGCAGATTGTTGAAACATGGGATAAATTGTTCAACTCATCTCAGAAAGAGCAGCGTGTTTCATTTCTGTATTTGGCGAATGACATTTTGCAGAACAGCAGGCGCAAGGGGAGTGAATTTGTGAATGAATTCTGGAAAGTTCTTCCTGGTGCTCTCAAGTATGTCTATGATCATGGTGATGAAGGTGGAAAGAAAGCAGTTGCCAGACTTAGTTGGTCGCATAGAGGGTCGTCAGACGTGATCGCAAAGTTGGCCGCTGGAGGTAGTCGTTGTCGAGTTGGAGGTGGTTATCTCCGAAATTGGGCATCGGAGGTAGCCTTTGCCAGAGTTGTCCATTGCATTAGAGTCGTCAGAGTTGGTCAAAGGAAGGTTTTTGGTTCTCGGGGTCAGAGTCTTAAGGATGAAATGCTTGGTAAAATTCCACCTCCAACTCAATTACCCAGCAGCAATGGGAAGAGTTCAAATCCTATAAAGATAGTGAAGAGGGATGCCCACTCTGTGAGAATTAAACTGGCTGTTGGAGGTGTTCCAGAAAAGATTCTTACTGCATTTCAATCTGTACTTGATGAACATCTGAATGAAGATGCTGCTCTGAACAACTGTAGTTCTGCTACTCATCATCTGAATCAAATTGAGGAAGACTTAAATGCTTCCTTAGCTCAAGATTTTGGCTTTGCTGAACTTGTGGAGATTGTATTTCCTTCCACTCAAGTTGTCTTGCTCTTTGCAAACTTCGTGCAGACTTGTAGAGACGGCGGCGTCAATGGTGACTCCGAGCGAGTAGAGATGGGATGCAAAGCTACCGTCATGGGGAAGGGCTCGAGATTTGGAGGCAGTCTGGCAACATTGACCAAATCGGAAGCTCGTCGGAAGTTGTCCTCTGTTTTCGATCCTGGGTTCTCCTCACGAGACAAGGAAACCCAAGATCAGATTAATGTGAATATGATTGAAGTTAAGGAGGATGAGCATTCAATGTCAAAAAGATCAGTTGGTCGTTCGAAGCCCGAGCCACCTAAGAAGGGGCATTTCAAATCCCCGCCAAACGAGTCAGTTTTTAAAATTCTGCCGCTCGACAAGCCGCCCAAAGAAGCCCAACGCTCAATTAAAGTTCTAGCTAAGGGTTTTGTAGATGAGTGGGCTAGCTCTGTTTCTTTTTTCATCGCGTTGACTTGTATCTCATTCTCTTTGTACCTTCCTTCCAAACTAGGAACTCAACCAAGGTCTGCTTTGCTGGACGACCTTCAAGAGCAGGAAAATGTAATTCAAGAATGTATTAGGCAACTCGAAAGTGTTGAGGCAACTAGGGCTTCTTTGGTCTCCTTGCTTGTAGAAGCGCTTCAGGACCAAGAATCAAAGCTGGAACTAGTTCGCAATCAGTTGCAGATTGCTCGATCTCAAATTGAGCTAGCAAGCAACGTTCGGAAGAGATTTACTTCGACTCCGGTTCCTGGTTCTTCAGCTACTACCATGGACCTACTGACAGAGATGACTCATGTGACCGATACTAAGTTATCTTCGGTTCAACAAAACATAATCTCTTCTCAGTCCCCTCTTGTCCAGACTATGGTGTCTTTTCCTGGTCCTAAAACCAGTGAAGAGGAAAACAAGAGAGCTGCTGCTGCTGCCGTTGCTGCAAAGTTGGCTGCTTCTACATCGTCGGCACAGATGCTTACCTCTGTTCTTTCATCCCTTGTTGCAGAAGAAGCTGCTTCCATGAATGGTAGCCTTAAATCATCTGGGTTCTCTTCATTGTCCATATTTTCTCCTGATAAACGACAGAAACTTGAGAAGCCAATGCCTAATTCTGATGTCAGCAGTTCAGATGGCGTCGGTGCATCATTTGTTACACCTATGCAACAACAATTGACAAGTTTACCACTTGCACAGTCAGTAAATGGGCAACCTGTTTCCCAGGGGAACCCGAGTCAGGCTTCATTTGCTCCACCACCTCCGCCAGCACCACCATCACTATCCTCAACTCCACCAGTGAATCAATATGCGCAGCCTGGTGGATTAGTGGGGGTATTACCGTATAATTTTGGAGCATATTCTCTACCACCTCCACCTCCTTTGCCTCCACATATTGCGATGGGTTTGAGTAGGCCGACATCTCAGCCACCTCCGCAACAGCTGCAGCAGCCACAGCAGTCACAGCCAACTTCAGCAGGATTTTATCGGCCACCAGGTATAGGTTTTTACGGGCAAGGCCAGCAGTCGACGCCACCCCCCGTCCCTAGGCAGTAAGTTCCCAGGTAGAATTTACTTGGGCCAAATCTTCACTTAAAAGGAAGTTTTAGTATATCTTCCCCTGCCAACAATTTAAAAGAGCATGTTGCAGCTGAGGTTCGTCCATGCTGATGGCAAAATTAGGTGGCTACCTTTAGGCTTTTCCAATGTGATGCTATAAAAATTTGGAAGCTTGACGGATTCTCGGCTACCTGAAAATTTTACTGTGGCCTGGTAAGCTTTTCTCCTATACGTTTGGAAGCTGGTTAGAGTACTGCCTAGAAGATTTTTAGTTGACATGTTAGATTCAGCTTTGAAGACCCTGTTGTCTTTAAAATCTGCATGGATAGAGTGGCCAAATAGACTTATTTTGGATAGCAATAACAGGATGATGATACTGGTCTGTAATATAACTTAGTGAGGCCATGAAAAATTACTAGTTAAATACAGCAGTGCCACTGTTTTGTAATCTGTTGCCCTGTTTTCTTATTCAGTAGAGAACTTTTATTTACAATAAACTGTTCCCTTCACTCTTCTTGGCT

Coding sequence (CDS)

ATGAATCTGTTTATCTTTGATATCAAATTAGCTTTATCAAGATGGTGCATTTCCCACCGGAAAAAGGCCAAGCAGATTGTTGAAACATGGGATAAATTGTTCAACTCATCTCAGAAAGAGCAGCGTGTTTCATTTCTGTATTTGGCGAATGACATTTTGCAGAACAGCAGGCGCAAGGGGAGTGAATTTGTGAATGAATTCTGGAAAGTTCTTCCTGGTGCTCTCAAGTATGTCTATGATCATGGTGATGAAGGTGGAAAGAAAGCAGTTGCCAGACTTAGTTGGTCGCATAGAGGGTCGTCAGACGTGATCGCAAAGTTGGCCGCTGGAGGTAGTCGTTGTCGAGTTGGAGGTGGTTATCTCCGAAATTGGGCATCGGAGGTAGCCTTTGCCAGAGTTGTCCATTGCATTAGAGTCGTCAGAGTTGGTCAAAGGAAGGTTTTTGGTTCTCGGGGTCAGAGTCTTAAGGATGAAATGCTTGGTAAAATTCCACCTCCAACTCAATTACCCAGCAGCAATGGGAAGAGTTCAAATCCTATAAAGATAGTGAAGAGGGATGCCCACTCTGTGAGAATTAAACTGGCTGTTGGAGGTGTTCCAGAAAAGATTCTTACTGCATTTCAATCTGTACTTGATGAACATCTGAATGAAGATGCTGCTCTGAACAACTGTAGTTCTGCTACTCATCATCTGAATCAAATTGAGGAAGACTTAAATGCTTCCTTAGCTCAAGATTTTGGCTTTGCTGAACTTGTGGAGATTGTATTTCCTTCCACTCAAGTTGTCTTGCTCTTTGCAAACTTCGTGCAGACTTGTAGAGACGGCGGCGTCAATGGTGACTCCGAGCGAGTAGAGATGGGATGCAAAGCTACCGTCATGGGGAAGGGCTCGAGATTTGGAGGCAGTCTGGCAACATTGACCAAATCGGAAGCTCGTCGGAAGTTGTCCTCTGTTTTCGATCCTGGGTTCTCCTCACGAGACAAGGAAACCCAAGATCAGATTAATGTGAATATGATTGAAGTTAAGGAGGATGAGCATTCAATGTCAAAAAGATCAGTTGGTCGTTCGAAGCCCGAGCCACCTAAGAAGGGGCATTTCAAATCCCCGCCAAACGAGTCAGTTTTTAAAATTCTGCCGCTCGACAAGCCGCCCAAAGAAGCCCAACGCTCAATTAAAGTTCTAGCTAAGGGTTTTGTAGATGAGTGGGCTAGCTCTGTTTCTTTTTTCATCGCGTTGACTTGTATCTCATTCTCTTTGTACCTTCCTTCCAAACTAGGAACTCAACCAAGGTCTGCTTTGCTGGACGACCTTCAAGAGCAGGAAAATGTAATTCAAGAATGTATTAGGCAACTCGAAAGTGTTGAGGCAACTAGGGCTTCTTTGGTCTCCTTGCTTGTAGAAGCGCTTCAGGACCAAGAATCAAAGCTGGAACTAGTTCGCAATCAGTTGCAGATTGCTCGATCTCAAATTGAGCTAGCAAGCAACGTTCGGAAGAGATTTACTTCGACTCCGGTTCCTGGTTCTTCAGCTACTACCATGGACCTACTGACAGAGATGACTCATGTGACCGATACTAAGTTATCTTCGGTTCAACAAAACATAATCTCTTCTCAGTCCCCTCTTGTCCAGACTATGGTGTCTTTTCCTGGTCCTAAAACCAGTGAAGAGGAAAACAAGAGAGCTGCTGCTGCTGCCGTTGCTGCAAAGTTGGCTGCTTCTACATCGTCGGCACAGATGCTTACCTCTGTTCTTTCATCCCTTGTTGCAGAAGAAGCTGCTTCCATGAATGGTAGCCTTAAATCATCTGGGTTCTCTTCATTGTCCATATTTTCTCCTGATAAACGACAGAAACTTGAGAAGCCAATGCCTAATTCTGATGTCAGCAGTTCAGATGGCGTCGGTGCATCATTTGTTACACCTATGCAACAACAATTGACAAGTTTACCACTTGCACAGTCAGTAAATGGGCAACCTGTTTCCCAGGGGAACCCGAGTCAGGCTTCATTTGCTCCACCACCTCCGCCAGCACCACCATCACTATCCTCAACTCCACCAGTGAATCAATATGCGCAGCCTGGTGGATTAGTGGGGGTATTACCGTATAATTTTGGAGCATATTCTCTACCACCTCCACCTCCTTTGCCTCCACATATTGCGATGGGTTTGAGTAGGCCGACATCTCAGCCACCTCCGCAACAGCTGCAGCAGCCACAGCAGTCACAGCCAACTTCAGCAGGATTTTATCGGCCACCAGGTATAGGTTTTTACGGGCAAGGCCAGCAGTCGACGCCACCCCCCGTCCCTAGGCAGTAA

Protein sequence

MNLFIFDIKLALSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLANDILQNSRRKGSEFVNEFWKVLPGALKYVYDHGDEGGKKAVARLSWSHRGSSDVIAKLAAGGSRCRVGGGYLRNWASEVAFARVVHCIRVVRVGQRKVFGSRGQSLKDEMLGKIPPPTQLPSSNGKSSNPIKIVKRDAHSVRIKLAVGGVPEKILTAFQSVLDEHLNEDAALNNCSSATHHLNQIEEDLNASLAQDFGFAELVEIVFPSTQVVLLFANFVQTCRDGGVNGDSERVEMGCKATVMGKGSRFGGSLATLTKSEARRKLSSVFDPGFSSRDKETQDQINVNMIEVKEDEHSMSKRSVGRSKPEPPKKGHFKSPPNESVFKILPLDKPPKEAQRSIKVLAKGFVDEWASSVSFFIALTCISFSLYLPSKLGTQPRSALLDDLQEQENVIQECIRQLESVEATRASLVSLLVEALQDQESKLELVRNQLQIARSQIELASNVRKRFTSTPVPGSSATTMDLLTEMTHVTDTKLSSVQQNIISSQSPLVQTMVSFPGPKTSEEENKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNGSLKSSGFSSLSIFSPDKRQKLEKPMPNSDVSSSDGVGASFVTPMQQQLTSLPLAQSVNGQPVSQGNPSQASFAPPPPPAPPSLSSTPPVNQYAQPGGLVGVLPYNFGAYSLPPPPPLPPHIAMGLSRPTSQPPPQQLQQPQQSQPTSAGFYRPPGIGFYGQGQQSTPPPVPRQ
Homology
BLAST of CaUC10G194330 vs. NCBI nr
Match: XP_038904543.1 (regulation of nuclear pre-mRNA domain-containing protein 1B-like [Benincasa hispida])

HSP 1 Score: 860.1 bits (2221), Expect = 1.4e-245
Identity = 514/761 (67.54%), Postives = 521/761 (68.46%), Query Frame = 0

Query: 11  ALSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLANDILQNSRRKGSEFVNEFWKV 70
           +LSRWCISHRKKAKQIVETWDK FNSSQKEQRVSFLYLANDILQNSRRKGSEFVNEFWKV
Sbjct: 27  SLSRWCISHRKKAKQIVETWDKSFNSSQKEQRVSFLYLANDILQNSRRKGSEFVNEFWKV 86

Query: 71  LPGALKYVYDHGDEGGKKAVARLSWSHRGSSDVIAKLAAGGSRCRVGGGYLRNWASEVAF 130
           LPGALKYVYDHGDEGGKKAVARL                           +  W      
Sbjct: 87  LPGALKYVYDHGDEGGKKAVARL---------------------------VNIWE----- 146

Query: 131 ARVVHCIRVVRVGQRKVFGSRGQSLKDEMLGKIPPPTQLPSSNGKSSNPIKIVKRDAHSV 190
                        +RKVFGSRGQSLKDEMLGK PPPTQLPSSNGKSSNPIKIVKRDAHSV
Sbjct: 147 -------------ERKVFGSRGQSLKDEMLGKTPPPTQLPSSNGKSSNPIKIVKRDAHSV 206

Query: 191 RIKLAVGGVPEKILTAFQSVLDEHLNEDAALNNCSSATHHLNQIEEDLNASLAQDFGFAE 250
           RIKLAVGGVPEKILTAFQSVLDEHLNEDAALNNCSSATH+LNQIEEDLNASLAQ      
Sbjct: 207 RIKLAVGGVPEKILTAFQSVLDEHLNEDAALNNCSSATHNLNQIEEDLNASLAQ------ 266

Query: 251 LVEIVFPSTQVVLLFANFVQTCRDGGVNGDSERVEMGCKATVMGKGSRFGGSLATLTKSE 310
                                                                       
Sbjct: 267 ------------------------------------------------------------ 326

Query: 311 ARRKLSSVFDPGFSSRDKETQDQINVNMIEVKEDEHSMSKRSVGRSKPEPPKKGHFKSPP 370
                                                                       
Sbjct: 327 ------------------------------------------------------------ 386

Query: 371 NESVFKILPLDKPPKEAQRSIKVLAKGFVDEWASSVSFFIALTCISFSLYLPSKLGTQPR 430
                                                                  GTQPR
Sbjct: 387 -------------------------------------------------------GTQPR 446

Query: 431 SALLDDLQEQENVIQECIRQLESVEATRASLVSLLVEALQDQESKLELVRNQLQIARSQI 490
           S LLDDLQEQENVIQECIRQLESVEATRASLVSLLVEALQDQESKLELVR+QLQIARSQI
Sbjct: 447 SGLLDDLQEQENVIQECIRQLESVEATRASLVSLLVEALQDQESKLELVRSQLQIARSQI 506

Query: 491 ELASNVRKRFTSTPVPGSSATTMDLLTEMTHVTDTKLSSVQQNIISSQSPLVQTMVSFPG 550
           ELASNVRKRFTSTPVPGSSATTMDLLTEMTHVTDTKLSSVQQNIISSQSPLVQ M SFPG
Sbjct: 507 ELASNVRKRFTSTPVPGSSATTMDLLTEMTHVTDTKLSSVQQNIISSQSPLVQAMASFPG 561

Query: 551 PKTSEEENKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNGSLKSSGFSSLSIF 610
           PKTSEEENKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNGSLKSSGFSSLSIF
Sbjct: 567 PKTSEEENKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNGSLKSSGFSSLSIF 561

Query: 611 SPDKRQKLEKPMPNSDVSSSDGVGASFVTPMQQ-QLTSLPLAQSVNGQPVSQGNPSQASF 670
           SP+KRQKLEKPMP SDVSSSDGV ASFVTPMQQ QLTSLPLAQSVNGQPVSQ NPSQASF
Sbjct: 627 SPEKRQKLEKPMPISDVSSSDGVSASFVTPMQQPQLTSLPLAQSVNGQPVSQANPSQASF 561

Query: 671 APPPPPAPPSLSSTPPVNQYAQPGGLVGVLPYNFGAYSLPPPPPLPPHIAMGLSRPTSQP 730
           APPPPPAPPSLSSTPPVNQY QPGGL+GVLPYNFGAYSLPPPPPLPPHIAMGLSRPTSQP
Sbjct: 687 APPPPPAPPSLSSTPPVNQYVQPGGLMGVLPYNFGAYSLPPPPPLPPHIAMGLSRPTSQP 561

Query: 731 PPQQLQQPQQSQPTSAGFYRPPGIGFYGQGQQSTPPPVPRQ 771
           P QQLQQPQQSQPTSAGFYRPPGIGFYGQGQQSTPPPVPRQ
Sbjct: 747 PQQQLQQPQQSQPTSAGFYRPPGIGFYGQGQQSTPPPVPRQ 561

BLAST of CaUC10G194330 vs. NCBI nr
Match: XP_004136608.1 (regulation of nuclear pre-mRNA domain-containing protein 1B [Cucumis sativus] >XP_031739391.1 regulation of nuclear pre-mRNA domain-containing protein 1B [Cucumis sativus] >KAE8651160.1 hypothetical protein Csa_001803 [Cucumis sativus])

HSP 1 Score: 835.5 bits (2157), Expect = 3.7e-238
Identity = 495/760 (65.13%), Postives = 510/760 (67.11%), Query Frame = 0

Query: 11  ALSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLANDILQNSRRKGSEFVNEFWKV 70
           +LS+WCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLANDILQNSRRKGSEFVNEFWKV
Sbjct: 27  SLSKWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLANDILQNSRRKGSEFVNEFWKV 86

Query: 71  LPGALKYVYDHGDEGGKKAVARLSWSHRGSSDVIAKLAAGGSRCRVGGGYLRNWASEVAF 130
           LPGALKYVYDHGDE GKKAVARL                           +  W      
Sbjct: 87  LPGALKYVYDHGDESGKKAVARL---------------------------VNIWE----- 146

Query: 131 ARVVHCIRVVRVGQRKVFGSRGQSLKDEMLGKIPPPTQLPSSNGKSSNPIKIVKRDAHSV 190
                        +RKVFGSRGQSLKDEMLGK PPPT LPSSNGKSSNPIKIVKRDAHSV
Sbjct: 147 -------------ERKVFGSRGQSLKDEMLGKNPPPTPLPSSNGKSSNPIKIVKRDAHSV 206

Query: 191 RIKLAVGGVPEKILTAFQSVLDEHLNEDAALNNCSSATHHLNQIEEDLNASLAQDFGFAE 250
           RIKLAVGGVPEKILTAFQSVLDEHLNEDAAL NCSSATHHLNQIEEDLN SLAQ      
Sbjct: 207 RIKLAVGGVPEKILTAFQSVLDEHLNEDAALINCSSATHHLNQIEEDLNVSLAQ------ 266

Query: 251 LVEIVFPSTQVVLLFANFVQTCRDGGVNGDSERVEMGCKATVMGKGSRFGGSLATLTKSE 310
                                                                       
Sbjct: 267 ------------------------------------------------------------ 326

Query: 311 ARRKLSSVFDPGFSSRDKETQDQINVNMIEVKEDEHSMSKRSVGRSKPEPPKKGHFKSPP 370
                                                                       
Sbjct: 327 ------------------------------------------------------------ 386

Query: 371 NESVFKILPLDKPPKEAQRSIKVLAKGFVDEWASSVSFFIALTCISFSLYLPSKLGTQPR 430
                                                                  GTQPR
Sbjct: 387 -------------------------------------------------------GTQPR 446

Query: 431 SALLDDLQEQENVIQECIRQLESVEATRASLVSLLVEALQDQESKLELVRNQLQIARSQI 490
           SALLDDLQ+QE VIQECIRQLE VEATRASLVSLLVEALQDQESKLELVRNQLQ+ARSQI
Sbjct: 447 SALLDDLQDQETVIQECIRQLEGVEATRASLVSLLVEALQDQESKLELVRNQLQVARSQI 506

Query: 491 ELASNVRKRFTSTPVPGSSATTMDLLTEMTHVTDTKLSSVQQNIISSQSPLVQTMVSFPG 550
           ELASNVRKRFTST VPG SATT+DLLTEMTH TD+KLSSVQQNIISSQSPL+Q M SFPG
Sbjct: 507 ELASNVRKRFTSTTVPGPSATTVDLLTEMTHATDSKLSSVQQNIISSQSPLIQAMGSFPG 560

Query: 551 PKTSEEENKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNGSLKSSGFSSLSIF 610
           PKTSEEENKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNG LKSSGFSSLS+F
Sbjct: 567 PKTSEEENKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNGGLKSSGFSSLSLF 560

Query: 611 SPDKRQKLEKPMPNSDVSSSDGVGASFVTPMQQQLTSLPLAQSVNGQPVSQGNPSQASFA 670
           SP+KRQKLEKPMP SDVSSSDG GASFV PMQQQ+TS+PLAQS NGQPVSQ NPSQASFA
Sbjct: 627 SPEKRQKLEKPMPISDVSSSDGAGASFVAPMQQQMTSMPLAQSANGQPVSQANPSQASFA 560

Query: 671 PPPPPAPPSLSSTPPVNQYAQPGGLVGVLPYNFGAYSLPPPPPLPPHIAMGLSRPTSQPP 730
           PPPPP PPSLSSTPPVNQYAQ GGL+GVLPYNFGAYSLPPPPPLPPHIAMGLSRPTSQPP
Sbjct: 687 PPPPPVPPSLSSTPPVNQYAQSGGLMGVLPYNFGAYSLPPPPPLPPHIAMGLSRPTSQPP 560

Query: 731 PQQLQQPQQSQPTSAGFYRPPGIGFYGQGQQSTPPPVPRQ 771
           PQQLQQPQQSQP S+GFYRPPGIGFYGQGQQSTPPPVPRQ
Sbjct: 747 PQQLQQPQQSQPASSGFYRPPGIGFYGQGQQSTPPPVPRQ 560

BLAST of CaUC10G194330 vs. NCBI nr
Match: XP_008443180.1 (PREDICTED: regulation of nuclear pre-mRNA domain-containing protein 1B-like [Cucumis melo])

HSP 1 Score: 832.0 bits (2148), Expect = 4.1e-237
Identity = 496/760 (65.26%), Postives = 508/760 (66.84%), Query Frame = 0

Query: 11  ALSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLANDILQNSRRKGSEFVNEFWKV 70
           +LS+WCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLANDILQNSRRKGSEFVNEFWKV
Sbjct: 27  SLSKWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLANDILQNSRRKGSEFVNEFWKV 86

Query: 71  LPGALKYVYDHGDEGGKKAVARLSWSHRGSSDVIAKLAAGGSRCRVGGGYLRNWASEVAF 130
           LPGALKYVYDHGDE GKKAVARL                           +  W      
Sbjct: 87  LPGALKYVYDHGDESGKKAVARL---------------------------VNIWE----- 146

Query: 131 ARVVHCIRVVRVGQRKVFGSRGQSLKDEMLGKIPPPTQLPSSNGKSSNPIKIVKRDAHSV 190
                        +RKVFGSRGQSLKDEMLGK PPPT LPSSNGKSSNPIKIVKRDAHSV
Sbjct: 147 -------------ERKVFGSRGQSLKDEMLGKNPPPTPLPSSNGKSSNPIKIVKRDAHSV 206

Query: 191 RIKLAVGGVPEKILTAFQSVLDEHLNEDAALNNCSSATHHLNQIEEDLNASLAQDFGFAE 250
           RIKLAVGGVPEKILTAFQSVLDEHLNEDAALNNCSSATHHLNQIEEDLN SLAQ      
Sbjct: 207 RIKLAVGGVPEKILTAFQSVLDEHLNEDAALNNCSSATHHLNQIEEDLNVSLAQ------ 266

Query: 251 LVEIVFPSTQVVLLFANFVQTCRDGGVNGDSERVEMGCKATVMGKGSRFGGSLATLTKSE 310
                                                                       
Sbjct: 267 ------------------------------------------------------------ 326

Query: 311 ARRKLSSVFDPGFSSRDKETQDQINVNMIEVKEDEHSMSKRSVGRSKPEPPKKGHFKSPP 370
                                                                       
Sbjct: 327 ------------------------------------------------------------ 386

Query: 371 NESVFKILPLDKPPKEAQRSIKVLAKGFVDEWASSVSFFIALTCISFSLYLPSKLGTQPR 430
                                                                  GTQPR
Sbjct: 387 -------------------------------------------------------GTQPR 446

Query: 431 SALLDDLQEQENVIQECIRQLESVEATRASLVSLLVEALQDQESKLELVRNQLQIARSQI 490
           S LLDDLQ+QE VIQECIRQLE VEATRASLVSLLVEALQDQESKLELVRNQLQ+ARSQI
Sbjct: 447 SGLLDDLQDQETVIQECIRQLEGVEATRASLVSLLVEALQDQESKLELVRNQLQVARSQI 506

Query: 491 ELASNVRKRFTSTPVPGSSATTMDLLTEMTHVTDTKLSSVQQNIISSQSPLVQTMVSFPG 550
           ELASNVRKRFTST  PG SATT+DLLTEMTHVTDTKLSSVQQNIISSQ PLVQ M SFPG
Sbjct: 507 ELASNVRKRFTSTTAPGPSATTIDLLTEMTHVTDTKLSSVQQNIISSQPPLVQAMGSFPG 558

Query: 551 PKTSEEENKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNGSLKSSGFSSLSIF 610
           PKTSEEENKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNG LKSSGFSSL  F
Sbjct: 567 PKTSEEENKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNGGLKSSGFSSL--F 558

Query: 611 SPDKRQKLEKPMPNSDVSSSDGVGASFVTPMQQQLTSLPLAQSVNGQPVSQGNPSQASFA 670
           SP+KRQKLEKPMP SDVSSSDGVGASFV PMQQQ+TS+PLAQS NGQPVSQ NPSQASFA
Sbjct: 627 SPEKRQKLEKPMPISDVSSSDGVGASFVAPMQQQMTSMPLAQSANGQPVSQANPSQASFA 558

Query: 671 PPPPPAPPSLSSTPPVNQYAQPGGLVGVLPYNFGAYSLPPPPPLPPHIAMGLSRPTSQPP 730
           PPPPP PPSLSSTPPVNQYAQ GGL+GVLPYNFGAYSLPPPPPLPPHIAMGLSRPTSQPP
Sbjct: 687 PPPPPVPPSLSSTPPVNQYAQSGGLIGVLPYNFGAYSLPPPPPLPPHIAMGLSRPTSQPP 558

Query: 731 PQQLQQPQQSQPTSAGFYRPPGIGFYGQGQQSTPPPVPRQ 771
           PQQLQ PQQSQPTS+GFYRPPGIGFYGQGQQSTPPPVPRQ
Sbjct: 747 PQQLQPPQQSQPTSSGFYRPPGIGFYGQGQQSTPPPVPRQ 558

BLAST of CaUC10G194330 vs. NCBI nr
Match: KAG7027736.1 (Regulation of nuclear pre-mRNA domain-containing protein 1B, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 778.1 bits (2008), Expect = 7.1e-221
Identity = 481/771 (62.39%), Postives = 500/771 (64.85%), Query Frame = 0

Query: 1   MNLFIFDIKLALSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLANDILQNSRRKG 60
           MNLF FDI+LALSRWCISHRKKAKQIVETWDKLFNSSQK+QRVSFLYLANDILQNSRRKG
Sbjct: 7   MNLFFFDIELALSRWCISHRKKAKQIVETWDKLFNSSQKDQRVSFLYLANDILQNSRRKG 66

Query: 61  SEFVNEFWKVLPGALKYVYDHGDEGGKKAVARLSWSHRGSSDVIAKLAAGGSRCRVGGGY 120
           SEFVNEFWKVLP +LKYVYDHGDE GKKAVARL                           
Sbjct: 67  SEFVNEFWKVLPVSLKYVYDHGDESGKKAVARL--------------------------- 126

Query: 121 LRNWASEVAFARVVHCIRVVRVGQRKVFGSRGQSLKDEMLGKIPPPTQLPSSNGKSSNPI 180
           +  W                   +RKVFGSRGQSLKDEMLGK PPP  LP+SNGKSSNPI
Sbjct: 127 VDIWE------------------ERKVFGSRGQSLKDEMLGKNPPP--LPNSNGKSSNPI 186

Query: 181 KIVKRDAHSVRIKLAVGGVPEKILTAFQSVLDEHLNEDAALNNCSSATHHLNQIEEDLNA 240
           KIVKRDAHSVRIKLAVGGVPEKILTAFQSVLDEHLNEDAALNNCSSATHHLNQIEEDLNA
Sbjct: 187 KIVKRDAHSVRIKLAVGGVPEKILTAFQSVLDEHLNEDAALNNCSSATHHLNQIEEDLNA 246

Query: 241 SLAQDFGFAELVEIVFPSTQVVLLFANFVQTCRDGGVNGDSERVEMGCKATVMGKGSRFG 300
           S+AQ                                                        
Sbjct: 247 SIAQ-------------------------------------------------------- 306

Query: 301 GSLATLTKSEARRKLSSVFDPGFSSRDKETQDQINVNMIEVKEDEHSMSKRSVGRSKPEP 360
                                                                       
Sbjct: 307 ------------------------------------------------------------ 366

Query: 361 PKKGHFKSPPNESVFKILPLDKPPKEAQRSIKVLAKGFVDEWASSVSFFIALTCISFSLY 420
                                                                       
Sbjct: 367 ------------------------------------------------------------ 426

Query: 421 LPSKLGTQPRSALLDDLQEQENVIQECIRQLESVEATRASLVSLLVEALQDQESKLELVR 480
                GTQPRSALLDDLQEQENV+Q+CI QLESVEATRASLVSLLVEALQDQESKLELVR
Sbjct: 427 -----GTQPRSALLDDLQEQENVVQQCIGQLESVEATRASLVSLLVEALQDQESKLELVR 486

Query: 481 NQLQIARSQIELASNVRKRFTSTPVPGSSATTMDLLTEMTHVTDTKLSSVQQNIISSQSP 540
           NQLQIARSQIELASNVRKR  S PVPG  AT +DLLTEMTHVTD KL S Q N ISSQSP
Sbjct: 487 NQLQIARSQIELASNVRKRIAS-PVPGPLATNIDLLTEMTHVTDAKLPSAQSNTISSQSP 539

Query: 541 LVQTMVSFPGPKTSEEENKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNGSLK 600
           LVQ MVS  GPK+SEEENKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNG LK
Sbjct: 547 LVQAMVSSAGPKSSEEENKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNGGLK 539

Query: 601 SSGFSSLSIFSPDKRQKLEKPMPNSDVSSSDGVGASFVTPM-QQQLTSLPLAQSVNGQPV 660
           S+GF+SLSIFSP+KRQKLEKPMP SD  SSDG+GASFVTPM QQQLT++ LAQS N QPV
Sbjct: 607 SAGFASLSIFSPEKRQKLEKPMPISD--SSDGIGASFVTPMQQQQLTNVALAQSANIQPV 539

Query: 661 SQGNPSQASFAPPPPPAPPSLSSTPPVNQYAQPGGLVGVLPYNFGAYSLPPPPPLPPHIA 720
           SQ N SQASFAPPPPPA       PPVNQYAQ GG++GVLPYNFGAYSLPPPPPLPPHI 
Sbjct: 667 SQANQSQASFAPPPPPA-------PPVNQYAQSGGIMGVLPYNFGAYSLPPPPPLPPHIV 539

Query: 721 MGLSRPTSQPPPQQLQQPQQSQPTSAGFYRPPGIGFYGQGQQSTPPPVPRQ 771
           MGLSRPTSQPP QQ QQPQQSQPTSAGFYRPPGIGFYGQGQQSTPPPVPRQ
Sbjct: 727 MGLSRPTSQPPQQQPQQPQQSQPTSAGFYRPPGIGFYGQGQQSTPPPVPRQ 539

BLAST of CaUC10G194330 vs. NCBI nr
Match: XP_022152017.1 (regulation of nuclear pre-mRNA domain-containing protein 1B-like [Momordica charantia] >XP_022152018.1 regulation of nuclear pre-mRNA domain-containing protein 1B-like [Momordica charantia])

HSP 1 Score: 774.2 bits (1998), Expect = 1.0e-219
Identity = 478/761 (62.81%), Postives = 494/761 (64.91%), Query Frame = 0

Query: 11  ALSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLANDILQNSRRKGSEFVNEFWKV 70
           +LSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLANDILQNSRRKGSEFVNEFWKV
Sbjct: 27  SLSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLANDILQNSRRKGSEFVNEFWKV 86

Query: 71  LPGALKYVYDHGDEGGKKAVARLSWSHRGSSDVIAKLAAGGSRCRVGGGYLRNWASEVAF 130
           LPGALKYVYDHGD+ GKKAVARL                           +  W      
Sbjct: 87  LPGALKYVYDHGDDSGKKAVARL---------------------------VDIWE----- 146

Query: 131 ARVVHCIRVVRVGQRKVFGSRGQSLKDEMLGKIPPPTQLPSSNGKSSNPIKIVKRDAHSV 190
                        +RKVFGSRGQSLKDEMLGK PPP  LPSSNGKSSNPIKIVKRDAHSV
Sbjct: 147 -------------ERKVFGSRGQSLKDEMLGKNPPP--LPSSNGKSSNPIKIVKRDAHSV 206

Query: 191 RIKLAVGGVPEKILTAFQSVLDEHLNEDAALNNCSSATHHLNQIEEDLNASLAQDFGFAE 250
           RIKLAVGGVPEKILTAFQSVLDEHLNED ALNNCSSATHHLNQIEEDLNA+LAQ      
Sbjct: 207 RIKLAVGGVPEKILTAFQSVLDEHLNEDTALNNCSSATHHLNQIEEDLNATLAQ------ 266

Query: 251 LVEIVFPSTQVVLLFANFVQTCRDGGVNGDSERVEMGCKATVMGKGSRFGGSLATLTKSE 310
                                                                       
Sbjct: 267 ------------------------------------------------------------ 326

Query: 311 ARRKLSSVFDPGFSSRDKETQDQINVNMIEVKEDEHSMSKRSVGRSKPEPPKKGHFKSPP 370
                                                                       
Sbjct: 327 ------------------------------------------------------------ 386

Query: 371 NESVFKILPLDKPPKEAQRSIKVLAKGFVDEWASSVSFFIALTCISFSLYLPSKLGTQPR 430
                                                                  GTQPR
Sbjct: 387 -------------------------------------------------------GTQPR 446

Query: 431 SALLDDLQEQENVIQECIRQLESVEATRASLVSLLVEALQDQESKLELVRNQLQIARSQI 490
           SALLDDLQEQE+VIQ+CI QLESVEATR SLVSLLVEALQDQESKLELVRNQLQIARSQI
Sbjct: 447 SALLDDLQEQESVIQQCIGQLESVEATRGSLVSLLVEALQDQESKLELVRNQLQIARSQI 506

Query: 491 ELASNVRKRFTSTPVPGSSATTMDLLTEMTHVTDTKLSSVQQNIISSQSPLVQTMVSFPG 550
           ELASNVRKR T TPVPG  A T+DLLTEMTHV DTKL+SVQ N ISSQSPLVQ MVSF G
Sbjct: 507 ELASNVRKRLT-TPVPGPLAATIDLLTEMTHVADTKLTSVQSNTISSQSPLVQAMVSFAG 557

Query: 551 PKTSEEENKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNGSLKSSGFSSLSIF 610
           PKTSEEENKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNGSLKSSGF+SL IF
Sbjct: 567 PKTSEEENKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNGSLKSSGFASLPIF 557

Query: 611 SPDKRQKLEKPMPNSDVSSSDGVGASFVTPM-QQQLTSLPLAQSVNGQPVSQGNPSQASF 670
           SP+KRQKLEKPMP SDV+SSD +G SFVT M QQQLTS+ LAQS N QPVSQ NPSQASF
Sbjct: 627 SPEKRQKLEKPMPISDVNSSDSIG-SFVTSMQQQQLTSVALAQSANVQPVSQANPSQASF 557

Query: 671 APPPPPAPPSLSSTPPVNQYAQPGGLVGVLPYNFGAYSLPPPPPLPPHIAMGLSRPTSQP 730
            PPPPPAPPSLSST PVN Y Q G L+GVLPY FGAYSLPPPPPLPPHIAMGL+RPTSQP
Sbjct: 687 TPPPPPAPPSLSSTTPVNPYPQSGALMGVLPYGFGAYSLPPPPPLPPHIAMGLTRPTSQP 557

Query: 731 PPQQLQQPQQSQPTSAGFYRPPGIGFYGQGQQSTPPPVPRQ 771
           P QQ QQPQQSQPTS GFYRPPGIGFYGQ QQSTPPPVPRQ
Sbjct: 747 PQQQPQQPQQSQPTSGGFYRPPGIGFYGQSQQSTPPPVPRQ 557

BLAST of CaUC10G194330 vs. ExPASy Swiss-Prot
Match: Q9NQG5 (Regulation of nuclear pre-mRNA domain-containing protein 1B OS=Homo sapiens OX=9606 GN=RPRD1B PE=1 SV=1)

HSP 1 Score: 80.9 bits (198), Expect = 7.1e-14
Identity = 41/89 (46.07%), Postives = 53/89 (59.55%), Query Frame = 0

Query: 12  LSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLANDILQNSRRKGSEFVNEFWKVL 71
           LS W I HRK A  IV  W +    ++  ++++FLYLAND++QNS+RKG EF  EF  VL
Sbjct: 26  LSLWLIHHRKHAGPIVSVWHRELRKAKSNRKLTFLYLANDVIQNSKRKGPEFTREFESVL 85

Query: 72  PGALKYVYDHGDEGGKKAVARL--SWSHR 99
             A  +V    DEG KK + RL   W  R
Sbjct: 86  VDAFSHVAREADEGCKKPLERLLNIWQER 114

BLAST of CaUC10G194330 vs. ExPASy Swiss-Prot
Match: Q9CSU0 (Regulation of nuclear pre-mRNA domain-containing protein 1B OS=Mus musculus OX=10090 GN=Rprd1b PE=1 SV=2)

HSP 1 Score: 80.9 bits (198), Expect = 7.1e-14
Identity = 41/89 (46.07%), Postives = 53/89 (59.55%), Query Frame = 0

Query: 12  LSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLANDILQNSRRKGSEFVNEFWKVL 71
           LS W I HRK A  IV  W +    ++  ++++FLYLAND++QNS+RKG EF  EF  VL
Sbjct: 26  LSLWLIHHRKHAGPIVSVWHRELRKAKSNRKLTFLYLANDVIQNSKRKGPEFTREFESVL 85

Query: 72  PGALKYVYDHGDEGGKKAVARL--SWSHR 99
             A  +V    DEG KK + RL   W  R
Sbjct: 86  VDAFSHVAREADEGCKKPLERLLNIWQER 114

BLAST of CaUC10G194330 vs. ExPASy Swiss-Prot
Match: Q0P5J9 (Regulation of nuclear pre-mRNA domain-containing protein 1A OS=Bos taurus OX=9913 GN=RPRD1A PE=2 SV=2)

HSP 1 Score: 79.0 bits (193), Expect = 2.7e-13
Identity = 40/100 (40.00%), Postives = 61/100 (61.00%), Query Frame = 0

Query: 12  LSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLANDILQNSRRKGSEFVNEFWKVL 71
           LS W I HRK ++ IV  W++    ++  ++++FLYLAND++QNS+RKG EF  +F  V+
Sbjct: 26  LSLWLIHHRKHSRPIVTVWERELRKAKPNRKLTFLYLANDVIQNSKRKGPEFTKDFAPVI 85

Query: 72  PGALKYVYDHGDEGGKKAVARL--SWSHRG--SSDVIAKL 108
             A K+V    DE  KK + R+   W  R    +DV+ +L
Sbjct: 86  VEAFKHVSSETDESCKKHLGRVLSIWEERSVYENDVLEQL 125

BLAST of CaUC10G194330 vs. ExPASy Swiss-Prot
Match: Q96P16 (Regulation of nuclear pre-mRNA domain-containing protein 1A OS=Homo sapiens OX=9606 GN=RPRD1A PE=1 SV=1)

HSP 1 Score: 79.0 bits (193), Expect = 2.7e-13
Identity = 40/100 (40.00%), Postives = 61/100 (61.00%), Query Frame = 0

Query: 12  LSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLANDILQNSRRKGSEFVNEFWKVL 71
           LS W I HRK ++ IV  W++    ++  ++++FLYLAND++QNS+RKG EF  +F  V+
Sbjct: 26  LSLWLIHHRKHSRPIVTVWERELRKAKPNRKLTFLYLANDVIQNSKRKGPEFTKDFAPVI 85

Query: 72  PGALKYVYDHGDEGGKKAVARL--SWSHRG--SSDVIAKL 108
             A K+V    DE  KK + R+   W  R    +DV+ +L
Sbjct: 86  VEAFKHVSSETDESCKKHLGRVLSIWEERSVYENDVLEQL 125

BLAST of CaUC10G194330 vs. ExPASy Swiss-Prot
Match: Q8VDS4 (Regulation of nuclear pre-mRNA domain-containing protein 1A OS=Mus musculus OX=10090 GN=Rprd1a PE=1 SV=1)

HSP 1 Score: 79.0 bits (193), Expect = 2.7e-13
Identity = 40/100 (40.00%), Postives = 61/100 (61.00%), Query Frame = 0

Query: 12  LSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLANDILQNSRRKGSEFVNEFWKVL 71
           LS W I HRK ++ IV  W++    ++  ++++FLYLAND++QNS+RKG EF  +F  V+
Sbjct: 26  LSLWLIHHRKHSRPIVTVWERELRKAKPNRKLTFLYLANDVIQNSKRKGPEFTKDFAPVI 85

Query: 72  PGALKYVYDHGDEGGKKAVARL--SWSHRG--SSDVIAKL 108
             A K+V    DE  KK + R+   W  R    +DV+ +L
Sbjct: 86  VEAFKHVSSETDESCKKHLGRVLSIWEERSVYENDVLEQL 125

BLAST of CaUC10G194330 vs. ExPASy TrEMBL
Match: A0A1S3B7H0 (regulation of nuclear pre-mRNA domain-containing protein 1B-like OS=Cucumis melo OX=3656 GN=LOC103486842 PE=4 SV=1)

HSP 1 Score: 832.0 bits (2148), Expect = 2.0e-237
Identity = 496/760 (65.26%), Postives = 508/760 (66.84%), Query Frame = 0

Query: 11  ALSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLANDILQNSRRKGSEFVNEFWKV 70
           +LS+WCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLANDILQNSRRKGSEFVNEFWKV
Sbjct: 27  SLSKWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLANDILQNSRRKGSEFVNEFWKV 86

Query: 71  LPGALKYVYDHGDEGGKKAVARLSWSHRGSSDVIAKLAAGGSRCRVGGGYLRNWASEVAF 130
           LPGALKYVYDHGDE GKKAVARL                           +  W      
Sbjct: 87  LPGALKYVYDHGDESGKKAVARL---------------------------VNIWE----- 146

Query: 131 ARVVHCIRVVRVGQRKVFGSRGQSLKDEMLGKIPPPTQLPSSNGKSSNPIKIVKRDAHSV 190
                        +RKVFGSRGQSLKDEMLGK PPPT LPSSNGKSSNPIKIVKRDAHSV
Sbjct: 147 -------------ERKVFGSRGQSLKDEMLGKNPPPTPLPSSNGKSSNPIKIVKRDAHSV 206

Query: 191 RIKLAVGGVPEKILTAFQSVLDEHLNEDAALNNCSSATHHLNQIEEDLNASLAQDFGFAE 250
           RIKLAVGGVPEKILTAFQSVLDEHLNEDAALNNCSSATHHLNQIEEDLN SLAQ      
Sbjct: 207 RIKLAVGGVPEKILTAFQSVLDEHLNEDAALNNCSSATHHLNQIEEDLNVSLAQ------ 266

Query: 251 LVEIVFPSTQVVLLFANFVQTCRDGGVNGDSERVEMGCKATVMGKGSRFGGSLATLTKSE 310
                                                                       
Sbjct: 267 ------------------------------------------------------------ 326

Query: 311 ARRKLSSVFDPGFSSRDKETQDQINVNMIEVKEDEHSMSKRSVGRSKPEPPKKGHFKSPP 370
                                                                       
Sbjct: 327 ------------------------------------------------------------ 386

Query: 371 NESVFKILPLDKPPKEAQRSIKVLAKGFVDEWASSVSFFIALTCISFSLYLPSKLGTQPR 430
                                                                  GTQPR
Sbjct: 387 -------------------------------------------------------GTQPR 446

Query: 431 SALLDDLQEQENVIQECIRQLESVEATRASLVSLLVEALQDQESKLELVRNQLQIARSQI 490
           S LLDDLQ+QE VIQECIRQLE VEATRASLVSLLVEALQDQESKLELVRNQLQ+ARSQI
Sbjct: 447 SGLLDDLQDQETVIQECIRQLEGVEATRASLVSLLVEALQDQESKLELVRNQLQVARSQI 506

Query: 491 ELASNVRKRFTSTPVPGSSATTMDLLTEMTHVTDTKLSSVQQNIISSQSPLVQTMVSFPG 550
           ELASNVRKRFTST  PG SATT+DLLTEMTHVTDTKLSSVQQNIISSQ PLVQ M SFPG
Sbjct: 507 ELASNVRKRFTSTTAPGPSATTIDLLTEMTHVTDTKLSSVQQNIISSQPPLVQAMGSFPG 558

Query: 551 PKTSEEENKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNGSLKSSGFSSLSIF 610
           PKTSEEENKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNG LKSSGFSSL  F
Sbjct: 567 PKTSEEENKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNGGLKSSGFSSL--F 558

Query: 611 SPDKRQKLEKPMPNSDVSSSDGVGASFVTPMQQQLTSLPLAQSVNGQPVSQGNPSQASFA 670
           SP+KRQKLEKPMP SDVSSSDGVGASFV PMQQQ+TS+PLAQS NGQPVSQ NPSQASFA
Sbjct: 627 SPEKRQKLEKPMPISDVSSSDGVGASFVAPMQQQMTSMPLAQSANGQPVSQANPSQASFA 558

Query: 671 PPPPPAPPSLSSTPPVNQYAQPGGLVGVLPYNFGAYSLPPPPPLPPHIAMGLSRPTSQPP 730
           PPPPP PPSLSSTPPVNQYAQ GGL+GVLPYNFGAYSLPPPPPLPPHIAMGLSRPTSQPP
Sbjct: 687 PPPPPVPPSLSSTPPVNQYAQSGGLIGVLPYNFGAYSLPPPPPLPPHIAMGLSRPTSQPP 558

Query: 731 PQQLQQPQQSQPTSAGFYRPPGIGFYGQGQQSTPPPVPRQ 771
           PQQLQ PQQSQPTS+GFYRPPGIGFYGQGQQSTPPPVPRQ
Sbjct: 747 PQQLQPPQQSQPTSSGFYRPPGIGFYGQGQQSTPPPVPRQ 558

BLAST of CaUC10G194330 vs. ExPASy TrEMBL
Match: A0A6J1DF20 (regulation of nuclear pre-mRNA domain-containing protein 1B-like OS=Momordica charantia OX=3673 GN=LOC111019832 PE=4 SV=1)

HSP 1 Score: 774.2 bits (1998), Expect = 4.9e-220
Identity = 478/761 (62.81%), Postives = 494/761 (64.91%), Query Frame = 0

Query: 11  ALSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLANDILQNSRRKGSEFVNEFWKV 70
           +LSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLANDILQNSRRKGSEFVNEFWKV
Sbjct: 27  SLSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLANDILQNSRRKGSEFVNEFWKV 86

Query: 71  LPGALKYVYDHGDEGGKKAVARLSWSHRGSSDVIAKLAAGGSRCRVGGGYLRNWASEVAF 130
           LPGALKYVYDHGD+ GKKAVARL                           +  W      
Sbjct: 87  LPGALKYVYDHGDDSGKKAVARL---------------------------VDIWE----- 146

Query: 131 ARVVHCIRVVRVGQRKVFGSRGQSLKDEMLGKIPPPTQLPSSNGKSSNPIKIVKRDAHSV 190
                        +RKVFGSRGQSLKDEMLGK PPP  LPSSNGKSSNPIKIVKRDAHSV
Sbjct: 147 -------------ERKVFGSRGQSLKDEMLGKNPPP--LPSSNGKSSNPIKIVKRDAHSV 206

Query: 191 RIKLAVGGVPEKILTAFQSVLDEHLNEDAALNNCSSATHHLNQIEEDLNASLAQDFGFAE 250
           RIKLAVGGVPEKILTAFQSVLDEHLNED ALNNCSSATHHLNQIEEDLNA+LAQ      
Sbjct: 207 RIKLAVGGVPEKILTAFQSVLDEHLNEDTALNNCSSATHHLNQIEEDLNATLAQ------ 266

Query: 251 LVEIVFPSTQVVLLFANFVQTCRDGGVNGDSERVEMGCKATVMGKGSRFGGSLATLTKSE 310
                                                                       
Sbjct: 267 ------------------------------------------------------------ 326

Query: 311 ARRKLSSVFDPGFSSRDKETQDQINVNMIEVKEDEHSMSKRSVGRSKPEPPKKGHFKSPP 370
                                                                       
Sbjct: 327 ------------------------------------------------------------ 386

Query: 371 NESVFKILPLDKPPKEAQRSIKVLAKGFVDEWASSVSFFIALTCISFSLYLPSKLGTQPR 430
                                                                  GTQPR
Sbjct: 387 -------------------------------------------------------GTQPR 446

Query: 431 SALLDDLQEQENVIQECIRQLESVEATRASLVSLLVEALQDQESKLELVRNQLQIARSQI 490
           SALLDDLQEQE+VIQ+CI QLESVEATR SLVSLLVEALQDQESKLELVRNQLQIARSQI
Sbjct: 447 SALLDDLQEQESVIQQCIGQLESVEATRGSLVSLLVEALQDQESKLELVRNQLQIARSQI 506

Query: 491 ELASNVRKRFTSTPVPGSSATTMDLLTEMTHVTDTKLSSVQQNIISSQSPLVQTMVSFPG 550
           ELASNVRKR T TPVPG  A T+DLLTEMTHV DTKL+SVQ N ISSQSPLVQ MVSF G
Sbjct: 507 ELASNVRKRLT-TPVPGPLAATIDLLTEMTHVADTKLTSVQSNTISSQSPLVQAMVSFAG 557

Query: 551 PKTSEEENKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNGSLKSSGFSSLSIF 610
           PKTSEEENKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNGSLKSSGF+SL IF
Sbjct: 567 PKTSEEENKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNGSLKSSGFASLPIF 557

Query: 611 SPDKRQKLEKPMPNSDVSSSDGVGASFVTPM-QQQLTSLPLAQSVNGQPVSQGNPSQASF 670
           SP+KRQKLEKPMP SDV+SSD +G SFVT M QQQLTS+ LAQS N QPVSQ NPSQASF
Sbjct: 627 SPEKRQKLEKPMPISDVNSSDSIG-SFVTSMQQQQLTSVALAQSANVQPVSQANPSQASF 557

Query: 671 APPPPPAPPSLSSTPPVNQYAQPGGLVGVLPYNFGAYSLPPPPPLPPHIAMGLSRPTSQP 730
            PPPPPAPPSLSST PVN Y Q G L+GVLPY FGAYSLPPPPPLPPHIAMGL+RPTSQP
Sbjct: 687 TPPPPPAPPSLSSTTPVNPYPQSGALMGVLPYGFGAYSLPPPPPLPPHIAMGLTRPTSQP 557

Query: 731 PPQQLQQPQQSQPTSAGFYRPPGIGFYGQGQQSTPPPVPRQ 771
           P QQ QQPQQSQPTS GFYRPPGIGFYGQ QQSTPPPVPRQ
Sbjct: 747 PQQQPQQPQQSQPTSGGFYRPPGIGFYGQSQQSTPPPVPRQ 557

BLAST of CaUC10G194330 vs. ExPASy TrEMBL
Match: A0A6J1F335 (UPF0400 protein C337.03-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111441698 PE=4 SV=1)

HSP 1 Score: 768.8 bits (1984), Expect = 2.1e-218
Identity = 472/763 (61.86%), Postives = 493/763 (64.61%), Query Frame = 0

Query: 11  ALSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLANDILQNSRRKGSEFVNEFWKV 70
           +LSRWCISHRK+AKQIVETWDKLFNSSQ+EQRVSFLYLANDILQNSRRKGSEFVNEFWKV
Sbjct: 27  SLSRWCISHRKRAKQIVETWDKLFNSSQQEQRVSFLYLANDILQNSRRKGSEFVNEFWKV 86

Query: 71  LPGALKYVYDHGDEGGKKAVARLSWSHRGSSDVIAKLAAGGSRCRVGGGYLRNWASEVAF 130
           LPGALKYVYD GDE GKKA ARL                           +  W      
Sbjct: 87  LPGALKYVYDQGDENGKKAAARL---------------------------VNIWE----- 146

Query: 131 ARVVHCIRVVRVGQRKVFGSRGQSLKDEMLGKIPPPTQLPSSNGKSSNPIKIVKRDAHSV 190
                        +RKVFGSRGQ+LKDEMLGK PPPT LPSSNGKSSNP KIVKRDAHSV
Sbjct: 147 -------------ERKVFGSRGQNLKDEMLGKYPPPTPLPSSNGKSSNPKKIVKRDAHSV 206

Query: 191 RIKLAVGGVPEKILTAFQSVLDEHLNEDAALNNCSSATHHLNQIEEDLNASLAQDFGFAE 250
           RIKLAVGGVPE+ILTAFQSVLDEHLNEDAALN+CSSATHHLNQIEEDLNASLAQ      
Sbjct: 207 RIKLAVGGVPERILTAFQSVLDEHLNEDAALNDCSSATHHLNQIEEDLNASLAQ------ 266

Query: 251 LVEIVFPSTQVVLLFANFVQTCRDGGVNGDSERVEMGCKATVMGKGSRFGGSLATLTKSE 310
                                                                       
Sbjct: 267 ------------------------------------------------------------ 326

Query: 311 ARRKLSSVFDPGFSSRDKETQDQINVNMIEVKEDEHSMSKRSVGRSKPEPPKKGHFKSPP 370
                                                                       
Sbjct: 327 ------------------------------------------------------------ 386

Query: 371 NESVFKILPLDKPPKEAQRSIKVLAKGFVDEWASSVSFFIALTCISFSLYLPSKLGTQPR 430
                                                                  GTQPR
Sbjct: 387 -------------------------------------------------------GTQPR 446

Query: 431 SALLDDLQEQENVIQECIRQLESVEATRASLVSLLVEALQDQESKLELVRNQLQIARSQI 490
           SALLDDLQEQENVIQ CIRQLESVEATRASLVSLLVEAL DQESKLELVRNQLQIARSQI
Sbjct: 447 SALLDDLQEQENVIQRCIRQLESVEATRASLVSLLVEALLDQESKLELVRNQLQIARSQI 506

Query: 491 ELASNVRKRFTSTPVPGSSATTMDLLTEMTHVTDTKLSSVQQNIISSQSPLVQTMVSFPG 550
           ELASNVRKR TSTPVPG SATT+++ TE THV DTKL SVQQN I+S    VQ M SF G
Sbjct: 507 ELASNVRKRSTSTPVPGPSATTINVPTETTHVIDTKLPSVQQNAIASP---VQAMASFAG 558

Query: 551 PKTSEEENKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNGSLKSSGFSSLSIF 610
           PKTSE+E KRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNGSLKSSGF+SLSIF
Sbjct: 567 PKTSEDETKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNGSLKSSGFASLSIF 558

Query: 611 SPDKRQKLEKPMPNSDVSSSDGVGASFVTPM---QQQLTSLPLAQSVNGQPVSQGNPSQA 670
             +KRQKLEKPMP SDVS SDG+GASF+TPM   QQQLTS+PLAQS NGQPVSQ NP QA
Sbjct: 627 PSEKRQKLEKPMPISDVSGSDGIGASFITPMQQQQQQLTSVPLAQSANGQPVSQANPCQA 558

Query: 671 SFAPPPPPAPPSLSSTPPVNQYAQPGGLVGVLPYNFGAYSLPPPPPLPPHIAMGLSRPTS 730
           SFAPPPPPAPPSLSSTPPVNQYAQ GGL+GV PY+FGAYSLPPPPPLPPHIAMGLSRP S
Sbjct: 687 SFAPPPPPAPPSLSSTPPVNQYAQSGGLMGVSPYSFGAYSLPPPPPLPPHIAMGLSRPMS 558

Query: 731 QPPPQQLQQPQQSQPTSAGFYRPPGIGFYGQGQQSTPPPVPRQ 771
           QPP Q  QQPQ + PTSAGFYRPPGIGFYGQGQQSTPPPVPRQ
Sbjct: 747 QPPQQ--QQPQLTLPTSAGFYRPPGIGFYGQGQQSTPPPVPRQ 558

BLAST of CaUC10G194330 vs. ExPASy TrEMBL
Match: A0A6J1FGC3 (regulation of nuclear pre-mRNA domain-containing protein 1B-like OS=Cucurbita moschata OX=3662 GN=LOC111445436 PE=4 SV=1)

HSP 1 Score: 761.1 bits (1964), Expect = 4.3e-216
Identity = 472/761 (62.02%), Postives = 491/761 (64.52%), Query Frame = 0

Query: 11  ALSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLANDILQNSRRKGSEFVNEFWKV 70
           +LSRWCISHRKKAKQIVETWDKLFNSSQK+QRVSFLYLANDILQNSRRKGSEFVNEFWKV
Sbjct: 27  SLSRWCISHRKKAKQIVETWDKLFNSSQKDQRVSFLYLANDILQNSRRKGSEFVNEFWKV 86

Query: 71  LPGALKYVYDHGDEGGKKAVARLSWSHRGSSDVIAKLAAGGSRCRVGGGYLRNWASEVAF 130
           LP +LKYVYDHGDE GKKAVARL                           +  W      
Sbjct: 87  LPVSLKYVYDHGDESGKKAVARL---------------------------VDIWE----- 146

Query: 131 ARVVHCIRVVRVGQRKVFGSRGQSLKDEMLGKIPPPTQLPSSNGKSSNPIKIVKRDAHSV 190
                        +RKVFGSRGQSLKDEMLGK PPP  LP+SNGKSSNPIKIVKRDAHSV
Sbjct: 147 -------------ERKVFGSRGQSLKDEMLGKNPPP--LPNSNGKSSNPIKIVKRDAHSV 206

Query: 191 RIKLAVGGVPEKILTAFQSVLDEHLNEDAALNNCSSATHHLNQIEEDLNASLAQDFGFAE 250
           RIKLAVGGVPEKILTAFQSVLDEHLNEDAALNNCSSATHHLNQIEEDLNAS+AQ      
Sbjct: 207 RIKLAVGGVPEKILTAFQSVLDEHLNEDAALNNCSSATHHLNQIEEDLNASIAQ------ 266

Query: 251 LVEIVFPSTQVVLLFANFVQTCRDGGVNGDSERVEMGCKATVMGKGSRFGGSLATLTKSE 310
                                                                       
Sbjct: 267 ------------------------------------------------------------ 326

Query: 311 ARRKLSSVFDPGFSSRDKETQDQINVNMIEVKEDEHSMSKRSVGRSKPEPPKKGHFKSPP 370
                                                                       
Sbjct: 327 ------------------------------------------------------------ 386

Query: 371 NESVFKILPLDKPPKEAQRSIKVLAKGFVDEWASSVSFFIALTCISFSLYLPSKLGTQPR 430
                                                                  GTQPR
Sbjct: 387 -------------------------------------------------------GTQPR 446

Query: 431 SALLDDLQEQENVIQECIRQLESVEATRASLVSLLVEALQDQESKLELVRNQLQIARSQI 490
           SALLDDLQEQENV+Q+CI QLESVEATRASLVSLLVEALQDQESKLELVRNQLQIARSQI
Sbjct: 447 SALLDDLQEQENVVQQCIGQLESVEATRASLVSLLVEALQDQESKLELVRNQLQIARSQI 506

Query: 491 ELASNVRKRFTSTPVPGSSATTMDLLTEMTHVTDTKLSSVQQNIISSQSPLVQTMVSFPG 550
           ELASNVRKR  S PVPG  AT +DLLTEMTHVTD KL S Q N ISSQSPLVQ MVS  G
Sbjct: 507 ELASNVRKRIVS-PVPGPLATNIDLLTEMTHVTDVKLPSAQSNTISSQSPLVQAMVSSAG 549

Query: 551 PKTSEEENKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNGSLKSSGFSSLSIF 610
           PK+SEEENKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNG LKS+GF+SLSIF
Sbjct: 567 PKSSEEENKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNGGLKSAGFASLSIF 549

Query: 611 SPDKRQKLEKPMPNSDVSSSDGVGASFVTPM-QQQLTSLPLAQSVNGQPVSQGNPSQASF 670
           SP+KRQKLEKPMP SD  SSDG+GASFVTPM QQQLT++ LAQS N QPVSQ N SQASF
Sbjct: 627 SPEKRQKLEKPMPISD--SSDGIGASFVTPMQQQQLTNVALAQSANIQPVSQANQSQASF 549

Query: 671 APPPPPAPPSLSSTPPVNQYAQPGGLVGVLPYNFGAYSLPPPPPLPPHIAMGLSRPTSQP 730
           APPPPPA       PPVNQYAQ GG++GVLPYNFGAYSLPPPPPLPPHI MGLSRPTSQP
Sbjct: 687 APPPPPA-------PPVNQYAQSGGIMGVLPYNFGAYSLPPPPPLPPHIVMGLSRPTSQP 549

Query: 731 PPQQLQQPQQSQPTSAGFYRPPGIGFYGQGQQSTPPPVPRQ 771
           P QQ QQPQQSQPTSAGFYRPPGIGFYGQGQQSTPPPVPRQ
Sbjct: 747 PQQQPQQPQQSQPTSAGFYRPPGIGFYGQGQQSTPPPVPRQ 549

BLAST of CaUC10G194330 vs. ExPASy TrEMBL
Match: A0A6J1I2G4 (regulation of nuclear pre-mRNA domain-containing protein 1B-like OS=Cucurbita maxima OX=3661 GN=LOC111470291 PE=4 SV=1)

HSP 1 Score: 759.6 bits (1960), Expect = 1.3e-215
Identity = 474/763 (62.12%), Postives = 493/763 (64.61%), Query Frame = 0

Query: 11  ALSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLANDILQNSRRKGSEFVNEFWKV 70
           +LSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLANDILQNSRRKGSEFVNEFWKV
Sbjct: 27  SLSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLANDILQNSRRKGSEFVNEFWKV 86

Query: 71  LPGALKYVYDHGDEGGKKAVARLSWSHRGSSDVIAKLAAGGSRCRVGGGYLRNWASEVAF 130
           LP +LKYVYDHGDE GKKAVARL                           +  W      
Sbjct: 87  LPVSLKYVYDHGDESGKKAVARL---------------------------VDIWE----- 146

Query: 131 ARVVHCIRVVRVGQRKVFGSRGQSLKDEMLGKIPPPTQLPSSNGKSSNPIKIVKRDAHSV 190
                        +RKVFGSRGQSLKDEMLGK PPP  LP+SNGKSSNPIKIVKRDAHSV
Sbjct: 147 -------------ERKVFGSRGQSLKDEMLGKNPPP--LPNSNGKSSNPIKIVKRDAHSV 206

Query: 191 RIKLAVGGVPEKILTAFQSVLDEHLNEDAALNNCSSATHHLNQIEEDLNASLAQDFGFAE 250
           RIKLAVGGVPEKILTAFQSVLDEHLNEDAALNNCSSATHHLNQIEEDLNAS+AQ      
Sbjct: 207 RIKLAVGGVPEKILTAFQSVLDEHLNEDAALNNCSSATHHLNQIEEDLNASIAQ------ 266

Query: 251 LVEIVFPSTQVVLLFANFVQTCRDGGVNGDSERVEMGCKATVMGKGSRFGGSLATLTKSE 310
                                                                       
Sbjct: 267 ------------------------------------------------------------ 326

Query: 311 ARRKLSSVFDPGFSSRDKETQDQINVNMIEVKEDEHSMSKRSVGRSKPEPPKKGHFKSPP 370
                                                                       
Sbjct: 327 ------------------------------------------------------------ 386

Query: 371 NESVFKILPLDKPPKEAQRSIKVLAKGFVDEWASSVSFFIALTCISFSLYLPSKLGTQPR 430
                                                                  GTQPR
Sbjct: 387 -------------------------------------------------------GTQPR 446

Query: 431 SALLDDLQEQENVIQECIRQLESVEATRASLVSLLVEALQDQESKLELVRNQLQIARSQI 490
           SALLDDLQEQENV+Q+CI QLESVEATRASLVSLLVEALQDQESKLELVRNQLQIARSQI
Sbjct: 447 SALLDDLQEQENVVQQCIGQLESVEATRASLVSLLVEALQDQESKLELVRNQLQIARSQI 506

Query: 491 ELASNVRKRFTSTPVPGSSATTMDLLTEMTHVTDTKLSSVQQNIISSQSPLVQTMVSFPG 550
           ELASNVRKR  S PVPG  AT++DLLTEMTHVTD KLSS Q N ISSQSPLVQ MVS  G
Sbjct: 507 ELASNVRKRIAS-PVPGPLATSIDLLTEMTHVTDAKLSSAQSNTISSQSPLVQAMVSSAG 551

Query: 551 PKTSEEENKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNGSLKSSGFSSLSIF 610
           PK+SEEENKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNG LKS+GF+SLSIF
Sbjct: 567 PKSSEEENKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNGGLKSAGFASLSIF 551

Query: 611 SPDKRQKLEKPMPNSDVSSSDGVGASFVTPM-QQQLTSLPLAQSVNGQPVSQGNPSQASF 670
           SP+KRQKLEKPMP SD  SSDG+GASFVTPM QQQLT++ LAQS N QPVSQ N SQASF
Sbjct: 627 SPEKRQKLEKPMPISD--SSDGIGASFVTPMQQQQLTNVALAQSANIQPVSQANQSQASF 551

Query: 671 APPPPPAPPSLSSTPPVNQYAQPGGLVGVLPYNFGAYSLPPPPPLPPHIAMGLSRPTSQP 730
           APPPPPA       PPVNQYAQ GG++GVLPYNFGAYSLPPPPPLPPHI MGLSRPTSQP
Sbjct: 687 APPPPPA-------PPVNQYAQSGGIMGVLPYNFGAYSLPPPPPLPPHIVMGLSRPTSQP 551

Query: 731 PPQ--QLQQPQQSQPTSAGFYRPPGIGFYGQGQQSTPPPVPRQ 771
           P Q  Q QQPQQSQPTSAGFYRPPGIGFYGQGQQSTPPPVPRQ
Sbjct: 747 PQQQPQPQQPQQSQPTSAGFYRPPGIGFYGQGQQSTPPPVPRQ 551

BLAST of CaUC10G194330 vs. TAIR 10
Match: AT5G10060.1 (ENTH/VHS family protein )

HSP 1 Score: 157.1 bits (396), Expect = 5.5e-38
Identity = 200/757 (26.42%), Postives = 283/757 (37.38%), Query Frame = 0

Query: 12  LSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLANDILQNSRRKGSEFVNEFWKVL 71
           LS WCI +R KA+ IV TW+K F+S++ +Q+V  LYLANDILQNS+R+G+EFV EFW VL
Sbjct: 27  LSHWCIFNRSKAELIVTTWEKQFHSTEMDQKVPLLYLANDILQNSKRQGNEFVQEFWNVL 86

Query: 72  PGALKYVYDHGDEGGKKAVARLSWSHRGSSDVIAKLAAGGSRCRVGGGYLRNWASEVAFA 131
           P ALK +   GD+ GK AVAR+                           ++ W       
Sbjct: 87  PKALKDIVSQGDDNGKSAVARV---------------------------IKIWE------ 146

Query: 132 RVVHCIRVVRVGQRKVFGSRGQSLKDEMLGK-IPPPTQLPSSNGKSSNPIKIVKRDAHSV 191
                       +R+VFGSR +SLKD MLG+ +P P  +     + S   K  KR++ S 
Sbjct: 147 ------------ERRVFGSRSKSLKDVMLGEDVPLPLDISKKRPRGS---KSSKRESKSS 206

Query: 192 RIKLA-VGGVPEKILTAFQSVLDEHLNEDAALNNCSSATHHLNQIEEDLNASLAQDFGFA 251
           R KLA  GGV EKI +A+  V+ E+ NE+A +N C SA   + ++E+D            
Sbjct: 207 RTKLASSGGVAEKIASAYHLVVAENSNEEAEMNKCKSAVKRIRKMEKD------------ 266

Query: 252 ELVEIVFPSTQVVLLFANFVQTCRDGGVNGDSERVEMGCKATVMGKGSRFGGSLATLTKS 311
                                             VE  C                     
Sbjct: 267 ----------------------------------VEEAC--------------------- 326

Query: 312 EARRKLSSVFDPGFSSRDKETQDQINVNMIEVKEDEHSMSKRSVGRSKPEPPKKGHFKSP 371
                                                     S  +  P+          
Sbjct: 327 ------------------------------------------STAKDNPK---------- 386

Query: 372 PNESVFKILPLDKPPKEAQRSIKVLAKGFVDEWASSVSFFIALTCISFSLYLPSKLGTQP 431
                                                                       
Sbjct: 387 ------------------------------------------------------------ 446

Query: 432 RSALLDDLQEQENVIQECIRQLESVEATRASLVSLLVEALQDQESKLELVRNQLQIARSQ 491
           R +L  +L+E+E ++++CI +L+SV+ +R+SLV+ L +AL++QES+L+ ++ Q+Q+A+ Q
Sbjct: 447 RKSLAKELEEEEYLLRQCIEKLKSVQGSRSSLVNQLKDALREQESELDNLKAQIQVAKEQ 469

Query: 492 IELASNVRKRFTSTPVPGSSATTMDLLTEMTHVTDTKLSSVQQNIISSQSPLVQTMVSFP 551
            E A N++KR           T    +TE                               
Sbjct: 507 TEEAQNMQKRLNDEDYTSKQTTAATTITE------------------------------T 469

Query: 552 GPKTSEEENKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNGSLKSSGFSSLSI 611
              T   +  +   A++AA L ASTSS  ++ SVLSS  AE       + K+SG S    
Sbjct: 567 NDNTKSGQASKMTPASIAAMLTASTSSHMIMQSVLSSFAAE-------ATKTSGLS---- 469

Query: 612 FSPDKRQKLEKPMPNSDVSSSDGVGASFVTPMQQQLTSLPLAQSVNGQPVSQGNPSQASF 671
                  K E  +P SD +      ASF +    Q          N  P +QG   Q   
Sbjct: 627 -------KSESTVPVSDTN------ASFPSYNNSQ----------NQTPTTQG---QYHV 469

Query: 672 APPPPPAPPSLSSTPPVNQYAQPGGLVGVLPYNFGAYSLP----PPPPLPPHIAMGLSRP 731
            P PP  PP     P +N            PY FG   L     PPPP PPH+ +G  +P
Sbjct: 687 IPNPP--PPQFLKPPVMNN-----------PYAFGNIPLMPPGLPPPPPPPHL-IGNQQP 469

Query: 732 TSQPPPQQLQQPQQSQPTSAGFYRPPGIGFYGQGQQS 763
             Q P     Q  Q  PT    ++PPGI +YG    S
Sbjct: 747 --QIPQSNSAQQSQQGPT----FQPPGIMYYGAPHHS 469

BLAST of CaUC10G194330 vs. TAIR 10
Match: AT3G26990.1 (ENTH/VHS family protein )

HSP 1 Score: 139.8 bits (351), Expect = 9.1e-33
Identity = 203/771 (26.33%), Postives = 285/771 (36.96%), Query Frame = 0

Query: 12  LSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLANDILQNSRRKGSEFVNEFWKVL 71
           LS WCI H  KAK +VETW + F+ + +EQR+++LYLANDILQNSRRKGSEFV EFWKVL
Sbjct: 27  LSHWCIFHMNKAKHVVETWGRQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVL 86

Query: 72  PGALKYVYDHGDEGGKKAVARLSWSHRGSSDVIAKLAAGGSRCRVGGGYLRNWASEVAFA 131
           P AL+ + ++GD+ G+K+  RL                           +  W       
Sbjct: 87  PDALRDMIENGDDFGRKSARRL---------------------------VNIWE------ 146

Query: 132 RVVHCIRVVRVGQRKVFGSRGQSLKDEMLGKIPPPTQLPSSNGKSSNPIKIVKRDAHSVR 191
                       +RKVFGSRGQ LK+E+LG+ P        NG          R+ + V 
Sbjct: 147 ------------ERKVFGSRGQILKEELLGRQP-------ENG---------TRNGNLVP 206

Query: 192 IKLAV------GGVPEKILTAFQSVLDEHLNEDAALNNCSSATHHLNQIEEDLNASLAQD 251
           +KL+V      G   EK+++A + +    ++EDA +   ++A  +L              
Sbjct: 207 LKLSVPQRQVNGSTLEKVVSAVEVLHGVQIDEDALVGKSTNAAGYLE------------- 266

Query: 252 FGFAELVEIVFPSTQVVLLFANFVQTCRDGGVNGDSERVEMGCKATVMGKGSRFGGSLAT 311
                                                      KAT              
Sbjct: 267 -------------------------------------------KAT-------------- 326

Query: 312 LTKSEARRKLSSVFDPGFSSRDKETQDQINVNMIEVKEDEHSMSKRSVGRSKPEPPKKGH 371
               E  R LSS                                                
Sbjct: 327 ---QEVERDLSS------------------------------------------------ 386

Query: 372 FKSPPNESVFKILPLDKPPKEAQRSIKVLAKGFVDEWASSVSFFIALTCISFSLYLPSKL 431
                                                                       
Sbjct: 387 ------------------------------------------------------------ 446

Query: 432 GTQPRSALLDDLQEQENVIQECIRQLESVEATRASLVSLLVEALQDQESKLELVRNQLQI 491
           G  P  A++ +LQ Q  ++++CI QL ++E +R SL+S L EALQ+QE KLE VRN LQI
Sbjct: 447 GHAPGPAVVKELQGQHVILRDCIEQLGAMETSRTSLISHLREALQEQELKLEQVRNHLQI 506

Query: 492 ARSQIELASNVRKRFTSTPVPGSS---ATTMDLLTEMTHVTDTKLSSVQQNIISSQSPLV 551
           AR Q +   ++ ++       GSS   AT  +   E+  V+ T  ++  Q+   S     
Sbjct: 507 ARFQSDRTGDLCRQLLDH--GGSSQPPATEEEESKEVIKVSST--AAAPQSFTHSDVEQS 513

Query: 552 QTMVSFPGPKTSEEENKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNGSLKSS 611
             ++    P  S E+ ++ AAAAV AKL ASTSSA+ML+ VLSSL +E     N     +
Sbjct: 567 APVMFASNPTQSLEDPRKTAAAAVVAKLTASTSSAEMLSYVLSSLASEGIIGNNNPPAVT 513

Query: 612 GFSSLSIFSPDKRQKLEKPMPNSDVSSSDGVGASFVTPMQQQLTSLPLAQSVNGQPVSQG 671
              S   F P+KR KL+    N D         S+++P  Q   +               
Sbjct: 627 ETLSSVDFPPEKRPKLQ----NHD--------QSYLSPHHQNTAT--------------- 513

Query: 672 NPSQASFAPPPPPAPPSLSSTPPVNQYAQPGGLVGVLPYNFGAYSLPPPPPLPPHIAMGL 731
             + +S  P P P PP     P   Q  QP G V   P+N+          +    A   
Sbjct: 687 --TSSSTPPQPLPPPPPFQLQPQFLQPLQPPGPVNHTPFNY---------TIATSTATTQ 513

Query: 732 SRPTSQPP--PQQLQQPQQSQPTSAGFYRPPG-IGFYGQGQQSTPPPVPRQ 771
            +   Q P  P   Q    S P+   + +  G  GFYG        PV RQ
Sbjct: 747 QQQQEQGPWVPGLTQLSTTSAPSENSYQKFQGQDGFYGINSSVPITPVTRQ 513

BLAST of CaUC10G194330 vs. TAIR 10
Match: AT5G65180.1 (ENTH/VHS family protein )

HSP 1 Score: 139.8 bits (351), Expect = 9.1e-33
Identity = 170/670 (25.37%), Postives = 251/670 (37.46%), Query Frame = 0

Query: 12  LSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLANDILQNSRRKGSEFVNEFWKVL 71
           LS+WCI HR +A+ +V TW+K F+S+Q  Q+V  LYLANDILQNS+R+G+EFV EFWKVL
Sbjct: 27  LSQWCIVHRSEAELVVTTWEKQFHSTQIGQKVPLLYLANDILQNSKRQGNEFVQEFWKVL 86

Query: 72  PGALKYVYDHGDEGGKKAVARLSWSHRGSSDVIAKLAAGGSRCRVGGGYLRNWASEVAFA 131
           PGALK +   GD+ GK  V+RL                           +  W       
Sbjct: 87  PGALKDIVSLGDDYGKGVVSRL---------------------------VNIWE------ 146

Query: 132 RVVHCIRVVRVGQRKVFGSRGQSLKDEMLG-KIPPPTQLPSSNGKSSNPIKIVKRDAHSV 191
                       +R+VFGSR +SLKD ML  + PPP  +     + S   K  KRD+ S 
Sbjct: 147 ------------ERRVFGSRSKSLKDVMLSEEAPPPLDVSKKRFRGS---KSAKRDSKST 206

Query: 192 RIKLAVGGVPEKILTAFQSVLDEHLNEDAALNNCSSATHHLNQIEEDLNASLAQDFGFAE 251
           + KL+ GGV EKI++AF  V  E+ NE+  +N C SA   + ++E+D             
Sbjct: 207 KTKLSSGGVSEKIVSAFNLVRAENSNEETEMNKCKSAVRRIRKMEKD------------- 266

Query: 252 LVEIVFPSTQVVLLFANFVQTCRDGGVNGDSERVEMGCKATVMGKGSRFGGSLATLTKSE 311
                                            VE  C                      
Sbjct: 267 ---------------------------------VEDACSTA------------------- 326

Query: 312 ARRKLSSVFDPGFSSRDKETQDQINVNMIEVKEDEHSMSKRSVGRSKPEPPKKGHFKSPP 371
                                                                   K P 
Sbjct: 327 --------------------------------------------------------KDPR 386

Query: 372 NESVFKILPLDKPPKEAQRSIKVLAKGFVDEWASSVSFFIALTCISFSLYLPSKLGTQPR 431
            ES+ K                                                      
Sbjct: 387 KESLAK------------------------------------------------------ 428

Query: 432 SALLDDLQEQENVIQECIRQLESVEATRASLVSLLVEALQDQESKLELVRNQLQIARSQI 491
                +L+E+EN++++ + +L+SVE +R SLV+ L EAL++QES+LE +++Q+Q+A+ Q 
Sbjct: 447 -----ELEEEENILRQSVEKLKSVEESRTSLVNHLREALREQESELENLQSQIQVAQEQT 428

Query: 492 ELASNVRKRFTS-TPVPGSSATTMDLLTEMTHVTDTKLSSVQQNIISSQSPLVQTMVSFP 551
           E A N++KR  + TPV  ++ T                        S QS  +       
Sbjct: 507 EEAQNMQKRLNNETPVNNNNGT------------------------SGQSAKITP----- 428

Query: 552 GPKTSEEENKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNGSLKSSGFSSLSI 611
                      A+ AA+A  L +ST+S+ ++ SVLSS  AE   +   +  ++  ++  +
Sbjct: 567 -----------ASIAAMAEMLTSSTNSSMIMHSVLSSFAAEATQTSGLTKSNTSDTNAFV 428

Query: 612 FSPDKRQKLEKPMPNSDVSSSDGVG-----ASFVTPMQQQLTSLPLAQSVNGQPVSQGNP 671
             P+ +Q    P P +      G G          P     T  P   + N QP +    
Sbjct: 627 VPPNPQQYHIIPNPAASQQFPYGFGNIPLMPPGALPPPPPGTLPPHMMNNNNQPNAAQQQ 428

Query: 672 SQ--ASFAPP 673
           +Q   SF PP
Sbjct: 687 AQQGQSFHPP 428

BLAST of CaUC10G194330 vs. TAIR 10
Match: AT5G65180.2 (ENTH/VHS family protein )

HSP 1 Score: 57.8 bits (138), Expect = 4.6e-08
Identity = 70/251 (27.89%), Postives = 119/251 (47.41%), Query Frame = 0

Query: 430 RSALLDDLQEQENVIQECIRQLESVEATRASLVSLLVEALQDQESKLELVRNQLQIARSQ 489
           + +L  +L+E+EN++++ + +L+SVE +R SLV+ L EAL++QES+LE +++Q+Q+A+ Q
Sbjct: 90  KESLAKELEEEENILRQSVEKLKSVEESRTSLVNHLREALREQESELENLQSQIQVAQEQ 149

Query: 490 IELASNVRKRFTS-TPVPGSSATTMDLLTEMTHVTDTKLSSVQQNIISSQSPLVQTMVSF 549
            E A N++KR  + TPV  ++ T                        S QS  +      
Sbjct: 150 TEEAQNMQKRLNNETPVNNNNGT------------------------SGQSAKITP---- 209

Query: 550 PGPKTSEEENKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNGSLKSSGFSSLS 609
                       A+ AA+A  L +ST+S+ ++ SVLSS  AE   +   +  ++  ++  
Sbjct: 210 ------------ASIAAMAEMLTSSTNSSMIMHSVLSSFAAEATQTSGLTKSNTSDTNAF 269

Query: 610 IFSPDKRQKLEKPMPNSDVSSSDGVG-----ASFVTPMQQQLTSLPLAQSVNGQPVSQGN 669
           +  P+ +Q    P P +      G G          P     T  P   + N QP +   
Sbjct: 270 VVPPNPQQYHIIPNPAASQQFPYGFGNIPLMPPGALPPPPPGTLPPHMMNNNNQPNAAQQ 300

Query: 670 PSQ--ASFAPP 673
            +Q   SF PP
Sbjct: 330 QAQQGQSFHPP 300


HSP 2 Score: 56.6 bits (135), Expect = 1.0e-07
Identity = 32/84 (38.10%), Postives = 48/84 (57.14%), Query Frame = 0

Query: 164 PPPTQLPSSNGKSSNPIKIVKRDAHSVRIKLAVGGVPEKILTAFQSVLDEHLNEDAALNN 223
           PPP  +     + S   K  KRD+ S + KL+ GGV EKI++AF  V  E+ NE+  +N 
Sbjct: 7   PPPLDVSKKRFRGS---KSAKRDSKSTKTKLSSGGVSEKIVSAFNLVRAENSNEETEMNK 66

Query: 224 CSSATHHLNQIEEDLN--ASLAQD 246
           C SA   + ++E+D+    S A+D
Sbjct: 67  CKSAVRRIRKMEKDVEDACSTAKD 87

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038904543.11.4e-24567.54regulation of nuclear pre-mRNA domain-containing protein 1B-like [Benincasa hisp... [more]
XP_004136608.13.7e-23865.13regulation of nuclear pre-mRNA domain-containing protein 1B [Cucumis sativus] >X... [more]
XP_008443180.14.1e-23765.26PREDICTED: regulation of nuclear pre-mRNA domain-containing protein 1B-like [Cuc... [more]
KAG7027736.17.1e-22162.39Regulation of nuclear pre-mRNA domain-containing protein 1B, partial [Cucurbita ... [more]
XP_022152017.11.0e-21962.81regulation of nuclear pre-mRNA domain-containing protein 1B-like [Momordica char... [more]
Match NameE-valueIdentityDescription
Q9NQG57.1e-1446.07Regulation of nuclear pre-mRNA domain-containing protein 1B OS=Homo sapiens OX=9... [more]
Q9CSU07.1e-1446.07Regulation of nuclear pre-mRNA domain-containing protein 1B OS=Mus musculus OX=1... [more]
Q0P5J92.7e-1340.00Regulation of nuclear pre-mRNA domain-containing protein 1A OS=Bos taurus OX=991... [more]
Q96P162.7e-1340.00Regulation of nuclear pre-mRNA domain-containing protein 1A OS=Homo sapiens OX=9... [more]
Q8VDS42.7e-1340.00Regulation of nuclear pre-mRNA domain-containing protein 1A OS=Mus musculus OX=1... [more]
Match NameE-valueIdentityDescription
A0A1S3B7H02.0e-23765.26regulation of nuclear pre-mRNA domain-containing protein 1B-like OS=Cucumis melo... [more]
A0A6J1DF204.9e-22062.81regulation of nuclear pre-mRNA domain-containing protein 1B-like OS=Momordica ch... [more]
A0A6J1F3352.1e-21861.86UPF0400 protein C337.03-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC1114... [more]
A0A6J1FGC34.3e-21662.02regulation of nuclear pre-mRNA domain-containing protein 1B-like OS=Cucurbita mo... [more]
A0A6J1I2G41.3e-21562.12regulation of nuclear pre-mRNA domain-containing protein 1B-like OS=Cucurbita ma... [more]
Match NameE-valueIdentityDescription
AT5G10060.15.5e-3826.42ENTH/VHS family protein [more]
AT3G26990.19.1e-3326.33ENTH/VHS family protein [more]
AT5G65180.19.1e-3325.37ENTH/VHS family protein [more]
AT5G65180.24.6e-0827.89ENTH/VHS family protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL246-FR2) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 462..489
NoneNo IPR availableCOILSCoilCoilcoord: 430..457
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 159..178
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 344..373
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 705..719
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 344..366
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 667..682
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 654..770
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 614..635
NoneNo IPR availablePANTHERPTHR12460:SF21RNA POLYMERASE II-BINDING DOMAIN PROTEINcoord: 11..93
coord: 426..770
NoneNo IPR availablePANTHERPTHR12460CYCLIN-DEPENDENT KINASE INHIBITOR-RELATED PROTEINcoord: 426..770
NoneNo IPR availablePANTHERPTHR12460:SF21RNA POLYMERASE II-BINDING DOMAIN PROTEINcoord: 139..245
NoneNo IPR availablePANTHERPTHR12460CYCLIN-DEPENDENT KINASE INHIBITOR-RELATED PROTEINcoord: 11..93
NoneNo IPR availablePANTHERPTHR12460CYCLIN-DEPENDENT KINASE INHIBITOR-RELATED PROTEINcoord: 139..245
NoneNo IPR availableCDDcd16981CID_RPRD_likecoord: 12..107
e-value: 1.80961E-38
score: 136.942
IPR006569CID domainSMARTSM00582558neu5coord: 3..160
e-value: 4.8E-14
score: 62.7
IPR006569CID domainPFAMPF04818CIDcoord: 11..93
e-value: 2.0E-21
score: 76.5
IPR006569CID domainPROSITEPS51391CIDcoord: 1..164
score: 23.031979
IPR008942ENTH/VHSGENE3D1.25.40.90coord: 8..109
e-value: 7.4E-25
score: 89.4
IPR008942ENTH/VHSSUPERFAMILY48464ENTH/VHS domaincoord: 10..93

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CaUC10G194330.1CaUC10G194330.1mRNA