Cp4.1LG08g13750 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG08g13750
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionRegulation of nuclear pre-mRNA domain-containing protein 1B
LocationCp4.1LG08 : 9821034 .. 9832894 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CATTCTTGGTCAACATTTTTTCTCGATATTTTGACGAACTCTCTGCAAACCAGTTTTTTCATCTTCCTCCACTTATCACCCTTCTTCTGGACCATCATTTGCTGGTGAATTAGGGAATCCTCTGTCCACTTCGAAAAACCCACAACTTTTCCCACTCCTTCTCGGTTTCAATTTCTTTATTCTCTCTTCATCCTAATCCCTATTGCCAAGGGAACACAAATACCAGGAAAAATCATCGCCAATCAGCAACCTTTTTTCTTCTTTCCTGTAATTCATCTCCCCGAGGTTCTATCGGGAAAGGCTATCATCAAGAGTGGAGTTCTTGAATCAATATACGCCAGGTATTCGAAATTTCAAATGTTTTTTTTTTTCTTTTGCTTTTATGGAAATCTAGATCCACCCTTTTCGTTAGCGGCGAGGAATTACTGATTGGGTGTGGAACCCTAATGTTGTTTTTCCATAAGTTCTCGAACTGGGCTTCGCGTAAATTGTTTTTCGTGCTGAGAATGATAGTTATGCCTAGTTATATTAGGAAATAATGTTGAAGGTAGCTTTTTTGAAGTTTTGTTCTTGGTTACAATTCGTGTGCTCACTATTTTCAGAGCTTACATAGCTTCGATGAACTGCTAGTTTTTCTTGGAACTTTGCTGCAGGGTAATGTAATAATATTCTCGTCACGGGATGAACAACGAGGGTTTTGAAGCACAGGTGTTGTCTGAGAAGTTATCCAAACTCAACAATTCTCAGCAGAGCATCGAGTGTATGCTTCTCTAATCATTATCTTTTCATTTATTTTTGTGGCCTGAAAATATTGCATTGCCCGTTCTATTACACACCGGCGTTTACAGTTTTTGTTCAGATATTTCTCCCTTTTATATATTGATTTTAATTTTTTTTTTTTGGTGTNCAAAAAAAAAAAAAAAAAAAAAGAAGAAGAAGAAAAACAAAGTAATGCCTTTTGTCATTTACTAATTATGTATGTAAAGAACTGGGTATGGTGGCGATTATCTCTTAAAACTTTGAAGTTAGGATCCTTCTCTATATGTATTTTAAATTGTAAAAGAATTAATTGAATTTAATTTTGTGGATAGCCTCATGATTTTGTGGTTGAGATTTCTTCGATCTTGGCATGACTTCATTATTAACCTTGATTACCTATGCTCCTTATGGTTGAACATTTTCATTACTCTTGTCTGGGAACTTTGGATTGAGTAATTATTGCCTTTGTTCTTTTCATTTTTCTGGCAATAAATTCACTAGATTCGAAGACGTTTGGGATGTGTTAGTTCTTGTGTTTAGATTGAGCCCCTTTTGAAGTCTTTGTAATTTTTTTGTGTTGGTACATACTGATTAAAGAACGGTTCTTATACTGTTGACTCATGGAAGAATATTTTGTGCTCAAATATATCCTCAAAAAATTTCTGATGGTGGGTGCAACTGTAATAAATGTGTTTTTCTTTGATATTGAATTAGCTTTATCAAGATGGTGCATTTCCCACCGGAAAAAGGCCAAGCAGATTGTTGAAACATGGGATAAATTATTCAACTCATCTCAGAAAGAGCAGCGTGTTTCGTTTCTTTATTTGGCAAATGACATTTTGCAAAACAGCAGGCGCAAGGGAAGTGAATTTGTGAACGAATTCTGGAAAGTTCTTCCTGTTTCTCTCAAGAATGTTTATGATCATGGGGATGAAAGTGGAAAGAAGGCGGTAGCCAGACTTGTATGTCCTCTTTATTCTCGTGTATAAATATGATGGCAATAAATGTTCTTATGTTTTTATTTGTGGAAGCATCTTAGTTGACTCTAGAACATGTTGCTCAAATGTAAAACATTTTGCATTTGTGAAAAAAGAAATGAAGAAAAGGTGTAGACGGAAAACAGAAAAAGGGAGGATGGATTCCTTCATTTCGGCCTGACAAATTTGTATCATAAACATTTATATTTAGGCTTCTCAAAGAGTGAACTGCTTTATTAGTCACATTGGTAACAGAACTCTCAATTCTACTTATAGCAGTCTTTTGATTGGTTTCATGAGTACTGAGGGTGGTCAATTGGTGCTAATTCCTAAAGCTTGTATATTAGTTAGGAATAATCTAGTAAATTCCTGATATGTGGGTTAAAACTTGAGTTTGAAGCATAATATGTGGTACTTTTTCTTTCCCCTTTTGTCCCCCCATTCCTAGACGTGAATGTAGGTTTCTTTATAAAATAATTTTTGTTTAAAATAGTTATAAGCATAATTATTATTATTTTTTTATATAAATAAAGGAATATAGGACCTGTTTGTGAGGGGTAGATAAATCCTTGATTTGATCTTATTGCAAATAAAGCTATTAATGAGAGAAGAAAGGAAACTGAGATCTAAAATTTGTCACCCAAAAAGCTTACTATTTAATGGGTACACCCATATAGATTGATATAAAAGAAAGTGGTGTTTGAGTCTAACTGAAATAAATTCCTTAGTTTGGATGGATTCACGATGGGTTTTATTTCAAGAGAGTTGGGAGTGTGTAGGAGTATGCTTGGAAGGTCTTTTAAGTTTTACAAGCGGGTGTCATCGATGTTTCTACTTATGAGGCCTATGTTTTCCTTGTACTGAAAAGACAAAGGAGTGTAGGGTTAAAGGACTCTTAGGCCAATAGGATTTGTTATAAGTTTAAATGAGATTATTGTTCCAACATTAGCAAATGGATTGAGAAAGGTCTTCCCTAACACGATTTTAGACTCCCAAGGAGCTTTTGTAGCAGCGAGACAATTTGGATCAGGTGTTGATTGTCAGTGAGGCTATTGAGGATCACAATCAAGAGGGTGTCATTCTTAAGATTGATTTTGGTTACGCTTATGTTCATGTGAATTTGAGATTTCTTAGTTAAAGATCTCTAGAAGGAAGGCTTCATGTTTAAATGGAGGTTGTGGATTTGAAGATATATTAGGATGGCGAATTTCTCTATCCTTATCAATAGTTAACCTGACTGAGGTTCAAGCCCCTACAAATCTTAAGCAAGGTGATCCCCACTCCATTCCTCTTCATTTTGGTCACTGACATCTTGATTTTGTTGGTTTCTCATGAGCAAAATTGGAGTGATCAATGTTTCTAAAATCGTTTAAGACGTAAGGCTTGAGGTGAGGCAATTTATCATTTTTGGAGGCTATCTTGGGTTTGCAGCTTAATAAAGGCAAGTGTCAAATTTTAAGTTTGAATTGTGATCCAATCAAGTTGCGTAGGTGGGTTGCTTTAAATGAGATTATTCCTCCAACATAAGCAAATAGATTGTGAAGGGTCTTCCCTTACACGATTTAAGACTCCCAAGGAGCTTTTGTAGCCGAGAGACAAAATTTGGACCAGGTGTTGATTGTCAATGAGGCTATTGAGCATTACAATCAAGAGGGTGTCATTCTTAAGATTAATTTTGGTTAAGCTTATGTTCATGTTAAATAATTCAATCTGTTTTATTAAAAACCGTGTTATACTTTGATAGTGTATGGAAGTTTATACAATTTGTTTTTTGGTTACAGATTACGATTGATTGACATTTTATTCTAAATTATCCTCACGTACATTTTTGTTTTTTATATGTTGCTTTCCGCTATCAACTGTTTGCATGGCTCTGTAATGTTAATTAAAATTGCAATAATAAGATTGACCAGTTCCCGTCTCCCCCTTTCTTATGACAGTATCTCGAGTGTTATTTCGTAAGTTTGGACTCTCACTTGTTTCCCTTCATGGGAGTTTGTTTGTTAAGGTCGATATTTGGGAAGAAAGAAAGGTTTTTGGTTCTCGGGGTCAGAGTCTTAAGGATGAAATGCTTGGTAAAAATCCACCTCCATTACCCAACAGCAATGGAAAGAGTTCAAATCCTATAAAGATAGTGAAGAGGGATGCCCATTCTGTGAGAATTGTAAGACTATTGCAGTTTTCCATTTGATTTCTTGTGTTCAGACCTCTATAATCTTTATTGCTCAACTCTAGAAACTGGCTGTTGGAGGTGTTCCGGAAAAGATTCTTACTGCATTTCAATCTGTACTTGATGAACATTTGAATGAAGATGCTGCTCTCAACAACTGTAGTTCTGCAACTCATCATCTGAACCAAATTGAGGAAGACTTGAATGCTTCCATAGCTCAAGGTATAACCTCATCTACTATAGTCATCTCATGCAATTAGTTTAAGGTGACGCTGACATGTACGCTAAGTTAGCGTACATGACAGCGTGGTTTTGAGTTTCTTTCCTAATTAGTGTTGTTTGTGTGAATTCCAACTCGTGTTACTTTTGGAATAGCTGTCATTTACCTTAAAGAGCTGCAACATTTTTCTGTTGACGGAAACTTCTGAAGCTCTTCTGTCGTTGTTCTTATCTCCCCTACTTATTGTCGCTGTATACTGAGAAGCAATACTCTAATGCTAAATGATAGGTCCGGAAGATAGTAATAGATATGGAGTTTTTTTTAAGAAATAGAACTTTCATCATGAACAATGAAAGAATACAAAAGAGAAAACTGAGACATACATAAAGATAGCCCAAAAACAGGAACCAGCACGAAAGGGACATTAAACCTAGCTAGGGAGGACATAAGATCTAGTGGACAGTTCCGGGATTCTTCGGTAATAGAAGCTGAAGGGACATTAAACCTAGCTAGGGACTAAACCCCATCAATAATCTCTCTAATTCCCTAAATATCTTATTATTCTTCTCAAGCTAAATACCCCTAAAAACAACAAGAAACTAACCCTCCAAATGAACCAACATTTATCTCGAAAGGGTGTATACAAAAGAATATCGAGAATCGTAGACTGGCAATCTCTACAGCAAGCTAGAAACCTGAACTGTTCCCAAAAGCAAACCCAAATTCAGAGGCAAAGTCACTCCAACCTTTTCAATCTCTAAGAATTATAAGGAGGCCATACTTGCAAAATAAAACCTTTACATTTTGTTATATATGGATTTTATGATAAACTAAAAATATCAATACGTTTAATACAATGAACAAGTATCTAATTTGATGGATACATTCAATATTTAGAATTTAATTTGAAATGGTTTACAATTTGACATGACATGACTGTAGAAATACAATTTGATAGAAATACATTTACCAGCTATTCTATCAAACATGTAGTTGAATCTTCATGGAAGGAATGCTAAGATTCAACCTTTGTAGCCATAGTGAGCAAGCTAAGAAAGAATGATTGAAATCTTCAATCTTGGAACATTGGCCAAGTCAAAGGGACATCTTACCATAAAATGCCGAACCCCGATTTTTATTTAATACATATTTTAGTTTTAAATGCCTTAATTTCCTTCTTTTGAAAGTTTGTAGATCAATGGTCATTCTATTCTTTGGTTTATTTTGTCCAGTATATATATAGAATTCTCCTTGCTATATTTTTATGTTTGTGTCGTAAAACCTCTTTCTTCGAGAAGTATTGTAAGAGAGTTTTTCTTTGTACTTTCCTTCCAAACTAGGAACTCAACCAAGATCAGCATTGCTGGACGACCTTCAAGAGCAGGAAAATGTGGTTCAACAATGTATTGGGCAACTTGAAAGTGTTGAGGCAACTAGGTCTTCTTTGGTTTCCTTACTTGTAGAAGCGCTTCAGGACCAAGTAATTTATCTTCCTCTTTTTTATTTCCTCAAATAATTTCTCAAGATTCCAAGCCTCCTCTATATAGTTGACATACTTTCATTGTAAGTTGATTTCCTTGTGGAAAAATTACAGGAATCAAAGCTGGAACTAGTTCGCAATCAGTTGCAGGTAAGTTAAAGTGCTTCTTATATTTTCTTTTCTGAAATCTGTAGTAAGAGTTTATGTTTTGGGATCTCTTGTTGGCTTAAAGTAGTTTTGAAGGTGGGTCTGACATGTGATGTATTGTATACCTTGAAACCTCAGAATTTGGTAAGATATTTGGATATGTCAAAACATGAACTTCAGTATACCTTGAACCATTTAGATCAGAGTGCCAGACCTTTATATTTCTTGCACATTTAAGCATGGTGGAGGTGGACAGACACCTTGAACAAGTGAACACGATAAAACTTTGACCGTGACTATTACATTTTTCGAGAAAAATAAGACTGAATGATGTTGTTTGAAGTGTATAATTTTGTGGGGGATTGATTTTGACGTTGTTTTAAGTTTGATATATTTGATTGTTTAGAACGAAGTTTTTTTGCCTTTATACATCATTAAGATGACACTAGTGGTTGACTAATATTGTTATTCTTTATACATCTTATTATATTCTCTTAATGAGAAACCAAACATTCCATTGAAGAAATGAAAAGAGACTAATGTTCAAAAGATACAAACTCCTAAGAGGGTGGGGAATTGAAAAAAACAGCATCAAGGAAAAAAAAAACAAAAGCTTCCCTATTCAAACATGTCTTGCAAAGGAAAAACCCAAACAAAAACTCACTTTGTTGTACGATAAAGGTGTAGAATAAGCTCTTTTTGTGTAAATCTTCAAAGAAAGGGTGGTTAAGATCACAGAACCGCATTATAGGAAAAAGTTCATGTTGTTAGTTGAAGAAATCGTTTTTGTCTGAATAATAGACTCCATGGAGACTTGCTTCTTGTTCTCAACTCCCAAAATTCTTCCGGAAAAACAATTGTAACAATGGCTTCATTTGGATTCAGAAAATTGTAAACAAAAGAGAGATATTAAGAAACCACGAAAGTAGTTAACTTGAGGGGAAAATTCAATCTGTGGTGCCAGCGGGAGAAGAATACAGTGGATGGAAGACCTTTTAACAAATGATAATAGATTTTCTCAACAGAAAGAAAGCGGAGCAACAAAGGGAAATAAAACGGAAGTGTAGAAAGGAGAAATATTTTGCAGAAGCTGTAAAATCGCCCTCAGAAAGGTCGGGGAGGGAGAAAAAGACAAAGAAGGAGGATTTAAAGGTGATCGAAAGGAAAGGAAGACTTGAAGAGCACCTTATCATCATTCATTTTTCAGACATTTTTCAGACGGATAAAGCCATCCTGAAATGCCATTATGAGGATTTAGAAGGTCGTTTGTGCCTAATGGATGAGCTTTGGACCGTTTGTTTTGAAGCTAGAGAGATGGGATGTTTGTAAACATAGTAGAGTTAAGTTAGTATCCCATTATGGTGGTTGGGTGAGAATTCATAACATTCCTCTTCAATTTCCATTGAACCTGAAGATATTCAGCAATTTGTGGCTGTCATGGTGGCTTGGAAGGTTATGAAGTAACTCCTTGCTCATGGATTGCTTTGATGTGGGCATTAAGATCAAGGATAAGTATTGGGGGTTTATACTAGTCAAATATTAGCCTCATCGACAGTGTGGGGACATACCAGGAGGAGAATATGCTTGTGGATAGGGTCGCCGACCTTCATGGAAGTTTCTCGCCAACCACTGCTCACGTTTTTCATGGAGGACCAAAGGATCCTAATTTTAGTCTGATGGATGAATGAAGGATCGAAGGTACAATTACCCATGCGTTAATGTTCAAAATGCTCTCGCCATAAAGATATCAGATACGGCCCACAATTCCAATTCATAATTATCCTGCAAAATGGGGGATGTGGTAATGGATACAATGGACCACAAGAAAAGGGAAAAGGGTAAAGAGGATGAAACCCAACAAGGAAAAGGAAAAGAGATTCCAAAGGATGTCTCAACAGAAGACATGTATGGCCCATCTAAATCACGAAGAAAAGAACGTGAATCACATTTGGCGAGAAGCCCAAGTTAAAGCGTTTTGAAATAGGATATATCCAATCCACGAGTACTCACAAAGACCCCTCCCAATGGGAAGAAGCTCCAAACGAAGAGCAAGAGAGAGACCTAGGTTCCGAAATTTCTTTATGAAGTCCTAGCATTAGAACAATGAACAAAAATTTTGGAATTCAAGAGAGTGTGATGGAAGATATTCGTTCTGGAGGATTTCAAAGTTATTTCCAAACGGATGAATCCTTAGATATTATGGCCATGACAATTGAAGGTGCTAGTTCAATGGAAGAGAGGAAGGACGAGAACACCATGTCGACCATGGCAATGATCTCCCCAAAATCGAAGGCAATTAAAAAAAAAAAAGAGTAGGAATTTCAATATTCAGAGGGCTTCACAATTAGCAAAGAGGTAGTGCAAACTTTGGAAAAGAATAGCTATGTATCAAACCATTAATAAGGAATTCGGCGAAAAAAGGTAGTGTTGCTCAAAAGAAAATAGCAGAAGAGGTGACAAGTCTTCTTAGAACATGGAAAATGAAAGGGATGAGGAGGAGACGAAGATAAAGGTGTGGACACAGAAGTAGTTGTTGAAAGTGATATCAGGGCTCCTTAGGCATCTCACTAAAAATAATTTCGTGGAATGTGAGTGTGTCCCATATCTAGCATCAATAACCTGTCTTCATAAAGCATCCTCTTCATGATAATACGTCCGAATTCACTTGGCTAATAAAGCTATATTATTTTCTGCCAATCATTGCAAACCGAGGCCTCCATGGGAGGTAGATCTAGTGACTTTTTCCCATCTGGCTAAGTGTGATCTGGATTGAGAGGAGCTCCCTTTTCAAAGGAAATGGCCCCTGATNACCGCCTCTCAGGCCCCCGACTGATTCTACCATAGAGGCCAACGATAGACAATAACTCCCCCCCGAACACAGCTTACAACTTTCATTGTCTCTATATGGACTTTCAATATTCAATGGAACCTTCCATGGGATCTTGAAGAGAGACAAAAAGTATGTGGGAAGATTCAAGAGCGTTGCTAACAAAAGAGCAAGGCGACGCCCTTTGGATAAGTGTGTAATTCCATGAGTGGATCCTTTTTTCAACTTCTCTCAACAATTGGGATACAAAATGAAGAAGTTCTAGGTTTATCTCCCAATGGAAGGCAGATAGACATTTGACCAATATCCTTGTCCGCAATCAAATTTCTCGGCTATTAATTCGACCTCCTCAAATTCTGTATTGACCCAAAAATCTCCGATCTCTTATGATTTGTAATCAATCCACATGCCTTTTCAAAAGAGTCAACAAGGCCAAAAAGGCTGTCAAGAGCCAAGATACATGTGGATGAGGAAAGAATGATTTCATCGGAAAAAAAGATGATGTATCTGAAGGAAAGCTCTTCCTATCTCAAAGCTTTATGGAGCTTGCTTCTATCATCATTCTACTAGAACAATCCATGACCGCGATTTCCTTTCCACCTCCAAACAAACGCTTGAAGAAGTAAGAAAAAGGGACATGTAGCATAAAGGCAAGTTAGAGGGAACTAAATTACAAAGTGTTGGCCTACCATCTCCCGAAAGTTTGAAGCATTCCATGTATCCAACTTCTTCAAAATTCTTTCCTTAACGGGATGCCATAAGGATGCATTTTTGGGATGACCTCCATGGGGAGACCAAGATATAAAAGAAGGCAAATATTCTATCTTGCAATGAAGCCTTGAAGTAACTTGATTCAACTTGACATTGTCAATATTTATCCCACTAATTGACGACTTCTTCCTATTGATTCTGAGACCAGAAAATCTCTCATATACTTCCTCTGTTTTGATTAAAATGTCCAGCATGTATCATCATATTTGCAAACTAACAATAGCATCCTAACAGTTAAAATCTTTCTCAATGACCAGCTTCAAACTTCTACCTTTCTGTTTTTTGCCCAAAAAAAAAAAAGCTATGGGTTGCTTCATATCCACTAGCTGTCATATTCACAACTTATTTACTCTATCATGTGATATGAACGAGTTTGAATACCAATATTACAGTTCAAACTACTAGGAGTTCAATTATTTGAGAATTTTTACTTCTTGTTTCTCTTCTCACAATAACCACTGAACCATTTTATATCAATTTTCTCAGTCATTGCTAAGCTATTTTAGGCTGTAGAAACTGAATTGTGCTGTAAATTTTATTTACCGCGAACCTTCATCAAGAATTTGGTCTGGGTCATGTCTAATGCAATTTTTAGATCGTATTTTGATTAGCAAATTTTCTCCTCAACTAGAAAATGCATTAATAGATTAAACTTCAGTTGGTCCCTTTTGGATGTGAATTGATAAGGTGGTTCTTTTTCAAGGTGTTATGAGAAATTGGCTCAAATATTGTGGATAAAGATATAACATTTGTACTTTTATAAATTGGAACGGACTTCTTTGAGAGTGAGAGAGCAGTAAGTTTGTGTACACTGTTCATTTTCCCAAGTTTATGTACTAGCGACAGAGCACTTGGTTATGTACTATCAACTTGTTTATTGTTATCTGCCATTCACATGGACGGACATCTCATATTAGTTGTATCTCATATGTTGGGGAATGCAAAATATTAGTTAGAGGGTTTGTTATTTTCTAATTTCAGTTTGCTTATGTTGCTTGGAAATTATTGGTTATGAATCTGCAGATAGCTCGATCTCAAATTGAGCTAGCAAGTAACGTTCGGAAGAGAATTGCGAGTCCAGTTCCTGGTCCTCTTGCTACGAACATTGATCTACTGACAGAAATGACCCATGTGACTGATGCTAAGTTACCTTCAGCTCAATCAAACACAATCTCTTCTCAGTCCCCTCTTGTCCAGGCTATGGTTTCTTCTGCTGGTCCTAAAAGCAGTGAAGAGGAAAACAAGAGAGCTGCTGCTGCTGCTGTTGCTGCAAAGTTGGCTGCTTCTACATCTTCAGCACAGATGCTTACCTCTGTTCTTTCATCTCTTGTTGCAGAAGAAGCTGCTTCCATGAATGGTGGCCTAAAATCAGCTGGGTTTGCTTCATTATCCATATTCTCTCCTGAAAAACGACAGAAACTTGAGAAGCCAATGCCGATTTCTGATAGTTCAGATGGAATTGGTGCGTCATTCGTTACACCTATGCAACAACAACAATTGACAAATGTGGCACTTGCACAATCAGCAAATATACAACCTGTATCCCAGGCGAACCAGAGTCAAGCTTCATTTGCTCCACCACCACCGCCAGCTCCACCAGTGAATCAATATGCTCAGTCTGGTGGAATAATGGGGGTATTACCTTATAACTTTGGAGCATATTCTCTACCACCTCCACCTCCTTTACCTCCCCATATTGTGATGGGTTTGAGTAGGCCGACGTCTCAGCCACCTCAGCAACAGCCACAGCAGCCTCAGCAGTCGCAGCCAACTTCAGCAGGATTTTACAGGCCACCAGGTATAGGTTTCTACGGGCAAGGTCAGCAGTCGACTCCACCACCTGTCCCTAGGCAGTAAGTTCCCAGGTAGAGTTTACTTGGGGCAAATGTTCAATTAAAGGATTAATAGCTTTAAACAGGTTAAATTAGGCATTCACATTTCTACTTGCCATCTGAGAGTTCATTTCAGAATATGTATTATCATGTAAATTAGTAGAACTTTTTTTTGGTTCTGTAGAAGTTTTAGTATATGTCCATCTCCTGCCAACGAGTTAAAAGGGCATGTTGCAGCTGAGGCAGCTGAGGTTCGTCCAATGCTGATGGAAGAATTAAGTGGCTAGATTTAGGCTTTTCCAATGTAATGCTGTCGAAAACTGGAAACCTCGGTGAATTCTCGGCTGCCTCGAAATTTTACTATGGCCAGGTAATCTTTTCTCCCATTAGATTGGAAGTTGGGTAGAGTACCTCCTAGCAGAATTTTAGTTGACAACGTTAGATTTAGGTTTGAAGACACTGTTGCCTCTAAATCTGCATGGATGGATAGAGAGAGTGGCTAAATGGACTTGTTTTGGATAGTCATAACAGGATGATACTGGTCTGTAATATAACTTAGTGAGGCCTTGAGAATTTCTAGTAGATACAGCACTGCCATGGTTCTTGTAATCTGTTGCTCTGTTTTCTGATTCAGTAGAGAATTTTTATTTAAAATAAACTGTTCCTTACTCTTCTTAG

mRNA sequence

CATTCTTGGTCAACATTTTTTCTCGATATTTTGACGAACTCTCTGCAAACCAGTTTTTTCATCTTCCTCCACTTATCACCCTTCTTCTGGACCATCATTTGCTGGTGAATTAGGGAATCCTCTGTCCACTTCGAAAAACCCACAACTTTTCCCACTCCTTCTCGGTTTCAATTTCTTTATTCTCTCTTCATCCTAATCCCTATTGCCAAGGGAACACAAATACCAGGAAAAATCATCGCCAATCAGCAACCTTTTTTCTTCTTTCCTGTAATTCATCTCCCCGAGGTTCTATCGGGAAAGGCTATCATCAAGAGTGGAGTTCTTGAATCAATATACGCCAGGGTAATGTAATAATATTCTCGTCACGGGATGAACAACGAGGGTTTTGAAGCACAGGTGTTGTCTGAGAAGTTATCCAAACTCAACAATTCTCAGCAGAGCATCGAGTCTTTATCAAGATGGTGCATTTCCCACCGGAAAAAGGCCAAGCAGATTGTTGAAACATGGGATAAATTATTCAACTCATCTCAGAAAGAGCAGCGTGTTTCGTTTCTTTATTTGGCAAATGACATTTTGCAAAACAGCAGGCGCAAGGGAAGTGAATTTGTGAACGAATTCTGGAAAGTTCTTCCTGTTTCTCTCAAGAATGTTTATGATCATGGGGATGAAAGTGGAAAGAAGGCGGTAGCCAGACTTGTCGATATTTGGGAAGAAAGAAAGGTTTTTGGTTCTCGGGGTCAGAGTCTTAAGGATGAAATGCTTGGTAAAAATCCACCTCCATTACCCAACAGCAATGGAAAGAGTTCAAATCCTATAAAGATAGTGAAGAGGGATGCCCATTCTGTGAGAATTAAACTGGCTGTTGGAGGTGTTCCGGAAAAGATTCTTACTGCATTTCAATCTGTACTTGATGAACATTTGAATGAAGATGCTGCTCTCAACAACTGTAGTTCTGCAACTCATCATCTGAACCAAATTGAGGAAGACTTGAATGCTTCCATAGCTCAAGGAACTCAACCAAGATCAGCATTGCTGGACGACCTTCAAGAGCAGGAAAATGTGGTTCAACAATGTATTGGGCAACTTGAAAGTGTTGAGGCAACTAGGTCTTCTTTGGTTTCCTTACTTGTAGAAGCGCTTCAGGACCAAGAATCAAAGCTGGAACTAGTTCGCAATCAGTTGCAGATAGCTCGATCTCAAATTGAGCTAGCAAGTAACGTTCGGAAGAGAATTGCGAGTCCAGTTCCTGGTCCTCTTGCTACGAACATTGATCTACTGACAGAAATGACCCATGTGACTGATGCTAAGTTACCTTCAGCTCAATCAAACACAATCTCTTCTCAGTCCCCTCTTGTCCAGGCTATGGTTTCTTCTGCTGGTCCTAAAAGCAGTGAAGAGGAAAACAAGAGAGCTGCTGCTGCTGCTGTTGCTGCAAAGTTGGCTGCTTCTACATCTTCAGCACAGATGCTTACCTCTGTTCTTTCATCTCTTGTTGCAGAAGAAGCTGCTTCCATGAATGGTGGCCTAAAATCAGCTGGGTTTGCTTCATTATCCATATTCTCTCCTGAAAAACGACAGAAACTTGAGAAGCCAATGCCGATTTCTGATAGTTCAGATGGAATTGGTGCGTCATTCGTTACACCTATGCAACAACAACAATTGACAAATGTGGCACTTGCACAATCAGCAAATATACAACCTGTATCCCAGGCGAACCAGAGTCAAGCTTCATTTGCTCCACCACCACCGCCAGCTCCACCAGTGAATCAATATGCTCAGTCTGGTGGAATAATGGGGGTATTACCTTATAACTTTGGAGCATATTCTCTACCACCTCCACCTCCTTTACCTCCCCATATTGTGATGGGTTTGAGTAGGCCGACGTCTCAGCCACCTCAGCAACAGCCACAGCAGCCTCAGCAGTCGCAGCCAACTTCAGCAGGATTTTACAGGCCACCAGGTATAGGTTTCTACGGGCAAGGTCAGCAGTCGACTCCACCACCTGTCCCTAGGCAAAGTTTTAGTATATGTCCATCTCCTGCCAACGAGTTAAAAGGGCATGTTGCAGCTGAGGCAGCTGAGGTTCGTCCAATGCTGATGGAAGAATTAAGTGGCTAGATTTAGGCTTTTCCAATGTAATGCTGTCGAAAACTGGAAACCTCGGTGAATTCTCGGCTGCCTCGAAATTTTACTATGGCCAGGTAATCTTTTCTCCCATTAGATTGGAAGTTGGGTAGAGTACCTCCTAGCAGAATTTTAGTTGACAACGTTAGATTTAGGTTTGAAGACACTGTTGCCTCTAAATCTGCATGGATGGATAGAGAGAGTGGCTAAATGGACTTGTTTTGGATAGTCATAACAGGATGATACTGGTCTGTAATATAACTTAGTGAGGCCTTGAGAATTTCTAGTAGATACAGCACTGCCATGGTTCTTGTAATCTGTTGCTCTGTTTTCTGATTCAGTAGAGAATTTTTATTTAAAATAAACTGTTCCTTACTCTTCTTAG

Coding sequence (CDS)

ATGAACAACGAGGGTTTTGAAGCACAGGTGTTGTCTGAGAAGTTATCCAAACTCAACAATTCTCAGCAGAGCATCGAGTCTTTATCAAGATGGTGCATTTCCCACCGGAAAAAGGCCAAGCAGATTGTTGAAACATGGGATAAATTATTCAACTCATCTCAGAAAGAGCAGCGTGTTTCGTTTCTTTATTTGGCAAATGACATTTTGCAAAACAGCAGGCGCAAGGGAAGTGAATTTGTGAACGAATTCTGGAAAGTTCTTCCTGTTTCTCTCAAGAATGTTTATGATCATGGGGATGAAAGTGGAAAGAAGGCGGTAGCCAGACTTGTCGATATTTGGGAAGAAAGAAAGGTTTTTGGTTCTCGGGGTCAGAGTCTTAAGGATGAAATGCTTGGTAAAAATCCACCTCCATTACCCAACAGCAATGGAAAGAGTTCAAATCCTATAAAGATAGTGAAGAGGGATGCCCATTCTGTGAGAATTAAACTGGCTGTTGGAGGTGTTCCGGAAAAGATTCTTACTGCATTTCAATCTGTACTTGATGAACATTTGAATGAAGATGCTGCTCTCAACAACTGTAGTTCTGCAACTCATCATCTGAACCAAATTGAGGAAGACTTGAATGCTTCCATAGCTCAAGGAACTCAACCAAGATCAGCATTGCTGGACGACCTTCAAGAGCAGGAAAATGTGGTTCAACAATGTATTGGGCAACTTGAAAGTGTTGAGGCAACTAGGTCTTCTTTGGTTTCCTTACTTGTAGAAGCGCTTCAGGACCAAGAATCAAAGCTGGAACTAGTTCGCAATCAGTTGCAGATAGCTCGATCTCAAATTGAGCTAGCAAGTAACGTTCGGAAGAGAATTGCGAGTCCAGTTCCTGGTCCTCTTGCTACGAACATTGATCTACTGACAGAAATGACCCATGTGACTGATGCTAAGTTACCTTCAGCTCAATCAAACACAATCTCTTCTCAGTCCCCTCTTGTCCAGGCTATGGTTTCTTCTGCTGGTCCTAAAAGCAGTGAAGAGGAAAACAAGAGAGCTGCTGCTGCTGCTGTTGCTGCAAAGTTGGCTGCTTCTACATCTTCAGCACAGATGCTTACCTCTGTTCTTTCATCTCTTGTTGCAGAAGAAGCTGCTTCCATGAATGGTGGCCTAAAATCAGCTGGGTTTGCTTCATTATCCATATTCTCTCCTGAAAAACGACAGAAACTTGAGAAGCCAATGCCGATTTCTGATAGTTCAGATGGAATTGGTGCGTCATTCGTTACACCTATGCAACAACAACAATTGACAAATGTGGCACTTGCACAATCAGCAAATATACAACCTGTATCCCAGGCGAACCAGAGTCAAGCTTCATTTGCTCCACCACCACCGCCAGCTCCACCAGTGAATCAATATGCTCAGTCTGGTGGAATAATGGGGGTATTACCTTATAACTTTGGAGCATATTCTCTACCACCTCCACCTCCTTTACCTCCCCATATTGTGATGGGTTTGAGTAGGCCGACGTCTCAGCCACCTCAGCAACAGCCACAGCAGCCTCAGCAGTCGCAGCCAACTTCAGCAGGATTTTACAGGCCACCAGGTATAGGTTTCTACGGGCAAGGTCAGCAGTCGACTCCACCACCTGTCCCTAGGCAAAGTTTTAGTATATGTCCATCTCCTGCCAACGAGTTAAAAGGGCATGTTGCAGCTGAGGCAGCTGAGGTTCGTCCAATGCTGATGGAAGAATTAAGTGGCTAG

Protein sequence

MNNEGFEAQVLSEKLSKLNNSQQSIESLSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLANDILQNSRRKGSEFVNEFWKVLPVSLKNVYDHGDESGKKAVARLVDIWEERKVFGSRGQSLKDEMLGKNPPPLPNSNGKSSNPIKIVKRDAHSVRIKLAVGGVPEKILTAFQSVLDEHLNEDAALNNCSSATHHLNQIEEDLNASIAQGTQPRSALLDDLQEQENVVQQCIGQLESVEATRSSLVSLLVEALQDQESKLELVRNQLQIARSQIELASNVRKRIASPVPGPLATNIDLLTEMTHVTDAKLPSAQSNTISSQSPLVQAMVSSAGPKSSEEENKRAAAAAVAAKLAASTSSAQMLTSVLSSLVAEEAASMNGGLKSAGFASLSIFSPEKRQKLEKPMPISDSSDGIGASFVTPMQQQQLTNVALAQSANIQPVSQANQSQASFAPPPPPAPPVNQYAQSGGIMGVLPYNFGAYSLPPPPPLPPHIVMGLSRPTSQPPQQQPQQPQQSQPTSAGFYRPPGIGFYGQGQQSTPPPVPRQSFSICPSPANELKGHVAAEAAEVRPMLMEELSG
BLAST of Cp4.1LG08g13750 vs. Swiss-Prot
Match: RPR1B_HUMAN (Regulation of nuclear pre-mRNA domain-containing protein 1B OS=Homo sapiens GN=RPRD1B PE=1 SV=1)

HSP 1 Score: 120.9 bits (302), Expect = 4.5e-26
Identity = 106/330 (32.12%), Postives = 166/330 (50.30%), Query Frame = 1

Query: 6   FEAQVLSEKLSKLNNSQQSIESLSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLA 65
           F    L +KLS+L+NSQQS+++LS W I HRK A  IV  W +    ++  ++++FLYLA
Sbjct: 4   FSESALEKKLSELSNSQQSVQTLSLWLIHHRKHAGPIVSVWHRELRKAKSNRKLTFLYLA 63

Query: 66  NDILQNSRRKGSEFVNEFWKVLPVSLKNVYDHGDESGKKAVARLVDIWEERKVFGSRG-Q 125
           ND++QNS+RKG EF  EF  VL  +  +V    DE  KK + RL++IW+ER V+G    Q
Sbjct: 64  NDVIQNSKRKGPEFTREFESVLVDAFSHVAREADEGCKKPLERLLNIWQERSVYGGEFIQ 123

Query: 126 SLKDEML-GKNPPPLPNSNGKSSNPIKIVKRDAHSVRIKL---AVGGVPEKILTAFQSVL 185
            LK  M   K+PPP      K++   K +KR    ++ +      G    +  +A   + 
Sbjct: 124 QLKLSMEDSKSPPP------KATEEKKSLKRTFQQIQEEEDDDYPGSYSPQDPSAGPLLT 183

Query: 186 DEHLNEDAALNNCSSATHHLNQIEEDLNASIAQGTQPRSALLDDLQEQE-------NVVQ 245
           +E +     L N +S    + Q      AS+ Q  Q  S LL+ + ++E        V +
Sbjct: 184 EELIKALQDLENAASGDATVRQ----KIASLPQEVQDVS-LLEKITDKEAAERLSKTVDE 243

Query: 246 QCI------GQLESVEATRSSLVSLLVEALQDQESKLELVRNQLQIARSQIELASNVRKR 305
            C+      G+L +    R  L  +LVE  Q+Q+  L     +L+  + ++   + VRK 
Sbjct: 244 ACLLLAEYNGRLAAELEDRRQLARMLVEYTQNQKDVLSEKEKKLEEYKQKLARVTQVRKE 303

Query: 306 IASPVPGPLATNIDLLTEMTHVTDAKLPSA 318
           + S +      ++ LL  +T    A LPSA
Sbjct: 304 LKSHIQS--LPDLSLLPNVTGGL-APLPSA 319

BLAST of Cp4.1LG08g13750 vs. Swiss-Prot
Match: RPR1B_MOUSE (Regulation of nuclear pre-mRNA domain-containing protein 1B OS=Mus musculus GN=Rprd1b PE=1 SV=2)

HSP 1 Score: 120.9 bits (302), Expect = 4.5e-26
Identity = 106/330 (32.12%), Postives = 166/330 (50.30%), Query Frame = 1

Query: 6   FEAQVLSEKLSKLNNSQQSIESLSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLA 65
           F    L +KLS+L+NSQQS+++LS W I HRK A  IV  W +    ++  ++++FLYLA
Sbjct: 4   FSESALEKKLSELSNSQQSVQTLSLWLIHHRKHAGPIVSVWHRELRKAKSNRKLTFLYLA 63

Query: 66  NDILQNSRRKGSEFVNEFWKVLPVSLKNVYDHGDESGKKAVARLVDIWEERKVFGSRG-Q 125
           ND++QNS+RKG EF  EF  VL  +  +V    DE  KK + RL++IW+ER V+G    Q
Sbjct: 64  NDVIQNSKRKGPEFTREFESVLVDAFSHVAREADEGCKKPLERLLNIWQERSVYGGEFIQ 123

Query: 126 SLKDEML-GKNPPPLPNSNGKSSNPIKIVKRDAHSVRIKL---AVGGVPEKILTAFQSVL 185
            LK  M   K+PPP      K++   K +KR    ++ +      G    +  +A   + 
Sbjct: 124 QLKLSMEDSKSPPP------KAAEEKKSLKRTFQQIQEEEDDDYPGSYSPQDPSAGPLLT 183

Query: 186 DEHLNEDAALNNCSSATHHLNQIEEDLNASIAQGTQPRSALLDDLQEQE-------NVVQ 245
           +E +     L N +S    + Q      AS+ Q  Q  S LL+ + ++E        V +
Sbjct: 184 EELIKALQDLENAASGDATVRQ----KIASLPQEVQDVS-LLEKITDKEAAERLSKTVDE 243

Query: 246 QCI------GQLESVEATRSSLVSLLVEALQDQESKLELVRNQLQIARSQIELASNVRKR 305
            C+      G+L +    R  L  +LVE  Q+Q+  L     +L+  + ++   + VRK 
Sbjct: 244 ACLLLAEYNGRLAAELEDRRQLARMLVEYTQNQKEVLSEKEKKLEEYKQKLARVTQVRKE 303

Query: 306 IASPVPGPLATNIDLLTEMTHVTDAKLPSA 318
           + S +      ++ LL  +T    A LPSA
Sbjct: 304 LKSHIQS--LPDLSLLPNVTGGL-APLPSA 319

BLAST of Cp4.1LG08g13750 vs. Swiss-Prot
Match: RPR1A_PONAB (Regulation of nuclear pre-mRNA domain-containing protein 1A OS=Pongo abelii GN=RPRD1A PE=2 SV=1)

HSP 1 Score: 113.2 bits (282), Expect = 9.4e-24
Identity = 87/301 (28.90%), Postives = 149/301 (49.50%), Query Frame = 1

Query: 6   FEAQVLSEKLSKLNNSQQSIESLSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLA 65
           F    L +KLS+L+NSQQS+++LS W I HRK ++ IV  W++    ++  ++++FLYLA
Sbjct: 4   FSEAALEKKLSELSNSQQSVQTLSLWLIHHRKHSRPIVTVWERELRKAKPNRKLTFLYLA 63

Query: 66  NDILQNSRRKGSEFVNEFWKVLPVSLKNVYDHGDESGKKAVARLVDIWEERKVFGSRG-Q 125
           ND++QNS+RKG EF  +F  V+  + K+V    DES KK + R++ IWEER V+ +   +
Sbjct: 64  NDVIQNSKRKGPEFTKDFAPVIVEAFKHVSSETDESCKKHLGRVLSIWEERSVYENDVLE 123

Query: 126 SLKDEMLGKNPPPLPNSNGKSSNPIKIVKRDAHSVRIKLAVGGVPEKILTAFQSVLDEHL 185
            LK  + G   P             + +K D +     L     P + L   +++ D   
Sbjct: 124 QLKQALYGDKKP--------RKRTYEQIKVDENENCSSLGSPSEPPQTLDLVRALQD--- 183

Query: 186 NEDAALNNCSSATHHLNQIEEDLNASIAQGTQPRSALLDDLQEQEN-------VVQQCI- 245
                L N +S    ++Q    L   + +      +LLD + ++E+       V   C+ 
Sbjct: 184 -----LENAASGDAAVHQRIASLPVEVQE-----VSLLDKITDKESGERLSKMVEDACML 243

Query: 246 -----GQLESVEATRSSLVSLLVEALQDQESKLELVRNQLQIARSQIELASNVRKRIASP 293
                G+L +    R  L  +L + L+ Q+  L    ++L+  + ++   S VRK + S 
Sbjct: 244 LADYNGRLAAEIDDRKQLTRMLADFLRCQKEALAEKEHKLEEYKRKLARVSLVRKELRSR 283

BLAST of Cp4.1LG08g13750 vs. Swiss-Prot
Match: RPR1A_HUMAN (Regulation of nuclear pre-mRNA domain-containing protein 1A OS=Homo sapiens GN=RPRD1A PE=1 SV=1)

HSP 1 Score: 113.2 bits (282), Expect = 9.4e-24
Identity = 87/301 (28.90%), Postives = 149/301 (49.50%), Query Frame = 1

Query: 6   FEAQVLSEKLSKLNNSQQSIESLSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLA 65
           F    L +KLS+L+NSQQS+++LS W I HRK ++ IV  W++    ++  ++++FLYLA
Sbjct: 4   FSEAALEKKLSELSNSQQSVQTLSLWLIHHRKHSRPIVTVWERELRKAKPNRKLTFLYLA 63

Query: 66  NDILQNSRRKGSEFVNEFWKVLPVSLKNVYDHGDESGKKAVARLVDIWEERKVFGSRG-Q 125
           ND++QNS+RKG EF  +F  V+  + K+V    DES KK + R++ IWEER V+ +   +
Sbjct: 64  NDVIQNSKRKGPEFTKDFAPVIVEAFKHVSSETDESCKKHLGRVLSIWEERSVYENDVLE 123

Query: 126 SLKDEMLGKNPPPLPNSNGKSSNPIKIVKRDAHSVRIKLAVGGVPEKILTAFQSVLDEHL 185
            LK  + G   P             + +K D +     L     P + L   +++ D   
Sbjct: 124 QLKQALYGDKKP--------RKRTYEQIKVDENENCSSLGSPSEPPQTLDLVRALQD--- 183

Query: 186 NEDAALNNCSSATHHLNQIEEDLNASIAQGTQPRSALLDDLQEQEN-------VVQQCI- 245
                L N +S    ++Q    L   + +      +LLD + ++E+       V   C+ 
Sbjct: 184 -----LENAASGDAAVHQRIASLPVEVQE-----VSLLDKITDKESGERLSKMVEDACML 243

Query: 246 -----GQLESVEATRSSLVSLLVEALQDQESKLELVRNQLQIARSQIELASNVRKRIASP 293
                G+L +    R  L  +L + L+ Q+  L    ++L+  + ++   S VRK + S 
Sbjct: 244 LADYNGRLAAEIDDRKQLTRMLADFLRCQKEALAEKEHKLEEYKRKLARVSLVRKELRSR 283

BLAST of Cp4.1LG08g13750 vs. Swiss-Prot
Match: RPR1A_BOVIN (Regulation of nuclear pre-mRNA domain-containing protein 1A OS=Bos taurus GN=RPRD1A PE=2 SV=2)

HSP 1 Score: 113.2 bits (282), Expect = 9.4e-24
Identity = 87/301 (28.90%), Postives = 149/301 (49.50%), Query Frame = 1

Query: 6   FEAQVLSEKLSKLNNSQQSIESLSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLA 65
           F    L +KLS+L+NSQQS+++LS W I HRK ++ IV  W++    ++  ++++FLYLA
Sbjct: 4   FSEAALEKKLSELSNSQQSVQTLSLWLIHHRKHSRPIVTVWERELRKAKPNRKLTFLYLA 63

Query: 66  NDILQNSRRKGSEFVNEFWKVLPVSLKNVYDHGDESGKKAVARLVDIWEERKVFGSRG-Q 125
           ND++QNS+RKG EF  +F  V+  + K+V    DES KK + R++ IWEER V+ +   +
Sbjct: 64  NDVIQNSKRKGPEFTKDFAPVIVEAFKHVSSETDESCKKHLGRVLSIWEERSVYENDVLE 123

Query: 126 SLKDEMLGKNPPPLPNSNGKSSNPIKIVKRDAHSVRIKLAVGGVPEKILTAFQSVLDEHL 185
            LK  + G   P             + +K D +     L     P + L   +++ D   
Sbjct: 124 QLKQALYGDKKP--------RKRTYEQIKVDENENCSSLGSPSEPPQTLDLVRALQD--- 183

Query: 186 NEDAALNNCSSATHHLNQIEEDLNASIAQGTQPRSALLDDLQEQEN-------VVQQCI- 245
                L N +S    ++Q    L   + +      +LLD + ++E+       V   C+ 
Sbjct: 184 -----LENAASGDAAVHQRIASLPVEVQE-----VSLLDKITDKESGERLSKMVEDACML 243

Query: 246 -----GQLESVEATRSSLVSLLVEALQDQESKLELVRNQLQIARSQIELASNVRKRIASP 293
                G+L +    R  L  +L + L+ Q+  L    ++L+  + ++   S VRK + S 
Sbjct: 244 LADYNGRLAAEIDDRKQLTRMLADFLRCQKEALAEKEHKLEEYKRKLARVSLVRKELRSR 283

BLAST of Cp4.1LG08g13750 vs. TrEMBL
Match: A0A061F7B0_THECC (ENTH/VHS family protein OS=Theobroma cacao GN=TCM_025762 PE=4 SV=1)

HSP 1 Score: 677.2 bits (1746), Expect = 1.8e-191
Identity = 387/573 (67.54%), Postives = 453/573 (79.06%), Query Frame = 1

Query: 1   MNNEGFEAQVLSEKLSKLNNSQQSIESLSRWCISHRKKAKQIVETWDKLFNSSQKEQRVS 60
           M+++ F+ Q+L+EKLSKLNNSQQSIESLSRWCI+HRKKAKQIVETWDKLFNSSQKEQRVS
Sbjct: 1   MSSDTFDGQILTEKLSKLNNSQQSIESLSRWCITHRKKAKQIVETWDKLFNSSQKEQRVS 60

Query: 61  FLYLANDILQNSRRKGSEFVNEFWKVLPVSLKNVYDHGDESGKKAVARLVDIWEERKVFG 120
           FLYLANDILQNSRRKGSEFVNEFWKVLP +LK+VY++GDE GKKAV RLVDIWEERKVFG
Sbjct: 61  FLYLANDILQNSRRKGSEFVNEFWKVLPGALKHVYENGDEYGKKAVTRLVDIWEERKVFG 120

Query: 121 SRGQSLKDEMLGKNPPPLPN----SNGKSSNPIKIVKRDAHSVRIKLAVGGVPEKILTAF 180
           SRGQ+LKDEMLGKNPPP P     +NGKSSNPIKIVKRDAH+VRIKLAVGG+PEKILTA+
Sbjct: 121 SRGQNLKDEMLGKNPPPPPPPLSVNNGKSSNPIKIVKRDAHAVRIKLAVGGLPEKILTAY 180

Query: 181 QSVLDEHLNEDAALNNCSSATHHLNQIEEDLNASIAQGTQPRSALLDDLQEQENVVQQCI 240
           QSVL+++ NED ALN C++A   L++I ED+ +S+AQG Q  SALLD+LQ+QE  +QQCI
Sbjct: 181 QSVLEDNQNEDIALNKCNAAVQQLHKIGEDVESSLAQGNQNESALLDELQQQEIALQQCI 240

Query: 241 GQLESVEATRSSLVSLLVEALQDQESKLELVRNQLQIARSQIELASNVRKRIASP-VPGP 300
            QLE+VE  R++L+  L EALQ+QESKLEL+ +QLQ+AR QIE ASNVRKR+  P VPG 
Sbjct: 241 EQLENVETIRATLIFQLKEALQEQESKLELIHSQLQVARGQIEQASNVRKRLTLPTVPGH 300

Query: 301 LATNIDLLTEMTHVTDAKLPSAQSNTISSQSPLVQAMVSSAGPKSSEEENKRAAAAAVAA 360
           L+       E   V +  LPSAQ           Q ++S A   ++EE+NK+AAAAAVAA
Sbjct: 301 LSATTISTAEGAIVVEQNLPSAQPTGTPPHPHHAQPVLSFAPSMTTEEDNKKAAAAAVAA 360

Query: 361 KLAASTSSAQMLTSVLSSLVAEEAASMNGGLKSAGFAS-LSIFSPEKRQKLEKPMPISD- 420
           KLAASTSSAQMLTSVLSSLVAEEAASMNG LKS GF S LS+F PEKR KLEKPMP+SD 
Sbjct: 361 KLAASTSSAQMLTSVLSSLVAEEAASMNGSLKSGGFTSGLSMFPPEKRPKLEKPMPVSDA 420

Query: 421 -SSDGIGASFVTPMQQQQLTNVALAQSANIQPVSQANQSQASFA----PPPPPA----PP 480
            +SD    ++ +P+QQQ +TN+ LA S ++QP+SQ NQ QA FA    PPPPP     PP
Sbjct: 421 SNSDVSSTAYFSPLQQQAMTNMPLAPSTSVQPMSQGNQIQAPFASAPPPPPPPLSPANPP 480

Query: 481 VNQYAQSGGIM-GVLPYNFGAYSLPPPPPLPPHIVMGLSRPTSQPPQ-------QQPQQP 540
            +QY QS G+M GV+PY +GA +LPPPPPLPPHI M L+RP+SQP Q       QQPQ  
Sbjct: 481 ASQYVQSTGMMVGVMPYGYGANTLPPPPPLPPHIAMSLARPSSQPLQQLQSQSLQQPQSQ 540

Query: 541 QQSQPTSAGFYRPPGIGFYGQGQQSTPPPVPRQ 550
            Q QP + GFYRPPGIGFYGQ  QST PPVPRQ
Sbjct: 541 PQQQPATGGFYRPPGIGFYGQNPQST-PPVPRQ 572

BLAST of Cp4.1LG08g13750 vs. TrEMBL
Match: V4SRT7_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10031118mg PE=4 SV=1)

HSP 1 Score: 671.4 bits (1731), Expect = 1.0e-189
Identity = 381/562 (67.79%), Postives = 446/562 (79.36%), Query Frame = 1

Query: 1   MNNEGFEAQVLSEKLSKLNNSQQSIESLSRWCISHRKKAKQIVETWDKLFNSSQKEQRVS 60
           M+NE F+ Q+LSEKLSKLNNSQQSIESLSRWCI+HRKKAKQIVETWDK FNSSQKEQRVS
Sbjct: 1   MSNEAFDGQILSEKLSKLNNSQQSIESLSRWCITHRKKAKQIVETWDKSFNSSQKEQRVS 60

Query: 61  FLYLANDILQNSRRKGSEFVNEFWKVLPVSLKNVYDHGDESGKKAVARLVDIWEERKVFG 120
           FLYLANDILQNSRRKGSEFVNEFWKVLP +LK+VYD+GDE GKKAV RLVDIWEERKVFG
Sbjct: 61  FLYLANDILQNSRRKGSEFVNEFWKVLPAALKHVYDNGDEYGKKAVTRLVDIWEERKVFG 120

Query: 121 SRGQSLKDEMLGKNPPPLPNSNGKSSNPIKIVKRDAHSVRIKLAVGGVPEKILTAFQSVL 180
           SRGQ LKDEMLGKNPPP+P SNG+SSNPIKIVKRDA+SVRIKLAVG +PEKILTAFQSVL
Sbjct: 121 SRGQGLKDEMLGKNPPPVPVSNGRSSNPIKIVKRDANSVRIKLAVGALPEKILTAFQSVL 180

Query: 181 DEHLNEDAALNNCSSATHHLNQIEEDLNASIAQGTQPRSALLDDLQEQENVVQQCIGQLE 240
           DEH NE+ ALNNC++A HH+ ++ ED+  + +QG Q  SA +D LQEQENV+Q+C+ Q+E
Sbjct: 181 DEHPNEEVALNNCNAAVHHVGKVSEDVENTPSQGNQHGSASVDQLQEQENVLQECVRQME 240

Query: 241 SVEATRSSLVSLLVEALQDQESKLELVRNQLQIARSQIELASNVRKRIASP-VPGPLATN 300
           S E TR++LV  L EAL DQESKLEL+R +LQ+AR QIE AS  RKR+ +P VPG  +T+
Sbjct: 241 SAETTRAALVFQLKEALHDQESKLELIRTKLQVARGQIERASATRKRLTNPLVPGLPSTS 300

Query: 301 IDLLTEMTHVTDAKLPSAQSNTISSQSPLVQAMVSSAGPKSSEEENKRAAAAAVAAKLAA 360
           I    E T   +  LPS    +I  Q  + Q ++S A  K++EEENK+AAAAAVAAKLAA
Sbjct: 301 IAQGMEATRGAEPILPSVPPTSIPPQPSVNQPVISFAPLKTTEEENKKAAAAAVAAKLAA 360

Query: 361 STSSAQMLTSVLSSLVAEEAASMNGGLKSAGF-ASLSIFSPEKRQKLEKPMPISD--SSD 420
           STSSAQMLTSVLSSLVAEEAA  NG L S GF A LSIF PEKR KLEK    SD  +SD
Sbjct: 361 STSSAQMLTSVLSSLVAEEAAK-NGSLNSGGFTAGLSIFPPEKRPKLEKSTTASDVSTSD 420

Query: 421 GIGASFVTPMQQQQLTNVALAQSANIQPVSQANQSQASFAPPPPPAPPV-------NQYA 480
               ++ +P+QQQQ+TN+ + QS ++QP+SQ +Q Q+ FAP PPP PP+       +QY 
Sbjct: 421 VANTTYFSPLQQQQVTNMPVVQSTSMQPMSQVSQIQSQFAPAPPPPPPLSPATPPSSQYV 480

Query: 481 QSGGIM-GVLPYNFGAYSLPPPPPLPPHIVMGLSRPTSQPPQQQPQQPQQSQP-TSAGFY 540
           QS G+M GV+PY FGA +LPPPPPLPPH+ MGL+RP+    Q QPQQ QQ QP  + G+Y
Sbjct: 481 QSTGMMVGVMPYGFGANTLPPPPPLPPHMAMGLARPSQSLQQPQPQQLQQQQPAATGGYY 540

Query: 541 RPPGIGFYGQGQQSTPPPVPRQ 550
           RPPGIGFYGQ Q ST  PVPRQ
Sbjct: 541 RPPGIGFYGQSQPST-SPVPRQ 560

BLAST of Cp4.1LG08g13750 vs. TrEMBL
Match: A0A0A0LH72_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G815990 PE=4 SV=1)

HSP 1 Score: 664.1 bits (1712), Expect = 1.6e-187
Identity = 372/432 (86.11%), Postives = 391/432 (90.51%), Query Frame = 1

Query: 130 MLGKNPPP--LPNSNGKSSNPIKIVKRDAHSVRIKLAVGGVPEKILTAFQSVLDEHLNED 189
           MLGKNPPP  LP+SNGKSSNPIKIVKRDAHSVRIKLAVGGVPEKILTAFQSVLDEHLNED
Sbjct: 1   MLGKNPPPTPLPSSNGKSSNPIKIVKRDAHSVRIKLAVGGVPEKILTAFQSVLDEHLNED 60

Query: 190 AALNNCSSATHHLNQIEEDLNASIAQGTQPRSALLDDLQEQENVVQQCIGQLESVEATRS 249
           AAL NCSSATHHLNQIEEDLN S+AQGTQPRSALLDDLQ+QE V+Q+CI QLE VEATR+
Sbjct: 61  AALINCSSATHHLNQIEEDLNVSLAQGTQPRSALLDDLQDQETVIQECIRQLEGVEATRA 120

Query: 250 SLVSLLVEALQDQESKLELVRNQLQIARSQIELASNVRKRIAS-PVPGPLATNIDLLTEM 309
           SLVSLLVEALQDQESKLELVRNQLQ+ARSQIELASNVRKR  S  VPGP AT +DLLTEM
Sbjct: 121 SLVSLLVEALQDQESKLELVRNQLQVARSQIELASNVRKRFTSTTVPGPSATTVDLLTEM 180

Query: 310 THVTDAKLPSAQSNTISSQSPLVQAMVSSAGPKSSEEENKRAAAAAVAAKLAASTSSAQM 369
           TH TD+KL S Q N ISSQSPL+QAM S  GPK+SEEENKRAAAAAVAAKLAASTSSAQM
Sbjct: 181 THATDSKLSSVQQNIISSQSPLIQAMGSFPGPKTSEEENKRAAAAAVAAKLAASTSSAQM 240

Query: 370 LTSVLSSLVAEEAASMNGGLKSAGFASLSIFSPEKRQKLEKPMPISD--SSDGIGASFVT 429
           LTSVLSSLVAEEAASMNGGLKS+GF+SLS+FSPEKRQKLEKPMPISD  SSDG GASFV 
Sbjct: 241 LTSVLSSLVAEEAASMNGGLKSSGFSSLSLFSPEKRQKLEKPMPISDVSSSDGAGASFVA 300

Query: 430 PMQQQQLTNVALAQSANIQPVSQANQSQASFAPPPPP-------APPVNQYAQSGGIMGV 489
           PM QQQ+T++ LAQSAN QPVSQAN SQASFAPPPPP        PPVNQYAQSGG+MGV
Sbjct: 301 PM-QQQMTSMPLAQSANGQPVSQANPSQASFAPPPPPVPPSLSSTPPVNQYAQSGGLMGV 360

Query: 490 LPYNFGAYSLPPPPPLPPHIVMGLSRPTSQPPQQQPQQPQQSQPTSAGFYRPPGIGFYGQ 549
           LPYNFGAYSLPPPPPLPPHI MGLSRPTSQPP QQ QQPQQSQP S+GFYRPPGIGFYGQ
Sbjct: 361 LPYNFGAYSLPPPPPLPPHIAMGLSRPTSQPPPQQLQQPQQSQPASSGFYRPPGIGFYGQ 420

BLAST of Cp4.1LG08g13750 vs. TrEMBL
Match: F6HMV0_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_08s0056g00770 PE=4 SV=1)

HSP 1 Score: 663.7 bits (1711), Expect = 2.1e-187
Identity = 377/566 (66.61%), Postives = 449/566 (79.33%), Query Frame = 1

Query: 1   MNNEGFEAQVLSEKLSKLNNSQQSIESLSRWCISHRKKAKQIVETWDKLFNSSQKEQRVS 60
           M+N+ F+ Q+L+EKLSKLNNSQQSIESLSRWCISHRKKAKQIVETWDKLFNSSQKEQ VS
Sbjct: 107 MSNDVFDGQLLAEKLSKLNNSQQSIESLSRWCISHRKKAKQIVETWDKLFNSSQKEQCVS 166

Query: 61  FLYLANDILQNSRRKGSEFVNEFWKVLPVSLKNVYDHGDESGKKAVARLVDIWEERKVFG 120
           FLYLANDILQNSRRKGSEFVNEFWKVLP++LK+VYD+GDE GKKAV+RLVDIWEERKVFG
Sbjct: 167 FLYLANDILQNSRRKGSEFVNEFWKVLPLALKHVYDNGDEYGKKAVSRLVDIWEERKVFG 226

Query: 121 SRGQSLKDEMLGKNPPPLPNSNGKSSNPIKIVKRDAHSVRIKLAVGGVPEKILTAFQSVL 180
           SRGQ LKDEMLGK+PP L  SNGK+SNPIKIVKRD+ SVRIKL++GG+PEKI+TAFQ+V 
Sbjct: 227 SRGQGLKDEMLGKSPPLLV-SNGKNSNPIKIVKRDSQSVRIKLSIGGMPEKIVTAFQTVH 286

Query: 181 DEHLNEDAALNNCSSATHHLNQIEEDLNASIAQGTQPRSALLDDLQEQENVVQQCIGQLE 240
           DE +NE+A LN C +A  H+ ++E D   +  +G Q R+AL+D+L+EQEN++QQC+ QLE
Sbjct: 287 DEQVNEEAVLNKCKTAVQHVGKLEVDAGNTSGEGNQQRAALVDELKEQENILQQCVVQLE 346

Query: 241 SVEATRSSLVSLLVEALQDQESKLELVRNQLQIARSQIELASNVRKRIASP-VPGPLATN 300
           S EATR++LVS L EA+ DQESKL LVR QLQ+AR +IE A N+R+R+ SP V GP    
Sbjct: 347 SSEATRAALVSQLKEAVLDQESKLGLVRAQLQVARGRIEQAINMRQRLTSPTVAGPQTIR 406

Query: 301 IDLLTEMTHVTDAKLPSAQSNTISSQSPLVQAMVSSAGPKSSEEENKRAAAAAVAAKLAA 360
           ++  TE     +  +PS Q+ T   ++PL Q ++S A  K++EE++K+AAAAAVAAKLAA
Sbjct: 407 MNPQTEAPMAVEPNMPSVQATTTPPKAPLTQPVISFAPLKTTEEDSKKAAAAAVAAKLAA 466

Query: 361 STSSAQMLTSVLSSLVAEEAASMNGGLKSAGFASLSIFSPEKRQKLEKPMPISD--SSDG 420
           STSSAQMLTSVLSSLVAEEAAS NGGLKS+GFA  SIFSPEKR +LEKPMPISD  +SD 
Sbjct: 467 STSSAQMLTSVLSSLVAEEAAS-NGGLKSSGFA--SIFSPEKRPRLEKPMPISDGNNSDA 526

Query: 421 IGASFVTPMQQQQLTNVALAQSANIQPVSQANQSQASFAPPPPPAPPV-------NQYAQ 480
             AS+ TP+QQQ + N+ LA   ++ P+SQANQ QA F PPPPP PP+        QY Q
Sbjct: 527 GSASYFTPVQQQSMANMPLAPPTSVPPMSQANQMQAPFPPPPPPPPPLPLVNPPAKQYVQ 586

Query: 481 SGGIM-GVLPYNFGAYSLPPPPPLPPHIVMGLSRPTSQPPQQ----QPQQPQQSQ--PTS 540
           S G+M GV+PY +G  SLPPPPP+  H+ MGLS P  QPPQQ    Q QQ QQSQ   T 
Sbjct: 587 SSGMMVGVMPYGYGTTSLPPPPPMLSHVPMGLSAPAPQPPQQLQSPQLQQQQQSQQPATG 646

Query: 541 AGFYRPPGIGFYGQGQQSTPPPVPRQ 550
            G+YRPPGI FYGQ  Q TPPPVPRQ
Sbjct: 647 GGYYRPPGIAFYGQSHQPTPPPVPRQ 668

BLAST of Cp4.1LG08g13750 vs. TrEMBL
Match: A0A067L6A1_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_02639 PE=4 SV=1)

HSP 1 Score: 643.3 bits (1658), Expect = 2.9e-181
Identity = 369/566 (65.19%), Postives = 441/566 (77.92%), Query Frame = 1

Query: 1   MNNEGFEAQVLSEKLSKLNNSQQSIESLSRWCISHRKKAKQIVETWDKLFNSSQKEQRVS 60
           M+N+ F+ Q+L+EKLSKLNNSQQSIESLSRWCI HRKKA+QIVETWDKLF+SSQ+EQRVS
Sbjct: 1   MSNDVFDVQILNEKLSKLNNSQQSIESLSRWCILHRKKARQIVETWDKLFDSSQREQRVS 60

Query: 61  FLYLANDILQNSRRKGSEFVNEFWKVLPVSLKNVYDHGDESGKKAVARLVDIWEERKVFG 120
           FLYLANDILQNSRRKGSEFVNEFWKVLP SLK VY++GDE GKK V RLVDIWEERKVFG
Sbjct: 61  FLYLANDILQNSRRKGSEFVNEFWKVLPGSLKQVYENGDEHGKKVVTRLVDIWEERKVFG 120

Query: 121 SRGQSLKDEMLGKNPPPLPNSNGKSSNPIKIVKRDAHSVRIKLAVGGVPEKILTAFQSVL 180
           SRGQ LKDEMLGKNPPPL  SNGKSSN IKI+KRDAH+VRIKLAVGG+PEKILTAFQSV 
Sbjct: 121 SRGQGLKDEMLGKNPPPLLASNGKSSNSIKIMKRDAHTVRIKLAVGGLPEKILTAFQSVT 180

Query: 181 DEHLNEDAALNNCSSATHHLNQIEEDLNASIAQGTQPRSALLDDLQEQENVVQQCIGQLE 240
           DEHL+E+AALN  ++A   + +I E++     QG Q  S+ +++LQ QENV+QQC+G+LE
Sbjct: 181 DEHLDEEAALNESTAALSQVGKIREEMENGSTQGNQQGSSFVEELQVQENVLQQCVGKLE 240

Query: 241 SVEATRSSLVSLLVEALQDQESKLELVRNQLQIARSQIELASNVRKRIASP-VPGPLATN 300
           S EATR+ L+S L EALQDQESKL+++R +LQ AR QIE A ++R R+ S  VPGPL T 
Sbjct: 241 SAEATRAMLISQLKEALQDQESKLDVIRARLQAARVQIEQAVSLRNRLTSSIVPGPLTTI 300

Query: 301 IDLLTEMTHVTDAKLPSAQSNTISSQSPLVQAMVSSAGPKSSEEENKRAAAAAVAAKLAA 360
                +   V +  +   Q  +   Q  L Q +VS A  K+++E++K+AAAAAVAAKLAA
Sbjct: 301 TMPSADAAKVVEHSIAPVQPTSTPPQPQLTQPVVSFALMKTTDEDSKKAAAAAVAAKLAA 360

Query: 361 STSSAQMLTSVLSSLVAEEAASMNGGLKSAGFAS-LSIFSPEKRQKLEKPMPISDSSDGI 420
           STSSAQMLTSVLSSLVAEEAAS+NGGLKS GF + L++FSPEKR KL+KPMP   ++  +
Sbjct: 361 STSSAQMLTSVLSSLVAEEAASLNGGLKSTGFTTGLTMFSPEKRPKLDKPMPADVNNADM 420

Query: 421 G-ASFVTPMQQQQLTNVALA-QSANIQPVSQANQSQASFA-----PPPPPA----PPVNQ 480
           G +++ TP+QQQ  T + L   S+++Q +SQ+NQ Q SF      PPPPP     P  NQ
Sbjct: 421 GNSAYFTPLQQQSGTTMPLVPPSSSLQSMSQSNQIQTSFGALPPLPPPPPMSPANPATNQ 480

Query: 481 YAQSGGIM-GVLPYNFGAYSLPPPPPLPPHIVMGLSRPTSQPPQQ-QPQQPQQSQ--PTS 540
           Y QS G+M GVLPY +GA SLPPPP LPPHI MGL+RP +Q PQQ QPQQPQQ Q  P +
Sbjct: 481 YVQSTGMMVGVLPYGYGANSLPPPPSLPPHIAMGLARPAAQQPQQSQPQQPQQQQQTPGT 540

Query: 541 AGFYRPPGIGFYGQGQQSTPPPVPRQ 550
            G+YRPPGIGFYGQ  Q   PPVPRQ
Sbjct: 541 GGYYRPPGIGFYGQNHQPATPPVPRQ 566

BLAST of Cp4.1LG08g13750 vs. TAIR10
Match: AT5G10060.1 (AT5G10060.1 ENTH/VHS family protein)

HSP 1 Score: 287.7 bits (735), Expect = 1.6e-77
Identity = 210/543 (38.67%), Postives = 301/543 (55.43%), Query Frame = 1

Query: 6   FEAQVLSEKLSKLNNSQQSIESLSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLA 65
           F  Q+L +KL+KLN+SQQSIE+LS WCI +R KA+ IV TW+K F+S++ +Q+V  LYLA
Sbjct: 5   FSDQILIDKLAKLNSSQQSIETLSHWCIFNRSKAELIVTTWEKQFHSTEMDQKVPLLYLA 64

Query: 66  NDILQNSRRKGSEFVNEFWKVLPVSLKNVYDHGDESGKKAVARLVDIWEERKVFGSRGQS 125
           NDILQNS+R+G+EFV EFW VLP +LK++   GD++GK AVAR++ IWEER+VFGSR +S
Sbjct: 65  NDILQNSKRQGNEFVQEFWNVLPKALKDIVSQGDDNGKSAVARVIKIWEERRVFGSRSKS 124

Query: 126 LKDEMLGKNPPPLPNSNGKSSNPIKIVKRDAHSVRIKLAV-GGVPEKILTAFQSVLDEHL 185
           LKD MLG++ P   + + K     K  KR++ S R KLA  GGV EKI +A+  V+ E+ 
Sbjct: 125 LKDVMLGEDVPLPLDISKKRPRGSKSSKRESKSSRTKLASSGGVAEKIASAYHLVVAENS 184

Query: 186 NEDAALNNCSSATHHLNQIEEDLN--ASIAQGTQPRSALLDDLQEQENVVQQCIGQLESV 245
           NE+A +N C SA   + ++E+D+    S A+    R +L  +L+E+E +++QCI +L+SV
Sbjct: 185 NEEAEMNKCKSAVKRIRKMEKDVEEACSTAKDNPKRKSLAKELEEEEYLLRQCIEKLKSV 244

Query: 246 EATRSSLVSLLVEALQDQESKLELVRNQLQIARSQIELASNVRKRIASPVPGPLATNIDL 305
           + +RSSLV+ L +AL++QES+L+ ++ Q+Q+A+ Q E A N++KR+          N + 
Sbjct: 245 QGSRSSLVNQLKDALREQESELDNLKAQIQVAKEQTEEAQNMQKRL----------NDED 304

Query: 306 LTEMTHVTDAKLPSAQSNTISSQSPLVQAMVSSAGPKSSEEENKRAAAAAVAAKLAASTS 365
            T         +     NT S Q+                    +   A++AA L ASTS
Sbjct: 305 YTSKQTTAATTITETNDNTKSGQA-------------------SKMTPASIAAMLTASTS 364

Query: 366 SAQMLTSVLSSLVAEEAASMNGGLKSAGFASLSIFSPEKRQKLEKPMPISDSSDGIGASF 425
           S  ++ SVLSS  AE  A+   GL                 K E  +P+SD         
Sbjct: 365 SHMIMQSVLSSFAAE--ATKTSGLS----------------KSESTVPVSD--------- 424

Query: 426 VTPMQQQQLTNVALAQSANIQPVSQANQSQASFAPPPPPA----PPVNQYAQSGGIMGVL 485
                    TN +     N Q  +   Q Q    P PPP     PPV     + G + ++
Sbjct: 425 ---------TNASFPSYNNSQNQTPTTQGQYHVIPNPPPPQFLKPPVMNNPYAFGNIPLM 469

Query: 486 PYNFGAYSLPPPPPLPPHIVMGLSRPTSQPPQQQPQQPQQSQPTSAGFYRPPGIGFYGQG 542
           P         PPPP PPH++ G  +P  Q PQ    Q  Q  PT    ++PPGI +YG  
Sbjct: 485 PPGL------PPPPPPPHLI-GNQQP--QIPQSNSAQQSQQGPT----FQPPGIMYYGAP 469

BLAST of Cp4.1LG08g13750 vs. TAIR10
Match: AT3G26990.1 (AT3G26990.1 ENTH/VHS family protein)

HSP 1 Score: 272.3 bits (695), Expect = 6.8e-73
Identity = 216/565 (38.23%), Postives = 310/565 (54.87%), Query Frame = 1

Query: 6   FEAQVLSEKLSKLNNSQQSIESLSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLA 65
           F AQ+L EKL+KLNNSQ SIE+LS WCI H  KAK +VETW + F+ + +EQR+++LYLA
Sbjct: 5   FNAQILVEKLAKLNNSQASIETLSHWCIFHMNKAKHVVETWGRQFHCAPREQRLAYLYLA 64

Query: 66  NDILQNSRRKGSEFVNEFWKVLPVSLKNVYDHGDESGKKAVARLVDIWEERKVFGSRGQS 125
           NDILQNSRRKGSEFV EFWKVLP +L+++ ++GD+ G+K+  RLV+IWEERKVFGSRGQ 
Sbjct: 65  NDILQNSRRKGSEFVGEFWKVLPDALRDMIENGDDFGRKSARRLVNIWEERKVFGSRGQI 124

Query: 126 LKDEMLGKNPPPLPNSNGKSSNPIKIVKRDAHSVRIKLAV------GGVPEKILTAFQSV 185
           LK+E+LG+ P      NG          R+ + V +KL+V      G   EK+++A + +
Sbjct: 125 LKEELLGRQP-----ENG---------TRNGNLVPLKLSVPQRQVNGSTLEKVVSAVEVL 184

Query: 186 LDEHLNEDAALNNCSSATHHLNQIEEDLNASIAQGTQPRSALLDDLQEQENVVQQCIGQL 245
               ++EDA +   ++A  +L +  +++   ++ G  P  A++ +LQ Q  +++ CI QL
Sbjct: 185 HGVQIDEDALVGKSTNAAGYLEKATQEVERDLSSGHAPGPAVVKELQGQHVILRDCIEQL 244

Query: 246 ESVEATRSSLVSLLVEALQDQESKLELVRNQLQIARSQIELASNVRKRIA--SPVPGPLA 305
            ++E +R+SL+S L EALQ+QE KLE VRN LQIAR Q +   ++ +++        P A
Sbjct: 245 GAMETSRTSLISHLREALQEQELKLEQVRNHLQIARFQSDRTGDLCRQLLDHGGSSQPPA 304

Query: 306 TNIDLLTEMTHVTD-AKLPSAQSNTISSQSPLVQAMVSSAGPKSSEEENKRAAAAAVAAK 365
           T  +   E+  V+  A  P + +++   QS  V   + ++ P  S E+ ++ AAAAV AK
Sbjct: 305 TEEEESKEVIKVSSTAAAPQSFTHSDVEQSAPV---MFASNPTQSLEDPRKTAAAAVVAK 364

Query: 366 LAASTSSAQMLTSVLSSLVAEEAASMNGGLKSAGFASLSIFSPEKRQKLEKPMPISDSSD 425
           L ASTSSA+ML+ VLSSL +E     N         S   F PEKR KL+          
Sbjct: 365 LTASTSSAEMLSYVLSSLASEGIIGNNNPPAVTETLSSVDFPPEKRPKLQNH-------- 424

Query: 426 GIGASFVTPMQQQQLTNVALAQSANIQPVSQANQSQASFAPPPPPAPPVNQYAQSGGIMG 485
               S+++P  Q   T    + S   QP+           PPPPP     Q+ Q      
Sbjct: 425 --DQSYLSPHHQNTATT---SSSTPPQPL-----------PPPPPFQLQPQFLQ------ 484

Query: 486 VLPYNFGAYSLPPPPPL---PPHIVMGLSRPTSQPPQQQ--------PQQPQQSQPTSAG 545
                     L PP P+   P +  +  S  T+Q  QQ+         Q    S P+   
Sbjct: 485 ---------PLQPPGPVNHTPFNYTIATSTATTQQQQQEQGPWVPGLTQLSTTSAPSENS 513

Query: 546 FYRPPG-IGFYGQGQQSTPPPVPRQ 550
           + +  G  GFYG        PV RQ
Sbjct: 545 YQKFQGQDGFYGINSSVPITPVTRQ 513

BLAST of Cp4.1LG08g13750 vs. TAIR10
Match: AT5G65180.1 (AT5G65180.1 ENTH/VHS family protein)

HSP 1 Score: 266.5 bits (680), Expect = 3.7e-71
Identity = 148/333 (44.44%), Postives = 229/333 (68.77%), Query Frame = 1

Query: 6   FEAQVLSEKLSKLNNSQQSIESLSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSFLYLA 65
           F  ++L + L+KLN++QQSI++LS+WCI HR +A+ +V TW+K F+S+Q  Q+V  LYLA
Sbjct: 5   FSEEILIDNLAKLNSTQQSIQTLSQWCIVHRSEAELVVTTWEKQFHSTQIGQKVPLLYLA 64

Query: 66  NDILQNSRRKGSEFVNEFWKVLPVSLKNVYDHGDESGKKAVARLVDIWEERKVFGSRGQS 125
           NDILQNS+R+G+EFV EFWKVLP +LK++   GD+ GK  V+RLV+IWEER+VFGSR +S
Sbjct: 65  NDILQNSKRQGNEFVQEFWKVLPGALKDIVSLGDDYGKGVVSRLVNIWEERRVFGSRSKS 124

Query: 126 LKDEMLGKNPPPLPNSNGKSSNPIKIVKRDAHSVRIKLAVGGVPEKILTAFQSVLDEHLN 185
           LKD ML +  PP  + + K     K  KRD+ S + KL+ GGV EKI++AF  V  E+ N
Sbjct: 125 LKDVMLSEEAPPPLDVSKKRFRGSKSAKRDSKSTKTKLSSGGVSEKIVSAFNLVRAENSN 184

Query: 186 EDAALNNCSSATHHLNQIEEDLNASIAQGTQPR-SALLDDLQEQENVVQQCIGQLESVEA 245
           E+  +N C SA   + ++E+D+  + +    PR  +L  +L+E+EN+++Q + +L+SVE 
Sbjct: 185 EETEMNKCKSAVRRIRKMEKDVEDACSTAKDPRKESLAKELEEEENILRQSVEKLKSVEE 244

Query: 246 TRSSLVSLLVEALQDQESKLELVRNQLQIARSQIELASNVRKRIASPVPGPLATNIDLLT 305
           +R+SLV+ L EAL++QES+LE +++Q+Q+A+ Q E A N++KR+ +    P+  N     
Sbjct: 245 SRTSLVNHLREALREQESELENLQSQIQVAQEQTEEAQNMQKRLNNET--PVNNNNGTSG 304

Query: 306 EMTHVTDAKLPS-AQSNTISSQSPLVQAMVSSA 337
           +   +T A + + A+  T S+ S ++   V S+
Sbjct: 305 QSAKITPASIAAMAEMLTSSTNSSMIMHSVLSS 335

BLAST of Cp4.1LG08g13750 vs. NCBI nr
Match: gi|449437658|ref|XP_004136608.1| (PREDICTED: regulation of nuclear pre-mRNA domain-containing protein 1B-like [Cucumis sativus])

HSP 1 Score: 902.9 bits (2332), Expect = 2.9e-259
Identity = 494/561 (88.06%), Postives = 517/561 (92.16%), Query Frame = 1

Query: 1   MNNEGFEAQVLSEKLSKLNNSQQSIESLSRWCISHRKKAKQIVETWDKLFNSSQKEQRVS 60
           MNNE FEAQVL+EKLSKLNNSQQSIESLS+WCISHRKKAKQIVETWDKLFNSSQKEQRVS
Sbjct: 1   MNNEVFEAQVLAEKLSKLNNSQQSIESLSKWCISHRKKAKQIVETWDKLFNSSQKEQRVS 60

Query: 61  FLYLANDILQNSRRKGSEFVNEFWKVLPVSLKNVYDHGDESGKKAVARLVDIWEERKVFG 120
           FLYLANDILQNSRRKGSEFVNEFWKVLP +LK VYDHGDESGKKAVARLV+IWEERKVFG
Sbjct: 61  FLYLANDILQNSRRKGSEFVNEFWKVLPGALKYVYDHGDESGKKAVARLVNIWEERKVFG 120

Query: 121 SRGQSLKDEMLGKNPPP--LPNSNGKSSNPIKIVKRDAHSVRIKLAVGGVPEKILTAFQS 180
           SRGQSLKDEMLGKNPPP  LP+SNGKSSNPIKIVKRDAHSVRIKLAVGGVPEKILTAFQS
Sbjct: 121 SRGQSLKDEMLGKNPPPTPLPSSNGKSSNPIKIVKRDAHSVRIKLAVGGVPEKILTAFQS 180

Query: 181 VLDEHLNEDAALNNCSSATHHLNQIEEDLNASIAQGTQPRSALLDDLQEQENVVQQCIGQ 240
           VLDEHLNEDAAL NCSSATHHLNQIEEDLN S+AQGTQPRSALLDDLQ+QE V+Q+CI Q
Sbjct: 181 VLDEHLNEDAALINCSSATHHLNQIEEDLNVSLAQGTQPRSALLDDLQDQETVIQECIRQ 240

Query: 241 LESVEATRSSLVSLLVEALQDQESKLELVRNQLQIARSQIELASNVRKRIAS-PVPGPLA 300
           LE VEATR+SLVSLLVEALQDQESKLELVRNQLQ+ARSQIELASNVRKR  S  VPGP A
Sbjct: 241 LEGVEATRASLVSLLVEALQDQESKLELVRNQLQVARSQIELASNVRKRFTSTTVPGPSA 300

Query: 301 TNIDLLTEMTHVTDAKLPSAQSNTISSQSPLVQAMVSSAGPKSSEEENKRAAAAAVAAKL 360
           T +DLLTEMTH TD+KL S Q N ISSQSPL+QAM S  GPK+SEEENKRAAAAAVAAKL
Sbjct: 301 TTVDLLTEMTHATDSKLSSVQQNIISSQSPLIQAMGSFPGPKTSEEENKRAAAAAVAAKL 360

Query: 361 AASTSSAQMLTSVLSSLVAEEAASMNGGLKSAGFASLSIFSPEKRQKLEKPMPISD--SS 420
           AASTSSAQMLTSVLSSLVAEEAASMNGGLKS+GF+SLS+FSPEKRQKLEKPMPISD  SS
Sbjct: 361 AASTSSAQMLTSVLSSLVAEEAASMNGGLKSSGFSSLSLFSPEKRQKLEKPMPISDVSSS 420

Query: 421 DGIGASFVTPMQQQQLTNVALAQSANIQPVSQANQSQASFAPPPPP-------APPVNQY 480
           DG GASFV PM QQQ+T++ LAQSAN QPVSQAN SQASFAPPPPP        PPVNQY
Sbjct: 421 DGAGASFVAPM-QQQMTSMPLAQSANGQPVSQANPSQASFAPPPPPVPPSLSSTPPVNQY 480

Query: 481 AQSGGIMGVLPYNFGAYSLPPPPPLPPHIVMGLSRPTSQPPQQQPQQPQQSQPTSAGFYR 540
           AQSGG+MGVLPYNFGAYSLPPPPPLPPHI MGLSRPTSQPP QQ QQPQQSQP S+GFYR
Sbjct: 481 AQSGGLMGVLPYNFGAYSLPPPPPLPPHIAMGLSRPTSQPPPQQLQQPQQSQPASSGFYR 540

Query: 541 PPGIGFYGQGQQSTPPPVPRQ 550
           PPGIGFYGQGQQSTPPPVPRQ
Sbjct: 541 PPGIGFYGQGQQSTPPPVPRQ 560

BLAST of Cp4.1LG08g13750 vs. NCBI nr
Match: gi|659084984|ref|XP_008443180.1| (PREDICTED: regulation of nuclear pre-mRNA domain-containing protein 1B-like [Cucumis melo])

HSP 1 Score: 895.6 bits (2313), Expect = 4.7e-257
Identity = 493/561 (87.88%), Postives = 514/561 (91.62%), Query Frame = 1

Query: 1   MNNEGFEAQVLSEKLSKLNNSQQSIESLSRWCISHRKKAKQIVETWDKLFNSSQKEQRVS 60
           MNNE FEAQVL+EKLSKLNNSQQSIESLS+WCISHRKKAKQIVETWDKLFNSSQKEQRVS
Sbjct: 1   MNNEVFEAQVLAEKLSKLNNSQQSIESLSKWCISHRKKAKQIVETWDKLFNSSQKEQRVS 60

Query: 61  FLYLANDILQNSRRKGSEFVNEFWKVLPVSLKNVYDHGDESGKKAVARLVDIWEERKVFG 120
           FLYLANDILQNSRRKGSEFVNEFWKVLP +LK VYDHGDESGKKAVARLV+IWEERKVFG
Sbjct: 61  FLYLANDILQNSRRKGSEFVNEFWKVLPGALKYVYDHGDESGKKAVARLVNIWEERKVFG 120

Query: 121 SRGQSLKDEMLGKNPPP--LPNSNGKSSNPIKIVKRDAHSVRIKLAVGGVPEKILTAFQS 180
           SRGQSLKDEMLGKNPPP  LP+SNGKSSNPIKIVKRDAHSVRIKLAVGGVPEKILTAFQS
Sbjct: 121 SRGQSLKDEMLGKNPPPTPLPSSNGKSSNPIKIVKRDAHSVRIKLAVGGVPEKILTAFQS 180

Query: 181 VLDEHLNEDAALNNCSSATHHLNQIEEDLNASIAQGTQPRSALLDDLQEQENVVQQCIGQ 240
           VLDEHLNEDAALNNCSSATHHLNQIEEDLN S+AQGTQPRS LLDDLQ+QE V+Q+CI Q
Sbjct: 181 VLDEHLNEDAALNNCSSATHHLNQIEEDLNVSLAQGTQPRSGLLDDLQDQETVIQECIRQ 240

Query: 241 LESVEATRSSLVSLLVEALQDQESKLELVRNQLQIARSQIELASNVRKRIAS-PVPGPLA 300
           LE VEATR+SLVSLLVEALQDQESKLELVRNQLQ+ARSQIELASNVRKR  S   PGP A
Sbjct: 241 LEGVEATRASLVSLLVEALQDQESKLELVRNQLQVARSQIELASNVRKRFTSTTAPGPSA 300

Query: 301 TNIDLLTEMTHVTDAKLPSAQSNTISSQSPLVQAMVSSAGPKSSEEENKRAAAAAVAAKL 360
           T IDLLTEMTHVTD KL S Q N ISSQ PLVQAM S  GPK+SEEENKRAAAAAVAAKL
Sbjct: 301 TTIDLLTEMTHVTDTKLSSVQQNIISSQPPLVQAMGSFPGPKTSEEENKRAAAAAVAAKL 360

Query: 361 AASTSSAQMLTSVLSSLVAEEAASMNGGLKSAGFASLSIFSPEKRQKLEKPMPISD--SS 420
           AASTSSAQMLTSVLSSLVAEEAASMNGGLKS+GF+SL  FSPEKRQKLEKPMPISD  SS
Sbjct: 361 AASTSSAQMLTSVLSSLVAEEAASMNGGLKSSGFSSL--FSPEKRQKLEKPMPISDVSSS 420

Query: 421 DGIGASFVTPMQQQQLTNVALAQSANIQPVSQANQSQASFAPPPPP-------APPVNQY 480
           DG+GASFV PM QQQ+T++ LAQSAN QPVSQAN SQASFAPPPPP        PPVNQY
Sbjct: 421 DGVGASFVAPM-QQQMTSMPLAQSANGQPVSQANPSQASFAPPPPPVPPSLSSTPPVNQY 480

Query: 481 AQSGGIMGVLPYNFGAYSLPPPPPLPPHIVMGLSRPTSQPPQQQPQQPQQSQPTSAGFYR 540
           AQSGG++GVLPYNFGAYSLPPPPPLPPHI MGLSRPTSQPP QQ Q PQQSQPTS+GFYR
Sbjct: 481 AQSGGLIGVLPYNFGAYSLPPPPPLPPHIAMGLSRPTSQPPPQQLQPPQQSQPTSSGFYR 540

Query: 541 PPGIGFYGQGQQSTPPPVPRQ 550
           PPGIGFYGQGQQSTPPPVPRQ
Sbjct: 541 PPGIGFYGQGQQSTPPPVPRQ 558

BLAST of Cp4.1LG08g13750 vs. NCBI nr
Match: gi|590640232|ref|XP_007029897.1| (ENTH/VHS family protein [Theobroma cacao])

HSP 1 Score: 677.2 bits (1746), Expect = 2.6e-191
Identity = 387/573 (67.54%), Postives = 453/573 (79.06%), Query Frame = 1

Query: 1   MNNEGFEAQVLSEKLSKLNNSQQSIESLSRWCISHRKKAKQIVETWDKLFNSSQKEQRVS 60
           M+++ F+ Q+L+EKLSKLNNSQQSIESLSRWCI+HRKKAKQIVETWDKLFNSSQKEQRVS
Sbjct: 1   MSSDTFDGQILTEKLSKLNNSQQSIESLSRWCITHRKKAKQIVETWDKLFNSSQKEQRVS 60

Query: 61  FLYLANDILQNSRRKGSEFVNEFWKVLPVSLKNVYDHGDESGKKAVARLVDIWEERKVFG 120
           FLYLANDILQNSRRKGSEFVNEFWKVLP +LK+VY++GDE GKKAV RLVDIWEERKVFG
Sbjct: 61  FLYLANDILQNSRRKGSEFVNEFWKVLPGALKHVYENGDEYGKKAVTRLVDIWEERKVFG 120

Query: 121 SRGQSLKDEMLGKNPPPLPN----SNGKSSNPIKIVKRDAHSVRIKLAVGGVPEKILTAF 180
           SRGQ+LKDEMLGKNPPP P     +NGKSSNPIKIVKRDAH+VRIKLAVGG+PEKILTA+
Sbjct: 121 SRGQNLKDEMLGKNPPPPPPPLSVNNGKSSNPIKIVKRDAHAVRIKLAVGGLPEKILTAY 180

Query: 181 QSVLDEHLNEDAALNNCSSATHHLNQIEEDLNASIAQGTQPRSALLDDLQEQENVVQQCI 240
           QSVL+++ NED ALN C++A   L++I ED+ +S+AQG Q  SALLD+LQ+QE  +QQCI
Sbjct: 181 QSVLEDNQNEDIALNKCNAAVQQLHKIGEDVESSLAQGNQNESALLDELQQQEIALQQCI 240

Query: 241 GQLESVEATRSSLVSLLVEALQDQESKLELVRNQLQIARSQIELASNVRKRIASP-VPGP 300
            QLE+VE  R++L+  L EALQ+QESKLEL+ +QLQ+AR QIE ASNVRKR+  P VPG 
Sbjct: 241 EQLENVETIRATLIFQLKEALQEQESKLELIHSQLQVARGQIEQASNVRKRLTLPTVPGH 300

Query: 301 LATNIDLLTEMTHVTDAKLPSAQSNTISSQSPLVQAMVSSAGPKSSEEENKRAAAAAVAA 360
           L+       E   V +  LPSAQ           Q ++S A   ++EE+NK+AAAAAVAA
Sbjct: 301 LSATTISTAEGAIVVEQNLPSAQPTGTPPHPHHAQPVLSFAPSMTTEEDNKKAAAAAVAA 360

Query: 361 KLAASTSSAQMLTSVLSSLVAEEAASMNGGLKSAGFAS-LSIFSPEKRQKLEKPMPISD- 420
           KLAASTSSAQMLTSVLSSLVAEEAASMNG LKS GF S LS+F PEKR KLEKPMP+SD 
Sbjct: 361 KLAASTSSAQMLTSVLSSLVAEEAASMNGSLKSGGFTSGLSMFPPEKRPKLEKPMPVSDA 420

Query: 421 -SSDGIGASFVTPMQQQQLTNVALAQSANIQPVSQANQSQASFA----PPPPPA----PP 480
            +SD    ++ +P+QQQ +TN+ LA S ++QP+SQ NQ QA FA    PPPPP     PP
Sbjct: 421 SNSDVSSTAYFSPLQQQAMTNMPLAPSTSVQPMSQGNQIQAPFASAPPPPPPPLSPANPP 480

Query: 481 VNQYAQSGGIM-GVLPYNFGAYSLPPPPPLPPHIVMGLSRPTSQPPQ-------QQPQQP 540
            +QY QS G+M GV+PY +GA +LPPPPPLPPHI M L+RP+SQP Q       QQPQ  
Sbjct: 481 ASQYVQSTGMMVGVMPYGYGANTLPPPPPLPPHIAMSLARPSSQPLQQLQSQSLQQPQSQ 540

Query: 541 QQSQPTSAGFYRPPGIGFYGQGQQSTPPPVPRQ 550
            Q QP + GFYRPPGIGFYGQ  QST PPVPRQ
Sbjct: 541 PQQQPATGGFYRPPGIGFYGQNPQST-PPVPRQ 572

BLAST of Cp4.1LG08g13750 vs. NCBI nr
Match: gi|567889575|ref|XP_006437308.1| (hypothetical protein CICLE_v10031118mg [Citrus clementina])

HSP 1 Score: 671.4 bits (1731), Expect = 1.4e-189
Identity = 381/562 (67.79%), Postives = 446/562 (79.36%), Query Frame = 1

Query: 1   MNNEGFEAQVLSEKLSKLNNSQQSIESLSRWCISHRKKAKQIVETWDKLFNSSQKEQRVS 60
           M+NE F+ Q+LSEKLSKLNNSQQSIESLSRWCI+HRKKAKQIVETWDK FNSSQKEQRVS
Sbjct: 1   MSNEAFDGQILSEKLSKLNNSQQSIESLSRWCITHRKKAKQIVETWDKSFNSSQKEQRVS 60

Query: 61  FLYLANDILQNSRRKGSEFVNEFWKVLPVSLKNVYDHGDESGKKAVARLVDIWEERKVFG 120
           FLYLANDILQNSRRKGSEFVNEFWKVLP +LK+VYD+GDE GKKAV RLVDIWEERKVFG
Sbjct: 61  FLYLANDILQNSRRKGSEFVNEFWKVLPAALKHVYDNGDEYGKKAVTRLVDIWEERKVFG 120

Query: 121 SRGQSLKDEMLGKNPPPLPNSNGKSSNPIKIVKRDAHSVRIKLAVGGVPEKILTAFQSVL 180
           SRGQ LKDEMLGKNPPP+P SNG+SSNPIKIVKRDA+SVRIKLAVG +PEKILTAFQSVL
Sbjct: 121 SRGQGLKDEMLGKNPPPVPVSNGRSSNPIKIVKRDANSVRIKLAVGALPEKILTAFQSVL 180

Query: 181 DEHLNEDAALNNCSSATHHLNQIEEDLNASIAQGTQPRSALLDDLQEQENVVQQCIGQLE 240
           DEH NE+ ALNNC++A HH+ ++ ED+  + +QG Q  SA +D LQEQENV+Q+C+ Q+E
Sbjct: 181 DEHPNEEVALNNCNAAVHHVGKVSEDVENTPSQGNQHGSASVDQLQEQENVLQECVRQME 240

Query: 241 SVEATRSSLVSLLVEALQDQESKLELVRNQLQIARSQIELASNVRKRIASP-VPGPLATN 300
           S E TR++LV  L EAL DQESKLEL+R +LQ+AR QIE AS  RKR+ +P VPG  +T+
Sbjct: 241 SAETTRAALVFQLKEALHDQESKLELIRTKLQVARGQIERASATRKRLTNPLVPGLPSTS 300

Query: 301 IDLLTEMTHVTDAKLPSAQSNTISSQSPLVQAMVSSAGPKSSEEENKRAAAAAVAAKLAA 360
           I    E T   +  LPS    +I  Q  + Q ++S A  K++EEENK+AAAAAVAAKLAA
Sbjct: 301 IAQGMEATRGAEPILPSVPPTSIPPQPSVNQPVISFAPLKTTEEENKKAAAAAVAAKLAA 360

Query: 361 STSSAQMLTSVLSSLVAEEAASMNGGLKSAGF-ASLSIFSPEKRQKLEKPMPISD--SSD 420
           STSSAQMLTSVLSSLVAEEAA  NG L S GF A LSIF PEKR KLEK    SD  +SD
Sbjct: 361 STSSAQMLTSVLSSLVAEEAAK-NGSLNSGGFTAGLSIFPPEKRPKLEKSTTASDVSTSD 420

Query: 421 GIGASFVTPMQQQQLTNVALAQSANIQPVSQANQSQASFAPPPPPAPPV-------NQYA 480
               ++ +P+QQQQ+TN+ + QS ++QP+SQ +Q Q+ FAP PPP PP+       +QY 
Sbjct: 421 VANTTYFSPLQQQQVTNMPVVQSTSMQPMSQVSQIQSQFAPAPPPPPPLSPATPPSSQYV 480

Query: 481 QSGGIM-GVLPYNFGAYSLPPPPPLPPHIVMGLSRPTSQPPQQQPQQPQQSQP-TSAGFY 540
           QS G+M GV+PY FGA +LPPPPPLPPH+ MGL+RP+    Q QPQQ QQ QP  + G+Y
Sbjct: 481 QSTGMMVGVMPYGFGANTLPPPPPLPPHMAMGLARPSQSLQQPQPQQLQQQQPAATGGYY 540

Query: 541 RPPGIGFYGQGQQSTPPPVPRQ 550
           RPPGIGFYGQ Q ST  PVPRQ
Sbjct: 541 RPPGIGFYGQSQPST-SPVPRQ 560

BLAST of Cp4.1LG08g13750 vs. NCBI nr
Match: gi|645270837|ref|XP_008240637.1| (PREDICTED: regulation of nuclear pre-mRNA domain-containing protein 1B-like [Prunus mume])

HSP 1 Score: 666.8 bits (1719), Expect = 3.5e-188
Identity = 379/579 (65.46%), Postives = 444/579 (76.68%), Query Frame = 1

Query: 2   NNEGFEAQVLSEKLSKLNNSQQSIESLSRWCISHRKKAKQIVETWDKLFNSSQKEQRVSF 61
           NNE F+ Q+L++KL+KLN+SQQSIESLSRWCISHRKKAKQIVETWDK FNSSQK+QRVSF
Sbjct: 3   NNEAFDGQILADKLTKLNSSQQSIESLSRWCISHRKKAKQIVETWDKCFNSSQKDQRVSF 62

Query: 62  LYLANDILQNSRRKGSEFVNEFWKVLPVSLKNVYDHGDESGKKAVARLVDIWEERKVFGS 121
           LYLANDILQNSRRKGSEFVNEFWK LP +LK+VY++GD  GKK   RLVDIWEERKVFGS
Sbjct: 63  LYLANDILQNSRRKGSEFVNEFWKFLPSALKHVYENGDGHGKKVATRLVDIWEERKVFGS 122

Query: 122 RGQSLKDEMLGKNPPPLPNSNGKSSNPIKIVKRDAHSVRIKLAVGGVPEKILTAFQSVLD 181
           RGQSLKDEM+GKNPP LP SNGKSSNPIKIVKRDAHSVRIKLAVGG+PEKILTAFQ VL+
Sbjct: 123 RGQSLKDEMMGKNPPALPVSNGKSSNPIKIVKRDAHSVRIKLAVGGLPEKILTAFQPVLE 182

Query: 182 EHLNEDAALNNCSSATHHLNQIEEDLNASIAQGTQPRSALLDDLQEQENVVQQCIGQLES 241
           EHL+E+AALN CS+A HH+ +I+ED+  ++  GTQ  S LLDDL+EQE+V+ Q +GQLE+
Sbjct: 183 EHLSEEAALNKCSAALHHVGKIDEDVENTLTHGTQQGSTLLDDLKEQEDVLNQSVGQLEN 242

Query: 242 VEATRSSLVSLLVEALQDQESKLELVRNQLQIARSQIELASNVRKRIA-SPVPGPLATNI 301
           VEATRS+LVS L EALQDQES+LELVR QLQ+AR QIE   N+++R+  SPV  P +   
Sbjct: 243 VEATRSALVSQLKEALQDQESELELVRTQLQVARHQIEKLGNIKRRLTLSPVVKPQSNVT 302

Query: 302 DLLTEMTHVTDAKLPSAQSNTISSQSPLVQAMVSSAGPKSSEEENKRAAAAAVAAKLAAS 361
           ++ TE T V +  L   Q + I  Q P  Q ++S A  K+++EENK+AAAAAVAAKLAAS
Sbjct: 303 NMTTESTRVLEPNLSLVQPSGIPPQPP-TQPVISFASVKTTDEENKKAAAAAVAAKLAAS 362

Query: 362 TSSAQMLTSVLSSLVAEEAASMNGGLKSAGFAS-LSIFSPEKRQKLEKPMPISD--SSDG 421
           TSSAQMLTSVLSSLVAEEAASM G L SA F S LS+F PEKR KLEK M  S+  + D 
Sbjct: 363 TSSAQMLTSVLSSLVAEEAASMTGSLTSAAFTSGLSMFPPEKRPKLEKQMSASELNNPDV 422

Query: 422 IGASFVTPMQQQQLTNVALAQSANIQPVSQANQSQASFAPPPPP---------APPVNQY 481
              ++ TP+QQQ +TNV  A  A +QP+SQANQ Q +F PPPPP          PP NQY
Sbjct: 423 GNTAYFTPLQQQTMTNVQHAPPATMQPLSQANQMQNTFGPPPPPPPAPSASPATPPANQY 482

Query: 482 AQSGGIM-GVLPYNFGAYSLPPPPPLPPHIVMGLSR-----------------PTSQPPQ 541
           AQS G+M GV+PY +G+ +LPPPPP+PPHI MGLSR                 P  QP Q
Sbjct: 483 AQSAGLMVGVMPYGYGSNTLPPPPPIPPHISMGLSRPGQPQQQQQQQQQQQHQPQPQPQQ 542

Query: 542 QQPQQPQQSQPTSAGFYRPPGIGFYGQGQQSTPPPVPRQ 550
           QQ QQ QQ QP + G+YRP G+GFYGQ  QS   PVPRQ
Sbjct: 543 QQQQQQQQLQPATGGYYRPLGMGFYGQSNQSNTQPVPRQ 580

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
RPR1B_HUMAN4.5e-2632.12Regulation of nuclear pre-mRNA domain-containing protein 1B OS=Homo sapiens GN=R... [more]
RPR1B_MOUSE4.5e-2632.12Regulation of nuclear pre-mRNA domain-containing protein 1B OS=Mus musculus GN=R... [more]
RPR1A_PONAB9.4e-2428.90Regulation of nuclear pre-mRNA domain-containing protein 1A OS=Pongo abelii GN=R... [more]
RPR1A_HUMAN9.4e-2428.90Regulation of nuclear pre-mRNA domain-containing protein 1A OS=Homo sapiens GN=R... [more]
RPR1A_BOVIN9.4e-2428.90Regulation of nuclear pre-mRNA domain-containing protein 1A OS=Bos taurus GN=RPR... [more]
Match NameE-valueIdentityDescription
A0A061F7B0_THECC1.8e-19167.54ENTH/VHS family protein OS=Theobroma cacao GN=TCM_025762 PE=4 SV=1[more]
V4SRT7_9ROSI1.0e-18967.79Uncharacterized protein OS=Citrus clementina GN=CICLE_v10031118mg PE=4 SV=1[more]
A0A0A0LH72_CUCSA1.6e-18786.11Uncharacterized protein OS=Cucumis sativus GN=Csa_3G815990 PE=4 SV=1[more]
F6HMV0_VITVI2.1e-18766.61Putative uncharacterized protein OS=Vitis vinifera GN=VIT_08s0056g00770 PE=4 SV=... [more]
A0A067L6A1_JATCU2.9e-18165.19Uncharacterized protein OS=Jatropha curcas GN=JCGZ_02639 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G10060.11.6e-7738.67 ENTH/VHS family protein[more]
AT3G26990.16.8e-7338.23 ENTH/VHS family protein[more]
AT5G65180.13.7e-7144.44 ENTH/VHS family protein[more]
Match NameE-valueIdentityDescription
gi|449437658|ref|XP_004136608.1|2.9e-25988.06PREDICTED: regulation of nuclear pre-mRNA domain-containing protein 1B-like [Cuc... [more]
gi|659084984|ref|XP_008443180.1|4.7e-25787.88PREDICTED: regulation of nuclear pre-mRNA domain-containing protein 1B-like [Cuc... [more]
gi|590640232|ref|XP_007029897.1|2.6e-19167.54ENTH/VHS family protein [Theobroma cacao][more]
gi|567889575|ref|XP_006437308.1|1.4e-18967.79hypothetical protein CICLE_v10031118mg [Citrus clementina][more]
gi|645270837|ref|XP_008240637.1|3.5e-18865.46PREDICTED: regulation of nuclear pre-mRNA domain-containing protein 1B-like [Pru... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR008942ENTH_VHS
IPR006903RNA_pol_II-bd
IPR006569CID_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG08g13750.1Cp4.1LG08g13750.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006569CID domainSMARTSM00582558neu5coord: 10..131
score: 9.6
IPR006569CID domainPROFILEPS51391CIDcoord: 3..135
score: 47
IPR006903RNA polymerase II-binding domainPFAMPF04818CTD_bindcoord: 58..119
score: 3.4
IPR008942ENTH/VHSGENE3DG3DSA:1.25.40.90coord: 12..131
score: 6.0
IPR008942ENTH/VHSunknownSSF48464ENTH/VHS domaincoord: 2..120
score: 4.71
NoneNo IPR availableunknownCoilCoilcoord: 225..245
score: -coord: 250..277
scor
NoneNo IPR availablePANTHERPTHR12460CYCLIN-DEPENDENT KINASE INHIBITOR-RELATED PROTEINcoord: 1..545
score: 1.3E