CSPI05G16750 (gene) Wild cucumber (PI 183967)

NameCSPI05G16750
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionNuclear receptor corepressor 1
LocationChr5 : 18091396 .. 18099936 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTCATCCCTTATCGGCTCAAATTCTCACTTTCTCGCTCCGGAGGCAAAATTTTCCGTCCGCCATCAGAGGAAGAGAGAGAGAGAGAGAGAGTTTAGAGATTGAAGGTCGCTGAGGAGATACGAAGGGCGGAGATTGCGAACTCTAACTTAGATTGAGGTTTTGCATGTGATGTCGTAGAGCCTTTCAATCGCCATTGTCGGTTCCTTCTGGTTCTAAACCCTTTCAATTTCAAACCCGTTTCAGTCTCTATTCATCACGTACGCCTGTATAGCTTTTCCTTCTGTTTTTTCTTCCTCGTTTTGGTGGTTTTTCTTCAAGATTTGGTCTGGAGTTACCCGTGTGGTGTGGCTGCTACATATCGGCGTTTTTTCGTTCTTAGACGTCCAGTGAAAGGAGGCAGAACCGATCGATATGATTGTGTTGAGTTGATTTTGTTTTTGGTTTTCTCTTTGGCAATTCCTTTGGCAGGATATGAAAGTTTTGTTACTGAGGATTCTTGCTTGACATGGTTGCGAAAGTAGTTGAATTGATTTCCGGCGGTGCTATTTGGGGCGCTCCTCGTGCTGTGCGTTGAAAACGGTTGTTCTTTGAGGGTGCCCATGCTATTGGCCTCATTTTCATGCCGCCTGAACCTTTGCCCTGGGATCGGAAGGACTTCTTCAAGGAGAGGAAACACGAGAGGTCAGAGTTCCTCGGACCTTTACCCAGATGGAGAGATTCATCCAGTCACGGATCCCGTGAGTTTAGTAGGTGGGGATCAGGTGATTTTCGGAGACCTCCAGGTGAGCAATTCATTCATTTCTGATTTCTTTTTGAATTATTTAGGTTTTATAGAATCAAAAGATATCTACATCTCTTCCCCTGTTGAGTTCTTTATGGAGACTCGTAGTTCCGAGCATATACTACTATTGTTTTGCTTAATTGAAACCCAAGAAAAGTTTTTCTTCTGTTCATTAATTAAATTCTGGATCCTCTCTTTCTGATAGATCTAGTTGAACCAAGGGTTTGTTATATATCCATGATTTTATAAAGTATTGGTGGTTAGATCTGAGGTTTATCGCCTGGATTCGTCCCAACTAGAAATCTTAGTTTTGCTACAGAAATTGCTCGGATTATTTTGTTTCAAATTTTGCCCCATGTTAGATCCTTTGCTTGTATTGACTGTGTTTTCTTTTCCATTTTCCTGTTCTTTTTTGAAATATTTCTGAAGGTCATGGTAGGCAGGGGGGTTGGCATGTGTTCTCTGAGGAATATGGTCACGGTTATGGGCCTTCCATGTCATTCAATAACAAGATGCTTGAGAACGTCAGTAGCCGGCCATCTGTTTCACATGGCGATGGGAAGTATGCTAGGAACGGTAGGGAAAGCAGATCTTTTAGTCAGAGAGATTGGAAAGGCCATTCTTGGGCAACAAGTAATGGGTCTACAAACAATGGTGGTAGAATGCAGCATGATCTAAATTATGATCAGAGGTCAGTTCATGATATGCTGATATATCCCTCTCATTCTCATTCTGACTTTGTAAACCCGAGAGAAAAGGTAAAAGGCCAGCATGATAAGGTCGATGATGTCAATGGGTTAGGCACAAACCAGAGACGTGATCGGGAGTATTCAGTGAGTTCCTCAGGGTGGAAGCCTCTTAAATGGACACGTTCTGGTGGCTTGTCTTCACGAACATCAACATCGGGCCACTCCAGTAGCAAAAAGAGCATTGATGCTTTAGATTCTAATGATAGAAAGTCTGAGACTGTGTCGAAAAATGCATCACAAAATTTTTCTCCTTCAGCCGACCATGCTGAGTGTGCCATGTCTTCTCTCCCTTATGATGACGCAAGTGCCAGGAAGAAGCCGCGGCTAGGATGGGGTGAGGGACTTGCCAAGTATGAGAAAAAAAAGGTCGAAGTTCCTGATGGCAGCACTGCCTTCACAAATATTACTGCAGAATCTACTCATTCTCTGAATTCTAGCTTGATTGAAAAAGGCCCCCGAGGTTCAGGATTTGCTGATTGTACCTCACCTGCCACCCCTTCTTCTGTCATTAGTGGTTCCCCTCCAGGTACACAGCTCTTTTTTGCTTTTCTTATGGTGGGGCAAATTTTCACTTCGTATGGTGTTTATCAAGTGATAATATCTCAAATTTTCATGCATTCAACTTATTCCAGTAGAAGAAGATATTAATGATTATATATAGATTTATATTGAATACCAAATAGTCTCAATCAGAGTTAACAAAATTTACAAAAGAAATTCTATAATTATCAAATGCTTTAAATATTTCCATTGTCATAAGAAGCTACAAATTTTAGAAAAGAAAACACGGTAATCACTAATCAGTAACTACATGCTATAATTTAATGAACTCAGCCTATTATTGTGGCATAGGGGGCAATAATGTGTAAGAAAACACGATTGTCTTTATGTTAACTTCAAGAACCAGAACATGGAATCTAGGGGATCAAAAGCAACAGTGGGGAAGGGTAAGTGAGTCCTAGAGGGGGGCGGGAAAAACATTCATTGTACCAGCAGACTCTAGGAGGAGGAGAGGGTTCTTAAAGTTCTCATCTTTTGTAGACGTGGGAATGCCAAATAAGGGTTATGTCAGCTTAAAAGAAAAAAATTTCCTTGTGGTGGTGAAGGGGATCAATGAGTAAGAAGATGCTAATAAACTAAACTATGAAGGGCAGATCAACAAAAGAGTAAATAGAATTAAGCATGTTTGAAAAAATTAGGTATGTGTTGAAGATAGGTAAAACCAGTAGTACAAGATTTTTTAGATAATAATTTGATAGTTGACAAGATCACAAGAAAGGTAAACAATAATTTGATGGTATAGCATTCTTTTAACAGGATGGTTGAGCATAAATAAGAATTGTGATTGGAGGGTCAGAGAGAATCCTAGTGATATGTGGCCCCTCGTTAGATTTTATTTTATTTTATTTTATTTTATTTCAAAGATCTTTTGTAATTATTCTTTAAGACTATTTTGTATGGTTGGAGGCCCTTTCTCTAGTTGGTTTCCTTTTGTGGGCTTGGTTGTTTTGGTTCTTGTATTCTTTCATTTGTTTCCTCAATGAAAGTTGTTATATTATTTACTACAAAGAGGGATGGATTAGTAGGCATATCAGAAGTAGGAGGTTCATGGAGGATATATATATATCGAAAGGGGATAGGTCAACAGTGATTTGATGGATGAAATTTCTGATATCCATTACAATGGTATACATCTTGCGCCAGTGAGTAGTTTTGGAGGGTATCATAAAGTAAAGGGTTTCATTTAGAAAATTTGTTGCTTCTAAATACTGTCGTGTCTTTTGATCACTCCAGATTGTTGTCTTTCTTGCTATTAGGTGAAATGTGCTGCATTTTACTGAAATCAAACTTCTGCATCTGTGTTTCATGTGTAGGTGGGGATGAAAAATCATTTGGAAAGGCATCGAGTGATAATGATGTCAGTAACTTTCATGGTTCACCTGGCTCTTGTTTTCAGAATCAATATGAGGGAACGTCCACTGTAGAGAAGTTGGATAATTTTTCAATAGCTAATTTGTGTTCTCCACTCATTCAACTGTTGCAATCTAATGATTCGATTTCAGTGGATTCCACTGCCTTGAGTAAGCTGCTTATATATAAAAATCAAATTTCTAAAGTGTTGGAGACAACTGAGTCTGAAATTGATTTACTTGAAAATGAGCTGAAGGGATTGAAATCTGAAAGTAAGGGTTACTTTTCTTTCACGCTGGCATCCAGTTCTTTGCTGGTGGGAGATAAATTCTTTGAAGAGCAAAACAATGTCGCTAATGCAGTTGCTACCCTGCCAGTTGTTACTTCTGCAAATACTATTTCAAAAACAATGGCACATTCTACAAGTGACTTGGAAGAAGTGTATGCTGAAAAGGATAGGTCTGGGAGGTTGGATGTGAAGGAATCTGTCATGAAAGAGAAGCTAACAATTTATGGTTGCAGTGTGAAAGAAAATATTGCAGCCTATATAGACAATAGCATGCCTATAAAGAGTGAAGGTGTCACAGTACATCCTGTTGCTAATGATATGTATGAATGTGCTGAGGGAGGAGATAGTGTGTCTGATTTAATTCTGGCATCCAATAAAGAGTCTGCATGTAAGGCTTCTGAAGCTTTAATCGGGTTGTTGCCTACCAATGAACGTAAGATTGATATTTGGAGTACAAATGCCTGCTCACAGAATCAATGTTTAGTGAAAGAGAGATTTGCAAAGAGGAAACGGTTACTAAGATTTAAGGAGAGAGTAATTACCCTTAAGTTTAAAGCCTACCAGTCTTTGTGGAAGGAGAATTTGCATGTGCCTCCTGTAAGGAAGCTACGAGCTAAATCTCAGAAAAAACATCAGTTGAGTTTGTGGACAAATTACAGTGGCTATCAGAAGAACCGATCTTCCATTCGATACCGTATGCCTTCACCAGGTAAGGAAACTTCTAATCTATGTCCTTAAGTTCAAGAATGTTTTAAACAGGTTTTGCTCTAAACTTGCCTCTTCTTATATCCACGTTAATAACTTAATTTGCCTCAAGTTTTGTATCTGCTTTCCATTCTATGCTATTTGGTCCTCTAACTTCTCTGAGTCATGGGTATTAGTTTGCATATCTCAAGCTTAACTTATATTTTAATTCTTAAACTAAATGTACGTGTGTTAATTTTGGCTTCTCTACCCAAGTATCACCCTATTTGAAGTGCTTGGGTAGTTTGGTGAAATTTCTTCCACCTGACCAATCTTAGTGTTGACCATCCGCATCGTTCCTCATATTTTAGTTTTTAAATACTGAACTTGATATACCAATTAAGTTTATTGTCTTGAGGTACTGTGAATCGGCACTGTGATGTTAAATAAGTTGACTGCACGTTCTTTTTTGTATGGGAAATGACTACTTATTTGGAAGGGAGAGGAAGGTGTGCCGCCTTATAATAATTTAGGGCCCTGTCTGATTGTTTGGTTTTTGAAAATTATGTTTGTTTCCTTACAACCTCTTTGTTATGATTTTCATCTTAATTGAACACTTGAATTCTTTGCCAAATTCCAGAAGCAGAAACAAGTTTTCAAGACAGTTTTTACAAATGGTCCTAAAAAGTAAGAAATTAATGGGTAGAAGTAATATTCATAAGTTTAGTTTTTTTTAAAAAAACATCAATCAAATCATCCATCGGAGCCAACCCCCTTTCTTTGGTGGGAGATTTTGGTGGGCTTGTTTTTTTCTTGGCCCTTTGTATTGTTTGAAAGCAATTGTTTCTATTTCTTTTTATTTAAATTAAGAGCCTTAGTGATGACAGTTTTTTTCTAGAAATATTACGTCTCATTCGAACTCTCGTTTTAGGTAACAACCTCTTTAGTCATTTAGAATTAAATAGGTTCTGGAGCTAAATGTTAATTTTGAGTAATTGAAGAGGCGAAAGTGGAAGTAGTTTGAGGGAATGGTCTTTTGGCATAATAATTACATTCTAGTAATATGTCAAGTATGACGTTCTTAAATAAAGAGATAATTGCTCACTTCAAATATGATTGTCTTTACTCTTTATTAATATAAACTTATGATGGGTTTTTTGAGTATGATATGAACATGATTAATCAAATTAGTTTATGTGATCGAAAGGCATTTTCTGATAGTATATTTCTTTCCTTCAGCATAGGGTTATTAATACAAATGCATCTATGAGCTTTGTTATCTTTTTTAATATTTTGTTCATCATTATATTATACTAGTTTTTTTTTTTTTTTTTTTTTTTCATTTTTCACGTTTGCTATTGTGATATTGTGATATTCGTTTGAGGATTTTCCCCTTTTCATTTTCATCATCACTTTACCTTCATGTAAATATTTTTCCTCTTTAAATCAGGAAGGTGTACGTCTTTGGAGAAGAATCTCCTAATGTGATACAATTGATATAATCATTTTGAATTTTAAAACTTGACTTGCAGCAGGAAATCTGAACCCCGTTTCTAGCACAGAGATACTTAAGCATGTGAGCATGCAGCTTTCTACTCCCCAGATTAAGCAGTACCGGAGGACATTGAAGATGCCCGCATTAGTTCTGGACCAGAAGGATAAGATGGGCTCAAGGTTCATCTCTAACAACGGATTAGTTGAGAACCCTTGTGCAGTTGAGAAGGAAAGGGCAATGATTAATCCATGGACCTCAGAAGAGAAAGATGTTTTTATGGAGAAGTTGGAATGTTTTGGGAAAGATTTTGGGAAAATTGCATCCTTTCTTGATCATAAGACAACAGCAGACTGTGTCGAGTTCTACTACAAAAACCACAAGTCTGATTGCTTTGAGAAAACAAAGAAGCTGGAGTTTGGGAAGAAAGTGAAGTCCTCCACCAGTAACTATTTGATGACAACAGGGAAGAAATGGAATCCAGAAACGAATGCCGCTTCTCTTGACATGTTGGGTGCCGCCTCAACGATGACTGCCCGTGCTCATAAGTATTCCAGCAGCAGGTCTGGTGGAAGAACTTCATATCACATAACTCAATTCGATGATGGTCTATCAGAAAGGGCCAAAGGTCTTAATGGTTTTGGAAATGAAAGAGAAAAGGTGGCTGCTGATGTTTTGGCTGGTATATGTGGTTCTCTTTCTTCAGAAGCCATGGGTTCGTGTGTCACTAGTAATTTCAACCGAGGAGACAGTTCTCAGAATTTGAAGTGCAAAAAGGGTGTTACAACCGTATTAAGACAGCGTATGACAACCAATGTTCCACGGTATGTTGACAATGAGATTTTTTCTGACGAGAGTTGTGGAGAAATGGGTCCTTCCTATTGGACGGATGGGGAGAAGTCTCTTTTCATAGAAGCGGTGTCAGTTTATGGGAAGAATTTCTCTGTGATCTCTACCCATGTAGGATCAAAATCCACGGACCAATGCAAGGTCTTCTTTAGCAAAGCACGGAAGTGCCTCGGGTTGGATTTAATATGTTCTGCAAAGAAAATGCCAGATAATGGAAATGGGCATGATGCTGATAGAAGCAATGGTGAAGGAGGTGTAGATACCAAAGATGCCTTTCCTTGTGAAATGGTTGGCTCGCGGGTGGTTGATGACTTGCCAAAGGCTGTAATGAGTATAAGTGGTGGTGAATCGGAATCCATGAATCTGCAATCTACCCATCAGGAAGTCAATCCATCCTCAAAGACTTGTAGTAATGCTGCTGTGGATGCTATGGTGTCAGATGATGAATGTACTAGGAAGGATGGCTCTCAATCGGGTTTTGATGACGACTGCCAGTCAGTGAATTCTGCCAATGATAAGAATGGTTTGATACATGAGCAGCAACATGTAGTCATATCTGATGAAACTGCAAAAGAACAAGACATTTCTGTTTTGGTTGCAACATCAGTTGGAAATGTTTCAGATACTGAAACCAAGAGAGGAAATGTCGATGCTAGCACAGCTCGAGGTGATAAAGCTGATTCCCATGCGACAGATTGCCCTTCAATACCCTCAAACTCTCACATAACATCATCGGCTAAGGAGGAACAAGGGCGTCATCATGTTAGAGTGCATTCACGTAGTTTGTCTGATTCTGAGCAATCGTCTAGAAATGGCGACATAAAATTATTTGGTCAAATTCTTACACATTCCTCATTTGTGCCGAGTTCAAAATCTGGATCCAGTGTGAACGGAATCAAGACGACCGAGCCTCACCACAAGTTCAAGCGTAGATTGAAAGTAAACAGCCATGGGAATCTAAGTACAGCCAAGTTCAATTGTAAGAACTCTCCAGGCCAAGAGGAGAATACTCCCTCAAGAAGTTATGGAATTTGGGATGGCAACCAAATACGCACTGGGCTTTCGTCATTGCCTGATCCCACCACCCTATTATCCAGATATCCTACATTCAATCATCTCTCTAAGCCAGCCTCCTCCCCGACCGAGCAGTCACCATCTGGTTGCAAGGAAGAGACATCAAACTCAAACAAGGAAACCCAGAAGAGGGAGGTAAATAATAGTAGGAAGGAGGAAGTAGTTGGAGAAATGAATGTAGAAGAGAGTTGTTGTAATGAGGGCGGTGGTGGTGGTGGGTCATAATAGAATATGAGAGAGAACTTAGGTTTGGTTAGTAGATAAAATTATATAATGCTTCAGAGAGAGCCATTTCAATAGAGAGAGAGGCTTTCTCAGCAGCAAATTCTTTGTTTTTGCTGCAACCCCCCATGTATCATAGTTTCTTTCATGTTGATTAATCAATAGCAGAATTTGACGGCTCTTATTGCCGTCTTCTAGGAAAACAAAACAAAAAAAAAAAAATTATATTCAATCAGTTGCCCATTGCAGAGTTAACACGTTACATTTGAAGGACACATTTCTGTAGAAATGTCCATAGAATTTTTGTTATTATAAATTTAGACAGTACTTAGAACTTAATTAGTTCATGGATGATAAGATGGTTTGATAAAACCCCTCTCTTGCTTTGTTTAAATTGAACCTTTTTCGCCTTTCTAAAGGTCGAAACAAGTTTCAATTACTTTAATTTTTTATAAAAGAGATTTTGTTATTGTGTTTTCAGTCACCTC

mRNA sequence

ATGCCGCCTGAACCTTTGCCCTGGGATCGGAAGGACTTCTTCAAGGAGAGGAAACACGAGAGGTCAGAGTTCCTCGGACCTTTACCCAGATGGAGAGATTCATCCAGTCACGGATCCCGTGAGTTTAGTAGGTGGGGATCAGGTGATTTTCGGAGACCTCCAGGTCATGGTAGGCAGGGGGGTTGGCATGTGTTCTCTGAGGAATATGGTCACGGTTATGGGCCTTCCATGTCATTCAATAACAAGATGCTTGAGAACGTCAGTAGCCGGCCATCTGTTTCACATGGCGATGGGAAGTATGCTAGGAACGGTAGGGAAAGCAGATCTTTTAGTCAGAGAGATTGGAAAGGCCATTCTTGGGCAACAAGTAATGGGTCTACAAACAATGGTGGTAGAATGCAGCATGATCTAAATTATGATCAGAGGTCAGTTCATGATATGCTGATATATCCCTCTCATTCTCATTCTGACTTTGTAAACCCGAGAGAAAAGGTAAAAGGCCAGCATGATAAGGTCGATGATGTCAATGGGTTAGGCACAAACCAGAGACGTGATCGGGAGTATTCAGTGAGTTCCTCAGGGTGGAAGCCTCTTAAATGGACACGTTCTGGTGGCTTGTCTTCACGAACATCAACATCGGGCCACTCCAGTAGCAAAAAGAGCATTGATGCTTTAGATTCTAATGATAGAAAGTCTGAGACTGTGTCGAAAAATGCATCACAAAATTTTTCTCCTTCAGCCGACCATGCTGAGTGTGCCATGTCTTCTCTCCCTTATGATGACGCAAGTGCCAGGAAGAAGCCGCGGCTAGGATGGGGTGAGGGACTTGCCAAGTATGAGAAAAAAAAGGTCGAAGTTCCTGATGGCAGCACTGCCTTCACAAATATTACTGCAGAATCTACTCATTCTCTGAATTCTAGCTTGATTGAAAAAGGCCCCCGAGGTTCAGGATTTGCTGATTGTACCTCACCTGCCACCCCTTCTTCTGTCATTAGTGGTTCCCCTCCAGGTGGGGATGAAAAATCATTTGGAAAGGCATCGAGTGATAATGATGTCAGTAACTTTCATGGTTCACCTGGCTCTTGTTTTCAGAATCAATATGAGGGAACGTCCACTGTAGAGAAGTTGGATAATTTTTCAATAGCTAATTTGTGTTCTCCACTCATTCAACTGTTGCAATCTAATGATTCGATTTCAGTGGATTCCACTGCCTTGAGTAAGCTGCTTATATATAAAAATCAAATTTCTAAAGTGTTGGAGACAACTGAGTCTGAAATTGATTTACTTGAAAATGAGCTGAAGGGATTGAAATCTGAAAGTAAGGGTTACTTTTCTTTCACGCTGGCATCCAGTTCTTTGCTGGTGGGAGATAAATTCTTTGAAGAGCAAAACAATGTCGCTAATGCAGTTGCTACCCTGCCAGTTGTTACTTCTGCAAATACTATTTCAAAAACAATGGCACATTCTACAAGTGACTTGGAAGAAGTGTATGCTGAAAAGGATAGGTCTGGGAGGTTGGATGTGAAGGAATCTGTCATGAAAGAGAAGCTAACAATTTATGGTTGCAGTGTGAAAGAAAATATTGCAGCCTATATAGACAATAGCATGCCTATAAAGAGTGAAGGTGTCACAGTACATCCTGTTGCTAATGATATGTATGAATGTGCTGAGGGAGGAGATAGTGTGTCTGATTTAATTCTGGCATCCAATAAAGAGTCTGCATGTAAGGCTTCTGAAGCTTTAATCGGGTTGTTGCCTACCAATGAACGTAAGATTGATATTTGGAGTACAAATGCCTGCTCACAGAATCAATGTTTAGTGAAAGAGAGATTTGCAAAGAGGAAACGGTTACTAAGATTTAAGGAGAGAGTAATTACCCTTAAGTTTAAAGCCTACCAGTCTTTGTGGAAGGAGAATTTGCATGTGCCTCCTGTAAGGAAGCTACGAGCTAAATCTCAGAAAAAACATCAGTTGAGTTTGTGGACAAATTACAGTGGCTATCAGAAGAACCGATCTTCCATTCGATACCGTATGCCTTCACCAGCAGGAAATCTGAACCCCGTTTCTAGCACAGAGATACTTAAGCATGTGAGCATGCAGCTTTCTACTCCCCAGATTAAGCAGTACCGGAGGACATTGAAGATGCCCGCATTAGTTCTGGACCAGAAGGATAAGATGGGCTCAAGGTTCATCTCTAACAACGGATTAGTTGAGAACCCTTGTGCAGTTGAGAAGGAAAGGGCAATGATTAATCCATGGACCTCAGAAGAGAAAGATGTTTTTATGGAGAAGTTGGAATGTTTTGGGAAAGATTTTGGGAAAATTGCATCCTTTCTTGATCATAAGACAACAGCAGACTGTGTCGAGTTCTACTACAAAAACCACAAGTCTGATTGCTTTGAGAAAACAAAGAAGCTGGAGTTTGGGAAGAAAGTGAAGTCCTCCACCAGTAACTATTTGATGACAACAGGGAAGAAATGGAATCCAGAAACGAATGCCGCTTCTCTTGACATGTTGGGTGCCGCCTCAACGATGACTGCCCGTGCTCATAAGTATTCCAGCAGCAGGTCTGGTGGAAGAACTTCATATCACATAACTCAATTCGATGATGGTCTATCAGAAAGGGCCAAAGGTCTTAATGGTTTTGGAAATGAAAGAGAAAAGGTGGCTGCTGATGTTTTGGCTGGTATATGTGGTTCTCTTTCTTCAGAAGCCATGGGTTCGTGTGTCACTAGTAATTTCAACCGAGGAGACAGTTCTCAGAATTTGAAGTGCAAAAAGGGTGTTACAACCGTATTAAGACAGCGTATGACAACCAATGTTCCACGGTATGTTGACAATGAGATTTTTTCTGACGAGAGTTGTGGAGAAATGGGTCCTTCCTATTGGACGGATGGGGAGAAGTCTCTTTTCATAGAAGCGGTGTCAGTTTATGGGAAGAATTTCTCTGTGATCTCTACCCATGTAGGATCAAAATCCACGGACCAATGCAAGGTCTTCTTTAGCAAAGCACGGAAGTGCCTCGGGTTGGATTTAATATGTTCTGCAAAGAAAATGCCAGATAATGGAAATGGGCATGATGCTGATAGAAGCAATGGTGAAGGAGGTGTAGATACCAAAGATGCCTTTCCTTGTGAAATGGTTGGCTCGCGGGTGGTTGATGACTTGCCAAAGGCTGTAATGAGTATAAGTGGTGGTGAATCGGAATCCATGAATCTGCAATCTACCCATCAGGAAGTCAATCCATCCTCAAAGACTTGTAGTAATGCTGCTGTGGATGCTATGGTGTCAGATGATGAATGTACTAGGAAGGATGGCTCTCAATCGGGTTTTGATGACGACTGCCAGTCAGTGAATTCTGCCAATGATAAGAATGGTTTGATACATGAGCAGCAACATGTAGTCATATCTGATGAAACTGCAAAAGAACAAGACATTTCTGTTTTGGTTGCAACATCAGTTGGAAATGTTTCAGATACTGAAACCAAGAGAGGAAATGTCGATGCTAGCACAGCTCGAGGTGATAAAGCTGATTCCCATGCGACAGATTGCCCTTCAATACCCTCAAACTCTCACATAACATCATCGGCTAAGGAGGAACAAGGGCGTCATCATGTTAGAGTGCATTCACGTAGTTTGTCTGATTCTGAGCAATCGTCTAGAAATGGCGACATAAAATTATTTGGTCAAATTCTTACACATTCCTCATTTGTGCCGAGTTCAAAATCTGGATCCAGTGTGAACGGAATCAAGACGACCGAGCCTCACCACAAGTTCAAGCGTAGATTGAAAGTAAACAGCCATGGGAATCTAAGTACAGCCAAGTTCAATTGTAAGAACTCTCCAGGCCAAGAGGAGAATACTCCCTCAAGAAGTTATGGAATTTGGGATGGCAACCAAATACGCACTGGGCTTTCGTCATTGCCTGATCCCACCACCCTATTATCCAGATATCCTACATTCAATCATCTCTCTAAGCCAGCCTCCTCCCCGACCGAGCAGTCACCATCTGGTTGCAAGGAAGAGACATCAAACTCAAACAAGGAAACCCAGAAGAGGGAGGTAAATAATAGTAGGAAGGAGGAAGTAGTTGGAGAAATGAATGTAGAAGAGAGTTGTTGTAATGAGGGCGGTGGTGGTGGTGGGTCATAA

Coding sequence (CDS)

ATGCCGCCTGAACCTTTGCCCTGGGATCGGAAGGACTTCTTCAAGGAGAGGAAACACGAGAGGTCAGAGTTCCTCGGACCTTTACCCAGATGGAGAGATTCATCCAGTCACGGATCCCGTGAGTTTAGTAGGTGGGGATCAGGTGATTTTCGGAGACCTCCAGGTCATGGTAGGCAGGGGGGTTGGCATGTGTTCTCTGAGGAATATGGTCACGGTTATGGGCCTTCCATGTCATTCAATAACAAGATGCTTGAGAACGTCAGTAGCCGGCCATCTGTTTCACATGGCGATGGGAAGTATGCTAGGAACGGTAGGGAAAGCAGATCTTTTAGTCAGAGAGATTGGAAAGGCCATTCTTGGGCAACAAGTAATGGGTCTACAAACAATGGTGGTAGAATGCAGCATGATCTAAATTATGATCAGAGGTCAGTTCATGATATGCTGATATATCCCTCTCATTCTCATTCTGACTTTGTAAACCCGAGAGAAAAGGTAAAAGGCCAGCATGATAAGGTCGATGATGTCAATGGGTTAGGCACAAACCAGAGACGTGATCGGGAGTATTCAGTGAGTTCCTCAGGGTGGAAGCCTCTTAAATGGACACGTTCTGGTGGCTTGTCTTCACGAACATCAACATCGGGCCACTCCAGTAGCAAAAAGAGCATTGATGCTTTAGATTCTAATGATAGAAAGTCTGAGACTGTGTCGAAAAATGCATCACAAAATTTTTCTCCTTCAGCCGACCATGCTGAGTGTGCCATGTCTTCTCTCCCTTATGATGACGCAAGTGCCAGGAAGAAGCCGCGGCTAGGATGGGGTGAGGGACTTGCCAAGTATGAGAAAAAAAAGGTCGAAGTTCCTGATGGCAGCACTGCCTTCACAAATATTACTGCAGAATCTACTCATTCTCTGAATTCTAGCTTGATTGAAAAAGGCCCCCGAGGTTCAGGATTTGCTGATTGTACCTCACCTGCCACCCCTTCTTCTGTCATTAGTGGTTCCCCTCCAGGTGGGGATGAAAAATCATTTGGAAAGGCATCGAGTGATAATGATGTCAGTAACTTTCATGGTTCACCTGGCTCTTGTTTTCAGAATCAATATGAGGGAACGTCCACTGTAGAGAAGTTGGATAATTTTTCAATAGCTAATTTGTGTTCTCCACTCATTCAACTGTTGCAATCTAATGATTCGATTTCAGTGGATTCCACTGCCTTGAGTAAGCTGCTTATATATAAAAATCAAATTTCTAAAGTGTTGGAGACAACTGAGTCTGAAATTGATTTACTTGAAAATGAGCTGAAGGGATTGAAATCTGAAAGTAAGGGTTACTTTTCTTTCACGCTGGCATCCAGTTCTTTGCTGGTGGGAGATAAATTCTTTGAAGAGCAAAACAATGTCGCTAATGCAGTTGCTACCCTGCCAGTTGTTACTTCTGCAAATACTATTTCAAAAACAATGGCACATTCTACAAGTGACTTGGAAGAAGTGTATGCTGAAAAGGATAGGTCTGGGAGGTTGGATGTGAAGGAATCTGTCATGAAAGAGAAGCTAACAATTTATGGTTGCAGTGTGAAAGAAAATATTGCAGCCTATATAGACAATAGCATGCCTATAAAGAGTGAAGGTGTCACAGTACATCCTGTTGCTAATGATATGTATGAATGTGCTGAGGGAGGAGATAGTGTGTCTGATTTAATTCTGGCATCCAATAAAGAGTCTGCATGTAAGGCTTCTGAAGCTTTAATCGGGTTGTTGCCTACCAATGAACGTAAGATTGATATTTGGAGTACAAATGCCTGCTCACAGAATCAATGTTTAGTGAAAGAGAGATTTGCAAAGAGGAAACGGTTACTAAGATTTAAGGAGAGAGTAATTACCCTTAAGTTTAAAGCCTACCAGTCTTTGTGGAAGGAGAATTTGCATGTGCCTCCTGTAAGGAAGCTACGAGCTAAATCTCAGAAAAAACATCAGTTGAGTTTGTGGACAAATTACAGTGGCTATCAGAAGAACCGATCTTCCATTCGATACCGTATGCCTTCACCAGCAGGAAATCTGAACCCCGTTTCTAGCACAGAGATACTTAAGCATGTGAGCATGCAGCTTTCTACTCCCCAGATTAAGCAGTACCGGAGGACATTGAAGATGCCCGCATTAGTTCTGGACCAGAAGGATAAGATGGGCTCAAGGTTCATCTCTAACAACGGATTAGTTGAGAACCCTTGTGCAGTTGAGAAGGAAAGGGCAATGATTAATCCATGGACCTCAGAAGAGAAAGATGTTTTTATGGAGAAGTTGGAATGTTTTGGGAAAGATTTTGGGAAAATTGCATCCTTTCTTGATCATAAGACAACAGCAGACTGTGTCGAGTTCTACTACAAAAACCACAAGTCTGATTGCTTTGAGAAAACAAAGAAGCTGGAGTTTGGGAAGAAAGTGAAGTCCTCCACCAGTAACTATTTGATGACAACAGGGAAGAAATGGAATCCAGAAACGAATGCCGCTTCTCTTGACATGTTGGGTGCCGCCTCAACGATGACTGCCCGTGCTCATAAGTATTCCAGCAGCAGGTCTGGTGGAAGAACTTCATATCACATAACTCAATTCGATGATGGTCTATCAGAAAGGGCCAAAGGTCTTAATGGTTTTGGAAATGAAAGAGAAAAGGTGGCTGCTGATGTTTTGGCTGGTATATGTGGTTCTCTTTCTTCAGAAGCCATGGGTTCGTGTGTCACTAGTAATTTCAACCGAGGAGACAGTTCTCAGAATTTGAAGTGCAAAAAGGGTGTTACAACCGTATTAAGACAGCGTATGACAACCAATGTTCCACGGTATGTTGACAATGAGATTTTTTCTGACGAGAGTTGTGGAGAAATGGGTCCTTCCTATTGGACGGATGGGGAGAAGTCTCTTTTCATAGAAGCGGTGTCAGTTTATGGGAAGAATTTCTCTGTGATCTCTACCCATGTAGGATCAAAATCCACGGACCAATGCAAGGTCTTCTTTAGCAAAGCACGGAAGTGCCTCGGGTTGGATTTAATATGTTCTGCAAAGAAAATGCCAGATAATGGAAATGGGCATGATGCTGATAGAAGCAATGGTGAAGGAGGTGTAGATACCAAAGATGCCTTTCCTTGTGAAATGGTTGGCTCGCGGGTGGTTGATGACTTGCCAAAGGCTGTAATGAGTATAAGTGGTGGTGAATCGGAATCCATGAATCTGCAATCTACCCATCAGGAAGTCAATCCATCCTCAAAGACTTGTAGTAATGCTGCTGTGGATGCTATGGTGTCAGATGATGAATGTACTAGGAAGGATGGCTCTCAATCGGGTTTTGATGACGACTGCCAGTCAGTGAATTCTGCCAATGATAAGAATGGTTTGATACATGAGCAGCAACATGTAGTCATATCTGATGAAACTGCAAAAGAACAAGACATTTCTGTTTTGGTTGCAACATCAGTTGGAAATGTTTCAGATACTGAAACCAAGAGAGGAAATGTCGATGCTAGCACAGCTCGAGGTGATAAAGCTGATTCCCATGCGACAGATTGCCCTTCAATACCCTCAAACTCTCACATAACATCATCGGCTAAGGAGGAACAAGGGCGTCATCATGTTAGAGTGCATTCACGTAGTTTGTCTGATTCTGAGCAATCGTCTAGAAATGGCGACATAAAATTATTTGGTCAAATTCTTACACATTCCTCATTTGTGCCGAGTTCAAAATCTGGATCCAGTGTGAACGGAATCAAGACGACCGAGCCTCACCACAAGTTCAAGCGTAGATTGAAAGTAAACAGCCATGGGAATCTAAGTACAGCCAAGTTCAATTGTAAGAACTCTCCAGGCCAAGAGGAGAATACTCCCTCAAGAAGTTATGGAATTTGGGATGGCAACCAAATACGCACTGGGCTTTCGTCATTGCCTGATCCCACCACCCTATTATCCAGATATCCTACATTCAATCATCTCTCTAAGCCAGCCTCCTCCCCGACCGAGCAGTCACCATCTGGTTGCAAGGAAGAGACATCAAACTCAAACAAGGAAACCCAGAAGAGGGAGGTAAATAATAGTAGGAAGGAGGAAGTAGTTGGAGAAATGAATGTAGAAGAGAGTTGTTGTAATGAGGGCGGTGGTGGTGGTGGGTCATAA
BLAST of CSPI05G16750 vs. Swiss-Prot
Match: NCOR2_MOUSE (Nuclear receptor corepressor 2 OS=Mus musculus GN=Ncor2 PE=1 SV=3)

HSP 1 Score: 88.6 bits (218), Expect = 5.9e-16
Identity = 80/314 (25.48%), Postives = 137/314 (43.63%), Query Frame = 1

Query: 707  KQYRRTLKMPALVLDQKDKMGSRFISNNGLVENPCAVEKERAMINPWTSEEKDVFMEKLE 766
            KQ R+   +P ++ D  D+   +FI+ NGL+++P  V K+R + N W+ +E+D F EK  
Sbjct: 387  KQMRQLAVIPPMLYDA-DQQRIKFINMNGLMDDPMKVYKDRQVTNMWSEQERDTFREKFM 446

Query: 767  CFGKDFGKIASFLDHKTTADCVEFYYKNHKSDCFEKTKKLEFGKKVKSSTSNYLMTTGKK 826
               K+FG IASFL+ KT A+CV +YY   K++ ++   +  + ++ KS            
Sbjct: 447  QHPKNFGLIASFLERKTVAECVLYYYLTKKNENYKSLVRRSYRRRGKSQQQ--------- 506

Query: 827  WNPETNAASLDMLGAASTMTARAHKYSSSRSGGRTSYHITQFDDGLSERAKGLNGFGNER 886
                                 +  +   +RS         +  +   E  K      NE+
Sbjct: 507  ----------------QQQQQQQQQQQMARSSQEEKEEKEKEKEADKEEEK--QDAENEK 566

Query: 887  EKVAADVLAGICGSLSSEAMGSCVTSNFNRGDSSQNLKCKKGVTTVLRQRMTTNVPRYVD 946
            E+++ +      G  + E     V S   +  +SQ  + K  +T  +      N      
Sbjct: 567  EELSKEKTDDTSGEDNDEK--EAVASKGRKTANSQGRR-KGRITRSMANE--ANHEETAT 626

Query: 947  NEIFSDESCGEMG-PSYWTDGEKSLFIEAVSVYGKNFSVISTHVGSKSTDQCKVFFSKAR 1006
             +  S+ +  EM   S WT+ E     + +  +G+N+S I+  VGSK+  QCK F+   +
Sbjct: 627  PQQSSELASMEMNESSRWTEEEMETAKKGLLEHGRNWSAIARMVGSKTVSQCKNFYFNYK 667

Query: 1007 KCLGLDLICSAKKM 1020
            K   LD I    K+
Sbjct: 687  KRQNLDEILQQHKL 667

BLAST of CSPI05G16750 vs. Swiss-Prot
Match: NCOR2_HUMAN (Nuclear receptor corepressor 2 OS=Homo sapiens GN=NCOR2 PE=1 SV=2)

HSP 1 Score: 87.4 bits (215), Expect = 1.3e-15
Identity = 95/414 (22.95%), Postives = 168/414 (40.58%), Query Frame = 1

Query: 707  KQYRRTLKMPALVLDQKDKMGSRFISNNGLVENPCAVEKERAMINPWTSEEKDVFMEKLE 766
            KQ R+   +P ++ D  D+   +FI+ NGL+ +P  V K+R ++N W+ +EK+ F EK  
Sbjct: 387  KQMRQLAVIPPMLYDA-DQQRIKFINMNGLMADPMKVYKDRQVMNMWSEQEKETFREKFM 446

Query: 767  CFGKDFGKIASFLDHKTTADCVEFYYKNHKSDCFEKTKKLEFGKKVKSSTSNYLMTTGKK 826
               K+FG IASFL+ KT A+CV +YY   K++ ++   +  + ++ KS          ++
Sbjct: 447  QHPKNFGLIASFLERKTVAECVLYYYLTKKNENYKSLVRRSYRRRGKSQQQQQ-----QQ 506

Query: 827  WNPETNAASLDMLGAASTMTARAHKYSSSRSGGRTSYHITQFDDGLSERAKGLNGFGNER 886
               +       M  ++        K   +             +D L E+    +G  N+ 
Sbjct: 507  QQQQQQQQQQPMPRSSQEEKDEKEKEKEAEKEEEKPEVENDKEDLLKEKTDDTSGEDNDE 566

Query: 887  EKVAADVLAGICGSLSSEAMGSCVTSNFNRGDSSQNLKCKKGVTTVLRQRMTTNVPRYVD 946
            ++  A           S+   +  +    +G  ++++  +      +  + +  +     
Sbjct: 567  KEAVA-----------SKGRKTANSQGRRKGRITRSMANEANSEEAITPQQSAELASMEL 626

Query: 947  NEIFSDESCGEMGPSYWTDGEKSLFIEAVSVYGKNFSVISTHVGSKSTDQCKVFFSKARK 1006
            NE            S WT+ E     + +  +G+N+S I+  VGSK+  QCK F+   +K
Sbjct: 627  NE-----------SSRWTEEEMETAKKGLLEHGRNWSAIARMVGSKTVSQCKNFYFNYKK 686

Query: 1007 CLGLDLICSAKKMP-DNGNGHDADRSNGEGGVDTKDAFPCEMVGSRVVDDLPKAVMSISG 1066
               LD I    K+  +        +         + AFP       VV+D       +SG
Sbjct: 687  RQNLDEILQQHKLKMEKERNARRKKKKAPAAASEEAAFP------PVVEDEEMEASGVSG 746

Query: 1067 GESESM-NLQSTHQEVNPSSK-TCSNAAVDAMVSDDEC-------TRKDGSQSG 1111
             E E +   ++ H   N   +  CS  A     SD E          KD  Q+G
Sbjct: 747  NEEEMVEEAEALHASGNEVPRGECSGPATVNNSSDTESIPSPHTEAAKDTGQNG 766

BLAST of CSPI05G16750 vs. Swiss-Prot
Match: NCOR1_XENTR (Nuclear receptor corepressor 1 OS=Xenopus tropicalis GN=ncor1 PE=2 SV=1)

HSP 1 Score: 79.0 bits (193), Expect = 4.7e-13
Identity = 68/269 (25.28%), Postives = 128/269 (47.58%), Query Frame = 1

Query: 561 SVSDLILASNKESACKASEALIGLLPTNERKI-------DIWSTNACSQNQCLVKER--- 620
           S+  +I   N++ A +A + L GL P  E  +        ++  N    NQ + K+    
Sbjct: 226 SIVQIIYDENRKKAEEAHKILEGLGPKVELPLYNQPSDTKVYHENI-KTNQVMRKKLILF 285

Query: 621 FAKRKRLLRFKERVITLKFKAYQSLWK---ENLHVPPVRKLRAKSQKKHQLSLWTNYSGY 680
           F +R    + +E+ I  ++      W+   + +   P RK +    +++    +      
Sbjct: 286 FKRRNHARKLREQNICQRYDQLMEAWEKKVDRIENNPRRKAKESKTREYYEKQFPEIRKQ 345

Query: 681 QKNRSSIRYRMPSPAGNLNPVSSTE-----ILKHVSMQLSTPQIKQYRRTLKMPALVLDQ 740
           ++ +   +      AG    ++ +E     I+  +S Q +    KQ R+   +P ++ D 
Sbjct: 346 REQQERFQRVGQRGAGLSATIARSEHEISEIIDGLSEQENNE--KQMRQLSVIPPMMFDA 405

Query: 741 KDKMGSRFISNNGLVENPCAVEKERAMINPWTSEEKDVFMEKLECFGKDFGKIASFLDHK 800
           + +   +FI+ NGL+E+P  V K+R  +N WT  EK++F EK     K+FG IAS+L+ K
Sbjct: 406 EQRR-VKFINMNGLMEDPMKVYKDRQFMNVWTDHEKEIFKEKFVQHPKNFGLIASYLERK 465

Query: 801 TTADCVEFYYKNHKSDCFEKTKKLEFGKK 812
           T +DCV +YY   K++ F+   +  + K+
Sbjct: 466 TVSDCVLYYYLTKKNENFKALVRRNYPKR 490

BLAST of CSPI05G16750 vs. Swiss-Prot
Match: NCOR1_MOUSE (Nuclear receptor corepressor 1 OS=Mus musculus GN=Ncor1 PE=1 SV=1)

HSP 1 Score: 75.1 bits (183), Expect = 6.7e-12
Identity = 63/268 (23.51%), Postives = 127/268 (47.39%), Query Frame = 1

Query: 561 SVSDLILASNKESACKASEALIGLLPTNERKIDIWSTNACSQNQCLVKERFAKRKRLLRF 620
           S+  +I   N++ A +A +   GL P  E  +    ++    ++ +   +  ++K +L F
Sbjct: 234 SIVQIIYDENRKKAEEAHKIFEGLGPKVELPLYNQPSDTKVYHENIKTNQVMRKKLILFF 293

Query: 621 K---------ERVITLKFKAYQSLWK---ENLHVPPVRKLRAKSQKKHQLSLWTNYSGYQ 680
           K         E+ I  ++      W+   + +   P RK +    +++    +      +
Sbjct: 294 KRRNHARKQREQKICQRYDQLMEAWEKKVDRIENNPRRKAKESKTREYYEKQFPEIRKQR 353

Query: 681 KNRSSIRYRMPSPAGNLNPVSSTE-----ILKHVSMQLSTPQIKQYRRTLKMPALVLDQK 740
           + +   +      AG    ++ +E     I+  +S Q +    KQ R+   +P ++ D +
Sbjct: 354 EQQERFQRVGQRGAGLSATIARSEHEISEIIDGLSEQENNE--KQMRQLSVIPPMMFDAE 413

Query: 741 DKMGSRFISNNGLVENPCAVEKERAMINPWTSEEKDVFMEKLECFGKDFGKIASFLDHKT 800
            +   +FI+ NGL+E+P  V K+R  +N WT  EK++F +K     K+FG IAS+L+ K+
Sbjct: 414 QRR-VKFINMNGLMEDPMKVYKDRQFMNVWTDHEKEIFKDKFIQHPKNFGLIASYLERKS 473

Query: 801 TADCVEFYYKNHKSDCFEKTKKLEFGKK 812
             DCV +YY   K++ ++   +  +GK+
Sbjct: 474 VPDCVLYYYLTKKNENYKALVRRNYGKR 498

BLAST of CSPI05G16750 vs. Swiss-Prot
Match: NCOR1_HUMAN (Nuclear receptor corepressor 1 OS=Homo sapiens GN=NCOR1 PE=1 SV=2)

HSP 1 Score: 75.1 bits (183), Expect = 6.7e-12
Identity = 63/268 (23.51%), Postives = 127/268 (47.39%), Query Frame = 1

Query: 561 SVSDLILASNKESACKASEALIGLLPTNERKIDIWSTNACSQNQCLVKERFAKRKRLLRF 620
           S+  +I   N++ A +A +   GL P  E  +    ++    ++ +   +  ++K +L F
Sbjct: 234 SIVQIIYDENRKKAEEAHKIFEGLGPKVELPLYNQPSDTKVYHENIKTNQVMRKKLILFF 293

Query: 621 K---------ERVITLKFKAYQSLWK---ENLHVPPVRKLRAKSQKKHQLSLWTNYSGYQ 680
           K         E+ I  ++      W+   + +   P RK +    +++    +      +
Sbjct: 294 KRRNHARKQREQKICQRYDQLMEAWEKKVDRIENNPRRKAKESKTREYYEKQFPEIRKQR 353

Query: 681 KNRSSIRYRMPSPAGNLNPVSSTE-----ILKHVSMQLSTPQIKQYRRTLKMPALVLDQK 740
           + +   +      AG    ++ +E     I+  +S Q +    KQ R+   +P ++ D +
Sbjct: 354 EQQERFQRVGQRGAGLSATIARSEHEISEIIDGLSEQENNE--KQMRQLSVIPPMMFDAE 413

Query: 741 DKMGSRFISNNGLVENPCAVEKERAMINPWTSEEKDVFMEKLECFGKDFGKIASFLDHKT 800
            +   +FI+ NGL+E+P  V K+R  +N WT  EK++F +K     K+FG IAS+L+ K+
Sbjct: 414 QRR-VKFINMNGLMEDPMKVYKDRQFMNVWTDHEKEIFKDKFIQHPKNFGLIASYLERKS 473

Query: 801 TADCVEFYYKNHKSDCFEKTKKLEFGKK 812
             DCV +YY   K++ ++   +  +GK+
Sbjct: 474 VPDCVLYYYLTKKNENYKALVRRNYGKR 498

BLAST of CSPI05G16750 vs. TrEMBL
Match: A0A0A0KU04_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G492330 PE=4 SV=1)

HSP 1 Score: 2695.6 bits (6986), Expect = 0.0e+00
Identity = 1376/1383 (99.49%), Postives = 1381/1383 (99.86%), Query Frame = 1

Query: 1    MPPEPLPWDRKDFFKERKHERSEFLGPLPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQG 60
            MPPEPLPWDRKDFFKERKHERSEFLGPLPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQG
Sbjct: 1    MPPEPLPWDRKDFFKERKHERSEFLGPLPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQG 60

Query: 61   GWHVFSEEYGHGYGPSMSFNNKMLENVSSRPSVSHGDGKYARNGRESRSFSQRDWKGHSW 120
            GWHVFSEEYGHGYGPSMSFNNKMLENVSSRPSVSHGDGKYARNGRESRSFSQRDWKGHSW
Sbjct: 61   GWHVFSEEYGHGYGPSMSFNNKMLENVSSRPSVSHGDGKYARNGRESRSFSQRDWKGHSW 120

Query: 121  ATSNGSTNNGGRMQHDLNYDQRSVHDMLIYPSHSHSDFVNPREKVKGQHDKVDDVNGLGT 180
            ATSNGSTNNGGRMQHDLNYDQRSVHDMLIYPSHSHSDFVNPREKVKGQHDKVDDVNGLGT
Sbjct: 121  ATSNGSTNNGGRMQHDLNYDQRSVHDMLIYPSHSHSDFVNPREKVKGQHDKVDDVNGLGT 180

Query: 181  NQRRDREYSVSSSGWKPLKWTRSGGLSSRTSTSGHSSSKKSIDALDSNDRKSETVSKNAS 240
            NQRRDREYSVSSSGWKPLKWTRSGGLSSRTSTSGHSSSKKSI+ALDSNDRKSETVSKNAS
Sbjct: 181  NQRRDREYSVSSSGWKPLKWTRSGGLSSRTSTSGHSSSKKSIEALDSNDRKSETVSKNAS 240

Query: 241  QNFSPSADHAECAMSSLPYDDASARKKPRLGWGEGLAKYEKKKVEVPDGSTAFTNITAES 300
            QNFSPSADHAECAMSSLPYDDASARKKPRLGWGEGLAKYEKKKVEVPDGSTAFTNITAES
Sbjct: 241  QNFSPSADHAECAMSSLPYDDASARKKPRLGWGEGLAKYEKKKVEVPDGSTAFTNITAES 300

Query: 301  THSLNSSLIEKGPRGSGFADCTSPATPSSVISGSPPGGDEKSFGKASSDNDVSNFHGSPG 360
            THSLNSSLIEKGPRGSGFADCTSPATPSSVISGSPPGGDEKSFGKASSDNDVSNFHGSPG
Sbjct: 301  THSLNSSLIEKGPRGSGFADCTSPATPSSVISGSPPGGDEKSFGKASSDNDVSNFHGSPG 360

Query: 361  SCFQNQYEGTSTVEKLDNFSIANLCSPLIQLLQSNDSISVDSTALSKLLIYKNQISKVLE 420
            SCFQNQYEGTSTVEKLDNFSIANLCSPLIQLLQSNDSISVDSTALSKLLIYKNQISKVLE
Sbjct: 361  SCFQNQYEGTSTVEKLDNFSIANLCSPLIQLLQSNDSISVDSTALSKLLIYKNQISKVLE 420

Query: 421  TTESEIDLLENELKGLKSESKGYFSFTLASSSLLVGDKFFEEQNNVANAVATLPVVTSAN 480
            TTESEIDLLENELKGLKSESKGYFSFTLASSSLLVGDKFFEEQNNVANAVATLPVVTSAN
Sbjct: 421  TTESEIDLLENELKGLKSESKGYFSFTLASSSLLVGDKFFEEQNNVANAVATLPVVTSAN 480

Query: 481  TISKTMAHSTSDLEEVYAEKDRSGRLDVKESVMKEKLTIYGCSVKENIAAYIDNSMPIKS 540
            TISKTMAHSTSDLEEVYA+KDRSGRLDVKESVMKEKLTIYGCSVKENIAAYIDNS+PIKS
Sbjct: 481  TISKTMAHSTSDLEEVYADKDRSGRLDVKESVMKEKLTIYGCSVKENIAAYIDNSVPIKS 540

Query: 541  EGVTVHPVANDMYECAEGGDSVSDLILASNKESACKASEALIGLLPTNERKIDIWSTNAC 600
            EGVTVHPVANDMYECAEGGDSVSDLILASNKESACKASEALIG+LPTNERKIDIWSTNAC
Sbjct: 541  EGVTVHPVANDMYECAEGGDSVSDLILASNKESACKASEALIGMLPTNERKIDIWSTNAC 600

Query: 601  SQNQCLVKERFAKRKRLLRFKERVITLKFKAYQSLWKENLHVPPVRKLRAKSQKKHQLSL 660
            SQNQCLVKERFAKRKRLLRFKERVITLKFKAYQSLWKENLHVPPVRKLRAKSQKKHQLSL
Sbjct: 601  SQNQCLVKERFAKRKRLLRFKERVITLKFKAYQSLWKENLHVPPVRKLRAKSQKKHQLSL 660

Query: 661  WTNYSGYQKNRSSIRYRMPSPAGNLNPVSSTEILKHVSMQLSTPQIKQYRRTLKMPALVL 720
            WTNYSGYQKNRSSIRYRMPSPAGNLNPVSSTEILKHVSMQLSTPQIKQYRRTLKMPALVL
Sbjct: 661  WTNYSGYQKNRSSIRYRMPSPAGNLNPVSSTEILKHVSMQLSTPQIKQYRRTLKMPALVL 720

Query: 721  DQKDKMGSRFISNNGLVENPCAVEKERAMINPWTSEEKDVFMEKLECFGKDFGKIASFLD 780
            DQKDKMGSRFISNNGLVENPCAVEKERAMINPWTSEEKDVFMEKLECFGKDFGKIASFLD
Sbjct: 721  DQKDKMGSRFISNNGLVENPCAVEKERAMINPWTSEEKDVFMEKLECFGKDFGKIASFLD 780

Query: 781  HKTTADCVEFYYKNHKSDCFEKTKKLEFGKKVKSSTSNYLMTTGKKWNPETNAASLDMLG 840
            HKTTADCVEFYYKNHKSDCFEKTKKLEFGKKVKSSTSNYLMTTGKKWNPETNAASLDMLG
Sbjct: 781  HKTTADCVEFYYKNHKSDCFEKTKKLEFGKKVKSSTSNYLMTTGKKWNPETNAASLDMLG 840

Query: 841  AASTMTARAHKYSSSRSGGRTSYHITQFDDGLSERAKGLNGFGNEREKVAADVLAGICGS 900
            AASTMTARAHKYSSSRSGGRTSYHITQFDDGLSERAKGLNGFGNEREKVAADVLAGICGS
Sbjct: 841  AASTMTARAHKYSSSRSGGRTSYHITQFDDGLSERAKGLNGFGNEREKVAADVLAGICGS 900

Query: 901  LSSEAMGSCVTSNFNRGDSSQNLKCKKGVTTVLRQRMTTNVPRYVDNEIFSDESCGEMGP 960
            LSSEAMGSCVTSNFNRGDSSQ+LKCKKGVTTVLRQRMTTNVPRYVDNEIFSDESCGEMGP
Sbjct: 901  LSSEAMGSCVTSNFNRGDSSQDLKCKKGVTTVLRQRMTTNVPRYVDNEIFSDESCGEMGP 960

Query: 961  SYWTDGEKSLFIEAVSVYGKNFSVISTHVGSKSTDQCKVFFSKARKCLGLDLICSAKKMP 1020
            SYWTDGEKSLFIEAVSVYGKNFSVISTHVGSKSTDQCKVFFSKARKCLGLDLICSAKKMP
Sbjct: 961  SYWTDGEKSLFIEAVSVYGKNFSVISTHVGSKSTDQCKVFFSKARKCLGLDLICSAKKMP 1020

Query: 1021 DNGNGHDADRSNGEGGVDTKDAFPCEMVGSRVVDDLPKAVMSISGGESESMNLQSTHQEV 1080
            DNGNGHDADRSNGEGGVDTKDAFPCEMVGSRVVDDLPKAVMSISGGESESMNLQSTHQEV
Sbjct: 1021 DNGNGHDADRSNGEGGVDTKDAFPCEMVGSRVVDDLPKAVMSISGGESESMNLQSTHQEV 1080

Query: 1081 NPSSKTCSNAAVDAMVSDDECTRKDGSQSGFDDDCQSVNSANDKNGLIHEQQHVVISDET 1140
            NPSSKTCSNAAVDAMVSDDECTRKDGSQSGFDDDCQSVNSANDKNGLIHEQQHVVISDET
Sbjct: 1081 NPSSKTCSNAAVDAMVSDDECTRKDGSQSGFDDDCQSVNSANDKNGLIHEQQHVVISDET 1140

Query: 1141 AKEQDISVLVATSVGNVSDTETKRGNVDASTARGDKADSHATDCPSIPSNSHITSSAKEE 1200
            AKEQDISVLVATSVGNVSDTETKRGNVDASTARGDKADSHATDCPSIPSNSHITSSAKEE
Sbjct: 1141 AKEQDISVLVATSVGNVSDTETKRGNVDASTARGDKADSHATDCPSIPSNSHITSSAKEE 1200

Query: 1201 QGRHHVRVHSRSLSDSEQSSRNGDIKLFGQILTHSSFVPSSKSGSSVNGIKTTEPHHKFK 1260
            QGRHHVRVHSRSLSDSEQSSRNGDIKLFGQILTHSSFVPSSKSGSS NGIKTTEPHHKFK
Sbjct: 1201 QGRHHVRVHSRSLSDSEQSSRNGDIKLFGQILTHSSFVPSSKSGSSENGIKTTEPHHKFK 1260

Query: 1261 RRLKVNSHGNLSTAKFNCKNSPGQEENTPSRSYGIWDGNQIRTGLSSLPDPTTLLSRYPT 1320
            RRLKVNSHGNLSTAKFNCKNSPGQEENTPSRSYGIWDGNQIRTGL SLPDPTTLLSRYPT
Sbjct: 1261 RRLKVNSHGNLSTAKFNCKNSPGQEENTPSRSYGIWDGNQIRTGLLSLPDPTTLLSRYPT 1320

Query: 1321 FNHLSKPASSPTEQSPSGCKEETSNSNKETQKREVNNSRKEEVVGEMNVEESCCNEGGGG 1380
            FNHLSKPASSPTEQSPSGCKEETSNSNKETQKREVNNSRKEEVVGEMNVEESCCNEGGGG
Sbjct: 1321 FNHLSKPASSPTEQSPSGCKEETSNSNKETQKREVNNSRKEEVVGEMNVEESCCNEGGGG 1380

Query: 1381 GGS 1384
            GGS
Sbjct: 1381 GGS 1383

BLAST of CSPI05G16750 vs. TrEMBL
Match: A0A061FMP2_THECC (Duplicated homeodomain-like superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_043101 PE=4 SV=1)

HSP 1 Score: 963.0 bits (2488), Expect = 3.9e-277
Identity = 604/1190 (50.76%), Postives = 776/1190 (65.21%), Query Frame = 1

Query: 1    MPPEPLPWDRKDFFKERKHERSEFLGPLP---RWRDSSS-----HGS-REFSRWGSGDFR 60
            MPPEPLPWDRKDF+KERKHER+E     P   RWRDSSS     HGS REF+RWGS D R
Sbjct: 1    MPPEPLPWDRKDFYKERKHERTESQPQQPSTARWRDSSSMSSYQHGSFREFTRWGSADLR 60

Query: 61   RPPGHGRQGGWHVFSEEYG-HGYGPSMSFNNKMLENVSSRPSVSHGDGKYARNG-RESR- 120
            RPPGHG+QG WH+F+EE G HGY PS S  +KML++ S R SVS GDGKY+RN  RE+  
Sbjct: 61   RPPGHGKQGSWHLFAEENGGHGYVPSRS-GDKMLDDESCRQSVSRGDGKYSRNSSRENNR 120

Query: 121  -SFSQRDWKGHSWATSNGSTNNGGRMQHDLNYDQRSVHDMLIYPSHSHSDFVNPREKV-K 180
             S+SQRDW+ HSW  SNGS N  GR  HD+N +QRSV DML YPSH+HSDFV+  +++ K
Sbjct: 121  ASYSQRDWRAHSWEMSNGSPNTPGR-PHDVNNEQRSVDDMLTYPSHAHSDFVSTWDQLHK 180

Query: 181  GQHD-KVDDVNGLGTNQRRDREYSVSSSGWKPLKWTRSGGLSSRTSTSGHSSSKKSIDAL 240
             QHD K   VNGLGT QR +RE SV S  WKPLKW+RSG LSSR S   HSSS KS+  +
Sbjct: 181  DQHDNKTSGVNGLGTGQRCERENSVGSMDWKPLKWSRSGSLSSRGSGFSHSSSSKSLGGV 240

Query: 241  DSNDRKSETVSKNASQNFSPSADHAECAMSSLPYDDASARKKPRLGWGEGLAKYEKKKVE 300
            DS + K E   KN +   SPS D A C  S+ P D+  +RKKPRLGWGEGLAKYEKKKVE
Sbjct: 241  DSGEGKLELQQKNLTPVQSPSGDAAACVTSAAPSDETMSRKKPRLGWGEGLAKYEKKKVE 300

Query: 301  VPD-----GSTAFTNITAESTHSLNSSLIEKGPRGSGFADCTSPATPSSVISGSPPGGDE 360
             PD     G    +    E  +SL S+L EK PR  GF+DC SPATPSSV   S PG +E
Sbjct: 301  GPDTSMNRGVATISVGNTEPNNSLGSNLAEKSPRVLGFSDCASPATPSSVACSSSPGVEE 360

Query: 361  KSFGKASS-DNDVSNFHGSPGSCFQNQYEGTS-TVEKLDNFSIANLCSPLIQLLQSNDSI 420
            KSFGKA++ DND+SN  GSP    QN  EG S  +EKLD  SI N+ S L+ LLQS+D  
Sbjct: 361  KSFGKAANIDNDISNLCGSPSLGSQNHLEGPSFNLEKLDMNSIINMGSSLVDLLQSDDPS 420

Query: 421  SVD-----STALSKLLIYKNQISKVLETTESEIDLLENELKGLKSESKGYFSFTLASSSL 480
            +VD     STA++KLL++K  + K LETTESEID LENELK LK+ S   +     SSSL
Sbjct: 421  TVDSSFVRSTAMNKLLLWKGDVLKALETTESEIDSLENELKTLKANSGSRYPCPATSSSL 480

Query: 481  LVGD--KFFEEQNNVANAV---ATLPVVTSANTISKTMAHSTSDLEEVYAEKDRSGRLD- 540
             + +  +  EE   ++N +   A L +    + + + +     DLEEV A+  + G +D 
Sbjct: 481  PMEENGRACEELEAISNMIPRPAPLKIDPCGDALEEKVPLCNGDLEEVNADA-KDGDIDS 540

Query: 541  -------------VKESVMKEKLTIYGCS-----------VKENIAAYIDN---SMPIKS 600
                         ++++V    + ++ CS            + N+A    N   S+P   
Sbjct: 541  PGTATSKFVEPSSLEKAVSPSDVKLHECSGDLGTVQLTTMGEVNLAPGSSNEGTSVPFSG 600

Query: 601  EGVTVHPVANDMYECAEGGDSVSDL-------ILASNKESACKASEALIGLLPTNE-RKI 660
            EG  +  + ND++   E  +SV+D+       I+A+NKE A  AS+    LLP +    I
Sbjct: 601  EGSALEKIDNDVHG-PEPSNSVADIENIMYDVIIATNKELANSASKVFNNLLPKDWCSVI 660

Query: 661  DIWSTNACSQNQCLVKERFAKRKRLLRFKERVITLKFKAYQSLWKENLHVPPVRKLRAKS 720
               +  AC Q   L++E+  KRK+ +RFKERV+ LKFKA+Q  WKE++  P +RK RAKS
Sbjct: 661  SEIANGACWQTDSLIREKIVKRKQCIRFKERVLMLKFKAFQHAWKEDMRSPLIRKYRAKS 720

Query: 721  QKKHQLSLWTNYSGYQKNRSSIRYRMPSPAGNLNPVSSTEILKHVSMQLSTPQIKQYRRT 780
            QKK++LSL +   GYQK+RSSIR R+ SPAGNL+  S+ E++  VS  LS   ++ YR  
Sbjct: 721  QKKYELSLRSTLGGYQKHRSSIRSRLTSPAGNLSLESNVEMINFVSKLLSDSHVRLYRNA 780

Query: 781  LKMPALVLDQKDKMGSRFISNNGLVENPCAVEKERAMINPWTSEEKDVFMEKLECFGKDF 840
            LKMPAL LD+K+K  SRFIS+NGLVE+PCAVEKERA+INPWTSEEK++FM+KL  FGKDF
Sbjct: 781  LKMPALFLDEKEKQVSRFISSNGLVEDPCAVEKERALINPWTSEEKEIFMDKLAAFGKDF 840

Query: 841  GKIASFLDHKTTADCVEFYYKNHKSDCFEKT-KKLEFGKKVKSSTSNYLMTTGKKWNPET 900
             KIASFLDHKTTADCVEFYYKNHKS+CFEKT KKL+  K+ KS+ + YL+T+GKKW+ E 
Sbjct: 841  RKIASFLDHKTTADCVEFYYKNHKSECFEKTKKKLDLSKQGKSTANTYLLTSGKKWSREL 900

Query: 901  NAASLDMLGAASTMTARAHKYSSSRS--------GGRTSYHITQFDDGLSERAKGLNGFG 960
            NAASLD+LG AS + A A     +R         GGR     ++ DD + ER+   +  G
Sbjct: 901  NAASLDVLGEASVIAAHAESGMRNRQTSAGRIFLGGRFDSKTSRVDDSIVERSSSFDVIG 960

Query: 961  NEREKVAADVLAGICGSLSSEAMGSCVTSNFNRGDSSQ-NLKCKKGVTTVLRQRMTTNVP 1020
            N+RE VAADVLAGICGSLSSEAM SC+TS+ + G+S Q   KC+K V +V+++  T++V 
Sbjct: 961  NDRETVAADVLAGICGSLSSEAMSSCITSSADPGESYQREWKCQK-VDSVVKRPSTSDVT 1020

Query: 1021 RYVDNEIFSDESCGEMGPSYWTDGEKSLFIEAVSVYGKNFSVISTHVGSKSTDQCKVFFS 1080
            + +D++  SDESCGEM P+ WTD EKS+FI+AVS+YGK+F++IS  VG++S DQCKVFFS
Sbjct: 1021 QNIDDDTCSDESCGEMDPADWTDEEKSVFIQAVSLYGKDFAMISRCVGTRSRDQCKVFFS 1080

Query: 1081 KARKCLGLDLICSAKKMPDNGNGHDADRSNGEGGVDTKDAFPCE-------MVGSRVVDD 1099
            KARKCLGLDLI    +   N     +D +NG GG D +DA   E        +GS+V +D
Sbjct: 1081 KARKCLGLDLIHPRTR---NLGTPMSDDANG-GGSDIEDACVLESSVVCSDKLGSKVEED 1140

BLAST of CSPI05G16750 vs. TrEMBL
Match: A0A061FNE6_THECC (Duplicated homeodomain-like superfamily protein isoform 2 OS=Theobroma cacao GN=TCM_043101 PE=4 SV=1)

HSP 1 Score: 956.8 bits (2472), Expect = 2.8e-275
Identity = 603/1190 (50.67%), Postives = 775/1190 (65.13%), Query Frame = 1

Query: 1    MPPEPLPWDRKDFFKERKHERSEFLGPLP---RWRDSSS-----HGS-REFSRWGSGDFR 60
            MPPEPLPWDRKDF+KERKHER+E     P   RWRDSSS     HGS REF+RWGS D R
Sbjct: 1    MPPEPLPWDRKDFYKERKHERTESQPQQPSTARWRDSSSMSSYQHGSFREFTRWGSADLR 60

Query: 61   RPPGHGRQGGWHVFSEEYG-HGYGPSMSFNNKMLENVSSRPSVSHGDGKYARNG-RESR- 120
            RPPGHG+QG WH+F+EE G HGY PS S  +KML++ S R SVS GDGKY+RN  RE+  
Sbjct: 61   RPPGHGKQGSWHLFAEENGGHGYVPSRS-GDKMLDDESCRQSVSRGDGKYSRNSSRENNR 120

Query: 121  -SFSQRDWKGHSWATSNGSTNNGGRMQHDLNYDQRSVHDMLIYPSHSHSDFVNPREKV-K 180
             S+SQRDW+ HSW  SNGS N  GR  HD+N +QRSV DML YPSH+HSDFV+  +++ K
Sbjct: 121  ASYSQRDWRAHSWEMSNGSPNTPGR-PHDVNNEQRSVDDMLTYPSHAHSDFVSTWDQLHK 180

Query: 181  GQHD-KVDDVNGLGTNQRRDREYSVSSSGWKPLKWTRSGGLSSRTSTSGHSSSKKSIDAL 240
             QHD K   VNGLGT QR +RE SV S  WKPLKW+RSG LSSR S   HSSS KS+  +
Sbjct: 181  DQHDNKTSGVNGLGTGQRCERENSVGSMDWKPLKWSRSGSLSSRGSGFSHSSSSKSLGGV 240

Query: 241  DSNDRKSETVSKNASQNFSPSADHAECAMSSLPYDDASARKKPRLGWGEGLAKYEKKKVE 300
            DS + K E   KN +   SPS D A C  S+ P D+  +RKKPRLGWGEGLAKYEKKKVE
Sbjct: 241  DSGEGKLELQQKNLTPVQSPSGDAAACVTSAAPSDETMSRKKPRLGWGEGLAKYEKKKVE 300

Query: 301  VPD-----GSTAFTNITAESTHSLNSSLIEKGPRGSGFADCTSPATPSSVISGSPPGGDE 360
             PD     G    +    E  +SL S+L EK PR  GF+DC SPATPSSV   S PG +E
Sbjct: 301  GPDTSMNRGVATISVGNTEPNNSLGSNLAEKSPRVLGFSDCASPATPSSVACSSSPGVEE 360

Query: 361  KSFGKASS-DNDVSNFHGSPGSCFQNQYEGTS-TVEKLDNFSIANLCSPLIQLLQSNDSI 420
            KSFGKA++ DND+SN  GSP    QN  EG S  +EKLD  SI N+ S L+ LLQS+D  
Sbjct: 361  KSFGKAANIDNDISNLCGSPSLGSQNHLEGPSFNLEKLDMNSIINMGSSLVDLLQSDDPS 420

Query: 421  SVD-----STALSKLLIYKNQISKVLETTESEIDLLENELKGLKSESKGYFSFTLASSSL 480
            +VD     STA++KLL++K  + K LETTESEID LENELK LK+ S   +     SSSL
Sbjct: 421  TVDSSFVRSTAMNKLLLWKGDVLKALETTESEIDSLENELKTLKANSGSRYPCPATSSSL 480

Query: 481  LVGD--KFFEEQNNVANAV---ATLPVVTSANTISKTMAHSTSDLEEVYAEKDRSGRLD- 540
             + +  +  EE   ++N +   A L +    + + + +     DLEEV A+  + G +D 
Sbjct: 481  PMEENGRACEELEAISNMIPRPAPLKIDPCGDALEEKVPLCNGDLEEVNADA-KDGDIDS 540

Query: 541  -------------VKESVMKEKLTIYGCS-----------VKENIAAYIDN---SMPIKS 600
                         ++++V    + ++ CS            + N+A    N   S+P   
Sbjct: 541  PGTATSKFVEPSSLEKAVSPSDVKLHECSGDLGTVQLTTMGEVNLAPGSSNEGTSVPFSG 600

Query: 601  EGVTVHPVANDMYECAEGGDSVSDL-------ILASNKESACKASEALIGLLPTNE-RKI 660
            EG  +  + ND++   E  +SV+D+       I+A+NKE A  AS+    LLP +    I
Sbjct: 601  EGSALEKIDNDVHG-PEPSNSVADIENIMYDVIIATNKELANSASKVFNNLLPKDWCSVI 660

Query: 661  DIWSTNACSQNQCLVKERFAKRKRLLRFKERVITLKFKAYQSLWKENLHVPPVRKLRAKS 720
               +  AC Q   L++E+  KRK+ +RFKERV+ LKFKA+Q  WKE++  P +RK RAKS
Sbjct: 661  SEIANGACWQTDSLIREKIVKRKQCIRFKERVLMLKFKAFQHAWKEDMRSPLIRKYRAKS 720

Query: 721  QKKHQLSLWTNYSGYQKNRSSIRYRMPSPAGNLNPVSSTEILKHVSMQLSTPQIKQYRRT 780
            QKK++LSL +   GYQK+RSSIR R+ SP GNL+  S+ E++  VS  LS   ++ YR  
Sbjct: 721  QKKYELSLRSTLGGYQKHRSSIRSRLTSP-GNLSLESNVEMINFVSKLLSDSHVRLYRNA 780

Query: 781  LKMPALVLDQKDKMGSRFISNNGLVENPCAVEKERAMINPWTSEEKDVFMEKLECFGKDF 840
            LKMPAL LD+K+K  SRFIS+NGLVE+PCAVEKERA+INPWTSEEK++FM+KL  FGKDF
Sbjct: 781  LKMPALFLDEKEKQVSRFISSNGLVEDPCAVEKERALINPWTSEEKEIFMDKLAAFGKDF 840

Query: 841  GKIASFLDHKTTADCVEFYYKNHKSDCFEKT-KKLEFGKKVKSSTSNYLMTTGKKWNPET 900
             KIASFLDHKTTADCVEFYYKNHKS+CFEKT KKL+  K+ KS+ + YL+T+GKKW+ E 
Sbjct: 841  RKIASFLDHKTTADCVEFYYKNHKSECFEKTKKKLDLSKQGKSTANTYLLTSGKKWSREL 900

Query: 901  NAASLDMLGAASTMTARAHKYSSSRS--------GGRTSYHITQFDDGLSERAKGLNGFG 960
            NAASLD+LG AS + A A     +R         GGR     ++ DD + ER+   +  G
Sbjct: 901  NAASLDVLGEASVIAAHAESGMRNRQTSAGRIFLGGRFDSKTSRVDDSIVERSSSFDVIG 960

Query: 961  NEREKVAADVLAGICGSLSSEAMGSCVTSNFNRGDSSQ-NLKCKKGVTTVLRQRMTTNVP 1020
            N+RE VAADVLAGICGSLSSEAM SC+TS+ + G+S Q   KC+K V +V+++  T++V 
Sbjct: 961  NDRETVAADVLAGICGSLSSEAMSSCITSSADPGESYQREWKCQK-VDSVVKRPSTSDVT 1020

Query: 1021 RYVDNEIFSDESCGEMGPSYWTDGEKSLFIEAVSVYGKNFSVISTHVGSKSTDQCKVFFS 1080
            + +D++  SDESCGEM P+ WTD EKS+FI+AVS+YGK+F++IS  VG++S DQCKVFFS
Sbjct: 1021 QNIDDDTCSDESCGEMDPADWTDEEKSVFIQAVSLYGKDFAMISRCVGTRSRDQCKVFFS 1080

Query: 1081 KARKCLGLDLICSAKKMPDNGNGHDADRSNGEGGVDTKDAFPCE-------MVGSRVVDD 1099
            KARKCLGLDLI    +   N     +D +NG GG D +DA   E        +GS+V +D
Sbjct: 1081 KARKCLGLDLIHPRTR---NLGTPMSDDANG-GGSDIEDACVLESSVVCSDKLGSKVEED 1140

BLAST of CSPI05G16750 vs. TrEMBL
Match: F6HNI1_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_13s0019g04010 PE=4 SV=1)

HSP 1 Score: 936.4 bits (2419), Expect = 3.9e-269
Identity = 608/1243 (48.91%), Postives = 774/1243 (62.27%), Query Frame = 1

Query: 1    MPPEPLPWDRKDFFKERKHERSEFLGPLPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQG 60
            MPPEPLPWDRKDFFKERKHERSE LG   RWRDS   GSREF+RWGS + RRPPGHG+QG
Sbjct: 1    MPPEPLPWDRKDFFKERKHERSESLGFSARWRDSHQ-GSREFARWGSAEVRRPPGHGKQG 60

Query: 61   GWHVFSEEYGHGYGPSMSFNNKMLENVSSRPSVSHGDG--KYARNGRESR-SFSQRDWKG 120
            GWH+F EE GHG+ PS S ++KM+E+ +SRP  + GDG  KY+RN RE R SFSQ+DWKG
Sbjct: 61   GWHIFPEESGHGFVPSRS-SDKMVEDENSRPFTTRGDGNGKYSRNNREIRGSFSQKDWKG 120

Query: 121  HSWATSNGSTNNGGRMQHDLNYDQRSVHDMLIYPSHSHSDFVNPREKV--KGQHDKVDDV 180
            H   T N S N  GR    +N DQRSV DMLI     HSDFVN  +++  K QHDK+  V
Sbjct: 121  HPLETGNASPNMSGRSLA-IN-DQRSVDDMLI-----HSDFVNGWDQLQLKDQHDKMGSV 180

Query: 181  NGLGTNQRRDREYSVSSSGWKPLKWTRSGGLSSRTSTSGHSSSKKSIDALDSNDRKSETV 240
            NGLGT QR +RE S+SS  WKPLKWTRSG LSSR S   HSSS KS+  +DSN+ + +  
Sbjct: 181  NGLGTGQRAERENSLSSIDWKPLKWTRSGSLSSRGSGFSHSSSSKSM-GVDSNEARGDLQ 240

Query: 241  SKNASQNFSPSADHAECAMSSLPYDDASARKKPRLGWGEGLAKYEKKKVEVPDGST---- 300
             +N +   SPS D   C  S+ P ++ S+RKKPRLGWGEGLAKYE+KKVE PD S     
Sbjct: 241  PRNVTPVQSPSGDAVACVASTAPSEETSSRKKPRLGWGEGLAKYERKKVEGPDESVNKNG 300

Query: 301  -AFTNITAESTHSLNSSLIEKGPRGSGFADCTSPATPSSVISGSPPGGDEKSFGKASS-D 360
              F     ESTHSLNS+L +K PR  GF+DC SPATPSSV   S PG +EKSF KA + D
Sbjct: 301  IVFCTSNGESTHSLNSNLADKSPRVMGFSDCASPATPSSVACSSSPGMEEKSFSKAGNVD 360

Query: 361  NDVSNFHGSPGSCFQNQYEGTSTV-EKLDNFSIANLCSPLIQLLQSNDSISVD-----ST 420
            ND S   GSPG    N  +G S + E L+   IANL    I+LLQS+D  SVD     ST
Sbjct: 361  NDTSTLSGSPGPVSLNHLDGFSFILESLEPNQIANLGFSPIELLQSDDPSSVDSNFMRST 420

Query: 421  ALSKLLIYKNQISKVLETTESEIDLLENELKGLKSESKGYFSFTLASSSLLVGDKF--FE 480
            A+SKLLI+K  ISK LE TESEID LENELK LKS S        ASSS  V  K    E
Sbjct: 421  AMSKLLIWKGDISKSLEMTESEIDTLENELKSLKSGSGSSCPCPAASSSFPVEGKAKPCE 480

Query: 481  EQNNVANAV---ATLPVVTSANTISKTMAHSTSDLEEVYAEK-----DRSGRLDVK---- 540
            EQ   +N +   A L +V   + ++      +  +E+ +AE      D  G    K    
Sbjct: 481  EQGAASNLILRPAPLQIVPPGDMMTDKTLLGSDAMEDAHAEVKDEDIDSPGTATSKFVEP 540

Query: 541  ----ESVMKEKLTIYG-CSVKENIAAYIDNSMPIKSEGVTVHP----------------- 600
                ++     + I G CS    I    +  + +   G  V                   
Sbjct: 541  PCLVKTASPSDMVIQGECSGNLKITRSTNMEVELLVSGPNVEETGISTSGGDSRLLVESK 600

Query: 601  ----VANDMYECAEGGDSVSDLILASNKESACKASEALIGLLPTNERKIDIW--STNACS 660
                V+ DM    +  D + +LILASNK+ A +ASE    LLP N+ + DI   +  AC 
Sbjct: 601  TGARVSGDMGVLDDEEDKIYNLILASNKDCANRASEVFNKLLPQNQCQNDILGAANFACR 660

Query: 661  QNQCLVKERFAKRKRLLRFKERVITLKFKAYQSLWKENLHVPPVRKLRAKSQKKHQLSLW 720
            QN  L+K++FA RKR LRFKE+VITLKF+  Q +WKE++ +  +RK RAKSQKK +LSL 
Sbjct: 661  QNDSLIKQKFAMRKRFLRFKEKVITLKFRVSQHVWKEDMRLLSIRKYRAKSQKKFELSLR 720

Query: 721  TNYSGYQKNRSSIRYRMPSPAGNLNPVSSTEILKHVSMQLSTPQIKQYRRTLKMPALVLD 780
            T++ GYQK+RSSIR R  SPAGNL+PV + E++ + S  LS  Q+K  R  LKMPAL+LD
Sbjct: 721  TSHCGYQKHRSSIRSRFSSPAGNLSPVPTAEMINYTSKMLSESQMKLCRNILKMPALILD 780

Query: 781  QKDKMGSRFISNNGLVENPCAVEKERAMINPWTSEEKDVFMEKLECFGKDFGKIASFLDH 840
            +K+K  SRFIS+NGLVE+PCAVE ER MINPWT+EEK++FM+KL  FGK+F KIASFLDH
Sbjct: 781  KKEKTASRFISSNGLVEDPCAVENERTMINPWTAEEKEIFMDKLAIFGKEFKKIASFLDH 840

Query: 841  KTTADCVEFYYKNHKSDCFEKT-KKLEFGKKVKS-STSNYLMTTGKKWNPETNAASLDML 900
            KTTADCVEFYYKNHKSDCFEKT KKLE  K+ KS S + YL+T+GKKWN E NAASLDML
Sbjct: 841  KTTADCVEFYYKNHKSDCFEKTKKKLELRKQGKSLSATTYLVTSGKKWNREMNAASLDML 900

Query: 901  GAASTMTARAHKYSSSRS--------GGRTSYHITQFDDGLSERAKGLNGFGNEREKVAA 960
            GAAS M ARA     +          G    Y     D+G+ ER+   +   NERE VAA
Sbjct: 901  GAASVMAARAGDSMENLQTCPGKFLLGAHHDYRTPHGDNGVVERSSSYDIIRNERETVAA 960

Query: 961  DVLAGICGSLSSEAMGSCVTSNFNRGDSSQNLKCKKGVTTVLRQRMTTNVPRYVDNEIFS 1020
            DVLAGICGSLSSEAM SC+TS+ + G+  + L+ K G  + +++ +T  V + +D E  S
Sbjct: 961  DVLAGICGSLSSEAMSSCITSSLDPGEGYRELRQKVG--SGVKRPLTPEVTQSIDEETCS 1020

Query: 1021 DESCGEMGPSYWTDGEKSLFIEAVSVYGKNFSVISTHVGSKSTDQCKVFFSKARKCLGLD 1080
            DESCGEM P+ WTD EK +F++AVS YGK+F+ IS  V ++S DQCKVFFSKARKCLGLD
Sbjct: 1021 DESCGEMDPADWTDEEKCIFVQAVSSYGKDFAKISRCVRTRSRDQCKVFFSKARKCLGLD 1080

Query: 1081 LICSAKKMPDNGNGHDADRSNGEGGVDTKDAFPCE--------MVGSRVVDDLPKAVMSI 1140
            LI        N    ++D +NG GG DT+DA   E          GS++ +D   +V++I
Sbjct: 1081 LIHPG----PNVGTPESDDANG-GGSDTEDACVVEAGSVICSNKSGSKMEEDSLLSVLNI 1140

Query: 1141 SGGESESMNLQSTHQEVNPSSKTCSNAAVD-------AMVSDDECTRKDGSQSGFDDDCQ 1160
            +  ES+   +++   ++N S +      VD         +  D+C + + ++  F D   
Sbjct: 1141 NPDESDFSGMKNLQTDLNRSYENNGIGRVDHKDDETVTNLVSDKCHQLEKTEQVFGDS-N 1200

BLAST of CSPI05G16750 vs. TrEMBL
Match: A5AZS6_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_026902 PE=4 SV=1)

HSP 1 Score: 916.8 bits (2368), Expect = 3.2e-263
Identity = 605/1263 (47.90%), Postives = 770/1263 (60.97%), Query Frame = 1

Query: 1    MPPEPLPWDRKDFFKERKHERSEFLGPLPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQG 60
            MPPEPLPWDRKDFFKERKHERSE LG   RWRDS   GSREF+RWGS   RRPPGHG+QG
Sbjct: 1    MPPEPLPWDRKDFFKERKHERSESLGFSARWRDSHQ-GSREFARWGSAXVRRPPGHGKQG 60

Query: 61   GWHVFSEEYGHGYGPSMSFNNKMLENVSSRPSVSHGDG--KYARNGRESR-SFSQRDWKG 120
            GWH+F EE GHG+ PS S ++KM+E+ +SRP    GDG  KY+RN RE R SFSQ+DWKG
Sbjct: 61   GWHIFPEESGHGFVPSRS-SDKMVEDENSRPFTXRGDGNGKYSRNNREIRGSFSQKDWKG 120

Query: 121  HSWATSNGSTNNGGRMQHDLNYDQRSVHDMLIYPSHSHSDFVNPREKV--KGQHDKVDDV 180
            H   T N S N  GR    +N DQRSV DMLI     HSDFVN  +++  K QHDK+  V
Sbjct: 121  HPLETGNASPNMSGRSLA-IN-DQRSVDDMLI-----HSDFVNGWDQLQLKDQHDKMGSV 180

Query: 181  NGLGTNQRRDREYSVSSSGWKPLKWTRSGGLSSRTSTSGHSSSKKSIDALDSNDRKSETV 240
            NGLGT QR +RE S+SS  WKPLKWTRSG LSSR S   HSSS KS+  +DSN+ + +  
Sbjct: 181  NGLGTGQRAERENSLSSIDWKPLKWTRSGSLSSRGSGFSHSSSSKSM-GVDSNEARGDLQ 240

Query: 241  SKNASQNFSPSADHAECAMSSLPYDDASARKKPRLGWGEGLAKYEKKKVEVPDGST---- 300
             +N +   SPS D   C  S+ P ++ S+RKKPRLGWGEGLAKYE+KKVE PD S     
Sbjct: 241  XRNVTPVQSPSGDAVACVASTAPSEETSSRKKPRLGWGEGLAKYERKKVEGPDESVNKNG 300

Query: 301  -AFTNITAESTHSLNSSLIEKGPRGSGFADCTSPATPSSVISGSPPGGDEKSFGKASS-D 360
              F     ESTHSLNS+L +K PR  GF+DC SPATPSSV   S PG ++KSF KA + D
Sbjct: 301  IVFCTSNGESTHSLNSNLADKSPRVMGFSDCASPATPSSVACSSSPGMEDKSFSKAGNVD 360

Query: 361  NDVSNFHGSPGSCFQNQYEGTSTV-EKLDNFSIANLCSPLIQLLQSNDSISVD-----ST 420
            ND S   GSPG    N  +G S + E L+   IANL    I+LLQS+D  SVD     ST
Sbjct: 361  NDTSTLSGSPGPVSLNHLDGFSFILESLEPNQIANLGFSPIELLQSDDPSSVDSNFMRST 420

Query: 421  ALSKLLIYKNQISKVLETTESEIDLLENELKGLKSESKGYFSFTLASSSLLVGDKF--FE 480
            A+SKLLI+K  ISK LE TESEID LENELK LKS S        ASSS  V  K    E
Sbjct: 421  AMSKLLIWKGDISKSLEMTESEIDTLENELKSLKSGSGSSCPCPAASSSFPVEGKAKPCE 480

Query: 481  EQNNVANAV---ATLPVVTSANTISKTMAHSTSDLEEVYAEK-----DRSGRLDVK---- 540
            EQ   +N +   A L +V   + ++      +  +E+ +AE      D  G    K    
Sbjct: 481  EQGAASNLILRPAPLQIVPPGDMMTDKTLLGSDAMEDAHAEVKDEDIDSPGTATSKFVEP 540

Query: 541  ----ESVMKEKLTIYG-CSVKENIAAYIDNSMPIKSEGVTVHP----------------- 600
                ++     + I G CS    I    +  + +   G  V                   
Sbjct: 541  PCLVKTASPSDMVIQGECSGNLKITRSTNMEVELLVSGPNVEETGISTSGGDSRLLVESK 600

Query: 601  ----VANDMYECAEGGDSVSDLILASNKESACKASEALIGLLPTNERKIDIW--STNACS 660
                V+ DM    +  D + +LILASNK+ A +ASE    LLP N+ + DI   +  AC 
Sbjct: 601  TGARVSGDMGVLDDEEDKIYNLILASNKDCANRASEVFNKLLPQNQCQNDILGAANFACR 660

Query: 661  QNQCLVKERFAKRKRLLRFKERVITLKFKAYQSLWKENLHVPPVRKLRAKSQKKHQLSLW 720
            QN  L+K++FA RKR LRFKE+VITLKF+  Q +WKE++ +  +RK RAKSQKK +LSL 
Sbjct: 661  QNDSLIKQKFAMRKRFLRFKEKVITLKFRVSQHVWKEDMRLLSIRKYRAKSQKKFELSLR 720

Query: 721  TNYSGYQKNRSSIRYRMPSPA--------------------GNLNPVSSTEILKHVSMQL 780
            T++ GYQK+RSSIR R  SP                     GNL+PV + E++ + S  L
Sbjct: 721  TSHCGYQKHRSSIRSRFSSPGADFFLNLVLALFFEKLAVQPGNLSPVPTAEMINYTSKML 780

Query: 781  STPQIKQYRRTLKMPALVLDQKDKMGSRFISNNGLVENPCAVEKERAMINPWTSEEKDVF 840
            S  Q+K  R  LKMPAL+LD+K+K  SRFIS+NGLVE+PCAVE ER MINPWT+EEK++F
Sbjct: 781  SESQMKLCRNILKMPALILDKKEKTASRFISSNGLVEDPCAVENERTMINPWTAEEKEIF 840

Query: 841  MEKLECFGKDFGKIASFLDHKTTADCVEFYYKNHKSDCFEKT-KKLEFGKKVKS-STSNY 900
            M+KL  FGK+F KIASFLDHKTTADCVEFYYKNHKSDCFEKT KKLE  K+ KS S + Y
Sbjct: 841  MDKLAIFGKEFKKIASFLDHKTTADCVEFYYKNHKSDCFEKTKKKLELRKQGKSLSATTY 900

Query: 901  LMTTGKKWNPETNAASLDMLGAASTMTARAHKYSSSRS--------GGRTSYHITQFDDG 960
            L+T+GKKWN E NAASLDMLGAAS M ARA     +          G    Y     D+G
Sbjct: 901  LVTSGKKWNREMNAASLDMLGAASVMAARAGDSMENLQTCPGKFLLGAHHDYRTPHGDNG 960

Query: 961  LSERAKGLNGFGNEREKVAADVLAGICGSLSSEAMGSCVTSNFNRGDSSQNLKCKKGVTT 1020
            + ER+   +   NERE VAADVLAGICGSLSSEAM SC+TS+ + G+  + L+ K G  +
Sbjct: 961  VVERSSSYDIIRNERETVAADVLAGICGSLSSEAMSSCITSSLDPGEGYRELRQKVG--S 1020

Query: 1021 VLRQRMTTNVPRYVDNEIFSDESCGEMGPSYWTDGEKSLFIEAVSVYGKNFSVISTHVGS 1080
             +++ +T  V + +  E  SDESCGEM P+ WTD EK +F++AVS YGK+F+ IS  V +
Sbjct: 1021 GVKRPLTPEVTQSIAEETCSDESCGEMDPADWTDEEKCIFVQAVSSYGKDFAKISRCVRT 1080

Query: 1081 KSTDQCKVFFSKARKCLGLDLICSAKKMPDNGNGHDADRSNGEGGVDTKDAFPCE----- 1140
            +S DQCKVFFSKARKCLGLDLI        N    ++D +NG GG DT+DA   E     
Sbjct: 1081 RSRDQCKVFFSKARKCLGLDLIHPG----PNVGTPESDDANG-GGSDTEDACVVEAGSVI 1140

Query: 1141 ---MVGSRVVDDLPKAVMSISGGESESMNLQSTHQEVNPSSKTCSNAAVD-------AMV 1160
                 GS++ +D   +V++I+  ES+   +++   ++N S +      VD         +
Sbjct: 1141 CSNKSGSKMEEDSLLSVLNINPDESDFSGMKNLQTDLNRSYENNGIGRVDHKDDETVTNL 1200

BLAST of CSPI05G16750 vs. TAIR10
Match: AT3G52250.1 (AT3G52250.1 Duplicated homeodomain-like superfamily protein)

HSP 1 Score: 451.8 bits (1161), Expect = 1.5e-126
Identity = 383/1064 (36.00%), Postives = 562/1064 (52.82%), Query Frame = 1

Query: 135  HDLNYDQRSVHDMLIYPSHSHSDFVNPREKVK----GQHDKVDDVNGLGTNQRRDREYSV 194
            +DL Y +R V D  +     +++     E+++      ++ +  +N +  +++  +E S+
Sbjct: 232  NDLMYGRRLVSDNSLDAPIPNAELEGTWEQLRLKDPQDNNSLHGINDIDGDRKCAKESSL 291

Query: 195  SSSGWKPLKWTRSGGLSSRTSTSGHSSSKKSIDALDSNDRKSETVSKNASQNFSPSADHA 254
             ++G  PL W  SG  +S++S   HSSS KS+ A+DS+DRK E + K  +   S S D  
Sbjct: 292  GATGKLPL-WNSSGSFASQSSGFSHSSSLKSLGAVDSSDRKIEVLPKIVTVTQSSSGDAT 351

Query: 255  ECAMSSLPYDDASARKKPRLGWGEGLAKYEKKKVEV---PDGSTAFTNITAESTHSLNSS 314
             CA ++   ++ S+RKK RLGWGEGLAKYEKKKV+V    DG+T   N   E  HSLN +
Sbjct: 352  ACATTTHLSEEMSSRKKQRLGWGEGLAKYEKKKVDVNPNEDGTTLMEN-GLEELHSLNKN 411

Query: 315  LIEKGPRGSGFADCTSPATPSSVISGSPPGGDEKSFGKAS-SDNDVSNFHGSPGSCFQNQ 374
            + +K P  +   D  SP TPSSV   S PG  +KS  KA+ + +DVSN   SP       
Sbjct: 412  IADKSPTAAIVPDYGSPTTPSSVACSSSPGFADKSSPKAAIAASDVSNMCRSPSPVSSIH 471

Query: 375  YEG-TSTVEKLDNFSIANLCSPLIQLLQSN-----DSISVDSTALSKLLIYKNQISKVLE 434
             E     +E+LDN S+      L +LL ++     DS SV  T+++ LL +K +I K +E
Sbjct: 472  LERFPINIEELDNISMERFGCLLNELLGTDDSGTGDSSSVQLTSMNTLLAWKGEILKAVE 531

Query: 435  TTESEIDLLENELKGLKSESKGYFSFTLASSSLLVGDKFFEEQNNVANAVATLPVVTSAN 494
             TESEIDLLEN+ + LK E + +      SS    GD    ++     A  +L    +A+
Sbjct: 532  MTESEIDLLENKHRTLKLEGRRHSRVVGPSSYCCDGDANVPKE----QASCSLDPKATAS 591

Query: 495  TISKTMAHS---TSDLEEVYAEKDRSGRLDVKESVMKEKLTIYGCSVKENIAAYIDNSMP 554
            +++KT+  +    + L +V A+       +VK  + +   T+     +E+I         
Sbjct: 592  SVAKTLVRAPVHQAGLAKVPADVFEDSPGEVK-PLSQSFATV---EREEDILPIPSMKAA 651

Query: 555  IKSEGVTVHPVAN-DMYECAEGGDSVS---DL----ILASNKESACKASEALIGLLPTNE 614
            + S+ +     AN +  E +   DS++   DL    +L++NK+ AC++S     LLP + 
Sbjct: 652  VSSKEINTPAFANQETIEVSSADDSMASKEDLFWAKLLSANKKYACESSGVFNQLLPRDF 711

Query: 615  RKIDIWSTNACSQNQ--CLVKERFAKRKRLLRFKERVITLKFKAYQSLWKENLHVPPVRK 674
               D        Q Q    V+E+ A R  LLR +E+++ L+FKA+Q  WK++L    + K
Sbjct: 712  NSSDNSRFPGICQTQFDSHVQEKIADRVGLLRAREKILLLQFKAFQLSWKKDLDQLALAK 771

Query: 675  LRAKSQKKHQLSLWTNYSGYQKNRSSIRYRMPSPAGNLNP-VSSTEILKHVSMQLSTPQI 734
             ++KS KK +L       GY K   S+R R  S A   +  V +TE++ ++   L    +
Sbjct: 772  YQSKSSKKTELYPNAKNGGYLKLPQSVRLRFSSSAPRRDSVVPTTELVSYMEKLLPGTHL 831

Query: 735  KQYRRTLKMPALVLDQKDKMGSRFISNNGLVENPCAVEKERAMINPWTSEEKDVFMEKLE 794
            K +R  LKMPA++LD+K+++ SRFIS+NGL+E+PC VEKER MINPWTSEEK++F+  L 
Sbjct: 832  KPFRDILKMPAMILDEKERVMSRFISSNGLIEDPCDVEKERTMINPWTSEEKEIFLNLLA 891

Query: 795  CFGKDFGKIASFLDHKTTADCVEFYYKNHKSDCFEKTKKLE-FGKKVKSSTSNYLMTTGK 854
              GKDF KIAS L  KTTADC+++YYKNHKSDCF K KK   +GK+ K +   Y++   K
Sbjct: 892  MHGKDFKKIASSLTQKTTADCIDYYYKNHKSDCFGKIKKQRAYGKEGKHT---YMLAPRK 951

Query: 855  KWNPETNAASLDMLGAASTMTARAHKYSSSRS--------GGRTSYHITQFDDGLSERAK 914
            KW  E  AASLD+LG  S + A A K +S+R          G +S +  Q D   SE   
Sbjct: 952  KWKREMGAASLDILGDVSIIAANAGKVASTRPISSKKITLRGCSSANSLQHDGNNSEGCS 1011

Query: 915  GLNGFGNEREKVAADVLAGICGSLSSEAMGSCV-TSNFNRGDSSQNLKC-----KKGVTT 974
                F  +R    ADVLA   G LS E + SC+ TS  +R     +LK      K  ++ 
Sbjct: 1012 YSFDFPRKR-TAGADVLA--VGPLSPEQINSCLRTSVSSRERCMDHLKFNHVVKKPRISH 1071

Query: 975  VLRQRMTTNVPRYVDNE---IFSDESCGEMGPSYWTDGEKSLFIEAVSVYGKNFSVISTH 1034
             L    +  +     NE     S+ESCGE GP +WTD E+S FI+  S++GKNF+ IS +
Sbjct: 1072 TLHNENSNTLHNENSNEEDDSCSEESCGETGPIHWTDDERSAFIQGFSLFGKNFASISRY 1131

Query: 1035 VGSKSTDQCKVFFSKARKCLGLDLICSAKKMPDNGNGH-----DADRSNGEGGVDTKDAF 1094
            VG++S DQCKVFFSK RKCLGL+ I         G+G+       D  N  GG D +D  
Sbjct: 1132 VGTRSPDQCKVFFSKVRKCLGLESI-------KFGSGNVSTSVSVDNGNEGGGSDLEDPC 1191

Query: 1095 PCEMVGSRVVDDLPKAVMSISGGESESMNLQSTHQEVNPSSKTCSNAAVDAMVSDDECTR 1145
            P E   S +V++    V +  G  S +         VN S    +N   D   S++E  +
Sbjct: 1192 PMES-NSGIVNN---GVCAKMGMNSPTSPFNMNQDGVNQSGS--ANVKADLSRSEEENGQ 1251

BLAST of CSPI05G16750 vs. NCBI nr
Match: gi|449452162|ref|XP_004143829.1| (PREDICTED: uncharacterized protein LOC101219573 isoform X1 [Cucumis sativus])

HSP 1 Score: 2695.6 bits (6986), Expect = 0.0e+00
Identity = 1376/1383 (99.49%), Postives = 1381/1383 (99.86%), Query Frame = 1

Query: 1    MPPEPLPWDRKDFFKERKHERSEFLGPLPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQG 60
            MPPEPLPWDRKDFFKERKHERSEFLGPLPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQG
Sbjct: 1    MPPEPLPWDRKDFFKERKHERSEFLGPLPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQG 60

Query: 61   GWHVFSEEYGHGYGPSMSFNNKMLENVSSRPSVSHGDGKYARNGRESRSFSQRDWKGHSW 120
            GWHVFSEEYGHGYGPSMSFNNKMLENVSSRPSVSHGDGKYARNGRESRSFSQRDWKGHSW
Sbjct: 61   GWHVFSEEYGHGYGPSMSFNNKMLENVSSRPSVSHGDGKYARNGRESRSFSQRDWKGHSW 120

Query: 121  ATSNGSTNNGGRMQHDLNYDQRSVHDMLIYPSHSHSDFVNPREKVKGQHDKVDDVNGLGT 180
            ATSNGSTNNGGRMQHDLNYDQRSVHDMLIYPSHSHSDFVNPREKVKGQHDKVDDVNGLGT
Sbjct: 121  ATSNGSTNNGGRMQHDLNYDQRSVHDMLIYPSHSHSDFVNPREKVKGQHDKVDDVNGLGT 180

Query: 181  NQRRDREYSVSSSGWKPLKWTRSGGLSSRTSTSGHSSSKKSIDALDSNDRKSETVSKNAS 240
            NQRRDREYSVSSSGWKPLKWTRSGGLSSRTSTSGHSSSKKSI+ALDSNDRKSETVSKNAS
Sbjct: 181  NQRRDREYSVSSSGWKPLKWTRSGGLSSRTSTSGHSSSKKSIEALDSNDRKSETVSKNAS 240

Query: 241  QNFSPSADHAECAMSSLPYDDASARKKPRLGWGEGLAKYEKKKVEVPDGSTAFTNITAES 300
            QNFSPSADHAECAMSSLPYDDASARKKPRLGWGEGLAKYEKKKVEVPDGSTAFTNITAES
Sbjct: 241  QNFSPSADHAECAMSSLPYDDASARKKPRLGWGEGLAKYEKKKVEVPDGSTAFTNITAES 300

Query: 301  THSLNSSLIEKGPRGSGFADCTSPATPSSVISGSPPGGDEKSFGKASSDNDVSNFHGSPG 360
            THSLNSSLIEKGPRGSGFADCTSPATPSSVISGSPPGGDEKSFGKASSDNDVSNFHGSPG
Sbjct: 301  THSLNSSLIEKGPRGSGFADCTSPATPSSVISGSPPGGDEKSFGKASSDNDVSNFHGSPG 360

Query: 361  SCFQNQYEGTSTVEKLDNFSIANLCSPLIQLLQSNDSISVDSTALSKLLIYKNQISKVLE 420
            SCFQNQYEGTSTVEKLDNFSIANLCSPLIQLLQSNDSISVDSTALSKLLIYKNQISKVLE
Sbjct: 361  SCFQNQYEGTSTVEKLDNFSIANLCSPLIQLLQSNDSISVDSTALSKLLIYKNQISKVLE 420

Query: 421  TTESEIDLLENELKGLKSESKGYFSFTLASSSLLVGDKFFEEQNNVANAVATLPVVTSAN 480
            TTESEIDLLENELKGLKSESKGYFSFTLASSSLLVGDKFFEEQNNVANAVATLPVVTSAN
Sbjct: 421  TTESEIDLLENELKGLKSESKGYFSFTLASSSLLVGDKFFEEQNNVANAVATLPVVTSAN 480

Query: 481  TISKTMAHSTSDLEEVYAEKDRSGRLDVKESVMKEKLTIYGCSVKENIAAYIDNSMPIKS 540
            TISKTMAHSTSDLEEVYA+KDRSGRLDVKESVMKEKLTIYGCSVKENIAAYIDNS+PIKS
Sbjct: 481  TISKTMAHSTSDLEEVYADKDRSGRLDVKESVMKEKLTIYGCSVKENIAAYIDNSVPIKS 540

Query: 541  EGVTVHPVANDMYECAEGGDSVSDLILASNKESACKASEALIGLLPTNERKIDIWSTNAC 600
            EGVTVHPVANDMYECAEGGDSVSDLILASNKESACKASEALIG+LPTNERKIDIWSTNAC
Sbjct: 541  EGVTVHPVANDMYECAEGGDSVSDLILASNKESACKASEALIGMLPTNERKIDIWSTNAC 600

Query: 601  SQNQCLVKERFAKRKRLLRFKERVITLKFKAYQSLWKENLHVPPVRKLRAKSQKKHQLSL 660
            SQNQCLVKERFAKRKRLLRFKERVITLKFKAYQSLWKENLHVPPVRKLRAKSQKKHQLSL
Sbjct: 601  SQNQCLVKERFAKRKRLLRFKERVITLKFKAYQSLWKENLHVPPVRKLRAKSQKKHQLSL 660

Query: 661  WTNYSGYQKNRSSIRYRMPSPAGNLNPVSSTEILKHVSMQLSTPQIKQYRRTLKMPALVL 720
            WTNYSGYQKNRSSIRYRMPSPAGNLNPVSSTEILKHVSMQLSTPQIKQYRRTLKMPALVL
Sbjct: 661  WTNYSGYQKNRSSIRYRMPSPAGNLNPVSSTEILKHVSMQLSTPQIKQYRRTLKMPALVL 720

Query: 721  DQKDKMGSRFISNNGLVENPCAVEKERAMINPWTSEEKDVFMEKLECFGKDFGKIASFLD 780
            DQKDKMGSRFISNNGLVENPCAVEKERAMINPWTSEEKDVFMEKLECFGKDFGKIASFLD
Sbjct: 721  DQKDKMGSRFISNNGLVENPCAVEKERAMINPWTSEEKDVFMEKLECFGKDFGKIASFLD 780

Query: 781  HKTTADCVEFYYKNHKSDCFEKTKKLEFGKKVKSSTSNYLMTTGKKWNPETNAASLDMLG 840
            HKTTADCVEFYYKNHKSDCFEKTKKLEFGKKVKSSTSNYLMTTGKKWNPETNAASLDMLG
Sbjct: 781  HKTTADCVEFYYKNHKSDCFEKTKKLEFGKKVKSSTSNYLMTTGKKWNPETNAASLDMLG 840

Query: 841  AASTMTARAHKYSSSRSGGRTSYHITQFDDGLSERAKGLNGFGNEREKVAADVLAGICGS 900
            AASTMTARAHKYSSSRSGGRTSYHITQFDDGLSERAKGLNGFGNEREKVAADVLAGICGS
Sbjct: 841  AASTMTARAHKYSSSRSGGRTSYHITQFDDGLSERAKGLNGFGNEREKVAADVLAGICGS 900

Query: 901  LSSEAMGSCVTSNFNRGDSSQNLKCKKGVTTVLRQRMTTNVPRYVDNEIFSDESCGEMGP 960
            LSSEAMGSCVTSNFNRGDSSQ+LKCKKGVTTVLRQRMTTNVPRYVDNEIFSDESCGEMGP
Sbjct: 901  LSSEAMGSCVTSNFNRGDSSQDLKCKKGVTTVLRQRMTTNVPRYVDNEIFSDESCGEMGP 960

Query: 961  SYWTDGEKSLFIEAVSVYGKNFSVISTHVGSKSTDQCKVFFSKARKCLGLDLICSAKKMP 1020
            SYWTDGEKSLFIEAVSVYGKNFSVISTHVGSKSTDQCKVFFSKARKCLGLDLICSAKKMP
Sbjct: 961  SYWTDGEKSLFIEAVSVYGKNFSVISTHVGSKSTDQCKVFFSKARKCLGLDLICSAKKMP 1020

Query: 1021 DNGNGHDADRSNGEGGVDTKDAFPCEMVGSRVVDDLPKAVMSISGGESESMNLQSTHQEV 1080
            DNGNGHDADRSNGEGGVDTKDAFPCEMVGSRVVDDLPKAVMSISGGESESMNLQSTHQEV
Sbjct: 1021 DNGNGHDADRSNGEGGVDTKDAFPCEMVGSRVVDDLPKAVMSISGGESESMNLQSTHQEV 1080

Query: 1081 NPSSKTCSNAAVDAMVSDDECTRKDGSQSGFDDDCQSVNSANDKNGLIHEQQHVVISDET 1140
            NPSSKTCSNAAVDAMVSDDECTRKDGSQSGFDDDCQSVNSANDKNGLIHEQQHVVISDET
Sbjct: 1081 NPSSKTCSNAAVDAMVSDDECTRKDGSQSGFDDDCQSVNSANDKNGLIHEQQHVVISDET 1140

Query: 1141 AKEQDISVLVATSVGNVSDTETKRGNVDASTARGDKADSHATDCPSIPSNSHITSSAKEE 1200
            AKEQDISVLVATSVGNVSDTETKRGNVDASTARGDKADSHATDCPSIPSNSHITSSAKEE
Sbjct: 1141 AKEQDISVLVATSVGNVSDTETKRGNVDASTARGDKADSHATDCPSIPSNSHITSSAKEE 1200

Query: 1201 QGRHHVRVHSRSLSDSEQSSRNGDIKLFGQILTHSSFVPSSKSGSSVNGIKTTEPHHKFK 1260
            QGRHHVRVHSRSLSDSEQSSRNGDIKLFGQILTHSSFVPSSKSGSS NGIKTTEPHHKFK
Sbjct: 1201 QGRHHVRVHSRSLSDSEQSSRNGDIKLFGQILTHSSFVPSSKSGSSENGIKTTEPHHKFK 1260

Query: 1261 RRLKVNSHGNLSTAKFNCKNSPGQEENTPSRSYGIWDGNQIRTGLSSLPDPTTLLSRYPT 1320
            RRLKVNSHGNLSTAKFNCKNSPGQEENTPSRSYGIWDGNQIRTGL SLPDPTTLLSRYPT
Sbjct: 1261 RRLKVNSHGNLSTAKFNCKNSPGQEENTPSRSYGIWDGNQIRTGLLSLPDPTTLLSRYPT 1320

Query: 1321 FNHLSKPASSPTEQSPSGCKEETSNSNKETQKREVNNSRKEEVVGEMNVEESCCNEGGGG 1380
            FNHLSKPASSPTEQSPSGCKEETSNSNKETQKREVNNSRKEEVVGEMNVEESCCNEGGGG
Sbjct: 1321 FNHLSKPASSPTEQSPSGCKEETSNSNKETQKREVNNSRKEEVVGEMNVEESCCNEGGGG 1380

Query: 1381 GGS 1384
            GGS
Sbjct: 1381 GGS 1383

BLAST of CSPI05G16750 vs. NCBI nr
Match: gi|778703085|ref|XP_011655309.1| (PREDICTED: uncharacterized protein LOC101219573 isoform X2 [Cucumis sativus])

HSP 1 Score: 2689.1 bits (6969), Expect = 0.0e+00
Identity = 1375/1383 (99.42%), Postives = 1380/1383 (99.78%), Query Frame = 1

Query: 1    MPPEPLPWDRKDFFKERKHERSEFLGPLPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQG 60
            MPPEPLPWDRKDFFKERKHERSEFLGPLPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQG
Sbjct: 1    MPPEPLPWDRKDFFKERKHERSEFLGPLPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQG 60

Query: 61   GWHVFSEEYGHGYGPSMSFNNKMLENVSSRPSVSHGDGKYARNGRESRSFSQRDWKGHSW 120
            GWHVFSEEYGHGYGPSMSFNNKMLENVSSRPSVSHGDGKYARNGRESRSFSQRDWKGHSW
Sbjct: 61   GWHVFSEEYGHGYGPSMSFNNKMLENVSSRPSVSHGDGKYARNGRESRSFSQRDWKGHSW 120

Query: 121  ATSNGSTNNGGRMQHDLNYDQRSVHDMLIYPSHSHSDFVNPREKVKGQHDKVDDVNGLGT 180
            ATSNGSTNNGGRMQHDLNYDQRSVHDMLIYPSHSHSDFVNPREKVKGQHDKVDDVNGLGT
Sbjct: 121  ATSNGSTNNGGRMQHDLNYDQRSVHDMLIYPSHSHSDFVNPREKVKGQHDKVDDVNGLGT 180

Query: 181  NQRRDREYSVSSSGWKPLKWTRSGGLSSRTSTSGHSSSKKSIDALDSNDRKSETVSKNAS 240
            NQRRDREYSVSSSGWKPLKWTRSGGLSSRTSTSGHSSSKKSI+ALDSNDRKSETVSKNAS
Sbjct: 181  NQRRDREYSVSSSGWKPLKWTRSGGLSSRTSTSGHSSSKKSIEALDSNDRKSETVSKNAS 240

Query: 241  QNFSPSADHAECAMSSLPYDDASARKKPRLGWGEGLAKYEKKKVEVPDGSTAFTNITAES 300
            QNFSPSADHAECAMSSLPYDDASARKKPRLGWGEGLAKYEKKKVEVPDGSTAFTNITAES
Sbjct: 241  QNFSPSADHAECAMSSLPYDDASARKKPRLGWGEGLAKYEKKKVEVPDGSTAFTNITAES 300

Query: 301  THSLNSSLIEKGPRGSGFADCTSPATPSSVISGSPPGGDEKSFGKASSDNDVSNFHGSPG 360
            THSLNSSLIEKGPRGSGFADCTSPATPSSVISGSPPGGDEKSFGKASSDNDVSNFHGSPG
Sbjct: 301  THSLNSSLIEKGPRGSGFADCTSPATPSSVISGSPPGGDEKSFGKASSDNDVSNFHGSPG 360

Query: 361  SCFQNQYEGTSTVEKLDNFSIANLCSPLIQLLQSNDSISVDSTALSKLLIYKNQISKVLE 420
            SCFQNQYEGTSTVEKLDNFSIANLCSPLIQLLQSNDSISVDSTALSKLLIYKNQISKVLE
Sbjct: 361  SCFQNQYEGTSTVEKLDNFSIANLCSPLIQLLQSNDSISVDSTALSKLLIYKNQISKVLE 420

Query: 421  TTESEIDLLENELKGLKSESKGYFSFTLASSSLLVGDKFFEEQNNVANAVATLPVVTSAN 480
            TTESEIDLLENELKGLKSESKGYFSFTLASSSLLVGDKFFEEQNNVANAVATLPVVTSAN
Sbjct: 421  TTESEIDLLENELKGLKSESKGYFSFTLASSSLLVGDKFFEEQNNVANAVATLPVVTSAN 480

Query: 481  TISKTMAHSTSDLEEVYAEKDRSGRLDVKESVMKEKLTIYGCSVKENIAAYIDNSMPIKS 540
            TISKTMAHSTSDLEEVYA+KDRSGRLDVKESVMKEKLTIYGCSVKENIAAYIDNS+PIKS
Sbjct: 481  TISKTMAHSTSDLEEVYADKDRSGRLDVKESVMKEKLTIYGCSVKENIAAYIDNSVPIKS 540

Query: 541  EGVTVHPVANDMYECAEGGDSVSDLILASNKESACKASEALIGLLPTNERKIDIWSTNAC 600
            EGVTVHPVANDMYECAEGGDSVSDLILASNKESACKASEALIG+LPTNERKIDIWSTNAC
Sbjct: 541  EGVTVHPVANDMYECAEGGDSVSDLILASNKESACKASEALIGMLPTNERKIDIWSTNAC 600

Query: 601  SQNQCLVKERFAKRKRLLRFKERVITLKFKAYQSLWKENLHVPPVRKLRAKSQKKHQLSL 660
            SQNQCLVKERFAKRKRLLRFKERVITLKFKAYQSLWKENLHVPPVRKLRAKSQKKHQLSL
Sbjct: 601  SQNQCLVKERFAKRKRLLRFKERVITLKFKAYQSLWKENLHVPPVRKLRAKSQKKHQLSL 660

Query: 661  WTNYSGYQKNRSSIRYRMPSPAGNLNPVSSTEILKHVSMQLSTPQIKQYRRTLKMPALVL 720
            WTNYSGYQKNRSSIRYRMPSP GNLNPVSSTEILKHVSMQLSTPQIKQYRRTLKMPALVL
Sbjct: 661  WTNYSGYQKNRSSIRYRMPSP-GNLNPVSSTEILKHVSMQLSTPQIKQYRRTLKMPALVL 720

Query: 721  DQKDKMGSRFISNNGLVENPCAVEKERAMINPWTSEEKDVFMEKLECFGKDFGKIASFLD 780
            DQKDKMGSRFISNNGLVENPCAVEKERAMINPWTSEEKDVFMEKLECFGKDFGKIASFLD
Sbjct: 721  DQKDKMGSRFISNNGLVENPCAVEKERAMINPWTSEEKDVFMEKLECFGKDFGKIASFLD 780

Query: 781  HKTTADCVEFYYKNHKSDCFEKTKKLEFGKKVKSSTSNYLMTTGKKWNPETNAASLDMLG 840
            HKTTADCVEFYYKNHKSDCFEKTKKLEFGKKVKSSTSNYLMTTGKKWNPETNAASLDMLG
Sbjct: 781  HKTTADCVEFYYKNHKSDCFEKTKKLEFGKKVKSSTSNYLMTTGKKWNPETNAASLDMLG 840

Query: 841  AASTMTARAHKYSSSRSGGRTSYHITQFDDGLSERAKGLNGFGNEREKVAADVLAGICGS 900
            AASTMTARAHKYSSSRSGGRTSYHITQFDDGLSERAKGLNGFGNEREKVAADVLAGICGS
Sbjct: 841  AASTMTARAHKYSSSRSGGRTSYHITQFDDGLSERAKGLNGFGNEREKVAADVLAGICGS 900

Query: 901  LSSEAMGSCVTSNFNRGDSSQNLKCKKGVTTVLRQRMTTNVPRYVDNEIFSDESCGEMGP 960
            LSSEAMGSCVTSNFNRGDSSQ+LKCKKGVTTVLRQRMTTNVPRYVDNEIFSDESCGEMGP
Sbjct: 901  LSSEAMGSCVTSNFNRGDSSQDLKCKKGVTTVLRQRMTTNVPRYVDNEIFSDESCGEMGP 960

Query: 961  SYWTDGEKSLFIEAVSVYGKNFSVISTHVGSKSTDQCKVFFSKARKCLGLDLICSAKKMP 1020
            SYWTDGEKSLFIEAVSVYGKNFSVISTHVGSKSTDQCKVFFSKARKCLGLDLICSAKKMP
Sbjct: 961  SYWTDGEKSLFIEAVSVYGKNFSVISTHVGSKSTDQCKVFFSKARKCLGLDLICSAKKMP 1020

Query: 1021 DNGNGHDADRSNGEGGVDTKDAFPCEMVGSRVVDDLPKAVMSISGGESESMNLQSTHQEV 1080
            DNGNGHDADRSNGEGGVDTKDAFPCEMVGSRVVDDLPKAVMSISGGESESMNLQSTHQEV
Sbjct: 1021 DNGNGHDADRSNGEGGVDTKDAFPCEMVGSRVVDDLPKAVMSISGGESESMNLQSTHQEV 1080

Query: 1081 NPSSKTCSNAAVDAMVSDDECTRKDGSQSGFDDDCQSVNSANDKNGLIHEQQHVVISDET 1140
            NPSSKTCSNAAVDAMVSDDECTRKDGSQSGFDDDCQSVNSANDKNGLIHEQQHVVISDET
Sbjct: 1081 NPSSKTCSNAAVDAMVSDDECTRKDGSQSGFDDDCQSVNSANDKNGLIHEQQHVVISDET 1140

Query: 1141 AKEQDISVLVATSVGNVSDTETKRGNVDASTARGDKADSHATDCPSIPSNSHITSSAKEE 1200
            AKEQDISVLVATSVGNVSDTETKRGNVDASTARGDKADSHATDCPSIPSNSHITSSAKEE
Sbjct: 1141 AKEQDISVLVATSVGNVSDTETKRGNVDASTARGDKADSHATDCPSIPSNSHITSSAKEE 1200

Query: 1201 QGRHHVRVHSRSLSDSEQSSRNGDIKLFGQILTHSSFVPSSKSGSSVNGIKTTEPHHKFK 1260
            QGRHHVRVHSRSLSDSEQSSRNGDIKLFGQILTHSSFVPSSKSGSS NGIKTTEPHHKFK
Sbjct: 1201 QGRHHVRVHSRSLSDSEQSSRNGDIKLFGQILTHSSFVPSSKSGSSENGIKTTEPHHKFK 1260

Query: 1261 RRLKVNSHGNLSTAKFNCKNSPGQEENTPSRSYGIWDGNQIRTGLSSLPDPTTLLSRYPT 1320
            RRLKVNSHGNLSTAKFNCKNSPGQEENTPSRSYGIWDGNQIRTGL SLPDPTTLLSRYPT
Sbjct: 1261 RRLKVNSHGNLSTAKFNCKNSPGQEENTPSRSYGIWDGNQIRTGLLSLPDPTTLLSRYPT 1320

Query: 1321 FNHLSKPASSPTEQSPSGCKEETSNSNKETQKREVNNSRKEEVVGEMNVEESCCNEGGGG 1380
            FNHLSKPASSPTEQSPSGCKEETSNSNKETQKREVNNSRKEEVVGEMNVEESCCNEGGGG
Sbjct: 1321 FNHLSKPASSPTEQSPSGCKEETSNSNKETQKREVNNSRKEEVVGEMNVEESCCNEGGGG 1380

Query: 1381 GGS 1384
            GGS
Sbjct: 1381 GGS 1382

BLAST of CSPI05G16750 vs. NCBI nr
Match: gi|659131413|ref|XP_008465673.1| (PREDICTED: uncharacterized protein LOC103503311 isoform X1 [Cucumis melo])

HSP 1 Score: 2536.9 bits (6574), Expect = 0.0e+00
Identity = 1304/1396 (93.41%), Postives = 1339/1396 (95.92%), Query Frame = 1

Query: 1    MPPEPLPWDRKDFFKERKHERSEFLGPLPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQG 60
            MPPEPLPWDRKDFFKERKHERSEFLGPLPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQG
Sbjct: 1    MPPEPLPWDRKDFFKERKHERSEFLGPLPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQG 60

Query: 61   GWHVFSEEYGHGYGPSMSFNNKMLENVSSRPSVSHGDGKYARNGRESRSFSQRDWKGHSW 120
            GWHVFSEEYGHGYGPSMSFNNKMLENVSSRPSVSHGDGKYARNGRESRSFSQRDWKGHSW
Sbjct: 61   GWHVFSEEYGHGYGPSMSFNNKMLENVSSRPSVSHGDGKYARNGRESRSFSQRDWKGHSW 120

Query: 121  ATSNGSTNNGGRMQHDLNYDQRSVHDMLIYPSHSHSDFVNPREKVKGQHDKVDDVNGLGT 180
            ATSNGSTNNGGR+QHDLNYDQRSVHDMLIYPSHSHSDFVNPR+KVKGQHDKVDDVNGLGT
Sbjct: 121  ATSNGSTNNGGRIQHDLNYDQRSVHDMLIYPSHSHSDFVNPRDKVKGQHDKVDDVNGLGT 180

Query: 181  NQRRDREYSVSSSGWKPLKWTRSGGLSSRTSTSGHSSSKKSIDALDSNDRKSETVSKNAS 240
            NQRRDREYSVSSSGWKPLKWTRSGGLSSRTSTSGHSSSKKS+DALDSNDRKSETVSKNAS
Sbjct: 181  NQRRDREYSVSSSGWKPLKWTRSGGLSSRTSTSGHSSSKKSVDALDSNDRKSETVSKNAS 240

Query: 241  QNFSPSADHAECAMSSLPYDDASARKKPRLGWGEGLAKYEKKKVEVPDGSTAFTNITAES 300
            QNFSPSADHAECAMSSLPYDDASARKKPRLGWGEGLAKYEKKKVEVPDGSTAFTN+ AES
Sbjct: 241  QNFSPSADHAECAMSSLPYDDASARKKPRLGWGEGLAKYEKKKVEVPDGSTAFTNVNAES 300

Query: 301  THSLNSSLIEKGPRGSGFADCTSPATPSSVISGSPPGGDEKSFGKASSDNDVSNFHGSPG 360
            THSLNS LIEKGPRGSGFADCTSPATPSSVISGS PGGDEKSFGKASSDNDVSNFHGSPG
Sbjct: 301  THSLNSCLIEKGPRGSGFADCTSPATPSSVISGSSPGGDEKSFGKASSDNDVSNFHGSPG 360

Query: 361  SCFQNQYEGTSTVEKLDNFSIANLCSPLIQLLQSNDSISVDSTALSKLLIYKNQISKVLE 420
            S FQNQYEGTSTVEKLDNFSIANLCSPLIQLLQSNDS SVDSTALSKLLIYKNQISKVLE
Sbjct: 361  SGFQNQYEGTSTVEKLDNFSIANLCSPLIQLLQSNDSTSVDSTALSKLLIYKNQISKVLE 420

Query: 421  TTESEIDLLENELKGLKSESKGYFSFTLASSSLLVGDKFFEEQNNVANAVATLPVVTSAN 480
            TTESEIDLLENELKGLKSE KGYFSFTLASS  LVGDKFFEEQNNV N VATLPVVTSA+
Sbjct: 421  TTESEIDLLENELKGLKSEGKGYFSFTLASSP-LVGDKFFEEQNNVTNTVATLPVVTSAH 480

Query: 481  TISKTMAHSTSDLEEVYAEKDRSGRLDVKESVMKEKLTIYGCSVKENIAAYIDNSMPIKS 540
            TISKT+AHST+DLEEVYA+KDRSGR DVKESVMKE LT+ GCS K++I AYIDNS+PIKS
Sbjct: 481  TISKTLAHSTNDLEEVYADKDRSGRSDVKESVMKENLTVSGCSAKDHIVAYIDNSLPIKS 540

Query: 541  EGVTVHPVANDMYECAEGGDSVSDLILASNKESACKASEALIGLLPTNERKIDIWSTNAC 600
            EGVTVHPVAND YECAEGGDSVSDLILASNKESACKASEAL+ +LPTNE KIDIWSTNAC
Sbjct: 541  EGVTVHPVANDTYECAEGGDSVSDLILASNKESACKASEALMRMLPTNECKIDIWSTNAC 600

Query: 601  SQNQCLVKERFAKRKRLLRFKERVITLKFKAYQSLWKENLHVPPVRKLRAKSQKKHQLSL 660
            +QNQCLVKERFAKRKRLLRFKERVITLKFKAYQSLWKENLHVPPVRKLRAKSQKKHQLSL
Sbjct: 601  AQNQCLVKERFAKRKRLLRFKERVITLKFKAYQSLWKENLHVPPVRKLRAKSQKKHQLSL 660

Query: 661  WTNYSGYQKNRSSIRYRMPSPAGNLNPVSSTEILKHVSMQLSTPQIKQYRRTLKMPALVL 720
            WTNYSGYQKNRSSIRYRMPSPAGNLNPVSSTEILKHVSMQLS+PQIKQYRRTLKMP LVL
Sbjct: 661  WTNYSGYQKNRSSIRYRMPSPAGNLNPVSSTEILKHVSMQLSSPQIKQYRRTLKMPTLVL 720

Query: 721  DQKDKMGSRFISNNGLVENPCAVEKERAMINPWTSEEKDVFMEKLECFGKDFGKIASFLD 780
            DQKDKMGSRFISNNGLVENPCAVEKERAMINPWTSEEKDVFMEKLECFGKDFGKIASFLD
Sbjct: 721  DQKDKMGSRFISNNGLVENPCAVEKERAMINPWTSEEKDVFMEKLECFGKDFGKIASFLD 780

Query: 781  HKTTADCVEFYYKNHKSDCFEKTKKLEFGKKVKSSTSNYLMTTGKKWNPETNAASLDMLG 840
            HKTTADCVEFYYKNHKSDCFEKTKKLEFGKKVKSSTSNYLMTTGKKWNPETNAASLD+LG
Sbjct: 781  HKTTADCVEFYYKNHKSDCFEKTKKLEFGKKVKSSTSNYLMTTGKKWNPETNAASLDILG 840

Query: 841  AASTMTARAHKYSSSRSGGRTSYHITQFDDGLSERAKGLNGFGNEREKVAADVLAGICGS 900
            AASTMTARAHKYSS RSGGRTSYH TQFDD LSERAKGLN FGNEREKVAADVLAGICGS
Sbjct: 841  AASTMTARAHKYSSGRSGGRTSYHTTQFDDDLSERAKGLNSFGNEREKVAADVLAGICGS 900

Query: 901  LSSEAMGSCVTSNFNRGDSSQNLKCKKGVTTVLRQRMTTNVPRYVDNEIFSDESCGEMGP 960
            LSSEAMGSCVTSNFNRGDSSQ+LKCKKG TTVLR+RMTTNVPRYVDNEIFSDESCGEMGP
Sbjct: 901  LSSEAMGSCVTSNFNRGDSSQDLKCKKGATTVLRRRMTTNVPRYVDNEIFSDESCGEMGP 960

Query: 961  SYWTDGEKSLFIEAVSVYGKNFSVISTHVGSKSTDQCKVFFSKARKCLGLDLICSAKKMP 1020
            SYWTDGEKSLFIEAVSVYGKNFSVISTHVGSKSTDQCKVFFSKARKCLGLDLICSAKKMP
Sbjct: 961  SYWTDGEKSLFIEAVSVYGKNFSVISTHVGSKSTDQCKVFFSKARKCLGLDLICSAKKMP 1020

Query: 1021 DNGNGHDADRSNGEGGVDTKDAFPCEMVGSRVVDDLPKAVMSISGGESESMNLQSTHQEV 1080
            DNGNGHDAD  NGEGGVDTKDAFPCE+VGSRVVDDLPK+VMSISGGESESMNLQSTHQEV
Sbjct: 1021 DNGNGHDADGGNGEGGVDTKDAFPCELVGSRVVDDLPKSVMSISGGESESMNLQSTHQEV 1080

Query: 1081 ---NPSSKTCSNAAVDAMVSDDECTRKDGSQSGFDDDCQSVNSANDKNGLIHEQQHVVIS 1140
               N SSKTCSNAAVDAMVSDDECTRKDGSQSGFD+DCQSVNSANDKNGL++EQQH V+S
Sbjct: 1081 KESNLSSKTCSNAAVDAMVSDDECTRKDGSQSGFDEDCQSVNSANDKNGLVNEQQHAVMS 1140

Query: 1141 DETAKEQDISVLVATSVGNVSDTETKRGNVDASTARGDKADSHATDCPSIPSNSHITSSA 1200
            +ETAKEQDISV VATSV NVSDTETKRGNVDASTARGDKADSHA DCPS+P NSHITSSA
Sbjct: 1141 NETAKEQDISVSVATSVENVSDTETKRGNVDASTARGDKADSHAADCPSMPLNSHITSSA 1200

Query: 1201 KEEQGRHHVRVHSRSLSDSEQSSRNGDIKLFGQILTHSSFVPSSKSGSSVNGIKTTEPHH 1260
            KEEQGRHH+RVHSRSLSDSE+SSRNGDIKLFGQILTHSSFVPSSKSGSS NGI+TTEPHH
Sbjct: 1201 KEEQGRHHIRVHSRSLSDSERSSRNGDIKLFGQILTHSSFVPSSKSGSSENGIRTTEPHH 1260

Query: 1261 KFKRRLKVNSHGNLSTAKFNCKNSPGQEENTPSRSYGIWDGNQIRTGLSSLPDPTTLLSR 1320
            KFKRRLKVNSHGNLSTAKF+CKNSPGQEE+TPSRSYGIWDGNQIRTGLSSLPDPTTLL+R
Sbjct: 1261 KFKRRLKVNSHGNLSTAKFDCKNSPGQEESTPSRSYGIWDGNQIRTGLSSLPDPTTLLTR 1320

Query: 1321 YPTFNHLSKPASSPTE-QSPSGCKEETSNSNKETQKREVNNSRKEEVVGEMNVEESC--- 1380
            YPTFNHLSKPA SP E QS S CKEE SNSN+ETQK EVNNSRKEEVVG MNV ESC   
Sbjct: 1321 YPTFNHLSKPAFSPIEQQSLSSCKEEKSNSNEETQKMEVNNSRKEEVVGGMNVGESCNDG 1380

Query: 1381 ------CNEGGGGGGS 1384
                  C+  GGG GS
Sbjct: 1381 CDIKLDCSNKGGGSGS 1395

BLAST of CSPI05G16750 vs. NCBI nr
Match: gi|659131415|ref|XP_008465674.1| (PREDICTED: uncharacterized protein LOC103503311 isoform X2 [Cucumis melo])

HSP 1 Score: 2530.4 bits (6557), Expect = 0.0e+00
Identity = 1303/1396 (93.34%), Postives = 1338/1396 (95.85%), Query Frame = 1

Query: 1    MPPEPLPWDRKDFFKERKHERSEFLGPLPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQG 60
            MPPEPLPWDRKDFFKERKHERSEFLGPLPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQG
Sbjct: 1    MPPEPLPWDRKDFFKERKHERSEFLGPLPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQG 60

Query: 61   GWHVFSEEYGHGYGPSMSFNNKMLENVSSRPSVSHGDGKYARNGRESRSFSQRDWKGHSW 120
            GWHVFSEEYGHGYGPSMSFNNKMLENVSSRPSVSHGDGKYARNGRESRSFSQRDWKGHSW
Sbjct: 61   GWHVFSEEYGHGYGPSMSFNNKMLENVSSRPSVSHGDGKYARNGRESRSFSQRDWKGHSW 120

Query: 121  ATSNGSTNNGGRMQHDLNYDQRSVHDMLIYPSHSHSDFVNPREKVKGQHDKVDDVNGLGT 180
            ATSNGSTNNGGR+QHDLNYDQRSVHDMLIYPSHSHSDFVNPR+KVKGQHDKVDDVNGLGT
Sbjct: 121  ATSNGSTNNGGRIQHDLNYDQRSVHDMLIYPSHSHSDFVNPRDKVKGQHDKVDDVNGLGT 180

Query: 181  NQRRDREYSVSSSGWKPLKWTRSGGLSSRTSTSGHSSSKKSIDALDSNDRKSETVSKNAS 240
            NQRRDREYSVSSSGWKPLKWTRSGGLSSRTSTSGHSSSKKS+DALDSNDRKSETVSKNAS
Sbjct: 181  NQRRDREYSVSSSGWKPLKWTRSGGLSSRTSTSGHSSSKKSVDALDSNDRKSETVSKNAS 240

Query: 241  QNFSPSADHAECAMSSLPYDDASARKKPRLGWGEGLAKYEKKKVEVPDGSTAFTNITAES 300
            QNFSPSADHAECAMSSLPYDDASARKKPRLGWGEGLAKYEKKKVEVPDGSTAFTN+ AES
Sbjct: 241  QNFSPSADHAECAMSSLPYDDASARKKPRLGWGEGLAKYEKKKVEVPDGSTAFTNVNAES 300

Query: 301  THSLNSSLIEKGPRGSGFADCTSPATPSSVISGSPPGGDEKSFGKASSDNDVSNFHGSPG 360
            THSLNS LIEKGPRGSGFADCTSPATPSSVISGS PGGDEKSFGKASSDNDVSNFHGSPG
Sbjct: 301  THSLNSCLIEKGPRGSGFADCTSPATPSSVISGSSPGGDEKSFGKASSDNDVSNFHGSPG 360

Query: 361  SCFQNQYEGTSTVEKLDNFSIANLCSPLIQLLQSNDSISVDSTALSKLLIYKNQISKVLE 420
            S FQNQYEGTSTVEKLDNFSIANLCSPLIQLLQSNDS SVDSTALSKLLIYKNQISKVLE
Sbjct: 361  SGFQNQYEGTSTVEKLDNFSIANLCSPLIQLLQSNDSTSVDSTALSKLLIYKNQISKVLE 420

Query: 421  TTESEIDLLENELKGLKSESKGYFSFTLASSSLLVGDKFFEEQNNVANAVATLPVVTSAN 480
            TTESEIDLLENELKGLKSE KGYFSFTLASS  LVGDKFFEEQNNV N VATLPVVTSA+
Sbjct: 421  TTESEIDLLENELKGLKSEGKGYFSFTLASSP-LVGDKFFEEQNNVTNTVATLPVVTSAH 480

Query: 481  TISKTMAHSTSDLEEVYAEKDRSGRLDVKESVMKEKLTIYGCSVKENIAAYIDNSMPIKS 540
            TISKT+AHST+DLEEVYA+KDRSGR DVKESVMKE LT+ GCS K++I AYIDNS+PIKS
Sbjct: 481  TISKTLAHSTNDLEEVYADKDRSGRSDVKESVMKENLTVSGCSAKDHIVAYIDNSLPIKS 540

Query: 541  EGVTVHPVANDMYECAEGGDSVSDLILASNKESACKASEALIGLLPTNERKIDIWSTNAC 600
            EGVTVHPVAND YECAEGGDSVSDLILASNKESACKASEAL+ +LPTNE KIDIWSTNAC
Sbjct: 541  EGVTVHPVANDTYECAEGGDSVSDLILASNKESACKASEALMRMLPTNECKIDIWSTNAC 600

Query: 601  SQNQCLVKERFAKRKRLLRFKERVITLKFKAYQSLWKENLHVPPVRKLRAKSQKKHQLSL 660
            +QNQCLVKERFAKRKRLLRFKERVITLKFKAYQSLWKENLHVPPVRKLRAKSQKKHQLSL
Sbjct: 601  AQNQCLVKERFAKRKRLLRFKERVITLKFKAYQSLWKENLHVPPVRKLRAKSQKKHQLSL 660

Query: 661  WTNYSGYQKNRSSIRYRMPSPAGNLNPVSSTEILKHVSMQLSTPQIKQYRRTLKMPALVL 720
            WTNYSGYQKNRSSIRYRMPSP GNLNPVSSTEILKHVSMQLS+PQIKQYRRTLKMP LVL
Sbjct: 661  WTNYSGYQKNRSSIRYRMPSP-GNLNPVSSTEILKHVSMQLSSPQIKQYRRTLKMPTLVL 720

Query: 721  DQKDKMGSRFISNNGLVENPCAVEKERAMINPWTSEEKDVFMEKLECFGKDFGKIASFLD 780
            DQKDKMGSRFISNNGLVENPCAVEKERAMINPWTSEEKDVFMEKLECFGKDFGKIASFLD
Sbjct: 721  DQKDKMGSRFISNNGLVENPCAVEKERAMINPWTSEEKDVFMEKLECFGKDFGKIASFLD 780

Query: 781  HKTTADCVEFYYKNHKSDCFEKTKKLEFGKKVKSSTSNYLMTTGKKWNPETNAASLDMLG 840
            HKTTADCVEFYYKNHKSDCFEKTKKLEFGKKVKSSTSNYLMTTGKKWNPETNAASLD+LG
Sbjct: 781  HKTTADCVEFYYKNHKSDCFEKTKKLEFGKKVKSSTSNYLMTTGKKWNPETNAASLDILG 840

Query: 841  AASTMTARAHKYSSSRSGGRTSYHITQFDDGLSERAKGLNGFGNEREKVAADVLAGICGS 900
            AASTMTARAHKYSS RSGGRTSYH TQFDD LSERAKGLN FGNEREKVAADVLAGICGS
Sbjct: 841  AASTMTARAHKYSSGRSGGRTSYHTTQFDDDLSERAKGLNSFGNEREKVAADVLAGICGS 900

Query: 901  LSSEAMGSCVTSNFNRGDSSQNLKCKKGVTTVLRQRMTTNVPRYVDNEIFSDESCGEMGP 960
            LSSEAMGSCVTSNFNRGDSSQ+LKCKKG TTVLR+RMTTNVPRYVDNEIFSDESCGEMGP
Sbjct: 901  LSSEAMGSCVTSNFNRGDSSQDLKCKKGATTVLRRRMTTNVPRYVDNEIFSDESCGEMGP 960

Query: 961  SYWTDGEKSLFIEAVSVYGKNFSVISTHVGSKSTDQCKVFFSKARKCLGLDLICSAKKMP 1020
            SYWTDGEKSLFIEAVSVYGKNFSVISTHVGSKSTDQCKVFFSKARKCLGLDLICSAKKMP
Sbjct: 961  SYWTDGEKSLFIEAVSVYGKNFSVISTHVGSKSTDQCKVFFSKARKCLGLDLICSAKKMP 1020

Query: 1021 DNGNGHDADRSNGEGGVDTKDAFPCEMVGSRVVDDLPKAVMSISGGESESMNLQSTHQEV 1080
            DNGNGHDAD  NGEGGVDTKDAFPCE+VGSRVVDDLPK+VMSISGGESESMNLQSTHQEV
Sbjct: 1021 DNGNGHDADGGNGEGGVDTKDAFPCELVGSRVVDDLPKSVMSISGGESESMNLQSTHQEV 1080

Query: 1081 ---NPSSKTCSNAAVDAMVSDDECTRKDGSQSGFDDDCQSVNSANDKNGLIHEQQHVVIS 1140
               N SSKTCSNAAVDAMVSDDECTRKDGSQSGFD+DCQSVNSANDKNGL++EQQH V+S
Sbjct: 1081 KESNLSSKTCSNAAVDAMVSDDECTRKDGSQSGFDEDCQSVNSANDKNGLVNEQQHAVMS 1140

Query: 1141 DETAKEQDISVLVATSVGNVSDTETKRGNVDASTARGDKADSHATDCPSIPSNSHITSSA 1200
            +ETAKEQDISV VATSV NVSDTETKRGNVDASTARGDKADSHA DCPS+P NSHITSSA
Sbjct: 1141 NETAKEQDISVSVATSVENVSDTETKRGNVDASTARGDKADSHAADCPSMPLNSHITSSA 1200

Query: 1201 KEEQGRHHVRVHSRSLSDSEQSSRNGDIKLFGQILTHSSFVPSSKSGSSVNGIKTTEPHH 1260
            KEEQGRHH+RVHSRSLSDSE+SSRNGDIKLFGQILTHSSFVPSSKSGSS NGI+TTEPHH
Sbjct: 1201 KEEQGRHHIRVHSRSLSDSERSSRNGDIKLFGQILTHSSFVPSSKSGSSENGIRTTEPHH 1260

Query: 1261 KFKRRLKVNSHGNLSTAKFNCKNSPGQEENTPSRSYGIWDGNQIRTGLSSLPDPTTLLSR 1320
            KFKRRLKVNSHGNLSTAKF+CKNSPGQEE+TPSRSYGIWDGNQIRTGLSSLPDPTTLL+R
Sbjct: 1261 KFKRRLKVNSHGNLSTAKFDCKNSPGQEESTPSRSYGIWDGNQIRTGLSSLPDPTTLLTR 1320

Query: 1321 YPTFNHLSKPASSPTE-QSPSGCKEETSNSNKETQKREVNNSRKEEVVGEMNVEESC--- 1380
            YPTFNHLSKPA SP E QS S CKEE SNSN+ETQK EVNNSRKEEVVG MNV ESC   
Sbjct: 1321 YPTFNHLSKPAFSPIEQQSLSSCKEEKSNSNEETQKMEVNNSRKEEVVGGMNVGESCNDG 1380

Query: 1381 ------CNEGGGGGGS 1384
                  C+  GGG GS
Sbjct: 1381 CDIKLDCSNKGGGSGS 1394

BLAST of CSPI05G16750 vs. NCBI nr
Match: gi|590564860|ref|XP_007009785.1| (Duplicated homeodomain-like superfamily protein isoform 1 [Theobroma cacao])

HSP 1 Score: 963.0 bits (2488), Expect = 5.6e-277
Identity = 604/1190 (50.76%), Postives = 776/1190 (65.21%), Query Frame = 1

Query: 1    MPPEPLPWDRKDFFKERKHERSEFLGPLP---RWRDSSS-----HGS-REFSRWGSGDFR 60
            MPPEPLPWDRKDF+KERKHER+E     P   RWRDSSS     HGS REF+RWGS D R
Sbjct: 1    MPPEPLPWDRKDFYKERKHERTESQPQQPSTARWRDSSSMSSYQHGSFREFTRWGSADLR 60

Query: 61   RPPGHGRQGGWHVFSEEYG-HGYGPSMSFNNKMLENVSSRPSVSHGDGKYARNG-RESR- 120
            RPPGHG+QG WH+F+EE G HGY PS S  +KML++ S R SVS GDGKY+RN  RE+  
Sbjct: 61   RPPGHGKQGSWHLFAEENGGHGYVPSRS-GDKMLDDESCRQSVSRGDGKYSRNSSRENNR 120

Query: 121  -SFSQRDWKGHSWATSNGSTNNGGRMQHDLNYDQRSVHDMLIYPSHSHSDFVNPREKV-K 180
             S+SQRDW+ HSW  SNGS N  GR  HD+N +QRSV DML YPSH+HSDFV+  +++ K
Sbjct: 121  ASYSQRDWRAHSWEMSNGSPNTPGR-PHDVNNEQRSVDDMLTYPSHAHSDFVSTWDQLHK 180

Query: 181  GQHD-KVDDVNGLGTNQRRDREYSVSSSGWKPLKWTRSGGLSSRTSTSGHSSSKKSIDAL 240
             QHD K   VNGLGT QR +RE SV S  WKPLKW+RSG LSSR S   HSSS KS+  +
Sbjct: 181  DQHDNKTSGVNGLGTGQRCERENSVGSMDWKPLKWSRSGSLSSRGSGFSHSSSSKSLGGV 240

Query: 241  DSNDRKSETVSKNASQNFSPSADHAECAMSSLPYDDASARKKPRLGWGEGLAKYEKKKVE 300
            DS + K E   KN +   SPS D A C  S+ P D+  +RKKPRLGWGEGLAKYEKKKVE
Sbjct: 241  DSGEGKLELQQKNLTPVQSPSGDAAACVTSAAPSDETMSRKKPRLGWGEGLAKYEKKKVE 300

Query: 301  VPD-----GSTAFTNITAESTHSLNSSLIEKGPRGSGFADCTSPATPSSVISGSPPGGDE 360
             PD     G    +    E  +SL S+L EK PR  GF+DC SPATPSSV   S PG +E
Sbjct: 301  GPDTSMNRGVATISVGNTEPNNSLGSNLAEKSPRVLGFSDCASPATPSSVACSSSPGVEE 360

Query: 361  KSFGKASS-DNDVSNFHGSPGSCFQNQYEGTS-TVEKLDNFSIANLCSPLIQLLQSNDSI 420
            KSFGKA++ DND+SN  GSP    QN  EG S  +EKLD  SI N+ S L+ LLQS+D  
Sbjct: 361  KSFGKAANIDNDISNLCGSPSLGSQNHLEGPSFNLEKLDMNSIINMGSSLVDLLQSDDPS 420

Query: 421  SVD-----STALSKLLIYKNQISKVLETTESEIDLLENELKGLKSESKGYFSFTLASSSL 480
            +VD     STA++KLL++K  + K LETTESEID LENELK LK+ S   +     SSSL
Sbjct: 421  TVDSSFVRSTAMNKLLLWKGDVLKALETTESEIDSLENELKTLKANSGSRYPCPATSSSL 480

Query: 481  LVGD--KFFEEQNNVANAV---ATLPVVTSANTISKTMAHSTSDLEEVYAEKDRSGRLD- 540
             + +  +  EE   ++N +   A L +    + + + +     DLEEV A+  + G +D 
Sbjct: 481  PMEENGRACEELEAISNMIPRPAPLKIDPCGDALEEKVPLCNGDLEEVNADA-KDGDIDS 540

Query: 541  -------------VKESVMKEKLTIYGCS-----------VKENIAAYIDN---SMPIKS 600
                         ++++V    + ++ CS            + N+A    N   S+P   
Sbjct: 541  PGTATSKFVEPSSLEKAVSPSDVKLHECSGDLGTVQLTTMGEVNLAPGSSNEGTSVPFSG 600

Query: 601  EGVTVHPVANDMYECAEGGDSVSDL-------ILASNKESACKASEALIGLLPTNE-RKI 660
            EG  +  + ND++   E  +SV+D+       I+A+NKE A  AS+    LLP +    I
Sbjct: 601  EGSALEKIDNDVHG-PEPSNSVADIENIMYDVIIATNKELANSASKVFNNLLPKDWCSVI 660

Query: 661  DIWSTNACSQNQCLVKERFAKRKRLLRFKERVITLKFKAYQSLWKENLHVPPVRKLRAKS 720
               +  AC Q   L++E+  KRK+ +RFKERV+ LKFKA+Q  WKE++  P +RK RAKS
Sbjct: 661  SEIANGACWQTDSLIREKIVKRKQCIRFKERVLMLKFKAFQHAWKEDMRSPLIRKYRAKS 720

Query: 721  QKKHQLSLWTNYSGYQKNRSSIRYRMPSPAGNLNPVSSTEILKHVSMQLSTPQIKQYRRT 780
            QKK++LSL +   GYQK+RSSIR R+ SPAGNL+  S+ E++  VS  LS   ++ YR  
Sbjct: 721  QKKYELSLRSTLGGYQKHRSSIRSRLTSPAGNLSLESNVEMINFVSKLLSDSHVRLYRNA 780

Query: 781  LKMPALVLDQKDKMGSRFISNNGLVENPCAVEKERAMINPWTSEEKDVFMEKLECFGKDF 840
            LKMPAL LD+K+K  SRFIS+NGLVE+PCAVEKERA+INPWTSEEK++FM+KL  FGKDF
Sbjct: 781  LKMPALFLDEKEKQVSRFISSNGLVEDPCAVEKERALINPWTSEEKEIFMDKLAAFGKDF 840

Query: 841  GKIASFLDHKTTADCVEFYYKNHKSDCFEKT-KKLEFGKKVKSSTSNYLMTTGKKWNPET 900
             KIASFLDHKTTADCVEFYYKNHKS+CFEKT KKL+  K+ KS+ + YL+T+GKKW+ E 
Sbjct: 841  RKIASFLDHKTTADCVEFYYKNHKSECFEKTKKKLDLSKQGKSTANTYLLTSGKKWSREL 900

Query: 901  NAASLDMLGAASTMTARAHKYSSSRS--------GGRTSYHITQFDDGLSERAKGLNGFG 960
            NAASLD+LG AS + A A     +R         GGR     ++ DD + ER+   +  G
Sbjct: 901  NAASLDVLGEASVIAAHAESGMRNRQTSAGRIFLGGRFDSKTSRVDDSIVERSSSFDVIG 960

Query: 961  NEREKVAADVLAGICGSLSSEAMGSCVTSNFNRGDSSQ-NLKCKKGVTTVLRQRMTTNVP 1020
            N+RE VAADVLAGICGSLSSEAM SC+TS+ + G+S Q   KC+K V +V+++  T++V 
Sbjct: 961  NDRETVAADVLAGICGSLSSEAMSSCITSSADPGESYQREWKCQK-VDSVVKRPSTSDVT 1020

Query: 1021 RYVDNEIFSDESCGEMGPSYWTDGEKSLFIEAVSVYGKNFSVISTHVGSKSTDQCKVFFS 1080
            + +D++  SDESCGEM P+ WTD EKS+FI+AVS+YGK+F++IS  VG++S DQCKVFFS
Sbjct: 1021 QNIDDDTCSDESCGEMDPADWTDEEKSVFIQAVSLYGKDFAMISRCVGTRSRDQCKVFFS 1080

Query: 1081 KARKCLGLDLICSAKKMPDNGNGHDADRSNGEGGVDTKDAFPCE-------MVGSRVVDD 1099
            KARKCLGLDLI    +   N     +D +NG GG D +DA   E        +GS+V +D
Sbjct: 1081 KARKCLGLDLIHPRTR---NLGTPMSDDANG-GGSDIEDACVLESSVVCSDKLGSKVEED 1140

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
NCOR2_MOUSE5.9e-1625.48Nuclear receptor corepressor 2 OS=Mus musculus GN=Ncor2 PE=1 SV=3[more]
NCOR2_HUMAN1.3e-1522.95Nuclear receptor corepressor 2 OS=Homo sapiens GN=NCOR2 PE=1 SV=2[more]
NCOR1_XENTR4.7e-1325.28Nuclear receptor corepressor 1 OS=Xenopus tropicalis GN=ncor1 PE=2 SV=1[more]
NCOR1_MOUSE6.7e-1223.51Nuclear receptor corepressor 1 OS=Mus musculus GN=Ncor1 PE=1 SV=1[more]
NCOR1_HUMAN6.7e-1223.51Nuclear receptor corepressor 1 OS=Homo sapiens GN=NCOR1 PE=1 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0KU04_CUCSA0.0e+0099.49Uncharacterized protein OS=Cucumis sativus GN=Csa_5G492330 PE=4 SV=1[more]
A0A061FMP2_THECC3.9e-27750.76Duplicated homeodomain-like superfamily protein isoform 1 OS=Theobroma cacao GN=... [more]
A0A061FNE6_THECC2.8e-27550.67Duplicated homeodomain-like superfamily protein isoform 2 OS=Theobroma cacao GN=... [more]
F6HNI1_VITVI3.9e-26948.91Putative uncharacterized protein OS=Vitis vinifera GN=VIT_13s0019g04010 PE=4 SV=... [more]
A5AZS6_VITVI3.2e-26347.90Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_026902 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G52250.11.5e-12636.00 Duplicated homeodomain-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449452162|ref|XP_004143829.1|0.0e+0099.49PREDICTED: uncharacterized protein LOC101219573 isoform X1 [Cucumis sativus][more]
gi|778703085|ref|XP_011655309.1|0.0e+0099.42PREDICTED: uncharacterized protein LOC101219573 isoform X2 [Cucumis sativus][more]
gi|659131413|ref|XP_008465673.1|0.0e+0093.41PREDICTED: uncharacterized protein LOC103503311 isoform X1 [Cucumis melo][more]
gi|659131415|ref|XP_008465674.1|0.0e+0093.34PREDICTED: uncharacterized protein LOC103503311 isoform X2 [Cucumis melo][more]
gi|590564860|ref|XP_007009785.1|5.6e-27750.76Duplicated homeodomain-like superfamily protein isoform 1 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001005SANT/Myb
IPR009057Homeobox-like_sf
IPR017884SANT_dom
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005634 nucleus
cellular_component GO:0005575 cellular_component
molecular_function GO:0003677 DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI05G16750.1CSPI05G16750.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001005SANT/Myb domainPFAMPF00249Myb_DNA-bindingcoord: 962..1002
score: 1.
IPR001005SANT/Myb domainSMARTSM00717santcoord: 749..797
score: 3.0E-5coord: 959..1007
score: 1.
IPR009057Homeodomain-likeGENE3DG3DSA:1.10.10.60coord: 752..793
score: 2.6E-4coord: 963..1003
score: 7.
IPR009057Homeodomain-likeunknownSSF46689Homeodomain-likecoord: 736..796
score: 9.72E-14coord: 960..1009
score: 7.02
IPR017884SANT domainPROFILEPS51293SANTcoord: 958..1009
score: 12.41coord: 748..799
score: 15
NoneNo IPR availableunknownCoilCoilcoord: 412..439
scor
NoneNo IPR availablePANTHERPTHR13992NUCLEAR RECEPTOR CO-REPRESSOR RELATED NCORcoord: 602..1013
score: 4.2E