CSPI04G02590 (gene) Wild cucumber (PI 183967)

NameCSPI04G02590
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionZZ-type zinc finger-containing protein 3, putative isoform 1
LocationChr4 : 1544883 .. 1558597 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAGAAAGGAAAAAGGAGGAAGAAAAAGAGAGAAAAAGAGAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAGAAGGAAAAAGGAACTTAAAAAGAAAAAAAAAAGGGGGAAAAAAAGGAAGAAATGTAATGTTGCTTTGCTTACGAATATGTTGACACTTTCCGACGTCAAACCTCCAAACCCTCTCTCCAATTGCTGTATACACAGACATCGGTCTCGTTTCTCTCTCTATTTTTCTGCAACTTTCTTTAGTTTATTTTCTTCTTTACCCGAGCTCCCTTCAGTCTTAATCCGTACAAATCTTCCACACTCATTTTCCCTCCTTTTTCCTTCTTTTTTTTTTCTGTTTCTTCATTGATTGACCCATCTATCGTCAATTTTCATTTTTGAGCTTTTGTTTTAGTCATTTTTCTTATGGGTTCATTGATTTTTGTTGAAATAATCTGAATTGGCGGGAGCCAACGGTCAATTATGTGGTTTTAGGAGGTGGGTTTTGATTGGATTTGGATTTGGGATTGGAAAAACGTTTGAATTGTTGTGAGTTTTGATGGCTAACCCATCTGGGAACCATCAAGAAGCTGGCCAACCGTCGTCTTCCTTCGATGGAGGGAACCCCAGCAACGGTAATTCGACACCTGTGCCTGCAGCGGATAATCCGAGTTCGGCTCTTGCTATGAAGCATAACCCGGGTATCTCTACGGATTGGACATCTGATGAGCAGGTCACACTGGAAGAAGGGCTTAAGAAGTGAGTTTTTTCCCCTTTTTGTTTTTGATGTATGGTTTTTAGCTTCTTGCGTTTATGGAAGTTGCTTGAATTAAGCGATTTGGTTGATTTTTTTTCCTTCTTCCATGGAGGAAATTGATTTTGTACTTCTTGTTCAAGCTCTTTTCCCTTTAATTTCCAGCAATTTTCATTTTTAAATGGTTAGATTACGAGATTGGTTCCTCGGAGTTGTGTTTTACAAGGCCCTGAATGTGTCTAGTAAGTCCACAAGGCTTTTGGGGTTTGTACTTGACGAACAATTTTTACACTTTTTCTCTCCGTCTCTTAAAATTCCTCTTTTGAACATTCCCTAAGCATACCGGCTATATTTATGGACCGAGCGGGGATTTTTATCCAGCTGTGAAAATATATGAGATTATTTATAGCTAACGATTTATGTGTGAATGTGGAATTTGACCTTTTGAAGTCTCAGGTCGGAACTAATAGCATCTCGCTGTTCTTTTCCATTATTTGCCTCTTATCTGAATTTCACCACATCGGCATCTGATAATAGTCATGTCTGTTGCTTTTTTTTAATCAATGTTTTCTTAGTTGATTTTATTTCTTCAATTTCTCTTGGCTCAGTTTTCAACCGCACAAAGATGTTCAAGTTTCTATTGAGGGATGGTCTTTGTTTTTAATGTGAAGGCCAAAGTTGTGTGATTTAAAACTAAGTTTTACGTTACAGGAAAGTAAGGAATCCATATGCTTAGGAAAAAGCTTGGATGCTATCATAAAGAGTTTTATTTGACTTATGATTGTAAACTGGATTCAGCCTTATAGGGATATCTAAGGTGTTTTTGAAAGCTAAAAGCTGCAACCGCTTCAGATTTGATATGTGCAGAAACAATACCATTTACTGTTTTCTCTTAAATGAGCCATAATGTGATTGGTTTGGAAGTTAAAAAATATCTATGCCAACGTTTCAAAGTCATTTTTTTTTTAATCTTTAATTTTACGAAAACACGCCTCTTCATTAAAAATGAAATGAGACTAATGCTCAAAGTACAAAAGAATTATATAAAGGACATAAAAAACTAAGGATTAGAGGCGCACTTAGACATCTCAAATGACACCCCCTAGTGTCCACATCATTTCTGAACAAACGAACAAAGACTAAATAACATCTCAAGCTAAGGAAGAACTACAACAAAATGATACATCATAAGACCAAAATAAAACAAGCAGTACAAAACCCTTGACTATGTGAGGACAAATGCAAAAGAGACTCAAAGCAAAATAACTTGATGAGCTCTGATTCATAGGTGCTACACTCCTTTTTTGTGTCCTTTACCCAAGTGTTTATGACGTAGATTGTTCTAAAGAAAGATCAGTGACGGTGGTGTAGGATCCTAGCTGCTTAAATCGAAAACTTGAGAAGAAACTTAAAGTATCGGAAATTTGATGAATGGATGCTTTTCATTGAAATATTGGTACAAGTCTACTGATCCTCTTGAATTTATGTTAATTTGGAACTTGGAAAAGATATCTCTACACTAGTTAAAGTTGCAAGTCGGTCCTTTATCATAAACAATAGTGGAAGGTTGGAGAAGGAGCTAGAAAATCAGATTTGGAGATTAGTTCACAAAAAGAAACCAACCAGTAGGTAAAACTAACTAACTGTCAAATAACTAATGACTAACAATTAATTTCAACTCCTACATCCAAAAGTGAAGGTTCTTTTATTCTCTCCCTCCCTCATAGAAAAATATAAACATACAAGACAAGATACTAGGAAGAAATCCCCATTTTTCTCTTCTCCTGCAAATGTGTATGAACTGCTTGCATAAGGAATTTACCCGAGCCTCTTGTTGGAAATGACTTCCTACTTGAGAACTCTGTTTATATCTAGATCTAGGGTTGGCAAGGAAATGTTAGGTAAACTATGTGGGAAATTAGTCCAGACAATTATTCTTGCACCTTAAACCACTATGAGTTGACCTAATGGTCAATCCTATAACTATTTTATAGATGTTATTTTCATAGTTGGAGTCTCTTTTCACAAAGGGCCCCCCCTTTTTGTGGGATTATTTTTCTTTGCATGCCCTTGTATTTTTTCATATTTTCTATCGATTCCATTTAGAAAAGAGCCATAAATATAATAAAGGAGTTAGATGGAATGAGTTCAAGGCATGATGGCCCTCCACTTAGGATTTAATATCTTGAAATGTAGCATGGTTAGGTGGTTGTCTCATGATAATAGGCATGGTTTGTGCAAGTTTCCTCAACTCGCACAAATATAAAGAGAAAGATCTTGCCTTTGTAGATCTAGTGTGGCCAAAACAAGCACAAGAAGATTCTAACTTAAACGTTCTACCATTTAATTGGTAAAATGAGTTCTACTTGAATTTCGTATATCTCTCTTGTACAATGAGCTTTTGTCTCATTATATCTTTATTAATAAAGAGACACATATCCTTTTCAAAAAAAAAAAGGTTCTACTTGAATATTTTGTGCTCTTGGCTCCTGGAATTATGAGACCATGCTTCCTATGAATCCTCGTATGACTTTGAGGATCCCTAAGATGTTTTTCATTTGTTAAAAGCCATTCTCAAAGTTGCTGTCCAAAAGGATTCTGTTGATGTGTATCTGGAGAGTGTGTCGTGAAGATAAGTCCACCTCATTTCTCGACTCTGTAACCTTGCATGAATTATGGCTTTGATCAAATCCTCCGCAATTGAGAGACCTTGGTGCTTTAGTTATTTTGGGAGAGGGAATTCCTGCCTCTCTAGGCTGTTTTATTTTGTGATTAACATTTATTCCTGTTTCTTGTTAGAAAAAAAGAAAAAGGAAAAATGAAGAACAGTTAAAGTTTCTAATAATTTAATGAACTTCTAAGTTCTAACTGCTAGGAAATTGTTTCAGACACCAGACTGTGAAGATGAAGCTGCCTCTTGGCTAAAACTCTATCTTGGAGGCTTCTTTTCTTTTATTTTTCCATCTTCCTTTCTTCCTCTCTTCATTTTCTAATGTTTCTTAAAGGGGACAAAAGGTCATACTTGAACAGTTTTTTTTTTTAAGATAGGTCATACTTGAACAGTTGAACTGCTGCCTTCTGTGCATTGTTTTATTATCATTACTTTCCTATTGTCCTTTTAACCTAACTAACCTTCTTATCATATTTTTGGTGGTCCAATTTTTTAGTGAATGATGTTAGTTTGGTTTATACACAGATATGCCGCAGAGTCTAGTGTTATTCGGTATGCAAAGATTGCAATGCAACTACCAAATAAGACTGTACGAGATGTTGCTTTGCGTTGCAGATGGATGAACGTGAGTTATTTTCCTCAATTCTTGCTTCTATTATCGTATAAATTTTTGATAAGTTTCCTGCTACTTAACTTGCAATCATGCATACCATTAGCATGGTAATTGGACAGAAATACTATTAAGGGGTGTCTGGTTAGCCATGGGATGTGTTTTGGTACGATTTGTAATACGAAACTCATTCCATTCTGATAGTTTTTAATATTAACTCTATTTCTTACTGTTTTTGCGCTATACGATCTCATTTCCGTTTCGTCTTCGATTACTTGTGCTTTCCAACACTCCTTTGATTCCAATGATTCTGATTACATTCCAATCTCCAAACACCTCCTTCGTAAAGAAATATCCATATAAGAGTGGCACAATTTTGTGATTGATGCTCCATGAAGTTCTTGTTTCCATTTCATTATATTAACGAAGAGGCTCGTTTCATTTTCAAAACAAAAGAAAAGTAGATTTAAGTGCATTTATACATACCTACATATGTACATATGTATATATATTTGTATTCTAATGCATGCCTTTAAATGGTCTGAATGTCAATGCATTTTGTAATCGTGATTGTTCTGTTTCTATAACCCAAAGGAAAAGAATAATTTTTCTTCCTGATACTAGCAGTTAGCAAGTGCTTACCCATTAAAACATTTTTTTAAATCACTTTTTTCATGTGGTTTATTAGTCTGTATTTGGATATGGCATAAAGGCACTCTATCATCGTTTATGGACATTTTTTTACTTTTATATCATCGTTTTCAGGTTTTAATTTTATTTATGCACTTAATGTTCCTCTTTGGTTTACTTTTTTTTCTTTGGGGGTCAGAAAAAGGAAAATAGCAAGAGAAGGAAGGAAGAACACAATTTAACAAGAAAGAACAAAGATAAAAAGGTATTAAAGGTATTGTGTAAATTATTGGTGCATTTTATGGTCTCCCTGCTTCTCTTTGGTTCACTTCTTAGTTTTCTTTTTATTGTTCAATTGTTTATTTGTTACTTGCAGCATATGCATCAACTTTAGACATTCATTTTGTAAACAACTGACCATTTTGGAAACTTTAATTGAATGGTTTTTCTTCTTCTCCATTATTGTTTTGTTTGCATAAACATTAGGTGAGACTTATATTTTCCTTTTGGTAATGAAAGTTAACTTTAAAAAAAAAGAAAAAGAAAAAGAAACAAACATTGGGTCAGACTTGCAATTTTCCAATTAAAGCTTTAAATTTCATAAGCAAATCAATATGGGCTCTCAAGATGCTTTTTTTGGAAAGTCGTCCCATCCATGCATCAATTTTGATTGCCCATTTTGTTAACTTGACATTTGAAGTCCACACGTGTGTAGGAATTGACGAAATGGCGTTTTTCGAAGGTAATATTGATAAAGGGTTTAAATTGATTCACCTACAACAATTCGGAGTTTAATTGAAATAATTTTAAATCTCACACTGTATTCTCCAAACAAAATTTGAAGTAGAGTGTAAATTGGTACAATAAATAGTAAGTTGTATGGAAAGTTGTCCATTTCTTCCATGGAAAGTCTTGACATAATTTTTGGCATGGTCTTAGGACTGGGGAATGATATTTTTCTTTATTTCAGGAAAGAGTATCTGACTCTTCAATGAAGTCAGCACAGGTTGCAGCAAGGCCTAACGTGCCTCCTTATGGAATGCCTATGATTCCTATGGACAATGATGATGGTGTCTCATATAAAGGTTTTGCTTCTACATTCTTCGTTTACACCATTACGTGTTTTGGGCTGTTACAGCTTTCATTCTTCATTCTCTTGATGTGAAAAATGCGGGAGTAAATTTAATGATTCGCTATTAATATACATTAGGTAACGTTCATAAAAATAAAATATCGGACATGCAGGTAATAAATTGTAGGGTGTATTCAACTGCCATGCTTGTCTCCAGTTTAAACCATCCGAGATGCTTTTTTTTCCTCTAAAATTTGTACCATGTCAATATGTTATGAAGGTATAAATGCTTATCATAATGTTTGCTTGTCTTTTGAAGGCCCTGTATTATTCTAGGATTAGGAACCAATCAGTTTAGCACAACCCAGTGGAAGACATAAAATGATCAAAGACCATGAGGTGCCAAGGGGCCTGATGGATTATTCTCAGGCTTTGAGGAGGGGTGGTGGAAAAAGATGGGTTGAGGAAGATAAAAAAGTTGGAAAGGTGAGGAAATGGTTTCGGACTGGGATTGCAAAATTGGTTTTGATTGAGAAATGACGGTTACTGTGGAAGTACTCAGGAGCAGGTTGAGTTGGGATTTCTTTATGAATTTGATAGAATAGAAAGTCAAATTTCAGTTTGTTTTTAGGCCTTTTTGTGCGAATCTTACAGTGTGCATTTATGGAAGAAAGGCTGATTTGCAGGATAAGTAGAGGTTCGGAGCTTGAGCAAGAAGATTATCAAGTGCAGCCATGGAGTTTCCAATTGGTGGGAAGGAAGAAGGATAGAGTTCACTGGGGGCTGGGTTACAGTTTTAAATGTGCCTCCTTTTCTTAGATTAATTGAGTTTTTCTTCTCATTAGCAGATTGGTGTAGAGTGCTGGAGGAGCGTGAAAGGCTGGGATGGGAAGAGTTATGGCTGGATGAAGTTTGCTTTAGGGCTTGGGGAAACTCTGATGGCTTCACTTTATTTTGATAGTCTGCTTTGTTGGAGGATTAAGATTCCGAGAAAATGAAATTCTTTGCCTAACAAATTCTACATGGAAGAGCTAACACTATGGATCCACTTGTAAGGAAGATGCCCTCATAGCTGGCCCGTTTTATTGTATTCTCTGTTCGAAGGCATAGGAAGTGTTGACCACATTCTTTGGACATACAAGTTTGTGACGTGGCTGCAAAATCATTTCTTGGTTTTTGCTTGCTTGGGGGAGGACACTAGTGATATGATTTGGAAGTTCCTCCTCCATCTGCCCTTTCACTAAAAGGGCCTATTTGGGGTGTGTGATTTATGGGATCTTTGGGATGAGTGGAACAACAAAGTTGTTTAGAGGGTTGGAGAGGGATCCTAATAATTTTTTATCTTTCATTAATTTCACGTGTCTTTATTTCTTCCATTTGGAAGATCATTTGCAATTATTCTTTAGGCACCATTTTGTCTTTTCTAAAATTCTTGCATTCTTTCATTTTTTAATGTAAGTGGTTATTAGTGTGTGTGCCTGCACTTCAATTTCAAAAATCTTCTATAATTATTCTATAGGCACCATTTCATGTTTCTTTATGCGCTTCCGTTGAAGATCGTTTGCAATAATTATATGGGCACCATTTTGTTTAGCTAGAGCCCCTTTCTAGTTGAGGCTGGCACACTGCCTTTTGTGGGTTTTGTTTTTTGTATGCCGTGTATTCTTTCATTTTTTCTCAACGATAGTTGTGTGTGTATATATATCTTTTCTTTTGAAACTACTGTATTTTGTGCATAACATTTGCATTTGGTCTTTGTGATGCTTTCAGCTATTGGTGGTACAACTGGAGAGCTTCTTGAACAGAATGCACATGCAATGAATCAAATTTCTTCTAATCTTGCATCTTTTCAGGTGAAGTACTTGGTTGTCTGGATATTGTTTTAGCATTTTTAAGTTAAAGCAAAATTTTTTAAGTTAAAGCAAAAATTTAGTCCCACCATACCTTACGTAGTGTTAATTGTGGATTTGTATCTATTTCTTGATGATACTGAGTAATTTTTCTTTCAGCCTTCTCTGATATATTAACTTCTGTACTGGACAGCAAGATTTGCACTTTCTGTTGGTTATGAATCACTCTATAGACTTCTTTTTAACTTTTACTTCCATTGTGTAATTGCTGTGAAACTCTTTTGTATTTCTCAACCCCCCAAAAGATTTGCTTAGCAGTTAGGAATGAAACTCTTGCTGGTCCCTTTGATTTGTTGGCAGACTCCTCATGTCCCCTGTGTTCCCTTGGTTTCAGTAAAAAAATGTTTTCTTATTCAAAAATGCTCGATGGTGAAATTATGTGGCCTACTGATGCTTATTATAATCTGTGAGCGAGCACTGATTCTCTTCTTCTTTGCTATTTGCCCCCTCCCTCTTTAAATCACTTAATTGCATTTTACCAAGCCTTTCATGTCTGTGTTTCTATGTATATATATGTAACATGATATACAACTGTCCATTCAAACACATTCGTTCTAAAACTAAAAACATGAAATATTTCATTATTTAACAATGGCAATTGAGGTGTACTGTTTTTTTTATGATTTCTTGCCAGGCAACCTAATGTTCTTTTATAAGAGTGGACAACATAATTTTGAACTAATAACATTTTGTTTTATGTAGATACAAGATAATATCAGTCTCTTCTGCCAAACGCGGGACAACATCCTCAAAATAATGAACGAGTAAGTAACATTATGTCAATCTACTTTTTTCCTCATTTTTCCTCCTTATTGTAAAAGCTGCGCTTCCCTCAGTTTTCTCAAACATTCTTACTTTGTTAACGTCCATTTAATAGGAAAGTTCATTTATAATATATGATATTTGTTTGAGTAGACCATTGTGGGTTGACCTAGTAGTAAATAAAGAGCATGAGTTTAATAAAGGCTTAGAGGGAATGAGTTCAACTGTACCTAGGATTTAATATCTTAGGAGTTTTCATGATACCCAAATATTGTAGTGTTAGGCGGGGTTGTCTCATAAGATTAGTCGAGGTGCGTGAAAGCTGGTTTGGATGCTCACGGATATTGAAAAAGGGTTTGTTCTAGTGAAGTTGGAGTATCCTTTTGGTCCTAACCAATGGATGTTTAAAATAACATTAGCTTAGACTGGTTTTTTGCTTACTTTCCTCCCTTATTTGTAAGGGTCTCCTTAGGTTTTCCAAATATTCTTACTTGTCAATGTACCTTTAGTAACATTGAAAACATTTTAAATGCGTGTCGAAAGACAATTGGATCAAAGAACGCTCAATGTCATACAAACACGTGGACACATTGCACAAATAATCCATGCTGTTAATATACATTTAGTAGGAGACTTCATTTTTAATTCTCCTAGCTGAACTTTAACTTGGTGAACTTTTACTTTCAAGATAGCCATGTTATGGTGTTGAAAAGTTGATTTGATTCCAGCATCTTATATAGGTCTTTACTTAGGAGATAATCCTATGTTCTTCCCTGATGGGAACCTAACTCACAGATAACAAATAAATAGCCAAATGTAGGAATTGTTGTTCCAAGGCAGCACTCACTCGCTTATTTCCTATTAGTTTTTAATGACACTGCTTACTTGCACAAGCATATTTTAAATATGTGAAGAAGCCAATAAAATGGGTGACTTCAGCCTTGAACTAAAATATTGTTCCTCTCCCTATGCTTAAGGGGTCTTCACTTCCACAGGTCATTGGAATGAAATAAGGGCCTTCTAGCCAAATGGTTTTGGTGCTTGATTGTAAACTTGGGGCTCTTAGGAAATAGATGGATTTGATTTGGTTCATGAGATGAGAACATGTGGCTTTCTAGTTTTTGCAAACTTAAGATCTTGAGAAGCTCTTCGTCTAAGATTATTAATGAAGCTTGAAGATTTGTTCCTTAATTTCTCTTCCATGAAGACTAGAGAGGTTAGTTATCGGCCTTTTTTCAGCTTTGATGATCAAAGTGGGGTGGCAGTAGAAAATTCATAGTATACACAGGTACATCTTAATGTTATTTTTGTCTGGAGCAGAACTTTCCCCAAGTCTAGCTGAAACATCCCTGCTTGGGATCAATGTGAGTATGGTGAGGAGGTCAATATTGGGCTGCAGATTTCCTATCTGGCTAACAATCGCCCATAATTTTCTAAGCTTCTCGTCAGGTGGAAACTATAGTAGATGATAATTCTAAGAGCCACTTACCATAAATCGAGGACCAAGTTGGATGTTTGGAAATGCGTTCTTTCTAAAGGAGATAGGGTGTCCCTTGCCCAATCCATCCTTAATGTTTTACTGCAAATCCGTTTGAATGTAGTAGTTGCCTAAAAAATGCAAGGTTTCTGTCCACTCACTATTTCTGATATATTTCAACAGAAAGTAGAGAATACAAGCTTCCCAACCATTGACAGAGTGCTTAAGCATAGGATTTCTCATTAGGTAAGATCTAATATTAGAAGGTTTTGTGGCAAAAGTTTTTCCTCCTACCTCAGTCTACAATTATGGAGTATACTCCCAATCTGGAAACATTCTGCTCCATTTTCTAAATCGAAGGAGGTTCTGTGGGCCTGCATTCTGGATTTCGTAGGTGGTTGGGGTGGTTTGGTAGCAACCAAAGATCTTGCAGATTTAATTAATAATGGAATTAATTAGTCATGACGTTCTCATATTATTTTCTGCCTACCAGATTTTGTATTTTGGATTGATGTGTACTGATTGAGTCACAGGACCTGTGCTGTCATTAACTTTGCCTTTACCACAATTGAACTATACGTAATTTTTTCAATTTAATCGATAGGACAATCACTCCAAGATTGGTTGAGTGGGCAAAGTAACTTGGACTGATCTTGATAGGAGAGTGATTTCTTCCGCTATGTCTGGGCTATGAGATGCCATGAAGTTTGTATATTGCATATAACCAACCATTACACACCCGTTTAGATTGTTGAGATAATATTAAGTTTACTTGTCATTTCACTCGTTCCCCTTTCAAAGAAGGTAAGTCTCAAAAGAAATTCCAATGAAGCAAATCATGACACTGGAAGAAATTTCAATAAAGCACCAATGGTTATGCTCATTACCAGAAAATTTCAGTGAGACTTGATGTCACATACAATTCCCATGAAGCAACGTAGCAACACAATAATCAAACCTATTATGTTCAAATATGGAGCAGGTGCTGTCTAGAAGTGCTTCAGACCCAGCTGATCAGGGTTATTATTATCAGTTATTGATCAAGATTCAATTTGGTAATACCAGAAAACACTGGCTATATAAGGCTTCAACAAGGGTGATTTTCAACTGTTGAGAGAAGGTTTCAAACTTCAATGGAAGGAAATCCGGATGAAGAAAACAGAAAAAGGAAATTCAGCTAAGAGATTTCAGCCATGCTCTGATACTACGTTGCAGTGCACATCATGTAAGAGAACAATCTACGTCTTCAAAGGGGAGTTCTTTTAGAATCAGTTTCTGTTTGTTTTGGAAAATTTATTCGAGTTACTCTAGGTTTTGGATGACATTAAGAAGACAGTTTTGAGGCAGCCGTTGCTTCTCTTTGGCGCTTCTCTTTCTCCTCTCATCTTGAAGATTCGTACAAAGTTTTTATATTTGAATGGGCTCCTCTTGAGAGAGTTTTCAAGTTTTGCTTTTTATTTAGCTTTCTACGAATGTTTCGTCTTCGATAGGCTTTATTTTGTTTGATACTTCTCCGTTTGATTTCGTTTTTTAGGTTGCCTCACTTTTTGTCGCCTAAGTTTTCACTTTAGACTGTATTCTTCCTTAGTATAATACTCTTGATTTTAAACTTAAGTCTCATTTACTATCATTAATAAAGAGGCTGGTTTTCGTTAAAAGAAAAAAATAGAACAATCTACGTCTTTATCTTGTATAGCATGTTTACAGCCACAGTAATCCAAAAAGTGCAGGTATTGAATACTTGCTTTTCCCCCTTTAATCAAACTCACTGAAAATACTTAAAAAAATATAACCTAAAAAATAGATCCCAAGGAGAAAGCCTGTATGAGTCCCTCAGTGGAGGCATCAAGCCATGTAGAACAACAGTAGAACCAATCCGATTGAATAGCCTGACAAATTACCCTGAGGGAAGTAAAGTAGCACTATGGACATAAAGGTTATGCTCCTGCCAAAGATGATGTATATCAACACGTCAAGCGACTATCCAAATTTTTCGCTGTTGGTCTCCTGCACTAAAACGACTAATTCAATCAAGCTTTATGCACCACTATGCGAACTTGAGGGAGCAACCACCCCAAAAGAGCACATCCACCCCAAATAGCTTAGCTAAAAGAGCAACCAAAGAATAAATTGTTTCGAGACTCAACTCCACTGCTACAAGGACACAATCACTAGTGACATTAAAATCTTTTGACGAAAATAATCATGACCACCAAACCAAAGGAGACCAGATCAAGGTATCAAAAGATGCCCCAGAAGTCAGACTCAAGGATGACCTGAAGACTCTATAGCAACATAGAAGCCAAGTCAATCGATCATCTCATTCAACCTCTGCCACTATAGAATGTTCCTGCCGTGAAAGTTCAACCAACTCTTTAGAGCTAGAAGGCCAAGACCAAGACACATTTGGAAAGAGAAACTGTCAGAGTAAATAAAGGGGAAAATAGTGAAAGAAAGCATTTGCTCTACACATAGACAAACAATAGTATTTGCAGAATTTACAAAGAACACTTGCAAGCCCAAGATAGTCTTGGGCTAGAAAAATATCTTCTTTGTACAAAATTGAGTGGAGATTCTTTTGGAAATTTAGAAGTCTACACTTCTTGAAGTTTGACCTTCAAAGTTTTCTGGTTCAGTTAAGCTTGATGACTCAAGATGAGTGTCAGAAATCAATTAGTGGCTTTTTTATTATATAAATATAATTATAAACAATAAATAAATTGTGTGTATGTGTGTGTATATATTTTGGGGGGTGATAATAGTTAGAGTATTTATTGTTCAAAGAGGAAATTAATAGAAGTGTTGATTATACGTTGGTTGAGACCAGGATTGTCCAATGGGGATGAAATTTTCCTTTCTTAAAATATTGTTTGTCTGATAAGGCTACAAGTTAATATCAAGAATCAACTAATTTTACCTTCTTAACTTTTGATTTTGAAGCCAGGATTGAAATTAGGAGTTTCTTCCAAAATCTTTTACAAGTTAACAGTTTGTTTTTGGTTTCATAAATTAGTTTCGAGGTGTAGGTTGCTCAAATTGTACTAGCTCTTGGTAAAATCGACCTAGTTGCAGGAATTCTTCCCCGTAGTAGCAGTCAGGATTTTTAGCATTAAAACCCGAAAGTTTTCATTTACTCATGTCTTGAGAAGGACATCCACTGCAAATTTGTATGATTTATCCTTGTGAATTGATGAGAGTTTTCATTGAAGCAGAATTTTCCAAACAAAATTGCTCTAACTTATCTTCAATCTACATGCTTGTAGCTTAAATGAAATGCCGGAAGTAATGAAGCAGATGCCACCACTTCCGGTGAAGGTGAACGAAGAGTTAGCGAACACGATCCTTCCGCCGACTTCTCATTCCTTGCAATCATGAAAAATCTTCCACAAAAGCTGGTTCAGCAAAAACAGACTCTCCCTTTCAATTCACTTCCACATGCGAGGAAAGGAAAGGAAAGTATATTTTCTCAACTGAACCTTACCTTCTCAATTCAGAATTTTCTGCTTATCATCACAAATTGTTTAGTCTTTCGGTCCACAAAGTTGTTCCCCTTTAATTATTATTATTATTTTTTGTTTCTTTAGCATATATGGATTTATACATTAGCTTCAAGAGGTTAGATTGTAACTTAGATGCTGTGTAATAATTAGGAATTGTTTTCTTTCTAATGGTCTGGGAAATGAAAAGTAATAAACAATGGTTGATTATTTGTATTCCATGTTTTCCCTCTCTCCTTGTCATCGTCGTCTTCGCATCGAAGTTTCTTCATACATGATCACCGTATGAAGGTGTTGCGATTTTCCTGACAACTGCTTGACCTTTCTTCGTACCTCATCACCAC

mRNA sequence

ATGGCTAACCCATCTGGGAACCATCAAGAAGCTGGCCAACCGTCGTCTTCCTTCGATGGAGGGAACCCCAGCAACGGTAATTCGACACCTGTGCCTGCAGCGGATAATCCGAGTTCGGCTCTTGCTATGAAGCATAACCCGGGTATCTCTACGGATTGGACATCTGATGAGCAGGTCACACTGGAAGAAGGGCTTAAGAAATATGCCGCAGAGTCTAGTGTTATTCGGTATGCAAAGATTGCAATGCAACTACCAAATAAGACTGTACGAGATGTTGCTTTGCGTTGCAGATGGATGAACAAAAAGGAAAATAGCAAGAGAAGGAAGGAAGAACACAATTTAACAAGAAAGAACAAAGATAAAAAGGAAAGAGTATCTGACTCTTCAATGAAGTCAGCACAGGTTGCAGCAAGGCCTAACGTGCCTCCTTATGGAATGCCTATGATTCCTATGGACAATGATGATGGTGTCTCATATAAAGCTATTGGTGGTACAACTGGAGAGCTTCTTGAACAGAATGCACATGCAATGAATCAAATTTCTTCTAATCTTGCATCTTTTCAGATACAAGATAATATCAGTCTCTTCTGCCAAACGCGGGACAACATCCTCAAAATAATGAACGACTTAAATGAAATGCCGGAAGTAATGAAGCAGATGCCACCACTTCCGGTGAAGGTGAACGAAGAGTTAGCGAACACGATCCTTCCGCCGACTTCTCATTCCTTGCAATCATGA

Coding sequence (CDS)

ATGGCTAACCCATCTGGGAACCATCAAGAAGCTGGCCAACCGTCGTCTTCCTTCGATGGAGGGAACCCCAGCAACGGTAATTCGACACCTGTGCCTGCAGCGGATAATCCGAGTTCGGCTCTTGCTATGAAGCATAACCCGGGTATCTCTACGGATTGGACATCTGATGAGCAGGTCACACTGGAAGAAGGGCTTAAGAAATATGCCGCAGAGTCTAGTGTTATTCGGTATGCAAAGATTGCAATGCAACTACCAAATAAGACTGTACGAGATGTTGCTTTGCGTTGCAGATGGATGAACAAAAAGGAAAATAGCAAGAGAAGGAAGGAAGAACACAATTTAACAAGAAAGAACAAAGATAAAAAGGAAAGAGTATCTGACTCTTCAATGAAGTCAGCACAGGTTGCAGCAAGGCCTAACGTGCCTCCTTATGGAATGCCTATGATTCCTATGGACAATGATGATGGTGTCTCATATAAAGCTATTGGTGGTACAACTGGAGAGCTTCTTGAACAGAATGCACATGCAATGAATCAAATTTCTTCTAATCTTGCATCTTTTCAGATACAAGATAATATCAGTCTCTTCTGCCAAACGCGGGACAACATCCTCAAAATAATGAACGACTTAAATGAAATGCCGGAAGTAATGAAGCAGATGCCACCACTTCCGGTGAAGGTGAACGAAGAGTTAGCGAACACGATCCTTCCGCCGACTTCTCATTCCTTGCAATCATGA
BLAST of CSPI04G02590 vs. TrEMBL
Match: A0A0A0KX84_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G010420 PE=4 SV=1)

HSP 1 Score: 469.2 bits (1206), Expect = 3.2e-129
Identity = 244/245 (99.59%), Postives = 244/245 (99.59%), Query Frame = 1

Query: 1   MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNPSSALAMKHNPGISTDWTSDEQVT 60
           MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADN SSALAMKHNPGISTDWTSDEQVT
Sbjct: 1   MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNSSSALAMKHNPGISTDWTSDEQVT 60

Query: 61  LEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMNKKENSKRRKEEHNLTRKNKD 120
           LEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMNKKENSKRRKEEHNLTRKNKD
Sbjct: 61  LEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMNKKENSKRRKEEHNLTRKNKD 120

Query: 121 KKERVSDSSMKSAQVAARPNVPPYGMPMIPMDNDDGVSYKAIGGTTGELLEQNAHAMNQI 180
           KKERVSDSSMKSAQVAARPNVPPYGMPMIPMDNDDGVSYKAIGGTTGELLEQNAHAMNQI
Sbjct: 121 KKERVSDSSMKSAQVAARPNVPPYGMPMIPMDNDDGVSYKAIGGTTGELLEQNAHAMNQI 180

Query: 181 SSNLASFQIQDNISLFCQTRDNILKIMNDLNEMPEVMKQMPPLPVKVNEELANTILPPTS 240
           SSNLASFQIQDNISLFCQTRDNILKIMNDLNEMPEVMKQMPPLPVKVNEELANTILPPTS
Sbjct: 181 SSNLASFQIQDNISLFCQTRDNILKIMNDLNEMPEVMKQMPPLPVKVNEELANTILPPTS 240

Query: 241 HSLQS 246
           HSLQS
Sbjct: 241 HSLQS 245

BLAST of CSPI04G02590 vs. TrEMBL
Match: M5XRQ9_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa010592mg PE=4 SV=1)

HSP 1 Score: 352.8 bits (904), Expect = 3.3e-94
Identity = 183/246 (74.39%), Postives = 211/246 (85.77%), Query Frame = 1

Query: 1   MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNPSSALAMKHNPGISTDWTSDEQVT 60
           MANPSGNHQE    SSSF+G NPSNGNS PV A ++  +A+AMKHNPGIS DW+++EQ  
Sbjct: 1   MANPSGNHQEPSHASSSFNGTNPSNGNSAPVSAPESSGAAMAMKHNPGISMDWSAEEQAI 60

Query: 61  LEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMNKKENSKRRKEEHNLTRKNKD 120
           L++GL KY+ ES++IRYAKIAMQL NKTVRDVALRCRWM KKENSKRRKEEHNLTRK+KD
Sbjct: 61  LDDGLAKYSTESNIIRYAKIAMQLQNKTVRDVALRCRWMTKKENSKRRKEEHNLTRKSKD 120

Query: 121 KKERVSDSSMKSAQVAARPNVPPYGMPMIPMDNDDGVSYKAIGGTTGELLEQNAHAMNQI 180
           KKERV D+S K +  A RPNV PY  PM+ MDNDDG+SYKAIGG TGELLEQNA A+NQI
Sbjct: 121 KKERVIDTSAKPSHFAGRPNVAPYAPPMVTMDNDDGISYKAIGGITGELLEQNAQALNQI 180

Query: 181 SSNLASFQIQDNISLFCQTRDNILKIMNDLNEMPEVMKQMPPLPVKVNEELANTI-LPPT 240
           S+NLA+FQIQ+NI+LFCQTRDNILKIMNDLN+MP+VMKQMPPLPVKVNEELA  + +PP 
Sbjct: 181 SANLAAFQIQENINLFCQTRDNILKIMNDLNDMPDVMKQMPPLPVKVNEELATHVGIPP- 240

Query: 241 SHSLQS 246
            H +QS
Sbjct: 241 -HQMQS 244

BLAST of CSPI04G02590 vs. TrEMBL
Match: A0A061EIV7_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_019960 PE=4 SV=1)

HSP 1 Score: 346.7 bits (888), Expect = 2.4e-92
Identity = 182/246 (73.98%), Postives = 205/246 (83.33%), Query Frame = 1

Query: 1   MANPSGNHQ-EAGQPSSSFDGGNPSNGNSTPVPAADNPSSALAMKHNPGISTDWTSDEQV 60
           MANP GNHQ EA   SSSF+GGN SNG++ P       SS   MKHNPGI+ DWT +EQ 
Sbjct: 1   MANPPGNHQQEANHASSSFNGGNLSNGSTIP------DSSGSGMKHNPGIALDWTLEEQA 60

Query: 61  TLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMNKKENSKRRKEEHNLTRKNK 120
            L+EGLKK+A+ESS+IRYAKIAMQL NKTVRDVALRCRWM KKENSKRRKEEHNL RK+K
Sbjct: 61  ILDEGLKKFASESSIIRYAKIAMQLQNKTVRDVALRCRWMTKKENSKRRKEEHNLARKSK 120

Query: 121 DKKERVSDSSMKSAQVAARPNVPPYGMPMIPMDNDDGVSYKAIGGTTGELLEQNAHAMNQ 180
           DKKERV+D S K A  AARPNVPPY  PMIPMD DDG+ YKAIGG TGELLEQNA A NQ
Sbjct: 121 DKKERVADPSTKPAHFAARPNVPPYAPPMIPMDYDDGIPYKAIGGATGELLEQNAQAFNQ 180

Query: 181 ISSNLASFQIQDNISLFCQTRDNILKIMNDLNEMPEVMKQMPPLPVKVNEELANTILPPT 240
           IS+NLA+FQIQ+N+ L CQTRDNI KIMNDLN+MP++MKQMPPLPVKVN+ELA TILPP+
Sbjct: 181 ISANLAAFQIQENVGLLCQTRDNIFKIMNDLNDMPDIMKQMPPLPVKVNDELAGTILPPS 240

Query: 241 SHSLQS 246
           +H +QS
Sbjct: 241 THMMQS 240

BLAST of CSPI04G02590 vs. TrEMBL
Match: F6GX47_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_00s0226g00040 PE=4 SV=1)

HSP 1 Score: 339.0 bits (868), Expect = 5.0e-90
Identity = 176/247 (71.26%), Postives = 208/247 (84.21%), Query Frame = 1

Query: 1   MANPSGNHQEAGQPSSSFDGG-NPSNGNSTPV-----PAADNPSSALAMKHNPGISTDWT 60
           MANPSG HQE G  SSSF+GG NPSNG+  P      P A   ++A AMKHNPGI+ DWT
Sbjct: 1   MANPSGTHQEPGHASSSFNGGGNPSNGSVAPASENSGPPAGAVATATAMKHNPGIAMDWT 60

Query: 61  SDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMNKKENSKRRKEEHNL 120
            +EQ  LEEGL  Y+++S++IRYAKIAMQL NKTVRDVALRCRWM+KKENSKRRKE+HNL
Sbjct: 61  PEEQSVLEEGLNAYSSDSNIIRYAKIAMQLQNKTVRDVALRCRWMSKKENSKRRKEDHNL 120

Query: 121 TRKNKDKKERVSDSSMKSAQVAARPNVPPYGMPMIPMDNDDGVSYKAIGGTTGELLEQNA 180
           +RK+KDKKE+V++ S KS+ +A+R NVPPY MPMIPMDNDDG+SYKAIGG+TG+LLEQNA
Sbjct: 121 SRKSKDKKEKVTEPSAKSSHLASRTNVPPYAMPMIPMDNDDGISYKAIGGSTGQLLEQNA 180

Query: 181 HAMNQISSNLASFQIQDNISLFCQTRDNILKIMNDLNEMPEVMKQMPPLPVKVNEELANT 240
            A NQIS+NLAS QIQDNISLFCQ RDNI  I+NDLN+MPEVM+QMPPLPVK+NE+L N+
Sbjct: 181 QAFNQISANLASLQIQDNISLFCQARDNIQAILNDLNDMPEVMRQMPPLPVKINEDLVNS 240

Query: 241 ILPPTSH 242
           ILP  +H
Sbjct: 241 ILPRAAH 247

BLAST of CSPI04G02590 vs. TrEMBL
Match: A0A0B0PT63_GOSAR (Histone H2A deubiquitinase MYSM1 OS=Gossypium arboreum GN=F383_16156 PE=4 SV=1)

HSP 1 Score: 330.5 bits (846), Expect = 1.8e-87
Identity = 171/245 (69.80%), Postives = 203/245 (82.86%), Query Frame = 1

Query: 1   MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNPSSALAMKHNPGISTDWTSDEQVT 60
           MANP GNHQ+    +SSF+G + +NGN  PVP     +S   MKHNPGIS DWT +EQ  
Sbjct: 1   MANPPGNHQQEANQASSFNGAHLNNGN--PVPE----TSGSGMKHNPGISLDWTLEEQAI 60

Query: 61  LEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMNKKENSKRRKEEHNLTRKNKD 120
           L++GLKKYA+E S+IRYAKIA+QL NKTVRDVALRCRWM KKENSKRRKEEHN+ RK+KD
Sbjct: 61  LDDGLKKYASEPSIIRYAKIALQLQNKTVRDVALRCRWMTKKENSKRRKEEHNIARKSKD 120

Query: 121 KKERVSDSSMKSAQVAARPNVPPYGMPMIPMDNDDGVSYKAIGGTTGELLEQNAHAMNQI 180
           KKERV+D + K  Q AARP++PPY  PMIPMD DDG+ Y+AIGG TGELLEQNAHA NQI
Sbjct: 121 KKERVADPTAKPTQFAARPSLPPYAPPMIPMDYDDGIPYRAIGGVTGELLEQNAHAFNQI 180

Query: 181 SSNLASFQIQDNISLFCQTRDNILKIMNDLNEMPEVMKQMPPLPVKVNEELANTILPPTS 240
           S+NLA+FQIQ+NI L CQTRDNILKIMN+LN++P++MKQM  LPVK+N+ELANTILPP+S
Sbjct: 181 SANLAAFQIQENIGLLCQTRDNILKIMNELNDIPDIMKQMQVLPVKLNDELANTILPPSS 239

Query: 241 HSLQS 246
           H + S
Sbjct: 241 HPMLS 239

BLAST of CSPI04G02590 vs. TAIR10
Match: AT3G07565.4 (AT3G07565.4 Protein of unknown function (DUF3755))

HSP 1 Score: 260.0 bits (663), Expect = 1.5e-69
Identity = 147/265 (55.47%), Postives = 187/265 (70.57%), Query Frame = 1

Query: 2   ANPSGNHQEAGQPSSSFDGGNPSNGNSTPV----------PAADNPSSALAMKHNPGIST 61
           ANPSGN+QE    +      + +  N   V           AADN  +  A++HNPGIST
Sbjct: 5   ANPSGNNQEGSSATQKVSSSSAAAANGAAVNSVDNGGNTGAAADNSQTIGALRHNPGIST 64

Query: 62  DWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMNKKENSKRRKEE 121
           DWT +EQ  LE+ L KYA E SV RYAKIAM++ +KTVRDVALRCRWM KKEN KRRKE+
Sbjct: 65  DWTLEEQSLLEDLLVKYATEPSVFRYAKIAMKMKDKTVRDVALRCRWMTKKENGKRRKED 124

Query: 122 HNLTRKNKDKK-ERVSDSSMK-SAQVAARPNVPPYGMPMIPMDNDDGVSYKAIGGTTGEL 181
           H+ +RK+KDKK E+ +DSS K S+ +   PN P Y  PM+P+D DDG+SYKAIGG +G+L
Sbjct: 125 HS-SRKSKDKKQEKATDSSAKSSSHLNVHPNGPSYAPPMMPIDTDDGISYKAIGGVSGDL 184

Query: 182 LEQNAHAMNQISSNLASFQI---------QDNISLFCQTRDNILKIMNDLNEMPEVMKQM 241
           LEQNA   NQ+S+N ++FQ+          +N+++ C+ RDNIL I+NDLN+MPEVMKQM
Sbjct: 185 LEQNAQMFNQLSTNFSAFQVNSTSTFHLLHENVNILCKARDNILAILNDLNDMPEVMKQM 244

Query: 242 PPLPVKVNEELANTILPPTSHSLQS 246
           PPLPVK+NEELAN+ILP  SH  +S
Sbjct: 245 PPLPVKLNEELANSILPRPSHQRKS 268

BLAST of CSPI04G02590 vs. TAIR10
Match: AT1G10820.2 (AT1G10820.2 Protein of unknown function (DUF3755))

HSP 1 Score: 135.2 bits (339), Expect = 5.5e-32
Identity = 91/227 (40.09%), Postives = 128/227 (56.39%), Query Frame = 1

Query: 20  GGNPSNGNSTPVPAADNPSSALA-MKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYA 79
           G N  N +S   PA D   S  A +K    +  DW+ +EQ  LE GL K   E  + +Y 
Sbjct: 24  GVNAINASSGFHPAVDASGSVAAGVKQEAALVMDWSVEEQYVLENGLAKLKDEPKISKYV 83

Query: 80  KIAMQLPNKTVRDVALRCRWMNKKENSKRRKEEHNLTRKNKDKKERVSDSSMKSAQVAAR 139
           KIA  LP+KTVRDVALRCRWM +K    RRK E N   KN   ++ V  S     ++   
Sbjct: 84  KIAATLPDKTVRDVALRCRWMTRK----RRKREDNNAAKNISTRKVVDTSP----ELNML 143

Query: 140 PNVPPYGMPMI--PMDNDDGVSYKAIGGTTGELLEQNAHAMNQISSNLASFQIQDNISLF 199
            NVP      +   M +     ++ +     +LL+QNA A +QIS NL++ ++QDNISLF
Sbjct: 144 SNVPQQNALYVLNNMCHSTRTPFEGLSDAVMDLLQQNAQAFSQISYNLSACKLQDNISLF 203

Query: 200 CQTRDNILKIMNDLNEMPEVMKQMPPLPVKVNEELANTILPPTSHSL 244
            Q R+NI  I+ D+ EMP +M +MP LPV +N++LA+ +L  T+  +
Sbjct: 204 HQARNNISAILTDMKEMPGIMSRMPALPVSINDDLASNLLSSTTQPI 242

BLAST of CSPI04G02590 vs. TAIR10
Match: AT1G60670.2 (AT1G60670.2 Protein of unknown function (DUF3755))

HSP 1 Score: 131.7 bits (330), Expect = 6.1e-31
Identity = 86/232 (37.07%), Postives = 132/232 (56.90%), Query Frame = 1

Query: 18  FDGGNPSNGNSTPVPAADNPSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRY 77
           F   +  N +S      ++P+S   +KH   ++ DW+ +EQ  LE+GL K+  E  V +Y
Sbjct: 19  FSATSSMNASSGFHLTVNSPTSVTGLKHEASLAVDWSVEEQYILEKGLSKFKDEPQVTKY 78

Query: 78  AKIAMQLPNKTVRDVALRCRWMNKKENSKRRKEEHNLTRKNKDKKERVSDSSMKSAQVAA 137
            KIA  LP+K+VRDVA+RC+WM +K   +R+ EEH+   K   +K  V D   K    + 
Sbjct: 79  VKIAATLPDKSVRDVAMRCKWMTQK---RRKGEEHSTGTKVSYRK--VVDLPPKLNMFST 138

Query: 138 RPNVPPYGMPMIPMDNDDGVSYKAIGGTTGELLEQNAHAMNQISSNLASFQIQDNISLFC 197
            P        M  M     + ++ +     E L QNA A +QISSNL+  + QDN+SLF 
Sbjct: 139 EPQQNAT-YAMNHMCQSARMPFEGLSDAVMERLRQNAQAFSQISSNLSVCKPQDNVSLFY 198

Query: 198 QTRDNILKIMNDLNEMPEVMKQMPPLPVKVNEELANTIL----PPTSHSLQS 246
             R+NI  I+ND+ EMP ++ +MPPLPV +N +LA++++     P S+++ S
Sbjct: 199 MARNNISAILNDMKEMPGIISRMPPLPVSINNDLASSLVTSATQPRSYTIPS 244

BLAST of CSPI04G02590 vs. TAIR10
Match: AT1G68160.1 (AT1G68160.1 Protein of unknown function (DUF3755))

HSP 1 Score: 124.4 bits (311), Expect = 9.7e-29
Identity = 78/217 (35.94%), Postives = 124/217 (57.14%), Query Frame = 1

Query: 21  GNPSNGNSTPVPAADNPSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKI 80
           GN SNG+           S   +K +  + ++W+++EQ  L+ GL+KY    S+  Y +I
Sbjct: 58  GNSSNGSD----------SGSGLKLDTSMVSEWSNEEQYILDAGLEKYKDMPSIDMYIQI 117

Query: 81  AMQLPNKTVRDVALRCRWMNKKENSKRRKEEHNLTRKNKDKKERVSDSSMKSAQVAARPN 140
              LP+K++RD+ALRCRW+ +K   +R+ EE N  R+    K +  +SS KS+     P+
Sbjct: 118 GNTLPDKSIRDIALRCRWLRRK---RRKSEELNCGRRASSSKGKQVESSSKSSI----PS 177

Query: 141 VPPYGMPMIPMDNDDGVSYKAI-----GGTTGELLEQNAHAMNQISSNLASFQIQDNISL 200
           V P+ M   P       + K I           L+EQN  A +QI +NL+S++  DN+ L
Sbjct: 178 VLPHNMASYPFSGPSTSTSKQITSEDLSSYATNLIEQNVRAFSQIRANLSSYKAGDNLDL 237

Query: 201 FCQTRDNILKIMNDLNEMPEVMKQMPPLPVKVNEELA 233
           F Q R+N++ I N++N MP +M +MPPLPV +N++L+
Sbjct: 238 FRQARNNLITIQNEINNMPGLMNKMPPLPVTINDDLS 257

BLAST of CSPI04G02590 vs. TAIR10
Match: AT2G43470.1 (AT2G43470.1 Protein of unknown function (DUF3755))

HSP 1 Score: 81.3 bits (199), Expect = 9.4e-16
Identity = 77/228 (33.77%), Postives = 115/228 (50.44%), Query Frame = 1

Query: 22  NPSNGNSTPVPAADNPSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAES--SVIRYAK 81
           N  +GN T       PS +  +    GI+ +WT+ E   L + L  Y+++S  +V RY +
Sbjct: 5   NAYSGNLT------TPSESSLLISRSGIALNWTTAEDDILIQLLDSYSSDSRSAVTRYLQ 64

Query: 82  IAMQLPNKTVRDVALRCRWMNKKENSKRRKEEHN-LTRKNKDKKERVSDSSMKSAQVAAR 141
           I   L +KT+RDVA R RW+  K+ +K++KE+HN L     D +E V   +M  A    +
Sbjct: 65  ILEFLQDKTIRDVAARSRWIYNKKIAKKKKEDHNGLGTTRVDNEEIV---NMVLASQVYQ 124

Query: 142 PNVPPYGMPMIPMDNDDGVSYKAIGGTTGELLEQNAHAMNQISSNLASFQIQDNISLFCQ 201
           P+                V   +  G   ELL  N    NQI +NL    + DN+ LF +
Sbjct: 125 PS---------------QVFQPSQHGVHNELLNHNKQWFNQIYANLTFLNLTDNLDLFRK 184

Query: 202 TRDNILKIMNDLNE-MPEVMKQMP-PLPVKVNEEL---ANTILPPTSH 242
            R+NI  ++ DLNE + E  K MP  LP K+N+EL    +  +P TS+
Sbjct: 185 IRENIKSLLKDLNENVSETWKNMPSSLPEKLNDELFLGLDKAIPSTSN 208

BLAST of CSPI04G02590 vs. NCBI nr
Match: gi|449468874|ref|XP_004152146.1| (PREDICTED: uncharacterized protein LOC101222201 isoform X2 [Cucumis sativus])

HSP 1 Score: 469.2 bits (1206), Expect = 4.6e-129
Identity = 244/245 (99.59%), Postives = 244/245 (99.59%), Query Frame = 1

Query: 1   MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNPSSALAMKHNPGISTDWTSDEQVT 60
           MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADN SSALAMKHNPGISTDWTSDEQVT
Sbjct: 1   MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNSSSALAMKHNPGISTDWTSDEQVT 60

Query: 61  LEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMNKKENSKRRKEEHNLTRKNKD 120
           LEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMNKKENSKRRKEEHNLTRKNKD
Sbjct: 61  LEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMNKKENSKRRKEEHNLTRKNKD 120

Query: 121 KKERVSDSSMKSAQVAARPNVPPYGMPMIPMDNDDGVSYKAIGGTTGELLEQNAHAMNQI 180
           KKERVSDSSMKSAQVAARPNVPPYGMPMIPMDNDDGVSYKAIGGTTGELLEQNAHAMNQI
Sbjct: 121 KKERVSDSSMKSAQVAARPNVPPYGMPMIPMDNDDGVSYKAIGGTTGELLEQNAHAMNQI 180

Query: 181 SSNLASFQIQDNISLFCQTRDNILKIMNDLNEMPEVMKQMPPLPVKVNEELANTILPPTS 240
           SSNLASFQIQDNISLFCQTRDNILKIMNDLNEMPEVMKQMPPLPVKVNEELANTILPPTS
Sbjct: 181 SSNLASFQIQDNISLFCQTRDNILKIMNDLNEMPEVMKQMPPLPVKVNEELANTILPPTS 240

Query: 241 HSLQS 246
           HSLQS
Sbjct: 241 HSLQS 245

BLAST of CSPI04G02590 vs. NCBI nr
Match: gi|778689821|ref|XP_011653023.1| (PREDICTED: uncharacterized protein LOC101222201 isoform X1 [Cucumis sativus])

HSP 1 Score: 463.8 bits (1192), Expect = 1.9e-127
Identity = 244/248 (98.39%), Postives = 244/248 (98.39%), Query Frame = 1

Query: 1   MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNPSSALAMKHNPGISTDWTSDEQVT 60
           MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADN SSALAMKHNPGISTDWTSDEQVT
Sbjct: 1   MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNSSSALAMKHNPGISTDWTSDEQVT 60

Query: 61  LEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMNKKENSKRRKEEHNLTRKNKD 120
           LEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMNKKENSKRRKEEHNLTRKNKD
Sbjct: 61  LEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMNKKENSKRRKEEHNLTRKNKD 120

Query: 121 K---KERVSDSSMKSAQVAARPNVPPYGMPMIPMDNDDGVSYKAIGGTTGELLEQNAHAM 180
           K   KERVSDSSMKSAQVAARPNVPPYGMPMIPMDNDDGVSYKAIGGTTGELLEQNAHAM
Sbjct: 121 KKVLKERVSDSSMKSAQVAARPNVPPYGMPMIPMDNDDGVSYKAIGGTTGELLEQNAHAM 180

Query: 181 NQISSNLASFQIQDNISLFCQTRDNILKIMNDLNEMPEVMKQMPPLPVKVNEELANTILP 240
           NQISSNLASFQIQDNISLFCQTRDNILKIMNDLNEMPEVMKQMPPLPVKVNEELANTILP
Sbjct: 181 NQISSNLASFQIQDNISLFCQTRDNILKIMNDLNEMPEVMKQMPPLPVKVNEELANTILP 240

Query: 241 PTSHSLQS 246
           PTSHSLQS
Sbjct: 241 PTSHSLQS 248

BLAST of CSPI04G02590 vs. NCBI nr
Match: gi|778689827|ref|XP_011653024.1| (PREDICTED: uncharacterized protein LOC101222201 isoform X3 [Cucumis sativus])

HSP 1 Score: 420.2 bits (1079), Expect = 2.4e-114
Identity = 222/245 (90.61%), Postives = 222/245 (90.61%), Query Frame = 1

Query: 1   MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNPSSALAMKHNPGISTDWTSDEQVT 60
           MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADN SSALAMKHNPGISTDWTSDEQVT
Sbjct: 1   MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNSSSALAMKHNPGISTDWTSDEQVT 60

Query: 61  LEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMNKKENSKRRKEEHNLTRKNKD 120
           LEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN                    
Sbjct: 61  LEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN-------------------- 120

Query: 121 KKERVSDSSMKSAQVAARPNVPPYGMPMIPMDNDDGVSYKAIGGTTGELLEQNAHAMNQI 180
             ERVSDSSMKSAQVAARPNVPPYGMPMIPMDNDDGVSYKAIGGTTGELLEQNAHAMNQI
Sbjct: 121 --ERVSDSSMKSAQVAARPNVPPYGMPMIPMDNDDGVSYKAIGGTTGELLEQNAHAMNQI 180

Query: 181 SSNLASFQIQDNISLFCQTRDNILKIMNDLNEMPEVMKQMPPLPVKVNEELANTILPPTS 240
           SSNLASFQIQDNISLFCQTRDNILKIMNDLNEMPEVMKQMPPLPVKVNEELANTILPPTS
Sbjct: 181 SSNLASFQIQDNISLFCQTRDNILKIMNDLNEMPEVMKQMPPLPVKVNEELANTILPPTS 223

Query: 241 HSLQS 246
           HSLQS
Sbjct: 241 HSLQS 223

BLAST of CSPI04G02590 vs. NCBI nr
Match: gi|1009138578|ref|XP_015886660.1| (PREDICTED: uncharacterized protein LOC107421838 [Ziziphus jujuba])

HSP 1 Score: 366.7 bits (940), Expect = 3.2e-98
Identity = 186/245 (75.92%), Postives = 215/245 (87.76%), Query Frame = 1

Query: 1   MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNPSSALAMKHNPGISTDWTSDEQVT 60
           MANPSGNHQE    +SSF+GGNP+NGNS PV   ++  +A+ MKHNPGIS DWT +EQ  
Sbjct: 1   MANPSGNHQEPSHAASSFNGGNPNNGNSAPVAGPESSGAAMTMKHNPGISMDWTPEEQAI 60

Query: 61  LEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMNKKENSKRRKEEHNLTRKNKD 120
           LEEGL K+++ES++IRYAKIAMQL NKTVRDVALRCRWM KKENSKRRKEEHNL+RKNKD
Sbjct: 61  LEEGLAKHSSESNIIRYAKIAMQLQNKTVRDVALRCRWMTKKENSKRRKEEHNLSRKNKD 120

Query: 121 KKERVSDSSMKSAQVAARPNVPPYGMPMIPMDNDDGVSYKAIGGTTGELLEQNAHAMNQI 180
           KKERV+DSS KS+  A R +VPPY  P+IPMD+DDG+SYKAIGG TGELLEQNA A+ QI
Sbjct: 121 KKERVTDSSTKSSHFAPRSSVPPYAPPIIPMDSDDGISYKAIGGATGELLEQNAQALTQI 180

Query: 181 SSNLASFQIQDNISLFCQTRDNILKIMNDLNEMPEVMKQMPPLPVKVNEELANTILPPTS 240
           S+N+A+FQIQDNI+LFCQ RD+ILKIMNDLN+MPEVMKQMPPLPVK+NEELANTILP TS
Sbjct: 181 SANIANFQIQDNINLFCQARDSILKIMNDLNDMPEVMKQMPPLPVKLNEELANTILPRTS 240

Query: 241 HSLQS 246
           HS+QS
Sbjct: 241 HSMQS 245

BLAST of CSPI04G02590 vs. NCBI nr
Match: gi|596298807|ref|XP_007227553.1| (hypothetical protein PRUPE_ppa010592mg [Prunus persica])

HSP 1 Score: 352.8 bits (904), Expect = 4.8e-94
Identity = 183/246 (74.39%), Postives = 211/246 (85.77%), Query Frame = 1

Query: 1   MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADNPSSALAMKHNPGISTDWTSDEQVT 60
           MANPSGNHQE    SSSF+G NPSNGNS PV A ++  +A+AMKHNPGIS DW+++EQ  
Sbjct: 1   MANPSGNHQEPSHASSSFNGTNPSNGNSAPVSAPESSGAAMAMKHNPGISMDWSAEEQAI 60

Query: 61  LEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMNKKENSKRRKEEHNLTRKNKD 120
           L++GL KY+ ES++IRYAKIAMQL NKTVRDVALRCRWM KKENSKRRKEEHNLTRK+KD
Sbjct: 61  LDDGLAKYSTESNIIRYAKIAMQLQNKTVRDVALRCRWMTKKENSKRRKEEHNLTRKSKD 120

Query: 121 KKERVSDSSMKSAQVAARPNVPPYGMPMIPMDNDDGVSYKAIGGTTGELLEQNAHAMNQI 180
           KKERV D+S K +  A RPNV PY  PM+ MDNDDG+SYKAIGG TGELLEQNA A+NQI
Sbjct: 121 KKERVIDTSAKPSHFAGRPNVAPYAPPMVTMDNDDGISYKAIGGITGELLEQNAQALNQI 180

Query: 181 SSNLASFQIQDNISLFCQTRDNILKIMNDLNEMPEVMKQMPPLPVKVNEELANTI-LPPT 240
           S+NLA+FQIQ+NI+LFCQTRDNILKIMNDLN+MP+VMKQMPPLPVKVNEELA  + +PP 
Sbjct: 181 SANLAAFQIQENINLFCQTRDNILKIMNDLNDMPDVMKQMPPLPVKVNEELATHVGIPP- 240

Query: 241 SHSLQS 246
            H +QS
Sbjct: 241 -HQMQS 244

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KX84_CUCSA3.2e-12999.59Uncharacterized protein OS=Cucumis sativus GN=Csa_4G010420 PE=4 SV=1[more]
M5XRQ9_PRUPE3.3e-9474.39Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa010592mg PE=4 SV=1[more]
A0A061EIV7_THECC2.4e-9273.98Uncharacterized protein OS=Theobroma cacao GN=TCM_019960 PE=4 SV=1[more]
F6GX47_VITVI5.0e-9071.26Putative uncharacterized protein OS=Vitis vinifera GN=VIT_00s0226g00040 PE=4 SV=... [more]
A0A0B0PT63_GOSAR1.8e-8769.80Histone H2A deubiquitinase MYSM1 OS=Gossypium arboreum GN=F383_16156 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G07565.41.5e-6955.47 Protein of unknown function (DUF3755)[more]
AT1G10820.25.5e-3240.09 Protein of unknown function (DUF3755)[more]
AT1G60670.26.1e-3137.07 Protein of unknown function (DUF3755)[more]
AT1G68160.19.7e-2935.94 Protein of unknown function (DUF3755)[more]
AT2G43470.19.4e-1633.77 Protein of unknown function (DUF3755)[more]
Match NameE-valueIdentityDescription
gi|449468874|ref|XP_004152146.1|4.6e-12999.59PREDICTED: uncharacterized protein LOC101222201 isoform X2 [Cucumis sativus][more]
gi|778689821|ref|XP_011653023.1|1.9e-12798.39PREDICTED: uncharacterized protein LOC101222201 isoform X1 [Cucumis sativus][more]
gi|778689827|ref|XP_011653024.1|2.4e-11490.61PREDICTED: uncharacterized protein LOC101222201 isoform X3 [Cucumis sativus][more]
gi|1009138578|ref|XP_015886660.1|3.2e-9875.92PREDICTED: uncharacterized protein LOC107421838 [Ziziphus jujuba][more]
gi|596298807|ref|XP_007227553.1|4.8e-9474.39hypothetical protein PRUPE_ppa010592mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001005SANT/Myb
IPR009057Homeobox-like_sf
IPR022228DUF3755
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003677 DNA binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G02590.1CSPI04G02590.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001005SANT/Myb domainSMARTSM00717santcoord: 49..102
score: 3.
IPR009057Homeodomain-likeGENE3DG3DSA:1.10.10.60coord: 52..110
score: 1.
IPR009057Homeodomain-likeunknownSSF46689Homeodomain-likecoord: 45..106
score: 2.0
IPR022228Protein of unknown function DUF3755PFAMPF12579DUF3755coord: 186..219
score: 3.5
NoneNo IPR availablePANTHERPTHR14000FAMILY NOT NAMEDcoord: 1..240
score: 1.0E