CSPI06G00230 (gene) Wild cucumber (PI 183967)

NameCSPI06G00230
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionHAD-superfamily hydrolase, subfamily IG, 5'-nucleotidase
LocationChr6 : 198061 .. 216311 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATAAAAGAAGAAGAGGGGAAATAAAGAAAAGAGAAAAGGGAGTGGTCAGGCAAAAGTCAGACAGAGAGAAAGAAAAAATGATGATTTTGAAAATAAAGAAAAGAGAGAAAGGGGAGAGAAGAAATGGAGAAATGATCATAAAAATCATCTTAGCTTATTAGACAGATTGGAATTTTAATGCTTAAAATGGATTCAAGAAGTTGATACCCAAAACCTTTACACTTTGTAAGTTCTCTTCTTTCTCTCTCTCTCTATATATATATGTATGTATACATATGTAACTTTTCAAAACTTTGATGTGAGTTTTTTTCTGGAATTGGTTGATAAGAGTTCTAATTATATATAAATCGATGAGTTCATTCTTGTTTTAAGCTTCTAATGAACATTATAATTTTATTTTTTTTTCAGTTCGACAATATATGATGTCTAGAATTTGAATTTTCGACCCTTTTGTCAAAGTTTCATGGTTAATAATGTCGATAAAACAAGGATTTGAAATGAATTTATAGATTTTTTTGTTTTTTTTTTCGTGCTCAAAAGCTTTTGTTTCTATTTTCTTGCATCAAATTGTCGATAAAGATTAGAGAAAGGTCGAAGAATTTATTGTGAGAATGTATACATTTCCTTCTTGTTGAGAAGAGTAATGAGTTCATTATTGTTTTTAAGCTTCAATTGAAGATTACAATTTTTTTTTTCTGTTCGACAATATGTGATGTGAGGATCTGAATTTCTGACATTAACAAGAATGAAATTATTTGTAAGTTTTTGATTTTTTTTTCTTGCTCAAAAACTTTTGTTTCTATGTTCTTGCGTCGAAGTGTGTTGAAAAAGATTAAAGAATGGTTGAAGAAATTTATTGTGAAAATGGAAGAGGAAAAGAAACGATAGGAAGATGAAAAGAGTGGAACGGAAAGAAGAAAGTAAATAATGAAAAAGGTGTACATTTTTACGTATAGTATTTTAGTTAAAAGTTAAATTTTAGCAAAAATAACTTAGTGGCATGCATCACGTTAATGTCACATCATGTATCACACGATGAAAGTACTCATAAATCTAAATGATTCCAAAGAAGAAAAGCATTTGTTCCTTCAAAATACATTTGTATCACATGTAATAAATAATTGGAATAACGACTTTTCAAGTGCTTCATTTCAAATCATCTTTGAATTTTTTGTAGTATATCGTTGGTTTTGAAACTTTGCAATTTATCTTTAGACAGGTCAAAAACAATGTGGGAGCCCTCCTTGGTGGCGTGGTCATTTTTCATTCTTACATTGTTGTTCACTACTTTTCTTCCAAACATATTAAAAAATAAAAAGTACAATGATCAAAATGATCAAAGTCGAACAAGATACAAATTTCCTTCTGGGAGAAGAAGTTGGCCTGTTATAGGTGATAGCTTCAACTGGTACTCTGCCATTGCAAGTTCGCATCCTTCCAAGTACGTTGAAGAACAAGCCAAAAGGTAATATAACTCAAAAACAAAGACCTATACATACACACATATACACTTTCTCTATGTTTAGGGTTTTTTTTTTTCAACATTCTATAAAAGTTTTTTCAAAACAACTCTCCAAATAAAAGTTCATTAGAATGTCGAAACGTATTGCAATTCAAAAAAACTTCCTCAGCGTAATTAATTTTATATAATTTATGATTGTTTTAGGTTAAATTTTCTATAAAACATATGATAATATCGTGATATTAACCAATTGAGCTTTCCAAAAAGATACGGGAAGATCTTCTCGTGTCGTGTTTTTGGCAAATGTACAGTGGTGTCGGTCGATCCAGATTTTAATAGGTATGTGATGCAAAACGAAGGGATATTGTTTCAATCGAGCTATCCCAAGTCTTTTCGAGATTTAGTAGGCGTAAACGGTGTTATCACGGCTCAAGGAGAACAACAGAGGAAACTCCATGGAATTGCTTCCAATTTTATGCGTATAAACAAACTCAACTTTAGCTTCATAAAAGAAATTCAAACCATAATCATCCATTCCTTGACCACTTTCCATGACCGTCATCAAATCATCTCTCTCCAAGATGCTTGCAGGAAGGCCAGTATGCATATATATACATATATATATATATATATTATTTGCTACATTCTTACAAGTTTAATTACGTCGTCTTGATTGATAATCATTTGATTTTTTTTACGTAAAAATGGTCTTTGAACTATTCTTAGACAGCTTCTTATTCTAGAGAAAAAAAGTTTTTAACAATTTTTTCTTTTTTCTTTTTCATAATTTGACTTGAATTTTGAAAATAAATAGTGAAAAAACAAAGAAATTTATTGAGTCCAATTTTCATTTGTGTCTTTTCCGTCTTTGAACGAGTTCTTCAACTCTTAATACGTCCACAGCTCAAGAAAAACGTTACCTCTAATAGTTTCTCCACTTCTAAATTTTGTTTTCAGTCTCATACATCCCTCCCATATTCAAAATTCTTAAAAAATTTATTGGTAAACTTTCAATTTTGTATTTAATATATTACGTAATTTTTTTTCTAAATTTTTTTTTAATAGGGTAAGTAAAAGCTTAATAAAAATATTTGTTTAAAATTAGGGATGGAATAGATACAATAACTTTATAAAACTGTAATTTTATGTTATTTTAATTTGTAGCTGTGTATATATATTTAAATAATAGAGAGTGTTTTTAATTGCTTTAATTTTGACTCAACAGATTGCCATTAATCTTATGGTAAGTCAGCTACTGGGTGTGTCAAGTGATTCGGAAGTGAACGAGATCAGTGGTTTATTTTACAACTTTGTAGATGGTTGTCTTTCTTTCCCTATCAATTTACCTCCCTTTGCCTATCACTCTGCCATGAAGGTAATTCAATAACCATTGTTATCATCGCATAGTATCGTATCATTTTTTAATTTTGTGTATAGTATAGGCAAGGAAGCAAATCATAAGAAAAATAAAGAAGAAAATAGAGATGCAGAAATCAATGGAAAAGAGTAGTAGTAGTGGGCATGGAGTTCTTGGAAGACTATTAGAGGAACAAAAGCACTTATGTGATGAGGCAGTTGAAGATTTCATAATCAATCTTCTTTTTGCCGGCAATGAAACCACTTCTAAGACAATGCTTTTCGCAGTCTATTACCTAACCCACTCACCTAAAGCATTTAAGCAACTCATGGTACGTTCCAATATTAAACTAACAAATCATCTATATACCTTTTTTTCTTTTTCTTTTTTAACTACTTGATTCGATTGGACTTATGAAAGGAAAACTAAAAGTTTGGTTAACTGTTGTTTAAAAAGAGTATTATGAAGAGAATTGACATATTTTGTTGATTAATACGTAAAAGTATATATAGAAAAAGTGAGTAAGTTTCTATATTTTGATTATCAATCTATATTAATGTGTTATTTTAGCTAATCTACCTGAAATTACAAATGATAGGAAGAACAAGATGGCTTGAGAAATAAGTGTGGAGACAATACAATCACATGGGAGGATTATAAATCAATGTCATTTACCCAATGTGTAAGTTTAAGTATTTTTTACAATTTCATCATTTACGATCATTTTTCTAAAATAAATTTGTTTTTCTGAAAAAAAATAAGGTTATTGATGAGACACTTCGACTCGGAGGAATTGCAATTTGGTTGATGAGAGAGACAAAAGAAGAGATTAAATACTCAGGTACTATTAACTTTTCGTTTGGAGCATTTTTAAAAATAGCAAAATAAATTAAAATATTTACAAGTTATAGCAAAATTTTGTATGACTATCAGTGATAGAATCTGCTATAGATGGTAAATATTTTGTAAAATCTACCATTTTTAAAAATTGTCATTTTCTTTCCATTTTAGAACCAACCTTTTCTATAAAAAAAAATAGATTTTATTCTTGCCATGTGAATTAAAAGCATTTAGGTCTTTGTTTGAAATAATTTTTTTATATATTTTAAAATAATGTTTGTATTTAAAAGCACTTTTTGAAAAACCACTTTAAAAGTTAAGAAAAGTAAATTATTTCAAAGAGCAAAAACTAATATATGGTTCCAATCTTCTATCCTTTTTTGTATTTTTTTTTCTAAAGTTCCCTCTAAGTTATTGCCTTACAAGTTTAAATAATTAAAATTTTTAAAACAGTAATTGTATTTTATTGGTTTAAATTATATTAGAATATCCTTGAAGGTTGAAATAATTGATTTTAGTTTTCATTTTTACTCTGTAACTCATGAATTTGTTACAGAAAATAACTTTATATGTATATTTTCTTTTATATTTTACGATAAGATAACAATTTAAATATTATTTATAATATATTGATGGAAATTGATGTATAAAAATTTCTAACAAAAATAATTTGAAATGTGGTCACAACTATTTTAATTTCGTACAATCTGTAATAAAATAAGTAAATTATGATATTTTATAAATATATTAATTCTCACTCTATTTAAAATTTTATTTTTCTATGTAATTGTAAGTTATATTATGTACATTTTTTTTATTATTATTACAACGAAAATAATATATTTAACCAACGCTCTAAAGTTCACGATTTTTTTTTTTTTTTGGAACATTTTAAAGTTGGGATTTCCAAAATTGAGTAACAATTTTTTTTCTTTTCTTTTTTGGGTAAAATAATATATATTTATTTCAATTATATACAATAATATATATATTCTATAATTTCTAATAAAGAAAAATCTAACCTTTTAGAGTTGGACTGATATTGGATAGATTATGTGATTCCAAAGGGAACTTTTGTTGTTCCATTTCTTTCTGCTGTACATTTGGATGAGAATATTTATGATGAAGCTTTAACCTTTAATCCTTGGAGATGGATGCAGCTTCAAAATCAGGTTAGTAAAAATTCATGCAAACCAAAAATTATATATTCATAGTAGAATTTCTCATTTATAAACATTATTTTCAACCATTTTTTCATTTAAATAATTTTTTAAAAAAACAATTGTATCTAAAATTAGACATTTTAAAGTATATCGATTAAAATTAAAGGTAAATTTCTATAAAAAAAAAAAAAAAGCACGAATGATTGTTATTTTCATAAAAAGTGAATTGTGGTTGATTATTTAGGAAAAGAGAAATTGGAAAAGTAGCCCATATTTTGCACCATTTGGAGGAGGGGGTAGATTGTGCCCGGGAGCTGAATTGGCTCGTCTTCAAATTGCTCTATTTCTCCATCACTTTCTTACCAACTATAGGTCTCTTCCCAAACTTACCCTTATTCTTTTTTCCTTTTAATTGTCTGTTCAATGCATATAGGTATTTATTTTTCTTTTAAAACTAAATTAAAATTAAATTTTATACTATGAAACTTTTATTTTCTTATTTTTTAAATCTATTTTCCAAATACACATTTAAATTTCATTCATTAACTCGAAATTTAGCATGTATTTTAGTAATGATTTATTTTTATCTTTTCTTTTACGAAAATAAAAGAATTAGAAAGAGAGTCGATAAAACTCTAACTTGATATTTGAAAAGAAAAACATTGTTTTATTAAAGTGAAGACAAACTTAGCTTAATGATAATTGATATGACCTTCTATTTTAGAAGTTAAAGGTTCGATCTCCCATCCTCATAACTATTGTACTTAATATATATTTTCTTAAAATAAGAAATGAAAAATATAAACGATCCTATAATTCTATTCTATGAATTTTGTCAATGTACTTTTAAATATCATTAGCCATCCAAAAATTAGTCATGACTTTTAACGCTCGAGTTTTAAGTTGTGAACAATATGTCCACAAAGTATCTTTCTATTGTAAGAAAATTTTTGGTGTTATTTAATCAATTTCAATAAGAGAGAGAGAAAAAAAGAGATAACTTTGAATATTTAAATTTAAGATTTATTGAGGTTAAAAGCTATTGAAATTATATTTATTAAAATTAAACCGGGAGAATATAAGAACTAAAATAAAGTATAGCTTAAAAGTACAAAAGTCAAAATGCAATTCTAATCTCAAAATAATTACCAAACACTTTTACTTTTAAGAAAATAAAAACAAAAATAGTATCACACGAAAAAAGGAACTAATTATGAATTGCGTTCAACGAATATCTTCCCAGGTGGACACAAATTAAAGCAGATAGAATGTCTTTCTTTCCCTCGGCTCGACTAGTTAATGGCTTTCAAATTCGCTTGGAAAGACGGGAACACTGATCATATCTTTGTGCAACTCACCAATTATTCAATAAGTATTTTGTTGTACTTATGGTAGACTTTTATTTTCAATAAAAAATATATCTTTTCTTCAACTTTGATTCATCATGTAATTTAGATGGGGAAGATGAACAACACTTTTCATTAATAAACCTACTTTCGTGTTCACTCAATTTCCAAAACAATACTTCAAACTAAATCTTAATATTATTTTTTCGAGTTAACGAATTTGAAGTAAAATTATTGGTTTGATACCGACTATAAAATGAATATTTGATTAATCCCTCCAGCACACAATAGGAAAAAGTACAATAATAAAATAATGAATAACAAGAGACTTAGTAAAATATCATAATAATATTGGAGAAGAAAAATATAATCTTTTAGAGAGAAAAGTAGAAAAAATACTCATCATCTCCTACACATATTAGATTAAAATAATATGATAATCACTAAAAATAAAAGAATAAAAAGAATACATACATACATATTGTTCTAAAGAAAAGTACAAAGAATAATATTTATCTCCTAAACACATTACAATCTTTTGATCTGTTTTGTTATATTTTTTTAATGTACCCGTTAAAAACACTTGCAATGTTGCCTTAATACCAATAAAAGAGCTCGAGAAAAAGTAAAAAGAAAGCAAAAGAACTATCTAACCTAAATGCTAAAGTTCTTTTTTTTTTTCCTCGTTGAGGATTGTTGGTTTGAGTTATAATCATTGGTTGGTGATAAAATATGATTGTATATTCATGAAGGAGTGGTTGTAAATATAGTAATTAGATTAAAAATAAATTAATACTTGAATATACAACAACATTTTAAAAATATTATAAATATAACAAAATATGTTCAAGTATATTAATGCTAGGAGTCTATCACTGATAAATCAATATTAATATCAATGATAGACCTTAACATATTTTGCTATATTTGCAAATATTGGTCTATCATCGAAAAATCATAAGCCCATCACTAATAAATATTGATATGATTTTTCTAAAAATGTTGCTATATAGTTAATTATTATTAAAAAAATTGTTATCCAATTGTAATTACCTTTTATAAAATCGACTGATTTAGATTGTCCCTGAAACCAACGAAACTAAACCCGTTACAAAGATAAAACATTGTTTTTATTATTTAAAAAAACAAAAATCTCACTATTTTAAAAACTAAATAAAACTAACATTCTCTTTTTCATTTTTTTCATTTCAACAATAGCGTGGTACACATCACCGTTCATTCAATGTTCAACATTTTGTTTGAATGTTTGTGACGTTTGTAGTTAATATTAAAAATATATATATACTTTGAAAAGTTTAGAAAGCTCTCTTAAAACTTTAAAAATAGTTACAATTAGTTTTTTTTATAATATAGTCTCATTATTATTATTATTATTGAAGCCCCTGTTTTTCGCACGTGCTTTGTCTTCATTTTTTCACACATGCTTTGTCTTCATGTTTCCACACGTGCTTTGTCTTGGTCTTTCTCTCTTTCTTCCCCAATTGAAATATCCACTGTACAAAGAAGAAGAAAAGGAAAGTCTTGTTTTCTGCAGATTTTACCGGAAATCTGCCGGACGGCGAAATCCTAATGGCCGTGGCCGCTTCTCTAGGGAACCCCGCTCTTACCAGAGCCACGTTACTTAGTCCTGTAAACAAACAGCATTTAACTTCACGCAGGGCTCAAAAGATGACGCTGTGTTTGTGCGCTATGGACTCCAAATCAGTTGGTGTTGGAGGTGATGTATTTTCTGTTACGTCATCGGCTAAGTCCGGAGTTGATTACCTTGGACAGAGCACTAAAGGAGATATGAATGTTAAACTCGAGCATCTTGATGCTTTTGGTAACTCCTCTTTGATCAAATGATGCGTTAGTTTCTGGATTTTTTTTTTTTTTTTGTTTCTTCTATCTGATTCAACTGGCTACTTGTTTTTAGCATTTAGCTGTTAATTAGACTGATATTTAACAAGAATTTTTGTCCGTTTTGTCTTACCGGTTTCTTTCCTTTTTGTGTTTTTCAGTATTAATACACATTTTTACTTATGATCATTCGAAGTATCTTCCATTAGTGAGAAATTATGTGAGGTTCAGGATTGTTATTTTGAGTCAGTGCGTTTTCCTTTTTTGTGTAGGTGTAGACGGCGAAGAAACCTTAGAAGGTCCAATCGAGGAAGTTGCGAGGGTGGAGGCACATGAAGCCGAGGATCTTCTTAGAGATTTGGGAATCCCGGTCAGTGGCGAAACAACTTATTTCCAACATGTTTTTCTGCATTTCTTAAGTTTTTTTTTCAGTATCCTGGCATTTTAGCGTTGAATTTGATTTGACTGTCTACTGCTTCATTAGTTAGTTGTTAGTTATCGGCTGGTTGGATTGAATAGAACAACCTGATCGAAGCATGAAGGTCATATGTTTGTAACGGATTATATGTTCCATATTAGGTCCCGTTCTTCTTTCTCTTTTTGGAGATTGTTGACTATTCATGGCTATTTCCTTAAATGCTTTATTTTTTCCTCGTATATTCTAATTCATACAAGGAAAAAGAAAAGTTCATTTAGTTGCTGTTAGTAGTCACAAAATATTGGATTTTTCATATTTAATCGATCCACTGCGTGCAATCGATATTCAATTTTCTGCTTCAGTGGGCTCGGTAATTTGTTGTATATAAGGAAGAAATGACCGATTCCCCAAGAATATTTACATTTAAAGTGGGCGACATTAACTATCATACTGCGTTTTTATTATTTATTTCCAAATGGTCGATTTTGTCCTTCCCTGCATCCGTCTTTATTGTGATTTACAAAAAATTGCTAGACAAACGTCTACACTTCATTTCCTGCACTCACAAATATTAATTTGCAGAGTCCTTCATCAAGAAATTCACTACATGGTATATTTTGTAGCCGAACATTGAATCTTCGATCAATCAGTGCCATTGGATACGACATGGACTACACTCTGATGCATTATAATGTCATGGTAATTGCTTTGATATTTTTGCATCTGTTAAGCACAGGAAAAGGATTAGGGAAGTTTTCTAGCTAGTGATTTCTATCCTGTACGATGAATGGTTAGTGATATTACAGACTCTGAAATATTGCTTTCTAACTTGATTCTTTCTTTAGCTGTCCTTTTCTTTTCACATTAATGTCTGTCCATCCTTTGATTGTTTTCGGAGAACTTACTTGTACTCCTTTAGGCTTGGGAGGGAAGAGCGTATGACTATTGTATGGAAAACTTAAGGAACATGGGTTTCCCAGTCAATGGGCTTGCATTTGACCCAGACTTGGTAAAGTTCTTACTGAGCTAATTTAAGAATTGTCGTGGTCGACTTTGTTTCTGATCCTTATATTGTGTGTTTTCATTGTTTCTGCATTTTTCCTTTCACAATTTAATTAGGTTATAAGAGGCCTTGTCATAGATAAAGAGAGGGGTAACTTAGTAAAGGCTGATCGTTTTGGTTATATTAAAAGAGCAATGCATGGAACAAAAATGCTGTCTACTCGAGATGTGAGGTATATGTTCAAGTATAACTTTGTTCTCAATTTCTTTTGTGCTTGATAAATATTAATTTGATGAAAGGCATGCAACCAGGCCCTTAAATTGTAACTAGAGATGATAGCTCTACCTTTTTGCAGGCTAAAACTACAGCTTTGGGCTCTCAGATCGTATAAAACACTTACACATGTATTCCAGTGCATCCTTCATATTATTACCATTATATAATTTTATGGATCTTGAAATCGGCATCAATAAATACTTTGCATTGTCCAGCTTTTCAACATTTTAAAGATTTCGAACTAGATTAAGATTTTATTTGAAGCCTTTGTTCTCAGATGTTGCATTTCTAGAAATAGGTCATGCGCTCCACCAACCTCTAATTGGTGCATTATATATATTATATATTATTTTCTTTATTTGAAAAAGAACTGGTGTATTGTATTGGATGATTCTCGTCTTCTTTACATACAAAGGCCAGAGTTTAATACGAAAATTTTAGCTTTAAAATGATAAATATGTTACAATATAGAATAATGGTTCTATATATATTGTGGTAAATGCAACTTGCTTATATTAAAAGACATCTAATTCCTTTCCAGAATCTGATGCATTACAATGTGCATTTTTCTTATAAACAAGGTAGATATGTTTCTGTGTCCATATCACATTATATTTGTTTCTTGCAATTTTTGTTTGTGCTTCTCCTATCTACTGCTCAAATTATCCTAGGATAGAGTTTGTTAGACTAGAGTTGCTAGGTTTCTATTAATATTGACTATATTACACATTGCAAAGAAAAGCTATAGCTGAAATTTCTATTGTTACTAATGTTCTGTGCATAGTGAGATTTATGGGAGGGAATTGGTGGATCTGCGGAAGGAGAATCGATGGGAGTTCCTAAATACATTGTTTTCAGTATCAGAGGCTGTTGCTTATATGCAGGTAGTCCTATTGTTATTCACTCTATTATTTAGCATTGCTCTTGTTAGAGATTTGATGCCTTTCTCATTTTTACTAGTCAGTTTCTTTGCTTAGGGTCCTTAAAGTAATATAAATTGCTTTTCCTCCCGTTGTTGAGATGAAATCTAGAAATCATAGTTGTGGTAGACAATAAATTTGTACAGAAAAAGGGTAGCCAGCCTTACGCAACAACAATATTATATAAATGTTCTCCAACCCAGTTACGTCCAAGAAAAGACCCAATCAAAGGAGCACACCACTCACAACATGTACAAAAGAGAACTCCAAACTTTGGTAATGAAAAAAAGATTTTTTATAAAATAAATTTGAATAGATAGCCCAAATCTTAAACTTCTTTCCATGAACTATCTTGATAGGTTATTAAAAACAATTTAGTACTAATCACAAAATGACAACCACCTTCTCTTCTAACTCTTGCTATATCTTATATACTCCTTCTACCGTCATAACTACCTGTACATATCCTCCTTATCCTTCACTCAACTTAGCCATGCATGTTAGCTTTTACTCCTATAAAGTACTACAGTTAAAGAAAAGTTTGGTCCTTTAAAACTCTGATTTCCTACCATTCTCTAAAGTACTAGGCATAATGTTTCCTACTCAAGAAAACTATAGGGTCTAGAAGTAGTTCTTTGACCTAAGCCTTCTCTATCTTTTTTTATTTCTCATTTTTTTTCAAAAGCTATTTTCCTTTTTAATAAATGCTAATTGGAGAGTATTTGTCATTCTCTGATTTCCTTTGGATGCTTGTCTTCTCATACATTGTTTTGTTCAATGAAATACTTTTTTTTATGTACGATGCAGGCTTAATGTCAAGTTGTAGCTCCATTTGTATATTAAATGTATTTTCTTCATTAAGTGGTGTGAGTTCTCTCAGATGGTAGACAGATTGGACGATGGAGCCATAGGAGCAGCAATTGGTCCACTTGATTATAAAGGACTCTACAAGGTATGGTGCTTCTACACCGCATCTTTCATAAAATTGAATGTTATAGTTAACTGTGTACATTTTAAGTTGAAACTTGGGTCATAGGCTGTTGGGAAAGCACTATTCAGGGCACATGTCGAGGGTCAGCTTAAGGTATATACTTTCATTGAACAAAATGTTTTAACTGTTACTGATTTTATTTGATTGCCCTTTACTGATTCATGATATTCACAGAGTGAGATAATGTCTAATCCTGAATTATTTGTCGAGCCTGATCCTGTACTACCATTAACCCTTTTGGATCAAAAAGAGGTATAACCCATCAGTTATAATTGGTTATTATATTGGCTGTCATGATTGTGTATTCATTGCTTGAGAAATGTTATGCATCAGGCTGGAAAAAAGCTTTTGCTTATCACCAACTCGGATTATCACTACACAAATAAAATGATGCAGCATTCCTTCAACAAATTTCTTCCAAATGATATGGGATGGCGAGATCTATTTGATATTGTAAGTTTGAGTTTAGTTCTTTTGTTTTTCTTTTTTTAATGTTCTTGAAAGTTTGTTCTAAACATGAGGGTGAGCCCCCAATATTGCGTCTCTAAAGTATCGTTGGGCATTCATCATTTATTTTTTCTCAAGTTCGTCAGCAACGTCTTCCTCGGTTTAGGGTTGTAATTATGTTAAGGTTGTTGGTTCTCTTGCTAGAATCTTCTTGGTTCTCTCTATGAGAGTTTTGTTCTCTGGATAGTGGTGTAGGTGCTGTTTTTGTTGTTCGATGGATAGCTTCAGTGGCTCTTGGATATTTGCTTCTGGGATAATTTTTTTCTTTTTTGGCATTTGTTCTTCAGTGCTTTATTTGGCTTTTGTAATTCTTTAGGGATTCACTTCATTTGTTGAGTTGTTGGTTCTTTTCACTCCTGAGAAGTAGTATCCTTGAACTGATTTCATTCTTTTTGTATATCAACGAAAGCCGTGTTTTTTTTTTCCAGAAAAAGTACGTCTAGGAAGACAGCTTTAGTAGAGGGGTTTTATAGGATTTGGAAGTGGGAACTGCTTCTTAATTTTGGCTTTTGTTGGACTATTTGTATTGTACCAGTGAAAGTTACTGGTACTTCTGTTGTAAAATTTTAATTTTTGGAATGAGTTTCCATTTTTTTCCTTTGTTAAAGGGAAAATTATATCATACCTTATATATACTTTTAGGTAATAGTCTCAGCAAGAAAGCCAGAATTTTTCCAAATGTCTCATCCATTGTACGAGGTGGTGACTGGTGAGGGTCTCATGCGCCCATGCTTCAAGGCTGTTGCAGGTAGTTAATGTGCTTCGGTTTCTATCCTACAACTATTTTTCAGTTATTCATTATCAAAAGGGGATTTACAAAAGAAGAGTAGATAAGATACTATATCCTCATCCACAACATCACAAGGATTACACACACACAAAAAAAAAGCCAACTGATGAAAATTTGATTTTGGCAACAATTGCAAAATCAATTAGAAGTAGACCGTGTAGAAGCAAATAAAATAACCAGATCCAATACCTCCCTATTGGATTCTTGATTATGATAGACTTCAGAATTTCTCTCAAGCCAAAGGATTCAAAAGAGTCCGAATGTTCTCATTGGGCTTGTTAAAGTACTTATTGATTAAATTTACCATAATCATCGACTTGAACTTTGGATTGATGGATAATTTAGCATGTTATTAGAGTAGCAGATCTAGTGTTCAAATTCCTGTAATGTCTTTTACTCCTTGATTAATATTGATTTTGACTTGTCGGCCTTTTTAAAGATTTTCAAGTAAACAGGTGAGAGGAGGGTTAAATTGTTGCTATTGTTAAAATTAATGTAATCCATCAACTTAAGCTTTTAGGTAGATTGGTGATTTAATAGGACCAACAGATCCTGGTTTGACCTTAAAAGAAGTTCCATTGATGAGTTGGGAGGAGTGCATTCAAAGCTTTGCCAGGCCAACACCACCGAAGGCTTAATAAATTAAACAACCCATCCCAACAATCAGTCACAACAAGAAAAAAAGAAAAGATACATCAATGTCTCACTGCCTTTGTAAAGGGCACACCAACTTGAGGAAAAGCATTGTATTTTTTGAATTGTACAAGTATTCAGATTGTCAAGTGGGCAATGCACAAGAAAACCTTCACTCCTAGAAATTGTACTGAAACTAGAGATAAAGATACTGTGGTTCAATAGGGTTTTATTTTTATGCTTCAATAACATTCTTCTGGGTATGGCCTCGAATTTGGATAAATCAGCTTTTTATATGCAGATTCTTCCTTGATTGCTTTTTGGTTTCGTTGAAAGATGGTGCTCTTGCCAACTCTGACCTCTGGATTACTCCTTGGTTTTAATATTCAAATTGAGAATGGTGTGTGAAAATTTTGCCAGAATGGCGCAATGTTGATGGTAAATTTGAAGCATCGAGTTTGCTGAATCTTAGTTTCTTAATCTTTAGATGAAGGTGTTTGAGTAGAAATGGGGTTATAAAATGAAATAATGGCTCTTTTCAAGCAAAAGGGGGTTGGAGATTTTGCTTCGATTCTTCACCTTCTGTGTTGTAGGGTATTTGGCATTTATTGTTTGTGATCTTTTGATTTCCATTCAAAAGCCAGGGGAACTGATTGTTTAAGGTACCAAAATGGAAATGCTTTACAGGGTCGTTCTAATTATACCTAATTGTTTTGTTTTCATTCCGGAGAAAGGGAAGATAGGTCACCCACTCTTTTTGAGAAAAAAACCAAAGGTGAAGAAAATGGGAAAAGAAAATAGAAGGAATCCTCAACAATACATCAAGAAAGGGAGATCTGCCTATGCTGTTCATTTTTATAACTTTCAAGGCTTATTTCTTGTTTTAGGAATTTTTACCAATCTCTTAAGAAAGGAGTAACTTCTCACTACTTTAATGAACAGTTAGTAATTTAACTGCCAGGTTATGTAAATATCTTTATCATTATGTTCTTGCGTATGCTATTATACCGGTGTTTAATGGATCGGCTTTACATGCAGGGGGCTTGTACTCTGGTGGTAGCGCACAAATGATTGAGAATTCTTTAAACATTCATGGTGATGAGATACTGTATGTTGGTGATCATATCTATACCGATGTAAGCCAATCTAAAGTACACTTGCGCTGGAGAACAGCATTAATTCTACGAGAATTGGAAGAAGAGGTATGAGTGGAAAACTATTAATATAGAGAATATAGAAGGTATATGTAGAAACATATCAATATAAAAACTATCAATGCTTATATAAAATAAAAATAAATAATAATAAAAGAAAAGGAAAAGAAAAACCAATTCTATTTCTAATTGCCAGTTAAGGTTAATAAAGTTTATGTTGCAAATATATATACTTATGTATACGTACTTCTTTTATTAAATAATCTCTGAAGTAGCTTTCTTTTCTTTCATTGTCACCGAAAAATTTGTTTCAAGGAAAAAATAGAAATGGAGATGAGTTATCCCACCAAGCCAAGAGGAGAATTATGAGTTGGGCGCTTTAACCATTCAGCCATGGATGCTTAGCGGGGATCCTCGTACATGGTGAATAACCAAATTCCAATTGAAATTAAATCTTTGGGGTAAACCAATGCAATTTAGGAGGACTCAATTCAGTGAAAGGGCAGCAATTCAAATTATGGATTTTAGAATTCAGAGAGATCAAGAATTCTCATTATTTCTTAGATTCATGGACCCAATTCAATTCATTGGTATCTTTCATTCACATTTTTTTCCATCAAGAACGTTATATAAAAACAAGAACCAGTCCCAACGAAAAAGCAGAATAAATGGAGTTGTTAAATAATAGTGCTCCCATACTTCCTAATATAAGACTTGATCCCAACAAGACTACAAGAAAATCATGGATCGGCCAAGGAAAATCCATTATATGTAGTAAAAAAAGAGATAAATAAATCTAAATGTTTTTCATGACCTTATTGATTTGACCAGGAAAAAAAATGTATAATCAATTCCTATTTAAATCTTAAGGGGTGTGAATTAATGTAGGAGAGTATATTACAAACTCTTTTTTATAGGGAGCACCAAGGCTTAGATTCCATCTAGATCTCAAAAAGCTTTTTGACTAGCCTCCTTCCCTTGAAAAGTTCTCTTCTTTCTTCCAAACCAGTCTACAAGTAACCGCTGTCACCACATTTGTCCAGAGGATTCTCGTTTGACCTTGAAAGTTTGAAGCCTGCACCACTCAAGAGTTGAAGGACCCTAGTTCCTATTTCTTATGTATTTGGATGAGATGTAAGGTGATACAAGCATATGGGAAAACAGTCGCAATCTCAGCACGGTTTAACCATGAATCTTTCCAGAAAGACTAGTGTGTACATTACCATGACAATGCTGGCAGGACTGGAATCTTGATCCAATGTAAGAAAAACAAATCTAGGGGACTTAAAAGAACCACCATGCATGGATAGAGCTTAAATATTGAACTGAGAATACATGAGTCGATCTTCTAAAACTCTGGCATCTTTTCTATCGAGAAAACTGTAGGAAGAAATGGAAGATTTGTACAAGCATCATCATTTGCACAAGCGTCATTTACCACTCATGAAAAATAAGCAGTACTCGGGTTTTTATTCAAGTTGTTTAGGGTAGTGCTTTATATTGGAAGTTCAAAGGTTTGTTTGAGCTCATGTTCACGCTTTTCTAGCACAAATTATTGTTGATTTCTGATATTTCTATTACCTAATTTGATTGTGCAGTATAGTGCTTTGATTCATAGTCGTGGCCATCGAGCATCTCTCATTGAGCTTATAAATCAAAAGGAAGTTGTGGGGGATCTATTTAATCAACTTCGGCTTGCTTTGCAAAGGCGTACTCAAGGCCGCCCTGCTCAAGTAAGCTTTGGTTCTTTGATTTTATGCTGTTCTTACGAAGATATAAGGAGTATTTTCGTATTTTCTTTGGAGATATCTTTGAAAACTTAGGGTAGATTTTATATACATGTGTGCGTGTACTTTTGGCCAAAGCATGGCTTTGAGAAGAGATGATTGGGATATGACCAAGGAGGTGTTCGATTTACCCTGATCATTTTTTAGAAAATACAAAGGTGGGATGGCACCAAAATAGAATTTGAACTATAAATATCTATACTAAAGATTTGTTGCTTAAGAGGTTGGGAAACCTAGTCTTTAATGTTCAGGAGTTCATAATAGCCAGTAATTTTCTCCTACTTGTTCATGACATTTCGCTGTATTTATAGTTGGTTTCTTCACAGCCCGAGAACCAAAAGAAAGATTCTGAGCATGAATCCGAATATTACCATCTCACCTCTAAATCGTGTTTCAATAGTTATTAGCATTGTTATTTTTCAGAGTAGTGGGAAATGAGATAGATGATATTCACTAGTGTTCAAATTTTCAAGTAACTGTGAGACATTTTAAGCTTATAGGGCCTTGATATCAATTTTCTTCAAGCTATGTGATTTTCATCCGTTGCTTCATTTTTTTTTTAATTGCGTGCTTCAGACCCTAGCAGCTACTAACATGAATGATGAGGAACTCACTGAAAGCATGCAAAAGCTGCTCATAGTTATGCAAAGATTAGACCAGAAAATTGCTCCAATGCTAGAAGCAGATGGAGAGCTATTCAACAAAAGGTTTGAGCTCCGTGAAAAAGTACAGCGAAGAAGGATGTCTTATTATATTTTGAAAAAGATCTTCCTTCGTGTTCATTTCCTTGCCTACTGTAGAGTTGAATATATACATGCATACGACATGCACTCTCACATATATATTCATCAGTCGAAGGTGAATAGGATAGCTGTTGAACCTTAGTCCTAGAAGACACTGTGAAGATTATTAAAAGACTTTTTCAGATAAGAACGTGAATTTTCATAATTACAAGAGATGTACTAAGGCTTGATGGTACCATTTTAAAAAACGTGAATATGGATTTTCTTGAAAACATGGAGGGAAAGAGTGAGCTTGTGCAAATTGATATTTGAAATATGAAATATGAAGGTCGAGTAAAATGAAGAGTTTCAATATGTGGCATGTATTCAAGTTTCAACCAAAAATATTGTAGGATCTCCTACTATTGCATTCTTATTCTCGTATATAATTAGCAACGCTCTGGAGTAGCAAATAAGACATAACTTGATTAACATTTTTGGAACTTAGATTTAACCTACTCTTGATATTCTGCGTAGGTGGGGTTTTCTTTCCCGTGCAGGCCTATGGGATAAAAGCCATTTGATGAGACAAATTGAGAAGTAAGCAATTCTTCCTTCTCCTATATCATATTTCAATCGGCATAAAATTGTAGGTTATAATAATATATTAGCAATGGTTTTCTTCCCAAATAATTGCAGGTATGCAGATATATACACTTCAAGGGTTTCAAACTTCTTAAACTACACGCCATTCACGTATTTCCGCTCGCAAGAACAGGTTTAGCAGTGTTTATTATTGAAATTTGCACTTTTATTGAGAAAGTCCATCTTCTGCCTCTTCTTTTCACAACTATGCATCTTATATTAACTCTAGACTTCGAGCACTCACATTAGTGCTCTTAAATTTTGTAGTATTTGCTATTAGCATAGTAATTTAGAATGACTTGCTGTTTTTGCAGACATTGGCTCATGACTCATATTCATTCTACTGTTCGCATGAGGAAACTACTATTGACAAATAA

mRNA sequence

ATGTGGGAGCCCTCCTTGGTGGCGTGGTCATTTTTCATTCTTACATTGTTGTTCACTACTTTTCTTCCAAACATATTAAAAAATAAAAAGTACAATGATCAAAATGATCAAAGTCGAACAAGATACAAATTTCCTTCTGGGAGAAGAAGTTGGCCTGTTATAGGTGATAGCTTCAACTGGTACTCTGCCATTGCAAGTTCGCATCCTTCCAAGTACGTTGAAGAACAAGCCAAAAGATACGGGAAGATCTTCTCGTGTCGTGTTTTTGGCAAATGTACAGTGGTGTCGGTCGATCCAGATTTTAATAGGTATGTGATGCAAAACGAAGGGATATTGTTTCAATCGAGCTATCCCAAGTCTTTTCGAGATTTAGTAGGCGTAAACGGTGTTATCACGGCTCAAGGAGAACAACAGAGGAAACTCCATGGAATTGCTTCCAATTTTATGCGTATAAACAAACTCAACTTTAGCTTCATAAAAGAAATTCAAACCATAATCATCCATTCCTTGACCACTTTCCATGACCGTCATCAAATCATCTCTCTCCAAGATGCTTGCAGGAAGATTGCCATTAATCTTATGGTAAGTCAGCTACTGGGTGTGTCAAGTGATTCGGAAGTGAACGAGATCAGTGGTTTATTTTACAACTTTGTAGATGGTTGTCTTTCTTTCCCTATCAATTTACCTCCCTTTGCCTATCACTCTGCCATGAAGGCAAGGAAGCAAATCATAAGAAAAATAAAGAAGAAAATAGAGATGCAGAAATCAATGGAAAAGAGTAGTAGTAGTGGGCATGGAGTTCTTGGAAGACTATTAGAGGAACAAAAGCACTTATGTGATGAGGCAGTTGAAGATTTCATAATCAATCTTCTTTTTGCCGGCAATGAAACCACTTCTAAGACAATGCTTTTCGCAGTCTATTACCTAACCCACTCACCTAAAGCATTTAAGCAACTCATGGAAGAACAAGATGGCTTGAGAAATAAGTGTGGAGACAATACAATCACATGGGAGGATTATAAATCAATGTCATTTACCCAATGTGTTATTGATGAGACACTTCGACTCGGAGGAATTGCAATTTGGTTGATGAGAGAGACAAAAGAAGAGATTAAATACTCAGATTATGTGATTCCAAAGGGAACTTTTGTTGTTCCATTTCTTTCTGCTGTACATTTGGATGAGAATATTTATGATGAAGCTTTAACCTTTAATCCTTGGAGATGGATGCAGCTTCAAAATCAGGAAAAGAGAAATTGGAAAAGTAGCCCATATTTTGCACCATTTGGAGGAGGGGGTAGATTGTGCCCGGGAGCTGAATTGGCTCATTTTACCGGAAATCTGCCGGACGGCGAAATCCTAATGGCCGTGGCCGCTTCTCTAGGGAACCCCGCTCTTACCAGAGCCACGTTACTTAGTCCTGTAAACAAACAGCATTTAACTTCACGCAGGGCTCAAAAGATGACGCTGTGTTTGTGCGCTATGGACTCCAAATCAGTTGGTGTTGGAGGTGATGTATTTTCTGTTACGTCATCGGCTAAGTCCGGAGTTGATTACCTTGGACAGAGCACTAAAGGAGATATGAATGTTAAACTCGAGCATCTTGATGCTTTTGGTGTAGACGGCGAAGAAACCTTAGAAGGTCCAATCGAGGAAGTTGCGAGGGTGGAGGCACATGAAGCCGAGGATCTTCTTAGAGATTTGGGAATCCCGAGTCCTTCATCAAGAAATTCACTACATGGTATATTTTGTAGCCGAACATTGAATCTTCGATCAATCAGTGCCATTGGATACGACATGGACTACACTCTGATGCATTATAATGTCATGGCTTGGGAGGGAAGAGCGTATGACTATTGTATGGAAAACTTAAGGAACATGGGTTTCCCAGTCAATGGGCTTGCATTTGACCCAGACTTGGTTATAAGAGGCCTTGTCATAGATAAAGAGAGGGGTAACTTAGTAAAGGCTGATCGTTTTGGTTATATTAAAAGAGCAATGCATGGAACAAAAATGCTGTCTACTCGAGATGTGAGTGAGATTTATGGGAGGGAATTGGTGGATCTGCGGAAGGAGAATCGATGGGAGTTCCTAAATACATTGTTTTCAGTATCAGAGGCTGTTGCTTATATGCAGATGGTAGACAGATTGGACGATGGAGCCATAGGAGCAGCAATTGGTCCACTTGATTATAAAGGACTCTACAAGGCTGTTGGGAAAGCACTATTCAGGGCACATGTCGAGGGTCAGCTTAAGAGTGAGATAATGTCTAATCCTGAATTATTTGTCGAGCCTGATCCTGTACTACCATTAACCCTTTTGGATCAAAAAGAGGCTGGAAAAAAGCTTTTGCTTATCACCAACTCGGATTATCACTACACAAATAAAATGATGCAGCATTCCTTCAACAAATTTCTTCCAAATGATATGGGATGGCGAGATCTATTTGATATTGTAATAGTCTCAGCAAGAAAGCCAGAATTTTTCCAAATGTCTCATCCATTGTACGAGGTGGTGACTGGTGAGGGTCTCATGCGCCCATGCTTCAAGGCTGTTGCAGGGGGCTTGTACTCTGGTGGTAGCGCACAAATGATTGAGAATTCTTTAAACATTCATGGTGATGAGATACTGTATGTTGGTGATCATATCTATACCGATGTAAGCCAATCTAAAGTACACTTGCGCTGGAGAACAGCATTAATTCTACGAGAATTGGAAGAAGAGTATAGTGCTTTGATTCATAGTCGTGGCCATCGAGCATCTCTCATTGAGCTTATAAATCAAAAGGAAGTTGTGGGGGATCTATTTAATCAACTTCGGCTTGCTTTGCAAAGGCGTACTCAAGGCCGCCCTGCTCAAACCCTAGCAGCTACTAACATGAATGATGAGGAACTCACTGAAAGCATGCAAAAGCTGCTCATAGTTATGCAAAGATTAGACCAGAAAATTGCTCCAATGCTAGAAGCAGATGGAGAGCTATTCAACAAAAGGTGGGGTTTTCTTTCCCGTGCAGGCCTATGGGATAAAAGCCATTTGATGAGACAAATTGAGAAGTATGCAGATATATACACTTCAAGGGTTTCAAACTTCTTAAACTACACGCCATTCACGTATTTCCGCTCGCAAGAACAGACATTGGCTCATGACTCATATTCATTCTACTGTTCGCATGAGGAAACTACTATTGACAAATAA

Coding sequence (CDS)

ATGTGGGAGCCCTCCTTGGTGGCGTGGTCATTTTTCATTCTTACATTGTTGTTCACTACTTTTCTTCCAAACATATTAAAAAATAAAAAGTACAATGATCAAAATGATCAAAGTCGAACAAGATACAAATTTCCTTCTGGGAGAAGAAGTTGGCCTGTTATAGGTGATAGCTTCAACTGGTACTCTGCCATTGCAAGTTCGCATCCTTCCAAGTACGTTGAAGAACAAGCCAAAAGATACGGGAAGATCTTCTCGTGTCGTGTTTTTGGCAAATGTACAGTGGTGTCGGTCGATCCAGATTTTAATAGGTATGTGATGCAAAACGAAGGGATATTGTTTCAATCGAGCTATCCCAAGTCTTTTCGAGATTTAGTAGGCGTAAACGGTGTTATCACGGCTCAAGGAGAACAACAGAGGAAACTCCATGGAATTGCTTCCAATTTTATGCGTATAAACAAACTCAACTTTAGCTTCATAAAAGAAATTCAAACCATAATCATCCATTCCTTGACCACTTTCCATGACCGTCATCAAATCATCTCTCTCCAAGATGCTTGCAGGAAGATTGCCATTAATCTTATGGTAAGTCAGCTACTGGGTGTGTCAAGTGATTCGGAAGTGAACGAGATCAGTGGTTTATTTTACAACTTTGTAGATGGTTGTCTTTCTTTCCCTATCAATTTACCTCCCTTTGCCTATCACTCTGCCATGAAGGCAAGGAAGCAAATCATAAGAAAAATAAAGAAGAAAATAGAGATGCAGAAATCAATGGAAAAGAGTAGTAGTAGTGGGCATGGAGTTCTTGGAAGACTATTAGAGGAACAAAAGCACTTATGTGATGAGGCAGTTGAAGATTTCATAATCAATCTTCTTTTTGCCGGCAATGAAACCACTTCTAAGACAATGCTTTTCGCAGTCTATTACCTAACCCACTCACCTAAAGCATTTAAGCAACTCATGGAAGAACAAGATGGCTTGAGAAATAAGTGTGGAGACAATACAATCACATGGGAGGATTATAAATCAATGTCATTTACCCAATGTGTTATTGATGAGACACTTCGACTCGGAGGAATTGCAATTTGGTTGATGAGAGAGACAAAAGAAGAGATTAAATACTCAGATTATGTGATTCCAAAGGGAACTTTTGTTGTTCCATTTCTTTCTGCTGTACATTTGGATGAGAATATTTATGATGAAGCTTTAACCTTTAATCCTTGGAGATGGATGCAGCTTCAAAATCAGGAAAAGAGAAATTGGAAAAGTAGCCCATATTTTGCACCATTTGGAGGAGGGGGTAGATTGTGCCCGGGAGCTGAATTGGCTCATTTTACCGGAAATCTGCCGGACGGCGAAATCCTAATGGCCGTGGCCGCTTCTCTAGGGAACCCCGCTCTTACCAGAGCCACGTTACTTAGTCCTGTAAACAAACAGCATTTAACTTCACGCAGGGCTCAAAAGATGACGCTGTGTTTGTGCGCTATGGACTCCAAATCAGTTGGTGTTGGAGGTGATGTATTTTCTGTTACGTCATCGGCTAAGTCCGGAGTTGATTACCTTGGACAGAGCACTAAAGGAGATATGAATGTTAAACTCGAGCATCTTGATGCTTTTGGTGTAGACGGCGAAGAAACCTTAGAAGGTCCAATCGAGGAAGTTGCGAGGGTGGAGGCACATGAAGCCGAGGATCTTCTTAGAGATTTGGGAATCCCGAGTCCTTCATCAAGAAATTCACTACATGGTATATTTTGTAGCCGAACATTGAATCTTCGATCAATCAGTGCCATTGGATACGACATGGACTACACTCTGATGCATTATAATGTCATGGCTTGGGAGGGAAGAGCGTATGACTATTGTATGGAAAACTTAAGGAACATGGGTTTCCCAGTCAATGGGCTTGCATTTGACCCAGACTTGGTTATAAGAGGCCTTGTCATAGATAAAGAGAGGGGTAACTTAGTAAAGGCTGATCGTTTTGGTTATATTAAAAGAGCAATGCATGGAACAAAAATGCTGTCTACTCGAGATGTGAGTGAGATTTATGGGAGGGAATTGGTGGATCTGCGGAAGGAGAATCGATGGGAGTTCCTAAATACATTGTTTTCAGTATCAGAGGCTGTTGCTTATATGCAGATGGTAGACAGATTGGACGATGGAGCCATAGGAGCAGCAATTGGTCCACTTGATTATAAAGGACTCTACAAGGCTGTTGGGAAAGCACTATTCAGGGCACATGTCGAGGGTCAGCTTAAGAGTGAGATAATGTCTAATCCTGAATTATTTGTCGAGCCTGATCCTGTACTACCATTAACCCTTTTGGATCAAAAAGAGGCTGGAAAAAAGCTTTTGCTTATCACCAACTCGGATTATCACTACACAAATAAAATGATGCAGCATTCCTTCAACAAATTTCTTCCAAATGATATGGGATGGCGAGATCTATTTGATATTGTAATAGTCTCAGCAAGAAAGCCAGAATTTTTCCAAATGTCTCATCCATTGTACGAGGTGGTGACTGGTGAGGGTCTCATGCGCCCATGCTTCAAGGCTGTTGCAGGGGGCTTGTACTCTGGTGGTAGCGCACAAATGATTGAGAATTCTTTAAACATTCATGGTGATGAGATACTGTATGTTGGTGATCATATCTATACCGATGTAAGCCAATCTAAAGTACACTTGCGCTGGAGAACAGCATTAATTCTACGAGAATTGGAAGAAGAGTATAGTGCTTTGATTCATAGTCGTGGCCATCGAGCATCTCTCATTGAGCTTATAAATCAAAAGGAAGTTGTGGGGGATCTATTTAATCAACTTCGGCTTGCTTTGCAAAGGCGTACTCAAGGCCGCCCTGCTCAAACCCTAGCAGCTACTAACATGAATGATGAGGAACTCACTGAAAGCATGCAAAAGCTGCTCATAGTTATGCAAAGATTAGACCAGAAAATTGCTCCAATGCTAGAAGCAGATGGAGAGCTATTCAACAAAAGGTGGGGTTTTCTTTCCCGTGCAGGCCTATGGGATAAAAGCCATTTGATGAGACAAATTGAGAAGTATGCAGATATATACACTTCAAGGGTTTCAAACTTCTTAAACTACACGCCATTCACGTATTTCCGCTCGCAAGAACAGACATTGGCTCATGACTCATATTCATTCTACTGTTCGCATGAGGAAACTACTATTGACAAATAA
BLAST of CSPI06G00230 vs. Swiss-Prot
Match: C72B1_PINTA (Abietadienol/abietadienal oxidase OS=Pinus taeda GN=CYP720B1 PE=1 SV=1)

HSP 1 Score: 350.9 bits (899), Expect = 4.9e-95
Identity = 185/439 (42.14%), Postives = 278/439 (63.33%), Query Frame = 1

Query: 13  ILTLLFTTFLPNI-LKNKKYNDQNDQSRTRYK------FPSGRRSWPVIGDSFNWYSAIA 72
           +L ++FT  +  + L  + +N Q  Q RT  +       P G   WP+IG+++++Y ++ 
Sbjct: 7   LLLVVFTAAVALLHLIYRWWNAQRGQKRTSNEKNQELHLPPGSTGWPLIGETYSYYRSMT 66

Query: 73  SSHPSKYVEEQAKRYGK-IFSCRVFGKCTVVSVDPDFNRYVMQNEGILFQSSYPKSFRDL 132
           S+ P ++++++ KRY   +F   +FG   V+S DP FN+YV+QNEG  FQ+ YPK+ + L
Sbjct: 67  SNRPRQFIDDREKRYDSDVFVSHLFGSQAVISSDPQFNKYVLQNEGRFFQAHYPKALKAL 126

Query: 133 VGVNGVITAQGEQQRKLHGIASNFMRINKLNFSFIKEIQTIIIHSLTTFHDRHQIISLQD 192
           +G  G+++  G+ QRKLHGIA N +R  +L F F++EIQ ++  +L  + D+ +I +LQ+
Sbjct: 127 IGDYGLLSVHGDLQRKLHGIAVNLLRFERLKFDFMEEIQNLVHSTLDRWVDKKEI-ALQN 186

Query: 193 ACRKIAINLMVSQLLGVSSDSEVNEISGLFYNFVDGCLSFPINLPPFAYHSAMKARKQII 252
            C ++ +NLM  QLL +S   E NEI  LF ++ +  ++ PI +P   Y   +KAR+ +I
Sbjct: 187 ECHQMVLNLMAKQLLDLSPSKETNEICELFVDYTNAVIAIPIKIPGSTYAKGLKARELLI 246

Query: 253 RKIKKKIEMQKSMEKSSSSGHGVLGRLLEEQKHLCDEAVEDFIINLLFAGNETTSKTMLF 312
           RKI   I+ ++  +        +L +LLEE   + DE + DFI+ LLFAG+ET+S+ M F
Sbjct: 247 RKISNMIKERR--DHPHIVHKDLLTKLLEEDS-ISDEIICDFILFLLFAGHETSSRAMTF 306

Query: 313 AVYYLTHSPKAFKQLMEEQDG-LRNKCGDNTITWEDYKSMSFTQCVIDETLRLGGIAIWL 372
           A+ +LT  PKA  Q+ EE D  L+ K G   + W+DYKSM FTQCVI+ETLRLG     +
Sbjct: 307 AIKFLTTCPKALTQMKEEHDAILKAKGGHKKLEWDDYKSMKFTQCVINETLRLGNFGPGV 366

Query: 373 MRETKEEIKYSDYVIPKGTFVVPFLSAVHLDENIYDEALTFNPWRWMQLQNQEKRNWKSS 432
            RETKE+ K  D +IPKG  V  FL+A HLDE  ++EALTFNPWRW     +  ++  ++
Sbjct: 367 FRETKEDTKVKDCLIPKGWVVFAFLTATHLDEKFHNEALTFNPWRW-----ELDQDVSNN 426

Query: 433 PYFAPFGGGGRLCPGAELA 443
             F+PFGGG RLCPG+ LA
Sbjct: 427 HLFSPFGGGARLCPGSHLA 436

BLAST of CSPI06G00230 vs. Swiss-Prot
Match: C72B2_PINTA (Cytochrome P450 720B2 OS=Pinus taeda GN=CYP720B2 PE=2 SV=1)

HSP 1 Score: 320.1 bits (819), Expect = 9.3e-86
Identity = 170/412 (41.26%), Postives = 260/412 (63.11%), Query Frame = 1

Query: 38  SRTRYKFPSGRRSWPVIGDSFNWYSAIASS-HPSKYVEEQAKRYGKIFSCRVFGKCT-VV 97
           S   YK P G   WP+IG++ +++  I S+  P ++++E+ +RYG+IF   +FG+   VV
Sbjct: 38  SSRAYKLPPGSTGWPLIGETISFFRGINSTAQPRQFIQEREQRYGEIFRSNLFGRSRIVV 97

Query: 98  SVDPDFNRYVMQNEGILFQSSYPKSFRDLVGVNGVITAQGEQQRKLHGIASNFMRINKLN 157
           SVDP+FN++V+Q+EG  FQ++YPK  R+L+G  G+++  G+ QRKLHG A N +R  +L+
Sbjct: 98  SVDPEFNKHVLQHEGRQFQANYPKPLRNLIGKYGLLSVHGDLQRKLHGAAVNLLRFERLS 157

Query: 158 FSFIKEIQTIIIHSLTTFHDRHQIISLQDACRKIAINLMVSQLLGVSSDSEVNEISGLFY 217
             F+++IQ ++  +L  +  +   I LQ+ C ++ +NLM  QLL +S   +  EI   F 
Sbjct: 158 VDFMEDIQNLLHITLAKWEAKRD-IHLQEECHQLVLNLMAKQLLDLSPSKDTEEICEAFG 217

Query: 218 NFVDGCLSFPINLPPFAYHSAMKARKQIIRKIKKKIEMQKSMEKSSSSGHGVLGRLLEEQ 277
           +F +  L+ PI +P   Y    KAR+ +I+KI + IE ++  +   +  + +L +LL+E 
Sbjct: 218 HFSEALLAVPIKIPGTKYARGFKAREFLIKKIYESIEDRR--QHPEAVHNDLLTKLLKED 277

Query: 278 KHLCDEAVEDFIINLLFAGNETTSKTMLFAVYYLTHSPKAFKQLMEEQDGLRNKCG---D 337
               +E + DFI+ LLFAG+ET+S++M FA+ +LT  P+A ++L  E D L  + G   +
Sbjct: 278 S-FSEEIIADFILFLLFAGHETSSRSMSFAIKFLTDCPRALEELKAEHDALLKRKGNLKN 337

Query: 338 NTITWEDYKSMSFTQCVIDETLRLGGIAIWLMRETKEEIK-YSDYVIPKGTFVVPFLSAV 397
             + W+DY+S+ FTQCVI ETLR+G     + RETKE+IK    +VIP+G  V  FL+  
Sbjct: 338 QKLNWDDYQSLKFTQCVIHETLRVGNFGPGVFRETKEDIKTKGGFVIPRGWTVYVFLTGT 397

Query: 398 HLDENIYDEALTFNPWRWM-QLQNQEKRNWKSSPYFAPFGGGGRLCPGAELA 443
           HLDE  +  AL F+PWRW   LQ+QE      +P F PFGGG RLCPG  LA
Sbjct: 398 HLDEKYHSSALKFDPWRWQPHLQDQE---LLKNPSFMPFGGGARLCPGMHLA 442

BLAST of CSPI06G00230 vs. Swiss-Prot
Match: C90B1_ARATH (Cytochrome P450 90B1 OS=Arabidopsis thaliana GN=CYP90B1 PE=1 SV=2)

HSP 1 Score: 276.2 bits (705), Expect = 1.5e-72
Identity = 147/437 (33.64%), Postives = 245/437 (56.06%), Query Frame = 1

Query: 39  RTRYKFPSGRRSWPVIGDSFNWYSAIASSHPSKYVEEQAKRYGKIFSCRVFGKCTVVSVD 98
           +TR+  P G+  WP +G++  +     ++    ++++   +YGKI+   +FG+ T+VS D
Sbjct: 34  KTRFNLPPGKSGWPFLGETIGYLKPYTATTLGDFMQQHVSKYGKIYRSNLFGEPTIVSAD 93

Query: 99  PDFNRYVMQNEGILFQSSYPKSFRDLVGVNGVITAQGEQQRKLHGIASNFMRINKLNFSF 158
              NR+++QNEG LF+ SYP+S   ++G   ++   G+  R +  I+ NF+   +L    
Sbjct: 94  AGLNRFILQNEGRLFECSYPRSIGGILGKWSMLVLVGDMHRDMRSISLNFLSHARLRTIL 153

Query: 159 IKEIQTIIIHSLTTFHDRHQIISLQDACRKIAINLMVSQLLGVS-SDSEVNEISGLFYNF 218
           +K+++   +  L ++  ++ I S QD  +K   NLM   ++ +   + E  ++   +  F
Sbjct: 154 LKDVERHTLFVLDSWQ-QNSIFSAQDEAKKFTFNLMAKHIMSMDPGEEETEQLKKEYVTF 213

Query: 219 VDGCLSFPINLPPFAYHSAMKARKQIIRKIKKKIEMQKSMEKS----------------S 278
           + G +S P+NLP  AYH A+++R  I++ I++K+E +K   K                 S
Sbjct: 214 MKGVVSAPLNLPGTAYHKALQSRATILKFIERKMEERKLDIKEEDQEEEEVKTEDEAEMS 273

Query: 279 SSGH--------GVLGRLLEEQKHLCDEAVEDFIINLLFAGNETTSKTMLFAVYYLTHSP 338
            S H         +LG +L+   +L  E + D I++LLFAG+ET+S  +  A+++L   P
Sbjct: 274 KSDHVRKQRTDDDLLGWVLKHS-NLSTEQILDLILSLLFAGHETSSVAIALAIFFLQACP 333

Query: 339 KAFKQLMEEQDGL---RNKCGDNTITWEDYKSMSFTQCVIDETLRLGGIAIWLMRETKEE 398
           KA ++L EE   +   + + G++ + W+DYK M FTQCVI+ETLRLG +  +L R+  ++
Sbjct: 334 KAVEELREEHLEIARAKKELGESELNWDDYKKMDFTQCVINETLRLGNVVRFLHRKALKD 393

Query: 399 IKYSDYVIPKGTFVVPFLSAVHLDENIYDEALTFNPWRWMQLQNQEKRNWKSS-----PY 443
           ++Y  Y IP G  V+P +SAVHLD + YD+   FNPWRW Q  N    +   S       
Sbjct: 394 VRYKGYDIPSGWKVLPVISAVHLDNSRYDQPNLFNPWRWQQQNNGASSSGSGSFSTWGNN 453

BLAST of CSPI06G00230 vs. Swiss-Prot
Match: C90D1_ARATH (3-epi-6-deoxocathasterone 23-monooxygenase OS=Arabidopsis thaliana GN=CYP90D1 PE=2 SV=1)

HSP 1 Score: 261.9 bits (668), Expect = 3.0e-68
Identity = 142/451 (31.49%), Postives = 251/451 (55.65%), Query Frame = 1

Query: 5   SLVAWSFFILTLLFTTFLPNILKN-----KKYNDQNDQSRTRY-KFPSGRRSWPVIGDSF 64
           SL+ +SFF   ++      N L++     KK ND +  S++   KFP G   WPVIG++ 
Sbjct: 6   SLLFFSFFFFIIIVIFNKINGLRSSPASKKKLNDHHVTSQSHGPKFPHGSLGWPVIGETI 65

Query: 65  NWYSAIASSHPSKYVEEQAKRYGKIFSCRVFGKCTVVSVDPDFNRYVMQNEGILFQSSYP 124
            + S+  S  P  +++++   YG++F   +FG  T+VS D + NR V+Q++   F   YP
Sbjct: 66  EFVSSAYSDRPESFMDKRRLMYGRVFKSHIFGTATIVSTDAEVNRAVLQSDSTAFVPFYP 125

Query: 125 KSFRDLVGVNGVITAQGEQQRKLHGIASNFMRINKLNFSFIKEIQTIIIHSLTTFHDRHQ 184
           K+ R+L+G + ++   G   R+ HG+  +F++   L    ++++   +  S+  + +  Q
Sbjct: 126 KTVRELMGKSSILLINGSLHRRFHGLVGSFLKSPLLKAQIVRDMHKFLSESMDLWSE-DQ 185

Query: 185 IISLQDACRKIAINLMVSQLLGVSSDSEVNEISGLFYNFVDGCLSFPINLPPFAYHSAMK 244
            + LQD  + +A  ++   L+ V    ++ E+   F NF+ G +S PIN P    H +++
Sbjct: 186 PVLLQDVSKTVAFKVLAKALISVEKGEDLEELKREFENFISGLMSLPINFPGTQLHRSLQ 245

Query: 245 ARKQIIRKIKK----KIEMQKSMEKSSSSGHGVLGRLLEE-QKHLCDEAVEDFIINLLFA 304
           A+K +++++++    KI   K+ E+       V+  LL++  +HL    + + +I+++  
Sbjct: 246 AKKNMVKQVERIIEGKIRKTKNKEEDDVIAKDVVDVLLKDSSEHLTHNLIANNMIDMMIP 305

Query: 305 GNETTSKTMLFAVYYLTHSPKAFKQLMEEQDGLRN--KCGDNTITWEDYKSMSFTQCVID 364
           G+++    +  AV +L+ SP A   L EE   L++  +     + W DY S+ FTQ VI 
Sbjct: 306 GHDSVPVLITLAVKFLSDSPAALNLLTEENMKLKSLKELTGEPLYWNDYLSLPFTQKVIT 365

Query: 365 ETLRLGGIAIWLMRETKEEIKYSDYVIPKGTFVVPFLSAVHLDENIYDEALTFNPWRWMQ 424
           ETLR+G + I +MR+  ++++   YVIPKG   + +L +VHLD+  Y+    FNPWRW  
Sbjct: 366 ETLRMGNVIIGVMRKAMKDVEIKGYVIPKGWCFLAYLRSVHLDKLYYESPYKFNPWRW-- 425

Query: 425 LQNQEKRNWKSSPYFAPFGGGGRLCPGAELA 443
               ++R+  +S  F+PFGGG RLCPG +LA
Sbjct: 426 ----QERDMNTSS-FSPFGGGQRLCPGLDLA 448

BLAST of CSPI06G00230 vs. Swiss-Prot
Match: C90A1_ARATH (Cytochrome P450 90A1 OS=Arabidopsis thaliana GN=CYP90A1 PE=2 SV=1)

HSP 1 Score: 254.6 bits (649), Expect = 4.8e-66
Identity = 140/409 (34.23%), Postives = 219/409 (53.55%), Query Frame = 1

Query: 39  RTRYK---FPSGRRSWPVIGDSFNWYSAIASSHPSKYVEEQAKRYGKIFSCRVFGKCTVV 98
           RTRY+    P G    P+IG++F    A  + +P  +++E+  RYG +F   +FG+ T+ 
Sbjct: 23  RTRYRRMGLPPGSLGLPLIGETFQLIGAYKTENPEPFIDERVARYGSVFMTHLFGEPTIF 82

Query: 99  SVDPDFNRYVMQNEGILFQSSYPKSFRDLVGVNGVITAQGEQQRKLHGIASNFMRINKLN 158
           S DP+ NR+V+QNEG LF+ SYP S  +L+G + ++  +G   +++H +  +F   + + 
Sbjct: 83  SADPETNRFVLQNEGKLFECSYPASICNLLGKHSLLLMKGSLHKRMHSLTMSFANSSIIK 142

Query: 159 FSFIKEIQTIIIHSLTTFHDRHQIISLQDACRKIAINLMVSQLLGVSSDSEVNEISGLFY 218
              + +I  ++  +L ++  R   + L +  +KI   L V QL+          +   + 
Sbjct: 143 DHLMLDIDRLVRFNLDSWSSR---VLLMEEAKKITFELTVKQLMSFDPGEWSESLRKEYL 202

Query: 219 NFVDGCLSFPINLPPFAYHSAMKARKQIIRKIKKKI-EMQKSMEKSSSSGHGVLGRLLEE 278
             ++G  S P+ L    Y  A++AR+++   +   + + ++  E+ +     +L  LL  
Sbjct: 203 LVIEGFFSLPLPLFSTTYRKAIQARRKVAEALTVVVMKRREEEEEGAERKKDMLAALLAA 262

Query: 279 QKHLCDEAVEDFIINLLFAGNETTSKTMLFAVYYLTHSPKAFKQLMEEQDGLRNKCGDN- 338
                DE + DF++ LL AG ETTS  M  AV +LT +P A  QL EE + +R    D+ 
Sbjct: 263 DDGFSDEEIVDFLVALLVAGYETTSTIMTLAVKFLTETPLALAQLKEEHEKIRAMKSDSY 322

Query: 339 TITWEDYKSMSFTQCVIDETLRLGGIAIWLMRETKEEIKYSDYVIPKGTFVVPFLSAVHL 398
           ++ W DYKSM FTQCV++ETLR+  I   + R    +++   Y IPKG  V     AVHL
Sbjct: 323 SLEWSDYKSMPFTQCVVNETLRVANIIGGVFRRAMTDVEIKGYKIPKGWKVFSSFRAVHL 382

Query: 399 DENIYDEALTFNPWRWMQLQNQEKRNWKSSPYFAPFGGGGRLCPGAELA 443
           D N + +A TFNPWRW             S  F PFGGG RLCPG ELA
Sbjct: 383 DPNHFKDARTFNPWRW----QSNSVTTGPSNVFTPFGGGPRLCPGYELA 424

BLAST of CSPI06G00230 vs. TrEMBL
Match: A0A0A0K9W0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G001750 PE=4 SV=1)

HSP 1 Score: 1219.5 bits (3154), Expect = 0.0e+00
Identity = 609/610 (99.84%), Postives = 610/610 (100.00%), Query Frame = 1

Query: 455  MAVAASLGNPALTRATLLSPVNKQHLTSRRAQKMTLCLCAMDSKSVGVGGDVFSVTSSAK 514
            MAVAASLGNPALTRATLLSPVNKQHLTSRRAQKMTLCLCAMDSKSVGVGGDVFSVTSSAK
Sbjct: 1    MAVAASLGNPALTRATLLSPVNKQHLTSRRAQKMTLCLCAMDSKSVGVGGDVFSVTSSAK 60

Query: 515  SGVDYLGQSTKGDMNVKLEHLDAFGVDGEETLEGPIEEVARVEAHEAEDLLRDLGIPSPS 574
            SGVDYLGQSTKGDMNVKLEHLDAFGVDGEETLEGPIEEVARVEAHEAEDLLRDLGIPSPS
Sbjct: 61   SGVDYLGQSTKGDMNVKLEHLDAFGVDGEETLEGPIEEVARVEAHEAEDLLRDLGIPSPS 120

Query: 575  SRNSLHGIFCSRTLNLRSISAIGYDMDYTLMHYNVMAWEGRAYDYCMENLRNMGFPVNGL 634
            SRNSLHGIFCSRTLNLRSISAIGYDMDYTLMHYNVMAWEGRAYDYCMENLRNMGFPVNGL
Sbjct: 121  SRNSLHGIFCSRTLNLRSISAIGYDMDYTLMHYNVMAWEGRAYDYCMENLRNMGFPVNGL 180

Query: 635  AFDPDLVIRGLVIDKERGNLVKADRFGYIKRAMHGTKMLSTRDVSEIYGRELVDLRKENR 694
            AFDPDLVIRGLVIDKERGNLVKADRFGYIKRAMHGTKMLSTRDVSEIYGRELVDLRKENR
Sbjct: 181  AFDPDLVIRGLVIDKERGNLVKADRFGYIKRAMHGTKMLSTRDVSEIYGRELVDLRKENR 240

Query: 695  WEFLNTLFSVSEAVAYMQMVDRLDDGAIGAAIGPLDYKGLYKAVGKALFRAHVEGQLKSE 754
            WEFLNTLFSVSEAVAYMQMVDRLDDGAIGAAIGPLDYKGLYKAVGKALFRAHVEGQLKSE
Sbjct: 241  WEFLNTLFSVSEAVAYMQMVDRLDDGAIGAAIGPLDYKGLYKAVGKALFRAHVEGQLKSE 300

Query: 755  IMSNPELFVEPDPVLPLTLLDQKEAGKKLLLITNSDYHYTNKMMQHSFNKFLPNDMGWRD 814
            IMSNPELFVEPDPVLPLTLLDQKEAGKKLLLITNSDYHYTNKMMQHSFNKFLPNDMGWRD
Sbjct: 301  IMSNPELFVEPDPVLPLTLLDQKEAGKKLLLITNSDYHYTNKMMQHSFNKFLPNDMGWRD 360

Query: 815  LFDIVIVSARKPEFFQMSHPLYEVVTGEGLMRPCFKAVAGGLYSGGSAQMIENSLNIHGD 874
            LFDIVIVSARKPEFFQMSHPLYEVVTGEGLMRPCFKAVAGGLYSGGSAQMIENSLNIHGD
Sbjct: 361  LFDIVIVSARKPEFFQMSHPLYEVVTGEGLMRPCFKAVAGGLYSGGSAQMIENSLNIHGD 420

Query: 875  EILYVGDHIYTDVSQSKVHLRWRTALILRELEEEYSALIHSRGHRASLIELINQKEVVGD 934
            EILYVGDHIYTDVSQSKVHLRWRTALILRELEEEYSALIHSRGHRASLIELINQKEVVGD
Sbjct: 421  EILYVGDHIYTDVSQSKVHLRWRTALILRELEEEYSALIHSRGHRASLIELINQKEVVGD 480

Query: 935  LFNQLRLALQRRTQGRPAQTLAATNMNDEELTESMQKLLIVMQRLDQKIAPMLEADGELF 994
            LFNQLRLALQRRTQGRPAQTLAATNMNDEELTESMQKLLIVMQRLDQK+APMLEADGELF
Sbjct: 481  LFNQLRLALQRRTQGRPAQTLAATNMNDEELTESMQKLLIVMQRLDQKVAPMLEADGELF 540

Query: 995  NKRWGFLSRAGLWDKSHLMRQIEKYADIYTSRVSNFLNYTPFTYFRSQEQTLAHDSYSFY 1054
            NKRWGFLSRAGLWDKSHLMRQIEKYADIYTSRVSNFLNYTPFTYFRSQEQTLAHDSYSFY
Sbjct: 541  NKRWGFLSRAGLWDKSHLMRQIEKYADIYTSRVSNFLNYTPFTYFRSQEQTLAHDSYSFY 600

Query: 1055 CSHEETTIDK 1065
            CSHEETTIDK
Sbjct: 601  CSHEETTIDK 610

BLAST of CSPI06G00230 vs. TrEMBL
Match: M5XV85_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002751mg PE=4 SV=1)

HSP 1 Score: 1009.6 bits (2609), Expect = 2.8e-291
Identity = 505/608 (83.06%), Postives = 547/608 (89.97%), Query Frame = 1

Query: 453  ILMAVAASLGNPALTRATLLSPVNKQHLTSRRAQKMTLCLCAMDSKSVGVGGD---VFSV 512
            + MA    L +P   R    S ++ ++L      K   C C+  + S   G D   VFS+
Sbjct: 18   VSMAATNYLRSPVRIRPMSRSILSSKNLLWNYGMK---CQCSSSTSSSSSGVDEKSVFSL 77

Query: 513  TSSAKSGVDYLGQSTKGDMNVKLEHLDAFGVDGEETLEGPIEEVARVEAHEAEDLLRDLG 572
            TSS+K  VDYLG+ TKGD+NVK+EHL+AFG+D + TL+GPIEEVARVEA EAEDLLRDLG
Sbjct: 78   TSSSKYEVDYLGEKTKGDLNVKVEHLEAFGIDSQATLKGPIEEVARVEAEEAEDLLRDLG 137

Query: 573  IPSP-SSRNSLHGIFCSRTLNLRSISAIGYDMDYTLMHYNVMAWEGRAYDYCMENLRNMG 632
            IP+P SSR S  GIFCSRTLNLRSISAIGYDMDYTLMHYNV+AWEGRAYDYCMENL+ +G
Sbjct: 138  IPTPFSSRQSPRGIFCSRTLNLRSISAIGYDMDYTLMHYNVIAWEGRAYDYCMENLKKVG 197

Query: 633  FPVNGLAFDPDLVIRGLVIDKERGNLVKADRFGYIKRAMHGTKMLSTRDVSEIYGRELVD 692
            FPV+GLAFDPDLVIRGLVIDKE+GNLVKADRFGY+KRAMHGTKMLS R VSE+YGRELVD
Sbjct: 198  FPVDGLAFDPDLVIRGLVIDKEKGNLVKADRFGYVKRAMHGTKMLSNRAVSEMYGRELVD 257

Query: 693  LRKENRWEFLNTLFSVSEAVAYMQMVDRLDDGAIGAAIGPLDYKGLYKAVGKALFRAHVE 752
            LRKE+RWEFLNTLFSVSEAVAYMQMVDRLDDG I A +GPLDYKGLYKAVG+ALFRAHVE
Sbjct: 258  LRKESRWEFLNTLFSVSEAVAYMQMVDRLDDGTIAAQLGPLDYKGLYKAVGRALFRAHVE 317

Query: 753  GQLKSEIMSNPELFVEPDPVLPLTLLDQKEAGKKLLLITNSDYHYTNKMMQHSFNKFLPN 812
            GQLKSEIMS PELFV PDP LPL LLDQKEAGKKLLLITNSDYHYT+KMMQHSFN+FLPN
Sbjct: 318  GQLKSEIMSKPELFVTPDPELPLALLDQKEAGKKLLLITNSDYHYTDKMMQHSFNRFLPN 377

Query: 813  DMGWRDLFDIVIVSARKPEFFQMSHPLYEVVTGEGLMRPCFKAVAGGLYSGGSAQMIENS 872
            DMGWRDLFDIVIVSARKPEFFQMSHP+YEVVTGEGLMRPCFKA  GGLYSGGSAQM+ENS
Sbjct: 378  DMGWRDLFDIVIVSARKPEFFQMSHPMYEVVTGEGLMRPCFKAKTGGLYSGGSAQMVENS 437

Query: 873  LNIHGDEILYVGDHIYTDVSQSKVHLRWRTALILRELEEEYSALIHSRGHRASLIELINQ 932
            LNIHGDEILYVGDHIYTDVSQSKVHLRWRTALI RELEEE+SALIHSRGHRASL+ELINQ
Sbjct: 438  LNIHGDEILYVGDHIYTDVSQSKVHLRWRTALICRELEEEFSALIHSRGHRASLVELINQ 497

Query: 933  KEVVGDLFNQLRLALQRRTQGRPAQTLAATNMNDEELTESMQKLLIVMQRLDQKIAPMLE 992
            KEV+GDLFNQLRLA QRRT+GRPAQTLAATN++D+EL+ESMQKLLIVMQRLDQKIAPMLE
Sbjct: 498  KEVIGDLFNQLRLASQRRTKGRPAQTLAATNLDDQELSESMQKLLIVMQRLDQKIAPMLE 557

Query: 993  ADGELFNKRWGFLSRAGLWDKSHLMRQIEKYADIYTSRVSNFLNYTPFTYFRSQEQTLAH 1052
            ADGELFNKRWGFLSRAG WDKSHLMRQIEKYADIYTSRVSNFL+YTPF YFRSQEQTLAH
Sbjct: 558  ADGELFNKRWGFLSRAGFWDKSHLMRQIEKYADIYTSRVSNFLHYTPFMYFRSQEQTLAH 617

Query: 1053 DSYSFYCS 1057
            DSYS+YCS
Sbjct: 618  DSYSYYCS 622

BLAST of CSPI06G00230 vs. TrEMBL
Match: A0A0B0MWT3_GOSAR (Cytosolic purine 5'-nucleotidase OS=Gossypium arboreum GN=F383_27032 PE=4 SV=1)

HSP 1 Score: 1005.4 bits (2598), Expect = 5.3e-290
Identity = 498/582 (85.57%), Postives = 537/582 (92.27%), Query Frame = 1

Query: 476  NKQHLTSRRAQKMTLCLCAMDSKSVGVGGDVFSVTSSAKSGVDYLGQSTKGDMNVKLEHL 535
            + ++L  R  + +  C C   S S  VGGD FS+TSS+K  VDYLG+STKGD+N+ L+HL
Sbjct: 35   SSKYLVMRAPKMVFKCGC---SSSSSVGGDAFSLTSSSKCDVDYLGESTKGDLNINLKHL 94

Query: 536  DAFGVDGEETLEGPIEEVARVEAHEAEDLLRDLGIPSPSS-RNSLHGIFCSRTLNLRSIS 595
            + FG+DG+ TLEGPIE+VAR+EA EA  LLRDLGIPSPS+ R S  GIFC+RTLNLRSIS
Sbjct: 95   ENFGLDGQATLEGPIEQVARLEAEEAGSLLRDLGIPSPSAARLSPRGIFCTRTLNLRSIS 154

Query: 596  AIGYDMDYTLMHYNVMAWEGRAYDYCMENLRNMGFPVNGLAFDPDLVIRGLVIDKERGNL 655
            AIGYDMDYTL+HYNVMAWEGRAYDYCMENL++MGFPV GLAFDPDLVIRGLVIDKE+GNL
Sbjct: 155  AIGYDMDYTLIHYNVMAWEGRAYDYCMENLKSMGFPVEGLAFDPDLVIRGLVIDKEKGNL 214

Query: 656  VKADRFGYIKRAMHGTKMLSTRDVSEIYGRELVDLRKENRWEFLNTLFSVSEAVAYMQMV 715
            VKADRFGY+KRAMHGTKMLSTR VSE+YGRELVDLRKE++WEFLNTLFSVSEAVAYMQMV
Sbjct: 215  VKADRFGYVKRAMHGTKMLSTRAVSEMYGRELVDLRKESQWEFLNTLFSVSEAVAYMQMV 274

Query: 716  DRLDDGAIGAAIGPLDYKGLYKAVGKALFRAHVEGQLKSEIMSNPELFVEPDPVLPLTLL 775
            DRLDDG I A +GPLDYKGLYKAVGKALFRAHVEGQLKSEIMS PELFVEPDP LPL LL
Sbjct: 275  DRLDDGVIPADLGPLDYKGLYKAVGKALFRAHVEGQLKSEIMSKPELFVEPDPELPLALL 334

Query: 776  DQKEAGKKLLLITNSDYHYTNKMMQHSFNKFLPNDMGWRDLFDIVIVSARKPEFFQMSHP 835
            DQKEAGKKLLLITNSDYHYT+KMMQHSFN+FLPNDMGWRDLFD+VIVSARKPEFFQMSHP
Sbjct: 335  DQKEAGKKLLLITNSDYHYTDKMMQHSFNRFLPNDMGWRDLFDMVIVSARKPEFFQMSHP 394

Query: 836  LYEVVTGEGLMRPCFKAVAGGLYSGGSAQMIENSLNIHGDEILYVGDHIYTDVSQSKVHL 895
            LYEVVTGEGLMRPCFK   GGLYSGGSAQM+ENSLNIHGDEILYVGDHIYTDVSQSKVHL
Sbjct: 395  LYEVVTGEGLMRPCFKTRTGGLYSGGSAQMVENSLNIHGDEILYVGDHIYTDVSQSKVHL 454

Query: 896  RWRTALILRELEEEYSALIHSRGHRASLIELINQKEVVGDLFNQLRLALQRRTQGRPAQT 955
            RWRTALI RELEEEY ALIHSRG RA+++ELINQKEVVGDLFNQLRLALQRRT+GRPAQT
Sbjct: 455  RWRTALICRELEEEYRALIHSRGPRATVVELINQKEVVGDLFNQLRLALQRRTEGRPAQT 514

Query: 956  LAATNMNDEELTESMQKLLIVMQRLDQKIAPMLEADGELFNKRWGFLSRAGLWDKSHLMR 1015
            LAATNM+D ELTESMQKLLIVMQRLD+KIAP+LEADGELFNKRWGFLSRAGLWDKSHLMR
Sbjct: 515  LAATNMDDRELTESMQKLLIVMQRLDEKIAPLLEADGELFNKRWGFLSRAGLWDKSHLMR 574

Query: 1016 QIEKYADIYTSRVSNFLNYTPFTYFRSQEQTLAHDSYSFYCS 1057
            QIEKYADIYTSRVSNFLNYTPF YFRSQEQTLAHD+YS YCS
Sbjct: 575  QIEKYADIYTSRVSNFLNYTPFMYFRSQEQTLAHDTYSHYCS 613

BLAST of CSPI06G00230 vs. TrEMBL
Match: A0A0D2SC88_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_005G073300 PE=4 SV=1)

HSP 1 Score: 1004.2 bits (2595), Expect = 1.2e-289
Identity = 497/582 (85.40%), Postives = 537/582 (92.27%), Query Frame = 1

Query: 476  NKQHLTSRRAQKMTLCLCAMDSKSVGVGGDVFSVTSSAKSGVDYLGQSTKGDMNVKLEHL 535
            + ++L  R  + +  C C   S S  VGGD FS+TSS+K  VDYLG+STKGD+N+ L+HL
Sbjct: 19   SSKYLVMRAPKMVFKCGC---SSSSSVGGDAFSLTSSSKCDVDYLGESTKGDLNINLKHL 78

Query: 536  DAFGVDGEETLEGPIEEVARVEAHEAEDLLRDLGIPSPSS-RNSLHGIFCSRTLNLRSIS 595
            + FG+DG+ TLEGPIE+VAR+EA EA  LLRDLGIPSPS+ R S  G+FC+RTLNLRSIS
Sbjct: 79   ENFGLDGQATLEGPIEQVARLEAEEAGSLLRDLGIPSPSAARLSPRGMFCTRTLNLRSIS 138

Query: 596  AIGYDMDYTLMHYNVMAWEGRAYDYCMENLRNMGFPVNGLAFDPDLVIRGLVIDKERGNL 655
            AIGYDMDYTL+HYNVMAWEGRAYDYCMENL++MGFPV GLAFDPDLVIRGLVIDKE+GNL
Sbjct: 139  AIGYDMDYTLIHYNVMAWEGRAYDYCMENLKSMGFPVEGLAFDPDLVIRGLVIDKEKGNL 198

Query: 656  VKADRFGYIKRAMHGTKMLSTRDVSEIYGRELVDLRKENRWEFLNTLFSVSEAVAYMQMV 715
            VKADRFGY+KRAMHGTKMLSTR VSE+YGRELVDLRKE++WEFLNTLFSVSEAVAYMQMV
Sbjct: 199  VKADRFGYVKRAMHGTKMLSTRAVSEMYGRELVDLRKESQWEFLNTLFSVSEAVAYMQMV 258

Query: 716  DRLDDGAIGAAIGPLDYKGLYKAVGKALFRAHVEGQLKSEIMSNPELFVEPDPVLPLTLL 775
            DRLDDG I A +GPLDYKGLYKAVGKALFRAHVEGQLKSEIMS PELFVEPDP LPL LL
Sbjct: 259  DRLDDGVIPADLGPLDYKGLYKAVGKALFRAHVEGQLKSEIMSKPELFVEPDPELPLALL 318

Query: 776  DQKEAGKKLLLITNSDYHYTNKMMQHSFNKFLPNDMGWRDLFDIVIVSARKPEFFQMSHP 835
            DQKEAGKKLLLITNSDYHYT+KMMQHSFN+FLPNDMGWRDLFD+VIVSARKPEFFQMSHP
Sbjct: 319  DQKEAGKKLLLITNSDYHYTDKMMQHSFNRFLPNDMGWRDLFDMVIVSARKPEFFQMSHP 378

Query: 836  LYEVVTGEGLMRPCFKAVAGGLYSGGSAQMIENSLNIHGDEILYVGDHIYTDVSQSKVHL 895
            LYEVVTGEGLMRPCFK   GGLYSGGSAQM+ENSLNIHGDEILYVGDHIYTDVSQSKVHL
Sbjct: 379  LYEVVTGEGLMRPCFKTRTGGLYSGGSAQMVENSLNIHGDEILYVGDHIYTDVSQSKVHL 438

Query: 896  RWRTALILRELEEEYSALIHSRGHRASLIELINQKEVVGDLFNQLRLALQRRTQGRPAQT 955
            RWRTALI RELEEEY ALIHSRG RA+++ELINQKEVVGDLFNQLRLALQRRT+GRPAQT
Sbjct: 439  RWRTALICRELEEEYKALIHSRGPRATVVELINQKEVVGDLFNQLRLALQRRTKGRPAQT 498

Query: 956  LAATNMNDEELTESMQKLLIVMQRLDQKIAPMLEADGELFNKRWGFLSRAGLWDKSHLMR 1015
            LAATNM+D ELTESMQKLLIVMQRLD+KIAP+LEADGELFNKRWGFLSRAGLWDKSHLMR
Sbjct: 499  LAATNMDDRELTESMQKLLIVMQRLDEKIAPLLEADGELFNKRWGFLSRAGLWDKSHLMR 558

Query: 1016 QIEKYADIYTSRVSNFLNYTPFTYFRSQEQTLAHDSYSFYCS 1057
            QIEKYADIYTSRVSNFLNYTPF YFRSQEQTLAHD+YS YCS
Sbjct: 559  QIEKYADIYTSRVSNFLNYTPFMYFRSQEQTLAHDTYSHYCS 597

BLAST of CSPI06G00230 vs. TrEMBL
Match: A0A061EPF8_THECC (HAD-superfamily hydrolase, subfamily IG, 5'-nucleotidase isoform 1 OS=Theobroma cacao GN=TCM_021506 PE=4 SV=1)

HSP 1 Score: 999.2 bits (2582), Expect = 3.8e-288
Identity = 494/564 (87.59%), Postives = 527/564 (93.44%), Query Frame = 1

Query: 493  CAMDSKSVGVGGDVFSVTSSAKSGVDYLGQSTKGDMNVKLEHLDAFGVDGEETLEGPIEE 552
            C   S       DVFS+TSS+K  VDYLGQSTKGD+NV LEHL+AFG+DG+ TLEGPIE+
Sbjct: 34   CGCSSSGSSGQEDVFSLTSSSKYDVDYLGQSTKGDLNVNLEHLEAFGLDGQATLEGPIEQ 93

Query: 553  VARVEAHEAEDLLRDLGIPSPSS-RNSLHGIFCSRTLNLRSISAIGYDMDYTLMHYNVMA 612
            VAR+E  EAE LLRDLGIPSPS+ R S  GIFCSRTLNLRSISAIGYDMDYTL+ YNVMA
Sbjct: 94   VARMETEEAEGLLRDLGIPSPSAVRLSPRGIFCSRTLNLRSISAIGYDMDYTLIQYNVMA 153

Query: 613  WEGRAYDYCMENLRNMGFPVNGLAFDPDLVIRGLVIDKERGNLVKADRFGYIKRAMHGTK 672
            WEGRAYDYCM+NL+NMGFPV GLAFDPDLVIRGLVIDKE+GNLVKADRFGY+KRAMHGTK
Sbjct: 154  WEGRAYDYCMDNLKNMGFPVEGLAFDPDLVIRGLVIDKEKGNLVKADRFGYVKRAMHGTK 213

Query: 673  MLSTRDVSEIYGRELVDLRKENRWEFLNTLFSVSEAVAYMQMVDRLDDGAIGAAIGPLDY 732
            MLSTR VSE+YGRELVDLRKE+RWEFLNTLFSVSEAVAYMQMVDRLD+GAI   +GPLD 
Sbjct: 214  MLSTRAVSEMYGRELVDLRKESRWEFLNTLFSVSEAVAYMQMVDRLDEGAIPVDLGPLDC 273

Query: 733  KGLYKAVGKALFRAHVEGQLKSEIMSNPELFVEPDPVLPLTLLDQKEAGKKLLLITNSDY 792
            KGLYKAVGKALFRAHVEGQLKSEIMS PELFVEPDP LPL LLDQKEAGK+LLLITNSDY
Sbjct: 274  KGLYKAVGKALFRAHVEGQLKSEIMSKPELFVEPDPELPLALLDQKEAGKRLLLITNSDY 333

Query: 793  HYTNKMMQHSFNKFLPNDMGWRDLFDIVIVSARKPEFFQMSHPLYEVVTGEGLMRPCFKA 852
            HYT+KMM+HSFNKFLPNDMGWRDLFD+VIVSARKPEFFQMSHPLYEVVTGEGLMRPCFKA
Sbjct: 334  HYTDKMMRHSFNKFLPNDMGWRDLFDMVIVSARKPEFFQMSHPLYEVVTGEGLMRPCFKA 393

Query: 853  VAGGLYSGGSAQMIENSLNIHGDEILYVGDHIYTDVSQSKVHLRWRTALILRELEEEYSA 912
              GGLYSGGSAQM+ENSLNIHGDEILYVGDHIYTDVSQSKVHLRWRTALI RELEEEY+A
Sbjct: 394  QTGGLYSGGSAQMVENSLNIHGDEILYVGDHIYTDVSQSKVHLRWRTALICRELEEEYNA 453

Query: 913  LIHSRGHRASLIELINQKEVVGDLFNQLRLALQRRTQGRPAQTLAATNMNDEELTESMQK 972
            LIHSRGHRA+L+ELINQKE+VGDLFNQLRLALQRRT+ RPAQTLAATNM+D+ELTESMQK
Sbjct: 454  LIHSRGHRATLVELINQKEIVGDLFNQLRLALQRRTKERPAQTLAATNMDDQELTESMQK 513

Query: 973  LLIVMQRLDQKIAPMLEADGELFNKRWGFLSRAGLWDKSHLMRQIEKYADIYTSRVSNFL 1032
            LLIVMQRLD+KIAPML+ADGELFNKRWGFLSRAGLWDKSHLMRQIEKYADIYTSRVSNFL
Sbjct: 514  LLIVMQRLDEKIAPMLDADGELFNKRWGFLSRAGLWDKSHLMRQIEKYADIYTSRVSNFL 573

Query: 1033 NYTPFTYFRSQEQTLAHDSYSFYC 1056
            NYTPF YFRSQEQTLAHDSYS +C
Sbjct: 574  NYTPFMYFRSQEQTLAHDSYSHHC 597

BLAST of CSPI06G00230 vs. TAIR10
Match: AT5G48960.1 (AT5G48960.1 HAD-superfamily hydrolase, subfamily IG, 5'-nucleotidase)

HSP 1 Score: 948.7 bits (2451), Expect = 3.0e-276
Identity = 476/570 (83.51%), Postives = 516/570 (90.53%), Query Frame = 1

Query: 483  RRAQKMTLCLCA-MDSKSVGVGGDVFSVTSSAKSGVDYLGQSTKGDMNVKLEHLDAFGVD 542
            R + +M  C  A  D   V VG DVFSVT+S+K  VDYLGQSTKGD+N+KL+ L +FG D
Sbjct: 65   RGSLRMIKCRAAGADGGRVAVGDDVFSVTTSSKYEVDYLGQSTKGDLNLKLDPLQSFG-D 124

Query: 543  GEETLEGPIEEVARVEAHEAEDLLRDLGIPSP-SSRNSLHGIFCSRTLNLRSISAIGYDM 602
            G+ TLEGPIEEVAR EA  AE+L+R+LGI  P S+++S  GIFCSRTLNLRSISAIGYDM
Sbjct: 125  GQATLEGPIEEVARTEAQAAENLIRELGIQGPFSAQHSPRGIFCSRTLNLRSISAIGYDM 184

Query: 603  DYTLMHYNVMAWEGRAYDYCMENLRNMGFPVNGLAFDPDLVIRGLVIDKERGNLVKADRF 662
            DYTLMHYNVMAWEG+AYDYCMENL++MGFPV+GLAFDP+LVIRGL+IDKE+GNLVKADRF
Sbjct: 185  DYTLMHYNVMAWEGKAYDYCMENLKSMGFPVDGLAFDPELVIRGLMIDKEKGNLVKADRF 244

Query: 663  GYIKRAMHGTKMLSTRDVSEIYGRELVDLRKENRWEFLNTLFSVSEAVAYMQMVDRLDDG 722
            GY+KRAMHGTKMLS + VSEIYGRELVDLR ++RWEFLNT FSVSEA+AY QMVDRLDDG
Sbjct: 245  GYVKRAMHGTKMLSNKAVSEIYGRELVDLRNQSRWEFLNTFFSVSEALAYAQMVDRLDDG 304

Query: 723  AIGAAIGPLDYKGLYKAVGKALFRAHVEGQLKSEIMSNPELFVEPDPVLPLTLLDQKEAG 782
             I A +G LDYKGLYKAV KALFRAHVEGQLKSEIMS PELFVEPDP LPL LLDQKEAG
Sbjct: 305  FISADLGTLDYKGLYKAVAKALFRAHVEGQLKSEIMSKPELFVEPDPELPLALLDQKEAG 364

Query: 783  KKLLLITNSDYHYTNKMMQHSFNKFLPNDMGWRDLFDIVIVSARKPEFFQMSHPLYEVVT 842
            KKLLLITNSDYHYT+KMM+HSFNKFLPNDM WRDLFD+VIVSARKPEFFQMSHPLYEVVT
Sbjct: 365  KKLLLITNSDYHYTDKMMKHSFNKFLPNDMDWRDLFDMVIVSARKPEFFQMSHPLYEVVT 424

Query: 843  GEGLMRPCFKAVAGGLYSGGSAQMIENSLNIHGDEILYVGDHIYTDVSQSKVHLRWRTAL 902
            GEGLMRPCFKA  GGLYSGGSAQMIE+SLN+HGDEILYVGDHIYTDVS SKVHLRWRTAL
Sbjct: 425  GEGLMRPCFKAETGGLYSGGSAQMIESSLNVHGDEILYVGDHIYTDVSVSKVHLRWRTAL 484

Query: 903  ILRELEEEYSALIHSRGHRASLIELINQKEVVGDLFNQLRLALQRRTQGRPAQTLAATNM 962
            I RELEEEY ALI SRGHR  LIELINQKEVVGDLFNQLRLALQRR++GRPAQTLAATN+
Sbjct: 485  ICRELEEEYMALIGSRGHREELIELINQKEVVGDLFNQLRLALQRRSKGRPAQTLAATNL 544

Query: 963  NDEELTESMQKLLIVMQRLDQKIAPMLEADGELFNKRWGFLSRAGLWDKSHLMRQIEKYA 1022
            +D+ELTE+MQKLLIVMQRLD KI  MLE DGELFNKRWGFLSRAGLWDKSHLMRQIEKYA
Sbjct: 545  DDQELTETMQKLLIVMQRLDDKIGLMLETDGELFNKRWGFLSRAGLWDKSHLMRQIEKYA 604

Query: 1023 DIYTSRVSNFLNYTPFTYFRSQEQTLAHDS 1051
            DIYTSRVSNFLNYTPF YFRSQEQ+LAHDS
Sbjct: 605  DIYTSRVSNFLNYTPFMYFRSQEQSLAHDS 633

BLAST of CSPI06G00230 vs. TAIR10
Match: AT1G73340.1 (AT1G73340.1 Cytochrome P450 superfamily protein)

HSP 1 Score: 524.6 bits (1350), Expect = 1.4e-148
Identity = 265/452 (58.63%), Postives = 333/452 (73.67%), Query Frame = 1

Query: 19  TTFLPNILKNKKYNDQNDQSRTRYKFPSGRRSWPVIGDSFNWYSAIASSHPSKYVEEQAK 78
           TTFL  I+          + R  ++ P G R WP+IGD+F W +A+A SHPS +VE+Q K
Sbjct: 18  TTFLAFIIIFLLAGIARRKRRAPHRLPPGSRGWPLIGDTFAWLNAVAGSHPSSFVEKQIK 77

Query: 79  RYGKIFSCRVFGKCTVVSVDPDFNRYVMQNEGILFQSSYPKSFRDLVGVNGVITAQGEQQ 138
           +YG+IFSC +FGK  VVS DPDFNR++MQNEG LFQSSYPKSFRDLVG +GVIT  G+QQ
Sbjct: 78  KYGRIFSCSLFGKWAVVSADPDFNRFIMQNEGKLFQSSYPKSFRDLVGKDGVITVHGDQQ 137

Query: 139 RKLHGIASNFMRINKLNFSFIKEIQTIIIHSLTTFHDRHQIISLQDACRKIAINLMVSQL 198
           R+LH IAS+ MR ++L   F++ I  +++ +L+ F D  +++ LQD CRK+AI+LMV+QL
Sbjct: 138 RRLHSIASSMMRHDQLKTHFLEVIPVVMLQTLSNFKD-GEVVLLQDICRKVAIHLMVNQL 197

Query: 199 LGVSSDSEVNEISGLFYNFVDGCLSFPINLPPFAYHSAMK-------------------- 258
           LGVSS+SEV+E+S LF +FVDGCLS PI+LP F Y+ AMK                    
Sbjct: 198 LGVSSESEVDEMSQLFSDFVDGCLSVPIDLPGFTYNKAMKVSFKHLSQLLICGFGACLLR 257

Query: 259 -------ARKQIIRKIKKKIEMQ-KSMEKSSSSGHGVLGRLLEEQKHLCDEAVEDFIINL 318
                  ARK+IIRKI K IE + ++   S ++G+GVLGRLLEE+  L +E++ DFIINL
Sbjct: 258 FCLFLIQARKEIIRKINKTIEKRLQNKAASDTAGNGVLGRLLEEES-LPNESMADFIINL 317

Query: 319 LFAGNETTSKTMLFAVYYLTHSPKAFKQLMEEQDGLRNKCGDNTITWEDYKSMSFTQCVI 378
           LFAGNETTSKTMLFAVY+LTH PKA  QL+EE D L        +TW+DYK+M FTQCVI
Sbjct: 318 LFAGNETTSKTMLFAVYFLTHCPKAMTQLLEEHDRL----AGGMLTWQDYKTMDFTQCVI 377

Query: 379 DETLRLGGIAIWLMRETKEEIKYSDYVIPKGTFVVPFLSAVHLDENIYDEALTFNPWRWM 438
           DETLRLGGIAIWLMRE KE++ Y DYVIPKG FVVPFLSAVHLDE+ Y E+L+FNPWRW+
Sbjct: 378 DETLRLGGIAIWLMREAKEDVSYQDYVIPKGCFVVPFLSAVHLDESYYKESLSFNPWRWL 437

Query: 439 QLQNQEKRNWKSSPYFAPFGGGGRLCPGAELA 443
             + Q+KRNW++SP++ PFGGG R CPGAELA
Sbjct: 438 DPETQQKRNWRTSPFYCPFGGGTRFCPGAELA 463

BLAST of CSPI06G00230 vs. TAIR10
Match: AT3G50660.1 (AT3G50660.1 Cytochrome P450 superfamily protein)

HSP 1 Score: 276.2 bits (705), Expect = 8.6e-74
Identity = 147/437 (33.64%), Postives = 245/437 (56.06%), Query Frame = 1

Query: 39  RTRYKFPSGRRSWPVIGDSFNWYSAIASSHPSKYVEEQAKRYGKIFSCRVFGKCTVVSVD 98
           +TR+  P G+  WP +G++  +     ++    ++++   +YGKI+   +FG+ T+VS D
Sbjct: 34  KTRFNLPPGKSGWPFLGETIGYLKPYTATTLGDFMQQHVSKYGKIYRSNLFGEPTIVSAD 93

Query: 99  PDFNRYVMQNEGILFQSSYPKSFRDLVGVNGVITAQGEQQRKLHGIASNFMRINKLNFSF 158
              NR+++QNEG LF+ SYP+S   ++G   ++   G+  R +  I+ NF+   +L    
Sbjct: 94  AGLNRFILQNEGRLFECSYPRSIGGILGKWSMLVLVGDMHRDMRSISLNFLSHARLRTIL 153

Query: 159 IKEIQTIIIHSLTTFHDRHQIISLQDACRKIAINLMVSQLLGVS-SDSEVNEISGLFYNF 218
           +K+++   +  L ++  ++ I S QD  +K   NLM   ++ +   + E  ++   +  F
Sbjct: 154 LKDVERHTLFVLDSWQ-QNSIFSAQDEAKKFTFNLMAKHIMSMDPGEEETEQLKKEYVTF 213

Query: 219 VDGCLSFPINLPPFAYHSAMKARKQIIRKIKKKIEMQKSMEKS----------------S 278
           + G +S P+NLP  AYH A+++R  I++ I++K+E +K   K                 S
Sbjct: 214 MKGVVSAPLNLPGTAYHKALQSRATILKFIERKMEERKLDIKEEDQEEEEVKTEDEAEMS 273

Query: 279 SSGH--------GVLGRLLEEQKHLCDEAVEDFIINLLFAGNETTSKTMLFAVYYLTHSP 338
            S H         +LG +L+   +L  E + D I++LLFAG+ET+S  +  A+++L   P
Sbjct: 274 KSDHVRKQRTDDDLLGWVLKHS-NLSTEQILDLILSLLFAGHETSSVAIALAIFFLQACP 333

Query: 339 KAFKQLMEEQDGL---RNKCGDNTITWEDYKSMSFTQCVIDETLRLGGIAIWLMRETKEE 398
           KA ++L EE   +   + + G++ + W+DYK M FTQCVI+ETLRLG +  +L R+  ++
Sbjct: 334 KAVEELREEHLEIARAKKELGESELNWDDYKKMDFTQCVINETLRLGNVVRFLHRKALKD 393

Query: 399 IKYSDYVIPKGTFVVPFLSAVHLDENIYDEALTFNPWRWMQLQNQEKRNWKSS-----PY 443
           ++Y  Y IP G  V+P +SAVHLD + YD+   FNPWRW Q  N    +   S       
Sbjct: 394 VRYKGYDIPSGWKVLPVISAVHLDNSRYDQPNLFNPWRWQQQNNGASSSGSGSFSTWGNN 453

BLAST of CSPI06G00230 vs. TAIR10
Match: AT3G13730.1 (AT3G13730.1 cytochrome P450, family 90, subfamily D, polypeptide 1)

HSP 1 Score: 261.9 bits (668), Expect = 1.7e-69
Identity = 142/451 (31.49%), Postives = 251/451 (55.65%), Query Frame = 1

Query: 5   SLVAWSFFILTLLFTTFLPNILKN-----KKYNDQNDQSRTRY-KFPSGRRSWPVIGDSF 64
           SL+ +SFF   ++      N L++     KK ND +  S++   KFP G   WPVIG++ 
Sbjct: 6   SLLFFSFFFFIIIVIFNKINGLRSSPASKKKLNDHHVTSQSHGPKFPHGSLGWPVIGETI 65

Query: 65  NWYSAIASSHPSKYVEEQAKRYGKIFSCRVFGKCTVVSVDPDFNRYVMQNEGILFQSSYP 124
            + S+  S  P  +++++   YG++F   +FG  T+VS D + NR V+Q++   F   YP
Sbjct: 66  EFVSSAYSDRPESFMDKRRLMYGRVFKSHIFGTATIVSTDAEVNRAVLQSDSTAFVPFYP 125

Query: 125 KSFRDLVGVNGVITAQGEQQRKLHGIASNFMRINKLNFSFIKEIQTIIIHSLTTFHDRHQ 184
           K+ R+L+G + ++   G   R+ HG+  +F++   L    ++++   +  S+  + +  Q
Sbjct: 126 KTVRELMGKSSILLINGSLHRRFHGLVGSFLKSPLLKAQIVRDMHKFLSESMDLWSE-DQ 185

Query: 185 IISLQDACRKIAINLMVSQLLGVSSDSEVNEISGLFYNFVDGCLSFPINLPPFAYHSAMK 244
            + LQD  + +A  ++   L+ V    ++ E+   F NF+ G +S PIN P    H +++
Sbjct: 186 PVLLQDVSKTVAFKVLAKALISVEKGEDLEELKREFENFISGLMSLPINFPGTQLHRSLQ 245

Query: 245 ARKQIIRKIKK----KIEMQKSMEKSSSSGHGVLGRLLEE-QKHLCDEAVEDFIINLLFA 304
           A+K +++++++    KI   K+ E+       V+  LL++  +HL    + + +I+++  
Sbjct: 246 AKKNMVKQVERIIEGKIRKTKNKEEDDVIAKDVVDVLLKDSSEHLTHNLIANNMIDMMIP 305

Query: 305 GNETTSKTMLFAVYYLTHSPKAFKQLMEEQDGLRN--KCGDNTITWEDYKSMSFTQCVID 364
           G+++    +  AV +L+ SP A   L EE   L++  +     + W DY S+ FTQ VI 
Sbjct: 306 GHDSVPVLITLAVKFLSDSPAALNLLTEENMKLKSLKELTGEPLYWNDYLSLPFTQKVIT 365

Query: 365 ETLRLGGIAIWLMRETKEEIKYSDYVIPKGTFVVPFLSAVHLDENIYDEALTFNPWRWMQ 424
           ETLR+G + I +MR+  ++++   YVIPKG   + +L +VHLD+  Y+    FNPWRW  
Sbjct: 366 ETLRMGNVIIGVMRKAMKDVEIKGYVIPKGWCFLAYLRSVHLDKLYYESPYKFNPWRW-- 425

Query: 425 LQNQEKRNWKSSPYFAPFGGGGRLCPGAELA 443
               ++R+  +S  F+PFGGG RLCPG +LA
Sbjct: 426 ----QERDMNTSS-FSPFGGGQRLCPGLDLA 448

BLAST of CSPI06G00230 vs. TAIR10
Match: AT5G05690.1 (AT5G05690.1 Cytochrome P450 superfamily protein)

HSP 1 Score: 254.6 bits (649), Expect = 2.7e-67
Identity = 140/409 (34.23%), Postives = 219/409 (53.55%), Query Frame = 1

Query: 39  RTRYK---FPSGRRSWPVIGDSFNWYSAIASSHPSKYVEEQAKRYGKIFSCRVFGKCTVV 98
           RTRY+    P G    P+IG++F    A  + +P  +++E+  RYG +F   +FG+ T+ 
Sbjct: 23  RTRYRRMGLPPGSLGLPLIGETFQLIGAYKTENPEPFIDERVARYGSVFMTHLFGEPTIF 82

Query: 99  SVDPDFNRYVMQNEGILFQSSYPKSFRDLVGVNGVITAQGEQQRKLHGIASNFMRINKLN 158
           S DP+ NR+V+QNEG LF+ SYP S  +L+G + ++  +G   +++H +  +F   + + 
Sbjct: 83  SADPETNRFVLQNEGKLFECSYPASICNLLGKHSLLLMKGSLHKRMHSLTMSFANSSIIK 142

Query: 159 FSFIKEIQTIIIHSLTTFHDRHQIISLQDACRKIAINLMVSQLLGVSSDSEVNEISGLFY 218
              + +I  ++  +L ++  R   + L +  +KI   L V QL+          +   + 
Sbjct: 143 DHLMLDIDRLVRFNLDSWSSR---VLLMEEAKKITFELTVKQLMSFDPGEWSESLRKEYL 202

Query: 219 NFVDGCLSFPINLPPFAYHSAMKARKQIIRKIKKKI-EMQKSMEKSSSSGHGVLGRLLEE 278
             ++G  S P+ L    Y  A++AR+++   +   + + ++  E+ +     +L  LL  
Sbjct: 203 LVIEGFFSLPLPLFSTTYRKAIQARRKVAEALTVVVMKRREEEEEGAERKKDMLAALLAA 262

Query: 279 QKHLCDEAVEDFIINLLFAGNETTSKTMLFAVYYLTHSPKAFKQLMEEQDGLRNKCGDN- 338
                DE + DF++ LL AG ETTS  M  AV +LT +P A  QL EE + +R    D+ 
Sbjct: 263 DDGFSDEEIVDFLVALLVAGYETTSTIMTLAVKFLTETPLALAQLKEEHEKIRAMKSDSY 322

Query: 339 TITWEDYKSMSFTQCVIDETLRLGGIAIWLMRETKEEIKYSDYVIPKGTFVVPFLSAVHL 398
           ++ W DYKSM FTQCV++ETLR+  I   + R    +++   Y IPKG  V     AVHL
Sbjct: 323 SLEWSDYKSMPFTQCVVNETLRVANIIGGVFRRAMTDVEIKGYKIPKGWKVFSSFRAVHL 382

Query: 399 DENIYDEALTFNPWRWMQLQNQEKRNWKSSPYFAPFGGGGRLCPGAELA 443
           D N + +A TFNPWRW             S  F PFGGG RLCPG ELA
Sbjct: 383 DPNHFKDARTFNPWRW----QSNSVTTGPSNVFTPFGGGPRLCPGYELA 424

BLAST of CSPI06G00230 vs. NCBI nr
Match: gi|449441492|ref|XP_004138516.1| (PREDICTED: 5'-nucleotidase domain-containing protein 4 [Cucumis sativus])

HSP 1 Score: 1219.5 bits (3154), Expect = 0.0e+00
Identity = 609/610 (99.84%), Postives = 610/610 (100.00%), Query Frame = 1

Query: 455  MAVAASLGNPALTRATLLSPVNKQHLTSRRAQKMTLCLCAMDSKSVGVGGDVFSVTSSAK 514
            MAVAASLGNPALTRATLLSPVNKQHLTSRRAQKMTLCLCAMDSKSVGVGGDVFSVTSSAK
Sbjct: 1    MAVAASLGNPALTRATLLSPVNKQHLTSRRAQKMTLCLCAMDSKSVGVGGDVFSVTSSAK 60

Query: 515  SGVDYLGQSTKGDMNVKLEHLDAFGVDGEETLEGPIEEVARVEAHEAEDLLRDLGIPSPS 574
            SGVDYLGQSTKGDMNVKLEHLDAFGVDGEETLEGPIEEVARVEAHEAEDLLRDLGIPSPS
Sbjct: 61   SGVDYLGQSTKGDMNVKLEHLDAFGVDGEETLEGPIEEVARVEAHEAEDLLRDLGIPSPS 120

Query: 575  SRNSLHGIFCSRTLNLRSISAIGYDMDYTLMHYNVMAWEGRAYDYCMENLRNMGFPVNGL 634
            SRNSLHGIFCSRTLNLRSISAIGYDMDYTLMHYNVMAWEGRAYDYCMENLRNMGFPVNGL
Sbjct: 121  SRNSLHGIFCSRTLNLRSISAIGYDMDYTLMHYNVMAWEGRAYDYCMENLRNMGFPVNGL 180

Query: 635  AFDPDLVIRGLVIDKERGNLVKADRFGYIKRAMHGTKMLSTRDVSEIYGRELVDLRKENR 694
            AFDPDLVIRGLVIDKERGNLVKADRFGYIKRAMHGTKMLSTRDVSEIYGRELVDLRKENR
Sbjct: 181  AFDPDLVIRGLVIDKERGNLVKADRFGYIKRAMHGTKMLSTRDVSEIYGRELVDLRKENR 240

Query: 695  WEFLNTLFSVSEAVAYMQMVDRLDDGAIGAAIGPLDYKGLYKAVGKALFRAHVEGQLKSE 754
            WEFLNTLFSVSEAVAYMQMVDRLDDGAIGAAIGPLDYKGLYKAVGKALFRAHVEGQLKSE
Sbjct: 241  WEFLNTLFSVSEAVAYMQMVDRLDDGAIGAAIGPLDYKGLYKAVGKALFRAHVEGQLKSE 300

Query: 755  IMSNPELFVEPDPVLPLTLLDQKEAGKKLLLITNSDYHYTNKMMQHSFNKFLPNDMGWRD 814
            IMSNPELFVEPDPVLPLTLLDQKEAGKKLLLITNSDYHYTNKMMQHSFNKFLPNDMGWRD
Sbjct: 301  IMSNPELFVEPDPVLPLTLLDQKEAGKKLLLITNSDYHYTNKMMQHSFNKFLPNDMGWRD 360

Query: 815  LFDIVIVSARKPEFFQMSHPLYEVVTGEGLMRPCFKAVAGGLYSGGSAQMIENSLNIHGD 874
            LFDIVIVSARKPEFFQMSHPLYEVVTGEGLMRPCFKAVAGGLYSGGSAQMIENSLNIHGD
Sbjct: 361  LFDIVIVSARKPEFFQMSHPLYEVVTGEGLMRPCFKAVAGGLYSGGSAQMIENSLNIHGD 420

Query: 875  EILYVGDHIYTDVSQSKVHLRWRTALILRELEEEYSALIHSRGHRASLIELINQKEVVGD 934
            EILYVGDHIYTDVSQSKVHLRWRTALILRELEEEYSALIHSRGHRASLIELINQKEVVGD
Sbjct: 421  EILYVGDHIYTDVSQSKVHLRWRTALILRELEEEYSALIHSRGHRASLIELINQKEVVGD 480

Query: 935  LFNQLRLALQRRTQGRPAQTLAATNMNDEELTESMQKLLIVMQRLDQKIAPMLEADGELF 994
            LFNQLRLALQRRTQGRPAQTLAATNMNDEELTESMQKLLIVMQRLDQK+APMLEADGELF
Sbjct: 481  LFNQLRLALQRRTQGRPAQTLAATNMNDEELTESMQKLLIVMQRLDQKVAPMLEADGELF 540

Query: 995  NKRWGFLSRAGLWDKSHLMRQIEKYADIYTSRVSNFLNYTPFTYFRSQEQTLAHDSYSFY 1054
            NKRWGFLSRAGLWDKSHLMRQIEKYADIYTSRVSNFLNYTPFTYFRSQEQTLAHDSYSFY
Sbjct: 541  NKRWGFLSRAGLWDKSHLMRQIEKYADIYTSRVSNFLNYTPFTYFRSQEQTLAHDSYSFY 600

Query: 1055 CSHEETTIDK 1065
            CSHEETTIDK
Sbjct: 601  CSHEETTIDK 610

BLAST of CSPI06G00230 vs. NCBI nr
Match: gi|659116824|ref|XP_008458278.1| (PREDICTED: 5'-nucleotidase domain-containing protein 4 [Cucumis melo])

HSP 1 Score: 1198.3 bits (3099), Expect = 0.0e+00
Identity = 595/610 (97.54%), Postives = 604/610 (99.02%), Query Frame = 1

Query: 455  MAVAASLGNPALTRATLLSPVNKQHLTSRRAQKMTLCLCAMDSKSVGVGGDVFSVTSSAK 514
            MAVA SLGNP LTR TLLSPVNK+HLTSRR+QKMTLCLCAMDSKSVGVGGDVFSVTSSAK
Sbjct: 1    MAVAVSLGNPVLTRTTLLSPVNKRHLTSRRSQKMTLCLCAMDSKSVGVGGDVFSVTSSAK 60

Query: 515  SGVDYLGQSTKGDMNVKLEHLDAFGVDGEETLEGPIEEVARVEAHEAEDLLRDLGIPSPS 574
            SGVDYLGQSTKGDMNVK EHLDAFGVDGEETLEGPIEEVARVEAHEAEDLLRDLGIPSPS
Sbjct: 61   SGVDYLGQSTKGDMNVKFEHLDAFGVDGEETLEGPIEEVARVEAHEAEDLLRDLGIPSPS 120

Query: 575  SRNSLHGIFCSRTLNLRSISAIGYDMDYTLMHYNVMAWEGRAYDYCMENLRNMGFPVNGL 634
            SRNS HGIFCSRTLNLRSIS IGYDMDYTLMHYNVMAWEGRAYDYCMENLRNMGFPVNGL
Sbjct: 121  SRNSPHGIFCSRTLNLRSISVIGYDMDYTLMHYNVMAWEGRAYDYCMENLRNMGFPVNGL 180

Query: 635  AFDPDLVIRGLVIDKERGNLVKADRFGYIKRAMHGTKMLSTRDVSEIYGRELVDLRKENR 694
            AFDPDLVIRGLVIDKERGNLVKADRFGYIKRAMHGTKMLSTRDVSEIYGRELVDLRKENR
Sbjct: 181  AFDPDLVIRGLVIDKERGNLVKADRFGYIKRAMHGTKMLSTRDVSEIYGRELVDLRKENR 240

Query: 695  WEFLNTLFSVSEAVAYMQMVDRLDDGAIGAAIGPLDYKGLYKAVGKALFRAHVEGQLKSE 754
            WEFLNTLFSVSEAVAYMQMVDRLDDGAIGAA+GPLDYKGLYKAVGKALFRAHVEGQLKSE
Sbjct: 241  WEFLNTLFSVSEAVAYMQMVDRLDDGAIGAALGPLDYKGLYKAVGKALFRAHVEGQLKSE 300

Query: 755  IMSNPELFVEPDPVLPLTLLDQKEAGKKLLLITNSDYHYTNKMMQHSFNKFLPNDMGWRD 814
            IMSNPELFVEPDPVLPLTLLDQKEAGKKLLLITNSDYHYT+KMMQHSFN+FLPNDMGWRD
Sbjct: 301  IMSNPELFVEPDPVLPLTLLDQKEAGKKLLLITNSDYHYTDKMMQHSFNRFLPNDMGWRD 360

Query: 815  LFDIVIVSARKPEFFQMSHPLYEVVTGEGLMRPCFKAVAGGLYSGGSAQMIENSLNIHGD 874
            LFDIVIVSARKPEFFQMSHPLYEVVTGEGLMRPCFKAVAGGLYSGGSAQMIENSLNIHGD
Sbjct: 361  LFDIVIVSARKPEFFQMSHPLYEVVTGEGLMRPCFKAVAGGLYSGGSAQMIENSLNIHGD 420

Query: 875  EILYVGDHIYTDVSQSKVHLRWRTALILRELEEEYSALIHSRGHRASLIELINQKEVVGD 934
            EILYVGDHIYTDVSQSKVHLRWRTALILRELEEEYSALIHSRGHRASLIELINQKEVVGD
Sbjct: 421  EILYVGDHIYTDVSQSKVHLRWRTALILRELEEEYSALIHSRGHRASLIELINQKEVVGD 480

Query: 935  LFNQLRLALQRRTQGRPAQTLAATNMNDEELTESMQKLLIVMQRLDQKIAPMLEADGELF 994
            LFNQLRLALQRRTQGRPAQTLAATN++DEELTESMQKLLIVMQRLDQKIAPMLEADGELF
Sbjct: 481  LFNQLRLALQRRTQGRPAQTLAATNLDDEELTESMQKLLIVMQRLDQKIAPMLEADGELF 540

Query: 995  NKRWGFLSRAGLWDKSHLMRQIEKYADIYTSRVSNFLNYTPFTYFRSQEQTLAHDSYSFY 1054
            NKRWGFLSRAGLWDKSHLMRQIEKYADIYTSRVSNFLNYTPFTYFRSQEQTLAHDSYSFY
Sbjct: 541  NKRWGFLSRAGLWDKSHLMRQIEKYADIYTSRVSNFLNYTPFTYFRSQEQTLAHDSYSFY 600

Query: 1055 CSHEETTIDK 1065
            CSH+ETT+DK
Sbjct: 601  CSHDETTVDK 610

BLAST of CSPI06G00230 vs. NCBI nr
Match: gi|596122278|ref|XP_007221821.1| (hypothetical protein PRUPE_ppa002751mg [Prunus persica])

HSP 1 Score: 1009.6 bits (2609), Expect = 4.0e-291
Identity = 505/608 (83.06%), Postives = 547/608 (89.97%), Query Frame = 1

Query: 453  ILMAVAASLGNPALTRATLLSPVNKQHLTSRRAQKMTLCLCAMDSKSVGVGGD---VFSV 512
            + MA    L +P   R    S ++ ++L      K   C C+  + S   G D   VFS+
Sbjct: 18   VSMAATNYLRSPVRIRPMSRSILSSKNLLWNYGMK---CQCSSSTSSSSSGVDEKSVFSL 77

Query: 513  TSSAKSGVDYLGQSTKGDMNVKLEHLDAFGVDGEETLEGPIEEVARVEAHEAEDLLRDLG 572
            TSS+K  VDYLG+ TKGD+NVK+EHL+AFG+D + TL+GPIEEVARVEA EAEDLLRDLG
Sbjct: 78   TSSSKYEVDYLGEKTKGDLNVKVEHLEAFGIDSQATLKGPIEEVARVEAEEAEDLLRDLG 137

Query: 573  IPSP-SSRNSLHGIFCSRTLNLRSISAIGYDMDYTLMHYNVMAWEGRAYDYCMENLRNMG 632
            IP+P SSR S  GIFCSRTLNLRSISAIGYDMDYTLMHYNV+AWEGRAYDYCMENL+ +G
Sbjct: 138  IPTPFSSRQSPRGIFCSRTLNLRSISAIGYDMDYTLMHYNVIAWEGRAYDYCMENLKKVG 197

Query: 633  FPVNGLAFDPDLVIRGLVIDKERGNLVKADRFGYIKRAMHGTKMLSTRDVSEIYGRELVD 692
            FPV+GLAFDPDLVIRGLVIDKE+GNLVKADRFGY+KRAMHGTKMLS R VSE+YGRELVD
Sbjct: 198  FPVDGLAFDPDLVIRGLVIDKEKGNLVKADRFGYVKRAMHGTKMLSNRAVSEMYGRELVD 257

Query: 693  LRKENRWEFLNTLFSVSEAVAYMQMVDRLDDGAIGAAIGPLDYKGLYKAVGKALFRAHVE 752
            LRKE+RWEFLNTLFSVSEAVAYMQMVDRLDDG I A +GPLDYKGLYKAVG+ALFRAHVE
Sbjct: 258  LRKESRWEFLNTLFSVSEAVAYMQMVDRLDDGTIAAQLGPLDYKGLYKAVGRALFRAHVE 317

Query: 753  GQLKSEIMSNPELFVEPDPVLPLTLLDQKEAGKKLLLITNSDYHYTNKMMQHSFNKFLPN 812
            GQLKSEIMS PELFV PDP LPL LLDQKEAGKKLLLITNSDYHYT+KMMQHSFN+FLPN
Sbjct: 318  GQLKSEIMSKPELFVTPDPELPLALLDQKEAGKKLLLITNSDYHYTDKMMQHSFNRFLPN 377

Query: 813  DMGWRDLFDIVIVSARKPEFFQMSHPLYEVVTGEGLMRPCFKAVAGGLYSGGSAQMIENS 872
            DMGWRDLFDIVIVSARKPEFFQMSHP+YEVVTGEGLMRPCFKA  GGLYSGGSAQM+ENS
Sbjct: 378  DMGWRDLFDIVIVSARKPEFFQMSHPMYEVVTGEGLMRPCFKAKTGGLYSGGSAQMVENS 437

Query: 873  LNIHGDEILYVGDHIYTDVSQSKVHLRWRTALILRELEEEYSALIHSRGHRASLIELINQ 932
            LNIHGDEILYVGDHIYTDVSQSKVHLRWRTALI RELEEE+SALIHSRGHRASL+ELINQ
Sbjct: 438  LNIHGDEILYVGDHIYTDVSQSKVHLRWRTALICRELEEEFSALIHSRGHRASLVELINQ 497

Query: 933  KEVVGDLFNQLRLALQRRTQGRPAQTLAATNMNDEELTESMQKLLIVMQRLDQKIAPMLE 992
            KEV+GDLFNQLRLA QRRT+GRPAQTLAATN++D+EL+ESMQKLLIVMQRLDQKIAPMLE
Sbjct: 498  KEVIGDLFNQLRLASQRRTKGRPAQTLAATNLDDQELSESMQKLLIVMQRLDQKIAPMLE 557

Query: 993  ADGELFNKRWGFLSRAGLWDKSHLMRQIEKYADIYTSRVSNFLNYTPFTYFRSQEQTLAH 1052
            ADGELFNKRWGFLSRAG WDKSHLMRQIEKYADIYTSRVSNFL+YTPF YFRSQEQTLAH
Sbjct: 558  ADGELFNKRWGFLSRAGFWDKSHLMRQIEKYADIYTSRVSNFLHYTPFMYFRSQEQTLAH 617

Query: 1053 DSYSFYCS 1057
            DSYS+YCS
Sbjct: 618  DSYSYYCS 622

BLAST of CSPI06G00230 vs. NCBI nr
Match: gi|645233792|ref|XP_008223513.1| (PREDICTED: 5'-nucleotidase domain-containing protein 4-like isoform X1 [Prunus mume])

HSP 1 Score: 1006.5 bits (2601), Expect = 3.4e-290
Identity = 506/622 (81.35%), Postives = 552/622 (88.75%), Query Frame = 1

Query: 455  MAVAASLGNPALTRATLLS--------------PVNKQHLTSRRA--QKMTLCLCAMDSK 514
            M    +LGN ++ R T +S              P+++  L+S+      +  C C+  + 
Sbjct: 1    MYTTEALGNFSVFRETSVSMAATNYIRSPVRIRPMSRSILSSKNLLWNHVIKCQCSSRTS 60

Query: 515  SVGVGGD---VFSVTSSAKSGVDYLGQSTKGDMNVKLEHLDAFGVDGEETLEGPIEEVAR 574
            S   G D   VFS+TSS+K  VDYLG+ TKGD+NVK+EHL+AFG+D + TL+GPIEEVAR
Sbjct: 61   SSSSGVDEKSVFSLTSSSKYEVDYLGEKTKGDLNVKVEHLEAFGIDSQATLKGPIEEVAR 120

Query: 575  VEAHEAEDLLRDLGIPSP-SSRNSLHGIFCSRTLNLRSISAIGYDMDYTLMHYNVMAWEG 634
             EA EAEDLLRDLGIP+P SSR S  GIFCSRTLNLRSISAIGYDMDYTLMHYNV+AWEG
Sbjct: 121  FEAEEAEDLLRDLGIPTPFSSRQSARGIFCSRTLNLRSISAIGYDMDYTLMHYNVIAWEG 180

Query: 635  RAYDYCMENLRNMGFPVNGLAFDPDLVIRGLVIDKERGNLVKADRFGYIKRAMHGTKMLS 694
            RAYDYCMENL+ +GFPV+GLAFDPDLVIRGLVIDKE+GNLVKADRFGY+KRAMHGTKMLS
Sbjct: 181  RAYDYCMENLKKVGFPVDGLAFDPDLVIRGLVIDKEKGNLVKADRFGYVKRAMHGTKMLS 240

Query: 695  TRDVSEIYGRELVDLRKENRWEFLNTLFSVSEAVAYMQMVDRLDDGAIGAAIGPLDYKGL 754
             R VSE+YGRELVDLRKE+RWEFLNTLFSVSEAVAYMQMVDRLDDG I A +GP+DYKGL
Sbjct: 241  NRAVSEMYGRELVDLRKESRWEFLNTLFSVSEAVAYMQMVDRLDDGTIAAQLGPIDYKGL 300

Query: 755  YKAVGKALFRAHVEGQLKSEIMSNPELFVEPDPVLPLTLLDQKEAGKKLLLITNSDYHYT 814
            YKAVG+ALFRAHVEGQLKSEIMS PELFV PDP LPL LLDQKEAGKKLLLITNSDYHYT
Sbjct: 301  YKAVGRALFRAHVEGQLKSEIMSKPELFVTPDPELPLALLDQKEAGKKLLLITNSDYHYT 360

Query: 815  NKMMQHSFNKFLPNDMGWRDLFDIVIVSARKPEFFQMSHPLYEVVTGEGLMRPCFKAVAG 874
            +KMMQHSFN+FLPNDMGWRDLFDIVIVSARKPEFFQMSHP+YEVVTGEGLMRPCFKA  G
Sbjct: 361  DKMMQHSFNRFLPNDMGWRDLFDIVIVSARKPEFFQMSHPMYEVVTGEGLMRPCFKAKTG 420

Query: 875  GLYSGGSAQMIENSLNIHGDEILYVGDHIYTDVSQSKVHLRWRTALILRELEEEYSALIH 934
            GLYSGGSAQM+ENSLNIHGDEILYVGDHIYTDVSQSKVHLRWRTALI RELEEE+SALIH
Sbjct: 421  GLYSGGSAQMVENSLNIHGDEILYVGDHIYTDVSQSKVHLRWRTALICRELEEEFSALIH 480

Query: 935  SRGHRASLIELINQKEVVGDLFNQLRLALQRRTQGRPAQTLAATNMNDEELTESMQKLLI 994
            SRGHRASL+ELINQKEVVGDLFNQLRLA QRRT+GRPAQTLAATN++D+EL+ESMQKLLI
Sbjct: 481  SRGHRASLVELINQKEVVGDLFNQLRLASQRRTKGRPAQTLAATNLDDQELSESMQKLLI 540

Query: 995  VMQRLDQKIAPMLEADGELFNKRWGFLSRAGLWDKSHLMRQIEKYADIYTSRVSNFLNYT 1054
            VMQRLDQKIAPMLEADGELFNKRWGFLSRAG WDKSHLMRQIEKYADIYTSRVSNFL+YT
Sbjct: 541  VMQRLDQKIAPMLEADGELFNKRWGFLSRAGFWDKSHLMRQIEKYADIYTSRVSNFLHYT 600

Query: 1055 PFTYFRSQEQTLAHDSYSFYCS 1057
            PF YFRSQEQTLAHDSYS+YCS
Sbjct: 601  PFMYFRSQEQTLAHDSYSYYCS 622

BLAST of CSPI06G00230 vs. NCBI nr
Match: gi|728819759|gb|KHG03386.1| (Cytosolic purine 5'-nucleotidase [Gossypium arboreum])

HSP 1 Score: 1005.4 bits (2598), Expect = 7.6e-290
Identity = 498/582 (85.57%), Postives = 537/582 (92.27%), Query Frame = 1

Query: 476  NKQHLTSRRAQKMTLCLCAMDSKSVGVGGDVFSVTSSAKSGVDYLGQSTKGDMNVKLEHL 535
            + ++L  R  + +  C C   S S  VGGD FS+TSS+K  VDYLG+STKGD+N+ L+HL
Sbjct: 35   SSKYLVMRAPKMVFKCGC---SSSSSVGGDAFSLTSSSKCDVDYLGESTKGDLNINLKHL 94

Query: 536  DAFGVDGEETLEGPIEEVARVEAHEAEDLLRDLGIPSPSS-RNSLHGIFCSRTLNLRSIS 595
            + FG+DG+ TLEGPIE+VAR+EA EA  LLRDLGIPSPS+ R S  GIFC+RTLNLRSIS
Sbjct: 95   ENFGLDGQATLEGPIEQVARLEAEEAGSLLRDLGIPSPSAARLSPRGIFCTRTLNLRSIS 154

Query: 596  AIGYDMDYTLMHYNVMAWEGRAYDYCMENLRNMGFPVNGLAFDPDLVIRGLVIDKERGNL 655
            AIGYDMDYTL+HYNVMAWEGRAYDYCMENL++MGFPV GLAFDPDLVIRGLVIDKE+GNL
Sbjct: 155  AIGYDMDYTLIHYNVMAWEGRAYDYCMENLKSMGFPVEGLAFDPDLVIRGLVIDKEKGNL 214

Query: 656  VKADRFGYIKRAMHGTKMLSTRDVSEIYGRELVDLRKENRWEFLNTLFSVSEAVAYMQMV 715
            VKADRFGY+KRAMHGTKMLSTR VSE+YGRELVDLRKE++WEFLNTLFSVSEAVAYMQMV
Sbjct: 215  VKADRFGYVKRAMHGTKMLSTRAVSEMYGRELVDLRKESQWEFLNTLFSVSEAVAYMQMV 274

Query: 716  DRLDDGAIGAAIGPLDYKGLYKAVGKALFRAHVEGQLKSEIMSNPELFVEPDPVLPLTLL 775
            DRLDDG I A +GPLDYKGLYKAVGKALFRAHVEGQLKSEIMS PELFVEPDP LPL LL
Sbjct: 275  DRLDDGVIPADLGPLDYKGLYKAVGKALFRAHVEGQLKSEIMSKPELFVEPDPELPLALL 334

Query: 776  DQKEAGKKLLLITNSDYHYTNKMMQHSFNKFLPNDMGWRDLFDIVIVSARKPEFFQMSHP 835
            DQKEAGKKLLLITNSDYHYT+KMMQHSFN+FLPNDMGWRDLFD+VIVSARKPEFFQMSHP
Sbjct: 335  DQKEAGKKLLLITNSDYHYTDKMMQHSFNRFLPNDMGWRDLFDMVIVSARKPEFFQMSHP 394

Query: 836  LYEVVTGEGLMRPCFKAVAGGLYSGGSAQMIENSLNIHGDEILYVGDHIYTDVSQSKVHL 895
            LYEVVTGEGLMRPCFK   GGLYSGGSAQM+ENSLNIHGDEILYVGDHIYTDVSQSKVHL
Sbjct: 395  LYEVVTGEGLMRPCFKTRTGGLYSGGSAQMVENSLNIHGDEILYVGDHIYTDVSQSKVHL 454

Query: 896  RWRTALILRELEEEYSALIHSRGHRASLIELINQKEVVGDLFNQLRLALQRRTQGRPAQT 955
            RWRTALI RELEEEY ALIHSRG RA+++ELINQKEVVGDLFNQLRLALQRRT+GRPAQT
Sbjct: 455  RWRTALICRELEEEYRALIHSRGPRATVVELINQKEVVGDLFNQLRLALQRRTEGRPAQT 514

Query: 956  LAATNMNDEELTESMQKLLIVMQRLDQKIAPMLEADGELFNKRWGFLSRAGLWDKSHLMR 1015
            LAATNM+D ELTESMQKLLIVMQRLD+KIAP+LEADGELFNKRWGFLSRAGLWDKSHLMR
Sbjct: 515  LAATNMDDRELTESMQKLLIVMQRLDEKIAPLLEADGELFNKRWGFLSRAGLWDKSHLMR 574

Query: 1016 QIEKYADIYTSRVSNFLNYTPFTYFRSQEQTLAHDSYSFYCS 1057
            QIEKYADIYTSRVSNFLNYTPF YFRSQEQTLAHD+YS YCS
Sbjct: 575  QIEKYADIYTSRVSNFLNYTPFMYFRSQEQTLAHDTYSHYCS 613

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
C72B1_PINTA4.9e-9542.14Abietadienol/abietadienal oxidase OS=Pinus taeda GN=CYP720B1 PE=1 SV=1[more]
C72B2_PINTA9.3e-8641.26Cytochrome P450 720B2 OS=Pinus taeda GN=CYP720B2 PE=2 SV=1[more]
C90B1_ARATH1.5e-7233.64Cytochrome P450 90B1 OS=Arabidopsis thaliana GN=CYP90B1 PE=1 SV=2[more]
C90D1_ARATH3.0e-6831.493-epi-6-deoxocathasterone 23-monooxygenase OS=Arabidopsis thaliana GN=CYP90D1 PE... [more]
C90A1_ARATH4.8e-6634.23Cytochrome P450 90A1 OS=Arabidopsis thaliana GN=CYP90A1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0K9W0_CUCSA0.0e+0099.84Uncharacterized protein OS=Cucumis sativus GN=Csa_6G001750 PE=4 SV=1[more]
M5XV85_PRUPE2.8e-29183.06Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002751mg PE=4 SV=1[more]
A0A0B0MWT3_GOSAR5.3e-29085.57Cytosolic purine 5'-nucleotidase OS=Gossypium arboreum GN=F383_27032 PE=4 SV=1[more]
A0A0D2SC88_GOSRA1.2e-28985.40Uncharacterized protein OS=Gossypium raimondii GN=B456_005G073300 PE=4 SV=1[more]
A0A061EPF8_THECC3.8e-28887.59HAD-superfamily hydrolase, subfamily IG, 5'-nucleotidase isoform 1 OS=Theobroma ... [more]
Match NameE-valueIdentityDescription
AT5G48960.13.0e-27683.51 HAD-superfamily hydrolase, subfamily IG, 5'-nucleotidase[more]
AT1G73340.11.4e-14858.63 Cytochrome P450 superfamily protein[more]
AT3G50660.18.6e-7433.64 Cytochrome P450 superfamily protein[more]
AT3G13730.11.7e-6931.49 cytochrome P450, family 90, subfamily D, polypeptide 1[more]
AT5G05690.12.7e-6734.23 Cytochrome P450 superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449441492|ref|XP_004138516.1|0.0e+0099.84PREDICTED: 5'-nucleotidase domain-containing protein 4 [Cucumis sativus][more]
gi|659116824|ref|XP_008458278.1|0.0e+0097.54PREDICTED: 5'-nucleotidase domain-containing protein 4 [Cucumis melo][more]
gi|596122278|ref|XP_007221821.1|4.0e-29183.06hypothetical protein PRUPE_ppa002751mg [Prunus persica][more]
gi|645233792|ref|XP_008223513.1|3.4e-29081.35PREDICTED: 5'-nucleotidase domain-containing protein 4-like isoform X1 [Prunus m... [more]
gi|728819759|gb|KHG03386.1|7.6e-29085.57Cytosolic purine 5'-nucleotidase [Gossypium arboreum][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001128Cyt_P450
IPR002403Cyt_P450_E_grp-IV
IPR008380HAD-SF_hydro_IG_5-nucl
IPR017972Cyt_P450_CS
IPR023214HAD_sf
IPR001128Cyt_P450
IPR002403Cyt_P450_E_grp-IV
IPR008380HAD-SF_hydro_IG_5-nucl
IPR017972Cyt_P450_CS
IPR023214HAD_sf
Vocabulary: Molecular Function
TermDefinition
GO:0016705oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:0005506iron ion binding
GO:0020037heme binding
GO:0004497monooxygenase activity
GO:0016705oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:0005506iron ion binding
GO:0020037heme binding
GO:0004497monooxygenase activity
Vocabulary: Biological Process
TermDefinition
GO:0055114oxidation-reduction process
GO:0055114oxidation-reduction process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0055114 oxidation-reduction process
biological_process GO:0006470 protein dephosphorylation
biological_process GO:0008150 biological_process
cellular_component GO:0009570 chloroplast stroma
cellular_component GO:0005575 cellular_component
molecular_function GO:0008253 5'-nucleotidase activity
molecular_function GO:0020037 heme binding
molecular_function GO:0005506 iron ion binding
molecular_function GO:0004497 monooxygenase activity
molecular_function GO:0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
molecular_function GO:0046872 metal ion binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI06G00230.2CSPI06G00230.2mRNA
CSPI06G00230.1CSPI06G00230.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001128Cytochrome P450PRINTSPR00385P450coord: 427..436
score: 1.5E-6coord: 293..310
score: 1.5E-6coord: 348..359
score: 1.
IPR001128Cytochrome P450GENE3DG3DSA:1.10.630.10coord: 39..444
score: 6.3
IPR001128Cytochrome P450PFAMPF00067p450coord: 45..443
score: 1.6
IPR001128Cytochrome P450unknownSSF48264Cytochrome P450coord: 39..445
score: 2.23
IPR002403Cytochrome P450, E-class, group IVPRINTSPR00465EP450IVcoord: 376..390
score: 1.4E-15coord: 343..359
score: 1.4E-15coord: 420..436
score: 1.4E-15coord: 284..310
score: 1.4E-15coord: 392..410
score: 1.4E-15coord: 71..94
score: 1.4
IPR008380HAD-superfamily hydrolase, subfamily IG, 5'-nucleotidasePANTHERPTHR12103CYTOSOLIC PURINE 5-NUCLEOTIDASE-RELATEDcoord: 494..942
score: 0.0coord: 959..1063
score:
IPR008380HAD-superfamily hydrolase, subfamily IG, 5'-nucleotidasePFAMPF057615_nucleotidcoord: 582..1050
score: 1.8E
IPR008380HAD-superfamily hydrolase, subfamily IG, 5'-nucleotidaseTIGRFAMsTIGR02244TIGR02244coord: 582..921
score: 1.8E
IPR017972Cytochrome P450, conserved sitePROSITEPS00086CYTOCHROME_P450coord: 429..438
scor
IPR023214HAD-like domainGENE3DG3DSA:3.40.50.1000coord: 870..905
score: 6.7E-7coord: 762..838
score: 6.7E-7coord: 590..606
score: 6.
IPR023214HAD-like domainunknownSSF56784HAD-likecoord: 580..1051
score: 1.34E
NoneNo IPR availablePANTHERPTHR12103:SF15PROTEIN Y71H10B.1, ISOFORM Ccoord: 494..942
score: 0.0coord: 959..1063
score: