Sgr024966 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr024966
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
Descriptionprolyl 4-hydroxylase 1
Locationtig00002486: 4839976 .. 4864009 (-)
RNA-Seq ExpressionSgr024966
SyntenySgr024966
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
TAAGCTGGTCACCCCGAATCATTGTATTGCATAATTTTTTGAGCACAGAGGTGAGCATAATCCTGTCAACATGGTAATTTCTTTTGCTGGATGCTATTGATTTACTGCCGATTCTCATTCCTTACAGAATCTTCATTAAATTGAGACATACTTCCGACTAAGTGTCATATTATAATTTTGTGAAAAACTGAATATTGCACAGTTGATATTTTTGGGGATTATTCTGGAACTAGACCTGAAGAGAGTGTATAGAGAAACACACACAAAAATTAATAACTACTGGCCCAAATATTTAATAGGTTTTTGGGATAAGTTTTCCCATGGGTATATGAAATTGAAAACCATATTTATTAATTGGGGTTACAAAAATTCCTTCCAATTTGCACAAATATCACTAATAGAGTAGTCACAAAAAGATTTTCCTAATTTGCACCATGTGGCTGCTTATACAAAATAAGATCAAATAAATAGGTGTAATTCTTTTCCTTTCCAATGAAAGTTCTGTTGTTTCTTTCCAGCCATGTAGTCCATAGGAAGCTTCTAATCAAATTTAGCCATAAGGACTTAGCTTGTCGTCTGAAGGGTTGAGCTGCCAAGCAGGTGTGGAGATTGGCGGCAATACTGGTGTGAAGAAGGATGGGCCAGTCGAAGCCTGAAAGAGAGAATTCCAAATGATGGAAGCAAAGTGGCAGCTTACAAAAAGATGATCTTGGGACTCACTGCTTTGCCTGCAGAGGGGGCAAATATTTGGGGAAAGAGCAACCGAGGGACATCTCGTCTGCATACGGTCCATGGTGTTGATCTTTTCGGAGAGCACCTCCCACAAAAAAATTTAATATTCCTAGGATAGTTGTCCTTCCAGATAGCCGTAATCAGGTCCTCGTGAATGTGGGGAGCCTTTGTAGTCAAAAAAAGATGAAGGGACCTGCAAGAGAAGGTGCCCGAGGGCTCAAGAGACCAATGCCATGAATCAGACTGGTGCTTTGGACGAAGCCCATCAATCTTGCTCATCAAAAGGAGCCAACCTTCAAATTCATCCTCCTTTAAATTCCTCCTAAACTGTATATTCCAGGAGTCAGAAGGGCTATCCCAGAGAGTGGCCACTGTCATGTTAGCATGAATGGAGAGTCTGTAGAGATTGGGGGAAAATGGAGAACAAAGGCAAAGAACCATGTGAGTATTTAGACCATTACCAATCTTATAGGAAACATTGGTGGCAATACAAGATTTTACCTTCCAAATGTCTCTCCAAGGACCTCTAAAGCGACTGGAGGCTTAACCATCTGGCCACCAATCAGTGGGAGTGGAGCCATATCCGTCACGAATAAAAGTTCTCCATAAAGCTATTTTTTCATGGGCATATCTCCAATCCATTTGGCCATGAGAGGTTGATTCTTCTCCTGGATATTACCCAGGCCGAGGCCTCCCATCAAAATTGGGCAAGAAATCTTGTTCCAAGAAGCCAAGTGAGGGGACTCCTTCCAGTAGAAATCTTGAATGATTTTTTCGATTGATTTTGCAACCCCTTTTGGCATTTTGAAGACTGATTCCGTTTCCCTTTCAAAAAAAAAAAAAAAAAAAAAAAGGAAAAAAAAGAAATTTTTTTTATTAATTGGGTCTTAATTTTAAAACCTAGGTAATAATTATCTATGGTTAAAGAAGCTCTGCTAACCATAAAATTTAATTATCCTTGTGGGAGATGGGCTTTTCTAATTTTTGAGAGATGAATAATTGATAGGAGAGTGTTATGTTCTCGCAAGGTTCGAAATGTCAATGTTGATGTTGGACTGGATTGTTTGGTCGAAATATTGATGTCAACAAAAATATAGAAAAAAAATAAAGATCATGTAAACTTGTAGAAATTCTTGAATTAATAAATGAAACTTTAACTAAGGCTTAATTCATCATGTTTAATACTCATATTCATACACATAAGCAATATTTGCATAGTCATATATATGATTCAAAGATTTATTAAACATCGATATGCGAACAAAAGTTTCAAAACACAGAAGCAATTTCATAATATAAAAAATGTAATTACATGGTGTGATAATAGAAAACCAAAAGATGACAGAGATTGTTGAACAACATCAAGATAATTTTGATAGTAATGTCAATAATGTTCCTAACTCATCCGTTCTTGGAATCAAGTAAGATAATTGTCCATTTGCTTAGTGTACAACTTAAATGTTGAACAAATGGACCTTATAGGTGAATCTCTCATCAAACCAAAGTGCATATAATTGGAGATGATCATGAGATGTTTTTTTTCCCTTTTGCATGAGATTTTCATGAGACCGTTTGGGTCCTTGCTATTTTTGGCTGGAAAGAAATTAATAACTGTAGGATGTTTGAAGGTAGGTTAAATTTAGGATAGAATATGGGTTGAGAGTGTAATTTTGTTGCATAGAAATAAGACCTAGTGATGTTAATGGTGGTGGAGGGTTTGACTCTTCATGCTGTCAATTATAAAAATTATGTCTGTTAGCAAAGTAGTGAGAACTTGAAAGAGGCTATTGTCTTTAAACCTTTCATTGGATGCAATAGGATTCTCTAGACTCTAGTGGTCCTAGCTATTTTTGGATTATTTGGCTTGAAAGGAACATAAGGACATTTGAGGAACCTCTATATTCTTTCAGAGAAAGGTAAGTAGCATAAAAGAATTGAGTTGAAGCTTTCTAAAATCAGGAAGGAAAGGTAAGGGTTCTTAGTCCTCTTTAGTATTTCGTTAAGTCCTTATAAGTGCTTGGTACCAAGTTGAATTACGGATTTGAGTAGTTGCTGCATTGAGAGTAGATATTTTCATGTTTTGCTTGAAAACGATTGGTTTTTATTGAAGAAATGGAGAGAGATATTATGCTTCCTCTTCATTTGTCCCAAATTAGATGGTTTGAGAATGCTTTGGTTGAGGTGATGTAATTTCCCCTTCAGTTGTGGTACTTCAAGCAAGATAGGGATGTGAATGGTACTTGTCGCATTTAGAAGAAACGAATTCATAATGGTTGGTGTCTGGAGTGTGTGGTTTGGCCTACTACGGGTGGAAGAAAGAGTGTTTTTGTCCCTGCAGGATTCAATAAACGAGGATGGTTAACTTTTTGGGACATGATTAAAGACTTCTTGTTGAAATATAATGAAAGAAATTTGATGAAAGAGCCTATCAAGCAAGGAAATCAGTTTATTGATGCTTTGTCTCGGTCTTCTTATGCTGAAATTGTTATGAAGGGGAATGCAGAAGTTGAGTTTCAAGGAAATCTGAACACCATCCCTGATGTTGCTAGAAAGAAAATAAGGAACTTTGGAAAGAAGCAGGAAAGTTATTGGGTTAAAAAAGAGGCTGATGTAGTTGAGATTGATTGGGACAATCTAATGGTAATCACAAGACTTAATGCTCATGATGATTGGAGGGCAATCTCTGCGGTTTTGAAGAAAGAATTCTCACAAGATTGCTTGATCAATCCATTTATGGATGATAAAGCTCTTATGAAATTTGAACATGGAAGAGTTTTGAGGAATCTTGGAAAAAATGGAAAATGGGGGAAAATTGGTAATTTGCACTTGAAAATAGAAAGATGGAATTTTGAGAAACACAATAGACAAGAAGTGAAGATGGGTTACGGTGGATGGATCAAATTACAAAACTTACCATTGAAATATTGGGATCGACAATGTTTTGAAGCCTTGGGTAATCAATTGGGGGGCTTTGTGAGAAGTTCTCACAAATCCTTGAACCTGTTAAATTGTAAAGAAGCTTTTATTGAAGTAGAAAGGAATTCTTGTGGTTTTCTCCCAGCAGAAATAGAAATCAAAGATCCACGATTGGGTAGCTTCTTTGTTAGTATTTCGGTTGTATATGAGACGTTCAACGAGCATGATGGATCTTCTACAAGAAAAAGAGCCTCCAAGAATCTTGAATTAAGTTTGTTCAACAACAAACTAGATATAGAGAGAAGTATGAGAGTTTTGGAAGATGAAAAGTCACTTGAGAATTTTATGCCTATTTCTTCAGAGGAAGAAGATGAAAAGACAGCTGATTCTTTTAAGGAAGTGGGTGAGGAGGATTTCATGCGTAATTCAAGTATTAATGATAGACTCTTGGTTTTGACTCCAAGACAGTGTATGGAAGATGGGGTAATTAATAACCTTAATAGGACAAATGGTGAGCAGTCGTTCGTGCACAAGGTGGATATTAATGTGACAAATAAAAATGACTTTTCAAAGAAGCAAGGGGCATATGAAGAGGCCTTGGTCTCTATTTCAACAGAAATATTGCCCAACAGATGTTTGTTATTAAAGGACAAAGAAAACGATTTACTTCTCTCTACTGTCGAGGAAGATGAAAGGTCACTTAATGTCATAAAAGATTCTGGTGAGAAAAATGATTTGAATCTAGCCAAAGAAGATGACGAGACTGTCAAAGGAATACCCTGTTTTTCTGACAAAGAAGCTGAAAAGTCTTTTTATATTAATGCAGTAGGTAAGGTGTTGGAAAAACAACCGAGTATTAATTATGAAAATGAATTATTCTTAAGTTCCCCATGTATGAAGAATCCTGTGTATGACTCTCCATTGCCCAACAACATTTTGGAATTGAAAAACGCCTTTAATGTTGCGTATCCCCAAGCTCTTTCCCGAAGAATTGATGAAGTTGCCTCTTTATTGGACTTTGACTCTTTGGTTCCTTCGGTTGGGACTCCACCTCCCATGCCTTTGGTACCCTTTGAAAATGAGGAAACTGTAAACTGTTGTAACTCAATATTGAGAGAATTCTCATCTATTTCGTATACTAAAGTCACGTATTTATAAGTACACAAGAGAGCCCTAGATTAAAAGAATGTAAAATTACAATAAAGGACAAATATACTAATAACATCTAAACTAAGGTATTTATACTACATAATGGTGTATATCATAACACTCCCCCTCAAGCTGGAGCAAATATGTCAATCATGCCCAGCTTGTTGCAAAGGTAGTCTATTCTTGCTCCATTTAAGGCTTTTGTAAAGATATCTCCCAATTGTTCATCGGTCTTCACATATCCTGTAGACACCTCACCTTGTTGTATTTTCTCACGAATAAAATGACAATCAACTTCAATATGTTTAGTCCGTTCATGAAATACTGGATTAGATGCAATGTGAAGGGCAGCTTGATTGTCACACCACAACTTAGTTGGCATAGTGATACTGAAGCCTAACTCAGTTAAAAGTTGATGTATCCATGTTATTTCACAAACAGATTGTGCCATAGCTCTATATTCTGATTCAGCACTCGAACGTGAAACTACATTTTGCTTCTTACTCTTCCATGAAACCAAATTACCTCCTACAAAAATACAATATCCAGAAGTTGATCTTCTATCTTCTCTTGATCCAGCCCAATCAGCATCTGAAAAACACTCGACTTTCGTATGACCATGATCTTTGTATAAGATTCCACGTCCAGGTGCAGCTTTTAAATAACATAATATTCGTTCTACTGCAGCCCAATGATCCACTGTAGGCGAAGACATATACTGACTCACAATGCTTACTGGATAAGCAATGTCTGGTCGTGTCACTGTTAGATAATTCAACTTTCCAACTAATCGTCTATATCTCTCAGGATTTTTAAATAATTCACCATCTTTTGTAAGTTGTAAATTAGGCACCATTGGAGTACTACATGGTTTGGCTCCCAGTTTTCCTGTCTCAGATAACAGATCAAGCACATACTTCCTTTGTGATAAAAAGATACCATTCTTGCTTCTCATCACTTCAATACCCAAAAAGTATTTCAACATTCCTAGATCCTTAGTATGAAACTGACCGTGAAGGAAGTTTTTGAGAGAAGAAATACCTGATATATCATTTCCAGTGATAATAATATCATCAACATATACAACAAGAAGAGTAATGCCACTTTCAGACCGTCGATAGAATACAGAATGATCAGACTTACTCTTCTTCATTCCAAACTGCTCAAGTGCTTGACTAAATCTTCCAAACCACGCTCGTGGACTTTGTTTCAATCCGTACAAAGATTTTCGAAGATGGCAAACTTTATCATTCTCCCCCTGAGCAACAAAACCAGGTGGTTGCTCCATATAAACTTCTTCCTGAAGATCACCATGTAGAAAAGCATTCTTTATGTCAAGTTGATGCAAAGACCAACGTTGGGTAGCAGCCATGGAAATGAATAACCTGACAGAAGTTAGTTTAGCAACGGGAGAAAATGTGTCAGAGTAGTCAATCCCATAAGTCTGAGCATAGCCTTTGGCAACCAAGCGTGCTTTCAATCGGGCAACTGATCCATCTGGGTTGACCTTAATTGCAAATACCCACTTACACCCAATTGCTTTCTTTCCTGCAGGACGAGGCACTAAATCCCACGTACCATTATCATCTAAAGCATTCATCTCTTCAATCATTGCATCACGCCAGCCAGGATGAGATAAGGCTTCATGAATAGTTTTAGGAATAGAAACAGAATCAAGGGACGCAATAAAGGAATATGTGGGTGATGACAAATGGTTATATGAAACAAATGAAGAAATAGGATAAGTGCATTTGCGTTTACCTTTACGAAGTGCAATAGGAAGCTCATCACTTGGTCCTGGATCCAATGTCGAAGAATCTACTAGTATAGGATATGTGCCTGGAGGTTGTTGTTGTTGTCGTCGCCTTGAGTAGACCTGAGTAATGGGTGGACGAGGAGGGACAGATACCACTGGAGGTGGAGCAAGGGAGGAATCTGGAGAAACAATGGTATAAATAAAGAGATCATCCTCCTCTTCCTTACGCGTACTCGAAGGTGCTGGATTAAAAGGTATATCCTCAAAGAAAGTAACGTCAGGAGAAACAAGGTATCTGTTAAGACTAGGACAATAACAACGATATCCTTTTTGAACACGAGAATATCCTAGAAAAATGCACTTTAATGATTTTGAATCTAACTTTGTGAGGTGAGGACGGACGTCTCGAACAAAACATGTACAACCAAATATTTTTGGATCGATAGAAAATAATGGTTGTGTTGGAAATAGAACTTGATACGGAATCTCACCATTAAGAACAGATGAAGACATTCGATTTATTAGGAAGCAGGCTGTGGAGACAGCATCAACCCAAAATGCTTTGGAACATGCATTTGAAATGATAATGCTCTTGCTGTTTCAAGGAGATGTCTATTCTTTCGTTCTGCAACTCCATTTTGGGATGGAGTGTCAGCACATGAAGATTGATGAATAATGCCATTTTGACATAAATAAGATCCAAGCATATTAGAAAAGTACTCGCCCGCATTATCCGTTCGCAAAACTTTAAGAGAAACGTTAAATTGAGTTTGGATTTCAGCATGAAAATTGCAAAAGTGGGAGAGCAACTCAGAACGGCTTTTCATTAAATATAACCAAGTCAAACGAGAATAATCGTCAACAAAAGTAACAAAATATCTAAATCCAGTTTTGGACACTACTGGACAGGGAACCCAAATATCAGAATGAACTAATTCAAAAGGAGCATTTGCTCGTTTATTGACTCTAGGACATGAACTAAGACGATGAAATTTGGCAAACTGACAAGAATCACAATTTAAAGAAGACAAAGAATGAAACTATGGATAAAGTTTCTTCAACACGGACAAAGACGGATGGCCCAAACGACAATGGACTTCAAATGCAGATGCAACTCCAGAACATGCTACAGCCTTTGGTTCTTGGTGAAAAAAAATGTAAAGACCTCCAGATTCATATCCTTTACCAATAATCTTCTTCGTCACAAGATCCTGAAACAAGCAATAGCCAGGAAAGAATGAAACAGAACAGTTAAGATCACGAGTGAGTTTACTAACAGAGATTAAATTAAAGGAAAGTTTAGGCAAATTTAAGACAGAGGACAAATGTAGAGATGGTGTGAGATTAATTGTGCCAGATCCAAGTACAGACGATGTTGATCCGTCTGCTAAAGTAACAATAGGAGAAGGTGCAGGTGACAAAGGAGTAGAAAATAAGCGGGAATTACTTGTCATATAGTCTGTCGCACCAGAATCTATGACCCATTTGGTGGAGGATGAAAGAAGATAATGATGCATATTACCTGACTCAGCGATGGCTGTAATAGGGGTAGATGATGATTATGATGCTTGTAATGATTCCTGGTACTGCTGAAATTTAGCAAAATCATCTGCAGAAATGGTTACTGACTTTTCAGATGTATCATGGGTGGAAGCAACCTGAGCAGATCGAGGTCTTTGGCCCTTATTCAACAATTTCTTGCATTCGCGTTTCATATGACTTGGCTTACGACAATAGTGGCATATAATCTCCTGACCATCTTGTCTGCGATTATCATAACCAGTTCTGGGAGTATTGGTGCCCATCCCTTTATCACCTCTGTGATTATTGTTCATACCAATGAGAGCACTGCTTGATTGAGCAGGAGAGGAACCTGATTGAAATTCTCCGTACGAAGAACTCTACTAAAAGCTTCTTGTAAAGAAGAGATGTCAGGACTAGAAAGAATTTGTGCTTTAGCCATTTCAAATTCAGATGGAAGTCCAGTCAAAAAACTCATGACAGCCATCTGTTCTCGCTGAGCTTGTTGAACCTTCATATCTGTACTAAATGGTAGTAACATATTGAGTTCTGCATACGTTTTCTTAAATTCCATAAAATAACTTGTAAGTGATTGTTCCTTCCTTTCTGGACGATAGAACGCCTTGCATACCTCGTACATTCTATTGACTTGCCCTTTTCCCGAGTACAAAAATTCCAAGTACTCAAGTAGTTCTTTCACAAATTCACAGTGATTAATTAAGCCAACTACCTCACTATCAATGGAATTCTTTATCTGAAGAAACAAACGGGCATCATCTCATAACCATGTCTTCTTTGTGTCATCATCTGGTGGATCATCAGTGATATGGTTATCCATCTCTATGCTTCGTAGATAAAGTCGAATGGTCCTACTCCAGTCATAGTAATTTGAACCATTTAGCTTATGATCTGTGATCTTTGACGTTAGAGGAACCACATCAGATATCACTATGGGTTTTTTCTCAGCCATAGTCCCAAATAAAGGTTGTAGAAACCCAAAATCTGATAATTATTGACCAAAAACAGAACTGAACTATGAGGCAAAACCAACTATTTTCCAAACCGCTTCGCACCAAAGTAGATCTTGCTTAAACCGAGCACACAGACAGGACGAACAGCATGAAACATGACAAGAGCAGACGACGGAAGACAAATTGGTTGTCCCACGCGCCTTCACGCGCCGGCGCGTGGAGTCTGAAACGGTTTTGTTGGCAGCGCGTGTAGCCCACGCGCGGGAGTTTCCAGCGATCGGCGAAGGCGATCCGGTCCTCTGGACGGCGGCGCTCCTTTTGGGGTGGGTGGTGTCAACAAACGGCCACTCCAATGAGATGACCCAAACTTCAAACCCTAACCTCACTGAGAAGAAAAAAACCCTAATAGCCCCAAAAGAGATCTCTAAAACCCTAGACAAGGCTCTGATACCATGTAAACTGTTGTAACTCAATATTGAGAGAATTCTCATCTATTTCGTATACTAAAGTCACGTATTTATAAGTACATAAGAGAGCCCTAGATTAAAAGAATGTAAAATTACAATAAAGGACAAATATACTAATAACATCTAAACTAAGGTATTTATACTACATAATGGTGTATATCATAACAGAAACAATGGCTTTTTCCACGCCTAAGAAAGATGTTGGAAAAGACTTCTCCACGGTTATAGTAGTAGAAGGCAAAGAGTCAAATCAGCAATTAACAAGGATATTTTGTTAATGCATCGTATCCTTGCTCAAGGTACGATAGATAATTCTTTGAATTCTCTATTCAAAGAAGTTCAATCTCCTCCACATGCAAACATGCAATTCAAGGATTTGGTAGATTTAGATATTCTCAAATATTGCAAGGATGTGGGCATCACTCTTATTGAGTTCAAGCATTAAAAAGGGTATCTTCCGTATAAAGGTTGGTCAGTTTCCAAGGGGATTATTTTGGTTTTTACGTTTTTGAGGTTTCCAAGACAAAATTTAAGGCAGATTTTAAGGTTGCATGAAGATATATGATGCTAGTTGGATTTTTGAAGTTCCAATCAAGCATTTACTCGGCTGATTTGTTTTCAAAGCTTCATTAATTTCATATTATCTCATTAATGGCTTCTTCATCCAATCTCTTGGTAAGGATTCTACTATTTATATCTTGGTTGGATGTGGTCAAATTTCATCAAGATTTTCTTCATATTGGCTTCTCAATATCAGAGTCTTTTTGGTCGGTTTCCATGGCTCTACCTTCTGCAGACTATTTTCCGAGTTGCTCCAAGAAAGATATCTTTTCGACTTTCAATGAAATATCTAGCATGCAATCAAGTTGCTTGGTTGAATACATCGCCTTTCTTGAGAATGAGTTGGAAAGGCTTCGTGAAGAAACTTTTCAAATCAAAGTTTCTCTTGTCAAGAAGCGGATTCAGTTGTTGCAGGACTATAAGAAGGAGCTTGAGGAAGTTCTCGTGTTGGAGGCTGTTAAGAATTAATTTTTGGTTAGCTATAGTATTTTAGAGGCCTGGGTGTTTGTGCAGCTGCTTTTGGATCGTTTTGTTTTGATGATTCTTGGTTCACTTCAGTTGATCGGGTTGTAATGCTTCATTTAAAGCATGTTCTTTTCTTATTTTATCTTTTCTTTCTTCTTTTATCACTCCCTCGGGAGTTTGTTTCTTGAACATTTGTTCCTTTTCATTTTATCAATGAAAAGTTGGTTCTTGTTCAAAAAAAGGACATTTGAGGATAAAGAGTGTCAAGCTTTTGATTTTTGCAGTGTGGTGCATTGTAATGCATTGGTTTGGGCTTTAAGCACCAAGTATTTTTGTGATATATCTCTTTTCTTGATTGGTCATATTTTGGGGCTGGGTGCTACTTATTAATTTTTTCCCTATTTGTTTTTTTTCCCTTTGATGAGAAACAAAACTTCCCTTAGAGAAAGTAAGTACAGATGGAGAAGATAAGGAATCATCCTAGCCTTAAGTCGAAGGAAACTGCAAATGCTCCAAGTGGCTTTACTTACCGAGTATAATTACAGAAATACTTAAGACAGAGAACCTAAAACAGAAGCTAAACTACAAATTTGGTCCTACCATTACTCGTTGTCCCTGAGTTGTTTCAGTTTATTCTTCCAAAGAGAAGCCTTGATTGCATTTTTTAGTGTAAGATTTCGGCTTTTTTTCTTGACTGAGTGCACACACAAAAGCTGGTATGGGAGATTTTTGAAAATTCAAGTCAGGCAACCTCAATTACCAAAGCAAGTGGAGGCAAAAGAACTATTAAAAAAATACAGTGGAGGTTCTCAGCAATTTGTTTGCAAAGACAACACCAGTTAGAAAAGATGCTTAGAGTTGGATTCCCCTAAGAACTTTGTCATTTGTGTTGAAGCCAACCAAAGCCACGCATCATGAAATGAATTCAATCTTTTGAGGATATTTTCTATTTAAATCACATTTGATCATTTCTTTTGGGAGAAACTGGTAACCCCCAGATATTTGTCTAAAGAGAGACTTGCTTGTAAATCTTCCCAACTTATCCAGCCACCACAGCTGGGTATCATGTTGAGGGATAGGCTTTTACCCTCAAATTTATTTAGAAAGAGATATTCATTCAGAAATTTCCTCAGGATACTTAATGATTGTATCGTTATTTTTTAGTTTTTGGCTATGTATTATAAATCTCACTTCTAATTAAATTGTATTGAACTATTGATTCTACTTTTGGTTCAAGAATCCTATACGATGGATAAAGGTACAAGATAACTTCTGTTTTGACTAATTGATTCTTCGTACAGGAGTGCGACTACCTTAAGGCAATAGCACTTCCTCGCCTTGAAATTTCCACTGTTGTGGATACAAAAACAGGAAAGGTATGTATTATGACCACTTTTCTATTAGCTGATGATCCAAAGTATTAGACAACTCAGATGTTTTTTTCTTTAATTGCTTTGCTTGGCTAAAGTGCGTATTTATAGAATGTTGCAAATTTATGCTGATAATTATGATAAGAGAAAATATCACAGGTTATGATTAGATGATATATTTGTATAGAAGTGCTAATACTGTTCTTTTTGTGATATAATTTTTGCTGTTTGTTCTTAGCCAACAAAATAGGGTAACCGTTAGCATTTATTTATTTTACCTTGCGTGGATTTATTCTCTTGATAGATTCTACAGTTCTGGATTGTGATTGATAGTTTTGCTTCTTTCTTTTGTAGTTTTTGCATTGGAGGACCCACATATTGATGCTCTTTTGGTTGAATAATTATTTTTTTTCCTATCAGAAAAAAAAAGTTTTCCTTCCAACTGACAATGACCTTTGAAATATATACGCGCTTTCCCTTGTGGATTTATTTTCAGGGAATTAAGAGTGATTTCAGAACGAGCTCTGGTATGTTTTTAAGTTATCAAGAGAAAAATTATCCAATGACCCAGGTATTGATTACAATACAGTTGTAGGTATATTTTGATGTTTTAAGTAATGGATATGCTCAAAAGTCAACCAACTTTATGGAAAAACTTTCAATTCAGTTAAAAGCATTCTTTTGGATTAATTTACAAATCTGTTCCACTTGTTTCAGACAAAAATTTAATTTAAAAAAAACAGAAATTGTCATATATGATGACCCATTTACGGTAAAGGCTATAGACATGCTAAGCAATCCAGTATATAGTTATATAGACTATAGACTATTCCACATGGTTCTCTTGTTATTTGCTTTATGACAGTTGTGTCTTAGGTAAAATGGGCCATTCACTAATCTGAAACTCCATTTCCAACTTCTTCCATTCCACCTTGACCACCCTAAACATTGACAATCATTGTACTTCCACTGTCACATCTGAACTCCCAAAGGGCAAGGACAACAATCCTCAACTACTCTCAAGTCAGGTAAAATAAGGCTAGGACCACACTTTGACAATTCCCTGTTCCATTGACGATAACTTCCTAGTACTCTTGATTGCCAAGCTCATTGTATCACTCATGGGTAACCCGAGTTTCTTTACTAACTCGTCAAAGATAAAATTGTGAGTTGTCCTGCTGTCAATTAACATAATCACCTCCTTTTAGCCAATATGACCTTTAATCTTCAGGGTTTTTATTCCCAATCCCTCAATAGAATTTAATGATAGCTCGGCCTACATGCGACTCCTTGGTTTGGAGTGGTCAACAGTCGCTTCCAATGCCTTTTGCATGTCATTAACAACTATTTATTCAATTTCAGCTGCGTTGCAATTATGCATGACCATTACTTGAAGCTCCTTCTTACCGATGGCCTGGAGAGAACTTCTCATCACACTTGAAACAAAGTCATTTTCCCCATTTCCTTTGGAGCTTACTGTCAATGATTTTCTTATAAGATGAGCTGTCACTTGTTTTAACGGAGGGTTCTCGGGTCTGGGTTGCACAATTTTGGGCAGGTTTAGGGTCATCGATCAGGTCAGAATCTAATCCATAGCTAATGTGTTTCATATTTGGAGTGAAGGGAGCCCTTGGTTTTGTTTGTTAGGCCGAATTTATGTCTTCTATCCCCCGAACCGCCTCCATTTTTTGGATTAACACATTGAGTTGAAAACGCCTGTAATTTGTCTTGATAACTTCTTTGAGGCCCGTTACGAAAGGACTTTCCAGAATATTTTCATAAAGGATAGAAGCGTGTTGATGCAGTGCTAGGAATTTCTCATACATTGTCGTTCCCTCATGGGTAGTTTAGAATCGTTGCAATAGTAGCTGGAAAAATTCATCCTACCTGCCAATGTGCCCCCTCCAGACACAACATAGTTGTTTTGAGCTTTTCCTCTATAACTTATTCACCACAAAATATCGCTCCATTCTATACAACCAACCAGCCATCTAAATTGTCTTTGTTGAGGGGTCGGATCTCGAGCTTCCTCAGCTGCATATCATATAATCACAGCCTCCTGGTAGCTCAACTACTTGCTACCATTTCCTTCACGTTCCAGGTTATGTAATTTCCAATGACGTTGTGACTTCAGCTCTGCATAACATTCTCTCCATCTTTTGAGTCAACACCTCCATCCCTTTCTCAAATGACTTATTCAACTCTTCACCAACCTGCACCAATTTCTTCTCTACCAAACTTTGCACGCTCGTGATGTTCCCCACAATTGCATCTAGTAGTGCTTCCAACCTTTTTTTTTTAATATTTTTTTTATTTGCAACCATTTCCTAGAATTGAACTACTTTGATACCACAATGATTGAACCAACCATATTTACTAGAAATAGAGAGGTACATACATTGCTAAGGGATAATTCAAGAGATCCCTAACTCTCCCAACTCCTAGTCCCACCACTTCCAAAATGTAACCTCCTATTGAACTACTTTCTTAAATGCTTGCTGAGATCACATTCTTACGCAAGCAACTGTTAGTTATAATGGTCCCACACACGTCATCACCCTAACTACTTATTAGACCTATCCTCTACCCTCTTTCTACTATATACTTTAATAATCGGGGGACCTATCAGTTTCCTGACAACAAACCTTAATAATACTGTACAATGCTATACTCCAGCTTTGCCATAGAAGGAGGCCCTATCAATTTTTAGCTAAAAAATCAGGCACTCTGCTTTGCCAATGGCCAGACTTATCAAACTTCAGAAACAAAGAACACCAAAGCAATTTCAAGACGCATTAAATCACCTAAACAAAAGCCATCAAATGCCGTTGGAGAAAGGACAGTAAAAAACAAGTGGTTTAGACTCGTTATGACTGAAGCACAGAATACACTGATTAGGACACACAAAATGCTGATTTGGTTAGCAATCCATCAGCCATATTAATAAAGGGTCCCAAGGAAGAACTTGACTTTCTTGGGATATTTGTGTTGCCAAATAGTGTAGTAGTGGATCTTAGGATAATTTATCTATTCTCAGTAAGGAAGTGAAGAAGGGATTTAATAGTGAAATCCTTAAAAGAGTACAAAATCCATATTCTTGATAGTTTATTAGTGCATCCTGAAATAATTGAGTACGTGAAGGAGGTTGCCGTCCAAATATCTGTCATTAAGATTTTTTCTAATTTCAAGTCCTGACAACTAGAATCTTTCTTCCGAAGCTCTCTAATGGACTGTTGACGACTCTCTAGCATAGCTTACAGTTTTGGGAAGGAGTGACAAAAACCAGAGCTATCTTCCCAACAATCCTCCCAAAATTTGATTCTAGAACCGTAGTTCACTGAAAAGAAAGAGAAGTTGTTGTCCAAGCACTCGCATTGGACAATGAATCCATAGGAGGATTCGCTGTTTGAAACATTGAATAAGGTAAACCAACAAAGAAATCCTTACTGTAAATTCTAGCCACTATCTCATGTCAAGGGGCTTGACTCAAGGGGGAATCTCTAGCACTATTTAGCAGTTAGAGACAGCTTTTTCTTTACAAAGGTTATTGAGGCCTAAGTCCCCTTTGTCTGAAGACTTGGAAATATCATTCTGCATCTAACAAGATGAACAGCAATTTTAGGCTCTTTACAAACAACATTCTTAATTTAGCTTCGATAATTCTGAGTTCCGTAGAAACTCACAAAAGAGATAAATCATATACCCGGATGTTAAACCATAGGCCAAACAGGCTTGAAGGGACGGAATTAATAACTGCAAATGGGTAATTAATACCATTATTGAAGAGTTAGAAAGTCAATCCTCAATCCTGAAGTCAATTTGAAAATTGTGGATCTCGAAGGTCTGGTACACTTTTTGAAAATCCTTGAACGCTAGGTGTCATGTTGAAAGTGGTTAAGGTAGATGGATCTCTTCCTTCCGAAGCTCCTTGGTAATATTAGGAGTCCAACCCCTCTCAAACAATCTTCTAAGCGCACCCTGATCAAATTATGGTAAATAAGATGGAAGTTAGGGGGGTCTCATTGCCTAAGCCTCCTTGAAGTCACAATTTTTCCTCTCAGTTAATAGAACAAATGGCCGAAGAGATATTGTTGTAAACCCATCCCATCCACTTAGAGATGAAGTGTACCATCTCAAGAAACATCTAACAGAGACCAATCCTTCAAATCATGTCTTCTCAAAATCAATCATAGAAAGGAAACCGTTCTTCTTCCTCCTCTTACAACCCTCTTGAACTTCATTAGCAATCAGACCGCTTCCAGCATTTGCCTTTTATGCTTCTAGTAATTATAGATGGCAGAGGCTAGAACAGTTTTCAACTTTTTTATCAACAACCTCCGACGAGACAATAAAGCATGTAAATTTTTATTGAGAATCTAGTAGTTAGGATTTTCTCTTTGAATTTGTATTACCTTTTTGACTCACGAGGAAATGTTATGAATGCTAAACGTTGCCAGCCGTCTTTTATTATTACAGCCTGTTTTCTTATCTTTGGGGCTCTCTCTCTTGGTTGGTGATTGACTTCCTCTGTTTTGAATTCTGTGAAATTAGGCAATCGAAAAAAGAATTTCTGTCTATTCTCAAATACCAATAGAAAATGGAGAGCTCATTCAAGTGTTAAGGTATGGATATAATTTTGTTTCTGAGAGATGTGTGTGAACATGAAGAGTGGTTTGTCCTTCAGTTTTAAGGCTAACATTTCAAGCAGTACAGGTATGAGAAGAATCAATTTTACAAACCTCATCATGACTACTTTTCCGATACTGTAAGATTCTGTTATTTCTTCAGCCATGAGTCTTATTATGCTGAACTTGTGCTCTTGCTGATGGCACGTGTCGTTTCTTCCATGTTTTCTTTTTACAGTTTAACTTGCAGCGTGGTGGTCAGCGAATAGCAACCATGCTTATGTATCTAAGTGACAATGTAGAAGGTGGAGAAACCTACTTTCCGAAGGTATTTACCTTTTTCGTTACTGAATTAAATTTCATTGCTATAATGCATTAGTTTACTTAATCTTTTTATAGAAACAATCAATTAGTATTAATGAACAAGATAATTACGAAGCAAGAGGAGGATAAAGGTATCCTCTTGGACAGAAGCCCAATTCGATGCATTGAAAAGAAAAGTTTACTTTTGCCGACCATTTCTTAGATAATTGTTTAACATCCTCTTTTTTGATTTCCCTAAAAATGTCTTTAATTACCCATCAATTATATATTCGTCCTCTTTGTCATTCCACTATTAGTGTCCTTTATTACTCTTTAATTAAATATTTTAGTACCCAACTCGTTATGTTCCGTTTTAATCATGGAACTGGAGTTGATACTAGAGCAATTAGATAAACTGAACGGTATTCTCATTTTCCATTCGAGTTTCTCATGTATGACATAGGAAGCAAGGCAAGAGATGAATAGGGTGAGAACTACTAGCCTAGAAGCAAGCATGGTTGGACATGACACTGGCATGAGGATACAACTTTTTCTAAAAAATAAGGACATGGGTATGGAAGGGACACATTCAATAAAATATTTTTTTGCAAAGTATTTTTGTATACTTTTCTACATATTAATGCAAATTATATAATAATAAAGAATTAAGATATAAAAGTATATTCTCAACCATTCAATTTTAAGCAAATTTAAATATTGTAAAGTCAATTATATTTCTCTATAGACATAATACTTTTCAACATTGAAGTGAAGTCTAGTGTTCCATTAAAGACACAAGTTTTCTGAAGGGGGAAAAAGATCGCATCTCAAATGTCATTTTTGCATCTTCTTATACACTTCAATAAGGTTTAGTAGGGTGTAAACCTACTTGTTTTCTTTCCTTCTTAGTTTTTTACTTTATTTTCCATAGGCTTTTATGCTTTCATTCATTTATATTTAGTTCCTTGACATAAATATTATAAATGTGTTTCACTAGTTGGACACTCCAAATCATGTATTCTACGTGTTTTCTCCTAATGTTTAAATGTCCTCATCGGACGCATATCAACACTTGTACTTTGCACAAAATGGAGCGTCCATACTTTAGAGGATTAGAAAGAAGAGAAAAAAGAAGAGAGGACAGGAGAAATTCTAGTATATCGAATAGGCATACCCATTACCAGAGGTAACTTATGAATTATGCTGATTGACCTCAAATGAATGTGGTAAAACAGGAGAATTTCTGATTTGGGCACCTGCTTGAGTACTGTGGTGTTCTATATTTAGAAAACAAATGTATTAGCTTTTTATTTATAGTTATGTCATTCTTTATTACTGGACGATGGAGGAATAGAAAGTGGCATGTTGCTACGGCATCATGCAGGTTTTAAAATTAAACTTGTCTGTTTACAAATTTCATTATTTCAAAGTCTATTTTTAAAAAACGTTTTTTTAAAATTTTACTCAATTTTAAAACTAGAAATGTTTTTGTTTTCGAAATTTTGGCAAAGATTGCAAAAGTGTTCTTAAACATGAAAAAATAAAATAATGAAACTAGTAGTAAACGAACTTTAATTTTTTTAAAAACAGAAGTAAAAATAAGATTGTTATCAAATTGGGCCATAGAGTATCATTTGATTGTGTTTAAGTTGGCAATTATGGTTTCAAAAACTAATGGACGTTTGCAGTAATCAAGTGATGACTAGTAGCCTATCATCTGAGAAGGAAGGCCACTCTGAGATACATAATGATTGAACTTTGAAATTCTCTTTCCCTTTCTTTCTGGCATGAATACAGCTATTGAAAGACAAAACAGTGCCGGGTTTTTTTCCTGTGTAATTTTGTTCTTCTGGTCGTGTCCAAGTTTCTTGAAAGTGTTGTGTTCTGAACTTTCCTTTTATGTTTTTCTCAGGCTGGTTCTGGCGAGTGTAGTTGTGGCGGGAAAACCGTCCCAGGGTTGTCTGTTAAACCAGTCCAAGGAGATGCAGTACTCTTCTGGAGCATGGTATGTATCTTTTGCCTCTCAAGACAAAATGGATAATGTTAAAAAAAAAAAGTTGATTACGATTTTGAATATTAAATCATCTAATCAACCCCAAGAGGTACTTTTTGGTGGTAAGGGTTTAAGATGTTTAGAGATGTCCACCAAAGTTCTAGCCCCTAGCCTCGAGTTGAAAACTGAATAACTTAAAACGTTTGTTGTCTTTTGAAGTGGGTGAAGCCTCTCCTGAGGGTGAGCCACGAACATAAGGGCCCTTATAATCCAGTTATAAAAAAAGAAAAGAAAATCATGTAATTGAAAATTTGAGAAAGAAAACTAGCAATCTTGTGAGTTTTTTTAACAACCAAATGTCGTAGGTTAAGGGGTTGTCCAGTGAGATTAGTCGAAGTGCGCACAAGCTAGCTCGAACACTCACGGATATATGAAAAAAAAACCAAAAAATGTATTTGCTTAGTTTTACTTCATATTGATATGAATGTGGATGTAGAGATGCCCTAGTGTTTTTAACCATTTGTTTTACATGTGATCACGTCAGGGATTGGATGGACAGTCGGATCCTAACAGCATTCATGGAGGGTGTGAAGTATTGTCAGGCGAAAAATGGTCTGCCACAAAATGGATGAGGCAAAAGAGTACTCTGGTACCATAGTTGAAACTTAAAGAGTTCCCATTGTACATCGGCATCTAATATTTTGTTACATAAGAAGTAACAAATTTATAGAGAGAGAGAGAGAGAGAGAGAGAGCCTAGTGTAGATAGTGTCTTAAAATATTAATAATACTGGGTAAATCAATTTATTTAATAAATAATATAGCTTCTGTACTCCCTTCTCATTTGCTTTTCATAATCCTAATTGAAAGTTGTTAGCGAGAAAAAACAAAGAATATCTATAGGTTTTTGTGGGGGAAGTGTAGGGATTCTCTATATACAAGGAGAAGTCGTAGAAATGACCTAAAATGTAAGAGGCCCACTCATGCTCTTTTTAATTCTGTTGGTGTAATGCTTACGTTACATATTAAATATATTATTTACAGATTATGTTACGGACTAATTTTGGGTAGGTTGATGTTGAACTGACGAAGCTAATAATAGGAATCTATTAAATATAATGGAAAGTTTAAGAACCAATTAGAAATTTTGTAAAATAGATTGACTAATTAGACATAATCGTGCAAGTTTAGGGACCAAACTGTAATTTAACTTTAAATAAATAAAAAAAAAATTCATATCTCCGATCATGTCCCCAATTAAAATTGGAAAAATCCTCGTCTAGTTCGGGGCGATCCTTGCAAGGTCCCCATCTACACAAGAAATATTGTCATCCGTAAGGTAGGAGTTTGATTAGGGCATGACTAGTACTCTTCGTCCAAAAAGTTTAAAAGGTTCCTAACATTACGGTAGGTATATTCATATTATTGAATTTAAAGTAATCACATTGTAGTAGTTTTTATTAACTTTTTTTAAACTAAAATTTTAAGATTAATTGTAACAAATTAATCATATCTGTTCCTTGGGCTTGCATAATCGGGACCAAATGCGGGTTGGACTTTTGTTAACCTAAACCTCCCATTTGGAACTTGGGAACCAAATGGCCCCTTGTAGTTTTGAGGTAGTTTGTTCCATTAATGAAGTTATAATAATAGGGCTTAACAACAATGCTTTTGGAAGTTTGAAGGGTTAATTGCTTGAGGATGCAATTGTAATAACCTTTATTAAGGGCCGTTTGTTTGGTGGGAATTTGAGATTCTCATGGCTTGGGTATCCAGATTCTTATGTTTGTTTGACAAGAATGTTGGATATTCGGGCGTCTTCAGATGTCTGAACATCCTATGATGCCCATATGGGAGTCATCTATGCAACTTTGGATGCCCTCTGGAATCATGAGTTTTTAAGATTTCTATGTTGAGGACATCTTCGAATGCCATTTTTGTCCTTCGTATCTCTTTTATATATATTTTCTTTCATTAAATTTCTTTATTTCTCATTTTTTATTTTTTATTTAAAAATAAATTAATGTATCATTCGCTAATGAACGACTATATGCAATTAGCTATATTTATATATTACATTTACATATAAAATAAACTTCAGGTTAAATTTATAAAATCTGAAATCTTGAGTGTAATCTTCTATTTATTTTACTTTTCCAATAGTCACTCAACATTTTGAATTTGGTATATTTTCTATTATATTCGTTGAATGATTTTTTTTTAGTTATTTGTCTGTAACTTCTCAAAATAAATATTAATTCACAATTCAAAACAATTTTTTTTTATCACAAACAAATATTATTTTTCTAAAAAAAATCTACTAATTTGATAATGTGATAAATATAATCCACTAAAAACAACATCAATTATAATAATTGTCTTCTTATGAAAAAGTTCAAGTTAAAATTGAAATTAATATTTAATATTAAATATTTTATAAAAATCTCAAATTAATATAAATTAACATGTTCTACAAAATAAGATATAAAATTAATATAAACTAAATTGATGTAAAAATAAAATATTAAATTAATAAAGATATATGTTATATAAATTTAAAATATATTAAATTAAAATATAAATTATCTAAAGATTGTTAAGGGCATTATTGGAATAATATATAATTTTAAGACATTCCTATGAACAAATCCTATAAAACAAACATAGCTATCTAAATATTTCTAGATATCTTTATAAATTTTTTTGCAAAATAAATATGGTTATCTAAATATGTCGGGCATCTAAATTTCCAAGCATCTAAATTTCAGCCATTTGAAGATGCCGGACATCTAGATTCCCTGAAAACAAACTGCCCTTATTATTCAAAAGTAGTTCTTTTTTCCATCTATTTTTTTTTATACCAAATGTATAATTTTTAGATAAATATTATAGTATTCAATGATTTAATTGAATCATATTTTGCAGTGTAAATGGGTAGTTAAATATATTGTTGGAAAATTTATTTGATTAAATATGAAAATTTCATAATTAACTTATAGTTCTAAGTTTTTACTTTATCATTTTTATTCTATAAGTACATTCATTTTATCTTTGACATTAATTATTCAGAGACTTTTTCTATAGAAAAACTAACATGACATGTTGATATATTGTATGATTATATATATGAAAGATTAAAGTGGTACCACAGTACATCATTTTCCATATAGACCAATATGATTTTTTTAAAAATATACCATTAAAAGAAAATAAATTAACATACCAAGATCAACCACATTAAAAATAGCCAAAGGAAAATACTTCTCTACTGACCGGTTAGCAAAAATTTCCTACTGACCAGTAGGGGCGTACAAAAAATCTCGAAAACCGATCCGACAGACCGAAGTTTTACTATATGAAAAACGGATTTTTCGGTCGGTCGGAGTCGCCGGAAAAATGTTTTCCGACCGACCGACCGACCGAACATTAATTAAAAAAAAAAAATACTGTATACTTTTCTGCAGACTGAAAACAGTACACTGCTAGCAGATTGAAAACAGCAGACTGCAATGCAGTTTGCAGCCGTTTTTGAAAAGGAAGTTTCATTATTTATTTTTTAAATATTATTTTCTTTCTCTCTCTTCTCTGTTTCACTTTTTTCTCACTTTCCCTTCCTTTCTTTCACTTCTTCTCAGTTTCTCTTCTCACTTCTCCCTCTCTCTTCTCTCAGTTTCTTCTTCTCCCTTTCTCTCTTTTCTCACCTTCCCTCCCCTCCGGCCTGTAACCTCCCTTGAGCACTCCGTCGTGTATCCTTCCGGCCTCCTTTACTTGCCCTCCGTCTACTTCAAAGTTCAAGCGTATTTCTCGTTTTTTGTTTTCCTTCTCCTCCTCACACAGTGAGTTTTTTTCATCTTTCTTCTTCTCCTTCTCATGCAATGAGTTTTTTTCATCTCCCTCTTAGTACTTCGCCGAAAGTTACAGCAGCTCTCACTCTCCCTCCATCTTCATCGGTTTAGGCCAAAACCGACCGACCGTCAGTGTAGTCGGTCTGTCGGTTTTATATAACCTTCCAAAACTGCCACCTTCCGACCTATCAAAACCAATAGGTCGGTGTTCGGTGTCGGATTGTGTCCAAAACCGACACCGACCGACTGATGTTCACCCCTACTGACCAGCATTAAATTTTCGACAAGTATGCAAACAGAGGATAGATGAGAGATTTTTTTCTTTTTTTTTTTTTAAATTACATAGCTGTCAAATTTTTAATGGTGATCAGTAGAAACATCCTGAACTGGTGAGCAGAAAAATAATATTCATAGCCAAATTCATAGGAATAGAATTATAATTAAATAAAAAACAAATCTACCTTTTTTTTAAAAAAGAATTTGGTGTACCTCTCACTTTTAGAAAAATGAAAAATTCACTCCAACTTTTTTTCCCCCTTTTGTGCAGTCCATCATCTTTAAGAAAGAAGAGATACTCTTGAATTTCTGAATTCATATAATTCTACATTATATTGAGAAAGTAACCCAAGAGAATTGCTTTAAAACAAGAACAAAGATCTAAATCAATAATAAAAGATAATTTATATTTTAATAAAAGTATTTGAGCTACAAAACTAAAATTTCCATCACACAAGAAATTAACCCAACATTTATTCAAACATGAAACATCAAAGCCAGTAACTATAGAAGATAAACAACACCATTTACTAATCCACTAATAGATGGTTCAGACTTTAAAGGAAAAAAACACCCTTTTGCTATCATTGAGCTAGCTTTTCTCTTCTATATAATTGAACTTGAACATATAACATCAAACTCTAGAACTTCTCCTAGCAATTTGGAGAGTTGGGAAGCTTGCATTTCTCAGGCAACTTCAAGGCAAGATTAGGGTCAATGCCAAGAGATGGAAGAGTCTTCGAGTTGCGGTAGGCGCAGAAACAGTGCAAGTCGGCGTGCGAGAGCGCCGTGCAGCATTCGTCGGTCGGGGGCGTCGGCTTCGGAGGCGTCACCGATGGCTTGCATGCATACAAACCAGCAGGGGTCATGTTGCATATGGTCTGAGCTTCACCCCAACCAATTGCAACTATCAACACCACTGCCATAGCCACCACCTTCTGAGCTCTCTCCATTGGCTCTTTCTCTCTCTCTCTACTCTCTACTCTCTACTCTCTCTCTTCTGTGATTCTAAGTTCTTTGATTTTGATGGTTTTTATAGAGAATGGAAGGAAGAAGAGGGTGGAAAATGTAGAGAATGTCCGTGA

mRNA sequence

TAAGCTGGTCACCCCGAATCATTGTATTGCATAATTTTTTGAGCACAGAGGAGTGCGACTACCTTAAGGCAATAGCACTTCCTCGCCTTGAAATTTCCACTGTTGTGGATACAAAAACAGGAAAGGGAATTAAGAGTGATTTCAGAACGAGCTCTGGTATGTTTTTAAGTTATCAAGAGAAAAATTATCCAATGACCCAGGCAATCGAAAAAAGAATTTCTGTCTATTCTCAAATACCAATAGAAAATGGAGAGCTCATTCAAGTGTTAAGGTATGAGAAGAATCAATTTTACAAACCTCATCATGACTACTTTTCCGATACTTTTAACTTGCAGCGTGGTGGTCAGCGAATAGCAACCATGCTTATGTATCTAAGTGACAATGTAGAAGGTGGAGAAACCTACTTTCCGAAGGCTGGTTCTGGCGAGTGTAGTTGTGGCGGGAAAACCGTCCCAGGGTTGTCTGTTAAACCAGTCCAAGGAGATGCAGTACTCTTCTGGAGCATGGGATTGGATGGACAGTCGGATCCTAACAGCATTCATGGAGGGTGTGAAGTATTGTCAGGCGAAAAATGGCAACTTCAAGGCAAGATTAGGGTCAATGCCAAGAGATGGAAGAGTCTTCGAGTTGCGGTAGGCGCAGAAACAGTGCAAGTCGGCGTGCGAGAGCGCCGTGCAGCATTCGTCGGTCGGGGGCGTCGGCTTCGGAGGCGTCACCGATGGCTTGCATGCATACAAACCAGCAGGGGTCATGTTGCATATGGTCTGAGCTTCACCCCAACCAATTGCAACTATCAACACCACTGCCATAGCCACCACCTTCTGAGCTCTCTCCATTGGCTCTTTCTCTCTCTCTCTACTCTCTACTCTCTACTCTCTCTCTTCTGTGATTCTAAGTTCTTTGATTTTGATGGTTTTTATAGAGAATGGAAGGAAGAAGAGGGTGGAAAATGTAGAGAATGTCCGTGA

Coding sequence (CDS)

TAAGCTGGTCACCCCGAATCATTGTATTGCATAATTTTTTGAGCACAGAGGAGTGCGACTACCTTAAGGCAATAGCACTTCCTCGCCTTGAAATTTCCACTGTTGTGGATACAAAAACAGGAAAGGGAATTAAGAGTGATTTCAGAACGAGCTCTGGTATGTTTTTAAGTTATCAAGAGAAAAATTATCCAATGACCCAGGCAATCGAAAAAAGAATTTCTGTCTATTCTCAAATACCAATAGAAAATGGAGAGCTCATTCAAGTGTTAAGGTATGAGAAGAATCAATTTTACAAACCTCATCATGACTACTTTTCCGATACTTTTAACTTGCAGCGTGGTGGTCAGCGAATAGCAACCATGCTTATGTATCTAAGTGACAATGTAGAAGGTGGAGAAACCTACTTTCCGAAGGCTGGTTCTGGCGAGTGTAGTTGTGGCGGGAAAACCGTCCCAGGGTTGTCTGTTAAACCAGTCCAAGGAGATGCAGTACTCTTCTGGAGCATGGGATTGGATGGACAGTCGGATCCTAACAGCATTCATGGAGGGTGTGAAGTATTGTCAGGCGAAAAATGGCAACTTCAAGGCAAGATTAGGGTCAATGCCAAGAGATGGAAGAGTCTTCGAGTTGCGGTAGGCGCAGAAACAGTGCAAGTCGGCGTGCGAGAGCGCCGTGCAGCATTCGTCGGTCGGGGGCGTCGGCTTCGGAGGCGTCACCGATGGCTTGCATGCATACAAACCAGCAGGGGTCATGTTGCATATGGTCTGAGCTTCACCCCAACCAATTGCAACTATCAACACCACTGCCATAGCCACCACCTTCTGAGCTCTCTCCATTGGCTCTTTCTCTCTCTCTCTACTCTCTACTCTCTACTCTCTCTCTTCTGTGATTCTAAGTTCTTTGATTTTGATGGTTTTTATAGAGAATGGAAGGAAGAAGAGGGTGGAAAATGTAGAGAATGTCCGTGA

Protein sequence

SWSPRIIVLHNFLSTEECDYLKAIALPRLEISTVVDTKTGKGIKSDFRTSSGMFLSYQEKNYPMTQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLQRGGQRIATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVQGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWQLQGKIRVNAKRWKSLRVAVGAETVQVGVRERRAAFVGRGRRLRRRHRWLACIQTSRGHVAYGLSFTPTNCNYQHHCHSHHLLSSLHWLFLSLSTLYSLLSLFCDSKFFDFDGFYREWKEEEGGKCRECP
Homology
BLAST of Sgr024966 vs. NCBI nr
Match: XP_022137963.1 (prolyl 4-hydroxylase 1 [Momordica charantia])

HSP 1 Score: 389.0 bits (998), Expect = 3.9e-104
Identity = 182/191 (95.29%), Postives = 190/191 (99.48%), Query Frame = 0

Query: 1   SWSPRIIVLHNFLSTEECDYLKAIALPRLEISTVVDTKTGKGIKSDFRTSSGMFLSYQEK 60
           SWSPRIIVLHNFLSTEECDYL+A+ALPRLE+STVVDTKTGKG+KSDFRTSSGMFLS+QEK
Sbjct: 86  SWSPRIIVLHNFLSTEECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEK 145

Query: 61  NYPMTQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLQRGGQRIATM 120
           NYPM QAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNL+RGGQR+ATM
Sbjct: 146 NYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRVATM 205

Query: 121 LMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVQGDAVLFWSMGLDGQSDPNSIH 180
           LMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPV+GDAVLFWSMGLDGQSDPNSIH
Sbjct: 206 LMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIH 265

Query: 181 GGCEVLSGEKW 192
           GGCEVLSGEKW
Sbjct: 266 GGCEVLSGEKW 276

BLAST of Sgr024966 vs. NCBI nr
Match: XP_038904320.1 (prolyl 4-hydroxylase 1 isoform X1 [Benincasa hispida])

HSP 1 Score: 387.9 bits (995), Expect = 8.6e-104
Identity = 182/191 (95.29%), Postives = 189/191 (98.95%), Query Frame = 0

Query: 1   SWSPRIIVLHNFLSTEECDYLKAIALPRLEISTVVDTKTGKGIKSDFRTSSGMFLSYQEK 60
           SWSPRIIVLHNFLSTEECDYLKAIALPRLEISTVVDTKTGKG+KSDFRTSSGMFLS+QEK
Sbjct: 86  SWSPRIIVLHNFLSTEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEK 145

Query: 61  NYPMTQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLQRGGQRIATM 120
           NYPM QAIEKRISVYSQIP+ENGELIQVLRYEKNQFYKPHHDYFSDTFNL+RGGQRIATM
Sbjct: 146 NYPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATM 205

Query: 121 LMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVQGDAVLFWSMGLDGQSDPNSIH 180
           LMYLS+N+EGGETYFPKAGSGECSCGGKTVPGLSVKP +GDAVLFWSMGLDGQSDPNSIH
Sbjct: 206 LMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPNSIH 265

Query: 181 GGCEVLSGEKW 192
           GGCEVLSGEKW
Sbjct: 266 GGCEVLSGEKW 276

BLAST of Sgr024966 vs. NCBI nr
Match: XP_008453925.1 (PREDICTED: prolyl 4-hydroxylase 1 isoform X3 [Cucumis melo])

HSP 1 Score: 384.0 bits (985), Expect = 1.2e-102
Identity = 179/191 (93.72%), Postives = 187/191 (97.91%), Query Frame = 0

Query: 1   SWSPRIIVLHNFLSTEECDYLKAIALPRLEISTVVDTKTGKGIKSDFRTSSGMFLSYQEK 60
           SWSPRIIVLHNFLSTEECDYLK IALPRLEISTVVDTKTGKG+KSDFRTSSGMFLS+ EK
Sbjct: 86  SWSPRIIVLHNFLSTEECDYLKGIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEK 145

Query: 61  NYPMTQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLQRGGQRIATM 120
           NYPM QAIEKRISVYSQIP+ENGELIQVLRYEKNQFYKPHHDYFSDTFNL+RGGQRIATM
Sbjct: 146 NYPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATM 205

Query: 121 LMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVQGDAVLFWSMGLDGQSDPNSIH 180
           LMYLS+N+EGGETYFPKAGSGECSCGGKTVPGLSVKP +GDA+LFWSMGLDGQSDPNSIH
Sbjct: 206 LMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAILFWSMGLDGQSDPNSIH 265

Query: 181 GGCEVLSGEKW 192
           GGCEVLSGEKW
Sbjct: 266 GGCEVLSGEKW 276

BLAST of Sgr024966 vs. NCBI nr
Match: XP_016901568.1 (PREDICTED: prolyl 4-hydroxylase 1 isoform X2 [Cucumis melo])

HSP 1 Score: 384.0 bits (985), Expect = 1.2e-102
Identity = 179/191 (93.72%), Postives = 187/191 (97.91%), Query Frame = 0

Query: 1   SWSPRIIVLHNFLSTEECDYLKAIALPRLEISTVVDTKTGKGIKSDFRTSSGMFLSYQEK 60
           SWSPRIIVLHNFLSTEECDYLK IALPRLEISTVVDTKTGKG+KSDFRTSSGMFLS+ EK
Sbjct: 89  SWSPRIIVLHNFLSTEECDYLKGIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEK 148

Query: 61  NYPMTQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLQRGGQRIATM 120
           NYPM QAIEKRISVYSQIP+ENGELIQVLRYEKNQFYKPHHDYFSDTFNL+RGGQRIATM
Sbjct: 149 NYPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATM 208

Query: 121 LMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVQGDAVLFWSMGLDGQSDPNSIH 180
           LMYLS+N+EGGETYFPKAGSGECSCGGKTVPGLSVKP +GDA+LFWSMGLDGQSDPNSIH
Sbjct: 209 LMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAILFWSMGLDGQSDPNSIH 268

Query: 181 GGCEVLSGEKW 192
           GGCEVLSGEKW
Sbjct: 269 GGCEVLSGEKW 279

BLAST of Sgr024966 vs. NCBI nr
Match: XP_016901567.1 (PREDICTED: prolyl 4-hydroxylase 1 isoform X1 [Cucumis melo])

HSP 1 Score: 384.0 bits (985), Expect = 1.2e-102
Identity = 179/191 (93.72%), Postives = 187/191 (97.91%), Query Frame = 0

Query: 1   SWSPRIIVLHNFLSTEECDYLKAIALPRLEISTVVDTKTGKGIKSDFRTSSGMFLSYQEK 60
           SWSPRIIVLHNFLSTEECDYLK IALPRLEISTVVDTKTGKG+KSDFRTSSGMFLS+ EK
Sbjct: 105 SWSPRIIVLHNFLSTEECDYLKGIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEK 164

Query: 61  NYPMTQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLQRGGQRIATM 120
           NYPM QAIEKRISVYSQIP+ENGELIQVLRYEKNQFYKPHHDYFSDTFNL+RGGQRIATM
Sbjct: 165 NYPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATM 224

Query: 121 LMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVQGDAVLFWSMGLDGQSDPNSIH 180
           LMYLS+N+EGGETYFPKAGSGECSCGGKTVPGLSVKP +GDA+LFWSMGLDGQSDPNSIH
Sbjct: 225 LMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAILFWSMGLDGQSDPNSIH 284

Query: 181 GGCEVLSGEKW 192
           GGCEVLSGEKW
Sbjct: 285 GGCEVLSGEKW 295

BLAST of Sgr024966 vs. ExPASy Swiss-Prot
Match: Q9ZW86 (Prolyl 4-hydroxylase 1 OS=Arabidopsis thaliana OX=3702 GN=P4H1 PE=1 SV=1)

HSP 1 Score: 337.0 bits (863), Expect = 2.3e-91
Identity = 155/201 (77.11%), Postives = 179/201 (89.05%), Query Frame = 0

Query: 1   SWSPRIIVLHNFLSTEECDYLKAIALPRLEISTVVDTKTGKGIKSDFRTSSGMFLSYQEK 60
           SWSPRIIVLH+FLS EEC+YLKAIA PRL++STVVD KTGKG+KSD RTSSGMFL++ E+
Sbjct: 81  SWSPRIIVLHDFLSPEECEYLKAIARPRLQVSTVVDVKTGKGVKSDVRTSSGMFLTHVER 140

Query: 61  NYPMTQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLQRGGQRIATM 120
           +YP+ QAIEKRI+V+SQ+P ENGELIQVLRYE  QFYKPHHDYF+DTFNL+RGGQR+ATM
Sbjct: 141 SYPIIQAIEKRIAVFSQVPAENGELIQVLRYEPQQFYKPHHDYFADTFNLKRGGQRVATM 200

Query: 121 LMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVQGDAVLFWSMGLDGQSDPNSIH 180
           LMYL+D+VEGGETYFP AG G+C+CGGK + G+SVKP +GDAVLFWSMGLDGQSDP SIH
Sbjct: 201 LMYLTDDVEGGETYFPLAGDGDCTCGGKIMKGISVKPTKGDAVLFWSMGLDGQSDPRSIH 260

Query: 181 GGCEVLSGEKWQLQGKIRVNA 202
           GGCEVLSGEKW     +R  A
Sbjct: 261 GGCEVLSGEKWSATKWMRQKA 281

BLAST of Sgr024966 vs. ExPASy Swiss-Prot
Match: F4JZ24 (Probable prolyl 4-hydroxylase 10 OS=Arabidopsis thaliana OX=3702 GN=P4H10 PE=3 SV=1)

HSP 1 Score: 205.7 bits (522), Expect = 7.9e-52
Identity = 111/211 (52.61%), Postives = 132/211 (62.56%), Query Frame = 0

Query: 1   SWSPRIIVLHNFLSTEECDYLKAIALPRLEISTVVDTKTGKGIKSDFRTSSGMFLSYQEK 60
           SW PR  V HNFL+ EEC YL  +A P +E STVVD KTGK   S  RTSSG FL+    
Sbjct: 82  SWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSRVRTSSGTFLARGRD 141

Query: 61  NYPMTQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLQRGGQRIATM 120
                + IEKRIS ++ IP+E+GE +QVL YE  Q Y+PH+DYF D +N + GGQRIAT+
Sbjct: 142 K--TIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMDEYNTRNGGQRIATV 201

Query: 121 LMYLSDNVEGGETYFPKAGS-----------GECSCGGKTVPGLSVKPVQGDAVLFWSMG 180
           LMYLSD  EGGET FP A              EC  G     GLSVKP  GDA+LFWSM 
Sbjct: 202 LMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKG-----GLSVKPKMGDALLFWSMT 261

Query: 181 LDGQSDPNSIHGGCEVLSGEKWQLQGKIRVN 201
            D   DP+S+HGGC V+ G KW     +RV+
Sbjct: 262 PDATLDPSSLHGGCAVIKGNKWSSTKWLRVH 285

BLAST of Sgr024966 vs. ExPASy Swiss-Prot
Match: Q9LN20 (Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana OX=3702 GN=P4H3 PE=2 SV=1)

HSP 1 Score: 204.9 bits (520), Expect = 1.4e-51
Identity = 109/200 (54.50%), Postives = 130/200 (65.00%), Query Frame = 0

Query: 1   SWSPRIIVLHNFLSTEECDYLKAIALPRLEISTVVDTKTGKGIKSDFRTSSGMFLSYQEK 60
           SW PR  V HNFLS EEC+YL ++A P +  STVVD++TGK   S  RTSSG FL  +  
Sbjct: 80  SWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETGKSKDSRVRTSSGTFL--RRG 139

Query: 61  NYPMTQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLQRGGQRIATM 120
              + + IEKRI+ Y+ IP ++GE +QVL YE  Q Y+PH+DYF D FN + GGQR+ATM
Sbjct: 140 RDKIIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQRMATM 199

Query: 121 LMYLSDNVEGGETYFPKAGSGECS---------CGGKTVPGLSVKPVQGDAVLFWSMGLD 180
           LMYLSD  EGGET FP A     S         CG K   GLSVKP  GDA+LFWSM  D
Sbjct: 200 LMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKK---GLSVKPRMGDALLFWSMRPD 259

Query: 181 GQSDPNSIHGGCEVLSGEKW 192
              DP S+HGGC V+ G KW
Sbjct: 260 ATLDPTSLHGGCPVIRGNKW 274

BLAST of Sgr024966 vs. ExPASy Swiss-Prot
Match: F4JNU8 (Probable prolyl 4-hydroxylase 8 OS=Arabidopsis thaliana OX=3702 GN=P4H8 PE=3 SV=1)

HSP 1 Score: 201.1 bits (510), Expect = 2.0e-50
Identity = 108/199 (54.27%), Postives = 131/199 (65.83%), Query Frame = 0

Query: 1   SWSPRIIVLHNFLSTEECDYLKAIALPRLEISTVVDTKTGKGIKSDFRTSSGMFLSYQEK 60
           SW PR  V HNFL+ EEC++L ++A P +  S VVD KTGK I S  RTSSG FL+    
Sbjct: 84  SWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDVKTGKSIDSRVRTSSGTFLN--RG 143

Query: 61  NYPMTQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLQRGGQRIATM 120
           +  + + IE RIS ++ IP ENGE +QVL YE  Q Y+PHHDYF D FN+++GGQRIAT+
Sbjct: 144 HDEIVEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEPHHDYFFDEFNVRKGGQRIATV 203

Query: 121 LMYLSDNVEGGETYFPKAGSG--------ECSCGGKTVPGLSVKPVQGDAVLFWSMGLDG 180
           LMYLSD  EGGET FP A           E S  GK   GLSV P + DA+LFWSM  D 
Sbjct: 204 LMYLSDVDEGGETVFPAAKGNVSDVPWWDELSQCGK--EGLSVLPKKRDALLFWSMKPDA 263

Query: 181 QSDPNSIHGGCEVLSGEKW 192
             DP+S+HGGC V+ G KW
Sbjct: 264 SLDPSSLHGGCPVIKGNKW 278

BLAST of Sgr024966 vs. ExPASy Swiss-Prot
Match: Q24JN5 (Prolyl 4-hydroxylase 5 OS=Arabidopsis thaliana OX=3702 GN=P4H5 PE=2 SV=1)

HSP 1 Score: 195.7 bits (496), Expect = 8.2e-49
Identity = 104/199 (52.26%), Postives = 131/199 (65.83%), Query Frame = 0

Query: 1   SWSPRIIVLHNFLSTEECDYLKAIALPRLEISTVVDTKTGKGIKSDFRTSSGMFLSYQEK 60
           SW PR +V HNFL+ EEC++L ++A P +  STVVD KTG    S  RTSSG FL  +  
Sbjct: 84  SWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSRVRTSSGTFL--RRG 143

Query: 61  NYPMTQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLQRGGQRIATM 120
           +  + + IEKRIS ++ IP+ENGE +QVL Y+  Q Y+PH+DYF D FN + GGQRIAT+
Sbjct: 144 HDEVVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQRIATV 203

Query: 121 LMYLSDNVEGGETYFPKAGS--------GECSCGGKTVPGLSVKPVQGDAVLFWSMGLDG 180
           LMYLSD  +GGET FP A           E S  GK   GLSV P + DA+LFW+M  D 
Sbjct: 204 LMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGK--EGLSVLPKKRDALLFWNMRPDA 263

Query: 181 QSDPNSIHGGCEVLSGEKW 192
             DP+S+HGGC V+ G KW
Sbjct: 264 SLDPSSLHGGCPVVKGNKW 278

BLAST of Sgr024966 vs. ExPASy TrEMBL
Match: A0A6J1CBS4 (prolyl 4-hydroxylase 1 OS=Momordica charantia OX=3673 GN=LOC111009248 PE=4 SV=1)

HSP 1 Score: 389.0 bits (998), Expect = 1.9e-104
Identity = 182/191 (95.29%), Postives = 190/191 (99.48%), Query Frame = 0

Query: 1   SWSPRIIVLHNFLSTEECDYLKAIALPRLEISTVVDTKTGKGIKSDFRTSSGMFLSYQEK 60
           SWSPRIIVLHNFLSTEECDYL+A+ALPRLE+STVVDTKTGKG+KSDFRTSSGMFLS+QEK
Sbjct: 86  SWSPRIIVLHNFLSTEECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEK 145

Query: 61  NYPMTQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLQRGGQRIATM 120
           NYPM QAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNL+RGGQR+ATM
Sbjct: 146 NYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRVATM 205

Query: 121 LMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVQGDAVLFWSMGLDGQSDPNSIH 180
           LMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPV+GDAVLFWSMGLDGQSDPNSIH
Sbjct: 206 LMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIH 265

Query: 181 GGCEVLSGEKW 192
           GGCEVLSGEKW
Sbjct: 266 GGCEVLSGEKW 276

BLAST of Sgr024966 vs. ExPASy TrEMBL
Match: A0A6J1F2K0 (prolyl 4-hydroxylase 1-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111439093 PE=4 SV=1)

HSP 1 Score: 384.0 bits (985), Expect = 6.0e-103
Identity = 182/191 (95.29%), Postives = 188/191 (98.43%), Query Frame = 0

Query: 1   SWSPRIIVLHNFLSTEECDYLKAIALPRLEISTVVDTKTGKGIKSDFRTSSGMFLSYQEK 60
           SWSPRIIVLHNFLS+EECDYLKAIALPRLEISTVVDTKTGKGIKSDFRTSSGMFLS+QE+
Sbjct: 86  SWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGIKSDFRTSSGMFLSHQER 145

Query: 61  NYPMTQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLQRGGQRIATM 120
           NYPM QAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNL+RGGQRIATM
Sbjct: 146 NYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATM 205

Query: 121 LMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVQGDAVLFWSMGLDGQSDPNSIH 180
           LMYL+DNVEGGETYFPKAGSG CSCGGKTVPGLSVKPV+GDAVLFWSMGLDGQSDPNSIH
Sbjct: 206 LMYLTDNVEGGETYFPKAGSGMCSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIH 265

Query: 181 GGCEVLSGEKW 192
           GGCEVL GEKW
Sbjct: 266 GGCEVLGGEKW 276

BLAST of Sgr024966 vs. ExPASy TrEMBL
Match: A0A1S4E011 (prolyl 4-hydroxylase 1 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103494504 PE=4 SV=1)

HSP 1 Score: 384.0 bits (985), Expect = 6.0e-103
Identity = 179/191 (93.72%), Postives = 187/191 (97.91%), Query Frame = 0

Query: 1   SWSPRIIVLHNFLSTEECDYLKAIALPRLEISTVVDTKTGKGIKSDFRTSSGMFLSYQEK 60
           SWSPRIIVLHNFLSTEECDYLK IALPRLEISTVVDTKTGKG+KSDFRTSSGMFLS+ EK
Sbjct: 89  SWSPRIIVLHNFLSTEECDYLKGIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEK 148

Query: 61  NYPMTQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLQRGGQRIATM 120
           NYPM QAIEKRISVYSQIP+ENGELIQVLRYEKNQFYKPHHDYFSDTFNL+RGGQRIATM
Sbjct: 149 NYPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATM 208

Query: 121 LMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVQGDAVLFWSMGLDGQSDPNSIH 180
           LMYLS+N+EGGETYFPKAGSGECSCGGKTVPGLSVKP +GDA+LFWSMGLDGQSDPNSIH
Sbjct: 209 LMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAILFWSMGLDGQSDPNSIH 268

Query: 181 GGCEVLSGEKW 192
           GGCEVLSGEKW
Sbjct: 269 GGCEVLSGEKW 279

BLAST of Sgr024966 vs. ExPASy TrEMBL
Match: A0A1S3BY76 (prolyl 4-hydroxylase 1 isoform X6 OS=Cucumis melo OX=3656 GN=LOC103494504 PE=4 SV=1)

HSP 1 Score: 384.0 bits (985), Expect = 6.0e-103
Identity = 179/191 (93.72%), Postives = 187/191 (97.91%), Query Frame = 0

Query: 1   SWSPRIIVLHNFLSTEECDYLKAIALPRLEISTVVDTKTGKGIKSDFRTSSGMFLSYQEK 60
           SWSPRIIVLHNFLSTEECDYLK IALPRLEISTVVDTKTGKG+KSDFRTSSGMFLS+ EK
Sbjct: 70  SWSPRIIVLHNFLSTEECDYLKGIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEK 129

Query: 61  NYPMTQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLQRGGQRIATM 120
           NYPM QAIEKRISVYSQIP+ENGELIQVLRYEKNQFYKPHHDYFSDTFNL+RGGQRIATM
Sbjct: 130 NYPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATM 189

Query: 121 LMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVQGDAVLFWSMGLDGQSDPNSIH 180
           LMYLS+N+EGGETYFPKAGSGECSCGGKTVPGLSVKP +GDA+LFWSMGLDGQSDPNSIH
Sbjct: 190 LMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAILFWSMGLDGQSDPNSIH 249

Query: 181 GGCEVLSGEKW 192
           GGCEVLSGEKW
Sbjct: 250 GGCEVLSGEKW 260

BLAST of Sgr024966 vs. ExPASy TrEMBL
Match: A0A1S3BXE6 (prolyl 4-hydroxylase 1 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103494504 PE=4 SV=1)

HSP 1 Score: 384.0 bits (985), Expect = 6.0e-103
Identity = 179/191 (93.72%), Postives = 187/191 (97.91%), Query Frame = 0

Query: 1   SWSPRIIVLHNFLSTEECDYLKAIALPRLEISTVVDTKTGKGIKSDFRTSSGMFLSYQEK 60
           SWSPRIIVLHNFLSTEECDYLK IALPRLEISTVVDTKTGKG+KSDFRTSSGMFLS+ EK
Sbjct: 86  SWSPRIIVLHNFLSTEECDYLKGIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEK 145

Query: 61  NYPMTQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLQRGGQRIATM 120
           NYPM QAIEKRISVYSQIP+ENGELIQVLRYEKNQFYKPHHDYFSDTFNL+RGGQRIATM
Sbjct: 146 NYPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATM 205

Query: 121 LMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVQGDAVLFWSMGLDGQSDPNSIH 180
           LMYLS+N+EGGETYFPKAGSGECSCGGKTVPGLSVKP +GDA+LFWSMGLDGQSDPNSIH
Sbjct: 206 LMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAILFWSMGLDGQSDPNSIH 265

Query: 181 GGCEVLSGEKW 192
           GGCEVLSGEKW
Sbjct: 266 GGCEVLSGEKW 276

BLAST of Sgr024966 vs. TAIR 10
Match: AT2G43080.1 (P4H isoform 1 )

HSP 1 Score: 337.0 bits (863), Expect = 1.6e-92
Identity = 155/201 (77.11%), Postives = 179/201 (89.05%), Query Frame = 0

Query: 1   SWSPRIIVLHNFLSTEECDYLKAIALPRLEISTVVDTKTGKGIKSDFRTSSGMFLSYQEK 60
           SWSPRIIVLH+FLS EEC+YLKAIA PRL++STVVD KTGKG+KSD RTSSGMFL++ E+
Sbjct: 81  SWSPRIIVLHDFLSPEECEYLKAIARPRLQVSTVVDVKTGKGVKSDVRTSSGMFLTHVER 140

Query: 61  NYPMTQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLQRGGQRIATM 120
           +YP+ QAIEKRI+V+SQ+P ENGELIQVLRYE  QFYKPHHDYF+DTFNL+RGGQR+ATM
Sbjct: 141 SYPIIQAIEKRIAVFSQVPAENGELIQVLRYEPQQFYKPHHDYFADTFNLKRGGQRVATM 200

Query: 121 LMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVQGDAVLFWSMGLDGQSDPNSIH 180
           LMYL+D+VEGGETYFP AG G+C+CGGK + G+SVKP +GDAVLFWSMGLDGQSDP SIH
Sbjct: 201 LMYLTDDVEGGETYFPLAGDGDCTCGGKIMKGISVKPTKGDAVLFWSMGLDGQSDPRSIH 260

Query: 181 GGCEVLSGEKWQLQGKIRVNA 202
           GGCEVLSGEKW     +R  A
Sbjct: 261 GGCEVLSGEKWSATKWMRQKA 281

BLAST of Sgr024966 vs. TAIR 10
Match: AT5G66060.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 205.7 bits (522), Expect = 5.6e-53
Identity = 111/211 (52.61%), Postives = 132/211 (62.56%), Query Frame = 0

Query: 1   SWSPRIIVLHNFLSTEECDYLKAIALPRLEISTVVDTKTGKGIKSDFRTSSGMFLSYQEK 60
           SW PR  V HNFL+ EEC YL  +A P +E STVVD KTGK   S  RTSSG FL+    
Sbjct: 82  SWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSRVRTSSGTFLARGRD 141

Query: 61  NYPMTQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLQRGGQRIATM 120
                + IEKRIS ++ IP+E+GE +QVL YE  Q Y+PH+DYF D +N + GGQRIAT+
Sbjct: 142 K--TIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMDEYNTRNGGQRIATV 201

Query: 121 LMYLSDNVEGGETYFPKAGS-----------GECSCGGKTVPGLSVKPVQGDAVLFWSMG 180
           LMYLSD  EGGET FP A              EC  G     GLSVKP  GDA+LFWSM 
Sbjct: 202 LMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKG-----GLSVKPKMGDALLFWSMT 261

Query: 181 LDGQSDPNSIHGGCEVLSGEKWQLQGKIRVN 201
            D   DP+S+HGGC V+ G KW     +RV+
Sbjct: 262 PDATLDPSSLHGGCAVIKGNKWSSTKWLRVH 285

BLAST of Sgr024966 vs. TAIR 10
Match: AT1G20270.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 204.9 bits (520), Expect = 9.6e-53
Identity = 109/200 (54.50%), Postives = 130/200 (65.00%), Query Frame = 0

Query: 1   SWSPRIIVLHNFLSTEECDYLKAIALPRLEISTVVDTKTGKGIKSDFRTSSGMFLSYQEK 60
           SW PR  V HNFLS EEC+YL ++A P +  STVVD++TGK   S  RTSSG FL  +  
Sbjct: 80  SWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETGKSKDSRVRTSSGTFL--RRG 139

Query: 61  NYPMTQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLQRGGQRIATM 120
              + + IEKRI+ Y+ IP ++GE +QVL YE  Q Y+PH+DYF D FN + GGQR+ATM
Sbjct: 140 RDKIIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQRMATM 199

Query: 121 LMYLSDNVEGGETYFPKAGSGECS---------CGGKTVPGLSVKPVQGDAVLFWSMGLD 180
           LMYLSD  EGGET FP A     S         CG K   GLSVKP  GDA+LFWSM  D
Sbjct: 200 LMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKK---GLSVKPRMGDALLFWSMRPD 259

Query: 181 GQSDPNSIHGGCEVLSGEKW 192
              DP S+HGGC V+ G KW
Sbjct: 260 ATLDPTSLHGGCPVIRGNKW 274

BLAST of Sgr024966 vs. TAIR 10
Match: AT4G35810.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 201.1 bits (510), Expect = 1.4e-51
Identity = 108/199 (54.27%), Postives = 131/199 (65.83%), Query Frame = 0

Query: 1   SWSPRIIVLHNFLSTEECDYLKAIALPRLEISTVVDTKTGKGIKSDFRTSSGMFLSYQEK 60
           SW PR  V HNFL+ EEC++L ++A P +  S VVD KTGK I S  RTSSG FL+    
Sbjct: 84  SWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDVKTGKSIDSRVRTSSGTFLN--RG 143

Query: 61  NYPMTQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLQRGGQRIATM 120
           +  + + IE RIS ++ IP ENGE +QVL YE  Q Y+PHHDYF D FN+++GGQRIAT+
Sbjct: 144 HDEIVEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEPHHDYFFDEFNVRKGGQRIATV 203

Query: 121 LMYLSDNVEGGETYFPKAGSG--------ECSCGGKTVPGLSVKPVQGDAVLFWSMGLDG 180
           LMYLSD  EGGET FP A           E S  GK   GLSV P + DA+LFWSM  D 
Sbjct: 204 LMYLSDVDEGGETVFPAAKGNVSDVPWWDELSQCGK--EGLSVLPKKRDALLFWSMKPDA 263

Query: 181 QSDPNSIHGGCEVLSGEKW 192
             DP+S+HGGC V+ G KW
Sbjct: 264 SLDPSSLHGGCPVIKGNKW 278

BLAST of Sgr024966 vs. TAIR 10
Match: AT2G17720.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 195.7 bits (496), Expect = 5.8e-50
Identity = 104/199 (52.26%), Postives = 131/199 (65.83%), Query Frame = 0

Query: 1   SWSPRIIVLHNFLSTEECDYLKAIALPRLEISTVVDTKTGKGIKSDFRTSSGMFLSYQEK 60
           SW PR +V HNFL+ EEC++L ++A P +  STVVD KTG    S  RTSSG FL  +  
Sbjct: 84  SWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSRVRTSSGTFL--RRG 143

Query: 61  NYPMTQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLQRGGQRIATM 120
           +  + + IEKRIS ++ IP+ENGE +QVL Y+  Q Y+PH+DYF D FN + GGQRIAT+
Sbjct: 144 HDEVVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQRIATV 203

Query: 121 LMYLSDNVEGGETYFPKAGS--------GECSCGGKTVPGLSVKPVQGDAVLFWSMGLDG 180
           LMYLSD  +GGET FP A           E S  GK   GLSV P + DA+LFW+M  D 
Sbjct: 204 LMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGK--EGLSVLPKKRDALLFWNMRPDA 263

Query: 181 QSDPNSIHGGCEVLSGEKW 192
             DP+S+HGGC V+ G KW
Sbjct: 264 SLDPSSLHGGCPVVKGNKW 278

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022137963.13.9e-10495.29prolyl 4-hydroxylase 1 [Momordica charantia][more]
XP_038904320.18.6e-10495.29prolyl 4-hydroxylase 1 isoform X1 [Benincasa hispida][more]
XP_008453925.11.2e-10293.72PREDICTED: prolyl 4-hydroxylase 1 isoform X3 [Cucumis melo][more]
XP_016901568.11.2e-10293.72PREDICTED: prolyl 4-hydroxylase 1 isoform X2 [Cucumis melo][more]
XP_016901567.11.2e-10293.72PREDICTED: prolyl 4-hydroxylase 1 isoform X1 [Cucumis melo][more]
Match NameE-valueIdentityDescription
Q9ZW862.3e-9177.11Prolyl 4-hydroxylase 1 OS=Arabidopsis thaliana OX=3702 GN=P4H1 PE=1 SV=1[more]
F4JZ247.9e-5252.61Probable prolyl 4-hydroxylase 10 OS=Arabidopsis thaliana OX=3702 GN=P4H10 PE=3 S... [more]
Q9LN201.4e-5154.50Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana OX=3702 GN=P4H3 PE=2 SV=... [more]
F4JNU82.0e-5054.27Probable prolyl 4-hydroxylase 8 OS=Arabidopsis thaliana OX=3702 GN=P4H8 PE=3 SV=... [more]
Q24JN58.2e-4952.26Prolyl 4-hydroxylase 5 OS=Arabidopsis thaliana OX=3702 GN=P4H5 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1CBS41.9e-10495.29prolyl 4-hydroxylase 1 OS=Momordica charantia OX=3673 GN=LOC111009248 PE=4 SV=1[more]
A0A6J1F2K06.0e-10395.29prolyl 4-hydroxylase 1-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC11143... [more]
A0A1S4E0116.0e-10393.72prolyl 4-hydroxylase 1 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103494504 PE=4 S... [more]
A0A1S3BY766.0e-10393.72prolyl 4-hydroxylase 1 isoform X6 OS=Cucumis melo OX=3656 GN=LOC103494504 PE=4 S... [more]
A0A1S3BXE66.0e-10393.72prolyl 4-hydroxylase 1 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103494504 PE=4 S... [more]
Match NameE-valueIdentityDescription
AT2G43080.11.6e-9277.11P4H isoform 1 [more]
AT5G66060.15.6e-5352.612-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT1G20270.19.6e-5354.502-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT4G35810.11.4e-5154.272-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT2G17720.15.8e-5052.262-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006620Prolyl 4-hydroxylase, alpha subunitSMARTSM00702p4hccoord: 4..198
e-value: 4.9E-55
score: 198.8
IPR044862Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domainPFAMPF136402OG-FeII_Oxy_3coord: 86..195
e-value: 4.9E-17
score: 62.6
NoneNo IPR availableGENE3D2.60.120.620q2cbj1_9rhob like domaincoord: 1..198
e-value: 1.6E-63
score: 216.2
NoneNo IPR availablePANTHERPTHR10869:SF179BNAA04G24820D PROTEINcoord: 1..199
IPR045054Prolyl 4-hydroxylasePANTHERPTHR10869PROLYL 4-HYDROXYLASE ALPHA SUBUNITcoord: 1..199
IPR005123Oxoglutarate/iron-dependent dioxygenasePROSITEPS51471FE2OG_OXYcoord: 82..199
score: 10.402064

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr024966.1Sgr024966.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0018401 peptidyl-proline hydroxylation to 4-hydroxy-L-proline
cellular_component GO:0005783 endoplasmic reticulum
cellular_component GO:0000137 Golgi cis cisterna
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0005506 iron ion binding
molecular_function GO:0031418 L-ascorbic acid binding
molecular_function GO:0004656 procollagen-proline 4-dioxygenase activity
molecular_function GO:0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen