Moc03g02390 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc03g02390
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRibonuclease H
Locationchr3: 1789070 .. 1810031 (-)
RNA-Seq ExpressionMoc03g02390
SyntenyMoc03g02390
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCATCCTCCTCGCGTTGGAAAAAAGGCAAAGCGACTACGGATTCTGCTACATCTAGCAACCCCGTCCACAAGCCGTAAGAAACCCCGCCTTATCCACCTGGGTACGGTACACCTCTTAGGACAATGGCAGAGGGTTTCATGCCATAATATACAACGTACAACCCTTTGTATGACGTCCCGGTTGGTCAGTATTCACATCCCTTTGTCAAGGGAGCTCAGCAAATCCGAACCAACATCATTTTTTCGAAAGCCCGAACAAATGATGTCACCCCCTACCGTCCTTAATCTAGATGGTCTCCTAGCCAAGGCAGACCCAGTCAGACAAAGTGCCCCCAGTAACGAAAAGTTCGAAGTTCTGGAGGAAAGATTAAGAGCAGTAGAGGGGACAGACGTTTTTGGCAACATCGATGCCTCATAATTGTGCTTGGTGTCTGGATTAGTCATCCCTCCAAAATTCAAGGTGCCAGAGTTTGAGGAGTACGATGGTTCTTCTTGTCCTAAGAATCATTTTATAATGTACTGCAAGAAGATGGCAGCGTACGTCCAAAATGACAAATTGTTGATACACTACTTTTAAGATAGTTTATCTGGTCCGGCCTCTCGTTGTTACATGCAGTTAGATAGCTCTCATGTTGGCTCGTGGAAGAATCTGATCGACTCCTTCCTAAAACAATATAAGCATAACATAGAGATGGCTCCAGATCGCTTAGATTTACAGAGGATGGAGAAGAAGAGTACAAAGAGCTTTAAAGAGTATGCTCAAAGGTGGAGGGACACGGCAACTCAAGTCCAACCTCCTTTAATGGATAAGGAGCTATCTGCCATGTTCATCAACACCCTAAAACATCCTTTCTATGATCGGATGATAGGAAGCGCTTCCACAAATTTCTCTGATATTATGACAATCGGAGAAAGGATCGAATACGGGGTTAAACACGGGTAGATAACCAGTATTGCCGAGGAGTCATTAGCCGCAAAGAAGGCAAGTCATTCCAAGAAGAAAGAAGGTGAGGTGCAAATGGTGGGAGCAGACCGATACTTTTGGAAACAACAACCGTACGGTCGGACACCGCGATACACTCCGTATTATTACCCAACGCCATACGGGTATAATCAACCATTTGTGAATAATGCAACTTCACATTACTCCCCTTATGCCTCTCAAAATTTTTGACCCCCGGCCAGTCAAAATTTCCAACCTAAGGGTCAGCAGCATAATACATTTTATACTCAAGGGCAACAGACTAATAGAGGGGCACGTAGACAGACCCAGTTTGACCCAATCCCCATGACTTATACTGAGCTTCTGCTTTAGTTATTTCAGAATAATCAGCTGGCACCTGTACCTGTAGATCCGATCCAACCTCCATACCCAAGATGGTATGATGCAAATGCTCATTGTGACTATCACGCGGGAGCCATAGGGCATTCAATAGAGAACTGCACTACATTGTAATACAGAGTTCAAGCTTTGATCAAGACAAGTTGGTTGAACTTTAAAAAGGAAAATGGGCCTGATGTAAGTAATAATCCGCTGCCGAATCATCAGAATGTCCAAATAAATGCAATCGAATTCCAAGGGTTCGAGTCGAAGAGTTAGGTTGCCGATATTTACAACCCCTATTGAGGAACTATTTGAAATTCTCTTGGGCGGTGGATATATATCAGTGGAGTACCTATGCCAAAACCTCTAGTATAAAGGGTATGATGAAAGTCTGACGTGCCCGTTCCACGCTGGGGCGAAAGGACACTCTCTAGAGCAATGCAACCATTTTCAAAAGAGAGTTCAAGAGTTGTTAGACTCAAAAGTCCTTACTGTTACAAAATCTCACCTGAAGAAAAGGATAAATGTCGTGGAGGATATCTTGGTTGCTGAGGGCTCAAGTGATTCTCTTAAGCCAAAACCTCTCACCATCTTTTATAGTGAAAAGCCAGATGCAGTCGGAAACCAATCACTATCACTGGTCCTAGCTCCTTTTGAGTATAAAAGTTCCAGAGTAGTGCCTTGGAGATATGAGTGCAAGGTAACTGTAGGGCAAGAGGTATCATCTCTTCCACTCCTAGTTGACAATATCATCAGAGTAGGAGGCTTGACATGAACTGGGAGATGTTATACACCGGATAGCTTGCTAAAACGCGTGAATGAGCCTGCTAGTGAAAAGAATAAAGAGAAAGCAAGTGAGAAGAAGAAGGGGAAGGTAGAGGAAGATAAGAAAGGGAATGCCAAACTCAATGAGTATATTTATGATGAACTGGTGGAGGCAATTGTTGTAAAGGATGCAAGTCGTAAACAACCCATGTCCAAGGAAGAGACTCAAGAGTTTCTAAAGCTAGTGAAGCAAAGTGAATACAAAGTTATTGAACAATTAGGTCGGACACCTGCAAAAATCTCTATATTATCTTTACTGTTATCCTCTGAAGCGCATCGGAATACACTGTTGGAGGTCTTGAAGTAGACTTTCGTTTCACAAGACATCACAGTGGATAATATGAGTATCATTGTAGGGAATATAACGACCTCTAGCTCCATCACTTTTACAGATGAGGAGATACCACCAGAGGGTACAGGACATACCAAAGCTCTCCACATCTCAGTAAAGTGTAGAAACTTCATAATAGAAAAAGTCATTATGGACAATTGATCTTCTTTAAACATAATGCCAAGATCCACGCTAGAGAAGTTACCGGTTGATATGTCCCATACGAGACCTAGTACTATGATAGTAAGAGCTTTCGATGGAAGTCGTAGCGTTGTTGTTTGGGATATTGAGATCCCGATTAAGATAGGTCTTTGCACCTTTGACATAACATTTCAGGTTATGGACATTACATCAGTTTATAGTTTTTTGTTGGGGTGACCTTGGATACATTCGGTAAGGGCAGTTCGTCTACTCTACATCAGAAAATTAAATTTGCGGTTGACCAAAAGTTGGTGATCATATCGGGACAAGAAGACATTCTAGTCTCAATGCTTGCTTCGATGCCATACTTTGAAGCAACAGAAGAAGCTTTTGAGTCTTCATTCCAATCATTCAAGATTGCAAATGCTACAACTTTACATAAGAAGTCTGGAAGACCTAAGCCACGACTTTTAGAGGCCGCCTTTAAAGGAGACACTGGGAGTTTAGACAAGCTACCGAGGATGGCTAAAAATACAAGGAGGTTCGGGCTAGGGCATAAGCCAAATAGGTGTGACATCACCAGAGTGCAGAACCGTGAAAAAGCAAAACGTCTAACAAGATTTGAGAATAGGGAGCATGATTACTCAAGGAGGACTGTTTTACCTCTCAGCCACTCTTTCAGAAGTACCGACATAGTCCATCGGGAGTACGATGAGAGTTTTGCAGTGGCAGTAGTGATAGAAGAAAGAAAGCAAGTCGGACCTTTTATCTTCCCGTGCCCAGACAGTTTCGAGCTGAGCAATTGGAGCGTGCTAGAGCTGCTGTATTTTATAAATAATAAGTCAAAGTAATTGTTTTTTTTCCCTCATTTTCTTAATTAATAAATTAATTTTCTATTCCCTTGTGTATTCTATCCAGACGTATCATTTGTCTATAATTCTTCCACATCAATAAAGTTTAAATTCATTCACTTTGATTCTTTCCTTTGTTATTTCTCTGGGCCAATTTATTCATTTGTTTCCTTTTCCTTTCTTTTCTCCCTTATTATATCTTAAGCAGTAATACTGAGATTGAATGTGATAATGATTTGAAATACAAACTCGATACACCTATATACAACGTCGAGTCGGGTGAGGAAATGAACGATGAGCCCTCTGCTGAGTTATTAAGAATGTTAGAAGAAGAAGAAGAAAAGATGTTGGGACTCCATGAGGAATTAACTGAGACAGTTAACTTGAGAGTCGCAAAAAGTTGACAGAGTTACTTCATGAATATGCTGATGTTTTTGCTTGGTCATATCAAGATATGCTTGGTTTAGATACAGACATTGTAGTGCACAAATTGCCAATTAACCCAGAGTTCAAGCCGGTTAGGTAGAAGTTACGGAAAATGAGGTCAGATATGTTGATCAAGATAAAGGATGAAGTAAGGAAGCAAATTGATGTGGGGTTCCTTACGGTATCTAATTACCCTGAGTGGGTGGCAAACATTGTCCCTGTACCAAAGAAAATGGACAAGTAAGAATGTGTGTGGATTATAGAGACTTGAATCGTACAAGTCTAAAAGACAACTTCCCGCTTCCTCATATTGATGTGTTGATTGATAATACTGTTGGGTTCTCAACCTTCTCCTTTATGAATGGGTTCTCAGGCTACAATCAAATTAAGATGGCACCTGAAGATCGCGAAAAAACCACATTCATTACGCTATGGGGAACTTGTTGCTACAAAGTTATGCTATTTGGGTTAAAAAACGCTGGGGCAACCTACCAGCGTACCATGGTTACTCTCTTTCATGACTTGATGCACAAAGAAATTGAAGTTTACGTGGATGATATGATTGCCAAGTCAAAACAGGGCGAGGAGCACACAACTATTTTAAGGAAGCTGTTTGATCGATTGAGGAAGTTCAAATTGAAACTCAATCCCAACAAATGCATATTTGGGGCAACCACTGGAAAACTCCTAGGTTTTGTAGTAAGCTAAGAAGGCATTAAAGTTGACCCGGCTAAAGTCAAAGCAATCTTAGAGATACCACCTCCACAGACGTAGAAAGAAGTCAGAGGATTTCTAGGACGCCTCAACTACATCGCAAGGTTCATATCTCACTTAACAGCAGGTTGCGAACCCATCTTTAAATTGCTCCGCAAGAACAACGATGGGGTATGGAGTGAAGATTGTCAAGCAGCTTTTGATAAGATTAAGCAGTATTTGCAAGACCCTCCAGTTCTTGTGCCACCAACTCCAGGATGGCCCTTTATTTTATATCTCACAGTGACTGAAAACTCAATGGGATGTGTACTAGGGCAGCATGATGATTTAGGCAGGAAAGAACATGCTATATATTATTTAAGTAAGAAGTTCACCAATTGCGAGACTAGATACTCTCAAGTAGAAAAAACTTGTTCTGCTCTAGCTTGAGCTGGCCGACGTGTAAGATAATACATGTTGTATTATACCACATGACTCATTTCAAAGATGGACCCCATAAAGTACATTTTTGAAAAGCCGTCTCTCTCGAGTTGAATGGCAAGGTGGTAGTATCTCTTGTCCGAATATGATATTGTTTATGTGACTCAAAAGGCCATTAAGGAGAGTGCTTTGGCCGACTACCTAGCTCAACAACCTATGAATGACTACGTACCGGTGAAGTTCGACTTTCCAGATGAGTATATCTCCACCATAACCGCAAGTGAGGAAAGTTTAGACCCACAAACTTCGACCATGAACATGAAATAAATTGCTTTGGCGATAAAATCTCGCTATTGTCGTCGAAAAGGCCCGCTAAAAACATGAAATAAATTGCTTTCGGCGACCACTTTAAAATCGTCGTTGAAAGGTGCCCTAAAAATCAATCTCCTTCCCCATTTCTCCATTTCTCTTCCCTTTTGCACCCCCCCCCCCCCCGATTTTCAGATCTCCGTCCGTCGCTGCCGCTGCTCGACCTCTAGTTGTCCGTCGCCGCCGCCTCGACCGCTCGCCGTCTGCTTCTGATTCTACCGTTCCACCTCCGCCCTCCGCCTCCGATTCGACCTGTTTCGCCTATCGGGCTCCTAATTCGTCTTCAGTTCCGCCACCCGCTGCTGCTGCACCACCGTCCACTGCTCCCAACCGCTGCCCGCCAGTCGCCGCCCGCCCGCCACCGCCTCCGACACCCTAAGGTTTATATTTTTTTTTCACTTTATTTATGTTAATTTTTTTAATATTAGATTATATTTATTTAGGTTATTTTGATTATTTTTAAAATATTATATTTATGTATATTATTTTGCTTTTTTAATCTTGAACACAAGCTTGAGTTATTTGAGAAATTTTTGAATTATATTTGAGATATTGTAAATGCTATGTGAGAAATTGTGAATTATATTTGGGATATTGTGAATGCTATGGAGAAATTGTGAAATTGATGTTTTGTAATCTTGAACAAATGCTTAAGTTATATTTTGTATAGTAGTTGATGTTTTATATGAGTTACTGGAGAAATTGTGAAATAAGTTATTTGAGAAAATATAAAAGTTGTTATTTGTTTGATTTTGACGTACTTTTTTTGGTTGATGATTTAATTTATTGGCTTTGTAAGTGTGATTGTTTTATTTATACTTAAATTAGTCGATTTTTGTGCGCATGATTGTTTTATTTATATGTATTCTTTGTGATTTTATTGTTGTTGGTTTTGAATTTATAATGAATTATGAATGATTATTATGTTGGTTGTGACCGATGTTGGTCTAGATTTTTAGACGACCTCTCTAAGTATATTTTAGTTCTTTTTTTAGTGGCTATATCTTAGGAATATATCTAAGGCCTCAATAATACTTTCTAAGTCTTCTACTTTTAACTACAATACTCATAATCGACAATAATGACCACACCCTTGCATCCTCGTCTCACTTTGAAAACAAATTTTAGATATTTCTATTTTCAAAATATCCTTAGGTTGATGACAAAATGAACATAATCTTTTGTCTGATTCTTTTAGGATTCAACATTGACCATTTATACTGAATTGGAGGTTTTTTAAAAGAGTATTATCATCGATAATGAATAAGAATTGGATTAAGGTTAGGGATAGATTCTCGCCCGAGTATGTTCAAGGAGTAAAACATTTCATAGAATTAGCAAAACTTCACGTCAACAATACGAGGAAAACAAGATGTCCATGTAGATATTGTATGAATGTAAAGTGGCTTTCAATTGATGGAGTAGAAAGACACTTGTTTATAAATGGAATATCACATTCATATACTCAATGGATATACCATGGAGGAGAACCAATCCCTTCAAGTTGTTATAAAAGACGTGAAACTCCTTCGACAATGAATTAAATAGAGAATACAATGTGATGAATAATGAAGATGGTGATATGTCTGGACTTCTCAATGATTTACAGTACCCCATGATAGAAGAAAATGTAGAAGGTAAACAATTTTGTTTTGAGACGCTCCCAAACGAACATGGAAGAGACACGTTTAATAGTTTCGAGTAGTTGTTCAATGAAGTGCGTAGTCAATTGTACACAGGTTGTACAAAATTTTGATCATTGATTTTTGTAGTAAAGTTGATGCATATCAAGATCATTAACAATTGGAGTAATAAATCGTTTGATATGTTACTTGAATTGTTAAACGAAGCATTTACAATAGCTGCATCTATACCTAGTATACAGCATATGAGGCTAAAAGGAGACTGCGTGATTTAGGACTTTCATATGAGTCCATACATGCCTGCAAGTATGACTGTATATTATTTAGGAAAGAATATTCTGAATGTCAAAGTTGTCCAGTATGTGGTGAACCTCGTTATAAAAACAATGATGAGAAAGCTTTACAAGTCCCCCAAAAAGTGTTACGATATTTTTCGTTGATACCAAGATTGAAACGATTCTTTTCTTCAAAACATTTCTGTAAGGACAGTATTGTAATTTTAAGTAAAAAAAAGAGAAAAGAAAAGGAAAGGAAAGTGGGAAAGTGAAGACGTGTGTGAAGGCATTGAAGGGACACGTGTAGAGGAAAAATGGGAAAAATGGGAAAAATAGAAAAGAAAGAAAAAGGAAGAAAAGGAAAGGAAAAAAAGGGGGGGGGGGGAGTGTTGGGCACGAAATGAAACAAAAGATGGGAGGCTGAAAGGGAAGAATTTCATTGCTGAAGAAGGCGGAGGTGAAGGTGGTTTTAGAGCAAAGGGAAAAAAAAACATTGATCTTCTTGTGTTCAAAGTATCTCTTGAAATAGGTGGAGACTCTGAATATCTGATTCCTTGAACTCTTGGATGGATTTCGGCCTCTGTGCGTGTTTTCATTGTTATAAGTCAGAGGAATCTGTGAAGTAAAAGAAGAAGAAGAGAAGAAATGGAGAGGAAGAAATAGAGGGTTCCGTTGGTTGTGGTTGAGGGAGGAAGTAGGTGAATAGTAGAGGAGAGAGAATGAGTGGGTTAAAATTAAAAATAAAAGAAAATGAAAGAAAAAAAATGGTTGGGAAATTGAAGCTCACGTGTAGGAAATTGAAATGGGTTGAAAGGTTATGAATGTGGGATATGGGTCCCACTTGTAAGGAAGATGAGAGAGGAAGAGGTGTAATTGGATGGTAGAGAGAGAATGGGGTTGTTGATTAAATTTATGGTTTAATTCGGAGAATTGTAGAGTTTAATTGAAAATTGAGTGTGAATTGAAATTTCATTGGTTATTATTTCATGACGATCGGAAATTGGAGCAGTGTACCGACCTAATTCACCTCGTTCATAGAATGTGGGTACATATGCGGTGTTGAATTTTCTTAACCTGTTTTAGGTTTGAAAGAAGATATTTGATACGTATTCAGAAAGTGTTTTTAGAAAGTATATGTTGTTACAAGTCATGTTCAGAAAATTACTGAATTTTAGTTGTTCAAGTTATTTGAAGTTTTAAATGAGAATATTTATTACTTCAGCTTTATGAAGCTCATCTTAAGAATTGAAATTTCAGTTGTTTTTATGGAAGAAGTTTTACGAAACTATTTTCTTATTAGAATGCCACTCACTGAGCTTCCAAGCTCACTGTTTATCATTGTTTTTCTCCACCAGGCTGCACTTGAGTTTCAAAGAGATAGCCGACCAATTTCTGTCACGCACCGTCTCAAAACTTGCATATCATTTTGTATGTACATCTAGAATTAACTTATACTGGAATAGTTGTACCTTTTGTAGTTGTATAGCATGTCTAGAAGTCGTGGCTGAATAATTATTTTTTTTAAACTGAATTATGCTACATTTTATGTTTTATATATTGAAGAATAAACTAAATTAGTTTTAGAAATTTAGAAATAAGTTGACTATAGACTTGGTAGATGGTTGGGGGTCTGTGGTCGGCGCGACTTGTTCTATAGTTAGTAGGTGCTTAGGTCGGGTTGTGTCAATTTCGCTTTTGAAATGAGGTGGCATAAAGAAAAACAAGTTGAGACAGAGGGTGTCTTACAACATCTAGCTGATGCTTTAGGGTGGAAGCATCTAGATAGCACATTTCCTCGATTTGCTTCAGACCCACGAAATGTTCGTTTGGGACTAGCTTTAGATGGGTTTCATCCATTTGGGAATATGAACAATTCACATAGCATGTGGCCAGTGGTACTTATTCCATACAATTTGCCTCCTTGGAAGTGCATGAAAGAAAATAATTTATTAATCTCATTGCCTATTCCTGGGCCGAAGTCACCTGGAAAAGATATCAACATTTACTTACAACCATTAATTGAAGAGTTGAAAGAGTTATGGGATACTGTGGTTCGTACTTACGATGTGTAAGTGGTGAGTACTTTCAACTACACGCAACTCTTTTATGGACTATTAATGACTTTCTAGCATATGGCGACTTGTCCGGATGGAGTACAAAAGGATACCAAGTATGTCCCATCTGTAAAGAGGACAATTCATCTTTTTTTTAATAAGAAACAAAATTTGTTTCATGGGACATCGACGTTATCTTCCTCAAAATCATAGTTGGCGTGGGAGTAGATTGCATGATGAAAAGTTAGAACGACGCCCCCCACTAGTGTCAATGGATGGAGATGTGATTCTTCAACAAGTAAATACACTTAATTTTCTAGATTTTGGTAAGAATCCAACCAAACAAAAGAGTAAAAGAAAGGGAGATTTAAATTGGACGAAAAAAAGTATATTCTTTGAACTTCCCTATTGGTCGAAACTTGTGTTAAGATATAAGTTGGATGTGATGCATATCGAGAAGAACATATGCGACAATCTAGTAGGAACTTTGTTAAATATCGATGGGAAAACAAAAGATACGATCAATACGCGTTTGGATTTATAAGATTTAAAAATTTGGAAAGATTTACACTTACAAAAAGAGGAGAATAAAGTTGTAAAAAACCACATGCTACGTATACATTGACTGCTGCTGAGAGGGTGAAGTTTTGCAAATTCTTGAAATCAGCTAAGTTTCCATAAGGATTTTTGTCTAATATATCGATATGTGTGAGCACAGAGGACGGGAGACTATAGGGGTTGAAAACTCACGACTGTCATGTTTTGCTACGACGACTATTACCGATTGCCATTCAAGCATACGTACCACGTTTGTGGGTACAACTATCGTTGAGCTATGTAACTTCTTTCGTGATATATGTGCAAAGACAGTACGCATCAATGATTTGGATCGATTGCATTCTGACATTATAATCATACTATGTAAGTTGGAGAGAAAATTCCCACCAACATTTTTTGATGTTATGATACATCCTGCAGTTCACTTACCATACGAAACAAAAATTGTGGGGCTAGTTAGTTACAGTTGGATGTACCCTATCGAGCGAAGTCTCCGAACCTTAAAACAATATGCACAAAAGAAGAATAGAGCTCGTCTTGAGGGTTCGATAGCCGAATCATTCGTCATGAATGAATCGTTGACTTTCTGCTCATTGTATCTGAGTAGAATTGAAACGAGATTCAATAGAGATGAACGCAACGATGATACTATTGTCGGTTATAAAAATTATGGACATTTTGACGTGTTCAGGCTAAGTGCACGGCCTCTTGGGACATCAACTTCGAAAACATTATCCGTCATTGAAAGAAGAGTTGCACACTGGTACGTACTGAACAACTGTGCAGAGATATAGTCGTATCGCGAGTAAGTTCGACTTATGCTACGTTCACTTTAATTTCACACTTAGTACGTATAACACTAATGTTGTTTTTCTCTTTTTTTGGAATTGTAGTCAACATTTGAGCTCAATTCGAATTCCTGGAGACAATGTTCCTGCCCTATATGAAGGACGCAGTACACAATTTCCTGATTGGTTAAAAAAAAGGTAATATTGATTTTTAGGGTTTCAATGTAGATGAACACAAAACGAAATCGTTAAGTATTTTTTTATGATCTATTACAGGTTTTGACATTACGGGAACAAGGCGAATTAGATGATGATCTTTATTCACTTACGTTAGGGCCGTCACTAACTGCATGTTCATATAGTGGTTGTATTGTAAACGGACTACATTTTCAAAATATAGAACGCGACAGTCGTCGAACGACACAAAATAGTGGGGTCATAGTGTGTGGAGGAGAAGAGGATGACATGAATTTTTATGGTGTCTTGAGTGATGTTTGGGAGTTAGATTATGTTATGGGGAGACGTGTGATACTTTTTAAGTGCAAATGGTTTGACACAGACTCTAAGAAGAATAGAATTCGAGTTGACTTAGATTTTAAGTCAATTAATACGTCTAAGTATTGGTATGCCGACGACCACTTAATTCTCGTTGGACAAACTCAACAAGTATTTTATTTGGATGACCCCAAACTTGGTAATAATTGGAAAGTTGTGCAACATATTCTAAATAAACGTGCATGGGATATTTCAGAACAAGAAGACGTTGAAAATGATCAAATTTCGTTGTGAGAAGTAGGACCTAGAATTGAAATAGATGAATCCACTCAAGATAGTTCATTGCGTCGAGATGATGTTGATCCTATTGTCGTTGAGGGTTGAGAACAGTCCAATGGAAGCATAGATGCGAGCAATGTGGATGATTTTATAAATGACGAATCGGACGATGTGGAAGCGTCAACTGATAATGATATAGATGATGTTGACGATGAGTATTATAGCGAATAAGTATTCCTCCAACTATGTCGATTTAAATATATGTTGGATAAATTATACATCAAATTCATTTATGTTGAGTATTTGTTTTGTCAATATATAATACTTTTCGTTGGTTGCCTAAGTTTGCAGAAAAATGAAACCAGACCAAGCAGGTCAGTTAGAGGCACACGTACAGGGTCGTGAGCATCCACTAGCAGACCCGCCTATTCTGGTTGTTCTTCCAGGTACGTCAATTTAAATATATATAAAATAAATAAGTATGATTTTACTAATTGGCTTACTGTTTGACAACATACAGATAGATTAGGCGTGCGAGGATTTACTCGATTGATAGAGCTATAAAATCAAGTTCGTGCAACTAACGAAAGGTACGTATCGTATGTGACCACACAGTTAATAAACCAATATGTCCCATGGCCACCACATTCTCTATGGCGATCGGGGTACATACACGCGAACAAGTCGACCTAAAGGTTAACAAGTGGAGTGAGGTCGAAAAAATTCAGAAGAGGGCAATTATAACTCAGCTCAAGGTTTGAGATTAATTACGAAATATATTTTATCTTTTGTTATGATGCAACAGTAATAACACTTAAATATATGTTCGATGTTTTTTTTGTTATAGGGTCATTTCGATTTCAACGAATCAGATCGAGTTATGAAGAAATTCATCGAGCATGAGATGAGAACGACACATAAGGCATATAGGGAAAAATTGCGCCAACACTACCTGAGTTATCCCACACCTGAGATTGCGCACAACAATCCTCCTGAGCAATTGTTGGCAAAACCAGACGATTGGAACATGTTGTGCAATAGATGGGAGACAGATGAGTGGAAGGTAATTTTACGAATATAAATTATTCTTTTTTTTAGTATTTATACTCAATATATATTCTCATTTAAGTTTTCTTTTGTTAGAAAAAATCTGAGACCAACAAATGCAACAGGGCGAAGATGCCATTCAACCATCGTGCGGGGCCGAAGGCATTTGCTATTATTGCAAAGGAGAAGGTGCAATGCAATAATATTTTTTTAGCCATATACTTATATTACTTAAGATTGATTATCAACAATTATGTGTTATCAATTGCAGAAAGAAGAAGAGGGAGGTGACAATTTTTCTGAAATCGATTTGTTCAGGGAGACGAGATATTCTGACACTAAAGGTTGGGTCGAAGGAGCGGAGACAGCTTACGTAAGTTTACAAATCTTATAACAATTAAGTTCAAATTACGCATTGAATTTTATGTGTAAACATTTTGTTCTTTTTAAAAATATGTATGCAGCTAGACATGATGCGTGTTAGAGAAGCATCTATGTAAGACGGAGGTGAATCGATACCAGACCCAGAAGTATTAGAGACAGTTCTTGGTTATCGTTCAGGATACGTTAAGAGTGCTGGTTGGGGCCCAAAACTAAAGTCTCGTAGAGGTTCTTATTCAAGTGTAGCATCAACACAGCGGGAAGAAGAGCTGTCACAAGAAATATATACTCTAAGAGAAGAATTTCAATACCGTTTCACACAAAAAGACAAGGAGTTAGATGAGCGTCTCGCAGCCAAAGATTATGAGATTACGAGTCTACGCGGTGAAATGTCAGATTTGAGATCAATGGTGCTACAACTTATGAGTCTGTCTAGTGGAGCCAGCTCGTCAAAATAGGTATGTCCACATAGAGTGTTGTTAGAATTCAATAACCATAAATGTGACTATCTTGCAGTTATGGTAGTCATTTAGAACGTGTATTATGAATGATTTAACTATGTTTATGTTTGAGTGCATGTTTGAAGTGTTTTAGATATTAATATCAAGTGAATCTTCTGTGAAGTGAATCTTCTGTGAAGTGAATCTTTGAAGTGAATTGGTACGTCAAGTATAGTGTTGTTAAAATTCAGTAACTGTAAATGTGGCTATCCGGTAGTTATGGTATCTAGAACATATGTCATGTATAGTTTAGCTATGCTTATGTATAATGTGGTTTTGGATGATTTTGTTGAATATGTGAAATGAATTGGTGATTGTGGATGTTTTACATAATAAAAGCAAGTTTTGAGAATTTGAAGTCTGAAGATTAGTTTTTCTTAACTTTGTGGGGTTTTATACGTACCTTTCCTTTAGTTTGTATAATTTGGATTGAAGTTAATGAGAGTTATATGGACATGAGAGATTTATGTGTGTGTTTTGATAATGAGACTAAGTTTTGTGCATTTTAAGTTTGTTTGGGAGAGTTTTATTGGCTTTGAATATATGAGAGTGTTGGGGGGGACGTTATTAAGTGTTTTTGGATAACGTTGTTGAATTTTAAGTGAATTGTGATAGAGTGTTTGTGAATGTTTTTTAGACTTGGTGAGAGTAAGTTTTGTGAATTTTAAAGTTGTTTTCATGAGTTTTTGTTAACTTTTAATGCTTACTTTGAGTTCATGCATATGCAAAGCTTAGAATGACGTTATTAAGTGTTTTTGGATAACGTTGCTGAATTTTAAGTGAATTGTGATAGAGTGTTTGTGAATGTTTTTTAGACTTGGTGAGAGTAAGTTTTGTGAATTTGAAAGTTGTTTTCATGAGTTTTTGTTAACTTTTAATGCTTACTTTGAGTTCATGCATATGCAAAGCTTAGAATGACGTTATTAAGTGTTTTTCGATAACGTTATTGAATTTTAAGTGAATTGTGATAGAGTGTTTGTGAATGTTTTTTAGATTTGGTGAGAGTAAGTTTTGTGAGTTTGAAAGTTGTTTTCATGAGTTTTTGTTAACTTTTAATGTTTACTTTGAGTTCATGCATATGAAGTGAGAATAACATTTTTGAGTTCATGCATGTGCAAAGCTTGAATGTTTAGTTTGACATGAGGAATGAGTTAGAATTGGAAGTTTGTTTGAATGGAAGTGGTGAGTTAACTTGTGGAAGTGTTTGTTAACATAAGTTTATTTGTTAATTATTTTTTCAGGTTCGTTCTATTTCTGTATCGAAAGCTTCTGAAAAGTTGATTATCTCTCGAGAAGACTTTGATGATATGAGACATTTGAATGTTTAAGTTTTTTATTTTGAGACTTCTTAGAATATGTTGTTTAATTTTAGTTTGTATGGACTTCTTGTACTTTTTGTAAAACTAATTTTTATATGATGGTATAAAATTTTATATGAAATTCAATCTATGGCATTTAGGTTTATTGTTGAAGAGGAGGAAAAAAATAAATAAAAAAATCGGCGACGAAACCAACATTTGATCGCCAAATGTCAGATTTGGCGACCAAAACTGTTTTGTGGCCGAAGGTTACCTTCGGAGACCAAAACAGATTTGGTCGCCGAGGACACCTTCGGCAACCAATATCAGGTCGTCGAAGCTACCTTCGACAATGAAAAACGAAAGTCATCGCCGAAACCCTCTCGCGACTGGGCAACGGCAACGAGTCAGGCGACGACTTATTTTCGTCGCAAAAGGTTTTCGGGGACGAATTTGGACCTTTTCGTCGCCGAAACCCTAAATTCTTGTAGTGAGGAGAATGGGAAACAAGAGACACTAAGTTGTTGCCTTACAAACAACTCATAACAGAATTGTCACAAGAATTTGATGAAATCTTATTTGATTATTTGCCAAGAGAAAATAATCAAGTAACAGATGCATTGGCCATATTACCAGTGATGTTCAATTTAGAACTCAATGAGGATGTCCGTCCGATTAAAGTTGGGAGGAGAGATGTCCCAGCTTCTTGTATGAGCATTGAGGAAAAACCCGACGGTAACCCCTGGTTTCATGAAATTAAGCAATATATCAAGAGTAAAGAATATCCACCAAATGCTTCAAAAAATGATAAGCGCACCTCCGCAAGTTGGCAACGAAGTTTTTCTTAAACGGAGAGATATTGTACAAGAGAAACCATGACATAGTTCTCCTAAGGTGCGTTGAAGGAAGAGAGGCCAATAGGATTATGGAGGAAATTCATGAAGGAATTTGTGGCACTCATACAAATGGGCACATGATAGCTAGACAAATTTTAAGAGCTGGCTGTTACTGACTGACTATAGAGACAAATTGTATTAAATATGCAAGAAAATGTCACAAATGTTAAATTTACTCGGACAAGATTCATTCTCCTGCTTCTCATTTGCATACTTTGACAGCCTTTTGGCCTTTCTCTATGTGGGGCATGGATGTGATAGGACCTATCGAACCTAAAGCATCAAATAGGCACCAATTCATTTTGGTAGCCATAGATTATTTTACTAAATGGGTAGAAGTAACTTCTTACCGAGATGTGACAAAAAGAGTTGTAGTCAAATTATCAAGAAATAAATTATTTGTCGTTATGGTCTTCCTGAAAGCTTAATTTCCGATAATGCAAGGAATTTGAACAACAAATTATGAGTGAACTCTATGAGCAGTTTAAGATCAAACATCTCAACTCAACCCCTTATCGACCTAAGATGAATGGTGTAGTGGAAGCTGCCAACAAAAACATAAAAAGAATAATTGAAAAGATGACTGTAACTTACAGAGACTGGCATGAAATGTTTCCATTTGCCTTACATGATTACCGTACCTCAGTTCGTACTTCAACTAGGGCAACGCCATTTTCATTAGTTTATGGTATGGAATCTGTTTTCCCTGTTGAAGTAGAAATTCTATCATTGAGAGTTATCATGGAGGCTAAGTTACAAGAGGCTGAGTGGGTCCAAAGACGTTATGAGCTGCTAAATTTTGTAGAAGAAAAGAGGTTGACGACATTATGCAGAAGACAACTTTATCAAAGAAGAATGATGAAAGCCTATGACAAGAAGGTGCGCCCAAAAAGGTTTAGAGAGGGAGATTTAGTTCTAAAAAGAATTCTCCCATTACAGAAAGATCATCGAGGCAAATGGACCCCAAATTATGAAAGACCATTTGTGGTAAAAAAGACTTTCTCTAGAGGAGCACTAGTTCTGGCTAACATGGATGGCAACGAGTTTTTAAAATCAAGTCAATTCAGATTATGTTCGAAAGTATTATGCATAATTTCTCGTGTATCTCGAAAATTTCACATTGATCTCAGTATTTCACTTGAAAGGCATTTTATGTATTCATTTTTTATATAGCCTACAAATTTGAACAACATTTTTGAGATATGTATCCTTTGTCATTTGAAAATCTCATTTATAAATTATGTTCTCTATTCGTCTCTTTATTTATGAATGACGGTAGTGGCTTACATTCACGTTCTCCAAGGTCACTTGTTGCTTGAAATCAAACGTAAGGGCACAGGAAACCTAAAAATAGGGACATTTGTTCTCTGGAATATTGGGGAAAATATGTCATTACAAAGTTATCAGGGGCAAAGTCGTCGTCCCATTTGTTCTGGCTACTCTCGATCCAGTTAAAAGGATTTTTGAAGGAAAACGAAATGCGGTGATCGGGCTATCTGAACAATCCCAAGTTTTTATAAAAGTAAGTGATTTCACTCTATTTTGGCATCCAAGAAAATTTTTCCTGTTATCATCATGCGTCAAACGTACGTTTCATGCATCGTTTCATGCATCATTTCATGCATTTTCCTTGAAAAAGTCTCAAAAAAGCCCTATGGTAGAGAGTTTCATGACAGTTCAAAGATCGTGCTCGAAGCCTGTATCAAGCCAGGTCCGTTGCATTTTAGTTTTTCGTTGGTTAAATCGAACCCCATCAAAGATCGTGTCGAAGCCTTTGTCAAGCCAGATTAGTTGCCTTTTAGTTTTCTGTTGGTTAAATCGAACCCGATCAAAGATCGTGGTCGAAGCCTGTGTCACGCCAGGTCCGTTGCATTTTAGTAACCGTTGGTTAAATCGAACACGATCAAGGATCGTGGTCGAAGCCTGTGTCAAGCCAGGTCCGTTACCTTTTAGTTTTTTTCGTTGGTTAAATTGAACCCGATCACAGATCGTGGTCGAAGCCTGTGTCAAGCCAGGTCCGTCGCATTTTAGTTTTCTGTTGGTTAAATCAAACCCGATCAAAGATCGTGGTTGAAGCTTGTGTCATGCCAGGTCCATTGCCTTTTAGTTTTATGTTGGTTAAATTGAACCCGATCAAAGATCGTGGTCGAAGCCTGTGTCAAGCCAGACCTGTTGCCTTTTAGTTTTCTGTTGGTTAAATAGAACCCGATAGATCGTGGTCGAAGCCTGTGTCAAGCCAGGTCCATTGCCTTTTAGTTTTTAGTTGGTTAAATCGAACACGATCAAAGATCGTGTTCGAAGCCTGTGTCATGCTAGGTCCGTTGCCTTTTAGTTTTTCGTTGGCTAAATCGAACCCGATCAAAGATCGTGGTCAAAGCCTGTGTCAAGCTAGGTCCGTTGCCTTTTAGTTTTCTGTTGGTTAAATCGAACCCGATAGATCGTGGTCGAAGCCTATGTAAAGTCAGGTCTGTTACCTTTTTAATTCCCCGTCGGTTAAATGAAACGCGATTAAAGTTCGTGATCGAAGCTGGTCAAGCAAGGTCCGTCATTCTAGCCTCGCCTTCCCAATACTTTGTCTTTTGCCTTTGTCTACCCTTTTTCTGCAAGGATACACAAAGGGGGGCAAGATGTAGACACCGTATTTTGTTCGGCTCGCTTTCGATAAAAAAAAATAATTTCTGATTTATCTATTATTTTATTATATTTATTGATTTAGTTTAAAATAACATAATTTACTTTATTATAAGATTTAATATAAAGTTTTAATTAAATCAAATTACGTTTTAAAATTAGATTTTGTTTGAAAATAATTGGTTTTAAGATTTAAAAAACTATTTTTTGTTCTCTCCTTTGATTTTTTCCTTCGTACGTGTCCCCTTCCCTTCTTCTCCTCCTACTTTTTCTCTCTTCTCTTGCCTCTTTTTCTTTTTTACCATTATCTTTAGGTGTATTTAAAGAGGGCTACAATGGAGAGGAGGAAACACACAGAAAACCCATACAGAGAAGAGGAACAAAAATTCACTTCCTTCTTTCCCTCTACTATCACCTTCTCTTCTTCTTCACCCACACCGGCGACCCCAACGATACTCTGACTGCATACGTCCACCGGTGATCACAAGTTTCGACAACCCGAGCACAGTAGTCACGTGAAAAGCTCTATAATCCTGACGGACAACAATAATATAACGAACAACTTATACAACCGACGCACAGATCTCCGGCAACCTGACTCCTACCTCAAGTGCTCCTTCACGGTTTCAGACCCGCGGGTACACCAAGGATTTCGTCACTGCAGTACGTGATATAGAAAAAAATGCTTTTAGAAATGATTTGATTTTTTTAAATAAATTTGTTTTAAATATGAGTTTTTAAATAAGTTTGTTTTAAATTTGATATATATATTTTTTAAAATAAGTTTGTTTAAATATGATTTGATTATTTTTAAAATAAGTTTCTTTTAAATATGATTTGATTTTTTTTTAAAATGAGTTTGTTTTAAATATGATTCTTTTTTAAATGAATTTGTGTTAAATTTGATTTTTTTAAAAAGGAAGTTTGTTTTAAATATGATTTTTTTTCTTTAAAACAAGTTTGTTTTAAATGTTTTTTTTTTAAGTTTTAAGGAAGTTTGTTATAGGGATTTCAAATCTACTAATTCATATCTTATAACAAAGTGTGAATTAATATAACGTGGGACATTTCAATCAAAATACAAAAATAATTTTCTTTTGGACCAAATAGAATAATAGGATAATTTTTGACAAAATATGGTACCGTGTTCGAACGGGTGTCGTATGATGCTAACACCTTCCCTATGCTCAGCCGACTTCCGAACGTAAAATCTTTTTATCGTAGACCGAGTTTTTAAAATAGGTGACCAATCACACCTCGTAGATGATTGGTGGCGACTCCAAACCCTAATCTCGAAAGAGATCTGTTAGGGGATGACATCGGCCGCTCCGCGTCACGATGACGTTGTGATATTCATTAATAATAACAAAGTTAAATTAAAAAAATAAATATTAAATATAAATTAAATATTTTATAATAATATCACATCAACTAACAAAAGTGATATAAATTATTAAGAGTATAAATGAAAAAGAAATAAAAAAGTATAAAATAATTAAATAAAAAGATATGGGTCTCATTAATTAAGAGAGATAAACGATAAAAGTAGAATAGTGATTGCAATAAATGTGTAGGACCCACGGAATAAGAAATGAGAATGAGATTGATGAGGGTGTAGGAGAGGAATGAGATTGCTCCTTTTTAGGGAATAGGAATGAAAAAAGTGGGTGGGTCACACCATTTGATTCCCATTTCAACAATTTAGTAAACACCATCAAAGGGAATAAAATAATTGATTATCATTCTTTCATTCCCATTCCCTTGAAGTAAACATGGCCTTAGGGAGTACGATCAAGGCAATTAAGAGTAAACTAACCCGTAGACAACTTAGTATGTTTAGGAAGACCATTTTTGGTCATTTGCTAGATGTGGATCTTGTGCTTAATGAGTCACTTATACACAATCTATTGCTTAGGGAAATGGAGGATAGTAGACACGAGGTGCTTATTATGGACGTGCTTGGTTATAGGGTATCTTTTGGCAGTTCAGTCAAAATTGTCAGAAAAGCTCGTCTACCATGGTGCAAGATGATTCAGAGGATGTTCGAGTTCTATCCAAGTCATTTAAGGTACGACTGTTTTGCATATCATATATACCGTACGCCTGCATAGAATCTCCTAACTATGTGATTGCTCTTCATAGGTGATCGATTCGTTGGTGATGTTCATGTATAAGAAGCTCGAACTGAGACCAGACTTGTGTCGCCACAAGTTCACCATCGATGTCGTAGTACTTGCGGTAAGTTGCCTTGTTCAGTAATTGTTCAATGAATACATGTCTTACTGTTATTTAATCGTCGTACTCGTTACATAACTTCCTTAGACGAACAGACGGTGTCTGTGCCTGCATGTTATCTCCTAGCATCATTACTTTGAGGGTTGCATCGAATTATGATTGGGAGGGAAGAGCGAAGACTGTGCTCAGCTACATAGATGGTACTCACTCAGACTACGAGACGCGGTGGATGGATGTCGATACTGTATATCTAACCGATAATATCGGTGGAACGCATTGGATAATGTTGTGCATCGACTTTGACGAGGGTGAACTTATCGTGTCAGACTTCTTCATGGCCATGACACCACTACCAAATTTGGAGGAGGAGTTGAAGTTGATGATAACTATCATCCCGGCCCTTATTTGTAGGGTCGGTGTTGCGATAAAGAAGCAGGACATACCATCCACATCATGGCGCATCCATAGAGTTTCATAA

mRNA sequence

ATGTCATCCTCCTCGCGTTGGAAAAAAGGCAAAGCGACTACGGATTCTGCTACATCTAGCAACCCCGTCCACAAGCCTATTCACATCCCTTTGTCAAGGGAGCTCAGCAAATCCGAACCAACATCATTTTTTCGAAAGCCCGAACAAATGATGTCACCCCCTACCGTCCTTAATCTAGATGGTCTCCTAGCCAAGGCAGACCCAGTCAGACAAAGTGCCCCCAGTAACGAAAAGTTCGAAGTTCTGGAGGAAAGATTAAGAGCAGTAGAGGGGACAGACTTAGATAGCTCTCATGTTGGCTCGTGGAAGAATCTGATCGACTCCTTCCTAAAACAATATAAGCATAACATAGAGATGGCTCCAGATCGCTTAGATTTACAGAGGATGGAGAAGAAGAGTACAAAGAGCTTTAAAGAGTATGCTCAAAGGTGGAGGGACACGGCAACTCAAGTCCAACCTCCTTTAATGGATAAGGAGCTATCTGCCATGTTCATCAACACCCTAAAACATCCTTTCTATGATCGGATGATAGGAAGCGCTTCCACAAATTTCTCTGATATTATGACAATCGGAGAAAGGATCGAATACGGGAATAATCAGCTGGCACCTGTACCTGTAGATCCGATCCAACCTCCATACCCAAGATGGTATGATGCAAATGCTCATTGTGACTATCACGCGGGAGCCATAGGGCATTCAATAGAGAACTGCACTACATTTCTGACGTGCCCGTTCCACGCTGGGGCGAAAGGACACTCTCTAGAGCAATGCAACCATTTTCAAAAGAGAGTTCAAGAGTTGTTAGACTCAAAAGTCCTTACTGTTACAAAATCTCACCTGAAGAAAAGGATAAATGTCGTGGAGGATATCTTGGTTGCTGAGGGCTCAAGTGATTCTCTTAAGCCAAAACCTCTCACCATCTTTTATAGTGAAAAGCCAGATGCAGTCGGAAACCAATCACTATCACTGGTCCTAGCTCCTTTTGAGTATAAAAGTTCCAGAGTAGTGCCTTGGAGATATGAGTGCAAGGTAACTGTAGGGCAAGAGAATAAAGAGAAAGCAAGTGAGAAGAAGAAGGGGAAGGTAGAGGAAGATAAGAAAGGGAATGCCAAACTCAATGAGTATATTTATGATGAACTGGTGGAGGCAATTGTTGTAAAGGATGCAAGTCGTAAACAACCCATGTCCAAGGAAGAGACTCAAGAGTTTCTAAAGCTAGTGAAGCAAAGTGAATACAAAGTTATTGAACAATTAGGTCGGACACCTGCAAAAATCTCTATATTATCTTTACTGTTATCCTCTGAAGCGCATCGGAATACACTGTTGGAGTTTTTTGTTGGGGTGACCTTGGATACATTCGGTAAGGGCAGTTCGTCTACTCTACATCAGAAAATTAAATTTGCGGTTGACCAAAAGTTGGTGATCATATCGGGACAAGAAGACATTCTAGTCTCAATGCTTGCTTCGATGCCATACTTTGAAGCAACAGAAGAAGCTTTTGAGTCTTCATTCCAATCATTCAAGATTGCAAATGCTACAACTTTACATAAGAAGTCTGGAAGACCTAAGCCACGACTTTTAGAGGCCGCCTTTAAAGGAGACACTGGGAGTTTAGACAAGCTACCGAGGATGGCTAAAAATACAAGGAGGTTCGGGCTAGGGCATAAGCCAAATAGGTGTGACATCACCAGAGTGCAGAACCGTGAAAAAGCAAAACGTCTAACAAGATTTGAGAATAGGGAGCATGATTACTCAAGGAGGACTGTTTTACCTCTCAGCCACTCTTTCAGAAGTACCGACATAGTCCATCGGGAGTACGATGAGAGTTTTGCAGTGGCAGTAGTGATAGAAGAAAGAAAGCAAGTCGGACCTTTTATCTTCCCGTGCCCAGACAGTTTCGAGCTGAGCAATTGGAGCGTGCTAGAGCTGCTTAATACTGAGATTGAATGTGATAATGATTTGAAATACAAACTCGATACACCTATATACAACGTCGAGTCGGGTGAGGAAATGAACGATGAGCCCTCTGCTGAGCTAAGTGCACGGCCTCTTGGGACATCAACTTCGAAAACATTATCCGTCATTGAAAGAAGAGTTGCACACTGTCAACATTTGAGCTCAATTCGAATTCCTGGAGACAATGTTCCTGCCCTATATGAAGGACGCAGTACACAATTTCCTGATTGGTTAAAAAAAAGAAAAATGAAACCAGACCAAGCAGGTCAGTTAGAGGCACACGTACAGGGTCGTGAGCATCCACTAGCAGACCCGCCTATTCTGGTTGTTCTTCCAGTTAATAAACCAATATGTCCCATGGCCACCACATTCTCTATGGCGATCGGGGTACATACACGCGAACAAGTCGACCTAAAGGTTAACAAGTGGAGTGAGGTCGAAAAAATTCAGAAGAGGGCAATTATAACTCAGCTCAAGGGTCATTTCGATTTCAACGAATCAGATCGAGTTATGAAGAAATTCATCGAGCATGAGATGAGAACGACACATAAGGCATATAGGGAAAAATTGCGCCAACACTACCTGAGTTATCCCACACCTGAGATTGCGCACAACAATCCTCCTGAGCAATTGTTGGCAAAACCAGACGATTGGAACATGTTGTGCAATAGATGGGAGACAGATGAGTGGAAGAAAAAATCTGAGACCAACAAATGCAACAGGGCGAAGATGCCATTCAACCATCGTGCGGGGCCGAAGGCATTTGCTATTATTGCAAAGGAGAAGAAAGAAGAAGAGGGAGGTGACAATTTTTCTGAAATCGATTTGTTCAGGGAGACGAGATATTCTGACACTAAAGGTTGGGTCGAAGGAGCGGAGACAGCTTACCTAGACATGATGCGTGTTAGAGAAGCATCTATTGCTGGTTGGGGCCCAAAACTAAAGTCTCGTAGAGGTTCTTATTCAAGTGTAGCATCAACACAGCGGGAAGAAGAGCTGTCACAAGAAATATATACTCTAAGAGAAGAATTTCAATACCGTTTCACACAAAAAGACAAGGAGTTAGATGAGCGTCTCGCAGCCAAAGATTATGAGATTACGAGTCTACGCGGTGAAATGTCAGATTTGAGATCAATGGTGCTACAACTTATGAGTCTGTCTAGTGGAGCCAGCTCGTTACCTTCGGAGACCAAAACAGATTTGGTCGCCGAGGACACCTTCGGCAACCAATATCAGGTCGTCGAAGCTACCTTCGACAATGAAAAACGAAAGTCATCGCCGAAACCCTCTCGCGACTGGGCAACGGCAACGAGTCAGGCGACGACTTATTTTCGTCGCAAAAGGTTTTCGGGGACGAATTTGGACCTTTTCGTCGCCGAAACCCTAAATTCTTGTAGTGAGGAGAATGGGAAACAAGAGACACTAAATGCATTGGCCATATTACCAGTGATGTTCAATTTAGAACTCAATGAGGATGTCCGTCCGATTAAAGTTGGGAGGAGAGATGTCCCAGCTTCTTGTATGAGCATTGAGGAAAAACCCGACGGTAACCCCTGGGGCAAAGTCGTCGTCCCATTTGTTCTGGCTACTCTCGATCCAGTTAAAAGGATTTTTGAAGGAAAACGAAATGCGGTGATCGGGCTATCTGAACAATCCCAAGTTTTTATAAAATTCAGTCAAAATTGTCAGAAAAGCTCGTCTACCATGGTGCAAGATGATTCAGAGGATGTTCGAGTTCTATCCAAGTCATTTAAGGTGATCGATTCGTTGGTGATGTTCATGTATAAGAAGCTCGAACTGAGACCAGACTTGTGTCGCCACAAGTTCACCATCGATGTCGTAGTACTTGCGGTAAGTTGCCTTGTTCAACGAACAGACGGTGTCTGTGCCTGCATGTTATCTCCTAGCATCATTACTTTGAGGGTTGCATCGAATTATGATTGGGAGGGAAGAGCGAAGACTGTGCTCAGCTACATAGATGGTACTCACTCAGACTACGAGACGCGGTGGATGGATGTCGATACTGTATATCTAACCGATAATATCGGTGGAACGCATTGGATAATGTTGTGCATCGACTTTGACGAGGGTGAACTTATCGTGTCAGACTTCTTCATGGCCATGACACCACTACCAAATTTGGAGGAGGAGTTGAAGTTGATGATAACTATCATCCCGGCCCTTATTTGTAGGGTCGGTGTTGCGATAAAGAAGCAGGACATACCATCCACATCATGGCGCATCCATAGAGTTTCATAA

Coding sequence (CDS)

ATGTCATCCTCCTCGCGTTGGAAAAAAGGCAAAGCGACTACGGATTCTGCTACATCTAGCAACCCCGTCCACAAGCCTATTCACATCCCTTTGTCAAGGGAGCTCAGCAAATCCGAACCAACATCATTTTTTCGAAAGCCCGAACAAATGATGTCACCCCCTACCGTCCTTAATCTAGATGGTCTCCTAGCCAAGGCAGACCCAGTCAGACAAAGTGCCCCCAGTAACGAAAAGTTCGAAGTTCTGGAGGAAAGATTAAGAGCAGTAGAGGGGACAGACTTAGATAGCTCTCATGTTGGCTCGTGGAAGAATCTGATCGACTCCTTCCTAAAACAATATAAGCATAACATAGAGATGGCTCCAGATCGCTTAGATTTACAGAGGATGGAGAAGAAGAGTACAAAGAGCTTTAAAGAGTATGCTCAAAGGTGGAGGGACACGGCAACTCAAGTCCAACCTCCTTTAATGGATAAGGAGCTATCTGCCATGTTCATCAACACCCTAAAACATCCTTTCTATGATCGGATGATAGGAAGCGCTTCCACAAATTTCTCTGATATTATGACAATCGGAGAAAGGATCGAATACGGGAATAATCAGCTGGCACCTGTACCTGTAGATCCGATCCAACCTCCATACCCAAGATGGTATGATGCAAATGCTCATTGTGACTATCACGCGGGAGCCATAGGGCATTCAATAGAGAACTGCACTACATTTCTGACGTGCCCGTTCCACGCTGGGGCGAAAGGACACTCTCTAGAGCAATGCAACCATTTTCAAAAGAGAGTTCAAGAGTTGTTAGACTCAAAAGTCCTTACTGTTACAAAATCTCACCTGAAGAAAAGGATAAATGTCGTGGAGGATATCTTGGTTGCTGAGGGCTCAAGTGATTCTCTTAAGCCAAAACCTCTCACCATCTTTTATAGTGAAAAGCCAGATGCAGTCGGAAACCAATCACTATCACTGGTCCTAGCTCCTTTTGAGTATAAAAGTTCCAGAGTAGTGCCTTGGAGATATGAGTGCAAGGTAACTGTAGGGCAAGAGAATAAAGAGAAAGCAAGTGAGAAGAAGAAGGGGAAGGTAGAGGAAGATAAGAAAGGGAATGCCAAACTCAATGAGTATATTTATGATGAACTGGTGGAGGCAATTGTTGTAAAGGATGCAAGTCGTAAACAACCCATGTCCAAGGAAGAGACTCAAGAGTTTCTAAAGCTAGTGAAGCAAAGTGAATACAAAGTTATTGAACAATTAGGTCGGACACCTGCAAAAATCTCTATATTATCTTTACTGTTATCCTCTGAAGCGCATCGGAATACACTGTTGGAGTTTTTTGTTGGGGTGACCTTGGATACATTCGGTAAGGGCAGTTCGTCTACTCTACATCAGAAAATTAAATTTGCGGTTGACCAAAAGTTGGTGATCATATCGGGACAAGAAGACATTCTAGTCTCAATGCTTGCTTCGATGCCATACTTTGAAGCAACAGAAGAAGCTTTTGAGTCTTCATTCCAATCATTCAAGATTGCAAATGCTACAACTTTACATAAGAAGTCTGGAAGACCTAAGCCACGACTTTTAGAGGCCGCCTTTAAAGGAGACACTGGGAGTTTAGACAAGCTACCGAGGATGGCTAAAAATACAAGGAGGTTCGGGCTAGGGCATAAGCCAAATAGGTGTGACATCACCAGAGTGCAGAACCGTGAAAAAGCAAAACGTCTAACAAGATTTGAGAATAGGGAGCATGATTACTCAAGGAGGACTGTTTTACCTCTCAGCCACTCTTTCAGAAGTACCGACATAGTCCATCGGGAGTACGATGAGAGTTTTGCAGTGGCAGTAGTGATAGAAGAAAGAAAGCAAGTCGGACCTTTTATCTTCCCGTGCCCAGACAGTTTCGAGCTGAGCAATTGGAGCGTGCTAGAGCTGCTTAATACTGAGATTGAATGTGATAATGATTTGAAATACAAACTCGATACACCTATATACAACGTCGAGTCGGGTGAGGAAATGAACGATGAGCCCTCTGCTGAGCTAAGTGCACGGCCTCTTGGGACATCAACTTCGAAAACATTATCCGTCATTGAAAGAAGAGTTGCACACTGTCAACATTTGAGCTCAATTCGAATTCCTGGAGACAATGTTCCTGCCCTATATGAAGGACGCAGTACACAATTTCCTGATTGGTTAAAAAAAAGAAAAATGAAACCAGACCAAGCAGGTCAGTTAGAGGCACACGTACAGGGTCGTGAGCATCCACTAGCAGACCCGCCTATTCTGGTTGTTCTTCCAGTTAATAAACCAATATGTCCCATGGCCACCACATTCTCTATGGCGATCGGGGTACATACACGCGAACAAGTCGACCTAAAGGTTAACAAGTGGAGTGAGGTCGAAAAAATTCAGAAGAGGGCAATTATAACTCAGCTCAAGGGTCATTTCGATTTCAACGAATCAGATCGAGTTATGAAGAAATTCATCGAGCATGAGATGAGAACGACACATAAGGCATATAGGGAAAAATTGCGCCAACACTACCTGAGTTATCCCACACCTGAGATTGCGCACAACAATCCTCCTGAGCAATTGTTGGCAAAACCAGACGATTGGAACATGTTGTGCAATAGATGGGAGACAGATGAGTGGAAGAAAAAATCTGAGACCAACAAATGCAACAGGGCGAAGATGCCATTCAACCATCGTGCGGGGCCGAAGGCATTTGCTATTATTGCAAAGGAGAAGAAAGAAGAAGAGGGAGGTGACAATTTTTCTGAAATCGATTTGTTCAGGGAGACGAGATATTCTGACACTAAAGGTTGGGTCGAAGGAGCGGAGACAGCTTACCTAGACATGATGCGTGTTAGAGAAGCATCTATTGCTGGTTGGGGCCCAAAACTAAAGTCTCGTAGAGGTTCTTATTCAAGTGTAGCATCAACACAGCGGGAAGAAGAGCTGTCACAAGAAATATATACTCTAAGAGAAGAATTTCAATACCGTTTCACACAAAAAGACAAGGAGTTAGATGAGCGTCTCGCAGCCAAAGATTATGAGATTACGAGTCTACGCGGTGAAATGTCAGATTTGAGATCAATGGTGCTACAACTTATGAGTCTGTCTAGTGGAGCCAGCTCGTTACCTTCGGAGACCAAAACAGATTTGGTCGCCGAGGACACCTTCGGCAACCAATATCAGGTCGTCGAAGCTACCTTCGACAATGAAAAACGAAAGTCATCGCCGAAACCCTCTCGCGACTGGGCAACGGCAACGAGTCAGGCGACGACTTATTTTCGTCGCAAAAGGTTTTCGGGGACGAATTTGGACCTTTTCGTCGCCGAAACCCTAAATTCTTGTAGTGAGGAGAATGGGAAACAAGAGACACTAAATGCATTGGCCATATTACCAGTGATGTTCAATTTAGAACTCAATGAGGATGTCCGTCCGATTAAAGTTGGGAGGAGAGATGTCCCAGCTTCTTGTATGAGCATTGAGGAAAAACCCGACGGTAACCCCTGGGGCAAAGTCGTCGTCCCATTTGTTCTGGCTACTCTCGATCCAGTTAAAAGGATTTTTGAAGGAAAACGAAATGCGGTGATCGGGCTATCTGAACAATCCCAAGTTTTTATAAAATTCAGTCAAAATTGTCAGAAAAGCTCGTCTACCATGGTGCAAGATGATTCAGAGGATGTTCGAGTTCTATCCAAGTCATTTAAGGTGATCGATTCGTTGGTGATGTTCATGTATAAGAAGCTCGAACTGAGACCAGACTTGTGTCGCCACAAGTTCACCATCGATGTCGTAGTACTTGCGGTAAGTTGCCTTGTTCAACGAACAGACGGTGTCTGTGCCTGCATGTTATCTCCTAGCATCATTACTTTGAGGGTTGCATCGAATTATGATTGGGAGGGAAGAGCGAAGACTGTGCTCAGCTACATAGATGGTACTCACTCAGACTACGAGACGCGGTGGATGGATGTCGATACTGTATATCTAACCGATAATATCGGTGGAACGCATTGGATAATGTTGTGCATCGACTTTGACGAGGGTGAACTTATCGTGTCAGACTTCTTCATGGCCATGACACCACTACCAAATTTGGAGGAGGAGTTGAAGTTGATGATAACTATCATCCCGGCCCTTATTTGTAGGGTCGGTGTTGCGATAAAGAAGCAGGACATACCATCCACATCATGGCGCATCCATAGAGTTTCATAA

Protein sequence

MSSSSRWKKGKATTDSATSSNPVHKPIHIPLSRELSKSEPTSFFRKPEQMMSPPTVLNLDGLLAKADPVRQSAPSNEKFEVLEERLRAVEGTDLDSSHVGSWKNLIDSFLKQYKHNIEMAPDRLDLQRMEKKSTKSFKEYAQRWRDTATQVQPPLMDKELSAMFINTLKHPFYDRMIGSASTNFSDIMTIGERIEYGNNQLAPVPVDPIQPPYPRWYDANAHCDYHAGAIGHSIENCTTFLTCPFHAGAKGHSLEQCNHFQKRVQELLDSKVLTVTKSHLKKRINVVEDILVAEGSSDSLKPKPLTIFYSEKPDAVGNQSLSLVLAPFEYKSSRVVPWRYECKVTVGQENKEKASEKKKGKVEEDKKGNAKLNEYIYDELVEAIVVKDASRKQPMSKEETQEFLKLVKQSEYKVIEQLGRTPAKISILSLLLSSEAHRNTLLEFFVGVTLDTFGKGSSSTLHQKIKFAVDQKLVIISGQEDILVSMLASMPYFEATEEAFESSFQSFKIANATTLHKKSGRPKPRLLEAAFKGDTGSLDKLPRMAKNTRRFGLGHKPNRCDITRVQNREKAKRLTRFENREHDYSRRTVLPLSHSFRSTDIVHREYDESFAVAVVIEERKQVGPFIFPCPDSFELSNWSVLELLNTEIECDNDLKYKLDTPIYNVESGEEMNDEPSAELSARPLGTSTSKTLSVIERRVAHCQHLSSIRIPGDNVPALYEGRSTQFPDWLKKRKMKPDQAGQLEAHVQGREHPLADPPILVVLPVNKPICPMATTFSMAIGVHTREQVDLKVNKWSEVEKIQKRAIITQLKGHFDFNESDRVMKKFIEHEMRTTHKAYREKLRQHYLSYPTPEIAHNNPPEQLLAKPDDWNMLCNRWETDEWKKKSETNKCNRAKMPFNHRAGPKAFAIIAKEKKEEEGGDNFSEIDLFRETRYSDTKGWVEGAETAYLDMMRVREASIAGWGPKLKSRRGSYSSVASTQREEELSQEIYTLREEFQYRFTQKDKELDERLAAKDYEITSLRGEMSDLRSMVLQLMSLSSGASSLPSETKTDLVAEDTFGNQYQVVEATFDNEKRKSSPKPSRDWATATSQATTYFRRKRFSGTNLDLFVAETLNSCSEENGKQETLNALAILPVMFNLELNEDVRPIKVGRRDVPASCMSIEEKPDGNPWGKVVVPFVLATLDPVKRIFEGKRNAVIGLSEQSQVFIKFSQNCQKSSSTMVQDDSEDVRVLSKSFKVIDSLVMFMYKKLELRPDLCRHKFTIDVVVLAVSCLVQRTDGVCACMLSPSIITLRVASNYDWEGRAKTVLSYIDGTHSDYETRWMDVDTVYLTDNIGGTHWIMLCIDFDEGELIVSDFFMAMTPLPNLEEELKLMITIIPALICRVGVAIKKQDIPSTSWRIHRVS
Homology
BLAST of Moc03g02390 vs. NCBI nr
Match: XP_022143495.1 (LOW QUALITY PROTEIN: uncharacterized protein LOC111013372 [Momordica charantia])

HSP 1 Score: 791.6 bits (2043), Expect = 1.1e-224
Identity = 496/987 (50.25%), Postives = 538/987 (54.51%), Query Frame = 0

Query: 61   GLLAKADPVRQSAPSNEKFEVLEERLRAVEGTD--------------------------- 120
            GL AK DPV Q+APSNEKFEVL+ERLRA+EGTD                           
Sbjct: 63   GLPAKTDPVGQNAPSNEKFEVLKERLRAIEGTDVFGNIDASQLCLVSRLVIPPKFKVPEF 122

Query: 121  ------------------------------------------------LDSSHVGSWKNL 180
                                                            LDSSHVGSWKNL
Sbjct: 123  EKYDGSSCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLSSPASRWYMQLDSSHVGSWKNL 182

Query: 181  IDSFLKQYKHNIEMAPDRLDLQRMEKKSTKSFKEYAQRWRDTATQVQPPLMDKELSAMFI 240
             DSFLKQYKHNI+MAPDRLDLQRMEKKST+SFKEYAQRWRDTA QVQPPL DKELS MFI
Sbjct: 183  ADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLTDKELSXMFI 242

Query: 241  NTLKHPFYDRMIGSASTNFSDIMTIGERIEYG-------------------------NNQ 300
            NTLKHPFYDRM+GSASTNFSDIM IGERIEYG                           +
Sbjct: 243  NTLKHPFYDRMVGSASTNFSDIMAIGERIEYGVRHGRITSTADEPLAAKKTSHSKKKEGE 302

Query: 301  LAPVPVDPIQPPYPRWYDANAHCDYHAGAIGHSIENCTTF-------------------- 360
            LA VPVDPIQPPYPRW DANA CDYH GAIGHSIENCT                      
Sbjct: 303  LAHVPVDPIQPPYPRWCDANARCDYHTGAIGHSIENCTALKYRVQALIKAGWLNFKKENG 362

Query: 361  ------------------------------------------------------------ 420
                                                                        
Sbjct: 363  PBVSNNPLPNHXNVQINAIECQEIESKSKVADITTPMEELFEILLGSGYVSVEYLCPNLK 422

Query: 421  -------LTCPFHAGAKGHSLEQCNHFQKRVQELLDSKVLTVTKSHLKKRINVVEDILV- 480
                   LTCPFHAGAKGH+LEQCN F+  VQELLDSK+LTV  SH KK INVVED+ V 
Sbjct: 423  YKGYDESLTCPFHAGAKGHALEQCNSFRMIVQELLDSKILTVANSHQKKGINVVEDVSVA 482

Query: 481  ----AEGSSDSLKPKPLTIFYSEKPDAVGNQSLSLVL---APFEYKSSRVVPWRYECKVT 540
                AEGSSD+LKPK LTIFYSEKPDA       + +   APFEYKSS+ VPW+YECKVT
Sbjct: 483  EGSIAEGSSDALKPKRLTIFYSEKPDAPNCSRKPITITVPAPFEYKSSKAVPWKYECKVT 542

Query: 541  VGQE----------------------------------------NKEKASEKKKGKVEED 600
            VGQ+                                        NKEKASEKKK KVEED
Sbjct: 543  VGQDVSSPPLPVDNITGVGGLTXTGRCYTPDSLLKRVSETTSEKNKEKASEKKKEKVEED 602

Query: 601  KKGNAKLNEYIYDELVEAIVVKDASRKQPMSKEETQEFLKLVKQSEYKVIEQLGRTPAKI 660
            KKG AKL+E ++DELVEAIVVKD S KQ + +EE QEFLKLVKQSEYKV EQLGRTPAKI
Sbjct: 603  KKGKAKLHEDVHDELVEAIVVKDVSPKQHVFEEEIQEFLKLVKQSEYKVTEQLGRTPAKI 662

Query: 661  SILSLLLSSEAHRNTLLEF----FVG--VTLDTFG------------------------- 680
            SILSLLLSSEAHRNTLLE     FV   +T+D                            
Sbjct: 663  SILSLLLSSEAHRNTLLEXLKQAFVSQDITVDNLSNVVGNITASSSITFTDEEIPPEGTG 722

BLAST of Moc03g02390 vs. NCBI nr
Match: XP_022143495.1 (LOW QUALITY PROTEIN: uncharacterized protein LOC111013372 [Momordica charantia])

HSP 1 Score: 85.1 bits (209), Expect = 5.2e-12
Identity = 39/47 (82.98%), Postives = 42/47 (89.36%), Query Frame = 0

Query: 1125 ETLNALAILPVMFNLELNEDVRPIKVGRRDVPASCMSIEEKPDGNPW 1172
            +  +ALA L VMFNLELNEDV PIKVGRRDVPASCMSIEE+PDGNPW
Sbjct: 1694 QVXDALATLAVMFNLELNEDVCPIKVGRRDVPASCMSIEEEPDGNPW 1740


HSP 2 Score: 790.4 bits (2040), Expect = 2.5e-224
Identity = 524/1152 (45.49%), Postives = 566/1152 (49.13%), Query Frame = 0

Query: 13   TTDSATSSNPVHKPIHIPLSRELSKSEPTSFFRKPEQMMSPPTVLNLDGLLAKADPVRQS 72
            TT +     PV + +H P  +   +      FR+PEQMMSPPTVLNL  LLAK DPV Q+
Sbjct: 5    TTYNPLYDVPVGQYLH-PFVKGAQQIPTNIIFREPEQMMSPPTVLNLGDLLAKTDPVGQN 64

Query: 73   APSNEKFEVLEERLRAVEGTD--------------------------------------- 132
            APSNEKFEVL+ERLRA+E TD                                       
Sbjct: 65   APSNEKFEVLKERLRAIERTDVFGNIDASQLCSVSGLVIPPKLKVPEFEKYNGSSCPKNH 124

Query: 133  ------------------------------------LDSSHVGSWKNLIDSFLKQYKHNI 192
                                                LDSSHVGSWKNL DSFLKQYKHNI
Sbjct: 125  LXMYCRKMAAYVQNDKLLIHCFQDSLSGPASRWYMQLDSSHVGSWKNLADSFLKQYKHNI 184

Query: 193  EMAPDRLDLQRMEKKSTKSFKEYAQRWRDTATQVQPPLMDKELSAMFINTLKHPFYDRMI 252
            +MAPDRLDLQRMEKKSTKSFKEYAQRWRDTA QVQPPL+DKELSAMFINTLKHPFYDRMI
Sbjct: 185  DMAPDRLDLQRMEKKSTKSFKEYAQRWRDTAAQVQPPLIDKELSAMFINTLKHPFYDRMI 244

Query: 253  GSASTNFSDIMTIGERIEYG---------------------------------------- 312
            GSASTNFSDIMTIGERIEYG                                        
Sbjct: 245  GSASTNFSDIMTIGERIEYGVRHGRITSTTDEPLAAKKASHSKKKEGEVQMVGADRHSWK 304

Query: 313  ------------------------------------------------------------ 372
                                                                        
Sbjct: 305  QQPYRRTPQYSPYYYPTPYGYNQPFVNNATSHYYPYASQNFRPPASQNFQLTPTSQNFQP 364

Query: 373  -----------------------------------------NNQLAPVPVDPIQPPYPRW 432
                                                     NNQLAPVPVDPIQPPYPRW
Sbjct: 365  RGQQHNTFYTQGQQNNRGARKQTQFDPIPMTYTELLPQLFQNNQLAPVPVDPIQPPYPRW 424

Query: 433  YDANAHCDYHAGAIGHSIENCTTF------------------------------------ 492
            YDANA CDYHAGAI HS ENCT                                      
Sbjct: 425  YDANARCDYHAGAIXHSTENCTXLKYRVQALIKAGWXNFKKENGXDVSKXXLXNHQNVQI 484

Query: 493  -------------------------------------------------LTCPFHAGAKG 552
                                                             LTC FH GAKG
Sbjct: 485  NAIECQGIESKSKVABITTPMXELFEILLGSGYISVEYLCPKYKGYDESLTCXFHXGAKG 544

Query: 553  HSLEQCNHFQKRVQELLDSKVLTVTKSHLKKRINVVEDILVAEGSSDSLKPKPLTIFYSE 612
            HSLEQCN F+ +VQELLDSK+LT   SH KK  NVVEDILVAEGSSDSLKPKPLTIFY E
Sbjct: 545  HSLEQCNXFRMKVQELLDSKILTXANSHXKKXTNVVEDILVAEGSSDSLKPKPLTIFYRE 604

Query: 613  KPDAVG---NQSLSLVLAPFEYKSSRVVPWRYECKVTVGQE------------------- 672
            KPDA           V  PFEYKSS+ VPW+YECKVTVGQ+                   
Sbjct: 605  KPDAPSCSRKPXXITVPXPFEYKSSKAVPWKYECKVTVGQDVSSPSLPVDNITGVGGLTR 664

Query: 673  ---------------------NKEKASEKKKGKVEEDKKGNAKLNEYIYDELVEAIVVKD 680
                                 NKEKASEKKK KVEEDKKG AKL+E   DELVEAIVVKD
Sbjct: 665  TGRCYTPDSLLKRVNETTSEKNKEKASEKKKEKVEEDKKGKAKLHEDARDELVEAIVVKD 724

BLAST of Moc03g02390 vs. NCBI nr
Match: XP_022147189.1 (LOW QUALITY PROTEIN: uncharacterized protein LOC111016200 [Momordica charantia])

HSP 1 Score: 85.9 bits (211), Expect = 3.1e-12
Identity = 39/47 (82.98%), Postives = 42/47 (89.36%), Query Frame = 0

Query: 1125 ETLNALAILPVMFNLELNEDVRPIKVGRRDVPASCMSIEEKPDGNPW 1172
            +  +ALA L VMFNLELNEDVRPIKVGRRDVPASCMSIEE+PDG PW
Sbjct: 1804 QVADALATLAVMFNLELNEDVRPIKVGRRDVPASCMSIEEEPDGKPW 1850


HSP 2 Score: 578.6 bits (1490), Expect = 1.5e-160
Identity = 365/694 (52.59%), Postives = 405/694 (58.36%), Query Frame = 0

Query: 119 MAPDRLDLQRMEKKSTKSFKEYAQRWRDTATQVQPPLMDKELSAMFINTLKHPFYDRMIG 178
           MAPDRLDLQRMEKKST+SFKEYAQR RDTATQVQPPL DKELSAMFINTLKHPFYDRMIG
Sbjct: 1   MAPDRLDLQRMEKKSTESFKEYAQRCRDTATQVQPPLTDKELSAMFINTLKHPFYDRMIG 60

Query: 179 SASTNFSDIMTIGERIEYG----------------------NNQLAPVPVDPIQPPYPRW 238
           SASTNFS+IMTIGE IEYG                        +LAPVPVD IQPPYPRW
Sbjct: 61  SASTNFSNIMTIGETIEYGVKHGQITSTTEELLAAKRQVIPRRKLAPVPVDLIQPPYPRW 120

Query: 239 YDANAHCDYHAGAIGHSIENCTTF------------------------------------ 298
           YD NA CDYHAGAIGHS +NCT                                      
Sbjct: 121 YDVNARCDYHAGAIGHSTKNCTALKYRVQALIRVGIESKSKVGDITIPLEELFENLLGSG 180

Query: 299 -------------------LTCPFHAGAKGHSLEQCNHFQKRVQELLDSKVLTVTKSHLK 358
                               TCPFHAGAKGHSLEQCN FQKRVQELLDSKVLTVTKS LK
Sbjct: 181 YVSVEYLCPNLKYKEYDGSQTCPFHAGAKGHSLEQCNRFQKRVQELLDSKVLTVTKSQLK 240

Query: 359 KRINVVEDILVAEGSSDSLKPKPLTIFYSEKPDAVGNQSLSLVL---APFEYKSSRVVPW 418
           KR NVVEDILVAEGSSDSLKPKPLTIFY EK DA       + +   A FE +       
Sbjct: 241 KRTNVVEDILVAEGSSDSLKPKPLTIFYCEKLDAPSCSRKPITITVPAAFEVRGLTRTER 300

Query: 419 RY-------ECKVTVGQENKEKASEKKKGKVEEDKKGNAKLNEYIYDELVEAIVVKDASR 478
            Y              ++NKEKASEKK+ K EEDKKG  KLN+ I DELVEAIVVKDAS 
Sbjct: 301 CYTPDSLLKRVNEPASEKNKEKASEKKE-KAEEDKKGKTKLNKDICDELVEAIVVKDASP 360

Query: 479 KQPMSKEETQEFLKLVKQSEYK-------VIEQLGRTPAKISILSLLLSSEAH------- 538
           KQ +S+EETQ+FLKLVKQSEYK        ++ L      I+  S +  ++         
Sbjct: 361 KQSVSEEETQDFLKLVKQSEYKAFVSQDITVDNLSNVVGNITASSSITFTDEEIPPESTR 420

Query: 539 -------------------------------RNTL------------------------- 598
                                          R+TL                         
Sbjct: 421 HTKPLHISVKCKNFLIAKVLVDNGSSLNIMPRSTLEKLPVDMSHMRPSTVIVRAFDGACS 480

Query: 599 -----LEFFVGVTLDTFG------------------------KGSSSTLHQKIKFAVDQK 627
                +E  + +   TF                             STLHQKIKFAVDQK
Sbjct: 481 TVVRDIEILIQIGPCTFDITFQVMDITSAYSFLLGRPWIHSTGAVPSTLHQKIKFAVDQK 540

BLAST of Moc03g02390 vs. NCBI nr
Match: XP_022158986.1 (LOW QUALITY PROTEIN: uncharacterized protein LOC111025431 [Momordica charantia])

HSP 1 Score: 506.5 bits (1303), Expect = 7.2e-139
Identity = 492/1547 (31.80%), Postives = 619/1547 (40.01%), Query Frame = 0

Query: 50   MMSPPTVLNLDGLLAKADPVRQSAPSNEKFEVLEERLRAVEGT----------------- 109
            MMSPPTVLNL GL AK D V Q+APSNEKFEVLEERLRA+EGT                 
Sbjct: 1    MMSPPTVLNLGGLPAKTDLVGQNAPSNEKFEVLEERLRAIEGTYVFGNIDASQLCLVSGL 60

Query: 110  ----------------------------------------------------------DL 169
                                                                       L
Sbjct: 61   VIPPKFKVPEFEKYDGSSCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLSGPASRWYMQL 120

Query: 170  DSSHVGSWKNLIDSFLKQYKHNIEMAPDRLDLQRMEKKSTKSFKEYAQRWRDTATQVQPP 229
            DSS+VGSWKNL DSFLKQYKHNI+MAPDRLDLQRMEKKST+SFKEYAQRWRDTA QVQPP
Sbjct: 121  DSSNVGSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPP 180

Query: 230  LMDKELSAMFINTLKHPFYDRMIGSASTNFSDIMTIGERIEYG----------------- 289
            L DKELSAMFINTLKHPFYDRMIG+ASTNFSDIMTIGERIEYG                 
Sbjct: 181  LTDKELSAMFINTLKHPFYDRMIGNASTNFSDIMTIGERIEYGVRHGRITSTVDEPLAAK 240

Query: 290  ------------------------------------------------------------ 349
                                                                        
Sbjct: 241  KASHSKKKEGEVQMVGADRHSWKQQPYSRTPRYTPYYYPTPYGYNQPFVNNATSHYSPYT 300

Query: 350  ------------------------------------------------------------ 409
                                                                        
Sbjct: 301  FQNFRPPASQNFQPTPASQNFQPRGQQHNTLYTQEQQTNRGARKQTQFDPIPMTYTELLP 360

Query: 410  ----NNQLAPVPVDPIQPPYPRWYDANAHCDYHAGAIGHSIENCTTF------------- 469
                NNQLAPVPVDPIQPPYPRWYD NA CDYHAGAIGHS ENCT               
Sbjct: 361  QLFQNNQLAPVPVDPIQPPYPRWYDTNARCDYHAGAIGHSTENCTALKYRVQALIKAGWL 420

Query: 470  ------------------------------------------------------------ 529
                                                                        
Sbjct: 421  NFKKENGPDVSKNPLPNHQNVQINAIECQEIESKSKVADIRTPMVELFEILLGSGYVSVE 480

Query: 530  --------------LTCPFHAGAKGHSLEQCNHFQKRVQELLDSKVLTVTKSHLKKRINV 589
                          LTCPFHAGAKGHSLEQCN F+ +VQELLDSK+LTV  SH KK IN+
Sbjct: 481  YLCPNLKYKGYDESLTCPFHAGAKGHSLEQCNSFRMKVQELLDSKILTVANSHQKKGINI 540

Query: 590  VEDILVAEGSSDSLKPKPLTIFYSEKPDAVGNQSLSLVL---APFEYKSSRVVPWRYECK 649
            VED+ VAEGSSD+LKPK LTIFYSEKP+A       + +   APFEYKSS+ VPW+Y+CK
Sbjct: 541  VEDVSVAEGSSDALKPKCLTIFYSEKPNAPNCSRKPITITVPAPFEYKSSKAVPWKYQCK 600

Query: 650  VTVGQE----------------------------------------NKEKASEKKKGKVE 709
            VTVGQ+                                        NKEKASEKKK KVE
Sbjct: 601  VTVGQDVSSPPLPIDNITGVGGLTRTGRCYTPDSLLKCVNETTSEKNKEKASEKKKEKVE 660

Query: 710  EDKKGNAKLNEYIYDELVEAIVVKDASRKQPMSKEETQEFLKLVKQSEYKVIEQLGRTPA 769
            EDKKG AKL+E ++DELVEAIVVKD S KQPMS+EETQE LKLVKQSEYKVIEQLGRTPA
Sbjct: 661  EDKKGKAKLHEDVHDELVEAIVVKDVSPKQPMSEEETQEILKLVKQSEYKVIEQLGRTPA 720

Query: 770  KISILSLLLSSEAHRNTLLEFFVGVTLDTFGKGSSSTLHQKIKFAVDQKLVIISGQEDIL 829
            KISILSLLLSSEAHRN LLE                        A+ Q  V     +DI 
Sbjct: 721  KISILSLLLSSEAHRNALLE------------------------ALKQAFV----SQDIT 780

Query: 830  VSMLASMPYFEATEEAFESSFQSFKIANATTLHKK----SGRPKPRLLEAAFKGDTGSLD 889
            V  L+++        A   +F   +I    T H K    S + K  L+      +  SL+
Sbjct: 781  VDNLSNV--VGNISXASSITFTDEEIPPEGTGHTKALHISIKCKNFLIAKVLVDNGSSLN 840

Query: 890  KLPRMAKNTRRFGLGHKPNRCDITRVQNREKAKRLTRFENREHDYSRRTVLPLSHSFRST 949
             +PR         + H      I R  +  ++  +   E           +P+     + 
Sbjct: 841  IMPRSTLEKLPVDMSHMRPSTVIVRAFDGARSAVVGDIE-----------IPIQIGPCTF 900

Query: 950  DIVHREYDESFAVAVVIE----ERKQVGPFIFPCPDSFELSNWSVLELLNTEIECDNDLK 1009
            DI  +  D + A + ++           P        F +        LN     DN   
Sbjct: 901  DITFQVMDITSAYSFLLGRPWIHSAGAVPSTLHQKIKFAVDQNVDYRDLNRASPKDNFSL 960

Query: 1010 YKLDTPIYNVE--SGEEMNDEPSAELSARPLGTSTSKTLSVIERRVAHCQHLSSIRIPGD 1069
              +D  + N    S     D  S     +       KT ++I      C  +    +   
Sbjct: 961  PHIDVLVDNTTGFSTFSFMDGFSGYNQIKMAPEDREKT-TLITLWGTFCYKVMPFGL--K 1020

Query: 1070 NVPALYEGRSTQFPDWLKKRKMKPDQAGQLEAHVQGREH-----PLADPPILVVLPVNKP 1129
            N  A Y+         L  ++++      +    QG EH      L D      L +N  
Sbjct: 1021 NAGATYQRAMVTLFHDLMHKEIEVYVDDMIAKSKQGEEHTTILRKLFDRLRKFKLKLNPN 1080

Query: 1130 ICPMATTFSMAIGVHTREQ---VDL-KVNKWSEVEKIQKRAIITQLKGHFDFNESDRVMK 1172
             C    T    +G    ++   VDL KV    E+   Q +  + +  G  ++      + 
Sbjct: 1081 KCIFGATTGKLLGFVVSQEDIKVDLDKVKAILEMPPPQTQKEVREFLGRLNY------IA 1140

BLAST of Moc03g02390 vs. NCBI nr
Match: XP_022150030.1 (LOW QUALITY PROTEIN: uncharacterized protein LOC111018303 [Momordica charantia])

HSP 1 Score: 503.1 bits (1294), Expect = 8.0e-138
Identity = 319/593 (53.79%), Postives = 349/593 (58.85%), Query Frame = 0

Query: 263 RVQELLDSKVLTVTKSHLKKRINVVEDILVAEGSSDSLKPKPLTIFYSEKPDAVGNQSLS 322
           +VQELLDSK+LTV  SH KKR NVVEDILVAEGSSDS+KPK LTIFY EKPDA       
Sbjct: 2   KVQELLDSKILTVANSHQKKRTNVVEDILVAEGSSDSIKPKRLTIFYREKPDAPSCSRKP 61

Query: 323 LVL---APFEYKSSRVVPWRYECKVTVGQE------------------------------ 382
           + +   APFEYKSS+ VPW+YECKVTVGQ+                              
Sbjct: 62  ITITVPAPFEYKSSKAVPWKYECKVTVGQDVSSPSLPVDNITGVGGLTRTGRCYTPDSLL 121

Query: 383 ----------NKEKASEKKKGKVEEDKKGNAKLNEYIYDELVEAIVVKDASRKQPMSKEE 442
                     NKEKASEKKK KVEEDKKG AKL+E ++DELVE               EE
Sbjct: 122 KRVNETTSEKNKEKASEKKKEKVEEDKKGKAKLHEDVHDELVE---------------EE 181

Query: 443 TQEFLKLVKQSEYKVIEQLGRTPAKISILSLLLSSEAHRNTLLE----FFVG--VTLD-- 502
           TQEFLKLVKQ+EYKVIEQLGRTPAKISILSLLLSSEAHRN LLE     FV   +T+D  
Sbjct: 182 TQEFLKLVKQNEYKVIEQLGRTPAKISILSLLLSSEAHRNALLEALKQAFVSQDITVDNL 241

Query: 503 -------------TF--------------------------------GKGSS-------- 562
                        TF                                G GSS        
Sbjct: 242 SNVVGNIMASSCITFTDEEIPPEGTGHTKALHISVKCKNFLIAKVLVGNGSSLNIMPRST 301

Query: 563 ------------------------------------------------------------ 622
                                                                       
Sbjct: 302 LEKLPVDMSHMRPSTVIVRAFDGARNAVVGDIEIPIQIGLCTFDITFQVMDITSAYSFLL 361

Query: 623 ------------STLHQKIKFAVDQKLVIISGQEDILVSMLASMPYFEATEEAFESSFQS 680
                       STLHQKIKFAVDQKLVIISGQEDILVS LASMPY EA EEAFESSFQS
Sbjct: 362 GRPWIHSAGAVPSTLHQKIKFAVDQKLVIISGQEDILVSRLASMPYVEAAEEAFESSFQS 421

BLAST of Moc03g02390 vs. ExPASy TrEMBL
Match: A0A6J1CNY7 (Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111013372 PE=4 SV=1)

HSP 1 Score: 791.6 bits (2043), Expect = 5.5e-225
Identity = 496/987 (50.25%), Postives = 538/987 (54.51%), Query Frame = 0

Query: 61   GLLAKADPVRQSAPSNEKFEVLEERLRAVEGTD--------------------------- 120
            GL AK DPV Q+APSNEKFEVL+ERLRA+EGTD                           
Sbjct: 63   GLPAKTDPVGQNAPSNEKFEVLKERLRAIEGTDVFGNIDASQLCLVSRLVIPPKFKVPEF 122

Query: 121  ------------------------------------------------LDSSHVGSWKNL 180
                                                            LDSSHVGSWKNL
Sbjct: 123  EKYDGSSCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLSSPASRWYMQLDSSHVGSWKNL 182

Query: 181  IDSFLKQYKHNIEMAPDRLDLQRMEKKSTKSFKEYAQRWRDTATQVQPPLMDKELSAMFI 240
             DSFLKQYKHNI+MAPDRLDLQRMEKKST+SFKEYAQRWRDTA QVQPPL DKELS MFI
Sbjct: 183  ADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLTDKELSXMFI 242

Query: 241  NTLKHPFYDRMIGSASTNFSDIMTIGERIEYG-------------------------NNQ 300
            NTLKHPFYDRM+GSASTNFSDIM IGERIEYG                           +
Sbjct: 243  NTLKHPFYDRMVGSASTNFSDIMAIGERIEYGVRHGRITSTADEPLAAKKTSHSKKKEGE 302

Query: 301  LAPVPVDPIQPPYPRWYDANAHCDYHAGAIGHSIENCTTF-------------------- 360
            LA VPVDPIQPPYPRW DANA CDYH GAIGHSIENCT                      
Sbjct: 303  LAHVPVDPIQPPYPRWCDANARCDYHTGAIGHSIENCTALKYRVQALIKAGWLNFKKENG 362

Query: 361  ------------------------------------------------------------ 420
                                                                        
Sbjct: 363  PDVSNNPLPNHXNVQINAIECQEIESKSKVADITTPMEELFEILLGSGYVSVEYLCPNLK 422

Query: 421  -------LTCPFHAGAKGHSLEQCNHFQKRVQELLDSKVLTVTKSHLKKRINVVEDILV- 480
                   LTCPFHAGAKGH+LEQCN F+  VQELLDSK+LTV  SH KK INVVED+ V 
Sbjct: 423  YKGYDESLTCPFHAGAKGHALEQCNSFRMIVQELLDSKILTVANSHQKKGINVVEDVSVA 482

Query: 481  ----AEGSSDSLKPKPLTIFYSEKPDAVGNQSLSLVL---APFEYKSSRVVPWRYECKVT 540
                AEGSSD+LKPK LTIFYSEKPDA       + +   APFEYKSS+ VPW+YECKVT
Sbjct: 483  EGSIAEGSSDALKPKRLTIFYSEKPDAPNCSRKPITITVPAPFEYKSSKAVPWKYECKVT 542

Query: 541  VGQE----------------------------------------NKEKASEKKKGKVEED 600
            VGQ+                                        NKEKASEKKK KVEED
Sbjct: 543  VGQDVSSPPLPVDNITGVGGLTXTGRCYTPDSLLKRVSETTSEKNKEKASEKKKEKVEED 602

Query: 601  KKGNAKLNEYIYDELVEAIVVKDASRKQPMSKEETQEFLKLVKQSEYKVIEQLGRTPAKI 660
            KKG AKL+E ++DELVEAIVVKD S KQ + +EE QEFLKLVKQSEYKV EQLGRTPAKI
Sbjct: 603  KKGKAKLHEDVHDELVEAIVVKDVSPKQHVFEEEIQEFLKLVKQSEYKVTEQLGRTPAKI 662

Query: 661  SILSLLLSSEAHRNTLLEF----FVG--VTLDTFG------------------------- 680
            SILSLLLSSEAHRNTLLE     FV   +T+D                            
Sbjct: 663  SILSLLLSSEAHRNTLLEXLKQAFVSQDITVDNLSNVVGNITASSSITFTDEEIPPEGTG 722

BLAST of Moc03g02390 vs. ExPASy TrEMBL
Match: A0A6J1CNY7 (Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111013372 PE=4 SV=1)

HSP 1 Score: 85.1 bits (209), Expect = 2.5e-12
Identity = 39/47 (82.98%), Postives = 42/47 (89.36%), Query Frame = 0

Query: 1125 ETLNALAILPVMFNLELNEDVRPIKVGRRDVPASCMSIEEKPDGNPW 1172
            +  +ALA L VMFNLELNEDV PIKVGRRDVPASCMSIEE+PDGNPW
Sbjct: 1694 QVXDALATLAVMFNLELNEDVCPIKVGRRDVPASCMSIEEEPDGNPW 1740


HSP 2 Score: 790.4 bits (2040), Expect = 1.2e-224
Identity = 524/1152 (45.49%), Postives = 566/1152 (49.13%), Query Frame = 0

Query: 13   TTDSATSSNPVHKPIHIPLSRELSKSEPTSFFRKPEQMMSPPTVLNLDGLLAKADPVRQS 72
            TT +     PV + +H P  +   +      FR+PEQMMSPPTVLNL  LLAK DPV Q+
Sbjct: 5    TTYNPLYDVPVGQYLH-PFVKGAQQIPTNIIFREPEQMMSPPTVLNLGDLLAKTDPVGQN 64

Query: 73   APSNEKFEVLEERLRAVEGTD--------------------------------------- 132
            APSNEKFEVL+ERLRA+E TD                                       
Sbjct: 65   APSNEKFEVLKERLRAIERTDVFGNIDASQLCSVSGLVIPPKLKVPEFEKYNGSSCPKNH 124

Query: 133  ------------------------------------LDSSHVGSWKNLIDSFLKQYKHNI 192
                                                LDSSHVGSWKNL DSFLKQYKHNI
Sbjct: 125  LXMYCRKMAAYVQNDKLLIHCFQDSLSGPASRWYMQLDSSHVGSWKNLADSFLKQYKHNI 184

Query: 193  EMAPDRLDLQRMEKKSTKSFKEYAQRWRDTATQVQPPLMDKELSAMFINTLKHPFYDRMI 252
            +MAPDRLDLQRMEKKSTKSFKEYAQRWRDTA QVQPPL+DKELSAMFINTLKHPFYDRMI
Sbjct: 185  DMAPDRLDLQRMEKKSTKSFKEYAQRWRDTAAQVQPPLIDKELSAMFINTLKHPFYDRMI 244

Query: 253  GSASTNFSDIMTIGERIEYG---------------------------------------- 312
            GSASTNFSDIMTIGERIEYG                                        
Sbjct: 245  GSASTNFSDIMTIGERIEYGVRHGRITSTTDEPLAAKKASHSKKKEGEVQMVGADRHSWK 304

Query: 313  ------------------------------------------------------------ 372
                                                                        
Sbjct: 305  QQPYRRTPQYSPYYYPTPYGYNQPFVNNATSHYYPYASQNFRPPASQNFQLTPTSQNFQP 364

Query: 373  -----------------------------------------NNQLAPVPVDPIQPPYPRW 432
                                                     NNQLAPVPVDPIQPPYPRW
Sbjct: 365  RGQQHNTFYTQGQQNNRGARKQTQFDPIPMTYTELLPQLFQNNQLAPVPVDPIQPPYPRW 424

Query: 433  YDANAHCDYHAGAIGHSIENCTTF------------------------------------ 492
            YDANA CDYHAGAI HS ENCT                                      
Sbjct: 425  YDANARCDYHAGAIXHSTENCTXLKYRVQALIKAGWXNFKKENGXDVSKXXLXNHQNVQI 484

Query: 493  -------------------------------------------------LTCPFHAGAKG 552
                                                             LTC FH GAKG
Sbjct: 485  NAIECQGIESKSKVADITTPMXELFEILLGSGYISVEYLCPKYKGYDESLTCXFHXGAKG 544

Query: 553  HSLEQCNHFQKRVQELLDSKVLTVTKSHLKKRINVVEDILVAEGSSDSLKPKPLTIFYSE 612
            HSLEQCN F+ +VQELLDSK+LT   SH KK  NVVEDILVAEGSSDSLKPKPLTIFY E
Sbjct: 545  HSLEQCNXFRMKVQELLDSKILTXANSHXKKXTNVVEDILVAEGSSDSLKPKPLTIFYRE 604

Query: 613  KPDAVG---NQSLSLVLAPFEYKSSRVVPWRYECKVTVGQE------------------- 672
            KPDA           V  PFEYKSS+ VPW+YECKVTVGQ+                   
Sbjct: 605  KPDAPSCSRKPXXITVPXPFEYKSSKAVPWKYECKVTVGQDVSSPSLPVDNITGVGGLTR 664

Query: 673  ---------------------NKEKASEKKKGKVEEDKKGNAKLNEYIYDELVEAIVVKD 680
                                 NKEKASEKKK KVEEDKKG AKL+E   DELVEAIVVKD
Sbjct: 665  TGRCYTPDSLLKRVNETTSEKNKEKASEKKKEKVEEDKKGKAKLHEDARDELVEAIVVKD 724

BLAST of Moc03g02390 vs. ExPASy TrEMBL
Match: A0A6J1D099 (Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111016200 PE=4 SV=1)

HSP 1 Score: 85.9 bits (211), Expect = 1.5e-12
Identity = 39/47 (82.98%), Postives = 42/47 (89.36%), Query Frame = 0

Query: 1125 ETLNALAILPVMFNLELNEDVRPIKVGRRDVPASCMSIEEKPDGNPW 1172
            +  +ALA L VMFNLELNEDVRPIKVGRRDVPASCMSIEE+PDG PW
Sbjct: 1804 QVADALATLAVMFNLELNEDVRPIKVGRRDVPASCMSIEEEPDGKPW 1850


HSP 2 Score: 578.6 bits (1490), Expect = 7.3e-161
Identity = 365/694 (52.59%), Postives = 405/694 (58.36%), Query Frame = 0

Query: 119 MAPDRLDLQRMEKKSTKSFKEYAQRWRDTATQVQPPLMDKELSAMFINTLKHPFYDRMIG 178
           MAPDRLDLQRMEKKST+SFKEYAQR RDTATQVQPPL DKELSAMFINTLKHPFYDRMIG
Sbjct: 1   MAPDRLDLQRMEKKSTESFKEYAQRCRDTATQVQPPLTDKELSAMFINTLKHPFYDRMIG 60

Query: 179 SASTNFSDIMTIGERIEYG----------------------NNQLAPVPVDPIQPPYPRW 238
           SASTNFS+IMTIGE IEYG                        +LAPVPVD IQPPYPRW
Sbjct: 61  SASTNFSNIMTIGETIEYGVKHGQITSTTEELLAAKRQVIPRRKLAPVPVDLIQPPYPRW 120

Query: 239 YDANAHCDYHAGAIGHSIENCTTF------------------------------------ 298
           YD NA CDYHAGAIGHS +NCT                                      
Sbjct: 121 YDVNARCDYHAGAIGHSTKNCTALKYRVQALIRVGIESKSKVGDITIPLEELFENLLGSG 180

Query: 299 -------------------LTCPFHAGAKGHSLEQCNHFQKRVQELLDSKVLTVTKSHLK 358
                               TCPFHAGAKGHSLEQCN FQKRVQELLDSKVLTVTKS LK
Sbjct: 181 YVSVEYLCPNLKYKEYDGSQTCPFHAGAKGHSLEQCNRFQKRVQELLDSKVLTVTKSQLK 240

Query: 359 KRINVVEDILVAEGSSDSLKPKPLTIFYSEKPDAVGNQSLSLVL---APFEYKSSRVVPW 418
           KR NVVEDILVAEGSSDSLKPKPLTIFY EK DA       + +   A FE +       
Sbjct: 241 KRTNVVEDILVAEGSSDSLKPKPLTIFYCEKLDAPSCSRKPITITVPAAFEVRGLTRTER 300

Query: 419 RY-------ECKVTVGQENKEKASEKKKGKVEEDKKGNAKLNEYIYDELVEAIVVKDASR 478
            Y              ++NKEKASEKK+ K EEDKKG  KLN+ I DELVEAIVVKDAS 
Sbjct: 301 CYTPDSLLKRVNEPASEKNKEKASEKKE-KAEEDKKGKTKLNKDICDELVEAIVVKDASP 360

Query: 479 KQPMSKEETQEFLKLVKQSEYK-------VIEQLGRTPAKISILSLLLSSEAH------- 538
           KQ +S+EETQ+FLKLVKQSEYK        ++ L      I+  S +  ++         
Sbjct: 361 KQSVSEEETQDFLKLVKQSEYKAFVSQDITVDNLSNVVGNITASSSITFTDEEIPPESTR 420

Query: 539 -------------------------------RNTL------------------------- 598
                                          R+TL                         
Sbjct: 421 HTKPLHISVKCKNFLIAKVLVDNGSSLNIMPRSTLEKLPVDMSHMRPSTVIVRAFDGACS 480

Query: 599 -----LEFFVGVTLDTFG------------------------KGSSSTLHQKIKFAVDQK 627
                +E  + +   TF                             STLHQKIKFAVDQK
Sbjct: 481 TVVRDIEILIQIGPCTFDITFQVMDITSAYSFLLGRPWIHSTGAVPSTLHQKIKFAVDQK 540

BLAST of Moc03g02390 vs. ExPASy TrEMBL
Match: A0A6J1E2J7 (Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111025431 PE=4 SV=1)

HSP 1 Score: 506.5 bits (1303), Expect = 3.5e-139
Identity = 492/1547 (31.80%), Postives = 619/1547 (40.01%), Query Frame = 0

Query: 50   MMSPPTVLNLDGLLAKADPVRQSAPSNEKFEVLEERLRAVEGT----------------- 109
            MMSPPTVLNL GL AK D V Q+APSNEKFEVLEERLRA+EGT                 
Sbjct: 1    MMSPPTVLNLGGLPAKTDLVGQNAPSNEKFEVLEERLRAIEGTYVFGNIDASQLCLVSGL 60

Query: 110  ----------------------------------------------------------DL 169
                                                                       L
Sbjct: 61   VIPPKFKVPEFEKYDGSSCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLSGPASRWYMQL 120

Query: 170  DSSHVGSWKNLIDSFLKQYKHNIEMAPDRLDLQRMEKKSTKSFKEYAQRWRDTATQVQPP 229
            DSS+VGSWKNL DSFLKQYKHNI+MAPDRLDLQRMEKKST+SFKEYAQRWRDTA QVQPP
Sbjct: 121  DSSNVGSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPP 180

Query: 230  LMDKELSAMFINTLKHPFYDRMIGSASTNFSDIMTIGERIEYG----------------- 289
            L DKELSAMFINTLKHPFYDRMIG+ASTNFSDIMTIGERIEYG                 
Sbjct: 181  LTDKELSAMFINTLKHPFYDRMIGNASTNFSDIMTIGERIEYGVRHGRITSTVDEPLAAK 240

Query: 290  ------------------------------------------------------------ 349
                                                                        
Sbjct: 241  KASHSKKKEGEVQMVGADRHSWKQQPYSRTPRYTPYYYPTPYGYNQPFVNNATSHYSPYT 300

Query: 350  ------------------------------------------------------------ 409
                                                                        
Sbjct: 301  FQNFRPPASQNFQPTPASQNFQPRGQQHNTLYTQEQQTNRGARKQTQFDPIPMTYTELLP 360

Query: 410  ----NNQLAPVPVDPIQPPYPRWYDANAHCDYHAGAIGHSIENCTTF------------- 469
                NNQLAPVPVDPIQPPYPRWYD NA CDYHAGAIGHS ENCT               
Sbjct: 361  QLFQNNQLAPVPVDPIQPPYPRWYDTNARCDYHAGAIGHSTENCTALKYRVQALIKAGWL 420

Query: 470  ------------------------------------------------------------ 529
                                                                        
Sbjct: 421  NFKKENGPDVSKNPLPNHQNVQINAIECQEIESKSKVADIRTPMVELFEILLGSGYVSVE 480

Query: 530  --------------LTCPFHAGAKGHSLEQCNHFQKRVQELLDSKVLTVTKSHLKKRINV 589
                          LTCPFHAGAKGHSLEQCN F+ +VQELLDSK+LTV  SH KK IN+
Sbjct: 481  YLCPNLKYKGYDESLTCPFHAGAKGHSLEQCNSFRMKVQELLDSKILTVANSHQKKGINI 540

Query: 590  VEDILVAEGSSDSLKPKPLTIFYSEKPDAVGNQSLSLVL---APFEYKSSRVVPWRYECK 649
            VED+ VAEGSSD+LKPK LTIFYSEKP+A       + +   APFEYKSS+ VPW+Y+CK
Sbjct: 541  VEDVSVAEGSSDALKPKCLTIFYSEKPNAPNCSRKPITITVPAPFEYKSSKAVPWKYQCK 600

Query: 650  VTVGQE----------------------------------------NKEKASEKKKGKVE 709
            VTVGQ+                                        NKEKASEKKK KVE
Sbjct: 601  VTVGQDVSSPPLPIDNITGVGGLTRTGRCYTPDSLLKCVNETTSEKNKEKASEKKKEKVE 660

Query: 710  EDKKGNAKLNEYIYDELVEAIVVKDASRKQPMSKEETQEFLKLVKQSEYKVIEQLGRTPA 769
            EDKKG AKL+E ++DELVEAIVVKD S KQPMS+EETQE LKLVKQSEYKVIEQLGRTPA
Sbjct: 661  EDKKGKAKLHEDVHDELVEAIVVKDVSPKQPMSEEETQEILKLVKQSEYKVIEQLGRTPA 720

Query: 770  KISILSLLLSSEAHRNTLLEFFVGVTLDTFGKGSSSTLHQKIKFAVDQKLVIISGQEDIL 829
            KISILSLLLSSEAHRN LLE                        A+ Q  V     +DI 
Sbjct: 721  KISILSLLLSSEAHRNALLE------------------------ALKQAFV----SQDIT 780

Query: 830  VSMLASMPYFEATEEAFESSFQSFKIANATTLHKK----SGRPKPRLLEAAFKGDTGSLD 889
            V  L+++        A   +F   +I    T H K    S + K  L+      +  SL+
Sbjct: 781  VDNLSNV--VGNISXASSITFTDEEIPPEGTGHTKALHISIKCKNFLIAKVLVDNGSSLN 840

Query: 890  KLPRMAKNTRRFGLGHKPNRCDITRVQNREKAKRLTRFENREHDYSRRTVLPLSHSFRST 949
             +PR         + H      I R  +  ++  +   E           +P+     + 
Sbjct: 841  IMPRSTLEKLPVDMSHMRPSTVIVRAFDGARSAVVGDIE-----------IPIQIGPCTF 900

Query: 950  DIVHREYDESFAVAVVIE----ERKQVGPFIFPCPDSFELSNWSVLELLNTEIECDNDLK 1009
            DI  +  D + A + ++           P        F +        LN     DN   
Sbjct: 901  DITFQVMDITSAYSFLLGRPWIHSAGAVPSTLHQKIKFAVDQNVDYRDLNRASPKDNFSL 960

Query: 1010 YKLDTPIYNVE--SGEEMNDEPSAELSARPLGTSTSKTLSVIERRVAHCQHLSSIRIPGD 1069
              +D  + N    S     D  S     +       KT ++I      C  +    +   
Sbjct: 961  PHIDVLVDNTTGFSTFSFMDGFSGYNQIKMAPEDREKT-TLITLWGTFCYKVMPFGL--K 1020

Query: 1070 NVPALYEGRSTQFPDWLKKRKMKPDQAGQLEAHVQGREH-----PLADPPILVVLPVNKP 1129
            N  A Y+         L  ++++      +    QG EH      L D      L +N  
Sbjct: 1021 NAGATYQRAMVTLFHDLMHKEIEVYVDDMIAKSKQGEEHTTILRKLFDRLRKFKLKLNPN 1080

Query: 1130 ICPMATTFSMAIGVHTREQ---VDL-KVNKWSEVEKIQKRAIITQLKGHFDFNESDRVMK 1172
             C    T    +G    ++   VDL KV    E+   Q +  + +  G  ++      + 
Sbjct: 1081 KCIFGATTGKLLGFVVSQEDIKVDLDKVKAILEMPPPQTQKEVREFLGRLNY------IA 1140

BLAST of Moc03g02390 vs. ExPASy TrEMBL
Match: A0A6J1D7C7 (Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111018303 PE=4 SV=1)

HSP 1 Score: 503.1 bits (1294), Expect = 3.9e-138
Identity = 319/593 (53.79%), Postives = 349/593 (58.85%), Query Frame = 0

Query: 263 RVQELLDSKVLTVTKSHLKKRINVVEDILVAEGSSDSLKPKPLTIFYSEKPDAVGNQSLS 322
           +VQELLDSK+LTV  SH KKR NVVEDILVAEGSSDS+KPK LTIFY EKPDA       
Sbjct: 2   KVQELLDSKILTVANSHQKKRTNVVEDILVAEGSSDSIKPKRLTIFYREKPDAPSCSRKP 61

Query: 323 LVL---APFEYKSSRVVPWRYECKVTVGQE------------------------------ 382
           + +   APFEYKSS+ VPW+YECKVTVGQ+                              
Sbjct: 62  ITITVPAPFEYKSSKAVPWKYECKVTVGQDVSSPSLPVDNITGVGGLTRTGRCYTPDSLL 121

Query: 383 ----------NKEKASEKKKGKVEEDKKGNAKLNEYIYDELVEAIVVKDASRKQPMSKEE 442
                     NKEKASEKKK KVEEDKKG AKL+E ++DELVE               EE
Sbjct: 122 KRVNETTSEKNKEKASEKKKEKVEEDKKGKAKLHEDVHDELVE---------------EE 181

Query: 443 TQEFLKLVKQSEYKVIEQLGRTPAKISILSLLLSSEAHRNTLLE----FFVG--VTLD-- 502
           TQEFLKLVKQ+EYKVIEQLGRTPAKISILSLLLSSEAHRN LLE     FV   +T+D  
Sbjct: 182 TQEFLKLVKQNEYKVIEQLGRTPAKISILSLLLSSEAHRNALLEALKQAFVSQDITVDNL 241

Query: 503 -------------TF--------------------------------GKGSS-------- 562
                        TF                                G GSS        
Sbjct: 242 SNVVGNIMASSCITFTDEEIPPEGTGHTKALHISVKCKNFLIAKVLVGNGSSLNIMPRST 301

Query: 563 ------------------------------------------------------------ 622
                                                                       
Sbjct: 302 LEKLPVDMSHMRPSTVIVRAFDGARNAVVGDIEIPIQIGLCTFDITFQVMDITSAYSFLL 361

Query: 623 ------------STLHQKIKFAVDQKLVIISGQEDILVSMLASMPYFEATEEAFESSFQS 680
                       STLHQKIKFAVDQKLVIISGQEDILVS LASMPY EA EEAFESSFQS
Sbjct: 362 GRPWIHSAGAVPSTLHQKIKFAVDQKLVIISGQEDILVSRLASMPYVEAAEEAFESSFQS 421

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022143495.11.1e-22450.25LOW QUALITY PROTEIN: uncharacterized protein LOC111013372 [Momordica charantia][more]
XP_022143495.15.2e-1282.98LOW QUALITY PROTEIN: uncharacterized protein LOC111013372 [Momordica charantia][more]
XP_022147189.13.1e-1282.98LOW QUALITY PROTEIN: uncharacterized protein LOC111016200 [Momordica charantia][more]
XP_022158986.17.2e-13931.80LOW QUALITY PROTEIN: uncharacterized protein LOC111025431 [Momordica charantia][more]
XP_022150030.18.0e-13853.79LOW QUALITY PROTEIN: uncharacterized protein LOC111018303 [Momordica charantia][more]

Pages

Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1CNY75.5e-22550.25Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111013372 PE=4 SV=1[more]
A0A6J1CNY72.5e-1282.98Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111013372 PE=4 SV=1[more]
A0A6J1D0991.5e-1282.98Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111016200 PE=4 SV=1[more]
A0A6J1E2J73.5e-13931.80Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111025431 PE=4 SV=1[more]
A0A6J1D7C73.9e-13853.79Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111018303 PE=4 SV=1[more]

Pages

Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005162Retrotransposon gag domainPFAMPF03732Retrotrans_gagcoord: 96..169
e-value: 3.0E-6
score: 27.4
IPR004252Probable transposase, Ptta/En/Spm, plantPFAMPF03004Transposase_24coord: 867..951
e-value: 1.4E-15
score: 57.7
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..51
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..25
NoneNo IPR availablePANTHERPTHR33240:SF7GAG-PRO-LIKE PROTEINcoord: 242..442
NoneNo IPR availablePANTHERPTHR33240OS08G0508500 PROTEINcoord: 242..442

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc03g02390.1Moc03g02390.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0090304 nucleic acid metabolic process