Moc03g21720 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc03g21720
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCCHC-type domain-containing protein
Locationchr3: 14928451 .. 14940448 (+)
RNA-Seq ExpressionMoc03g21720
SyntenyMoc03g21720
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAAGATTTGGCCGAGGCGCGGAAGACGACCTTGGAGGTCGGCTGCGAGCACTTAAAGGGCTACTCAGGGTACCTGCAAAATGAAAGTCACTCCGACGTTCAAGTTAGTGCTCAATCCGAGCTTAATGGGTAAGCATCAAACAGATTTCTCTTACCTGGAGTCATGGATGAAGGGTGTATTTATAGCGGTAGTTGGGTAGATACACGTCCTATCAGTACAATCCGGTGCTCAAGGAGGGGCTTGCTGAATACCCGTCAGTTCCTGGCAATGACCAGTCCGACTCCCACTCCAAAGGATGAGGTGACGCGGCCGCCTTATTCGTACAAGCAACCAAGGTCGAGCTCCCAGGTCGGCTCCCCAAGCCGCACTACGTCCAGGTCGACCGTGTGTAACCTACCCCCAACACCAGCTTATATGAAGACATACATTGTAATATGATACTCACTATCGCATGTCACCCACAAGTAGGATCAAATCAACATTATCCATAACTCTTTTAGGTTATTAGGATAAAACTTATGACCTTGGTATTTTTCTTTTAATGAATAGTAAAATATTTACTAATTCTAAGATTTTAGAGTATAAATCTTTGATTAAATTCACTAATTCTAGGATCAAATCAACATTATCCATAACTCTTTTAGGTTATTAGGATAAAACTTATGACCTTGGTATTTTTTTTAATGAATAGTAAAATATTTACTAATTCTAAGATTTTAGAGTATAAATCTTTGATTAAATTCACTAATTCTAGGATCAAATCAACATTATCCATAACTCTTTTAGGTTATTAGGATAAAACTTATAACCTTGGTATTTTTTTAATGAATAGTAAAATATTTACTAATTATAAGATTGTAGAGTATAAATCCTTGATTAAATTTGAGTCAAATAAACCATTATGATTAAGGTGAGACTCTTAGGAGCCGTTTGTTTCATGGGGATCAAAGATTTCCATGGGATGGGCATCTGGATTTTTGTTTGTTTAAGGAGAATGTAGGATACTCGAGTATCTCTAGATGTTCGGGTATCCTAGGATGTTCATACGGAGGGCATCTAATGAGTTTTAGATGCTTTTCTGACACACGGGTTTCCAAGATTCCCATATCGAGGACATCTACTCTTTATATTTTTGCCCTTCATTTAATTTTAATAATTTAAAGCAATTATATTATTAATAATATTTATTATCATAAATATATAATTTTAAAGCAACATTTTTCTTAAGTCCCACATACACGTGATATTTTCCTGATAATGCGATGATAAAAAATAAATAAATTTCACCTAAGATAACACCAAACATATAATTATCTTTTTGAAAAAATATTAAAGTTGAAAATGAAATTTATATTAAATTAATATGAATTAATTTGTTCTACAAAATACATAATACTAAATTAATTAAAATAAATCTATTATATTAATGAATGTATAAATGGTATGAATATTCGTATGGATATTTTCGGAACATTCTATAAATTTAACACTTGCCAAGAATATATCCTACAAAACAAACGTGACTATCTAAATAGTAGATGTACGGCATCTACATTCTCAACAATTTACGGATATAGGGGATCAAACTTCAAATCAAATTTACATATTAATTACTGTTGAGCTATGTTCATGCATAAATTACCAATGAGCTATGCTCATGCATAATTTTTTTTAAGGATCATATATTTAGTAATTTAAATGTCTATTACTGCTGAGCTATGCTCCGACATAAATTTAACTTTTTCCTTCCTATGTTATAATTTTTTATTCAAATTCAAATTCAAATTATACTTTGGTGTAAAATTTTAAACTTTTTCCTTCTCATGTTATAATTTTTTATTCGAATTCAAATCAAATTATACTTTCGTGTAAAATTCTGAACTTGCTACTATATTTAAAAATAGATATTTTTAGTTTGTCATTTATCTAATTATCTCTTAAATAATATCTTTAGGCAGTAAATATAAAGTTTTAAGTCCTAAATATATGATCCTTCAAAAAAAAAATCCTAAATATACGTGTAATAAAATAAATCAGAACTTTTTATAATTGTTTTAATAAGTACTATTATTCTAATTAAAAAAAAACCTATACACAATTATCATTATAATGAAAATGATAAAGAACATAATCAATTTCAATAAAACTAAATAACAGGGTGGAATTTATCAATCATTGTAAATTAATACACATTACATAATAAAAAAATCTTATACACAATCATCATCATTATTATTATTATTATTATTATTTTGAGAGGAACAATCATAATTATTTTTAATACTATTTATAGTATAAAAACATTTATATATTGGGCAAAATAAATCAACATAAATGCCATTTATTATTAAGGCGTTCGTTTATTCATCACAATTAATAAATTCCCTTTGCAAAAGTAATTATTAGTTCAAATTGGAATGACTAATTTACAATAGCAGAAACATTAAATTTAAGGTTAAATAATAAGTTTAGTCCTGAATAAATTTTGATGGCCATGTTTATCTAATTCTTATTAAACTTTAAAATGTTTCATTACTGTCTCTAATCAATTTTGATTTCATGAAATCTTTCAAGTATAATACTTGATCAAATAATCATGTGATATACTTATTAAAAAAAATATTGGTCAATTTGACACGACACGACACACAAAATAGTGTTTAGAACGACATTATTTAAATAAGAATCATTAAACTTTATTATACTTGTCAAAAACTAAATTTATAATTTAACCAAAAGAGAAGAAAGGAGAAAAAAAATAGATCATACATATTTAATAGGACAAATATAATAGAAAATATAGGAAGAATCTCAAGATAAAAGTAGAAAAAAGGATGCAAAAACCTTTCTAAATAAAAGCAAAAGGAAAAAAGGTGAGTTAGGTGGATGGAGGAAAAAGGGATAAGTTTTTCTTTTTCCTTTTTTTTTTTGTTAAAAGAGATAATAAATGAGGGCAAATGTACAACTCAACTATATTTTTATACTCATATATATTACATACACACACATATGAAGACATAGGACAAGATGAGATGTTCACAATCACCAAAAACCAACATGACAAGACATGGGAATCTTAGCTTGCCCAAAGGACATCTTATATCGTACGGATCAAAAATTGCACAATCCAATCTTAACACAAACCTCTGCCGTGTAGTGCATTTCTTACCCTATTTTCCACCATTCATATTTTTGTCCCTTCCCCTGTTTTTCGAGTCGCTACGTGTTATGCGGGTCGATAAGGATTCGAATATCTATCTTTTAGAATACAATGTATAATACTGTAATTAGATTGACTAGTGATTAAGGACCTTTCTTTCATGAGGTATGAAATTGAACCAATATCGTAATTTGATATACTAATGTGATTTATGCATTATTCAAATTAGAAGTGGTTAAAAACTTCAAACTATGTTATGGTGTTACTTCTTTTTCAAATGTAGGCGAGGAGACTGGTTGTTTTTTTTCTTCATCTCTAAAATTCAAATTTATAAACTACAATTATCTTCTAAATTGTCTAGTCACGTATTAGTTACTATGTCGATTTAATTTCTATATTTTTAAAAATTTTAAACTATCTCTAAATTTGGTTTCGTTGACAATAATATCAACTACACCTGGAATGCTTTTTATTAGTTTGTAAAGTTGACCTTCTATTAGTGGAATTAACATACATCGAAAATAACGGCATAATCCAAAATGAAAATACATTATTAGAAGTTTAATCAATGTTTAGTTAAGTATAGATACATAGATTAAATTTGTGCAGCTCCAAATATTAAAAAGAAGGGAATAATTATCTAATGGTTATTACCCATTCCAAACTATTAGATTGTAATCTCATAGATTAAAATTAAATTATTCTAATTTTTATGTAACCCTCATGTCTACTAACGTATTAACAACTTCTAAACCTACAAACAAATTTTTGAAATATATTCTTATACAACAATCCAAGTACATATCCAAATCAAGGACATAAATATAGAACCCTCAAATCACATGCTCTTAAGGAAAAAGAGAGAAAAAAATGAGCAAAATTTCTAGCATCTCATGATTTTGCCAAGCTAGACAAAATGGCTGCAGATATTGGAATCAAAACTTACCATCAAAATAATATTATATGTTTGAGATTTGAAATAATAACCTAAACAAAACTCTCCTTATATGGTTTTAAGGTTTCAATTTAAGTCTTTAGAATTTTGGCTTAGTTTCATAGTTTGAATTTTTTAACTCCCAAATCTAGGAGTATTCAAAATTTGGTACTTAATTACCCTAGCATTTTCCTACAATTTTGAAATTTTGTCCTCATCGGCCAAGATTGATCCAAACTAAGTATGCTTGAGCCACCGAAAAACAAGGTTTCCTATAATTTGTTAGATCTCACCGATGATTTCAAATAAATCACTTCTCATCCAAATTTGGACATATAAACCTTAACAAACAGTCAACTACCACAATCCTTCAAGTAGGAAACATTTACTCATTTGTCAGGTAAATAATTAACTGTACTAGCATCATAAACTATGAGAACTAAATTTAAGAACTAAATTGAAACATAGAGACAAAATTTCTTAGAAGTATTTAAACTACCCATCTGGGAAAAAGCAATTGAGGTTTGGTTTAAGATATTATTAGTAAAAACAAAAGACATTTTGGAACATTTATTTAGGCCTATGAGGTTATTATTGAAGGCCCCATTTATATTTGTTATATGTGAACAAAAGGGTTAGATAGTGACATTGGAGATGGCAGCTTCACAAATCCACAGACAAATCCAGCTGAGTGACTGCCTTTTTGGCTGTCATCAGAGACAGAAATTTGCTCTGCAAGTTTTGTATGTCATTAATGGCCACCATACATGATGGAGGAGAGAGAGAGAAGAAGAGAGAGAATGAGAGAGAGATTTCAAAGTTGCAATCAGCCAAGTTGATAGGTCTCAATAATTGGCCCACCAAAGAGATATGGACTACTTGTGATGCAAAAAGCGAAGCTCTGTAAGACTCAAAACAATGGCAGTAAGATGGGAAAAAATGACCAATTCCCAACCCCGTGGATTCTCATTTCCCATACCAAAACTTCCTTAAATTACGGTCCACTGATTTATCGGACACAAAATAGAGCCCGTTTTCAATATGTCGAAATATCTATTTTCATTGATAACTACCACTAGTTTACAAGATATTACTAAACTTTGAAGTCGGGTTTAGAAAAGTTAGCTGAGGAATGGTTGAAAGAATAATCGAAATATATTTTTAAATTTTCGTCACTGACTTATTAGACACAAAATAAAATTCATCTTTAATATGTCGAAATATCTGTTTTCATTGATAACTACCACTAATTTACAAAATATTATTAAAGTGGGGTTTAGAAAAATTACCTGCTGTGAGGAGGGGAGATTTTACATGTCAGAATGGAAAAGATAGGTCTGTTTTCCTGCCATATAATACACACATAAAAGCCAAATCCAAGACCCAATCCATTGATGGATTACTGCTTTCTATAAACAAATAAAATGAATATAAATAAATATTCTTCAAGAGAGAGAGGAACCCACCTGAAAGTAACAGAAAATTATAGAGAAGAGAGAGAATCTCATCAGTTTTGGTTTTCTCTCTCTTCTTTTCGGTATAGTAATTAAGTCGTTTATTGCTGATGCAGCATCATCAAGATTCTCACGTTTCTTTCGAAGACCGATGAACAGTCATGATCTTGCATGTGAATCTTGAAGATGCTACACCCGCAAACAACGCCTGTATCCACCGTCCTAGAGCATAATTCAGTCGAATGATAAGAGAGACTATAACATAAAACCCTTCTAAATTCAAGACCGTTTCATCCAAAAAAAAAAAGAATACAGATCGAGAATCCCTCAACCCAAAAAGAGATCTCAGCAAAAACAAACAGAAAATTATAATAATAATAATTAATTACGTGGATCCTTCCCATTTTTTGGAAAAAAAATTTCTTTTTGGATTCTCGAGCAATATTTTCAATTCTATTTGATAGAGAGAATTGAAGGGTAAGGCTCTCAATCAAAACACAAAACCCATAGAACTAAAATTTATTAAAAAGAAAATAACAATAAAGTTACCAGAAGAAACAGAGGTTGAAGAAATTTTTGAGGCAATAGAACAGAGAAATTTAAAAAGAAAAGAAAAAAAAAAAGAATGAAAGAATTTCTCTATCTAAAAACCGAAGAACGTGTAGAGAGAGAGAGGAGAGGGAACAGTGTAAGATGAGATGACCGAAGAAAGAGTCAACATATATACACGTAAAGCATGCCGACGTGGCAGCTGGACTGGTTTTTATTGGCTGTTGAAAAATGTTTACGTTTATTTATGTATTTTTATTTTTTGAATTTGGGGACTGGGTCGTTGCTTAGCTGTTTCTCGCTCATATTCCTTAATAAAAAAAACCTAATTTTTTTTTTCAAAAACAAAAGAGAAAAAGAAAAAAGAAATTATTATTCTCATAGCTAGTGGCGCAATCCTCGTATTGACCATTTATTATTATTTATTATTTTGTTTTGAATTTCCTTTTTCAAAACATTTCAATGTATGCCGCCACTTCTATACTTCTTTGTTTTCTTTTTCTCTTTCCCTTCCGGAAATACCCAAAATACCCTTCTCTAGTTTTGATAGTCTAGATTTCTTTCTTTTTTTTTTTAAAAAAAAAAGAAGTTTATTGTTTAGATCCTAAAAAACTTGGTACCAAACCCATAATAATAAAAAACTAAAAATATGAAATTCAGCTTTATTTTTTTTTTTGTAAAAAATTCCTTGTTTTCAGATTGGTGGCTTTGAATCACCATTCAAAGAAATCTAAAATTGAGTTTATTTTATTTTGGCCTTTTTGTAGCTTTCAATAAAATAATAACTAATTTGGTTGGTAAATTTTGAACTGTCTATTTCCCTTATAATTTACTCTAAGGTCCTATTTGTTAAAAATTAGACAAATTTTAGGTGTAATTATCATGTTAGTGGGTAGTTAATTTCATAGGAAGTGGATGTAAACTTGAGGAGAAATTTTTGGGCACAGTGGCATGGTGATCTTGATGGGAGAGAAATTCAAGGTTAGTGGCATAACGGTGTCAGTGGAGAGAAGTTTAAGGTTTGTGGTATATTGGTTAGTTTTACATTGTTCATATAAATACTTGTAGCAATTTCTCATTTTATGAATCAAATCCTTCCTTCCTAAACATTTATCTTGTTTCCTTTCTATTGTTCATCTTTGTGTCTCGTTACATGGTATCAGAGCGAGGTTCTTTATCTCGTGTTTGTGTTTGAGTTTCGTGCTCATTGTCTAAGTTTAGTGCTCGTTGCTTGAATTTAACACTCGTTGAAGATGAATGGTGGAAGCAATATTTGTGTAGACAAGCTTACCAATGATAACTATAGCTATTGGAGGCTATATGTATTGAAGCCTTTCTACAAGGTCAAGATCTGTGGGATCTTGTTTCTGGTGATGATGCGGAAATTCCAAAAGACACTCAGGAGAATGCTGAGGCGCAAAGGAAGTGGAAGATCAAGTGTGGCAAAACGTTATTTGCTTTGCGAACTTCTATTGGCAATGAGTATATATCGAGCATGTTCGTGATAAAAAGTCTCCAAAACAAGTGTGGGATACACTTGAAAGGTTGTTCACTCAAAAGAACACGACGAGATTGCAGTATTTGGAGAACGAACTTACTGGAAAAACTCAAGGTAATTTGTCGGTTTCAGAGTACTTTCTGAAAATTAAATCTTTGTGTTCTAAAATTTTAGAACTGGATGAGGAGAAGCCCATTAGTGATGCTTGTTTGCGTCGTTATCTCATTCGTGGACTGCGAAAGGAATTTATGCAATTTATTTCTTCGATACAAGGTTGGGTAAATCAACCTTCTATCATTGAGTTGGAAAACTTGCTTTCAAATCAAGAAGTCTTGGTGTAGCAAATGCTTGGGAACAACAAGCAAACTGCACAGGTGGAAGATGTTTTTTATGCAAAAGATAATTCTTATTCCAGGCGTTCTTCAGATGACAACAAGCACTCCAAAACTGAAGGGCAGTCCAAAGGTAACATAAAAAAATGTTTTAGGTGCGGCAAGTCGGGACATATCAAACATGATTGTCGGACCAAGGTTGTGTGTCATCATTGTGGGAGGCCAGGTCATATTAGACCAAATAGTTGGGTGAATCTCAAAGAAGCGGAAGCAAATGTTGTACGGGAGAGTAACAAACCTGAACAATCTACTTGGGATCAATGTTTGTCGATTGAAGCTGTCGACCAACCCATTAACATGCATGTCAATGCTTCTATAGATTATAATGAGGATTGGATTATTGACTCTGGTTATTCTCATCATACTACTGGAAATGTTTTCTTCTCTCTGAGGTACGTACACATCATGGAAGAAGAGTCATTGCGACAGCCGATAATTTCTTACATCCTGTTGTTGAAGAAGGATGTGTTAATGTTGAGAATGGTGCTCCAAATGTTGGTGGTGTTTCTGTTAAAGATGTTTATCATGTTCCAGGCTTGAAGAAGAATCTGGTTTCAGTCTCTCAGATTACCGACTCTGGGAGGTACTTTTTTTTTTTGTCCAGATGATGTGAAAATTGAAGCAATTTTCAGTTGATATTTTGTTAACTGGAAAGAGGAAAGATTCCCTATCTGCAAGTGATGCATATGTTGAAAATACAGGTCAGAATGATAGTGCAACACTTTGACATACTCGGTTGGGTCATATTGGTTATCAACAGTTACAGAGAATTTCGACAAAGAAGCTTTTAGGTGGTATTCCTCTACTTAAAGAAATTCATCTTGATGTGGTTTGTCCTGGTTGTCAATTTGGAAAATCAAATCGTCTCCCTTTAAGACAGTAAAGCTACTGTTCCATTGCAATTAGTTCATTCAGGCTTGATGGGGCCAGCTTGTCTTGGCTGCATGCAAATAACCTTCCAAGAGAGCTTTGGGCAGCAGCTATTCAGACAGCTTGTCATGTCATAAATCGCCTAGCTTCATGGTCGGGTCAGATCAATCTCCATTTGAAGTACTATATCATCAGAGACCCAATGTGAGTTATCTTCGAGTTTTTGGGTCAATTTGTTATGTTCATGTTTCAAAGAGTAAGCGTACTAAACTAGACCCAAAGGCAAAGTGTTGTATTTTTGTTGGATATGATACTCATAGAAAAGGATGGAGATGTATGGATCCAAATACAAGAGAAATTGTTGTTTCTCGAGATGTGGTGCTTGATGAGGTTTCATCACATGAAGTGGACGCGAGTACAAAGAAAAGTGGTGTTACTCTATCACCGTTCTTTAATGATGATGTGTCAAATGAGAAAGATTTTAATGGTAGTTCTGGAGAGAATGTTAGAACAGATGAAGCTACAGAAAGTACAGTTCGAAGGTCTTCTAGAGAAAGAACTCAACCCAGTTATTTATCTGATTACGAGGTACAGTTAAATAGTTGTTCGTTTGTATCTTGTTTTTTAATGGGTGATGTTTGTGAAGAAGAACCTCAATCTTAAAATGAAACCAAAGGTGTTCTTGAATGGGAAGAAGCAATGCATGATGAAATTTCGACTCTAAACAAGAACGACACATGAGAACTTATTCCAAGGCCAAAGAATGTAGAATTAGTTACTTGTAAATGAGTTTACAAGTTGAAGAAAAGGTCTGATGATACTATTGACTGATATAAAGCCCGTCTGGTTGCTCGTGACTTTTTACAACAATATGGTCTAGACTACGATGAGACATTCAGTCCTATTGCTAAAATGGTAACCATCCGAACTATTATTTCATTAGCTGCTTGCAAGAATTGGAATTATCCAATTGGATGTGAAAAATGCTTTCCTTTATGGAGAGTTGGATCGAGATATTTTTATGGAGCAACCTCGTGGTTTTGTTTCTAAAGAGTTTCCAAATTATGTGTGTCGGCTCAGGAAGGTTTGTATGGCCTTAAGCAAGGTCCACGTGCTTGGTATGGTAAGATTGCTCAATATCTTGAGTTTTGTGAGTTCAGGTCTTCAAATGCAGATTCAAGTCTGTTTGTTAAGAAGACAACCTCGGTACGTATAATGCTTCTACTCTATGTTGATGATATGATCATAACCGGTGATGATGATGTTGAACTTAACCGTCTTCGAGATGTTGTATCTATTCGTTTTGAAATGAAGAGTTTGGGCGAAGCTAAGTATTTCCTTAGTTTGGAGGTTGAAAGGTCAGGTGGATATTTCATCTCATAGAAAGGATATGCAGCAAGTTTATTGAATCGCTTTGGTATGAAAGAGTCAAAGCCAATGACTACTCCTATGGAGCCATCCTTGAAGTTGATTAAGGAAGAAGGAAAACTATTGGCAGATGCAACAACTTTTCGGCAACTCATTGGTAGTTTAATTTATTTGACTATCACAAGGTCTGACATCTCTTACTCTGTCTCAATTTATGGAGAAACCATGTGAAGCTCATTTAATTGCAGCAAAGAGGATTTTTCGTGAAGAGCACTTTGAGTTTTGGCTTGTTGTATAAACAAGCTACTTCCTTTGTGCTGAGTGGTTTTGTTGATGCATATTGGGCCGGTAATGTCAATGATAGACGTTCCACAACCGGGTACTGTTTTAATAGGGGCTATATCATGGTGCAGTAAAAAGCAAACTTCTGTTGCTCTTTCGAGTTGTGAAGCAGAATATGTAGCAGCTTCTATGGCTACTCAAGAGTGTATTTGGCTGAAAAGACTAATGGGAGAAATATTCTCTATTTTGGACTACAAGTGTCCATTTTTTGTGACAATGAAAGTGCAATCAAGCTTGCAGGGAATCCAGTGTTTCATGCTCATACAAAGCATATTGAAACACATTTTCACTTTGTTCAAGATAAAGTTTTGACACAAGACATTCAGTTGCAGAAGATACATTTCAAGGAGCAAGTTGCAGACATATTTACTAAGGCGCTTGGCAGACCCAAGTTTGAAGAACTCAGAAGTTCACTTGGTGTTATTGACCGTACATTTGCACTAAGGGGGAGTGTTAAAAATTAGTGCAAACTTTATGTGTAATTGTCATGTTAGTGGGTAGTTAATTTCATAGGATGTGGATGTAAACTTGAGGAGAAATTTTTGGGTATAGTGGCATAGCGGCATGGTGGTGTTGATGAGAGAGATTCAAGATTAGTGGCATAATGATGTTAGTGGGAGAGAAGTTTAAGGTTTGTGGTATAGTGGTTAGTTTTACATTAGGGAGAGAAGTTTAAGAGTTTTACATTGTCCCTATAAATACTTGTAGCAATTTCTCATTCTATGAGTCAAATCCTTCCTTCGTAAACATTTATCTTGCTTCATTTCTATTGTTCATCTCCGTGTTTCGTTACACTATTTTGTATATGGGGTTTCAAAGCTAGAGTTATGGAAGGTTCAATTATTTGCAATCTTATGTTGCTCTGTCGAGTTGTGAAGCAGAATATGTAGCGGCTTCTATGGCTATTCAAGAGTGTATTTTGCTGAAAAGACTAATGGGAGAAATATTCTCTATTTTGGACTACCCAATGTCCATTTTTGTGACAATGAAACTGCAATCAAGCTTGTAGGGAATCCAGTGTTTCATGCTCATACAAAGCATATTGAAACACATTTTCACTTTGTTTTGACACAAGACATTCAACTGCAGAAAATACATTCCAAGAAGCAAGTTGCAGACATATTTACTAAGGCGCTTGGCAGACCCAAGTTTGTAGAACTCGGAAGTTCACTTGGTGTTGTTGACCGTACATTTGCACTAAGGGAAGTGTTAAAAATTAGTGCAAACTTTATGTGTAATTATCATGTTAGTGGGCAGTAAATTTCATAGGATGTGGGTGTAAACTTGGAGAGAAATTTTTGGGTATAGTGGCATAGTGGCATGGTGGTGTTGATTGGAGAGAAATTCAAGGTTAGTGGCATAATGGTGTTAGTGAGAGAGAAGTTCAATGTTCGTGGTACAGTGGTTAGTTTTACATTGTTCCTATAA

mRNA sequence

ATGCAAGATTTGGCCGAGGCGCGGAAGACGACCTTGGAGGTCGGCTGCGAGCACTTAAAGGGCTACTCAGGGTACCTGCAAAATGAAAGTCACTCCGACGTTCAAGTTAGTGCTCAATCCGAGCTTAATGGTACAATCCGGTGCTCAAGGAGGGGCTTGCTGAATACCCGTCAGTTCCTGGCAATGACCAGTCCGACTCCCACTCCAAAGGATGAGGTGACGCGGCCGCCTTATTCGTACAAGCAACCAAGGTCGAGCTCCCAGCTATTGGAGGCTATATGTATTGAAGCCTTTCTACAAGGTCAAGATCTGTGGGATCTTGTTTCTGGTGATGATGCGGAAATTCCAAAAGACACTCAGGAGAATGCTGAGGCGCAAAGGAAGTGGAAGATCAAGTGTGGCAAAACGTTATTTGCTTTGCGAACTTCTATTGGCAATGAGTATATATCGAGCATGTTCGTGATAAAAAGTCTCCAAAACAAGTGTGGGATACACTTGAAAGAACTGGATGAGGAGAAGCCCATTAGTGATGCTTGTTTGCGTCGTTATCTCATTCGTGGACTGCGAAAGGAATTTATGCAATTTATTTCTTCGATACAAGGTTGGGTGGAAGATGTTTTTTATGCAAAAGATAATTCTTATTCCAGGCGTTCTTCAGATGACAACAAGCACTCCAAAACTGAAGGGCAGTCCAAAGGTCATATTAGACCAAATAGTTGGGTGAATCTCAAAGAAGCGGAAGCAAATGTTGTACGGGAGAGTAACAAACCTGAACAATCTACTTGGGATCAATGTTTGTCGATTGAAGCTGTCGACCAACCCATTAACATGCATGTCAATGCTTCTATAGATTATAATGAGGATTGGATTATTGACTCTGCCGATAATTTCTTACATCCTGTTGTTGAAGAAGGATGTGTTAATGTTGAGAATGGTGCTCCAAATGTTGGTGGTGTTTCTGTTAAAGATGTTTATCATGTTCCAGGCTTGAAGAAGAATCTGGTTTCAGTCTCTCAGATTACCGACTCTGGGAGGCTTGATGGGGCCAGCTTGTCTTGGCTGCATGCAAATAACCTTCCAAGAGAGCTTTGGGCAGCAGCTATTCAGACAGCTTGTCATGTCATAAATCGCCTAGCTTCATGGTCGGTGGACGCGAGTACAAAGAAAAGTGGTGTTACTCTATCACCGTTCTTTAATGATGATGTGTCAAATGAGAAAGATTTTAATGGTAGTTCTGGAGAGAATGTTAGAACAGATGAAGCTACAGAAAGTACAGTTCGAAGGTCTTCTAGAGAAAGAACTCAACCCAGTTATTTATCTGATTACGAGTGGCATAGTGGCATGGTGGTGTTGATTGGAGAGAAATTCAAGGTTAGTGGCATAATGGTGTTAGTGAGAGAGAAGTTCAATGTTCGTGGTACAGTGGTTAGTTTTACATTGTTCCTATAA

Coding sequence (CDS)

ATGCAAGATTTGGCCGAGGCGCGGAAGACGACCTTGGAGGTCGGCTGCGAGCACTTAAAGGGCTACTCAGGGTACCTGCAAAATGAAAGTCACTCCGACGTTCAAGTTAGTGCTCAATCCGAGCTTAATGGTACAATCCGGTGCTCAAGGAGGGGCTTGCTGAATACCCGTCAGTTCCTGGCAATGACCAGTCCGACTCCCACTCCAAAGGATGAGGTGACGCGGCCGCCTTATTCGTACAAGCAACCAAGGTCGAGCTCCCAGCTATTGGAGGCTATATGTATTGAAGCCTTTCTACAAGGTCAAGATCTGTGGGATCTTGTTTCTGGTGATGATGCGGAAATTCCAAAAGACACTCAGGAGAATGCTGAGGCGCAAAGGAAGTGGAAGATCAAGTGTGGCAAAACGTTATTTGCTTTGCGAACTTCTATTGGCAATGAGTATATATCGAGCATGTTCGTGATAAAAAGTCTCCAAAACAAGTGTGGGATACACTTGAAAGAACTGGATGAGGAGAAGCCCATTAGTGATGCTTGTTTGCGTCGTTATCTCATTCGTGGACTGCGAAAGGAATTTATGCAATTTATTTCTTCGATACAAGGTTGGGTGGAAGATGTTTTTTATGCAAAAGATAATTCTTATTCCAGGCGTTCTTCAGATGACAACAAGCACTCCAAAACTGAAGGGCAGTCCAAAGGTCATATTAGACCAAATAGTTGGGTGAATCTCAAAGAAGCGGAAGCAAATGTTGTACGGGAGAGTAACAAACCTGAACAATCTACTTGGGATCAATGTTTGTCGATTGAAGCTGTCGACCAACCCATTAACATGCATGTCAATGCTTCTATAGATTATAATGAGGATTGGATTATTGACTCTGCCGATAATTTCTTACATCCTGTTGTTGAAGAAGGATGTGTTAATGTTGAGAATGGTGCTCCAAATGTTGGTGGTGTTTCTGTTAAAGATGTTTATCATGTTCCAGGCTTGAAGAAGAATCTGGTTTCAGTCTCTCAGATTACCGACTCTGGGAGGCTTGATGGGGCCAGCTTGTCTTGGCTGCATGCAAATAACCTTCCAAGAGAGCTTTGGGCAGCAGCTATTCAGACAGCTTGTCATGTCATAAATCGCCTAGCTTCATGGTCGGTGGACGCGAGTACAAAGAAAAGTGGTGTTACTCTATCACCGTTCTTTAATGATGATGTGTCAAATGAGAAAGATTTTAATGGTAGTTCTGGAGAGAATGTTAGAACAGATGAAGCTACAGAAAGTACAGTTCGAAGGTCTTCTAGAGAAAGAACTCAACCCAGTTATTTATCTGATTACGAGTGGCATAGTGGCATGGTGGTGTTGATTGGAGAGAAATTCAAGGTTAGTGGCATAATGGTGTTAGTGAGAGAGAAGTTCAATGTTCGTGGTACAGTGGTTAGTTTTACATTGTTCCTATAA

Protein sequence

MQDLAEARKTTLEVGCEHLKGYSGYLQNESHSDVQVSAQSELNGTIRCSRRGLLNTRQFLAMTSPTPTPKDEVTRPPYSYKQPRSSSQLLEAICIEAFLQGQDLWDLVSGDDAEIPKDTQENAEAQRKWKIKCGKTLFALRTSIGNEYISSMFVIKSLQNKCGIHLKELDEEKPISDACLRRYLIRGLRKEFMQFISSIQGWVEDVFYAKDNSYSRRSSDDNKHSKTEGQSKGHIRPNSWVNLKEAEANVVRESNKPEQSTWDQCLSIEAVDQPINMHVNASIDYNEDWIIDSADNFLHPVVEEGCVNVENGAPNVGGVSVKDVYHVPGLKKNLVSVSQITDSGRLDGASLSWLHANNLPRELWAAAIQTACHVINRLASWSVDASTKKSGVTLSPFFNDDVSNEKDFNGSSGENVRTDEATESTVRRSSRERTQPSYLSDYEWHSGMVVLIGEKFKVSGIMVLVREKFNVRGTVVSFTLFL
Homology
BLAST of Moc03g21720 vs. NCBI nr
Match: KAA8540328.1 (hypothetical protein F0562_024753 [Nyssa sinensis])

HSP 1 Score: 264.2 bits (674), Expect = 2.2e-66
Identity = 173/401 (43.14%), Postives = 203/401 (50.62%), Query Frame = 0

Query: 93  ICIEAFLQGQDLWDLVSGDDAEIPKDTQENAEAQRKWKIKCGKTLFALRTSIGNEYI--- 152
           +C+EA+LQGQDLWDL+SGDD  IP+DT +NAE +RKWKIKCGK LFALRTSI  EYI   
Sbjct: 22  LCMEAYLQGQDLWDLISGDDVVIPEDTPQNAELRRKWKIKCGKALFALRTSISQEYIQHV 81

Query: 153 -------------SSMFVIKS------LQNKCG--------------------IHLKELD 212
                          +F  K+      L+N+                        + ELD
Sbjct: 82  RDGKSPKQVWETLERLFTQKNTMRLQFLENQLAGMTQDNLSISEYFLKIKTLCSEISELD 141

Query: 213 EEKPISDACLRRYLIRGLRKEFMQFISSIQGW---------------------------- 272
            E+P+SDA LRRYLIRGLRKEFM FISSIQGW                            
Sbjct: 142 TEEPVSDARLRRYLIRGLRKEFMPFISSIQGWANQPSIIELENLLSNQEALMKQMASSNK 201

Query: 273 -----VEDVFYAKD----NSYSRRSSDDNKHSKTEGQSK--------------------- 332
                VED  Y KD    NS+S+ SS DNK SKTEGQS+                     
Sbjct: 202 QSPSQVEDALYTKDKAKSNSFSKHSSGDNKQSKTEGQSRGNSRSCYRCGKLGHLKRDCRV 261

Query: 333 ----------GHIRPNSWVNLKEAEANVVRESNKPEQSTWDQCLSIEAVDQP-------- 346
                     GHI+ N  VNL    ANV  E+++ EQ  W+QCLSIEAVDQP        
Sbjct: 262 KVVCNRCGKSGHIKQNCRVNL--TGANVAHETSEFEQLKWEQCLSIEAVDQPVILNSVVQ 321

BLAST of Moc03g21720 vs. NCBI nr
Match: KAA8521602.1 (hypothetical protein F0562_012275 [Nyssa sinensis])

HSP 1 Score: 255.0 bits (650), Expect = 1.3e-63
Identity = 179/443 (40.41%), Postives = 221/443 (49.89%), Query Frame = 0

Query: 93  ICIEAFLQGQDLWDLVSGDDAEIPKDTQENAEAQRKWKIKCGKTLFALRTSIGNEYISSM 152
           +C+EA+LQGQDLWDL+SGDDA I +DT +N E QRKWKIKCGK LFALRT I        
Sbjct: 22  LCMEAYLQGQDLWDLISGDDAVILEDTPQNVELQRKWKIKCGKALFALRTLI-------- 81

Query: 153 FVIKSLQNKCGIHLKELDEEKPISDACLRRYLIRGLRKEFMQFISSIQGW---------- 212
                          +LD E+P+SDA L RYLI GLRKEFM FIS IQGW          
Sbjct: 82  ---------------KLDTEEPVSDARLCRYLICGLRKEFMPFISLIQGWANQPSIIELE 141

Query: 213 -----------------------VEDVFYAKD----NSYSRRSSDDNKHSKTEGQSKG-- 272
                                  VED FY KD    NS+S+ SS DNK SK E Q  G  
Sbjct: 142 NLLSNQEALMKQMASSNKQSPSQVEDAFYTKDKAKSNSFSKHSSVDNKQSKIERQFGGNS 201

Query: 273 -----------------------------HIRPNSWVNLKEAEANVVRESNKPEQSTWDQ 332
                                        HI+ N  VNL    ANV  E+ + EQ  W+Q
Sbjct: 202 KSCYRCGKLGHLKRNCRVKVVCNRCGKSNHIKQNFRVNL--TRANVAHETGEFEQLKWEQ 261

Query: 333 CLSIEAVDQ-----------PINMHVNASIDYNEDWIIDS-------------------- 392
           CLSIEA+DQ            +  + NASIDY++DWI+DS                    
Sbjct: 262 CLSIEAIDQLVILNSVVQQTNVETYANASIDYSKDWIVDSGCSHHATGNASLLSEVRPHY 321

Query: 393 -------ADNFLHPVVEEGCVNVENGAPNVGGVSVKDVYHVPGLKKNLVSVSQITDSGRL 429
                  A+N LHPVV+EG  NV+    NVGGVS+K+VY VP LKKNL SVSQI DS R 
Sbjct: 322 GKRAIVTANNSLHPVVKEGNFNVKKDISNVGGVSLKNVYCVPSLKKNLASVSQIADSER- 381

BLAST of Moc03g21720 vs. NCBI nr
Match: KAA8549858.1 (hypothetical protein F0562_001542 [Nyssa sinensis])

HSP 1 Score: 239.2 bits (609), Expect = 7.4e-59
Identity = 166/423 (39.24%), Postives = 205/423 (48.46%), Query Frame = 0

Query: 71  DEVTRPPYSYKQPRSSSQLLEAICIEAFLQGQDLWDLVSGDDAEIPKDTQENAEAQRKWK 130
           D++    YSY++          +C+EA+LQGQ+LWDL+SGDD  I +DT +N E +RKWK
Sbjct: 10  DKLVGNNYSYRK----------LCMEAYLQGQNLWDLISGDDVVILEDTPQNVELRRKWK 69

Query: 131 IKCGKTLFALRTSIGNEYI----------------SSMFVIKS------LQNKCG----- 190
           IK GK LFALRTSI  EYI                  +F  K+      L+N+       
Sbjct: 70  IKYGKALFALRTSISQEYIQHVRDGKSPKQVWKTLERLFTQKNTMRLQFLKNELAGMTQD 129

Query: 191 ---------------IHLKELDEEKPISDACLRRYLIRGLRKEFMQFISSIQGW------ 250
                            + ELD E+P+SDA L RYLI GLRKEFM FISSIQGW      
Sbjct: 130 NLSILEYFLKIKTLCSEISELDTEEPVSDARLHRYLILGLRKEFMPFISSIQGWANQPFI 189

Query: 251 ---------------------------VEDVFYAKD----NSYSRRSSDDNKHSKTEGQS 310
                                      VED  Y KD    NS+S+ SS D+K SKT+GQS
Sbjct: 190 IELENLLSNQEALMKQIASNNKQSPSQVEDALYTKDKAKSNSFSKHSSADSKQSKTKGQS 249

Query: 311 KG-------------------------------HIRPNSWVNLKEAEANVVRESNKPEQS 346
           +G                               HI+ N  VNL    ANV  +++K EQ 
Sbjct: 250 RGNSKSYYRCGKLGHLKRDCHVKVVCNRCEKSVHIKQNCRVNL--IGANVAHKTSKFEQL 309

BLAST of Moc03g21720 vs. NCBI nr
Match: RWR74934.1 (Integrase, catalytic core [Cinnamomum micranthum f. kanehirae])

HSP 1 Score: 215.7 bits (548), Expect = 8.8e-52
Identity = 150/393 (38.17%), Postives = 186/393 (47.33%), Query Frame = 0

Query: 93  ICIEAFLQGQDLWDLVSGDDAEIPKDTQENAEAQRKWKIKCGKTLFALRTSIGNEYISSM 152
           +C+EA+LQGQDLWDL+SGD+A IP+DT +NA+  RKWKIKCGK LFALRTSI  +YI+ +
Sbjct: 22  LCMEAYLQGQDLWDLISGDNAVIPEDTSQNADLWRKWKIKCGKALFALRTSISQDYIARV 81

Query: 153 FVIKS----------------------LQNKCG--------------------IHLKELD 212
             + S                      L+N+                        + ELD
Sbjct: 82  RDVSSPKQVWEILERLFTQKNTMRLQYLENELAGMTQGTLSIPEYFLKVKTLCAEISELD 141

Query: 213 EEKPISDACLRRYLIRGLRKEFMQFISSIQGW---------------------------- 272
            E+P+SDA L RYLIRGLRKEFM FISSIQGW                            
Sbjct: 142 TEEPVSDARLHRYLIRGLRKEFMPFISSIQGWATQPSIIELENLLSNQEALVKQMTSNDK 201

Query: 273 -----VEDVFYAKD---NSYSRRSSDDNKHSKTEGQ------------------------ 332
                VED  Y KD    ++ ++ SDD + S  EG+                        
Sbjct: 202 KSLSLVEDALYTKDQGNKNFFKQGSDDTEQSNNEGKFRGNSKGCFRCGQLGHIKRDCHAR 261

Query: 333 -------SKGHIRPNSWVNLKEAEANVVRESNKPEQSTWDQCLSIEAVDQPI----NMHV 346
                    GHI+ N  V L EA ANV +E ++ EQSTW+  LSI A    I       V
Sbjct: 262 VVCNRCGKSGHIKANCRVKLMEAGANVAQEKDESEQSTWEHGLSITANQSTIVTSAQTDV 321

BLAST of Moc03g21720 vs. NCBI nr
Match: RWR74934.1 (Integrase, catalytic core [Cinnamomum micranthum f. kanehirae])

HSP 1 Score: 62.4 bits (150), Expect = 1.2e-05
Identity = 26/36 (72.22%), Postives = 29/36 (80.56%), Query Frame = 0

Query: 346 LDGASLSWLHANNLPRELWAAAIQTACHVINRLASW 382
           L    LSWLH  NLPRELWAAA+Q+ACHVINRL +W
Sbjct: 637 LTSMCLSWLHTKNLPRELWAAAVQSACHVINRLPAW 672


HSP 2 Score: 212.2 bits (539), Expect = 9.7e-51
Identity = 139/332 (41.87%), Postives = 178/332 (53.61%), Query Frame = 0

Query: 106 DLVSGDDAEIPKDTQENAEAQRKWKIKCGKTLFALRTSIGNEYISSMFVIKS-------- 165
           DLVSGDDA IPKDT +N E QRKWKIKCGK LFALRTSI  EYI  +  +KS        
Sbjct: 7   DLVSGDDAVIPKDTPQNVELQRKWKIKCGKALFALRTSISQEYIEHVRDVKSPKQVWETL 66

Query: 166 --------------LQNKCG--------------------IHLKELDEEKPISDACLRRY 225
                         L+N+                        +KELD E+P+SDA LR Y
Sbjct: 67  ERLFTQKNMMRLQFLENELAGMTQDNLSISEYFWKIKTSYSEIKELDTEEPVSDARLRGY 126

Query: 226 LIRGLRKEFMQFISSIQGW--------VEDVFYAKDNSYSRRSSDDNKHSKTEGQ----S 285
           LIR L KEFM FISSIQGW        +E++   ++ +  ++ +  NK S ++ +    +
Sbjct: 127 LIRELGKEFMPFISSIQGWANQPSIIELENLLLNRE-ALMKQMASSNKQSPSQVEDAFYT 186

Query: 286 KGHIRPNSWVNLKEAEANVVRESNKPEQSTWDQCLSIEAVDQP-----------INMHVN 345
           K   + NS+  L    ANV  E+++ EQ  W+QCLSIEA+DQP           +  + N
Sbjct: 187 KDKAKSNSFSKLSSG-ANVAHETSEFEQLKWEQCLSIEAIDQPVIVNSIVQQTNVETYAN 246

BLAST of Moc03g21720 vs. ExPASy TrEMBL
Match: A0A5J5BCB3 (Uncharacterized protein OS=Nyssa sinensis OX=561372 GN=F0562_024753 PE=4 SV=1)

HSP 1 Score: 264.2 bits (674), Expect = 1.0e-66
Identity = 173/401 (43.14%), Postives = 203/401 (50.62%), Query Frame = 0

Query: 93  ICIEAFLQGQDLWDLVSGDDAEIPKDTQENAEAQRKWKIKCGKTLFALRTSIGNEYI--- 152
           +C+EA+LQGQDLWDL+SGDD  IP+DT +NAE +RKWKIKCGK LFALRTSI  EYI   
Sbjct: 22  LCMEAYLQGQDLWDLISGDDVVIPEDTPQNAELRRKWKIKCGKALFALRTSISQEYIQHV 81

Query: 153 -------------SSMFVIKS------LQNKCG--------------------IHLKELD 212
                          +F  K+      L+N+                        + ELD
Sbjct: 82  RDGKSPKQVWETLERLFTQKNTMRLQFLENQLAGMTQDNLSISEYFLKIKTLCSEISELD 141

Query: 213 EEKPISDACLRRYLIRGLRKEFMQFISSIQGW---------------------------- 272
            E+P+SDA LRRYLIRGLRKEFM FISSIQGW                            
Sbjct: 142 TEEPVSDARLRRYLIRGLRKEFMPFISSIQGWANQPSIIELENLLSNQEALMKQMASSNK 201

Query: 273 -----VEDVFYAKD----NSYSRRSSDDNKHSKTEGQSK--------------------- 332
                VED  Y KD    NS+S+ SS DNK SKTEGQS+                     
Sbjct: 202 QSPSQVEDALYTKDKAKSNSFSKHSSGDNKQSKTEGQSRGNSRSCYRCGKLGHLKRDCRV 261

Query: 333 ----------GHIRPNSWVNLKEAEANVVRESNKPEQSTWDQCLSIEAVDQP-------- 346
                     GHI+ N  VNL    ANV  E+++ EQ  W+QCLSIEAVDQP        
Sbjct: 262 KVVCNRCGKSGHIKQNCRVNL--TGANVAHETSEFEQLKWEQCLSIEAVDQPVILNSVVQ 321

BLAST of Moc03g21720 vs. ExPASy TrEMBL
Match: A0A5J4ZW51 (CCHC-type domain-containing protein OS=Nyssa sinensis OX=561372 GN=F0562_012275 PE=4 SV=1)

HSP 1 Score: 255.0 bits (650), Expect = 6.3e-64
Identity = 179/443 (40.41%), Postives = 221/443 (49.89%), Query Frame = 0

Query: 93  ICIEAFLQGQDLWDLVSGDDAEIPKDTQENAEAQRKWKIKCGKTLFALRTSIGNEYISSM 152
           +C+EA+LQGQDLWDL+SGDDA I +DT +N E QRKWKIKCGK LFALRT I        
Sbjct: 22  LCMEAYLQGQDLWDLISGDDAVILEDTPQNVELQRKWKIKCGKALFALRTLI-------- 81

Query: 153 FVIKSLQNKCGIHLKELDEEKPISDACLRRYLIRGLRKEFMQFISSIQGW---------- 212
                          +LD E+P+SDA L RYLI GLRKEFM FIS IQGW          
Sbjct: 82  ---------------KLDTEEPVSDARLCRYLICGLRKEFMPFISLIQGWANQPSIIELE 141

Query: 213 -----------------------VEDVFYAKD----NSYSRRSSDDNKHSKTEGQSKG-- 272
                                  VED FY KD    NS+S+ SS DNK SK E Q  G  
Sbjct: 142 NLLSNQEALMKQMASSNKQSPSQVEDAFYTKDKAKSNSFSKHSSVDNKQSKIERQFGGNS 201

Query: 273 -----------------------------HIRPNSWVNLKEAEANVVRESNKPEQSTWDQ 332
                                        HI+ N  VNL    ANV  E+ + EQ  W+Q
Sbjct: 202 KSCYRCGKLGHLKRNCRVKVVCNRCGKSNHIKQNFRVNL--TRANVAHETGEFEQLKWEQ 261

Query: 333 CLSIEAVDQ-----------PINMHVNASIDYNEDWIIDS-------------------- 392
           CLSIEA+DQ            +  + NASIDY++DWI+DS                    
Sbjct: 262 CLSIEAIDQLVILNSVVQQTNVETYANASIDYSKDWIVDSGCSHHATGNASLLSEVRPHY 321

Query: 393 -------ADNFLHPVVEEGCVNVENGAPNVGGVSVKDVYHVPGLKKNLVSVSQITDSGRL 429
                  A+N LHPVV+EG  NV+    NVGGVS+K+VY VP LKKNL SVSQI DS R 
Sbjct: 322 GKRAIVTANNSLHPVVKEGNFNVKKDISNVGGVSLKNVYCVPSLKKNLASVSQIADSER- 381

BLAST of Moc03g21720 vs. ExPASy TrEMBL
Match: A0A5J5C3K7 (Uncharacterized protein OS=Nyssa sinensis OX=561372 GN=F0562_001542 PE=4 SV=1)

HSP 1 Score: 239.2 bits (609), Expect = 3.6e-59
Identity = 166/423 (39.24%), Postives = 205/423 (48.46%), Query Frame = 0

Query: 71  DEVTRPPYSYKQPRSSSQLLEAICIEAFLQGQDLWDLVSGDDAEIPKDTQENAEAQRKWK 130
           D++    YSY++          +C+EA+LQGQ+LWDL+SGDD  I +DT +N E +RKWK
Sbjct: 10  DKLVGNNYSYRK----------LCMEAYLQGQNLWDLISGDDVVILEDTPQNVELRRKWK 69

Query: 131 IKCGKTLFALRTSIGNEYI----------------SSMFVIKS------LQNKCG----- 190
           IK GK LFALRTSI  EYI                  +F  K+      L+N+       
Sbjct: 70  IKYGKALFALRTSISQEYIQHVRDGKSPKQVWKTLERLFTQKNTMRLQFLKNELAGMTQD 129

Query: 191 ---------------IHLKELDEEKPISDACLRRYLIRGLRKEFMQFISSIQGW------ 250
                            + ELD E+P+SDA L RYLI GLRKEFM FISSIQGW      
Sbjct: 130 NLSILEYFLKIKTLCSEISELDTEEPVSDARLHRYLILGLRKEFMPFISSIQGWANQPFI 189

Query: 251 ---------------------------VEDVFYAKD----NSYSRRSSDDNKHSKTEGQS 310
                                      VED  Y KD    NS+S+ SS D+K SKT+GQS
Sbjct: 190 IELENLLSNQEALMKQIASNNKQSPSQVEDALYTKDKAKSNSFSKHSSADSKQSKTKGQS 249

Query: 311 KG-------------------------------HIRPNSWVNLKEAEANVVRESNKPEQS 346
           +G                               HI+ N  VNL    ANV  +++K EQ 
Sbjct: 250 RGNSKSYYRCGKLGHLKRDCHVKVVCNRCEKSVHIKQNCRVNL--IGANVAHKTSKFEQL 309

BLAST of Moc03g21720 vs. ExPASy TrEMBL
Match: A0A443N8T5 (Integrase, catalytic core OS=Cinnamomum micranthum f. kanehirae OX=337451 GN=CKAN_00329300 PE=4 SV=1)

HSP 1 Score: 215.7 bits (548), Expect = 4.2e-52
Identity = 150/393 (38.17%), Postives = 186/393 (47.33%), Query Frame = 0

Query: 93  ICIEAFLQGQDLWDLVSGDDAEIPKDTQENAEAQRKWKIKCGKTLFALRTSIGNEYISSM 152
           +C+EA+LQGQDLWDL+SGD+A IP+DT +NA+  RKWKIKCGK LFALRTSI  +YI+ +
Sbjct: 22  LCMEAYLQGQDLWDLISGDNAVIPEDTSQNADLWRKWKIKCGKALFALRTSISQDYIARV 81

Query: 153 FVIKS----------------------LQNKCG--------------------IHLKELD 212
             + S                      L+N+                        + ELD
Sbjct: 82  RDVSSPKQVWEILERLFTQKNTMRLQYLENELAGMTQGTLSIPEYFLKVKTLCAEISELD 141

Query: 213 EEKPISDACLRRYLIRGLRKEFMQFISSIQGW---------------------------- 272
            E+P+SDA L RYLIRGLRKEFM FISSIQGW                            
Sbjct: 142 TEEPVSDARLHRYLIRGLRKEFMPFISSIQGWATQPSIIELENLLSNQEALVKQMTSNDK 201

Query: 273 -----VEDVFYAKD---NSYSRRSSDDNKHSKTEGQ------------------------ 332
                VED  Y KD    ++ ++ SDD + S  EG+                        
Sbjct: 202 KSLSLVEDALYTKDQGNKNFFKQGSDDTEQSNNEGKFRGNSKGCFRCGQLGHIKRDCHAR 261

Query: 333 -------SKGHIRPNSWVNLKEAEANVVRESNKPEQSTWDQCLSIEAVDQPI----NMHV 346
                    GHI+ N  V L EA ANV +E ++ EQSTW+  LSI A    I       V
Sbjct: 262 VVCNRCGKSGHIKANCRVKLMEAGANVAQEKDESEQSTWEHGLSITANQSTIVTSAQTDV 321

BLAST of Moc03g21720 vs. ExPASy TrEMBL
Match: A0A443N8T5 (Integrase, catalytic core OS=Cinnamomum micranthum f. kanehirae OX=337451 GN=CKAN_00329300 PE=4 SV=1)

HSP 1 Score: 62.4 bits (150), Expect = 6.0e-06
Identity = 26/36 (72.22%), Postives = 29/36 (80.56%), Query Frame = 0

Query: 346 LDGASLSWLHANNLPRELWAAAIQTACHVINRLASW 382
           L    LSWLH  NLPRELWAAA+Q+ACHVINRL +W
Sbjct: 637 LTSMCLSWLHTKNLPRELWAAAVQSACHVINRLPAW 672


HSP 2 Score: 212.2 bits (539), Expect = 4.7e-51
Identity = 139/332 (41.87%), Postives = 178/332 (53.61%), Query Frame = 0

Query: 106 DLVSGDDAEIPKDTQENAEAQRKWKIKCGKTLFALRTSIGNEYISSMFVIKS-------- 165
           DLVSGDDA IPKDT +N E QRKWKIKCGK LFALRTSI  EYI  +  +KS        
Sbjct: 7   DLVSGDDAVIPKDTPQNVELQRKWKIKCGKALFALRTSISQEYIEHVRDVKSPKQVWETL 66

Query: 166 --------------LQNKCG--------------------IHLKELDEEKPISDACLRRY 225
                         L+N+                        +KELD E+P+SDA LR Y
Sbjct: 67  ERLFTQKNMMRLQFLENELAGMTQDNLSISEYFWKIKTSYSEIKELDTEEPVSDARLRGY 126

Query: 226 LIRGLRKEFMQFISSIQGW--------VEDVFYAKDNSYSRRSSDDNKHSKTEGQ----S 285
           LIR L KEFM FISSIQGW        +E++   ++ +  ++ +  NK S ++ +    +
Sbjct: 127 LIRELGKEFMPFISSIQGWANQPSIIELENLLLNRE-ALMKQMASSNKQSPSQVEDAFYT 186

Query: 286 KGHIRPNSWVNLKEAEANVVRESNKPEQSTWDQCLSIEAVDQP-----------INMHVN 345
           K   + NS+  L    ANV  E+++ EQ  W+QCLSIEA+DQP           +  + N
Sbjct: 187 KDKAKSNSFSKLSSG-ANVAHETSEFEQLKWEQCLSIEAIDQPVIVNSIVQQTNVETYAN 246

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA8540328.12.2e-6643.14hypothetical protein F0562_024753 [Nyssa sinensis][more]
KAA8521602.11.3e-6340.41hypothetical protein F0562_012275 [Nyssa sinensis][more]
KAA8549858.17.4e-5939.24hypothetical protein F0562_001542 [Nyssa sinensis][more]
RWR74934.18.8e-5238.17Integrase, catalytic core [Cinnamomum micranthum f. kanehirae][more]
RWR74934.11.2e-0572.22Integrase, catalytic core [Cinnamomum micranthum f. kanehirae][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5J5BCB31.0e-6643.14Uncharacterized protein OS=Nyssa sinensis OX=561372 GN=F0562_024753 PE=4 SV=1[more]
A0A5J4ZW516.3e-6440.41CCHC-type domain-containing protein OS=Nyssa sinensis OX=561372 GN=F0562_012275 ... [more]
A0A5J5C3K73.6e-5939.24Uncharacterized protein OS=Nyssa sinensis OX=561372 GN=F0562_001542 PE=4 SV=1[more]
A0A443N8T54.2e-5238.17Integrase, catalytic core OS=Cinnamomum micranthum f. kanehirae OX=337451 GN=CKA... [more]
A0A443N8T56.0e-0672.22Integrase, catalytic core OS=Cinnamomum micranthum f. kanehirae OX=337451 GN=CKA... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 214..238
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 62..81
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 214..231
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 407..432

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc03g21720.1Moc03g21720.1mRNA