Sgr016933 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr016933
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionTy3-gypsy retrotransposon protein
Locationtig00153016: 720257 .. 732615 (-)
RNA-Seq ExpressionSgr016933
SyntenySgr016933
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAGGTAGTGGAATAACGTGGAAGAATAGGGTATGTGAAACTACGTAGTCCAATCTAAATCAGATCATAGAACATTTACATTCCAACATGATAAATCTACAAGAACTCCTAAAAAATCAAAAAACTTCCTACTGCAGTTAAAATGTTGACTAAAGCAGCCAAGGAACGATTTGCCAAAATTAAGAAGCAACGCATAGCAATCTATCCCTAAAATCTCACCCCCAAAAATTGAGTAACGTCTAACGTGTAATCGGGTATCTGAAATAATAAGAGAGTTCGCATAAAAAAAATAATAATAATAATAATTAGCACCTGCAAGCGTCCGGAGGCCGGTGGTCGGCCTGGGATTCTTCAGCGCAGGGGTAAAGGAAAGTGACGGGGAAAGATATTCTCTCTGCAAATGATGTAAAGTTACGCAAATTCAAATGATTTAAAACGATTTTCAGCGAAGAATCTCGCCAGAGCGGAAAATAAAGCAGAAACAAAAAAGGCAATGCATACGTCGGAGAAATTGATTTGTGCAGCAGAATTCGGCAAAGCAAGGAGCAGATAAACCCTAAACCAAGCTTGTTTCAGAATTTTACGAATTGGCAATAAAGCGAGAGAGAGAGAGAGAGAGAGAGAGAATCTGGAAGGTTCTGGTGCGTCGCGCGTGGTTGAAAAGTGGGAGCTAGGGCTTGGATTGGGGGAGAATCTTGATCGATTGGGTTATCATCGCATCATACGACGCATCGTGAATGAGGGATGAGTAAGAGGTGGTTACGTAAGCTGTCGAAAGGTCGGCCTGAGCTGTCCACCTACTCGGCCCGCTAGGCCCAACTCATTTGGTTTTTTTATGTCAAATTTATCAAACAAATTTTTTTTTTTTTTTTTTTTTTGAGAAATATCAATTTTCCCTCCTGAATTTGGCGAGGTGTATCAATTTTCAGTATAGACTGTAAATTTCATCAAATTGCACCTTAAACTTAGATAAGTGTTATAATTTTTACCTTAAACTTTATTAAGTGTTGCAATTGTTACTATATAATAATTTTTCGTTTAAATGATTGTTAGATAAGTATTCTCATATACGTTCAATGAATTATAAAACATCCATCCAATAAGAAATTAATAGGACTTAGAGAAAAAGGACGTTAGAGCTAGTTTTTCTGTTAAAATTTGTGAGTTTTGGCAAAATTATTCAATAGTCGATTTTTACAATGATCAAATTTAATTTTATGCAAAACTTGATTTTTATTGAAATTAGTGAGTTTATGATACTTTTGCATTGAAATTTGTTAAAATTAGTGATTGAATTAGTAGAATTTGAATCGAGGTAAAAATTGCAACACTTATCTAAATTTAAAGTCCAATTTGATGAAATTAAAAATTTATGATGAAAATTGATATACCTTACCAAGTTTATGGGTAAAAAAATAATATTTTTTCTTTTTGTATCAATAAACTGTTGGAGGGATTATAACCTCTAACTATTAGAAAAATAAGTAATATTTTTACCAGTTGAGCTATTCTCGTGGCCAATTAAGGTAAGTAAGAGGTTGGGGAAAAATTTTGCATATATTGCTAAATAAAAGAGAAAAAATCGAAATATGTTCAAAAGTTTGTGATACTGTAAAAGTGGCCACAAATCTAAATGCACGTTTTTTATAATTTAAAAATATTTGATATGCCATCCTAGCAATGTTTAGATTTTCCTATTCAAACTATAAAAATCACGTTTAATATTTAATTAATTAAAATTAAAACTATAGAGATAATTAAGGAGAATATTATTATCAACTTTTTAAATTTTTAAATAGATAAATATTAGATTAATCAATATCCATAATAGTATATAACACTCGTGTAAAAAAATATAATAATAATATATAACAACTTACCTTATATAAAATTGAGTGACTCAGTAAGTAAGTAAATAAATATATATATATATATATATATATATTCACAAATTGAGAGTGAGAGATTTTAATCTATGAAGAGGAAGTAGAGATGCTTTAACCGTCGAACTATGCTCGCATTAGCCTCAGTAAATAAAGAGTTAACGGTCGATAATATATTTTTTTTCTCCAAACTCTCCTTGCTTTTTCCACACTCGCCTAATGACCGTTTTTTAGCCCTTTAGCCAACGATTTTATTACTAGTTCAATACAAATAATAGAGTGAGTTTTTTTTAGTTCAATAATATAGTGAGTTTAATATATTATTTTAAAAGGAAATTAAATTATACCAGATAAATTTTCTTCTTTTATTCAGCATTCAATAATAGAGACTTTACTGTGCTAAATTAAAATCATTATTTGATAAAATAAAGTATCATATAATCAGTTTCGAGTTTTGTAATAATGGAAGATAAACATTCTTTAATAATGGTGATGAAAATAATAATTCCATATATTTGAGAAACATTTTCAGACTAATTTATTGTTGTTTTAATTATCTCATTTTTAAATACCAACAGATCAATGTTATTAATCAATATACTAGATTTTAATTTAAATTCATTTATGAGCCTTTAGTAACCTCTCTAGTTTGATATACGAGTTTTGTAAGTGGGATAAATAATGAGTACAAGATTGTCTCGTTTCACACACACAACACAAGTAGATTGATGCAAAAGCTTATTGCAATGCAACAAGTTTTCTAGTTCTTTACACGCATATTAAACATTTTGGCCAATTAATAAGAGAGGCCAACTGCAAAACCAACATTGGATCAATATGTGATCTAACTTCACCAATTGCCGGGTTAACCACAACTATATAAAATCATAATTTTAAAATTAGGAACAAAATTCAAACCTTTTTAAAAAATCAAGGATTAAAATTCAATCACTGACTTTATAATATAACACTTATTTTATTTAATTTTTGCAAAGAACCCCCCCGAATTTTGAGATTGGTGTAAAGTCATAGAGCGCTACCAAAAATTAGAATTCAGCGCTGTCCAGGAGTCAGCTCGTCACGCCAGGGAAGAACCTTGCTCCTCGGTCTGAAATTGGATTGAAGACTCTCCTTCGTGGTTGGAGACACAGAGGTTAGTGTTCTTTTCGCGCTCTGTTTTTTCTTTCTTGTGCCCTCTATTATCTGTCAAAAAAGAGAAAAAAAAGTGGGGCGCTTCGTGAGTCCCTCTATGGTTTTGCTTACTCGTTCTCTTCTCATCCCTCAGTCGTTATCAAACTGTAGAACGTTTTCTTCTGCATTCAAATTTTGGGTCAAATACAAATTCGGAAAACCCTTTATTCTCGAACCCTTCTTGGTATCTTTGCAGCATGTTTTTCATTATCTAAAGCCGAGAATCTATGGCTATTAGCTTGAACATGGAGTAGCACGCACTGGCTCGGACGTCGGAATGAATCATAGTATCAGTTCTGATGAGCTCGACCATCTTTCGTTGGAGGTGAGGCAGAAAATGCTACTGGAAAGTAAGCATCCATTACTCGAAGATGGGAGTAAAAGAATATCAACGTGAGTATTATCTGTTGATGTATATATTTTTTCTGCTTATTTCGTTTTTGATTAGAGACTCAACGTTTTCTTTGGGCCATATTCAGGCTTTGCAACGATTTGCTTTTCAAGAAAGAAGATGAATGCTGTGATTTGCAGGTAATTTAGCATGCGTGGCACTTCAATTTGTCTCGGTTTTTGTGCTCATTGATAGGTCGTGTTACTGTTCTGGCAGCAATGACAATTTACAACTACATAAATTTGTATGTTATACACACACACAAGAATTTTCTTTTGAAAGGAGCCCTCTTTTATTGTGGCTGCCCCAAGTCAATTGGAACATTGTTACTTTTCCCAGAGATAAAGGATGCCTTACCTAGGTATTGGCAATACTAAGGAGAGAAATTTGTACTCCTTATGAGTCTTTGGCGTTTCTTGATAACTCATAGGAGATATGGAAGGAGGTCATCTCAAGTAAATAAGTGAGAAGGCATTCCTAACAAATGTTGCTATCTAAGTGCCAAAATCCCATGGGCATATCACAAAGCTTGCTCAACATTTTGATCTTCGCACAAGGCCAACTGTTCAAGATAGTGCTAACACCCATTTTGGCATCATGAATGGTGCCATGGCCATTGCCTCGTGATCGTTTTTCTAGGCTTCTCTTTCTAGCCATCCACAAAGATATTTTAGTCAAAGATATTTGTGGGTTGAGAAATGATTGTTGGCACATCTGACCTTAGAAGGGACTTTTGTGGAAGGGGTATTGCTAACTGGTTCTAACTATTGGATCTCATGAATGGGTACTTTAATGTCTAGTAAAGACTCTAGACCTTGGCTTTTGGATAACTGCCTTCACAGTTAAATATATTTTCGCCATCTCAACAAAAGGGATCCATTGACCACTCTAGTCCTCTGTAGCTGATTGTGGAAATCATTCATTCTTAAGAAATGCAAATTCTCTTTCTCTCTCTCTAATTATTCCTCATTGTCCATACATGCTTATTGGAGAGTGTTTTTGTAACTGTTTTGGCCCTCGTTTGTAATTTCACTACCTTCGGTAAAATGGTTTCTCATCCATAAACATTCTTATAGTGAGCTAAGAATTTGATGGCCTAAACTTATTTGTCTTAACCTTGATCAAGCAAGCCTTCTATATCTTTTGAAAGGCAAGCTCCTTTTCCTCAATTAGCTAGTTTTTATTTTTTTGTTTCCTTCTTCAAGTTTGTCTTTGTTTGTGTTTGACCTGATAGTCTTACGAGACTACTCCAATTTACAAGTGTACGACTTCACTAAACTTGATACAATAATGACTCAACTTATTTATCCAAGAAAATTTTGATTAAATGAAATTAACATTAAGAAAACTTTTGCTCATAGAGTATAATTTCACCAATTTTCTGTTTATTTTTCAATTTGTTATATGTTCTTGCTGCAGGGTGTTTCCTCTGTGATATCTGCTTGTGATGCCAGAAATCTGGGAGATCAACAACTGGAAGCTCAAATTAACGATACTAGTGGTATGGGGGAAATACTTCTGAAGGTCTGTTTTATATTGTTCCTGAGATATTCTAATTTCATCAGTGTTTATGATATAGTTCTTGTGACCTTATGTAGGTCATCTTGAGGACTGCTACAGTGAGGGGGCTCGATTAAATATAGAGAAGCAGATATCATGCACTATGGCATTTGAGAATCCAACACCACCTGAGGTTCCTGATTGGGTAAGGGTGGAATCTACAGGCATCTTACCAGGTTTCTTGGCAGATGGTGTAGATAACTTTGCTTCCGCTGGTGTGGCTGTAACTAAGGTTAAAAATGAGACGTTTGATGACTTCGATGAAGATCTTGATCATGTTGTATTGATAGAGCGACTAAGGATGCTGCTATCAAGGTTCCACATTTTCTTGGTGATCATATTCTCTTTCTGTAGTCATCTATTTCCATCCATGCACTGTTTAGGTCTATAATCTTTTTCCCCCATACTTAGACTAGTATATAGTGTGACTCAATTGCAGCCAATTTTTTTGTGGTTAAAATAATGTTTTGGTCTTTCTATTTTAATTCCTAAACTTTCAAGTTTCAATTATTAAAACTTAGTAAGTCATCTGAAATTGATCTCCTTCATTAACTCCTCTAATAGGATTTAATAATTGGGATAAGTAAAATAAGTCATATAACAACAATAACGTAGTTCTCAACAACCAAATTTAATTAAAAAATGTGTTTTTAATGAAGCCCGTGAAATAGTTTTTCCCAATAATTACATGTTTTAACTAGGCTGATAACTATTGATTTTATTTTTTCAGAAAGAATATTTGTAGGACAAGTTCAAGATGTAGTACTTTAGATATTTAATTAAAGTTGAGATTTCAAATGAAATCGAGTTAAATTGGATCACAAAACAATATTATAACCTTTTTTTATTTTCATTTTTATTATTATTTTTTTTGTGAAGGATTAGAGTTATCTAATGTTATTTTCCCGTTCCCATTGTAGGAGAGCGTTGGGTTTCATGAATCACCATGTGGAGGTAATTCGCTGCTTTTCTGTAATTTGTAGCAAAAAATCTGCAATTTAATACGTCATGGGTGGAATCACATACAATGCCAACGTCCTCGAGAACTATAGTTTCTATTGTATGATAGTGACCCATGAGAGGAGATTCGAGGTGAGGGAGATGAAACTGGAATTTATAGAGGTAGAACTCTGGGATGTGAGATTATGACGTTGCAAGATGCTTCTTTGTCAACCAGATAGTAGTGGGTCAAAGATCATGGCGTGGTGGCAGTAGTGGCTCCAAAGGTAGGAGTGTCAACTGTCATGAAGAAGAGGAACAATGTAGAGGATTTTCTTTTAACCTTCACGTGTTTGAAGAACTTTGGCGGTGGCTTCATAAATAATAATAATAATAATAAAAAAAAAAACTTTTGGGATGACAACCAAAAATTGTGGTTGAAAGAAATCCTTTTAGAACCTTCATGGATCATGTCCTTTTTGTTGCTCTTTCTTGGAGTAAATTTCATACTCCTTACAGAACTCTTATTCTTACTTATCTTTGTGCCAACTGGAAGATCTTTTTTAACTTCTTTGTCTTGGAGTTTGGCTTCTCCCCTTTTTGGTAATTTCATTCATTCAATGAAATTATCTTTCATTAGAAAAAAAACCTATCAACTAGAACTGTGGATTGTGGAGACTGGATCCGCCCAATTTGAGGGTGCCAAGAAGAAATCCCCGTGCTCCTTAATGACATGGTTAGAGGAGCAGTGCCACCACCACCAAAAAATGAATTTTTGGAGTGGTGAGGAGAGGTCCCTTCTTCTAAATGTTGGTTTAGTCGTATTAGGAAAGAAATGCCACATGAGACATCTTTGGCACGCCCATTTTTCTTATAGGGTTCACACTATTTTCCTTTCCTAATTAGTTTTTATACCATAATTATGTGAGGGATTATGGACTTCCAAGAGGTACTTGATGAGGGAGGAATCCTAGCCACCAAGGCAAAATTTTCTGACTCATGTTGAGTTGTGGGTGTGGTATCCTTCATCATCAACCTTTTGTGGCTTTACTTGCTGTACTTCAGCAAAAGCCCCGCAGTATTTGGTAAAGGTTTCATGGATAGGGTTCTCCCCCTCACCTTATCTAATTCCTTGTTAGACCAAGTAAAAGTTGAATACTCTCTTATTTTGAACTATCTTTCGAAATGTGGTACCGTTTATTACAACTTGTAGTTGTGATTTTCAAACATAGTTTTTGCCAATCGGGTGAGTTGGGTGAATATTAAGTTACAGGCACCTCTTTTTGTCGTAAGTCATGAAATGTAGGTCTCCGATTATTTTCTGTCGTAAGTTTTGTTTTAAAAAATGTTAAAAACTTGCCTTGTAATTTTGTGCTGTCCTCCCTTTCTTCATTGGCAGGTCATGTACATATTGACAAACATGTGTTTGTGTTATGGATTGTTTTTCTCAATGAAATGGGTCAAAAGCTTCATAATATGTCCCCCAACTTGTTTTTCCCATTGTTTCCTTCCATGTCTTTTTCATGATGCTCTGCAACATTAAGTTTTGTGATATTTTGTCATGATTTGCTTTTTTGTTGTTTAGAGTTGTTGGTAATTTGATACGCTGCTGGGTTGTGCAGGGTAGTTCTGGTGTGTCATCAGGAAATCTTATACAATGCTTTTTGAAACAGAAGGAAAAGTTTATGTTTGCTAATGGGGAGCTGATGGGAATTGGCAATGCGTTGCATGATAAAATTGGAAGTGATGCTCCTCGTCTTTGCATCCCTTCAGTAATTTGTTCACCTAATACAGCCTTTTCTGGATCCTGTTTCTCAAGCGATCATTCTTTAAATAAATCAACTGAATCAGGCAATGACATGGAACTTAAAGAAGATGATAAGATCTGTTCATCTGAGAAGGTGGCCACAGAATTAGGCCCACGGCTTTTGACTGATCATGTCCCTGAAGTAAATTTATTTAATTCCACAAAAGTGAAGGATGAACCTTATGATCATGTTGACGGCTGCAACTTATATGATAAGGATACGAAGAACATCTGCAGCAGAATTTTGTCAATAAAGAGTGAAACAATCATGCCCGATGAACCTTATGAAAACAAGGTAGATGATATGCGACTGCAAGATCGAATGAAGTTTTTCTCATCTCGAAAGGTTTTTGGTTCTACGTCAAGGGATTACGAGCATCCAAAACCTTCTGACCCTGGATGTAGTTCTCTTGTTTCAGAACCTTCTAGTTTAATGAACATTAAACATCGACGCAAGCGGAAAAAAACTGCCACGTATGTACTTTACGTGTTTGAATACTACTGCAATATACTTTTTTTTCCTGTGTTTTCTCAATCTTTCTAAGGTGTTTTGTGTCAATTTATTTGCAGGAATTCAGTTGAAACAGCACTCGAGGAAGATGCCCCTGGCCTTCTGCAGGTATCAGTATCATGTGTCGTTCAATTGTGCTTGTGAATTTTGGATCATATTCATTCTAGTTTCATTGGCAGATACTAGTTGGCAAAGGTGTAGAAATTGATGAAATTAAGCTTTATGGAGAGATGGAAAGTGATGATGATCTGCATGAGTCATTTAGTGAAGACAGCTTTGGTGAGCTTGAAGCTGTGATATCGAGGGTGAGCGATAATTTATGATCTTGTTGTCATGTAATTGCTTTGCTTTTGTAATTTCTCCTAAATTTTCCTTCATGGGCTGTTTGGCGTAAGCTTTTCTAGCCATGAGTTTCAATATCCTACATACGATATTTTCCATACTTGAGAAATGAAGATCCTTGGGTGGTAATGAATGCCTATCCCCTAGGGTTTTTAAACCTCTCAAACATATGGTTTTCTTAACCACCAAACACAGGTTTCCTAATATCATGAGAGAGAGGTCTAGTTCAAAAGTTAATGAAAAGTTTGTTTCTTGTTCAAAAACAAATGAGAGAGAGGTCTATATTACTTAATTTTGATTAATTAACAAATATATATTTCTTAAGATTTAATATTTGTTTCTATGAATATTTTTAATAAATTTTTGCAGCGGTTGTTTGGGAAATTATTCTGCAGATTTAAAAACTATTTTATAGAGTTCTTTAAAAATGTTACAGTGCACTTAACCATTGTTCACTAGTCCATGTTGTCAAAGGAAGCACTACAACAAAGTAAAAGAGAACAACTTTACTTGTTATAGTAGCATAATTGAATTTGTTCTCTCTTGTCAATATTGTCTTGTTTCTTTTTCTTATTGTCCTTCCCCTACCTTCGTCTGCCTCCTTCTGGGAAGTAGTTTATTTGATAATTTTAATTGATTGGTTGGATTTTATATTAGATTGCAATGGAAATAACTTATTTCAGTGGACCAAGTTCTGGTGGTGCTTCTTCAAAATTAGTTGAATTTCAAATTGTTAAAGAAAGAGATTGGATGATGGGTCAATTTGGGTGCTGAGAAAATCGGACTAGTATGGTTGGGTTGTTGAAATTATTTCTCTTTTCCATGGAGTGAAGAGGTGTATCTTAGTCCCTATTAGGAGAAAACAAGGGAGGTTGGAGTGTCTTTTGGGATACGCTCTATGTTTTTATTAGAAAATTTGGTAGTGAAAGGAGCAAGGAAAAGGTGAGCAAACCTGAGAGTGGGAGGAGTAAGGCTCATCTCGGCGGATGAAGATATGGTGTGTATTAAGAATGATTAGGTAAAGGAGTTACTACTTTTAACTTTGGAAGGTGTAATGCTAGGGTTCATGGGAAGATGTCGTGGATTCCTTCTTACGATGACTGGATTAGAGTTGGGATTTTTTTTAGATAGGAAACAGAGAATATTATTATCAAGAGCCCAGATACAAAAAATGGGAGATGAGATATCCCCATGAAACCAACGGGTTACAAAAAGGATTCCCTGGTATAAATATACAAAAGCTCAAAGTACAAAAAAGTTTAGCTAAGGAGCACCATGAAGATGAAAAAGATATAGTGAGATCCCAAAATCATCAAGGCTTGACTCGGCTCCTTTGAAAGTCTACTGGTATCTCTCGTGCCATTCCAAAAGACTAGCCACGGAATTGGAACAGATTTCACCAGCATAATACCCCCTGGAAGGAGAAGCTAGCAATAACTGCTTGAGTGCCTCCCCACTCTTCTTAGCAAAACCCCAACTGATGCCAAAATCTGAAAAAATTTAGACCAAACTGCGGCGGCAAAGGGGCAAAAGAAGAATAGATGAGAGTGGTTCTCCAAGTCTCTTTTACACATAATGCACCAACTAGGGGAAAGTAAGAAATTGAGCATCCTTCTTTGGAGCTTCTCGCAAGTGTTGAGACGTTCTTGGCCAAAAATCCATAGGAACATTTTAACCTTCTTAGGGTAATTCGCCTTCCAAATGTCATTGGAAACATGTGTCAAGAGAAGAGTTTCGAGCACTGATTTTTTAAACAAGGGAGCTCACTTGATACTCTCCGCTCCTATGAAGGATGAGATTTAGAGTCCTCTTGATTAGAAGGCTGCCGTTGACTGATTGAGCTTGATAAAGTGTTCCATTTTTCAATCTCAAGATCTGTGAGGTTCCTTTTAAATTGGAGATTCCATCTCTTATTATCATTTGACCAACATTGGTAGATAAAGAAACCTTTAGAGTTAGCCACGTGGTAGAATCTCAGAAAACACAGCACAGGCATCACCTATATGCTTAAATGATTAATATTCCAATTATCAAGAAGAAAATTAGGTATACTCTCTTTCCCATGACAAATTTTCTTCTTGATAATTGGAATAGTAATCATTTAAGCATATAGGTGATGTATGTGCTGGTTTTTTAACATTTGCTCAGATAACGGTTAGGAAGATGGATTTTATGGAGGCTGCCATGAAAGTAAGGGACAATTATTATGGTTCCACAACACCAGAGGTGGAGTTATTGTAGTTGGGTGGTACAAATTGCAGGTTTTCTAACACCTAGTGGTTAAATAGTAGAAACCCGAGATTCAAGGTACTTCTTCAAAAGAGGTGGCTAGAGCTTTTGATGAATTCATGTTGTCACCCTGTGGATATTTCGCTGTAGTGCTCCAAGATAATTTGTTTGTCTTGGAGCCTATTTTCAGGGAAGATGAGGTGATTCTCAGAAGAATTTTAGTAGACATTATAGGATCTGATTGCCTCCTCGAGATTCTTTGCAACAATGTTCTTATTGTTGTAACATGGTCGACGTAATCTTCCAATTATCATTTATGGTTTGAAGTTTGGGTGACCATTTCCAGGATAGCAACTGGCTGTCATCACTCATACGAGGAGGAGCACCTCAAGTTTAACCAATCTGCACCTCATGTCATCAGAAGATTTGCATGTGAGGCTCACACATCACTGGTTCGTTCTATTTTCTGCCCTGCCCGCTGAAGCATAAGTGGGCAGAATGGAATTTTGATGGTATTGGAAGAGTAGTGAGGAAATTTGGGATAACATCACCCTTGTACTTCTATTTAGTTTTTTCTTTCTTAAGTTTTCTTTAACTACTCCACTTCTTTTATATAACCAACTGGAAAGTGCTCTTGTTGTTCCCGCTTTTGGACATACATTTCAATTTTTCAATGATATTGGTTTCTTATCCAAAAATCAAACTCAACTGTTTATTTGATAATTTTGATTGGTTGGTAGTAGTTAGTACTCTGATTTGCGGTTGATGGTTGGGATTTTTTATGGCATTTTGTTTAGTACTCTCTCTGTCTATACCCTAATGTTATTGTTTTACCATATTGTAACAGCTGTTTTCTCAACGCCATTCCTTTTTGAAGTTTCCTTCTATAAGATGCACAAAAGCTTCTAGAGCAAGCTATTGCTTAGCTTGTCTAGTTTCACTTATTGAGCAGGTAACAAGTTTTATCTTCCATTTACGGTTATGAACTTATTTTGCGTATCATTTTTTCTATTCACTTCTATTTAGTTCATCATTGCATATGATTAGTTTTCTTCATGTCATGATCTTAAGAAAGATCTTTTATTTTCTTTTAGCCTATGCTTAAACTTTGTTACTGTCCAGTTTATTCCACTCTGATAATCTATATTTGTTGGTCAATCTTCTATTTTAGACAAGATATCTTCATTTCCGGAGTTGGCCTGTCGAATGGGGGTGGTGCCGTGATCTTCAGTCTTTTATATTTGTATTCGAGAGACATAAAAGGTGCATGCACTACAATTTTTCTACGTGAATCAATGTACAGAGTTAAATGTACACAGTTAAATATTGATGGTGCTTCCTTGAGTGCAGAATAGTGCTGGAACGTCCCGAGTATGGCTATGCTACATATTTTTTTGAGCTTGTCGATTCCTTACCTATCGACTGGCAGATAAAGCGGTTGGTGATTGCTTTGAAGCTTACTAGTTGTAGCAGAATTTCACTAATTGAGAACAAACCATTGTTG

mRNA sequence

ATGCAGAGAGTTCGCATAAAAAAAATAATAATAATAATAATTAGCACCTGCAAGCGTCCGGAGGCCGGTGGTCGGCCTGGGATTCTTCAGCGCAGGGCGCTGTCCAGGAGTCAGCTCGTCACGCCAGGGAAGAACCTTGCTCCTCGGTCTGAAATTGGATTGAAGACTCTCCTTCGTGGTTGGAGACACAGAGCACGCACTGGCTCGGACGTCGGAATGAATCATAGTATCAGTTCTGATGAGCTCGACCATCTTTCGTTGGAGGTGAGGCAGAAAATGCTACTGGAAAAGACTCAACGTTTTCTTTGGGCCATATTCAGGCTTTGCAACGATTTGCTTTTCAAGAAAGAAGATGAATGCTGTGATTTGCAGGGTGTTTCCTCTGTGATATCTGCTTGTGATGCCAGAAATCTGGGAGATCAACAACTGGAAGCTCAAATTAACGATACTAGTGGTCATCTTGAGGACTGCTACAGTGAGGGGGCTCGATTAAATATAGAGAAGCAGATATCATGCACTATGGCATTTGAGAATCCAACACCACCTGAGGTTCCTGATTGGGTAAGGGTGGAATCTACAGGCATCTTACCAGGTTTCTTGGCAGATGGTGTAGATAACTTTGCTTCCGCTGGTGTGGCTGTAACTAAGGTTAAAAATGAGACGTTTGATGACTTCGATGAAGATCTTGATCATGTTGTATTGATAGAGCGACTAAGGATGCTGCTATCAAGGAGAGCGTTGGGTTTCATGAATCACCATGTGGAGGGTAGTTCTGGTGTGTCATCAGGAAATCTTATACAATGCTTTTTGAAACAGAAGGAAAAGTTTATGTTTGCTAATGGGGAGCTGATGGGAATTGGCAATGCGTTGCATGATAAAATTGGAAGTGATGCTCCTCGTCTTTGCATCCCTTCAGTAATTTGTTCACCTAATACAGCCTTTTCTGGATCCTGTTTCTCAAGCGATCATTCTTTAAATAAATCAACTGAATCAGGCAATGACATGGAACTTAAAGAAGATGATAAGATCTGTTCATCTGAGAAGGTGGCCACAGAATTAGGCCCACGGCTTTTGACTGATCATGTCCCTGAAGTAAATTTATTTAATTCCACAAAAGTGAAGGATGAACCTTATGATCATGTTGACGGCTGCAACTTATATGATAAGGATACGAAGAACATCTGCAGCAGAATTTTGTCAATAAAGAGTGAAACAATCATGCCCGATGAACCTTATGAAAACAAGGTAGATGATATGCGACTGCAAGATCGAATGAAGTTTTTCTCATCTCGAAAGGTTTTTGGTTCTACGTCAAGGGATTACGAGCATCCAAAACCTTCTGACCCTGGATGTAGTTCTCTTGTTTCAGAACCTTCTAGTTTAATGAACATTAAACATCGACGCAAGCGGAAAAAAACTGCCACGAATTCAGTTGAAACAGCACTCGAGGAAGATGCCCCTGGCCTTCTGCAGATACTAGTTGGCAAAGGTGTAGAAATTGATGAAATTAAGCTTTATGGAGAGATGGAAAGTGATGATGATCTGCATGAGTCATTTAGTGAAGACAGCTTTGGTGAGCTTGAAGCTGTGATATCGAGGCTGTTTTCTCAACGCCATTCCTTTTTGAAGTTTCCTTCTATAAGATGCACAAAAGCTTCTAGAGCAAGCTATTGCTTAGCTTGTCTAGTTTCACTTATTGAGCAGACAAGATATCTTCATTTCCGGAGTTGGCCTGTCGAATGGGGGTGGTGCCGTGATCTTCAGTCTTTTATATTTGTATTCGAGAGACATAAAAGAATAGTGCTGGAACGTCCCGAGTATGGCTATGCTACATATTTTTTTGAGCTTGTCGATTCCTTACCTATCGACTGGCAGATAAAGCGGTTGGTGATTGCTTTGAAGCTTACTAGTTGTAGCAGAATTTCACTAATTGAGAACAAACCATTGTTG

Coding sequence (CDS)

ATGCAGAGAGTTCGCATAAAAAAAATAATAATAATAATAATTAGCACCTGCAAGCGTCCGGAGGCCGGTGGTCGGCCTGGGATTCTTCAGCGCAGGGCGCTGTCCAGGAGTCAGCTCGTCACGCCAGGGAAGAACCTTGCTCCTCGGTCTGAAATTGGATTGAAGACTCTCCTTCGTGGTTGGAGACACAGAGCACGCACTGGCTCGGACGTCGGAATGAATCATAGTATCAGTTCTGATGAGCTCGACCATCTTTCGTTGGAGGTGAGGCAGAAAATGCTACTGGAAAAGACTCAACGTTTTCTTTGGGCCATATTCAGGCTTTGCAACGATTTGCTTTTCAAGAAAGAAGATGAATGCTGTGATTTGCAGGGTGTTTCCTCTGTGATATCTGCTTGTGATGCCAGAAATCTGGGAGATCAACAACTGGAAGCTCAAATTAACGATACTAGTGGTCATCTTGAGGACTGCTACAGTGAGGGGGCTCGATTAAATATAGAGAAGCAGATATCATGCACTATGGCATTTGAGAATCCAACACCACCTGAGGTTCCTGATTGGGTAAGGGTGGAATCTACAGGCATCTTACCAGGTTTCTTGGCAGATGGTGTAGATAACTTTGCTTCCGCTGGTGTGGCTGTAACTAAGGTTAAAAATGAGACGTTTGATGACTTCGATGAAGATCTTGATCATGTTGTATTGATAGAGCGACTAAGGATGCTGCTATCAAGGAGAGCGTTGGGTTTCATGAATCACCATGTGGAGGGTAGTTCTGGTGTGTCATCAGGAAATCTTATACAATGCTTTTTGAAACAGAAGGAAAAGTTTATGTTTGCTAATGGGGAGCTGATGGGAATTGGCAATGCGTTGCATGATAAAATTGGAAGTGATGCTCCTCGTCTTTGCATCCCTTCAGTAATTTGTTCACCTAATACAGCCTTTTCTGGATCCTGTTTCTCAAGCGATCATTCTTTAAATAAATCAACTGAATCAGGCAATGACATGGAACTTAAAGAAGATGATAAGATCTGTTCATCTGAGAAGGTGGCCACAGAATTAGGCCCACGGCTTTTGACTGATCATGTCCCTGAAGTAAATTTATTTAATTCCACAAAAGTGAAGGATGAACCTTATGATCATGTTGACGGCTGCAACTTATATGATAAGGATACGAAGAACATCTGCAGCAGAATTTTGTCAATAAAGAGTGAAACAATCATGCCCGATGAACCTTATGAAAACAAGGTAGATGATATGCGACTGCAAGATCGAATGAAGTTTTTCTCATCTCGAAAGGTTTTTGGTTCTACGTCAAGGGATTACGAGCATCCAAAACCTTCTGACCCTGGATGTAGTTCTCTTGTTTCAGAACCTTCTAGTTTAATGAACATTAAACATCGACGCAAGCGGAAAAAAACTGCCACGAATTCAGTTGAAACAGCACTCGAGGAAGATGCCCCTGGCCTTCTGCAGATACTAGTTGGCAAAGGTGTAGAAATTGATGAAATTAAGCTTTATGGAGAGATGGAAAGTGATGATGATCTGCATGAGTCATTTAGTGAAGACAGCTTTGGTGAGCTTGAAGCTGTGATATCGAGGCTGTTTTCTCAACGCCATTCCTTTTTGAAGTTTCCTTCTATAAGATGCACAAAAGCTTCTAGAGCAAGCTATTGCTTAGCTTGTCTAGTTTCACTTATTGAGCAGACAAGATATCTTCATTTCCGGAGTTGGCCTGTCGAATGGGGGTGGTGCCGTGATCTTCAGTCTTTTATATTTGTATTCGAGAGACATAAAAGAATAGTGCTGGAACGTCCCGAGTATGGCTATGCTACATATTTTTTTGAGCTTGTCGATTCCTTACCTATCGACTGGCAGATAAAGCGGTTGGTGATTGCTTTGAAGCTTACTAGTTGTAGCAGAATTTCACTAATTGAGAACAAACCATTGTTG

Protein sequence

MQRVRIKKIIIIIISTCKRPEAGGRPGILQRRALSRSQLVTPGKNLAPRSEIGLKTLLRGWRHRARTGSDVGMNHSISSDELDHLSLEVRQKMLLEKTQRFLWAIFRLCNDLLFKKEDECCDLQGVSSVISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISCTMAFENPTPPEVPDWVRVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALGFMNHHVEGSSGVSSGNLIQCFLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVICSPNTAFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLFNSTKVKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKKTATNSVETALEEDAPGLLQILVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKFPSIRCTKASRASYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL
Homology
BLAST of Sgr016933 vs. NCBI nr
Match: XP_022141525.1 (uncharacterized protein LOC111011878 isoform X2 [Momordica charantia])

HSP 1 Score: 933.7 bits (2412), Expect = 8.5e-268
Identity = 472/582 (81.10%), Postives = 502/582 (86.25%), Query Frame = 0

Query: 73  MNHSISSDELDHLSLEVRQKMLLEKTQRFL----WAIFRLCNDLLFKKEDECCDLQGVSS 132
           MNH  S DELDHLSL  RQKMLLE     L      I  LCND + K+EDECCD+QGVSS
Sbjct: 1   MNHGTSFDELDHLSLVQRQKMLLENKHPLLEDGSKIISPLCNDFIVKEEDECCDVQGVSS 60

Query: 133 VISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISCTMAFENPTPPEVPDWV 192
           +IS  D  NLG QQLE Q+ DTSGHLED Y+E ARLN EKQISCTM FENPTPPEVPDWV
Sbjct: 61  MISTRDDGNLGGQQLETQMKDTSGHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWV 120

Query: 193 RVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALG 252
           RVESTGIL G L DGVDNF SAGVAVTKVKNE FDDF+EDLDHVV IERLRMLLSR+ALG
Sbjct: 121 RVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALG 180

Query: 253 FMNHHVEGSSGVSSGNLIQCFLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVIC 312
            MN HVEG SG SSG+ +QCFLKQK K MF+N EL G  N LHD+ G DAP L  PSV+C
Sbjct: 181 SMNQHVEGGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVC 240

Query: 313 SPNTAFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLF 372
           SP    SGS FSS+ SLNK TESGNDMELKEDD+IC SEKV TELG RLLT+H PE NLF
Sbjct: 241 SPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLTNHAPEANLF 300

Query: 373 NSTKVKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFF 432
            STKVKDEPYDHVDGCNL+ KD  N+CSRILS+KSET MPDEPYENKVDDMRLQDRMKFF
Sbjct: 301 YSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFF 360

Query: 433 SSRKVFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKKTATNSVETALEEDAPG 492
           SSRKVFGSTSRDYEHPKPSDPGCSSLVSEP+SLMN+K RRK K+TATNS+ETALEEDAPG
Sbjct: 361 SSRKVFGSTSRDYEHPKPSDPGCSSLVSEPASLMNVKRRRKWKRTATNSIETALEEDAPG 420

Query: 493 LLQILVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKFPSIR 552
           LLQILV KGV +DEIKLYGEMESDDDL ESFSE+SFGELEAVISRLFSQR SFLKFP IR
Sbjct: 421 LLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPIR 480

Query: 553 CTKASRASYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYG 612
           CTKASR+SYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYG
Sbjct: 481 CTKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYG 540

Query: 613 YATYFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL 651
           YATYFFELV+ LPI WQIKRLVIALKLT+CSRISL+EN+PLL
Sbjct: 541 YATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLL 582

BLAST of Sgr016933 vs. NCBI nr
Match: XP_022141523.1 (uncharacterized protein LOC111011878 isoform X1 [Momordica charantia])

HSP 1 Score: 927.9 bits (2397), Expect = 4.7e-266
Identity = 472/586 (80.55%), Postives = 502/586 (85.67%), Query Frame = 0

Query: 73  MNHSISSDELDHLSLEVRQKMLLEKTQRFL----WAIFRLCNDLLFKKEDECCDLQGVSS 132
           MNH  S DELDHLSL  RQKMLLE     L      I  LCND + K+EDECCD+QGVSS
Sbjct: 1   MNHGTSFDELDHLSLVQRQKMLLENKHPLLEDGSKIISPLCNDFIVKEEDECCDVQGVSS 60

Query: 133 VISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISCTMAFENPTPPEVPDWV 192
           +IS  D  NLG QQLE Q+ DTSGHLED Y+E ARLN EKQISCTM FENPTPPEVPDWV
Sbjct: 61  MISTRDDGNLGGQQLETQMKDTSGHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWV 120

Query: 193 RVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALG 252
           RVESTGIL G L DGVDNF SAGVAVTKVKNE FDDF+EDLDHVV IERLRMLLSR+ALG
Sbjct: 121 RVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALG 180

Query: 253 FMNHHVEGSSGVSSGNLIQCFLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVIC 312
            MN HVEG SG SSG+ +QCFLKQK K MF+N EL G  N LHD+ G DAP L  PSV+C
Sbjct: 181 SMNQHVEGGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVC 240

Query: 313 SPNTAFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLF 372
           SP    SGS FSS+ SLNK TESGNDMELKEDD+IC SEKV TELG RLLT+H PE NLF
Sbjct: 241 SPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLTNHAPEANLF 300

Query: 373 NSTKVKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFF 432
            STKVKDEPYDHVDGCNL+ KD  N+CSRILS+KSET MPDEPYENKVDDMRLQDRMKFF
Sbjct: 301 YSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFF 360

Query: 433 SSRKVFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKKTATNSVETALEEDAPG 492
           SSRKVFGSTSRDYEHPKPSDPGCSSLVSEP+SLMN+K RRK K+TATNS+ETALEEDAPG
Sbjct: 361 SSRKVFGSTSRDYEHPKPSDPGCSSLVSEPASLMNVKRRRKWKRTATNSIETALEEDAPG 420

Query: 493 LL----QILVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKF 552
           LL    QILV KGV +DEIKLYGEMESDDDL ESFSE+SFGELEAVISRLFSQR SFLKF
Sbjct: 421 LLQFHWQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKF 480

Query: 553 PSIRCTKASRASYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLER 612
           P IRCTKASR+SYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLER
Sbjct: 481 PPIRCTKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLER 540

Query: 613 PEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL 651
           PEYGYATYFFELV+ LPI WQIKRLVIALKLT+CSRISL+EN+PLL
Sbjct: 541 PEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLL 586

BLAST of Sgr016933 vs. NCBI nr
Match: KAG6579038.1 (hypothetical protein SDJN03_23486, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 873.6 bits (2256), Expect = 1.0e-249
Identity = 453/579 (78.24%), Postives = 489/579 (84.46%), Query Frame = 0

Query: 73  MNHSISSDELDHLSLEVRQKMLLEKTQRFLWAIFRLCNDLLFKKEDECCDLQGVSSVISA 132
           MN  ISSDELDHLSL VR+KML E     L    +  +  + KKE+ECCDLQG S++ISA
Sbjct: 1   MNRGISSDELDHLSLAVRRKMLQENKFTLLEDESKRISTFV-KKENECCDLQGGSTMISA 60

Query: 133 CDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISCTMAFENPTPPEVPDWVRVES 192
           CDARNLGDQQLEAQINDT+GH  D YSEGARLN E        FENPTPPEV D VRVES
Sbjct: 61  CDARNLGDQQLEAQINDTNGHHMDNYSEGARLNRE-----NTTFENPTPPEVLDRVRVES 120

Query: 193 TGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALGFMNH 252
           T IL G LA GVDNFA AGVAVTKVKNE FDDF+EDLDHV+LIERLRMLLSRRALG MN 
Sbjct: 121 TSILSGTLAAGVDNFAPAGVAVTKVKNEMFDDFNEDLDHVLLIERLRMLLSRRALGLMNQ 180

Query: 253 HVEGSSGVSSGNLIQCFLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVICSPNT 312
           HVEG SGV SG+L+QCFLKQK K MFA+ E M IGN LHDK GS APR C PSV+CSPN 
Sbjct: 181 HVEGGSGVPSGDLLQCFLKQKAKSMFASEERMEIGNVLHDKSGSYAPRHCSPSVVCSPNA 240

Query: 313 AFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLFNSTK 372
             SGS FSS+HSLNKSTESGNDMELKE DKICSSEKVATELG R LT+HVP+ NL +STK
Sbjct: 241 TLSGSYFSSNHSLNKSTESGNDMELKE-DKICSSEKVATELGSRHLTNHVPQANLLSSTK 300

Query: 373 VKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFFSSRK 432
           VKDEPYDH +GC++Y KD  N+ S  LSIKSET MPDEPYENKVDDM LQDRMKFFSSRK
Sbjct: 301 VKDEPYDHGEGCSIYGKDMNNVYSNTLSIKSETTMPDEPYENKVDDMPLQDRMKFFSSRK 360

Query: 433 VFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKKTATNSVETALEEDAPGLLQI 492
             G TS DYEHPKPSDPGCS LVSEP +  N K RRK+KKTATNS+ETALEEDAPGLLQI
Sbjct: 361 DIGFTSMDYEHPKPSDPGCSVLVSEPVNFPNTKRRRKQKKTATNSIETALEEDAPGLLQI 420

Query: 493 LVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKFPS-IRCTK 552
           LV KG+++DEIKLYGE ESDDDL ES SEDSF ELE VI+RLF QRHSFLKFPS IRCTK
Sbjct: 421 LVEKGIQVDEIKLYGETESDDDLDESSSEDSFRELEDVITRLFPQRHSFLKFPSIIRCTK 480

Query: 553 ASRASYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYAT 612
           ASRASYCLACLVSLIEQTRYLHFR+WPVEWGWCRDLQSFIFVFERHKRIV+ERPEYGYAT
Sbjct: 481 ASRASYCLACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYAT 540

Query: 613 YFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL 651
           YFFELV+SLPI WQIKRLVIA+KLT+CSRISL+EN+PLL
Sbjct: 541 YFFELVESLPISWQIKRLVIAMKLTNCSRISLLENRPLL 572

BLAST of Sgr016933 vs. NCBI nr
Match: XP_022939493.1 (uncharacterized protein LOC111445382 isoform X1 [Cucurbita moschata] >XP_022939494.1 uncharacterized protein LOC111445382 isoform X1 [Cucurbita moschata])

HSP 1 Score: 870.5 bits (2248), Expect = 8.8e-249
Identity = 452/579 (78.07%), Postives = 487/579 (84.11%), Query Frame = 0

Query: 73  MNHSISSDELDHLSLEVRQKMLLEKTQRFLWAIFRLCNDLLFKKEDECCDLQGVSSVISA 132
           MN  ISSDELDHLSL VR+KML E     L    +  +  + KKE+ECCDLQG S++ISA
Sbjct: 1   MNRGISSDELDHLSLAVRRKMLQENKLTLLEDESKGISTFV-KKENECCDLQGGSTMISA 60

Query: 133 CDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISCTMAFENPTPPEVPDWVRVES 192
           CDARNLGDQQLEAQINDT+GH  D YSEGARLN E        FENPTPPEV D VRVES
Sbjct: 61  CDARNLGDQQLEAQINDTNGHHMDNYSEGARLNRE-----NTTFENPTPPEVLDRVRVES 120

Query: 193 TGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALGFMNH 252
           T IL G L  GVDNFA AGVAVTKVKNE FDDFDEDLDHV+LIERLRMLLSRRALG MN 
Sbjct: 121 TSILSGTLVAGVDNFAPAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLMNQ 180

Query: 253 HVEGSSGVSSGNLIQCFLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVICSPNT 312
           HVEG SGV SG+L+QCFLKQK K MFA+ E M IGN LHDK GS APR C PSV+CSPN 
Sbjct: 181 HVEGGSGVPSGDLLQCFLKQKAKSMFASEERMEIGNVLHDKSGSYAPRHCSPSVVCSPNA 240

Query: 313 AFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLFNSTK 372
             SGS FSS+HSLNKSTESGNDMELKE DKICSSEKVATELG R LT+HVP+ NL +STK
Sbjct: 241 TLSGSYFSSNHSLNKSTESGNDMELKE-DKICSSEKVATELGSRHLTNHVPQENLLSSTK 300

Query: 373 VKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFFSSRK 432
           VKDEPYDH +GC++Y KD  N+ S  LSIKSET MPDEPYENKVDDM LQDRMKFFSSRK
Sbjct: 301 VKDEPYDHGEGCSIYGKDMNNVYSNTLSIKSETTMPDEPYENKVDDMPLQDRMKFFSSRK 360

Query: 433 VFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKKTATNSVETALEEDAPGLLQI 492
             G TS DYEHPKPSDPGCS LVSEP +  N K RRK+KKTATNS+ETALEEDAPGLLQI
Sbjct: 361 DIGFTSMDYEHPKPSDPGCSVLVSEPVNFPNTKRRRKQKKTATNSIETALEEDAPGLLQI 420

Query: 493 LVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKFPS-IRCTK 552
           LV KG+++DEIKLYGE ESDDDL ES SEDSF ELE VI+RLF QRHSFLKFPS IRC K
Sbjct: 421 LVEKGIQVDEIKLYGETESDDDLDESSSEDSFRELEDVITRLFPQRHSFLKFPSIIRCMK 480

Query: 553 ASRASYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYAT 612
           ASRASYCLACLVSLIEQTRYLHFR+WPVEWGWCRDLQSFIFVFERHKRIV+ERPEYGYAT
Sbjct: 481 ASRASYCLACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYAT 540

Query: 613 YFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL 651
           YFFELV+SLPI WQIKRLVIA+KLT+CSRISL+EN+PLL
Sbjct: 541 YFFELVESLPISWQIKRLVIAMKLTNCSRISLLENRPLL 572

BLAST of Sgr016933 vs. NCBI nr
Match: XP_022141526.1 (uncharacterized protein LOC111011878 isoform X3 [Momordica charantia])

HSP 1 Score: 864.0 bits (2231), Expect = 8.3e-247
Identity = 435/526 (82.70%), Postives = 462/526 (87.83%), Query Frame = 0

Query: 129 VISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISCTMAFENPTPPEVPDWV 188
           +IS  D  NLG QQLE Q+ DTSGHLED Y+E ARLN EKQISCTM FENPTPPEVPDWV
Sbjct: 1   MISTRDDGNLGGQQLETQMKDTSGHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWV 60

Query: 189 RVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALG 248
           RVESTGIL G L DGVDNF SAGVAVTKVKNE FDDF+EDLDHVV IERLRMLLSR+ALG
Sbjct: 61  RVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALG 120

Query: 249 FMNHHVEGSSGVSSGNLIQCFLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVIC 308
            MN HVEG SG SSG+ +QCFLKQK K MF+N EL G  N LHD+ G DAP L  PSV+C
Sbjct: 121 SMNQHVEGGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVC 180

Query: 309 SPNTAFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLF 368
           SP    SGS FSS+ SLNK TESGNDMELKEDD+IC SEKV TELG RLLT+H PE NLF
Sbjct: 181 SPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLTNHAPEANLF 240

Query: 369 NSTKVKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFF 428
            STKVKDEPYDHVDGCNL+ KD  N+CSRILS+KSET MPDEPYENKVDDMRLQDRMKFF
Sbjct: 241 YSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFF 300

Query: 429 SSRKVFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKKTATNSVETALEEDAPG 488
           SSRKVFGSTSRDYEHPKPSDPGCSSLVSEP+SLMN+K RRK K+TATNS+ETALEEDAPG
Sbjct: 301 SSRKVFGSTSRDYEHPKPSDPGCSSLVSEPASLMNVKRRRKWKRTATNSIETALEEDAPG 360

Query: 489 LL----QILVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKF 548
           LL    QILV KGV +DEIKLYGEMESDDDL ESFSE+SFGELEAVISRLFSQR SFLKF
Sbjct: 361 LLQFHWQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKF 420

Query: 549 PSIRCTKASRASYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLER 608
           P IRCTKASR+SYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLER
Sbjct: 421 PPIRCTKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLER 480

Query: 609 PEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL 651
           PEYGYATYFFELV+ LPI WQIKRLVIALKLT+CSRISL+EN+PLL
Sbjct: 481 PEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLL 526

BLAST of Sgr016933 vs. ExPASy TrEMBL
Match: A0A6J1CIB2 (uncharacterized protein LOC111011878 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111011878 PE=4 SV=1)

HSP 1 Score: 933.7 bits (2412), Expect = 4.1e-268
Identity = 472/582 (81.10%), Postives = 502/582 (86.25%), Query Frame = 0

Query: 73  MNHSISSDELDHLSLEVRQKMLLEKTQRFL----WAIFRLCNDLLFKKEDECCDLQGVSS 132
           MNH  S DELDHLSL  RQKMLLE     L      I  LCND + K+EDECCD+QGVSS
Sbjct: 1   MNHGTSFDELDHLSLVQRQKMLLENKHPLLEDGSKIISPLCNDFIVKEEDECCDVQGVSS 60

Query: 133 VISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISCTMAFENPTPPEVPDWV 192
           +IS  D  NLG QQLE Q+ DTSGHLED Y+E ARLN EKQISCTM FENPTPPEVPDWV
Sbjct: 61  MISTRDDGNLGGQQLETQMKDTSGHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWV 120

Query: 193 RVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALG 252
           RVESTGIL G L DGVDNF SAGVAVTKVKNE FDDF+EDLDHVV IERLRMLLSR+ALG
Sbjct: 121 RVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALG 180

Query: 253 FMNHHVEGSSGVSSGNLIQCFLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVIC 312
            MN HVEG SG SSG+ +QCFLKQK K MF+N EL G  N LHD+ G DAP L  PSV+C
Sbjct: 181 SMNQHVEGGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVC 240

Query: 313 SPNTAFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLF 372
           SP    SGS FSS+ SLNK TESGNDMELKEDD+IC SEKV TELG RLLT+H PE NLF
Sbjct: 241 SPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLTNHAPEANLF 300

Query: 373 NSTKVKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFF 432
            STKVKDEPYDHVDGCNL+ KD  N+CSRILS+KSET MPDEPYENKVDDMRLQDRMKFF
Sbjct: 301 YSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFF 360

Query: 433 SSRKVFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKKTATNSVETALEEDAPG 492
           SSRKVFGSTSRDYEHPKPSDPGCSSLVSEP+SLMN+K RRK K+TATNS+ETALEEDAPG
Sbjct: 361 SSRKVFGSTSRDYEHPKPSDPGCSSLVSEPASLMNVKRRRKWKRTATNSIETALEEDAPG 420

Query: 493 LLQILVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKFPSIR 552
           LLQILV KGV +DEIKLYGEMESDDDL ESFSE+SFGELEAVISRLFSQR SFLKFP IR
Sbjct: 421 LLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPIR 480

Query: 553 CTKASRASYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYG 612
           CTKASR+SYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYG
Sbjct: 481 CTKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYG 540

Query: 613 YATYFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL 651
           YATYFFELV+ LPI WQIKRLVIALKLT+CSRISL+EN+PLL
Sbjct: 541 YATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLL 582

BLAST of Sgr016933 vs. ExPASy TrEMBL
Match: A0A6J1CKR0 (uncharacterized protein LOC111011878 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111011878 PE=4 SV=1)

HSP 1 Score: 927.9 bits (2397), Expect = 2.3e-266
Identity = 472/586 (80.55%), Postives = 502/586 (85.67%), Query Frame = 0

Query: 73  MNHSISSDELDHLSLEVRQKMLLEKTQRFL----WAIFRLCNDLLFKKEDECCDLQGVSS 132
           MNH  S DELDHLSL  RQKMLLE     L      I  LCND + K+EDECCD+QGVSS
Sbjct: 1   MNHGTSFDELDHLSLVQRQKMLLENKHPLLEDGSKIISPLCNDFIVKEEDECCDVQGVSS 60

Query: 133 VISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISCTMAFENPTPPEVPDWV 192
           +IS  D  NLG QQLE Q+ DTSGHLED Y+E ARLN EKQISCTM FENPTPPEVPDWV
Sbjct: 61  MISTRDDGNLGGQQLETQMKDTSGHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWV 120

Query: 193 RVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALG 252
           RVESTGIL G L DGVDNF SAGVAVTKVKNE FDDF+EDLDHVV IERLRMLLSR+ALG
Sbjct: 121 RVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALG 180

Query: 253 FMNHHVEGSSGVSSGNLIQCFLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVIC 312
            MN HVEG SG SSG+ +QCFLKQK K MF+N EL G  N LHD+ G DAP L  PSV+C
Sbjct: 181 SMNQHVEGGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVC 240

Query: 313 SPNTAFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLF 372
           SP    SGS FSS+ SLNK TESGNDMELKEDD+IC SEKV TELG RLLT+H PE NLF
Sbjct: 241 SPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLTNHAPEANLF 300

Query: 373 NSTKVKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFF 432
            STKVKDEPYDHVDGCNL+ KD  N+CSRILS+KSET MPDEPYENKVDDMRLQDRMKFF
Sbjct: 301 YSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFF 360

Query: 433 SSRKVFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKKTATNSVETALEEDAPG 492
           SSRKVFGSTSRDYEHPKPSDPGCSSLVSEP+SLMN+K RRK K+TATNS+ETALEEDAPG
Sbjct: 361 SSRKVFGSTSRDYEHPKPSDPGCSSLVSEPASLMNVKRRRKWKRTATNSIETALEEDAPG 420

Query: 493 LL----QILVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKF 552
           LL    QILV KGV +DEIKLYGEMESDDDL ESFSE+SFGELEAVISRLFSQR SFLKF
Sbjct: 421 LLQFHWQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKF 480

Query: 553 PSIRCTKASRASYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLER 612
           P IRCTKASR+SYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLER
Sbjct: 481 PPIRCTKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLER 540

Query: 613 PEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL 651
           PEYGYATYFFELV+ LPI WQIKRLVIALKLT+CSRISL+EN+PLL
Sbjct: 541 PEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLL 586

BLAST of Sgr016933 vs. ExPASy TrEMBL
Match: A0A6J1FLT1 (uncharacterized protein LOC111445382 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111445382 PE=4 SV=1)

HSP 1 Score: 870.5 bits (2248), Expect = 4.3e-249
Identity = 452/579 (78.07%), Postives = 487/579 (84.11%), Query Frame = 0

Query: 73  MNHSISSDELDHLSLEVRQKMLLEKTQRFLWAIFRLCNDLLFKKEDECCDLQGVSSVISA 132
           MN  ISSDELDHLSL VR+KML E     L    +  +  + KKE+ECCDLQG S++ISA
Sbjct: 1   MNRGISSDELDHLSLAVRRKMLQENKLTLLEDESKGISTFV-KKENECCDLQGGSTMISA 60

Query: 133 CDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISCTMAFENPTPPEVPDWVRVES 192
           CDARNLGDQQLEAQINDT+GH  D YSEGARLN E        FENPTPPEV D VRVES
Sbjct: 61  CDARNLGDQQLEAQINDTNGHHMDNYSEGARLNRE-----NTTFENPTPPEVLDRVRVES 120

Query: 193 TGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALGFMNH 252
           T IL G L  GVDNFA AGVAVTKVKNE FDDFDEDLDHV+LIERLRMLLSRRALG MN 
Sbjct: 121 TSILSGTLVAGVDNFAPAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLMNQ 180

Query: 253 HVEGSSGVSSGNLIQCFLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVICSPNT 312
           HVEG SGV SG+L+QCFLKQK K MFA+ E M IGN LHDK GS APR C PSV+CSPN 
Sbjct: 181 HVEGGSGVPSGDLLQCFLKQKAKSMFASEERMEIGNVLHDKSGSYAPRHCSPSVVCSPNA 240

Query: 313 AFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLFNSTK 372
             SGS FSS+HSLNKSTESGNDMELKE DKICSSEKVATELG R LT+HVP+ NL +STK
Sbjct: 241 TLSGSYFSSNHSLNKSTESGNDMELKE-DKICSSEKVATELGSRHLTNHVPQENLLSSTK 300

Query: 373 VKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFFSSRK 432
           VKDEPYDH +GC++Y KD  N+ S  LSIKSET MPDEPYENKVDDM LQDRMKFFSSRK
Sbjct: 301 VKDEPYDHGEGCSIYGKDMNNVYSNTLSIKSETTMPDEPYENKVDDMPLQDRMKFFSSRK 360

Query: 433 VFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKKTATNSVETALEEDAPGLLQI 492
             G TS DYEHPKPSDPGCS LVSEP +  N K RRK+KKTATNS+ETALEEDAPGLLQI
Sbjct: 361 DIGFTSMDYEHPKPSDPGCSVLVSEPVNFPNTKRRRKQKKTATNSIETALEEDAPGLLQI 420

Query: 493 LVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKFPS-IRCTK 552
           LV KG+++DEIKLYGE ESDDDL ES SEDSF ELE VI+RLF QRHSFLKFPS IRC K
Sbjct: 421 LVEKGIQVDEIKLYGETESDDDLDESSSEDSFRELEDVITRLFPQRHSFLKFPSIIRCMK 480

Query: 553 ASRASYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYAT 612
           ASRASYCLACLVSLIEQTRYLHFR+WPVEWGWCRDLQSFIFVFERHKRIV+ERPEYGYAT
Sbjct: 481 ASRASYCLACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYAT 540

Query: 613 YFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL 651
           YFFELV+SLPI WQIKRLVIA+KLT+CSRISL+EN+PLL
Sbjct: 541 YFFELVESLPISWQIKRLVIAMKLTNCSRISLLENRPLL 572

BLAST of Sgr016933 vs. ExPASy TrEMBL
Match: A0A6J1CJF8 (uncharacterized protein LOC111011878 isoform X3 OS=Momordica charantia OX=3673 GN=LOC111011878 PE=4 SV=1)

HSP 1 Score: 864.0 bits (2231), Expect = 4.0e-247
Identity = 435/526 (82.70%), Postives = 462/526 (87.83%), Query Frame = 0

Query: 129 VISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISCTMAFENPTPPEVPDWV 188
           +IS  D  NLG QQLE Q+ DTSGHLED Y+E ARLN EKQISCTM FENPTPPEVPDWV
Sbjct: 1   MISTRDDGNLGGQQLETQMKDTSGHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWV 60

Query: 189 RVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALG 248
           RVESTGIL G L DGVDNF SAGVAVTKVKNE FDDF+EDLDHVV IERLRMLLSR+ALG
Sbjct: 61  RVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALG 120

Query: 249 FMNHHVEGSSGVSSGNLIQCFLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVIC 308
            MN HVEG SG SSG+ +QCFLKQK K MF+N EL G  N LHD+ G DAP L  PSV+C
Sbjct: 121 SMNQHVEGGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVC 180

Query: 309 SPNTAFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLF 368
           SP    SGS FSS+ SLNK TESGNDMELKEDD+IC SEKV TELG RLLT+H PE NLF
Sbjct: 181 SPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLTNHAPEANLF 240

Query: 369 NSTKVKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFF 428
            STKVKDEPYDHVDGCNL+ KD  N+CSRILS+KSET MPDEPYENKVDDMRLQDRMKFF
Sbjct: 241 YSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFF 300

Query: 429 SSRKVFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKKTATNSVETALEEDAPG 488
           SSRKVFGSTSRDYEHPKPSDPGCSSLVSEP+SLMN+K RRK K+TATNS+ETALEEDAPG
Sbjct: 301 SSRKVFGSTSRDYEHPKPSDPGCSSLVSEPASLMNVKRRRKWKRTATNSIETALEEDAPG 360

Query: 489 LL----QILVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKF 548
           LL    QILV KGV +DEIKLYGEMESDDDL ESFSE+SFGELEAVISRLFSQR SFLKF
Sbjct: 361 LLQFHWQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKF 420

Query: 549 PSIRCTKASRASYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLER 608
           P IRCTKASR+SYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLER
Sbjct: 421 PPIRCTKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLER 480

Query: 609 PEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL 651
           PEYGYATYFFELV+ LPI WQIKRLVIALKLT+CSRISL+EN+PLL
Sbjct: 481 PEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLL 526

BLAST of Sgr016933 vs. ExPASy TrEMBL
Match: A0A6J1JZL8 (uncharacterized protein LOC111489311 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111489311 PE=4 SV=1)

HSP 1 Score: 845.1 bits (2182), Expect = 1.9e-241
Identity = 441/579 (76.17%), Postives = 481/579 (83.07%), Query Frame = 0

Query: 73  MNHSISSDELDHLSLEVRQKMLLEKTQRFLWAIFRLCNDLLFKKEDECCDLQGVSSVISA 132
           MN  ISSDELDHLSL VR+KML E     L    +  +  + KKE+ECCDLQG S++IS 
Sbjct: 1   MNRGISSDELDHLSLAVRRKMLQENKLTLLEDESKRISTFV-KKENECCDLQGGSTMIS- 60

Query: 133 CDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISCTMAFENPTPPEVPDWVRVES 192
              RNLGDQQLEA+INDT+GHL D YSEGARLN E        FENPTPPEV D VRVES
Sbjct: 61  ---RNLGDQQLEAEINDTNGHLMDNYSEGARLNRENS-----TFENPTPPEVLDRVRVES 120

Query: 193 TGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALGFMNH 252
           T IL G LA  VDNFA AGVAVTKVKNE FDDFDEDLDHV+LIERLRMLLSRR+LG MN 
Sbjct: 121 TSILSGTLAARVDNFAPAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRSLGLMNQ 180

Query: 253 HVEGSSGVSSGNLIQCFLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVICSPNT 312
           HVEG SGV SG+L+QCFLKQK K MFA+ E M IGN LHDK  S APR C PSV+CSPN 
Sbjct: 181 HVEGGSGVPSGDLLQCFLKQKAKSMFASEERMEIGNVLHDKSVSYAPRHCSPSVVCSPNA 240

Query: 313 AFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLFNSTK 372
             SGS FSS+HSLNKSTESGNDMELKE DKI SS+KVATELG R LT+HVP+ NL +STK
Sbjct: 241 TLSGSYFSSNHSLNKSTESGNDMELKE-DKIYSSDKVATELGSRHLTNHVPQANLLSSTK 300

Query: 373 VKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFFSSRK 432
           VKDEPYDH +GC++Y KD  N+    LS+KSET MPDEP+ENKVDDM LQDRMKFFSSRK
Sbjct: 301 VKDEPYDHGEGCSIYGKDMNNVYGNTLSLKSETTMPDEPFENKVDDMPLQDRMKFFSSRK 360

Query: 433 VFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKKTATNSVETALEEDAPGLLQI 492
            FG TS DYEHPKPSDPGCS LVSEP +  N K RRK KKTATNS+ETALEEDAPGLLQI
Sbjct: 361 DFGFTSMDYEHPKPSDPGCSILVSEPVNFPNTKRRRKEKKTATNSIETALEEDAPGLLQI 420

Query: 493 LVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKFPS-IRCTK 552
           LV KG+++DEIKLYGE ESDDDL ES SEDSF ELE VI+RLF QRHSFLKFPS IRC K
Sbjct: 421 LVEKGIQVDEIKLYGETESDDDLDESSSEDSFRELEDVITRLFPQRHSFLKFPSIIRCIK 480

Query: 553 ASRASYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYAT 612
           ASRASYCLACLVSLIEQTRYLHFR+WPVEWGWCRDLQSFIFVF+RHKRIV+ERPEYGYAT
Sbjct: 481 ASRASYCLACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFQRHKRIVMERPEYGYAT 540

Query: 613 YFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL 651
           YFFELV+SLPI WQIKRLVIA+KLT+CSRISL+EN+PLL
Sbjct: 541 YFFELVESLPISWQIKRLVIAMKLTNCSRISLLENRPLL 568

BLAST of Sgr016933 vs. TAIR 10
Match: AT5G16610.2 (unknown protein; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 277.3 bits (708), Expect = 3.1e-74
Identity = 220/611 (36.01%), Postives = 305/611 (49.92%), Query Frame = 0

Query: 81  ELDHLSLEVRQKMLL--EKTQRFLWAIFRLCN-DLLFKKEDECCDLQGVSSVISACDARN 140
           E DHL L  R+ +LL  E+  + + A     N D + K+E+E C  +    V+S CDA  
Sbjct: 22  EEDHLPLTSRRSLLLSSERVSQRIAAYVPASNVDSVLKREEEDCFNE--LGVVSNCDATE 81

Query: 141 LGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISCTMAFENPTPPEVPDWVRVESTGILP 200
               ++   +N                     I C+   +        D   +     + 
Sbjct: 82  SVSTEILESMN---------------------IGCSQGLK--------DSGNIRPQNNIL 141

Query: 201 GFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALGFMNHHVEGS 260
           G  ++ V+NF   G        ET  +  +DL+H+ L ER +MLL R A+     +VE +
Sbjct: 142 GCCSNAVENFNRVG--------ET--ERSDDLEHLTLKERRKMLLERVAIRLPESNVEDN 201

Query: 261 SGVSSGNLIQCFLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVICSPNTAFSGS 320
           +       +    K K +    NG     G      +      LC    ICS + +  G 
Sbjct: 202 TEDCDETEL---YKIKAEISCENGIASSSGVQFSGFLEKIDSVLCRNFSICSESGSQLGG 261

Query: 321 CFSSDHSLN--KSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVP-EVNLFNSTKVK 380
              SD  ++  +S +   +  L E   + SS K   +   R+  + +P   N   ST+VK
Sbjct: 262 IQESDIPISPERSFDLSPEASLPE---VSSSNKNPRKRVKRVQRNPLPLNENEIQSTQVK 321

Query: 381 DEPYDHVDGCNLYDKDTKN-ICSRILSIKSETIMPDEPY-ENKVDDMRLQDRMKF----- 440
            +P   +  C + D D KN + S+ + +K E     E   EN++D ++L  R+       
Sbjct: 322 VDP---LADCVMEDNDEKNPVTSKQIPVKREVETHGEALDENELDSVKLSFRLNRCTSAP 381

Query: 441 --FSSRKVFGSTSRD-----YEHPKPSD---------PGCSSLVSEPSSLMN-------I 500
             F   K    T+ +      +H K  D          G    ++ PSS  +       +
Sbjct: 382 TPFRCMKNEAETASEMDEDIIDHMKLIDRLKLRSFHGSGHHEDLNSPSSGFSFCTSDEYV 441

Query: 501 KHRR-----KRKKTATNSVETALEEDAPGLLQILVGKGVEIDEIKLYGEMESDDDLHESF 560
           K  R     KRKKTAT+S+ETALEEDAPGLLQ+L+ +GV +DE++LYG    D    +S 
Sbjct: 442 KPSRVFRPWKRKKTATDSIETALEEDAPGLLQVLIQQGVTVDELRLYGNEGGDVPSDDSL 501

Query: 561 SEDSFGELEAVISRLFSQRHSFLKFPSIRCTKASRASYCLACLVSLIEQTRYLHFRSWPV 620
             +SF ELE VIS+LF +R +  K  +   +KASR SYCL CL SLIEQ RYL FR WPV
Sbjct: 502 LNESFSELEDVISQLFYKRETGTKLLNSSFSKASRTSYCLTCLFSLIEQARYLQFRKWPV 561

Query: 621 EWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCS 651
           EWGWCRDLQSFIFVFERH RIV+ERPEYGYATYFFEL ++  I WQ+KRLV+A+KL SC 
Sbjct: 562 EWGWCRDLQSFIFVFERHNRIVMERPEYGYATYFFELSNTASIRWQVKRLVLAMKLASCG 582

BLAST of Sgr016933 vs. TAIR 10
Match: AT5G16610.1 (unknown protein; Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 256.1 bits (653), Expect = 7.4e-68
Identity = 140/251 (55.78%), Postives = 169/251 (67.33%), Query Frame = 0

Query: 401 IKSETIMPDEPYENKVDDMRLQDRMKFFSSRKVFGS-TSRDYEHPKPSDPGCSSLVSEPS 460
           +K+E     E  E+ +D M+L DR+K    R   GS    D   P      C+S   E  
Sbjct: 193 MKNEAETASEMDEDIIDHMKLIDRLKL---RSFHGSGHHEDLNSPSSGFSFCTS--DEYV 252

Query: 461 SLMNIKHRRKRKKTATNSVETALEEDAPGLLQILVGKGVEIDEIKLYGEMESDDDLHESF 520
               +    KRKKTAT+S+ETALEEDAPGLLQ+L+ +GV +DE++LYG    D    +S 
Sbjct: 253 KPSRVFRPWKRKKTATDSIETALEEDAPGLLQVLIQQGVTVDELRLYGNEGGDVPSDDSL 312

Query: 521 SEDSFGELEAVISRLFSQRHSFLKFPSIRCTKASRASYCLACLVSLIEQTRYLHFRSWPV 580
             +SF ELE VIS+LF +R +  K  +   +KASR SYCL CL SLIEQ RYL FR WPV
Sbjct: 313 LNESFSELEDVISQLFYKRETGTKLLNSSFSKASRTSYCLTCLFSLIEQARYLQFRKWPV 372

Query: 581 EWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCS 640
           EWGWCRDLQSFIFVFERH RIV+ERPEYGYATYFFEL ++  I WQ+KRLV+A+KL SC 
Sbjct: 373 EWGWCRDLQSFIFVFERHNRIVMERPEYGYATYFFELSNTASIRWQVKRLVLAMKLASCG 432

Query: 641 RISLIENKPLL 651
           R  LIENKPLL
Sbjct: 433 RYQLIENKPLL 438

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022141525.18.5e-26881.10uncharacterized protein LOC111011878 isoform X2 [Momordica charantia][more]
XP_022141523.14.7e-26680.55uncharacterized protein LOC111011878 isoform X1 [Momordica charantia][more]
KAG6579038.11.0e-24978.24hypothetical protein SDJN03_23486, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022939493.18.8e-24978.07uncharacterized protein LOC111445382 isoform X1 [Cucurbita moschata] >XP_0229394... [more]
XP_022141526.18.3e-24782.70uncharacterized protein LOC111011878 isoform X3 [Momordica charantia][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1CIB24.1e-26881.10uncharacterized protein LOC111011878 isoform X2 OS=Momordica charantia OX=3673 G... [more]
A0A6J1CKR02.3e-26680.55uncharacterized protein LOC111011878 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A6J1FLT14.3e-24978.07uncharacterized protein LOC111445382 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1CJF84.0e-24782.70uncharacterized protein LOC111011878 isoform X3 OS=Momordica charantia OX=3673 G... [more]
A0A6J1JZL81.9e-24176.17uncharacterized protein LOC111489311 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
Match NameE-valueIdentityDescription
AT5G16610.23.1e-7436.01unknown protein; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae ... [more]
AT5G16610.17.4e-6855.78unknown protein; Has 30201 Blast hits to 17322 proteins in 780 species: Archae -... [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 438..457
NoneNo IPR availablePANTHERPTHR47871NAC DOMAIN-CONTAINING PROTEIN 8coord: 117..650

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr016933.1Sgr016933.1mRNA