Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAGGTAGTGGAATAACGTGGAAGAATAGGGTATGTGAAACTACGTAGTCCAATCTAAATCAGATCATAGAACATTTACATTCCAACATGATAAATCTACAAGAACTCCTAAAAAATCAAAAAACTTCCTACTGCAGTTAAAATGTTGACTAAAGCAGCCAAGGAACGATTTGCCAAAATTAAGAAGCAACGCATAGCAATCTATCCCTAAAATCTCACCCCCAAAAATTGAGTAACGTCTAACGTGTAATCGGGTATCTGAAATAATAAGAGAGTTCGCATAAAAAAAATAATAATAATAATAATTAGCACCTGCAAGCGTCCGGAGGCCGGTGGTCGGCCTGGGATTCTTCAGCGCAGGGGTAAAGGAAAGTGACGGGGAAAGATATTCTCTCTGCAAATGATGTAAAGTTACGCAAATTCAAATGATTTAAAACGATTTTCAGCGAAGAATCTCGCCAGAGCGGAAAATAAAGCAGAAACAAAAAAGGCAATGCATACGTCGGAGAAATTGATTTGTGCAGCAGAATTCGGCAAAGCAAGGAGCAGATAAACCCTAAACCAAGCTTGTTTCAGAATTTTACGAATTGGCAATAAAGCGAGAGAGAGAGAGAGAGAGAGAGAGAATCTGGAAGGTTCTGGTGCGTCGCGCGTGGTTGAAAAGTGGGAGCTAGGGCTTGGATTGGGGGAGAATCTTGATCGATTGGGTTATCATCGCATCATACGACGCATCGTGAATGAGGGATGAGTAAGAGGTGGTTACGTAAGCTGTCGAAAGGTCGGCCTGAGCTGTCCACCTACTCGGCCCGCTAGGCCCAACTCATTTGGTTTTTTTATGTCAAATTTATCAAACAAATTTTTTTTTTTTTTTTTTTTTTGAGAAATATCAATTTTCCCTCCTGAATTTGGCGAGGTGTATCAATTTTCAGTATAGACTGTAAATTTCATCAAATTGCACCTTAAACTTAGATAAGTGTTATAATTTTTACCTTAAACTTTATTAAGTGTTGCAATTGTTACTATATAATAATTTTTCGTTTAAATGATTGTTAGATAAGTATTCTCATATACGTTCAATGAATTATAAAACATCCATCCAATAAGAAATTAATAGGACTTAGAGAAAAAGGACGTTAGAGCTAGTTTTTCTGTTAAAATTTGTGAGTTTTGGCAAAATTATTCAATAGTCGATTTTTACAATGATCAAATTTAATTTTATGCAAAACTTGATTTTTATTGAAATTAGTGAGTTTATGATACTTTTGCATTGAAATTTGTTAAAATTAGTGATTGAATTAGTAGAATTTGAATCGAGGTAAAAATTGCAACACTTATCTAAATTTAAAGTCCAATTTGATGAAATTAAAAATTTATGATGAAAATTGATATACCTTACCAAGTTTATGGGTAAAAAAATAATATTTTTTCTTTTTGTATCAATAAACTGTTGGAGGGATTATAACCTCTAACTATTAGAAAAATAAGTAATATTTTTACCAGTTGAGCTATTCTCGTGGCCAATTAAGGTAAGTAAGAGGTTGGGGAAAAATTTTGCATATATTGCTAAATAAAAGAGAAAAAATCGAAATATGTTCAAAAGTTTGTGATACTGTAAAAGTGGCCACAAATCTAAATGCACGTTTTTTATAATTTAAAAATATTTGATATGCCATCCTAGCAATGTTTAGATTTTCCTATTCAAACTATAAAAATCACGTTTAATATTTAATTAATTAAAATTAAAACTATAGAGATAATTAAGGAGAATATTATTATCAACTTTTTAAATTTTTAAATAGATAAATATTAGATTAATCAATATCCATAATAGTATATAACACTCGTGTAAAAAAATATAATAATAATATATAACAACTTACCTTATATAAAATTGAGTGACTCAGTAAGTAAGTAAATAAATATATATATATATATATATATATATTCACAAATTGAGAGTGAGAGATTTTAATCTATGAAGAGGAAGTAGAGATGCTTTAACCGTCGAACTATGCTCGCATTAGCCTCAGTAAATAAAGAGTTAACGGTCGATAATATATTTTTTTTCTCCAAACTCTCCTTGCTTTTTCCACACTCGCCTAATGACCGTTTTTTAGCCCTTTAGCCAACGATTTTATTACTAGTTCAATACAAATAATAGAGTGAGTTTTTTTTAGTTCAATAATATAGTGAGTTTAATATATTATTTTAAAAGGAAATTAAATTATACCAGATAAATTTTCTTCTTTTATTCAGCATTCAATAATAGAGACTTTACTGTGCTAAATTAAAATCATTATTTGATAAAATAAAGTATCATATAATCAGTTTCGAGTTTTGTAATAATGGAAGATAAACATTCTTTAATAATGGTGATGAAAATAATAATTCCATATATTTGAGAAACATTTTCAGACTAATTTATTGTTGTTTTAATTATCTCATTTTTAAATACCAACAGATCAATGTTATTAATCAATATACTAGATTTTAATTTAAATTCATTTATGAGCCTTTAGTAACCTCTCTAGTTTGATATACGAGTTTTGTAAGTGGGATAAATAATGAGTACAAGATTGTCTCGTTTCACACACACAACACAAGTAGATTGATGCAAAAGCTTATTGCAATGCAACAAGTTTTCTAGTTCTTTACACGCATATTAAACATTTTGGCCAATTAATAAGAGAGGCCAACTGCAAAACCAACATTGGATCAATATGTGATCTAACTTCACCAATTGCCGGGTTAACCACAACTATATAAAATCATAATTTTAAAATTAGGAACAAAATTCAAACCTTTTTAAAAAATCAAGGATTAAAATTCAATCACTGACTTTATAATATAACACTTATTTTATTTAATTTTTGCAAAGAACCCCCCCGAATTTTGAGATTGGTGTAAAGTCATAGAGCGCTACCAAAAATTAGAATTCAGCGCTGTCCAGGAGTCAGCTCGTCACGCCAGGGAAGAACCTTGCTCCTCGGTCTGAAATTGGATTGAAGACTCTCCTTCGTGGTTGGAGACACAGAGGTTAGTGTTCTTTTCGCGCTCTGTTTTTTCTTTCTTGTGCCCTCTATTATCTGTCAAAAAAGAGAAAAAAAAGTGGGGCGCTTCGTGAGTCCCTCTATGGTTTTGCTTACTCGTTCTCTTCTCATCCCTCAGTCGTTATCAAACTGTAGAACGTTTTCTTCTGCATTCAAATTTTGGGTCAAATACAAATTCGGAAAACCCTTTATTCTCGAACCCTTCTTGGTATCTTTGCAGCATGTTTTTCATTATCTAAAGCCGAGAATCTATGGCTATTAGCTTGAACATGGAGTAGCACGCACTGGCTCGGACGTCGGAATGAATCATAGTATCAGTTCTGATGAGCTCGACCATCTTTCGTTGGAGGTGAGGCAGAAAATGCTACTGGAAAGTAAGCATCCATTACTCGAAGATGGGAGTAAAAGAATATCAACGTGAGTATTATCTGTTGATGTATATATTTTTTCTGCTTATTTCGTTTTTGATTAGAGACTCAACGTTTTCTTTGGGCCATATTCAGGCTTTGCAACGATTTGCTTTTCAAGAAAGAAGATGAATGCTGTGATTTGCAGGTAATTTAGCATGCGTGGCACTTCAATTTGTCTCGGTTTTTGTGCTCATTGATAGGTCGTGTTACTGTTCTGGCAGCAATGACAATTTACAACTACATAAATTTGTATGTTATACACACACACAAGAATTTTCTTTTGAAAGGAGCCCTCTTTTATTGTGGCTGCCCCAAGTCAATTGGAACATTGTTACTTTTCCCAGAGATAAAGGATGCCTTACCTAGGTATTGGCAATACTAAGGAGAGAAATTTGTACTCCTTATGAGTCTTTGGCGTTTCTTGATAACTCATAGGAGATATGGAAGGAGGTCATCTCAAGTAAATAAGTGAGAAGGCATTCCTAACAAATGTTGCTATCTAAGTGCCAAAATCCCATGGGCATATCACAAAGCTTGCTCAACATTTTGATCTTCGCACAAGGCCAACTGTTCAAGATAGTGCTAACACCCATTTTGGCATCATGAATGGTGCCATGGCCATTGCCTCGTGATCGTTTTTCTAGGCTTCTCTTTCTAGCCATCCACAAAGATATTTTAGTCAAAGATATTTGTGGGTTGAGAAATGATTGTTGGCACATCTGACCTTAGAAGGGACTTTTGTGGAAGGGGTATTGCTAACTGGTTCTAACTATTGGATCTCATGAATGGGTACTTTAATGTCTAGTAAAGACTCTAGACCTTGGCTTTTGGATAACTGCCTTCACAGTTAAATATATTTTCGCCATCTCAACAAAAGGGATCCATTGACCACTCTAGTCCTCTGTAGCTGATTGTGGAAATCATTCATTCTTAAGAAATGCAAATTCTCTTTCTCTCTCTCTAATTATTCCTCATTGTCCATACATGCTTATTGGAGAGTGTTTTTGTAACTGTTTTGGCCCTCGTTTGTAATTTCACTACCTTCGGTAAAATGGTTTCTCATCCATAAACATTCTTATAGTGAGCTAAGAATTTGATGGCCTAAACTTATTTGTCTTAACCTTGATCAAGCAAGCCTTCTATATCTTTTGAAAGGCAAGCTCCTTTTCCTCAATTAGCTAGTTTTTATTTTTTTGTTTCCTTCTTCAAGTTTGTCTTTGTTTGTGTTTGACCTGATAGTCTTACGAGACTACTCCAATTTACAAGTGTACGACTTCACTAAACTTGATACAATAATGACTCAACTTATTTATCCAAGAAAATTTTGATTAAATGAAATTAACATTAAGAAAACTTTTGCTCATAGAGTATAATTTCACCAATTTTCTGTTTATTTTTCAATTTGTTATATGTTCTTGCTGCAGGGTGTTTCCTCTGTGATATCTGCTTGTGATGCCAGAAATCTGGGAGATCAACAACTGGAAGCTCAAATTAACGATACTAGTGGTATGGGGGAAATACTTCTGAAGGTCTGTTTTATATTGTTCCTGAGATATTCTAATTTCATCAGTGTTTATGATATAGTTCTTGTGACCTTATGTAGGTCATCTTGAGGACTGCTACAGTGAGGGGGCTCGATTAAATATAGAGAAGCAGATATCATGCACTATGGCATTTGAGAATCCAACACCACCTGAGGTTCCTGATTGGGTAAGGGTGGAATCTACAGGCATCTTACCAGGTTTCTTGGCAGATGGTGTAGATAACTTTGCTTCCGCTGGTGTGGCTGTAACTAAGGTTAAAAATGAGACGTTTGATGACTTCGATGAAGATCTTGATCATGTTGTATTGATAGAGCGACTAAGGATGCTGCTATCAAGGTTCCACATTTTCTTGGTGATCATATTCTCTTTCTGTAGTCATCTATTTCCATCCATGCACTGTTTAGGTCTATAATCTTTTTCCCCCATACTTAGACTAGTATATAGTGTGACTCAATTGCAGCCAATTTTTTTGTGGTTAAAATAATGTTTTGGTCTTTCTATTTTAATTCCTAAACTTTCAAGTTTCAATTATTAAAACTTAGTAAGTCATCTGAAATTGATCTCCTTCATTAACTCCTCTAATAGGATTTAATAATTGGGATAAGTAAAATAAGTCATATAACAACAATAACGTAGTTCTCAACAACCAAATTTAATTAAAAAATGTGTTTTTAATGAAGCCCGTGAAATAGTTTTTCCCAATAATTACATGTTTTAACTAGGCTGATAACTATTGATTTTATTTTTTCAGAAAGAATATTTGTAGGACAAGTTCAAGATGTAGTACTTTAGATATTTAATTAAAGTTGAGATTTCAAATGAAATCGAGTTAAATTGGATCACAAAACAATATTATAACCTTTTTTTATTTTCATTTTTATTATTATTTTTTTTGTGAAGGATTAGAGTTATCTAATGTTATTTTCCCGTTCCCATTGTAGGAGAGCGTTGGGTTTCATGAATCACCATGTGGAGGTAATTCGCTGCTTTTCTGTAATTTGTAGCAAAAAATCTGCAATTTAATACGTCATGGGTGGAATCACATACAATGCCAACGTCCTCGAGAACTATAGTTTCTATTGTATGATAGTGACCCATGAGAGGAGATTCGAGGTGAGGGAGATGAAACTGGAATTTATAGAGGTAGAACTCTGGGATGTGAGATTATGACGTTGCAAGATGCTTCTTTGTCAACCAGATAGTAGTGGGTCAAAGATCATGGCGTGGTGGCAGTAGTGGCTCCAAAGGTAGGAGTGTCAACTGTCATGAAGAAGAGGAACAATGTAGAGGATTTTCTTTTAACCTTCACGTGTTTGAAGAACTTTGGCGGTGGCTTCATAAATAATAATAATAATAATAAAAAAAAAAACTTTTGGGATGACAACCAAAAATTGTGGTTGAAAGAAATCCTTTTAGAACCTTCATGGATCATGTCCTTTTTGTTGCTCTTTCTTGGAGTAAATTTCATACTCCTTACAGAACTCTTATTCTTACTTATCTTTGTGCCAACTGGAAGATCTTTTTTAACTTCTTTGTCTTGGAGTTTGGCTTCTCCCCTTTTTGGTAATTTCATTCATTCAATGAAATTATCTTTCATTAGAAAAAAAACCTATCAACTAGAACTGTGGATTGTGGAGACTGGATCCGCCCAATTTGAGGGTGCCAAGAAGAAATCCCCGTGCTCCTTAATGACATGGTTAGAGGAGCAGTGCCACCACCACCAAAAAATGAATTTTTGGAGTGGTGAGGAGAGGTCCCTTCTTCTAAATGTTGGTTTAGTCGTATTAGGAAAGAAATGCCACATGAGACATCTTTGGCACGCCCATTTTTCTTATAGGGTTCACACTATTTTCCTTTCCTAATTAGTTTTTATACCATAATTATGTGAGGGATTATGGACTTCCAAGAGGTACTTGATGAGGGAGGAATCCTAGCCACCAAGGCAAAATTTTCTGACTCATGTTGAGTTGTGGGTGTGGTATCCTTCATCATCAACCTTTTGTGGCTTTACTTGCTGTACTTCAGCAAAAGCCCCGCAGTATTTGGTAAAGGTTTCATGGATAGGGTTCTCCCCCTCACCTTATCTAATTCCTTGTTAGACCAAGTAAAAGTTGAATACTCTCTTATTTTGAACTATCTTTCGAAATGTGGTACCGTTTATTACAACTTGTAGTTGTGATTTTCAAACATAGTTTTTGCCAATCGGGTGAGTTGGGTGAATATTAAGTTACAGGCACCTCTTTTTGTCGTAAGTCATGAAATGTAGGTCTCCGATTATTTTCTGTCGTAAGTTTTGTTTTAAAAAATGTTAAAAACTTGCCTTGTAATTTTGTGCTGTCCTCCCTTTCTTCATTGGCAGGTCATGTACATATTGACAAACATGTGTTTGTGTTATGGATTGTTTTTCTCAATGAAATGGGTCAAAAGCTTCATAATATGTCCCCCAACTTGTTTTTCCCATTGTTTCCTTCCATGTCTTTTTCATGATGCTCTGCAACATTAAGTTTTGTGATATTTTGTCATGATTTGCTTTTTTGTTGTTTAGAGTTGTTGGTAATTTGATACGCTGCTGGGTTGTGCAGGGTAGTTCTGGTGTGTCATCAGGAAATCTTATACAATGCTTTTTGAAACAGAAGGAAAAGTTTATGTTTGCTAATGGGGAGCTGATGGGAATTGGCAATGCGTTGCATGATAAAATTGGAAGTGATGCTCCTCGTCTTTGCATCCCTTCAGTAATTTGTTCACCTAATACAGCCTTTTCTGGATCCTGTTTCTCAAGCGATCATTCTTTAAATAAATCAACTGAATCAGGCAATGACATGGAACTTAAAGAAGATGATAAGATCTGTTCATCTGAGAAGGTGGCCACAGAATTAGGCCCACGGCTTTTGACTGATCATGTCCCTGAAGTAAATTTATTTAATTCCACAAAAGTGAAGGATGAACCTTATGATCATGTTGACGGCTGCAACTTATATGATAAGGATACGAAGAACATCTGCAGCAGAATTTTGTCAATAAAGAGTGAAACAATCATGCCCGATGAACCTTATGAAAACAAGGTAGATGATATGCGACTGCAAGATCGAATGAAGTTTTTCTCATCTCGAAAGGTTTTTGGTTCTACGTCAAGGGATTACGAGCATCCAAAACCTTCTGACCCTGGATGTAGTTCTCTTGTTTCAGAACCTTCTAGTTTAATGAACATTAAACATCGACGCAAGCGGAAAAAAACTGCCACGTATGTACTTTACGTGTTTGAATACTACTGCAATATACTTTTTTTTCCTGTGTTTTCTCAATCTTTCTAAGGTGTTTTGTGTCAATTTATTTGCAGGAATTCAGTTGAAACAGCACTCGAGGAAGATGCCCCTGGCCTTCTGCAGGTATCAGTATCATGTGTCGTTCAATTGTGCTTGTGAATTTTGGATCATATTCATTCTAGTTTCATTGGCAGATACTAGTTGGCAAAGGTGTAGAAATTGATGAAATTAAGCTTTATGGAGAGATGGAAAGTGATGATGATCTGCATGAGTCATTTAGTGAAGACAGCTTTGGTGAGCTTGAAGCTGTGATATCGAGGGTGAGCGATAATTTATGATCTTGTTGTCATGTAATTGCTTTGCTTTTGTAATTTCTCCTAAATTTTCCTTCATGGGCTGTTTGGCGTAAGCTTTTCTAGCCATGAGTTTCAATATCCTACATACGATATTTTCCATACTTGAGAAATGAAGATCCTTGGGTGGTAATGAATGCCTATCCCCTAGGGTTTTTAAACCTCTCAAACATATGGTTTTCTTAACCACCAAACACAGGTTTCCTAATATCATGAGAGAGAGGTCTAGTTCAAAAGTTAATGAAAAGTTTGTTTCTTGTTCAAAAACAAATGAGAGAGAGGTCTATATTACTTAATTTTGATTAATTAACAAATATATATTTCTTAAGATTTAATATTTGTTTCTATGAATATTTTTAATAAATTTTTGCAGCGGTTGTTTGGGAAATTATTCTGCAGATTTAAAAACTATTTTATAGAGTTCTTTAAAAATGTTACAGTGCACTTAACCATTGTTCACTAGTCCATGTTGTCAAAGGAAGCACTACAACAAAGTAAAAGAGAACAACTTTACTTGTTATAGTAGCATAATTGAATTTGTTCTCTCTTGTCAATATTGTCTTGTTTCTTTTTCTTATTGTCCTTCCCCTACCTTCGTCTGCCTCCTTCTGGGAAGTAGTTTATTTGATAATTTTAATTGATTGGTTGGATTTTATATTAGATTGCAATGGAAATAACTTATTTCAGTGGACCAAGTTCTGGTGGTGCTTCTTCAAAATTAGTTGAATTTCAAATTGTTAAAGAAAGAGATTGGATGATGGGTCAATTTGGGTGCTGAGAAAATCGGACTAGTATGGTTGGGTTGTTGAAATTATTTCTCTTTTCCATGGAGTGAAGAGGTGTATCTTAGTCCCTATTAGGAGAAAACAAGGGAGGTTGGAGTGTCTTTTGGGATACGCTCTATGTTTTTATTAGAAAATTTGGTAGTGAAAGGAGCAAGGAAAAGGTGAGCAAACCTGAGAGTGGGAGGAGTAAGGCTCATCTCGGCGGATGAAGATATGGTGTGTATTAAGAATGATTAGGTAAAGGAGTTACTACTTTTAACTTTGGAAGGTGTAATGCTAGGGTTCATGGGAAGATGTCGTGGATTCCTTCTTACGATGACTGGATTAGAGTTGGGATTTTTTTTAGATAGGAAACAGAGAATATTATTATCAAGAGCCCAGATACAAAAAATGGGAGATGAGATATCCCCATGAAACCAACGGGTTACAAAAAGGATTCCCTGGTATAAATATACAAAAGCTCAAAGTACAAAAAAGTTTAGCTAAGGAGCACCATGAAGATGAAAAAGATATAGTGAGATCCCAAAATCATCAAGGCTTGACTCGGCTCCTTTGAAAGTCTACTGGTATCTCTCGTGCCATTCCAAAAGACTAGCCACGGAATTGGAACAGATTTCACCAGCATAATACCCCCTGGAAGGAGAAGCTAGCAATAACTGCTTGAGTGCCTCCCCACTCTTCTTAGCAAAACCCCAACTGATGCCAAAATCTGAAAAAATTTAGACCAAACTGCGGCGGCAAAGGGGCAAAAGAAGAATAGATGAGAGTGGTTCTCCAAGTCTCTTTTACACATAATGCACCAACTAGGGGAAAGTAAGAAATTGAGCATCCTTCTTTGGAGCTTCTCGCAAGTGTTGAGACGTTCTTGGCCAAAAATCCATAGGAACATTTTAACCTTCTTAGGGTAATTCGCCTTCCAAATGTCATTGGAAACATGTGTCAAGAGAAGAGTTTCGAGCACTGATTTTTTAAACAAGGGAGCTCACTTGATACTCTCCGCTCCTATGAAGGATGAGATTTAGAGTCCTCTTGATTAGAAGGCTGCCGTTGACTGATTGAGCTTGATAAAGTGTTCCATTTTTCAATCTCAAGATCTGTGAGGTTCCTTTTAAATTGGAGATTCCATCTCTTATTATCATTTGACCAACATTGGTAGATAAAGAAACCTTTAGAGTTAGCCACGTGGTAGAATCTCAGAAAACACAGCACAGGCATCACCTATATGCTTAAATGATTAATATTCCAATTATCAAGAAGAAAATTAGGTATACTCTCTTTCCCATGACAAATTTTCTTCTTGATAATTGGAATAGTAATCATTTAAGCATATAGGTGATGTATGTGCTGGTTTTTTAACATTTGCTCAGATAACGGTTAGGAAGATGGATTTTATGGAGGCTGCCATGAAAGTAAGGGACAATTATTATGGTTCCACAACACCAGAGGTGGAGTTATTGTAGTTGGGTGGTACAAATTGCAGGTTTTCTAACACCTAGTGGTTAAATAGTAGAAACCCGAGATTCAAGGTACTTCTTCAAAAGAGGTGGCTAGAGCTTTTGATGAATTCATGTTGTCACCCTGTGGATATTTCGCTGTAGTGCTCCAAGATAATTTGTTTGTCTTGGAGCCTATTTTCAGGGAAGATGAGGTGATTCTCAGAAGAATTTTAGTAGACATTATAGGATCTGATTGCCTCCTCGAGATTCTTTGCAACAATGTTCTTATTGTTGTAACATGGTCGACGTAATCTTCCAATTATCATTTATGGTTTGAAGTTTGGGTGACCATTTCCAGGATAGCAACTGGCTGTCATCACTCATACGAGGAGGAGCACCTCAAGTTTAACCAATCTGCACCTCATGTCATCAGAAGATTTGCATGTGAGGCTCACACATCACTGGTTCGTTCTATTTTCTGCCCTGCCCGCTGAAGCATAAGTGGGCAGAATGGAATTTTGATGGTATTGGAAGAGTAGTGAGGAAATTTGGGATAACATCACCCTTGTACTTCTATTTAGTTTTTTCTTTCTTAAGTTTTCTTTAACTACTCCACTTCTTTTATATAACCAACTGGAAAGTGCTCTTGTTGTTCCCGCTTTTGGACATACATTTCAATTTTTCAATGATATTGGTTTCTTATCCAAAAATCAAACTCAACTGTTTATTTGATAATTTTGATTGGTTGGTAGTAGTTAGTACTCTGATTTGCGGTTGATGGTTGGGATTTTTTATGGCATTTTGTTTAGTACTCTCTCTGTCTATACCCTAATGTTATTGTTTTACCATATTGTAACAGCTGTTTTCTCAACGCCATTCCTTTTTGAAGTTTCCTTCTATAAGATGCACAAAAGCTTCTAGAGCAAGCTATTGCTTAGCTTGTCTAGTTTCACTTATTGAGCAGGTAACAAGTTTTATCTTCCATTTACGGTTATGAACTTATTTTGCGTATCATTTTTTCTATTCACTTCTATTTAGTTCATCATTGCATATGATTAGTTTTCTTCATGTCATGATCTTAAGAAAGATCTTTTATTTTCTTTTAGCCTATGCTTAAACTTTGTTACTGTCCAGTTTATTCCACTCTGATAATCTATATTTGTTGGTCAATCTTCTATTTTAGACAAGATATCTTCATTTCCGGAGTTGGCCTGTCGAATGGGGGTGGTGCCGTGATCTTCAGTCTTTTATATTTGTATTCGAGAGACATAAAAGGTGCATGCACTACAATTTTTCTACGTGAATCAATGTACAGAGTTAAATGTACACAGTTAAATATTGATGGTGCTTCCTTGAGTGCAGAATAGTGCTGGAACGTCCCGAGTATGGCTATGCTACATATTTTTTTGAGCTTGTCGATTCCTTACCTATCGACTGGCAGATAAAGCGGTTGGTGATTGCTTTGAAGCTTACTAGTTGTAGCAGAATTTCACTAATTGAGAACAAACCATTGTTG
mRNA sequence
ATGCAGAGAGTTCGCATAAAAAAAATAATAATAATAATAATTAGCACCTGCAAGCGTCCGGAGGCCGGTGGTCGGCCTGGGATTCTTCAGCGCAGGGCGCTGTCCAGGAGTCAGCTCGTCACGCCAGGGAAGAACCTTGCTCCTCGGTCTGAAATTGGATTGAAGACTCTCCTTCGTGGTTGGAGACACAGAGCACGCACTGGCTCGGACGTCGGAATGAATCATAGTATCAGTTCTGATGAGCTCGACCATCTTTCGTTGGAGGTGAGGCAGAAAATGCTACTGGAAAAGACTCAACGTTTTCTTTGGGCCATATTCAGGCTTTGCAACGATTTGCTTTTCAAGAAAGAAGATGAATGCTGTGATTTGCAGGGTGTTTCCTCTGTGATATCTGCTTGTGATGCCAGAAATCTGGGAGATCAACAACTGGAAGCTCAAATTAACGATACTAGTGGTCATCTTGAGGACTGCTACAGTGAGGGGGCTCGATTAAATATAGAGAAGCAGATATCATGCACTATGGCATTTGAGAATCCAACACCACCTGAGGTTCCTGATTGGGTAAGGGTGGAATCTACAGGCATCTTACCAGGTTTCTTGGCAGATGGTGTAGATAACTTTGCTTCCGCTGGTGTGGCTGTAACTAAGGTTAAAAATGAGACGTTTGATGACTTCGATGAAGATCTTGATCATGTTGTATTGATAGAGCGACTAAGGATGCTGCTATCAAGGAGAGCGTTGGGTTTCATGAATCACCATGTGGAGGGTAGTTCTGGTGTGTCATCAGGAAATCTTATACAATGCTTTTTGAAACAGAAGGAAAAGTTTATGTTTGCTAATGGGGAGCTGATGGGAATTGGCAATGCGTTGCATGATAAAATTGGAAGTGATGCTCCTCGTCTTTGCATCCCTTCAGTAATTTGTTCACCTAATACAGCCTTTTCTGGATCCTGTTTCTCAAGCGATCATTCTTTAAATAAATCAACTGAATCAGGCAATGACATGGAACTTAAAGAAGATGATAAGATCTGTTCATCTGAGAAGGTGGCCACAGAATTAGGCCCACGGCTTTTGACTGATCATGTCCCTGAAGTAAATTTATTTAATTCCACAAAAGTGAAGGATGAACCTTATGATCATGTTGACGGCTGCAACTTATATGATAAGGATACGAAGAACATCTGCAGCAGAATTTTGTCAATAAAGAGTGAAACAATCATGCCCGATGAACCTTATGAAAACAAGGTAGATGATATGCGACTGCAAGATCGAATGAAGTTTTTCTCATCTCGAAAGGTTTTTGGTTCTACGTCAAGGGATTACGAGCATCCAAAACCTTCTGACCCTGGATGTAGTTCTCTTGTTTCAGAACCTTCTAGTTTAATGAACATTAAACATCGACGCAAGCGGAAAAAAACTGCCACGAATTCAGTTGAAACAGCACTCGAGGAAGATGCCCCTGGCCTTCTGCAGATACTAGTTGGCAAAGGTGTAGAAATTGATGAAATTAAGCTTTATGGAGAGATGGAAAGTGATGATGATCTGCATGAGTCATTTAGTGAAGACAGCTTTGGTGAGCTTGAAGCTGTGATATCGAGGCTGTTTTCTCAACGCCATTCCTTTTTGAAGTTTCCTTCTATAAGATGCACAAAAGCTTCTAGAGCAAGCTATTGCTTAGCTTGTCTAGTTTCACTTATTGAGCAGACAAGATATCTTCATTTCCGGAGTTGGCCTGTCGAATGGGGGTGGTGCCGTGATCTTCAGTCTTTTATATTTGTATTCGAGAGACATAAAAGAATAGTGCTGGAACGTCCCGAGTATGGCTATGCTACATATTTTTTTGAGCTTGTCGATTCCTTACCTATCGACTGGCAGATAAAGCGGTTGGTGATTGCTTTGAAGCTTACTAGTTGTAGCAGAATTTCACTAATTGAGAACAAACCATTGTTG
Coding sequence (CDS)
ATGCAGAGAGTTCGCATAAAAAAAATAATAATAATAATAATTAGCACCTGCAAGCGTCCGGAGGCCGGTGGTCGGCCTGGGATTCTTCAGCGCAGGGCGCTGTCCAGGAGTCAGCTCGTCACGCCAGGGAAGAACCTTGCTCCTCGGTCTGAAATTGGATTGAAGACTCTCCTTCGTGGTTGGAGACACAGAGCACGCACTGGCTCGGACGTCGGAATGAATCATAGTATCAGTTCTGATGAGCTCGACCATCTTTCGTTGGAGGTGAGGCAGAAAATGCTACTGGAAAAGACTCAACGTTTTCTTTGGGCCATATTCAGGCTTTGCAACGATTTGCTTTTCAAGAAAGAAGATGAATGCTGTGATTTGCAGGGTGTTTCCTCTGTGATATCTGCTTGTGATGCCAGAAATCTGGGAGATCAACAACTGGAAGCTCAAATTAACGATACTAGTGGTCATCTTGAGGACTGCTACAGTGAGGGGGCTCGATTAAATATAGAGAAGCAGATATCATGCACTATGGCATTTGAGAATCCAACACCACCTGAGGTTCCTGATTGGGTAAGGGTGGAATCTACAGGCATCTTACCAGGTTTCTTGGCAGATGGTGTAGATAACTTTGCTTCCGCTGGTGTGGCTGTAACTAAGGTTAAAAATGAGACGTTTGATGACTTCGATGAAGATCTTGATCATGTTGTATTGATAGAGCGACTAAGGATGCTGCTATCAAGGAGAGCGTTGGGTTTCATGAATCACCATGTGGAGGGTAGTTCTGGTGTGTCATCAGGAAATCTTATACAATGCTTTTTGAAACAGAAGGAAAAGTTTATGTTTGCTAATGGGGAGCTGATGGGAATTGGCAATGCGTTGCATGATAAAATTGGAAGTGATGCTCCTCGTCTTTGCATCCCTTCAGTAATTTGTTCACCTAATACAGCCTTTTCTGGATCCTGTTTCTCAAGCGATCATTCTTTAAATAAATCAACTGAATCAGGCAATGACATGGAACTTAAAGAAGATGATAAGATCTGTTCATCTGAGAAGGTGGCCACAGAATTAGGCCCACGGCTTTTGACTGATCATGTCCCTGAAGTAAATTTATTTAATTCCACAAAAGTGAAGGATGAACCTTATGATCATGTTGACGGCTGCAACTTATATGATAAGGATACGAAGAACATCTGCAGCAGAATTTTGTCAATAAAGAGTGAAACAATCATGCCCGATGAACCTTATGAAAACAAGGTAGATGATATGCGACTGCAAGATCGAATGAAGTTTTTCTCATCTCGAAAGGTTTTTGGTTCTACGTCAAGGGATTACGAGCATCCAAAACCTTCTGACCCTGGATGTAGTTCTCTTGTTTCAGAACCTTCTAGTTTAATGAACATTAAACATCGACGCAAGCGGAAAAAAACTGCCACGAATTCAGTTGAAACAGCACTCGAGGAAGATGCCCCTGGCCTTCTGCAGATACTAGTTGGCAAAGGTGTAGAAATTGATGAAATTAAGCTTTATGGAGAGATGGAAAGTGATGATGATCTGCATGAGTCATTTAGTGAAGACAGCTTTGGTGAGCTTGAAGCTGTGATATCGAGGCTGTTTTCTCAACGCCATTCCTTTTTGAAGTTTCCTTCTATAAGATGCACAAAAGCTTCTAGAGCAAGCTATTGCTTAGCTTGTCTAGTTTCACTTATTGAGCAGACAAGATATCTTCATTTCCGGAGTTGGCCTGTCGAATGGGGGTGGTGCCGTGATCTTCAGTCTTTTATATTTGTATTCGAGAGACATAAAAGAATAGTGCTGGAACGTCCCGAGTATGGCTATGCTACATATTTTTTTGAGCTTGTCGATTCCTTACCTATCGACTGGCAGATAAAGCGGTTGGTGATTGCTTTGAAGCTTACTAGTTGTAGCAGAATTTCACTAATTGAGAACAAACCATTGTTG
Protein sequence
MQRVRIKKIIIIIISTCKRPEAGGRPGILQRRALSRSQLVTPGKNLAPRSEIGLKTLLRGWRHRARTGSDVGMNHSISSDELDHLSLEVRQKMLLEKTQRFLWAIFRLCNDLLFKKEDECCDLQGVSSVISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISCTMAFENPTPPEVPDWVRVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALGFMNHHVEGSSGVSSGNLIQCFLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVICSPNTAFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLFNSTKVKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKKTATNSVETALEEDAPGLLQILVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKFPSIRCTKASRASYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL
Homology
BLAST of Sgr016933 vs. NCBI nr
Match:
XP_022141525.1 (uncharacterized protein LOC111011878 isoform X2 [Momordica charantia])
HSP 1 Score: 933.7 bits (2412), Expect = 8.5e-268
Identity = 472/582 (81.10%), Postives = 502/582 (86.25%), Query Frame = 0
Query: 73 MNHSISSDELDHLSLEVRQKMLLEKTQRFL----WAIFRLCNDLLFKKEDECCDLQGVSS 132
MNH S DELDHLSL RQKMLLE L I LCND + K+EDECCD+QGVSS
Sbjct: 1 MNHGTSFDELDHLSLVQRQKMLLENKHPLLEDGSKIISPLCNDFIVKEEDECCDVQGVSS 60
Query: 133 VISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISCTMAFENPTPPEVPDWV 192
+IS D NLG QQLE Q+ DTSGHLED Y+E ARLN EKQISCTM FENPTPPEVPDWV
Sbjct: 61 MISTRDDGNLGGQQLETQMKDTSGHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWV 120
Query: 193 RVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALG 252
RVESTGIL G L DGVDNF SAGVAVTKVKNE FDDF+EDLDHVV IERLRMLLSR+ALG
Sbjct: 121 RVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALG 180
Query: 253 FMNHHVEGSSGVSSGNLIQCFLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVIC 312
MN HVEG SG SSG+ +QCFLKQK K MF+N EL G N LHD+ G DAP L PSV+C
Sbjct: 181 SMNQHVEGGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVC 240
Query: 313 SPNTAFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLF 372
SP SGS FSS+ SLNK TESGNDMELKEDD+IC SEKV TELG RLLT+H PE NLF
Sbjct: 241 SPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLTNHAPEANLF 300
Query: 373 NSTKVKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFF 432
STKVKDEPYDHVDGCNL+ KD N+CSRILS+KSET MPDEPYENKVDDMRLQDRMKFF
Sbjct: 301 YSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFF 360
Query: 433 SSRKVFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKKTATNSVETALEEDAPG 492
SSRKVFGSTSRDYEHPKPSDPGCSSLVSEP+SLMN+K RRK K+TATNS+ETALEEDAPG
Sbjct: 361 SSRKVFGSTSRDYEHPKPSDPGCSSLVSEPASLMNVKRRRKWKRTATNSIETALEEDAPG 420
Query: 493 LLQILVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKFPSIR 552
LLQILV KGV +DEIKLYGEMESDDDL ESFSE+SFGELEAVISRLFSQR SFLKFP IR
Sbjct: 421 LLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPIR 480
Query: 553 CTKASRASYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYG 612
CTKASR+SYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYG
Sbjct: 481 CTKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYG 540
Query: 613 YATYFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL 651
YATYFFELV+ LPI WQIKRLVIALKLT+CSRISL+EN+PLL
Sbjct: 541 YATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLL 582
BLAST of Sgr016933 vs. NCBI nr
Match:
XP_022141523.1 (uncharacterized protein LOC111011878 isoform X1 [Momordica charantia])
HSP 1 Score: 927.9 bits (2397), Expect = 4.7e-266
Identity = 472/586 (80.55%), Postives = 502/586 (85.67%), Query Frame = 0
Query: 73 MNHSISSDELDHLSLEVRQKMLLEKTQRFL----WAIFRLCNDLLFKKEDECCDLQGVSS 132
MNH S DELDHLSL RQKMLLE L I LCND + K+EDECCD+QGVSS
Sbjct: 1 MNHGTSFDELDHLSLVQRQKMLLENKHPLLEDGSKIISPLCNDFIVKEEDECCDVQGVSS 60
Query: 133 VISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISCTMAFENPTPPEVPDWV 192
+IS D NLG QQLE Q+ DTSGHLED Y+E ARLN EKQISCTM FENPTPPEVPDWV
Sbjct: 61 MISTRDDGNLGGQQLETQMKDTSGHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWV 120
Query: 193 RVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALG 252
RVESTGIL G L DGVDNF SAGVAVTKVKNE FDDF+EDLDHVV IERLRMLLSR+ALG
Sbjct: 121 RVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALG 180
Query: 253 FMNHHVEGSSGVSSGNLIQCFLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVIC 312
MN HVEG SG SSG+ +QCFLKQK K MF+N EL G N LHD+ G DAP L PSV+C
Sbjct: 181 SMNQHVEGGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVC 240
Query: 313 SPNTAFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLF 372
SP SGS FSS+ SLNK TESGNDMELKEDD+IC SEKV TELG RLLT+H PE NLF
Sbjct: 241 SPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLTNHAPEANLF 300
Query: 373 NSTKVKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFF 432
STKVKDEPYDHVDGCNL+ KD N+CSRILS+KSET MPDEPYENKVDDMRLQDRMKFF
Sbjct: 301 YSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFF 360
Query: 433 SSRKVFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKKTATNSVETALEEDAPG 492
SSRKVFGSTSRDYEHPKPSDPGCSSLVSEP+SLMN+K RRK K+TATNS+ETALEEDAPG
Sbjct: 361 SSRKVFGSTSRDYEHPKPSDPGCSSLVSEPASLMNVKRRRKWKRTATNSIETALEEDAPG 420
Query: 493 LL----QILVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKF 552
LL QILV KGV +DEIKLYGEMESDDDL ESFSE+SFGELEAVISRLFSQR SFLKF
Sbjct: 421 LLQFHWQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKF 480
Query: 553 PSIRCTKASRASYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLER 612
P IRCTKASR+SYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLER
Sbjct: 481 PPIRCTKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLER 540
Query: 613 PEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL 651
PEYGYATYFFELV+ LPI WQIKRLVIALKLT+CSRISL+EN+PLL
Sbjct: 541 PEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLL 586
BLAST of Sgr016933 vs. NCBI nr
Match:
KAG6579038.1 (hypothetical protein SDJN03_23486, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 873.6 bits (2256), Expect = 1.0e-249
Identity = 453/579 (78.24%), Postives = 489/579 (84.46%), Query Frame = 0
Query: 73 MNHSISSDELDHLSLEVRQKMLLEKTQRFLWAIFRLCNDLLFKKEDECCDLQGVSSVISA 132
MN ISSDELDHLSL VR+KML E L + + + KKE+ECCDLQG S++ISA
Sbjct: 1 MNRGISSDELDHLSLAVRRKMLQENKFTLLEDESKRISTFV-KKENECCDLQGGSTMISA 60
Query: 133 CDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISCTMAFENPTPPEVPDWVRVES 192
CDARNLGDQQLEAQINDT+GH D YSEGARLN E FENPTPPEV D VRVES
Sbjct: 61 CDARNLGDQQLEAQINDTNGHHMDNYSEGARLNRE-----NTTFENPTPPEVLDRVRVES 120
Query: 193 TGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALGFMNH 252
T IL G LA GVDNFA AGVAVTKVKNE FDDF+EDLDHV+LIERLRMLLSRRALG MN
Sbjct: 121 TSILSGTLAAGVDNFAPAGVAVTKVKNEMFDDFNEDLDHVLLIERLRMLLSRRALGLMNQ 180
Query: 253 HVEGSSGVSSGNLIQCFLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVICSPNT 312
HVEG SGV SG+L+QCFLKQK K MFA+ E M IGN LHDK GS APR C PSV+CSPN
Sbjct: 181 HVEGGSGVPSGDLLQCFLKQKAKSMFASEERMEIGNVLHDKSGSYAPRHCSPSVVCSPNA 240
Query: 313 AFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLFNSTK 372
SGS FSS+HSLNKSTESGNDMELKE DKICSSEKVATELG R LT+HVP+ NL +STK
Sbjct: 241 TLSGSYFSSNHSLNKSTESGNDMELKE-DKICSSEKVATELGSRHLTNHVPQANLLSSTK 300
Query: 373 VKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFFSSRK 432
VKDEPYDH +GC++Y KD N+ S LSIKSET MPDEPYENKVDDM LQDRMKFFSSRK
Sbjct: 301 VKDEPYDHGEGCSIYGKDMNNVYSNTLSIKSETTMPDEPYENKVDDMPLQDRMKFFSSRK 360
Query: 433 VFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKKTATNSVETALEEDAPGLLQI 492
G TS DYEHPKPSDPGCS LVSEP + N K RRK+KKTATNS+ETALEEDAPGLLQI
Sbjct: 361 DIGFTSMDYEHPKPSDPGCSVLVSEPVNFPNTKRRRKQKKTATNSIETALEEDAPGLLQI 420
Query: 493 LVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKFPS-IRCTK 552
LV KG+++DEIKLYGE ESDDDL ES SEDSF ELE VI+RLF QRHSFLKFPS IRCTK
Sbjct: 421 LVEKGIQVDEIKLYGETESDDDLDESSSEDSFRELEDVITRLFPQRHSFLKFPSIIRCTK 480
Query: 553 ASRASYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYAT 612
ASRASYCLACLVSLIEQTRYLHFR+WPVEWGWCRDLQSFIFVFERHKRIV+ERPEYGYAT
Sbjct: 481 ASRASYCLACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYAT 540
Query: 613 YFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL 651
YFFELV+SLPI WQIKRLVIA+KLT+CSRISL+EN+PLL
Sbjct: 541 YFFELVESLPISWQIKRLVIAMKLTNCSRISLLENRPLL 572
BLAST of Sgr016933 vs. NCBI nr
Match:
XP_022939493.1 (uncharacterized protein LOC111445382 isoform X1 [Cucurbita moschata] >XP_022939494.1 uncharacterized protein LOC111445382 isoform X1 [Cucurbita moschata])
HSP 1 Score: 870.5 bits (2248), Expect = 8.8e-249
Identity = 452/579 (78.07%), Postives = 487/579 (84.11%), Query Frame = 0
Query: 73 MNHSISSDELDHLSLEVRQKMLLEKTQRFLWAIFRLCNDLLFKKEDECCDLQGVSSVISA 132
MN ISSDELDHLSL VR+KML E L + + + KKE+ECCDLQG S++ISA
Sbjct: 1 MNRGISSDELDHLSLAVRRKMLQENKLTLLEDESKGISTFV-KKENECCDLQGGSTMISA 60
Query: 133 CDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISCTMAFENPTPPEVPDWVRVES 192
CDARNLGDQQLEAQINDT+GH D YSEGARLN E FENPTPPEV D VRVES
Sbjct: 61 CDARNLGDQQLEAQINDTNGHHMDNYSEGARLNRE-----NTTFENPTPPEVLDRVRVES 120
Query: 193 TGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALGFMNH 252
T IL G L GVDNFA AGVAVTKVKNE FDDFDEDLDHV+LIERLRMLLSRRALG MN
Sbjct: 121 TSILSGTLVAGVDNFAPAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLMNQ 180
Query: 253 HVEGSSGVSSGNLIQCFLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVICSPNT 312
HVEG SGV SG+L+QCFLKQK K MFA+ E M IGN LHDK GS APR C PSV+CSPN
Sbjct: 181 HVEGGSGVPSGDLLQCFLKQKAKSMFASEERMEIGNVLHDKSGSYAPRHCSPSVVCSPNA 240
Query: 313 AFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLFNSTK 372
SGS FSS+HSLNKSTESGNDMELKE DKICSSEKVATELG R LT+HVP+ NL +STK
Sbjct: 241 TLSGSYFSSNHSLNKSTESGNDMELKE-DKICSSEKVATELGSRHLTNHVPQENLLSSTK 300
Query: 373 VKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFFSSRK 432
VKDEPYDH +GC++Y KD N+ S LSIKSET MPDEPYENKVDDM LQDRMKFFSSRK
Sbjct: 301 VKDEPYDHGEGCSIYGKDMNNVYSNTLSIKSETTMPDEPYENKVDDMPLQDRMKFFSSRK 360
Query: 433 VFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKKTATNSVETALEEDAPGLLQI 492
G TS DYEHPKPSDPGCS LVSEP + N K RRK+KKTATNS+ETALEEDAPGLLQI
Sbjct: 361 DIGFTSMDYEHPKPSDPGCSVLVSEPVNFPNTKRRRKQKKTATNSIETALEEDAPGLLQI 420
Query: 493 LVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKFPS-IRCTK 552
LV KG+++DEIKLYGE ESDDDL ES SEDSF ELE VI+RLF QRHSFLKFPS IRC K
Sbjct: 421 LVEKGIQVDEIKLYGETESDDDLDESSSEDSFRELEDVITRLFPQRHSFLKFPSIIRCMK 480
Query: 553 ASRASYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYAT 612
ASRASYCLACLVSLIEQTRYLHFR+WPVEWGWCRDLQSFIFVFERHKRIV+ERPEYGYAT
Sbjct: 481 ASRASYCLACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYAT 540
Query: 613 YFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL 651
YFFELV+SLPI WQIKRLVIA+KLT+CSRISL+EN+PLL
Sbjct: 541 YFFELVESLPISWQIKRLVIAMKLTNCSRISLLENRPLL 572
BLAST of Sgr016933 vs. NCBI nr
Match:
XP_022141526.1 (uncharacterized protein LOC111011878 isoform X3 [Momordica charantia])
HSP 1 Score: 864.0 bits (2231), Expect = 8.3e-247
Identity = 435/526 (82.70%), Postives = 462/526 (87.83%), Query Frame = 0
Query: 129 VISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISCTMAFENPTPPEVPDWV 188
+IS D NLG QQLE Q+ DTSGHLED Y+E ARLN EKQISCTM FENPTPPEVPDWV
Sbjct: 1 MISTRDDGNLGGQQLETQMKDTSGHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWV 60
Query: 189 RVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALG 248
RVESTGIL G L DGVDNF SAGVAVTKVKNE FDDF+EDLDHVV IERLRMLLSR+ALG
Sbjct: 61 RVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALG 120
Query: 249 FMNHHVEGSSGVSSGNLIQCFLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVIC 308
MN HVEG SG SSG+ +QCFLKQK K MF+N EL G N LHD+ G DAP L PSV+C
Sbjct: 121 SMNQHVEGGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVC 180
Query: 309 SPNTAFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLF 368
SP SGS FSS+ SLNK TESGNDMELKEDD+IC SEKV TELG RLLT+H PE NLF
Sbjct: 181 SPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLTNHAPEANLF 240
Query: 369 NSTKVKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFF 428
STKVKDEPYDHVDGCNL+ KD N+CSRILS+KSET MPDEPYENKVDDMRLQDRMKFF
Sbjct: 241 YSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFF 300
Query: 429 SSRKVFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKKTATNSVETALEEDAPG 488
SSRKVFGSTSRDYEHPKPSDPGCSSLVSEP+SLMN+K RRK K+TATNS+ETALEEDAPG
Sbjct: 301 SSRKVFGSTSRDYEHPKPSDPGCSSLVSEPASLMNVKRRRKWKRTATNSIETALEEDAPG 360
Query: 489 LL----QILVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKF 548
LL QILV KGV +DEIKLYGEMESDDDL ESFSE+SFGELEAVISRLFSQR SFLKF
Sbjct: 361 LLQFHWQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKF 420
Query: 549 PSIRCTKASRASYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLER 608
P IRCTKASR+SYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLER
Sbjct: 421 PPIRCTKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLER 480
Query: 609 PEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL 651
PEYGYATYFFELV+ LPI WQIKRLVIALKLT+CSRISL+EN+PLL
Sbjct: 481 PEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLL 526
BLAST of Sgr016933 vs. ExPASy TrEMBL
Match:
A0A6J1CIB2 (uncharacterized protein LOC111011878 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111011878 PE=4 SV=1)
HSP 1 Score: 933.7 bits (2412), Expect = 4.1e-268
Identity = 472/582 (81.10%), Postives = 502/582 (86.25%), Query Frame = 0
Query: 73 MNHSISSDELDHLSLEVRQKMLLEKTQRFL----WAIFRLCNDLLFKKEDECCDLQGVSS 132
MNH S DELDHLSL RQKMLLE L I LCND + K+EDECCD+QGVSS
Sbjct: 1 MNHGTSFDELDHLSLVQRQKMLLENKHPLLEDGSKIISPLCNDFIVKEEDECCDVQGVSS 60
Query: 133 VISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISCTMAFENPTPPEVPDWV 192
+IS D NLG QQLE Q+ DTSGHLED Y+E ARLN EKQISCTM FENPTPPEVPDWV
Sbjct: 61 MISTRDDGNLGGQQLETQMKDTSGHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWV 120
Query: 193 RVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALG 252
RVESTGIL G L DGVDNF SAGVAVTKVKNE FDDF+EDLDHVV IERLRMLLSR+ALG
Sbjct: 121 RVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALG 180
Query: 253 FMNHHVEGSSGVSSGNLIQCFLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVIC 312
MN HVEG SG SSG+ +QCFLKQK K MF+N EL G N LHD+ G DAP L PSV+C
Sbjct: 181 SMNQHVEGGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVC 240
Query: 313 SPNTAFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLF 372
SP SGS FSS+ SLNK TESGNDMELKEDD+IC SEKV TELG RLLT+H PE NLF
Sbjct: 241 SPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLTNHAPEANLF 300
Query: 373 NSTKVKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFF 432
STKVKDEPYDHVDGCNL+ KD N+CSRILS+KSET MPDEPYENKVDDMRLQDRMKFF
Sbjct: 301 YSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFF 360
Query: 433 SSRKVFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKKTATNSVETALEEDAPG 492
SSRKVFGSTSRDYEHPKPSDPGCSSLVSEP+SLMN+K RRK K+TATNS+ETALEEDAPG
Sbjct: 361 SSRKVFGSTSRDYEHPKPSDPGCSSLVSEPASLMNVKRRRKWKRTATNSIETALEEDAPG 420
Query: 493 LLQILVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKFPSIR 552
LLQILV KGV +DEIKLYGEMESDDDL ESFSE+SFGELEAVISRLFSQR SFLKFP IR
Sbjct: 421 LLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPIR 480
Query: 553 CTKASRASYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYG 612
CTKASR+SYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYG
Sbjct: 481 CTKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYG 540
Query: 613 YATYFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL 651
YATYFFELV+ LPI WQIKRLVIALKLT+CSRISL+EN+PLL
Sbjct: 541 YATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLL 582
BLAST of Sgr016933 vs. ExPASy TrEMBL
Match:
A0A6J1CKR0 (uncharacterized protein LOC111011878 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111011878 PE=4 SV=1)
HSP 1 Score: 927.9 bits (2397), Expect = 2.3e-266
Identity = 472/586 (80.55%), Postives = 502/586 (85.67%), Query Frame = 0
Query: 73 MNHSISSDELDHLSLEVRQKMLLEKTQRFL----WAIFRLCNDLLFKKEDECCDLQGVSS 132
MNH S DELDHLSL RQKMLLE L I LCND + K+EDECCD+QGVSS
Sbjct: 1 MNHGTSFDELDHLSLVQRQKMLLENKHPLLEDGSKIISPLCNDFIVKEEDECCDVQGVSS 60
Query: 133 VISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISCTMAFENPTPPEVPDWV 192
+IS D NLG QQLE Q+ DTSGHLED Y+E ARLN EKQISCTM FENPTPPEVPDWV
Sbjct: 61 MISTRDDGNLGGQQLETQMKDTSGHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWV 120
Query: 193 RVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALG 252
RVESTGIL G L DGVDNF SAGVAVTKVKNE FDDF+EDLDHVV IERLRMLLSR+ALG
Sbjct: 121 RVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALG 180
Query: 253 FMNHHVEGSSGVSSGNLIQCFLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVIC 312
MN HVEG SG SSG+ +QCFLKQK K MF+N EL G N LHD+ G DAP L PSV+C
Sbjct: 181 SMNQHVEGGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVC 240
Query: 313 SPNTAFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLF 372
SP SGS FSS+ SLNK TESGNDMELKEDD+IC SEKV TELG RLLT+H PE NLF
Sbjct: 241 SPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLTNHAPEANLF 300
Query: 373 NSTKVKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFF 432
STKVKDEPYDHVDGCNL+ KD N+CSRILS+KSET MPDEPYENKVDDMRLQDRMKFF
Sbjct: 301 YSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFF 360
Query: 433 SSRKVFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKKTATNSVETALEEDAPG 492
SSRKVFGSTSRDYEHPKPSDPGCSSLVSEP+SLMN+K RRK K+TATNS+ETALEEDAPG
Sbjct: 361 SSRKVFGSTSRDYEHPKPSDPGCSSLVSEPASLMNVKRRRKWKRTATNSIETALEEDAPG 420
Query: 493 LL----QILVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKF 552
LL QILV KGV +DEIKLYGEMESDDDL ESFSE+SFGELEAVISRLFSQR SFLKF
Sbjct: 421 LLQFHWQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKF 480
Query: 553 PSIRCTKASRASYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLER 612
P IRCTKASR+SYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLER
Sbjct: 481 PPIRCTKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLER 540
Query: 613 PEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL 651
PEYGYATYFFELV+ LPI WQIKRLVIALKLT+CSRISL+EN+PLL
Sbjct: 541 PEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLL 586
BLAST of Sgr016933 vs. ExPASy TrEMBL
Match:
A0A6J1FLT1 (uncharacterized protein LOC111445382 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111445382 PE=4 SV=1)
HSP 1 Score: 870.5 bits (2248), Expect = 4.3e-249
Identity = 452/579 (78.07%), Postives = 487/579 (84.11%), Query Frame = 0
Query: 73 MNHSISSDELDHLSLEVRQKMLLEKTQRFLWAIFRLCNDLLFKKEDECCDLQGVSSVISA 132
MN ISSDELDHLSL VR+KML E L + + + KKE+ECCDLQG S++ISA
Sbjct: 1 MNRGISSDELDHLSLAVRRKMLQENKLTLLEDESKGISTFV-KKENECCDLQGGSTMISA 60
Query: 133 CDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISCTMAFENPTPPEVPDWVRVES 192
CDARNLGDQQLEAQINDT+GH D YSEGARLN E FENPTPPEV D VRVES
Sbjct: 61 CDARNLGDQQLEAQINDTNGHHMDNYSEGARLNRE-----NTTFENPTPPEVLDRVRVES 120
Query: 193 TGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALGFMNH 252
T IL G L GVDNFA AGVAVTKVKNE FDDFDEDLDHV+LIERLRMLLSRRALG MN
Sbjct: 121 TSILSGTLVAGVDNFAPAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLMNQ 180
Query: 253 HVEGSSGVSSGNLIQCFLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVICSPNT 312
HVEG SGV SG+L+QCFLKQK K MFA+ E M IGN LHDK GS APR C PSV+CSPN
Sbjct: 181 HVEGGSGVPSGDLLQCFLKQKAKSMFASEERMEIGNVLHDKSGSYAPRHCSPSVVCSPNA 240
Query: 313 AFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLFNSTK 372
SGS FSS+HSLNKSTESGNDMELKE DKICSSEKVATELG R LT+HVP+ NL +STK
Sbjct: 241 TLSGSYFSSNHSLNKSTESGNDMELKE-DKICSSEKVATELGSRHLTNHVPQENLLSSTK 300
Query: 373 VKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFFSSRK 432
VKDEPYDH +GC++Y KD N+ S LSIKSET MPDEPYENKVDDM LQDRMKFFSSRK
Sbjct: 301 VKDEPYDHGEGCSIYGKDMNNVYSNTLSIKSETTMPDEPYENKVDDMPLQDRMKFFSSRK 360
Query: 433 VFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKKTATNSVETALEEDAPGLLQI 492
G TS DYEHPKPSDPGCS LVSEP + N K RRK+KKTATNS+ETALEEDAPGLLQI
Sbjct: 361 DIGFTSMDYEHPKPSDPGCSVLVSEPVNFPNTKRRRKQKKTATNSIETALEEDAPGLLQI 420
Query: 493 LVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKFPS-IRCTK 552
LV KG+++DEIKLYGE ESDDDL ES SEDSF ELE VI+RLF QRHSFLKFPS IRC K
Sbjct: 421 LVEKGIQVDEIKLYGETESDDDLDESSSEDSFRELEDVITRLFPQRHSFLKFPSIIRCMK 480
Query: 553 ASRASYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYAT 612
ASRASYCLACLVSLIEQTRYLHFR+WPVEWGWCRDLQSFIFVFERHKRIV+ERPEYGYAT
Sbjct: 481 ASRASYCLACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYAT 540
Query: 613 YFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL 651
YFFELV+SLPI WQIKRLVIA+KLT+CSRISL+EN+PLL
Sbjct: 541 YFFELVESLPISWQIKRLVIAMKLTNCSRISLLENRPLL 572
BLAST of Sgr016933 vs. ExPASy TrEMBL
Match:
A0A6J1CJF8 (uncharacterized protein LOC111011878 isoform X3 OS=Momordica charantia OX=3673 GN=LOC111011878 PE=4 SV=1)
HSP 1 Score: 864.0 bits (2231), Expect = 4.0e-247
Identity = 435/526 (82.70%), Postives = 462/526 (87.83%), Query Frame = 0
Query: 129 VISACDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISCTMAFENPTPPEVPDWV 188
+IS D NLG QQLE Q+ DTSGHLED Y+E ARLN EKQISCTM FENPTPPEVPDWV
Sbjct: 1 MISTRDDGNLGGQQLETQMKDTSGHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWV 60
Query: 189 RVESTGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALG 248
RVESTGIL G L DGVDNF SAGVAVTKVKNE FDDF+EDLDHVV IERLRMLLSR+ALG
Sbjct: 61 RVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALG 120
Query: 249 FMNHHVEGSSGVSSGNLIQCFLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVIC 308
MN HVEG SG SSG+ +QCFLKQK K MF+N EL G N LHD+ G DAP L PSV+C
Sbjct: 121 SMNQHVEGGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVC 180
Query: 309 SPNTAFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLF 368
SP SGS FSS+ SLNK TESGNDMELKEDD+IC SEKV TELG RLLT+H PE NLF
Sbjct: 181 SPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLTNHAPEANLF 240
Query: 369 NSTKVKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFF 428
STKVKDEPYDHVDGCNL+ KD N+CSRILS+KSET MPDEPYENKVDDMRLQDRMKFF
Sbjct: 241 YSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFF 300
Query: 429 SSRKVFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKKTATNSVETALEEDAPG 488
SSRKVFGSTSRDYEHPKPSDPGCSSLVSEP+SLMN+K RRK K+TATNS+ETALEEDAPG
Sbjct: 301 SSRKVFGSTSRDYEHPKPSDPGCSSLVSEPASLMNVKRRRKWKRTATNSIETALEEDAPG 360
Query: 489 LL----QILVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKF 548
LL QILV KGV +DEIKLYGEMESDDDL ESFSE+SFGELEAVISRLFSQR SFLKF
Sbjct: 361 LLQFHWQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKF 420
Query: 549 PSIRCTKASRASYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLER 608
P IRCTKASR+SYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLER
Sbjct: 421 PPIRCTKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLER 480
Query: 609 PEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL 651
PEYGYATYFFELV+ LPI WQIKRLVIALKLT+CSRISL+EN+PLL
Sbjct: 481 PEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLL 526
BLAST of Sgr016933 vs. ExPASy TrEMBL
Match:
A0A6J1JZL8 (uncharacterized protein LOC111489311 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111489311 PE=4 SV=1)
HSP 1 Score: 845.1 bits (2182), Expect = 1.9e-241
Identity = 441/579 (76.17%), Postives = 481/579 (83.07%), Query Frame = 0
Query: 73 MNHSISSDELDHLSLEVRQKMLLEKTQRFLWAIFRLCNDLLFKKEDECCDLQGVSSVISA 132
MN ISSDELDHLSL VR+KML E L + + + KKE+ECCDLQG S++IS
Sbjct: 1 MNRGISSDELDHLSLAVRRKMLQENKLTLLEDESKRISTFV-KKENECCDLQGGSTMIS- 60
Query: 133 CDARNLGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISCTMAFENPTPPEVPDWVRVES 192
RNLGDQQLEA+INDT+GHL D YSEGARLN E FENPTPPEV D VRVES
Sbjct: 61 ---RNLGDQQLEAEINDTNGHLMDNYSEGARLNRENS-----TFENPTPPEVLDRVRVES 120
Query: 193 TGILPGFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALGFMNH 252
T IL G LA VDNFA AGVAVTKVKNE FDDFDEDLDHV+LIERLRMLLSRR+LG MN
Sbjct: 121 TSILSGTLAARVDNFAPAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRSLGLMNQ 180
Query: 253 HVEGSSGVSSGNLIQCFLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVICSPNT 312
HVEG SGV SG+L+QCFLKQK K MFA+ E M IGN LHDK S APR C PSV+CSPN
Sbjct: 181 HVEGGSGVPSGDLLQCFLKQKAKSMFASEERMEIGNVLHDKSVSYAPRHCSPSVVCSPNA 240
Query: 313 AFSGSCFSSDHSLNKSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVPEVNLFNSTK 372
SGS FSS+HSLNKSTESGNDMELKE DKI SS+KVATELG R LT+HVP+ NL +STK
Sbjct: 241 TLSGSYFSSNHSLNKSTESGNDMELKE-DKIYSSDKVATELGSRHLTNHVPQANLLSSTK 300
Query: 373 VKDEPYDHVDGCNLYDKDTKNICSRILSIKSETIMPDEPYENKVDDMRLQDRMKFFSSRK 432
VKDEPYDH +GC++Y KD N+ LS+KSET MPDEP+ENKVDDM LQDRMKFFSSRK
Sbjct: 301 VKDEPYDHGEGCSIYGKDMNNVYGNTLSLKSETTMPDEPFENKVDDMPLQDRMKFFSSRK 360
Query: 433 VFGSTSRDYEHPKPSDPGCSSLVSEPSSLMNIKHRRKRKKTATNSVETALEEDAPGLLQI 492
FG TS DYEHPKPSDPGCS LVSEP + N K RRK KKTATNS+ETALEEDAPGLLQI
Sbjct: 361 DFGFTSMDYEHPKPSDPGCSILVSEPVNFPNTKRRRKEKKTATNSIETALEEDAPGLLQI 420
Query: 493 LVGKGVEIDEIKLYGEMESDDDLHESFSEDSFGELEAVISRLFSQRHSFLKFPS-IRCTK 552
LV KG+++DEIKLYGE ESDDDL ES SEDSF ELE VI+RLF QRHSFLKFPS IRC K
Sbjct: 421 LVEKGIQVDEIKLYGETESDDDLDESSSEDSFRELEDVITRLFPQRHSFLKFPSIIRCIK 480
Query: 553 ASRASYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYAT 612
ASRASYCLACLVSLIEQTRYLHFR+WPVEWGWCRDLQSFIFVF+RHKRIV+ERPEYGYAT
Sbjct: 481 ASRASYCLACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFQRHKRIVMERPEYGYAT 540
Query: 613 YFFELVDSLPIDWQIKRLVIALKLTSCSRISLIENKPLL 651
YFFELV+SLPI WQIKRLVIA+KLT+CSRISL+EN+PLL
Sbjct: 541 YFFELVESLPISWQIKRLVIAMKLTNCSRISLLENRPLL 568
BLAST of Sgr016933 vs. TAIR 10
Match:
AT5G16610.2 (unknown protein; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )
HSP 1 Score: 277.3 bits (708), Expect = 3.1e-74
Identity = 220/611 (36.01%), Postives = 305/611 (49.92%), Query Frame = 0
Query: 81 ELDHLSLEVRQKMLL--EKTQRFLWAIFRLCN-DLLFKKEDECCDLQGVSSVISACDARN 140
E DHL L R+ +LL E+ + + A N D + K+E+E C + V+S CDA
Sbjct: 22 EEDHLPLTSRRSLLLSSERVSQRIAAYVPASNVDSVLKREEEDCFNE--LGVVSNCDATE 81
Query: 141 LGDQQLEAQINDTSGHLEDCYSEGARLNIEKQISCTMAFENPTPPEVPDWVRVESTGILP 200
++ +N I C+ + D + +
Sbjct: 82 SVSTEILESMN---------------------IGCSQGLK--------DSGNIRPQNNIL 141
Query: 201 GFLADGVDNFASAGVAVTKVKNETFDDFDEDLDHVVLIERLRMLLSRRALGFMNHHVEGS 260
G ++ V+NF G ET + +DL+H+ L ER +MLL R A+ +VE +
Sbjct: 142 GCCSNAVENFNRVG--------ET--ERSDDLEHLTLKERRKMLLERVAIRLPESNVEDN 201
Query: 261 SGVSSGNLIQCFLKQKEKFMFANGELMGIGNALHDKIGSDAPRLCIPSVICSPNTAFSGS 320
+ + K K + NG G + LC ICS + + G
Sbjct: 202 TEDCDETEL---YKIKAEISCENGIASSSGVQFSGFLEKIDSVLCRNFSICSESGSQLGG 261
Query: 321 CFSSDHSLN--KSTESGNDMELKEDDKICSSEKVATELGPRLLTDHVP-EVNLFNSTKVK 380
SD ++ +S + + L E + SS K + R+ + +P N ST+VK
Sbjct: 262 IQESDIPISPERSFDLSPEASLPE---VSSSNKNPRKRVKRVQRNPLPLNENEIQSTQVK 321
Query: 381 DEPYDHVDGCNLYDKDTKN-ICSRILSIKSETIMPDEPY-ENKVDDMRLQDRMKF----- 440
+P + C + D D KN + S+ + +K E E EN++D ++L R+
Sbjct: 322 VDP---LADCVMEDNDEKNPVTSKQIPVKREVETHGEALDENELDSVKLSFRLNRCTSAP 381
Query: 441 --FSSRKVFGSTSRD-----YEHPKPSD---------PGCSSLVSEPSSLMN-------I 500
F K T+ + +H K D G ++ PSS + +
Sbjct: 382 TPFRCMKNEAETASEMDEDIIDHMKLIDRLKLRSFHGSGHHEDLNSPSSGFSFCTSDEYV 441
Query: 501 KHRR-----KRKKTATNSVETALEEDAPGLLQILVGKGVEIDEIKLYGEMESDDDLHESF 560
K R KRKKTAT+S+ETALEEDAPGLLQ+L+ +GV +DE++LYG D +S
Sbjct: 442 KPSRVFRPWKRKKTATDSIETALEEDAPGLLQVLIQQGVTVDELRLYGNEGGDVPSDDSL 501
Query: 561 SEDSFGELEAVISRLFSQRHSFLKFPSIRCTKASRASYCLACLVSLIEQTRYLHFRSWPV 620
+SF ELE VIS+LF +R + K + +KASR SYCL CL SLIEQ RYL FR WPV
Sbjct: 502 LNESFSELEDVISQLFYKRETGTKLLNSSFSKASRTSYCLTCLFSLIEQARYLQFRKWPV 561
Query: 621 EWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCS 651
EWGWCRDLQSFIFVFERH RIV+ERPEYGYATYFFEL ++ I WQ+KRLV+A+KL SC
Sbjct: 562 EWGWCRDLQSFIFVFERHNRIVMERPEYGYATYFFELSNTASIRWQVKRLVLAMKLASCG 582
BLAST of Sgr016933 vs. TAIR 10
Match:
AT5G16610.1 (unknown protein; Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 256.1 bits (653), Expect = 7.4e-68
Identity = 140/251 (55.78%), Postives = 169/251 (67.33%), Query Frame = 0
Query: 401 IKSETIMPDEPYENKVDDMRLQDRMKFFSSRKVFGS-TSRDYEHPKPSDPGCSSLVSEPS 460
+K+E E E+ +D M+L DR+K R GS D P C+S E
Sbjct: 193 MKNEAETASEMDEDIIDHMKLIDRLKL---RSFHGSGHHEDLNSPSSGFSFCTS--DEYV 252
Query: 461 SLMNIKHRRKRKKTATNSVETALEEDAPGLLQILVGKGVEIDEIKLYGEMESDDDLHESF 520
+ KRKKTAT+S+ETALEEDAPGLLQ+L+ +GV +DE++LYG D +S
Sbjct: 253 KPSRVFRPWKRKKTATDSIETALEEDAPGLLQVLIQQGVTVDELRLYGNEGGDVPSDDSL 312
Query: 521 SEDSFGELEAVISRLFSQRHSFLKFPSIRCTKASRASYCLACLVSLIEQTRYLHFRSWPV 580
+SF ELE VIS+LF +R + K + +KASR SYCL CL SLIEQ RYL FR WPV
Sbjct: 313 LNESFSELEDVISQLFYKRETGTKLLNSSFSKASRTSYCLTCLFSLIEQARYLQFRKWPV 372
Query: 581 EWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDSLPIDWQIKRLVIALKLTSCS 640
EWGWCRDLQSFIFVFERH RIV+ERPEYGYATYFFEL ++ I WQ+KRLV+A+KL SC
Sbjct: 373 EWGWCRDLQSFIFVFERHNRIVMERPEYGYATYFFELSNTASIRWQVKRLVLAMKLASCG 432
Query: 641 RISLIENKPLL 651
R LIENKPLL
Sbjct: 433 RYQLIENKPLL 438
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022141525.1 | 8.5e-268 | 81.10 | uncharacterized protein LOC111011878 isoform X2 [Momordica charantia] | [more] |
XP_022141523.1 | 4.7e-266 | 80.55 | uncharacterized protein LOC111011878 isoform X1 [Momordica charantia] | [more] |
KAG6579038.1 | 1.0e-249 | 78.24 | hypothetical protein SDJN03_23486, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022939493.1 | 8.8e-249 | 78.07 | uncharacterized protein LOC111445382 isoform X1 [Cucurbita moschata] >XP_0229394... | [more] |
XP_022141526.1 | 8.3e-247 | 82.70 | uncharacterized protein LOC111011878 isoform X3 [Momordica charantia] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1CIB2 | 4.1e-268 | 81.10 | uncharacterized protein LOC111011878 isoform X2 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1CKR0 | 2.3e-266 | 80.55 | uncharacterized protein LOC111011878 isoform X1 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1FLT1 | 4.3e-249 | 78.07 | uncharacterized protein LOC111445382 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1CJF8 | 4.0e-247 | 82.70 | uncharacterized protein LOC111011878 isoform X3 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1JZL8 | 1.9e-241 | 76.17 | uncharacterized protein LOC111489311 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
Match Name | E-value | Identity | Description | |
AT5G16610.2 | 3.1e-74 | 36.01 | unknown protein; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae ... | [more] |
AT5G16610.1 | 7.4e-68 | 55.78 | unknown protein; Has 30201 Blast hits to 17322 proteins in 780 species: Archae -... | [more] |