Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: utr5polypeptideCDSutr3
Hold the cursor over a type above to highlight its positions in the sequence below.
GTAGTGTTTGAGCTTAGTTGTGTCACTCAAACATGGCGGAGTAATTACACGAGGGAGGGGGTTAGAATTCGGGAGCTTGTAAAATTCATCACTCTCTCTTCAACATTCAGTTTCGTTCTTGACGGATCTGCTTCTGCTCATTTTCCGGAAGCTTGCTTCTACCCGATTCTCTTCCCGCTTGACACTTCACCATGTCGCCTACTCCCGAGGAGCCTAATTTGCAGAACGGAAACCAAACCCAACCGCACATTCCGTCAGAATCACAGCAAACTGAAGAATCCGGATCGGACCCAGAATCCAGAGTTGCCACAATTCCCCAACAGCAACGCGAATCAGAATCGGTTGATGAAGAAGCAGATGCGGAGCCTCGATCGGAGCCGGAGTCTCGGAGGGAACAGTCGTCGGAGTCCATCCAGTTGCAGGTGGTGACGGATGCCACAGATCCCAGGTCCGGTGATCCCGAGGAAGCCTCGATCCCGTCCAACGGCGCCGACAACTCGCATCCCGCCCTGCGGAAGGACGAAGGAAGCCGGACGTTCACCATGAGGGAGCTGCTGAATGGATTGAAAGGTGATGATGGTAACGACAGCGTTAATGAATCGGAAGGCGAGAGGCCCGAGGCTAACTCCGGTCACAGGTTCGTTTCTTCGTTCTCTTTTGTTGTTCTTCATGTTAATTTTACTTTAGGATACTGTATATAGTTGATTGGTGACTGAATTGTTTCTAATAATGCAGTATACTCTTCAATTATTTGGAAATGAATGATGAATTATGAATGTTTAACCTGGTTTTTTTTATCTTCTCCTTCCACCGGTGACCGGAGAAAAAGCGATCACTTCTGTTAGAGGGAGAGTGGTTTTACGATCGTTCTGTTCTTTCAAGCCTGAGGGTTAGACATATTCCTCCCTATTTTTGATAAACAAGTTATTGGGTAGTTTTCCTTCAAAGAAGAATATTTTAGTCATAAAAATCAAAGGATTCGTATGTGAGCAAGTCAATTTCCAAAAGTCAAAGTTGATCTAAAGCTATCCCACGAGCTTTCTTGGCGCCAGCAAGGTTTCTTCTCTTTAACGATAATTTTCATCTCCTCTGATAGACCCCCAGCAAATTGTTGGAAGCCGACACTTTAAAAATATCTAAGGTGTTTTTCTTCCTTTCAGGAGTCCACTCTTTATTTGTCTTCCTATCAGGGTAAGAAACACTGATTGCTTTGAACGATCGAGTGTTGTGGTTTTTGTTCGGCCTCAAAACTGTCCCAACATGACCCACTAACACTCTTAAGCATTGTGAACACTATATTCTCATCCAAATCTTTTTACTAATGCATTTTCCTGGATTGCTATCTTTTGCTCTCTTATCCCCTGTCCTTTTCTCACAGTAGTATCTTCTCTTTTTCTATCTCTTTGTTCCATCCTCTACCTGTAGTTTCAATTTTAGGATTTTTGGAATTAATCAATTCGTTTTTGGATATCCCTGTCCATTTCTTTTCGCATACTAATCAATTAGTTTGATTTTCTCCCATTTGATTTCGGTGTCAATTAAGAAAATGCTCATTATGATCAGTGGAGCCTTTTTCTTGCTATACATGACAATAGAGTTTTGCTGTTATCACTGTCCGTCTCAAGAGCTGTATTCTTTTTTTTACTGTATGAATTTGCACAGTTTTCTTATCAAAAAAAGAATTTGCACAGTTTTACTCAAGATAGCCCACATCAGCCTTATTCTGAACAGAGCAGAGCTGCTATGGAGTTGATCAATAGTGTTACAGGTGTTGATGAGGAGGGTCGTTCTCGCCAACGCATTCTCACATTTGCTGCTAGGAGGTATCATCACAATACTTTTTGTAGTTGTCTACCAACTAGACCTCTGCATTTCTCTTGGAGGTTATGGTAATTCAAACTTTGAATTCCAATTGAAATAGTTGCCCAGTTGTTTTTTCCCATTTCTTCCAATTGGAGCTTGTGTTTAAGCTCTTGTTGTGTGATTGGGATGCATTGTAAGAATTGGACTGGGATTTTCACGCAACAAAATAAAAATGAAGAAAAGGGAAACTCGTGCGACGTGCATACACAAGTGTTGGTTAAGAAGACAATATACAATACTATGTGTATGCTTGTTTATATTTATATTGATTACCATGGGTATTTAGATTAGGTTTCATTTCATGCAAAGCGATATGGGATTAGCTGATTTCATAATACTTAGATTTGATACTTATACTGTATTGATTCGTTTCCATAACCATAAAAAAATCATTCTTAATTAAAATTATGTGTAGAATATTTACATTCCAGTTAGGTGGCCCCATTGTTTTCAAAGGGGCATGCCTAAGTGCAAAGCACAGATGGCATAGGTGAGATTCAAGACAGTCACCTCAAGATATCTTGGGCATGTCGCTTGGAGATGACTATAGGAGGGTAGAGCTCATGAAGGCTCACATTGTTTCCCATTGAATTATGATTAGGCTTCAAAGGATTATTACCCCTTAGTGAACTTTGGGCTTTAGACGAACCATGGGTAGCACCAAGAATTGAATTATGATGTCCAAGACATATGAGTTACGTTTGCCACCATGGTACCGTTGAAAATGCCCAACAAGTACATATCAACTTTTATTTTAGTTTTTTGAAATGAAGGCAGGTATCTACAAACCTTTGAAAATATGATAAAATGAACGTTAAAGAAAGGTCAAGAATTAATTTTTGCCTTTTGATCATCTTTATTGAAAATATATATTTTATAGTAATCCATTTCTTTATAAAAATAATTTGCCACTCCTATAGATGTCACTTGTTTCCAATCATCTAATTTCCGTTTAAATATTGCAGGTATGCTAGTGCAATTGAGAGAAATGCTCAAGACTATGATGCTCTATACAATTGGGCATTAGTCCTCCAGGTTTGAGACTTCATATTATTTCTTTTGGGAAGGTGGTTCTTTTCCTTTTCCTTTTTATCTCTTGATCAGTAATTTTGTAGAGTGCAGGGAGATATCATTGAGTCATTGACAGGTCATTAATATAGCTCCAGCTCAATTGAATTAAGTGTTTGGCATGTGTTTGATATAAAAGAAAATATAAAAAAAAAAATATAACTAGTTAGCTGACACCTTCCAGTACAGTACCAATATTTTCTGCAGAAAGACTACAGGGATGTAAAATGCAATGAAACATTTAGTTTTTTCACATAGTACAAGTAAAACTTTGTTAGAATACACTATAAAATGTACAGATCTTTATTGGAAGGCTTTCTTGTAATTAGCCACCATGGCTTTTGGTTATTTCATTCTATCAATGAAATACTTGTTTCTTACTGAAAAAAAATCTACAGATCTCTTTTCATTCTTTTGTAAGCAATATCACAATAAACTCTTTATTAATCTGGTGTTGCAGGAGAGTGCAGATAATGTTGGTGCAGATTCCTCTTCACCTTCTAAAGATGCGTTGCTTGAGGAGGCTTGTAAAAAGTATGATGAGGCTACCCGTCTTTGCCCAACACTTCATGATGTATGTAAAAAAAACTTTTTTAAACTTTTACTACTCAGGATGACGTTGCAAGAAATGTGAGCAGAAGATGGGGGAAAATGTTTTGTTAATGACAAGATGGGGAAAAATTGAAAGGAAAAGATGGACAGATTTTGAAGAGTTTCTAATATTGATTCTGCCAGAAAAGAAGTTATTTAGAATATAGAGTTATTTTTTCGTTCTTTTTTCAAGATGGATCAAAATGGATTTTCCTCCGGCCAATTTTTCACACCAGTAATTATTTATTCCTGCTAAGGTTTGATAACCTATTACGAAAAATGAGCCTTCTTCTGAACTTGGAACTAGCAGATAAAGAAAATTATTGTATGCATTTTTACATGACCTGCAAAATTTATTGTTGGTGCTATTTCAATGATATTATGTATTCGGATTGCATCCCAGGCTTTTTATAATTGGGCTATTGCAATCTCTGATCGGGCCAAAATTCGTGGTCGTACAAAGGAGGCTGAAGAACTATGGAAGCAGGTTTATGCTTCCATCCCTTGTGTATCTATCATTTGGGAATGTCTATGACTGGAAAGGGATGGGCTCTTGCCATTCTATTTTTTTCTGGGATATATTAACGTTTCCTTTGGAAGGGAAAAATATTTTATAATTTAGCTATCAACTGTGTTATTGTGATGGCTGTGATCCAATGGACACTCCATGTCCTCTCCAATTTGGTGTTATTTTCCTCTCTTTTTGGTGAAATGTATCTTTTATCAAATTTCCTTTTGTTCCTTAATATTATATCATCAATTTCATTACTCAGTGTAAAAATTAGTAGCCTCCATTTTAGGCTGAAAATTCAAATTTCCACCCCTATATTAGTAATTTTATATTTTCAGAAAAAAAAAGAGAATACCTGCTTTCTCTCAAAAGTATCATTTGGGTTATTAACTGTATAAAAGGTCTAATTATATTTAATCATCAACTCTTGTGGGATGCTGGTAAAACATATAAAATATAAGCAATTCTTTTATATTAGATAAACCAAGTTGACAATAAGATCAATTTATTGAGGGACTGAATGGAACTTTTTAAAATTCATCCACAAAAATAATTGATATATAATTTGGTCAGTAATTACTGTTTCCTTTTTTTCTGAATTAATCATTTCAATTATTCCATTTTTCAGGCTACCAAAAACTATGAAAAAGCTGTTCAACTCAATTGGAACAGTCCCCAGGTACATGTCAAGAAATTGCTAGAAAACCCATATGTGCTTAAATTTGCCCCTCTGGGATGTTCTTCTTCCCATGGAATGTGTTCTCCGTTGTGAAATGGATTCTATTAAACCTCCATAACTTGAAATAAATCATATAAATTTATTTTGCTAATACCTTAAGGCTAACTCTTTTTGGATCTATATCAGGCGCTAAATAATTGGGGACTCGCTCTACAGGTACTTATGTTATGTTATGTTATGTTATGTTTTTCACATTAGTTTACTCTTTGAATTTAAAGGCTCTCGCCGGCCCACCTTCACGTATGCAACCACAGTATGTGTAAGAAACTTGTTTTTATTGAATATCATGATTAATTTTAGGAACTCAGTGCGATTGTGCCAGCACGAGAAAAGCAGACAATTGTAAGAACAGCTATCAGTAAGGTAACACTACTATTTGGCAGCATCTAAGTACAAAATAATTGCTGCCTTTCACATAAAACTTACTGTTACGTTCCATTCTTGATAACTGAGTTGTCTTCCATTACAGTTTCGTGCAGCAATACAGTTGCAATTTGATTTTCATCGAGCAATCTACAATCTTGGTACTGTTCTGGTGAGTCTGTTGCCTGTCCTTCTGTATATGGAAATTACATGATCTTACATCTGTCTAAAGCGCTGTGTTTAGAATGACTTCATCAACATGTGTTGTAGAGATGTATTTAAGAACTTCTCAAATTCTGTCTATATAAGTGATAGTGACAATGGAATAGGGGATTTAACTCTTGATTGTTTCACAGATCAGATTATATAAATTTGCTCAAATCTTAAACTGATTTCTGGCATCTACAGTATGGACTAGCTGAGGACACATTAAGGACCGGAGGAACAGGAAATGCTAAGGATGTTTCCCCTAATGACTTGTACAGCCAATCTGCTATTTATATCGCAGCTGCTCATGCTCTAAAACCAAATTACTCTGTAAGCCTGGTCCTTTTGTCTCGAATAATTTCTGTGTTGTTGAGCCTGATGTAATAACGGAATATTATGTTCCGTTCTGCTTCTTTTACATCTTTACCAACGACTGGACAAGTATTTGTGCACAAGTATGATAGTGTTTTAAGCATTGATGTTCGTGGAAAGAAAACAAAGACATTTCAAATACTATACAAAAGGCTTTAACTCTTAAATTTTCTCCCATTTCTCTTGTTTTCTGATTAGTCAATTTCTTTTCTTCATTTAAAAGTTTGAATTGAAATTTTGATGTTTTTTTTATTAGGTTTACAGCAGTGCCCTGCGGTTGGTTCGTTCAATGGTTAGTTCTGACTTATCAAATAATGTACATCACAATTGATCATGAACTTGAGCCATATTTCAGATCATCAATGAAATCGTTTCTTATGAATAAAAATGTTATTTTACGTTCCTTCGAGCATTTTATCTGTAATTTTAAAGTTTTCACGAAATCTGTGACTTGTAAATCAGAAGTTATCATGTATTCTGAAGTACGTGTGCTTTTCAGATTACTACTTACTGGTTACATCCCTTGATCAGGAAATATTTTTGGTTGCAATGAACTTAATGAAACTTGACTTTTTGACAGCTGCCGTTGCCGTATCTAAAAGTTGGATACCTGACTGCACCTCCTGTGGGGAGACCACTTGCTCCTCACAGTGATTGGAAACGTTCACAATTTTTTCTAAATCATGATGTATTGCAAAAGGTACCCATTTTTTAAAATCAAGTTAACCACTGCATTTATGTGATTTGGTTGAGACTGATTTTAACCTACTTGTATCCTCTACACCCAATCCTCTTTCAGCTTAACATAGGGGGGGAGCAAATACAAACATCACCTAGCTTGAATGGCGAGAGGACAATCAAAGTAGAAATTCCAGATATTGTCTCAGTATCAGCATGCGCAGATCTAACTCTACCACCCGGTGCTGGACTCTGCATTGACACAATCCATGGACCAGTTTTCTTGGTGAGCATATTCTAGCGGCATTACATCGTACATGTATAGAATTTTGCTCCTAAAGAGTGATTAAAAAAACAAAAAAGCTAAGGCCTCGTTTGATAACAATTTCATTTTTTCTTTTTTCTTTTAAAAAAAGCTTAGGAACATTACCTATATTCATTTTTTTTGTTATATAGTTTCTATGAACATTTTCAAAATTTAGGCCAAATTTTGAGGGGAAAAAAAGTAGTTTTTGTTTTTGAAATTCAGTTAAAAGAACTCAAATATATTTTCAAGAGAGGTGAAAAATTTACTAATGAAATTGTACAAAAACAAGCATAATTTAAAAAAAAAATAAAAAAACAAAAACACAAAAACAAAATGGCTATCAAATGGACCCAAAATGGTACAAAATGCTATGCAACTGGTTTCAGTTTTGCCCTTCATATTTTAAAAAATGTCAGTTTTATCTCATTCTTTTAATTATTACAAAAGAAACCATAGGTCTAGCAATTTTTATTATAATTGAGACTTGTAAAAATATTAAGTGCAAATTGAAACCAAATAAATCATACAAAATGCCCCTATCATATGAGCTTGGTCTTTTATATTTTTGTTACACATATCAACAGAAATTGCCAAGAAAAGGTTGGGGTTTCTTAAAAAAAAAAAAAAAAATTGAAGGTAAAACCGAAACTTCTAAAAGGTTCGGGACAAAATTGAAACCAATTCCTATGATCATATCAAAAATGATCTTTAGAACATCCAATTTATCGAACTTTACTGAATTCCAAAGATCATTATGTCTGGTCTGTACGAGTTACTGCTATCAGAGTGGAGTGCATTGATCACATGTAATTTCCCATGGTTTCCTTTTGTTACAACTTACGTAGTACTGCATGTGCATCCTTTGTTCTAGGGAACATGAAACATGCATTATTTTGCCCAATAACTATTAACTTATATGATCACCTTGTTGTTCTGTCAGGTCGCTGACACGTGGGACGCGCTTGATGGATGGCTTGATGCAATTAGATTAGTTTACACAATCTATGCTCGAGGCAAGAACGACGTTTTGGCTGGCATCATAGCAGGCTGATTATTACCAAGTACGCAAATGTATCATCAATATTATCTTTATGTCTATATTATGCTTACTCACAGTAGATTTGAGTATCCATTCCTCTAAATTGAAACACAAAATTTTGGAGTACTTTCCAGTGCTTAATACATGTTCTTTTAGTCAGATTTCCTTTCAACCTTAATAATCTATGTACATGTCGCTTGTTAACAGCACGCACAAGTGGTATCGAATTAATTTCATTGAAAAATTCAATTGAATTATTTTGGTAATTTTATCAGTAGATCTTGAAAACAATAGTTTATAATAAAATACACTCGCACACAATATTGTTTCTGGATTGATGATTTAGACCCTGGGAGACAACGAACAAAGGAATAGTATTTCTTTCTTTCTTTTGTGTGTGTGTGTGTAATTAATCTTTGCAACCTGATTATTATTTTTTTTTTCGTTTTTTGATTAAATATAAAATAATTCTTTGATTGGTATTGGTATTTTATGGTGAGTGAAAAGAGTAATTACTGTAACACCCATGATCAGGAGAGGTCAGTACACCAACCACAATGGTGTAAAATAATAATGTTTACAGTTGAAAAAGGAAGCAGCTGGAAAGACTTCCGTGTATTGCTGGTACAACAAAGCTCATGGCGGCTGCCAAACTTGGAAATGGGTAATGGACTTTTGATACGCGCTGACGTGATATCATTGATTTGTACGAGTTTTCTTGATAATTAAATGTCCTAAGTTTAAGATATAGTTGTTTCTCAATATTAGTTATTGTGTGCATATGAGGTATTTTCTTCGTTTTCCCTCAAAATGCTTTATGAGCAGCTTGATGACATTTGTTTAATAAAGTCATTCAAGTTGGTCGGATTGACCTTGTATAATTACAGGGCTTAGTTAAGTAGAGCTGGAACTGGGGTTGCCCAATGCATCATTTCTTTCCACCTTATTAACCTTATTGGCATTAGTAGGTAGGATTGATAGATTTCTCCTCGGTTGTTGATGATAACTGCTTTGATTATAATTGAGTGGCTAGAAAGAATATATAATGCGCTTCATAGTTCAGGACTAAATAGTACATCGGGTTTACTAGGTTCGACCATAGCAGGGTCCAATCGTTCCCAAAAGAAGAGCTTGATCGAGTATCTTTGACGATGCAGAAAGCTTAGAACCTTACTAGATGATTGGCAAATGTTAATGTTATTAGTTAAGATTATTGTTTCCAAATCTGTTATTTTAGAAGAAAAACAAACACATGCGAGAACATACAGAACGTTAATCATCTTTGTGAAGCAGTTATCGGAAGAAGTTTTCATAGGTCGAAATCCGATCGTGGAATAGGAGAGATGTTGGTCCAAAATTCTGTTTGTTTGATAGTTTTTAAGGCTGAAAATGGAGGTGGGGATGAAGTTGAAGATAGAGAAGAAGTTAAGGGAAGACAAAGGGAGGTGTGGTTGCTGGTTGGTTAGATAGGCCATTGGGTTTGCATAGTATCTGTACAGTCAATCAGTAGTAATAGAAGAAAGAGGGAGAAGATGAAGAAGAGAGAGAAAGAGAGTCATTTATGTGGATGTCTCATGTGTTCTATTCTTCTACAGAAACAAGTATTGTTTGACAATGAGTATGTCCTTTGTTTCTATTCCCATTACACTTCACTTTAACTACACGTTGTTTTGCGGAGATTCGAAGTCCGATATTACTCTTATTCATTATCAGCTATGTGAACGTTATGAAAAATTTATCAGTTATGTCAACGTTATGAAAAATTTATAGTGTGAATAGATGTAGTAGATATTATAAGAATATTCCATTTCGTTGGGAATTTAAAGTTCAAACTTATCAC
mRNA sequence
GTAGTGTTTGAGCTTAGTTGTGTCACTCAAACATGGCGGAGTAATTACACGAGGGAGGGGGTTAGAATTCGGGAGCTTGTAAAATTCATCACTCTCTCTTCAACATTCAGTTTCGTTCTTGACGGATCTGCTTCTGCTCATTTTCCGGAAGCTTGCTTCTACCCGATTCTCTTCCCGCTTGACACTTCACCATGTCGCCTACTCCCGAGGAGCCTAATTTGCAGAACGGAAACCAAACCCAACCGCACATTCCGTCAGAATCACAGCAAACTGAAGAATCCGGATCGGACCCAGAATCCAGAGTTGCCACAATTCCCCAACAGCAACGCGAATCAGAATCGGTTGATGAAGAAGCAGATGCGGAGCCTCGATCGGAGCCGGAGTCTCGGAGGGAACAGTCGTCGGAGTCCATCCAGTTGCAGGTGGTGACGGATGCCACAGATCCCAGGTCCGGTGATCCCGAGGAAGCCTCGATCCCGTCCAACGGCGCCGACAACTCGCATCCCGCCCTGCGGAAGGACGAAGGAAGCCGGACGTTCACCATGAGGGAGCTGCTGAATGGATTGAAAGGTGATGATGGTAACGACAGCGTTAATGAATCGGAAGGCGAGAGGCCCGAGGCTAACTCCGGTCACAGAGTTTTGCTGTTATCACTGTCCGTCTCAAGAGCTGTATTCTTTTTTTTACTGTATGAATTTGCACAGTTTTCTTATCAAAAAAAGAATTTGCACAGTTTTACTCAAGATAGCCCACATCAGCCTTATTCTGAACAGAGCAGAGCTGCTATGGAGTTGATCAATAGTGTTACAGGTGTTGATGAGGAGGGTCGTTCTCGCCAACGCATTCTCACATTTGCTGCTAGGAGGTATGCTAGTGCAATTGAGAGAAATGCTCAAGACTATGATGCTCTATACAATTGGGCATTAGTCCTCCAGGAGAGTGCAGATAATGTTGGTGCAGATTCCTCTTCACCTTCTAAAGATGCGTTGCTTGAGGAGGCTTGTAAAAAGTATGATGAGGCTACCCGTCTTTGCCCAACACTTCATGATGCTTTTTATAATTGGGCTATTGCAATCTCTGATCGGGCCAAAATTCGTGGTCGTACAAAGGAGGCTGAAGAACTATGGAAGCAGGCTACCAAAAACTATGAAAAAGCTGTTCAACTCAATTGGAACAGTCCCCAGGCGCTAAATAATTGGGGACTCGCTCTACAGGAACTCAGTGCGATTGTGCCAGCACGAGAAAAGCAGACAATTGTAAGAACAGCTATCAGTAAGTTTCGTGCAGCAATACAGTTGCAATTTGATTTTCATCGAGCAATCTACAATCTTGGTACTGTTCTGTATGGACTAGCTGAGGACACATTAAGGACCGGAGGAACAGGAAATGCTAAGGATGTTTCCCCTAATGACTTGTACAGCCAATCTGCTATTTATATCGCAGCTGCTCATGCTCTAAAACCAAATTACTCTGTTTACAGCAGTGCCCTGCGGTTGGTTCGTTCAATGCTGCCGTTGCCGTATCTAAAAGTTGGATACCTGACTGCACCTCCTGTGGGGAGACCACTTGCTCCTCACAGTGATTGGAAACGTTCACAATTTTTTCTAAATCATGATGTATTGCAAAAGCTTAACATAGGGGGGGAGCAAATACAAACATCACCTAGCTTGAATGGCGAGAGGACAATCAAAGTAGAAATTCCAGATATTGTCTCAGTATCAGCATGCGCAGATCTAACTCTACCACCCGGTGCTGGACTCTGCATTGACACAATCCATGGACCAGTTTTCTTGGTCGCTGACACGTGGGACGCGCTTGATGGATGGCTTGATGCAATTAGATTAGTTTACACAATCTATGCTCGAGGCAAGAACGACGTTTTGGCTGGCATCATAGCAGGCTGATTATTACCAAGTACGCAAATGAGAGGTCAGTACACCAACCACAATGGTGTAAAATAATAATGTTTACAGTTGAAAAAGGAAGCAGCTGGAAAGACTTCCGTGTATTGCTGGTACAACAAAGCTCATGGCGGCTGCCAAACTTGGAAATGGGTAATGGACTTTTGATACGCGCTGACGTGATATCATTGATTTGTACGAGTTTTCTTGATAATTAAATGTCCTAAGTTTAAGATATAGTTGTTTCTCAATATTAGTTATTGTGTGCATATGAGGTATTTTCTTCGTTTTCCCTCAAAATGCTTTATGAGCAGCTTGATGACATTTGTTTAATAAAGTCATTCAAGTTGGTCGGATTGACCTTGTATAATTACAGGGCTTAGTTAAGTAGAGCTGGAACTGGGGTTGCCCAATGCATCATTTCTTTCCACCTTATTAACCTTATTGGCATTAGTAGGTAGGATTGATAGATTTCTCCTCGGTTGTTGATGATAACTGCTTTGATTATAATTGAGTGGCTAGAAAGAATATATAATGCGCTTCATAGTTCAGGACTAAATAGTACATCGGGTTTACTAGGTTCGACCATAGCAGGGTCCAATCGTTCCCAAAAGAAGAGCTTGATCGAGTATCTTTGACGATGCAGAAAGCTTAGAACCTTACTAGATGATTGGCAAATGTTAATGTTATTAGTTAAGATTATTGTTTCCAAATCTGTTATTTTAGAAGAAAAACAAACACATGCGAGAACATACAGAACGTTAATCATCTTTGTGAAGCAGTTATCGGAAGAAGTTTTCATAGGTCGAAATCCGATCGTGGAATAGGAGAGATGTTGGTCCAAAATTCTGTTTGTTTGATAGTTTTTAAGGCTGAAAATGGAGGTGGGGATGAAGTTGAAGATAGAGAAGAAGTTAAGGGAAGACAAAGGGAGGTGTGGTTGCTGGTTGGTTAGATAGGCCATTGGGTTTGCATAGTATCTGTACAGTCAATCAGTAGTAATAGAAGAAAGAGGGAGAAGATGAAGAAGAGAGAGAAAGAGAGTCATTTATGTGGATGTCTCATGTGTTCTATTCTTCTACAGAAACAAGTATTGTTTGACAATGAGTATGTCCTTTGTTTCTATTCCCATTACACTTCACTTTAACTACACGTTGTTTTGCGGAGATTCGAAGTCCGATATTACTCTTATTCATTATCAGCTATGTGAACGTTATGAAAAATTTATCAGTTATGTCAACGTTATGAAAAATTTATAGTGTGAATAGATGTAGTAGATATTATAAGAATATTCCATTTCGTTGGGAATTTAAAGTTCAAACTTATCAC
Coding sequence (CDS)
ATGTCGCCTACTCCCGAGGAGCCTAATTTGCAGAACGGAAACCAAACCCAACCGCACATTCCGTCAGAATCACAGCAAACTGAAGAATCCGGATCGGACCCAGAATCCAGAGTTGCCACAATTCCCCAACAGCAACGCGAATCAGAATCGGTTGATGAAGAAGCAGATGCGGAGCCTCGATCGGAGCCGGAGTCTCGGAGGGAACAGTCGTCGGAGTCCATCCAGTTGCAGGTGGTGACGGATGCCACAGATCCCAGGTCCGGTGATCCCGAGGAAGCCTCGATCCCGTCCAACGGCGCCGACAACTCGCATCCCGCCCTGCGGAAGGACGAAGGAAGCCGGACGTTCACCATGAGGGAGCTGCTGAATGGATTGAAAGGTGATGATGGTAACGACAGCGTTAATGAATCGGAAGGCGAGAGGCCCGAGGCTAACTCCGGTCACAGAGTTTTGCTGTTATCACTGTCCGTCTCAAGAGCTGTATTCTTTTTTTTACTGTATGAATTTGCACAGTTTTCTTATCAAAAAAAGAATTTGCACAGTTTTACTCAAGATAGCCCACATCAGCCTTATTCTGAACAGAGCAGAGCTGCTATGGAGTTGATCAATAGTGTTACAGGTGTTGATGAGGAGGGTCGTTCTCGCCAACGCATTCTCACATTTGCTGCTAGGAGGTATGCTAGTGCAATTGAGAGAAATGCTCAAGACTATGATGCTCTATACAATTGGGCATTAGTCCTCCAGGAGAGTGCAGATAATGTTGGTGCAGATTCCTCTTCACCTTCTAAAGATGCGTTGCTTGAGGAGGCTTGTAAAAAGTATGATGAGGCTACCCGTCTTTGCCCAACACTTCATGATGCTTTTTATAATTGGGCTATTGCAATCTCTGATCGGGCCAAAATTCGTGGTCGTACAAAGGAGGCTGAAGAACTATGGAAGCAGGCTACCAAAAACTATGAAAAAGCTGTTCAACTCAATTGGAACAGTCCCCAGGCGCTAAATAATTGGGGACTCGCTCTACAGGAACTCAGTGCGATTGTGCCAGCACGAGAAAAGCAGACAATTGTAAGAACAGCTATCAGTAAGTTTCGTGCAGCAATACAGTTGCAATTTGATTTTCATCGAGCAATCTACAATCTTGGTACTGTTCTGTATGGACTAGCTGAGGACACATTAAGGACCGGAGGAACAGGAAATGCTAAGGATGTTTCCCCTAATGACTTGTACAGCCAATCTGCTATTTATATCGCAGCTGCTCATGCTCTAAAACCAAATTACTCTGTTTACAGCAGTGCCCTGCGGTTGGTTCGTTCAATGCTGCCGTTGCCGTATCTAAAAGTTGGATACCTGACTGCACCTCCTGTGGGGAGACCACTTGCTCCTCACAGTGATTGGAAACGTTCACAATTTTTTCTAAATCATGATGTATTGCAAAAGCTTAACATAGGGGGGGAGCAAATACAAACATCACCTAGCTTGAATGGCGAGAGGACAATCAAAGTAGAAATTCCAGATATTGTCTCAGTATCAGCATGCGCAGATCTAACTCTACCACCCGGTGCTGGACTCTGCATTGACACAATCCATGGACCAGTTTTCTTGGTCGCTGACACGTGGGACGCGCTTGATGGATGGCTTGATGCAATTAGATTAGTTTACACAATCTATGCTCGAGGCAAGAACGACGTTTTGGCTGGCATCATAGCAGGCTGA
Protein sequence
MSPTPEEPNLQNGNQTQPHIPSESQQTEESGSDPESRVATIPQQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEASIPSNGADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHRVLLLSLSVSRAVFFFLLYEFAQFSYQKKNLHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKIRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSLNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG
Homology
BLAST of MC10g1393 vs. ExPASy Swiss-Prot
Match:
Q9FHY8 (Protein HLB1 OS=Arabidopsis thaliana OX=3702 GN=HLB1 PE=1 SV=1)
HSP 1 Score: 612.1 bits (1577), Expect = 6.5e-174
Identity = 360/610 (59.02%), Postives = 423/610 (69.34%), Query Frame = 0
Query: 1 MSPTPEEPNLQNG------NQTQPHIPSESQQTEE--SGSDPESRVATIPQ--QQRESES 60
M+ T EEP LQNG Q IP QTE +G PE P+ Q +++
Sbjct: 1 MADTVEEPQLQNGAAPAESETEQNPIPEPQLQTEPKLTGEIPEIEADLTPEEVQSEVTDA 60
Query: 61 VDEEADAEPRSE-----------PESRREQSSESIQLQVVTDA------TDPRSGDPEEA 120
EE +E + E E++ E E +Q VVTD D G EE
Sbjct: 61 KPEEVQSEVKPEEVKTVVTDAKPEEAQSEVKPEEVQ-SVVTDTKPDLTDVDLSPGGSEEI 120
Query: 121 SIPSNGA--DNSHPALRK-DEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHRV 180
I S +++ L+K D+G++TFTMRELL+ LK ++G+ + + S
Sbjct: 121 PIRSTEVEQESTTSVLKKDDDGNKTFTMRELLSELKSEEGDGTPHSSAS----------- 180
Query: 181 LLLSLSVSRAVFFFLLYEFAQFSYQKKNLHSFTQDSPHQPYSEQSRAAMELINSVTGVDE 240
F+++S QP ++ AM+LIN + DE
Sbjct: 181 ------------------------------PFSRESASQP--AENNPAMDLINRIQVNDE 240
Query: 241 EGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEA 300
EGRSRQR+L FAAR+YASAIERN D+DALYNWAL+LQESADNV DS SPSKD LLEEA
Sbjct: 241 EGRSRQRVLAFAARKYASAIERNPDDHDALYNWALILQESADNVSPDSVSPSKDDLLEEA 300
Query: 301 CKKYDEATRLCPTLHDAFYNWAIAISDRAKIRGRTKEAEELWKQATKNYEKAVQLNWNSP 360
CKKYDEATRLCPTL+DA+YNWAIAISDRAKIRGRTKEAEELW+QA NYEKAVQLNWNS
Sbjct: 301 CKKYDEATRLCPTLYDAYYNWAIAISDRAKIRGRTKEAEELWEQAADNYEKAVQLNWNSS 360
Query: 361 QALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAED 420
QALNNWGL LQELS IVPAREK+ +VRTAISKFRAAI+LQFDFHRAIYNLGTVLYGLAED
Sbjct: 361 QALNNWGLVLQELSQIVPAREKEKVVRTAISKFRAAIRLQFDFHRAIYNLGTVLYGLAED 420
Query: 421 TLRTGGTGNAKDVSPNDLYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYL 480
TLRTGG+GN KD+ P +LYSQSAIYIAAAH+LKP+YSVYSSALRLVRSMLPLP+LKVGYL
Sbjct: 421 TLRTGGSGNGKDMPPGELYSQSAIYIAAAHSLKPSYSVYSSALRLVRSMLPLPHLKVGYL 480
Query: 481 TAPPVGRPLAPHSDWKRSQFFLNHD-VLQKL---------NIGGEQIQTSPSLNGERTIK 540
TAPPVG LAPHSDWKR++F LNH+ +LQ L N+ G+ S ++ +T+K
Sbjct: 481 TAPPVGNSLAPHSDWKRTEFELNHERLLQVLKPEPREMGRNLSGKAETMSTNVE-RKTVK 540
Query: 541 VEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGK 571
V I +IVSV+ CADLTLPPGAGLCIDTIHGPVFLVAD+W++LDGWLDAIRLVYTIYARGK
Sbjct: 541 VNITEIVSVTPCADLTLPPGAGLCIDTIHGPVFLVADSWESLDGWLDAIRLVYTIYARGK 565
BLAST of MC10g1393 vs. NCBI nr
Match:
XP_022145328.1 (protein HLB1 [Momordica charantia] >XP_022145329.1 protein HLB1 [Momordica charantia] >XP_022145330.1 protein HLB1 [Momordica charantia])
HSP 1 Score: 1034 bits (2673), Expect = 0.0
Identity = 538/570 (94.39%), Postives = 538/570 (94.39%), Query Frame = 0
Query: 1 MSPTPEEPNLQNGNQTQPHIPSESQQTEESGSDPESRVATIPQQQRESESVDEEADAEPR 60
MSPTPEEPNLQNGNQTQPHIPSESQQTEESGSDPESRVATIPQQQRESESVDEEADAEPR
Sbjct: 1 MSPTPEEPNLQNGNQTQPHIPSESQQTEESGSDPESRVATIPQQQRESESVDEEADAEPR 60
Query: 61 SEPESRREQSSESIQLQVVTDATDPRSGDPEEASIPSNGADNSHPALRKDEGSRTFTMRE 120
SEPESRREQSSESIQLQVVTDATDPRSGDPEEASIPSNGADNSHPALRKDEGSRTFTMRE
Sbjct: 61 SEPESRREQSSESIQLQVVTDATDPRSGDPEEASIPSNGADNSHPALRKDEGSRTFTMRE 120
Query: 121 LLNGLKGDDGNDSVNESEGERPEANSGHRVLLLSLSVSRAVFFFLLYEFAQFSYQKKNLH 180
LLNGLKGDDGNDSVNESEGERPEANSGH
Sbjct: 121 LLNGLKGDDGNDSVNESEGERPEANSGH-------------------------------- 180
Query: 181 SFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDAL 240
SFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDAL
Sbjct: 181 SFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDAL 240
Query: 241 YNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAK 300
YNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAK
Sbjct: 241 YNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAK 300
Query: 301 IRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAI 360
IRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAI
Sbjct: 301 IRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAI 360
Query: 361 SKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAH 420
SKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAH
Sbjct: 361 SKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAH 420
Query: 421 ALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKL 480
ALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKL
Sbjct: 421 ALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKL 480
Query: 481 NIGGEQIQTSPSLNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWD 540
NIGGEQIQTSPSLNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWD
Sbjct: 481 NIGGEQIQTSPSLNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWD 538
Query: 541 ALDGWLDAIRLVYTIYARGKNDVLAGIIAG 570
ALDGWLDAIRLVYTIYARGKNDVLAGIIAG
Sbjct: 541 ALDGWLDAIRLVYTIYARGKNDVLAGIIAG 538
BLAST of MC10g1393 vs. NCBI nr
Match:
XP_022965252.1 (protein HLB1-like isoform X2 [Cucurbita maxima])
HSP 1 Score: 888 bits (2294), Expect = 0.0
Identity = 472/583 (80.96%), Postives = 501/583 (85.93%), Query Frame = 0
Query: 1 MSPTPEEPN-LQNGNQTQPHIPSESQQTEESGSDPESRVATIP----QQQRESESVDEEA 60
MSP PEEPN LQNG + +PHI ES Q ES S+PES IP QQ+RESESV+ A
Sbjct: 1 MSPIPEEPNNLQNGIEIEPHISVESNQIGESKSEPESTADVIPTAELQQERESESVNGVA 60
Query: 61 DAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEASIPSNGADNSHPALRKDEGSRT 120
D+EP+SE +S R+Q SESI+LQVVTD TDPR +P+ SI SNGA+NS PALRKDEGSRT
Sbjct: 61 DSEPQSELDSPRKQLSESIELQVVTDVTDPRFEEPKGTSISSNGAENSQPALRKDEGSRT 120
Query: 121 FTMRELLNGLKGDDGNDSVNESEGERPEANSGHRVLLLSLSVSRAVFFFLLYEFAQFSYQ 180
FTMRELLNGLK +DGNDS+NESEGE+PEANSG+
Sbjct: 121 FTMRELLNGLKVEDGNDSLNESEGEKPEANSGY--------------------------- 180
Query: 181 KKNLHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQ 240
S QDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERN Q
Sbjct: 181 -----SLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQ 240
Query: 241 DYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAI 300
DYDALYNWALVLQESADNV DS+SPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAI
Sbjct: 241 DYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAI 300
Query: 301 SDRAKIRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTI 360
SDRAK+RGRTKEAEELWKQAT+NYEKAVQLNWNSPQALNNWGLALQELSAIVPAREK TI
Sbjct: 301 SDRAKMRGRTKEAEELWKQATRNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKPTI 360
Query: 361 VRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIY 420
V+TAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTG KDVSPN+LYSQSAIY
Sbjct: 361 VKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGTVKDVSPNELYSQSAIY 420
Query: 421 IAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHD 480
IAAAHALKP+YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRP APH DWKRSQFFLNHD
Sbjct: 421 IAAAHALKPSYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPFAPHGDWKRSQFFLNHD 480
Query: 481 VLQKLNIGGEQIQTSPSL--------NGERTIKVEIPDIVSVSACADLTLPPGAGLCIDT 540
VLQKLNIGGEQIQTSP+L NG+RT+KVEIPDIVSVSACADLTLPPGAGLCIDT
Sbjct: 481 VLQKLNIGGEQIQTSPTLLGRSGSTLNGDRTMKVEIPDIVSVSACADLTLPPGAGLCIDT 540
Query: 541 IHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG 570
IHG +FLVAD+WDALDGWLDAIRLVYTIYARGKN+VLAGIIAG
Sbjct: 541 IHGQIFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG 551
BLAST of MC10g1393 vs. NCBI nr
Match:
XP_022965251.1 (protein HLB1-like isoform X1 [Cucurbita maxima])
HSP 1 Score: 877 bits (2265), Expect = 0.0
Identity = 471/611 (77.09%), Postives = 501/611 (82.00%), Query Frame = 0
Query: 1 MSPTPEEPN-LQNGNQTQPHIPSESQQTEESGSDPESRVATIP----------------- 60
MSP PEEPN LQNG + +PHI ES Q ES S+PES +P
Sbjct: 1 MSPIPEEPNNLQNGIEIEPHISVESNQIGESKSEPESTADVVPTAELQQERESESVNGVA 60
Query: 61 ---------------QQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRS 120
QQ+RESESV+ AD+EP+SE +S R+Q SESI+LQVVTD TDPR
Sbjct: 61 DLEPQLEMVIPTAELQQERESESVNGVADSEPQSELDSPRKQLSESIELQVVTDVTDPRF 120
Query: 121 GDPEEASIPSNGADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSG 180
+P+ SI SNGA+NS PALRKDEGSRTFTMRELLNGLK +DGNDS+NESEGE+PEANSG
Sbjct: 121 EEPKGTSISSNGAENSQPALRKDEGSRTFTMRELLNGLKVEDGNDSLNESEGEKPEANSG 180
Query: 181 HRVLLLSLSVSRAVFFFLLYEFAQFSYQKKNLHSFTQDSPHQPYSEQSRAAMELINSVTG 240
+ S QDSPHQPYSEQSRAAMELINSVTG
Sbjct: 181 Y--------------------------------SLNQDSPHQPYSEQSRAAMELINSVTG 240
Query: 241 VDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALL 300
VDEEGRSRQRILTFAARRYASAIERN QDYDALYNWALVLQESADNV DS+SPSKDALL
Sbjct: 241 VDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALL 300
Query: 301 EEACKKYDEATRLCPTLHDAFYNWAIAISDRAKIRGRTKEAEELWKQATKNYEKAVQLNW 360
EEACKKYDEATRLCPTLHDAFYNWAIAISDRAK+RGRTKEAEELWKQAT+NYEKAVQLNW
Sbjct: 301 EEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATRNYEKAVQLNW 360
Query: 361 NSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGL 420
NSPQALNNWGLALQELSAIVPAREK TIV+TAISKFRAAIQLQFDFHRAIYNLGTVLYGL
Sbjct: 361 NSPQALNNWGLALQELSAIVPAREKPTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGL 420
Query: 421 AEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKV 480
AEDTLRTGGTG KDVSPN+LYSQSAIYIAAAHALKP+YSVYSSALRLVRSMLPLPYLKV
Sbjct: 421 AEDTLRTGGTGTVKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRLVRSMLPLPYLKV 480
Query: 481 GYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSL--------NGERTI 540
GYLTAPPVGRP APH DWKRSQFFLNHDVLQKLNIGGEQIQTSP+L NG+RT+
Sbjct: 481 GYLTAPPVGRPFAPHGDWKRSQFFLNHDVLQKLNIGGEQIQTSPTLLGRSGSTLNGDRTM 540
Query: 541 KVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARG 570
KVEIPDIVSVSACADLTLPPGAGLCIDTIHG +FLVAD+WDALDGWLDAIRLVYTIYARG
Sbjct: 541 KVEIPDIVSVSACADLTLPPGAGLCIDTIHGQIFLVADSWDALDGWLDAIRLVYTIYARG 579
BLAST of MC10g1393 vs. NCBI nr
Match:
XP_038876586.1 (protein HLB1 [Benincasa hispida])
HSP 1 Score: 875 bits (2262), Expect = 0.0
Identity = 472/584 (80.82%), Postives = 492/584 (84.25%), Query Frame = 0
Query: 1 MSPTPEEPN-LQNGNQTQPHIPSESQQTEESGSDPESRVATIPQ----QQRESESVDEEA 60
MSPTPEEPN LQNG + QPHI ES QT E S+PE I Q+RESESV+
Sbjct: 1 MSPTPEEPNNLQNGIEIQPHISPESDQTSEPRSEPEPTADAILSSELHQERESESVN--- 60
Query: 61 DAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEASIPSNG-ADNSHPALRKDEGSR 120
+ SEP SRR+Q ESI LQV TD DPR + +E SIPSNG +NS PALRKDEGSR
Sbjct: 61 NGVADSEPVSRRKQLPESIHLQVETDVADPRFEEHKETSIPSNGNTENSKPALRKDEGSR 120
Query: 121 TFTMRELLNGLKGDDGNDSVNESEGERPEANSGHRVLLLSLSVSRAVFFFLLYEFAQFSY 180
TFTMRELLNGLKG+DGNDS+NESEGERPE N G+
Sbjct: 121 TFTMRELLNGLKGEDGNDSLNESEGERPEGNPGY-------------------------- 180
Query: 181 QKKNLHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNA 240
S QDSPHQPYSEQSRAAMELI+SVTGVDEEGRSRQRILTFAARRYASAIERN
Sbjct: 181 ------SLNQDSPHQPYSEQSRAAMELISSVTGVDEEGRSRQRILTFAARRYASAIERNG 240
Query: 241 QDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIA 300
QDYDALYNWALVLQESADNV DS+SPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIA
Sbjct: 241 QDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIA 300
Query: 301 ISDRAKIRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQT 360
ISDRAK+RGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQT
Sbjct: 301 ISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQT 360
Query: 361 IVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAI 420
IV+TAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN KDVSPN+LYSQSAI
Sbjct: 361 IVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDVSPNELYSQSAI 420
Query: 421 YIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNH 480
YIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPH DWKRSQFFLNH
Sbjct: 421 YIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHGDWKRSQFFLNH 480
Query: 481 DVLQKLNIGGEQIQTSPS--------LNGERTIKVEIPDIVSVSACADLTLPPGAGLCID 540
DVLQKLNIGGEQIQTSPS LNG+ TIKVEIPDIVSVSACADLTLPPGAGLCID
Sbjct: 481 DVLQKLNIGGEQIQTSPSILGRSGSTLNGDWTIKVEIPDIVSVSACADLTLPPGAGLCID 540
Query: 541 TIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG 570
TIHGPVFLVAD+WDALDGWLDAIRLVYTIYARGKN+VLAGII G
Sbjct: 541 TIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIITG 549
BLAST of MC10g1393 vs. NCBI nr
Match:
XP_004146133.1 (protein HLB1 isoform X1 [Cucumis sativus] >KGN55671.1 hypothetical protein Csa_010331 [Cucumis sativus])
HSP 1 Score: 875 bits (2262), Expect = 0.0
Identity = 472/585 (80.68%), Postives = 495/585 (84.62%), Query Frame = 0
Query: 1 MSPTPEEPN-LQNGNQTQPHIPSESQQTEESGSDPES-RVATIP----QQQRESESVDEE 60
MSPTPEEPN LQNG + QPHI SES Q E S PE V +IP Q++RESESV
Sbjct: 1 MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESV--- 60
Query: 61 ADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEASIPSNG-ADNSHPALRKDEGS 120
++ P SEPES R+Q SESI L VVT TDP + +E S PSNG +N PALRKDEGS
Sbjct: 61 SNGVPDSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPSNGNTENLQPALRKDEGS 120
Query: 121 RTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHRVLLLSLSVSRAVFFFLLYEFAQFS 180
RTFTMRELLNGLKG+DG+DS+NESEGERPE NSG+
Sbjct: 121 RTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGY------------------------- 180
Query: 181 YQKKNLHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERN 240
S QDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERN
Sbjct: 181 -------SLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERN 240
Query: 241 AQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAI 300
QDYDALYNWALVLQESADNV DS+SPSKDALLEEACKKYDEAT LCPTLHDAFYNWAI
Sbjct: 241 GQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAI 300
Query: 301 AISDRAKIRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQ 360
AISDRAK+RGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQ
Sbjct: 301 AISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQ 360
Query: 361 TIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSA 420
TIV+TAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG+GN KDVSPN+LYSQSA
Sbjct: 361 TIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSA 420
Query: 421 IYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLN 480
IYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLN
Sbjct: 421 IYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLN 480
Query: 481 HDVLQKLNIGGEQIQTSPS--------LNGERTIKVEIPDIVSVSACADLTLPPGAGLCI 540
HDVLQKLNIGGEQIQTSPS LNG+RTIKVEIPDIVSVSACADLTLPPGAGLCI
Sbjct: 481 HDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVEIPDIVSVSACADLTLPPGAGLCI 540
Query: 541 DTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG 570
DTIHGP+FLVAD+WD LDGWLDAIRLVYTIYARGKN+VLAGII G
Sbjct: 541 DTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG 550
BLAST of MC10g1393 vs. ExPASy TrEMBL
Match:
A0A6J1CUX3 (protein HLB1 OS=Momordica charantia OX=3673 GN=LOC111014811 PE=4 SV=1)
HSP 1 Score: 1034 bits (2673), Expect = 0.0
Identity = 538/570 (94.39%), Postives = 538/570 (94.39%), Query Frame = 0
Query: 1 MSPTPEEPNLQNGNQTQPHIPSESQQTEESGSDPESRVATIPQQQRESESVDEEADAEPR 60
MSPTPEEPNLQNGNQTQPHIPSESQQTEESGSDPESRVATIPQQQRESESVDEEADAEPR
Sbjct: 1 MSPTPEEPNLQNGNQTQPHIPSESQQTEESGSDPESRVATIPQQQRESESVDEEADAEPR 60
Query: 61 SEPESRREQSSESIQLQVVTDATDPRSGDPEEASIPSNGADNSHPALRKDEGSRTFTMRE 120
SEPESRREQSSESIQLQVVTDATDPRSGDPEEASIPSNGADNSHPALRKDEGSRTFTMRE
Sbjct: 61 SEPESRREQSSESIQLQVVTDATDPRSGDPEEASIPSNGADNSHPALRKDEGSRTFTMRE 120
Query: 121 LLNGLKGDDGNDSVNESEGERPEANSGHRVLLLSLSVSRAVFFFLLYEFAQFSYQKKNLH 180
LLNGLKGDDGNDSVNESEGERPEANSGH
Sbjct: 121 LLNGLKGDDGNDSVNESEGERPEANSGH-------------------------------- 180
Query: 181 SFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDAL 240
SFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDAL
Sbjct: 181 SFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDAL 240
Query: 241 YNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAK 300
YNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAK
Sbjct: 241 YNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAK 300
Query: 301 IRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAI 360
IRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAI
Sbjct: 301 IRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAI 360
Query: 361 SKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAH 420
SKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAH
Sbjct: 361 SKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAH 420
Query: 421 ALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKL 480
ALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKL
Sbjct: 421 ALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKL 480
Query: 481 NIGGEQIQTSPSLNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWD 540
NIGGEQIQTSPSLNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWD
Sbjct: 481 NIGGEQIQTSPSLNGERTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWD 538
Query: 541 ALDGWLDAIRLVYTIYARGKNDVLAGIIAG 570
ALDGWLDAIRLVYTIYARGKNDVLAGIIAG
Sbjct: 541 ALDGWLDAIRLVYTIYARGKNDVLAGIIAG 538
BLAST of MC10g1393 vs. ExPASy TrEMBL
Match:
A0A6J1HJU5 (protein HLB1-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111465172 PE=4 SV=1)
HSP 1 Score: 888 bits (2294), Expect = 0.0
Identity = 472/583 (80.96%), Postives = 501/583 (85.93%), Query Frame = 0
Query: 1 MSPTPEEPN-LQNGNQTQPHIPSESQQTEESGSDPESRVATIP----QQQRESESVDEEA 60
MSP PEEPN LQNG + +PHI ES Q ES S+PES IP QQ+RESESV+ A
Sbjct: 1 MSPIPEEPNNLQNGIEIEPHISVESNQIGESKSEPESTADVIPTAELQQERESESVNGVA 60
Query: 61 DAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEASIPSNGADNSHPALRKDEGSRT 120
D+EP+SE +S R+Q SESI+LQVVTD TDPR +P+ SI SNGA+NS PALRKDEGSRT
Sbjct: 61 DSEPQSELDSPRKQLSESIELQVVTDVTDPRFEEPKGTSISSNGAENSQPALRKDEGSRT 120
Query: 121 FTMRELLNGLKGDDGNDSVNESEGERPEANSGHRVLLLSLSVSRAVFFFLLYEFAQFSYQ 180
FTMRELLNGLK +DGNDS+NESEGE+PEANSG+
Sbjct: 121 FTMRELLNGLKVEDGNDSLNESEGEKPEANSGY--------------------------- 180
Query: 181 KKNLHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQ 240
S QDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERN Q
Sbjct: 181 -----SLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQ 240
Query: 241 DYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAI 300
DYDALYNWALVLQESADNV DS+SPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAI
Sbjct: 241 DYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAI 300
Query: 301 SDRAKIRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTI 360
SDRAK+RGRTKEAEELWKQAT+NYEKAVQLNWNSPQALNNWGLALQELSAIVPAREK TI
Sbjct: 301 SDRAKMRGRTKEAEELWKQATRNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKPTI 360
Query: 361 VRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSAIY 420
V+TAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTG KDVSPN+LYSQSAIY
Sbjct: 361 VKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGTVKDVSPNELYSQSAIY 420
Query: 421 IAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHD 480
IAAAHALKP+YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRP APH DWKRSQFFLNHD
Sbjct: 421 IAAAHALKPSYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPFAPHGDWKRSQFFLNHD 480
Query: 481 VLQKLNIGGEQIQTSPSL--------NGERTIKVEIPDIVSVSACADLTLPPGAGLCIDT 540
VLQKLNIGGEQIQTSP+L NG+RT+KVEIPDIVSVSACADLTLPPGAGLCIDT
Sbjct: 481 VLQKLNIGGEQIQTSPTLLGRSGSTLNGDRTMKVEIPDIVSVSACADLTLPPGAGLCIDT 540
Query: 541 IHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG 570
IHG +FLVAD+WDALDGWLDAIRLVYTIYARGKN+VLAGIIAG
Sbjct: 541 IHGQIFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG 551
BLAST of MC10g1393 vs. ExPASy TrEMBL
Match:
A0A6J1HL68 (protein HLB1-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111465172 PE=4 SV=1)
HSP 1 Score: 877 bits (2265), Expect = 0.0
Identity = 471/611 (77.09%), Postives = 501/611 (82.00%), Query Frame = 0
Query: 1 MSPTPEEPN-LQNGNQTQPHIPSESQQTEESGSDPESRVATIP----------------- 60
MSP PEEPN LQNG + +PHI ES Q ES S+PES +P
Sbjct: 1 MSPIPEEPNNLQNGIEIEPHISVESNQIGESKSEPESTADVVPTAELQQERESESVNGVA 60
Query: 61 ---------------QQQRESESVDEEADAEPRSEPESRREQSSESIQLQVVTDATDPRS 120
QQ+RESESV+ AD+EP+SE +S R+Q SESI+LQVVTD TDPR
Sbjct: 61 DLEPQLEMVIPTAELQQERESESVNGVADSEPQSELDSPRKQLSESIELQVVTDVTDPRF 120
Query: 121 GDPEEASIPSNGADNSHPALRKDEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSG 180
+P+ SI SNGA+NS PALRKDEGSRTFTMRELLNGLK +DGNDS+NESEGE+PEANSG
Sbjct: 121 EEPKGTSISSNGAENSQPALRKDEGSRTFTMRELLNGLKVEDGNDSLNESEGEKPEANSG 180
Query: 181 HRVLLLSLSVSRAVFFFLLYEFAQFSYQKKNLHSFTQDSPHQPYSEQSRAAMELINSVTG 240
+ S QDSPHQPYSEQSRAAMELINSVTG
Sbjct: 181 Y--------------------------------SLNQDSPHQPYSEQSRAAMELINSVTG 240
Query: 241 VDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALL 300
VDEEGRSRQRILTFAARRYASAIERN QDYDALYNWALVLQESADNV DS+SPSKDALL
Sbjct: 241 VDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALL 300
Query: 301 EEACKKYDEATRLCPTLHDAFYNWAIAISDRAKIRGRTKEAEELWKQATKNYEKAVQLNW 360
EEACKKYDEATRLCPTLHDAFYNWAIAISDRAK+RGRTKEAEELWKQAT+NYEKAVQLNW
Sbjct: 301 EEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATRNYEKAVQLNW 360
Query: 361 NSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGL 420
NSPQALNNWGLALQELSAIVPAREK TIV+TAISKFRAAIQLQFDFHRAIYNLGTVLYGL
Sbjct: 361 NSPQALNNWGLALQELSAIVPAREKPTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGL 420
Query: 421 AEDTLRTGGTGNAKDVSPNDLYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKV 480
AEDTLRTGGTG KDVSPN+LYSQSAIYIAAAHALKP+YSVYSSALRLVRSMLPLPYLKV
Sbjct: 421 AEDTLRTGGTGTVKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRLVRSMLPLPYLKV 480
Query: 481 GYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSL--------NGERTI 540
GYLTAPPVGRP APH DWKRSQFFLNHDVLQKLNIGGEQIQTSP+L NG+RT+
Sbjct: 481 GYLTAPPVGRPFAPHGDWKRSQFFLNHDVLQKLNIGGEQIQTSPTLLGRSGSTLNGDRTM 540
Query: 541 KVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARG 570
KVEIPDIVSVSACADLTLPPGAGLCIDTIHG +FLVAD+WDALDGWLDAIRLVYTIYARG
Sbjct: 541 KVEIPDIVSVSACADLTLPPGAGLCIDTIHGQIFLVADSWDALDGWLDAIRLVYTIYARG 579
BLAST of MC10g1393 vs. ExPASy TrEMBL
Match:
A0A0A0L688 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G002900 PE=4 SV=1)
HSP 1 Score: 875 bits (2262), Expect = 0.0
Identity = 472/585 (80.68%), Postives = 495/585 (84.62%), Query Frame = 0
Query: 1 MSPTPEEPN-LQNGNQTQPHIPSESQQTEESGSDPES-RVATIP----QQQRESESVDEE 60
MSPTPEEPN LQNG + QPHI SES Q E S PE V +IP Q++RESESV
Sbjct: 1 MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESV--- 60
Query: 61 ADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEASIPSNG-ADNSHPALRKDEGS 120
++ P SEPES R+Q SESI L VVT TDP + +E S PSNG +N PALRKDEGS
Sbjct: 61 SNGVPDSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPSNGNTENLQPALRKDEGS 120
Query: 121 RTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHRVLLLSLSVSRAVFFFLLYEFAQFS 180
RTFTMRELLNGLKG+DG+DS+NESEGERPE NSG+
Sbjct: 121 RTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGY------------------------- 180
Query: 181 YQKKNLHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERN 240
S QDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERN
Sbjct: 181 -------SLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERN 240
Query: 241 AQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAI 300
QDYDALYNWALVLQESADNV DS+SPSKDALLEEACKKYDEAT LCPTLHDAFYNWAI
Sbjct: 241 GQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAI 300
Query: 301 AISDRAKIRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQ 360
AISDRAK+RGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQ
Sbjct: 301 AISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQ 360
Query: 361 TIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSA 420
TIV+TAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG+GN KDVSPN+LYSQSA
Sbjct: 361 TIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSA 420
Query: 421 IYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLN 480
IYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLN
Sbjct: 421 IYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLN 480
Query: 481 HDVLQKLNIGGEQIQTSPS--------LNGERTIKVEIPDIVSVSACADLTLPPGAGLCI 540
HDVLQKLNIGGEQIQTSPS LNG+RTIKVEIPDIVSVSACADLTLPPGAGLCI
Sbjct: 481 HDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVEIPDIVSVSACADLTLPPGAGLCI 540
Query: 541 DTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG 570
DTIHGP+FLVAD+WD LDGWLDAIRLVYTIYARGKN+VLAGII G
Sbjct: 541 DTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG 550
BLAST of MC10g1393 vs. ExPASy TrEMBL
Match:
A0A1S3BJC9 (uncharacterized protein LOC103490705 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103490705 PE=4 SV=1)
HSP 1 Score: 873 bits (2255), Expect = 6.76e-316
Identity = 471/585 (80.51%), Postives = 493/585 (84.27%), Query Frame = 0
Query: 1 MSPTPEEPN-LQNGNQTQPHIPSESQQTEESGSDPESRVA-TIP----QQQRESESVDEE 60
MSPTPEEPN LQNG + QPHI SES Q E S+ E A +IP QQ+RESESV
Sbjct: 1 MSPTPEEPNNLQNGIEIQPHISSESDQISEPRSELEEPTADSIPSSELQQERESESV--- 60
Query: 61 ADAEPRSEPESRREQSSESIQLQVVTDATDPRSGDPEEASIPSNG-ADNSHPALRKDEGS 120
++ SEPES R+Q SESI L VVT TDP + +E S P NG +N PALRKDEGS
Sbjct: 61 SNGVADSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPFNGNTENLQPALRKDEGS 120
Query: 121 RTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHRVLLLSLSVSRAVFFFLLYEFAQFS 180
RTFTMRELLNGLKG+DG+D +NESEGERPE NSGH
Sbjct: 121 RTFTMRELLNGLKGEDGSDGLNESEGERPEGNSGH------------------------- 180
Query: 181 YQKKNLHSFTQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERN 240
S QDSPHQPYSEQSRAAMELINS+TGVDEEGRSRQRILTFAARRYASAIERN
Sbjct: 181 -------SLNQDSPHQPYSEQSRAAMELINSITGVDEEGRSRQRILTFAARRYASAIERN 240
Query: 241 AQDYDALYNWALVLQESADNVGADSSSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAI 300
QDYDALYNWALVLQESADNV DS+SPSKDALLEEACKKYDEAT LCPTLHDAFYNWAI
Sbjct: 241 GQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAI 300
Query: 301 AISDRAKIRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQ 360
AISDRAK+RGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQ
Sbjct: 301 AISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQ 360
Query: 361 TIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNAKDVSPNDLYSQSA 420
TIV+TAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN KDVSPN+LYSQSA
Sbjct: 361 TIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNIKDVSPNELYSQSA 420
Query: 421 IYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLN 480
IYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLN
Sbjct: 421 IYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLN 480
Query: 481 HDVLQKLNIGGEQIQTSPS--------LNGERTIKVEIPDIVSVSACADLTLPPGAGLCI 540
HDVLQKLNIGGEQIQTSPS LNG+RTIKVEIPDIVSVSACADLTLPPGAGLCI
Sbjct: 481 HDVLQKLNIGGEQIQTSPSTLGRSGSTLNGDRTIKVEIPDIVSVSACADLTLPPGAGLCI 540
Query: 541 DTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGKNDVLAGIIAG 570
DTIHGP+FLVAD+WDALDGWLDAIRLVYTIYARGKN+VLAGII G
Sbjct: 541 DTIHGPIFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIITG 550
BLAST of MC10g1393 vs. TAIR 10
Match:
AT5G41950.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 612.1 bits (1577), Expect = 4.6e-175
Identity = 360/610 (59.02%), Postives = 423/610 (69.34%), Query Frame = 0
Query: 1 MSPTPEEPNLQNG------NQTQPHIPSESQQTEE--SGSDPESRVATIPQ--QQRESES 60
M+ T EEP LQNG Q IP QTE +G PE P+ Q +++
Sbjct: 1 MADTVEEPQLQNGAAPAESETEQNPIPEPQLQTEPKLTGEIPEIEADLTPEEVQSEVTDA 60
Query: 61 VDEEADAEPRSE-----------PESRREQSSESIQLQVVTDA------TDPRSGDPEEA 120
EE +E + E E++ E E +Q VVTD D G EE
Sbjct: 61 KPEEVQSEVKPEEVKTVVTDAKPEEAQSEVKPEEVQ-SVVTDTKPDLTDVDLSPGGSEEI 120
Query: 121 SIPSNGA--DNSHPALRK-DEGSRTFTMRELLNGLKGDDGNDSVNESEGERPEANSGHRV 180
I S +++ L+K D+G++TFTMRELL+ LK ++G+ + + S
Sbjct: 121 PIRSTEVEQESTTSVLKKDDDGNKTFTMRELLSELKSEEGDGTPHSSAS----------- 180
Query: 181 LLLSLSVSRAVFFFLLYEFAQFSYQKKNLHSFTQDSPHQPYSEQSRAAMELINSVTGVDE 240
F+++S QP ++ AM+LIN + DE
Sbjct: 181 ------------------------------PFSRESASQP--AENNPAMDLINRIQVNDE 240
Query: 241 EGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVGADSSSPSKDALLEEA 300
EGRSRQR+L FAAR+YASAIERN D+DALYNWAL+LQESADNV DS SPSKD LLEEA
Sbjct: 241 EGRSRQRVLAFAARKYASAIERNPDDHDALYNWALILQESADNVSPDSVSPSKDDLLEEA 300
Query: 301 CKKYDEATRLCPTLHDAFYNWAIAISDRAKIRGRTKEAEELWKQATKNYEKAVQLNWNSP 360
CKKYDEATRLCPTL+DA+YNWAIAISDRAKIRGRTKEAEELW+QA NYEKAVQLNWNS
Sbjct: 301 CKKYDEATRLCPTLYDAYYNWAIAISDRAKIRGRTKEAEELWEQAADNYEKAVQLNWNSS 360
Query: 361 QALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAED 420
QALNNWGL LQELS IVPAREK+ +VRTAISKFRAAI+LQFDFHRAIYNLGTVLYGLAED
Sbjct: 361 QALNNWGLVLQELSQIVPAREKEKVVRTAISKFRAAIRLQFDFHRAIYNLGTVLYGLAED 420
Query: 421 TLRTGGTGNAKDVSPNDLYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYL 480
TLRTGG+GN KD+ P +LYSQSAIYIAAAH+LKP+YSVYSSALRLVRSMLPLP+LKVGYL
Sbjct: 421 TLRTGGSGNGKDMPPGELYSQSAIYIAAAHSLKPSYSVYSSALRLVRSMLPLPHLKVGYL 480
Query: 481 TAPPVGRPLAPHSDWKRSQFFLNHD-VLQKL---------NIGGEQIQTSPSLNGERTIK 540
TAPPVG LAPHSDWKR++F LNH+ +LQ L N+ G+ S ++ +T+K
Sbjct: 481 TAPPVGNSLAPHSDWKRTEFELNHERLLQVLKPEPREMGRNLSGKAETMSTNVE-RKTVK 540
Query: 541 VEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADTWDALDGWLDAIRLVYTIYARGK 571
V I +IVSV+ CADLTLPPGAGLCIDTIHGPVFLVAD+W++LDGWLDAIRLVYTIYARGK
Sbjct: 541 VNITEIVSVTPCADLTLPPGAGLCIDTIHGPVFLVADSWESLDGWLDAIRLVYTIYARGK 565
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9FHY8 | 6.5e-174 | 59.02 | Protein HLB1 OS=Arabidopsis thaliana OX=3702 GN=HLB1 PE=1 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1CUX3 | 0.0 | 94.39 | protein HLB1 OS=Momordica charantia OX=3673 GN=LOC111014811 PE=4 SV=1 | [more] |
A0A6J1HJU5 | 0.0 | 80.96 | protein HLB1-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111465172 PE=4 SV... | [more] |
A0A6J1HL68 | 0.0 | 77.09 | protein HLB1-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111465172 PE=4 SV... | [more] |
A0A0A0L688 | 0.0 | 80.68 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G002900 PE=4 SV=1 | [more] |
A0A1S3BJC9 | 6.76e-316 | 80.51 | uncharacterized protein LOC103490705 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
Match Name | E-value | Identity | Description | |
AT5G41950.1 | 4.6e-175 | 59.02 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |