Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTCCTCTCGATTTCCGTCGTGGTGATCGCCGATGGAGTCGGGTTCTTTGTTGCAGCGGTACCGTCGTGACCGGAAAAAGCTGCTGGCATTCTTGTTGTCCTCGCGTTTTGTCAGAGAGCTGAGGACTCCGGCAGGTCCTGTCACTGATTTCTCCGCCGTCGACCTTGACTCTCTGAGCGCTAGCTATGTTTTGGAGTGCATTAAGTCTGGTAATCTTGCTTCTTTGTGTGTGAGAGATGAGTGTGAGTTGTTGTAGTTTGAATGAAAGATTTGATATCGAATTTGCGGTTTAATCTCGAGTTTTTTTTATTGCCAATGCTCGAGAATGAAGCTGGAAGTCTTTTTCATTTTCTTTAGCTTCTTCTCGGTTCTATTAGTTTTTTTTTTCTTCTGTAAAATGGATTTGGTTTCCTTGTTTGGGATCGGAAAGTTTGCGTCAACGGTTCTGCGGCGGAGTGCTAGCTGCTTTCCATGTTTTTATAGCGCGTTCTCTGTTTTTTAGTTGACCGAGTCACGTAACTGTGAACACTTGCAAGGAATTTAATTTCATTGTTATTTTCTTGCTTAGGTGGAGTTATTGACATTTCTACAGCTGCTAAGAAGAAGCAACATGAATCTTCTTACCCGATAATGGTAAGAATATGTTAGTGAGAAGTTATTTGGTTTTGCCTTTTTCTCCGCGATTTACCTTCTGCTATGCCGTCGCTCTATGGAAAGTAAATAAATCATGCTAATGTTTCCGTGCTTCAGTGCTTCCGTCTGTTTATTCATTTATAATTTTTTAAATTTAAATTCTGAATGGACTGTGAAGTTTATGCACTATCTGAAAATGCTGAATTGAAGCTTGAGCTTTCCCAAAGTTTCTGTTGATTTCTTTTGCATGATATTCAGCGGACATTTTTCTCTCGAACTATCTTCATCGCATTTAACAACGCCTTGATCGTCCGTACACCAGAGGGTATGGCCTGAGCATCTTTTACTTAGTCGGGTAGTCAATCAAATGCCTCGCGAAGGTGCAACTTAGACAAATTAACTTTCGACTTGTACCAAAAGTATTTAAATTTCGAACATTTGTTTTAATGAAGATTTAACAGTAATCCATGCCAATTTTCTCTTAGTATTGTAGTTGTAGTAACTTTGTTTTAGAATTTCTGTTCTTCATTAGAGTTTTTGATTTCGGATGTGCAAGCAATAATTTTTTTTTTTTTTTTCATTAGAATTAAAAGAGGTCTGTGTGCATGTTTGCATTACTTACCAATTTGATTTTTGACTGCAGATACAATCTCGACTACGGACTTCTTATTTTCTACTTTCTCATCCAGATTTGTCTGGATCACCTCCTAGCCGTGCGCCACCGCCACCTAAAATGGTGGAGAGATCCTCTAGTGACATATCGTGTTCATCTAGGTCGCTTGGGTCATCAGTTGATGACAATATTGCAACCTATAGTGATGACTGTGGTCCTCAGTCTAATGGAACAACAGCCACACCTTCGAAACTTGTGAAGGATCTCAAAGTTCCGGCTTTAGGACTACCTAAGTTGTATACAGGTATTTGATGCTTTTATTCTTCTATTGAACCGAGATTTCATTTTCATTTAAATTCTTAACAAACAGACATGTTTATCCTTTCATATTTTTCCCATTAAAAATAAAAAGAGTTTTCCGAAACAATGCTTAAATTTGTTAAGATCCCCACCTCCCGACCATCATTTCTTATATTGTAAATTGATTCATAGTTTTATGATGATCAAAGGACTGACAGATGATGACTTGGATGAAGCAGCATATTTAATCTTGCTTGCATCAATCGCATTTTCTGGGTGCGCTCTTTTCTTATGGAAATGCTAGACTTATTTATTTATTTTGTAGTTTGTCCTGCTTAAGTGCTACGTATTGTGCTTCCGGAAACTTGAATTTGAGTTCATGTCTCTTTTTCATTTTTCTTTTCTTGATAGTGATAGGGAACATAAAATATATCCGTCTTCTTTATCATTTGTTTGTTCTCTCTCTCTCTCTCCATCACCAGGGTTGAAATCTATTCATCTGAGGATAAAAAGAAAGAAAATGGTATTACACATACTGTAGGGATGACAAGTATAAGGAATGAAGTAGATGTTCAATCAGAGAATTTTGAAGGACATTTGAATTTTCTTCACGCTATCCGCACCCAGATGCAGGCAAGGCAGACTCTGTCTCTCAGATGATCTTATCGCTATTTCCTTTTTACATTGATGTCATACATCTGAAGCTTAACGATCTATTGTAGCTATTTTGCTGCATATAGTGAGATTTTCTTTTCTCCTAGAGAGGTTAACAGAAGTCTACTAATTCTTATATATTGTTTCAGGTTTCATTTTATATGTTGTTTCATTTTTATCTGCTCAATGAAGTTTAGAATTGCCGGTTGAAATCTCGATGCATTAATACATGTTCCCAATCTATCTTACTGTTCCTCTCCCTCTCTCCCTATCTCTTCCCCCATTAACTTGGTCAATTGCTGATTTGGACTCAGAATTTATATTATTTGACCAGATTTCAGCAGTTTCTGATGCATGCATGAGGAAAAGATTAATGGAATTGGCTGCAAAGAGAAACTGGGGGCAAATTAATGTTCCTCAAGTTTTGTTGGTCCTCTTGCATGGGGTTTTCAGGTCTGATTTTCCTAGTGAGAAAGCTTATATGCAATGGAAGTTCAGACAGGTAATAAATCAATTGTGGACAATTGGTGAAAATTTTTTTTGCATTTTTTACCGTAGTTAAATTGGCTGAAAGCTCTTTAGTTATTTTTTCTGCGTCACTGTTTATCAATTATGTACAATGCACATCCTGTAGAGTGTAACCAATTGTCTTTATATTGGGCATTTTTCGATCAAGCAGCTTTAGTGGCTCATTTATGGAACTGCTTTTTGAGAAAAAAGAATGTATGTATATCTTACATCTATAGGAAACCATATATTCAAAAAACAGAGAAAAAATGAACAACCTTCCAATCTAGGTTTTCACTTATTCCATATATTGTAGTATAGACGGGCTTTTGAAACTCCGAGAAACATGATACTTTCATAGAGATTGACATCTCTGTTGGTCCTGCAACTATAAGAAGTTGGAATGGTGACATTAAAGTGAGGGGCTTAATGGGTACTGTAGTTTTAAATAAAGTACACAATCCTCCCCTCGAACAAACAAAGTACACCATAGAGCATCCCTTGAATCCTCCTTCTCTGGAGCTCTTGAACAACCTCCCCTAAATTGAAGTTTGACTCCTTCGCTAGAGCCCTCAAACAAAGTACACACCTTTTGTTCGACACTTGAGTCACTTTGACTGTACCTTCGAGGCTCACAATTTCTTTGTTTGACATTTGAGGATTCTATTGACATGCCTAAGTTAAGGACATAATACCATGTGAGGAATCATGGACCTCCACCATGGTATGATATTGTCCACTTTGAGTAGAAGTTCTCATGTCTTTACTTTTGGTATCCCTAAAAGACCTCACATGAATGAAGATAGTATTCCTTATAAATCCATATTTTTTTCCTTAATTAGCCAACGTGGGACTACTTCTCCCAATAATCCTCAACAGGTGATGACGGTTATATTTGTGATTTTCCTTAAAATAGATGAAGTATGCTTTAATTCAATGTTCCGTTCAGAAATTTATTGTTGGTTAATTGCTCGGGAGAAAGATCCTGGCTTCTTTAAAAAATGAACAATTACATTATTGATTATGGAACTCGATAACATACACATCCAGGTCTATTTTTTTCTTTCTGTACTATGAGTTACCTTACATGGACATACAAGTCCTCTCCTTACTTAACCATGAAGTTTGGCGGCTGCATCTCTAGGTCAATCTTTTGGAAGAATTCTGTTACTCTGCTAATCTTGTAGCTTCTGAACGTCAAATCTGTGAAAGTTCTCTTGTGAAGATCAGAAGTACAAAGGTTCAGTTTTATTTTATTTTATTTTTGTTTTTACTTTAATTAGATACAGAGAATTAAATGCTAGTTTAGAGTGGTTCATTCTTCAGATTTCTAAATTCCTTCTGAGATCGATCATATAAAAGAATCTGATTTACTGCATTGTTGAACATGGTAGGAATGGGATATCAATATGGTTCGATCTGAGCGTGCAAAAGTATTGTCAGGTATTGGACAAGTACTGTCGAAGTTGCCAGCTCCCTCTGCTTATCACTTAAACATCAGGTTATATGAGAAATTGATTTTAGGCATGTTTGATGTTCTTGACGACAGCCATCCCGTTATGGTATTTGTTCTTCCTTTAATATTTGTTTGATTTAGTTTTGCGTGTGCTGTAAATTAAGGCTCATAAACGTCTGATGATTAGCCTGTTACCTTTGTGTAGGAAGTGGATGACAGTCTCGTCCTTCTCAAATTGACTTGGTCTGCTTTAGGCATCACTCCGGAAGTACACAGTGTGATCTTCGGGTGGGTCCTTTTCCATCAGGTATACTTCTCTATGAAGTGATTAGTAGGGCTCGTTTTCTTTAACAAACTGCATTATTGAATTCTTCATCGTCAAATTGAAAAATTTGGTACTACTTATTTGCTAAGAATTTTTGAACTACTTCTTGAACCATAAACAATTTTTTTTATTAATGAAATGAAAAGAGACTAATGCTCACGAAATACAAACTCCCACGAGGGAGTGAAAGGAAAAACTAAAAATTAGGAAACCTTTAATCAAGCAGATTCAACACGATCTAGTCTAAGAAGAAGCCTGCTTGCAAAGATTCTTTGATGCTGTTCCAAACATAACTTGATAAACTCCCGAGGAGGCTACCGCTATCTTGAGATAACAAAACCTCAAAAAGGGCTGTTGGCGTGCCGAACATTCCAACAGAAGCGACAAAACAAAGCATCTATTTCCTTTCTCTTTAACTGAATATGAAAAAAATGATCTATGAAAGGCTTGAGATAGCCCAAAGGCCAAACCATAGAAGAAATTGAGCAAAACCTGCAGGTACAATATTTACTATGGTAGGAGGATAGGTTAGGCATGCTATGTAATTGATTCTTTAATTCTTGGGCAATTGTTGACTATTTGTACAGGACAACCTGTTGTTTTTGTTGCAAACCTGCAGGTACAATATTTTACATGTTGGTGTGGTTTATTTTTTAATGTTATATTTAAAATTTAATTACTTTTCATAGAAAAAAAATATGCCTTATCCTCTAGTTATAACTTATAAGAAAAAAAGATAGTTAGCTGTATGCTTAGGAAAATAATATTGGTTGTTTTTTTTTTTATCTAATATATTTTTCTATTGAAGTTTGTCAAGACAGGTGAAGCCTCATTCTTGGACGAAGCCATTCTTGAGTTACAGGAGGTTGCATCATCTAAAGATGACGGTGGGAAAGAAGAACAATATTTGAAAAGCCTATCATGTTCAATATCATGCAATGGCAATGAAATGAAGTTGAGTTTGGTGGAGTCCGCCTTCTTCTTAATTAGCAGTTGGTGTGACATTAAACTACAAGCCTACCATTTGCACTTCCGTGAGGTGAATTTCAATTCTTGGTTGACATTTTTCCTATGGTATCCTGTCAGCCCCATAAATGTGATGACCATTGCTCAAATGTTTTATATATTACACAGCTTTTTTGATGTAAGGTCCCAATATTATTCTACCTCCGTTAATTACTATTTCTAGCGTACCATTGCCCTTATATTTGTTATCACGTTCTTGAATGTTCCAAATTTCCTATCATTTGTCTACACTAATTTAGCTATTTTTCTTAAAATCAGAAAACTTCTTACTTTGGGAAAGTGGTGAGCCTGTTGTCCACTGTTGGAGTTATTACTGATTGCAATACTGTAAAGGTAGGTACTAATGCCCAGCGTACTAAAATGGATATTTGATTTATAGATCTCTTGGGCTATGTTTTCATAAATCTTCCGTTTTGATAGTTCATGTCTAAATAGAAATTTGTTCCCAGAAAAAATCATTTCTCCCAATTTTTTTTTACAATAACTTTCATATTACATTCAGCGATACAATCTCCGTGGGAGTAAAATGAAAAATGAAAATGATGTTAAATCACCACAAGCTAGCTTACAGGTTATGGTAAATTTATTTATTTTATTTTTTTTAATCAATGTTGTAACACTCTTCTCCACTTTTGGGCTTGAAAAGTAGTAGAAGGCCCAATAAGTGAAAACCAATAATACTAGGTGCATAAATTAGTTACAAAGGTTTGAACATAAGACCTCTCTGAACTCAAAAACTTAAGCTTTTGAGTATAACATGCCTGAATTGTTCAGTCATGTGGAGTTACGCGAGTTTATCATGTTATGACGAGGATATTAAGGGATTGGAGGTTATCATTTCCTGATGCTATATGGAATAACTAGCTTTAGTTATAATGTTGCTTACTGAATTAGCAAAATTCCAGGAGCTTGCTTGCATAATTATGGTGGTCACTTTAAATAGCTTGTTTTCTATCCTTTAGCGTGATGCATTCTTTGCTGTTGTTTGTGGGATTTAGCTAGAGATGAATAGTAGAGTTTTTGAGGGGTTTAGGAGATCTCCTGATCTCCCGATAGATTTATTGCACATGTCTGTTACTAAGAGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTAAATTATTTATTCATTTATTTAATTATCCTTTAGACCTCATTCTTTTGGATTGGACTCTTTTTTGTGTAGTTTTTCTTGGGTCTGAGTGCTTCTTTTTGTGTTATTTGTATTCTTTCAATTTATCTCAATGACAGTAGGTTTTTTTATTTAAAAAAAAAATGTAGGGTGCTTCATGAGAGATTATTTCTATGGGAAGGAACCAAAAAGTGTTAGATATTTTCATTTTATTTTTATTTCATTTTATATCTGTTGAAATTAAATTAATATTTTTTATAGAGCAGCAGAGTCTAGTTTTAAGATGATGATAATTTCAAGTCTACATGTTCAAATTGCGGCTAATCGGGCATCTGATTTTTACTTGCTTATATTAAAAGGAAACTACTTTTCCATAAATACTGGAAAGTTGCAACCGAAATTGTAGGAAACAACTCCTTTTAGGAGATTACAAGGAGCCCCTGAAGCAAACTTATTCATTCATATTCTAATCTTGCTGTTTTGAACCTTATGTGAATTGTCAATATTTTGCTTTAGTTAACAAAATTGGATGGTTTGAAAGCAATTGGGGCTAGAAAACTCAGAACCTATGTCGAGAAGTCCATAGAAGCTGCTTACAAAGAGGTTAGTCGAGATTTGTGACACAGTATCATAAGAATATGGTTGAGAAAGCATCGTTTCTTTACAATCCATCATAGAGTATTCAATTATTTTAGTCTACTTATTTGGTTAGGCAGAGAATAATGAAAATTCTGAATCCAAAGAGAGTATACATCCTCTGGCACTCCTTGCCAATAGGCTACGATTGGTTGTGGAGAAAGAAATTACTGTATTCTTCCCGGTTTTACGTCAATTATGTCCTGATTCTGGGATAATTGCAGCCATGCTATTGCACCAATATTATGGAGAAAAACTGGTTTGTTTCGTTAGTTCCTTTTGTGAAATTTTTGTTTTGCCAGTGCTGGTTTTCCTCTATTTCATTTTTTTTATATTCTGGATGTTGTTCTCATCAAAACCCAGAAGCCATTCTTAAAGGAAGTATCGAAGCTTTCAGATGATGTAAGGTCGGTGCTTCCAGCTGCCTATTCCTTAGATCGTGACCTAACTCATTTATTTACTGCTGCTTCCAAAGAAAGTAGATTAAGCCCTCTTCTCAAGGAAGACCTTGAGCACTATCCGGTCAGCCAATCTTTTCCCTTATACAATATCACATATGGATGTATAATGCTTGTATTTGAATTTTATATTCTGTATGCTTGTGGTTGTTTTGAACTCCAAGAGACACTAAAAGTTGAGTAGTATAAACAACTAAAATTTGAGAGAATCAAAATAATCTATTCCATTCACCAAAAAGATAAATATATACAAATGCACAAGGTTAACCTAAGCAAGAGAAGGTAAAGAGACAATAAAGAACCATAGAACTAGTATATTTACAAAATTATAATTATTCCAATATAAAGACTTTATTTAGCACTCCTTCTCAAGTTGGAACTTATATATTCATTATGCCCAACTTGTTAGAGATAATCTATATTCATGCTCCATTGGTTTGGTTTTATCTTTTCACGTACAAAATGACAATTGACCTCAATATGTTTAGTTTGCTCATGAAACATCTTGAGCAAACTAAACATACTAACACACCACAATTTGGTCGACGTTGTGATATCAAATCTCAATCCAATAAGAAGTTGACGTATCCAAACTAAATTCACATACAGAATATGTCATTGCTCTATACTCTGATTCAACACTTGAACGTGATGCTACTGTTGTTTCTTACTCTTCCAAGAAACTAAATTATCTACAAATACAGAATATCTTGAAGTTGGTCTTTTTATCTTCCTTAGACCTTGCCTAATCATCATACGAGAAGCATTCAATGTTAGAATGGCCATAATTTTATATAAAAAATCAGGCCCAGGAACTGCTTGCAAATAACATAAAATCTACTCCAATGCAGGCCAATGATCAACTGTAGAGAATATATATACTGGCTTATGGTACTCACCGAATAAGCTATGTCTGGTTGAATCACTGCAAGATAATTGATTTTTTCCATTAGCCTCTTTCCATATTTTTTTAATAATTCTTCCTATTTTGTGAGCTGGAAATTGAGCACCATTTGAGTACTACTTAGCTTAGCCCTTAGCTTTCCTACTTTAGCAAACAAGTCAAGTACATATTTTCTCTGTGATAACAAAATTCCCTTCTTGCTTCTTATTACCTATAAAGTATTTCAACATTTCCAATATGGTTTAGCCCTAACTTCCTATTTCAGCCAATAACTCAAGTAGAAGTACATATTTTCTCTGTGATAACATATTTCCCTTCTTGCTTCTCATTACCTCAATTCCATTACCTCAATTCTTAGAAAGTATTTCAACATTCCCAAATCTTTTGTATGGAATTGACTATGCAAAAAGGTGTTGAGCGGTTGAATGGCTAAAGTGTCATCACCACTAATAACAATATCATTAAGATACACGATCAACATGATGATCTCACTCTCTGATTATTTATAAAAGACAAAATGATCGGACTTTCTTTTCCTCATTCCAAAGTTTTCAATCATCTGCTTGAATTTTTTCAAACTATGCTTGTGGACTTTTCTTTATCCAAATGAGAATGGAAGGATGCTATGAGAGAAGAAATGCATGCTTTAGAAAAGTGTAACACATGGAAGTTGTTGATTGGCCAAAAAGAAAAAATATAGTTGGGTGTACGTGGCTTTTTACAGTAAAGTATAAGGCTGATGGAACGTTGGAAAGGTATAAGGCAAGATTAGTAGCTAAGGGGATACACTTGACCTTACTCTATAGACTACCATGAGACATTTGCCCTAGTTACAAAGATGAGCATTCCTCTAGGATTTGAAAGATCAAGAGGTAAAGTATACAAACTGAGAAAGAATTTGTATGGATTGAAACAATCACCAAGGATGTGGTTTGGAAGATTGTGGTATTGTTAAAGTCAAGGAGATCGCACTTTCTTTATCAAGGGGAGTTACTGTGCTTCTAGTGTATGTAGATGAAATTATAGTTCCTGAAGATAATGAAGAAGAAAAAATTGAACTTATAAGGAAATTGATGAAAGAATTTGAGATCAAAGAACTTGGAAAGTACTCTCTTGGTATTGAAGTAGCATACTCAGTTGAGGTTTATTTTTATTTTTTTTTATTATTATTATTATCAGAACAGGATTTATTGATGGAAAGATGGAAAATGGATGTGAACATCTACCAACACCAAAGGATCCACCTCAGAAACTGGCAGAGTAAAAAAAGGAACCTGCAGTGAACCGTGAAATTTGCCAGAGGCTAGTGGGGAGGCTTATATTTCTAGCTCACACTCGACTGGACATAGCATACTCGGTGAGTATAATTAGCCAATTTATGCCTGATCCACAAGAGCCTCACCTTCAAGCAGCGTACAGAGTGTTAAGTTACCTGAAAGGAACTTCAGGAAAAGAAATTTTGTTCCAACGACACAATTCCCATGCATTGGATCAATAGTGAAGATAAGATCTACTACGGGAATTGTACTTTTCTTGGAGGTAATTTGGTATTATGAAGGAGTAAAAAACAAAATGTGGTGGCTAGATCATCTGTCGCATCAGAAGTCCAAGCTATTGCACAAGAGCTATGTGAACTCTTATGGTTGAAAACAATCCTTGATGGCTTACACATTAAAGGAAAAGGTCCGATAAAGATGTACTGCGATAACAAATCAGCTATCGATATTGCACATAATCCTATCCAGCATGATAAGATAAAGTATATTGAAATTGATTGACATTTTATCAAAGTGAAACTAGAGAAGGGGATGATATGAATTAGTTATGTGCCCTCACACCTTCTACTAGTAGATATACTATCAAAATATAATTTCACAACTCAATGTTTCATGAGTTAGTATCTATTCCTCAGCTTGAGAGGGAGTGTTGAAGGAATAACCTATATTCAAGGAGTACTTGTGTAATTCCTAGCCTGAACCTAAGAGATATTTGTGTAATCCCATTAAATAAGGGAAGTAGTTGAACTCCCTAATTTATCTATATTGTTACCTAGAGATTTATTTAGATTTATTGTCACATACTATCTAGGGATTTATTTATCTTTTCTACAATAGGGGCATGCACTTTCATAAAGATAATTTAGACTTCCAACACTATCAAAAATAAAAAACTTAACTCAACCAAACCCAGCAAACCAAGGAGCCAAAGAATAACAACAATACTTAGTTCACGAGGATTCTTAAGTAGTTGTCTTTAGTATCTTTAACTTCTGAAATCATATAATTTAACCCGAGGCTTTGAAAATGTTCCAAAAGAATTCTCTCTCTTCTCCTTTGCCTCCCTTCCCCTCTGTTGCACTTGTCTAACCATGTCGTCCATTCTATGCCTCTCTCCACTCGTATGAGTGCGCCCAAAAGCGTACATTCTCTCCCCCAGAGATTTGAATGCAAGGGAAGGTGGTGGGAATCTTTTTCATTCCAATGTGAAAATTGAACAATGTGAGGACAGCAGGGTGGCAAATATAAGGGACAATTGGCTGGAGCCTTTTTTGCTAGCTGACTGTTCCTCTCTCTGAATATTTTACTACAGCTCATTATTTTTTTTCTTTTCGTAATCTTAATTTCTTTTAATTGTACAATATTTAATTTCCTTGTAAAATATTTTATATTGCCTATTTCCTCTCGTAAGTTCTTAATAGGTTTGTATATATTAGGTTGTAATCTTCTCATTTTGAGATTTGAAATTGTTTGGGTTATTCATTTCTTCATTAACGGCAGTGTTAAATCCATATTTAGTGTTGTATGTACTTGTAATATCAAGACATGTAAGCTGTTTGTTTCTTTCCTCCGTCTTTTGGGCAGATTGTACAAATTGCTAAGCCCATAATTCTTGACTGGATGATCGATCAACTGGAACAAACCTCTGAATGGACTGGACGTGCTTTCAAGTTAGAGGTTAGTTATTTGCAGACTGATTTTTCTGGTTCAATATGTAATATGATATTTGTCTGAGTGGAAGCAGTAGCTAGTTTATTTTAAGGAAAGATGTATCAGATAGGAAGTCAAGGATCAAGTGAAGGGTGAAAATATGATCTTGATAAAATTGCTGTTATTTATCTTTCTTGTACAGTAATCATTAAAGGAATGAAACTGTAGAAGCCATGGTAAAAAAGTATGTAGGTTGACATAGCCTGAACAACTGAAACTTAGAGAGATTGAAAAAAACACATTAGATATATCTTTTCTTTCCATTGCATGTGTAGATTGATCAACACCTCTTACTAATTTTGGGTTTTTTTTGGCAAGTGAAGGATTGGGAACCAATATCGTTTCAGCAAAATCTGGCAGCATCAGTTATTGAAGTTTTTAGAATTATTGAGGAGGTATTTATCATTGTAAAGACAATTACTTCAGTTATGTCTGAAAGACGAGAAATTTTGGTTTGCTCCTATGAATTCTAATGATACTTTCCTACCAAAGTTTCTGAGGTTGGGGAGTGAAAGAATATTTATAGAGTCGCTGTTAAATATACTGATTGATAACTTTTGAGCACTTGGGGTTATGTTCTACCGATGTCATAGGTATTAGTGGGTCATTCTTGTTGGTTTCTCCTGCATTTTTTTTTTCCCCAAATAATCAATACCATATGCAACCAGTGGAAGTAGTAGCAGTGTGTAAAATTGTCTTGTAGCTACATTTTTTCTGTTTCCTTTTAAATTTACTATCGAGGGCTAGTTTCATAAATTTTTGTTTACTATTATGGACTTGTAATGAAAAACTTGCAGACTGTTGACCAGTATTTTGACCTGAACCTTCCGATGGACATCACACATTTGCAAGCTCTCCTGTCCATTGTCTATCACAGCTTGGATGGCTATCTATCGGGTCTTCTCAACCAGTTAGGTAATTTATGAATTTTGGAATTTACGAAGAATGAAGATTGATGGCAAGTTGCAAAAAATTCAGACAAATGTACACACACACACATAAAAGGAAAAGCAAAACAACCTAAGGACAAAGGGATTAGGCATCCCTTTTCCACAAGGAATGATGGTGTGTAAGTGGAGTAGTTACAAATCACAAAAGTAATCAAAGGAAATATTCTTATCTGAAAAAACTCTTGAATTCCCTTCCATCCATGTCAACCGTAGAATAGCCTCGAATGGTACATGTTTGGATTTATTAATTTACTCTTGAAATGATCATTCTATTGGAGATTTGATCATATCATTTTTCCATTGCAGTTGAGAAGAATTATTTGTATCCACCTGTTCCTCCATTGACTCGTTTTATGGAGACAGTTGCAACAGGAAAGAAAAAGTTGCCCGAATGTCATTTAGACGAGCATGTGAGCAGCAAATTGAATGGATTGACGATATCTAAGCTTTGTATAAAATTGAATACCCTTGGTGTAAGTGATGCTTCAAGTAAAAGCTTTTTTTTTTTTACTTTTTTTTTTTTCACTAAATATGCATATGCATGTTTGAAGTCATGTGAGATAATTCTTCTGTAGTAAATGCAATACTTATAATATTTTGAGCGTGGTCTTTGTCATGTATGATCAGTTAATGACAATACAAAATCAGGGGGAAGGGCAGTTCCCCCTAAATCTTTTACGTGATCATAGTTAACATGCCATTCTTAGTGACGCCGTCAATTGGTAGTTTTATTTTGTGTTACCTCTCTTTTCTCTTCATGCAGTACATTCAAAAACAAATTGAAACACTGGAGGATAGAATTGGAAAATCTTGGGCGCTTATTGGAGGATCTGCTAAGCATAAACGCGGTACACATATATTTATTTCACTTGATATTACCATTCAATATACATAACATTAATGATAACTAAAAGACAGAAACTAGTGGTCTGTCTTCACTTTGCATAATGACAATAAAAACCATATGCGGAAGAGGATTGAATTAGTTGACATCAGTATTTTTATTGCTAAAGTATTAACAAGTCCACACGGCATATGACAAAATGTTGCATTCTCATTCCTTTCTCGTGTTTGTTTATCTGGCTTGTGGAATTATAATACTTTGTGACCACTTCTTAAAGCTCGGTTCAGGTTTAAGTTTTCTTATTTCAAATTTTCTTTAATATTCTTAATCACTAAAATAAAAAACAGATTGATGTGCTTATTCGTTCTGTATATTAAAAAATTTGTGGAGGAAGTGTGTGGAAGTCCAAGTTCCTCAAATTACGTGGTTATCATGTAGCTCTTTTATCTATCACATCGCTATATATTTTGGTTCTATCAGAGTCTCTATAGAGAGGGATGAATATGTTGTTTATTCCTTCTTGGTGGTTATTAAAACTTTAGACAAAAGCACACTTTTCGTCCTTAATATTAGAGTTTAGTTGACCATGAAAATATCATGTGCTATGTTGAGGTGGCAAATTTTAAATGAGGTGCCTATTTTCTCTAATTTATATTATTTTTTTATTACCTTACTTTTTCATGTTTTTCCCTCTCTCCCCTCCTCAAATCTAGTCTGGTCTAATTTGCTAAGAGAGGGAAAGAGGAAAAGAGGATGCAGTATGCAAATGGAGGAAACCAGTCACCGAACTCGGACAACTTCTAATTATTTTGAACCCTAAATTTCTTAGAAAATTAGACCAGACTAGATTTGAGGAAGGATTAAAGACTTCTCGCAACATCTAGACACGTTTCTCTAATTGAAGTAATTTGATTTGCTATGCCTATGATAAATTTTGAACGGTTGTTTTCAAAGTTTTGGCCATGGTAATTTTCCAACAGATAGCCCGATCTAATTAATTTCATTCAACCCCAAAATGATTGCTTTCATCACTGTATATTTGATGTTAATACCTATAGTATCATCCATTCAGCCCCGGAAGAGGCGACTGTCAATGGTGGTCTTCGTACCTGTAGTGACGAAGTTAATGAGTTGTTTGCCAACACTTTCAACAACATTAAGAGTTTCATTGCAAAGGCTATCAGCAAGTTTTGTGACATTACTGGTGAGTTTCTCAACCAATTTTGTGGTGGCCTTTAAATTTATGTAGTTATTAAGTTAAGCTAGATTGATTTTAGGTTATGCTTTAAAATATATTTGTTGAGCCTAAGTACGGTGCATTTTCTGGTCTCCGTATTCTCAATTAAGTATGGCCTCAAATAAGAGGGAAACTGGGGAGAACATAACCATTTTATTGTCGACCCTAAGTATGGTCTCCATATTCTCAATTATGTCCTCAAATAAGAGGGGAACTGGGAAGGACAACCAATTTTTTAGTCAAGTTCTTTTACACTTCTATTTTTGGTTTTTGTTAAACTGTTTTGCAAGATGACTGTAGAATGCTTACATAAAAAATCATATTTGTAACAACTTTTCCATCTCTGAATAATAGTCTTAATTTTGTTCAAATTCTACTCTCCAGGCATAAGAGTTATTTTTTGGGATCTCAGAGATGAGTTCCTCTCGTATTTGTATCATGGCAATGTTGATGCTGTTCGATTAGAAGACGTTCTTGCGCATCTTGATACGGTAAGATCTTTGACTCGCTAAATGTGATATTGTGCAAATTTTGTCTGTAATGTCTTTTATGTGCGTCATGCTTGCATATGTATAGTATATTTCTTTATACATTTTATCTATGGGTTGCACCCATCTAAGTCATCTATTATATTATAGGTCCTAAACAATGTATGTGGCAGGATCGATGATACTCTTAGGGATCTCGTGGTATTGAGCATCTGTCGAGCATCAATGGTACGATCATTTAAGAATGCTACTTCTATCTCCTACCTTTTGCTCCTTCCATCTGAAATGCATGGTTTTCCAGGAAGCCTTCATATGGGTGATGCTTAATGGAGGTCCTTCACGTGCATTTTCTGATTCAGATATTGTTCTCATTAAGGAAGATCTCGGTATATTGAAGGTATGTGCTCTAGATTGAAATTACTTTTACAGTTGTAGTTTATGATAGGTAGGATTCCTGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACTAATGCCTTAGCCTTGCTTGTAAGTAGAGGAAATGGTTGATTGAACTTGTGGTTTGCGATAAACGATTATATTTATATTGTTCTTGCATCTTCAATTCTAGGACTTCTTTGTAGCTGATGGAGAAGGTCTTCCCCGTACTCTTGTTGAGAAGGAAGCAAAATTTGCTGAAGAAATATTGGGCTTATATTCACTTCCGGTTTGTTTATCTTGTTGATCAAGTTTATATTATAAGTCCTTATTATAGAATTTTTCATATGGACCGCGTTTTCTAATTGGTGCACTTTTCTGCAGACTGAAACTATTATTCAACTGCTGATGAGCTCAGGTGGGGATATTTCTACGGAATTGGATCCGTGTAGTAATAATGGCAGTCTTCATTTTAACGATTCCCAGGCATTGGTACGAGTCTTATGCCACAAAAAAGATACGGAAGCTTCCACGTTTCTAAAGCGAAAATATAATCTCCCTGCTTCTTCAGGTAATTCTCTATCCATTATTGATTTCATAGCCTTTGGCACGCTGTTTCCTCCTCCACAATTGTTGAGAAAGAATGAAAGATAAAAAGGATATTGGAGAACTGATAACCAAAAATCTTGTTGCCCTTCATCTAGAACCAGTTACAACTATTTTCGAGTTGAATAGTTGAACAAGCTATCTTCCACCACCAAGGTTTCAAAACGTTATGCATTTTTTTTTTTATTAAAATATGATCCGATTTAGTCAATTGAGTTTTAGATTATGACCTCTTAAAATTTGAGTTCTGACCTCTTAAAGTTAACTATGCTTAAGCTATTTTTGAAATTGAATGCCTCCCAAAGTTGAAATAATCTATTGTATTTAAACTTTGTGAGGTCGTAACTATTAATTAACTAAATTGAGTGATATTTTCTTTATCCTGTGTAATCAATTTCTACATGTTGGAAGGTAGCTTGTTTGACCCTTTCACCCAAAATTGGTTGTAATTGATTGTAGATGAATGACTGTAGAGGGACATATTGTGATGAGGGAGGATAGAGAATAAAATGAACTTACAGGACAGGAAAAGTAAAATTAGGCAAGCCTTGATAAATTTTTTGGATTTAGGTCATTTCCAGCAATTATCCAGTTCAATAAATTAATTAGAAAATATGCCCTAAAAGTTTCTTGAAATTCTTGCAGATTATGATATGACCCCCTTGCAGGCCTCAACTCAGAGATGACATGACTTGTGACAAATTCAAAGTGTGATTCATAATTGATAGAAAATAGAAGGATTTAGCAATTTCCAACGATGACTTTGCACGGTGGCTAGAAAGAAAGAGTAGGAAGGGAAAATTGAGTTTGGTCCCATGTGGCTAGATTTTACTTGCCGAAGGCGAGTCTTATGGGTGATCCAATCAAGAGTGGAGTTTTGCCTATAAATGTACATAATATATTTCATTTTATAATAAAACGAAAATTGCATTTGTGTTTGATTATACTTAAAATGATCATTTTGAA
mRNA sequence
TTCCTCTCGATTTCCGTCGTGGTGATCGCCGATGGAGTCGGGTTCTTTGTTGCAGCGGTACCGTCGTGACCGGAAAAAGCTGCTGGCATTCTTGTTGTCCTCGCGTTTTGTCAGAGAGCTGAGGACTCCGGCAGGTCCTGTCACTGATTTCTCCGCCGTCGACCTTGACTCTCTGAGCGCTAGCTATGTTTTGGAGTGCATTAAGTCTGGTGGAGTTATTGACATTTCTACAGCTGCTAAGAAGAAGCAACATGAATCTTCTTACCCGATAATGTCAATCAAATGCCTCGCGAAGATACAATCTCGACTACGGACTTCTTATTTTCTACTTTCTCATCCAGATTTGTCTGGATCACCTCCTAGCCGTGCGCCACCGCCACCTAAAATGGTGGAGAGATCCTCTAGTGACATATCGTGTTCATCTAGGTCGCTTGGGTCATCAGTTGATGACAATATTGCAACCTATAGTGATGACTGTGGTCCTCAGTCTAATGGAACAACAGCCACACCTTCGAAACTTGTGAAGGATCTCAAAGTTCCGGCTTTAGGACTACCTAAGTTGTATACAGGACTGACAGATGATGACTTGGATGAAGCAGCATATTTAATCTTGCTTGCATCAATCGCATTTTCTGGGGTTGAAATCTATTCATCTGAGGATAAAAAGAAAGAAAATGGTATTACACATACTGTAGGGATGACAAGTATAAGGAATGAAGTAGATGTTCAATCAGAGAATTTTGAAGGACATTTGAATTTTCTTCACGCTATCCGCACCCAGATGCAGATTTCAGCAGTTTCTGATGCATGCATGAGGAAAAGATTAATGGAATTGGCTGCAAAGAGAAACTGGGGGCAAATTAATGTTCCTCAAGTTTTGTTGGTCCTCTTGCATGGGGTTTTCAGGTCTGATTTTCCTAGTGAGAAAGCTTATATGCAATGGAAGTTCAGACAGGTCAATCTTTTGGAAGAATTCTGTTACTCTGCTAATCTTGTAGCTTCTGAACGTCAAATCTGTGAAAGTTCTCTTGTGAAGATCAGAAGTACAAAGGAATGGGATATCAATATGGTTCGATCTGAGCGTGCAAAAGTATTGTCAGGTATTGGACAAGTACTGTCGAAGTTGCCAGCTCCCTCTGCTTATCACTTAAACATCAGGTTATATGAGAAATTGATTTTAGGCATGTTTGATGTTCTTGACGACAGCCATCCCGTTATGGAAGTGGATGACAGTCTCGTCCTTCTCAAATTGACTTGGTCTGCTTTAGGCATCACTCCGGAAGTACACAGTGTGATCTTCGGGTGGGTCCTTTTCCATCAGTTTGTCAAGACAGGTGAAGCCTCATTCTTGGACGAAGCCATTCTTGAGTTACAGGAGGTTGCATCATCTAAAGATGACGGTGGGAAAGAAGAACAATATTTGAAAAGCCTATCATGTTCAATATCATGCAATGGCAATGAAATGAAGTTGAGTTTGGTGGAGTCCGCCTTCTTCTTAATTAGCAGTTGGTGTGACATTAAACTACAAGCCTACCATTTGCACTTCCGTGAGAAAACTTCTTACTTTGGGAAAGTGGTGAGCCTGTTGTCCACTGTTGGAGTTATTACTGATTGCAATACTGTAAAGTTAACAAAATTGGATGGTTTGAAAGCAATTGGGGCTAGAAAACTCAGAACCTATGTCGAGAAGTCCATAGAAGCTGCTTACAAAGAGGCAGAGAATAATGAAAATTCTGAATCCAAAGAGAGTATACATCCTCTGGCACTCCTTGCCAATAGGCTACGATTGGTTGTGGAGAAAGAAATTACTGTATTCTTCCCGGTTTTACGTCAATTATGTCCTGATTCTGGGATAATTGCAGCCATGCTATTGCACCAATATTATGGAGAAAAACTGAAGCCATTCTTAAAGGAAGTATCGAAGCTTTCAGATGATGTAAGGTCGGTGCTTCCAGCTGCCTATTCCTTAGATCGTGACCTAACTCATTTATTTACTGCTGCTTCCAAAGAAAGTAGATTAAGCCCTCTTCTCAAGGAAGACCTTGAGCACTATCCGATTGTACAAATTGCTAAGCCCATAATTCTTGACTGGATGATCGATCAACTGGAACAAACCTCTGAATGGACTGGACGTGCTTTCAAGTTAGAGGATTGGGAACCAATATCGTTTCAGCAAAATCTGGCAGCATCAGTTATTGAAGTTTTTAGAATTATTGAGGAGACTGTTGACCAGTATTTTGACCTGAACCTTCCGATGGACATCACACATTTGCAAGCTCTCCTGTCCATTGTCTATCACAGCTTGGATGGCTATCTATCGGGTCTTCTCAACCAGTTAGTTGAGAAGAATTATTTGTATCCACCTGTTCCTCCATTGACTCGTTTTATGGAGACAGTTGCAACAGGAAAGAAAAAGTTGCCCGAATGTCATTTAGACGAGCATGTGAGCAGCAAATTGAATGGATTGACGATATCTAAGCTTTGTATAAAATTGAATACCCTTGGTTACATTCAAAAACAAATTGAAACACTGGAGGATAGAATTGGAAAATCTTGGGCGCTTATTGGAGGATCTGCTAAGCATAAACGCGCCCCGGAAGAGGCGACTGTCAATGGTGGTCTTCGTACCTGTAGTGACGAAGTTAATGAGTTGTTTGCCAACACTTTCAACAACATTAAGAGTTTCATTGCAAAGGCTATCAGCAAGTTTTGTGACATTACTGGCATAAGAGTTATTTTTTGGGATCTCAGAGATGAGTTCCTCTCGTATTTGTATCATGGCAATGTTGATGCTGTTCGATTAGAAGACGTTCTTGCGCATCTTGATACGGTCCTAAACAATGTATGTGGCAGGATCGATGATACTCTTAGGGATCTCGTGGTATTGAGCATCTGTCGAGCATCAATGGAAGCCTTCATATGGGTGATGCTTAATGGAGGTCCTTCACGTGCATTTTCTGATTCAGATATTGTTCTCATTAAGGAAGATCTCGGTATATTGAAGGACTTCTTTGTAGCTGATGGAGAAGGTCTTCCCCGTACTCTTGTTGAGAAGGAAGCAAAATTTGCTGAAGAAATATTGGGCTTATATTCACTTCCGACTGAAACTATTATTCAACTGCTGATGAGCTCAGGTGGGGATATTTCTACGGAATTGGATCCGTGTAGTAATAATGGCAGTCTTCATTTTAACGATTCCCAGGCATTGGTACGAGTCTTATGCCACAAAAAAGATACGGAAGCTTCCACGTTTCTAAAGCGAAAATATAATCTCCCTGCTTCTTCAGATTATGATATGACCCCCTTGCAGGCCTCAACTCAGAGATGACATGACTTGTGACAAATTCAAAGTGTGATTCATAATTGATAGAAAATAGAAGGATTTAGCAATTTCCAACGATGACTTTGCACGGTGGCTAGAAAGAAAGAGTAGGAAGGGAAAATTGAGTTTGGTCCCATGTGGCTAGATTTTACTTGCCGAAGGCGAGTCTTATGGGTGATCCAATCAAGAGTGGAGTTTTGCCTATAAATGTACATAATATATTTCATTTTATAATAAAACGAAAATTGCATTTGTGTTTGATTATACTTAAAATGATCATTTTGAA
Coding sequence (CDS)
ATGGAGTCGGGTTCTTTGTTGCAGCGGTACCGTCGTGACCGGAAAAAGCTGCTGGCATTCTTGTTGTCCTCGCGTTTTGTCAGAGAGCTGAGGACTCCGGCAGGTCCTGTCACTGATTTCTCCGCCGTCGACCTTGACTCTCTGAGCGCTAGCTATGTTTTGGAGTGCATTAAGTCTGGTGGAGTTATTGACATTTCTACAGCTGCTAAGAAGAAGCAACATGAATCTTCTTACCCGATAATGTCAATCAAATGCCTCGCGAAGATACAATCTCGACTACGGACTTCTTATTTTCTACTTTCTCATCCAGATTTGTCTGGATCACCTCCTAGCCGTGCGCCACCGCCACCTAAAATGGTGGAGAGATCCTCTAGTGACATATCGTGTTCATCTAGGTCGCTTGGGTCATCAGTTGATGACAATATTGCAACCTATAGTGATGACTGTGGTCCTCAGTCTAATGGAACAACAGCCACACCTTCGAAACTTGTGAAGGATCTCAAAGTTCCGGCTTTAGGACTACCTAAGTTGTATACAGGACTGACAGATGATGACTTGGATGAAGCAGCATATTTAATCTTGCTTGCATCAATCGCATTTTCTGGGGTTGAAATCTATTCATCTGAGGATAAAAAGAAAGAAAATGGTATTACACATACTGTAGGGATGACAAGTATAAGGAATGAAGTAGATGTTCAATCAGAGAATTTTGAAGGACATTTGAATTTTCTTCACGCTATCCGCACCCAGATGCAGATTTCAGCAGTTTCTGATGCATGCATGAGGAAAAGATTAATGGAATTGGCTGCAAAGAGAAACTGGGGGCAAATTAATGTTCCTCAAGTTTTGTTGGTCCTCTTGCATGGGGTTTTCAGGTCTGATTTTCCTAGTGAGAAAGCTTATATGCAATGGAAGTTCAGACAGGTCAATCTTTTGGAAGAATTCTGTTACTCTGCTAATCTTGTAGCTTCTGAACGTCAAATCTGTGAAAGTTCTCTTGTGAAGATCAGAAGTACAAAGGAATGGGATATCAATATGGTTCGATCTGAGCGTGCAAAAGTATTGTCAGGTATTGGACAAGTACTGTCGAAGTTGCCAGCTCCCTCTGCTTATCACTTAAACATCAGGTTATATGAGAAATTGATTTTAGGCATGTTTGATGTTCTTGACGACAGCCATCCCGTTATGGAAGTGGATGACAGTCTCGTCCTTCTCAAATTGACTTGGTCTGCTTTAGGCATCACTCCGGAAGTACACAGTGTGATCTTCGGGTGGGTCCTTTTCCATCAGTTTGTCAAGACAGGTGAAGCCTCATTCTTGGACGAAGCCATTCTTGAGTTACAGGAGGTTGCATCATCTAAAGATGACGGTGGGAAAGAAGAACAATATTTGAAAAGCCTATCATGTTCAATATCATGCAATGGCAATGAAATGAAGTTGAGTTTGGTGGAGTCCGCCTTCTTCTTAATTAGCAGTTGGTGTGACATTAAACTACAAGCCTACCATTTGCACTTCCGTGAGAAAACTTCTTACTTTGGGAAAGTGGTGAGCCTGTTGTCCACTGTTGGAGTTATTACTGATTGCAATACTGTAAAGTTAACAAAATTGGATGGTTTGAAAGCAATTGGGGCTAGAAAACTCAGAACCTATGTCGAGAAGTCCATAGAAGCTGCTTACAAAGAGGCAGAGAATAATGAAAATTCTGAATCCAAAGAGAGTATACATCCTCTGGCACTCCTTGCCAATAGGCTACGATTGGTTGTGGAGAAAGAAATTACTGTATTCTTCCCGGTTTTACGTCAATTATGTCCTGATTCTGGGATAATTGCAGCCATGCTATTGCACCAATATTATGGAGAAAAACTGAAGCCATTCTTAAAGGAAGTATCGAAGCTTTCAGATGATGTAAGGTCGGTGCTTCCAGCTGCCTATTCCTTAGATCGTGACCTAACTCATTTATTTACTGCTGCTTCCAAAGAAAGTAGATTAAGCCCTCTTCTCAAGGAAGACCTTGAGCACTATCCGATTGTACAAATTGCTAAGCCCATAATTCTTGACTGGATGATCGATCAACTGGAACAAACCTCTGAATGGACTGGACGTGCTTTCAAGTTAGAGGATTGGGAACCAATATCGTTTCAGCAAAATCTGGCAGCATCAGTTATTGAAGTTTTTAGAATTATTGAGGAGACTGTTGACCAGTATTTTGACCTGAACCTTCCGATGGACATCACACATTTGCAAGCTCTCCTGTCCATTGTCTATCACAGCTTGGATGGCTATCTATCGGGTCTTCTCAACCAGTTAGTTGAGAAGAATTATTTGTATCCACCTGTTCCTCCATTGACTCGTTTTATGGAGACAGTTGCAACAGGAAAGAAAAAGTTGCCCGAATGTCATTTAGACGAGCATGTGAGCAGCAAATTGAATGGATTGACGATATCTAAGCTTTGTATAAAATTGAATACCCTTGGTTACATTCAAAAACAAATTGAAACACTGGAGGATAGAATTGGAAAATCTTGGGCGCTTATTGGAGGATCTGCTAAGCATAAACGCGCCCCGGAAGAGGCGACTGTCAATGGTGGTCTTCGTACCTGTAGTGACGAAGTTAATGAGTTGTTTGCCAACACTTTCAACAACATTAAGAGTTTCATTGCAAAGGCTATCAGCAAGTTTTGTGACATTACTGGCATAAGAGTTATTTTTTGGGATCTCAGAGATGAGTTCCTCTCGTATTTGTATCATGGCAATGTTGATGCTGTTCGATTAGAAGACGTTCTTGCGCATCTTGATACGGTCCTAAACAATGTATGTGGCAGGATCGATGATACTCTTAGGGATCTCGTGGTATTGAGCATCTGTCGAGCATCAATGGAAGCCTTCATATGGGTGATGCTTAATGGAGGTCCTTCACGTGCATTTTCTGATTCAGATATTGTTCTCATTAAGGAAGATCTCGGTATATTGAAGGACTTCTTTGTAGCTGATGGAGAAGGTCTTCCCCGTACTCTTGTTGAGAAGGAAGCAAAATTTGCTGAAGAAATATTGGGCTTATATTCACTTCCGACTGAAACTATTATTCAACTGCTGATGAGCTCAGGTGGGGATATTTCTACGGAATTGGATCCGTGTAGTAATAATGGCAGTCTTCATTTTAACGATTCCCAGGCATTGGTACGAGTCTTATGCCACAAAAAAGATACGGAAGCTTCCACGTTTCTAAAGCGAAAATATAATCTCCCTGCTTCTTCAGATTATGATATGACCCCCTTGCAGGCCTCAACTCAGAGATGA
Protein sequence
MESGSLLQRYRRDRKKLLAFLLSSRFVRELRTPAGPVTDFSAVDLDSLSASYVLECIKSGGVIDISTAAKKKQHESSYPIMSIKCLAKIQSRLRTSYFLLSHPDLSGSPPSRAPPPPKMVERSSSDISCSSRSLGSSVDDNIATYSDDCGPQSNGTTATPSKLVKDLKVPALGLPKLYTGLTDDDLDEAAYLILLASIAFSGVEIYSSEDKKKENGITHTVGMTSIRNEVDVQSENFEGHLNFLHAIRTQMQISAVSDACMRKRLMELAAKRNWGQINVPQVLLVLLHGVFRSDFPSEKAYMQWKFRQVNLLEEFCYSANLVASERQICESSLVKIRSTKEWDINMVRSERAKVLSGIGQVLSKLPAPSAYHLNIRLYEKLILGMFDVLDDSHPVMEVDDSLVLLKLTWSALGITPEVHSVIFGWVLFHQFVKTGEASFLDEAILELQEVASSKDDGGKEEQYLKSLSCSISCNGNEMKLSLVESAFFLISSWCDIKLQAYHLHFREKTSYFGKVVSLLSTVGVITDCNTVKLTKLDGLKAIGARKLRTYVEKSIEAAYKEAENNENSESKESIHPLALLANRLRLVVEKEITVFFPVLRQLCPDSGIIAAMLLHQYYGEKLKPFLKEVSKLSDDVRSVLPAAYSLDRDLTHLFTAASKESRLSPLLKEDLEHYPIVQIAKPIILDWMIDQLEQTSEWTGRAFKLEDWEPISFQQNLAASVIEVFRIIEETVDQYFDLNLPMDITHLQALLSIVYHSLDGYLSGLLNQLVEKNYLYPPVPPLTRFMETVATGKKKLPECHLDEHVSSKLNGLTISKLCIKLNTLGYIQKQIETLEDRIGKSWALIGGSAKHKRAPEEATVNGGLRTCSDEVNELFANTFNNIKSFIAKAISKFCDITGIRVIFWDLRDEFLSYLYHGNVDAVRLEDVLAHLDTVLNNVCGRIDDTLRDLVVLSICRASMEAFIWVMLNGGPSRAFSDSDIVLIKEDLGILKDFFVADGEGLPRTLVEKEAKFAEEILGLYSLPTETIIQLLMSSGGDISTELDPCSNNGSLHFNDSQALVRVLCHKKDTEASTFLKRKYNLPASSDYDMTPLQASTQR
Homology
BLAST of CmoCh06G007460 vs. ExPASy Swiss-Prot
Match:
Q8RX56 (Protein unc-13 homolog OS=Arabidopsis thaliana OX=3702 GN=PATROL1 PE=2 SV=1)
HSP 1 Score: 573.2 bits (1476), Expect = 6.5e-162
Identity = 389/1126 (34.55%), Postives = 595/1126 (52.84%), Query Frame = 0
Query: 6 LLQRYRRDRKKLLAFLLSSRFVRELRTPAGPVTDFSAVDLDSLSASYVLECIKSGGVIDI 65
+LQRYRRDR+KLL F+L+ ++++ P G VT VDLD +S YV+ C K GG++++
Sbjct: 9 ILQRYRRDRRKLLDFMLAGSLIKKVIMPPGAVT-LDDVDLDQVSVDYVINCAKKGGMLEL 68
Query: 66 STAAKKKQHESSYPIMSIKCLAKIQSRLRTSYFLLSHPDLSGSPPSRAPPP-PKMVERSS 125
+ A + P M+ A +FL + P+ SGSPP RAPPP P ++ SS
Sbjct: 69 AEAIRDYHDHIGLPYMNSVGTA-------DEFFLATIPESSGSPPKRAPPPIPVLISSSS 128
Query: 126 SDIS----CSSRSL-----GSSVD---------DNIATYSDDCGPQSNGTTATPSKLVKD 185
++ C S S S D D+I + DD G + D
Sbjct: 129 PMVTNPEWCESPSAPPLMRSESFDSPKAQELTVDDIDDFEDDDDLDEVGNFRISRRTAND 188
Query: 186 LKVPALGLPKLYTGLTDDDLDEAAYLILLASIAFSGVEIYSSEDKKKENGITHTVGMTSI 245
LP TG+TDDDL E A+ ILLA SG I S++KKKE + +
Sbjct: 189 AADLVPRLPSFATGITDDDLRETAFEILLACAGASGGLIVPSKEKKKEKSRSRLIKKLGR 248
Query: 246 RNEVDVQSENFEGHLNFLHAIRTQMQISAVSDACMRKRLMELAAKRNWGQINVPQVLLVL 305
++E QS++ G ++ L +R QM+IS D R+ L+ A + +++ V L L
Sbjct: 249 KSESVSQSQSSSGLVSLLEMMRGQMEISEAMDIRTRQGLLNALAGKVGKRMDSLLVPLEL 308
Query: 306 LHGVFRSDFPSEKAYMQWKFRQVNLLEEFCYSANLVA---SERQICE--SSLVKIRSTKE 365
L V R++F +KAY++W+ RQ+N+L E + +V S R+ + S L++I ++
Sbjct: 309 LCCVSRTEFSDKKAYLRWQKRQLNMLAEGLINNPVVGFGESGRKATDLKSLLLRIEESES 368
Query: 366 WDINMVRSERAKVLSGIGQV---LSKLPA-----------PSAYHLNIRLYEKLILGMFD 425
+ +RA+ L + +V L++ PA YHLN+RLYEKL+L +FD
Sbjct: 369 LPSSAGEVQRAECLKSLREVAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLLCVFD 428
Query: 426 VLDDSHPVMEVDDSLVLLKLTWSALGITPEVHSVIFGWVLFHQFVKTGEASFLDEAILEL 485
+L+D EV++ L LLK TW LGIT +H + WVLF Q+V T E L AI +L
Sbjct: 429 ILNDGKLTEEVEEILELLKSTWRVLGITETIHYTCYAWVLFRQYVITSERGLLRHAIQQL 488
Query: 486 QEVASSKDDGGKEEQYLKSLSCSISCNGNEMKLSLVESAFFLISSWCDIKLQAYHLHFRE 545
+++ + G +E +LK+L C + NE ++S +ES I SW D +L YHLHF E
Sbjct: 489 KKIPLKEQRGPQERLHLKTLKCRVD---NE-EISFLESFLSPIRSWADKQLGDYHLHFAE 548
Query: 546 KTSYFGKVVSLLSTVGVITDCNTVKLTKLDGLKAIGA-----RKLRTYVEKSIEAAYKEA 605
+ V TV +I T +L + +A+ + ++ +YV SI+ +
Sbjct: 549 GSLVMEDTV----TVAMI----TWRLLLEESDRAMHSNSSDREQIESYVLSSIKNTFTRM 608
Query: 606 ENNENSESKESIHPLALLANRLRLVVEKEITVFFPVLRQLCPDSGIIAAMLLHQYYGEKL 665
+ + + H LALLA + +++K+ T+F P+L Q P + +A L+H+ YG KL
Sbjct: 609 SLAIDRSDRNNEHHLALLAEETKKLMKKDSTIFMPILSQRHPQAIAFSASLIHKLYGNKL 668
Query: 666 KPFLKEVSKLSDDVRSVLPAAYSLDRDLTHLFTAASKESRLSPLLKEDLEHYPIVQIAKP 725
KPFL L++D SV PAA SL++ L L T+ E P K+ L Y + ++
Sbjct: 669 KPFLDGAEHLTEDAVSVFPAADSLEQYLLELMTSVCGEDTSGPYFKK-LIPYEVESLSGT 728
Query: 726 IILDWMIDQLEQTSEWTGRAFKLEDWEPISFQQNLAASVIEVFRIIEETVDQYFDLNLPM 785
++L W+ QL + W RA+K E W+PIS QQ +S++EVFRI+EETVDQ+F L +PM
Sbjct: 729 LVLRWINSQLGRILSWVERAYKQEHWDPISPQQRYGSSIVEVFRIVEETVDQFFALKVPM 788
Query: 786 DITHLQALLSIVYHSLDGYLSGLLNQLVEKNYLYPPVPPLTRFMETVA----TGKKKLPE 845
L AL + ++ Y + ++ +L K+ L PPVP LTR+ + A K+
Sbjct: 789 RSIELSALFRGIDNAFQVYTNHVMEKLASKDDLVPPVPVLTRYKKETAIKVFVKKELFDS 848
Query: 846 CHLDEHVSSKLNGLTISKLCIKLNTLGYIQKQIETLEDRIGKSWALIGGSAKHKRAPEEA 905
HLDE S ++ + LC++LNTL Y Q+ LED + W K+ E+
Sbjct: 849 KHLDERRSINIDVPATAMLCVQLNTLHYAVSQLSKLEDSMWLRWIA-------KKPREKI 908
Query: 906 TVNGGLRTCSDEVNELFANTFNNIKSFIAKAISKFCDITGIRVIFWDLRDEFLSYLYHGN 965
+ + S N+ +F + I A+ + C+ TG ++IF DLR+ F+ LY N
Sbjct: 909 VIRKSMVEKSKSFNQ--KESFEGSRKDINAALDRICEFTGTKIIFCDLREPFIENLYKPN 968
Query: 966 VDAVRLEDVLAHLDTVLNNVCGRIDDTLRDLVVLSICRASMEAFIWVMLNGGPSRAFSDS 1025
V RLE ++ LDT L +C I + LRD +V S+ +AS++ + V+L+GG SR F S
Sbjct: 969 VSQSRLEGLIEALDTELGQLCSVIMEPLRDRIVTSLLQASLDGLLRVLLDGGASRVFHPS 1028
Query: 1026 DIVLIKEDLGILKDFFVADGEGLPRTLVEKEAKFAEEILGLYSLPTETIIQLLMSSGGDI 1085
+ L++ED+ +LK+FF++ G+GLPR +VE + ++ L+ T +I L S
Sbjct: 1029 ESKLLEEDVEVLKEFFISGGDGLPRGVVENQVARVRLVVKLHGYETRELIDDLRSRS--- 1088
BLAST of CmoCh06G007460 vs. ExPASy TrEMBL
Match:
A0A6J1FPY9 (uncharacterized protein LOC111447738 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111447738 PE=4 SV=1)
HSP 1 Score: 2131.3 bits (5521), Expect = 0.0e+00
Identity = 1091/1098 (99.36%), Postives = 1091/1098 (99.36%), Query Frame = 0
Query: 1 MESGSLLQRYRRDRKKLLAFLLSSRFVRELRTPAGPVTDFSAVDLDSLSASYVLECIKSG 60
MESGSLLQRYRRDRKKLLAFLLSSRFVRELRTPAGPVTDFSAVDLDSLSASYVLECIKSG
Sbjct: 1 MESGSLLQRYRRDRKKLLAFLLSSRFVRELRTPAGPVTDFSAVDLDSLSASYVLECIKSG 60
Query: 61 GVIDISTAAKKKQHESSYPIMSIKCLAKIQSRLRTSYFLLSHPDLSGSPPSRAPPPPKMV 120
GVIDISTAAKKKQHESSYPIM IQSRLRTSYFLLSHPDLSGSPPSRAPPPPKMV
Sbjct: 61 GVIDISTAAKKKQHESSYPIM-------IQSRLRTSYFLLSHPDLSGSPPSRAPPPPKMV 120
Query: 121 ERSSSDISCSSRSLGSSVDDNIATYSDDCGPQSNGTTATPSKLVKDLKVPALGLPKLYTG 180
ERSSSDISCSSRSLGSSVDDNIATYSDDCGPQSNGTTATPSKLVKDLKVPALGLPKLYTG
Sbjct: 121 ERSSSDISCSSRSLGSSVDDNIATYSDDCGPQSNGTTATPSKLVKDLKVPALGLPKLYTG 180
Query: 181 LTDDDLDEAAYLILLASIAFSGVEIYSSEDKKKENGITHTVGMTSIRNEVDVQSENFEGH 240
LTDDDLDEAAYLILLASIAFSGVEIYSSEDKKKENGITHTVGMTSIRNEVDVQSENFEGH
Sbjct: 181 LTDDDLDEAAYLILLASIAFSGVEIYSSEDKKKENGITHTVGMTSIRNEVDVQSENFEGH 240
Query: 241 LNFLHAIRTQMQISAVSDACMRKRLMELAAKRNWGQINVPQVLLVLLHGVFRSDFPSEKA 300
LNFLHAIRTQMQISAVSDACMRKRLMELAAKRNWGQINVPQVLLVLLHGVFRSDFPSEKA
Sbjct: 241 LNFLHAIRTQMQISAVSDACMRKRLMELAAKRNWGQINVPQVLLVLLHGVFRSDFPSEKA 300
Query: 301 YMQWKFRQVNLLEEFCYSANLVASERQICESSLVKIRSTKEWDINMVRSERAKVLSGIGQ 360
YMQWKFRQVNLLEEFCYSANLVASERQICESSLVKIRSTKEWDINMVRSERAKVLSGIGQ
Sbjct: 301 YMQWKFRQVNLLEEFCYSANLVASERQICESSLVKIRSTKEWDINMVRSERAKVLSGIGQ 360
Query: 361 VLSKLPAPSAYHLNIRLYEKLILGMFDVLDDSHPVMEVDDSLVLLKLTWSALGITPEVHS 420
VLSKLPAPSAYHLNIRLYEKLILGMFDVLDDSHPVMEVDDSLVLLKLTWSALGITPEVHS
Sbjct: 361 VLSKLPAPSAYHLNIRLYEKLILGMFDVLDDSHPVMEVDDSLVLLKLTWSALGITPEVHS 420
Query: 421 VIFGWVLFHQFVKTGEASFLDEAILELQEVASSKDDGGKEEQYLKSLSCSISCNGNEMKL 480
VIFGWVLFHQFVKTGEASFLDEAILELQEVASSKDDGGKEEQYLKSLSCSISCNGNEMKL
Sbjct: 421 VIFGWVLFHQFVKTGEASFLDEAILELQEVASSKDDGGKEEQYLKSLSCSISCNGNEMKL 480
Query: 481 SLVESAFFLISSWCDIKLQAYHLHFREKTSYFGKVVSLLSTVGVITDCNTVKLTKLDGLK 540
SLVESAFFLISSWCDIKLQAYHLHFREKTSYFGKVVSLLSTVGVITDCNTVKLTKLDGLK
Sbjct: 481 SLVESAFFLISSWCDIKLQAYHLHFREKTSYFGKVVSLLSTVGVITDCNTVKLTKLDGLK 540
Query: 541 AIGARKLRTYVEKSIEAAYKEAENNENSESKESIHPLALLANRLRLVVEKEITVFFPVLR 600
AIGARKLRTYVEKSIEAAYKEAENNENSESKESIHPLALLANRLRLVVEKEITVFFPVLR
Sbjct: 541 AIGARKLRTYVEKSIEAAYKEAENNENSESKESIHPLALLANRLRLVVEKEITVFFPVLR 600
Query: 601 QLCPDSGIIAAMLLHQYYGEKLKPFLKEVSKLSDDVRSVLPAAYSLDRDLTHLFTAASKE 660
QLCPDSGIIAAMLLHQYYGEKLKPFLKEVSKLSDDVRSVLPAAYSLDRDLTHLFTAASKE
Sbjct: 601 QLCPDSGIIAAMLLHQYYGEKLKPFLKEVSKLSDDVRSVLPAAYSLDRDLTHLFTAASKE 660
Query: 661 SRLSPLLKEDLEHYPIVQIAKPIILDWMIDQLEQTSEWTGRAFKLEDWEPISFQQNLAAS 720
SRLSPLLKEDLEHYPIVQIAKPIILDWMIDQLEQTSEWTGRAFKLEDWEPISFQQNLAAS
Sbjct: 661 SRLSPLLKEDLEHYPIVQIAKPIILDWMIDQLEQTSEWTGRAFKLEDWEPISFQQNLAAS 720
Query: 721 VIEVFRIIEETVDQYFDLNLPMDITHLQALLSIVYHSLDGYLSGLLNQLVEKNYLYPPVP 780
VIEVFRIIEETVDQYFDLNLPMDITHLQALLSIVYHSLDGYLSGLLNQLVEKNYLYPPVP
Sbjct: 721 VIEVFRIIEETVDQYFDLNLPMDITHLQALLSIVYHSLDGYLSGLLNQLVEKNYLYPPVP 780
Query: 781 PLTRFMETVATGKKKLPECHLDEHVSSKLNGLTISKLCIKLNTLGYIQKQIETLEDRIGK 840
PLTRFMETVATGKKKLPECHLDEHVSSKLNGLTISKLCIKLNTLGYIQKQIETLEDRIGK
Sbjct: 781 PLTRFMETVATGKKKLPECHLDEHVSSKLNGLTISKLCIKLNTLGYIQKQIETLEDRIGK 840
Query: 841 SWALIGGSAKHKRAPEEATVNGGLRTCSDEVNELFANTFNNIKSFIAKAISKFCDITGIR 900
SWALIGGSAKHKRAPEEATVNGGLRTCSDEVNELFANTFNNIKSFIAKAISKFCDITGIR
Sbjct: 841 SWALIGGSAKHKRAPEEATVNGGLRTCSDEVNELFANTFNNIKSFIAKAISKFCDITGIR 900
Query: 901 VIFWDLRDEFLSYLYHGNVDAVRLEDVLAHLDTVLNNVCGRIDDTLRDLVVLSICRASME 960
VIFWDLRDEFLSYLYHGNVDAVRLEDVLAHLDTVLNNVCGRIDDTLRDLVVLSICRASME
Sbjct: 901 VIFWDLRDEFLSYLYHGNVDAVRLEDVLAHLDTVLNNVCGRIDDTLRDLVVLSICRASME 960
Query: 961 AFIWVMLNGGPSRAFSDSDIVLIKEDLGILKDFFVADGEGLPRTLVEKEAKFAEEILGLY 1020
AFIWVMLNGGPSRAFSDSDIVLIKEDLGILKDFFVADGEGLPRTLVEKEAKFAEEILGLY
Sbjct: 961 AFIWVMLNGGPSRAFSDSDIVLIKEDLGILKDFFVADGEGLPRTLVEKEAKFAEEILGLY 1020
Query: 1021 SLPTETIIQLLMSSGGDISTELDPCSNNGSLHFNDSQALVRVLCHKKDTEASTFLKRKYN 1080
SLPTETIIQLLMSSGGDISTELDPCSNNGSLHFNDSQALVRVLCHKKDTEASTFLKRKYN
Sbjct: 1021 SLPTETIIQLLMSSGGDISTELDPCSNNGSLHFNDSQALVRVLCHKKDTEASTFLKRKYN 1080
Query: 1081 LPASSDYDMTPLQASTQR 1099
LPASSDYDMTPLQASTQR
Sbjct: 1081 LPASSDYDMTPLQASTQR 1091
BLAST of CmoCh06G007460 vs. ExPASy TrEMBL
Match:
A0A6J1KUH5 (uncharacterized protein LOC111498329 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111498329 PE=4 SV=1)
HSP 1 Score: 2085.8 bits (5403), Expect = 0.0e+00
Identity = 1066/1098 (97.09%), Postives = 1077/1098 (98.09%), Query Frame = 0
Query: 1 MESGSLLQRYRRDRKKLLAFLLSSRFVRELRTPAGPVTDFSAVDLDSLSASYVLECIKSG 60
ME GSLLQRY RDRK LLAFLLSSRFVRELRTPAGPVTD SA+DLDSLSASYVLECIKSG
Sbjct: 1 MEPGSLLQRYLRDRKNLLAFLLSSRFVRELRTPAGPVTDLSAIDLDSLSASYVLECIKSG 60
Query: 61 GVIDISTAAKKKQHESSYPIMSIKCLAKIQSRLRTSYFLLSHPDLSGSPPSRAPPPPKMV 120
GVIDISTA KKK+HESSYPIM IQSRLRTSYFLLSHPDLSGSPPSRAPPPPKMV
Sbjct: 61 GVIDISTATKKKRHESSYPIM-------IQSRLRTSYFLLSHPDLSGSPPSRAPPPPKMV 120
Query: 121 ERSSSDISCSSRSLGSSVDDNIATYSDDCGPQSNGTTATPSKLVKDLKVPALGLPKLYTG 180
ERSSSDISCSSRSLGSSVDDNIATYSDDCGPQSNGTTATPSKLVKDLKVPALGLPKLYTG
Sbjct: 121 ERSSSDISCSSRSLGSSVDDNIATYSDDCGPQSNGTTATPSKLVKDLKVPALGLPKLYTG 180
Query: 181 LTDDDLDEAAYLILLASIAFSGVEIYSSEDKKKENGITHTVGMTSIRNEVDVQSENFEGH 240
LTDDDLDEAAYLILLASIAFSGVEIYSSEDKKKENGI HTVGMTSIRNE+DVQSENFEGH
Sbjct: 181 LTDDDLDEAAYLILLASIAFSGVEIYSSEDKKKENGIKHTVGMTSIRNELDVQSENFEGH 240
Query: 241 LNFLHAIRTQMQISAVSDACMRKRLMELAAKRNWGQINVPQVLLVLLHGVFRSDFPSEKA 300
LNFLHAIRTQMQISA+SDACMRKRLMEL AKRNWGQINVPQVLLVLLHGVFRSDFPSEKA
Sbjct: 241 LNFLHAIRTQMQISAISDACMRKRLMELDAKRNWGQINVPQVLLVLLHGVFRSDFPSEKA 300
Query: 301 YMQWKFRQVNLLEEFCYSANLVASERQICESSLVKIRSTKEWDINMVRSERAKVLSGIGQ 360
YMQWKFRQVNLLEEFCYSANLVASERQICESSLVKIRSTKEWDINMVRSERAKVLSGIGQ
Sbjct: 301 YMQWKFRQVNLLEEFCYSANLVASERQICESSLVKIRSTKEWDINMVRSERAKVLSGIGQ 360
Query: 361 VLSKLPAPSAYHLNIRLYEKLILGMFDVLDDSHPVMEVDDSLVLLKLTWSALGITPEVHS 420
VLSKLPAPSAYHLNIRLYEKLILG+FDVLDDSHPVMEVDDSLVLLKLTWSALGITPEVHS
Sbjct: 361 VLSKLPAPSAYHLNIRLYEKLILGIFDVLDDSHPVMEVDDSLVLLKLTWSALGITPEVHS 420
Query: 421 VIFGWVLFHQFVKTGEASFLDEAILELQEVASSKDDGGKEEQYLKSLSCSISCNGNEMKL 480
VIFGWVLFHQFVKTGEASFLDEAILELQEVASSKDDGGKEEQYLKSLSCSISCNGNEMKL
Sbjct: 421 VIFGWVLFHQFVKTGEASFLDEAILELQEVASSKDDGGKEEQYLKSLSCSISCNGNEMKL 480
Query: 481 SLVESAFFLISSWCDIKLQAYHLHFREKTSYFGKVVSLLSTVGVITDCNTVKLTKLDGLK 540
SLVESAFFLISSWCDIKLQAYHLHFREKTSYFGKVVSLLSTVGV+TDCNTVKLTKLDGLK
Sbjct: 481 SLVESAFFLISSWCDIKLQAYHLHFREKTSYFGKVVSLLSTVGVVTDCNTVKLTKLDGLK 540
Query: 541 AIGARKLRTYVEKSIEAAYKEAENNENSESKESIHPLALLANRLRLVVEKEITVFFPVLR 600
AIGARKLRTYVE+SIEAAYKEAENN NSESK SIHPLALLANRLRLVVEKEITVFFPVLR
Sbjct: 541 AIGARKLRTYVERSIEAAYKEAENNANSESKASIHPLALLANRLRLVVEKEITVFFPVLR 600
Query: 601 QLCPDSGIIAAMLLHQYYGEKLKPFLKEVSKLSDDVRSVLPAAYSLDRDLTHLFTAASKE 660
QLCPDSGIIAAMLLHQYYGEKLKPFLKEVSKLSDDVRSVLPAAYSLDRDLTHLFTAASKE
Sbjct: 601 QLCPDSGIIAAMLLHQYYGEKLKPFLKEVSKLSDDVRSVLPAAYSLDRDLTHLFTAASKE 660
Query: 661 SRLSPLLKEDLEHYPIVQIAKPIILDWMIDQLEQTSEWTGRAFKLEDWEPISFQQNLAAS 720
SRLSPLLKEDLEHYPIVQIAKPIILDWMIDQLEQTSEWTGRAFKLEDWEPISFQQNLAAS
Sbjct: 661 SRLSPLLKEDLEHYPIVQIAKPIILDWMIDQLEQTSEWTGRAFKLEDWEPISFQQNLAAS 720
Query: 721 VIEVFRIIEETVDQYFDLNLPMDITHLQALLSIVYHSLDGYLSGLLNQLVEKNYLYPPVP 780
VIEVFRIIEETVDQYFDLNLPMDITHLQALLSIVYHSLDGYLSGLLNQLVEKNYLYPPVP
Sbjct: 721 VIEVFRIIEETVDQYFDLNLPMDITHLQALLSIVYHSLDGYLSGLLNQLVEKNYLYPPVP 780
Query: 781 PLTRFMETVATGKKKLPECHLDEHVSSKLNGLTISKLCIKLNTLGYIQKQIETLEDRIGK 840
PLTRFMETVATGKKKLPECHLDEHVSSKLNGLTISKLCIKLNTLGYIQKQIETLEDRIGK
Sbjct: 781 PLTRFMETVATGKKKLPECHLDEHVSSKLNGLTISKLCIKLNTLGYIQKQIETLEDRIGK 840
Query: 841 SWALIGGSAKHKRAPEEATVNGGLRTCSDEVNELFANTFNNIKSFIAKAISKFCDITGIR 900
SWALIG SAKHKRA EEA VNGGLRTCSDEVNELFANTFNNIKSFIAKAISKFCDITGIR
Sbjct: 841 SWALIGESAKHKRAQEEAIVNGGLRTCSDEVNELFANTFNNIKSFIAKAISKFCDITGIR 900
Query: 901 VIFWDLRDEFLSYLYHGNVDAVRLEDVLAHLDTVLNNVCGRIDDTLRDLVVLSICRASME 960
VIFWDLRD F+SYLY GNV+AVRLEDVLAHLDTVLNNVCGRIDDTLRDLVVLSICRASME
Sbjct: 901 VIFWDLRDGFISYLYRGNVEAVRLEDVLAHLDTVLNNVCGRIDDTLRDLVVLSICRASME 960
Query: 961 AFIWVMLNGGPSRAFSDSDIVLIKEDLGILKDFFVADGEGLPRTLVEKEAKFAEEILGLY 1020
AFIWVMLNGGPSRAFSDSDIVLIKEDLGILKDFFVADGEGLPRTLVEKEAKFAEEILGLY
Sbjct: 961 AFIWVMLNGGPSRAFSDSDIVLIKEDLGILKDFFVADGEGLPRTLVEKEAKFAEEILGLY 1020
Query: 1021 SLPTETIIQLLMSSGGDISTELDPCSNNGSLHFNDSQALVRVLCHKKDTEASTFLKRKYN 1080
SLPTETIIQLLMSSGGDISTELDPCSNNG+LHFNDSQALVRVLCHKKDTEASTFLKRK+N
Sbjct: 1021 SLPTETIIQLLMSSGGDISTELDPCSNNGNLHFNDSQALVRVLCHKKDTEASTFLKRKFN 1080
Query: 1081 LPASSDYDMTPLQASTQR 1099
LPASSDYDMTPLQASTQR
Sbjct: 1081 LPASSDYDMTPLQASTQR 1091
BLAST of CmoCh06G007460 vs. ExPASy TrEMBL
Match:
A0A6J1FVS9 (uncharacterized protein LOC111447738 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111447738 PE=4 SV=1)
HSP 1 Score: 1988.8 bits (5151), Expect = 0.0e+00
Identity = 1011/1012 (99.90%), Postives = 1011/1012 (99.90%), Query Frame = 0
Query: 87 AKIQSRLRTSYFLLSHPDLSGSPPSRAPPPPKMVERSSSDISCSSRSLGSSVDDNIATYS 146
A IQSRLRTSYFLLSHPDLSGSPPSRAPPPPKMVERSSSDISCSSRSLGSSVDDNIATYS
Sbjct: 9 ASIQSRLRTSYFLLSHPDLSGSPPSRAPPPPKMVERSSSDISCSSRSLGSSVDDNIATYS 68
Query: 147 DDCGPQSNGTTATPSKLVKDLKVPALGLPKLYTGLTDDDLDEAAYLILLASIAFSGVEIY 206
DDCGPQSNGTTATPSKLVKDLKVPALGLPKLYTGLTDDDLDEAAYLILLASIAFSGVEIY
Sbjct: 69 DDCGPQSNGTTATPSKLVKDLKVPALGLPKLYTGLTDDDLDEAAYLILLASIAFSGVEIY 128
Query: 207 SSEDKKKENGITHTVGMTSIRNEVDVQSENFEGHLNFLHAIRTQMQISAVSDACMRKRLM 266
SSEDKKKENGITHTVGMTSIRNEVDVQSENFEGHLNFLHAIRTQMQISAVSDACMRKRLM
Sbjct: 129 SSEDKKKENGITHTVGMTSIRNEVDVQSENFEGHLNFLHAIRTQMQISAVSDACMRKRLM 188
Query: 267 ELAAKRNWGQINVPQVLLVLLHGVFRSDFPSEKAYMQWKFRQVNLLEEFCYSANLVASER 326
ELAAKRNWGQINVPQVLLVLLHGVFRSDFPSEKAYMQWKFRQVNLLEEFCYSANLVASER
Sbjct: 189 ELAAKRNWGQINVPQVLLVLLHGVFRSDFPSEKAYMQWKFRQVNLLEEFCYSANLVASER 248
Query: 327 QICESSLVKIRSTKEWDINMVRSERAKVLSGIGQVLSKLPAPSAYHLNIRLYEKLILGMF 386
QICESSLVKIRSTKEWDINMVRSERAKVLSGIGQVLSKLPAPSAYHLNIRLYEKLILGMF
Sbjct: 249 QICESSLVKIRSTKEWDINMVRSERAKVLSGIGQVLSKLPAPSAYHLNIRLYEKLILGMF 308
Query: 387 DVLDDSHPVMEVDDSLVLLKLTWSALGITPEVHSVIFGWVLFHQFVKTGEASFLDEAILE 446
DVLDDSHPVMEVDDSLVLLKLTWSALGITPEVHSVIFGWVLFHQFVKTGEASFLDEAILE
Sbjct: 309 DVLDDSHPVMEVDDSLVLLKLTWSALGITPEVHSVIFGWVLFHQFVKTGEASFLDEAILE 368
Query: 447 LQEVASSKDDGGKEEQYLKSLSCSISCNGNEMKLSLVESAFFLISSWCDIKLQAYHLHFR 506
LQEVASSKDDGGKEEQYLKSLSCSISCNGNEMKLSLVESAFFLISSWCDIKLQAYHLHFR
Sbjct: 369 LQEVASSKDDGGKEEQYLKSLSCSISCNGNEMKLSLVESAFFLISSWCDIKLQAYHLHFR 428
Query: 507 EKTSYFGKVVSLLSTVGVITDCNTVKLTKLDGLKAIGARKLRTYVEKSIEAAYKEAENNE 566
EKTSYFGKVVSLLSTVGVITDCNTVKLTKLDGLKAIGARKLRTYVEKSIEAAYKEAENNE
Sbjct: 429 EKTSYFGKVVSLLSTVGVITDCNTVKLTKLDGLKAIGARKLRTYVEKSIEAAYKEAENNE 488
Query: 567 NSESKESIHPLALLANRLRLVVEKEITVFFPVLRQLCPDSGIIAAMLLHQYYGEKLKPFL 626
NSESKESIHPLALLANRLRLVVEKEITVFFPVLRQLCPDSGIIAAMLLHQYYGEKLKPFL
Sbjct: 489 NSESKESIHPLALLANRLRLVVEKEITVFFPVLRQLCPDSGIIAAMLLHQYYGEKLKPFL 548
Query: 627 KEVSKLSDDVRSVLPAAYSLDRDLTHLFTAASKESRLSPLLKEDLEHYPIVQIAKPIILD 686
KEVSKLSDDVRSVLPAAYSLDRDLTHLFTAASKESRLSPLLKEDLEHYPIVQIAKPIILD
Sbjct: 549 KEVSKLSDDVRSVLPAAYSLDRDLTHLFTAASKESRLSPLLKEDLEHYPIVQIAKPIILD 608
Query: 687 WMIDQLEQTSEWTGRAFKLEDWEPISFQQNLAASVIEVFRIIEETVDQYFDLNLPMDITH 746
WMIDQLEQTSEWTGRAFKLEDWEPISFQQNLAASVIEVFRIIEETVDQYFDLNLPMDITH
Sbjct: 609 WMIDQLEQTSEWTGRAFKLEDWEPISFQQNLAASVIEVFRIIEETVDQYFDLNLPMDITH 668
Query: 747 LQALLSIVYHSLDGYLSGLLNQLVEKNYLYPPVPPLTRFMETVATGKKKLPECHLDEHVS 806
LQALLSIVYHSLDGYLSGLLNQLVEKNYLYPPVPPLTRFMETVATGKKKLPECHLDEHVS
Sbjct: 669 LQALLSIVYHSLDGYLSGLLNQLVEKNYLYPPVPPLTRFMETVATGKKKLPECHLDEHVS 728
Query: 807 SKLNGLTISKLCIKLNTLGYIQKQIETLEDRIGKSWALIGGSAKHKRAPEEATVNGGLRT 866
SKLNGLTISKLCIKLNTLGYIQKQIETLEDRIGKSWALIGGSAKHKRAPEEATVNGGLRT
Sbjct: 729 SKLNGLTISKLCIKLNTLGYIQKQIETLEDRIGKSWALIGGSAKHKRAPEEATVNGGLRT 788
Query: 867 CSDEVNELFANTFNNIKSFIAKAISKFCDITGIRVIFWDLRDEFLSYLYHGNVDAVRLED 926
CSDEVNELFANTFNNIKSFIAKAISKFCDITGIRVIFWDLRDEFLSYLYHGNVDAVRLED
Sbjct: 789 CSDEVNELFANTFNNIKSFIAKAISKFCDITGIRVIFWDLRDEFLSYLYHGNVDAVRLED 848
Query: 927 VLAHLDTVLNNVCGRIDDTLRDLVVLSICRASMEAFIWVMLNGGPSRAFSDSDIVLIKED 986
VLAHLDTVLNNVCGRIDDTLRDLVVLSICRASMEAFIWVMLNGGPSRAFSDSDIVLIKED
Sbjct: 849 VLAHLDTVLNNVCGRIDDTLRDLVVLSICRASMEAFIWVMLNGGPSRAFSDSDIVLIKED 908
Query: 987 LGILKDFFVADGEGLPRTLVEKEAKFAEEILGLYSLPTETIIQLLMSSGGDISTELDPCS 1046
LGILKDFFVADGEGLPRTLVEKEAKFAEEILGLYSLPTETIIQLLMSSGGDISTELDPCS
Sbjct: 909 LGILKDFFVADGEGLPRTLVEKEAKFAEEILGLYSLPTETIIQLLMSSGGDISTELDPCS 968
Query: 1047 NNGSLHFNDSQALVRVLCHKKDTEASTFLKRKYNLPASSDYDMTPLQASTQR 1099
NNGSLHFNDSQALVRVLCHKKDTEASTFLKRKYNLPASSDYDMTPLQASTQR
Sbjct: 969 NNGSLHFNDSQALVRVLCHKKDTEASTFLKRKYNLPASSDYDMTPLQASTQR 1020
BLAST of CmoCh06G007460 vs. ExPASy TrEMBL
Match:
A0A6J1L1Q4 (uncharacterized protein LOC111498329 isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111498329 PE=4 SV=1)
HSP 1 Score: 1954.9 bits (5063), Expect = 0.0e+00
Identity = 992/1011 (98.12%), Postives = 1002/1011 (99.11%), Query Frame = 0
Query: 88 KIQSRLRTSYFLLSHPDLSGSPPSRAPPPPKMVERSSSDISCSSRSLGSSVDDNIATYSD 147
+IQSRLRTSYFLLSHPDLSGSPPSRAPPPPKMVERSSSDISCSSRSLGSSVDDNIATYSD
Sbjct: 9 RIQSRLRTSYFLLSHPDLSGSPPSRAPPPPKMVERSSSDISCSSRSLGSSVDDNIATYSD 68
Query: 148 DCGPQSNGTTATPSKLVKDLKVPALGLPKLYTGLTDDDLDEAAYLILLASIAFSGVEIYS 207
DCGPQSNGTTATPSKLVKDLKVPALGLPKLYTGLTDDDLDEAAYLILLASIAFSGVEIYS
Sbjct: 69 DCGPQSNGTTATPSKLVKDLKVPALGLPKLYTGLTDDDLDEAAYLILLASIAFSGVEIYS 128
Query: 208 SEDKKKENGITHTVGMTSIRNEVDVQSENFEGHLNFLHAIRTQMQISAVSDACMRKRLME 267
SEDKKKENGI HTVGMTSIRNE+DVQSENFEGHLNFLHAIRTQMQISA+SDACMRKRLME
Sbjct: 129 SEDKKKENGIKHTVGMTSIRNELDVQSENFEGHLNFLHAIRTQMQISAISDACMRKRLME 188
Query: 268 LAAKRNWGQINVPQVLLVLLHGVFRSDFPSEKAYMQWKFRQVNLLEEFCYSANLVASERQ 327
L AKRNWGQINVPQVLLVLLHGVFRSDFPSEKAYMQWKFRQVNLLEEFCYSANLVASERQ
Sbjct: 189 LDAKRNWGQINVPQVLLVLLHGVFRSDFPSEKAYMQWKFRQVNLLEEFCYSANLVASERQ 248
Query: 328 ICESSLVKIRSTKEWDINMVRSERAKVLSGIGQVLSKLPAPSAYHLNIRLYEKLILGMFD 387
ICESSLVKIRSTKEWDINMVRSERAKVLSGIGQVLSKLPAPSAYHLNIRLYEKLILG+FD
Sbjct: 249 ICESSLVKIRSTKEWDINMVRSERAKVLSGIGQVLSKLPAPSAYHLNIRLYEKLILGIFD 308
Query: 388 VLDDSHPVMEVDDSLVLLKLTWSALGITPEVHSVIFGWVLFHQFVKTGEASFLDEAILEL 447
VLDDSHPVMEVDDSLVLLKLTWSALGITPEVHSVIFGWVLFHQFVKTGEASFLDEAILEL
Sbjct: 309 VLDDSHPVMEVDDSLVLLKLTWSALGITPEVHSVIFGWVLFHQFVKTGEASFLDEAILEL 368
Query: 448 QEVASSKDDGGKEEQYLKSLSCSISCNGNEMKLSLVESAFFLISSWCDIKLQAYHLHFRE 507
QEVASSKDDGGKEEQYLKSLSCSISCNGNEMKLSLVESAFFLISSWCDIKLQAYHLHFRE
Sbjct: 369 QEVASSKDDGGKEEQYLKSLSCSISCNGNEMKLSLVESAFFLISSWCDIKLQAYHLHFRE 428
Query: 508 KTSYFGKVVSLLSTVGVITDCNTVKLTKLDGLKAIGARKLRTYVEKSIEAAYKEAENNEN 567
KTSYFGKVVSLLSTVGV+TDCNTVKLTKLDGLKAIGARKLRTYVE+SIEAAYKEAENN N
Sbjct: 429 KTSYFGKVVSLLSTVGVVTDCNTVKLTKLDGLKAIGARKLRTYVERSIEAAYKEAENNAN 488
Query: 568 SESKESIHPLALLANRLRLVVEKEITVFFPVLRQLCPDSGIIAAMLLHQYYGEKLKPFLK 627
SESK SIHPLALLANRLRLVVEKEITVFFPVLRQLCPDSGIIAAMLLHQYYGEKLKPFLK
Sbjct: 489 SESKASIHPLALLANRLRLVVEKEITVFFPVLRQLCPDSGIIAAMLLHQYYGEKLKPFLK 548
Query: 628 EVSKLSDDVRSVLPAAYSLDRDLTHLFTAASKESRLSPLLKEDLEHYPIVQIAKPIILDW 687
EVSKLSDDVRSVLPAAYSLDRDLTHLFTAASKESRLSPLLKEDLEHYPIVQIAKPIILDW
Sbjct: 549 EVSKLSDDVRSVLPAAYSLDRDLTHLFTAASKESRLSPLLKEDLEHYPIVQIAKPIILDW 608
Query: 688 MIDQLEQTSEWTGRAFKLEDWEPISFQQNLAASVIEVFRIIEETVDQYFDLNLPMDITHL 747
MIDQLEQTSEWTGRAFKLEDWEPISFQQNLAASVIEVFRIIEETVDQYFDLNLPMDITHL
Sbjct: 609 MIDQLEQTSEWTGRAFKLEDWEPISFQQNLAASVIEVFRIIEETVDQYFDLNLPMDITHL 668
Query: 748 QALLSIVYHSLDGYLSGLLNQLVEKNYLYPPVPPLTRFMETVATGKKKLPECHLDEHVSS 807
QALLSIVYHSLDGYLSGLLNQLVEKNYLYPPVPPLTRFMETVATGKKKLPECHLDEHVSS
Sbjct: 669 QALLSIVYHSLDGYLSGLLNQLVEKNYLYPPVPPLTRFMETVATGKKKLPECHLDEHVSS 728
Query: 808 KLNGLTISKLCIKLNTLGYIQKQIETLEDRIGKSWALIGGSAKHKRAPEEATVNGGLRTC 867
KLNGLTISKLCIKLNTLGYIQKQIETLEDRIGKSWALIG SAKHKRA EEA VNGGLRTC
Sbjct: 729 KLNGLTISKLCIKLNTLGYIQKQIETLEDRIGKSWALIGESAKHKRAQEEAIVNGGLRTC 788
Query: 868 SDEVNELFANTFNNIKSFIAKAISKFCDITGIRVIFWDLRDEFLSYLYHGNVDAVRLEDV 927
SDEVNELFANTFNNIKSFIAKAISKFCDITGIRVIFWDLRD F+SYLY GNV+AVRLEDV
Sbjct: 789 SDEVNELFANTFNNIKSFIAKAISKFCDITGIRVIFWDLRDGFISYLYRGNVEAVRLEDV 848
Query: 928 LAHLDTVLNNVCGRIDDTLRDLVVLSICRASMEAFIWVMLNGGPSRAFSDSDIVLIKEDL 987
LAHLDTVLNNVCGRIDDTLRDLVVLSICRASMEAFIWVMLNGGPSRAFSDSDIVLIKEDL
Sbjct: 849 LAHLDTVLNNVCGRIDDTLRDLVVLSICRASMEAFIWVMLNGGPSRAFSDSDIVLIKEDL 908
Query: 988 GILKDFFVADGEGLPRTLVEKEAKFAEEILGLYSLPTETIIQLLMSSGGDISTELDPCSN 1047
GILKDFFVADGEGLPRTLVEKEAKFAEEILGLYSLPTETIIQLLMSSGGDISTELDPCSN
Sbjct: 909 GILKDFFVADGEGLPRTLVEKEAKFAEEILGLYSLPTETIIQLLMSSGGDISTELDPCSN 968
Query: 1048 NGSLHFNDSQALVRVLCHKKDTEASTFLKRKYNLPASSDYDMTPLQASTQR 1099
NG+LHFNDSQALVRVLCHKKDTEASTFLKRK+NLPASSDYDMTPLQASTQR
Sbjct: 969 NGNLHFNDSQALVRVLCHKKDTEASTFLKRKFNLPASSDYDMTPLQASTQR 1019
BLAST of CmoCh06G007460 vs. ExPASy TrEMBL
Match:
A0A6J1KSN8 (uncharacterized protein LOC111498329 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111498329 PE=4 SV=1)
HSP 1 Score: 1953.7 bits (5060), Expect = 0.0e+00
Identity = 993/1020 (97.35%), Postives = 1006/1020 (98.63%), Query Frame = 0
Query: 80 IMSIKCLA-KIQSRLRTSYFLLSHPDLSGSPPSRAPPPPKMVERSSSDISCSSRSLGSSV 139
++ + C + IQSRLRTSYFLLSHPDLSGSPPSRAPPPPKMVERSSSDISCSSRSLGSSV
Sbjct: 1 MLMLPCFSVSIQSRLRTSYFLLSHPDLSGSPPSRAPPPPKMVERSSSDISCSSRSLGSSV 60
Query: 140 DDNIATYSDDCGPQSNGTTATPSKLVKDLKVPALGLPKLYTGLTDDDLDEAAYLILLASI 199
DDNIATYSDDCGPQSNGTTATPSKLVKDLKVPALGLPKLYTGLTDDDLDEAAYLILLASI
Sbjct: 61 DDNIATYSDDCGPQSNGTTATPSKLVKDLKVPALGLPKLYTGLTDDDLDEAAYLILLASI 120
Query: 200 AFSGVEIYSSEDKKKENGITHTVGMTSIRNEVDVQSENFEGHLNFLHAIRTQMQISAVSD 259
AFSGVEIYSSEDKKKENGI HTVGMTSIRNE+DVQSENFEGHLNFLHAIRTQMQISA+SD
Sbjct: 121 AFSGVEIYSSEDKKKENGIKHTVGMTSIRNELDVQSENFEGHLNFLHAIRTQMQISAISD 180
Query: 260 ACMRKRLMELAAKRNWGQINVPQVLLVLLHGVFRSDFPSEKAYMQWKFRQVNLLEEFCYS 319
ACMRKRLMEL AKRNWGQINVPQVLLVLLHGVFRSDFPSEKAYMQWKFRQVNLLEEFCYS
Sbjct: 181 ACMRKRLMELDAKRNWGQINVPQVLLVLLHGVFRSDFPSEKAYMQWKFRQVNLLEEFCYS 240
Query: 320 ANLVASERQICESSLVKIRSTKEWDINMVRSERAKVLSGIGQVLSKLPAPSAYHLNIRLY 379
ANLVASERQICESSLVKIRSTKEWDINMVRSERAKVLSGIGQVLSKLPAPSAYHLNIRLY
Sbjct: 241 ANLVASERQICESSLVKIRSTKEWDINMVRSERAKVLSGIGQVLSKLPAPSAYHLNIRLY 300
Query: 380 EKLILGMFDVLDDSHPVMEVDDSLVLLKLTWSALGITPEVHSVIFGWVLFHQFVKTGEAS 439
EKLILG+FDVLDDSHPVMEVDDSLVLLKLTWSALGITPEVHSVIFGWVLFHQFVKTGEAS
Sbjct: 301 EKLILGIFDVLDDSHPVMEVDDSLVLLKLTWSALGITPEVHSVIFGWVLFHQFVKTGEAS 360
Query: 440 FLDEAILELQEVASSKDDGGKEEQYLKSLSCSISCNGNEMKLSLVESAFFLISSWCDIKL 499
FLDEAILELQEVASSKDDGGKEEQYLKSLSCSISCNGNEMKLSLVESAFFLISSWCDIKL
Sbjct: 361 FLDEAILELQEVASSKDDGGKEEQYLKSLSCSISCNGNEMKLSLVESAFFLISSWCDIKL 420
Query: 500 QAYHLHFREKTSYFGKVVSLLSTVGVITDCNTVKLTKLDGLKAIGARKLRTYVEKSIEAA 559
QAYHLHFREKTSYFGKVVSLLSTVGV+TDCNTVKLTKLDGLKAIGARKLRTYVE+SIEAA
Sbjct: 421 QAYHLHFREKTSYFGKVVSLLSTVGVVTDCNTVKLTKLDGLKAIGARKLRTYVERSIEAA 480
Query: 560 YKEAENNENSESKESIHPLALLANRLRLVVEKEITVFFPVLRQLCPDSGIIAAMLLHQYY 619
YKEAENN NSESK SIHPLALLANRLRLVVEKEITVFFPVLRQLCPDSGIIAAMLLHQYY
Sbjct: 481 YKEAENNANSESKASIHPLALLANRLRLVVEKEITVFFPVLRQLCPDSGIIAAMLLHQYY 540
Query: 620 GEKLKPFLKEVSKLSDDVRSVLPAAYSLDRDLTHLFTAASKESRLSPLLKEDLEHYPIVQ 679
GEKLKPFLKEVSKLSDDVRSVLPAAYSLDRDLTHLFTAASKESRLSPLLKEDLEHYPIVQ
Sbjct: 541 GEKLKPFLKEVSKLSDDVRSVLPAAYSLDRDLTHLFTAASKESRLSPLLKEDLEHYPIVQ 600
Query: 680 IAKPIILDWMIDQLEQTSEWTGRAFKLEDWEPISFQQNLAASVIEVFRIIEETVDQYFDL 739
IAKPIILDWMIDQLEQTSEWTGRAFKLEDWEPISFQQNLAASVIEVFRIIEETVDQYFDL
Sbjct: 601 IAKPIILDWMIDQLEQTSEWTGRAFKLEDWEPISFQQNLAASVIEVFRIIEETVDQYFDL 660
Query: 740 NLPMDITHLQALLSIVYHSLDGYLSGLLNQLVEKNYLYPPVPPLTRFMETVATGKKKLPE 799
NLPMDITHLQALLSIVYHSLDGYLSGLLNQLVEKNYLYPPVPPLTRFMETVATGKKKLPE
Sbjct: 661 NLPMDITHLQALLSIVYHSLDGYLSGLLNQLVEKNYLYPPVPPLTRFMETVATGKKKLPE 720
Query: 800 CHLDEHVSSKLNGLTISKLCIKLNTLGYIQKQIETLEDRIGKSWALIGGSAKHKRAPEEA 859
CHLDEHVSSKLNGLTISKLCIKLNTLGYIQKQIETLEDRIGKSWALIG SAKHKRA EEA
Sbjct: 721 CHLDEHVSSKLNGLTISKLCIKLNTLGYIQKQIETLEDRIGKSWALIGESAKHKRAQEEA 780
Query: 860 TVNGGLRTCSDEVNELFANTFNNIKSFIAKAISKFCDITGIRVIFWDLRDEFLSYLYHGN 919
VNGGLRTCSDEVNELFANTFNNIKSFIAKAISKFCDITGIRVIFWDLRD F+SYLY GN
Sbjct: 781 IVNGGLRTCSDEVNELFANTFNNIKSFIAKAISKFCDITGIRVIFWDLRDGFISYLYRGN 840
Query: 920 VDAVRLEDVLAHLDTVLNNVCGRIDDTLRDLVVLSICRASMEAFIWVMLNGGPSRAFSDS 979
V+AVRLEDVLAHLDTVLNNVCGRIDDTLRDLVVLSICRASMEAFIWVMLNGGPSRAFSDS
Sbjct: 841 VEAVRLEDVLAHLDTVLNNVCGRIDDTLRDLVVLSICRASMEAFIWVMLNGGPSRAFSDS 900
Query: 980 DIVLIKEDLGILKDFFVADGEGLPRTLVEKEAKFAEEILGLYSLPTETIIQLLMSSGGDI 1039
DIVLIKEDLGILKDFFVADGEGLPRTLVEKEAKFAEEILGLYSLPTETIIQLLMSSGGDI
Sbjct: 901 DIVLIKEDLGILKDFFVADGEGLPRTLVEKEAKFAEEILGLYSLPTETIIQLLMSSGGDI 960
Query: 1040 STELDPCSNNGSLHFNDSQALVRVLCHKKDTEASTFLKRKYNLPASSDYDMTPLQASTQR 1099
STELDPCSNNG+LHFNDSQALVRVLCHKKDTEASTFLKRK+NLPASSDYDMTPLQASTQR
Sbjct: 961 STELDPCSNNGNLHFNDSQALVRVLCHKKDTEASTFLKRKFNLPASSDYDMTPLQASTQR 1020
BLAST of CmoCh06G007460 vs. TAIR 10
Match:
AT4G11670.1 (Protein of unknown function (DUF810) )
HSP 1 Score: 858.6 bits (2217), Expect = 5.5e-249
Identity = 515/1111 (46.35%), Postives = 697/1111 (62.74%), Query Frame = 0
Query: 5 SLLQRYRRDRKKLLAFLLSSRFVRELRTPAGPVTDFSAVDLDSLSASYVLECIKSGGVID 64
SLLQRYR DR+KL+ FL+SS V+ELR+P+G T S DLD+LSA YVL+C+KSGGV+D
Sbjct: 3 SLLQRYRNDRRKLMEFLMSSGLVKELRSPSGSPTSLSPADLDALSADYVLDCVKSGGVVD 62
Query: 65 ISTAAKKKQHESSYPIMSIKCLAKIQSRLRTSYFLLSHPDLSGSPPSRAPPPPKMVERSS 124
+S +K +SSYP+ I S SYFL+S PDL+GSPP R PPPP +E+SS
Sbjct: 63 VSKGREKYNFDSSYPV-------TIHSESGDSYFLVSSPDLAGSPPHRMPPPPVNIEKSS 122
Query: 125 SDISCSSRSLGSSVDDNIATYSDDCGPQSNGTTATPS-KLVKDLKVPALGLPKLYTGLTD 184
++ + SR + SS N + D+ + TP K VK +K+ LGLP L TGL+D
Sbjct: 123 NNGADMSRHMDSS---NTPSARDNYVFKEE----TPDIKPVKPIKIIPLGLPPLRTGLSD 182
Query: 185 DDLDEAAYLILLASIAFS-----GVEIYSSEDKKKENGITHTVGMTSIRNEVDVQSENFE 244
DDL EAAY +++AS+ S VE Y + +K E S R + ++ ++ +
Sbjct: 183 DDLREAAYELMIASMLLSSFLTNSVEAYPTHRRKIEK---------SSRLMLSLKRKD-K 242
Query: 245 GHLNFLHAIRTQMQISAVSDACMRKRLMELAAKRNWGQINVPQVLLVLLHGVFRSDFPSE 304
HL T +IS+ D C+R+ L++LA R QI++PQ+ L LL G+F+SDFP+E
Sbjct: 243 PHLQ-PQISNTHSEISSKMDTCIRRNLVQLATLRTGEQIDLPQLALGLLVGIFKSDFPNE 302
Query: 305 KAYMQWKFRQVNLLEE-FCYSANLVASERQICESSLVKIRSTKEWDINMVRSERAKVLSG 364
K YM+WK RQ NLLEE C+S +L +ER L IR +KEWD+ + S R +VLS
Sbjct: 303 KLYMKWKTRQANLLEEVLCFSPSLEKNERATMRKCLATIRDSKEWDVVVSASLRIEVLSS 362
Query: 365 IGQVLSKLPA--------------PSAYHLNIRLYEKLILGMFDVLDDSHPVMEVDDSLV 424
I QV SKL + + YHLNIRLYEKL+ G+FD LD+ + + L
Sbjct: 363 IRQVASKLSSLPGRCGIEEETYYWTAIYHLNIRLYEKLLFGVFDTLDEGQVIEDASSMLF 422
Query: 425 LLKLTWSALGITPEVHSVIFGWVLFHQFVKTGEASFLDEAILELQEVASSKDDGGKEEQY 484
+K WS LGIT +HS I+GWVLF QFV TGE S L I ELQ+V S++ KE+ Y
Sbjct: 423 HMKSIWSTLGITENLHSAIYGWVLFQQFVCTGEPSLLGSTIQELQKVTSAESGNPKEDLY 482
Query: 485 LKSLSCSISCNGNEMKLSLVESAFFLISSWCDIKLQAYHLHFREKTSYFGKVVSLLSTVG 544
L L CS G ++ L LV++ +S+WCD KLQ YHLHF +K FG +V L STVG
Sbjct: 483 LSHLVCSRQTIGTDIHLGLVKAILTSVSAWCDDKLQDYHLHFGKKPRDFGMLVRLASTVG 542
Query: 545 V-ITDCNTVKLTKLDGLKAIGARKLRTYVEKSIEAAYKEAENNENSESK-ESIHPLALLA 604
+ DC +L KLD L + K+++YV+ SI+ A A + +S E H LALLA
Sbjct: 543 LPPADCTRTELIKLDTLSDDVSDKIQSYVQNSIKGACARAAHFAYVKSHGERTHALALLA 602
Query: 605 NRLRLVVEKEITVFFPVLRQLCPDSGIIAAMLLHQYYGEKLKPFLKEVSKLSDDVRSVLP 664
N L ++ + EI F PV + P+ +I+AMLLH++YGE+L PFL+ VS LS DVR V+P
Sbjct: 603 NELTVIAKVEINEFVPVFSKWLPECMMISAMLLHRFYGERLTPFLEGVSSLSGDVRKVVP 662
Query: 665 AAYSLDRDLTHLFTAASKESRLSPLLKEDLEHYPIVQIAKPIILDWMIDQLEQTSEWTGR 724
AAY L +LT L+ SK P + L++Y I + KP++LDW+I Q + +WT R
Sbjct: 663 AAYMLQEELTQLYNCHSKSKLRKPYFHK-LKNYEIEKAVKPVMLDWLISQHDHILQWTRR 722
Query: 725 AFKLEDWEPISFQQNLAASVIEVFRIIEETVDQYFDLNLPMDITHLQALLSIVYHSLDGY 784
AF++E+WEP+S QQ AAS++E+FRIIEETV Q F L+LP+DITHLQALLS++YHSLD Y
Sbjct: 723 AFEIEEWEPLSVQQRHAASIVEIFRIIEETVSQLFGLHLPVDITHLQALLSLIYHSLDTY 782
Query: 785 LSGLLNQLVEKNYLYPPVPPLTRFMETV--ATGKKKLPECHLDEHVSSKLNGLTISKLCI 844
L + +QLV+K +LYP PPLTRF E V +K L D + KL+ LTI KLCI
Sbjct: 783 LQRVFDQLVDKKFLYPSAPPLTRFTENVMPVMKRKSLEFSEPDNKIVKKLDELTIPKLCI 842
Query: 845 KLNTLGYIQKQIETLEDRIGKSWALIGGSAKHKRAPE--EATVNGGLRTCSDEVNELFAN 904
LNTL YIQKQI E I KS L+ S + E EA V L T S+ V+ELFA
Sbjct: 843 ILNTLCYIQKQISATEVGIRKSLTLVEASLNKRSEIETDEAEVENSL-THSEAVDELFAT 902
Query: 905 TFNNIKSFIAKAISKFCDITGIRVIFWDLRDEFLSYLYHGNVDAVRLEDVLAHLDTVLNN 964
T+++++ A I+K D+ ++ W + FL Y + L D + VL+
Sbjct: 903 TYDSLRDTNANCITKTRDL----IVLWQ-KYAFLFYW-------LILMDEKCNAQ-VLDT 962
Query: 965 VCGRIDDTLRDLVVLSICRASMEAFIWVMLNGGPSRAFSDSDIVLIKEDLGILKDFFVAD 1024
VC + RD+VVLSICR+++EA++ V+L+GGP+RAFSDSDI L++EDL ILK+FF+AD
Sbjct: 963 VCSLSYEDSRDMVVLSICRSALEAYVRVLLDGGPTRAFSDSDITLMEEDLSILKEFFIAD 1022
Query: 1025 GEGLPRTLVEKEAKFAEEILGLYSLPTETIIQLLMSSGGDISTELDPCSNNGSLHFNDSQ 1084
GEGLPR+LVE+EAK A+EIL LYSL ++ +IQ+LM++ I+ + ++ D+Q
Sbjct: 1023 GEGLPRSLVEQEAKQAKEILDLYSLESDMLIQMLMTASELINMGV----SSEQRRLEDAQ 1069
Query: 1085 ALVRVLCHKKDTEASTFLKRKYNLPASSDYD 1089
LVRVLCHKKD AS FLKR+Y LP S++Y+
Sbjct: 1083 TLVRVLCHKKDRNASKFLKRQYELPMSTEYE 1069
BLAST of CmoCh06G007460 vs. TAIR 10
Match:
AT5G06970.1 (Protein of unknown function (DUF810) )
HSP 1 Score: 573.2 bits (1476), Expect = 4.6e-163
Identity = 389/1126 (34.55%), Postives = 595/1126 (52.84%), Query Frame = 0
Query: 6 LLQRYRRDRKKLLAFLLSSRFVRELRTPAGPVTDFSAVDLDSLSASYVLECIKSGGVIDI 65
+LQRYRRDR+KLL F+L+ ++++ P G VT VDLD +S YV+ C K GG++++
Sbjct: 9 ILQRYRRDRRKLLDFMLAGSLIKKVIMPPGAVT-LDDVDLDQVSVDYVINCAKKGGMLEL 68
Query: 66 STAAKKKQHESSYPIMSIKCLAKIQSRLRTSYFLLSHPDLSGSPPSRAPPP-PKMVERSS 125
+ A + P M+ A +FL + P+ SGSPP RAPPP P ++ SS
Sbjct: 69 AEAIRDYHDHIGLPYMNSVGTA-------DEFFLATIPESSGSPPKRAPPPIPVLISSSS 128
Query: 126 SDIS----CSSRSL-----GSSVD---------DNIATYSDDCGPQSNGTTATPSKLVKD 185
++ C S S S D D+I + DD G + D
Sbjct: 129 PMVTNPEWCESPSAPPLMRSESFDSPKAQELTVDDIDDFEDDDDLDEVGNFRISRRTAND 188
Query: 186 LKVPALGLPKLYTGLTDDDLDEAAYLILLASIAFSGVEIYSSEDKKKENGITHTVGMTSI 245
LP TG+TDDDL E A+ ILLA SG I S++KKKE + +
Sbjct: 189 AADLVPRLPSFATGITDDDLRETAFEILLACAGASGGLIVPSKEKKKEKSRSRLIKKLGR 248
Query: 246 RNEVDVQSENFEGHLNFLHAIRTQMQISAVSDACMRKRLMELAAKRNWGQINVPQVLLVL 305
++E QS++ G ++ L +R QM+IS D R+ L+ A + +++ V L L
Sbjct: 249 KSESVSQSQSSSGLVSLLEMMRGQMEISEAMDIRTRQGLLNALAGKVGKRMDSLLVPLEL 308
Query: 306 LHGVFRSDFPSEKAYMQWKFRQVNLLEEFCYSANLVA---SERQICE--SSLVKIRSTKE 365
L V R++F +KAY++W+ RQ+N+L E + +V S R+ + S L++I ++
Sbjct: 309 LCCVSRTEFSDKKAYLRWQKRQLNMLAEGLINNPVVGFGESGRKATDLKSLLLRIEESES 368
Query: 366 WDINMVRSERAKVLSGIGQV---LSKLPA-----------PSAYHLNIRLYEKLILGMFD 425
+ +RA+ L + +V L++ PA YHLN+RLYEKL+L +FD
Sbjct: 369 LPSSAGEVQRAECLKSLREVAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLLCVFD 428
Query: 426 VLDDSHPVMEVDDSLVLLKLTWSALGITPEVHSVIFGWVLFHQFVKTGEASFLDEAILEL 485
+L+D EV++ L LLK TW LGIT +H + WVLF Q+V T E L AI +L
Sbjct: 429 ILNDGKLTEEVEEILELLKSTWRVLGITETIHYTCYAWVLFRQYVITSERGLLRHAIQQL 488
Query: 486 QEVASSKDDGGKEEQYLKSLSCSISCNGNEMKLSLVESAFFLISSWCDIKLQAYHLHFRE 545
+++ + G +E +LK+L C + NE ++S +ES I SW D +L YHLHF E
Sbjct: 489 KKIPLKEQRGPQERLHLKTLKCRVD---NE-EISFLESFLSPIRSWADKQLGDYHLHFAE 548
Query: 546 KTSYFGKVVSLLSTVGVITDCNTVKLTKLDGLKAIGA-----RKLRTYVEKSIEAAYKEA 605
+ V TV +I T +L + +A+ + ++ +YV SI+ +
Sbjct: 549 GSLVMEDTV----TVAMI----TWRLLLEESDRAMHSNSSDREQIESYVLSSIKNTFTRM 608
Query: 606 ENNENSESKESIHPLALLANRLRLVVEKEITVFFPVLRQLCPDSGIIAAMLLHQYYGEKL 665
+ + + H LALLA + +++K+ T+F P+L Q P + +A L+H+ YG KL
Sbjct: 609 SLAIDRSDRNNEHHLALLAEETKKLMKKDSTIFMPILSQRHPQAIAFSASLIHKLYGNKL 668
Query: 666 KPFLKEVSKLSDDVRSVLPAAYSLDRDLTHLFTAASKESRLSPLLKEDLEHYPIVQIAKP 725
KPFL L++D SV PAA SL++ L L T+ E P K+ L Y + ++
Sbjct: 669 KPFLDGAEHLTEDAVSVFPAADSLEQYLLELMTSVCGEDTSGPYFKK-LIPYEVESLSGT 728
Query: 726 IILDWMIDQLEQTSEWTGRAFKLEDWEPISFQQNLAASVIEVFRIIEETVDQYFDLNLPM 785
++L W+ QL + W RA+K E W+PIS QQ +S++EVFRI+EETVDQ+F L +PM
Sbjct: 729 LVLRWINSQLGRILSWVERAYKQEHWDPISPQQRYGSSIVEVFRIVEETVDQFFALKVPM 788
Query: 786 DITHLQALLSIVYHSLDGYLSGLLNQLVEKNYLYPPVPPLTRFMETVA----TGKKKLPE 845
L AL + ++ Y + ++ +L K+ L PPVP LTR+ + A K+
Sbjct: 789 RSIELSALFRGIDNAFQVYTNHVMEKLASKDDLVPPVPVLTRYKKETAIKVFVKKELFDS 848
Query: 846 CHLDEHVSSKLNGLTISKLCIKLNTLGYIQKQIETLEDRIGKSWALIGGSAKHKRAPEEA 905
HLDE S ++ + LC++LNTL Y Q+ LED + W K+ E+
Sbjct: 849 KHLDERRSINIDVPATAMLCVQLNTLHYAVSQLSKLEDSMWLRWIA-------KKPREKI 908
Query: 906 TVNGGLRTCSDEVNELFANTFNNIKSFIAKAISKFCDITGIRVIFWDLRDEFLSYLYHGN 965
+ + S N+ +F + I A+ + C+ TG ++IF DLR+ F+ LY N
Sbjct: 909 VIRKSMVEKSKSFNQ--KESFEGSRKDINAALDRICEFTGTKIIFCDLREPFIENLYKPN 968
Query: 966 VDAVRLEDVLAHLDTVLNNVCGRIDDTLRDLVVLSICRASMEAFIWVMLNGGPSRAFSDS 1025
V RLE ++ LDT L +C I + LRD +V S+ +AS++ + V+L+GG SR F S
Sbjct: 969 VSQSRLEGLIEALDTELGQLCSVIMEPLRDRIVTSLLQASLDGLLRVLLDGGASRVFHPS 1028
Query: 1026 DIVLIKEDLGILKDFFVADGEGLPRTLVEKEAKFAEEILGLYSLPTETIIQLLMSSGGDI 1085
+ L++ED+ +LK+FF++ G+GLPR +VE + ++ L+ T +I L S
Sbjct: 1029 ESKLLEEDVEVLKEFFISGGDGLPRGVVENQVARVRLVVKLHGYETRELIDDLRSRS--- 1088
BLAST of CmoCh06G007460 vs. TAIR 10
Match:
AT2G20010.2 (Protein of unknown function (DUF810) )
HSP 1 Score: 290.0 bits (741), Expect = 7.8e-78
Identity = 243/883 (27.52%), Postives = 413/883 (46.77%), Query Frame = 0
Query: 247 IRTQMQISAVSDACMRKRLMELAAKRNWGQINVPQVLLVLLHGVFRSDFPSEKAYMQWKF 306
+R QM+IS D+ +R+ L+ +A+ + ++ + + L LL + SDFP ++ Y W+
Sbjct: 115 VRVQMRISEQIDSRIRRALLRIASGQLGRRVEMMVLPLELLQQLKASDFPDQEEYESWQR 174
Query: 307 RQVNLLE---------EFCYSANLVASERQICESSLVKIRSTKEWDINMVRSERAKVLS- 366
R + LLE S V +QI S L + T + ++ R+ V+S
Sbjct: 175 RNLKLLEAGLILYPCVPLSKSDKSVQQLKQIIRSGLERPLDTGK-ITGETQNLRSLVMSL 234
Query: 367 -------GIGQVLSKLPAPSAYHLNIRLYEKLILGMFDVLDDSHPVMEVDDSLVLLKLTW 426
GIG + LN+R+Y+ L+ FDV D+ V EVD+ L L+K TW
Sbjct: 235 ASRQNNNGIGS--ETCHWADGFPLNLRIYQMLLESCFDVNDELLIVEEVDEVLELIKKTW 294
Query: 427 SALGITPEVHSVIFGWVLFHQFVKTGEAS-----FLDEAILELQEVASSKDDGGKEEQYL 486
LGI +H+V F WVL +++V TG+ ILE++ A +D +Y
Sbjct: 295 PVLGINQMIHNVCFLWVLVNRYVSTGQVENDLLVAAHNLILEIENDAMETND----PEYS 354
Query: 487 KSLSCSISCNGNEMKLSLVESAFFLISSWCDIKLQAYHLHFR-EKTSYFGKVVSLLSTVG 546
K LS +S L+ W + +L AYH F + VSL V
Sbjct: 355 KILSSVLS----------------LVMDWGEKRLLAYHDTFNIDNVETLETTVSLGILVA 414
Query: 547 VI--TDCNTVKLTKLDGLKAIGARKLRTYVEKSIEAAYKE----AENNENSESKES---I 606
+ D ++ K + + G ++ TY+ S+ A+++ E+++ S+S++S +
Sbjct: 415 KVLGEDISSEYRRKKKHVDS-GRDRVDTYIRSSLRMAFQQTKRMVEHSKKSKSRQSTNNL 474
Query: 607 HPLALLANRLRLVVEKEITVFFPVLRQLCPDSGIIAAMLLHQYYGEKLKPFLKEVSKLSD 666
LA+LA + + E +F P+L+ P + +AA LH YG +LK F+ +++L+
Sbjct: 475 PALAILAEDIGHLAFNEKAIFSPILKNWHPLAAGVAAATLHSCYGTELKKFVSGITELTP 534
Query: 667 DVRSVLPAAYSLDRDLTHLFT--AASKESRLSPLLKEDLEHYPIVQIAKPIILDWMIDQL 726
D VL AA L++DL + A E +++E + + + ++ W+ ++
Sbjct: 535 DAIRVLTAADKLEKDLVQIAVQDAVDSEDGGKSVIRE-MPPFEAEVVIGNLVKSWIKIRV 594
Query: 727 EQTSEWTGRAFKLEDWEPISFQQNLAASVIEVFRIIEETVDQYFDLNLPMDITHLQALLS 786
++ EW R + E W P S + +A S ++V R+++ET++ +F L + + L L S
Sbjct: 595 DRLKEWIDRNLQQEVWNPRSNKLGIAPSAVDVLRMVDETLEAFFLLPILLHPVLLPELTS 654
Query: 787 IVYHSLDGYLSGLLNQLVEKNYLYPPVPPLTRFMETVATG-----KKKLPECHLDEHVSS 846
+ + Y+S + +N P +P LTR TV + KKK S
Sbjct: 655 GLDKCMQHYVSKAKSSCGSRNTFLPVLPALTRC--TVGSRLHGVFKKKEKPMVASHRRKS 714
Query: 847 KL----NGLTISKLCIKLNTLGYIQKQIETLEDRIGKSWALIGGSAKHKRAPEEATVNGG 906
+L + I + C ++NTL YI+ +IE+ G R PE
Sbjct: 715 QLGTGNDSAEILQFCCRINTLQYIRTEIES------------SGRKTLNRLPESEV---- 774
Query: 907 LRTCSDEVNELFANTFNNIKSFIAKAISKFCDITGIRVIFWDLRDEFLSYLYHGNVDAVR 966
D ++F + S+ +K I + + T +++F DL + LY G V + R
Sbjct: 775 --AALDAKGKIFEQSI----SYCSKGIQQLSEATAYKIVFHDLSNVLWDGLYLGEVPSSR 834
Query: 967 LEDVLAHLDTVLNNVCGRIDDTLRDLVVLSICRASMEAFIWVMLNGGPSRAFSDSDIVLI 1026
+E L L+ L + + D +R V+ I RAS + F+ V+L GGPSR F+ D +
Sbjct: 835 IEPFLQELERCLEIISSSVHDRVRTRVISDIMRASFDGFLLVLLAGGPSRGFTIQDSAAV 894
Query: 1027 KEDLGILKDFFVADGEGLPRTLVEKEAKFAEEILGLYSLPTETIIQ----LLMSSGGDIS 1083
+ED L D F ++G+GLP L+EK + + IL L T+++I+ + + + G
Sbjct: 895 EEDFKFLCDLFWSNGDGLPLDLIEKVSTTVKSILPLLRTDTDSLIERFKAVCLENHGSDR 948
BLAST of CmoCh06G007460 vs. TAIR 10
Match:
AT2G20010.1 (Protein of unknown function (DUF810) )
HSP 1 Score: 285.0 bits (728), Expect = 2.5e-76
Identity = 241/879 (27.42%), Postives = 410/879 (46.64%), Query Frame = 0
Query: 251 MQISAVSDACMRKRLMELAAKRNWGQINVPQVLLVLLHGVFRSDFPSEKAYMQWKFRQVN 310
M+IS D+ +R+ L+ +A+ + ++ + + L LL + SDFP ++ Y W+ R +
Sbjct: 1 MRISEQIDSRIRRALLRIASGQLGRRVEMMVLPLELLQQLKASDFPDQEEYESWQRRNLK 60
Query: 311 LLE---------EFCYSANLVASERQICESSLVKIRSTKEWDINMVRSERAKVLS----- 370
LLE S V +QI S L + T + ++ R+ V+S
Sbjct: 61 LLEAGLILYPCVPLSKSDKSVQQLKQIIRSGLERPLDTGK-ITGETQNLRSLVMSLASRQ 120
Query: 371 ---GIGQVLSKLPAPSAYHLNIRLYEKLILGMFDVLDDSHPVMEVDDSLVLLKLTWSALG 430
GIG + LN+R+Y+ L+ FDV D+ V EVD+ L L+K TW LG
Sbjct: 121 NNNGIGS--ETCHWADGFPLNLRIYQMLLESCFDVNDELLIVEEVDEVLELIKKTWPVLG 180
Query: 431 ITPEVHSVIFGWVLFHQFVKTGEAS-----FLDEAILELQEVASSKDDGGKEEQYLKSLS 490
I +H+V F WVL +++V TG+ ILE++ A +D +Y K LS
Sbjct: 181 INQMIHNVCFLWVLVNRYVSTGQVENDLLVAAHNLILEIENDAMETND----PEYSKILS 240
Query: 491 CSISCNGNEMKLSLVESAFFLISSWCDIKLQAYHLHFR-EKTSYFGKVVSLLSTVGVI-- 550
+S L+ W + +L AYH F + VSL V +
Sbjct: 241 SVLS----------------LVMDWGEKRLLAYHDTFNIDNVETLETTVSLGILVAKVLG 300
Query: 551 TDCNTVKLTKLDGLKAIGARKLRTYVEKSIEAAYKE----AENNENSESKES---IHPLA 610
D ++ K + + G ++ TY+ S+ A+++ E+++ S+S++S + LA
Sbjct: 301 EDISSEYRRKKKHVDS-GRDRVDTYIRSSLRMAFQQTKRMVEHSKKSKSRQSTNNLPALA 360
Query: 611 LLANRLRLVVEKEITVFFPVLRQLCPDSGIIAAMLLHQYYGEKLKPFLKEVSKLSDDVRS 670
+LA + + E +F P+L+ P + +AA LH YG +LK F+ +++L+ D
Sbjct: 361 ILAEDIGHLAFNEKAIFSPILKNWHPLAAGVAAATLHSCYGTELKKFVSGITELTPDAIR 420
Query: 671 VLPAAYSLDRDLTHLFT--AASKESRLSPLLKEDLEHYPIVQIAKPIILDWMIDQLEQTS 730
VL AA L++DL + A E +++E + + + ++ W+ ++++
Sbjct: 421 VLTAADKLEKDLVQIAVQDAVDSEDGGKSVIRE-MPPFEAEVVIGNLVKSWIKIRVDRLK 480
Query: 731 EWTGRAFKLEDWEPISFQQNLAASVIEVFRIIEETVDQYFDLNLPMDITHLQALLSIVYH 790
EW R + E W P S + +A S ++V R+++ET++ +F L + + L L S +
Sbjct: 481 EWIDRNLQQEVWNPRSNKLGIAPSAVDVLRMVDETLEAFFLLPILLHPVLLPELTSGLDK 540
Query: 791 SLDGYLSGLLNQLVEKNYLYPPVPPLTRFMETVATG-----KKKLPECHLDEHVSSKL-- 850
+ Y+S + +N P +P LTR TV + KKK S+L
Sbjct: 541 CMQHYVSKAKSSCGSRNTFLPVLPALTRC--TVGSRLHGVFKKKEKPMVASHRRKSQLGT 600
Query: 851 --NGLTISKLCIKLNTLGYIQKQIETLEDRIGKSWALIGGSAKHKRAPEEATVNGGLRTC 910
+ I + C ++NTL YI+ +IE+ G R PE
Sbjct: 601 GNDSAEILQFCCRINTLQYIRTEIES------------SGRKTLNRLPESEV------AA 660
Query: 911 SDEVNELFANTFNNIKSFIAKAISKFCDITGIRVIFWDLRDEFLSYLYHGNVDAVRLEDV 970
D ++F + S+ +K I + + T +++F DL + LY G V + R+E
Sbjct: 661 LDAKGKIFEQSI----SYCSKGIQQLSEATAYKIVFHDLSNVLWDGLYLGEVPSSRIEPF 720
Query: 971 LAHLDTVLNNVCGRIDDTLRDLVVLSICRASMEAFIWVMLNGGPSRAFSDSDIVLIKEDL 1030
L L+ L + + D +R V+ I RAS + F+ V+L GGPSR F+ D ++ED
Sbjct: 721 LQELERCLEIISSSVHDRVRTRVISDIMRASFDGFLLVLLAGGPSRGFTIQDSAAVEEDF 780
Query: 1031 GILKDFFVADGEGLPRTLVEKEAKFAEEILGLYSLPTETIIQ----LLMSSGGDISTELD 1083
L D F ++G+GLP L+EK + + IL L T+++I+ + + + G +L
Sbjct: 781 KFLCDLFWSNGDGLPLDLIEKVSTTVKSILPLLRTDTDSLIERFKAVCLENHGSDRGKLP 830
BLAST of CmoCh06G007460 vs. TAIR 10
Match:
AT2G25800.1 (Protein of unknown function (DUF810) )
HSP 1 Score: 280.4 bits (716), Expect = 6.2e-75
Identity = 227/876 (25.91%), Postives = 402/876 (45.89%), Query Frame = 0
Query: 247 IRTQMQISAVSDACMRKRLMELAAKRNWGQINVPQVLLVLLHGVFRSDFPSEKAYMQWKF 306
+R QM++S D+ +R+ + +AA + +I + L LL + SDF ++ Y W
Sbjct: 149 MRIQMRVSEAVDSRVRRAFLRIAASQVGRKIESVVLPLELLQQLKSSDFTDQQEYDAWLK 208
Query: 307 RQVNLLEEFCY--------SANLVASERQICESSLVKIRSTKEWDINMVRSERAKVLS-- 366
R + +LE N RQI +L + T + ++S R+ V+S
Sbjct: 209 RSLKVLEAGLLLHPRVPLDKTNSSQRLRQIIHGALDRPLETGR-NNEQMQSLRSAVMSLA 268
Query: 367 --GIGQVLSKLPAPSAYHLNIRLYEKLILGMFDVLDDSHPVMEVDDSLVLLKLTWSALGI 426
G N+RLYE L+ FD D + V EVDD + +K TW LGI
Sbjct: 269 TRSDGSFSDSCHWADGSPFNLRLYELLLEACFDSNDATSMVEEVDDLMEHIKKTWVILGI 328
Query: 427 TPEVHSVIFGWVLFHQFVKTG--EASFLDEAILELQEVASSKDDGGKEEQYLKSLSCSIS 486
+H++ F W+LF ++V TG E L +L EVA K+ +Y + LS ++S
Sbjct: 329 NQMLHNLCFTWLLFSRYVVTGQVEMDLLHACDSQLAEVAKDAKT-TKDPEYSQVLSSTLS 388
Query: 487 CNGNEMKLSLVESAFFLISSWCDIKLQAYHLHF-REKTSYFGKVVSLLSTVG--VITDCN 546
I W + +L AYH F R +VSL + ++ D +
Sbjct: 389 A----------------ILGWAEKRLLAYHDTFDRGNIHTMEGIVSLGVSAARILVEDIS 448
Query: 547 TVKLTKLDGLKAIGARKLRTYVEKSIEAAYKEAENNENS------ESKESIHPLALLANR 606
+ G + ++ TY+ S+ ++ + +S K + LA+LA
Sbjct: 449 NEYRRRRKGEVDVARTRIETYIRSSLRTSFAQRMEKADSSRRASRNQKNPLPVLAILAKD 508
Query: 607 LRLVVEKEITVFFPVLRQLCPDSGIIAAMLLHQYYGEKLKPFLKEVSKLSDDVRSVLPAA 666
+ + +E +F P+L++ P + +A LH YG ++K F+ +S+L+ D +L AA
Sbjct: 509 IGELAIQEKRMFSPILKRWHPFAAGVAVATLHVCYGNEIKQFIAGISELTPDAVQILRAA 568
Query: 667 YSLDRDLTHLFTAASKES-RLSPLLKEDLEHYPIVQIAKPIILDWMIDQLEQTSEWTGRA 726
L++DL + S +S + ++ + + ++ DW+ ++++ EW R
Sbjct: 569 DKLEKDLVQIAVEDSVDSDDGGKAIIREMPPFEAETVIANLVKDWIKARIDRLKEWVDRN 628
Query: 727 FKLEDWEPI-SFQQNLAASVIEVFRIIEETVDQYFDLNLPMDITHLQALLSIVYHSLDGY 786
+ E W+P+ + + A S EV RI +ET++ +F L +PM L L+ + L Y
Sbjct: 629 LQQEVWKPLENLEGGYAQSAAEVLRITDETLEAFFQLPIPMHPAVLPDLIIGLDKYLQYY 688
Query: 787 LSGLLNQLVEKNYLYPPVPPLTRFMETVATG-------KKKLPECHLDEHVSSKLNG--- 846
+S + + P +P LTR TG K+K P E S +NG
Sbjct: 689 VSKAKSGCGSRTTYMPTMPALTR----CTTGSKFQWKKKEKTPTTQKRESQVSVMNGENS 748
Query: 847 LTISKLCIKLNTLGYIQKQIETLEDRIGKSWALIGGSAKHKRAPEEATVNGGLRTCSDEV 906
++++C+++N+L I+ +++ +E R+ H R E A +D+
Sbjct: 749 FGVTQICVRINSLHKIRSELDVVEKRV----------ITHLRNCESAH--------TDDF 808
Query: 907 NELFANTFNNIKSFIAKAISKFCDITGIRVIFWDLRDEFLSYLYHGNVDAVRLEDVLAHL 966
+ F + + + + + +V+F DL LY G++ + R++ L L
Sbjct: 809 SNGLEKKFELTPAACIEGVQQLSESLAYKVVFHDLSHTLWDGLYIGDLSSSRIDPFLKEL 868
Query: 967 DTVLNNVCGRIDDTLRDLVVLSICRASMEAFIWVMLNGGPSRAFSDSDIVLIKEDLGILK 1026
+ L + + + +R ++ I RAS++ F+ V+L GGPSRAF+ D +++ED +K
Sbjct: 869 EQNLTVIAETVHERVRTRIITDIMRASLDGFLLVLLAGGPSRAFTRQDSQIMEEDFKSMK 928
Query: 1027 DFFVADGEGLPRTLVEKEAKFAEEILGLYSLPTETIIQ-----LLMSSGGDISTELDPCS 1083
D F A+G+GL L++K + +L L+S T+++I+ L + G + L
Sbjct: 929 DMFWANGDGLAMDLIDKFSTTVRGVLPLFSTDTDSLIERFKGTTLEAYGSSAKSRLPLPP 984
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q8RX56 | 6.5e-162 | 34.55 | Protein unc-13 homolog OS=Arabidopsis thaliana OX=3702 GN=PATROL1 PE=2 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1FPY9 | 0.0e+00 | 99.36 | uncharacterized protein LOC111447738 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1KUH5 | 0.0e+00 | 97.09 | uncharacterized protein LOC111498329 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1FVS9 | 0.0e+00 | 99.90 | uncharacterized protein LOC111447738 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1L1Q4 | 0.0e+00 | 98.12 | uncharacterized protein LOC111498329 isoform X3 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1KSN8 | 0.0e+00 | 97.35 | uncharacterized protein LOC111498329 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |