Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GGTCTGGCAACAGAAGGAAAAGGCCTTCCTCGTTGCCCTTCTAATGGCGATCCGACCCAATCCGAACAAACCCATCTTCTGTCTCTTCTAATTCTCCTTCCCTTCTCGATCCTTCATTATCTGCAGTTACTGTGTTGATTAAAGAATCTTGAAAATTGTTAAACGTCAATGATTTTGATTGGGGCCTGCGATATCGGATATGAATCTCTCACTTTCATCTTCCTCTTCTTCTTCACTTCCTTCCGCTTCTGCATCTTCTTCTTCTTCCTCCCCCTCTTCGTCTTCCACTACTTCATGGTTCTCCGGCATAGTTCGCGGCCGTCCCGATAGGTCTTCGAGCGTGAAAATGTCTGGCAGCTCTGCCTCTGGCTTTGTTGCTGGGGATCCCCCTGGTCCTGTTGTGAGGAAGAATCATTTTCGTGGGTTGCTCTTCAAGTACGGTCCCAAGCCAATTCAGGTCCGTGGATCTTTAATTGAGAATTTTCTGTTCTTTGGACCTGTTTAATTTGACTTAGGATTGGTTTGTCATAGGGATCTTAATCTTGAATATTCCTGTTGGTGTTTGCTGCTTTAGTAGAATAAACCCATTTGAGAGGAAGTTCAAAGGATTATTCAGAAACTTCCTGTGATTAAGCTTCGTTATTCCTCATGTAATGCAATTGTTTGGAGGAATTAGGGAACATCTTCTTGGACTTTTAAGTAATACCTTATTAGTGACTGTTTCTATGTTCCATTTGCTTCAGAATCTCCCACTGAGTTTAAGGAATGTTAACTACAACAAATCACAAGCTTCGAAGAGTTATAGTTAGATGTGCTTTGCAGATGTATGGGGTCCTTTGCTATGGTTGGAGAGAGGAATGAAACACTCCTTATAATAAGGGTGTGTAAACTCTCTGTAGCATACACGTTTTAAAACCTTGAGGGAAGTCTGGAAGGAAAATCTAGAAGAGGACAATATCTGCTAGTGGTGGGCTTGGACTGTTACAAATGGTATCAGAGCCAGGCTCCGAATAGTGTGCTAGCGAGGATGCTAGGCCTCCAAAGAGGGTGGATTGTGAGATCCCACGTTGGTTGGAGTGAGGAATGAAACATTCCTTATAATAAGGGCGTGGAAACCTCTCTCTAGCAGACGCGTTTTGAGGGGGAAGCCCGAAAGAGAAAGCTCAAGAGGATAGTATCTGCTAGTGGTGGGCTTGGGCTGTTACAGATGGTTTTAGAGCCAGGTTCCGAATGGTGTGCCAGCGAGGACACTGGCCTCCGAGGGGGGTGGATTATGAGATCCCATAGCATTCCTTATAATAAGGGTGCGAAAACCTCTCTTCAACGCGGGAATCCTGAAAGGGAAAGCCCAAAGAGGATAATATCTGCTAGCGGTGGGTTTAGGCTGTTACATATCATGACCAAAATAAACTCTATATTCATGTAATGACATATTTCTTTCTCACGTGATGCCTGGTTAATCGTGTCCTCTATTATTCAGGTTGCATTTAAGACGGGGGATTACAAGCAGCAAGTCATATTCATTGGTGGATTGACTGATGGCTTTATGGCAACAGAGTATGTTTTCATCGAGGGGTTTTTCCCTGATTGGACCTTTCATTTGCTTGATTTGTGGGTCTAAATTAAAATGACAGGGACTGTTCCCCATTTCCTTTCAGTTCTTATCATCAAAATACATACCAAAGATCTTGATGTGTGTTCACTGGTGGATTTTCAAGAAATTAGAGAAATCGATTTTCATTGTTGTATGGTTCGAAACTCTGCAGTTCCTGACATTTTCTTCTTGATTTGTAGATACTTGGAACCTCTTGCAATTGCTTTGGATAAAGAAAAATGGTCACTTGTTCAGATTCTTCTATCATCATCATACAGTGGATATGGTACCTCCAGCTTGCAACAAGTAAGTTGGGTTGGTAACATCACCATGGTTTCTTGAACGAAGAAGTATACCATCTTTTAGTTGGCAAACAGTAATATCCTGTTTTAACAAAAGGATTTTGATTTGATTTAGGATGCCAAGGAGCTTGATCAGCTAGTAAGCTATCTGATCAATAAAGAAGACTCCGAGGGTGTCGTGTTACTTGGACATAGTACTGGCTGTCAGGTAATTACTGAATGTTCTTGAACTATGAAGCTTTGATCAACAGTTGGAAATTTCATCTATGTTCTTTGGACATTGTTGCAGGACATAGTCTATTATATGCGTACAAATGCAGCTTGCTCTCGAGCAGTTCGTGGTGCCATTTTGCAGGTAATAACTTATCCTTGAGTGTTTCATTACTGGAAGATTCACATCTCAGTACTGGTGGTGAGTTATGAATCTCGCATTACATCTTTGTTGTAGCGTTACGTAATTGTCTAGTAGAATTAGCTAAGGTTTTTGCAACTTGTAAGCTATCTTGGATGCCATGGTTCTCCAAAATGAAGAAAAAAAGTTATTTTCCTCTTGGCGTTTGAACACTTGTAGTCAATTTCTTTTAGAGCACATGATTTTATGTGTAGTTTTTAACTAATAATCATGATTCTTGTTTGTTTGGTCAGGCTCCGGTTAGTGATCGAGAGTATAGAGCAACTCTTCCTGAAACAGCAGCCATGATCGACTTGGCTTCGACTATGATAAACGAAGGCAGAGGATTAGATCTTATGCCAAGGGAGGCAGACCCGTCTCCGATAACCGCCACTCGGTGAGTATAAGAGCACATCATCTTTATTCGTTGGTAATCTTGAAAGCATGAAAGTTGAACATGTTACTATCTTACCGAACAGAGGATTCACAATTTGAAAACCAGTGGGAAATCATGTAAATATCTTTTGATTTGGCTGTGTCTGTAGTATCCTTCTTTATTTCTTTATTTCTTTAAACGTGTGTGAGATCCCACATCGGTCGGAGAAGAGAACGAAACATTCCTTATAAGGGTGTGGAAACCTCTCCCTAGCGTGTATTTTAAAACCAAGCGGGGAGGCCCGAAAGGGAAAGCTCAAAGAGGACAATATTTGTTAGCAGTGGGCTCGGGCTGTTCCAAATGGTATTAGAGCTAGTCACTAGGCGATGTGCCAGTGAGGACGTTGGGCCTCCAAGGAGGGTGGATTGTAAGGTCCCACATCATTTGGAGAGGGGAACGAAACATTCCTTATAAGGGTGTGGAAACTTCTCCCTAGCAGACACATTTTAAAACCTTGCGGGGAAGCCCAGGAGGTAAAGTCCAAAGAGAACAATATCTACTAGTGGTGGGCTTGGACTGTTACAGTGTGGATGATTTTTGGTACGTTTAATTAATTTTTCTCGAGATTGCAGTTTTTAATGCTACGATTAGCTATTTGCAGGTATTATTCGCTTTGCTCATACATGGGGGATGATGACATGTTCAGCTCCGACCTTAGCGATGACCAGTTGAGAATGAGACTTGGGCACATGGCTAACACACCGTGTCAGGTCTGCATGCACACTTCTCTCTTTCTGGTTTATCTGAATGGAGCAATGCTTCTTAAACCTTACTTAAAAGACTTGAATAAAAATAAGTTTAATTTTTGGATCGATGAAAGCTACTGCTTCTCTTTCGGACCGAAGAATATAAAAGCTCAAGCCATGTGAGATCCCACATTGGTTGTAGAGGAGAACGAAACATTCCTTATAAGAGTGTGAAAACATTTCCTTAACCAACGTGTTTTAAAACCGTGAGGCTGACGCCGATACGTAACGGGCCAAGCCAGACAATATCTACTAGCGGTGGACTTGGACTATTGCAAATGGTCAGAGCCAGACATTAAGCGGTGTGCGAGCGAGGACGCTAATCCCTATGGAGGTGGATTGTGAGATCCCACATCAGTTGGAGAGGAGAATGAAACATTCCTTATTAAGAGTGTGGAAACCTCTCCCCTAACAGACGCGTTTTAAAACTGTGAGGCTGACAGCGATACGTAACAGACCAAAGTAGACAATATCTGTTAGTAATAGGCTTGAGCTGTTACAAATAGTATCAGAGCCAGACACTAAGCGGTGTGCCAGCGAGGACGCTGGACCCCTGGGGGGAGATTGGAGGTCCCACATCGGTTGGAGAAGAAAACGAAACATTCCTTATAAGGGTGTGGAAACCTCTCACTAACAAACCTCTCACATTGCTTGCAGGTTATCTATTCCATGGGTGACGAATATGTGCCGGAATACGTGGATAAGGAGGCATTGGTTGATAGGTAAACTCAATAACTCTTGTTACTCAATTGCTATCCCATAGTCCACCTGCCTCAAACAATTAACATTCTTGATGGGGCTGGTGTTACTCACCTCTGTTTCACTATACATTCAATGATTTAGAGAGTAGTAGCCACAACCGAGCGGGAGAGAGCGAGCGAGCGATCGATCGATCATGGTGCTTGACACCTAACGATATTGTTTTTGTTATGAGGTAATGCAGATTGTGCAAAGCAATGGGAGGTGCAGAGAAAGTTGAGATTCAACATGGGAATCATTCTCTGTCAAACAGAGTTAATGAAGCAGTTGATGCCATAATTGACTTCGTGAAAAGGGAGGGTCCTAAGGGGTGGGATGATCCATGGCATTAGTACTGTTCATTCTTCTGCTGCGTGCTCCCTTCTTCTCTTCATTCATTTCATTTGATCATATCAGATTGTGAGGAAGGGTTATTTAAATGGGTATCAGATCTCTTGTCTAGACTGTGTTATCTGTGGTAGCTTTCATTGTTATTTGCAGACAAAGAGACAGAAAGGGTATGATTGTTTGTTCATTAGATTTCCTTCAATTGGAATTCTTCTTGTGTTCTCCATAACTTCGTGGTAATTATTTACAAGAGTTAGCATTGATTTTGATTAATCCCCAACATCGAGATGACATAGTTAATGTAACCGTTCAAGTCCATCACTAACACATATCATCTTTTTTGGGCTTTCCTTCCGAGCTTCTCTTCAAAGTTTCCATATTTTTGTAAGGAATGTTTCGTTCCCTCTCTAGCCAATGTGGGATCGTACAATCCACCCTCCTTGGGGGCTAACATCCTCGCTGACACACTACTCAATCTCTTGCTCTGATACCATTTTGTAACCGTCTAAGTCCAGATATAAAACGTGTCCGTTACTTCTAGACTTTACTTCTAGACTTTCCCTCAAGGATTTAAAACGTGTCCGTTAGGGGAGATTTTTGTATCTTTATAAGGAATACTTTCACTTCCCTCTCCAACCAATATGAGATCTCATAGTTAGTTTAATTTCTCATTAATAATTTTATGTTACATTTTAAGAATTTTGAAAAAAAAAGAGTTAAAATTAGTTTTTATAATTTTTAAAATAAAAAATTTATAAGATTTTGTATTTGAAAATTTAGGAAAATTTTAATAAAATTATTGTTAATTACTTTATGTAATTTTACTTACTAAAATTGACCGATTTACATTTTTACTAAATAATCCGCAGCGGGTCGGGTACATATAAAATGGTTCCTCATGGAGTCCGGAACCCGACAAGAAACATGTCGCGTGGGAAGCACGTGCCACAGCTAGAATCCAGGAATCCTCATAATGCAAAACAATGAAATGACGAAAAAGCCCCTACTGAGCTCAACTCCAATCCTGGCTGACAGGAAAGAACGTCGACATTGTCGGTGCTCCGCGTTGCTTGACAAAACGGTAATTTTCATGAATCTCACAAGAAGGGTAAAACCGTCCATTCATAAAGTTGTCTTCCGCTCAAGCTCCGCCCTCGCCTTTCCCTTTGGGACATCGACAATGGTCTCCAGCTCCATAATTATCGCTGTTTTTATCCGACAGAAGAGGAATACAAAAAAAAAAAAAAAAACATTTTTATTTCAATCGATCAATAATTCCCCCTAATTTCTACCCCAAATTTTGTTATTTGTAGAGAGAGAAACAGCAAAACAGAAGACCAACGAAGAAGAAGAAGAAATTGAAGAAGCATCTCCGTTTCTTCAAATCTTCGTTGCTCAAACACAACAATCTTCAAAACCCAGACAGATTTCTGGTTTTTCCGACCAACCCAGTAAGTAAAATCTGTTCTTTTTTCACTTTTTTCCTGCTTTTTCTCTTCGGTTTTGTTGGGTTTTGTTTAATCGAGAAAGGGGAATCTTTAGCGTTCGGAAAAGATCAGAACTTTATCGCCTGCATTTCTCTCCGGCGAAGGAAACAAAGGGTTTTTTGGGTTTCTTTTTTGTTCTTCTGGATTGTGGATTCGTTGAATCTGCGGCAGTTTTCAGATGCCGGAGAAGGGCCTGCTTCGAGCGACGTCGCAGCCATGGCTTTTCGTGTCGTCTTCGTTGGTACATTCCCGTTTTTGGGTACTCACAGGGTTGGTTCTTCTCAGTATGTTGGCCGTTTGGAGTATCGACGGCTTCAACGTTAATACCTTCATCAAATTTTGGAGCTCCCCTCAAGATTTCGTCTCTGTTTCTTCCAATTTCACCAATACCCATTTTAATTTATCCACTAAGGATTCCAATTTCACCCAATTCGTCTCTTACAAACCCGAAAATTACCCGGAATCTGCGTTGTTTGAACCGGTTTCGCCGCATAATTCCACCGGCGAACCTCGCCGGATTGAGAATGAACCGGCGGTTCCCTACGCGTCTTTTTCTGATTGGTTTTCAGCTGAACTCGAGCCTAATTATACCTCCAATCTTCTGGCTTTATGGAAGACTCCCGGCGGCGAACCCTGCAGAGACTCCCGGACGGCGGATATTGCTATTTCCGGCATGGAAGGTTCGGGAGTGGTGGAGCTTTCAACAGGGGATGTTCATGAGTTTCGGTTTCAAGCTGTTGATGAATCTGGAAACCCTCGTTGTCTAGGTGGAGACTATTTTGAAACTGATCTTTCCGGCGAGTGGTGGAAGTCTCGGCCGTTTGTTAGAGATTTGGGGAATGGGACTTACTTGTTTTGGGTTCAGGTGCACCCTGATTTCGCCGGTGATTATGAACTGACTGTGATTCTTCTGTTCCGAAGTTTCGAGGGGCTGAGGTTTTCCCCGACCCGATTCGCGTATGATCGAGAGCTTCGGAGAATCAAGGTTCGATTTGTTAGGACTTCGGTGGGGTTGCCGGAGATTAAGAGTTGTGGAAGGTGGGATTTTGGTAAAGAGATTTGGACGGGACGGTGGACTCGACACGGTCGAAATGAGGGTTGCGAGATCAGTGATGACGGTCGGTACCGGTGTTTTCCACCGGAGTATCCTTGCCGGAGGCCTTGGTGCAATGGGTCATTGGGGTTGTTGGAGAGCAATGGTTGGGTTTATTCAGCACATTGTTCATTTCAGATGTTTACTAGTGATTCTGCTTGGGATTGCTTGAAGGGTAGATGGATCTTCTTTTGGGGTGATTCGAATCATGTCGATACGATAAGGAACCTTCTGAATTTCGTGTTGGATTTGCCTGAAATCCCTGCAGTTCCGAGGCGATTCGATAGGAATTTTTCGAACCCGAAAAACCCGTCTCAAACGGTTCGCATTACCAGCATCTTCAATGGCCATTGGAATGATACACAAAACTATGAAGGTTTGAATTCATTGAGAAATGAGGGATTCAGAAATCTCCTGTACAAGTACTTCTCCGAACAAACCGTTCCCGACACGATCATCATGAACTCGGGCCTCCACGACGGTGTTCACTGGTTGAATATCCGAGCTTTCTCGGTTGGGGCAGCCTATGCTGCATCATTTTGGAAACAACTTCTGGATTCCATCATGCAGAGGGGATTAACAGTCCCGAAAGTGTTCTACCGAACCACGGTCGCAACCGGTGGCTACGCTCGAACACTCGCATTCAATCCCAACAAAATGGAAATATTCAACTGGGTCGTGCTGGAGAAGTTAAAGGAATCCGGGATCATCCACGGCATCATCGACAACTTCGACATGACCTTCCCCTGGCATTTCGACAACCGTTGCAACGATGGAGTACATTACGGTCGAGCTCCGGCCAAGCTCACATGGAGGGACGGCCAAATCGGCCACCAATATTTCTTAGACCTCATGTTAGCTCATATTCTTCTCAATGCACTCTGTTCTTGA
mRNA sequence
GGTCTGGCAACAGAAGGAAAAGGCCTTCCTCGTTGCCCTTCTAATGGCGATCCGACCCAATCCGAACAAACCCATCTTCTGTCTCTTCTAATTCTCCTTCCCTTCTCGATCCTTCATTATCTGCAGTTACTGTGTTGATTAAAGAATCTTGAAAATTGTTAAACGTCAATGATTTTGATTGGGGCCTGCGATATCGGATATGAATCTCTCACTTTCATCTTCCTCTTCTTCTTCACTTCCTTCCGCTTCTGCATCTTCTTCTTCTTCCTCCCCCTCTTCGTCTTCCACTACTTCATGGTTCTCCGGCATAGTTCGCGGCCGTCCCGATAGGTCTTCGAGCGTGAAAATGTCTGGCAGCTCTGCCTCTGGCTTTGTTGCTGGGGATCCCCCTGGTCCTGTTGTGAGGAAGAATCATTTTCGTGGGTTGCTCTTCAAGTACGGTCCCAAGCCAATTCAGGTTGCATTTAAGACGGGGGATTACAAGCAGCAAGTCATATTCATTGGTGGATTGACTGATGGCTTTATGGCAACAGAATACTTGGAACCTCTTGCAATTGCTTTGGATAAAGAAAAATGGTCACTTGTTCAGATTCTTCTATCATCATCATACAGTGGATATGGTACCTCCAGCTTGCAACAAGATGCCAAGGAGCTTGATCAGCTAGTAAGCTATCTGATCAATAAAGAAGACTCCGAGGGTGTCGTGTTACTTGGACATAGTACTGGCTGTCAGGACATAGTCTATTATATGCGTACAAATGCAGCTTGCTCTCGAGCAGTTCGTGGTGCCATTTTGCAGGCTCCGGTTAGTGATCGAGAGTATAGAGCAACTCTTCCTGAAACAGCAGCCATGATCGACTTGGCTTCGACTATGATAAACGAAGGCAGAGGATTAGATCTTATGCCAAGGGAGGCAGACCCGTCTCCGATAACCGCCACTCGGTATTATTCGCTTTGCTCATACATGGGGGATGATGACATGTTCAGCTCCGACCTTAGCGATGACCAGTTGAGAATGAGACTTGGGCACATGGCTAACACACCGTGTCAGGTTATCTATTCCATGGGTGACGAATATGTGCCGGAATACGTGGATAAGGAGGCATTGGTTGATAGATTGTGCAAAGCAATGGGAGGTGCAGAGAAAGTTGAGATTCAACATGGGAATCATTCTCTGTCAAACAGAGTTAATGAAGCAGTTGATGCCATAATTGACTTCATTGTGAGGAAGGGTTATTTAAATGGGTATCAGATCTCTTGTCTAGACTGTGTTATCTGTGGTAGCTTTCATTGTTATTTGCAGACAAAGAGACAGAAAGGGTATGATTGTTTGTTCATTAGATTTCCTTCAATTGGAATTCTTCTTGTGTTCTCCATAACTTCGTGGAAAGAACGTCGACATTGTCGGTGCTCCGCGTTGCTTGACAAAACGAGAGAGAAACAGCAAAACAGAAGACCAACGAAGAAGAAGAAGAAATTGAAGAAGCATCTCCGTTTCTTCAAATCTTCGTTGCTCAAACACAACAATCTTCAAAACCCAGACAGATTTCTGGTTTTTCCGACCAACCCAGTAAGTAAAATCTGTTCTTTTTTCACTTTTTTCCTGCTTTTTCTCTTCGGTTTTGTTGGGTTTTGTTTAATCGAGAAAGGGGAATCTTTAGCGTTCGGAAAAGATCAGAACTTTATCGCCTGCATTTCTCTCCGGCGAAGGAAACAAAGGATGCCGGAGAAGGGCCTGCTTCGAGCGACGTCGCAGCCATGGCTTTTCGTGTCGTCTTCGTTGGTACATTCCCGTTTTTGGGTACTCACAGGGTTGGTTCTTCTCAGTATGTTGGCCGTTTGGAGTATCGACGGCTTCAACGTTAATACCTTCATCAAATTTTGGAGCTCCCCTCAAGATTTCGTCTCTGTTTCTTCCAATTTCACCAATACCCATTTTAATTTATCCACTAAGGATTCCAATTTCACCCAATTCGTCTCTTACAAACCCGAAAATTACCCGGAATCTGCGTTGTTTGAACCGGTTTCGCCGCATAATTCCACCGGCGAACCTCGCCGGATTGAGAATGAACCGGCGGTTCCCTACGCGTCTTTTTCTGATTGGTTTTCAGCTGAACTCGAGCCTAATTATACCTCCAATCTTCTGGCTTTATGGAAGACTCCCGGCGGCGAACCCTGCAGAGACTCCCGGACGGCGGATATTGCTATTTCCGGCATGGAAGGTTCGGGAGTGGTGGAGCTTTCAACAGGGGATGTTCATGAGTTTCGGTTTCAAGCTGTTGATGAATCTGGAAACCCTCGTTGTCTAGGTGGAGACTATTTTGAAACTGATCTTTCCGGCGAGTGGTGGAAGTCTCGGCCGTTTGTTAGAGATTTGGGGAATGGGACTTACTTGTTTTGGGTTCAGGTGCACCCTGATTTCGCCGGTGATTATGAACTGACTGTGATTCTTCTGTTCCGAAGTTTCGAGGGGCTGAGGTTTTCCCCGACCCGATTCGCGTATGATCGAGAGCTTCGGAGAATCAAGGTTCGATTTGTTAGGACTTCGGTGGGGTTGCCGGAGATTAAGAGTTGTGGAAGGTGGGATTTTGGTAAAGAGATTTGGACGGGACGGTGGACTCGACACGGTCGAAATGAGGGTTGCGAGATCAGTGATGACGGTCGGTACCGGTGTTTTCCACCGGAGTATCCTTGCCGGAGGCCTTGGTGCAATGGGTCATTGGGGTTGTTGGAGAGCAATGGTTGGGTTTATTCAGCACATTGTTCATTTCAGATGTTTACTAGTGATTCTGCTTGGGATTGCTTGAAGGGTAGATGGATCTTCTTTTGGGGTGATTCGAATCATGTCGATACGATAAGGAACCTTCTGAATTTCGTGTTGGATTTGCCTGAAATCCCTGCAGTTCCGAGGCGATTCGATAGGAATTTTTCGAACCCGAAAAACCCGTCTCAAACGGTTCGCATTACCAGCATCTTCAATGGCCATTGGAATGATACACAAAACTATGAAGGTTTGAATTCATTGAGAAATGAGGGATTCAGAAATCTCCTGTACAAGTACTTCTCCGAACAAACCGTTCCCGACACGATCATCATGAACTCGGGCCTCCACGACGGTGTTCACTGGTTGAATATCCGAGCTTTCTCGGTTGGGGCAGCCTATGCTGCATCATTTTGGAAACAACTTCTGGATTCCATCATGCAGAGGGGATTAACAGTCCCGAAAGTGTTCTACCGAACCACGGTCGCAACCGGTGGCTACGCTCGAACACTCGCATTCAATCCCAACAAAATGGAAATATTCAACTGGGTCGTGCTGGAGAAGTTAAAGGAATCCGGGATCATCCACGGCATCATCGACAACTTCGACATGACCTTCCCCTGGCATTTCGACAACCGTTGCAACGATGGAGTACATTACGGTCGAGCTCCGGCCAAGCTCACATGGAGGGACGGCCAAATCGGCCACCAATATTTCTTAGACCTCATGTTAGCTCATATTCTTCTCAATGCACTCTGTTCTTGA
Coding sequence (CDS)
ATGAATCTCTCACTTTCATCTTCCTCTTCTTCTTCACTTCCTTCCGCTTCTGCATCTTCTTCTTCTTCCTCCCCCTCTTCGTCTTCCACTACTTCATGGTTCTCCGGCATAGTTCGCGGCCGTCCCGATAGGTCTTCGAGCGTGAAAATGTCTGGCAGCTCTGCCTCTGGCTTTGTTGCTGGGGATCCCCCTGGTCCTGTTGTGAGGAAGAATCATTTTCGTGGGTTGCTCTTCAAGTACGGTCCCAAGCCAATTCAGGTTGCATTTAAGACGGGGGATTACAAGCAGCAAGTCATATTCATTGGTGGATTGACTGATGGCTTTATGGCAACAGAATACTTGGAACCTCTTGCAATTGCTTTGGATAAAGAAAAATGGTCACTTGTTCAGATTCTTCTATCATCATCATACAGTGGATATGGTACCTCCAGCTTGCAACAAGATGCCAAGGAGCTTGATCAGCTAGTAAGCTATCTGATCAATAAAGAAGACTCCGAGGGTGTCGTGTTACTTGGACATAGTACTGGCTGTCAGGACATAGTCTATTATATGCGTACAAATGCAGCTTGCTCTCGAGCAGTTCGTGGTGCCATTTTGCAGGCTCCGGTTAGTGATCGAGAGTATAGAGCAACTCTTCCTGAAACAGCAGCCATGATCGACTTGGCTTCGACTATGATAAACGAAGGCAGAGGATTAGATCTTATGCCAAGGGAGGCAGACCCGTCTCCGATAACCGCCACTCGGTATTATTCGCTTTGCTCATACATGGGGGATGATGACATGTTCAGCTCCGACCTTAGCGATGACCAGTTGAGAATGAGACTTGGGCACATGGCTAACACACCGTGTCAGGTTATCTATTCCATGGGTGACGAATATGTGCCGGAATACGTGGATAAGGAGGCATTGGTTGATAGATTGTGCAAAGCAATGGGAGGTGCAGAGAAAGTTGAGATTCAACATGGGAATCATTCTCTGTCAAACAGAGTTAATGAAGCAGTTGATGCCATAATTGACTTCATTGTGAGGAAGGGTTATTTAAATGGGTATCAGATCTCTTGTCTAGACTGTGTTATCTGTGGTAGCTTTCATTGTTATTTGCAGACAAAGAGACAGAAAGGGTATGATTGTTTGTTCATTAGATTTCCTTCAATTGGAATTCTTCTTGTGTTCTCCATAACTTCGTGGAAAGAACGTCGACATTGTCGGTGCTCCGCGTTGCTTGACAAAACGAGAGAGAAACAGCAAAACAGAAGACCAACGAAGAAGAAGAAGAAATTGAAGAAGCATCTCCGTTTCTTCAAATCTTCGTTGCTCAAACACAACAATCTTCAAAACCCAGACAGATTTCTGGTTTTTCCGACCAACCCAGTAAGTAAAATCTGTTCTTTTTTCACTTTTTTCCTGCTTTTTCTCTTCGGTTTTGTTGGGTTTTGTTTAATCGAGAAAGGGGAATCTTTAGCGTTCGGAAAAGATCAGAACTTTATCGCCTGCATTTCTCTCCGGCGAAGGAAACAAAGGATGCCGGAGAAGGGCCTGCTTCGAGCGACGTCGCAGCCATGGCTTTTCGTGTCGTCTTCGTTGGTACATTCCCGTTTTTGGGTACTCACAGGGTTGGTTCTTCTCAGTATGTTGGCCGTTTGGAGTATCGACGGCTTCAACGTTAATACCTTCATCAAATTTTGGAGCTCCCCTCAAGATTTCGTCTCTGTTTCTTCCAATTTCACCAATACCCATTTTAATTTATCCACTAAGGATTCCAATTTCACCCAATTCGTCTCTTACAAACCCGAAAATTACCCGGAATCTGCGTTGTTTGAACCGGTTTCGCCGCATAATTCCACCGGCGAACCTCGCCGGATTGAGAATGAACCGGCGGTTCCCTACGCGTCTTTTTCTGATTGGTTTTCAGCTGAACTCGAGCCTAATTATACCTCCAATCTTCTGGCTTTATGGAAGACTCCCGGCGGCGAACCCTGCAGAGACTCCCGGACGGCGGATATTGCTATTTCCGGCATGGAAGGTTCGGGAGTGGTGGAGCTTTCAACAGGGGATGTTCATGAGTTTCGGTTTCAAGCTGTTGATGAATCTGGAAACCCTCGTTGTCTAGGTGGAGACTATTTTGAAACTGATCTTTCCGGCGAGTGGTGGAAGTCTCGGCCGTTTGTTAGAGATTTGGGGAATGGGACTTACTTGTTTTGGGTTCAGGTGCACCCTGATTTCGCCGGTGATTATGAACTGACTGTGATTCTTCTGTTCCGAAGTTTCGAGGGGCTGAGGTTTTCCCCGACCCGATTCGCGTATGATCGAGAGCTTCGGAGAATCAAGGTTCGATTTGTTAGGACTTCGGTGGGGTTGCCGGAGATTAAGAGTTGTGGAAGGTGGGATTTTGGTAAAGAGATTTGGACGGGACGGTGGACTCGACACGGTCGAAATGAGGGTTGCGAGATCAGTGATGACGGTCGGTACCGGTGTTTTCCACCGGAGTATCCTTGCCGGAGGCCTTGGTGCAATGGGTCATTGGGGTTGTTGGAGAGCAATGGTTGGGTTTATTCAGCACATTGTTCATTTCAGATGTTTACTAGTGATTCTGCTTGGGATTGCTTGAAGGGTAGATGGATCTTCTTTTGGGGTGATTCGAATCATGTCGATACGATAAGGAACCTTCTGAATTTCGTGTTGGATTTGCCTGAAATCCCTGCAGTTCCGAGGCGATTCGATAGGAATTTTTCGAACCCGAAAAACCCGTCTCAAACGGTTCGCATTACCAGCATCTTCAATGGCCATTGGAATGATACACAAAACTATGAAGGTTTGAATTCATTGAGAAATGAGGGATTCAGAAATCTCCTGTACAAGTACTTCTCCGAACAAACCGTTCCCGACACGATCATCATGAACTCGGGCCTCCACGACGGTGTTCACTGGTTGAATATCCGAGCTTTCTCGGTTGGGGCAGCCTATGCTGCATCATTTTGGAAACAACTTCTGGATTCCATCATGCAGAGGGGATTAACAGTCCCGAAAGTGTTCTACCGAACCACGGTCGCAACCGGTGGCTACGCTCGAACACTCGCATTCAATCCCAACAAAATGGAAATATTCAACTGGGTCGTGCTGGAGAAGTTAAAGGAATCCGGGATCATCCACGGCATCATCGACAACTTCGACATGACCTTCCCCTGGCATTTCGACAACCGTTGCAACGATGGAGTACATTACGGTCGAGCTCCGGCCAAGCTCACATGGAGGGACGGCCAAATCGGCCACCAATATTTCTTAGACCTCATGTTAGCTCATATTCTTCTCAATGCACTCTGTTCTTGA
Protein sequence
MNLSLSSSSSSSLPSASASSSSSSPSSSSTTSWFSGIVRGRPDRSSSVKMSGSSASGFVAGDPPGPVVRKNHFRGLLFKYGPKPIQVAFKTGDYKQQVIFIGGLTDGFMATEYLEPLAIALDKEKWSLVQILLSSSYSGYGTSSLQQDAKELDQLVSYLINKEDSEGVVLLGHSTGCQDIVYYMRTNAACSRAVRGAILQAPVSDREYRATLPETAAMIDLASTMINEGRGLDLMPREADPSPITATRYYSLCSYMGDDDMFSSDLSDDQLRMRLGHMANTPCQVIYSMGDEYVPEYVDKEALVDRLCKAMGGAEKVEIQHGNHSLSNRVNEAVDAIIDFIVRKGYLNGYQISCLDCVICGSFHCYLQTKRQKGYDCLFIRFPSIGILLVFSITSWKERRHCRCSALLDKTREKQQNRRPTKKKKKLKKHLRFFKSSLLKHNNLQNPDRFLVFPTNPVSKICSFFTFFLLFLFGFVGFCLIEKGESLAFGKDQNFIACISLRRRKQRMPEKGLLRATSQPWLFVSSSLVHSRFWVLTGLVLLSMLAVWSIDGFNVNTFIKFWSSPQDFVSVSSNFTNTHFNLSTKDSNFTQFVSYKPENYPESALFEPVSPHNSTGEPRRIENEPAVPYASFSDWFSAELEPNYTSNLLALWKTPGGEPCRDSRTADIAISGMEGSGVVELSTGDVHEFRFQAVDESGNPRCLGGDYFETDLSGEWWKSRPFVRDLGNGTYLFWVQVHPDFAGDYELTVILLFRSFEGLRFSPTRFAYDRELRRIKVRFVRTSVGLPEIKSCGRWDFGKEIWTGRWTRHGRNEGCEISDDGRYRCFPPEYPCRRPWCNGSLGLLESNGWVYSAHCSFQMFTSDSAWDCLKGRWIFFWGDSNHVDTIRNLLNFVLDLPEIPAVPRRFDRNFSNPKNPSQTVRITSIFNGHWNDTQNYEGLNSLRNEGFRNLLYKYFSEQTVPDTIIMNSGLHDGVHWLNIRAFSVGAAYAASFWKQLLDSIMQRGLTVPKVFYRTTVATGGYARTLAFNPNKMEIFNWVVLEKLKESGIIHGIIDNFDMTFPWHFDNRCNDGVHYGRAPAKLTWRDGQIGHQYFLDLMLAHILLNALCS
Homology
BLAST of CmoCh06G014400 vs. ExPASy Swiss-Prot
Match:
Q9C0Y8 (UPF0613 protein PB24D3.06c OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=SPAPB24D3.06c PE=3 SV=1)
HSP 1 Score: 127.5 bits (319), Expect = 9.5e-28
Identity = 86/278 (30.94%), Postives = 144/278 (51.80%), Query Frame = 0
Query: 98 VIFIGGLTDGFMATEYLEPLAIALDKEKWSLVQILLSSSYSGYGTSSLQQDAKELDQLVS 157
++F+GGL DG + Y++ L LD+ WS+VQ+ SSY G+GT SL++D ++L + V
Sbjct: 38 LLFVGGLGDGLLTVPYVQELVNPLDEIGWSIVQVQTQSSYIGWGTGSLKRDDEDLHKAVD 97
Query: 158 YLIN----KEDSEGVVLLGHSTGCQDIVYYMRTNAACSRAVRGAILQAPVSDRE--YRAT 217
Y ++ + +VL+GHSTG Q+++YY+ + + + G I QAPVSDRE Y+
Sbjct: 98 YFLHIGGADFSTRKIVLMGHSTGSQNVLYYLTQSILPNYLIAG-IAQAPVSDREAAYQFN 157
Query: 218 LPE-TAAMID-LASTMINEGRGLDLMPREADPS-----PITATRYYSLCSYMGDDDMFSS 277
E T ++D + + +++G G D++PR + P +A R L G+DD FSS
Sbjct: 158 GKEKTKELVDWVKAEYLDKGLGNDVLPRSKVENFFGEVPTSANRCIDLTDVRGNDDFFSS 217
Query: 278 DLSDDQLRMRLGHM-------ANTPCQVIYSMGDEYVPEYVDKEALVDRLCKAM------ 337
DLS D G++ A++ ++ S DE+V DK L++R +++
Sbjct: 218 DLSADDFAKTFGNLKEISGSTAHSQLILLMSERDEFVSPSTDKAQLLNRFRESIRPTTSN 277
Query: 338 --------GGAEKVEIQHGNHSLSNRVNEAVDAIIDFI 342
G V + +L +N+ + A+ FI
Sbjct: 278 SSLSGIIPGATHNVGPKSSPEALKWLINQLITALRSFI 314
BLAST of CmoCh06G014400 vs. ExPASy Swiss-Prot
Match:
Q4WF56 (Fusarinine C esterase sidJ OS=Neosartorya fumigata (strain ATCC MYA-4609 / Af293 / CBS 101355 / FGSC A1100) OX=330879 GN=sidJ PE=1 SV=1)
HSP 1 Score: 112.1 bits (279), Expect = 4.1e-23
Identity = 90/293 (30.72%), Postives = 133/293 (45.39%), Query Frame = 0
Query: 70 KNHFRGLLFKYGPKPIQVAFKTGDYKQ--QVIFIGGLTDGFMATEYLEPLAIALDKEKWS 129
K G+L Y + + T ++ ++F+GGL DG T YL LA AL +WS
Sbjct: 8 KGGLPGILHHYTETLVTFEYTTTTTRKPHSLLFVGGLGDGLATTSYLADLAHALQPTEWS 67
Query: 130 LVQILLSSSYSGYGTSSLQQDAKELDQLVSYLI--------NKEDSEGVVLLGHSTGCQD 189
L + L+SSY +G L +D E+ Q + Y+ S +VL+GHSTG Q
Sbjct: 68 LFTLTLTSSYQSWGLGHLDRDTNEIAQCLKYIKEYKTEKFGGSASSGKIVLMGHSTGSQC 127
Query: 190 IVYYM-RTNAAC-------------SRAVRGAILQAPVSDREY----------RATLPET 249
+++Y+ R N + GAI+QAPVSDRE T E
Sbjct: 128 VLHYLSRPNPHTHTPAFDPYLEHVERMPLDGAIMQAPVSDREAIQWVLAEGLGDRTPAEI 187
Query: 250 AAMIDLASTMINEG-----RGLDLMPREADPS-------PITATRYYSLCS-----YMGD 307
+ + ++M E G D++ A S P++A R+ SL S +
Sbjct: 188 RPVFEKLTSMAREAARDADAGTDVLLPLAMTSLVYPAHTPLSARRFLSLTSPESPESPSE 247
BLAST of CmoCh06G014400 vs. ExPASy TrEMBL
Match:
A0A6J1F9E2 (uncharacterized protein LOC111443291 OS=Cucurbita moschata OX=3662 GN=LOC111443291 PE=4 SV=1)
HSP 1 Score: 1283.9 bits (3321), Expect = 0.0e+00
Identity = 601/601 (100.00%), Postives = 601/601 (100.00%), Query Frame = 0
Query: 508 MPEKGLLRATSQPWLFVSSSLVHSRFWVLTGLVLLSMLAVWSIDGFNVNTFIKFWSSPQD 567
MPEKGLLRATSQPWLFVSSSLVHSRFWVLTGLVLLSMLAVWSIDGFNVNTFIKFWSSPQD
Sbjct: 1 MPEKGLLRATSQPWLFVSSSLVHSRFWVLTGLVLLSMLAVWSIDGFNVNTFIKFWSSPQD 60
Query: 568 FVSVSSNFTNTHFNLSTKDSNFTQFVSYKPENYPESALFEPVSPHNSTGEPRRIENEPAV 627
FVSVSSNFTNTHFNLSTKDSNFTQFVSYKPENYPESALFEPVSPHNSTGEPRRIENEPAV
Sbjct: 61 FVSVSSNFTNTHFNLSTKDSNFTQFVSYKPENYPESALFEPVSPHNSTGEPRRIENEPAV 120
Query: 628 PYASFSDWFSAELEPNYTSNLLALWKTPGGEPCRDSRTADIAISGMEGSGVVELSTGDVH 687
PYASFSDWFSAELEPNYTSNLLALWKTPGGEPCRDSRTADIAISGMEGSGVVELSTGDVH
Sbjct: 121 PYASFSDWFSAELEPNYTSNLLALWKTPGGEPCRDSRTADIAISGMEGSGVVELSTGDVH 180
Query: 688 EFRFQAVDESGNPRCLGGDYFETDLSGEWWKSRPFVRDLGNGTYLFWVQVHPDFAGDYEL 747
EFRFQAVDESGNPRCLGGDYFETDLSGEWWKSRPFVRDLGNGTYLFWVQVHPDFAGDYEL
Sbjct: 181 EFRFQAVDESGNPRCLGGDYFETDLSGEWWKSRPFVRDLGNGTYLFWVQVHPDFAGDYEL 240
Query: 748 TVILLFRSFEGLRFSPTRFAYDRELRRIKVRFVRTSVGLPEIKSCGRWDFGKEIWTGRWT 807
TVILLFRSFEGLRFSPTRFAYDRELRRIKVRFVRTSVGLPEIKSCGRWDFGKEIWTGRWT
Sbjct: 241 TVILLFRSFEGLRFSPTRFAYDRELRRIKVRFVRTSVGLPEIKSCGRWDFGKEIWTGRWT 300
Query: 808 RHGRNEGCEISDDGRYRCFPPEYPCRRPWCNGSLGLLESNGWVYSAHCSFQMFTSDSAWD 867
RHGRNEGCEISDDGRYRCFPPEYPCRRPWCNGSLGLLESNGWVYSAHCSFQMFTSDSAWD
Sbjct: 301 RHGRNEGCEISDDGRYRCFPPEYPCRRPWCNGSLGLLESNGWVYSAHCSFQMFTSDSAWD 360
Query: 868 CLKGRWIFFWGDSNHVDTIRNLLNFVLDLPEIPAVPRRFDRNFSNPKNPSQTVRITSIFN 927
CLKGRWIFFWGDSNHVDTIRNLLNFVLDLPEIPAVPRRFDRNFSNPKNPSQTVRITSIFN
Sbjct: 361 CLKGRWIFFWGDSNHVDTIRNLLNFVLDLPEIPAVPRRFDRNFSNPKNPSQTVRITSIFN 420
Query: 928 GHWNDTQNYEGLNSLRNEGFRNLLYKYFSEQTVPDTIIMNSGLHDGVHWLNIRAFSVGAA 987
GHWNDTQNYEGLNSLRNEGFRNLLYKYFSEQTVPDTIIMNSGLHDGVHWLNIRAFSVGAA
Sbjct: 421 GHWNDTQNYEGLNSLRNEGFRNLLYKYFSEQTVPDTIIMNSGLHDGVHWLNIRAFSVGAA 480
Query: 988 YAASFWKQLLDSIMQRGLTVPKVFYRTTVATGGYARTLAFNPNKMEIFNWVVLEKLKESG 1047
YAASFWKQLLDSIMQRGLTVPKVFYRTTVATGGYARTLAFNPNKMEIFNWVVLEKLKESG
Sbjct: 481 YAASFWKQLLDSIMQRGLTVPKVFYRTTVATGGYARTLAFNPNKMEIFNWVVLEKLKESG 540
Query: 1048 IIHGIIDNFDMTFPWHFDNRCNDGVHYGRAPAKLTWRDGQIGHQYFLDLMLAHILLNALC 1107
IIHGIIDNFDMTFPWHFDNRCNDGVHYGRAPAKLTWRDGQIGHQYFLDLMLAHILLNALC
Sbjct: 541 IIHGIIDNFDMTFPWHFDNRCNDGVHYGRAPAKLTWRDGQIGHQYFLDLMLAHILLNALC 600
Query: 1108 S 1109
S
Sbjct: 601 S 601
BLAST of CmoCh06G014400 vs. ExPASy TrEMBL
Match:
A0A6J1I996 (uncharacterized protein LOC111472251 OS=Cucurbita maxima OX=3661 GN=LOC111472251 PE=4 SV=1)
HSP 1 Score: 1260.0 bits (3259), Expect = 0.0e+00
Identity = 589/601 (98.00%), Postives = 592/601 (98.50%), Query Frame = 0
Query: 508 MPEKGLLRATSQPWLFVSSSLVHSRFWVLTGLVLLSMLAVWSIDGFNVNTFIKFWSSPQD 567
MPEKGLLRATSQPWLF SS LVHSRFWVLTGLVLLSMLA WSIDGFN+NTFIKFWSSPQD
Sbjct: 1 MPEKGLLRATSQPWLFGSSPLVHSRFWVLTGLVLLSMLAFWSIDGFNINTFIKFWSSPQD 60
Query: 568 FVSVSSNFTNTHFNLSTKDSNFTQFVSYKPENYPESALFEPVSPHNSTGEPRRIENEPAV 627
FVSVSSNFTNTHFNLSTKDSNFTQFVSYKPENYPESALFEPVSPHNST EPRRIENEPAV
Sbjct: 61 FVSVSSNFTNTHFNLSTKDSNFTQFVSYKPENYPESALFEPVSPHNSTVEPRRIENEPAV 120
Query: 628 PYASFSDWFSAELEPNYTSNLLALWKTPGGEPCRDSRTADIAISGMEGSGVVELSTGDVH 687
PYASFSDWFSAELEPNYTSNLLALWKTPGGEPCRDSRTADIAISGMEG GVVELSTGDVH
Sbjct: 121 PYASFSDWFSAELEPNYTSNLLALWKTPGGEPCRDSRTADIAISGMEGPGVVELSTGDVH 180
Query: 688 EFRFQAVDESGNPRCLGGDYFETDLSGEWWKSRPFVRDLGNGTYLFWVQVHPDFAGDYEL 747
EFRFQAVDESGNPRCLGGDYFETDLSGEWWKSRPFVRD GNGTYLFWVQVHPDFAGDYEL
Sbjct: 181 EFRFQAVDESGNPRCLGGDYFETDLSGEWWKSRPFVRDFGNGTYLFWVQVHPDFAGDYEL 240
Query: 748 TVILLFRSFEGLRFSPTRFAYDRELRRIKVRFVRTSVGLPEIKSCGRWDFGKEIWTGRWT 807
TVILLFRSFEGLRFSPTRFAYDRELRRIKVRFVRTSV LPEIKSCGRWDFG+EIWTGRWT
Sbjct: 241 TVILLFRSFEGLRFSPTRFAYDRELRRIKVRFVRTSVELPEIKSCGRWDFGREIWTGRWT 300
Query: 808 RHGRNEGCEISDDGRYRCFPPEYPCRRPWCNGSLGLLESNGWVYSAHCSFQMFTSDSAWD 867
RHGRNEGCEISDDGRYRCFPPEYPCRRPWCNGSLGLLESNGWVYSAHCSFQMFTS SAWD
Sbjct: 301 RHGRNEGCEISDDGRYRCFPPEYPCRRPWCNGSLGLLESNGWVYSAHCSFQMFTSGSAWD 360
Query: 868 CLKGRWIFFWGDSNHVDTIRNLLNFVLDLPEIPAVPRRFDRNFSNPKNPSQTVRITSIFN 927
CLKGRWIFFWGDSNHVDTIRNLLNFVLDLPEIPAVPRRFDRNFSNPKNPSQTVRITSIFN
Sbjct: 361 CLKGRWIFFWGDSNHVDTIRNLLNFVLDLPEIPAVPRRFDRNFSNPKNPSQTVRITSIFN 420
Query: 928 GHWNDTQNYEGLNSLRNEGFRNLLYKYFSEQTVPDTIIMNSGLHDGVHWLNIRAFSVGAA 987
GHWNDTQNYEGLNSLRNEGFRNLLYKYFSEQTVPDTIIMNSGLHDGVHWLNIRAFSVGAA
Sbjct: 421 GHWNDTQNYEGLNSLRNEGFRNLLYKYFSEQTVPDTIIMNSGLHDGVHWLNIRAFSVGAA 480
Query: 988 YAASFWKQLLDSIMQRGLTVPKVFYRTTVATGGYARTLAFNPNKMEIFNWVVLEKLKESG 1047
YAASFWKQLLDSIM RGLTVPKVFYRTTVATGGYARTLAFNPNKMEIFNWVVLEKLKESG
Sbjct: 481 YAASFWKQLLDSIMLRGLTVPKVFYRTTVATGGYARTLAFNPNKMEIFNWVVLEKLKESG 540
Query: 1048 IIHGIIDNFDMTFPWHFDNRCNDGVHYGRAPAKLTWRDGQIGHQYFLDLMLAHILLNALC 1107
IIHG+IDNFDMTFPWHFDNRCNDGVHYGRAPAKLTWRDGQIGHQYFLDLMLAHILLNALC
Sbjct: 541 IIHGVIDNFDMTFPWHFDNRCNDGVHYGRAPAKLTWRDGQIGHQYFLDLMLAHILLNALC 600
Query: 1108 S 1109
S
Sbjct: 601 S 601
BLAST of CmoCh06G014400 vs. ExPASy TrEMBL
Match:
A0A5J5AEB9 (Uncharacterized protein OS=Nyssa sinensis OX=561372 GN=F0562_034734 PE=4 SV=1)
HSP 1 Score: 1147.1 bits (2966), Expect = 0.0e+00
Identity = 605/1144 (52.88%), Postives = 743/1144 (64.95%), Query Frame = 0
Query: 1 MNLSLSSSSSSSLPSASASSSSSSPSSSSTTSWFSGIVRGRPDRSSSVKMSGSSASGFVA 60
MNLS SSSSSS S+S+SSSS S SSS+TTSW SGIVRGR D+S SVKM+ +S +G
Sbjct: 1 MNLSSSSSSSS---SSSSSSSSLSSSSSATTSWLSGIVRGRFDKSGSVKMANNSITG--- 60
Query: 61 GDPPGPVVRKNHFRGLLFKYGPKPIQVAFKTGDYKQQVIFIGGLTDGFMATEYLEPLAIA 120
GD GP+ RKN F+G++FKYGPKPIQVAF+TGD KQQVIFIGG+T+GF+ATEYLEPLA+A
Sbjct: 61 GDSVGPINRKNQFQGVMFKYGPKPIQVAFRTGDNKQQVIFIGGMTEGFLATEYLEPLALA 120
Query: 121 LDKEKWSLVQILLSSSYSGYGTSSLQQDAKELDQLVSYLINKEDSEGVVLLGHSTGCQDI 180
L+KEKWSLVQ LLSSSYSGYG SSL++DA ELD+L+SYLINKEDSEGVVLLGHSTGCQDI
Sbjct: 121 LEKEKWSLVQFLLSSSYSGYGISSLKRDAMELDELISYLINKEDSEGVVLLGHSTGCQDI 180
Query: 181 VYYMRTNAACSRAVRGAILQAPVSDREYRATLPETAAMIDLASTMINEGRGLDLMPREAD 240
VYY+RTN ACSRAVR AILQAPVSDRE+RAT PET AMIDLAS +I+EGRG +LMPREAD
Sbjct: 181 VYYIRTNTACSRAVRAAILQAPVSDREHRATRPETTAMIDLASKLISEGRGSELMPREAD 240
Query: 241 P-SPITATRYYSLCSYMGDDDMFSSDLSDDQLRMRLGHMANTPCQVIYSMGDEYVPEYVD 300
P SP+TA RY+S C+Y GDDDMFSSDLSDDQLR RLGH++ TPCQ++ S
Sbjct: 241 PDSPVTAYRYHSFCAYTGDDDMFSSDLSDDQLRKRLGHLSTTPCQIVQS----------- 300
Query: 301 KEALVDRLCKAMGGAEKVEIQHGNHSLSNRVNEAVDAIIDFIVRKGYLNGYQISCLDCVI 360
NG+ +S
Sbjct: 301 ------------------------------------------------NGWILSS----- 360
Query: 361 CGSFHCYLQTKRQKGYDCLFIRFPSIGILLVFSITSWKERRHCRCSALLDKTREKQQNRR 420
Sbjct: 361 ------------------------------------------------------------ 420
Query: 421 PTKKKKKLKKHLRFFKSSLLKHNNLQNPDRFLVFPTNPVSKICSFFTFFLLFLFGFVGFC 480
F SL HN LQ D + + + K F F LL+ F
Sbjct: 421 --------------FIFSLFLHNLLQKSDVVWIREFSSIGKGLPTFRFLLLYAF------ 480
Query: 481 LIEKGESLAFGKDQNFIACISLRRRKQRMPEKGLLRATSQPWLFVSSSLVHSRFWVLTGL 540
F + I MPEK ++ W+F S+ ++H F VLT
Sbjct: 481 ---------------FASTI--------MPEKAANTVSTYSWIFPSNPMIHLWFKVLTAT 540
Query: 541 VLLSMLAVWSIDGFNV------------------------NTFIKFWSSPQDFVSVSSNF 600
V L +L VW IDG+NV +F S ++ N
Sbjct: 541 VFLGILIVWGIDGWNVADLQRDFLNVKTSTPVDQNLKPTHENLSEFMKSRRNLTHTHQNL 600
Query: 601 TNTHFNLSTK-------DSNFTQFVSYKPE----NYPESALFEPVSPHNSTGEPRRIENE 660
T+TH NLS+K +N T F S + N+ L P +P
Sbjct: 601 THTHENLSSKIDDTKQYPANLTNFSSQHNQLNLTNFSSPVLVNPTNP------------- 660
Query: 661 PAVPYASFSDWFSAELEPNYTSNLLALWKTPGGEPCRDSRTADIAISGMEGSGVVELSTG 720
P DW SA+LEPNY+SNLLA W PGGEPC+DS+T +I+I ++G +ELSTG
Sbjct: 661 ---PKPVTLDWISAKLEPNYSSNLLAGWLNPGGEPCKDSKTVEISIPTLDGRNRIELSTG 720
Query: 721 DVHEFRFQAVDESGNPRCLGGDYFETDLSGEWWKSRPFVRDLGNGTYLFWVQVHPDFAGD 780
D+HEF FQA+D+SG P CLGGDYFETDLSGE WKSRP ++DLGNGTY F +QVHPDFAG+
Sbjct: 721 DIHEFVFQALDDSGKPHCLGGDYFETDLSGESWKSRPPIKDLGNGTYSFSLQVHPDFAGN 780
Query: 781 YELTVILLFRSFEGLRFSPTRFAYDRELRRIKVRFVRTSVGLPEIKSCGRWDFGKEIWTG 840
Y LTVILLFR FEGL+FSP RFA+D+ LR I + F ++S LP I+ C + D+ +++W+G
Sbjct: 781 YTLTVILLFRHFEGLKFSPERFAFDKILRIIPINFYKSSAQLPGIRECQKSDYTRDVWSG 840
Query: 841 RWTRHGRNEGCEISDDGRYRCFPPEYPCRRPWCNGSLGLLESNGWVYSAHCSFQMFTSDS 900
RWTRHG+N+ C IS+DGRYRCF P+YPC+RPWC+GSLGLLESNGW+YS HCSF++F+++
Sbjct: 841 RWTRHGKNDTCPISNDGRYRCFEPDYPCQRPWCDGSLGLLESNGWIYSTHCSFRLFSTER 900
Query: 901 AWDCLKGRWIFFWGDSNHVDTIRNLLNFVLDLPEIPAVPRRFDRNFSNPKNPSQTVRITS 960
AW CL RWIFFWGDSNH DT+RN+L+F+L++PE+ AVPRRFD N +NPK+PSQTVRIT+
Sbjct: 901 AWSCLNNRWIFFWGDSNHCDTVRNILHFILEVPEVEAVPRRFDMNITNPKDPSQTVRITN 955
Query: 961 IFNGHWNDTQNYEGLNSLRNEGFRNLLYKYFSEQTVPDTIIMNSGLHDGVHWLNIRAFSV 1020
IFNGH NDT NY+GLNSL++ +R LL YFS++TVPD++IMNSGLHDGV W NIR F+
Sbjct: 961 IFNGHPNDTGNYQGLNSLKDGPYRELLKHYFSQETVPDSVIMNSGLHDGVFWPNIRRFTK 955
Query: 1021 GAAYAASFWKQLLDSIMQRGLTVPKVFYRTTVATGGYARTLAFNPNKMEIFNWVVLEKLK 1080
GA YAA+FW ++++S+ +RGL P+V YRTTVATGGYAR LAFNP+KME FN VVL+KL+
Sbjct: 1021 GAEYAAAFWAEVVESVRRRGLVAPEVIYRTTVATGGYARKLAFNPHKMEAFNGVVLDKLR 955
Query: 1081 ESGIIHGIIDNFDMTFPWHFDNRCNDGVHYGRAPAKLTWRDGQIGHQYFLDLMLAHILLN 1109
+ G ++ +ID+FDMT+PWH+DNRCNDGVHYGRAPAKL WRDG+IGHQYF+DLML H+LLN
Sbjct: 1081 QFGAVNRVIDDFDMTYPWHYDNRCNDGVHYGRAPAKLKWRDGKIGHQYFVDLMLGHVLLN 955
BLAST of CmoCh06G014400 vs. ExPASy TrEMBL
Match:
A0A5J5C3T1 (Uncharacterized protein OS=Nyssa sinensis OX=561372 GN=F0562_000271 PE=4 SV=1)
HSP 1 Score: 1142.1 bits (2953), Expect = 0.0e+00
Identity = 595/1103 (53.94%), Postives = 726/1103 (65.82%), Query Frame = 0
Query: 7 SSSSSSLPSASASSSSSSPSSSSTTSWFSGIVRGRPDRSSSVKMSGSSASGFVAGDPPGP 66
S+S SL S+S+SS+S S SSSSTTSW SGIVRGR D+S SVKM+ +S +G GD GP
Sbjct: 3 SASDMSL-SSSSSSASLSSSSSSTTSWLSGIVRGRSDKSGSVKMANNSTTG---GDSVGP 62
Query: 67 VVRKNHFRGLLFKYGPKPIQVAFKTGDYKQQVIFIGGLTDGFMATEYLEPLAIALDKEKW 126
+ RKN FRG++FKYGPKPIQVAFKTGDYKQQVIFIGGLTDGF+AT YLEPLAIAL+ EKW
Sbjct: 63 INRKNQFRGVMFKYGPKPIQVAFKTGDYKQQVIFIGGLTDGFLATAYLEPLAIALENEKW 122
Query: 127 SLVQILLSSSYSGYGTSSLQQDAKELDQLVSYLINKEDSEGVVLLGHSTGCQDIVYYMRT 186
SLVQ LLSSSYSGYG SSL+QDA ELDQL+SYLINKEDSEGVVLLGHSTGCQDIVYYMRT
Sbjct: 123 SLVQFLLSSSYSGYGISSLKQDAFELDQLISYLINKEDSEGVVLLGHSTGCQDIVYYMRT 182
Query: 187 NAACSRAVRGAILQAPVSDREYRATLPETAAMIDLASTMINEGRGLDLMPREADP-SPIT 246
NAACSRAVR AILQAPVSDREYRATLPETAAMIDLAS M++E RG +LMP+EADP +PIT
Sbjct: 183 NAACSRAVRAAILQAPVSDREYRATLPETAAMIDLASKMMSESRGSELMPKEADPEAPIT 242
Query: 247 ATRYYSLCSYMGDDDMFSSDLSDDQLRMRLGHMANTPCQVIYSMGDEYVPEYVDKEALVD 306
A RY+SLC+YMGDDDMFSSDLSDDQLRMRLGHM+NTPCQVI+SM DEYVP+YVDK+ALVD
Sbjct: 243 AYRYHSLCAYMGDDDMFSSDLSDDQLRMRLGHMSNTPCQVIFSMADEYVPDYVDKKALVD 302
Query: 307 RLCKAMGGAEKVEIQHGNHSLSNRVNEAVDAIIDFIVRKGYLNGYQISCLDCVICGSFHC 366
RLC+AMGGAEKVEI+ GNHSLSNRV EAV AII+F+ R+G C++ S +
Sbjct: 303 RLCRAMGGAEKVEIEWGNHSLSNRVEEAVHAIINFVKREG-----PKGCIET----SLYA 362
Query: 367 YLQTKRQKGYDCLFIRFPSIGILLVFSITSWKERRHCRCSALLDKTREKQQNRRPTKKKK 426
+LQ
Sbjct: 363 FLQ--------------------------------------------------------- 422
Query: 427 KLKKHLRFFKSSLLKHNNLQNPDRFLVFPTNPVSKICSFFTFFLLFLFGFVGFCLIEKGE 486
H+ + +P+
Sbjct: 423 ----HIYW----------------------SPI--------------------------- 482
Query: 487 SLAFGKDQNFIACISLRRRKQRMPEKGLLRATSQPWLFVSSSLVHSRFWVLTGLVLLSML 546
MP+K +S W+F S+ ++H F VLT V L +L
Sbjct: 483 ----------------------MPDKAANSVSSCSWIFRSNPMIHWYFEVLTLTVFLGIL 542
Query: 547 AVWSIDGFNVNTFIKFWSSPQDFVSVSSNFTNTHFNLSTKDSNFTQFVSYKPENYPESAL 606
VW ++ +N DF++V + KP
Sbjct: 543 FVWGVNAWNAGDL------QNDFLTVKTQ---------------------KP-------- 602
Query: 607 FEPVSPHNSTGEPRRIENEPAVPYASFSDWFSAELEPNYTSNLLALWKTPGGEPCRDSRT 666
WK
Sbjct: 603 ----------------------------------------------WK------------ 662
Query: 667 ADIAISGMEGSGVVELSTGDVHEFRFQAVDESGNPRCLGGDYFETDLSGEWWKSRPFVRD 726
I+I ++ S + LSTGD+HEF FQA+D+SG P CLGGDYFETDLSGE WKSRP ++D
Sbjct: 663 --ISIPRLDNSNQIALSTGDIHEFVFQALDDSGKPHCLGGDYFETDLSGELWKSRPPIKD 722
Query: 727 LGNGTYLFWVQVHPDFAGDYELTVILLFRSFEGLRFSPTRFAYDRELRRIKVRFVRTSVG 786
GNGTY F +QVHPDF G+Y LT+ILLFR FEGL+FS RFA+D+ LR+I + F ++S+
Sbjct: 723 FGNGTYSFSLQVHPDFTGNYTLTIILLFRHFEGLKFSTERFAFDQTLRKIPISFYKSSIL 782
Query: 787 LPEIKSCGRWDFGKEIWTGRWTRHGRNEGCEISDDGRYRCFPPEYPCRRPWCNGSLGLLE 846
PE+ C ++D+ +++W+GRWTRHG+NE C IS+DGRYRC P +PC+RPWC G LGLLE
Sbjct: 783 FPELTQCQKFDYARDVWSGRWTRHGKNENCPISNDGRYRCLEPNFPCQRPWCEGYLGLLE 842
Query: 847 SNGWVYSAHCSFQMFTSDSAWDCLKGRWIFFWGDSNHVDTIRNLLNFVLDLPEIPAVPRR 906
SNGW YS HCSF++F+S++AW+CLK RWIFFWGDSNH DT+RN+ +F+L +PEI VPRR
Sbjct: 843 SNGWTYSTHCSFRVFSSETAWNCLKNRWIFFWGDSNHCDTVRNIFHFILGVPEIEIVPRR 865
Query: 907 FDRNFSNPKNPSQTVRITSIFNGHWNDTQNYEGLNSLRNEGFRNLLYKYFSEQTVPDTII 966
FD N +NPK+ SQTVRIT+IFNGH N + NY+GLNSL++E +R LL +YFS ++VPDT+I
Sbjct: 903 FDMNITNPKDASQTVRITNIFNGHPNASGNYQGLNSLKDEAYRELLKQYFSLESVPDTVI 865
Query: 967 MNSGLHDGVHWLNIRAFSVGAAYAASFWKQLLDSIMQRGLTVPKVFYRTTVATGGYARTL 1026
MNSGLHDG+ W IR F+ GA YAA+FW +L+DS+ QRGL P++ YRTTVATGGYAR L
Sbjct: 963 MNSGLHDGIFWPTIRRFNKGAEYAAAFWAELVDSVRQRGLKPPEIIYRTTVATGGYARRL 865
Query: 1027 AFNPNKMEIFNWVVLEKLKESGIIHGIIDNFDMTFPWHFDNRCNDGVHYGRAPAKLTWRD 1086
AFNP KME FN V L+KL++ G+++ +ID+FDMT+PWHFDNRCNDGVHYGRAPAK+ WRD
Sbjct: 1023 AFNPQKMEAFNGVFLDKLRQFGVVNWVIDDFDMTYPWHFDNRCNDGVHYGRAPAKMRWRD 865
Query: 1087 GQIGHQYFLDLMLAHILLNALCS 1109
G++GHQYF+DLML H+LLNA+C+
Sbjct: 1083 GEVGHQYFVDLMLGHVLLNAICA 865
BLAST of CmoCh06G014400 vs. ExPASy TrEMBL
Match:
A0A0A0L5N9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G150080 PE=4 SV=1)
HSP 1 Score: 1067.0 bits (2758), Expect = 5.3e-308
Identity = 500/602 (83.06%), Postives = 535/602 (88.87%), Query Frame = 0
Query: 508 MPEKGLLRATSQPWLFVSSSLVHSRFWVLTGLVLLSMLAVWSIDGFNVNTFIKFWSSPQD 567
M EKGLL TS PWL SS LVHSRF VLT L+L SMLA+WSIDG ++ FIK WSSPQD
Sbjct: 1 MQEKGLLPVTSNPWLLRSSPLVHSRFGVLTALILFSMLAIWSIDGGHIKIFIKAWSSPQD 60
Query: 568 FVSVSSNFTNTHFNLSTKDSNFTQFVSYKPE-NYPESALFEPVSPHNSTGEPRRIENEPA 627
FVSVSSNFT+TH D NFT +SYKPE NY ES L EPV P N+ EPRR +++PA
Sbjct: 61 FVSVSSNFTDTH------DFNFTPVISYKPEINYQESVLNEPVPPQNAPVEPRRKQSKPA 120
Query: 628 VPYASFSDWFSAELEPNYTSNLLALWKTPGGEPCRDSRTADIAISGMEGSGVVELSTGDV 687
V + SFSDWFSAELEPN+TS+LLA W PGGEPCRD +T DIAISGME +V LSTGDV
Sbjct: 121 VRHDSFSDWFSAELEPNFTSHLLAQWLAPGGEPCRDLKTTDIAISGMESPAIVTLSTGDV 180
Query: 688 HEFRFQAVDESGNPRCLGGDYFETDLSGEWWKSRPFVRDLGNGTYLFWVQVHPDFAGDYE 747
HEFRFQA+DESGNPRCLGGDYFETDLSG WKSRPFV+D GNGTY FW+QVHPDFAGDY
Sbjct: 181 HEFRFQALDESGNPRCLGGDYFETDLSGNLWKSRPFVKDFGNGTYSFWLQVHPDFAGDYN 240
Query: 748 LTVILLFRSFEGLRFSPTRFAYDRELRRIKVRFVRTSVGLPEIKSCGRWDFGKEIWTGRW 807
LTVILLFR FEGLRFSPTRFAYDRELRRIKVRFV+ SV LP+IK C DF ++IWTGRW
Sbjct: 241 LTVILLFRHFEGLRFSPTRFAYDRELRRIKVRFVKNSVVLPKIKMCRSSDFSRDIWTGRW 300
Query: 808 TRHGRNEGCEISDDGRYRCFPPEYPCRRPWCNGSLGLLESNGWVYSAHCSFQMFTSDSAW 867
TRHGRN+ C+ISDDGRYRCF P+YPC+ PWCNG LGLLESNGWVYSAHCSF MF+S SAW
Sbjct: 301 TRHGRNDRCKISDDGRYRCFAPDYPCQSPWCNGPLGLLESNGWVYSAHCSFTMFSSSSAW 360
Query: 868 DCLKGRWIFFWGDSNHVDTIRNLLNFVLDLPEIPAVPRRFDRNFSNPKNPSQTVRITSIF 927
DCLKGRWIFFWGDSNHVDTIRNLLNFVLDLPEIPAVPRRFDRNFSNPKNPSQTVRITSIF
Sbjct: 361 DCLKGRWIFFWGDSNHVDTIRNLLNFVLDLPEIPAVPRRFDRNFSNPKNPSQTVRITSIF 420
Query: 928 NGHWNDTQNYEGLNSLRNEGFRNLLYKYFSEQTVPDTIIMNSGLHDGVHWLNIRAFSVGA 987
NGHWNDTQNYEGLNSLRNEGFR+LL KYFSE+TVPDTIIMNSGLHDGVHWLNIR+FSVGA
Sbjct: 421 NGHWNDTQNYEGLNSLRNEGFRSLLQKYFSEETVPDTIIMNSGLHDGVHWLNIRSFSVGA 480
Query: 988 AYAASFWKQLLDSIMQRGLTVPKVFYRTTVATGGYARTLAFNPNKMEIFNWVVLEKLKES 1047
YAASFWKQ+LDSI QRGLTVPKVFYRTTVATGGYARTLAFNPNKMEIFNWVVLEKLKE+
Sbjct: 481 TYAASFWKQVLDSIKQRGLTVPKVFYRTTVATGGYARTLAFNPNKMEIFNWVVLEKLKEA 540
Query: 1048 GIIHGIIDNFDMTFPWHFDNRCNDGVHYGRAPAKLTWRDGQIGHQYFLDLMLAHILLNAL 1107
GI HG+IDNFDMTFPWHFDNRCNDGVHYGRAPAKL WRDG+IGHQYFLDLMLAHILLNAL
Sbjct: 541 GITHGVIDNFDMTFPWHFDNRCNDGVHYGRAPAKLKWRDGEIGHQYFLDLMLAHILLNAL 596
Query: 1108 CS 1109
C+
Sbjct: 601 CT 596
BLAST of CmoCh06G014400 vs. TAIR 10
Match:
AT3G06150.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G19060.1); Has 61 Blast hits to 59 proteins in 10 species: Archae - 0; Bacteria - 0; Metazoa - 3; Fungi - 0; Plants - 58; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 751.9 bits (1940), Expect = 7.3e-217
Identity = 345/612 (56.37%), Postives = 457/612 (74.67%), Query Frame = 0
Query: 508 MPEKGLL--RATSQPWLFVSSSLVHSRFWVLTGLVLLSMLAVWSIDGFNVNTFIKFWSSP 567
MPEKG++ SQ + S L+ R LT LV ML VWSIDG ++ +F++ W
Sbjct: 1 MPEKGMIFPPVPSQLVILRPSPLLQWRLGALTALVCFLMLVVWSIDGCSIQSFVQPWRFN 60
Query: 568 QDFVSVS---SNFTNTHFNLSTKDSNFTQFVSYKPENYPESALFEP----VSPHNSTGEP 627
V +S S F +T NL VS KP + + P N T
Sbjct: 61 AYSVRISPSPSPFMSTKPNL----------VSEKPHRQNLTLMMAPRNLVPKKTNLTSNS 120
Query: 628 RRIENEPAVPYASFSDWFSAELEPNYTSNLLALWKTPGGEPCRDSRTADIAISGMEGSGV 687
R++ E W +A + N+T+NL+ W PGG PCR+++T +I++ G++G
Sbjct: 121 TRVQFE----------WITAGSQKNFTANLMRGWLAPGGAPCREAKTVEISVPGVDGIDS 180
Query: 688 VELSTGDVHEFRFQAVDESGNPRCLGGDYFETDLSGEWWKSRPFVRDLGNGTYLFWVQVH 747
VEL+ G++HEF+FQA+DESG C+GGDYFETD+SGE WKSRP V+D GNGTY F +QVH
Sbjct: 181 VELTAGEIHEFKFQAIDESGKNVCIGGDYFETDISGENWKSRPPVKDFGNGTYSFSLQVH 240
Query: 748 PDFAGDYELTVILLFRSFEGLRFSPTRFAYDRELRRIKVRFVRT-SVGLPEIKSCGRWDF 807
P+FAGD+ LTVILLFR ++GL+FS +R +DR+LR +++RFV+T V LPE++SC + DF
Sbjct: 241 PEFAGDFNLTVILLFRHYQGLKFSTSRLGFDRKLRNVRLRFVKTPDVTLPELRSCKKSDF 300
Query: 808 GKEIWTGRWTRHGRNEGCEISDDGRYRCFPPEYPCRRPWCNGSLGLLESNGWVYSAHCSF 867
++ W+GRWTR G+N+ C+IS+DGRYRC ++PCR+PWC+G++G +ESNGWVYS+HCSF
Sbjct: 301 NRDAWSGRWTRLGKNDECQISNDGRYRCLAADFPCRKPWCDGAVGAIESNGWVYSSHCSF 360
Query: 868 QMFTSDSAWDCLKGRWIFFWGDSNHVDTIRNLLNFVLDLPEIPAVPRRFDRNFSNPKNPS 927
++F+++ AWDCLKG+WIFFWGDSNHVD+IRNLLNFVL PEIPAVPRRFD FSNPKNPS
Sbjct: 361 KLFSAEKAWDCLKGKWIFFWGDSNHVDSIRNLLNFVLGHPEIPAVPRRFDMKFSNPKNPS 420
Query: 928 QTVRITSIFNGHWNDTQNYEGLNSLRNEGFRNLLYKYFSEQT--VPDTIIMNSGLHDGVH 987
+TVRITSIFNGHWN+T+NY+GL+SL++ FR LL KYF+E+T VPD +I+NSGLHDG+H
Sbjct: 421 ETVRITSIFNGHWNETKNYQGLDSLKDRDFRELLKKYFNEETNRVPDVMIVNSGLHDGIH 480
Query: 988 WLNIRAFSVGAAYAASFWKQLLDSIMQRGLTVPKVFYRTTVATGGYARTLAFNPNKMEIF 1047
W ++RAF+ GA AA+FW+++ D + RGL P+V +R T+ATGGYAR LAFNP+KME F
Sbjct: 481 WTSLRAFAKGAETAAAFWREVFDGVKSRGLQPPEVIFRNTIATGGYARMLAFNPSKMEAF 540
Query: 1048 NWVVLEKLKESGIIHGIIDNFDMTFPWHFDNRCNDGVHYGRAPAKLTWRDGQIGHQYFLD 1107
N V LEK++++G++ ++DNFDMT+PWH+DNRCNDGVHYGRAPAK+ WRDG+IGHQYF+D
Sbjct: 541 NGVFLEKMRDAGLVTSVVDNFDMTYPWHYDNRCNDGVHYGRAPAKMRWRDGEIGHQYFVD 592
BLAST of CmoCh06G014400 vs. TAIR 10
Match:
AT5G19060.1 (CONTAINS InterPro DOMAIN/s: Immunoglobulin-like fold (InterPro:IPR013783); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G06150.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )
HSP 1 Score: 720.7 bits (1859), Expect = 1.8e-207
Identity = 334/584 (57.19%), Postives = 421/584 (72.09%), Query Frame = 0
Query: 526 SSLVHSRFWVLTGLVLLSMLAVWSIDGFNVNTFIKFWSSPQDFVSVSSNFTNTHFNLSTK 585
S L R LT LV ++ VWSID ++ +FIK W
Sbjct: 13 SPLHQWRLSALTSLVFFLIVVVWSIDSCSIRSFIKSW----------------------- 72
Query: 586 DSNFTQFVSYKPENYPESALFEPVSPHNSTGEPRRIENEPAVPYASFSDWFSAELEPNYT 645
+F SY SP + +P R++ W S E E N+T
Sbjct: 73 -----RFNSYS---------IRLTSPPSLDLDPTRVK----------LAWISVEQEQNFT 132
Query: 646 SNLLALWKTPGGEPCRDSRTADIAISGMEGSGVVELSTGDVHEFRFQAVDESGNPRCLGG 705
+N+L W PGGE CR++ T +I++ G+EG G+VEL+ G++HEFRF ++D+SG C+GG
Sbjct: 133 ANVLKNWLAPGGEKCREANTVEISVPGIEGKGLVELTAGEIHEFRFHSLDDSGERVCIGG 192
Query: 706 DYFETDLSGEWWKSRPFVRDLGNGTYLFWVQVHPDFAGDYELTVILLFRSFEGLRFSPTR 765
DYFETDLSGE WKSRP V+DLGNGTY +Q+HPDFAGDY+LTV+LLFR F+GL+ SP R
Sbjct: 193 DYFETDLSGENWKSRPPVKDLGNGTYSLSLQIHPDFAGDYDLTVVLLFRRFQGLKLSPAR 252
Query: 766 FAYDRELRRIKVRFV-RTSVGLPEIKSCGRWDFGKEIWTGRWTRHGRNEGCEISDDGRYR 825
FA++R LR K+RF+ + V LPE++ C DF +++W+GRW R G+N+ CEIS+DGRYR
Sbjct: 253 FAFNRTLRNFKLRFIKKPHVVLPELRRCELSDFDRDVWSGRWIRLGKNDECEISNDGRYR 312
Query: 826 CFPPEYPCRRPWCNGSLGLLESNGWVYSAHCSFQMFTSDSAWDCLKGRWIFFWGDSNHVD 885
C P Y CR PWC+G+L LESNGWVYS+HCSF++F+S+SAWDCLK +WIFFWGDSNHVD
Sbjct: 313 CLPDGYRCREPWCDGALSALESNGWVYSSHCSFKLFSSESAWDCLKNKWIFFWGDSNHVD 372
Query: 886 TIRNLLNFVLDLPEIPAVPRRFDRNFSNPKNPSQTVRITSIFNGHWNDTQNYEGLNSLRN 945
+IRNLLNFVL PEI AVPRRFD FSNPKN S+TVRITSIFNGHWN+TQNY GL+SL +
Sbjct: 373 SIRNLLNFVLGHPEIGAVPRRFDLKFSNPKNSSETVRITSIFNGHWNETQNYLGLDSLED 432
Query: 946 EGFRNLLYKYFSEQT-VPDTIIMNSGLHDGVHWLNIRAFSVGAAYAASFWKQLLDSIMQR 1005
+ FR LL YF E+T VPD +I+NSGLHDG+HW N+RAF+ GA AA+FW+ + DS+ R
Sbjct: 433 DSFRELLKSYFVEETGVPDVMIVNSGLHDGIHWSNLRAFTKGAETAAAFWRNVFDSVKAR 492
Query: 1006 GLTVPKVFYRTTVATGGYARTLAFNPNKMEIFNWVVLEKLKESGIIHGIIDNFDMTFPWH 1065
GL PKV +R T+ATGGYAR LAFNP+KME++N V LEK+K G++ +IDNFDMT+PWH
Sbjct: 493 GLRPPKVIFRNTIATGGYARKLAFNPSKMEVYNGVFLEKMKGLGLVSSVIDNFDMTYPWH 549
Query: 1066 FDNRCNDGVHYGRAPAKLTWRDGQIGHQYFLDLMLAHILLNALC 1108
FDNRCNDGVHYGR PAK+ W DG+IGHQYF+DLML H+LLNA+C
Sbjct: 553 FDNRCNDGVHYGRPPAKVRWIDGEIGHQYFVDLMLVHVLLNAVC 549
BLAST of CmoCh06G014400 vs. TAIR 10
Match:
AT5G19050.1 (alpha/beta-Hydrolases superfamily protein )
HSP 1 Score: 513.8 bits (1322), Expect = 3.3e-145
Identity = 272/353 (77.05%), Postives = 314/353 (88.95%), Query Frame = 0
Query: 1 MNLSLSSSSSSSLPSASASSSSSSP----SSSSTTSWFSGIVRGRPDRSSSVKMSGSSA- 60
M+LSL SSS+++ +AS S SSSSP SSS+TTSWFSGIVRGR D+S + K+S SS+
Sbjct: 1 MSLSLPSSSAAA-AAASTSGSSSSPAAASSSSTTTSWFSGIVRGRGDKSGTAKLSKSSSM 60
Query: 61 --SGFVAGDPPGPVVRKNHFRGLLFKYGPKPIQVAFKTGDYKQQVIFIGGLTDGFMATEY 120
G +GD GP+ KN FRG+LFKYGPK IQVAFKTG+YKQQVIFIGGLTDG +AT+Y
Sbjct: 61 AGGGSGSGDYGGPIKGKNQFRGVLFKYGPKSIQVAFKTGEYKQQVIFIGGLTDGLLATDY 120
Query: 121 LEPLAIALDKEKWSLVQILLSSSYSGYGTSSLQQDAKELDQLVSYLINKEDSEGVVLLGH 180
LEPLAIALDKEKWSLVQ+L+SSSYSG+GTSSL+QDA+E+DQL+++LINKE+SEGVVLLGH
Sbjct: 121 LEPLAIALDKEKWSLVQLLMSSSYSGFGTSSLKQDAQEIDQLINHLINKENSEGVVLLGH 180
Query: 181 STGCQDIVYYMRTNAACSRAVRGAILQAPVSDREYRATLPETAAMIDLASTMINEGRGLD 240
STGCQDIVYYM TNAACSRAVR AILQAPVSDREY+ATLPET AMIDLA+ MI EGRG +
Sbjct: 181 STGCQDIVYYMGTNAACSRAVRAAILQAPVSDREYKATLPETPAMIDLAANMIKEGRGEE 240
Query: 241 LMPREADP-SPITATRYYSLCSYMGDDDMFSSDLSDDQLRMRLGHMANTPCQVIYSMGDE 300
LMPREADP +PI+A RY+SLC+YMGDDDMFSSDLSDDQL+ RLGHMANTPCQVI+SMGDE
Sbjct: 241 LMPREADPCAPISAYRYHSLCAYMGDDDMFSSDLSDDQLKTRLGHMANTPCQVIFSMGDE 300
Query: 301 YVPEYVDKEALVDRLCKAMGGAEKVEIQHGNHSLSNRVNEAVDAIIDFIVRKG 346
YVP+YVDK+ALV+RL KAMGGAEKVEI+HGNHSLSNRV+EAV AII F+ R+G
Sbjct: 301 YVPDYVDKKALVNRLSKAMGGAEKVEIEHGNHSLSNRVHEAVQAIIGFVKREG 352
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9C0Y8 | 9.5e-28 | 30.94 | UPF0613 protein PB24D3.06c OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843... | [more] |
Q4WF56 | 4.1e-23 | 30.72 | Fusarinine C esterase sidJ OS=Neosartorya fumigata (strain ATCC MYA-4609 / Af293... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1F9E2 | 0.0e+00 | 100.00 | uncharacterized protein LOC111443291 OS=Cucurbita moschata OX=3662 GN=LOC1114432... | [more] |
A0A6J1I996 | 0.0e+00 | 98.00 | uncharacterized protein LOC111472251 OS=Cucurbita maxima OX=3661 GN=LOC111472251... | [more] |
A0A5J5AEB9 | 0.0e+00 | 52.88 | Uncharacterized protein OS=Nyssa sinensis OX=561372 GN=F0562_034734 PE=4 SV=1 | [more] |
A0A5J5C3T1 | 0.0e+00 | 53.94 | Uncharacterized protein OS=Nyssa sinensis OX=561372 GN=F0562_000271 PE=4 SV=1 | [more] |
A0A0A0L5N9 | 5.3e-308 | 83.06 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G150080 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT3G06150.1 | 7.3e-217 | 56.37 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT5G19060.1 | 1.8e-207 | 57.19 | CONTAINS InterPro DOMAIN/s: Immunoglobulin-like fold (InterPro:IPR013783); BEST ... | [more] |
AT5G19050.1 | 3.3e-145 | 77.05 | alpha/beta-Hydrolases superfamily protein | [more] |