Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTTAAAGTTGTAGAAAATAAAAGGAAAAGATTTATTGGAAAAAAAAAGGACCCTAGCCTCCATCTTCTTCTTCCTCTTTCCCTTGTAGCCGCCACCCCCGCCCCCAATCCCCTTCAGCTAGTTTTCTCCGGCGAGCATTCGACTTGTAAACGTGTGGATCCACTCACTGTTTGTTGTTGGACCACAAACTACGCTACCTCACGTTGACGACATGACCCACAAAACAAACCCACGCATTTGAGACGTCTCTCTTTGTTGACGGCTTAGCGAACCCACGTCGACGACCCCGTTCCTCCGGCGACTGACAAGTAGACACCCACATGGACGGTGAGTGTTCTTCCTCGACGATCTAGTTACACACCGACGGGTTTTTGACCCACGGATATGACTTGTATCATGAGTATGACTTCTAACGGCTACGCCTGGGTCTTCGCAGCATTGTTTCTCCCTGTTTTGACAAGTTTCGACATTGACCCACTCTTCGTTTGGCTTCAAATCAAGTACCCATAACGTTGAAGGTCCAGAAGATTTGCAAGTTTGGACGTTTTTGGACTCCCACCAGCAAGATTCGAAGGCTTTTTGGTAAGTTTCGGTATGAGTTATTGCTTTAGTTTTCTTTTCGAATTGGATTTAGGTGTTTGTGTTCTGAACTCTTGGAACTTTTGTTTTTACGTTCGATAGATTACCAAGTATTGGAACTTTTGTTTTAAGTGGAAACAATACAAGTATCTATAGGAAAAAATAAGAAACTATCCATCAGTGTATTATCCTTAAATTTCTACTACTAACCTAGAAAAGCTCCTTTAATCAATCCACAACCATTATGTCACTATCCTCCAATTTAGTAGGTACTACCCTAAATAAAAGCCACTAACTTCAATTAAATAAGAAACTAATTTAATTAATAATTTGCTAAAATAAGTTTGCACTAATTTTTAACACTCCCCCTTAGTGTAAACAAAAGATCAACAACACCAATTTGACTTTACCAAGAGTCTCGATAAATATATCATCAATTTGTTCTCTGGAATGTATCTTCTCCAATTGAATATTTTGCGTCAAGACTTTTTCTCGAATAAAGTGAAAATGTGTTTCAATGTGTTTTGTACGAGCATGAAACACTGGATTTCCTGCAAGCCTGATTGCACTTTCATTATCACAAAAAATGGAAACTGGATAGTTTAAACTGTAAGATTTTTCTCCCATAAGCCTTTTTAGCCAAATACACTCTTGGGTAGCCATAGTAACTGCTACATATTCTGCTTTACTACTTGAAAAAGCAACAGAAGATTGCTTCTTACTGTACCATGAAATAGCTTCTGAACTTATATTGAAACAGTATCCTATTGTGGAATGTCTGTCATTTACGATACCATCTCAATCTGCATAAAATAACCACTTAAAACAAAATATGTAACTTGTTTGTAGAACAAGCCAAAACGTAGGGTGCTCTTCACATATCATAGAATCTTCTTTGTTGCGATTAAATGAGTTTCTCATGGTTTATCCATAAAGTGAGCAACAACACAAATAGAGTAAGAAATGTCCAGCCTTGTGATAGTTAAATAAATTAAACTAAAACAAGTTTTCGAAAAGGTGTTGCATCAGCCGATTGTTTTCCTTCTTCCTTAATCAACTTCAAGGCTGGTTCCATGGGAGTAGTCATTGGTTTTAACTAATTCATGTCAAAACGCTCCATTAAACTTGCTGTCTACCATTTCTGTGAGACAATATCCATCTGACCTTTCTACCTCCAAGTCAAGAAAACATCTTGCTTCACCCAAACTTTTCATTTCAAAACAAATGGACAAAGCATCTCGAAGACTATTAATTTGAACTTCGTCATCACCTGTTATGACCATGTCATCAACATAGAGAAGTAGCATTGTACATCCTGAAGTTTTCTTTTTAACAAATAGGCTTGAATCTGCATTTGAAGATCTAAACCCACAAAATTCAAGATATTGAGAGCAATCTTACCATACCAAGCACGAGGAGCTTGCTTAAGACCATACAAAGCCTTCTTGAGCCAATACACAAAAGTTGGAAACTCCTTAGAAACAAAACCATGAAGTTCTTCCATGAAAATGTCTCGATCCAACTCTCCATAAAGGAAACATTCTTCACATAACTACCACAATTTCCAATCTTTGCAAAGCTAGTGAAATAATTGTCTGGATGGTTACCATTTTAGCAACAGGACTGAATGTCTCTTCATAGTCTAGATCATATTGCTATGAAAAACCACGAGCTACAAGACGAACTTTATACCTGTCAATAGTACCATCAAACTTTTTCTTTGACTTGTAAACCCATTTACAAGCAACCAACTCGATATTCTTTGGCTTTGGAACAAGTTCGCACGTATCATTCTTGCTAAGAGCCAAATTTTCCTCCTTCATTGCTTATTCTCATTCAAGAATGCCTTTAGCTTCATTATAAGTCTGGGGTTCATCTTCACAAATATCACCTATTAAAAAACAAGATACAAATGAACAGTTAAATAATTGTACCTCATACTCAGATAAATAACTAGTTCGTGTTCTCAATCTAGAAGACCTACGAACAACAATTTTAGTATCTTCATTTGCCTCAGCATTTTCTCCAAGATTAACATTAATACCATTCTCGTTTGATACATCATCCGTAGAGAAAGGTGACACGTCAACAATGTCTTCTTTCTTTGAATTTGCATCCATTTGGCATGACGAAACTTCTTCAAACACCACATCTCGAGAAACAAAAACTTTCCTTGTCTTTGGATCCATGCATCTCCATCCTCTCCTGTGAGTCTCATATCCAACAAAAATACAACATTTTGCCTTTGGGTCTAATATAGTACGTTTACTCTTTGAAACATGAGCAAAACAAATTGACCAAAAAAACTTTGGGTTTATGATGATATAGTACTTCAAAAGGAGATAGATTTAATCCTGTCCATGGAGGTAAGCGATTTATGACATGACAAGCTGATTGAATAGATGTTGCCCATAGCTCCCTTAGAAGATTCTTTGCACGCAGCCAAGACAAGCACATGGAGGTAAGATATGTTCAGCAACTCCATTCTGTTATGGAGTGTTAGGACACTACATTTGGCACTGAATCCCATGCTCTTTGCAGTAATTTTGAAATTGATCAGACATATATTCTCCCCCCATTATATGTGCGAAGACCCTCACTTGCTCCTTGAATTGGATAAACTTGGAGAAAGTCTCACTTTTAGCTTCCAAAAAATACACCTAAGTGAAACGAGAAAAATTGTCCACAAAAACCATAACATAGCGACAACTAGAATAGCTGGGTGTTTTAGTTGGTCCATCAAATTTGAATGAATTAACTGCAATGCACCAGTAGATCTGCTATGAATTTGGGAAAGGAAGACGAGGTGACTTACCAAATTGACAACCAGGGCAAACCACATCATGGTGTATTTCTTTAAAGAGAGGAACATTAACAAAAAACTTCTTCATAGAAATTCTTTCCAACAACTGGTAATCAACATGGCCAGCCGAGTATGCCAAAGTGTTGCACAATCATTCTGACCTGTCTTTTCAACATATGCATCCCTTGCAAACAAAACATAGAGAGAATCTTTCCTCTTTCCAGTGAACAAAACATCACGTTCAAATTTCTTAACATCAAAAATAATTTTCACATCATTTGGTCCAAAAGGAGCATACCTCCCAAAGTCGGCAATTTGAGAGATCAAAACCAAATTCTTCTTTAAACCTGAAACATGATGAACATCTGTAGGAGAAACACCACGGTTCTCAACATTAACATGCTCTTCTTCAATAACAGGATGTAAGGAATTGTTAGGCGTCACAATTACTCCTCCTCCTTGGTGAGCACATACATCAGAGAGAAGAGAAACATTTTCAATTGCATGATAAGAACAATTGGAATCAACAATCTAATTTGTATTGTTATCTATTGAAACATTAGCATGCAAATTAATGGATTGATCAATAGTTTCAAGTGATAAGCATTGTTCCCATTTAATTTGCTTAGATTTGTTAATTTCCTCTACGATATTTGCTTCTGCTTCTTTGAGATCCACACGACAACTTGGTCTAATATGACCTAATTTTCCACAACGATGACATACTACCTTCACCCGACAATCATGTTTGATGTGTCCTGGCGTGCCGCACCTAAAACATCCCTTCAATTTGCCCGTGGACTGCCCTTCAACCTTGGAATCATTGCTGTCATTTGAAGAACGTTTGGAAGAATAATTGACTCTTCTCTTGTCTTTTGCATAAAGAATAGCTTCCACTTCGGGATGAGATTGCTTGTTGTTGCCAAGCATTTGTTTTATCAATGCTTCTTGATATGAGAGCAAATTCTCCAACTCAACAATAGAAGGTTGATTTACCCAACCTTGTATCGAAGAAATAAATGGCATAAACTCCTTTCGCAGTCCACGAATAAGATAACAACAGGAACGAGCATTGCTAATAGGCTCCTCTTTGTCCAATTCTGAAATTTTAGAACACACTCTTTTTTTTCAGAAAATATTCTGAAACTGAAAAATTACCTTGAGTCGTTCCAGCGAGTTCGTTTTCCAAATACTACAATCTCATTGTGTTCTACTGATTGAACGACCTTTCAAGTGTTTCCCACACTTTCATTGGAGAATTCTCATCACAAACATGCTCTCTATATACTCCTTGCTGATAGAAGTTCACAAAGCAAATAAGGCTTTACTGCACTTGACCTTCCACTTCCTTTGTGCCTCATCATTCTCCAGAGTATCTTCTTGAATTGAATGATCTTCGGATTGTCCTTGAAGAAAAGCTTCCATAATAGCCTCCAATAACTATAATTATCACCAGTAAGCTTGTCTACACAAATGTTGTTTCCACCATTCATCTTCACTGAGTATTTAAACTCAAACACAAAACAAAGAGCTTGAATGCTCTGATACCGTGTAACGAAAACAACAAATCAAAGAGAACGGAAGGTGGAAACAAGAGCGAAAATATTGCATGGAGCGCTAAGTGTCAATTGCGTTCTACGTCCATTTAAGTGGAAACAATACAAGTATTTATAGGAAAAATAAGAAACTACTCACTAATGCATTATCCTCAATTTCTACCACTAACCCAGAAAAGCTCCCTTAATCAAACCACTGAACCATTATGCCACCATCCTCCAATTTAGGAGGCACTACCCTAGATAAAGGCCATTAACTTCAATTAAAGAAGAAACTAATTAAATTAATAATTTGCAAAATAAGTTTGCACTAATTTTTAACAGATTAATCACGTTTCGGACTAGTTTGGACTCAATTAAAGCATGTTAGAGCAGTATTTGACTTGTGAAAATTTGTTGGTTGTAGTTTTGGGTTAATTTGGAGTAACTAATTCAAGTTTATGGTGGATTTTGGTTAACGATATTCTTGTTGAAAAGTCTCTCGATGTTGATTAAGGTTGTCGAACTAAGTTCTAACACCTGAATATTGTGTTTATTGTTTAGGCTTTCTAAAGGTTGAAGTTCGTTCAAGGAAGTGGATTGAGAATTGAGAATTATAGGATTGTTGATTGAGGTAAGTAGTTTCCACACCCCTTCCAAAATATAAATTTTAACTGTTGTGGTTGGATGAAATTTGTAATATGTTTGAAAATATGTTTGTACCATTTCTTATAAAAGAAAGTATGTTTATACCATTTGAAATTGTTAGCATGCTTGAAACTATGAATCTTTCTATTAGTATTCATATTTTTATTGAGATTAATAGGGTTCTTTTTTTTTTTTTTTTCTTCTTTTTTTTTTTAAAAAAAAAGTTTTATGAGTACTTGTAAAAGGAAATTGGAGATATAGATTACTTTTTAATATTGATAGTGCACCCTCCAATTTGTAGGTCTCAAGGCAATGAAGAAGGAAATTCTTAAATCGCCACCTATCACAGCTTTCACTCCAGACGGTGACTATCTCGCCATCTTGTCCTCAAATGGAACTCTAAAGGCAAGCTCCTTCTCTTTCTGTTGTTTCTATGCTTTTGATTTTTCTCATGTTGGTTTTCACTCGGAATATCCAACTAGTTCATCCACTTCCGCTTTTTGGACCTCATTGTAGTTGCAAATATTTGTTTTTTCTCATGCTCTTGGTCATGTGAATGTTAATTACTTACGAACTCGATGGGAACATTTATATGTATCTGTAGATTTGGAGTACCCGTGATGGGAGTTTACTGGCAGAATGGAAGGATCCCGATGGAAAAAATGATGTTGGTTATTCCTGTATGGCATGCTGTTTTCAGGGGAAAAAGGTGAGGAGACCATTTGGTTTATCATCAAGTTTAATTCTTTGACACTGTTTTGTAAGTGCTGGGAATCTTAAACCTGTAATGTCTTTCTGCAATTTCAGCGTAAAAACAGTTACTGTGTAGTTGCTGTTGGTACCAATAGTGGGGATGTGTTGGCTGTGAATGCTTCAAACGGTGAGAGAAAGTGGGTCTCCGCAGGTTGCCATCTTGGGTACGCAATCAATTCTGCCTACTCTTGATTACTTAGTATTTTGAAGAATGTTAAGTGATCAGATGAGTTAGCTTAAGGATGCAAACTTTATGTACTTTAATGATCTTATATTGGCCGTGTTAGTTGCTTCATGTTGATGCATTCATTGTGGCAATACGTTTAGATGGGAGAAAAATGATATGATAATATAGTGTTACATTGATGAATACGTTGGAGGTTCCCCCTTGTTCACACGCCTTAAATATATATATTTCACGATCAATTTCATTCAGTCAATGCAATGGTTTCTTTTTTAAAAAAACACATGTTTTGTGATCAAGTCATCTCACGTGAGTTCATCAACTTATGGTTCTTCTACAGAAAACCATGCTTTGTTTGTCATTCTATTCATTATGGTGAGTGAATCACTTTTTTATCCTGATTTATGAAGCGGAGTTATTGGCCTTTCTTTTGCGAACAAAGGCCGTAGACTGCATGCGGTTGGAAGTAATGGAAAGGTCTCTGAGATGGACACTGAAACAGGAAACATTATCAAGGAGTTCAAAGCTTCGAAAAAATCAATCTCTTCTTCATCCTTTTCACTTGGTTAGTTTCCATATCCATGTTCTGCTACGTGAACAATGAAAGCACCTTTTTTGACGTAACGAGTCACTCTTTGGCCAGATGAGAAGTACTTAGCTGTAGCTGGTAAAAAGTTAAAAATTTTAAGCAAAGATGATGGGGATGAGCTTATGGTGCATCCTGATAAATTGGTGAGCTGCTCAAATTCTCTTACTCTTAAATACGTCCAAATCACTTACTATAGTATACTATACAAAATATAGGGTCCTGTGAAGCTTGTTTCTCTATCTGATGATGCCAAAACAATAATTACGTCAGAACTTGGAGCCAAACATCTTCAAGTGTGGTGGTGTGATATGAGTGCTGGAAAACTTAGTAGGGGTCCTGTTCTCTCCATGAAGCATCCTCCGTTTGTCTCTGAATGCAGAAATGTTAGCAATCAAGAAGACAACGTGGTTGTCTTGTCAGTATCAGTATCAGGTGTAGCTTATTTATGGAAATTAAAGATACTATCTGAAGATGAGGTTAGTCCAACTAAAGTCTCTGTAAAAGCTAATGACAACCAATCAGCTGAGGAAAACCATGGAAGTGCTAAGAAGAATCGAGTTTCTGTCATTGCTTCCAGAATAAATGGTGTAGGAGACAATGAAGTGTCAGTTCTTGTTACTCATGGCTCCATGGACCTACCACAGCATAGTCTTTTTAGTATTGGTTATTCCGTGAAGGAAGATGTGAACACTGCACGTGGGAATAAAACACTCCAACAAAATGATGATTGTTCTGAACAAGGTTTGACAAAATATCATGTTTGCCATTTTCCTTATTTCTTCCCTATCCTTGTTCTCATACAGGTAATCACATTGCTTATACTTTTTACAGGTCCCCATGAGGTTGAGCAAGAAGTGGTTACGCCGAAAAGTAAGAAAGGCAAAAAGAAAAGAGCAGCATCTGATCTTGATAGTCTGACAACTGGAGATATCAGTGATGTTGGTAAGGATTATCTTTGAGCTATTGCTTGTGGCTCTTGAATTGTCATTGAGCATCTACAACTGTCCATCTCTAAGCAAATAGTGTAATATTCTTTTTAGTTTTCACAGCTGACGTGATATCTGGTGATGTAGGCAATGGAGATGCTTCTGATGTAATATTTAATGATGATTTAAATGAGCCATCCATGGGAGAGAAACTTGCGAGTTTGAATTTGGTGGACCAGAATGAGGATGAAGGTCGTGAAAAAGAACCTTCCGTCCCTGCAATACCACCAAGTGCAGACTCTGTTCAGGTTTTGCTTAAGCAAGCGCTACATGCAGAGGATCGCGTCCTTTTGCTAGAATGCTTATATACCAAGGATGATAAGGTTATCACAGTCGTATACCATTAACTTCGTATACGAACCATTTTGGTCTTAATCCATCATATATGTCTGTTTTCAGGTTATTTCAAAATCAATAGCACAATTAAATTCATCTGATGTTCTCAAGCTTTTGCATTCTCTGATATCCTTTATCCAGTCAAGGTAATCTCACTTGTCCTAATTTTGCTTGATGCTTGCTAAGGTCTTGCTTTTCTTATATTTAAATCATATCCATAAAATCAAGAATACTATACCGAAATTTTCATGTCTTATTTACAGAGGTGCAATTCTTGTATGCGTCCTCCCTTGGCTGAGAGGCTTACTTCTCCAACATGCTAGTAAAATTATGTCTCAAGAATCTTCCCTCCTTGCCTTGAACTCTCTATATCAGGTTAGCAATGGTTTGTTTAATACGATTGTTTAAATCATCAGCTTGAAACTTTTTTTGCCATCTCAAATATGCAGTCAATGAGCTGTCTCTTTCACTACCATTAATCATTATTCTCTAATAAATAAATACAAATATACAATTTCTATGTATGTGCAGCTCATTGAGTCTAGAATTTCAACTTTCCAATCCGCTCTACTGCTATCAAGTAGCTTAGACTTCCTCTACTCAGGGGTAAGTGCATTATTTTCCTTCCACACATTACATGGCGAAATTCGAGTTTCGCAGAAACCTTCTTCAAGCTCAATGAGGCGAATAACTCCTGTCTTCTTTCTGGCAGGTCCTTGATGAGGAGGTAGACGAAAATGATGCCATTGTGCCAATTATTTACGAGGAGGACGATAGCGACGATAAGGAATCGGGAGATGAAATGGAAACTGATGAAGACGAGGAAAGGGATGAAGTAGAAGCCTTTGACGATCTTAGTGCTGGTGAAGTGGATGATGACATGAGTGAGTGATTACGAAGCTACAAAATGCTTTGTGGGGTTTTTATTTTTTGACCTTCCTTTTTTCCGAGATGAGCTTGATCTGTAATTTTGACATGATTAATTCTCCACATGAATGAGAATTATATAAACCCATTGGCCCAAGTTGCAAACTTGTTATGTGTTCTTTAGTTCTTTAAAAGGCAGCAAATTTTCATCTCGAGTGTAAAATCTGAAGGCAGAATCTTTATATTGAATAAAACCATCAACCAATTCAGATGCAAACGTTGTCATCTTGAATTTAAGATTTGAAGGCAGTACCCTTGGCGTAAACTCTAATAAATTGTAATATTAAAAGA
mRNA sequence
GTTTAAAGTTGTAGAAAATAAAAGGAAAAGATTTATTGGAAAAAAAAAGGACCCTAGCCTCCATCTTCTTCTTCCTCTTTCCCTTGTAGCCGCCACCCCCGCCCCCAATCCCCTTCAGCTAGTTTTCTCCGGCGAGCATTCGACTTGTAAACGTGTGGATCCACTCACTGTTTGTTGTTGGACCACAAACTACGCTACCTCACGTTGACGACATGACCCACAAAACAAACCCACGCATTTGAGACGTCTCTCTTTGTTGACGGCTTAGCGAACCCACGTCGACGACCCCGTTCCTCCGGCGACTGACAAGTAGACACCCACATGGACGCATTGTTTCTCCCTGTTTTGACAAGTTTCGACATTGACCCACTCTTCGTTTGGCTTCAAATCAAGTACCCATAACGTTGAAGGTCCAGAAGATTTGCAAGTTTGGACGTTTTTGGACTCCCACCAGCAAGATTCGAAGGCTTTTTGGCTTTCTAAAGGTTGAAGTTCGTTCAAGGAAGTGGATTGAGAATTGAGAATTATAGGATTGTTGATTGAGGTCTCAAGGCAATGAAGAAGGAAATTCTTAAATCGCCACCTATCACAGCTTTCACTCCAGACGGTGACTATCTCGCCATCTTGTCCTCAAATGGAACTCTAAAGATTTGGAGTACCCGTGATGGGAGTTTACTGGCAGAATGGAAGGATCCCGATGGAAAAAATGATGTTGGTTATTCCTGTATGGCATGCTGTTTTCAGGGGAAAAAGCGTAAAAACAGTTACTGTGTAGTTGCTGTTGGTACCAATAGTGGGGATGTGTTGGCTGTGAATGCTTCAAACGGTGAGAGAAAGTGGGTCTCCGCAGGTTGCCATCTTGGCGGAGTTATTGGCCTTTCTTTTGCGAACAAAGGCCGTAGACTGCATGCGGTTGGAAGTAATGGAAAGGTCTCTGAGATGGACACTGAAACAGGAAACATTATCAAGGAGTTCAAAGCTTCGAAAAAATCAATCTCTTCTTCATCCTTTTCACTTGATGAGAAGTACTTAGCTGTAGCTGGTAAAAAGTTAAAAATTTTAAGCAAAGATGATGGGGATGAGCTTATGGTGCATCCTGATAAATTGGGTCCTGTGAAGCTTGTTTCTCTATCTGATGATGCCAAAACAATAATTACGTCAGAACTTGGAGCCAAACATCTTCAAGTGTGGTGGTGTGATATGAGTGCTGGAAAACTTAGTAGGGGTCCTGTTCTCTCCATGAAGCATCCTCCGTTTGTCTCTGAATGCAGAAATGTTAGCAATCAAGAAGACAACGTGGTTGTCTTGTCAGTATCAGTATCAGGTGTAGCTTATTTATGGAAATTAAAGATACTATCTGAAGATGAGGTTAGTCCAACTAAAGTCTCTGTAAAAGCTAATGACAACCAATCAGCTGAGGAAAACCATGGAAGTGCTAAGAAGAATCGAGTTTCTGTCATTGCTTCCAGAATAAATGGTGTAGGAGACAATGAAGTGTCAGTTCTTGTTACTCATGGCTCCATGGACCTACCACAGCATAGTCTTTTTAGTATTGGTTATTCCGTGAAGGAAGATGTGAACACTGCACGTGGGAATAAAACACTCCAACAAAATGATGATTGTTCTGAACAAGGTCCCCATGAGGTTGAGCAAGAAGTGGTTACGCCGAAAAGTAAGAAAGGCAAAAAGAAAAGAGCAGCATCTGATCTTGATAGTCTGACAACTGGAGATATCAGTGATGTTGGCAATGGAGATGCTTCTGATGTAATATTTAATGATGATTTAAATGAGCCATCCATGGGAGAGAAACTTGCGAGTTTGAATTTGGTGGACCAGAATGAGGATGAAGGTCGTGAAAAAGAACCTTCCGTCCCTGCAATACCACCAAGTGCAGACTCTGTTCAGGTTTTGCTTAAGCAAGCGCTACATGCAGAGGATCGCGTCCTTTTGCTAGAATGCTTATATACCAAGGATGATAAGGTTATTTCAAAATCAATAGCACAATTAAATTCATCTGATGTTCTCAAGCTTTTGCATTCTCTGATATCCTTTATCCAGTCAAGAGGTGCAATTCTTGTATGCGTCCTCCCTTGGCTGAGAGGCTTACTTCTCCAACATGCTAGTAAAATTATGTCTCAAGAATCTTCCCTCCTTGCCTTGAACTCTCTATATCAGCTCATTGAGTCTAGAATTTCAACTTTCCAATCCGCTCTACTGCTATCAAGTAGCTTAGACTTCCTCTACTCAGGGGTCCTTGATGAGGAGGTAGACGAAAATGATGCCATTGTGCCAATTATTTACGAGGAGGACGATAGCGACGATAAGGAATCGGGAGATGAAATGGAAACTGATGAAGACGAGGAAAGGGATGAAGTAGAAGCCTTTGACGATCTTAGTGCTGGTGAAGTGGATGATGACATGAGTGAGTGATTACGAAGCTACAAAATGCTTTGTGGGGTTTTTATTTTTTGACCTTCCTTTTTTCCGAGATGAGCTTGATCTGTAATTTTGACATGATTAATTCTCCACATGAATGAGAATTATATAAACCCATTGGCCCAAGTTGCAAACTTGTTATGTGTTCTTTAGTTCTTTAAAAGGCAGCAAATTTTCATCTCGAGTGTAAAATCTGAAGGCAGAATCTTTATATTGAATAAAACCATCAACCAATTCAGATGCAAACGTTGTCATCTTGAATTTAAGATTTGAAGGCAGTACCCTTGGCGTAAACTCTAATAAATTGTAATATTAAAAGA
Coding sequence (CDS)
ATGAAGAAGGAAATTCTTAAATCGCCACCTATCACAGCTTTCACTCCAGACGGTGACTATCTCGCCATCTTGTCCTCAAATGGAACTCTAAAGATTTGGAGTACCCGTGATGGGAGTTTACTGGCAGAATGGAAGGATCCCGATGGAAAAAATGATGTTGGTTATTCCTGTATGGCATGCTGTTTTCAGGGGAAAAAGCGTAAAAACAGTTACTGTGTAGTTGCTGTTGGTACCAATAGTGGGGATGTGTTGGCTGTGAATGCTTCAAACGGTGAGAGAAAGTGGGTCTCCGCAGGTTGCCATCTTGGCGGAGTTATTGGCCTTTCTTTTGCGAACAAAGGCCGTAGACTGCATGCGGTTGGAAGTAATGGAAAGGTCTCTGAGATGGACACTGAAACAGGAAACATTATCAAGGAGTTCAAAGCTTCGAAAAAATCAATCTCTTCTTCATCCTTTTCACTTGATGAGAAGTACTTAGCTGTAGCTGGTAAAAAGTTAAAAATTTTAAGCAAAGATGATGGGGATGAGCTTATGGTGCATCCTGATAAATTGGGTCCTGTGAAGCTTGTTTCTCTATCTGATGATGCCAAAACAATAATTACGTCAGAACTTGGAGCCAAACATCTTCAAGTGTGGTGGTGTGATATGAGTGCTGGAAAACTTAGTAGGGGTCCTGTTCTCTCCATGAAGCATCCTCCGTTTGTCTCTGAATGCAGAAATGTTAGCAATCAAGAAGACAACGTGGTTGTCTTGTCAGTATCAGTATCAGGTGTAGCTTATTTATGGAAATTAAAGATACTATCTGAAGATGAGGTTAGTCCAACTAAAGTCTCTGTAAAAGCTAATGACAACCAATCAGCTGAGGAAAACCATGGAAGTGCTAAGAAGAATCGAGTTTCTGTCATTGCTTCCAGAATAAATGGTGTAGGAGACAATGAAGTGTCAGTTCTTGTTACTCATGGCTCCATGGACCTACCACAGCATAGTCTTTTTAGTATTGGTTATTCCGTGAAGGAAGATGTGAACACTGCACGTGGGAATAAAACACTCCAACAAAATGATGATTGTTCTGAACAAGGTCCCCATGAGGTTGAGCAAGAAGTGGTTACGCCGAAAAGTAAGAAAGGCAAAAAGAAAAGAGCAGCATCTGATCTTGATAGTCTGACAACTGGAGATATCAGTGATGTTGGCAATGGAGATGCTTCTGATGTAATATTTAATGATGATTTAAATGAGCCATCCATGGGAGAGAAACTTGCGAGTTTGAATTTGGTGGACCAGAATGAGGATGAAGGTCGTGAAAAAGAACCTTCCGTCCCTGCAATACCACCAAGTGCAGACTCTGTTCAGGTTTTGCTTAAGCAAGCGCTACATGCAGAGGATCGCGTCCTTTTGCTAGAATGCTTATATACCAAGGATGATAAGGTTATTTCAAAATCAATAGCACAATTAAATTCATCTGATGTTCTCAAGCTTTTGCATTCTCTGATATCCTTTATCCAGTCAAGAGGTGCAATTCTTGTATGCGTCCTCCCTTGGCTGAGAGGCTTACTTCTCCAACATGCTAGTAAAATTATGTCTCAAGAATCTTCCCTCCTTGCCTTGAACTCTCTATATCAGCTCATTGAGTCTAGAATTTCAACTTTCCAATCCGCTCTACTGCTATCAAGTAGCTTAGACTTCCTCTACTCAGGGGTCCTTGATGAGGAGGTAGACGAAAATGATGCCATTGTGCCAATTATTTACGAGGAGGACGATAGCGACGATAAGGAATCGGGAGATGAAATGGAAACTGATGAAGACGAGGAAAGGGATGAAGTAGAAGCCTTTGACGATCTTAGTGCTGGTGAAGTGGATGATGACATGAGTGAGTGA
Protein sequence
MKKEILKSPPITAFTPDGDYLAILSSNGTLKIWSTRDGSLLAEWKDPDGKNDVGYSCMACCFQGKKRKNSYCVVAVGTNSGDVLAVNASNGERKWVSAGCHLGGVIGLSFANKGRRLHAVGSNGKVSEMDTETGNIIKEFKASKKSISSSSFSLDEKYLAVAGKKLKILSKDDGDELMVHPDKLGPVKLVSLSDDAKTIITSELGAKHLQVWWCDMSAGKLSRGPVLSMKHPPFVSECRNVSNQEDNVVVLSVSVSGVAYLWKLKILSEDEVSPTKVSVKANDNQSAEENHGSAKKNRVSVIASRINGVGDNEVSVLVTHGSMDLPQHSLFSIGYSVKEDVNTARGNKTLQQNDDCSEQGPHEVEQEVVTPKSKKGKKKRAASDLDSLTTGDISDVGNGDASDVIFNDDLNEPSMGEKLASLNLVDQNEDEGREKEPSVPAIPPSADSVQVLLKQALHAEDRVLLLECLYTKDDKVISKSIAQLNSSDVLKLLHSLISFIQSRGAILVCVLPWLRGLLLQHASKIMSQESSLLALNSLYQLIESRISTFQSALLLSSSLDFLYSGVLDEEVDENDAIVPIIYEEDDSDDKESGDEMETDEDEERDEVEAFDDLSAGEVDDDMSE
Homology
BLAST of Bhi09G000044 vs. TAIR 10
Match:
AT1G15420.1 (CONTAINS InterPro DOMAIN/s: Small-subunit processome, Utp12 (InterPro:IPR007148); Has 764 Blast hits to 656 proteins in 193 species: Archae - 0; Bacteria - 42; Metazoa - 237; Fungi - 154; Plants - 85; Viruses - 23; Other Eukaryotes - 223 (source: NCBI BLink). )
HSP 1 Score: 192.2 bits (487), Expect = 1.3e-48
Identity = 132/260 (50.77%), Postives = 181/260 (69.62%), Query Frame = 0
Query: 372 KSKKGKKKRAASDLDSLTTGDISDVGNGDASDVIFNDDLNEPSMGEKLASLNLVD----Q 431
K KK KKRA + D +T D + D V+ +D LNEP++G+KL SL+L++
Sbjct: 27 KHKKKSKKRAEPEPDIPSTRDSG--LDEDRDGVLVDDTLNEPTIGDKLESLDLLNGEKVN 86
Query: 432 NEDEGREKEPSVPAIPPSADSVQVLLKQALHAEDRVLLLECLYTKDDKVISKSIAQLNSS 491
+E+ R+ P PP+A SV VLL+QALHA+DR LLL+CLY +D++VI+ S+A+LNS+
Sbjct: 87 SEESNRDSAPGDDK-PPTAASVNVLLRQALHADDRSLLLDCLYNRDEQVIANSVAKLNSA 146
Query: 492 DVLKLLHSLISFIQSRGAILVCVLPWLRGLLLQHASKIMSQESSLLALNSLYQLIESRIS 551
+VLKLL++L+ +QSRGAIL C +PW++ LLL H+S IMSQESSLLALN++YQLIESR+S
Sbjct: 147 EVLKLLNALLPILQSRGAILACTIPWIKSLLLTHSSGIMSQESSLLALNTMYQLIESRVS 206
Query: 552 TFQSALLLSSSLDFLYSGVLDEEVDENDAIVPIIYEEDDSD-DKESGDE--METDEDEER 611
T +A+ +SS LD + LDEE DE P+IYE+ DSD D+E G E METDE+ +
Sbjct: 207 TIHTAVEVSSGLDLIVDD-LDEEEDEG----PVIYEDKDSDEDEEEGIEEAMETDEEADD 266
Query: 612 DEVEAFDDLSAGEVDDDMSE 625
EA D ++ E DDMS+
Sbjct: 267 SADEAADGVNDFEGFDDMSD 278
BLAST of Bhi09G000044 vs. TAIR 10
Match:
AT5G11240.1 (transducin family protein / WD-40 repeat family protein )
HSP 1 Score: 158.7 bits (400), Expect = 1.5e-38
Identity = 162/631 (25.67%), Postives = 284/631 (45.01%), Query Frame = 0
Query: 11 ITAFTPDGDYLAILSSNGTLKIWSTRDGSLLAEWKD-------------PDGKNDVGYSC 70
+T+F+P DYLA+ + +G +KIW T G + E+ D G V Y+C
Sbjct: 10 LTSFSPALDYLALSTGDGRIKIWDTVKGQVQTEFADIASTEETNIYTKVGKGHLSVDYTC 69
Query: 71 M--ACCFQGKKRKNSYCVVAVGTNSGDVLAVNASNGERKWVSAGCHLGGVIGLSFANKGR 130
M + KKRK V+ +GT GDVLA++ ++G+ KW + CH GGV +S + K
Sbjct: 70 MKWLSLEKKKKRKLGTSVLVLGTGGGDVLALDVASGQLKWRISDCHPGGVNAVSSSAKAS 129
Query: 131 RLHAVGSNGKVSEMDTETGNIIKEFKASKKSISSSSFSLDEKYLAVAGKKLKILSKDDGD 190
+++ G++G V ++D +GN+I++FKAS K++SS S D K L A +LK + D
Sbjct: 130 CIYSGGADGMVCQIDPHSGNLIRKFKASTKTVSSLCVSPDGKILVTASTQLKTFNCSDLK 189
Query: 191 ELMVHPDKLGPVKLVSLSDDAKTIITSELGAKHLQVWWCDMSAGKLSRGPVLSMKHPPFV 250
++ G V+ V+ ++D K +++S +G +++ VW D A K S VL+++HPP
Sbjct: 190 KIQKFTGHPGVVRCVAFTEDGKYVLSSAVGERYIAVWKTD-GAKKQSASCVLALEHPPVF 249
Query: 251 SECRNVSNQEDNVVVLSVSVSGVAYLWKLKILSE-DEVSPTKVSVKANDNQSAEENHGSA 310
+ +N E + VL++S GV Y W + E +PTKV++ +A+ +
Sbjct: 250 VDSWGETN-EKGLYVLAISEIGVCYFWYGSNVEELCNATPTKVAL-----ATADSSLKPY 309
Query: 311 KKNRVSVIASRINGV-GDNEVSVLVTHGSMDLPQHSLFSIGYSVKEDVNTARGNKTLQQN 370
K + + A+++ G+ + G + P + + +N ++ L
Sbjct: 310 KSSLPLIFAAKLQGILKPGSAHAFIASGLLVKPSFQKMVLQFGNDLVLNASKDGILLPIT 369
Query: 371 DDCSEQGPHEVEQEVVTPKSKKGKK------KRAASDLDSLTTGDISDVGNGDASDVIFN 430
S+ + Q VT + + R A + + S + D
Sbjct: 370 QSVSKSSKRQGVQNKVTTLDRAHAEDALLPIARVADLHEKKSVQLHSSDKDTYMVDQSHA 429
Query: 431 DDLNEPSMGEKLASLNLVDQNEDEGREKEPSVPAIPPSADSVQVLLKQALHAEDRVLLLE 490
D + SM +KL SL ++ ++ K S +I D LK L
Sbjct: 430 DHVETFSMEDKLRSLGILGGTDE---HKNLSYASIIDGTD-----LKAYL---------- 489
Query: 491 CLYTKDDKVISKSIAQLNSSDVLKLLHSLISFIQSRGAILVCVLPWLRGLLLQHASKIMS 550
K + ++ + S K L +L + Q+R +LPW+ +++ H+ IMS
Sbjct: 490 -----PPKKLKSAVLSMEPSTAFKTLEALAAMWQTRACGGRHLLPWIYSIMVNHSHYIMS 549
Query: 551 QE-SSLLALNSLYQLIESRISTFQSALLLSSSLDFLYSGV----------------LDEE 602
QE + LN+L ++ +SR + Q L LS L + + + +DE
Sbjct: 550 QEPKNQQLLNTLVKITKSRGTALQQLLQLSGRLQLVTAQINKAAGSQTQITAHDQEIDES 609
BLAST of Bhi09G000044 vs. ExPASy Swiss-Prot
Match:
Q15061 (WD repeat-containing protein 43 OS=Homo sapiens OX=9606 GN=WDR43 PE=1 SV=3)
HSP 1 Score: 67.0 bits (162), Expect = 8.6e-10
Identity = 148/667 (22.19%), Postives = 263/667 (39.43%), Query Frame = 0
Query: 13 AFTPDGD-YLAILSSNGTLKIWSTRDGSLLAEWKDPDGKNDVGYSCMACC---------F 72
AF+P Y A+ S++G L++W T + L E+ P +C+A
Sbjct: 20 AFSPHSQAYFALASTDGHLRVWETANNRLHQEYV-PSAHLSGTCTCLAWAPARLQAKESP 79
Query: 73 QGKKRK-------NSYCVVAVGTNSGDVLAVNASNGE--RKWVSAGCHLGGVIGLSFANK 132
Q KKRK N ++A+GT G +L + GE K +S G H V + +
Sbjct: 80 QRKKRKSEAVGMSNQTDLLALGTAVGSILLYSTVKGELHSKLISGG-HDNRVNCIQWHQD 139
Query: 133 GRRLHAVGSNGKVSEMDTETGNIIKEFKASKKSISSSSFSLDEKYLAVAGKKLKILSKDD 192
L++ + + E + +T + ++K S+SS S D K L AG+ +K+ +
Sbjct: 140 SGCLYSCSDDKHIVEWNVQTCKVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLET 199
Query: 193 GDELMVHPDKLGPVKLVSLSD----------DAKTIITSELGAKH---LQVWWCDMSAGK 252
+ PV + + D T + GA H L VW +
Sbjct: 200 KEVYRHFTGHATPVSSLMFTTIRPPNESQPFDGITGLYFLSGAVHDRLLNVWQVRSENKE 259
Query: 253 LSRGPVLSMKHPPFVSECRNVSNQEDNVVVLSVSVSGVAYLWKLKILSEDEVSPTKVSVK 312
S ++ P + N+E+ V + V G +L++ IL+ P
Sbjct: 260 KSAVMSFTVTDEPVYIDLTLSENKEEPVKLAVVCRDGQVHLFE-HILNGYCKKPL----- 319
Query: 313 ANDNQSAEENHGSAKKNRVSVIASRINGVGDNEVSVLVTHGSMDLPQHSLFSIG-----Y 372
++ G KK+ I G +++S+L+ +GS P ++
Sbjct: 320 TSNCTIQIATPGKGKKSTPKPIPILAAGFCSDKMSLLLVYGSWFQPTIERVALNSREPHM 379
Query: 373 SVKEDVNTARGNK---------TLQQNDDCSEQGP----HEVEQEVVTPKSKKGKKKRAA 432
+ D++ K T N + P H + P++++ + KR +
Sbjct: 380 CLVRDISNCWAPKVETAITKVRTPVMNSEAKVLVPGIPGHHAAIKPAPPQTEQVESKRKS 439
Query: 433 SDLDSLTTGDISDVGNGDASDVIFNDDLNEPSMGEKLASLNLVDQNEDEGREKEPSVPAI 492
NE S+ E+L ++++ +G+E
Sbjct: 440 GG--------------------------NEVSIEERLGAMDI--DTHKKGKE-------- 499
Query: 493 PPSADSVQVLLKQALHAEDRVLLLECLYTKDDKVISKSIAQLNSSDVLKLLHSLISFIQS 552
+S VLL Q L + D +L + L T++ +I K++ ++ ++ LL L +Q
Sbjct: 500 DLQTNSFPVLLTQGLESNDFEMLNKVLQTRNVNLIKKTVLRMPLHTIIPLLQELTKRLQG 559
Query: 553 RGAILVCVLPWLRGLLLQHASKIMSQESSLLALNSLYQLIESRISTFQSALLLSSSLDFL 612
V ++ WL+ +L HAS + + + L +LYQL+ESR+ TFQ L L L
Sbjct: 560 HPNSAVLMVQWLKCVLTVHASYLSTLPDLVPQLGTLYQLMESRVKTFQKLSHLHGKLILL 619
Query: 613 YSGVLDEEVDENDAI----VPIIYE----EDDSDDKESGDEMETDEDEERDEVEAFDDLS 622
+ V E + ++YE E++SDD+ + + E + DE+ +E E+ D
Sbjct: 620 ITQVTASEKTKGATSPGQKAKLVYEEESSEEESDDEIADKDSEDNWDEDEEESESEKDED 642
BLAST of Bhi09G000044 vs. ExPASy Swiss-Prot
Match:
Q6ZQL4 (WD repeat-containing protein 43 OS=Mus musculus OX=10090 GN=Wdr43 PE=1 SV=2)
HSP 1 Score: 67.0 bits (162), Expect = 8.6e-10
Identity = 149/641 (23.24%), Postives = 246/641 (38.38%), Query Frame = 0
Query: 13 AFTPDGD-YLAILSSNGTLKIWSTRDGSLLAEWKDPDGKNDVGYSCMACC---------F 72
AF+PD Y A+ SS+G L++W T + L E+ P +C+A
Sbjct: 20 AFSPDSQAYFALASSDGQLRVWETANNRLHQEYV-PSAHLSGTCTCLAWAPARLQAKESH 79
Query: 73 QGKKRKNSYC-------VVAVGTNSGDVLAVNASNGE-RKWVSAGCHLGGVIGLSFANKG 132
Q KKRK+ ++A+GT G +L + GE +++G H V + +
Sbjct: 80 QRKKRKSEVTGTKDQADLLALGTAVGSILLYSTVRGELHSKLTSGGHENRVNCIQWHQDN 139
Query: 133 RRLHAVGSNGKVSEMDTETGNIIKEFKASKKSISSSSFSLDEKYLAVAGKKLKILSKDDG 192
L++ + + E T+T + ++K S+SS S D K L AG+ +K+ +
Sbjct: 140 DCLYSCSDDKYIVEWSTQTCKVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETK 199
Query: 193 DELMVHPDKLGPVKLVSLSDDAKTIITSELGAKHLQVWWCDMSAGKLSRGPVLSMKHPPF 252
+ PV + + TI +E S G + H
Sbjct: 200 EVYRHFTGHATPVSSLRFT----TIRPNE----------SQPSDGITGLYFLSGAVHDRL 259
Query: 253 VSECRNVSNQEDNVVVLSVSVSGVAYLWKLKILSEDEVSPTKVSVKANDNQSAEENH--- 312
++ + S ++ V+S +V+ L LSE++ P K++V D Q H
Sbjct: 260 LNVWQVRSENKEKSAVMSFTVTDEPVYVDL-TLSENKEEPVKLAVVCRDGQVHLFEHILN 319
Query: 313 GSAKK----NRVSVIASRINGVGDNEVSVLVTHGSMDLPQHSLFSIGYSVKEDVNTARGN 372
G KK N IA+ G + + S L + SL + Y R
Sbjct: 320 GHCKKPLTSNCTIQIATPGKGKKVTPKPIPILAASFCLDKMSLLLV-YGNWFQPTIERVA 379
Query: 373 KTLQQNDDCSEQGPHEVEQEVVTPKSKKGKKKRAASDLDSLTTGDISDVGNGDASDVIFN 432
+ C E+ V K K S+ L G + A +
Sbjct: 380 LNSKDTHICLERDISNCWAPTVETAITKVKTPVMNSEAKVLVPG----IPGHHAPIKLPP 439
Query: 433 DDLNEPSMGEKLASLNLVDQNEDEGREKEPSVPAIPPSADSVQVLLKQALHAEDRVLLLE 492
E KL S + + + +S VLL Q L + D +L +
Sbjct: 440 AQPKEAENKRKLGSTEATIEERLGAMDLDRKGRKDDLQTNSFAVLLTQGLESNDFEILNK 499
Query: 493 CLYTKDDKVISKSIAQLNSSDVLKLLHSLISFIQSRGAILVCVLPWLRGLLLQHASKIMS 552
L TK+ +I +++ ++ V+ LL L +Q ++ WL+ +L HAS + +
Sbjct: 500 VLQTKNVNLIKRTVLRIPLRVVIPLLQELTKRLQGHPNSAALMIQWLKCVLTIHASYLST 559
Query: 553 QESSLLALNSLYQLIESRISTFQSALLLSSSLDFLYSGVLDEEVDEN----DAIVPIIYE 612
+ L +LYQL+ESR+ TFQ L L L + V E + ++YE
Sbjct: 560 LPDLVEQLGTLYQLMESRVKTFQKLSNLHGKLILLVTQVTASEKSKKMTSPGQKAKLVYE 619
Query: 613 EDDSDDKESGDEMETDEDEERDEVEAFDDLSAGEVDDDMSE 625
E+ S+++ + E D D+ DE E D VD+D E
Sbjct: 620 EESSEEESDDEVPEKDSDDNWDEDEDKDSEKDEGVDEDNEE 639
BLAST of Bhi09G000044 vs. ExPASy Swiss-Prot
Match:
Q8YV57 (Uncharacterized WD repeat-containing protein all2124 OS=Nostoc sp. (strain PCC 7120 / SAG 25.82 / UTEX 2576) OX=103690 GN=all2124 PE=4 SV=1)
HSP 1 Score: 60.5 bits (145), Expect = 8.0e-08
Identity = 58/206 (28.16%), Postives = 90/206 (43.69%), Query Frame = 0
Query: 13 AFTPDGDYLAILSSNGTLKIWSTRDGSLLAEWKDPDGK-NDVGYSCMACCFQGKKRKNSY 72
+FTP GD +A +++ T+KIW RDG L D + N V +S GK
Sbjct: 1411 SFTPQGDLIASANADKTVKIWRVRDGKALKTLIGHDNEVNKVNFSP-----DGK------ 1470
Query: 73 CVVAVGTNSGDVLAVNASNGERKWVSAGCHLGGVIGLSFANKGRRLHAVGSNGKVSEMDT 132
+A + V N S+G+ K G H V +SF+ G+ + + ++ + D+
Sbjct: 1471 -TLASASRDNTVKLWNVSDGKFKKTLKG-HTDEVFWVSFSPDGKIIASASADKTIRLWDS 1530
Query: 133 ETGNIIKEFKASKKSISSSSFSLDEKYLA--VAGKKLKILSKDDGDELMVHPDKLGPVKL 192
+GN+IK A + S +F+ D LA A K +K+ DG L V
Sbjct: 1531 FSGNLIKSLPAHNDLVYSVNFNPDGSMLASTSADKTVKLWRSHDGHLLHTFSGHSNVVYS 1590
Query: 193 VSLSDDAKTIITSELGAKHLQVWWCD 216
S S D + I S K +++W D
Sbjct: 1591 SSFSPDGR-YIASASEDKTVKIWQID 1602
BLAST of Bhi09G000044 vs. ExPASy Swiss-Prot
Match:
Q91V09 (WD repeat-containing protein 13 OS=Mus musculus OX=10090 GN=Wdr13 PE=1 SV=1)
HSP 1 Score: 50.4 bits (119), Expect = 8.3e-05
Identity = 38/128 (29.69%), Postives = 55/128 (42.97%), Query Frame = 0
Query: 13 AFTPDGDYLAILSSNGTLKIWSTRDGSLLAEWKDPDGKNDVGYSCMACCFQGKKRKNSYC 72
A++ D L S + T++IW++ DG + E DPDG + C FQ
Sbjct: 224 AWSLSNDILVSTSLDATMRIWASEDGRCIREIPDPDGA-----ELLCCTFQPVNNN---- 283
Query: 73 VVAVGTNSGDVLAVNASNGERKWVSAGCHLGGVIGLSFANKGRRLHAVGSNGKVSE--MD 132
+ VG +V +N S G++ + G V+ LSF GR L A G V D
Sbjct: 284 LTVVGNAKHNVHVMNISTGKKVKGGSSKLTGRVLALSFDAPGRLLWAGDDRGSVFSFLFD 342
Query: 133 TETGNIIK 139
TG + K
Sbjct: 344 MATGKLTK 342
BLAST of Bhi09G000044 vs. ExPASy Swiss-Prot
Match:
Q5RF24 (WD repeat-containing protein 13 OS=Pongo abelii OX=9601 GN=WDR13 PE=2 SV=1)
HSP 1 Score: 50.4 bits (119), Expect = 8.3e-05
Identity = 38/128 (29.69%), Postives = 55/128 (42.97%), Query Frame = 0
Query: 13 AFTPDGDYLAILSSNGTLKIWSTRDGSLLAEWKDPDGKNDVGYSCMACCFQGKKRKNSYC 72
A++ D L S + T++IW++ DG + E DPDG + C FQ
Sbjct: 224 AWSLSNDILVSTSLDATMRIWASEDGRCIREIPDPDGA-----ELLCCTFQPVNNN---- 283
Query: 73 VVAVGTNSGDVLAVNASNGERKWVSAGCHLGGVIGLSFANKGRRLHAVGSNGKVSE--MD 132
+ VG +V +N S G++ + G V+ LSF GR L A G V D
Sbjct: 284 LTVVGNAKHNVHVMNISTGKKVKGGSSKLTGRVLALSFDAPGRLLWAGDDRGSVFSFLFD 342
Query: 133 TETGNIIK 139
TG + K
Sbjct: 344 MATGKLTK 342
BLAST of Bhi09G000044 vs. ExPASy TrEMBL
Match:
A0A1S3CDX9 (WD repeat-containing protein 43 OS=Cucumis melo OX=3656 GN=LOC103499777 PE=4 SV=1)
HSP 1 Score: 1062.0 bits (2745), Expect = 9.6e-307
Identity = 554/625 (88.64%), Postives = 588/625 (94.08%), Query Frame = 0
Query: 1 MKKEILKSPPITAFTPDGDYLAILSSNGTLKIWSTRDGSLLAEWKDPDGKNDVGYSCMAC 60
MKKEILKSPPITAFTPDGDYLAI SSNGTLKIWSTRDGSLLAEWKD DGK+D GYSCMAC
Sbjct: 1 MKKEILKSPPITAFTPDGDYLAIFSSNGTLKIWSTRDGSLLAEWKDLDGKDDFGYSCMAC 60
Query: 61 CFQGKKRKNSYCVVAVGTNSGDVLAVNASNGERKWVSAGCHLGGVIGLSFANKGRRLHAV 120
C GKKRK+SYC+VA+GTN+GDVLAVNASNGE+KWVS GCH GGVIGLSFAN+GRRLH V
Sbjct: 61 CLLGKKRKSSYCLVAIGTNNGDVLAVNASNGEKKWVSTGCHPGGVIGLSFANEGRRLHTV 120
Query: 121 GSNGKVSEMDTETGNIIKEFKASKKSISSSSFSLDEKYLAVAGKKLKILSKDDGDELMVH 180
GSNG SEMDTETGNIIKEFKASKKSISSS+FSLDEKYLAVAGKKLKILS DDGDEL+VH
Sbjct: 121 GSNGMASEMDTETGNIIKEFKASKKSISSSAFSLDEKYLAVAGKKLKILSADDGDELIVH 180
Query: 181 PDKLGPVKLVSLSDDAKTIITSELGAKHLQVWWCDMSAGKLSRGPVLSMKHPPFVSECRN 240
PDKL PVKLVS+SDDAKTI+TSELGAKHLQVWWCDMSAGK SRGPVLSM HPPFVSECRN
Sbjct: 181 PDKLDPVKLVSISDDAKTILTSELGAKHLQVWWCDMSAGKFSRGPVLSMNHPPFVSECRN 240
Query: 241 VSNQEDNVVVLSVSVSGVAYLWKLKILSEDEVSPTKVSVKANDNQSAEENHGSAKKNRVS 300
VSNQED+VVVLSVSVSG AYLWKLK+LSEDEV PTKVSVKANDNQSAEENHGSAKKNRVS
Sbjct: 241 VSNQEDSVVVLSVSVSGAAYLWKLKVLSEDEVIPTKVSVKANDNQSAEENHGSAKKNRVS 300
Query: 301 VIASRINGVGDNEVSVLVTHGSMDLPQHSLFSIGYSVKEDVNTARGNKTLQQNDDCSEQG 360
V+AS+I+ +GDNEVSVLVTHGS+DLPQHSL IGY+VKED NTA NKTLQQND SEQG
Sbjct: 301 VLASKIHRIGDNEVSVLVTHGSVDLPQHSLLDIGYTVKEDANTAHENKTLQQNDGVSEQG 360
Query: 361 PHEVEQEVVTPKSKKGKKKRAASDLDSLTTGDISDVGNGDASDVIFNDDLNEPSMGEKLA 420
PHE+EQ V+TPKSKK KKKRAASDLDS T GD+SDVGNGDASDV+FNDDLNEPSMGEKLA
Sbjct: 361 PHEIEQ-VITPKSKKSKKKRAASDLDSPTAGDVSDVGNGDASDVVFNDDLNEPSMGEKLA 420
Query: 421 SLNLVDQNEDEGREKE-PSVPAIPPSADSVQVLLKQALHAEDRVLLLECLYTKDDKVISK 480
SLNL DQNED GRE+E PSVP IPPSADSVQVLLKQALHA+D LLLECLYTKDDKVISK
Sbjct: 421 SLNLADQNEDGGREQEDPSVPVIPPSADSVQVLLKQALHADDHALLLECLYTKDDKVISK 480
Query: 481 SIAQLNSSDVLKLLHSLISFIQSRGAILVCVLPWLRGLLLQHASKIMSQESSLLALNSLY 540
SIAQLNSSDVLKLLHS+ISFIQSRGAILVC LPWLRGLLLQHASKIMSQESSLLALNSLY
Sbjct: 481 SIAQLNSSDVLKLLHSVISFIQSRGAILVCALPWLRGLLLQHASKIMSQESSLLALNSLY 540
Query: 541 QLIESRISTFQSALLLSSSLDFLYSGVLDEEVDENDAIVPIIYEEDDSDDKESGDEMETD 600
QLIE+RISTFQSALLLSSSLDFLY+GVLDEE ++NDAIVPIIYEE+DSD+ E+GDEMETD
Sbjct: 541 QLIEARISTFQSALLLSSSLDFLYTGVLDEEENDNDAIVPIIYEEEDSDENETGDEMETD 600
Query: 601 EDEERDEVEAFDDLSAGEVDDDMSE 625
ED+ERDEVEAFDDLSAGEVDDDMSE
Sbjct: 601 EDDERDEVEAFDDLSAGEVDDDMSE 624
BLAST of Bhi09G000044 vs. ExPASy TrEMBL
Match:
A0A0A0KBW9 (Utp12 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G433260 PE=4 SV=1)
HSP 1 Score: 1038.9 bits (2685), Expect = 8.7e-300
Identity = 544/625 (87.04%), Postives = 583/625 (93.28%), Query Frame = 0
Query: 1 MKKEILKSPPITAFTPDGDYLAILSSNGTLKIWSTRDGSLLAEWKDPDGKNDVGYSCMAC 60
MK EILKS PI AFTPDGDYLAI+SSNGTLKIWSTRDGSLLAEWKD DGKND GYSCMAC
Sbjct: 1 MKMEILKSLPIAAFTPDGDYLAIVSSNGTLKIWSTRDGSLLAEWKDLDGKNDFGYSCMAC 60
Query: 61 CFQGKKRKNSYCVVAVGTNSGDVLAVNASNGERKWVSAGCHLGGVIGLSFANKGRRLHAV 120
CF GKKRK+SYCVVA+GTNSGDVLAVNASNGE+KWVSAGCH GGVIGLSFANKG RL V
Sbjct: 61 CFLGKKRKSSYCVVAIGTNSGDVLAVNASNGEKKWVSAGCHPGGVIGLSFANKGCRLRTV 120
Query: 121 GSNGKVSEMDTETGNIIKEFKASKKSISSSSFSLDEKYLAVAGKKLKILSKDDGDELMVH 180
GSNG SEMDTETGNIIKEFKASKKSISSS+FSLDE+YL VAGKKLKILS DDGDEL+VH
Sbjct: 121 GSNGMASEMDTETGNIIKEFKASKKSISSSAFSLDERYLVVAGKKLKILSTDDGDELIVH 180
Query: 181 PDKLGPVKLVSLSDDAKTIITSELGAKHLQVWWCDMSAGKLSRGPVLSMKHPPFVSECRN 240
PDKLGPVKLVS+SDDAKTIITSELGAKHLQVWWC++SAGK SRGP+LSMKHPPFVSECRN
Sbjct: 181 PDKLGPVKLVSVSDDAKTIITSELGAKHLQVWWCNISAGKFSRGPILSMKHPPFVSECRN 240
Query: 241 VSNQEDNVVVLSVSVSGVAYLWKLKILSEDEVSPTKVSVKANDNQSAEENHGSAKKNRVS 300
VSNQED+VVVLSVSVSG AYLWKLK+LSEDEV+PTKVSVKANDNQSAEENHGSAKKNR S
Sbjct: 241 VSNQEDSVVVLSVSVSGAAYLWKLKVLSEDEVTPTKVSVKANDNQSAEENHGSAKKNRAS 300
Query: 301 VIASRINGVGDNEVSVLVTHGSMDLPQHSLFSIGYSVKEDVNTARGNKTLQQNDDCSEQG 360
V+ASRI+G+GDNEVSVLVTHGS+DLPQH+L IGY+VKED NTA NKTLQQND SEQG
Sbjct: 301 VLASRIHGIGDNEVSVLVTHGSVDLPQHTLLDIGYTVKEDANTAHENKTLQQNDCVSEQG 360
Query: 361 PHEVEQEVVTPKSKKGKKKRAASDLDSLTTGDISDVGNGDASDVIFNDDLNEPSMGEKLA 420
PHE+EQ V+TPKSKK KKKRAAS+LDSLT GD+SDVGNGD SDV+FNDDLNEPSMGEKLA
Sbjct: 361 PHEIEQ-VITPKSKKSKKKRAASELDSLTAGDVSDVGNGDTSDVLFNDDLNEPSMGEKLA 420
Query: 421 SLNLVDQNEDEGREKE-PSVPAIPPSADSVQVLLKQALHAEDRVLLLECLYTKDDKVISK 480
SLNL DQN+D GRE+E PSVP IPPSADSVQVLLKQALHA+DR LLLECLYTKD KVISK
Sbjct: 421 SLNLADQNKDGGREQEDPSVPVIPPSADSVQVLLKQALHADDRALLLECLYTKDVKVISK 480
Query: 481 SIAQLNSSDVLKLLHSLISFIQSRGAILVCVLPWLRGLLLQHASKIMSQESSLLALNSLY 540
SIAQLNSSDVL LLH+LISFIQSRGAILVC LPWLR L+LQHASKIMSQESSLLALNSLY
Sbjct: 481 SIAQLNSSDVLTLLHALISFIQSRGAILVCALPWLRCLILQHASKIMSQESSLLALNSLY 540
Query: 541 QLIESRISTFQSALLLSSSLDFLYSGVLDEEVDENDAIVPIIYEEDDSDDKESGDEMETD 600
QLIESR STFQSALLLSSSLDFLY+ VLD+E ++ND IVPIIYEE+DSD+ E+GDEMET+
Sbjct: 541 QLIESRTSTFQSALLLSSSLDFLYTEVLDKEENDNDTIVPIIYEEEDSDENETGDEMETN 600
Query: 601 EDEERDEVEAFDDLSAGEVDDDMSE 625
ED+ERDEVEAFDDLSAGEVDDDMSE
Sbjct: 601 EDDERDEVEAFDDLSAGEVDDDMSE 624
BLAST of Bhi09G000044 vs. ExPASy TrEMBL
Match:
A0A6J1D938 (WD repeat-containing protein 43 OS=Momordica charantia OX=3673 GN=LOC111018389 PE=4 SV=1)
HSP 1 Score: 1009.2 bits (2608), Expect = 7.4e-291
Identity = 534/625 (85.44%), Postives = 577/625 (92.32%), Query Frame = 0
Query: 1 MKKEILKSPPITAFTPDGDYLAILSSNGTLKIWSTRDGSLLAEWKDPDGKNDVGYSCMAC 60
MKKE+LKSPPITAFTPDGDYLAILS N T+KIWS RDGSLLAEWKD +GK DVGYSC+AC
Sbjct: 1 MKKEMLKSPPITAFTPDGDYLAILSPNETVKIWSARDGSLLAEWKDSEGKTDVGYSCLAC 60
Query: 61 CFQGKK--RKNSYCVVAVGTNSGDVLAVNASNGERKWVSAGCHLGGVIGLSFANKGRRLH 120
CF KK +K S CV+A+GT++GDVLAVNAS+GE KWVSAGCH+GGVIGLSFANKGRRL
Sbjct: 61 CFVRKKHQKKKSSCVIAIGTDNGDVLAVNASSGETKWVSAGCHIGGVIGLSFANKGRRLR 120
Query: 121 AVGSNGKVSEMDTETGNIIKEFKASKKSISSSSFSLDEKYLAVAGKKLKILSKDDGDELM 180
VGSNG SEMDTETGNIIKEFKASKKSISSS+FS DEKYLAVAGKKL+ILS D+GDELM
Sbjct: 121 TVGSNGMASEMDTETGNIIKEFKASKKSISSSAFSPDEKYLAVAGKKLRILSTDNGDELM 180
Query: 181 VHPDKLGPVKLVSLSDDAKTIITSELGAKHLQVWWCDMSAGKLSRGPVLSMKHPPFVSEC 240
VH DKLGPVKLVS+SDDAK IITSE GAKHLQVWWCDMSA KLSRGPVLSMKHPPFVSEC
Sbjct: 181 VHSDKLGPVKLVSISDDAKAIITSEFGAKHLQVWWCDMSARKLSRGPVLSMKHPPFVSEC 240
Query: 241 RNVSNQEDNVVVLSVSVSGVAYLWKLKILSEDEVSPTKVSVKANDNQSAEENHGSAKKNR 300
+N+SN+ED++VVLSVSVSGVAY+W+LKILSEDEVSP KV+VKAND QSAEENHGSAKKNR
Sbjct: 241 KNISNEEDSIVVLSVSVSGVAYVWRLKILSEDEVSPNKVTVKANDAQSAEENHGSAKKNR 300
Query: 301 VSVIASRINGVGDNEVSVLVTHGSMDLPQHSLFSIGYSVKEDVNTARGNKTLQQNDDCSE 360
+SVIASRI+G GDNEVSVLVTHGSMD PQ SLF+IGYSVKED+NTA KTLQQNDD S
Sbjct: 301 ISVIASRIDGSGDNEVSVLVTHGSMDQPQLSLFNIGYSVKEDLNTAHEKKTLQQNDDFSG 360
Query: 361 QGPHEVEQEVVTPKSKKGKKKRAASDLDSLTTGDISDVGNGDASDVIFNDDLNEPSMGEK 420
QGPHE+EQ V TPKSKK KKKRAASD+DSLT GD+S VGNGDASDV+FNDD+NEP+MGEK
Sbjct: 361 QGPHEIEQAVTTPKSKKSKKKRAASDIDSLTAGDVSGVGNGDASDVLFNDDINEPTMGEK 420
Query: 421 LASLNLVDQNEDEGRE-KEPSVPAIPPSADSVQVLLKQALHAEDRVLLLECLYTKDDKVI 480
LASLNL+DQ+EDE E +EPSVPAIPPSADSVQVLLKQALHA+DR LLLECLYTKDDKVI
Sbjct: 421 LASLNLLDQDEDESHEQEEPSVPAIPPSADSVQVLLKQALHADDRALLLECLYTKDDKVI 480
Query: 481 SKSIAQLNSSDVLKLLHSLISFIQSRGAILVCVLPWLRGLLLQHASKIMSQESSLLALNS 540
SKSIAQLNSSDVLKLLHSLIS IQSRGAILVC LPWLRGLLLQHAS+IMSQESSLLALNS
Sbjct: 481 SKSIAQLNSSDVLKLLHSLISIIQSRGAILVCALPWLRGLLLQHASRIMSQESSLLALNS 540
Query: 541 LYQLIESRISTFQSALLLSSSLDFLYSGVLDEEVDENDAIVPIIYE-EDDSDDKESGDEM 600
LYQLIESRISTFQSA+LLSSSLDFLY+GVLDEEVD+NDAIVPIIYE EDDSDD+ESGDEM
Sbjct: 541 LYQLIESRISTFQSAVLLSSSLDFLYTGVLDEEVDDNDAIVPIIYEDEDDSDDQESGDEM 600
Query: 601 ETDEDEERDE-VEAFDDLSAGEVDD 621
ETDEDEE +E EAF DLSAGEVDD
Sbjct: 601 ETDEDEEEEEREEAFGDLSAGEVDD 625
BLAST of Bhi09G000044 vs. ExPASy TrEMBL
Match:
A0A5A7UYX1 (WD repeat-containing protein 43 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold459G002230 PE=4 SV=1)
HSP 1 Score: 1003.4 bits (2593), Expect = 4.1e-289
Identity = 526/594 (88.55%), Postives = 557/594 (93.77%), Query Frame = 0
Query: 32 IWSTRDGSLLAEWKDPDGKNDVGYSCMACCFQGKKRKNSYCVVAVGTNSGDVLAVNASNG 91
IWSTRDGSLLAEWKD DGK+D GYSCMACC GKKRK+SYCVVA+GTN+GDVLAVNASNG
Sbjct: 28 IWSTRDGSLLAEWKDLDGKDDFGYSCMACCLLGKKRKSSYCVVAIGTNNGDVLAVNASNG 87
Query: 92 ERKWVSAGCHLGGVIGLSFANKGRRLHAVGSNGKVSEMDTETGNIIKEFKASKKSISSSS 151
E+KWVS GCH GGVIGLSFAN+GRRLH VGSNG SEMDTETGNIIKEFKASKKSISSS+
Sbjct: 88 EKKWVSTGCHPGGVIGLSFANEGRRLHTVGSNGMASEMDTETGNIIKEFKASKKSISSSA 147
Query: 152 FSLDEKYLAVAGKKLKILSKDDGDELMVHPDKLGPVKLVSLSDDAKTIITSELGAKHLQV 211
FSLDEKYLAVAGKKLKILS DDGDEL+VHPDKL PVKLVS+SDDAKTIITSELGAKHLQV
Sbjct: 148 FSLDEKYLAVAGKKLKILSADDGDELIVHPDKLDPVKLVSISDDAKTIITSELGAKHLQV 207
Query: 212 WWCDMSAGKLSRGPVLSMKHPPFVSECRNVSNQEDNVVVLSVSVSGVAYLWKLKILSEDE 271
WWCDMSAGK SRGPVLSM HPPFVSECRNVSNQED+VVVLSVSVSG AYLWKLK+LSEDE
Sbjct: 208 WWCDMSAGKFSRGPVLSMNHPPFVSECRNVSNQEDSVVVLSVSVSGAAYLWKLKVLSEDE 267
Query: 272 VSPTKVSVKANDNQSAEENHGSAKKNRVSVIASRINGVGDNEVSVLVTHGSMDLPQHSLF 331
V PTKVSVKANDNQSAEENHGSAKKNRVSV+ASRI+ +GDNEVSVLVTHGS+DLPQHSL
Sbjct: 268 VIPTKVSVKANDNQSAEENHGSAKKNRVSVLASRIHRIGDNEVSVLVTHGSVDLPQHSLL 327
Query: 332 SIGYSVKEDVNTARGNKTLQQNDDCSEQGPHEVEQEVVTPKSKKGKKKRAASDLDSLTTG 391
IGY+VKED NTA NKTLQQND SEQGPHE+EQ V+ PKSKK KKKRAASDLDS T G
Sbjct: 328 DIGYTVKEDANTAHENKTLQQNDGVSEQGPHEIEQ-VIAPKSKKSKKKRAASDLDSPTAG 387
Query: 392 DISDVGNGDASDVIFNDDLNEPSMGEKLASLNLVDQNEDEGREKE-PSVPAIPPSADSVQ 451
D+SDVGNGDASDV+FNDDLNEPSMGEKLASLNL DQNED GRE+E PSVP IPPSADSVQ
Sbjct: 388 DVSDVGNGDASDVVFNDDLNEPSMGEKLASLNLADQNEDGGREQEDPSVPVIPPSADSVQ 447
Query: 452 VLLKQALHAEDRVLLLECLYTKDDKVISKSIAQLNSSDVLKLLHSLISFIQSRGAILVCV 511
VLLKQALHA+D LLLECLYTKDDKVISKSIAQLNSSDVLKLLHS+ISFIQSRGAILVC
Sbjct: 448 VLLKQALHADDHALLLECLYTKDDKVISKSIAQLNSSDVLKLLHSVISFIQSRGAILVCA 507
Query: 512 LPWLRGLLLQHASKIMSQESSLLALNSLYQLIESRISTFQSALLLSSSLDFLYSGVLDEE 571
LPWLRGLLLQHASKIMSQESSLLALNSLYQLIE+RISTFQSALLLSSSLDFLY+GVLDEE
Sbjct: 508 LPWLRGLLLQHASKIMSQESSLLALNSLYQLIEARISTFQSALLLSSSLDFLYTGVLDEE 567
Query: 572 VDENDAIVPIIYEEDDSDDKESGDEMETDEDEERDEVEAFDDLSAGEVDDDMSE 625
++NDAIVPIIYEE+DSD+ E+GDEMETDED+ERDEVEAFDDLSAGEVDDDMSE
Sbjct: 568 ENDNDAIVPIIYEEEDSDENETGDEMETDEDDERDEVEAFDDLSAGEVDDDMSE 620
BLAST of Bhi09G000044 vs. ExPASy TrEMBL
Match:
A0A6J1HJF3 (WD repeat-containing protein 43 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111464569 PE=4 SV=1)
HSP 1 Score: 964.5 bits (2492), Expect = 2.1e-277
Identity = 509/626 (81.31%), Postives = 559/626 (89.30%), Query Frame = 0
Query: 1 MKKEILKSPPITAFTPDGDYLAILSSNGTLKIWSTRDGSLLAEWKDPDGKNDVGYSCMAC 60
MK E KSPPITAFTP+GDYLAILSSNGT+KIW+T DGSLLAEWKDPDGK D GYSC+AC
Sbjct: 1 MKMEQHKSPPITAFTPNGDYLAILSSNGTVKIWNTSDGSLLAEWKDPDGKTDAGYSCIAC 60
Query: 61 CFQGKKRKNSYCVVAVGTNSGDVLAVNASNGERKWVSAGCHLGGVIGLSFANKGRRLHAV 120
CF GKKRKNS C++A+GTN GDVL VNAS+GE KWVSAGCHLGGVIGLSFA+KGRRLH V
Sbjct: 61 CFVGKKRKNSSCLIAIGTNGGDVLTVNASSGETKWVSAGCHLGGVIGLSFADKGRRLHTV 120
Query: 121 GSNGKVSEMDTETGNIIKEFKASKKSISSSSFSLDEKYLAVAGKKLKILSKDDGDELMVH 180
GSNG +M+ ETG+II EFKASKKSISSS+FSLDEKYLAVAGKKLKILS DDG ELMVH
Sbjct: 121 GSNGIAFKMNAETGSIINEFKASKKSISSSAFSLDEKYLAVAGKKLKILSTDDGVELMVH 180
Query: 181 PDKLGPVKLVSLSDDAKTIITSELGAKHLQVWWCDMSAGKLSRGPVLSMKHPPFVSECRN 240
PDKLGPVKL S+SDDAKTIITSE GAKH+QVWWCDMSAGKLSRGPVLSMKHPPFVSECRN
Sbjct: 181 PDKLGPVKLDSISDDAKTIITSEPGAKHIQVWWCDMSAGKLSRGPVLSMKHPPFVSECRN 240
Query: 241 VSNQEDNVVVLSVSVSGVAYLWKLKILSEDEVSPTKVSVKANDNQSAEENHGSAKKNRVS 300
++N ED++VVLSVSVSGVAYLWKLK LSED+V+PTKV+VK N+ +SAEENHGSAKKNR+S
Sbjct: 241 INNGEDSIVVLSVSVSGVAYLWKLKFLSEDKVNPTKVTVKVNNVESAEENHGSAKKNRIS 300
Query: 301 VIASRINGVGDNEVSVLVTHGSMDLPQHSLFSIGYSVKEDVNTARGNKTLQQNDDCSEQG 360
V++S I G+GDNEVSVLVTHGSMDLPQH++ +IGY KED N A ++G
Sbjct: 301 VLSSTIQGLGDNEVSVLVTHGSMDLPQHTVLNIGYPAKEDANIA-----------LEKEG 360
Query: 361 PHEVEQEVVTPKSKKGKKKRAASDLDSLTTGDISDVGNGDASDVIFNDDLNEPSMGEKLA 420
PHE++Q V +PKSKK KKKRAASDLDS GD+SDVGN D S+V+FNDDLNEP+MG+KLA
Sbjct: 361 PHEIKQAVTSPKSKKSKKKRAASDLDSQKAGDVSDVGNEDMSNVLFNDDLNEPTMGDKLA 420
Query: 421 SLNLVDQNEDEGREK-EPSVPAIPPSADSVQVLLKQALHAEDRVLLLECLYTKDDKVISK 480
SLNL +QNEDE E+ EPSVPAIPPSADSVQVLLKQAL A+DR LLLECLYTKDDKVISK
Sbjct: 421 SLNLEEQNEDENHEQDEPSVPAIPPSADSVQVLLKQALRADDRALLLECLYTKDDKVISK 480
Query: 481 SIAQLNSSDVLKLLHSLISFIQSRGAILVCVLPWLRGLLLQHASKIMSQESSLLALNSLY 540
SIAQLNSSDVLKLLH+LIS IQSRGAILVC +PWLRGLLLQHAS+IMSQESSLLALNSLY
Sbjct: 481 SIAQLNSSDVLKLLHALISIIQSRGAILVCAIPWLRGLLLQHASRIMSQESSLLALNSLY 540
Query: 541 QLIESRISTFQSALLLSSSLDFLYSGVLDEEVDENDAIVPIIYEEDDSDDKE-SGDEMET 600
QLIESRISTFQSALLLSSSLDFLY+GVLDEE +ENDAIVPIIYEEDDSDDKE SGDEMET
Sbjct: 541 QLIESRISTFQSALLLSSSLDFLYTGVLDEEAEENDAIVPIIYEEDDSDDKESSGDEMET 600
Query: 601 DEDEERDEVEAFDDLSAGEVDDDMSE 625
DE+ EVEAFDDLSAGEVDDDMSE
Sbjct: 601 DEEGGGVEVEAFDDLSAGEVDDDMSE 615
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
AT1G15420.1 | 1.3e-48 | 50.77 | CONTAINS InterPro DOMAIN/s: Small-subunit processome, Utp12 (InterPro:IPR007148)... | [more] |
AT5G11240.1 | 1.5e-38 | 25.67 | transducin family protein / WD-40 repeat family protein | [more] |
Match Name | E-value | Identity | Description | |
Q15061 | 8.6e-10 | 22.19 | WD repeat-containing protein 43 OS=Homo sapiens OX=9606 GN=WDR43 PE=1 SV=3 | [more] |
Q6ZQL4 | 8.6e-10 | 23.24 | WD repeat-containing protein 43 OS=Mus musculus OX=10090 GN=Wdr43 PE=1 SV=2 | [more] |
Q8YV57 | 8.0e-08 | 28.16 | Uncharacterized WD repeat-containing protein all2124 OS=Nostoc sp. (strain PCC 7... | [more] |
Q91V09 | 8.3e-05 | 29.69 | WD repeat-containing protein 13 OS=Mus musculus OX=10090 GN=Wdr13 PE=1 SV=1 | [more] |
Q5RF24 | 8.3e-05 | 29.69 | WD repeat-containing protein 13 OS=Pongo abelii OX=9601 GN=WDR13 PE=2 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
A0A1S3CDX9 | 9.6e-307 | 88.64 | WD repeat-containing protein 43 OS=Cucumis melo OX=3656 GN=LOC103499777 PE=4 SV=... | [more] |
A0A0A0KBW9 | 8.7e-300 | 87.04 | Utp12 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G433260 PE=4 ... | [more] |
A0A6J1D938 | 7.4e-291 | 85.44 | WD repeat-containing protein 43 OS=Momordica charantia OX=3673 GN=LOC111018389 P... | [more] |
A0A5A7UYX1 | 4.1e-289 | 88.55 | WD repeat-containing protein 43 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_... | [more] |
A0A6J1HJF3 | 2.1e-277 | 81.31 | WD repeat-containing protein 43 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC1... | [more] |