|
Sequences
The following sequences are available for this feature:
Gene sequence (with intron) Legend: exonCDSpolypeptide Hold the cursor over a type above to highlight its positions in the sequence below. ATGTCGACGACAAAGTCGACACCCAAAGCACAAGTCGACCGGCTTACGCTAATAGAGGAGGAAATGCTGTTCCTCAAAGAAGTCCCTGACACCCTCCGCTTCCTGGAAGCACGGGTGACCGAATTGAGTGAGAAAGTCGTAGGAATCGACGCAATGGGCAACCGCCTGGATGGGTTGCCAATCGCAGAATTGATGTTTCGAGTGACCTCGCTCGAAGAAAGAGTTGCTCCTACGAGCAGCCCAAAACCGTCTGGTAGTCCGGATAGCTCTGTCGCGCACAAGGAGGGACGTGGCGAAGAGTTCGACGTGCTACAAAATACAATGATGAGCTTGTTCAATGGATTAGCTGATGAGTTCAGAACAACAATCGACGACATCCAAGAAAGGATGGCCTCCATGGGCACTCGAATTGAAGTGACCATGAAAGCCGTGGAGAACGTCACGGCTGGGCAAGCTAATACAGGGTCCAACAAACTAAGATTCCTAGATCCTAGAGCCTTTAAAGGGAATCGGGACGCCAAAGAGTTGGAAAACTTCATCTTTGATGTCGAACAGTACTTCAAAGCCACAACGGCTTGTACTGACGACAAGAAGGTGACCGTAGCCTCGATGTATCTCATAGACGATGCCAAACTGTGGTGGCGTACGAAGGTGCAAGACATCGAGGATGGTTTGTGCACCATTGACTCGTGGGAGGACCTTAAGAAAGAGTTGAGGGACCAGTTCCTCCCCGAAAACGCAGGACATTTAGCAATGGAAAAACTAGTAGCCCTGAAACACACTGGAGGCATACGAGACTATGTCAGACAGTTCTCAACCCTGATGCTAGATATCAGGGGCACATCAGAGAAGGACAAGGTGTTCTTCTTTATAAATGGGTTACAGCCGTGGGCCAAAACAAAACTACACGAGAACAAGGTTCAAACCCTAGCTGACGCAATGGCCTGTGCCGAGAGACTCCTAGACTATGGGAATGAAGCGGGATCCCAAAGAAGAATAACACCAGCCCCAAACACTGGGGGCAAGCCATACAAACCACCAAGTCATCGAAATGGAAGCCCCAACAGGCCGAACGGAGGTAACGACAGACCAAGCGAATGGACGGATAGACCTCCTCAGAACAACCAAGCGGGGACATCTCGAGGACCTTACCATCAAAGGAACCACCCGACGACGCCTTTACAATGCATGTTGTGTAAAGGTCCCCACAAGGTATCTTACTGTCCTCATCGGGCCTCTCTCACTGCGCTCCAAGTGTCCATTCAAGAGAGCAATGACGCAAGGATTGAGACTATGCTTGACAAGAAGGAAGATCAAGACAATCCCCGAATGGGCGCACTTAAATTCTTGTCAGCCCTCCAACGGAAGGTCGAATCGAAGGAGATAGTAGAGAAAGGACTCATGTTCGTAAATGCGACAATAAATTCCCAACCGAATAGGAGCACTCTGATAGATTCAGGAGCGACCCACAACTTCATCGCCGATCAAGAAGCCCGAAGATTAGGACTCACTATAGGAAAGGACCCGGGAAAAATGAAAGCTGTCAACTCTGAGGCCTTGCCTATTGTGGGAGTTTCCAAAAGAGTCCCCTTTAAAATAGGGGATTGGACAGGAGAGCTAGATCTTGTCGTAGCTCGCATGGACGACTTTGACGTGGTACTTGGGATGGAGTTCCTCCTAGAACACAAAGTTATCCCAATGCCGTTGGCAAAATGCTTAGTGATCACCGATCGCAACCCCACGGTAATACCTGCAAGCATCAAGCAACCAGGTAATCTTCGAATGATCTCGGCCATACAATTGAAAAGGGGACTCGCGCGAGAGGAACCTACATTTATGGCCATACCATTGATGGAAGTAACAACCAACGAAGAAACTGTCCCAAATGAAATCAATGAGGTACTAAACGACTATGCTGACATAATGCCAGAGAGCTTACCCCAAACATTACCACCTCGTCGAGGCATTGATCACGAAATCGAACTCATCCCCGGAGTTAAACCGCCAGCGAAGAACGCATACCGGATGGCTCCACCCGAGCTAGCCGAATTAAGGAAACAACTGGATGAGTTGCTGAAGGCGGGATTCATCCGCCCGGCAAAGGCACCCTACGGAGCCCCCGTACTGTTCCAGAAGAAGAAGGATGGGACGTTGCGTCTGTGCATAGACTATAGAGCCTTAAACAAGGTGACGGTGCGCAACAAATATCCACTGCTGATAATATCCGACTTGTTCGACCAACTTCACGGGGCCAAGTATTTCACGAAGTTGGACTTACGATCAGGGTATTACCAAGTACGTATTGCCGAAGGGGATGAGCCCAAGACGACGTGCGTAACAAGATATGGGGCCTTCGAATTCCTAGTAATGCCCTTTGGCTTGACAAACGCTCCAGCTACTTTCTGCACGTTGATGAACCAAGTTTTCTACGAATACTTGGATCAGTTTGTCATAGTATACCTCGACGACATAGTGGTTTACAGCACAACCCTAGAAGAACACAAAGTGCACTTGAAGCTGGTGTTCGACAAGCTACGACAAAACCAACTGTATGTCAAGAAAGAGAAATGTGCATTCGCACAAACATGCATCAACTTCCTTGGACATGTCGTCAAATGTGGACATATTAGTATGGATAGCGATAAGATAAAAGCTATCCAAGAATGGAAAGTTCCTACTTCCGTATCCGATTTGCGGTCCTTCTTAGGATTAGCCAACTACTATAGGCGGTTCGTCGAAGGGTTTTCACGACGAGCGGCCCCATTGACAGAGCTGTTGAAGAAAGACCACACTTGGTCGTGGTCAGATGATTGTCAAATGGCCTTTGAAGATCTGAAAACAACCATGATGAGGGGTCCTGTCCTCGGATTGGTAGATGTTACAAAGCCATTTGAAATAGAAACAGACGCTTCCGACTTTGCCCTAGGTGGGGTCCTTATTCAAGAAGGCCACCCCATCGCTTTCGAAAGTCGAAAGCTCAATGACGTCGAACTTAGATACACTGTCTCCGAAAAAGAAATGCTGGCAGTAGTCCATTGCCTTCGAGTCTGGAGACAATACCTCTTGGGATCACAGTTCGTAGTGAAGACGGATAACAGCGCCATTTGCCACTTCTTTGATCAACCAAAATTGACGGCAAAACAAGCCCGGTGGCAGGAGTCGTTAGCTGAATTCGACTTCAAGTTCGAACACAAAGCAGGGAAGAGCAATCAAGCAGCCGACGCACTGAGTCGGAAGGGCGAACATGCGGCCCTGTGCATGTTAGCCCATATTCACTCAAGTAAGATCGATGGATCGATGCGCGACATCATCAAGGAACATTTACATAAAGACCCATCGGCCAAAACCGTCGTCGAACTAGCTAAAGCTGGGAAAACACGACAGTTTTGGGTTGAGGGAGACCTTCTGATGACAAAAGGAAACAGATTGTATGTCCCAAGAACGGGAGAACTGAGGAAGAAGCTCATTCAGGAATGTCATGATACCTTATGGGCCGGACACCCTGGGTGGCAAAGAACATACGCTCTAATAAAGAAAGGGTACTTCTGGCCAAACATGCGAGACGACATCATGCAATACACCAAGACGTGCCTCATCTGTCAACAGGACAAAGTCGAGAAAGCCAAAGTCTCAGGACTCTTGGAACCTCTACCTGTGCCGACAAGACCCTGGGAAAGTGTATCTTTGGACTTCATAACACACCTCCCAAAAGTCGGGGACTATGACGCTATCTTGGTTATCGTAGACCGATTCTCAAAATATGCGACGTTCATCCCCACTCCCAAATTATGCTCGGCCGAACTCACAGCTGAACTATTTTTCAAACACATTGTAAAGTTATGGGGTATTCCGTCGAGCATCATCAGTGATCGGGATGGCAGATTCATTGGGACATTCTGGACCGAGTTATTCGCCTTCTTGGGAACAACCTTAAACATCTCCTCGAGTTACCACCCTCAAACCGATGGTCAGACAGAACGGTTCAATTGCTTGCTCGAAGAATACTTACGTCACTTCGTCGATGCTCGCCAGAAGAACTGGATACAACTGTTAGATGTCGCCCAATTTTGCTTCAATTGTCAAACCAGCTCGTCTACGGGAAAGAGTCCCTTTGAAATTGTAAGTGGACGACAACCGGCCTTACCCCACATTATCGATCATCCTTATGCAGGGAAAAACCCTCAAGCTCACAACTTCACAAGAGAATGGAAGCAGACAACAGATATAGCCCGGGCATATTTAGAGAAGGCTTCCAAGCATATGAAAAAGTGGGCAGACAAGAAGCGTCGCCCCCTTCAATTCCGAGCAGGAGATCAAGTCCTCATCAAGCTGAAACCAGAACAGATCAGATTTCGCAACTGA mRNA sequence ATGTCGACGACAAAGTCGACACCCAAAGCACAAGTCGACCGGCTTACGCTAATAGAGGAGGAAATGCTGTTCCTCAAAGAAGTCCCTGACACCCTCCGCTTCCTGGAAGCACGGGTGACCGAATTGAGTGAGAAAGTCGTAGGAATCGACGCAATGGGCAACCGCCTGGATGGGTTGCCAATCGCAGAATTGATGTTTCGAGTGACCTCGCTCGAAGAAAGAGTTGCTCCTACGAGCAGCCCAAAACCGTCTGGTAGTCCGGATAGCTCTGTCGCGCACAAGGAGGGACGTGGCGAAGAGTTCGACGTGCTACAAAATACAATGATGAGCTTGTTCAATGGATTAGCTGATGAGTTCAGAACAACAATCGACGACATCCAAGAAAGGATGGCCTCCATGGGCACTCGAATTGAAGTGACCATGAAAGCCGTGGAGAACGTCACGGCTGGGCAAGCTAATACAGGGTCCAACAAACTAAGATTCCTAGATCCTAGAGCCTTTAAAGGGAATCGGGACGCCAAAGAGTTGGAAAACTTCATCTTTGATGTCGAACAGTACTTCAAAGCCACAACGGCTTGTACTGACGACAAGAAGGTGACCGTAGCCTCGATGTATCTCATAGACGATGCCAAACTGTGGTGGCGTACGAAGGTGCAAGACATCGAGGATGGTTTGTGCACCATTGACTCGTGGGAGGACCTTAAGAAAGAGTTGAGGGACCAGTTCCTCCCCGAAAACGCAGGACATTTAGCAATGGAAAAACTAGTAGCCCTGAAACACACTGGAGGCATACGAGACTATGTCAGACAGTTCTCAACCCTGATGCTAGATATCAGGGGCACATCAGAGAAGGACAAGGTGTTCTTCTTTATAAATGGGTTACAGCCGTGGGCCAAAACAAAACTACACGAGAACAAGGTTCAAACCCTAGCTGACGCAATGGCCTGTGCCGAGAGACTCCTAGACTATGGGAATGAAGCGGGATCCCAAAGAAGAATAACACCAGCCCCAAACACTGGGGGCAAGCCATACAAACCACCAAGTCATCGAAATGGAAGCCCCAACAGGCCGAACGGAGGTAACGACAGACCAAGCGAATGGACGGATAGACCTCCTCAGAACAACCAAGCGGGGACATCTCGAGGACCTTACCATCAAAGGAACCACCCGACGACGCCTTTACAATGCATGTTGTGTAAAGGTCCCCACAAGGTATCTTACTGTCCTCATCGGGCCTCTCTCACTGCGCTCCAAGTGTCCATTCAAGAGAGCAATGACGCAAGGATTGAGACTATGCTTGACAAGAAGGAAGATCAAGACAATCCCCGAATGGGCGCACTTAAATTCTTGTCAGCCCTCCAACGGAAGGTCGAATCGAAGGAGATAGTAGAGAAAGGACTCATGTTCGTAAATGCGACAATAAATTCCCAACCGAATAGGAGCACTCTGATAGATTCAGGAGCGACCCACAACTTCATCGCCGATCAAGAAGCCCGAAGATTAGGACTCACTATAGGAAAGGACCCGGGAAAAATGAAAGCTGTCAACTCTGAGGCCTTGCCTATTGTGGGAGTTTCCAAAAGAGTCCCCTTTAAAATAGGGGATTGGACAGGAGAGCTAGATCTTGTCGTAGCTCGCATGGACGACTTTGACGTGGTACTTGGGATGGAGTTCCTCCTAGAACACAAAGTTATCCCAATGCCGTTGGCAAAATGCTTAGTGATCACCGATCGCAACCCCACGGTAATACCTGCAAGCATCAAGCAACCAGGTAATCTTCGAATGATCTCGGCCATACAATTGAAAAGGGGACTCGCGCGAGAGGAACCTACATTTATGGCCATACCATTGATGGAAGTAACAACCAACGAAGAAACTGTCCCAAATGAAATCAATGAGGTACTAAACGACTATGCTGACATAATGCCAGAGAGCTTACCCCAAACATTACCACCTCGTCGAGGCATTGATCACGAAATCGAACTCATCCCCGGAGTTAAACCGCCAGCGAAGAACGCATACCGGATGGCTCCACCCGAGCTAGCCGAATTAAGGAAACAACTGGATGAGTTGCTGAAGGCGGGATTCATCCGCCCGGCAAAGGCACCCTACGGAGCCCCCGTACTGTTCCAGAAGAAGAAGGATGGGACGTTGCGTCTGTGCATAGACTATAGAGCCTTAAACAAGGTGACGGTGCGCAACAAATATCCACTGCTGATAATATCCGACTTGTTCGACCAACTTCACGGGGCCAAGTATTTCACGAAGTTGGACTTACGATCAGGGTATTACCAAGTACGTATTGCCGAAGGGGATGAGCCCAAGACGACGTGCGTAACAAGATATGGGGCCTTCGAATTCCTAGTAATGCCCTTTGGCTTGACAAACGCTCCAGCTACTTTCTGCACGTTGATGAACCAAGTTTTCTACGAATACTTGGATCAGTTTGTCATAGTATACCTCGACGACATAGTGGTTTACAGCACAACCCTAGAAGAACACAAAGTGCACTTGAAGCTGGTGTTCGACAAGCTACGACAAAACCAACTGTATGTCAAGAAAGAGAAATGTGCATTCGCACAAACATGCATCAACTTCCTTGGACATGTCGTCAAATGTGGACATATTAGTATGGATAGCGATAAGATAAAAGCTATCCAAGAATGGAAAGTTCCTACTTCCGTATCCGATTTGCGGTCCTTCTTAGGATTAGCCAACTACTATAGGCGGTTCGTCGAAGGGTTTTCACGACGAGCGGCCCCATTGACAGAGCTGTTGAAGAAAGACCACACTTGGTCGTGGTCAGATGATTGTCAAATGGCCTTTGAAGATCTGAAAACAACCATGATGAGGGGTCCTGTCCTCGGATTGGTAGATGTTACAAAGCCATTTGAAATAGAAACAGACGCTTCCGACTTTGCCCTAGGTGGGGTCCTTATTCAAGAAGGCCACCCCATCGCTTTCGAAAGTCGAAAGCTCAATGACGTCGAACTTAGATACACTGTCTCCGAAAAAGAAATGCTGGCAGTAGTCCATTGCCTTCGAGTCTGGAGACAATACCTCTTGGGATCACAGTTCGTAGTGAAGACGGATAACAGCGCCATTTGCCACTTCTTTGATCAACCAAAATTGACGGCAAAACAAGCCCGGTGGCAGGAGTCGTTAGCTGAATTCGACTTCAAGTTCGAACACAAAGCAGGGAAGAGCAATCAAGCAGCCGACGCACTGAGTCGGAAGGGCGAACATGCGGCCCTGTGCATGTTAGCCCATATTCACTCAAGTAAGATCGATGGATCGATGCGCGACATCATCAAGGAACATTTACATAAAGACCCATCGGCCAAAACCGTCGTCGAACTAGCTAAAGCTGGGAAAACACGACAGTTTTGGGTTGAGGGAGACCTTCTGATGACAAAAGGAAACAGATTGTATGTCCCAAGAACGGGAGAACTGAGGAAGAAGCTCATTCAGGAATGTCATGATACCTTATGGGCCGGACACCCTGGGTGGCAAAGAACATACGCTCTAATAAAGAAAGGGTACTTCTGGCCAAACATGCGAGACGACATCATGCAATACACCAAGACGTGCCTCATCTGTCAACAGGACAAAGTCGAGAAAGCCAAAGTCTCAGGACTCTTGGAACCTCTACCTGTGCCGACAAGACCCTGGGAAAGTGTATCTTTGGACTTCATAACACACCTCCCAAAAGTCGGGGACTATGACGCTATCTTGGTTATCGTAGACCGATTCTCAAAATATGCGACGTTCATCCCCACTCCCAAATTATGCTCGGCCGAACTCACAGCTGAACTATTTTTCAAACACATTGTAAAGTTATGGGGTATTCCGTCGAGCATCATCAGTGATCGGGATGGCAGATTCATTGGGACATTCTGGACCGAGTTATTCGCCTTCTTGGGAACAACCTTAAACATCTCCTCGAGTTACCACCCTCAAACCGATGGTCAGACAGAACGGTTCAATTGCTTGCTCGAAGAATACTTACGTCACTTCGTCGATGCTCGCCAGAAGAACTGGATACAACTGTTAGATGTCGCCCAATTTTGCTTCAATTGTCAAACCAGCTCGTCTACGGGAAAGAGTCCCTTTGAAATTGTAAGTGGACGACAACCGGCCTTACCCCACATTATCGATCATCCTTATGCAGGGAAAAACCCTCAAGCTCACAACTTCACAAGAGAATGGAAGCAGACAACAGATATAGCCCGGGCATATTTAGAGAAGGCTTCCAAGCATATGAAAAAGTGGGCAGACAAGAAGCGTCGCCCCCTTCAATTCCGAGCAGGAGATCAAGTCCTCATCAAGCTGAAACCAGAACAGATCAGATTTCGCAACTGA Coding sequence (CDS) ATGTCGACGACAAAGTCGACACCCAAAGCACAAGTCGACCGGCTTACGCTAATAGAGGAGGAAATGCTGTTCCTCAAAGAAGTCCCTGACACCCTCCGCTTCCTGGAAGCACGGGTGACCGAATTGAGTGAGAAAGTCGTAGGAATCGACGCAATGGGCAACCGCCTGGATGGGTTGCCAATCGCAGAATTGATGTTTCGAGTGACCTCGCTCGAAGAAAGAGTTGCTCCTACGAGCAGCCCAAAACCGTCTGGTAGTCCGGATAGCTCTGTCGCGCACAAGGAGGGACGTGGCGAAGAGTTCGACGTGCTACAAAATACAATGATGAGCTTGTTCAATGGATTAGCTGATGAGTTCAGAACAACAATCGACGACATCCAAGAAAGGATGGCCTCCATGGGCACTCGAATTGAAGTGACCATGAAAGCCGTGGAGAACGTCACGGCTGGGCAAGCTAATACAGGGTCCAACAAACTAAGATTCCTAGATCCTAGAGCCTTTAAAGGGAATCGGGACGCCAAAGAGTTGGAAAACTTCATCTTTGATGTCGAACAGTACTTCAAAGCCACAACGGCTTGTACTGACGACAAGAAGGTGACCGTAGCCTCGATGTATCTCATAGACGATGCCAAACTGTGGTGGCGTACGAAGGTGCAAGACATCGAGGATGGTTTGTGCACCATTGACTCGTGGGAGGACCTTAAGAAAGAGTTGAGGGACCAGTTCCTCCCCGAAAACGCAGGACATTTAGCAATGGAAAAACTAGTAGCCCTGAAACACACTGGAGGCATACGAGACTATGTCAGACAGTTCTCAACCCTGATGCTAGATATCAGGGGCACATCAGAGAAGGACAAGGTGTTCTTCTTTATAAATGGGTTACAGCCGTGGGCCAAAACAAAACTACACGAGAACAAGGTTCAAACCCTAGCTGACGCAATGGCCTGTGCCGAGAGACTCCTAGACTATGGGAATGAAGCGGGATCCCAAAGAAGAATAACACCAGCCCCAAACACTGGGGGCAAGCCATACAAACCACCAAGTCATCGAAATGGAAGCCCCAACAGGCCGAACGGAGGTAACGACAGACCAAGCGAATGGACGGATAGACCTCCTCAGAACAACCAAGCGGGGACATCTCGAGGACCTTACCATCAAAGGAACCACCCGACGACGCCTTTACAATGCATGTTGTGTAAAGGTCCCCACAAGGTATCTTACTGTCCTCATCGGGCCTCTCTCACTGCGCTCCAAGTGTCCATTCAAGAGAGCAATGACGCAAGGATTGAGACTATGCTTGACAAGAAGGAAGATCAAGACAATCCCCGAATGGGCGCACTTAAATTCTTGTCAGCCCTCCAACGGAAGGTCGAATCGAAGGAGATAGTAGAGAAAGGACTCATGTTCGTAAATGCGACAATAAATTCCCAACCGAATAGGAGCACTCTGATAGATTCAGGAGCGACCCACAACTTCATCGCCGATCAAGAAGCCCGAAGATTAGGACTCACTATAGGAAAGGACCCGGGAAAAATGAAAGCTGTCAACTCTGAGGCCTTGCCTATTGTGGGAGTTTCCAAAAGAGTCCCCTTTAAAATAGGGGATTGGACAGGAGAGCTAGATCTTGTCGTAGCTCGCATGGACGACTTTGACGTGGTACTTGGGATGGAGTTCCTCCTAGAACACAAAGTTATCCCAATGCCGTTGGCAAAATGCTTAGTGATCACCGATCGCAACCCCACGGTAATACCTGCAAGCATCAAGCAACCAGGTAATCTTCGAATGATCTCGGCCATACAATTGAAAAGGGGACTCGCGCGAGAGGAACCTACATTTATGGCCATACCATTGATGGAAGTAACAACCAACGAAGAAACTGTCCCAAATGAAATCAATGAGGTACTAAACGACTATGCTGACATAATGCCAGAGAGCTTACCCCAAACATTACCACCTCGTCGAGGCATTGATCACGAAATCGAACTCATCCCCGGAGTTAAACCGCCAGCGAAGAACGCATACCGGATGGCTCCACCCGAGCTAGCCGAATTAAGGAAACAACTGGATGAGTTGCTGAAGGCGGGATTCATCCGCCCGGCAAAGGCACCCTACGGAGCCCCCGTACTGTTCCAGAAGAAGAAGGATGGGACGTTGCGTCTGTGCATAGACTATAGAGCCTTAAACAAGGTGACGGTGCGCAACAAATATCCACTGCTGATAATATCCGACTTGTTCGACCAACTTCACGGGGCCAAGTATTTCACGAAGTTGGACTTACGATCAGGGTATTACCAAGTACGTATTGCCGAAGGGGATGAGCCCAAGACGACGTGCGTAACAAGATATGGGGCCTTCGAATTCCTAGTAATGCCCTTTGGCTTGACAAACGCTCCAGCTACTTTCTGCACGTTGATGAACCAAGTTTTCTACGAATACTTGGATCAGTTTGTCATAGTATACCTCGACGACATAGTGGTTTACAGCACAACCCTAGAAGAACACAAAGTGCACTTGAAGCTGGTGTTCGACAAGCTACGACAAAACCAACTGTATGTCAAGAAAGAGAAATGTGCATTCGCACAAACATGCATCAACTTCCTTGGACATGTCGTCAAATGTGGACATATTAGTATGGATAGCGATAAGATAAAAGCTATCCAAGAATGGAAAGTTCCTACTTCCGTATCCGATTTGCGGTCCTTCTTAGGATTAGCCAACTACTATAGGCGGTTCGTCGAAGGGTTTTCACGACGAGCGGCCCCATTGACAGAGCTGTTGAAGAAAGACCACACTTGGTCGTGGTCAGATGATTGTCAAATGGCCTTTGAAGATCTGAAAACAACCATGATGAGGGGTCCTGTCCTCGGATTGGTAGATGTTACAAAGCCATTTGAAATAGAAACAGACGCTTCCGACTTTGCCCTAGGTGGGGTCCTTATTCAAGAAGGCCACCCCATCGCTTTCGAAAGTCGAAAGCTCAATGACGTCGAACTTAGATACACTGTCTCCGAAAAAGAAATGCTGGCAGTAGTCCATTGCCTTCGAGTCTGGAGACAATACCTCTTGGGATCACAGTTCGTAGTGAAGACGGATAACAGCGCCATTTGCCACTTCTTTGATCAACCAAAATTGACGGCAAAACAAGCCCGGTGGCAGGAGTCGTTAGCTGAATTCGACTTCAAGTTCGAACACAAAGCAGGGAAGAGCAATCAAGCAGCCGACGCACTGAGTCGGAAGGGCGAACATGCGGCCCTGTGCATGTTAGCCCATATTCACTCAAGTAAGATCGATGGATCGATGCGCGACATCATCAAGGAACATTTACATAAAGACCCATCGGCCAAAACCGTCGTCGAACTAGCTAAAGCTGGGAAAACACGACAGTTTTGGGTTGAGGGAGACCTTCTGATGACAAAAGGAAACAGATTGTATGTCCCAAGAACGGGAGAACTGAGGAAGAAGCTCATTCAGGAATGTCATGATACCTTATGGGCCGGACACCCTGGGTGGCAAAGAACATACGCTCTAATAAAGAAAGGGTACTTCTGGCCAAACATGCGAGACGACATCATGCAATACACCAAGACGTGCCTCATCTGTCAACAGGACAAAGTCGAGAAAGCCAAAGTCTCAGGACTCTTGGAACCTCTACCTGTGCCGACAAGACCCTGGGAAAGTGTATCTTTGGACTTCATAACACACCTCCCAAAAGTCGGGGACTATGACGCTATCTTGGTTATCGTAGACCGATTCTCAAAATATGCGACGTTCATCCCCACTCCCAAATTATGCTCGGCCGAACTCACAGCTGAACTATTTTTCAAACACATTGTAAAGTTATGGGGTATTCCGTCGAGCATCATCAGTGATCGGGATGGCAGATTCATTGGGACATTCTGGACCGAGTTATTCGCCTTCTTGGGAACAACCTTAAACATCTCCTCGAGTTACCACCCTCAAACCGATGGTCAGACAGAACGGTTCAATTGCTTGCTCGAAGAATACTTACGTCACTTCGTCGATGCTCGCCAGAAGAACTGGATACAACTGTTAGATGTCGCCCAATTTTGCTTCAATTGTCAAACCAGCTCGTCTACGGGAAAGAGTCCCTTTGAAATTGTAAGTGGACGACAACCGGCCTTACCCCACATTATCGATCATCCTTATGCAGGGAAAAACCCTCAAGCTCACAACTTCACAAGAGAATGGAAGCAGACAACAGATATAGCCCGGGCATATTTAGAGAAGGCTTCCAAGCATATGAAAAAGTGGGCAGACAAGAAGCGTCGCCCCCTTCAATTCCGAGCAGGAGATCAAGTCCTCATCAAGCTGAAACCAGAACAGATCAGATTTCGCAACTGA Protein sequence MSTTKSTPKAQVDRLTLIEEEMLFLKEVPDTLRFLEARVTELSEKVVGIDAMGNRLDGLPIAELMFRVTSLEERVAPTSSPKPSGSPDSSVAHKEGRGEEFDVLQNTMMSLFNGLADEFRTTIDDIQERMASMGTRIEVTMKAVENVTAGQANTGSNKLRFLDPRAFKGNRDAKELENFIFDVEQYFKATTACTDDKKVTVASMYLIDDAKLWWRTKVQDIEDGLCTIDSWEDLKKELRDQFLPENAGHLAMEKLVALKHTGGIRDYVRQFSTLMLDIRGTSEKDKVFFFINGLQPWAKTKLHENKVQTLADAMACAERLLDYGNEAGSQRRITPAPNTGGKPYKPPSHRNGSPNRPNGGNDRPSEWTDRPPQNNQAGTSRGPYHQRNHPTTPLQCMLCKGPHKVSYCPHRASLTALQVSIQESNDARIETMLDKKEDQDNPRMGALKFLSALQRKVESKEIVEKGLMFVNATINSQPNRSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKAVNSEALPIVGVSKRVPFKIGDWTGELDLVVARMDDFDVVLGMEFLLEHKVIPMPLAKCLVITDRNPTVIPASIKQPGNLRMISAIQLKRGLAREEPTFMAIPLMEVTTNEETVPNEINEVLNDYADIMPESLPQTLPPRRGIDHEIELIPGVKPPAKNAYRMAPPELAELRKQLDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLLIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTLMNQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVKCGHISMDSDKIKAIQEWKVPTSVSDLRSFLGLANYYRRFVEGFSRRAAPLTELLKKDHTWSWSDDCQMAFEDLKTTMMRGPVLGLVDVTKPFEIETDASDFALGGVLIQEGHPIAFESRKLNDVELRYTVSEKEMLAVVHCLRVWRQYLLGSQFVVKTDNSAICHFFDQPKLTAKQARWQESLAEFDFKFEHKAGKSNQAADALSRKGEHAALCMLAHIHSSKIDGSMRDIIKEHLHKDPSAKTVVELAKAGKTRQFWVEGDLLMTKGNRLYVPRTGELRKKLIQECHDTLWAGHPGWQRTYALIKKGYFWPNMRDDIMQYTKTCLICQQDKVEKAKVSGLLEPLPVPTRPWESVSLDFITHLPKVGDYDAILVIVDRFSKYATFIPTPKLCSAELTAELFFKHIVKLWGIPSSIISDRDGRFIGTFWTELFAFLGTTLNISSSYHPQTDGQTERFNCLLEEYLRHFVDARQKNWIQLLDVAQFCFNCQTSSSTGKSPFEIVSGRQPALPHIIDHPYAGKNPQAHNFTREWKQTTDIARAYLEKASKHMKKWADKKRRPLQFRAGDQVLIKLKPEQIRFRN
Homology
BLAST of CmaCh00G001230 vs. ExPASy Swiss-Prot
Match: P0CT41 (Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-12 PE=3 SV=1) HSP 1 Score: 482.3 bits (1240), Expect = 2.0e-134 Identity = 278/838 (33.17%), Postives = 449/838 (53.58%), Query Frame = 0 Query: 630 EINEVLNDYADIMPESLPQTLP-PRRGIDHEIELI-PGVKPPAKNAYRMAPPELAELRKQ 689 E+ ++ ++ DI E+ + LP P +G++ E+EL + P +N Y + P ++ + + Sbjct: 373 ELPDIYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRN-YPLPPGKMQAMNDE 432
Query: 690 LDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLLIISDLFDQ 749 +++ LK+G IR +KA PV+F KK+GTLR+ +DY+ LNK N YPL +I L + Sbjct: 433 INQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAK 492
Query: 750 LHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTLMNQ 809 + G+ FTKLDL+S Y+ +R+ +GDE K G FE+LVMP+G++ APA F +N Sbjct: 493 IQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINT 552
Query: 810 VFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFL 869 + E + V+ Y+DDI+++S + EH H+K V KL+ L + + KC F Q+ + F+ Sbjct: 553 ILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFI 612
Query: 870 GHVVKCGHISMDSDKIKAIQEWKVPTSVSDLRSFLGLANYYRRFVEGFSRRAAPLTELLK 929 G+ + + + I + +WK P + +LR FLG NY R+F+ S+ PL LLK Sbjct: 613 GYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLK 672
Query: 930 KDHTWSWSDDCQMAFEDLKTTMMRGPVLGLVDVTKPFEIETDASDFALGGVLIQEG---- 989 KD W W+ A E++K ++ PVL D +K +ETDASD A+G VL Q+ Sbjct: 673 KDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDK 732
Query: 990 -HPIAFESRKLNDVELRYTVSEKEMLAVVHCLRVWRQYLLGS--QFVVKTDN-SAICHFF 1049 +P+ + S K++ +L Y+VS+KEMLA++ L+ WR YL + F + TD+ + I Sbjct: 733 YYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRIT 792
Query: 1050 DQPKLTAKQ-ARWQESLAEFDFKFEHKAGKSNQAADALSRKGEHAALCMLAHIHSSKIDG 1109 ++ + K+ ARWQ L +F+F+ ++ G +N ADALSR + I D Sbjct: 793 NESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDET-----EPIPKDSEDN 852
Query: 1110 SMRDIIKEHLHKDPSAKTVVELAKAGK------------TRQFWVEGDLLMTKGNRLYVP 1169 S+ + + + D + V E K ++ LL+ +++ +P Sbjct: 853 SINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILLP 912
Query: 1170 RTGELRKKLIQECHDTLWAGHPGWQRTYALIKKGYFWPNMRDDIMQYTKTCLICQQDKVE 1229 +L + +I++ H+ HPG + +I + + W +R I +Y + C CQ +K Sbjct: 913 NDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSR 972
Query: 1230 KAKVSGLLEPLPVPTRPWESVSLDFITHLPKVGDYDAILVIVDRFSKYATFIPTPKLCSA 1289 K G L+P+P RPWES+S+DFIT LP+ Y+A+ V+VDRFSK A +P K +A Sbjct: 973 NHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSITA 1032
Query: 1290 ELTAELFFKHIVKLWGIPSSIISDRDGRFIGTFWTELFAFLGTTLNISSSYHPQTDGQTE 1349 E TA +F + ++ +G P II+D D F W + + S Y PQTDGQTE Sbjct: 1033 EQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTE 1092
Query: 1350 RFNCLLEEYLRHFVDARQKNWIQLLDVAQFCFNCQTSSSTGKSPFEIVSGRQPALPHIID 1409 R N +E+ LR W+ + + Q +N S+T +PFEIV PAL + Sbjct: 1093 RTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLEL 1152
Query: 1410 HPYAGKNPQAHNFTREWKQTTDIARAYLEKASKHMKKWADKKRRPL-QFRAGDQVLIK 1444 ++ K + ++E Q + +L + MKK+ D K + + +F+ GD V++K Sbjct: 1153 PSFSDKTDEN---SQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK 1201
BLAST of CmaCh00G001230 vs. ExPASy Swiss-Prot
Match: P0CT34 (Transposon Tf2-1 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-1 PE=3 SV=1) HSP 1 Score: 482.3 bits (1240), Expect = 2.0e-134 Identity = 278/838 (33.17%), Postives = 449/838 (53.58%), Query Frame = 0 Query: 630 EINEVLNDYADIMPESLPQTLP-PRRGIDHEIELI-PGVKPPAKNAYRMAPPELAELRKQ 689 E+ ++ ++ DI E+ + LP P +G++ E+EL + P +N Y + P ++ + + Sbjct: 373 ELPDIYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRN-YPLPPGKMQAMNDE 432
Query: 690 LDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLLIISDLFDQ 749 +++ LK+G IR +KA PV+F KK+GTLR+ +DY+ LNK N YPL +I L + Sbjct: 433 INQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAK 492
Query: 750 LHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTLMNQ 809 + G+ FTKLDL+S Y+ +R+ +GDE K G FE+LVMP+G++ APA F +N Sbjct: 493 IQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINT 552
Query: 810 VFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFL 869 + E + V+ Y+DDI+++S + EH H+K V KL+ L + + KC F Q+ + F+ Sbjct: 553 ILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFI 612
Query: 870 GHVVKCGHISMDSDKIKAIQEWKVPTSVSDLRSFLGLANYYRRFVEGFSRRAAPLTELLK 929 G+ + + + I + +WK P + +LR FLG NY R+F+ S+ PL LLK Sbjct: 613 GYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLK 672
Query: 930 KDHTWSWSDDCQMAFEDLKTTMMRGPVLGLVDVTKPFEIETDASDFALGGVLIQEG---- 989 KD W W+ A E++K ++ PVL D +K +ETDASD A+G VL Q+ Sbjct: 673 KDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDK 732
Query: 990 -HPIAFESRKLNDVELRYTVSEKEMLAVVHCLRVWRQYLLGS--QFVVKTDN-SAICHFF 1049 +P+ + S K++ +L Y+VS+KEMLA++ L+ WR YL + F + TD+ + I Sbjct: 733 YYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRIT 792
Query: 1050 DQPKLTAKQ-ARWQESLAEFDFKFEHKAGKSNQAADALSRKGEHAALCMLAHIHSSKIDG 1109 ++ + K+ ARWQ L +F+F+ ++ G +N ADALSR + I D Sbjct: 793 NESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDET-----EPIPKDSEDN 852
Query: 1110 SMRDIIKEHLHKDPSAKTVVELAKAGK------------TRQFWVEGDLLMTKGNRLYVP 1169 S+ + + + D + V E K ++ LL+ +++ +P Sbjct: 853 SINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILLP 912
Query: 1170 RTGELRKKLIQECHDTLWAGHPGWQRTYALIKKGYFWPNMRDDIMQYTKTCLICQQDKVE 1229 +L + +I++ H+ HPG + +I + + W +R I +Y + C CQ +K Sbjct: 913 NDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSR 972
Query: 1230 KAKVSGLLEPLPVPTRPWESVSLDFITHLPKVGDYDAILVIVDRFSKYATFIPTPKLCSA 1289 K G L+P+P RPWES+S+DFIT LP+ Y+A+ V+VDRFSK A +P K +A Sbjct: 973 NHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSITA 1032
Query: 1290 ELTAELFFKHIVKLWGIPSSIISDRDGRFIGTFWTELFAFLGTTLNISSSYHPQTDGQTE 1349 E TA +F + ++ +G P II+D D F W + + S Y PQTDGQTE Sbjct: 1033 EQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTE 1092
Query: 1350 RFNCLLEEYLRHFVDARQKNWIQLLDVAQFCFNCQTSSSTGKSPFEIVSGRQPALPHIID 1409 R N +E+ LR W+ + + Q +N S+T +PFEIV PAL + Sbjct: 1093 RTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLEL 1152
Query: 1410 HPYAGKNPQAHNFTREWKQTTDIARAYLEKASKHMKKWADKKRRPL-QFRAGDQVLIK 1444 ++ K + ++E Q + +L + MKK+ D K + + +F+ GD V++K Sbjct: 1153 PSFSDKTDEN---SQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK 1201
BLAST of CmaCh00G001230 vs. ExPASy Swiss-Prot
Match: P0CT35 (Transposon Tf2-2 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-2 PE=3 SV=1) HSP 1 Score: 482.3 bits (1240), Expect = 2.0e-134 Identity = 278/838 (33.17%), Postives = 449/838 (53.58%), Query Frame = 0 Query: 630 EINEVLNDYADIMPESLPQTLP-PRRGIDHEIELI-PGVKPPAKNAYRMAPPELAELRKQ 689 E+ ++ ++ DI E+ + LP P +G++ E+EL + P +N Y + P ++ + + Sbjct: 373 ELPDIYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRN-YPLPPGKMQAMNDE 432
Query: 690 LDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLLIISDLFDQ 749 +++ LK+G IR +KA PV+F KK+GTLR+ +DY+ LNK N YPL +I L + Sbjct: 433 INQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAK 492
Query: 750 LHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTLMNQ 809 + G+ FTKLDL+S Y+ +R+ +GDE K G FE+LVMP+G++ APA F +N Sbjct: 493 IQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINT 552
Query: 810 VFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFL 869 + E + V+ Y+DDI+++S + EH H+K V KL+ L + + KC F Q+ + F+ Sbjct: 553 ILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFI 612
Query: 870 GHVVKCGHISMDSDKIKAIQEWKVPTSVSDLRSFLGLANYYRRFVEGFSRRAAPLTELLK 929 G+ + + + I + +WK P + +LR FLG NY R+F+ S+ PL LLK Sbjct: 613 GYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLK 672
Query: 930 KDHTWSWSDDCQMAFEDLKTTMMRGPVLGLVDVTKPFEIETDASDFALGGVLIQEG---- 989 KD W W+ A E++K ++ PVL D +K +ETDASD A+G VL Q+ Sbjct: 673 KDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDK 732
Query: 990 -HPIAFESRKLNDVELRYTVSEKEMLAVVHCLRVWRQYLLGS--QFVVKTDN-SAICHFF 1049 +P+ + S K++ +L Y+VS+KEMLA++ L+ WR YL + F + TD+ + I Sbjct: 733 YYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRIT 792
Query: 1050 DQPKLTAKQ-ARWQESLAEFDFKFEHKAGKSNQAADALSRKGEHAALCMLAHIHSSKIDG 1109 ++ + K+ ARWQ L +F+F+ ++ G +N ADALSR + I D Sbjct: 793 NESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDET-----EPIPKDSEDN 852
Query: 1110 SMRDIIKEHLHKDPSAKTVVELAKAGK------------TRQFWVEGDLLMTKGNRLYVP 1169 S+ + + + D + V E K ++ LL+ +++ +P Sbjct: 853 SINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILLP 912
Query: 1170 RTGELRKKLIQECHDTLWAGHPGWQRTYALIKKGYFWPNMRDDIMQYTKTCLICQQDKVE 1229 +L + +I++ H+ HPG + +I + + W +R I +Y + C CQ +K Sbjct: 913 NDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSR 972
Query: 1230 KAKVSGLLEPLPVPTRPWESVSLDFITHLPKVGDYDAILVIVDRFSKYATFIPTPKLCSA 1289 K G L+P+P RPWES+S+DFIT LP+ Y+A+ V+VDRFSK A +P K +A Sbjct: 973 NHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSITA 1032
Query: 1290 ELTAELFFKHIVKLWGIPSSIISDRDGRFIGTFWTELFAFLGTTLNISSSYHPQTDGQTE 1349 E TA +F + ++ +G P II+D D F W + + S Y PQTDGQTE Sbjct: 1033 EQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTE 1092
Query: 1350 RFNCLLEEYLRHFVDARQKNWIQLLDVAQFCFNCQTSSSTGKSPFEIVSGRQPALPHIID 1409 R N +E+ LR W+ + + Q +N S+T +PFEIV PAL + Sbjct: 1093 RTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLEL 1152
Query: 1410 HPYAGKNPQAHNFTREWKQTTDIARAYLEKASKHMKKWADKKRRPL-QFRAGDQVLIK 1444 ++ K + ++E Q + +L + MKK+ D K + + +F+ GD V++K Sbjct: 1153 PSFSDKTDEN---SQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK 1201
BLAST of CmaCh00G001230 vs. ExPASy Swiss-Prot
Match: P0CT36 (Transposon Tf2-3 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-3 PE=1 SV=1) HSP 1 Score: 482.3 bits (1240), Expect = 2.0e-134 Identity = 278/838 (33.17%), Postives = 449/838 (53.58%), Query Frame = 0 Query: 630 EINEVLNDYADIMPESLPQTLP-PRRGIDHEIELI-PGVKPPAKNAYRMAPPELAELRKQ 689 E+ ++ ++ DI E+ + LP P +G++ E+EL + P +N Y + P ++ + + Sbjct: 373 ELPDIYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRN-YPLPPGKMQAMNDE 432
Query: 690 LDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLLIISDLFDQ 749 +++ LK+G IR +KA PV+F KK+GTLR+ +DY+ LNK N YPL +I L + Sbjct: 433 INQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAK 492
Query: 750 LHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTLMNQ 809 + G+ FTKLDL+S Y+ +R+ +GDE K G FE+LVMP+G++ APA F +N Sbjct: 493 IQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINT 552
Query: 810 VFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFL 869 + E + V+ Y+DDI+++S + EH H+K V KL+ L + + KC F Q+ + F+ Sbjct: 553 ILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFI 612
Query: 870 GHVVKCGHISMDSDKIKAIQEWKVPTSVSDLRSFLGLANYYRRFVEGFSRRAAPLTELLK 929 G+ + + + I + +WK P + +LR FLG NY R+F+ S+ PL LLK Sbjct: 613 GYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLK 672
Query: 930 KDHTWSWSDDCQMAFEDLKTTMMRGPVLGLVDVTKPFEIETDASDFALGGVLIQEG---- 989 KD W W+ A E++K ++ PVL D +K +ETDASD A+G VL Q+ Sbjct: 673 KDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDK 732
Query: 990 -HPIAFESRKLNDVELRYTVSEKEMLAVVHCLRVWRQYLLGS--QFVVKTDN-SAICHFF 1049 +P+ + S K++ +L Y+VS+KEMLA++ L+ WR YL + F + TD+ + I Sbjct: 733 YYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRIT 792
Query: 1050 DQPKLTAKQ-ARWQESLAEFDFKFEHKAGKSNQAADALSRKGEHAALCMLAHIHSSKIDG 1109 ++ + K+ ARWQ L +F+F+ ++ G +N ADALSR + I D Sbjct: 793 NESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDET-----EPIPKDSEDN 852
Query: 1110 SMRDIIKEHLHKDPSAKTVVELAKAGK------------TRQFWVEGDLLMTKGNRLYVP 1169 S+ + + + D + V E K ++ LL+ +++ +P Sbjct: 853 SINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILLP 912
Query: 1170 RTGELRKKLIQECHDTLWAGHPGWQRTYALIKKGYFWPNMRDDIMQYTKTCLICQQDKVE 1229 +L + +I++ H+ HPG + +I + + W +R I +Y + C CQ +K Sbjct: 913 NDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSR 972
Query: 1230 KAKVSGLLEPLPVPTRPWESVSLDFITHLPKVGDYDAILVIVDRFSKYATFIPTPKLCSA 1289 K G L+P+P RPWES+S+DFIT LP+ Y+A+ V+VDRFSK A +P K +A Sbjct: 973 NHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSITA 1032
Query: 1290 ELTAELFFKHIVKLWGIPSSIISDRDGRFIGTFWTELFAFLGTTLNISSSYHPQTDGQTE 1349 E TA +F + ++ +G P II+D D F W + + S Y PQTDGQTE Sbjct: 1033 EQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTE 1092
Query: 1350 RFNCLLEEYLRHFVDARQKNWIQLLDVAQFCFNCQTSSSTGKSPFEIVSGRQPALPHIID 1409 R N +E+ LR W+ + + Q +N S+T +PFEIV PAL + Sbjct: 1093 RTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLEL 1152
Query: 1410 HPYAGKNPQAHNFTREWKQTTDIARAYLEKASKHMKKWADKKRRPL-QFRAGDQVLIK 1444 ++ K + ++E Q + +L + MKK+ D K + + +F+ GD V++K Sbjct: 1153 PSFSDKTDEN---SQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK 1201
BLAST of CmaCh00G001230 vs. ExPASy Swiss-Prot
Match: P0CT37 (Transposon Tf2-4 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-4 PE=3 SV=1) HSP 1 Score: 482.3 bits (1240), Expect = 2.0e-134 Identity = 278/838 (33.17%), Postives = 449/838 (53.58%), Query Frame = 0 Query: 630 EINEVLNDYADIMPESLPQTLP-PRRGIDHEIELI-PGVKPPAKNAYRMAPPELAELRKQ 689 E+ ++ ++ DI E+ + LP P +G++ E+EL + P +N Y + P ++ + + Sbjct: 373 ELPDIYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRN-YPLPPGKMQAMNDE 432
Query: 690 LDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLLIISDLFDQ 749 +++ LK+G IR +KA PV+F KK+GTLR+ +DY+ LNK N YPL +I L + Sbjct: 433 INQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAK 492
Query: 750 LHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTLMNQ 809 + G+ FTKLDL+S Y+ +R+ +GDE K G FE+LVMP+G++ APA F +N Sbjct: 493 IQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINT 552
Query: 810 VFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFL 869 + E + V+ Y+DDI+++S + EH H+K V KL+ L + + KC F Q+ + F+ Sbjct: 553 ILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFI 612
Query: 870 GHVVKCGHISMDSDKIKAIQEWKVPTSVSDLRSFLGLANYYRRFVEGFSRRAAPLTELLK 929 G+ + + + I + +WK P + +LR FLG NY R+F+ S+ PL LLK Sbjct: 613 GYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLK 672
Query: 930 KDHTWSWSDDCQMAFEDLKTTMMRGPVLGLVDVTKPFEIETDASDFALGGVLIQEG---- 989 KD W W+ A E++K ++ PVL D +K +ETDASD A+G VL Q+ Sbjct: 673 KDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDK 732
Query: 990 -HPIAFESRKLNDVELRYTVSEKEMLAVVHCLRVWRQYLLGS--QFVVKTDN-SAICHFF 1049 +P+ + S K++ +L Y+VS+KEMLA++ L+ WR YL + F + TD+ + I Sbjct: 733 YYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRIT 792
Query: 1050 DQPKLTAKQ-ARWQESLAEFDFKFEHKAGKSNQAADALSRKGEHAALCMLAHIHSSKIDG 1109 ++ + K+ ARWQ L +F+F+ ++ G +N ADALSR + I D Sbjct: 793 NESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDET-----EPIPKDSEDN 852
Query: 1110 SMRDIIKEHLHKDPSAKTVVELAKAGK------------TRQFWVEGDLLMTKGNRLYVP 1169 S+ + + + D + V E K ++ LL+ +++ +P Sbjct: 853 SINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILLP 912
Query: 1170 RTGELRKKLIQECHDTLWAGHPGWQRTYALIKKGYFWPNMRDDIMQYTKTCLICQQDKVE 1229 +L + +I++ H+ HPG + +I + + W +R I +Y + C CQ +K Sbjct: 913 NDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSR 972
Query: 1230 KAKVSGLLEPLPVPTRPWESVSLDFITHLPKVGDYDAILVIVDRFSKYATFIPTPKLCSA 1289 K G L+P+P RPWES+S+DFIT LP+ Y+A+ V+VDRFSK A +P K +A Sbjct: 973 NHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSITA 1032
Query: 1290 ELTAELFFKHIVKLWGIPSSIISDRDGRFIGTFWTELFAFLGTTLNISSSYHPQTDGQTE 1349 E TA +F + ++ +G P II+D D F W + + S Y PQTDGQTE Sbjct: 1033 EQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTE 1092
Query: 1350 RFNCLLEEYLRHFVDARQKNWIQLLDVAQFCFNCQTSSSTGKSPFEIVSGRQPALPHIID 1409 R N +E+ LR W+ + + Q +N S+T +PFEIV PAL + Sbjct: 1093 RTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLEL 1152
Query: 1410 HPYAGKNPQAHNFTREWKQTTDIARAYLEKASKHMKKWADKKRRPL-QFRAGDQVLIK 1444 ++ K + ++E Q + +L + MKK+ D K + + +F+ GD V++K Sbjct: 1153 PSFSDKTDEN---SQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK 1201
BLAST of CmaCh00G001230 vs. TAIR 10
Match: ATMG00860.1 (DNA/RNA polymerases superfamily protein ) HSP 1 Score: 104.0 bits (258), Expect = 1.0e-21 Identity = 53/130 (40.77%), Postives = 78/130 (60.00%), Query Frame = 0 Query: 837 HLKLVFDKLRQNQLYVKKEKCAFAQTCINFLG--HVVKCGHISMDSDKIKAIQEWKVPTS 896 HL +V Q+Q Y ++KCAF Q I +LG H++ +S D K++A+ W P + Sbjct: 3 HLGMVLQIWEQHQFYANRKKCAFGQPQIAYLGHRHIISGEGVSADPAKLEAMVGWPEPKN 62
Query: 897 VSDLRSFLGLANYYRRFVEGFSRRAAPLTELLKKDHTWSWSDDCQMAFEDLKTTMMRGPV 956 ++LR FLGL YYRRFV+ + + PLTELLKK ++ W++ +AF+ LK + PV Sbjct: 63 TTELRGFLGLTGYYRRFVKNYGKIVRPLTELLKK-NSLKWTEMAALAFKALKGAVTTLPV 122
Query: 957 LGLVDVTKPF 965 L L D+ PF Sbjct: 123 LALPDLKLPF 131
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
P0CT41 | 2.0e-134 | 33.17 | Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24... | [more] |
P0CT34 | 2.0e-134 | 33.17 | Transposon Tf2-1 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... | [more] |
P0CT35 | 2.0e-134 | 33.17 | Transposon Tf2-2 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... | [more] |
P0CT36 | 2.0e-134 | 33.17 | Transposon Tf2-3 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... | [more] |
P0CT37 | 2.0e-134 | 33.17 | Transposon Tf2-4 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... | [more] |
Match Name | E-value | Identity | Description | |
ATMG00860.1 | 1.0e-21 | 40.77 | DNA/RNA polymerases superfamily protein | [more] |
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR Term | IPR Description | Source | Source Term | Source Description | Alignment |
IPR005162 | Retrotransposon gag domain | PFAM | PF03732 | Retrotrans_gag | coord: 201..295 e-value: 1.4E-16 score: 60.5 |
None | No IPR available | PFAM | PF08284 | RVP_2 | coord: 469..564 e-value: 9.5E-11 score: 41.6 |
None | No IPR available | GENE3D | 1.10.340.70 | | coord: 1115..1204 e-value: 1.0E-20 score: 75.8 |
None | No IPR available | GENE3D | 3.10.10.10 | HIV Type 1 Reverse Transcriptase, subunit A, domain 1 | coord: 656..796 e-value: 1.6E-87 score: 294.3 |
None | No IPR available | MOBIDB_LITE | mobidb-lite | disorder_prediction | coord: 351..389 |
None | No IPR available | MOBIDB_LITE | mobidb-lite | disorder_prediction | coord: 74..95 |
None | No IPR available | MOBIDB_LITE | mobidb-lite | disorder_prediction | coord: 326..389 |
None | No IPR available | PANTHER | PTHR47266 | FAMILY NOT NAMED | coord: 666..1448 |
None | No IPR available | CDD | cd09274 | RNase_HI_RT_Ty3 | coord: 965..1079 e-value: 2.80368E-56 score: 188.856 |
None | No IPR available | CDD | cd01647 | RT_LTR | coord: 695..871 e-value: 8.19232E-86 score: 275.243 |
None | No IPR available | CDD | cd00303 | retropepsin_like | coord: 470..559 e-value: 1.07485E-18 score: 80.4583 |
IPR000477 | Reverse transcriptase domain | PFAM | PF00078 | RVT_1 | coord: 712..870 e-value: 6.4E-28 score: 97.8 |
IPR000477 | Reverse transcriptase domain | PROSITE | PS50878 | RT_POL | coord: 692..871 score: 15.209808 |
IPR021109 | Aspartic peptidase domain superfamily | GENE3D | 2.40.70.10 | Acid Proteases | coord: 450..585 e-value: 5.0E-20 score: 73.6 |
IPR021109 | Aspartic peptidase domain superfamily | SUPERFAMILY | 50630 | Acid proteases | coord: 464..564 |
IPR041588 | Integrase zinc-binding domain | PFAM | PF17921 | Integrase_H2C2 | coord: 1150..1204 e-value: 5.4E-21 score: 74.4 |
IPR041577 | Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain | PFAM | PF17919 | RT_RNaseH_2 | coord: 934..1028 e-value: 1.3E-30 score: 105.3 |
IPR043128 | Reverse transcriptase/Diguanylate cyclase domain | GENE3D | 3.30.70.270 | | coord: 736..871 e-value: 1.6E-87 score: 294.3 |
IPR043128 | Reverse transcriptase/Diguanylate cyclase domain | GENE3D | 3.30.70.270 | | coord: 880..970 e-value: 1.1E-28 score: 101.1 |
IPR036397 | Ribonuclease H superfamily | GENE3D | 3.30.420.10 | | coord: 1215..1412 e-value: 4.0E-45 score: 155.6 |
IPR001584 | Integrase, catalytic core | PROSITE | PS50994 | INTEGRASE | coord: 1219..1378 score: 22.256716 |
IPR012337 | Ribonuclease H-like superfamily | SUPERFAMILY | 53098 | Ribonuclease H-like | coord: 1216..1372 |
IPR043502 | DNA/RNA polymerase superfamily | SUPERFAMILY | 56672 | DNA/RNA polymerases | coord: 634..1064 |
Relationships
The following mRNA feature(s) are a part of this gene:
GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category |
Term Accession |
Term Name |
biological_process |
GO:0015074 |
DNA integration |
biological_process |
GO:0090305 |
nucleic acid phosphodiester bond hydrolysis |
biological_process |
GO:0006508 |
proteolysis |
biological_process |
GO:0006278 |
RNA-dependent DNA biosynthetic process |
molecular_function |
GO:0004190 |
aspartic-type endopeptidase activity |
molecular_function |
GO:0004519 |
endonuclease activity |
molecular_function |
GO:0003676 |
nucleic acid binding |
molecular_function |
GO:0003964 |
RNA-directed DNA polymerase activity |
molecular_function |
GO:0008270 |
zinc ion binding |
|