CmaCh00G001230 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh00G001230
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionReverse transcriptase
LocationCma_Chr00: 5540263 .. 5544624 (+)
RNA-Seq ExpressionCmaCh00G001230
SyntenyCmaCh00G001230
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCGACGACAAAGTCGACACCCAAAGCACAAGTCGACCGGCTTACGCTAATAGAGGAGGAAATGCTGTTCCTCAAAGAAGTCCCTGACACCCTCCGCTTCCTGGAAGCACGGGTGACCGAATTGAGTGAGAAAGTCGTAGGAATCGACGCAATGGGCAACCGCCTGGATGGGTTGCCAATCGCAGAATTGATGTTTCGAGTGACCTCGCTCGAAGAAAGAGTTGCTCCTACGAGCAGCCCAAAACCGTCTGGTAGTCCGGATAGCTCTGTCGCGCACAAGGAGGGACGTGGCGAAGAGTTCGACGTGCTACAAAATACAATGATGAGCTTGTTCAATGGATTAGCTGATGAGTTCAGAACAACAATCGACGACATCCAAGAAAGGATGGCCTCCATGGGCACTCGAATTGAAGTGACCATGAAAGCCGTGGAGAACGTCACGGCTGGGCAAGCTAATACAGGGTCCAACAAACTAAGATTCCTAGATCCTAGAGCCTTTAAAGGGAATCGGGACGCCAAAGAGTTGGAAAACTTCATCTTTGATGTCGAACAGTACTTCAAAGCCACAACGGCTTGTACTGACGACAAGAAGGTGACCGTAGCCTCGATGTATCTCATAGACGATGCCAAACTGTGGTGGCGTACGAAGGTGCAAGACATCGAGGATGGTTTGTGCACCATTGACTCGTGGGAGGACCTTAAGAAAGAGTTGAGGGACCAGTTCCTCCCCGAAAACGCAGGACATTTAGCAATGGAAAAACTAGTAGCCCTGAAACACACTGGAGGCATACGAGACTATGTCAGACAGTTCTCAACCCTGATGCTAGATATCAGGGGCACATCAGAGAAGGACAAGGTGTTCTTCTTTATAAATGGGTTACAGCCGTGGGCCAAAACAAAACTACACGAGAACAAGGTTCAAACCCTAGCTGACGCAATGGCCTGTGCCGAGAGACTCCTAGACTATGGGAATGAAGCGGGATCCCAAAGAAGAATAACACCAGCCCCAAACACTGGGGGCAAGCCATACAAACCACCAAGTCATCGAAATGGAAGCCCCAACAGGCCGAACGGAGGTAACGACAGACCAAGCGAATGGACGGATAGACCTCCTCAGAACAACCAAGCGGGGACATCTCGAGGACCTTACCATCAAAGGAACCACCCGACGACGCCTTTACAATGCATGTTGTGTAAAGGTCCCCACAAGGTATCTTACTGTCCTCATCGGGCCTCTCTCACTGCGCTCCAAGTGTCCATTCAAGAGAGCAATGACGCAAGGATTGAGACTATGCTTGACAAGAAGGAAGATCAAGACAATCCCCGAATGGGCGCACTTAAATTCTTGTCAGCCCTCCAACGGAAGGTCGAATCGAAGGAGATAGTAGAGAAAGGACTCATGTTCGTAAATGCGACAATAAATTCCCAACCGAATAGGAGCACTCTGATAGATTCAGGAGCGACCCACAACTTCATCGCCGATCAAGAAGCCCGAAGATTAGGACTCACTATAGGAAAGGACCCGGGAAAAATGAAAGCTGTCAACTCTGAGGCCTTGCCTATTGTGGGAGTTTCCAAAAGAGTCCCCTTTAAAATAGGGGATTGGACAGGAGAGCTAGATCTTGTCGTAGCTCGCATGGACGACTTTGACGTGGTACTTGGGATGGAGTTCCTCCTAGAACACAAAGTTATCCCAATGCCGTTGGCAAAATGCTTAGTGATCACCGATCGCAACCCCACGGTAATACCTGCAAGCATCAAGCAACCAGGTAATCTTCGAATGATCTCGGCCATACAATTGAAAAGGGGACTCGCGCGAGAGGAACCTACATTTATGGCCATACCATTGATGGAAGTAACAACCAACGAAGAAACTGTCCCAAATGAAATCAATGAGGTACTAAACGACTATGCTGACATAATGCCAGAGAGCTTACCCCAAACATTACCACCTCGTCGAGGCATTGATCACGAAATCGAACTCATCCCCGGAGTTAAACCGCCAGCGAAGAACGCATACCGGATGGCTCCACCCGAGCTAGCCGAATTAAGGAAACAACTGGATGAGTTGCTGAAGGCGGGATTCATCCGCCCGGCAAAGGCACCCTACGGAGCCCCCGTACTGTTCCAGAAGAAGAAGGATGGGACGTTGCGTCTGTGCATAGACTATAGAGCCTTAAACAAGGTGACGGTGCGCAACAAATATCCACTGCTGATAATATCCGACTTGTTCGACCAACTTCACGGGGCCAAGTATTTCACGAAGTTGGACTTACGATCAGGGTATTACCAAGTACGTATTGCCGAAGGGGATGAGCCCAAGACGACGTGCGTAACAAGATATGGGGCCTTCGAATTCCTAGTAATGCCCTTTGGCTTGACAAACGCTCCAGCTACTTTCTGCACGTTGATGAACCAAGTTTTCTACGAATACTTGGATCAGTTTGTCATAGTATACCTCGACGACATAGTGGTTTACAGCACAACCCTAGAAGAACACAAAGTGCACTTGAAGCTGGTGTTCGACAAGCTACGACAAAACCAACTGTATGTCAAGAAAGAGAAATGTGCATTCGCACAAACATGCATCAACTTCCTTGGACATGTCGTCAAATGTGGACATATTAGTATGGATAGCGATAAGATAAAAGCTATCCAAGAATGGAAAGTTCCTACTTCCGTATCCGATTTGCGGTCCTTCTTAGGATTAGCCAACTACTATAGGCGGTTCGTCGAAGGGTTTTCACGACGAGCGGCCCCATTGACAGAGCTGTTGAAGAAAGACCACACTTGGTCGTGGTCAGATGATTGTCAAATGGCCTTTGAAGATCTGAAAACAACCATGATGAGGGGTCCTGTCCTCGGATTGGTAGATGTTACAAAGCCATTTGAAATAGAAACAGACGCTTCCGACTTTGCCCTAGGTGGGGTCCTTATTCAAGAAGGCCACCCCATCGCTTTCGAAAGTCGAAAGCTCAATGACGTCGAACTTAGATACACTGTCTCCGAAAAAGAAATGCTGGCAGTAGTCCATTGCCTTCGAGTCTGGAGACAATACCTCTTGGGATCACAGTTCGTAGTGAAGACGGATAACAGCGCCATTTGCCACTTCTTTGATCAACCAAAATTGACGGCAAAACAAGCCCGGTGGCAGGAGTCGTTAGCTGAATTCGACTTCAAGTTCGAACACAAAGCAGGGAAGAGCAATCAAGCAGCCGACGCACTGAGTCGGAAGGGCGAACATGCGGCCCTGTGCATGTTAGCCCATATTCACTCAAGTAAGATCGATGGATCGATGCGCGACATCATCAAGGAACATTTACATAAAGACCCATCGGCCAAAACCGTCGTCGAACTAGCTAAAGCTGGGAAAACACGACAGTTTTGGGTTGAGGGAGACCTTCTGATGACAAAAGGAAACAGATTGTATGTCCCAAGAACGGGAGAACTGAGGAAGAAGCTCATTCAGGAATGTCATGATACCTTATGGGCCGGACACCCTGGGTGGCAAAGAACATACGCTCTAATAAAGAAAGGGTACTTCTGGCCAAACATGCGAGACGACATCATGCAATACACCAAGACGTGCCTCATCTGTCAACAGGACAAAGTCGAGAAAGCCAAAGTCTCAGGACTCTTGGAACCTCTACCTGTGCCGACAAGACCCTGGGAAAGTGTATCTTTGGACTTCATAACACACCTCCCAAAAGTCGGGGACTATGACGCTATCTTGGTTATCGTAGACCGATTCTCAAAATATGCGACGTTCATCCCCACTCCCAAATTATGCTCGGCCGAACTCACAGCTGAACTATTTTTCAAACACATTGTAAAGTTATGGGGTATTCCGTCGAGCATCATCAGTGATCGGGATGGCAGATTCATTGGGACATTCTGGACCGAGTTATTCGCCTTCTTGGGAACAACCTTAAACATCTCCTCGAGTTACCACCCTCAAACCGATGGTCAGACAGAACGGTTCAATTGCTTGCTCGAAGAATACTTACGTCACTTCGTCGATGCTCGCCAGAAGAACTGGATACAACTGTTAGATGTCGCCCAATTTTGCTTCAATTGTCAAACCAGCTCGTCTACGGGAAAGAGTCCCTTTGAAATTGTAAGTGGACGACAACCGGCCTTACCCCACATTATCGATCATCCTTATGCAGGGAAAAACCCTCAAGCTCACAACTTCACAAGAGAATGGAAGCAGACAACAGATATAGCCCGGGCATATTTAGAGAAGGCTTCCAAGCATATGAAAAAGTGGGCAGACAAGAAGCGTCGCCCCCTTCAATTCCGAGCAGGAGATCAAGTCCTCATCAAGCTGAAACCAGAACAGATCAGATTTCGCAACTGA

mRNA sequence

ATGTCGACGACAAAGTCGACACCCAAAGCACAAGTCGACCGGCTTACGCTAATAGAGGAGGAAATGCTGTTCCTCAAAGAAGTCCCTGACACCCTCCGCTTCCTGGAAGCACGGGTGACCGAATTGAGTGAGAAAGTCGTAGGAATCGACGCAATGGGCAACCGCCTGGATGGGTTGCCAATCGCAGAATTGATGTTTCGAGTGACCTCGCTCGAAGAAAGAGTTGCTCCTACGAGCAGCCCAAAACCGTCTGGTAGTCCGGATAGCTCTGTCGCGCACAAGGAGGGACGTGGCGAAGAGTTCGACGTGCTACAAAATACAATGATGAGCTTGTTCAATGGATTAGCTGATGAGTTCAGAACAACAATCGACGACATCCAAGAAAGGATGGCCTCCATGGGCACTCGAATTGAAGTGACCATGAAAGCCGTGGAGAACGTCACGGCTGGGCAAGCTAATACAGGGTCCAACAAACTAAGATTCCTAGATCCTAGAGCCTTTAAAGGGAATCGGGACGCCAAAGAGTTGGAAAACTTCATCTTTGATGTCGAACAGTACTTCAAAGCCACAACGGCTTGTACTGACGACAAGAAGGTGACCGTAGCCTCGATGTATCTCATAGACGATGCCAAACTGTGGTGGCGTACGAAGGTGCAAGACATCGAGGATGGTTTGTGCACCATTGACTCGTGGGAGGACCTTAAGAAAGAGTTGAGGGACCAGTTCCTCCCCGAAAACGCAGGACATTTAGCAATGGAAAAACTAGTAGCCCTGAAACACACTGGAGGCATACGAGACTATGTCAGACAGTTCTCAACCCTGATGCTAGATATCAGGGGCACATCAGAGAAGGACAAGGTGTTCTTCTTTATAAATGGGTTACAGCCGTGGGCCAAAACAAAACTACACGAGAACAAGGTTCAAACCCTAGCTGACGCAATGGCCTGTGCCGAGAGACTCCTAGACTATGGGAATGAAGCGGGATCCCAAAGAAGAATAACACCAGCCCCAAACACTGGGGGCAAGCCATACAAACCACCAAGTCATCGAAATGGAAGCCCCAACAGGCCGAACGGAGGTAACGACAGACCAAGCGAATGGACGGATAGACCTCCTCAGAACAACCAAGCGGGGACATCTCGAGGACCTTACCATCAAAGGAACCACCCGACGACGCCTTTACAATGCATGTTGTGTAAAGGTCCCCACAAGGTATCTTACTGTCCTCATCGGGCCTCTCTCACTGCGCTCCAAGTGTCCATTCAAGAGAGCAATGACGCAAGGATTGAGACTATGCTTGACAAGAAGGAAGATCAAGACAATCCCCGAATGGGCGCACTTAAATTCTTGTCAGCCCTCCAACGGAAGGTCGAATCGAAGGAGATAGTAGAGAAAGGACTCATGTTCGTAAATGCGACAATAAATTCCCAACCGAATAGGAGCACTCTGATAGATTCAGGAGCGACCCACAACTTCATCGCCGATCAAGAAGCCCGAAGATTAGGACTCACTATAGGAAAGGACCCGGGAAAAATGAAAGCTGTCAACTCTGAGGCCTTGCCTATTGTGGGAGTTTCCAAAAGAGTCCCCTTTAAAATAGGGGATTGGACAGGAGAGCTAGATCTTGTCGTAGCTCGCATGGACGACTTTGACGTGGTACTTGGGATGGAGTTCCTCCTAGAACACAAAGTTATCCCAATGCCGTTGGCAAAATGCTTAGTGATCACCGATCGCAACCCCACGGTAATACCTGCAAGCATCAAGCAACCAGGTAATCTTCGAATGATCTCGGCCATACAATTGAAAAGGGGACTCGCGCGAGAGGAACCTACATTTATGGCCATACCATTGATGGAAGTAACAACCAACGAAGAAACTGTCCCAAATGAAATCAATGAGGTACTAAACGACTATGCTGACATAATGCCAGAGAGCTTACCCCAAACATTACCACCTCGTCGAGGCATTGATCACGAAATCGAACTCATCCCCGGAGTTAAACCGCCAGCGAAGAACGCATACCGGATGGCTCCACCCGAGCTAGCCGAATTAAGGAAACAACTGGATGAGTTGCTGAAGGCGGGATTCATCCGCCCGGCAAAGGCACCCTACGGAGCCCCCGTACTGTTCCAGAAGAAGAAGGATGGGACGTTGCGTCTGTGCATAGACTATAGAGCCTTAAACAAGGTGACGGTGCGCAACAAATATCCACTGCTGATAATATCCGACTTGTTCGACCAACTTCACGGGGCCAAGTATTTCACGAAGTTGGACTTACGATCAGGGTATTACCAAGTACGTATTGCCGAAGGGGATGAGCCCAAGACGACGTGCGTAACAAGATATGGGGCCTTCGAATTCCTAGTAATGCCCTTTGGCTTGACAAACGCTCCAGCTACTTTCTGCACGTTGATGAACCAAGTTTTCTACGAATACTTGGATCAGTTTGTCATAGTATACCTCGACGACATAGTGGTTTACAGCACAACCCTAGAAGAACACAAAGTGCACTTGAAGCTGGTGTTCGACAAGCTACGACAAAACCAACTGTATGTCAAGAAAGAGAAATGTGCATTCGCACAAACATGCATCAACTTCCTTGGACATGTCGTCAAATGTGGACATATTAGTATGGATAGCGATAAGATAAAAGCTATCCAAGAATGGAAAGTTCCTACTTCCGTATCCGATTTGCGGTCCTTCTTAGGATTAGCCAACTACTATAGGCGGTTCGTCGAAGGGTTTTCACGACGAGCGGCCCCATTGACAGAGCTGTTGAAGAAAGACCACACTTGGTCGTGGTCAGATGATTGTCAAATGGCCTTTGAAGATCTGAAAACAACCATGATGAGGGGTCCTGTCCTCGGATTGGTAGATGTTACAAAGCCATTTGAAATAGAAACAGACGCTTCCGACTTTGCCCTAGGTGGGGTCCTTATTCAAGAAGGCCACCCCATCGCTTTCGAAAGTCGAAAGCTCAATGACGTCGAACTTAGATACACTGTCTCCGAAAAAGAAATGCTGGCAGTAGTCCATTGCCTTCGAGTCTGGAGACAATACCTCTTGGGATCACAGTTCGTAGTGAAGACGGATAACAGCGCCATTTGCCACTTCTTTGATCAACCAAAATTGACGGCAAAACAAGCCCGGTGGCAGGAGTCGTTAGCTGAATTCGACTTCAAGTTCGAACACAAAGCAGGGAAGAGCAATCAAGCAGCCGACGCACTGAGTCGGAAGGGCGAACATGCGGCCCTGTGCATGTTAGCCCATATTCACTCAAGTAAGATCGATGGATCGATGCGCGACATCATCAAGGAACATTTACATAAAGACCCATCGGCCAAAACCGTCGTCGAACTAGCTAAAGCTGGGAAAACACGACAGTTTTGGGTTGAGGGAGACCTTCTGATGACAAAAGGAAACAGATTGTATGTCCCAAGAACGGGAGAACTGAGGAAGAAGCTCATTCAGGAATGTCATGATACCTTATGGGCCGGACACCCTGGGTGGCAAAGAACATACGCTCTAATAAAGAAAGGGTACTTCTGGCCAAACATGCGAGACGACATCATGCAATACACCAAGACGTGCCTCATCTGTCAACAGGACAAAGTCGAGAAAGCCAAAGTCTCAGGACTCTTGGAACCTCTACCTGTGCCGACAAGACCCTGGGAAAGTGTATCTTTGGACTTCATAACACACCTCCCAAAAGTCGGGGACTATGACGCTATCTTGGTTATCGTAGACCGATTCTCAAAATATGCGACGTTCATCCCCACTCCCAAATTATGCTCGGCCGAACTCACAGCTGAACTATTTTTCAAACACATTGTAAAGTTATGGGGTATTCCGTCGAGCATCATCAGTGATCGGGATGGCAGATTCATTGGGACATTCTGGACCGAGTTATTCGCCTTCTTGGGAACAACCTTAAACATCTCCTCGAGTTACCACCCTCAAACCGATGGTCAGACAGAACGGTTCAATTGCTTGCTCGAAGAATACTTACGTCACTTCGTCGATGCTCGCCAGAAGAACTGGATACAACTGTTAGATGTCGCCCAATTTTGCTTCAATTGTCAAACCAGCTCGTCTACGGGAAAGAGTCCCTTTGAAATTGTAAGTGGACGACAACCGGCCTTACCCCACATTATCGATCATCCTTATGCAGGGAAAAACCCTCAAGCTCACAACTTCACAAGAGAATGGAAGCAGACAACAGATATAGCCCGGGCATATTTAGAGAAGGCTTCCAAGCATATGAAAAAGTGGGCAGACAAGAAGCGTCGCCCCCTTCAATTCCGAGCAGGAGATCAAGTCCTCATCAAGCTGAAACCAGAACAGATCAGATTTCGCAACTGA

Coding sequence (CDS)

ATGTCGACGACAAAGTCGACACCCAAAGCACAAGTCGACCGGCTTACGCTAATAGAGGAGGAAATGCTGTTCCTCAAAGAAGTCCCTGACACCCTCCGCTTCCTGGAAGCACGGGTGACCGAATTGAGTGAGAAAGTCGTAGGAATCGACGCAATGGGCAACCGCCTGGATGGGTTGCCAATCGCAGAATTGATGTTTCGAGTGACCTCGCTCGAAGAAAGAGTTGCTCCTACGAGCAGCCCAAAACCGTCTGGTAGTCCGGATAGCTCTGTCGCGCACAAGGAGGGACGTGGCGAAGAGTTCGACGTGCTACAAAATACAATGATGAGCTTGTTCAATGGATTAGCTGATGAGTTCAGAACAACAATCGACGACATCCAAGAAAGGATGGCCTCCATGGGCACTCGAATTGAAGTGACCATGAAAGCCGTGGAGAACGTCACGGCTGGGCAAGCTAATACAGGGTCCAACAAACTAAGATTCCTAGATCCTAGAGCCTTTAAAGGGAATCGGGACGCCAAAGAGTTGGAAAACTTCATCTTTGATGTCGAACAGTACTTCAAAGCCACAACGGCTTGTACTGACGACAAGAAGGTGACCGTAGCCTCGATGTATCTCATAGACGATGCCAAACTGTGGTGGCGTACGAAGGTGCAAGACATCGAGGATGGTTTGTGCACCATTGACTCGTGGGAGGACCTTAAGAAAGAGTTGAGGGACCAGTTCCTCCCCGAAAACGCAGGACATTTAGCAATGGAAAAACTAGTAGCCCTGAAACACACTGGAGGCATACGAGACTATGTCAGACAGTTCTCAACCCTGATGCTAGATATCAGGGGCACATCAGAGAAGGACAAGGTGTTCTTCTTTATAAATGGGTTACAGCCGTGGGCCAAAACAAAACTACACGAGAACAAGGTTCAAACCCTAGCTGACGCAATGGCCTGTGCCGAGAGACTCCTAGACTATGGGAATGAAGCGGGATCCCAAAGAAGAATAACACCAGCCCCAAACACTGGGGGCAAGCCATACAAACCACCAAGTCATCGAAATGGAAGCCCCAACAGGCCGAACGGAGGTAACGACAGACCAAGCGAATGGACGGATAGACCTCCTCAGAACAACCAAGCGGGGACATCTCGAGGACCTTACCATCAAAGGAACCACCCGACGACGCCTTTACAATGCATGTTGTGTAAAGGTCCCCACAAGGTATCTTACTGTCCTCATCGGGCCTCTCTCACTGCGCTCCAAGTGTCCATTCAAGAGAGCAATGACGCAAGGATTGAGACTATGCTTGACAAGAAGGAAGATCAAGACAATCCCCGAATGGGCGCACTTAAATTCTTGTCAGCCCTCCAACGGAAGGTCGAATCGAAGGAGATAGTAGAGAAAGGACTCATGTTCGTAAATGCGACAATAAATTCCCAACCGAATAGGAGCACTCTGATAGATTCAGGAGCGACCCACAACTTCATCGCCGATCAAGAAGCCCGAAGATTAGGACTCACTATAGGAAAGGACCCGGGAAAAATGAAAGCTGTCAACTCTGAGGCCTTGCCTATTGTGGGAGTTTCCAAAAGAGTCCCCTTTAAAATAGGGGATTGGACAGGAGAGCTAGATCTTGTCGTAGCTCGCATGGACGACTTTGACGTGGTACTTGGGATGGAGTTCCTCCTAGAACACAAAGTTATCCCAATGCCGTTGGCAAAATGCTTAGTGATCACCGATCGCAACCCCACGGTAATACCTGCAAGCATCAAGCAACCAGGTAATCTTCGAATGATCTCGGCCATACAATTGAAAAGGGGACTCGCGCGAGAGGAACCTACATTTATGGCCATACCATTGATGGAAGTAACAACCAACGAAGAAACTGTCCCAAATGAAATCAATGAGGTACTAAACGACTATGCTGACATAATGCCAGAGAGCTTACCCCAAACATTACCACCTCGTCGAGGCATTGATCACGAAATCGAACTCATCCCCGGAGTTAAACCGCCAGCGAAGAACGCATACCGGATGGCTCCACCCGAGCTAGCCGAATTAAGGAAACAACTGGATGAGTTGCTGAAGGCGGGATTCATCCGCCCGGCAAAGGCACCCTACGGAGCCCCCGTACTGTTCCAGAAGAAGAAGGATGGGACGTTGCGTCTGTGCATAGACTATAGAGCCTTAAACAAGGTGACGGTGCGCAACAAATATCCACTGCTGATAATATCCGACTTGTTCGACCAACTTCACGGGGCCAAGTATTTCACGAAGTTGGACTTACGATCAGGGTATTACCAAGTACGTATTGCCGAAGGGGATGAGCCCAAGACGACGTGCGTAACAAGATATGGGGCCTTCGAATTCCTAGTAATGCCCTTTGGCTTGACAAACGCTCCAGCTACTTTCTGCACGTTGATGAACCAAGTTTTCTACGAATACTTGGATCAGTTTGTCATAGTATACCTCGACGACATAGTGGTTTACAGCACAACCCTAGAAGAACACAAAGTGCACTTGAAGCTGGTGTTCGACAAGCTACGACAAAACCAACTGTATGTCAAGAAAGAGAAATGTGCATTCGCACAAACATGCATCAACTTCCTTGGACATGTCGTCAAATGTGGACATATTAGTATGGATAGCGATAAGATAAAAGCTATCCAAGAATGGAAAGTTCCTACTTCCGTATCCGATTTGCGGTCCTTCTTAGGATTAGCCAACTACTATAGGCGGTTCGTCGAAGGGTTTTCACGACGAGCGGCCCCATTGACAGAGCTGTTGAAGAAAGACCACACTTGGTCGTGGTCAGATGATTGTCAAATGGCCTTTGAAGATCTGAAAACAACCATGATGAGGGGTCCTGTCCTCGGATTGGTAGATGTTACAAAGCCATTTGAAATAGAAACAGACGCTTCCGACTTTGCCCTAGGTGGGGTCCTTATTCAAGAAGGCCACCCCATCGCTTTCGAAAGTCGAAAGCTCAATGACGTCGAACTTAGATACACTGTCTCCGAAAAAGAAATGCTGGCAGTAGTCCATTGCCTTCGAGTCTGGAGACAATACCTCTTGGGATCACAGTTCGTAGTGAAGACGGATAACAGCGCCATTTGCCACTTCTTTGATCAACCAAAATTGACGGCAAAACAAGCCCGGTGGCAGGAGTCGTTAGCTGAATTCGACTTCAAGTTCGAACACAAAGCAGGGAAGAGCAATCAAGCAGCCGACGCACTGAGTCGGAAGGGCGAACATGCGGCCCTGTGCATGTTAGCCCATATTCACTCAAGTAAGATCGATGGATCGATGCGCGACATCATCAAGGAACATTTACATAAAGACCCATCGGCCAAAACCGTCGTCGAACTAGCTAAAGCTGGGAAAACACGACAGTTTTGGGTTGAGGGAGACCTTCTGATGACAAAAGGAAACAGATTGTATGTCCCAAGAACGGGAGAACTGAGGAAGAAGCTCATTCAGGAATGTCATGATACCTTATGGGCCGGACACCCTGGGTGGCAAAGAACATACGCTCTAATAAAGAAAGGGTACTTCTGGCCAAACATGCGAGACGACATCATGCAATACACCAAGACGTGCCTCATCTGTCAACAGGACAAAGTCGAGAAAGCCAAAGTCTCAGGACTCTTGGAACCTCTACCTGTGCCGACAAGACCCTGGGAAAGTGTATCTTTGGACTTCATAACACACCTCCCAAAAGTCGGGGACTATGACGCTATCTTGGTTATCGTAGACCGATTCTCAAAATATGCGACGTTCATCCCCACTCCCAAATTATGCTCGGCCGAACTCACAGCTGAACTATTTTTCAAACACATTGTAAAGTTATGGGGTATTCCGTCGAGCATCATCAGTGATCGGGATGGCAGATTCATTGGGACATTCTGGACCGAGTTATTCGCCTTCTTGGGAACAACCTTAAACATCTCCTCGAGTTACCACCCTCAAACCGATGGTCAGACAGAACGGTTCAATTGCTTGCTCGAAGAATACTTACGTCACTTCGTCGATGCTCGCCAGAAGAACTGGATACAACTGTTAGATGTCGCCCAATTTTGCTTCAATTGTCAAACCAGCTCGTCTACGGGAAAGAGTCCCTTTGAAATTGTAAGTGGACGACAACCGGCCTTACCCCACATTATCGATCATCCTTATGCAGGGAAAAACCCTCAAGCTCACAACTTCACAAGAGAATGGAAGCAGACAACAGATATAGCCCGGGCATATTTAGAGAAGGCTTCCAAGCATATGAAAAAGTGGGCAGACAAGAAGCGTCGCCCCCTTCAATTCCGAGCAGGAGATCAAGTCCTCATCAAGCTGAAACCAGAACAGATCAGATTTCGCAACTGA

Protein sequence

MSTTKSTPKAQVDRLTLIEEEMLFLKEVPDTLRFLEARVTELSEKVVGIDAMGNRLDGLPIAELMFRVTSLEERVAPTSSPKPSGSPDSSVAHKEGRGEEFDVLQNTMMSLFNGLADEFRTTIDDIQERMASMGTRIEVTMKAVENVTAGQANTGSNKLRFLDPRAFKGNRDAKELENFIFDVEQYFKATTACTDDKKVTVASMYLIDDAKLWWRTKVQDIEDGLCTIDSWEDLKKELRDQFLPENAGHLAMEKLVALKHTGGIRDYVRQFSTLMLDIRGTSEKDKVFFFINGLQPWAKTKLHENKVQTLADAMACAERLLDYGNEAGSQRRITPAPNTGGKPYKPPSHRNGSPNRPNGGNDRPSEWTDRPPQNNQAGTSRGPYHQRNHPTTPLQCMLCKGPHKVSYCPHRASLTALQVSIQESNDARIETMLDKKEDQDNPRMGALKFLSALQRKVESKEIVEKGLMFVNATINSQPNRSTLIDSGATHNFIADQEARRLGLTIGKDPGKMKAVNSEALPIVGVSKRVPFKIGDWTGELDLVVARMDDFDVVLGMEFLLEHKVIPMPLAKCLVITDRNPTVIPASIKQPGNLRMISAIQLKRGLAREEPTFMAIPLMEVTTNEETVPNEINEVLNDYADIMPESLPQTLPPRRGIDHEIELIPGVKPPAKNAYRMAPPELAELRKQLDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLLIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTLMNQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVKCGHISMDSDKIKAIQEWKVPTSVSDLRSFLGLANYYRRFVEGFSRRAAPLTELLKKDHTWSWSDDCQMAFEDLKTTMMRGPVLGLVDVTKPFEIETDASDFALGGVLIQEGHPIAFESRKLNDVELRYTVSEKEMLAVVHCLRVWRQYLLGSQFVVKTDNSAICHFFDQPKLTAKQARWQESLAEFDFKFEHKAGKSNQAADALSRKGEHAALCMLAHIHSSKIDGSMRDIIKEHLHKDPSAKTVVELAKAGKTRQFWVEGDLLMTKGNRLYVPRTGELRKKLIQECHDTLWAGHPGWQRTYALIKKGYFWPNMRDDIMQYTKTCLICQQDKVEKAKVSGLLEPLPVPTRPWESVSLDFITHLPKVGDYDAILVIVDRFSKYATFIPTPKLCSAELTAELFFKHIVKLWGIPSSIISDRDGRFIGTFWTELFAFLGTTLNISSSYHPQTDGQTERFNCLLEEYLRHFVDARQKNWIQLLDVAQFCFNCQTSSSTGKSPFEIVSGRQPALPHIIDHPYAGKNPQAHNFTREWKQTTDIARAYLEKASKHMKKWADKKRRPLQFRAGDQVLIKLKPEQIRFRN
Homology
BLAST of CmaCh00G001230 vs. ExPASy Swiss-Prot
Match: P0CT41 (Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-12 PE=3 SV=1)

HSP 1 Score: 482.3 bits (1240), Expect = 2.0e-134
Identity = 278/838 (33.17%), Postives = 449/838 (53.58%), Query Frame = 0

Query: 630  EINEVLNDYADIMPESLPQTLP-PRRGIDHEIELI-PGVKPPAKNAYRMAPPELAELRKQ 689
            E+ ++  ++ DI  E+  + LP P +G++ E+EL     + P +N Y + P ++  +  +
Sbjct: 373  ELPDIYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRN-YPLPPGKMQAMNDE 432

Query: 690  LDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLLIISDLFDQ 749
            +++ LK+G IR +KA    PV+F  KK+GTLR+ +DY+ LNK    N YPL +I  L  +
Sbjct: 433  INQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAK 492

Query: 750  LHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTLMNQ 809
            + G+  FTKLDL+S Y+ +R+ +GDE K       G FE+LVMP+G++ APA F   +N 
Sbjct: 493  IQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINT 552

Query: 810  VFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFL 869
            +  E  +  V+ Y+DDI+++S +  EH  H+K V  KL+   L + + KC F Q+ + F+
Sbjct: 553  ILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFI 612

Query: 870  GHVVKCGHISMDSDKIKAIQEWKVPTSVSDLRSFLGLANYYRRFVEGFSRRAAPLTELLK 929
            G+ +     +   + I  + +WK P +  +LR FLG  NY R+F+   S+   PL  LLK
Sbjct: 613  GYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLK 672

Query: 930  KDHTWSWSDDCQMAFEDLKTTMMRGPVLGLVDVTKPFEIETDASDFALGGVLIQEG---- 989
            KD  W W+     A E++K  ++  PVL   D +K   +ETDASD A+G VL Q+     
Sbjct: 673  KDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDK 732

Query: 990  -HPIAFESRKLNDVELRYTVSEKEMLAVVHCLRVWRQYLLGS--QFVVKTDN-SAICHFF 1049
             +P+ + S K++  +L Y+VS+KEMLA++  L+ WR YL  +   F + TD+ + I    
Sbjct: 733  YYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRIT 792

Query: 1050 DQPKLTAKQ-ARWQESLAEFDFKFEHKAGKSNQAADALSRKGEHAALCMLAHIHSSKIDG 1109
            ++ +   K+ ARWQ  L +F+F+  ++ G +N  ADALSR  +         I     D 
Sbjct: 793  NESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDET-----EPIPKDSEDN 852

Query: 1110 SMRDIIKEHLHKDPSAKTVVELAKAGK------------TRQFWVEGDLLMTKGNRLYVP 1169
            S+  + +  +  D   + V E     K                 ++  LL+   +++ +P
Sbjct: 853  SINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILLP 912

Query: 1170 RTGELRKKLIQECHDTLWAGHPGWQRTYALIKKGYFWPNMRDDIMQYTKTCLICQQDKVE 1229
               +L + +I++ H+     HPG +    +I + + W  +R  I +Y + C  CQ +K  
Sbjct: 913  NDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSR 972

Query: 1230 KAKVSGLLEPLPVPTRPWESVSLDFITHLPKVGDYDAILVIVDRFSKYATFIPTPKLCSA 1289
              K  G L+P+P   RPWES+S+DFIT LP+   Y+A+ V+VDRFSK A  +P  K  +A
Sbjct: 973  NHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSITA 1032

Query: 1290 ELTAELFFKHIVKLWGIPSSIISDRDGRFIGTFWTELFAFLGTTLNISSSYHPQTDGQTE 1349
            E TA +F + ++  +G P  II+D D  F    W +        +  S  Y PQTDGQTE
Sbjct: 1033 EQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTE 1092

Query: 1350 RFNCLLEEYLRHFVDARQKNWIQLLDVAQFCFNCQTSSSTGKSPFEIVSGRQPALPHIID 1409
            R N  +E+ LR         W+  + + Q  +N    S+T  +PFEIV    PAL  +  
Sbjct: 1093 RTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLEL 1152

Query: 1410 HPYAGKNPQAHNFTREWKQTTDIARAYLEKASKHMKKWADKKRRPL-QFRAGDQVLIK 1444
              ++ K  +    ++E  Q     + +L   +  MKK+ D K + + +F+ GD V++K
Sbjct: 1153 PSFSDKTDEN---SQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK 1201

BLAST of CmaCh00G001230 vs. ExPASy Swiss-Prot
Match: P0CT34 (Transposon Tf2-1 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-1 PE=3 SV=1)

HSP 1 Score: 482.3 bits (1240), Expect = 2.0e-134
Identity = 278/838 (33.17%), Postives = 449/838 (53.58%), Query Frame = 0

Query: 630  EINEVLNDYADIMPESLPQTLP-PRRGIDHEIELI-PGVKPPAKNAYRMAPPELAELRKQ 689
            E+ ++  ++ DI  E+  + LP P +G++ E+EL     + P +N Y + P ++  +  +
Sbjct: 373  ELPDIYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRN-YPLPPGKMQAMNDE 432

Query: 690  LDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLLIISDLFDQ 749
            +++ LK+G IR +KA    PV+F  KK+GTLR+ +DY+ LNK    N YPL +I  L  +
Sbjct: 433  INQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAK 492

Query: 750  LHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTLMNQ 809
            + G+  FTKLDL+S Y+ +R+ +GDE K       G FE+LVMP+G++ APA F   +N 
Sbjct: 493  IQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINT 552

Query: 810  VFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFL 869
            +  E  +  V+ Y+DDI+++S +  EH  H+K V  KL+   L + + KC F Q+ + F+
Sbjct: 553  ILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFI 612

Query: 870  GHVVKCGHISMDSDKIKAIQEWKVPTSVSDLRSFLGLANYYRRFVEGFSRRAAPLTELLK 929
            G+ +     +   + I  + +WK P +  +LR FLG  NY R+F+   S+   PL  LLK
Sbjct: 613  GYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLK 672

Query: 930  KDHTWSWSDDCQMAFEDLKTTMMRGPVLGLVDVTKPFEIETDASDFALGGVLIQEG---- 989
            KD  W W+     A E++K  ++  PVL   D +K   +ETDASD A+G VL Q+     
Sbjct: 673  KDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDK 732

Query: 990  -HPIAFESRKLNDVELRYTVSEKEMLAVVHCLRVWRQYLLGS--QFVVKTDN-SAICHFF 1049
             +P+ + S K++  +L Y+VS+KEMLA++  L+ WR YL  +   F + TD+ + I    
Sbjct: 733  YYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRIT 792

Query: 1050 DQPKLTAKQ-ARWQESLAEFDFKFEHKAGKSNQAADALSRKGEHAALCMLAHIHSSKIDG 1109
            ++ +   K+ ARWQ  L +F+F+  ++ G +N  ADALSR  +         I     D 
Sbjct: 793  NESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDET-----EPIPKDSEDN 852

Query: 1110 SMRDIIKEHLHKDPSAKTVVELAKAGK------------TRQFWVEGDLLMTKGNRLYVP 1169
            S+  + +  +  D   + V E     K                 ++  LL+   +++ +P
Sbjct: 853  SINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILLP 912

Query: 1170 RTGELRKKLIQECHDTLWAGHPGWQRTYALIKKGYFWPNMRDDIMQYTKTCLICQQDKVE 1229
               +L + +I++ H+     HPG +    +I + + W  +R  I +Y + C  CQ +K  
Sbjct: 913  NDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSR 972

Query: 1230 KAKVSGLLEPLPVPTRPWESVSLDFITHLPKVGDYDAILVIVDRFSKYATFIPTPKLCSA 1289
              K  G L+P+P   RPWES+S+DFIT LP+   Y+A+ V+VDRFSK A  +P  K  +A
Sbjct: 973  NHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSITA 1032

Query: 1290 ELTAELFFKHIVKLWGIPSSIISDRDGRFIGTFWTELFAFLGTTLNISSSYHPQTDGQTE 1349
            E TA +F + ++  +G P  II+D D  F    W +        +  S  Y PQTDGQTE
Sbjct: 1033 EQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTE 1092

Query: 1350 RFNCLLEEYLRHFVDARQKNWIQLLDVAQFCFNCQTSSSTGKSPFEIVSGRQPALPHIID 1409
            R N  +E+ LR         W+  + + Q  +N    S+T  +PFEIV    PAL  +  
Sbjct: 1093 RTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLEL 1152

Query: 1410 HPYAGKNPQAHNFTREWKQTTDIARAYLEKASKHMKKWADKKRRPL-QFRAGDQVLIK 1444
              ++ K  +    ++E  Q     + +L   +  MKK+ D K + + +F+ GD V++K
Sbjct: 1153 PSFSDKTDEN---SQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK 1201

BLAST of CmaCh00G001230 vs. ExPASy Swiss-Prot
Match: P0CT35 (Transposon Tf2-2 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-2 PE=3 SV=1)

HSP 1 Score: 482.3 bits (1240), Expect = 2.0e-134
Identity = 278/838 (33.17%), Postives = 449/838 (53.58%), Query Frame = 0

Query: 630  EINEVLNDYADIMPESLPQTLP-PRRGIDHEIELI-PGVKPPAKNAYRMAPPELAELRKQ 689
            E+ ++  ++ DI  E+  + LP P +G++ E+EL     + P +N Y + P ++  +  +
Sbjct: 373  ELPDIYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRN-YPLPPGKMQAMNDE 432

Query: 690  LDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLLIISDLFDQ 749
            +++ LK+G IR +KA    PV+F  KK+GTLR+ +DY+ LNK    N YPL +I  L  +
Sbjct: 433  INQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAK 492

Query: 750  LHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTLMNQ 809
            + G+  FTKLDL+S Y+ +R+ +GDE K       G FE+LVMP+G++ APA F   +N 
Sbjct: 493  IQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINT 552

Query: 810  VFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFL 869
            +  E  +  V+ Y+DDI+++S +  EH  H+K V  KL+   L + + KC F Q+ + F+
Sbjct: 553  ILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFI 612

Query: 870  GHVVKCGHISMDSDKIKAIQEWKVPTSVSDLRSFLGLANYYRRFVEGFSRRAAPLTELLK 929
            G+ +     +   + I  + +WK P +  +LR FLG  NY R+F+   S+   PL  LLK
Sbjct: 613  GYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLK 672

Query: 930  KDHTWSWSDDCQMAFEDLKTTMMRGPVLGLVDVTKPFEIETDASDFALGGVLIQEG---- 989
            KD  W W+     A E++K  ++  PVL   D +K   +ETDASD A+G VL Q+     
Sbjct: 673  KDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDK 732

Query: 990  -HPIAFESRKLNDVELRYTVSEKEMLAVVHCLRVWRQYLLGS--QFVVKTDN-SAICHFF 1049
             +P+ + S K++  +L Y+VS+KEMLA++  L+ WR YL  +   F + TD+ + I    
Sbjct: 733  YYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRIT 792

Query: 1050 DQPKLTAKQ-ARWQESLAEFDFKFEHKAGKSNQAADALSRKGEHAALCMLAHIHSSKIDG 1109
            ++ +   K+ ARWQ  L +F+F+  ++ G +N  ADALSR  +         I     D 
Sbjct: 793  NESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDET-----EPIPKDSEDN 852

Query: 1110 SMRDIIKEHLHKDPSAKTVVELAKAGK------------TRQFWVEGDLLMTKGNRLYVP 1169
            S+  + +  +  D   + V E     K                 ++  LL+   +++ +P
Sbjct: 853  SINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILLP 912

Query: 1170 RTGELRKKLIQECHDTLWAGHPGWQRTYALIKKGYFWPNMRDDIMQYTKTCLICQQDKVE 1229
               +L + +I++ H+     HPG +    +I + + W  +R  I +Y + C  CQ +K  
Sbjct: 913  NDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSR 972

Query: 1230 KAKVSGLLEPLPVPTRPWESVSLDFITHLPKVGDYDAILVIVDRFSKYATFIPTPKLCSA 1289
              K  G L+P+P   RPWES+S+DFIT LP+   Y+A+ V+VDRFSK A  +P  K  +A
Sbjct: 973  NHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSITA 1032

Query: 1290 ELTAELFFKHIVKLWGIPSSIISDRDGRFIGTFWTELFAFLGTTLNISSSYHPQTDGQTE 1349
            E TA +F + ++  +G P  II+D D  F    W +        +  S  Y PQTDGQTE
Sbjct: 1033 EQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTE 1092

Query: 1350 RFNCLLEEYLRHFVDARQKNWIQLLDVAQFCFNCQTSSSTGKSPFEIVSGRQPALPHIID 1409
            R N  +E+ LR         W+  + + Q  +N    S+T  +PFEIV    PAL  +  
Sbjct: 1093 RTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLEL 1152

Query: 1410 HPYAGKNPQAHNFTREWKQTTDIARAYLEKASKHMKKWADKKRRPL-QFRAGDQVLIK 1444
              ++ K  +    ++E  Q     + +L   +  MKK+ D K + + +F+ GD V++K
Sbjct: 1153 PSFSDKTDEN---SQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK 1201

BLAST of CmaCh00G001230 vs. ExPASy Swiss-Prot
Match: P0CT36 (Transposon Tf2-3 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-3 PE=1 SV=1)

HSP 1 Score: 482.3 bits (1240), Expect = 2.0e-134
Identity = 278/838 (33.17%), Postives = 449/838 (53.58%), Query Frame = 0

Query: 630  EINEVLNDYADIMPESLPQTLP-PRRGIDHEIELI-PGVKPPAKNAYRMAPPELAELRKQ 689
            E+ ++  ++ DI  E+  + LP P +G++ E+EL     + P +N Y + P ++  +  +
Sbjct: 373  ELPDIYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRN-YPLPPGKMQAMNDE 432

Query: 690  LDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLLIISDLFDQ 749
            +++ LK+G IR +KA    PV+F  KK+GTLR+ +DY+ LNK    N YPL +I  L  +
Sbjct: 433  INQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAK 492

Query: 750  LHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTLMNQ 809
            + G+  FTKLDL+S Y+ +R+ +GDE K       G FE+LVMP+G++ APA F   +N 
Sbjct: 493  IQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINT 552

Query: 810  VFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFL 869
            +  E  +  V+ Y+DDI+++S +  EH  H+K V  KL+   L + + KC F Q+ + F+
Sbjct: 553  ILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFI 612

Query: 870  GHVVKCGHISMDSDKIKAIQEWKVPTSVSDLRSFLGLANYYRRFVEGFSRRAAPLTELLK 929
            G+ +     +   + I  + +WK P +  +LR FLG  NY R+F+   S+   PL  LLK
Sbjct: 613  GYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLK 672

Query: 930  KDHTWSWSDDCQMAFEDLKTTMMRGPVLGLVDVTKPFEIETDASDFALGGVLIQEG---- 989
            KD  W W+     A E++K  ++  PVL   D +K   +ETDASD A+G VL Q+     
Sbjct: 673  KDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDK 732

Query: 990  -HPIAFESRKLNDVELRYTVSEKEMLAVVHCLRVWRQYLLGS--QFVVKTDN-SAICHFF 1049
             +P+ + S K++  +L Y+VS+KEMLA++  L+ WR YL  +   F + TD+ + I    
Sbjct: 733  YYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRIT 792

Query: 1050 DQPKLTAKQ-ARWQESLAEFDFKFEHKAGKSNQAADALSRKGEHAALCMLAHIHSSKIDG 1109
            ++ +   K+ ARWQ  L +F+F+  ++ G +N  ADALSR  +         I     D 
Sbjct: 793  NESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDET-----EPIPKDSEDN 852

Query: 1110 SMRDIIKEHLHKDPSAKTVVELAKAGK------------TRQFWVEGDLLMTKGNRLYVP 1169
            S+  + +  +  D   + V E     K                 ++  LL+   +++ +P
Sbjct: 853  SINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILLP 912

Query: 1170 RTGELRKKLIQECHDTLWAGHPGWQRTYALIKKGYFWPNMRDDIMQYTKTCLICQQDKVE 1229
               +L + +I++ H+     HPG +    +I + + W  +R  I +Y + C  CQ +K  
Sbjct: 913  NDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSR 972

Query: 1230 KAKVSGLLEPLPVPTRPWESVSLDFITHLPKVGDYDAILVIVDRFSKYATFIPTPKLCSA 1289
              K  G L+P+P   RPWES+S+DFIT LP+   Y+A+ V+VDRFSK A  +P  K  +A
Sbjct: 973  NHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSITA 1032

Query: 1290 ELTAELFFKHIVKLWGIPSSIISDRDGRFIGTFWTELFAFLGTTLNISSSYHPQTDGQTE 1349
            E TA +F + ++  +G P  II+D D  F    W +        +  S  Y PQTDGQTE
Sbjct: 1033 EQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTE 1092

Query: 1350 RFNCLLEEYLRHFVDARQKNWIQLLDVAQFCFNCQTSSSTGKSPFEIVSGRQPALPHIID 1409
            R N  +E+ LR         W+  + + Q  +N    S+T  +PFEIV    PAL  +  
Sbjct: 1093 RTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLEL 1152

Query: 1410 HPYAGKNPQAHNFTREWKQTTDIARAYLEKASKHMKKWADKKRRPL-QFRAGDQVLIK 1444
              ++ K  +    ++E  Q     + +L   +  MKK+ D K + + +F+ GD V++K
Sbjct: 1153 PSFSDKTDEN---SQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK 1201

BLAST of CmaCh00G001230 vs. ExPASy Swiss-Prot
Match: P0CT37 (Transposon Tf2-4 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-4 PE=3 SV=1)

HSP 1 Score: 482.3 bits (1240), Expect = 2.0e-134
Identity = 278/838 (33.17%), Postives = 449/838 (53.58%), Query Frame = 0

Query: 630  EINEVLNDYADIMPESLPQTLP-PRRGIDHEIELI-PGVKPPAKNAYRMAPPELAELRKQ 689
            E+ ++  ++ DI  E+  + LP P +G++ E+EL     + P +N Y + P ++  +  +
Sbjct: 373  ELPDIYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRN-YPLPPGKMQAMNDE 432

Query: 690  LDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLLIISDLFDQ 749
            +++ LK+G IR +KA    PV+F  KK+GTLR+ +DY+ LNK    N YPL +I  L  +
Sbjct: 433  INQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAK 492

Query: 750  LHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCTLMNQ 809
            + G+  FTKLDL+S Y+ +R+ +GDE K       G FE+LVMP+G++ APA F   +N 
Sbjct: 493  IQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINT 552

Query: 810  VFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFL 869
            +  E  +  V+ Y+DDI+++S +  EH  H+K V  KL+   L + + KC F Q+ + F+
Sbjct: 553  ILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFI 612

Query: 870  GHVVKCGHISMDSDKIKAIQEWKVPTSVSDLRSFLGLANYYRRFVEGFSRRAAPLTELLK 929
            G+ +     +   + I  + +WK P +  +LR FLG  NY R+F+   S+   PL  LLK
Sbjct: 613  GYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLK 672

Query: 930  KDHTWSWSDDCQMAFEDLKTTMMRGPVLGLVDVTKPFEIETDASDFALGGVLIQEG---- 989
            KD  W W+     A E++K  ++  PVL   D +K   +ETDASD A+G VL Q+     
Sbjct: 673  KDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDK 732

Query: 990  -HPIAFESRKLNDVELRYTVSEKEMLAVVHCLRVWRQYLLGS--QFVVKTDN-SAICHFF 1049
             +P+ + S K++  +L Y+VS+KEMLA++  L+ WR YL  +   F + TD+ + I    
Sbjct: 733  YYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRIT 792

Query: 1050 DQPKLTAKQ-ARWQESLAEFDFKFEHKAGKSNQAADALSRKGEHAALCMLAHIHSSKIDG 1109
            ++ +   K+ ARWQ  L +F+F+  ++ G +N  ADALSR  +         I     D 
Sbjct: 793  NESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDET-----EPIPKDSEDN 852

Query: 1110 SMRDIIKEHLHKDPSAKTVVELAKAGK------------TRQFWVEGDLLMTKGNRLYVP 1169
            S+  + +  +  D   + V E     K                 ++  LL+   +++ +P
Sbjct: 853  SINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILLP 912

Query: 1170 RTGELRKKLIQECHDTLWAGHPGWQRTYALIKKGYFWPNMRDDIMQYTKTCLICQQDKVE 1229
               +L + +I++ H+     HPG +    +I + + W  +R  I +Y + C  CQ +K  
Sbjct: 913  NDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSR 972

Query: 1230 KAKVSGLLEPLPVPTRPWESVSLDFITHLPKVGDYDAILVIVDRFSKYATFIPTPKLCSA 1289
              K  G L+P+P   RPWES+S+DFIT LP+   Y+A+ V+VDRFSK A  +P  K  +A
Sbjct: 973  NHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSITA 1032

Query: 1290 ELTAELFFKHIVKLWGIPSSIISDRDGRFIGTFWTELFAFLGTTLNISSSYHPQTDGQTE 1349
            E TA +F + ++  +G P  II+D D  F    W +        +  S  Y PQTDGQTE
Sbjct: 1033 EQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTE 1092

Query: 1350 RFNCLLEEYLRHFVDARQKNWIQLLDVAQFCFNCQTSSSTGKSPFEIVSGRQPALPHIID 1409
            R N  +E+ LR         W+  + + Q  +N    S+T  +PFEIV    PAL  +  
Sbjct: 1093 RTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLEL 1152

Query: 1410 HPYAGKNPQAHNFTREWKQTTDIARAYLEKASKHMKKWADKKRRPL-QFRAGDQVLIK 1444
              ++ K  +    ++E  Q     + +L   +  MKK+ D K + + +F+ GD V++K
Sbjct: 1153 PSFSDKTDEN---SQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK 1201

BLAST of CmaCh00G001230 vs. TAIR 10
Match: ATMG00860.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 104.0 bits (258), Expect = 1.0e-21
Identity = 53/130 (40.77%), Postives = 78/130 (60.00%), Query Frame = 0

Query: 837 HLKLVFDKLRQNQLYVKKEKCAFAQTCINFLG--HVVKCGHISMDSDKIKAIQEWKVPTS 896
           HL +V     Q+Q Y  ++KCAF Q  I +LG  H++    +S D  K++A+  W  P +
Sbjct: 3   HLGMVLQIWEQHQFYANRKKCAFGQPQIAYLGHRHIISGEGVSADPAKLEAMVGWPEPKN 62

Query: 897 VSDLRSFLGLANYYRRFVEGFSRRAAPLTELLKKDHTWSWSDDCQMAFEDLKTTMMRGPV 956
            ++LR FLGL  YYRRFV+ + +   PLTELLKK ++  W++   +AF+ LK  +   PV
Sbjct: 63  TTELRGFLGLTGYYRRFVKNYGKIVRPLTELLKK-NSLKWTEMAALAFKALKGAVTTLPV 122

Query: 957 LGLVDVTKPF 965
           L L D+  PF
Sbjct: 123 LALPDLKLPF 131

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P0CT412.0e-13433.17Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24... [more]
P0CT342.0e-13433.17Transposon Tf2-1 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
P0CT352.0e-13433.17Transposon Tf2-2 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
P0CT362.0e-13433.17Transposon Tf2-3 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
P0CT372.0e-13433.17Transposon Tf2-4 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
Match NameE-valueIdentityDescription
ATMG00860.11.0e-2140.77DNA/RNA polymerases superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005162Retrotransposon gag domainPFAMPF03732Retrotrans_gagcoord: 201..295
e-value: 1.4E-16
score: 60.5
NoneNo IPR availablePFAMPF08284RVP_2coord: 469..564
e-value: 9.5E-11
score: 41.6
NoneNo IPR availableGENE3D1.10.340.70coord: 1115..1204
e-value: 1.0E-20
score: 75.8
NoneNo IPR availableGENE3D3.10.10.10HIV Type 1 Reverse Transcriptase, subunit A, domain 1coord: 656..796
e-value: 1.6E-87
score: 294.3
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 351..389
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 74..95
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 326..389
NoneNo IPR availablePANTHERPTHR47266FAMILY NOT NAMEDcoord: 666..1448
NoneNo IPR availableCDDcd09274RNase_HI_RT_Ty3coord: 965..1079
e-value: 2.80368E-56
score: 188.856
NoneNo IPR availableCDDcd01647RT_LTRcoord: 695..871
e-value: 8.19232E-86
score: 275.243
NoneNo IPR availableCDDcd00303retropepsin_likecoord: 470..559
e-value: 1.07485E-18
score: 80.4583
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 712..870
e-value: 6.4E-28
score: 97.8
IPR000477Reverse transcriptase domainPROSITEPS50878RT_POLcoord: 692..871
score: 15.209808
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 450..585
e-value: 5.0E-20
score: 73.6
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 464..564
IPR041588Integrase zinc-binding domainPFAMPF17921Integrase_H2C2coord: 1150..1204
e-value: 5.4E-21
score: 74.4
IPR041577Reverse transcriptase/retrotransposon-derived protein, RNase H-like domainPFAMPF17919RT_RNaseH_2coord: 934..1028
e-value: 1.3E-30
score: 105.3
IPR043128Reverse transcriptase/Diguanylate cyclase domainGENE3D3.30.70.270coord: 736..871
e-value: 1.6E-87
score: 294.3
IPR043128Reverse transcriptase/Diguanylate cyclase domainGENE3D3.30.70.270coord: 880..970
e-value: 1.1E-28
score: 101.1
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 1215..1412
e-value: 4.0E-45
score: 155.6
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 1219..1378
score: 22.256716
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 1216..1372
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 634..1064

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh00G001230.1CmaCh00G001230.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0006508 proteolysis
biological_process GO:0006278 RNA-dependent DNA biosynthetic process
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0003964 RNA-directed DNA polymerase activity
molecular_function GO:0008270 zinc ion binding