Cp4.1LG09g08350 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG09g08350
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionTelomerase activating protein Est1, putative
LocationCp4.1LG09 : 7706627 .. 7714652 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TACTGCGTAGAGGTGTCGTGGTCTTCTTACATAGCTTTCGATTCTCGTCTCTGTGTACCCTCCAAGTGTGCGCTTTTGGCTGGAGAATCTGTTTTGGCCTCCTCTGCTTCAAATATGTTTGCTCTGATTCTTTAATGGCGATGATCGATTTTTGCCTAATTTCTACGAGGTTTGTTTATTGCTACTATCCATTTTTTTTACTTTGTTTTTTTGCCCTTTTGCTTTGTGATCTTCTTGGGACGGCTGTTCTTTACTTTACTTCATGTGGGGTTTCGTAAAATCTGTTTCTGGTGTTTGATTCAGTGTTCTTCTTCTTCTACAGAATTCCTGTGAGTTTTTGGGAATACCCTTGAAACGAACCACACCCAGTAGTCATAGGGTGTTGATTTTCATCTGGACTTCAATTTTGGCTTCTGGGAATGGATTTTTTATGTGTAGGGTTCTGTAGTTGGAGTTTAGTTCTTTGTTTTCGGTTGAATTTTGGGATGTTTCTTCATTATGTTCACGAATTGTTCTTTGAGATCTTACTTGCTCACAAACTAAATGAAAATGAGAGGATTGTTTGAATGTTGCTGCGCATCAGAATGCTTCTTCCACTACTTCAAGTATGGTTTGAAGTAATTGAAACTAGGAGTGGAGCTAGAAATTTTTGTAATGAGTGTAGTGTTGACCTTCCTTTGTTTTTGGCTTCACCCTTGCTTGATATGATGTCTTTGTATATGTGAGATCCCACGTTGGTTGGAGAGGGGAATGAAGCATTCCTTATGAGGGTGTGGAAACCTCTCCCTAGTAGACGCGTTTTAAAACCGTTAGCGATGCATAACGGGCTAAAACGGACGATATTTGCTTGCAGTGGGCTTGAGTTGTTAGAAATGGTATTAGAGCTAGAAATTGGGTGGTGTGTCAATGAGGACGTTGGGCCCTCAAATGGGTGGATTGTGAGATCCCACGTTCGTTGGAGAGGGGAATGAAGCATTTTTATAAGAGTGTAGAAACTTCTCCCTAGCAGACATGTTTTAAAATCATGAGGCTGACAGTGATACGTAATGGGCTAAAGTGTACAATATCTGCTAGCGGTGAGCTTGGGTTGTTACAAATGATATCAGAGCCAGACACAAGGCGGTGTGTTAGCAAGGACACTAGTCCCCAAGAGGGGTGGATTATGAGATCCCACATCGCTTGGAGAGGGGAACGAAACATTCTTTATAAGGGCTAGTAGACGCGTTTTAAAGTCGTGAGGCCGACGACGATACATAATGGGCCAAAACGAATAATATCTGTTAGCCATGGGCTTGGGCTGTTACAAATGGTATCAAAGCTAGACATTAGGTGGTGTGCCAGTGAGGATGTTAGGCCCCCAAAGGGGTGGATTGTGAGATCCCACATCGGTTGGAGAGGGGAACGAAGCGTTCCTTATAAGGGTGTAGAAACCTCTCCCTAGCAGACGTGTTTTAAAATCGTGAGGTTGATGACGATACATAATGGGCTAAAGCAGTTATATATGCTAGTAGTGGGCTTGGGTGGTTAGATATCTTTTTCCTTCCTTTCTTTCCTCTCATTGTTTCACTTCTCACAGGTTAACTTACATGAGAATATAATACACTTGATGTTCCAAAGGTAGTGTACCTTGAAATCAGCTGCTCAATTTTAACCATGGCTACCTCTGCAAATCAAAGCAAAAAGGAGAGCTTGCTTAATGAGGTAATTTGTTGCATTACATTTCATGTTGTTATGGTTGCTTCATGATGTGAGTCATTGCAGGTTGTGAGCTTGGAAAAGCAGTTAACAGCATCAATCCTTTCCAAAGGGATATTACATTCAGATGTCAAAGATCTATACTACAAAGTTTGTTCAATTTACGAGAGAATTTTCCTTAATGAACATGAACAATTAGAGCTTCAAGATGTTGAATATTCTCTATGGAAGCTTCATTACAAGCTAATCGACGAGTTTCGGAAGCGGATAAAGAGAAGCTCTGCAAATGTAGAGAGCCCAAAGTTGGGGACAGGACAACATTCTAATGATGCACAGCGAAGTAGTAGCAATCATATTGCAGAATTTAGGTTGTTTCTCATGGAAGCAACAAAGTTCTATCAGAAACTGATATTGAAAATCAGGGAGTATAATGGTGTTCAAAAAGAAGGCTTGTTATATAAGGCTTTTGGTGTCTCTAAAGGCATTGATCCAAAGAGAAAGAAGACATGTCAATTCTTATGTCACCGTCTCTTAGTTTGCCTTGGGGATCTTTCTAGGTACATGGAACAACATGAAAAACCAGATGTCCATTCCCATAAGTGGTTAGCTGCTGCTACTCATTACTTAGAAGCAACAATGGTTTGGCCTGATAGTGGAAACCCCCATAATCAGGTGACCTTTTTTATTACTGCAAAAAGGAAATCTATGTGATCCCATATAAGTTGGAGAAGGGATTGAAGCATTCCTTATAAGGGTGTGGAAACCTTTCACTAGTAGATATGTTTTAAAACTGTGAGGCTGACGGTGATATACAGGGGTGGATTGTGGATCCCACATTGGTTGGAGAGGAGAATGAAGCATTCCTTATAAGGGGTGTGGAAACCTTTCCCTAGTAGATGTGTTTTAAAACCGTGAGGCTGATGGCGATACGTAACGGGCCGAAGTTGACAATATCTACTAGCGGTGGGTTTAGGCTGTTACAAATGGTTTTAGAGCCAGACACCGTGTGGTGTGCCAGAGAGGACGCTAGGTCCCCAAGGGGGGTGGATTGTGAGATCCCACATCGGTTGGAGAGGGGAACAAAGCATTCCTTATAAGGTGTGGAAACCTTTCCCTAGTAGACGCGTTTTAAACCTGTGAGGCTGATGATGATAGTAACGGGCCAAAACGGATAATAACTACTAGTGGTGGGCTTGGGCTGTTACAAATGGTATCAGAGCCAGATACTGGATGGTGTGCCAGCGAGGATGCTGGATCCTCAAGGGGGGTGGATTGTGAGATTCCACGTTGGTTGGAGAGGGGAACGAAGCATTCCTTATAAAGGTGTGGAAACCTTTCCCTAGTAGACATGTTTTAAAACTGTGGGGCTAACGACGTGCATAATGAGCCAAAACGAACAATATCTACTAGCGGTGGGCTTAGGCTGTTACAAATGGTATCAGAGCCAGACACCGGGCGGTGTGCCAACGAAGACGTTAGGCCCCAAGGGGGAGATTCCACGTCGGTTGGAGAGGGGAACGAAGCATTCCTTATAAGGGTGTGAACCTTTCGCATTCCTTATACGGGTGTGAAAACCTTTCCCTAGTAGACATGTTTTAAAACTGTGAGGTTGACGACGATGCATAACGAGCCAAAATGAACAATATCTACTAGTGGTGGGCTTAGGTTGTTACAAATGGTATCAGAGCGAAACATCGGGCGGTGTGTCAGCAAAGACGTTAGGCCCCCAAGGGGGTGGATTGTGAGATCCCACATCAGTTGGAGAGGGGAACAAAACATTCCTTATAAGGGTGTGGAAACCTTTTCCTAGTAGACGCATTTTAGAACCGTGAGGCTGACGACGATACATAACGCGCCAAAGCGGACAATATCTACTAACAGTGAGCTTGGGCTGTTACAGAATCATCTTATGAACTCTTTTTTCCAATCATGATATGAGATAGTATAGTATGTTTTACTCAAACTGTTCATAGTTTCTTTCCAGTTGGCTGTATTGGCAACGTATGTTAATGATCAGTTTCTCGCCATGTACCACTGTGTGAGAAGTTCTGCAGTCAAAGAGCCTTTTCCAGATGCTTGGGACAACCTTTTGTTACTATTTGAAAGAGTAAGACTCATTTAGTTTCTATCTTACCCTCTACACCTTTTCTGGTTGTTTTTACCACATTTTAACTATGTGACCTTACAGAATAGGTCATCTCTTATGCCTTCCCTATCTAGGGACGCCGAGTTCGATTTCTTGAGACCGTCCGAGAAGTGCTGTTTAGAAATCAAATCACAAACCAAAGATGATCGCAAGTCTCTAGAGACCGACTTGTTTTCTCTGCTCATCAGAACATTGGCTTTCTTTTTCATAAATTCGAGGTATTTTAGTCGCTCAACGTTCGGTATATATATACATTCATATTGTTTTTAGTCTAAGAAACTCCTAATTCATGTATACATGACTTTGTGCAACATTTAGGAATGCTTTGCTTATAAAGTTGTTTAGCTTGTTAGAGTCAACTTGAGACCGATGATACTTGTGAGACTGGGTATAGATATATAGTAGATGACTATATCTGGTAAAGTTATATATACTGAAATCATATATATATATATATATATATATTTAGACAGTGCTTTCTGCATTTTCTGTATTCTCTCTTACTTAGTGTACATACCTGAAATCTCTCTCCTGTTCTTTGTGTTCATCTGGATCGAGTGTGTGCGTTGTTGTACGATTCTTACTGTAAAAAAGTGACTAAAGTGTCGAACGAAGGGTGTACTTTGTTCGAGGGCTCCAGAGAAAGGAGTCGAGTCTCGATCAAGGGGAGGCTATTCGAGAGCTCCATAAGCCTCAGGAGAGGCTTATAGTGTACTTTGTTCGAGGGGAGGATTGTTGAGGATTATTGGGAGCGAGTCCCACATTGGTTAATTTAGTGGAAGATCATGGGTTTATAAGTGAGGAATACTATCTTCATTGGTGTGAGGCCTTTTGGGAATCCCAAAGCAAAGCCATGAGAGCTTATGCTCAAAGTAGACGATATCAAACCATTGTGGAGAGTCATGATTCCCAATAGTGTTTATCTAGATCGAGTGTGTGCGTTGTTGTACGATCCTTACTGTAACAGCCCAAGCCCGCCGCTAATAAATATTGACCAATTTAGGCTTTCCCTTTTCGGTTTTCCCTCAAGGTTTTTAAAACGCGTGTGTTGGGGAGAGGTTTCCACACTCTTATAAAGAATGTTTCGTTATCCTCTCCAAGTGATGTAGGATATCATAATTAAGAGTCACTTAACGTTTTCTTTGATTTGCCAATTCCATGTCCACCATTTTTTTCTGAGTTCTAACATTTCTTGATGGATTTCAATACATTACAGTTTGGAGCAATTCACAAGCACATTTTCATCTATGATGAGATCGCTGGATGAACTCTTGTCTCTAGATGATTCTGAATTAAACGTTTCATTAGAGTCGTACGAACTTTTGGATTCAGTGCGAACCGGCCCTTTCCGAGCCATCCAAATTGCTTCCATATTCATCTTCATGGTACAGAATCTTCTCAGTAAAGCNTTTCTGTATTCTCTCTTACTTAGTGTACATACCTGAAATCTCTCTCCTGTTCTTTGTGTTCATCTGGATCGAGTGTGTGCGTTGTTGTACGATTCTTACTGTAAAAAAGTGACTAAAGTGTCGAACGAAGGGTGTACTTTGTTCGAGGGCTCCAGAGAAAGGAGTCGAGTCTCGATCAAGGGGAGGCTATTCGAGAGCTCCATAAGCCTCAGGAGAGGCTTATAGTGTACTTTGTTCGAGGGGAGGATTGTTGAGGATTATTGGGAGCGAGTCCCACATTGGTTAATTTAGTGGAAGATCATGGGTTTATAAGTGAGGAATACTATCTTCATTGGTGTGAGGCCTTTTGGGAATCCCAAAGCAAAGCCATGAGAGCTTATGCTCAAAGTAGACGATATCAAACCATTGTGGAGAGTCATGATTCCCAATAGTGTTTATCTAGATCGAGTGTGTGCGTTGTTGTACGATCCTTACTGTAACAGCCCAAGCCCGCCGCTAATAAATATTGACCAATTTAGGCTTTCCCTTTTCGGTTTTCCCTCAAGGTTTTTAAAACGCGTGTGTTGGGGAGAGGTTTCCACACTCTTATAAAGAATGTTTCGTTATCCTCTCCAAGTGATGTAGGATATCATAATTAAGAGTCACTTAACGTTTTCTTTGATTTGCCAATTCCATGTCCACCATTTTTTTCTGAGTTCTAACATTTCTTGATGGATTTCAATACATTACAGTTTGGAGCAATTCACAAGCACATTTTCATCTATGATGAGATCGCTGGATGAACTCTTGTCTCTAGATGATTCTGAATTAAACGTTTCATTAGAGTCGTACGAACTTTTGGATTCAGTGCGAACCGGCCCTTTCCGAGCCATCCAAATTGCTTCCATATTCATCTTCATGGTACAGAATCTTCTCAGTAAAGCTAACCTGAATGATCTGCAGCAACTTGAGCTAACCCACTTGGCATTGGTTGCTACCTTTATTGTCATGGGACGTCTAGTCGAGAGATCGCTGAAGACGAGCCAATGGGAATCATCCCCTCTTTTACCTGCAGTGCTCGTTTTCGTGGAATGGTTACCAAGCGTTCTTGATGAAGTAGTAAGATATGGTTCTGATGCAAAAACTAGAAGCTCCATGTCATACTTTTTTGGTGCTTTTGTTGAGCTTGTACAGAAGCTGAATGTTCATACAGCTGAGGCACATTGTTCTCTTGCTATCCCTCTGTGGGAAGATTATGAGCTAAGAGGCTTCACTCCTTTAGCTTTTGCACATGAACCATTGGATTTCTCATCGCACTGGGAACACATCGACAACTTCGAATGCGGAGCTGAACACCGTGCTTATCGCATAACGGTTGCTGCTACTAAAATTTCGAATATCGCCAAGGATTATCCCGAGTGGATCATTCATGACAAAGTAGAACAAAATGAACTTCCAGATAAGAAAGAACTGGAAGATGAAGAAGTCATTCTTTTCAAGCCCCTCACGAGATACAATTCGGCACCAATCTCTATTGCAGGGAGCGATGAAGCGTCACCGAAAAGCACAGAGGCTCAGACTATATCTTCTGATGAATGTTTGAGGCGTGCCACGTCACTACTTATACAACAGACGCAGGGCCAGACCGATCCCTTTGCTTTTCATAATCTCAGCAGAAACAAGCCATTTGAGCAGCAGCATGATGTTTCAGAAGGCACCATATCAACTGGCCCTCCTTCACTTAGTGCCTGGGTGATCAATAAAGGTTTTACTTTTAACCCTGTTACAGAGAAAGGGCCTGGTTTGCAGCCCATTGATGAGTTAACTCTGGCATTCGTGAACAGTCTTAAACTCGACGATACTGAGAATTCTGCGTCGATTCCGAGCTCGAAATCTGGAAAATCCGACCTTTTTCCACCTCCTCCCTATTCCACCCCAGTACCTTCAGCTCCTTATTTACCTGATGATGCTGTTTGGACTAATGCTACCAATGCTAACATCTCTAGAAACATTGACCAAAATGATACATTTTCAGGAAGTGCTTATTCAAATTGGACTGCCCCTCAAGCTACATATGAACATGGTCCCATGATTGGTGGTCTTACGAATATGTATCCGTCGTCGCATCGAATGAGTTCTTCGGAATGGCTTCGTCAATATCGGGAGAATCAAGTACGGGCACCTCCCTACAATGCTTCTGGAAACGTTATGAACTTGCAAAGAAATGATACTTCAAGGTATGAACATTTGTATCCAACAATGAATATGGAGAGTCCATTGCGTTATCCAGCTTTCCCTGCAGCTTTCCCTGCAGCTTTCCCTGCAGCTTACAGCACGAATGAGAACCAAAAAAACATGTTTTTCCATGGTTATGAAAGGCCAAACCTGTATGGCTGCGGTGTTATTGATTTGAGAAGTGAGCAGCCACCGGTTCTGATGTATCTAAAAGATAAAGAGTGGCAGCTGCAAAAGGATGCTGCTAATAGAAGTGCTGCCTATATGGGGAATTGAGGATTTTCGTAGTTTTGTCTATATTCATCGTTTACTCGAACCGAAAACTCGAAACTGTCTAATGAAAGCGTTTGCGTTTGTATATACGTGTATAGAAAGTCGAAAAAATTATCTCAAGTAAACGTGTATTAAGGAATTGTCTCGCCTCGACATTGTAGAAGATCGTTTCCTTTTCTTAAAGATGTGTGAAAGCGGACCTTCATGTTTGGTGATATAATGTGTCCTTGAGTTTCATTTACGACTGGCCTAAAGTTAAGTCACGTTCAATGAAATCTAGTTTCATTAA

mRNA sequence

TACTGCGTAGAGGTGTCGTGGTCTTCTTACATAGCTTTCGATTCTCGTCTCTGTGTACCCTCCAAGTGTGCGCTTTTGGCTGGAGAATCTGTTTTGGCCTCCTCTGCTTCAAATATGTTTGCTCTGATTCTTTAATGGCGATGATCGATTTTTGCCTAATTTCTACGAGGTTAACTTACATGAGAATATAATACACTTGATGTTCCAAAGGTAGTGTACCTTGAAATCAGCTGCTCAATTTTAACCATGGCTACCTCTGCAAATCAAAGCAAAAAGGAGAGCTTGCTTAATGAGGTTGTGAGCTTGGAAAAGCAGTTAACAGCATCAATCCTTTCCAAAGGGATATTACATTCAGATGTCAAAGATCTATACTACAAAGTTTGTTCAATTTACGAGAGAATTTTCCTTAATGAACATGAACAATTAGAGCTTCAAGATGTTGAATATTCTCTATGGAAGCTTCATTACAAGCTAATCGACGAGTTTCGGAAGCGGATAAAGAGAAGCTCTGCAAATGTAGAGAGCCCAAAGTTGGGGACAGGACAACATTCTAATGATGCACAGCGAAGTAGTAGCAATCATATTGCAGAATTTAGGTTGTTTCTCATGGAAGCAACAAAGTTCTATCAGAAACTGATATTGAAAATCAGGGAGTATAATGGTGTTCAAAAAGAAGGCTTGTTATATAAGGCTTTTGGTGTCTCTAAAGGCATTGATCCAAAGAGAAAGAAGACATGTCAATTCTTATGTCACCGTCTCTTAGTTTGCCTTGGGGATCTTTCTAGTTCTGCAGTCAAAGAGCCTTTTCCAGATGCTTGGGACAACCTTTTGTTACTATTTGAAAGAAATAGGTCATCTCTTATGCCTTCCCTATCTAGGGACGCCGAGTTCGATTTCTTGAGACCGTCCGAGAAGTGCTGTTTAGAAATCAAATCACAAACCAAAGATGATCGCAAGTCTCTAGAGACCGACTTGTTTTCTCTGCTCATCAGAACATTGGCTTTCTTTTTCATAAATTCGAGTTTGGAGCAATTCACAAGCACATTTTCATCTATGATGAGATCGCTGGATGAACTCTTGTCTCTAGATGATTCTGAATTAAACGTTTCATTAGAGTCTTTGGAGCAATTCACAAGCACATTTTCATCTATGATGAGATCGCTGGATGAACTCTTGTCTCTAGATGATTCTGAATTAAACGTTTCATTAGAGTCGTACGAACTTTTGGATTCAGTGCGAACCGGCCCTTTCCGAGCCATCCAAATTGCTTCCATATTCATCTTCATGGTACAGAATCTTCTCAGTAAAGCTAACCTGAATGATCTGCAGCAACTTGAGCTAACCCACTTGGCATTGGTTGCTACCTTTATTGTCATGGGACGTCTAGTCGAGAGATCGCTGAAGACGAGCCAATGGGAATCATCCCCTCTTTTACCTGCAGTGCTCGTTTTCGTGGAATGGTTACCAAGCGTTCTTGATGAAGTAGTAAGATATGGTTCTGATGCAAAAACTAGAAGCTCCATGTCATACTTTTTTGGTGCTTTTGTTGAGCTTGTACAGAAGCTGAATGTTCATACAGCTGAGGCACATTGTTCTCTTGCTATCCCTCTGTGGGAAGATTATGAGCTAAGAGGCTTCACTCCTTTAGCTTTTGCACATGAACCATTGGATTTCTCATCGCACTGGGAACACATCGACAACTTCGAATGCGGAGCTGAACACCGTGCTTATCGCATAACGGTTGCTGCTACTAAAATTTCGAATATCGCCAAGGATTATCCCGAGTGGATCATTCATGACAAAGTAGAACAAAATGAACTTCCAGATAAGAAAGAACTGGAAGATGAAGAAGTCATTCTTTTCAAGCCCCTCACGAGATACAATTCGGCACCAATCTCTATTGCAGGGAGCGATGAAGCGTCACCGAAAAGCACAGAGGCTCAGACTATATCTTCTGATGAATGTTTGAGGCGTGCCACGTCACTACTTATACAACAGACGCAGGGCCAGACCGATCCCTTTGCTTTTCATAATCTCAGCAGAAACAAGCCATTTGAGCAGCAGCATGATGTTTCAGAAGGCACCATATCAACTGGCCCTCCTTCACTTAGTGCCTGGGTGATCAATAAAGGTTTTACTTTTAACCCTGTTACAGAGAAAGGGCCTGGTTTGCAGCCCATTGATGAGTTAACTCTGGCATTCGTGAACAGTCTTAAACTCGACGATACTGAGAATTCTGCGTCGATTCCGAGCTCGAAATCTGGAAAATCCGACCTTTTTCCACCTCCTCCCTATTCCACCCCAGTACCTTCAGCTCCTTATTTACCTGATGATGCTGTTTGGACTAATGCTACCAATGCTAACATCTCTAGAAACATTGACCAAAATGATACATTTTCAGGAAGTGCTTATTCAAATTGGACTGCCCCTCAAGCTACATATGAACATGGTCCCATGATTGGTGGTCTTACGAATATGTATCCGTCGTCGCATCGAATGAGTTCTTCGGAATGGCTTCGTCAATATCGGGAGAATCAAGTACGGGCACCTCCCTACAATGCTTCTGGAAACGTTATGAACTTGCAAAGAAATGATACTTCAAGGTATGAACATTTGTATCCAACAATGAATATGGAGAGTCCATTGCGTTATCCAGCTTTCCCTGCAGCTTTCCCTGCAGCTTTCCCTGCAGCTTACAGCACGAATGAGAACCAAAAAAACATGTTTTTCCATGGTTATGAAAGGCCAAACCTGTATGGCTGCGGTGTTATTGATTTGAGAAGTGAGCAGCCACCGGTTCTGATGTATCTAAAAGATAAAGAGTGGCAGCTGCAAAAGGATGCTGCTAATAGAAGTGCTGCCTATATGGGGAATTGAGGATTTTCGTAGTTTTGTCTATATTCATCGTTTACTCGAACCGAAAACTCGAAACTGTCTAATGAAAGCGTTTGCGTTTGTATATACGTGTATAGAAAGTCGAAAAAATTATCTCAAGTAAACGTGTATTAAGGAATTGTCTCGCCTCGACATTGTAGAAGATCGTTTCCTTTTCTTAAAGATGTGTGAAAGCGGACCTTCATGTTTGGTGATATAATGTGTCCTTGAGTTTCATTTACGACTGGCCTAAAGTTAAGTCACGTTCAATGAAATCTAGTTTCATTAA

Coding sequence (CDS)

ATGGCTACCTCTGCAAATCAAAGCAAAAAGGAGAGCTTGCTTAATGAGGTTGTGAGCTTGGAAAAGCAGTTAACAGCATCAATCCTTTCCAAAGGGATATTACATTCAGATGTCAAAGATCTATACTACAAAGTTTGTTCAATTTACGAGAGAATTTTCCTTAATGAACATGAACAATTAGAGCTTCAAGATGTTGAATATTCTCTATGGAAGCTTCATTACAAGCTAATCGACGAGTTTCGGAAGCGGATAAAGAGAAGCTCTGCAAATGTAGAGAGCCCAAAGTTGGGGACAGGACAACATTCTAATGATGCACAGCGAAGTAGTAGCAATCATATTGCAGAATTTAGGTTGTTTCTCATGGAAGCAACAAAGTTCTATCAGAAACTGATATTGAAAATCAGGGAGTATAATGGTGTTCAAAAAGAAGGCTTGTTATATAAGGCTTTTGGTGTCTCTAAAGGCATTGATCCAAAGAGAAAGAAGACATGTCAATTCTTATGTCACCGTCTCTTAGTTTGCCTTGGGGATCTTTCTAGTTCTGCAGTCAAAGAGCCTTTTCCAGATGCTTGGGACAACCTTTTGTTACTATTTGAAAGAAATAGGTCATCTCTTATGCCTTCCCTATCTAGGGACGCCGAGTTCGATTTCTTGAGACCGTCCGAGAAGTGCTGTTTAGAAATCAAATCACAAACCAAAGATGATCGCAAGTCTCTAGAGACCGACTTGTTTTCTCTGCTCATCAGAACATTGGCTTTCTTTTTCATAAATTCGAGTTTGGAGCAATTCACAAGCACATTTTCATCTATGATGAGATCGCTGGATGAACTCTTGTCTCTAGATGATTCTGAATTAAACGTTTCATTAGAGTCTTTGGAGCAATTCACAAGCACATTTTCATCTATGATGAGATCGCTGGATGAACTCTTGTCTCTAGATGATTCTGAATTAAACGTTTCATTAGAGTCGTACGAACTTTTGGATTCAGTGCGAACCGGCCCTTTCCGAGCCATCCAAATTGCTTCCATATTCATCTTCATGGTACAGAATCTTCTCAGTAAAGCTAACCTGAATGATCTGCAGCAACTTGAGCTAACCCACTTGGCATTGGTTGCTACCTTTATTGTCATGGGACGTCTAGTCGAGAGATCGCTGAAGACGAGCCAATGGGAATCATCCCCTCTTTTACCTGCAGTGCTCGTTTTCGTGGAATGGTTACCAAGCGTTCTTGATGAAGTAGTAAGATATGGTTCTGATGCAAAAACTAGAAGCTCCATGTCATACTTTTTTGGTGCTTTTGTTGAGCTTGTACAGAAGCTGAATGTTCATACAGCTGAGGCACATTGTTCTCTTGCTATCCCTCTGTGGGAAGATTATGAGCTAAGAGGCTTCACTCCTTTAGCTTTTGCACATGAACCATTGGATTTCTCATCGCACTGGGAACACATCGACAACTTCGAATGCGGAGCTGAACACCGTGCTTATCGCATAACGGTTGCTGCTACTAAAATTTCGAATATCGCCAAGGATTATCCCGAGTGGATCATTCATGACAAAGTAGAACAAAATGAACTTCCAGATAAGAAAGAACTGGAAGATGAAGAAGTCATTCTTTTCAAGCCCCTCACGAGATACAATTCGGCACCAATCTCTATTGCAGGGAGCGATGAAGCGTCACCGAAAAGCACAGAGGCTCAGACTATATCTTCTGATGAATGTTTGAGGCGTGCCACGTCACTACTTATACAACAGACGCAGGGCCAGACCGATCCCTTTGCTTTTCATAATCTCAGCAGAAACAAGCCATTTGAGCAGCAGCATGATGTTTCAGAAGGCACCATATCAACTGGCCCTCCTTCACTTAGTGCCTGGGTGATCAATAAAGGTTTTACTTTTAACCCTGTTACAGAGAAAGGGCCTGGTTTGCAGCCCATTGATGAGTTAACTCTGGCATTCGTGAACAGTCTTAAACTCGACGATACTGAGAATTCTGCGTCGATTCCGAGCTCGAAATCTGGAAAATCCGACCTTTTTCCACCTCCTCCCTATTCCACCCCAGTACCTTCAGCTCCTTATTTACCTGATGATGCTGTTTGGACTAATGCTACCAATGCTAACATCTCTAGAAACATTGACCAAAATGATACATTTTCAGGAAGTGCTTATTCAAATTGGACTGCCCCTCAAGCTACATATGAACATGGTCCCATGATTGGTGGTCTTACGAATATGTATCCGTCGTCGCATCGAATGAGTTCTTCGGAATGGCTTCGTCAATATCGGGAGAATCAAGTACGGGCACCTCCCTACAATGCTTCTGGAAACGTTATGAACTTGCAAAGAAATGATACTTCAAGGTATGAACATTTGTATCCAACAATGAATATGGAGAGTCCATTGCGTTATCCAGCTTTCCCTGCAGCTTTCCCTGCAGCTTTCCCTGCAGCTTACAGCACGAATGAGAACCAAAAAAACATGTTTTTCCATGGTTATGAAAGGCCAAACCTGTATGGCTGCGGTGTTATTGATTTGAGAAGTGAGCAGCCACCGGTTCTGATGTATCTAAAAGATAAAGAGTGGCAGCTGCAAAAGGATGCTGCTAATAGAAGTGCTGCCTATATGGGGAATTGA

Protein sequence

MATSANQSKKESLLNEVVSLEKQLTASILSKGILHSDVKDLYYKVCSIYERIFLNEHEQLELQDVEYSLWKLHYKLIDEFRKRIKRSSANVESPKLGTGQHSNDAQRSSSNHIAEFRLFLMEATKFYQKLILKIREYNGVQKEGLLYKAFGVSKGIDPKRKKTCQFLCHRLLVCLGDLSSSAVKEPFPDAWDNLLLLFERNRSSLMPSLSRDAEFDFLRPSEKCCLEIKSQTKDDRKSLETDLFSLLIRTLAFFFINSSLEQFTSTFSSMMRSLDELLSLDDSELNVSLESLEQFTSTFSSMMRSLDELLSLDDSELNVSLESYELLDSVRTGPFRAIQIASIFIFMVQNLLSKANLNDLQQLELTHLALVATFIVMGRLVERSLKTSQWESSPLLPAVLVFVEWLPSVLDEVVRYGSDAKTRSSMSYFFGAFVELVQKLNVHTAEAHCSLAIPLWEDYELRGFTPLAFAHEPLDFSSHWEHIDNFECGAEHRAYRITVAATKISNIAKDYPEWIIHDKVEQNELPDKKELEDEEVILFKPLTRYNSAPISIAGSDEASPKSTEAQTISSDECLRRATSLLIQQTQGQTDPFAFHNLSRNKPFEQQHDVSEGTISTGPPSLSAWVINKGFTFNPVTEKGPGLQPIDELTLAFVNSLKLDDTENSASIPSSKSGKSDLFPPPPYSTPVPSAPYLPDDAVWTNATNANISRNIDQNDTFSGSAYSNWTAPQATYEHGPMIGGLTNMYPSSHRMSSSEWLRQYRENQVRAPPYNASGNVMNLQRNDTSRYEHLYPTMNMESPLRYPAFPAAFPAAFPAAYSTNENQKNMFFHGYERPNLYGCGVIDLRSEQPPVLMYLKDKEWQLQKDAANRSAAYMGN
BLAST of Cp4.1LG09g08350 vs. Swiss-Prot
Match: SMG7L_ARATH (Protein SMG7L OS=Arabidopsis thaliana GN=SMG7L PE=2 SV=1)

HSP 1 Score: 323.9 bits (829), Expect = 5.3e-87
Identity = 291/979 (29.72%), Postives = 438/979 (44.74%), Query Frame = 1

Query: 2   ATSANQSKKESLLNEVVSLEKQLTASILSKGILHSDVKDLYYKVCSIYERIFLNEHEQLE 61
           A SA+Q +K + L EV ++EKQL   I SK ILH+DV +LY K  S YE+IF +  +  E
Sbjct: 3   ANSADQKQKPNFLVEVNNIEKQLWTLIHSKTILHTDVSELYAKAGSTYEQIFKSNLQHEE 62

Query: 62  LQDVEYSLWKLHYKLIDEFRKRIKRSSANVESPKLGTGQHSNDAQRSSSNHIAEFRLFLM 121
           LQ+VE+ LWKLHYK IDEFRK +K            T  H        + H+  F+LFL 
Sbjct: 63  LQEVEFCLWKLHYKHIDEFRKGLK------------TNDH--------AKHMKAFKLFLS 122

Query: 122 EATKFYQKLILKIREYNGVQKEGLLYKAFGVSKGIDPKRKKTCQFLCHRLLVCLGDL--- 181
           +A +FYQ LI K+R Y         Y       G     ++  +FLCHR  +CLGDL   
Sbjct: 123 KAAEFYQNLISKVRGY---------YHRLSEESG-----EQKSRFLCHRFYICLGDLQRY 182

Query: 182 -------------SSSA-----VKEPFPDAWD---------------------------- 241
                        S++A       + +PD+ +                            
Sbjct: 183 QEQYLKAHEHPNWSTAATYYLEAAKSWPDSGNPHNQLAVLATYVSDELLALYHCVRSLAV 242

Query: 242 ---------NLLLLFERNRSSLMPSLSRDAEFDFLRPSEKCCLEIKSQTKDDRKS---LE 301
                    NLLLLFE+NRSS + SLS DAEF++L PSEK  + +K +     K      
Sbjct: 243 KEPFPGASNNLLLLFEKNRSSPLQSLSTDAEFNYLNPSEK-KVSVKERDLSKAKGELVAG 302

Query: 302 TDLFSLLIRTLAFFFINSSLEQFTSTFSSMMRSLDELLSLDDSELNVSLESLEQFTSTFS 361
            DL+ L++RT +FFF+ S                                S ++F   F+
Sbjct: 303 IDLWPLVVRTTSFFFLKS--------------------------------SFDEFGRAFA 362

Query: 362 SMMRSLDELLSLDDSELNVSLESYELLDSVRTGPFRAIQIASIFIFMVQNLLSKANLNDL 421
           S +R LD   + DD  L   LESY+ +D+ R GP++ +QI ++FI++  N L++AN +D+
Sbjct: 363 STIRELDAAFAADDRNLEAMLESYQFMDTARKGPYKILQIVAVFIYIFHN-LAEANGSDI 422

Query: 422 --QQLELTHLALVATFIVMGRLVERSLKTSQWESSPLLPAVLVFVEWLPSVLDEVVRYGS 481
             ++++LT+LAL   FIVMGR+VER LKT+  +S PLLPA+LVF+++LP +LD+V     
Sbjct: 423 VKEEVKLTNLALTMVFIVMGRVVERCLKTTPLDSCPLLPALLVFLDYLPFLLDKVEEEEE 482

Query: 482 ----DAKTRSSMSYFFGAFVELVQKLNVHTAEAHCSLAIPLWEDYELRGFTPLAFAHEPL 541
               D K++S++SYFFG  V+++ +L V          + LWED+EL+   PLA  H  L
Sbjct: 483 ECRFDEKSKSAISYFFGKLVDILNQLKVKDKNCPAKTLLALWEDHELKSLAPLAPIHALL 542

Query: 542 DFSSHWEHIDNFECGAEHRAYRITVAATKI-SNIAKDYPEWIIHDKVEQNELPDKKELED 601
           DFSS+ +  ++F+ G E R  RI  +A  I +   K   +W+  D    +      EL+ 
Sbjct: 543 DFSSNMDLRESFDRGKELRLQRIISSAIDITTRQKKGSQKWLFFDNQRTHFYTTSGELQS 602

Query: 602 EEVILFKPLTRYNSAPISIAGSDEASPKSTEAQTISSDECLRRATSLLIQQTQGQTDPFA 661
              +        N   ++I G  E  P   E      +E       LL    + Q+ P  
Sbjct: 603 NGELFHGNGEGRNRKCVTI-GPVEIIPLENERSVPVEEE----EVILLKPLVRCQSAPIY 662

Query: 662 FHNLSRNKPFEQQHDVSEGTISTGPPSLS---AWVINKGFTFNP-VTEKGPGLQPIDELT 721
              ++  KP       S    +T   SL    + + ++ F+F   + +  P    ++E T
Sbjct: 663 SSGIAA-KPLSSDCTTSGNQTTTSNDSLRRTLSLIGSESFSFTQGLKDTDPQHLHLEEGT 722

Query: 722 LA-FVNSLK-----------------------LDDTENSASIPSSKSGKSDLFPPPPYST 781
           ++    SL                        +D+T   ++  S     S   P   YS 
Sbjct: 723 VSGRPPSLSAWVVDKNKEKGRLGLSKPNGLGPIDETGPVSAFDSLSINSSTEHPASSYSP 782

Query: 782 PVPSAPYLPDDAVWTNATNANISRNIDQNDTFSGSAYSNWTAPQATYEHGPMIGGLTNMY 841
           P PSAP LP+DA W +    N +        +  + Y         Y + P +G      
Sbjct: 783 PTPSAPLLPEDASWFH----NDASTNKAESFYDQTRYMELPGIMKPYTNPPFVG------ 842

Query: 842 PSSHRMSSSEWLRQYRENQVRAPPYN----ASGNVMNLQRNDTSRYEHL--YPTMNMESP 877
                +SSSEWLR+YRE++   P Y+     + N+ N   + +S++  L  Y T N  S 
Sbjct: 843 -----ISSSEWLRRYRESRNLGPAYSYQAQGTNNLRNFMAHGSSKFSLLARYGTPNDSS- 880

BLAST of Cp4.1LG09g08350 vs. Swiss-Prot
Match: SMG7_ARATH (Protein SMG7 OS=Arabidopsis thaliana GN=SMG7 PE=2 SV=1)

HSP 1 Score: 65.5 bits (158), Expect = 3.4e-09
Identity = 46/135 (34.07%), Postives = 65/135 (48.15%), Query Frame = 1

Query: 49  YERIFLNEHEQLELQDVEYSLWKLHYKLIDEFRKRIKRSSANVESPKLGTGQHSNDAQRS 108
           YE I L  H   E  ++E  LW+LHYK I+ FR  I R  A+  S      +  + A++ 
Sbjct: 51  YEAIILESHTFSEQHNIEIPLWQLHYKRIEYFRLHINRVLASSTSTAAQNVKGPSKAEQI 110

Query: 109 SSNHIAEFRLFLMEATKFYQKLILKIREYNGVQ----KEGLLYKAFGVSKGIDPKRKKTC 168
           +   + +FR FL EAT FY  +ILKIR   G+      E    +      G +    +  
Sbjct: 111 AQLKL-QFRTFLSEATGFYHDMILKIRSKYGLPLGSFSEDQQSQNLSDKDGKELAEVQKA 170

Query: 169 QFLCHRLLVCLGDLS 180
              CHR L+ LGDL+
Sbjct: 171 LKSCHRCLIYLGDLA 184

BLAST of Cp4.1LG09g08350 vs. TrEMBL
Match: A0A0A0LSD4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G074930 PE=4 SV=1)

HSP 1 Score: 724.2 bits (1868), Expect = 1.9e-205
Identity = 394/604 (65.23%), Postives = 443/604 (73.34%), Query Frame = 1

Query: 4   SANQSKKESLLNEVVSLEKQLTASILSKGILHSDVKDLYYKVCSIYERIFLNEHEQLELQ 63
           + +Q++KE+LL+EVVSLEKQLT SILSKGILHSDV DLYYKVCSIYE+IF +EHEQ+ELQ
Sbjct: 3   ATSQNRKENLLHEVVSLEKQLTTSILSKGILHSDVNDLYYKVCSIYEKIFTSEHEQVELQ 62

Query: 64  DVEYSLWKLHYKLIDEFRKRIKRSSANVESPKLGTGQHSNDAQRSSSNHIAEFRLFLMEA 123
           DVEYSLWKLHYKLIDEFRKRIKRSS N  SPKLGT Q  N+ QRS+SNHIAEFRLFL+EA
Sbjct: 63  DVEYSLWKLHYKLIDEFRKRIKRSSGNGGSPKLGTTQSPNNVQRSNSNHIAEFRLFLLEA 122

Query: 124 TKFYQKLILKIREYNGVQKEGLLYKAFGVSKGIDPKRKKTCQFLCHRLLVCLGDLS---- 183
           TKFYQ LILKIREY GV  EGLLYKAF V+KGIDPK+KK CQFLCHRLL+CLGDL+    
Sbjct: 123 TKFYQILILKIREYYGVPNEGLLYKAFSVAKGIDPKKKKKCQFLCHRLLICLGDLARYVE 182

Query: 184 ------------SSAVKEPF------PDAW--------------DNLL------------ 243
                       ++A    F      PD+               D  L            
Sbjct: 183 QHEKLDVYSHKWAAAATHYFEATMVWPDSGNPHNQLAVLATYVNDQFLAMYHCVRSSAVK 242

Query: 244 -----------LLFERNRSSLMPSLSRDAEFDFLRPSEKCCLEIKSQTKDDRKSLETDLF 303
                      LLFERNRSSL+PSLS D +F+FLRPSEKCC EIKSQ KDD KSLETDLF
Sbjct: 243 EPFPDAWDNLILLFERNRSSLLPSLSGDGQFNFLRPSEKCCFEIKSQIKDDNKSLETDLF 302

Query: 304 SLLIRTLAFFFINSSLEQFTSTFSSMMRSLDELLSLDDSELNVSLESLEQFTSTFSSMMR 363
           SLLIRTL FFFINS                                SLE+FTS FSSMMR
Sbjct: 303 SLLIRTLGFFFINS--------------------------------SLEEFTSAFSSMMR 362

Query: 364 SLDELLSLDDSELNVSLESYELLDSVRTGPFRAIQIASIFIFMVQNLLSKANLNDLQQLE 423
            LDE LSLDDSELN SLESY+LLDSVRTGPFRAIQIAS+FIFMVQN  SK +LND QQ+E
Sbjct: 363 WLDEFLSLDDSELNASLESYKLLDSVRTGPFRAIQIASVFIFMVQNRFSKVDLNDKQQIE 422

Query: 424 LTHLALVATFIVMGRLVERSLKTSQWESSPLLPAVLVFVEWLPSVLDEVVRYGSDAKTRS 483
           LT LALV TFI MGRLVER L+ S+ +S PLLPAVL+FVEWLP+VLDEVVRYG D K+R+
Sbjct: 423 LTQLALVVTFIAMGRLVERCLEASKLDSFPLLPAVLIFVEWLPNVLDEVVRYGDDEKSRN 482

Query: 484 SMSYFFGAFVELVQKLNVHTAEAHCSLAIPLWEDYELRGFTPLAFAHEPLDFSSHWEHID 542
           SM+YFFG +V L+++LNV+  EA CSLAIPLWEDYELRGFTPLAF+H+PLDFSSHWEH+D
Sbjct: 483 SMTYFFGVYVGLLERLNVNKVEAQCSLAIPLWEDYELRGFTPLAFSHKPLDFSSHWEHMD 542

BLAST of Cp4.1LG09g08350 vs. TrEMBL
Match: W9RYX4_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_014497 PE=4 SV=1)

HSP 1 Score: 461.5 bits (1186), Expect = 2.4e-126
Identity = 318/798 (39.85%), Postives = 421/798 (52.76%), Query Frame = 1

Query: 178 LSSSAVKEPFPDAWDNLLLLFERNRSSLMPSLSRDAEFDFLRPSEKCCLEIKSQTKD--- 237
           + S AVKEPFPDAWDNLLLL ERNRSS + SLS +A+F+F++P E+   +  S++ D   
Sbjct: 177 IRSLAVKEPFPDAWDNLLLLLERNRSSPLQSLSSEAQFNFIKPYERSITKTNSKSIDHSS 236

Query: 238 --DRKSLETDLFSLLIRTLAFFFINSSLEQFTSTFSSMMRSLDELLSLDDSELNVSLESL 297
             +  S  TD +SL IR ++FF +  SL++F S F+S+MR LD LL+LDD+EL  SLES 
Sbjct: 237 CRNNGSAATDFWSLFIRIISFFVVKPSLDEFPSAFTSVMRGLDALLALDDTELKASLES- 296

Query: 298 EQFTSTFSSMMRSLDELLSLDDSELNVSLESYELLDSVRTGPFRAIQIASIFIFMVQNLL 357
                                          Y+ +DS++ GPFRA+Q+ SIF++ +Q+L+
Sbjct: 297 -------------------------------YQHMDSIKAGPFRALQVVSIFLYTLQSLI 356

Query: 358 SKANL------NDLQQLELTHLALVATFIVMGRLVER--SLKTSQWESSPLLPAVLVFVE 417
           +   +      +D Q + L  LAL + FI MGR VER   LK     S PLLPAVLVFVE
Sbjct: 357 NCPQIKHFEEMSDTQLILLRQLALTSLFIFMGRFVERCLKLKAGALSSCPLLPAVLVFVE 416

Query: 418 WLPSVLDEVVRYGSDAKTRSSMSYFFGAFVELVQKLNVHTAEAHCSLAIPLWEDYELRGF 477
           WL ++L+E  +YG D ++ S+MSYFF +FV L+ +L  +  E + S++ PLWEDYELRGF
Sbjct: 417 WLATMLNEAEKYGVDRRSSSAMSYFFESFVALLNRLGANNNEGNTSVSTPLWEDYELRGF 476

Query: 478 TPLAFAHEPLDFSSHWEHIDNF--------------------------------ECGAEH 537
            P+  AHE L FSSHWEHIDNF                                + G   
Sbjct: 477 APVTRAHESLYFSSHWEHIDNFEEGTKSRCRRIRNAGLKIANRSNDSQKWIIYDQSGGNF 536

Query: 538 RAYRITVAATKI--------SNIAKDYPEWIIHDKVEQNELPDKKE----------LEDE 597
           R+  I   A +         S++  D  +    + VE+ E P  +E          +E+E
Sbjct: 537 RSVPINSNAAEFNENVESISSDLKTDASDQNFCEGVEEFEGPILEENPSVNGKSVTVEEE 596

Query: 598 EVILFKPLTRYNSAPISIAGSDEASPKSTEAQTISSDECLRRATSLLIQQTQG------- 657
           EVILFKPLTRYNSAP+    ++  SPK  E Q    D+CLRRATSLLI Q Q        
Sbjct: 597 EVILFKPLTRYNSAPLCTNSNEPTSPKEMEEQAAPPDDCLRRATSLLIAQNQAQGGTTFM 656

Query: 658 QTDPFAFHNLSRNKPFEQQHDV-SEGT---------ISTGPPSLSAWVINKGFTFNPVTE 717
           QTD     N   NKPF+QQ  V  E T         IS+GPPSLSAWV+ +G   N   +
Sbjct: 657 QTD---ISNFRHNKPFKQQELVFKEATMLPPFPDTLISSGPPSLSAWVLERGGLINNKEK 716

Query: 718 KGPG-----LQPIDELTLAFVNSLKLDDTENSASIPSSKSGKSDLFPPPPYSTPVPSAPY 777
              G     L PI+E+    +  L +   ++S+    S    +  +   PYS P PSAP 
Sbjct: 717 AASGIHKHILNPIEEMASESLCGLSITQNQDSS---RSHDFLATHYSSSPYSAPTPSAPL 776

Query: 778 LPDDAVWTNATNANI--SRNIDQNDTFS-----GSAYSNWTAPQA-TYEHG-PMIGGLTN 837
           LPDDA W     + +  S  I+  +T S      S+Y NW A Q  T  +G   I GL  
Sbjct: 777 LPDDAAWFTGLQSRLQPSEGINGTETLSNASQGNSSYPNWNATQGPTDLYGLSSIPGLAV 836

Query: 838 MYPSSHRMSSSEWLRQYRENQVRAPP--YNASGNVMNLQRNDTSRYEHLYPTMNMESPLR 877
            Y    RM+SSEWLRQYREN     P  + A GN+ N      S      PT++MESP  
Sbjct: 837 NYTPQRRMTSSEWLRQYRENHAWPWPSYFYAPGNIGNSDNPLASN-----PTVHMESPPL 896

BLAST of Cp4.1LG09g08350 vs. TrEMBL
Match: A0A067GD48_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g001829mg PE=4 SV=1)

HSP 1 Score: 426.0 bits (1094), Expect = 1.1e-115
Identity = 249/548 (45.44%), Postives = 334/548 (60.95%), Query Frame = 1

Query: 10  KESLLNEVVSLEKQLTASILSKGILHSDVKDLYYKVCSIYERIFLNEHEQLELQDVEYSL 69
           K +LL EV + +KQL   I SKG+L  +V++LY++VCS YE+I LN+++Q ELQDVEYSL
Sbjct: 15  KPNLLVEVANTDKQLVTLIHSKGLLCPEVQELYHRVCSSYEKILLNDYDQAELQDVEYSL 74

Query: 70  WKLHYKLIDEFRKRIKRSSANVESPKLGTGQHSNDAQRSSSNHIAEFRLFLMEATKFYQK 129
           WKLHY+ IDEFRKRIK+SS +  +      Q   + QRSS NHI  F+ FL EA  FY  
Sbjct: 75  WKLHYRHIDEFRKRIKKSSVSDNTMP----QSGANVQRSSDNHIEGFKSFLSEAMAFYHN 134

Query: 130 LILKIREYNGVQKEGLLYKAFGVSKGIDPKRKKTCQFLCHRLLVCLGDLSSSAVKEPFPD 189
           L++KI+ Y G+ +E    K   +S  ++P +K+  QFLCHR LVCLGDL  +  KE + +
Sbjct: 135 LVVKIKRYYGLPEESSFAKEGYMSTTLEPNKKQKYQFLCHRFLVCLGDL--ARYKEQYEN 194

Query: 190 --------------------AWDNLLLLFERNRSSLMPSLSRDAEFDFLRPSEKCCLEIK 249
                                W +      +NRSS + SLS +A FD  +PSE+   +IK
Sbjct: 195 FGAQEHNWSVAVSHYLEATMIWPDSGNPQNQNRSSDLHSLSMEAHFDISKPSERSSNQIK 254

Query: 250 SQTKDDRKSL-----------ETDLFSLLIRTLAFFFINSSLEQFTSTFSSMMRSLDELL 309
           SQ++D   +            ET+L+SL+IRT++FFFI S                    
Sbjct: 255 SQSRDGFSNCNMLKAEHDCFKETNLWSLIIRTISFFFIKS-------------------- 314

Query: 310 SLDDSELNVSLESLEQFTSTFSSMMRSLDELLSLDDSELNVSLESYELLDSVRTGPFRAI 369
                       SLE F  TF+S MR LD  + LDD++L   LESY+L+DS RTGPFRA+
Sbjct: 315 ------------SLEDFPYTFASTMRELDAAMELDDAKLKALLESYQLMDSARTGPFRAL 374

Query: 370 QIASIFIFMVQNLLSKANL------NDLQQLELTHLALVATFIVMGRLVERSLKTSQWES 429
           Q+ SIFIF ++NL++   +      ND+QQLE    AL ATFI MGRLVER LK++  +S
Sbjct: 375 QVVSIFIFTIENLINAPEIKGSKDKNDMQQLEFIRWALSATFIFMGRLVERCLKSNSLDS 434

Query: 430 SPLLPAVLVFVEWLPSVLDEVVRYGSDAKTRSSMSYFFGAFVELVQKLNVHTAEAHCSLA 489
           SPLL +VLVFVEWL  +L++   Y SD K+RS+MSYFFGAFV L+++LN   +E      
Sbjct: 435 SPLLSSVLVFVEWLVGILEQAESYASDGKSRSAMSYFFGAFVGLLKQLNAR-SEVSSPKK 494

Query: 490 IPLWEDYELRGFTPLAFAHEPLDFSSHWEHIDNFECGAEHRAYRITVAATKISNIAKDYP 521
             LWEDYELRGF P+  +H+ LDFS H+ HI +FE G E RA R+  AA KI+N +    
Sbjct: 495 TALWEDYELRGFAPVLCSHQSLDFSVHFGHIKSFEAGIESRADRVINAAMKIANRSNGSQ 523

BLAST of Cp4.1LG09g08350 vs. TrEMBL
Match: A0A0B0P381_GOSAR (Telomerase-binding EST1A OS=Gossypium arboreum GN=F383_23932 PE=4 SV=1)

HSP 1 Score: 399.8 bits (1026), Expect = 8.4e-108
Identity = 251/613 (40.95%), Postives = 339/613 (55.30%), Query Frame = 1

Query: 6   NQSKKESLLNEVVSLEKQLTASILSKGILHSDVKDLYYKVCSIYERIFLNEHEQLELQDV 65
           +Q  + S L E+ + EK L   I +KG+LHSDV+DLY+KVC  YE  FL++HE  ELQDV
Sbjct: 86  DQKARASFLLEIANTEKHLWVLIHTKGLLHSDVRDLYHKVCLNYESFFLDDHELTELQDV 145

Query: 66  EYSLWKLHYKLIDEFRKRIKRSSANVESPKLGTGQHSNDAQRSSSNHIAEFRLFLMEATK 125
           EYSLWKLHYK IDEFRKR KRSSAN ES     G   +D     + +I  F+ FL++AT+
Sbjct: 146 EYSLWKLHYKHIDEFRKRTKRSSANSESTMCAMGSSGSD-----NRYIDGFKSFLLKATE 205

Query: 126 FYQKLILKIREYNGVQKEGLLYKAFGVSKGIDPKRKKTCQFLCHRLLVCLGDLS------ 185
           FY+KLI K+R + G+ +E    +  G++  I+P + + C FLCHR LVCLGDL+      
Sbjct: 206 FYKKLIEKLRSHYGLPEESSSSRRGGINASIEPVKLRKCHFLCHRFLVCLGDLARYMEQV 265

Query: 186 --SSAVKE--------------PFPDA--------------WDNLL-------------- 245
             SS +K                +PD+               D  L              
Sbjct: 266 EQSSVLKHNWSVAAAYYLEAAMVWPDSGNPQNQLAVLATYVGDEFLALYHCIRSLAVKEP 325

Query: 246 ---------LLFERNRSSLMPSLSRDAEFDFLRPSEKCCLEIKSQTK-----------DD 305
                    LLFERNRS  +PSLS + +FDFL+P E+   ++K Q+            ++
Sbjct: 326 FPDAWNNLVLLFERNRSCDLPSLSSEEQFDFLQPFERSDSQVKLQSSEKVSDGVLLKGEN 385

Query: 306 RKSLETDLFSLLIRTLAFFFINSSLEQFTSTFSSMMRSLDELLSLDDSELNVSLESLEQF 365
             S   + + LLIRTL+FFF+ S                                SLE F
Sbjct: 386 DHSAGMNFWLLLIRTLSFFFLKS--------------------------------SLEDF 445

Query: 366 TSTFSSMMRSLDELLSLDDSELNVSLESYELLDSVRTGPFRAIQIASIFIFMVQNLLSKA 425
              F+S MR LD +++LDD +L   LESY+L+DS RTGPFR +Q  S+FIF+  NL +  
Sbjct: 446 PCAFASTMRVLDVMMALDDIKLRAMLESYQLMDSARTGPFRVLQAVSVFIFVFHNLNNNL 505

Query: 426 NL------NDLQQLELTHLALVATFIVMGRLVERSLKTSQWESSPLLPAVLVFVEWLPSV 485
            L       + Q LEL   AL ATFI MGR+V R L+ +   S PLLPA+LVFVEWL S+
Sbjct: 506 ELPGSKDGKNKQHLELIQFALNATFIFMGRVVNRCLRANSLNSCPLLPAILVFVEWLASM 565

Query: 486 LDEVVRYGSDAKTRSSMSYFFGAFVELVQKLNVHTAEAHCSLAIPLWEDYELRGFTPLAF 536
            DEV  YG D KT+SS+SYF  AF++L+++L+V+  E    + I LWEDYELRGF PLA 
Sbjct: 566 FDEVEAYGVDEKTKSSISYFLAAFMDLLKQLDVN-VEIVSDVRIALWEDYELRGFAPLAQ 625

BLAST of Cp4.1LG09g08350 vs. TrEMBL
Match: A0A0D2VAS6_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_010G155000 PE=4 SV=1)

HSP 1 Score: 397.9 bits (1021), Expect = 3.2e-107
Identity = 253/613 (41.27%), Postives = 340/613 (55.46%), Query Frame = 1

Query: 6   NQSKKESLLNEVVSLEKQLTASILSKGILHSDVKDLYYKVCSIYERIFLNEHEQLELQDV 65
           +Q  K + L E+ + EK L   I +KG+LHSDV+DLY+KVC  YE  FL++HE  ELQDV
Sbjct: 11  DQKAKANFLLEIANTEKHLWVLIHTKGLLHSDVRDLYHKVCLNYESFFLDDHELTELQDV 70

Query: 66  EYSLWKLHYKLIDEFRKRIKRSSANVESPKLGTGQHSNDAQRSSSNHIAEFRLFLMEATK 125
           EYSLWKLHYK IDEFRKR KRSSAN ES     G   +D     + +I  F+ FL++AT+
Sbjct: 71  EYSLWKLHYKHIDEFRKRTKRSSANSESTMSAMGSIGSD-----NRYIDGFKSFLLKATE 130

Query: 126 FYQKLILKIREYNGVQKEGLLYKAFGVSKGIDPKRKKTCQFLCHRLLVCLGDLS------ 185
           FY+KLI K+R + G+ +E    K  G++  I+P + + C FLCHR LVCLGDL+      
Sbjct: 131 FYKKLIEKLRSHYGLPEESSSSKRGGINASIEPVKLRKCHFLCHRFLVCLGDLARYMEQV 190

Query: 186 --SSAVKE--------------PFPDA--------------WDNLL-------------- 245
             SS +K                +PD+               D  L              
Sbjct: 191 EQSSVLKHNWSVAAAYYLEAAMVWPDSGNPQNQLAVLATYVGDEFLALYHCIRSLAVKEP 250

Query: 246 ---------LLFERNRSSLMPSLSRDAEFDFLRPSEKCCLEIKSQTK-----------DD 305
                    LLFERNRS  +PSLS + +FDFL+P E+   ++K Q+            ++
Sbjct: 251 FPDAWNNLVLLFERNRSCDLPSLSSEEQFDFLQPFERSGSQVKLQSSEKVSDGVPLKGEN 310

Query: 306 RKSLETDLFSLLIRTLAFFFINSSLEQFTSTFSSMMRSLDELLSLDDSELNVSLESLEQF 365
             S   + + LLIR L+FFF+ S                                SLE F
Sbjct: 311 DHSEGMNFWLLLIRMLSFFFLKS--------------------------------SLEDF 370

Query: 366 TSTFSSMMRSLDELLSLDDSELNVSLESYELLDSVRTGPFRAIQIASIFIFMVQNLLSKA 425
              F+S MR LD +++LDD +L   LESY+L+DS RTGPFR +Q  S+FIF+  NL +  
Sbjct: 371 PCAFASTMRVLDVMMALDDIKLRAMLESYQLMDSARTGPFRVLQAVSVFIFVFHNLNNNP 430

Query: 426 NL------NDLQQLELTHLALVATFIVMGRLVERSLKTSQWESSPLLPAVLVFVEWLPSV 485
            L       + + LEL   AL ATFI MGR+V R L+ +   S PLLPA+LVFVEWL S+
Sbjct: 431 ELPGSKDGKNKKHLELIQFALNATFIFMGRVVYRCLRANSLNSCPLLPAILVFVEWLASM 490

Query: 486 LDEVVRYGSDAKTRSSMSYFFGAFVELVQKLNVHTAEAHCSLAIPLWEDYELRGFTPLAF 536
           LDEV  YG D KT+SS+SYFF AF++L+++L+V+  E    + I LWEDYELRGF PLA 
Sbjct: 491 LDEVEAYGVDEKTKSSISYFFAAFMDLLKQLDVN-VEIVSDVRIALWEDYELRGFAPLAQ 550

BLAST of Cp4.1LG09g08350 vs. TAIR10
Match: AT1G28260.1 (AT1G28260.1 Telomerase activating protein Est1)

HSP 1 Score: 323.9 bits (829), Expect = 3.0e-88
Identity = 291/979 (29.72%), Postives = 438/979 (44.74%), Query Frame = 1

Query: 2   ATSANQSKKESLLNEVVSLEKQLTASILSKGILHSDVKDLYYKVCSIYERIFLNEHEQLE 61
           A SA+Q +K + L EV ++EKQL   I SK ILH+DV +LY K  S YE+IF +  +  E
Sbjct: 3   ANSADQKQKPNFLVEVNNIEKQLWTLIHSKTILHTDVSELYAKAGSTYEQIFKSNLQHEE 62

Query: 62  LQDVEYSLWKLHYKLIDEFRKRIKRSSANVESPKLGTGQHSNDAQRSSSNHIAEFRLFLM 121
           LQ+VE+ LWKLHYK IDEFRK +K            T  H        + H+  F+LFL 
Sbjct: 63  LQEVEFCLWKLHYKHIDEFRKGLK------------TNDH--------AKHMKAFKLFLS 122

Query: 122 EATKFYQKLILKIREYNGVQKEGLLYKAFGVSKGIDPKRKKTCQFLCHRLLVCLGDL--- 181
           +A +FYQ LI K+R Y         Y       G     ++  +FLCHR  +CLGDL   
Sbjct: 123 KAAEFYQNLISKVRGY---------YHRLSEESG-----EQKSRFLCHRFYICLGDLQRY 182

Query: 182 -------------SSSA-----VKEPFPDAWD---------------------------- 241
                        S++A       + +PD+ +                            
Sbjct: 183 QEQYLKAHEHPNWSTAATYYLEAAKSWPDSGNPHNQLAVLATYVSDELLALYHCVRSLAV 242

Query: 242 ---------NLLLLFERNRSSLMPSLSRDAEFDFLRPSEKCCLEIKSQTKDDRKS---LE 301
                    NLLLLFE+NRSS + SLS DAEF++L PSEK  + +K +     K      
Sbjct: 243 KEPFPGASNNLLLLFEKNRSSPLQSLSTDAEFNYLNPSEK-KVSVKERDLSKAKGELVAG 302

Query: 302 TDLFSLLIRTLAFFFINSSLEQFTSTFSSMMRSLDELLSLDDSELNVSLESLEQFTSTFS 361
            DL+ L++RT +FFF+ S                                S ++F   F+
Sbjct: 303 IDLWPLVVRTTSFFFLKS--------------------------------SFDEFGRAFA 362

Query: 362 SMMRSLDELLSLDDSELNVSLESYELLDSVRTGPFRAIQIASIFIFMVQNLLSKANLNDL 421
           S +R LD   + DD  L   LESY+ +D+ R GP++ +QI ++FI++  N L++AN +D+
Sbjct: 363 STIRELDAAFAADDRNLEAMLESYQFMDTARKGPYKILQIVAVFIYIFHN-LAEANGSDI 422

Query: 422 --QQLELTHLALVATFIVMGRLVERSLKTSQWESSPLLPAVLVFVEWLPSVLDEVVRYGS 481
             ++++LT+LAL   FIVMGR+VER LKT+  +S PLLPA+LVF+++LP +LD+V     
Sbjct: 423 VKEEVKLTNLALTMVFIVMGRVVERCLKTTPLDSCPLLPALLVFLDYLPFLLDKVEEEEE 482

Query: 482 ----DAKTRSSMSYFFGAFVELVQKLNVHTAEAHCSLAIPLWEDYELRGFTPLAFAHEPL 541
               D K++S++SYFFG  V+++ +L V          + LWED+EL+   PLA  H  L
Sbjct: 483 ECRFDEKSKSAISYFFGKLVDILNQLKVKDKNCPAKTLLALWEDHELKSLAPLAPIHALL 542

Query: 542 DFSSHWEHIDNFECGAEHRAYRITVAATKI-SNIAKDYPEWIIHDKVEQNELPDKKELED 601
           DFSS+ +  ++F+ G E R  RI  +A  I +   K   +W+  D    +      EL+ 
Sbjct: 543 DFSSNMDLRESFDRGKELRLQRIISSAIDITTRQKKGSQKWLFFDNQRTHFYTTSGELQS 602

Query: 602 EEVILFKPLTRYNSAPISIAGSDEASPKSTEAQTISSDECLRRATSLLIQQTQGQTDPFA 661
              +        N   ++I G  E  P   E      +E       LL    + Q+ P  
Sbjct: 603 NGELFHGNGEGRNRKCVTI-GPVEIIPLENERSVPVEEE----EVILLKPLVRCQSAPIY 662

Query: 662 FHNLSRNKPFEQQHDVSEGTISTGPPSLS---AWVINKGFTFNP-VTEKGPGLQPIDELT 721
              ++  KP       S    +T   SL    + + ++ F+F   + +  P    ++E T
Sbjct: 663 SSGIAA-KPLSSDCTTSGNQTTTSNDSLRRTLSLIGSESFSFTQGLKDTDPQHLHLEEGT 722

Query: 722 LA-FVNSLK-----------------------LDDTENSASIPSSKSGKSDLFPPPPYST 781
           ++    SL                        +D+T   ++  S     S   P   YS 
Sbjct: 723 VSGRPPSLSAWVVDKNKEKGRLGLSKPNGLGPIDETGPVSAFDSLSINSSTEHPASSYSP 782

Query: 782 PVPSAPYLPDDAVWTNATNANISRNIDQNDTFSGSAYSNWTAPQATYEHGPMIGGLTNMY 841
           P PSAP LP+DA W +    N +        +  + Y         Y + P +G      
Sbjct: 783 PTPSAPLLPEDASWFH----NDASTNKAESFYDQTRYMELPGIMKPYTNPPFVG------ 842

Query: 842 PSSHRMSSSEWLRQYRENQVRAPPYN----ASGNVMNLQRNDTSRYEHL--YPTMNMESP 877
                +SSSEWLR+YRE++   P Y+     + N+ N   + +S++  L  Y T N  S 
Sbjct: 843 -----ISSSEWLRRYRESRNLGPAYSYQAQGTNNLRNFMAHGSSKFSLLARYGTPNDSS- 880

BLAST of Cp4.1LG09g08350 vs. TAIR10
Match: AT5G19400.1 (AT5G19400.1 Telomerase activating protein Est1)

HSP 1 Score: 65.5 bits (158), Expect = 1.9e-10
Identity = 46/135 (34.07%), Postives = 65/135 (48.15%), Query Frame = 1

Query: 49  YERIFLNEHEQLELQDVEYSLWKLHYKLIDEFRKRIKRSSANVESPKLGTGQHSNDAQRS 108
           YE I L  H   E  ++E  LW+LHYK I+ FR  I R  A+  S      +  + A++ 
Sbjct: 51  YEAIILESHTFSEQHNIEIPLWQLHYKRIEYFRLHINRVLASSTSTAAQNVKGPSKAEQI 110

Query: 109 SSNHIAEFRLFLMEATKFYQKLILKIREYNGVQ----KEGLLYKAFGVSKGIDPKRKKTC 168
           +   + +FR FL EAT FY  +ILKIR   G+      E    +      G +    +  
Sbjct: 111 AQLKL-QFRTFLSEATGFYHDMILKIRSKYGLPLGSFSEDQQSQNLSDKDGKELAEVQKA 170

Query: 169 QFLCHRLLVCLGDLS 180
              CHR L+ LGDL+
Sbjct: 171 LKSCHRCLIYLGDLA 184

BLAST of Cp4.1LG09g08350 vs. NCBI nr
Match: gi|449457837|ref|XP_004146654.1| (PREDICTED: protein SMG7L [Cucumis sativus])

HSP 1 Score: 724.2 bits (1868), Expect = 2.8e-205
Identity = 394/604 (65.23%), Postives = 443/604 (73.34%), Query Frame = 1

Query: 4   SANQSKKESLLNEVVSLEKQLTASILSKGILHSDVKDLYYKVCSIYERIFLNEHEQLELQ 63
           + +Q++KE+LL+EVVSLEKQLT SILSKGILHSDV DLYYKVCSIYE+IF +EHEQ+ELQ
Sbjct: 3   ATSQNRKENLLHEVVSLEKQLTTSILSKGILHSDVNDLYYKVCSIYEKIFTSEHEQVELQ 62

Query: 64  DVEYSLWKLHYKLIDEFRKRIKRSSANVESPKLGTGQHSNDAQRSSSNHIAEFRLFLMEA 123
           DVEYSLWKLHYKLIDEFRKRIKRSS N  SPKLGT Q  N+ QRS+SNHIAEFRLFL+EA
Sbjct: 63  DVEYSLWKLHYKLIDEFRKRIKRSSGNGGSPKLGTTQSPNNVQRSNSNHIAEFRLFLLEA 122

Query: 124 TKFYQKLILKIREYNGVQKEGLLYKAFGVSKGIDPKRKKTCQFLCHRLLVCLGDLS---- 183
           TKFYQ LILKIREY GV  EGLLYKAF V+KGIDPK+KK CQFLCHRLL+CLGDL+    
Sbjct: 123 TKFYQILILKIREYYGVPNEGLLYKAFSVAKGIDPKKKKKCQFLCHRLLICLGDLARYVE 182

Query: 184 ------------SSAVKEPF------PDAW--------------DNLL------------ 243
                       ++A    F      PD+               D  L            
Sbjct: 183 QHEKLDVYSHKWAAAATHYFEATMVWPDSGNPHNQLAVLATYVNDQFLAMYHCVRSSAVK 242

Query: 244 -----------LLFERNRSSLMPSLSRDAEFDFLRPSEKCCLEIKSQTKDDRKSLETDLF 303
                      LLFERNRSSL+PSLS D +F+FLRPSEKCC EIKSQ KDD KSLETDLF
Sbjct: 243 EPFPDAWDNLILLFERNRSSLLPSLSGDGQFNFLRPSEKCCFEIKSQIKDDNKSLETDLF 302

Query: 304 SLLIRTLAFFFINSSLEQFTSTFSSMMRSLDELLSLDDSELNVSLESLEQFTSTFSSMMR 363
           SLLIRTL FFFINS                                SLE+FTS FSSMMR
Sbjct: 303 SLLIRTLGFFFINS--------------------------------SLEEFTSAFSSMMR 362

Query: 364 SLDELLSLDDSELNVSLESYELLDSVRTGPFRAIQIASIFIFMVQNLLSKANLNDLQQLE 423
            LDE LSLDDSELN SLESY+LLDSVRTGPFRAIQIAS+FIFMVQN  SK +LND QQ+E
Sbjct: 363 WLDEFLSLDDSELNASLESYKLLDSVRTGPFRAIQIASVFIFMVQNRFSKVDLNDKQQIE 422

Query: 424 LTHLALVATFIVMGRLVERSLKTSQWESSPLLPAVLVFVEWLPSVLDEVVRYGSDAKTRS 483
           LT LALV TFI MGRLVER L+ S+ +S PLLPAVL+FVEWLP+VLDEVVRYG D K+R+
Sbjct: 423 LTQLALVVTFIAMGRLVERCLEASKLDSFPLLPAVLIFVEWLPNVLDEVVRYGDDEKSRN 482

Query: 484 SMSYFFGAFVELVQKLNVHTAEAHCSLAIPLWEDYELRGFTPLAFAHEPLDFSSHWEHID 542
           SM+YFFG +V L+++LNV+  EA CSLAIPLWEDYELRGFTPLAF+H+PLDFSSHWEH+D
Sbjct: 483 SMTYFFGVYVGLLERLNVNKVEAQCSLAIPLWEDYELRGFTPLAFSHKPLDFSSHWEHMD 542

BLAST of Cp4.1LG09g08350 vs. NCBI nr
Match: gi|659068090|ref|XP_008442690.1| (PREDICTED: LOW QUALITY PROTEIN: protein SMG7L [Cucumis melo])

HSP 1 Score: 500.4 bits (1287), Expect = 6.6e-138
Identity = 261/369 (70.73%), Postives = 292/369 (79.13%), Query Frame = 1

Query: 180 SSAVKEPFPDAWDNLLLLFERNRSSLMPSLSRDAEFDFLRPSEKCCLEIKSQTKDDRKSL 239
           SSAVKEPFPDAWDNL+LLFERNRSSL+PSLSR+ +F+FLRPSEKCC EIKSQTKDD KSL
Sbjct: 238 SSAVKEPFPDAWDNLILLFERNRSSLLPSLSREGQFNFLRPSEKCCFEIKSQTKDDNKSL 297

Query: 240 ETDLFSLLIRTLAFFFINSSLEQFTSTFSSMMRSLDELLSLDDSELNVSLESLEQFTSTF 299
           E DLFSLLIRTL FFFINSSLE                                +FTSTF
Sbjct: 298 EADLFSLLIRTLGFFFINSSLE--------------------------------EFTSTF 357

Query: 300 SSMMRSLDELLSLDDSELNVSLESYELLDSVRTGPFRAIQIASIFIFMVQNLLSKANLND 359
           SSMMR LDELLSLDDSELN SLESY+LLDSVR GPFRAIQIAS+FIFMVQN  SK +LND
Sbjct: 358 SSMMRWLDELLSLDDSELNASLESYKLLDSVRKGPFRAIQIASVFIFMVQNRFSKVDLND 417

Query: 360 LQQLELTHLALVATFIVMGRLVERSLKTSQWESSPLLPAVLVFVEWLPSVLDEVVRYGSD 419
            QQLELT LALVATFIVMGRLVER L+ S+ +S PL+PAVL+F+EWLP+VL+EVVRYG D
Sbjct: 418 KQQLELTQLALVATFIVMGRLVERCLEASKLDSFPLVPAVLIFMEWLPNVLNEVVRYGDD 477

Query: 420 AKTRSSMSYFFGAFVELVQKLNVHTAEAHCSLAIPLWEDYELRGFTPLAFAHEPLDFSSH 479
            K+R+SM+Y FG +V L+++LNV   EA CSLAIPLWEDYELRGFTPLAFAH+ LDFSSH
Sbjct: 478 EKSRNSMTYXFGVYVGLLERLNVDKVEAQCSLAIPLWEDYELRGFTPLAFAHKQLDFSSH 537

Query: 480 WEHIDNFECGAEHRAYRITVAATKISNIAKDYPEWIIHDK-------VEQNELPDKKELE 539
           WEH+D FE GA+HRAYRI VAATKISNIA D P+WIIHDK       +EQNELPDKKELE
Sbjct: 538 WEHMDAFELGAKHRAYRIIVAATKISNIANDSPKWIIHDKTCEVVYTLEQNELPDKKELE 574

Query: 540 DEEVILFKP 542
             +  +  P
Sbjct: 598 SAKCYIVSP 574

BLAST of Cp4.1LG09g08350 vs. NCBI nr
Match: gi|703144030|ref|XP_010108172.1| (hypothetical protein L484_014497 [Morus notabilis])

HSP 1 Score: 461.5 bits (1186), Expect = 3.4e-126
Identity = 318/798 (39.85%), Postives = 421/798 (52.76%), Query Frame = 1

Query: 178 LSSSAVKEPFPDAWDNLLLLFERNRSSLMPSLSRDAEFDFLRPSEKCCLEIKSQTKD--- 237
           + S AVKEPFPDAWDNLLLL ERNRSS + SLS +A+F+F++P E+   +  S++ D   
Sbjct: 177 IRSLAVKEPFPDAWDNLLLLLERNRSSPLQSLSSEAQFNFIKPYERSITKTNSKSIDHSS 236

Query: 238 --DRKSLETDLFSLLIRTLAFFFINSSLEQFTSTFSSMMRSLDELLSLDDSELNVSLESL 297
             +  S  TD +SL IR ++FF +  SL++F S F+S+MR LD LL+LDD+EL  SLES 
Sbjct: 237 CRNNGSAATDFWSLFIRIISFFVVKPSLDEFPSAFTSVMRGLDALLALDDTELKASLES- 296

Query: 298 EQFTSTFSSMMRSLDELLSLDDSELNVSLESYELLDSVRTGPFRAIQIASIFIFMVQNLL 357
                                          Y+ +DS++ GPFRA+Q+ SIF++ +Q+L+
Sbjct: 297 -------------------------------YQHMDSIKAGPFRALQVVSIFLYTLQSLI 356

Query: 358 SKANL------NDLQQLELTHLALVATFIVMGRLVER--SLKTSQWESSPLLPAVLVFVE 417
           +   +      +D Q + L  LAL + FI MGR VER   LK     S PLLPAVLVFVE
Sbjct: 357 NCPQIKHFEEMSDTQLILLRQLALTSLFIFMGRFVERCLKLKAGALSSCPLLPAVLVFVE 416

Query: 418 WLPSVLDEVVRYGSDAKTRSSMSYFFGAFVELVQKLNVHTAEAHCSLAIPLWEDYELRGF 477
           WL ++L+E  +YG D ++ S+MSYFF +FV L+ +L  +  E + S++ PLWEDYELRGF
Sbjct: 417 WLATMLNEAEKYGVDRRSSSAMSYFFESFVALLNRLGANNNEGNTSVSTPLWEDYELRGF 476

Query: 478 TPLAFAHEPLDFSSHWEHIDNF--------------------------------ECGAEH 537
            P+  AHE L FSSHWEHIDNF                                + G   
Sbjct: 477 APVTRAHESLYFSSHWEHIDNFEEGTKSRCRRIRNAGLKIANRSNDSQKWIIYDQSGGNF 536

Query: 538 RAYRITVAATKI--------SNIAKDYPEWIIHDKVEQNELPDKKE----------LEDE 597
           R+  I   A +         S++  D  +    + VE+ E P  +E          +E+E
Sbjct: 537 RSVPINSNAAEFNENVESISSDLKTDASDQNFCEGVEEFEGPILEENPSVNGKSVTVEEE 596

Query: 598 EVILFKPLTRYNSAPISIAGSDEASPKSTEAQTISSDECLRRATSLLIQQTQG------- 657
           EVILFKPLTRYNSAP+    ++  SPK  E Q    D+CLRRATSLLI Q Q        
Sbjct: 597 EVILFKPLTRYNSAPLCTNSNEPTSPKEMEEQAAPPDDCLRRATSLLIAQNQAQGGTTFM 656

Query: 658 QTDPFAFHNLSRNKPFEQQHDV-SEGT---------ISTGPPSLSAWVINKGFTFNPVTE 717
           QTD     N   NKPF+QQ  V  E T         IS+GPPSLSAWV+ +G   N   +
Sbjct: 657 QTD---ISNFRHNKPFKQQELVFKEATMLPPFPDTLISSGPPSLSAWVLERGGLINNKEK 716

Query: 718 KGPG-----LQPIDELTLAFVNSLKLDDTENSASIPSSKSGKSDLFPPPPYSTPVPSAPY 777
              G     L PI+E+    +  L +   ++S+    S    +  +   PYS P PSAP 
Sbjct: 717 AASGIHKHILNPIEEMASESLCGLSITQNQDSS---RSHDFLATHYSSSPYSAPTPSAPL 776

Query: 778 LPDDAVWTNATNANI--SRNIDQNDTFS-----GSAYSNWTAPQA-TYEHG-PMIGGLTN 837
           LPDDA W     + +  S  I+  +T S      S+Y NW A Q  T  +G   I GL  
Sbjct: 777 LPDDAAWFTGLQSRLQPSEGINGTETLSNASQGNSSYPNWNATQGPTDLYGLSSIPGLAV 836

Query: 838 MYPSSHRMSSSEWLRQYRENQVRAPP--YNASGNVMNLQRNDTSRYEHLYPTMNMESPLR 877
            Y    RM+SSEWLRQYREN     P  + A GN+ N      S      PT++MESP  
Sbjct: 837 NYTPQRRMTSSEWLRQYRENHAWPWPSYFYAPGNIGNSDNPLASN-----PTVHMESPPL 896

BLAST of Cp4.1LG09g08350 vs. NCBI nr
Match: gi|641858918|gb|KDO77608.1| (hypothetical protein CISIN_1g001829mg [Citrus sinensis])

HSP 1 Score: 426.0 bits (1094), Expect = 1.6e-115
Identity = 249/548 (45.44%), Postives = 334/548 (60.95%), Query Frame = 1

Query: 10  KESLLNEVVSLEKQLTASILSKGILHSDVKDLYYKVCSIYERIFLNEHEQLELQDVEYSL 69
           K +LL EV + +KQL   I SKG+L  +V++LY++VCS YE+I LN+++Q ELQDVEYSL
Sbjct: 15  KPNLLVEVANTDKQLVTLIHSKGLLCPEVQELYHRVCSSYEKILLNDYDQAELQDVEYSL 74

Query: 70  WKLHYKLIDEFRKRIKRSSANVESPKLGTGQHSNDAQRSSSNHIAEFRLFLMEATKFYQK 129
           WKLHY+ IDEFRKRIK+SS +  +      Q   + QRSS NHI  F+ FL EA  FY  
Sbjct: 75  WKLHYRHIDEFRKRIKKSSVSDNTMP----QSGANVQRSSDNHIEGFKSFLSEAMAFYHN 134

Query: 130 LILKIREYNGVQKEGLLYKAFGVSKGIDPKRKKTCQFLCHRLLVCLGDLSSSAVKEPFPD 189
           L++KI+ Y G+ +E    K   +S  ++P +K+  QFLCHR LVCLGDL  +  KE + +
Sbjct: 135 LVVKIKRYYGLPEESSFAKEGYMSTTLEPNKKQKYQFLCHRFLVCLGDL--ARYKEQYEN 194

Query: 190 --------------------AWDNLLLLFERNRSSLMPSLSRDAEFDFLRPSEKCCLEIK 249
                                W +      +NRSS + SLS +A FD  +PSE+   +IK
Sbjct: 195 FGAQEHNWSVAVSHYLEATMIWPDSGNPQNQNRSSDLHSLSMEAHFDISKPSERSSNQIK 254

Query: 250 SQTKDDRKSL-----------ETDLFSLLIRTLAFFFINSSLEQFTSTFSSMMRSLDELL 309
           SQ++D   +            ET+L+SL+IRT++FFFI S                    
Sbjct: 255 SQSRDGFSNCNMLKAEHDCFKETNLWSLIIRTISFFFIKS-------------------- 314

Query: 310 SLDDSELNVSLESLEQFTSTFSSMMRSLDELLSLDDSELNVSLESYELLDSVRTGPFRAI 369
                       SLE F  TF+S MR LD  + LDD++L   LESY+L+DS RTGPFRA+
Sbjct: 315 ------------SLEDFPYTFASTMRELDAAMELDDAKLKALLESYQLMDSARTGPFRAL 374

Query: 370 QIASIFIFMVQNLLSKANL------NDLQQLELTHLALVATFIVMGRLVERSLKTSQWES 429
           Q+ SIFIF ++NL++   +      ND+QQLE    AL ATFI MGRLVER LK++  +S
Sbjct: 375 QVVSIFIFTIENLINAPEIKGSKDKNDMQQLEFIRWALSATFIFMGRLVERCLKSNSLDS 434

Query: 430 SPLLPAVLVFVEWLPSVLDEVVRYGSDAKTRSSMSYFFGAFVELVQKLNVHTAEAHCSLA 489
           SPLL +VLVFVEWL  +L++   Y SD K+RS+MSYFFGAFV L+++LN   +E      
Sbjct: 435 SPLLSSVLVFVEWLVGILEQAESYASDGKSRSAMSYFFGAFVGLLKQLNAR-SEVSSPKK 494

Query: 490 IPLWEDYELRGFTPLAFAHEPLDFSSHWEHIDNFECGAEHRAYRITVAATKISNIAKDYP 521
             LWEDYELRGF P+  +H+ LDFS H+ HI +FE G E RA R+  AA KI+N +    
Sbjct: 495 TALWEDYELRGFAPVLCSHQSLDFSVHFGHIKSFEAGIESRADRVINAAMKIANRSNGSQ 523

BLAST of Cp4.1LG09g08350 vs. NCBI nr
Match: gi|728839879|gb|KHG19322.1| (Telomerase-binding EST1A [Gossypium arboreum])

HSP 1 Score: 399.8 bits (1026), Expect = 1.2e-107
Identity = 251/613 (40.95%), Postives = 339/613 (55.30%), Query Frame = 1

Query: 6   NQSKKESLLNEVVSLEKQLTASILSKGILHSDVKDLYYKVCSIYERIFLNEHEQLELQDV 65
           +Q  + S L E+ + EK L   I +KG+LHSDV+DLY+KVC  YE  FL++HE  ELQDV
Sbjct: 86  DQKARASFLLEIANTEKHLWVLIHTKGLLHSDVRDLYHKVCLNYESFFLDDHELTELQDV 145

Query: 66  EYSLWKLHYKLIDEFRKRIKRSSANVESPKLGTGQHSNDAQRSSSNHIAEFRLFLMEATK 125
           EYSLWKLHYK IDEFRKR KRSSAN ES     G   +D     + +I  F+ FL++AT+
Sbjct: 146 EYSLWKLHYKHIDEFRKRTKRSSANSESTMCAMGSSGSD-----NRYIDGFKSFLLKATE 205

Query: 126 FYQKLILKIREYNGVQKEGLLYKAFGVSKGIDPKRKKTCQFLCHRLLVCLGDLS------ 185
           FY+KLI K+R + G+ +E    +  G++  I+P + + C FLCHR LVCLGDL+      
Sbjct: 206 FYKKLIEKLRSHYGLPEESSSSRRGGINASIEPVKLRKCHFLCHRFLVCLGDLARYMEQV 265

Query: 186 --SSAVKE--------------PFPDA--------------WDNLL-------------- 245
             SS +K                +PD+               D  L              
Sbjct: 266 EQSSVLKHNWSVAAAYYLEAAMVWPDSGNPQNQLAVLATYVGDEFLALYHCIRSLAVKEP 325

Query: 246 ---------LLFERNRSSLMPSLSRDAEFDFLRPSEKCCLEIKSQTK-----------DD 305
                    LLFERNRS  +PSLS + +FDFL+P E+   ++K Q+            ++
Sbjct: 326 FPDAWNNLVLLFERNRSCDLPSLSSEEQFDFLQPFERSDSQVKLQSSEKVSDGVLLKGEN 385

Query: 306 RKSLETDLFSLLIRTLAFFFINSSLEQFTSTFSSMMRSLDELLSLDDSELNVSLESLEQF 365
             S   + + LLIRTL+FFF+ S                                SLE F
Sbjct: 386 DHSAGMNFWLLLIRTLSFFFLKS--------------------------------SLEDF 445

Query: 366 TSTFSSMMRSLDELLSLDDSELNVSLESYELLDSVRTGPFRAIQIASIFIFMVQNLLSKA 425
              F+S MR LD +++LDD +L   LESY+L+DS RTGPFR +Q  S+FIF+  NL +  
Sbjct: 446 PCAFASTMRVLDVMMALDDIKLRAMLESYQLMDSARTGPFRVLQAVSVFIFVFHNLNNNL 505

Query: 426 NL------NDLQQLELTHLALVATFIVMGRLVERSLKTSQWESSPLLPAVLVFVEWLPSV 485
            L       + Q LEL   AL ATFI MGR+V R L+ +   S PLLPA+LVFVEWL S+
Sbjct: 506 ELPGSKDGKNKQHLELIQFALNATFIFMGRVVNRCLRANSLNSCPLLPAILVFVEWLASM 565

Query: 486 LDEVVRYGSDAKTRSSMSYFFGAFVELVQKLNVHTAEAHCSLAIPLWEDYELRGFTPLAF 536
            DEV  YG D KT+SS+SYF  AF++L+++L+V+  E    + I LWEDYELRGF PLA 
Sbjct: 566 FDEVEAYGVDEKTKSSISYFLAAFMDLLKQLDVN-VEIVSDVRIALWEDYELRGFAPLAQ 625

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
SMG7L_ARATH5.3e-8729.72Protein SMG7L OS=Arabidopsis thaliana GN=SMG7L PE=2 SV=1[more]
SMG7_ARATH3.4e-0934.07Protein SMG7 OS=Arabidopsis thaliana GN=SMG7 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LSD4_CUCSA1.9e-20565.23Uncharacterized protein OS=Cucumis sativus GN=Csa_1G074930 PE=4 SV=1[more]
W9RYX4_9ROSA2.4e-12639.85Uncharacterized protein OS=Morus notabilis GN=L484_014497 PE=4 SV=1[more]
A0A067GD48_CITSI1.1e-11545.44Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g001829mg PE=4 SV=1[more]
A0A0B0P381_GOSAR8.4e-10840.95Telomerase-binding EST1A OS=Gossypium arboreum GN=F383_23932 PE=4 SV=1[more]
A0A0D2VAS6_GOSRA3.2e-10741.27Uncharacterized protein OS=Gossypium raimondii GN=B456_010G155000 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G28260.13.0e-8829.72 Telomerase activating protein Est1[more]
AT5G19400.11.9e-1034.07 Telomerase activating protein Est1[more]
Match NameE-valueIdentityDescription
gi|449457837|ref|XP_004146654.1|2.8e-20565.23PREDICTED: protein SMG7L [Cucumis sativus][more]
gi|659068090|ref|XP_008442690.1|6.6e-13870.73PREDICTED: LOW QUALITY PROTEIN: protein SMG7L [Cucumis melo][more]
gi|703144030|ref|XP_010108172.1|3.4e-12639.85hypothetical protein L484_014497 [Morus notabilis][more]
gi|641858918|gb|KDO77608.1|1.6e-11545.44hypothetical protein CISIN_1g001829mg [Citrus sinensis][more]
gi|728839879|gb|KHG19322.1|1.2e-10740.95Telomerase-binding EST1A [Gossypium arboreum][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR018834DNA/RNA-bd_Est1-type
IPR011990TPR-like_helical_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG09g08350.1Cp4.1LG09g08350.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 20..279
score: 1.63E-39coord: 329..507
score: 1.63
IPR018834DNA/RNA-binding domain, Est1-typePFAMPF10373EST1_DNA_bindcoord: 179..470
score: 1.6
NoneNo IPR availablePANTHERPTHR15696SMG-7 SUPPRESSOR WITH MORPHOLOGICAL EFFECT ON GENITALIA PROTEIN 7coord: 313..852
score: 1.3E-225coord: 6..281
score: 1.3E
NoneNo IPR availablePANTHERPTHR15696:SF3PROTEIN SMG7Lcoord: 313..852
score: 1.3E-225coord: 6..281
score: 1.3E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG09g08350Cp4.1LG20g04340Cucurbita pepo (Zucchini)cpecpeB048