Cp4.1LG03g12160 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG03g12160
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionHAT dimerization domain-containing protein
LocationCp4.1LG03 : 10565046 .. 10572348 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TAGGAGAAAATGAACGAAGACGTTGATGTGCAGATTGAGTTCAAAATTCAAACAAAGAGACCCAAATCGCCAGGCGACTTCTTCTTCTTCCTCGTCTTCACAGCCCGGATTTTGTCGCCGGCTCCGATTCTAGGGTTTCTGTCCCGGTGAATGCTGCTGCTGCTGTTGCTGTTCCTCCGATGAATCGGCCGCACTGCAACCATCTTCAATCCAACTCCATTTCTAACTGCCACGAATGCGGGATTTCTCAGTCCGTATGCTGGATCCTCCACAATGTCCGTTTCAAAGCCTCCTTCCGTCGTCTCTGCACCAATTGCGTCCTTAAGAACAATCTTTCCCGTTTCTGTCCTCTTTGCTTCGATATTTACGACGATTCGACTCCGCCGTCGTCTCATCAGCGAGTTATGTGCTTCAGATGCCCTTCAATATCTCATGTCTCTTGCGCTTCCTTTCGTTTTTCCTCTACGTTCCTCTGCCCTATCTGCTCCGATCCTTGTTTTGCCTTCTTCGACGGTTTCGACTCCGGCGGCCTTCGCCAATCGGAGCCTGCCGTTGCCGTTTTGGCCGGTAGAGATGGTAAATCGGCGAAGGCGATTGTCGCTGCGGCTCGTGTCGCCGCTCAATCCATGCGAAGAGCGGCGGCTGACGCTAGGGCTGTAGCGGAGATGAAGATCAGAAATGCTGTGTTCGCTAAGAAGCAGGCTACTCTCGCCTTGGAACGGCTTGCTTATCTTGTACTTCATGGGAAAGACAGAAATGGATATGCTAAAACTAATGGAAATGCTGCTGCTGGTGCGGTGGAAGAAGAAGAAGAAACCGAGCTACAAGGTGAAGTGGTAACAGCCATTCTTGAGCGTATGAAGGCGAATCAGACTCAGTTTTGACGAAGGTAGCTGATTAAGGAAACAAACTGTAGAGTAATCTTCTTCGACTCGCATTCAATCGAGTACCTCCAAGATTCAGTTGAATCAGGGCGCAGAGGTACTATTTACTCATCCATGGAAGCTTGTGGATCGATGCAGTGACGATTATTCTATAATCATAGATGCAAGAACAAATCAGTAAAAGGAAGTGTGAATGAATCTCCATGTACATTTCAAAATTTCAACATTATTTGATTGTTCTTGAAGTGCTATTGAGAATGAGGCGCTTGTTCTGCTTTATCGATTCCAATTCTTCATTTTATCACTTCTATTTGCTCTATATATAGTTCTCAGCTTCTTCTTCATGCAGCTCTCTCTTCTATCGGCCATTTTTTTTCAGAATCTCATAGCTTTTGAGGTGTGGAAGCCATTGTTGCCGCCTTCAAGCTACAAGAATCGCCACTGCGGAGCCACGGTGAACATTTGATATTCAAATACCAGTGGCGGTGGAACGGTGGCCATATCAAGACGCCGGCGGCGATAGATAGTGCTGATTTCAACTAACAAAGTCTGATTCTGCGGTGAATTTTCTGAGATACCATCATTTCGTTTCGTTTGATTCAGATTATAGACCGAGTCTGCTCGATTCAACTCGCTTCCAGTTAACCGAGTTGACTCACTGAATTTCCCCGTCCATAAATTATTTTTCTGAGTTCGAAAAAAAAAAAAAAAAAACTCTTCTTTTTCCTTACAACTAGTAAAGAACGTTACTTTAGAGAATAATATCTATAATTAATTAATGGGGTTTCGGGTTTCCCCCTTTTATTTCCTTCCCACGAAAAAAAAAAGGGAAATGTTCTTAATTAGCGATCTGGAACCCTAGAGACCAGAACGAGTCGAGTTCAACCCATCTCACCCAGACATTCCACTCGCGACTCACGATTCCACCAGCGATCCCACCAGCCCTTCGTCAAGCTCCTCTCTCTCTCTCCCTCGGCTCCCTCTTTTCTTTGTCTTTAAACCTGGAATCAACCTTTTTGATTCTCAGAAAGTAATCAAAGATTATGGATGCCGTTAAATTCAGAATGCCGGTGAAACCGCCGGAAAGATTCTGAAAGGGGTCGGAATTTCTTCACATTCAAGGTATTGAAGATTGTATGGTTCCTGTTTATCGTTGCAATTTATAAGTTAGTTTCATATGATTGTTTTCAATTTTTGTTCCTGTTTAAATTGTGTGTTTTCTGTGTATCACCTGTTCGATGAAATTCCAAACTGTTATGAAGGATTCTGACTTGTTTTCAGCTTTTCTTTCTTTATTTTGGTGAACTATAGTCTGTTTTCTTATAGCATAACCATTTTATGTGATGCTTGGTATGGTCAAATCGATTAATATTCTATTTATCTTAGGGTTAGCTGTGTTTACCGGACCTCTAAGTTCATACTTTTTGTTTATGAAGATTGCTTGCTTCTCTATTCACAGTTTTGATTTTCACATTTCTTTGTTTAGATAAGAAAATCCCAATATATAGGATGTCATTGTTGTTTTATAATAGGTTTTGATAGTCAACATGAAAAGGAATCAATGTAGAATGTCTAATGGTGTCACCATAAATTTCTCTGACTGTAAATTTTAGGCACCTTTTGATCGATCACATCGTTAGCTGGAGTTTTCAGAATATGTGCTGACCGATCTCTAGCAGATGCGCATTAGACTTCTGCTTGGTTAGATTGGTCGAGGAGATTAGTTGCATTTACCTTAGTGTTTTACCAATGATGGCCCCTATTCGCACGTGTGGATTTGTTGATCCAGGTTGGGAGCATGGAGTTGCTCAAGATGAAAAGAAAAAGAAGGTTAAATGTAATTATTGCGGGAAAATAGTTAGTGGTGGCATATATAGGTTGAAGCAACATTTAGCTCGAGTTTCGGGGGAAGTTACGTATTGTGACAAGGCTCCAGAGGAAGTATATTTGAGAATGAGAGAAAACCTGGAAGGTTGTCGTTCCAATAAGAAACCAAGACAGTCGGAAGATGATGAACAATCATATTTGAACTTCCATTCCAATGATGATGAAGATGGTGGTTTACATGTGGCTTATAGAAATAGAGGAAGGCAATTGATGGTAAACCGAAACGTCGGTGCTAACATGACTCCTCTAAGATCATTAAGATATGTTGACCCTGGATGGGAACACGGTGTGGCTCAGGACGAAAGGAAGAAGAAGGTTAAGTGCAACTACTGTGAAAAGATAGTTAGTGGAGGTATTAATAGGTTTAAGCAACATCTAGCCAGAATTCCTGGAGAGGTAGCCCCGTGTAAACACGCTCCTGAGGAAGTGTATCTTAAGATCAAAGAGAATATGAAATGGCATCGTACTGGCAGGAGAAATGGACAGACCGATGCCAACGAGTTATCGGCTTATTTTATGCAATCAGATAATGAAGAAGAGGAAGACGAGAAAGAGGAATCCTTGCATCATATTAGCAAGGAAAGATTGATCGATGGTGACAAAAGGTCAAGCAAAGATTTGAGAAGTACCTTTAGGGGAATGTCCCCTGGTGGTGGATCTGAACCGTCGGTTAAAAGATCGAGGTTAGATTCTGTTTTTCTGAAAACCACCAAAAGACCAACCGAACAGTTGCACAAACAAGCATTAGTAAAAAGAGGAGCCAATAGGAGGTCACGCAAAGAAGTAATGTCTGCAATTTGCAAATTCTTTTGCTATGCAGGAATTCCTTTCCAGTCTGCAAATTCTGTTTACTTTCATAAGATGTTGGAGACAGTCGGTCAATATGGATCAGGCTTGGTTGGCCCCTCGTGCCAATTGATATCTGGTCGGTTATTACAGGACGAAGTGGCAACCGTTAAGACTTACCTGGTTGAGTTGAAGGCCTCCTGGGCAATTACTGGCTGTTCTATACTGGTGGACAGTCGGAAGGATTCAAATGGTCGAACGTCTATAAACTTTTTGGTTTCTTGTCCCCGTGGTGTTTACTTTGTCTCATCAGTTGATGCCACTGAAGTAGCAGATGACCCTTCAAACTTGTTTAGGGTGCTTGATGCAGTGGTAGATGAAATTGGGGAGGAAAATGTGGTGCAGGTAGTTTCTGCAATCATCTCGTAGTTCTATTCTTTATTGCCAGTGTCCTTGCTGATAATTGTTTGTTTTTCTCCATTATTTCTTTAGGTAATCACTGAGAATACTCCTAATTATAAAGCTGCTGGTAAAATGCTCGAGGAGAAGAGAAGAAATTTATTCTGGACGCCATGTGCGACCTATTGTATTGATCACATGCTTGAAGATTTTTTGAAATTGAGAACCGTGGAAGATTGCATGGAAAAGTGCCAAAAAATTACCAAGTTTATTTACAATCGGAACTGGTTGTTAAATTTCATGAAGAACGAGTTCACCCAAGGGTTGGAACTTCTTAGACCTGCAGTTACTCGGAACGCCTCGAACTTTGCTACTTTGCAGTGCTTCCTGGACCACAGAGCTAGTTTACGGAGAATGTTCGTCTCCAATGAGTGGACTTCTTGCAGGTTTTCTAAATCTGGTGAGGGACAAGAAGTAGAGATGATTGTATTAAATACTTCATTTTGGAAGAAGGTTCAATATGTTTGTAAATCTGTGGAACCAGTATTGCAAGTTCTTCAAAAATCCGATTCGGTTCAAAGCTTGTCGATGTCATCTATATATAATGACATGTACAGAGCCAAGTTCGCTATACAATCCATTCATGGCGACGATGCCAGGAAATATGGACCATTCTGGAATGTGATAGATAGCAACTGGAATTCTTTATTTTGCCACCCTTTACATATGGCTGCTTTTTTCTTAAACCCATCATACAGATATCGTCCTGATTTCGTGGCGGTATGAAAATAACTACTTCTTGGTGTTCCAAAATCATGACAATCTCTTTTCTCAGCTACATTTCCTTTGTTCGTGCAGCATTCGGAGGTGGTTCGTGGACTTAATGAATGCATAGTTCGGCTCGAGTCTGACAGTTCCAGAAGGATTTCTGCATCTATGCAGGTAATGGATGGGTTATTTTAGGAGCCTGTCGATTGTCCTGATTTTGTGTAATCTTCACTACTTCGATATTGGCTGTCCCCAGATTTCGGACTATAATTCAGCAAAATCTGATTTTGGAACTGAGCTGGCTATCAGTACAAGAACAGAGCTTGATCCAGGTGATTGCTGCATTATCCTAATCCAAGTAATAGAAAACTCTGTTTTTGTTGATAGTTGAATCGTAGTTATCAGTTTCATTTTTGCTTGAATAGCTGCATGGTGGCAACAACATGGAATTAGTTGTTTAGAACTGCAACAAATAGCTGTTCGCATACTGAGTCAAACATGTTCATCTTTGTGTTGTGAACACTACTGGTCCCCTTTCAAGAATGAACGCAGTCAAAAGAACAATGCTTTGTCTCAGAGAAAAATGGCTGATTTGTTGTATGTTCACTACAACCTTCGGCTTCGAGAACGCCAACTAAGAAAGCGATCTAGTGACTCTGTTTCTCTTGATGATATTCTTATGGAACATTTGTTGGATGATTGGATTGTGGAACCTCAGAAACAAGGCATGCAAGAAGATGAGGTATGTAGTCCCACCATCTTTACCTGCATAGAGATTCCATTATATGCAAATGAACTGCAATCCATAGACTTCATACAGTGGAAGGCATGCCTTATGTCTTCCAAGTGTCACATTTTTTTGTTTTTTGCCTTTTGGTTGTAGGTGAAAAAACCACACTTTCATGAAGAGAAATGGAAGAATATACTAGGTTATTAGAAAAAAACTAGTCCCAACAAAAAAATTGAGTCCCTAACTACCGATGTGAGATCTCACAATTCACCCCCTTTTGGGGCCAGCGTTCTCGCTGGCACTTGTTCCCTTCTCCAATCGATGTGGGACCCCCAATCCACCACCCCCTTTCGGGTCCTAGCATCCTTGCTAGCACACTATCTCGTGTCCACCCTCCCTCGGGGCTCAGCCTCCTCGTTGACACATTGCCCAGTGTTTGGCTCTGATACCATTTGTAATGGTCCAAGTCCACCGCTAGCAGATATTGTCCTCTTTGGGCTTTCCCTTCATCAAGGTTTTTAAACGCGTTTGTTGGGGAGATGTTTCCACACCCTTATAAAGAATGCTTCGTTCTCCTCCCCTACCGATGTGAGATTTCACAATCCACCCCCTTTCGAGGCCCAGCATCCTCGCTGGCACTTGTTCCCTTCTCCAATCATTGGCACTTGTTCCCTTGCTGGCACACCGCCTCGTGTCCACTCCGTTCGGGGCTCAGCCTCCTCGCTGGCACATCACCTGGTGTTTGGCTCTACTGCCATTTGTAACAGTCTAAGCCCACCGCTAGCAGATATTGTCCTCTTTGGGCTTTTCCCTTTTGGGCTTCCCTTCATCAAGGTTTTTAAAACGTGTCTGGTAGGGAGAGGTTTCCACACCCTTATAAAGAATGTTTCGTTCTCCTCTCCAACCGATGTGGGATCTCACAGCGAACTTTAGGAGACACCATGGATTTTAAGAAATGAAATTTTCTATGCAGGGTTAAACCCAAACGTTATAGCCTTATAGGATATCTCCAAATATCTAATGCCTATACCACCAGATTACCCATCAGGCTCGAGTGTTCAATTCGAAAACGTTAATCTGAAATCCTTGACAGATTAAGTTCTTGGATTTGTGATTCAACACATGAAAAGAGAAATATGTGAAAGATGTTATTGAGTGAGTTTCACTGTTTTAAATAATCGCCCTTTTAATATGTGTCCTCTGGCATCCTTCTGGAGTTGAAAACCCCTATGTATTTCAGGAAATCCTTTGTCCTGGAATGGAGACACTAGATGCATATGAGAATGATTTGATTGACTATGAGGACGGGACTACAGAGGCGGCGAGGAAGGGCTGTCTTCAACTGGTTTGTTTGACTAATGTCGAACCATTGGATGTCAACCCTGCCAATGGAGGCGCTTCCACCGACAATGATGCCGATGTTAAGTTCTACGACGATGAGCTAAGTGACTAAACTTGACATGAGTTGAGCTCACCAGCCATTGTCTGGTTACCTGATTTATGGCTTCATATTTATGTTTTTGATCTCTATAACCCAAGTCAGCTTTAATCCAGGTGCATCTGTAATTACACGAAGTTTTACTCTTATTCAAATCCGAATCCAAACGGTAAATTGTTCGAAGTTAGTTATCAGTTCTTAAGTTCCTCGTGTTTTGAGTGTATATATTTCTTAGCAGCAAGTTGTTTTTGCTTCATGAGGATCGTCTCGATTTTCGTAGTAAATGTGGTCTGTAATCCATAGCCTGTGATATGAAGTTCCAATCCTTCGAATACATGGCCTCGGGTATAACGACCCAGACCCACCGCTAGTAGATATTGTCTTCTTTAGCCTTTCTCTTTCGGCCTTCCC

mRNA sequence

TAGGAGAAAATGAACGAAGACGTTGATGTGCAGATTGAGTTCAAAATTCAAACAAAGAGACCCAAATCGCCAGGCGACTTCTTCTTCTTCCTCGTCTTCACAGCCCGGATTTTGTCGCCGGCTCCGATTCTAGGGTTTCTGTCCCGGTGAATGCTGCTGCTGCTGTTGCTGTTCCTCCGATGAATCGGCCGCACTGCAACCATCTTCAATCCAACTCCATTTCTAACTGCCACGAATGCGGGATTTCTCAGTCCGTATGCTGGATCCTCCACAATGTCCGTTTCAAAGCCTCCTTCCGTCGTCTCTGCACCAATTGCGTCCTTAAGAACAATCTTTCCCGTTTCTGTCCTCTTTGCTTCGATATTTACGACGATTCGACTCCGCCGTCGTCTCATCAGCGAGTTATGTGCTTCAGATGCCCTTCAATATCTCATGTCTCTTGCGCTTCCTTTCGTTTTTCCTCTACGTTCCTCTGCCCTATCTGCTCCGATCCTTGTTTTGCCTTCTTCGACGGTTTCGACTCCGGCGGCCTTCGCCAATCGGAGCCTGCCGTTGCCGTTTTGGCCGGTAGAGATGGTAAATCGGCGAAGGCGATTGTCGCTGCGGCTCGTGTCGCCGCTCAATCCATGCGAAGAGCGGCGGCTGACGCTAGGGCTGTAGCGGAGATGAAGATCAGAAATGCTGTGTTCGCTAAGAAGCAGGCTACTCTCGCCTTGGAACGGCTTGCTTATCTTGTACTTCATGGGAAAGACAGAAATGGATATGCTAAAACTAATGGAAATGCTGCTGCTGGTGCGGTGGAAGAAGAAGAAGAAACCGAGCTACAAGGTGAAGTGGTAACAGCCATTCTTGAGCGTATGAAGGCGAATCAGACTCAGTTTTGACGAAGGTAGCTGATTAAGGAAACAAACTGTAGAGTAATCTTCTTCGACTCGCATTCAATCGAGTACCTCCAAGATTCAGTTGAATCAGGGCGCAGAGAATCTCATAGCTTTTGAGGTGTGGAAGCCATTGTTGCCGCCTTCAAGCTACAAGAATCGCCACTGCGGAGCCACGGTGAACATTTGATATTCAAATACCAGTGGCGGTGGAACGGTGGCCATATCAAGACGCCGGCGGCGATAGATAGTGCTGATTTCAACTAACAAAGTCTGATTCTGCGGTGAATTTTCTGAGATACCATCATTTCGTTTCGTTTGATTCAGATTATAGACCGAGTCTGCTCGATTCAACTCGCTTCCAGTTAACCGAGTTGACTCACTGAATTTCCCCGTCCATAAATTATTTTTCTGAGTTCGAAAAAAAAAAAAAAAAAACTCTTCTTTTTCCTTACAACTAGTAAAGAACGTTACTTTAGAGAATAATATCTATAATTAATTAATGGGGTTTCGGGTTTCCCCCTTTTATTTCCTTCCCACGAAAAAAAAAAGGGAAATGTTCTTAATTAGCGATCTGGAACCCTAGAGACCAGAACGAGTCGAGTTCAACCCATCTCACCCAGACATTCCACTCGCGACTCACGATTCCACCAGCGATCCCACCAGCCCTTCGTCAAGCTCCTCTCTCTCTCTCCCTCGGCTCCCTCTTTTCTTTGTCTTTAAACCTGGAATCAACCTTTTTGATTCTCAGAAAGTAATCAAAGATTATGGATGCCGTTAAATTCAGAATGCCGGTGAAACCGCCGGAAAGATTCTGAAAGGGGTCGGAATTTCTTCACATTCAAGGCACCTTTTGATCGATCACATCGTTAGCTGGAGTTTTCAGAATATGTGCTGACCGATCTCTAGCAGATGCGCATTAGACTTCTGCTTGGTTAGATTGGTCGAGGAGATTAGTTGCATTTACCTTAGTGTTTTACCAATGATGGCCCCTATTCGCACGTGTGGATTTGTTGATCCAGGTTGGGAGCATGGAGTTGCTCAAGATGAAAAGAAAAAGAAGGTTAAATGTAATTATTGCGGGAAAATAGTTAGTGGTGGCATATATAGGTTGAAGCAACATTTAGCTCGAGTTTCGGGGGAAGTTACGTATTGTGACAAGGCTCCAGAGGAAGTATATTTGAGAATGAGAGAAAACCTGGAAGGTTGTCGTTCCAATAAGAAACCAAGACAGTCGGAAGATGATGAACAATCATATTTGAACTTCCATTCCAATGATGATGAAGATGGTGGTTTACATGTGGCTTATAGAAATAGAGGAAGGCAATTGATGGTAAACCGAAACGTCGGTGCTAACATGACTCCTCTAAGATCATTAAGATATGTTGACCCTGGATGGGAACACGGTGTGGCTCAGGACGAAAGGAAGAAGAAGGTTAAGTGCAACTACTGTGAAAAGATAGTTAGTGGAGGTATTAATAGGTTTAAGCAACATCTAGCCAGAATTCCTGGAGAGGTAGCCCCGTGTAAACACGCTCCTGAGGAAGTGTATCTTAAGATCAAAGAGAATATGAAATGGCATCGTACTGGCAGGAGAAATGGACAGACCGATGCCAACGAGTTATCGGCTTATTTTATGCAATCAGATAATGAAGAAGAGGAAGACGAGAAAGAGGAATCCTTGCATCATATTAGCAAGGAAAGATTGATCGATGGTGACAAAAGGTCAAGCAAAGATTTGAGAAGTACCTTTAGGGGAATGTCCCCTGGTGGTGGATCTGAACCGTCGGTTAAAAGATCGAGGTTAGATTCTGTTTTTCTGAAAACCACCAAAAGACCAACCGAACAGTTGCACAAACAAGCATTAGTAAAAAGAGGAGCCAATAGGAGGTCACGCAAAGAAGTAATGTCTGCAATTTGCAAATTCTTTTGCTATGCAGGAATTCCTTTCCAGTCTGCAAATTCTGTTTACTTTCATAAGATGTTGGAGACAGTCGGTCAATATGGATCAGGCTTGGTTGGCCCCTCGTGCCAATTGATATCTGGTCGGTTATTACAGGACGAAGTGGCAACCGTTAAGACTTACCTGGTTGAGTTGAAGGCCTCCTGGGCAATTACTGGCTGTTCTATACTGGTGGACAGTCGGAAGGATTCAAATGGTCGAACGTCTATAAACTTTTTGGTTTCTTGTCCCCGTGGTGTTTACTTTGTCTCATCAGTTGATGCCACTGAAGTAGCAGATGACCCTTCAAACTTGTTTAGGGTGCTTGATGCAGTGGTAGATGAAATTGGGGAGGAAAATGTGGTGCAGTGCTTCCTGGACCACAGAGCTAGTTTACGGAGAATGTTCGTCTCCAATGAGTGGACTTCTTGCAGGTTTTCTAAATCTGGTGAGGGACAAGAAGTAGAGATGATTGTATTAAATACTTCATTTTGGAAGAAGGTTCAATATGTTTGTAAATCTGTGGAACCAGTATTGCAAGTTCTTCAAAAATCCGATTCGGTTCAAAGCTTGTCGATGTCATCTATATATAATGACATGTACAGAGCCAAGTTCGCTATACAATCCATTCATGGCGACGATGCCAGGAAATATGGACCATTCTGGAATGTGATAGATAGCAACTGGAATTCTTTATTTTGCCACCCTTTACATATGGCTGCTTTTTTCTTAAACCCATCATACAGATATCGTCCTGATTTCGTGGCGCATTCGGAGGTGGTTCGTGGACTTAATGAATGCATAGTTCGGCTCGAGTCTGACAGTTCCAGAAGGATTTCTGCATCTATGCAGATTTCGGACTATAATTCAGCAAAATCTGATTTTGGAACTGAGCTGGCTATCAGTACAAGAACAGAGCTTGATCCAGCTGCATGGTGGCAACAACATGGAATTAGTTGTTTAGAACTGCAACAAATAGCTGTTCGCATACTGAGTCAAACATGTTCATCTTTGTGTTGTGAACACTACTGGTCCCCTTTCAAGAATGAACGCAGTCAAAAGAACAATGCTTTGTCTCAGAGAAAAATGGCTGATTTGTTGTATGTTCACTACAACCTTCGGCTTCGAGAACGCCAACTAAGAAAGCGATCTAGTGACTCTGTTTCTCTTGATGATATTCTTATGGAACATTTGTTGGATGATTGGATTGTGGAACCTCAGAAACAAGGCATGCAAGAAGATGAGGAAATCCTTTGTCCTGGAATGGAGACACTAGATGCATATGAGAATGATTTGATTGACTATGAGGACGGGACTACAGAGGCGGCGAGGAAGGGCTGTCTTCAACTGGTTTGTTTGACTAATGTCGAACCATTGGATGTCAACCCTGCCAATGGAGGCGCTTCCACCGACAATGATGCCGATGTTAAGTTCTACGACGATGAGCTAAGTGACTAAACTTGACATGAGTTGAGCTCACCAGCCATTGTCTGGTTACCTGATTTATGGCTTCATATTTATGTTTTTGATCTCTATAACCCAAGTCAGCTTTAATCCAGGTGCATCTGTAATTACACGAAGTTTTACTCTTATTCAAATCCGAATCCAAACGGTAAATTGTTCGAAGTTAGTTATCAGTTCTTAAGTTCCTCGTGTTTTGAGTGTATATATTTCTTAGCAGCAAGTTGTTTTTGCTTCATGAGGATCGTCTCGATTTTCGTAGTAAATGTGGTCTGTAATCCATAGCCTGTGATATGAAGTTCCAATCCTTCGAATACATGGCCTCGGGTATAACGACCCAGACCCACCGCTAGTAGATATTGTCTTCTTTAGCCTTTCTCTTTCGGCCTTCCC

Coding sequence (CDS)

ATGATGGCCCCTATTCGCACGTGTGGATTTGTTGATCCAGGTTGGGAGCATGGAGTTGCTCAAGATGAAAAGAAAAAGAAGGTTAAATGTAATTATTGCGGGAAAATAGTTAGTGGTGGCATATATAGGTTGAAGCAACATTTAGCTCGAGTTTCGGGGGAAGTTACGTATTGTGACAAGGCTCCAGAGGAAGTATATTTGAGAATGAGAGAAAACCTGGAAGGTTGTCGTTCCAATAAGAAACCAAGACAGTCGGAAGATGATGAACAATCATATTTGAACTTCCATTCCAATGATGATGAAGATGGTGGTTTACATGTGGCTTATAGAAATAGAGGAAGGCAATTGATGGTAAACCGAAACGTCGGTGCTAACATGACTCCTCTAAGATCATTAAGATATGTTGACCCTGGATGGGAACACGGTGTGGCTCAGGACGAAAGGAAGAAGAAGGTTAAGTGCAACTACTGTGAAAAGATAGTTAGTGGAGGTATTAATAGGTTTAAGCAACATCTAGCCAGAATTCCTGGAGAGGTAGCCCCGTGTAAACACGCTCCTGAGGAAGTGTATCTTAAGATCAAAGAGAATATGAAATGGCATCGTACTGGCAGGAGAAATGGACAGACCGATGCCAACGAGTTATCGGCTTATTTTATGCAATCAGATAATGAAGAAGAGGAAGACGAGAAAGAGGAATCCTTGCATCATATTAGCAAGGAAAGATTGATCGATGGTGACAAAAGGTCAAGCAAAGATTTGAGAAGTACCTTTAGGGGAATGTCCCCTGGTGGTGGATCTGAACCGTCGGTTAAAAGATCGAGGTTAGATTCTGTTTTTCTGAAAACCACCAAAAGACCAACCGAACAGTTGCACAAACAAGCATTAGTAAAAAGAGGAGCCAATAGGAGGTCACGCAAAGAAGTAATGTCTGCAATTTGCAAATTCTTTTGCTATGCAGGAATTCCTTTCCAGTCTGCAAATTCTGTTTACTTTCATAAGATGTTGGAGACAGTCGGTCAATATGGATCAGGCTTGGTTGGCCCCTCGTGCCAATTGATATCTGGTCGGTTATTACAGGACGAAGTGGCAACCGTTAAGACTTACCTGGTTGAGTTGAAGGCCTCCTGGGCAATTACTGGCTGTTCTATACTGGTGGACAGTCGGAAGGATTCAAATGGTCGAACGTCTATAAACTTTTTGGTTTCTTGTCCCCGTGGTGTTTACTTTGTCTCATCAGTTGATGCCACTGAAGTAGCAGATGACCCTTCAAACTTGTTTAGGGTGCTTGATGCAGTGGTAGATGAAATTGGGGAGGAAAATGTGGTGCAGTGCTTCCTGGACCACAGAGCTAGTTTACGGAGAATGTTCGTCTCCAATGAGTGGACTTCTTGCAGGTTTTCTAAATCTGGTGAGGGACAAGAAGTAGAGATGATTGTATTAAATACTTCATTTTGGAAGAAGGTTCAATATGTTTGTAAATCTGTGGAACCAGTATTGCAAGTTCTTCAAAAATCCGATTCGGTTCAAAGCTTGTCGATGTCATCTATATATAATGACATGTACAGAGCCAAGTTCGCTATACAATCCATTCATGGCGACGATGCCAGGAAATATGGACCATTCTGGAATGTGATAGATAGCAACTGGAATTCTTTATTTTGCCACCCTTTACATATGGCTGCTTTTTTCTTAAACCCATCATACAGATATCGTCCTGATTTCGTGGCGCATTCGGAGGTGGTTCGTGGACTTAATGAATGCATAGTTCGGCTCGAGTCTGACAGTTCCAGAAGGATTTCTGCATCTATGCAGATTTCGGACTATAATTCAGCAAAATCTGATTTTGGAACTGAGCTGGCTATCAGTACAAGAACAGAGCTTGATCCAGCTGCATGGTGGCAACAACATGGAATTAGTTGTTTAGAACTGCAACAAATAGCTGTTCGCATACTGAGTCAAACATGTTCATCTTTGTGTTGTGAACACTACTGGTCCCCTTTCAAGAATGAACGCAGTCAAAAGAACAATGCTTTGTCTCAGAGAAAAATGGCTGATTTGTTGTATGTTCACTACAACCTTCGGCTTCGAGAACGCCAACTAAGAAAGCGATCTAGTGACTCTGTTTCTCTTGATGATATTCTTATGGAACATTTGTTGGATGATTGGATTGTGGAACCTCAGAAACAAGGCATGCAAGAAGATGAGGAAATCCTTTGTCCTGGAATGGAGACACTAGATGCATATGAGAATGATTTGATTGACTATGAGGACGGGACTACAGAGGCGGCGAGGAAGGGCTGTCTTCAACTGGTTTGTTTGACTAATGTCGAACCATTGGATGTCAACCCTGCCAATGGAGGCGCTTCCACCGACAATGATGCCGATGTTAAGTTCTACGACGATGAGCTAAGTGACTAA

Protein sequence

MMAPIRTCGFVDPGWEHGVAQDEKKKKVKCNYCGKIVSGGIYRLKQHLARVSGEVTYCDKAPEEVYLRMRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEDGGLHVAYRNRGRQLMVNRNVGANMTPLRSLRYVDPGWEHGVAQDERKKKVKCNYCEKIVSGGINRFKQHLARIPGEVAPCKHAPEEVYLKIKENMKWHRTGRRNGQTDANELSAYFMQSDNEEEEDEKEESLHHISKERLIDGDKRSSKDLRSTFRGMSPGGGSEPSVKRSRLDSVFLKTTKRPTEQLHKQALVKRGANRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLISGRLLQDEVATVKTYLVELKASWAITGCSILVDSRKDSNGRTSINFLVSCPRGVYFVSSVDATEVADDPSNLFRVLDAVVDEIGEENVVQCFLDHRASLRRMFVSNEWTSCRFSKSGEGQEVEMIVLNTSFWKKVQYVCKSVEPVLQVLQKSDSVQSLSMSSIYNDMYRAKFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNPSYRYRPDFVAHSEVVRGLNECIVRLESDSSRRISASMQISDYNSAKSDFGTELAISTRTELDPAAWWQQHGISCLELQQIAVRILSQTCSSLCCEHYWSPFKNERSQKNNALSQRKMADLLYVHYNLRLRERQLRKRSSDSVSLDDILMEHLLDDWIVEPQKQGMQEDEEILCPGMETLDAYENDLIDYEDGTTEAARKGCLQLVCLTNVEPLDVNPANGGASTDNDADVKFYDDELSD
BLAST of Cp4.1LG03g12160 vs. TrEMBL
Match: A0A0A0L859_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G171120 PE=4 SV=1)

HSP 1 Score: 837.8 bits (2163), Expect = 1.1e-239
Identity = 410/448 (91.52%), Postives = 426/448 (95.09%), Query Frame = 1

Query: 1   MMAPIRTCGFVDPGWEHGVAQDEKKKKVKCNYCGKIVSGGIYRLKQHLARVSGEVTYCDK 60
           MMAPIRT GFVDPGWEHGVAQDEKKKKVKCNYCGKIVSGGIYRLKQHLARVSGEVTYCDK
Sbjct: 1   MMAPIRTSGFVDPGWEHGVAQDEKKKKVKCNYCGKIVSGGIYRLKQHLARVSGEVTYCDK 60

Query: 61  APEEVYLRMRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEDGGLHVAYRNRGRQLMVNR 120
           APEEVYLRMRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDE+ G HV YRNRGRQLM NR
Sbjct: 61  APEEVYLRMRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDGSHVTYRNRGRQLMGNR 120

Query: 121 NVGANMTPLRSLRYVDPGWEHGVAQDERKKKVKCNYCEKIVSGGINRFKQHLARIPGEVA 180
           NVG NMTPLRSLRYVDPGWEHGVAQDERKKKVKCNYCEKIVSGGINRFKQHLARIPGEVA
Sbjct: 121 NVGTNMTPLRSLRYVDPGWEHGVAQDERKKKVKCNYCEKIVSGGINRFKQHLARIPGEVA 180

Query: 181 PCKHAPEEVYLKIKENMKWHRTGRRNGQTDANELSAYFMQSDNEEEEDEKEESLHHISKE 240
           PCKHAPEEVYLKIKENMKWHRTGRR+ QTDANE+SAYFMQSDNEEEE+EKEESLHHISKE
Sbjct: 181 PCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYFMQSDNEEEEEEKEESLHHISKE 240

Query: 241 RLIDGDKRSSKDLRSTFRGMSPGGGSEPSVKRSRLDSVFLKTTKRPTEQLHKQALVKRGA 300
           R IDGDKR SKDL+STFRGMSPGGGSEPSVKRSRLDSVFLKTTKR TEQ+ KQALVKRG 
Sbjct: 241 RFIDGDKRLSKDLKSTFRGMSPGGGSEPSVKRSRLDSVFLKTTKRQTEQVQKQALVKRGG 300

Query: 301 NRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLISGRLLQD 360
           NRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQL+SGRLLQ+
Sbjct: 301 NRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLMSGRLLQE 360

Query: 361 EVATVKTYLVELKASWAITGCSILVDSRKDSNGRTSINFLVSCPRGVYFVSSVDATEVAD 420
           EVAT+K+YLVELKASWA+TGCSILVD+ KDS+GR  INFLVSCPRGVYFVSSVDA E+ D
Sbjct: 361 EVATIKSYLVELKASWAVTGCSILVDNWKDSDGRAFINFLVSCPRGVYFVSSVDAMEIVD 420

Query: 421 DPSNLFRVLDAVVDEIGEENVVQCFLDH 449
           DPSNLF VLD VVDEIGEENVVQ   ++
Sbjct: 421 DPSNLFSVLDGVVDEIGEENVVQVITEN 448

BLAST of Cp4.1LG03g12160 vs. TrEMBL
Match: E5GC38_CUCME (DNA binding protein OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 835.1 bits (2156), Expect = 7.2e-239
Identity = 408/448 (91.07%), Postives = 426/448 (95.09%), Query Frame = 1

Query: 1   MMAPIRTCGFVDPGWEHGVAQDEKKKKVKCNYCGKIVSGGIYRLKQHLARVSGEVTYCDK 60
           MMAPIRT GFVDPGWEHGVAQDEKKKKVKCNYCGKIVSGGIYRLKQHLARVSGEVTYCDK
Sbjct: 1   MMAPIRTSGFVDPGWEHGVAQDEKKKKVKCNYCGKIVSGGIYRLKQHLARVSGEVTYCDK 60

Query: 61  APEEVYLRMRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEDGGLHVAYRNRGRQLMVNR 120
           APEEVYLRMRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDE+ G HV YRNRGRQLM NR
Sbjct: 61  APEEVYLRMRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDGSHVTYRNRGRQLMGNR 120

Query: 121 NVGANMTPLRSLRYVDPGWEHGVAQDERKKKVKCNYCEKIVSGGINRFKQHLARIPGEVA 180
           NVG NMTPLRSLRYVDPGWEHGVAQDERKKKVKCNYCEKIVSGGINRFKQHLARIPGEVA
Sbjct: 121 NVGTNMTPLRSLRYVDPGWEHGVAQDERKKKVKCNYCEKIVSGGINRFKQHLARIPGEVA 180

Query: 181 PCKHAPEEVYLKIKENMKWHRTGRRNGQTDANELSAYFMQSDNEEEEDEKEESLHHISKE 240
           PCKHAPEEVYLKIKENMKWHRTGRR+ QTDANE+SAYFMQSDNEEEE+EKEESLHHISKE
Sbjct: 181 PCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYFMQSDNEEEEEEKEESLHHISKE 240

Query: 241 RLIDGDKRSSKDLRSTFRGMSPGGGSEPSVKRSRLDSVFLKTTKRPTEQLHKQALVKRGA 300
           R IDGDKR SKDL+STFRGM+PGGGSEPSVKRSRLDSVFLKTTKR TEQ+ KQALVKRG 
Sbjct: 241 RFIDGDKRLSKDLKSTFRGMAPGGGSEPSVKRSRLDSVFLKTTKRQTEQVQKQALVKRGG 300

Query: 301 NRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLISGRLLQD 360
           NRRSRKEVM+AICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQL+SGRLLQ+
Sbjct: 301 NRRSRKEVMTAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLMSGRLLQE 360

Query: 361 EVATVKTYLVELKASWAITGCSILVDSRKDSNGRTSINFLVSCPRGVYFVSSVDATEVAD 420
           EVAT+K+YLVELKASWA+TGCSILVD+ K S+GR  INFLVSCPRGVYFVSSVDA E+ D
Sbjct: 361 EVATIKSYLVELKASWAVTGCSILVDNWKGSDGRAFINFLVSCPRGVYFVSSVDAMEIVD 420

Query: 421 DPSNLFRVLDAVVDEIGEENVVQCFLDH 449
           DPSNLFRVLD VVDEIGEENVVQ   ++
Sbjct: 421 DPSNLFRVLDGVVDEIGEENVVQVITEN 448

BLAST of Cp4.1LG03g12160 vs. TrEMBL
Match: A5BYZ7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_033845 PE=4 SV=1)

HSP 1 Score: 677.2 bits (1746), Expect = 2.5e-191
Identity = 328/454 (72.25%), Postives = 385/454 (84.80%), Query Frame = 1

Query: 2   MAPIRTCGFVDPGWEHGVAQDEKKKKVKCNYCGKIVSGGIYRLKQHLARVSGEVTYCDKA 61
           M  +R+ G+ DPGWEHG+AQDE+KKKVKCNYCGKIVSGGIYRLKQHLARVSGEVTYCDKA
Sbjct: 1   MTSLRSPGYSDPGWEHGIAQDERKKKVKCNYCGKIVSGGIYRLKQHLARVSGEVTYCDKA 60

Query: 62  PEEVYLRMRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEDGGL-HVAYRNRGRQLMVNR 121
           PEEVYL+MRENLEGCRSNKKPRQSEDD  +YLNFH NDDE+    H  YR++G+QLM +R
Sbjct: 61  PEEVYLKMRENLEGCRSNKKPRQSEDDGHTYLNFHQNDDEEEEEEHAGYRSKGKQLMSDR 120

Query: 122 NVGANMTPLRSLRYVDPGWEHGVAQDERKKKVKCNYCEKIVSGGINRFKQHLARIPGEVA 181
           N+  N+ PLRSL YVDPGWEHGVAQDERKKKVKCNYCEKIVSGGINRFKQHLARIPGEVA
Sbjct: 121 NLVINLAPLRSLGYVDPGWEHGVAQDERKKKVKCNYCEKIVSGGINRFKQHLARIPGEVA 180

Query: 182 PCKHAPEEVYLKIKENMKWHRTGRRNGQTDANELSAYFMQSDNEEEEDEK-EESLHHISK 241
           PCK+APEEVYLKIKENMKWHRTGRR+ + DA E+SA++M SDN++EEDE+ E++LH ++K
Sbjct: 181 PCKNAPEEVYLKIKENMKWHRTGRRHRRPDAKEISAFYMNSDNDDEEDEQDEDALHRMNK 240

Query: 242 ERLIDGDKRSSKDLRSTFRGMSPGGGSEPSVKRSRLDSVFLKTTKRPTEQLHKQALVKRG 301
           E LI G+KR SKDLR TFRG+SPG GSEPS++RSRLDSV  KT K      +KQ  VK G
Sbjct: 241 ENLIIGEKRLSKDLRKTFRGISPGSGSEPSLRRSRLDSVVPKTPKSQKALSYKQVKVKTG 300

Query: 302 ANRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLISGRLLQ 361
           +++++RKEV+SAICKFF +AG+P  +ANS YFHKMLE VGQYG GLVGP  QLISGR LQ
Sbjct: 301 SSKKTRKEVISAICKFFYHAGVPLHAANSPYFHKMLELVGQYGQGLVGPPTQLISGRFLQ 360

Query: 362 DEVATVKTYLVELKASWAITGCSILVDSRKDSNGRTSINFLVSCPRGVYFVSSVDATEVA 421
           +E+AT+K YL E KASWAITGCSI  DS +D+ GRT IN LVSCP G+YFVSSVDAT++ 
Sbjct: 361 EEIATIKNYLAEYKASWAITGCSIKADSWRDAQGRTLINILVSCPHGIYFVSSVDATDIV 420

Query: 422 DDPSNLFRVLDAVVDEIGEENVVQCFLDHRASLR 454
           DD +NLF++LD VV+E+GEENVVQ   ++  S +
Sbjct: 421 DDATNLFKLLDKVVEEMGEENVVQVITENTPSYK 454

BLAST of Cp4.1LG03g12160 vs. TrEMBL
Match: D7T690_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0020g00790 PE=4 SV=1)

HSP 1 Score: 677.2 bits (1746), Expect = 2.5e-191
Identity = 328/454 (72.25%), Postives = 385/454 (84.80%), Query Frame = 1

Query: 2   MAPIRTCGFVDPGWEHGVAQDEKKKKVKCNYCGKIVSGGIYRLKQHLARVSGEVTYCDKA 61
           M  +R+ G+ DPGWEHG+AQDE+KKKVKCNYCGKIVSGGIYRLKQHLARVSGEVTYCDKA
Sbjct: 5   MTSLRSPGYSDPGWEHGIAQDERKKKVKCNYCGKIVSGGIYRLKQHLARVSGEVTYCDKA 64

Query: 62  PEEVYLRMRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEDGGL-HVAYRNRGRQLMVNR 121
           PEEVYL+MRENLEGCRSNKKPRQSEDD  +YLNFH NDDE+    H  YR++G+QLM +R
Sbjct: 65  PEEVYLKMRENLEGCRSNKKPRQSEDDGHTYLNFHQNDDEEEEEEHAGYRSKGKQLMSDR 124

Query: 122 NVGANMTPLRSLRYVDPGWEHGVAQDERKKKVKCNYCEKIVSGGINRFKQHLARIPGEVA 181
           N+  N+ PLRSL YVDPGWEHGVAQDERKKKVKCNYCEKIVSGGINRFKQHLARIPGEVA
Sbjct: 125 NLVINLAPLRSLGYVDPGWEHGVAQDERKKKVKCNYCEKIVSGGINRFKQHLARIPGEVA 184

Query: 182 PCKHAPEEVYLKIKENMKWHRTGRRNGQTDANELSAYFMQSDNEEEEDEK-EESLHHISK 241
           PCK+APEEVYLKIKENMKWHRTGRR+ + DA E+SA++M SDN++EEDE+ E++LH ++K
Sbjct: 185 PCKNAPEEVYLKIKENMKWHRTGRRHRRPDAKEISAFYMNSDNDDEEDEQDEDALHRMNK 244

Query: 242 ERLIDGDKRSSKDLRSTFRGMSPGGGSEPSVKRSRLDSVFLKTTKRPTEQLHKQALVKRG 301
           E LI G+KR SKDLR TFRG+SPG GSEPS++RSRLDSV  KT K      +KQ  VK G
Sbjct: 245 ENLIIGEKRLSKDLRKTFRGISPGSGSEPSLRRSRLDSVVPKTPKSQKALSYKQVKVKTG 304

Query: 302 ANRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLISGRLLQ 361
           +++++RKEV+SAICKFF +AG+P  +ANS YFHKMLE VGQYG GLVGP  QLISGR LQ
Sbjct: 305 SSKKTRKEVISAICKFFYHAGVPLHAANSPYFHKMLELVGQYGQGLVGPPTQLISGRFLQ 364

Query: 362 DEVATVKTYLVELKASWAITGCSILVDSRKDSNGRTSINFLVSCPRGVYFVSSVDATEVA 421
           +E+AT+K YL E KASWAITGCSI  DS +D+ GRT IN LVSCP G+YFVSSVDAT++ 
Sbjct: 365 EEIATIKNYLAEYKASWAITGCSIKADSWRDAQGRTLINILVSCPHGIYFVSSVDATDIV 424

Query: 422 DDPSNLFRVLDAVVDEIGEENVVQCFLDHRASLR 454
           DD +NLF++LD VV+E+GEENVVQ   ++  S +
Sbjct: 425 DDATNLFKLLDKVVEEMGEENVVQVITENTPSYK 458

BLAST of Cp4.1LG03g12160 vs. TrEMBL
Match: K7LCF8_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_09G079100 PE=4 SV=1)

HSP 1 Score: 645.2 bits (1663), Expect = 1.1e-181
Identity = 311/450 (69.11%), Postives = 378/450 (84.00%), Query Frame = 1

Query: 2   MAPIRTCGFVDPGWEHGVAQDEKKKKVKCNYCGKIVSGGIYRLKQHLARVSGEVTYCDKA 61
           MAPIR+ GFVDPGW+HG+AQDE+KKKV+CNYCGKIVSGGIYRLKQHLARVSGEVTYC+KA
Sbjct: 1   MAPIRSTGFVDPGWDHGIAQDERKKKVRCNYCGKIVSGGIYRLKQHLARVSGEVTYCEKA 60

Query: 62  PEEVYLRMRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEDGGLHVAYRNRGRQLMVNRN 121
           P+EVYL+M+ENLEGCRS+KK +Q   D Q+Y+NFHSNDDED    V  R++G+QLM +RN
Sbjct: 61  PDEVYLKMKENLEGCRSHKKQKQV--DAQAYMNFHSNDDEDEEEQVGCRSKGKQLMDDRN 120

Query: 122 VGANMTPLRSLRYVDPGWEHGVAQDERKKKVKCNYCEKIVSGGINRFKQHLARIPGEVAP 181
           V  N+TPLRSL YVDPGWEHGVAQDERKKKVKCNYCEKIVSGGINRFKQHLARIPGEVAP
Sbjct: 121 VSVNLTPLRSLGYVDPGWEHGVAQDERKKKVKCNYCEKIVSGGINRFKQHLARIPGEVAP 180

Query: 182 CKHAPEEVYLKIKENMKWHRTGRRNGQTDANELSAYFMQSDNEEEEDE---KEESLHHIS 241
           CK+APE+VYLKIKENMKWHRTGRR  + +A EL  ++ +SDN++++DE    E++LHH++
Sbjct: 181 CKNAPEDVYLKIKENMKWHRTGRRLRRPEAKELMPFYAKSDNDDDDDEYEQVEDALHHMN 240

Query: 242 KERLIDGDKRSSKDLRSTFRGMSPGGGSEPSVKRSRLDSVFLKTTKRPTEQLHKQALVKR 301
           KE L+D DKR SKD+  T++G+SP  G EP ++RSRLD+V+LK  K  T Q +KQ  VK 
Sbjct: 241 KETLMDVDKRFSKDIMKTYKGISPSTGPEPVLRRSRLDNVYLKLPKNQTPQTYKQVKVKT 300

Query: 302 GANRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLISGRLL 361
           G  ++ RKEV+S+ICKFF +AGIP ++A+S+YFHKMLE VGQYG GLV P  QL+SGRLL
Sbjct: 301 GPTKKLRKEVISSICKFFYHAGIPIKAADSLYFHKMLEVVGQYGQGLVCPPSQLMSGRLL 360

Query: 362 QDEVATVKTYLVELKASWAITGCSILVDSRKDSNGRTSINFLVSCPRGVYFVSSVDATEV 421
           Q+E+  +K YL+E KASWAITGCSI+ DS  D+ GRT+INFLVSCP GVYFVSSVDAT V
Sbjct: 361 QEEINCIKNYLLEYKASWAITGCSIMADSWIDTQGRTNINFLVSCPHGVYFVSSVDATNV 420

Query: 422 ADDPSNLFRVLDAVVDEIGEENVVQCFLDH 449
            +D  NLF++LD VV+E+GEENVVQ   ++
Sbjct: 421 VEDAPNLFKLLDKVVEEVGEENVVQVITEN 448

BLAST of Cp4.1LG03g12160 vs. TAIR10
Match: AT3G17450.1 (AT3G17450.1 hAT dimerisation domain-containing protein)

HSP 1 Score: 476.5 bits (1225), Expect = 3.3e-134
Identity = 253/461 (54.88%), Postives = 330/461 (71.58%), Query Frame = 1

Query: 2   MAPIRTCGFVDPGWEHGVAQDEKKKKVKCNYCGKIVSGGIYRLKQHLARVSGEVTYCDKA 61
           MAP  + G VDPGWEHGVAQD++KKKVKCNYCGKIVSGGIYRLKQHLARVSGEVTYCDK+
Sbjct: 1   MAPPGSIGVVDPGWEHGVAQDQRKKKVKCNYCGKIVSGGIYRLKQHLARVSGEVTYCDKS 60

Query: 62  PEEVYLRMRENLEGCRSNKKPRQSEDDE-QSYLNFH-SNDDEDGGLHV----AYRNRGRQ 121
           PEEV +RM+ENL   RS KK RQSED+  QS  +FH SN+D++         + R++G+ 
Sbjct: 61  PEEVCMRMKENL--VRSTKKLRQSEDNSGQSCSSFHQSNNDDEADEEERRCWSIRSKGKL 120

Query: 122 LMVNRNVGANMTPLRSLRYVDPGWEHGVAQDERKKKVKCNYCEKIVSGGINRFKQHLARI 181
            + + ++      LRS  Y+DPGWEHG+AQDERKKKVKCNYC KIVSGGINRFKQHLARI
Sbjct: 121 GLSDGSL------LRSSGYIDPGWEHGIAQDERKKKVKCNYCNKIVSGGINRFKQHLARI 180

Query: 182 PGEVAPCKHAPEEVYLKIKENMKWHRTGRRNGQTD--ANELSAYFMQSDNEEEEDEKEES 241
           PGEVAPCK APEEVY+KIKENMKWHR G+R  + D     L+   +  D ++EED ++  
Sbjct: 181 PGEVAPCKTAPEEVYVKIKENMKWHRAGKRQNRPDDEMGALTFRTVSQDPDQEEDREDHD 240

Query: 242 LHHISKERLIDGDKRSSKDLRSTFRGMSPGGGSEPSVKRSRLDSVFLKTTKRPTEQLHKQ 301
            +  S++RL+ G+ R SKD R +F   +    SE   KR+R+      ++ +      ++
Sbjct: 241 FYPTSQDRLMLGNGRFSKDKRKSFDSTNMRSVSEAKTKRARMIPFQSPSSSK------QR 300

Query: 302 ALVKRGANRR-SRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQL 361
            L    +NR  SRK+V S+I KF  + G+P ++ANS+YF KM+E +G YG G V PS QL
Sbjct: 301 KLYSSCSNRVVSRKDVTSSISKFLHHVGVPTEAANSLYFQKMIELIGMYGEGFVVPSSQL 360

Query: 362 ISGRLLQDEVATVKTYLVELKASWAITGCSILVDSRKDSNGRTSINFLVSCPRGVYFVSS 421
            SGRLLQ+E++T+K+YL E ++SW +TGCSI+ D+  ++ G+  I+FLVSCPRGVYF SS
Sbjct: 361 FSGRLLQEEMSTIKSYLREYRSSWVVTGCSIMADTWTNTEGKKMISFLVSCPRGVYFHSS 420

Query: 422 VDATEVADDPSNLFRVLDAVVDEIGEENVVQCFLDHRASLR 454
           +DAT++ +D  +LF+ LD +VD+IGEENVVQ    + A  R
Sbjct: 421 IDATDIVEDALSLFKCLDKLVDDIGEENVVQVITQNTAIFR 447

BLAST of Cp4.1LG03g12160 vs. TAIR10
Match: AT5G33406.1 (AT5G33406.1 hAT dimerisation domain-containing protein / transposase-related)

HSP 1 Score: 160.6 bits (405), Expect = 4.0e-39
Identity = 86/286 (30.07%), Postives = 146/286 (51.05%), Query Frame = 1

Query: 451 SLRRMFVSNEWTSCRFSKSGEGQEVEMIVLNTSFWKKVQYVCKSVEPVLQVLQKSDSVQS 510
           +LR+M  S+EW + +++K   G +++      SFWK V +  K   P++QVL+  D  + 
Sbjct: 35  NLRKMVHSDEWNASKWTKEAGGMKIKSFFFQESFWKNVLHALKLGGPLIQVLRMVDGERK 94

Query: 511 LSMSSIYNDMYRAKFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNPSYRY 570
             M  IY  M +AK  I          Y   + +ID  W+     PLH A ++LNP + Y
Sbjct: 95  PPMGYIYGAMDQAKETIMKSFTYKEENYKMAFEIIDRRWDIQLHRPLHAAGYYLNPEFHY 154

Query: 571 -RPDFVAHSEVVRGLNECIVRLESDSSRRISASMQISDYNSAKSDFGTELAISTRTELDP 630
            +PD + + EV+ G   C+ RL      +     ++  +  A   FG  +AI  RT++ P
Sbjct: 155 GQPDDIGYEEVLGGFLGCLGRLVPKIETQDKIITELDAFKKATGLFGIPMAIRLRTKMSP 214

Query: 631 AAWWQQHGISCLELQQIAVRILSQTCSSLCCEHYWSPFKNERSQKNNALSQRKMADLLYV 690
           A WW  +G S   LQ  A+++LS TCS+  CE  W  F+   +++ N L+Q ++ D+++V
Sbjct: 215 AEWWSAYGSSTPNLQNFAIKVLSLTCSATGCERNWGVFQLLHTKRRNRLTQCRLNDMIFV 274

Query: 691 HYNLRLRERQLRKRSSDSVSLDDILMEHLLDDWIVEPQKQGMQEDE 736
            YN  L+ R  R  + D + L++I      ++W+    ++   + E
Sbjct: 275 KYNRALQRRYKRNDTFDPILLNEI---DQCNEWLTGRMEENSSDTE 317

BLAST of Cp4.1LG03g12160 vs. TAIR10
Match: AT4G15020.1 (AT4G15020.1 hAT transposon superfamily)

HSP 1 Score: 149.8 bits (377), Expect = 7.1e-36
Identity = 95/278 (34.17%), Postives = 148/278 (53.24%), Query Frame = 1

Query: 449 RASLRRMFVSNEWTSCRFSKSGEGQEVEMIVLNTSFWKKVQYVCKSVEPVLQVLQKSDSV 508
           +++L+ M  S EW  C +S+   G  +  +  + +FWK V  V     P+L+ L+   S 
Sbjct: 430 KSNLQAMVTSAEWNECSYSEEPSGLVMNALT-DEAFWKAVALVNHLTSPLLRALRIVCSE 489

Query: 509 QSLSMSSIYNDMYRAKFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNPSY 568
           +  +M  +Y  +YRAK AI++ H  +   Y  +W +ID  W      PL  A FFLNP  
Sbjct: 490 KRPAMGYVYAALYRAKDAIKT-HLVNREDYIIYWKIIDRWWEQQQHIPLLAAGFFLNPKL 549

Query: 569 RYRPDFVAHSEVVRGLNECIVRLESDSSRRISASMQISDYNSAKSDFGTELAISTRTELD 628
            Y  +    SE++  + +CI RL  D   +     +++ Y +A   FG  LAI  R  + 
Sbjct: 550 FYNTNEEIRSELILSVLDCIERLVPDDKIQDKIIKELTSYKTAGGVFGRNLAIRARDTML 609

Query: 629 PAAWWQQHGISCLELQQIAVRILSQTC-SSLCCEHYWSPFKNERSQKNNALSQRKMADLL 688
           PA WW  +G SCL L + A+RILSQTC SS+ C     P ++   Q  N++ Q++++DL+
Sbjct: 610 PAEWWSTYGESCLNLSRFAIRILSQTCSSSVSCRRNQIPVEH-IYQSKNSIEQKRLSDLV 669

Query: 689 YVHYNLRLRERQLRKRSSDSVSLDDILMEHL--LDDWI 724
           +V YN+RL  RQL   S D  +LD +    +  L +W+
Sbjct: 670 FVQYNMRL--RQLGPGSGDD-TLDPLSHNRIDVLKEWV 701

BLAST of Cp4.1LG03g12160 vs. TAIR10
Match: AT3G22220.1 (AT3G22220.1 hAT transposon superfamily)

HSP 1 Score: 146.7 bits (369), Expect = 6.0e-35
Identity = 91/283 (32.16%), Postives = 142/283 (50.18%), Query Frame = 1

Query: 447 DHRASLRRMFVSNEWTSCRFSKSGEGQEVEMIVLNTSFWKKVQYVCKSVEPVLQVLQKSD 506
           D +  L+ M  S+EW  C +SK   G  +   + +  FWK +        P+L+VL+   
Sbjct: 424 DLKPYLQAMVTSSEWNDCSYSKEAGGLAMTETINDEDFWKALTLANHITAPILRVLRIVC 483

Query: 507 SVQSLSMSSIYNDMYRAKFAIQS--IHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFL 566
           S +  +M  +Y  MYRAK AI++   H ++   Y  +W +ID  W      PL+ A F+L
Sbjct: 484 SERKPAMGYVYAAMYRAKEAIKTNLAHREE---YIVYWKIIDRWWLQ---QPLYAAGFYL 543

Query: 567 NPSYRYRPDFVAHSEVVRGLNECIVRLESDSSRRISASMQISDYNSAKSDFGTELAISTR 626
           NP + Y  D    SE+   + +CI +L  D + +      I+ Y +A   FG  LAI  R
Sbjct: 544 NPKFFYSIDEEMRSEIHLAVVDCIEKLVPDVNIQDIVIKDINSYKNAVGIFGRNLAIRAR 603

Query: 627 TELDPAAWWQQHGISCLELQQIAVRILSQTCSSLCCEHYWSPFKNERSQKNNALSQRKMA 686
             + PA WW  +G SCL L + A+RILSQTCSS           ++  +  N++ ++++ 
Sbjct: 604 DTMLPAEWWSTYGESCLNLSRFAIRILSQTCSSSIGSVRNLTSISQIYESKNSIERQRLN 663

Query: 687 DLLYVHYNLRLRERQLRKRSSDSVSLDDILMEHLLDDWIVEPQ 728
           DL++V YN+RLR         D+V         +L+DW+   Q
Sbjct: 664 DLVFVQYNMRLRRIGSESSGDDTVDPLSHSNMEVLEDWVSRNQ 700

BLAST of Cp4.1LG03g12160 vs. TAIR10
Match: AT1G79740.1 (AT1G79740.1 hAT transposon superfamily)

HSP 1 Score: 142.1 bits (357), Expect = 1.5e-33
Identity = 87/288 (30.21%), Postives = 151/288 (52.43%), Query Frame = 1

Query: 442 VQCFLDHRASLRRMFVSNEWTSCRFSKSGEGQEVEMIVL--NTSFWKKVQYVCKSVEPVL 501
           +Q  +  +A L+ MF   E+T+     + + Q +  + +  +  FW+ V+      EP+L
Sbjct: 334 LQSMMKQKARLKHMFNCPEYTT----NTNKPQSISCVNILEDNDFWRAVEESVAISEPIL 393

Query: 502 QVLQKSDSVQSLSMSSIYNDMYRAKFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHM 561
           +VL++  S    ++ SIY  M +AK +I++ +  D  K+  F +++D+NW      PLH 
Sbjct: 394 KVLREV-STGKPAVGSIYELMSKAKESIRTYYIMDENKHKVFSDIVDTNWCEHLHSPLHA 453

Query: 562 AAFFLNPSYRYRPDFVAHSEVVRGLNECIVRLESDSSRRISASMQISDYNSAKSDFGTEL 621
           AA FLNPS +Y P+    + +     + + +L   S  R   + QI  +  AK  FG  L
Sbjct: 454 AAAFLNPSIQYNPEIKFLTSLKEDFFKVLEKLLPTSDLRRDITNQIFTFTRAKGMFGCNL 513

Query: 622 AISTRTELDPAAWWQQHGISCLELQQIAVRILSQTCSSLCCEHYWSPFKNERSQKNNALS 681
           A+  R  + P  WW+Q G S   LQ++A+RILSQ CS    E  WS F+    ++ N + 
Sbjct: 514 AMEARDSVSPGLWWEQFGDSAPVLQRVAIRILSQVCSGYNLERQWSTFQQMHWERRNKID 573

Query: 682 QRKMADLLYVHYNLRLRERQLRKRSSDSVSLDDILMEHLLDDWIVEPQ 728
           +  +  L YV+ NL+L   ++    +D ++L+DI    ++ +W+ E +
Sbjct: 574 REILNKLAYVNQNLKL--GRMITLETDPIALEDI---DMMSEWVEEAE 611

BLAST of Cp4.1LG03g12160 vs. NCBI nr
Match: gi|778679159|ref|XP_011651096.1| (PREDICTED: uncharacterized protein LOC101213851 [Cucumis sativus])

HSP 1 Score: 837.8 bits (2163), Expect = 1.6e-239
Identity = 410/448 (91.52%), Postives = 426/448 (95.09%), Query Frame = 1

Query: 1   MMAPIRTCGFVDPGWEHGVAQDEKKKKVKCNYCGKIVSGGIYRLKQHLARVSGEVTYCDK 60
           MMAPIRT GFVDPGWEHGVAQDEKKKKVKCNYCGKIVSGGIYRLKQHLARVSGEVTYCDK
Sbjct: 1   MMAPIRTSGFVDPGWEHGVAQDEKKKKVKCNYCGKIVSGGIYRLKQHLARVSGEVTYCDK 60

Query: 61  APEEVYLRMRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEDGGLHVAYRNRGRQLMVNR 120
           APEEVYLRMRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDE+ G HV YRNRGRQLM NR
Sbjct: 61  APEEVYLRMRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDGSHVTYRNRGRQLMGNR 120

Query: 121 NVGANMTPLRSLRYVDPGWEHGVAQDERKKKVKCNYCEKIVSGGINRFKQHLARIPGEVA 180
           NVG NMTPLRSLRYVDPGWEHGVAQDERKKKVKCNYCEKIVSGGINRFKQHLARIPGEVA
Sbjct: 121 NVGTNMTPLRSLRYVDPGWEHGVAQDERKKKVKCNYCEKIVSGGINRFKQHLARIPGEVA 180

Query: 181 PCKHAPEEVYLKIKENMKWHRTGRRNGQTDANELSAYFMQSDNEEEEDEKEESLHHISKE 240
           PCKHAPEEVYLKIKENMKWHRTGRR+ QTDANE+SAYFMQSDNEEEE+EKEESLHHISKE
Sbjct: 181 PCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYFMQSDNEEEEEEKEESLHHISKE 240

Query: 241 RLIDGDKRSSKDLRSTFRGMSPGGGSEPSVKRSRLDSVFLKTTKRPTEQLHKQALVKRGA 300
           R IDGDKR SKDL+STFRGMSPGGGSEPSVKRSRLDSVFLKTTKR TEQ+ KQALVKRG 
Sbjct: 241 RFIDGDKRLSKDLKSTFRGMSPGGGSEPSVKRSRLDSVFLKTTKRQTEQVQKQALVKRGG 300

Query: 301 NRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLISGRLLQD 360
           NRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQL+SGRLLQ+
Sbjct: 301 NRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLMSGRLLQE 360

Query: 361 EVATVKTYLVELKASWAITGCSILVDSRKDSNGRTSINFLVSCPRGVYFVSSVDATEVAD 420
           EVAT+K+YLVELKASWA+TGCSILVD+ KDS+GR  INFLVSCPRGVYFVSSVDA E+ D
Sbjct: 361 EVATIKSYLVELKASWAVTGCSILVDNWKDSDGRAFINFLVSCPRGVYFVSSVDAMEIVD 420

Query: 421 DPSNLFRVLDAVVDEIGEENVVQCFLDH 449
           DPSNLF VLD VVDEIGEENVVQ   ++
Sbjct: 421 DPSNLFSVLDGVVDEIGEENVVQVITEN 448

BLAST of Cp4.1LG03g12160 vs. NCBI nr
Match: gi|659077032|ref|XP_008438995.1| (PREDICTED: uncharacterized protein LOC103483923 [Cucumis melo])

HSP 1 Score: 835.1 bits (2156), Expect = 1.0e-238
Identity = 408/448 (91.07%), Postives = 426/448 (95.09%), Query Frame = 1

Query: 1   MMAPIRTCGFVDPGWEHGVAQDEKKKKVKCNYCGKIVSGGIYRLKQHLARVSGEVTYCDK 60
           MMAPIRT GFVDPGWEHGVAQDEKKKKVKCNYCGKIVSGGIYRLKQHLARVSGEVTYCDK
Sbjct: 1   MMAPIRTSGFVDPGWEHGVAQDEKKKKVKCNYCGKIVSGGIYRLKQHLARVSGEVTYCDK 60

Query: 61  APEEVYLRMRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEDGGLHVAYRNRGRQLMVNR 120
           APEEVYLRMRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDE+ G HV YRNRGRQLM NR
Sbjct: 61  APEEVYLRMRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDGSHVTYRNRGRQLMGNR 120

Query: 121 NVGANMTPLRSLRYVDPGWEHGVAQDERKKKVKCNYCEKIVSGGINRFKQHLARIPGEVA 180
           NVG NMTPLRSLRYVDPGWEHGVAQDERKKKVKCNYCEKIVSGGINRFKQHLARIPGEVA
Sbjct: 121 NVGTNMTPLRSLRYVDPGWEHGVAQDERKKKVKCNYCEKIVSGGINRFKQHLARIPGEVA 180

Query: 181 PCKHAPEEVYLKIKENMKWHRTGRRNGQTDANELSAYFMQSDNEEEEDEKEESLHHISKE 240
           PCKHAPEEVYLKIKENMKWHRTGRR+ QTDANE+SAYFMQSDNEEEE+EKEESLHHISKE
Sbjct: 181 PCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYFMQSDNEEEEEEKEESLHHISKE 240

Query: 241 RLIDGDKRSSKDLRSTFRGMSPGGGSEPSVKRSRLDSVFLKTTKRPTEQLHKQALVKRGA 300
           R IDGDKR SKDL+STFRGM+PGGGSEPSVKRSRLDSVFLKTTKR TEQ+ KQALVKRG 
Sbjct: 241 RFIDGDKRLSKDLKSTFRGMAPGGGSEPSVKRSRLDSVFLKTTKRQTEQVQKQALVKRGG 300

Query: 301 NRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLISGRLLQD 360
           NRRSRKEVM+AICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQL+SGRLLQ+
Sbjct: 301 NRRSRKEVMTAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLMSGRLLQE 360

Query: 361 EVATVKTYLVELKASWAITGCSILVDSRKDSNGRTSINFLVSCPRGVYFVSSVDATEVAD 420
           EVAT+K+YLVELKASWA+TGCSILVD+ K S+GR  INFLVSCPRGVYFVSSVDA E+ D
Sbjct: 361 EVATIKSYLVELKASWAVTGCSILVDNWKGSDGRAFINFLVSCPRGVYFVSSVDAMEIVD 420

Query: 421 DPSNLFRVLDAVVDEIGEENVVQCFLDH 449
           DPSNLFRVLD VVDEIGEENVVQ   ++
Sbjct: 421 DPSNLFRVLDGVVDEIGEENVVQVITEN 448

BLAST of Cp4.1LG03g12160 vs. NCBI nr
Match: gi|147799625|emb|CAN75144.1| (hypothetical protein VITISV_033845 [Vitis vinifera])

HSP 1 Score: 677.2 bits (1746), Expect = 3.6e-191
Identity = 328/454 (72.25%), Postives = 385/454 (84.80%), Query Frame = 1

Query: 2   MAPIRTCGFVDPGWEHGVAQDEKKKKVKCNYCGKIVSGGIYRLKQHLARVSGEVTYCDKA 61
           M  +R+ G+ DPGWEHG+AQDE+KKKVKCNYCGKIVSGGIYRLKQHLARVSGEVTYCDKA
Sbjct: 1   MTSLRSPGYSDPGWEHGIAQDERKKKVKCNYCGKIVSGGIYRLKQHLARVSGEVTYCDKA 60

Query: 62  PEEVYLRMRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEDGGL-HVAYRNRGRQLMVNR 121
           PEEVYL+MRENLEGCRSNKKPRQSEDD  +YLNFH NDDE+    H  YR++G+QLM +R
Sbjct: 61  PEEVYLKMRENLEGCRSNKKPRQSEDDGHTYLNFHQNDDEEEEEEHAGYRSKGKQLMSDR 120

Query: 122 NVGANMTPLRSLRYVDPGWEHGVAQDERKKKVKCNYCEKIVSGGINRFKQHLARIPGEVA 181
           N+  N+ PLRSL YVDPGWEHGVAQDERKKKVKCNYCEKIVSGGINRFKQHLARIPGEVA
Sbjct: 121 NLVINLAPLRSLGYVDPGWEHGVAQDERKKKVKCNYCEKIVSGGINRFKQHLARIPGEVA 180

Query: 182 PCKHAPEEVYLKIKENMKWHRTGRRNGQTDANELSAYFMQSDNEEEEDEK-EESLHHISK 241
           PCK+APEEVYLKIKENMKWHRTGRR+ + DA E+SA++M SDN++EEDE+ E++LH ++K
Sbjct: 181 PCKNAPEEVYLKIKENMKWHRTGRRHRRPDAKEISAFYMNSDNDDEEDEQDEDALHRMNK 240

Query: 242 ERLIDGDKRSSKDLRSTFRGMSPGGGSEPSVKRSRLDSVFLKTTKRPTEQLHKQALVKRG 301
           E LI G+KR SKDLR TFRG+SPG GSEPS++RSRLDSV  KT K      +KQ  VK G
Sbjct: 241 ENLIIGEKRLSKDLRKTFRGISPGSGSEPSLRRSRLDSVVPKTPKSQKALSYKQVKVKTG 300

Query: 302 ANRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLISGRLLQ 361
           +++++RKEV+SAICKFF +AG+P  +ANS YFHKMLE VGQYG GLVGP  QLISGR LQ
Sbjct: 301 SSKKTRKEVISAICKFFYHAGVPLHAANSPYFHKMLELVGQYGQGLVGPPTQLISGRFLQ 360

Query: 362 DEVATVKTYLVELKASWAITGCSILVDSRKDSNGRTSINFLVSCPRGVYFVSSVDATEVA 421
           +E+AT+K YL E KASWAITGCSI  DS +D+ GRT IN LVSCP G+YFVSSVDAT++ 
Sbjct: 361 EEIATIKNYLAEYKASWAITGCSIKADSWRDAQGRTLINILVSCPHGIYFVSSVDATDIV 420

Query: 422 DDPSNLFRVLDAVVDEIGEENVVQCFLDHRASLR 454
           DD +NLF++LD VV+E+GEENVVQ   ++  S +
Sbjct: 421 DDATNLFKLLDKVVEEMGEENVVQVITENTPSYK 454

BLAST of Cp4.1LG03g12160 vs. NCBI nr
Match: gi|731388638|ref|XP_002274968.2| (PREDICTED: uncharacterized protein LOC100247647 [Vitis vinifera])

HSP 1 Score: 677.2 bits (1746), Expect = 3.6e-191
Identity = 328/454 (72.25%), Postives = 385/454 (84.80%), Query Frame = 1

Query: 2   MAPIRTCGFVDPGWEHGVAQDEKKKKVKCNYCGKIVSGGIYRLKQHLARVSGEVTYCDKA 61
           M  +R+ G+ DPGWEHG+AQDE+KKKVKCNYCGKIVSGGIYRLKQHLARVSGEVTYCDKA
Sbjct: 5   MTSLRSPGYSDPGWEHGIAQDERKKKVKCNYCGKIVSGGIYRLKQHLARVSGEVTYCDKA 64

Query: 62  PEEVYLRMRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEDGGL-HVAYRNRGRQLMVNR 121
           PEEVYL+MRENLEGCRSNKKPRQSEDD  +YLNFH NDDE+    H  YR++G+QLM +R
Sbjct: 65  PEEVYLKMRENLEGCRSNKKPRQSEDDGHTYLNFHQNDDEEEEEEHAGYRSKGKQLMSDR 124

Query: 122 NVGANMTPLRSLRYVDPGWEHGVAQDERKKKVKCNYCEKIVSGGINRFKQHLARIPGEVA 181
           N+  N+ PLRSL YVDPGWEHGVAQDERKKKVKCNYCEKIVSGGINRFKQHLARIPGEVA
Sbjct: 125 NLVINLAPLRSLGYVDPGWEHGVAQDERKKKVKCNYCEKIVSGGINRFKQHLARIPGEVA 184

Query: 182 PCKHAPEEVYLKIKENMKWHRTGRRNGQTDANELSAYFMQSDNEEEEDEK-EESLHHISK 241
           PCK+APEEVYLKIKENMKWHRTGRR+ + DA E+SA++M SDN++EEDE+ E++LH ++K
Sbjct: 185 PCKNAPEEVYLKIKENMKWHRTGRRHRRPDAKEISAFYMNSDNDDEEDEQDEDALHRMNK 244

Query: 242 ERLIDGDKRSSKDLRSTFRGMSPGGGSEPSVKRSRLDSVFLKTTKRPTEQLHKQALVKRG 301
           E LI G+KR SKDLR TFRG+SPG GSEPS++RSRLDSV  KT K      +KQ  VK G
Sbjct: 245 ENLIIGEKRLSKDLRKTFRGISPGSGSEPSLRRSRLDSVVPKTPKSQKALSYKQVKVKTG 304

Query: 302 ANRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLISGRLLQ 361
           +++++RKEV+SAICKFF +AG+P  +ANS YFHKMLE VGQYG GLVGP  QLISGR LQ
Sbjct: 305 SSKKTRKEVISAICKFFYHAGVPLHAANSPYFHKMLELVGQYGQGLVGPPTQLISGRFLQ 364

Query: 362 DEVATVKTYLVELKASWAITGCSILVDSRKDSNGRTSINFLVSCPRGVYFVSSVDATEVA 421
           +E+AT+K YL E KASWAITGCSI  DS +D+ GRT IN LVSCP G+YFVSSVDAT++ 
Sbjct: 365 EEIATIKNYLAEYKASWAITGCSIKADSWRDAQGRTLINILVSCPHGIYFVSSVDATDIV 424

Query: 422 DDPSNLFRVLDAVVDEIGEENVVQCFLDHRASLR 454
           DD +NLF++LD VV+E+GEENVVQ   ++  S +
Sbjct: 425 DDATNLFKLLDKVVEEMGEENVVQVITENTPSYK 458

BLAST of Cp4.1LG03g12160 vs. NCBI nr
Match: gi|1009163694|ref|XP_015900101.1| (PREDICTED: uncharacterized protein LOC107433329 [Ziziphus jujuba])

HSP 1 Score: 677.2 bits (1746), Expect = 3.6e-191
Identity = 331/453 (73.07%), Postives = 382/453 (84.33%), Query Frame = 1

Query: 2   MAPIRTCGFVDPGWEHGVAQDEKKKKVKCNYCGKIVSGGIYRLKQHLARVSGEVTYCDKA 61
           MAP R+ G VDPGWEHG+AQDEKKKKVKCNYCGKIVSGGIYRLKQHLARVSGEVTYCDKA
Sbjct: 1   MAPTRSSGLVDPGWEHGIAQDEKKKKVKCNYCGKIVSGGIYRLKQHLARVSGEVTYCDKA 60

Query: 62  PEEVYLRMRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEDGGLHVAYRNRGRQLMVNRN 121
           PE+VYLRM+ENLEGCRSNKKPR S DD Q+YLNFH+NDDE+  LHVAYR++G+QLM +RN
Sbjct: 61  PEDVYLRMKENLEGCRSNKKPRHSGDDGQAYLNFHTNDDEEQELHVAYRSKGKQLMGDRN 120

Query: 122 VGANMTPLRSLRYVDPGWEHGVAQDERKKKVKCNYCEKIVSGGINRFKQHLARIPGEVAP 181
           +G  +TPLRSL YVDPGWEH +AQDERKKKVKCNYC+KIVSGGINRFKQHLARIPGEVAP
Sbjct: 121 LGMKLTPLRSLGYVDPGWEHCIAQDERKKKVKCNYCDKIVSGGINRFKQHLARIPGEVAP 180

Query: 182 CKHAPEEVYLKIKENMKWHRTGRRNGQTDANELSAYFMQSDNEEEEDEK-EESLHHISKE 241
           CKHAPEEVYLKIK+NMKWHRTGR+  + DA E+  ++ QSDNE+EEDE+ E  LH I KE
Sbjct: 181 CKHAPEEVYLKIKDNMKWHRTGRKQRRPDAKEILTFYPQSDNEDEEDEQVEADLHLIRKE 240

Query: 242 RLIDGDKRSSKDLRSTFRGMSPGGGSEPSVKRSRLDSVFLKTTKRPTEQLHKQALVKRGA 301
           RLID D R  KDLR TF+G+SP   SEP +KRSRLDS+FL T K  T +  KQ  VK G+
Sbjct: 241 RLIDADGRLGKDLRKTFKGVSPSTVSEPLLKRSRLDSIFLNTFKGQTPESFKQVKVKTGS 300

Query: 302 NRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLISGRLLQD 361
           N++SRKEV+SAICKFF +AG+P Q+ANS+YFHKMLE VGQYG GLVGP  QLISGR LQ+
Sbjct: 301 NKKSRKEVISAICKFFYHAGVPLQAANSLYFHKMLELVGQYGYGLVGPPSQLISGRFLQE 360

Query: 362 EVATVKTYLVELKASWAITGCSILVDSRKDSNGRTSINFLVSCPRGVYFVSSVDATEVAD 421
           E+AT+K+YLVE KASWAITGCSIL DS +D+ GRT INFL S P G+YFVSS DATEV +
Sbjct: 361 EIATLKSYLVECKASWAITGCSILADSWRDTRGRTLINFLSSGPNGMYFVSSADATEVVE 420

Query: 422 DPSNLFRVLDAVVDEIGEENVVQCFLDHRASLR 454
           D  +LF++LD VV+EIGE+NVVQ    +  S +
Sbjct: 421 DAFSLFKLLDKVVEEIGEDNVVQVITQNTPSYK 453

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0L859_CUCSA1.1e-23991.52Uncharacterized protein OS=Cucumis sativus GN=Csa_3G171120 PE=4 SV=1[more]
E5GC38_CUCME7.2e-23991.07DNA binding protein OS=Cucumis melo subsp. melo PE=4 SV=1[more]
A5BYZ7_VITVI2.5e-19172.25Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_033845 PE=4 SV=1[more]
D7T690_VITVI2.5e-19172.25Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0020g00790 PE=4 SV=... [more]
K7LCF8_SOYBN1.1e-18169.11Uncharacterized protein OS=Glycine max GN=GLYMA_09G079100 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G17450.13.3e-13454.88 hAT dimerisation domain-containing protein[more]
AT5G33406.14.0e-3930.07 hAT dimerisation domain-containing protein / transposase-related[more]
AT4G15020.17.1e-3634.17 hAT transposon superfamily[more]
AT3G22220.16.0e-3532.16 hAT transposon superfamily[more]
AT1G79740.11.5e-3330.21 hAT transposon superfamily[more]
Match NameE-valueIdentityDescription
gi|778679159|ref|XP_011651096.1|1.6e-23991.52PREDICTED: uncharacterized protein LOC101213851 [Cucumis sativus][more]
gi|659077032|ref|XP_008438995.1|1.0e-23891.07PREDICTED: uncharacterized protein LOC103483923 [Cucumis melo][more]
gi|147799625|emb|CAN75144.1|3.6e-19172.25hypothetical protein VITISV_033845 [Vitis vinifera][more]
gi|731388638|ref|XP_002274968.2|3.6e-19172.25PREDICTED: uncharacterized protein LOC100247647 [Vitis vinifera][more]
gi|1009163694|ref|XP_015900101.1|3.6e-19173.07PREDICTED: uncharacterized protein LOC107433329 [Ziziphus jujuba][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO:0046983protein dimerization activity
GO:0003677DNA binding
Vocabulary: INTERPRO
TermDefinition
IPR012337RNaseH-like_sf
IPR008906HATC_C_dom
IPR007021DUF659
IPR003656Znf_BED
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003677 DNA binding
molecular_function GO:0046983 protein dimerization activity
molecular_function GO:0003676 nucleic acid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG03g12160.1Cp4.1LG03g12160.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003656Zinc finger, BED-typePFAMPF02892zf-BEDcoord: 137..174
score: 2.2E-8coord: 13..50
score: 1.
IPR003656Zinc finger, BED-typePROFILEPS50808ZF_BEDcoord: 9..65
score: 10.456coord: 133..189
score: 11
IPR007021Domain of unknown function DUF659PFAMPF04937DUF659coord: 348..451
score: 7.0
IPR008906HAT, C-terminal dimerisation domainPFAMPF05699Dimer_Tnp_hATcoord: 623..693
score: 8.
IPR012337Ribonuclease H-like domainunknownSSF53098Ribonuclease H-likecoord: 378..696
score: 5.99
NoneNo IPR availablePANTHERPTHR32166FAMILY NOT NAMEDcoord: 98..743
score:
NoneNo IPR availablePANTHERPTHR32166:SF18HAT DIMERIZATION DOMAIN-CONTAINING PROTEINcoord: 98..743
score:

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG03g12160Wax gourdcpewgoB0766
Cp4.1LG03g12160Cucurbita pepo (Zucchini)cpecpeB184
Cp4.1LG03g12160Cucurbita pepo (Zucchini)cpecpeB482
Cp4.1LG03g12160Cucurbita maxima (Rimu)cmacpeB837
Cp4.1LG03g12160Cucurbita moschata (Rifu)cmocpeB787
Cp4.1LG03g12160Bottle gourd (USVL1VR-Ls)cpelsiB490
Cp4.1LG03g12160Watermelon (Charleston Gray)cpewcgB554
Cp4.1LG03g12160Watermelon (97103) v1cpewmB619
Cp4.1LG03g12160Melon (DHL92) v3.6.1cpemedB659
Cp4.1LG03g12160Melon (DHL92) v3.6.1cpemedB701
Cp4.1LG03g12160Silver-seed gourdcarcpeB0106
Cp4.1LG03g12160Silver-seed gourdcarcpeB1042