Cp4.1LG08g05680 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG08g05680
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionClathrin interactor EPSIN 2
LocationCp4.1LG08 : 590640 .. 598407 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAAGGTGGGAAGAAAATGTAGAATGATGAGGTGAGTTGACCATGGATGGACGTGAGATGCAAGGTTGATTACTATGTGGAAGTTGAAATTTTAATACAATAAAACAAGTATGAGTTTGATTTTTGAGAACCCATTTGGGTTTTGGCTTTTATGATTCGCTGCGTGTAGCTGTGCTACTTGCGCTGTCTTCTCCTAATTTTTTTCCCATTTCTCCTCTCTCTCTCTCAGATTCTCAACAGCGAGGAACAGAGACGAATCCCTCTCTTTCCTCTTCTATTCTTTCCAGCTTCGAGCGAATTCCGGGATCGTCTTCTTTCAATCCTCAGATCGGGCTTCTGCTTTCAAATCCACTTACCGCAGTCGATTCAATCAGGTGCCCACTTGTTCGATTCGGTTCGATTCGATTCAATCTAGGTTTTAGCTGTTGGGTTTTGATTCAGATCTACTTCACCTCTCTTGTTCTTCGATTTCTGGTTGGCTACTCAAACAGTTCATTCGGCTACCTCTTTTGCGTCCTTAGTCTTGTAAGATTCTTCTTCTCCCACGCTTTTCCGTTGTTTCCGATATCATCTCGTTTTCTTTGTTCTTAGATTGTTGATTTTGCAAGAGCCTTTTCCCAGATTTTCCCCCCGATAATTGCTTTGAACATGTGAAATTTCTGGTAATAATTGTGTCGTATAGGATCTTTTCCCCCCTTTTCTAGCTGGGTTTATCTGATTTGTCGTAATTTTTTTATTTTTGTTTCATTGGTTATAGATCTGTTGGTTCTGTATTCGTATTGAGATTATTACTTTAGCAGCTGGTTGTGAATACCTTGATGGGGAAGATTCATTTAATTTTCCGGTTGTTGTGTAAATCGGTCAAGCATTTAGCCACCTCCCAGTTCAGATCCTTTACATTGCTGATTAAGACATTGTTTATTAATAATAGGGCTTAGAATAAGAACATTAGCTCAAAAGTGGGTGCGTTAGTTGCAGGGGAGTGAGGTTGTGAGATCCCACATTGGTTGGGGAGGAGAACGAAGTATTCTCTATAAGGGTGTGGAAATCTCTCCCTAGCAAACGCGTTTTAAAAACCTTGAGGGGAAGCCTGGAAGGGAAAGCCGAAAGAGGATAATATCTAGCCTAGTGGTGGGTTTGGGCCGTTAGAGAGGCAGGTCCATAACATGTTAAGAGTTGGACTTCTTGAATCATGAGCAACTTTTGTTGTTGAGGAAGGTGGAAGGTCTTATCTTGTTAGGTGAAATGTTGTTTCCAAGTGGTGCAATGTGGACAGGTTAGGTGTAGAGAATTTGACAAGAACCTTACCCTCCTAGCTTAGTGTTCGTTACACTTTCGTTTAGAGCTTGACACTTTGTGGTTGGATTCTTGTGAGCTAATATGGTTGTCACCAGGTTAGATGGGTAACGGCTACATTTCCGAAAGCTGTCGTCCATTGTAATCTTCCATACGACATTCATTTGCAGCACGGAATGTAGCCAATTTCATCCACTTTTTCACCTAATCAGATCACAACCATTGATGTATAGCCCAATTATCATCTTATTCTTATTGCTATATTTCTTCTCAAGTTTGAACTTTTACGTATTATATATTTAATATTGGGATCGTTTTGTATGTGTGTAATTACTGCATAGACTAACAAGAGCTTCCATTTGATTCAACAGCGGAACTAGGGGCGCTTCTGCTTGTTTCTGTAATTGATAATTACCAGCTAGGATGAAGAAGGCCTTCGATCAAACTGTTAGGGACTTGTAAGCTATATTCTTGCCCTGTCAAAATTCATTTTTATTCATTGATCATGTAAATTTAACTTTATGCATTAGGTTTTCCTAATAGGATCCCTCTATCTTTTATTGCAGAAAGCGAGAGGTCAATAAGACCGTGCTCAAAATTCCGAAAATAGAGCAGAAGGTATTCCTTTCAGTTCCTCTGTTTCCATTATTTTCTTCCCGCCCTGAAGTGATCTTGTCTAGATTCAAGTATTTATTTGTTTCTTTTAAGGAAAGAAAGTGTTTAGGTTCGTTTATCACTCATGTTCTCGATGATAGCCCTTAGGTGAAATAGAAATGTGAGATCTCACATTGGTTGAAGAGGGGAACGAAGCATTTCTTATAAGGGTGTGAAAACCTCTCCCTAGCAGATGTGTTTTAAAACCTTGAGGGGAAGCCCAGAAGGGAAAGCCTAAAGAGGACAATATCTGCTAGCGGTGGGCTTGGGCTGTTACAAGAAACGTAATTGATGCACAATTTTTAGGGGAGCGAAGAAATTTCTTAGCTATGGATGCTACATGACATAATTAACCAAAAGAGTTAGACTGTGATATTGTGATTTTAGACTTCTCGAAATTCTATAAGAAAACTTTGTAGTAAGGGTTTTATGCACGCGAACTTATATGGAACGCTGTTTTTACCTTCTTTTTTCAGGTTTTGGATGCCACTAGCAACGAGCCGTGGGGTCCTCATGGATCACTTCTTGCAGATATAGCACAGGCAACTCGAAATTAGTAAGCATAGGCTTGCTTTTGTAGATGCTTTTAGATTGAATATTCTACACCAGGGGAGAACTTATTCCAAATGTCCATGCAGTCATGAATATCAAATGATTATGGGAGTAATCTGGAAGCGGATTAACGATACTGGCAAGAATTGGAGGCATGTTTACAAGGTTAGCCGTTTTCCCTTTTTGTTTCGACTTTTAGGATTATTGTGTTTCGACGATTTAATGAACTTACCTTTCCATTTTGATACGGAAGCTCTTTATGCATTCCTAGTCCTTATATTCCTACCCAAACTTGTATCTTTCTCAGGGTTTGACCGTTTTGGAGTACTTGGTGGGTCATGGGTCGGAGCGTGTCATAGATGACATCAGAGAACATGTCTATCAGATATCGGTACGTTTCTCTTTCTCAATTCACTTTGACAGCTATAACTACTCAGCTCTCTGATCTTCCTGAAATGTGATTACTCTCTCTTGATTCATCGGGCATCTTTTGTACGACAGACCTTATCTAATTTTCAATACATTGACTCGAGCGGGAGGGATCAGGGTAACAATGTTAGAAAGAAATCGCAAAATCTTGTTGCCCTTGTAAATGATAAAGAGAGGATTGTTGAGGTCAGACAGAAAGCTGCTGCTAATAAGGACAAGTGAGTCTGCTCTCGACGTTCGGTTAATTTATATATTTGTCGATTCTTCGTTCTTTTCATTTTTCTTGGTTCGTCTCTATTTATTGCGCTAGTATTCTTATTTATTAGTGCCAATTCATCATTTCAGACCATCAGGTTATTTCGTTTTATTTACTCACAATTTTTCGTTTTATTTACTCACAATTATGTTGCATAGAAATGTTCTTTAAGCTTTCATGGAATAGTTTTAAAAACTTAATTACTACTACTGTTCTTATGACTTTCATGATGCTACATAGGTTTCACAGTGCAGCATCTATGGGCAGTATGTATAGACCAAGTGCAGGAGCATATGACGACCGCTATGAAGGAAGATACGGAAGCCGTGATGGAGACAGAAATGTCGATTCTTATGGGAGGGAAAGAGAATATGGTTTTAGAGATGATCGTTCCGGTGGAAATGAGGATTCATATGGTCGTGATTATGAAGACCGCTATAATAGAGATGGTTATAGGGATGATGATTATCGAGGAAGAAGCCGAAGCGTTGATGACTATCAGTATGGTTCAAGAAGCCGAAGCTCTGACAGAAATGGAGAACGTACATATGACGATGGTCAAGTTTCTTCTCGGTATGCGCATGCTGCAGATATAATATGCTGATGTTATATTAGTAAAGTCCATTATATATTTTTTGACAGTATGAAGTTTGATATATGTATAATTTTCCTGGAAAGCGCTTATTTTCTTAGCTCTCATTTCTTTGCCTCTTGTGTTTCTTGCAGCAACAACGATACTAGAGCGAACGAACCATCTCGGGATGAAAGGTTAGTTTTTAATCATTTAATTTCATTGTTCGTTTTATATTTATCCGATGAGGAAGCGGTTTAGTGGCACTGTTAATTTTGAATTTTTCATAGGCCGCTTGAACGTAAATTTTCGGAACAGAATATTGGTGCTCCACCTAGTTACGAAGAAGTTGTGACTGAATCTGAAAGCAATGTGCAAAGTCAAAGGTGCTAATTTTGTTCCGAAAATATGACCGTTACTTTTTTTCTTTCGTCTTGTTCGCCCGTTCATTCATTTGGGATATTGTATCTATAAGCAGAGATGTTGAAGCTCCACCGACTGCTGCTCCGAGAGCGTTTCCCCCACCCACATCCAGTACTCAAAGCCAGCTAACTAGTCATGGAACTGCTGCATCCCCACCTACCCAGGGGCTTGATGGTTCTGATGAATTCGATCCTCGTGGTTCAGTTCCAGGTATGGGAAGCACCTGATTTTCTATGCCGCACCTCATCGTTACTTATTGTAAAAAGGGTTAGATTGCAATGGGCACTTCCATCCATGAGCTTCTTAGCCCTACTCTGACCAGTGCCTCTCACCGTCCGGTCACTGGCTTTGATACTATTTGTAACAGCCTAAGCCCACTGCTAGCAGGTATTGTCCTCCTTGGGGGCTTGGGCTATTACAAATGGTATCAGAGCTAAACACCGGATGGTATGCCACTAAGGATAGGAGAATGAAGCATTCCTTATAAGGGATGGATGGATTGTGATCCCACATTGGTTGGATAAGGGGAACGAAACATTGGGTGTTGAAACTTCTCCCTAATAGACGCATTTTAAAACCGTGAGGCTGACGGTGATGCGTAACAGACCAAAATAGACAATATCTACTAGCGGTGGGCTTGAGCTGTTATAAACTGTACTTCCGTTAGTACTTTCTTTTGATATTTATTTGTACTTTGCACTTAATATCTCTTTCCATCTCTTATTTCAGCTGCTCCACCTGCCACAAGCAATTTAGAGACAAATTTATTAGACTCCTTGGCCTTAGTGCCTGTTGGGCCAGTAACATCAACTGCAGACTATGAGGGTCATGTTCAGACAAGCTCTGCTGTGGGATACCATGCTCAAAATCAGGTAATATTGAGGCATTGAAATGATTTTGAACTGATCTCTGTTTTGCTTATGTCATGCTTATGTATGCAATTCTCTCGCTCTCATGAACTCATGATTTGCTGTTTCTCATCACTTTTATTCGTGGCATTGCTGAGTGCATGTTATACCTGGTTTTGGTAGATATAGCTTTGTGAGACCCCACGTCGATTGTAGAGAGGAACGGAGTATTCCTTATAAGGGTGTGGAAACCTCTCCCTAGTAGACGCGTTTTAAAACCTTGAGGGGATGCTTGAAAGGAAAAGTCCAAAGAGGGCAATATCTGCTAACGGTGGACTTGGGTTGTTACAAATGGTATTAGAGTCAGATACCGAGCAGTGTGCCAACGAGGACGTCAGGCCTCCAAGGGGGATGGATTGTGAGACCCTACATCAGTTGGAGAGGGGAATAAAACATTCCTTATAAGGGTGTGGAAACCTCTTCTTAGCAGACGTGTTTTAAAACCTTGAGGGGAAACCGGAAGGGAAAACCCAAAGAGGACAATATTTGCTAGTAGTGAGCTTGGGCTATTACAAATGGTATTAGAGCTAGACACAGAGCGGCGTGCCAGCGAGGACGTTAGGCCCCTAAGTGGGGTGGATTGTGAGATCCCACATCGGTTGGAGAGGGAAATGAAACATTCCTTATAAGGGGTGGAAACCTCTCCCTAGCTAACACGTTCTAAAACCTTTAAGAAAAACCCAGATGGAAAACCCCAAAGAGGACATATCTGCTAGCGGTGGACTTGGACTGTTTGTAACAGTGAGGCTACAACATATTTATGGCATAATAAGTAAAGAAATTTCAGATTTCTCGGATCTAAATTTAATTTGGGCCGACTCTTACCTTGTAAAGTTGGATTGTAGATCAATTACTTTTTACTGGCAGAGATTGATATTTTTCAACTTATTATGCTCACACAGACTTTCGAAGACGCATTCGGTGACTCGCCTTTTAAAGCCATTTCTTCCTCTGATGTCCAAGATCAAGCACATGTCCAGCATGGGGAATCATTTTCTGTAGCAACTTATTCCACACCGAACATCCCCGTTCAACCCCAACCGAATTTTCCGCAGCCTCGTGAAGAGTCTCTGCAACATCAGAATATTGGTGTACTAGCTGATCTCCTTCCTCCTCCTGAACCTTTACCTGCTGTTGTTTCACAGCCTGCATTCACAGTATCAAACAGCCAACAAGCTGTATCTGGCTTGCCTGCACAGCCAAATTCTAATAACTTGGGGACTTATCAGCAACATGGAAACGTAGCTCCCGTAAATTTTCAAAATCCAACAGAACCCGGGAGAGAGTTTAACAACGGGATGTTCATGGTTCCGGGAAGTGCACCGACACATGTCAATTCATATATGGCTCCTCCAAATGCTGGACCTAGTACACATCCCAACAATTTTGGATATTCTCAGGATGGTTCGGTAGCTCCCACGAGTTCTCATGTCGCACTTCAAACTACTCAACCGCCTGCTCAGCTTCCTAGTGGGAACTTCAATCCGCCACATGCCTCTGTAGCTCCAGTAGCTTCTCAACTGTCTTACCAAGCCCCTAATTTTCCTGTTAACGTATCTAACCCTGATGTTATGGGTAGTTTTCAAGCAGGAAACTATACATCCATGGCTTCCCAGCAAATTCCTCCATCTGGATCGCTTTCAAATGCATCTCAACCTTCAAAGAACAAGTTCGAGACCAAGTCTACAGTATGGACCGACACATTAAACCGAGGGCTTGTCAATTTAAACATATCTGGACGTGAGTTTCCCCCTCGATTTCATTACATCACTTCGACTTACTGGATTGTTTTGAAAAAACACATCGTAGTTCTTCCTTGTTCATCTTTTTGAATCATAAATTCTCCATGTGCAGCTAAAACAAACCCATTGGCGGATATTGGTGTTGATTTTGAAGCCCTCAATAGGAAAGAGAAGAGGATGGAGAAGCCTAGTACAGCTCCAGCGATATCTACCATTAACATGGGCAAAGCTATGGGATCTGGTTCCGGGATTGGCCGAGCTGGTGCGAGTGCACTAAGGCCACCTCCAAACGCAATGTCGGGTCCGGGAATGGGAATGGGTATGAACCCCAATCCTGGCATGGGCATGGGTATGAGAGGAGGATATGGAGGCATGAATCAACCCATGGGTGGCATGGGGATGAATATGGGTATGGCACAACACGGAATCCAGATGCAGCAGCCTCGAGCGAACATGCCTGGTGCATACAACCCAATGATGGGTGCTGGTGGCTACGCCCCTCAACAGCCACCGTACGGCGGCTACCGATGATATCAGACATGTTATTAGAGTTGTGGAAGAGGCTCTGCTTGTTTATCAGGTTGTAAGATTAGATAGAACACAGCATTAAATATTGGAAGGTCTGCATCCAACCCAAGATTTTCTTATTTGAAGCTATGTTTTCCCCTGAAAGAAGGCCTCGCCTTCATTGTTTTCTACCATCTCGTATTAGTCATTTTTTTTGTGGGTTGAGTTCTGCTTTGAATTTTGTGTGTATTCAATGAGGTCAAAATCTTTTTCTCATCATTTTCCTTGCACGTTCTTGTGATTGTATATTTTGATTGAATACCAAAAGTAAAGAATAAGATACATTGGTCAAAGTTTGTCTCATTCATATTTCAGTTGTTCTAGTATACACATGATATCAGACTCAGTGATAGGCTTGGATCTCTTCTAGG

mRNA sequence

GAAGGTGGGAAGAAAATGTAGAATGATGAGATTCTCAACAGCGAGGAACAGAGACGAATCCCTCTCTTTCCTCTTCTATTCTTTCCAGCTTCGAGCGAATTCCGGGATCGTCTTCTTTCAATCCTCAGATCGGGCTTCTGCTTTCAAATCCACTTACCGCAGTCGATTCAATCAGTTCATTCGGCTACCTCTTTTGCGTCCTTAGTCTTCGGAACTAGGGGCGCTTCTGCTTGTTTCTGTAATTGATAATTACCAGCTAGGATGAAGAAGGCCTTCGATCAAACTGTTAGGGACTTAAAGCGAGAGGTCAATAAGACCGTGCTCAAAATTCCGAAAATAGAGCAGAAGGTTTTGGATGCCACTAGCAACGAGCCGTGGGGTCCTCATGGATCACTTCTTGCAGATATAGCACAGGCAACTCGAAATTATCATGAATATCAAATGATTATGGGAGTAATCTGGAAGCGGATTAACGATACTGGCAAGAATTGGAGGCATGTTTACAAGGGTTTGACCGTTTTGGAGTACTTGGTGGGTCATGGGTCGGAGCGTGTCATAGATGACATCAGAGAACATGTCTATCAGATATCGACCTTATCTAATTTTCAATACATTGACTCGAGCGGGAGGGATCAGGGTAACAATGTTAGAAAGAAATCGCAAAATCTTGTTGCCCTTGTAAATGATAAAGAGAGGATTGTTGAGGTCAGACAGAAAGCTGCTGCTAATAAGGACAAGTTTCACAGTGCAGCATCTATGGGCAGTATGTATAGACCAAGTGCAGGAGCATATGACGACCGCTATGAAGGAAGATACGGAAGCCGTGATGGAGACAGAAATGTCGATTCTTATGGGAGGGAAAGAGAATATGGTTTTAGAGATGATCGTTCCGGTGGAAATGAGGATTCATATGGTCGTGATTATGAAGACCGCTATAATAGAGATGGTTATAGGGATGATGATTATCGAGGAAGAAGCCGAAGCGTTGATGACTATCAGTATGGTTCAAGAAGCCGAAGCTCTGACAGAAATGGAGAACGTACATATGACGATGGTCAAGTTTCTTCTCGCAACAACGATACTAGAGCGAACGAACCATCTCGGGATGAAAGGCCGCTTGAACGTAAATTTTCGGAACAGAATATTGGTGCTCCACCTAGTTACGAAGAAGTTGTGACTGAATCTGAAAGCAATGTGCAAAGTCAAAGAGATGTTGAAGCTCCACCGACTGCTGCTCCGAGAGCGTTTCCCCCACCCACATCCAGTACTCAAAGCCAGCTAACTAGTCATGGAACTGCTGCATCCCCACCTACCCAGGGGCTTGATGGTTCTGATGAATTCGATCCTCGTGGTTCAGTTCCAGCTGCTCCACCTGCCACAAGCAATTTAGAGACAAATTTATTAGACTCCTTGGCCTTAGTGCCTGTTGGGCCAGTAACATCAACTGCAGACTATGAGGGTCATGTTCAGACAAGCTCTGCTGTGGGATACCATGCTCAAAATCAGACTTTCGAAGACGCATTCGGTGACTCGCCTTTTAAAGCCATTTCTTCCTCTGATGTCCAAGATCAAGCACATGTCCAGCATGGGGAATCATTTTCTGTAGCAACTTATTCCACACCGAACATCCCCGTTCAACCCCAACCGAATTTTCCGCAGCCTCGTGAAGAGTCTCTGCAACATCAGAATATTGGTGTACTAGCTGATCTCCTTCCTCCTCCTGAACCTTTACCTGCTGTTGTTTCACAGCCTGCATTCACAGTATCAAACAGCCAACAAGCTGTATCTGGCTTGCCTGCACAGCCAAATTCTAATAACTTGGGGACTTATCAGCAACATGGAAACGTAGCTCCCGTAAATTTTCAAAATCCAACAGAACCCGGGAGAGAGTTTAACAACGGGATGTTCATGGTTCCGGGAAGTGCACCGACACATGTCAATTCATATATGGCTCCTCCAAATGCTGGACCTAGTACACATCCCAACAATTTTGGATATTCTCAGGATGGTTCGGTAGCTCCCACGAGTTCTCATGTCGCACTTCAAACTACTCAACCGCCTGCTCAGCTTCCTAGTGGGAACTTCAATCCGCCACATGCCTCTGTAGCTCCAGTAGCTTCTCAACTGTCTTACCAAGCCCCTAATTTTCCTGTTAACGTATCTAACCCTGATGTTATGGGTAGTTTTCAAGCAGGAAACTATACATCCATGGCTTCCCAGCAAATTCCTCCATCTGGATCGCTTTCAAATGCATCTCAACCTTCAAAGAACAAGTTCGAGACCAAGTCTACAGTATGGACCGACACATTAAACCGAGGGCTTGTCAATTTAAACATATCTGGACCTAAAACAAACCCATTGGCGGATATTGGTGTTGATTTTGAAGCCCTCAATAGGAAAGAGAAGAGGATGGAGAAGCCTAGTACAGCTCCAGCGATATCTACCATTAACATGGGCAAAGCTATGGGATCTGGTTCCGGGATTGGCCGAGCTGGTGCGAGTGCACTAAGGCCACCTCCAAACGCAATGTCGGGTCCGGGAATGGGAATGGGTATGAACCCCAATCCTGGCATGGGCATGGGTATGAGAGGAGGATATGGAGGCATGAATCAACCCATGGGTGGCATGGGGATGAATATGGGTATGGCACAACACGGAATCCAGATGCAGCAGCCTCGAGCGAACATGCCTGGTGCATACAACCCAATGATGGGTGCTGGTGGCTACGCCCCTCAACAGCCACCGTACGGCGGCTACCGATGATATCAGACATGTTATTAGAGTTGTGGAAGAGGCTCTGCTTGTTTATCAGGTTGTAAGATTAGATAGAACACAGCATTAAATATTGGAAGGTCTGCATCCAACCCAAGATTTTCTTATTTGAAGCTATGTTTTCCCCTGAAAGAAGGCCTCGCCTTCATTGTTTTCTACCATCTCGTATTAGTCATTTTTTTTGTGGGTTGAGTTCTGCTTTGAATTTTGTGTGTATTCAATGAGGTCAAAATCTTTTTCTCATCATTTTCCTTGCACGTTCTTGTGATTGTATATTTTGATTGAATACCAAAAGTAAAGAATAAGATACATTGGTCAAAGTTTGTCTCATTCATATTTCAGTTGTTCTAGTATACACATGATATCAGACTCAGTGATAGGCTTGGATCTCTTCTAGG

Coding sequence (CDS)

ATGAAGAAGGCCTTCGATCAAACTGTTAGGGACTTAAAGCGAGAGGTCAATAAGACCGTGCTCAAAATTCCGAAAATAGAGCAGAAGGTTTTGGATGCCACTAGCAACGAGCCGTGGGGTCCTCATGGATCACTTCTTGCAGATATAGCACAGGCAACTCGAAATTATCATGAATATCAAATGATTATGGGAGTAATCTGGAAGCGGATTAACGATACTGGCAAGAATTGGAGGCATGTTTACAAGGGTTTGACCGTTTTGGAGTACTTGGTGGGTCATGGGTCGGAGCGTGTCATAGATGACATCAGAGAACATGTCTATCAGATATCGACCTTATCTAATTTTCAATACATTGACTCGAGCGGGAGGGATCAGGGTAACAATGTTAGAAAGAAATCGCAAAATCTTGTTGCCCTTGTAAATGATAAAGAGAGGATTGTTGAGGTCAGACAGAAAGCTGCTGCTAATAAGGACAAGTTTCACAGTGCAGCATCTATGGGCAGTATGTATAGACCAAGTGCAGGAGCATATGACGACCGCTATGAAGGAAGATACGGAAGCCGTGATGGAGACAGAAATGTCGATTCTTATGGGAGGGAAAGAGAATATGGTTTTAGAGATGATCGTTCCGGTGGAAATGAGGATTCATATGGTCGTGATTATGAAGACCGCTATAATAGAGATGGTTATAGGGATGATGATTATCGAGGAAGAAGCCGAAGCGTTGATGACTATCAGTATGGTTCAAGAAGCCGAAGCTCTGACAGAAATGGAGAACGTACATATGACGATGGTCAAGTTTCTTCTCGCAACAACGATACTAGAGCGAACGAACCATCTCGGGATGAAAGGCCGCTTGAACGTAAATTTTCGGAACAGAATATTGGTGCTCCACCTAGTTACGAAGAAGTTGTGACTGAATCTGAAAGCAATGTGCAAAGTCAAAGAGATGTTGAAGCTCCACCGACTGCTGCTCCGAGAGCGTTTCCCCCACCCACATCCAGTACTCAAAGCCAGCTAACTAGTCATGGAACTGCTGCATCCCCACCTACCCAGGGGCTTGATGGTTCTGATGAATTCGATCCTCGTGGTTCAGTTCCAGCTGCTCCACCTGCCACAAGCAATTTAGAGACAAATTTATTAGACTCCTTGGCCTTAGTGCCTGTTGGGCCAGTAACATCAACTGCAGACTATGAGGGTCATGTTCAGACAAGCTCTGCTGTGGGATACCATGCTCAAAATCAGACTTTCGAAGACGCATTCGGTGACTCGCCTTTTAAAGCCATTTCTTCCTCTGATGTCCAAGATCAAGCACATGTCCAGCATGGGGAATCATTTTCTGTAGCAACTTATTCCACACCGAACATCCCCGTTCAACCCCAACCGAATTTTCCGCAGCCTCGTGAAGAGTCTCTGCAACATCAGAATATTGGTGTACTAGCTGATCTCCTTCCTCCTCCTGAACCTTTACCTGCTGTTGTTTCACAGCCTGCATTCACAGTATCAAACAGCCAACAAGCTGTATCTGGCTTGCCTGCACAGCCAAATTCTAATAACTTGGGGACTTATCAGCAACATGGAAACGTAGCTCCCGTAAATTTTCAAAATCCAACAGAACCCGGGAGAGAGTTTAACAACGGGATGTTCATGGTTCCGGGAAGTGCACCGACACATGTCAATTCATATATGGCTCCTCCAAATGCTGGACCTAGTACACATCCCAACAATTTTGGATATTCTCAGGATGGTTCGGTAGCTCCCACGAGTTCTCATGTCGCACTTCAAACTACTCAACCGCCTGCTCAGCTTCCTAGTGGGAACTTCAATCCGCCACATGCCTCTGTAGCTCCAGTAGCTTCTCAACTGTCTTACCAAGCCCCTAATTTTCCTGTTAACGTATCTAACCCTGATGTTATGGGTAGTTTTCAAGCAGGAAACTATACATCCATGGCTTCCCAGCAAATTCCTCCATCTGGATCGCTTTCAAATGCATCTCAACCTTCAAAGAACAAGTTCGAGACCAAGTCTACAGTATGGACCGACACATTAAACCGAGGGCTTGTCAATTTAAACATATCTGGACCTAAAACAAACCCATTGGCGGATATTGGTGTTGATTTTGAAGCCCTCAATAGGAAAGAGAAGAGGATGGAGAAGCCTAGTACAGCTCCAGCGATATCTACCATTAACATGGGCAAAGCTATGGGATCTGGTTCCGGGATTGGCCGAGCTGGTGCGAGTGCACTAAGGCCACCTCCAAACGCAATGTCGGGTCCGGGAATGGGAATGGGTATGAACCCCAATCCTGGCATGGGCATGGGTATGAGAGGAGGATATGGAGGCATGAATCAACCCATGGGTGGCATGGGGATGAATATGGGTATGGCACAACACGGAATCCAGATGCAGCAGCCTCGAGCGAACATGCCTGGTGCATACAACCCAATGATGGGTGCTGGTGGCTACGCCCCTCAACAGCCACCGTACGGCGGCTACCGATGA

Protein sequence

MKKAFDQTVRDLKREVNKTVLKIPKIEQKVLDATSNEPWGPHGSLLADIAQATRNYHEYQMIMGVIWKRINDTGKNWRHVYKGLTVLEYLVGHGSERVIDDIREHVYQISTLSNFQYIDSSGRDQGNNVRKKSQNLVALVNDKERIVEVRQKAAANKDKFHSAASMGSMYRPSAGAYDDRYEGRYGSRDGDRNVDSYGREREYGFRDDRSGGNEDSYGRDYEDRYNRDGYRDDDYRGRSRSVDDYQYGSRSRSSDRNGERTYDDGQVSSRNNDTRANEPSRDERPLERKFSEQNIGAPPSYEEVVTESESNVQSQRDVEAPPTAAPRAFPPPTSSTQSQLTSHGTAASPPTQGLDGSDEFDPRGSVPAAPPATSNLETNLLDSLALVPVGPVTSTADYEGHVQTSSAVGYHAQNQTFEDAFGDSPFKAISSSDVQDQAHVQHGESFSVATYSTPNIPVQPQPNFPQPREESLQHQNIGVLADLLPPPEPLPAVVSQPAFTVSNSQQAVSGLPAQPNSNNLGTYQQHGNVAPVNFQNPTEPGREFNNGMFMVPGSAPTHVNSYMAPPNAGPSTHPNNFGYSQDGSVAPTSSHVALQTTQPPAQLPSGNFNPPHASVAPVASQLSYQAPNFPVNVSNPDVMGSFQAGNYTSMASQQIPPSGSLSNASQPSKNKFETKSTVWTDTLNRGLVNLNISGPKTNPLADIGVDFEALNRKEKRMEKPSTAPAISTINMGKAMGSGSGIGRAGASALRPPPNAMSGPGMGMGMNPNPGMGMGMRGGYGGMNQPMGGMGMNMGMAQHGIQMQQPRANMPGAYNPMMGAGGYAPQQPPYGGYR
BLAST of Cp4.1LG08g05680 vs. Swiss-Prot
Match: EPN2_ARATH (Clathrin interactor EPSIN 2 OS=Arabidopsis thaliana GN=EPSIN2 PE=1 SV=1)

HSP 1 Score: 480.3 bits (1235), Expect = 4.2e-134
Identity = 407/947 (42.98%), Postives = 494/947 (52.16%), Query Frame = 1

Query: 1   MKKAFDQTVRDLKREVNKTVLKIPKIEQKVLDATSNEPWGPHGSLLADIAQATRNYHEYQ 60
           MKK F QTVRDLKREVNK VLK+P +EQKVLDATSNEPWGPHGSLLAD+AQA+RNYHEYQ
Sbjct: 1   MKKVFGQTVRDLKREVNKKVLKVPGVEQKVLDATSNEPWGPHGSLLADLAQASRNYHEYQ 60

Query: 61  MIMGVIWKRINDTGKNWRHVYKGLTVLEYLVGHGSERVIDDIREHVYQISTLSNFQYIDS 120
           +IM VIWKR++DTGKNWRHVYK LTVLEY+VGHGSERVID+IRE  YQISTLS+FQYIDS
Sbjct: 61  LIMVVIWKRLSDTGKNWRHVYKALTVLEYMVGHGSERVIDEIRERAYQISTLSDFQYIDS 120

Query: 121 SGRDQGNNVRKKSQNLVALVNDKERIVEVRQKAAANKDKFHSAASMGSMYRPSAGAYDDR 180
            GRDQG+NVRKKSQ+LVALVNDKERI EVRQKAAAN+DK+ S+A  G MY+PS G Y D+
Sbjct: 121 GGRDQGSNVRKKSQSLVALVNDKERIAEVRQKAAANRDKYRSSAP-GGMYKPSGG-YGDK 180

Query: 181 Y--------------EGRYGSRDGDRNV-----------DSYGREREYGFRDD------R 240
           Y              E  YG RD DRN            D YGR+   G RDD      R
Sbjct: 181 YDYGSRDEERSSYGREREYGYRDDDRNSRDGDRHSRDSEDRYGRD---GNRDDDYRGRSR 240

Query: 241 SGGNEDSYGRDYEDRYNRDGYRDDDYRGRSRSVDDYQYGSRSRSSDRNG--ERTYDDGQV 300
           S  N  S GR  E     DG+     RG     DD        S D  G  +R + +  +
Sbjct: 241 SVDNYGSRGRSSEREREDDGHSSS--RGSGARADD-------NSQDGRGGLQRKFSEQNI 300

Query: 301 SSRNNDTRANEPSRDERPLERKFSEQNIGAPPSYEEVVTESESNVQ-SQRDVEAPPTAAP 360
            +  +   A   SR        +SE++ G  P        S    Q +  +  +PPT   
Sbjct: 301 GAPPSYEEAVSDSRSP-----VYSERDGGETPQVTAPGAASPPPPQVAAPEAASPPTGTN 360

Query: 361 RAFPPPTSSTQSQLTSHGT----------AASPPTQGLDGSDEFDPRGSVPAAPPATSNL 420
            A    T   +S      T          +A PP           P  +  +AP  ++++
Sbjct: 361 TANTTATFVNESPSQKVETFDEFDPRSAFSAGPPAYASTDGVTAPPTVTSMSAPTTSNSV 420

Query: 421 ETNLLDSLALV------PVGPVTST-ADYEGHVQTSSAVGYHAQN---QTFEDAFGDSPF 480
           E +LL SLA V       + P  S   +  G      A  +       Q+F+D FGDSPF
Sbjct: 421 EMDLLGSLADVFSSNALAIVPADSIYVETNGQANAGPAPSFSTSQPSTQSFDDPFGDSPF 480

Query: 481 KAISSSDVQ-------------------------DQAH-VQHGESFSVATYSTP-NIPVQ 540
           KA +S+D                           D AH    G+SFS      P +  VQ
Sbjct: 481 KAFTSTDTDSTPQQNFGASFQPPPPAFTSEVSHPDTAHNFGFGDSFSAVANPDPASQNVQ 540

Query: 541 PQPNFPQ-PREESLQHQN-IGVLADLLPPPEPLPAVVSQPAFTVSNSQQAVSGLPAQPNS 600
           P  N P  P+E+    Q+ I +LA +LPP  P   V S P+   S            P+ 
Sbjct: 541 PPSNSPGFPQEQFATSQSGIDILAGILPPSGP--PVQSGPSIPTSQFP---------PSG 600

Query: 601 NNLGTYQQHGNVAPVNFQNPTEPGREFNNGMFMVPGSAPTHVNSYMAPP-NAGPSTHPNN 660
           NN+  Y+   +  PV+   P  PG+             P   N   A P N+G   H   
Sbjct: 601 NNM--YEGFHSQPPVSTA-PNLPGQTPFGQAVQPYNMVPHSQNMTGAMPFNSGGFMH--- 660

Query: 661 FGYSQDGSVAPTSSHVALQTTQPPAQLPSGNFNPPHASVAPVASQLSYQAPNFPVNVS-N 720
               Q GS  P S+             P+G F        P  S    +  + PV +  N
Sbjct: 661 ----QPGSQTPYSTPSG----------PAGQFMAHQGHGMP-PSHGPQRTQSGPVTLQGN 720

Query: 721 PDVMGSF--QAGNYTSMASQQIPPSGSLSNASQ---PSKNKFETKSTVWTDTLNRGLVNL 780
            +VMG    QA   +  +S   P    L+ A +   P + KFE KS+VW DTL+RGLVN 
Sbjct: 721 NNVMGDMFSQATPNSLTSSSSHPDLTPLTGAIEIVPPPQKKFEPKSSVWADTLSRGLVNF 780

Query: 781 NISGPKTNPLADIGVDFEALNRKEKRMEKPSTAPAISTINMGKAMGSGSGIGRAGASALR 834
           NISG KTNPLADIGVDFEA+NR+EKR+EK +  PA STINMGKAMGSG+G+GR+GA+A+R
Sbjct: 781 NISGSKTNPLADIGVDFEAINRREKRLEKQTNTPATSTINMGKAMGSGTGLGRSGATAMR 840

BLAST of Cp4.1LG08g05680 vs. Swiss-Prot
Match: EPN3_ARATH (Clathrin interactor EPSIN 3 OS=Arabidopsis thaliana GN=EPSIN3 PE=2 SV=1)

HSP 1 Score: 447.2 bits (1149), Expect = 3.9e-124
Identity = 324/708 (45.76%), Postives = 404/708 (57.06%), Query Frame = 1

Query: 1   MKKAFDQTVRDLKREVNKTVLKIPKIEQKVLDATSNEPWGPHGSLLADIAQATRNYHEYQ 60
           MKKAF QTVRDLKR VNK VLK+P IEQKVLDATSNE WGPHGSLLADIA A+RNYHEYQ
Sbjct: 1   MKKAFGQTVRDLKRGVNKKVLKVPGIEQKVLDATSNESWGPHGSLLADIAHASRNYHEYQ 60

Query: 61  MIMGVIWKRINDTGKNWRHVYKGLTVLEYLVGHGSERVIDDIREHVYQISTLSNFQYIDS 120
           + MGV+WKR++D+GKNWRHVYK LTVLEY+VGHGSERVI++++EH YQI+TLS FQYIDS
Sbjct: 61  ITMGVLWKRLSDSGKNWRHVYKALTVLEYMVGHGSERVIEEVKEHAYQITTLSGFQYIDS 120

Query: 121 SGRDQGNNVRKKSQNLVALVNDKERIVEVRQKAAANKDKFHSAASMGSMYRPSAGAYDDR 180
           SG+DQG+NVRKK+Q+LVALVNDKERI EVR+KAAAN+DK+H+     SM+RPS G Y D+
Sbjct: 121 SGKDQGSNVRKKAQSLVALVNDKERITEVREKAAANRDKYHN-----SMHRPS-GGYGDK 180

Query: 181 --YEGRYGSRDGDRNVDSYGREREYGFR-DDRSGGNEDSYGRDYEDRYNRDGYRDDDYRG 240
             YEGRYG RD  R+  SYG+EREYG+R DDR+  + D Y RD EDRY RDG  DD+YRG
Sbjct: 181 YDYEGRYGDRDEGRS--SYGKEREYGYRDDDRNSRDGDRYSRDSEDRYGRDGNTDDEYRG 240

Query: 241 RSRSVDDYQYGSRSRSSDRNGERTYDDGQVSSRNNDTRANEPSRDER-PLERKFSEQNIG 300
           RSRSVD+Y  GSR RSSDR      DDGQ SSR++   A++ S+D R  LERKFSEQNIG
Sbjct: 241 RSRSVDNYN-GSRGRSSDRE-RPIEDDGQSSSRDSGAPADDHSQDGRGGLERKFSEQNIG 300

Query: 301 -APPSYEEVVTESESNVQSQRD-VEAPPTAAPRAFPPPTS---STQSQLTSHGTAASPPT 360
            APPSYEE V+ES S V S+RD  E P  A P A   P +   S  ++       +SP  
Sbjct: 301 AAPPSYEEAVSESRSPVYSERDGGETPQVAPPGAAASPLAENISVDNKAADFVNESSP-- 360

Query: 361 QGLDGSDEFDPRGSVPA----------------------APPATSNLETNLLDSLALV-- 420
           Q ++  DEFDPRGSV A                      APPA+ N E +LL SL+ V  
Sbjct: 361 QQVEAFDEFDPRGSVSAACAPTAGASVPAPIPPTVVSTPAPPASINAEMDLLGSLSDVFS 420

Query: 421 --PVGPVTS---TADYEGHVQTSSAVGY---HAQNQTFEDAFGDSPFKAISSSDVQDQAH 480
             P+  VTS   + +  G   T  A  +    +  Q F+D FGDSPFKAI+S+D +   H
Sbjct: 421 PNPLAIVTSDSTSVETNGQANTGLAPSFSTSQSSTQPFDDPFGDSPFKAITSADTETSQH 480

Query: 481 VQHGESFSVATYSTPNIPVQPQPNFPQPREESLQHQNIG------VLADLLPPPEPLPA- 540
              G            +P QP P    P  E     N G       + D  P  + + A 
Sbjct: 481 QSFG------------VPFQPTPPTSNPNNE----HNFGFGEAFSAVTDSEPGVQNMQAP 540

Query: 541 ----VVSQPAFTVSNSQQAVSGLPAQPNSNNLGTYQQHGNVAPVNFQNPTEPGREFNNGM 600
               V  Q  F  S S+  +      P+   +    Q  +  P +  +P     E  +  
Sbjct: 541 PNLSVFPQEQFDTSQSEIDILAGILPPSGPPVSLSPQPDSTMPTSQFHPNGNSYESYHHQ 600

Query: 601 FMVPGSAPTHVNSYMAPPNAGPSTHPNNFGYSQ---------DGSVAPTSSHVALQTTQP 648
                +APT +N     P    S   N   +SQ         +G       +    T+QP
Sbjct: 601 -----AAPTDLNMQGQTPFGQASQQFNMVSHSQNHHEGMQFNNGGFTQQPGYAGPATSQP 660

BLAST of Cp4.1LG08g05680 vs. Swiss-Prot
Match: EPN1_ARATH (Clathrin interactor EPSIN 1 OS=Arabidopsis thaliana GN=EPSIN1 PE=1 SV=1)

HSP 1 Score: 207.6 bits (527), Expect = 5.2e-52
Identity = 164/497 (33.00%), Postives = 267/497 (53.72%), Query Frame = 1

Query: 3   KAFDQTVRDLKREVNKTVLKIPKIEQKVLDATSNEPWGPHGSLLADIAQATRNYHEYQMI 62
           K FDQTVR++KREVN  VLK+P++EQKVLDAT NEPWGPHG+ LA+IAQAT+ + E QM+
Sbjct: 5   KVFDQTVREIKREVNLKVLKVPEMEQKVLDATDNEPWGPHGTALAEIAQATKKFSECQMV 64

Query: 63  MGVIWKRINDTGKNWRHVYKGLTVLEYLVGHGSERVIDDIREHVYQISTLSNFQYIDSSG 122
           M V+W R+++TGK+WR+VYK L V++YL+ +GSER +D+I EH YQIS+L++F+Y++ +G
Sbjct: 65  MSVLWTRLSETGKDWRYVYKALAVIDYLISNGSERAVDEIIEHTYQISSLTSFEYVEPNG 124

Query: 123 RDQGNNVRKKSQNLVALVNDKERIVEVRQKAAANKDKFHSAASMGSMYRP-SAGAYDDRY 182
           +D G NVRKK++N+VAL+N+KE+I E+R KA AN++K+   +S G  Y+  S+ ++   +
Sbjct: 125 KDVGINVRKKAENIVALLNNKEKISEIRDKAVANRNKYVGLSSTGITYKSGSSASFGGSF 184

Query: 183 EGRYGSRDGDRNVDSYGREREY-GFRDDRSGGNEDSYGRDYEDRYNRDGYRD-DDYRGRS 242
           +    + D  ++ DS   + +Y  F+  R G   +      +  ++R G  D D+     
Sbjct: 185 QSGSSNFDSYKDRDSREDKNDYESFQKSRRGVKTEEQSYTSKKSFSRYGSTDHDNLSSGK 244

Query: 243 RSVDDYQYGSRSRSSDRNGERTYDDGQVSSRNNDTRANEPSRDERPLERKFSEQNIG--- 302
           +S D  ++ S   ++  N +  +DD         T +N+PS         F    IG   
Sbjct: 245 KSPDSAKHRSYVSAAPSNNDDDFDDFD----PRGTSSNKPSTGSANQVDLFGGDLIGDFL 304

Query: 303 -APPSYEEVVTESESNVQSQRDVEAPPTAAPRAFPPPTSSTQSQL-------TSHGTAAS 362
            + P+       +E+  ++    +A   +A        S TQ Q+        S   +++
Sbjct: 305 DSGPTETSSTNNNENFQEADLFADAAFVSASAQGAEFGSQTQKQVDLFSASEPSVTVSSA 364

Query: 363 PPTQGLDGSDEFDPRGSVPAAPPATSNLETNLLDSLALVPVGPVTSTADYEGHVQTSSAV 422
           PPT  L  S E         + P  S    N++D  A VP+     +  +      S++V
Sbjct: 365 PPTVDLFASSESVVSPEAKISIP-ESMATPNIVDPFAAVPMDNFDGSDPFGAFTSHSASV 424

Query: 423 GYHAQNQTFEDAFGDSPFKAISSSDVQDQAHVQHGESFSVAT--YSTPNIPVQPQPNFPQ 482
               Q  +   +   +    +S +D + Q H+Q  + F V +  ++          N   
Sbjct: 425 STGPQAPSVHGS-ATNTTSPLSFADSKPQ-HLQKKDPFQVKSGIWADSLSRGLIDLNITA 484

Query: 483 PREESLQHQNIGVLADL 484
           P++ SL   ++GV+ DL
Sbjct: 485 PKKASL--ADVGVVGDL 492

BLAST of Cp4.1LG08g05680 vs. Swiss-Prot
Match: EPN1_RAT (Epsin-1 OS=Rattus norvegicus GN=Epn1 PE=1 SV=1)

HSP 1 Score: 158.3 bits (399), Expect = 3.6e-37
Identity = 75/157 (47.77%), Postives = 107/157 (68.15%), Query Frame = 1

Query: 12  LKREVNKTVLKIPKIEQKVLDATSNEPWGPHGSLLADIAQATRNYHEYQMIMGVIWKRIN 71
           L+R++   V    + E KV +ATSN+PWGP  SL+++IA  T N   +  IM +IWKR+N
Sbjct: 6   LRRQMKNIVHNYSEAEIKVREATSNDPWGPSSSLMSEIADLTYNVVAFSEIMSMIWKRLN 65

Query: 72  DTGKNWRHVYKGLTVLEYLVGHGSERVIDDIREHVYQISTLSNFQYIDSSGRDQGNNVRK 131
           D GKNWRHVYK +T++EYL+  GSERV    +E++Y + TL +FQY+D  G+DQG NVR+
Sbjct: 66  DHGKNWRHVYKAMTLMEYLIKTGSERVSQQCKENMYAVQTLKDFQYVDRDGKDQGVNVRE 125

Query: 132 KSQNLVALVNDKERIVEVRQKAAANKDKFHSAASMGS 169
           K++ LVAL+ D++R+ E R  A   K+K    A+  S
Sbjct: 126 KAKQLVALLRDEDRLREERAHALKTKEKLAQTATASS 162

BLAST of Cp4.1LG08g05680 vs. Swiss-Prot
Match: EPN1_HUMAN (Epsin-1 OS=Homo sapiens GN=EPN1 PE=1 SV=2)

HSP 1 Score: 158.3 bits (399), Expect = 3.6e-37
Identity = 75/157 (47.77%), Postives = 107/157 (68.15%), Query Frame = 1

Query: 12  LKREVNKTVLKIPKIEQKVLDATSNEPWGPHGSLLADIAQATRNYHEYQMIMGVIWKRIN 71
           L+R++   V    + E KV +ATSN+PWGP  SL+++IA  T N   +  IM +IWKR+N
Sbjct: 6   LRRQMKNIVHNYSEAEIKVREATSNDPWGPSSSLMSEIADLTYNVVAFSEIMSMIWKRLN 65

Query: 72  DTGKNWRHVYKGLTVLEYLVGHGSERVIDDIREHVYQISTLSNFQYIDSSGRDQGNNVRK 131
           D GKNWRHVYK +T++EYL+  GSERV    +E++Y + TL +FQY+D  G+DQG NVR+
Sbjct: 66  DHGKNWRHVYKAMTLMEYLIKTGSERVSQQCKENMYAVQTLKDFQYVDRDGKDQGVNVRE 125

Query: 132 KSQNLVALVNDKERIVEVRQKAAANKDKFHSAASMGS 169
           K++ LVAL+ D++R+ E R  A   K+K    A+  S
Sbjct: 126 KAKQLVALLRDEDRLREERAHALKTKEKLAQTATASS 162

BLAST of Cp4.1LG08g05680 vs. TrEMBL
Match: A0A0A0KZ33_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G017040 PE=4 SV=1)

HSP 1 Score: 1283.9 bits (3321), Expect = 0.0e+00
Identity = 691/850 (81.29%), Postives = 729/850 (85.76%), Query Frame = 1

Query: 1   MKKAFDQTVRDLKREVNKTVLKIPKIEQKVLDATSNEPWGPHGSLLADIAQATRNYHEYQ 60
           MKKAFDQTVRDLKREVNKTVLKIPK+EQKVLDATSNEPWGPHGSLLADIAQATRNYHEYQ
Sbjct: 1   MKKAFDQTVRDLKREVNKTVLKIPKVEQKVLDATSNEPWGPHGSLLADIAQATRNYHEYQ 60

Query: 61  MIMGVIWKRINDTGKNWRHVYKGLTVLEYLVGHGSERVIDDIREHVYQISTLSNFQYIDS 120
           MIMG++WKRINDTGKNWRHVYKGLTVLEYLVGHGSERVIDDIREH YQISTLS+FQYIDS
Sbjct: 61  MIMGILWKRINDTGKNWRHVYKGLTVLEYLVGHGSERVIDDIREHAYQISTLSDFQYIDS 120

Query: 121 SGRDQGNNVRKKSQNLVALVNDKERIVEVRQKAAANKDKFHSAASMGSMYRPSAGAYDDR 180
           +GRDQGNNVRKKSQNLVALVNDKERI+EVRQKAAAN+DKF SA+SMGSMYRP +G YDDR
Sbjct: 121 NGRDQGNNVRKKSQNLVALVNDKERIIEVRQKAAANRDKFRSASSMGSMYRPGSGGYDDR 180

Query: 181 YEGRYGSRDGDRNVDSYGREREYGFRDDRSGGNEDSYGRDYEDRYNRDGYRDDDYRGRSR 240
           YEGRYG RDGDRNVDSYGRER+YGFRDDRSG NEDSYGRDYE+RYNRDGY+DDDYRGRSR
Sbjct: 181 YEGRYGGRDGDRNVDSYGRERDYGFRDDRSGRNEDSYGRDYEERYNRDGYKDDDYRGRSR 240

Query: 241 SVDDYQYGSRSRSSDRNGERTY-DDGQVSSRNNDTRANEPSRDERPLERKFSEQNIGAPP 300
           S+DDYQYGSRSRSSDR+GER Y DDGQVSSRN+  R +EPS+  R LERKFSEQNI APP
Sbjct: 241 SIDDYQYGSRSRSSDRDGERAYDDDGQVSSRNSGARPDEPSQVGRQLERKFSEQNI-APP 300

Query: 301 SYEEVVTESESNVQSQRDVEAPPTAAPRAFPPPTSSTQSQLTSHGTAASPPTQGLDGSDE 360
           SYEE V ES S V SQR+VEAP T APRAFPPP  ST SQ T+HGT ASP  QG DGSDE
Sbjct: 301 SYEEAVNESGSTVPSQREVEAPATTAPRAFPPPVPSTPSQQTTHGTTASPLPQGFDGSDE 360

Query: 361 FDPRGSVPAAPPATSNLETNLLDSLALVPVGPVTSTADYEGHVQTSSAVGYHAQNQTFED 420
           FDPRGSVP AP A+SNLE NL DSLALVPVGPVTS+AD E HVQTSSAVG   QNQTFED
Sbjct: 361 FDPRGSVPVAPNASSNLEANLFDSLALVPVGPVTSSADSESHVQTSSAVGSFTQNQTFED 420

Query: 421 AFGDSPFKAISSSDVQDQAHVQHGESFSVATYSTPNIPVQPQPNFPQPREESLQHQNIGV 480
            FGDSPFKAISSS VQDQ + Q GESFS ATYSTPN+PVQPQPN   PREE+LQHQNIGV
Sbjct: 421 PFGDSPFKAISSSGVQDQTYFQRGESFSAATYSTPNVPVQPQPNLHHPREETLQHQNIGV 480

Query: 481 LADLLPPPEPLPAVVSQPAFT----VSNSQQAVSGLPAQPNSNNLGTYQQHGNVAPVNFQ 540
           LADLL PPE LPA VSQP FT    V  +  A SGLPAQPNS NLG YQQ GN+APVNFQ
Sbjct: 481 LADLL-PPETLPAAVSQPTFTSNQPVQPNSHAASGLPAQPNS-NLGNYQQDGNIAPVNFQ 540

Query: 541 NPTEPGREFNNGMFMVPGSAPTHVNSYMAPPNAGPSTHPNNFGYSQDGSVAPTSSHVALQ 600
           N TEPGREF NGMF+ PG  P H  SYMAPPNAGP+  PNNFG   +GS  P SSH+ LQ
Sbjct: 541 NQTEPGREFGNGMFVAPGGIPAH-GSYMAPPNAGPNAQPNNFGTYHNGSAVPASSHLTLQ 600

Query: 601 TTQPPAQLPSG-NFNPPHASVAPVASQLSYQAPNFPVNVSNPDVMGSF--QAGNYTSMAS 660
           TT+PPA LPSG NFNPP  S   VASQ+SYQ  NFPV  S  +VMGSF  QAGNYTSMAS
Sbjct: 601 TTRPPAHLPSGNNFNPPQGS---VASQVSYQTSNFPVVKS--EVMGSFNSQAGNYTSMAS 660

Query: 661 QQIPPSGSLSNASQPSKNKFETKSTVWTDTLNRGLVNLNISGPKTNPLADIGVDFEALNR 720
           QQ PP+GSLS ASQ S NKFETKSTVW+DTL+RGLVNLNISGPK NP+ADIGVDFEALNR
Sbjct: 661 QQNPPAGSLSTASQASNNKFETKSTVWSDTLSRGLVNLNISGPKANPMADIGVDFEALNR 720

Query: 721 KEKRMEKPSTAPAISTINMGKAMGSGSGIGRAGASALRPPPNAMSGP--------GMGMG 780
           KEKRMEKPSTAP +STINMGKAMGSGSGIGR GASALRPPPNAMSG         GMGMG
Sbjct: 721 KEKRMEKPSTAPVVSTINMGKAMGSGSGIGRVGASALRPPPNAMSGSGSGMGMGMGMGMG 780

Query: 781 MNPNPGMGMGM-RGGYGGMNQPMGGMGMNMGMAQHGIQMQQPRANMPGAYNPMMGAGGYA 834
           MNPNPGMGMGM   GYGGMNQPMGGMGMNMGM Q G QMQQPRANMPG YNPMMG GGYA
Sbjct: 781 MNPNPGMGMGMGMRGYGGMNQPMGGMGMNMGMGQQGFQMQQPRANMPGVYNPMMGGGGYA 840

BLAST of Cp4.1LG08g05680 vs. TrEMBL
Match: B9IAE1_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0014s14930g PE=4 SV=2)

HSP 1 Score: 750.4 bits (1936), Expect = 2.4e-213
Identity = 504/937 (53.79%), Postives = 590/937 (62.97%), Query Frame = 1

Query: 1   MKKAFDQTVRDLKREVNKTVLKIPKIEQKVLDATSNEPWGPHGSLLADIAQATRNYHEYQ 60
           MKK F QTVRD KREVNK VLK+P IEQKVLDATSNEPWGPHGSLLADIAQA+RNYHEYQ
Sbjct: 1   MKKVFGQTVRDFKREVNKKVLKVPSIEQKVLDATSNEPWGPHGSLLADIAQASRNYHEYQ 60

Query: 61  MIMGVIWKRINDTGKNWRHVYKGLTVLEYLVGHGSERVIDDIREHVYQISTLSNFQYIDS 120
           MIM V+WKRINDTGKNWRHVYK LTVLEYLV HGSER ID+IREH YQI+TLS+FQYIDS
Sbjct: 61  MIMAVLWKRINDTGKNWRHVYKALTVLEYLVAHGSERAIDEIREHSYQITTLSDFQYIDS 120

Query: 121 SGRDQGNNVRKKSQNLVALVNDKERIVEVRQKAAANKDKFHSAASMGSMYRPSAGAYDDR 180
           SG+DQGNNVRKKSQ+LV LVNDKERIVE RQKAAAN+DKF +A S G M RP  G+YDD 
Sbjct: 121 SGKDQGNNVRKKSQSLVVLVNDKERIVEARQKAAANRDKFRNA-SPGGMNRP--GSYDD- 180

Query: 181 YEGRYGSRDGDRNVDSYGREREYGFRDD-RSGGNEDSYGRDY----EDRYNRDGYRDDDY 240
            +GRYG+RD DRN   YG+EREY +RDD R G   DSYGRD     E+RY RDGYRDDDY
Sbjct: 181 -DGRYGNRDEDRNGYGYGKEREYNYRDDERYGKYGDSYGRDGDHNGEERYGRDGYRDDDY 240

Query: 241 RGRSRSVDDYQYGSRSRSSDRNGERTYDD-GQVSSRNNDTRANEPSRD---ERPLERKFS 300
           +GRSRS+DDY  GSRSRSSDR+ +  +DD GQ SSR    RA++ S D    + LERKFS
Sbjct: 241 QGRSRSIDDY--GSRSRSSDRDRDHAFDDDGQSSSRG--ARADDQSHDGSIAKRLERKFS 300

Query: 301 EQNIGAPPSYEEVVTESESNVQSQRDVEA----PPTA----APRAFPPPTSSTQS----- 360
           EQNI  PPSYEE ++ES S   S+R+ EA     P A    APR+F PP  +  S     
Sbjct: 301 EQNISGPPSYEEALSESRSPAHSERNGEALAVPAPVASSPPAPRSFSPPAFNAASPPPSN 360

Query: 361 ---QLTSHGTAASPPTQGLDGSDEFDPRGSVPAAPPATS------------NLETNLLDS 420
              + T   T ASP  Q +  +DEFDPRG + A P ATS            N E +LL S
Sbjct: 361 PGQENTFFATPASPADQEVVVADEFDPRGPISAPPTATSVQTASAFTPTSNNAEMDLLGS 420

Query: 421 L---------ALVPVGPVTSTADYEGHVQTSSAVGYHAQN--------QTFEDAFGDSPF 480
           L         A++PV   T+T++ +     S ++    Q+        Q FED FGDSPF
Sbjct: 421 LSDVFTPNPLAIMPVTSATTTSEADSQTNFSGSMFAATQSPSNDIPLMQAFEDPFGDSPF 480

Query: 481 KAI----------------------SSSDVQDQAHVQHGESFSVATYSTPNI-PVQPQPN 540
           KA                        ++++ +     +G++FS  TYS PN+ P    P+
Sbjct: 481 KATPTDAFSAQQPTASSAPFQPTMNQNTEMPNAVAPPNGDTFSAMTYSAPNVQPPSTNPH 540

Query: 541 FPQPREESLQHQNIGVLADLLPPPEPLPAVVSQPAFTVSNSQQAVSGLPAQPNSNNLGTY 600
           F  P+E S  H    +LAD+LPP  P  AV SQ  F++ + Q        QP ++  G +
Sbjct: 541 F-LPQEMSSSHPETDILADILPPSGP-SAVASQAGFSLPSGQHP------QPGASVYGNF 600

Query: 601 QQHGNVAPVNFQNPTEP-----GREFNNGMFMVPGSAPTHVNSYMA-PPNAGPSTHPNNF 660
               N  P N   P  P     G++ ++  F   G +P  ++S M+  P AGP    NN 
Sbjct: 601 ----NSPPGNMVLPAAPHMAPQGQQLSSANFFTQGGSPAPIHSNMSLQPPAGPVVLFNNG 660

Query: 661 GY-SQDGSVAPTSSHVALQT-TQPPAQLPSGNFNPPHASVAPVASQLSYQAPNFPVNVSN 720
               Q GS AP  S  +  T T    Q  SGNF P H S  PVASQ +YQ P       N
Sbjct: 661 NLVPQQGSTAPVVSQFSHHTPTGSAPQYNSGNFLPQHGSTFPVASQFTYQTPPASSPQHN 720

Query: 721 PDVMGS-FQAGNYTSMASQQIPPS--GSLSNASQPSKNKFETKSTVWTDTLNRGLVNLNI 780
            DV+G+ F  G  TSMASQ   PS  GSL+   QPSK+KFETKSTVW DTL+RGLVNLNI
Sbjct: 721 -DVLGNLFSQGPNTSMASQTALPSSTGSLAIVPQPSKDKFETKSTVWADTLSRGLVNLNI 780

Query: 781 SGPKTNPLADIGVDFEALNRKEKRMEKPSTAPAISTINMGKAMGSGSGIGRAGASALRPP 834
           SGPKTNPLADIGVDF+ALNRKEKRMEK    P +STI MG+AMGSG+G+GRAGA  LRPP
Sbjct: 781 SGPKTNPLADIGVDFDALNRKEKRMEKQPMTPVVSTITMGRAMGSGTGLGRAGAGVLRPP 840

BLAST of Cp4.1LG08g05680 vs. TrEMBL
Match: W9R4X1_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_024722 PE=4 SV=1)

HSP 1 Score: 749.6 bits (1934), Expect = 4.1e-213
Identity = 501/938 (53.41%), Postives = 603/938 (64.29%), Query Frame = 1

Query: 1   MKKAFDQTVRDLKREVNKTVLKIPKIEQKVLDATSNEPWGPHGSLLADIAQATRNYHEYQ 60
           MKKAFDQTVRDLKREVNK VLK+P IEQKVLD+T+NEPWGPHGSLLADIA ATRNYHEYQ
Sbjct: 1   MKKAFDQTVRDLKREVNKKVLKVPGIEQKVLDSTNNEPWGPHGSLLADIALATRNYHEYQ 60

Query: 61  MIMGVIWKRINDTGKNWRHVYKGLTVLEYLVGHGSERVIDDIREHVYQISTLSNFQYIDS 120
           MIM VIWKRINDTGKNWRHVYK LTVL+YLV HGSERVID+IREHVYQISTLS+FQYIDS
Sbjct: 61  MIMAVIWKRINDTGKNWRHVYKALTVLDYLVAHGSERVIDEIREHVYQISTLSDFQYIDS 120

Query: 121 SGRDQGNNVRKKSQNLVALVNDKERIVEVRQKAAANKDKFHSAASMGSMYRPSAGAY--- 180
           SGRDQG+NVRKKSQ+LV LVNDKERI+EVRQKAAAN+DKF +A+S G MYRP  G+Y   
Sbjct: 121 SGRDQGSNVRKKSQSLVVLVNDKERIIEVRQKAAANRDKFRNASSSGGMYRP--GSYSST 180

Query: 181 --------DDRYEGRYGSRDGD---RNVDSYGREREYGFR-DDRSGGNEDS-------YG 240
                   DDRYEGRYGS+D D   RN DSYGRERE+G+R DD+ G N DS       YG
Sbjct: 181 GGTGDRFDDDRYEGRYGSKDDDRYSRNGDSYGREREWGYRDDDKYGRNGDSYSRDGDRYG 240

Query: 241 RDYEDRYNRDGYRDDDYRGRSRSVDDYQYGSRSRSSDRNGERTY-DDGQVSSRNNDTRAN 300
           R+YEDRY RDG RDDDYRGRS+SVD  QYG RSRSSDR+ ER++ DDGQ SSR+      
Sbjct: 241 REYEDRYGRDGDRDDDYRGRSQSVDGNQYGQRSRSSDRDRERSFDDDGQYSSRSG--ARG 300

Query: 301 EPSRDERPLERKFSEQNIGAPPSYEEVVTESESNVQSQRDVEAPPTAAPRAFPP-----P 360
           + S+D R L+RKFSEQNIGAPPSYEE  +ES S V S+RD E P  AAPRA  P     P
Sbjct: 301 DDSQDGRRLDRKFSEQNIGAPPSYEE-ASESRSPVHSERDGETPAAAAPRASSPAANNHP 360

Query: 361 TSSTQSQLTS---------HGTAASPPTQGLDGSDEFDPRGSVPAAPPATSNLETNLLD- 420
            SS Q   T          H T+ SP  Q +  +DEFDPRG+V A P   ++ E +LL  
Sbjct: 361 ISSPQPGSTQNHPSQPPNVHDTSVSPANQEVQATDEFDPRGAVSATPAHVNSAEVDLLGS 420

Query: 421 -SLALVPVGPVTSTADYEGHVQTSSA--VGYHAQNQT-------FEDAFGDSPFKAISSS 480
            SLA+VP    ++T++ E    T +A  V  +  NQ+       FED FGD+PFKAI   
Sbjct: 421 LSLAIVPTVSTSATSEAESQAPTFAAAPVSSNVTNQSNIDLMQHFEDPFGDAPFKAIPLD 480

Query: 481 DVQ------DQAHVQH--------------GESFSVATYSTP---NIPVQPQPNFPQPRE 540
             Q         H  H              G+S S  TYS P   ++   P  +   P+E
Sbjct: 481 TTQALPQTSTSIHTTHAGVPNANAGSDFGFGDSLSGLTYSAPSFSSVQTPPMNSEFLPQE 540

Query: 541 ESLQHQNIGVLADLLPPPEPLPAVVSQPAFTVSNSQQAVSGLPAQPNSNNLGTYQ-QHGN 600
            S  HQN  +LAD+LPP  P PA+ +QP F+      A +  PAQP +N+   YQ Q G+
Sbjct: 541 LSTAHQNTDILADILPPSGPSPAITAQPPFS------APAAPPAQPGANDFRNYQPQPGS 600

Query: 601 V--APVNFQNPTEPGREFNNGMFMVPGSAPT-HVNSYMAPPN-AGPSTHPNNFGYSQDGS 660
           +   P N    T+ G    +G ++    APT  + S++ P    GP+   N+  +   G 
Sbjct: 601 IVAVPSNMVPQTQSGPAGQHGNYL--SHAPTAPLTSHVVPQTPTGPTVQFNSGNFLPQGQ 660

Query: 661 VAPTSSHVALQTTQPPAQLPSGNFNPPHASVAPVASQLSYQAPNFPVNVSNPDVMGSFQA 720
            A   ++ +    Q  + +P  +   PH++V P A QL+    NF    S+   + S+QA
Sbjct: 661 FA-APNNGSFYPQQGASTVPVTSHTVPHSAVGP-AGQLN--TGNFLPQQSSAHQVVSYQA 720

Query: 721 G---------------------NYTSMASQQIPPSGSLS-NASQPSKNKFETKSTVWTDT 780
                                 N  S+A Q   PS +++  +++P+ +KFETKSTVW DT
Sbjct: 721 SSGPALQQGNDLLGGLLPQTGQNPPSVAQQVTLPSSTVTLMSTKPADDKFETKSTVWADT 780

Query: 781 LNRGLVNLNISGPKTNPLADIGVDFEALNRKEKRMEKPSTAPAISTINMGKAMGSGSGIG 834
           L+RGLVNLNISG K NPLADIG+DF+A+NRKEKRMEKP+  P  ST+ MG+AMGSGSG+G
Sbjct: 781 LSRGLVNLNISGSKINPLADIGIDFDAINRKEKRMEKPAATPVTSTVTMGRAMGSGSGMG 840

BLAST of Cp4.1LG08g05680 vs. TrEMBL
Match: A0A059AUM7_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_H00432 PE=4 SV=1)

HSP 1 Score: 741.5 bits (1913), Expect = 1.1e-210
Identity = 508/934 (54.39%), Postives = 594/934 (63.60%), Query Frame = 1

Query: 1   MKKAFDQTVRDLKREVNKTVLKIPKIEQKVLDATSNEPWGPHGSLLADIAQATRNYHEYQ 60
           MKK FDQTVRD+KREVNK VLK+P IEQKVLDATSNEPWGPHG+LLADIAQA+RNYHEYQ
Sbjct: 1   MKKVFDQTVRDIKREVNKKVLKVPGIEQKVLDATSNEPWGPHGTLLADIAQASRNYHEYQ 60

Query: 61  MIMGVIWKRINDTGKNWRHVYKGLTVLEYLVGHGSERVIDDIREHVYQISTLSNFQYIDS 120
           MIMGVIWKRINDTGKNWRHVYK LTVLEYLV HGSERVI++IREH YQISTLS+FQYIDS
Sbjct: 61  MIMGVIWKRINDTGKNWRHVYKALTVLEYLVAHGSERVIEEIREHAYQISTLSDFQYIDS 120

Query: 121 SGRDQGNNVRKKSQNLVALVNDKERIVEVRQKAAANKDKFHSAASMGSMYRPSAG-AYDD 180
           SGRDQGNNVR+KSQ LV LVNDKERIVEVRQKAAAN+DKF S +S G MYRP +G  YDD
Sbjct: 121 SGRDQGNNVRRKSQGLVMLVNDKERIVEVRQKAAANRDKFRSVSSSGGMYRPGSGDRYDD 180

Query: 181 -RYEGRYGSRDGDRNVDSYGREREYGFRD-DRSGGNEDSY-------GRDYEDRYNRDGY 240
            RYEGRYGSRD      SYGRER++G +D DR   N DSY       GRDYE+R+ RDGY
Sbjct: 181 ERYEGRYGSRDEY----SYGRERDWGSKDEDRYSRNGDSYSRDGDRYGRDYEERFGRDGY 240

Query: 241 RDDDYRGRSRSVDDYQYGS-RSRSSDRNGERTYDD-GQVSSRNNDTRANEPSRDERPLER 300
           RDDD+RGRSRS+DDYQYGS RSRS DR+ +R+ DD GQ SSR +  RA+E S+DER L+R
Sbjct: 241 RDDDHRGRSRSIDDYQYGSSRSRSHDRDRDRSVDDDGQYSSRGSAGRADENSQDERRLDR 300

Query: 301 KFSEQNIGAPPSYEEVVTESESNVQSQRDVEAPPTAAPRAFPPPTSSTQSQLTSHGTA-- 360
           K+SEQNIGAPPSYEE V +S S V ++RDVE   T+AP+   PP +      +    A  
Sbjct: 301 KYSEQNIGAPPSYEEAVVDSRSPVHNERDVETSATSAPKPSSPPVAPVNDNTSPPANAPV 360

Query: 361 --ASPPTQGLDGSDEFDPRGSVPAAP-PATS-NLETNLLDSL---------ALVPVGPVT 420
             ASP  Q  +  DEFDPRG + AAP P+TS + ET+LL SL         A++PV    
Sbjct: 361 PSASPAKQEFETFDEFDPRGPLSAAPAPSTSMSAETDLLGSLSDSFSANPLAIMPVTTGN 420

Query: 421 STADYEGHVQTSSAVGYHA--------QNQTFEDAFGDSPFKAISSSDVQDQAHVQHGES 480
           S  + E    T+    + A         NQ+F+D FGDSPFKA  S D    A VQ   +
Sbjct: 421 SAPEAEAAANTNFGPTFAAAPPSASNVMNQSFDDPFGDSPFKATPSGD---GAPVQQPTA 480

Query: 481 FSVATYSTPNI------PVQPQPNF----------PQPREESLQHQ----NIGVLADLLP 540
             VAT   PN+      P     NF          P   + S  +     N  +LAD+LP
Sbjct: 481 TPVATMQ-PNVNQNVEMPQMAGSNFGDSLSGLTYAPASAQNSQFYSEPNPNTDILADILP 540

Query: 541 PPEPLPAVVSQPAFTVSNSQQAVSGLPAQPNSNNLGTY-QQHGNVAPVNFQNP----TEP 600
           P  P      QP F+      A SG PA   ++  G +  Q G VAPV  Q P    T P
Sbjct: 541 PSGPSAVAAPQPPFS------APSGHPAHLQTSVYGNFGSQSGPVAPVASQVPQHMQTGP 600

Query: 601 GREFNNGMFMVPGSAPTHVNSYMAPPNAGPSTHPNNFGYSQDGSVAPTSSHVALQTTQPP 660
             +FN G F      P H+ +          + PN   Y+Q G+  P   H+A Q   P 
Sbjct: 601 VPQFNYGNF------PQHMGT----------SAPNGNVYTQAGAAPPPPPHMAPQNMLP- 660

Query: 661 AQLPSGNFNPPHASVAPVASQLSYQAPNFPVNVS-NPDVMGSF--QAGNYTSMASQQIPP 720
           +   +G+F P     + V S +  Q  +F   V  N DV+G+   Q G  + +ASQ   P
Sbjct: 661 SHTTNGSFLPS-GGTSTVTSHMPPQQASFGGAVQQNNDVLGNLLPQMGQNSQVASQPPQP 720

Query: 721 -----SGSLSNASQPSKNK-FETKSTVWTDTLNRGLVNLNISGPKTNPLADIGVDFEALN 780
                +G+L+   QPSK+K FETKSTVW DTL+RGLVNLNISGPKTNPLADIGVDF+A+N
Sbjct: 721 PLSSSTGALAIVPQPSKDKKFETKSTVWADTLSRGLVNLNISGPKTNPLADIGVDFDAIN 780

Query: 781 RKEKRMEKPSTAPAISTINMGKAMGSGSGIGRAGASALRPPPNAMSGPGMGMGMNPNPGM 834
           RKEKRMEK S AP  STINMGKAMGSGSGIGRAGA ALRPPPN ++G GMGMGM    GM
Sbjct: 781 RKEKRMEKQSAAPVTSTINMGKAMGSGSGIGRAGAGALRPPPNPVAGSGMGMGMGMGMGM 840

BLAST of Cp4.1LG08g05680 vs. TrEMBL
Match: A0A059AUX7_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_H00432 PE=4 SV=1)

HSP 1 Score: 735.7 bits (1898), Expect = 6.2e-209
Identity = 507/935 (54.22%), Postives = 594/935 (63.53%), Query Frame = 1

Query: 1   MKKAFDQTVRDLKREVNKTVLKIPKIEQKVLDATSNEPWGPHGSLLADIAQATRNYHEYQ 60
           MKK FDQTVRD+KREVNK VLK+P IEQKVLDATSNEPWGPHG+LLADIAQA+RNYHEYQ
Sbjct: 1   MKKVFDQTVRDIKREVNKKVLKVPGIEQKVLDATSNEPWGPHGTLLADIAQASRNYHEYQ 60

Query: 61  MIMGVIWKRINDTGKNWRHVYKGLTVLEYLVGHGSERVIDDIREHVYQISTLSNFQYIDS 120
           MIMGVIWKRINDTGKNWRHVYK LTVLEYLV HGSERVI++IREH YQISTLS+FQYIDS
Sbjct: 61  MIMGVIWKRINDTGKNWRHVYKALTVLEYLVAHGSERVIEEIREHAYQISTLSDFQYIDS 120

Query: 121 SGRDQGNNVRKKSQNLVALVNDKERIVEVRQKAAANKD-KFHSAASMGSMYRPSAG-AYD 180
           SGRDQGNNVR+KSQ LV LVNDKERIVEVRQKAAAN+D +F S +S G MYRP +G  YD
Sbjct: 121 SGRDQGNNVRRKSQGLVMLVNDKERIVEVRQKAAANRDNRFRSVSSSGGMYRPGSGDRYD 180

Query: 181 D-RYEGRYGSRDGDRNVDSYGREREYGFRD-DRSGGNEDSY-------GRDYEDRYNRDG 240
           D RYEGRYGSRD      SYGRER++G +D DR   N DSY       GRDYE+R+ RDG
Sbjct: 181 DERYEGRYGSRDEY----SYGRERDWGSKDEDRYSRNGDSYSRDGDRYGRDYEERFGRDG 240

Query: 241 YRDDDYRGRSRSVDDYQYGS-RSRSSDRNGERTYDD-GQVSSRNNDTRANEPSRDERPLE 300
           YRDDD+RGRSRS+DDYQYGS RSRS DR+ +R+ DD GQ SSR +  RA+E S+DER L+
Sbjct: 241 YRDDDHRGRSRSIDDYQYGSSRSRSHDRDRDRSVDDDGQYSSRGSAGRADENSQDERRLD 300

Query: 301 RKFSEQNIGAPPSYEEVVTESESNVQSQRDVEAPPTAAPRAFPPPTSSTQSQLTSHGTA- 360
           RK+SEQNIGAPPSYEE V +S S V ++RDVE   T+AP+   PP +      +    A 
Sbjct: 301 RKYSEQNIGAPPSYEEAVVDSRSPVHNERDVETSATSAPKPSSPPVAPVNDNTSPPANAP 360

Query: 361 ---ASPPTQGLDGSDEFDPRGSVPAAP-PATS-NLETNLLDSL---------ALVPVGPV 420
              ASP  Q  +  DEFDPRG + AAP P+TS + ET+LL SL         A++PV   
Sbjct: 361 VPSASPAKQEFETFDEFDPRGPLSAAPAPSTSMSAETDLLGSLSDSFSANPLAIMPVTTG 420

Query: 421 TSTADYEGHVQTSSAVGYHA--------QNQTFEDAFGDSPFKAISSSDVQDQAHVQHGE 480
            S  + E    T+    + A         NQ+F+D FGDSPFKA  S D    A VQ   
Sbjct: 421 NSAPEAEAAANTNFGPTFAAAPPSASNVMNQSFDDPFGDSPFKATPSGD---GAPVQQPT 480

Query: 481 SFSVATYSTPNI------PVQPQPNF----------PQPREESLQHQ----NIGVLADLL 540
           +  VAT   PN+      P     NF          P   + S  +     N  +LAD+L
Sbjct: 481 ATPVATMQ-PNVNQNVEMPQMAGSNFGDSLSGLTYAPASAQNSQFYSEPNPNTDILADIL 540

Query: 541 PPPEPLPAVVSQPAFTVSNSQQAVSGLPAQPNSNNLGTY-QQHGNVAPVNFQNP----TE 600
           PP  P      QP F+      A SG PA   ++  G +  Q G VAPV  Q P    T 
Sbjct: 541 PPSGPSAVAAPQPPFS------APSGHPAHLQTSVYGNFGSQSGPVAPVASQVPQHMQTG 600

Query: 601 PGREFNNGMFMVPGSAPTHVNSYMAPPNAGPSTHPNNFGYSQDGSVAPTSSHVALQTTQP 660
           P  +FN G F      P H+ +          + PN   Y+Q G+  P   H+A Q   P
Sbjct: 601 PVPQFNYGNF------PQHMGT----------SAPNGNVYTQAGAAPPPPPHMAPQNMLP 660

Query: 661 PAQLPSGNFNPPHASVAPVASQLSYQAPNFPVNVS-NPDVMGSF--QAGNYTSMASQQIP 720
            +   +G+F P     + V S +  Q  +F   V  N DV+G+   Q G  + +ASQ   
Sbjct: 661 -SHTTNGSFLPS-GGTSTVTSHMPPQQASFGGAVQQNNDVLGNLLPQMGQNSQVASQPPQ 720

Query: 721 P-----SGSLSNASQPSKNK-FETKSTVWTDTLNRGLVNLNISGPKTNPLADIGVDFEAL 780
           P     +G+L+   QPSK+K FETKSTVW DTL+RGLVNLNISGPKTNPLADIGVDF+A+
Sbjct: 721 PPLSSSTGALAIVPQPSKDKKFETKSTVWADTLSRGLVNLNISGPKTNPLADIGVDFDAI 780

Query: 781 NRKEKRMEKPSTAPAISTINMGKAMGSGSGIGRAGASALRPPPNAMSGPGMGMGMNPNPG 834
           NRKEKRMEK S AP  STINMGKAMGSGSGIGRAGA ALRPPPN ++G GMGMGM    G
Sbjct: 781 NRKEKRMEKQSAAPVTSTINMGKAMGSGSGIGRAGAGALRPPPNPVAGSGMGMGMGMGMG 840

BLAST of Cp4.1LG08g05680 vs. TAIR10
Match: AT2G43160.3 (AT2G43160.3 ENTH/VHS family protein)

HSP 1 Score: 480.3 bits (1235), Expect = 2.4e-135
Identity = 407/947 (42.98%), Postives = 494/947 (52.16%), Query Frame = 1

Query: 1   MKKAFDQTVRDLKREVNKTVLKIPKIEQKVLDATSNEPWGPHGSLLADIAQATRNYHEYQ 60
           MKK F QTVRDLKREVNK VLK+P +EQKVLDATSNEPWGPHGSLLAD+AQA+RNYHEYQ
Sbjct: 1   MKKVFGQTVRDLKREVNKKVLKVPGVEQKVLDATSNEPWGPHGSLLADLAQASRNYHEYQ 60

Query: 61  MIMGVIWKRINDTGKNWRHVYKGLTVLEYLVGHGSERVIDDIREHVYQISTLSNFQYIDS 120
           +IM VIWKR++DTGKNWRHVYK LTVLEY+VGHGSERVID+IRE  YQISTLS+FQYIDS
Sbjct: 61  LIMVVIWKRLSDTGKNWRHVYKALTVLEYMVGHGSERVIDEIRERAYQISTLSDFQYIDS 120

Query: 121 SGRDQGNNVRKKSQNLVALVNDKERIVEVRQKAAANKDKFHSAASMGSMYRPSAGAYDDR 180
            GRDQG+NVRKKSQ+LVALVNDKERI EVRQKAAAN+DK+ S+A  G MY+PS G Y D+
Sbjct: 121 GGRDQGSNVRKKSQSLVALVNDKERIAEVRQKAAANRDKYRSSAP-GGMYKPSGG-YGDK 180

Query: 181 Y--------------EGRYGSRDGDRNV-----------DSYGREREYGFRDD------R 240
           Y              E  YG RD DRN            D YGR+   G RDD      R
Sbjct: 181 YDYGSRDEERSSYGREREYGYRDDDRNSRDGDRHSRDSEDRYGRD---GNRDDDYRGRSR 240

Query: 241 SGGNEDSYGRDYEDRYNRDGYRDDDYRGRSRSVDDYQYGSRSRSSDRNG--ERTYDDGQV 300
           S  N  S GR  E     DG+     RG     DD        S D  G  +R + +  +
Sbjct: 241 SVDNYGSRGRSSEREREDDGHSSS--RGSGARADD-------NSQDGRGGLQRKFSEQNI 300

Query: 301 SSRNNDTRANEPSRDERPLERKFSEQNIGAPPSYEEVVTESESNVQ-SQRDVEAPPTAAP 360
            +  +   A   SR        +SE++ G  P        S    Q +  +  +PPT   
Sbjct: 301 GAPPSYEEAVSDSRSP-----VYSERDGGETPQVTAPGAASPPPPQVAAPEAASPPTGTN 360

Query: 361 RAFPPPTSSTQSQLTSHGT----------AASPPTQGLDGSDEFDPRGSVPAAPPATSNL 420
            A    T   +S      T          +A PP           P  +  +AP  ++++
Sbjct: 361 TANTTATFVNESPSQKVETFDEFDPRSAFSAGPPAYASTDGVTAPPTVTSMSAPTTSNSV 420

Query: 421 ETNLLDSLALV------PVGPVTST-ADYEGHVQTSSAVGYHAQN---QTFEDAFGDSPF 480
           E +LL SLA V       + P  S   +  G      A  +       Q+F+D FGDSPF
Sbjct: 421 EMDLLGSLADVFSSNALAIVPADSIYVETNGQANAGPAPSFSTSQPSTQSFDDPFGDSPF 480

Query: 481 KAISSSDVQ-------------------------DQAH-VQHGESFSVATYSTP-NIPVQ 540
           KA +S+D                           D AH    G+SFS      P +  VQ
Sbjct: 481 KAFTSTDTDSTPQQNFGASFQPPPPAFTSEVSHPDTAHNFGFGDSFSAVANPDPASQNVQ 540

Query: 541 PQPNFPQ-PREESLQHQN-IGVLADLLPPPEPLPAVVSQPAFTVSNSQQAVSGLPAQPNS 600
           P  N P  P+E+    Q+ I +LA +LPP  P   V S P+   S            P+ 
Sbjct: 541 PPSNSPGFPQEQFATSQSGIDILAGILPPSGP--PVQSGPSIPTSQFP---------PSG 600

Query: 601 NNLGTYQQHGNVAPVNFQNPTEPGREFNNGMFMVPGSAPTHVNSYMAPP-NAGPSTHPNN 660
           NN+  Y+   +  PV+   P  PG+             P   N   A P N+G   H   
Sbjct: 601 NNM--YEGFHSQPPVSTA-PNLPGQTPFGQAVQPYNMVPHSQNMTGAMPFNSGGFMH--- 660

Query: 661 FGYSQDGSVAPTSSHVALQTTQPPAQLPSGNFNPPHASVAPVASQLSYQAPNFPVNVS-N 720
               Q GS  P S+             P+G F        P  S    +  + PV +  N
Sbjct: 661 ----QPGSQTPYSTPSG----------PAGQFMAHQGHGMP-PSHGPQRTQSGPVTLQGN 720

Query: 721 PDVMGSF--QAGNYTSMASQQIPPSGSLSNASQ---PSKNKFETKSTVWTDTLNRGLVNL 780
            +VMG    QA   +  +S   P    L+ A +   P + KFE KS+VW DTL+RGLVN 
Sbjct: 721 NNVMGDMFSQATPNSLTSSSSHPDLTPLTGAIEIVPPPQKKFEPKSSVWADTLSRGLVNF 780

Query: 781 NISGPKTNPLADIGVDFEALNRKEKRMEKPSTAPAISTINMGKAMGSGSGIGRAGASALR 834
           NISG KTNPLADIGVDFEA+NR+EKR+EK +  PA STINMGKAMGSG+G+GR+GA+A+R
Sbjct: 781 NISGSKTNPLADIGVDFEAINRREKRLEKQTNTPATSTINMGKAMGSGTGLGRSGATAMR 840

BLAST of Cp4.1LG08g05680 vs. TAIR10
Match: AT3G59290.1 (AT3G59290.1 ENTH/VHS family protein)

HSP 1 Score: 447.2 bits (1149), Expect = 2.2e-125
Identity = 324/708 (45.76%), Postives = 404/708 (57.06%), Query Frame = 1

Query: 1   MKKAFDQTVRDLKREVNKTVLKIPKIEQKVLDATSNEPWGPHGSLLADIAQATRNYHEYQ 60
           MKKAF QTVRDLKR VNK VLK+P IEQKVLDATSNE WGPHGSLLADIA A+RNYHEYQ
Sbjct: 1   MKKAFGQTVRDLKRGVNKKVLKVPGIEQKVLDATSNESWGPHGSLLADIAHASRNYHEYQ 60

Query: 61  MIMGVIWKRINDTGKNWRHVYKGLTVLEYLVGHGSERVIDDIREHVYQISTLSNFQYIDS 120
           + MGV+WKR++D+GKNWRHVYK LTVLEY+VGHGSERVI++++EH YQI+TLS FQYIDS
Sbjct: 61  ITMGVLWKRLSDSGKNWRHVYKALTVLEYMVGHGSERVIEEVKEHAYQITTLSGFQYIDS 120

Query: 121 SGRDQGNNVRKKSQNLVALVNDKERIVEVRQKAAANKDKFHSAASMGSMYRPSAGAYDDR 180
           SG+DQG+NVRKK+Q+LVALVNDKERI EVR+KAAAN+DK+H+     SM+RPS G Y D+
Sbjct: 121 SGKDQGSNVRKKAQSLVALVNDKERITEVREKAAANRDKYHN-----SMHRPS-GGYGDK 180

Query: 181 --YEGRYGSRDGDRNVDSYGREREYGFR-DDRSGGNEDSYGRDYEDRYNRDGYRDDDYRG 240
             YEGRYG RD  R+  SYG+EREYG+R DDR+  + D Y RD EDRY RDG  DD+YRG
Sbjct: 181 YDYEGRYGDRDEGRS--SYGKEREYGYRDDDRNSRDGDRYSRDSEDRYGRDGNTDDEYRG 240

Query: 241 RSRSVDDYQYGSRSRSSDRNGERTYDDGQVSSRNNDTRANEPSRDER-PLERKFSEQNIG 300
           RSRSVD+Y  GSR RSSDR      DDGQ SSR++   A++ S+D R  LERKFSEQNIG
Sbjct: 241 RSRSVDNYN-GSRGRSSDRE-RPIEDDGQSSSRDSGAPADDHSQDGRGGLERKFSEQNIG 300

Query: 301 -APPSYEEVVTESESNVQSQRD-VEAPPTAAPRAFPPPTS---STQSQLTSHGTAASPPT 360
            APPSYEE V+ES S V S+RD  E P  A P A   P +   S  ++       +SP  
Sbjct: 301 AAPPSYEEAVSESRSPVYSERDGGETPQVAPPGAAASPLAENISVDNKAADFVNESSP-- 360

Query: 361 QGLDGSDEFDPRGSVPA----------------------APPATSNLETNLLDSLALV-- 420
           Q ++  DEFDPRGSV A                      APPA+ N E +LL SL+ V  
Sbjct: 361 QQVEAFDEFDPRGSVSAACAPTAGASVPAPIPPTVVSTPAPPASINAEMDLLGSLSDVFS 420

Query: 421 --PVGPVTS---TADYEGHVQTSSAVGY---HAQNQTFEDAFGDSPFKAISSSDVQDQAH 480
             P+  VTS   + +  G   T  A  +    +  Q F+D FGDSPFKAI+S+D +   H
Sbjct: 421 PNPLAIVTSDSTSVETNGQANTGLAPSFSTSQSSTQPFDDPFGDSPFKAITSADTETSQH 480

Query: 481 VQHGESFSVATYSTPNIPVQPQPNFPQPREESLQHQNIG------VLADLLPPPEPLPA- 540
              G            +P QP P    P  E     N G       + D  P  + + A 
Sbjct: 481 QSFG------------VPFQPTPPTSNPNNE----HNFGFGEAFSAVTDSEPGVQNMQAP 540

Query: 541 ----VVSQPAFTVSNSQQAVSGLPAQPNSNNLGTYQQHGNVAPVNFQNPTEPGREFNNGM 600
               V  Q  F  S S+  +      P+   +    Q  +  P +  +P     E  +  
Sbjct: 541 PNLSVFPQEQFDTSQSEIDILAGILPPSGPPVSLSPQPDSTMPTSQFHPNGNSYESYHHQ 600

Query: 601 FMVPGSAPTHVNSYMAPPNAGPSTHPNNFGYSQ---------DGSVAPTSSHVALQTTQP 648
                +APT +N     P    S   N   +SQ         +G       +    T+QP
Sbjct: 601 -----AAPTDLNMQGQTPFGQASQQFNMVSHSQNHHEGMQFNNGGFTQQPGYAGPATSQP 660

BLAST of Cp4.1LG08g05680 vs. TAIR10
Match: AT5G11710.1 (AT5G11710.1 ENTH/VHS family protein)

HSP 1 Score: 207.6 bits (527), Expect = 3.0e-53
Identity = 164/497 (33.00%), Postives = 267/497 (53.72%), Query Frame = 1

Query: 3   KAFDQTVRDLKREVNKTVLKIPKIEQKVLDATSNEPWGPHGSLLADIAQATRNYHEYQMI 62
           K FDQTVR++KREVN  VLK+P++EQKVLDAT NEPWGPHG+ LA+IAQAT+ + E QM+
Sbjct: 5   KVFDQTVREIKREVNLKVLKVPEMEQKVLDATDNEPWGPHGTALAEIAQATKKFSECQMV 64

Query: 63  MGVIWKRINDTGKNWRHVYKGLTVLEYLVGHGSERVIDDIREHVYQISTLSNFQYIDSSG 122
           M V+W R+++TGK+WR+VYK L V++YL+ +GSER +D+I EH YQIS+L++F+Y++ +G
Sbjct: 65  MSVLWTRLSETGKDWRYVYKALAVIDYLISNGSERAVDEIIEHTYQISSLTSFEYVEPNG 124

Query: 123 RDQGNNVRKKSQNLVALVNDKERIVEVRQKAAANKDKFHSAASMGSMYRP-SAGAYDDRY 182
           +D G NVRKK++N+VAL+N+KE+I E+R KA AN++K+   +S G  Y+  S+ ++   +
Sbjct: 125 KDVGINVRKKAENIVALLNNKEKISEIRDKAVANRNKYVGLSSTGITYKSGSSASFGGSF 184

Query: 183 EGRYGSRDGDRNVDSYGREREY-GFRDDRSGGNEDSYGRDYEDRYNRDGYRD-DDYRGRS 242
           +    + D  ++ DS   + +Y  F+  R G   +      +  ++R G  D D+     
Sbjct: 185 QSGSSNFDSYKDRDSREDKNDYESFQKSRRGVKTEEQSYTSKKSFSRYGSTDHDNLSSGK 244

Query: 243 RSVDDYQYGSRSRSSDRNGERTYDDGQVSSRNNDTRANEPSRDERPLERKFSEQNIG--- 302
           +S D  ++ S   ++  N +  +DD         T +N+PS         F    IG   
Sbjct: 245 KSPDSAKHRSYVSAAPSNNDDDFDDFD----PRGTSSNKPSTGSANQVDLFGGDLIGDFL 304

Query: 303 -APPSYEEVVTESESNVQSQRDVEAPPTAAPRAFPPPTSSTQSQL-------TSHGTAAS 362
            + P+       +E+  ++    +A   +A        S TQ Q+        S   +++
Sbjct: 305 DSGPTETSSTNNNENFQEADLFADAAFVSASAQGAEFGSQTQKQVDLFSASEPSVTVSSA 364

Query: 363 PPTQGLDGSDEFDPRGSVPAAPPATSNLETNLLDSLALVPVGPVTSTADYEGHVQTSSAV 422
           PPT  L  S E         + P  S    N++D  A VP+     +  +      S++V
Sbjct: 365 PPTVDLFASSESVVSPEAKISIP-ESMATPNIVDPFAAVPMDNFDGSDPFGAFTSHSASV 424

Query: 423 GYHAQNQTFEDAFGDSPFKAISSSDVQDQAHVQHGESFSVAT--YSTPNIPVQPQPNFPQ 482
               Q  +   +   +    +S +D + Q H+Q  + F V +  ++          N   
Sbjct: 425 STGPQAPSVHGS-ATNTTSPLSFADSKPQ-HLQKKDPFQVKSGIWADSLSRGLIDLNITA 484

Query: 483 PREESLQHQNIGVLADL 484
           P++ SL   ++GV+ DL
Sbjct: 485 PKKASL--ADVGVVGDL 492

BLAST of Cp4.1LG08g05680 vs. TAIR10
Match: AT3G46540.1 (AT3G46540.1 ENTH/VHS family protein)

HSP 1 Score: 79.0 bits (193), Expect = 1.6e-14
Identity = 44/123 (35.77%), Postives = 69/123 (56.10%), Query Frame = 1

Query: 32  DATSNEPWGPHGSLLADIAQATRNYHEYQMIMGVIWKRINDTGK-NWRHVYKGLTVLEYL 91
           +AT  E  GP+   L  I++A   + +Y  I+ V+ KR+    K NWR  Y  L V+E+L
Sbjct: 52  EATDGESCGPNTQTLGSISKAAFEFEDYLAIVEVLHKRLAKFDKRNWRMAYNSLIVVEHL 111

Query: 92  VGHGSERVIDDIREHVYQISTLSNFQYIDSSGRDQGNNVRKKSQNLVALVNDKERIVEVR 151
           + HG E V D+ +  +  IS +  FQ ID  G + G  VRKK++ ++ L+   E + E R
Sbjct: 112 LTHGPESVSDEFQGDIDVISQMQTFQQIDEKGFNWGLAVRKKAEKVLKLLEKGELLKEER 171

Query: 152 QKA 154
           ++A
Sbjct: 172 KRA 174

BLAST of Cp4.1LG08g05680 vs. TAIR10
Match: AT3G23350.1 (AT3G23350.1 ENTH/VHS family protein)

HSP 1 Score: 78.6 bits (192), Expect = 2.1e-14
Identity = 47/155 (30.32%), Postives = 77/155 (49.68%), Query Frame = 1

Query: 2   KKAFDQTVRDLKREVNKTVLKIPKIEQKVLDATSNEPWGPHGSLLADIAQATRNYHEYQM 61
           KK     ++D        +  + + E  V + T+ +P  P    +  IA+A+ +  EY  
Sbjct: 10  KKQASSFIQDKYNVARLVLTDVTEAELLVEEVTNGDPSSPDAKTMTKIAEASFDTVEYWR 69

Query: 62  IMGVIWKRINDTG---KNWRHVYKGLTVLEYLVGHGSERVIDDIREHVYQISTLSNFQYI 121
           I+ V+ ++I       KNWR  YK + +LE+L+ HG   +  D    +     LS FQY+
Sbjct: 70  IVDVLHRKIGKDEREIKNWREAYKAMVLLEFLLMHGPIHLPHDFLYDLDHFRFLSTFQYV 129

Query: 122 DSSGRDQGNNVRKKSQNLVALVNDKERIVEVRQKA 154
           D++G D G  V+KK+  +  L+  KE + E R KA
Sbjct: 130 DNNGFDWGAQVQKKADQIQTLLLGKEELREARLKA 164

BLAST of Cp4.1LG08g05680 vs. NCBI nr
Match: gi|659107956|ref|XP_008453940.1| (PREDICTED: clathrin interactor EPSIN 2 isoform X2 [Cucumis melo])

HSP 1 Score: 1287.7 bits (3331), Expect = 0.0e+00
Identity = 694/848 (81.84%), Postives = 732/848 (86.32%), Query Frame = 1

Query: 1   MKKAFDQTVRDLKREVNKTVLKIPKIEQKVLDATSNEPWGPHGSLLADIAQATRNYHEYQ 60
           MKKAFDQTVRDLKREVNKTVLKIPKIEQKVLDATSNEPWGPHGSLLADIAQATRNYHEYQ
Sbjct: 1   MKKAFDQTVRDLKREVNKTVLKIPKIEQKVLDATSNEPWGPHGSLLADIAQATRNYHEYQ 60

Query: 61  MIMGVIWKRINDTGKNWRHVYKGLTVLEYLVGHGSERVIDDIREHVYQISTLSNFQYIDS 120
           MIMG++WKRINDTGKNWRHVYKGLTVLEYLVGHGSERVIDDIREH YQISTLS+FQYIDS
Sbjct: 61  MIMGILWKRINDTGKNWRHVYKGLTVLEYLVGHGSERVIDDIREHAYQISTLSDFQYIDS 120

Query: 121 SGRDQGNNVRKKSQNLVALVNDKERIVEVRQKAAANKDKFHSAASMGSMYRPSAGAYDDR 180
           +GRDQGNNVRKKSQNLVALVNDKERI+EVRQKAAAN+DKF SA+SMGSMYRPS+G YDDR
Sbjct: 121 NGRDQGNNVRKKSQNLVALVNDKERIIEVRQKAAANRDKFRSASSMGSMYRPSSGGYDDR 180

Query: 181 YEGRYGSRDGDRNVDSYGREREYGFRDDRSGGNEDSYGRDYEDRYNRDGYRDDDYRGRSR 240
           YEGRYGSRDGDRNVDSYGRER+YGFRDDRSG NEDSYGRDYE+RYNRDGY+DDDYRGRSR
Sbjct: 181 YEGRYGSRDGDRNVDSYGRERDYGFRDDRSGRNEDSYGRDYEERYNRDGYKDDDYRGRSR 240

Query: 241 SVDDYQYGSRSRSSDRNGERTY-DDGQVSSRNNDTRANEPSRDERPLERKFSEQNIGAPP 300
           ++DDYQYGSRSRSSDR+GER Y DDGQVSSRNN  R +EPS   R LERKFSEQNI APP
Sbjct: 241 NIDDYQYGSRSRSSDRDGERAYDDDGQVSSRNNGARPDEPSPVGRQLERKFSEQNI-APP 300

Query: 301 SYEEVVTESESNVQSQRDVEAPPTAAPRAFPPPTSSTQSQLTSHGTAASPPTQGLDGSDE 360
           SYEE V ES S V SQR+VEAP T APRAFPPP SST SQ T+HGT ASP  QG DGSDE
Sbjct: 301 SYEEAVNESGSTVPSQREVEAPATTAPRAFPPPVSSTPSQQTTHGTTASPHPQGPDGSDE 360

Query: 361 FDPRGSVPAAPPATSNLETNLLDSLALVPVGPVTSTADYEGHVQTSSAVGYHAQNQTFED 420
           FDPRGSVP AP A+SNLETNL DSLALVPVGPVTS+AD E HVQTS+ VG   QNQTFED
Sbjct: 361 FDPRGSVPVAPNASSNLETNLFDSLALVPVGPVTSSADSECHVQTSATVGSFTQNQTFED 420

Query: 421 AFGDSPFKAISSSDVQDQAHVQHGESFSVATYSTPNIPVQPQPNFPQPREESLQHQNIGV 480
            FGDSPFKAISSS VQDQ H Q GESFS ATYS P++PVQPQPN   PREE+LQHQNIGV
Sbjct: 421 PFGDSPFKAISSSGVQDQTHFQRGESFSAATYSKPDVPVQPQPNLHHPREETLQHQNIGV 480

Query: 481 LADLLPPPEPLPAVVSQPAFTVSN-----SQQAVSGLPAQPNSNNLGTYQQHGNVAPVNF 540
           LADLL PPE LPA VSQP +T SN     + QA SGLPAQ N  NLG YQ+ GN+A VNF
Sbjct: 481 LADLL-PPETLPAAVSQPTYTTSNQPVQPNSQAASGLPAQSNP-NLGNYQRDGNIAAVNF 540

Query: 541 QNPTEPGREFNNGMFMVPGSAPTHVNSYMAPPNAGPSTHPNNFGYSQDGSVAPTSSHVAL 600
           QN TEPGREF NGMF+ PG  P HV SYMAPPNAGP+  PNNFG S +GS  P SSH  L
Sbjct: 541 QNQTEPGREFGNGMFVAPGGIPAHV-SYMAPPNAGPNAQPNNFGTSHNGSAVPASSHHTL 600

Query: 601 QTTQPPAQLPSGN-FNPPHASVAPVASQLSYQAPNFPVNVSNPDVMGSF--QAGNYTSMA 660
           QTT+PPA LPSGN FN P  SVAPVASQ+SYQ  NFPV  S  +VMGSF  QAGNYTSMA
Sbjct: 601 QTTRPPAHLPSGNYFNAPQGSVAPVASQVSYQTSNFPVVKS--EVMGSFNSQAGNYTSMA 660

Query: 661 SQQIPPSGSLSNASQPSKNKFETKSTVWTDTLNRGLVNLNISGPKTNPLADIGVDFEALN 720
           SQQ PP+G LS ASQ SKNKFETKSTVW+DTL+RGLVNLNISGPK NP ADIGVDFEALN
Sbjct: 661 SQQNPPAGPLSTASQASKNKFETKSTVWSDTLSRGLVNLNISGPKANPTADIGVDFEALN 720

Query: 721 RKEKRMEKPSTAPAISTINMGKAMGSGSGIGRAGASALRPPPNAMSGP------GMGMGM 780
           RKEKRMEKPSTAP +STINMGKAMGSGSGIGRAGASALRP PNAMSG       GMGMGM
Sbjct: 721 RKEKRMEKPSTAPVVSTINMGKAMGSGSGIGRAGASALRPLPNAMSGSGSGMGMGMGMGM 780

Query: 781 NPNPGMGMGMRGGYGGMNQPMGGMGMNMGMAQHGIQMQQPRANMPGAYNPMMGAGGYAPQ 834
           NPNPGMGMGMR GYGGMNQPMGGM MNMGM Q G QMQQPRANMPG YNPMMG+GGYAPQ
Sbjct: 781 NPNPGMGMGMR-GYGGMNQPMGGMSMNMGMGQQGFQMQQPRANMPGVYNPMMGSGGYAPQ 840

BLAST of Cp4.1LG08g05680 vs. NCBI nr
Match: gi|449468762|ref|XP_004152090.1| (PREDICTED: clathrin interactor EPSIN 2 [Cucumis sativus])

HSP 1 Score: 1283.9 bits (3321), Expect = 0.0e+00
Identity = 691/850 (81.29%), Postives = 729/850 (85.76%), Query Frame = 1

Query: 1   MKKAFDQTVRDLKREVNKTVLKIPKIEQKVLDATSNEPWGPHGSLLADIAQATRNYHEYQ 60
           MKKAFDQTVRDLKREVNKTVLKIPK+EQKVLDATSNEPWGPHGSLLADIAQATRNYHEYQ
Sbjct: 1   MKKAFDQTVRDLKREVNKTVLKIPKVEQKVLDATSNEPWGPHGSLLADIAQATRNYHEYQ 60

Query: 61  MIMGVIWKRINDTGKNWRHVYKGLTVLEYLVGHGSERVIDDIREHVYQISTLSNFQYIDS 120
           MIMG++WKRINDTGKNWRHVYKGLTVLEYLVGHGSERVIDDIREH YQISTLS+FQYIDS
Sbjct: 61  MIMGILWKRINDTGKNWRHVYKGLTVLEYLVGHGSERVIDDIREHAYQISTLSDFQYIDS 120

Query: 121 SGRDQGNNVRKKSQNLVALVNDKERIVEVRQKAAANKDKFHSAASMGSMYRPSAGAYDDR 180
           +GRDQGNNVRKKSQNLVALVNDKERI+EVRQKAAAN+DKF SA+SMGSMYRP +G YDDR
Sbjct: 121 NGRDQGNNVRKKSQNLVALVNDKERIIEVRQKAAANRDKFRSASSMGSMYRPGSGGYDDR 180

Query: 181 YEGRYGSRDGDRNVDSYGREREYGFRDDRSGGNEDSYGRDYEDRYNRDGYRDDDYRGRSR 240
           YEGRYG RDGDRNVDSYGRER+YGFRDDRSG NEDSYGRDYE+RYNRDGY+DDDYRGRSR
Sbjct: 181 YEGRYGGRDGDRNVDSYGRERDYGFRDDRSGRNEDSYGRDYEERYNRDGYKDDDYRGRSR 240

Query: 241 SVDDYQYGSRSRSSDRNGERTY-DDGQVSSRNNDTRANEPSRDERPLERKFSEQNIGAPP 300
           S+DDYQYGSRSRSSDR+GER Y DDGQVSSRN+  R +EPS+  R LERKFSEQNI APP
Sbjct: 241 SIDDYQYGSRSRSSDRDGERAYDDDGQVSSRNSGARPDEPSQVGRQLERKFSEQNI-APP 300

Query: 301 SYEEVVTESESNVQSQRDVEAPPTAAPRAFPPPTSSTQSQLTSHGTAASPPTQGLDGSDE 360
           SYEE V ES S V SQR+VEAP T APRAFPPP  ST SQ T+HGT ASP  QG DGSDE
Sbjct: 301 SYEEAVNESGSTVPSQREVEAPATTAPRAFPPPVPSTPSQQTTHGTTASPLPQGFDGSDE 360

Query: 361 FDPRGSVPAAPPATSNLETNLLDSLALVPVGPVTSTADYEGHVQTSSAVGYHAQNQTFED 420
           FDPRGSVP AP A+SNLE NL DSLALVPVGPVTS+AD E HVQTSSAVG   QNQTFED
Sbjct: 361 FDPRGSVPVAPNASSNLEANLFDSLALVPVGPVTSSADSESHVQTSSAVGSFTQNQTFED 420

Query: 421 AFGDSPFKAISSSDVQDQAHVQHGESFSVATYSTPNIPVQPQPNFPQPREESLQHQNIGV 480
            FGDSPFKAISSS VQDQ + Q GESFS ATYSTPN+PVQPQPN   PREE+LQHQNIGV
Sbjct: 421 PFGDSPFKAISSSGVQDQTYFQRGESFSAATYSTPNVPVQPQPNLHHPREETLQHQNIGV 480

Query: 481 LADLLPPPEPLPAVVSQPAFT----VSNSQQAVSGLPAQPNSNNLGTYQQHGNVAPVNFQ 540
           LADLL PPE LPA VSQP FT    V  +  A SGLPAQPNS NLG YQQ GN+APVNFQ
Sbjct: 481 LADLL-PPETLPAAVSQPTFTSNQPVQPNSHAASGLPAQPNS-NLGNYQQDGNIAPVNFQ 540

Query: 541 NPTEPGREFNNGMFMVPGSAPTHVNSYMAPPNAGPSTHPNNFGYSQDGSVAPTSSHVALQ 600
           N TEPGREF NGMF+ PG  P H  SYMAPPNAGP+  PNNFG   +GS  P SSH+ LQ
Sbjct: 541 NQTEPGREFGNGMFVAPGGIPAH-GSYMAPPNAGPNAQPNNFGTYHNGSAVPASSHLTLQ 600

Query: 601 TTQPPAQLPSG-NFNPPHASVAPVASQLSYQAPNFPVNVSNPDVMGSF--QAGNYTSMAS 660
           TT+PPA LPSG NFNPP  S   VASQ+SYQ  NFPV  S  +VMGSF  QAGNYTSMAS
Sbjct: 601 TTRPPAHLPSGNNFNPPQGS---VASQVSYQTSNFPVVKS--EVMGSFNSQAGNYTSMAS 660

Query: 661 QQIPPSGSLSNASQPSKNKFETKSTVWTDTLNRGLVNLNISGPKTNPLADIGVDFEALNR 720
           QQ PP+GSLS ASQ S NKFETKSTVW+DTL+RGLVNLNISGPK NP+ADIGVDFEALNR
Sbjct: 661 QQNPPAGSLSTASQASNNKFETKSTVWSDTLSRGLVNLNISGPKANPMADIGVDFEALNR 720

Query: 721 KEKRMEKPSTAPAISTINMGKAMGSGSGIGRAGASALRPPPNAMSGP--------GMGMG 780
           KEKRMEKPSTAP +STINMGKAMGSGSGIGR GASALRPPPNAMSG         GMGMG
Sbjct: 721 KEKRMEKPSTAPVVSTINMGKAMGSGSGIGRVGASALRPPPNAMSGSGSGMGMGMGMGMG 780

Query: 781 MNPNPGMGMGM-RGGYGGMNQPMGGMGMNMGMAQHGIQMQQPRANMPGAYNPMMGAGGYA 834
           MNPNPGMGMGM   GYGGMNQPMGGMGMNMGM Q G QMQQPRANMPG YNPMMG GGYA
Sbjct: 781 MNPNPGMGMGMGMRGYGGMNQPMGGMGMNMGMGQQGFQMQQPRANMPGVYNPMMGGGGYA 840

BLAST of Cp4.1LG08g05680 vs. NCBI nr
Match: gi|659107950|ref|XP_008453936.1| (PREDICTED: clathrin interactor EPSIN 2 isoform X1 [Cucumis melo])

HSP 1 Score: 1281.5 bits (3315), Expect = 0.0e+00
Identity = 694/853 (81.36%), Postives = 732/853 (85.81%), Query Frame = 1

Query: 1   MKKAFDQTVRDLKREVNKTVLKIPKIEQKVLDATSNEPWGPHGSLLADIAQATRNYHEYQ 60
           MKKAFDQTVRDLKREVNKTVLKIPKIEQKVLDATSNEPWGPHGSLLADIAQATRNYHEYQ
Sbjct: 1   MKKAFDQTVRDLKREVNKTVLKIPKIEQKVLDATSNEPWGPHGSLLADIAQATRNYHEYQ 60

Query: 61  MIMGVIWKRINDTGKNWRHVYKGLTVLEYLVGHGSERVIDDIREHVYQISTLSNFQYIDS 120
           MIMG++WKRINDTGKNWRHVYKGLTVLEYLVGHGSERVIDDIREH YQISTLS+FQYIDS
Sbjct: 61  MIMGILWKRINDTGKNWRHVYKGLTVLEYLVGHGSERVIDDIREHAYQISTLSDFQYIDS 120

Query: 121 SGRDQGNNVRKKSQNLVALVNDKERIVEVRQKAAANKDKFHSAASMGSMYRPSAGAYDDR 180
           +GRDQGNNVRKKSQNLVALVNDKERI+EVRQKAAAN+DKF SA+SMGSMYRPS+G YDDR
Sbjct: 121 NGRDQGNNVRKKSQNLVALVNDKERIIEVRQKAAANRDKFRSASSMGSMYRPSSGGYDDR 180

Query: 181 YEGRYGSRDGDRNVDSYGREREYGFRDDRSGGNEDSYGRDYEDRYNRDGYRDDDYRGRSR 240
           YEGRYGSRDGDRNVDSYGRER+YGFRDDRSG NEDSYGRDYE+RYNRDGY+DDDYRGRSR
Sbjct: 181 YEGRYGSRDGDRNVDSYGRERDYGFRDDRSGRNEDSYGRDYEERYNRDGYKDDDYRGRSR 240

Query: 241 SVDDYQYGSRSRSSDRNGERTY-DDGQVSSRNNDTRANEPSRDERPLERKFSEQNIGAPP 300
           ++DDYQYGSRSRSSDR+GER Y DDGQVSSRNN  R +EPS   R LERKFSEQNI APP
Sbjct: 241 NIDDYQYGSRSRSSDRDGERAYDDDGQVSSRNNGARPDEPSPVGRQLERKFSEQNI-APP 300

Query: 301 SYEEVVTESESNVQSQ-----RDVEAPPTAAPRAFPPPTSSTQSQLTSHGTAASPPTQGL 360
           SYEE V ES S V SQ     R+VEAP T APRAFPPP SST SQ T+HGT ASP  QG 
Sbjct: 301 SYEEAVNESGSTVPSQSVSVNREVEAPATTAPRAFPPPVSSTPSQQTTHGTTASPHPQGP 360

Query: 361 DGSDEFDPRGSVPAAPPATSNLETNLLDSLALVPVGPVTSTADYEGHVQTSSAVGYHAQN 420
           DGSDEFDPRGSVP AP A+SNLETNL DSLALVPVGPVTS+AD E HVQTS+ VG   QN
Sbjct: 361 DGSDEFDPRGSVPVAPNASSNLETNLFDSLALVPVGPVTSSADSECHVQTSATVGSFTQN 420

Query: 421 QTFEDAFGDSPFKAISSSDVQDQAHVQHGESFSVATYSTPNIPVQPQPNFPQPREESLQH 480
           QTFED FGDSPFKAISSS VQDQ H Q GESFS ATYS P++PVQPQPN   PREE+LQH
Sbjct: 421 QTFEDPFGDSPFKAISSSGVQDQTHFQRGESFSAATYSKPDVPVQPQPNLHHPREETLQH 480

Query: 481 QNIGVLADLLPPPEPLPAVVSQPAFTVSN-----SQQAVSGLPAQPNSNNLGTYQQHGNV 540
           QNIGVLADLL PPE LPA VSQP +T SN     + QA SGLPAQ N  NLG YQ+ GN+
Sbjct: 481 QNIGVLADLL-PPETLPAAVSQPTYTTSNQPVQPNSQAASGLPAQSNP-NLGNYQRDGNI 540

Query: 541 APVNFQNPTEPGREFNNGMFMVPGSAPTHVNSYMAPPNAGPSTHPNNFGYSQDGSVAPTS 600
           A VNFQN TEPGREF NGMF+ PG  P HV SYMAPPNAGP+  PNNFG S +GS  P S
Sbjct: 541 AAVNFQNQTEPGREFGNGMFVAPGGIPAHV-SYMAPPNAGPNAQPNNFGTSHNGSAVPAS 600

Query: 601 SHVALQTTQPPAQLPSGN-FNPPHASVAPVASQLSYQAPNFPVNVSNPDVMGSF--QAGN 660
           SH  LQTT+PPA LPSGN FN P  SVAPVASQ+SYQ  NFPV  S  +VMGSF  QAGN
Sbjct: 601 SHHTLQTTRPPAHLPSGNYFNAPQGSVAPVASQVSYQTSNFPVVKS--EVMGSFNSQAGN 660

Query: 661 YTSMASQQIPPSGSLSNASQPSKNKFETKSTVWTDTLNRGLVNLNISGPKTNPLADIGVD 720
           YTSMASQQ PP+G LS ASQ SKNKFETKSTVW+DTL+RGLVNLNISGPK NP ADIGVD
Sbjct: 661 YTSMASQQNPPAGPLSTASQASKNKFETKSTVWSDTLSRGLVNLNISGPKANPTADIGVD 720

Query: 721 FEALNRKEKRMEKPSTAPAISTINMGKAMGSGSGIGRAGASALRPPPNAMSGP------G 780
           FEALNRKEKRMEKPSTAP +STINMGKAMGSGSGIGRAGASALRP PNAMSG       G
Sbjct: 721 FEALNRKEKRMEKPSTAPVVSTINMGKAMGSGSGIGRAGASALRPLPNAMSGSGSGMGMG 780

Query: 781 MGMGMNPNPGMGMGMRGGYGGMNQPMGGMGMNMGMAQHGIQMQQPRANMPGAYNPMMGAG 834
           MGMGMNPNPGMGMGMR GYGGMNQPMGGM MNMGM Q G QMQQPRANMPG YNPMMG+G
Sbjct: 781 MGMGMNPNPGMGMGMR-GYGGMNQPMGGMSMNMGMGQQGFQMQQPRANMPGVYNPMMGSG 840

BLAST of Cp4.1LG08g05680 vs. NCBI nr
Match: gi|645231986|ref|XP_008222654.1| (PREDICTED: LOW QUALITY PROTEIN: clathrin interactor EPSIN 2-like [Prunus mume])

HSP 1 Score: 792.0 bits (2044), Expect = 1.0e-225
Identity = 523/963 (54.31%), Postives = 607/963 (63.03%), Query Frame = 1

Query: 1   MKKAFDQTVRDLKREVNKTVLKIPKIEQKVLDATSNEPWGPHGSLLADIAQATRNYHEYQ 60
           MKK FDQTVRD+KREVNK VLK+P IEQKVLDATSNEPWGPHGSLLADIAQATRNYHEYQ
Sbjct: 1   MKKVFDQTVRDIKREVNKKVLKVPGIEQKVLDATSNEPWGPHGSLLADIAQATRNYHEYQ 60

Query: 61  MIMGVIWKRINDTGKNWRHVYKGLTVLEYLVGHGSERVIDDIREHVYQISTLSNFQYIDS 120
           MIM VIWKR++DTGKNWRHVYK L VLEY+V HGSERVIDDI+EH YQISTLS+FQYIDS
Sbjct: 61  MIMSVIWKRLSDTGKNWRHVYKALIVLEYMVAHGSERVIDDIKEHAYQISTLSDFQYIDS 120

Query: 121 SGRDQGNNVRKKSQNLVALVNDKERIVEVRQKAAANKDKFHSAASMGSMYRP----SAGA 180
           SGRDQG+NVRKKSQ+LVALVNDKERI+EVRQKAAAN+DKF + +S G MYRP    S G 
Sbjct: 121 SGRDQGSNVRKKSQSLVALVNDKERIIEVRQKAAANRDKFRNTSSPGGMYRPGSYSSTGG 180

Query: 181 Y-----DDRYEGRYGSRD--GDRNVDSYGREREYGFRDDRSGGNEDSYGRDYEDRYNRDG 240
           Y     DDRYEGRYG +   G R+ D YG++ +   RD       D YGR+Y++R  R+G
Sbjct: 181 YGDKYDDDRYEGRYGXQGELGYRDDDRYGKQGDSYSRDG------DRYGREYDERNGREG 240

Query: 241 YRDDDYRGRSRSVDDYQYGSRSRSSDRNGERTYDD-GQVSSRNNDTRANEPSRDERPLER 300
           +RDDDYRGRSRSVDDY + SRSRSSDR  ER+ DD GQ SSR +  RA++ S+D R L R
Sbjct: 241 FRDDDYRGRSRSVDDY-HDSRSRSSDRERERSLDDDGQYSSRGSGARADDQSQDGR-LSR 300

Query: 301 KFSEQNIGAPPSYEEVVTESESNVQSQRDVEAPPTAAPRAFPPPTSSTQSQLTSHGTA-A 360
           KFSEQNIGAPPSYEEVV+ES S V S+R  E P  +APRA  PPTS+   Q TS   A A
Sbjct: 301 KFSEQNIGAPPSYEEVVSESRSPVHSERGGETPTASAPRASSPPTSTNPGQATSAQVASA 360

Query: 361 SPPTQGLDGSDEF-DPRGSVPAAP--------------------------PATSN-LETN 420
           SP  Q ++ SDEF DPRGSV   P                          PATSN +E +
Sbjct: 361 SPVKQEVEPSDEFFDPRGSVSGVPFGSVSAAAQTAQAAPAAQTAQAASAAPATSNNVEID 420

Query: 421 LLDSL---------ALVPVGPVTSTADYEGHVQTSSAVGYHA-------QNQTFEDAFGD 480
           LL SL         A+VP    T+T + + H  + SA  + A        +Q+F+D FGD
Sbjct: 421 LLGSLSDSFSSNALAIVPTTSPTTTFEPDAHANSGSATTFVATPSASNVMSQSFDDPFGD 480

Query: 481 SPFKAISSS-------------DVQDQAHVQHGESFSVATYSTPNIP-VQPQPNFPQ--P 540
           SPF+A+ SS             D Q  A+   G+SFS  TYS P +  VQ  P  PQ  P
Sbjct: 481 SPFRALPSSETVQPQPQTSTPTDNQSAANFPFGDSFSAVTYSAPGVSSVQTPPTNPQFLP 540

Query: 541 REESLQHQNIGVLADLLPPPEPLPAVVSQPAFTVSNSQQAVSGLPAQPNSNNLGTYQ--- 600
           +E+S +H N  +LAD+LPPP P P + SQP F+ S  Q      P+QPN+N  G +    
Sbjct: 541 QEQSAEH-NTDILADILPPPGPSPVMTSQPPFSGSTGQ------PSQPNANMYGNFHAQP 600

Query: 601 ----------------------QHGNVAPVNF----QNPTEPGREFNNGMFMVPGSAPTH 660
                                 Q G  AP+      Q PT P  +FN+G F+        
Sbjct: 601 GAIVPHNQTGFAGQNSSGGFSPQGGPTAPITSHVAPQTPTGPIAQFNSGNFISQQGG--- 660

Query: 661 VNSYMAPPNAGPSTHPNNFGYSQDGSVAPTSSHVALQT-TQPPAQLPSGNFNPPHASVAP 720
                + PN+G      NF   Q GS AP +S++A QT T P AQL  GNF+P   SV P
Sbjct: 661 ----FSAPNSG------NFFPQQGGSTAPITSYMAPQTHTGPAAQLNGGNFHPQQGSVGP 720

Query: 721 VASQLSYQAPNFPVNVSNPDVMGSF--QAGNYTSMASQQIPPS--GSLSNASQPSKNKFE 780
           VASQ  +QAP  P    N DV+G+   Q G  TSM S Q  PS  G+LS  +QP K+KFE
Sbjct: 721 VASQAVHQAPTGPGLQHNSDVLGNLFPQTGPNTSMGSHQALPSSTGALSIVAQPPKDKFE 780

Query: 781 TKSTVWTDTLNRGLVNLNISGPKTNPLADIGVDFEALNRKEKRMEKPSTAPAISTINMGK 834
            KS VW DTL+RGLVN NISG K NPL DIG+DF+++NRKEKRMEK    PA ST+ MGK
Sbjct: 781 PKSAVWADTLSRGLVNFNISGAKINPLNDIGIDFDSINRKEKRMEKQPATPAASTVTMGK 840

BLAST of Cp4.1LG08g05680 vs. NCBI nr
Match: gi|743922088|ref|XP_011005122.1| (PREDICTED: clathrin interactor EPSIN 2-like isoform X3 [Populus euphratica])

HSP 1 Score: 752.3 bits (1941), Expect = 9.1e-214
Identity = 503/930 (54.09%), Postives = 592/930 (63.66%), Query Frame = 1

Query: 1   MKKAFDQTVRDLKREVNKTVLKIPKIEQKVLDATSNEPWGPHGSLLADIAQATRNYHEYQ 60
           MKK F QTVRD KREVNK VLK+P IEQKVLDATSNEPWGPHGSLLADIAQA+RNYHEYQ
Sbjct: 1   MKKVFGQTVRDFKREVNKKVLKVPSIEQKVLDATSNEPWGPHGSLLADIAQASRNYHEYQ 60

Query: 61  MIMGVIWKRINDTGKNWRHVYKGLTVLEYLVGHGSERVIDDIREHVYQISTLSNFQYIDS 120
           MIM V+WKRINDTGKNWRHVYK LTVLEYLV HGSER ID+IREH YQI+TLS+FQYIDS
Sbjct: 61  MIMAVLWKRINDTGKNWRHVYKALTVLEYLVAHGSERAIDEIREHSYQITTLSDFQYIDS 120

Query: 121 SGRDQGNNVRKKSQNLVALVNDKERIVEVRQKAAANKDKFHSAASMGSMYRPSAGAYDDR 180
           SG+DQGNNVRKKSQ+LV LVNDKERIVE RQKAAAN+DKF +A S G MYRP  G+YDD 
Sbjct: 121 SGKDQGNNVRKKSQSLVVLVNDKERIVEARQKAAANRDKFRNA-SPGGMYRP--GSYDD- 180

Query: 181 YEGRYGSRDGDRNVDSYGREREYGFRDD-RSGGNEDSYGRDY----EDRYNRDGYRDDDY 240
            +GRYG+RD DRN   YG+EREY +RDD R G   DSYGRD     E+RY RDGYRDDDY
Sbjct: 181 -DGRYGNRDEDRNGYGYGKEREYNYRDDERYGKYGDSYGRDGDHNGEERYGRDGYRDDDY 240

Query: 241 RGRSRSVDDYQYGSRSRSSDRNGERTYDD-GQVSSRNNDTRANEPSRD---ERPLERKFS 300
           +GRSRS+DDY  GSRSRSSDR+ +  +DD GQ SSR    RA++ S D    + LERKFS
Sbjct: 241 QGRSRSIDDY--GSRSRSSDRDRDHAFDDDGQSSSRG--ARADDQSHDGSIAKRLERKFS 300

Query: 301 EQNIGAPPSYEEVVTESESNVQSQRDVEA-------------PPTAAPRAF---PPPTSS 360
           EQNI  PPSYEE ++ES S   S+R+ EA             P +++P AF    PP S+
Sbjct: 301 EQNISGPPSYEEALSESRSPAHSERNGEALSASAPVASSPPVPRSSSPPAFNAASPPPSN 360

Query: 361 TQSQLTSHGTAASPPTQGLDGSDEFDPRGSVPAAPPATS------------NLETNLLDS 420
              + T   T ASP  Q ++ +DEFDPRG + A P ATS            N E +LL S
Sbjct: 361 PGQENTFFVTPASPADQEVEVADEFDPRGPISAPPTATSVQTASAFTPTLNNAEMDLLGS 420

Query: 421 L---------ALVPVGPVTSTADYEGHVQTSSAVGYHAQ------NQTFEDAFGDSPFKA 480
           L         A++PV   T+T++ +     S +     Q      NQ FED FGDSPFKA
Sbjct: 421 LSDVFTPNPLAIMPVTSATTTSEADSQTNFSGSTFAATQSPSNVMNQAFEDPFGDSPFKA 480

Query: 481 I----------------------SSSDVQDQAHVQHGESFSVATYSTPNI-PVQPQPNFP 540
                                   ++++ +     +G++FS  TYS PN+ P    PNF 
Sbjct: 481 TPTDAFSAQQPSASAAPFQPTMNQNTEMPNAVAPPNGDTFSAMTYSAPNVQPPSTNPNF- 540

Query: 541 QPREESLQHQNIGVLADLLPPPEPLPAVVSQPAFTVSNSQQAVSGLPAQPNSNNLGTYQQ 600
            P+E S  H    +LAD+LPP  P  AV SQ  F++       SG   QP ++  G +  
Sbjct: 541 LPQEMSSSHAETDILADILPPSGP-SAVASQAGFSLP------SGHHPQPGASVYGNFNS 600

Query: 601 H-GNVAPVNFQNPTEPGREFNNGMFMVPGSAPTHVNSYMA-PPNAGPSTHPNNFGY-SQD 660
             GN+      +    G++ ++G F   G +P  ++S M+  P AGP    NN     Q 
Sbjct: 601 PAGNMVLPAAAHMAPQGQQLSSGDFFTQGGSPAPIHSNMSLQPPAGPVVQFNNGNLVPQQ 660

Query: 661 GSVAPTSSHVALQT-TQPPAQLPSGNFNPPHASVAPVASQLSYQAPNFPVNVSNPDVMGS 720
           GS AP  S  +  T T    Q   G+F P H S  PVASQ +YQ P+      N DV+G+
Sbjct: 661 GSTAPVVSQFSHHTPTGSAPQYNGGSFLPQHGSAFPVASQFTYQTPSASSPQHN-DVLGN 720

Query: 721 -FQAGNYTSMASQQ-IPPS-GSLSNASQPSKNKFETKSTVWTDTLNRGLVNLNISGPKTN 780
            F  G   S ASQ  +PPS GSL+   QPSK+KFETKSTVW DTL+RGLVNLNISGPKTN
Sbjct: 721 LFSQGPNISTASQTALPPSTGSLAIVPQPSKDKFETKSTVWADTLSRGLVNLNISGPKTN 780

Query: 781 PLADIGVDFEALNRKEKRMEKPSTAPAISTINMGKAMGSGSGIGRAGASALRPPPNAMSG 834
           PLADIGVDF+ALNRKEKR EK    PA+STI MGKAMGSG+G+GRAGA  LRPPPN   G
Sbjct: 781 PLADIGVDFDALNRKEKRTEKQPVTPAVSTITMGKAMGSGTGLGRAGAGVLRPPPNPTIG 840

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
EPN2_ARATH4.2e-13442.98Clathrin interactor EPSIN 2 OS=Arabidopsis thaliana GN=EPSIN2 PE=1 SV=1[more]
EPN3_ARATH3.9e-12445.76Clathrin interactor EPSIN 3 OS=Arabidopsis thaliana GN=EPSIN3 PE=2 SV=1[more]
EPN1_ARATH5.2e-5233.00Clathrin interactor EPSIN 1 OS=Arabidopsis thaliana GN=EPSIN1 PE=1 SV=1[more]
EPN1_RAT3.6e-3747.77Epsin-1 OS=Rattus norvegicus GN=Epn1 PE=1 SV=1[more]
EPN1_HUMAN3.6e-3747.77Epsin-1 OS=Homo sapiens GN=EPN1 PE=1 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0KZ33_CUCSA0.0e+0081.29Uncharacterized protein OS=Cucumis sativus GN=Csa_4G017040 PE=4 SV=1[more]
B9IAE1_POPTR2.4e-21353.79Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0014s14930g PE=4 SV=2[more]
W9R4X1_9ROSA4.1e-21353.41Uncharacterized protein OS=Morus notabilis GN=L484_024722 PE=4 SV=1[more]
A0A059AUM7_EUCGR1.1e-21054.39Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_H00432 PE=4 SV=1[more]
A0A059AUX7_EUCGR6.2e-20954.22Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_H00432 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G43160.32.4e-13542.98 ENTH/VHS family protein[more]
AT3G59290.12.2e-12545.76 ENTH/VHS family protein[more]
AT5G11710.13.0e-5333.00 ENTH/VHS family protein[more]
AT3G46540.11.6e-1435.77 ENTH/VHS family protein[more]
AT3G23350.12.1e-1430.32 ENTH/VHS family protein[more]
Match NameE-valueIdentityDescription
gi|659107956|ref|XP_008453940.1|0.0e+0081.84PREDICTED: clathrin interactor EPSIN 2 isoform X2 [Cucumis melo][more]
gi|449468762|ref|XP_004152090.1|0.0e+0081.29PREDICTED: clathrin interactor EPSIN 2 [Cucumis sativus][more]
gi|659107950|ref|XP_008453936.1|0.0e+0081.36PREDICTED: clathrin interactor EPSIN 2 isoform X1 [Cucumis melo][more]
gi|645231986|ref|XP_008222654.1|1.0e-22554.31PREDICTED: LOW QUALITY PROTEIN: clathrin interactor EPSIN 2-like [Prunus mume][more]
gi|743922088|ref|XP_011005122.1|9.1e-21454.09PREDICTED: clathrin interactor EPSIN 2-like isoform X3 [Populus euphratica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR013809ENTH
IPR008942ENTH_VHS
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG08g05680.1Cp4.1LG08g05680.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008942ENTH/VHSGENE3DG3DSA:1.25.40.90coord: 24..162
score: 2.8
IPR008942ENTH/VHSunknownSSF48464ENTH/VHS domaincoord: 6..149
score: 1.4
IPR013809ENTH domainPFAMPF01417ENTHcoord: 25..145
score: 7.0
IPR013809ENTH domainSMARTSM00273enth_2coord: 24..150
score: 4.7
IPR013809ENTH domainPROFILEPS50942ENTHcoord: 18..150
score: 40
NoneNo IPR availablePANTHERPTHR12276EPSIN/ENT-RELATEDcoord: 298..832
score: 1.8E-261coord: 1..250
score: 1.8E
NoneNo IPR availablePANTHERPTHR12276:SF45CLATHRIN INTERACTOR 1coord: 298..832
score: 1.8E-261coord: 1..250
score: 1.8E