Cp4.1LG01g06100 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g06100
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionENTH/ANTH/VHS superfamily protein
LocationCp4.1LG01 : 309502 .. 316336 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAGTAGGAAGAGTAAGGGCCATTATTCGATCCTTCCTCTCCGCACAATAAATCCTCTCGTTCTTCGTTGGATCATAGAGACGGAATCACAGAGTAATCAGAGAATTCAGGCCATCTCAACACGTAACACTGCTAATTTTTAGAGACAGTGCTTCAAGAATATGGCTACGCTTCAGACGTGGAGGAAAGCCTATGGCGCTCTCAAGGATTCTACCAAAGTCGGCCTTGCCCATGTCAACAGCGAGTACGCGGTATTCTCTCATTTTTCTTTTTTAGATTCTCTCCTCTCCTTTTCTTCTTTATTTTCGATTTTGTCGATCGGTGTTTGATCCGATTTCTGCGTTTTTGCCTTGTCTTATTCTGCGGGATTCTTGTCTGAGTTTAGGATTTGGATGTGGCAATAGTCAAATCTACTAACCACGTCGAGTGCCCGCCGAAAGAGAGACATCTCAGGAGTGAGTGATTGCTGTTGCTAGATCCGTTTTGTTTCTCTGATTCCAAATCTACTGTTTTAGTCTGATTTTCTGTTTGTGTTTTCAGAAATCTTGATTGCTACATCTGCAATTAGACCTCGCGCTGATGTTGCTTATTGCATTCATGCCCTCGCTCGACGCTTGTCCAAGACGCGCAATTGGACGGTACCAATTTTATTCACCTTGGTTGAATTTCTCTGTGGAATTTGTTATTCTTTCTGGCATTAGCCTCTTTTTATCTGGATGATAACTGTCGTATAAGAAACATGTGCGGTTTCGTTAGCTATCCTCTCTTTTGAATGGTTATGTTCCTGACGTAACTGGTTTTTCTTTAGTGCTCCTAATTATCTGCATTGGATTTGGTACTTGCATTATCTGCGTTGTTTTTCAAGTTTGATACTCCACGATTGAAAACTGCGATATGTAATTTCTTTTTTCTGATTCTTAATATTACCATTTGAAAGGAAATGCGTATTTCTGTAATTTCTGGTGAATTTATATTTCTCCTGCTATGATAAATACAAAACCATTTGGGTCTAACCCTTTATTAGTTGAAGTTTTTGTTTCTAATTTCTAGGGTCTCATCTATTTATGATAATGAAGTATGATCTGTGTGGCCTGAAGATTGAAATATTTCCTGATTCCCAAAACGAAATGTCTGTATCTCATAGTCATTTTGACAAATATGATATTGAATAATCCATGATTGTCAGTGGAATGAATGTCACAATATAATGCTCATCAGCTGACTCATTGTTTACTGTTTCTTTTGACTCCATTTCATTTTGCATCCACTATATAAGTTTCTTCATGTTTTACAGGTAGCTTTGAAAACATTAATCGTCATACATAGGACCTTGAGGGAGGGCGATCCAACATTTAGGGAAGAGCTTTTGAATTTTACACAAAGAGCCCGAATTATTCAACTGTCTAATTTTAAGGATGATTCAAGTCCTATCGGTAAAATTCTTTAGATCCTCTTCTCTTATTTTTCTGTGATTCATACAAATGATACACTAATAGTGTAATCAATAATAATAGATGACTGATACGTATATCCTTTTCATAGCTTGGGATTGCTCTGCATGGGTACGCACATATGCATTGTTTTTAGAGGAGCGACTCGAATGTTTCAGGATACTAAAGTATGACATTGAATCCGAACGCCTGCCAAGACCTGCCCAAGGTCAGGAGAAGGTATTGTATTGAAGAATTTTATTAATTCACCTGGCCTCCTTGCATGGTACATGTATGAGTGAGTTATAACATCTTTTTCAGGTTTATAATATGGAAGATGCATGCCCTTTTTGACTATCGGTAATAATAATGTGATTATAGTTTTCTTATTATAAACATCCCAATTAACTGTAGAGATTTTTTTTTTTCTTTTTGTAAACTATGGATATGGTTGCCTCTACTGAGGTGGAAGCTTATTCAGTTGACTGGGGGTGAAGTTTATTAGCCTAATGACAATTTTTCATGATCTCTGGATGGTCAAGGGGACCTAGTAGAGTACTGAAATGCCTATTTTCACCCAAGTTAAATCCAATGGTTGAATCCTAATTTCTAGGTTATTATTGGTTAAATGCTGGCTCAACTCTTCCTGAAATGACACTCAGGTTTTACAACCTGCATTTTATACGATCCTTTGGATGGACTCTTTAAGAGAAATGAAGGCTTTTCTATGAACTTTCTATTTAGTATCAATACAAATTAAATTTGGTAAAGAATTCTTCCTTTTCCCTTAATCCTCAATCTCAATGGCATTGGTTTTGTCTCTATAATATGGTTTGGACGAGAACAATAATATAGTTTCATATGTATTAAAACAAACCAAAGAAATACATGGAAGAGTCAGCACTTTGGACTGTGTTTCGAGATATTCTTCTTTGGTGTTGTCACTGCAATGATTCATTCGGTGTAAATGTCAGGCTGATGACCTTCTTCACGTATTATTGGGTTGCCGGTTTGTTCAGTCCCTTTGGGCTAGGTGGTTGTCCACCTTTGGTCTAACTTTGCCTTTTCGTTAAGAGTTGCATTATTTTATGGCACGCCAATTTCTTTGCTATTTTGTGGGGGATTGGATTGAAAGAAATAATCAAACTTTTAGAGATTTGGAGCAATCTTGTGATGAGGTCTGAACGGTGGTTAGGTTTAATGTCTCTTCGTGGATATCAACCACTAGACCTTTTATTTTAGTCCATTTATATAGTCTTTAGGAAGTTCGTTTTGCATTTTTTTTTTGTTGCCCAATTATACCGTCATTNTAAAAATGCCCTATACTAACCTATGCCTAGGAAAAAAAAAATAAAAAGAAGAAAAGAGATGTAGTAGCAAAGCCACTAGTTGAACATATTGGATATGAGTTTGATAGTTATAGGGCACATTTGGAACATTTATTAAACATAATAAACATGTGTCATTATACATACAAAGATATGCATTTTGGAGTGGCTAAGGAGTGTCCTTAGTTCATAGATTCTTGAAATTCATTAGCCATGGTTCCTTTTGTTGGCAGGGCTACAGCAGAACCAGGGAACTGGACAGTGAAGAACTGTTGGAACATTTGCCTGCCTTGCAACAGCTGTTGTATCGTCTTATTGGCTGCAGGGTATCTTTTTGACACCTGTGAACCTCCAACTTTCATTTGATAATTAAATAAACTTTTTCAGAATATGGATTTTTTTTTAGAACTCTTTACTTTGACCTTCTAAGTACGAAAATTACCTGATGTTGAAATTACAGCCTGAAGGAGCAGCTATTGGGAATTATGTTATACAGTATGCCCTGGCACTGGTATGCTGAATTTCTTCTACTTATATCTCAGTTCCAGGATAATTTTTATTGTAGCAGAGAATATTATTAATGTTAATGTTTTCTCTATTATTATTATAGTTGTTATCATTATTTTTCATCTTTTGTGGGCCATATCCTCTTTTTCTTAGCTTCTTAGGTAGGCTTAGTATAGGTAATTAGACTTCCTTTCCTTTACCTGTCCACTTAACGTTCAATGGATAAGAAAAAATGCAAGGTCGTAATGGATTGAAATTTAAGAGTTATAAGAAAGATACATGCCTCCCCTCATCCTACTTAACGTTCAATGAACAGCTTTACTACTTTGTTGGGGTTGGGGGTGTCTTATTCACATGGTGAAATTTAAAAAAAAAAAAACTTGGACATCTTTGATTTTGTTATTTGACCAGCCGCCCTCTCTTTCTCCATGTTTCATATGTAAGGACCAGGATAATGCTCTTTATGTGGGCAAAAGCCATTGTTCTTGTATGCCTGTATCCTCTGTTATGTGGGAAAGAGTTCTAAAATTCTCTCAGCCTTTTGGCTGAGATCAAGTGTAGTATCTGTTCTCAAAGTATACTTTTCATTAACATTGCTTTATTCCCAGTCTTGATAATGCCGTGACATGCTATTTTCAGGTATTGAAAGAGAGCTTTAAAATCTATTGTGCTATTAATGATGGAATTATAAATCTCGTTGACAAGGTAAGTATCTTAAGGAATCAAAACTTTAGGACTCAGATACATAGAGTGCATTGGTGACAGAAAAGTTATTCAATTTCCTACAGTTTTTTGAGATGCCAAGACATGAGGCTATCAAAGCCCTTGATATCTACAAAAGAGCTGGCCAACAGGTATTTCTGAACTCATTTTTGTTTCTAAATATTATAGTTATCTTTTGCAATGGTTTCTTGTTTTTGGTTGTTTTCCCTTCAAATGTCTCTCAACCTTATTTATATCAGACACCAACAGGTATTACGGGTTTATTTGTCTTGCCTTTTCCATGCTGCAGGCCGGAAGCCTATCAGATTTCTATGATATTTGCAAAGGGTTGGAACTTGCTAGGAATTTCCAGTTTCCTGTTTTAAGAGAAGTAATGATGATATTATAGCGATTTATGTTTCTTATCAAGTTTAAGCTTATAGTATGAACTTATACTTCCCTACTACATGCTTTGTTATCCACTTCCAGCCTCCGCAGTCATTTCTTAATACGATGGAAGAGTACATTCGGGAGGCACCACGAATGGTTACGGTACCAAATGAACCACTGGTTAGTATATTCTTGACCACTTTTCATACCAATAATTGGATTAATGTGTGAGAATTTGCTTTTAGGATCTGTTTTCGTGTTTCTAATCTACTTTACAGGGTACCGTGACTTTCTTTTCTCTTTTTATTGTCTGTTTGTGTCCTAGAGTATATTATTTATGTATATTAGAAATTTTCTGTTTCACAATGTTTAAATTTATATTTTCTCATCCTGGACAGTTGCAACTTACTTACAAGCCGGAAGATTCTCCTTCTGAAGATCCGAACTTACCCACTGATGAAACGGAGGCATCTCCTTCACATGATCTTTCTATTACTCCTGTTGATTCGGCTCCATTGCCACCTCCAGTACCAGCTCCGGCTCCTGAAAGGCATTTAGATACTGGAGATCTATTGGTATAGATATATATATATATATATATTTTTTTTGTAATAGATGATAAATGCTTTAATCCACCATCCTTTCCACCTTTTAAAATTACAACAAATAAATAAAGCTTTTTGCTTGGTGGTTTCTTCAGGGATTGAGTCTTGATACCACCGAAGTATCTGCGATTGAGGACAGAAATGCTTTGGCTTTAGCCATAGTTCCTTCTGGTGGTAAATTCTCAGTCTCTATTTCTAGTTGGGTTACTTGAGAAATTTCACTAACCACTTTTGTTAATGGTCGCGAACCTAATATCAGATGCAGCAGCACCCACATTTCATTCCAATGGTGCACAGCCAAAGGATTTCGATCCTACTGGATGGGAGCTTGCCTTGGTCACCACTCCAAGTACTAACCTTTCATCAACTAATGAGAGACAACTGGTAGGCTTTAATGCGATTGTGAATTTCAAACCATTTCAGTCTCCAGTGTAAAACTTCTAGCCATTATTACTTGAGAACAATTCTGGAATGGAACGAACATAGTTAGGGCCTTGATAAGAATCTTTTAGGGGTTTCCTTGAATTTTTGTCAGATGAGCAAGTCTTGGATTTAGCTAGTGGACATTTCTGAATTAGATTCTTTGAAGTTTAAGGGGATTATTAGGATCACGGGAATCCAGAGTTCTAAGGCATTAGGAATATAACTATTGTGTAGGTAAACGCTGGTGATATATTCTTCCATCAAATTGATAGATCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGTTGTTGTCATTCTATGATGTACCAGGCTGGTGGGCTGGACACACTCACTCTCGACAGTCTATACGATGAGGGTGCATATAGAGCTTCTCTACAGCCCGTGTATGGTAAGCCCGCACCAAATCCATTTGAGGTGCAAGATCCATTTGCATACTCAAATGCCATCGCTCCAGCTCCATCAGTTCAAATGGCGCCAATACCTCAGCAGCAGGCTAATCCCTTTGGTCCATACCAACCTACCTTTTCACAGCAACAATCATTTACGATGGATCCTACAAATCCTTTTGGTGATGCAGGTTTTGGAGCATTCTCTGCTCCTAACCACCATACTGTTCCTCCAAATGCTAACAATCCATTTGGAAGCACAGGCCTTCTGTAGAGAGCTCGTGGGGCTGTTTCCAACGTGTAATTTAGTCTACAGGTTTGGTTTTCCACCATATTTGGCGAGATGTAAAGGGTCGATTTTCTTGAAGCTGCAATGAACGGTTGTTTCTGGGGCGTGCGACATTTGTTTGGGTTGAAACTTCCAGTGAGTGTATAGTGAAGGGTGTAATACGTAGCAAATGAAATTCATGACAATTTCGAGTTAGCTGGAACTCTGTCGTTCTATTGTTTTGCTGATTTTGAGTGCATTGCGCATAGAATCCTACTAAAAATTTTATGCATACCTGTATAATTCATCAAGTCATCATATTTCAGATCCTTTCATTGTTCTGCATTCTCTTTTTACATTTCACTGATCAACAGGAATTTAGGTTTACCTCAAGATGCCATGCAGGCAGAATCCAGTGGAGGTTTACCTCAAGATGCCATGCAGGCAGAATCCAGTGGAGGGAGTCTATGTTCTCATCTTTATATTTAGAATGTAAATTTTTTTCTTCCGTATTGGCCGAGATAACCTTTAGTTTCAGTCCCTTTTAGTACACATTTAGTCAGATTTGAATGGTGTCAATCGGAGCCTAAAAGTGTAACCATGAATTACTTGCCGTATTGGATTAGCCTAATATTATCATTAATGGCAGCTCCAGCTCTCCTTTTCTTTTAATTATTTATCACATGTGAACGAATTTTTCCAAATTCAGATTGCCTTGTTTGTTTGATTTGATAACTGTTAACTTTTGGTAAGAAGTAATTGT

mRNA sequence

AAAGTAGGAAGAGTAAGGGCCATTATTCGATCCTTCCTCTCCGCACAATAAATCCTCTCGTTCTTCGTTGGATCATAGAGACGGAATCACAGAGTAATCAGAGAATTCAGGCCATCTCAACACGTAACACTGCTAATTTTTAGAGACAGTGCTTCAAGAATATGGCTACGCTTCAGACGTGGAGGAAAGCCTATGGCGCTCTCAAGGATTCTACCAAAGTCGGCCTTGCCCATGTCAACAGCGAGTACGCGGATTTGGATGTGGCAATAGTCAAATCTACTAACCACGTCGAGTGCCCGCCGAAAGAGAGACATCTCAGGAAAATCTTGATTGCTACATCTGCAATTAGACCTCGCGCTGATGTTGCTTATTGCATTCATGCCCTCGCTCGACGCTTGTCCAAGACGCGCAATTGGACGGTAGCTTTGAAAACATTAATCGTCATACATAGGACCTTGAGGGAGGGCGATCCAACATTTAGGGAAGAGCTTTTGAATTTTACACAAAGAGCCCGAATTATTCAACTGTCTAATTTTAAGGATGATTCAAGTCCTATCGCTTGGGATTGCTCTGCATGGGTACGCACATATGCATTGTTTTTAGAGGAGCGACTCGAATGTTTCAGGATACTAAAGTATGACATTGAATCCGAACGCCTGCCAAGACCTGCCCAAGGTCAGGAGAAGGGCTACAGCAGAACCAGGGAACTGGACAGTGAAGAACTGTTGGAACATTTGCCTGCCTTGCAACAGCTGTTGTATCGTCTTATTGGCTGCAGGCCTGAAGGAGCAGCTATTGGGAATTATGTTATACAGTATGCCCTGGCACTGGTATTGAAAGAGAGCTTTAAAATCTATTGTGCTATTAATGATGGAATTATAAATCTCGTTGACAAGTTTTTTGAGATGCCAAGACATGAGGCTATCAAAGCCCTTGATATCTACAAAAGAGCTGGCCAACAGGCCGGAAGCCTATCAGATTTCTATGATATTTGCAAAGGGTTGGAACTTGCTAGGAATTTCCAGTTTCCTGTTTTAAGAGAACCTCCGCAGTCATTTCTTAATACGATGGAAGAGTACATTCGGGAGGCACCACGAATGGTTACGGTACCAAATGAACCACTGTTGCAACTTACTTACAAGCCGGAAGATTCTCCTTCTGAAGATCCGAACTTACCCACTGATGAAACGGAGGCATCTCCTTCACATGATCTTTCTATTACTCCTGTTGATTCGGCTCCATTGCCACCTCCAGTACCAGCTCCGGCTCCTGAAAGGCATTTAGATACTGGAGATCTATTGGGATTGAGTCTTGATACCACCGAAGTATCTGCGATTGAGGACAGAAATGCTTTGGCTTTAGCCATAGTTCCTTCTGGTGATGCAGCAGCACCCACATTTCATTCCAATGGTGCACAGCCAAAGGATTTCGATCCTACTGGATGGGAGCTTGCCTTGGTCACCACTCCAAGTACTAACCTTTCATCAACTAATGAGAGACAACTGGCTGGTGGGCTGGACACACTCACTCTCGACAGTCTATACGATGAGGGTGCATATAGAGCTTCTCTACAGCCCGTGTATGGTAAGCCCGCACCAAATCCATTTGAGGTGCAAGATCCATTTGCATACTCAAATGCCATCGCTCCAGCTCCATCAGTTCAAATGGCGCCAATACCTCAGCAGCAGGCTAATCCCTTTGGTCCATACCAACCTACCTTTTCACAGCAACAATCATTTACGATGGATCCTACAAATCCTTTTGGTGATGCAGGTTTTGGAGCATTCTCTGCTCCTAACCACCATACTGTTCCTCCAAATGCTAACAATCCATTTGGAAGCACAGGCCTTCTGTAGAGAGCTCGTGGGGCTGTTTCCAACGTGTAATTTAGTCTACAGGTTTGGTTTTCCACCATATTTGGCGAGATGTAAAGGGTCGATTTTCTTGAAGCTGCAATGAACGGTTGTTTCTGGGGCGTGCGACATTTGTTTGGGTTGAAACTTCCAGTGAGTGTATAGTGAAGGGTGTAATACGTAGCAAATGAAATTCATGACAATTTCGAGTTAGCTGGAACTCTGTCGTTCTATTGTTTTGCTGATTTTGAGTGCATTGCGCATAGAATCCTACTAAAAATTTTATGCATACCTGTATAATTCATCAAGTCATCATATTTCAGATCCTTTCATTGTTCTGCATTCTCTTTTTACATTTCACTGATCAACAGGAATTTAGGTTTACCTCAAGATGCCATGCAGGCAGAATCCAGTGGAGGTTTACCTCAAGATGCCATGCAGGCAGAATCCAGTGGAGGGAGTCTATGTTCTCATCTTTATATTTAGAATGTAAATTTTTTTCTTCCGTATTGGCCGAGATAACCTTTAGTTTCAGTCCCTTTTAGTACACATTTAGTCAGATTTGAATGGTGTCAATCGGAGCCTAAAAGTGTAACCATGAATTACTTGCCGTATTGGATTAGCCTAATATTATCATTAATGGCAGCTCCAGCTCTCCTTTTCTTTTAATTATTTATCACATGTGAACGAATTTTTCCAAATTCAGATTGCCTTGTTTGTTTGATTTGATAACTGTTAACTTTTGGTAAGAAGTAATTGT

Coding sequence (CDS)

ATGGCTACGCTTCAGACGTGGAGGAAAGCCTATGGCGCTCTCAAGGATTCTACCAAAGTCGGCCTTGCCCATGTCAACAGCGAGTACGCGGATTTGGATGTGGCAATAGTCAAATCTACTAACCACGTCGAGTGCCCGCCGAAAGAGAGACATCTCAGGAAAATCTTGATTGCTACATCTGCAATTAGACCTCGCGCTGATGTTGCTTATTGCATTCATGCCCTCGCTCGACGCTTGTCCAAGACGCGCAATTGGACGGTAGCTTTGAAAACATTAATCGTCATACATAGGACCTTGAGGGAGGGCGATCCAACATTTAGGGAAGAGCTTTTGAATTTTACACAAAGAGCCCGAATTATTCAACTGTCTAATTTTAAGGATGATTCAAGTCCTATCGCTTGGGATTGCTCTGCATGGGTACGCACATATGCATTGTTTTTAGAGGAGCGACTCGAATGTTTCAGGATACTAAAGTATGACATTGAATCCGAACGCCTGCCAAGACCTGCCCAAGGTCAGGAGAAGGGCTACAGCAGAACCAGGGAACTGGACAGTGAAGAACTGTTGGAACATTTGCCTGCCTTGCAACAGCTGTTGTATCGTCTTATTGGCTGCAGGCCTGAAGGAGCAGCTATTGGGAATTATGTTATACAGTATGCCCTGGCACTGGTATTGAAAGAGAGCTTTAAAATCTATTGTGCTATTAATGATGGAATTATAAATCTCGTTGACAAGTTTTTTGAGATGCCAAGACATGAGGCTATCAAAGCCCTTGATATCTACAAAAGAGCTGGCCAACAGGCCGGAAGCCTATCAGATTTCTATGATATTTGCAAAGGGTTGGAACTTGCTAGGAATTTCCAGTTTCCTGTTTTAAGAGAACCTCCGCAGTCATTTCTTAATACGATGGAAGAGTACATTCGGGAGGCACCACGAATGGTTACGGTACCAAATGAACCACTGTTGCAACTTACTTACAAGCCGGAAGATTCTCCTTCTGAAGATCCGAACTTACCCACTGATGAAACGGAGGCATCTCCTTCACATGATCTTTCTATTACTCCTGTTGATTCGGCTCCATTGCCACCTCCAGTACCAGCTCCGGCTCCTGAAAGGCATTTAGATACTGGAGATCTATTGGGATTGAGTCTTGATACCACCGAAGTATCTGCGATTGAGGACAGAAATGCTTTGGCTTTAGCCATAGTTCCTTCTGGTGATGCAGCAGCACCCACATTTCATTCCAATGGTGCACAGCCAAAGGATTTCGATCCTACTGGATGGGAGCTTGCCTTGGTCACCACTCCAAGTACTAACCTTTCATCAACTAATGAGAGACAACTGGCTGGTGGGCTGGACACACTCACTCTCGACAGTCTATACGATGAGGGTGCATATAGAGCTTCTCTACAGCCCGTGTATGGTAAGCCCGCACCAAATCCATTTGAGGTGCAAGATCCATTTGCATACTCAAATGCCATCGCTCCAGCTCCATCAGTTCAAATGGCGCCAATACCTCAGCAGCAGGCTAATCCCTTTGGTCCATACCAACCTACCTTTTCACAGCAACAATCATTTACGATGGATCCTACAAATCCTTTTGGTGATGCAGGTTTTGGAGCATTCTCTGCTCCTAACCACCATACTGTTCCTCCAAATGCTAACAATCCATTTGGAAGCACAGGCCTTCTGTAG

Protein sequence

MATLQTWRKAYGALKDSTKVGLAHVNSEYADLDVAIVKSTNHVECPPKERHLRKILIATSAIRPRADVAYCIHALARRLSKTRNWTVALKTLIVIHRTLREGDPTFREELLNFTQRARIIQLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIESERLPRPAQGQEKGYSRTRELDSEELLEHLPALQQLLYRLIGCRPEGAAIGNYVIQYALALVLKESFKIYCAINDGIINLVDKFFEMPRHEAIKALDIYKRAGQQAGSLSDFYDICKGLELARNFQFPVLREPPQSFLNTMEEYIREAPRMVTVPNEPLLQLTYKPEDSPSEDPNLPTDETEASPSHDLSITPVDSAPLPPPVPAPAPERHLDTGDLLGLSLDTTEVSAIEDRNALALAIVPSGDAAAPTFHSNGAQPKDFDPTGWELALVTTPSTNLSSTNERQLAGGLDTLTLDSLYDEGAYRASLQPVYGKPAPNPFEVQDPFAYSNAIAPAPSVQMAPIPQQQANPFGPYQPTFSQQQSFTMDPTNPFGDAGFGAFSAPNHHTVPPNANNPFGSTGLL
BLAST of Cp4.1LG01g06100 vs. Swiss-Prot
Match: CAP8_ARATH (Putative clathrin assembly protein At2g01600 OS=Arabidopsis thaliana GN=At2g01600 PE=2 SV=2)

HSP 1 Score: 760.0 bits (1961), Expect = 1.9e-218
Identity = 404/590 (68.47%), Postives = 455/590 (77.12%), Query Frame = 1

Query: 1   MATLQTWRKAYGALKDSTKVGLAHVNSEYADLDVAIVKSTNHVECPPKERHLRKILIATS 60
           M TLQ+WRKAYGALKDSTKVGL  VNSEYADLDVAIVK+TNHVECPPK+RHLRKI  ATS
Sbjct: 1   MGTLQSWRKAYGALKDSTKVGLVRVNSEYADLDVAIVKATNHVECPPKDRHLRKIFAATS 60

Query: 61  AIRPRADVAYCIHALARRLSKTRNWTVALKTLIVIHRTLREGDPTFREELLNFTQRARII 120
             R RADVAYCIHAL+RRL KTRNWTVALKTLIVIHR LREGDPTFREELLNF+QR RI+
Sbjct: 61  VTRARADVAYCIHALSRRLHKTRNWTVALKTLIVIHRLLREGDPTFREELLNFSQRGRIL 120

Query: 121 QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIESERLPRPAQGQEKGYSRT 180
           QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFR+LKYD E+ERLP+   GQ+KGYSRT
Sbjct: 121 QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRVLKYDTEAERLPKSNPGQDKGYSRT 180

Query: 181 RELDSEELLEHLPALQQLLYRLIGCRPEGAAIGNYVIQYALALVLKESFKIYCAINDGII 240
           R+LD EELLE LPALQQLLYRLIGCRPEGAA  N+VIQYALALVLKESFK+YCAINDGII
Sbjct: 181 RDLDGEELLEQLPALQQLLYRLIGCRPEGAANHNHVIQYALALVLKESFKVYCAINDGII 240

Query: 241 NLVDKFFEMPRHEAIKALDIYKRAGQQAGSLSDFYDICKGLELARNFQFPVLREPPQSFL 300
           NL+DKFFEM +HEAI +L+IYKRAGQQA SLSDFY+ CKGLELARNFQFPVLREPPQSFL
Sbjct: 241 NLIDKFFEMAKHEAITSLEIYKRAGQQARSLSDFYEACKGLELARNFQFPVLREPPQSFL 300

Query: 301 NTMEEYIREAPRMVTVPNEPLLQLTYKPEDS-PSEDPNLPTDETEASPSHDLSITPVDSA 360
            TMEEYI+EAPR+V VP EPLL LTY+P+D   +ED     +E E  PS D+ +   ++ 
Sbjct: 301 TTMEEYIKEAPRVVDVPAEPLL-LTYRPDDGLTTEDTEPSHEEREMLPSDDVVVVSEETE 360

Query: 361 PLPPPVPAPAPERHLDTGDLLGLSLDTTEVSAIEDRNALALAIVPSGDAAAPTFHSNGAQ 420
           P PPP P+   +  +DT DL GL+    + S IED+NALALAIV S DA  PT H    Q
Sbjct: 361 PSPPPPPSANAQNFIDTDDLWGLNTGAPDTSVIEDQNALALAIV-STDADPPTPHFG--Q 420

Query: 421 PKDFDPTGWELALVTTPSTNLSSTNERQLAGGLDTLTLDSLYDEGAYRASLQPVYGKPAP 480
           P ++DPTGWELALVT PS+++S++ ER+LAGGLDTLTL SLYD+GAY AS +PVYG PAP
Sbjct: 421 PNNYDPTGWELALVTAPSSDISASTERKLAGGLDTLTLSSLYDDGAYIASQRPVYGAPAP 480

Query: 481 NPFEVQDPFAYSNAIAPAPSVQMAPIPQQQA--NPFGPY--------QPTFSQQQSFTMD 540
           NPF   DPFA SN  AP         PQQQA  NPFG Y        QPT+  Q +   +
Sbjct: 481 NPFASHDPFASSNGTAPP--------PQQQAVNNPFGAYQQTYQHQPQPTYQHQSNPPTN 540

Query: 541 PTNPFGD---------------AGFGAFSAPNHHTVPPNANNPFGSTGLL 565
            +NPFGD               +G+G FS   H       NNPF STGL+
Sbjct: 541 NSNPFGDFGEFPVNPVSQQPNTSGYGDFSVNQH-------NNPFRSTGLI 571

BLAST of Cp4.1LG01g06100 vs. Swiss-Prot
Match: CAP9_ARATH (Putative clathrin assembly protein At1g14910 OS=Arabidopsis thaliana GN=At1g14910 PE=2 SV=2)

HSP 1 Score: 721.8 bits (1862), Expect = 5.6e-207
Identity = 377/568 (66.37%), Postives = 447/568 (78.70%), Query Frame = 1

Query: 1   MATLQTWRKAYGALKDSTKVGLAHVNSEYADLDVAIVKSTNHVECPPKERHLRKILIATS 60
           M TLQ+WR+AYGALKD+TKVGL  VNS+YA+LDVAIVK+TNHVECPPK+RHLRKI +ATS
Sbjct: 1   MGTLQSWRRAYGALKDTTKVGLVRVNSDYAELDVAIVKATNHVECPPKDRHLRKIFLATS 60

Query: 61  AIRPRADVAYCIHALARRLSKTRNWTVALKTLIVIHRTLREGDPTFREELLNFTQRARII 120
           AIRPRADVAYCIHAL+RRL KTRNWTVALK L+VIHR LR+GDPTFREELLNF+Q+ RI+
Sbjct: 61  AIRPRADVAYCIHALSRRLHKTRNWTVALKALLVIHRLLRDGDPTFREELLNFSQKGRIM 120

Query: 121 QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIESERLPRPAQGQEKGYSRT 180
           Q+SNFKDDSSP+AWDCS WVRTYALFLEERLECFR+LKYDIE+ERLP+ + GQEKGYS+T
Sbjct: 121 QISNFKDDSSPVAWDCSGWVRTYALFLEERLECFRVLKYDIEAERLPKVSPGQEKGYSKT 180

Query: 181 RELDSEELLEHLPALQQLLYRLIGCRPEGAAIGNYVIQYALALVLKESFKIYCAINDGII 240
           R+LD E+LLE LPALQQLL+RLIGC+PEGAA  N++IQYAL+LVLKESFK+YCAIN+GII
Sbjct: 181 RDLDGEKLLEQLPALQQLLHRLIGCKPEGAAKHNHIIQYALSLVLKESFKVYCAINEGII 240

Query: 241 NLVDKFFEMPRHEAIKALDIYKRAGQQAGSLSDFYDICKGLELARNFQFPVLREPPQSFL 300
           NLV+KFFEMPRHEAIKAL+IYKRAG QAG+LS FY++CKGLELARNFQFPVLREPPQSFL
Sbjct: 241 NLVEKFFEMPRHEAIKALEIYKRAGLQAGNLSAFYEVCKGLELARNFQFPVLREPPQSFL 300

Query: 301 NTMEEYIREAPRMVTVPNEPLLQLTYKPEDS-PSEDPNLPTDETEASPSHDLSITPVDSA 360
            TMEEY+R+AP+MV V + PLL LTY P+D   SED     +E E S   D ++ P +  
Sbjct: 301 TTMEEYMRDAPQMVDVTSGPLL-LTYTPDDGLTSEDVGPSHEEHETSSPSDSAVVPSEET 360

Query: 361 PL--PPPVPAPAPERHLDTGDLLGLSLDTTEVSAIEDRNALALAIVPSGDAAAPTFHSNG 420
            L    P     P+  +DT DLLGL  DT +  AI D+NALALA+V S D  +  F  + 
Sbjct: 361 QLSSQSPPSVETPQNFIDTDDLLGLHDDTPDPLAILDQNALALALV-SNDVDSSPF--SF 420

Query: 421 AQPKDFDPTGWELALVTTPSTNLSSTNERQLAGGLDTLTLDSLYDEGAYRASLQPVYGKP 480
            Q +D DP+GWELALVTTPS ++S+  ERQLAGGLDTLTL+SLYD+GA RA+ QP YG P
Sbjct: 421 GQARDLDPSGWELALVTTPSNDISAATERQLAGGLDTLTLNSLYDDGALRAAQQPAYGVP 480

Query: 481 APNPFEVQDPFAYSNAIAPAPSVQMAPIPQQQANPFGPYQPTFSQQQ-----SFTMDPTN 540
           A NPFEVQD FA+S++++P  +V          NPFG Y+PT+ QQ+          P N
Sbjct: 481 ASNPFEVQDLFAFSDSVSPPSAVN---------NPFGLYEPTYHQQEQQPQLQVAPSPAN 540

Query: 541 PFGDAGFGAFSAPNHHTVPPNANNPFGS 561
           PFGD  FG F  P      P +   FG+
Sbjct: 541 PFGD--FGEF--PIVPVSEPQSTTSFGA 551

BLAST of Cp4.1LG01g06100 vs. Swiss-Prot
Match: CAP7_ARATH (Putative clathrin assembly protein At5g57200 OS=Arabidopsis thaliana GN=At5g57200 PE=3 SV=1)

HSP 1 Score: 578.9 bits (1491), Expect = 5.9e-164
Identity = 334/584 (57.19%), Postives = 399/584 (68.32%), Query Frame = 1

Query: 1   MATLQTWRKAYGALKDSTKVGLAHVNSEYADLDVAIVKSTNHVECPPKERHLRKILIATS 60
           M T  ++RKAYGALKD+T VGLA VNSE+ DLD+AIVK+TNHVE PPKERH+RKI  ATS
Sbjct: 1   MGTFTSFRKAYGALKDTTTVGLAKVNSEFKDLDIAIVKATNHVESPPKERHVRKIFSATS 60

Query: 61  AIRPRADVAYCIHALARRLSKTRNWTVALKTLIVIHRTLREGDPTFREELLNFTQRARII 120
            I+PRADVAYCIHAL++RLSKTRNW VA+K LIVIHRTLREGDPTFREELLN++ R  I+
Sbjct: 61  VIQPRADVAYCIHALSKRLSKTRNWVVAMKVLIVIHRTLREGDPTFREELLNYSHRRHIL 120

Query: 121 QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIESERLPRPAQGQEKGYSRT 180
           ++SNFKDD+SP+AWDCSAWVRTYALFLEERLEC+R+LKYDIE+ERLP+ A G      RT
Sbjct: 121 RISNFKDDTSPLAWDCSAWVRTYALFLEERLECYRVLKYDIEAERLPK-ASGAASKTHRT 180

Query: 181 RELDSEELLEHLPALQQLLYRLIGCRPEGAAIGNYVIQYALALVLKESFKIYCAINDGII 240
           R L  E+LLE LPALQQLLYRLIGC+PEGAA  NY+IQYALALVLKESFKIYCAINDGII
Sbjct: 181 RMLSGEDLLEQLPALQQLLYRLIGCQPEGAAYSNYLIQYALALVLKESFKIYCAINDGII 240

Query: 241 NLVDKFFEMPRHEAIKALDIYKRAGQQAGSLSDFYDICKGLELARNFQFPVLREPPQSFL 300
           NLVD FFEM RH+A+KAL+IYKRAGQQA +L++FYD CKGLELARNFQFP LR+PP SFL
Sbjct: 241 NLVDMFFEMSRHDAVKALNIYKRAGQQAENLAEFYDYCKGLELARNFQFPTLRQPPPSFL 300

Query: 301 NTMEEYIREAPRMVTVPNEPLLQLTYKPE-DSPSEDPNLPTDETEASPSHDLSITPVDSA 360
            TMEEYI+EAP+  +V  +   Q   + E +   E P  P +E   + + +     ++  
Sbjct: 301 ATMEEYIKEAPQSGSVQKKLEYQEKEEEEQEQEEEQPEEPAEEENQNENTENDQPLIEEE 360

Query: 361 PLPP----PVPAPAPERHLDTGDLLGLSLDTTEVSAIEDRNALALAIVPSG-DAAAPTFH 420
              P     V    P   +DT DLLGL     + + IE  NA +LAI P G + +AP   
Sbjct: 361 EEEPKEEIEVEEAKPSPLIDTDDLLGLHEINPKAAEIEQNNAFSLAIYPPGHETSAP--- 420

Query: 421 SNGAQPKDFDPTGWELALVTTPSTNLSSTNER-----QLAGGLDTLTLDSLYDEGAYRAS 480
           SN     +   +GWELALVT  + N ++ N R     +L GG D L LDSLY++   R  
Sbjct: 421 SNSLSLIEAGGSGWELALVTPQNNNNNNNNPRPVIATKLGGGFDNLLLDSLYEDDTARRQ 480

Query: 481 LQPV---YGKPA-----------PNPFEV-QDPFAYSNAIAPAPSVQMAPIPQQ--QANP 540
           +Q     YG  A           PNPF V QDPFA SN +AP  +VQMA   QQ    N 
Sbjct: 481 IQLTNAGYGFGATAIPGALASSNPNPFGVQQDPFAMSNNMAPPTNVQMAMQQQQMMMMNN 540

Query: 541 FGPYQPTFS--QQQSFTMDPT-----NPFGDAGFGAFSAPNHHT 550
             PY   +S      F+ +P+     NPFGD  F A  AP   T
Sbjct: 541 QSPYNNNYSPYHHHQFSPNPSTSSSPNPFGDP-FLALPAPPSST 579

BLAST of Cp4.1LG01g06100 vs. Swiss-Prot
Match: CAP6_ARATH (Putative clathrin assembly protein At4g25940 OS=Arabidopsis thaliana GN=At4g25940 PE=2 SV=1)

HSP 1 Score: 571.2 bits (1471), Expect = 1.2e-161
Identity = 335/603 (55.56%), Postives = 402/603 (66.67%), Query Frame = 1

Query: 1   MATLQTWRKAYGALKDSTKVGLAHVNSEYADLDVAIVKSTNHVECPPKERHLRKILIATS 60
           MAT  ++RKA GA+KDST V +A VNSE+ DLDVAIVK+TNHVE  PKERH+R+I  ATS
Sbjct: 1   MATFNSFRKAVGAIKDSTTVSIAKVNSEFKDLDVAIVKATNHVESAPKERHIRRIFSATS 60

Query: 61  AIRPRADVAYCIHALARRLSKTRNWTVALKTLIVIHRTLREGDPTFREELLNFTQRARII 120
            ++PRADVAYCIHALA+RLSKTRNW VA+K LIVIHRTLREGDPTFREELLN++ R  I+
Sbjct: 61  VVQPRADVAYCIHALAKRLSKTRNWVVAIKVLIVIHRTLREGDPTFREELLNYSHRGHIL 120

Query: 121 QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIESERLPRPAQGQEKGYS-- 180
           ++SNFKDD+SP+AWDCSAW+RTYALFLEERLEC+R+LKYDIE+ERLP+ +    K     
Sbjct: 121 RISNFKDDTSPLAWDCSAWIRTYALFLEERLECYRVLKYDIEAERLPKGSGASSKNVDFN 180

Query: 181 -----RTRELDSEELLEHLPALQQLLYRLIGCRPEGAAIGNYVIQYALALVLKESFKIYC 240
                RTR L  EELLE LPALQQLLYRLIGC+PEG+A  NY+IQYALALVLKESFKIYC
Sbjct: 181 ASQTYRTRMLSDEELLEQLPALQQLLYRLIGCQPEGSAYSNYLIQYALALVLKESFKIYC 240

Query: 241 AINDGIINLVDKFFEMPRHEAIKALDIYKRAGQQAGSLSDFYDICKGLELARNFQFPVLR 300
           AINDGIINLVD FFEM RH+A+KAL+IYKRAGQQA +L+DFY+ CKGLELARNFQFP LR
Sbjct: 241 AINDGIINLVDMFFEMSRHDAVKALNIYKRAGQQAENLADFYEYCKGLELARNFQFPTLR 300

Query: 301 EPPQSFLNTMEEYIREAPRMVTVPNEPLLQLTYKPEDSPSED-------PNLPTD---ET 360
           +PP SFL TME+YI+EAP+  +V  +  L+   K E+   E+       P  P +   + 
Sbjct: 301 QPPPSFLATMEDYIKEAPQSGSVQKK--LEYQEKEEEEQEEEEAEHSVQPEEPAEADNQK 360

Query: 361 EASPSHDLSITPVDSAPLPPPVPAPAPERHLDTGDLLGLSLDTTEVSAIEDRNALALAIV 420
           E S      I   +            P   +DT DLLGL+    + + IEDRNALALAI 
Sbjct: 361 ENSEGDQPLIEEEEEDQEKIEEEDAKPSFLIDTDDLLGLNEINPKAAEIEDRNALALAIY 420

Query: 421 PSG-DAAAPTFHSNGAQPKDFDPTGWELALVTTPSTNLSSTNER-----QLAGGLDTLTL 480
           P G +A  P   SN     +   +GWELALV TP  N ++ N R     +LAGG D L L
Sbjct: 421 PPGHEAPGP---SNILSLIETGGSGWELALV-TPQNNNNNNNPRPAPNTKLAGGFDNLLL 480

Query: 481 DSLYDEGAYRASLQPV---YG-------KPAPNPFEV-QDPFAYSNAIAPAPSVQMAPIP 540
           DSLY++ + R  +Q     YG          PNPF++ QDPFA SN IAP  +VQMA   
Sbjct: 481 DSLYEDDSARRQIQLTNAGYGHGGIDTTAAPPNPFQMQQDPFAMSNNIAPPTNVQMAMQQ 540

Query: 541 QQQ-------------ANPFGPYQPTFSQQQSFTMDPTNPFGDAGFGAFSAPNHHTVPPN 557
           QQQ              +P   +Q     Q S    P+NPFGDA F A   P     P  
Sbjct: 541 QQQQQMTMMHQSPYNYTHPHDYHQNHHHHQFSAGPSPSNPFGDA-FLALPPPPGSAGPQQ 596

BLAST of Cp4.1LG01g06100 vs. Swiss-Prot
Match: CAP10_ARATH (Putative clathrin assembly protein At5g35200 OS=Arabidopsis thaliana GN=At5g35200 PE=1 SV=1)

HSP 1 Score: 451.4 bits (1160), Expect = 1.4e-125
Identity = 268/568 (47.18%), Postives = 354/568 (62.32%), Query Frame = 1

Query: 8   RKAYGALKDSTKVGLAHVNSEYADLDVAIVKSTNHVECPPKERHLRKILIATSAIRPRAD 67
           R+  GA+KD+T V LA VNS+Y +LD+AIVK+TNHVE P KER++R I +A SA RPRAD
Sbjct: 12  RRYLGAIKDTTTVSLAKVNSDYKELDIAIVKATNHVERPSKERYIRAIFMAISATRPRAD 71

Query: 68  VAYCIHALARRLSKTRNWTVALKTLIVIHRTLREGDPTFREELLNFTQ-RARIIQLSNFK 127
           VAYCIHALARRLS+T NW VALKTLIVIHR LRE D TF EE++N+++ R+ ++ +S+FK
Sbjct: 72  VAYCIHALARRLSRTHNWAVALKTLIVIHRALREVDQTFHEEVINYSRSRSHMLNMSHFK 131

Query: 128 DDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIESERLPRPAQGQEKGYSRTRELDSE 187
           DDS P AW  SAWVR YALFLEERLECFR+LKYD+E +              RT++LD+ 
Sbjct: 132 DDSGPNAWAYSAWVRFYALFLEERLECFRVLKYDVEVDP------------PRTKDLDTP 191

Query: 188 ELLEHLPALQQLLYRLIGCRPEGAAIGNYVIQYALALVLKESFKIYCAINDGIINLVDKF 247
           +LLE LPALQ+LL+R++ C+PEGAA+ N++IQ AL++V+ ES KIY A+ DGI NLVDKF
Sbjct: 192 DLLEQLPALQELLFRVLDCQPEGAAVQNHIIQLALSMVISESTKIYQALTDGIDNLVDKF 251

Query: 248 FEMPRHEAIKALDIYKRAGQQAGSLSDFYDICKGLELARNFQFPVLREPPQSFLNTMEEY 307
           F+M R++A+KALD+Y+RA +QAG LS+F+++CK + + R  +F  + +PP SFL  MEEY
Sbjct: 252 FDMQRNDAVKALDMYRRAVKQAGRLSEFFEVCKSVNVGRGERFIKIEQPPTSFLQAMEEY 311

Query: 308 IREAPRMVTVPNEPLLQLTYKPEDSPSEDPNLPTDETEASPSHDLSITPVDSAPLPPPVP 367
           ++EAP    V  E +++    P++  + +  +P    E  P+             P PV 
Sbjct: 312 VKEAPLAAGVKKEQVVEKLTAPKEILAIEYEIPPKVVEEKPAS------------PEPVK 371

Query: 368 APAPERHLDTGDLLGLSLDTTEVSAIEDRNALALAIVPSG--DAAAPTFHSNGAQPKDFD 427
           A A +      DLL +      VS +E++NALALAIVP       + T  +NG      +
Sbjct: 372 AEAEKPVEKQPDLLSMDDPAPMVSELEEKNALALAIVPVSVEQPHSTTDFTNG------N 431

Query: 428 PTGWELALVTTPSTNLSSTNERQLAGGLDTLTLDSLYDEGAYRASLQ------PVYGKPA 487
            TGWELALVT PS+N  +  + +LAGGLD LTLDSLY E A R S Q      P    P 
Sbjct: 432 STGWELALVTAPSSNEGAAADSKLAGGLDKLTLDSLY-EDAIRVSQQQNRSYNPWEQNPV 491

Query: 488 PNPFEVQDPFAYSNAIAPAPSVQMA-----PIPQQQANP---FGPYQPTFSQQQSFTMDP 547
            N   +  PF  SN +A     QMA         Q  N     GP Q  + QQQ    + 
Sbjct: 492 HNGHMMHQPFYASNGVAAPQPFQMANQNHQTFGYQHQNAGMMMGPVQQPYQQQQ---QNM 540

Query: 548 TNPFGDAGFGAFSAPNHHTVPPNANNPF 559
            NPFG+         N +   P   NP+
Sbjct: 552 NNPFGNP-----FVSNGNPQQPQGYNPY 540

BLAST of Cp4.1LG01g06100 vs. TrEMBL
Match: A0A0A0KIT4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G033480 PE=4 SV=1)

HSP 1 Score: 1057.0 bits (2732), Expect = 8.2e-306
Identity = 530/566 (93.64%), Postives = 544/566 (96.11%), Query Frame = 1

Query: 1   MATLQTWRKAYGALKDSTKVGLAHVNSEYADLDVAIVKSTNHVECPPKERHLRKILIATS 60
           MATLQTWRKAYGALKDSTKVGLAHVNS+YADLDVAIVK+TNHVECPPKERHLRKILIATS
Sbjct: 1   MATLQTWRKAYGALKDSTKVGLAHVNSDYADLDVAIVKATNHVECPPKERHLRKILIATS 60

Query: 61  AIRPRADVAYCIHALARRLSKTRNWTVALKTLIVIHRTLREGDPTFREELLNFTQRARII 120
           AIRPRADVAYCIHALARRLSKTRNWTVALK LIVIHRTLREGDPTFREELLNFTQRARI+
Sbjct: 61  AIRPRADVAYCIHALARRLSKTRNWTVALKALIVIHRTLREGDPTFREELLNFTQRARIL 120

Query: 121 QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIESERLPRPAQGQEKGYSRT 180
           QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIESERLPRPAQGQEKGYSRT
Sbjct: 121 QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIESERLPRPAQGQEKGYSRT 180

Query: 181 RELDSEELLEHLPALQQLLYRLIGCRPEGAAIGNYVIQYALALVLKESFKIYCAINDGII 240
           RELDSEELLEHLPALQQLLYRLIGC+PEGAAIGNYVIQYALALVLKESFKIYCAINDGII
Sbjct: 181 RELDSEELLEHLPALQQLLYRLIGCKPEGAAIGNYVIQYALALVLKESFKIYCAINDGII 240

Query: 241 NLVDKFFEMPRHEAIKALDIYKRAGQQAGSLSDFYDICKGLELARNFQFPVLREPPQSFL 300
           NLVDKFFEMPRHEAIKALDIYKRAGQQAGSLSDFYDICKGLELARNFQFPVLREPPQSFL
Sbjct: 241 NLVDKFFEMPRHEAIKALDIYKRAGQQAGSLSDFYDICKGLELARNFQFPVLREPPQSFL 300

Query: 301 NTMEEYIREAPRMVTVPNEPLLQLTYKPEDSPSEDPNLPTDETEASPSHDLSITPVDSAP 360
           NTMEEYIREAPRMVTVPNEPLLQLTYKPE+S SED NLPTDE EASPS+DLSITPV++AP
Sbjct: 301 NTMEEYIREAPRMVTVPNEPLLQLTYKPEESLSEDQNLPTDELEASPSNDLSITPVETAP 360

Query: 361 L-PPPVPAPAPERHLDTGDLLGLSLDTTEVSAIEDRNALALAIVPSGDAAAPTFHSNGAQ 420
             PPP PAPAPE HL+TGDLLGLSL TTEVSAIE+RNALALAIVPSGD  APTFHSNGAQ
Sbjct: 361 TPPPPAPAPAPESHLETGDLLGLSLATTEVSAIEERNALALAIVPSGDTEAPTFHSNGAQ 420

Query: 421 PKDFDPTGWELALVTTPSTNLSSTNERQLAGGLDTLTLDSLYDEGAYRASLQPVYGKPAP 480
             DFDPTGWELALVTTPSTNLSS NERQLAGGLDTL LDSLYDEGAYRASLQPVYGKPAP
Sbjct: 421 ANDFDPTGWELALVTTPSTNLSSANERQLAGGLDTLILDSLYDEGAYRASLQPVYGKPAP 480

Query: 481 NPFEVQDPFAYSNAIAPAPSVQMAPIPQQQANPFGPYQPTF-SQQQSFTMDPTNPFGDAG 540
           NPFEVQDPFAYSNAIAP PSVQMAP+ QQQANPFGP+QPTF  QQQ FTMDPTNPFGD+G
Sbjct: 481 NPFEVQDPFAYSNAIAPPPSVQMAPLAQQQANPFGPFQPTFPQQQQPFTMDPTNPFGDSG 540

Query: 541 FGAFSAPNHHTVPPNANNPFGSTGLL 565
           FGAF APNHHTVPP A+NPFGSTGLL
Sbjct: 541 FGAFPAPNHHTVPPPASNPFGSTGLL 566

BLAST of Cp4.1LG01g06100 vs. TrEMBL
Match: M5XDV8_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003610mg PE=4 SV=1)

HSP 1 Score: 908.3 bits (2346), Expect = 4.7e-261
Identity = 457/566 (80.74%), Postives = 501/566 (88.52%), Query Frame = 1

Query: 1   MATLQTWRKAYGALKDSTKVGLAHVNSEYADLDVAIVKSTNHVECPPKERHLRKILIATS 60
           M TLQTWRKAYGALKD+TKVGLAHVNS+YADLDVAIVK+TNHVECPPKERHLRKILIATS
Sbjct: 1   MGTLQTWRKAYGALKDTTKVGLAHVNSDYADLDVAIVKATNHVECPPKERHLRKILIATS 60

Query: 61  AIRPRADVAYCIHALARRLSKTRNWTVALKTLIVIHRTLREGDPTFREELLNFTQRARII 120
           AIRPRADVAYCIHAL+RRL+KT NWTVALKTLIVIHRTLREGDPTFREELLNF+QR RI+
Sbjct: 61  AIRPRADVAYCIHALSRRLNKTHNWTVALKTLIVIHRTLREGDPTFREELLNFSQRGRIL 120

Query: 121 QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIESERLPRPAQGQEKGYSRT 180
           QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFR+LKYDIE+ERLPRPAQGQEKGYSRT
Sbjct: 121 QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRVLKYDIEAERLPRPAQGQEKGYSRT 180

Query: 181 RELDSEELLEHLPALQQLLYRLIGCRPEGAAIGNYVIQYALALVLKESFKIYCAINDGII 240
           RELDSEELLE LPALQQLLYRLIGCRPEGAA+ NYVIQYALALVLKESFKIYCA+NDGII
Sbjct: 181 RELDSEELLEQLPALQQLLYRLIGCRPEGAAVVNYVIQYALALVLKESFKIYCAVNDGII 240

Query: 241 NLVDKFFEMPRHEAIKALDIYKRAGQQAGSLSDFYDICKGLELARNFQFPVLREPPQSFL 300
           NLVDKFFEMPRHEA+KALD+YKRAGQQA  LSDFY++CKGLELARNFQFPVLREPPQSFL
Sbjct: 241 NLVDKFFEMPRHEAVKALDVYKRAGQQAAGLSDFYEVCKGLELARNFQFPVLREPPQSFL 300

Query: 301 NTMEEYIREAPRMVTVPNEPLLQLTYKPEDSPSEDPNLPTDETEASPSHDLSITPVDSAP 360
            TMEEYIREAPR VTVP+EPLLQLTY+PE+ PSED  L +DE+E +P   + ++ V++A 
Sbjct: 301 TTMEEYIREAPRAVTVPHEPLLQLTYRPEE-PSEDTKLSSDESEPAPLDIVPVSNVETA- 360

Query: 361 LPPPVPAPAPERHLDTGDLLGLSLDTTEVSAIEDRNALALAIVPSGDAAAPTFHSNGAQP 420
            P P P P P+   DTGDLLGL    ++VS +E+RNALALAIV S   AAPTF+S+  QP
Sbjct: 361 TPSPPPPPPPQSSQDTGDLLGLDYTASDVSVMEERNALALAIVSSETDAAPTFNSSAVQP 420

Query: 421 KDFDPTGWELALVTTPSTNLSSTNERQLAGGLDTLTLDSLYDEGAYRASLQPVYGKPAPN 480
           KDFDPTGWELALVTTPS N+SS NERQLAGGLD+LTL+SLYDEGAYRA+ QPVYG PAPN
Sbjct: 421 KDFDPTGWELALVTTPSNNISSVNERQLAGGLDSLTLNSLYDEGAYRAAQQPVYGAPAPN 480

Query: 481 PFEVQDPFAYSNAIAPAPSVQMAPIPQQQANPFGPYQPTFS-QQQSFTMDPTNPFGDAGF 540
           PFEVQDPFA SN +AP P VQMA + QQQ+NPFG +QPT+  QQQ+  M PTNPFGD GF
Sbjct: 481 PFEVQDPFALSNNVAPPPGVQMAAMAQQQSNPFGSFQPTYQPQQQNVMMGPTNPFGDTGF 540

Query: 541 GAFSAPNHHTVP-PNANNPFGSTGLL 565
           GAF  P H   P P  +NPFGSTGLL
Sbjct: 541 GAF--PAHPPAPHPQTSNPFGSTGLL 562

BLAST of Cp4.1LG01g06100 vs. TrEMBL
Match: B9HJD6_POPTR (Clathrin assembly family protein OS=Populus trichocarpa GN=POPTR_0008s13140g PE=4 SV=1)

HSP 1 Score: 906.0 bits (2340), Expect = 2.3e-260
Identity = 456/567 (80.42%), Postives = 496/567 (87.48%), Query Frame = 1

Query: 1   MATLQTWRKAYGALKDSTKVGLAHVNSEYADLDVAIVKSTNHVECPPKERHLRKILIATS 60
           M TLQTWRKAYGALKDSTKVGLAHVNS+YA+LDVAIVK+TNHVECPPKERHLRKIL ATS
Sbjct: 1   MGTLQTWRKAYGALKDSTKVGLAHVNSDYAELDVAIVKATNHVECPPKERHLRKILAATS 60

Query: 61  AIRPRADVAYCIHALARRLSKTRNWTVALKTLIVIHRTLREGDPTFREELLNFTQRARII 120
           AIRPRADVAYCIHAL+RRL+KT NWTVALK LIVIHR LREGDPTFREELLNF+QR RI+
Sbjct: 61  AIRPRADVAYCIHALSRRLAKTHNWTVALKILIVIHRLLREGDPTFREELLNFSQRGRIL 120

Query: 121 QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIESERLPRPAQGQEKGYSRT 180
           QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIE+ERLPRPAQGQ+KGYSRT
Sbjct: 121 QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIEAERLPRPAQGQDKGYSRT 180

Query: 181 RELDSEELLEHLPALQQLLYRLIGCRPEGAAIGNYVIQYALALVLKESFKIYCAINDGII 240
           R+LDSE+LLE LPALQQLLYRL+GCRPEGAA+GNYVIQYALALVLKESFKIYCAINDGII
Sbjct: 181 RDLDSEDLLEQLPALQQLLYRLVGCRPEGAAVGNYVIQYALALVLKESFKIYCAINDGII 240

Query: 241 NLVDKFFEMPRHEAIKALDIYKRAGQQAGSLSDFYDICKGLELARNFQFPVLREPPQSFL 300
           NLVDKFFEMPRHEAIKALDIYKRAGQQAG+LSDFYDICKGLELARNFQFPVLREPPQSFL
Sbjct: 241 NLVDKFFEMPRHEAIKALDIYKRAGQQAGNLSDFYDICKGLELARNFQFPVLREPPQSFL 300

Query: 301 NTMEEYIREAPRMVTVPNEPLLQLTYKPEDSPSEDPNLPTDETEASPSHDLSITPVDSAP 360
            TMEEYIREAPR+V+VP+E LLQLTY+PE+ PSED     DE E  PS D++++ V+ A 
Sbjct: 301 TTMEEYIREAPRVVSVPSEALLQLTYRPEEGPSEDAKSSGDELEPPPSDDVAVSNVEIA- 360

Query: 361 LPPPVPAPAPERHLDTGDLLGLSLDTTEVSAIEDRNALALAIVPSGDAAAPTFHSNGAQP 420
             PPVP  AP+  +DTGDLLGL   T   S IE+ NALALAIVPS    APTF+S   Q 
Sbjct: 361 --PPVPTTAPQNSIDTGDLLGLDYGTPNASTIEESNALALAIVPSESDVAPTFNSVAGQA 420

Query: 421 KDFDPTGWELALVTTPSTNLSSTNERQLAGGLDTLTLDSLYDEGAYRASLQPVYGKPAPN 480
           KDFDPTGWELALVTTPS+N+S+TNERQLAGGLD+LTL+SLYDEGAYRA+ +PVYG PAPN
Sbjct: 421 KDFDPTGWELALVTTPSSNISATNERQLAGGLDSLTLNSLYDEGAYRAARRPVYGAPAPN 480

Query: 481 PFEVQDPFAYSNAIAPAPSVQMAPIPQQQANPFGPYQPTFSQ---QQSFTMDPTNPFGDA 540
           PFE+QDPFA SN+IA  PSVQMA + QQ  NPFGPYQPT+ Q   QQ+  M   NPFGDA
Sbjct: 481 PFEIQDPFALSNSIAAPPSVQMAAMTQQPHNPFGPYQPTYPQPQHQQNMMMSHANPFGDA 540

Query: 541 GFGAFSAPNHHTVPPNANNPFGSTGLL 565
           GFGAF A  H    P  NNPFGSTGLL
Sbjct: 541 GFGAFHA--HPMAHPQTNNPFGSTGLL 562

BLAST of Cp4.1LG01g06100 vs. TrEMBL
Match: B9SP68_RICCO (Clathrin assembly protein, putative OS=Ricinus communis GN=RCOM_0629240 PE=4 SV=1)

HSP 1 Score: 906.0 bits (2340), Expect = 2.3e-260
Identity = 457/572 (79.90%), Postives = 500/572 (87.41%), Query Frame = 1

Query: 1   MATLQTWRKAYGALKDSTKVGLAHVNSEYADLDVAIVKSTNHVECPPKERHLRKILIATS 60
           M TLQTWRKAYGALKDSTKVGLAHVNS++A+LDVAIVK+TNHVECPPKERHLRKIL+ATS
Sbjct: 1   MGTLQTWRKAYGALKDSTKVGLAHVNSDFAELDVAIVKATNHVECPPKERHLRKILVATS 60

Query: 61  AIRPRADVAYCIHALARRLSKTRNWTVALKTLIVIHRTLREGDPTFREELLNFTQRARII 120
           AIRPRADV YCIHAL+RRL+KT NWTVALKTLIVIHR LREGDPTF+EEL+NF+QR RI+
Sbjct: 61  AIRPRADVQYCIHALSRRLAKTHNWTVALKTLIVIHRLLREGDPTFKEELVNFSQRGRIL 120

Query: 121 QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIESERLPRPAQGQEKGYSRT 180
           QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIE+ERLPRP QGQ+KGYSRT
Sbjct: 121 QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIEAERLPRPVQGQDKGYSRT 180

Query: 181 RELDSEELLEHLPALQQLLYRLIGCRPEGAAIGNYVIQYALALVLKESFKIYCAINDGII 240
           RELDSEELLE LPALQQLLYRL+GCRPEGAA+GNYVIQYALALVLKESFKIYCAINDGII
Sbjct: 181 RELDSEELLEQLPALQQLLYRLVGCRPEGAAVGNYVIQYALALVLKESFKIYCAINDGII 240

Query: 241 NLVDKFFEMPRHEAIKALDIYKRAGQQAGSLSDFYDICKGLELARNFQFPVLREPPQSFL 300
           NLVDKFFEMPRHEAIKALD+YKRAGQQAGSLSDFYD+CKGLELARNFQFPVLREPPQSFL
Sbjct: 241 NLVDKFFEMPRHEAIKALDVYKRAGQQAGSLSDFYDVCKGLELARNFQFPVLREPPQSFL 300

Query: 301 NTMEEYIREAPRMVTVPNEPLLQLTYKPEDSPS--EDPNLPTDETEASPSHDLSITPVDS 360
            TMEEYIREAPR+VTVP+EPLLQLTY+PE+ PS  ED  LP DE E+ PS D++I   + 
Sbjct: 301 TTMEEYIREAPRVVTVPSEPLLQLTYRPEEGPSEPEDTKLPIDEPESVPSEDVAIANAEV 360

Query: 361 APLPPPVPAPAPERHLDTGDLLGL---SLDTTEVSAIEDRNALALAIVPSGDAAAPTFHS 420
           AP  PP P   P+ ++DTGDLLGL   S D +  SAIE+RNALALAIVP    AAPTF+S
Sbjct: 361 APPTPPTP---PQNNMDTGDLLGLNYASPDVSAASAIEERNALALAIVPLEQDAAPTFNS 420

Query: 421 NGAQPKDFDPTGWELALVTTPSTNLSSTNERQLAGGLDTLTLDSLYDEGAYRASLQPVYG 480
              QPKDFDPTGWELALVTTPS N+SS N+RQLAGGLDTLTL+SLYD+ AYRA+ QPVYG
Sbjct: 421 GAGQPKDFDPTGWELALVTTPSANISSVNDRQLAGGLDTLTLNSLYDDVAYRAAQQPVYG 480

Query: 481 KPAPNPFEVQDPFAYSNAIAPAPSVQMAPIPQQQANPFGPYQPTF---SQQQSFTMDPTN 540
            PAPNPFEV DPFA SN+IAP  +VQMA + QQ  NPFGPYQPT+    QQQ   M P N
Sbjct: 481 APAPNPFEVHDPFAMSNSIAPPSAVQMAAMTQQPPNPFGPYQPTYPQPQQQQHLMMSPAN 540

Query: 541 PFGDAGFGAFSAPNHHTVPPNANNPFGSTGLL 565
           PFGDAGFG F  P +    P++NNPFGSTGLL
Sbjct: 541 PFGDAGFGTF--PVNTVTHPHSNNPFGSTGLL 567

BLAST of Cp4.1LG01g06100 vs. TrEMBL
Match: A0A061E7Q5_THECC (ENTH/ANTH/VHS superfamily protein OS=Theobroma cacao GN=TCM_010974 PE=4 SV=1)

HSP 1 Score: 900.6 bits (2326), Expect = 9.8e-259
Identity = 454/569 (79.79%), Postives = 497/569 (87.35%), Query Frame = 1

Query: 1   MATLQTWRKAYGALKDSTKVGLAHVNSEYADLDVAIVKSTNHVECPPKERHLRKILIATS 60
           M TLQTWRKAYGALKD+TKVGLAHVNS+YADLDVAIVK+TNHVECPPKERHLRKI +ATS
Sbjct: 1   MGTLQTWRKAYGALKDTTKVGLAHVNSDYADLDVAIVKATNHVECPPKERHLRKIFMATS 60

Query: 61  AIRPRADVAYCIHALARRLSKTRNWTVALKTLIVIHRTLREGDPTFREELLNFTQRARII 120
           AIRPRADVAYCIHALARRL+KT NWTVALKTLIVIHR LREGDPTFREELLNF+QRARI+
Sbjct: 61  AIRPRADVAYCIHALARRLAKTHNWTVALKTLIVIHRALREGDPTFREELLNFSQRARIL 120

Query: 121 QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIESERLPRPAQGQEKGYSRT 180
           QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIE+ERLPRPAQGQ+KGYSRT
Sbjct: 121 QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIEAERLPRPAQGQDKGYSRT 180

Query: 181 RELDSEELLEHLPALQQLLYRLIGCRPEGAAIGNYVIQYALALVLKESFKIYCAINDGII 240
           RELDSEELLE LPALQQLLYRLIGCRPEGAAIGNYVIQYALALVLKESFKIYCAINDGII
Sbjct: 181 RELDSEELLEQLPALQQLLYRLIGCRPEGAAIGNYVIQYALALVLKESFKIYCAINDGII 240

Query: 241 NLVDKFFEMPRHEAIKALDIYKRAGQQAGSLSDFYDICKGLELARNFQFPVLREPPQSFL 300
           NLVDKFFEMPRHEA+ ALD+YKRAGQQA SLSDFYD+CKGLELARNFQFPVLREPPQSFL
Sbjct: 241 NLVDKFFEMPRHEAVTALDVYKRAGQQANSLSDFYDVCKGLELARNFQFPVLREPPQSFL 300

Query: 301 NTMEEYIREAPRMVTVPNEPLLQLTYKPEDSPSEDPNLPTDETEASP-SHDLSITPVDSA 360
            TMEEYIREAPR+V+VP EPLLQLTY+PE+ PSED  L  DE E S  + D++++ V++ 
Sbjct: 301 TTMEEYIREAPRVVSVPTEPLLQLTYRPEEGPSEDTKLSNDEPEPSALADDIAVSGVETV 360

Query: 361 PLPPPVPAPAPERHLDTGDLLGLSLDTTEVSAIEDRNALALAIVPSGDAAAPTFHSNGAQ 420
           P+PP    P P+ + D GDLL LS    +  AIE+ NALALAIVP+     PTF+S   Q
Sbjct: 361 PVPP----PPPQNNADGGDLLDLSYSAPDALAIEESNALALAIVPTEPGTGPTFNSTTGQ 420

Query: 421 PKDFDPTGWELALVTTPSTNLSSTNERQLAGGLDTLTLDSLYDEGAYRASLQPVYGKPAP 480
           PKDFDPTGWELALVTTPS+++S+ N+RQLAGGLD+LTL+SLYDE AYRAS QPVYG PAP
Sbjct: 421 PKDFDPTGWELALVTTPSSDISAVNDRQLAGGLDSLTLNSLYDEAAYRASQQPVYGAPAP 480

Query: 481 NPFEVQDPFAYSNAIAPAPSVQMAPIPQQQANPFGPYQPTFS---QQQSFTMDPTNPFGD 540
           NPFEVQDPFA SN IAPA +VQMA + Q Q+NPFGPYQPT+    QQQ   M P+NPFGD
Sbjct: 481 NPFEVQDPFAMSNNIAPARAVQMAAMAQPQSNPFGPYQPTYQQPLQQQHMMMSPSNPFGD 540

Query: 541 AGFGAFSAPNHHTV-PPNANNPFGSTGLL 565
           AGFGAF       V  P+ANNPFGSTGLL
Sbjct: 541 AGFGAFPVNQMPPVAQPHANNPFGSTGLL 565

BLAST of Cp4.1LG01g06100 vs. TAIR10
Match: AT2G01600.1 (AT2G01600.1 ENTH/ANTH/VHS superfamily protein)

HSP 1 Score: 760.0 bits (1961), Expect = 1.0e-219
Identity = 404/590 (68.47%), Postives = 455/590 (77.12%), Query Frame = 1

Query: 1   MATLQTWRKAYGALKDSTKVGLAHVNSEYADLDVAIVKSTNHVECPPKERHLRKILIATS 60
           M TLQ+WRKAYGALKDSTKVGL  VNSEYADLDVAIVK+TNHVECPPK+RHLRKI  ATS
Sbjct: 1   MGTLQSWRKAYGALKDSTKVGLVRVNSEYADLDVAIVKATNHVECPPKDRHLRKIFAATS 60

Query: 61  AIRPRADVAYCIHALARRLSKTRNWTVALKTLIVIHRTLREGDPTFREELLNFTQRARII 120
             R RADVAYCIHAL+RRL KTRNWTVALKTLIVIHR LREGDPTFREELLNF+QR RI+
Sbjct: 61  VTRARADVAYCIHALSRRLHKTRNWTVALKTLIVIHRLLREGDPTFREELLNFSQRGRIL 120

Query: 121 QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIESERLPRPAQGQEKGYSRT 180
           QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFR+LKYD E+ERLP+   GQ+KGYSRT
Sbjct: 121 QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRVLKYDTEAERLPKSNPGQDKGYSRT 180

Query: 181 RELDSEELLEHLPALQQLLYRLIGCRPEGAAIGNYVIQYALALVLKESFKIYCAINDGII 240
           R+LD EELLE LPALQQLLYRLIGCRPEGAA  N+VIQYALALVLKESFK+YCAINDGII
Sbjct: 181 RDLDGEELLEQLPALQQLLYRLIGCRPEGAANHNHVIQYALALVLKESFKVYCAINDGII 240

Query: 241 NLVDKFFEMPRHEAIKALDIYKRAGQQAGSLSDFYDICKGLELARNFQFPVLREPPQSFL 300
           NL+DKFFEM +HEAI +L+IYKRAGQQA SLSDFY+ CKGLELARNFQFPVLREPPQSFL
Sbjct: 241 NLIDKFFEMAKHEAITSLEIYKRAGQQARSLSDFYEACKGLELARNFQFPVLREPPQSFL 300

Query: 301 NTMEEYIREAPRMVTVPNEPLLQLTYKPEDS-PSEDPNLPTDETEASPSHDLSITPVDSA 360
            TMEEYI+EAPR+V VP EPLL LTY+P+D   +ED     +E E  PS D+ +   ++ 
Sbjct: 301 TTMEEYIKEAPRVVDVPAEPLL-LTYRPDDGLTTEDTEPSHEEREMLPSDDVVVVSEETE 360

Query: 361 PLPPPVPAPAPERHLDTGDLLGLSLDTTEVSAIEDRNALALAIVPSGDAAAPTFHSNGAQ 420
           P PPP P+   +  +DT DL GL+    + S IED+NALALAIV S DA  PT H    Q
Sbjct: 361 PSPPPPPSANAQNFIDTDDLWGLNTGAPDTSVIEDQNALALAIV-STDADPPTPHFG--Q 420

Query: 421 PKDFDPTGWELALVTTPSTNLSSTNERQLAGGLDTLTLDSLYDEGAYRASLQPVYGKPAP 480
           P ++DPTGWELALVT PS+++S++ ER+LAGGLDTLTL SLYD+GAY AS +PVYG PAP
Sbjct: 421 PNNYDPTGWELALVTAPSSDISASTERKLAGGLDTLTLSSLYDDGAYIASQRPVYGAPAP 480

Query: 481 NPFEVQDPFAYSNAIAPAPSVQMAPIPQQQA--NPFGPY--------QPTFSQQQSFTMD 540
           NPF   DPFA SN  AP         PQQQA  NPFG Y        QPT+  Q +   +
Sbjct: 481 NPFASHDPFASSNGTAPP--------PQQQAVNNPFGAYQQTYQHQPQPTYQHQSNPPTN 540

Query: 541 PTNPFGD---------------AGFGAFSAPNHHTVPPNANNPFGSTGLL 565
            +NPFGD               +G+G FS   H       NNPF STGL+
Sbjct: 541 NSNPFGDFGEFPVNPVSQQPNTSGYGDFSVNQH-------NNPFRSTGLI 571

BLAST of Cp4.1LG01g06100 vs. TAIR10
Match: AT1G14910.1 (AT1G14910.1 ENTH/ANTH/VHS superfamily protein)

HSP 1 Score: 721.8 bits (1862), Expect = 3.2e-208
Identity = 377/568 (66.37%), Postives = 447/568 (78.70%), Query Frame = 1

Query: 1   MATLQTWRKAYGALKDSTKVGLAHVNSEYADLDVAIVKSTNHVECPPKERHLRKILIATS 60
           M TLQ+WR+AYGALKD+TKVGL  VNS+YA+LDVAIVK+TNHVECPPK+RHLRKI +ATS
Sbjct: 1   MGTLQSWRRAYGALKDTTKVGLVRVNSDYAELDVAIVKATNHVECPPKDRHLRKIFLATS 60

Query: 61  AIRPRADVAYCIHALARRLSKTRNWTVALKTLIVIHRTLREGDPTFREELLNFTQRARII 120
           AIRPRADVAYCIHAL+RRL KTRNWTVALK L+VIHR LR+GDPTFREELLNF+Q+ RI+
Sbjct: 61  AIRPRADVAYCIHALSRRLHKTRNWTVALKALLVIHRLLRDGDPTFREELLNFSQKGRIM 120

Query: 121 QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIESERLPRPAQGQEKGYSRT 180
           Q+SNFKDDSSP+AWDCS WVRTYALFLEERLECFR+LKYDIE+ERLP+ + GQEKGYS+T
Sbjct: 121 QISNFKDDSSPVAWDCSGWVRTYALFLEERLECFRVLKYDIEAERLPKVSPGQEKGYSKT 180

Query: 181 RELDSEELLEHLPALQQLLYRLIGCRPEGAAIGNYVIQYALALVLKESFKIYCAINDGII 240
           R+LD E+LLE LPALQQLL+RLIGC+PEGAA  N++IQYAL+LVLKESFK+YCAIN+GII
Sbjct: 181 RDLDGEKLLEQLPALQQLLHRLIGCKPEGAAKHNHIIQYALSLVLKESFKVYCAINEGII 240

Query: 241 NLVDKFFEMPRHEAIKALDIYKRAGQQAGSLSDFYDICKGLELARNFQFPVLREPPQSFL 300
           NLV+KFFEMPRHEAIKAL+IYKRAG QAG+LS FY++CKGLELARNFQFPVLREPPQSFL
Sbjct: 241 NLVEKFFEMPRHEAIKALEIYKRAGLQAGNLSAFYEVCKGLELARNFQFPVLREPPQSFL 300

Query: 301 NTMEEYIREAPRMVTVPNEPLLQLTYKPEDS-PSEDPNLPTDETEASPSHDLSITPVDSA 360
            TMEEY+R+AP+MV V + PLL LTY P+D   SED     +E E S   D ++ P +  
Sbjct: 301 TTMEEYMRDAPQMVDVTSGPLL-LTYTPDDGLTSEDVGPSHEEHETSSPSDSAVVPSEET 360

Query: 361 PL--PPPVPAPAPERHLDTGDLLGLSLDTTEVSAIEDRNALALAIVPSGDAAAPTFHSNG 420
            L    P     P+  +DT DLLGL  DT +  AI D+NALALA+V S D  +  F  + 
Sbjct: 361 QLSSQSPPSVETPQNFIDTDDLLGLHDDTPDPLAILDQNALALALV-SNDVDSSPF--SF 420

Query: 421 AQPKDFDPTGWELALVTTPSTNLSSTNERQLAGGLDTLTLDSLYDEGAYRASLQPVYGKP 480
            Q +D DP+GWELALVTTPS ++S+  ERQLAGGLDTLTL+SLYD+GA RA+ QP YG P
Sbjct: 421 GQARDLDPSGWELALVTTPSNDISAATERQLAGGLDTLTLNSLYDDGALRAAQQPAYGVP 480

Query: 481 APNPFEVQDPFAYSNAIAPAPSVQMAPIPQQQANPFGPYQPTFSQQQ-----SFTMDPTN 540
           A NPFEVQD FA+S++++P  +V          NPFG Y+PT+ QQ+          P N
Sbjct: 481 ASNPFEVQDLFAFSDSVSPPSAVN---------NPFGLYEPTYHQQEQQPQLQVAPSPAN 540

Query: 541 PFGDAGFGAFSAPNHHTVPPNANNPFGS 561
           PFGD  FG F  P      P +   FG+
Sbjct: 541 PFGD--FGEF--PIVPVSEPQSTTSFGA 551

BLAST of Cp4.1LG01g06100 vs. TAIR10
Match: AT5G57200.1 (AT5G57200.1 ENTH/ANTH/VHS superfamily protein)

HSP 1 Score: 578.9 bits (1491), Expect = 3.3e-165
Identity = 334/584 (57.19%), Postives = 399/584 (68.32%), Query Frame = 1

Query: 1   MATLQTWRKAYGALKDSTKVGLAHVNSEYADLDVAIVKSTNHVECPPKERHLRKILIATS 60
           M T  ++RKAYGALKD+T VGLA VNSE+ DLD+AIVK+TNHVE PPKERH+RKI  ATS
Sbjct: 1   MGTFTSFRKAYGALKDTTTVGLAKVNSEFKDLDIAIVKATNHVESPPKERHVRKIFSATS 60

Query: 61  AIRPRADVAYCIHALARRLSKTRNWTVALKTLIVIHRTLREGDPTFREELLNFTQRARII 120
            I+PRADVAYCIHAL++RLSKTRNW VA+K LIVIHRTLREGDPTFREELLN++ R  I+
Sbjct: 61  VIQPRADVAYCIHALSKRLSKTRNWVVAMKVLIVIHRTLREGDPTFREELLNYSHRRHIL 120

Query: 121 QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIESERLPRPAQGQEKGYSRT 180
           ++SNFKDD+SP+AWDCSAWVRTYALFLEERLEC+R+LKYDIE+ERLP+ A G      RT
Sbjct: 121 RISNFKDDTSPLAWDCSAWVRTYALFLEERLECYRVLKYDIEAERLPK-ASGAASKTHRT 180

Query: 181 RELDSEELLEHLPALQQLLYRLIGCRPEGAAIGNYVIQYALALVLKESFKIYCAINDGII 240
           R L  E+LLE LPALQQLLYRLIGC+PEGAA  NY+IQYALALVLKESFKIYCAINDGII
Sbjct: 181 RMLSGEDLLEQLPALQQLLYRLIGCQPEGAAYSNYLIQYALALVLKESFKIYCAINDGII 240

Query: 241 NLVDKFFEMPRHEAIKALDIYKRAGQQAGSLSDFYDICKGLELARNFQFPVLREPPQSFL 300
           NLVD FFEM RH+A+KAL+IYKRAGQQA +L++FYD CKGLELARNFQFP LR+PP SFL
Sbjct: 241 NLVDMFFEMSRHDAVKALNIYKRAGQQAENLAEFYDYCKGLELARNFQFPTLRQPPPSFL 300

Query: 301 NTMEEYIREAPRMVTVPNEPLLQLTYKPE-DSPSEDPNLPTDETEASPSHDLSITPVDSA 360
            TMEEYI+EAP+  +V  +   Q   + E +   E P  P +E   + + +     ++  
Sbjct: 301 ATMEEYIKEAPQSGSVQKKLEYQEKEEEEQEQEEEQPEEPAEEENQNENTENDQPLIEEE 360

Query: 361 PLPP----PVPAPAPERHLDTGDLLGLSLDTTEVSAIEDRNALALAIVPSG-DAAAPTFH 420
              P     V    P   +DT DLLGL     + + IE  NA +LAI P G + +AP   
Sbjct: 361 EEEPKEEIEVEEAKPSPLIDTDDLLGLHEINPKAAEIEQNNAFSLAIYPPGHETSAP--- 420

Query: 421 SNGAQPKDFDPTGWELALVTTPSTNLSSTNER-----QLAGGLDTLTLDSLYDEGAYRAS 480
           SN     +   +GWELALVT  + N ++ N R     +L GG D L LDSLY++   R  
Sbjct: 421 SNSLSLIEAGGSGWELALVTPQNNNNNNNNPRPVIATKLGGGFDNLLLDSLYEDDTARRQ 480

Query: 481 LQPV---YGKPA-----------PNPFEV-QDPFAYSNAIAPAPSVQMAPIPQQ--QANP 540
           +Q     YG  A           PNPF V QDPFA SN +AP  +VQMA   QQ    N 
Sbjct: 481 IQLTNAGYGFGATAIPGALASSNPNPFGVQQDPFAMSNNMAPPTNVQMAMQQQQMMMMNN 540

Query: 541 FGPYQPTFS--QQQSFTMDPT-----NPFGDAGFGAFSAPNHHT 550
             PY   +S      F+ +P+     NPFGD  F A  AP   T
Sbjct: 541 QSPYNNNYSPYHHHQFSPNPSTSSSPNPFGDP-FLALPAPPSST 579

BLAST of Cp4.1LG01g06100 vs. TAIR10
Match: AT4G25940.1 (AT4G25940.1 ENTH/ANTH/VHS superfamily protein)

HSP 1 Score: 571.2 bits (1471), Expect = 6.9e-163
Identity = 335/603 (55.56%), Postives = 402/603 (66.67%), Query Frame = 1

Query: 1   MATLQTWRKAYGALKDSTKVGLAHVNSEYADLDVAIVKSTNHVECPPKERHLRKILIATS 60
           MAT  ++RKA GA+KDST V +A VNSE+ DLDVAIVK+TNHVE  PKERH+R+I  ATS
Sbjct: 1   MATFNSFRKAVGAIKDSTTVSIAKVNSEFKDLDVAIVKATNHVESAPKERHIRRIFSATS 60

Query: 61  AIRPRADVAYCIHALARRLSKTRNWTVALKTLIVIHRTLREGDPTFREELLNFTQRARII 120
            ++PRADVAYCIHALA+RLSKTRNW VA+K LIVIHRTLREGDPTFREELLN++ R  I+
Sbjct: 61  VVQPRADVAYCIHALAKRLSKTRNWVVAIKVLIVIHRTLREGDPTFREELLNYSHRGHIL 120

Query: 121 QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIESERLPRPAQGQEKGYS-- 180
           ++SNFKDD+SP+AWDCSAW+RTYALFLEERLEC+R+LKYDIE+ERLP+ +    K     
Sbjct: 121 RISNFKDDTSPLAWDCSAWIRTYALFLEERLECYRVLKYDIEAERLPKGSGASSKNVDFN 180

Query: 181 -----RTRELDSEELLEHLPALQQLLYRLIGCRPEGAAIGNYVIQYALALVLKESFKIYC 240
                RTR L  EELLE LPALQQLLYRLIGC+PEG+A  NY+IQYALALVLKESFKIYC
Sbjct: 181 ASQTYRTRMLSDEELLEQLPALQQLLYRLIGCQPEGSAYSNYLIQYALALVLKESFKIYC 240

Query: 241 AINDGIINLVDKFFEMPRHEAIKALDIYKRAGQQAGSLSDFYDICKGLELARNFQFPVLR 300
           AINDGIINLVD FFEM RH+A+KAL+IYKRAGQQA +L+DFY+ CKGLELARNFQFP LR
Sbjct: 241 AINDGIINLVDMFFEMSRHDAVKALNIYKRAGQQAENLADFYEYCKGLELARNFQFPTLR 300

Query: 301 EPPQSFLNTMEEYIREAPRMVTVPNEPLLQLTYKPEDSPSED-------PNLPTD---ET 360
           +PP SFL TME+YI+EAP+  +V  +  L+   K E+   E+       P  P +   + 
Sbjct: 301 QPPPSFLATMEDYIKEAPQSGSVQKK--LEYQEKEEEEQEEEEAEHSVQPEEPAEADNQK 360

Query: 361 EASPSHDLSITPVDSAPLPPPVPAPAPERHLDTGDLLGLSLDTTEVSAIEDRNALALAIV 420
           E S      I   +            P   +DT DLLGL+    + + IEDRNALALAI 
Sbjct: 361 ENSEGDQPLIEEEEEDQEKIEEEDAKPSFLIDTDDLLGLNEINPKAAEIEDRNALALAIY 420

Query: 421 PSG-DAAAPTFHSNGAQPKDFDPTGWELALVTTPSTNLSSTNER-----QLAGGLDTLTL 480
           P G +A  P   SN     +   +GWELALV TP  N ++ N R     +LAGG D L L
Sbjct: 421 PPGHEAPGP---SNILSLIETGGSGWELALV-TPQNNNNNNNPRPAPNTKLAGGFDNLLL 480

Query: 481 DSLYDEGAYRASLQPV---YG-------KPAPNPFEV-QDPFAYSNAIAPAPSVQMAPIP 540
           DSLY++ + R  +Q     YG          PNPF++ QDPFA SN IAP  +VQMA   
Sbjct: 481 DSLYEDDSARRQIQLTNAGYGHGGIDTTAAPPNPFQMQQDPFAMSNNIAPPTNVQMAMQQ 540

Query: 541 QQQ-------------ANPFGPYQPTFSQQQSFTMDPTNPFGDAGFGAFSAPNHHTVPPN 557
           QQQ              +P   +Q     Q S    P+NPFGDA F A   P     P  
Sbjct: 541 QQQQQMTMMHQSPYNYTHPHDYHQNHHHHQFSAGPSPSNPFGDA-FLALPPPPGSAGPQQ 596

BLAST of Cp4.1LG01g06100 vs. TAIR10
Match: AT5G35200.1 (AT5G35200.1 ENTH/ANTH/VHS superfamily protein)

HSP 1 Score: 451.4 bits (1160), Expect = 7.9e-127
Identity = 268/568 (47.18%), Postives = 354/568 (62.32%), Query Frame = 1

Query: 8   RKAYGALKDSTKVGLAHVNSEYADLDVAIVKSTNHVECPPKERHLRKILIATSAIRPRAD 67
           R+  GA+KD+T V LA VNS+Y +LD+AIVK+TNHVE P KER++R I +A SA RPRAD
Sbjct: 12  RRYLGAIKDTTTVSLAKVNSDYKELDIAIVKATNHVERPSKERYIRAIFMAISATRPRAD 71

Query: 68  VAYCIHALARRLSKTRNWTVALKTLIVIHRTLREGDPTFREELLNFTQ-RARIIQLSNFK 127
           VAYCIHALARRLS+T NW VALKTLIVIHR LRE D TF EE++N+++ R+ ++ +S+FK
Sbjct: 72  VAYCIHALARRLSRTHNWAVALKTLIVIHRALREVDQTFHEEVINYSRSRSHMLNMSHFK 131

Query: 128 DDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIESERLPRPAQGQEKGYSRTRELDSE 187
           DDS P AW  SAWVR YALFLEERLECFR+LKYD+E +              RT++LD+ 
Sbjct: 132 DDSGPNAWAYSAWVRFYALFLEERLECFRVLKYDVEVDP------------PRTKDLDTP 191

Query: 188 ELLEHLPALQQLLYRLIGCRPEGAAIGNYVIQYALALVLKESFKIYCAINDGIINLVDKF 247
           +LLE LPALQ+LL+R++ C+PEGAA+ N++IQ AL++V+ ES KIY A+ DGI NLVDKF
Sbjct: 192 DLLEQLPALQELLFRVLDCQPEGAAVQNHIIQLALSMVISESTKIYQALTDGIDNLVDKF 251

Query: 248 FEMPRHEAIKALDIYKRAGQQAGSLSDFYDICKGLELARNFQFPVLREPPQSFLNTMEEY 307
           F+M R++A+KALD+Y+RA +QAG LS+F+++CK + + R  +F  + +PP SFL  MEEY
Sbjct: 252 FDMQRNDAVKALDMYRRAVKQAGRLSEFFEVCKSVNVGRGERFIKIEQPPTSFLQAMEEY 311

Query: 308 IREAPRMVTVPNEPLLQLTYKPEDSPSEDPNLPTDETEASPSHDLSITPVDSAPLPPPVP 367
           ++EAP    V  E +++    P++  + +  +P    E  P+             P PV 
Sbjct: 312 VKEAPLAAGVKKEQVVEKLTAPKEILAIEYEIPPKVVEEKPAS------------PEPVK 371

Query: 368 APAPERHLDTGDLLGLSLDTTEVSAIEDRNALALAIVPSG--DAAAPTFHSNGAQPKDFD 427
           A A +      DLL +      VS +E++NALALAIVP       + T  +NG      +
Sbjct: 372 AEAEKPVEKQPDLLSMDDPAPMVSELEEKNALALAIVPVSVEQPHSTTDFTNG------N 431

Query: 428 PTGWELALVTTPSTNLSSTNERQLAGGLDTLTLDSLYDEGAYRASLQ------PVYGKPA 487
            TGWELALVT PS+N  +  + +LAGGLD LTLDSLY E A R S Q      P    P 
Sbjct: 432 STGWELALVTAPSSNEGAAADSKLAGGLDKLTLDSLY-EDAIRVSQQQNRSYNPWEQNPV 491

Query: 488 PNPFEVQDPFAYSNAIAPAPSVQMA-----PIPQQQANP---FGPYQPTFSQQQSFTMDP 547
            N   +  PF  SN +A     QMA         Q  N     GP Q  + QQQ    + 
Sbjct: 492 HNGHMMHQPFYASNGVAAPQPFQMANQNHQTFGYQHQNAGMMMGPVQQPYQQQQ---QNM 540

Query: 548 TNPFGDAGFGAFSAPNHHTVPPNANNPF 559
            NPFG+         N +   P   NP+
Sbjct: 552 NNPFGNP-----FVSNGNPQQPQGYNPY 540

BLAST of Cp4.1LG01g06100 vs. NCBI nr
Match: gi|449449048|ref|XP_004142277.1| (PREDICTED: putative clathrin assembly protein At2g01600 [Cucumis sativus])

HSP 1 Score: 1057.0 bits (2732), Expect = 1.2e-305
Identity = 530/566 (93.64%), Postives = 544/566 (96.11%), Query Frame = 1

Query: 1   MATLQTWRKAYGALKDSTKVGLAHVNSEYADLDVAIVKSTNHVECPPKERHLRKILIATS 60
           MATLQTWRKAYGALKDSTKVGLAHVNS+YADLDVAIVK+TNHVECPPKERHLRKILIATS
Sbjct: 1   MATLQTWRKAYGALKDSTKVGLAHVNSDYADLDVAIVKATNHVECPPKERHLRKILIATS 60

Query: 61  AIRPRADVAYCIHALARRLSKTRNWTVALKTLIVIHRTLREGDPTFREELLNFTQRARII 120
           AIRPRADVAYCIHALARRLSKTRNWTVALK LIVIHRTLREGDPTFREELLNFTQRARI+
Sbjct: 61  AIRPRADVAYCIHALARRLSKTRNWTVALKALIVIHRTLREGDPTFREELLNFTQRARIL 120

Query: 121 QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIESERLPRPAQGQEKGYSRT 180
           QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIESERLPRPAQGQEKGYSRT
Sbjct: 121 QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIESERLPRPAQGQEKGYSRT 180

Query: 181 RELDSEELLEHLPALQQLLYRLIGCRPEGAAIGNYVIQYALALVLKESFKIYCAINDGII 240
           RELDSEELLEHLPALQQLLYRLIGC+PEGAAIGNYVIQYALALVLKESFKIYCAINDGII
Sbjct: 181 RELDSEELLEHLPALQQLLYRLIGCKPEGAAIGNYVIQYALALVLKESFKIYCAINDGII 240

Query: 241 NLVDKFFEMPRHEAIKALDIYKRAGQQAGSLSDFYDICKGLELARNFQFPVLREPPQSFL 300
           NLVDKFFEMPRHEAIKALDIYKRAGQQAGSLSDFYDICKGLELARNFQFPVLREPPQSFL
Sbjct: 241 NLVDKFFEMPRHEAIKALDIYKRAGQQAGSLSDFYDICKGLELARNFQFPVLREPPQSFL 300

Query: 301 NTMEEYIREAPRMVTVPNEPLLQLTYKPEDSPSEDPNLPTDETEASPSHDLSITPVDSAP 360
           NTMEEYIREAPRMVTVPNEPLLQLTYKPE+S SED NLPTDE EASPS+DLSITPV++AP
Sbjct: 301 NTMEEYIREAPRMVTVPNEPLLQLTYKPEESLSEDQNLPTDELEASPSNDLSITPVETAP 360

Query: 361 L-PPPVPAPAPERHLDTGDLLGLSLDTTEVSAIEDRNALALAIVPSGDAAAPTFHSNGAQ 420
             PPP PAPAPE HL+TGDLLGLSL TTEVSAIE+RNALALAIVPSGD  APTFHSNGAQ
Sbjct: 361 TPPPPAPAPAPESHLETGDLLGLSLATTEVSAIEERNALALAIVPSGDTEAPTFHSNGAQ 420

Query: 421 PKDFDPTGWELALVTTPSTNLSSTNERQLAGGLDTLTLDSLYDEGAYRASLQPVYGKPAP 480
             DFDPTGWELALVTTPSTNLSS NERQLAGGLDTL LDSLYDEGAYRASLQPVYGKPAP
Sbjct: 421 ANDFDPTGWELALVTTPSTNLSSANERQLAGGLDTLILDSLYDEGAYRASLQPVYGKPAP 480

Query: 481 NPFEVQDPFAYSNAIAPAPSVQMAPIPQQQANPFGPYQPTF-SQQQSFTMDPTNPFGDAG 540
           NPFEVQDPFAYSNAIAP PSVQMAP+ QQQANPFGP+QPTF  QQQ FTMDPTNPFGD+G
Sbjct: 481 NPFEVQDPFAYSNAIAPPPSVQMAPLAQQQANPFGPFQPTFPQQQQPFTMDPTNPFGDSG 540

Query: 541 FGAFSAPNHHTVPPNANNPFGSTGLL 565
           FGAF APNHHTVPP A+NPFGSTGLL
Sbjct: 541 FGAFPAPNHHTVPPPASNPFGSTGLL 566

BLAST of Cp4.1LG01g06100 vs. NCBI nr
Match: gi|659129479|ref|XP_008464707.1| (PREDICTED: putative clathrin assembly protein At2g01600 [Cucumis melo])

HSP 1 Score: 1047.7 bits (2708), Expect = 7.1e-303
Identity = 523/565 (92.57%), Postives = 540/565 (95.58%), Query Frame = 1

Query: 1   MATLQTWRKAYGALKDSTKVGLAHVNSEYADLDVAIVKSTNHVECPPKERHLRKILIATS 60
           MATLQTWRKAYGALKDSTKVGLAHVNS+YADLDVAIVK+TNHVECPPKERHLRKILIATS
Sbjct: 93  MATLQTWRKAYGALKDSTKVGLAHVNSDYADLDVAIVKATNHVECPPKERHLRKILIATS 152

Query: 61  AIRPRADVAYCIHALARRLSKTRNWTVALKTLIVIHRTLREGDPTFREELLNFTQRARII 120
           AIRPRADVAYCIHALARRLSKTRNWTVALK LIVIHRTLREGDPTFREELLNFTQR+RI+
Sbjct: 153 AIRPRADVAYCIHALARRLSKTRNWTVALKALIVIHRTLREGDPTFREELLNFTQRSRIL 212

Query: 121 QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIESERLPRPAQGQEKGYSRT 180
           QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIESERLPRPAQGQEKGYSRT
Sbjct: 213 QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIESERLPRPAQGQEKGYSRT 272

Query: 181 RELDSEELLEHLPALQQLLYRLIGCRPEGAAIGNYVIQYALALVLKESFKIYCAINDGII 240
           RELDSEELLEHLPALQQLLYRLIGC+PEGAAIGNYVIQYALALVLKESFKIYCAINDGII
Sbjct: 273 RELDSEELLEHLPALQQLLYRLIGCKPEGAAIGNYVIQYALALVLKESFKIYCAINDGII 332

Query: 241 NLVDKFFEMPRHEAIKALDIYKRAGQQAGSLSDFYDICKGLELARNFQFPVLREPPQSFL 300
           NLV+KFFEMPRHEAIKALDIYKRAGQQ+GSLSDFYDICKGLELARNFQFPVLREPPQSFL
Sbjct: 333 NLVEKFFEMPRHEAIKALDIYKRAGQQSGSLSDFYDICKGLELARNFQFPVLREPPQSFL 392

Query: 301 NTMEEYIREAPRMVTVPNEPLLQLTYKPEDSPSEDPNLPTDETEASPSHDLSITPVDSAP 360
           NTMEEYIREAPRMVTVPNEPLLQLTYKPE+S SEDPNLP DE EASPS+DLSITPV++AP
Sbjct: 393 NTMEEYIREAPRMVTVPNEPLLQLTYKPEESLSEDPNLPPDEPEASPSNDLSITPVETAP 452

Query: 361 LPPPVPAPAPERHLDTGDLLGLSLDTTEVSAIEDRNALALAIVPSGDAAAPTFHSNGAQP 420
            PPP    APE HL+TGDLLGLSL TTEVSAIE+RNALALAIVPSGD  APTFHSNGA  
Sbjct: 453 APPPPAPAAPESHLETGDLLGLSLATTEVSAIEERNALALAIVPSGDTEAPTFHSNGAHA 512

Query: 421 KDFDPTGWELALVTTPSTNLSSTNERQLAGGLDTLTLDSLYDEGAYRASLQPVYGKPAPN 480
            DFDPTGWELALVTTPSTNLS+ NERQLAGGLDTLTLDSLYDEGAYRASLQPVYGKPAPN
Sbjct: 513 NDFDPTGWELALVTTPSTNLSTANERQLAGGLDTLTLDSLYDEGAYRASLQPVYGKPAPN 572

Query: 481 PFEVQDPFAYSNAIAPAPSVQMAPIPQQQANPFGPYQPTF-SQQQSFTMDPTNPFGDAGF 540
           PFEVQDPFAYSNAIAP PSVQMAP+ Q QANPFGP+QPTF  QQQ FTMDPTNPFGDAGF
Sbjct: 573 PFEVQDPFAYSNAIAPPPSVQMAPLAQPQANPFGPFQPTFPQQQQPFTMDPTNPFGDAGF 632

Query: 541 GAFSAPNHHTVPPNANNPFGSTGLL 565
           GAF APNHHTVPP A+NPFGSTGLL
Sbjct: 633 GAFPAPNHHTVPPPASNPFGSTGLL 657

BLAST of Cp4.1LG01g06100 vs. NCBI nr
Match: gi|645229365|ref|XP_008221433.1| (PREDICTED: putative clathrin assembly protein At2g01600 [Prunus mume])

HSP 1 Score: 908.7 bits (2347), Expect = 5.2e-261
Identity = 456/566 (80.57%), Postives = 500/566 (88.34%), Query Frame = 1

Query: 1   MATLQTWRKAYGALKDSTKVGLAHVNSEYADLDVAIVKSTNHVECPPKERHLRKILIATS 60
           M TLQTWRKAYGALKD+TKVGLAHVNS+YADLDVAIVK+TNHVECPPKERHLRKILIATS
Sbjct: 1   MGTLQTWRKAYGALKDTTKVGLAHVNSDYADLDVAIVKATNHVECPPKERHLRKILIATS 60

Query: 61  AIRPRADVAYCIHALARRLSKTRNWTVALKTLIVIHRTLREGDPTFREELLNFTQRARII 120
           AIRPRADVAYCIHAL+RRL+KT NWTVALKTLIVIHRTLREGDPTFREELLNF+QR RI+
Sbjct: 61  AIRPRADVAYCIHALSRRLNKTHNWTVALKTLIVIHRTLREGDPTFREELLNFSQRGRIL 120

Query: 121 QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIESERLPRPAQGQEKGYSRT 180
           QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFR+LKYDIE+ERLPRPAQGQEKGYSRT
Sbjct: 121 QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRVLKYDIEAERLPRPAQGQEKGYSRT 180

Query: 181 RELDSEELLEHLPALQQLLYRLIGCRPEGAAIGNYVIQYALALVLKESFKIYCAINDGII 240
           RELDSEELLE LPALQQLLY LIGCRPEGAA+ NYVIQYALALVLKESFKIYCA+NDGII
Sbjct: 181 RELDSEELLEQLPALQQLLYHLIGCRPEGAAVANYVIQYALALVLKESFKIYCAVNDGII 240

Query: 241 NLVDKFFEMPRHEAIKALDIYKRAGQQAGSLSDFYDICKGLELARNFQFPVLREPPQSFL 300
           NLVDKFFEMPRHEA+KALD+YKRAGQQA  LSDFY++CKGLELARNFQFPVLREPPQSFL
Sbjct: 241 NLVDKFFEMPRHEAVKALDVYKRAGQQAAGLSDFYEVCKGLELARNFQFPVLREPPQSFL 300

Query: 301 NTMEEYIREAPRMVTVPNEPLLQLTYKPEDSPSEDPNLPTDETEASPSHDLSITPVDSAP 360
            TMEEYIREAPR V+VP+EPLLQLTY+PE+ PSED  L +DE+E +P   + ++ V++A 
Sbjct: 301 TTMEEYIREAPRAVSVPHEPLLQLTYRPEE-PSEDTKLSSDESEPAPLDIVPVSNVETA- 360

Query: 361 LPPPVPAPAPERHLDTGDLLGLSLDTTEVSAIEDRNALALAIVPSGDAAAPTFHSNGAQP 420
            P P P P P+   DTGDLLGL     +VS +E+RNALALAIV S   AAPTF+S+  QP
Sbjct: 361 TPSPPPPPPPQSSQDTGDLLGLDYTAADVSVMEERNALALAIVSSETDAAPTFNSSAVQP 420

Query: 421 KDFDPTGWELALVTTPSTNLSSTNERQLAGGLDTLTLDSLYDEGAYRASLQPVYGKPAPN 480
           KDFDPTGWELALVTTPS N+SS NERQLAGGLD+LTL+SLYDEGAYRA+ QPVYG PAPN
Sbjct: 421 KDFDPTGWELALVTTPSNNISSVNERQLAGGLDSLTLNSLYDEGAYRAAQQPVYGAPAPN 480

Query: 481 PFEVQDPFAYSNAIAPAPSVQMAPIPQQQANPFGPYQPTFS-QQQSFTMDPTNPFGDAGF 540
           PFEVQDPFA SN +AP P VQMA + QQQ+NPFGP+QPT+  QQQ+  M PTNPFGD GF
Sbjct: 481 PFEVQDPFALSNNVAPPPGVQMAAMAQQQSNPFGPFQPTYQPQQQNVMMGPTNPFGDTGF 540

Query: 541 GAFSAPNHHTVP-PNANNPFGSTGLL 565
           GAF  P H   P P  +NPFGSTGLL
Sbjct: 541 GAF--PAHPPAPHPQTSNPFGSTGLL 562

BLAST of Cp4.1LG01g06100 vs. NCBI nr
Match: gi|596159388|ref|XP_007222905.1| (hypothetical protein PRUPE_ppa003610mg [Prunus persica])

HSP 1 Score: 908.3 bits (2346), Expect = 6.7e-261
Identity = 457/566 (80.74%), Postives = 501/566 (88.52%), Query Frame = 1

Query: 1   MATLQTWRKAYGALKDSTKVGLAHVNSEYADLDVAIVKSTNHVECPPKERHLRKILIATS 60
           M TLQTWRKAYGALKD+TKVGLAHVNS+YADLDVAIVK+TNHVECPPKERHLRKILIATS
Sbjct: 1   MGTLQTWRKAYGALKDTTKVGLAHVNSDYADLDVAIVKATNHVECPPKERHLRKILIATS 60

Query: 61  AIRPRADVAYCIHALARRLSKTRNWTVALKTLIVIHRTLREGDPTFREELLNFTQRARII 120
           AIRPRADVAYCIHAL+RRL+KT NWTVALKTLIVIHRTLREGDPTFREELLNF+QR RI+
Sbjct: 61  AIRPRADVAYCIHALSRRLNKTHNWTVALKTLIVIHRTLREGDPTFREELLNFSQRGRIL 120

Query: 121 QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIESERLPRPAQGQEKGYSRT 180
           QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFR+LKYDIE+ERLPRPAQGQEKGYSRT
Sbjct: 121 QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRVLKYDIEAERLPRPAQGQEKGYSRT 180

Query: 181 RELDSEELLEHLPALQQLLYRLIGCRPEGAAIGNYVIQYALALVLKESFKIYCAINDGII 240
           RELDSEELLE LPALQQLLYRLIGCRPEGAA+ NYVIQYALALVLKESFKIYCA+NDGII
Sbjct: 181 RELDSEELLEQLPALQQLLYRLIGCRPEGAAVVNYVIQYALALVLKESFKIYCAVNDGII 240

Query: 241 NLVDKFFEMPRHEAIKALDIYKRAGQQAGSLSDFYDICKGLELARNFQFPVLREPPQSFL 300
           NLVDKFFEMPRHEA+KALD+YKRAGQQA  LSDFY++CKGLELARNFQFPVLREPPQSFL
Sbjct: 241 NLVDKFFEMPRHEAVKALDVYKRAGQQAAGLSDFYEVCKGLELARNFQFPVLREPPQSFL 300

Query: 301 NTMEEYIREAPRMVTVPNEPLLQLTYKPEDSPSEDPNLPTDETEASPSHDLSITPVDSAP 360
            TMEEYIREAPR VTVP+EPLLQLTY+PE+ PSED  L +DE+E +P   + ++ V++A 
Sbjct: 301 TTMEEYIREAPRAVTVPHEPLLQLTYRPEE-PSEDTKLSSDESEPAPLDIVPVSNVETA- 360

Query: 361 LPPPVPAPAPERHLDTGDLLGLSLDTTEVSAIEDRNALALAIVPSGDAAAPTFHSNGAQP 420
            P P P P P+   DTGDLLGL    ++VS +E+RNALALAIV S   AAPTF+S+  QP
Sbjct: 361 TPSPPPPPPPQSSQDTGDLLGLDYTASDVSVMEERNALALAIVSSETDAAPTFNSSAVQP 420

Query: 421 KDFDPTGWELALVTTPSTNLSSTNERQLAGGLDTLTLDSLYDEGAYRASLQPVYGKPAPN 480
           KDFDPTGWELALVTTPS N+SS NERQLAGGLD+LTL+SLYDEGAYRA+ QPVYG PAPN
Sbjct: 421 KDFDPTGWELALVTTPSNNISSVNERQLAGGLDSLTLNSLYDEGAYRAAQQPVYGAPAPN 480

Query: 481 PFEVQDPFAYSNAIAPAPSVQMAPIPQQQANPFGPYQPTFS-QQQSFTMDPTNPFGDAGF 540
           PFEVQDPFA SN +AP P VQMA + QQQ+NPFG +QPT+  QQQ+  M PTNPFGD GF
Sbjct: 481 PFEVQDPFALSNNVAPPPGVQMAAMAQQQSNPFGSFQPTYQPQQQNVMMGPTNPFGDTGF 540

Query: 541 GAFSAPNHHTVP-PNANNPFGSTGLL 565
           GAF  P H   P P  +NPFGSTGLL
Sbjct: 541 GAF--PAHPPAPHPQTSNPFGSTGLL 562

BLAST of Cp4.1LG01g06100 vs. NCBI nr
Match: gi|255573732|ref|XP_002527787.1| (PREDICTED: putative clathrin assembly protein At2g01600 isoform X2 [Ricinus communis])

HSP 1 Score: 906.0 bits (2340), Expect = 3.3e-260
Identity = 457/572 (79.90%), Postives = 500/572 (87.41%), Query Frame = 1

Query: 1   MATLQTWRKAYGALKDSTKVGLAHVNSEYADLDVAIVKSTNHVECPPKERHLRKILIATS 60
           M TLQTWRKAYGALKDSTKVGLAHVNS++A+LDVAIVK+TNHVECPPKERHLRKIL+ATS
Sbjct: 1   MGTLQTWRKAYGALKDSTKVGLAHVNSDFAELDVAIVKATNHVECPPKERHLRKILVATS 60

Query: 61  AIRPRADVAYCIHALARRLSKTRNWTVALKTLIVIHRTLREGDPTFREELLNFTQRARII 120
           AIRPRADV YCIHAL+RRL+KT NWTVALKTLIVIHR LREGDPTF+EEL+NF+QR RI+
Sbjct: 61  AIRPRADVQYCIHALSRRLAKTHNWTVALKTLIVIHRLLREGDPTFKEELVNFSQRGRIL 120

Query: 121 QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIESERLPRPAQGQEKGYSRT 180
           QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIE+ERLPRP QGQ+KGYSRT
Sbjct: 121 QLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIEAERLPRPVQGQDKGYSRT 180

Query: 181 RELDSEELLEHLPALQQLLYRLIGCRPEGAAIGNYVIQYALALVLKESFKIYCAINDGII 240
           RELDSEELLE LPALQQLLYRL+GCRPEGAA+GNYVIQYALALVLKESFKIYCAINDGII
Sbjct: 181 RELDSEELLEQLPALQQLLYRLVGCRPEGAAVGNYVIQYALALVLKESFKIYCAINDGII 240

Query: 241 NLVDKFFEMPRHEAIKALDIYKRAGQQAGSLSDFYDICKGLELARNFQFPVLREPPQSFL 300
           NLVDKFFEMPRHEAIKALD+YKRAGQQAGSLSDFYD+CKGLELARNFQFPVLREPPQSFL
Sbjct: 241 NLVDKFFEMPRHEAIKALDVYKRAGQQAGSLSDFYDVCKGLELARNFQFPVLREPPQSFL 300

Query: 301 NTMEEYIREAPRMVTVPNEPLLQLTYKPEDSPS--EDPNLPTDETEASPSHDLSITPVDS 360
            TMEEYIREAPR+VTVP+EPLLQLTY+PE+ PS  ED  LP DE E+ PS D++I   + 
Sbjct: 301 TTMEEYIREAPRVVTVPSEPLLQLTYRPEEGPSEPEDTKLPIDEPESVPSEDVAIANAEV 360

Query: 361 APLPPPVPAPAPERHLDTGDLLGL---SLDTTEVSAIEDRNALALAIVPSGDAAAPTFHS 420
           AP  PP P   P+ ++DTGDLLGL   S D +  SAIE+RNALALAIVP    AAPTF+S
Sbjct: 361 APPTPPTP---PQNNMDTGDLLGLNYASPDVSAASAIEERNALALAIVPLEQDAAPTFNS 420

Query: 421 NGAQPKDFDPTGWELALVTTPSTNLSSTNERQLAGGLDTLTLDSLYDEGAYRASLQPVYG 480
              QPKDFDPTGWELALVTTPS N+SS N+RQLAGGLDTLTL+SLYD+ AYRA+ QPVYG
Sbjct: 421 GAGQPKDFDPTGWELALVTTPSANISSVNDRQLAGGLDTLTLNSLYDDVAYRAAQQPVYG 480

Query: 481 KPAPNPFEVQDPFAYSNAIAPAPSVQMAPIPQQQANPFGPYQPTF---SQQQSFTMDPTN 540
            PAPNPFEV DPFA SN+IAP  +VQMA + QQ  NPFGPYQPT+    QQQ   M P N
Sbjct: 481 APAPNPFEVHDPFAMSNSIAPPSAVQMAAMTQQPPNPFGPYQPTYPQPQQQQHLMMSPAN 540

Query: 541 PFGDAGFGAFSAPNHHTVPPNANNPFGSTGLL 565
           PFGDAGFG F  P +    P++NNPFGSTGLL
Sbjct: 541 PFGDAGFGTF--PVNTVTHPHSNNPFGSTGLL 567

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
CAP8_ARATH1.9e-21868.47Putative clathrin assembly protein At2g01600 OS=Arabidopsis thaliana GN=At2g0160... [more]
CAP9_ARATH5.6e-20766.37Putative clathrin assembly protein At1g14910 OS=Arabidopsis thaliana GN=At1g1491... [more]
CAP7_ARATH5.9e-16457.19Putative clathrin assembly protein At5g57200 OS=Arabidopsis thaliana GN=At5g5720... [more]
CAP6_ARATH1.2e-16155.56Putative clathrin assembly protein At4g25940 OS=Arabidopsis thaliana GN=At4g2594... [more]
CAP10_ARATH1.4e-12547.18Putative clathrin assembly protein At5g35200 OS=Arabidopsis thaliana GN=At5g3520... [more]
Match NameE-valueIdentityDescription
A0A0A0KIT4_CUCSA8.2e-30693.64Uncharacterized protein OS=Cucumis sativus GN=Csa_5G033480 PE=4 SV=1[more]
M5XDV8_PRUPE4.7e-26180.74Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003610mg PE=4 SV=1[more]
B9HJD6_POPTR2.3e-26080.42Clathrin assembly family protein OS=Populus trichocarpa GN=POPTR_0008s13140g PE=... [more]
B9SP68_RICCO2.3e-26079.90Clathrin assembly protein, putative OS=Ricinus communis GN=RCOM_0629240 PE=4 SV=... [more]
A0A061E7Q5_THECC9.8e-25979.79ENTH/ANTH/VHS superfamily protein OS=Theobroma cacao GN=TCM_010974 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G01600.11.0e-21968.47 ENTH/ANTH/VHS superfamily protein[more]
AT1G14910.13.2e-20866.37 ENTH/ANTH/VHS superfamily protein[more]
AT5G57200.13.3e-16557.19 ENTH/ANTH/VHS superfamily protein[more]
AT4G25940.16.9e-16355.56 ENTH/ANTH/VHS superfamily protein[more]
AT5G35200.17.9e-12747.18 ENTH/ANTH/VHS superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449449048|ref|XP_004142277.1|1.2e-30593.64PREDICTED: putative clathrin assembly protein At2g01600 [Cucumis sativus][more]
gi|659129479|ref|XP_008464707.1|7.1e-30392.57PREDICTED: putative clathrin assembly protein At2g01600 [Cucumis melo][more]
gi|645229365|ref|XP_008221433.1|5.2e-26180.57PREDICTED: putative clathrin assembly protein At2g01600 [Prunus mume][more]
gi|596159388|ref|XP_007222905.1|6.7e-26180.74hypothetical protein PRUPE_ppa003610mg [Prunus persica][more]
gi|255573732|ref|XP_002527787.1|3.3e-26079.90PREDICTED: putative clathrin assembly protein At2g01600 isoform X2 [Ricinus comm... [more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0048268clathrin coat assembly
Vocabulary: Molecular Function
TermDefinition
GO:0030276clathrin binding
GO:00055451-phosphatidylinositol binding
GO:0005543phospholipid binding
Vocabulary: Cellular Component
TermDefinition
GO:0030136clathrin-coated vesicle
Vocabulary: INTERPRO
TermDefinition
IPR014712Clathrin_AP_dom2
IPR013809ENTH
IPR011417ANTH_dom
IPR008942ENTH_VHS
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0048268 clathrin coat assembly
biological_process GO:0055114 oxidation-reduction process
cellular_component GO:0030136 clathrin-coated vesicle
molecular_function GO:0005545 1-phosphatidylinositol binding
molecular_function GO:0032440 2-alkenal reductase [NAD(P)] activity
molecular_function GO:0030276 clathrin binding
molecular_function GO:0005543 phospholipid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g06100.1Cp4.1LG01g06100.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008942ENTH/VHSGENE3DG3DSA:1.25.40.90coord: 30..157
score: 2.9
IPR008942ENTH/VHSunknownSSF48464ENTH/VHS domaincoord: 31..162
score: 2.24
IPR011417AP180 N-terminal homology (ANTH) domainPFAMPF07651ANTHcoord: 31..309
score: 1.0
IPR013809ENTH domainSMARTSM00273enth_2coord: 30..161
score: 5.8
IPR013809ENTH domainPROFILEPS50942ENTHcoord: 24..161
score: 35
IPR014712Phosphoinositide-binding clathrin adaptor, domain 2GENE3DG3DSA:1.20.58.150coord: 185..312
score: 4.8
NoneNo IPR availablePANTHERPTHR22951CLATHRIN ASSEMBLY PROTEINcoord: 2..564
score:
NoneNo IPR availablePANTHERPTHR22951:SF25SUBFAMILY NOT NAMEDcoord: 2..564
score:
NoneNo IPR availableunknownSSF89009GAT-like domaincoord: 178..309
score: 2.16

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG01g06100CmaCh04G000650Cucurbita maxima (Rimu)cmacpeB720
Cp4.1LG01g06100CmoCh04G000580Cucurbita moschata (Rifu)cmocpeB673
Cp4.1LG01g06100Carg22010Silver-seed gourdcarcpeB0258
The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG01g06100Cp4.1LG01g07470Cucurbita pepo (Zucchini)cpecpeB374