HG10011785 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10011785
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionENTH domain-containing protein
LocationChr01: 11770556 .. 11771644 (+)
RNA-Seq ExpressionHG10011785
SyntenyHG10011785
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTGCGCACAAAAAAGTTGAGTTCCCTAATTGGACTCATCAAAGACAAAGCCTCTCAAACCAAAGCCGCGCTTCTCACTAAGCCCAACATTCTCTCCTTTCAACTCGCCCTCCTCCGAGCCACCACTCACGATCCCCACGCGCCGCCCAGCGAGAAGCACCTCTCTGTTCTTCTCTCTCTTGGCAAAACCTCTCGCGCCACCGCCGCTGCTGCCGTTGAAGTCTTAATGGACCGCCTTCAAACCACCCAAAACTCCGCCGTCGCCCTCAAGTGTCTAATCGCTGTCCACCACATCATCAAGAACGGCGGCTTCATTCTACAAGACCAGCTCTCTGTTTTTCCCTTCACCGGCGGCAGAAACTACCTTAAACTCTCCGATTTCCGCGACAATTCCAATCCCATTTCTTGGGAGCTTTCCTCTTGGGTTCGATGGTACGCTCAGTACATCGAAACTGTCTTGTCTATTTCCCGAATTTTGGGGGCTTTTGTTGGTTCTTCTAGCTCGAATGAAGAGAAGGAGAAAAAAGCAGAGCAGATTTCGGGGATTTTGAACTCCGATTTGCTTAAAGAGACCGAATCTTTGGTGGGTTTAATCGAAGAAACTTCGAAAATGCCTCACTGTTTGCATCTGAATGGAAACAGATTGGTGGATAAGATCTACGCCTTTGTCGGTGACGATTACTTGTCGGTTATTAAGGAAATTTCAATCCGAGTTGCAGAGTTTCACCAGCGGCTCGGTTGCCTGAGTTTCGGCGAATCGGTCGAGTTGGTTTGCGCGTTGAAACGGCTCGAGGATTGCAAAGAAAAGCAATCCATGGGAATTTCTGCAAAGTACGAAGTTTTGTTGGATGAATTTTGGGGTTCCATTAGAGAGACCAAGAATTTGATTGGGGAATCCAAGGAAAATCGAGAGGGCGGTAAATTGGCCAGGACGAAGAGCAAAATGAGCGACTCGGGCCGGTTTATGGAGCGGGCTAATGCTGGTTCTTATCGCGACTCAATTCGGTTCGGTTCTGAGCGATTCGATTTAACCTACAAAGGGTTTCCAGTCCTAGGTATAACGGAATCGTACTTTCTGCTAAAATGA

mRNA sequence

ATGGTGCGCACAAAAAAGTTGAGTTCCCTAATTGGACTCATCAAAGACAAAGCCTCTCAAACCAAAGCCGCGCTTCTCACTAAGCCCAACATTCTCTCCTTTCAACTCGCCCTCCTCCGAGCCACCACTCACGATCCCCACGCGCCGCCCAGCGAGAAGCACCTCTCTGTTCTTCTCTCTCTTGGCAAAACCTCTCGCGCCACCGCCGCTGCTGCCGTTGAAGTCTTAATGGACCGCCTTCAAACCACCCAAAACTCCGCCGTCGCCCTCAAGTGTCTAATCGCTGTCCACCACATCATCAAGAACGGCGGCTTCATTCTACAAGACCAGCTCTCTGTTTTTCCCTTCACCGGCGGCAGAAACTACCTTAAACTCTCCGATTTCCGCGACAATTCCAATCCCATTTCTTGGGAGCTTTCCTCTTGGGTTCGATGGTACGCTCAGTACATCGAAACTGTCTTGTCTATTTCCCGAATTTTGGGGGCTTTTGTTGGTTCTTCTAGCTCGAATGAAGAGAAGGAGAAAAAAGCAGAGCAGATTTCGGGGATTTTGAACTCCGATTTGCTTAAAGAGACCGAATCTTTGGTGGGTTTAATCGAAGAAACTTCGAAAATGCCTCACTGTTTGCATCTGAATGGAAACAGATTGGTGGATAAGATCTACGCCTTTGTCGGTGACGATTACTTGTCGGTTATTAAGGAAATTTCAATCCGAGTTGCAGAGTTTCACCAGCGGCTCGGTTGCCTGAGTTTCGGCGAATCGGTCGAGTTGGTTTGCGCGTTGAAACGGCTCGAGGATTGCAAAGAAAAGCAATCCATGGGAATTTCTGCAAAGTACGAAGTTTTGTTGGATGAATTTTGGGGTTCCATTAGAGAGACCAAGAATTTGATTGGGGAATCCAAGGAAAATCGAGAGGGCGGTAAATTGGCCAGGACGAAGAGCAAAATGAGCGACTCGGGCCGGTTTATGGAGCGGGCTAATGCTGGTTCTTATCGCGACTCAATTCGGTTCGGTTCTGAGCGATTCGATTTAACCTACAAAGGGTTTCCAGTCCTAGGTATAACGGAATCGTACTTTCTGCTAAAATGA

Coding sequence (CDS)

ATGGTGCGCACAAAAAAGTTGAGTTCCCTAATTGGACTCATCAAAGACAAAGCCTCTCAAACCAAAGCCGCGCTTCTCACTAAGCCCAACATTCTCTCCTTTCAACTCGCCCTCCTCCGAGCCACCACTCACGATCCCCACGCGCCGCCCAGCGAGAAGCACCTCTCTGTTCTTCTCTCTCTTGGCAAAACCTCTCGCGCCACCGCCGCTGCTGCCGTTGAAGTCTTAATGGACCGCCTTCAAACCACCCAAAACTCCGCCGTCGCCCTCAAGTGTCTAATCGCTGTCCACCACATCATCAAGAACGGCGGCTTCATTCTACAAGACCAGCTCTCTGTTTTTCCCTTCACCGGCGGCAGAAACTACCTTAAACTCTCCGATTTCCGCGACAATTCCAATCCCATTTCTTGGGAGCTTTCCTCTTGGGTTCGATGGTACGCTCAGTACATCGAAACTGTCTTGTCTATTTCCCGAATTTTGGGGGCTTTTGTTGGTTCTTCTAGCTCGAATGAAGAGAAGGAGAAAAAAGCAGAGCAGATTTCGGGGATTTTGAACTCCGATTTGCTTAAAGAGACCGAATCTTTGGTGGGTTTAATCGAAGAAACTTCGAAAATGCCTCACTGTTTGCATCTGAATGGAAACAGATTGGTGGATAAGATCTACGCCTTTGTCGGTGACGATTACTTGTCGGTTATTAAGGAAATTTCAATCCGAGTTGCAGAGTTTCACCAGCGGCTCGGTTGCCTGAGTTTCGGCGAATCGGTCGAGTTGGTTTGCGCGTTGAAACGGCTCGAGGATTGCAAAGAAAAGCAATCCATGGGAATTTCTGCAAAGTACGAAGTTTTGTTGGATGAATTTTGGGGTTCCATTAGAGAGACCAAGAATTTGATTGGGGAATCCAAGGAAAATCGAGAGGGCGGTAAATTGGCCAGGACGAAGAGCAAAATGAGCGACTCGGGCCGGTTTATGGAGCGGGCTAATGCTGGTTCTTATCGCGACTCAATTCGGTTCGGTTCTGAGCGATTCGATTTAACCTACAAAGGGTTTCCAGTCCTAGGTATAACGGAATCGTACTTTCTGCTAAAATGA

Protein sequence

MVRTKKLSSLIGLIKDKASQTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKNGGFILQDQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSSSSNEEKEKKAEQISGILNSDLLKETESLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGESKENREGGKLARTKSKMSDSGRFMERANAGSYRDSIRFGSERFDLTYKGFPVLGITESYFLLK
Homology
BLAST of HG10011785 vs. NCBI nr
Match: XP_038903242.1 (putative clathrin assembly protein At4g40080 [Benincasa hispida])

HSP 1 Score: 653.3 bits (1684), Expect = 1.2e-183
Identity = 335/362 (92.54%), Postives = 345/362 (95.30%), Query Frame = 0

Query: 1   MVRTKKLSSLIGLIKDKASQTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLS 60
           MV TK LSSLIGLIKDKASQ+KAALL KPNILSFQLALLRATTHDPHAPP EKHL VLLS
Sbjct: 1   MVSTKNLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPREKHLCVLLS 60

Query: 61  LGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKNGGFILQDQLSVFPFTGGR 120
           LGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHI+KNGGFILQDQLSVFPFTGGR
Sbjct: 61  LGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHIVKNGGFILQDQLSVFPFTGGR 120

Query: 121 NYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSSSSNEEKEKKAEQI 180
           NYLKLSDFRD+SNPISWELSSWVRWYAQYIETVLSISRILG FVGSS+SNEEKEKK EQI
Sbjct: 121 NYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSTSNEEKEKKTEQI 180

Query: 181 SGILNSDLLKETESLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVA 240
           SGILNSDLLKETESLVGLIEETSKMPHCLHLNGNRL DKIYAFVGDDYLS +KEISIRV 
Sbjct: 181 SGILNSDLLKETESLVGLIEETSKMPHCLHLNGNRLADKIYAFVGDDYLSAMKEISIRVT 240

Query: 241 EFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGES 300
           EFHQRL CLSFGESVELVCALKRLEDCKEKQS GIS+KYEVL+DEFWGSIRETKNLIGES
Sbjct: 241 EFHQRLSCLSFGESVELVCALKRLEDCKEKQSKGISSKYEVLMDEFWGSIRETKNLIGES 300

Query: 301 KENREGGKLARTKSKMSDSGRFMERANAGSYRDSIRFGSERFDLTYKGFPVLGITESYFL 360
           KEN+EGGKLARTKS+MSDSGRFMERA AGSYRDS+RFGSERFDLT KGFPV G  ESYFL
Sbjct: 301 KENQEGGKLARTKSRMSDSGRFMERAIAGSYRDSLRFGSERFDLTCKGFPVPGTRESYFL 360

Query: 361 LK 363
           LK
Sbjct: 361 LK 362

BLAST of HG10011785 vs. NCBI nr
Match: KAG6577320.1 (putative clathrin assembly protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 601.3 bits (1549), Expect = 5.6e-168
Identity = 312/363 (85.95%), Postives = 329/363 (90.63%), Query Frame = 0

Query: 1   MVRTKKLSSLIGLIKDKASQTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLS 60
           MVRTKKLSSLIGLIKDKASQ+KAALL KPNILSFQLALLRATTHDPHAPP+ K LSVLLS
Sbjct: 1   MVRTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPTHKDLSVLLS 60

Query: 61  LGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKNGGFILQDQLSVFPFTGGR 120
           LGKTSRATAAAA+EVLMDRLQ+TQNSAVALKCLIA+HHIIKNG FILQDQLSVFPFTGGR
Sbjct: 61  LGKTSRATAAAALEVLMDRLQSTQNSAVALKCLIAIHHIIKNGDFILQDQLSVFPFTGGR 120

Query: 121 NYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILG-AFVGSSSSNEEKEKKAEQ 180
           NYLKLSDFRD+SNPISWELSSWVRWYAQYIETVL ISRILG  FVGSSSSN E+EKK EQ
Sbjct: 121 NYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLCISRILGFFFVGSSSSNAEREKKTEQ 180

Query: 181 ISGILNSDLLKETESLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRV 240
           ISG LNSDLLKETESL+GLIEE SKMPHCLHLNGN LVDKIYAFVG+DYLS  KEIS RV
Sbjct: 181 ISGFLNSDLLKETESLMGLIEEVSKMPHCLHLNGNGLVDKIYAFVGEDYLSATKEISTRV 240

Query: 241 AEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGE 300
            EF QRLGCLSFGESVELVCALKRLEDCKEKQS GIS  +E+LL  FWGSIRE +NLIGE
Sbjct: 241 TEFRQRLGCLSFGESVELVCALKRLEDCKEKQSRGISGNHEILLKGFWGSIREIRNLIGE 300

Query: 301 SKENREGGKLARTKSKMSDSGRFMERANAGSYRDSIRFGSERFDLTYKGFPVLGITESYF 360
           SK+ RE GKL RTKS+MSDSGRFM++ NA  YR S+RFGSERFD T KG PVLGITESY 
Sbjct: 301 SKDPRETGKLGRTKSRMSDSGRFMDQDNAKLYRHSVRFGSERFDFTCKGIPVLGITESYL 360

Query: 361 LLK 363
           LLK
Sbjct: 361 LLK 363

BLAST of HG10011785 vs. NCBI nr
Match: KAG7015410.1 (putative clathrin assembly protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 599.7 bits (1545), Expect = 1.6e-167
Identity = 311/363 (85.67%), Postives = 328/363 (90.36%), Query Frame = 0

Query: 1   MVRTKKLSSLIGLIKDKASQTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLS 60
           MVRTKKLSSLIGLIKDKASQ+KAALL KPNILSFQLALLRATTHDPHAPP+ K LSVLLS
Sbjct: 1   MVRTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPTHKDLSVLLS 60

Query: 61  LGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKNGGFILQDQLSVFPFTGGR 120
           LGKTSRATAAAA+EVLMDRLQ+TQNSAVALKCLIA+HHIIKNG FILQDQLSVFPFTGGR
Sbjct: 61  LGKTSRATAAAALEVLMDRLQSTQNSAVALKCLIAIHHIIKNGDFILQDQLSVFPFTGGR 120

Query: 121 NYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILG-AFVGSSSSNEEKEKKAEQ 180
           NYLKLSDFRD+SNPISWELSSWVRWYAQYIETVL ISRILG  FVGSSSSN E+EKK EQ
Sbjct: 121 NYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLCISRILGFFFVGSSSSNAEREKKTEQ 180

Query: 181 ISGILNSDLLKETESLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRV 240
           ISG  NSDLLKETESL+GLIEE SKMPHCLHLNGN LVDKIYAFVG+DYLS  KEIS RV
Sbjct: 181 ISGFFNSDLLKETESLMGLIEEVSKMPHCLHLNGNGLVDKIYAFVGEDYLSATKEISTRV 240

Query: 241 AEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGE 300
            EF QRLGCLSFGESVELVCALKRLEDCKEKQS GIS  +E+LL  FWGSIRE +NLIGE
Sbjct: 241 TEFRQRLGCLSFGESVELVCALKRLEDCKEKQSRGISGNHEILLKGFWGSIREIRNLIGE 300

Query: 301 SKENREGGKLARTKSKMSDSGRFMERANAGSYRDSIRFGSERFDLTYKGFPVLGITESYF 360
           SK+ RE GKL RTKS+MSDSGRFM++ NA  YR S+RFGSERFD T KG PVLGITESY 
Sbjct: 301 SKDPRETGKLGRTKSRMSDSGRFMDQDNAKLYRHSVRFGSERFDFTCKGIPVLGITESYL 360

Query: 361 LLK 363
           LLK
Sbjct: 361 LLK 363

BLAST of HG10011785 vs. NCBI nr
Match: XP_022929539.1 (putative clathrin assembly protein At4g40080 [Cucurbita moschata])

HSP 1 Score: 599.0 bits (1543), Expect = 2.8e-167
Identity = 311/363 (85.67%), Postives = 328/363 (90.36%), Query Frame = 0

Query: 1   MVRTKKLSSLIGLIKDKASQTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLS 60
           MVRTKKLSSLIGLIKDKASQ+KAALL KPNILSFQLALLRATTHDPHAPP+ K LSVLLS
Sbjct: 32  MVRTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPTHKDLSVLLS 91

Query: 61  LGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKNGGFILQDQLSVFPFTGGR 120
           LGKTSRATAAAA+EVLMDRLQ+TQNSAVALKCLIA+HHIIKNG FILQDQLSVFPFTGGR
Sbjct: 92  LGKTSRATAAAALEVLMDRLQSTQNSAVALKCLIAIHHIIKNGDFILQDQLSVFPFTGGR 151

Query: 121 NYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILG-AFVGSSSSNEEKEKKAEQ 180
           NYLKLSDFRD+SNPISWELSSWVRWYAQYIETVL ISRILG  FVGSSSSN E+EKK EQ
Sbjct: 152 NYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLCISRILGFFFVGSSSSNAEREKKTEQ 211

Query: 181 ISGILNSDLLKETESLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRV 240
           ISG  NSDLLKETESL+GLIEE SKMPHCLHLNGN LVDKIYAFVG+DYLS  KEIS RV
Sbjct: 212 ISGFFNSDLLKETESLMGLIEEVSKMPHCLHLNGNGLVDKIYAFVGEDYLSATKEISNRV 271

Query: 241 AEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGE 300
            EF QRLGCLSFGESVELVCALKRLEDCKEKQS GIS  +E+LL  FWGSIRE +NLIGE
Sbjct: 272 TEFRQRLGCLSFGESVELVCALKRLEDCKEKQSRGISGNHEILLKGFWGSIREIRNLIGE 331

Query: 301 SKENREGGKLARTKSKMSDSGRFMERANAGSYRDSIRFGSERFDLTYKGFPVLGITESYF 360
           SK+ RE GKL RTKS+MSDSGRFM++ NA  YR S+RFGSERFD T KG PVLGITESY 
Sbjct: 332 SKDPRETGKLGRTKSRMSDSGRFMDQDNAKLYRHSVRFGSERFDFTCKGIPVLGITESYL 391

Query: 361 LLK 363
           LLK
Sbjct: 392 LLK 394

BLAST of HG10011785 vs. NCBI nr
Match: XP_023552000.1 (putative clathrin assembly protein At4g40080 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 597.4 bits (1539), Expect = 8.1e-167
Identity = 309/363 (85.12%), Postives = 329/363 (90.63%), Query Frame = 0

Query: 1   MVRTKKLSSLIGLIKDKASQTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLS 60
           MVRTKKLSSLIGLIKDKASQ+KAALL KPNI+SFQLALLRATTHDPHAPP+ K LSVLLS
Sbjct: 32  MVRTKKLSSLIGLIKDKASQSKAALLAKPNIVSFQLALLRATTHDPHAPPTHKDLSVLLS 91

Query: 61  LGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKNGGFILQDQLSVFPFTGGR 120
           LGKTSRATAAAA+EVLMDRLQ+TQNSAVALKCLIA+HHIIKNG FILQDQLSVFPFTGGR
Sbjct: 92  LGKTSRATAAAALEVLMDRLQSTQNSAVALKCLIAIHHIIKNGDFILQDQLSVFPFTGGR 151

Query: 121 NYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILG-AFVGSSSSNEEKEKKAEQ 180
           NYLKLSDFRD+SNPISWELSSWVRWYAQYIETVL ISRILG  FVGSSSSN E+EKK EQ
Sbjct: 152 NYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLCISRILGFFFVGSSSSNAEREKKTEQ 211

Query: 181 ISGILNSDLLKETESLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRV 240
           ISG  NSDLLKETESL+GLIEE SK+PHCLHLNGN LVDKIYAFVG+DYLS  KEIS RV
Sbjct: 212 ISGFFNSDLLKETESLMGLIEEVSKIPHCLHLNGNGLVDKIYAFVGEDYLSATKEISTRV 271

Query: 241 AEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGE 300
            EF QRLGCLSFGESVELVCALKRLEDCKEKQS GIS  +E+LL  FWGSIRE +NLIGE
Sbjct: 272 TEFRQRLGCLSFGESVELVCALKRLEDCKEKQSRGISGNHEILLKGFWGSIREIRNLIGE 331

Query: 301 SKENREGGKLARTKSKMSDSGRFMERANAGSYRDSIRFGSERFDLTYKGFPVLGITESYF 360
           SK++RE GKL RTKS+MSDSGRFM++ NA  YR S+RFGSERFD T KG PVLGITESY 
Sbjct: 332 SKDHRETGKLDRTKSRMSDSGRFMDQDNAKLYRHSVRFGSERFDFTCKGIPVLGITESYL 391

Query: 361 LLK 363
           LLK
Sbjct: 392 LLK 394

BLAST of HG10011785 vs. ExPASy Swiss-Prot
Match: Q8L936 (Putative clathrin assembly protein At4g40080 OS=Arabidopsis thaliana OX=3702 GN=At4g40080 PE=2 SV=2)

HSP 1 Score: 333.6 bits (854), Expect = 2.8e-90
Identity = 187/364 (51.37%), Postives = 246/364 (67.58%), Query Frame = 0

Query: 1   MVRTKKLSSLIGLIKDKASQTKAALL---TKPNILSFQLALLRATTHDPHAPPSEKHLSV 60
           M R    + LIG IKDKASQ+KAAL+   TK   LSF L++LRATTHDP  PP  +HL+V
Sbjct: 1   MGRITSFADLIGRIKDKASQSKAALVSSNTKSKTLSFHLSVLRATTHDPSTPPGNRHLAV 60

Query: 61  LLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKNGGFILQDQLSVFPFT 120
           +LS G  SRATA++AVE +M+RL TT ++ VALK LI +HHI+K+G FILQDQLSVFP +
Sbjct: 61  ILSAGTGSRATASSAVESIMERLHTTGDACVALKSLIIIHHIVKHGRFILQDQLSVFPAS 120

Query: 121 GGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSSSSNEEKEKKA 180
           GGRNYLKLS FRD  +P+ WELSSWVRWYA Y+E +LS SRI+G F+ S+SS   KE+  
Sbjct: 121 GGRNYLKLSAFRDEKSPLMWELSSWVRWYALYLEHLLSTSRIMGFFISSTSSTIHKEEYE 180

Query: 181 EQISGILNSDLLKETESLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISI 240
           E +S + NSDLL+E ++LVGL+EE  K+P      G  L DKI   VG+DY+S I E+  
Sbjct: 181 EMVSSLTNSDLLREIDALVGLLEEACKIPDLPFSGGKSLADKITQLVGEDYVSSINELYT 240

Query: 241 RVAEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYE-VLLDEFWGSIRETKNL 300
           R  EF +R   LSFG+++ELVCALKRLE CKE+ S      ++   +D FWG + E K +
Sbjct: 241 RFNEFKERSNTLSFGDTIELVCALKRLESCKERLSEICHGNWKRGWIDGFWGLVLEVKGI 300

Query: 301 IGESKENREGGKLART------KSKMSDSGRFMERANAGSYRDSIRFGSERF-DLTYKGF 354
           IG  ++N   G++ ++      + K  +S RF +R   G Y + +RF S RF ++    F
Sbjct: 301 IGNLEDNY--GQIEKSIVGFGKRDKGYESARFTDRLIIG-YSNPVRFSSGRFSNVDRFNF 360

BLAST of HG10011785 vs. ExPASy Swiss-Prot
Match: Q9FKQ2 (Putative clathrin assembly protein At5g65370 OS=Arabidopsis thaliana OX=3702 GN=At5g65370 PE=3 SV=1)

HSP 1 Score: 146.4 bits (368), Expect = 6.4e-34
Identity = 99/301 (32.89%), Postives = 164/301 (54.49%), Query Frame = 0

Query: 6   KLSSLIGLIKDKASQTK---AALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLG 65
           KL++L G++KD+ASQ K     L +  N  +  LALL+AT+H  + PPS+K+++ L S  
Sbjct: 3   KLATLNGILKDEASQMKLNVVHLCSSVNAKTIDLALLKATSHTSNNPPSDKYVTFLQSTI 62

Query: 66  KTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKN-GGFILQDQL------SVFP 125
            T        V+ ++ RL+ T +  VA KCLI +H ++K+  G+  +D L          
Sbjct: 63  DT--CYGPDTVDAILHRLRVTTDVCVAAKCLILLHKMVKSESGYNGEDSLRNNINHRTLI 122

Query: 126 FTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSSSSNEEKEK 185
           +T G + LKL+D   NS+  + EL+ WV+WY QY++  LSI+ +LG        NE+K  
Sbjct: 123 YTQGGSNLKLNDLNVNSSRFTRELTPWVQWYKQYLDCYLSIAEVLGITPNIKEKNEDKRL 182

Query: 186 KAEQISGILNSDLLKETESLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEI 245
           + +++S      +LK+ + LV L E  S  P       N++V ++   +  DY S I+ +
Sbjct: 183 ETQRVSSYPMDCILKQIDFLVELFEHISDRPKAPQSKLNKIVIEMTELMVQDYFSAIRLM 242

Query: 246 SIRVAEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKN 297
            IR  E + R+      +  ELV  L++LE+CKE  S   S + + L+ +FW  + + K+
Sbjct: 243 RIRFEELNVRV-----AKPNELVPVLEKLENCKEGLS-EFSWRSKYLIADFWYLVSKLKD 295

BLAST of HG10011785 vs. ExPASy Swiss-Prot
Match: Q8H0W9 (Putative clathrin assembly protein At5g10410 OS=Arabidopsis thaliana OX=3702 GN=At5g10410 PE=2 SV=2)

HSP 1 Score: 140.2 bits (352), Expect = 4.6e-32
Identity = 96/309 (31.07%), Postives = 166/309 (53.72%), Query Frame = 0

Query: 10  LIGLIKDKASQTKAALL---TKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSR 69
           +IG  KDKAS  KA L+       +    LALL++TT  P+ PP+  ++S ++S   +  
Sbjct: 8   IIGKFKDKASIGKARLVHSFGSTAVKYIHLALLKSTTRTPNKPPNSDYVSAVISYSNSRY 67

Query: 70  ATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKNGGFILQDQLSVFPFTGGRNYLKLS 129
           A AA +  +   RL+ T+N+ VA K LI +H +IK+     +D+        GRN LKL+
Sbjct: 68  APAAFSAALW--RLRVTKNAIVATKSLIVIHKLIKSS----RDKFE--GLGHGRNNLKLN 127

Query: 130 DFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSSSSNEEKEKKAEQISGILNS 189
           +F D S+ ++ ELS W+RWY QY++ +  + ++LG+F     + ++K ++ +++S     
Sbjct: 128 EFSDKSSNLTLELSQWIRWYGQYLDRLSWVPKVLGSFPNLLVNPKDKVEEKDRVSSYQTG 187

Query: 190 DLLKETESLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRL 249
            ++++T+SLV   E     P    +  N++VD+I   V +DY  +++ + +R+    +RL
Sbjct: 188 YIIRQTDSLVSFFEHICTRPEIPPMFQNKIVDEIRELVIEDYFKIVRLVMVRLQVLFERL 247

Query: 250 ---GCLSFGE--SVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGESK 309
              G    G+    +    L RL +CKE  S G+  +   L D+FW  + E      E K
Sbjct: 248 IKPGVKPIGDLGLNDFSLLLVRLVECKESLS-GLFWRCRRLADDFW-CLVEMLKAETEKK 306

Query: 310 ENREGGKLA 311
            N++  +LA
Sbjct: 308 NNKQMIELA 306

BLAST of HG10011785 vs. ExPASy Swiss-Prot
Match: Q8LBH2 (Putative clathrin assembly protein At2g01600 OS=Arabidopsis thaliana OX=3702 GN=At2g01600 PE=2 SV=2)

HSP 1 Score: 83.6 bits (205), Expect = 5.1e-15
Identity = 60/194 (30.93%), Postives = 101/194 (52.06%), Query Frame = 0

Query: 12  GLIKDKASQTKAALL-TKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATA- 71
           G +KD    TK  L+          +A+++AT H    PP ++HL  + +    +RA A 
Sbjct: 12  GALKD---STKVGLVRVNSEYADLDVAIVKATNH-VECPPKDRHLRKIFAATSVTRARAD 71

Query: 72  -AAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKNGGFILQDQLSVFPFTGGRNYLKLSDF 131
            A  +  L  RL  T+N  VALK LI +H +++ G    +++L  F   G    L+LS+F
Sbjct: 72  VAYCIHALSRRLHKTRNWTVALKTLIVIHRLLREGDPTFREELLNFSQRG--RILQLSNF 131

Query: 132 RDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSS---SSNEEKEKKAEQISGILN 191
           +D+S+PI+W+ S+WVR YA ++E  L   R+L     +     SN  ++K   +   +  
Sbjct: 132 KDDSSPIAWDCSAWVRTYALFLEERLECFRVLKYDTEAERLPKSNPGQDKGYSRTRDLDG 191

Query: 192 SDLLKETESLVGLI 200
            +LL++  +L  L+
Sbjct: 192 EELLEQLPALQQLL 199

BLAST of HG10011785 vs. ExPASy Swiss-Prot
Match: Q8LF20 (Putative clathrin assembly protein At2g25430 OS=Arabidopsis thaliana OX=3702 GN=At2g25430 PE=1 SV=2)

HSP 1 Score: 81.3 bits (199), Expect = 2.5e-14
Identity = 52/149 (34.90%), Postives = 87/149 (58.39%), Query Frame = 0

Query: 11  IGLIKDKAS--QTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRAT 70
           IG +KD+ S    K A    P++   ++A+++AT+HD   P SEK++  +L+L   SR  
Sbjct: 9   IGAVKDQTSIGIAKVASNMAPDL---EVAIVKATSHDDD-PASEKYIREILNLTSLSRGY 68

Query: 71  AAAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKNGGFILQDQLSVFPFTGGRNYLKLSDF 130
             A V  +  RL  T++  VALK L+ VH ++  G  I Q+++ ++    G   L +SDF
Sbjct: 69  ILACVTSVSRRLSKTRDWVVALKALMLVHRLLNEGDPIFQEEI-LYSTRRGTRMLNMSDF 128

Query: 131 RDNSNPISWELSSWVRWYAQYIETVLSIS 158
           RD ++  SW+ S++VR YA Y++  L ++
Sbjct: 129 RDEAHSSSWDHSAFVRTYAGYLDQRLELA 152

BLAST of HG10011785 vs. ExPASy TrEMBL
Match: A0A6J1EP16 (putative clathrin assembly protein At4g40080 OS=Cucurbita moschata OX=3662 GN=LOC111436077 PE=4 SV=1)

HSP 1 Score: 599.0 bits (1543), Expect = 1.3e-167
Identity = 311/363 (85.67%), Postives = 328/363 (90.36%), Query Frame = 0

Query: 1   MVRTKKLSSLIGLIKDKASQTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLS 60
           MVRTKKLSSLIGLIKDKASQ+KAALL KPNILSFQLALLRATTHDPHAPP+ K LSVLLS
Sbjct: 32  MVRTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPTHKDLSVLLS 91

Query: 61  LGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKNGGFILQDQLSVFPFTGGR 120
           LGKTSRATAAAA+EVLMDRLQ+TQNSAVALKCLIA+HHIIKNG FILQDQLSVFPFTGGR
Sbjct: 92  LGKTSRATAAAALEVLMDRLQSTQNSAVALKCLIAIHHIIKNGDFILQDQLSVFPFTGGR 151

Query: 121 NYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILG-AFVGSSSSNEEKEKKAEQ 180
           NYLKLSDFRD+SNPISWELSSWVRWYAQYIETVL ISRILG  FVGSSSSN E+EKK EQ
Sbjct: 152 NYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLCISRILGFFFVGSSSSNAEREKKTEQ 211

Query: 181 ISGILNSDLLKETESLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRV 240
           ISG  NSDLLKETESL+GLIEE SKMPHCLHLNGN LVDKIYAFVG+DYLS  KEIS RV
Sbjct: 212 ISGFFNSDLLKETESLMGLIEEVSKMPHCLHLNGNGLVDKIYAFVGEDYLSATKEISNRV 271

Query: 241 AEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGE 300
            EF QRLGCLSFGESVELVCALKRLEDCKEKQS GIS  +E+LL  FWGSIRE +NLIGE
Sbjct: 272 TEFRQRLGCLSFGESVELVCALKRLEDCKEKQSRGISGNHEILLKGFWGSIREIRNLIGE 331

Query: 301 SKENREGGKLARTKSKMSDSGRFMERANAGSYRDSIRFGSERFDLTYKGFPVLGITESYF 360
           SK+ RE GKL RTKS+MSDSGRFM++ NA  YR S+RFGSERFD T KG PVLGITESY 
Sbjct: 332 SKDPRETGKLGRTKSRMSDSGRFMDQDNAKLYRHSVRFGSERFDFTCKGIPVLGITESYL 391

Query: 361 LLK 363
           LLK
Sbjct: 392 LLK 394

BLAST of HG10011785 vs. ExPASy TrEMBL
Match: A0A6J1JCT4 (putative clathrin assembly protein At4g40080 OS=Cucurbita maxima OX=3661 GN=LOC111483252 PE=4 SV=1)

HSP 1 Score: 588.2 bits (1515), Expect = 2.4e-164
Identity = 306/363 (84.30%), Postives = 323/363 (88.98%), Query Frame = 0

Query: 1   MVRTKKLSSLIGLIKDKASQTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLS 60
           MV TKKLSSLIGLIKDKASQ+KAALL KPNILSFQLALLRATTHDPHAPP  K LSVLLS
Sbjct: 37  MVSTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPIHKDLSVLLS 96

Query: 61  LGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKNGGFILQDQLSVFPFTGGR 120
            GKTSRATAAAA+EVLMDRLQ+TQNSAVALKCLIA+HHI+KNG FILQDQLSVFPFTGGR
Sbjct: 97  FGKTSRATAAAALEVLMDRLQSTQNSAVALKCLIAIHHIVKNGDFILQDQLSVFPFTGGR 156

Query: 121 NYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILG-AFVGSSSSNEEKEKKAEQ 180
           NYLKLSDFRD+SNPISWELSSWVRWYAQYIETVL ISRILG  FVGSSSSN E+EKK EQ
Sbjct: 157 NYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLCISRILGFFFVGSSSSNAEREKKTEQ 216

Query: 181 ISGILNSDLLKETESLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRV 240
           ISG  NSDLLKETESL+GLIEE SKMPHCLHLNGN LVDKIYAFVG+DYLS  KEIS RV
Sbjct: 217 ISGFFNSDLLKETESLMGLIEEVSKMPHCLHLNGNGLVDKIYAFVGEDYLSATKEISTRV 276

Query: 241 AEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGE 300
            EF  RLGCLSFGESVELVCALKRLEDCKEKQS GIS  +E+LL  FWGSIRE +NLIGE
Sbjct: 277 TEFRHRLGCLSFGESVELVCALKRLEDCKEKQSRGISGNHEILLKGFWGSIREIRNLIGE 336

Query: 301 SKENREGGKLARTKSKMSDSGRFMERANAGSYRDSIRFGSERFDLTYKGFPVLGITESYF 360
           SK+ RE GKL RTKS+MSDSGRFM++ NA   R S+RFGSERFD T KG PVLGITESY 
Sbjct: 337 SKDLRETGKLGRTKSRMSDSGRFMDQDNAKLDRHSVRFGSERFDFTCKGIPVLGITESYL 396

Query: 361 LLK 363
           LLK
Sbjct: 397 LLK 399

BLAST of HG10011785 vs. ExPASy TrEMBL
Match: A0A0A0KXU4 (ENTH domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G099230 PE=4 SV=1)

HSP 1 Score: 587.0 bits (1512), Expect = 5.3e-164
Identity = 306/352 (86.93%), Postives = 320/352 (90.91%), Query Frame = 0

Query: 1   MVRTKKLSSLIGLIKDKASQTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLS 60
           MV TKKLSSLIGLIKDKASQ+KAALL KPNILSFQLALLRATTHD HAPPS+KHLS LLS
Sbjct: 9   MVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLS 68

Query: 61  LGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKNGGFILQDQLSVFPFTGGR 120
           LGKTSRATAA AVEVLMDRLQTT NSAVALKCLIAVHHI K+G FILQDQLSVFPFTGGR
Sbjct: 69  LGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFKDGDFILQDQLSVFPFTGGR 128

Query: 121 NYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSSSSNEEKEKKAEQI 180
           NYLKLSDFRD+SNPISW+LSSWVRWYAQYIETVLSISRILG FVGSS SNEEKE+K EQI
Sbjct: 129 NYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQI 188

Query: 181 SGILNSDLLKETESLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVA 240
           SGILNSDLLKETESLVGLIEE SKMPHCLHLN NRLVDKIY+FVGDDYLS +KEISIRV 
Sbjct: 189 SGILNSDLLKETESLVGLIEEISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVT 248

Query: 241 EFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIR---ETKNLI 300
           EFH RLG LSF ESVELVCALKRLEDCKEKQSMGI AKYEVL+D  WGSIR   ETKNL 
Sbjct: 249 EFHHRLGWLSFAESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQETKNLT 308

Query: 301 GESKENREGGKLARTKSKMSDSGRFMERANAGSYRDSIRFGSERFDLTYKGF 350
           GESKE+REGGKL +TK ++SDSGRFMER NA SYRD +RFGSERF LTY GF
Sbjct: 309 GESKEHREGGKLCKTKRRVSDSGRFMERPNASSYRDLLRFGSERFVLTYDGF 360

BLAST of HG10011785 vs. ExPASy TrEMBL
Match: A0A5A7TT50 (Putative clathrin assembly protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold316G00710 PE=4 SV=1)

HSP 1 Score: 575.5 bits (1482), Expect = 1.6e-160
Identity = 295/349 (84.53%), Postives = 318/349 (91.12%), Query Frame = 0

Query: 1   MVRTKKLSSLIGLIKDKASQTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLS 60
           M+ TK+LSSLIGLIKDKASQ+KAALL KPNILSFQLALLRATTHDPHAPPS+KHLS LLS
Sbjct: 1   MMNTKRLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSDKHLSALLS 60

Query: 61  LGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKNGGFILQDQLSVFPFTGGR 120
           LGKTSRATAAAAVEVLMDRLQTT NSAVALKCLIAVHHI KNGGFILQDQLSVFPFTGGR
Sbjct: 61  LGKTSRATAAAAVEVLMDRLQTTHNSAVALKCLIAVHHIFKNGGFILQDQLSVFPFTGGR 120

Query: 121 NYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSSSSNEEKEKKAEQI 180
           NYLKLSDFRD+SNPISWELSSWVRWYAQYIETVLSISR LG  VGSSSSNEE E+K EQI
Sbjct: 121 NYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRNLGFIVGSSSSNEEMERKTEQI 180

Query: 181 SGILNSDLLKETESLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVA 240
           SGI NS+LLK+TESLVGLIEE SKMP CLHLN NRLVDKIY FVGDDYL+ +K+ISIRV 
Sbjct: 181 SGIWNSELLKDTESLVGLIEEISKMPPCLHLNRNRLVDKIYGFVGDDYLAAMKDISIRVT 240

Query: 241 EFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGES 300
           EFH RLGCLSFGESVELVCALKRL+DCKEKQSMGI A+YEVL+D FW SIRETKNLIG S
Sbjct: 241 EFHHRLGCLSFGESVELVCALKRLDDCKEKQSMGIFARYEVLMDGFWSSIRETKNLIGAS 300

Query: 301 KENREGGKLARTKSKMSDSGRFMERANAGSYRDSIRFGSERFDLTYKGF 350
           KENR+G KL++ + ++SDSGRF+ER+NA SY D + F SERF LTYKGF
Sbjct: 301 KENRDGCKLSQMERRISDSGRFIERSNASSYCDVLPFRSERFGLTYKGF 349

BLAST of HG10011785 vs. ExPASy TrEMBL
Match: A0A1S3C1C0 (putative clathrin assembly protein At4g40080 OS=Cucumis melo OX=3656 GN=LOC103495758 PE=4 SV=1)

HSP 1 Score: 573.9 bits (1478), Expect = 4.6e-160
Identity = 295/349 (84.53%), Postives = 318/349 (91.12%), Query Frame = 0

Query: 1   MVRTKKLSSLIGLIKDKASQTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLS 60
           M+ TK+LSSLIGLIKDKASQ+KAALL KPNILSFQLALLRATTHDPHAPPS+KHLS LLS
Sbjct: 1   MMNTKRLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSDKHLSALLS 60

Query: 61  LGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKNGGFILQDQLSVFPFTGGR 120
           LGKTSRATAAAAVEVLMDRLQTT NSAVALKCLIAVHHI KNGGFILQDQLSVFPFTGGR
Sbjct: 61  LGKTSRATAAAAVEVLMDRLQTTHNSAVALKCLIAVHHIFKNGGFILQDQLSVFPFTGGR 120

Query: 121 NYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSSSSNEEKEKKAEQI 180
           NYLKLSDFRD+SNPISWELSSWVRWYAQYIETVLSISRILG  VGSSSSNEE E+K EQI
Sbjct: 121 NYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFIVGSSSSNEEMERKTEQI 180

Query: 181 SGILNSDLLKETESLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVA 240
           SGI NS+LLK+TESLVGLIEE SKMP CLHLN NRLVDKIY FVGDDYL+ +KEISIRV 
Sbjct: 181 SGIWNSELLKDTESLVGLIEEISKMPPCLHLNRNRLVDKIYGFVGDDYLAAMKEISIRVT 240

Query: 241 EFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGES 300
           EFH RLGCLSFGESVELVCALKRL+D KEKQS+GI A+YEVL+D FW SIRETKNLIG S
Sbjct: 241 EFHHRLGCLSFGESVELVCALKRLDDFKEKQSLGIFARYEVLMDGFWSSIRETKNLIGAS 300

Query: 301 KENREGGKLARTKSKMSDSGRFMERANAGSYRDSIRFGSERFDLTYKGF 350
           KENR+G KL++ + ++SDSGRF+ER+NA SY D + F SERF LTYKGF
Sbjct: 301 KENRDGCKLSQMERRISDSGRFIERSNASSYCDVLPFRSERFGLTYKGF 349

BLAST of HG10011785 vs. TAIR 10
Match: AT4G40080.1 (ENTH/ANTH/VHS superfamily protein )

HSP 1 Score: 333.6 bits (854), Expect = 2.0e-91
Identity = 187/364 (51.37%), Postives = 246/364 (67.58%), Query Frame = 0

Query: 1   MVRTKKLSSLIGLIKDKASQTKAALL---TKPNILSFQLALLRATTHDPHAPPSEKHLSV 60
           M R    + LIG IKDKASQ+KAAL+   TK   LSF L++LRATTHDP  PP  +HL+V
Sbjct: 1   MGRITSFADLIGRIKDKASQSKAALVSSNTKSKTLSFHLSVLRATTHDPSTPPGNRHLAV 60

Query: 61  LLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKNGGFILQDQLSVFPFT 120
           +LS G  SRATA++AVE +M+RL TT ++ VALK LI +HHI+K+G FILQDQLSVFP +
Sbjct: 61  ILSAGTGSRATASSAVESIMERLHTTGDACVALKSLIIIHHIVKHGRFILQDQLSVFPAS 120

Query: 121 GGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSSSSNEEKEKKA 180
           GGRNYLKLS FRD  +P+ WELSSWVRWYA Y+E +LS SRI+G F+ S+SS   KE+  
Sbjct: 121 GGRNYLKLSAFRDEKSPLMWELSSWVRWYALYLEHLLSTSRIMGFFISSTSSTIHKEEYE 180

Query: 181 EQISGILNSDLLKETESLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISI 240
           E +S + NSDLL+E ++LVGL+EE  K+P      G  L DKI   VG+DY+S I E+  
Sbjct: 181 EMVSSLTNSDLLREIDALVGLLEEACKIPDLPFSGGKSLADKITQLVGEDYVSSINELYT 240

Query: 241 RVAEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYE-VLLDEFWGSIRETKNL 300
           R  EF +R   LSFG+++ELVCALKRLE CKE+ S      ++   +D FWG + E K +
Sbjct: 241 RFNEFKERSNTLSFGDTIELVCALKRLESCKERLSEICHGNWKRGWIDGFWGLVLEVKGI 300

Query: 301 IGESKENREGGKLART------KSKMSDSGRFMERANAGSYRDSIRFGSERF-DLTYKGF 354
           IG  ++N   G++ ++      + K  +S RF +R   G Y + +RF S RF ++    F
Sbjct: 301 IGNLEDNY--GQIEKSIVGFGKRDKGYESARFTDRLIIG-YSNPVRFSSGRFSNVDRFNF 360

BLAST of HG10011785 vs. TAIR 10
Match: AT5G65370.1 (ENTH/ANTH/VHS superfamily protein )

HSP 1 Score: 146.4 bits (368), Expect = 4.6e-35
Identity = 99/301 (32.89%), Postives = 164/301 (54.49%), Query Frame = 0

Query: 6   KLSSLIGLIKDKASQTK---AALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLG 65
           KL++L G++KD+ASQ K     L +  N  +  LALL+AT+H  + PPS+K+++ L S  
Sbjct: 3   KLATLNGILKDEASQMKLNVVHLCSSVNAKTIDLALLKATSHTSNNPPSDKYVTFLQSTI 62

Query: 66  KTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKN-GGFILQDQL------SVFP 125
            T        V+ ++ RL+ T +  VA KCLI +H ++K+  G+  +D L          
Sbjct: 63  DT--CYGPDTVDAILHRLRVTTDVCVAAKCLILLHKMVKSESGYNGEDSLRNNINHRTLI 122

Query: 126 FTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSSSSNEEKEK 185
           +T G + LKL+D   NS+  + EL+ WV+WY QY++  LSI+ +LG        NE+K  
Sbjct: 123 YTQGGSNLKLNDLNVNSSRFTRELTPWVQWYKQYLDCYLSIAEVLGITPNIKEKNEDKRL 182

Query: 186 KAEQISGILNSDLLKETESLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEI 245
           + +++S      +LK+ + LV L E  S  P       N++V ++   +  DY S I+ +
Sbjct: 183 ETQRVSSYPMDCILKQIDFLVELFEHISDRPKAPQSKLNKIVIEMTELMVQDYFSAIRLM 242

Query: 246 SIRVAEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKN 297
            IR  E + R+      +  ELV  L++LE+CKE  S   S + + L+ +FW  + + K+
Sbjct: 243 RIRFEELNVRV-----AKPNELVPVLEKLENCKEGLS-EFSWRSKYLIADFWYLVSKLKD 295

BLAST of HG10011785 vs. TAIR 10
Match: AT5G10410.1 (ENTH/ANTH/VHS superfamily protein )

HSP 1 Score: 140.2 bits (352), Expect = 3.3e-33
Identity = 96/309 (31.07%), Postives = 166/309 (53.72%), Query Frame = 0

Query: 10  LIGLIKDKASQTKAALL---TKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSR 69
           +IG  KDKAS  KA L+       +    LALL++TT  P+ PP+  ++S ++S   +  
Sbjct: 8   IIGKFKDKASIGKARLVHSFGSTAVKYIHLALLKSTTRTPNKPPNSDYVSAVISYSNSRY 67

Query: 70  ATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKNGGFILQDQLSVFPFTGGRNYLKLS 129
           A AA +  +   RL+ T+N+ VA K LI +H +IK+     +D+        GRN LKL+
Sbjct: 68  APAAFSAALW--RLRVTKNAIVATKSLIVIHKLIKSS----RDKFE--GLGHGRNNLKLN 127

Query: 130 DFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSSSSNEEKEKKAEQISGILNS 189
           +F D S+ ++ ELS W+RWY QY++ +  + ++LG+F     + ++K ++ +++S     
Sbjct: 128 EFSDKSSNLTLELSQWIRWYGQYLDRLSWVPKVLGSFPNLLVNPKDKVEEKDRVSSYQTG 187

Query: 190 DLLKETESLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRL 249
            ++++T+SLV   E     P    +  N++VD+I   V +DY  +++ + +R+    +RL
Sbjct: 188 YIIRQTDSLVSFFEHICTRPEIPPMFQNKIVDEIRELVIEDYFKIVRLVMVRLQVLFERL 247

Query: 250 ---GCLSFGE--SVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGESK 309
              G    G+    +    L RL +CKE  S G+  +   L D+FW  + E      E K
Sbjct: 248 IKPGVKPIGDLGLNDFSLLLVRLVECKESLS-GLFWRCRRLADDFW-CLVEMLKAETEKK 306

Query: 310 ENREGGKLA 311
            N++  +LA
Sbjct: 308 NNKQMIELA 306

BLAST of HG10011785 vs. TAIR 10
Match: AT2G01600.1 (ENTH/ANTH/VHS superfamily protein )

HSP 1 Score: 83.6 bits (205), Expect = 3.6e-16
Identity = 60/194 (30.93%), Postives = 101/194 (52.06%), Query Frame = 0

Query: 12  GLIKDKASQTKAALL-TKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATA- 71
           G +KD    TK  L+          +A+++AT H    PP ++HL  + +    +RA A 
Sbjct: 12  GALKD---STKVGLVRVNSEYADLDVAIVKATNH-VECPPKDRHLRKIFAATSVTRARAD 71

Query: 72  -AAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKNGGFILQDQLSVFPFTGGRNYLKLSDF 131
            A  +  L  RL  T+N  VALK LI +H +++ G    +++L  F   G    L+LS+F
Sbjct: 72  VAYCIHALSRRLHKTRNWTVALKTLIVIHRLLREGDPTFREELLNFSQRG--RILQLSNF 131

Query: 132 RDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSS---SSNEEKEKKAEQISGILN 191
           +D+S+PI+W+ S+WVR YA ++E  L   R+L     +     SN  ++K   +   +  
Sbjct: 132 KDDSSPIAWDCSAWVRTYALFLEERLECFRVLKYDTEAERLPKSNPGQDKGYSRTRDLDG 191

Query: 192 SDLLKETESLVGLI 200
            +LL++  +L  L+
Sbjct: 192 EELLEQLPALQQLL 199

BLAST of HG10011785 vs. TAIR 10
Match: AT2G25430.1 (epsin N-terminal homology (ENTH) domain-containing protein / clathrin assembly protein-related )

HSP 1 Score: 81.3 bits (199), Expect = 1.8e-15
Identity = 52/149 (34.90%), Postives = 87/149 (58.39%), Query Frame = 0

Query: 11  IGLIKDKAS--QTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRAT 70
           IG +KD+ S    K A    P++   ++A+++AT+HD   P SEK++  +L+L   SR  
Sbjct: 9   IGAVKDQTSIGIAKVASNMAPDL---EVAIVKATSHDDD-PASEKYIREILNLTSLSRGY 68

Query: 71  AAAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKNGGFILQDQLSVFPFTGGRNYLKLSDF 130
             A V  +  RL  T++  VALK L+ VH ++  G  I Q+++ ++    G   L +SDF
Sbjct: 69  ILACVTSVSRRLSKTRDWVVALKALMLVHRLLNEGDPIFQEEI-LYSTRRGTRMLNMSDF 128

Query: 131 RDNSNPISWELSSWVRWYAQYIETVLSIS 158
           RD ++  SW+ S++VR YA Y++  L ++
Sbjct: 129 RDEAHSSSWDHSAFVRTYAGYLDQRLELA 152

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038903242.11.2e-18392.54putative clathrin assembly protein At4g40080 [Benincasa hispida][more]
KAG6577320.15.6e-16885.95putative clathrin assembly protein, partial [Cucurbita argyrosperma subsp. soror... [more]
KAG7015410.11.6e-16785.67putative clathrin assembly protein, partial [Cucurbita argyrosperma subsp. argyr... [more]
XP_022929539.12.8e-16785.67putative clathrin assembly protein At4g40080 [Cucurbita moschata][more]
XP_023552000.18.1e-16785.12putative clathrin assembly protein At4g40080 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Q8L9362.8e-9051.37Putative clathrin assembly protein At4g40080 OS=Arabidopsis thaliana OX=3702 GN=... [more]
Q9FKQ26.4e-3432.89Putative clathrin assembly protein At5g65370 OS=Arabidopsis thaliana OX=3702 GN=... [more]
Q8H0W94.6e-3231.07Putative clathrin assembly protein At5g10410 OS=Arabidopsis thaliana OX=3702 GN=... [more]
Q8LBH25.1e-1530.93Putative clathrin assembly protein At2g01600 OS=Arabidopsis thaliana OX=3702 GN=... [more]
Q8LF202.5e-1434.90Putative clathrin assembly protein At2g25430 OS=Arabidopsis thaliana OX=3702 GN=... [more]
Match NameE-valueIdentityDescription
A0A6J1EP161.3e-16785.67putative clathrin assembly protein At4g40080 OS=Cucurbita moschata OX=3662 GN=LO... [more]
A0A6J1JCT42.4e-16484.30putative clathrin assembly protein At4g40080 OS=Cucurbita maxima OX=3661 GN=LOC1... [more]
A0A0A0KXU45.3e-16486.93ENTH domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G099230 PE=4 S... [more]
A0A5A7TT501.6e-16084.53Putative clathrin assembly protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C... [more]
A0A1S3C1C04.6e-16084.53putative clathrin assembly protein At4g40080 OS=Cucumis melo OX=3656 GN=LOC10349... [more]
Match NameE-valueIdentityDescription
AT4G40080.12.0e-9151.37ENTH/ANTH/VHS superfamily protein [more]
AT5G65370.14.6e-3532.89ENTH/ANTH/VHS superfamily protein [more]
AT5G10410.13.3e-3331.07ENTH/ANTH/VHS superfamily protein [more]
AT2G01600.13.6e-1630.93ENTH/ANTH/VHS superfamily protein [more]
AT2G25430.11.8e-1534.90epsin N-terminal homology (ENTH) domain-containing protein / clathrin assembly p... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013809ENTH domainSMARTSM00273enth_2coord: 32..164
e-value: 3.8E-9
score: 46.4
IPR013809ENTH domainPROSITEPS50942ENTHcoord: 26..164
score: 17.544388
IPR008942ENTH/VHSGENE3D1.25.40.90coord: 6..162
e-value: 5.3E-31
score: 109.3
IPR008942ENTH/VHSSUPERFAMILY48464ENTH/VHS domaincoord: 34..161
IPR011417AP180 N-terminal homology (ANTH) domainPFAMPF07651ANTHcoord: 36..204
e-value: 9.1E-18
score: 64.2
NoneNo IPR availablePANTHERPTHR22951CLATHRIN ASSEMBLY PROTEINcoord: 5..345
NoneNo IPR availablePANTHERPTHR22951:SF76OS09G0468150 PROTEINcoord: 5..345
NoneNo IPR availableCDDcd16987ANTH_N_AP180_plantcoord: 34..157
e-value: 1.1381E-52
score: 168.571

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10011785.1HG10011785.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0072583 clathrin-dependent endocytosis
biological_process GO:0006900 vesicle budding from membrane
cellular_component GO:0005905 clathrin-coated pit
cellular_component GO:0030136 clathrin-coated vesicle
cellular_component GO:0005794 Golgi apparatus
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0005545 1-phosphatidylinositol binding
molecular_function GO:0032050 clathrin heavy chain binding
molecular_function GO:0005546 phosphatidylinositol-4,5-bisphosphate binding
molecular_function GO:0000149 SNARE binding
molecular_function GO:0005543 phospholipid binding