Cp4.1LG20g07160 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG20g07160
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionHeavy metal-associated domain protein
LocationCp4.1LG20 : 6451426 .. 6453524 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CATCCATGGAGTATGGCACGTCTTCATCGTCACGAACGAAACCGCTTCCTCTGAAATTTCTCTCCCTCTCTATTCTCTCGCTATAAATACTCTCCCTCTCTAAACGGTTTCCCATCTCTCTAATTTCGATTTCGTTTTGAATTTCAGAAGACGAAACTACCTGTGTTACAATGGGCGAGGTGAGTCGACTCGGATTTTTGCTCTGTTTCTGTTCTTCTCTGTGTTTTTTAGGGTTAATTCTTTAGGGTTTTGTTGTACAGAAGAAGACGAAGAACGAGAATGAGAAAAACGGCGATGGAGGCGGAGGGAAGAAGAAGGAAGAGAATCCATTCACTGTGGTTCTTAAAGTCGATATGCATTGTGAAGGCTGCGCCAACAAAATCACCAAATGTGTTAAGGGATTTGAAGGTATGATCACGACGGCCATGGCGGATTCCTGTTTTTTCTTTTCCTCTGTTTGTCCCCTGTTATGTTATTTTACTGTCAGTTTCAGATATAAATATGCTCTGTTTTCGATTTTCTGCTTTCCGCCATTACTGAGCAGTGAATCGATAATGCTCTGTTTTTTGAACTGTTAACAGGAAGAACAAAAGAAGACCTAAATCGGGTTCCATGTGTATATTTTTCCTCTGTTCTGTCATCGTTATGATTTTACTGTCAGTTTCAGATATAAACATGCTCTGTTTTCGATTTTCCGCTTTCCGCCATTATTGAGCGCAGTGAATCGATAATGCTCTGTTTTTTGAACTGTTAACAGGAAGAACAAAAAAGTACCCAAATCGGGTTTCCATTTGTGTATTTTACCTTTCTGTTATTCGGCAATGAAGATATCTGACACCATTAATGGAGGCGCTCTGTTTTGTTTGTTTGTTTGTTCTTCAGGTGTGCAGACTGTGAAAGCTGAGATTGAAGGAAACAAACTGACAGTAACGGGGGAAAAAATAGACGCATCGAAACTCCTTGAGAAGCTTTCAAACAAGACGAAGAAGAAAGTGGATTTGATTTCACCACAGAGCAAGAAAGAAAAGGACTCGAAGCCCAAAGTTAAGGGCGATGAAGACCAGACCTCTTCTAATAACAACAAATCCGACAAGAAAACAGAGGAGAACAAGAAGAAACCCAAAGAGGTAAAGAAAAAAACAAGAAACCCTTGTTGGTCCCCTCTGAGATTCCGTTGTTTGATTCTGTTTTCTTCTCTGTTTTTGAAACAGCCGCCGGTGACGACGGCGGTGCTGAAGGTGGCGCTACATTGCCAGGGGTGTATAGAGAAGATTCAAAGGATTACAACCAAATTCAAAGGTAAGAGATGATGAACAAGAAATGGGTTCTTAATTGATTTGATATTTGATTTTGATTTTGATTTGAAATAATGTCAGGCGTTCAAGAGATGTCATTGGATAAGCAGAAGGAGTTGGTGACCGTGAAAGGCACAATGGATGTTAAGGCCTTGAGTGGTTGCTTAACCGAGAGACTAAAACGAGCGGTGGAGATTGTCCCAACGAAGAAGGAGAAAGACAAGGACAATAACAACAACAACAACAAAAACGAAGGTGGTGGCGGCACCAAGAAAGAACCTCCTGCAGCCGGCGACGGCGACAGCAATAGCAATGAGAATGGAGGAGGAAAGAAGAAGAAGAAAGGCGGTAACGGTGGAGGAGACGGTGGAGCAACGGGCATGGAAGAGGGTGGTGGAGGTGGAAAAATGGAAGGGTACAAAATGGAGTACATGGGAGTGGGAGGAGGAATAGGATATGGATACGGGTTGAATGGGTACGGGATGAGTACTGGTCTCGGGTATGGGTACGGGTATGGAGCGGGAGGGGTCGTGGGGGAGAATTTACACGCACCGCAGTTGTTTAGCGATGAGAATCCAAATGCTTGCTCGATCATGTAAGGATTAGAAAATGTAAAAATTTAGAAGAAGAATCCAATGTCTAGTTGGTAATTTGGTTAGTTTTTGGTAGGGGAGGGGGAATGAAGGAAGGAAGGAAGGTTTTTGTTGTTATTTTGTTGGTTAATATTGGATGAAGAGGGGAATTGATTCCCTTTGGGCTATCATTATGTGTTTGTTTTTTAATATCTTGGCTCTTAATTTGTGC

mRNA sequence

CATCCATGGAGTATGGCACGTCTTCATCGTCACGAACGAAACCGCTTCCTCTGAAATTTCTCTCCCTCTCTATTCTCTCGCTATAAATACTCTCCCTCTCTAAACGGTTTCCCATCTCTCTAATTTCGATTTCGTTTTGAATTTCAGAAGACGAAACTACCTGTGTTACAATGGGCGAGAAGAAGACGAAGAACGAGAATGAGAAAAACGGCGATGGAGGCGGAGGGAAGAAGAAGGAAGAGAATCCATTCACTGTGGTTCTTAAAGTCGATATGCATTGTGAAGGCTGCGCCAACAAAATCACCAAATGTGTTAAGGGATTTGAAGGTGTGCAGACTGTGAAAGCTGAGATTGAAGGAAACAAACTGACAGTAACGGGGGAAAAAATAGACGCATCGAAACTCCTTGAGAAGCTTTCAAACAAGACGAAGAAGAAAGTGGATTTGATTTCACCACAGAGCAAGAAAGAAAAGGACTCGAAGCCCAAAGTTAAGGGCGATGAAGACCAGACCTCTTCTAATAACAACAAATCCGACAAGAAAACAGAGGAGAACAAGAAGAAACCCAAAGAGCCGCCGGTGACGACGGCGGTGCTGAAGGTGGCGCTACATTGCCAGGGGTGTATAGAGAAGATTCAAAGGATTACAACCAAATTCAAAGGCGTTCAAGAGATGTCATTGGATAAGCAGAAGGAGTTGGTGACCGTGAAAGGCACAATGGATGTTAAGGCCTTGAGTGGTTGCTTAACCGAGAGACTAAAACGAGCGGTGGAGATTGTCCCAACGAAGAAGGAGAAAGACAAGGACAATAACAACAACAACAACAAAAACGAAGGTGGTGGCGGCACCAAGAAAGAACCTCCTGCAGCCGGCGACGGCGACAGCAATAGCAATGAGAATGGAGGAGGAAAGAAGAAGAAGAAAGGCGGTAACGGTGGAGGAGACGGTGGAGCAACGGGCATGGAAGAGGGTGGTGGAGGTGGAAAAATGGAAGGGTACAAAATGGAGTACATGGGAGTGGGAGGAGGAATAGGATATGGATACGGGTTGAATGGGTACGGGATGAGTACTGGTCTCGGGTATGGGTACGGGTATGGAGCGGGAGGGGTCGTGGGGGAGAATTTACACGCACCGCAGTTGTTTAGCGATGAGAATCCAAATGCTTGCTCGATCATGTAAGGATTAGAAAATGTAAAAATTTAGAAGAAGAATCCAATGTCTAGTTGGTAATTTGGTTAGTTTTTGGTAGGGGAGGGGGAATGAAGGAAGGAAGGAAGGTTTTTGTTGTTATTTTGTTGGTTAATATTGGATGAAGAGGGGAATTGATTCCCTTTGGGCTATCATTATGTGTTTGTTTTTTAATATCTTGGCTCTTAATTTGTGC

Coding sequence (CDS)

ATGGGCGAGAAGAAGACGAAGAACGAGAATGAGAAAAACGGCGATGGAGGCGGAGGGAAGAAGAAGGAAGAGAATCCATTCACTGTGGTTCTTAAAGTCGATATGCATTGTGAAGGCTGCGCCAACAAAATCACCAAATGTGTTAAGGGATTTGAAGGTGTGCAGACTGTGAAAGCTGAGATTGAAGGAAACAAACTGACAGTAACGGGGGAAAAAATAGACGCATCGAAACTCCTTGAGAAGCTTTCAAACAAGACGAAGAAGAAAGTGGATTTGATTTCACCACAGAGCAAGAAAGAAAAGGACTCGAAGCCCAAAGTTAAGGGCGATGAAGACCAGACCTCTTCTAATAACAACAAATCCGACAAGAAAACAGAGGAGAACAAGAAGAAACCCAAAGAGCCGCCGGTGACGACGGCGGTGCTGAAGGTGGCGCTACATTGCCAGGGGTGTATAGAGAAGATTCAAAGGATTACAACCAAATTCAAAGGCGTTCAAGAGATGTCATTGGATAAGCAGAAGGAGTTGGTGACCGTGAAAGGCACAATGGATGTTAAGGCCTTGAGTGGTTGCTTAACCGAGAGACTAAAACGAGCGGTGGAGATTGTCCCAACGAAGAAGGAGAAAGACAAGGACAATAACAACAACAACAACAAAAACGAAGGTGGTGGCGGCACCAAGAAAGAACCTCCTGCAGCCGGCGACGGCGACAGCAATAGCAATGAGAATGGAGGAGGAAAGAAGAAGAAGAAAGGCGGTAACGGTGGAGGAGACGGTGGAGCAACGGGCATGGAAGAGGGTGGTGGAGGTGGAAAAATGGAAGGGTACAAAATGGAGTACATGGGAGTGGGAGGAGGAATAGGATATGGATACGGGTTGAATGGGTACGGGATGAGTACTGGTCTCGGGTATGGGTACGGGTATGGAGCGGGAGGGGTCGTGGGGGAGAATTTACACGCACCGCAGTTGTTTAGCGATGAGAATCCAAATGCTTGCTCGATCATGTAA

Protein sequence

MGEKKTKNENEKNGDGGGGKKKEENPFTVVLKVDMHCEGCANKITKCVKGFEGVQTVKAEIEGNKLTVTGEKIDASKLLEKLSNKTKKKVDLISPQSKKEKDSKPKVKGDEDQTSSNNNKSDKKTEENKKKPKEPPVTTAVLKVALHCQGCIEKIQRITTKFKGVQEMSLDKQKELVTVKGTMDVKALSGCLTERLKRAVEIVPTKKEKDKDNNNNNNKNEGGGGTKKEPPAAGDGDSNSNENGGGKKKKKGGNGGGDGGATGMEEGGGGGKMEGYKMEYMGVGGGIGYGYGLNGYGMSTGLGYGYGYGAGGVVGENLHAPQLFSDENPNACSIM
BLAST of Cp4.1LG20g07160 vs. Swiss-Prot
Match: HIP3_ARATH (Heavy metal-associated isoprenylated plant protein 3 OS=Arabidopsis thaliana GN=HIPP3 PE=1 SV=1)

HSP 1 Score: 251.9 bits (642), Expect = 9.7e-66
Identity = 177/336 (52.68%), Postives = 209/336 (62.20%), Query Frame = 1

Query: 1   MGEKKTKNENEKNGDGGGGKKKEENP-FTVVLKVDMHCEGCANKITKCVKGFEGVQTVKA 60
           MGEKK + +N+K G  G  KKK E P  TVVLKVDMHCEGCA++I KCV+ F+GV+TVK+
Sbjct: 1   MGEKKNEGDNKKKG--GDNKKKNETPSITVVLKVDMHCEGCASRIVKCVRSFQGVETVKS 60

Query: 61  EIEGNKLTVTGEKIDASKLLEKLSNKTKKKVDLISPQSKKEKDSKPKVKGDEDQTSSNNN 120
           E    KLTVTG  +D  KL EKL  KTKKKVDL+SPQ KKEK+ + K K DED+  S   
Sbjct: 61  ESATGKLTVTGA-LDPVKLREKLEEKTKKKVDLVSPQPKKEKEKENKNKNDEDKKKS--- 120

Query: 121 KSDKKTEENKKKPKEPPVTTAVLKVALHCQGCIEKIQRITTKFKGVQEMSLDKQKELVTV 180
           +  KK + N KKPKE PVTTAVLK+  HCQGCI KIQ+  TK KGV  +++DK+K L+TV
Sbjct: 121 EEKKKPDNNDKKPKETPVTTAVLKLNFHCQGCIGKIQKTVTKTKGVNGLTMDKEKNLLTV 180

Query: 181 KGTMDVKALSGCLTERLKRAVEIVPTKKEKDKDNNNNNNKNEGGGGTKKEPPAAGDGDSN 240
           KGTMDVK L   L+E+LKRAVEIVP KKEKDK+N N N + + GGG          GD  
Sbjct: 181 KGTMDVKKLVEILSEKLKRAVEIVPPKKEKDKENGNENGEKKKGGG----------GD-- 240

Query: 241 SNENGGGKKKKKGGNGGGDGGATGMEEGGGGGKMEGYKMEYMGVGGGIGYGYGLNGYGMS 300
                GG K+K G  GGG+G                  MEYM      GYGY   G    
Sbjct: 241 -----GGGKEKTGNKGGGEG---------------VNMMEYMAAQPAYGYGYYPGG---- 283

Query: 301 TGLGYGYGYGAGGVVGENLHAPQLFSDENPNACSIM 336
               YGY   A        HAPQ+FSDENPNAC +M
Sbjct: 301 ---PYGYPIQA--------HAPQIFSDENPNACVVM 283

BLAST of Cp4.1LG20g07160 vs. Swiss-Prot
Match: HIP26_ARATH (Heavy metal-associated isoprenylated plant protein 26 OS=Arabidopsis thaliana GN=HIPP26 PE=1 SV=1)

HSP 1 Score: 57.8 bits (138), Expect = 2.7e-07
Identity = 30/72 (41.67%), Postives = 49/72 (68.06%), Query Frame = 1

Query: 21 KKEENPFTVVLKVDMHCEGCANKITKCVKGFEGVQTVKAEIEGNKLTVTGEKIDASKLLE 80
          KK +   TV +KV M CEGC  K+ + V+G +GV +V  E + +K+TV G  +D +K++ 
Sbjct: 20 KKRKQLQTVEIKVKMDCEGCERKVRRSVEGMKGVSSVTLEPKAHKVTVVG-YVDPNKVVA 79

Query: 81 KLSNKTKKKVDL 93
          ++S++T KKV+L
Sbjct: 80 RMSHRTGKKVEL 90

BLAST of Cp4.1LG20g07160 vs. TrEMBL
Match: A0A0A0LSA4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G103270 PE=4 SV=1)

HSP 1 Score: 452.6 bits (1163), Expect = 4.2e-124
Identity = 265/336 (78.87%), Postives = 287/336 (85.42%), Query Frame = 1

Query: 3   EKKTKNENEKNGDGGGG--KKKEENPFTVVLKVDMHCEGCANKITKCVKGFEGVQTVKAE 62
           +KK KN+NEKNGDGGGG  KKKEE PFT+VLK+DMHCEGCANKITKCVKGFEGVQ+VKAE
Sbjct: 7   QKKKKNDNEKNGDGGGGEGKKKEEIPFTIVLKIDMHCEGCANKITKCVKGFEGVQSVKAE 66

Query: 63  IEGNKLTVTGEKIDASKLLEKLSNKTKKKVDLISPQSKKEKDSKPKVKGDEDQTSSNNNK 122
           I+GNKLTV G+KIDA+KL EKLSNKTKKKVDLISPQ KKEKDSKPK K D+DQTSSNNNK
Sbjct: 67  IDGNKLTVMGKKIDATKLREKLSNKTKKKVDLISPQPKKEKDSKPKDKIDDDQTSSNNNK 126

Query: 123 SDKKTEENKKKPKEPPVTTAVLKVALHCQGCIEKIQRITTKFKGVQEMSLDKQKELVTVK 182
           SDKKT+ENKKKPKEPPVTTAVLKV LHCQGCIEKIQR+TTKFKGVQEMS+DKQK+ V VK
Sbjct: 127 SDKKTDENKKKPKEPPVTTAVLKVPLHCQGCIEKIQRVTTKFKGVQEMSVDKQKDSVMVK 186

Query: 183 GTMDVKALSGCLTERLKRAVEIVPTKKEKDKDNNNNNNKNEGGGGTKKEPPAAGDGDSNS 242
           GTMDVKAL G L+ERLKR VEIVP KKEK+K+  +NN K  GGGG +K+    GDGD N 
Sbjct: 187 GTMDVKALIGSLSERLKRTVEIVPAKKEKEKE-KDNNKKEGGGGGDEKKDSTTGDGDGNG 246

Query: 243 NENGGGKKKKKGGNGGGDGGATGMEEGGGGGKMEGYKMEYMGVGG-GIGYGYGLNGYGMS 302
             NGGGKKKKKGGNGGG  G  G   GGGGGKMEG KMEYMG+GG G GYGYG  GYGM+
Sbjct: 247 --NGGGKKKKKGGNGGGGDGEEG---GGGGGKMEGNKMEYMGMGGIGYGYGYGY-GYGMN 306

Query: 303 TGLGYGYGYGAGGVVGENLHAPQLFSDENPNACSIM 336
           T    GYGYG  G+VGENLHAPQLFSDENPNAC IM
Sbjct: 307 T----GYGYGPSGIVGENLHAPQLFSDENPNACFIM 331

BLAST of Cp4.1LG20g07160 vs. TrEMBL
Match: W9S929_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_020605 PE=4 SV=1)

HSP 1 Score: 271.9 bits (694), Expect = 1.0e-69
Identity = 191/344 (55.52%), Postives = 220/344 (63.95%), Query Frame = 1

Query: 4   KKTKNENEKNGDGGGGKKKEENPFTVVLKVDMHCEGCANKITKCVKGFEGVQTVKAEIEG 63
           KK   E +KNGDGG  KKKE+N  TVVLKVDMHCEGCA KI K VK F+GV   KAE   
Sbjct: 33  KKNGGEKKKNGDGGD-KKKEDNSLTVVLKVDMHCEGCATKIVKTVKSFDGVDDAKAEFAA 92

Query: 64  NKLTVTGEKIDASKLLEKLSNKTKKKVDLISPQSKKEKDSKPKVKGDEDQTSSNNNKSDK 123
           NKLTV G K+D SKL E L+ KTKKKVDLISPQ  K+ D+K     ++ +T  N+NK  K
Sbjct: 93  NKLTVVG-KVDPSKLREMLAVKTKKKVDLISPQPSKKDDNK---NDNKKKTDENDNK--K 152

Query: 124 KTEENKKKPKEPPVTTAVLKVALHCQGCIEKIQRITTKFKGVQEMSLDKQKELVTVKGTM 183
           K +E K K KEPPVTTAVLK+ LHCQGCI KI++  TK KG  +MS+D+QKELVTV G+M
Sbjct: 153 KPDEKKPKDKEPPVTTAVLKLRLHCQGCIGKIRKTVTKTKGFNDMSIDEQKELVTVIGSM 212

Query: 184 DVKALSGCLTERLKRAVEIVPTKKEKDKDNNNNNNKNEGGGGTKKEPPAAGDGDSNSNEN 243
           D+KAL+  L E+LKR VEIVP KKEKD                       G+ D    +N
Sbjct: 213 DMKALAESLQEKLKRPVEIVPPKKEKD----------------------GGEKDGGKADN 272

Query: 244 GGGKKKKKGGNGGGDGGATGMEEGGGGGKMEGYKMEYMGVGGGIGYGYGLN---GYGMST 303
           GGG  KKKGG GGG GG  G +  G GGKME  +MEYMG     GYGYG     GY    
Sbjct: 273 GGG-GKKKGGGGGGGGGDGGQKAEGDGGKMEESRMEYMG-QPLFGYGYGPGPGFGYAYPP 332

Query: 304 GLGYGYGYGAG---------GVVGENLHAPQLFSDENPNACSIM 336
           G G+G  Y  G         G VGE LHAPQ+FSDENPNACS+M
Sbjct: 333 GPGFGSAYPPGPGFGSAYPPGYVGEQLHAPQVFSDENPNACSVM 345

BLAST of Cp4.1LG20g07160 vs. TrEMBL
Match: V4SL19_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10028824mg PE=4 SV=1)

HSP 1 Score: 264.2 bits (674), Expect = 2.1e-67
Identity = 185/340 (54.41%), Postives = 224/340 (65.88%), Query Frame = 1

Query: 3   EKKTKNENEKNGDGGGGKKKEE--NPFTVVLKVDMHCEGCANKITKCVKGFEGVQTVKAE 62
           EKK K E E  GD    KKK++  +  TV+LKVDMHCEGCANKI +  + FEGV+ VKAE
Sbjct: 24  EKKKKEEEE--GDAVAEKKKDDKKSSVTVILKVDMHCEGCANKIVRYARSFEGVEAVKAE 83

Query: 63  IEGNKLTVTGEKIDASKLLEKLSNKTKKKVDLISPQSKKE-KDSKPKVKGDEDQTSSNNN 122
           +  NK+T+ G  +D SK+ EKL  KTKKK+DLISPQ KK+ KD +PK          +N 
Sbjct: 84  VAANKITIVGA-VDPSKIREKLDKKTKKKIDLISPQPKKDNKDKEPK---------QDNK 143

Query: 123 KSDKKTEENKKKPKEPPVTTAVLKVALHCQGCIEKIQRITTKFKGVQEMSLDKQKELVTV 182
             D K+ ++KK PKEPPVTTAVLK+ LHCQGCIEKI +I +K KGV + S+DKQK+ VTV
Sbjct: 144 PKDNKSPDDKK-PKEPPVTTAVLKLGLHCQGCIEKILKIVSKTKGVMDKSIDKQKDTVTV 203

Query: 183 KGTMDVKALSGCLTERLKRAVEIVPTKKEKDKDNNNN-NNKNEGGGGTKKEPPAAGDGDS 242
           KGTMD KAL+  L ERLKR VEIVP KKEK+K+     N++ E  GG             
Sbjct: 204 KGTMDAKALAEVLKERLKRPVEIVPPKKEKEKEKEKEKNDEKESNGG------------D 263

Query: 243 NSNENGGGKKKKKGGNGGGDGGATGMEEGGGGGKMEGYKMEY--MGV-GGGIGYGYGLNG 302
           N+N   GG KKKKGG GGG G   G + GGGGGKME  +MEY  MGV G G G+GY ++G
Sbjct: 264 NNNSGNGGSKKKKGGGGGGGGQEVG-DRGGGGGKMEESRMEYFPMGVPGSGYGHGYQIHG 323

Query: 303 YGMSTGLGYGYGYGAGGVVGENLHAPQLFSDENPNACSIM 336
                  GY YGY  GG   +   APQ+FSDENPNAC +M
Sbjct: 324 -------GYEYGYPVGGYYHQPA-APQMFSDENPNACVVM 329

BLAST of Cp4.1LG20g07160 vs. TrEMBL
Match: A0A067EYM6_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g020014mg PE=4 SV=1)

HSP 1 Score: 261.2 bits (666), Expect = 1.8e-66
Identity = 180/336 (53.57%), Postives = 220/336 (65.48%), Query Frame = 1

Query: 7   KNENEKNGDGGGGKKKEE--NPFTVVLKVDMHCEGCANKITKCVKGFEGVQTVKAEIEGN 66
           K E E+ GD    KKK++  +  TV+LKVDMHCEGCANKI +  + FEGV+ VKAE+  N
Sbjct: 30  KKEEEEEGDAVAEKKKDDKKSSVTVILKVDMHCEGCANKIVRYARSFEGVEAVKAEVAAN 89

Query: 67  KLTVTGEKIDASKLLEKLSNKTKKKVDLISPQSKKE-KDSKPKVKGDEDQTSSNNNKSDK 126
           K+T+ G  +D SK+ EKL  KTKKK+DLISPQ KK+ KD +PK          +N   D 
Sbjct: 90  KITIVG-AVDPSKIREKLDKKTKKKIDLISPQPKKDNKDKEPK---------QDNKPKDN 149

Query: 127 KTEENKKKPKEPPVTTAVLKVALHCQGCIEKIQRITTKFKGVQEMSLDKQKELVTVKGTM 186
           K+ ++ KKPKEPPV TAVLK+ LHCQGCIEKI +I +K KGV + S+DKQK+ VTVKGTM
Sbjct: 150 KSPDD-KKPKEPPVMTAVLKLGLHCQGCIEKILKIVSKTKGVMDKSIDKQKDTVTVKGTM 209

Query: 187 DVKALSGCLTERLKRAVEIVPTKKEKDKDNNNNNNKNEGGGGTKKEPPAAGDGDSNSNEN 246
           D KAL+  L ERLKR VEIVP KKEK+K+ N+    N               GD+N++  
Sbjct: 210 DAKALAEVLKERLKRPVEIVPPKKEKEKEKNDEKESN--------------GGDNNNSGG 269

Query: 247 GGGKKKKKGGNGGGDGGATG-MEEGGGGGKMEGYKMEY--MGV-GGGIGYGYGLNGYGMS 306
            GG KKKKGG GGG G   G    GGGGGKME  +MEY  MGV G G G+GY ++G    
Sbjct: 270 NGGSKKKKGGGGGGGGQEVGDGGGGGGGGKMEESRMEYFPMGVPGSGYGHGYQIHG---- 329

Query: 307 TGLGYGYGYGAGGVVGENLHAPQLFSDENPNACSIM 336
              GY YGY  GG   +   APQ+FSDENPNAC +M
Sbjct: 330 ---GYEYGYPVGGYYHQPA-APQMFSDENPNACVVM 332

BLAST of Cp4.1LG20g07160 vs. TrEMBL
Match: A0A067L9R4_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_05455 PE=4 SV=1)

HSP 1 Score: 260.8 bits (665), Expect = 2.3e-66
Identity = 188/365 (51.51%), Postives = 225/365 (61.64%), Query Frame = 1

Query: 2   GEKKTKNENEKNGDGGGGKKKEENPFTVVLKVDMHCEGCANKITKCVKGFEGVQTVKAEI 61
           GEK  + + ++ G GGG KK  +NP  VVLK++MHCEGCA+K+ K  +  EGV+TVKA+ 
Sbjct: 21  GEKHVEVQKKEGGGGGGDKKDGKNPMPVVLKIEMHCEGCASKVIKSARKLEGVETVKADT 80

Query: 62  EGNKLTVTGEKIDASKLLEKLSNKTKKKVDLISPQSKKE--KDSKPKVKGDEDQTSSNNN 121
           E NKLTVTG K++ S++ + L  KTKKKV+LISPQ KKE   +S    K ++++   N  
Sbjct: 81  ESNKLTVTG-KVNPSQIRDILHKKTKKKVELISPQPKKEDANNSNNNSKKEDNKKEDNKK 140

Query: 122 KSDKKTEENKKKPKEPPVTTAVLKVALHCQGCIEKIQRITTKFKGVQEMSLDKQKELVTV 181
            +DKK + + KKPKEPPVTTAV+KVA HC GCIEKIQRI  K KGVQEM+LD+QKE VTV
Sbjct: 141 SNDKKPDADNKKPKEPPVTTAVIKVAFHCLGCIEKIQRIVCKTKGVQEMTLDRQKETVTV 200

Query: 182 KGTMDVKALSGCLTERLKRAVEIVPTKKEKDKDNNNNNNKNEGGGGTKKEPPAAGDGDSN 241
           KGTMDVK L+  L ERLKR VEIVP KKEK+K+ N  +   E           AGDG   
Sbjct: 201 KGTMDVKGLTEALKERLKRPVEIVPPKKEKEKEANGGDKGGEN----------AGDGSG- 260

Query: 242 SNENGGGKKKKKGGNGGGDGGATGMEEGGGGGKMEGYKMEYM-GVGGGIGYGYGL----- 301
                   KKKKGG GGGDG   G   G    KMEG +MEY+ G G G GYGYG      
Sbjct: 261 --------KKKKGGGGGGDGNGGGGGGGDAAAKMEGNRMEYVPGYGFGPGYGYGYMSQPM 320

Query: 302 ----NGY-GMST-------GLG-----------YGYGYGAGGVVGENLHAPQLFSDENPN 336
               NGY G          G G           YGYGYG G V G  +H    F+DENPN
Sbjct: 321 PVYGNGYMGQPVPVPVPVYGNGYMGPAPQPVYEYGYGYGHGQVPGYPVH--MKFNDENPN 363

BLAST of Cp4.1LG20g07160 vs. TAIR10
Match: AT5G60800.2 (AT5G60800.2 Heavy metal transport/detoxification superfamily protein )

HSP 1 Score: 250.4 bits (638), Expect = 1.6e-66
Identity = 176/336 (52.38%), Postives = 209/336 (62.20%), Query Frame = 1

Query: 1   MGEKKTKNENEKNGDGGGGKKKEENP-FTVVLKVDMHCEGCANKITKCVKGFEGVQTVKA 60
           MGEKK + +N+K G  G  KKK E P  TVVLKVDMHCEGCA++I KCV+ F+GV+TVK+
Sbjct: 1   MGEKKNEGDNKKKG--GDNKKKNETPSITVVLKVDMHCEGCASRIVKCVRSFQGVETVKS 60

Query: 61  EIEGNKLTVTGEKIDASKLLEKLSNKTKKKVDLISPQSKKEKDSKPKVKGDEDQTSSNNN 120
           E    KLTVTG  +D  KL EKL  KTKKKVDL+SPQ KKEK+ + K K DED+  S   
Sbjct: 61  ESATGKLTVTGA-LDPVKLREKLEEKTKKKVDLVSPQPKKEKEKENKNKNDEDKKKS--- 120

Query: 121 KSDKKTEENKKKPKEPPVTTAVLKVALHCQGCIEKIQRITTKFKGVQEMSLDKQKELVTV 180
           +  KK + N KKPKE PVTTAVLK+  HCQGCI KIQ+  TK KGV  +++DK+K L+TV
Sbjct: 121 EEKKKPDNNDKKPKETPVTTAVLKLNFHCQGCIGKIQKTVTKTKGVNGLTMDKEKNLLTV 180

Query: 181 KGTMDVKALSGCLTERLKRAVEIVPTKKEKDKDNNNNNNKNEGGGGTKKEPPAAGDGDSN 240
           KGTMDVK L   L+E+LKRAVEIVP KKEKDK+N N N + + GGG          GD  
Sbjct: 181 KGTMDVKKLVEILSEKLKRAVEIVPPKKEKDKENGNENGEKKKGGG----------GD-- 240

Query: 241 SNENGGGKKKKKGGNGGGDGGATGMEEGGGGGKMEGYKMEYMGVGGGIGYGYGLNGYGMS 300
                GG K+K G  GGG+G                  MEYM      GYGY   G    
Sbjct: 241 -----GGGKEKTGNKGGGEG---------------VNMMEYMAAQPAYGYGYYPGG---- 283

Query: 301 TGLGYGYGYGAGGVVGENLHAPQLFSDENPNACSIM 336
               YGY   A        HAPQ+FSDENPNAC ++
Sbjct: 301 ---PYGYPIQA--------HAPQIFSDENPNACVVI 283

BLAST of Cp4.1LG20g07160 vs. TAIR10
Match: AT2G36950.1 (AT2G36950.1 Heavy metal transport/detoxification superfamily protein )

HSP 1 Score: 151.8 bits (382), Expect = 7.7e-37
Identity = 143/354 (40.40%), Postives = 183/354 (51.69%), Query Frame = 1

Query: 30  VLKVDMHCEGCANKITKCVKGFEGVQTVKAEIEGNKLTVTGEKIDASKLLEKLSNKTKKK 89
           V KVDMHCEGCA KI + VK F+GV+ V A+  GNKL V G KID  KL EKL  KTK+K
Sbjct: 53  VYKVDMHCEGCAKKIKRMVKHFDGVKDVTADTGGNKLLVVG-KIDPVKLQEKLEEKTKRK 112

Query: 90  VDLISPQSKKEKDSKPKVKGDEDQTSSNNNKS--DKKTEENKKKPKEPPVTTAVLKVALH 149
           V L +P         PKV+G              DK+       P  P  +   LK+ LH
Sbjct: 113 VVLANPP--------PKVEGPVAAAVGEKKADGGDKEAAPPAPAPAAPKESVVPLKIRLH 172

Query: 150 CQGCIEKIQRITTKFKGVQEMSLDKQKELVTVKGTMDVKALSGCLTERLKRAVE-IVPTK 209
           C+GCI+KI++I  K KGV+ +++D  K++VTVKGT+DVK L   LT++LKR VE +VP K
Sbjct: 173 CEGCIQKIKKIILKIKGVETVAIDGAKDVVTVKGTIDVKELVPLLTKKLKRTVEPLVPAK 232

Query: 210 KEKDKDNNNNNNKNEGGG-GTKKEPPAAGDGDSNSNENGGGKKKKKGGNG------GGDG 269
           K+   D    N K E      KKE P+AG  ++    + GG+KKK+ G+G      GGDG
Sbjct: 233 KD---DGAAENKKTEAAAPDAKKEAPSAGVNEAKKEGSDGGEKKKEVGDGGEKKKEGGDG 292

Query: 270 GATGMEEGGGGGKMEGYKMEYMGVGGGIG-----------YGYGL-----------NGYG 329
           G    E G GG K +         GGG+            YGY             + YG
Sbjct: 293 GEKKKEAGDGGEKKKD--------GGGVPAPVAMVNKMDYYGYSAYPTAPMHWQEGHVYG 352

Query: 330 MS---TGLGY----------GYGYGAGGVV---GENLHAPQLFSDENPNACSIM 336
            S   TG  Y          GY Y +   V     N++AP +FSDENPN CS+M
Sbjct: 353 QSYSMTGQNYPVGGQSYPGSGYNYASESYVPYAQPNVNAPGMFSDENPNGCSVM 386

BLAST of Cp4.1LG20g07160 vs. TAIR10
Match: AT2G28090.1 (AT2G28090.1 Heavy metal transport/detoxification superfamily protein )

HSP 1 Score: 146.7 bits (369), Expect = 2.5e-35
Identity = 90/207 (43.48%), Postives = 128/207 (61.84%), Query Frame = 1

Query: 13  NGDGGGGKKKEEN----PFTVVLKVDMHCEGCANKITKCVKGFEGVQTVKAEIEGNKLTV 72
           +GD    KKK++N    P  VVLK+D HC+GC  +I +  +  EGV+TV+A+ + NKLT+
Sbjct: 11  HGDVEEEKKKKQNNTTSPVHVVLKIDFHCDGCIARIVRLSRRLEGVETVRADPDSNKLTL 70

Query: 73  TGEKIDASKLLEKLSNKTKKKVDLISPQSKKEKDSKPKVKGDEDQTSSNNNKSDKKTEEN 132
            G  +D  K+ EKL  K+KKKV+LISP+ KK+             T  NN K     + N
Sbjct: 71  IGFIMDPVKIAEKLQKKSKKKVELISPKPKKD-------------TKENNEK-----KAN 130

Query: 133 KKKPKEPPVTTAVLKVALHCQGCIEKIQRITTKFKGVQEMSLDKQKELVTVKGTMDVKAL 192
            K      VTT VLKV   C GCI++IQ+  +  KGV ++ +DK+KE VTV GTMD+K++
Sbjct: 131 DKTQTVVAVTTVVLKVNCSCDGCIKRIQKAVSTTKGVYQVKMDKEKETVTVMGTMDIKSV 190

Query: 193 SGCLTERLKRAVEIVPTKKEKDKDNNN 216
           +  L  +LK+ V++VP KK+K KD +N
Sbjct: 191 TDNLKRKLKKTVQVVPEKKKKKKDKDN 199

BLAST of Cp4.1LG20g07160 vs. TAIR10
Match: AT5G03380.1 (AT5G03380.1 Heavy metal transport/detoxification superfamily protein )

HSP 1 Score: 146.4 bits (368), Expect = 3.2e-35
Identity = 150/407 (36.86%), Postives = 198/407 (48.65%), Query Frame = 1

Query: 1   MGEKK----TKNENEKNGDGGGGKKKEENPFTVVLKVDMHCEGCANKITKCVKGFEGVQT 60
           MGEKK    TK + EK    GG         TVV+K+DMHCEGC  KI +  K F+GV+ 
Sbjct: 1   MGEKKEETATKPQGEKKPTDGGIT-------TVVMKLDMHCEGCGKKIKRIFKHFKGVED 60

Query: 61  VKAEIEGNKLTVTGEKIDASKLLEKLSNKTKKKVDLISPQSKKEKDSKPKVKGDEDQTSS 120
           VK + + NKLTV G  +D  ++ +K+++K K+ V+L+S  +  +K++ P   G E + S 
Sbjct: 61  VKIDYKSNKLTVIGN-VDPVEVRDKVADKIKRPVELVSTVAPPKKETPPSSGGAEKKPSP 120

Query: 121 -----------------NNNKSDKKTEENKKKPKEPPV---TTAVLKVALHCQGCIEKIQ 180
                               K +KK EE +KK   PP    +T VLK  LHC+GC  KI+
Sbjct: 121 AAEEKPAEKKPAAVEKPGEKKEEKKKEEGEKKASPPPPPKESTVVLKTKLHCEGCEHKIK 180

Query: 181 RITTKFKGVQEMSLDKQKELVTVKGTMDVKALSGCLTERLKRAVEIVPTKKEKDKD---- 240
           RI  K KGV  +++D  K+LV VKG +DVK L+  L E+LKR VE+VP KK+        
Sbjct: 181 RIVNKIKGVNSVAIDSAKDLVIVKGIIDVKQLTPYLNEKLKRTVEVVPAKKDDGAPVAAA 240

Query: 241 --NNNNNNKNEGGGGTKKEPPAAGDGDSNSNENGGGKKKKKGGNGGGDGGATGMEEGGGG 300
                   K +   G KKE    G+       +GGG+KKK+   GGG GG  G   GG G
Sbjct: 241 AAAPAGGEKKDKVAGEKKEIKDVGE----KKVDGGGEKKKEVAVGGGGGGGGG---GGDG 300

Query: 301 GKMEGYKMEYMGVG----------GGIGYG---YGLNGYG-------------------M 336
           G M+  K EY G G           G  YG   Y + G                     M
Sbjct: 301 GAMDVKKSEYNGYGYPPQPMYYYPEGQVYGQQHYMMQGQSSQSYVQEPYSNQGYVQESYM 360

BLAST of Cp4.1LG20g07160 vs. TAIR10
Match: AT3G02960.1 (AT3G02960.1 Heavy metal transport/detoxification superfamily protein )

HSP 1 Score: 127.9 bits (320), Expect = 1.2e-29
Identity = 79/227 (34.80%), Postives = 129/227 (56.83%), Query Frame = 1

Query: 3   EKKTKNENEKNGDGGGGKKKEENPFT-VVLKVDMHCEGCANKITKCVKGFEGVQTVKAEI 62
           + K++ +N+KNGD    K  ++N    +VLKV MHCEGCA++++ C++G++GV+ +K EI
Sbjct: 11  DNKSEKKNQKNGDSSVDKSDKKNQCKEIVLKVYMHCEGCASQVSHCLRGYDGVEHIKTEI 70

Query: 63  EGNKLTVTGEKIDASKLLEKLSNKTKKKVDLISPQSKKEKDSKPKVKGDEDQTSSNNNKS 122
             NK+ V+G+  D  K+L ++  K  +  ++ISP+   ++D K                 
Sbjct: 71  GDNKVVVSGKFDDPLKILRRVQKKFSRNAEMISPKHNPKQDQK----------------- 130

Query: 123 DKKTEENKKKPKEPPVTTAVLKVALHCQGCIEKIQRITTKFKGVQEMSLDKQKELVTVKG 182
               E  +KK   P + TA+L++ +HC+GC+ +I+R   K KG+Q +  D+ K  V V+G
Sbjct: 131 ----EPQQKKESAPEIKTAILRMNMHCEGCVHEIKRGIEKIKGIQSVEPDRSKSTVVVRG 190

Query: 183 TMDVKALSGCLTERLKRAVEIVPTKKEKDKDNN-NNNNKNEGGGGTK 228
            MD   L   + ++L +  E++    EK KDNN  NNNK E   G K
Sbjct: 191 VMDPPKLVEKIKKKLGKHAELLSQITEKGKDNNKKNNNKKEESDGNK 216

BLAST of Cp4.1LG20g07160 vs. NCBI nr
Match: gi|449459106|ref|XP_004147287.1| (PREDICTED: protein FAM98B-like [Cucumis sativus])

HSP 1 Score: 452.6 bits (1163), Expect = 6.0e-124
Identity = 265/336 (78.87%), Postives = 287/336 (85.42%), Query Frame = 1

Query: 3   EKKTKNENEKNGDGGGG--KKKEENPFTVVLKVDMHCEGCANKITKCVKGFEGVQTVKAE 62
           +KK KN+NEKNGDGGGG  KKKEE PFT+VLK+DMHCEGCANKITKCVKGFEGVQ+VKAE
Sbjct: 7   QKKKKNDNEKNGDGGGGEGKKKEEIPFTIVLKIDMHCEGCANKITKCVKGFEGVQSVKAE 66

Query: 63  IEGNKLTVTGEKIDASKLLEKLSNKTKKKVDLISPQSKKEKDSKPKVKGDEDQTSSNNNK 122
           I+GNKLTV G+KIDA+KL EKLSNKTKKKVDLISPQ KKEKDSKPK K D+DQTSSNNNK
Sbjct: 67  IDGNKLTVMGKKIDATKLREKLSNKTKKKVDLISPQPKKEKDSKPKDKIDDDQTSSNNNK 126

Query: 123 SDKKTEENKKKPKEPPVTTAVLKVALHCQGCIEKIQRITTKFKGVQEMSLDKQKELVTVK 182
           SDKKT+ENKKKPKEPPVTTAVLKV LHCQGCIEKIQR+TTKFKGVQEMS+DKQK+ V VK
Sbjct: 127 SDKKTDENKKKPKEPPVTTAVLKVPLHCQGCIEKIQRVTTKFKGVQEMSVDKQKDSVMVK 186

Query: 183 GTMDVKALSGCLTERLKRAVEIVPTKKEKDKDNNNNNNKNEGGGGTKKEPPAAGDGDSNS 242
           GTMDVKAL G L+ERLKR VEIVP KKEK+K+  +NN K  GGGG +K+    GDGD N 
Sbjct: 187 GTMDVKALIGSLSERLKRTVEIVPAKKEKEKE-KDNNKKEGGGGGDEKKDSTTGDGDGNG 246

Query: 243 NENGGGKKKKKGGNGGGDGGATGMEEGGGGGKMEGYKMEYMGVGG-GIGYGYGLNGYGMS 302
             NGGGKKKKKGGNGGG  G  G   GGGGGKMEG KMEYMG+GG G GYGYG  GYGM+
Sbjct: 247 --NGGGKKKKKGGNGGGGDGEEG---GGGGGKMEGNKMEYMGMGGIGYGYGYGY-GYGMN 306

Query: 303 TGLGYGYGYGAGGVVGENLHAPQLFSDENPNACSIM 336
           T    GYGYG  G+VGENLHAPQLFSDENPNAC IM
Sbjct: 307 T----GYGYGPSGIVGENLHAPQLFSDENPNACFIM 331

BLAST of Cp4.1LG20g07160 vs. NCBI nr
Match: gi|659072031|ref|XP_008463112.1| (PREDICTED: keratin, type I cytoskeletal 9-like isoform X1 [Cucumis melo])

HSP 1 Score: 452.6 bits (1163), Expect = 6.0e-124
Identity = 261/334 (78.14%), Postives = 285/334 (85.33%), Query Frame = 1

Query: 3   EKKTKNENEKNGDG-GGGKKKEENPFTVVLKVDMHCEGCANKITKCVKGFEGVQTVKAEI 62
           +KK KN+NEKNGDG GGGKKKE+ PFT+VLK+DMHCEGCANKITKCVKGFEGVQ+VKAEI
Sbjct: 7   QKKKKNDNEKNGDGEGGGKKKEDIPFTIVLKIDMHCEGCANKITKCVKGFEGVQSVKAEI 66

Query: 63  EGNKLTVTGEKIDASKLLEKLSNKTKKKVDLISPQSKKEKDSKPKVKGDEDQTSSNNNKS 122
           +GNKLTV G+KIDA+KL EKLSNKTKKKVDLISPQ KKEKDSKPK K D+DQTSSNNNK 
Sbjct: 67  DGNKLTVMGKKIDATKLREKLSNKTKKKVDLISPQPKKEKDSKPKDKIDDDQTSSNNNKF 126

Query: 123 DKKTEENKKKPKEPPVTTAVLKVALHCQGCIEKIQRITTKFKGVQEMSLDKQKELVTVKG 182
           DKKT+ENKKKPKEPPVTTAVLKV LHCQGCIEKIQR+TTKFKGVQEMS+D+QK+ V VKG
Sbjct: 127 DKKTDENKKKPKEPPVTTAVLKVPLHCQGCIEKIQRVTTKFKGVQEMSVDRQKDSVMVKG 186

Query: 183 TMDVKALSGCLTERLKRAVEIVPTKKEKDKDNNNNNNKNEGGGGTKKEPPAAGDGDSNSN 242
           TMDVKAL G L+ERLKR VEIVP KKEK+K+  NN N+  GG   K  P   GDG+ N N
Sbjct: 187 TMDVKALIGSLSERLKRPVEIVPAKKEKEKEKENNKNEGGGGDDKKDSPTGDGDGNGNGN 246

Query: 243 ENGGGKKKKKGGNGGGDGGATGMEEGGGGGKMEGYKMEYMGVGGGIGYGYGLNGYGMSTG 302
            NGGGKKKKKGGNGGG  G  G   GGGGGKMEG KMEYMG+ GGIGYGYG  GYG++T 
Sbjct: 247 GNGGGKKKKKGGNGGGGEGEEG--GGGGGGKMEGNKMEYMGM-GGIGYGYGY-GYGLNT- 306

Query: 303 LGYGYGYGAGGVVGENLHAPQLFSDENPNACSIM 336
              GYGYG GG+VGENLHAPQLFSDENPNAC IM
Sbjct: 307 ---GYGYGPGGLVGENLHAPQLFSDENPNACFIM 332

BLAST of Cp4.1LG20g07160 vs. NCBI nr
Match: gi|659072033|ref|XP_008463119.1| (PREDICTED: aspartate, glycine, lysine and serine-rich protein-like isoform X2 [Cucumis melo])

HSP 1 Score: 452.6 bits (1163), Expect = 6.0e-124
Identity = 261/334 (78.14%), Postives = 285/334 (85.33%), Query Frame = 1

Query: 3   EKKTKNENEKNGDG-GGGKKKEENPFTVVLKVDMHCEGCANKITKCVKGFEGVQTVKAEI 62
           +KK KN+NEKNGDG GGGKKKE+ PFT+VLK+DMHCEGCANKITKCVKGFEGVQ+VKAEI
Sbjct: 6   QKKKKNDNEKNGDGEGGGKKKEDIPFTIVLKIDMHCEGCANKITKCVKGFEGVQSVKAEI 65

Query: 63  EGNKLTVTGEKIDASKLLEKLSNKTKKKVDLISPQSKKEKDSKPKVKGDEDQTSSNNNKS 122
           +GNKLTV G+KIDA+KL EKLSNKTKKKVDLISPQ KKEKDSKPK K D+DQTSSNNNK 
Sbjct: 66  DGNKLTVMGKKIDATKLREKLSNKTKKKVDLISPQPKKEKDSKPKDKIDDDQTSSNNNKF 125

Query: 123 DKKTEENKKKPKEPPVTTAVLKVALHCQGCIEKIQRITTKFKGVQEMSLDKQKELVTVKG 182
           DKKT+ENKKKPKEPPVTTAVLKV LHCQGCIEKIQR+TTKFKGVQEMS+D+QK+ V VKG
Sbjct: 126 DKKTDENKKKPKEPPVTTAVLKVPLHCQGCIEKIQRVTTKFKGVQEMSVDRQKDSVMVKG 185

Query: 183 TMDVKALSGCLTERLKRAVEIVPTKKEKDKDNNNNNNKNEGGGGTKKEPPAAGDGDSNSN 242
           TMDVKAL G L+ERLKR VEIVP KKEK+K+  NN N+  GG   K  P   GDG+ N N
Sbjct: 186 TMDVKALIGSLSERLKRPVEIVPAKKEKEKEKENNKNEGGGGDDKKDSPTGDGDGNGNGN 245

Query: 243 ENGGGKKKKKGGNGGGDGGATGMEEGGGGGKMEGYKMEYMGVGGGIGYGYGLNGYGMSTG 302
            NGGGKKKKKGGNGGG  G  G   GGGGGKMEG KMEYMG+ GGIGYGYG  GYG++T 
Sbjct: 246 GNGGGKKKKKGGNGGGGEGEEG--GGGGGGKMEGNKMEYMGM-GGIGYGYGY-GYGLNT- 305

Query: 303 LGYGYGYGAGGVVGENLHAPQLFSDENPNACSIM 336
              GYGYG GG+VGENLHAPQLFSDENPNAC IM
Sbjct: 306 ---GYGYGPGGLVGENLHAPQLFSDENPNACFIM 331

BLAST of Cp4.1LG20g07160 vs. NCBI nr
Match: gi|703157783|ref|XP_010111816.1| (hypothetical protein L484_020605 [Morus notabilis])

HSP 1 Score: 271.9 bits (694), Expect = 1.5e-69
Identity = 191/344 (55.52%), Postives = 220/344 (63.95%), Query Frame = 1

Query: 4   KKTKNENEKNGDGGGGKKKEENPFTVVLKVDMHCEGCANKITKCVKGFEGVQTVKAEIEG 63
           KK   E +KNGDGG  KKKE+N  TVVLKVDMHCEGCA KI K VK F+GV   KAE   
Sbjct: 33  KKNGGEKKKNGDGGD-KKKEDNSLTVVLKVDMHCEGCATKIVKTVKSFDGVDDAKAEFAA 92

Query: 64  NKLTVTGEKIDASKLLEKLSNKTKKKVDLISPQSKKEKDSKPKVKGDEDQTSSNNNKSDK 123
           NKLTV G K+D SKL E L+ KTKKKVDLISPQ  K+ D+K     ++ +T  N+NK  K
Sbjct: 93  NKLTVVG-KVDPSKLREMLAVKTKKKVDLISPQPSKKDDNK---NDNKKKTDENDNK--K 152

Query: 124 KTEENKKKPKEPPVTTAVLKVALHCQGCIEKIQRITTKFKGVQEMSLDKQKELVTVKGTM 183
           K +E K K KEPPVTTAVLK+ LHCQGCI KI++  TK KG  +MS+D+QKELVTV G+M
Sbjct: 153 KPDEKKPKDKEPPVTTAVLKLRLHCQGCIGKIRKTVTKTKGFNDMSIDEQKELVTVIGSM 212

Query: 184 DVKALSGCLTERLKRAVEIVPTKKEKDKDNNNNNNKNEGGGGTKKEPPAAGDGDSNSNEN 243
           D+KAL+  L E+LKR VEIVP KKEKD                       G+ D    +N
Sbjct: 213 DMKALAESLQEKLKRPVEIVPPKKEKD----------------------GGEKDGGKADN 272

Query: 244 GGGKKKKKGGNGGGDGGATGMEEGGGGGKMEGYKMEYMGVGGGIGYGYGLN---GYGMST 303
           GGG  KKKGG GGG GG  G +  G GGKME  +MEYMG     GYGYG     GY    
Sbjct: 273 GGG-GKKKGGGGGGGGGDGGQKAEGDGGKMEESRMEYMG-QPLFGYGYGPGPGFGYAYPP 332

Query: 304 GLGYGYGYGAG---------GVVGENLHAPQLFSDENPNACSIM 336
           G G+G  Y  G         G VGE LHAPQ+FSDENPNACS+M
Sbjct: 333 GPGFGSAYPPGPGFGSAYPPGYVGEQLHAPQVFSDENPNACSVM 345

BLAST of Cp4.1LG20g07160 vs. NCBI nr
Match: gi|567863674|ref|XP_006424491.1| (hypothetical protein CICLE_v10028824mg [Citrus clementina])

HSP 1 Score: 264.2 bits (674), Expect = 3.0e-67
Identity = 185/340 (54.41%), Postives = 224/340 (65.88%), Query Frame = 1

Query: 3   EKKTKNENEKNGDGGGGKKKEE--NPFTVVLKVDMHCEGCANKITKCVKGFEGVQTVKAE 62
           EKK K E E  GD    KKK++  +  TV+LKVDMHCEGCANKI +  + FEGV+ VKAE
Sbjct: 24  EKKKKEEEE--GDAVAEKKKDDKKSSVTVILKVDMHCEGCANKIVRYARSFEGVEAVKAE 83

Query: 63  IEGNKLTVTGEKIDASKLLEKLSNKTKKKVDLISPQSKKE-KDSKPKVKGDEDQTSSNNN 122
           +  NK+T+ G  +D SK+ EKL  KTKKK+DLISPQ KK+ KD +PK          +N 
Sbjct: 84  VAANKITIVGA-VDPSKIREKLDKKTKKKIDLISPQPKKDNKDKEPK---------QDNK 143

Query: 123 KSDKKTEENKKKPKEPPVTTAVLKVALHCQGCIEKIQRITTKFKGVQEMSLDKQKELVTV 182
             D K+ ++KK PKEPPVTTAVLK+ LHCQGCIEKI +I +K KGV + S+DKQK+ VTV
Sbjct: 144 PKDNKSPDDKK-PKEPPVTTAVLKLGLHCQGCIEKILKIVSKTKGVMDKSIDKQKDTVTV 203

Query: 183 KGTMDVKALSGCLTERLKRAVEIVPTKKEKDKDNNNN-NNKNEGGGGTKKEPPAAGDGDS 242
           KGTMD KAL+  L ERLKR VEIVP KKEK+K+     N++ E  GG             
Sbjct: 204 KGTMDAKALAEVLKERLKRPVEIVPPKKEKEKEKEKEKNDEKESNGG------------D 263

Query: 243 NSNENGGGKKKKKGGNGGGDGGATGMEEGGGGGKMEGYKMEY--MGV-GGGIGYGYGLNG 302
           N+N   GG KKKKGG GGG G   G + GGGGGKME  +MEY  MGV G G G+GY ++G
Sbjct: 264 NNNSGNGGSKKKKGGGGGGGGQEVG-DRGGGGGKMEESRMEYFPMGVPGSGYGHGYQIHG 323

Query: 303 YGMSTGLGYGYGYGAGGVVGENLHAPQLFSDENPNACSIM 336
                  GY YGY  GG   +   APQ+FSDENPNAC +M
Sbjct: 324 -------GYEYGYPVGGYYHQPA-APQMFSDENPNACVVM 329

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
HIP3_ARATH9.7e-6652.68Heavy metal-associated isoprenylated plant protein 3 OS=Arabidopsis thaliana GN=... [more]
HIP26_ARATH2.7e-0741.67Heavy metal-associated isoprenylated plant protein 26 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0LSA4_CUCSA4.2e-12478.87Uncharacterized protein OS=Cucumis sativus GN=Csa_1G103270 PE=4 SV=1[more]
W9S929_9ROSA1.0e-6955.52Uncharacterized protein OS=Morus notabilis GN=L484_020605 PE=4 SV=1[more]
V4SL19_9ROSI2.1e-6754.41Uncharacterized protein OS=Citrus clementina GN=CICLE_v10028824mg PE=4 SV=1[more]
A0A067EYM6_CITSI1.8e-6653.57Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g020014mg PE=4 SV=1[more]
A0A067L9R4_JATCU2.3e-6651.51Uncharacterized protein OS=Jatropha curcas GN=JCGZ_05455 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G60800.21.6e-6652.38 Heavy metal transport/detoxification superfamily protein [more]
AT2G36950.17.7e-3740.40 Heavy metal transport/detoxification superfamily protein [more]
AT2G28090.12.5e-3543.48 Heavy metal transport/detoxification superfamily protein [more]
AT5G03380.13.2e-3536.86 Heavy metal transport/detoxification superfamily protein [more]
AT3G02960.11.2e-2934.80 Heavy metal transport/detoxification superfamily protein [more]
Match NameE-valueIdentityDescription
gi|449459106|ref|XP_004147287.1|6.0e-12478.87PREDICTED: protein FAM98B-like [Cucumis sativus][more]
gi|659072031|ref|XP_008463112.1|6.0e-12478.14PREDICTED: keratin, type I cytoskeletal 9-like isoform X1 [Cucumis melo][more]
gi|659072033|ref|XP_008463119.1|6.0e-12478.14PREDICTED: aspartate, glycine, lysine and serine-rich protein-like isoform X2 [C... [more]
gi|703157783|ref|XP_010111816.1|1.5e-6955.52hypothetical protein L484_020605 [Morus notabilis][more]
gi|567863674|ref|XP_006424491.1|3.0e-6754.41hypothetical protein CICLE_v10028824mg [Citrus clementina][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0046872metal ion binding
Vocabulary: Biological Process
TermDefinition
GO:0030001metal ion transport
Vocabulary: INTERPRO
TermDefinition
IPR006121HMA_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0030001 metal ion transport
biological_process GO:0016226 iron-sulfur cluster assembly
cellular_component GO:0005575 cellular_component
molecular_function GO:0046872 metal ion binding
molecular_function GO:0051536 iron-sulfur cluster binding
molecular_function GO:0005198 structural molecule activity
molecular_function GO:0016787 hydrolase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG20g07160.1Cp4.1LG20g07160.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006121Heavy metal-associated domain, HMAPFAMPF00403HMAcoord: 142..201
score: 8.6E-7coord: 31..86
score: 6.7
IPR006121Heavy metal-associated domain, HMAPROFILEPS50846HMA_2coord: 35..84
score: 9
IPR006121Heavy metal-associated domain, HMAunknownSSF55008HMA, heavy metal-associated domaincoord: 28..85
score: 2.09E-12coord: 137..194
score: 7.4
NoneNo IPR availableGENE3DG3DSA:3.30.70.100coord: 26..91
score: 1.7E-15coord: 137..192
score: 2.9
NoneNo IPR availablePANTHERPTHR22814COPPER TRANSPORT PROTEIN ATOX1-RELATEDcoord: 242..334
score: 3.6E-99coord: 14..201
score: 3.6
NoneNo IPR availablePANTHERPTHR22814:SF100HEAVY METAL TRANSPORT/DETOXIFICATION DOMAIN-CONTAINING PROTEIN-RELATEDcoord: 242..334
score: 3.6E-99coord: 14..201
score: 3.6

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG20g07160Cp4.1LG16g05000Cucurbita pepo (Zucchini)cpecpeB297
Cp4.1LG20g07160Cp4.1LG19g03860Cucurbita pepo (Zucchini)cpecpeB413
Cp4.1LG20g07160Cp4.1LG02g12670Cucurbita pepo (Zucchini)cpecpeB432
The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG20g07160Cucurbita pepo (Zucchini)cpecpeB296
Cp4.1LG20g07160Silver-seed gourdcarcpeB0469