Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGACCGTCACCGGAAGTATCGTTTCTTCGAAGCCAATCTCTATCTCAAAAGCGGCGTCCACTCTCTCCTCCTTTCTCTCCGTCGACAATGGCGCTTCGAAAGCAATCTGCGCCTATCTGAGACGCGCCTCCGCCTCTTTCAACGAGTTAAAGCAGCTCCACAAGGAGCTGAAATCTTCGCGGTCCGATCGGAAGCACCGGCATCACGGATCCGAGGCTTCAAACGATCCAGAGGCTGCCAGAGATAATCCACAATGGATCGAAGACGGCGAGAAGAAAAATCCCCTGTATCTAAGAGCAAAAGATGGAAAAAGTGGTAAGACGAGTCTCAATGTCCAATCTGAGGATGGGAAGGCGGAAAAGGAAAGCGGTGGGAATGGTGATTTTGAGGATGCATCGGGTGAATATCGAAAGAGAAAGGTCGAGGACTTGAAGACTGAAATTGAAGATAAACCTAACCGAAAAGTAGAGATGGATGTAGAATCAAGTGATAAAGATAAGAGCGTTGTAGCAGTTGAGAAAAAAGGAAAAAAGCACCAGAAAAAGAGCGAGGATAGATATGCTAAGATTGAAGATGATGAACATAAGGCTGGAGCCAGGCGAAGTTCTAGTAAATCGCGAAATAGTGATAACAATGGCGAAATTGAAGCTTCTGCGAAGTTCGTCGAGAACAATATCGCAAGCGGAAAAGATAGAAAGAAGCACGTGGACAAGAAGAGTTTGGGTGACGATAAGGATCAAGTAAAGAGTGAAGGTCACAGAAGAAGAGACGCCGAGGAGGAAAAGAGCACAAATAAGGATAATGATGATGGAACAGAGTCGAGCAAGAAGAAAAAGAAGAAGAAGAAGAAGAAGAAGAACAGGGAAGAAGAAGATGATGATTTTCAGAATAACAGTGGAGGAGCTCTGGTGAAAGAGGAAATTCCAGTTTTGGATGACAAAGAGTTGAAAAGGAAAGAGAAGAAAAAAAGGAAGAATCGAGACTTGGAAGAAGGGGGTGATGATGGGTCTGAGGAACAACAGCGTACGAAGAGAAGAAAAGGAAATTTATGA
mRNA sequence
ATGAAGACCGTCACCGGAAGTATCGTTTCTTCGAAGCCAATCTCTATCTCAAAAGCGGCGTCCACTCTCTCCTCCTTTCTCTCCGTCGACAATGGCGCTTCGAAAGCAATCTGCGCCTATCTGAGACGCGCCTCCGCCTCTTTCAACGAGTTAAAGCAGCTCCACAAGGAGCTGAAATCTTCGCGGTCCGATCGGAAGCACCGGCATCACGGATCCGAGGCTTCAAACGATCCAGAGGCTGCCAGAGATAATCCACAATGGATCGAAGACGGCGAGAAGAAAAATCCCCTGTATCTAAGAGCAAAAGATGGAAAAAGTGGTAAGACGAGTCTCAATGTCCAATCTGAGGATGGGAAGGCGGAAAAGGAAAGCGGTGGGAATGGTGATTTTGAGGATGCATCGGGTGAATATCGAAAGAGAAAGGTCGAGGACTTGAAGACTGAAATTGAAGATAAACCTAACCGAAAAGTAGAGATGGATGTAGAATCAAGTGATAAAGATAAGAGCGTTGTAGCAGTTGAGAAAAAAGGAAAAAAGCACCAGAAAAAGAGCGAGGATAGATATGCTAAGATTGAAGATGATGAACATAAGGCTGGAGCCAGGCGAAGTTCTAGTAAATCGCGAAATAGTGATAACAATGGCGAAATTGAAGCTTCTGCGAAGTTCGTCGAGAACAATATCGCAAGCGGAAAAGATAGAAAGAAGCACGTGGACAAGAAGAGTTTGGGTGACGATAAGGATCAAGTAAAGAGTGAAGGTCACAGAAGAAGAGACGCCGAGGAGGAAAAGAGCACAAATAAGGATAATGATGATGGAACAGAGTCGAGCAAGAAGAAAAAGAAGAAGAAGAAGAAGAAGAAGAACAGGGAAGAAGAAGATGATGATTTTCAGAATAACAGTGGAGGAGCTCTGGTGAAAGAGGAAATTCCAGTTTTGGATGACAAAGAGTTGAAAAGGAAAGAGAAGAAAAAAAGGAAGAATCGAGACTTGGAAGAAGGGGGTGATGATGGGTCTGAGGAACAACAGCGTACGAAGAGAAGAAAAGGAAATTTATGA
Coding sequence (CDS)
ATGAAGACCGTCACCGGAAGTATCGTTTCTTCGAAGCCAATCTCTATCTCAAAAGCGGCGTCCACTCTCTCCTCCTTTCTCTCCGTCGACAATGGCGCTTCGAAAGCAATCTGCGCCTATCTGAGACGCGCCTCCGCCTCTTTCAACGAGTTAAAGCAGCTCCACAAGGAGCTGAAATCTTCGCGGTCCGATCGGAAGCACCGGCATCACGGATCCGAGGCTTCAAACGATCCAGAGGCTGCCAGAGATAATCCACAATGGATCGAAGACGGCGAGAAGAAAAATCCCCTGTATCTAAGAGCAAAAGATGGAAAAAGTGGTAAGACGAGTCTCAATGTCCAATCTGAGGATGGGAAGGCGGAAAAGGAAAGCGGTGGGAATGGTGATTTTGAGGATGCATCGGGTGAATATCGAAAGAGAAAGGTCGAGGACTTGAAGACTGAAATTGAAGATAAACCTAACCGAAAAGTAGAGATGGATGTAGAATCAAGTGATAAAGATAAGAGCGTTGTAGCAGTTGAGAAAAAAGGAAAAAAGCACCAGAAAAAGAGCGAGGATAGATATGCTAAGATTGAAGATGATGAACATAAGGCTGGAGCCAGGCGAAGTTCTAGTAAATCGCGAAATAGTGATAACAATGGCGAAATTGAAGCTTCTGCGAAGTTCGTCGAGAACAATATCGCAAGCGGAAAAGATAGAAAGAAGCACGTGGACAAGAAGAGTTTGGGTGACGATAAGGATCAAGTAAAGAGTGAAGGTCACAGAAGAAGAGACGCCGAGGAGGAAAAGAGCACAAATAAGGATAATGATGATGGAACAGAGTCGAGCAAGAAGAAAAAGAAGAAGAAGAAGAAGAAGAAGAACAGGGAAGAAGAAGATGATGATTTTCAGAATAACAGTGGAGGAGCTCTGGTGAAAGAGGAAATTCCAGTTTTGGATGACAAAGAGTTGAAAAGGAAAGAGAAGAAAAAAAGGAAGAATCGAGACTTGGAAGAAGGGGGTGATGATGGGTCTGAGGAACAACAGCGTACGAAGAGAAGAAAAGGAAATTTATGA
Protein sequence
MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKSSRSDRKHRHHGSEASNDPEAARDNPQWIEDGEKKNPLYLRAKDGKSGKTSLNVQSEDGKAEKESGGNGDFEDASGEYRKRKVEDLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKGKKHQKKSEDRYAKIEDDEHKAGARRSSSKSRNSDNNGEIEASAKFVENNIASGKDRKKHVDKKSLGDDKDQVKSEGHRRRDAEEEKSTNKDNDDGTESSKKKKKKKKKKKNREEEDDDFQNNSGGALVKEEIPVLDDKELKRKEKKKRKNRDLEEGGDDGSEEQQRTKRRKGNL
Homology
BLAST of CmaCh20G004500 vs. ExPASy TrEMBL
Match:
A0A6J1JCJ7 (cylicin-1-like OS=Cucurbita maxima OX=3661 GN=LOC111484496 PE=4 SV=1)
HSP 1 Score: 619.8 bits (1597), Expect = 7.1e-174
Identity = 351/351 (100.00%), Postives = 351/351 (100.00%), Query Frame = 0
Query: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS
Sbjct: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
Query: 61 SRSDRKHRHHGSEASNDPEAARDNPQWIEDGEKKNPLYLRAKDGKSGKTSLNVQSEDGKA 120
SRSDRKHRHHGSEASNDPEAARDNPQWIEDGEKKNPLYLRAKDGKSGKTSLNVQSEDGKA
Sbjct: 61 SRSDRKHRHHGSEASNDPEAARDNPQWIEDGEKKNPLYLRAKDGKSGKTSLNVQSEDGKA 120
Query: 121 EKESGGNGDFEDASGEYRKRKVEDLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKGKKH 180
EKESGGNGDFEDASGEYRKRKVEDLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKGKKH
Sbjct: 121 EKESGGNGDFEDASGEYRKRKVEDLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKGKKH 180
Query: 181 QKKSEDRYAKIEDDEHKAGARRSSSKSRNSDNNGEIEASAKFVENNIASGKDRKKHVDKK 240
QKKSEDRYAKIEDDEHKAGARRSSSKSRNSDNNGEIEASAKFVENNIASGKDRKKHVDKK
Sbjct: 181 QKKSEDRYAKIEDDEHKAGARRSSSKSRNSDNNGEIEASAKFVENNIASGKDRKKHVDKK 240
Query: 241 SLGDDKDQVKSEGHRRRDAEEEKSTNKDNDDGTESSKKKKKKKKKKKNREEEDDDFQNNS 300
SLGDDKDQVKSEGHRRRDAEEEKSTNKDNDDGTESSKKKKKKKKKKKNREEEDDDFQNNS
Sbjct: 241 SLGDDKDQVKSEGHRRRDAEEEKSTNKDNDDGTESSKKKKKKKKKKKNREEEDDDFQNNS 300
Query: 301 GGALVKEEIPVLDDKELKRKEKKKRKNRDLEEGGDDGSEEQQRTKRRKGNL 352
GGALVKEEIPVLDDKELKRKEKKKRKNRDLEEGGDDGSEEQQRTKRRKGNL
Sbjct: 301 GGALVKEEIPVLDDKELKRKEKKKRKNRDLEEGGDDGSEEQQRTKRRKGNL 351
BLAST of CmaCh20G004500 vs. ExPASy TrEMBL
Match:
A0A6J1FSX8 (glutamic acid-rich protein-like OS=Cucurbita moschata OX=3662 GN=LOC111448174 PE=4 SV=1)
HSP 1 Score: 562.8 bits (1449), Expect = 1.0e-156
Identity = 327/354 (92.37%), Postives = 335/354 (94.63%), Query Frame = 0
Query: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS
Sbjct: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
Query: 61 SRSDRKHRHHGSEASNDPEAARDNPQWIEDGEKKNPLYLRAKDGKSGKTSLNVQSE---D 120
SRSDRKHRHHGSEASNDPEA+R NP WIED EKKNPLYLRAKDG+SGK S NVQSE D
Sbjct: 61 SRSDRKHRHHGSEASNDPEASRGNPHWIEDDEKKNPLYLRAKDGRSGKPSFNVQSEDGKD 120
Query: 121 GKAEKESGGNGDFEDASGEYRKRKVEDLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKG 180
GK EKESGG+GDFEDASGEYRKRKV DLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKG
Sbjct: 121 GKTEKESGGSGDFEDASGEYRKRKVGDLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKG 180
Query: 181 KKHQKKSEDRYAKIEDDEHKAGARRSSSKSRNSDNNGEIEASAKFVENNIASGKDRKKHV 240
KKH+KKSEDR+AKIEDDE + GARRS SKSRNSDNNGEIEAS KFVENNIASGKDRKKH
Sbjct: 181 KKHKKKSEDRHAKIEDDEREDGARRSYSKSRNSDNNGEIEASGKFVENNIASGKDRKKHE 240
Query: 241 DKKSLGDDKDQVKSEGHRRRDAEEEKSTNKDNDDGTESSKKKKKKKKKKKNREEEDDDFQ 300
DKKSLGDDKDQVKSEG RRRDAEEEKSTNKDNDDGTES+ KKKKKKKKKKNREEEDDDFQ
Sbjct: 241 DKKSLGDDKDQVKSEGQRRRDAEEEKSTNKDNDDGTEST-KKKKKKKKKKNREEEDDDFQ 300
Query: 301 NNSGGALVKEEIPVLDDKELKRKEKKKRKNRDLEEGGDDGSEEQQRTKRRKGNL 352
NNSGGA+VKEEIPV DDKELKRKEKKKRKNR LEEGGDDGSEEQQRTKRRKGNL
Sbjct: 301 NNSGGAMVKEEIPVSDDKELKRKEKKKRKNRGLEEGGDDGSEEQQRTKRRKGNL 353
BLAST of CmaCh20G004500 vs. ExPASy TrEMBL
Match:
A0A0A0KCS1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G301030 PE=4 SV=1)
HSP 1 Score: 361.3 bits (926), Expect = 4.6e-96
Identity = 240/359 (66.85%), Postives = 273/359 (76.04%), Query Frame = 0
Query: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
MKTV GS+VSSKPISISKAASTLSSFLS DNGASKA+CAYLRRAS SFNELKQLHKELKS
Sbjct: 1 MKTVNGSVVSSKPISISKAASTLSSFLSADNGASKALCAYLRRASDSFNELKQLHKELKS 60
Query: 61 SRSDRKHRHHGSEASNDPEAARDNPQWIEDGEKKNPLYLR------AKDGKSGKTSLNVQ 120
S S RKH HHGSE SN+ EAA + +EDG+K N KD + KTSL VQ
Sbjct: 61 SCSVRKHLHHGSEVSNEFEAAIHDQYRVEDGDKNNSSVSEKKKRPDRKDRTTDKTSLRVQ 120
Query: 121 S---EDGKAEKESGGNGDFEDASGEYRKRKVEDLKTEIEDKPNRKVEMDVESSDKDKSVV 180
S + GK E+GGNG+ ED +G K+K +LK EIEDKP+ KVEMDVESSD+DKSVV
Sbjct: 121 SYNEQIGKTPMENGGNGNLEDVTG---KKKGSELKIEIEDKPSGKVEMDVESSDRDKSVV 180
Query: 181 AVEKKGKKHQKKSEDRYAKIEDDEHKAGARRSSSKSRNSDNNGEIEASAKFVENNIASGK 240
AVEKK K+H+KKSEDR+ IEDDE ++GAR KS+N+DNN + EAS +FVENN+A+GK
Sbjct: 181 AVEKKRKRHKKKSEDRHDDIEDDERESGARLKHGKSQNTDNNCDAEASGEFVENNVANGK 240
Query: 241 DRKKHVDKKSLGDDKDQVKSEGHRRRDAEEEKSTNKDNDDGTESSKKKKKKKKKKKNREE 300
RKK DKK L D KDQVKSE RR D +E KSTN DND+GT+ KKKKK+ R E
Sbjct: 241 SRKKLEDKKRLDDVKDQVKSEDQRRGDVKEGKSTNNDNDNGTDHVDLSPKKKKKR--RRE 300
Query: 301 EDDDFQNNSGGALVKEEIPVLDDKELKRKEKKKRKNRDL-EEGGDDGSEEQQRTKRRKG 350
EDDDFQ NSG A+VKEE+PVLD KELKRKEKKK KNR+L EEG DDGSEEQ TKRRKG
Sbjct: 301 EDDDFQKNSGEAMVKEEVPVLDSKELKRKEKKKSKNRELGEEGRDDGSEEQHSTKRRKG 354
BLAST of CmaCh20G004500 vs. ExPASy TrEMBL
Match:
A0A5A7SW64 (Glutamic acid-rich protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold228G00720 PE=4 SV=1)
HSP 1 Score: 358.2 bits (918), Expect = 3.9e-95
Identity = 244/360 (67.78%), Postives = 273/360 (75.83%), Query Frame = 0
Query: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
MKTVTGS+VSSKPISISKAASTLSSFLS DNGASKA+CAYLRRAS SFNELK LHKELKS
Sbjct: 1 MKTVTGSVVSSKPISISKAASTLSSFLSADNGASKALCAYLRRASDSFNELKHLHKELKS 60
Query: 61 SRSDRKHRHHGSEASNDPEAARDNPQWIEDGEKKNPLYLRAK---DGK---SGKTSLNVQ 120
S S RKH HHGS+ SN+ EAA DN +EDG+KKN K D K + KTSL VQ
Sbjct: 61 SPSVRKHLHHGSKVSNEFEAAMDNEYRVEDGDKKNSSVSEKKKRPDSKYRTTDKTSLRVQ 120
Query: 121 SED---GKAEKESGGNGDFEDASGEYRKRKVEDLKTEIEDKPNRKVEMDVESSDKDKSVV 180
S+D GK E+GGNG+ ED SG KRK LK EIEDKP+ KVEMDVESSD VV
Sbjct: 121 SDDEQSGKTAMENGGNGNLEDVSG---KRKGGGLKIEIEDKPSGKVEMDVESSD----VV 180
Query: 181 AVEKKGKKHQKKSEDRYAKIEDDEHKAGARRSSSKSRNSDNN-GEIEASAKFVENNIASG 240
AVEKK KKH+KKSEDR+ IEDDE ++GAR KS+N+DNN EAS +FVENN+A G
Sbjct: 181 AVEKKRKKHKKKSEDRHGDIEDDERESGARLKHGKSQNTDNNCDNAEASGEFVENNVAKG 240
Query: 241 KDRKKHVDKKSLGDDKDQVKSEGHRRRDAEEEKSTNKDNDDGTESSKKKKKKKKKKKNRE 300
K RKK DK+SLGD KDQVKSE RR D +EE+ST+ DN +GT+ KKKKK+K R
Sbjct: 241 KSRKKLEDKRSLGDVKDQVKSEDQRRGDIKEERSTDNDNGNGTDLVDLSTKKKKKRKQR- 300
Query: 301 EEDDDFQNNSGGALVKEEIPVLDDKELKRKEKKKRKNRDL-EEGGDDGSEEQQRTKRRKG 350
EEDDDFQ NSGGA+VKEE+PVLD KELKRKEKKK KNR+L EEG DDGSEEQ KRRKG
Sbjct: 301 EEDDDFQKNSGGAMVKEEVPVLDSKELKRKEKKKSKNRELGEEGHDDGSEEQHSRKRRKG 352
BLAST of CmaCh20G004500 vs. ExPASy TrEMBL
Match:
A0A5D3CVE3 (Glutamic acid-rich protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold84G00960 PE=4 SV=1)
HSP 1 Score: 355.1 bits (910), Expect = 3.3e-94
Identity = 243/360 (67.50%), Postives = 272/360 (75.56%), Query Frame = 0
Query: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
MKTVTGS+VSSKPISISKAASTLSSFLS DNGASKA+CAYLRRAS SFNELK LHKELKS
Sbjct: 1 MKTVTGSVVSSKPISISKAASTLSSFLSADNGASKALCAYLRRASDSFNELKHLHKELKS 60
Query: 61 SRSDRKHRHHGSEASNDPEAARDNPQWIEDGEKKNPLYLRAK---DGK---SGKTSLNVQ 120
S S RKH HHGS+ SN+ EAA DN +EDG+KKN K D K + KTSL VQ
Sbjct: 61 SPSVRKHLHHGSKVSNEFEAAMDNEYRVEDGDKKNSSVSEKKKRPDSKYRTTDKTSLRVQ 120
Query: 121 SED---GKAEKESGGNGDFEDASGEYRKRKVEDLKTEIEDKPNRKVEMDVESSDKDKSVV 180
S+D GK E+GGNG+ ED SG KRK LK EIEDKP+ KVEMDVESSD VV
Sbjct: 121 SDDEQSGKTAMENGGNGNLEDVSG---KRKGGGLKIEIEDKPSGKVEMDVESSD----VV 180
Query: 181 AVEKKGKKHQKKSEDRYAKIEDDEHKAGARRSSSKSRNSDNN-GEIEASAKFVENNIASG 240
AVEKK KKH+KKSEDR+ IEDDE ++GAR KS+N+DNN EAS +FVENN+A G
Sbjct: 181 AVEKKRKKHKKKSEDRHGDIEDDERESGARLKHGKSQNTDNNCDNAEASGEFVENNVAKG 240
Query: 241 KDRKKHVDKKSLGDDKDQVKSEGHRRRDAEEEKSTNKDNDDGTESSKKKKKKKKKKKNRE 300
K RKK DK+SLGD KDQVKSE RR D +EE+ST+ DN +GT+ KKKKK+K R
Sbjct: 241 KSRKKLEDKRSLGDVKDQVKSEDQRRGDIKEERSTDNDNGNGTDLVDLSTKKKKKRKQR- 300
Query: 301 EEDDDFQNNSGGALVKEEIPVLDDKELKRKEKKKRKNRDL-EEGGDDGSEEQQRTKRRKG 350
EEDDDFQ NSG A+VKEE+PVLD KELKRKEKKK KNR+L EEG DDGSEEQ KRRKG
Sbjct: 301 EEDDDFQKNSGEAMVKEEVPVLDSKELKRKEKKKSKNRELGEEGHDDGSEEQHSRKRRKG 352
BLAST of CmaCh20G004500 vs. NCBI nr
Match:
XP_022986894.1 (cylicin-1-like [Cucurbita maxima] >XP_022986895.1 cylicin-1-like [Cucurbita maxima] >XP_022986896.1 cylicin-1-like [Cucurbita maxima])
HSP 1 Score: 619.8 bits (1597), Expect = 1.5e-173
Identity = 351/351 (100.00%), Postives = 351/351 (100.00%), Query Frame = 0
Query: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS
Sbjct: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
Query: 61 SRSDRKHRHHGSEASNDPEAARDNPQWIEDGEKKNPLYLRAKDGKSGKTSLNVQSEDGKA 120
SRSDRKHRHHGSEASNDPEAARDNPQWIEDGEKKNPLYLRAKDGKSGKTSLNVQSEDGKA
Sbjct: 61 SRSDRKHRHHGSEASNDPEAARDNPQWIEDGEKKNPLYLRAKDGKSGKTSLNVQSEDGKA 120
Query: 121 EKESGGNGDFEDASGEYRKRKVEDLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKGKKH 180
EKESGGNGDFEDASGEYRKRKVEDLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKGKKH
Sbjct: 121 EKESGGNGDFEDASGEYRKRKVEDLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKGKKH 180
Query: 181 QKKSEDRYAKIEDDEHKAGARRSSSKSRNSDNNGEIEASAKFVENNIASGKDRKKHVDKK 240
QKKSEDRYAKIEDDEHKAGARRSSSKSRNSDNNGEIEASAKFVENNIASGKDRKKHVDKK
Sbjct: 181 QKKSEDRYAKIEDDEHKAGARRSSSKSRNSDNNGEIEASAKFVENNIASGKDRKKHVDKK 240
Query: 241 SLGDDKDQVKSEGHRRRDAEEEKSTNKDNDDGTESSKKKKKKKKKKKNREEEDDDFQNNS 300
SLGDDKDQVKSEGHRRRDAEEEKSTNKDNDDGTESSKKKKKKKKKKKNREEEDDDFQNNS
Sbjct: 241 SLGDDKDQVKSEGHRRRDAEEEKSTNKDNDDGTESSKKKKKKKKKKKNREEEDDDFQNNS 300
Query: 301 GGALVKEEIPVLDDKELKRKEKKKRKNRDLEEGGDDGSEEQQRTKRRKGNL 352
GGALVKEEIPVLDDKELKRKEKKKRKNRDLEEGGDDGSEEQQRTKRRKGNL
Sbjct: 301 GGALVKEEIPVLDDKELKRKEKKKRKNRDLEEGGDDGSEEQQRTKRRKGNL 351
BLAST of CmaCh20G004500 vs. NCBI nr
Match:
XP_022943393.1 (glutamic acid-rich protein-like [Cucurbita moschata] >XP_022943394.1 glutamic acid-rich protein-like [Cucurbita moschata] >XP_022943395.1 glutamic acid-rich protein-like [Cucurbita moschata] >XP_022943397.1 glutamic acid-rich protein-like [Cucurbita moschata] >XP_022943398.1 glutamic acid-rich protein-like [Cucurbita moschata])
HSP 1 Score: 562.8 bits (1449), Expect = 2.1e-156
Identity = 327/354 (92.37%), Postives = 335/354 (94.63%), Query Frame = 0
Query: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS
Sbjct: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
Query: 61 SRSDRKHRHHGSEASNDPEAARDNPQWIEDGEKKNPLYLRAKDGKSGKTSLNVQSE---D 120
SRSDRKHRHHGSEASNDPEA+R NP WIED EKKNPLYLRAKDG+SGK S NVQSE D
Sbjct: 61 SRSDRKHRHHGSEASNDPEASRGNPHWIEDDEKKNPLYLRAKDGRSGKPSFNVQSEDGKD 120
Query: 121 GKAEKESGGNGDFEDASGEYRKRKVEDLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKG 180
GK EKESGG+GDFEDASGEYRKRKV DLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKG
Sbjct: 121 GKTEKESGGSGDFEDASGEYRKRKVGDLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKG 180
Query: 181 KKHQKKSEDRYAKIEDDEHKAGARRSSSKSRNSDNNGEIEASAKFVENNIASGKDRKKHV 240
KKH+KKSEDR+AKIEDDE + GARRS SKSRNSDNNGEIEAS KFVENNIASGKDRKKH
Sbjct: 181 KKHKKKSEDRHAKIEDDEREDGARRSYSKSRNSDNNGEIEASGKFVENNIASGKDRKKHE 240
Query: 241 DKKSLGDDKDQVKSEGHRRRDAEEEKSTNKDNDDGTESSKKKKKKKKKKKNREEEDDDFQ 300
DKKSLGDDKDQVKSEG RRRDAEEEKSTNKDNDDGTES+ KKKKKKKKKKNREEEDDDFQ
Sbjct: 241 DKKSLGDDKDQVKSEGQRRRDAEEEKSTNKDNDDGTEST-KKKKKKKKKKNREEEDDDFQ 300
Query: 301 NNSGGALVKEEIPVLDDKELKRKEKKKRKNRDLEEGGDDGSEEQQRTKRRKGNL 352
NNSGGA+VKEEIPV DDKELKRKEKKKRKNR LEEGGDDGSEEQQRTKRRKGNL
Sbjct: 301 NNSGGAMVKEEIPVSDDKELKRKEKKKRKNRGLEEGGDDGSEEQQRTKRRKGNL 353
BLAST of CmaCh20G004500 vs. NCBI nr
Match:
XP_023511985.1 (DNA topoisomerase 1-like [Cucurbita pepo subsp. pepo] >XP_023511986.1 DNA topoisomerase 1-like [Cucurbita pepo subsp. pepo] >XP_023511987.1 DNA topoisomerase 1-like [Cucurbita pepo subsp. pepo] >XP_023511988.1 DNA topoisomerase 1-like [Cucurbita pepo subsp. pepo])
HSP 1 Score: 560.1 bits (1442), Expect = 1.4e-155
Identity = 326/354 (92.09%), Postives = 335/354 (94.63%), Query Frame = 0
Query: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS
Sbjct: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
Query: 61 SRSDRKHRHHGSEASNDPEAARDNPQWIEDGEKKNPLYLRAKDGKSGKTSLNVQSE---D 120
SRSDRKHRHHGSEASNDPEA R NP WIEDGEKKN YLR KDG+SGK SLNVQSE D
Sbjct: 61 SRSDRKHRHHGSEASNDPEAPRVNPHWIEDGEKKNLEYLREKDGRSGKPSLNVQSEDGQD 120
Query: 121 GKAEKESGGNGDFEDASGEYRKRKVEDLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKG 180
GK E +SGGNGDFEDASGEYRKRKVEDLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKG
Sbjct: 121 GKTETKSGGNGDFEDASGEYRKRKVEDLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKG 180
Query: 181 KKHQKKSEDRYAKIEDDEHKAGARRSSSKSRNSDNNGEIEASAKFVENNIASGKDRKKHV 240
KKH+KKSEDR+AKIEDDEH+AGARRS SKSRNSDNNGEIEAS KFVEN+IASGKDRKKH
Sbjct: 181 KKHKKKSEDRHAKIEDDEHEAGARRSYSKSRNSDNNGEIEASGKFVENSIASGKDRKKHE 240
Query: 241 DKKSLGDDKDQVKSEGHRRRDAEEEKSTNKDNDDGTESSKKKKKKKKKKKNREEEDDDFQ 300
DKKSLGDDKDQVKSEG RRRDAEEEKSTNKDNDDGTES+ KKK+KKKKKKNREEEDDDFQ
Sbjct: 241 DKKSLGDDKDQVKSEGQRRRDAEEEKSTNKDNDDGTEST-KKKRKKKKKKNREEEDDDFQ 300
Query: 301 NNSGGALVKEEIPVLDDKELKRKEKKKRKNRDLEEGGDDGSEEQQRTKRRKGNL 352
NNSGGA+VKEEIPV DDKELKRKEKKKRKNR LEEGGDDGSEEQQRTKRRKGNL
Sbjct: 301 NNSGGAMVKEEIPVSDDKELKRKEKKKRKNRGLEEGGDDGSEEQQRTKRRKGNL 353
BLAST of CmaCh20G004500 vs. NCBI nr
Match:
KAG7010591.1 (hypothetical protein SDJN02_27385, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 555.1 bits (1429), Expect = 4.4e-154
Identity = 325/354 (91.81%), Postives = 332/354 (93.79%), Query Frame = 0
Query: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS
Sbjct: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
Query: 61 SRSDRKHRHHGSEASNDPEAARDNPQWIEDGEKKNPLYLRAKDGKSGKTSLNVQSE---D 120
SRSDRKHRHHGSEASNDPEA+RDNP WIEDGEKKNPLYLRAK G+SGK S NVQSE D
Sbjct: 61 SRSDRKHRHHGSEASNDPEASRDNPHWIEDGEKKNPLYLRAKVGRSGKPSFNVQSEDGKD 120
Query: 121 GKAEKESGGNGDFEDASGEYRKRKVEDLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKG 180
GK EKESGGNGDFEDASGEYRKRKVEDLKTEIEDKPNRKVEMDVESSDKDKSVVAVE K
Sbjct: 121 GKTEKESGGNGDFEDASGEYRKRKVEDLKTEIEDKPNRKVEMDVESSDKDKSVVAVETKR 180
Query: 181 KKHQKKSEDRYAKIEDDEHKAGARRSSSKSRNSDNNGEIEASAKFVENNIASGKDRKKHV 240
KKH+KKSEDR+AKIEDDE + GARRS SKSR SDNNGEIEAS KFVENNIASGKDRKKH
Sbjct: 181 KKHKKKSEDRHAKIEDDERENGARRSYSKSRISDNNGEIEASGKFVENNIASGKDRKKHE 240
Query: 241 DKKSLGDDKDQVKSEGHRRRDAEEEKSTNKDNDDGTESSKKKKKKKKKKKNREEEDDDFQ 300
DKKSL DDKDQVKSEG RRRDAEEEKSTNKDNDDG ES+ KKKKKKKKKKNREEEDDDFQ
Sbjct: 241 DKKSLVDDKDQVKSEGQRRRDAEEEKSTNKDNDDGAEST-KKKKKKKKKKNREEEDDDFQ 300
Query: 301 NNSGGALVKEEIPVLDDKELKRKEKKKRKNRDLEEGGDDGSEEQQRTKRRKGNL 352
NNSGGA+VKEEIPV DDKELKRKEKKKRKNR LEEGGDDGSEEQQRTKRRKGNL
Sbjct: 301 NNSGGAMVKEEIPVSDDKELKRKEKKKRKNRGLEEGGDDGSEEQQRTKRRKGNL 353
BLAST of CmaCh20G004500 vs. NCBI nr
Match:
KAG6570745.1 (hypothetical protein SDJN03_29660, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 538.5 bits (1386), Expect = 4.3e-149
Identity = 313/340 (92.06%), Postives = 321/340 (94.41%), Query Frame = 0
Query: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS
Sbjct: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
Query: 61 SRSDRKHRHHGSEASNDPEAARDNPQWIEDGEKKNPLYLRAKDGKSGKTSLNVQSE---D 120
SRSDRKHRHHGSEASNDPEA+RDNP WIEDGEKKNPLYLRAKDG+SGK S NVQSE D
Sbjct: 61 SRSDRKHRHHGSEASNDPEASRDNPHWIEDGEKKNPLYLRAKDGRSGKPSFNVQSEDGKD 120
Query: 121 GKAEKESGGNGDFEDASGEYRKRKVEDLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKG 180
GK EKESGGNGDFEDASGEYRKRKVEDLKTEIE+KPNRKVEMDVESSDKDKSVVAVE K
Sbjct: 121 GKTEKESGGNGDFEDASGEYRKRKVEDLKTEIENKPNRKVEMDVESSDKDKSVVAVETKR 180
Query: 181 KKHQKKSEDRYAKIEDDEHKAGARRSSSKSRNSDNNGEIEASAKFVENNIASGKDRKKHV 240
KKH+KKSEDR+AKIEDDE + GARRS SKSR SDNNGEIEAS KFVENNIASGKDRKKH
Sbjct: 181 KKHKKKSEDRHAKIEDDERENGARRSYSKSRISDNNGEIEASGKFVENNIASGKDRKKHE 240
Query: 241 DKKSLGDDKDQVKSEGHRRRDAEEEKSTNKDNDDGTESSKKKKKKKKKKKNREEEDDDFQ 300
DKKSLGDDKDQVKSEG RRRDAEEEKSTNKDNDDGTES+ KKKKKKKKKKNREEEDDDFQ
Sbjct: 241 DKKSLGDDKDQVKSEGQRRRDAEEEKSTNKDNDDGTEST-KKKKKKKKKKNREEEDDDFQ 300
Query: 301 NNSGGALVKEEIPVLDDKELKRKEKKKRKNRDLEEGGDDG 338
NNSGGA+VKEEIPV DDKELKRKEKKKRKNR LEEGGDDG
Sbjct: 301 NNSGGAMVKEEIPVSDDKELKRKEKKKRKNRGLEEGGDDG 339
BLAST of CmaCh20G004500 vs. TAIR 10
Match:
AT5G60030.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G75335.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )
HSP 1 Score: 80.5 bits (197), Expect = 3.0e-15
Identity = 108/307 (35.18%), Postives = 161/307 (52.44%), Query Frame = 0
Query: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
MKTVTG +VS++PIS+SKAA LS F S DNGAS+ + AYLRRASA+F ELK H+E+KS
Sbjct: 1 MKTVTGRVVSAEPISLSKAAKLLSGFASSDNGASQDVSAYLRRASAAFTELKSFHREIKS 60
Query: 61 SR----SDRKHRHHGSEASNDPEAARDNPQWIEDGEK----------KNPLYLRAKDGKS 120
SDR+ + ++ S+D ++ R+ DG K +Y R +D K
Sbjct: 61 KETKPSSDRETKSTETKQSSDAKSERNVIDEF-DGRKIRYRNSEAVSVESVYGRERDEKK 120
Query: 121 GKTSLNVQSEDGKAEKESGGNGDFEDASGEYRKRKVEDLK---TEIEDKPNRKVEMDVES 180
K S + D K ++ + E S E R+RK E K + ED + KV+ +E
Sbjct: 121 MKKSKDADVVDEKVNEKL----EAEQRSEERRERKKEKKKKKNNKDEDVVDEKVKEKLE- 180
Query: 181 SDKDKSVVAVEKKGKKHQKKSE----DRYAKIEDDEHKAGARRSSSKSRNSDNNGEIEAS 240
D+ KS E+K KK +K ++ D K+ED++ A + K +N D + E
Sbjct: 181 -DEQKSADRKERKKKKSKKNNDEDVVDEKEKLEDEQKSAEIK---EKKKNKDEDVVDEKE 240
Query: 241 AKFVENNIASGKDRKKHVDKKSLGDDKDQVKSEGHRRRDAEEEKSTNKDNDDGTESSKKK 287
+ +E+ SG +RKK KK D +++ SE + + +K D + G+E K K
Sbjct: 241 KEKLEDEQRSG-ERKKEKKKKRKSD--EEIVSE-----ERKSKKKRKSDEEMGSEERKSK 289
BLAST of CmaCh20G004500 vs. TAIR 10
Match:
AT1G75335.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G60030.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 70.9 bits (172), Expect = 2.4e-12
Identity = 47/93 (50.54%), Postives = 63/93 (67.74%), Query Frame = 0
Query: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELK- 60
MKTVTG + S+KPIS+SKAA+ LS F+S +NGAS+ + AYLRRAS +F ELK +H+E+K
Sbjct: 1 MKTVTGRVNSAKPISLSKAATLLSGFVSSENGASQDVSAYLRRASGAFIELKSIHREIKS 60
Query: 61 -----SSRSDRK-HRHHGSEASNDPEAARDNPQ 87
SS+ RK HR GSE + R + +
Sbjct: 61 KETKLSSKKKRKSHREMGSEERKKSKKKRKSSE 93
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1JCJ7 | 7.1e-174 | 100.00 | cylicin-1-like OS=Cucurbita maxima OX=3661 GN=LOC111484496 PE=4 SV=1 | [more] |
A0A6J1FSX8 | 1.0e-156 | 92.37 | glutamic acid-rich protein-like OS=Cucurbita moschata OX=3662 GN=LOC111448174 PE... | [more] |
A0A0A0KCS1 | 4.6e-96 | 66.85 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G301030 PE=4 SV=1 | [more] |
A0A5A7SW64 | 3.9e-95 | 67.78 | Glutamic acid-rich protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaff... | [more] |
A0A5D3CVE3 | 3.3e-94 | 67.50 | Glutamic acid-rich protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaff... | [more] |
Match Name | E-value | Identity | Description | |
XP_022986894.1 | 1.5e-173 | 100.00 | cylicin-1-like [Cucurbita maxima] >XP_022986895.1 cylicin-1-like [Cucurbita maxi... | [more] |
XP_022943393.1 | 2.1e-156 | 92.37 | glutamic acid-rich protein-like [Cucurbita moschata] >XP_022943394.1 glutamic ac... | [more] |
XP_023511985.1 | 1.4e-155 | 92.09 | DNA topoisomerase 1-like [Cucurbita pepo subsp. pepo] >XP_023511986.1 DNA topois... | [more] |
KAG7010591.1 | 4.4e-154 | 91.81 | hypothetical protein SDJN02_27385, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
KAG6570745.1 | 4.3e-149 | 92.06 | hypothetical protein SDJN03_29660, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
Match Name | E-value | Identity | Description | |
AT5G60030.1 | 3.0e-15 | 35.18 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT1G75335.1 | 2.4e-12 | 50.54 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |