Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGACCGTCACCGGAAGTATCGTTTCTTCGAAGCCAATCTCTATCTCAAAAGCGGCGTCCACTCTCTCCTCCTTTCTCTCCGTCGACAATGGCGCTTCGAAAGCAATCTGCGCCTATCTGAGACGCGCCTCCGCCTCTTTCAACGAGTTAAAGCAGCTCCACAAGGAGTTGAAATCTTCGCGGTCTGATCGGAAGCACCGGCATCACGGATCCGAGGCTTCAAACGATCCAGAGGCTTCCAGAGGTAATCCACATTGGATCGAAGACGACGAGAAGAAAAATCCTCTGTATCTAAGAGCAAAAGATGGAAGAAGCGGTAAGCCGAGTTTTAATGTTCAATCTGAGGATGGGAAGGATGGGAAGACGGAAAAGGAAAGCGGTGGGAGTGGTGATTTTGAGGATGCATCGGGTGAATACCGAAAGAGAAAGGTCGGGGACTTGAAAACTGAAATTGAAGATAAACCTAACCGAAAAGTAGAGATGGATGTGGAATCAAGTGATAAAGATAAGAGCGTTGTAGCAGTTGAGAAAAAAGGAAAAAAGCACAAGAAAAAGAGCGAGGATAGACATGCTAAGATTGAAGATGACGAACGCGAGGATGGAGCCAGGCGAAGTTACAGTAAATCTCGAAATAGTGATAACAATGGCGAAATTGAAGCTTCTGGGAAGTTCGTCGAGAACAATATCGCAAGCGGAAAAGATAGAAAGAAGCACGAGGACAAGAAGAGTTTGGGTGACGATAAGGATCAAGTAAAGAGTGAAGGTCAGAGAAGAAGAGACGCCGAGGAGGAAAAGAGCACAAACAAGGATAATGATGATGGAACAGAGTCGACCAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAACAGGGAAGAAGAAGATGACGATTTTCAGAACAACAGTGGAGGAGCTATGGTGAAAGAGGAAATCCCAGTTTCGGATGACAAAGAGTTGAAAAGGAAAGAGAAGAAAAAAAGGAAGAATCGAGGTTTAGAAGAAGGGGGTGATGATGGGTCCGAGGAACAACAGCGTACGAAGAGAAGAAAAGGAAATTTATGAGACAAGGAAGGATGAACTCACAGTTCTTGCTTCAGTTTACTAATTGCTATCAATGGTAATTGTTTGATGCTTTATTTCATCCTTCTATTTTGATTCTTTATACCAAAAGTATTATCAATCGCTTAGGAAGCTAATTTTGTGTTCATTCTTGTGAAATTTTGACATTAGAACCCTTCCCCTGACCCCTTCTGGTCAATGATTTATTTGTTTGAATTTGTAGCCTGTTGATCTCTGATGTTTATAGGGAAGAAGAACTGGGTGAAAGGCAACTTTTTGCAAAGAACATATAGAAGGTACATTATTTGATGCTTATTTATACAATTTGATCAAGAGATTGATCAGTCTGCATTGGCCAAAAGGTTTTGTGCTTGTATCTCATTGTGAAATTTGTTCTTCTTTGAGTCATCTGCTGACCAAACAAATATGCCATGAAGTTTGCCCCTACTTTGAAGGGTTCTGCATGCCTTGAAGAACCCTTTTTCTGGTGTCAAACCTCCACTTCCATCTGTTGCAAAGCTTGCTAGAACTTTACCACCTTGATAATTTGAGGTTTGGGTCTCAAAATGATTCAAAAACTGAGAAACTGTGGTCCCTTTTCCATAGGCATAGAACTGGAAGTTCACATAGTCTATTAGATCTCCATACTTCCTCCACAAAGCTAAGTAATGTGTTTGGACAGCGTCGTCGTCGAACGGTGCAATGGATGCAAATGAAATAACATTTTGACGTTTGAGACGGAGCAGGAGTTTCCCTATGCACTCGGTGAATACATCGGGGTGTGCTTTGAAGTGTTCGTAATCGATATCGACCCCATCTATATCATACTCCTTCACTATACGAGATATGGACTCGAAAGCATTTTTAACCCAAGATGTAATGGATTTAGGTTTGAAGTATGCAAATTGTTTGCCTGCCGTATCGCCCCCGAGGCTGAGGGCGAACTTGACGTTTGAATGCTGGGCCTTGATGGAAGAAACAGAAGAAGGCGATAGGATTTGTTCATCCCAAAAGACTCTGAATTTGCCATTTGTTGCAGAAGGAGAAGATGAAGCCGTGTAATCGATGGCGAAGGAGAGGATGAAGTGGAACTCGACATCGGGATGAATAGGTACGTCTGAAAATCTTACATTATTGCCATCAGCTCCTATGTATTCTCTGAATAGTCTTGTGTTATCTAGGGGGGTTGCCTCTAGTGATGGAAGAGTGGGGAATAGCAACGGTAGAAATGAAAATAAGAAGAATATCTGAGCTGAACCCATAATAATTGGTTGAAGCTTGATGACACTTCTTCTCTTTATAAAGCCCTGTGAGATCACACGTTAGTCAGGTAGGGAAACGAAACATTCTTTATAAGGGTGTGGAAACCTCTACCTAGCAACGCATGTTAAAAACCTTGAGAGGAAGCCCAAAGAAGACAATATCTGTTAGTGATGCTAGCGATGGGCTTGAGCTGTTACATTTAAACTCTGATGTAGAAATGGAGAAACATGAGATCCTCGTTGGCAATGGACGGCTACTAGGCAACTAATTTGTCAGTTATGAATTATTTGAGATATATTAGAACCTTGTTTTATCTCAAGAACTTTTGTTATTAGTTTGTAGAAAACAAAGGTTTCTAGGTTTTAACCATCATTCGATAACTTGAAGAAAACATTTCTTGACCGTTGGATGATGATATGTAGTTCTTTGGAATTCATGTATTTTGTTTGTTCTTTTGGTATTAGTCTGCAAATGTGTCATAG
mRNA sequence
ATGAAGACCGTCACCGGAAGTATCGTTTCTTCGAAGCCAATCTCTATCTCAAAAGCGGCGTCCACTCTCTCCTCCTTTCTCTCCGTCGACAATGGCGCTTCGAAAGCAATCTGCGCCTATCTGAGACGCGCCTCCGCCTCTTTCAACGAGTTAAAGCAGCTCCACAAGGAGTTGAAATCTTCGCGGTCTGATCGGAAGCACCGGCATCACGGATCCGAGGCTTCAAACGATCCAGAGGCTTCCAGAGGTAATCCACATTGGATCGAAGACGACGAGAAGAAAAATCCTCTGTATCTAAGAGCAAAAGATGGAAGAAGCGGTAAGCCGAGTTTTAATGTTCAATCTGAGGATGGGAAGGATGGGAAGACGGAAAAGGAAAGCGGTGGGAGTGGTGATTTTGAGGATGCATCGGGTGAATACCGAAAGAGAAAGGTCGGGGACTTGAAAACTGAAATTGAAGATAAACCTAACCGAAAAGTAGAGATGGATGTGGAATCAAGTGATAAAGATAAGAGCGTTGTAGCAGTTGAGAAAAAAGGAAAAAAGCACAAGAAAAAGAGCGAGGATAGACATGCTAAGATTGAAGATGACGAACGCGAGGATGGAGCCAGGCGAAGTTACAGTAAATCTCGAAATAGTGATAACAATGGCGAAATTGAAGCTTCTGGGAAGTTCGTCGAGAACAATATCGCAAGCGGAAAAGATAGAAAGAAGCACGAGGACAAGAAGAGTTTGGGTGACGATAAGGATCAAGTAAAGAGTGAAGGTCAGAGAAGAAGAGACGCCGAGGAGGAAAAGAGCACAAACAAGGATAATGATGATGGAACAGAGTCGACCAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAACAGGGAAGAAGAAGATGACGATTTTCAGAACAACAGTGGAGGAGCTATGGTGAAAGAGGAAATCCCAGTTTCGGATGACAAAGAGTTGAAAAGGAAAGAGAAGAAAAAAAGGAAGAATCGAGGTTTAGAAGAAGGGGGTGATGATGGGTCCGAGGAACAACAGCGGGGGTTGCCTCTAGTGATGGAAGAGTGGGGAATAGCAACGTCTGCAAATGTGTCATAG
Coding sequence (CDS)
ATGAAGACCGTCACCGGAAGTATCGTTTCTTCGAAGCCAATCTCTATCTCAAAAGCGGCGTCCACTCTCTCCTCCTTTCTCTCCGTCGACAATGGCGCTTCGAAAGCAATCTGCGCCTATCTGAGACGCGCCTCCGCCTCTTTCAACGAGTTAAAGCAGCTCCACAAGGAGTTGAAATCTTCGCGGTCTGATCGGAAGCACCGGCATCACGGATCCGAGGCTTCAAACGATCCAGAGGCTTCCAGAGGTAATCCACATTGGATCGAAGACGACGAGAAGAAAAATCCTCTGTATCTAAGAGCAAAAGATGGAAGAAGCGGTAAGCCGAGTTTTAATGTTCAATCTGAGGATGGGAAGGATGGGAAGACGGAAAAGGAAAGCGGTGGGAGTGGTGATTTTGAGGATGCATCGGGTGAATACCGAAAGAGAAAGGTCGGGGACTTGAAAACTGAAATTGAAGATAAACCTAACCGAAAAGTAGAGATGGATGTGGAATCAAGTGATAAAGATAAGAGCGTTGTAGCAGTTGAGAAAAAAGGAAAAAAGCACAAGAAAAAGAGCGAGGATAGACATGCTAAGATTGAAGATGACGAACGCGAGGATGGAGCCAGGCGAAGTTACAGTAAATCTCGAAATAGTGATAACAATGGCGAAATTGAAGCTTCTGGGAAGTTCGTCGAGAACAATATCGCAAGCGGAAAAGATAGAAAGAAGCACGAGGACAAGAAGAGTTTGGGTGACGATAAGGATCAAGTAAAGAGTGAAGGTCAGAGAAGAAGAGACGCCGAGGAGGAAAAGAGCACAAACAAGGATAATGATGATGGAACAGAGTCGACCAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAACAGGGAAGAAGAAGATGACGATTTTCAGAACAACAGTGGAGGAGCTATGGTGAAAGAGGAAATCCCAGTTTCGGATGACAAAGAGTTGAAAAGGAAAGAGAAGAAAAAAAGGAAGAATCGAGGTTTAGAAGAAGGGGGTGATGATGGGTCCGAGGAACAACAGCGGGGGTTGCCTCTAGTGATGGAAGAGTGGGGAATAGCAACGTCTGCAAATGTGTCATAG
Protein sequence
MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKSSRSDRKHRHHGSEASNDPEASRGNPHWIEDDEKKNPLYLRAKDGRSGKPSFNVQSEDGKDGKTEKESGGSGDFEDASGEYRKRKVGDLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKGKKHKKKSEDRHAKIEDDEREDGARRSYSKSRNSDNNGEIEASGKFVENNIASGKDRKKHEDKKSLGDDKDQVKSEGQRRRDAEEEKSTNKDNDDGTESTKKKKKKKKKKNREEEDDDFQNNSGGAMVKEEIPVSDDKELKRKEKKKRKNRGLEEGGDDGSEEQQRGLPLVMEEWGIATSANVS
Homology
BLAST of CmoCh20G004850 vs. ExPASy TrEMBL
Match:
A0A6J1FSX8 (glutamic acid-rich protein-like OS=Cucurbita moschata OX=3662 GN=LOC111448174 PE=4 SV=1)
HSP 1 Score: 614.0 bits (1582), Expect = 4.0e-172
Identity = 345/345 (100.00%), Postives = 345/345 (100.00%), Query Frame = 0
Query: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS
Sbjct: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
Query: 61 SRSDRKHRHHGSEASNDPEASRGNPHWIEDDEKKNPLYLRAKDGRSGKPSFNVQSEDGKD 120
SRSDRKHRHHGSEASNDPEASRGNPHWIEDDEKKNPLYLRAKDGRSGKPSFNVQSEDGKD
Sbjct: 61 SRSDRKHRHHGSEASNDPEASRGNPHWIEDDEKKNPLYLRAKDGRSGKPSFNVQSEDGKD 120
Query: 121 GKTEKESGGSGDFEDASGEYRKRKVGDLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKG 180
GKTEKESGGSGDFEDASGEYRKRKVGDLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKG
Sbjct: 121 GKTEKESGGSGDFEDASGEYRKRKVGDLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKG 180
Query: 181 KKHKKKSEDRHAKIEDDEREDGARRSYSKSRNSDNNGEIEASGKFVENNIASGKDRKKHE 240
KKHKKKSEDRHAKIEDDEREDGARRSYSKSRNSDNNGEIEASGKFVENNIASGKDRKKHE
Sbjct: 181 KKHKKKSEDRHAKIEDDEREDGARRSYSKSRNSDNNGEIEASGKFVENNIASGKDRKKHE 240
Query: 241 DKKSLGDDKDQVKSEGQRRRDAEEEKSTNKDNDDGTESTKKKKKKKKKKNREEEDDDFQN 300
DKKSLGDDKDQVKSEGQRRRDAEEEKSTNKDNDDGTESTKKKKKKKKKKNREEEDDDFQN
Sbjct: 241 DKKSLGDDKDQVKSEGQRRRDAEEEKSTNKDNDDGTESTKKKKKKKKKKNREEEDDDFQN 300
Query: 301 NSGGAMVKEEIPVSDDKELKRKEKKKRKNRGLEEGGDDGSEEQQR 346
NSGGAMVKEEIPVSDDKELKRKEKKKRKNRGLEEGGDDGSEEQQR
Sbjct: 301 NSGGAMVKEEIPVSDDKELKRKEKKKRKNRGLEEGGDDGSEEQQR 345
BLAST of CmoCh20G004850 vs. ExPASy TrEMBL
Match:
A0A6J1JCJ7 (cylicin-1-like OS=Cucurbita maxima OX=3661 GN=LOC111484496 PE=4 SV=1)
HSP 1 Score: 547.7 bits (1410), Expect = 3.6e-152
Identity = 319/346 (92.20%), Postives = 327/346 (94.51%), Query Frame = 0
Query: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS
Sbjct: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
Query: 61 SRSDRKHRHHGSEASNDPEASRGNPHWIEDDEKKNPLYLRAKDGRSGKPSFNVQSEDGKD 120
SRSDRKHRHHGSEASNDPEA+R NP WIED EKKNPLYLRAKDG+SGK S NVQSE D
Sbjct: 61 SRSDRKHRHHGSEASNDPEAARDNPQWIEDGEKKNPLYLRAKDGKSGKTSLNVQSE---D 120
Query: 121 GKTEKESGGSGDFEDASGEYRKRKVGDLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKG 180
GK EKESGG+GDFEDASGEYRKRKV DLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKG
Sbjct: 121 GKAEKESGGNGDFEDASGEYRKRKVEDLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKG 180
Query: 181 KKHKKKSEDRHAKIEDDEREDGARRSYSKSRNSDNNGEIEASGKFVENNIASGKDRKKHE 240
KKH+KKSEDR+AKIEDDE + GARRS SKSRNSDNNGEIEAS KFVENNIASGKDRKKH
Sbjct: 181 KKHQKKSEDRYAKIEDDEHKAGARRSSSKSRNSDNNGEIEASAKFVENNIASGKDRKKHV 240
Query: 241 DKKSLGDDKDQVKSEGQRRRDAEEEKSTNKDNDDGTEST-KKKKKKKKKKNREEEDDDFQ 300
DKKSLGDDKDQVKSEG RRRDAEEEKSTNKDNDDGTES+ KKKKKKKKKKNREEEDDDFQ
Sbjct: 241 DKKSLGDDKDQVKSEGHRRRDAEEEKSTNKDNDDGTESSKKKKKKKKKKKNREEEDDDFQ 300
Query: 301 NNSGGAMVKEEIPVSDDKELKRKEKKKRKNRGLEEGGDDGSEEQQR 346
NNSGGA+VKEEIPV DDKELKRKEKKKRKNR LEEGGDDGSEEQQR
Sbjct: 301 NNSGGALVKEEIPVLDDKELKRKEKKKRKNRDLEEGGDDGSEEQQR 343
BLAST of CmoCh20G004850 vs. ExPASy TrEMBL
Match:
A0A5A7SW64 (Glutamic acid-rich protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold228G00720 PE=4 SV=1)
HSP 1 Score: 361.7 bits (927), Expect = 3.6e-96
Identity = 239/351 (68.09%), Postives = 269/351 (76.64%), Query Frame = 0
Query: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
MKTVTGS+VSSKPISISKAASTLSSFLS DNGASKA+CAYLRRAS SFNELK LHKELKS
Sbjct: 1 MKTVTGSVVSSKPISISKAASTLSSFLSADNGASKALCAYLRRASDSFNELKHLHKELKS 60
Query: 61 SRSDRKHRHHGSEASNDPEASRGNPHWIEDDEKKNPLYLRAK---DGR---SGKPSFNVQ 120
S S RKH HHGS+ SN+ EA+ N + +ED +KKN K D + + K S VQ
Sbjct: 61 SPSVRKHLHHGSKVSNEFEAAMDNEYRVEDGDKKNSSVSEKKKRPDSKYRTTDKTSLRVQ 120
Query: 121 SEDGKDGKTEKESGGSGDFEDASGEYRKRKVGDLKTEIEDKPNRKVEMDVESSDKDKSVV 180
S+D + GKT E+GG+G+ ED SG KRK G LK EIEDKP+ KVEMDVESSD VV
Sbjct: 121 SDDEQSGKTAMENGGNGNLEDVSG---KRKGGGLKIEIEDKPSGKVEMDVESSD----VV 180
Query: 181 AVEKKGKKHKKKSEDRHAKIEDDEREDGARRSYSKSRNSDNN-GEIEASGKFVENNIASG 240
AVEKK KKHKKKSEDRH IEDDERE GAR + KS+N+DNN EASG+FVENN+A G
Sbjct: 181 AVEKKRKKHKKKSEDRHGDIEDDERESGARLKHGKSQNTDNNCDNAEASGEFVENNVAKG 240
Query: 241 KDRKKHEDKKSLGDDKDQVKSEGQRRRDAEEEKSTNKDNDDGTESTKKKKKKKKKKNREE 300
K RKK EDK+SLGD KDQVKSE QRR D +EE+ST+ DN +GT+ KKKKK+ + E
Sbjct: 241 KSRKKLEDKRSLGDVKDQVKSEDQRRGDIKEERSTDNDNGNGTDLVDLSTKKKKKRKQRE 300
Query: 301 EDDDFQNNSGGAMVKEEIPVSDDKELKRKEKKKRKNRGL-EEGGDDGSEEQ 344
EDDDFQ NSGGAMVKEE+PV D KELKRKEKKK KNR L EEG DDGSEEQ
Sbjct: 301 EDDDFQKNSGGAMVKEEVPVLDSKELKRKEKKKSKNRELGEEGHDDGSEEQ 344
BLAST of CmoCh20G004850 vs. ExPASy TrEMBL
Match:
A0A0A0KCS1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G301030 PE=4 SV=1)
HSP 1 Score: 360.5 bits (924), Expect = 8.0e-96
Identity = 237/350 (67.71%), Postives = 269/350 (76.86%), Query Frame = 0
Query: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
MKTV GS+VSSKPISISKAASTLSSFLS DNGASKA+CAYLRRAS SFNELKQLHKELKS
Sbjct: 1 MKTVNGSVVSSKPISISKAASTLSSFLSADNGASKALCAYLRRASDSFNELKQLHKELKS 60
Query: 61 SRSDRKHRHHGSEASNDPEASRGNPHWIEDDEKKNPLYLR------AKDGRSGKPSFNVQ 120
S S RKH HHGSE SN+ EA+ + + +ED +K N KD + K S VQ
Sbjct: 61 SCSVRKHLHHGSEVSNEFEAAIHDQYRVEDGDKNNSSVSEKKKRPDRKDRTTDKTSLRVQ 120
Query: 121 SEDGKDGKTEKESGGSGDFEDASGEYRKRKVGDLKTEIEDKPNRKVEMDVESSDKDKSVV 180
S + + GKT E+GG+G+ ED +G K+K +LK EIEDKP+ KVEMDVESSD+DKSVV
Sbjct: 121 SYNEQIGKTPMENGGNGNLEDVTG---KKKGSELKIEIEDKPSGKVEMDVESSDRDKSVV 180
Query: 181 AVEKKGKKHKKKSEDRHAKIEDDEREDGARRSYSKSRNSDNNGEIEASGKFVENNIASGK 240
AVEKK K+HKKKSEDRH IEDDERE GAR + KS+N+DNN + EASG+FVENN+A+GK
Sbjct: 181 AVEKKRKRHKKKSEDRHDDIEDDERESGARLKHGKSQNTDNNCDAEASGEFVENNVANGK 240
Query: 241 DRKKHEDKKSLGDDKDQVKSEGQRRRDAEEEKSTNKDNDDGTESTKKKKKKKKKKNREEE 300
RKK EDKK L D KDQVKSE QRR D +E KSTN DND+GT+ KKKKK+ R EE
Sbjct: 241 SRKKLEDKKRLDDVKDQVKSEDQRRGDVKEGKSTNNDNDNGTDHVDLSPKKKKKR-RREE 300
Query: 301 DDDFQNNSGGAMVKEEIPVSDDKELKRKEKKKRKNRGL-EEGGDDGSEEQ 344
DDDFQ NSG AMVKEE+PV D KELKRKEKKK KNR L EEG DDGSEEQ
Sbjct: 301 DDDFQKNSGEAMVKEEVPVLDSKELKRKEKKKSKNRELGEEGRDDGSEEQ 346
BLAST of CmoCh20G004850 vs. ExPASy TrEMBL
Match:
A0A5D3CVE3 (Glutamic acid-rich protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold84G00960 PE=4 SV=1)
HSP 1 Score: 358.6 bits (919), Expect = 3.1e-95
Identity = 238/351 (67.81%), Postives = 268/351 (76.35%), Query Frame = 0
Query: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
MKTVTGS+VSSKPISISKAASTLSSFLS DNGASKA+CAYLRRAS SFNELK LHKELKS
Sbjct: 1 MKTVTGSVVSSKPISISKAASTLSSFLSADNGASKALCAYLRRASDSFNELKHLHKELKS 60
Query: 61 SRSDRKHRHHGSEASNDPEASRGNPHWIEDDEKKNPLYLRAK---DGR---SGKPSFNVQ 120
S S RKH HHGS+ SN+ EA+ N + +ED +KKN K D + + K S VQ
Sbjct: 61 SPSVRKHLHHGSKVSNEFEAAMDNEYRVEDGDKKNSSVSEKKKRPDSKYRTTDKTSLRVQ 120
Query: 121 SEDGKDGKTEKESGGSGDFEDASGEYRKRKVGDLKTEIEDKPNRKVEMDVESSDKDKSVV 180
S+D + GKT E+GG+G+ ED SG KRK G LK EIEDKP+ KVEMDVESSD VV
Sbjct: 121 SDDEQSGKTAMENGGNGNLEDVSG---KRKGGGLKIEIEDKPSGKVEMDVESSD----VV 180
Query: 181 AVEKKGKKHKKKSEDRHAKIEDDEREDGARRSYSKSRNSDNN-GEIEASGKFVENNIASG 240
AVEKK KKHKKKSEDRH IEDDERE GAR + KS+N+DNN EASG+FVENN+A G
Sbjct: 181 AVEKKRKKHKKKSEDRHGDIEDDERESGARLKHGKSQNTDNNCDNAEASGEFVENNVAKG 240
Query: 241 KDRKKHEDKKSLGDDKDQVKSEGQRRRDAEEEKSTNKDNDDGTESTKKKKKKKKKKNREE 300
K RKK EDK+SLGD KDQVKSE QRR D +EE+ST+ DN +GT+ KKKKK+ + E
Sbjct: 241 KSRKKLEDKRSLGDVKDQVKSEDQRRGDIKEERSTDNDNGNGTDLVDLSTKKKKKRKQRE 300
Query: 301 EDDDFQNNSGGAMVKEEIPVSDDKELKRKEKKKRKNRGL-EEGGDDGSEEQ 344
EDDDFQ NSG AMVKEE+PV D KELKRKEKKK KNR L EEG DDGSEEQ
Sbjct: 301 EDDDFQKNSGEAMVKEEVPVLDSKELKRKEKKKSKNRELGEEGHDDGSEEQ 344
BLAST of CmoCh20G004850 vs. NCBI nr
Match:
XP_022943393.1 (glutamic acid-rich protein-like [Cucurbita moschata] >XP_022943394.1 glutamic acid-rich protein-like [Cucurbita moschata] >XP_022943395.1 glutamic acid-rich protein-like [Cucurbita moschata] >XP_022943397.1 glutamic acid-rich protein-like [Cucurbita moschata] >XP_022943398.1 glutamic acid-rich protein-like [Cucurbita moschata])
HSP 1 Score: 614.0 bits (1582), Expect = 8.3e-172
Identity = 345/345 (100.00%), Postives = 345/345 (100.00%), Query Frame = 0
Query: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS
Sbjct: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
Query: 61 SRSDRKHRHHGSEASNDPEASRGNPHWIEDDEKKNPLYLRAKDGRSGKPSFNVQSEDGKD 120
SRSDRKHRHHGSEASNDPEASRGNPHWIEDDEKKNPLYLRAKDGRSGKPSFNVQSEDGKD
Sbjct: 61 SRSDRKHRHHGSEASNDPEASRGNPHWIEDDEKKNPLYLRAKDGRSGKPSFNVQSEDGKD 120
Query: 121 GKTEKESGGSGDFEDASGEYRKRKVGDLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKG 180
GKTEKESGGSGDFEDASGEYRKRKVGDLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKG
Sbjct: 121 GKTEKESGGSGDFEDASGEYRKRKVGDLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKG 180
Query: 181 KKHKKKSEDRHAKIEDDEREDGARRSYSKSRNSDNNGEIEASGKFVENNIASGKDRKKHE 240
KKHKKKSEDRHAKIEDDEREDGARRSYSKSRNSDNNGEIEASGKFVENNIASGKDRKKHE
Sbjct: 181 KKHKKKSEDRHAKIEDDEREDGARRSYSKSRNSDNNGEIEASGKFVENNIASGKDRKKHE 240
Query: 241 DKKSLGDDKDQVKSEGQRRRDAEEEKSTNKDNDDGTESTKKKKKKKKKKNREEEDDDFQN 300
DKKSLGDDKDQVKSEGQRRRDAEEEKSTNKDNDDGTESTKKKKKKKKKKNREEEDDDFQN
Sbjct: 241 DKKSLGDDKDQVKSEGQRRRDAEEEKSTNKDNDDGTESTKKKKKKKKKKNREEEDDDFQN 300
Query: 301 NSGGAMVKEEIPVSDDKELKRKEKKKRKNRGLEEGGDDGSEEQQR 346
NSGGAMVKEEIPVSDDKELKRKEKKKRKNRGLEEGGDDGSEEQQR
Sbjct: 301 NSGGAMVKEEIPVSDDKELKRKEKKKRKNRGLEEGGDDGSEEQQR 345
BLAST of CmoCh20G004850 vs. NCBI nr
Match:
KAG7010591.1 (hypothetical protein SDJN02_27385, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 584.7 bits (1506), Expect = 5.4e-163
Identity = 334/345 (96.81%), Postives = 336/345 (97.39%), Query Frame = 0
Query: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS
Sbjct: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
Query: 61 SRSDRKHRHHGSEASNDPEASRGNPHWIEDDEKKNPLYLRAKDGRSGKPSFNVQSEDGKD 120
SRSDRKHRHHGSEASNDPEASR NPHWIED EKKNPLYLRAK GRSGKPSFNVQSEDGKD
Sbjct: 61 SRSDRKHRHHGSEASNDPEASRDNPHWIEDGEKKNPLYLRAKVGRSGKPSFNVQSEDGKD 120
Query: 121 GKTEKESGGSGDFEDASGEYRKRKVGDLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKG 180
GKTEKESGG+GDFEDASGEYRKRKV DLKTEIEDKPNRKVEMDVESSDKDKSVVAVE K
Sbjct: 121 GKTEKESGGNGDFEDASGEYRKRKVEDLKTEIEDKPNRKVEMDVESSDKDKSVVAVETKR 180
Query: 181 KKHKKKSEDRHAKIEDDEREDGARRSYSKSRNSDNNGEIEASGKFVENNIASGKDRKKHE 240
KKHKKKSEDRHAKIEDDERE+GARRSYSKSR SDNNGEIEASGKFVENNIASGKDRKKHE
Sbjct: 181 KKHKKKSEDRHAKIEDDERENGARRSYSKSRISDNNGEIEASGKFVENNIASGKDRKKHE 240
Query: 241 DKKSLGDDKDQVKSEGQRRRDAEEEKSTNKDNDDGTESTKKKKKKKKKKNREEEDDDFQN 300
DKKSL DDKDQVKSEGQRRRDAEEEKSTNKDNDDG ESTKKKKKKKKKKNREEEDDDFQN
Sbjct: 241 DKKSLVDDKDQVKSEGQRRRDAEEEKSTNKDNDDGAESTKKKKKKKKKKNREEEDDDFQN 300
Query: 301 NSGGAMVKEEIPVSDDKELKRKEKKKRKNRGLEEGGDDGSEEQQR 346
NSGGAMVKEEIPVSDDKELKRKEKKKRKNRGLEEGGDDGSEEQQR
Sbjct: 301 NSGGAMVKEEIPVSDDKELKRKEKKKRKNRGLEEGGDDGSEEQQR 345
BLAST of CmoCh20G004850 vs. NCBI nr
Match:
KAG6570745.1 (hypothetical protein SDJN03_29660, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 581.6 bits (1498), Expect = 4.6e-162
Identity = 330/339 (97.35%), Postives = 333/339 (98.23%), Query Frame = 0
Query: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS
Sbjct: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
Query: 61 SRSDRKHRHHGSEASNDPEASRGNPHWIEDDEKKNPLYLRAKDGRSGKPSFNVQSEDGKD 120
SRSDRKHRHHGSEASNDPEASR NPHWIED EKKNPLYLRAKDGRSGKPSFNVQSEDGKD
Sbjct: 61 SRSDRKHRHHGSEASNDPEASRDNPHWIEDGEKKNPLYLRAKDGRSGKPSFNVQSEDGKD 120
Query: 121 GKTEKESGGSGDFEDASGEYRKRKVGDLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKG 180
GKTEKESGG+GDFEDASGEYRKRKV DLKTEIE+KPNRKVEMDVESSDKDKSVVAVE K
Sbjct: 121 GKTEKESGGNGDFEDASGEYRKRKVEDLKTEIENKPNRKVEMDVESSDKDKSVVAVETKR 180
Query: 181 KKHKKKSEDRHAKIEDDEREDGARRSYSKSRNSDNNGEIEASGKFVENNIASGKDRKKHE 240
KKHKKKSEDRHAKIEDDERE+GARRSYSKSR SDNNGEIEASGKFVENNIASGKDRKKHE
Sbjct: 181 KKHKKKSEDRHAKIEDDERENGARRSYSKSRISDNNGEIEASGKFVENNIASGKDRKKHE 240
Query: 241 DKKSLGDDKDQVKSEGQRRRDAEEEKSTNKDNDDGTESTKKKKKKKKKKNREEEDDDFQN 300
DKKSLGDDKDQVKSEGQRRRDAEEEKSTNKDNDDGTESTKKKKKKKKKKNREEEDDDFQN
Sbjct: 241 DKKSLGDDKDQVKSEGQRRRDAEEEKSTNKDNDDGTESTKKKKKKKKKKNREEEDDDFQN 300
Query: 301 NSGGAMVKEEIPVSDDKELKRKEKKKRKNRGLEEGGDDG 340
NSGGAMVKEEIPVSDDKELKRKEKKKRKNRGLEEGGDDG
Sbjct: 301 NSGGAMVKEEIPVSDDKELKRKEKKKRKNRGLEEGGDDG 339
BLAST of CmoCh20G004850 vs. NCBI nr
Match:
XP_023511985.1 (DNA topoisomerase 1-like [Cucurbita pepo subsp. pepo] >XP_023511986.1 DNA topoisomerase 1-like [Cucurbita pepo subsp. pepo] >XP_023511987.1 DNA topoisomerase 1-like [Cucurbita pepo subsp. pepo] >XP_023511988.1 DNA topoisomerase 1-like [Cucurbita pepo subsp. pepo])
HSP 1 Score: 577.4 bits (1487), Expect = 8.6e-161
Identity = 329/345 (95.36%), Postives = 334/345 (96.81%), Query Frame = 0
Query: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS
Sbjct: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
Query: 61 SRSDRKHRHHGSEASNDPEASRGNPHWIEDDEKKNPLYLRAKDGRSGKPSFNVQSEDGKD 120
SRSDRKHRHHGSEASNDPEA R NPHWIED EKKN YLR KDGRSGKPS NVQSEDG+D
Sbjct: 61 SRSDRKHRHHGSEASNDPEAPRVNPHWIEDGEKKNLEYLREKDGRSGKPSLNVQSEDGQD 120
Query: 121 GKTEKESGGSGDFEDASGEYRKRKVGDLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKG 180
GKTE +SGG+GDFEDASGEYRKRKV DLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKG
Sbjct: 121 GKTETKSGGNGDFEDASGEYRKRKVEDLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKG 180
Query: 181 KKHKKKSEDRHAKIEDDEREDGARRSYSKSRNSDNNGEIEASGKFVENNIASGKDRKKHE 240
KKHKKKSEDRHAKIEDDE E GARRSYSKSRNSDNNGEIEASGKFVEN+IASGKDRKKHE
Sbjct: 181 KKHKKKSEDRHAKIEDDEHEAGARRSYSKSRNSDNNGEIEASGKFVENSIASGKDRKKHE 240
Query: 241 DKKSLGDDKDQVKSEGQRRRDAEEEKSTNKDNDDGTESTKKKKKKKKKKNREEEDDDFQN 300
DKKSLGDDKDQVKSEGQRRRDAEEEKSTNKDNDDGTESTKKK+KKKKKKNREEEDDDFQN
Sbjct: 241 DKKSLGDDKDQVKSEGQRRRDAEEEKSTNKDNDDGTESTKKKRKKKKKKNREEEDDDFQN 300
Query: 301 NSGGAMVKEEIPVSDDKELKRKEKKKRKNRGLEEGGDDGSEEQQR 346
NSGGAMVKEEIPVSDDKELKRKEKKKRKNRGLEEGGDDGSEEQQR
Sbjct: 301 NSGGAMVKEEIPVSDDKELKRKEKKKRKNRGLEEGGDDGSEEQQR 345
BLAST of CmoCh20G004850 vs. NCBI nr
Match:
XP_022986894.1 (cylicin-1-like [Cucurbita maxima] >XP_022986895.1 cylicin-1-like [Cucurbita maxima] >XP_022986896.1 cylicin-1-like [Cucurbita maxima])
HSP 1 Score: 547.7 bits (1410), Expect = 7.3e-152
Identity = 319/346 (92.20%), Postives = 327/346 (94.51%), Query Frame = 0
Query: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS
Sbjct: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
Query: 61 SRSDRKHRHHGSEASNDPEASRGNPHWIEDDEKKNPLYLRAKDGRSGKPSFNVQSEDGKD 120
SRSDRKHRHHGSEASNDPEA+R NP WIED EKKNPLYLRAKDG+SGK S NVQSE D
Sbjct: 61 SRSDRKHRHHGSEASNDPEAARDNPQWIEDGEKKNPLYLRAKDGKSGKTSLNVQSE---D 120
Query: 121 GKTEKESGGSGDFEDASGEYRKRKVGDLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKG 180
GK EKESGG+GDFEDASGEYRKRKV DLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKG
Sbjct: 121 GKAEKESGGNGDFEDASGEYRKRKVEDLKTEIEDKPNRKVEMDVESSDKDKSVVAVEKKG 180
Query: 181 KKHKKKSEDRHAKIEDDEREDGARRSYSKSRNSDNNGEIEASGKFVENNIASGKDRKKHE 240
KKH+KKSEDR+AKIEDDE + GARRS SKSRNSDNNGEIEAS KFVENNIASGKDRKKH
Sbjct: 181 KKHQKKSEDRYAKIEDDEHKAGARRSSSKSRNSDNNGEIEASAKFVENNIASGKDRKKHV 240
Query: 241 DKKSLGDDKDQVKSEGQRRRDAEEEKSTNKDNDDGTEST-KKKKKKKKKKNREEEDDDFQ 300
DKKSLGDDKDQVKSEG RRRDAEEEKSTNKDNDDGTES+ KKKKKKKKKKNREEEDDDFQ
Sbjct: 241 DKKSLGDDKDQVKSEGHRRRDAEEEKSTNKDNDDGTESSKKKKKKKKKKKNREEEDDDFQ 300
Query: 301 NNSGGAMVKEEIPVSDDKELKRKEKKKRKNRGLEEGGDDGSEEQQR 346
NNSGGA+VKEEIPV DDKELKRKEKKKRKNR LEEGGDDGSEEQQR
Sbjct: 301 NNSGGALVKEEIPVLDDKELKRKEKKKRKNRDLEEGGDDGSEEQQR 343
BLAST of CmoCh20G004850 vs. TAIR 10
Match:
AT5G60030.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G75335.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )
HSP 1 Score: 75.1 bits (183), Expect = 1.3e-13
Identity = 98/314 (31.21%), Postives = 161/314 (51.27%), Query Frame = 0
Query: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
MKTVTG +VS++PIS+SKAA LS F S DNGAS+ + AYLRRASA+F ELK H+E+KS
Sbjct: 1 MKTVTGRVVSAEPISLSKAAKLLSGFASSDNGASQDVSAYLRRASAAFTELKSFHREIKS 60
Query: 61 SR----SDRKHRHHGSEASNDPEASR-------GNPHWIEDDE--KKNPLYLRAKDGRSG 120
SDR+ + ++ S+D ++ R G + E +Y R +D +
Sbjct: 61 KETKPSSDRETKSTETKQSSDAKSERNVIDEFDGRKIRYRNSEAVSVESVYGRERDEKKM 120
Query: 121 KPSFNVQSEDGKDGKTEKESGGSGDFEDASGEYRKRKVGDLKTEIEDKPNRKVEMDVESS 180
K S + D K + + S + + E +K+K + +++K K+E
Sbjct: 121 KKSKDADVVDEKVNEKLEAEQRSEERRERKKEKKKKKNNKDEDVVDEKVKEKLE------ 180
Query: 181 DKDKSVVAVEKKGKKHKKKSE----DRHAKIEDDEREDGARRSYSKSRNSDNNGEIEASG 240
D+ KS E+K KK KK ++ D K+ED+++ + EI+
Sbjct: 181 DEQKSADRKERKKKKSKKNNDEDVVDEKEKLEDEQK----------------SAEIKEKK 240
Query: 241 KFVENNIASGKDRKKHEDKKSLGDDKDQVKSEGQRRRD--AEEEKSTNKDNDDGTESTKK 296
K + ++ K+++K ED++ G+ K + K + + + +EE KS K D +++
Sbjct: 241 KNKDEDVVDEKEKEKLEDEQRSGERKKEKKKKRKSDEEIVSEERKSKKKRKSDEEMGSEE 292
BLAST of CmoCh20G004850 vs. TAIR 10
Match:
AT1G75335.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G60030.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 71.6 bits (174), Expect = 1.4e-12
Identity = 47/89 (52.81%), Postives = 61/89 (68.54%), Query Frame = 0
Query: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELK- 60
MKTVTG + S+KPIS+SKAA+ LS F+S +NGAS+ + AYLRRAS +F ELK +H+E+K
Sbjct: 1 MKTVTGRVNSAKPISLSKAATLLSGFVSSENGASQDVSAYLRRASGAFIELKSIHREIKS 60
Query: 61 -----SSRSDRK-HRHHGSEASNDPEASR 83
SS+ RK HR GSE + R
Sbjct: 61 KETKLSSKKKRKSHREMGSEERKKSKKKR 89
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1FSX8 | 4.0e-172 | 100.00 | glutamic acid-rich protein-like OS=Cucurbita moschata OX=3662 GN=LOC111448174 PE... | [more] |
A0A6J1JCJ7 | 3.6e-152 | 92.20 | cylicin-1-like OS=Cucurbita maxima OX=3661 GN=LOC111484496 PE=4 SV=1 | [more] |
A0A5A7SW64 | 3.6e-96 | 68.09 | Glutamic acid-rich protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaff... | [more] |
A0A0A0KCS1 | 8.0e-96 | 67.71 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G301030 PE=4 SV=1 | [more] |
A0A5D3CVE3 | 3.1e-95 | 67.81 | Glutamic acid-rich protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaff... | [more] |
Match Name | E-value | Identity | Description | |
XP_022943393.1 | 8.3e-172 | 100.00 | glutamic acid-rich protein-like [Cucurbita moschata] >XP_022943394.1 glutamic ac... | [more] |
KAG7010591.1 | 5.4e-163 | 96.81 | hypothetical protein SDJN02_27385, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
KAG6570745.1 | 4.6e-162 | 97.35 | hypothetical protein SDJN03_29660, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_023511985.1 | 8.6e-161 | 95.36 | DNA topoisomerase 1-like [Cucurbita pepo subsp. pepo] >XP_023511986.1 DNA topois... | [more] |
XP_022986894.1 | 7.3e-152 | 92.20 | cylicin-1-like [Cucurbita maxima] >XP_022986895.1 cylicin-1-like [Cucurbita maxi... | [more] |
Match Name | E-value | Identity | Description | |
AT5G60030.1 | 1.3e-13 | 31.21 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT1G75335.1 | 1.4e-12 | 52.81 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |