Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCCAGATCTTTCAACGCGCTGTGGCCGACGGTTATCTTTCAAGAAACCGGCGGTGTGGAAATGAGTCACCCGCCACTCTTCTTCCATATGCGATTCTTCACTTCCCCTTGCTGTTTAAACATTCTTATCGATGTAATACTTTCTGGAGTGTTTGCATTTTTAAATTTTCATGGCGTGTATGATTGATTCTACGAGTTCGATTTACCAAACAAGCTTCTCTGATTGTTCTGGAGTGCAAATCTTATCTGCGAGTGAAAATACCAATGATCGATGATCGATGATCGATTGTTCTAGTTATGATTATTATTGTGTTCAATTTTTCCCGTTTAGGTCTTGATGTTTCACGTTTTCATCTTATTTCGCAGGAGTGTTTTGTTTTGTTTTGTTAGTGAATGCTGAATCTATCCGACGCAATGGGTAATGTAGCCAGTTCATTGGCCTCTGGTTTGTTTTTGGCTCTTAACAAAGTATTTGGATCTCCACTTGATTTTCTCTCTGGAAAATCCTGCAGGTTGGTACCATTTTTCTTTTGCTTTTTCTCATCATGTGGCTTCTGCTCATAAACCTAATAATTTACCTGATTTTGTAATGTTCTATGCGCTTTTTGATTGAGTGCTGTTTACCGGATTTTGTCTTATTTTGTGCCTTTTGATTGAGTATTGTTTACCTGATTTTGTCACTGGGCGCACATGTTGGCAGTTCAGTGTGTGGATCAACATGGGATTTCATATGCTACATAGAAAATTTCTGCGTTGCCAATTTGCTAAAGATGGGCATGGTCGTGATCCTTGCATACTTTGGTGAGTGAAACAAGAAACTCCATGTCTTTGAGAATTTTGGTACTTCTAATATGTAGGGATTTCTTTGTTTTCTGTGGTTGGGAGAAGGCAGCCTTATAGTTTAAAACTTGCATATCAATGTCCAAAGTTATCTGATGATATAGACATGTTTTGAAGTGATTTTGGAACGGCCAAAGTAACTCTTGTCGTGTCCAAAATAACTAAAAGTTGCACGATAAAAGTAATTTTGAAACGACCAAAATCACTCGTGAACGTCCTTAGTTTTGTTAGCATTCTAGACCCACCCGCTTCGAGTATTTATAGAACCTTTCTAGTTCACGATTAGTAGAACTTACTCGATACGGTAGTATGGAGAAATCTGACTGTTTCTGTAATATTTCAAAATATAATTCGTCTCTTGACTTTGTAGTTTACGAATTAGAGTTGCCTATATCTATGGCTTACTGATGTAATGGTAAAACGTCAACTAAGTCGATATAGTATGGAAAGTGTATAATCCATATTTGATAACTTCTAGTTATGGGAATGAAGTTTTAGTTCATCAGCATATAGCGTATGGTGTACTGATGTAATTGTGTGAGATCTAACACTAGTTGGAGAGGGAAACAAAGCATTTCTTATAAGGGTGTGGAAACCTCTTCCAAGTAGACGCGTGTTAAAACCATGAGGCTGACGGCGATTCATAATGGGCCAAAGCGAACAATATTTACTAGCGGTGGGTTTGGGTTGTTACAAATGGTATCAGAGCCAGATACTGAGCGGTGTGCCAGTGATGCCATTGGGCCCCCAAGGGGGGTGGATTGTGAGATCTCACATCTGTTGGAGATGGGAACAAACCATTCCTTATAAGGGTGTGGAAACCTCTCCTTAGTAGATGTGTCTTAAAACCGTGAGACTGATAGCGATACGTAACAGATCAAAATGGACAATATTTGCTAGCGGTGGGCTTGGACTTGAGCTGTTATAAATTGTAAGGCTGCTTATCCCCAAAAAAATTTTATATGCCTGGTGTGACTAAATCATAGGCTCCCAAGATCAAACGAATTCCTTAGGAAAGGGATCTAAACGAGCCCTCTCCCGTTGCTGTTCCCGGATCACCCATTTTAGTTTCAAGTGATTCTAAGAATCCTTAGACGCTTTAGAACTCGAGTACCAAATAATATATGTGCATTATTACTTGTTGTTTATTACATGGAACATTAGTTTCAGTATATATGTGAATAATATTTTTTGGGAATTGAATGCAGTTCTTTTACTCCTCTATTTATTCCATAAAATTGGCATATTTGGATGCATCGGTCGGGGTCTCTGCAGAATGATATGGACATGTTTAGCTTCCTATTGCCATGCATGGGAGTACTGCTGCACTTTCATGTGTATCAAGCTTGCCAGTGTCAAAAGAACAAGACGACGCCATAGAAGAAGAGACCTAGAAGAAGAACTCGAAAGTGAAGAAGACGCAAAATATCGATATGGGTCATCAAGTGATTCGAGCAACGACTCCAAAAAGATTGAGTCGAGAAGGAGCAAACGGGTGTCTCGCAAACGGAGGAGGAGCCACAGAGGTTCTCAAACAAGTAAGACATTGAGGCCACCGAGCCATGGAATTCGAGTGAGGAGTGGTAGAGTGTTGGTCTATAGTAAGCATGGATCATCCAAGATCGTGCAGAAAGAAAGAAACTATAGAAGAGGAAGACAAATACGAAGGTAAGAGATTTGAACTTCTAACTTTTTACGATACATAAAATTTAAATCAGTCATATTAATTTTTGT
mRNA sequence
TCCAGATCTTTCAACGCGCTGTGGCCGACGGTTATCTTTCAAGAAACCGGCGGTGTGGAAATGAGTCACCCGCCACTCTTCTTCCATATGCGATTCTTCACTTCCCCTTGCTGTTTAAACATTCTTATCGATGAGTGTTTTGTTTTGTTTTGTTAGTGAATGCTGAATCTATCCGACGCAATGGGTAATGTAGCCAGTTCATTGGCCTCTGGTTTGTTTTTGGCTCTTAACAAAGTATTTGGATCTCCACTTGATTTTCTCTCTGGAAAATCCTGCAGTTCAGTGTGTGGATCAACATGGGATTTCATATGCTACATAGAAAATTTCTGCGTTGCCAATTTGCTAAAGATGGGCATGGTCGTGATCCTTGCATACTTTGTTCTTTTACTCCTCTATTTATTCCATAAAATTGGCATATTTGGATGCATCGGTCGGGGTCTCTGCAGAATGATATGGACATGTTTAGCTTCCTATTGCCATGCATGGGAGTACTGCTGCACTTTCATGTGTATCAAGCTTGCCAGTGTCAAAAGAACAAGACGACGCCATAGAAGAAGAGACCTAGAAGAAGAACTCGAAAGTGAAGAAGACGCAAAATATCGATATGGGTCATCAAGTGATTCGAGCAACGACTCCAAAAAGATTGAGTCGAGAAGGAGCAAACGGGTGTCTCGCAAACGGAGGAGGAGCCACAGAGGTTCTCAAACAAGTAAGACATTGAGGCCACCGAGCCATGGAATTCGAGTGAGGAGTGGTAGAGTGTTGGTCTATAGTAAGCATGGATCATCCAAGATCGTGCAGAAAGAAAGAAACTATAGAAGAGGAAGACAAATACGAAGGTAAGAGATTTGAACTTCTAACTTTTTACGATACATAAAATTTAAATCAGTCATATTAATTTTTGT
Coding sequence (CDS)
ATGCTGAATCTATCCGACGCAATGGGTAATGTAGCCAGTTCATTGGCCTCTGGTTTGTTTTTGGCTCTTAACAAAGTATTTGGATCTCCACTTGATTTTCTCTCTGGAAAATCCTGCAGTTCAGTGTGTGGATCAACATGGGATTTCATATGCTACATAGAAAATTTCTGCGTTGCCAATTTGCTAAAGATGGGCATGGTCGTGATCCTTGCATACTTTGTTCTTTTACTCCTCTATTTATTCCATAAAATTGGCATATTTGGATGCATCGGTCGGGGTCTCTGCAGAATGATATGGACATGTTTAGCTTCCTATTGCCATGCATGGGAGTACTGCTGCACTTTCATGTGTATCAAGCTTGCCAGTGTCAAAAGAACAAGACGACGCCATAGAAGAAGAGACCTAGAAGAAGAACTCGAAAGTGAAGAAGACGCAAAATATCGATATGGGTCATCAAGTGATTCGAGCAACGACTCCAAAAAGATTGAGTCGAGAAGGAGCAAACGGGTGTCTCGCAAACGGAGGAGGAGCCACAGAGGTTCTCAAACAAGTAAGACATTGAGGCCACCGAGCCATGGAATTCGAGTGAGGAGTGGTAGAGTGTTGGTCTATAGTAAGCATGGATCATCCAAGATCGTGCAGAAAGAAAGAAACTATAGAAGAGGAAGACAAATACGAAGGTAA
Protein sequence
MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWTCLASYCHAWEYCCTFMCIKLASVKRTRRRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRGSQTSKTLRPPSHGIRVRSGRVLVYSKHGSSKIVQKERNYRRGRQIRR
Homology
BLAST of CmoCh01G014290 vs. ExPASy TrEMBL
Match:
A0A6J1FIX0 (uncharacterized protein LOC111445893 OS=Cucurbita moschata OX=3662 GN=LOC111445893 PE=4 SV=1)
HSP 1 Score: 431.4 bits (1108), Expect = 2.3e-117
Identity = 227/227 (100.00%), Postives = 227/227 (100.00%), Query Frame = 0
Query: 1 MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVAN 60
MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVAN
Sbjct: 1 MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVAN 60
Query: 61 LLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWTCLASYCHAWEYCCTFMCIKL 120
LLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWTCLASYCHAWEYCCTFMCIKL
Sbjct: 61 LLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWTCLASYCHAWEYCCTFMCIKL 120
Query: 121 ASVKRTRRRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRG 180
ASVKRTRRRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRG
Sbjct: 121 ASVKRTRRRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRG 180
Query: 181 SQTSKTLRPPSHGIRVRSGRVLVYSKHGSSKIVQKERNYRRGRQIRR 228
SQTSKTLRPPSHGIRVRSGRVLVYSKHGSSKIVQKERNYRRGRQIRR
Sbjct: 181 SQTSKTLRPPSHGIRVRSGRVLVYSKHGSSKIVQKERNYRRGRQIRR 227
BLAST of CmoCh01G014290 vs. ExPASy TrEMBL
Match:
A0A6J1IVE5 (uncharacterized protein LOC111480312 OS=Cucurbita maxima OX=3661 GN=LOC111480312 PE=4 SV=1)
HSP 1 Score: 405.2 bits (1040), Expect = 1.8e-109
Identity = 213/227 (93.83%), Postives = 220/227 (96.92%), Query Frame = 0
Query: 1 MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVAN 60
MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVAN
Sbjct: 1 MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVAN 60
Query: 61 LLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWTCLASYCHAWEYCCTFMCIKL 120
LLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRG CRMIWTCLASYCHAWEYCCTFMC+KL
Sbjct: 61 LLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGFCRMIWTCLASYCHAWEYCCTFMCVKL 120
Query: 121 ASVKRTRRRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRG 180
ASVKRTRR HRRRDLEEELESEE+AK+RYGSS DSSNDS+KIESRRS+RVSRKRRRSHRG
Sbjct: 121 ASVKRTRRCHRRRDLEEELESEEEAKHRYGSSCDSSNDSEKIESRRSERVSRKRRRSHRG 180
Query: 181 SQTSKTLRPPSHGIRVRSGRVLVYSKHGSSKIVQKERNYRRGRQIRR 228
SQTSKTLRP SHGIRVRSGRVLVYSKHGSSK VQK+R YRRGRQ +R
Sbjct: 181 SQTSKTLRPTSHGIRVRSGRVLVYSKHGSSKFVQKKRKYRRGRQRQR 227
BLAST of CmoCh01G014290 vs. ExPASy TrEMBL
Match:
A0A1S4DVV8 (protein HAPLESS 2 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103487604 PE=4 SV=1)
HSP 1 Score: 305.8 bits (782), Expect = 1.5e-79
Identity = 171/238 (71.85%), Postives = 190/238 (79.83%), Query Frame = 0
Query: 8 MGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMV 67
MGNVASSLAS +F A+ K+FGSPLDFLSG+SCSSVCGSTWDFICYIENFCVANL+KMGMV
Sbjct: 1 MGNVASSLASDVFSAIGKIFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLVKMGMV 60
Query: 68 VILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWTCLASYCHAWEYCCTFMCIKLASVKRTR 127
IL+YFVLLLLYL HKIGIFGCIGRGLCRMIWTCLASY +AWEYCC+FMCIKLASVKRTR
Sbjct: 61 FILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLASYFYAWEYCCSFMCIKLASVKRTR 120
Query: 128 RRH-RRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRGSQTSKT 187
RRH RRRDLEEE ESEE K ++ S+SDSSN + +ESR S+ SR+ RR+H+ SQ K+
Sbjct: 121 RRHVRRRDLEEEFESEE-GKCQHESTSDSSNVLEHVESRSSRWSSRRWRRNHKDSQKRKS 180
Query: 188 LRPPSHGIRVRSGRVLVYSKH--------------------GSSKIVQKERNYRRGRQ 225
LRP HG+RVRSGRVLVY KH GSSK V KER YRRGRQ
Sbjct: 181 LRPKGHGVRVRSGRVLVYGKHRRKSVEVGNHSNEIDSFGMYGSSKFVHKERKYRRGRQ 237
BLAST of CmoCh01G014290 vs. ExPASy TrEMBL
Match:
A0A0A0L1V0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G361810 PE=4 SV=1)
HSP 1 Score: 302.0 bits (772), Expect = 2.1e-78
Identity = 169/237 (71.31%), Postives = 188/237 (79.32%), Query Frame = 0
Query: 8 MGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMV 67
MGNVASSLAS +F A+ K+FGSPLDFLSG+SCSSVCGSTWDFICYIENFCVANLLKMGMV
Sbjct: 1 MGNVASSLASDVFSAIGKIFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGMV 60
Query: 68 VILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWTCLASYCHAWEYCCTFMCIKLASVKRTR 127
IL+YFVLLLLYL HKIGIF CIGRGLCRMIWTCLASY +AWEYCC FMCIKLASVKRTR
Sbjct: 61 FILSYFVLLLLYLLHKIGIFRCIGRGLCRMIWTCLASYFYAWEYCCGFMCIKLASVKRTR 120
Query: 128 RRH-RRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRGSQTSKT 187
RRH RRRD+EEE E EE K R+ S+SDS+N + +ES+ S+RVS++ RR+HR SQ K+
Sbjct: 121 RRHVRRRDMEEEFEIEE-GKCRHESTSDSTNVLEHVESKSSRRVSQRWRRNHRDSQRRKS 180
Query: 188 LRPPSHGIRVRSGRVLVYSKH--------------------GSSKIVQKERNYRRGR 224
LRP HG+RVRSGRVLVY KH GSSK V KER YRRGR
Sbjct: 181 LRPKGHGVRVRSGRVLVYGKHRRKSVEVGNHLNEIDSFGMYGSSKYVHKERKYRRGR 236
BLAST of CmoCh01G014290 vs. ExPASy TrEMBL
Match:
A0A6J1DMF3 (uncharacterized protein LOC111021352 OS=Momordica charantia OX=3673 GN=LOC111021352 PE=4 SV=1)
HSP 1 Score: 300.1 bits (767), Expect = 8.1e-78
Identity = 170/245 (69.39%), Postives = 189/245 (77.14%), Query Frame = 0
Query: 1 MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVAN 60
M NL D MGNVASS+ASG F A+ K+F SPLDFLSGKSCSSVCGSTWDFICYIENFCVAN
Sbjct: 1 MPNLPDIMGNVASSVASGFFSAVGKLFRSPLDFLSGKSCSSVCGSTWDFICYIENFCVAN 60
Query: 61 LLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWTCLASYCHAWEYCCTFMCIKL 120
LLK+GMV+IL+ FV+LLLYL HKIGIFGCI RGLCRM WTC+ASY +AW+YCCTFMCIKL
Sbjct: 61 LLKLGMVLILSLFVILLLYLLHKIGIFGCICRGLCRMTWTCIASYFYAWDYCCTFMCIKL 120
Query: 121 ASVKRT-RRRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHR 180
SVKRT RRRHRRRDLEEE ES E K+RYGSSSDSS+ ++IE R S+R SR+ R +HR
Sbjct: 121 GSVKRTRRRRHRRRDLEEEFES-EGGKHRYGSSSDSSSVPERIELRSSQRASRRWRMNHR 180
Query: 181 GSQTSKTLRPPSHGIRVRSGRVLVYSK--------------------HGSSKIVQKERNY 225
GSQ K LRP S GIRVRSGR LVY K HGSSK V +E Y
Sbjct: 181 GSQMRKALRPKSRGIRVRSGRTLVYGKHRRKSSEVVNRLGEIHSLGRHGSSKFVHEEIRY 240
BLAST of CmoCh01G014290 vs. NCBI nr
Match:
XP_022940192.1 (uncharacterized protein LOC111445893 [Cucurbita moschata] >XP_022940193.1 uncharacterized protein LOC111445893 [Cucurbita moschata])
HSP 1 Score: 431.4 bits (1108), Expect = 4.8e-117
Identity = 227/227 (100.00%), Postives = 227/227 (100.00%), Query Frame = 0
Query: 1 MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVAN 60
MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVAN
Sbjct: 1 MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVAN 60
Query: 61 LLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWTCLASYCHAWEYCCTFMCIKL 120
LLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWTCLASYCHAWEYCCTFMCIKL
Sbjct: 61 LLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWTCLASYCHAWEYCCTFMCIKL 120
Query: 121 ASVKRTRRRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRG 180
ASVKRTRRRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRG
Sbjct: 121 ASVKRTRRRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRG 180
Query: 181 SQTSKTLRPPSHGIRVRSGRVLVYSKHGSSKIVQKERNYRRGRQIRR 228
SQTSKTLRPPSHGIRVRSGRVLVYSKHGSSKIVQKERNYRRGRQIRR
Sbjct: 181 SQTSKTLRPPSHGIRVRSGRVLVYSKHGSSKIVQKERNYRRGRQIRR 227
BLAST of CmoCh01G014290 vs. NCBI nr
Match:
KAG7031669.1 (Protein HAPLESS 2 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 416.4 bits (1069), Expect = 1.6e-112
Identity = 222/228 (97.37%), Postives = 225/228 (98.68%), Query Frame = 0
Query: 1 MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVAN 60
MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVAN
Sbjct: 1 MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVAN 60
Query: 61 LLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWTCLASYCHAWEYCCTFMCIKL 120
LLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWTCLASYCHAWEYCCTFMCIKL
Sbjct: 61 LLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWTCLASYCHAWEYCCTFMCIKL 120
Query: 121 ASVKRT-RRRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHR 180
ASVKRT RRRHRRRDLEEELESEE+AK+RYGSSSDSSNDS+KIESRRSKRVSRKRRRSHR
Sbjct: 121 ASVKRTRRRRHRRRDLEEELESEEEAKHRYGSSSDSSNDSEKIESRRSKRVSRKRRRSHR 180
Query: 181 GSQTSKTLRPPSHGIRVRSGRVLVYSKHGSSKIVQKERNYRRGRQIRR 228
GSQTSKTLRP SHGIRVRSGRVLVYSKHGSSKIVQKER YRRGRQIRR
Sbjct: 181 GSQTSKTLRPTSHGIRVRSGRVLVYSKHGSSKIVQKERKYRRGRQIRR 228
BLAST of CmoCh01G014290 vs. NCBI nr
Match:
KAG6608041.1 (Protein HAPLESS 2, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 416.0 bits (1068), Expect = 2.1e-112
Identity = 222/228 (97.37%), Postives = 225/228 (98.68%), Query Frame = 0
Query: 1 MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVAN 60
MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVAN
Sbjct: 1 MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVAN 60
Query: 61 LLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWTCLASYCHAWEYCCTFMCIKL 120
LLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWTCLASYCHAWEYCCTFMCIKL
Sbjct: 61 LLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWTCLASYCHAWEYCCTFMCIKL 120
Query: 121 ASVKRT-RRRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHR 180
ASVKRT RRRHRRRDLEEELESEE+AK+RYGSSSDSSNDS+KIESRRSKRVSRKRRRSHR
Sbjct: 121 ASVKRTRRRRHRRRDLEEELESEEEAKHRYGSSSDSSNDSEKIESRRSKRVSRKRRRSHR 180
Query: 181 GSQTSKTLRPPSHGIRVRSGRVLVYSKHGSSKIVQKERNYRRGRQIRR 228
GSQTSKTLRP SHGIRVRSGRVLVYSKHGSSKIVQKER YRRGRQIRR
Sbjct: 181 GSQTSKTLRPRSHGIRVRSGRVLVYSKHGSSKIVQKERKYRRGRQIRR 228
BLAST of CmoCh01G014290 vs. NCBI nr
Match:
XP_023523489.1 (protein HAPLESS 2 [Cucurbita pepo subsp. pepo] >XP_023523490.1 protein HAPLESS 2 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 410.6 bits (1054), Expect = 8.8e-111
Identity = 216/226 (95.58%), Postives = 221/226 (97.79%), Query Frame = 0
Query: 1 MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVAN 60
MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVAN
Sbjct: 1 MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVAN 60
Query: 61 LLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWTCLASYCHAWEYCCTFMCIKL 120
LLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWTCLASYCHAWEYCCTFMCIKL
Sbjct: 61 LLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWTCLASYCHAWEYCCTFMCIKL 120
Query: 121 ASVKRTRRRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRG 180
ASVKRTRRRHRRRDLEEELESEE+AK+RYGSSSDSSN+S+KIESRR KRVSRK+RRSHRG
Sbjct: 121 ASVKRTRRRHRRRDLEEELESEEEAKHRYGSSSDSSNNSEKIESRRRKRVSRKQRRSHRG 180
Query: 181 SQTSKTLRPPSHGIRVRSGRVLVYSKHGSSKIVQKERNYRRGRQIR 227
SQTSKTLRP HGIRVRSGRVLVYSKHGSSK VQKER YRRGRQIR
Sbjct: 181 SQTSKTLRPTFHGIRVRSGRVLVYSKHGSSKFVQKERKYRRGRQIR 226
BLAST of CmoCh01G014290 vs. NCBI nr
Match:
XP_022981050.1 (uncharacterized protein LOC111480312 [Cucurbita maxima] >XP_022981051.1 uncharacterized protein LOC111480312 [Cucurbita maxima])
HSP 1 Score: 405.2 bits (1040), Expect = 3.7e-109
Identity = 213/227 (93.83%), Postives = 220/227 (96.92%), Query Frame = 0
Query: 1 MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVAN 60
MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVAN
Sbjct: 1 MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVAN 60
Query: 61 LLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWTCLASYCHAWEYCCTFMCIKL 120
LLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRG CRMIWTCLASYCHAWEYCCTFMC+KL
Sbjct: 61 LLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGFCRMIWTCLASYCHAWEYCCTFMCVKL 120
Query: 121 ASVKRTRRRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRG 180
ASVKRTRR HRRRDLEEELESEE+AK+RYGSS DSSNDS+KIESRRS+RVSRKRRRSHRG
Sbjct: 121 ASVKRTRRCHRRRDLEEELESEEEAKHRYGSSCDSSNDSEKIESRRSERVSRKRRRSHRG 180
Query: 181 SQTSKTLRPPSHGIRVRSGRVLVYSKHGSSKIVQKERNYRRGRQIRR 228
SQTSKTLRP SHGIRVRSGRVLVYSKHGSSK VQK+R YRRGRQ +R
Sbjct: 181 SQTSKTLRPTSHGIRVRSGRVLVYSKHGSSKFVQKKRKYRRGRQRQR 227
BLAST of CmoCh01G014290 vs. TAIR 10
Match:
AT1G21722.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G78922.1); Has 47 Blast hits to 47 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 1; Fungi - 1; Plants - 42; Viruses - 0; Other Eukaryotes - 3 (source: NCBI BLink). )
HSP 1 Score: 138.7 bits (348), Expect = 6.0e-33
Identity = 93/230 (40.43%), Postives = 123/230 (53.48%), Query Frame = 0
Query: 8 MGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMV 67
MGNV S +G ++ FGSPLDFLSGKSCSSVC S WDFICY+ENFCVANL K ++
Sbjct: 1 MGNVMDSFFTGFSHSIGNFFGSPLDFLSGKSCSSVCPSPWDFICYVENFCVANLAKTALI 60
Query: 68 VILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWTCLASYCHAWEYCCTFMCIKLASVKRTR 127
+IL+YF L +Y+ +K+G + CI G +++W ++ + + YCC+F C L KR R
Sbjct: 61 LILSYFFLFFIYMLYKVGFWHCIIHGFFKLLWVLVSCWFYMLGYCCSFFCYDLLHSKRRR 120
Query: 128 RRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRGSQTSKTL 187
RR R +EE+ + D D +D RSKR RK R K+L
Sbjct: 121 RRRHNRYIEEDYDDNSD-------DDDDVDDDGSFTYHRSKRECRKEER------LRKSL 180
Query: 188 RPPSHGIRV------RSGRVLVYSKHGSSKI----VQKERNYRRGRQIRR 228
RP +H +RV RS L G S I V +E + R RR
Sbjct: 181 RPRNHRVRVGVRKDHRSDSGLSQHADGGSPIHGVRVSRESKFARKVSKRR 217
BLAST of CmoCh01G014290 vs. TAIR 10
Match:
AT1G78922.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G21722.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 72.0 bits (175), Expect = 6.9e-13
Identity = 62/227 (27.31%), Postives = 110/227 (48.46%), Query Frame = 0
Query: 8 MGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMV 67
MGN ++ + + +F +PL G+SC VC WD C+IE+FC+ ++ K+ ++
Sbjct: 1 MGNAIGKASNDIGGFIGNIFTAPLKATLGRSCLDVCSGPWDLECFIEHFCLPDIAKLVLM 60
Query: 68 VILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWTCLASYCHAWEYCCTFMCIKLASVKRTR 127
L + +L+ + L K+GI C+ + +C+M A+Y + +C L ++ R
Sbjct: 61 SSLCFIILMFITLLFKLGICQCVVKSICKMSCAACAAYWFDIGEMISCLCHSLTNINRVN 120
Query: 128 RRHRRRDLEEELESEEDAKYRYGS--SSDSSNDSKKIES----RRSKRVSRKRRRSHRGS 187
RR +R L+ E Y Y S S SS+ +I++ +R +R+ K H GS
Sbjct: 121 RRRKR------LDDIEATTYDYPSDDESSSSDSPSRIDNIRPKQRRRRLGSKHSHHHHGS 180
Query: 188 QTS--KTLRPPSHGIRVRSGRVLVYSKHGSSKIVQKERNYRRGRQIR 227
+ + +R PS + VR G G S+ V++ + R+I+
Sbjct: 181 NRNNRRLIRLPSRQLSVRVG--------GKSRRVRRSTRKIKSRKIK 213
BLAST of CmoCh01G014290 vs. TAIR 10
Match:
AT4G11720.1 (hapless 2 )
HSP 1 Score: 45.1 bits (105), Expect = 9.0e-05
Identity = 21/63 (33.33%), Postives = 40/63 (63.49%), Query Frame = 0
Query: 25 KVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKI 84
K+ +DF++G +C + C S +DF C+I+ C++ ++ G+++ L LLL+L H+
Sbjct: 526 KIINGLVDFITGDTCRNKCSSFFDFSCHIQYVCLSWMVMFGLLLALFPITCLLLWLLHQK 585
Query: 85 GIF 88
G+F
Sbjct: 586 GLF 588
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1FIX0 | 2.3e-117 | 100.00 | uncharacterized protein LOC111445893 OS=Cucurbita moschata OX=3662 GN=LOC1114458... | [more] |
A0A6J1IVE5 | 1.8e-109 | 93.83 | uncharacterized protein LOC111480312 OS=Cucurbita maxima OX=3661 GN=LOC111480312... | [more] |
A0A1S4DVV8 | 1.5e-79 | 71.85 | protein HAPLESS 2 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103487604 PE=4 SV=1 | [more] |
A0A0A0L1V0 | 2.1e-78 | 71.31 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G361810 PE=4 SV=1 | [more] |
A0A6J1DMF3 | 8.1e-78 | 69.39 | uncharacterized protein LOC111021352 OS=Momordica charantia OX=3673 GN=LOC111021... | [more] |
Match Name | E-value | Identity | Description | |
XP_022940192.1 | 4.8e-117 | 100.00 | uncharacterized protein LOC111445893 [Cucurbita moschata] >XP_022940193.1 unchar... | [more] |
KAG7031669.1 | 1.6e-112 | 97.37 | Protein HAPLESS 2 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
KAG6608041.1 | 2.1e-112 | 97.37 | Protein HAPLESS 2, partial [Cucurbita argyrosperma subsp. sororia] | [more] |
XP_023523489.1 | 8.8e-111 | 95.58 | protein HAPLESS 2 [Cucurbita pepo subsp. pepo] >XP_023523490.1 protein HAPLESS 2... | [more] |
XP_022981050.1 | 3.7e-109 | 93.83 | uncharacterized protein LOC111480312 [Cucurbita maxima] >XP_022981051.1 uncharac... | [more] |
Match Name | E-value | Identity | Description | |
AT1G21722.1 | 6.0e-33 | 40.43 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
AT1G78922.1 | 6.9e-13 | 27.31 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
AT4G11720.1 | 9.0e-05 | 33.33 | hapless 2 | [more] |