Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAGCCACAAAAATCCCTGCTAATTTCTCTATCTTCATCATCTTCTTCTTCATCTTCACCGGCGACGACCATCGGGCCGGTGAAATGAATATTCCGGTAACCGACTCTTCCGAATCAGCACACACTCTTCGAATCTCCGATCATCGCCGTCATGTGTTTCACGAAAAGCTCTGATCCGAAATCCAAATTTCTCGCTCGCTTAGTCACGAAGTTATCCGCTCCAGCTCCATGGAAATTCGGGGAAAGGTATCCATTTCTAGCCTCCTTAGGTGAAATTTTTGGTATATTCTGACTTCTCGGGAGGTTTTTTGTCTGTAATTTTCAATATGAGGAGAGTGGTTGGTGGAGAGGAATGAATTTTGAAGAACTGCAGGCATTGATTTCTGGATTTTAATTTCTCTGATTGTTCATTAATTTTGTGTAATCTTTCCCTTTCCCTTTCCCTTCTTCATTTCTGAATGGAGGATGAATGTATTAAGAGGAAAACTAGACGCATCTACTTCGATTTTGTTCACTCAGAGCCTGTGAGTTAATTATGAATCGGTGTTTATTAATTGGTTCTGAGAATGGAGCAGAGATGACGAGGGTGCGTAATTAATTACTGATTACTGCATCAAACAGAGAGCTTGAGGGTTTTAGAGTTCAGATTAAGCTTAAAAACCCCCTCATTTAGACTATTCATGTTCTAAGTACCTGTTTGAATCTGTTCTTGTTGTCTTCAGATGAAGATTCAACCCATTGATATCGATCCTCCTACTGGAAGAGTTGCCATTCGTGCTGATCCGGGCAAACCGGTGCTGAAATCGCGTCTCAGAAAGCTATTTGACCGTCCCCCTAATGTTCTGAAGAATTCTTCTGCGGAGAAGCCGATTGCCGCTGGGGAAGCTGCTCAGTTCATCATCAACAAAGATGGAGGCTCTGAGTTCGAGCCAAGTTCCATTTGTTTGGCTAAAATGGTGCAGAGTTTCATAGAGGAGAGTAATGAAAAGCAGTTGTCGGTGGCTACTGCTGTGAAAAATGGTCGCAACCGCTGCAATTGCTTCAACGGGAACAATAATGACAGCTCCGATGATGAGTCCGATGACTTCGGCGGTGGTTTTGGTGAATCTGTACCAATCGGATCCTCTGGTGCCGATGTTTCCGACTTGCTCAAGGTAAATGCCGCTTCCATTTTTCTCCGGAAATTTAGTTCTCCGCTTCTGAAACTGAAATTGATCGTACTCCATTTTTTGGGAATCAGAGTCTGATTCTTTCCGCTAGCGTAGCCGAGAGAAACCTCTTAGCCGACACCGCGAAGATCTTTGAGAAGAACAACAAGATTCACAAGAAGAAAGACGATTTGAGAAAAGTTGTCACAGATGGCCTTTCTTCTCTAGGTTACGATTCTTCAATCTGTAAATCAAAATGGGACAAATTCCCCTCACATCCTGCAGGTAATCCATTGCGCTTCTCTAAGAACAAAAACTAAAAACAAAATCAAAACAAAAAAACGTAGAAACACCTGCATTAGATTCGAATTAATCTGAAAATTCTGCCTGTAGGGGAATATGAATACATCGATGTGATAGTGGAGGGGGAGAGATTGCTGATCGACATAGATTTCAGATCGGAGTTTGAAATCGCTCGTTCAACCGGAATGTACAAGGCGATCCTCCAATTACTTCCGAATATCTTCATCGGCAAACCAGATCGTCTAGGTCAAATCGTCTCGATCGTATCGGAGGCTGCGAGACAAAGCTTGAAAAAGAAGGGGATGCACTTTCCGCCATGGAGGAAAGCCGAATACATGAGAGCCAAATGGCTTTCCCCTCACATCAGATCCAAACCTCCGATTCCATCGCAAAAAGAAATCGAAAACTCTGACAACGAGCAATCGCCGACAGAAACGGATTGCGGAGACTTCGAAATGATATTCGGCGACGAAATGACACCATCACCTCCTGAAAGTGAATCGATCGCTTCATCGTCTCCTCCGCAGAAAGGTTTCAATGACGGCGAGAAGGCTGCGGTGGCAGTGACGGCCTGGCAACCTCCGGCGATCAAACCGAAGAGTCTCGATAGAGGAGCTAAGATCGTTACGGGATTGGCATCAATCCTGAAAGAGAATCCGTAAAAATTTTGGGTTTTTTTTTTTTCTTTTTTGAAATATCATAAAAAAAAATAATTATGTGGGGGTAAATAGTAAATATCCGGAAATGTATGGGCTTTTGTTTAATTGAGAAAAAAAAAAAGAAAAAAAAAAGAAAATGAAAGGAAAATCCTTCCGGTTTTTTGGTTGTAGGGATATTCTAGTCTGTAATGGGCAGCAAATTTGGCTTACCCAATCGTTTTGTAAAAATAAATTAGAAAAAAATGATATATTACCGAATTTAACC
mRNA sequence
GAGCCACAAAAATCCCTGCTAATTTCTCTATCTTCATCATCTTCTTCTTCATCTTCACCGGCGACGACCATCGGGCCGGTGAAATGAATATTCCGGTAACCGACTCTTCCGAATCAGCACACACTCTTCGAATCTCCGATCATCGCCGTCATGTGTTTCACGAAAAGCTCTGATCCGAAATCCAAATTTCTCGCTCGCTTAGTCACGAAGTTATCCGCTCCAGCTCCATGGAAATTCGGGGAAAGAGCCTGTGAGTTAATTATGAATCGGTGTTTATTAATTGGTTCTGAGAATGGAGCAGAGATGACGAGGATGAAGATTCAACCCATTGATATCGATCCTCCTACTGGAAGAGTTGCCATTCGTGCTGATCCGGGCAAACCGGTGCTGAAATCGCGTCTCAGAAAGCTATTTGACCGTCCCCCTAATGTTCTGAAGAATTCTTCTGCGGAGAAGCCGATTGCCGCTGGGGAAGCTGCTCAGTTCATCATCAACAAAGATGGAGGCTCTGAGTTCGAGCCAAGTTCCATTTGTTTGGCTAAAATGGTGCAGAGTTTCATAGAGGAGAGTAATGAAAAGCAGTTGTCGGTGGCTACTGCTGTGAAAAATGGTCGCAACCGCTGCAATTGCTTCAACGGGAACAATAATGACAGCTCCGATGATGAGTCCGATGACTTCGGCGGTGGTTTTGGTGAATCTGTACCAATCGGATCCTCTGGTGCCGATGTTTCCGACTTGCTCAAGAGTCTGATTCTTTCCGCTAGCGTAGCCGAGAGAAACCTCTTAGCCGACACCGCGAAGATCTTTGAGAAGAACAACAAGATTCACAAGAAGAAAGACGATTTGAGAAAAGTTGTCACAGATGGCCTTTCTTCTCTAGGTTACGATTCTTCAATCTGTAAATCAAAATGGGACAAATTCCCCTCACATCCTGCAGGGGAATATGAATACATCGATGTGATAGTGGAGGGGGAGAGATTGCTGATCGACATAGATTTCAGATCGGAGTTTGAAATCGCTCGTTCAACCGGAATGTACAAGGCGATCCTCCAATTACTTCCGAATATCTTCATCGGCAAACCAGATCGTCTAGGTCAAATCGTCTCGATCGTATCGGAGGCTGCGAGACAAAGCTTGAAAAAGAAGGGGATGCACTTTCCGCCATGGAGGAAAGCCGAATACATGAGAGCCAAATGGCTTTCCCCTCACATCAGATCCAAACCTCCGATTCCATCGCAAAAAGAAATCGAAAACTCTGACAACGAGCAATCGCCGACAGAAACGGATTGCGGAGACTTCGAAATGATATTCGGCGACGAAATGACACCATCACCTCCTGAAAGTGAATCGATCGCTTCATCGTCTCCTCCGCAGAAAGGTTTCAATGACGGCGAGAAGGCTGCGGTGGCAGTGACGGCCTGGCAACCTCCGGCGATCAAACCGAAGAGTCTCGATAGAGGAGCTAAGATCGTTACGGGATTGGCATCAATCCTGAAAGAGAATCCGTAAAAATTTTGGGTTTTTTTTTTTTCTTTTTTGAAATATCATAAAAAAAAATAATTATGTGGGGGTAAATAGTAAATATCCGGAAATGTATGGGCTTTTGTTTAATTGAGAAAAAAAAAAAGAAAAAAAAAAGAAAATGAAAGGAAAATCCTTCCGGTTTTTTGGTTGTAGGGATATTCTAGTCTGTAATGGGCAGCAAATTTGGCTTACCCAATCGTTTTGTAAAAATAAATTAGAAAAAAATGATATATTACCGAATTTAACC
Coding sequence (CDS)
ATGTGTTTCACGAAAAGCTCTGATCCGAAATCCAAATTTCTCGCTCGCTTAGTCACGAAGTTATCCGCTCCAGCTCCATGGAAATTCGGGGAAAGAGCCTGTGAGTTAATTATGAATCGGTGTTTATTAATTGGTTCTGAGAATGGAGCAGAGATGACGAGGATGAAGATTCAACCCATTGATATCGATCCTCCTACTGGAAGAGTTGCCATTCGTGCTGATCCGGGCAAACCGGTGCTGAAATCGCGTCTCAGAAAGCTATTTGACCGTCCCCCTAATGTTCTGAAGAATTCTTCTGCGGAGAAGCCGATTGCCGCTGGGGAAGCTGCTCAGTTCATCATCAACAAAGATGGAGGCTCTGAGTTCGAGCCAAGTTCCATTTGTTTGGCTAAAATGGTGCAGAGTTTCATAGAGGAGAGTAATGAAAAGCAGTTGTCGGTGGCTACTGCTGTGAAAAATGGTCGCAACCGCTGCAATTGCTTCAACGGGAACAATAATGACAGCTCCGATGATGAGTCCGATGACTTCGGCGGTGGTTTTGGTGAATCTGTACCAATCGGATCCTCTGGTGCCGATGTTTCCGACTTGCTCAAGAGTCTGATTCTTTCCGCTAGCGTAGCCGAGAGAAACCTCTTAGCCGACACCGCGAAGATCTTTGAGAAGAACAACAAGATTCACAAGAAGAAAGACGATTTGAGAAAAGTTGTCACAGATGGCCTTTCTTCTCTAGGTTACGATTCTTCAATCTGTAAATCAAAATGGGACAAATTCCCCTCACATCCTGCAGGGGAATATGAATACATCGATGTGATAGTGGAGGGGGAGAGATTGCTGATCGACATAGATTTCAGATCGGAGTTTGAAATCGCTCGTTCAACCGGAATGTACAAGGCGATCCTCCAATTACTTCCGAATATCTTCATCGGCAAACCAGATCGTCTAGGTCAAATCGTCTCGATCGTATCGGAGGCTGCGAGACAAAGCTTGAAAAAGAAGGGGATGCACTTTCCGCCATGGAGGAAAGCCGAATACATGAGAGCCAAATGGCTTTCCCCTCACATCAGATCCAAACCTCCGATTCCATCGCAAAAAGAAATCGAAAACTCTGACAACGAGCAATCGCCGACAGAAACGGATTGCGGAGACTTCGAAATGATATTCGGCGACGAAATGACACCATCACCTCCTGAAAGTGAATCGATCGCTTCATCGTCTCCTCCGCAGAAAGGTTTCAATGACGGCGAGAAGGCTGCGGTGGCAGTGACGGCCTGGCAACCTCCGGCGATCAAACCGAAGAGTCTCGATAGAGGAGCTAAGATCGTTACGGGATTGGCATCAATCCTGAAAGAGAATCCGTAA
Protein sequence
MCFTKSSDPKSKFLARLVTKLSAPAPWKFGERACELIMNRCLLIGSENGAEMTRMKIQPIDIDPPTGRVAIRADPGKPVLKSRLRKLFDRPPNVLKNSSAEKPIAAGEAAQFIINKDGGSEFEPSSICLAKMVQSFIEESNEKQLSVATAVKNGRNRCNCFNGNNNDSSDDESDDFGGGFGESVPIGSSGADVSDLLKSLILSASVAERNLLADTAKIFEKNNKIHKKKDDLRKVVTDGLSSLGYDSSICKSKWDKFPSHPAGEYEYIDVIVEGERLLIDIDFRSEFEIARSTGMYKAILQLLPNIFIGKPDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLSPHIRSKPPIPSQKEIENSDNEQSPTETDCGDFEMIFGDEMTPSPPESESIASSSPPQKGFNDGEKAAVAVTAWQPPAIKPKSLDRGAKIVTGLASILKENP
Homology
BLAST of CmoCh17G001920 vs. ExPASy TrEMBL
Match:
A0A6J1H4C5 (uncharacterized protein LOC111460039 OS=Cucurbita moschata OX=3662 GN=LOC111460039 PE=4 SV=1)
HSP 1 Score: 781.9 bits (2018), Expect = 1.4e-222
Identity = 398/399 (99.75%), Postives = 399/399 (100.00%), Query Frame = 0
Query: 54 RMKIQPIDIDPPTGRVAIRADPGKPVLKSRLRKLFDRPPNVLKNSSAEKPIAAGEAAQFI 113
+MKIQPIDIDPPTGRVAIRADPGKPVLKSRLRKLFDRPPNVLKNSSAEKPIAAGEAAQFI
Sbjct: 6 KMKIQPIDIDPPTGRVAIRADPGKPVLKSRLRKLFDRPPNVLKNSSAEKPIAAGEAAQFI 65
Query: 114 INKDGGSEFEPSSICLAKMVQSFIEESNEKQLSVATAVKNGRNRCNCFNGNNNDSSDDES 173
INKDGGSEFEPSSICLAKMVQSFIEESNEKQLSVATAVKNGRNRCNCFNGNNNDSSDDES
Sbjct: 66 INKDGGSEFEPSSICLAKMVQSFIEESNEKQLSVATAVKNGRNRCNCFNGNNNDSSDDES 125
Query: 174 DDFGGGFGESVPIGSSGADVSDLLKSLILSASVAERNLLADTAKIFEKNNKIHKKKDDLR 233
DDFGGGFGESVPIGSSGADVSDLLKSLILSASVAERNLLADTAKIFEKNNKIHKKKDDLR
Sbjct: 126 DDFGGGFGESVPIGSSGADVSDLLKSLILSASVAERNLLADTAKIFEKNNKIHKKKDDLR 185
Query: 234 KVVTDGLSSLGYDSSICKSKWDKFPSHPAGEYEYIDVIVEGERLLIDIDFRSEFEIARST 293
KVVTDGLSSLGYDSSICKSKWDKFPSHPAGEYEYIDVIVEGERLLIDIDFRSEFEIARST
Sbjct: 186 KVVTDGLSSLGYDSSICKSKWDKFPSHPAGEYEYIDVIVEGERLLIDIDFRSEFEIARST 245
Query: 294 GMYKAILQLLPNIFIGKPDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLSPH 353
GMYKAILQLLPNIFIGKPDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLSPH
Sbjct: 246 GMYKAILQLLPNIFIGKPDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLSPH 305
Query: 354 IRSKPPIPSQKEIENSDNEQSPTETDCGDFEMIFGDEMTPSPPESESIASSSPPQKGFND 413
IRSKPPIPSQKEIENSDNEQSPTETDCGDFEMIFGDEMTPSPPESESIASSSPPQKGFND
Sbjct: 306 IRSKPPIPSQKEIENSDNEQSPTETDCGDFEMIFGDEMTPSPPESESIASSSPPQKGFND 365
Query: 414 GEKAAVAVTAWQPPAIKPKSLDRGAKIVTGLASILKENP 453
GEKAAVAVTAWQPPAIKPKSLDRGAKIVTGLASILKENP
Sbjct: 366 GEKAAVAVTAWQPPAIKPKSLDRGAKIVTGLASILKENP 404
BLAST of CmoCh17G001920 vs. ExPASy TrEMBL
Match:
A0A6J1KYK9 (uncharacterized protein LOC111499376 OS=Cucurbita maxima OX=3661 GN=LOC111499376 PE=4 SV=1)
HSP 1 Score: 760.8 bits (1963), Expect = 3.3e-216
Identity = 388/398 (97.49%), Postives = 391/398 (98.24%), Query Frame = 0
Query: 55 MKIQPIDIDPPTGRVAIRADPGKPVLKSRLRKLFDRPPNVLKNSSAEKPIAAGEAAQFII 114
MKIQPIDIDPPTGRVAIRADPGKPVLKSRLRKLFDRPPNVLKNSSAEKPIAAGEAAQFII
Sbjct: 1 MKIQPIDIDPPTGRVAIRADPGKPVLKSRLRKLFDRPPNVLKNSSAEKPIAAGEAAQFII 60
Query: 115 NKDGGSEFEPSSICLAKMVQSFIEESNEKQLSVATAVKNGRNRCNCFNGNNNDSSDDESD 174
NKDGGSEFEPSSICLAKMVQSFIEESNEKQLSVATAVKNGRNRCNCFNGNNNDSSDDESD
Sbjct: 61 NKDGGSEFEPSSICLAKMVQSFIEESNEKQLSVATAVKNGRNRCNCFNGNNNDSSDDESD 120
Query: 175 DFGGGFGESVPIGSSGADVSDLLKSLILSASVAERNLLADTAKIFEKNNKIHKKKDDLRK 234
DFGGG GESVPIGSSGADVSDLLKSLILSASVAERNLLADTAKI EKNNKIHKKKDDLRK
Sbjct: 121 DFGGGLGESVPIGSSGADVSDLLKSLILSASVAERNLLADTAKIVEKNNKIHKKKDDLRK 180
Query: 235 VVTDGLSSLGYDSSICKSKWDKFPSHPAGEYEYIDVIVEGERLLIDIDFRSEFEIARSTG 294
VVTDGLSSLGYDSSICKSKWDKFPSHPAGEY+YIDVIVEGER LIDIDFRSEFEIARSTG
Sbjct: 181 VVTDGLSSLGYDSSICKSKWDKFPSHPAGEYDYIDVIVEGERFLIDIDFRSEFEIARSTG 240
Query: 295 MYKAILQLLPNIFIGKPDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLSPHI 354
MYKAILQLLPNIF+GKPDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLSPHI
Sbjct: 241 MYKAILQLLPNIFVGKPDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLSPHI 300
Query: 355 RSKPPIPSQKEIENSDNEQSPTETDCGDFEMIFGDEMTPSPPESESIASSSPPQKGFNDG 414
RSKPPIPSQKEIENS+NEQSPTETDCGD EMIFGDE T SPPESESIASSSPPQKGFNDG
Sbjct: 301 RSKPPIPSQKEIENSNNEQSPTETDCGDLEMIFGDETTTSPPESESIASSSPPQKGFNDG 360
Query: 415 EKAAVAVTAWQPPAIKPKSLDRGAKIVTGLASILKENP 453
EKAAVA TAWQPPAIKPKSLDRGAKIVTGLASILKENP
Sbjct: 361 EKAAVAGTAWQPPAIKPKSLDRGAKIVTGLASILKENP 398
BLAST of CmoCh17G001920 vs. ExPASy TrEMBL
Match:
A0A1S3BMC1 (uncharacterized protein LOC103491218 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103491218 PE=4 SV=1)
HSP 1 Score: 685.3 bits (1767), Expect = 1.8e-193
Identity = 357/407 (87.71%), Postives = 375/407 (92.14%), Query Frame = 0
Query: 54 RMKIQPIDIDPPTGRVAIRADPGKPVLKSRLRKLFDRP-PNVLKNSSAEKPIAAGEAAQF 113
+MK+QPIDIDPPTGRVAIRADPGKPVLKSRLRKLFDRP PNVLKNS+AEKPIAAGEAAQF
Sbjct: 6 KMKVQPIDIDPPTGRVAIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQF 65
Query: 114 IINKDGGSEFEPSSICLAKMVQSFIEESNEKQLSVATAVKNGRNRCNCFNGNNNDSSDDE 173
IINKDG SEFEPSSICLAKMVQSFIEESNEKQLSVATAVKNGRNRCNCFNGNNNDSSDDE
Sbjct: 66 IINKDGFSEFEPSSICLAKMVQSFIEESNEKQLSVATAVKNGRNRCNCFNGNNNDSSDDE 125
Query: 174 SDDFGGGFGESVPIGSSGADVSDLLKSLILSASVAERNLLADTAKIFEKNNKIHKKKDDL 233
SDDFGGGFGE+V IGSSGADV DLLKSLIL ASVAERNLLADTAKI EKNNKIHK+KDDL
Sbjct: 126 SDDFGGGFGETVTIGSSGADVYDLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDL 185
Query: 234 RKVVTDGLSSLGYDSSICKSKWDKFPSHPAGEYEYIDVIVEGERLLIDIDFRSEFEIARS 293
RKVVTDGLSSLGYDSSICKSKW+K PSHPAGEYEYIDV+VE ERL+IDIDFRSEFEIARS
Sbjct: 186 RKVVTDGLSSLGYDSSICKSKWEKSPSHPAGEYEYIDVMVENERLVIDIDFRSEFEIARS 245
Query: 294 TGMYKAILQLLPNIFIGKPDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLSP 353
TGMYK ILQL+PNIF+GK DRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLSP
Sbjct: 246 TGMYKTILQLIPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLSP 305
Query: 354 HIRSKPPIPSQKEI------ENSDNEQSPTETDCGDFEMIFGDEMTP-SPPESESIASSS 413
HIRSKPP PS KEI EN +NE+SPTETDCG+ E+IFGDE T + ES SIASS
Sbjct: 306 HIRSKPPNPSVKEIEITNMEENENNEESPTETDCGELELIFGDEETMITSGESNSIASSP 365
Query: 414 PPQKGFNDGEKAAVAVTAWQPPAIKPKSLDRGAKIVTGLASILKENP 453
PPQ+G G+KAA+ VT WQPPAIKPKSLDRGAKIVTGLASILKENP
Sbjct: 366 PPQEGLYGGKKAALTVTPWQPPAIKPKSLDRGAKIVTGLASILKENP 412
BLAST of CmoCh17G001920 vs. ExPASy TrEMBL
Match:
A0A5D3E4G1 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold216G001530 PE=4 SV=1)
HSP 1 Score: 684.5 bits (1765), Expect = 3.0e-193
Identity = 357/406 (87.93%), Postives = 374/406 (92.12%), Query Frame = 0
Query: 55 MKIQPIDIDPPTGRVAIRADPGKPVLKSRLRKLFDRP-PNVLKNSSAEKPIAAGEAAQFI 114
MK+QPIDIDPPTGRVAIRADPGKPVLKSRLRKLFDRP PNVLKNS+AEKPIAAGEAAQFI
Sbjct: 1 MKVQPIDIDPPTGRVAIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFI 60
Query: 115 INKDGGSEFEPSSICLAKMVQSFIEESNEKQLSVATAVKNGRNRCNCFNGNNNDSSDDES 174
INKDG SEFEPSSICLAKMVQSFIEESNEKQLSVATAVKNGRNRCNCFNGNNNDSSDDES
Sbjct: 61 INKDGFSEFEPSSICLAKMVQSFIEESNEKQLSVATAVKNGRNRCNCFNGNNNDSSDDES 120
Query: 175 DDFGGGFGESVPIGSSGADVSDLLKSLILSASVAERNLLADTAKIFEKNNKIHKKKDDLR 234
DDFGGGFGE+V IGSSGADV DLLKSLIL ASVAERNLLADTAKI EKNNKIHK+KDDLR
Sbjct: 121 DDFGGGFGETVTIGSSGADVYDLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLR 180
Query: 235 KVVTDGLSSLGYDSSICKSKWDKFPSHPAGEYEYIDVIVEGERLLIDIDFRSEFEIARST 294
KVVTDGLSSLGYDSSICKSKW+K PSHPAGEYEYIDV+VE ERL+IDIDFRSEFEIARST
Sbjct: 181 KVVTDGLSSLGYDSSICKSKWEKSPSHPAGEYEYIDVMVENERLVIDIDFRSEFEIARST 240
Query: 295 GMYKAILQLLPNIFIGKPDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLSPH 354
GMYK ILQL+PNIF+GK DRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLSPH
Sbjct: 241 GMYKTILQLIPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLSPH 300
Query: 355 IRSKPPIPSQKEI------ENSDNEQSPTETDCGDFEMIFGDEMTP-SPPESESIASSSP 414
IRSKPP PS KEI EN +NE+SPTETDCG+ E+IFGDE T + ES SIASS P
Sbjct: 301 IRSKPPNPSVKEIEITNMEENENNEESPTETDCGELELIFGDEETMITSGESNSIASSPP 360
Query: 415 PQKGFNDGEKAAVAVTAWQPPAIKPKSLDRGAKIVTGLASILKENP 453
PQ+G G+KAA+ VT WQPPAIKPKSLDRGAKIVTGLASILKENP
Sbjct: 361 PQEGLYGGKKAALTVTPWQPPAIKPKSLDRGAKIVTGLASILKENP 406
BLAST of CmoCh17G001920 vs. ExPASy TrEMBL
Match:
A0A1S3BMM4 (uncharacterized protein LOC103491218 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103491218 PE=4 SV=1)
HSP 1 Score: 684.5 bits (1765), Expect = 3.0e-193
Identity = 357/406 (87.93%), Postives = 374/406 (92.12%), Query Frame = 0
Query: 55 MKIQPIDIDPPTGRVAIRADPGKPVLKSRLRKLFDRP-PNVLKNSSAEKPIAAGEAAQFI 114
MK+QPIDIDPPTGRVAIRADPGKPVLKSRLRKLFDRP PNVLKNS+AEKPIAAGEAAQFI
Sbjct: 1 MKVQPIDIDPPTGRVAIRADPGKPVLKSRLRKLFDRPFPNVLKNSTAEKPIAAGEAAQFI 60
Query: 115 INKDGGSEFEPSSICLAKMVQSFIEESNEKQLSVATAVKNGRNRCNCFNGNNNDSSDDES 174
INKDG SEFEPSSICLAKMVQSFIEESNEKQLSVATAVKNGRNRCNCFNGNNNDSSDDES
Sbjct: 61 INKDGFSEFEPSSICLAKMVQSFIEESNEKQLSVATAVKNGRNRCNCFNGNNNDSSDDES 120
Query: 175 DDFGGGFGESVPIGSSGADVSDLLKSLILSASVAERNLLADTAKIFEKNNKIHKKKDDLR 234
DDFGGGFGE+V IGSSGADV DLLKSLIL ASVAERNLLADTAKI EKNNKIHK+KDDLR
Sbjct: 121 DDFGGGFGETVTIGSSGADVYDLLKSLILCASVAERNLLADTAKIVEKNNKIHKRKDDLR 180
Query: 235 KVVTDGLSSLGYDSSICKSKWDKFPSHPAGEYEYIDVIVEGERLLIDIDFRSEFEIARST 294
KVVTDGLSSLGYDSSICKSKW+K PSHPAGEYEYIDV+VE ERL+IDIDFRSEFEIARST
Sbjct: 181 KVVTDGLSSLGYDSSICKSKWEKSPSHPAGEYEYIDVMVENERLVIDIDFRSEFEIARST 240
Query: 295 GMYKAILQLLPNIFIGKPDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLSPH 354
GMYK ILQL+PNIF+GK DRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLSPH
Sbjct: 241 GMYKTILQLIPNIFVGKTDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLSPH 300
Query: 355 IRSKPPIPSQKEI------ENSDNEQSPTETDCGDFEMIFGDEMTP-SPPESESIASSSP 414
IRSKPP PS KEI EN +NE+SPTETDCG+ E+IFGDE T + ES SIASS P
Sbjct: 301 IRSKPPNPSVKEIEITNMEENENNEESPTETDCGELELIFGDEETMITSGESNSIASSPP 360
Query: 415 PQKGFNDGEKAAVAVTAWQPPAIKPKSLDRGAKIVTGLASILKENP 453
PQ+G G+KAA+ VT WQPPAIKPKSLDRGAKIVTGLASILKENP
Sbjct: 361 PQEGLYGGKKAALTVTPWQPPAIKPKSLDRGAKIVTGLASILKENP 406
BLAST of CmoCh17G001920 vs. TAIR 10
Match:
AT3G22970.1 (Protein of unknown function (DUF506) )
HSP 1 Score: 352.8 bits (904), Expect = 4.0e-97
Identity = 224/411 (54.50%), Postives = 271/411 (65.94%), Query Frame = 0
Query: 55 MKIQPIDIDPPTGRVAIRADPG-KPVLKSRLRKLFDRP-PNVLKNS---SAEKP--IAAG 114
MKIQPIDID + RA+ G KPVLKSRL++LFDRP NVL+NS + EKP + G
Sbjct: 5 MKIQPIDID--SSPTVARAESGNKPVLKSRLKRLFDRPFTNVLRNSTTTTTEKPFVVTGG 64
Query: 115 EAAQFIINKDGG--SEFEPSSICLAKMVQSFIEESNEKQLSVATAVKNGRNRCNCFNGNN 174
E + GG +EFEPSS+CLAKMVQ+FIEE+NEKQ K GRNRCNCFNGNN
Sbjct: 65 EV------QCGGVVTEFEPSSVCLAKMVQNFIEENNEKQ------AKCGRNRCNCFNGNN 124
Query: 175 NDSSDDESDDFGGGFGESVPIGSSGADVSDLLKSLILSASVAERNLLADTAKIFEKNNKI 234
+ SSDDESD FGG G D SD LKSLI +V ERNLLAD AKI +KN +
Sbjct: 125 DGSSDDESDLFGGSI--------DGCDASDHLKSLIPCTTVGERNLLADAAKIVDKNKSV 184
Query: 235 HKKKDDLRKVVTDGLSSLGYDSSICKSKWDKFPSHPAGEYEYIDVIVEGERLLIDIDFRS 294
K+KDD++K+V +GL SL Y+SSICKSKWDK PS PAGEYEYIDVI+ ERL+ID+DFRS
Sbjct: 185 -KRKDDMKKIVNEGLLSLNYNSSICKSKWDKSPSFPAGEYEYIDVIIGEERLIIDVDFRS 244
Query: 295 EFEIARSTGMYKAILQLLPNIFIGKPDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYM 354
EF+IAR T YK +LQ LP IF+GK DRL QIV ++SEAA+QSLKKKGM FPPWRKAEYM
Sbjct: 245 EFDIARQTSGYKVLLQSLPFIFVGKSDRLSQIVFLISEAAKQSLKKKGMPFPPWRKAEYM 304
Query: 355 RAKWLSPHIRSKPPIPSQ----KEIENSDNEQSPTETDCGDFEMIFGDEMTPSPPESESI 414
R+KWLS + R+ + + ++ +D E D + E++F +E SP +
Sbjct: 305 RSKWLSSYTRASVVVVDETVTVTDVTAAD-AAVEKEVDSVEIELVF-EEKCLSP--RVIV 364
Query: 415 ASSSPPQKGFNDGEKAAVAVTAWQPPAIKPKSLDRGAKIVTGLASILKENP 453
SSS P G +D VAV +R K VTGLAS+ KE P
Sbjct: 365 NSSSSPTDGDDD-----VAV-------------EREVKAVTGLASLFKEKP 370
BLAST of CmoCh17G001920 vs. TAIR 10
Match:
AT4G14620.1 (Protein of unknown function (DUF506) )
HSP 1 Score: 318.2 bits (814), Expect = 1.1e-86
Identity = 208/403 (51.61%), Postives = 251/403 (62.28%), Query Frame = 0
Query: 55 MKIQPIDIDPPTGRVAIRADPGKPVLKSRLRKLFDRPPNVLKNSSAEKPIAAGEAAQFII 114
MKIQPI+ D P RV KPVLKSRL++L DRP + NS EK + +G+
Sbjct: 2 MKIQPINNDLPANRV---ESSTKPVLKSRLKRLLDRPFTRISNS--EKLLISGDGVV--- 61
Query: 115 NKDGGSEFEPSSICLAKMVQSFIEESNEKQLSVATAVKNGRN--RCNCFNGNNNDSSDDE 174
G+EFEPS LAKMVQ+++EE+N+KQ KNGRN RCNCFNG NND SDDE
Sbjct: 62 ---AGTEFEPS---LAKMVQNYMEENNDKQ------TKNGRNTHRCNCFNG-NNDISDDE 121
Query: 175 SDDFGGGFGESVPIGSSGADVSDLLKSLILSASVAERNLLADTAKIFEKNNKIHKKKDDL 234
D F D KSLI S E++LL + KI EKN + K+KD+L
Sbjct: 122 LDFFD----------------YDNFKSLIQCGSFVEKSLLVEATKIIEKNKSV-KRKDEL 181
Query: 235 RKVVTDGLSSLGYDSSICKSKWDKFPSHPAGEYEYIDVIVEGERLLIDIDFRSEFEIARS 294
RK+V D LSSLGYDSSICKSKWDK S PAGEYEYIDVIV GERL+IDIDFRSEFEIAR
Sbjct: 182 RKIVVDELSSLGYDSSICKSKWDKTRSIPAGEYEYIDVIVNGERLIIDIDFRSEFEIARQ 241
Query: 295 TGMYKAILQLLPNIFIGKPDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLSP 354
T YK +LQ LP IF+GK DR+ QIVSIVSEA++QSLKKKGMHFPPWRKA+YMRAKWLS
Sbjct: 242 TSGYKELLQSLPLIFVGKSDRIRQIVSIVSEASKQSLKKKGMHFPPWRKADYMRAKWLSS 301
Query: 355 HIRS----KPPIPSQKEIENSDNEQSPTETDCGDFEMIFGDEMTPSPPESESIASSSPPQ 414
+ R+ KP + S ++ + E D + E+IF +E PP I S
Sbjct: 302 YTRNSGEKKPTVTSAAKV------VAEPELDSSEIELIF-EEKVLLPPLKSPITS----- 340
Query: 415 KGFNDGEKAAVAVTAWQPPAIKPKSLDRGAKIVTGLASILKEN 452
G +D + A +S+ + AK+VTGLA + KEN
Sbjct: 362 VGRDDDDVA--------------ESVKKEAKVVTGLALLFKEN 340
BLAST of CmoCh17G001920 vs. TAIR 10
Match:
AT2G38820.2 (Protein of unknown function (DUF506) )
HSP 1 Score: 262.7 bits (670), Expect = 5.5e-70
Identity = 144/303 (47.52%), Postives = 203/303 (67.00%), Query Frame = 0
Query: 55 MKIQPIDIDPPTGRVAIRADPGKPVLKSRLRKLFDRPPNVLKNSSAEKPIAAGEAAQFII 114
MKIQPID + V + + + KSRL++LF+R + +EK G + +
Sbjct: 5 MKIQPIDESDVSEEVPY-PETMRQMPKSRLKRLFER--QFTNKNVSEK--FTGSDVEAPL 64
Query: 115 NKDGGSEFEPSSICLAKMVQSFIEESN--EKQLSVATAVKNGRNRCNCFNGNNNDSSDDE 174
++ +FEPSS+CLAKMV +F+E++N EKQ + GR+RCNCF+G+ +SSDDE
Sbjct: 65 SRGNSGDFEPSSVCLAKMVLNFMEDNNGGEKQ-------RCGRSRCNCFSGSGTESSDDE 124
Query: 175 SDDFGGGFGESVPIGSSGADVSDLLKSLILSASVAERNLLADTAKIFEKNNKIHKKKDDL 234
++ S + ++LKSL+L S+ RNLL D KI E + K
Sbjct: 125 TE-------------CSSGEACEILKSLVLCKSIRVRNLLTDVTKIAETSKNCKLKDGSC 184
Query: 235 RKVVTDGLSSLGYDSSICKSKWDKFPSHPAGEYEYIDVIVEGERLLIDIDFRSEFEIARS 294
K V +GL SLGYD+++CKS+W+K PS PAGEYEY+DVI++GERLLIDIDF+S+FEIAR+
Sbjct: 185 LKSVANGLVSLGYDAALCKSRWEKSPSCPAGEYEYVDVIMKGERLLIDIDFKSKFEIARA 244
Query: 295 TGMYKAILQLLPNIFIGKPDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLSP 354
T YK++LQ LP IF+GK DRL +I+ ++ +AA+QSLKKKG+H PPWR+AEY+++KWLS
Sbjct: 245 TKTYKSMLQTLPYIFVGKADRLQKIIVLICKAAKQSLKKKGLHVPPWRRAEYVKSKWLSS 282
Query: 355 HIR 356
H+R
Sbjct: 305 HVR 282
BLAST of CmoCh17G001920 vs. TAIR 10
Match:
AT2G38820.1 (Protein of unknown function (DUF506) )
HSP 1 Score: 240.4 bits (612), Expect = 2.9e-63
Identity = 136/303 (44.88%), Postives = 194/303 (64.03%), Query Frame = 0
Query: 55 MKIQPIDIDPPTGRVAIRADPGKPVLKSRLRKLFDRPPNVLKNSSAEKPIAAGEAAQFII 114
MKIQPID + V + + + KSRL++LF+R + +EK G + +
Sbjct: 5 MKIQPIDESDVSEEVPY-PETMRQMPKSRLKRLFER--QFTNKNVSEK--FTGSDVEAPL 64
Query: 115 NKDGGSEFEPSSICLAKMVQSFIEESN--EKQLSVATAVKNGRNRCNCFNGNNNDSSDDE 174
++ +FEPSS+CLAKMV +F+E++N EKQ + GR+RCNCF+G+ +SSDDE
Sbjct: 65 SRGNSGDFEPSSVCLAKMVLNFMEDNNGGEKQ-------RCGRSRCNCFSGSGTESSDDE 124
Query: 175 SDDFGGGFGESVPIGSSGADVSDLLKSLILSASVAERNLLADTAKIFEKNNKIHKKKDDL 234
++ S + ++LKSL+L S+ RNLL D KI E +
Sbjct: 125 TE-------------CSSGEACEILKSLVLCKSIRVRNLLTDVTKIAETS---------- 184
Query: 235 RKVVTDGLSSLGYDSSICKSKWDKFPSHPAGEYEYIDVIVEGERLLIDIDFRSEFEIARS 294
YD+++CKS+W+K PS PAGEYEY+DVI++GERLLIDIDF+S+FEIAR+
Sbjct: 185 ------------YDAALCKSRWEKSPSCPAGEYEYVDVIMKGERLLIDIDFKSKFEIARA 244
Query: 295 TGMYKAILQLLPNIFIGKPDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLSP 354
T YK++LQ LP IF+GK DRL +I+ ++ +AA+QSLKKKG+H PPWR+AEY+++KWLS
Sbjct: 245 TKTYKSMLQTLPYIFVGKADRLQKIIVLICKAAKQSLKKKGLHVPPWRRAEYVKSKWLSS 260
Query: 355 HIR 356
H+R
Sbjct: 305 HVR 260
BLAST of CmoCh17G001920 vs. TAIR 10
Match:
AT3G22970.2 (Protein of unknown function (DUF506) )
HSP 1 Score: 234.2 bits (596), Expect = 2.1e-61
Identity = 137/261 (52.49%), Postives = 173/261 (66.28%), Query Frame = 0
Query: 196 LLKSLILSASVAERNLLADTAKIFEKNNKIHKKKDDLRKVVTDGLSSLGYDSSICKSKWD 255
++ SLI +V ERNLLAD AKI +KN + K+KDD++K+V +GL SL Y+SSICKSKWD
Sbjct: 22 IISSLIPCTTVGERNLLADAAKIVDKNKSV-KRKDDMKKIVNEGLLSLNYNSSICKSKWD 81
Query: 256 KFPSHPAGEYEYIDVIVEGERLLIDIDFRSEFEIARSTGMYKAILQLLPNIFIGKPDRLG 315
K PS PAGEYEYIDVI+ ERL+ID+DFRSEF+IAR T YK +LQ LP IF+GK DRL
Sbjct: 82 KSPSFPAGEYEYIDVIIGEERLIIDVDFRSEFDIARQTSGYKVLLQSLPFIFVGKSDRLS 141
Query: 316 QIVSIVSEAARQSLKKKGMHFPPWRKAEYMRAKWLSPHIRSKPPIPSQ----KEIENSDN 375
QIV ++SEAA+QSLKKKGM FPPWRKAEYMR+KWLS + R+ + + ++ +D
Sbjct: 142 QIVFLISEAAKQSLKKKGMPFPPWRKAEYMRSKWLSSYTRASVVVVDETVTVTDVTAAD- 201
Query: 376 EQSPTETDCGDFEMIFGDEMTPSPPESESIASSSPPQKGFNDGEKAAVAVTAWQPPAIKP 435
E D + E++F +E SP + SSS P G +D VAV
Sbjct: 202 AAVEKEVDSVEIELVF-EEKCLSP--RVIVNSSSSPTDGDDD-----VAV---------- 259
Query: 436 KSLDRGAKIVTGLASILKENP 453
+R K VTGLAS+ KE P
Sbjct: 262 ---EREVKAVTGLASLFKEKP 259
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1H4C5 | 1.4e-222 | 99.75 | uncharacterized protein LOC111460039 OS=Cucurbita moschata OX=3662 GN=LOC1114600... | [more] |
A0A6J1KYK9 | 3.3e-216 | 97.49 | uncharacterized protein LOC111499376 OS=Cucurbita maxima OX=3661 GN=LOC111499376... | [more] |
A0A1S3BMC1 | 1.8e-193 | 87.71 | uncharacterized protein LOC103491218 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A5D3E4G1 | 3.0e-193 | 87.93 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A1S3BMM4 | 3.0e-193 | 87.93 | uncharacterized protein LOC103491218 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |