Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: utr5CDSpolypeptideutr3
Hold the cursor over a type above to highlight its positions in the sequence below.
CCTCCACGAACGTTGAACCATAGTTCAGCTCCACGTGTCTTTCTGCAAGGTTCTAAGCGTTGCATCCTTCATCTCGTTAAAATCTGCTCTGAATTTGTAGCACGTTGAACATTTTGCTTCCAATTCTGATTCTGATGAGCGAATATGAAACCATTTTTCAAGGTAATTTGAGGGACTGTGTTGTGAAGAAACTGTGTGACCTTCAGAAGTGAAGCTTCTTTTCTTCCTTCCGATTCCACATGGCACTCCCTTCTCCATCGGAGCGGTTCCAATTCCAATGAGCTAAGGTAAGAATAGAATGCTTCTTGATGAAATTTGCTTCAGATTTGGTTTTACCAATTTGCCTTTTCTTCACGAAGAGGTTCGTGTTTTAGTGTAAATGTGAAATGGAGACGATGATCAGCTCCTGCTTGTCTTCTGTCGTAGGAATTCTTGTGGATTTGGTTCTGCGTCGTACTGATTTTAGGTTTTCTCTGAGCTAATTCGTATCTTAAGAACTAATGGAAAATTATAACTTCTTATAATTCGGTGGCCTAATAATAAGCTGCACTTTCATCCTGCTCCTTCGCATTTTAATCGCAAGAGAAGCGAGGAGAAACTGGAAACGAATAGAAGAAGCAAGATACCTATAATTTATGCTCTTCAAGTCAATGAGTTTGTTAATGTATTGGGTTTCGGTCATTGCGATTACTTCACCTAAACTCTATCTCTGAGGTTCAGCTCCATCGTGTCTAAAAGAGGCTTCGTTATCAATTCGCTTGGCACTCTGTTTCAATCTACTTATGTCAAAATGTAATCTAGGGCTGTGGTTGATGAAAGATTCTTCCATCTAAATGCATCCAGAAGGAAAAAATTAATCATATGCACTGATAAATACCTTGATCTATCTATTCTAACCACACACACACGCACACAAAAGAAAAAACTTGCATAGCGTGGAGTTATTGTTATTATCTAGAATTTTGCTAATGTGGCGTCAATAGTTAACATTCTATCTTGATGATATTATACTTGAAAAGTTGTACTACAAATTAAAAGGTATATTCTTGTCACAATGTTGGAAATGTGGAACTGCTTCAATACTCTCTCTGGACAATATGTGTTCAACTGTTTAATGTCAAATATTTCTTGTGTTCTTTGATTTTATTATCTTACAATTTACTGTTATATTGTTTTATGTCTAACTAATTACAGTAAACCTCAAGCTTAATTGCTATAGGTTTCTTCAATAAATTCATAATCTAGTGTTGTAACAACTATAAATAGTAGAAATATGAAGAAGCGAGAGGGGTTTTGGAATTCTAAGAAGTTTGTCCCTTAGGAAAGGATTTCTGGACGAAGAAAAATAGTTATGAGAAAGAATGATTAGAGAGGAGATTCCATGGTTTCTGGAGAAACTGGAAACTCTGACATTTGGTTTGGTGATAGGCTGATAGCCTTCTTATTTTATTTATGTTTGAAATCTGCAGTAAATTTATTTTTTAACTTCAAAACATGTAAAATAGTTTGTGACTCACCAAATAAAACATGGAAAATAGTTTTTTCCTCTTAAAACTTGTTTGGTTGGATGAAAATAATTGTGCTATGCCAACAAAATGGAAATATTGTTTCTCAAATTGAAAGCATTACTATTAATTATCTTTCCTTTTGAAAGCTTCCATCAAAATGGAAATATATCCATGTCCTGACATTTATTGTTCATAAGCTAATATTTATTTTTCATCTATCCATGCGTATCTGGATCTAAACACTATCAGAATTTTCTTAATGCTGTTACACGATCCCTCTAATGCTATAGATTTTTTGTATGCTAACACGTGTAAATTAAATGGAAATAGAAGACCTAGCAATATCTCTCCAAATAAGGTATCTGTGACTGTAACTCTGCTTTAAAATTTAATATGAAAAACAGTATCTAAAACAACTTTTAGCATATATACCTGAATACTCTTTCACTTTCCTTTCAAGTGTTAAACTATGAATTATACTAATTTCTAAATGGTTATATATCCACGTCCTGACATTTATTGTTCATATATCCATGTGTATGTTGATATAAAAACTATCAAAATTGTCCTACTGCGGTAACTTGATCTCTCTAATATTTTTATTATGTATTTACTTAAAATTTTAAACTTTTCAGATGGTTCAAAAACCCATAGACTCCAAATTCAGTGAATATGGGCATGGAAATTCTGGGAAGGACGTGCCTCATGAAAAGCAACTGCAGATTTCTGCAAAGAAGACAGCATTAAGGGACTTGCAAAATGAGAATAGGGTCACAGCTTCCAATTGTACTGGAAGCTTCCCTCTTTTGAAGGAAGGAGGTCCTGGTAGTGACTTCATTAAAGTTTCTGCTAACAAGAGACCCTCAACTGTCTGCCCAACGAGTCCGCCTCATCTCCATTCTTCAACCTCTAATGCTGCAAATGGGCATCTTGTTTACGTCCGTAGAAAATCCGATGCTGATATAGGGAAGAATAGTCCTGGTGATAGTACGAGCATAAAAGCTGATTATCCAAATCTAAGTAAACTTGGTCAACTAGATGAAACCGTGCATCTCAAATCTCAGGTTAAGGAGCTAAAGAATCATTGCTTTCCAGCATTTGCTCCTTTTCCAGTGGTTCCTCCCATGAATGCATCTGGAACACCTTCAGTTCCTCATCACATTGGGAAGTATGGCATTAATTTAGCTACAGCAGAGTCAAACTTCCATTCTGCACTTTCTACTGTCCCTTCAGTAGGCATCCCACCAGGATGGAAAAACTTGCAGTGGGAAGACAGATATCATCAGTTGCAGTTGTTATTGAATAAATTGGATCAATCAGACCAACAAGATTATCTTCAGGGTATGCTTTACATCTAAGAAAGTGCTTCATCTTATTGCTGAAATAGTTTGAACGAACCTTATTAACTCATTGAATAATTTCCCTTTTACAAGTGCTTCGATCACTGTCATCAGTTGAACTTAGCAGGCATGCAGTTGCGTTGGAAAAGAGATCCATTCAGCTCTCGCTTGAGGAAGGTATTTTGATATTTGGCTTTACATAACTTCACGTTCCTTTTCTGTGATCTTGGAAGAACTAGCGGTCTACAAAATAAGTTATTAACTAAACATTACGTTTGATTCCATTATTTTTGTTCATTGTGGCCTTTCGTCTTTCCATATTACTTCTAAATTTTGACTTGCTACGGTGATTTTCTTTCTTTAGACTGAAGAAGTCTTGGATTTGGATATCTGATATTCTCAGAGGTTGGATAATATAGTGATTCAATAGCATAACATCAAACTTTCGTGAAATTATTTTTTGTTAATATGTTAGTTGAGACTTCATCCCTACTTGTCTAAAGCTTACCCTCTTTTGAGTTGGAAAAATTTTCAAATTTGTTTTCAATCTTGGGCACAGAACTGTTTTGCTTTTGCAAAAGATTGAAATATCTAAAACATACAAGGAGAAATACAACTGTCAGTTGGTTTCTCATGAGCATTTAACATTACTTTTATGAAAGGCTTGCCATTTGATGAACCTTAAGATATCCATCTTGCTAGGAAGTCTCTCACTGCCTTTCCTTTCGATTCATAGCGAAATGCTGTAAATAGTTATTCCTTTATTATCTCCCCAAAGTAAATTTGAAGCTTCATCTTTTGATTTCTGATAAACCTCGTATTTTTTAGTTCACACAGATAGGCATCTATCAGCTATGAAACATCAGACAGTTTCTTTTTGCTCTATAATATGATCAAAGTAACACGATTCCTCTTTTGTTTCCGACAGCGAAAGAGTTGCGGCGAGTTGGGGTTCTGAATGTGCTGGGAAATCCTGGGAAGAATATCAAAGTGCCATTGGCTCATCAAGACGGATCAGAGACATAAGGGCATGGACATAACATATTTTTATTTTTCACCCTGTTGTTCTTGTTCTTCTTGCTGCGCGAAGATAGCCTACCGACTACCGTACTGCGGTCGCACCGAGTTTCGCCGCCTGACCAGGTGCTTGCTGTGTTTGTTGTAATATTCTGCCTACTGTTTCTCCCTCTAATAATATTTTCATGTGTTTGGCGAAGGTTGTAAGAAAAGATTGCAACTGTCTTGACTGTTACCAGGAAGAAAAAGGAAACAAAATCCCTCTGGTTTCGCTCTCCTCAGTACTCAGCTCCCTTTTTATTTATTCTCCCAGCCAAATCACTTGCATATTTTTTCTTCTTCTATTCCCTAGTTTAATCCTATTCAAATTTATGTTCCCATTTTATAAGATAGAAAGTATTAAGGGCCAAAATAAAGGGTAAACGTTAACCGCCTTGGCTTGTTGGTCATCCATTTTTATCGGGTTTGATCATTAGTAACCACCTACCTCGTGTTTATATTTTGCGAGCTTCATTTGACAACTAAACAAAATAGAGTCA
mRNA sequence
CCTCCACGAACGTTGAACCATAGTTCAGCTCCACGTGTCTTTCTGCAAGGTTCTAAGCGTTGCATCCTTCATCTCGTTAAAATCTGCTCTGAATTTGTAGCACGTTGAACATTTTGCTTCCAATTCTGATTCTGATGAGCGAATATGAAACCATTTTTCAAGGTAATTTGAGGGACTGTGTTGTGAAGAAACTGTGTGACCTTCAGAAGTGAAGCTTCTTTTCTTCCTTCCGATTCCACATGGCACTCCCTTCTCCATCGGAGCGGTTCCAATTCCAATGAGCTAAGATGGTTCAAAAACCCATAGACTCCAAATTCAGTGAATATGGGCATGGAAATTCTGGGAAGGACGTGCCTCATGAAAAGCAACTGCAGATTTCTGCAAAGAAGACAGCATTAAGGGACTTGCAAAATGAGAATAGGGTCACAGCTTCCAATTGTACTGGAAGCTTCCCTCTTTTGAAGGAAGGAGGTCCTGGTAGTGACTTCATTAAAGTTTCTGCTAACAAGAGACCCTCAACTGTCTGCCCAACGAGTCCGCCTCATCTCCATTCTTCAACCTCTAATGCTGCAAATGGGCATCTTGTTTACGTCCGTAGAAAATCCGATGCTGATATAGGGAAGAATAGTCCTGGTGATAGTACGAGCATAAAAGCTGATTATCCAAATCTAAGTAAACTTGGTCAACTAGATGAAACCGTGCATCTCAAATCTCAGGTTAAGGAGCTAAAGAATCATTGCTTTCCAGCATTTGCTCCTTTTCCAGTGGTTCCTCCCATGAATGCATCTGGAACACCTTCAGTTCCTCATCACATTGGGAAGTATGGCATTAATTTAGCTACAGCAGAGTCAAACTTCCATTCTGCACTTTCTACTGTCCCTTCAGTAGGCATCCCACCAGGATGGAAAAACTTGCAGTGGGAAGACAGATATCATCAGTTGCAGTTGTTATTGAATAAATTGGATCAATCAGACCAACAAGATTATCTTCAGGTGCTTCGATCACTGTCATCAGTTGAACTTAGCAGGCATGCAGTTGCGTTGGAAAAGAGATCCATTCAGCTCTCGCTTGAGGAAGCGAAAGAGTTGCGGCGAGTTGGGGTTCTGAATGTGCTGGGAAATCCTGGGAAGAATATCAAAGTGCCATTGGCTCATCAAGACGGATCAGAGACATAAGGGCATGGACATAACATATTTTTATTTTTCACCCTGTTGTTCTTGTTCTTCTTGCTGCGCGAAGATAGCCTACCGACTACCGTACTGCGGTCGCACCGAGTTTCGCCGCCTGACCAGGTTGTAAGAAAAGATTGCAACTGTCTTGACTGTTACCAGGAAGAAAAAGGAAACAAAATCCCTCTGGTTTCGCTCTCCTCAGTACTCAGCTCCCTTTTTATTTATTCTCCCAGCCAAATCACTTGCATATTTTTTCTTCTTCTATTCCCTAGTTTAATCCTATTCAAATTTATGTTCCCATTTTATAAGATAGAAAGTATTAAGGGCCAAAATAAAGGGTAAACGTTAACCGCCTTGGCTTGTTGGTCATCCATTTTTATCGGGTTTGATCATTAGTAACCACCTACCTCGTGTTTATATTTTGCGAGCTTCATTTGACAACTAAACAAAATAGAGTCA
Coding sequence (CDS)
ATGGTTCAAAAACCCATAGACTCCAAATTCAGTGAATATGGGCATGGAAATTCTGGGAAGGACGTGCCTCATGAAAAGCAACTGCAGATTTCTGCAAAGAAGACAGCATTAAGGGACTTGCAAAATGAGAATAGGGTCACAGCTTCCAATTGTACTGGAAGCTTCCCTCTTTTGAAGGAAGGAGGTCCTGGTAGTGACTTCATTAAAGTTTCTGCTAACAAGAGACCCTCAACTGTCTGCCCAACGAGTCCGCCTCATCTCCATTCTTCAACCTCTAATGCTGCAAATGGGCATCTTGTTTACGTCCGTAGAAAATCCGATGCTGATATAGGGAAGAATAGTCCTGGTGATAGTACGAGCATAAAAGCTGATTATCCAAATCTAAGTAAACTTGGTCAACTAGATGAAACCGTGCATCTCAAATCTCAGGTTAAGGAGCTAAAGAATCATTGCTTTCCAGCATTTGCTCCTTTTCCAGTGGTTCCTCCCATGAATGCATCTGGAACACCTTCAGTTCCTCATCACATTGGGAAGTATGGCATTAATTTAGCTACAGCAGAGTCAAACTTCCATTCTGCACTTTCTACTGTCCCTTCAGTAGGCATCCCACCAGGATGGAAAAACTTGCAGTGGGAAGACAGATATCATCAGTTGCAGTTGTTATTGAATAAATTGGATCAATCAGACCAACAAGATTATCTTCAGGTGCTTCGATCACTGTCATCAGTTGAACTTAGCAGGCATGCAGTTGCGTTGGAAAAGAGATCCATTCAGCTCTCGCTTGAGGAAGCGAAAGAGTTGCGGCGAGTTGGGGTTCTGAATGTGCTGGGAAATCCTGGGAAGAATATCAAAGTGCCATTGGCTCATCAAGACGGATCAGAGACATAA
Protein sequence
MVQKPIDSKFSEYGHGNSGKDVPHEKQLQISAKKTALRDLQNENRVTASNCTGSFPLLKEGGPGSDFIKVSANKRPSTVCPTSPPHLHSSTSNAANGHLVYVRRKSDADIGKNSPGDSTSIKADYPNLSKLGQLDETVHLKSQVKELKNHCFPAFAPFPVVPPMNASGTPSVPHHIGKYGINLATAESNFHSALSTVPSVGIPPGWKNLQWEDRYHQLQLLLNKLDQSDQQDYLQVLRSLSSVELSRHAVALEKRSIQLSLEEAKELRRVGVLNVLGNPGKNIKVPLAHQDGSET
Homology
BLAST of MC02g0354 vs. NCBI nr
Match:
XP_022140529.1 (uncharacterized protein LOC111011167 [Momordica charantia])
HSP 1 Score: 595 bits (1533), Expect = 2.00e-214
Identity = 294/295 (99.66%), Postives = 295/295 (100.00%), Query Frame = 0
Query: 1 MVQKPIDSKFSEYGHGNSGKDVPHEKQLQISAKKTALRDLQNENRVTASNCTGSFPLLKE 60
MVQKPIDSKFSEYGHGNSGKDVPHEKQLQISAKKTALRDLQNENRVTASNCTGSFPLLKE
Sbjct: 1 MVQKPIDSKFSEYGHGNSGKDVPHEKQLQISAKKTALRDLQNENRVTASNCTGSFPLLKE 60
Query: 61 GGPGSDFIKVSANKRPSTVCPTSPPHLHSSTSNAANGHLVYVRRKSDADIGKNSPGDSTS 120
GGPGSDFIKVSANKRPSTVCPTSPPHLHSSTSNAANGHLVYVRRKSDADIGKNSPGDSTS
Sbjct: 61 GGPGSDFIKVSANKRPSTVCPTSPPHLHSSTSNAANGHLVYVRRKSDADIGKNSPGDSTS 120
Query: 121 IKADYPNLSKLGQLDETVHLKSQVKELKNHCFPAFAPFPVVPPMNASGTPSVPHHIGKYG 180
IKADYPNLSKLGQLDETVHLKSQVKELKNHCFPAFAPFPVVPPMNASGTPSVPHHIGKYG
Sbjct: 121 IKADYPNLSKLGQLDETVHLKSQVKELKNHCFPAFAPFPVVPPMNASGTPSVPHHIGKYG 180
Query: 181 INLATAESNFHSALSTVPSVGIPPGWKNLQWEDRYHQLQLLLNKLDQSDQQDYLQVLRSL 240
INLATAESNFHSALSTVPSVGIPPGWKNLQWEDRYHQLQLLLNKLDQSDQQDYLQVLRSL
Sbjct: 181 INLATAESNFHSALSTVPSVGIPPGWKNLQWEDRYHQLQLLLNKLDQSDQQDYLQVLRSL 240
Query: 241 SSVELSRHAVALEKRSIQLSLEEAKELRRVGVLNVLGNPGKNIKVPLAHQDGSET 295
SSVELSRHAVALEKRSIQLSLEEAKEL+RVGVLNVLGNPGKNIKVPLAHQDGSET
Sbjct: 241 SSVELSRHAVALEKRSIQLSLEEAKELQRVGVLNVLGNPGKNIKVPLAHQDGSET 295
BLAST of MC02g0354 vs. NCBI nr
Match:
XP_038878250.1 (uncharacterized protein LOC120070536 [Benincasa hispida])
HSP 1 Score: 495 bits (1275), Expect = 4.33e-175
Identity = 253/296 (85.47%), Postives = 260/296 (87.84%), Query Frame = 0
Query: 1 MVQKPIDSKFSEYGHGNSGKDVP-HEKQLQISAKKTALRDLQNENRVTASNCTGSFPLLK 60
MVQK IDSKFSEYGHGNSGKDV EKQLQISAKKTALRDLQN+NR+TASNC GS PLLK
Sbjct: 1 MVQKSIDSKFSEYGHGNSGKDVSSQEKQLQISAKKTALRDLQNDNRITASNCPGSSPLLK 60
Query: 61 EGGPGSDFIKVSANKRPSTVCPTSPPHLHSSTSNAANGHLVYVRRKSDADIGKNSPGDST 120
E G SD IKVS NKR S VCP SP HLHSS SNAANGHLVYVRRKSDADIGKNSP +T
Sbjct: 61 ERGTSSDIIKVSGNKRASPVCPASPSHLHSSPSNAANGHLVYVRRKSDADIGKNSPCGNT 120
Query: 121 SIKADYPNLSKLGQLDETVHLKSQVKELKNHCFPAFAPFPVVPPMNASGTPSVPHHIGKY 180
S KADYPNL KLGQL ET HLKSQVKEL+NHCF AFAPFP+V PMNA G PSVPHH+GK
Sbjct: 121 STKADYPNLHKLGQLAETAHLKSQVKELQNHCFQAFAPFPMVSPMNAPGKPSVPHHVGKC 180
Query: 181 GINLATAESNFHSALSTVPSVGIPPGWKNLQWEDRYHQLQLLLNKLDQSDQQDYLQVLRS 240
G NLATAESNF SA ST PSVGIP GWKNLQWEDRYHQLQLLLNKLDQSDQQDYLQVLRS
Sbjct: 181 GTNLATAESNFRSAPSTAPSVGIPTGWKNLQWEDRYHQLQLLLNKLDQSDQQDYLQVLRS 240
Query: 241 LSSVELSRHAVALEKRSIQLSLEEAKELRRVGVLNVLGNPGKNIKVPLAHQDGSET 295
LSSVELSRHAV LEKRSIQLSLEEAKEL+RVGVLNVLGNP KNIK PL HQDGSET
Sbjct: 241 LSSVELSRHAVELEKRSIQLSLEEAKELQRVGVLNVLGNPVKNIKAPLTHQDGSET 296
BLAST of MC02g0354 vs. NCBI nr
Match:
XP_022986425.1 (uncharacterized protein LOC111484175 [Cucurbita maxima])
HSP 1 Score: 487 bits (1253), Expect = 6.52e-172
Identity = 250/296 (84.46%), Postives = 261/296 (88.18%), Query Frame = 0
Query: 1 MVQKPIDSKFSEYGHGNSGKDVP-HEKQLQISAKKTALRDLQNENRVTASNCTGSFPLLK 60
MVQK IDSKFSEYGHGNSGKDVP EKQLQISAKKTALRDLQN+NRVTASNCTGS PLLK
Sbjct: 1 MVQKSIDSKFSEYGHGNSGKDVPSQEKQLQISAKKTALRDLQNDNRVTASNCTGSSPLLK 60
Query: 61 EGGPGSDFIKVSANKRPSTVCPTSPPHLHSSTSNAANGHLVYVRRKSDADIGKNSPGDST 120
E GP SDFIKVS N P +P HLHSSTSNA+NGHLVYVRRKSDADIGKNSP DST
Sbjct: 61 ERGPSSDFIKVSGNN------PATPSHLHSSTSNASNGHLVYVRRKSDADIGKNSPCDST 120
Query: 121 SIKADYPNLSKLGQLDETVHLKSQVKELKNHCFPAFAPFPVVPPMNASGTPSVPHHIGKY 180
+IK DYPNLSKLGQL ET HLKSQVKEL+NHCFPAFAPFP+V PMNASG PSVPHH+GKY
Sbjct: 121 NIKGDYPNLSKLGQLAETAHLKSQVKELQNHCFPAFAPFPMVSPMNASGKPSVPHHVGKY 180
Query: 181 GINLATAESNFHSALSTVPSVGIPPGWKNLQWEDRYHQLQLLLNKLDQSDQQDYLQVLRS 240
GIN TAESNFH A STVPS GWKNLQWEDRYHQLQLLLNKLDQSDQQDYLQVLRS
Sbjct: 181 GINFTTAESNFHPAPSTVPS-----GWKNLQWEDRYHQLQLLLNKLDQSDQQDYLQVLRS 240
Query: 241 LSSVELSRHAVALEKRSIQLSLEEAKELRRVGVLNVLGNPGKNIKVPLAHQDGSET 295
LSSVELSRHAV LE+RSIQLSLEEAKEL+RVGVLNVLGNP K+IK PL HQ+GSET
Sbjct: 241 LSSVELSRHAVELERRSIQLSLEEAKELQRVGVLNVLGNPVKSIKTPLTHQNGSET 285
BLAST of MC02g0354 vs. NCBI nr
Match:
XP_022943750.1 (uncharacterized protein LOC111448407 [Cucurbita moschata] >KAG7010816.1 hypothetical protein SDJN02_27612, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 481 bits (1239), Expect = 8.85e-170
Identity = 248/296 (83.78%), Postives = 259/296 (87.50%), Query Frame = 0
Query: 1 MVQKPIDSKFSEYGHGNSGKDVP-HEKQLQISAKKTALRDLQNENRVTASNCTGSFPLLK 60
MVQK IDSKFSEYGHGNSGKDVP EKQLQISAKKTALRDLQN+NRVTASNCTGS PLLK
Sbjct: 1 MVQKSIDSKFSEYGHGNSGKDVPSQEKQLQISAKKTALRDLQNDNRVTASNCTGSSPLLK 60
Query: 61 EGGPGSDFIKVSANKRPSTVCPTSPPHLHSSTSNAANGHLVYVRRKSDADIGKNSPGDST 120
E GP SDFIKVS N P +P HLHSSTSNA+NGHLVYVRRKS+ADIGKNSP DST
Sbjct: 61 ERGPSSDFIKVSGNN------PATPSHLHSSTSNASNGHLVYVRRKSEADIGKNSPCDST 120
Query: 121 SIKADYPNLSKLGQLDETVHLKSQVKELKNHCFPAFAPFPVVPPMNASGTPSVPHHIGKY 180
+IK DYPNLSKLGQL ET HLKSQVKEL+ CFPAFAPFP+V PMNASG PSVPHH+GKY
Sbjct: 121 NIKGDYPNLSKLGQLAETAHLKSQVKELQTRCFPAFAPFPMVSPMNASGKPSVPHHVGKY 180
Query: 181 GINLATAESNFHSALSTVPSVGIPPGWKNLQWEDRYHQLQLLLNKLDQSDQQDYLQVLRS 240
GIN ATAESNFH A STVPS GWKNLQWEDRYHQLQLLLNKLDQSDQQDYLQVLRS
Sbjct: 181 GINFATAESNFHPAPSTVPS-----GWKNLQWEDRYHQLQLLLNKLDQSDQQDYLQVLRS 240
Query: 241 LSSVELSRHAVALEKRSIQLSLEEAKELRRVGVLNVLGNPGKNIKVPLAHQDGSET 295
LSSVELSRHAV LE+RSIQLSLEEAKEL+RVGVLNVLGNP K+IK PL H DGSET
Sbjct: 241 LSSVELSRHAVELERRSIQLSLEEAKELQRVGVLNVLGNPVKSIKTPLTHHDGSET 285
BLAST of MC02g0354 vs. NCBI nr
Match:
KAG6570979.1 (hypothetical protein SDJN03_29894, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 481 bits (1239), Expect = 6.45e-169
Identity = 248/296 (83.78%), Postives = 259/296 (87.50%), Query Frame = 0
Query: 1 MVQKPIDSKFSEYGHGNSGKDVP-HEKQLQISAKKTALRDLQNENRVTASNCTGSFPLLK 60
MVQK IDSKFSEYGHGNSGKDVP EKQLQISAKKTALRDLQN+NRVTASNCTGS PLLK
Sbjct: 16 MVQKSIDSKFSEYGHGNSGKDVPSQEKQLQISAKKTALRDLQNDNRVTASNCTGSSPLLK 75
Query: 61 EGGPGSDFIKVSANKRPSTVCPTSPPHLHSSTSNAANGHLVYVRRKSDADIGKNSPGDST 120
E GP SDFIKVS N P +P HLHSSTSNA+NGHLVYVRRKS+ADIGKNSP DST
Sbjct: 76 ERGPSSDFIKVSGNN------PATPSHLHSSTSNASNGHLVYVRRKSEADIGKNSPCDST 135
Query: 121 SIKADYPNLSKLGQLDETVHLKSQVKELKNHCFPAFAPFPVVPPMNASGTPSVPHHIGKY 180
+IK DYPNLSKLGQL ET HLKSQVKEL+ CFPAFAPFP+V PMNASG PSVPHH+GKY
Sbjct: 136 NIKGDYPNLSKLGQLAETAHLKSQVKELQTRCFPAFAPFPMVSPMNASGKPSVPHHVGKY 195
Query: 181 GINLATAESNFHSALSTVPSVGIPPGWKNLQWEDRYHQLQLLLNKLDQSDQQDYLQVLRS 240
GIN ATAESNFH A STVPS GWKNLQWEDRYHQLQLLLNKLDQSDQQDYLQVLRS
Sbjct: 196 GINFATAESNFHPAPSTVPS-----GWKNLQWEDRYHQLQLLLNKLDQSDQQDYLQVLRS 255
Query: 241 LSSVELSRHAVALEKRSIQLSLEEAKELRRVGVLNVLGNPGKNIKVPLAHQDGSET 295
LSSVELSRHAV LE+RSIQLSLEEAKEL+RVGVLNVLGNP K+IK PL H DGSET
Sbjct: 256 LSSVELSRHAVELERRSIQLSLEEAKELQRVGVLNVLGNPVKSIKTPLTHHDGSET 300
BLAST of MC02g0354 vs. ExPASy TrEMBL
Match:
A0A6J1CFY6 (uncharacterized protein LOC111011167 OS=Momordica charantia OX=3673 GN=LOC111011167 PE=4 SV=1)
HSP 1 Score: 595 bits (1533), Expect = 9.68e-215
Identity = 294/295 (99.66%), Postives = 295/295 (100.00%), Query Frame = 0
Query: 1 MVQKPIDSKFSEYGHGNSGKDVPHEKQLQISAKKTALRDLQNENRVTASNCTGSFPLLKE 60
MVQKPIDSKFSEYGHGNSGKDVPHEKQLQISAKKTALRDLQNENRVTASNCTGSFPLLKE
Sbjct: 1 MVQKPIDSKFSEYGHGNSGKDVPHEKQLQISAKKTALRDLQNENRVTASNCTGSFPLLKE 60
Query: 61 GGPGSDFIKVSANKRPSTVCPTSPPHLHSSTSNAANGHLVYVRRKSDADIGKNSPGDSTS 120
GGPGSDFIKVSANKRPSTVCPTSPPHLHSSTSNAANGHLVYVRRKSDADIGKNSPGDSTS
Sbjct: 61 GGPGSDFIKVSANKRPSTVCPTSPPHLHSSTSNAANGHLVYVRRKSDADIGKNSPGDSTS 120
Query: 121 IKADYPNLSKLGQLDETVHLKSQVKELKNHCFPAFAPFPVVPPMNASGTPSVPHHIGKYG 180
IKADYPNLSKLGQLDETVHLKSQVKELKNHCFPAFAPFPVVPPMNASGTPSVPHHIGKYG
Sbjct: 121 IKADYPNLSKLGQLDETVHLKSQVKELKNHCFPAFAPFPVVPPMNASGTPSVPHHIGKYG 180
Query: 181 INLATAESNFHSALSTVPSVGIPPGWKNLQWEDRYHQLQLLLNKLDQSDQQDYLQVLRSL 240
INLATAESNFHSALSTVPSVGIPPGWKNLQWEDRYHQLQLLLNKLDQSDQQDYLQVLRSL
Sbjct: 181 INLATAESNFHSALSTVPSVGIPPGWKNLQWEDRYHQLQLLLNKLDQSDQQDYLQVLRSL 240
Query: 241 SSVELSRHAVALEKRSIQLSLEEAKELRRVGVLNVLGNPGKNIKVPLAHQDGSET 295
SSVELSRHAVALEKRSIQLSLEEAKEL+RVGVLNVLGNPGKNIKVPLAHQDGSET
Sbjct: 241 SSVELSRHAVALEKRSIQLSLEEAKELQRVGVLNVLGNPGKNIKVPLAHQDGSET 295
BLAST of MC02g0354 vs. ExPASy TrEMBL
Match:
A0A6J1JE12 (uncharacterized protein LOC111484175 OS=Cucurbita maxima OX=3661 GN=LOC111484175 PE=4 SV=1)
HSP 1 Score: 487 bits (1253), Expect = 3.16e-172
Identity = 250/296 (84.46%), Postives = 261/296 (88.18%), Query Frame = 0
Query: 1 MVQKPIDSKFSEYGHGNSGKDVP-HEKQLQISAKKTALRDLQNENRVTASNCTGSFPLLK 60
MVQK IDSKFSEYGHGNSGKDVP EKQLQISAKKTALRDLQN+NRVTASNCTGS PLLK
Sbjct: 1 MVQKSIDSKFSEYGHGNSGKDVPSQEKQLQISAKKTALRDLQNDNRVTASNCTGSSPLLK 60
Query: 61 EGGPGSDFIKVSANKRPSTVCPTSPPHLHSSTSNAANGHLVYVRRKSDADIGKNSPGDST 120
E GP SDFIKVS N P +P HLHSSTSNA+NGHLVYVRRKSDADIGKNSP DST
Sbjct: 61 ERGPSSDFIKVSGNN------PATPSHLHSSTSNASNGHLVYVRRKSDADIGKNSPCDST 120
Query: 121 SIKADYPNLSKLGQLDETVHLKSQVKELKNHCFPAFAPFPVVPPMNASGTPSVPHHIGKY 180
+IK DYPNLSKLGQL ET HLKSQVKEL+NHCFPAFAPFP+V PMNASG PSVPHH+GKY
Sbjct: 121 NIKGDYPNLSKLGQLAETAHLKSQVKELQNHCFPAFAPFPMVSPMNASGKPSVPHHVGKY 180
Query: 181 GINLATAESNFHSALSTVPSVGIPPGWKNLQWEDRYHQLQLLLNKLDQSDQQDYLQVLRS 240
GIN TAESNFH A STVPS GWKNLQWEDRYHQLQLLLNKLDQSDQQDYLQVLRS
Sbjct: 181 GINFTTAESNFHPAPSTVPS-----GWKNLQWEDRYHQLQLLLNKLDQSDQQDYLQVLRS 240
Query: 241 LSSVELSRHAVALEKRSIQLSLEEAKELRRVGVLNVLGNPGKNIKVPLAHQDGSET 295
LSSVELSRHAV LE+RSIQLSLEEAKEL+RVGVLNVLGNP K+IK PL HQ+GSET
Sbjct: 241 LSSVELSRHAVELERRSIQLSLEEAKELQRVGVLNVLGNPVKSIKTPLTHQNGSET 285
BLAST of MC02g0354 vs. ExPASy TrEMBL
Match:
A0A6J1FY79 (uncharacterized protein LOC111448407 OS=Cucurbita moschata OX=3662 GN=LOC111448407 PE=4 SV=1)
HSP 1 Score: 481 bits (1239), Expect = 4.28e-170
Identity = 248/296 (83.78%), Postives = 259/296 (87.50%), Query Frame = 0
Query: 1 MVQKPIDSKFSEYGHGNSGKDVP-HEKQLQISAKKTALRDLQNENRVTASNCTGSFPLLK 60
MVQK IDSKFSEYGHGNSGKDVP EKQLQISAKKTALRDLQN+NRVTASNCTGS PLLK
Sbjct: 1 MVQKSIDSKFSEYGHGNSGKDVPSQEKQLQISAKKTALRDLQNDNRVTASNCTGSSPLLK 60
Query: 61 EGGPGSDFIKVSANKRPSTVCPTSPPHLHSSTSNAANGHLVYVRRKSDADIGKNSPGDST 120
E GP SDFIKVS N P +P HLHSSTSNA+NGHLVYVRRKS+ADIGKNSP DST
Sbjct: 61 ERGPSSDFIKVSGNN------PATPSHLHSSTSNASNGHLVYVRRKSEADIGKNSPCDST 120
Query: 121 SIKADYPNLSKLGQLDETVHLKSQVKELKNHCFPAFAPFPVVPPMNASGTPSVPHHIGKY 180
+IK DYPNLSKLGQL ET HLKSQVKEL+ CFPAFAPFP+V PMNASG PSVPHH+GKY
Sbjct: 121 NIKGDYPNLSKLGQLAETAHLKSQVKELQTRCFPAFAPFPMVSPMNASGKPSVPHHVGKY 180
Query: 181 GINLATAESNFHSALSTVPSVGIPPGWKNLQWEDRYHQLQLLLNKLDQSDQQDYLQVLRS 240
GIN ATAESNFH A STVPS GWKNLQWEDRYHQLQLLLNKLDQSDQQDYLQVLRS
Sbjct: 181 GINFATAESNFHPAPSTVPS-----GWKNLQWEDRYHQLQLLLNKLDQSDQQDYLQVLRS 240
Query: 241 LSSVELSRHAVALEKRSIQLSLEEAKELRRVGVLNVLGNPGKNIKVPLAHQDGSET 295
LSSVELSRHAV LE+RSIQLSLEEAKEL+RVGVLNVLGNP K+IK PL H DGSET
Sbjct: 241 LSSVELSRHAVELERRSIQLSLEEAKELQRVGVLNVLGNPVKSIKTPLTHHDGSET 285
BLAST of MC02g0354 vs. ExPASy TrEMBL
Match:
A0A6J1G8C0 (uncharacterized protein LOC111451757 OS=Cucurbita moschata OX=3662 GN=LOC111451757 PE=4 SV=1)
HSP 1 Score: 416 bits (1068), Expect = 3.25e-144
Identity = 227/295 (76.95%), Postives = 241/295 (81.69%), Query Frame = 0
Query: 1 MVQKPIDSKFSEYGHGNSGKDVP-HEKQLQISAKKTALRDLQNENRVTASNCTGSFPLLK 60
MVQK IDSK S NSGK+ P HEKQLQISAKKTALRDLQN+NRV ASNCTGS PLLK
Sbjct: 1 MVQKSIDSKLS-----NSGKESPAHEKQLQISAKKTALRDLQNDNRVVASNCTGSSPLLK 60
Query: 61 EGGPGSDFIKVSANKRPSTVCPTSPPHLHSSTSNAANGHLVYVRRKSDADIGKNSPGDST 120
E GP SDFIKVS N +PS V TSPP L SSTSN GHLVY+RRKSDADI K+SP DS+
Sbjct: 61 ERGPSSDFIKVSGNNKPSPVFTTSPPRLVSSTSNTTTGHLVYIRRKSDADIAKSSPCDSS 120
Query: 121 SIKADYPNLSKLGQLDETVHLKSQVKELKNHCFPAFAPFPVVPPMNASGTPSVPHHIGKY 180
SIKADY SKLGQL ETVHLKSQVKEL++HCFPAFAPF +V PMNASG PSVPH KY
Sbjct: 121 SIKADYQ--SKLGQLAETVHLKSQVKELQDHCFPAFAPFTMVSPMNASGKPSVPH---KY 180
Query: 181 GINLATAESNFHSALSTVPSVGIPPGWKNLQWEDRYHQLQLLLNKLDQSDQQDYLQVLRS 240
GINLATAES+F SA WKNLQWE RYHQL+LLLNKL+QSDQQDYLQVLRS
Sbjct: 181 GINLATAESDFDSA-----------EWKNLQWEHRYHQLELLLNKLNQSDQQDYLQVLRS 240
Query: 241 LSSVELSRHAVALEKRSIQLSLEEAKELRRVGVLNVLGNPGKNIKVPLAHQDGSE 294
LSSVELSRHAV LEKRSI LS EEAKEL+RVGVLNVLGNP NIKVPLAHQDGS+
Sbjct: 241 LSSVELSRHAVELEKRSIHLSFEEAKELQRVGVLNVLGNPVNNIKVPLAHQDGSD 274
BLAST of MC02g0354 vs. ExPASy TrEMBL
Match:
A0A0A0KAB4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G118870 PE=4 SV=1)
HSP 1 Score: 414 bits (1065), Expect = 8.62e-143
Identity = 215/264 (81.44%), Postives = 225/264 (85.23%), Query Frame = 0
Query: 1 MVQKPIDSKFSEYGHGNSGKDVP-HEKQLQISAKKTALRDLQNENRVTASNCTGSFPLLK 60
MVQK IDSKFSEYGHGN GKDVP EKQLQISAKKTA RDLQN+N ASNCTGS PLLK
Sbjct: 1 MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLK 60
Query: 61 EGGPGSDFIKVSANKRPSTVCPTSPPHLHSSTSNAANGHLVYVRRKSDADIGKNSPGDST 120
E G GSD IKVS NKR V P SP HLHSSTSN+ANGHLVYVRRKSDADIGKNS D+T
Sbjct: 61 EIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHLVYVRRKSDADIGKNSSCDNT 120
Query: 121 SIKADYPNLSKLGQLDETVHLKSQVKELKNHCFPAFAPFPVVPPMNASGTPSVPHHIGKY 180
SIKA+YPNL+KLG L TVHLKSQ KEL+NHC AFAPFP+V +NA PSVPHH+GK
Sbjct: 121 SIKANYPNLNKLGSLAVTVHLKSQAKELQNHCVQAFAPFPMVSSVNAPRKPSVPHHMGKC 180
Query: 181 GINLATAESNFHSALSTVPSVGIPPGWKNLQWEDRYHQLQLLLNKLDQSDQQDYLQVLRS 240
GINLA AESNFHSA ST PSVGIP GWKNLQWEDRYHQLQLLLNKLDQSDQ+DYLQVL S
Sbjct: 181 GINLAVAESNFHSAPSTFPSVGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGS 240
Query: 241 LSSVELSRHAVALEKRSIQLSLEE 263
LSSVELSRHAV LEKRSIQLSLEE
Sbjct: 241 LSSVELSRHAVELEKRSIQLSLEE 264
BLAST of MC02g0354 vs. TAIR 10
Match:
AT2G45250.1 (Integral membrane protein hemolysin-III homolog )
HSP 1 Score: 99.8 bits (247), Expect = 4.0e-21
Identity = 81/227 (35.68%), Postives = 107/227 (47.14%), Query Frame = 0
Query: 58 LKEGGPGSDFIKVSANKRPSTVCPTSPPHLHSSTSNAANGHLVYVRRKSDADIGKNSPGD 117
+ EG P D K S++ PP +T+NAA+G LVYVRR+ + D K +
Sbjct: 33 IPEGTP-KDSEKAIEQDTVSSIGVKKPPVDSPATTNAASGRLVYVRRRVEVDTSKAAAST 92
Query: 118 STSIKADYPNLSKLGQLDETVHLKSQVKELKNHCFPAFAPFPVVPPMNASGTPSVPHHIG 177
+ P PP A P +P
Sbjct: 93 TN---------------------------------------PNPPPTKA--PPQIP---- 152
Query: 178 KYGINLATAESNFHSALSTVPSVGIPPGWKNLQWEDRYHQLQLLLNKLDQSDQQDYLQVL 237
S+ A + P+ P L WE+RY LQ+LLNKL+QSD+ D++Q+L
Sbjct: 153 ----------SSPAQAQAQEPT----PTSHKLDWEERYLHLQMLLNKLNQSDRTDHVQML 199
Query: 238 RSLSSVELSRHAVALEKRSIQLSLEEAKELRRVGVLNVLGNPGKNIK 285
SLSS ELS+HAV LEKRSIQ SLEEA+E++RV LNVLG +IK
Sbjct: 213 WSLSSAELSKHAVDLEKRSIQFSLEEAREMQRVAALNVLGRSVNSIK 199
BLAST of MC02g0354 vs. TAIR 10
Match:
AT4G38280.1 (BEST Arabidopsis thaliana protein match is: Integral membrane protein hemolysin-III homolog (TAIR:AT2G45250.1); Has 65 Blast hits to 65 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 2; Plants - 63; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 96.3 bits (238), Expect = 4.4e-20
Identity = 74/223 (33.18%), Postives = 101/223 (45.29%), Query Frame = 0
Query: 62 GPGSDFIKVSANKRPSTVCPTSPPHLHSSTSNAANGHLVYVRRKSDADIGKNSPGDSTSI 121
G D K + S++ PP +T+NAA+G LVYVRR+ + D K + +
Sbjct: 6 GTSKDSEKANEQDSVSSIGAKKPPLESPATTNAASGRLVYVRRRVEVDTSKAAASTTN-- 65
Query: 122 KADYPNLSKLGQLDETVHLKSQVKELKNHCFPAFAPFPVVPPMNASGTPSVPHHIGKYGI 181
PN P P P+ +P+
Sbjct: 66 ----PN-----------------------------PPPTKAPLQIPSSPAQEP------- 125
Query: 182 NLATAESNFHSALSTVPSVGIPPGWKNLQWEDRYHQLQLLLNKLDQSDQQDYLQVLRSLS 241
P L WE+RY LQ+LLNKL+QSD+ D++Q+L SLS
Sbjct: 126 ---------------------TPTSHKLDWEERYLHLQMLLNKLNQSDRTDHVQMLWSLS 165
Query: 242 SVELSRHAVALEKRSIQLSLEEAKELRRVGVLNVLGNPGKNIK 285
S ELS+HAV LEKRSIQ SLEEA+E++RV LN+LG ++K
Sbjct: 186 SAELSKHAVDLEKRSIQFSLEEAREMQRVAALNMLGRSVNSLK 165
BLAST of MC02g0354 vs. TAIR 10
Match:
AT2G45250.2 (Integral membrane protein hemolysin-III homolog )
HSP 1 Score: 79.3 bits (194), Expect = 5.6e-15
Identity = 70/206 (33.98%), Postives = 92/206 (44.66%), Query Frame = 0
Query: 58 LKEGGPGSDFIKVSANKRPSTVCPTSPPHLHSSTSNAANGHLVYVRRKSDADIGKNSPGD 117
+ EG P D K S++ PP +T+NAA+G LVYVRR+ + D K +
Sbjct: 33 IPEGTP-KDSEKAIEQDTVSSIGVKKPPVDSPATTNAASGRLVYVRRRVEVDTSKAAAST 92
Query: 118 STSIKADYPNLSKLGQLDETVHLKSQVKELKNHCFPAFAPFPVVPPMNASGTPSVPHHIG 177
+ P PP A P +P
Sbjct: 93 TN---------------------------------------PNPPPTKA--PPQIP---- 152
Query: 178 KYGINLATAESNFHSALSTVPSVGIPPGWKNLQWEDRYHQLQLLLNKLDQSDQQDYLQVL 237
S+ A + P+ P L WE+RY LQ+LLNKL+QSD+ D++Q+L
Sbjct: 153 ----------SSPAQAQAQEPT----PTSHKLDWEERYLHLQMLLNKLNQSDRTDHVQML 178
Query: 238 RSLSSVELSRHAVALEKRSIQLSLEE 264
SLSS ELS+HAV LEKRSIQ SLEE
Sbjct: 213 WSLSSAELSKHAVDLEKRSIQFSLEE 178
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_022140529.1 | 2.00e-214 | 99.66 | uncharacterized protein LOC111011167 [Momordica charantia] | [more] |
XP_038878250.1 | 4.33e-175 | 85.47 | uncharacterized protein LOC120070536 [Benincasa hispida] | [more] |
XP_022986425.1 | 6.52e-172 | 84.46 | uncharacterized protein LOC111484175 [Cucurbita maxima] | [more] |
XP_022943750.1 | 8.85e-170 | 83.78 | uncharacterized protein LOC111448407 [Cucurbita moschata] >KAG7010816.1 hypothet... | [more] |
KAG6570979.1 | 6.45e-169 | 83.78 | hypothetical protein SDJN03_29894, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1CFY6 | 9.68e-215 | 99.66 | uncharacterized protein LOC111011167 OS=Momordica charantia OX=3673 GN=LOC111011... | [more] |
A0A6J1JE12 | 3.16e-172 | 84.46 | uncharacterized protein LOC111484175 OS=Cucurbita maxima OX=3661 GN=LOC111484175... | [more] |
A0A6J1FY79 | 4.28e-170 | 83.78 | uncharacterized protein LOC111448407 OS=Cucurbita moschata OX=3662 GN=LOC1114484... | [more] |
A0A6J1G8C0 | 3.25e-144 | 76.95 | uncharacterized protein LOC111451757 OS=Cucurbita moschata OX=3662 GN=LOC1114517... | [more] |
A0A0A0KAB4 | 8.62e-143 | 81.44 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G118870 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT2G45250.1 | 4.0e-21 | 35.68 | Integral membrane protein hemolysin-III homolog | [more] |
AT4G38280.1 | 4.4e-20 | 33.18 | BEST Arabidopsis thaliana protein match is: Integral membrane protein hemolysin-... | [more] |
AT2G45250.2 | 5.6e-15 | 33.98 | Integral membrane protein hemolysin-III homolog | [more] |