MC04g1050 (gene) Bitter gourd (Dali-11) v1

Overview
NameMC04g1050
Typegene
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptioncharged multivesicular body protein 7
LocationMC04: 18605033 .. 18609304 (+)
RNA-Seq ExpressionMC04g1050
SyntenyMC04g1050
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: utr5CDSpolypeptideutr3
Hold the cursor over a type above to highlight its positions in the sequence below.
AGAAAACTCAATTTTTCTAATTTAGGTTATTTGAGAAGCGAAAACATTTTTAGATCATCAATTTCCGATGTTATATTTTAGAGGTGGAAAAAGGAAAAGGGAAAAAAAGAGGGTAAAAGACGAAAGAGAAGGAGCATCCCCCCTTTTGGTTTGGGTTCTTAAAAAAATTGAATGTAACAAGCTTTGGGGAATCCCTCTTTTGATTGATTACTACTAAATCCTCAATATTACTGCAATTTCGCTTCATTTGTGTTCGACGAGGTTTTAGGAACGAAGTTTTTGGCATTTGGTTGATTGCTATTTCATTCCGTCAACGCTATCATCTTACCTTCTGTAATCGGCCCATCGATTCCTTTAAAAATACATATCTGTTCATGCTTTTAGTTCATTTCCATTTTCCAGTCTTACAAGTGCAGCGTGTTTGATTTCGAGCAGTTTGTAGCGAGAAGAAGAGGGCTATAGTTCCTCCAATTGAAAAAGATTAGGTAATTTTGCTTTATATCGGCATAATAGTTCGTTCCTTCAATTTATCTATTTTTTGATTGTGCTATCACATTTTTACTTGATGAAATTTGAACTTTGATCGAGCATGCCAGTTTTTCTAGTGGTTTTCTCTCAGCCCCTGGTTAAAAATAGACTAATAGAAGATCAAATTTGTGCTCACTTATTATGAGTTTGAATTGAATTCTTGTATTGTCACTTCTTTACTTCATTACTGTTTTTATGCAATGGTCTTCATGTCAGATGCGTGCTAATATCTATGTTAAAAAATGTACATTTTGTGTTCCATGAACCATTGGCATGGCTTAGTAGTTTCTTTTTTCCGTGGGGGAAAGCTAGGCTTTTTAAAGTTGATTTGAATCTTCTATATGCCTTTTCTGTTAGGAATAAACTTAGACCATTAAGTGGAGGATAAATTTTCTTAATGTTAAGGACTTTTTGCTCTCTCTCTCGTTTACTGGTTTTATGATTTTCCAGAGAGTTCTCTGTACAGTCCTCAACGACTCTCTTAAAACTCTCTCTGCTGAAATTCATAATTTAAGATCTTTTGTTAGAAGGTCCCTGGATGTTTTATGGACTAATAGCTTGGTTATGATTTTTCTAGCATGAAACTACAACTTGAAATCACTTGAAAACATGGAAAGGAAGTGTCCAGTATTAAAATACATGTTTTTGCTAACTCAAGATTTACACAGTTGGATTGGACTGAAGGTAAGGAGCACCTCAAAACATAGGTGTAAAAAGCAAACAAAATGGAAAAGGAATTGAAGGGGTCACACGTAAGAGAGTTCATCAAGGACAAAGTCACTGACTGGGATGATGAAGTGGTGACTACAGCTCGATTCAAGGCATTCAGTGGGCAAAAATCTGATTGGGAACCCAGATACCTATTTTGGAGGGATTTGATCCTCACAATTGCCCATAAATTCAACTTCATATTCATCAAACCTATTGAGATAAGGAATCAGTGGTTTTCTCAAGGAGGGTTGGCTCCACTGTGTCTTGACCATGTCCTGGTATTGCTTCCAAATAAGATTATTATTCTTTTATTTATAATTTATTGAACTGTGTTGTTGTTCTCTCCTATAATAATTCACCATTCTATTCAGCATCAAATGTATATCGATGGTGACATTATAATACAAAGTGACATGCTAGACCCGAGGAGTGGCCAACTTTCTCACTTATTTAAAAAACTAAGCAATTTGATGGGTACATCCAAAAAGAACCCCAACGATTTGCTTCGTGATGAATATGTAGTTCTTGCCTCTGTATTAAAGGTATATGCTTTTAATCTCCTCTCACTCTTGCTCGAATCCTCTTCAGGTTCTATACTTTTTCCTTCTCTTTAGAAACTTGCCTTTTTGAGATCTCCTTTTGAATTTACTTCCACGTTCATTTTAATTTTTCACGACTCTTGGTTTTTGCTTCTATAACTTTAATATGAGATGTTTCCCTGCCAAGGATAGAGCAGCTGAGGTTGTCAAGTGTTTGTCTCATAGTAACTGGACCTCTTCCTGCATTATTACAATGATGAAGTTCCAGAACATCTGTGGAGGACCTGATGAAGCGACTGCCATCTTAAGTTACTTGTTTGGATGTGGTAAAGCAAGGTATCTCTCTAGGGAAAAAGAGGAATTTTTAGAGGTATGCCATTCTTTCCTTTTTCCGCATTTCATTCTTCATATAAGTACATTCTTGTTTTATTTTTAAAAATATGAAAGGGGTAGTGAAGCTGCTTTGATTCTTAAGAAAGGCAGTGCAGCCTGCCGAAGGCTGAACTTTACCGGAGAATTGGTTAGAAAGTAAGAAATGAGTAGTGTAAATTAAACTCCATCATATAGTAAAGAATGTAACATGATAGAATCCTTACCTACTAAGTTTTTTAAATTCCATCAATGTCTTTAGTCTGTCAGTGAGTTATCACCTCCATAACTCTCTTTTGATTTTTCCATTCCCCCTTATAATAATCAGGGTGTGAAGGTTTCTCTCTCAGCAGCTACAGTTCCTAGTATCACTACTCTCGACTGTGACATTTTGCACTTAATTTGGACAACAGAAAAGCTTCAGCAACAACTTGATGTGATTGACCAGCGCTATAATGTGTAATTATTTATGGCTCTTAACACAGTTTATAAATTGTACTTGTTCGAGGCATAGCTGCAAACAACTTGATAAGAAGTCTCTAGTAGCTTCATGATAAGATAGGGGGTGAAAACACTCCAGCTATTCTGTGTCAATAGTGTTTGGGGTTCAGTTTCATCATCTACTACAATAATGTTTGTTTGGAATGGCCAGGACAACAACTTCTGGTTGTTAAAATGGAAGGCTTAATTGAAGTTTTTAATGTTTTAGACTAGGACGGGTGGAACAACTATTCATATAATGCTCTCCTTGTAATATGCTTGTGAAGTCTGTCTGCCAATTTTTCATTGCAGGTCGAGGCAATCTGCATTAGCTTCTTTAAAGTCTGGAAACAAAAAAACTGCATTGAAACATGCAAGAGAGTTAAAGATCAGTACAGAAAGTCGAGAAAAAGTTGCATCTCTCTTAAACAGAGTGGAGGAAGTCCTAAATGCTATTGCAGATGCCGAGTCAACAAAAACGGTAGTTTACTATTAGTTTTCTCCTGTAATTAGTGGAAGGTGCCTGTATCTTTGTCTTGCTAACAATGACAGAGTTGGCTAACTTTTAGCATAAAATCATTTTTTAAACAAATACAGATTGTGAGCGACCAACAAATTATTCTGGTTTTGGTATTGACTTAGAACTTGATGCCTTAATGTTTACTAGTATCACACCCTGCTGGCAGGTTTCTGAAGCTATTCAAATTGGTGCTCGAGTAATGAAAGAACACGAGGTTAGTTGGGAGGAACTCCAGCATAGTATGCAGGAACTAGAAGACAGCATTGATTTACATAAGCAAGTTGCAAGTGCTATAGGTACTTCTATTATGGATTTTTTCCCAGTTTCTTGCCTTGAACTTTTCTTTCATTATAAGTGAATGCGGCTCTGGGTGTCATAGATTCAGCTCCATCTGGCTCGATTCTGGAAGACGAAGATATTGAGGAGGAGTTTCGAAAGTTTGAGTTGGAAGTTACAGGCCAAAACATCGACGTGCCAACACCCAATTCTGGGGCTTCAGTTTCTGATGATTTGTTGAGCATTGCCTTATCAAATCTAAAACTTGTGGAGGATACAGGTAAGGAGACAACAGTGAACCAGAATTCAAACTCTAACAGCAAGTCGAAAATAATGGAGCTTGGCATTTCTTAATTCTTAGGTGTAGTATATTGTGTATTGATAATGTCAGCCAAATATGTACACCTGCTTTTTCCGATTGCCCTCTTTTGTACTGTGTCGTCAAATGCCATTGAAAATTTGTATAAATTATCACTACAAAATAGAGTTAGCTGCATGATATACCATTAAATTTGAATCCTCAGAATCAATTAAAGCCTGTGTGGTAATGTTTCGTAATATTTTTGGGTATGTTTTATATTGTTTCTTGCTTCTCGTTTTTTATTTTATTAAGAAATTTTTAGTACTTTACCATTATTTATGTTTTTTGTTTTCCATTTTTAAAAAGCATTCTCAAAATTGTATAAGAAATGAGAAATGTGTGCATCATACTATTCGTAAGGATACATCTTAGGGGTGTTCAAAAAATCAGACAATCCGACGAACCCGATGAACCCAACCCAAACGTAAGGGTTGGGTTGGGTTGGGTTAGATGAAGGTTTGGGTTGGGCTGGATCTCGGGTTCAACCT

mRNA sequence

AGAAAACTCAATTTTTCTAATTTAGGTTATTTGAGAAGCGAAAACATTTTTAGATCATCAATTTCCGATGTTATATTTTAGAGGTGGAAAAAGGAAAAGGGAAAAAAAGAGGGTAAAAGACGAAAGAGAAGGAGCATCCCCCCTTTTGGTTTGGGTTCTTAAAAAAATTGAATGTAACAAGCTTTGGGGAATCCCTCTTTTGATTGATTACTACTAAATCCTCAATATTACTGCAATTTCGCTTCATTTGTGTTCGACGAGGTTTTAGGAACGAAGTTTTTGGCATTTGGTTGATTGCTATTTCATTCCGTCAACGCTATCATCTTACCTTCTGTAATCGGCCCATCGATTCCTTTAAAAATACATATCTGTTCATGCTTTTAGTTCATTTCCATTTTCCAGTCTTACAAGTGCAGCGTGTTTGATTTCGAGCAGTTTGTAGCGAGAAGAAGAGGGCTATAGTTCCTCCAATTGAAAAAGATTAGTTGGATTGGACTGAAGGTAAGGAGCACCTCAAAACATAGGTGTAAAAAGCAAACAAAATGGAAAAGGAATTGAAGGGGTCACACGTAAGAGAGTTCATCAAGGACAAAGTCACTGACTGGGATGATGAAGTGGTGACTACAGCTCGATTCAAGGCATTCAGTGGGCAAAAATCTGATTGGGAACCCAGATACCTATTTTGGAGGGATTTGATCCTCACAATTGCCCATAAATTCAACTTCATATTCATCAAACCTATTGAGATAAGGAATCAGTGGTTTTCTCAAGGAGGGTTGGCTCCACTGTGTCTTGACCATGTCCTGCATCAAATGTATATCGATGGTGACATTATAATACAAAGTGACATGCTAGACCCGAGGAGTGGCCAACTTTCTCACTTATTTAAAAAACTAAGCAATTTGATGGGTACATCCAAAAAGAACCCCAACGATTTGCTTCGTGATGAATATGTAGTTCTTGCCTCTGTATTAAAGGATAGAGCAGCTGAGGTTGTCAAGTGTTTGTCTCATAGTAACTGGACCTCTTCCTGCATTATTACAATGATGAAGTTCCAGAACATCTGTGGAGGACCTGATGAAGCGACTGCCATCTTAAGTTACTTGTTTGGATGTGGTAAAGCAAGGTATCTCTCTAGGGAAAAAGAGGAATTTTTAGAGGGTGTGAAGGTTTCTCTCTCAGCAGCTACAGTTCCTAGTATCACTACTCTCGACTGTGACATTTTGCACTTAATTTGGACAACAGAAAAGCTTCAGCAACAACTTGATGTGATTGACCAGCGCTATAATGTGTCGAGGCAATCTGCATTAGCTTCTTTAAAGTCTGGAAACAAAAAAACTGCATTGAAACATGCAAGAGAGTTAAAGATCAGTACAGAAAGTCGAGAAAAAGTTGCATCTCTCTTAAACAGAGTGGAGGAAGTCCTAAATGCTATTGCAGATGCCGAGTCAACAAAAACGGTTTCTGAAGCTATTCAAATTGGTGCTCGAGTAATGAAAGAACACGAGGTTAGTTGGGAGGAACTCCAGCATAGTATGCAGGAACTAGAAGACAGCATTGATTTACATAAGCAAGTTGCAAGTGCTATAGATTCAGCTCCATCTGGCTCGATTCTGGAAGACGAAGATATTGAGGAGGAGTTTCGAAAGTTTGAGTTGGAAGTTACAGGCCAAAACATCGACGTGCCAACACCCAATTCTGGGGCTTCAGTTTCTGATGATTTGTTGAGCATTGCCTTATCAAATCTAAAACTTGTGGAGGATACAGGTAAGGAGACAACAGTGAACCAGAATTCAAACTCTAACAGCAAGTCGAAAATAATGGAGCTTGGCATTTCTTAATTCTTAGGTGTAGTATATTGTGTATTGATAATGTCAGCCAAATATGTACACCTGCTTTTTCCGATTGCCCTCTTTTGTACTGTGTCGTCAAATGCCATTGAAAATTTGTATAAATTATCACTACAAAATAGAGTTAGCTGCATGATATACCATTAAATTTGAATCCTCAGAATCAATTAAAGCCTGTGTGGTAATGTTTCGTAATATTTTTGGGTATGTTTTATATTGTTTCTTGCTTCTCGTTTTTTATTTTATTAAGAAATTTTTAGTACTTTACCATTATTTATGTTTTTTGTTTTCCATTTTTAAAAAGCATTCTCAAAATTGTATAAGAAATGAGAAATGTGTGCATCATACTATTCGTAAGGATACATCTTAGGGGTGTTCAAAAAATCAGACAATCCGACGAACCCGATGAACCCAACCCAAACGTAAGGGTTGGGTTGGGTTGGGTTAGATGAAGGTTTGGGTTGGGCTGGATCTCGGGTTCAACCT

Coding sequence (CDS)

ATGGAAAAGGAATTGAAGGGGTCACACGTAAGAGAGTTCATCAAGGACAAAGTCACTGACTGGGATGATGAAGTGGTGACTACAGCTCGATTCAAGGCATTCAGTGGGCAAAAATCTGATTGGGAACCCAGATACCTATTTTGGAGGGATTTGATCCTCACAATTGCCCATAAATTCAACTTCATATTCATCAAACCTATTGAGATAAGGAATCAGTGGTTTTCTCAAGGAGGGTTGGCTCCACTGTGTCTTGACCATGTCCTGCATCAAATGTATATCGATGGTGACATTATAATACAAAGTGACATGCTAGACCCGAGGAGTGGCCAACTTTCTCACTTATTTAAAAAACTAAGCAATTTGATGGGTACATCCAAAAAGAACCCCAACGATTTGCTTCGTGATGAATATGTAGTTCTTGCCTCTGTATTAAAGGATAGAGCAGCTGAGGTTGTCAAGTGTTTGTCTCATAGTAACTGGACCTCTTCCTGCATTATTACAATGATGAAGTTCCAGAACATCTGTGGAGGACCTGATGAAGCGACTGCCATCTTAAGTTACTTGTTTGGATGTGGTAAAGCAAGGTATCTCTCTAGGGAAAAAGAGGAATTTTTAGAGGGTGTGAAGGTTTCTCTCTCAGCAGCTACAGTTCCTAGTATCACTACTCTCGACTGTGACATTTTGCACTTAATTTGGACAACAGAAAAGCTTCAGCAACAACTTGATGTGATTGACCAGCGCTATAATGTGTCGAGGCAATCTGCATTAGCTTCTTTAAAGTCTGGAAACAAAAAAACTGCATTGAAACATGCAAGAGAGTTAAAGATCAGTACAGAAAGTCGAGAAAAAGTTGCATCTCTCTTAAACAGAGTGGAGGAAGTCCTAAATGCTATTGCAGATGCCGAGTCAACAAAAACGGTTTCTGAAGCTATTCAAATTGGTGCTCGAGTAATGAAAGAACACGAGGTTAGTTGGGAGGAACTCCAGCATAGTATGCAGGAACTAGAAGACAGCATTGATTTACATAAGCAAGTTGCAAGTGCTATAGATTCAGCTCCATCTGGCTCGATTCTGGAAGACGAAGATATTGAGGAGGAGTTTCGAAAGTTTGAGTTGGAAGTTACAGGCCAAAACATCGACGTGCCAACACCCAATTCTGGGGCTTCAGTTTCTGATGATTTGTTGAGCATTGCCTTATCAAATCTAAAACTTGTGGAGGATACAGGTAAGGAGACAACAGTGAACCAGAATTCAAACTCTAACAGCAAGTCGAAAATAATGGAGCTTGGCATTTCTTAA

Protein sequence

MEKELKGSHVREFIKDKVTDWDDEVVTTARFKAFSGQKSDWEPRYLFWRDLILTIAHKFNFIFIKPIEIRNQWFSQGGLAPLCLDHVLHQMYIDGDIIIQSDMLDPRSGQLSHLFKKLSNLMGTSKKNPNDLLRDEYVVLASVLKDRAAEVVKCLSHSNWTSSCIITMMKFQNICGGPDEATAILSYLFGCGKARYLSREKEEFLEGVKVSLSAATVPSITTLDCDILHLIWTTEKLQQQLDVIDQRYNVSRQSALASLKSGNKKTALKHARELKISTESREKVASLLNRVEEVLNAIADAESTKTVSEAIQIGARVMKEHEVSWEELQHSMQELEDSIDLHKQVASAIDSAPSGSILEDEDIEEEFRKFELEVTGQNIDVPTPNSGASVSDDLLSIALSNLKLVEDTGKETTVNQNSNSNSKSKIMELGIS
Homology
BLAST of MC04g1050 vs. ExPASy Swiss-Prot
Match: Q6PBQ2 (Charged multivesicular body protein 7 OS=Danio rerio OX=7955 GN=chmp7 PE=2 SV=1)

HSP 1 Score: 84.7 bits (208), Expect = 2.7e-15
Identity = 89/374 (23.80%), Postives = 175/374 (46.79%), Query Frame = 0

Query: 20  DWDDEVVTTARFKAFSGQK----SDWEPRYLFWRDLILTIAHKFNFIFIKPIEIRNQWFS 79
           DWDD+   +  F AF   +    +DW+ +  FW  LI+    +   + +  ++  N+ F 
Sbjct: 15  DWDDDERMSFLFSAFKENRDVDCTDWDGKIDFWSPLIIEHCRRCGSVCVN-LQDLNENFR 74

Query: 80  QGGLAPLCLDHVLHQMYIDGDIIIQSDM-LDPRSGQLS---------HLFKKLSNLMGTS 139
           + G  PL L  V+  M   G +  +SD   +  SG LS          L   LS L+G+ 
Sbjct: 75  RKGSVPLGLSTVIQSMIRSGKVQKESDFAANVDSGWLSWGVGLLLVRPLKWTLSALLGSG 134

Query: 140 KKNPNDLLRDEYVVLASVLKDRAAEVVKCLSHSNWTSSCIITMMKFQNICGG--PDEATA 199
           +     +  +E  V+  ++K++AAE++     S  ++  +++  + +++     PDE+T 
Sbjct: 135 R-----VPLEESFVVIELVKEKAAELLAAYRGSALSARSLLSFQELRSLSSHICPDESTL 194

Query: 200 ILSYLFGCGKARYLSREKE---EFLEGVK-VSLSAA---TVPSITTLDCDILHLIWTTEK 259
            ++ L        L REK       EG K V  S A    V  ++ +D  I  L  + + 
Sbjct: 195 CMALL-------QLQREKHVTVSLHEGEKLVKFSQAGQGRVSPVSEVDLGIYQLQCSEKL 254

Query: 260 LQQQLDVIDQRYNVSRQSALASLKSGNKKTALKHARELKISTESREKVASLLNRVEEVLN 319
           L+++++ +       +Q A + LK G K  AL+  R  K   +  +++ + L  V+ +L+
Sbjct: 255 LEERVEALGHEAEKCKQQAKSLLKEGKKSQALRCLRGSKRVEKKADRLFAQLETVKGILD 314

Query: 320 AIADAESTKTVSEAIQIGARVMK--EHEVSWEELQHSMQELEDSIDLHKQVASAIDSAPS 369
            IA++++ + V +A Q G   ++     V+ E  ++ + ++++  D   +V   + S   
Sbjct: 315 RIANSQTDRLVMQAYQAGVAALRISLKGVTVERAENLVDQIQELCDTQDEVNQTLASGAP 374

BLAST of MC04g1050 vs. ExPASy Swiss-Prot
Match: Q871Y8 (Vacuolar-sorting protein snf7 OS=Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) OX=367110 GN=vsp-3 PE=3 SV=1)

HSP 1 Score: 47.0 bits (110), Expect = 6.3e-04
Identity = 49/199 (24.62%), Postives = 94/199 (47.24%), Query Frame = 0

Query: 237 LQQQLDVIDQR-----YNVSRQSALASLKSGNKKTALKHA-RELKISTESREKVASLLNR 296
           L+ QLD++ +R       +  Q A+A       KTA K A R  K++  + E     +  
Sbjct: 28  LRTQLDMLQKRERHLQNQIDEQDAIARKNVSTNKTAAKQALRRKKVAESTLETTLGQITT 87

Query: 297 VEEVLNAIADAESTKTVSEAIQIGARVM-KEH-EVSWEELQHSMQELEDSIDLHKQVASA 356
           +E+ +NAI  A   +    A+Q     M K H +++ E++   M +L+++ DL  ++A+A
Sbjct: 88  LEQQINAIESANINRETLAAMQAAREAMGKIHGKLTPEKVDEEMAKLQEANDLSNEIATA 147

Query: 357 IDSAPSGSILEDEDIEEEFRKFELEVTGQNIDVPTPNSGASVS-DDLLSIALSNLKLVED 416
           I SA  G  +++ ++E+E  K + E     +D     +G S+   D +S+  +    ++ 
Sbjct: 148 ITSANIGQPIDEGELEDELEKLQQE----EVDSKLHETGGSIPVHDKISLPAAGTGALKG 207

Query: 417 TGKETTVNQNSNSNSKSKI 427
             K   V ++       K+
Sbjct: 208 KEKAKAVVEDDEEEELRKL 222

BLAST of MC04g1050 vs. NCBI nr
Match: XP_022146425.1 (charged multivesicular body protein 7 [Momordica charantia])

HSP 1 Score: 838 bits (2165), Expect = 4.05e-306
Identity = 432/432 (100.00%), Postives = 432/432 (100.00%), Query Frame = 0

Query: 1   MEKELKGSHVREFIKDKVTDWDDEVVTTARFKAFSGQKSDWEPRYLFWRDLILTIAHKFN 60
           MEKELKGSHVREFIKDKVTDWDDEVVTTARFKAFSGQKSDWEPRYLFWRDLILTIAHKFN
Sbjct: 1   MEKELKGSHVREFIKDKVTDWDDEVVTTARFKAFSGQKSDWEPRYLFWRDLILTIAHKFN 60

Query: 61  FIFIKPIEIRNQWFSQGGLAPLCLDHVLHQMYIDGDIIIQSDMLDPRSGQLSHLFKKLSN 120
           FIFIKPIEIRNQWFSQGGLAPLCLDHVLHQMYIDGDIIIQSDMLDPRSGQLSHLFKKLSN
Sbjct: 61  FIFIKPIEIRNQWFSQGGLAPLCLDHVLHQMYIDGDIIIQSDMLDPRSGQLSHLFKKLSN 120

Query: 121 LMGTSKKNPNDLLRDEYVVLASVLKDRAAEVVKCLSHSNWTSSCIITMMKFQNICGGPDE 180
           LMGTSKKNPNDLLRDEYVVLASVLKDRAAEVVKCLSHSNWTSSCIITMMKFQNICGGPDE
Sbjct: 121 LMGTSKKNPNDLLRDEYVVLASVLKDRAAEVVKCLSHSNWTSSCIITMMKFQNICGGPDE 180

Query: 181 ATAILSYLFGCGKARYLSREKEEFLEGVKVSLSAATVPSITTLDCDILHLIWTTEKLQQQ 240
           ATAILSYLFGCGKARYLSREKEEFLEGVKVSLSAATVPSITTLDCDILHLIWTTEKLQQQ
Sbjct: 181 ATAILSYLFGCGKARYLSREKEEFLEGVKVSLSAATVPSITTLDCDILHLIWTTEKLQQQ 240

Query: 241 LDVIDQRYNVSRQSALASLKSGNKKTALKHARELKISTESREKVASLLNRVEEVLNAIAD 300
           LDVIDQRYNVSRQSALASLKSGNKKTALKHARELKISTESREKVASLLNRVEEVLNAIAD
Sbjct: 241 LDVIDQRYNVSRQSALASLKSGNKKTALKHARELKISTESREKVASLLNRVEEVLNAIAD 300

Query: 301 AESTKTVSEAIQIGARVMKEHEVSWEELQHSMQELEDSIDLHKQVASAIDSAPSGSILED 360
           AESTKTVSEAIQIGARVMKEHEVSWEELQHSMQELEDSIDLHKQVASAIDSAPSGSILED
Sbjct: 301 AESTKTVSEAIQIGARVMKEHEVSWEELQHSMQELEDSIDLHKQVASAIDSAPSGSILED 360

Query: 361 EDIEEEFRKFELEVTGQNIDVPTPNSGASVSDDLLSIALSNLKLVEDTGKETTVNQNSNS 420
           EDIEEEFRKFELEVTGQNIDVPTPNSGASVSDDLLSIALSNLKLVEDTGKETTVNQNSNS
Sbjct: 361 EDIEEEFRKFELEVTGQNIDVPTPNSGASVSDDLLSIALSNLKLVEDTGKETTVNQNSNS 420

Query: 421 NSKSKIMELGIS 432
           NSKSKIMELGIS
Sbjct: 421 NSKSKIMELGIS 432

BLAST of MC04g1050 vs. NCBI nr
Match: XP_038875996.1 (uncharacterized protein LOC120068336 isoform X2 [Benincasa hispida])

HSP 1 Score: 682 bits (1760), Expect = 3.01e-244
Identity = 363/444 (81.76%), Postives = 396/444 (89.19%), Query Frame = 0

Query: 1   MEKELKGSHVREFIKDKVTDWDDEVVTTARFKAFSGQKSDWEPRYLFWRDLILTIAHKFN 60
           MEKE KGS VREFI++KV DWDDEVV TARFKAFSGQKSDWEPRY  WRDLI+TIA KFN
Sbjct: 1   MEKESKGSRVREFIREKVPDWDDEVVATARFKAFSGQKSDWEPRYQVWRDLIITIARKFN 60

Query: 61  FIFIKPIEIRNQWFSQGGLAPLCLDHVLHQMYIDGDIIIQSDMLDPRSGQLSHLFKKLSN 120
           FIFIKP EI+NQWFS+GGL+PLCLDHVLH MYI+GDII + DMLDPRSGQLS+LFKKLSN
Sbjct: 61  FIFIKPSEIKNQWFSRGGLSPLCLDHVLHVMYIEGDIIRRGDMLDPRSGQLSYLFKKLSN 120

Query: 121 LMGTSKKNPNDLLRDEYVVLASVLKDRAAEVVKCLSHSNWTSSCIITMMKFQNICGGPDE 180
           LMGTSKKNP+ LLRD+YVVLA VL+DRAAEV+KCLS SNWTSSCIITM+KFQNICGGPDE
Sbjct: 121 LMGTSKKNPDSLLRDDYVVLACVLQDRAAEVIKCLSLSNWTSSCIITMVKFQNICGGPDE 180

Query: 181 ATAILSYLFGCGKARYLSREKEEFLEGVKVSLSAATVPSITTLDCDILHLIWTTEKLQQQ 240
           AT ILSYL G GKARYLS+EK+E LEGVK+SL+A TVP ITTLD DILHLIWTTEKLQQQ
Sbjct: 181 ATVILSYLIGYGKARYLSKEKKELLEGVKISLTAMTVPGITTLDYDILHLIWTTEKLQQQ 240

Query: 241 LDVIDQRYNVSRQSALASLKSGNKKTALKHARELKISTESREKVASLLNRVEEVLNAIAD 300
           LDVIDQRY+VSRQSALASLKSGNKKTALKHARELKI+TESREKVASLLNRVEEVLNAIAD
Sbjct: 241 LDVIDQRYDVSRQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD 300

Query: 301 AESTKTVSEAIQIGARVMKEHEVSWEELQHSMQELEDSIDLHKQVASAI--DSAPSGSIL 360
           AESTKTVSEAIQIGARVMKEHEVSW++LQ+S+QE+E SIDL KQVASAI  DSAPSGSI 
Sbjct: 301 AESTKTVSEAIQIGARVMKEHEVSWDQLQYSLQEIEVSIDLQKQVASAISTDSAPSGSIP 360

Query: 361 EDEDIEEEFRKFELEVT-GQNID---------VPTPNSGASVSDDLLSIALSNLKLVEDT 420
           EDEDIEEEF+K ELEVT GQN+D         + T  + A+VSDDLLS ALSNLKLVE+T
Sbjct: 361 EDEDIEEEFKKLELEVTAGQNLDPSTSESVVNIATGETVATVSDDLLSDALSNLKLVEET 420

Query: 421 GKETTVNQNSNSNSKSKIMELGIS 432
           G  T + Q SNS SKSK+MELGIS
Sbjct: 421 GNVTAI-QKSNSKSKSKMMELGIS 443

BLAST of MC04g1050 vs. NCBI nr
Match: XP_023551290.1 (charged multivesicular body protein 7 [Cucurbita pepo subsp. pepo] >XP_023551291.1 charged multivesicular body protein 7 [Cucurbita pepo subsp. pepo] >XP_023551292.1 charged multivesicular body protein 7 [Cucurbita pepo subsp. pepo] >XP_023551293.1 charged multivesicular body protein 7 [Cucurbita pepo subsp. pepo] >XP_023551294.1 charged multivesicular body protein 7 [Cucurbita pepo subsp. pepo] >XP_023551295.1 charged multivesicular body protein 7 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 682 bits (1759), Expect = 3.56e-244
Identity = 354/438 (80.82%), Postives = 393/438 (89.73%), Query Frame = 0

Query: 1   MEKELKGSHVREFIKDKVTDWDDEVVTTARFKAFSGQKSDWEPRYLFWRDLILTIAHKFN 60
           MEKE K   VREFI++KV DWD+EVV TARFKAFSGQKSDWEPRYLFWRDLILTIAH+FN
Sbjct: 1   MEKESKELLVREFIREKVPDWDNEVVATARFKAFSGQKSDWEPRYLFWRDLILTIAHQFN 60

Query: 61  FIFIKPIEIRNQWFSQGGLAPLCLDHVLHQMYIDGDIIIQSDMLDPRSGQLSHLFKKLSN 120
           FIF+KP EI+NQWFS+GGLAPLCLDHVLH M I+GDII +SDMLDPR GQLS+LFKKLSN
Sbjct: 61  FIFLKPSEIKNQWFSRGGLAPLCLDHVLHLMQIEGDIIRRSDMLDPRGGQLSYLFKKLSN 120

Query: 121 LMGTSKKNPNDLLRDEYVVLASVLKDRAAEVVKCLSHSNWTSSCIITMMKFQNICGGPDE 180
           LMGTSKKNP+ LL D+Y+ LA VL+DRAAEVVKCLSHSNWTSSC+ITM+KFQNICGGPDE
Sbjct: 121 LMGTSKKNPDSLLSDDYIALARVLQDRAAEVVKCLSHSNWTSSCVITMVKFQNICGGPDE 180

Query: 181 ATAILSYLFGCGKARYLSREKEEFLEGVKVSLSAATVPSITTLDCDILHLIWTTEKLQQQ 240
           ATAILSYL  CGKARYLS+E++E +EGVK+SLSAA VP ITTLD DILHLIWTTE+LQ+Q
Sbjct: 181 ATAILSYLTECGKARYLSKEQKELVEGVKLSLSAAKVPGITTLDYDILHLIWTTERLQRQ 240

Query: 241 LDVIDQRYNVSRQSALASLKSGNKKTALKHARELKISTESREKVASLLNRVEEVLNAIAD 300
           LDVIDQRY+VSRQSALASLKSGNKKTALKHARELKI+TESR+KVASLLNRVEEVLNAIAD
Sbjct: 241 LDVIDQRYDVSRQSALASLKSGNKKTALKHARELKITTESRKKVASLLNRVEEVLNAIAD 300

Query: 301 AESTKTVSEAIQIGARVMKEHEVSWEELQHSMQELEDSIDLHKQVASAIDSAPSGSILED 360
           AESTKTVSEAIQIGARVMKEHEVSW++LQHS+QELE SID+ KQVAS IDSAPSGSILE+
Sbjct: 301 AESTKTVSEAIQIGARVMKEHEVSWDQLQHSLQELEASIDIQKQVASVIDSAPSGSILEE 360

Query: 361 EDIEEEFRKFELEVTGQNIDVPTPNSG---------ASVSDDLLSIALSNLKLVEDTGKE 420
           EDI+EEF+K ELEV GQN+D  T ++G         A+VSDD LS ALSNLKLVE+TGKE
Sbjct: 361 EDIKEEFKKLELEVAGQNLDASTSDTGVNIATGNQVATVSDDSLSAALSNLKLVEETGKE 420

Query: 421 TTVNQNSNSNSKSKIMEL 429
           T + Q SNS SK KIMEL
Sbjct: 421 TVI-QKSNSKSKLKIMEL 437

BLAST of MC04g1050 vs. NCBI nr
Match: KAG6579154.1 (Charged multivesicular body protein 7, partial [Cucurbita argyrosperma subsp. sororia] >KAG7016673.1 Charged multivesicular body protein 7 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 680 bits (1754), Expect = 2.05e-243
Identity = 355/438 (81.05%), Postives = 392/438 (89.50%), Query Frame = 0

Query: 1   MEKELKGSHVREFIKDKVTDWDDEVVTTARFKAFSGQKSDWEPRYLFWRDLILTIAHKFN 60
           MEKE K   VREFI++KV DWD+E+V TARFKAFSGQKSDWEPRYLFWRDLILTIAH+FN
Sbjct: 1   MEKESKELLVREFIREKVPDWDNELVATARFKAFSGQKSDWEPRYLFWRDLILTIAHQFN 60

Query: 61  FIFIKPIEIRNQWFSQGGLAPLCLDHVLHQMYIDGDIIIQSDMLDPRSGQLSHLFKKLSN 120
           FIF+KP EI+NQWFS+GGLAPLCLDHVLH M I+GDII +SDMLDPR GQLS+LFKKLSN
Sbjct: 61  FIFLKPSEIKNQWFSRGGLAPLCLDHVLHLMQIEGDIIRRSDMLDPRGGQLSYLFKKLSN 120

Query: 121 LMGTSKKNPNDLLRDEYVVLASVLKDRAAEVVKCLSHSNWTSSCIITMMKFQNICGGPDE 180
           LMGTSKKN +  L D+Y+VLA VL+DRAAEVVKCLSHSNWTSSC+ITM+KFQNICGGPDE
Sbjct: 121 LMGTSKKNLDGFLSDDYIVLACVLQDRAAEVVKCLSHSNWTSSCVITMVKFQNICGGPDE 180

Query: 181 ATAILSYLFGCGKARYLSREKEEFLEGVKVSLSAATVPSITTLDCDILHLIWTTEKLQQQ 240
           ATAILSYL  CGKARYLS+E++E +EGVK+SLSAA VP ITTLD DILHLIWTTE+LQ+Q
Sbjct: 181 ATAILSYLTECGKARYLSKEQKELVEGVKLSLSAAKVPGITTLDYDILHLIWTTERLQRQ 240

Query: 241 LDVIDQRYNVSRQSALASLKSGNKKTALKHARELKISTESREKVASLLNRVEEVLNAIAD 300
           LDVIDQRY+VSRQSALASLKSGNKKTALKHARELKI+TESREKVASLLNRVEEVLNAIAD
Sbjct: 241 LDVIDQRYDVSRQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD 300

Query: 301 AESTKTVSEAIQIGARVMKEHEVSWEELQHSMQELEDSIDLHKQVASAIDSAPSGSILED 360
           AESTKTVSEAIQIGARVMKEHEVSW+ LQHS+QELE SID+ KQVAS IDSAPSGSILE+
Sbjct: 301 AESTKTVSEAIQIGARVMKEHEVSWDILQHSLQELEASIDIQKQVASVIDSAPSGSILEE 360

Query: 361 EDIEEEFRKFELEVTGQNIDVPTPNSG---------ASVSDDLLSIALSNLKLVEDTGKE 420
           EDIEEEF+K ELEV GQN+D  T ++G         A+VSDD LS ALSNLKLVE+TGKE
Sbjct: 361 EDIEEEFKKLELEVAGQNLDASTSDTGVNIATGNQVATVSDDSLSAALSNLKLVEETGKE 420

Query: 421 TTVNQNSNSNSKSKIMEL 429
           T + Q SNS SKSKIMEL
Sbjct: 421 TVI-QKSNSKSKSKIMEL 437

BLAST of MC04g1050 vs. NCBI nr
Match: XP_022938857.1 (charged multivesicular body protein 7 [Cucurbita moschata] >XP_022938858.1 charged multivesicular body protein 7 [Cucurbita moschata] >XP_022938859.1 charged multivesicular body protein 7 [Cucurbita moschata] >XP_022938860.1 charged multivesicular body protein 7 [Cucurbita moschata] >XP_022938861.1 charged multivesicular body protein 7 [Cucurbita moschata])

HSP 1 Score: 676 bits (1743), Expect = 9.70e-242
Identity = 353/438 (80.59%), Postives = 390/438 (89.04%), Query Frame = 0

Query: 1   MEKELKGSHVREFIKDKVTDWDDEVVTTARFKAFSGQKSDWEPRYLFWRDLILTIAHKFN 60
           MEKE K   VREFI++KV DWD+E+V TARFKAFSGQKSDWEPRYLFWRDLILTIAH+FN
Sbjct: 1   MEKESKELLVREFIREKVPDWDNELVATARFKAFSGQKSDWEPRYLFWRDLILTIAHQFN 60

Query: 61  FIFIKPIEIRNQWFSQGGLAPLCLDHVLHQMYIDGDIIIQSDMLDPRSGQLSHLFKKLSN 120
           FIF+KP EI+NQWFS+GGLAPLCLDHVLH M I+GDII +SDMLDPR GQLS+LFKKLSN
Sbjct: 61  FIFLKPSEIKNQWFSRGGLAPLCLDHVLHLMQIEGDIIRRSDMLDPRGGQLSYLFKKLSN 120

Query: 121 LMGTSKKNPNDLLRDEYVVLASVLKDRAAEVVKCLSHSNWTSSCIITMMKFQNICGGPDE 180
           LMGTSKKN + LL D+Y+VLA VL+DRAAEVVKCLSHSNWTSSC+ITM+KFQNICGGPDE
Sbjct: 121 LMGTSKKNLDGLLSDDYIVLACVLQDRAAEVVKCLSHSNWTSSCVITMVKFQNICGGPDE 180

Query: 181 ATAILSYLFGCGKARYLSREKEEFLEGVKVSLSAATVPSITTLDCDILHLIWTTEKLQQQ 240
           ATA LSYL  CGKARYLS+E++E +EGVK+SLSAA VP ITTLD DILHLIWTTE+LQ+Q
Sbjct: 181 ATATLSYLTECGKARYLSKEQKELVEGVKLSLSAAKVPGITTLDYDILHLIWTTERLQRQ 240

Query: 241 LDVIDQRYNVSRQSALASLKSGNKKTALKHARELKISTESREKVASLLNRVEEVLNAIAD 300
           LDVIDQRY+VSRQSALASLKSGNKKTALKHARELKI+TESREKVASLLNRVEEVLNAIAD
Sbjct: 241 LDVIDQRYDVSRQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD 300

Query: 301 AESTKTVSEAIQIGARVMKEHEVSWEELQHSMQELEDSIDLHKQVASAIDSAPSGSILED 360
           AESTKTVSEAIQIGAR MKEHEVSW+ LQHS+QELE SID+ KQVAS IDSAPSG ILE+
Sbjct: 301 AESTKTVSEAIQIGARAMKEHEVSWDILQHSLQELEASIDIQKQVASVIDSAPSGLILEE 360

Query: 361 EDIEEEFRKFELEVTGQNIDVPTPNSGAS---------VSDDLLSIALSNLKLVEDTGKE 420
           EDIEEEF+K ELEV GQN+D  T ++GA+         VSDD LS ALSNLKLVE+TGKE
Sbjct: 361 EDIEEEFKKLELEVAGQNLDASTSDTGANIATGNQVATVSDDSLSAALSNLKLVEETGKE 420

Query: 421 TTVNQNSNSNSKSKIMEL 429
           T + Q SNS SKSKIMEL
Sbjct: 421 TVI-QKSNSKSKSKIMEL 437

BLAST of MC04g1050 vs. ExPASy TrEMBL
Match: A0A6J1CX81 (charged multivesicular body protein 7 OS=Momordica charantia OX=3673 GN=LOC111015645 PE=4 SV=1)

HSP 1 Score: 838 bits (2165), Expect = 1.96e-306
Identity = 432/432 (100.00%), Postives = 432/432 (100.00%), Query Frame = 0

Query: 1   MEKELKGSHVREFIKDKVTDWDDEVVTTARFKAFSGQKSDWEPRYLFWRDLILTIAHKFN 60
           MEKELKGSHVREFIKDKVTDWDDEVVTTARFKAFSGQKSDWEPRYLFWRDLILTIAHKFN
Sbjct: 1   MEKELKGSHVREFIKDKVTDWDDEVVTTARFKAFSGQKSDWEPRYLFWRDLILTIAHKFN 60

Query: 61  FIFIKPIEIRNQWFSQGGLAPLCLDHVLHQMYIDGDIIIQSDMLDPRSGQLSHLFKKLSN 120
           FIFIKPIEIRNQWFSQGGLAPLCLDHVLHQMYIDGDIIIQSDMLDPRSGQLSHLFKKLSN
Sbjct: 61  FIFIKPIEIRNQWFSQGGLAPLCLDHVLHQMYIDGDIIIQSDMLDPRSGQLSHLFKKLSN 120

Query: 121 LMGTSKKNPNDLLRDEYVVLASVLKDRAAEVVKCLSHSNWTSSCIITMMKFQNICGGPDE 180
           LMGTSKKNPNDLLRDEYVVLASVLKDRAAEVVKCLSHSNWTSSCIITMMKFQNICGGPDE
Sbjct: 121 LMGTSKKNPNDLLRDEYVVLASVLKDRAAEVVKCLSHSNWTSSCIITMMKFQNICGGPDE 180

Query: 181 ATAILSYLFGCGKARYLSREKEEFLEGVKVSLSAATVPSITTLDCDILHLIWTTEKLQQQ 240
           ATAILSYLFGCGKARYLSREKEEFLEGVKVSLSAATVPSITTLDCDILHLIWTTEKLQQQ
Sbjct: 181 ATAILSYLFGCGKARYLSREKEEFLEGVKVSLSAATVPSITTLDCDILHLIWTTEKLQQQ 240

Query: 241 LDVIDQRYNVSRQSALASLKSGNKKTALKHARELKISTESREKVASLLNRVEEVLNAIAD 300
           LDVIDQRYNVSRQSALASLKSGNKKTALKHARELKISTESREKVASLLNRVEEVLNAIAD
Sbjct: 241 LDVIDQRYNVSRQSALASLKSGNKKTALKHARELKISTESREKVASLLNRVEEVLNAIAD 300

Query: 301 AESTKTVSEAIQIGARVMKEHEVSWEELQHSMQELEDSIDLHKQVASAIDSAPSGSILED 360
           AESTKTVSEAIQIGARVMKEHEVSWEELQHSMQELEDSIDLHKQVASAIDSAPSGSILED
Sbjct: 301 AESTKTVSEAIQIGARVMKEHEVSWEELQHSMQELEDSIDLHKQVASAIDSAPSGSILED 360

Query: 361 EDIEEEFRKFELEVTGQNIDVPTPNSGASVSDDLLSIALSNLKLVEDTGKETTVNQNSNS 420
           EDIEEEFRKFELEVTGQNIDVPTPNSGASVSDDLLSIALSNLKLVEDTGKETTVNQNSNS
Sbjct: 361 EDIEEEFRKFELEVTGQNIDVPTPNSGASVSDDLLSIALSNLKLVEDTGKETTVNQNSNS 420

Query: 421 NSKSKIMELGIS 432
           NSKSKIMELGIS
Sbjct: 421 NSKSKIMELGIS 432

BLAST of MC04g1050 vs. ExPASy TrEMBL
Match: A0A6J1FF90 (charged multivesicular body protein 7 OS=Cucurbita moschata OX=3662 GN=LOC111444939 PE=4 SV=1)

HSP 1 Score: 676 bits (1743), Expect = 4.69e-242
Identity = 353/438 (80.59%), Postives = 390/438 (89.04%), Query Frame = 0

Query: 1   MEKELKGSHVREFIKDKVTDWDDEVVTTARFKAFSGQKSDWEPRYLFWRDLILTIAHKFN 60
           MEKE K   VREFI++KV DWD+E+V TARFKAFSGQKSDWEPRYLFWRDLILTIAH+FN
Sbjct: 1   MEKESKELLVREFIREKVPDWDNELVATARFKAFSGQKSDWEPRYLFWRDLILTIAHQFN 60

Query: 61  FIFIKPIEIRNQWFSQGGLAPLCLDHVLHQMYIDGDIIIQSDMLDPRSGQLSHLFKKLSN 120
           FIF+KP EI+NQWFS+GGLAPLCLDHVLH M I+GDII +SDMLDPR GQLS+LFKKLSN
Sbjct: 61  FIFLKPSEIKNQWFSRGGLAPLCLDHVLHLMQIEGDIIRRSDMLDPRGGQLSYLFKKLSN 120

Query: 121 LMGTSKKNPNDLLRDEYVVLASVLKDRAAEVVKCLSHSNWTSSCIITMMKFQNICGGPDE 180
           LMGTSKKN + LL D+Y+VLA VL+DRAAEVVKCLSHSNWTSSC+ITM+KFQNICGGPDE
Sbjct: 121 LMGTSKKNLDGLLSDDYIVLACVLQDRAAEVVKCLSHSNWTSSCVITMVKFQNICGGPDE 180

Query: 181 ATAILSYLFGCGKARYLSREKEEFLEGVKVSLSAATVPSITTLDCDILHLIWTTEKLQQQ 240
           ATA LSYL  CGKARYLS+E++E +EGVK+SLSAA VP ITTLD DILHLIWTTE+LQ+Q
Sbjct: 181 ATATLSYLTECGKARYLSKEQKELVEGVKLSLSAAKVPGITTLDYDILHLIWTTERLQRQ 240

Query: 241 LDVIDQRYNVSRQSALASLKSGNKKTALKHARELKISTESREKVASLLNRVEEVLNAIAD 300
           LDVIDQRY+VSRQSALASLKSGNKKTALKHARELKI+TESREKVASLLNRVEEVLNAIAD
Sbjct: 241 LDVIDQRYDVSRQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD 300

Query: 301 AESTKTVSEAIQIGARVMKEHEVSWEELQHSMQELEDSIDLHKQVASAIDSAPSGSILED 360
           AESTKTVSEAIQIGAR MKEHEVSW+ LQHS+QELE SID+ KQVAS IDSAPSG ILE+
Sbjct: 301 AESTKTVSEAIQIGARAMKEHEVSWDILQHSLQELEASIDIQKQVASVIDSAPSGLILEE 360

Query: 361 EDIEEEFRKFELEVTGQNIDVPTPNSGAS---------VSDDLLSIALSNLKLVEDTGKE 420
           EDIEEEF+K ELEV GQN+D  T ++GA+         VSDD LS ALSNLKLVE+TGKE
Sbjct: 361 EDIEEEFKKLELEVAGQNLDASTSDTGANIATGNQVATVSDDSLSAALSNLKLVEETGKE 420

Query: 421 TTVNQNSNSNSKSKIMEL 429
           T + Q SNS SKSKIMEL
Sbjct: 421 TVI-QKSNSKSKSKIMEL 437

BLAST of MC04g1050 vs. ExPASy TrEMBL
Match: A0A6J1K252 (charged multivesicular body protein 7 OS=Cucurbita maxima OX=3661 GN=LOC111489435 PE=4 SV=1)

HSP 1 Score: 672 bits (1733), Expect = 1.56e-240
Identity = 351/438 (80.14%), Postives = 389/438 (88.81%), Query Frame = 0

Query: 1   MEKELKGSHVREFIKDKVTDWDDEVVTTARFKAFSGQKSDWEPRYLFWRDLILTIAHKFN 60
           MEKE K   VREFI++KV DWD+EVV TA FKAFSGQKSDWEPRYLFWRDLIL I+H+FN
Sbjct: 1   MEKESKELLVREFIREKVPDWDNEVVATAWFKAFSGQKSDWEPRYLFWRDLILKISHQFN 60

Query: 61  FIFIKPIEIRNQWFSQGGLAPLCLDHVLHQMYIDGDIIIQSDMLDPRSGQLSHLFKKLSN 120
           FIFIKP EI+NQWFS+GGLAPLCLDHVLH M I+GDII +SDMLDPR GQLS+LFKKLSN
Sbjct: 61  FIFIKPSEIKNQWFSRGGLAPLCLDHVLHLMQIEGDIIRRSDMLDPRGGQLSYLFKKLSN 120

Query: 121 LMGTSKKNPNDLLRDEYVVLASVLKDRAAEVVKCLSHSNWTSSCIITMMKFQNICGGPDE 180
           +MGTSKKNP+ LL D+Y+VLA VL+DRAAEVVKCLSHSNWTSSC+ITM+KFQNICGGPDE
Sbjct: 121 MMGTSKKNPDGLLSDDYIVLACVLQDRAAEVVKCLSHSNWTSSCVITMVKFQNICGGPDE 180

Query: 181 ATAILSYLFGCGKARYLSREKEEFLEGVKVSLSAATVPSITTLDCDILHLIWTTEKLQQQ 240
           ATAILSYL  CGKARYLS+E++E +EGVK+SLSAA VP ITTLD DILHLIWTTE+LQ+Q
Sbjct: 181 ATAILSYLTECGKARYLSKEQKELVEGVKLSLSAAKVPGITTLDYDILHLIWTTERLQRQ 240

Query: 241 LDVIDQRYNVSRQSALASLKSGNKKTALKHARELKISTESREKVASLLNRVEEVLNAIAD 300
           LDVIDQRY+VSRQSALASLKSGNKKTALKHARELKI+TESREKVASLLNRVEEVLNAIAD
Sbjct: 241 LDVIDQRYDVSRQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD 300

Query: 301 AESTKTVSEAIQIGARVMKEHEVSWEELQHSMQELEDSIDLHKQVASAIDSAPSGSILED 360
           AESTKTVSEAIQIGARVMKEHEVSW++LQHS+ ELE SID+ KQV S IDSAPSGSILE+
Sbjct: 301 AESTKTVSEAIQIGARVMKEHEVSWDQLQHSLHELEASIDIQKQVESVIDSAPSGSILEE 360

Query: 361 EDIEEEFRKFELEVTGQNIDVPTPNSG---------ASVSDDLLSIALSNLKLVEDTGKE 420
           EDIEEEF+K ELEV GQN+D  T ++G         A+VSDD LS ALSNLKLV +T KE
Sbjct: 361 EDIEEEFKKLELEVAGQNLDATTSDTGVNIATGHQVATVSDDSLSAALSNLKLVGETVKE 420

Query: 421 TTVNQNSNSNSKSKIMEL 429
           T + Q SNS SKSKIMEL
Sbjct: 421 TVI-QKSNSKSKSKIMEL 437

BLAST of MC04g1050 vs. ExPASy TrEMBL
Match: A0A0A0KMY2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G118690 PE=4 SV=1)

HSP 1 Score: 648 bits (1671), Expect = 4.59e-231
Identity = 339/442 (76.70%), Postives = 382/442 (86.43%), Query Frame = 0

Query: 1   MEKELKGSHVREFIKDKVTDWDDEVVTTARFKAFSGQKSDWEPRYLFWRDLILTIAHKFN 60
           MEKE KGS VREFI++KV DWDDEVV TARFKAFSGQKSDWEPRYLFWRDLILT+A +FN
Sbjct: 1   MEKESKGSCVREFIREKVPDWDDEVVATARFKAFSGQKSDWEPRYLFWRDLILTVARQFN 60

Query: 61  FIFIKPIEIRNQWFSQGGLAPLCLDHVLHQMYIDGDIIIQSDMLDPRSGQLSHLFKKLSN 120
           F+ IKP EI+NQWF +GGL PLCLDHVLH MY  GDII +SDMLDPRSGQLS++FKKLSN
Sbjct: 61  FLIIKPSEIKNQWFYRGGLTPLCLDHVLHLMYTGGDIIRRSDMLDPRSGQLSYMFKKLSN 120

Query: 121 LMGTSKKNPNDLLRDEYVVLASVLKDRAAEVVKCLSHSNWTSSCIITMMKFQNICGGPDE 180
           LMGTSKKNP+ LLRD+Y+VLA VL+DRAAEV+KCLS S+WTSSCIITM+KFQNICGGPDE
Sbjct: 121 LMGTSKKNPDSLLRDDYIVLACVLQDRAAEVIKCLSLSSWTSSCIITMVKFQNICGGPDE 180

Query: 181 ATAILSYLFGCGKARYLSREKEEFLEGVKVSLSAATVPSITTLDCDILHLIWTTEKLQQQ 240
           AT ILSYL  CGKA++LS+EK+E LEGVKVSLSA TVP IT+LD DILHL+WT EKLQQQ
Sbjct: 181 ATVILSYLIECGKAKFLSKEKKELLEGVKVSLSATTVPGITSLDYDILHLVWTAEKLQQQ 240

Query: 241 LDVIDQRYNVSRQSALASLKSGNKKTALKHARELKISTESREKVASLLNRVEEVLNAIAD 300
           LDVIDQRY+VS+QSAL SLKSGN+KTALKHARELKI+TESREKVASL NRVEEVLNAIAD
Sbjct: 241 LDVIDQRYDVSKQSALVSLKSGNRKTALKHARELKITTESREKVASLFNRVEEVLNAIAD 300

Query: 301 AESTKTVSEAIQIGARVMKEHEVSWEELQHSMQELEDSIDLHKQVASAIDSAPSGSILED 360
           AE TKTVSEAIQIGARVMKEHEV+W++LQ S+QELE S+D+ KQVA+AIDS PS SI +D
Sbjct: 301 AELTKTVSEAIQIGARVMKEHEVNWDQLQDSLQELEASVDIQKQVANAIDSVPSSSIPDD 360

Query: 361 EDIEEEFRKFELEVT-GQNIDVPTPNSG---------ASVSDDLLSIALSNLKLVEDTGK 420
           EDIEEEF+K ELE+T GQ +D  T  SG         A+V DD LS ALSNLKLVE+T K
Sbjct: 361 EDIEEEFKKLELELTAGQILDASTSESGVNIATGETVAAVCDDSLSTALSNLKLVEETEK 420

Query: 421 ETTVNQNSNSNSKSKIMELGIS 432
           E     +S+S  KSKIME+GIS
Sbjct: 421 EN--GNSSHSKRKSKIMEVGIS 440

BLAST of MC04g1050 vs. ExPASy TrEMBL
Match: A0A1S3CRA4 (charged multivesicular body protein 7 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103503835 PE=4 SV=1)

HSP 1 Score: 620 bits (1598), Expect = 6.30e-220
Identity = 327/443 (73.81%), Postives = 367/443 (82.84%), Query Frame = 0

Query: 1   MEKELKGSHVREFIKDKVTDWDDEVVTTARFKAFSGQKSDWEPRYLFWRDLILTIAHKFN 60
           MEKE KGS VREFI++KV DWDDEVV TARFKAFSGQKSDWEPRYLFWRDLILT+A + N
Sbjct: 1   MEKESKGSCVREFIREKVLDWDDEVVATARFKAFSGQKSDWEPRYLFWRDLILTVARQLN 60

Query: 61  FIFIKPIEIRNQWFSQGGLAPLCLDHVLHQMYIDGDIIIQSDMLDPRSGQLSHLFKKLSN 120
           F+ IKP EI+NQWFS+GGL PLCLDHVLH MY  GDII +SDMLDPRSGQLS++FK+LSN
Sbjct: 61  FLIIKPSEIKNQWFSRGGLTPLCLDHVLHLMYTGGDIIRRSDMLDPRSGQLSYMFKRLSN 120

Query: 121 LMGTSKKNPNDLLRDEYVVLASVLKDRAAEVVKCLSHSNWTSSCIITMMKFQNICGGPDE 180
           LMGTSKKNP  LLRD+Y++LA VL+DRA EV+KCLS SNWTSS IITM+KFQNICGGPDE
Sbjct: 121 LMGTSKKNPESLLRDDYIILACVLQDRATEVIKCLSLSNWTSSYIITMVKFQNICGGPDE 180

Query: 181 ATAILSYLFGCGKARYLSREKEEFLEGVKVSLSAATVPSITTLDCDILHLIWTTEKLQQQ 240
           AT ILSYL  CGKA++LS+ K + LEGVKVS SA TVP ITTLD DILHL+WT EKLQQQ
Sbjct: 181 ATVILSYLIECGKAKFLSKGKTKLLEGVKVSFSATTVPGITTLDYDILHLVWTAEKLQQQ 240

Query: 241 LDVIDQRYNVSRQSALASLKSGNKKTALKHARELKISTESREKVASLLNRVEEVLNAIAD 300
           LD I+QRY+VS+QSAL SLKSGNKK ALKHARELKI+TESREKVASL NRVEEVLNAI D
Sbjct: 241 LDAINQRYDVSKQSALVSLKSGNKKAALKHARELKITTESREKVASLFNRVEEVLNAIGD 300

Query: 301 AESTKTVSEAIQIGARVMKEHEVSWEELQHSMQELEDSIDLHKQVASAIDSAPSGSILED 360
           AE TK+VSEAIQIGARVMKEHEV+W++LQHS+QELE SID+ KQVA+ IDS PS SI  D
Sbjct: 301 AELTKSVSEAIQIGARVMKEHEVNWDQLQHSLQELETSIDIQKQVANTIDSVPSASIPND 360

Query: 361 E-DIEEEFRKFELEVTG-QNIDVPTPNSGASVS---------DDLLSIALSNLKLVEDTG 420
           E DIEE F+K ELE+T  Q +D  T  S  +++         DD LS  LSNLKLVE+  
Sbjct: 361 EEDIEEVFKKLELELTAAQILDASTSESAVNIATGETVVVVCDDSLSSTLSNLKLVEEVE 420

Query: 421 KETTVNQNSNSNSKSKIMELGIS 432
           KE   NQ SNS   SKIMELGIS
Sbjct: 421 KEDA-NQKSNSKRNSKIMELGIS 442

BLAST of MC04g1050 vs. TAIR 10
Match: AT3G62080.2 (SNF7 family protein )

HSP 1 Score: 399.4 bits (1025), Expect = 3.6e-111
Identity = 211/427 (49.41%), Postives = 301/427 (70.49%), Query Frame = 0

Query: 4   ELKGSHVREFIKDKVTDWDDEVVTTARFKAFSGQKSDWEPRYLFWRDLILTIAHKFNFIF 63
           E+    V+EFI+ +V DWDDEVV  ARFKAFSGQ+SDWE ++ FWRDLI+ ++ +F    
Sbjct: 39  EMDPEAVKEFIRREVPDWDDEVVAMARFKAFSGQRSDWELKFQFWRDLIIKVSRQFGLFI 98

Query: 64  IKPIEIRNQWFSQGGLAPLCLDHVLHQMYIDGDIIIQSDMLDPRSGQLSHLFKKLSNLMG 123
           I P++++  WF +GG+ PLC+D V+  M+ +GD++  SD+ DP SG+++ L + + NLM 
Sbjct: 99  IDPVQVKKAWFDRGGMTPLCIDDVVLLMHSEGDVVRISDLDDPGSGRIARLLRTVKNLMV 158

Query: 124 TSKKNPNDLLRDEYVVLASVLKDRAAEVVKCLSHSNWTSSCIITMMKFQNICGGPDEATA 183
                  ++L +  +VL  +LK++AA+VVK LS  +WTS+C++T+ KF+N+C G +EA+A
Sbjct: 159 QQPVKQEEIL-ENTLVLVPLLKEKAADVVKILSEGHWTSTCVVTLKKFRNLCNGSNEASA 218

Query: 184 ILSYLFGCGKARYLSREKEEFLEGVKVSLSAATVPSITTLDCDILHLIWTTEKLQQQLDV 243
           +LS+L GCGKA  +S  + E +EGVKVS S   +P I+TLDCDILHL+ TTEKLQ QL+V
Sbjct: 219 VLSHLSGCGKAHKISINRGELIEGVKVSFSQTALPGISTLDCDILHLLRTTEKLQDQLEV 278

Query: 244 IDQRYNVSRQSALASLKSGNKKTALKHARELKISTESREKVASLLNRVEEVLNAIADAES 303
           +DQR   S++SALASLKSG++K AL+HARELK+ TESREK  SLLNRVEEVLN IAD+ES
Sbjct: 279 MDQRCEKSKKSALASLKSGHRKVALRHARELKVVTESREKCTSLLNRVEEVLNTIADSES 338

Query: 304 TKTVSEAIQIGARVMKEHEVSWEELQHSMQELEDSIDLHKQVASAIDSAPSGSILEDEDI 363
           TK VSEAI+ GARVMK+ ++S +++   ++ELE++I+  KQV  A++SAP   I +DEDI
Sbjct: 339 TKMVSEAIKTGARVMKDIKISADDVHDYLEELEETIESQKQVEKALESAPYPDI-DDEDI 398

Query: 364 EEEFRKFELEVTGQNIDVPTPNSGASVSDDLLSIALSNLKL--VEDTGKETTVNQNSNSN 423
           EEE  + E+++  ++  V    S  +   D L+   S LKL   + T +E         +
Sbjct: 399 EEELLELEMDLESESSQVLPATSDTA---DSLTEMFSELKLGKTKQTLEEQATEPAQMKD 458

Query: 424 SKSKIME 429
           S  KI+E
Sbjct: 459 SGKKILE 460

BLAST of MC04g1050 vs. TAIR 10
Match: AT3G62080.1 (SNF7 family protein )

HSP 1 Score: 398.3 bits (1022), Expect = 8.0e-111
Identity = 210/421 (49.88%), Postives = 299/421 (71.02%), Query Frame = 0

Query: 10  VREFIKDKVTDWDDEVVTTARFKAFSGQKSDWEPRYLFWRDLILTIAHKFNFIFIKPIEI 69
           V+EFI+ +V DWDDEVV  ARFKAFSGQ+SDWE ++ FWRDLI+ ++ +F    I P+++
Sbjct: 6   VKEFIRREVPDWDDEVVAMARFKAFSGQRSDWELKFQFWRDLIIKVSRQFGLFIIDPVQV 65

Query: 70  RNQWFSQGGLAPLCLDHVLHQMYIDGDIIIQSDMLDPRSGQLSHLFKKLSNLMGTSKKNP 129
           +  WF +GG+ PLC+D V+  M+ +GD++  SD+ DP SG+++ L + + NLM       
Sbjct: 66  KKAWFDRGGMTPLCIDDVVLLMHSEGDVVRISDLDDPGSGRIARLLRTVKNLMVQQPVKQ 125

Query: 130 NDLLRDEYVVLASVLKDRAAEVVKCLSHSNWTSSCIITMMKFQNICGGPDEATAILSYLF 189
            ++L +  +VL  +LK++AA+VVK LS  +WTS+C++T+ KF+N+C G +EA+A+LS+L 
Sbjct: 126 EEIL-ENTLVLVPLLKEKAADVVKILSEGHWTSTCVVTLKKFRNLCNGSNEASAVLSHLS 185

Query: 190 GCGKARYLSREKEEFLEGVKVSLSAATVPSITTLDCDILHLIWTTEKLQQQLDVIDQRYN 249
           GCGKA  +S  + E +EGVKVS S   +P I+TLDCDILHL+ TTEKLQ QL+V+DQR  
Sbjct: 186 GCGKAHKISINRGELIEGVKVSFSQTALPGISTLDCDILHLLRTTEKLQDQLEVMDQRCE 245

Query: 250 VSRQSALASLKSGNKKTALKHARELKISTESREKVASLLNRVEEVLNAIADAESTKTVSE 309
            S++SALASLKSG++K AL+HARELK+ TESREK  SLLNRVEEVLN IAD+ESTK VSE
Sbjct: 246 KSKKSALASLKSGHRKVALRHARELKVVTESREKCTSLLNRVEEVLNTIADSESTKMVSE 305

Query: 310 AIQIGARVMKEHEVSWEELQHSMQELEDSIDLHKQVASAIDSAPSGSILEDEDIEEEFRK 369
           AI+ GARVMK+ ++S +++   ++ELE++I+  KQV  A++SAP   I +DEDIEEE  +
Sbjct: 306 AIKTGARVMKDIKISADDVHDYLEELEETIESQKQVEKALESAPYPDI-DDEDIEEELLE 365

Query: 370 FELEVTGQNIDVPTPNSGASVSDDLLSIALSNLKL--VEDTGKETTVNQNSNSNSKSKIM 429
            E+++  ++  V    S  +   D L+   S LKL   + T +E         +S  KI+
Sbjct: 366 LEMDLESESSQVLPATSDTA---DSLTEMFSELKLGKTKQTLEEQATEPAQMKDSGKKIL 421

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q6PBQ22.7e-1523.80Charged multivesicular body protein 7 OS=Danio rerio OX=7955 GN=chmp7 PE=2 SV=1[more]
Q871Y86.3e-0424.62Vacuolar-sorting protein snf7 OS=Neurospora crassa (strain ATCC 24698 / 74-OR23-... [more]
Match NameE-valueIdentityDescription
XP_022146425.14.05e-306100.00charged multivesicular body protein 7 [Momordica charantia][more]
XP_038875996.13.01e-24481.76uncharacterized protein LOC120068336 isoform X2 [Benincasa hispida][more]
XP_023551290.13.56e-24480.82charged multivesicular body protein 7 [Cucurbita pepo subsp. pepo] >XP_023551291... [more]
KAG6579154.12.05e-24381.05Charged multivesicular body protein 7, partial [Cucurbita argyrosperma subsp. so... [more]
XP_022938857.19.70e-24280.59charged multivesicular body protein 7 [Cucurbita moschata] >XP_022938858.1 charg... [more]
Match NameE-valueIdentityDescription
A0A6J1CX811.96e-306100.00charged multivesicular body protein 7 OS=Momordica charantia OX=3673 GN=LOC11101... [more]
A0A6J1FF904.69e-24280.59charged multivesicular body protein 7 OS=Cucurbita moschata OX=3662 GN=LOC111444... [more]
A0A6J1K2521.56e-24080.14charged multivesicular body protein 7 OS=Cucurbita maxima OX=3661 GN=LOC11148943... [more]
A0A0A0KMY24.59e-23176.70Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G118690 PE=4 SV=1[more]
A0A1S3CRA46.30e-22073.81charged multivesicular body protein 7 isoform X1 OS=Cucumis melo OX=3656 GN=LOC1... [more]
Match NameE-valueIdentityDescription
AT3G62080.23.6e-11149.41SNF7 family protein [more]
AT3G62080.18.0e-11149.88SNF7 family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (Dali-11) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 284..304
NoneNo IPR availableCOILSCoilCoilcoord: 318..338
NoneNo IPR availableGENE3D6.10.140.1230coord: 230..378
e-value: 1.0E-5
score: 27.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 410..432
NoneNo IPR availablePANTHERPTHR22761:SF7SNF7 FAMILY PROTEINcoord: 104..405
NoneNo IPR availablePANTHERPTHR22761CHARGED MULTIVESICULAR BODY PROTEINcoord: 104..405
IPR005024Snf7 familyPFAMPF03357Snf7coord: 233..379
e-value: 2.9E-14
score: 53.1

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MC04g1050.1MC04g1050.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0032511 late endosome to vacuole transport via multivesicular body sorting pathway
biological_process GO:0006900 vesicle budding from membrane
biological_process GO:0007034 vacuolar transport
cellular_component GO:0009898 cytoplasmic side of plasma membrane
cellular_component GO:0000815 ESCRT III complex
cellular_component GO:0005771 multivesicular body