ClCG04G003650 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG04G003650
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionMitochondrial glycoprotein
LocationCG_Chr04: 14359646 .. 14364125 (+)
RNA-Seq ExpressionClCG04G003650
SyntenyClCG04G003650
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTGAGAGGTCGCACTGGGAGAAGAAGGGACAGTTGAAGAATGGGGAGGGCTAATCAAATATTTCGCAAGGCCCGTAAAGCGCTCCATGATCTTGACCTCCTCAAGATCTTGCAATCCGAGATAGCCCACGAGCTTTCTTCAACCCGATTTCTGGTCCTCTTTAGTTTCCTCCATTGAAACCCTTTTCAATCAATCTCTTCATCATCTCTGATTTTCCATTTCTCTTTTTCCCATTCTCAATCGCAGAACCATGAACATAATAATGGCAGTTCCAGCGATTTCGCTGTGGAATATGACTCGCCCATGTCCCGAGACGTGGTGTTGCGGCGAAAATTGGAATCGGGCGAGGAGGTCGCGATTTCTGCTCTACCGGGTCGTTTCAGATTTGGACACGAAGGGGCTTTTCCGAGGGAGATTTTGATGAAGATTTGTGTGAGTAAGCCTGGAGTTAGCTCTCTTTTGCAGTTTGATTGTGGGGTTTCAGAGGATGGTCATAGTGGGTCTCCTTTCAAAATCTACAATGCCTATTATCTTCAATCTTCTGCTTGTTTGGGACCTTCTGTTTATAGAGGCCCTTTGTTCAGGTACTGCTTTTTGGCAACTCTAATCGTACCAAGAATTAACATTGAGCATAGCCATTTGTGTTTGATTACATTTTTGAGAAGCTAGAAACCCATGAGTTCTCTGTGTTTGTTTACTGATTTCAAACTGGAACTGTGACCTGGCTATATAATTTTCTTTCCTTGAGATTGGTCTCTTGGTTATATTTTGAAACTAGTGGTATGACATGAAATGATGAATAGGTATCACTTTTCACTTAATTTCTTTGACACTTTGGAGACCTCAGTCGGTGAAAGTTGTAGGCTGAGTTTTATCTAGGACAAGTTGTTGATCTTCGTCTCTAGGGTAGAATCAGGGGGCCTCAGACGTGGTAGTTACCAGGCCTGTTTTTACTTATTTCTTGTTATTTGGGACCCTATTCATCATCCATTCCTCGTCTACCCTTTGCGCCACCTTCCTCATAGTGCTAGGCTGTCACTGCACAACTTATGGGAATCTTATCGCCAAAGTAACCTAATGTGTGTACCAGTAACCATTTTACTTATCCTTTGAGAAGAACTTATGAAGTTCAAGTTGCAAAGTATTTGTGATGAGAATACGATTGGTGTGGTTTTTTGTTCTAAATGGCAGATTCAGGCTTTAGGATCCTGGCCCCCAATTCAATAATTTATGAAGTTCTGTAAAGTTCATCCTTCCTTTAGGACAATTTTTTAGTATTAACTAAATAATATTTATGATTTACAAAAGCTTTTTCCCCCTTGTTTTTGAAAGGCAATCTATAATTACCCATAGCCTCGTAATGGGAGGATATATTAACTTCTCATCCACTTCCTCTTGTACATTCCTTTTTTGGATAGAAAGATATTTCTTATCCAAAAAAAGCATTTACTTTTCAAAGCGCCTGTTTCGGTCTGCAAGAGTATTGAGAGGACTATGAGAGACTTCCTCTAGGAAGGGGTAGAGGAAGGGAATGGTTCTCATTCGGTTAGCTAGGATGTTACGGGGAAACCTGTGAATTAGGGGGTTGGAACTTGAAAATTTAAGGTTACACAACAAAGCTTTGTTGGCTAAGTGGCTTTGGTGGTTTGCCCTTGATTTTGAATCCTTGTGGTGGAAGATTATTGGGAGCAAGCCAAGCATGGTCCCCATCCTTATGAGTGGGTGGTGAAAGGGGTTAAAGGTATACACCAAAACCTGTGGAGAGATATTTCTTCTCATCTCCCTTCTTTTTCCCATCTTGTTTTCTGTGTGGTGGGAGGCGGTAAGGAAACATATTTCTGGGAGGATCTTTAGGTAAGGGATAGACCCTTCTCCTCTGTTTCCTTGGTTATATCATCTATCCTCCTTTAAAAATTGTTGTGTGTTCGATTTTTTGGTTTGGTCAGGGAACTCGGTCTCTTTCTCCTATGGTTTTCGTTTGGTCTCTTCTTTCCTTGATTGAGGGTTTTGATTTTAGGTTTGGGAGAAAGGATGTTTGGGTGTGGAGTCCTACCCCTATGGAGGGCTTCTCTTGTAATTCTTTCTTTAGGTTTTTACTTGATCCCTCTCCCACTGTTGAGTCGGTCTTTGATGTTTTATGGAGGATTAAGATTCCAAAGAAAGTTACATTCCTTTCTTGGCAAGTTTTGCTTGGGTCTGTGAACACCATAGATAGGATTGTGAGAAAGATGCCTTTGCTTGTGGGCCCTTGGTGTTGTATCCTTTGTCGGAAGGTGGAGGAAAACCTAGATCACCTTCTTTGGGTGTGAAATTACTTCTTTCAGGGGTTTGATTTTGCGCTTGCTCATCAAAGGGATGTTTGTTTGATGATCGGGAGTTCCATCCAGATCCGTCTTTCAAAGAGAAAGGATGTTTTCTTTGGTTTGTGGGAGTGTGTGCGATTTTGTGGGATATTTGGGGGGAGAGGAACAACAAAGTTTTTCTTGGAGTGGAGAGAGACCCTAGCGAGGTTTGGTCCCTTGTGAGCTTTCATGTTCTCCATGGGTTTCGGTTTTGAAGACTTATAGTTATTATTCCCTAGGCATATCTTATTTAATTGGATTCCCTTCTATGAGGGGTTTTGTGAGCTTGGTTTTTTGTATGCCTTTTATTCTTTCATTTTTTCTCAATGAAAGTAGTTGTTCCTATTAAAAGAAAAAAAAGTATTGGTTAAATTATCAATTGGGTCTGTATATTTTATAAATTTCAATTTGATCTCTATGGTTTGATTCAACTTTATAAATTGTCCCTAAACCATTGTTTACCCACCCTTTTTCTCCTATTTTTTACTCTGCCTAAACTATAGTATTAATCTAACATATTAAGCCAAATAGAAACCAACAACTCATTACTACATGAAATAGATAATACGCCACATTGGTAGAACTTAAAGGACGGATGGTGTTTGTGGGGTTTAACCAAACCATAAAGACCAAATTTTAAGCCTTAAAACTATAGGTACCTAATTGAAATTAAGCCCAAACCATAGGGACCAGATTGAAACTTTTAAAATTATTGAATCCAAATTTGTAATTTAACCAAAATTATATTACGCAATGCATCATAGTCCAGTGGGGTTGGTGACCTCCTGCTCACACGGAAAGGCAAATGTATATATTTTGCTTCGCATATGATGATTGTATCAATCAAAAAGATGGTTCATCCCTTTTCTGCAACTTGGAAGCTTAGGAGCAAAGATGGGTGGCCACTTACCTAAATGAAACAATAATCTTAATGTATCTTGACAACATCACCAGTCATGGTGGACGTAAATTGGCTTGGATAGTTACAATATCAAGAGAAGAAATAGCTGCTTCATTCTTTTCCTCTGTTTAATGCAATTTATCTGCTTGAGTTATTAAGAAGGTAAAAAAACTTAACCATCATAGGTTGGCCTACAATAAGGTCCATGAGTTTAATAAAAGGATTTAGAGGGAATGAGTTCTTGTGGCTGGACTGCACTATTTTGTAGATATTGATTCTTCTGATCATGAACACTTTTAAGTCAGATTCATCATTTTATTGTTGAGTGTGTTTAAGTTCGTATTCATGGTTGGATCTGGCTATTTCTGTGGCTAGATTTCGAGTGTTTTAGTTGTTCATTAGGTCACGAGCAGTTTTCCAATTTAGCTTTGAGAATGTATTGATTTCACCGCTTGTTTGGAGTTCTATTTAAGGTTAAAACTCACTAAGCCAAGTACGGAGGTAAAGTTGATTCTTTCCAAATTTGAATGGTAAAAACATAGTGTATAGACTAAACATAGACTTGTATGAATTATCACATCTTCCTTAATCTTAATTGACCAATTTGACTTTAGTTTCCTAGTTGATAACTGAAAATTTGTCTGTGAATGGCTTATTTTCGCTTCTATTACTTTGCTAAGCATCACGATTCATGATATTTAGCCAGTCATTCTCCGATCTTTTCAGTTTGGTTTTTGTCATTCATCATAAGTTTTACCTGTTTTTTCGAGCGTTCTTCTCTAAGATACTTAAAACTTCTGTATCCATGGCCAGCACGTTAGATCCTCAGTTACAAGACGCGCTCAAGGAATACCTAATCAGTAGAGGTGTTGAAGAAAGCCTGACCAATTTCCTTCTCATTCACCTGCATAAAAATGAGCAAGGTCAGTATTTGAATTGGTTGCAAAAGGTCGAATCTTCGATAGCAAAAAGACAACCAAAACAACTTTAACGACTTCATATCAAACACTGTATGCCTGAATTTCATACGGGAAGAGAAGGCCCAACTTTAAGTGCAGGTTTTGATCATTTAGATTGATTGCTAAATAAGAGCTTTCTTTCATTCTGCTTGCATTTTTTTGTTTCTGCAAGTGTATCCTGTTTCTTGCACATGGAACATTCAGCAAGATCTGGTTGTGAACAAAACAAAACTGTGGTACTCATTTAGTCTGAGCCAAGGGGAGAAATTCAATGGAAA

mRNA sequence

CTGAGAGGTCGCACTGGGAGAAGAAGGGACAGTTGAAGAATGGGGAGGGCTAATCAAATATTTCGCAAGGCCCGTAAAGCGCTCCATGATCTTGACCTCCTCAAGATCTTGCAATCCGAGATAGCCCACGAGCTTTCTTCAACCCGATTTCTGAACCATGAACATAATAATGGCAGTTCCAGCGATTTCGCTGTGGAATATGACTCGCCCATGTCCCGAGACGTGGTGTTGCGGCGAAAATTGGAATCGGGCGAGGAGGTCGCGATTTCTGCTCTACCGGGTCGTTTCAGATTTGGACACGAAGGGGCTTTTCCGAGGGAGATTTTGATGAAGATTTGTGTGAGTAAGCCTGGAGTTAGCTCTCTTTTGCAGTTTGATTGTGGGGTTTCAGAGGATGGTCATAGTGGGTCTCCTTTCAAAATCTACAATGCCTATTATCTTCAATCTTCTGCTTGTTTGGGACCTTCTGTTTATAGAGGCCCTTTGTTCAGCACGTTAGATCCTCAGTTACAAGACGCGCTCAAGGAATACCTAATCAGTAGAGGTGTTGAAGAAAGCCTGACCAATTTCCTTCTCATTCACCTGCATAAAAATGAGCAAGGTCAGTATTTGAATTGGTTGCAAAAGGTCGAATCTTCGATAGCAAAAAGACAACCAAAACAACTTTAACGACTTCATATCAAACACTGTATGCCTGAATTTCATACGGGAAGAGAAGGCCCAACTTTAAGTGCAGGTTTTGATCATTTAGATTGATTGCTAAATAAGAGCTTTCTTTCATTCTGCTTGCATTTTTTTGTTTCTGCAAGTGTATCCTGTTTCTTGCACATGGAACATTCAGCAAGATCTGGTTGTGAACAAAACAAAACTGTGGTACTCATTTAGTCTGAGCCAAGGGGAGAAATTCAATGGAAA

Coding sequence (CDS)

ATGGGGAGGGCTAATCAAATATTTCGCAAGGCCCGTAAAGCGCTCCATGATCTTGACCTCCTCAAGATCTTGCAATCCGAGATAGCCCACGAGCTTTCTTCAACCCGATTTCTGAACCATGAACATAATAATGGCAGTTCCAGCGATTTCGCTGTGGAATATGACTCGCCCATGTCCCGAGACGTGGTGTTGCGGCGAAAATTGGAATCGGGCGAGGAGGTCGCGATTTCTGCTCTACCGGGTCGTTTCAGATTTGGACACGAAGGGGCTTTTCCGAGGGAGATTTTGATGAAGATTTGTGTGAGTAAGCCTGGAGTTAGCTCTCTTTTGCAGTTTGATTGTGGGGTTTCAGAGGATGGTCATAGTGGGTCTCCTTTCAAAATCTACAATGCCTATTATCTTCAATCTTCTGCTTGTTTGGGACCTTCTGTTTATAGAGGCCCTTTGTTCAGCACGTTAGATCCTCAGTTACAAGACGCGCTCAAGGAATACCTAATCAGTAGAGGTGTTGAAGAAAGCCTGACCAATTTCCTTCTCATTCACCTGCATAAAAATGAGCAAGGTCAGTATTTGAATTGGTTGCAAAAGGTCGAATCTTCGATAGCAAAAAGACAACCAAAACAACTTTAA

Protein sequence

MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESSIAKRQPKQL
Homology
BLAST of ClCG04G003650 vs. NCBI nr
Match: XP_023544221.1 (uncharacterized protein LOC111803860 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 349.7 bits (896), Expect = 1.7e-92
Identity = 179/209 (85.65%), Postives = 187/209 (89.47%), Query Frame = 0

Query: 1   MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSR 60
           M RANQIFRKARKALHDLDLLKILQSEI HELSST F NHE   GSSSDFAVE+DSP SR
Sbjct: 1   MPRANQIFRKARKALHDLDLLKILQSEINHELSSTPFQNHE-QKGSSSDFAVEHDSPKSR 60

Query: 61  DVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKICVSKPGVSSLLQFDCGVSEDG 120
           DVVLRRKLESGEE+AISAL G   FG EGAF REILMKICVSKPGVSSLLQFDCGVSEDG
Sbjct: 61  DVVLRRKLESGEEIAISALSGPLIFGREGAFSREILMKICVSKPGVSSLLQFDCGVSEDG 120

Query: 121 HSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLI 180
           H GSPFKIYNAYYLQSSACL PSVYRGPLFS+LDP+LQ ALKE+LISRGVEESLT+FL+I
Sbjct: 121 HGGSPFKIYNAYYLQSSACLSPSVYRGPLFSSLDPELQKALKEFLISRGVEESLTDFLII 180

Query: 181 HLHKNEQGQYLNWLQKVESSIAKRQPKQL 210
           HLHK EQGQYLNWLQ VES IAKRQ  +L
Sbjct: 181 HLHKKEQGQYLNWLQNVESLIAKRQQNEL 208

BLAST of ClCG04G003650 vs. NCBI nr
Match: KAG6603196.1 (hypothetical protein SDJN03_03805, partial [Cucurbita argyrosperma subsp. sororia] >KAG7033506.1 hypothetical protein SDJN02_03228 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 345.9 bits (886), Expect = 2.4e-91
Identity = 177/209 (84.69%), Postives = 185/209 (88.52%), Query Frame = 0

Query: 1   MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSR 60
           M RANQIFRKARKALHDLDLLKILQSEI HELSST F NHE   GSSSDFAVE+DSP SR
Sbjct: 1   MPRANQIFRKARKALHDLDLLKILQSEINHELSSTPFQNHE-QKGSSSDFAVEHDSPKSR 60

Query: 61  DVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKICVSKPGVSSLLQFDCGVSEDG 120
           DVVLRRKLESGEE+AISAL G   FG EGAF REILMKICVSKPGVSSLLQFDCGVS+DG
Sbjct: 61  DVVLRRKLESGEEIAISALSGPLMFGREGAFSREILMKICVSKPGVSSLLQFDCGVSDDG 120

Query: 121 HSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLI 180
           H GSPFKIYNAYYLQSS CL PSVYRGPLFS+LDP+LQ ALK +LISRGVEESLT+FLLI
Sbjct: 121 HGGSPFKIYNAYYLQSSGCLSPSVYRGPLFSSLDPELQKALKGFLISRGVEESLTDFLLI 180

Query: 181 HLHKNEQGQYLNWLQKVESSIAKRQPKQL 210
           HLHK EQGQYLNWLQ VES IAKRQ  +L
Sbjct: 181 HLHKKEQGQYLNWLQNVESLIAKRQQNEL 208

BLAST of ClCG04G003650 vs. NCBI nr
Match: XP_022967898.1 (uncharacterized protein LOC111467274 [Cucurbita maxima])

HSP 1 Score: 345.5 bits (885), Expect = 3.2e-91
Identity = 178/209 (85.17%), Postives = 186/209 (89.00%), Query Frame = 0

Query: 1   MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSR 60
           M RANQIFRKARKALHDLDLLKILQSEI HELSST F NHE   GSSSDFAVE+DS  SR
Sbjct: 17  MPRANQIFRKARKALHDLDLLKILQSEINHELSSTSFQNHE-QKGSSSDFAVEHDSLKSR 76

Query: 61  DVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKICVSKPGVSSLLQFDCGVSEDG 120
           DVVLRRKLESGEE+AISAL G   FG EGAF REILMKICVSKPGV+SLLQFDCGVSEDG
Sbjct: 77  DVVLRRKLESGEEIAISALSGPLIFGREGAFSREILMKICVSKPGVNSLLQFDCGVSEDG 136

Query: 121 HSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLI 180
           H GSPFKIYNAYYLQSSACLGPSVYRGPLFS+LDP+LQ ALK +LISRGVEESLT+FLLI
Sbjct: 137 HGGSPFKIYNAYYLQSSACLGPSVYRGPLFSSLDPELQKALKGFLISRGVEESLTDFLLI 196

Query: 181 HLHKNEQGQYLNWLQKVESSIAKRQPKQL 210
           HLHK EQGQYLNWLQ VES IAKRQ  +L
Sbjct: 197 HLHKKEQGQYLNWLQNVESLIAKRQQNEL 224

BLAST of ClCG04G003650 vs. NCBI nr
Match: XP_022928687.1 (uncharacterized protein LOC111435528 [Cucurbita moschata])

HSP 1 Score: 344.7 bits (883), Expect = 5.4e-91
Identity = 178/209 (85.17%), Postives = 185/209 (88.52%), Query Frame = 0

Query: 1   MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSR 60
           M RANQIFRKARKALHDLDLLKILQSEI HELSST F NHE   GSSSDFAVE+DS  SR
Sbjct: 1   MPRANQIFRKARKALHDLDLLKILQSEINHELSSTPFQNHE-QKGSSSDFAVEHDSLKSR 60

Query: 61  DVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKICVSKPGVSSLLQFDCGVSEDG 120
           DVVLRRKLESGEE+AISAL G   FG EGAF REILMKICVSKPGVSSLLQFDCGVSEDG
Sbjct: 61  DVVLRRKLESGEEIAISALSGPLMFGREGAFSREILMKICVSKPGVSSLLQFDCGVSEDG 120

Query: 121 HSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLI 180
           H GSPFKIYNAYYLQSSACL PSVYRGPLFS+LDP+LQ ALK +LISRGVEESLT+FLLI
Sbjct: 121 HGGSPFKIYNAYYLQSSACLSPSVYRGPLFSSLDPELQKALKGFLISRGVEESLTDFLLI 180

Query: 181 HLHKNEQGQYLNWLQKVESSIAKRQPKQL 210
           HLHK EQGQYLNWLQ VES IAKRQ  +L
Sbjct: 181 HLHKKEQGQYLNWLQNVESLIAKRQQNEL 208

BLAST of ClCG04G003650 vs. NCBI nr
Match: XP_022153881.1 (mitochondrial acidic protein mam33 [Momordica charantia])

HSP 1 Score: 337.4 bits (864), Expect = 8.7e-89
Identity = 170/209 (81.34%), Postives = 184/209 (88.04%), Query Frame = 0

Query: 1   MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSR 60
           M RANQIFRKARKALHDLDLLKILQSEI HELSSTRF + E  +G S DF VE+DSP S+
Sbjct: 1   MPRANQIFRKARKALHDLDLLKILQSEITHELSSTRFQSDE--SGCSRDFVVEHDSPKSQ 60

Query: 61  DVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKICVSKPGVSSLLQFDCGVSEDG 120
           DVVLRRKLESGEEVA+SAL G  RFG EGAFPREILMKICVSKPGV S+LQFDCGVSED 
Sbjct: 61  DVVLRRKLESGEEVAVSALSGPLRFGREGAFPREILMKICVSKPGVGSILQFDCGVSEDH 120

Query: 121 HSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLI 180
           H GSPFKIYNAYYLQSSA LG SVYRGP FS+LDP+LQDALK+YLISRGVEESLTNFLL+
Sbjct: 121 HGGSPFKIYNAYYLQSSANLGSSVYRGPFFSSLDPRLQDALKDYLISRGVEESLTNFLLL 180

Query: 181 HLHKNEQGQYLNWLQKVESSIAKRQPKQL 210
           H+HK EQGQYLNWLQ +ES +AK QP +L
Sbjct: 181 HVHKKEQGQYLNWLQNLESLVAKGQPNEL 207

BLAST of ClCG04G003650 vs. ExPASy Swiss-Prot
Match: P40513 (Mitochondrial acidic protein MAM33 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=MAM33 PE=1 SV=1)

HSP 1 Score: 60.5 bits (145), Expect = 2.7e-08
Identity = 37/110 (33.64%), Postives = 60/110 (54.55%), Query Frame = 0

Query: 95  ILMKICVSKPGVSSLLQFDCGVSEDG----HSGSPFKIYNAYYLQSSAC--LGPSVYRGP 154
           ++ K   S+P VS  L  +    ++G     S +P+   +A   QS+        VY GP
Sbjct: 156 VISKESASEPAVSFELLMNL---QEGSFYVDSATPYPSVDAALNQSAEAEITRELVYHGP 215

Query: 155 LFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVE 199
            FS LD +LQ++L+ YL SRGV E L +F+  +    E  +Y++WL+K++
Sbjct: 216 PFSNLDEELQESLEAYLESRGVNEELASFISAYSEFKENNEYISWLEKMK 262

BLAST of ClCG04G003650 vs. ExPASy Swiss-Prot
Match: O94675 (Mitochondrial acidic protein mam33 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=SPBC776.07 PE=3 SV=1)

HSP 1 Score: 60.1 bits (144), Expect = 3.5e-08
Identity = 42/125 (33.60%), Postives = 60/125 (48.00%), Query Frame = 0

Query: 88  EGAFPREILMK-----ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGP 147
           E  FP E L +     I +SKPG  +L+ F+    +DG     F I N Y+ +    L  
Sbjct: 146 EDEFPNEDLGRFQPCTIEISKPGNGALV-FEATALDDG-----FDIENIYFSKDIDMLTS 205

Query: 148 ----------SVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLN 198
                       Y GP F  LDP+LQD    YL  R ++ESL++F++      E  +Y+N
Sbjct: 206 DSLEAEWKRRKQYLGPSFKELDPELQDLFHSYLEERKIDESLSSFIVSFGLTKELKEYIN 264

BLAST of ClCG04G003650 vs. ExPASy TrEMBL
Match: A0A6J1HWH3 (uncharacterized protein LOC111467274 OS=Cucurbita maxima OX=3661 GN=LOC111467274 PE=4 SV=1)

HSP 1 Score: 345.5 bits (885), Expect = 1.5e-91
Identity = 178/209 (85.17%), Postives = 186/209 (89.00%), Query Frame = 0

Query: 1   MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSR 60
           M RANQIFRKARKALHDLDLLKILQSEI HELSST F NHE   GSSSDFAVE+DS  SR
Sbjct: 17  MPRANQIFRKARKALHDLDLLKILQSEINHELSSTSFQNHE-QKGSSSDFAVEHDSLKSR 76

Query: 61  DVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKICVSKPGVSSLLQFDCGVSEDG 120
           DVVLRRKLESGEE+AISAL G   FG EGAF REILMKICVSKPGV+SLLQFDCGVSEDG
Sbjct: 77  DVVLRRKLESGEEIAISALSGPLIFGREGAFSREILMKICVSKPGVNSLLQFDCGVSEDG 136

Query: 121 HSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLI 180
           H GSPFKIYNAYYLQSSACLGPSVYRGPLFS+LDP+LQ ALK +LISRGVEESLT+FLLI
Sbjct: 137 HGGSPFKIYNAYYLQSSACLGPSVYRGPLFSSLDPELQKALKGFLISRGVEESLTDFLLI 196

Query: 181 HLHKNEQGQYLNWLQKVESSIAKRQPKQL 210
           HLHK EQGQYLNWLQ VES IAKRQ  +L
Sbjct: 197 HLHKKEQGQYLNWLQNVESLIAKRQQNEL 224

BLAST of ClCG04G003650 vs. ExPASy TrEMBL
Match: A0A6J1EKM2 (uncharacterized protein LOC111435528 OS=Cucurbita moschata OX=3662 GN=LOC111435528 PE=4 SV=1)

HSP 1 Score: 344.7 bits (883), Expect = 2.6e-91
Identity = 178/209 (85.17%), Postives = 185/209 (88.52%), Query Frame = 0

Query: 1   MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSR 60
           M RANQIFRKARKALHDLDLLKILQSEI HELSST F NHE   GSSSDFAVE+DS  SR
Sbjct: 1   MPRANQIFRKARKALHDLDLLKILQSEINHELSSTPFQNHE-QKGSSSDFAVEHDSLKSR 60

Query: 61  DVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKICVSKPGVSSLLQFDCGVSEDG 120
           DVVLRRKLESGEE+AISAL G   FG EGAF REILMKICVSKPGVSSLLQFDCGVSEDG
Sbjct: 61  DVVLRRKLESGEEIAISALSGPLMFGREGAFSREILMKICVSKPGVSSLLQFDCGVSEDG 120

Query: 121 HSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLI 180
           H GSPFKIYNAYYLQSSACL PSVYRGPLFS+LDP+LQ ALK +LISRGVEESLT+FLLI
Sbjct: 121 HGGSPFKIYNAYYLQSSACLSPSVYRGPLFSSLDPELQKALKGFLISRGVEESLTDFLLI 180

Query: 181 HLHKNEQGQYLNWLQKVESSIAKRQPKQL 210
           HLHK EQGQYLNWLQ VES IAKRQ  +L
Sbjct: 181 HLHKKEQGQYLNWLQNVESLIAKRQQNEL 208

BLAST of ClCG04G003650 vs. ExPASy TrEMBL
Match: A0A6J1DM07 (mitochondrial acidic protein mam33 OS=Momordica charantia OX=3673 GN=LOC111021294 PE=4 SV=1)

HSP 1 Score: 337.4 bits (864), Expect = 4.2e-89
Identity = 170/209 (81.34%), Postives = 184/209 (88.04%), Query Frame = 0

Query: 1   MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSR 60
           M RANQIFRKARKALHDLDLLKILQSEI HELSSTRF + E  +G S DF VE+DSP S+
Sbjct: 1   MPRANQIFRKARKALHDLDLLKILQSEITHELSSTRFQSDE--SGCSRDFVVEHDSPKSQ 60

Query: 61  DVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKICVSKPGVSSLLQFDCGVSEDG 120
           DVVLRRKLESGEEVA+SAL G  RFG EGAFPREILMKICVSKPGV S+LQFDCGVSED 
Sbjct: 61  DVVLRRKLESGEEVAVSALSGPLRFGREGAFPREILMKICVSKPGVGSILQFDCGVSEDH 120

Query: 121 HSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLI 180
           H GSPFKIYNAYYLQSSA LG SVYRGP FS+LDP+LQDALK+YLISRGVEESLTNFLL+
Sbjct: 121 HGGSPFKIYNAYYLQSSANLGSSVYRGPFFSSLDPRLQDALKDYLISRGVEESLTNFLLL 180

Query: 181 HLHKNEQGQYLNWLQKVESSIAKRQPKQL 210
           H+HK EQGQYLNWLQ +ES +AK QP +L
Sbjct: 181 HVHKKEQGQYLNWLQNLESLVAKGQPNEL 207

BLAST of ClCG04G003650 vs. ExPASy TrEMBL
Match: A0A1S3BK83 (uncharacterized protein LOC103490527 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103490527 PE=4 SV=1)

HSP 1 Score: 307.4 bits (786), Expect = 4.7e-80
Identity = 157/196 (80.10%), Postives = 171/196 (87.24%), Query Frame = 0

Query: 1   MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSR 60
           M R  Q+FRKARK   DL LL+ILQSEIAHELSST   N+E NN SSS F VE+DS  S+
Sbjct: 1   MVRTTQLFRKARKTFQDLRLLQILQSEIAHELSSTPCQNYE-NNASSSHFTVEHDSLKSQ 60

Query: 61  DVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKICVSKPGVSSLLQFDCGVSEDG 120
           DVVLRRK++SGEEV ISAL G  RFG++GAFPREILMKICVSKPGVSSLLQFDCGVSEDG
Sbjct: 61  DVVLRRKMDSGEEVVISALLGPLRFGYDGAFPREILMKICVSKPGVSSLLQFDCGVSEDG 120

Query: 121 HSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLI 180
           H GSPFK+YNAYYL+SS CLGP VYRGP FS+LDP+LQDALKEYLISRGVEESLTNFLLI
Sbjct: 121 HGGSPFKLYNAYYLRSSDCLGP-VYRGPSFSSLDPRLQDALKEYLISRGVEESLTNFLLI 180

Query: 181 HLHKNEQGQYLNWLQK 197
           HLHK EQGQYLNWL+K
Sbjct: 181 HLHKKEQGQYLNWLKK 194

BLAST of ClCG04G003650 vs. ExPASy TrEMBL
Match: A0A5N6RKE9 (Uncharacterized protein OS=Carpinus fangiana OX=176857 GN=FH972_016789 PE=4 SV=1)

HSP 1 Score: 274.6 bits (701), Expect = 3.3e-70
Identity = 131/202 (64.85%), Postives = 164/202 (81.19%), Query Frame = 0

Query: 4   ANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVV 63
           A+ + R+ RK L D DLLK+LQSEI HELSS RFLN++  +GS  DF VE+DS  S+DVV
Sbjct: 4   ASPVLRQCRKVLQDSDLLKVLQSEITHELSSNRFLNNQ--SGSLGDFLVEWDSSQSQDVV 63

Query: 64  LRRKLESGEEVAISALPGRFRFGHEGAFPREILMKICVSKPGVSSLLQFDCGVSEDGHSG 123
           LRRK E GEEV +SA+ G F +G E  FPR +LMK+C+ KPG+ S+LQFDCGVS+ G++G
Sbjct: 64  LRRKCELGEEVVVSAVLGPFTYGRESVFPRGVLMKVCLKKPGLGSILQFDCGVSDRGNNG 123

Query: 124 SPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLH 183
           S F I+NA Y+QSSA LGPS YRGP+FS+LDPQLQDALKEYL+S+G+ E+LTNFLL+HLH
Sbjct: 124 SEFNIHNACYIQSSARLGPSAYRGPVFSSLDPQLQDALKEYLVSKGIGENLTNFLLLHLH 183

Query: 184 KNEQGQYLNWLQKVESSIAKRQ 206
           K EQGQY+NWL K+ES +AK +
Sbjct: 184 KKEQGQYVNWLHKLESLVAKSE 203

BLAST of ClCG04G003650 vs. TAIR 10
Match: AT2G41600.5 (Mitochondrial glycoprotein family protein )

HSP 1 Score: 193.0 bits (489), Expect = 2.5e-49
Identity = 101/211 (47.87%), Postives = 140/211 (66.35%), Query Frame = 0

Query: 1   MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSR 60
           M + N + ++  KA+ + DLLKILQSEI HE+S  RF   E   GS  DF +++DSP S+
Sbjct: 1   MRKLNPLLKRGLKAIENGDLLKILQSEIRHEISHPRFQGVE--TGSLGDFKLDWDSPESQ 60

Query: 61  DVVLRRKLESGEEVAISAL--PGRFRFGHEGAFPREILMKICVSKPGVSSLLQFDCGVSE 120
           D+VL+R+ +SGE+V +SAL  P       +  FPRE   K+C+ KPG+SS+LQF C V E
Sbjct: 61  DIVLKRQFDSGEKVVVSALLQPEPIELEDDLVFPREAHAKVCIKKPGLSSILQFHCRVYE 120

Query: 121 DGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFL 180
            G   S F I +AY+++S      S Y    FS +DP+L  AL++YLIS+GV E LTNFL
Sbjct: 121 SGSGSSHFDIESAYFIRSFVSAPSSTYGDHFFSQVDPKLHSALEQYLISKGVSEGLTNFL 180

Query: 181 LIHLHKNEQGQYLNWLQKVESSIAKRQPKQL 210
           L HL+K EQ QY+NWL+++ES+++   PK L
Sbjct: 181 LCHLNKKEQDQYVNWLRRLESTMS-HSPKPL 208

BLAST of ClCG04G003650 vs. TAIR 10
Match: AT2G41600.3 (Mitochondrial glycoprotein family protein )

HSP 1 Score: 192.6 bits (488), Expect = 3.2e-49
Identity = 98/204 (48.04%), Postives = 137/204 (67.16%), Query Frame = 0

Query: 1   MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSR 60
           M + N + ++  KA+ + DLLKILQSEI HE+S  RF   E   GS  DF +++DSP S+
Sbjct: 1   MRKLNPLLKRGLKAIENGDLLKILQSEIRHEISHPRFQGVE--TGSLGDFKLDWDSPESQ 60

Query: 61  DVVLRRKLESGEEVAISAL--PGRFRFGHEGAFPREILMKICVSKPGVSSLLQFDCGVSE 120
           D+VL+R+ +SGE+V +SAL  P       +  FPRE   K+C+ KPG+SS+LQF C V E
Sbjct: 61  DIVLKRQFDSGEKVVVSALLQPEPIELEDDLVFPREAHAKVCIKKPGLSSILQFHCRVYE 120

Query: 121 DGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFL 180
            G   S F I +AY+++S      S Y    FS +DP+L  AL++YLIS+GV E LTNFL
Sbjct: 121 SGSGSSHFDIESAYFIRSFVSAPSSTYGDHFFSQVDPKLHSALEQYLISKGVSEGLTNFL 180

Query: 181 LIHLHKNEQGQYLNWLQKVESSIA 203
           L HL+K EQ QY+NWL+++ES+++
Sbjct: 181 LCHLNKKEQDQYVNWLRRLESTMS 202

BLAST of ClCG04G003650 vs. TAIR 10
Match: AT2G41600.2 (Mitochondrial glycoprotein family protein )

HSP 1 Score: 127.1 bits (318), Expect = 1.7e-29
Identity = 69/163 (42.33%), Postives = 97/163 (59.51%), Query Frame = 0

Query: 1   MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSR 60
           M + N + ++  KA+ + DLLKILQSEI HE+S  RF   E   GS  DF +++DSP S+
Sbjct: 1   MRKLNPLLKRGLKAIENGDLLKILQSEIRHEISHPRFQGVE--TGSLGDFKLDWDSPESQ 60

Query: 61  DVVLRRKLESGEEVAISAL--PGRFRFGHEGAFPREILMKICVSKPGVSSLLQFDCGVSE 120
           D+VL+R+ +SGE+V +SAL  P       +  FPRE   K+C+ KPG+SS+LQF C V E
Sbjct: 61  DIVLKRQFDSGEKVVVSALLQPEPIELEDDLVFPREAHAKVCIKKPGLSSILQFHCRVYE 120

Query: 121 DGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDAL 162
            G   S F I +AY+++S      S Y    F +   Q   A+
Sbjct: 121 SGSGSSHFDIESAYFIRSFVSAPSSTYGDHFFRSQTTQCTGAI 161

BLAST of ClCG04G003650 vs. TAIR 10
Match: AT2G41600.1 (Mitochondrial glycoprotein family protein )

HSP 1 Score: 124.8 bits (312), Expect = 8.2e-29
Identity = 67/152 (44.08%), Postives = 93/152 (61.18%), Query Frame = 0

Query: 1   MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSR 60
           M + N + ++  KA+ + DLLKILQSEI HE+S  RF   E   GS  DF +++DSP S+
Sbjct: 1   MRKLNPLLKRGLKAIENGDLLKILQSEIRHEISHPRFQGVE--TGSLGDFKLDWDSPESQ 60

Query: 61  DVVLRRKLESGEEVAISAL--PGRFRFGHEGAFPREILMKICVSKPGVSSLLQFDCGVSE 120
           D+VL+R+ +SGE+V +SAL  P       +  FPRE   K+C+ KPG+SS+LQF C V E
Sbjct: 61  DIVLKRQFDSGEKVVVSALLQPEPIELEDDLVFPREAHAKVCIKKPGLSSILQFHCRVYE 120

Query: 121 DGHSGSPFKIYNAYYLQSSACLGPSVYRGPLF 151
            G   S F I +AY+++S      S Y    F
Sbjct: 121 SGSGSSHFDIESAYFIRSFVSAPSSTYGDHFF 150

BLAST of ClCG04G003650 vs. TAIR 10
Match: AT2G41600.4 (Mitochondrial glycoprotein family protein )

HSP 1 Score: 124.8 bits (312), Expect = 8.2e-29
Identity = 67/152 (44.08%), Postives = 93/152 (61.18%), Query Frame = 0

Query: 1   MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSR 60
           M + N + ++  KA+ + DLLKILQSEI HE+S  RF   E   GS  DF +++DSP S+
Sbjct: 1   MRKLNPLLKRGLKAIENGDLLKILQSEIRHEISHPRFQGVE--TGSLGDFKLDWDSPESQ 60

Query: 61  DVVLRRKLESGEEVAISAL--PGRFRFGHEGAFPREILMKICVSKPGVSSLLQFDCGVSE 120
           D+VL+R+ +SGE+V +SAL  P       +  FPRE   K+C+ KPG+SS+LQF C V E
Sbjct: 61  DIVLKRQFDSGEKVVVSALLQPEPIELEDDLVFPREAHAKVCIKKPGLSSILQFHCRVYE 120

Query: 121 DGHSGSPFKIYNAYYLQSSACLGPSVYRGPLF 151
            G   S F I +AY+++S      S Y    F
Sbjct: 121 SGSGSSHFDIESAYFIRSFVSAPSSTYGDHFF 150

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_023544221.11.7e-9285.65uncharacterized protein LOC111803860 [Cucurbita pepo subsp. pepo][more]
KAG6603196.12.4e-9184.69hypothetical protein SDJN03_03805, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022967898.13.2e-9185.17uncharacterized protein LOC111467274 [Cucurbita maxima][more]
XP_022928687.15.4e-9185.17uncharacterized protein LOC111435528 [Cucurbita moschata][more]
XP_022153881.18.7e-8981.34mitochondrial acidic protein mam33 [Momordica charantia][more]
Match NameE-valueIdentityDescription
P405132.7e-0833.64Mitochondrial acidic protein MAM33 OS=Saccharomyces cerevisiae (strain ATCC 2045... [more]
O946753.5e-0833.60Mitochondrial acidic protein mam33 OS=Schizosaccharomyces pombe (strain 972 / AT... [more]
Match NameE-valueIdentityDescription
A0A6J1HWH31.5e-9185.17uncharacterized protein LOC111467274 OS=Cucurbita maxima OX=3661 GN=LOC111467274... [more]
A0A6J1EKM22.6e-9185.17uncharacterized protein LOC111435528 OS=Cucurbita moschata OX=3662 GN=LOC1114355... [more]
A0A6J1DM074.2e-8981.34mitochondrial acidic protein mam33 OS=Momordica charantia OX=3673 GN=LOC11102129... [more]
A0A1S3BK834.7e-8080.10uncharacterized protein LOC103490527 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A5N6RKE93.3e-7064.85Uncharacterized protein OS=Carpinus fangiana OX=176857 GN=FH972_016789 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G41600.52.5e-4947.87Mitochondrial glycoprotein family protein [more]
AT2G41600.33.2e-4948.04Mitochondrial glycoprotein family protein [more]
AT2G41600.21.7e-2942.33Mitochondrial glycoprotein family protein [more]
AT2G41600.18.2e-2944.08Mitochondrial glycoprotein family protein [more]
AT2G41600.48.2e-2944.08Mitochondrial glycoprotein family protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036561Mitochondrial glycoprotein superfamilyGENE3D3.10.280.10Mitochondrial glycoproteincoord: 15..201
e-value: 4.8E-26
score: 93.8
IPR036561Mitochondrial glycoprotein superfamilySUPERFAMILY54529Mitochondrial glycoprotein MAM33-likecoord: 17..199
IPR003428Mitochondrial glycoproteinPFAMPF02330MAM33coord: 97..199
e-value: 1.8E-16
score: 60.7
IPR003428Mitochondrial glycoproteinPANTHERPTHR10826COMPLEMENT COMPONENT 1coord: 2..203
NoneNo IPR availablePANTHERPTHR10826:SF7MITOCHONDRIAL GLYCOPROTEIN FAMILY PROTEINcoord: 2..203

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG04G003650.1ClCG04G003650.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0005759 mitochondrial matrix