Clc04G04020 (gene) Watermelon (cordophanus) v2

Overview
NameClc04G04020
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionMitochondrial glycoprotein
LocationClcChr04: 13931777 .. 13936372 (+)
RNA-Seq ExpressionClc04G04020
SyntenyClc04G04020
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AATAACCTATCAAACCTTATTTGGCGGTCATTTGTCATTTTAAACCTTTAAAATAGACTTTTTATACCATTATTTCCGGACGTTATAGACTGAGAGGTCGCACTGGGAGAAGAAGGGACAGTTGAAGAATGGGGAGGGCTAATCAAATATTTCGCAAGGCCCGTAAAGCGCTCCATGATCTTGACCTCCTCAAGATCTTGCAATCCGAGATAGCCCACGAGCTTTCTTCAACCCGATTTCTGGTCCTCTTTAGTTTCCTCCATTGAAACCCTTTTCAATCAATCTCTTCATCATCTCTGATTTTCCATTTCTCTTTTTCCCATTCTCAATCGCAGAACCATGAACATAATAATGGCAGTTCCAGCGATTTCGCTGTGGAATATGACTCGCCCATGTCCCGAGACGTGGTGTTGCGGCGAAAATTGGAATCGGGCGAGGAGGTCGCGATTTCTGCTCTACCGGGTCGTTTCAGATTTGGACACGAAGGGGCTTTTCCGAGGGAGATTTTGATGAAGATTTGTGTGAGTAAGCCTGGAGTTAGCTCTCTTTTGCAGTTTGATTGTGGGGTTTCAGAGGATGGTCATAGTGGGTATCCTTTCAAAATCTACAATGCCTATTATCTTCAATCTTCTGCTTGTTTGGGACCTTCTGTTTATAGAGGCCCTTTGTTCAGGTACTGCTTTTTGGCAACTCTAATCGTACCAAGAATTAACATTGAGCATAGCCATTTTTGTTTACTGATTTCAAACTGGAACTGTGACCTGGCTATATAATTTTCTTTCCTTGAGATTGGTCTCTTGGTTATATTTTGAAACTAGTGGTATGACATGAAATGATGAATAGGTATCACTTTTCACTTAATTTCTTTGACACTTTGGAGACCTCAGTCGGTGAAAGTTGTAGGCTGAGTTTTATCTAGGACAAGTTGTTGATCTTCGTCTCTAGGGTAGAATCAGGGGGCCTCAGACGTGGTAGTTACCAGGCCTGTTTTTACTTATTTCTTGTTATTTGGGACCCTATTCATCATCCATTCCTCGTCTACCCTTTGCGCCACCTTCCTCATAGTGCTAGGCTGTCACTGCACAACTTATGGGAATCTTATCGCCAAAGTAACCTAATGTGTGTACCAGTAACCATTTTACTTATCCTTTGAGAAGAACTTATGAAGTTCAAGTTGCAAAGTATTTGTGATGAGAATACGATTGGTGTGGTTTTTTGTTCTAAATGGCAGATTCAGGCTTTAGGATCCTGGCCCCCAATTCAATAATTTATGAAGTTCTGTAAAGTTCATCCTTCCTTTAGGACAATTTTTTAGTATTAACTAAATAATATTTATGATTTACAAAAGCTTTTTCCCCCTTGTTTTTGAAAGGCAATCTATAATTACCCATAGCCTCGTAATGGGAGGATATATTAACTTCTCATCCACTTCCTCTTGTACATTCCTTTTTTGGATAGAAAGATATTTCTTATCCAAAAAAAGCATTTACTTTTCAAAGCGCCTGTTTCGGTCTGCAAGAGTATTGAGAGGACTATGAGAGACTTCCTCTAGGAAGGGGTAGAGGAAGGGAATGGTTCTCATTCGGTTAGCTAGGATGTTACGGGGAAACCTATGAATTAGGGGGTTGGAACTTGAAAATTTAAGGTTACACAACAAAGCTTTGTTGGCTAAGTGGCTTTGGTGGTTTGCCCTTGATTTTGAATCCTTGTGGTGGAAGATTATTGGGAGCAAGCCAAGCATGGTCCCCATCCTTATGAGTGGGTGGTGAAAGGGGTTAAAGGTATACACCAAAACCTGTGGAGAGATATTTCTCTCATCTCCCTTCTTTTTCCCATCTTGTTTTCTGTGTGGTGGGAGGCGGTAAGGAAACATATTTCTGGGAGGATCTTTAGGTAAGGGATAGACCCTTCTCCTCTGTTTCCTTGGTTATATCATCTATCCTCCTTTAAAAATTGTTGTGTGTTCGATTTTTTGGTTTGGTCAGGGAACTCGGTCTCTTTCTCCTATGGTTTTCGTTTGGTCTCTTCTTTCCTTGATTGAGGGTTTTGATTTTAGGTTTGGGAGAAAGGATGTTTGGGTGTGGAGTCCTACCCCTATGGAGGGCTTCTCTTGTAATTCTTTCTTTAGGTTTTTACTTGATCCCTCTCCCACTGTTGAGTCGGTCTTTGATGTTTTATGGAGGATTAAGATTCCAAAGAAAGTTACATTCCTTTCTTGGCAAGTTTTGCTTGGGTCTGTGAACACCATAGATAGGATTGTGAGAAAGATGCCTTTGCTTGTGGGCCCTTGGTGTTGTATCCTTTGTCGGAAGGTGGAGGAAAACCTAGATCACCTTCTTTGGGTGTGAAATTACTTCTTTCAGGGGTTTGATTTTGCGCTTGCTCATCAAAGGGATGTTTGTTTGATGATCGGGAGTTCCATCCAGATCCGTCTTTCAAAGAGAAAGGATGTTTTCTTTGGTTTGTGGGAGTGTGTGCGATTTTGTGGGATATTTGGGGGGAGAGGAACAACAAAGTTTTTCTTGGAGTGGAGAGAGACCCTAGCGAGGTTTGGTCCCTTGTGAGCTTTCATGTTCTCCATGGGTTTCGGTTTTGAAGACTTATTGTTATTATTCCCTAGGCATATCTTATTTAATTGGATTCCCTTCCTATGAGGGGTTTTGTGAGCTTGGTTTTTTGTATGCCTTTTATTCTTTCATTTTTTCTCAATGAAAGTAGTTGTTCCTATTAAAAGAAAAAAAAGTATTGGTTAAATTATCAATTGGGTCTGTATATTTTATAAATTTCAATTTGATCTCTATGGTTTGATTCAACTTTATAAATTGTCCCTAAACCATTGTTTACCCACCCTTTTTCTCCTATTTTTTACTCTGCCTAAACTATAGTATTAATCTAACATATTAAGCCAAATAGAAACCAACAACTCATTACTACATGAAATAGATAATACGCCACATTGGTAGAACTTAAAGGACGGATGGTGTTTGTGGGGTTTAACCAAACCATAAAGACCAAATTTTAAGCCTTAAAACTATAGGTACCTAATTGAAATTAAGCCCAAACCATAGGGACCAGATTGAAACTTTTAAAATTATTGAATCCAAATTTGTAATTTAACCAAAATTATATTACGCAATGCATCATAGTCCAGTGGGGTTGGTGACCTCCTGCTCACACGGAAAGGCAAATGTATATATTTTGCTTCGCATATGATGATTGTATCAATCAAAAAGATGGTTCATCCCTTTTCTGCAACTTGGAAGCTTAGGAGCAAAGATGGGTGGCCACTTACCTAAATGAAACAATAATCTTAATGTATCTTGACAACATCACCAGTCATGGTGGACGTAAATTGGCTTGGATAGTTACAATATCAAGAGAAGAAATAGCTGCTTCATTCTTTTCCTCTGTTTAATGCAATTTATCTGCTTGAGTTATTAAGAAGGTAAAAAAACTTAACCATCATAGGTTGGCCTACAATAAGGTCCATGAGTTTAATAAAAGGATTTAGAGGGAATGAGTTCTTGTGGCTGGACTGCACTATTTTGTAGATATTGATTCTTCTGATCATGAACACTTTTAAGTCAGATTCATCATTTTATTGTTGAGTGTGTTTAAGTTCGTATTCATGGTTGGATCTGGCTATTTCTGTGGCTAGATTTCGAGTGTTTTAGTTGTTCATTAGGTCACGAGCAGTTTTCCAATTTAGCTTTGAGAATGTATTGATTTCACCGCTTGTTTGGAGTTCTATTTAAGGTTAAAACTCACTAAGCCAAGTACGGAGGTAAAGTTGATTCTTTCCAAATTTGAATGGTAAAAACATAGTGTATAGACTAAACATAGACTTGTATGAATTATCACATCTTCCTTAATCTTAATTGACCAATTTGACTTTAGTTTCCTAGTTGATAACTGAAAATTTGTCTGTGAATGGCTTATTTTCGCTTCTATTACTTTGCTAAGCATCACGATTCATGATATTTAGCCAGTCATTCTCTGATCTTTTCAGTTTGGTTTTTGTCATTCATCATAAGTTTTACCTGTTTTTTCGAGCGTTCTTCTCTAAGATACTTAAAACTTCTGTATCCATGGCCAGCACGTTAGATCCTCAGTTACAAGACGCGCTCAAGGAATACCTAATCAGTAGAGGTGTTGAAGAAAGCCTGACCAATTTCCTTCTCATTCACCTGCATAAAAATGAGCAAGGTCAGTATTTGAATTGGTTGCAAAAGGTCGAATCTTCGATAGCAAAAAGACAACCAAAACAACTTTAACGACTTCATATCAAACACTGTATGCCTGAATTTCATACGGGAAGAGAAGGCCCAACTTTAAGTGCAGGTTTTGATCATTTAGATTGATTGCTAAATAAGAGCTTTCTTTCATTCTGCTTGCATTTTTTTGTTTCTGCAAGTGTATCCTGTTTCTTGCACATGGAACATTCAGCAAGATCTGGTTGTGAACAAAACAAAACTGTGGTACTCATTTAGTCTGAGCCAAGGGGAGAAATTCAATGGAAATGTTGTAACAACTCTTTGTGAATTTCTTGAATCATATTCTTGGTTTTATTATCGTTTTTATTGACTGTTTTTATTT

mRNA sequence

AATAACCTATCAAACCTTATTTGGCGGTCATTTGTCATTTTAAACCTTTAAAATAGACTTTTTATACCATTATTTCCGGACGTTATAGACTGAGAGGTCGCACTGGGAGAAGAAGGGACAGTTGAAGAATGGGGAGGGCTAATCAAATATTTCGCAAGGCCCGTAAAGCGCTCCATGATCTTGACCTCCTCAAGATCTTGCAATCCGAGATAGCCCACGAGCTTTCTTCAACCCGATTTCTGAACCATGAACATAATAATGGCAGTTCCAGCGATTTCGCTGTGGAATATGACTCGCCCATGTCCCGAGACGTGGTGTTGCGGCGAAAATTGGAATCGGGCGAGGAGGTCGCGATTTCTGCTCTACCGGGTCGTTTCAGATTTGGACACGAAGGGGCTTTTCCGAGGGAGATTTTGATGAAGATTTGTGTGAGTAAGCCTGGAGTTAGCTCTCTTTTGCAGTTTGATTGTGGGGTTTCAGAGGATGGTCATAGTGGGTATCCTTTCAAAATCTACAATGCCTATTATCTTCAATCTTCTGCTTGTTTGGGACCTTCTGTTTATAGAGGCCCTTTGTTCAGCACGTTAGATCCTCAGTTACAAGACGCGCTCAAGGAATACCTAATCAGTAGAGGTGTTGAAGAAAGCCTGACCAATTTCCTTCTCATTCACCTGCATAAAAATGAGCAAGGTCAGTATTTGAATTGGTTGCAAAAGGTCGAATCTTCGATAGCAAAAAGACAACCAAAACAACTTTAACGACTTCATATCAAACACTGTATGCCTGAATTTCATACGGGAAGAGAAGGCCCAACTTTAAGTGCAGGTTTTGATCATTTAGATTGATTGCTAAATAAGAGCTTTCTTTCATTCTGCTTGCATTTTTTTGTTTCTGCAAGTGTATCCTGTTTCTTGCACATGGAACATTCAGCAAGATCTGGTTGTGAACAAAACAAAACTGTGGTACTCATTTAGTCTGAGCCAAGGGGAGAAATTCAATGGAAATGTTGTAACAACTCTTTGTGAATTTCTTGAATCATATTCTTGGTTTTATTATCGTTTTTATTGACTGTTTTTATTT

Coding sequence (CDS)

ATGGGGAGGGCTAATCAAATATTTCGCAAGGCCCGTAAAGCGCTCCATGATCTTGACCTCCTCAAGATCTTGCAATCCGAGATAGCCCACGAGCTTTCTTCAACCCGATTTCTGAACCATGAACATAATAATGGCAGTTCCAGCGATTTCGCTGTGGAATATGACTCGCCCATGTCCCGAGACGTGGTGTTGCGGCGAAAATTGGAATCGGGCGAGGAGGTCGCGATTTCTGCTCTACCGGGTCGTTTCAGATTTGGACACGAAGGGGCTTTTCCGAGGGAGATTTTGATGAAGATTTGTGTGAGTAAGCCTGGAGTTAGCTCTCTTTTGCAGTTTGATTGTGGGGTTTCAGAGGATGGTCATAGTGGGTATCCTTTCAAAATCTACAATGCCTATTATCTTCAATCTTCTGCTTGTTTGGGACCTTCTGTTTATAGAGGCCCTTTGTTCAGCACGTTAGATCCTCAGTTACAAGACGCGCTCAAGGAATACCTAATCAGTAGAGGTGTTGAAGAAAGCCTGACCAATTTCCTTCTCATTCACCTGCATAAAAATGAGCAAGGTCAGTATTTGAATTGGTTGCAAAAGGTCGAATCTTCGATAGCAAAAAGACAACCAAAACAACTTTAA

Protein sequence

MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKICVSKPGVSSLLQFDCGVSEDGHSGYPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESSIAKRQPKQL
Homology
BLAST of Clc04G04020 vs. NCBI nr
Match: XP_023544221.1 (uncharacterized protein LOC111803860 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 347.8 bits (891), Expect = 6.4e-92
Identity = 178/209 (85.17%), Postives = 186/209 (89.00%), Query Frame = 0

Query: 1   MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSR 60
           M RANQIFRKARKALHDLDLLKILQSEI HELSST F NHE   GSSSDFAVE+DSP SR
Sbjct: 1   MPRANQIFRKARKALHDLDLLKILQSEINHELSSTPFQNHE-QKGSSSDFAVEHDSPKSR 60

Query: 61  DVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKICVSKPGVSSLLQFDCGVSEDG 120
           DVVLRRKLESGEE+AISAL G   FG EGAF REILMKICVSKPGVSSLLQFDCGVSEDG
Sbjct: 61  DVVLRRKLESGEEIAISALSGPLIFGREGAFSREILMKICVSKPGVSSLLQFDCGVSEDG 120

Query: 121 HSGYPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLI 180
           H G PFKIYNAYYLQSSACL PSVYRGPLFS+LDP+LQ ALKE+LISRGVEESLT+FL+I
Sbjct: 121 HGGSPFKIYNAYYLQSSACLSPSVYRGPLFSSLDPELQKALKEFLISRGVEESLTDFLII 180

Query: 181 HLHKNEQGQYLNWLQKVESSIAKRQPKQL 210
           HLHK EQGQYLNWLQ VES IAKRQ  +L
Sbjct: 181 HLHKKEQGQYLNWLQNVESLIAKRQQNEL 208

BLAST of Clc04G04020 vs. NCBI nr
Match: KAG6603196.1 (hypothetical protein SDJN03_03805, partial [Cucurbita argyrosperma subsp. sororia] >KAG7033506.1 hypothetical protein SDJN02_03228 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 344.0 bits (881), Expect = 9.3e-91
Identity = 176/209 (84.21%), Postives = 184/209 (88.04%), Query Frame = 0

Query: 1   MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSR 60
           M RANQIFRKARKALHDLDLLKILQSEI HELSST F NHE   GSSSDFAVE+DSP SR
Sbjct: 1   MPRANQIFRKARKALHDLDLLKILQSEINHELSSTPFQNHE-QKGSSSDFAVEHDSPKSR 60

Query: 61  DVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKICVSKPGVSSLLQFDCGVSEDG 120
           DVVLRRKLESGEE+AISAL G   FG EGAF REILMKICVSKPGVSSLLQFDCGVS+DG
Sbjct: 61  DVVLRRKLESGEEIAISALSGPLMFGREGAFSREILMKICVSKPGVSSLLQFDCGVSDDG 120

Query: 121 HSGYPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLI 180
           H G PFKIYNAYYLQSS CL PSVYRGPLFS+LDP+LQ ALK +LISRGVEESLT+FLLI
Sbjct: 121 HGGSPFKIYNAYYLQSSGCLSPSVYRGPLFSSLDPELQKALKGFLISRGVEESLTDFLLI 180

Query: 181 HLHKNEQGQYLNWLQKVESSIAKRQPKQL 210
           HLHK EQGQYLNWLQ VES IAKRQ  +L
Sbjct: 181 HLHKKEQGQYLNWLQNVESLIAKRQQNEL 208

BLAST of Clc04G04020 vs. NCBI nr
Match: XP_022967898.1 (uncharacterized protein LOC111467274 [Cucurbita maxima])

HSP 1 Score: 343.6 bits (880), Expect = 1.2e-90
Identity = 177/209 (84.69%), Postives = 185/209 (88.52%), Query Frame = 0

Query: 1   MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSR 60
           M RANQIFRKARKALHDLDLLKILQSEI HELSST F NHE   GSSSDFAVE+DS  SR
Sbjct: 17  MPRANQIFRKARKALHDLDLLKILQSEINHELSSTSFQNHE-QKGSSSDFAVEHDSLKSR 76

Query: 61  DVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKICVSKPGVSSLLQFDCGVSEDG 120
           DVVLRRKLESGEE+AISAL G   FG EGAF REILMKICVSKPGV+SLLQFDCGVSEDG
Sbjct: 77  DVVLRRKLESGEEIAISALSGPLIFGREGAFSREILMKICVSKPGVNSLLQFDCGVSEDG 136

Query: 121 HSGYPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLI 180
           H G PFKIYNAYYLQSSACLGPSVYRGPLFS+LDP+LQ ALK +LISRGVEESLT+FLLI
Sbjct: 137 HGGSPFKIYNAYYLQSSACLGPSVYRGPLFSSLDPELQKALKGFLISRGVEESLTDFLLI 196

Query: 181 HLHKNEQGQYLNWLQKVESSIAKRQPKQL 210
           HLHK EQGQYLNWLQ VES IAKRQ  +L
Sbjct: 197 HLHKKEQGQYLNWLQNVESLIAKRQQNEL 224

BLAST of Clc04G04020 vs. NCBI nr
Match: XP_022928687.1 (uncharacterized protein LOC111435528 [Cucurbita moschata])

HSP 1 Score: 342.8 bits (878), Expect = 2.1e-90
Identity = 177/209 (84.69%), Postives = 184/209 (88.04%), Query Frame = 0

Query: 1   MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSR 60
           M RANQIFRKARKALHDLDLLKILQSEI HELSST F NHE   GSSSDFAVE+DS  SR
Sbjct: 1   MPRANQIFRKARKALHDLDLLKILQSEINHELSSTPFQNHE-QKGSSSDFAVEHDSLKSR 60

Query: 61  DVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKICVSKPGVSSLLQFDCGVSEDG 120
           DVVLRRKLESGEE+AISAL G   FG EGAF REILMKICVSKPGVSSLLQFDCGVSEDG
Sbjct: 61  DVVLRRKLESGEEIAISALSGPLMFGREGAFSREILMKICVSKPGVSSLLQFDCGVSEDG 120

Query: 121 HSGYPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLI 180
           H G PFKIYNAYYLQSSACL PSVYRGPLFS+LDP+LQ ALK +LISRGVEESLT+FLLI
Sbjct: 121 HGGSPFKIYNAYYLQSSACLSPSVYRGPLFSSLDPELQKALKGFLISRGVEESLTDFLLI 180

Query: 181 HLHKNEQGQYLNWLQKVESSIAKRQPKQL 210
           HLHK EQGQYLNWLQ VES IAKRQ  +L
Sbjct: 181 HLHKKEQGQYLNWLQNVESLIAKRQQNEL 208

BLAST of Clc04G04020 vs. NCBI nr
Match: XP_022153881.1 (mitochondrial acidic protein mam33 [Momordica charantia])

HSP 1 Score: 335.5 bits (859), Expect = 3.3e-88
Identity = 169/209 (80.86%), Postives = 183/209 (87.56%), Query Frame = 0

Query: 1   MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSR 60
           M RANQIFRKARKALHDLDLLKILQSEI HELSSTRF + E  +G S DF VE+DSP S+
Sbjct: 1   MPRANQIFRKARKALHDLDLLKILQSEITHELSSTRFQSDE--SGCSRDFVVEHDSPKSQ 60

Query: 61  DVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKICVSKPGVSSLLQFDCGVSEDG 120
           DVVLRRKLESGEEVA+SAL G  RFG EGAFPREILMKICVSKPGV S+LQFDCGVSED 
Sbjct: 61  DVVLRRKLESGEEVAVSALSGPLRFGREGAFPREILMKICVSKPGVGSILQFDCGVSEDH 120

Query: 121 HSGYPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLI 180
           H G PFKIYNAYYLQSSA LG SVYRGP FS+LDP+LQDALK+YLISRGVEESLTNFLL+
Sbjct: 121 HGGSPFKIYNAYYLQSSANLGSSVYRGPFFSSLDPRLQDALKDYLISRGVEESLTNFLLL 180

Query: 181 HLHKNEQGQYLNWLQKVESSIAKRQPKQL 210
           H+HK EQGQYLNWLQ +ES +AK QP +L
Sbjct: 181 HVHKKEQGQYLNWLQNLESLVAKGQPNEL 207

BLAST of Clc04G04020 vs. ExPASy Swiss-Prot
Match: O94675 (Mitochondrial acidic protein mam33 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=SPBC776.07 PE=3 SV=1)

HSP 1 Score: 60.5 bits (145), Expect = 2.7e-08
Identity = 42/125 (33.60%), Postives = 60/125 (48.00%), Query Frame = 0

Query: 88  EGAFPREILMK-----ICVSKPGVSSLLQFDCGVSEDGHSGYPFKIYNAYYLQSSACLGP 147
           E  FP E L +     I +SKPG  +L+ F+    +DG     F I N Y+ +    L  
Sbjct: 146 EDEFPNEDLGRFQPCTIEISKPGNGALV-FEATALDDG-----FDIENIYFSKDIDMLTS 205

Query: 148 ----------SVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLN 198
                       Y GP F  LDP+LQD    YL  R ++ESL++F++      E  +Y+N
Sbjct: 206 DSLEAEWKRRKQYLGPSFKELDPELQDLFHSYLEERKIDESLSSFIVSFGLTKELKEYIN 264

BLAST of Clc04G04020 vs. ExPASy Swiss-Prot
Match: P40513 (Mitochondrial acidic protein MAM33 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=MAM33 PE=1 SV=1)

HSP 1 Score: 57.8 bits (138), Expect = 1.7e-07
Identity = 25/55 (45.45%), Postives = 38/55 (69.09%), Query Frame = 0

Query: 144 VYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVE 199
           VY GP FS LD +LQ++L+ YL SRGV E L +F+  +    E  +Y++WL+K++
Sbjct: 208 VYHGPPFSNLDEELQESLEAYLESRGVNEELASFISAYSEFKENNEYISWLEKMK 262

BLAST of Clc04G04020 vs. ExPASy TrEMBL
Match: A0A6J1HWH3 (uncharacterized protein LOC111467274 OS=Cucurbita maxima OX=3661 GN=LOC111467274 PE=4 SV=1)

HSP 1 Score: 343.6 bits (880), Expect = 5.9e-91
Identity = 177/209 (84.69%), Postives = 185/209 (88.52%), Query Frame = 0

Query: 1   MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSR 60
           M RANQIFRKARKALHDLDLLKILQSEI HELSST F NHE   GSSSDFAVE+DS  SR
Sbjct: 17  MPRANQIFRKARKALHDLDLLKILQSEINHELSSTSFQNHE-QKGSSSDFAVEHDSLKSR 76

Query: 61  DVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKICVSKPGVSSLLQFDCGVSEDG 120
           DVVLRRKLESGEE+AISAL G   FG EGAF REILMKICVSKPGV+SLLQFDCGVSEDG
Sbjct: 77  DVVLRRKLESGEEIAISALSGPLIFGREGAFSREILMKICVSKPGVNSLLQFDCGVSEDG 136

Query: 121 HSGYPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLI 180
           H G PFKIYNAYYLQSSACLGPSVYRGPLFS+LDP+LQ ALK +LISRGVEESLT+FLLI
Sbjct: 137 HGGSPFKIYNAYYLQSSACLGPSVYRGPLFSSLDPELQKALKGFLISRGVEESLTDFLLI 196

Query: 181 HLHKNEQGQYLNWLQKVESSIAKRQPKQL 210
           HLHK EQGQYLNWLQ VES IAKRQ  +L
Sbjct: 197 HLHKKEQGQYLNWLQNVESLIAKRQQNEL 224

BLAST of Clc04G04020 vs. ExPASy TrEMBL
Match: A0A6J1EKM2 (uncharacterized protein LOC111435528 OS=Cucurbita moschata OX=3662 GN=LOC111435528 PE=4 SV=1)

HSP 1 Score: 342.8 bits (878), Expect = 1.0e-90
Identity = 177/209 (84.69%), Postives = 184/209 (88.04%), Query Frame = 0

Query: 1   MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSR 60
           M RANQIFRKARKALHDLDLLKILQSEI HELSST F NHE   GSSSDFAVE+DS  SR
Sbjct: 1   MPRANQIFRKARKALHDLDLLKILQSEINHELSSTPFQNHE-QKGSSSDFAVEHDSLKSR 60

Query: 61  DVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKICVSKPGVSSLLQFDCGVSEDG 120
           DVVLRRKLESGEE+AISAL G   FG EGAF REILMKICVSKPGVSSLLQFDCGVSEDG
Sbjct: 61  DVVLRRKLESGEEIAISALSGPLMFGREGAFSREILMKICVSKPGVSSLLQFDCGVSEDG 120

Query: 121 HSGYPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLI 180
           H G PFKIYNAYYLQSSACL PSVYRGPLFS+LDP+LQ ALK +LISRGVEESLT+FLLI
Sbjct: 121 HGGSPFKIYNAYYLQSSACLSPSVYRGPLFSSLDPELQKALKGFLISRGVEESLTDFLLI 180

Query: 181 HLHKNEQGQYLNWLQKVESSIAKRQPKQL 210
           HLHK EQGQYLNWLQ VES IAKRQ  +L
Sbjct: 181 HLHKKEQGQYLNWLQNVESLIAKRQQNEL 208

BLAST of Clc04G04020 vs. ExPASy TrEMBL
Match: A0A6J1DM07 (mitochondrial acidic protein mam33 OS=Momordica charantia OX=3673 GN=LOC111021294 PE=4 SV=1)

HSP 1 Score: 335.5 bits (859), Expect = 1.6e-88
Identity = 169/209 (80.86%), Postives = 183/209 (87.56%), Query Frame = 0

Query: 1   MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSR 60
           M RANQIFRKARKALHDLDLLKILQSEI HELSSTRF + E  +G S DF VE+DSP S+
Sbjct: 1   MPRANQIFRKARKALHDLDLLKILQSEITHELSSTRFQSDE--SGCSRDFVVEHDSPKSQ 60

Query: 61  DVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKICVSKPGVSSLLQFDCGVSEDG 120
           DVVLRRKLESGEEVA+SAL G  RFG EGAFPREILMKICVSKPGV S+LQFDCGVSED 
Sbjct: 61  DVVLRRKLESGEEVAVSALSGPLRFGREGAFPREILMKICVSKPGVGSILQFDCGVSEDH 120

Query: 121 HSGYPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLI 180
           H G PFKIYNAYYLQSSA LG SVYRGP FS+LDP+LQDALK+YLISRGVEESLTNFLL+
Sbjct: 121 HGGSPFKIYNAYYLQSSANLGSSVYRGPFFSSLDPRLQDALKDYLISRGVEESLTNFLLL 180

Query: 181 HLHKNEQGQYLNWLQKVESSIAKRQPKQL 210
           H+HK EQGQYLNWLQ +ES +AK QP +L
Sbjct: 181 HVHKKEQGQYLNWLQNLESLVAKGQPNEL 207

BLAST of Clc04G04020 vs. ExPASy TrEMBL
Match: A0A1S3BK83 (uncharacterized protein LOC103490527 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103490527 PE=4 SV=1)

HSP 1 Score: 305.1 bits (780), Expect = 2.3e-79
Identity = 156/196 (79.59%), Postives = 170/196 (86.73%), Query Frame = 0

Query: 1   MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSR 60
           M R  Q+FRKARK   DL LL+ILQSEIAHELSST   N+E NN SSS F VE+DS  S+
Sbjct: 1   MVRTTQLFRKARKTFQDLRLLQILQSEIAHELSSTPCQNYE-NNASSSHFTVEHDSLKSQ 60

Query: 61  DVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKICVSKPGVSSLLQFDCGVSEDG 120
           DVVLRRK++SGEEV ISAL G  RFG++GAFPREILMKICVSKPGVSSLLQFDCGVSEDG
Sbjct: 61  DVVLRRKMDSGEEVVISALLGPLRFGYDGAFPREILMKICVSKPGVSSLLQFDCGVSEDG 120

Query: 121 HSGYPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLI 180
           H G PFK+YNAYYL+SS CLGP VYRGP FS+LDP+LQDALKEYLISRGVEESLTNFLLI
Sbjct: 121 HGGSPFKLYNAYYLRSSDCLGP-VYRGPSFSSLDPRLQDALKEYLISRGVEESLTNFLLI 180

Query: 181 HLHKNEQGQYLNWLQK 197
           HLHK EQGQYLNWL+K
Sbjct: 181 HLHKKEQGQYLNWLKK 194

BLAST of Clc04G04020 vs. ExPASy TrEMBL
Match: A0A5N6RKE9 (Uncharacterized protein OS=Carpinus fangiana OX=176857 GN=FH972_016789 PE=4 SV=1)

HSP 1 Score: 272.7 bits (696), Expect = 1.3e-69
Identity = 130/202 (64.36%), Postives = 163/202 (80.69%), Query Frame = 0

Query: 4   ANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVV 63
           A+ + R+ RK L D DLLK+LQSEI HELSS RFLN++  +GS  DF VE+DS  S+DVV
Sbjct: 4   ASPVLRQCRKVLQDSDLLKVLQSEITHELSSNRFLNNQ--SGSLGDFLVEWDSSQSQDVV 63

Query: 64  LRRKLESGEEVAISALPGRFRFGHEGAFPREILMKICVSKPGVSSLLQFDCGVSEDGHSG 123
           LRRK E GEEV +SA+ G F +G E  FPR +LMK+C+ KPG+ S+LQFDCGVS+ G++G
Sbjct: 64  LRRKCELGEEVVVSAVLGPFTYGRESVFPRGVLMKVCLKKPGLGSILQFDCGVSDRGNNG 123

Query: 124 YPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLH 183
             F I+NA Y+QSSA LGPS YRGP+FS+LDPQLQDALKEYL+S+G+ E+LTNFLL+HLH
Sbjct: 124 SEFNIHNACYIQSSARLGPSAYRGPVFSSLDPQLQDALKEYLVSKGIGENLTNFLLLHLH 183

Query: 184 KNEQGQYLNWLQKVESSIAKRQ 206
           K EQGQY+NWL K+ES +AK +
Sbjct: 184 KKEQGQYVNWLHKLESLVAKSE 203

BLAST of Clc04G04020 vs. TAIR 10
Match: AT2G41600.3 (Mitochondrial glycoprotein family protein )

HSP 1 Score: 190.7 bits (483), Expect = 1.2e-48
Identity = 97/204 (47.55%), Postives = 136/204 (66.67%), Query Frame = 0

Query: 1   MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSR 60
           M + N + ++  KA+ + DLLKILQSEI HE+S  RF   E   GS  DF +++DSP S+
Sbjct: 1   MRKLNPLLKRGLKAIENGDLLKILQSEIRHEISHPRFQGVE--TGSLGDFKLDWDSPESQ 60

Query: 61  DVVLRRKLESGEEVAISAL--PGRFRFGHEGAFPREILMKICVSKPGVSSLLQFDCGVSE 120
           D+VL+R+ +SGE+V +SAL  P       +  FPRE   K+C+ KPG+SS+LQF C V E
Sbjct: 61  DIVLKRQFDSGEKVVVSALLQPEPIELEDDLVFPREAHAKVCIKKPGLSSILQFHCRVYE 120

Query: 121 DGHSGYPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFL 180
            G     F I +AY+++S      S Y    FS +DP+L  AL++YLIS+GV E LTNFL
Sbjct: 121 SGSGSSHFDIESAYFIRSFVSAPSSTYGDHFFSQVDPKLHSALEQYLISKGVSEGLTNFL 180

Query: 181 LIHLHKNEQGQYLNWLQKVESSIA 203
           L HL+K EQ QY+NWL+++ES+++
Sbjct: 181 LCHLNKKEQDQYVNWLRRLESTMS 202

BLAST of Clc04G04020 vs. TAIR 10
Match: AT2G41600.5 (Mitochondrial glycoprotein family protein )

HSP 1 Score: 190.7 bits (483), Expect = 1.2e-48
Identity = 100/211 (47.39%), Postives = 139/211 (65.88%), Query Frame = 0

Query: 1   MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSR 60
           M + N + ++  KA+ + DLLKILQSEI HE+S  RF   E   GS  DF +++DSP S+
Sbjct: 1   MRKLNPLLKRGLKAIENGDLLKILQSEIRHEISHPRFQGVE--TGSLGDFKLDWDSPESQ 60

Query: 61  DVVLRRKLESGEEVAISAL--PGRFRFGHEGAFPREILMKICVSKPGVSSLLQFDCGVSE 120
           D+VL+R+ +SGE+V +SAL  P       +  FPRE   K+C+ KPG+SS+LQF C V E
Sbjct: 61  DIVLKRQFDSGEKVVVSALLQPEPIELEDDLVFPREAHAKVCIKKPGLSSILQFHCRVYE 120

Query: 121 DGHSGYPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFL 180
            G     F I +AY+++S      S Y    FS +DP+L  AL++YLIS+GV E LTNFL
Sbjct: 121 SGSGSSHFDIESAYFIRSFVSAPSSTYGDHFFSQVDPKLHSALEQYLISKGVSEGLTNFL 180

Query: 181 LIHLHKNEQGQYLNWLQKVESSIAKRQPKQL 210
           L HL+K EQ QY+NWL+++ES+++   PK L
Sbjct: 181 LCHLNKKEQDQYVNWLRRLESTMS-HSPKPL 208

BLAST of Clc04G04020 vs. TAIR 10
Match: AT2G41600.2 (Mitochondrial glycoprotein family protein )

HSP 1 Score: 125.2 bits (313), Expect = 6.3e-29
Identity = 68/163 (41.72%), Postives = 96/163 (58.90%), Query Frame = 0

Query: 1   MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSR 60
           M + N + ++  KA+ + DLLKILQSEI HE+S  RF   E   GS  DF +++DSP S+
Sbjct: 1   MRKLNPLLKRGLKAIENGDLLKILQSEIRHEISHPRFQGVE--TGSLGDFKLDWDSPESQ 60

Query: 61  DVVLRRKLESGEEVAISAL--PGRFRFGHEGAFPREILMKICVSKPGVSSLLQFDCGVSE 120
           D+VL+R+ +SGE+V +SAL  P       +  FPRE   K+C+ KPG+SS+LQF C V E
Sbjct: 61  DIVLKRQFDSGEKVVVSALLQPEPIELEDDLVFPREAHAKVCIKKPGLSSILQFHCRVYE 120

Query: 121 DGHSGYPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDAL 162
            G     F I +AY+++S      S Y    F +   Q   A+
Sbjct: 121 SGSGSSHFDIESAYFIRSFVSAPSSTYGDHFFRSQTTQCTGAI 161

BLAST of Clc04G04020 vs. TAIR 10
Match: AT2G41600.1 (Mitochondrial glycoprotein family protein )

HSP 1 Score: 122.9 bits (307), Expect = 3.1e-28
Identity = 66/152 (43.42%), Postives = 92/152 (60.53%), Query Frame = 0

Query: 1   MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSR 60
           M + N + ++  KA+ + DLLKILQSEI HE+S  RF   E   GS  DF +++DSP S+
Sbjct: 1   MRKLNPLLKRGLKAIENGDLLKILQSEIRHEISHPRFQGVE--TGSLGDFKLDWDSPESQ 60

Query: 61  DVVLRRKLESGEEVAISAL--PGRFRFGHEGAFPREILMKICVSKPGVSSLLQFDCGVSE 120
           D+VL+R+ +SGE+V +SAL  P       +  FPRE   K+C+ KPG+SS+LQF C V E
Sbjct: 61  DIVLKRQFDSGEKVVVSALLQPEPIELEDDLVFPREAHAKVCIKKPGLSSILQFHCRVYE 120

Query: 121 DGHSGYPFKIYNAYYLQSSACLGPSVYRGPLF 151
            G     F I +AY+++S      S Y    F
Sbjct: 121 SGSGSSHFDIESAYFIRSFVSAPSSTYGDHFF 150

BLAST of Clc04G04020 vs. TAIR 10
Match: AT2G41600.4 (Mitochondrial glycoprotein family protein )

HSP 1 Score: 122.9 bits (307), Expect = 3.1e-28
Identity = 66/152 (43.42%), Postives = 92/152 (60.53%), Query Frame = 0

Query: 1   MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSR 60
           M + N + ++  KA+ + DLLKILQSEI HE+S  RF   E   GS  DF +++DSP S+
Sbjct: 1   MRKLNPLLKRGLKAIENGDLLKILQSEIRHEISHPRFQGVE--TGSLGDFKLDWDSPESQ 60

Query: 61  DVVLRRKLESGEEVAISAL--PGRFRFGHEGAFPREILMKICVSKPGVSSLLQFDCGVSE 120
           D+VL+R+ +SGE+V +SAL  P       +  FPRE   K+C+ KPG+SS+LQF C V E
Sbjct: 61  DIVLKRQFDSGEKVVVSALLQPEPIELEDDLVFPREAHAKVCIKKPGLSSILQFHCRVYE 120

Query: 121 DGHSGYPFKIYNAYYLQSSACLGPSVYRGPLF 151
            G     F I +AY+++S      S Y    F
Sbjct: 121 SGSGSSHFDIESAYFIRSFVSAPSSTYGDHFF 150

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_023544221.16.4e-9285.17uncharacterized protein LOC111803860 [Cucurbita pepo subsp. pepo][more]
KAG6603196.19.3e-9184.21hypothetical protein SDJN03_03805, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022967898.11.2e-9084.69uncharacterized protein LOC111467274 [Cucurbita maxima][more]
XP_022928687.12.1e-9084.69uncharacterized protein LOC111435528 [Cucurbita moschata][more]
XP_022153881.13.3e-8880.86mitochondrial acidic protein mam33 [Momordica charantia][more]
Match NameE-valueIdentityDescription
O946752.7e-0833.60Mitochondrial acidic protein mam33 OS=Schizosaccharomyces pombe (strain 972 / AT... [more]
P405131.7e-0745.45Mitochondrial acidic protein MAM33 OS=Saccharomyces cerevisiae (strain ATCC 2045... [more]
Match NameE-valueIdentityDescription
A0A6J1HWH35.9e-9184.69uncharacterized protein LOC111467274 OS=Cucurbita maxima OX=3661 GN=LOC111467274... [more]
A0A6J1EKM21.0e-9084.69uncharacterized protein LOC111435528 OS=Cucurbita moschata OX=3662 GN=LOC1114355... [more]
A0A6J1DM071.6e-8880.86mitochondrial acidic protein mam33 OS=Momordica charantia OX=3673 GN=LOC11102129... [more]
A0A1S3BK832.3e-7979.59uncharacterized protein LOC103490527 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A5N6RKE91.3e-6964.36Uncharacterized protein OS=Carpinus fangiana OX=176857 GN=FH972_016789 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G41600.31.2e-4847.55Mitochondrial glycoprotein family protein [more]
AT2G41600.51.2e-4847.39Mitochondrial glycoprotein family protein [more]
AT2G41600.26.3e-2941.72Mitochondrial glycoprotein family protein [more]
AT2G41600.13.1e-2843.42Mitochondrial glycoprotein family protein [more]
AT2G41600.43.1e-2843.42Mitochondrial glycoprotein family protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003428Mitochondrial glycoproteinPFAMPF02330MAM33coord: 97..199
e-value: 3.5E-16
score: 59.8
IPR003428Mitochondrial glycoproteinPANTHERPTHR10826COMPLEMENT COMPONENT 1coord: 2..203
IPR036561Mitochondrial glycoprotein superfamilyGENE3D3.10.280.10Mitochondrial glycoproteincoord: 15..201
e-value: 8.7E-26
score: 93.0
IPR036561Mitochondrial glycoprotein superfamilySUPERFAMILY54529Mitochondrial glycoprotein MAM33-likecoord: 17..199
NoneNo IPR availablePANTHERPTHR10826:SF7MITOCHONDRIAL GLYCOPROTEIN FAMILY PROTEINcoord: 2..203

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc04G04020.1Clc04G04020.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0005759 mitochondrial matrix