CmaCh05G003940 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh05G003940
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionUPF0481 protein At3g47200-like
LocationCma_Chr05: 1727621 .. 1728771 (+)
RNA-Seq ExpressionCmaCh05G003940
SyntenyCmaCh05G003940
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACAAACCATACAATATGCCAGGAATTAGTGAAGGTGAAGAAGGTGAACAACCTCGTCGTAATGTTATTGTAGAATGTAGCATCTTTTGAGTTCTTAAACTGTTACGTAGTATGAATCATAGAGTCTATACCCCTCAAGTCATTTCTATAGGTCCATTTCACCATCATCGAAAGGATTTGTTAGCCACTGAGCCATATAAGCTTCGACGTTGTTCGAACTTTCTAAGCCGTCTAGGTAACGAGAGAGATTCATTGGAGTTGCTTAAGAAAAATACTCAAACTTGGATGAAAGAAGTTCGAAATTGGTATGCATAGCACATAAACATGAATGATAAGGAATTTGGTAACATGATGATTGTGGATGGTCGTTTCCTAGTAGAGTTTTTGATACAACATCATAACCGACACTGCCCAAATACATGCTTCCAAACTCCAAACAATTTAGATCTTACCTTCCACCAAAGATTTATCGAGCTATTTACTGATTTGATTATGTTGGAAAATCGAGTCCCTTTCTTCCTTCTTGAACGTTTGTTCTACCTTATACTAAATACCATCTCCATCTCCTTTGTATACTTAACCTATATGTTTTTTAAACTCGAACTTGTTGACAATTATTCTCTTTCTGATCTATCGTCGATAAAACCAAAGTACTTCTTTGATTTGTTAAGCTTCTACTTCGTTGTCTCTAAGACATCTCTAGAGAACAATAATAATAATAGCCCGATAACTCCTCCATCGATAACTGAGCTTCACGAGGCTGGTGTCACCATTAAGAAAGCAGAAAATGCTGAATGTGCAATGAACCTAAGCTTCCAAAATGGGATTTTCACAATCCCACAATCCCACCTTTTAACCGTTGATGATTTCTTCGAACACACCATGCAAAATCTAATAGCATTTGAGCATTTTCCCTTGGAAAATGAAAGCAAGCGTATCCAATATATCGCATTCATGGATGATTTGATAAGAAAGGAGAAATATTTTAATTTACTTGTGAAAGCTGGAATCATAATCAACAAGATTGGAGATAGTGATAAAGAAGTTTCAAACTTGTTTAACAATCTCTGCAAATTTGTTGAACAACCATGTGATGGCGAGTTCAACAATATCAGCAAAGCTTTGCATAAGCATTGTGATAGACGATGA

mRNA sequence

ATGACAAACCATACAATATGCCAGGAATTAGTGAAGGTGAAGAAGGTGAACAACCTCGTCGTAATCTTCTACTTCGTTGTCTCTAAGACATCTCTAGAGAACAATAATAATAATAGCCCGATAACTCCTCCATCGATAACTGAGCTTCACGAGGCTGGTGTCACCATTAAGAAAGCAGAAAATGCTGAATGTGCAATGAACCTAAGCTTCCAAAATGGGATTTTCACAATCCCACAATCCCACCTTTTAACCGTTGATGATTTCTTCGAACACACCATGCAAAATCTAATAGCATTTGAGCATTTTCCCTTGGAAAATGAAAGCAAGCGTATCCAATATATCGCATTCATGGATGATTTGATAAGAAAGGAGAAATATTTTAATTTACTTGTGAAAGCTGGAATCATAATCAACAAGATTGGAGATAGTGATAAAGAAGTTTCAAACTTGTTTAACAATCTCTGCAAATTTGTTGAACAACCATGTGATGGCGAGTTCAACAATATCAGCAAAGCTTTGCATAAGCATTGTGATAGACGATGA

Coding sequence (CDS)

ATGACAAACCATACAATATGCCAGGAATTAGTGAAGGTGAAGAAGGTGAACAACCTCGTCGTAATCTTCTACTTCGTTGTCTCTAAGACATCTCTAGAGAACAATAATAATAATAGCCCGATAACTCCTCCATCGATAACTGAGCTTCACGAGGCTGGTGTCACCATTAAGAAAGCAGAAAATGCTGAATGTGCAATGAACCTAAGCTTCCAAAATGGGATTTTCACAATCCCACAATCCCACCTTTTAACCGTTGATGATTTCTTCGAACACACCATGCAAAATCTAATAGCATTTGAGCATTTTCCCTTGGAAAATGAAAGCAAGCGTATCCAATATATCGCATTCATGGATGATTTGATAAGAAAGGAGAAATATTTTAATTTACTTGTGAAAGCTGGAATCATAATCAACAAGATTGGAGATAGTGATAAAGAAGTTTCAAACTTGTTTAACAATCTCTGCAAATTTGTTGAACAACCATGTGATGGCGAGTTCAACAATATCAGCAAAGCTTTGCATAAGCATTGTGATAGACGATGA

Protein sequence

MTNHTICQELVKVKKVNNLVVIFYFVVSKTSLENNNNNSPITPPSITELHEAGVTIKKAENAECAMNLSFQNGIFTIPQSHLLTVDDFFEHTMQNLIAFEHFPLENESKRIQYIAFMDDLIRKEKYFNLLVKAGIIINKIGDSDKEVSNLFNNLCKFVEQPCDGEFNNISKALHKHCDRR
Homology
BLAST of CmaCh05G003940 vs. TAIR 10
Match: AT4G31980.1 (unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF247, plant (InterPro:IPR004158), Protein of unknown function DUF862, eukaryotic (InterPro:IPR008580); BEST Arabidopsis thaliana protein match is: Plant protein of unknown function (DUF247) (TAIR:AT5G11290.1); Has 1967 Blast hits to 1844 proteins in 183 species: Archae - 0; Bacteria - 6; Metazoa - 223; Fungi - 83; Plants - 1477; Viruses - 0; Other Eukaryotes - 178 (source: NCBI BLink). )

HSP 1 Score: 82.4 bits (202), Expect = 4.0e-16
Identity = 49/135 (36.30%), Postives = 77/135 (57.04%), Query Frame = 0

Query: 44  PSITELHEAGVTIKKAENAECAMNLSFQNGIFTIPQSHLLTVDDFFEHTMQNLIAFEHFP 103
           P  TELH AGV  K AE + C +++SF +G+  IP    + VDD  E   +N+I FE   
Sbjct: 506 PEATELHTAGVRFKPAETSSCLLDISFADGVLKIP---TIVVDDLTESLYKNIIGFEQCR 565

Query: 104 LENESKRIQYIAFMDDLIRKEKYFNLLVKAGIIINKIGDSDKEVSNLFNNLCKFVEQPCD 163
             N++  + YI  +   I+     +LL+ +GII+N +G+S  +VSNLFN++ K V     
Sbjct: 566 CSNKN-FLDYIMLLGCFIKSPTDADLLIHSGIIVNYLGNS-VDVSNLFNSISKEVIYDRR 625

Query: 164 GEFNNISKALHKHCD 179
             F+ +S+ L  +C+
Sbjct: 626 FYFSMLSENLQAYCN 635

BLAST of CmaCh05G003940 vs. TAIR 10
Match: AT3G50150.1 (Plant protein of unknown function (DUF247) )

HSP 1 Score: 73.9 bits (180), Expect = 1.4e-13
Identity = 47/136 (34.56%), Postives = 75/136 (55.15%), Query Frame = 0

Query: 46  ITELHEAGVTIKKAENAECAMNLSFQNGIFTIPQSHLLTVDDFFEHTMQNLIAFEHFPLE 105
           +TEL  AGV   + E  +   ++ F+NG   IP+   L + D  +    NLIAFE    +
Sbjct: 331 VTELRGAGVNFMRKETGQ-LWDIEFKNGYLKIPK---LLIHDGTKSLFSNLIAFEQCHTQ 390

Query: 106 NESKRIQYIAFMDDLIRKEKYFNLLVKAGIIINKIGDSDKEVSNLFNNLCK-FVEQPCDG 165
           + +    YI FMD+LI   +  + L   GII + +G SD EV++LFN LCK  +  P DG
Sbjct: 391 SSNNITSYIIFMDNLINSSQDVSYLHHDGIIEHWLG-SDSEVADLFNRLCKEVIFDPKDG 450

Query: 166 EFNNISKALHKHCDRR 181
             + +S+ ++++  R+
Sbjct: 451 YLSQLSREVNRYYSRK 461

BLAST of CmaCh05G003940 vs. TAIR 10
Match: AT5G11290.1 (Plant protein of unknown function (DUF247) )

HSP 1 Score: 70.1 bits (170), Expect = 2.1e-12
Identity = 44/135 (32.59%), Postives = 74/135 (54.81%), Query Frame = 0

Query: 45  SITELHEAGVTIKKAENAECAMNLSFQNGIFTIPQSHLLTVDDFFEHTMQNLIAFEHFPL 104
           S  E+  AGV ++ A+N  CA+++SF NG+ TIP+   + ++D  E   +N+I FE    
Sbjct: 181 SAKEIQNAGVKLQPADNNTCALDISFANGVLTIPK---IKINDITESLYRNIILFEQCH- 240

Query: 105 ENESKRIQYIAFMDDLIRKEKYFNLLVKAGIIINKIGDSDKEVSNLFNNLCKFVEQPCDG 164
             ++  I Y+ F+   IR      L +  GII+N+ G+++ +VS LFN++ K  E    G
Sbjct: 241 RLDAYFIHYMRFLSCFIRSPMDAELFIDHGIIVNRFGNAE-DVSRLFNSILK--ETSYSG 300

Query: 165 -EFNNISKALHKHCD 179
             +  +   L  HC+
Sbjct: 301 FYYKTVYGNLQAHCN 308

BLAST of CmaCh05G003940 vs. TAIR 10
Match: AT2G44930.1 (Plant protein of unknown function (DUF247) )

HSP 1 Score: 56.6 bits (135), Expect = 2.4e-08
Identity = 45/136 (33.09%), Postives = 70/136 (51.47%), Query Frame = 0

Query: 48  ELHEAGVTIKKAENA-ECAMNLSFQNGIFTIPQSHLLTVDDFFEHTMQNLIAFE--HFPL 107
           +L  AGV   + E   + ++ ++F+ GI  IP       DD  E  M+NL+A E  H+PL
Sbjct: 327 KLDSAGVDFVRLERKNDLSLVITFERGILEIP---CFLADDNTERIMRNLMALEQCHYPL 386

Query: 108 ENESKRIQYIAFMDDLIRKEKYFNLLVKAGIIINKIGDSDKEVSNLFNNLCKFVEQPCDG 167
              +    YIAF+D LI  ++  +LLVK G+I N +G     V+ + N LC  +      
Sbjct: 387 --TAYVCNYIAFLDFLIDTDQDVDLLVKKGVIKNWLG-HQASVAEMVNKLCLGLVD-FGS 446

Query: 168 EFNNISKALHKHCDRR 181
            +  I+  L+KH + R
Sbjct: 447 HYYGIADRLNKHYESR 455

BLAST of CmaCh05G003940 vs. TAIR 10
Match: AT5G22560.1 (Plant protein of unknown function (DUF247) )

HSP 1 Score: 56.6 bits (135), Expect = 2.4e-08
Identity = 37/126 (29.37%), Postives = 61/126 (48.41%), Query Frame = 0

Query: 40  PITPP---------SITELHEAGVTIKKAENAECAMNLSFQNGIFTIPQSHLLTVDDFFE 99
           P+TPP         S  +L   G+  ++ +  E  ++++ +NG+  IP    L  DDFF 
Sbjct: 323 PLTPPPRRFLKLVVSARKLRLRGIKFQQKKKFETPLDITLKNGVLKIPP---LLFDDFFS 382

Query: 100 HTMQNLIAFEHFPLENESKRIQYIAFMDDLIRKEKYFNLLVKAGIIINKIGDSDKEVSNL 157
             + N +AFE F ++  ++   Y+ FM  LI        L + GII N  G + +++S  
Sbjct: 383 SLLINCVAFEQFNVQGTTEMTSYVTFMGCLINTADDATFLSEKGIIENYFG-TGEQLSVF 442

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
AT4G31980.14.0e-1636.30unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF247,... [more]
AT3G50150.11.4e-1334.56Plant protein of unknown function (DUF247) [more]
AT5G11290.12.1e-1232.59Plant protein of unknown function (DUF247) [more]
AT2G44930.12.4e-0833.09Plant protein of unknown function (DUF247) [more]
AT5G22560.12.4e-0829.37Plant protein of unknown function (DUF247) [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004158Protein of unknown function DUF247, plantPFAMPF03140DUF247coord: 22..179
e-value: 4.8E-32
score: 111.7
NoneNo IPR availablePANTHERPTHR31170BNAC04G53230D PROTEINcoord: 24..180

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh05G003940.1CmaCh05G003940.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane