CmaCh05G004960 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh05G004960
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionDUF4057 domain-containing protein
LocationCma_Chr05: 2350430 .. 2353653 (-)
RNA-Seq ExpressionCmaCh05G004960
SyntenyCmaCh05G004960
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAAAAAAAAGAAGCTTCCAGATCATGAGGGCATTTCGGTCATTTGAGTTGAGAAAATCTCTGTGCTTCCACGGCACATTAGCACTGTGTCTCTCTCGCAAAGCACTTCCTTCTTCTTCTTCTTCGTTTTCGCCATTAACAAACACCAATCGAAGCTTCTCTACTCAAACTTCCATTTCTTCAGTTCTTCATTTCTTCATCAATCTCTCTTTGGCCTCTGCAAATCCTAATTCCTTTCTTCTCCAATGGACAGAGCCACTCCTGTTCGGAAGCCTCACACCTCCACTGCAGATCTCCTCACATGGCCTGAACTTCCTCCCGCCGATTCCCCGGCCTTTCCTTCGTCTGCTTCTCGCTCCGCTCCCAGGTCTCATCAGGTATACTTTCTGCATTTCTATCTTTTTTCTTGTTTCTTCTGCTTATTCTCTTTCTTTTGCGTCTCTCTAGCCCTCCGATGGAATCAGCAAGGTCGTCTTTGGAGGCCAGGTTACCGACGAGGAGGTTGAGAGCTTGAACAAAAGGTGAGATCTAAGTTTGCCTCTTTTCTTTCCAATCTTCATTATGTGAAGCTTATTCTGTTTCGATTACACTGTTTCCTTGTGTTCGATCTGCTTGTTTTGTTTCAAATTACTCATGATTGGATCTCAAATTTGTAATCCTTTCTGTGTTTCCAAGGCAATTTAGAAAAGTATATGAGCATTTTCTCTTCTTCTCCCACCACTTGATTTTAGTATGTAATATGGATTAAAGAGAAAAAGAAATATCAAGTTCCGTTTGCCGGTCCGCTATGCATCAGATCTTCAAATTTGTGGAGTTTCTTAGTTTGACTTCTATGTCAATCGCTTCAATTGTTCGATTTTCAAACTGTAGGAAACCCTGCTCTGGATATAAAATGAAGGAGATGACTGGCAGTGGCATTTTTGTTGGTAACGAAGGAGATGATGAGCAAGAATCTGGAAGCGCCAACCCTTCACAGAGTAAAACAGGAATACGCATGTACCAGGTATCCATCGCAGAAGATCACTTAATGCAGTTTCAACTGACTGTAAGCGAGTTTCTGTTTGGTTTGAATGCAATTAAACTCCTAATCTTGACGGGTCGATCAAATTGTAGCAAACACTGGCTGGAATTAGTCATATTTCATTCGGTGAAGAAGGTGGTGTTTCTCCTAAAAAGCCTACTACTCTACCTGAGGTTGCGAAGCAGCGTGAGTTGAGCGGGAACTTAGTAAGCGATGCCGATGAGAAGCTGAAGAAGCAGCTCTCTGATGCTAAGTACCAGGAGCTTAGTGGACATGGCATATTCGCTCCTCCTCCTGAGATTTTGCCTCGACCTACAACTGCTCGCACTTTGGATTTAAAAGGAAGCATCGAGATTGGGGAGCCTGATGATGTAAGTAGTTCTCCACTCAATTCTACAACCTCTTAATTCAAACACATCCATTACGAAGGCACACCCACAAACGCCATTCACTCATTTGGCCTAGAAATTAAGTGCATTTCTTTGTTTCTTTACCTACCTTATACAAATGTTTTAAAATCTAAATCGAGGATCTCCTGTTTGATAACCATTTTGTTCTTGATCTTCCATTTTTTTGAAAATTAAACTTATATGCACTACTTTCTTGCCTACTTTTTAAAGAATGTTTTAAAAAATCAGGCCAATGTTGGAAAACTAAAAGAAATAGTTCTTACAAACTTGATTTTGTTTTTGAAATTTGACTTAGAATTCAAATGTATTTTCAAGGGAGGTGATAAGTCTTCCCAAAAAGGATAAAGCAACAAGCATAATATTCAAAAATAGATAACAAAGACCTTATCAAATGGTGCTTTAATGAAAGTAGGTTCTAAAAACTTGTTTTTTGGAATTTGGAATCTGGCTAAGAATTCAAATGTTTTCAAGGATGAAAATTATGGTAAAGAAGTGGTGGGAAAACTAGCACAATTTTCAACAACCGAAAATCATAAACCAAATAGTTATCAAAAAAGGGCCTTACTATGATGTTCTCTTGTTATGGTTCGAGAAGCCGTGTATAGAAGAATGGTGCATCATGTTAAAGTATTGACATAATTATTATCGTAACTTATTAGAGAAGGATGATATTAGAGAAGGAGAAGGAGGTTATATGTTTGAATCCCATATTGTGATTTTTGGGTTGAGTGGTGACCTAAATCTTTCGTTTGCATCAGAAATGATGGTGTCAGAGGCAAACAATGATGAAATCCTAGGATTTTAGTTGGTTAGTATACGTGCATCCACTAATTTCTCTTCGTTCGTTTTCATCATATTCTTTTTAGTGAGGCCACAAGCTCACAATGATGAAATCCTAAGGTTCCAAGAAGTTAAATATACATGCAATCATCACTAATTAATTGCTCAACGTGAGGTTGCTTTTCTGACTGATTAATCATATGCCCCCATTTGATAGCCTGATATGGCTTTGGTTAGTAGTTTTTTAGTTTACTTTTTCTTCCAAGTCTATTCTTCTTTGCATATGAGTCTGTTGGTCATCACCTTGAGCTTTATTTCGATCTAACTCTAACATCGGTGTGTTCATGTTCCAACTAATCTGATCAAACACAGAGATGTGTGATCCCCGGAGAAGAACCTTCTGTAAAAACAGCAAAGAAGATTTATGATAAGAAATTTTCGGAGTTATCAGGAAACGACATCTTCAAAGGTGACGTTCCTCCATCGTCAACAGAGAAACCGTTGAGCATGGCAAAGTTGCGAGAGATGAGTGGGAGTGACATATTTGCAGACGGGAAGGTAGAGGCCCGAGACTACTTAGGCGGGGTACGCAAACCCCCGGGTGGCGAGAGCAGCATTGCCTTGGTCTAAATCATCGAGGTTTCTAACAAAACTCAATACTTTTAGATTATGGAAATTGTGATCTGGTTTGGGTGAAATATGTTGTTAGGTTTAACTTTGTGAGTCAACAACAACTATGGAGTCTGCTAATTGGGTTGTGCGTCCATAATGTTTTTGTTCTTGGTTGTGTGTATGTCTGTATCTGTATCTGTGTCTAGTAGTAGCTCAGTTTTATCTTGGTTGGTTGTCTTCGCCTATTTTACTTCCAGCTTTTGAGGTTCATTTATTTATCATTATATTCATTAATTGATTTTGCCTATGGTGCTGAAATGTATTTTTATCATTTTTCTTCTTCTTGTCTTTGGAAAATCTTCATATGCCAAACCTTTTATCTGCCATTTGCCAATGAAAT

mRNA sequence

AAAAAAAAAAAGAAGCTTCCAGATCATGAGGGCATTTCGGTCATTTGAGTTGAGAAAATCTCTGTGCTTCCACGGCACATTAGCACTGTGTCTCTCTCGCAAAGCACTTCCTTCTTCTTCTTCTTCGTTTTCGCCATTAACAAACACCAATCGAAGCTTCTCTACTCAAACTTCCATTTCTTCAGTTCTTCATTTCTTCATCAATCTCTCTTTGGCCTCTGCAAATCCTAATTCCTTTCTTCTCCAATGGACAGAGCCACTCCTGTTCGGAAGCCTCACACCTCCACTGCAGATCTCCTCACATGGCCTGAACTTCCTCCCGCCGATTCCCCGGCCTTTCCTTCGTCTGCTTCTCGCTCCGCTCCCAGGTCTCATCAGCCCTCCGATGGAATCAGCAAGGTCGTCTTTGGAGGCCAGGTTACCGACGAGGAGGTTGAGAGCTTGAACAAAAGGAAACCCTGCTCTGGATATAAAATGAAGGAGATGACTGGCAGTGGCATTTTTGTTGGTAACGAAGGAGATGATGAGCAAGAATCTGGAAGCGCCAACCCTTCACAGAGTAAAACAGGAATACGCATGTACCAGCAAACACTGGCTGGAATTAGTCATATTTCATTCGGTGAAGAAGGTGGTGTTTCTCCTAAAAAGCCTACTACTCTACCTGAGGTTGCGAAGCAGCGTGAGTTGAGCGGGAACTTAGTAAGCGATGCCGATGAGAAGCTGAAGAAGCAGCTCTCTGATGCTAAGTACCAGGAGCTTAGTGGACATGGCATATTCGCTCCTCCTCCTGAGATTTTGCCTCGACCTACAACTGCTCGCACTTTGGATTTAAAAGGAAGCATCGAGATTGGGGAGCCTGATGATAGATGTGTGATCCCCGGAGAAGAACCTTCTGTAAAAACAGCAAAGAAGATTTATGATAAGAAATTTTCGGAGTTATCAGGAAACGACATCTTCAAAGGTGACGTTCCTCCATCGTCAACAGAGAAACCGTTGAGCATGGCAAAGTTGCGAGAGATGAGTGGGAGTGACATATTTGCAGACGGGAAGGTAGAGGCCCGAGACTACTTAGGCGGGGTACGCAAACCCCCGGGTGGCGAGAGCAGCATTGCCTTGGTCTAAATCATCGAGGTTTCTAACAAAACTCAATACTTTTAGATTATGGAAATTGTGATCTGGTTTGGGTGAAATATGTTGTTAGGTTTAACTTTGTGAGTCAACAACAACTATGGAGTCTGCTAATTGGGTTGTGCGTCCATAATGTTTTTGTTCTTGGTTGTGTGTATGTCTGTATCTGTATCTGTGTCTAGTAGTAGCTCAGTTTTATCTTGGTTGGTTGTCTTCGCCTATTTTACTTCCAGCTTTTGAGGTTCATTTATTTATCATTATATTCATTAATTGATTTTGCCTATGGTGCTGAAATGTATTTTTATCATTTTTCTTCTTCTTGTCTTTGGAAAATCTTCATATGCCAAACCTTTTATCTGCCATTTGCCAATGAAAT

Coding sequence (CDS)

ATGGACAGAGCCACTCCTGTTCGGAAGCCTCACACCTCCACTGCAGATCTCCTCACATGGCCTGAACTTCCTCCCGCCGATTCCCCGGCCTTTCCTTCGTCTGCTTCTCGCTCCGCTCCCAGGTCTCATCAGCCCTCCGATGGAATCAGCAAGGTCGTCTTTGGAGGCCAGGTTACCGACGAGGAGGTTGAGAGCTTGAACAAAAGGAAACCCTGCTCTGGATATAAAATGAAGGAGATGACTGGCAGTGGCATTTTTGTTGGTAACGAAGGAGATGATGAGCAAGAATCTGGAAGCGCCAACCCTTCACAGAGTAAAACAGGAATACGCATGTACCAGCAAACACTGGCTGGAATTAGTCATATTTCATTCGGTGAAGAAGGTGGTGTTTCTCCTAAAAAGCCTACTACTCTACCTGAGGTTGCGAAGCAGCGTGAGTTGAGCGGGAACTTAGTAAGCGATGCCGATGAGAAGCTGAAGAAGCAGCTCTCTGATGCTAAGTACCAGGAGCTTAGTGGACATGGCATATTCGCTCCTCCTCCTGAGATTTTGCCTCGACCTACAACTGCTCGCACTTTGGATTTAAAAGGAAGCATCGAGATTGGGGAGCCTGATGATAGATGTGTGATCCCCGGAGAAGAACCTTCTGTAAAAACAGCAAAGAAGATTTATGATAAGAAATTTTCGGAGTTATCAGGAAACGACATCTTCAAAGGTGACGTTCCTCCATCGTCAACAGAGAAACCGTTGAGCATGGCAAAGTTGCGAGAGATGAGTGGGAGTGACATATTTGCAGACGGGAAGGTAGAGGCCCGAGACTACTTAGGCGGGGTACGCAAACCCCCGGGTGGCGAGAGCAGCATTGCCTTGGTCTAA

Protein sequence

MDRATPVRKPHTSTADLLTWPELPPADSPAFPSSASRSAPRSHQPSDGISKVVFGGQVTDEEVESLNKRKPCSGYKMKEMTGSGIFVGNEGDDEQESGSANPSQSKTGIRMYQQTLAGISHISFGEEGGVSPKKPTTLPEVAKQRELSGNLVSDADEKLKKQLSDAKYQELSGHGIFAPPPEILPRPTTARTLDLKGSIEIGEPDDRCVIPGEEPSVKTAKKIYDKKFSELSGNDIFKGDVPPSSTEKPLSMAKLREMSGSDIFADGKVEARDYLGGVRKPPGGESSIALV
Homology
BLAST of CmaCh05G004960 vs. ExPASy Swiss-Prot
Match: Q9SIE0 (DNA oxidative demethylase ALKBH2 OS=Arabidopsis thaliana OX=3702 GN=ALKBH2 PE=2 SV=2)

HSP 1 Score: 52.8 bits (125), Expect = 7.8e-06
Identity = 32/59 (54.24%), Postives = 40/59 (67.80%), Query Frame = 0

Query: 36 SRSAPRSHQP-SDGISKVVFGGQVTDEEVESLNKRKPCSGYKMKEMTGSGIFVGNEGDD 94
          S +A RS+QP SDGIS     GQ+T+EE ESL  +K CSG+K+KE+T S  F  N  DD
Sbjct: 7  STAANRSNQPSSDGIS----DGQITNEEAESLINKKNCSGHKLKEVTDSDTFSDNGKDD 61

BLAST of CmaCh05G004960 vs. TAIR 10
Match: AT1G78150.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G35780.1); Has 152 Blast hits to 146 proteins in 18 species: Archae - 0; Bacteria - 0; Metazoa - 1; Fungi - 2; Plants - 149; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 374.4 bits (960), Expect = 8.3e-104
Identity = 199/291 (68.38%), Postives = 227/291 (78.01%), Query Frame = 0

Query: 1   MDRATPVRKPHTSTADLLTWPELPPADSPAFPSSASRSAPRSHQPSDGISKVVFGGQVTD 60
           M+R+TPVRKPHTSTADLLTW E+PP DS   PSSASRSA RSHQPSDGISKVVFGGQVTD
Sbjct: 1   MERSTPVRKPHTSTADLLTWSEVPPPDS---PSSASRSAVRSHQPSDGISKVVFGGQVTD 60

Query: 61  EEVESLNKRKPCSGYKMKEMTGSGIFVGNEGDDEQESGSANPSQSKTGIRMYQQTLAGIS 120
           EEVESLN+RKPCS +KMKE+TGSGIF  NE DD  E            + +YQQ + GIS
Sbjct: 61  EEVESLNRRKPCSEHKMKEITGSGIFSRNEKDDASEP-----------LPVYQQAVNGIS 120

Query: 121 HISFGEEGGVSPKKPTTLPEVAKQRELSGNLVSDADEKLKKQLSDAKYQELSGHGIFAPP 180
            ISFGEE  +SPKKP T+PEVAKQRELSG + +++  KL+KQLSDAKY+E+SG  IFAPP
Sbjct: 121 QISFGEEENLSPKKPATVPEVAKQRELSGTMENESANKLQKQLSDAKYKEISGQNIFAPP 180

Query: 181 PEILPRPTTARTLDLKGSIEIGEPDDRCVIPGEEPSVKTAKKIYDKKFSELSGNDIFKGD 240
           PEI PR  T R L LK +  +G          E+ SVKTAKKIYDKKF+ELSGNDIFKGD
Sbjct: 181 PEIKPRSGTNRALALKDNFNLGAESQTA---EEDSSVKTAKKIYDKKFAELSGNDIFKGD 240

Query: 241 VPPSSTEKPLSMAKLREMSGSDIFADGKVEARDYLGGVRKPPGGESSIALV 292
              S+ EK LS AKL+E+ G++IFADGKVEARDYLGGVRKPPGGE+SIALV
Sbjct: 241 AASSNVEKHLSQAKLKEIGGNNIFADGKVEARDYLGGVRKPPGGETSIALV 274

BLAST of CmaCh05G004960 vs. TAIR 10
Match: AT1G78150.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G35780.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 374.4 bits (960), Expect = 8.3e-104
Identity = 199/291 (68.38%), Postives = 227/291 (78.01%), Query Frame = 0

Query: 1   MDRATPVRKPHTSTADLLTWPELPPADSPAFPSSASRSAPRSHQPSDGISKVVFGGQVTD 60
           M+R+TPVRKPHTSTADLLTW E+PP DS   PSSASRSA RSHQPSDGISKVVFGGQVTD
Sbjct: 1   MERSTPVRKPHTSTADLLTWSEVPPPDS---PSSASRSAVRSHQPSDGISKVVFGGQVTD 60

Query: 61  EEVESLNKRKPCSGYKMKEMTGSGIFVGNEGDDEQESGSANPSQSKTGIRMYQQTLAGIS 120
           EEVESLN+RKPCS +KMKE+TGSGIF  NE DD  E            + +YQQ + GIS
Sbjct: 61  EEVESLNRRKPCSEHKMKEITGSGIFSRNEKDDASEP-----------LPVYQQAVNGIS 120

Query: 121 HISFGEEGGVSPKKPTTLPEVAKQRELSGNLVSDADEKLKKQLSDAKYQELSGHGIFAPP 180
            ISFGEE  +SPKKP T+PEVAKQRELSG + +++  KL+KQLSDAKY+E+SG  IFAPP
Sbjct: 121 QISFGEEENLSPKKPATVPEVAKQRELSGTMENESANKLQKQLSDAKYKEISGQNIFAPP 180

Query: 181 PEILPRPTTARTLDLKGSIEIGEPDDRCVIPGEEPSVKTAKKIYDKKFSELSGNDIFKGD 240
           PEI PR  T R L LK +  +G          E+ SVKTAKKIYDKKF+ELSGNDIFKGD
Sbjct: 181 PEIKPRSGTNRALALKDNFNLGAESQTA---EEDSSVKTAKKIYDKKFAELSGNDIFKGD 240

Query: 241 VPPSSTEKPLSMAKLREMSGSDIFADGKVEARDYLGGVRKPPGGESSIALV 292
              S+ EK LS AKL+E+ G++IFADGKVEARDYLGGVRKPPGGE+SIALV
Sbjct: 241 AASSNVEKHLSQAKLKEIGGNNIFADGKVEARDYLGGVRKPPGGETSIALV 274

BLAST of CmaCh05G004960 vs. TAIR 10
Match: AT1G78150.3 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G35780.1). )

HSP 1 Score: 359.0 bits (920), Expect = 3.6e-99
Identity = 199/320 (62.19%), Postives = 227/320 (70.94%), Query Frame = 0

Query: 1   MDRATPVRKPHTSTADLLTWPELPPADSPAFPSSASRSAPRSHQPSDGISKVVFGGQVTD 60
           M+R+TPVRKPHTSTADLLTW E+PP DS   PSSASRSA RSHQPSDGISKVVFGGQVTD
Sbjct: 1   MERSTPVRKPHTSTADLLTWSEVPPPDS---PSSASRSAVRSHQPSDGISKVVFGGQVTD 60

Query: 61  EEVESLNK-----------------------------RKPCSGYKMKEMTGSGIFVGNEG 120
           EEVESLN+                             RKPCS +KMKE+TGSGIF  NE 
Sbjct: 61  EEVESLNRRILDDAFDSFMRLVIYTNVKTCENVYDVIRKPCSEHKMKEITGSGIFSRNEK 120

Query: 121 DDEQESGSANPSQSKTGIRMYQQTLAGISHISFGEEGGVSPKKPTTLPEVAKQRELSGNL 180
           DD  E            + +YQQ + GIS ISFGEE  +SPKKP T+PEVAKQRELSG +
Sbjct: 121 DDASEP-----------LPVYQQAVNGISQISFGEEENLSPKKPATVPEVAKQRELSGTM 180

Query: 181 VSDADEKLKKQLSDAKYQELSGHGIFAPPPEILPRPTTARTLDLKGSIEIGEPDDRCVIP 240
            +++  KL+KQLSDAKY+E+SG  IFAPPPEI PR  T R L LK +  +G         
Sbjct: 181 ENESANKLQKQLSDAKYKEISGQNIFAPPPEIKPRSGTNRALALKDNFNLGAESQTA--- 240

Query: 241 GEEPSVKTAKKIYDKKFSELSGNDIFKGDVPPSSTEKPLSMAKLREMSGSDIFADGKVEA 292
            E+ SVKTAKKIYDKKF+ELSGNDIFKGD   S+ EK LS AKL+E+ G++IFADGKVEA
Sbjct: 241 EEDSSVKTAKKIYDKKFAELSGNDIFKGDAASSNVEKHLSQAKLKEIGGNNIFADGKVEA 300

BLAST of CmaCh05G004960 vs. TAIR 10
Match: AT1G35780.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G78150.2); Has 145 Blast hits to 144 proteins in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 145; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 340.9 bits (873), Expect = 1.0e-93
Identity = 189/294 (64.29%), Postives = 222/294 (75.51%), Query Frame = 0

Query: 1   MDRATPVRKPHTSTADLLTWPELPPADSPAFPSSASRSAPRSHQPSDGISKVVFGGQVTD 60
           M++ TPVRKPH STADLLTWPE  P +SPA  + +SRSA RSHQPSDGISKVVFGGQVTD
Sbjct: 1   MEKNTPVRKPHMSTADLLTWPENQPFESPA--AVSSRSAARSHQPSDGISKVVFGGQVTD 60

Query: 61  EEVESLNKRKPCSGYKMKEMTGSGIFVGNEGDDEQESGSANPSQSKTGIRMYQQTLAGIS 120
           EEVESLNKRKPCS YKMKE+TGSGIF   E +D+ E  SAN + +       Q   A +S
Sbjct: 61  EEVESLNKRKPCSNYKMKEITGSGIFSVYEENDDSELASANSATNGKSRTFQQPPAAIMS 120

Query: 121 HISFGEEGGVSPKKPTTLPEVAKQRELSGNLVSDADEKLKKQLSDAKYQELSGHGIFAPP 180
           HISFGEE  V+PKKP T+PEVAKQRELSG L   +D KL KQ SDAK +ELSGH IFAPP
Sbjct: 121 HISFGEEEIVTPKKPATVPEVAKQRELSGTLEYQSDAKLNKQFSDAKCKELSGHNIFAPP 180

Query: 181 PEILPRPTTARTLDLKGSIEIGEPDDRCVIPGEEPSVKTAKKIYDKKFSELSGNDIFKGD 240
           PEI  RP T R L  K + ++GE D +      +  +KTAKKI D+KF++LSGN++FK D
Sbjct: 181 PEIKLRP-TVRALAYKDNFDLGESDTK-----PDGELKTAKKIADRKFTDLSGNNVFKSD 240

Query: 241 V--PPSST-EKPLSMAKLREMSGSDIFADGKVEARDYLGGVRKPPGGESSIALV 292
           V  P S+T E+ LS AKL+E+SG+DIFAD K ++RDY GGVRKPPGGESSIALV
Sbjct: 241 VSSPSSATAERLLSTAKLKEISGNDIFADAKAQSRDYFGGVRKPPGGESSIALV 286

BLAST of CmaCh05G004960 vs. TAIR 10
Match: AT4G39860.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G22270.1); Has 152 Blast hits to 146 proteins in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 2; Plants - 146; Viruses - 0; Other Eukaryotes - 4 (source: NCBI BLink). )

HSP 1 Score: 339.7 bits (870), Expect = 2.3e-93
Identity = 181/307 (58.96%), Postives = 219/307 (71.34%), Query Frame = 0

Query: 1   MDRATPVRKPHTSTADLLTWPELPPADSPAFPSSASRSAPRSHQPSDGISKVVFGGQVTD 60
           M+R TPVR PHTSTADLL+W E PP      P  ++ SA RSHQPSDGISK++ GGQ+TD
Sbjct: 1   MERNTPVRNPHTSTADLLSWSETPPP-----PHHSTPSAARSHQPSDGISKILGGGQITD 60

Query: 61  EEVESLNKRKPCSGYKMKEMTGSGIFVGNEGDDEQESGSANPSQSKTGIRMYQQTLAGIS 120
           EE +SLNK K CSGYK+KEMTGSGIF        +   + +P   KTG+R YQQTL G+S
Sbjct: 61  EEAQSLNKLKNCSGYKLKEMTGSGIFTDKGKVGSESDATTDP---KTGLRYYQQTLNGMS 120

Query: 121 HISFGEEGGVSPKKPTTLPEVAKQRELSGNLVSDADEKLKKQLSDAKYQELSGHGIFAPP 180
            ISF  +G VSPKKPTTL EVAKQRELSGNL+++AD K  KQ+S AK +E+SGH IFAPP
Sbjct: 121 QISFSADGNVSPKKPTTLTEVAKQRELSGNLLTEADLKSNKQISSAKIEEISGHDIFAPP 180

Query: 181 PEILPRPTTARTLDLKGSIEIGEPDDR----------------CVIPGEEPSVKTAKKIY 240
            EI PR   A   + +G+ ++GEP  R                 ++  EEP VKT+KKI+
Sbjct: 181 SEIQPRSLVAAQQEARGNRDMGEPAPRNLRTSVKVSNPAGGQSNILFSEEPVVKTSKKIH 240

Query: 241 DKKFSELSGNDIFKGDVPPSSTEKPLSMAKLREMSGSDIFADGKVEARDYLGGVRKPPGG 292
           ++KF EL+GN IFKGD  P S +K LS AKLREMSG++IFADGK E+RDY GGVRKPPGG
Sbjct: 241 NQKFQELTGNGIFKGDESPGSADKQLSSAKLREMSGNNIFADGKSESRDYFGGVRKPPGG 299

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SIE07.8e-0654.24DNA oxidative demethylase ALKBH2 OS=Arabidopsis thaliana OX=3702 GN=ALKBH2 PE=2 ... [more]
Match NameE-valueIdentityDescription
AT1G78150.18.3e-10468.38unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G78150.28.3e-10468.38unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G78150.33.6e-9962.19unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G35780.11.0e-9364.29unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G39860.12.3e-9358.96unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025131Domain of unknown function DUF4057PFAMPF13266DUF4057coord: 3..289
e-value: 3.5E-142
score: 473.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..109
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 31..45
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 94..109
NoneNo IPR availablePANTHERPTHR31132N-LYSINE METHYLTRANSFERASEcoord: 1..291
NoneNo IPR availablePANTHERPTHR31132:SF2HEMATOLOGICAL/NEUROLOGICAL-LIKE PROTEINcoord: 1..291

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh05G004960.1CmaCh05G004960.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006417 regulation of translation
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003723 RNA binding