CmaCh11G005440 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh11G005440
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionDUF4050 family protein
LocationCma_Chr11: 2614564 .. 2617084 (-)
RNA-Seq ExpressionCmaCh11G005440
SyntenyCmaCh11G005440
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCCATCTCTTCTCTGAATTTCTCGGAAACAACCCACCAATTTTCTATTTTGGTATTTAGTTTCCACTCCCCACTCGACTTTCCCCTTTTCCCCTCATCCAACTTCCTGCTTTCTTGCCCCAACTCTACTCCATCTCTCGCCAGTTACAGGTTCTTAGCTTCACTCTCGTTTTCCCTGAATCGATGCCCTTCTCTGCGCTTTTGGCTTAATATCAACATGGTTATGCTTAATAGTTCCTTCGCCGCCTGGATCAGCCGCTTGTTCGCTTGCATGGGGTAAGTTCTCGATTTTCCTCATCTATCGTTCATTGCGGTCTTGTAAAGGAGTTGTTATATTTGCTTCTAATGGAGTTTTGTGATGACCCTTTTGTGTAATCCATGGAATTGCTTCTGTTTTCTGGCTTCCTGAATAGTGGGCCATCTCCTTTTTCCTTCTTTTGCCAAGAATTTCACTATTTCAACTGTAATGTTTAATCCAAACGAAGCAAGGGGCATGGGGGAAGTTGTGGGATTTCATGTTCTTTTGGATTGCTGGTATTAATGTGAATTCCTGAAGTAACTATGAACGAAACTAAGTTTATTAAGATTCAAAGTTCGAAACCATTCCACATTCTTGGTAATGTTTTTGCTGCAATTTTATAGCATATATAGTGCTTTATTAGGTTTATAACAGTGCAGGGGTTGTTTTGGATGCTGCACTAAACCAACACCTATTATTGCTGTGGATGAGCCATCTAAGGGATTAAGAATTCAAGGACGAGTCGTTAAGAAACGTAGCATATCTGACGGTTTTTGGAGCACAAGCACATGTGATTTGGATAATAGCACCATTCAATCTCAACCAAGCATCTCCTCTATCAGTACATCAAACCTCACGCTCACTCATAGCAATGTTGGTGCCAGTGTGAGCAACCCTTCTGAATTTGTAAACCATGGCAAGTTTCCCTTCTGAATTCGTAAACCACGGCGGTTAGATCGCAATTGCTCATTCAAACCTGCTCAACATAGCAACAAACCATTTGCTCATTTTCTTTGCTTTCAGGTCTTCTTCTCTGGAATCAGAACAGGCTGCAGTGGATTGGTAATAGTAGTAGCAGCAAGACAACAGATCAAACTCAACTAAAACGGAAGGCAAAAATCAGGTCAGTTCACCAAATGTCTCTGCATTCCAAAATTGCTGCCACCGTTTGTAAGCTCTCGTTAAGTGCTATCTTGTGAGATCCCACATCGGTTAGAGAGTGGAACGAAGCATTCCTTATAAGGGAGTGGAAACTCCTCCCAAATATACGCATTTTAAAACTGTGAGGTTGACAACAATACATAACGGGCCAAAGTGGATAATATCTACTAGCGGTGGCCTTGGGTTGTTACAAATGATATCAGAGCCAGACACCGGACGGTGTGCTAGCAAGGACGCTGGGCTCCCAAGGAAGGTGGATTGTGAGATCCCACATCAGTTGGAAAGGGGAACGAAACATTCCTTACAAGGGTGTGGAAACCTCTCCCAAATAGACGCGTTTTAAAACCGTGAGGTTGACAGCGATACGTACCGGGCCAATGCAGACAATATCTACTAGCAATAGGCTTAGACCGTTACTGTAGTGATACAAAAATCTAGAACTAAAATTTTTGGTGGAGATCCTATTAAATCTTCATTCGATGCGTGTATATATGCATTTGCTTGCAGTTGGCGTGCAACATATGACAGTTTACTGAGTACAAGACAATGTTTTCCCCATCCAATTCCTCTGGCTGTAAGTTTGGTGTTGCAAGTATTGTCATTTCTTTCTCCATTGGTTTCATTGTAAGACTAAGAACTTTAAGAACTTCATTTTAACAGGAAATGGTGAAGTTTCTTGTGGAAGTATGGGAACAGGAGGGCCTATATGATTGAGAATGCTTCGTTTATTTTGGATATATTCCTTGAATCCTTGGGAGGAAGCTTCTTAGTACAGATTTCCAAAAAGAGAATAAAGGCTGTTTTTCTTCTTCTTCTTACCTTTATCAATCCTCCATTCATCCTGCATTTTTCAGCTGTACAAATTTATGAACACCAGAAAAATCCATTTTCTGTGTATTCTTCTTCCTTTTTCTTCACTGTTCATATTATGCATAAAAACTCGAAAATCTCCATCACTGCTGCTCAAAGTAAACTTTGTAATAATGTGAATTTACATGTATAGTTTGGTGACTTTGATGGGATTTTTTGTGGGTACTATTCTATGTTTCTGTATGGTATGTTATTGGTTGGTATGAGATAGAGATGGATTCTTATATTATAACGGGATTTAAGTAGCCATCTTCTTTTCTCCCATGCCCCTAACAAGACGAAGAATCTCCATTTAAGTGGGGAACATGGAGGGAGCGGAGGAAAAGAAATAGAAAAGCATGTGATCTATATCAAGTTCTTCTGTATCAACTCGACCCAAATCACTCACTAAAAAGATTCATTTTTTAGTTTTTGATACACAAAACATCTAGATAGCTTACATTATTGAATATATGCTATTCACATCGGT

mRNA sequence

ATGCCCATCTCTTCTCTGAATTTCTCGGAAACAACCCACCAATTTTCTATTTTGGTATTTAGTTTCCACTCCCCACTCGACTTTCCCCTTTTCCCCTCATCCAACTTCCTGCTTTCTTGCCCCAACTCTACTCCATCTCTCGCCAGTTACAGGTTCTTAGCTTCACTCTCGTTTTCCCTGAATCGATGCCCTTCTCTGCGCTTTTGGCTTAATATCAACATGGTTATGCTTAATAGTTCCTTCGCCGCCTGGATCAGCCGCTTGTTCGCTTGCATGGGGGGTTGTTTTGGATGCTGCACTAAACCAACACCTATTATTGCTGTGGATGAGCCATCTAAGGGATTAAGAATTCAAGGACGAGTCGTTAAGAAACGTAGCATATCTGACGGTTTTTGGAGCACAAGCACATGTGATTTGGATAATAGCACCATTCAATCTCAACCAAGCATCTCCTCTATCAGTACATCAAACCTCACGCTCACTCATAGCAATGTTGGTGCCAGTGTGAGCAACCCTTCTGAATTTGTAAACCATGGTCTTCTTCTCTGGAATCAGAACAGGCTGCAGTGGATTGGTAATAGTAGTAGCAGCAAGACAACAGATCAAACTCAACTAAAACGGAAGGCAAAAATCAGTTGGCGTGCAACATATGACAGTTTACTGAGTACAAGACAATGTTTTCCCCATCCAATTCCTCTGGCTGAAATGGTGAAGTTTCTTGTGGAAGTATGGGAACAGGAGGGCCTATATGATTGAGAATGCTTCGTTTATTTTGGATATATTCCTTGAATCCTTGGGAGGAAGCTTCTTAGTACAGATTTCCAAAAAGAGAATAAAGGCTGTTTTTCTTCTTCTTCTTACCTTTATCAATCCTCCATTCATCCTGCATTTTTCAGCTGTACAAATTTATGAACACCAGAAAAATCCATTTTCTGTGTATTCTTCTTCCTTTTTCTTCACTGTTCATATTATGCATAAAAACTCGAAAATCTCCATCACTGCTGCTCAAAGTAAACTTTGTAATAATGTGAATTTACATGTATAGTTTGGTGACTTTGATGGGATTTTTTGTGGGTACTATTCTATGTTTCTGTATGGTATGTTATTGGTTGGTATGAGATAGAGATGGATTCTTATATTATAACGGGATTTAAGTAGCCATCTTCTTTTCTCCCATGCCCCTAACAAGACGAAGAATCTCCATTTAAGTGGGGAACATGGAGGGAGCGGAGGAAAAGAAATAGAAAAGCATGTGATCTATATCAAGTTCTTCTGTATCAACTCGACCCAAATCACTCACTAAAAAGATTCATTTTTTAGTTTTTGATACACAAAACATCTAGATAGCTTACATTATTGAATATATGCTATTCACATCGGT

Coding sequence (CDS)

ATGCCCATCTCTTCTCTGAATTTCTCGGAAACAACCCACCAATTTTCTATTTTGGTATTTAGTTTCCACTCCCCACTCGACTTTCCCCTTTTCCCCTCATCCAACTTCCTGCTTTCTTGCCCCAACTCTACTCCATCTCTCGCCAGTTACAGGTTCTTAGCTTCACTCTCGTTTTCCCTGAATCGATGCCCTTCTCTGCGCTTTTGGCTTAATATCAACATGGTTATGCTTAATAGTTCCTTCGCCGCCTGGATCAGCCGCTTGTTCGCTTGCATGGGGGGTTGTTTTGGATGCTGCACTAAACCAACACCTATTATTGCTGTGGATGAGCCATCTAAGGGATTAAGAATTCAAGGACGAGTCGTTAAGAAACGTAGCATATCTGACGGTTTTTGGAGCACAAGCACATGTGATTTGGATAATAGCACCATTCAATCTCAACCAAGCATCTCCTCTATCAGTACATCAAACCTCACGCTCACTCATAGCAATGTTGGTGCCAGTGTGAGCAACCCTTCTGAATTTGTAAACCATGGTCTTCTTCTCTGGAATCAGAACAGGCTGCAGTGGATTGGTAATAGTAGTAGCAGCAAGACAACAGATCAAACTCAACTAAAACGGAAGGCAAAAATCAGTTGGCGTGCAACATATGACAGTTTACTGAGTACAAGACAATGTTTTCCCCATCCAATTCCTCTGGCTGAAATGGTGAAGTTTCTTGTGGAAGTATGGGAACAGGAGGGCCTATATGATTGA

Protein sequence

MPISSLNFSETTHQFSILVFSFHSPLDFPLFPSSNFLLSCPNSTPSLASYRFLASLSFSLNRCPSLRFWLNINMVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKRSISDGFWSTSTCDLDNSTIQSQPSISSISTSNLTLTHSNVGASVSNPSEFVNHGLLLWNQNRLQWIGNSSSSKTTDQTQLKRKAKISWRATYDSLLSTRQCFPHPIPLAEMVKFLVEVWEQEGLYD
Homology
BLAST of CmaCh11G005440 vs. TAIR 10
Match: AT5G25360.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G32342.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 218.8 bits (556), Expect = 5.0e-57
Identity = 110/175 (62.86%), Postives = 128/175 (73.14%), Query Frame = 0

Query: 77  LNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKRSISDGFWSTST 136
           L     +WI +LF CMGGCFGCC KP  I+AVDEPSKGLRIQGR+VKK S+S+ FWSTST
Sbjct: 3   LREIIPSWIYQLFGCMGGCFGCCNKPPLIVAVDEPSKGLRIQGRLVKKPSVSEDFWSTST 62

Query: 137 CDLDNSTIQSQPSISSISTSNLTLTHSNVGASVSNPSEFVNHGLLLWNQNRLQWIGNSSS 196
           C++DNST+QSQ S+SSIS +N T T     AS SNP+EFVNHGL LWNQ R QW+ N +S
Sbjct: 63  CEMDNSTLQSQRSMSSISFTNNTST----SASTSNPTEFVNHGLNLWNQTRQQWLANGTS 122

Query: 197 SKTTDQTQLKRKAKISWRATYDSLLSTRQCFPHPIPLAEMVKFLVEVWEQEGLYD 252
            K        R+  ISW ATY+SLL   + F  PIPL EMV FLV+VWEQEGLYD
Sbjct: 123 QKKAK----VREPTISWNATYESLLGMNKRFSRPIPLPEMVDFLVDVWEQEGLYD 169

BLAST of CmaCh11G005440 vs. TAIR 10
Match: AT5G25360.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G32342.1). )

HSP 1 Score: 218.8 bits (556), Expect = 5.0e-57
Identity = 110/175 (62.86%), Postives = 128/175 (73.14%), Query Frame = 0

Query: 77  LNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKRSISDGFWSTST 136
           L     +WI +LF CMGGCFGCC KP  I+AVDEPSKGLRIQGR+VKK S+S+ FWSTST
Sbjct: 3   LREIIPSWIYQLFGCMGGCFGCCNKPPLIVAVDEPSKGLRIQGRLVKKPSVSEDFWSTST 62

Query: 137 CDLDNSTIQSQPSISSISTSNLTLTHSNVGASVSNPSEFVNHGLLLWNQNRLQWIGNSSS 196
           C++DNST+QSQ S+SSIS +N T T     AS SNP+EFVNHGL LWNQ R QW+ N +S
Sbjct: 63  CEMDNSTLQSQRSMSSISFTNNTST----SASTSNPTEFVNHGLNLWNQTRQQWLANGTS 122

Query: 197 SKTTDQTQLKRKAKISWRATYDSLLSTRQCFPHPIPLAEMVKFLVEVWEQEGLYD 252
            K        R+  ISW ATY+SLL   + F  PIPL EMV FLV+VWEQEGLYD
Sbjct: 123 QKKAK----VREPTISWNATYESLLGMNKRFSRPIPLPEMVDFLVDVWEQEGLYD 169

BLAST of CmaCh11G005440 vs. TAIR 10
Match: AT4G32342.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G25360.2); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 154.8 bits (390), Expect = 8.9e-38
Identity = 87/159 (54.72%), Postives = 101/159 (63.52%), Query Frame = 0

Query: 95  CFGCCTKPTP-IIAVDEPSKGLRIQGRVVKKRSI-SDGFWSTSTCDLD-NSTIQSQPSIS 154
           CFGCC +    ++ VDEPSKGL+IQG++VKK S  SD FWSTSTCD+D N TIQSQ S  
Sbjct: 17  CFGCCNRERRLVVEVDEPSKGLKIQGKIVKKDSASSDDFWSTSTCDMDHNITIQSQSSNP 76

Query: 155 SISTSNLTLTHSNVGASVSNPSEFVNHGLLLWNQNRLQWIGNSSSSKTTDQTQLKRKAKI 214
                           S SN +EFVNHGL+LWN  R QW         T Q  L  +  I
Sbjct: 77  PFDPQ----------CSTSNSTEFVNHGLILWNHTRQQW-----RECLTRQQCLVPEPAI 136

Query: 215 SWRATYDSLLSTRQCFPHPIPLAEMVKFLVEVWEQEGLY 251
           SW +TYDSLLST + FP PIPL EMV FLV+VWE+EGLY
Sbjct: 137 SWNSTYDSLLSTNKLFPQPIPLKEMVHFLVDVWEEEGLY 160

BLAST of CmaCh11G005440 vs. TAIR 10
Match: AT1G15350.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G15770.2); Has 148 Blast hits to 148 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 141; Viruses - 0; Other Eukaryotes - 7 (source: NCBI BLink). )

HSP 1 Score: 142.1 bits (357), Expect = 6.0e-34
Identity = 79/163 (48.47%), Postives = 98/163 (60.12%), Query Frame = 0

Query: 92  MGGCFGCCT--KPTPIIAVDEPSKGLRIQGRVVKKRSISDGFWSTSTCDLDNSTIQSQPS 151
           MGGC GC    + T     D PS  +    R  KK S+S+ FWSTST D+DN T  SQ S
Sbjct: 1   MGGCVGCYREHRSTAASLKDPPSNSI---ARPCKKPSVSEDFWSTSTVDMDNITFPSQGS 60

Query: 152 ISSISTSNLTLTHSNVGASVSNPSEFVNHGLLLWNQNRLQWIGNSSSSKTTDQTQLKRKA 211
           +SS   SN T    +   + + P E+VN GLLLWNQ R +W+G    +   D  Q    A
Sbjct: 61  LSS---SNQTFDSQSAARNSNAPPEYVNQGLLLWNQTRERWVGKDKPNNPVDHNQ---GA 120

Query: 212 KISWR-ATYDSLLSTRQCFPHPIPLAEMVKFLVEVWEQEGLYD 252
           K++W  ATYDSLL + + FP PIPL EMV FLV++WEQEGLYD
Sbjct: 121 KLNWNTATYDSLLGSNKLFPQPIPLTEMVDFLVDIWEQEGLYD 154

BLAST of CmaCh11G005440 vs. TAIR 10
Match: AT1G15350.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G15770.2); Has 148 Blast hits to 148 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 141; Viruses - 0; Other Eukaryotes - 7 (source: NCBI BLink). )

HSP 1 Score: 142.1 bits (357), Expect = 6.0e-34
Identity = 79/163 (48.47%), Postives = 98/163 (60.12%), Query Frame = 0

Query: 92  MGGCFGCCT--KPTPIIAVDEPSKGLRIQGRVVKKRSISDGFWSTSTCDLDNSTIQSQPS 151
           MGGC GC    + T     D PS  +    R  KK S+S+ FWSTST D+DN T  SQ S
Sbjct: 1   MGGCVGCYREHRSTAASLKDPPSNSI---ARPCKKPSVSEDFWSTSTVDMDNITFPSQGS 60

Query: 152 ISSISTSNLTLTHSNVGASVSNPSEFVNHGLLLWNQNRLQWIGNSSSSKTTDQTQLKRKA 211
           +SS   SN T    +   + + P E+VN GLLLWNQ R +W+G    +   D  Q    A
Sbjct: 61  LSS---SNQTFDSQSAARNSNAPPEYVNQGLLLWNQTRERWVGKDKPNNPVDHNQ---GA 120

Query: 212 KISWR-ATYDSLLSTRQCFPHPIPLAEMVKFLVEVWEQEGLYD 252
           K++W  ATYDSLL + + FP PIPL EMV FLV++WEQEGLYD
Sbjct: 121 KLNWNTATYDSLLGSNKLFPQPIPLTEMVDFLVDIWEQEGLYD 154

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
AT5G25360.15.0e-5762.86unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G25360.25.0e-5762.86unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT4G32342.18.9e-3854.72unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G15350.26.0e-3448.47unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G15350.16.0e-3448.47unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025124Domain of unknown function DUF4050PFAMPF13259DUF4050coord: 210..251
e-value: 3.9E-12
score: 46.9
coord: 136..204
e-value: 3.6E-9
score: 37.2
NoneNo IPR availablePANTHERPTHR33373OS07G0479600 PROTEINcoord: 74..251
NoneNo IPR availablePANTHERPTHR33373:SF13DUF4050 FAMILY PROTEINcoord: 74..251

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh11G005440.1CmaCh11G005440.1mRNA