CmoCh20G010070 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh20G010070
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionPlant protein of unknown function (DUF863)
LocationCmo_Chr20: 6156488 .. 6157341 (-)
RNA-Seq ExpressionCmoCh20G010070
SyntenyCmoCh20G010070
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTCAATCCAGTGTCGTTTTGAGTTGGTGTTCTGTTTTTAGGCAAAAATCTGAAGATCTACTTGTTCCATGATCCATTTGTAAGGCCCCCAACATTCTCGGAACCAGAAAAACAGCAAGAAATTTCTGAGCATTCAAAATTCTCCTCTCTCTCTCTCTCTCGTAGATCTCCCATGGCGGACGAATCTTACTGTAAACACAATTTGAAGATGGCAATGTTAAACCATGAACACACATTTCGTCACCAAGTACCTTTTCTTCTTCTTCTTCCTCTTTCTCCACTTCCCTTTGTCTCGTCTTTCCTTTTGTTTAACAGCGGCCAGAGGCTTTTTTTGTTGCAGGTTTATGAGCTCCACCGATTGTACAGAATTCAGAAGGTTTTGATGAAGAACATAGCTGAAAACAGGGGAAAAACAGAGGAGGAAGAAGAAGAAGAAGGTGAAGAGATTGAGTTTATTCAGGAAAGTGAGATTGAATTGACATTGGGGCCTTCCAGTTATAGTAATCAAGTTGGAGGGAGGAGAAGGAGGAGAAAGAAATTGGGTGAATTAATGAAAGGAGGTTCCGATTCTTCCTCCTCCTCGACCACTGGATCTGCGCCTAAAGGTGGAATTCTTTATCGAGATGGCGAGTTTCTGGGATTTCTCGAGCTTCAAGATGAAACCCGGCTTGCTCAACACCATCCAAATCCATGGCTTTGTCAAGTTGTAAGCCTTAATCTCACTTGATTTTCATTTTTTATGTTTATTGATTGATTTTGTTTTTCTAGTTTTTATAATTCCCTTGTCATTGGCTGATTGAGAGCCTTATCTGTGTACAAGTTTATAACCGAGTTAATATCATTATCACCCGCCC

mRNA sequence

ATTCAATCCAGTGTCGTTTTGAGTTGGTGTTCTGTTTTTAGGCAAAAATCTGAAGATCTACTTGTTCCATGATCCATTTGTAAGGCCCCCAACATTCTCGGAACCAGAAAAACAGCAAGAAATTTCTGAGCATTCAAAATTCTCCTCTCTCTCTCTCTCTCGTAGATCTCCCATGGCGGACGAATCTTACTGTAAACACAATTTGAAGATGGCAATGTTAAACCATGAACACACATTTCGTCACCAAGTTTATGAGCTCCACCGATTGTACAGAATTCAGAAGGTTTTGATGAAGAACATAGCTGAAAACAGGGGAAAAACAGAGGAGGAAGAAGAAGAAGAAGGTGAAGAGATTGAGTTTATTCAGGAAAGTGAGATTGAATTGACATTGGGGCCTTCCAGTTATAGTAATCAAGTTGGAGGGAGGAGAAGGAGGAGAAAGAAATTGGGTGAATTAATGAAAGGAGGTTCCGATTCTTCCTCCTCCTCGACCACTGGATCTGCGCCTAAAGGTGGAATTCTTTATCGAGATGGCGAGTTTCTGGGATTTCTCGAGCTTCAAGATGAAACCCGGCTTGCTCAACACCATCCAAATCCATGGCTTTGTCAAGTTGTAAGCCTTAATCTCACTTGATTTTCATTTTTTATGTTTATTGATTGATTTTGTTTTTCTAGTTTTTATAATTCCCTTGTCATTGGCTGATTGAGAGCCTTATCTGTGTACAAGTTTATAACCGAGTTAATATCATTATCACCCGCCC

Coding sequence (CDS)

ATGGCGGACGAATCTTACTGTAAACACAATTTGAAGATGGCAATGTTAAACCATGAACACACATTTCGTCACCAAGTTTATGAGCTCCACCGATTGTACAGAATTCAGAAGGTTTTGATGAAGAACATAGCTGAAAACAGGGGAAAAACAGAGGAGGAAGAAGAAGAAGAAGGTGAAGAGATTGAGTTTATTCAGGAAAGTGAGATTGAATTGACATTGGGGCCTTCCAGTTATAGTAATCAAGTTGGAGGGAGGAGAAGGAGGAGAAAGAAATTGGGTGAATTAATGAAAGGAGGTTCCGATTCTTCCTCCTCCTCGACCACTGGATCTGCGCCTAAAGGTGGAATTCTTTATCGAGATGGCGAGTTTCTGGGATTTCTCGAGCTTCAAGATGAAACCCGGCTTGCTCAACACCATCCAAATCCATGGCTTTGTCAAGTTGTAAGCCTTAATCTCACTTGA

Protein sequence

MADESYCKHNLKMAMLNHEHTFRHQVYELHRLYRIQKVLMKNIAENRGKTEEEEEEEGEEIEFIQESEIELTLGPSSYSNQVGGRRRRRKKLGELMKGGSDSSSSSTTGSAPKGGILYRDGEFLGFLELQDETRLAQHHPNPWLCQVVSLNLT
Homology
BLAST of CmoCh20G010070 vs. ExPASy TrEMBL
Match: A0A6J1EXP9 (uncharacterized protein LOC111439473 OS=Cucurbita moschata OX=3662 GN=LOC111439473 PE=4 SV=1)

HSP 1 Score: 303.5 bits (776), Expect = 4.9e-79
Identity = 153/153 (100.00%), Postives = 153/153 (100.00%), Query Frame = 0

Query: 1   MADESYCKHNLKMAMLNHEHTFRHQVYELHRLYRIQKVLMKNIAENRGKTEEEEEEEGEE 60
           MADESYCKHNLKMAMLNHEHTFRHQVYELHRLYRIQKVLMKNIAENRGKTEEEEEEEGEE
Sbjct: 1   MADESYCKHNLKMAMLNHEHTFRHQVYELHRLYRIQKVLMKNIAENRGKTEEEEEEEGEE 60

Query: 61  IEFIQESEIELTLGPSSYSNQVGGRRRRRKKLGELMKGGSDSSSSSTTGSAPKGGILYRD 120
           IEFIQESEIELTLGPSSYSNQVGGRRRRRKKLGELMKGGSDSSSSSTTGSAPKGGILYRD
Sbjct: 61  IEFIQESEIELTLGPSSYSNQVGGRRRRRKKLGELMKGGSDSSSSSTTGSAPKGGILYRD 120

Query: 121 GEFLGFLELQDETRLAQHHPNPWLCQVVSLNLT 154
           GEFLGFLELQDETRLAQHHPNPWLCQVVSLNLT
Sbjct: 121 GEFLGFLELQDETRLAQHHPNPWLCQVVSLNLT 153

BLAST of CmoCh20G010070 vs. ExPASy TrEMBL
Match: A0A6J1J5Z3 (uncharacterized protein LOC111483758 OS=Cucurbita maxima OX=3661 GN=LOC111483758 PE=4 SV=1)

HSP 1 Score: 292.7 bits (748), Expect = 8.7e-76
Identity = 147/153 (96.08%), Postives = 150/153 (98.04%), Query Frame = 0

Query: 1   MADESYCKHNLKMAMLNHEHTFRHQVYELHRLYRIQKVLMKNIAENRGKTEEEEEEEGEE 60
           MADES+CKHNLKMAMLNHEHTFRHQVYELHRLYRIQKVLMKNIAENRGKTEEEE EEGEE
Sbjct: 1   MADESFCKHNLKMAMLNHEHTFRHQVYELHRLYRIQKVLMKNIAENRGKTEEEEGEEGEE 60

Query: 61  IEFIQESEIELTLGPSSYSNQVGGRRRRRKKLGELMKGGSDSSSSSTTGSAPKGGILYRD 120
           IEFIQESEI+LTLGPSSYSNQVGGRRRRRKKLGELMKGGSDSSS STTGSAPK GILYRD
Sbjct: 61  IEFIQESEIQLTLGPSSYSNQVGGRRRRRKKLGELMKGGSDSSSCSTTGSAPKSGILYRD 120

Query: 121 GEFLGFLELQDETRLAQHHPNPWLCQVVSLNLT 154
           GEFLGFL+LQDETRLAQHHPNPWLCQVVSLNLT
Sbjct: 121 GEFLGFLDLQDETRLAQHHPNPWLCQVVSLNLT 153

BLAST of CmoCh20G010070 vs. ExPASy TrEMBL
Match: A0A1S3C5U9 (uncharacterized protein LOC103497380 OS=Cucumis melo OX=3656 GN=LOC103497380 PE=4 SV=1)

HSP 1 Score: 192.6 bits (488), Expect = 1.2e-45
Identity = 115/174 (66.09%), Postives = 130/174 (74.71%), Query Frame = 0

Query: 1   MADESYCKHNLKMAMLNHEHTFRHQVYELHRLYRIQKVLMKNIAE-NRGKTEEEEEEEGE 60
           + ++SY KHN+KMAMLNHE TFRHQVYELHRLYRIQKVLMKNI E NRGKTEEEEEEEGE
Sbjct: 4   LLEQSYSKHNMKMAMLNHEQTFRHQVYELHRLYRIQKVLMKNIREKNRGKTEEEEEEEGE 63

Query: 61  EI-EFIQESEIELTLGPSSYSNQVGG----RRRRRKKLGELMKGGSDS------SSSSTT 120
           E  +FI+ESEIELTLGPS+Y+NQ+GG    R RR KK GE   G SDS      SSSST 
Sbjct: 64  EEGDFIEESEIELTLGPSNYNNQIGGRTRTRTRRMKKFGE---GNSDSGMSFSASSSSTN 123

Query: 121 GSAPKGGILYRD-GE--------FLGFLELQDETRLAQHHPNPWLCQVVSLNLT 154
           GS  K    YRD GE        FLGFL++QD+ +++ HHPNPWL Q VSLNLT
Sbjct: 124 GSVQKIKQFYRDNGEFVNGSQMGFLGFLDVQDDIKVSHHHPNPWLYQTVSLNLT 174

BLAST of CmoCh20G010070 vs. ExPASy TrEMBL
Match: A0A0A0LJL7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G277640 PE=4 SV=1)

HSP 1 Score: 190.3 bits (482), Expect = 6.1e-45
Identity = 114/174 (65.52%), Postives = 129/174 (74.14%), Query Frame = 0

Query: 1   MADESYCKHNLKMAMLNHEHTFRHQVYELHRLYRIQKVLMKNIAE-NRGKTEEEEEEEGE 60
           + ++SY KHN+KMAMLNHE TFRHQVYELHRLYRIQKVLMKNI E NRGKTEEEEEEEGE
Sbjct: 4   LLEQSYSKHNMKMAMLNHEQTFRHQVYELHRLYRIQKVLMKNIREKNRGKTEEEEEEEGE 63

Query: 61  EI-EFIQESEIELTLGPSSYSNQVGG----RRRRRKKLGELMKGGSDS------SSSSTT 120
           E  +FI+ES+IELTLGPS+Y+NQ GG    R RR KK GE   G SDS      SSSST 
Sbjct: 64  EEGDFIEESDIELTLGPSNYNNQTGGRTRTRTRRMKKFGE---GNSDSGMSFSASSSSTN 123

Query: 121 GSAPKGGILYRD-GE--------FLGFLELQDETRLAQHHPNPWLCQVVSLNLT 154
           GS  K    YRD GE        FLGFL++QD+ +++ HHPNPWL Q VSLNLT
Sbjct: 124 GSVQKIKQFYRDNGEFVNGSQMGFLGFLDVQDDIKVSHHHPNPWLYQTVSLNLT 174

BLAST of CmoCh20G010070 vs. ExPASy TrEMBL
Match: A0A6J1G6V7 (uncharacterized protein LOC111451410 OS=Cucurbita moschata OX=3662 GN=LOC111451410 PE=4 SV=1)

HSP 1 Score: 183.7 bits (465), Expect = 5.7e-43
Identity = 104/163 (63.80%), Postives = 120/163 (73.62%), Query Frame = 0

Query: 4   ESYCKHNLKMAMLNHEHTFRHQVYELHRLYRIQKVLMKNIAENRGKT-------EEEEEE 63
           ES  K N+K AML HE TFRHQVYELHRLYRIQK+LMKNI ENRGKT       E+E+EE
Sbjct: 8   ESESKENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKNITENRGKTERWEVKNEDEDEE 67

Query: 64  EGEEIEFIQESEIELTLGPSSYSNQVGGRRRRRKKLGELMKGGSDSSSSSTTGSAPKGGI 123
           + +E +FI+ESE+ELTLGPS+YSNQVGG RRRR+K        S SSSSSTTGSA K   
Sbjct: 68  DEKESDFIEESEVELTLGPSNYSNQVGGGRRRRRK-----SCFSSSSSSSTTGSAQKSRS 127

Query: 124 LYRDGEFLGFLE------LQDETRLAQHHPNPWLCQVVSLNLT 154
            YR GE +G  E      L+DE RL+QHHPNPWLCQ ++LNLT
Sbjct: 128 FYRHGELVGSSEMGFVGFLEDEMRLSQHHPNPWLCQPLTLNLT 165

BLAST of CmoCh20G010070 vs. TAIR 10
Match: AT5G67390.1 (unknown protein; BEST Arabidopsis thaliana protein match is: Plant protein of unknown function (DUF863) (TAIR:AT1G69360.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 84.3 bits (207), Expect = 9.0e-17
Identity = 65/176 (36.93%), Postives = 87/176 (49.43%), Query Frame = 0

Query: 6   YCKHNLKMAMLNHEHTFRHQVYELHRLYRIQKVLMKNIAENRGKTEEEEEEEG------- 65
           Y K  +KMAML HE TF+ QVYELHRLY++QK+LMKN+  N+  T+      G       
Sbjct: 8   YDKQCMKMAMLKHEETFKQQVYELHRLYQVQKILMKNMEINKFTTKNNHVNSGLGTFIRR 67

Query: 66  ---------------EEIEFIQESEIELTLGPSSYSNQVGGRRRRRKK---LGELMKG-- 125
                            IE + ESEIELTLGPS Y      R  ++KK   L E+M G  
Sbjct: 68  VDNEIDRPANFSGGNNNIEIMDESEIELTLGPSCYGGDEMMRMNKKKKKNSLPEMMDGSL 127

Query: 126 --GSDSSSSSTTGSAPKGGILYRDGEFLGFLELQDETRLAQHHPNPWLCQVVSLNL 153
             G  S SSS+TGS+        +       +++ E  +      PWL Q ++LN+
Sbjct: 128 NSGRRSFSSSSTGSSNNNNNNLEE-------QVRQERMMKHQKQQPWL-QALTLNV 175

BLAST of CmoCh20G010070 vs. TAIR 10
Match: AT5G67390.2 (unknown protein; BEST Arabidopsis thaliana protein match is: Plant protein of unknown function (DUF863) (TAIR:AT1G69360.1); Has 186 Blast hits to 186 proteins in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 170; Viruses - 0; Other Eukaryotes - 16 (source: NCBI BLink). )

HSP 1 Score: 84.3 bits (207), Expect = 9.0e-17
Identity = 65/176 (36.93%), Postives = 87/176 (49.43%), Query Frame = 0

Query: 6   YCKHNLKMAMLNHEHTFRHQVYELHRLYRIQKVLMKNIAENRGKTEEEEEEEG------- 65
           Y K  +KMAML HE TF+ QVYELHRLY++QK+LMKN+  N+  T+      G       
Sbjct: 8   YDKQCMKMAMLKHEETFKQQVYELHRLYQVQKILMKNMEINKFTTKNNHVNSGLGTFIRR 67

Query: 66  ---------------EEIEFIQESEIELTLGPSSYSNQVGGRRRRRKK---LGELMKG-- 125
                            IE + ESEIELTLGPS Y      R  ++KK   L E+M G  
Sbjct: 68  VDNEIDRPANFSGGNNNIEIMDESEIELTLGPSCYGGDEMMRMNKKKKKNSLPEMMDGSL 127

Query: 126 --GSDSSSSSTTGSAPKGGILYRDGEFLGFLELQDETRLAQHHPNPWLCQVVSLNL 153
             G  S SSS+TGS+        +       +++ E  +      PWL Q ++LN+
Sbjct: 128 NSGRRSFSSSSTGSSNNNNNNLEE-------QVRQERMMKHQKQQPWL-QALTLNV 175

BLAST of CmoCh20G010070 vs. TAIR 10
Match: AT1G69360.1 (Plant protein of unknown function (DUF863) )

HSP 1 Score: 47.8 bits (112), Expect = 9.4e-06
Identity = 27/46 (58.70%), Postives = 32/46 (69.57%), Query Frame = 0

Query: 4  ESYCKHNLKMAMLNHEHTFRHQVYELHRLYRIQKVLMKNIAENRGK 50
          +SY +  LK  ML HE  F++QVYELHRLYR QK LM   AE +GK
Sbjct: 53 DSYERDFLKQTMLEHEAVFKNQVYELHRLYRTQKSLM---AEVKGK 95

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1EXP94.9e-79100.00uncharacterized protein LOC111439473 OS=Cucurbita moschata OX=3662 GN=LOC1114394... [more]
A0A6J1J5Z38.7e-7696.08uncharacterized protein LOC111483758 OS=Cucurbita maxima OX=3661 GN=LOC111483758... [more]
A0A1S3C5U91.2e-4566.09uncharacterized protein LOC103497380 OS=Cucumis melo OX=3656 GN=LOC103497380 PE=... [more]
A0A0A0LJL76.1e-4565.52Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G277640 PE=4 SV=1[more]
A0A6J1G6V75.7e-4363.80uncharacterized protein LOC111451410 OS=Cucurbita moschata OX=3662 GN=LOC1114514... [more]
Match NameE-valueIdentityDescription
AT5G67390.19.0e-1736.93unknown protein; BEST Arabidopsis thaliana protein match is: Plant protein of un... [more]
AT5G67390.29.0e-1736.93unknown protein; BEST Arabidopsis thaliana protein match is: Plant protein of un... [more]
AT1G69360.19.4e-0658.70Plant protein of unknown function (DUF863) [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 50..66
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 46..114
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 98..112
NoneNo IPR availablePANTHERPTHR33167FAMILY NOT NAMEDcoord: 5..56
NoneNo IPR availablePANTHERPTHR33167FAMILY NOT NAMEDcoord: 53..153
NoneNo IPR availablePANTHERPTHR33167:SF45BNAC02G16680D PROTEINcoord: 5..56
NoneNo IPR availablePANTHERPTHR33167:SF45BNAC02G16680D PROTEINcoord: 53..153

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh20G010070.1CmoCh20G010070.1mRNA