CmoCh08G000920.1 (mRNA) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh08G000920.1
TypemRNA
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionUnknown protein
LocationCmo_Chr08: 478785 .. 482522 (-)
Sequence length903
RNA-Seq ExpressionCmoCh08G000920.1
SyntenyCmoCh08G000920.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGATCGGTTTCTTACACAAGATCAGACCAAAAGAATTTCAAGAAGAAACTCAAGAAGCTTCCGTGGTTATCATCACCAACACCACAGGATCAGATCCCGCGGAATCCACCACTCCTTCCGTTTCTTACCTCAAGGTCCCATCGTTCTTTCTCAAGCCCTTCCGAAAAAGATTCGATTACTGCACCGATAACATCAAGGAAAAAGAAAACTCCATCGAGATCGAGAAGAAGAAGTGGCTCGCCGAAGTTGCCTCCATGTTCTACAAAATCTTCGGCGGAGGAAAATTCTCCGCCTGCGGAATCGGAAGTAAAATCGATGGAGATGATGATGATGATGATTTTTTGAGCGAATCGGAAGCGGGAGAAGATTATCGCGAAGATCGAGAGGAAATTTTCGCGGCAGAATCGAATGGCGAATCTGAAACAGAGAATATAATTCGAAGCGACGAGAAGAGTTCAAGCCAATCGAGACTTCCTACGTCGATCTTAGACCTATTTAGAAAGAGGAATTTCAATTTCAATTTCATAAAGAGCAGAATCTGCAAGGTGAAAAGTGGCTACCGAAACAGAAATCTCAATTACGAGAAAAATCTCAGAAACAATGAATCGGCTATGCTATTTGCAAACCGAGAAAATCATAAGCGATTGTTTCAAATTCCATCGAAAGATCGGAAACAGACGAAGGAAAAAAAATCGACGGCAATCTCATATCATGAAGGCGATGAAAACGGACAACGGCTATGGCAGAAAAGGATTCTAATGGGAGGAAGATGCAAACCTCTGAGGAATACTTCTGTAAGTCTTCGATATGATCCAAATGGCTTCATCTAGCCGCAGATCCTTCCTCCATTGATGAACATTGCAAATGCCAAATTCTTTGAGCATTTTGAGATTCTTCATTTTTAAATCCCCCCATCTTTTTCCCGCTTCGTTTTTTTTTTCTTTCTTGTTTTTTTCTTTGGATAAAATAAAATAAAACTTTTGTATAATTTCTTTATTTTATTATTCACTGAATTTTTTTAAAAACTAATTTTCCTCACTTTTTTCGTAATTTTAAAAAATATGTTGTGCAACAAAAAAGGGTTATTAACAAAATGCTAAACAAGGTAAATTTCAAAATTATAAATTTAAACTTTAAAATGTTTTAATAAATACACTAAATATACAATTAAAATGTTAAATTTTTATTATACATAAAATAATATTTATATTTAATAGATCTTTTTTTTTTTTAAGAATCTATTAAATAAAAAGTTGCAAATTAAAAATTTATTTTTTTTATTATGATGTTCGAGCCACTTGCCACACAACAATTAAAAGGAAACCGTCCAAAGAATACATAGATTTAAACTATTGAGGCTGCCAGTTTGAAAGCTTATGGTCGTTTTGGGACCTGGGTCAAAATCAAATGTTGATATCTGAATTTAGGAGTTGATTATGTAAATCTAAACGATGTTGTTGGTCAAAGTGCTTGTGTGTACGTTGCATCGCCCATACAAGAAGGTAGGTGGCCATCAAAGATATCATGTCATTATGAGGCATCTGAATGTCGTAAACGTAGTGTTGATTATTGGTTAGAGGCACAAACCAGACCATGAAGGTCTTATTAACCAAATGTCAATTTCTAGTCAGAGTTAGCTAAGCGTCGCACATCACGTGCTCGTCACCGTTATAGGTCGGGAGGTGGAGTTATTCTCAAAAAAAGAAAAAAAAAAAACAATAATAATATAAATTAAATACATGAGTTGGGGCTGTCGCAAACAATACAGCTGGGGCCAAGTGTCATTGAATGGTTCTTGTTGAAATATCATGGTTTTCTTTTAAACCAATATTCTCCCTCTTTTTAAATTTATTAAATACTTTTCTTTTTTTAAAATATTTCAATTTCAAATTTCAATCTTCAACGTGTCGGCCAATGTTCTGATGCGACCATTCTTCATATATTTCCCCAATATTTTCCAATTAAAAATAATTATACTTTTTAGTATAATTACATTTTTACCCCACGCTCTTCATTTAACTTCAATTTTAGTCCATAAATTTTAAAATATTATATTTTTATTCTTAGTCCTTCCATTTTCATTAATTAATTAAATAATATGATATAAATTTTTAAATAATTTTATTATTAGCGAAAATAAGTATCCATAAATTTTTAAATAATTTTATTATTAGCGAAAATAAGTATCCAGTAAAAATGAATTAAACTAATCTAAATTAAGTATTACTTAAAACTCGAGAAAATTTAAAATTAAAGGGTAAAAAATGTAATATTAAAAATGATGAATTAAATAGAAATTATAATTAAGACTCCTAAAAATAATATATTTTTTTTACCTCTTGCCTTTCTTAGTATAATTACATTTTTATCCCTTTTAATTTAATTTCAATTTAGTCCATAAATTTCAAAATATTATATTTTTAGTCTCTTCCATCTTTCGATTTTCATTAATTAATTAAATAATAAGATGTAAATTTTTAAATAATTTTATTATTAGCGAAAATAAGTGTCTAGTAAAAATGAATTAAACTAATCTAAACGAATATTACTTAACCCTCGAGAAAATTCAAAATTAAATGGTAAAAAATGTGATATTAAAAATTATGAATTAAATAGAAATTATAATTAAGACGCTTAAAAATATATTTTTTTCATTCTTAAATTTTATTATGAAAAAATTAATTTTAAAATATTGAAAATAATGAGCTTAGGAATTTAAGGGTATTTAATAATAGAAGTTCCATTAGATTATGGTTGATAGATATAAACAATGATGAGTGAAATGAGACAAAATAAAAAAGGAGAAAAAAATAAAATAAAAATGCATTCCCTTAATTTACTGCGTGTAAGTTGCGTTGTCTCTTTTATCTCTCTCGCGGTACGTCCATTCTCCTTCGCTGCTATTTCCTCGACAGAGCATTTGTTCTTCCAAGACCATGTTCAAAAATCTCGCCTCTTCTTCTACTTTCCTCTCCAACTAGCTTCAGATCCGCCATCCTCATTCCACTCTCCGGCGTAACCATTTTCTTCCATTTCTGATCCATCCGCCTTCCTCCTTCTGCGATTGGTATGCTCTTCTATCATTTTACTGCTTTCCTTTCAGATCTTGCGCTTTATTTTTCCTTTTCATTTACATCATCCACAGCTTTGCTATTGATTCAGCTTATTGATCGCGAATGTTTAGTTGGAATGAAGTTTTGAATTGAATTCGGAATACTGTTTTTTTTTTCTTTTTTCTTTTTTTTTTCTTTCTTTGAATTGGCCTCTTGTTGTTTTTGATGATTGATGATCGAAGTTTCCATTGATGTTAATGATCTGGATCCTTGATTCTTGTTCATTTAGGTGAATTTTGGTTTCGTTTGGTTTTGTCGGTGCTTTTGGTTGGAAACTTCCAATTCTCTTCCACCGACTTGAATTTGTTTATACTTTTCACAATGCCAGATGTGGATCATGATTTGTGGTTCGGTATTTCTCGTCCATTCGACTGCTGATGCGTGTATGCATATCTATCCAAGTTTGTAACGTACATTTCGTTGTATACTTACGTGGCTAGCAATGTAGATGAAAATTAAGTTTCCTTCCATGATGTGTTTGATGTCAGATAACCACAATTTGGAATTGTAGTCGTTCTACATGTACTTTCATGTTCATTACTTAGATTTCTAATCTATATTCCAAGCTTTTAAACACTCTTACTTGTTTTCCTCTGAAACAGCCCAGAATATTTGAGGTGA

mRNA sequence

ATGGAGATCGGTTTCTTACACAAGATCAGACCAAAAGAATTTCAAGAAGAAACTCAAGAAGCTTCCGTGGTTATCATCACCAACACCACAGGATCAGATCCCGCGGAATCCACCACTCCTTCCGTTTCTTACCTCAAGGTCCCATCGTTCTTTCTCAAGCCCTTCCGAAAAAGATTCGATTACTGCACCGATAACATCAAGGAAAAAGAAAACTCCATCGAGATCGAGAAGAAGAAGTGGCTCGCCGAAGTTGCCTCCATGTTCTACAAAATCTTCGGCGGAGGAAAATTCTCCGCCTGCGGAATCGGAAGTAAAATCGATGGAGATGATGATGATGATGATTTTTTGAGCGAATCGGAAGCGGGAGAAGATTATCGCGAAGATCGAGAGGAAATTTTCGCGGCAGAATCGAATGGCGAATCTGAAACAGAGAATATAATTCGAAGCGACGAGAAGAGTTCAAGCCAATCGAGACTTCCTACGTCGATCTTAGACCTATTTAGAAAGAGGAATTTCAATTTCAATTTCATAAAGAGCAGAATCTGCAAGGTGAAAAGTGGCTACCGAAACAGAAATCTCAATTACGAGAAAAATCTCAGAAACAATGAATCGGCTATGCTATTTGCAAACCGAGAAAATCATAAGCGATTGTTTCAAATTCCATCGAAAGATCGGAAACAGACGAAGGAAAAAAAATCGACGGCAATCTCATATCATGAAGGCGATGAAAACGGACAACGGCTATGGCAGAAAAGGATTCTAATGGGAGGAAGATGCAAACCTCTGAGGAATACTTCTCTTCAGATCCGCCATCCTCATTCCACTCTCCGGCGTAACCATTTTCTTCCATTTCTGATCCATCCGCCTTCCTCCTTCTGCGATTGCCCAGAATATTTGAGGTGA

Coding sequence (CDS)

ATGGAGATCGGTTTCTTACACAAGATCAGACCAAAAGAATTTCAAGAAGAAACTCAAGAAGCTTCCGTGGTTATCATCACCAACACCACAGGATCAGATCCCGCGGAATCCACCACTCCTTCCGTTTCTTACCTCAAGGTCCCATCGTTCTTTCTCAAGCCCTTCCGAAAAAGATTCGATTACTGCACCGATAACATCAAGGAAAAAGAAAACTCCATCGAGATCGAGAAGAAGAAGTGGCTCGCCGAAGTTGCCTCCATGTTCTACAAAATCTTCGGCGGAGGAAAATTCTCCGCCTGCGGAATCGGAAGTAAAATCGATGGAGATGATGATGATGATGATTTTTTGAGCGAATCGGAAGCGGGAGAAGATTATCGCGAAGATCGAGAGGAAATTTTCGCGGCAGAATCGAATGGCGAATCTGAAACAGAGAATATAATTCGAAGCGACGAGAAGAGTTCAAGCCAATCGAGACTTCCTACGTCGATCTTAGACCTATTTAGAAAGAGGAATTTCAATTTCAATTTCATAAAGAGCAGAATCTGCAAGGTGAAAAGTGGCTACCGAAACAGAAATCTCAATTACGAGAAAAATCTCAGAAACAATGAATCGGCTATGCTATTTGCAAACCGAGAAAATCATAAGCGATTGTTTCAAATTCCATCGAAAGATCGGAAACAGACGAAGGAAAAAAAATCGACGGCAATCTCATATCATGAAGGCGATGAAAACGGACAACGGCTATGGCAGAAAAGGATTCTAATGGGAGGAAGATGCAAACCTCTGAGGAATACTTCTCTTCAGATCCGCCATCCTCATTCCACTCTCCGGCGTAACCATTTTCTTCCATTTCTGATCCATCCGCCTTCCTCCTTCTGCGATTGCCCAGAATATTTGAGGTGA

Protein sequence

MEIGFLHKIRPKEFQEETQEASVVIITNTTGSDPAESTTPSVSYLKVPSFFLKPFRKRFDYCTDNIKEKENSIEIEKKKWLAEVASMFYKIFGGGKFSACGIGSKIDGDDDDDDFLSESEAGEDYREDREEIFAAESNGESETENIIRSDEKSSSQSRLPTSILDLFRKRNFNFNFIKSRICKVKSGYRNRNLNYEKNLRNNESAMLFANRENHKRLFQIPSKDRKQTKEKKSTAISYHEGDENGQRLWQKRILMGGRCKPLRNTSLQIRHPHSTLRRNHFLPFLIHPPSSFCDCPEYLR
Homology
BLAST of CmoCh08G000920.1 vs. NCBI nr
Match: KAG6592883.1 (hypothetical protein SDJN03_12359, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 565.8 bits (1457), Expect = 2.2e-157
Identity = 291/300 (97.00%), Postives = 293/300 (97.67%), Query Frame = 0

Query: 1   MEIGFLHKIRPKEFQEETQEASVVIITNTTGSDPAESTTPSVSYLKVPSFFLKPFRKRFD 60
           MEIGFLHKIRPKEFQEETQEASVVIITNTTGSDPAESTTPSVSYLKVP+FFLKPFRKRFD
Sbjct: 1   MEIGFLHKIRPKEFQEETQEASVVIITNTTGSDPAESTTPSVSYLKVPTFFLKPFRKRFD 60

Query: 61  YCTDNIKEKENSIEIEKKKWLAEVASMFYKIFGGGKFSACGIGSKIDGDDDDDDFLSESE 120
           YCTDNIKEKENSIEIEKKKWLAEVASMF KIFGGGK SACGIG KIDG  DDDDFLSESE
Sbjct: 61  YCTDNIKEKENSIEIEKKKWLAEVASMFCKIFGGGKLSACGIGGKIDG--DDDDFLSESE 120

Query: 121 AGEDYREDREEIFAAESNGESETENIIRSDEKSSSQSRLPTSILDLFRKRNFNFNFIKSR 180
           AG+DYREDREEIFAAESNGESETENIIRSDEKSSSQSRLPTSILDLFRKRNFNFNFIKSR
Sbjct: 121 AGDDYREDREEIFAAESNGESETENIIRSDEKSSSQSRLPTSILDLFRKRNFNFNFIKSR 180

Query: 181 ICKVKSGYRNRNLNYEKNLRNNESAMLFANRENHKRLFQIPSKDRKQTKEKKSTAISYHE 240
           ICKVKSGYRNRNLNYEKNLRNNESAMLFANRENHKRLFQIPSKDRKQTKEKKSTAISYHE
Sbjct: 181 ICKVKSGYRNRNLNYEKNLRNNESAMLFANRENHKRLFQIPSKDRKQTKEKKSTAISYHE 240

Query: 241 GDENGQRLWQKRILMGGRCKPLRNTSLQIRHPHSTLRRNHFLPFLIHPPSSFCDCPEYLR 300
           GDENGQRLWQKRILMGGRCKPLRNTS  IRHPHSTLRRNHFLPFLIHPPSSFCDCPEYLR
Sbjct: 241 GDENGQRLWQKRILMGGRCKPLRNTS--IRHPHSTLRRNHFLPFLIHPPSSFCDCPEYLR 296

BLAST of CmoCh08G000920.1 vs. NCBI nr
Match: KAG6607704.1 (hypothetical protein SDJN03_01046, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 152.9 bits (385), Expect = 4.3e-33
Identity = 84/103 (81.55%), Postives = 89/103 (86.41%), Query Frame = 0

Query: 1   MEIGFLHKIRPKEFQEETQEASVVIITNTTGSDPAESTTPSVSYLKVPSFFLKPFRKRF- 60
           MEIGFLHKIRPKEFQEETQEASVVIITNTTGSDPAESTTPSVSYLKVP+FFLK  +K F 
Sbjct: 1   MEIGFLHKIRPKEFQEETQEASVVIITNTTGSDPAESTTPSVSYLKVPTFFLKKKKKTFR 60

Query: 61  --DYCTDNI--KEKENSIEIEKKKWLAEVASMFYKIFGGGKFS 99
             D  T  I  ++K+NSIEIEKKKWLAEVASMF KIFGGGK S
Sbjct: 61  KKDSITAPITSRKKKNSIEIEKKKWLAEVASMFCKIFGGGKNS 103

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
KAG6592883.12.2e-15797.00hypothetical protein SDJN03_12359, partial [Cucurbita argyrosperma subsp. sorori... [more]
KAG6607704.14.3e-3381.55hypothetical protein SDJN03_01046, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 134..154

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmoCh08G000920CmoCh08G000920gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh08G000920.1:exon:5181CmoCh08G000920.1:exon:5181exon
CmoCh08G000920.1:exon:5180CmoCh08G000920.1:exon:5180exon
CmoCh08G000920.1:exon:5179CmoCh08G000920.1:exon:5179exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh08G000920.1:cdsCmoCh08G000920.1:cds_3CDS
CmoCh08G000920.1:cdsCmoCh08G000920.1:cds_2CDS
CmoCh08G000920.1:cdsCmoCh08G000920.1:cdsCDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmoCh08G000920.1CmoCh08G000920.1-proteinpolypeptide