CmoCh02G007800 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh02G007800
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionMajor viral transcription factor
LocationCmo_Chr02: 4852225 .. 4852641 (-)
RNA-Seq ExpressionCmoCh02G007800
SyntenyCmoCh02G007800
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATGCAAATCAAGCTTTCGTGGTTGATTTCCCCAGAGCAGCAATGGAGAAAGAACATAAATCCAGATTGCCTAAACCTCCGACCAGGTTACAGAAGCAGGCGCCGGCGAGTCTTCATCTCGATCAACTAACGAGTGTCTCAATGTCTTCTGGTAGTGCTGATATTTCTTCGAAGGCGGTTCTTCCTCTGCTATCGCCGCTCCCTTCATCGCCACAGCCTTTTCCTGAAACTGAGGGAAATAGAAGAGCAGCAAACGGAAATACCGTAGACGGCGGCTATGGCGATCAAAGAGGTATGTGTTTTGCACCTCCTGGTGGCTGGCAACATCCAGCGGTGGCGACGACATTCGCCGATCCATCTACTCTGTTTACGTTCTTTCAATCGCAGTGCATCGTAAACAGTAAGACGCCGTGA

mRNA sequence

ATGAATGCAAATCAAGCTTTCGTGGTTGATTTCCCCAGAGCAGCAATGGAGAAAGAACATAAATCCAGATTGCCTAAACCTCCGACCAGGTTACAGAAGCAGGCGCCGGCGAGTCTTCATCTCGATCAACTAACGAGTGTCTCAATGTCTTCTGGTAGTGCTGATATTTCTTCGAAGGCGGTTCTTCCTCTGCTATCGCCGCTCCCTTCATCGCCACAGCCTTTTCCTGAAACTGAGGGAAATAGAAGAGCAGCAAACGGAAATACCGTAGACGGCGGCTATGGCGATCAAAGAGGTATGTGTTTTGCACCTCCTGGTGGCTGGCAACATCCAGCGGTGGCGACGACATTCGCCGATCCATCTACTCTGTTTACGTTCTTTCAATCGCAGTGCATCGTAAACAGTAAGACGCCGTGA

Coding sequence (CDS)

ATGAATGCAAATCAAGCTTTCGTGGTTGATTTCCCCAGAGCAGCAATGGAGAAAGAACATAAATCCAGATTGCCTAAACCTCCGACCAGGTTACAGAAGCAGGCGCCGGCGAGTCTTCATCTCGATCAACTAACGAGTGTCTCAATGTCTTCTGGTAGTGCTGATATTTCTTCGAAGGCGGTTCTTCCTCTGCTATCGCCGCTCCCTTCATCGCCACAGCCTTTTCCTGAAACTGAGGGAAATAGAAGAGCAGCAAACGGAAATACCGTAGACGGCGGCTATGGCGATCAAAGAGGTATGTGTTTTGCACCTCCTGGTGGCTGGCAACATCCAGCGGTGGCGACGACATTCGCCGATCCATCTACTCTGTTTACGTTCTTTCAATCGCAGTGCATCGTAAACAGTAAGACGCCGTGA

Protein sequence

MNANQAFVVDFPRAAMEKEHKSRLPKPPTRLQKQAPASLHLDQLTSVSMSSGSADISSKAVLPLLSPLPSSPQPFPETEGNRRAANGNTVDGGYGDQRGMCFAPPGGWQHPAVATTFADPSTLFTFFQSQCIVNSKTP
Homology
BLAST of CmoCh02G007800 vs. ExPASy TrEMBL
Match: A0A0A0KBQ7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G188670 PE=4 SV=1)

HSP 1 Score: 208.4 bits (529), Expect = 1.9e-50
Identity = 107/140 (76.43%), Postives = 117/140 (83.57%), Query Frame = 0

Query: 1   MNANQAFVVDFPRAAMEKEHKSRLPKPPTRLQKQAPASLHLDQLTSVSMSSGSADISSKA 60
           M+ANQ  VVDF R +ME E+K RLPKPPTRLQKQAPASLHLDQL+SVSMSS S DI SKA
Sbjct: 1   MSANQTVVVDFLRGSMENEYKPRLPKPPTRLQKQAPASLHLDQLSSVSMSSASNDICSKA 60

Query: 61  VLPLLSPLPSSPQPFPETEGNRRAANGNTVD--GGYGDQRGMCFAPPGGWQHPAVATTFA 120
           +LPLLSPLP SPQP PE +GNR +ANGN VD  GG GDQRG+ F  PGGWQHPAVA TF 
Sbjct: 61  ILPLLSPLPLSPQPLPEIDGNRISANGNAVDGGGGNGDQRGIGFVAPGGWQHPAVAATFP 120

Query: 121 DPSTLFTFFQSQCIVNSKTP 139
           DPSTLFTFFQSQC+V+S TP
Sbjct: 121 DPSTLFTFFQSQCMVSSNTP 140

BLAST of CmoCh02G007800 vs. ExPASy TrEMBL
Match: A0A5A7V856 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold571G00270 PE=4 SV=1)

HSP 1 Score: 184.5 bits (467), Expect = 3.0e-43
Identity = 96/125 (76.80%), Postives = 103/125 (82.40%), Query Frame = 0

Query: 16  MEKEHKSRLPKPPTRLQKQAPASLHLDQLTSVSMSSGSADISSKAVLPLLSPLPSSPQPF 75
           ME E K RLPKPPTRLQKQAPASLHLDQL+SVSMSS S D  SKA+LPLLSPLP SPQP 
Sbjct: 1   MENELKPRLPKPPTRLQKQAPASLHLDQLSSVSMSSASNDTYSKAILPLLSPLPLSPQPL 60

Query: 76  PETEGNRRAANGNTVD--GGYGDQRGMCFAPPGGWQHPAVATTFADPSTLFTFFQSQCIV 135
           PE +GNR +A GN V+  GG GDQRG+ F  PGGWQHPAVA TF DPSTLFTFFQSQCIV
Sbjct: 61  PEIDGNRISATGNAVEGGGGNGDQRGIGFVAPGGWQHPAVAATFPDPSTLFTFFQSQCIV 120

Query: 136 NSKTP 139
           +S TP
Sbjct: 121 SSNTP 125

BLAST of CmoCh02G007800 vs. ExPASy TrEMBL
Match: A0A6J1D5B7 (uncharacterized protein LOC111017071 OS=Momordica charantia OX=3673 GN=LOC111017071 PE=4 SV=1)

HSP 1 Score: 135.2 bits (339), Expect = 2.1e-28
Identity = 76/105 (72.38%), Postives = 86/105 (81.90%), Query Frame = 0

Query: 1   MNANQAFVVDFPRAAMEKEH--KSRLPKPPTRLQKQAPASLHLDQLTSVSMSSGSADISS 60
           MNANQ+ V+DF RA+MEKEH  KSR+PKPPTRLQKQAPASLHLDQ++S SM+ G A+ SS
Sbjct: 1   MNANQSAVIDFLRASMEKEHKAKSRIPKPPTRLQKQAPASLHLDQVSSTSMAPGGAETSS 60

Query: 61  KAVLPLLSPLPSSPQPFPETEGNR----RAANGNTVDGGYGDQRG 100
           KA+LPLLSPLP SPQP+PETE N      AAN N VDGG GDQRG
Sbjct: 61  KAILPLLSPLPLSPQPWPETEENNTNRVSAANENAVDGG-GDQRG 104

BLAST of CmoCh02G007800 vs. ExPASy TrEMBL
Match: A0A6P3ZH11 (uncharacterized protein LOC107411018 OS=Ziziphus jujuba OX=326968 GN=LOC107411018 PE=4 SV=1)

HSP 1 Score: 98.6 bits (244), Expect = 2.2e-17
Identity = 62/112 (55.36%), Postives = 77/112 (68.75%), Query Frame = 0

Query: 26  KPPTRLQKQAPASLHLDQLTSVSMSSGSADISSKAVLPLLSPLPSSPQPFPETEGNRR-- 85
           KPPTRLQK+APASL LDQ+ + S ++ S + +SKA+ PLLSPL  SPQP PE    RR  
Sbjct: 14  KPPTRLQKKAPASLKLDQVPT-SAANDSFNETSKAI-PLLSPLVLSPQPLPEMLEKRRFG 73

Query: 86  -AANGNTVDGGYGDQRGMCF-APPGGWQHPAVATTFADPSTLFTFFQSQCIV 134
            AA+ + V+    D+R      P  GWQHPAV TTF DPS+LF FFQSQC++
Sbjct: 74  CAADQHDVE----DKRSEAVPLPADGWQHPAVPTTFTDPSSLFAFFQSQCVI 119

BLAST of CmoCh02G007800 vs. ExPASy TrEMBL
Match: A0A2I4HTA9 (uncharacterized protein LOC108990495 OS=Juglans regia OX=51240 GN=LOC108990495 PE=4 SV=1)

HSP 1 Score: 94.7 bits (234), Expect = 3.1e-16
Identity = 55/117 (47.01%), Postives = 70/117 (59.83%), Query Frame = 0

Query: 17  EKEHKSRLPKPPTRLQKQAPASLHLDQLTSVSMSSGSADISSKAVLPLLSPLPSSPQPFP 76
           +KE K+++ KPPTRLQKQAPASL LD++     +S          +P LSPL  SPQP P
Sbjct: 10  QKEGKAKVGKPPTRLQKQAPASLQLDKVAVYGETS--------KAIPFLSPLILSPQPLP 69

Query: 77  ETEGNRRAANGNTVDGGYGDQRGMCFAPPGGWQHPAVATTFADPSTLFTFFQSQCIV 134
           ET   R   +G+  +   G +R     P GGWQHPAV   F +P  LFT FQSQC++
Sbjct: 70  ETLEIRSGKSGSNEEEDNGAKRSKALPPGGGWQHPAV-PVFTEPEALFTCFQSQCLL 117

BLAST of CmoCh02G007800 vs. NCBI nr
Match: KAG6605440.1 (hypothetical protein SDJN03_02757, partial [Cucurbita argyrosperma subsp. sororia] >KAG7035388.1 hypothetical protein SDJN02_02184, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 273.5 bits (698), Expect = 1.0e-69
Identity = 135/138 (97.83%), Postives = 137/138 (99.28%), Query Frame = 0

Query: 1   MNANQAFVVDFPRAAMEKEHKSRLPKPPTRLQKQAPASLHLDQLTSVSMSSGSADISSKA 60
           MNANQAFVVDFP AAMEKEHKSRLPKPPTRLQKQAPASLHLDQLTSVSMSSGSADISSKA
Sbjct: 1   MNANQAFVVDFPGAAMEKEHKSRLPKPPTRLQKQAPASLHLDQLTSVSMSSGSADISSKA 60

Query: 61  VLPLLSPLPSSPQPFPETEGNRRAANGNTVDGGYGDQRGMCFAPPGGWQHPAVATTFADP 120
           +LPLLSPLPSSPQPFPETEGNRRAANGNTVDGGYGDQRG+CFAPPGGWQHPAVATTFADP
Sbjct: 61  ILPLLSPLPSSPQPFPETEGNRRAANGNTVDGGYGDQRGICFAPPGGWQHPAVATTFADP 120

Query: 121 STLFTFFQSQCIVNSKTP 139
           STLFTFFQSQCIVNSKTP
Sbjct: 121 STLFTFFQSQCIVNSKTP 138

BLAST of CmoCh02G007800 vs. NCBI nr
Match: XP_038902307.1 (uncharacterized protein LOC120088941 [Benincasa hispida])

HSP 1 Score: 215.7 bits (548), Expect = 2.5e-52
Identity = 110/140 (78.57%), Postives = 118/140 (84.29%), Query Frame = 0

Query: 1   MNANQAFVVDFPRAAMEKEHKSRLPKPPTRLQKQAPASLHLDQLTSVSMSSGSADISSKA 60
           MNANQ  V+DF R +ME EHK+RLPKPPTRLQKQAPASLHLDQL+SVSMSSGSAD  SKA
Sbjct: 1   MNANQTVVLDFLRGSMENEHKARLPKPPTRLQKQAPASLHLDQLSSVSMSSGSADTCSKA 60

Query: 61  VLPLLSPLPSSPQPFPETEGNRRAANGNTVD--GGYGDQRGMCFAPPGGWQHPAVATTFA 120
           +LPLLSPLP SPQP PE +GNR +ANGN  D  GG GDQRG+ FA PGGWQHPAVA TF 
Sbjct: 61  ILPLLSPLPLSPQPLPEIDGNRISANGNAADGGGGNGDQRGIGFAAPGGWQHPAVAATFP 120

Query: 121 DPSTLFTFFQSQCIVNSKTP 139
           DPSTLFTFFQSQCIV S TP
Sbjct: 121 DPSTLFTFFQSQCIVTSNTP 140

BLAST of CmoCh02G007800 vs. NCBI nr
Match: XP_011657141.1 (uncharacterized protein LOC105435808 [Cucumis sativus] >XP_031742438.1 uncharacterized protein LOC116404357 [Cucumis sativus] >KAE8645882.1 hypothetical protein Csa_017707 [Cucumis sativus])

HSP 1 Score: 208.4 bits (529), Expect = 4.0e-50
Identity = 107/140 (76.43%), Postives = 117/140 (83.57%), Query Frame = 0

Query: 1   MNANQAFVVDFPRAAMEKEHKSRLPKPPTRLQKQAPASLHLDQLTSVSMSSGSADISSKA 60
           M+ANQ  VVDF R +ME E+K RLPKPPTRLQKQAPASLHLDQL+SVSMSS S DI SKA
Sbjct: 1   MSANQTVVVDFLRGSMENEYKPRLPKPPTRLQKQAPASLHLDQLSSVSMSSASNDICSKA 60

Query: 61  VLPLLSPLPSSPQPFPETEGNRRAANGNTVD--GGYGDQRGMCFAPPGGWQHPAVATTFA 120
           +LPLLSPLP SPQP PE +GNR +ANGN VD  GG GDQRG+ F  PGGWQHPAVA TF 
Sbjct: 61  ILPLLSPLPLSPQPLPEIDGNRISANGNAVDGGGGNGDQRGIGFVAPGGWQHPAVAATFP 120

Query: 121 DPSTLFTFFQSQCIVNSKTP 139
           DPSTLFTFFQSQC+V+S TP
Sbjct: 121 DPSTLFTFFQSQCMVSSNTP 140

BLAST of CmoCh02G007800 vs. NCBI nr
Match: XP_023512816.1 (uncharacterized protein LOC111777440 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 204.9 bits (520), Expect = 4.4e-49
Identity = 105/139 (75.54%), Postives = 115/139 (82.73%), Query Frame = 0

Query: 1   MNANQAFVVDFPRAAMEKEHKSRLPKPPTRLQKQAPASLHLDQLTSVSMSSGSADISSKA 60
           MNANQ+ VVDF RAAM+ E   R+PKPPTRLQKQAPA L+LDQL+SVSMSS   +ISSK 
Sbjct: 1   MNANQSAVVDFLRAAMDNERGGRIPKPPTRLQKQAPAGLYLDQLSSVSMSSVGTEISSKT 60

Query: 61  VLPLLSPLPSSPQPFPETEGNRRAANGNTVD-GGYGDQRGMCFAPPGGWQHPAVATTFAD 120
           VLPLLSPLPSSPQPF ETEGNR   NGN  D GG GDQRG+ FA PGGWQHPAVA T+AD
Sbjct: 61  VLPLLSPLPSSPQPFSETEGNRMLTNGNGADSGGNGDQRGIVFASPGGWQHPAVAATYAD 120

Query: 121 PSTLFTFFQSQCIVNSKTP 139
           PSTLFTFFQS+CI+ S TP
Sbjct: 121 PSTLFTFFQSKCIITSNTP 139

BLAST of CmoCh02G007800 vs. NCBI nr
Match: KAG6570684.1 (hypothetical protein SDJN03_29599, partial [Cucurbita argyrosperma subsp. sororia] >KAG7010531.1 hypothetical protein SDJN02_27325, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 201.8 bits (512), Expect = 3.8e-48
Identity = 103/139 (74.10%), Postives = 115/139 (82.73%), Query Frame = 0

Query: 1   MNANQAFVVDFPRAAMEKEHKSRLPKPPTRLQKQAPASLHLDQLTSVSMSSGSADISSKA 60
           MNANQ+ VVDF RAAM+ +   R+PKPPTRLQKQAPA L+LDQL+SVSM S   +  SK 
Sbjct: 1   MNANQSAVVDFLRAAMDNDRGGRIPKPPTRLQKQAPAGLYLDQLSSVSMPSVGTETYSKT 60

Query: 61  VLPLLSPLPSSPQPFPETEGNRRAANGNTVD-GGYGDQRGMCFAPPGGWQHPAVATTFAD 120
           VLPLLSPLPSSPQPFPETEGNR  ANGN V+ GG GDQRG+ FA PGGWQHPAVA T+AD
Sbjct: 61  VLPLLSPLPSSPQPFPETEGNRMLANGNGVNSGGNGDQRGIVFASPGGWQHPAVAATYAD 120

Query: 121 PSTLFTFFQSQCIVNSKTP 139
           PSTLFTFFQS+CI+ S TP
Sbjct: 121 PSTLFTFFQSKCIITSSTP 139

BLAST of CmoCh02G007800 vs. TAIR 10
Match: AT1G07473.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G07476.1); Has 22 Blast hits to 22 proteins in 6 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 22; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 46.6 bits (109), Expect = 1.9e-05
Identity = 26/54 (48.15%), Postives = 32/54 (59.26%), Query Frame = 0

Query: 21 KSRLPKPPTRLQKQAPASLHLDQLTSVSMSSGSADISSKAVLPLLSPLPSSPQP 75
          KS  P+ PTRLQKQAP +LHL  +        S+D+     +PLLSPL  SP P
Sbjct: 10 KSESPRSPTRLQKQAPTALHLGLVPENPFLQQSSDVVGTTAIPLLSPLFVSPSP 63

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KBQ71.9e-5076.43Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G188670 PE=4 SV=1[more]
A0A5A7V8563.0e-4376.80Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A6J1D5B72.1e-2872.38uncharacterized protein LOC111017071 OS=Momordica charantia OX=3673 GN=LOC111017... [more]
A0A6P3ZH112.2e-1755.36uncharacterized protein LOC107411018 OS=Ziziphus jujuba OX=326968 GN=LOC10741101... [more]
A0A2I4HTA93.1e-1647.01uncharacterized protein LOC108990495 OS=Juglans regia OX=51240 GN=LOC108990495 P... [more]
Match NameE-valueIdentityDescription
KAG6605440.11.0e-6997.83hypothetical protein SDJN03_02757, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_038902307.12.5e-5278.57uncharacterized protein LOC120088941 [Benincasa hispida][more]
XP_011657141.14.0e-5076.43uncharacterized protein LOC105435808 [Cucumis sativus] >XP_031742438.1 uncharact... [more]
XP_023512816.14.4e-4975.54uncharacterized protein LOC111777440 [Cucurbita pepo subsp. pepo][more]
KAG6570684.13.8e-4874.10hypothetical protein SDJN03_29599, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
AT1G07473.11.9e-0548.15unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..39
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 65..102
NoneNo IPR availablePANTHERPTHR33912:SF5F22G5.17coord: 10..134
IPR040381Uncharacterized protein At4g14450-likePANTHERPTHR33912OS01G0939400 PROTEINcoord: 10..134

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh02G007800.1CmoCh02G007800.1mRNA