CmoCh18G007850 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh18G007850
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
Descriptionheme-binding protein 2-like
LocationCmo_Chr18: 9377831 .. 9379366 (+)
RNA-Seq ExpressionCmoCh18G007850
SyntenyCmoCh18G007850
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTAGGACCGAGATCGATTCGTTACAAATATGAACTAAATCTCCCACGATATACTATACTCAAGTATATCATCGATCAAAAGATCGAGGGTATAAATTCAAAGCCATTAAACGTATAATTACAAATCGGGTTCGAATCAGAAGCAAGTTTGTAGCACAAACGAAGAAAACTCGAGGATGAACGGTAAGATAGTGATCAACTTTGCTCTAACAATATGCTTCTTTTGTTGTAGCTCAGGCAGAGTGATTGAATCTCCACATTATAATGTGATTCATGTGGAGGCAGAATTTGAGATTAGACAGTATAAACAAGTCTCATGGGTATCAGCTCTTGTCCAAGGAACAGCCTCCTTTGAGAAGTCCACCCAGCAAGGCTTCCACAGGTATAAGTTGCTATTTTCCGTTTGGTTGCCAAGAAAATTCCACCAAGAAAATTCCACCAAGAAAATTCCACCAAGAAAATGTGGTGGGATTTTCCTTCTGATTTGAACTAATAATCTCTTGTATTGGTTGGCATCAGATTGTATCAATACATTCATGGTGCTAATAGCAACTCTTCTCACTTTCTTATCACTTCTCCTGTCACGACGACCATTATGGCATCGACGCGTGGCCCCGAGCGCTTGGTTAGGTATTATCTTCCTTCTATGTATACCGAAAACCCGCCGCTGCCCAATTCTGAACTCAATGTTCAGTTTGAGAAGTGGAGAAGCAATTGCTTAGCAGTCAGGAGGTTTTCTGGGTTTGCTAAAGATGATAACATCAACAAAGAAGTTGAAGCTCTAAAGAGCAGCTTGACCAAGTACCTACCTAAGAGTTCAGCCATCTCAGAATACACCGTTGCTCAGTATAATTCGTCACGTCACCTGTCGGGTCGTTTGAACGAAGTCTGGATCGACGTTTCAGCTGTTACATCAGAGGGCTGTCAACGCCGGTAACGAGTTACTGGAGTTGGCTTGAGGACTGTTGCAGGCTGTGTGAATGTTTATGGCACAATAAAATGTACAATAAAACTAAATAAGAAGCCATACTGTGATGTATGTAACACTGTTCTTGAGGTATTTCTGTTGTTTGTGTAATATCAATTGAATAATGGATTCCTTTCCGAATCCAAAGCATCCTTTGAGAAAGTTTTGGTCAAGTATCAGTTGAACAATGCACCATATGAAGCAGAATAAGCCAATTAATCTTTGTCTAATTCCTTTAAGTTTCTTTTGATCTTCACTTCATCACTTACTAAAAAGTAAGTTTGGCTTCATAAAGTTTCTTTTGATCTTCCCTTCATCACTTATTTTTACTAAGGTTGGCTTCATTCACTTCATCACTTACTAAAAAGTAAGTTTGGCTTCATAAAGTTTCTTTTGATCTTCCCTTCATCACTTATTTTTACTAAGGTTGGCTTCATAGTAGCAGGTAACAGATATTGTCCGCATTAGCCTGTTACATATCATCACAGTTTAAAATGTGTCTACTATGGAGAGGGTTCTAACCCTACTTCCACTAGTGTCTCGCACCATCCAGTGACTGACTCTGATACC

mRNA sequence

TTAGGACCGAGATCGATTCGTTACAAATATGAACTAAATCTCCCACGATATACTATACTCAAGTATATCATCGATCAAAAGATCGAGGGTATAAATTCAAAGCCATTAAACGTATAATTACAAATCGGGTTCGAATCAGAAGCAAGTTTGTAGCACAAACGAAGAAAACTCGAGGATGAACGGTAAGATAGTGATCAACTTTGCTCTAACAATATGCTTCTTTTGTTGTAGCTCAGGCAGAGTGATTGAATCTCCACATTATAATGTGATTCATGTGGAGGCAGAATTTGAGATTAGACAGTATAAACAAGTCTCATGGGTATCAGCTCTTGTCCAAGGAACAGCCTCCTTTGAGAAGTCCACCCAGCAAGGCTTCCACAGATTGTATCAATACATTCATGGTGCTAATAGCAACTCTTCTCACTTTCTTATCACTTCTCCTGTCACGACGACCATTATGGCATCGACGCGTGGCCCCGAGCGCTTGGTTAGGTATTATCTTCCTTCTATGTATACCGAAAACCCGCCGCTGCCCAATTCTGAACTCAATGTTCAGTTTGAGAAGTGGAGAAGCAATTGCTTAGCAGTCAGGAGGTTTTCTGGGTTTGCTAAAGATGATAACATCAACAAAGAAGTTGAAGCTCTAAAGAGCAGCTTGACCAAGTACCTACCTAAGAGTTCAGCCATCTCAGAATACACCGTTGCTCAGTATAATTCGTCACGTCACCTGTCGGGTCGTTTGAACGAAGTCTGGATCGACGTTTCAGCTGTTACATCAGAGGGCTGTCAACGCCGGTAACGAGTTACTGGAGTTGGCTTGAGGACTGTTGCAGGCTGTGTGAATGTTTATGGCACAATAAAATGTACAATAAAACTAAATAAGAAGCCATACTGTGATGTATGTAACACTGTTCTTGAGGTATTTCTGTTGTTTGTGTAATATCAATTGAATAATGGATTCCTTTCCGAATCCAAAGCATCCTTTGAGAAAGTTTTGGTCAAGTATCAGTTGAACAATGCACCATATGAAGCAGAATAAGCCAATTAATCTTTGTCTAATTCCTTTAAGTTTCTTTTGATCTTCACTTCATCACTTACTAAAAAGTAAGTTTGGCTTCATAAAGTTTCTTTTGATCTTCCCTTCATCACTTATTTTTACTAAGGTTGGCTTCATAGTAGCAGGTAACAGATATTGTCCGCATTAGCCTGTTACATATCATCACAGTTTAAAATGTGTCTACTATGGAGAGGGTTCTAACCCTACTTCCACTAGTGTCTCGCACCATCCAGTGACTGACTCTGATACC

Coding sequence (CDS)

ATGAACGGTAAGATAGTGATCAACTTTGCTCTAACAATATGCTTCTTTTGTTGTAGCTCAGGCAGAGTGATTGAATCTCCACATTATAATGTGATTCATGTGGAGGCAGAATTTGAGATTAGACAGTATAAACAAGTCTCATGGGTATCAGCTCTTGTCCAAGGAACAGCCTCCTTTGAGAAGTCCACCCAGCAAGGCTTCCACAGATTGTATCAATACATTCATGGTGCTAATAGCAACTCTTCTCACTTTCTTATCACTTCTCCTGTCACGACGACCATTATGGCATCGACGCGTGGCCCCGAGCGCTTGGTTAGGTATTATCTTCCTTCTATGTATACCGAAAACCCGCCGCTGCCCAATTCTGAACTCAATGTTCAGTTTGAGAAGTGGAGAAGCAATTGCTTAGCAGTCAGGAGGTTTTCTGGGTTTGCTAAAGATGATAACATCAACAAAGAAGTTGAAGCTCTAAAGAGCAGCTTGACCAAGTACCTACCTAAGAGTTCAGCCATCTCAGAATACACCGTTGCTCAGTATAATTCGTCACGTCACCTGTCGGGTCGTTTGAACGAAGTCTGGATCGACGTTTCAGCTGTTACATCAGAGGGCTGTCAACGCCGGTAA

Protein sequence

MNGKIVINFALTICFFCCSSGRVIESPHYNVIHVEAEFEIRQYKQVSWVSALVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLITSPVTTTIMASTRGPERLVRYYLPSMYTENPPLPNSELNVQFEKWRSNCLAVRRFSGFAKDDNINKEVEALKSSLTKYLPKSSAISEYTVAQYNSSRHLSGRLNEVWIDVSAVTSEGCQRR
Homology
BLAST of CmoCh18G007850 vs. ExPASy TrEMBL
Match: A0A6J1FZD2 (heme-binding protein 2-like OS=Cucurbita moschata OX=3662 GN=LOC111449291 PE=3 SV=1)

HSP 1 Score: 420.2 bits (1079), Expect = 4.9e-114
Identity = 207/207 (100.00%), Postives = 207/207 (100.00%), Query Frame = 0

Query: 1   MNGKIVINFALTICFFCCSSGRVIESPHYNVIHVEAEFEIRQYKQVSWVSALVQGTASFE 60
           MNGKIVINFALTICFFCCSSGRVIESPHYNVIHVEAEFEIRQYKQVSWVSALVQGTASFE
Sbjct: 1   MNGKIVINFALTICFFCCSSGRVIESPHYNVIHVEAEFEIRQYKQVSWVSALVQGTASFE 60

Query: 61  KSTQQGFHRLYQYIHGANSNSSHFLITSPVTTTIMASTRGPERLVRYYLPSMYTENPPLP 120
           KSTQQGFHRLYQYIHGANSNSSHFLITSPVTTTIMASTRGPERLVRYYLPSMYTENPPLP
Sbjct: 61  KSTQQGFHRLYQYIHGANSNSSHFLITSPVTTTIMASTRGPERLVRYYLPSMYTENPPLP 120

Query: 121 NSELNVQFEKWRSNCLAVRRFSGFAKDDNINKEVEALKSSLTKYLPKSSAISEYTVAQYN 180
           NSELNVQFEKWRSNCLAVRRFSGFAKDDNINKEVEALKSSLTKYLPKSSAISEYTVAQYN
Sbjct: 121 NSELNVQFEKWRSNCLAVRRFSGFAKDDNINKEVEALKSSLTKYLPKSSAISEYTVAQYN 180

Query: 181 SSRHLSGRLNEVWIDVSAVTSEGCQRR 208
           SSRHLSGRLNEVWIDVSAVTSEGCQRR
Sbjct: 181 SSRHLSGRLNEVWIDVSAVTSEGCQRR 207

BLAST of CmoCh18G007850 vs. ExPASy TrEMBL
Match: A0A6J1HVL2 (heme-binding protein 2-like OS=Cucurbita maxima OX=3661 GN=LOC111466562 PE=3 SV=1)

HSP 1 Score: 406.0 bits (1042), Expect = 9.5e-110
Identity = 200/208 (96.15%), Postives = 204/208 (98.08%), Query Frame = 0

Query: 1   MNGKIVINFALTICFF-CCSSGRVIESPHYNVIHVEAEFEIRQYKQVSWVSALVQGTASF 60
           MNGKI++NFALTICFF CCSSGRVIESPHYNVIHVEAEFEIRQYKQVSW+SALVQGTASF
Sbjct: 1   MNGKILMNFALTICFFCCCSSGRVIESPHYNVIHVEAEFEIRQYKQVSWISALVQGTASF 60

Query: 61  EKSTQQGFHRLYQYIHGANSNSSHFLITSPVTTTIMASTRGPERLVRYYLPSMYTENPPL 120
           EKSTQQGFHRLYQYIHGAN NSSHFLITSPVTTTIMAST GPERLVRYYLPSMYTENPPL
Sbjct: 61  EKSTQQGFHRLYQYIHGANINSSHFLITSPVTTTIMASTHGPERLVRYYLPSMYTENPPL 120

Query: 121 PNSELNVQFEKWRSNCLAVRRFSGFAKDDNINKEVEALKSSLTKYLPKSSAISEYTVAQY 180
           PNSELNVQFEKWRSNCLAVRRFSGFAKDDNINKEVEALKSSL KYLPKSSAISEYTVAQY
Sbjct: 121 PNSELNVQFEKWRSNCLAVRRFSGFAKDDNINKEVEALKSSLNKYLPKSSAISEYTVAQY 180

Query: 181 NSSRHLSGRLNEVWIDVSAVTSEGCQRR 208
           NSSRHLSGRLNEVW+DVSAVTSEGCQRR
Sbjct: 181 NSSRHLSGRLNEVWLDVSAVTSEGCQRR 208

BLAST of CmoCh18G007850 vs. ExPASy TrEMBL
Match: A0A0A0KVJ5 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G599820 PE=3 SV=1)

HSP 1 Score: 358.6 bits (919), Expect = 1.7e-95
Identity = 171/206 (83.01%), Postives = 189/206 (91.75%), Query Frame = 0

Query: 1   MNGKIVINFALTIC-FFCCSSGRVIESPHYNVIHVEAEFEIRQYKQVSWVSALVQGTASF 60
           M GK++INFALTIC FFCCSSGRVIESPHY VIHVE++FEIRQYKQ+SW+SALVQGTASF
Sbjct: 1   MKGKVLINFALTICFFFCCSSGRVIESPHYKVIHVESDFEIRQYKQISWMSALVQGTASF 60

Query: 61  EKSTQQGFHRLYQYIHGANSNSSHFLITSPVTTTIMASTRGPERLVRYYLPSMYTENPPL 120
           EKST+QGFHRLYQY+HGANSNS HFL TSPVTTTIM  TR PERLVRYYLP M  ENPPL
Sbjct: 61  EKSTEQGFHRLYQYMHGANSNSYHFLFTSPVTTTIMTLTREPERLVRYYLPIMNAENPPL 120

Query: 121 PNSELNVQFEKWRSNCLAVRRFSGFAKDDNINKEVEALKSSLTKYLPKSSAISEYTVAQY 180
           PNSELNV FEKWR+NCLAVRRF GFAKDDNINKE++ALKSSL+KYLP+S+A+SEYT+AQY
Sbjct: 121 PNSELNVHFEKWRNNCLAVRRFPGFAKDDNINKEIDALKSSLSKYLPESAAVSEYTIAQY 180

Query: 181 NSSRHLSGRLNEVWIDVSAVTSEGCQ 206
           NSSR L GRLNEVW+DVS  T+EGCQ
Sbjct: 181 NSSRRLLGRLNEVWLDVSGFTTEGCQ 206

BLAST of CmoCh18G007850 vs. ExPASy TrEMBL
Match: A0A5A7ST56 (Heme-binding protein 2-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold35G001240 PE=3 SV=1)

HSP 1 Score: 353.2 bits (905), Expect = 7.3e-94
Identity = 167/206 (81.07%), Postives = 190/206 (92.23%), Query Frame = 0

Query: 1   MNGKIVINFALTIC-FFCCSSGRVIESPHYNVIHVEAEFEIRQYKQVSWVSALVQGTASF 60
           M GK++INFALTIC FFCCSSGRVIESPHY VIHVE++FEIRQYKQ+SW+SALVQGT+SF
Sbjct: 1   MKGKVLINFALTICFFFCCSSGRVIESPHYKVIHVESDFEIRQYKQISWMSALVQGTSSF 60

Query: 61  EKSTQQGFHRLYQYIHGANSNSSHFLITSPVTTTIMASTRGPERLVRYYLPSMYTENPPL 120
           EKSTQQGFHRLYQY+HGANSNS  FL TSPVTTTIM STR PE LVRYYLP+M  ENPPL
Sbjct: 61  EKSTQQGFHRLYQYMHGANSNSYRFLFTSPVTTTIMTSTREPEHLVRYYLPTMNAENPPL 120

Query: 121 PNSELNVQFEKWRSNCLAVRRFSGFAKDDNINKEVEALKSSLTKYLPKSSAISEYTVAQY 180
           PNSELN+ FEKW++NCLAVRRF GFAKDDNINKE++ALKS+L+K+LP+S+AISEYT+AQY
Sbjct: 121 PNSELNIHFEKWKNNCLAVRRFPGFAKDDNINKEIDALKSTLSKHLPESAAISEYTIAQY 180

Query: 181 NSSRHLSGRLNEVWIDVSAVTSEGCQ 206
           NSSR L GRLNEVW+DVS+ T+EGCQ
Sbjct: 181 NSSRRLLGRLNEVWLDVSSFTTEGCQ 206

BLAST of CmoCh18G007850 vs. ExPASy TrEMBL
Match: A0A1S3BEF7 (uncharacterized protein LOC103488984 OS=Cucumis melo OX=3656 GN=LOC103488984 PE=3 SV=1)

HSP 1 Score: 353.2 bits (905), Expect = 7.3e-94
Identity = 167/206 (81.07%), Postives = 190/206 (92.23%), Query Frame = 0

Query: 1   MNGKIVINFALTIC-FFCCSSGRVIESPHYNVIHVEAEFEIRQYKQVSWVSALVQGTASF 60
           M GK++INFALTIC FFCCSSGRVIESPHY VIHVE++FEIRQYKQ+SW+SALVQGT+SF
Sbjct: 1   MKGKVLINFALTICFFFCCSSGRVIESPHYKVIHVESDFEIRQYKQISWMSALVQGTSSF 60

Query: 61  EKSTQQGFHRLYQYIHGANSNSSHFLITSPVTTTIMASTRGPERLVRYYLPSMYTENPPL 120
           EKSTQQGFHRLYQY+HGANSNS  FL TSPVTTTIM STR PE LVRYYLP+M  ENPPL
Sbjct: 61  EKSTQQGFHRLYQYMHGANSNSYRFLFTSPVTTTIMTSTREPEHLVRYYLPTMNAENPPL 120

Query: 121 PNSELNVQFEKWRSNCLAVRRFSGFAKDDNINKEVEALKSSLTKYLPKSSAISEYTVAQY 180
           PNSELN+ FEKW++NCLAVRRF GFAKDDNINKE++ALKS+L+K+LP+S+AISEYT+AQY
Sbjct: 121 PNSELNIHFEKWKNNCLAVRRFPGFAKDDNINKEIDALKSTLSKHLPESAAISEYTIAQY 180

Query: 181 NSSRHLSGRLNEVWIDVSAVTSEGCQ 206
           NSSR L GRLNEVW+DVS+ T+EGCQ
Sbjct: 181 NSSRRLLGRLNEVWLDVSSFTTEGCQ 206

BLAST of CmoCh18G007850 vs. TAIR 10
Match: AT1G17100.1 (SOUL heme-binding family protein )

HSP 1 Score: 105.9 bits (263), Expect = 3.9e-23
Identity = 68/186 (36.56%), Postives = 95/186 (51.08%), Query Frame = 0

Query: 24  IESPHYNVIHVEAEFEIRQYKQVSWVSALVQGTASFEKSTQQGFHRLYQYIHGANSNSSH 83
           IE P Y ++H    +EIR+Y    WVS       S   +T+  F +L+ YI G N     
Sbjct: 45  IECPSYELVHSGNGYEIRRYNNTVWVSTEPIPDISLVDATRTAFFQLFAYIQGKNEYHQK 104

Query: 84  FLITSPVTTTIMASTRGP----ERLVRYYLPSMYTENPPLPNSELNVQFEKWRSNCLAVR 143
             +T+PV + +  S  GP       V +Y+P    +N P P    N+  +KW S  +AVR
Sbjct: 105 IEMTAPVISQVSPSD-GPFCESSFTVSFYVPK---KNQPDPAPSENLHIQKWNSRYVAVR 164

Query: 144 RFSGFAKDDNINKEVEALKSSL-----TKYLPKS------SAISEYTVAQYNSSRHLSGR 195
           +FSGF  DD+I ++  AL SSL        + KS       + S YTVAQYNS    SGR
Sbjct: 165 QFSGFVSDDSIGEQAAALDSSLKGTAWANAIAKSKEDGGVGSDSAYTVAQYNSPFEFSGR 224

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1FZD24.9e-114100.00heme-binding protein 2-like OS=Cucurbita moschata OX=3662 GN=LOC111449291 PE=3 S... [more]
A0A6J1HVL29.5e-11096.15heme-binding protein 2-like OS=Cucurbita maxima OX=3661 GN=LOC111466562 PE=3 SV=... [more]
A0A0A0KVJ51.7e-9583.01Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G599820 PE=3 SV=1[more]
A0A5A7ST567.3e-9481.07Heme-binding protein 2-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaf... [more]
A0A1S3BEF77.3e-9481.07uncharacterized protein LOC103488984 OS=Cucumis melo OX=3656 GN=LOC103488984 PE=... [more]
Match NameE-valueIdentityDescription
AT1G17100.13.9e-2336.56SOUL heme-binding family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006917SOUL haem-binding proteinPFAMPF04832SOULcoord: 24..194
e-value: 3.6E-41
score: 141.1
IPR006917SOUL haem-binding proteinPANTHERPTHR11220HEME-BINDING PROTEIN-RELATEDcoord: 10..205
IPR011256Regulatory factor, effector binding domain superfamilyGENE3D3.20.80.10Regulatory factor, effector binding domaincoord: 23..203
e-value: 7.7E-43
score: 148.3
IPR011256Regulatory factor, effector binding domain superfamilySUPERFAMILY55136Probable bacterial effector-binding domaincoord: 16..196
NoneNo IPR availablePANTHERPTHR11220:SF36SOUL HEME-BINDING PROTEIN-RELATEDcoord: 10..205

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh18G007850.1CmoCh18G007850.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0110165 cellular anatomical entity