CmoCh18G005700.1 (mRNA) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh18G005700.1
TypemRNA
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionNuclear transcription factor Y subunit gamma
LocationCmo_Chr18: 4736528 .. 4738493 (+)
Sequence length668
RNA-Seq ExpressionCmoCh18G005700.1
SyntenyCmoCh18G005700.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTAGCGACCCTATGAAGGATCTTAAAAAGGGAAAAGTGGGTTGGAAAGTGACAGGACTGCCTGATGAATACAAAATTTTTGAAAGTACGTTGGCTGTTTATATGCTTTATGTTGCTTTAATGAATTCTGTTAATGAACAGATACTTTGTTGACACTTGCAATGATCGTAGTTTTTTAATACTGTTAAAACGTATGTCCTATCAAAAAAGGGTGCTTGTTTTAGAGGGATATTGCTATAGTGAACTCGGTTAACAAACAGAAACTTTGTTAACACTTACGTTGATTGTAGCTTTGTATTGCTGTTATTAACTCTGCTGAAACAAGCACAGTTCTTGTTTTAATAGGATACCATTTTAATGGCACTGTTTACATGTCATATATGTTGCAATGTTATACAATACTCCTTTTTTCTATCCCAATATTCAACATTTCTCCTTGTCACAGTTAACGTTGTGTGTGGCCTATTGATCCATTTGGTATCACCACATGTAAGACTTGTTTGGTTGGTTTCCTTCCCTCTTTCCGCCATTCTTCTTTTGTTCTCCAAACTTTAGAAGTTGAAGTCTCAAAGAAGATGCAGGTCTTTTCTTGGTGTGTTGTCCTTGGGAGGATCTATTCTTTAAACTGTGCTGTCCTACATTGGTTGGAGAGGAGAACAAAACACCCTTTATAAGGGTGTAGAAACCTTCCCCTAGCATACGTGTTTTAAAGCTTTGAGGAGAAGCCCAAAAGAGAAAGCCCAAAGAGGACAATATTTGCTAGCGATGGGTCTGGGCCGTTACATGAACTTTGTTAAAAGAGCCTTGCTCATGCTAATTAGTGTATCTTGTGTAAATGTGTTGATGAAGATTTGAATGGCTTCGTTCTTTAGACTTTGTGGTTGACATTATGGTGAATATGGTGAATCTTCTGAAAGTGGTATGAACCAGACATCAATCATGCAATATAAACTTGAAGGAAAATGTGAAGAAAATGTTGTCAAAATTATTAAGAGATCTTTCCATTCATGTCAGGTTGAAGAAGAATGAAGTTTCATAATTTGGAGTTATGATGTTTCATTAATGCATTTTAGTACTTTTTATTTACTTTAGGTTACGTTTACACTTGTGATCATAATTTATGAATGTTACGTTTACACTTGTGACCATAATTTATGAATGTTACAACTCTTAAAATGCCTAAGGGAAGGTCAAGGGTTTCATTTTGAGACTATTTCATTTGCAGATTGCTGTATTGGAGGAACAGAAAAACGAGGCTGCAGAAAATGAAACTGCTGACAAAGTTGAATATGAAGCTAACAAATCAAATGATGTTGAACCAGGAAGTGCAGTGGCGAGTGCCAAGGGCGAGCTTCAAGACGAGAAACCAGATATTAACGATGTACCAATGGACGAAAGTCAGGTAATTTATTTTTCGTCATAAACATGTTAATCTGCAACCTTCGAGATGAAACCGGTTCAATGGTTCACCTCTTGATTTCATGTCTCAGGACAACGACCATGTAGTGCGCCAAGATTTGAACGAAAGTACTTTGGATTTAAGTCTGAACTTGAATGCTCTCAATGACGATGGCGAAGCTGGTTCGAAAGCTGATCACATTAGAGATGGCAAGAGGAAGGGCTAACTTTTAGAAGCAGCTGCAGTTTCGTGCTTACAAATTCAATTCTCTTGTTGGAACTGAGTTACTTGACATCCCACATCTCTGCTAATCCTGGAAGTATATGGTTCTTCAATACTAGTATGATATGATTGCTTATTTGAGCTAATTTTGATTTGTTCACAGAAAAAGATAAACAGTACCATATTTTATCTTTGAAATTCGATATTGATGTTAGAAGCTGTATCGAGAGTCGAGGGAATTGTGTACGATTATCCACTGAGGATTTAGCTTTATTTGGTTTCTGTCAAATGATTATCTAATGGACCCTAAGAGTTAAGAAAGCCTTACTATTTCAATCAAACATTT

mRNA sequence

TTAGCGACCCTATGAAGGATCTTAAAAAGGGAAAAGTGGGTTGGAAAGTGACAGGACTGCCTGATGAATACAAAATTTTTGAAACTAACAAATCAAATGATGTTGAACCAGGAAGTGCAGTGGCGAGTGCCAAGGGCGAGCTTCAAGACGAGAAACCAGATATTAACGATGTACCAATGGACGAAAGTCAGGACAACGACCATGTAGTGCGCCAAGATTTGAACGAAAGTACTTTGGATTTAAGTCTGAACTTGAATGCTCTCAATGACGATGGCGAAGCTGGTTCGAAAGCTGATCACATTAGAGATGGCAAGAGGAAGGGCTAACTTTTAGAAGCAGCTGCAGTTTCGTGCTTACAAATTCAATTCTCTTGTTGGAACTGAGTTACTTGACATCCCACATCTCTGCTAATCCTGGAAGTATATGGTTCTTCAATACTAGTATGATATGATTGCTTATTTGAGCTAATTTTGATTTGTTCACAGAAAAAGATAAACAGTACCATATTTTATCTTTGAAATTCGATATTGATGTTAGAAGCTGTATCGAGAGTCGAGGGAATTGTGTACGATTATCCACTGAGGATTTAGCTTTATTTGGTTTCTGTCAAATGATTATCTAATGGACCCTAAGAGTTAAGAAAGCCTTACTATTTCAATCAAACATTT

Coding sequence (CDS)

ATGAAGGATCTTAAAAAGGGAAAAGTGGGTTGGAAAGTGACAGGACTGCCTGATGAATACAAAATTTTTGAAACTAACAAATCAAATGATGTTGAACCAGGAAGTGCAGTGGCGAGTGCCAAGGGCGAGCTTCAAGACGAGAAACCAGATATTAACGATGTACCAATGGACGAAAGTCAGGACAACGACCATGTAGTGCGCCAAGATTTGAACGAAAGTACTTTGGATTTAAGTCTGAACTTGAATGCTCTCAATGACGATGGCGAAGCTGGTTCGAAAGCTGATCACATTAGAGATGGCAAGAGGAAGGGCTAA

Protein sequence

MKDLKKGKVGWKVTGLPDEYKIFETNKSNDVEPGSAVASAKGELQDEKPDINDVPMDESQDNDHVVRQDLNESTLDLSLNLNALNDDGEAGSKADHIRDGKRKG
Homology
BLAST of CmoCh18G005700.1 vs. ExPASy TrEMBL
Match: A0A6J1EHR2 (uncharacterized protein LOC111432647 OS=Cucurbita moschata OX=3662 GN=LOC111432647 PE=4 SV=1)

HSP 1 Score: 156.4 bits (394), Expect = 6.6e-35
Identity = 80/82 (97.56%), Postives = 81/82 (98.78%), Query Frame = 0

Query: 23  FETNKSNDVEPGSAVASAKGELQDEKPDINDVPMDESQDNDHVVRQDLNESTLDLSLNLN 82
           +E NKSNDVEPGSAVASAKGELQDEKPDINDVPMDESQDNDHVVRQDLNESTLDLSLNLN
Sbjct: 137 YEANKSNDVEPGSAVASAKGELQDEKPDINDVPMDESQDNDHVVRQDLNESTLDLSLNLN 196

Query: 83  ALNDDGEAGSKADHIRDGKRKG 105
           ALNDDGEAGSKADHIRDGKRKG
Sbjct: 197 ALNDDGEAGSKADHIRDGKRKG 218

BLAST of CmoCh18G005700.1 vs. ExPASy TrEMBL
Match: A0A6J1JX59 (uncharacterized protein LOC111490478 OS=Cucurbita maxima OX=3661 GN=LOC111490478 PE=4 SV=1)

HSP 1 Score: 151.4 bits (381), Expect = 2.1e-33
Identity = 77/81 (95.06%), Postives = 80/81 (98.77%), Query Frame = 0

Query: 23  FETNKSNDVEPGSAVASAKGELQDEKPDINDVPMDESQDNDHVVRQDLNESTLDLSLNLN 82
           +E NKSNDVEPGSAVASAKG+LQDEKP+INDVPMDESQDNDHVVRQDLNESTLDLSLNLN
Sbjct: 137 YEANKSNDVEPGSAVASAKGDLQDEKPNINDVPMDESQDNDHVVRQDLNESTLDLSLNLN 196

Query: 83  ALNDDGEAGSKADHIRDGKRK 104
           ALNDDGEAGSKADHIRDGKRK
Sbjct: 197 ALNDDGEAGSKADHIRDGKRK 217

BLAST of CmoCh18G005700.1 vs. ExPASy TrEMBL
Match: A0A6J1CLM1 (uncharacterized protein LOC111012736 OS=Momordica charantia OX=3673 GN=LOC111012736 PE=4 SV=1)

HSP 1 Score: 130.6 bits (327), Expect = 3.9e-27
Identity = 69/81 (85.19%), Postives = 73/81 (90.12%), Query Frame = 0

Query: 24  ETNKSNDVEPGSAVASAKGELQDEKPDINDVPMDESQDNDHVVRQDLNESTLDLSLNLNA 83
           E NKS DVEP   +A+AKGEL DEKPDINDVPM+ESQDNDH VRQDLNESTLDLSLNLNA
Sbjct: 133 EANKSYDVEP---MANAKGELLDEKPDINDVPMEESQDNDHAVRQDLNESTLDLSLNLNA 192

Query: 84  LNDDGEAGSKADHIRDGKRKG 105
           LNDDGE GSKADHIRDGKR+G
Sbjct: 193 LNDDGETGSKADHIRDGKRRG 210

BLAST of CmoCh18G005700.1 vs. ExPASy TrEMBL
Match: A0A5D3CMI7 (Putative serine/threonine-protein kinase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold302G001230 PE=4 SV=1)

HSP 1 Score: 127.9 bits (320), Expect = 2.5e-26
Identity = 69/81 (85.19%), Postives = 72/81 (88.89%), Query Frame = 0

Query: 24  ETNKSNDVEPGSAVASAKGELQDEKPDINDVPMDESQDNDHVVRQDLNESTLDLSLNLNA 83
           E  KSNDVEP S VA+ KGELQDEKPDINDVPM+ESQDNDH VRQDLNESTLDLSLNLNA
Sbjct: 138 EAIKSNDVEPSSTVAT-KGELQDEKPDINDVPMEESQDNDHPVRQDLNESTLDLSLNLNA 197

Query: 84  LNDDGEAGSKADHIRDGKRKG 105
           L+D GE  SKADHIRDGKRKG
Sbjct: 198 LDDGGETSSKADHIRDGKRKG 217

BLAST of CmoCh18G005700.1 vs. ExPASy TrEMBL
Match: A0A5A7UEU0 (Nuclear transcription factor Y subunit gamma OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold24G00310 PE=4 SV=1)

HSP 1 Score: 127.9 bits (320), Expect = 2.5e-26
Identity = 69/81 (85.19%), Postives = 72/81 (88.89%), Query Frame = 0

Query: 24  ETNKSNDVEPGSAVASAKGELQDEKPDINDVPMDESQDNDHVVRQDLNESTLDLSLNLNA 83
           E  KSNDVEP S VA+ KGELQDEKPDINDVPM+ESQDNDH VRQDLNESTLDLSLNLNA
Sbjct: 26  EAIKSNDVEPSSTVAT-KGELQDEKPDINDVPMEESQDNDHPVRQDLNESTLDLSLNLNA 85

Query: 84  LNDDGEAGSKADHIRDGKRKG 105
           L+D GE  SKADHIRDGKRKG
Sbjct: 86  LDDGGETSSKADHIRDGKRKG 105

BLAST of CmoCh18G005700.1 vs. TAIR 10
Match: AT4G22320.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 14 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G55210.1). )

HSP 1 Score: 58.2 bits (139), Expect = 4.7e-09
Identity = 37/89 (41.57%), Postives = 55/89 (61.80%), Query Frame = 0

Query: 16  LPDEYKIFETNKSNDVEPGSAVASAKGELQ-DEKPDINDVPMDESQ--------DNDHVV 75
           + ++ K+ + +K ++ +     +  K E++ +EKPDINDVPM++ Q        D + VV
Sbjct: 144 IDEDNKVEQEDKVDEDKTVEESSEKKAEVEVEEKPDINDVPMEDIQVEEKIVQDDEEKVV 203

Query: 76  RQDLNESTLDLSLNLNALNDDGEAGSKAD 96
           RQDLNEST+DL LNLNA + D E   K D
Sbjct: 204 RQDLNESTVDLGLNLNANDADAENDPKED 232

BLAST of CmoCh18G005700.1 vs. TAIR 10
Match: AT4G22320.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G55210.1); Has 8953 Blast hits to 5363 proteins in 542 species: Archae - 33; Bacteria - 806; Metazoa - 2454; Fungi - 831; Plants - 279; Viruses - 151; Other Eukaryotes - 4399 (source: NCBI BLink). )

HSP 1 Score: 57.8 bits (138), Expect = 6.1e-09
Identity = 37/90 (41.11%), Postives = 55/90 (61.11%), Query Frame = 0

Query: 16  LPDEYKIFETNKSNDVEPGSAVASAKGELQ-DEKPDINDVPMDESQ---------DNDHV 75
           + ++ K+ + +K ++ +     +  K E++ +EKPDINDVPM++ Q         D + V
Sbjct: 144 IDEDNKVEQEDKVDEDKTVEESSEKKAEVEVEEKPDINDVPMEDIQQVEEKIVQDDEEKV 203

Query: 76  VRQDLNESTLDLSLNLNALNDDGEAGSKAD 96
           VRQDLNEST+DL LNLNA + D E   K D
Sbjct: 204 VRQDLNESTVDLGLNLNANDADAENDPKED 233

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1EHR26.6e-3597.56uncharacterized protein LOC111432647 OS=Cucurbita moschata OX=3662 GN=LOC1114326... [more]
A0A6J1JX592.1e-3395.06uncharacterized protein LOC111490478 OS=Cucurbita maxima OX=3661 GN=LOC111490478... [more]
A0A6J1CLM13.9e-2785.19uncharacterized protein LOC111012736 OS=Momordica charantia OX=3673 GN=LOC111012... [more]
A0A5D3CMI72.5e-2685.19Putative serine/threonine-protein kinase OS=Cucumis melo var. makuwa OX=1194695 ... [more]
A0A5A7UEU02.5e-2685.19Nuclear transcription factor Y subunit gamma OS=Cucumis melo var. makuwa OX=1194... [more]
Match NameE-valueIdentityDescription
AT4G22320.24.7e-0941.57unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT4G22320.16.1e-0941.11unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 27..61
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 43..61
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 83..104
NoneNo IPR availablePANTHERPTHR34572GOLGIN FAMILY A PROTEINcoord: 23..96
NoneNo IPR availablePANTHERPTHR34572:SF1GOLGIN FAMILY A PROTEINcoord: 23..96

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmoCh18G005700CmoCh18G005700gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh18G005700.1:exon:7758CmoCh18G005700.1:exon:7758exon
CmoCh18G005700.1:exon:7759CmoCh18G005700.1:exon:7759exon
CmoCh18G005700.1:exon:7760CmoCh18G005700.1:exon:7760exon


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh18G005700.1:five_prime_utrCmoCh18G005700.1:five_prime_utrfive_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh18G005700.1:cdsCmoCh18G005700.1:cdsCDS
CmoCh18G005700.1:cdsCmoCh18G005700.1:cds_2CDS
CmoCh18G005700.1:cdsCmoCh18G005700.1:cds_3CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh18G005700.1:three_prime_utrCmoCh18G005700.1:three_prime_utrthree_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmoCh18G005700.1CmoCh18G005700.1-proteinpolypeptide