CmaCh05G005950 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh05G005950
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionTranslation initiation factor IF-2
LocationCma_Chr05: 2965971 .. 2970405 (-)
RNA-Seq ExpressionCmaCh05G005950
SyntenyCmaCh05G005950
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGATGCTTTTCGTTTTCGCCTAAATTTATTTTTGTTATTCCTATTCTGAAGCAGTTCGAACTGTAGAGATAATTCTTGAGCTTGGGAAGAAGGAATGCTAAGAACAAGGAAGACGGGAGGAAGAAGGTTGTTTCTTCTGATATAGAGCCTAATCAATAGGCAAAAAAACCTAAAACCTAAAACCAATTATGGTTTCAGAGCCCAAACCCACTGTTTCGAAGCAAAAACCTGGTAATCAAGATGGCGGCTCCAGGCTAAAGATTGATAGCTCCATCAAGAACTCAAATCCTTCTTCCAAACACAAATCCGTCTCTGTTGTCACTAAATCCGAGGTACTAATTTTGGAAATCTGTCGCGAATTGTTTTTTCTTTTTTTTTTTTTTTTTTTGTCTGAATTGTTTTGATGTTTTGAAACGCATCTGAGATCGGTAGGAGTAACGGATTTGGGGTAATTGAAATTTGGAGTTTCTGGAGTTTCTTTCAGTTACTTCCGATGTTCTAATTTCGTATTTCATTGGGGGATTCATCTCATTGGGCGGGCTGCTGTTCTTGGTGGCTTTGGAATTTTTTTGAATCCTTCCTTTTCTCTGATCGTTGATTTTGAATTTGGGTTTCAATTGATTTGTTTATGGAAGGTGAAGTCAAAGACAAGTTCGAGTTCTTCAAGGGCGACAACAAAAACTACTACAACAGTTACTACTGCTAAAGTGAGAGAAAAGAAAGTGTATAATTTGCCTGGTCAGAAATGTGATCCACCTGAAGAGGTTTTTACTTTAATCCTTGATGGTTTTTGTTGTTGATATGATTGATGAACTCATCATTCTTGCTCTTTTGTATGCTATGAACAGAGAGAGCCCCTTCGGATATTTTACGAGTCCTTGTCGAAACAGATACCAGCAAGTGAAATGGCAGAGTTTTGGTGCGGTTTGTTTTCTGATATTCTTTTGAGCATTCTTTGAGGTCTTTATAAAACTAAGAATATAGTATTGATTTGAACCAATCCATATGCTTCCATTTTTAATAAAATTTCGACTAGAAACATACAAGTTCTGTAATGTGCATCTTCGTAGATATCTGGTAATCATGTGTGTAAGGGTATAATCGGTGATTTGAGAGAAAAAGCAATAAGAACTCTGACATAGAATATTATTTCTAGTGCTATAGGATACGATTAGTTAGCTGACATTTTGAAATTTGAAGTTGGTTCATTAGAACCATATAAACCGATACCAAGAGCATGAGCTTTAGGCCTATGATAATCCATGATCGTCTCAATCAGTGAAGTTGGATGATGATTATCATCCACATAATCTACTACTACTTTAGGTTTTCTATTCGAATTGTAGGGACTAAGTGCAGTCGAAATTGTACACGAGAGGCTGCCTTTTAGGGACAGATAAGCTCTGTCAAACAACTTTACCACTAAGTGTTTCCTTTGTGTTTTGGTTATTCCAGTTTTCATGTGACACTTGCTCAGTATTTTGGTTACTCAGACAAGATGAACCATTGGGTTATGACTGAAACTTTTCATGTCTGATCCAATATTGATGATAGAAACGATAGATTGGATAAGTTGCTAGCTCTTCATATGAAAACAAATCCAACTTTATGCAAATGTCTTCTGCTGGAAGCTTATGCTTATATTCATGATGGAGTATATCAGAATTGAGTATAATAGCCTGCATTTGATCCTTCTATGTTCTTTCTACTTTCTGAATTTGTTCATTTTGTAACTTGAGTTTATGGATAAGATATAAAGTGTGCAATGTGAGTTATTTACATGTGACTGACGTGTAGAGAGTGTGAGATCCCACATCGGTTGGAGAGGGGAACGAAACATTCCTAATAAGGGTGTGGAAGGCTGTTATAGTTGGTTTCAGAGCCAAACACCAGGCGATGTGCCAGTAAGGAGACCAAGCCCCAAAGGGGGTTGGACATGAGGTGGTATACCAACAAGGACATTGGGCCCCAAAGGGGGGTGGATTATGAGATCCCTCATCGGTTGGGGAGGAGAATGAAGCATTCTTTATAAAGGTGTGAAAACCTCTCCCTAGTAGACGCATTTTAAAAACTTGAGGGGAAGCCCAAAAGGGAAAGTCCAAAGAGGACAATATCCGCTGACGGTGGGTTTAGGTTGTTATAGAGAGCATGTTATTCTTATAGAACATGTTTGTTCTTCAAGTTAAAGTTATATATTTAGTGTTGTAATCATTACCTCAAGGGGTGGTATTGCCAGTAAGAGCTTAGGAAATTTGAGGAAATTTCTCAACAAAGGTCTTGGATTCAAACCACCCACCCAATAGGAACTTAATTAATGAAAACCTTTGAACTCTCAATGTGCTATCGTTTGGGCGCGAGAAGTCTCTCCATGGAGTCGTAGGGTAGGAATGAATTCCACATATATCTCGAAATAAACAATTAAACAAAAGTCATCTGTTACTGGTTTTAATGGATATATTGCGGGGCAGCTGTAACGAGCTCAAGGAGCTATGTTTAGGTGGAAGAGTATATTTTTCCGCCCCCAACAAGATTGCAGTTCAAAGACAATTATCTTGTTTTAGGATTGTTAAGTTACCTGAGAAATGGCTGGATCATGGACTGAGTCATTTGTTTGCCAGAATGAGAATTCTTTCCCTGAGTGAAGGTCTGGTGGTGATTTGTTGGCTTTTAGGAACAATGCACCAAAAATTGGTGTATCCCACCGATGGATTACTGGTTGATATTAAGGAGCGTTAACATTAAAATAAAGGTTGTATTAAGTTGATCTTCTAGGAGAGTTGAGAACAAGTTTTTTCTTTTTGTTTGAAATTCATTTGTTACTTTGGCCTTTCTTCATGCGAAAGTACTTCTAACTTACAATGGATTAAAAAAGTTGTGAGATCCCACGTCGGTTCGGGAGAAGAATGAAGCATTCTTTAAAGGTGTGGATATCTCTCCCTAGATGATGCGTTTTAAAAATCTTGAGGGAAAACCTAAAAATGACAATATTTGGTAGTAGTGGGCTTGGGCCATTACCAATGGTATCAGAGTCAGACACCGGGCAATGTGCCAGCGAGGAGGCTGAACCCTAAAGGGGGTGGACAAGAGATTGTGTGCCAACAAGGACACTGAGTGGGGGGTCCCACATCGATTGGAGAAAGGAACGAGTGCTAACGAGGACACTGAGTCCTGAAGGGGGGTGGATTGTGAGATCCTACATCGGTTGGGGAGGAGTATGAAGCATTCTTTGAAGGATGTGGAAACCTATCCCTAGCAGACGTGTTTTAAAAACCTTGAGAGGAAGCTCGAAAGGGGAACCCCAAAGAGGACAATATCTGCTAGCAGTGGATTTGGACTTAGACTTGGAGTGTTACAAAAGTTCTAAAAGCTGAATGTGCACTTGAAGAAGATTCCAAAATTATAATGTATAAGACAGGACATAAGACCTTCTGTAGCAATTAGCTCTTGAATAGTTAAGATCAAAGCAAGGTGCTGTGGGTTTACGAATTATAAACACAAAGGTTTCATGTTTTTTGGTACGAACAGGATTCAATCAGTCGTCTTATGTACTTAAACACCGATTTAGGATGCTCAAATTTAGAGCTAAAAAGAAAGCTAGTGACAATCATTTTCATGTCTTGTGAAAAATATCCGAATATTTCATGTCTTATGATTTTTGAAGCATGCCATTATATTTATTGATATATTCGTTATTAACATGAGATTGGTAGCCCTTTGTTATCTGGTTTCAAATATTTGAATCGTTTATCCTTTTCGTTGTTTTGTTCTGTTAAAACGAGATTGCTAGCACCCTTTTATTGACTTGGGTTGAGAATGCGTCTCTTTTCAACATTTTTATGGTTTCTGTGCATTACTCTATTGAGAAGCCAAAAGGTATAACCAGTCTAAATTTCTAGGATGATGGAGCATGGCATGTTATCACCCGAAAAGGCGAAAAAGGCATACGAGAAGAAACTGAGAAGGCAAAAGGAACAGAGATCTGGGACTCCGATTAAATCACCTAAACTGCTGAGCAAACCAGAGAGTTCGCAGAGGCCACAGCAACCCTCTAAGAATGGCGATCTAAAAGCAAAGCAAAAGGTCACTACGAATGATAGCGACGACGATGACGACAACTTCATTCTAAGTCCCAAGAGAAGGAAAATGTAGGGAAAACACACCCGCCCTGCCTTCGTTCTTTATTATCATACTTTTCGAGGAATTACTTGATTAAAATTTCCTTAGAGTTGATGATTATAACCCAACAGTATTTACCTGGACATTCTTTTATACTCCACAGCCTGTGTATGAACAATAATTTTAGAAGAGACTAGTCTTATGGTTCAAATCCTTTTCATATAATGTGTCATTTATCTCCTTTTCCATAAGAGGACCTAGTCTGAAATCCAAGCCTAATTTGCCAAGAAATCAAGGATGGAGACATTTTGAGATCCAG

mRNA sequence

TGATGCTTTTCGTTTTCGCCTAAATTTATTTTTGTTATTCCTATTCTGAAGCAGTTCGAACTGTAGAGATAATTCTTGAGCTTGGGAAGAAGGAATGCTAAGAACAAGGAAGACGGGAGGAAGAAGGTTGTTTCTTCTGATATAGAGCCTAATCAATAGGCAAAAAAACCTAAAACCTAAAACCAATTATGGTTTCAGAGCCCAAACCCACTGTTTCGAAGCAAAAACCTGGTAATCAAGATGGCGGCTCCAGGCTAAAGATTGATAGCTCCATCAAGAACTCAAATCCTTCTTCCAAACACAAATCCGTCTCTGTTGTCACTAAATCCGAGGTGAAGTCAAAGACAAGTTCGAGTTCTTCAAGGGCGACAACAAAAACTACTACAACAGTTACTACTGCTAAAGTGAGAGAAAAGAAAGTGTATAATTTGCCTGGTCAGAAATGTGATCCACCTGAAGAGAGAGAGCCCCTTCGGATATTTTACGAGTCCTTGTCGAAACAGATACCAGCAAGTGAAATGGCAGAGTTTTGGATGATGGAGCATGGCATGTTATCACCCGAAAAGGCGAAAAAGGCATACGAGAAGAAACTGAGAAGGCAAAAGGAACAGAGATCTGGGACTCCGATTAAATCACCTAAACTGCTGAGCAAACCAGAGAGTTCGCAGAGGCCACAGCAACCCTCTAAGAATGGCGATCTAAAAGCAAAGCAAAAGGTCACTACGAATGATAGCGACGACGATGACGACAACTTCATTCTAAGTCCCAAGAGAAGGAAAATGTAGGGAAAACACACCCGCCCTGCCTTCGTTCTTTATTATCATACTTTTCGAGGAATTACTTGATTAAAATTTCCTTAGAGTTGATGATTATAACCCAACAGTATTTACCTGGACATTCTTTTATACTCCACAGCCTGTGTATGAACAATAATTTTAGAAGAGACTAGTCTTATGGTTCAAATCCTTTTCATATAATGTGTCATTTATCTCCTTTTCCATAAGAGGACCTAGTCTGAAATCCAAGCCTAATTTGCCAAGAAATCAAGGATGGAGACATTTTGAGATCCAG

Coding sequence (CDS)

ATGGTTTCAGAGCCCAAACCCACTGTTTCGAAGCAAAAACCTGGTAATCAAGATGGCGGCTCCAGGCTAAAGATTGATAGCTCCATCAAGAACTCAAATCCTTCTTCCAAACACAAATCCGTCTCTGTTGTCACTAAATCCGAGGTGAAGTCAAAGACAAGTTCGAGTTCTTCAAGGGCGACAACAAAAACTACTACAACAGTTACTACTGCTAAAGTGAGAGAAAAGAAAGTGTATAATTTGCCTGGTCAGAAATGTGATCCACCTGAAGAGAGAGAGCCCCTTCGGATATTTTACGAGTCCTTGTCGAAACAGATACCAGCAAGTGAAATGGCAGAGTTTTGGATGATGGAGCATGGCATGTTATCACCCGAAAAGGCGAAAAAGGCATACGAGAAGAAACTGAGAAGGCAAAAGGAACAGAGATCTGGGACTCCGATTAAATCACCTAAACTGCTGAGCAAACCAGAGAGTTCGCAGAGGCCACAGCAACCCTCTAAGAATGGCGATCTAAAAGCAAAGCAAAAGGTCACTACGAATGATAGCGACGACGATGACGACAACTTCATTCTAAGTCCCAAGAGAAGGAAAATGTAG

Protein sequence

MVSEPKPTVSKQKPGNQDGGSRLKIDSSIKNSNPSSKHKSVSVVTKSEVKSKTSSSSSRATTKTTTTVTTAKVREKKVYNLPGQKCDPPEEREPLRIFYESLSKQIPASEMAEFWMMEHGMLSPEKAKKAYEKKLRRQKEQRSGTPIKSPKLLSKPESSQRPQQPSKNGDLKAKQKVTTNDSDDDDDNFILSPKRRKM
Homology
BLAST of CmaCh05G005950 vs. TAIR 10
Match: AT5G11600.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 11 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G19990.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 156.8 bits (395), Expect = 1.9e-38
Identity = 121/242 (50.00%), Postives = 148/242 (61.16%), Query Frame = 0

Query: 1   MVSEPKPTVSKQKPGNQDG--GSRLKIDSSIKNS-----------NPSSKHKSVSVVT-K 60
           M ++ +P+ +K +P +      SR+KID SIK+            + S    SVS VT K
Sbjct: 1   MATQERPSSAKSEPRDASSSLSSRVKIDPSIKDKKKIATSSRPIMSDSKPRSSVSTVTAK 60

Query: 61  SEVKSK-----------TSSSSS--RATTKTTTTV--------TTAKV--------REKK 120
           SE K K           TS+++S  +   +TT+TV        T+A V        REKK
Sbjct: 61  SEAKPKVPINSVKTIATTSAAASLVKGKAQTTSTVVSLVKAKTTSATVSLVKGKAKREKK 120

Query: 121 VYNLPGQKCDPPEEREPLRIFYESLSKQIPASEMAEFWMMEHGMLSPEKAKKAYEKKLRR 180
           VY+L GQK DPPEEREPLRIFYESLSKQIP SEMAEFW+MEHGMLSPEKAK+A+EKK R+
Sbjct: 121 VYSLAGQKFDPPEEREPLRIFYESLSKQIPGSEMAEFWLMEHGMLSPEKAKRAFEKKQRK 180

Query: 181 QKEQRSGTPIKS-PKLLSKPESSQRPQQPSKNGDLKAKQKVTTNDSDDDDDNFILSPKRR 199
            K+ R GTP KS P   SK ESSQR      NG    K+K   +D DDDDD+FILS KRR
Sbjct: 181 MKQIRMGTPSKSAPTFSSKAESSQRTSASKNNGLDARKKKKVVDDDDDDDDDFILSHKRR 240

BLAST of CmaCh05G005950 vs. TAIR 10
Match: AT1G19990.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: cultured cell; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G11600.1); Has 11256 Blast hits to 7192 proteins in 541 species: Archae - 6; Bacteria - 629; Metazoa - 4714; Fungi - 936; Plants - 545; Viruses - 34; Other Eukaryotes - 4392 (source: NCBI BLink). )

HSP 1 Score: 88.2 bits (217), Expect = 8.1e-18
Identity = 85/223 (38.12%), Postives = 117/223 (52.47%), Query Frame = 0

Query: 10  SKQKP--GNQDGGSRLK-------------IDSSIKNS--NPSSKHKSVSVVTKSEVKSK 69
           +K+KP  GN  G  +LK             I SS+  S   P  K + +    + +  SK
Sbjct: 27  AKKKPTNGNNAGSKKLKKEENDDDDDDNKPIKSSVSGSRAKPVKKKEEIDKDDEKKPVSK 86

Query: 70  TSSS--SSRATTKTTTTVTTAKVREKKVYNLPGQKCDPPEEREPLRIFYESLSKQIPASE 129
            +SS   S+   K        K RE+KVY+LPGQK + P+ER+PLRIFYESL KQIP S+
Sbjct: 87  RNSSVGVSKENKKPEKEEEVKKKRERKVYDLPGQKREQPDERDPLRIFYESLYKQIPTSD 146

Query: 130 MAEFWMMEHGMLSPEKAKKAYEKKLRRQKEQRSGTPIKS----PK------LLSKPESSQ 189
           MA+ W+ME G+L  EKAKK  EKKL  QK  +  +P+KS    P+       + K E  +
Sbjct: 147 MAQIWLMESGLLPAEKAKKVLEKKL--QKGGKLSSPVKSAASTPRSNSKSVTVKKKEVQK 206

Query: 190 RPQQ----PSKNGDLK--AKQKVTTNDSDDDDDNFILSPKRRK 198
            P +      K  D K   K++   +D DD DD+F+ S   +K
Sbjct: 207 SPSEALSNKKKGNDSKPTTKKRKKNSDDDDSDDDFLASRVSKK 247

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
AT5G11600.11.9e-3850.00unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G19990.18.1e-1838.12unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..92
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 127..144
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 127..198
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 145..173
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 174..198
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..73
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 78..92
NoneNo IPR availablePANTHERPTHR33828:SF1OS05G0596200 PROTEINcoord: 8..197
NoneNo IPR availablePANTHERPTHR33828OS05G0596200 PROTEINcoord: 8..197

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh05G005950.1CmaCh05G005950.1mRNA