CmaCh16G012060 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh16G012060
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
Descriptionprotein LOW PSII ACCUMULATION 3, chloroplastic
LocationCma_Chr16: 9149878 .. 9153967 (+)
RNA-Seq ExpressionCmaCh16G012060
SyntenyCmaCh16G012060
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGATTTGTTTGATTACATTCGGTCCTCTCGTTTGGAGGAAAGAACATCAATCATCAATATTCTCTCACTCTCTCTCTTGTTTTTCTTTTTCACTTCGTAGTGTGCTTCATTCCTCAGGCAAGGCTTCCGATCGAGGAAAAATGTATTTCTCCAACGCGAAATTTGTCTCTAGCTCTACCATTCCGGCTGCGGTTCTTCCACTTCCTTCTCGAAATGTAATTTCTCGTGAAGTTTCTTTCTCGTTTTCTTTGGCGTTCTGCTCTATTTTACTTCGAAATTCTTTTTCCTCAAGTATTTTTTCTCTGTTTCTATCGGTAGTATGCGTGTTTCAGGATGTGTCACCGGAAAATTCATCGGAGTTTGCTTTCTAGTTCAGTGTTGGAGTTTATGGCGGGAAGAAATCTCAGAGTTCTGTCGAATTCGAGTAATCGCGAGAGTAGCGATACTTCGACTGATTTTGACGTTCCGTTTCCACGAGATTATTTTGATCTTCTTGATCAGGTTTGGATTCAATCACTTTAGAAATCTCGTTTATTTGTTTGTTTTTACTTCTTGGAGCCATTTTTTCTACGATCACTTCAATGCAGTGTCCGTTTTATTGTGTCATTTTGGTAGTTAGCTACTTCAATTAGCTTCCACTTAGCTTTTCATTTCATGTTTTTTTCAAATTACGTTTAACTTTGAGATGTAACTGTCATTTTCGGATCTCGTCTATAAAAAACTGCAGGCGAAAAAAGCAACTGAAGCGGCTTTAAGAGACAACAAACAGTTGATGGTTAGTTTTGAGCACATTTTGATTTCTGTGAATGGAAAGAAGCATTTATAGATTCATAAAATAGTCTGAATTTTCTGATCAAATCTATTGTACTTCAGGAGATTGAATTTCCAACTGCTGGACTCGAATCAGTTCCAGGTTCTTCCTGTTCTCGATTTATTGCATTTTAATTGTGTTACCATCTCAAGTCGTTCCCAGGCCTGTCTAGTAACTTTCTTTGCCTGTTACAGACCATATAACCTTATGTATCAGGTGATGGCGAAGGAGGAATCGAAATGACTGAGAGTATGCAATTGATTCGCCAATTTTGTGACTGCTTCATAGATCCACTGAAAGCCACCAGGACCAGAGTAGTAATGTCCTACTGCATTATCCTTCTATTCTCTTATAAAGTAAAATGTTTGCGTTCACATATATATGCTACATAATTTATCACTGGAAGTAACTAAGAAGCAATCGATCTTTTATGCGTTTCTCTCTAAAATTTCATCTGCATACAGATCAAAACTATGAAGATTTTTCCCTTTTTCTCGAGAAACTACTCAGTGAGGCCGCCACAATGTGGCTAACACATTTTCATCATCAGATGTAAGAAGGAGAAGTTTTCAAATTCATATCTAGTGATTATTAGTAGGCATGCAGATTTTTAATCCACATCTAATGGCTATTCCAGGGCAGCCATATGACTATGGCCTCATTGAATAACTTATCAATCTTTTCCTTATGATATCAGTACTAAAAATCTCTCTGGTGCTATAGACTATAAGTTAGAAAAATCTTCTTGTTTTCTTAACGATATATCAACTTCTCCTTTTATCTTTACGATTTATCAACTTTTGTCCTCCGAAATACATTCATATTATGAAATTTGAACGAGGAAATGTAATGCAGCTGGCATCCTGTTTATGATCATGGTTTTCTTTGTAATATGCTTGCTGTCTGATGTTCTCTCTCTTTTCTGACCATGAAAAAGCTCTTTCACATTCTAGACTGTTGGTTATAGTAAAGAGAAACATTTGTAGTTCTTTCCAGAGGCCAACGAAGTAAAATTTGCAAGAAATACAGCATTTGAGGGAGCTTCGTTCAAGTTGGACTACCTCACGAAACCATCATTTTTTGAGGATTTTGGTTTTGTTGAGAAAGTGAAAATGGCAGACCGTGTGAAACCAGAAGATGAACTTTTTCTGGTTGCATATCCATATTTCAATGTCAATGGTTAGTGAAGCCTTCCTTTGTTTCTCTTCCCTCTTATTTTAAAGCAATTTGCCCTTTGTCGGTTAAATTACATCGCGTCTTATTCCCCATTGCAGAAATGCTTGTGGTTGAAGAGCTTTACAAGGAAGCTGTCGCAAACACAACACGAAAGCTCATCATATTTAATGGAGAGCTCGACCGCATTAGATCCGGCTGTATCCTTTTATCATCTTTTCTATTATCCAACGAAGAGTGGGATTGGGATCAACTTTTAAAACGGTAAAAATGACATTAAGTGGTTTCTGAATTTCAAGAAAAATGTCAAAATCTTAATTTTTTTTCATTTAATTGGTTCCTGAACTTTAAAAAATGTCCTCTAGTGCATATTTTTTTTTAAATTTATGAGACTTAGATTAATAGTTTATTAGTCTATTGAATTTAGCTTGTAATGATTTAGTCCCATAACTCGATTTTGGTACTTATAAATGTGTAACAATATAACTCTCAACTTTAATATGCAACCATTGATTACTATGTATTTTATAATTTGTAACGATTTAGTTCTTAGTCTAAACAAATAGTAATTAAACTGTTACGAATTGAAATTGGAGAAACTAAATTGTTAGTTGTTCATGTTACCGAGAAAGTTCAAAGACTTGTGAGTCCTTTTTAATCCTCTTTTACTTCATAAAATGAGATGAATTTTCCCATGACAGGTTGCTGGGTTGTAATTCCACAGTTCTCTTTTATTATTATTATTATTTTGTGTTTGTTATGTGACTGGACCTTAACAGGGGAAAGATTATCCGCCATTTTTCTACCCAAAGCTGGCGGCACTTACAAAGACTCTCTTCCCCAAGATGGAGACTGTATATTACATCCACAACTTCAAGGGACAGAAAGGAGGAGTTCTTTTCAGGTTGAACATGACACTTTTAAATCTTTTCCTGTTGATTATTGTTGGGAGTGAGTCCCACATTAATCTGTTAATTTAGTGGAAGATCATGAGTTTATAATTTGAGAATATCATCTCCATTGATATGAGGCTTTTTGGAGAAGCCCAAAGCAAAACTATGAGAGCCCTCCTTAACAAAGTCATACTATTGTGGAAATCCGTGATTTTTAACATGTTATCAAAGTCATGCCTTGAACTTAGTTATGTCAATAGAATTTTCAAATATCGAACAAAGGGGGTGTACTTTGTTCGAAGGCTCCAGAGAAAGGAGTCGAGCCTCGATTAAGAGGAGACTATTCTAGAGCTTCATAAACCTCAAACTTTGTTCGAGGGGAGGATTATTGGGAGTGAGTCCTCTTGGGGAGGAAGCCCAAAGCAAAACTATGAGAGCTTATGCTCAAAGTGGACAATATTTAGATGGTTTTTTTCCATAATAATAGAATTAAAACCTTCAATTATTGTGTTTCCCGTTTGTTATGATGGTAATGTAGGTGCTACCCAGGTCCTTGGAAGGTCCTTAGAAAAGTGAGGAATAAATTAGTCTGTGTTCATCAGCAACAGGACATGCCTTCTCTCAAGGAAGTCGCTTTAAACATTCTTCCATCGTCTTGAGGTTACGTAACTTCTACACTAATGAACTCCTACTTCACAATCATCTCACCCACATTCCATAAACTTTTCTTTTTCTGGTTCTATAGTTTTAGAAATCAAGCTTATAAAAGTTACTGATGCATATTTGGTTTATTTATTACTATAATAATTTCAAGATGTTAAAGATAAATTGCAAAAACAAAAATAATCTTTTCTATGATTTCGTCAATTCGTAGACAGATTAAATGTATGGTATTTAAAAATAGAGAAAATAAGAAATATTCAAACAAGAACCAAAAGTTTCAATTTACAATCACACAACCGCAGTATTGAATGAGGATTATTTGGTTTGACTGCATATGTATTGAGCGAACTACATTTTTTTTTTTTTTTTTTGGCAGGTCTTATTAGTCGATATTACCCTTCATTTTCATGCAAAGAGGATCATTTGTTGGTTAGCATCGAAGTCTTTTCTGGTAAATGCTACATCATCATAATCTGTATATTCCTATTCAATATTTGACGTCCATGAAATTTACGGAATGCCTAATAATACACATAATTTAAAAC

mRNA sequence

ATGGAAGATTTGTTTGATTACATTCGTGTGCTTCATTCCTCAGGCAAGGCTTCCGATCGAGGAAAAATGTATTTCTCCAACGCGAAATTTGTCTCTAGCTCTACCATTCCGGCTGCGGTTCTTCCACTTCCTTCTCGAAATTATGCGTGTTTCAGGATGTGTCACCGGAAAATTCATCGGAGTTTGCTTTCTAGTTCAGTGTTGGAGTTTATGGCGGGAAGAAATCTCAGAGTTCTGTCGAATTCGAGTAATCGCGAGAGTAGCGATACTTCGACTGATTTTGACGTTCCGTTTCCACGAGATTATTTTGATCTTCTTGATCAGGCGAAAAAAGCAACTGAAGCGGCTTTAAGAGACAACAAACAGTTGATGGAGATTGAATTTCCAACTGCTGGACTCGAATCAGTTCCAGGTGATGGCGAAGGAGGAATCGAAATGACTGAGAGTATGCAATTGATTCGCCAATTTTGTGACTGCTTCATAGATCCACTGAAAGCCACCAGGACCAGAGTATTCTTTCCAGAGGCCAACGAAGTAAAATTTGCAAGAAATACAGCATTTGAGGGAGCTTCGTTCAAGTTGGACTACCTCACGAAACCATCATTTTTTGAGGATTTTGGTTTTGTTGAGAAAGTGAAAATGGCAGACCGTGTGAAACCAGAAGATGAACTTTTTCTGGTTGCATATCCATATTTCAATGTCAATGAAATGCTTGTGGTTGAAGAGCTTTACAAGGAAGCTGTCGCAAACACAACACGAAAGCTCATCATATTTAATGGAGAGCTCGACCGCATTAGATCCGGCTGTATCCTTTTATCATCTTTTCTATTATCCAACGAAGAGTGGGATTGGGATCAACTTTTAAAACGGGGAAAGATTATCCGCCATTTTTCTACCCAAAGCTGGCGGCACTTACAAAGACTCTCTTCCCCAAGATGGAGACTGTGCTACCCAGGTCCTTGGAAGGTCCTTAGAAAAGTGAGGAATAAATTAGTCTGTGTTCATCAGCAACAGGACATGCCTTCTCTCAAGGAAGTCGCTTTAAACATTCTTCCATCGTCTTGAGGTCTTATTAGTCGATATTACCCTTCATTTTCATGCAAAGAGGATCATTTGTTGGTTAGCATCGAAGTCTTTTCTGGTAAATGCTACATCATCATAATCTGTATATTCCTATTCAATATTTGACGTCCATGAAATTTACGGAATGCCTAATAATACACATAATTTAAAAC

Coding sequence (CDS)

ATGGAAGATTTGTTTGATTACATTCGTGTGCTTCATTCCTCAGGCAAGGCTTCCGATCGAGGAAAAATGTATTTCTCCAACGCGAAATTTGTCTCTAGCTCTACCATTCCGGCTGCGGTTCTTCCACTTCCTTCTCGAAATTATGCGTGTTTCAGGATGTGTCACCGGAAAATTCATCGGAGTTTGCTTTCTAGTTCAGTGTTGGAGTTTATGGCGGGAAGAAATCTCAGAGTTCTGTCGAATTCGAGTAATCGCGAGAGTAGCGATACTTCGACTGATTTTGACGTTCCGTTTCCACGAGATTATTTTGATCTTCTTGATCAGGCGAAAAAAGCAACTGAAGCGGCTTTAAGAGACAACAAACAGTTGATGGAGATTGAATTTCCAACTGCTGGACTCGAATCAGTTCCAGGTGATGGCGAAGGAGGAATCGAAATGACTGAGAGTATGCAATTGATTCGCCAATTTTGTGACTGCTTCATAGATCCACTGAAAGCCACCAGGACCAGAGTATTCTTTCCAGAGGCCAACGAAGTAAAATTTGCAAGAAATACAGCATTTGAGGGAGCTTCGTTCAAGTTGGACTACCTCACGAAACCATCATTTTTTGAGGATTTTGGTTTTGTTGAGAAAGTGAAAATGGCAGACCGTGTGAAACCAGAAGATGAACTTTTTCTGGTTGCATATCCATATTTCAATGTCAATGAAATGCTTGTGGTTGAAGAGCTTTACAAGGAAGCTGTCGCAAACACAACACGAAAGCTCATCATATTTAATGGAGAGCTCGACCGCATTAGATCCGGCTGTATCCTTTTATCATCTTTTCTATTATCCAACGAAGAGTGGGATTGGGATCAACTTTTAAAACGGGGAAAGATTATCCGCCATTTTTCTACCCAAAGCTGGCGGCACTTACAAAGACTCTCTTCCCCAAGATGGAGACTGTGCTACCCAGGTCCTTGGAAGGTCCTTAGAAAAGTGAGGAATAAATTAGTCTGTGTTCATCAGCAACAGGACATGCCTTCTCTCAAGGAAGTCGCTTTAAACATTCTTCCATCGTCTTGA

Protein sequence

MEDLFDYIRVLHSSGKASDRGKMYFSNAKFVSSSTIPAAVLPLPSRNYACFRMCHRKIHRSLLSSSVLEFMAGRNLRVLSNSSNRESSDTSTDFDVPFPRDYFDLLDQAKKATEAALRDNKQLMEIEFPTAGLESVPGDGEGGIEMTESMQLIRQFCDCFIDPLKATRTRVFFPEANEVKFARNTAFEGASFKLDYLTKPSFFEDFGFVEKVKMADRVKPEDELFLVAYPYFNVNEMLVVEELYKEAVANTTRKLIIFNGELDRIRSGCILLSSFLLSNEEWDWDQLLKRGKIIRHFSTQSWRHLQRLSSPRWRLCYPGPWKVLRKVRNKLVCVHQQQDMPSLKEVALNILPSS
Homology
BLAST of CmaCh16G012060 vs. TAIR 10
Match: AT5G48790.1 (Domain of unknown function (DUF1995) )

HSP 1 Score: 334.0 bits (855), Expect = 1.5e-91
Identity = 194/339 (57.23%), Postives = 229/339 (67.55%), Query Frame = 0

Query: 25  FSNAKFVSSSTIPAAVLPLPSRNYACFRMCHRKIHRSLLSSSVLEFMAGRNLRVLSNSSN 84
           FS A  VS +++ A  L   S+N  C    H K +    ++  L+F A        + S 
Sbjct: 5   FSIATTVSPASL-AGTLASNSKNVLC--SLHSKNNDITKTNRNLKFRA-------CSVSG 64

Query: 85  RESSDTSTDFDVPFPRDYFDLLDQAKKATEAALRDNKQLMEIEFPTAGLESVPGDGEGGI 144
             +++TS D +VPFPRDY +L++QAK+A E AL+D KQLMEIEFPT+GL SVPGDGEG  
Sbjct: 65  GYNNNTSVD-NVPFPRDYVELINQAKEAVEMALKDEKQLMEIEFPTSGLASVPGDGEGAT 124

Query: 145 EMTESMQLIRQFCDCFIDPLKATRTRVFFPEANEVKFARNTAFEGASFKLDYLTKPSFFE 204
           EMTES+ +IR+FCD  + P KA  TR+FFPEANEVKFA+ T F G  FKLDYLTKPS FE
Sbjct: 125 EMTESINMIREFCDRLLAPEKARSTRIFFPEANEVKFAQKTVFGGTYFKLDYLTKPSLFE 184

Query: 205 DFGFVEKVKMADRVKPEDELFLVAYPYFNVNEMLVVEELYKEAVANTTRKLIIFNGELDR 264
           DFGF E+VKMADRVKPEDELFLVAYPYFNVNEMLVVEELYKEAV NT RKLIIFNGELDR
Sbjct: 185 DFGFFERVKMADRVKPEDELFLVAYPYFNVNEMLVVEELYKEAVVNTDRKLIIFNGELDR 244

Query: 265 IRSG---------CILLSSFLLSNEEWDWDQLLKRGKIIRHFSTQSWRHLQRLSSPRWRL 324
           IRSG            L+  LL   E  +         I +F  Q    L R        
Sbjct: 245 IRSGYYPKFFYPKLAALTKTLLPKMETVY--------YIHNFKGQKGGVLFR-------- 304

Query: 325 CYPGPWKVLRKVRNKLVCVHQQQDMPSLKEVALNILPSS 355
           CYPGPW+VLR+ RNK +CVHQQ+ MPSLKEVAL+IL S+
Sbjct: 305 CYPGPWQVLRRTRNKYICVHQQESMPSLKEVALDILASA 316

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
AT5G48790.11.5e-9157.23Domain of unknown function (DUF1995) [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR018962Domain of unknown function DUF1995PFAMPF09353DUF1995coord: 99..347
e-value: 3.3E-34
score: 118.7
NoneNo IPR availablePANTHERPTHR34051:SF1PROTEIN LOW PSII ACCUMULATION 3, CHLOROPLASTIC ISOFORM X1coord: 77..268
NoneNo IPR availablePANTHERPTHR34051:SF1PROTEIN LOW PSII ACCUMULATION 3, CHLOROPLASTIC ISOFORM X1coord: 316..353
IPR044687Protein LOW PSII ACCUMULATION 3PANTHERPTHR34051PROTEIN LOW PSII ACCUMULATION 3, CHLOROPLASTICcoord: 316..353
coord: 77..268

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh16G012060.1CmaCh16G012060.1mRNA