CmaCh04G002210 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh04G002210
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionPentatricopeptide repeat-containing protein
LocationCma_Chr04: 1065243 .. 1067518 (+)
RNA-Seq ExpressionCmaCh04G002210
SyntenyCmaCh04G002210
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGCGTTTTTCAGCTCCTCCCCCTCTTTCACCTTAGCTTTGTTTATTCCTTCTGTCATTCAAATTTTGGAGCCCAACGTTAAGAAGGCCTAAATTTTGATTGCTAAAATGGTTCAAATTCTTTCATATGTTTTATAATTAATTAATTTATTCTCCCACAAGATTTATCTCTAAATCTGTGAAGGGATAGGGGTTTTAGATTTATGTGGGTTGTTTTCATTTTTATGTTGTTGTTCTTCTGAACCAATTTATGTATGTAATATAATTTTATCAGGTAGTACCGTACTTGATAAAATATTATATTTATGGATACTCCATTCTTTGTTTTTTCTTTTCAAGAACCTCATATTGACCTTTGAAGGATTGTGAAACAAAGGGCTGTCCATTATAAATACAGAAGAAATTAGAGATTGGGAAACGTAAGATTAGGAGCTTTGATTCTCTCCTGTATTAGAAAATCTTCCTCTTCCAGAAGTGCGAGATGTGGAAGCCCTCCATCTCCCTCAAAACAGCCCATTCATTTGCAAATTCCTTTGCAAAGCTGTATGGGCCATGTGTGTTAGTGCATGCTCTACCAATATCACTGCATTGTACGATGCTCTCCATTCTGTTGCCTCAAATCTCTCCAGTCTGAGAATTTGGCCCCCACGAATGAAATCATGTTCACAAATATAATAACATTATCACATTTACTACAAACTAGACCTAATAAAACAAACCTCTTGTGAAGAATCTCCACAATTCTGTACAACTCATCAACTTCGAAGGTGCCCATGGATTCCCACTTGTCGCTTCTTCTGTTAACCTAAAACATTCGACAATGACATAGTTTAACTAGCTAAGATATATATTTTCATTTTCGAGCGGCAAATGTCGAAAGAAAACTTACCATTGTACGTGAGTAACCGATTTTGACATGTTCTTTTGAGGAAGAAACAAACTTTCTTCTTCAATTCATGGAAGTAAGAGTGTTCCATTTTGGTTGAAGAACAATCGAAGAGAAACCCATTTTATTTAAGGCCTCATATTCAAATGACATAAGAAGAAACATAACTGTTCCATTTATGTCTCAACTGCTCCCTGGAAGTTGATGTGCAAATCAAATACACACAAAGCCAACTCCAAGAATTCATTCAAACCTTATCTATCTTAAGCTTTATGATTTAGGTATCTACCTAATATGCTCCCTAAAAGTCACGGGGTTCTTTATTTACTTATGGTAATATAATGTTACCTTCTTCAATTTTTTTTAATTTATTTCTTCGTTTTCTAAAATATTTTTTTGCAAACGTGACTCGAGGTCCAATACGTCGATGTTGTCCATGACATCCGAATACGTCGATGTTGTCCATGATATCCGAATTTTTCACAGCTAACCTGTTCATTGAGTTAGAATAAGTGTGCACTATCATTGTCTGCTTAATATTTTATTTGATGGCTACTCATTCAATTATATGGTTACATTAGAAGAAAAAAAACACCGAAAATGTTTGTTATAATCGGAACTCTCCCCTTTATTTGACGATTTAATTTACATTATTTTCAGGGAAAAATGATGGAAAATTACAATTCCAAACAGATGAACAGAGGACGAAATTCAGGGATCAGTTTTTAGAGAGAAAAAGAAGCAGAGGAGAAGCGGGATTCAATGAACAACGGTAACCGGCGGCGGCTTAAATCTGCTACCGGCGCCACCGTAACTGATATTTCTCTGATGCGGCAACGCGAGCATGGCGGTGTAGTCCAAAAAATCATTCAGCGCAATAACGGCCTGCAAAATCCGTTTCAAATCATCGGCGGCGGAGAATCCGAAGCATCCTCTTTCGCGGACAGCATACACGTAAAAAGTTAAACAACCTTTTCTAAATTTAAACCTAGCCATCTCACAGCCGTTCAGTCCCCGAATCAGCCGTCGATTCACATCGCCGAGAGTCATTTCCGCCGGCCACCTGTGGCCGACGGCAGCCTTCAGAGCCTCCGTCTCGAACACAAACAGAATCACCTTCGATTTCTTCGTGCCGAGGCCGAACGTCAGATTGTGCCAAGCGTGGTGCGCCAGAATCGTGAAATTCGTCGGCAGGAACACGGCGGTGGAATTGAACGGAAGGCGATTTTGGTTGTTATTGTCGCTGTTGCCGTCGTGGTGCGGCGGCAGAGAAAGTAGCTGGAGGAATTCAAGAGGCCAGGCTCGGCGAATTGTGAAGGATTTGACAACGAATTGGGCGCCGAATCGGAGACCTTTGACAGCGGTAGATGGCTTGAAGGAGAAATCG

mRNA sequence

ATGGAAGCGTTTTTCAGCTCCTCCCCCTCTTTCACCTTAGCTTTGTTTATTCCTTCTGTCATTCAAATTTTGGAGCCCAACAGAGAAAAAGAAGCAGAGGAGAAGCGGGATTCAATGAACAACGGTAACCGGCGGCGGCTTAAATCTGCTACCGGCGCCACCGTAACTGATATTTCTCTGATGCGGCAACGCGAGCATGGCGGTGTAGTCCAAAAAATCATTCAGCGCAATAACGGCCTGCAAAATCCGTTTCAAATCATCGGCGGCGGAGAATCCGAAGCATCCTCTTTCGCGGACAGCATACACCCGTTCAGTCCCCGAATCAGCCGTCGATTCACATCGCCGAGAGTCATTTCCGCCGGCCACCTGTGGCCGACGGCAGCCTTCAGAGCCTCCGTCTCGAACACAAACAGAATCACCTTCGATTTCTTCGTGCCGAGGCCGAACGTCAGATTGTGCCAAGCGTGGTGCGCCAGAATCGTGAAATTCGTCGGCAGGAACACGGCGGTGGAATTGAACGGAAGGCGATTTTGGTTGTTATTGTCGCTGTTGCCGTCGTGGTGCGGCGGCAGAGAAAGTAGCTGGAGGAATTCAAGAGGCCAGGCTCGGCGAATTGTGAAGGATTTGACAACGAATTGGGCGCCGAATCGGAGACCTTTGACAGCGGTAGATGGCTTGAAGGAGAAATCG

Coding sequence (CDS)

ATGGAAGCGTTTTTCAGCTCCTCCCCCTCTTTCACCTTAGCTTTGTTTATTCCTTCTGTCATTCAAATTTTGGAGCCCAACAGAGAAAAAGAAGCAGAGGAGAAGCGGGATTCAATGAACAACGGTAACCGGCGGCGGCTTAAATCTGCTACCGGCGCCACCGTAACTGATATTTCTCTGATGCGGCAACGCGAGCATGGCGGTGTAGTCCAAAAAATCATTCAGCGCAATAACGGCCTGCAAAATCCGTTTCAAATCATCGGCGGCGGAGAATCCGAAGCATCCTCTTTCGCGGACAGCATACACCCGTTCAGTCCCCGAATCAGCCGTCGATTCACATCGCCGAGAGTCATTTCCGCCGGCCACCTGTGGCCGACGGCAGCCTTCAGAGCCTCCGTCTCGAACACAAACAGAATCACCTTCGATTTCTTCGTGCCGAGGCCGAACGTCAGATTGTGCCAAGCGTGGTGCGCCAGAATCGTGAAATTCGTCGGCAGGAACACGGCGGTGGAATTGAACGGAAGGCGATTTTGGTTGTTATTGTCGCTGTTGCCGTCGTGGTGCGGCGGCAGAGAAAGTAGCTGGAGGAATTCAAGAGGCCAGGCTCGGCGAATTGTGAAGGATTTGACAACGAATTGGGCGCCGAATCGGAGACCTTTGACAGCGGTAGATGGCTTGAAGGAGAAATCG

Protein sequence

MEAFFSSSPSFTLALFIPSVIQILEPNREKEAEEKRDSMNNGNRRRLKSATGATVTDISLMRQREHGGVVQKIIQRNNGLQNPFQIIGGGESEASSFADSIHPFSPRISRRFTSPRVISAGHLWPTAAFRASVSNTNRITFDFFVPRPNVRLCQAWCARIVKFVGRNTAVELNGRRFWLLLSLLPSWCGGRESSWRNSRGQARRIVKDLTTNWAPNRRPLTAVDGLKEKS
Homology
BLAST of CmaCh04G002210 vs. ExPASy TrEMBL
Match: A0A0A9MTX3 (Uncharacterized protein OS=Arundo donax OX=35708 PE=4 SV=1)

HSP 1 Score: 80.9 bits (198), Expect = 7.8e-12
Identity = 38/63 (60.32%), Postives = 42/63 (66.67%), Query Frame = 0

Query: 103 PFSPRISRRFTSPRVISAGHLWPTAAFRASVSNTNRITFDFFVPRPNVRLCQAWCARIVK 162
           P SPR+S   TSP  I AG  W TAAF  S SNT   T DF VPRP+V +C AWCAR+VK
Sbjct: 16  PVSPRMSSLLTSPSGIMAGQSWSTAAFMLSDSNTKSTTLDFLVPRPSVSVCHAWCARMVK 75

Query: 163 FVG 166
            VG
Sbjct: 76  LVG 78

BLAST of CmaCh04G002210 vs. ExPASy TrEMBL
Match: A0A0A9VLF4 (Uncharacterized protein OS=Arundo donax OX=35708 PE=4 SV=1)

HSP 1 Score: 73.9 bits (180), Expect = 9.5e-10
Identity = 46/116 (39.66%), Postives = 55/116 (47.41%), Query Frame = 0

Query: 110 RRFTSPRVISAGHLWPTAAFRASVSNTNRITFDFFVPRPNVRLCQAWCARIVKFVGRNTA 169
           RR TSP   + G     AAF A+ S T     +F VPRP+VR+C AWCAR+ K VG   A
Sbjct: 3   RRPTSPSGTTCGQAASAAAFMAADSKTKTAALEFLVPRPSVRVCHAWCARMAKLVGM-YA 62

Query: 170 VELNGRRFWLLLSLLPSWCGGRESSWRNSRGQARRIVKDLTTNWAPNRRPLTAVDG 226
           V +                G       +S G ARR V    T  AP RRPLT+  G
Sbjct: 63  VVVGANDSGPPSPPAAVGAGAARRRCGSSSGAARRTVSARVTICAPKRRPLTSAHG 117

BLAST of CmaCh04G002210 vs. ExPASy TrEMBL
Match: A0A6A2YAD6 (Pentatricopeptide repeat-containing protein OS=Hibiscus syriacus OX=106335 GN=F3Y22_tig00111402pilonHSYRG01509 PE=3 SV=1)

HSP 1 Score: 61.6 bits (148), Expect = 4.9e-06
Identity = 37/71 (52.11%), Postives = 44/71 (61.97%), Query Frame = 0

Query: 160 IVKFVGRNTAVELNGRRFWLLLSLLPSWCGGRESSWRNSRGQARRIVKDLTTNWAPNRRP 219
           +VKFVG+   V   G    LLL LL +  GG  SS  +S G AR +V DLT NW PN RP
Sbjct: 1   MVKFVGKKAVVVGKGS---LLLDLLNN--GGDSSSLSSSSGLARLMVNDLTMNWPPNLRP 60

Query: 220 LTAVDGLKEKS 231
            T++ GLKEKS
Sbjct: 61  FTSLLGLKEKS 66

BLAST of CmaCh04G002210 vs. ExPASy TrEMBL
Match: A0A0A9VGE2 (Uncharacterized protein OS=Arundo donax OX=35708 PE=4 SV=1)

HSP 1 Score: 60.1 bits (144), Expect = 1.4e-05
Identity = 28/56 (50.00%), Postives = 34/56 (60.71%), Query Frame = 0

Query: 110 RRFTSPRVISAGHLWPTAAFRASVSNTNRITFDFFVPRPNVRLCQAWCARIVKFVG 166
           RR TSP   + G     AAF A+ S T     +F VPRP+VR+C AWCAR+ K VG
Sbjct: 3   RRPTSPSGTTCGQAASAAAFMAADSKTKTAALEFLVPRPSVRVCHAWCARMAKLVG 58

BLAST of CmaCh04G002210 vs. ExPASy TrEMBL
Match: A0A0A0KQX9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G517125 PE=4 SV=1)

HSP 1 Score: 58.9 bits (141), Expect = 3.2e-05
Identity = 28/41 (68.29%), Postives = 32/41 (78.05%), Query Frame = 0

Query: 190 GRESSWRNSRGQARRIVKDLTTNWAPNRRPLTAVDGLKEKS 231
           G+E S R+S+G ARRIVKDLT NW PN RP T+  GLKEKS
Sbjct: 20  GKERSRRSSKGLARRIVKDLTMNWPPNLRPFTSEHGLKEKS 60

BLAST of CmaCh04G002210 vs. NCBI nr
Match: CAE6076342.1 (unnamed protein product [Arabidopsis arenosa])

HSP 1 Score: 104.8 bits (260), Expect = 1.0e-18
Identity = 59/105 (56.19%), Postives = 65/105 (61.90%), Query Frame = 0

Query: 126 TAAFRASVSNTNRITFDFFVPRPNVRLCQAWCARIVKFVGRNTAVELNGRRFWLLLSLLP 185
           T  F ASVSNTN  T DF VPRP+V +C AWCARIVKFVGR  AVE +G           
Sbjct: 15  TDDFIASVSNTNTTTLDFLVPRPSVSVCHAWCARIVKFVGRYAAVERSG----------G 74

Query: 186 SWCGGRESSWRNSRGQARRIVKDLTTNWAPNRRPLTAVDGLKEKS 231
               GRE S+R+S   ARRIV DLT  W PN RP T+  GLKE S
Sbjct: 75  DDEEGRERSFRSSSVLARRIVNDLTIIWPPNLRPFTSEVGLKEMS 109

BLAST of CmaCh04G002210 vs. NCBI nr
Match: KAF9668561.1 (hypothetical protein SADUNF_Sadunf14G0016300 [Salix dunnii])

HSP 1 Score: 77.8 bits (190), Expect = 1.4e-10
Identity = 44/91 (48.35%), Postives = 54/91 (59.34%), Query Frame = 0

Query: 129 FRASVSNTNRITFDFFVPRPNVRLCQAWCARIVKFVGRNTAVELNGRRFWLLLSLLPSWC 188
           F  S   TN+ T D  VP P+V++CQAW A+IV+FVGR   V+  G R  LL   L    
Sbjct: 839 FMDSDLKTNKSTLDLLVPGPSVKVCQAWYAKIVRFVGRKAVVK--GNRILLLFFFLS--L 898

Query: 189 GGRESSWRNSRGQARRIVKDLTTNWAPNRRP 220
           GG ES++R+S G AR IVKD T N   N  P
Sbjct: 899 GGYESTFRSSSGLARWIVKDFTMNCPQNEDP 925

BLAST of CmaCh04G002210 vs. NCBI nr
Match: KAE8679465.1 (Pentatricopeptide repeat-containing protein [Hibiscus syriacus])

HSP 1 Score: 61.6 bits (148), Expect = 1.0e-05
Identity = 37/71 (52.11%), Postives = 44/71 (61.97%), Query Frame = 0

Query: 160 IVKFVGRNTAVELNGRRFWLLLSLLPSWCGGRESSWRNSRGQARRIVKDLTTNWAPNRRP 219
           +VKFVG+   V   G    LLL LL +  GG  SS  +S G AR +V DLT NW PN RP
Sbjct: 1   MVKFVGKKAVVVGKGS---LLLDLLNN--GGDSSSLSSSSGLARLMVNDLTMNWPPNLRP 60

Query: 220 LTAVDGLKEKS 231
            T++ GLKEKS
Sbjct: 61  FTSLLGLKEKS 66

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A9MTX37.8e-1260.32Uncharacterized protein OS=Arundo donax OX=35708 PE=4 SV=1[more]
A0A0A9VLF49.5e-1039.66Uncharacterized protein OS=Arundo donax OX=35708 PE=4 SV=1[more]
A0A6A2YAD64.9e-0652.11Pentatricopeptide repeat-containing protein OS=Hibiscus syriacus OX=106335 GN=F3... [more]
A0A0A9VGE21.4e-0550.00Uncharacterized protein OS=Arundo donax OX=35708 PE=4 SV=1[more]
A0A0A0KQX93.2e-0568.29Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G517125 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
CAE6076342.11.0e-1856.19unnamed protein product [Arabidopsis arenosa][more]
KAF9668561.11.4e-1048.35hypothetical protein SADUNF_Sadunf14G0016300 [Salix dunnii][more]
KAE8679465.11.0e-0552.11Pentatricopeptide repeat-containing protein [Hibiscus syriacus][more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 28..50
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 28..43

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh04G002210.1CmaCh04G002210.1mRNA