CmaCh20G002180 (gene) Cucurbita maxima (Rimu)

NameCmaCh20G002180
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionCoffea canephora DH200=94 genomic scaffold, scaffold_112
LocationCma_Chr20 : 1046003 .. 1047076 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CAACGGAGGAGCCATGGAGGATCAACGAAAACAGAAGGCAGAACGGCTCCGGAAGAAGAGTAATAAGATCTCTCTCGAGGACTACCTCGATTTCTTCTCCTCCGACAAGCAACTATTCCTCCCCGTCGCTTATCTCAATCAGGTCCTGATCTACATTGTTCATTTTTTCGTCCTCACTAATTTCTCTGTGACGATCTGTTTGGATTTCGTTTCGAACTACTTGCGTGTTGATTTTTTAACCGATGCTTCTGTTTAACTTGATGTGCAGATCATTCGCATGCATGGCTACATGAATATCAAAGGTCTGAAGGTGAGTTTTGCCGAACCGATACGTATGTTTAATTTGCTTGGTTGAAAATTAATCATAATAACCTTTGTTTGCATCTCTCACGCAATGTTCACCAATTGGTTTCTAGTTTGAACTTCTTGCTTTGAAAAATTAGAGAACGTTTACTCAAATTGATTGAAATTCCTTATCAATTTTTTTCTATGTTTGGGACTTAATATGGATCCTCTGTTGTGCGTTCCAAATTAGAGAACGTTTACGTAATGTTCTAAATTAATTTAGCATAAAATAGTGGGAAGAGTCCGGAAAATGAGATTAGAAAACGATGCTGCTCAGCAGCTGAAGTAATTTGATGTATCCTCACTCCTAGTCTAATAGGAACCGATTAATTACATTCACATCCTCTCTCTCTTGCTTAGAATGTAGTGAAGGAAGCCGTAGGTACAATTAATCTGGTCAATCTCTCTCGTTCCACACTCAAAGAGAGCATCTCATCATCCGCGTCCATTACGCTTGAGGACGTAATCTCGGACCTCAAAAACCTCGAATGGCAAGAATGCAGTGTGACATCTGTTCTGAATTTCAGTTCCTGGAAGCAGAATAACTCCGATCCGAGTCCGGACCGCCAGGAGCCGACTAGTGCCTCAAAGAAGTCAGGAAAGAAATTACGAGTATTGAGTGAATGTAATAGTCAAGAAGTCGAGGCAATTGATGGAGTTTCCTCGTCTTGTGCCTCGAAGAAGCCGGGAAGTGAATCTCAGAGTAAGAGAAAGAAAACAGCTGCTTAA

mRNA sequence

CAACGGAGGAGCCATGGAGGATCAACGAAAACAGAAGGCAGAACGGCTCCGGAAGAAGAGTAATAAGATCTCTCTCGAGGACTACCTCGATTTCTTCTCCTCCGACAAGCAACTATTCCTCCCCGTCGCTTATCTCAATCAGATCATTCGCATGCATGGCTACATGAATATCAAAGGTCTGAAGAATGTAGTGAAGGAAGCCGTAGGTACAATTAATCTGGTCAATCTCTCTCGTTCCACACTCAAAGAGAGCATCTCATCATCCGCGTCCATTACGCTTGAGGACGTAATCTCGGACCTCAAAAACCTCGAATGGCAAGAATGCAGTGTGACATCTGTTCTGAATTTCAGTTCCTGGAAGCAGAATAACTCCGATCCGAGTCCGGACCGCCAGGAGCCGACTAGTGCCTCAAAGAAGTCAGGAAAGAAATTACGAGTATTGAGTGAATGTAATAGTCAAGAAGTCGAGGCAATTGATGGAGTTTCCTCGTCTTGTGCCTCGAAGAAGCCGGGAAGTGAATCTCAGAGTAAGAGAAAGAAAACAGCTGCTTAA

Coding sequence (CDS)

ATGGAGGATCAACGAAAACAGAAGGCAGAACGGCTCCGGAAGAAGAGTAATAAGATCTCTCTCGAGGACTACCTCGATTTCTTCTCCTCCGACAAGCAACTATTCCTCCCCGTCGCTTATCTCAATCAGATCATTCGCATGCATGGCTACATGAATATCAAAGGTCTGAAGAATGTAGTGAAGGAAGCCGTAGGTACAATTAATCTGGTCAATCTCTCTCGTTCCACACTCAAAGAGAGCATCTCATCATCCGCGTCCATTACGCTTGAGGACGTAATCTCGGACCTCAAAAACCTCGAATGGCAAGAATGCAGTGTGACATCTGTTCTGAATTTCAGTTCCTGGAAGCAGAATAACTCCGATCCGAGTCCGGACCGCCAGGAGCCGACTAGTGCCTCAAAGAAGTCAGGAAAGAAATTACGAGTATTGAGTGAATGTAATAGTCAAGAAGTCGAGGCAATTGATGGAGTTTCCTCGTCTTGTGCCTCGAAGAAGCCGGGAAGTGAATCTCAGAGTAAGAGAAAGAAAACAGCTGCTTAA

Protein sequence

MEDQRKQKAERLRKKSNKISLEDYLDFFSSDKQLFLPVAYLNQIIRMHGYMNIKGLKNVVKEAVGTINLVNLSRSTLKESISSSASITLEDVISDLKNLEWQECSVTSVLNFSSWKQNNSDPSPDRQEPTSASKKSGKKLRVLSECNSQEVEAIDGVSSSCASKKPGSESQSKRKKTAA
BLAST of CmaCh20G002180 vs. TrEMBL
Match: V4L939_EUTSA (Uncharacterized protein OS=Eutrema salsugineum GN=EUTSA_v10009758mg PE=4 SV=1)

HSP 1 Score: 101.3 bits (251), Expect = 1.3e-18
Identity = 57/138 (41.30%), Postives = 92/138 (66.67%), Query Frame = 1

Query: 18  KISLEDYLDFFSSDKQLFLPVAYLNQIIRMHGYMNI-KGLKNVVKEAVGTINLVNLSRST 77
           KISLE+Y+DFF+S K +   +AYLNQI+ MHG+  + K  K +V EAV  ++L++LSRST
Sbjct: 8   KISLEEYVDFFNSGKSIDFTIAYLNQIVHMHGFRKLHKSAKKIVGEAVDALDLLDLSRST 67

Query: 78  LKES-ISSSASITLEDVISDLKNLEWQECSVTS--VLNFSSWKQNNSDPSPDRQEPTSAS 137
           LK++ +SSSA+ TL++VI+D++ L+WQEC +TS  ++NF    +  +     +       
Sbjct: 68  LKQTGVSSSATQTLDEVITDIEALKWQECCLTSLQIINFDEVTRAGAAKPKQKSNKRKIG 127

Query: 138 KKSGKKLRVLSECNSQEV 152
           K+  KK++   + NS+ +
Sbjct: 128 KEKAKKIKKKKKKNSKSI 145

BLAST of CmaCh20G002180 vs. TrEMBL
Match: K4DCH2_SOLLC (Uncharacterized protein OS=Solanum lycopersicum PE=4 SV=1)

HSP 1 Score: 97.8 bits (242), Expect = 1.4e-17
Identity = 56/124 (45.16%), Postives = 79/124 (63.71%), Query Frame = 1

Query: 11  RLRKKSNKISLEDYLDFFSSDKQLFLPVAYLNQIIRMHGYMNIKGLKNVVKEAVGTINLV 70
           R++KKS K+++E Y+DF  S KQ  L +  LN+II +HG+   K  K V+ +AV T+ L+
Sbjct: 9   RMKKKSQKMTVEKYVDFIDSKKQFDLTIPNLNEIISIHGFKKSKRQKKVLADAVNTMELI 68

Query: 71  NLSRSTLKESISSSASITLEDVISDLKNLEWQECSVTSVLNFS-SWKQNNSDPSPDRQEP 130
           +L RSTL+E ISS A +TL++ I DL NL WQEC VTS+     S   N SD    +   
Sbjct: 69  DLRRSTLQEEISSEAFVTLDEAIKDLTNLNWQECCVTSLQTICFSTGVNGSDHCQAKTNA 128

Query: 131 TSAS 134
           T++S
Sbjct: 129 TASS 132

BLAST of CmaCh20G002180 vs. TrEMBL
Match: A0A0B0PK16_GOSAR (Uncharacterized protein OS=Gossypium arboreum GN=F383_00109 PE=4 SV=1)

HSP 1 Score: 96.3 bits (238), Expect = 4.1e-17
Identity = 59/125 (47.20%), Postives = 82/125 (65.60%), Query Frame = 1

Query: 17  NKISLEDYLDFFSSDKQLFLPVAYLNQIIRMHGYMNIKGLKNVVKEAVGTINLVNLSRST 76
           +K+S+EDY+ F SS KQ  L + +LNQII +HG+  +   K  + +AV T++L++LSRST
Sbjct: 3   SKLSVEDYIQFLSSHKQRPLTINFLNQIISIHGFKKLTKHKKELSDAVETLDLMDLSRST 62

Query: 77  LKESISSSASITLEDVISDLKNLEWQECSVTSVLNFSSWKQNNSDPSPDRQEPTSASKKS 136
           LK SISS+A +T ++VI DL  LEWQEC VTS+      +  NS P PD+Q   S  K  
Sbjct: 63  LKSSISSNAWLTEKEVIGDLNCLEWQECCVTSI------QALNSSPLPDQQ---SNPKPQ 118

Query: 137 GKKLR 142
            K+ R
Sbjct: 123 AKRKR 118

BLAST of CmaCh20G002180 vs. TrEMBL
Match: A0A061DS97_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_001700 PE=4 SV=1)

HSP 1 Score: 95.5 bits (236), Expect = 6.9e-17
Identity = 69/165 (41.82%), Postives = 95/165 (57.58%), Query Frame = 1

Query: 1   MEDQRKQKAERLRKKSNKISLEDYLDFFSSDKQLFLPVAYLNQIIRMHGYMNIKGL-KNV 60
           ME   K  + R++ K  KISLE+Y+ F SS KQL L ++ LNQII +HG      + K  
Sbjct: 1   MERSSKANSWRMKAKKQKISLEEYIAFLSSHKQLPLTLSSLNQIIFIHGLKKSTNMPKKA 60

Query: 61  VKEAVGTINLVNLSRSTLKESISSSASITLEDVISDLKNLEWQECSVTSVLNFSSWKQNN 120
           + EAV  +NL++ SRSTLK ++SSSA +T E++I DL  LEWQEC VTS+   +S     
Sbjct: 61  LSEAVEKLNLIDPSRSTLKSTMSSSAWLTEEEIIGDLNRLEWQECCVTSIQTLNS----- 120

Query: 121 SDPSPDRQE-PTSASKKSGKKLRVLSECNSQEVEAIDGVSSSCAS 164
              SP++Q  P + +K   K+ R  S       E  D  SS+  S
Sbjct: 121 ---SPEQQSIPKAKAKAKAKRKRSASVA---VAEGADSFSSAVVS 154

BLAST of CmaCh20G002180 vs. TrEMBL
Match: D7KG22_ARALL (Predicted protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_678249 PE=4 SV=1)

HSP 1 Score: 95.1 bits (235), Expect = 9.1e-17
Identity = 55/126 (43.65%), Postives = 86/126 (68.25%), Query Frame = 1

Query: 18  KISLEDYLDFFSSDKQLFLPVAYLNQIIRMHGYMNIKGL-KNVVKEAVGTINLVNLSRST 77
           K+++E+Y+DFF+S     L ++YLNQI+ MHG+  +  L K +V EAV T++L++LSRST
Sbjct: 10  KMTVEEYVDFFTSGNSRNLTISYLNQILHMHGFRKLHKLQKKIVGEAVDTLDLLDLSRST 69

Query: 78  LKE---SISSSASITLEDVISDLKNLEWQECSVTSVLNFSSWKQNNSDPSPDRQEPTSAS 137
           LKE   S  SS+ +TL++VISD++ L+WQEC +TS+   +S +   S P P +++     
Sbjct: 70  LKEAPVSSPSSSPLTLDEVISDIEALKWQECCLTSLQIINSQEITGSVPKPKQKKSNKRK 129

Query: 138 KKSGKK 140
           K + KK
Sbjct: 130 KATMKK 135

BLAST of CmaCh20G002180 vs. TAIR10
Match: AT1G06320.1 (AT1G06320.1 unknown protein)

HSP 1 Score: 90.5 bits (223), Expect = 1.1e-18
Identity = 52/126 (41.27%), Postives = 86/126 (68.25%), Query Frame = 1

Query: 18  KISLEDYLDFFSSDKQLFLPVAYLNQIIRMHGYMNIKGL-KNVVKEAVGTINLVNLSRST 77
           KI++E+Y++F +S   +   +AYLNQI+ +HG+  +  L K +V+EAV +++L++LSRST
Sbjct: 8   KITVEEYVEFCNSGNSIHFTIAYLNQILHLHGFRKLHKLQKKIVEEAVDSLDLLDLSRST 67

Query: 78  LKE---SISSSASITLEDVISDLKNLEWQECSVTSVLNFSSWKQNNSDPSPDRQEPTSAS 137
           LK+   S  SS+S+TL++VISD++ L+WQEC  TS+   +S +   S+ S  +Q+     
Sbjct: 68  LKQVTDSSPSSSSLTLDEVISDIEALKWQECCFTSLQIINSQETTPSEISKPKQKSNKRK 127

Query: 138 KKSGKK 140
           K + KK
Sbjct: 128 KATMKK 133

BLAST of CmaCh20G002180 vs. NCBI nr
Match: gi|567154972|ref|XP_006417935.1| (hypothetical protein EUTSA_v10009758mg [Eutrema salsugineum])

HSP 1 Score: 101.3 bits (251), Expect = 1.8e-18
Identity = 57/138 (41.30%), Postives = 92/138 (66.67%), Query Frame = 1

Query: 18  KISLEDYLDFFSSDKQLFLPVAYLNQIIRMHGYMNI-KGLKNVVKEAVGTINLVNLSRST 77
           KISLE+Y+DFF+S K +   +AYLNQI+ MHG+  + K  K +V EAV  ++L++LSRST
Sbjct: 8   KISLEEYVDFFNSGKSIDFTIAYLNQIVHMHGFRKLHKSAKKIVGEAVDALDLLDLSRST 67

Query: 78  LKES-ISSSASITLEDVISDLKNLEWQECSVTS--VLNFSSWKQNNSDPSPDRQEPTSAS 137
           LK++ +SSSA+ TL++VI+D++ L+WQEC +TS  ++NF    +  +     +       
Sbjct: 68  LKQTGVSSSATQTLDEVITDIEALKWQECCLTSLQIINFDEVTRAGAAKPKQKSNKRKIG 127

Query: 138 KKSGKKLRVLSECNSQEV 152
           K+  KK++   + NS+ +
Sbjct: 128 KEKAKKIKKKKKKNSKSI 145

BLAST of CmaCh20G002180 vs. NCBI nr
Match: gi|985470270|ref|XP_015380948.1| (PREDICTED: uncharacterized protein LOC102610754 [Citrus sinensis])

HSP 1 Score: 100.9 bits (250), Expect = 2.4e-18
Identity = 51/98 (52.04%), Postives = 73/98 (74.49%), Query Frame = 1

Query: 12  LRKKSNKISLEDYLDFFSSDKQLFLPVAYLNQIIRMHGYMNIKGLKNVVKEAVGTINLVN 71
           +++K+ K+ +E YL+FF+S  Q  L V +LNQII MHG+  ++  KN + EAV TI+L+N
Sbjct: 1   MKEKNEKLCMEKYLEFFASRNQSDLKVEFLNQIISMHGFKRLRLPKNALSEAVSTIDLMN 60

Query: 72  LSRSTLKESISSSASITLEDVISDLKNLEWQECSVTSV 110
            SRSTLKE+IS + S+TL+ V+ DL +L WQEC VTS+
Sbjct: 61  PSRSTLKENISPAMSLTLKQVMEDLNDLTWQECCVTSI 98

BLAST of CmaCh20G002180 vs. NCBI nr
Match: gi|694326720|ref|XP_009354264.1| (PREDICTED: uncharacterized protein LOC103945420 [Pyrus x bretschneideri])

HSP 1 Score: 98.2 bits (243), Expect = 1.5e-17
Identity = 57/110 (51.82%), Postives = 73/110 (66.36%), Query Frame = 1

Query: 16  SNKISLEDYLDFFSSDKQLFLPVAYLNQIIRMHGYMNI-KGLKNVVKEAVGTINLVNLSR 75
           S K++LEDYL    S+  L L VA+LNQII MHGY  I K  K  + +AV ++ LV+ +R
Sbjct: 26  SEKMTLEDYLLLIQSNSHLHLTVAHLNQIISMHGYKKIYKVPKARLSDAVSSLPLVDPAR 85

Query: 76  STLKESISSSASITLEDVISDLKNLEWQECSVTSVLNFSSWKQNNSDPSP 125
           STLK+ IS     TLEDV++DL +L W+EC VTSV   SSWK   S P+P
Sbjct: 86  STLKDYISPFVITTLEDVVADLADLNWKECCVTSVETLSSWKHTTSAPAP 135

BLAST of CmaCh20G002180 vs. NCBI nr
Match: gi|460412831|ref|XP_004251801.1| (PREDICTED: uncharacterized protein LOC101257540 isoform X1 [Solanum lycopersicum])

HSP 1 Score: 97.8 bits (242), Expect = 2.0e-17
Identity = 56/124 (45.16%), Postives = 79/124 (63.71%), Query Frame = 1

Query: 11  RLRKKSNKISLEDYLDFFSSDKQLFLPVAYLNQIIRMHGYMNIKGLKNVVKEAVGTINLV 70
           R++KKS K+++E Y+DF  S KQ  L +  LN+II +HG+   K  K V+ +AV T+ L+
Sbjct: 9   RMKKKSQKMTVEKYVDFIDSKKQFDLTIPNLNEIISIHGFKKSKRQKKVLADAVNTMELI 68

Query: 71  NLSRSTLKESISSSASITLEDVISDLKNLEWQECSVTSVLNFS-SWKQNNSDPSPDRQEP 130
           +L RSTL+E ISS A +TL++ I DL NL WQEC VTS+     S   N SD    +   
Sbjct: 69  DLRRSTLQEEISSEAFVTLDEAIKDLTNLNWQECCVTSLQTICFSTGVNGSDHCQAKTNA 128

Query: 131 TSAS 134
           T++S
Sbjct: 129 TASS 132

BLAST of CmaCh20G002180 vs. NCBI nr
Match: gi|723748304|ref|XP_010313869.1| (PREDICTED: uncharacterized protein LOC101257540 isoform X2 [Solanum lycopersicum])

HSP 1 Score: 97.8 bits (242), Expect = 2.0e-17
Identity = 56/124 (45.16%), Postives = 79/124 (63.71%), Query Frame = 1

Query: 11  RLRKKSNKISLEDYLDFFSSDKQLFLPVAYLNQIIRMHGYMNIKGLKNVVKEAVGTINLV 70
           R++KKS K+++E Y+DF  S KQ  L +  LN+II +HG+   K  K V+ +AV T+ L+
Sbjct: 9   RMKKKSQKMTVEKYVDFIDSKKQFDLTIPNLNEIISIHGFKKSKRQKKVLADAVNTMELI 68

Query: 71  NLSRSTLKESISSSASITLEDVISDLKNLEWQECSVTSVLNFS-SWKQNNSDPSPDRQEP 130
           +L RSTL+E ISS A +TL++ I DL NL WQEC VTS+     S   N SD    +   
Sbjct: 69  DLRRSTLQEEISSEAFVTLDEAIKDLTNLNWQECCVTSLQTICFSTGVNGSDHCQAKTNA 128

Query: 131 TSAS 134
           T++S
Sbjct: 129 TASS 132

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
V4L939_EUTSA1.3e-1841.30Uncharacterized protein OS=Eutrema salsugineum GN=EUTSA_v10009758mg PE=4 SV=1[more]
K4DCH2_SOLLC1.4e-1745.16Uncharacterized protein OS=Solanum lycopersicum PE=4 SV=1[more]
A0A0B0PK16_GOSAR4.1e-1747.20Uncharacterized protein OS=Gossypium arboreum GN=F383_00109 PE=4 SV=1[more]
A0A061DS97_THECC6.9e-1741.82Uncharacterized protein OS=Theobroma cacao GN=TCM_001700 PE=4 SV=1[more]
D7KG22_ARALL9.1e-1743.65Predicted protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_678249 PE=4 ... [more]
Match NameE-valueIdentityDescription
AT1G06320.11.1e-1841.27 unknown protein[more]
Match NameE-valueIdentityDescription
gi|567154972|ref|XP_006417935.1|1.8e-1841.30hypothetical protein EUTSA_v10009758mg [Eutrema salsugineum][more]
gi|985470270|ref|XP_015380948.1|2.4e-1852.04PREDICTED: uncharacterized protein LOC102610754 [Citrus sinensis][more]
gi|694326720|ref|XP_009354264.1|1.5e-1751.82PREDICTED: uncharacterized protein LOC103945420 [Pyrus x bretschneideri][more]
gi|460412831|ref|XP_004251801.1|2.0e-1745.16PREDICTED: uncharacterized protein LOC101257540 isoform X1 [Solanum lycopersicum... [more]
gi|723748304|ref|XP_010313869.1|2.0e-1745.16PREDICTED: uncharacterized protein LOC101257540 isoform X2 [Solanum lycopersicum... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh20G002180.1CmaCh20G002180.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR35096FAMILY NOT NAMEDcoord: 1..114
score: 5.4
NoneNo IPR availablePANTHERPTHR35096:SF2SUBFAMILY NOT NAMEDcoord: 1..114
score: 5.4

The following gene(s) are paralogous to this gene:

None