CmoCh04G000200 (gene) Cucurbita moschata (Rifu)

NameCmoCh04G000200
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionCystic fibrosis transmembrane conductance regulator
LocationCmo_Chr04 : 108529 .. 109587 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCTCTCTTTGGGACTGCTCCTTTTAATGCACTTTCTTGTTTTTCCCCTCTCCTAACCCAAAAGATTTTCTGTGTTTTCCTGTCATTTCATTTCTTTACCCCTTCCATACCATTCATTCATCTTCCATCTCTTTGAGAACACTGTGTTTGAGGCTCCCTTTTTCTGAACTTGTTGTTCAGTTGAGAGAAATGCAAAGCAGCAGCGTTTCAGAAACAGAAGAGGGAAGCATTAAGGCTGAAAGAAGATCAGGAATGGAAGCTCAGGTGGTTGAAGTGGACATGGAGTTTTGGCCATTGCAGCATCCCACGGAACCTGATGATGAAGACCGCCCTGTCGTTTGCCCAATGCCCAACTCTTCTTCCATTCTTGACGTAACACTTCTTTTTCATTTACTTCCACCACGCCATGTGTTATCATCACACCTACCTCATACCTTCATCTCACAACAACTATAACTGTTTGTTAAACTGATGTTTGAAATGGAAAACAGGAAGGAACCATGCATAACGGCAAGAGGGCACCAGAAAGCTGGAGGAAAAGAACAGAAGTGTCTAGAGAAGTTAAAGCATCAGCCGAGACGGCGGTGAGGGTGGTGCGGAAGCGGCACCACCATACGCTCAGCCGACCCGACCACCTGATGGGAGGAATGTCGCCGGTTCCTCTGCTGCCGACCACCCAGAACTTCACCATCTTCCAAATGCTTCAGCAGTTGGACAAATTTGAGTCTTGACAATGAAACAAAAGGGCACGGCAGCTATAAGGTCAAATTACCAAAGTGACAGATAATGCAAATATATTTTGATATGTTTAGTGATTATTATTTTATATATATAGATATGGATATGGATATGGATATTTTGTGTTGAGAGATATGATATGGGTGTGAATTAGAAGACATATTTTGTGTTTGGATTTTGTGTATTTGGTAAAGTTGAAACTGTGCCGTTGTGGACTCCAATTGTTAAAGTGAAGTATGGAGGGGTTTGGTATAAAGGAAAAAGTAAATTTACTGAGAAGGAAATGGGAGACATGGACTCAATTATATAATGCAAACCCACA

mRNA sequence

TCTCTCTTTGGGACTGCTCCTTTTAATGCACTTTCTTGTTTTTCCCCTCTCCTAACCCAAAAGATTTTCTGTGTTTTCCTGTCATTTCATTTCTTTACCCCTTCCATACCATTCATTCATCTTCCATCTCTTTGAGAACACTGTGTTTGAGGCTCCCTTTTTCTGAACTTGTTGTTCAGTTGAGAGAAATGCAAAGCAGCAGCGTTTCAGAAACAGAAGAGGGAAGCATTAAGGCTGAAAGAAGATCAGGAATGGAAGCTCAGGTGGTTGAAGTGGACATGGAGTTTTGGCCATTGCAGCATCCCACGGAACCTGATGATGAAGACCGCCCTGTCGTTTGCCCAATGCCCAACTCTTCTTCCATTCTTGACGAAGGAACCATGCATAACGGCAAGAGGGCACCAGAAAGCTGGAGGAAAAGAACAGAAGTGTCTAGAGAAGTTAAAGCATCAGCCGAGACGGCGGTGAGGGTGGTGCGGAAGCGGCACCACCATACGCTCAGCCGACCCGACCACCTGATGGGAGGAATGTCGCCGGTTCCTCTGCTGCCGACCACCCAGAACTTCACCATCTTCCAAATGCTTCAGCAGTTGGACAAATTTGAGTCTTGACAATGAAACAAAAGGGCACGGCAGCTATAAGGTCAAATTACCAAAGTGACAGATAATGCAAATATATTTTGATATGTTTAGTGATTATTATTTTATATATATAGATATGGATATGGATATGGATATTTTGTGTTGAGAGATATGATATGGGTGTGAATTAGAAGACATATTTTGTGTTTGGATTTTGTGTATTTGGTAAAGTTGAAACTGTGCCGTTGTGGACTCCAATTGTTAAAGTGAAGTATGGAGGGGTTTGGTATAAAGGAAAAAGTAAATTTACTGAGAAGGAAATGGGAGACATGGACTCAATTATATAATGCAAACCCACA

Coding sequence (CDS)

ATGCAAAGCAGCAGCGTTTCAGAAACAGAAGAGGGAAGCATTAAGGCTGAAAGAAGATCAGGAATGGAAGCTCAGGTGGTTGAAGTGGACATGGAGTTTTGGCCATTGCAGCATCCCACGGAACCTGATGATGAAGACCGCCCTGTCGTTTGCCCAATGCCCAACTCTTCTTCCATTCTTGACGAAGGAACCATGCATAACGGCAAGAGGGCACCAGAAAGCTGGAGGAAAAGAACAGAAGTGTCTAGAGAAGTTAAAGCATCAGCCGAGACGGCGGTGAGGGTGGTGCGGAAGCGGCACCACCATACGCTCAGCCGACCCGACCACCTGATGGGAGGAATGTCGCCGGTTCCTCTGCTGCCGACCACCCAGAACTTCACCATCTTCCAAATGCTTCAGCAGTTGGACAAATTTGAGTCTTGA
BLAST of CmoCh04G000200 vs. TrEMBL
Match: A0A0A0KPE8_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G258200 PE=4 SV=1)

HSP 1 Score: 183.0 bits (463), Expect = 2.6e-43
Identity = 93/120 (77.50%), Postives = 100/120 (83.33%), Query Frame = 1

Query: 22  MEAQVVEVD-MEFWPLQHPTEPDDEDRPVVCPMPNSSSILDEGTMHNGKRAPESWRKRTE 81
           ME +V E+D MEFWPLQHP EPDDED PV+CPMPNS+S+LDEGT+HNGKR PESWRKRTE
Sbjct: 1   MEVKVFEMDKMEFWPLQHPLEPDDEDHPVICPMPNSTSLLDEGTLHNGKRTPESWRKRTE 60

Query: 82  VSREVKASAETAVRVVRKRHHHTLSRPDHLMGGMSPVPLLPTTQNFTIFQMLQQLDKFES 141
           VSREVK  AE   R VRKRHH TLSRPD LM GMSP P+ P   NFTIFQMLQQLDKFES
Sbjct: 61  VSREVKVQAE--ARPVRKRHHRTLSRPDQLMVGMSPRPITP---NFTIFQMLQQLDKFES 115

BLAST of CmoCh04G000200 vs. TrEMBL
Match: M5XJ48_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa024210mg PE=4 SV=1)

HSP 1 Score: 120.2 bits (300), Expect = 2.1e-24
Identity = 78/157 (49.68%), Postives = 93/157 (59.24%), Query Frame = 1

Query: 6   VSETEEGSIKAERRSGMEAQ----VVEVDMEFWPLQHPTEPDDEDRPVVCPMPNSSSILD 65
           + ET E    +    GMEA      +EVD +FWP++HP EP DEDRPV CPMP+SS I D
Sbjct: 19  LDETREKQQHSSWGCGMEAPHNHVQLEVDFQFWPVEHPLEPPDEDRPVKCPMPDSSVIND 78

Query: 66  EGTMHNGKRAPESWRKRTEVS-------------REVKASAE--TAVRVVRKRHHHTLSR 125
            G          + RKRTEVS             + V A AE   AVR VRKRHH+TL+R
Sbjct: 79  GGRQEKRSSESSAMRKRTEVSSAAAATYSKPRTDQMVVAVAEPPPAVRAVRKRHHNTLTR 138

Query: 126 PDHLMG---GMSPVPLLPTTQNFTIFQMLQQLDKFES 141
            DH++     M P+P LP TQ+ TIFQMLQQLDKFES
Sbjct: 139 GDHMISPLRRMPPIPSLP-TQSITIFQMLQQLDKFES 174

BLAST of CmoCh04G000200 vs. TrEMBL
Match: A0A061EG67_THECC (Uncharacterized protein isoform 2 OS=Theobroma cacao GN=TCM_011218 PE=4 SV=1)

HSP 1 Score: 117.9 bits (294), Expect = 1.0e-23
Identity = 76/148 (51.35%), Postives = 96/148 (64.86%), Query Frame = 1

Query: 2   QSSSVSETEEGSIKAERRSGMEAQ--VVEVDMEFWPLQHPTEPDDEDRPVVCPMPNSSSI 61
           + S VS  +E S +   + GM+     +E D+EFWP++HP EP DEDRPV CPMP SSSI
Sbjct: 20  EESEVSVIDE-SREPRHKGGMDLHHHAIEFDIEFWPVEHPMEPQDEDRPVKCPMPASSSI 79

Query: 62  LDEGTMHNGKRAPESWRKRTEVSREVK----ASAETAVRVVRKRHHHTLSRPDHLMGG-- 121
            D G  +  + A ES RKR+EV+ ++K       E AVR VRKRHH TL+R DH++    
Sbjct: 80  ND-GIGNEERLAAESSRKRSEVTEKLKKGTPVGTEPAVRAVRKRHH-TLTRDDHVIKPLV 139

Query: 122 -MSPVPLLPTTQNFTIFQMLQQLDKFES 141
            M P+P LPT QN TIFQMLQ+ DKF S
Sbjct: 140 RMPPLPPLPT-QNLTIFQMLQEFDKFNS 163

BLAST of CmoCh04G000200 vs. TrEMBL
Match: A0A061E9H9_THECC (Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_011218 PE=4 SV=1)

HSP 1 Score: 117.9 bits (294), Expect = 1.0e-23
Identity = 76/148 (51.35%), Postives = 96/148 (64.86%), Query Frame = 1

Query: 2   QSSSVSETEEGSIKAERRSGMEAQ--VVEVDMEFWPLQHPTEPDDEDRPVVCPMPNSSSI 61
           + S VS  +E S +   + GM+     +E D+EFWP++HP EP DEDRPV CPMP SSSI
Sbjct: 53  EESEVSVIDE-SREPRHKGGMDLHHHAIEFDIEFWPVEHPMEPQDEDRPVKCPMPASSSI 112

Query: 62  LDEGTMHNGKRAPESWRKRTEVSREVK----ASAETAVRVVRKRHHHTLSRPDHLMGG-- 121
            D G  +  + A ES RKR+EV+ ++K       E AVR VRKRHH TL+R DH++    
Sbjct: 113 ND-GIGNEERLAAESSRKRSEVTEKLKKGTPVGTEPAVRAVRKRHH-TLTRDDHVIKPLV 172

Query: 122 -MSPVPLLPTTQNFTIFQMLQQLDKFES 141
            M P+P LPT QN TIFQMLQ+ DKF S
Sbjct: 173 RMPPLPPLPT-QNLTIFQMLQEFDKFNS 196

BLAST of CmoCh04G000200 vs. TrEMBL
Match: A0A067L122_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_04823 PE=4 SV=1)

HSP 1 Score: 108.6 bits (270), Expect = 6.2e-21
Identity = 64/121 (52.89%), Postives = 81/121 (66.94%), Query Frame = 1

Query: 27  VEVDMEFWPLQHPTEPDDEDRPVVCPMPNSSSILDEGTMHNGKRAPESWRKRTEVSREVK 86
           VE D++FWP++HP EP DEDRPV CP+P +SS+L++G +H  +R  ES RKR EVS  V 
Sbjct: 42  VEFDVDFWPVEHPMEPQDEDRPVKCPIP-TSSVLNDGRVHE-ERNGESLRKRAEVSTMVN 101

Query: 87  ------ASAETAVRVVRKRHHHTLSRPDHLMGGMSPVPLLP--TTQNFTIFQMLQQLDKF 140
                   AE  V  VRKR HHTL+  DH++  +  +P LP    QN TIFQMLQQLD+F
Sbjct: 102 KEGIVIVGAEPPVGAVRKR-HHTLTNGDHVITPLRRMPSLPPLPAQNVTIFQMLQQLDEF 159

BLAST of CmoCh04G000200 vs. TAIR10
Match: AT2G01913.1 (AT2G01913.1 unknown protein)

HSP 1 Score: 57.8 bits (138), Expect = 6.3e-09
Identity = 43/137 (31.39%), Postives = 67/137 (48.91%), Query Frame = 1

Query: 16  AERRSGMEAQVV--------EVDMEFWPLQHPTEPDDEDRPVVCPMPNSSSILDEGTMHN 75
           A++R  +E  V+        + D+EF P++HP EP++EDRPV CP+P SSS++ +     
Sbjct: 6   ADKRESVEETVMMEYDEETNQFDVEFCPVEHPIEPEEEDRPVKCPVPISSSLIHKPK--- 65

Query: 76  GKRAPESWRKRTEVSRE--VKASAETAVRVVRKRHH--HTLSRPDHLMGGMSPVPLLPTT 135
            +++   W K    S E  V       VR VRKRH+        +               
Sbjct: 66  -EKSKPGWVKHRASSYETPVYPPPRHHVRNVRKRHNSFDVEGNNNFFTRSHDDDETTSRR 125

Query: 136 QNFTIFQMLQQLDKFES 141
            N T +++LQQ+ +FES
Sbjct: 126 SNVTFYRVLQQVQEFES 138

BLAST of CmoCh04G000200 vs. TAIR10
Match: AT2G35585.1 (AT2G35585.1 unknown protein)

HSP 1 Score: 47.8 bits (112), Expect = 6.6e-06
Identity = 23/60 (38.33%), Postives = 38/60 (63.33%), Query Frame = 1

Query: 23  EAQVVEVDMEFWPLQHPTEPDDEDRPVVCPMPNSSSILDEGTMHNGKRAPESWRKRTEVS 82
           E   +EV  EF P+ HP EP D D+P+ CP+P   SIL++G +   + +  S R+R++++
Sbjct: 44  ETHGIEVATEFKPVDHPMEPLDNDQPIQCPLP-EPSILNDGRLWKERVSASSMRRRSDLA 102

BLAST of CmoCh04G000200 vs. NCBI nr
Match: gi|449445308|ref|XP_004140415.1| (PREDICTED: uncharacterized protein LOC101204044 [Cucumis sativus])

HSP 1 Score: 183.0 bits (463), Expect = 3.7e-43
Identity = 93/120 (77.50%), Postives = 100/120 (83.33%), Query Frame = 1

Query: 22  MEAQVVEVD-MEFWPLQHPTEPDDEDRPVVCPMPNSSSILDEGTMHNGKRAPESWRKRTE 81
           ME +V E+D MEFWPLQHP EPDDED PV+CPMPNS+S+LDEGT+HNGKR PESWRKRTE
Sbjct: 1   MEVKVFEMDKMEFWPLQHPLEPDDEDHPVICPMPNSTSLLDEGTLHNGKRTPESWRKRTE 60

Query: 82  VSREVKASAETAVRVVRKRHHHTLSRPDHLMGGMSPVPLLPTTQNFTIFQMLQQLDKFES 141
           VSREVK  AE   R VRKRHH TLSRPD LM GMSP P+ P   NFTIFQMLQQLDKFES
Sbjct: 61  VSREVKVQAE--ARPVRKRHHRTLSRPDQLMVGMSPRPITP---NFTIFQMLQQLDKFES 115

BLAST of CmoCh04G000200 vs. NCBI nr
Match: gi|659114032|ref|XP_008456876.1| (PREDICTED: uncharacterized protein LOC103496689 [Cucumis melo])

HSP 1 Score: 180.3 bits (456), Expect = 2.4e-42
Identity = 91/120 (75.83%), Postives = 99/120 (82.50%), Query Frame = 1

Query: 22  MEAQVVEVD-MEFWPLQHPTEPDDEDRPVVCPMPNSSSILDEGTMHNGKRAPESWRKRTE 81
           ME +  E+D MEFWPLQHP EPDDED PV+CPMPNS+S+LDEGT+HNGKR PESWRKRTE
Sbjct: 1   MEVKAFEMDKMEFWPLQHPLEPDDEDHPVICPMPNSTSLLDEGTIHNGKRTPESWRKRTE 60

Query: 82  VSREVKASAETAVRVVRKRHHHTLSRPDHLMGGMSPVPLLPTTQNFTIFQMLQQLDKFES 141
           VSREVK  AE   R VRKRHH T++RPD LM GMSP    PTT NFTIFQMLQQLDKFES
Sbjct: 61  VSREVKLQAE--ARPVRKRHHRTVTRPDQLMAGMSP---RPTTPNFTIFQMLQQLDKFES 115

BLAST of CmoCh04G000200 vs. NCBI nr
Match: gi|645228351|ref|XP_008220954.1| (PREDICTED: uncharacterized protein LOC103320989 isoform X2 [Prunus mume])

HSP 1 Score: 120.2 bits (300), Expect = 3.0e-24
Identity = 78/157 (49.68%), Postives = 93/157 (59.24%), Query Frame = 1

Query: 6   VSETEEGSIKAERRSGMEAQ----VVEVDMEFWPLQHPTEPDDEDRPVVCPMPNSSSILD 65
           + ET E    +    GMEA      +EVD +FWP++HP EP DEDRPV CPMP+SS I D
Sbjct: 24  LDETREKQQHSSWGCGMEAPHNHVQLEVDFQFWPVEHPLEPPDEDRPVKCPMPDSSVIND 83

Query: 66  EGTMHNGKRAPESWRKRTEVS-------------REVKASAE--TAVRVVRKRHHHTLSR 125
            G          + RKRTEVS             + V A AE   AVR VRKRHH+TL+R
Sbjct: 84  GGRQEKRSSESSAMRKRTEVSSAAAATYSKPRMDQMVVAVAEPPPAVRAVRKRHHNTLTR 143

Query: 126 PDHLMG---GMSPVPLLPTTQNFTIFQMLQQLDKFES 141
            DH++     M P+P LP TQ+ TIFQMLQQLDKFES
Sbjct: 144 GDHMISPLRRMPPIPSLP-TQSITIFQMLQQLDKFES 179

BLAST of CmoCh04G000200 vs. NCBI nr
Match: gi|596253420|ref|XP_007224755.1| (hypothetical protein PRUPE_ppa024210mg [Prunus persica])

HSP 1 Score: 120.2 bits (300), Expect = 3.0e-24
Identity = 78/157 (49.68%), Postives = 93/157 (59.24%), Query Frame = 1

Query: 6   VSETEEGSIKAERRSGMEAQ----VVEVDMEFWPLQHPTEPDDEDRPVVCPMPNSSSILD 65
           + ET E    +    GMEA      +EVD +FWP++HP EP DEDRPV CPMP+SS I D
Sbjct: 19  LDETREKQQHSSWGCGMEAPHNHVQLEVDFQFWPVEHPLEPPDEDRPVKCPMPDSSVIND 78

Query: 66  EGTMHNGKRAPESWRKRTEVS-------------REVKASAE--TAVRVVRKRHHHTLSR 125
            G          + RKRTEVS             + V A AE   AVR VRKRHH+TL+R
Sbjct: 79  GGRQEKRSSESSAMRKRTEVSSAAAATYSKPRTDQMVVAVAEPPPAVRAVRKRHHNTLTR 138

Query: 126 PDHLMG---GMSPVPLLPTTQNFTIFQMLQQLDKFES 141
            DH++     M P+P LP TQ+ TIFQMLQQLDKFES
Sbjct: 139 GDHMISPLRRMPPIPSLP-TQSITIFQMLQQLDKFES 174

BLAST of CmoCh04G000200 vs. NCBI nr
Match: gi|590697445|ref|XP_007045441.1| (Uncharacterized protein isoform 1 [Theobroma cacao])

HSP 1 Score: 117.9 bits (294), Expect = 1.5e-23
Identity = 76/148 (51.35%), Postives = 96/148 (64.86%), Query Frame = 1

Query: 2   QSSSVSETEEGSIKAERRSGMEAQ--VVEVDMEFWPLQHPTEPDDEDRPVVCPMPNSSSI 61
           + S VS  +E S +   + GM+     +E D+EFWP++HP EP DEDRPV CPMP SSSI
Sbjct: 53  EESEVSVIDE-SREPRHKGGMDLHHHAIEFDIEFWPVEHPMEPQDEDRPVKCPMPASSSI 112

Query: 62  LDEGTMHNGKRAPESWRKRTEVSREVK----ASAETAVRVVRKRHHHTLSRPDHLMGG-- 121
            D G  +  + A ES RKR+EV+ ++K       E AVR VRKRHH TL+R DH++    
Sbjct: 113 ND-GIGNEERLAAESSRKRSEVTEKLKKGTPVGTEPAVRAVRKRHH-TLTRDDHVIKPLV 172

Query: 122 -MSPVPLLPTTQNFTIFQMLQQLDKFES 141
            M P+P LPT QN TIFQMLQ+ DKF S
Sbjct: 173 RMPPLPPLPT-QNLTIFQMLQEFDKFNS 196

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KPE8_CUCSA2.6e-4377.50Uncharacterized protein OS=Cucumis sativus GN=Csa_5G258200 PE=4 SV=1[more]
M5XJ48_PRUPE2.1e-2449.68Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa024210mg PE=4 SV=1[more]
A0A061EG67_THECC1.0e-2351.35Uncharacterized protein isoform 2 OS=Theobroma cacao GN=TCM_011218 PE=4 SV=1[more]
A0A061E9H9_THECC1.0e-2351.35Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_011218 PE=4 SV=1[more]
A0A067L122_JATCU6.2e-2152.89Uncharacterized protein OS=Jatropha curcas GN=JCGZ_04823 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G01913.16.3e-0931.39 unknown protein[more]
AT2G35585.16.6e-0638.33 unknown protein[more]
Match NameE-valueIdentityDescription
gi|449445308|ref|XP_004140415.1|3.7e-4377.50PREDICTED: uncharacterized protein LOC101204044 [Cucumis sativus][more]
gi|659114032|ref|XP_008456876.1|2.4e-4275.83PREDICTED: uncharacterized protein LOC103496689 [Cucumis melo][more]
gi|645228351|ref|XP_008220954.1|3.0e-2449.68PREDICTED: uncharacterized protein LOC103320989 isoform X2 [Prunus mume][more]
gi|596253420|ref|XP_007224755.1|3.0e-2449.68hypothetical protein PRUPE_ppa024210mg [Prunus persica][more]
gi|590697445|ref|XP_007045441.1|1.5e-2351.35Uncharacterized protein isoform 1 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G000200.1CmoCh04G000200.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR34196FAMILY NOT NAMEDcoord: 1..140
score: 4.3
NoneNo IPR availablePANTHERPTHR34196:SF1SUBFAMILY NOT NAMEDcoord: 1..140
score: 4.3