CmoCh02G005200.1 (mRNA) Cucurbita moschata (Rifu)

NameCmoCh02G005200.1
TypemRNA
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionUnknown protein
LocationCmo_Chr02 : 2914491 .. 2916262 (+)
Sequence length1012
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CTTTGCTTTGTCACTTTATATAATATGAATCTCAATCTCTTCCTCTTCTTCTTCTTCTTTTCCATCAATACATAGCAATAAAAATTTCAACTCAGATTGGGAAGAGCTGATGGCAACAACCATCCATGGCGAAGAACACGATAAACAAGACGAAGACGAAGAAGAAGCGTTATCTTTGTGCGACCTTCCCGTCAAAGAAAAGCAGCAGCCTTCGCTTCTAAATGAAAACCCCATCACGGAGGATTTCGATTTCAACCACTGGCCACCGCCGCCGTCGCCGCCGCCCATGTGCGCTGCCGACGACATCTTCTTTCAAGGCCATTTGCTCCCTCTTCGCCTCTCTGTTAGCTCTGATAATACTCACAATCACTTCTTTTCTAAACCTCTGTCCGCCAGGTTCGAACCCAGCCACAATTGTTCATACATTTAATTTTTTTTTTTTAATTTCTTAATTTATTTAATTTTTCCAGGTCGGAGTCTATGGATCATAATATGTTGAGGTTTAGAAATGGAAGCAGTAGTAGCAGTAGCAATAGCAGTAGAAGTCATTATTCCAGGTATGCTTTTTTAATTTCTCCCCATTTCAATTAATTATTTTATTTCAAACTTGGATATTTTTAAAATTATAATCTCCAAAATCCAACTCAAATTCAAACATCAAAATATATTACCGTTACACAAAATAAAAGGTTAAATTAATAATAATAAAATAAAATAAACTTTCCCAAAAAAAAATATATTGTTTTAACTTTAAAAGTTTTAATAATCTTTTAATTTTATAAAAATTGAATTTAGCAAAGTTTTATTAATATTTTTAATTTTTTTAAAAATATTTTACCCATTAACACTAATACCGTTAAAGATATTGTCCTTAGGTTTTCCCTATTTTAAAATGGCCCGATCTTAAAAGTTTGTTTTGTTCTCTTCTCCAACTAATAGATAATATTATTTTTATAAATTTATAGATATTTTTTTTTTAAATGTTAAAAATATTAATAAAAATTATATTTGAAATAAAATAATCACGGTTGGAGGATGGAAGTCCACACTAATTTAGGAAATGATCGATAAGTTAGGAATATTACCTCTATTGGTAGAGGCCAACTCTTGGGTAAGCCCAAACCAAAGCCACGAGAGCTTATGTTCAAAGAGGACATGATTGTGGAGAGTTGTGTTCATCTAATAATCACGATCTAGTTATATATTTTTAAAAAAATAATTGATGAAATATTTTGAATTGCAGGTGTTCAAGTATTAGTAACAATTCCATTTCAATTCCGACGAACTCAAAGCCAAGAACTCAAAACAACGTCTTCCACTCTCACCCAAGTCCCACGCCCCAAATCAGATCCTTTTCGACTTTCGGCCGCCGGAGCCGGAGCCGGAGCTCCTCCCGCTGGGACTTTTTCCGGGTGGGTCTTCTCCGAACGCCAGGGATGGAACTTCACGACCTCAAAACTCGCACCACCGTCAGCAACGCGGTACCCACGGTGGGGCAGAAAACAACCGCCTCGTTTTTGGGTGTGGTGAGCTGCAAAAAATCAGTGGAGACAATACCGGCGGCGAAGAAGATAAAGAATTGGAGTGGGAATATTGGGAAGAAAAGGAATGAAAAGGGAAAGGGAATTGGGATTAGGGAGAAGGAATTGAATGATAATGTTGAAATTAGGGAAAAGGAAAAGGAAAAGGCGACGAGGCTGTCACATCGTCGAACATTTGAATGGCTAAAGCAGCTCTCGCATGCAACCTTTGCTGACCAACAATCATCTTGA

mRNA sequence

CTTTGCTTTGTCACTTTATATAATATGAATCTCAATCTCTTCCTCTTCTTCTTCTTCTTTTCCATCAATACATAGCAATAAAAATTTCAACTCAGATTGGGAAGAGCTGATGGCAACAACCATCCATGGCGAAGAACACGATAAACAAGACGAAGACGAAGAAGAAGCGTTATCTTTGTGCGACCTTCCCGTCAAAGAAAAGCAGCAGCCTTCGCTTCTAAATGAAAACCCCATCACGGAGGATTTCGATTTCAACCACTGGCCACCGCCGCCGTCGCCGCCGCCCATGTGCGCTGCCGACGACATCTTCTTTCAAGGCCATTTGCTCCCTCTTCGCCTCTCTGTTAGCTCTGATAATACTCACAATCACTTCTTTTCTAAACCTCTGTCCGCCAGGTCGGAGTCTATGGATCATAATATGTTGAGGTTTAGAAATGGAAGCAGTAGTAGCAGTAGCAATAGCAGTAGAAGTCATTATTCCAGGTGTTCAAGTATTAGTAACAATTCCATTTCAATTCCGACGAACTCAAAGCCAAGAACTCAAAACAACGTCTTCCACTCTCACCCAAGTCCCACGCCCCAAATCAGATCCTTTTCGACTTTCGGCCGCCGGAGCCGGAGCCGGAGCTCCTCCCGCTGGGACTTTTTCCGGGTGGGTCTTCTCCGAACGCCAGGGATGGAACTTCACGACCTCAAAACTCGCACCACCGTCAGCAACGCGGTACCCACGGTGGGGCAGAAAACAACCGCCTCGTTTTTGGGTGTGGTGAGCTGCAAAAAATCAGTGGAGACAATACCGGCGGCGAAGAAGATAAAGAATTGGAGTGGGAATATTGGGAAGAAAAGGAATGAAAAGGGAAAGGGAATTGGGATTAGGGAGAAGGAATTGAATGATAATGTTGAAATTAGGGAAAAGGAAAAGGAAAAGGCGACGAGGCTGTCACATCGTCGAACATTTGAATGGCTAAAGCAGCTCTCGCATGCAACCTTTGCTGACCAACAATCATCTTGA

Coding sequence (CDS)

ATGGCAACAACCATCCATGGCGAAGAACACGATAAACAAGACGAAGACGAAGAAGAAGCGTTATCTTTGTGCGACCTTCCCGTCAAAGAAAAGCAGCAGCCTTCGCTTCTAAATGAAAACCCCATCACGGAGGATTTCGATTTCAACCACTGGCCACCGCCGCCGTCGCCGCCGCCCATGTGCGCTGCCGACGACATCTTCTTTCAAGGCCATTTGCTCCCTCTTCGCCTCTCTGTTAGCTCTGATAATACTCACAATCACTTCTTTTCTAAACCTCTGTCCGCCAGGTCGGAGTCTATGGATCATAATATGTTGAGGTTTAGAAATGGAAGCAGTAGTAGCAGTAGCAATAGCAGTAGAAGTCATTATTCCAGGTGTTCAAGTATTAGTAACAATTCCATTTCAATTCCGACGAACTCAAAGCCAAGAACTCAAAACAACGTCTTCCACTCTCACCCAAGTCCCACGCCCCAAATCAGATCCTTTTCGACTTTCGGCCGCCGGAGCCGGAGCCGGAGCTCCTCCCGCTGGGACTTTTTCCGGGTGGGTCTTCTCCGAACGCCAGGGATGGAACTTCACGACCTCAAAACTCGCACCACCGTCAGCAACGCGGTACCCACGGTGGGGCAGAAAACAACCGCCTCGTTTTTGGGTGTGGTGAGCTGCAAAAAATCAGTGGAGACAATACCGGCGGCGAAGAAGATAAAGAATTGGAGTGGGAATATTGGGAAGAAAAGGAATGAAAAGGGAAAGGGAATTGGGATTAGGGAGAAGGAATTGAATGATAATGTTGAAATTAGGGAAAAGGAAAAGGAAAAGGCGACGAGGCTGTCACATCGTCGAACATTTGAATGGCTAAAGCAGCTCTCGCATGCAACCTTTGCTGACCAACAATCATCTTGA
BLAST of CmoCh02G005200.1 vs. TrEMBL
Match: A0A0A0LPT6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G277620 PE=4 SV=1)

HSP 1 Score: 319.7 bits (818), Expect = 3.8e-84
Identity = 195/300 (65.00%), Postives = 225/300 (75.00%), Query Frame = 1

Query: 8   EEHDKQDEDEEEALSLCDLPVKEKQQPSLLNENPITE---DFDFNHWPPPPSPPPMCAAD 67
           ++ D+++E+EEEALSLCDLPVKEKQQP+      + E   DFDFNHW PPPSP  M  AD
Sbjct: 27  DDDDEEEEEEEEALSLCDLPVKEKQQPTRSVSTTVVETDQDFDFNHWRPPPSP--MLTAD 86

Query: 68  DIFFQGHLLPLRLSVSSDNTHNHFFSKPLSARSESMD-HNMLRFRNGSSSSSSNSSRSHY 127
           D+FFQGH+LPLRLS SS+N+ N+  +  L  RSESMD +NMLRFRN S+SSSS  SRSHY
Sbjct: 87  DLFFQGHMLPLRLSFSSENSQNN--NGNLWCRSESMDGNNMLRFRNESTSSSS--SRSHY 146

Query: 128 SRCSSISNNSISIPTNSKPR-TQNNVFHSHPSPTPQIRSFSTFGRRSRSRSSSRWDFFRV 187
           SR SS+SNNSISIPTNSKPR + NNVFHSHPSPTPQIRSFST     RSRSSSRW+FFR+
Sbjct: 147 SRSSSLSNNSISIPTNSKPRPSNNNVFHSHPSPTPQIRSFSTSSH--RSRSSSRWEFFRL 206

Query: 188 GLLRTPGMELHDLKTRTTVSNAVPT---VGQKTTASFLGVVSCKKSVETIPAAKKIKNWS 247
           GLLRTPGMELHDLKTRTT +    T      KTTAS LGVVSCK+SVET+P     KN  
Sbjct: 207 GLLRTPGMELHDLKTRTTTTTTTTTTTSTAHKTTASILGVVSCKRSVETVPTTTGSKN-- 266

Query: 248 GNIGKKRNEKGKGIGIREKELNDN-VEIREKEKEKATRLSHRRTFEWLKQLSHATFADQQ 299
                 R  +   +   +K  +DN VEIREKEKEK  R+SHRRTFEWLKQLSHATF ++Q
Sbjct: 267 ------RIRRENVLENNKKNNDDNKVEIREKEKEKERRVSHRRTFEWLKQLSHATFGEEQ 310

BLAST of CmoCh02G005200.1 vs. TrEMBL
Match: A0A0D2RFS0_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_003G028600 PE=4 SV=1)

HSP 1 Score: 149.8 bits (377), Expect = 5.2e-33
Identity = 128/331 (38.67%), Postives = 165/331 (49.85%), Query Frame = 1

Query: 8   EEHDKQDEDEEEALSLCDLPVKEKQQPSLLNEN----PITEDFDFNHWPPPP----SPPP 67
           E+ + +DE++EEALSLCDLPV   ++  L NE        EDF+F  WP       S P 
Sbjct: 6   EKREVEDEEQEEALSLCDLPVNLIEEKELKNEENGEPSQEEDFNFGSWPGHGGSFRSEPE 65

Query: 68  MCAADDIFFQGHLLPLRLSVSSDNTH----NHFFSKPLSARSESMDHNMLRFRNGSSSSS 127
           MC AD++FFQG +LPLR SVSSD       +H  S+ LS RSESMDH  L  R  S SSS
Sbjct: 66  MCVADEVFFQGQILPLRHSVSSDTGFRRHDSHNMSRSLS-RSESMDHGSLS-RFTSVSSS 125

Query: 128 SNSSRSHYSRCSSISNNSISIPTNSKPRTQNNVFHSHPSPTPQIRSFSTFGRRSRSRSSS 187
           S  S SHYS  S+ S + + I  N         F++HPSP PQIRS  T       RSSS
Sbjct: 126 STRSSSHYSTSSTNSKSGMKIRNN---------FNTHPSPKPQIRSTRTAVNSRIQRSSS 185

Query: 188 RWDFFRVGLLRTPGMELHDLKTRTTVSNAVPTVGQKTTASFLGVVSCKKSVETIPAAKKI 247
            WDFF++GL+R P + L DLK +    N+V               SC  S  +  + K +
Sbjct: 186 MWDFFKIGLVRAPELGLQDLKMKPRNKNSVSRNS-----------SCNSS-NSSSSTKLV 245

Query: 248 KNWSGNIGKKRNEKGKGIGIREKE------------------LNDNVEIREKEKEKAT-- 294
            N S      +N++    G+ EK                   LN+N  I+  +K+K T  
Sbjct: 246 NNGSSTAEVSKNQQESNKGLVEKRMGLFSGCTCSVNAVETVPLNNNNNIKNSDKDKTTSH 305

BLAST of CmoCh02G005200.1 vs. TrEMBL
Match: A0A061DNT4_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_000746 PE=4 SV=1)

HSP 1 Score: 143.3 bits (360), Expect = 4.9e-31
Identity = 142/387 (36.69%), Postives = 190/387 (49.10%), Query Frame = 1

Query: 1   MATTIHGEEHDK--QDEDEEEALSLCDLPV---KEKQQPSLLNENP-------ITEDFDF 60
           M T ++ +E +K   +E+EEEALSLCDLPV   KE+ Q    N            EDF+F
Sbjct: 1   METAVNCDEEEKWEAEEEEEEALSLCDLPVNLIKEENQVQPGNYEDGESQAIKTEEDFNF 60

Query: 61  NHWPPPPSPPP-MCAADDIFFQGHLLPLRLSVSSDNTHNHF------FSKPLSARSESMD 120
             W    S  P MCAAD++FF+G +LPLRLSVSSD+    F       S+ LS RSESMD
Sbjct: 61  GSWGGSLSTEPQMCAADEVFFKGQILPLRLSVSSDSGLTGFRQDSQNTSRCLS-RSESMD 120

Query: 121 HNML-RFRNGSSSSSSNSSRS-HYS----------------RCSSISNNSISIPTNSKPR 180
           H  L RF + SSSS S+S+RS HYS                   S SN+S +  + SKP 
Sbjct: 121 HGSLSRFTSISSSSRSSSTRSSHYSIGSSNSITVTARNFNSNSKSNSNSSSTSNSKSKPI 180

Query: 181 TQNNVFHSHPSPTPQIRSFST--FGRRSRSRSSSRWDFFRVGLLRTPGMELHDLKTRTTV 240
              N F++HPSP PQIR   T      SR++ +S WDFFR+GL+R P +EL DLK R+  
Sbjct: 181 KIRNNFNTHPSPKPQIRLSKTRPVNVSSRNQKTSMWDFFRLGLVRAPELELQDLKVRSNN 240

Query: 241 SNAVPTVGQKTTASFLGVVSCKKSVETIPAAKKIKNWSGNIGKKRNEKGKG-----IGIR 298
           +NA     + + +      S   S  T  +  KI N SG + + + +  KG     IG+ 
Sbjct: 241 NNA----NRNSVSRNSSCNSSNSSSSTKNSTSKIVNNSGEVARNQQDLNKGFLEKRIGLF 300

BLAST of CmoCh02G005200.1 vs. TrEMBL
Match: A0A0D2PQT2_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_008G080000 PE=4 SV=1)

HSP 1 Score: 143.3 bits (360), Expect = 4.9e-31
Identity = 142/346 (41.04%), Postives = 189/346 (54.62%), Query Frame = 1

Query: 2   ATTIHGEEHDKQDEDEEEALSLCDLPV---KEKQQPSLLNENP-------ITEDFDFNHW 61
           A+    E+  + +E+EEEALSLCDLPV   KE+ Q    NE           EDF+F  W
Sbjct: 13  ASNCDEEQKCEAEEEEEEALSLCDLPVNLIKEENQIQPRNEEDGESQAIKTEEDFNFGSW 72

Query: 62  PPPPSPPP-MCAADDIFFQGHLLPLRLSVSSDNT--HNHFFSKPLSARSESMDHNML-RF 121
               S  P MCAAD++FF+G +LPL +S+SSD++    H        RSES+DH  L RF
Sbjct: 73  DGCLSTKPEMCAADEVFFKGQILPLCVSISSDSSLIWCHRQDSQNKPRSESIDHGSLSRF 132

Query: 122 RNGSSSS-SSNSSRSHYSRCSSISN--------NSISIPTNSKPRTQNNVFHSHPSPTPQ 181
            + +SSS SS++  SHYS  SS S         NSI+  + S P    N F++HPSP PQ
Sbjct: 133 TSVTSSSRSSSTGSSHYSTNSSNSTTVTAATSFNSIT-SSKSNPNKSINNFNTHPSPKPQ 192

Query: 182 IRSFST----FGRRSRSRSSSRWDFFRVGLLRTPGMELHDLKTRTTVSN--AVPTVGQKT 241
           IR   T            SSS WDFFR+GL+R P +E H+LK RT  +N  +  +V + +
Sbjct: 193 IRLSKTRPMNISSSRNQTSSSVWDFFRLGLVRAPELEFHELKIRTNNNNNPSRNSVSRNS 252

Query: 242 TAS-----------FL--GVVS-CKKS---VETIPAAKKIKNWSGNIGKKRNEKGKGI-- 298
           + S           FL  G+ S CK S   VET+P   KI      + KK++EK K +  
Sbjct: 253 SCSSSNSQLDSIKGFLRKGLFSGCKCSVNVVETVP-LNKIAVIKSMMIKKKSEKEKAMFQ 312

BLAST of CmoCh02G005200.1 vs. TrEMBL
Match: U7DVN2_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0318s00200g PE=4 SV=1)

HSP 1 Score: 134.4 bits (337), Expect = 2.3e-28
Identity = 132/376 (35.11%), Postives = 183/376 (48.67%), Query Frame = 1

Query: 8   EEHDKQDEDEEEALSLCDLPVK---------EKQQPSLLNENPITEDFDFNHWPPPP--- 67
           ++ + QD++EEE+LSLCDLPV           + Q +        EDFDF  +       
Sbjct: 18  QDEEDQDQEEEESLSLCDLPVNMVKGENNQSTRDQEAHKETETNQEDFDFGPFRGGDGSL 77

Query: 68  -SPPPMCAADDIFFQGHLLPLRLSVSSDNTHNHFFSKP------LSARSESMDHNMLRFR 127
            +   MCAADDIFFQG +LPLRLSVSS++  N F +          +RSESMDHN L   
Sbjct: 78  SNKSDMCAADDIFFQGQILPLRLSVSSESGVNKFKNDTSLNPCHCLSRSESMDHNSLGGL 137

Query: 128 NGSSSSSSNSSRSHYSRCSSISNNSISIPTNSKPRTQNNVFHSHPSPTPQIR-SFSTFGR 187
             S SS S+SSRSHYS  S+ ++++I+     KP  QN  F +HPSP PQIR S ++ G 
Sbjct: 138 T-SFSSRSSSSRSHYSSSSTSTSSAIASTRMIKPIIQNQ-FLTHPSPKPQIRLSSASLGN 197

Query: 188 --RSRSRSSSRWDFFRVGLLRTPGMELHDLKTRTTVSNAVPTVGQKTTASFLGVVSCKK- 247
              S+ R+SS WDFFR+GL+RTP +E  DLK R  VS         +++S   +  C K 
Sbjct: 198 AASSKPRNSSVWDFFRLGLVRTPEIEFQDLKVRNYVSR-----NSSSSSSNSSINKCSKI 257

Query: 248 --SVETIPAAKKIKNWSGNIGKKRNEKGKGIG---------------------IREKELN 299
             S     +++KIKN S +     N+ GK +G                     ++   LN
Sbjct: 258 NVSNGNSKSSRKIKNESRH---NSNDSGKKMGKRSLLEKRGGLLSGCSCSVSTVKPVPLN 317

BLAST of CmoCh02G005200.1 vs. TAIR10
Match: AT5G67350.1 (AT5G67350.1 unknown protein)

HSP 1 Score: 82.4 bits (202), Expect = 5.2e-16
Identity = 93/307 (30.29%), Postives = 141/307 (45.93%), Query Frame = 1

Query: 8   EEHDKQDEDEEEALSLCDLPVKEKQQPSLLNENPITEDFD----------FNHWPPPPSP 67
           ++ D ++E+EEEALSLCDLP ++ +  S++ E    E+FD          F        P
Sbjct: 22  DDDDVEEEEEEEALSLCDLPNEKGELRSIVKEED--EEFDSGFEFGIGSSFRAGSDSCEP 81

Query: 68  PP-MCAADDIFFQGHLLPLRLSVSSDNTHNHFFSKPLSARSESMDHNMLRFRNGSSSSSS 127
            P M  AD++FF+G +LPLR SVS D   N      L  RSES++      R G      
Sbjct: 82  APEMSTADELFFKGRILPLRHSVSLDAGLNE--PTRLITRSESVEFR----RTG------ 141

Query: 128 NSSRSHYSRCSSISNNSISIPTNSKPRTQNNVFHSHPSPTPQIRSFSTFGRRSRS----R 187
                              I  + +    N + +S PSP PQIR  S+   R  S    +
Sbjct: 142 -------------------IIRSDRKIKNNFIDYSQPSPQPQIRRSSSMTARVNSIRNPK 201

Query: 188 SSSRWDFFRVGLLRTPGMELHDLKTRTTVSNAVPTVGQKTTASFLGVVSCKKSVETIPAA 247
           SSS WDF R+GL+RTP +EL     RTT  NA  +V + ++ S     S  K + +  + 
Sbjct: 202 SSSIWDFLRLGLVRTPEIEL-----RTTAGNAKLSVSRNSSCSSTSTSSNSKKIGSGESR 261

Query: 248 KKIKNWS-------GNIGKKRNEKGKGIGIREKELNDNVEIREK---EKEKATRLSHRRT 290
            + +  S        ++  +       I +   E  +   + EK   +KE+ + ++ +RT
Sbjct: 262 SRNRRRSFMFSDCKCSVSTETKMAPVKIKVSSGETEEKQRMMEKKTAKKEEKSAMARKRT 290

BLAST of CmoCh02G005200.1 vs. NCBI nr
Match: gi|778669884|ref|XP_011649316.1| (PREDICTED: homeobox protein 6-like [Cucumis sativus])

HSP 1 Score: 319.7 bits (818), Expect = 5.4e-84
Identity = 195/300 (65.00%), Postives = 225/300 (75.00%), Query Frame = 1

Query: 8   EEHDKQDEDEEEALSLCDLPVKEKQQPSLLNENPITE---DFDFNHWPPPPSPPPMCAAD 67
           ++ D+++E+EEEALSLCDLPVKEKQQP+      + E   DFDFNHW PPPSP  M  AD
Sbjct: 27  DDDDEEEEEEEEALSLCDLPVKEKQQPTRSVSTTVVETDQDFDFNHWRPPPSP--MLTAD 86

Query: 68  DIFFQGHLLPLRLSVSSDNTHNHFFSKPLSARSESMD-HNMLRFRNGSSSSSSNSSRSHY 127
           D+FFQGH+LPLRLS SS+N+ N+  +  L  RSESMD +NMLRFRN S+SSSS  SRSHY
Sbjct: 87  DLFFQGHMLPLRLSFSSENSQNN--NGNLWCRSESMDGNNMLRFRNESTSSSS--SRSHY 146

Query: 128 SRCSSISNNSISIPTNSKPR-TQNNVFHSHPSPTPQIRSFSTFGRRSRSRSSSRWDFFRV 187
           SR SS+SNNSISIPTNSKPR + NNVFHSHPSPTPQIRSFST     RSRSSSRW+FFR+
Sbjct: 147 SRSSSLSNNSISIPTNSKPRPSNNNVFHSHPSPTPQIRSFSTSSH--RSRSSSRWEFFRL 206

Query: 188 GLLRTPGMELHDLKTRTTVSNAVPT---VGQKTTASFLGVVSCKKSVETIPAAKKIKNWS 247
           GLLRTPGMELHDLKTRTT +    T      KTTAS LGVVSCK+SVET+P     KN  
Sbjct: 207 GLLRTPGMELHDLKTRTTTTTTTTTTTSTAHKTTASILGVVSCKRSVETVPTTTGSKN-- 266

Query: 248 GNIGKKRNEKGKGIGIREKELNDN-VEIREKEKEKATRLSHRRTFEWLKQLSHATFADQQ 299
                 R  +   +   +K  +DN VEIREKEKEK  R+SHRRTFEWLKQLSHATF ++Q
Sbjct: 267 ------RIRRENVLENNKKNNDDNKVEIREKEKEKERRVSHRRTFEWLKQLSHATFGEEQ 310

BLAST of CmoCh02G005200.1 vs. NCBI nr
Match: gi|659115855|ref|XP_008457772.1| (PREDICTED: histone-lysine N-methyltransferase, H3 lysine-79 specific [Cucumis melo])

HSP 1 Score: 312.4 bits (799), Expect = 8.7e-82
Identity = 193/295 (65.42%), Postives = 223/295 (75.59%), Query Frame = 1

Query: 11  DKQDEDEEEALSLCDLPVKEKQQPSLLNENPI--TEDFDFNHWPPPPSPPPMCAADDIFF 70
           D  D++EEEALSLCDLPVKEKQQP+      I  TEDFDFN+W PPPSP  M  ADD+FF
Sbjct: 27  DDDDDEEEEALSLCDLPVKEKQQPTRSVSATIVETEDFDFNNWRPPPSP--MLTADDLFF 86

Query: 71  QGHLLPLRLSVSSDNTHNHFFSKPLSARSESMD-HNMLRFRNGSSSSSSNSSRSHYSRCS 130
           QGH+LPLRLS SS+N+ N+  +  L +RSESMD +NMLRFRNGS+SSSS  SRSHYSR S
Sbjct: 87  QGHMLPLRLSFSSENSQNN--NGNLWSRSESMDRNNMLRFRNGSTSSSS--SRSHYSRSS 146

Query: 131 SISNNSISIPT-NSKPR-TQNNVFHSHPSPTPQIRSFSTFGRRSRSRSSSRWDFFRVGLL 190
           S+SNNSISIPT N+KPR + NNVFHSHPSPTPQIRSFST   RSRS  SSRW+FFR+GLL
Sbjct: 147 SLSNNSISIPTTNTKPRPSNNNVFHSHPSPTPQIRSFSTSSHRSRS--SSRWEFFRLGLL 206

Query: 191 RTPGMELHDLKTRTTVSNAVPTVGQKTTASFLGVVSCKKSVETIPAAKKIKNWSGNIGKK 250
           RTPGMELHDLKTRTT +        KTTAS LGVVSCK+SV+T+P      N   N  ++
Sbjct: 207 RTPGMELHDLKTRTTTTTTTTMTTHKTTASILGVVSCKRSVDTVPTTTGSSN---NRIRR 266

Query: 251 RNEKGKGIGIREKELNDNVEIREKEKEKAT--RLSHRRTFEWLKQLSHATFADQQ 299
            N       + +K  N+ VEIREKEKEK    R+SHRRTFEWLKQLSHATF ++Q
Sbjct: 267 EN-------VLKKNNNNKVEIREKEKEKEKERRVSHRRTFEWLKQLSHATFGEEQ 303

BLAST of CmoCh02G005200.1 vs. NCBI nr
Match: gi|823139478|ref|XP_012469599.1| (PREDICTED: probable membrane-associated kinase regulator 1 [Gossypium raimondii])

HSP 1 Score: 149.8 bits (377), Expect = 7.5e-33
Identity = 128/331 (38.67%), Postives = 165/331 (49.85%), Query Frame = 1

Query: 8   EEHDKQDEDEEEALSLCDLPVKEKQQPSLLNEN----PITEDFDFNHWPPPP----SPPP 67
           E+ + +DE++EEALSLCDLPV   ++  L NE        EDF+F  WP       S P 
Sbjct: 6   EKREVEDEEQEEALSLCDLPVNLIEEKELKNEENGEPSQEEDFNFGSWPGHGGSFRSEPE 65

Query: 68  MCAADDIFFQGHLLPLRLSVSSDNTH----NHFFSKPLSARSESMDHNMLRFRNGSSSSS 127
           MC AD++FFQG +LPLR SVSSD       +H  S+ LS RSESMDH  L  R  S SSS
Sbjct: 66  MCVADEVFFQGQILPLRHSVSSDTGFRRHDSHNMSRSLS-RSESMDHGSLS-RFTSVSSS 125

Query: 128 SNSSRSHYSRCSSISNNSISIPTNSKPRTQNNVFHSHPSPTPQIRSFSTFGRRSRSRSSS 187
           S  S SHYS  S+ S + + I  N         F++HPSP PQIRS  T       RSSS
Sbjct: 126 STRSSSHYSTSSTNSKSGMKIRNN---------FNTHPSPKPQIRSTRTAVNSRIQRSSS 185

Query: 188 RWDFFRVGLLRTPGMELHDLKTRTTVSNAVPTVGQKTTASFLGVVSCKKSVETIPAAKKI 247
            WDFF++GL+R P + L DLK +    N+V               SC  S  +  + K +
Sbjct: 186 MWDFFKIGLVRAPELGLQDLKMKPRNKNSVSRNS-----------SCNSS-NSSSSTKLV 245

Query: 248 KNWSGNIGKKRNEKGKGIGIREKE------------------LNDNVEIREKEKEKAT-- 294
            N S      +N++    G+ EK                   LN+N  I+  +K+K T  
Sbjct: 246 NNGSSTAEVSKNQQESNKGLVEKRMGLFSGCTCSVNAVETVPLNNNNNIKNSDKDKTTSH 305

BLAST of CmoCh02G005200.1 vs. NCBI nr
Match: gi|590705501|ref|XP_007047456.1| (Uncharacterized protein TCM_000746 [Theobroma cacao])

HSP 1 Score: 143.3 bits (360), Expect = 7.0e-31
Identity = 142/387 (36.69%), Postives = 190/387 (49.10%), Query Frame = 1

Query: 1   MATTIHGEEHDK--QDEDEEEALSLCDLPV---KEKQQPSLLNENP-------ITEDFDF 60
           M T ++ +E +K   +E+EEEALSLCDLPV   KE+ Q    N            EDF+F
Sbjct: 1   METAVNCDEEEKWEAEEEEEEALSLCDLPVNLIKEENQVQPGNYEDGESQAIKTEEDFNF 60

Query: 61  NHWPPPPSPPP-MCAADDIFFQGHLLPLRLSVSSDNTHNHF------FSKPLSARSESMD 120
             W    S  P MCAAD++FF+G +LPLRLSVSSD+    F       S+ LS RSESMD
Sbjct: 61  GSWGGSLSTEPQMCAADEVFFKGQILPLRLSVSSDSGLTGFRQDSQNTSRCLS-RSESMD 120

Query: 121 HNML-RFRNGSSSSSSNSSRS-HYS----------------RCSSISNNSISIPTNSKPR 180
           H  L RF + SSSS S+S+RS HYS                   S SN+S +  + SKP 
Sbjct: 121 HGSLSRFTSISSSSRSSSTRSSHYSIGSSNSITVTARNFNSNSKSNSNSSSTSNSKSKPI 180

Query: 181 TQNNVFHSHPSPTPQIRSFST--FGRRSRSRSSSRWDFFRVGLLRTPGMELHDLKTRTTV 240
              N F++HPSP PQIR   T      SR++ +S WDFFR+GL+R P +EL DLK R+  
Sbjct: 181 KIRNNFNTHPSPKPQIRLSKTRPVNVSSRNQKTSMWDFFRLGLVRAPELELQDLKVRSNN 240

Query: 241 SNAVPTVGQKTTASFLGVVSCKKSVETIPAAKKIKNWSGNIGKKRNEKGKG-----IGIR 298
           +NA     + + +      S   S  T  +  KI N SG + + + +  KG     IG+ 
Sbjct: 241 NNA----NRNSVSRNSSCNSSNSSSSTKNSTSKIVNNSGEVARNQQDLNKGFLEKRIGLF 300

BLAST of CmoCh02G005200.1 vs. NCBI nr
Match: gi|823206427|ref|XP_012437092.1| (PREDICTED: uncharacterized protein LOC105763405 [Gossypium raimondii])

HSP 1 Score: 143.3 bits (360), Expect = 7.0e-31
Identity = 142/346 (41.04%), Postives = 189/346 (54.62%), Query Frame = 1

Query: 2   ATTIHGEEHDKQDEDEEEALSLCDLPV---KEKQQPSLLNENP-------ITEDFDFNHW 61
           A+    E+  + +E+EEEALSLCDLPV   KE+ Q    NE           EDF+F  W
Sbjct: 13  ASNCDEEQKCEAEEEEEEALSLCDLPVNLIKEENQIQPRNEEDGESQAIKTEEDFNFGSW 72

Query: 62  PPPPSPPP-MCAADDIFFQGHLLPLRLSVSSDNT--HNHFFSKPLSARSESMDHNML-RF 121
               S  P MCAAD++FF+G +LPL +S+SSD++    H        RSES+DH  L RF
Sbjct: 73  DGCLSTKPEMCAADEVFFKGQILPLCVSISSDSSLIWCHRQDSQNKPRSESIDHGSLSRF 132

Query: 122 RNGSSSS-SSNSSRSHYSRCSSISN--------NSISIPTNSKPRTQNNVFHSHPSPTPQ 181
            + +SSS SS++  SHYS  SS S         NSI+  + S P    N F++HPSP PQ
Sbjct: 133 TSVTSSSRSSSTGSSHYSTNSSNSTTVTAATSFNSIT-SSKSNPNKSINNFNTHPSPKPQ 192

Query: 182 IRSFST----FGRRSRSRSSSRWDFFRVGLLRTPGMELHDLKTRTTVSN--AVPTVGQKT 241
           IR   T            SSS WDFFR+GL+R P +E H+LK RT  +N  +  +V + +
Sbjct: 193 IRLSKTRPMNISSSRNQTSSSVWDFFRLGLVRAPELEFHELKIRTNNNNNPSRNSVSRNS 252

Query: 242 TAS-----------FL--GVVS-CKKS---VETIPAAKKIKNWSGNIGKKRNEKGKGI-- 298
           + S           FL  G+ S CK S   VET+P   KI      + KK++EK K +  
Sbjct: 253 SCSSSNSQLDSIKGFLRKGLFSGCKCSVNVVETVP-LNKIAVIKSMMIKKKSEKEKAMFQ 312

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LPT6_CUCSA3.8e-8465.00Uncharacterized protein OS=Cucumis sativus GN=Csa_2G277620 PE=4 SV=1[more]
A0A0D2RFS0_GOSRA5.2e-3338.67Uncharacterized protein OS=Gossypium raimondii GN=B456_003G028600 PE=4 SV=1[more]
A0A061DNT4_THECC4.9e-3136.69Uncharacterized protein OS=Theobroma cacao GN=TCM_000746 PE=4 SV=1[more]
A0A0D2PQT2_GOSRA4.9e-3141.04Uncharacterized protein OS=Gossypium raimondii GN=B456_008G080000 PE=4 SV=1[more]
U7DVN2_POPTR2.3e-2835.11Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0318s00200g PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G67350.15.2e-1630.29 unknown protein[more]
Match NameE-valueIdentityDescription
gi|778669884|ref|XP_011649316.1|5.4e-8465.00PREDICTED: homeobox protein 6-like [Cucumis sativus][more]
gi|659115855|ref|XP_008457772.1|8.7e-8265.42PREDICTED: histone-lysine N-methyltransferase, H3 lysine-79 specific [Cucumis me... [more]
gi|823139478|ref|XP_012469599.1|7.5e-3338.67PREDICTED: probable membrane-associated kinase regulator 1 [Gossypium raimondii][more]
gi|590705501|ref|XP_007047456.1|7.0e-3136.69Uncharacterized protein TCM_000746 [Theobroma cacao][more]
gi|823206427|ref|XP_012437092.1|7.0e-3141.04PREDICTED: uncharacterized protein LOC105763405 [Gossypium raimondii][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmoCh02G005200CmoCh02G005200gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmoCh02G005200.1CmoCh02G005200.1-proteinpolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh02G005200.1.exon.3CmoCh02G005200.1.exon.3exon
CmoCh02G005200.1.exon.2CmoCh02G005200.1.exon.2exon
CmoCh02G005200.1.exon.1CmoCh02G005200.1.exon.1exon


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh02G005200.1.five_prime_UTR.1CmoCh02G005200.1.five_prime_UTR.1five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh02G005200.1.CDS.1CmoCh02G005200.1.CDS.1CDS
CmoCh02G005200.1.CDS.2CmoCh02G005200.1.CDS.2CDS
CmoCh02G005200.1.CDS.3CmoCh02G005200.1.CDS.3CDS


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableunknownCoilCoilcoord: 257..277
scor
NoneNo IPR availablePANTHERPTHR33922FAMILY NOT NAMEDcoord: 8..300
score: 1.5
NoneNo IPR availablePANTHERPTHR33922:SF2SUBFAMILY NOT NAMEDcoord: 8..300
score: 1.5