CmoCh14G011080 (gene) Cucurbita moschata (Rifu)

NameCmoCh14G011080
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionATECP63, putative
LocationCmo_Chr14 : 7433591 .. 7436068 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
TTCTCTTTCTCGTCCATTCCGTGTTCGGTGGTGACCAAAAATTCTAGAGAATGAATGGGAACGCTTCCATTTGTTTCCATTCCGATAAATACTCCTCACCGTTTCTTACGATCTTCCATTCTCAATCACCGCTTCACCCTCCCATCTTCCAATTCCATCCACACTCTAACATAAGGTAACCACGCAACACCCACTCTTGTTTTCAGATTCTTTCTCCTTTTGTTCCTCTTCGATTTGCTTATTTATGTGCTTCATTTCAATCCGTTCAATGTTCTGTGCTCTTTGTTGGATGAATGAATCGCTATTTCGAATCCGGTTCTTGAATCCGCGAGGGAAGTGTTTTCGATTTTTGCCTTTTCTTTCGTCATTGGAATCCGGTTGTTAATTTTTGGTTTACGGAATTAATGCGGCTTTAGGATCCTGATTATTTGCGTTTTTCTTCTTAATCCCTTACTTAGGGAGGAACCTCTCCTCTTTATTTGGTTCTGTTGTAGTTTGTATCGAATTTTCATTTCTCCTTCTGTTGAATCTAGATCGCATTCCTATTTTGAGGCATGGAATCTTAGAAAATATATTTGAAGAAAATGAAGGCTCGTGTAAGGACTCTTTTCCATTCGAATTTAAAATGTACATCTCGAAAACAAGTTTGAAACTGGTAAAAGTGGAAGTGATAAAAAAAAAAAATATGGTTTATCCTTGTGCAGTATTGCCGATAGTTTACTTTATGATTCTTTTGGAAAAAGAAAAGGCACGGTTAGACAATTTTTTATTACACATAATGCTGAATACAGCAATAAATCAGTTCCACTTTTCTTCGTTAACATGCTAGTTTAATTATTACAGATCAAGAACTTTTCAAGTTACTTCAATTTCTTTTCTCTTTTTATTTATTTTGCTCTCGAGGAATATTTTCAGTTGGTGATTCTCTGCTTTCTAATTGTTCAGGTGCGTTTAGTATTGAAAATCAAGTGCAGTTGGGAAGAAGATTTCATAACTGAAGTTTTAGCGAAATTCATCAAGATGGCTGATGAATCTGTAGCTATCCCAATTATGGGAAAGGAAGAGAAGACAAAGGTTGAAGTTGAGTCTGAGGTTGTTAAGGTGGATGATGAGGTGGAGAAAGAAAAGCATGAGGTTAAGATCAAGAGCAAGGAGGCAAAATATGAAGATGGAAAGAAAGAGAAGACAGAGGTGGAGGTTCAACTCAAGAGCTCATCAGGGAGGAAAGAGAAGATGAAAGATGTTGAGAATGAAAAGGAGAAGAAGAAAGAAGAGAAGCAGAAGAAGAAGGATGAAGGCAAAGATGAAAAAGAAGCAAAAAAGAAGCACAAAGATGAGGATGGAGCAGAGAAGGAAACAGAAGTGAAGAAGAAGAAGAAGGAAAAGGAAAAGGAAGAGGAGAAGAAAGAGAAAAAAGATGAAAAGAAACATAAGAAGAAGAAGGAAGTTGAGGATGATGAAGAGTGTGAAAAAGAAGAGAAGAAACAGGAAAAGGGTAAGAAGGATAAAGAGAAGGAGAAGGGTAAAGGTGGTGATGCAGTGGAAGATAAGAAGGTGAAGAAAGGAATTGGGAATGAAGAAGAAGAAGACGATGATGATGATGGTAAAGAAGAGAAGAAGAAGAAAAAGAACGAAGAGAAGGAAAAGAAGAAGAAGGAGAAAGGAGGAGAAGAAGAAGATGATAGTAAAGAAGAGAAGAAGAAGAAGAAGAAGAAAGGAGACGAAGGAGTAGAAGAAGATGATAGTAAAGAAGAGAAGAAGAAGAAGAAGAAGAAAGGAGAGAAGGAAAAAGAGAAAAAGGAGGAAGAAGTAGAAGAAGATGATAGCAAAGAAGAGAAGAAGAAGAAGAAGAAGAAAGGAGGAGAGAAGGAACAGGAGAAAAAGGAGAAAGGAGAAGATGATATTAAAGAAGAAAAGAATAAAAAGAAGAAGAAGGGGGAGGATGAAGATGATGATGGCGAAGAAGAAAAGAAAGAGAAGAAAGAAAAGAAGGATGAGAAGAAAAAGGAAAAAGGTGGGAAAGAGAAAGAGAAAAGAGAGAAGAACAAAGTAGAAGATGAAGAAACGGAGGAGAAGGATGAAGTAGACGAAGATGAAACGAAGGGAAAGAAGGAGAAGAAGAAGAAAGAGAAGGAAGATGAAACGAAGAGCAGAAGTAGGGAAATAGAGATAGAAATAGAAATAGAAGTAGAAGAAGAAGCAGGTGAGAAGGGAGGTTGTAGAGGAGGAGGAGGAGAAGAGGAGAAGGATAACGAGAAGAAAAACAAGAAAGATAAAGAAGAAAAGAAGAAGAAAGTAGAAGAGAAAAACAGAAGTAGAGATGTGGGGAAATTGAAACAGAAACTAGAGAAGATGGATGTTAAAATCAATGCCTTGCTCGACAAGAAGGCAGACATTCTGAGGCAAATAAAAGAAATTGAAGACGCAAATTCCAACATTGTTGCTGCTGCTGCAAAACCTACGGAAGAAGTGGCATAA

mRNA sequence

TTCTCTTTCTCGTCCATTCCGTGTTCGGTGGTGACCAAAAATTCTAGAGAATGAATGGGAACGCTTCCATTTGTTTCCATTCCGATAAATACTCCTCACCGTTTCTTACGATCTTCCATTCTCAATCACCGCTTCACCCTCCCATCTTCCAATTCCATCCACACTCTAACATAAGTATTGCCGATAGTTTACTTTATGATTCTTTTGGAAAAAGAAAAGGCACGGTGCGTTTAGTATTGAAAATCAAGTGCAGTTGGGAAGAAGATTTCATAACTGAAGTTTTAGCGAAATTCATCAAGATGGCTGATGAATCTGTAGCTATCCCAATTATGGGAAAGGAAGAGAAGACAAAGGTTGAAGTTGAGTCTGAGGTTGTTAAGGTGGATGATGAGGTGGAGAAAGAAAAGCATGAGGTTAAGATCAAGAGCAAGGAGGCAAAATATGAAGATGGAAAGAAAGAGAAGACAGAGGTGGAGGTTCAACTCAAGAGCTCATCAGGGAGGAAAGAGAAGATGAAAGATGTTGAGAATGAAAAGGAGAAGAAGAAAGAAGAGAAGCAGAAGAAGAAGGATGAAGGCAAAGATGAAAAAGAAGCAAAAAAGAAGCACAAAGATGAGGATGGAGCAGAGAAGGAAACAGAAGTGAAGAAGAAGAAGAAGGAAAAGGAAAAGGAAGAGGAGAAGAAAGAGAAAAAAGATGAAAAGAAACATAAGAAGAAGAAGGAAGTTGAGGATGATGAAGAGTGTGAAAAAGAAGAGAAGAAACAGGAAAAGGGTAAGAAGGATAAAGAGAAGGAGAAGGGTAAAGGTGGTGATGCAGTGGAAGATAAGAAGGTGAAGAAAGGAATTGGGAATGAAGAAGAAGAAGACGATGATGATGATGGTAAAGAAGAGAAGAAGAAGAAAAAGAACGAAGAGAAGGAAAAGAAGAAGAAGGAGAAAGGAGGAGAAGAAGAAGATGATAGTAAAGAAGAGAAGAAGAAGAAGAAGAAGAAAGGAGACGAAGGAGTAGAAGAAGATGATAGTAAAGAAGAGAAGAAGAAGAAGAAGAAGAAAGGAGAGAAGGAAAAAGAGAAAAAGGAGGAAGAAGTAGAAGAAGATGATAGCAAAGAAGAGAAGAAGAAGAAGAAGAAGAAAGGAGGAGAGAAGGAACAGGAGAAAAAGGAGAAAGGAGAAGATGATATTAAAGAAGAAAAGAATAAAAAGAAGAAGAAGGGGGAGGATGAAGATGATGATGGCGAAGAAGAAAAGAAAGAGAAGAAAGAAAAGAAGGATGAGAAGAAAAAGGAAAAAGGTGGGAAAGAGAAAGAGAAAAGAGAGAAGAACAAAGTAGAAGATGAAGAAACGGAGGAGAAGGATGAAGTAGACGAAGATGAAACGAAGGGAAAGAAGGAGAAGAAGAAGAAAGAGAAGGAAGATGAAACGAAGAGCAGAAGTAGGGAAATAGAGATAGAAATAGAAATAGAAGTAGAAGAAGAAGCAGGTGAGAAGGGAGGTTGTAGAGGAGGAGGAGGAGAAGAGGAGAAGGATAACGAGAAGAAAAACAAGAAAGATAAAGAAGAAAAGAAGAAGAAAGTAGAAGAGAAAAACAGAAGTAGAGATGTGGGGAAATTGAAACAGAAACTAGAGAAGATGGATGTTAAAATCAATGCCTTGCTCGACAAGAAGGCAGACATTCTGAGGCAAATAAAAGAAATTGAAGACGCAAATTCCAACATTGTTGCTGCTGCTGCAAAACCTACGGAAGAAGTGGCATAA

Coding sequence (CDS)

ATGAATGGGAACGCTTCCATTTGTTTCCATTCCGATAAATACTCCTCACCGTTTCTTACGATCTTCCATTCTCAATCACCGCTTCACCCTCCCATCTTCCAATTCCATCCACACTCTAACATAAGTATTGCCGATAGTTTACTTTATGATTCTTTTGGAAAAAGAAAAGGCACGGTGCGTTTAGTATTGAAAATCAAGTGCAGTTGGGAAGAAGATTTCATAACTGAAGTTTTAGCGAAATTCATCAAGATGGCTGATGAATCTGTAGCTATCCCAATTATGGGAAAGGAAGAGAAGACAAAGGTTGAAGTTGAGTCTGAGGTTGTTAAGGTGGATGATGAGGTGGAGAAAGAAAAGCATGAGGTTAAGATCAAGAGCAAGGAGGCAAAATATGAAGATGGAAAGAAAGAGAAGACAGAGGTGGAGGTTCAACTCAAGAGCTCATCAGGGAGGAAAGAGAAGATGAAAGATGTTGAGAATGAAAAGGAGAAGAAGAAAGAAGAGAAGCAGAAGAAGAAGGATGAAGGCAAAGATGAAAAAGAAGCAAAAAAGAAGCACAAAGATGAGGATGGAGCAGAGAAGGAAACAGAAGTGAAGAAGAAGAAGAAGGAAAAGGAAAAGGAAGAGGAGAAGAAAGAGAAAAAAGATGAAAAGAAACATAAGAAGAAGAAGGAAGTTGAGGATGATGAAGAGTGTGAAAAAGAAGAGAAGAAACAGGAAAAGGGTAAGAAGGATAAAGAGAAGGAGAAGGGTAAAGGTGGTGATGCAGTGGAAGATAAGAAGGTGAAGAAAGGAATTGGGAATGAAGAAGAAGAAGACGATGATGATGATGGTAAAGAAGAGAAGAAGAAGAAAAAGAACGAAGAGAAGGAAAAGAAGAAGAAGGAGAAAGGAGGAGAAGAAGAAGATGATAGTAAAGAAGAGAAGAAGAAGAAGAAGAAGAAAGGAGACGAAGGAGTAGAAGAAGATGATAGTAAAGAAGAGAAGAAGAAGAAGAAGAAGAAAGGAGAGAAGGAAAAAGAGAAAAAGGAGGAAGAAGTAGAAGAAGATGATAGCAAAGAAGAGAAGAAGAAGAAGAAGAAGAAAGGAGGAGAGAAGGAACAGGAGAAAAAGGAGAAAGGAGAAGATGATATTAAAGAAGAAAAGAATAAAAAGAAGAAGAAGGGGGAGGATGAAGATGATGATGGCGAAGAAGAAAAGAAAGAGAAGAAAGAAAAGAAGGATGAGAAGAAAAAGGAAAAAGGTGGGAAAGAGAAAGAGAAAAGAGAGAAGAACAAAGTAGAAGATGAAGAAACGGAGGAGAAGGATGAAGTAGACGAAGATGAAACGAAGGGAAAGAAGGAGAAGAAGAAGAAAGAGAAGGAAGATGAAACGAAGAGCAGAAGTAGGGAAATAGAGATAGAAATAGAAATAGAAGTAGAAGAAGAAGCAGGTGAGAAGGGAGGTTGTAGAGGAGGAGGAGGAGAAGAGGAGAAGGATAACGAGAAGAAAAACAAGAAAGATAAAGAAGAAAAGAAGAAGAAAGTAGAAGAGAAAAACAGAAGTAGAGATGTGGGGAAATTGAAACAGAAACTAGAGAAGATGGATGTTAAAATCAATGCCTTGCTCGACAAGAAGGCAGACATTCTGAGGCAAATAAAAGAAATTGAAGACGCAAATTCCAACATTGTTGCTGCTGCTGCAAAACCTACGGAAGAAGTGGCATAA
BLAST of CmoCh14G011080 vs. TrEMBL
Match: A0A0A0LG90_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G842100 PE=4 SV=1)

HSP 1 Score: 138.3 bits (347), Expect = 3.0e-29
Identity = 261/522 (50.00%), Postives = 340/522 (65.13%), Query Frame = 1

Query: 84  MADESVAIPIMGKEEKTKVEVESEVVKVDDEVEKEKHEVKIKSKEAKYEDGKKEKTEVEV 143
           MAD SV+IP++GKEEKTKVE++ EVVKVD EVEKEK EVK+K+KE K+ED KKEKT  ++
Sbjct: 1   MADGSVSIPVIGKEEKTKVELDWEVVKVDKEVEKEKLEVKMKNKEVKHEDDKKEKTAAKL 60

Query: 144 QLKSSSGRKEKMKDVENEKEK------KKEEKQKKKDEG-------KDE----------- 203
           Q KSSS +KEK KD+EN+KEK      +K++K K K++G         +           
Sbjct: 61  QRKSSSVQKEKAKDIENKKEKSLKSDDEKDKKVKVKEDGDSKLEGKNKKEEKEEKHKNKD 120

Query: 204 -----KEAKKKHKDEDGAEKETEVKKKKKEKEKEEEKKEKKDEKKHK------------- 263
                KE+KKKHKDEDGAEKETEV KK   KEK +EKKEKKDEKK K             
Sbjct: 121 EAKEEKESKKKHKDEDGAEKETEVNKK---KEKNDEKKEKKDEKKPKKKDEKSGENDGVK 180

Query: 264 ----KKKEVEDDEECEKEEKKQEKGKKDKEKEKGKGGDAVEDKKVKKGIGNEEEEDDDDD 323
               KKKE E+DE+ EK+EKKQEKGKKD  KEKGKG D VEDKKVKK +  +EEE +D++
Sbjct: 181 EKKGKKKEAEEDEDFEKKEKKQEKGKKD--KEKGKGSDVVEDKKVKKEV-EKEEEKEDEN 240

Query: 324 GKEEKKKKKNEEKEKKKKEKGGEEEDDSKEEKK----KKKKKGDEGVEEDDSKEEKKKKK 383
            +E+KKKKK +EKE KKK+K GEEEDD  EEKK    KKK+K D+G EED  KEEKKKK 
Sbjct: 241 KEEKKKKKKKDEKENKKKDK-GEEEDDGNEEKKKKGEKKKEKKDKGGEEDGGKEEKKKKT 300

Query: 384 KKGEKEKEKKEEEVEEDDSKEEKKKKKKKGGEKEQEKKEKGE--DDIKEEKNKKKKKGED 443
           +  EKEK+KKE+   EDDSKEEKKKK    GEKE++KK+K E  D  KEE+ KKK + E 
Sbjct: 301 E--EKEKKKKEKG-GEDDSKEEKKKKT---GEKEKKKKDKEEEGDKSKEEEKKKKVEKEK 360

Query: 444 EDDDGEEEKKEKKEKKDEKKKEKGGKEKEKREKNKVEDEETEEKDEVDEDETKGKKEKKK 503
           E  D     K K E+ DE K+ KG K+K K      ++E+T  +++  + E K +K+K K
Sbjct: 361 EKKDKGVTMKGKDEENDEVKENKGEKKKGK------DEEDTANEEKKLKQEKKDEKKKDK 420

Query: 504 KEKEDETKSRSREIEIEIEIEVEEEAGEKGGCRGGGGEEEKDNEKKNKKDKEEKKKKVEE 548
            EKE E K + ++I +E E + +E+  EK       GE+EK+ +KK+KK  E+++K+ ++
Sbjct: 421 GEKEKEEKKKDKKI-VEDENKKDEKKQEK-------GEKEKEEKKKDKKIVEDEEKEDKD 480

BLAST of CmoCh14G011080 vs. TrEMBL
Match: A0A061F0X0_THECC (Uncharacterized protein isoform 2 OS=Theobroma cacao GN=TCM_025897 PE=4 SV=1)

HSP 1 Score: 74.3 bits (181), Expect = 5.3e-10
Identity = 213/487 (43.74%), Postives = 319/487 (65.50%), Query Frame = 1

Query: 97  EEKTKVEVESEVVKVDDEVEKEKHEVKIKSKEAKYEDGKKEKTEVEVQLKSSSGRKEKMK 156
           +EK K   E E      + EKEK E   K KE   ED   E  E+E   +  + +K K K
Sbjct: 91  DEKQKKREEGEEKPKKKDKEKEKKEKNHKDKEVVEED---EDEEIE---EKKNEKKHKDK 150

Query: 157 DVENEKEKKKEEKQKKKDEGKDEKEAKKKHKDED---GAEKETEVKKKKKEKEKEEEKKE 216
            VE E++++K+EK+KKKD+ + EK+ +KKHKD++   G ++ETE KKKKK   K+EE KE
Sbjct: 151 GVEVEEDEEKDEKKKKKDKEEKEKKKEKKHKDKEHEVGEDEETEDKKKKK---KDEEYKE 210

Query: 217 KKDEKKHKKKKEVEDDEECE-KEEKKQEKGKKDKEKEKGKGGDAVEDKKVKKGIGNEEEE 276
           KK EKK  K+ EVEDDEE E K+EKK+ K +K+++KEK    +  E +K       EE++
Sbjct: 211 KKKEKKQDKEDEVEDDEEKEEKKEKKKNKEEKERKKEKKLKDEEEEGEK-------EEKK 270

Query: 277 DDDDDGKEEKKKKKNEEKEKKKKEKGGEEEDDSKEEKKKKKKKGDEGVEEDDSKEEKKKK 336
             D  GKE+KK+KK +EKE++ +E   EE+++ K+EKKK+KK  ++ VEED++ E+++KK
Sbjct: 271 KKDKGGKEKKKEKKQKEKEEEVEEDEDEEKEEKKKEKKKEKKVKEDKVEEDENGEKEEKK 330

Query: 337 KKKGEKEKEKKEEEVEED--DSKEEKKKKKKKGGEKEQEKKEK------GEDDIKEEKNK 396
           K+K +++K+ KE+EVEE+  + KEEKKK KKK  E++++KKE+      GE    EEK +
Sbjct: 331 KEKKKEKKKDKEDEVEEEKVEEKEEKKKDKKKDKEEKEKKKERKLKDEEGESADVEEKEE 390

Query: 397 KKKKGEDEDDDGEEEKKEKKEKKDEKKKEKGGKEKEKREK--NKVEDEETEEKDEVDEDE 456
           KKKK   +D D +E+KKEKK K    + E+ G++K+K+EK   K + ++ EEK+   +DE
Sbjct: 391 KKKKETKKDKDDKEKKKEKKHKD---EDEEVGEKKKKKEKEEKKEKKKDKEEKETKHKDE 450

Query: 457 TKGKK----EKKKKEKED-------ETKSRSREIEIEIEIEVEEEAGEKGGCRGGGGEEE 516
            K KK    EK+KK+ ED       ET   SREIEI   ++ E+E+  +G      G+ +
Sbjct: 451 EKSKKKETDEKEKKKHEDGANDMKCETDITSREIEI---VDFEKESEGEGEEEKQKGKAK 510

Query: 517 KDNEKKNKKDKEEKKKKVEEKNRSRDVGKLKQKLEKMDVKINALLDKKADILRQIKEIED 559
           +  EK+ +KDK+ +K+K++ K++S+D+ KLKQ+LEK++ KI ALL+KKA+IL QIKE E 
Sbjct: 511 EGKEKEKEKDKKGEKRKLKGKDKSKDLSKLKQRLEKINSKIEALLEKKAEILSQIKEAEG 555

BLAST of CmoCh14G011080 vs. NCBI nr
Match: gi|778688964|ref|XP_011652875.1| (PREDICTED: DNA ligase 1-like [Cucumis sativus])

HSP 1 Score: 149.4 bits (376), Expect = 1.8e-32
Identity = 251/531 (47.27%), Postives = 313/531 (58.95%), Query Frame = 1

Query: 84  MADESVAIPIMGKEEKTKVEVESEVVKVDDEVEKEKHEVKIKS-------KEAKYEDGKK 143
           MAD SV+IP++GKEEKTKVE++ EVVKVD EVEKEK EVK+K+               KK
Sbjct: 1   MADGSVSIPVIGKEEKTKVELDWEVVKVDKEVEKEKLEVKMKNKEVKHEDD-------KK 60

Query: 144 EKTEVEVQLKSSS----------GRKEKMKDVENEKEKK-------------------KE 203
           EKT  ++Q KSSS           +KEK    ++EK+KK                   KE
Sbjct: 61  EKTAAKLQRKSSSVQKEKAKDIENKKEKSLKSDDEKDKKVKVKEDGDSKLEGKNKKEEKE 120

Query: 204 EKQKKKDEGKDEKEAKKKHKDEDGAEKETEVKKKKKEKEKEEEKKEKKDEKKHKKK---- 263
           EK K KDE K+EKE+KKKHKDEDGAEKETEV KKK   EK +EKKEKKDEKK KKK    
Sbjct: 121 EKHKNKDEAKEEKESKKKHKDEDGAEKETEVNKKK---EKNDEKKEKKDEKKPKKKDEKS 180

Query: 264 -------------KEVEDDEECEKEEKKQEKGKKDKEKEKGKGGDAVEDKKVKKGIGNEE 323
                        KE E+DE+ EK+EKKQEKGKKDKEK  GKG D VEDKKVKK +    
Sbjct: 181 GENDGVKEKKGKKKEAEEDEDFEKKEKKQEKGKKDKEK--GKGSDVVEDKKVKKEV---- 240

Query: 324 EEDDDDDGKEEKKKKKNEEKEKKKKEKGGEEEDDSKEEKKKKKKKGDEGVEEDDSKEEKK 383
                                    EK  E+ED++KEEKKKK+KK D+G +E   KEEKK
Sbjct: 241 -------------------------EKEEEKEDENKEEKKKKEKKKDKGEKE---KEEKK 300

Query: 384 KKKKKGEKEKEKKEEEVEEDDSKEEKKKKKKKGGEKEQEKKEKGEDDIKEEKNKKKKKGE 443
           K KK            + ED++K+++KK++K  GEKE           KEEK K KK  E
Sbjct: 301 KDKK------------IVEDENKKDEKKQEK--GEKE-----------KEEKKKDKKIVE 360

Query: 444 DEDDDGEEEKKEKKEKKDEKKKEKGGKEKEKREKNKVEDEETEEKDEVDEDETKGKKEKK 503
           DE    E+E K+K++ KDE + +KG KEK+K + N     +T+ +  V +   + K E+ 
Sbjct: 361 DE----EKEDKDKRKDKDEVEDKKGRKEKKKEKGN-----DTKTEASVTDTSREIKIEES 420

Query: 504 KKEKEDETKSRSREIEIEIEIEVEEEAGEKGGCRGGGGEEEKDNEKKNKKDKEEKKKKVE 562
           KK     T + SREI I+     E + G KG           ++EKKNKKDKEEK+ K E
Sbjct: 421 KKTDTSVTNT-SREIVIQ-----ESDKGPKG-----------EDEKKNKKDKEEKRMKGE 436

BLAST of CmoCh14G011080 vs. NCBI nr
Match: gi|700204604|gb|KGN59737.1| (hypothetical protein Csa_3G842100 [Cucumis sativus])

HSP 1 Score: 124.4 bits (311), Expect = 6.4e-25
Identity = 250/535 (46.73%), Postives = 329/535 (61.50%), Query Frame = 1

Query: 84  MADESVAIPIMGKEEKTKVEVESEVVKVDDEVEKEKHEVKIKS-------KEAKYEDGKK 143
           MAD SV+IP++GKEEKTKVE++ EVVKVD EVEKEK EVK+K+               KK
Sbjct: 1   MADGSVSIPVIGKEEKTKVELDWEVVKVDKEVEKEKLEVKMKNKEVKHEDD-------KK 60

Query: 144 EKTEVEVQLKSSS----------GRKEKMKDVENEKEKK-------------------KE 203
           EKT  ++Q KSSS           +KEK    ++EK+KK                   KE
Sbjct: 61  EKTAAKLQRKSSSVQKEKAKDIENKKEKSLKSDDEKDKKVKVKEDGDSKLEGKNKKEEKE 120

Query: 204 EKQKKKDEGKDEKEAKKKHKDEDGAEKETEVKKKKKEKEKEEEKKEKKDEKKHKKK---- 263
           EK K KDE K+EKE+KKKHKDEDGAEKETEV KKK   EK +EKKEKKDEKK KKK    
Sbjct: 121 EKHKNKDEAKEEKESKKKHKDEDGAEKETEVNKKK---EKNDEKKEKKDEKKPKKKDEKS 180

Query: 264 -------------KEVEDDEECEKEEKKQEKGKKDKEKEKGKGGDAVEDKKVKKGIGNEE 323
                        KE E+DE+ EK+EKKQEKGKKDKEK  GKG D VEDKKVKK +  EE
Sbjct: 181 GENDGVKEKKGKKKEAEEDEDFEKKEKKQEKGKKDKEK--GKGSDVVEDKKVKKEVEKEE 240

Query: 324 EEDDDDDGKEEKKKKKNEEKEKKKKEKGGEEEDDSKEEKKKKKKKGDEGVEEDDSKEEKK 383
           E++D++  +E+KKKKK +EKE KKK+KG EEEDD  EEKKKK                  
Sbjct: 241 EKEDENK-EEKKKKKKKDEKENKKKDKG-EEEDDGNEEKKKK------------------ 300

Query: 384 KKKKKGEKEKEKKEEEVEEDDSKEEKKKKKKKGGEKEQEKKEK-GEDDIKE-------EK 443
                GEK+KEKK++  EED  KEEKKKK +   EKE++KKEK GEDD KE       EK
Sbjct: 301 -----GEKKKEKKDKGGEEDGGKEEKKKKTE---EKEKKKKEKGGEDDSKEEKKKKTGEK 360

Query: 444 NKKKKKGEDEDDDGEEEKKEKKEKKDEKKKEKGGKEKEKREKN-KVEDEETEEKDEVDED 503
            KKKK  E+E D  +EE+K+KK +K+++KK+KG   K K E+N +V++ + E+K   DE+
Sbjct: 361 EKKKKDKEEEGDKSKEEEKKKKVEKEKEKKDKGVTMKGKDEENDEVKENKGEKKKGKDEE 420

Query: 504 ETKGKKEKKKKEKEDETKSRSREIEIE---IEIEVEEEAGEKGGCRGGGGEEEKDNEKKN 548
           +T  +++K K+EK+DE K    E E E    + ++ E+  +K   +   GE+EK+ +KK+
Sbjct: 421 DTANEEKKLKQEKKDEKKKDKGEKEKEEKKKDKKIVEDENKKDEKKQEKGEKEKEEKKKD 480

BLAST of CmoCh14G011080 vs. NCBI nr
Match: gi|590640828|ref|XP_007030061.1| (Uncharacterized protein isoform 2 [Theobroma cacao])

HSP 1 Score: 74.3 bits (181), Expect = 7.5e-10
Identity = 213/487 (43.74%), Postives = 319/487 (65.50%), Query Frame = 1

Query: 97  EEKTKVEVESEVVKVDDEVEKEKHEVKIKSKEAKYEDGKKEKTEVEVQLKSSSGRKEKMK 156
           +EK K   E E      + EKEK E   K KE   ED   E  E+E   +  + +K K K
Sbjct: 91  DEKQKKREEGEEKPKKKDKEKEKKEKNHKDKEVVEED---EDEEIE---EKKNEKKHKDK 150

Query: 157 DVENEKEKKKEEKQKKKDEGKDEKEAKKKHKDED---GAEKETEVKKKKKEKEKEEEKKE 216
            VE E++++K+EK+KKKD+ + EK+ +KKHKD++   G ++ETE KKKKK   K+EE KE
Sbjct: 151 GVEVEEDEEKDEKKKKKDKEEKEKKKEKKHKDKEHEVGEDEETEDKKKKK---KDEEYKE 210

Query: 217 KKDEKKHKKKKEVEDDEECE-KEEKKQEKGKKDKEKEKGKGGDAVEDKKVKKGIGNEEEE 276
           KK EKK  K+ EVEDDEE E K+EKK+ K +K+++KEK    +  E +K       EE++
Sbjct: 211 KKKEKKQDKEDEVEDDEEKEEKKEKKKNKEEKERKKEKKLKDEEEEGEK-------EEKK 270

Query: 277 DDDDDGKEEKKKKKNEEKEKKKKEKGGEEEDDSKEEKKKKKKKGDEGVEEDDSKEEKKKK 336
             D  GKE+KK+KK +EKE++ +E   EE+++ K+EKKK+KK  ++ VEED++ E+++KK
Sbjct: 271 KKDKGGKEKKKEKKQKEKEEEVEEDEDEEKEEKKKEKKKEKKVKEDKVEEDENGEKEEKK 330

Query: 337 KKKGEKEKEKKEEEVEED--DSKEEKKKKKKKGGEKEQEKKEK------GEDDIKEEKNK 396
           K+K +++K+ KE+EVEE+  + KEEKKK KKK  E++++KKE+      GE    EEK +
Sbjct: 331 KEKKKEKKKDKEDEVEEEKVEEKEEKKKDKKKDKEEKEKKKERKLKDEEGESADVEEKEE 390

Query: 397 KKKKGEDEDDDGEEEKKEKKEKKDEKKKEKGGKEKEKREK--NKVEDEETEEKDEVDEDE 456
           KKKK   +D D +E+KKEKK K    + E+ G++K+K+EK   K + ++ EEK+   +DE
Sbjct: 391 KKKKETKKDKDDKEKKKEKKHKD---EDEEVGEKKKKKEKEEKKEKKKDKEEKETKHKDE 450

Query: 457 TKGKK----EKKKKEKED-------ETKSRSREIEIEIEIEVEEEAGEKGGCRGGGGEEE 516
            K KK    EK+KK+ ED       ET   SREIEI   ++ E+E+  +G      G+ +
Sbjct: 451 EKSKKKETDEKEKKKHEDGANDMKCETDITSREIEI---VDFEKESEGEGEEEKQKGKAK 510

Query: 517 KDNEKKNKKDKEEKKKKVEEKNRSRDVGKLKQKLEKMDVKINALLDKKADILRQIKEIED 559
           +  EK+ +KDK+ +K+K++ K++S+D+ KLKQ+LEK++ KI ALL+KKA+IL QIKE E 
Sbjct: 511 EGKEKEKEKDKKGEKRKLKGKDKSKDLSKLKQRLEKINSKIEALLEKKAEILSQIKEAEG 555

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LG90_CUCSA3.0e-2950.00Uncharacterized protein OS=Cucumis sativus GN=Csa_3G842100 PE=4 SV=1[more]
A0A061F0X0_THECC5.3e-1043.74Uncharacterized protein isoform 2 OS=Theobroma cacao GN=TCM_025897 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
gi|778688964|ref|XP_011652875.1|1.8e-3247.27PREDICTED: DNA ligase 1-like [Cucumis sativus][more]
gi|700204604|gb|KGN59737.1|6.4e-2546.73hypothetical protein Csa_3G842100 [Cucumis sativus][more]
gi|590640828|ref|XP_007030061.1|7.5e-1043.74Uncharacterized protein isoform 2 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh14G011080.1CmoCh14G011080.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableunknownCoilCoilcoord: 492..512
score: -coord: 101..144
score: -coord: 148..186
score: -coord: 326..362
score: -coord: 364..384
score: -coord: 191..243
score: -coord: 518..552
score: -coord: 268..303
score: -coord: 393..444
scor