CmoCh01G014140 (gene) Cucurbita moschata (Rifu)

NameCmoCh01G014140
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionZinc knuckle family protein
LocationCmo_Chr01 : 11113412 .. 11117467 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGGTTATTTAAATTCGTTTACAATACAAATTATTGCTCATTGGTGAATTGAATAAGTAATTGCTCAATCTGCATTTCGGCCCAATTGGTATGACTGGGCCTTTGGCCTATGGGCTTTGAATAAGAAGTTTAATATTTGAGGCCCAATTAGGTTGTAAATGGGCCGTAGGCCCATGGGTTAGAGAACTGGAAAAGCAAGGCCGAGCAATAGCTAAGGGCAGAGATTACCGAACTGTCGAGGGTCGTCGGACGAGAAAGCGGGGAATTTGCAGCGGCGACGACGACGACGACGGAGAATGGATGGTGAAGAAGGAGGGATACGGCTAAGCAAGAGGTTCTCCGACAAATCTGGATCCGGTGAAGTTGATTACAAAACCAAGGCTGGCACCGCTTGGAGCCATTCTTATCTCAACCAGAAGCCCTGGCATCCTCTCTCGTACCCTAATCAACGCCGGAAATGGATCGCCGAGCAGACTCACTCTCAGCGAGAAAAGCGCGCCGAGGAAGTTGCTCGCGAGGTCAATTATACGCTTCAGTTCTGTTGTTTCGCACTCGCTAGATCTTTCTTCTCAATGTTTTTTTTTTTTTTTTTCCTGTTTGCAGTATGCTCAAGAGCAGGAGTTCTTTCGCCAGACTGCTCTTGTCTCCAAGAAAGAGAAGGAAAAGGTCTTACCTAGTTTTTTTTTTCGCTTACTGATGTTTACACCTTGATTGAAGCAAATTTCTTATTTAGCACTTTCGTGAAGCTGTCTAGGTTAACGATACATAGCAATCTTATAGTTTTAGTGATTGGTTAATGTTTGTGCCTTATCTCATCATCATCTTTTCATTGAGTTCTGTTTCATTCTTTTTATTTCTGGTTGCTTATGTTGCAAATTGGAGAAGGTGTTCTATGGCTATTGGCTAGGATTTTGTGAGAGTGGCTCTCTCATCTTGTAAATTTTTTACGTCTATGTTTTCTGAAAAAAAGTCCTGCTGCTGACAGTCGCCATGCCGAACTCAATCTTAATGATGATGACATAGCTGAAATTGTTTCTCCAACGTTACTCTTCCATCCGGACGAAAGATCCTGTAGTTTCTTTACGAGAATAGATAGTGGACTGTAGTTTTCCTTCCATTTGTTATTTAACGAGTTGCTATTAATTGTTTTTCTGCTTGTTGTTTATGGAGAATTTTGGCCATTGTTGCTGAAATCTGTGTTTGGTGGTTAGTTGGAGATGATGAAAGCGGTTAGTTTTATGTACGTACGACCACCTGGTTACAATGCTGAAAGTGCAAAAGCTGCAGAGATCGCTGATGATAGGAAGAATCAAGAGGGTGATAGCTCCTCTCAGAATCTGCCCAAGGATGGCTCTGTGAATGAAAGGTACTTCATCTTTATCTTGCACAATTATGTTCTTATTGTTCAGATGGAAATGCGAAGGAATGTTCAATGATGAGGATGATTTGAAGTTTTTTACCCTTTAAAGGCAGTAAAATTTCTCTTCTTCTTGTAGGCCACCAGAATCGTCAGGCACAGTCGGTAGGGACCATGGCGACAAGAAGAAACCAAGGCCAAAAGATGTTTTCGGGCGTGCTTTGCCCACCGAAGAAGAATTTGAAGTCTTAAAAAATGCCCCTCGGTAAGTTGCTAGCATGCTGAGATTGAATGAATATGAGTACCACTTTGATTTTGTATGTTGCAAACTCTAGCAGTTAGCATATAGGGAATAAGTGACTATGGCTCATTAATTTTGCTCAAGGTGGAGAAAATGCTATTAATTATCTATCACCCTTTGACTTAATAGACTTCAAGAACATCTCTAGTTAAAACTTCAGTCGATTTTATGCACCACCAATTTCTTGATGTTATGATTACATGGATTCGTTGTCCCTCTGGATCGATGTCTGCTTGTCCTTTTGTCCACTTATACTTCAAAACATATTGGAGGGTCCTCATCCATAAAGAATCTGATAATGCATACCATATGCAAAGAGTTTAAAAGGTAGTATACGAAATGAGCTGCCAAATAGCGAATAAACAATCTCGTACTGTGCGGTATTATTTGGATAATTTCACAGTTAATTTTGGGTTATGTAATCTAAATGTTACTCATATTTTGTTTGAGATTTATGGTTTCTTTCCGTCTGTTTCCAGGATGGACACAGGTGTTTTTGCGAGAGTAAAACCATTTGGGGTAGAAGTACGCAATGTGAAATGTGTTAGATGTGGGATCTTTGGCCATCAAAGTGGTGATCGTGAATGTCCGTTGAAGGATGCTATAATGCCAAATGAAGAAAATCGATTGAAAAGAGATGACCCTTTATCTACAATACTTGCGCATGCAGAGAGCAGTGAGGCTAGTATTTTGATTTATTCTTCGTTTGAAACTTACATTGCAGTATTACCTTGTAACAGCCTAAGTCCACCGCTAGTAGATATCGTTTTTTTTGGACTTTCCCTTTCAAGCTTCCCCTCAAGGTTTTGTTTAAACGTGTCTGCTATGGAGAGGTTTCTGCACCCTTATAAAGAATGCCTCGTTCTCGTCTCCAATTGAACAGCCCAAATCCACCGCTAGTAGATATTGTCCTTCGGGCTTCCCCCAAAGGTTTTTAAAACACGTCTGCTAGGGAGATGTTTCCATACCTTTATAAAGAATGTTTAGTTCTCCTTTCCAATCGACGTGGGATCTCACATACCTAGTCTGATAAGTGCTTTAGTGTGTATACTGTTTTCAATATTCTGACATTAATGTGCGTCATTCTTTACTCCATGAACTGTTGGTGACTTAGCTGTTGCTCTACCATTGATTGGTAACCCGAATCGAGCGTGTTAAATATTTGTTCTCATGTGTTTCCTTTTCCATGTTATCTACAGCCTCTGAAGTGGGAGTTGAAGCAGAAACCAGGAATCAGTCCACCCCGTGGAGGTTTTAATCCCGACGACCCGAACCAGCAAATAGTTGCCGAGGACATATTTGATGAGTACGGAGGTCAATGCTTTTTCCACCACGACAATCTTCCGATTCTGATTTCTCTCTTTTAGCTGGTAACTGATTATTTCCCATCTATAATCTGTAGGCTTTCTCAGCAGTGGCGGTATTGTCCCTGAATTGCTGTCCAATTTTTCAAGCAAACCCAAGAAAAAGAAGTCTTCAAAAGAAAGTTCACGGAAAAAAGCAAAGAATCAAGAAGACGATGAGAGAATATCAAAGAAGAAAAGTAAATCTAAAAGAAAGAAACAAATCAGTAGTGAATCAAGTCCAGAAACCTCAGAGTCTGATAGGCGAAATAGAAGGAACAAGCACAAGACTTCCTATTCTTCTGATGATTTCGACTCGAAAACGCAACTTGGGACTAAAAAGCATAGAAGGAAACATCTAAATACTTCTGATGTTTCTAACTTTGAGCGGCCTCATATTACTGATCGATATTGCCCCGAACGAAGCCCTTCTGATATTTCAGACTCTGGTAGGCAGGATGAACGGAGGAAAGGCAAGGGTTCTGAATCAGAACACAGATCAGGCAGAAAAAACCAATCATACTCTTCTGAAGAGTCCGAATTCGAGACGCATCGACGAAGTGATAAAAGCAGATCCAAACGCTCCTTGTCGAAAGATCATCGCCCAGATAGTCATCATTCAAGTAGCAAGATGAGACGTAGGAGGTCTTATTCGTCTGATGATTCAGAAACGAACAGGCGTCGTAAAAGTAAAGGCCAAAAGCACAATTACTCATCAGATGATTCTGAGCAAGAGAAACATGCTAGAGATAAGAAGAGAAGATGTTAGTGTATTCGTCATCAAAATTTGACGTTAAAACGATCGTTGGAGCATAGTATTTCGAAGTAAAACGACATTGACGATAGCTGGTAATGCCTGATATCTATCACTTAAGATGAGATGTTCTATTTGATCATTACACTTGGGAATATGATTATAAAAAAAAATACCATTGGTGTTTTGAATGCCTCCTAATCGAAAATTATACACAAGTAATTATTTGATTGAGATATGATAAATATTGAGTTACTTTAAATGTCAAATTGATTCATTGATTGAGGAACACCATATTGCTG

mRNA sequence

TGGTTATTTAAATTCGTTTACAATACAAATTATTGCTCATTGGTGAATTGAATAAGTAATTGCTCAATCTGCATTTCGGCCCAATTGGTATGACTGGGCCTTTGGCCTATGGGCTTTGAATAAGAAGTTTAATATTTGAGGCCCAATTAGGTTGTAAATGGGCCGTAGGCCCATGGGTTAGAGAACTGGAAAAGCAAGGCCGAGCAATAGCTAAGGGCAGAGATTACCGAACTGTCGAGGGTCGTCGGACGAGAAAGCGGGGAATTTGCAGCGGCGACGACGACGACGACGGAGAATGGATGGTGAAGAAGGAGGGATACGGCTAAGCAAGAGGTTCTCCGACAAATCTGGATCCGGTGAAGTTGATTACAAAACCAAGGCTGGCACCGCTTGGAGCCATTCTTATCTCAACCAGAAGCCCTGGCATCCTCTCTCGTACCCTAATCAACGCCGGAAATGGATCGCCGAGCAGACTCACTCTCAGCGAGAAAAGCGCGCCGAGGAAGTTGCTCGCGAGTATGCTCAAGAGCAGGAGTTCTTTCGCCAGACTGCTCTTGTCTCCAAGAAAGAGAAGGAAAAGTTGGAGATGATGAAAGCGGTTAGTTTTATGTACGTACGACCACCTGGTTACAATGCTGAAAGTGCAAAAGCTGCAGAGATCGCTGATGATAGGAAGAATCAAGAGGGTGATAGCTCCTCTCAGAATCTGCCCAAGGATGGCTCTGTGAATGAAAGGCCACCAGAATCGTCAGGCACAGTCGGTAGGGACCATGGCGACAAGAAGAAACCAAGGCCAAAAGATGTTTTCGGGCGTGCTTTGCCCACCGAAGAAGAATTTGAAGTCTTAAAAAATGCCCCTCGGATGGACACAGGTGTTTTTGCGAGAGTAAAACCATTTGGGGTAGAAGTACGCAATGTGAAATGTGTTAGATGTGGGATCTTTGGCCATCAAAGTGGTGATCGTGAATGTCCGTTGAAGGATGCTATAATGCCAAATGAAGAAAATCGATTGAAAAGAGATGACCCTTTATCTACAATACTTGCGCATGCAGAGAGCAGTGAGCCTCTGAAGTGGGAGTTGAAGCAGAAACCAGGAATCAGTCCACCCCGTGGAGGTTTTAATCCCGACGACCCGAACCAGCAAATAGTTGCCGAGGACATATTTGATGAGTACGGAGGCTTTCTCAGCAGTGGCGGTATTGTCCCTGAATTGCTGTCCAATTTTTCAAGCAAACCCAAGAAAAAGAAGTCTTCAAAAGAAAGTTCACGGAAAAAAGCAAAGAATCAAGAAGACGATGAGAGAATATCAAAGAAGAAAAGTAAATCTAAAAGAAAGAAACAAATCAGTAGTGAATCAAGTCCAGAAACCTCAGAGTCTGATAGGCGAAATAGAAGGAACAAGCACAAGACTTCCTATTCTTCTGATGATTTCGACTCGAAAACGCAACTTGGGACTAAAAAGCATAGAAGGAAACATCTAAATACTTCTGATGTTTCTAACTTTGAGCGGCCTCATATTACTGATCGATATTGCCCCGAACGAAGCCCTTCTGATATTTCAGACTCTGGTAGGCAGGATGAACGGAGGAAAGGCAAGGGTTCTGAATCAGAACACAGATCAGGCAGAAAAAACCAATCATACTCTTCTGAAGAGTCCGAATTCGAGACGCATCGACGAAGTGATAAAAGCAGATCCAAACGCTCCTTGTCGAAAGATCATCGCCCAGATAGTCATCATTCAAGTAGCAAGATGAGACGTAGGAGGTCTTATTCGTCTGATGATTCAGAAACGAACAGGCGTCGTAAAAGTAAAGGCCAAAAGCACAATTACTCATCAGATGATTCTGAGCAAGAGAAACATGCTAGAGATAAGAAGAGAAGATGTTAGTGTATTCGTCATCAAAATTTGACGTTAAAACGATCGTTGGAGCATAGTATTTCGAAGTAAAACGACATTGACGATAGCTGGTAATGCCTGATATCTATCACTTAAGATGAGATGTTCTATTTGATCATTACACTTGGGAATATGATTATAAAAAAAAATACCATTGGTGTTTTGAATGCCTCCTAATCGAAAATTATACACAAGTAATTATTTGATTGAGATATGATAAATATTGAGTTACTTTAAATGTCAAATTGATTCATTGATTGAGGAACACCATATTGCTG

Coding sequence (CDS)

ATGGATGGTGAAGAAGGAGGGATACGGCTAAGCAAGAGGTTCTCCGACAAATCTGGATCCGGTGAAGTTGATTACAAAACCAAGGCTGGCACCGCTTGGAGCCATTCTTATCTCAACCAGAAGCCCTGGCATCCTCTCTCGTACCCTAATCAACGCCGGAAATGGATCGCCGAGCAGACTCACTCTCAGCGAGAAAAGCGCGCCGAGGAAGTTGCTCGCGAGTATGCTCAAGAGCAGGAGTTCTTTCGCCAGACTGCTCTTGTCTCCAAGAAAGAGAAGGAAAAGTTGGAGATGATGAAAGCGGTTAGTTTTATGTACGTACGACCACCTGGTTACAATGCTGAAAGTGCAAAAGCTGCAGAGATCGCTGATGATAGGAAGAATCAAGAGGGTGATAGCTCCTCTCAGAATCTGCCCAAGGATGGCTCTGTGAATGAAAGGCCACCAGAATCGTCAGGCACAGTCGGTAGGGACCATGGCGACAAGAAGAAACCAAGGCCAAAAGATGTTTTCGGGCGTGCTTTGCCCACCGAAGAAGAATTTGAAGTCTTAAAAAATGCCCCTCGGATGGACACAGGTGTTTTTGCGAGAGTAAAACCATTTGGGGTAGAAGTACGCAATGTGAAATGTGTTAGATGTGGGATCTTTGGCCATCAAAGTGGTGATCGTGAATGTCCGTTGAAGGATGCTATAATGCCAAATGAAGAAAATCGATTGAAAAGAGATGACCCTTTATCTACAATACTTGCGCATGCAGAGAGCAGTGAGCCTCTGAAGTGGGAGTTGAAGCAGAAACCAGGAATCAGTCCACCCCGTGGAGGTTTTAATCCCGACGACCCGAACCAGCAAATAGTTGCCGAGGACATATTTGATGAGTACGGAGGCTTTCTCAGCAGTGGCGGTATTGTCCCTGAATTGCTGTCCAATTTTTCAAGCAAACCCAAGAAAAAGAAGTCTTCAAAAGAAAGTTCACGGAAAAAAGCAAAGAATCAAGAAGACGATGAGAGAATATCAAAGAAGAAAAGTAAATCTAAAAGAAAGAAACAAATCAGTAGTGAATCAAGTCCAGAAACCTCAGAGTCTGATAGGCGAAATAGAAGGAACAAGCACAAGACTTCCTATTCTTCTGATGATTTCGACTCGAAAACGCAACTTGGGACTAAAAAGCATAGAAGGAAACATCTAAATACTTCTGATGTTTCTAACTTTGAGCGGCCTCATATTACTGATCGATATTGCCCCGAACGAAGCCCTTCTGATATTTCAGACTCTGGTAGGCAGGATGAACGGAGGAAAGGCAAGGGTTCTGAATCAGAACACAGATCAGGCAGAAAAAACCAATCATACTCTTCTGAAGAGTCCGAATTCGAGACGCATCGACGAAGTGATAAAAGCAGATCCAAACGCTCCTTGTCGAAAGATCATCGCCCAGATAGTCATCATTCAAGTAGCAAGATGAGACGTAGGAGGTCTTATTCGTCTGATGATTCAGAAACGAACAGGCGTCGTAAAAGTAAAGGCCAAAAGCACAATTACTCATCAGATGATTCTGAGCAAGAGAAACATGCTAGAGATAAGAAGAGAAGATGTTAG
BLAST of CmoCh01G014140 vs. Swiss-Prot
Match: Y4919_ARATH (Uncharacterized zinc finger CCHC domain-containing protein At4g19190 OS=Arabidopsis thaliana GN=At4g19190 PE=2 SV=1)

HSP 1 Score: 449.9 bits (1156), Expect = 3.9e-125
Identity = 302/548 (55.11%), Postives = 361/548 (65.88%), Query Frame = 1

Query: 2   DGEEGGIRLSKRFSD---KSGSGEVDYKTKAGTAWSHSYLNQKPWHPLSYPNQRRKWIAE 61
           +GE  GIRLSKRF+      GS EVDYKTK+GTAWSHS+LNQKPWHPLSYPNQRRKWIAE
Sbjct: 4   EGEGSGIRLSKRFAGGKVTGGSLEVDYKTKSGTAWSHSFLNQKPWHPLSYPNQRRKWIAE 63

Query: 62  QTHSQREKRAEEVAREYAQEQEFFRQTALVSKKEKEKLEMMKAVSFMYVRPPGYNAESAK 121
           QTH+Q ++RAEEVARE+AQEQEFF+Q AL+SKKE+EK+E MKAVSFMYVRPPGY+ ESAK
Sbjct: 64  QTHAQHDRRAEEVAREFAQEQEFFKQAALISKKEREKIETMKAVSFMYVRPPGYDPESAK 123

Query: 122 AAEIADDRKNQEGDSSSQNLPKDGSVNERPPESSGTVGRDHGDKKKPRPKDVFGRALPTE 181
           AAE  D++   +G SS+Q+   D +V  RP ES G  G    ++KKPRPKDVFGRALPTE
Sbjct: 124 AAEYKDEKHKGQG-SSTQDPVADDNVGSRPEESQGG-GERTQERKKPRPKDVFGRALPTE 183

Query: 182 EEFEVLKNAPRMDTGVFARVKPFGVEVRNVKCVRCGIFGHQSGDRECPLKDAIMPNEENR 241
           EEFEVLKNAPRM+TG+  RVKPF VEVRNVKC+RCG FGHQSGDR+CPLKDA+MPNEE R
Sbjct: 184 EEFEVLKNAPRMETGIPGRVKPFAVEVRNVKCLRCGNFGHQSGDRDCPLKDAVMPNEELR 243

Query: 242 LKRDDPLSTILAHAESSEPLKWELKQKPGISPPRGGFNPDDPNQQIVAEDIFDEYGGFLS 301
           LKRDDPL+ I+AH + SEPLKWELKQKPG+SPPRGGF+PDDPNQQIVAEDIFDEYGGFL 
Sbjct: 244 LKRDDPLTAIIAHTDPSEPLKWELKQKPGLSPPRGGFDPDDPNQQIVAEDIFDEYGGFL- 303

Query: 302 SGGIVPELLSNFSSKPKKKKSSKESSRKKAKN---QEDDERIS-------------KKKS 361
            G I  E+L + SS  KK+KS K    KK  +   +E DE  +             +KK 
Sbjct: 304 EGSIPIEILKSMSS-DKKRKSKKNKRHKKHSSRTVEETDESSTGSEDSREKRGSKKRKKL 363

Query: 362 KSKRKKQISSES-SPETSESD--RRNRRNKHKTSYSSDDFDSKTQLGTKKHRRKHLNTSD 421
           K K KKQ  S+S S E S SD  R +RR   K    S    S+       HR KH    D
Sbjct: 364 KKKSKKQYDSDSLSFEGSGSDSYRLSRRRHTKHVDPSASLKSEVYHQGNSHREKHY--YD 423

Query: 422 VSNFERPHITDRYCPERSPSDISDSGRQDERRKGKGSESEHRSGRKNQSYSSEESEFETH 481
             + +R  I DR       SD   S    ++R     +S HR  ++  S      + +  
Sbjct: 424 EKHQKRKEIVDRPSASSDDSDYYRSNSSRKKRSEDDYKSHHRERKQVHSNDPVSEKSQKQ 483

Query: 482 RRSDKSRSKRSLSKDHRPDSHHSSSKMRRRRSYSSDDSETNRRRKSKGQKHNYSSDDSEQ 528
             S+  + +R + K+HR D         RR  Y   +SE NR R  K  ++    DD + 
Sbjct: 484 HYSESGKIQR-VEKEHRYD--------ERRHRYVDMESE-NRNRSEKKPRY----DDRDS 531

BLAST of CmoCh01G014140 vs. TrEMBL
Match: A0A0A0KYH9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G358660 PE=4 SV=1)

HSP 1 Score: 666.8 bits (1719), Expect = 2.2e-188
Identity = 398/538 (73.98%), Postives = 427/538 (79.37%), Query Frame = 1

Query: 1   MDGEEGG-IRLSKRFSDKSGSGEVDYKTKAGTAWSHSYLNQKPWHPLSYPNQRRKWIAEQ 60
           MDGEEGG IRLSKRFSDK+GSGEVDYKTKAGTAWSHSYLNQKPWHPLSYPNQRRKWIAEQ
Sbjct: 1   MDGEEGGGIRLSKRFSDKAGSGEVDYKTKAGTAWSHSYLNQKPWHPLSYPNQRRKWIAEQ 60

Query: 61  THSQREKRAEEVAREYAQEQEFFRQTALVSKKEKEKLEMMKAVSFMYVRPPGYNAESAKA 120
           THSQREKR EEVAREYAQEQEFFRQTALVSKKEKEKLEMMKAVSFMYVRPPGYNAESAKA
Sbjct: 61  THSQREKRTEEVAREYAQEQEFFRQTALVSKKEKEKLEMMKAVSFMYVRPPGYNAESAKA 120

Query: 121 AEIADDRKNQEGDSSSQNLPKDGSVNERPPESSGTVGRDHGDKKKPRPKDVFGRALPTEE 180
           AEIADDRK QEGD+ SQ+LPKD S N RPPESS TVGR+ G  KKPRPKDVFGRALPTEE
Sbjct: 121 AEIADDRKKQEGDNPSQDLPKDSSGNARPPESSSTVGREPG--KKPRPKDVFGRALPTEE 180

Query: 181 EFEVLKNAPRMDTGVFARVKPFGVEVRNVKCVRCGIFGHQSGDRECPLKDAIMPNEENRL 240
           EFEVLKNAPRMDTGVFARVKPFGVEVRNVKCVRCGIFGHQSGDRECPLKDAIMPNEE+RL
Sbjct: 181 EFEVLKNAPRMDTGVFARVKPFGVEVRNVKCVRCGIFGHQSGDRECPLKDAIMPNEESRL 240

Query: 241 KRDDPLSTILAHAESSEPLKWELKQKPGISPPRGGFNPDDPNQQIVAEDIFDEYGGFLSS 300
           KRDDPL+TILA+AE+SEPLKWELKQKPGISPPRGGFNPDDPNQQIVAEDIFDEYGGFLS 
Sbjct: 241 KRDDPLTTILANAETSEPLKWELKQKPGISPPRGGFNPDDPNQQIVAEDIFDEYGGFLSC 300

Query: 301 GGIVPELLSNFSSKPKKKKSSKESSRK--------KAKNQEDDERISKKKSKSKRKKQIS 360
           GGIVPELLSNFSSKPKK K S++SSRK        K K+ EDDERISKKK KSKRKKQ++
Sbjct: 301 GGIVPELLSNFSSKPKKNKFSRQSSRKKLQSSSSRKEKDLEDDERISKKKHKSKRKKQVN 360

Query: 361 SESSPETSESDRRNRRNKHKTSYSSDDFDSKTQLGTKKHRRKHLNTSDVSNFERPHITDR 420
            ESS ETSESDRR+RRNKHK S+ SDD DSK    TKKHRRKHLNTSDVS     + +DR
Sbjct: 361 GESSSETSESDRRDRRNKHKISHLSDDSDSKMHHKTKKHRRKHLNTSDVSETSESN-SDR 420

Query: 421 YCPERSPSDISDSGRQDERRKGKGSESEHRSGRKNQSYSSEESEFETHRRSDKSRSKRSL 480
              +   S +SD       R+ K    +HR  R N   +S+ S FE    +D+   + S 
Sbjct: 421 RRNKHKISYLSDDSASKTHRRSK----KHRRKRLN---TSDVSNFERSHITDRYCPEPSP 480

Query: 481 SKDHRPDSHHSSSKMRRRRSYSSDDSETNRRRKSKGQKHNYSSDDSEQEKHARDKKRR 530
           S     + H    K R    Y   DSE   R   K Q H  SS+DS  E+H    K R
Sbjct: 481 SDISDSERHDKGRKHRDFYPYKKLDSELEHRSGRKDQSH--SSEDSGFERHPISDKTR 526

BLAST of CmoCh01G014140 vs. TrEMBL
Match: M5XSJ2_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003979mg PE=4 SV=1)

HSP 1 Score: 510.0 bits (1312), Expect = 3.5e-141
Identity = 325/558 (58.24%), Postives = 382/558 (68.46%), Query Frame = 1

Query: 1   MDGEEGGIRLSKRFSDKSGSGEVDYKTKAGTAWSHSYLNQKPWHPLSYPNQRRKWIAEQT 60
           M+    GIRLSKRFSDK G GEVDYKTK+GTAWSHSYLNQKPWHPLSYPNQRRKWIAEQ 
Sbjct: 1   MENGPRGIRLSKRFSDKGG-GEVDYKTKSGTAWSHSYLNQKPWHPLSYPNQRRKWIAEQI 60

Query: 61  HSQREKRAEEVAREYAQEQEFFRQTALVSKKEKEKLEMMKAVSFMYVRPPGYNAESAKAA 120
            +Q ++R EEV+REYAQEQEFFRQ ALVSKK+KEK+EMMKAVSFMYVRPPGYNAESAKAA
Sbjct: 61  QAQHQRRTEEVSREYAQEQEFFRQAALVSKKDKEKIEMMKAVSFMYVRPPGYNAESAKAA 120

Query: 121 EIADDRKNQEGDSSSQNLPKDGSVNERPPESSGTVGRDHGDKKKPRPKDVFGRALPTEEE 180
           EIAD                     E PPE     G +    KKPR KDVFGR LPTE+E
Sbjct: 121 EIAD---------------------ESPPECMPPSGEE---AKKPRLKDVFGRPLPTEQE 180

Query: 181 FEVLKNAPRMDTGVFARVKPFGVEVRNVKCVRCGIFGHQSGDRECPLKDAIMPNEENRLK 240
           FE+LKNAPRM+TGV  R KPFGVEVRNVKCVRCG FGHQSGDRECPLKDAIMPNEE RLK
Sbjct: 181 FEILKNAPRMETGVPTRAKPFGVEVRNVKCVRCGAFGHQSGDRECPLKDAIMPNEEGRLK 240

Query: 241 RDDPLSTILAHAESSEPLKWELKQKPGISPPRGGFNPDDPNQQIVAEDIFDEYGGFLSSG 300
           RDDPL+ ILAH + SEPLKWELKQKPGISPPRGGF PDDPNQQIVAEDIFDEYGGFL SG
Sbjct: 241 RDDPLTAILAHTDPSEPLKWELKQKPGISPPRGGFKPDDPNQQIVAEDIFDEYGGFL-SG 300

Query: 301 GIVPELLSNFSSKPKKKKSSKESSRKK-----------AKNQEDDERISKKKSKSKRKKQ 360
            ++PELL+NFSS+P+ K   K   +KK           + + EDD+R  KK  K  +KK 
Sbjct: 301 DVIPELLTNFSSQPRDKSKKKTKHKKKQSSPMSGEDGLSSSYEDDKRSKKKICKVNKKKH 360

Query: 361 ISSESSP-ETSESDRRNRRNKHKTSYSSDDFDSKTQLGTKKHRRKHLNTSDVS----NFE 420
             SESSP E  E DR   +++ K SYS ++ +++    + K R+KH ++ + S    +++
Sbjct: 361 GHSESSPSEILEFDRHKVKSRDKHSYSFENSNNEKHQRSTKKRQKHSHSYEDSEICRHYK 420

Query: 421 RPHITDRYCPERSPSDISDSGRQDERRKGKGS--------ESEHRSG--RKNQSYSSEES 480
           R + +DR C         D   +D +R+ K S        E  HRS   R+  S+S E+S
Sbjct: 421 RDNSSDR-CTNSFEDSEPDKHHRDVQRRRKHSFTVGDSYHEKHHRSRKVRQKHSHSYEDS 480

Query: 481 EFETHRRSDKSR-SKRSLSKDHRPDSHHSSSKMRRRRSYSSDDSETNR--RRKSKGQKHN 530
           + + H R DKSR S  + S+D  PD H  S K R R SYSS+DS TNR  R K    +H+
Sbjct: 481 KIDRHCRRDKSRESLNNSSEDSEPDRHLRSIKSRHRHSYSSEDSGTNRQNRNKKSRDRHS 531

BLAST of CmoCh01G014140 vs. TrEMBL
Match: A0A061GE82_THECC (Zinc knuckle family protein isoform 1 OS=Theobroma cacao GN=TCM_016691 PE=4 SV=1)

HSP 1 Score: 509.2 bits (1310), Expect = 6.0e-141
Identity = 326/560 (58.21%), Postives = 384/560 (68.57%), Query Frame = 1

Query: 1   MDG---EEGGIRLSKRFSDKS--GSGEVDYKTKAGTAWSHSYLNQKPWHPLSYPNQRRKW 60
           MDG   E GGIRLSKRFSD     SGEVDYKTKAGTAWSHSYLNQKPWHPLSYPNQRRKW
Sbjct: 1   MDGIGEEGGGIRLSKRFSDNKPGSSGEVDYKTKAGTAWSHSYLNQKPWHPLSYPNQRRKW 60

Query: 61  IAEQTHSQREKRAEEVAREYAQEQEFFRQTALVSKKEKEKLEMMKAVSFMYVRPPGYNAE 120
           IAEQTHS R +RAEEVAREYAQEQEFFRQTAL+SKKEKEK+EMMKAVSFMYVRPPGYNAE
Sbjct: 61  IAEQTHSHRMRRAEEVAREYAQEQEFFRQTALISKKEKEKVEMMKAVSFMYVRPPGYNAE 120

Query: 121 SAKAAEIADDRKNQEGDSSSQNLPKDGSVNERPPESSGTVGRDHGDKKKPRPKDVFGRAL 180
           SAKAAEIAD+RK  E ++ S +   D      P ES      +  +KKKPRPKDVFGR L
Sbjct: 121 SAKAAEIADERKRIEPNNVSDDQSTDVVSTAMPTESLPGKDPNGAEKKKPRPKDVFGRPL 180

Query: 181 PTEEEFEVLKNAPRMDTGVFARVKPFGVEVRNVKCVRCGIFGHQSGDRECPLKDAIMPNE 240
           PTEEEFE+LKNAPR++TGV  RVKPFGVEVRNVKC+RCG +GHQSGDRECPLKDAIMPNE
Sbjct: 181 PTEEEFEILKNAPRLETGVLGRVKPFGVEVRNVKCLRCGNYGHQSGDRECPLKDAIMPNE 240

Query: 241 ENRLKRDDPLSTILAHAESSEPLKWELKQKPGISPPRGGFNPDDPNQQIVAEDIFDEYGG 300
           E+RLKRDDPL+ I+A  + +EPLKWELKQKPG+SPPRGGF PDDPNQQIVAEDIFDEYGG
Sbjct: 241 ESRLKRDDPLTAIMAQMDPTEPLKWELKQKPGMSPPRGGFQPDDPNQQIVAEDIFDEYGG 300

Query: 301 FLSSGGIVPELLSNFSSKPKKKKSSKESSRKKAKNQE-------------------DDER 360
           FL SGG +P+LL+N S KPKK+KSS +S  K+  +                     DDER
Sbjct: 301 FL-SGGNIPDLLTNISCKPKKRKSSSKSKHKRQSSPSSRELEVPDQDGLPSPAHSDDDER 360

Query: 361 ISKKKSKSKRKKQ------ISSESSPETSESDRRNRRNKHKTSYSSDDFDSKTQLGTKKH 420
            SK+K K+K+KK+       S  SS +  + DR  R+ ++K SYSS+D DS  Q  TK  
Sbjct: 361 RSKRKKKTKKKKKKKKRQNYSESSSSDGLDFDRHQRKRRNKYSYSSEDSDSDRQYKTK-- 420

Query: 421 RRKHLNTSDVSNFERPHITDRYCPERSPSDISDSGRQDERRKGKGSESEHRSGRKNQSYS 480
             K   +S+ S+ +R + T         S  S+    D+   GK S S+H       SYS
Sbjct: 421 -HKCSYSSEDSDSDRQYKTKE--SREKLSYTSEDLDSDQECWGKRSRSKH-------SYS 480

Query: 481 SEESEFETHRRSDKSRSKRSLSKDHRPDSHHSSSKMRRRRSYSSDDSETNRR-RKSKGQK 530
           SE  +F+ H R  K + K S S +      H   K   ++ Y+SD  + +R   K  GQK
Sbjct: 481 SE--DFDRHHR--KIKHKCSYSSEDSDSGRHDLKKKSTQKPYTSDRMDVDRHWSKRSGQK 540

BLAST of CmoCh01G014140 vs. TrEMBL
Match: W9RFP8_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_025969 PE=4 SV=1)

HSP 1 Score: 507.7 bits (1306), Expect = 1.7e-140
Identity = 322/539 (59.74%), Postives = 375/539 (69.57%), Query Frame = 1

Query: 1   MDGEEG-GIRLSKRFSDKSGSGEVDYKTKAGTAWSHSYLNQKPWHPLSYPNQRRKWIAEQ 60
           M+GEEG G+RLSKRFS+ S SGEVDYK KAGTAWSHSYLNQKPWHPLSYPNQRRKWIAEQ
Sbjct: 1   MEGEEGEGLRLSKRFSETSKSGEVDYKMKAGTAWSHSYLNQKPWHPLSYPNQRRKWIAEQ 60

Query: 61  THSQREKRAEEVAREYAQEQEFFRQTALVSKKEKEKLEMMKAVSFMYVRPPGYNAESAKA 120
           TH+ R +R+E    EYAQEQEFFRQTALVSKKEKEKLE+MKAVSFMYVRPPGYNAESAKA
Sbjct: 61  THANRHRRSE----EYAQEQEFFRQTALVSKKEKEKLEIMKAVSFMYVRPPGYNAESAKA 120

Query: 121 AEIADDRKNQEGDSSSQNLPKDGSVNERPPESSGTVGRDH---GDKKKPRPKDVFGRALP 180
           AEIAD+ K  E ++ SQ+    G+    PPES    GRDH    +KKKPRPKDVFGR+LP
Sbjct: 121 AEIADEIKTHEQENPSQDPTTSGTSTSMPPES--MPGRDHLVREEKKKPRPKDVFGRSLP 180

Query: 181 TEEEFEVLKNAPRMDTGVFARVKPFGVEVRNVKCVRCGIFGHQSGDRECPLKDAIMPNEE 240
           TEEEFEVLKNAPR++TGV ARVKPFGVEVRNVKCVRCG +GHQSGDRECPL+DAIMPNEE
Sbjct: 181 TEEEFEVLKNAPRLETGVPARVKPFGVEVRNVKCVRCGTYGHQSGDRECPLRDAIMPNEE 240

Query: 241 NRLKRDDPLSTILAHAESSEPLKWELKQKPGISPPRGGFNPDDPNQQIVAEDIFDEYGGF 300
           +RLKRDDP++ ILAH + SEPLKWELKQKPGISPPRGGF PDDPNQQIVAEDIF+EYGGF
Sbjct: 241 SRLKRDDPMTAILAHTDPSEPLKWELKQKPGISPPRGGFKPDDPNQQIVAEDIFNEYGGF 300

Query: 301 LSSGGIVPELLSNFSSKPKKKKSSKESSRKKAKNQEDDERISKKKSKSKRKKQISSESSP 360
           L SGG +PELL+NFSSK KK K  K   + +     D E + +             ESS 
Sbjct: 301 L-SGGNIPELLTNFSSKSKKSKKKKNHKKHQFSCDSDSEELEE------------DESSS 360

Query: 361 ETSESDRRNRRNKHKTSYSSDDFDSKTQLGTKKHRRKHLNTSDVSNFERPHITDRYCPER 420
            + + +RR ++ KHK                KKH     ++SDV  +     ++R C   
Sbjct: 361 ASEDGERRQKKRKHKRK-------------NKKHNHSESSSSDVELYRHKEKSNRKC--- 420

Query: 421 SPSDISDSGRQDERRKGKGSESEHRSGRKNQSYSSEESEFETHRRSDKSRSK--RSL-SK 480
             S  SD+GR      GK         R+  S SSE+S  + H RSDK R +  +SL SK
Sbjct: 421 YSSGESDAGRH-HSSTGK---------RRKHSCSSEDSGKDRHHRSDKRRCRPMKSLPSK 480

Query: 481 DHRPDSHHSSSKMRRRRSYSSDDSETNRRRKSKG-QKHNYSSDDSE--QEKHARDKKRR 530
           D   D H+ S        YSS+DS+ N +  SKG QKH+YS   SE  +E H+  +K R
Sbjct: 481 DSNFDRHYKSRD--ECSHYSSEDSDRNGQSTSKGKQKHSYSRQHSETKRENHSSKQKSR 492

BLAST of CmoCh01G014140 vs. TrEMBL
Match: A0A0B0PYX8_GOSAR (Uncharacterized protein OS=Gossypium arboreum GN=F383_14281 PE=4 SV=1)

HSP 1 Score: 506.5 bits (1303), Expect = 3.9e-140
Identity = 320/551 (58.08%), Postives = 382/551 (69.33%), Query Frame = 1

Query: 1   MDG---EEGGIRLSKRFSDKS---GSGEVDYKTKAGTAWSHSYLNQKPWHPLSYPNQRRK 60
           MDG   E GGIRL K+F D +    S EVDYKTK GTAWSHSYLNQKPWHPLSYPNQRRK
Sbjct: 1   MDGIGEEGGGIRLRKKFPDDNKPGSSAEVDYKTKPGTAWSHSYLNQKPWHPLSYPNQRRK 60

Query: 61  WIAEQTHSQREKRAEEVAREYAQEQEFFRQTALVSKKEKEKLEMMKAVSFMYVRPPGYNA 120
           WIAEQTH+QR +RA+EVAREYAQEQEFFRQTAL+SKKEKEK+EMMKAVSFMYVRPPGYNA
Sbjct: 61  WIAEQTHAQRVRRADEVAREYAQEQEFFRQTALISKKEKEKVEMMKAVSFMYVRPPGYNA 120

Query: 121 ESAKAAEIADDRKNQEGDSSSQNLPKDGSVNERPPESSGTVGRDHGDKKKPRPKDVFGRA 180
           ESAKAAEIAD+RK  + ++ S +   D       PES    G    +K+K RPKDVFGR+
Sbjct: 121 ESAKAAEIADERKKTDHNNVSDDHSTDVPSTAMQPESLPGGGATTQEKRKSRPKDVFGRS 180

Query: 181 LPTEEEFEVLKNAPRMDTGVFARVKPFGVEVRNVKCVRCGIFGHQSGDRECPLKDAIMPN 240
           LPTEEEFEVLKNAPR++TGV  RVKPF VEVRNVKC+RCG +GHQSGDR+CPLKDAIMPN
Sbjct: 181 LPTEEEFEVLKNAPRLETGVPGRVKPFAVEVRNVKCLRCGNYGHQSGDRDCPLKDAIMPN 240

Query: 241 EENRLKRDDPLSTILAHAESSEPLKWELKQKPGISPPRGGFNPDDPNQQIVAEDIFDEYG 300
           EE+RLKRDDPL+ I+A  + +EPLKWELKQKPG+SPPRGGF PDDPNQQIVAEDIFDEYG
Sbjct: 241 EESRLKRDDPLTAIMAQMDPTEPLKWELKQKPGMSPPRGGFQPDDPNQQIVAEDIFDEYG 300

Query: 301 GFLSSGGIVPELLSNFSSKPKKKKSSKESSRKK-----------------AKNQEDDERI 360
           GFLS GG +P+LL+N S KPKK+KSSK+S  K+                 + + +DDE+ 
Sbjct: 301 GFLS-GGNIPDLLTNISCKPKKRKSSKKSKHKRNSSPSTWDADVPNQDGLSSSSDDDEKK 360

Query: 361 SKKKSKSKRK-KQISSESSPETSESDRRNRRNKHKTSYSSDDFDSKTQLGTKKHRRKHLN 420
           SKKK   K+K +  S  SS + +E D+  R+  +K SYSS+D DS  Q  TK+ R K   
Sbjct: 361 SKKKKTKKKKWRNYSGSSSDDGAEFDKHKRQRINKHSYSSEDSDSGRQYRTKERREKRSY 420

Query: 421 TSDVSNFERPHITDRYCPERSPSDISDSGRQDERRKGKGSESEHRSGRKNQSYSSEESEF 480
           TS+ S           C ++S S  S S  +D  R        +R G+  +SY+ E+S+ 
Sbjct: 421 TSEDS-----ESNPECCGKKSRSKHSYSSAEDFDR-------HYRKGKHKRSYTPEDSDS 480

Query: 481 ETHRRSDKSRSKRSLSKDHRPDSHHSSSKMRRRRSYSSDDSETNRRRKSKGQKHNYSSDD 528
             H R  KS  K     +   D    S K R+R S SSD +     RKSK  KH+YSS+D
Sbjct: 481 GRHDRKKKSTRKPYTLDETDADRRRMSQKSRQRHSNSSDKNH-RHDRKSK-NKHSYSSED 536

BLAST of CmoCh01G014140 vs. TAIR10
Match: AT4G19190.1 (AT4G19190.1 zinc knuckle (CCHC-type) family protein)

HSP 1 Score: 449.9 bits (1156), Expect = 2.2e-126
Identity = 302/548 (55.11%), Postives = 361/548 (65.88%), Query Frame = 1

Query: 2   DGEEGGIRLSKRFSD---KSGSGEVDYKTKAGTAWSHSYLNQKPWHPLSYPNQRRKWIAE 61
           +GE  GIRLSKRF+      GS EVDYKTK+GTAWSHS+LNQKPWHPLSYPNQRRKWIAE
Sbjct: 4   EGEGSGIRLSKRFAGGKVTGGSLEVDYKTKSGTAWSHSFLNQKPWHPLSYPNQRRKWIAE 63

Query: 62  QTHSQREKRAEEVAREYAQEQEFFRQTALVSKKEKEKLEMMKAVSFMYVRPPGYNAESAK 121
           QTH+Q ++RAEEVARE+AQEQEFF+Q AL+SKKE+EK+E MKAVSFMYVRPPGY+ ESAK
Sbjct: 64  QTHAQHDRRAEEVAREFAQEQEFFKQAALISKKEREKIETMKAVSFMYVRPPGYDPESAK 123

Query: 122 AAEIADDRKNQEGDSSSQNLPKDGSVNERPPESSGTVGRDHGDKKKPRPKDVFGRALPTE 181
           AAE  D++   +G SS+Q+   D +V  RP ES G  G    ++KKPRPKDVFGRALPTE
Sbjct: 124 AAEYKDEKHKGQG-SSTQDPVADDNVGSRPEESQGG-GERTQERKKPRPKDVFGRALPTE 183

Query: 182 EEFEVLKNAPRMDTGVFARVKPFGVEVRNVKCVRCGIFGHQSGDRECPLKDAIMPNEENR 241
           EEFEVLKNAPRM+TG+  RVKPF VEVRNVKC+RCG FGHQSGDR+CPLKDA+MPNEE R
Sbjct: 184 EEFEVLKNAPRMETGIPGRVKPFAVEVRNVKCLRCGNFGHQSGDRDCPLKDAVMPNEELR 243

Query: 242 LKRDDPLSTILAHAESSEPLKWELKQKPGISPPRGGFNPDDPNQQIVAEDIFDEYGGFLS 301
           LKRDDPL+ I+AH + SEPLKWELKQKPG+SPPRGGF+PDDPNQQIVAEDIFDEYGGFL 
Sbjct: 244 LKRDDPLTAIIAHTDPSEPLKWELKQKPGLSPPRGGFDPDDPNQQIVAEDIFDEYGGFL- 303

Query: 302 SGGIVPELLSNFSSKPKKKKSSKESSRKKAKN---QEDDERIS-------------KKKS 361
            G I  E+L + SS  KK+KS K    KK  +   +E DE  +             +KK 
Sbjct: 304 EGSIPIEILKSMSS-DKKRKSKKNKRHKKHSSRTVEETDESSTGSEDSREKRGSKKRKKL 363

Query: 362 KSKRKKQISSES-SPETSESD--RRNRRNKHKTSYSSDDFDSKTQLGTKKHRRKHLNTSD 421
           K K KKQ  S+S S E S SD  R +RR   K    S    S+       HR KH    D
Sbjct: 364 KKKSKKQYDSDSLSFEGSGSDSYRLSRRRHTKHVDPSASLKSEVYHQGNSHREKHY--YD 423

Query: 422 VSNFERPHITDRYCPERSPSDISDSGRQDERRKGKGSESEHRSGRKNQSYSSEESEFETH 481
             + +R  I DR       SD   S    ++R     +S HR  ++  S      + +  
Sbjct: 424 EKHQKRKEIVDRPSASSDDSDYYRSNSSRKKRSEDDYKSHHRERKQVHSNDPVSEKSQKQ 483

Query: 482 RRSDKSRSKRSLSKDHRPDSHHSSSKMRRRRSYSSDDSETNRRRKSKGQKHNYSSDDSEQ 528
             S+  + +R + K+HR D         RR  Y   +SE NR R  K  ++    DD + 
Sbjct: 484 HYSESGKIQR-VEKEHRYD--------ERRHRYVDMESE-NRNRSEKKPRY----DDRDS 531

BLAST of CmoCh01G014140 vs. NCBI nr
Match: gi|659086998|ref|XP_008444222.1| (PREDICTED: uncharacterized zinc finger CCHC domain-containing protein At4g19190 [Cucumis melo])

HSP 1 Score: 669.8 bits (1727), Expect = 3.8e-189
Identity = 411/596 (68.96%), Postives = 442/596 (74.16%), Query Frame = 1

Query: 1   MDGEEGG-IRLSKRFSDKSGSGEVDYKTKAGTAWSHSYLNQKPWHPLSYPNQRRKWIAEQ 60
           MDGEEGG IRLSKRFSDK+GSGEVDYKTKAGTAWSHSYLNQKPWHPLSYPNQRRKWIAEQ
Sbjct: 1   MDGEEGGGIRLSKRFSDKAGSGEVDYKTKAGTAWSHSYLNQKPWHPLSYPNQRRKWIAEQ 60

Query: 61  THSQREKRAEEVAREYAQEQEFFRQTALVSKKEKEKLEMMKAVSFMYVRPPGYNAESAKA 120
           THSQREKRAEEVAREYAQEQEFFRQTALVSKKEKEKLEMMKAVSFMYVRPPGYNAESAKA
Sbjct: 61  THSQREKRAEEVAREYAQEQEFFRQTALVSKKEKEKLEMMKAVSFMYVRPPGYNAESAKA 120

Query: 121 AEIADDRKNQEGDSSSQNLPKDGSVNERPPESSGTVGRDHGDKKKPRPKDVFGRALPTEE 180
           AEIADDRK QEGD+ SQ+LP D S NERPPESS TV R+ G  KKPRPKDVFGRALPTEE
Sbjct: 121 AEIADDRKKQEGDNPSQDLPIDSSANERPPESSSTVSREPG--KKPRPKDVFGRALPTEE 180

Query: 181 EFEVLKNAPRMDTGVFARVKPFGVEVRNVKCVRCGIFGHQSGDRECPLKDAIMPNEENRL 240
           EFEVLKNAPRMDTGVFARVKPFGVEVRNVKCVRCGIFGHQSGDRECPLKDAIMPNEE+RL
Sbjct: 181 EFEVLKNAPRMDTGVFARVKPFGVEVRNVKCVRCGIFGHQSGDRECPLKDAIMPNEESRL 240

Query: 241 KRDDPLSTILAHAESSEPLKWELKQKPGISPPRGGFNPDDPNQQIVAEDIFDEYGGFLSS 300
           KRDDPL+TILA+AE+SEPLKWELKQKPGISPPRGGFNPDDPNQQIVAEDIFDEYGGFLSS
Sbjct: 241 KRDDPLTTILANAETSEPLKWELKQKPGISPPRGGFNPDDPNQQIVAEDIFDEYGGFLSS 300

Query: 301 GGIVPELLSNFSSKPKKKKSSKESSRKK--------AKNQEDDERISKKKSKSKRKKQIS 360
           GGIVPELLSNFSSKPKKKK S++SSRKK         K+ EDDERISKKK KSKRKKQ++
Sbjct: 301 GGIVPELLSNFSSKPKKKKVSRQSSRKKFQSSSSRKEKDLEDDERISKKKRKSKRKKQVN 360

Query: 361 SESSPETSESDRRNRRNKHKTSYSSDDFDSKTQLGTKKHRRKHLNTSDVSNFERPHITDR 420
           SESS ETSESDRR+RR KHK S+ SDD DSKT   TKKHRRKHLNTSDVS     +  DR
Sbjct: 361 SESSSETSESDRRDRRIKHKISHLSDDSDSKTHHKTKKHRRKHLNTSDVSETSESN-NDR 420

Query: 421 YCPERSPSDISDSGRQDERRKGKGSESEHRSGRKNQS-------------YSSEES--EF 480
               +      DS  +  RR  K     HR  + N S             Y  E S  + 
Sbjct: 421 RNKHKISYLSDDSDSKTHRRTKK-----HRREQLNTSDVSNFEWPHIADRYCPEPSPPDI 480

Query: 481 ETHRRSDKSRSKRSLSKDHRPDSHHSSSKMRRRRSYSSDDSET----------------- 530
               R DK R ++      R DS       R+ +S+SS+DS                   
Sbjct: 481 SGSERHDKGRKRKDFYSYKRLDSELEHKSGRKNQSHSSEDSGFERHPLSDKTRSKHSSSK 540

BLAST of CmoCh01G014140 vs. NCBI nr
Match: gi|449449912|ref|XP_004142708.1| (PREDICTED: uncharacterized zinc finger CCHC domain-containing protein At4g19190 [Cucumis sativus])

HSP 1 Score: 666.8 bits (1719), Expect = 3.2e-188
Identity = 398/538 (73.98%), Postives = 427/538 (79.37%), Query Frame = 1

Query: 1   MDGEEGG-IRLSKRFSDKSGSGEVDYKTKAGTAWSHSYLNQKPWHPLSYPNQRRKWIAEQ 60
           MDGEEGG IRLSKRFSDK+GSGEVDYKTKAGTAWSHSYLNQKPWHPLSYPNQRRKWIAEQ
Sbjct: 1   MDGEEGGGIRLSKRFSDKAGSGEVDYKTKAGTAWSHSYLNQKPWHPLSYPNQRRKWIAEQ 60

Query: 61  THSQREKRAEEVAREYAQEQEFFRQTALVSKKEKEKLEMMKAVSFMYVRPPGYNAESAKA 120
           THSQREKR EEVAREYAQEQEFFRQTALVSKKEKEKLEMMKAVSFMYVRPPGYNAESAKA
Sbjct: 61  THSQREKRTEEVAREYAQEQEFFRQTALVSKKEKEKLEMMKAVSFMYVRPPGYNAESAKA 120

Query: 121 AEIADDRKNQEGDSSSQNLPKDGSVNERPPESSGTVGRDHGDKKKPRPKDVFGRALPTEE 180
           AEIADDRK QEGD+ SQ+LPKD S N RPPESS TVGR+ G  KKPRPKDVFGRALPTEE
Sbjct: 121 AEIADDRKKQEGDNPSQDLPKDSSGNARPPESSSTVGREPG--KKPRPKDVFGRALPTEE 180

Query: 181 EFEVLKNAPRMDTGVFARVKPFGVEVRNVKCVRCGIFGHQSGDRECPLKDAIMPNEENRL 240
           EFEVLKNAPRMDTGVFARVKPFGVEVRNVKCVRCGIFGHQSGDRECPLKDAIMPNEE+RL
Sbjct: 181 EFEVLKNAPRMDTGVFARVKPFGVEVRNVKCVRCGIFGHQSGDRECPLKDAIMPNEESRL 240

Query: 241 KRDDPLSTILAHAESSEPLKWELKQKPGISPPRGGFNPDDPNQQIVAEDIFDEYGGFLSS 300
           KRDDPL+TILA+AE+SEPLKWELKQKPGISPPRGGFNPDDPNQQIVAEDIFDEYGGFLS 
Sbjct: 241 KRDDPLTTILANAETSEPLKWELKQKPGISPPRGGFNPDDPNQQIVAEDIFDEYGGFLSC 300

Query: 301 GGIVPELLSNFSSKPKKKKSSKESSRK--------KAKNQEDDERISKKKSKSKRKKQIS 360
           GGIVPELLSNFSSKPKK K S++SSRK        K K+ EDDERISKKK KSKRKKQ++
Sbjct: 301 GGIVPELLSNFSSKPKKNKFSRQSSRKKLQSSSSRKEKDLEDDERISKKKHKSKRKKQVN 360

Query: 361 SESSPETSESDRRNRRNKHKTSYSSDDFDSKTQLGTKKHRRKHLNTSDVSNFERPHITDR 420
            ESS ETSESDRR+RRNKHK S+ SDD DSK    TKKHRRKHLNTSDVS     + +DR
Sbjct: 361 GESSSETSESDRRDRRNKHKISHLSDDSDSKMHHKTKKHRRKHLNTSDVSETSESN-SDR 420

Query: 421 YCPERSPSDISDSGRQDERRKGKGSESEHRSGRKNQSYSSEESEFETHRRSDKSRSKRSL 480
              +   S +SD       R+ K    +HR  R N   +S+ S FE    +D+   + S 
Sbjct: 421 RRNKHKISYLSDDSASKTHRRSK----KHRRKRLN---TSDVSNFERSHITDRYCPEPSP 480

Query: 481 SKDHRPDSHHSSSKMRRRRSYSSDDSETNRRRKSKGQKHNYSSDDSEQEKHARDKKRR 530
           S     + H    K R    Y   DSE   R   K Q H  SS+DS  E+H    K R
Sbjct: 481 SDISDSERHDKGRKHRDFYPYKKLDSELEHRSGRKDQSH--SSEDSGFERHPISDKTR 526

BLAST of CmoCh01G014140 vs. NCBI nr
Match: gi|645228720|ref|XP_008221127.1| (PREDICTED: uncharacterized zinc finger CCHC domain-containing protein At4g19190 [Prunus mume])

HSP 1 Score: 522.7 bits (1345), Expect = 7.5e-145
Identity = 333/567 (58.73%), Postives = 398/567 (70.19%), Query Frame = 1

Query: 1   MDGEE--GGIRLSKRFSDKSGSGEVDYKTKAGTAWSHSYLNQKPWHPLSYPNQRRKWIAE 60
           M+GEE  GGIRLSKRFSDK G GEVDYKTK+GTAWSHSYLNQKPWHPLSYPNQRRKWIAE
Sbjct: 1   MEGEEEGGGIRLSKRFSDKGG-GEVDYKTKSGTAWSHSYLNQKPWHPLSYPNQRRKWIAE 60

Query: 61  QTHSQREKRAEEVAREYAQEQEFFRQTALVSKKEKEKLEMMKAVSFMYVRPPGYNAESAK 120
           Q  +Q ++R EEV+REYAQEQEFFRQ ALVSKK+KEK+EMMKAVSFMYVRPPGYNAESAK
Sbjct: 61  QIQAQHQRRTEEVSREYAQEQEFFRQAALVSKKDKEKIEMMKAVSFMYVRPPGYNAESAK 120

Query: 121 AAEIADDRKNQEGDSSSQNLPKDGSVNE-------RPPESSGTVGRDHGDKKKPRPKDVF 180
           AAEIAD+   +  ++ S   P+  +  +       RPPE     G +    KKPRPKDVF
Sbjct: 121 AAEIADESVREHHNNPSFEDPQTQTQTDAPSTSRPRPPECMPPSGEE---AKKPRPKDVF 180

Query: 181 GRALPTEEEFEVLKNAPRMDTGVFARVKPFGVEVRNVKCVRCGIFGHQSGDRECPLKDAI 240
           GR LPTE+EFE+LKNAPRM+TGV  R KPFGVEVRNVKCVRCG FGHQSGDRECPLKDAI
Sbjct: 181 GRPLPTEQEFEILKNAPRMETGVPTRAKPFGVEVRNVKCVRCGAFGHQSGDRECPLKDAI 240

Query: 241 MPNEENRLKRDDPLSTILAHAESSEPLKWELKQKPGISPPRGGFNPDDPNQQIVAEDIFD 300
           MPNEE+RLKRDDPL+ ILAH + SEPLKWELKQKPGISPPRGGF PDDPNQQIVAEDIFD
Sbjct: 241 MPNEESRLKRDDPLTAILAHTDPSEPLKWELKQKPGISPPRGGFKPDDPNQQIVAEDIFD 300

Query: 301 EYGGFLSSGGIVPELLSNFSSKPKKKKSSKESSRKK-----------AKNQEDDERISKK 360
           EYGGFL SG ++PELL+NF S+P+ K   K   +KK           + + EDD++  KK
Sbjct: 301 EYGGFL-SGDVIPELLTNFPSQPRDKSKKKTKHKKKQSSPMSGEDRLSSSNEDDKKSKKK 360

Query: 361 KSKSKRKKQISSESSP-ETSESDRRNRRNKHKTSYSSDDFDS-KTQLGTK---KHRRKHL 420
             K  +KK   SESSP E  E DR   +++ K SYS ++ ++ K Q  TK   KH R + 
Sbjct: 361 ICKVNKKKHGHSESSPSEILEFDRHKVKSRDKHSYSFENSNNEKHQRSTKKRHKHSRSYE 420

Query: 421 NTSDVSNFERPHITDRYCPERSPSDISDSGRQDERRKGKGS--------ESEHRSG--RK 480
           ++    +++R + +DR C   S     D+  +D +R+ K S        E  HRS   R+
Sbjct: 421 DSEICRHYKRDNSSDR-CTNSSEDSEPDNHHRDVQRRRKHSFTVEDLYHEKHHRSRKVRQ 480

Query: 481 NQSYSSEESEFETHRRSDKSR-SKRSLSKDHRPDSHHSSSKMRRRRSYSSDDSETNR--R 530
             S+S E+S+ + H R D+SR S  + S+D  PD H  S K RRR SYSS+DS TNR  R
Sbjct: 481 KHSHSYEDSKIDRHWRRDRSRESLNNSSEDSEPDRHLRSIKSRRRHSYSSEDSGTNRQNR 540

BLAST of CmoCh01G014140 vs. NCBI nr
Match: gi|596299867|ref|XP_007227705.1| (hypothetical protein PRUPE_ppa003979mg [Prunus persica])

HSP 1 Score: 510.0 bits (1312), Expect = 5.0e-141
Identity = 325/558 (58.24%), Postives = 382/558 (68.46%), Query Frame = 1

Query: 1   MDGEEGGIRLSKRFSDKSGSGEVDYKTKAGTAWSHSYLNQKPWHPLSYPNQRRKWIAEQT 60
           M+    GIRLSKRFSDK G GEVDYKTK+GTAWSHSYLNQKPWHPLSYPNQRRKWIAEQ 
Sbjct: 1   MENGPRGIRLSKRFSDKGG-GEVDYKTKSGTAWSHSYLNQKPWHPLSYPNQRRKWIAEQI 60

Query: 61  HSQREKRAEEVAREYAQEQEFFRQTALVSKKEKEKLEMMKAVSFMYVRPPGYNAESAKAA 120
            +Q ++R EEV+REYAQEQEFFRQ ALVSKK+KEK+EMMKAVSFMYVRPPGYNAESAKAA
Sbjct: 61  QAQHQRRTEEVSREYAQEQEFFRQAALVSKKDKEKIEMMKAVSFMYVRPPGYNAESAKAA 120

Query: 121 EIADDRKNQEGDSSSQNLPKDGSVNERPPESSGTVGRDHGDKKKPRPKDVFGRALPTEEE 180
           EIAD                     E PPE     G +    KKPR KDVFGR LPTE+E
Sbjct: 121 EIAD---------------------ESPPECMPPSGEE---AKKPRLKDVFGRPLPTEQE 180

Query: 181 FEVLKNAPRMDTGVFARVKPFGVEVRNVKCVRCGIFGHQSGDRECPLKDAIMPNEENRLK 240
           FE+LKNAPRM+TGV  R KPFGVEVRNVKCVRCG FGHQSGDRECPLKDAIMPNEE RLK
Sbjct: 181 FEILKNAPRMETGVPTRAKPFGVEVRNVKCVRCGAFGHQSGDRECPLKDAIMPNEEGRLK 240

Query: 241 RDDPLSTILAHAESSEPLKWELKQKPGISPPRGGFNPDDPNQQIVAEDIFDEYGGFLSSG 300
           RDDPL+ ILAH + SEPLKWELKQKPGISPPRGGF PDDPNQQIVAEDIFDEYGGFL SG
Sbjct: 241 RDDPLTAILAHTDPSEPLKWELKQKPGISPPRGGFKPDDPNQQIVAEDIFDEYGGFL-SG 300

Query: 301 GIVPELLSNFSSKPKKKKSSKESSRKK-----------AKNQEDDERISKKKSKSKRKKQ 360
            ++PELL+NFSS+P+ K   K   +KK           + + EDD+R  KK  K  +KK 
Sbjct: 301 DVIPELLTNFSSQPRDKSKKKTKHKKKQSSPMSGEDGLSSSYEDDKRSKKKICKVNKKKH 360

Query: 361 ISSESSP-ETSESDRRNRRNKHKTSYSSDDFDSKTQLGTKKHRRKHLNTSDVS----NFE 420
             SESSP E  E DR   +++ K SYS ++ +++    + K R+KH ++ + S    +++
Sbjct: 361 GHSESSPSEILEFDRHKVKSRDKHSYSFENSNNEKHQRSTKKRQKHSHSYEDSEICRHYK 420

Query: 421 RPHITDRYCPERSPSDISDSGRQDERRKGKGS--------ESEHRSG--RKNQSYSSEES 480
           R + +DR C         D   +D +R+ K S        E  HRS   R+  S+S E+S
Sbjct: 421 RDNSSDR-CTNSFEDSEPDKHHRDVQRRRKHSFTVGDSYHEKHHRSRKVRQKHSHSYEDS 480

Query: 481 EFETHRRSDKSR-SKRSLSKDHRPDSHHSSSKMRRRRSYSSDDSETNR--RRKSKGQKHN 530
           + + H R DKSR S  + S+D  PD H  S K R R SYSS+DS TNR  R K    +H+
Sbjct: 481 KIDRHCRRDKSRESLNNSSEDSEPDRHLRSIKSRHRHSYSSEDSGTNRQNRNKKSRDRHS 531

BLAST of CmoCh01G014140 vs. NCBI nr
Match: gi|590680404|ref|XP_007040853.1| (Zinc knuckle family protein isoform 1 [Theobroma cacao])

HSP 1 Score: 509.2 bits (1310), Expect = 8.6e-141
Identity = 326/560 (58.21%), Postives = 384/560 (68.57%), Query Frame = 1

Query: 1   MDG---EEGGIRLSKRFSDKS--GSGEVDYKTKAGTAWSHSYLNQKPWHPLSYPNQRRKW 60
           MDG   E GGIRLSKRFSD     SGEVDYKTKAGTAWSHSYLNQKPWHPLSYPNQRRKW
Sbjct: 1   MDGIGEEGGGIRLSKRFSDNKPGSSGEVDYKTKAGTAWSHSYLNQKPWHPLSYPNQRRKW 60

Query: 61  IAEQTHSQREKRAEEVAREYAQEQEFFRQTALVSKKEKEKLEMMKAVSFMYVRPPGYNAE 120
           IAEQTHS R +RAEEVAREYAQEQEFFRQTAL+SKKEKEK+EMMKAVSFMYVRPPGYNAE
Sbjct: 61  IAEQTHSHRMRRAEEVAREYAQEQEFFRQTALISKKEKEKVEMMKAVSFMYVRPPGYNAE 120

Query: 121 SAKAAEIADDRKNQEGDSSSQNLPKDGSVNERPPESSGTVGRDHGDKKKPRPKDVFGRAL 180
           SAKAAEIAD+RK  E ++ S +   D      P ES      +  +KKKPRPKDVFGR L
Sbjct: 121 SAKAAEIADERKRIEPNNVSDDQSTDVVSTAMPTESLPGKDPNGAEKKKPRPKDVFGRPL 180

Query: 181 PTEEEFEVLKNAPRMDTGVFARVKPFGVEVRNVKCVRCGIFGHQSGDRECPLKDAIMPNE 240
           PTEEEFE+LKNAPR++TGV  RVKPFGVEVRNVKC+RCG +GHQSGDRECPLKDAIMPNE
Sbjct: 181 PTEEEFEILKNAPRLETGVLGRVKPFGVEVRNVKCLRCGNYGHQSGDRECPLKDAIMPNE 240

Query: 241 ENRLKRDDPLSTILAHAESSEPLKWELKQKPGISPPRGGFNPDDPNQQIVAEDIFDEYGG 300
           E+RLKRDDPL+ I+A  + +EPLKWELKQKPG+SPPRGGF PDDPNQQIVAEDIFDEYGG
Sbjct: 241 ESRLKRDDPLTAIMAQMDPTEPLKWELKQKPGMSPPRGGFQPDDPNQQIVAEDIFDEYGG 300

Query: 301 FLSSGGIVPELLSNFSSKPKKKKSSKESSRKKAKNQE-------------------DDER 360
           FL SGG +P+LL+N S KPKK+KSS +S  K+  +                     DDER
Sbjct: 301 FL-SGGNIPDLLTNISCKPKKRKSSSKSKHKRQSSPSSRELEVPDQDGLPSPAHSDDDER 360

Query: 361 ISKKKSKSKRKKQ------ISSESSPETSESDRRNRRNKHKTSYSSDDFDSKTQLGTKKH 420
            SK+K K+K+KK+       S  SS +  + DR  R+ ++K SYSS+D DS  Q  TK  
Sbjct: 361 RSKRKKKTKKKKKKKKRQNYSESSSSDGLDFDRHQRKRRNKYSYSSEDSDSDRQYKTK-- 420

Query: 421 RRKHLNTSDVSNFERPHITDRYCPERSPSDISDSGRQDERRKGKGSESEHRSGRKNQSYS 480
             K   +S+ S+ +R + T         S  S+    D+   GK S S+H       SYS
Sbjct: 421 -HKCSYSSEDSDSDRQYKTKE--SREKLSYTSEDLDSDQECWGKRSRSKH-------SYS 480

Query: 481 SEESEFETHRRSDKSRSKRSLSKDHRPDSHHSSSKMRRRRSYSSDDSETNRR-RKSKGQK 530
           SE  +F+ H R  K + K S S +      H   K   ++ Y+SD  + +R   K  GQK
Sbjct: 481 SE--DFDRHHR--KIKHKCSYSSEDSDSGRHDLKKKSTQKPYTSDRMDVDRHWSKRSGQK 540

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y4919_ARATH3.9e-12555.11Uncharacterized zinc finger CCHC domain-containing protein At4g19190 OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0KYH9_CUCSA2.2e-18873.98Uncharacterized protein OS=Cucumis sativus GN=Csa_4G358660 PE=4 SV=1[more]
M5XSJ2_PRUPE3.5e-14158.24Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003979mg PE=4 SV=1[more]
A0A061GE82_THECC6.0e-14158.21Zinc knuckle family protein isoform 1 OS=Theobroma cacao GN=TCM_016691 PE=4 SV=1[more]
W9RFP8_9ROSA1.7e-14059.74Uncharacterized protein OS=Morus notabilis GN=L484_025969 PE=4 SV=1[more]
A0A0B0PYX8_GOSAR3.9e-14058.08Uncharacterized protein OS=Gossypium arboreum GN=F383_14281 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G19190.12.2e-12655.11 zinc knuckle (CCHC-type) family protein[more]
Match NameE-valueIdentityDescription
gi|659086998|ref|XP_008444222.1|3.8e-18968.96PREDICTED: uncharacterized zinc finger CCHC domain-containing protein At4g19190 ... [more]
gi|449449912|ref|XP_004142708.1|3.2e-18873.98PREDICTED: uncharacterized zinc finger CCHC domain-containing protein At4g19190 ... [more]
gi|645228720|ref|XP_008221127.1|7.5e-14558.73PREDICTED: uncharacterized zinc finger CCHC domain-containing protein At4g19190 ... [more]
gi|596299867|ref|XP_007227705.1|5.0e-14158.24hypothetical protein PRUPE_ppa003979mg [Prunus persica][more]
gi|590680404|ref|XP_007040853.1|8.6e-14158.21Zinc knuckle family protein isoform 1 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR019339CIR_N_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh01G014140.1CmoCh01G014140.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR019339CBF1-interacting co-repressor CIR, N-terminal domainPFAMPF10197Cir_Ncoord: 43..78
score: 3.7
IPR019339CBF1-interacting co-repressor CIR, N-terminal domainSMARTSM01083Cir_N_3coord: 42..78
score: 7.1
NoneNo IPR availablePANTHERPTHR13151CBF1 INTERACTING COREPRESSOR CIRcoord: 177..529
score: 2.1E-55coord: 34..126
score: 2.1
NoneNo IPR availablePANTHERPTHR13151:SF2COREPRESSOR INTERACTING WITH RBPJ 1coord: 177..529
score: 2.1E-55coord: 34..126
score: 2.1
NoneNo IPR availablePFAMPF15288zf-CCHC_6coord: 208..229
score: 1.