CmaCh08G009030 (gene) Cucurbita maxima (Rimu)

NameCmaCh08G009030
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionArid/bright DNA-binding domain-containing family protein
LocationCma_Chr08 : 5592616 .. 5596487 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTGTCAGATGAGCTCATGAATTGTGTTGGCTTTGACGAAAATTCAATTTCAGATTTTTCCAAGAGAATATTTGATATGAAGAGTGATAGGATGAAGAGTGGGGTTGTATTTCCTCAAAGAAGCAAGAAACTCCAGGATATTGATCATTGCACCATCAAAAATGAGGATATAAAAGTTATCGATCCACCTTCGGTCAAGGAAGCAAATTTTGTCCTGGAGAGAACGCAGAAACCGATGTTGGGATTGCTAAATTGGCTGAAAGATGTTGCAAGAAACCCTTGTGATCCATCAATATGTTCACTACCAGAAAAGTCAAAATGGAAATCACATGGAAATGAAGAGGTTTGGAAACAAGTTTTGCTAGTTCGAGAGGAATTTTTTGTAAAAAGACGAGTCGATGCAAGTAGCGAGCAATCTTTCTTGCAGGTACTTTTCATAAATTCTTAAGTTGTTTATAGTATATCTTTATCCCTATAAAGAATCAAATGCTATTTATATCTGTGCCTGTATTAAGTTCTGTGTGTTCTCGTTGTCCCATGATATATAAGTTGATTGTAGTATTTTTCGAACTGATTTCAGTATTTTATTGTTTTTCATATCATTGTTCTCTCCGTCTTGATATGATTTTCATACTATGCATTTTGATTGGTATCCCTTTACATCTTAAATAGTGTTCTGTTCCTTGTAAACTGCAAAGATTCTCATTGTGAAGGCCAATATGGAAGTAAGACATAATCTTAAAATGTATCATCGAGTTACTACAATAATTTGCTTTCATCTGATCTATAGTATCTATCTAGTTTAGTTGGTTATTTAGTATTATGAACCTTTGTATCTTTTAGGAATTTCTGTAGTGTCCTTTCTATATGTTTAATGGTTTAATGGTCTTGTGTGAATATAAAACTATAGAATCCTTCATATCAACTCAGAATATTATTTATCACCGTATGAAGGTACTATTGAGTGTTAGCTGTTATTTTGTAAAATTGATCTTGAGCTTAAATAAGTGATGAAATTGGTTCCTCCAAGTTTAATTAGAAATAATTTCACGAACCAACTAGTAAATTGTAGAAAACACCTGTAAAATACTATAAAAACTATTTATATCCAGCCATTAAAATATCAAAATCTAGCCTTAAACCTGAGCGTGGATATTAAGTCTAATATTCAACTAACATAATCAAGAAAACTACAATTCACCAAAATAGTCTCATTCAACCCTAAGGACTACGAAAGGCCAATTTGATAATTATTTAGGCCCTAGCGACTTGATTTTTTGTTTATGAAAAATAAGCGTATAAATTATATTTGTTTCTCTTTATAATCATTTGGTTCTTTGTTTTTCAAAATTGTATATTTTTTTTTCTCATCATTTCTCTACAATTGTTTTCATCTTTCTTAAAAAATGTTAGAATTTTTCAACCAAATTTTCAAACACAAACTTGTTTTTGAAAGTTAGTCTTCATGGTACTTATAGGTGTTGTAATTTTTTTAAAAAAATCTTTTAAAATTTTTTGCAATAGTATTTTGTTCTCTCCTTTTTTTTCTTTTTAATATGTTGAATGCTACTATTACAAGATTGGAAGTTTGAGGTCCTATTTGACATAAAATTTAATTTTATAACTAATGAATCAATTTATTTTCAAAAACTTTGAATATGTCATAAATATGTTAAACACAAAATTCAAAAACTTCTTTGACACAAAATTGTAGTTTCAAAGATGACATTTCTTAGACACTTTTTAAATTTTAAGAATCTTATAGGCATAAACTTGAAAATTCAAAAATTATGCTTCTAATCTTATAGACATGTACTTGTAATTTCGTAGACCTTTCGACATTGGGCAGGCGTTAGCCCCCATACCGACTTTGCGGAGACTTGTGTTTTTGGTAAACAGTCCCCGAGCTTGGTCACTGCAACCCCTTTTGTTAGGAGGCACCCTTTCTCCCAAAGTTAACGGGGCTATTTTGCCAATTTCCTTAGAGAGAATTGTCTTGCGCCCCTAGGTATTCTCTACCTTCCCACCTGTGACGGTTTCGGATACAGGTACCCTTTTGTTGAAAGTTGTTTGAGCTTTTCTTGGGAATATGACATGGGTTACTTCAGCTCCATAGCGCCTAGTAACAGTGGATTAGAGTAATGTGTTATAGTGATGTCTCGGTCGAGAATTGAGAAGAATTGAATAAGTATGGTATCTAATGATAATGATAAAGTGAGTGTGAGAACGAGTGATAATGGTTTTTTTAAAGGATAAGTTATACAAAGAAGAGGGAGGAGAGGCCTCCCTGTCTCTCAAGGACTTGTTAACCTCACAACAACATGATAAATAAACATATTACAGAAGAAAAAAGAAAGGATATATGATGAGAATTGAGAAGGGGGATGAAATTAGAGTGTAAGGGAAGCCAACTATTGGCAGGATTCGGAGTTGTAAGGCTGGGTAATTTACTATCACCGAACTTAACGTCTTTGACATAATTTATTATCAGGTTCATTTGATCTTCCCTATTTTGCAGAGAAATCAGAGGATGCATCCTTGTATGTATGATAACGATACGGTTCCAATTTACAATCTTAGGAAGAGATTAAGCATTGACAAGAAGGTTCAATCTCAAGAACCTGTCTCTAAATCAAGTGATTCTTCACCTTCAGATTCATCAGATGAGTACAAGCCCGTTCTCCTAGGGCCAGATTATCAAGCTCAGATACCAGAATGGAATGGTGTGATATCCAACAGCAATTTGAAGTGGTTGGGAACTCAAGAATGGCCTTTAAACTTAAAAAAAGGAAGTAACAGAAATCCCGTTGAAAGGGATCCCATTGGAAAAGGAAGGCGAGATCCTTGTGGATGCTTGGACGCTGGTTCAGTTGGTTGTGTCAAATTTCACGTTGCTGAGAAAAGGCACAGACTAAAGCTAGAGTTGGGCGCTGCGTTTCTCCAATGGAGATTTGATCAGATGGGCGAAGATGTTACATTTGCTTGGAAAGTAGATGATGAGAAAAAGTTCGAGGATATAGTGACGTCGAACCCTCCATCTCTCGGCATATGTTTTTGGGATGAGATTGTTGAGTCATTCCCTTCTAGGAGCAGGGAAGATCTTGTTAGCTACTACTATAATGTCTTTCTTTTACGTCGTAGAGGACACCAAAATCGGGTTACACCGAACAAAATCGATAGTGATGATGAATCAGAATCCGGGATTACAACCAATGGATTCGGACACGAAGTGCATAACTCACCCGGCTCCATTTTCTACTCTCCCAAGAAGCCACGGTAAGCTTGAGGTATCGCTTCAAAATGGATTTTGACAAGTACGTACCTATGTTGATAATGTGATTCTGATTTGGAAACAGGCATTTTTTATTTTTGATTTTTGTAAGGAAAATGTGAAAAGGAAAATTCTTGTGTGTTACATGGAAAGTTGTAATCTGTGAAAGATGGAAGGAAGTCAATGGGACGAATACGTCGTGATTTACCAACAAATTGTTATGAAAATGGGCTGTTTTCTGAGGCTCAGATGTTCGGTTTCAATTATTTGAGGGGTTGATTAATTGTTTTGGCTAGATGTTATCTCCAAGAAATGAGGAGGATCTTGGTCGTAAGGGCCTCTTAAACAAGGCAAATAGGTGTAGTTGGGAAGAGGATGCCATTCTGTTCATTACAAAGGTTTGATTCTTGTATATTTGCTCTCTCATGTTGGCTTATGAAAAACGTACAAGCATGATGCTTTAAATGATTAAAATCACCGTTTAAATTCTTTAGTGGTGGAATTTGTTTTTTTGTTTTTTTTTTGGTTATTAAGTTTGGCATAATCAACTCATTATAAACACTATTGTAATCCCTAGCTATTGTATTACGGTTTCCCTG

mRNA sequence

ATGTTGTCAGATGAGCTCATGAATTGTGTTGGCTTTGACGAAAATTCAATTTCAGATTTTTCCAAGAGAATATTTGATATGAAGAGTGATAGGATGAAGAGTGGGGTTGTATTTCCTCAAAGAAGCAAGAAACTCCAGGATATTGATCATTGCACCATCAAAAATGAGGATATAAAAGTTATCGATCCACCTTCGGTCAAGGAAGCAAATTTTGTCCTGGAGAGAACGCAGAAACCGATGTTGGGATTGCTAAATTGGCTGAAAGATGTTGCAAGAAACCCTTGTGATCCATCAATATGTTCACTACCAGAAAAGTCAAAATGGAAATCACATGGAAATGAAGAGGTTTGGAAACAAGTTTTGCTAGTTCGAGAGGAATTTTTTGTAAAAAGACGAGTCGATGCAAGTAGCGAGCAATCTTTCTTGCAGAGAAATCAGAGGATGCATCCTTGTATGTATGATAACGATACGGTTCCAATTTACAATCTTAGGAAGAGATTAAGCATTGACAAGAAGGTTCAATCTCAAGAACCTGTCTCTAAATCAAGTGATTCTTCACCTTCAGATTCATCAGATGAGTACAAGCCCGTTCTCCTAGGGCCAGATTATCAAGCTCAGATACCAGAATGGAATGGTGTGATATCCAACAGCAATTTGAAGTGGTTGGGAACTCAAGAATGGCCTTTAAACTTAAAAAAAGGAAGTAACAGAAATCCCGTTGAAAGGGATCCCATTGGAAAAGGAAGGCGAGATCCTTGTGGATGCTTGGACGCTGGTTCAGTTGGTTGTGTCAAATTTCACGTTGCTGAGAAAAGGCACAGACTAAAGCTAGAGTTGGGCGCTGCGTTTCTCCAATGGAGATTTGATCAGATGGGCGAAGATGTTACATTTGCTTGGAAAGTAGATGATGAGAAAAAGTTCGAGGATATAGTGACGTCGAACCCTCCATCTCTCGGCATATGTTTTTGGGATGAGATTGTTGAGTCATTCCCTTCTAGGAGCAGGGAAGATCTTGTTAGCTACTACTATAATGTCTTTCTTTTACGTCGTAGAGGACACCAAAATCGGGTTACACCGAACAAAATCGATAGTGATGATGAATCAGAATCCGGGATTACAACCAATGGATTCGGACACGAAGTGCATAACTCACCCGGCTCCATTTTCTACTCTCCCAAGAAGCCACGGTAAGCTTGAGGTATCGCTTCAAAATGGATTTTGACAAGTACGTACCTATGTTGATAATGTGATTCTGATTTGGAAACAGGCATTTTTTATTTTTGATTTTTGTAAGGAAAATGTGAAAAGGAAAATTCTTGTGTGTTACATGGAAAGTTGTAATCTGTGAAAGATGGAAGGAAGTCAATGGGACGAATACGTCGTGATTTACCAACAAATTGTTATGAAAATGGGCTGTTTTCTGAGGCTCAGATGTTCGGTTTCAATTATTTGAGGGGTTGATTAATTGTTTTGGCTAGATGTTATCTCCAAGAAATGAGGAGGATCTTGGTCGTAAGGGCCTCTTAAACAAGGCAAATAGGTGTAGTTGGGAAGAGGATGCCATTCTGTTCATTACAAAGGTTTGATTCTTGTATATTTGCTCTCTCATGTTGGCTTATGAAAAACGTACAAGCATGATGCTTTAAATGATTAAAATCACCGTTTAAATTCTTTAGTGGTGGAATTTGTTTTTTTGTTTTTTTTTTGGTTATTAAGTTTGGCATAATCAACTCATTATAAACACTATTGTAATCCCTAGCTATTGTATTACGGTTTCCCTG

Coding sequence (CDS)

ATGTTGTCAGATGAGCTCATGAATTGTGTTGGCTTTGACGAAAATTCAATTTCAGATTTTTCCAAGAGAATATTTGATATGAAGAGTGATAGGATGAAGAGTGGGGTTGTATTTCCTCAAAGAAGCAAGAAACTCCAGGATATTGATCATTGCACCATCAAAAATGAGGATATAAAAGTTATCGATCCACCTTCGGTCAAGGAAGCAAATTTTGTCCTGGAGAGAACGCAGAAACCGATGTTGGGATTGCTAAATTGGCTGAAAGATGTTGCAAGAAACCCTTGTGATCCATCAATATGTTCACTACCAGAAAAGTCAAAATGGAAATCACATGGAAATGAAGAGGTTTGGAAACAAGTTTTGCTAGTTCGAGAGGAATTTTTTGTAAAAAGACGAGTCGATGCAAGTAGCGAGCAATCTTTCTTGCAGAGAAATCAGAGGATGCATCCTTGTATGTATGATAACGATACGGTTCCAATTTACAATCTTAGGAAGAGATTAAGCATTGACAAGAAGGTTCAATCTCAAGAACCTGTCTCTAAATCAAGTGATTCTTCACCTTCAGATTCATCAGATGAGTACAAGCCCGTTCTCCTAGGGCCAGATTATCAAGCTCAGATACCAGAATGGAATGGTGTGATATCCAACAGCAATTTGAAGTGGTTGGGAACTCAAGAATGGCCTTTAAACTTAAAAAAAGGAAGTAACAGAAATCCCGTTGAAAGGGATCCCATTGGAAAAGGAAGGCGAGATCCTTGTGGATGCTTGGACGCTGGTTCAGTTGGTTGTGTCAAATTTCACGTTGCTGAGAAAAGGCACAGACTAAAGCTAGAGTTGGGCGCTGCGTTTCTCCAATGGAGATTTGATCAGATGGGCGAAGATGTTACATTTGCTTGGAAAGTAGATGATGAGAAAAAGTTCGAGGATATAGTGACGTCGAACCCTCCATCTCTCGGCATATGTTTTTGGGATGAGATTGTTGAGTCATTCCCTTCTAGGAGCAGGGAAGATCTTGTTAGCTACTACTATAATGTCTTTCTTTTACGTCGTAGAGGACACCAAAATCGGGTTACACCGAACAAAATCGATAGTGATGATGAATCAGAATCCGGGATTACAACCAATGGATTCGGACACGAAGTGCATAACTCACCCGGCTCCATTTTCTACTCTCCCAAGAAGCCACGGTAA

Protein sequence

MLSDELMNCVGFDENSISDFSKRIFDMKSDRMKSGVVFPQRSKKLQDIDHCTIKNEDIKVIDPPSVKEANFVLERTQKPMLGLLNWLKDVARNPCDPSICSLPEKSKWKSHGNEEVWKQVLLVREEFFVKRRVDASSEQSFLQRNQRMHPCMYDNDTVPIYNLRKRLSIDKKVQSQEPVSKSSDSSPSDSSDEYKPVLLGPDYQAQIPEWNGVISNSNLKWLGTQEWPLNLKKGSNRNPVERDPIGKGRRDPCGCLDAGSVGCVKFHVAEKRHRLKLELGAAFLQWRFDQMGEDVTFAWKVDDEKKFEDIVTSNPPSLGICFWDEIVESFPSRSREDLVSYYYNVFLLRRRGHQNRVTPNKIDSDDESESGITTNGFGHEVHNSPGSIFYSPKKPR
BLAST of CmaCh08G009030 vs. Swiss-Prot
Match: ARID1_ARATH (AT-rich interactive domain-containing protein 1 OS=Arabidopsis thaliana GN=ARID1 PE=2 SV=1)

HSP 1 Score: 253.1 bits (645), Expect = 5.2e-66
Identity = 131/294 (44.56%), Postives = 187/294 (63.61%), Query Frame = 1

Query: 74  ERTQKPMLGLLNWLKDVARNPCDPSICSLPEKSKWKSHGNEEVWKQVLLVREEFFVKRRV 133
           +R ++  L  L WL DVA++PCDPS+  +P++S+W S+G+EE WKQ+LL R     +   
Sbjct: 244 KRKRECPLETLKWLSDVAKDPCDPSLGIVPDRSEWVSYGSEEPWKQLLLFRAS---RTNN 303

Query: 134 DASSEQSFLQRNQRMHPCMYDNDTVPIYNLRKRLSIDKKVQSQEPVSKSSDSSPSDSSDE 193
           D++ E+++ Q+ Q+MHPC+YD+     YNLR+RLS +   + +      SD   SD  D 
Sbjct: 304 DSACEKTW-QKVQKMHPCLYDDSAGASYNLRERLSYEDYKRGK--TGNGSDIGSSDEEDR 363

Query: 194 YKPVLLGPDYQAQIPEWNGVISNSNLKWLGTQEWPLNLKKGSNRNPVERDPIGKGRRDPC 253
               L+G  +QA++PEW G+   S+ KWLGT+ WPL  ++      +ERD IGKGR+DPC
Sbjct: 364 -PCALVGSKFQAKVPEWTGITPESDSKWLGTRIWPLTKEQTKANLLIERDRIGKGRQDPC 423

Query: 254 GCLDAGSVGCVKFHVAEKRHRLKLELGAAFLQWRFDQMGEDVTFAWKVDDEKKFEDIVTS 313
           GC + GS+ CVKFH+  KR +LKLELG AF  W FD MGE     W   + KK + ++ S
Sbjct: 424 GCHNPGSIECVKFHITAKRDKLKLELGPAFYMWCFDVMGECTLQYWTDLELKKIKSLM-S 483

Query: 314 NPPSLGICFWDEIVESFPSRSREDLVSYYYNVFLLRRRGHQNRVTPNKIDSDDE 368
           +PPSL   F  +     PS+SR  +VSY+YNV LL+ R  Q+R+TP+ IDSD +
Sbjct: 484 SPPSLSPAFIHQAKMILPSKSRGKIVSYFYNVTLLQYRASQSRITPHDIDSDTD 529

BLAST of CmaCh08G009030 vs. Swiss-Prot
Match: ARID2_ARATH (AT-rich interactive domain-containing protein 2 OS=Arabidopsis thaliana GN=ARID2 PE=2 SV=1)

HSP 1 Score: 206.8 bits (525), Expect = 4.2e-52
Identity = 121/354 (34.18%), Postives = 181/354 (51.13%), Query Frame = 1

Query: 69  ANFVLERTQKPMLGLLNWLKDVARNPCDPSICSLPEKSKWKSHGNEEVWKQVLLVREEFF 128
           ++F LE+ +  + G+L WL  VA +P DP+I  +P  SKWK +   + W QV   +    
Sbjct: 212 SDFSLEK-RDDLPGMLKWLALVATSPHDPAIGVIPHSSKWKQYNGNKCWLQVARAKNSLL 271

Query: 129 VKR-RVDASSEQSFLQRNQRMH-PCMYDNDTVPIYNLR---------------------- 188
           V+R   +        + +Q +H P MY++D   I  LR                      
Sbjct: 272 VQRDNAELRYRYHPFRGHQNIHHPSMYEDDRKSIGRLRYSIRPPNLSKHCSSSCCNGSSL 331

Query: 189 -----------KRLSIDKKVQSQEPVSKSSDSSPSDSSDEYKPVLLGPDYQAQIPEWNGV 248
                      ++L+I    ++      S     + +    + + +G  +QAQ+ EW   
Sbjct: 332 VSLSKSRSTKCRKLTIIASERAGLTAGTSRARKRNKAEIPRRCIKVGHQHQAQVDEWTES 391

Query: 249 ISNSNLKWLGTQEWPLNLKKGSNRNPVERDPIGKGRRDPCGCLDAGSVGCVKFHVAEKRH 308
             +S+ KWLGT+ WP    +  ++  +  D +GKGR D C C  +G V C + H+AEKR 
Sbjct: 392 GVDSDSKWLGTRIWPPENSEALDQT-LGNDLVGKGRPDSCSCELSGFVECTRLHIAEKRM 451

Query: 309 RLKLELGAAFLQWRFDQMGEDVTFAWKVDDEKKFEDIVTSNPPSLGICFWDEIVESFPSR 368
            LK ELG  F  WRF+QMGE+V   W  ++EK+F+D++ ++P S    FW    ++FP +
Sbjct: 452 ELKRELGDDFFHWRFNQMGEEVCLRWTEEEEKRFKDMIIADPQS----FWTNAAKNFPKK 511

Query: 369 SREDLVSYYYNVFLLRRRGHQNRVTPNKIDSDDESESGITTNGFGHEVHNSPGS 388
            RE+LVSYY+NVFL+ RR +QNRVTP  IDSDDE   G     FG +   S GS
Sbjct: 512 KREELVSYYFNVFLINRRRYQNRVTPKSIDSDDEGAFGSVGGSFGRDAVTSSGS 559

BLAST of CmaCh08G009030 vs. TrEMBL
Match: A0A0A0KD11_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G004620 PE=4 SV=1)

HSP 1 Score: 665.2 bits (1715), Expect = 4.9e-188
Identity = 311/397 (78.34%), Postives = 360/397 (90.68%), Query Frame = 1

Query: 1   MLSDELMNCVGFDENSISDFSKRIFDMKSDRMKSGVVFPQRSKKLQDIDHCTIKNEDIKV 60
           MLSDE MNC+ FD+NSISDFSKRIFDMKSDRMKSG VFP+RSKK +DI+HC  +N D+++
Sbjct: 1   MLSDEFMNCIAFDDNSISDFSKRIFDMKSDRMKSGDVFPRRSKKFKDIEHCNTENGDVQI 60

Query: 61  IDPPSVKEANFVLERT-QKPMLGLLNWLKDVARNPCDPSICSLPEKSKWKSHGNEEVWKQ 120
           IDPP VKE N V E++ Q+PMLGLL+WLK +ARNPCDPSI SLPEKSKWKS+GNEE+WKQ
Sbjct: 61  IDPPFVKETNVVQEKSKQEPMLGLLDWLKGIARNPCDPSISSLPEKSKWKSYGNEEIWKQ 120

Query: 121 VLLVREEFFVKRRVDASSEQSFLQRNQRMHPCMYDNDTVPIYNLRKRLSIDKKVQSQEPV 180
           VL+VREE F+KR+VD+SSEQSF+Q+NQ+MHPCMYD+DT PIYNLRKRLS+DKK  SQEPV
Sbjct: 121 VLVVREEMFLKRQVDSSSEQSFMQKNQKMHPCMYDDDTAPIYNLRKRLSLDKKDLSQEPV 180

Query: 181 SKSSDSSPSDSSDEYKPVLLGPDYQAQIPEWNGVISNSNLKWLGTQEWPLNLKKGSNRNP 240
           SK+SDSSP+DS D+YKPV LG DYQA++PEWNGVIS S+LKWLGTQ+WPL  KKG NR  
Sbjct: 181 SKASDSSPTDSLDDYKPVPLGSDYQARVPEWNGVISKSDLKWLGTQDWPL--KKGRNRYL 240

Query: 241 VERDPIGKGRRDPCGCLDAGSVGCVKFHVAEKRHRLKLELGAAFLQWRFDQMGEDVTFAW 300
           VERDPIG+GRRDPCGC+D  SVGCV+FHV+EKRH+LKLELG AFLQWRFD+MGE+VTFAW
Sbjct: 241 VERDPIGRGRRDPCGCMDPNSVGCVQFHVSEKRHKLKLELGDAFLQWRFDKMGEEVTFAW 300

Query: 301 KVDDEKKFEDIVTSNPPSLGICFWDEIVESFPSRSREDLVSYYYNVFLLRRRGHQNRVTP 360
            VDDEKKFEDIV+SNPPSLGI +W++I+ESFPSRS+ DLV YYYNVFLLRRRGHQNRVTP
Sbjct: 301 TVDDEKKFEDIVSSNPPSLGISYWEDIIESFPSRSKADLVGYYYNVFLLRRRGHQNRVTP 360

Query: 361 NKIDSDDESESGITTNGFGHEVHNSPGSIFYSPKKPR 397
           N+I+SD+ESESG  TNGFG+EVHNS GSIFYSPKKPR
Sbjct: 361 NEINSDEESESGTATNGFGNEVHNSSGSIFYSPKKPR 395

BLAST of CmaCh08G009030 vs. TrEMBL
Match: M5X794_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002218mg PE=4 SV=1)

HSP 1 Score: 381.7 bits (979), Expect = 1.1e-102
Identity = 205/418 (49.04%), Postives = 266/418 (63.64%), Query Frame = 1

Query: 4   DELMNCVGFDENSISDFSKRIFDMKSDRMKSGVVFPQRSKKLQDIDHCTIKNEDIKVIDP 63
           DE +  +      +S F KR    +     S     +R KK  DI      NED  ++DP
Sbjct: 287 DEAVMILDPSPGEVSSFRKR---KRGPSCGSEAGESRRGKKYNDIS-----NEDAAILDP 346

Query: 64  PSVKEANFVLERTQKPMLGLLNWLKDVARNPCDPSICSLPEKSKWKSHGNEEVWKQVLLV 123
            + +EA    +R Q+ + G+LNW++ +A++PCDP++ SLPE+SKWKS GNEE W QVL  
Sbjct: 347 STDEEAISFWKRKQESLCGMLNWVRMIAKDPCDPAVGSLPERSKWKSFGNEENWMQVLWA 406

Query: 124 REEFFVKRRVDASSEQSFLQRNQRMHPCMYDNDTVPIYNLRKRLSIDKKVQS--QEPVSK 183
           RE  FVK+  D+ +EQS  Q+NQRMHP +YD+     YNLR+RL ++KK+ S    P S+
Sbjct: 407 REAIFVKKHADSGAEQSNWQKNQRMHPSLYDDHFSSSYNLRERLRLEKKLLSGGTMPQSR 466

Query: 184 --SSDSSPSDSSD-------------------EYKP--VLLGPDYQAQIPEWNGVISNSN 243
             S  SSPS S D                    Y P  + LG +YQA +PEW G  S S 
Sbjct: 467 TGSESSSPSYSPDMAGMEDQLLGTSDFTSVLDRYPPSHIPLGSNYQAHLPEWTGEASESE 526

Query: 244 LKWLGTQEWPLNLKKGSNRNPVERDPIGKGRRDPCGCLDAGSVGCVKFHVAEKRHRLKLE 303
           LKWLG++ WPL  +K  +R  +ERDPIGKGR++ CGC  +GS+ CV+FH++EKR R+K E
Sbjct: 527 LKWLGSKFWPL--EKPEHRYLIERDPIGKGRQESCGCQVSGSIECVRFHISEKRLRVKRE 586

Query: 304 LGAAFLQWRFDQMGEDVTFAWKVDDEKKFEDIVTSNPPSLGICFWDEIVESFPSRSREDL 363
           LG AF  W F+QMG++V  +W  ++EKKF+DIV SNPPSLGI FWD+I +SFP +SR +L
Sbjct: 587 LGPAFYHWEFNQMGDEVGLSWTAEEEKKFKDIVKSNPPSLGITFWDQIFKSFPKKSRREL 646

Query: 364 VSYYYNVFLLRRRGHQNRVTPNKIDSDDES-ESGITTNGFGHEVHNSPGSIFYSPKKP 396
           VSYY+NVFLL RRG+QNR TPN IDSDDE  ESG  TNGFG E      SI  SP KP
Sbjct: 647 VSYYFNVFLLHRRGYQNRFTPNNIDSDDEGLESGSVTNGFGDEERKPSKSILKSPHKP 694

BLAST of CmaCh08G009030 vs. TrEMBL
Match: A0A061DU38_THECC (ARID/BRIGHT DNA-binding domain,ELM2 domain protein, putative isoform 1 OS=Theobroma cacao GN=TCM_005304 PE=4 SV=1)

HSP 1 Score: 361.3 bits (926), Expect = 1.5e-96
Identity = 202/437 (46.22%), Postives = 268/437 (61.33%), Query Frame = 1

Query: 3   SDELMNCVGFDE--NSISDFSKRIFDMKS-------DRMKSGVVFPQRSKKLQDIDHCTI 62
           SD    C+  DE   S SD +K   +          D +KS ++     +   D   CT 
Sbjct: 226 SDGDKKCMDGDECEESPSDLAKSAVNSSDVEKICNEDEVKSAIM-----EDFVDCKKCTD 285

Query: 63  KNED--IKVIDPPSVKEANFVLERTQKPMLGLLNWLKDVARNPCDPSICSLPEKSKWKSH 122
            ++D  + ++D    KE     +R ++ M G+LNW+ ++A++PCDP I SLPE+SKWKS+
Sbjct: 286 SDDDDNVVILDSNDTKEKFSSHKRKRESMWGMLNWITEIAKDPCDPVIGSLPERSKWKSY 345

Query: 123 GNEEVWKQVLLVREEFFVKRRVDASSEQSFLQRNQRMHPCMYDNDTVPIYNLRKRLSIDK 182
           GNEE+WKQVLL RE  F K+   +  +QS  Q+NQ+MHPC+YD+ T   YNLR+RLS  K
Sbjct: 346 GNEELWKQVLLFREAAFHKKDDHSGVDQSSWQKNQKMHPCLYDDPTRFGYNLRERLSCPK 405

Query: 183 KVQSQEPVSKSSDSSPSDSS--------------------------------DEYKPVLL 242
           K+   + VSK  + S S SS                                D    V +
Sbjct: 406 KLLLGKMVSKGKNYSQSSSSGNHSDLDNSMVGIDKQSHGTYDSATPGSVFDYDNDMQVPI 465

Query: 243 GPDYQAQIPEWNGVISNSNLKWLGTQEWPLNLKKGSNRNPVERDPIGKGRRDPCGCLDAG 302
           GP +Q ++P+W G+ S S+ KWLGT+ WP  L+K   R  +ERD IGKGR+D CGC   G
Sbjct: 466 GPYFQVEVPDWTGLASESDPKWLGTRVWP--LEKKEKRFLIERDHIGKGRQDSCGCHIQG 525

Query: 303 SVGCVKFHVAEKRHRLKLELGAAFLQWRFDQMGEDVTFAWKVDDEKKFEDIVTSNPPSLG 362
           S+ CVKFHVAEKR ++KLELG+AF QW+FD+MGE+V F+WK ++++KF  IV SNPP L 
Sbjct: 526 SIQCVKFHVAEKRLKVKLELGSAFNQWKFDKMGEEVAFSWKEEEQRKFSSIVKSNPPLLD 585

Query: 363 ICFWDEIVESFPS-RSREDLVSYYYNVFLLRRRGHQNRVTPNKIDSDD-ESESGITTNGF 395
            CFWDEI + F S +SRE+LV YYYNVFLL+RR +QNR+TPN I+SDD ESE+    NGF
Sbjct: 586 KCFWDEIYKYFRSKKSREELVCYYYNVFLLQRRAYQNRITPNNINSDDEESEAESGANGF 645

BLAST of CmaCh08G009030 vs. TrEMBL
Match: A0A061DSZ0_THECC (ARID/BRIGHT DNA-binding domain,ELM2 domain protein, putative isoform 3 OS=Theobroma cacao GN=TCM_005304 PE=4 SV=1)

HSP 1 Score: 361.3 bits (926), Expect = 1.5e-96
Identity = 202/437 (46.22%), Postives = 268/437 (61.33%), Query Frame = 1

Query: 3   SDELMNCVGFDE--NSISDFSKRIFDMKS-------DRMKSGVVFPQRSKKLQDIDHCTI 62
           SD    C+  DE   S SD +K   +          D +KS ++     +   D   CT 
Sbjct: 231 SDGDKKCMDGDECEESPSDLAKSAVNSSDVEKICNEDEVKSAIM-----EDFVDCKKCTD 290

Query: 63  KNED--IKVIDPPSVKEANFVLERTQKPMLGLLNWLKDVARNPCDPSICSLPEKSKWKSH 122
            ++D  + ++D    KE     +R ++ M G+LNW+ ++A++PCDP I SLPE+SKWKS+
Sbjct: 291 SDDDDNVVILDSNDTKEKFSSHKRKRESMWGMLNWITEIAKDPCDPVIGSLPERSKWKSY 350

Query: 123 GNEEVWKQVLLVREEFFVKRRVDASSEQSFLQRNQRMHPCMYDNDTVPIYNLRKRLSIDK 182
           GNEE+WKQVLL RE  F K+   +  +QS  Q+NQ+MHPC+YD+ T   YNLR+RLS  K
Sbjct: 351 GNEELWKQVLLFREAAFHKKDDHSGVDQSSWQKNQKMHPCLYDDPTRFGYNLRERLSCPK 410

Query: 183 KVQSQEPVSKSSDSSPSDSS--------------------------------DEYKPVLL 242
           K+   + VSK  + S S SS                                D    V +
Sbjct: 411 KLLLGKMVSKGKNYSQSSSSGNHSDLDNSMVGIDKQSHGTYDSATPGSVFDYDNDMQVPI 470

Query: 243 GPDYQAQIPEWNGVISNSNLKWLGTQEWPLNLKKGSNRNPVERDPIGKGRRDPCGCLDAG 302
           GP +Q ++P+W G+ S S+ KWLGT+ WP  L+K   R  +ERD IGKGR+D CGC   G
Sbjct: 471 GPYFQVEVPDWTGLASESDPKWLGTRVWP--LEKKEKRFLIERDHIGKGRQDSCGCHIQG 530

Query: 303 SVGCVKFHVAEKRHRLKLELGAAFLQWRFDQMGEDVTFAWKVDDEKKFEDIVTSNPPSLG 362
           S+ CVKFHVAEKR ++KLELG+AF QW+FD+MGE+V F+WK ++++KF  IV SNPP L 
Sbjct: 531 SIQCVKFHVAEKRLKVKLELGSAFNQWKFDKMGEEVAFSWKEEEQRKFSSIVKSNPPLLD 590

Query: 363 ICFWDEIVESFPS-RSREDLVSYYYNVFLLRRRGHQNRVTPNKIDSDD-ESESGITTNGF 395
            CFWDEI + F S +SRE+LV YYYNVFLL+RR +QNR+TPN I+SDD ESE+    NGF
Sbjct: 591 KCFWDEIYKYFRSKKSREELVCYYYNVFLLQRRAYQNRITPNNINSDDEESEAESGANGF 650

BLAST of CmaCh08G009030 vs. TrEMBL
Match: A0A067K8K6_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_14376 PE=4 SV=1)

HSP 1 Score: 352.8 bits (904), Expect = 5.3e-94
Identity = 198/423 (46.81%), Postives = 257/423 (60.76%), Query Frame = 1

Query: 3   SDELMNCVGFDENSISDF-SKRIFDMKSDRMKSGVVFPQRSKKLQDIDHCTIKNEDIKVI 62
           SD    C    EN  S+  S   FD   D +KS VV  +  KK ++ D       D+ V+
Sbjct: 236 SDGDKKCKDDQENLRSNLTSLNAFD--EDEVKSMVVEIEDDKKCENGDE-----NDVIVL 295

Query: 63  DPPSVKEANFVLERTQKPMLGLLNWLKDVARNPCDPSICSLPEKSKWKSHGNEEVWKQVL 122
           D  +VK++   L+R +K +  +LNW+  VARNPCDP + SLPE SKW S+GNEE+WKQVL
Sbjct: 296 DSDAVKDSFSCLKRKRKSVCRVLNWITGVARNPCDPLVDSLPESSKWGSYGNEELWKQVL 355

Query: 123 LVREEFFVKRRVDASSEQSFLQRNQRMHPCMYDNDTVPIYNLRKRLSIDKKV-------- 182
           L RE  F+K  VD+ +E    Q+N++MHPCMYD+     YN R+RL   KK+        
Sbjct: 356 LAREALFLKGNVDSGAE----QKNRKMHPCMYDDPVGSAYNFRERLKCTKKLLHGKTASQ 415

Query: 183 -QSQEPVSKSSDSSPSD----------SSDEY--------KPVLLGPDYQAQIPEWNGVI 242
            ++   +S S+  + SD          SS +Y        KP+ LGPD+QA++PEW G++
Sbjct: 416 GEACSQLSSSTAETESDSCTKKICDGHSSTKYSVIDIPVEKPIPLGPDFQAEVPEWTGMV 475

Query: 243 SNSNLKWLGTQEWPLNLKKGSNRNPVERDPIGKGRRDPCGCLDAGSVGCVKFHVAEKRHR 302
           S S+ KW+GT+ WP   +K  NR  +ER+PIGKGR+D CGC    S  CV+FH  EKR R
Sbjct: 476 SQSDSKWVGTRVWP--PEKIDNRLVIEREPIGKGRQDSCGCEVPKSTECVRFHSTEKRMR 535

Query: 303 LKLELGAAFLQWRFDQMGEDVTFAWKVDDEKKFEDIVTSNPPSLGICFWDEIVESFPSRS 362
            + ELG AFL W+FD+MGEDV  +W  ++EKKF+ IV  NPPSL  CFWDE+ + FP+R 
Sbjct: 536 TRRELGIAFLHWKFDKMGEDVKLSWTEEEEKKFKAIVRLNPPSLDKCFWDEMFKFFPTRR 595

Query: 363 REDLVSYYYNVFLLRRRGHQNRVTPNKIDS-DDESESGITTNGFGHEVHNSPGSIFYSPK 397
           REDL           RR HQNR TPN IDS DDESE G+  N  GHE   SPGS+ YS K
Sbjct: 596 REDL-----------RRAHQNRFTPNNIDSDDDESECGLIANSSGHEAPKSPGSLLYSAK 634

BLAST of CmaCh08G009030 vs. TAIR10
Match: AT2G46040.1 (AT2G46040.1 ARID/BRIGHT DNA-binding domain;ELM2 domain protein)

HSP 1 Score: 253.1 bits (645), Expect = 2.9e-67
Identity = 131/294 (44.56%), Postives = 187/294 (63.61%), Query Frame = 1

Query: 74  ERTQKPMLGLLNWLKDVARNPCDPSICSLPEKSKWKSHGNEEVWKQVLLVREEFFVKRRV 133
           +R ++  L  L WL DVA++PCDPS+  +P++S+W S+G+EE WKQ+LL R     +   
Sbjct: 244 KRKRECPLETLKWLSDVAKDPCDPSLGIVPDRSEWVSYGSEEPWKQLLLFRAS---RTNN 303

Query: 134 DASSEQSFLQRNQRMHPCMYDNDTVPIYNLRKRLSIDKKVQSQEPVSKSSDSSPSDSSDE 193
           D++ E+++ Q+ Q+MHPC+YD+     YNLR+RLS +   + +      SD   SD  D 
Sbjct: 304 DSACEKTW-QKVQKMHPCLYDDSAGASYNLRERLSYEDYKRGK--TGNGSDIGSSDEEDR 363

Query: 194 YKPVLLGPDYQAQIPEWNGVISNSNLKWLGTQEWPLNLKKGSNRNPVERDPIGKGRRDPC 253
               L+G  +QA++PEW G+   S+ KWLGT+ WPL  ++      +ERD IGKGR+DPC
Sbjct: 364 -PCALVGSKFQAKVPEWTGITPESDSKWLGTRIWPLTKEQTKANLLIERDRIGKGRQDPC 423

Query: 254 GCLDAGSVGCVKFHVAEKRHRLKLELGAAFLQWRFDQMGEDVTFAWKVDDEKKFEDIVTS 313
           GC + GS+ CVKFH+  KR +LKLELG AF  W FD MGE     W   + KK + ++ S
Sbjct: 424 GCHNPGSIECVKFHITAKRDKLKLELGPAFYMWCFDVMGECTLQYWTDLELKKIKSLM-S 483

Query: 314 NPPSLGICFWDEIVESFPSRSREDLVSYYYNVFLLRRRGHQNRVTPNKIDSDDE 368
           +PPSL   F  +     PS+SR  +VSY+YNV LL+ R  Q+R+TP+ IDSD +
Sbjct: 484 SPPSLSPAFIHQAKMILPSKSRGKIVSYFYNVTLLQYRASQSRITPHDIDSDTD 529

BLAST of CmaCh08G009030 vs. TAIR10
Match: AT4G11400.1 (AT4G11400.1 ARID/BRIGHT DNA-binding domain;ELM2 domain protein)

HSP 1 Score: 206.8 bits (525), Expect = 2.4e-53
Identity = 121/354 (34.18%), Postives = 181/354 (51.13%), Query Frame = 1

Query: 69  ANFVLERTQKPMLGLLNWLKDVARNPCDPSICSLPEKSKWKSHGNEEVWKQVLLVREEFF 128
           ++F LE+ +  + G+L WL  VA +P DP+I  +P  SKWK +   + W QV   +    
Sbjct: 212 SDFSLEK-RDDLPGMLKWLALVATSPHDPAIGVIPHSSKWKQYNGNKCWLQVARAKNSLL 271

Query: 129 VKR-RVDASSEQSFLQRNQRMH-PCMYDNDTVPIYNLR---------------------- 188
           V+R   +        + +Q +H P MY++D   I  LR                      
Sbjct: 272 VQRDNAELRYRYHPFRGHQNIHHPSMYEDDRKSIGRLRYSIRPPNLSKHCSSSCCNGSSL 331

Query: 189 -----------KRLSIDKKVQSQEPVSKSSDSSPSDSSDEYKPVLLGPDYQAQIPEWNGV 248
                      ++L+I    ++      S     + +    + + +G  +QAQ+ EW   
Sbjct: 332 VSLSKSRSTKCRKLTIIASERAGLTAGTSRARKRNKAEIPRRCIKVGHQHQAQVDEWTES 391

Query: 249 ISNSNLKWLGTQEWPLNLKKGSNRNPVERDPIGKGRRDPCGCLDAGSVGCVKFHVAEKRH 308
             +S+ KWLGT+ WP    +  ++  +  D +GKGR D C C  +G V C + H+AEKR 
Sbjct: 392 GVDSDSKWLGTRIWPPENSEALDQT-LGNDLVGKGRPDSCSCELSGFVECTRLHIAEKRM 451

Query: 309 RLKLELGAAFLQWRFDQMGEDVTFAWKVDDEKKFEDIVTSNPPSLGICFWDEIVESFPSR 368
            LK ELG  F  WRF+QMGE+V   W  ++EK+F+D++ ++P S    FW    ++FP +
Sbjct: 452 ELKRELGDDFFHWRFNQMGEEVCLRWTEEEEKRFKDMIIADPQS----FWTNAAKNFPKK 511

Query: 369 SREDLVSYYYNVFLLRRRGHQNRVTPNKIDSDDESESGITTNGFGHEVHNSPGS 388
            RE+LVSYY+NVFL+ RR +QNRVTP  IDSDDE   G     FG +   S GS
Sbjct: 512 KREELVSYYFNVFLINRRRYQNRVTPKSIDSDDEGAFGSVGGSFGRDAVTSSGS 559

BLAST of CmaCh08G009030 vs. TAIR10
Match: AT5G04110.1 (AT5G04110.1 DNA GYRASE B3)

HSP 1 Score: 139.0 bits (349), Expect = 6.1e-33
Identity = 77/201 (38.31%), Postives = 112/201 (55.72%), Query Frame = 1

Query: 180 SKSSDSSPSDSSDEYKPVL-LGPDYQAQIPEW---------NGVISNSN-LKWLGTQEWP 239
           +K+S    +  S++ +P + +GP +QA+IP W          G   +SN L+WLGT  WP
Sbjct: 343 NKTSKDVITHGSNKTRPAIPIGPRFQAEIPVWIAPTKKGKFYGSPGDSNTLRWLGTGVWP 402

Query: 240 L-NLKKGSNRNPVERDPIGKGRRDPCGCLDAGSVGCVKFHVAEKRHRLKLELGAAFLQWR 299
             +LKK      V    +G+GR D C C    S  C+K H  E +  L+ E+  AF  W 
Sbjct: 403 TYSLKK-----TVHSKKVGEGRSDSCSCASPRSTNCIKRHKKEAQELLEKEINRAFSTWE 462

Query: 300 FDQMGEDVTF-AWKVDDEKKFEDIVTSNPPSLGICFWDEIVESFPSRSREDLVSYYYNVF 359
           FDQMGE++   +W   +E++FE +V  NP S    FW+    +FP +S++DL+SYYYNVF
Sbjct: 463 FDQMGEEIVLKSWTAKEERRFEALVKKNPLSSSDGFWEFASNAFPQKSKKDLLSYYYNVF 522

Query: 360 LLRRRGHQNRVTPNKIDSDDE 368
           L++R         N IDSDD+
Sbjct: 523 LIKRMRLLKSSAANNIDSDDD 538

BLAST of CmaCh08G009030 vs. TAIR10
Match: AT2G03470.1 (AT2G03470.1 ELM2 domain-containing protein)

HSP 1 Score: 119.8 bits (299), Expect = 3.8e-27
Identity = 76/214 (35.51%), Postives = 112/214 (52.34%), Query Frame = 1

Query: 175 SQEPVSKSSDSSPSDSSDEY------------------KPVLLGPDYQAQIPEW--NGVI 234
           SQ  V+  SD S   S  ++                  K VL+G ++QA IPE+    ++
Sbjct: 83  SQSGVTTQSDLSHQSSGSDFTWKPVEDVYTCLMNQPPRKQVLVGSNHQADIPEFVKEEIL 142

Query: 235 SNSNLKWLGTQEWPLNLKKGSNRNPVERDPIGKGRRDPCGCLDAGSVGCVKFHVAEKRHR 294
             S  +     E  L  K     +  +    G+GR++ C CLD GS+ CV+ H+ E R  
Sbjct: 143 DQSEARTKEDLEGKLMRKCVIPMSDSDLCGTGQGRKE-CLCLDKGSIRCVRRHIIEARES 202

Query: 295 LKLELG-AAFLQWRFDQMGEDVTFAWKVDDEKKFEDIVTSNPPSLGICFWDEIVESFPSR 354
           L   +G   F++    +MGE+V   W  ++E  F  +V SNP S G  FW ++  +FPSR
Sbjct: 203 LVETIGYERFMELGLCEMGEEVASLWTEEEEDLFHKVVYSNPFSAGRDFWKQLKGTFPSR 262

Query: 355 SREDLVSYYYNVFLLRRRGHQNRVTPNKIDSDDE 368
           + ++LVSYY+NVF+LRRRG QNR     +DSDD+
Sbjct: 263 TMKELVSYYFNVFILRRRGIQNRFKALDVDSDDD 295

BLAST of CmaCh08G009030 vs. TAIR10
Match: AT1G13880.1 (AT1G13880.1 ELM2 domain-containing protein)

HSP 1 Score: 113.6 bits (283), Expect = 2.8e-25
Identity = 71/177 (40.11%), Postives = 98/177 (55.37%), Query Frame = 1

Query: 195 KPVLLGPDYQAQIPEWNGVISNSNL-KWLGTQEWPLNLKKGSNRNPVERD--PIGKGRRD 254
           K V +G DYQA IPE     +N    + +G  E  +  K        E +   IGKGR++
Sbjct: 126 KTVPIGSDYQADIPECVKEEANDQSGQGVGYDEEQVTGKCVIPMPDCETEVCKIGKGRKE 185

Query: 255 PCGCLDAGSVGCVKFHVAEKRHRLKLELGA-AFLQWRFDQMGEDVTFAWKVDDEKKFEDI 314
            C CLD GS+ CV+ H+ E R  L   +G    L     +MGE+V      D+E  F +I
Sbjct: 186 -CICLDKGSIRCVQQHIMENREDLFATIGYDRCLDIGLCEMGEEVAARLTEDEEDLFHEI 245

Query: 315 VTSNPPSLGICFWDEIVESFPSRSREDLVSYYYNVFLLRRRGHQNRVTPNKIDSDDE 368
           V SNP S+   FW  +  +FPSR+ +++VSYY+NVF+LRRR  QNR     +DSDD+
Sbjct: 246 VYSNPVSMDRDFWKHLKSAFPSRTMKEIVSYYFNVFILRRRAIQNRSKSLDVDSDDD 301

BLAST of CmaCh08G009030 vs. NCBI nr
Match: gi|449441632|ref|XP_004138586.1| (PREDICTED: AT-rich interactive domain-containing protein 1-like [Cucumis sativus])

HSP 1 Score: 665.2 bits (1715), Expect = 7.0e-188
Identity = 311/397 (78.34%), Postives = 360/397 (90.68%), Query Frame = 1

Query: 1   MLSDELMNCVGFDENSISDFSKRIFDMKSDRMKSGVVFPQRSKKLQDIDHCTIKNEDIKV 60
           MLSDE MNC+ FD+NSISDFSKRIFDMKSDRMKSG VFP+RSKK +DI+HC  +N D+++
Sbjct: 1   MLSDEFMNCIAFDDNSISDFSKRIFDMKSDRMKSGDVFPRRSKKFKDIEHCNTENGDVQI 60

Query: 61  IDPPSVKEANFVLERT-QKPMLGLLNWLKDVARNPCDPSICSLPEKSKWKSHGNEEVWKQ 120
           IDPP VKE N V E++ Q+PMLGLL+WLK +ARNPCDPSI SLPEKSKWKS+GNEE+WKQ
Sbjct: 61  IDPPFVKETNVVQEKSKQEPMLGLLDWLKGIARNPCDPSISSLPEKSKWKSYGNEEIWKQ 120

Query: 121 VLLVREEFFVKRRVDASSEQSFLQRNQRMHPCMYDNDTVPIYNLRKRLSIDKKVQSQEPV 180
           VL+VREE F+KR+VD+SSEQSF+Q+NQ+MHPCMYD+DT PIYNLRKRLS+DKK  SQEPV
Sbjct: 121 VLVVREEMFLKRQVDSSSEQSFMQKNQKMHPCMYDDDTAPIYNLRKRLSLDKKDLSQEPV 180

Query: 181 SKSSDSSPSDSSDEYKPVLLGPDYQAQIPEWNGVISNSNLKWLGTQEWPLNLKKGSNRNP 240
           SK+SDSSP+DS D+YKPV LG DYQA++PEWNGVIS S+LKWLGTQ+WPL  KKG NR  
Sbjct: 181 SKASDSSPTDSLDDYKPVPLGSDYQARVPEWNGVISKSDLKWLGTQDWPL--KKGRNRYL 240

Query: 241 VERDPIGKGRRDPCGCLDAGSVGCVKFHVAEKRHRLKLELGAAFLQWRFDQMGEDVTFAW 300
           VERDPIG+GRRDPCGC+D  SVGCV+FHV+EKRH+LKLELG AFLQWRFD+MGE+VTFAW
Sbjct: 241 VERDPIGRGRRDPCGCMDPNSVGCVQFHVSEKRHKLKLELGDAFLQWRFDKMGEEVTFAW 300

Query: 301 KVDDEKKFEDIVTSNPPSLGICFWDEIVESFPSRSREDLVSYYYNVFLLRRRGHQNRVTP 360
            VDDEKKFEDIV+SNPPSLGI +W++I+ESFPSRS+ DLV YYYNVFLLRRRGHQNRVTP
Sbjct: 301 TVDDEKKFEDIVSSNPPSLGISYWEDIIESFPSRSKADLVGYYYNVFLLRRRGHQNRVTP 360

Query: 361 NKIDSDDESESGITTNGFGHEVHNSPGSIFYSPKKPR 397
           N+I+SD+ESESG  TNGFG+EVHNS GSIFYSPKKPR
Sbjct: 361 NEINSDEESESGTATNGFGNEVHNSSGSIFYSPKKPR 395

BLAST of CmaCh08G009030 vs. NCBI nr
Match: gi|659116683|ref|XP_008458203.1| (PREDICTED: AT-rich interactive domain-containing protein 1-like isoform X1 [Cucumis melo])

HSP 1 Score: 656.4 bits (1692), Expect = 3.2e-185
Identity = 313/397 (78.84%), Postives = 358/397 (90.18%), Query Frame = 1

Query: 1   MLSDELMNCVGFDENSISDFSKRIFDMKSDRMKSGVVFPQRSKKLQDIDHCTIKNEDIKV 60
           MLSDE MNCV FDENSISDFSKRI DMKSDRMK+G  FP+RSKKL+DI H  I+NED+++
Sbjct: 1   MLSDEFMNCVAFDENSISDFSKRILDMKSDRMKTGDAFPRRSKKLKDIAHRNIENEDVQI 60

Query: 61  IDPPSVKEANFVLERT-QKPMLGLLNWLKDVARNPCDPSICSLPEKSKWKSHGNEEVWKQ 120
           IDPP +KE N V E++ Q+PMLGLL+WLK +ARNPCD SI SLPEKSKWKS+GNEE+WKQ
Sbjct: 61  IDPPLIKETNVVQEKSKQEPMLGLLDWLKVIARNPCDSSISSLPEKSKWKSYGNEEIWKQ 120

Query: 121 VLLVREEFFVKRRVDASSEQSFLQRNQRMHPCMYDNDTVPIYNLRKRLSIDKKVQSQEPV 180
           VL+VREE FVKR+VD+SSEQS +QRNQRMHPCMYD+DTVPIYNLRKRLS+DKK  SQEPV
Sbjct: 121 VLVVREEMFVKRQVDSSSEQSSVQRNQRMHPCMYDDDTVPIYNLRKRLSLDKKDVSQEPV 180

Query: 181 SKSSDSSPSDSSDEYKPVLLGPDYQAQIPEWNGVISNSNLKWLGTQEWPLNLKKGSNRNP 240
           SK++DSSP+DS D+YKPVLLG DYQA++PEWNGVIS S+LKWLGTQ+WPL  KKG NR  
Sbjct: 181 SKANDSSPTDSLDDYKPVLLGSDYQARVPEWNGVISESDLKWLGTQDWPL--KKGRNRYL 240

Query: 241 VERDPIGKGRRDPCGCLDAGSVGCVKFHVAEKRHRLKLELGAAFLQWRFDQMGEDVTFAW 300
           VERDPIGKGRRDPCGC+DA SVGCV+FHV+EKR RLKLELG AFL+WRFD+MGEDVT AW
Sbjct: 241 VERDPIGKGRRDPCGCMDANSVGCVQFHVSEKRQRLKLELGDAFLRWRFDEMGEDVTLAW 300

Query: 301 KVDDEKKFEDIVTSNPPSLGICFWDEIVESFPSRSREDLVSYYYNVFLLRRRGHQNRVTP 360
            V+DEKKFEDIV+SNPPSLGI +W++I+ESFPSRS+ DLVSYYYNVFLLRRRGHQNRVTP
Sbjct: 301 TVEDEKKFEDIVSSNPPSLGISYWEDIIESFPSRSKADLVSYYYNVFLLRRRGHQNRVTP 360

Query: 361 NKIDSDDESESGITTNGFGHEVHNSPGSIFYSPKKPR 397
           ++IDSD+ESESG  T  FG+EVHNSPGSIFYSPKKPR
Sbjct: 361 DEIDSDEESESGTATIRFGNEVHNSPGSIFYSPKKPR 395

BLAST of CmaCh08G009030 vs. NCBI nr
Match: gi|659116689|ref|XP_008458206.1| (PREDICTED: AT-rich interactive domain-containing protein 1-like isoform X2 [Cucumis melo])

HSP 1 Score: 610.9 bits (1574), Expect = 1.6e-171
Identity = 290/371 (78.17%), Postives = 335/371 (90.30%), Query Frame = 1

Query: 27  MKSDRMKSGVVFPQRSKKLQDIDHCTIKNEDIKVIDPPSVKEANFVLERT-QKPMLGLLN 86
           MKSDRMK+G  FP+RSKKL+DI H  I+NED+++IDPP +KE N V E++ Q+PMLGLL+
Sbjct: 1   MKSDRMKTGDAFPRRSKKLKDIAHRNIENEDVQIIDPPLIKETNVVQEKSKQEPMLGLLD 60

Query: 87  WLKDVARNPCDPSICSLPEKSKWKSHGNEEVWKQVLLVREEFFVKRRVDASSEQSFLQRN 146
           WLK +ARNPCD SI SLPEKSKWKS+GNEE+WKQVL+VREE FVKR+VD+SSEQS +QRN
Sbjct: 61  WLKVIARNPCDSSISSLPEKSKWKSYGNEEIWKQVLVVREEMFVKRQVDSSSEQSSVQRN 120

Query: 147 QRMHPCMYDNDTVPIYNLRKRLSIDKKVQSQEPVSKSSDSSPSDSSDEYKPVLLGPDYQA 206
           QRMHPCMYD+DTVPIYNLRKRLS+DKK  SQEPVSK++DSSP+DS D+YKPVLLG DYQA
Sbjct: 121 QRMHPCMYDDDTVPIYNLRKRLSLDKKDVSQEPVSKANDSSPTDSLDDYKPVLLGSDYQA 180

Query: 207 QIPEWNGVISNSNLKWLGTQEWPLNLKKGSNRNPVERDPIGKGRRDPCGCLDAGSVGCVK 266
           ++PEWNGVIS S+LKWLGTQ+WPL  KKG NR  VERDPIGKGRRDPCGC+DA SVGCV+
Sbjct: 181 RVPEWNGVISESDLKWLGTQDWPL--KKGRNRYLVERDPIGKGRRDPCGCMDANSVGCVQ 240

Query: 267 FHVAEKRHRLKLELGAAFLQWRFDQMGEDVTFAWKVDDEKKFEDIVTSNPPSLGICFWDE 326
           FHV+EKR RLKLELG AFL+WRFD+MGEDVT AW V+DEKKFEDIV+SNPPSLGI +W++
Sbjct: 241 FHVSEKRQRLKLELGDAFLRWRFDEMGEDVTLAWTVEDEKKFEDIVSSNPPSLGISYWED 300

Query: 327 IVESFPSRSREDLVSYYYNVFLLRRRGHQNRVTPNKIDSDDESESGITTNGFGHEVHNSP 386
           I+ESFPSRS+ DLVSYYYNVFLLRRRGHQNRVTP++IDSD+ESESG  T  FG+EVHNSP
Sbjct: 301 IIESFPSRSKADLVSYYYNVFLLRRRGHQNRVTPDEIDSDEESESGTATIRFGNEVHNSP 360

Query: 387 GSIFYSPKKPR 397
           GSIFYSPKKPR
Sbjct: 361 GSIFYSPKKPR 369

BLAST of CmaCh08G009030 vs. NCBI nr
Match: gi|645254404|ref|XP_008233024.1| (PREDICTED: AT-rich interactive domain-containing protein 1 isoform X1 [Prunus mume])

HSP 1 Score: 389.4 bits (999), Expect = 7.4e-105
Identity = 209/418 (50.00%), Postives = 269/418 (64.35%), Query Frame = 1

Query: 4   DELMNCVGFDENSISDFSKRIFDMKSDRMKSGVVFPQRSKKLQDIDHCTIKNEDIKVIDP 63
           DE +  +      +S F KR    +     S V   +R KK  DI      NED  ++DP
Sbjct: 306 DEAVIFLDPSPGEVSSFRKR---KRGPSCGSEVGESRRGKKYNDIS-----NEDAVILDP 365

Query: 64  PSVKEANFVLERTQKPMLGLLNWLKDVARNPCDPSICSLPEKSKWKSHGNEEVWKQVLLV 123
            + +EAN   +R Q+ + G+LNW++ +A++PCDP++ SLPE+SKWKS GNEE W QVL  
Sbjct: 366 STDEEANSFWKRKQESLCGMLNWVRVIAKDPCDPAVGSLPERSKWKSFGNEENWMQVLCA 425

Query: 124 REEFFVKRRVDASSEQSFLQRNQRMHPCMYDNDTVPIYNLRKRLSIDKKVQS--QEPVSK 183
           RE  FVK+  D+ +EQS  Q+NQRMHP +YD+     YNLR+RL ++KK+ S    P S+
Sbjct: 426 REAIFVKKHADSGAEQSNWQKNQRMHPSLYDDHFSSSYNLRERLRLEKKLLSGGTMPQSR 485

Query: 184 --SSDSSPSDSSD-------------------EYKP--VLLGPDYQAQIPEWNGVISNSN 243
             S  SSPS S D                    Y P  + LG +YQA +PEW G  S S 
Sbjct: 486 TGSESSSPSYSPDMAGMEDQLLGTSDFTGVLDRYPPSHIPLGSNYQAHLPEWTGEASESE 545

Query: 244 LKWLGTQEWPLNLKKGSNRNPVERDPIGKGRRDPCGCLDAGSVGCVKFHVAEKRHRLKLE 303
           LKWLG++ WPL  +K  +R  +ERDPIGKGR++ CGC  +GS+ CV+FH++EKR R+KLE
Sbjct: 546 LKWLGSRFWPL--EKPEHRYLIERDPIGKGRQESCGCQVSGSIECVRFHISEKRLRVKLE 605

Query: 304 LGAAFLQWRFDQMGEDVTFAWKVDDEKKFEDIVTSNPPSLGICFWDEIVESFPSRSREDL 363
           LG AF  W F+QMGE+V  +W  ++EKKF+DIV SNPPSLGI FWD+I +SFP +SR +L
Sbjct: 606 LGPAFYHWEFNQMGEEVGLSWTAEEEKKFKDIVKSNPPSLGITFWDQIFKSFPKKSRREL 665

Query: 364 VSYYYNVFLLRRRGHQNRVTPNKIDSDDES-ESGITTNGFGHEVHNSPGSIFYSPKKP 396
           VSYY+NVFLL RRG+QNR TPN IDSDDE  ESG  TNGFG E      SI  SP KP
Sbjct: 666 VSYYFNVFLLHRRGYQNRFTPNNIDSDDEGLESGSVTNGFGDEERKPSKSILKSPHKP 713

BLAST of CmaCh08G009030 vs. NCBI nr
Match: gi|645254407|ref|XP_008233025.1| (PREDICTED: AT-rich interactive domain-containing protein 1 isoform X2 [Prunus mume])

HSP 1 Score: 389.4 bits (999), Expect = 7.4e-105
Identity = 209/418 (50.00%), Postives = 269/418 (64.35%), Query Frame = 1

Query: 4   DELMNCVGFDENSISDFSKRIFDMKSDRMKSGVVFPQRSKKLQDIDHCTIKNEDIKVIDP 63
           DE +  +      +S F KR    +     S V   +R KK  DI      NED  ++DP
Sbjct: 282 DEAVIFLDPSPGEVSSFRKR---KRGPSCGSEVGESRRGKKYNDIS-----NEDAVILDP 341

Query: 64  PSVKEANFVLERTQKPMLGLLNWLKDVARNPCDPSICSLPEKSKWKSHGNEEVWKQVLLV 123
            + +EAN   +R Q+ + G+LNW++ +A++PCDP++ SLPE+SKWKS GNEE W QVL  
Sbjct: 342 STDEEANSFWKRKQESLCGMLNWVRVIAKDPCDPAVGSLPERSKWKSFGNEENWMQVLCA 401

Query: 124 REEFFVKRRVDASSEQSFLQRNQRMHPCMYDNDTVPIYNLRKRLSIDKKVQS--QEPVSK 183
           RE  FVK+  D+ +EQS  Q+NQRMHP +YD+     YNLR+RL ++KK+ S    P S+
Sbjct: 402 REAIFVKKHADSGAEQSNWQKNQRMHPSLYDDHFSSSYNLRERLRLEKKLLSGGTMPQSR 461

Query: 184 --SSDSSPSDSSD-------------------EYKP--VLLGPDYQAQIPEWNGVISNSN 243
             S  SSPS S D                    Y P  + LG +YQA +PEW G  S S 
Sbjct: 462 TGSESSSPSYSPDMAGMEDQLLGTSDFTGVLDRYPPSHIPLGSNYQAHLPEWTGEASESE 521

Query: 244 LKWLGTQEWPLNLKKGSNRNPVERDPIGKGRRDPCGCLDAGSVGCVKFHVAEKRHRLKLE 303
           LKWLG++ WPL  +K  +R  +ERDPIGKGR++ CGC  +GS+ CV+FH++EKR R+KLE
Sbjct: 522 LKWLGSRFWPL--EKPEHRYLIERDPIGKGRQESCGCQVSGSIECVRFHISEKRLRVKLE 581

Query: 304 LGAAFLQWRFDQMGEDVTFAWKVDDEKKFEDIVTSNPPSLGICFWDEIVESFPSRSREDL 363
           LG AF  W F+QMGE+V  +W  ++EKKF+DIV SNPPSLGI FWD+I +SFP +SR +L
Sbjct: 582 LGPAFYHWEFNQMGEEVGLSWTAEEEKKFKDIVKSNPPSLGITFWDQIFKSFPKKSRREL 641

Query: 364 VSYYYNVFLLRRRGHQNRVTPNKIDSDDES-ESGITTNGFGHEVHNSPGSIFYSPKKP 396
           VSYY+NVFLL RRG+QNR TPN IDSDDE  ESG  TNGFG E      SI  SP KP
Sbjct: 642 VSYYFNVFLLHRRGYQNRFTPNNIDSDDEGLESGSVTNGFGDEERKPSKSILKSPHKP 689

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ARID1_ARATH5.2e-6644.56AT-rich interactive domain-containing protein 1 OS=Arabidopsis thaliana GN=ARID1... [more]
ARID2_ARATH4.2e-5234.18AT-rich interactive domain-containing protein 2 OS=Arabidopsis thaliana GN=ARID2... [more]
Match NameE-valueIdentityDescription
A0A0A0KD11_CUCSA4.9e-18878.34Uncharacterized protein OS=Cucumis sativus GN=Csa_6G004620 PE=4 SV=1[more]
M5X794_PRUPE1.1e-10249.04Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002218mg PE=4 SV=1[more]
A0A061DU38_THECC1.5e-9646.22ARID/BRIGHT DNA-binding domain,ELM2 domain protein, putative isoform 1 OS=Theobr... [more]
A0A061DSZ0_THECC1.5e-9646.22ARID/BRIGHT DNA-binding domain,ELM2 domain protein, putative isoform 3 OS=Theobr... [more]
A0A067K8K6_JATCU5.3e-9446.81Uncharacterized protein OS=Jatropha curcas GN=JCGZ_14376 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G46040.12.9e-6744.56 ARID/BRIGHT DNA-binding domain;ELM2 domain protein[more]
AT4G11400.12.4e-5334.18 ARID/BRIGHT DNA-binding domain;ELM2 domain protein[more]
AT5G04110.16.1e-3338.31 DNA GYRASE B3[more]
AT2G03470.13.8e-2735.51 ELM2 domain-containing protein[more]
AT1G13880.12.8e-2540.11 ELM2 domain-containing protein[more]
Match NameE-valueIdentityDescription
gi|449441632|ref|XP_004138586.1|7.0e-18878.34PREDICTED: AT-rich interactive domain-containing protein 1-like [Cucumis sativus... [more]
gi|659116683|ref|XP_008458203.1|3.2e-18578.84PREDICTED: AT-rich interactive domain-containing protein 1-like isoform X1 [Cucu... [more]
gi|659116689|ref|XP_008458206.1|1.6e-17178.17PREDICTED: AT-rich interactive domain-containing protein 1-like isoform X2 [Cucu... [more]
gi|645254404|ref|XP_008233024.1|7.4e-10550.00PREDICTED: AT-rich interactive domain-containing protein 1 isoform X1 [Prunus mu... [more]
gi|645254407|ref|XP_008233025.1|7.4e-10550.00PREDICTED: AT-rich interactive domain-containing protein 1 isoform X2 [Prunus mu... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005634 nucleus
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0003677 DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh08G009030.1CmaCh08G009030.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR22970FAMILY NOT NAMEDcoord: 55..396
score: 9.3E
NoneNo IPR availablePANTHERPTHR22970:SF23AT-RICH INTERACTIVE DOMAIN-CONTAINING PROTEIN 1coord: 55..396
score: 9.3E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmaCh08G009030CmaCh17G004410Cucurbita maxima (Rimu)cmacmaB385
The following block(s) are covering this gene:
GeneOrganismBlock
CmaCh08G009030Cucumber (Chinese Long) v3cmacucB1064
CmaCh08G009030Cucurbita maxima (Rimu)cmacmaB359
CmaCh08G009030Cucurbita moschata (Rifu)cmacmoB886
CmaCh08G009030Bottle gourd (USVL1VR-Ls)cmalsiB836
CmaCh08G009030Cucumber (Gy14) v2cgybcmaB463