CmoCh02G000530.1 (mRNA) Cucurbita moschata (Rifu)

NameCmoCh02G000530.1
TypemRNA
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionUbiquitin and WLM domain-containing protein
LocationCmo_Chr02 : 287837 .. 291457 (-)
Sequence length2500
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexonthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGCAGCAACACAGCATGTATAATCTGCCCGTCTTATGGAGAGGAACGAAGTATGTGGTGGAAATTAGTTCAGATTCTACTCTTCGGGACCTCGGTGAAAAGCTGCTAAAATTAACAGAAGTTCAAGCAGATACTATGCGGCTCATAGTCCCACAATTTTCCAGCAAAAGCTCTAAAATGTTATATCCTTTTTCTGATGAAGATGGATTCTTGGCTTTGCATAAGATTTCCATTTTTAAGGTGCCATTCATTTACTTATATCTATGACGTAGAGTTTTTTAGGTTGTGTGCCACAGCTACTAGTTGGTTTTTGATTTGACGGGTTATTTGAAGTAGTATGTTCGTGACGCGTATGAAGTTCTGATATTAAGTTTGCTGTTTCGATCTGTCTGGAACAATATTTTCTCTAATGATCCACCAAAAAACCGTCATTATGCTGCTAGTTCTAGGCAACTGACTATCAGCTCTGACTGCTTCAGGACAACAAGCCTATCAGAATGATGGGAGTATCTAAGAATGAGGTAGATGAAGTTTTGAAGAACGAAAAGAAAAATGAACGAATTGCTGGGTTCGACAAAGAAGAACAGAGACTGAAACAACGAATGTCAAGTAAGCGACAGGGCTTACTGAAACTACCAGAAGGACCCTATGTATTTTGTGAATTTCGGACGCTTCAAATTCCAGGAATTGAGGTGCTCTTTTCCATGCTCTATAGTGTTTTTTCAGTAATTTAGTATTGCATGCATAAATGTCAGCAGGATGGCATTTTGCAGGGCTTATGCCCTTTAAGATTTCCTTTGTTGAGAATTATATTAATCTACATGTCTTAACTGGTGTTGAATGGCATCCTTTCATTGGCGGTAGTTGAACCCTCCAGCTTCAGAAGCTTTGAAAAGAATGCATATGCTTGCAGCTGATCCTGGCATTGTTGCAATCATGAACAAGGTAAAGCAATGATTAAATCTGGATTTAATATGTGATTGGCGATAAATAACCTGGTGATTGGCCTTCTTTTAGCATCGTTGGCGTGTGGGAATTATGACTGAGATGGCCCCTGTTGGCTATGTTGGTGTGAGCCCGAAATGTATTCTTGGCTTTAATAAGGTATGAACGACTACTTCAGTGTGTGGAAAATCTTAGAATATTATTCTCCAAATTCGAATTTTGACTGTGCTTTCTGATCAGAACCATGGAGAGGAGATATCGCTGCGACTTCGGACAGACGACCTGAAGGGCTTCAGAAAATATGAAAGCATTAAGAAAACATTACTCCATGAACTTGTGAGCTCTCTTTGTTTTTCTTTTTTTATGCATGCAGAGGATAATAGTACATGGTAATTGATTCTCGAGTGAATTTCGTAACATATTGTGATTTACAACTAGGCACACATGATTTATTCCGAGCACGATGCCAACTTCTATGCTTTGGATAAGCAGGTAGTATGTTTTATGATTAAGATTGTTTTCTCATTTATGGCATATTTACAATTGTCGCTTTGCATTATTAGTCTGTAATTTGGTTGTTCTTTCAATCTTATATTTCTTAGCTTAATGAGGAGGCTGCCACTTTAGATTGGACAAGATCAAAAGGCCACACGTTGAGCGGAGTTAGTTATTCCCAATATCACGATGAAAGCGATGATGTCCAAGACGGCTTTGGTGTCTCACAGAAGCTTGGTGGTAGCATGTCGTATCGGCTGGTTAATCCCCGTGCTTCTTCAGTTTCCGCTGCTTATCACCGTTTGTCACACTCTTCAGATTTCAGTTCAAGAGTGTCTCCAGTAAATGGAGAGTCCAATCCGGATGAAAATTCTAATTGCCAAAACAAACTGGAGCCTGATCCTGATGTTGATGCTTATCAAAAGAAGATCGAGCCTGACCCAGATGACAGTTCCAATTATCAAAAGAAGCTCAAGCCTGACCCAGATGACAATTCCAATTATCAAAAGAAGTTCGAGCCTGACCCGTATGACAGTTCCAATTATCAAAAGAAGTTCGAGCCTGACCCAAATGACAGTTCCAATTATCAAAAGAAGTTCGAGCCTGACCCGGATGACAGTTCCAATTATCAAAAGAAGTTCGAGCCTGACCCGGATGACAGTTCCAATTATCAAAAGAAGTCCGAGCCTGACCCGGATGACAGTTCCAATTATAAAAAGAAGCTCGAGCGTGACCCAGATGACAGTTCCAATTATAAAAAGAAGCTCGAGCGTGACCCAGATGACAGTTGCAATTATAAAAAGAAGTTCGAGCCTGACCCAGATGACATTTCTGATTATCAAAAGAAGTTCGAGCCTGACCCAGATGACATTTCTGATTATCAAAAGAAGTTCGAGCCTGACCCAGATGACATTTCTGATTATCAAAAGAAGTTCGAGCCTGACCCAGACGACATTTCCGATTATCAAAAGAAGTTGGAGTCTGACCCAGACGACAGTTCCGATTATCAAAAGAAGTTCGAGCCTGACCCAGACGACAGTTCCGATTATCAAAAGAAGTTCGAGCCTGATCCAGACGACAGTTCCGATTATCAAAAGAAGTACGAGCCTGACCCAGATGACAGTTCTAATTATGAGGCGAATTGTCTAGAAGCTGGCTTAGTCACAGAACCTATGCAGACAGAGCCTGACCCTGATGAAAGTTCGGTACATCAGGCGGATTTATCTAAAATGGTTGTTGACGAACCCAATCCTGATGATCAAGAAATTCAAAGAATTCAAGACTCTGTTTCTGTTGTTTGCATTCGATTGCGTGAGGCTATCGCAAGGCTGCTGGCTGAAGTTCGACCTTCTGAATCGGCTGCAGTTTTTCAAACTCTGTTCAAGATTGTTAGGTAACTTCTACTGTTCTTTTCCTTTTGTGCTCTCAGTTCTTGATGGAAATGTTCGACTGCTTGTCCTTGGCTACTGACACTATGGCTTTTGGCATCTGGTTATTATGATTTAATCTTCAATGGTAAATTGTTTGCTTAAATTTCCTCTTATCTGCAGGAATGTAATTGAACACCCAGGTGATATGAAATACAGAAAGTTTCGCAAGGCAAGACTACGAAATACAGAAAGCTTTGAATAAGTTTTAGTAATTTTTATTGTATCCTTGAGCTAATAAATGAAGCATTGCAGGCTAATCCCACTATCCAGAAGAATGTTGCCAACTACAAAGGTTAGTTTTGTATGCGTTCATGTGCAACATGCTCGAAGAAACGAAAGAACTGTTCTAAATCACTATGATCTTCTGTATCAATTCTTTTAACATTTTTTAATGCAGCTGCACTGGAGATCCTCTTCTTGATAGGTTTCATTGAAGATGTACTGCTAAACGAAATGGGCAACGCCGAAACATTTCTCGTACTGAAGCGTAACGATCCTGGCTTATTGTGGCTTGCCAAATCCACCCTTGAAACGTGCAATGCCTTGTAGATAGATTGAAGAACGTAGTTAGTATGTTTAATAGAACTGTAAATGATAGTGTATGATGATACTATATACTAGTTCGATTTATCGTTTTTCAACACGAAACTCAGGACTGTAGGTCCATATCGTATGTCGGTTTATTTGTTTATGCAAATGTTCAATTATAAACTTGGTTCTTAAGATTTTTAGGTTTCAACTAAAAATG

mRNA sequence

ATGGAGCAGCAACACAGCATGTATAATCTGCCCGTCTTATGGAGAGGAACGAAGTATGTGGTGGAAATTAGTTCAGATTCTACTCTTCGGGACCTCGGTGAAAAGCTGCTAAAATTAACAGAAGTTCAAGCAGATACTATGCGGCTCATAGTCCCACAATTTTCCAGCAAAAGCTCTAAAATGTTATATCCTTTTTCTGATGAAGATGGATTCTTGGCTTTGCATAAGATTTCCATTTTTAAGGACAACAAGCCTATCAGAATGATGGGAGTATCTAAGAATGAGGTAGATGAAGTTTTGAAGAACGAAAAGAAAAATGAACGAATTGCTGGGTTCGACAAAGAAGAACAGAGACTGAAACAACGAATGTCAAGTAAGCGACAGGGCTTACTGAAACTACCAGAAGGACCCTATGTATTTTGTGAATTTCGGACGCTTCAAATTCCAGGAATTGAGTTGAACCCTCCAGCTTCAGAAGCTTTGAAAAGAATGCATATGCTTGCAGCTGATCCTGGCATTGTTGCAATCATGAACAAGCATCGTTGGCGTGTGGGAATTATGACTGAGATGGCCCCTGTTGGCTATGTTGGTGTGAGCCCGAAATGTATTCTTGGCTTTAATAAGAACCATGGAGAGGAGATATCGCTGCGACTTCGGACAGACGACCTGAAGGGCTTCAGAAAATATGAAAGCATTAAGAAAACATTACTCCATGAACTTGCACACATGATTTATTCCGAGCACGATGCCAACTTCTATGCTTTGGATAAGCAGCTTAATGAGGAGGCTGCCACTTTAGATTGGACAAGATCAAAAGGCCACACGTTGAGCGGAGTTAGTTATTCCCAATATCACGATGAAAGCGATGATGTCCAAGACGGCTTTGGTGTCTCACAGAAGCTTGGTGGTAGCATGTCGTATCGGCTGGTTAATCCCCGTGCTTCTTCAGTTTCCGCTGCTTATCACCGTTTGTCACACTCTTCAGATTTCAGTTCAAGAGTGTCTCCAGTAAATGGAGAGTCCAATCCGGATGAAAATTCTAATTGCCAAAACAAACTGGAGCCTGATCCTGATGTTGATGCTTATCAAAAGAAGATCGAGCCTGACCCAGATGACAGTTCCAATTATCAAAAGAAGCTCAAGCCTGACCCAGATGACAATTCCAATTATCAAAAGAAGTTCGAGCCTGACCCGTATGACAGTTCCAATTATCAAAAGAAGTTCGAGCCTGACCCAAATGACAGTTCCAATTATCAAAAGAAGTTCGAGCCTGACCCGGATGACAGTTCCAATTATCAAAAGAAGTTCGAGCCTGACCCGGATGACAGTTCCAATTATCAAAAGAAGTCCGAGCCTGACCCGGATGACAGTTCCAATTATAAAAAGAAGCTCGAGCGTGACCCAGATGACAGTTCCAATTATAAAAAGAAGCTCGAGCGTGACCCAGATGACAGTTGCAATTATAAAAAGAAGTTCGAGCCTGACCCAGATGACATTTCTGATTATCAAAAGAAGTTCGAGCCTGACCCAGATGACATTTCTGATTATCAAAAGAAGTTCGAGCCTGACCCAGATGACATTTCTGATTATCAAAAGAAGTTCGAGCCTGACCCAGACGACATTTCCGATTATCAAAAGAAGTTGGAGTCTGACCCAGACGACAGTTCCGATTATCAAAAGAAGTTCGAGCCTGACCCAGACGACAGTTCCGATTATCAAAAGAAGTTCGAGCCTGATCCAGACGACAGTTCCGATTATCAAAAGAAGTACGAGCCTGACCCAGATGACAGTTCTAATTATGAGGCGAATTGTCTAGAAGCTGGCTTAGTCACAGAACCTATGCAGACAGAGCCTGACCCTGATGAAAGTTCGGTACATCAGGCGGATTTATCTAAAATGGTTGTTGACGAACCCAATCCTGATGATCAAGAAATTCAAAGAATTCAAGACTCTGTTTCTGTTGTTTGCATTCGATTGCGTGAGGCTATCGCAAGGCTGCTGGCTGAAGTTCGACCTTCTGAATCGGCTGCAGTTTTTCAAACTCTGTTCAAGATTGTTAGGAATGTAATTGAACACCCAGGTGATATGAAATACAGAAAGTTTCGCAAGGCTAATCCCACTATCCAGAAGAATGTTGCCAACTACAAAGCTGCACTGGAGATCCTCTTCTTGATAGGTTTCATTGAAGATGTACTGCTAAACGAAATGGGCAACGCCGAAACATTTCTCGTACTGAAGCGTAACGATCCTGGCTTATTGTGGCTTGCCAAATCCACCCTTGAAACGTGCAATGCCTTGTAGATAGATTGAAGAACGTAGTTAGTATGTTTAATAGAACTGTAAATGATAGTGTATGATGATACTATATACTAGTTCGATTTATCGTTTTTCAACACGAAACTCAGGACTGTAGGTCCATATCGTATGTCGGTTTATTTGTTTATGCAAATGTTCAATTATAAACTTGGTTCTTAAGATTTTTAGGTTTCAACTAAAAATG

Coding sequence (CDS)

ATGGAGCAGCAACACAGCATGTATAATCTGCCCGTCTTATGGAGAGGAACGAAGTATGTGGTGGAAATTAGTTCAGATTCTACTCTTCGGGACCTCGGTGAAAAGCTGCTAAAATTAACAGAAGTTCAAGCAGATACTATGCGGCTCATAGTCCCACAATTTTCCAGCAAAAGCTCTAAAATGTTATATCCTTTTTCTGATGAAGATGGATTCTTGGCTTTGCATAAGATTTCCATTTTTAAGGACAACAAGCCTATCAGAATGATGGGAGTATCTAAGAATGAGGTAGATGAAGTTTTGAAGAACGAAAAGAAAAATGAACGAATTGCTGGGTTCGACAAAGAAGAACAGAGACTGAAACAACGAATGTCAAGTAAGCGACAGGGCTTACTGAAACTACCAGAAGGACCCTATGTATTTTGTGAATTTCGGACGCTTCAAATTCCAGGAATTGAGTTGAACCCTCCAGCTTCAGAAGCTTTGAAAAGAATGCATATGCTTGCAGCTGATCCTGGCATTGTTGCAATCATGAACAAGCATCGTTGGCGTGTGGGAATTATGACTGAGATGGCCCCTGTTGGCTATGTTGGTGTGAGCCCGAAATGTATTCTTGGCTTTAATAAGAACCATGGAGAGGAGATATCGCTGCGACTTCGGACAGACGACCTGAAGGGCTTCAGAAAATATGAAAGCATTAAGAAAACATTACTCCATGAACTTGCACACATGATTTATTCCGAGCACGATGCCAACTTCTATGCTTTGGATAAGCAGCTTAATGAGGAGGCTGCCACTTTAGATTGGACAAGATCAAAAGGCCACACGTTGAGCGGAGTTAGTTATTCCCAATATCACGATGAAAGCGATGATGTCCAAGACGGCTTTGGTGTCTCACAGAAGCTTGGTGGTAGCATGTCGTATCGGCTGGTTAATCCCCGTGCTTCTTCAGTTTCCGCTGCTTATCACCGTTTGTCACACTCTTCAGATTTCAGTTCAAGAGTGTCTCCAGTAAATGGAGAGTCCAATCCGGATGAAAATTCTAATTGCCAAAACAAACTGGAGCCTGATCCTGATGTTGATGCTTATCAAAAGAAGATCGAGCCTGACCCAGATGACAGTTCCAATTATCAAAAGAAGCTCAAGCCTGACCCAGATGACAATTCCAATTATCAAAAGAAGTTCGAGCCTGACCCGTATGACAGTTCCAATTATCAAAAGAAGTTCGAGCCTGACCCAAATGACAGTTCCAATTATCAAAAGAAGTTCGAGCCTGACCCGGATGACAGTTCCAATTATCAAAAGAAGTTCGAGCCTGACCCGGATGACAGTTCCAATTATCAAAAGAAGTCCGAGCCTGACCCGGATGACAGTTCCAATTATAAAAAGAAGCTCGAGCGTGACCCAGATGACAGTTCCAATTATAAAAAGAAGCTCGAGCGTGACCCAGATGACAGTTGCAATTATAAAAAGAAGTTCGAGCCTGACCCAGATGACATTTCTGATTATCAAAAGAAGTTCGAGCCTGACCCAGATGACATTTCTGATTATCAAAAGAAGTTCGAGCCTGACCCAGATGACATTTCTGATTATCAAAAGAAGTTCGAGCCTGACCCAGACGACATTTCCGATTATCAAAAGAAGTTGGAGTCTGACCCAGACGACAGTTCCGATTATCAAAAGAAGTTCGAGCCTGACCCAGACGACAGTTCCGATTATCAAAAGAAGTTCGAGCCTGATCCAGACGACAGTTCCGATTATCAAAAGAAGTACGAGCCTGACCCAGATGACAGTTCTAATTATGAGGCGAATTGTCTAGAAGCTGGCTTAGTCACAGAACCTATGCAGACAGAGCCTGACCCTGATGAAAGTTCGGTACATCAGGCGGATTTATCTAAAATGGTTGTTGACGAACCCAATCCTGATGATCAAGAAATTCAAAGAATTCAAGACTCTGTTTCTGTTGTTTGCATTCGATTGCGTGAGGCTATCGCAAGGCTGCTGGCTGAAGTTCGACCTTCTGAATCGGCTGCAGTTTTTCAAACTCTGTTCAAGATTGTTAGGAATGTAATTGAACACCCAGGTGATATGAAATACAGAAAGTTTCGCAAGGCTAATCCCACTATCCAGAAGAATGTTGCCAACTACAAAGCTGCACTGGAGATCCTCTTCTTGATAGGTTTCATTGAAGATGTACTGCTAAACGAAATGGGCAACGCCGAAACATTTCTCGTACTGAAGCGTAACGATCCTGGCTTATTGTGGCTTGCCAAATCCACCCTTGAAACGTGCAATGCCTTGTAG
BLAST of CmoCh02G000530.1 vs. Swiss-Prot
Match: YQ77_SCHPO (Ubiquitin and WLM domain-containing metalloprotease SPCC1442.07c OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=SPCC1442.07c PE=3 SV=1)

HSP 1 Score: 110.2 bits (274), Expect = 1.0e-22
Identity = 81/279 (29.03%), Postives = 123/279 (44.09%), Query Frame = 1

Query: 15  RGTKYVVEISSDSTLRDLGEKLLKLTEVQADTMRLIVPQFSSKSSKMLYPFSDEDGFLAL 74
           RG    +  + + T+ D  EKL +  +V    ++L+     S  S +     +E   + L
Sbjct: 8   RGNVIALSFNENDTVLDAKEKLGQEIDVSPSLIKLLYKGNLSDDSHLQDVVKNESKIMCL 67

Query: 75  HKISIFKDNKPIRMMGVSKNEVDEVLKNEKKNERIAGFDKEEQRLKQRMSSKRQGLLKLP 134
                 + +K I    +S+ +V +   N                    +  K+      P
Sbjct: 68  -----IRQDKDIVNQAISQLKVPDYSTNTYS-----------------LKPKKPHTTPKP 127

Query: 135 EGPYVFCEFRTLQIPGIELNPPASEALKRMHMLAADPGIVAIMNKHRWRVGIMTEMAPVG 194
              Y F E   L  P  +       AL+ +  L  D GI  IM+ HRW V +++EM P  
Sbjct: 128 ASIYTFNELVVLDYPHKD------RALRYLERLRDDTGIKKIMDSHRWTVPLLSEMDPAE 187

Query: 195 YVGVSPKCILGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMIYSEHDANFYA 254
           +     K  LG N N G  I LRLRTD   GFR Y+++K TL+HEL H ++ EHD++F+ 
Sbjct: 188 HTRHDSKT-LGLNHNQGAHIELRLRTDRYDGFRDYKTVKSTLIHELTHNVHGEHDSSFWE 247

Query: 255 LDKQLNEEAATLDWTRSKGHTLSG-VSYSQYHDESDDVQ 293
           L +QL +EA   D     G  +S   SY+   D  D+ Q
Sbjct: 248 LFRQLTKEADAADLLGKPGSYVSDRASYTPQQDNDDEDQ 257

BLAST of CmoCh02G000530.1 vs. TrEMBL
Match: A0A0A0KF59_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G169280 PE=4 SV=1)

HSP 1 Score: 823.2 bits (2125), Expect = 2.7e-235
Identity = 492/771 (63.81%), Postives = 557/771 (72.24%), Query Frame = 1

Query: 1   MEQQHSMYNLPVLWRGTKYVVEISSDSTLRDLGEKLLKLTEVQADTMRLIVPQFSSKSSK 60
           MEQQH +YN+PVLWRGTKY+VEISSDSTLRDLG++LLK+TEV+ADTMR IVPQFSSKSSK
Sbjct: 70  MEQQHIIYNIPVLWRGTKYMVEISSDSTLRDLGQELLKITEVKADTMRFIVPQFSSKSSK 129

Query: 61  MLYPFSDEDGFLALHKISIFKDN-KPIRMMGVSKNEVDEVLKNEKKNERIAGFDKEEQRL 120
           MLYPFSDEDG LAL K SIFKDN KPIRMMGVSKNEVDE+L N KKNERI GFD+EE+RL
Sbjct: 130 MLYPFSDEDGCLALQKFSIFKDNNKPIRMMGVSKNEVDEILNNAKKNERIVGFDEEEKRL 189

Query: 121 KQRMSSKRQGLLKLPEGPYVFCEFRTLQIPGIELNPPASEALKRMHMLAADPGIVAIMNK 180
           KQRMSSK +G+LKLPEGPYVFCEFRTLQIPGIELNPPASEALKRMHMLAADPGIVAIMNK
Sbjct: 190 KQRMSSKPRGVLKLPEGPYVFCEFRTLQIPGIELNPPASEALKRMHMLAADPGIVAIMNK 249

Query: 181 HRWRVGIMTEMAPVGYVGVSPKCILGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHE 240
           H WRVGIMTEMAP+GYVGV+PKCILGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHE
Sbjct: 250 HHWRVGIMTEMAPIGYVGVNPKCILGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHE 309

Query: 241 LAHMIYSEHDANFYALDKQLNEEAATLDWTRSKGHTLSGVSYSQYHDESDDVQDGFGVSQ 300
           LAHMI+SEHDANFYALDKQLNEEAA LDWTRSKGHTL+G++YSQYH+E+D V+D FGVSQ
Sbjct: 310 LAHMIFSEHDANFYALDKQLNEEAAALDWTRSKGHTLTGMNYSQYHEEND-VEDDFGVSQ 369

Query: 301 KLGGSMSYRLVNPRASSVSAAYHRLSHSSDFSSRVSPVNGESNPDENSNCQNKLEPDPDV 360
           KLGGSMS++LVN RA+SV+AAYHR++++SD SS V  V+ ESNP  NS+ QNKLEPDPD 
Sbjct: 370 KLGGSMSHQLVNARAASVAAAYHRMTNNSDCSSGVPQVSAESNP--NSSHQNKLEPDPD- 429

Query: 361 DAYQKKIEPDPDDSSNYQKKLKPDPDDNSNYQKKFEPDPYDSSNYQKKFEPDPNDSSNYQ 420
           D+   K+EPDPD SSN Q  L  D                             N+S N++
Sbjct: 430 DSVYPKLEPDPDGSSNDQNMLGLDS----------------------------NNSYNHK 489

Query: 421 KKFEPDPDDSSNYQKKFEPDPDDSSNYQKKSEPDPDDSSNYKKKLERDPDDSSNYKKKLE 480
            K EP PDDS             S N + +SEP         K L    D SS     + 
Sbjct: 490 GKLEPAPDDSIG-----------SENLESESEP------RIIKSLVVQTDLSSTEVHPVP 549

Query: 481 RDPDDSCNYKKKF-EPDPDDISDYQKKFEPDPDDISDYQKKFEPDPDDISDYQKKF-EPD 540
                     K + EPD DD          D D +S   +       D + +Q+   EPD
Sbjct: 550 ATNSRLLEATKSYGEPDLDDRGSSSNSKVIDTDHLSQGMQNL-----DCNIFQRMIVEPD 609

Query: 541 PDDISDYQKKLESDPDDSSDYQKKFEPDPDDSSDYQKKFEPDPDDSSDYQKKYEPDPDDS 600
           PD + +    L S      +                   E D  ++   +        + 
Sbjct: 610 PDALGEKVNTLASGRAIGHN-------------------ETDCLEAGLVK--------NQ 669

Query: 601 SNYEANCLEAGLV--TEPMQTEPDPDESSVHQADLSKMVVDEPNPDDQEIQRIQDSVSVV 660
           S+   NC +   +   EPMQ EPDPDES VHQ D SKM VD+ +PDDQEIQRIQDSVSVV
Sbjct: 670 SHLSINCKKHDTIQGEEPMQIEPDPDESLVHQVDSSKMAVDQLDPDDQEIQRIQDSVSVV 729

Query: 661 CIRLREAIARLLAEVRPSESAAVFQTLFKIVRNVIEHPGDMKYRKFRKANPTIQKNVANY 720
           C RLREAI +LLAEV+PSES+AV QTLFKIV+NVIEHP +MKYRK RKANP IQKNVANY
Sbjct: 730 CNRLREAITKLLAEVKPSESSAVVQTLFKIVKNVIEHPDEMKYRKLRKANPIIQKNVANY 759

Query: 721 KAALEILFLIGFIEDVLLNEMGNAETFLVLKRNDPGLLWLAKSTLETCNAL 767
           +AALEILFLIGFIED LL+E+G AETFLVLKRNDPGLLWLAKSTLETCNAL
Sbjct: 790 EAALEILFLIGFIEDALLDEIGKAETFLVLKRNDPGLLWLAKSTLETCNAL 759

BLAST of CmoCh02G000530.1 vs. TrEMBL
Match: M5X3T8_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002194mg PE=4 SV=1)

HSP 1 Score: 630.9 bits (1626), Expect = 2.0e-177
Identity = 406/770 (52.73%), Postives = 519/770 (67.40%), Query Frame = 1

Query: 1   MEQQHSMYNLPVLWRGTKYVVEISSDSTLRDLGEKLLKLTEVQADTMRLIVPQFSSKSSK 60
           M+     +N+ V+WRG K+ VEI++ +TL+DLG +L KLT V+ADT++LIVPQFS KSSK
Sbjct: 1   MQDPQVEHNIWVIWRGKKFNVEINAGATLKDLGHELQKLTNVKADTLKLIVPQFSDKSSK 60

Query: 61  MLYPFSDEDGFLALHKISIFKDNKPIRMMGVSKNEVDEVLKNEKKNERIAGFDKEEQRLK 120
           +L PFSDE   L+L + SI  + K IRMMGVS++EVDEVL++ K N RIAGFD+EE RL+
Sbjct: 61  LLSPFSDEHEKLSLEETSII-EGKSIRMMGVSEHEVDEVLQHAKTNLRIAGFDEEEMRLR 120

Query: 121 QRMSSKRQGLLKLPEGPYVFCEFRTLQIPGIELNPPASEALKRMHMLAADPGIVAIMNKH 180
           QRMS  R   LKLP+GPY+FC+FRTLQ+PGIELNPP SEALKRMHMLAADPGI+++MNKH
Sbjct: 121 QRMSY-RPHTLKLPQGPYIFCDFRTLQLPGIELNPPVSEALKRMHMLAADPGIISVMNKH 180

Query: 181 RWRVGIMTEMAPVGYVGVSPKCILGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHEL 240
           RWRVGIMTEMAPVGYVG+SPKCILGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHEL
Sbjct: 181 RWRVGIMTEMAPVGYVGISPKCILGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHEL 240

Query: 241 AHMIYSEHDANFYALDKQLNEEAATLDWTRSKGHTLSGVSYSQYHDESDDVQDGFGVSQK 300
           AHM+YSEHDANFYALDKQLN+EA +LDWTRS+ HTLSGV YS++++E+  V      SQK
Sbjct: 241 AHMVYSEHDANFYALDKQLNQEAESLDWTRSRSHTLSGVQYSEHYEENFYVGGRSNSSQK 300

Query: 301 LGGSMSYRLVNPRASSVSAAYHRLSHSSDFSSRVSPVNGESNPDENSNCQNKLEPDPDVD 360
           LGG+MS RL + R SSV+AAY RL+ +S  S  VS V+ ES+PD++     K        
Sbjct: 301 LGGNMSDRLPSARTSSVAAAYQRLATASHDS--VSEVHEESHPDDSIPHMQK-------- 360

Query: 361 AYQKKIEPDPDDSSNYQKKLKPDPDDNSNYQKKFEPDPYDSSNYQKKFEPDPNDSSNYQK 420
                      +SS+     K + +  S+++ +++PD           EPDP+D S  + 
Sbjct: 361 -----------ESSHMDFVGKGNLEIGSSHKIQWKPD----------MEPDPDDQSGNKN 420

Query: 421 KFEPDPDDSSNYQK----KFEPDPDDSSNYQKKSEPDPDDSSNYKKKLERDPDDSSNYKK 480
            FEP PD+SS+        F  D  +S   Q  S       S   +KLE      +  ++
Sbjct: 421 NFEPSPDESSSQSSGSGTLFGQDFSESMMSQLVSH------SVSNRKLE-----GTECRE 480

Query: 481 KLERDPDDSC-NYKKKFEPDPDDISDYQKKFEPDPDDISDYQKKFEPDPDDISDYQKKFE 540
           + + D  ++C  +    EP+P     +  + E     I       EPDPDD+       +
Sbjct: 481 EPDADYMEACLKHDVVAEPEP----FHSHEMEILESRIQPRNNVDEPDPDDLD-----AK 540

Query: 541 PDPDDISDYQKKLESDPDDSSDYQK-KFEPDPDDSSDYQKKFEPDPDDS-SDYQKKYEPD 600
           PD      Y   +  + DDS   +  K E  P    +     EPDPDDS S+   + EPD
Sbjct: 541 PDNLGCGSYGNIIRPNHDDSLVSETIKCEAHPRKVHN-----EPDPDDSQSNGVIQAEPD 600

Query: 601 PDDSSNYEANCLEAGLVTEPMQTEPDPDESSVHQADLSKMVVDEPNPDDQEIQRIQDSVS 660
           PDDS +        G++    Q EPDPD++ VH  ++S+M +DEP+PDD+E QRIQD V+
Sbjct: 601 PDDSQS-------IGII----QAEPDPDDNLVHPREISRMQIDEPDPDDEEFQRIQDPVT 660

Query: 661 VVCIRLREAIARLLAEVRPSESAAVFQTLFKIVRNVIEHPGDMKYRKFRKANPTIQKNVA 720
           V   RL+E I  L AEV P+++ AV QTLFKI RNV+EHPG++KYR+ RKANP IQ+NVA
Sbjct: 661 VFRKRLQENIELLQAEVNPTQATAVLQTLFKITRNVLEHPGEIKYRRLRKANPAIQRNVA 700

Query: 721 NYKAALEILFLIGFIEDVLLNEMGNAETFLVLKRNDPGLLWLAKSTLETC 764
           NYKAA+ ILFLIGF E+V ++E+G  ET+LVLKR+DPGLLWLAKS+LETC
Sbjct: 721 NYKAAMAILFLIGFNENV-VDEIGRPETYLVLKRDDPGLLWLAKSSLETC 700

BLAST of CmoCh02G000530.1 vs. TrEMBL
Match: W1NQ89_AMBTC (Uncharacterized protein OS=Amborella trichopoda GN=AMTR_s00074p00152100 PE=4 SV=1)

HSP 1 Score: 607.4 bits (1565), Expect = 2.3e-170
Identity = 394/795 (49.56%), Postives = 524/795 (65.91%), Query Frame = 1

Query: 3   QQHSMYNLPVLWRGTKYVVEISSDSTLRDLGEKLLKLTEVQADTMRLIVPQFSSKSSKML 62
           ++  + ++ V+WRG +  VEI+S ST+ +LG+KL  +T V+ +TMRL+VP+ ++ SSK+L
Sbjct: 2   EEEDIMSVTVIWRGKQITVEINSGSTVEELGQKLQIVTNVKPETMRLLVPRSTNNSSKLL 61

Query: 63  YPFSDEDGFLALHKISIFKDNKPIRMMGVSKNEVDEVLKNEKKNE-RIAGFDKEEQRLKQ 122
            PFS E   L+L + ++ K  K IRMMGV  +E++EV  +  K + RIAGFD+EE+RL+Q
Sbjct: 62  LPFSHEHSKLSLQETAVLK-GKSIRMMGVFSDEIEEVSHDSSKPDLRIAGFDEEEKRLRQ 121

Query: 123 RMSSKRQGLLKLPEGPYVFCEFRTLQIPGIELNPPASEALKRMHMLAADPGIVAIMNKHR 182
           R   + Q  L+LP+GPY+FC+FRTL IPGI+LNP  +EAL+RMHMLA+DPGIVAIMNKH+
Sbjct: 122 RTFGRPQSSLRLPQGPYIFCDFRTLSIPGIQLNPLPTEALQRMHMLASDPGIVAIMNKHK 181

Query: 183 WRVGIMTEMAPVGYVGVSPKCILGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELA 242
           WRVGIMTE+APVGYVGVSPKCILG+NKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELA
Sbjct: 182 WRVGIMTELAPVGYVGVSPKCILGYNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELA 241

Query: 243 HMIYSEHDANFYALDKQLNEEAATLDWTRSKGHTLSGVSYSQYHDESDDVQDGFGVSQKL 302
           HM++SEHD NFYALDKQLN+EA TLDWT+SK HTLSG S    ++    ++     + KL
Sbjct: 242 HMVHSEHDTNFYALDKQLNQEAVTLDWTKSKRHTLSGTSSKDDYEWETQIETVHH-APKL 301

Query: 303 GGSMSYRLVNPRASSVSAAYHRL-SHSSDFSSRVSPVNGESNPDENSNCQ-----NKLEP 362
           GG      ++ R SSV+AAY RL  +SSD  S       ++   ++SN       N  EP
Sbjct: 302 GGRNLNPGIHVRESSVAAAYIRLLKNSSDNVSEEPKKQADAMEIDSSNMNSSGFVNMGEP 361

Query: 363 DPDVD--AYQKKI-----EPDPDDSS-------NYQKKLKPDPDDNSNYQKKFEPDPYDS 422
           DPD       KK+     EPDPDD          Y+    PDP    +++   EPDP D 
Sbjct: 362 DPDDGDLVKSKKLTLGHEEPDPDDYCFPNSSIMKYKNTSLPDPGGFCDHKISEEPDPDDL 421

Query: 423 SNYQKKFEPDPNDSSNYQKKFEPDPDDSSNYQKKFEPDPDDSSNYQKKSEPDPDDSSNYK 482
            +++++ +PDP+D  +++++ E +PDD  +++K+ EPDPDD  +++ + EPDPD+ S  K
Sbjct: 422 CDHRQREDPDPDDLCDHRRREESNPDDLCDHRKREEPDPDDLCDHRTREEPDPDEFSTKK 481

Query: 483 KKL----ERDPDDSSNYKKKLERDPDDSC-----NYKKKFEPDPDDISDYQKK-FEPDPD 542
             L    E DPDD   ++   E DPD++      N  K  EPDPDD    +    EPD D
Sbjct: 482 GILKHGSEPDPDDDEIHE---EPDPDETNSKVLGNELKTSEPDPDDTGVSEAMAIEPDQD 541

Query: 543 DISDYQKKFEPDPDDISDYQKKFEPDPDDISDYQKKLESDPD-DSSDYQKKFEPDPDDSS 602
           D  D  + F+PDP +  + Q   EPDPDD   ++K + ++PD D S+     EPDPDD  
Sbjct: 542 DAQDISEGFKPDPKEHLETQIN-EPDPDD---FEKAVINEPDPDDSENAMASEPDPDDF- 601

Query: 603 DYQKKFEPDPDDSSDYQKKYEPDPDDSSNYEANCLEAGLVTEPMQTEPDPDESSVHQADL 662
           D     EPDPDD+ +     EPDPDDS     N        EP++   D + +   QA+ 
Sbjct: 602 DKAGTNEPDPDDT-EMAGANEPDPDDSDKVGTN--------EPVRD--DSENAETFQAE- 661

Query: 663 SKMVVDEPNPDDQEIQRIQDSVSVVCIRLREAIARLLAEVRPSESAAVFQTLFKIVRNVI 722
                     D  E++RIQDSV++V  RL+ +I RL AE  P E+A+V  TLFKIVRNVI
Sbjct: 662 --------GMDMDELRRIQDSVTIVTTRLQSSIERLKAEASPFEAASVILTLFKIVRNVI 721

Query: 723 EHPGDMKYRKFRKANPTIQKNVANYKAALEILFLIGFIEDVLLNEMGNAETFLVLKRNDP 766
           EHP ++K+++ RKANP  QK VA YKAA+E+L  IGF EDV+L+E+G AE +LVL+RNDP
Sbjct: 722 EHPNEIKFKRLRKANPHFQKTVARYKAAMEVLVAIGFCEDVVLDEIGKAEPYLVLRRNDP 766

BLAST of CmoCh02G000530.1 vs. TrEMBL
Match: A0A067K749_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_13776 PE=4 SV=1)

HSP 1 Score: 602.1 bits (1551), Expect = 9.8e-169
Identity = 390/749 (52.07%), Postives = 500/749 (66.76%), Query Frame = 1

Query: 23  ISSDSTLRDLGEKLLKLTEVQADTMRLIVPQFSSKSSKMLYPFSDEDGFLALHKISIFKD 82
           ++S+++L+DLG++L KLT+V+ DTM+LIVPQ SSK SK+L PFS+E   L+LHK SI  +
Sbjct: 1   MNSNASLKDLGDELQKLTDVKPDTMKLIVPQVSSKGSKLLSPFSNEHSQLSLHKASIL-E 60

Query: 83  NKPIRMMGVSKNEVDEVLKNEKKNERIAGFDKEEQRLKQRMSSKRQGLLKLPEGPYVFCE 142
            K IRMMGV ++EVD+VL+N K N RIAGFD+EE+R+KQR +     LLKLP+GPY FC+
Sbjct: 61  GKSIRMMGVPEDEVDKVLQNAKDNLRIAGFDEEERRMKQRSAYSPHALLKLPQGPYTFCD 120

Query: 143 FRTLQIPGIELNPPASEALKRMHMLAADPGIVAIMNKHRWRVGIMTEMAPVGYVGVSPKC 202
           FRTLQ+PGI+LNPPASEALKRMHMLAADPGIVAIMNKHRWRVGIMTEMAPVGYVGVSPKC
Sbjct: 121 FRTLQLPGIQLNPPASEALKRMHMLAADPGIVAIMNKHRWRVGIMTEMAPVGYVGVSPKC 180

Query: 203 ILGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMIYSEHDANFYALDKQLNEE 262
           ILGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHM+YSEHDANFY LDKQLN+E
Sbjct: 181 ILGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMVYSEHDANFYNLDKQLNQE 240

Query: 263 AATLDWTRSKGHTLSGVSYSQYHDESDDVQ-DGFGVSQKLGGSMSYRLVNPRASSVSAAY 322
           AA+LDWT+S+GHTL+ V +  +++E +    D    S KLGG++  ++ + RASSV+AAY
Sbjct: 241 AASLDWTKSRGHTLNRVRHLDHYEEEESYDSDNRSFSYKLGGNVLDQMASARASSVAAAY 300

Query: 323 HRLSHSSDFSSRVSPVNGESNPDENSNCQNKLEPDPDVDAYQKKIEPDPDDSSNYQKKLK 382
            RL++ S   S  S V  E +PD++                  +I       +NY  +  
Sbjct: 301 LRLANESANGSGASRVYEEPDPDDS------------------RISMHHRPEANYVGE-- 360

Query: 383 PDPDDNSNYQKKFEPDPYDSSNYQKKFEPDPNDSSNYQKKFEPDPDDSSNYQKKFEPDPD 442
            + D    ++ +F+PD          +EPDP++ S  Q K EPDPDDS N         +
Sbjct: 361 DNTDIEFAHKVQFKPD----------YEPDPDEYSYVQSKHEPDPDDSQNNNLGLMETLN 420

Query: 443 DSSNYQKK-SEPDPDDSSNYKKKLERDPDDSSNYKKKLE-RDPDDSCNYKKKF-EPDPDD 502
           +     K   EPDPDDS   + K+        N    L+ R      +  K + EPDPD+
Sbjct: 421 NLIKLGKTIDEPDPDDS---EVKVGDGNIQGPNQDNSLKIRSRKGQAHLDKVYGEPDPDE 480

Query: 503 I-SDYQKKFEPDPDDISDYQKKFEPDPDDISDYQKKFEPDPDDISDYQKKLESDPDDSSD 562
             +D   + EPDPDD  D     E     I +     EPDPDD   Y K+       S+ 
Sbjct: 481 SQADRTMQVEPDPDD--DLAASHEISSMKIDESMIIDEPDPDD--SYAKQ-------SNS 540

Query: 563 YQKKFEPDPDDSSDYQKKFEPDPDDSSDYQKKY-EPDPDDSSNYEANCLEAGLVTEPMQT 622
             +  +     ++   +  E    D++  +K Y EPDPD+S   +AN +        +  
Sbjct: 541 RHRNIKGADQSNTPLTESIE----DAACPKKAYREPDPDES---QANSV--------VGI 600

Query: 623 EPDPDESSVHQADLSKMVVDEPNPDDQEIQRIQDSVSVVCIRLREAIARLLAEVRPSESA 682
           EPDPD+  +   ++S M +DEP+PDD+EI+RIQD V+VVC RL++A+  L  EV  +E+ 
Sbjct: 601 EPDPDDGLLASQEISNMKIDEPDPDDEEIRRIQDPVAVVCGRLQKAVETLRTEVDTAEAT 660

Query: 683 AVFQTLFKIVRNVIEHPGDMKYRKFRKANPTIQKNVANYKAALEILFLIGFIEDVLLNEM 742
           A  QTLFKI+RNVIEHP +MK+++ RKANP IQKNVAN++AALEIL ++GFIEDVLL+E 
Sbjct: 661 ATLQTLFKIIRNVIEHPYEMKFKRIRKANPIIQKNVANHRAALEILQMVGFIEDVLLDET 689

Query: 743 GNAETFLVLKRNDPGLLWLAKSTLETCNA 766
           G AET LVLKRNDPGLLWLAKS+LE C A
Sbjct: 721 GKAETCLVLKRNDPGLLWLAKSSLEACLA 689

BLAST of CmoCh02G000530.1 vs. TrEMBL
Match: A0A0K9P5J0_ZOSMR (Uncharacterized protein OS=Zostera marina GN=ZOSMA_379G00080 PE=4 SV=1)

HSP 1 Score: 510.4 bits (1313), Expect = 3.9e-141
Identity = 376/814 (46.19%), Postives = 492/814 (60.44%), Query Frame = 1

Query: 6   SMYNLPVLWRGTKYVVEISSDS-TLRDLGEKLLKLTEVQADTMRLIVPQFSSKSSKMLYP 65
           S+ ++ V WR +   V+++ D+  ++DLG KLLKLT+V+A+TM+L++P  + K SK++ P
Sbjct: 210 SVVDVLVKWRKSHLRVKVNFDTDVVKDLGLKLLKLTKVKAETMKLLIPLTACKGSKLMNP 269

Query: 66  FSDEDGFLALHKISIFKDNKPIRMMGVSKNEVDEVLK-NEKKNERIAGFDKEEQRLKQRM 125
           FS E   L L +I+I  + KP+ MMG   +E+ E+ + N K++ RI GFD+EE+RL+QR 
Sbjct: 270 FSVEHSSLKLQEIAIL-EGKPVIMMGAFDDEIQELSQSNSKRDSRIIGFDEEEKRLRQRS 329

Query: 126 SSKRQGLLKLPEGPYVFCEFRTLQIPGIELNPPASEALKRMHMLAADPGIVAIMNKHRWR 185
               +   KLP+G Y+FC+FRTL+IPGIELNPPASEALKRMH LA DPGI+AIMNKH WR
Sbjct: 330 IGISEYSPKLPKGSYIFCDFRTLRIPGIELNPPASEALKRMHTLACDPGIIAIMNKHHWR 389

Query: 186 VGIMTEMAPVGYVGVSPKCILGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHM 245
           VGIM EMAP+GYVGVSPKC+LGFNKN GEEI+LRLRTDDLKGFRKYESIKKTLLHELAHM
Sbjct: 390 VGIMKEMAPLGYVGVSPKCVLGFNKNFGEEIALRLRTDDLKGFRKYESIKKTLLHELAHM 449

Query: 246 IYSEHDANFYALDKQLNEEAATLDWTRSKGHTLSGV-SYSQYHDESDDVQDGF--GVSQK 305
           +YS+HDA+FYALDKQLN EA TLDWT+S   TL+G  +Y+ Y+DES  VQ+ +     QK
Sbjct: 450 VYSDHDASFYALDKQLNNEAITLDWTKSGSRTLNGTHTYTHYNDES-YVQETYRNKSGQK 509

Query: 306 LGGSMSYRLVNPRASSVSAAYHR-LSHSSDFSSRVSPVNGESNPDENSNCQNKLEPDPDV 365
           LGG+    L + RAS+V AA++R LS S++    +       +  +N            +
Sbjct: 510 LGGANLTPLSSARASAVVAAHNRYLSTSTNIKQELQIATQYIHEKDN------------L 569

Query: 366 DAYQKKIEPDPDDSSNYQKKLKPDPDDNSNYQKKFEPDPYDSSNYQKKFEPDPNDSSNYQ 425
                 +EPDPDDS+    K + DP    +   K+EPDP DS     K EPDP+DS    
Sbjct: 570 HESSISVEPDPDDSN---LKYESDP---GSIMVKYEPDPDDSIT---KDEPDPDDSI--- 629

Query: 426 KKFEPDPDDSSNYQKKFEPDPDDSSNYQKKSEPDPDDSSNYKKKLERDPDDSSNYKKKLE 485
            K EPDP DS     K EP+PDDS     K EPDPDD      ++ +D  +S +    ++
Sbjct: 630 MKVEPDPYDSI---MKDEPEPDDSI---VKDEPDPDD------RITKDKPESDD--SIMK 689

Query: 486 RDPDDSCNYKKKFEPDPDDISDYQKKFEPDPDDISDYQKKFEPDPDDISDYQKKFEPDPD 545
            DPDDS     K  PDPD I     K EPD D       K EPDPD   D   K EPDP 
Sbjct: 690 DDPDDSIT---KDGPDPDTII----KDEPDLDKSI---AKDEPDPD---DNILKDEPDP- 749

Query: 546 DISDYQKKLESDPDDSSDYQKKFEPDPDDSSDYQKKFEPDPDDS----SDYQK--KYEPD 605
                      DPD+S     K EPDP +S     K EPDP +S     DY K   Y  +
Sbjct: 750 -----------DPDESI---TKDEPDPRESI---TKDEPDPRESLGKMVDYLKHGSYNAE 809

Query: 606 P---------DD--SSNYEANCLEAGLVTEPMQTEPD--PDESSVHQADL--------SK 665
                     DD    +Y A+        E      D   DE+     +L         K
Sbjct: 810 TIRGKSLLTVDDVMHVDYYAHSESLSYPKEVQSKTNDFVSDETGFSSKELDFDEYDSSKK 869

Query: 666 MVVDE------PNPDD-----------------QEIQRIQDSVSVVCIRLREAIARLLAE 725
           M +DE      P+ DD                  E+QRI++SV+++C RL+++I  L  +
Sbjct: 870 MNIDELQTYEIPDSDDVKPGAIEHNCSMNLSENDELQRIEESVAIICQRLQKSIELLRFD 929

Query: 726 VRPSESAAVFQTLFKIVRNVIEHPGDMKYRKFRKANPTIQKNVANYKAALEILFLIGFIE 764
               ++++   TLFKI+ NVIEHPGD K+RK RK NP  QK VAN+KAA+++L ++GF E
Sbjct: 930 STQVDTSSTLYTLFKIIMNVIEHPGDEKFRKLRKRNPLFQKTVANHKAAMDVLRIVGFCE 952

BLAST of CmoCh02G000530.1 vs. TAIR10
Match: AT5G35690.1 (AT5G35690.1 WLM (InterPro:IPR013536), PUB domain (InterPro:IPR018997), PUG domain (InterPro:IPR006567))

HSP 1 Score: 401.4 bits (1030), Expect = 1.3e-111
Identity = 230/433 (53.12%), Postives = 295/433 (68.13%), Query Frame = 1

Query: 1   MEQQHSMYNLPVLWRGTKYVVEISSDSTLRDLGEKLLKLTEVQADTMRLIVPQFSSKSSK 60
           ME       + +LW+G KY VEI S ++L+DLG +L KLT V ++T+RLIVP+ + K S 
Sbjct: 6   MEDSGKKIRVSLLWKGNKYSVEIDSGASLKDLGYELRKLTGVTSETLRLIVPRLNEKGSS 65

Query: 61  MLYPFSDEDGFLALHKISIFKDNKPIRMMGVSKNEVDEVLKNEKKNERIAGFDKEEQRLK 120
           ++ PFSDE   L+L + +I +D K IRMMGVS+ EV+ VLK    + RI GF++EE+RLK
Sbjct: 66  LMLPFSDEHSSLSLQESNIIED-KTIRMMGVSEEEVEGVLKEAVSDMRILGFEEEERRLK 125

Query: 121 QRMSSKRQGLLKLPEGPYVFCEFRTLQIPGIELNPPASEALKRMHMLAADPGIVAIMNKH 180
           Q+ S      +KLP+G Y+F +FRTLQ+PGIELNPP S ALKRMHMLAADPGI+A+MNKH
Sbjct: 126 QKKSYVSSASIKLPQGTYIFGDFRTLQLPGIELNPPPSAALKRMHMLAADPGIIAVMNKH 185

Query: 181 RWRVGIMTEMAPVGYVGVSPKCILGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHEL 240
           RWRVGIMTE+APVGYVGVSP+C+LGFNKN GEEISLRLRTDDLKGFRKY+SIKKTLLHEL
Sbjct: 186 RWRVGIMTELAPVGYVGVSPRCLLGFNKNQGEEISLRLRTDDLKGFRKYQSIKKTLLHEL 245

Query: 241 AHMIYSEHDANFYALDKQLNEEAATLDWTRSKGHTLSGVSYSQYHDESDDVQD-GFGVSQ 300
           AHM+Y+EHD  FYALD QLN+EA +LDWT+S+GHTL+G  +    DE D   D    VSQ
Sbjct: 246 AHMVYTEHDEKFYALDSQLNKEAESLDWTKSRGHTLNGTKFINDDDEEDYFFDENETVSQ 305

Query: 301 KLGGSMSYRLVNPRASSVSAAYHRLSHSSDFSSRVSPVNGESNPDENSNCQNKLEPDPDV 360
           +LGG+ S  L N R SSV+AAY RLSH+S     VS ++ E +PD+  + ++        
Sbjct: 306 RLGGNQSDNLGNARESSVAAAYRRLSHTS-----VSKLSEEPDPDDLVDVRD-------- 365

Query: 361 DAYQKKIEPDPDDSSNYQKKLKPDPDDNSNYQKKFEPDPYDSSNYQKKFEPDPNDSSNYQ 420
              + K    P   S+   K +PDPDD +        D    +     +E   + +   +
Sbjct: 366 ---ENKQLVLPKAQSDSMTKFEPDPDDTT-------ADDATKTESCHSYEMASDLAHPTK 414

Query: 421 KKFEPDPDDSSNY 433
              EPDPDDS  +
Sbjct: 426 DDDEPDPDDSETH 414

BLAST of CmoCh02G000530.1 vs. TAIR10
Match: AT1G55915.1 (AT1G55915.1 zinc ion binding)

HSP 1 Score: 67.0 bits (162), Expect = 5.7e-11
Identity = 60/199 (30.15%), Postives = 94/199 (47.24%), Query Frame = 1

Query: 147 QIPGIELNPPASEALKRMHMLAADPGIVAIMNKHRWRVGIMTEMAPVGYVGVSPKCILGF 206
           +I  ++  P   EA K +  +A    +  IM + +WRV +++E  P      +P+ +LG 
Sbjct: 14  EIKALKRKPREDEARKILEKVANQ--VQPIMTRRKWRVKLLSEFCPT-----NPR-LLGV 73

Query: 207 NKNHGEEISLRLR--TDDLKGFRKYESIKKTLLHELAHMIYSEHDANFYALDKQLNEEAA 266
           N N G ++ LRLR    DL  F  Y  I  T+LHEL H  +  H+A+FY L  +L +E  
Sbjct: 74  NVNRGVQVKLRLRRVNHDL-DFLSYHEILDTMLHELCHNAHGPHNASFYKLWDELRKECE 133

Query: 267 TLDWTRSKGHTLSGVSYSQYHDESDDVQDGFGVSQKLGG-SMSYRLVNPRASSVSAAYHR 326
            L    SKG T +G  +                 ++LGG S    L   RA++ +AA  R
Sbjct: 134 EL---MSKGITGTGQGFDM-------------PGKRLGGLSRQPSLSFLRATAATAAEKR 187

Query: 327 LSHSSDFSSRVSPVNGESN 343
           +   +   S    + G+S+
Sbjct: 194 VRAGTLLPSGPQRLGGDSS 187

BLAST of CmoCh02G000530.1 vs. NCBI nr
Match: gi|778713704|ref|XP_004143191.2| (PREDICTED: uncharacterized protein LOC101220832 [Cucumis sativus])

HSP 1 Score: 823.2 bits (2125), Expect = 3.9e-235
Identity = 492/771 (63.81%), Postives = 557/771 (72.24%), Query Frame = 1

Query: 1   MEQQHSMYNLPVLWRGTKYVVEISSDSTLRDLGEKLLKLTEVQADTMRLIVPQFSSKSSK 60
           MEQQH +YN+PVLWRGTKY+VEISSDSTLRDLG++LLK+TEV+ADTMR IVPQFSSKSSK
Sbjct: 70  MEQQHIIYNIPVLWRGTKYMVEISSDSTLRDLGQELLKITEVKADTMRFIVPQFSSKSSK 129

Query: 61  MLYPFSDEDGFLALHKISIFKDN-KPIRMMGVSKNEVDEVLKNEKKNERIAGFDKEEQRL 120
           MLYPFSDEDG LAL K SIFKDN KPIRMMGVSKNEVDE+L N KKNERI GFD+EE+RL
Sbjct: 130 MLYPFSDEDGCLALQKFSIFKDNNKPIRMMGVSKNEVDEILNNAKKNERIVGFDEEEKRL 189

Query: 121 KQRMSSKRQGLLKLPEGPYVFCEFRTLQIPGIELNPPASEALKRMHMLAADPGIVAIMNK 180
           KQRMSSK +G+LKLPEGPYVFCEFRTLQIPGIELNPPASEALKRMHMLAADPGIVAIMNK
Sbjct: 190 KQRMSSKPRGVLKLPEGPYVFCEFRTLQIPGIELNPPASEALKRMHMLAADPGIVAIMNK 249

Query: 181 HRWRVGIMTEMAPVGYVGVSPKCILGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHE 240
           H WRVGIMTEMAP+GYVGV+PKCILGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHE
Sbjct: 250 HHWRVGIMTEMAPIGYVGVNPKCILGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHE 309

Query: 241 LAHMIYSEHDANFYALDKQLNEEAATLDWTRSKGHTLSGVSYSQYHDESDDVQDGFGVSQ 300
           LAHMI+SEHDANFYALDKQLNEEAA LDWTRSKGHTL+G++YSQYH+E+D V+D FGVSQ
Sbjct: 310 LAHMIFSEHDANFYALDKQLNEEAAALDWTRSKGHTLTGMNYSQYHEEND-VEDDFGVSQ 369

Query: 301 KLGGSMSYRLVNPRASSVSAAYHRLSHSSDFSSRVSPVNGESNPDENSNCQNKLEPDPDV 360
           KLGGSMS++LVN RA+SV+AAYHR++++SD SS V  V+ ESNP  NS+ QNKLEPDPD 
Sbjct: 370 KLGGSMSHQLVNARAASVAAAYHRMTNNSDCSSGVPQVSAESNP--NSSHQNKLEPDPD- 429

Query: 361 DAYQKKIEPDPDDSSNYQKKLKPDPDDNSNYQKKFEPDPYDSSNYQKKFEPDPNDSSNYQ 420
           D+   K+EPDPD SSN Q  L  D                             N+S N++
Sbjct: 430 DSVYPKLEPDPDGSSNDQNMLGLDS----------------------------NNSYNHK 489

Query: 421 KKFEPDPDDSSNYQKKFEPDPDDSSNYQKKSEPDPDDSSNYKKKLERDPDDSSNYKKKLE 480
            K EP PDDS             S N + +SEP         K L    D SS     + 
Sbjct: 490 GKLEPAPDDSIG-----------SENLESESEP------RIIKSLVVQTDLSSTEVHPVP 549

Query: 481 RDPDDSCNYKKKF-EPDPDDISDYQKKFEPDPDDISDYQKKFEPDPDDISDYQKKF-EPD 540
                     K + EPD DD          D D +S   +       D + +Q+   EPD
Sbjct: 550 ATNSRLLEATKSYGEPDLDDRGSSSNSKVIDTDHLSQGMQNL-----DCNIFQRMIVEPD 609

Query: 541 PDDISDYQKKLESDPDDSSDYQKKFEPDPDDSSDYQKKFEPDPDDSSDYQKKYEPDPDDS 600
           PD + +    L S      +                   E D  ++   +        + 
Sbjct: 610 PDALGEKVNTLASGRAIGHN-------------------ETDCLEAGLVK--------NQ 669

Query: 601 SNYEANCLEAGLV--TEPMQTEPDPDESSVHQADLSKMVVDEPNPDDQEIQRIQDSVSVV 660
           S+   NC +   +   EPMQ EPDPDES VHQ D SKM VD+ +PDDQEIQRIQDSVSVV
Sbjct: 670 SHLSINCKKHDTIQGEEPMQIEPDPDESLVHQVDSSKMAVDQLDPDDQEIQRIQDSVSVV 729

Query: 661 CIRLREAIARLLAEVRPSESAAVFQTLFKIVRNVIEHPGDMKYRKFRKANPTIQKNVANY 720
           C RLREAI +LLAEV+PSES+AV QTLFKIV+NVIEHP +MKYRK RKANP IQKNVANY
Sbjct: 730 CNRLREAITKLLAEVKPSESSAVVQTLFKIVKNVIEHPDEMKYRKLRKANPIIQKNVANY 759

Query: 721 KAALEILFLIGFIEDVLLNEMGNAETFLVLKRNDPGLLWLAKSTLETCNAL 767
           +AALEILFLIGFIED LL+E+G AETFLVLKRNDPGLLWLAKSTLETCNAL
Sbjct: 790 EAALEILFLIGFIEDALLDEIGKAETFLVLKRNDPGLLWLAKSTLETCNAL 759

BLAST of CmoCh02G000530.1 vs. NCBI nr
Match: gi|659112962|ref|XP_008456391.1| (PREDICTED: uncharacterized protein LOC103496343 [Cucumis melo])

HSP 1 Score: 820.5 bits (2118), Expect = 2.5e-234
Identity = 487/768 (63.41%), Postives = 547/768 (71.22%), Query Frame = 1

Query: 1   MEQQHSMYNLPVLWRGTKYVVEISSDSTLRDLGEKLLKLTEVQADTMRLIVPQFSSKSSK 60
           MEQQH +YN+PVLWRGTKY+VEISSDSTLRDLG++LLK+TEV+ADTMRLIVPQFSSKSSK
Sbjct: 1   MEQQHIIYNIPVLWRGTKYMVEISSDSTLRDLGQELLKITEVKADTMRLIVPQFSSKSSK 60

Query: 61  MLYPFSDEDGFLALHKISIFKDNKPIRMMGVSKNEVDEVLKNEKKNERIAGFDKEEQRLK 120
           MLYPFSDEDG LAL K SIFKDNKPIRMMGVSKNEVDEVL N KKNERI GFD+EE+RLK
Sbjct: 61  MLYPFSDEDGCLALQKFSIFKDNKPIRMMGVSKNEVDEVLNNAKKNERIVGFDEEEKRLK 120

Query: 121 QRMSSKRQGLLKLPEGPYVFCEFRTLQIPGIELNPPASEALKRMHMLAADPGIVAIMNKH 180
           QRMSSK +G+LKLPEGPYVFCEFRTLQIPGIELNP ASEALKRMHMLAADPGIVAIMNKH
Sbjct: 121 QRMSSKPRGVLKLPEGPYVFCEFRTLQIPGIELNPSASEALKRMHMLAADPGIVAIMNKH 180

Query: 181 RWRVGIMTEMAPVGYVGVSPKCILGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHEL 240
            WRVGIMTEMAP+GYVGVSPKCILGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHEL
Sbjct: 181 HWRVGIMTEMAPIGYVGVSPKCILGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHEL 240

Query: 241 AHMIYSEHDANFYALDKQLNEEAATLDWTRSKGHTLSGVSYSQYHDESDDVQDGFGVSQK 300
           AHMI+SEHDANFYALDKQLNEEAA LDWTRSK HTL+G+ YSQYH+E DDV+DGFGVSQK
Sbjct: 241 AHMIFSEHDANFYALDKQLNEEAAALDWTRSKSHTLTGMKYSQYHEE-DDVEDGFGVSQK 300

Query: 301 LGGSMSYRLVNPRASSVSAAYHRLSHSSDFSSRVSPVNGESNPDENSNCQNKLEPDPDVD 360
           LGGSMS++LVN RA+SV+AAYHR++++SD+SS V  V+ ESNP+ +SN QNKLEPDPD  
Sbjct: 301 LGGSMSHQLVNARAASVAAAYHRMTNTSDYSSGVPTVSAESNPN-SSNHQNKLEPDPDDS 360

Query: 361 AYQKKIEPDPDDSSNYQKKLKPDPDDNSNYQKKFEPDPYDSSNYQKKFEPDPNDSSNYQK 420
           AY K ++PD D +SN Q  L  D                             N+SSN++ 
Sbjct: 361 AYPK-LDPDSDGNSNDQNMLGLDS----------------------------NNSSNHKS 420

Query: 421 KFEPDPDDSSNYQKKFEPDPDDSSNYQKKSEPDPDDSSNYKKKLERDPDDSSNYKKKLER 480
           K EP  DDS             S N + + EP       + K L    D SS     +  
Sbjct: 421 KLEPASDDSIG-----------SKNLESECEP------RFIKSLVVQTDLSSTEVHPVLA 480

Query: 481 DPDDSCNYKKKF-EPDPDDISDYQKKFEPDPDDISDYQKKFEPDPDDISDYQKKFEPDPD 540
                    K + EPD DD+         D D  S   +      D  +  +   E DPD
Sbjct: 481 TNSRLLEATKLYGEPDIDDMGSSSNSKVIDTDHFSQGMQNL----DCNTSQRMVVETDPD 540

Query: 541 DISDYQKKLESDPDDSSDYQKKFEPDPDDSSDYQKKFEPDPDDSSDYQKKYEPDPDDSSN 600
            + +    L S      +     E                                + S+
Sbjct: 541 ALGEKVNTLGSGRATGHNEADCLEAGL---------------------------VTNQSH 600

Query: 601 YEANCLEAGLV--TEPMQTEPDPDESSVHQADLSKMVVDEPNPDDQEIQRIQDSVSVVCI 660
              NC +   +   EPM  EPDPDE  VHQ D SKM VD+ +PDDQEIQRIQDSVSVVC 
Sbjct: 601 LSINCKKHDTIQGEEPMLIEPDPDEGLVHQVDSSKMAVDQLDPDDQEIQRIQDSVSVVCN 660

Query: 661 RLREAIARLLAEVRPSESAAVFQTLFKIVRNVIEHPGDMKYRKFRKANPTIQKNVANYKA 720
           RLREAI +LLAEV+PSES+AV QTLFKIV+NVIEHP +MKYRK RKANP IQKNVANYKA
Sbjct: 661 RLREAITKLLAEVKPSESSAVVQTLFKIVKNVIEHPDEMKYRKLRKANPIIQKNVANYKA 689

Query: 721 ALEILFLIGFIEDVLLNEMGNAETFLVLKRNDPGLLWLAKSTLETCNA 766
           ALEILFLIGFIED LL+E+G AETFLVLKRNDPGLLWLAKSTLETCNA
Sbjct: 721 ALEILFLIGFIEDALLDEIGKAETFLVLKRNDPGLLWLAKSTLETCNA 689

BLAST of CmoCh02G000530.1 vs. NCBI nr
Match: gi|743918948|ref|XP_011003486.1| (PREDICTED: acidic repeat-containing protein isoform X2 [Populus euphratica])

HSP 1 Score: 631.3 bits (1627), Expect = 2.2e-177
Identity = 401/770 (52.08%), Postives = 516/770 (67.01%), Query Frame = 1

Query: 7   MYNLPVLWRGTKYVVEISSDSTLRDLGEKLLKLTEVQADTMRLIVPQFSSKSSKMLYPFS 66
           M+ + V+WRG K++V +++D++++DLG++L KLT+++ADTMRLIVP+FS+KSSK+L+PFS
Sbjct: 17  MFKVTVIWRGNKFIVGMNTDASVKDLGDELQKLTDIRADTMRLIVPRFSNKSSKLLFPFS 76

Query: 67  DEDGFLALHKISIFKDNKPIRMMGVSKNEVDEVLKNEKKNERIAGFDKEEQRLKQRMSSK 126
           DE   L+L + SI  + K IRM+GVS++EVD+VL+N K + RIAGFD+EE+R++QRMS K
Sbjct: 77  DEHSQLSLQEASIM-EGKFIRMLGVSEDEVDKVLQNAKVDLRIAGFDEEEKRMRQRMSEK 136

Query: 127 RQGLLKLPEGPYVFCEFRTLQIPGIELNPPASEALKRMHMLAADPGIVAIMNKHRWRVGI 186
             GLLKLP+GPY+FC+FRTLQIPG+ELNPPASEALKRMHMLAADPGIVAIMNKHRWRVGI
Sbjct: 137 PFGLLKLPQGPYIFCDFRTLQIPGVELNPPASEALKRMHMLAADPGIVAIMNKHRWRVGI 196

Query: 187 MTEMAPVGYVGVSPKCILGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMIYS 246
           MTEMAPVGYVGVSPKCILGFNK+ GEEISLRLRTDDLKGFRKYESIKKTLLHELAHM+YS
Sbjct: 197 MTEMAPVGYVGVSPKCILGFNKDLGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMLYS 256

Query: 247 EHDANFYALDKQLNEEAATLDWTRSKGHTLSGVSYSQYHDESDDVQDGFGVSQKLGGSMS 306
           EHDANFYALDKQLN+EAA+LDWT+S+GHTLSGV++   + E   V D    S KLGG++S
Sbjct: 257 EHDANFYALDKQLNQEAASLDWTKSRGHTLSGVNHQDQYSEDFYVSDSRSSSVKLGGNVS 316

Query: 307 YRLVNPRASSVSAAYHRLSHSSDFSSRVSPVNGESNPDENSNCQNKLEPDPDVDAYQKKI 366
            +L   RASSV+AAYHRL+ +S  S   S V+ E +PD++    +K EPD      + K+
Sbjct: 317 NQLAGARASSVAAAYHRLADASSNSLGASEVHEEPDPDDSIFNMHK-EPDVKGQVEKGKL 376

Query: 367 EPDPDDSSNYQKKLKPDPDDNSNYQKKFEPDPYDSSNYQKKFEPDPNDSSNYQKKF-EPD 426
           + +  D S ++   +P PD++   Q K EPDP DS     +     N      K   EPD
Sbjct: 377 DIENLDKSLWKPHHQPVPDEHPFNQNKNEPDPDDSQGNDHEVMDLLNGGIRPDKNIDEPD 436

Query: 427 PDDSSNYQKKFEPDPDDSSNYQKKS--EPDPDDSSNYKKKLERDPDDSSNYKKKLERDPD 486
           PDDS     +   D  D   Y  K+  EPDPDDS             + +    +     
Sbjct: 437 PDDSQGNHHE-AMDILDIGIYPDKTVDEPDPDDSQG-----------NHHVVMDILNGGI 496

Query: 487 DSCNYKKKFEPDPDDISDYQKKFEPD------PDDISDYQKKFEPDPDDIS-DYQKKFEP 546
             C  K   EPDPDD    Q +          PD   D     EPDPDD   +Y +  + 
Sbjct: 497 CLCPDKTIDEPDPDDSQGNQHEAMDILNGGTCPDKTID-----EPDPDDSQGNYHEVMDI 556

Query: 547 DPDDISDYQKKLESDPDDSSDYQKKFEPDPDDSSDYQKKFEPDPDDSSDYQKKY-EPDPD 606
              DI          PD + D     EPDPDDS           +D    +K Y EPDPD
Sbjct: 557 LNGDIR---------PDKTID-----EPDPDDSL-----VTGSIEDQFHLKKAYKEPDPD 616

Query: 607 DSSNYEANCLEAGLVTEPMQTEPDPDESSVHQADLSKMVVDEPNPDDQEIQRIQDSVSVV 666
           +S   +            +Q EP+PD+      ++S+M +DEP+PDD+E++RIQD VSV+
Sbjct: 617 ESETNKV-----------VQAEPNPDDDLAVSHEVSRMQIDEPDPDDEELRRIQDPVSVI 676

Query: 667 CIRLREAIARLLAEVRPSESAAVFQTLFKIVRNVIEHPGDMKYRKFRKANPTIQKNVANY 726
           C RL++A   L AE++ +E+ A  QTL KI+RNVIEHP   K+++ RKANP IQKNVA++
Sbjct: 677 CSRLQKATETLRAELKSTEATAALQTLLKIIRNVIEHPDQSKFKRLRKANPIIQKNVASH 736

Query: 727 KAALEILFLIGFIEDVLLNEMGNAETFLVLKRNDPGLLWLAKSTLETCNA 766
           +AA+EI+ ++GF E+V  +E G A+T+LVLKRNDPGLLWLAKSTLE C A
Sbjct: 737 QAAVEIVHVVGFSEEVSYDETGKADTYLVLKRNDPGLLWLAKSTLEACMA 737

BLAST of CmoCh02G000530.1 vs. NCBI nr
Match: gi|743918950|ref|XP_011003487.1| (PREDICTED: acidic repeat-containing protein isoform X3 [Populus euphratica])

HSP 1 Score: 631.3 bits (1627), Expect = 2.2e-177
Identity = 401/770 (52.08%), Postives = 516/770 (67.01%), Query Frame = 1

Query: 7   MYNLPVLWRGTKYVVEISSDSTLRDLGEKLLKLTEVQADTMRLIVPQFSSKSSKMLYPFS 66
           M+ + V+WRG K++V +++D++++DLG++L KLT+++ADTMRLIVP+FS+KSSK+L+PFS
Sbjct: 5   MFKVTVIWRGNKFIVGMNTDASVKDLGDELQKLTDIRADTMRLIVPRFSNKSSKLLFPFS 64

Query: 67  DEDGFLALHKISIFKDNKPIRMMGVSKNEVDEVLKNEKKNERIAGFDKEEQRLKQRMSSK 126
           DE   L+L + SI  + K IRM+GVS++EVD+VL+N K + RIAGFD+EE+R++QRMS K
Sbjct: 65  DEHSQLSLQEASIM-EGKFIRMLGVSEDEVDKVLQNAKVDLRIAGFDEEEKRMRQRMSEK 124

Query: 127 RQGLLKLPEGPYVFCEFRTLQIPGIELNPPASEALKRMHMLAADPGIVAIMNKHRWRVGI 186
             GLLKLP+GPY+FC+FRTLQIPG+ELNPPASEALKRMHMLAADPGIVAIMNKHRWRVGI
Sbjct: 125 PFGLLKLPQGPYIFCDFRTLQIPGVELNPPASEALKRMHMLAADPGIVAIMNKHRWRVGI 184

Query: 187 MTEMAPVGYVGVSPKCILGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMIYS 246
           MTEMAPVGYVGVSPKCILGFNK+ GEEISLRLRTDDLKGFRKYESIKKTLLHELAHM+YS
Sbjct: 185 MTEMAPVGYVGVSPKCILGFNKDLGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMLYS 244

Query: 247 EHDANFYALDKQLNEEAATLDWTRSKGHTLSGVSYSQYHDESDDVQDGFGVSQKLGGSMS 306
           EHDANFYALDKQLN+EAA+LDWT+S+GHTLSGV++   + E   V D    S KLGG++S
Sbjct: 245 EHDANFYALDKQLNQEAASLDWTKSRGHTLSGVNHQDQYSEDFYVSDSRSSSVKLGGNVS 304

Query: 307 YRLVNPRASSVSAAYHRLSHSSDFSSRVSPVNGESNPDENSNCQNKLEPDPDVDAYQKKI 366
            +L   RASSV+AAYHRL+ +S  S   S V+ E +PD++    +K EPD      + K+
Sbjct: 305 NQLAGARASSVAAAYHRLADASSNSLGASEVHEEPDPDDSIFNMHK-EPDVKGQVEKGKL 364

Query: 367 EPDPDDSSNYQKKLKPDPDDNSNYQKKFEPDPYDSSNYQKKFEPDPNDSSNYQKKF-EPD 426
           + +  D S ++   +P PD++   Q K EPDP DS     +     N      K   EPD
Sbjct: 365 DIENLDKSLWKPHHQPVPDEHPFNQNKNEPDPDDSQGNDHEVMDLLNGGIRPDKNIDEPD 424

Query: 427 PDDSSNYQKKFEPDPDDSSNYQKKS--EPDPDDSSNYKKKLERDPDDSSNYKKKLERDPD 486
           PDDS     +   D  D   Y  K+  EPDPDDS             + +    +     
Sbjct: 425 PDDSQGNHHE-AMDILDIGIYPDKTVDEPDPDDSQG-----------NHHVVMDILNGGI 484

Query: 487 DSCNYKKKFEPDPDDISDYQKKFEPD------PDDISDYQKKFEPDPDDIS-DYQKKFEP 546
             C  K   EPDPDD    Q +          PD   D     EPDPDD   +Y +  + 
Sbjct: 485 CLCPDKTIDEPDPDDSQGNQHEAMDILNGGTCPDKTID-----EPDPDDSQGNYHEVMDI 544

Query: 547 DPDDISDYQKKLESDPDDSSDYQKKFEPDPDDSSDYQKKFEPDPDDSSDYQKKY-EPDPD 606
              DI          PD + D     EPDPDDS           +D    +K Y EPDPD
Sbjct: 545 LNGDIR---------PDKTID-----EPDPDDSL-----VTGSIEDQFHLKKAYKEPDPD 604

Query: 607 DSSNYEANCLEAGLVTEPMQTEPDPDESSVHQADLSKMVVDEPNPDDQEIQRIQDSVSVV 666
           +S   +            +Q EP+PD+      ++S+M +DEP+PDD+E++RIQD VSV+
Sbjct: 605 ESETNKV-----------VQAEPNPDDDLAVSHEVSRMQIDEPDPDDEELRRIQDPVSVI 664

Query: 667 CIRLREAIARLLAEVRPSESAAVFQTLFKIVRNVIEHPGDMKYRKFRKANPTIQKNVANY 726
           C RL++A   L AE++ +E+ A  QTL KI+RNVIEHP   K+++ RKANP IQKNVA++
Sbjct: 665 CSRLQKATETLRAELKSTEATAALQTLLKIIRNVIEHPDQSKFKRLRKANPIIQKNVASH 724

Query: 727 KAALEILFLIGFIEDVLLNEMGNAETFLVLKRNDPGLLWLAKSTLETCNA 766
           +AA+EI+ ++GF E+V  +E G A+T+LVLKRNDPGLLWLAKSTLE C A
Sbjct: 725 QAAVEIVHVVGFSEEVSYDETGKADTYLVLKRNDPGLLWLAKSTLEACMA 725

BLAST of CmoCh02G000530.1 vs. NCBI nr
Match: gi|743918946|ref|XP_011003485.1| (PREDICTED: acidic repeat-containing protein isoform X1 [Populus euphratica])

HSP 1 Score: 631.3 bits (1627), Expect = 2.2e-177
Identity = 401/770 (52.08%), Postives = 516/770 (67.01%), Query Frame = 1

Query: 7   MYNLPVLWRGTKYVVEISSDSTLRDLGEKLLKLTEVQADTMRLIVPQFSSKSSKMLYPFS 66
           M+ + V+WRG K++V +++D++++DLG++L KLT+++ADTMRLIVP+FS+KSSK+L+PFS
Sbjct: 28  MFKVTVIWRGNKFIVGMNTDASVKDLGDELQKLTDIRADTMRLIVPRFSNKSSKLLFPFS 87

Query: 67  DEDGFLALHKISIFKDNKPIRMMGVSKNEVDEVLKNEKKNERIAGFDKEEQRLKQRMSSK 126
           DE   L+L + SI  + K IRM+GVS++EVD+VL+N K + RIAGFD+EE+R++QRMS K
Sbjct: 88  DEHSQLSLQEASIM-EGKFIRMLGVSEDEVDKVLQNAKVDLRIAGFDEEEKRMRQRMSEK 147

Query: 127 RQGLLKLPEGPYVFCEFRTLQIPGIELNPPASEALKRMHMLAADPGIVAIMNKHRWRVGI 186
             GLLKLP+GPY+FC+FRTLQIPG+ELNPPASEALKRMHMLAADPGIVAIMNKHRWRVGI
Sbjct: 148 PFGLLKLPQGPYIFCDFRTLQIPGVELNPPASEALKRMHMLAADPGIVAIMNKHRWRVGI 207

Query: 187 MTEMAPVGYVGVSPKCILGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMIYS 246
           MTEMAPVGYVGVSPKCILGFNK+ GEEISLRLRTDDLKGFRKYESIKKTLLHELAHM+YS
Sbjct: 208 MTEMAPVGYVGVSPKCILGFNKDLGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMLYS 267

Query: 247 EHDANFYALDKQLNEEAATLDWTRSKGHTLSGVSYSQYHDESDDVQDGFGVSQKLGGSMS 306
           EHDANFYALDKQLN+EAA+LDWT+S+GHTLSGV++   + E   V D    S KLGG++S
Sbjct: 268 EHDANFYALDKQLNQEAASLDWTKSRGHTLSGVNHQDQYSEDFYVSDSRSSSVKLGGNVS 327

Query: 307 YRLVNPRASSVSAAYHRLSHSSDFSSRVSPVNGESNPDENSNCQNKLEPDPDVDAYQKKI 366
            +L   RASSV+AAYHRL+ +S  S   S V+ E +PD++    +K EPD      + K+
Sbjct: 328 NQLAGARASSVAAAYHRLADASSNSLGASEVHEEPDPDDSIFNMHK-EPDVKGQVEKGKL 387

Query: 367 EPDPDDSSNYQKKLKPDPDDNSNYQKKFEPDPYDSSNYQKKFEPDPNDSSNYQKKF-EPD 426
           + +  D S ++   +P PD++   Q K EPDP DS     +     N      K   EPD
Sbjct: 388 DIENLDKSLWKPHHQPVPDEHPFNQNKNEPDPDDSQGNDHEVMDLLNGGIRPDKNIDEPD 447

Query: 427 PDDSSNYQKKFEPDPDDSSNYQKKS--EPDPDDSSNYKKKLERDPDDSSNYKKKLERDPD 486
           PDDS     +   D  D   Y  K+  EPDPDDS             + +    +     
Sbjct: 448 PDDSQGNHHE-AMDILDIGIYPDKTVDEPDPDDSQG-----------NHHVVMDILNGGI 507

Query: 487 DSCNYKKKFEPDPDDISDYQKKFEPD------PDDISDYQKKFEPDPDDIS-DYQKKFEP 546
             C  K   EPDPDD    Q +          PD   D     EPDPDD   +Y +  + 
Sbjct: 508 CLCPDKTIDEPDPDDSQGNQHEAMDILNGGTCPDKTID-----EPDPDDSQGNYHEVMDI 567

Query: 547 DPDDISDYQKKLESDPDDSSDYQKKFEPDPDDSSDYQKKFEPDPDDSSDYQKKY-EPDPD 606
              DI          PD + D     EPDPDDS           +D    +K Y EPDPD
Sbjct: 568 LNGDIR---------PDKTID-----EPDPDDSL-----VTGSIEDQFHLKKAYKEPDPD 627

Query: 607 DSSNYEANCLEAGLVTEPMQTEPDPDESSVHQADLSKMVVDEPNPDDQEIQRIQDSVSVV 666
           +S   +            +Q EP+PD+      ++S+M +DEP+PDD+E++RIQD VSV+
Sbjct: 628 ESETNKV-----------VQAEPNPDDDLAVSHEVSRMQIDEPDPDDEELRRIQDPVSVI 687

Query: 667 CIRLREAIARLLAEVRPSESAAVFQTLFKIVRNVIEHPGDMKYRKFRKANPTIQKNVANY 726
           C RL++A   L AE++ +E+ A  QTL KI+RNVIEHP   K+++ RKANP IQKNVA++
Sbjct: 688 CSRLQKATETLRAELKSTEATAALQTLLKIIRNVIEHPDQSKFKRLRKANPIIQKNVASH 747

Query: 727 KAALEILFLIGFIEDVLLNEMGNAETFLVLKRNDPGLLWLAKSTLETCNA 766
           +AA+EI+ ++GF E+V  +E G A+T+LVLKRNDPGLLWLAKSTLE C A
Sbjct: 748 QAAVEIVHVVGFSEEVSYDETGKADTYLVLKRNDPGLLWLAKSTLEACMA 748

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
YQ77_SCHPO1.0e-2229.03Ubiquitin and WLM domain-containing metalloprotease SPCC1442.07c OS=Schizosaccha... [more]
Match NameE-valueIdentityDescription
A0A0A0KF59_CUCSA2.7e-23563.81Uncharacterized protein OS=Cucumis sativus GN=Csa_6G169280 PE=4 SV=1[more]
M5X3T8_PRUPE2.0e-17752.73Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002194mg PE=4 SV=1[more]
W1NQ89_AMBTC2.3e-17049.56Uncharacterized protein OS=Amborella trichopoda GN=AMTR_s00074p00152100 PE=4 SV=... [more]
A0A067K749_JATCU9.8e-16952.07Uncharacterized protein OS=Jatropha curcas GN=JCGZ_13776 PE=4 SV=1[more]
A0A0K9P5J0_ZOSMR3.9e-14146.19Uncharacterized protein OS=Zostera marina GN=ZOSMA_379G00080 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G35690.11.3e-11153.12 WLM (InterPro:IPR013536), PUB domain (InterPro:IPR018997), PUG domai... [more]
AT1G55915.15.7e-1130.15 zinc ion binding[more]
Match NameE-valueIdentityDescription
gi|778713704|ref|XP_004143191.2|3.9e-23563.81PREDICTED: uncharacterized protein LOC101220832 [Cucumis sativus][more]
gi|659112962|ref|XP_008456391.1|2.5e-23463.41PREDICTED: uncharacterized protein LOC103496343 [Cucumis melo][more]
gi|743918948|ref|XP_011003486.1|2.2e-17752.08PREDICTED: acidic repeat-containing protein isoform X2 [Populus euphratica][more]
gi|743918950|ref|XP_011003487.1|2.2e-17752.08PREDICTED: acidic repeat-containing protein isoform X3 [Populus euphratica][more]
gi|743918946|ref|XP_011003485.1|2.2e-17752.08PREDICTED: acidic repeat-containing protein isoform X1 [Populus euphratica][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR013536WLM_dom
IPR018997PUB_domain
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmoCh02G000530CmoCh02G000530gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmoCh02G000530.1CmoCh02G000530.1-proteinpolypeptide


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh02G000530.1.three_prime_UTR.1CmoCh02G000530.1.three_prime_UTR.1three_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh02G000530.1.CDS.10CmoCh02G000530.1.CDS.10CDS
CmoCh02G000530.1.CDS.9CmoCh02G000530.1.CDS.9CDS
CmoCh02G000530.1.CDS.8CmoCh02G000530.1.CDS.8CDS
CmoCh02G000530.1.CDS.7CmoCh02G000530.1.CDS.7CDS
CmoCh02G000530.1.CDS.6CmoCh02G000530.1.CDS.6CDS
CmoCh02G000530.1.CDS.5CmoCh02G000530.1.CDS.5CDS
CmoCh02G000530.1.CDS.4CmoCh02G000530.1.CDS.4CDS
CmoCh02G000530.1.CDS.3CmoCh02G000530.1.CDS.3CDS
CmoCh02G000530.1.CDS.2CmoCh02G000530.1.CDS.2CDS
CmoCh02G000530.1.CDS.1CmoCh02G000530.1.CDS.1CDS


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh02G000530.1.exon.10CmoCh02G000530.1.exon.10exon
CmoCh02G000530.1.exon.9CmoCh02G000530.1.exon.9exon
CmoCh02G000530.1.exon.8CmoCh02G000530.1.exon.8exon
CmoCh02G000530.1.exon.7CmoCh02G000530.1.exon.7exon
CmoCh02G000530.1.exon.6CmoCh02G000530.1.exon.6exon
CmoCh02G000530.1.exon.5CmoCh02G000530.1.exon.5exon
CmoCh02G000530.1.exon.4CmoCh02G000530.1.exon.4exon
CmoCh02G000530.1.exon.3CmoCh02G000530.1.exon.3exon
CmoCh02G000530.1.exon.2CmoCh02G000530.1.exon.2exon
CmoCh02G000530.1.exon.1CmoCh02G000530.1.exon.1exon


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013536WLM domainPFAMPF08325WLMcoord: 150..324
score: 3.5
IPR013536WLM domainPROFILEPS51397WLMcoord: 131..325
score: 36
IPR018997PUB domainPFAMPF09409PUBcoord: 677..749
score: 1.2
IPR018997PUB domainSMARTSM00580PGNneucoord: 677..747
score: 5.2
IPR018997PUB domainunknownSSF143503PUG domain-likecoord: 654..763
score: 1.06
NoneNo IPR availableunknownCoilCoilcoord: 102..122
scor
NoneNo IPR availableGENE3DG3DSA:3.10.20.90coord: 3..96
score: 6.5
NoneNo IPR availablePANTHERPTHR23153UBX-RELATEDcoord: 638..761
score: 3.5E-122coord: 115..293
score: 3.5E
NoneNo IPR availablePANTHERPTHR23153:SF36SUBFAMILY NOT NAMEDcoord: 115..293
score: 3.5E-122coord: 638..761
score: 3.5E