CSPI07G15020 (gene) Wild cucumber (PI 183967)

NameCSPI07G15020
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionUPF0361 C3orf37-like protein
LocationChr7 : 13698487 .. 13706513 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTGGGAATGTAAGAAAATGGATGTGTTCATCGGTTTGCTTCTTCCCGCCTTTTGCGTTTCTTCACGTCCAGAACTTTCCGTTCAGGCTGAGCTGAGCACCTTCTCCAAGGCAAGGCTTCCGCCATAGCCACCCCAACAGTCAAAGCTTCTCAACTCCACTCTTCCTTTCCCGTAGAATACTACAAAGACTGTAGCAAAGCATATTAGAGACGGATGTGCGGAAGAGCCCGTTGTACTCTTCGAGCTGATGACATCACCAGGGCCTGCCACTGCACCGGCGGCCCCGTACGCTCCCTCAACATGGACCGGTAACATCATTCACTTTTTTTTTTTTTCTCATTTCGTTCTGCTTAACTGTATTTCAGTGTTCTGTCTCTATTTCATGTCTCGTGCTAGTGGGAATGTTCTGAACTTACTGGCTTCGATTCTTTGTTAATTTATTTAGTTTTCGTCCGCTGTTCAATGCCTCCCCGGGCTCCGATTTACCGGTTGTTCGTCGAGACGATGAATCTAGTGATGGAGGAGTCGTCCTTCAGTGCATGAAATGGGGGCTTATTCCTAGTTTTACTGAGAAATTCGAGAAACCTAATTACTTCAAGATGGTGATTTTTCCACTTTTGTCTTTTGGCCTTTACCTGATTGTGTTGTGTAGAGTCTTATTTTGAAATTACAGTTACTATTTTTTGTTACCTATGATTAGTAAGCATTGTAGGCCGTGTGATTTTCGACTAGAGAATGGGTACTATTGTTCAAAATATCAATGACCTTACTCGAAAGAGATCTATTCTATAGGTGGTTTTACAATTGTTTTTTGTTCTAAAATCTTGTACTCTGTGTATATATATCTTTAGGCAAGCAATGATTTAATATGTATAAAAAATTCACAAGACCATGGATTAGAAAGGGCCAATGGAGTTACATACACTGCGAGCTTGCCTTGTCTTGATGTGTTTTGTTACAATTCTAGTACTATTTGTTCTTGCAAGTCCTATTCACACGTTGCTTTGTATATTTGATGTAGTTGATTTAGTAGAACTTTTGTATTTTTTCTTGTTAAAATAAATCTTGGTTTTTGCTTTAGAAAAAAGTAGGTAATGGATATTCTTGGTTTTTTGCACAACACAGGGGAGCCTTTTTGTATCAAAAGTATCGAAAGGGCTCTCCTGTGTTGCGGAAATATTTTTTAGGCCAACTGGAAATCCTTATAATCCTTTGGCTTGGACCTCTTTCCCCTTTTGTTGTTTTATTCATTAATGAAAATCGTTTCTTCTTTTTTGAGAAAAAAAATGTCTTGTTCTTTGATGTCACTGTAGCTTTGTCATGTCTGTTTTACTATCCTACTCTTGTCCCTTGAGAAACAAACTTTTCATTAATATAATAAAATGAGACTATTACTGAAGATACAAAGTTCAAAATGCATCCGTACTAATTGAACATATAAATGTAGAGATGTCGGGATCAACAAGTGGACCTGGACATTTCAGCTAGGTTGATACACCAATGCTCCATTTTTTCAAAATTCTTTTGAAAAATTTAAAATTTCAGTCAAGGCTCGGAAAATCTTGAATAGGCTTGAAATCATAAAATGGGATATCTTAAAAGGCATTTTAAAAGTTTTATGAAAATATTAGCCTTGCTTCTTAGACTCCTACGCAAATGATCAGATCTTGACTTGATTGTAAAGAAAAGTAAAGCTTTTTTTGTGATTACAACCTTACTTTTCTTATTGCACAAACGAATAACTTCATGCAATCTCATTGATTTTGTGCCCTTTTGTAATTTCACAATATCAATGAAATTATGATTGTCTGTTTCTCATATAAAAAAGTAGATCTCCAAAGGTTTTTGTTTAGGTGAAAGTTTGGATTCAAGGGTCTCAAAGGTTTACCCTCCTCCAAGCCACCTTGTCTAGCCTTCCCACATATTACCTATCACTTTACAAAATGCCATTGAAGGTGATTTCTAAAATTGAAAAATTATACAGGTCATTTTTGTGGAAAGGTGGCACTTAGAACATTAAATGGAATAAGGTTATCAAACCTAATTCCTAAGGGGGTTTGGGAAGACAAAGCATTGGACACAAAAACAAGTCTCTCCTTGCAAAATGGATTTGGTGATACCACCATGAAGAAAATGCACTTTGGAAACAAATCATTACAGCAGAATACGGTGAGACTTCTCCCTCCCATTGGCCCAACACATCTTTCACTTATACAATCAAAGCACCATGGAGATCCATCCTTCAACACCTTGATCTCATAAAGAATAGAATCACACCTCCTTTTGGAGTGATTGTTGGATTGGTAATGCCCTTTATCCTCTTTCGCCGCCAGTACAACTTGGCCACAAAGAAATTGTCTATTATGAGCAACTTTTCGAAAGATAATCATGGATGGAGATCTCTTATAGGAGAAACTTGGAAGAGAAAGAACCTGAAGATTGTTTGGCTTTAATGAATCTCGTATTCATTGTCAATCCTACTACATCTATAGACAAATGGACTTGGAAACTTGACAAAAATGGGAACCTCTCCTTCAAATGTACAATGGATTTGGACACAAAGGATGTGGAAACAAACTCTTCCCTTTTCAATGCAATTTGGACTGACTTCTACCCAAAAAAAAAAAAAATCAAATTCTTCTCGTGGGAAATTGGACACAATGCTATCAACACAAATGATCAAATGTAGAGGTGCCACCGTTGTCTTGAGCATGCTTGTCCAGGGCAAAGTTTGCCTATGATCAAATGCCCAAACCATCCTCAACCACCTTATGGTTATGTTTTGACCTTAGTCTAACATGTTGATAACATGTGAAGAGAGGCTACATCATATAGCGGGATATTCGCAACTTTCGAATTTCATATGTCTGACTTGCTCACCGTGTTAGAGCAAGTGCCACACAATCGCCTGCCTCGTATTCTTGTGATTGTGACAACCGGCACCCACACATAGCCCTTTTGCCCCAGTGCTGCTACCTTTGCTTATCGGATGGTGAGACCTCAGCCATCTCTTCTTTCATTGCTATTTTGCTAAACAATTTTGGAAGGAAATCACCATCTCAACCGGCTGGTCAACCCTTCTCCCATTGGCATTCAAAGCATTCTCACATAGCCCATGGAGTACCTTTTTATCAAACACAAAAAAATCATATGGTCTCATCTCATCCGTGCTTTACTATGGACTATTTGGAAAGAAAGAAATAACCATACTTTCAAAGAAAAAGAAAGCTCCTACGACACAATATGGGAACTTTCTATCTACGTAGCCATGACTTGGTGTAAATGTACTTTTTTTAATGCATTTAATCTCACTTCTCTTATAATCAGTCGGAGCAATCTGCTGTAACTTTTTTGGTTTGGGCTCTTGCCCCCCTTTATACTTTCATACATCAATGAAAGATATTTCTTATAAAAAACAAAATCTAGCTTTTGTTCTTTTGTGAATTTAAAGTAATGATTTAATGCTTTTGCCTTTTGCCTACACTACAAGGAATTTTGGAACTTAGAGAAATTTAATTATTTGATAGTTGATACTTAATGATTTTGGTAAGATACTCTAATTTAGTAGGGTGTATGGTTATAGTAGTATTTTGTCTTTAGTTTATGTCCAAGAACATACACCTTAGTTTGGCTAGTGTATTAGTGTCCAACATGTGTGGGGCAGTTGGACTTTCACCTAAACCTTATTGAGCATGTATTCGATACTTGTTAGCATGATAGACATGTGTTAGACACTAGTTGTACAGAACCCATATAGATTCAACATCTGTTAGGCACGTATGAACTCTTGGTAAGTATGCTAGATAGATGCATAGTAATATAAGACAAAGAAAGTATGAAATCAAAATTCATCAAACTCAGTTTTAGCCTCATGAATGCATACGCCTGTTGACTTCCAATTGTCTTGTATGAAAATGATATATATTTTAATAAAAGTGTCGTTTATGAATCTTAGATTTTAAAAAATTGATGTGTGATTGGGTTCGTATCGTATTACTTTTGTGTCTCACATCCATAACCATGGTTATTAGGTTTGTGATAGAAATTTGTATATTTAAGATGAATGTTTTATGATAAATAAACTATGAACACCAGCCAATATATGTTTGCCTTATTATCTGGTATCTGCAATGTGTAGTTCAATGCTCGCTCAGAGTCCATACATGAAAAGGCCTCTTTTCACCGTCTAGTTCCTAAAAGAAGGTGCCTTGTGGCAGTGGAAGGGTATGCTTTTCGAGAACCTATTTATATGAACGTGTGTTTGTGTGCATCTGTCTTTACTCTTGCACAAACACCCTTATAACCTCAGTTCTATGGTAGGTTCTACGAGTGGAAAAAGGACGGATCAAAAAAGCAGCCGTATTATATTCATTTTAAGGATGGGCAGCCACTTGCTCTTGCTGCTTTATATGATTGTTGGGAAAACCTTGAAGGCATGTTTCTTTTTTATCGAGCAAAAGATAACTTTGCCCTAATACTTTGTTTTCACTGTATCAATGTTTTGATCTGTTTCCTAAAAGAAAAAGGTTTAAATTTCTTGTTCTAGATTCAGTTGGCCCATCCATTTGTTTTAAACTTCGTTACTGGATACTGACACTGATTTATTTTAGTTTTCTTCAATATCTGGTAACCAAGATATCTGGATTTGATATGTGATATATTGACCAGATATTGAATGTACTTCTTAGGAAAACCTATAATGTTTTTTATTAATCAAAGGATTGATTTATTGCTGTTTTTGCAGGTGAATTACTTTACACTTTCACCATTCTAACAACTTCATCATCTCCAGCTTTGAAGTGGTTGCACGGTCAGTCCTATCTTTTCTTAGTGTTGTCTTAGATCACGTTTACAACTGAATTTGAATATTGATAGGGACATGATGGTAGTTATTAGTGGTTGCAATGCAATTATTTCTAACTTGAAATTGTTGAGATGTTGAGAATCTCACAATGGAAAAAAGAAAATACCATTTTCTTGTTTGAGTTCTTTCCAGGACTCACATTCTTTATAAGATAGATGAACTACTCCTCTCATTGCCAATTGATTTTGAGATGGAACTTCATACTATCAAATATGGTATCAGAGTCCATTAAGTCTAAACGGGTAGTTGGTCCAAAATTGGTGAACTCAAAGAGGCATCATCTTGTGAAGGCATGTTGAGATGTTTAGAATTCTATCTTGGAAAAACCAAGGGGGACTCACACTGTTTATAAGATAGATGAGCTACTCATCTCATTGCCAATTGATTTTGAGATAGAGCCCCATGCTATCAAATAAACTGTGGGGTGGTTTGAGGATTATAAGGAATTCTTACCAGCCTTGATAAAAAAACGTAAGAGTGTTAGGACATAGGTGATAGATTTAAGTATCCTCTGGGAAACAATTAGTAGAAGCACATTTATTTCTAGACTTCCAAACATCCCATAAAATAGTAGTCATTTTTTTTCTCTTGCAAACTTTTCACTCAAATCATATGCAAGGAAGAAGAATTTGCAAAGGAAAGACTAGGAGCTGCCTACTTGAAGTTACTTTCAAATGATATACCAAAAACTCTCCCAAATTCTTGGTTTAAGGGGCAGTAATGAATAAATATTCTACTTTCTCTCCTTTTTGAAAATGTAACATATATAGGTTAGGCTAACCCACACGTCTTTGGAGCATATAATTTGTTTAGGCAAGAAATGGATGTCCATTAATATTAAAAGAAGTGAATGTGATATCCACCATATCTAGTCACCAAGCCTTGTAAAAATTATTGCCGATTGGCACTAAATTGATTGAATAATAAATTTTAGATGTGGGAGAATTTGATTGTTCAGTGGCTGATGGGTGATCTGTTTAGATAGGATGCCTGTAATATTGGGTGACAAAGAACGTATGGATATGTGGTTGAATGATTCTTCATCGTCCAAGTATGATTCCGTCCTTAAACCATACGAGGCTCCTGATTTGGTAAGAACTTTTTGAAAAGAGAACAATGACTTATGTTAATAATTCATTAATGTTTAAAATGTAATTGTTAGGTATGGTACCCTGTAACTCCTTCCATGGGAAAGCCTTCATTTGATGGGCCAGACTGCATCAAGGAGGTATCTCATCTCACTCGTTCTTTTTGAATTGGTTGATAGCTTTTTCATCCTCGTTGAAGTTTAAAATTTATAGCATGTTACTGGAGAAAATTTGAGTTACGTTACTCTTGTAGATTAAGTGATGATTTAGTTTCTTTATGAACAATGTATCTGAGGTCACATATTTGTTTATTCACGTTTCTCGCCTCCAGTGGAGCACTTCAGCTAGTTTGTTTGTTTGTTCCTTCTAAGCTTTATTTTGTTTTCTGGTTTATCATGAGTAATGAAGGTATATATTTATCTGTACAGAAATGTACTGCTGTGGTTATTGAAGTAGTGGATATAATAAGTAGATAAATCTCATTTTAATGCCTCTCTTTGGGCTTTTGCCAGAAATTACTCGTGCATTGATGTGATTTCTTCTTGTTCTCGATATTTAGATAAAATGCTTGCGCGCCAAAATGACATTAAGTTTGTTGGTAGCAATAAAAGATTACTTAAATATCTCTCACTATTCTTCTTAAATATTAACTTCTTACAATTTTAAGTTCTGAATTAAAATAAAGAAATTAAAGCTTGAACGTGGCACAATATGCTTGTGCCATGTGGAGGTCTGTAGATACAAGTCTGGAAAGTTTTGCTCGAATGGAAATAACTAGGATTTATGTTCACATTGAATTTCTCTGTGTAGGTTTAACACATTCACAATGATGATGTGGGATGGGTATTAAAATTTGACCTTGAGGAAGGTTATAAATGTCGTTTGCTATTATGAGACGAAAATAGGGAAGAATCATACTCATTCTTTCCATATATATATAGGCATGTCTGCATATATGTGTCTTGAATTAAAAATAATAGCAAACGTGAACATAACTTAATTGACGTATTGAAACGTTAGCATAAGAGAGTTTGTAGTCCGATTCTCCCACCCTTTTTGTACTGAAAAGAAAAGATTAAAATTACAGTGGACAAATGATTAATAAGCATATTTCTCATACCGTAACATCTTTATCTACTAATTTATTCCCAAATTGGCCCTTCTGTCATTTATTAGAAAGGAGCTGACATTTTTTCTAATGGCCTTTTGTATCAGTCAAATGAACACGAAACTCTCTAACGTCATCTGTTTTTCATCTTTTGTTGAATACTGACCTTATTTTATAAAATATTTGACATGTATTCTGCAGAAGTTACTTTGATTTTAGTGTTCTTTGGAAATGATCGTGAATGTAGCATTAAGCAATATGCTTTTATTTCAAATGATAGTCACAGTCCTTGTTGATTGATTTACATTAACAATTAACCTTATATAGATCCAAAAAGTAACATTATTATGTCTTCTTTGTAGATACAGCTAAAGAATGATGGAAGCAATCTCATCTCCAAATTTTTCTCTGCAAAAGAAACAAAAAAGGAATATTCGGTCTCACAAGAGAAAACTTGCTCTAACACATCTGTGAAGCCCGAGGCATCTCCAAGTCTAGAAGAGCACAAAAGAGAAGTAAATCGTGGAGCTTCATCTGAAGAATCAAAGGATTGTCTTGCAAAGTGTTCATCTGATACTTCACTAACATATCAAATAAAACGAGATCGTGAAGACATCTCATCCGACTTGAAAAGTGGCATGGACGACTACAGCAAGGTAGGCAGCAGTCCAAAGATACGGAAGAAGGGAAACCTGAAAACTGGTAATGACAACCAATTAACCCTCTTTTCATACTTTGGAAAGAAATAGATAGGCCTGCTTTGTTTCAAAACAGACAGGTGTGCGCTGCATCTCATATGTTTATATATGCCATTTATTTTGTTTATCTTGGTGTGTTAGTTGCTGCTGACTGCTGAGGTACGTGGAAGGTTTTTATTTTTTTTTAATGAACATTTCGATTGGGTTAAATCTTAAATGCAGTGCATTTTTTGTTGTATAAAGGGCAGTCTTCTGTAGCTTAGAAGGGC

mRNA sequence

ATGTGCGGAAGAGCCCGTTGTACTCTTCGAGCTGATGACATCACCAGGGCCTGCCACTGCACCGGCGGCCCCGTACGCTCCCTCAACATGGACCGTTTTCGTCCGCTGTTCAATGCCTCCCCGGGCTCCGATTTACCGGTTGTTCGTCGAGACGATGAATCTAGTGATGGAGGAGTCGTCCTTCAGTGCATGAAATGGGGGCTTATTCCTAGTTTTACTGAGAAATTCGAGAAACCTAATTACTTCAAGATGTTCAATGCTCGCTCAGAGTCCATACATGAAAAGGCCTCTTTTCACCGTCTAGTTCCTAAAAGAAGGTGCCTTGTGGCAGTGGAAGGGTTCTACGAGTGGAAAAAGGACGGATCAAAAAAGCAGCCGTATTATATTCATTTTAAGGATGGGCAGCCACTTGCTCTTGCTGCTTTATATGATTGTTGGGAAAACCTTGAAGGTGAATTACTTTACACTTTCACCATTCTAACAACTTCATCATCTCCAGCTTTGAAGTGGTTGCACGATAGGATGCCTGTAATATTGGGTGACAAAGAACGTATGGATATGTGGTTGAATGATTCTTCATCGTCCAAGTATGATTCCGTCCTTAAACCATACGAGGCTCCTGATTTGGTATGGTACCCTGTAACTCCTTCCATGGGAAAGCCTTCATTTGATGGGCCAGACTGCATCAAGGAGATACAGCTAAAGAATGATGGAAGCAATCTCATCTCCAAATTTTTCTCTGCAAAAGAAACAAAAAAGGAATATTCGGTCTCACAAGAGAAAACTTGCTCTAACACATCTGTGAAGCCCGAGGCATCTCCAAGTCTAGAAGAGCACAAAAGAGAAGTAAATCGTGGAGCTTCATCTGAAGAATCAAAGGATTGTCTTGCAAAGTGTTCATCTGATACTTCACTAACATATCAAATAAAACGAGATCGTGAAGACATCTCATCCGACTTGAAAAGTGGCATGGACGACTACAGCAAGGTAGGCAGCAGTCCAAAGATACGGAAGAAGGGAAACCTGAAAACTGGTAATGACAACCAATTAACCCTCTTTTCATACTTTGGAAAGAAATAG

Coding sequence (CDS)

ATGTGCGGAAGAGCCCGTTGTACTCTTCGAGCTGATGACATCACCAGGGCCTGCCACTGCACCGGCGGCCCCGTACGCTCCCTCAACATGGACCGTTTTCGTCCGCTGTTCAATGCCTCCCCGGGCTCCGATTTACCGGTTGTTCGTCGAGACGATGAATCTAGTGATGGAGGAGTCGTCCTTCAGTGCATGAAATGGGGGCTTATTCCTAGTTTTACTGAGAAATTCGAGAAACCTAATTACTTCAAGATGTTCAATGCTCGCTCAGAGTCCATACATGAAAAGGCCTCTTTTCACCGTCTAGTTCCTAAAAGAAGGTGCCTTGTGGCAGTGGAAGGGTTCTACGAGTGGAAAAAGGACGGATCAAAAAAGCAGCCGTATTATATTCATTTTAAGGATGGGCAGCCACTTGCTCTTGCTGCTTTATATGATTGTTGGGAAAACCTTGAAGGTGAATTACTTTACACTTTCACCATTCTAACAACTTCATCATCTCCAGCTTTGAAGTGGTTGCACGATAGGATGCCTGTAATATTGGGTGACAAAGAACGTATGGATATGTGGTTGAATGATTCTTCATCGTCCAAGTATGATTCCGTCCTTAAACCATACGAGGCTCCTGATTTGGTATGGTACCCTGTAACTCCTTCCATGGGAAAGCCTTCATTTGATGGGCCAGACTGCATCAAGGAGATACAGCTAAAGAATGATGGAAGCAATCTCATCTCCAAATTTTTCTCTGCAAAAGAAACAAAAAAGGAATATTCGGTCTCACAAGAGAAAACTTGCTCTAACACATCTGTGAAGCCCGAGGCATCTCCAAGTCTAGAAGAGCACAAAAGAGAAGTAAATCGTGGAGCTTCATCTGAAGAATCAAAGGATTGTCTTGCAAAGTGTTCATCTGATACTTCACTAACATATCAAATAAAACGAGATCGTGAAGACATCTCATCCGACTTGAAAAGTGGCATGGACGACTACAGCAAGGTAGGCAGCAGTCCAAAGATACGGAAGAAGGGAAACCTGAAAACTGGTAATGACAACCAATTAACCCTCTTTTCATACTTTGGAAAGAAATAG
BLAST of CSPI07G15020 vs. Swiss-Prot
Match: HMCES_XENTR (Embryonic stem cell-specific 5-hydroxymethylcytosine-binding protein OS=Xenopus tropicalis GN=hmces PE=2 SV=1)

HSP 1 Score: 159.1 bits (401), Expect = 9.2e-38
Identity = 104/298 (34.90%), Postives = 156/298 (52.35%), Query Frame = 1

Query: 1   MCGRARCTLRADDITRACHCT---GGPV----RSLNMDRFRPLFNASPGSDLPVV----- 60
           MCGR  CTL  DD+ +AC      GG      R  + D+++P +N SP S+ PV+     
Sbjct: 1   MCGRTACTLAPDDVRKACTYRDKQGGRKWPNWRDGDSDKYQPSYNKSPQSNSPVLLSLKH 60

Query: 61  -RRDDESSDGGVVLQCMKWGLIPS-FTEKFEKPNYFKMFNARSESIHEKASFHRLVPK-R 120
            ++D +SS+   VL  M+WGLIPS F E       +K  N RS+++ EKA +   + K +
Sbjct: 61  FQKDADSSER--VLAAMRWGLIPSWFNEPDPSKMQYKTNNCRSDTMTEKALYKASLFKGK 120

Query: 121 RCLVAVEGFYEWKKDGSKKQPYYIHFK----------------DGQPL-ALAALYDCWEN 180
           RC+V  +GFYEW++  S+KQPYYI+F                 +GQ L  +A L+DCWE 
Sbjct: 121 RCVVLADGFYEWQRQNSEKQPYYIYFPQIKAEKSPAEQDITDWNGQRLLTMAGLFDCWEP 180

Query: 181 LEG-ELLYTFTILTTSSSPALKWLHDRMPVILGDKERMDMWLNDSSSSKYDSVLKPYEAP 240
             G E LY++T++T  SS  + W+HDRMP IL   E +  WL+       D++   +   
Sbjct: 181 PNGGETLYSYTVITVDSSKTMNWIHDRMPAILDGDEAVRKWLDFGEVPTKDALKLIHPIE 240

Query: 241 DLVWYPVTPSMGKPSFDGPDCIKEI---QLKNDGSNLISK----FFSAKETKKEYSVS 259
           ++ ++PV+  +     + P+C+  I   Q K    +  SK    +   K  KKE S S
Sbjct: 241 NITYHPVSTVVNNSRNNTPECMAAIILTQKKGPALSASSKKMLDWLQNKSPKKEESHS 296

BLAST of CSPI07G15020 vs. Swiss-Prot
Match: HMCES_XENLA (Embryonic stem cell-specific 5-hydroxymethylcytosine-binding protein OS=Xenopus laevis GN=hmces PE=2 SV=1)

HSP 1 Score: 155.6 bits (392), Expect = 1.0e-36
Identity = 101/298 (33.89%), Postives = 152/298 (51.01%), Query Frame = 1

Query: 1   MCGRARCTLRADDITRACHCTGGPVRSL-------NMDRFRPLFNASPGSDLPVV----- 60
           MCGR  CTL  DD+++AC       R         + D+++P +N SP S+ PV+     
Sbjct: 1   MCGRTACTLAPDDVSKACSYQDKQGRQKCPKWRDGDTDKYQPSYNKSPQSNNPVLLSLKH 60

Query: 61  -RRDDESSDGGVVLQCMKWGLIPS-FTEKFEKPNYFKMFNARSESIHEKASFHR-LVPKR 120
            ++D +SS+   VL  M+WGLIPS F E       +K  N RS++I EKA +   L   R
Sbjct: 61  FQKDADSSER--VLAAMRWGLIPSWFNELDPSKMQYKTNNCRSDTITEKALYKAPLFKGR 120

Query: 121 RCLVAVEGFYEWKKDGSKKQPYYIHFK----------------DGQPL-ALAALYDCWEN 180
           RC+V  +GFYEWK+   +KQPYYI+F                 +GQ L  +A L+DCWE 
Sbjct: 121 RCVVLADGFYEWKRQDGEKQPYYIYFPQIKSEKFPEEQDMMDWNGQRLLTMAGLFDCWEP 180

Query: 181 LEG-ELLYTFTILTTSSSPALKWLHDRMPVILGDKERMDMWLNDSSSSKYDSVLKPYEAP 240
             G E LY++T++T  SS  +  +HDRMP IL   E +  WL+    S  D++   +   
Sbjct: 181 PSGGEPLYSYTVITVDSSKTMNCIHDRMPAILDGDEAIRKWLDFGEVSTQDALKLIHPIE 240

Query: 241 DLVWYPVTPSMGKPSFDGPDCIKEIQLK-------NDGSNLISKFFSAKETKKEYSVS 259
           ++ ++PV+  +     +  +CI  + L        +  S  + ++   K  KKE S S
Sbjct: 241 NITYHPVSTVVNNSRNNSTECIAAVILTQKKGPALSASSKKMLEWLQNKSPKKEESRS 296

BLAST of CSPI07G15020 vs. Swiss-Prot
Match: HMCES_CHICK (Embryonic stem cell-specific 5-hydroxymethylcytosine-binding protein OS=Gallus gallus GN=HMCES PE=2 SV=1)

HSP 1 Score: 146.0 bits (367), Expect = 8.1e-34
Identity = 89/266 (33.46%), Postives = 139/266 (52.26%), Query Frame = 1

Query: 1   MCGRARCTLRADDITRACHCTGGPVRS-----LNMDRFRPLFNASPGSDLPV------VR 60
           MCGR  C+L A  + RAC       R      L   R+RP +N  P S  PV      V+
Sbjct: 1   MCGRTACSLGAARLRRACAYRDRQGRRQQPEWLREGRYRPSYNKGPQSSGPVLLSRKHVQ 60

Query: 61  RDDESSDGGVVLQCMKWGLIPS-FTEKFEKPNYFKMFNARSESIHEKASFHR-LVPKRRC 120
           +D +SS+   VL  M+WGL+PS F E       FK  N RS+++  K+S+   L+  +RC
Sbjct: 61  QDADSSER--VLMDMRWGLVPSWFKEDDPSKMQFKTSNCRSDTMLSKSSYKGPLLKGKRC 120

Query: 121 LVAVEGFYEWKKDGSKKQPYYIHFKDGQP------------------LALAALYDCWENL 180
           +V  +GFYEW++ G  KQPY+I+F   +                   L +A ++DCWE  
Sbjct: 121 VVLADGFYEWQQRGGGKQPYFIYFPQNKKHPAEEEEDSDEEWRGWRLLTMAGIFDCWEPP 180

Query: 181 EG-ELLYTFTILTTSSSPALKWLHDRMPVILGDKERMDMWLNDSSSSKYDSVLKPYEAPD 235
           +G E LYT+TI+T  +S  + ++H RMP IL   E ++ WL+ +     +++     A +
Sbjct: 181 KGGEPLYTYTIITVDASEDVSFIHHRMPAILDGDEAIEKWLDFAEVPTREAMKLIRPAEN 240

BLAST of CSPI07G15020 vs. Swiss-Prot
Match: HMCES_RAT (Embryonic stem cell-specific 5-hydroxymethylcytosine-binding protein OS=Rattus norvegicus GN=Hmces PE=2 SV=1)

HSP 1 Score: 143.7 bits (361), Expect = 4.0e-33
Identity = 103/335 (30.75%), Postives = 164/335 (48.96%), Query Frame = 1

Query: 1   MCGRARCTLRADDITRACHCTGGPVRSL-----NMDRFRPLFNASPGSDLPVV------R 60
           MCGR  C L  D +TRAC       R       + D++ P +N SP S  PV+       
Sbjct: 1   MCGRTSCHLPRDALTRACAYLDRQGRRQLPQWRDPDKYCPSYNKSPQSSSPVLLSRLHFE 60

Query: 61  RDDESSDGGVVLQCMKWGLIPS-FTEKFEKPNYFKMFNARSESIHEKASFHRLVPK-RRC 120
           +D +SSD   ++  M+WGL+PS F E       F   N RS++I EK SF   + K RRC
Sbjct: 61  KDADSSDR--IIFPMRWGLVPSWFKESDPSKLQFNTSNCRSDTIMEKQSFKAPLGKGRRC 120

Query: 121 LVAVEGFYEWKK--DGSKKQPYYIHF------KDGQP------------------LALAA 180
           +V  +GFYEW++    +++QPY+I+F      K G+                   L +A 
Sbjct: 121 VVLADGFYEWQRCQGTNQRQPYFIYFPQSKTEKSGENSGSDSLNNKEEVWDNWRLLTMAG 180

Query: 181 LYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILGDKERMDMWLNDSSSSKYDSVL 240
           ++DCWE  +GE LY+++I+T  S   L  +H RMP IL  +E +  WL+    S  +++ 
Sbjct: 181 IFDCWEPPKGERLYSYSIITVDSCRGLSDIHSRMPAILDGEEAVSKWLDFGEVSTQEALK 240

Query: 241 KPYEAPDLVWYPVTPSMGKPSFDGPDC-------IKEIQLKNDGSNLISKFFSAKETKKE 290
             +   ++ ++PV+P +     + P+C       +K+    +  S  + ++ + K  KKE
Sbjct: 241 LIHPIDNITFHPVSPVVNNSRNNTPECLAPADLLVKKEPKASGSSQRMMQWLATKSPKKE 300

BLAST of CSPI07G15020 vs. Swiss-Prot
Match: YOQW_BACSU (Putative SOS response-associated peptidase YoqW OS=Bacillus subtilis (strain 168) GN=yoqW PE=3 SV=1)

HSP 1 Score: 139.0 bits (349), Expect = 9.9e-32
Identity = 83/239 (34.73%), Postives = 125/239 (52.30%), Query Frame = 1

Query: 1   MCGRARCTLRADDITRACHCTGGPVRSLNMDRF------RPLFNASPGSDLPVVRRDDES 60
           MCGR       DDI          +   N+D+F       P +N +P  ++  +  D  +
Sbjct: 1   MCGRFTLFSEFDDI----------IEQFNIDQFLPEGEYHPSYNVAPSQNILTIINDGSN 60

Query: 61  SDGGVVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHRLVPKRRCLVAVEGF 120
           +  G     ++WGLIP +  K EK  Y KM NAR+E++ EK SF + +  +RC++  + F
Sbjct: 61  NRLGK----LRWGLIPPWA-KDEKIGY-KMINARAETLSEKPSFRKPLVSKRCIIPADSF 120

Query: 121 YEWKK-DGSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHD 180
           YEWK+ D   K P  I  K     A A LY+ W   EG  LYT TI+TT  +  ++ +HD
Sbjct: 121 YEWKRLDPKTKIPMRIKLKSSNLFAFAGLYEKWNTPEGNPLYTCTIITTKPNELMEDIHD 180

Query: 181 RMPVILGDKERMDMWLNDSSSSK--YDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIK 231
           RMPVIL D+   + WLN  ++      S+L+PY+A D+  Y V+  +  P  + P+ I+
Sbjct: 181 RMPVILTDENEKE-WLNPKNTDPDYLQSLLQPYDADDMEAYQVSSLVNSPKNNSPELIE 222

BLAST of CSPI07G15020 vs. TrEMBL
Match: A0A0A0K6X8_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G371760 PE=4 SV=1)

HSP 1 Score: 734.2 bits (1894), Expect = 7.7e-209
Identity = 358/359 (99.72%), Postives = 358/359 (99.72%), Query Frame = 1

Query: 1   MCGRARCTLRADDITRACHCTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVV 60
           MCGRARCTLRADDITRACH TGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVV
Sbjct: 1   MCGRARCTLRADDITRACHRTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVV 60

Query: 61  LQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHRLVPKRRCLVAVEGFYEWKKD 120
           LQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHRLVPKRRCLVAVEGFYEWKKD
Sbjct: 61  LQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHRLVPKRRCLVAVEGFYEWKKD 120

Query: 121 GSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILG 180
           GSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILG
Sbjct: 121 GSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILG 180

Query: 181 DKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSN 240
           DKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSN
Sbjct: 181 DKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSN 240

Query: 241 LISKFFSAKETKKEYSVSQEKTCSNTSVKPEASPSLEEHKREVNRGASSEESKDCLAKCS 300
           LISKFFSAKETKKEYSVSQEKTCSNTSVKPEASPSLEEHKREVNRGASSEESKDCLAKCS
Sbjct: 241 LISKFFSAKETKKEYSVSQEKTCSNTSVKPEASPSLEEHKREVNRGASSEESKDCLAKCS 300

Query: 301 SDTSLTYQIKRDREDISSDLKSGMDDYSKVGSSPKIRKKGNLKTGNDNQLTLFSYFGKK 360
           SDTSLTYQIKRDREDISSDLKSGMDDYSKVGSSPKIRKKGNLKTGNDNQLTLFSYFGKK
Sbjct: 301 SDTSLTYQIKRDREDISSDLKSGMDDYSKVGSSPKIRKKGNLKTGNDNQLTLFSYFGKK 359

BLAST of CSPI07G15020 vs. TrEMBL
Match: M5VHM0_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa018685mg PE=4 SV=1)

HSP 1 Score: 449.9 bits (1156), Expect = 2.9e-123
Identity = 230/363 (63.36%), Postives = 278/363 (76.58%), Query Frame = 1

Query: 1   MCGRARCTLRADDITRACHCTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVV 60
           MCGRARCTLRADDI RACH + GPVR++NMDRFRPLFNASPGS+LPVVRR+D     GVV
Sbjct: 1   MCGRARCTLRADDIPRACHRSHGPVRTVNMDRFRPLFNASPGSNLPVVRREDGGDGDGVV 60

Query: 61  LQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHRLVPKRRCLVAVEGFYEWKKD 120
           + CMKWGLIPSFT+K EKP+++KMFNARSESI EKASF RL+PK RCL+AVEGFYEWKKD
Sbjct: 61  VHCMKWGLIPSFTKKTEKPDHYKMFNARSESICEKASFRRLIPKNRCLIAVEGFYEWKKD 120

Query: 121 GSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILG 180
           GSKKQPYY+HF DG+PL  AALYD WEN EGE LYTFTI+TTSSS AL WLHDRMPVILG
Sbjct: 121 GSKKQPYYVHFNDGRPLLFAALYDFWENSEGEKLYTFTIITTSSSSALGWLHDRMPVILG 180

Query: 181 DKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSN 240
           DK   D WL+ SS+S +DS+LKPYE PDLVWYPVT +MGK SFDGP+CI EIQLK +G+N
Sbjct: 181 DKGSTDSWLSGSSTSNFDSLLKPYEGPDLVWYPVTQAMGKVSFDGPECINEIQLKTEGNN 240

Query: 241 LISKFFSAKETKKEYSVSQEKTCSNTSVKPEASPSLEEH---KREVNRGASSEE-SKDCL 300
            I+KFF +K TKKE    ++ +  ++SVK +   S++E    K +  + AS+E+   D  
Sbjct: 241 SITKFFMSKGTKKEELNPKDTSFYDSSVKNDLPKSVKEEPEGKEKTEQPASTEKCENDSK 300

Query: 301 AKCSSDTSLTY-QIKRDREDISSDLKSGMDDYSKVGSSPKIRKKGNLKTGNDNQLTLFSY 359
            +  S   ++  Q KRD E+ S+D K    + S++ +SP  +KK N K+  D Q TLFSY
Sbjct: 301 GQTISQEGVSKGQTKRDYEEFSADSKPVAYETSEMSASP-AKKKVNPKSSVDKQPTLFSY 360

BLAST of CSPI07G15020 vs. TrEMBL
Match: A0A067LKP4_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_15120 PE=4 SV=1)

HSP 1 Score: 442.6 bits (1137), Expect = 4.7e-121
Identity = 224/367 (61.04%), Postives = 264/367 (71.93%), Query Frame = 1

Query: 1   MCGRARCTLRADDITRACHCTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVV 60
           MCGRARCTLRADDI+RACHC G PVRS+NMDR+RP +N SPGS+LPVV R D S   G  
Sbjct: 1   MCGRARCTLRADDISRACHCNGAPVRSVNMDRYRPYYNVSPGSNLPVVYRGDVSGGEGYS 60

Query: 61  LQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHRLVPKRRCLVAVEGFYEWKKD 120
           L CM WGL+PSFT+K EKP++++MFNARSES+ EKASF RL+PK RCLVAVEGFYEWKKD
Sbjct: 61  LHCMTWGLVPSFTKKTEKPDFYRMFNARSESVREKASFRRLLPKNRCLVAVEGFYEWKKD 120

Query: 121 GSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILG 180
           GSKKQPYYIHFKD +PL  AALYD W+N EGE+L TFTILTTSSS AL+WLHDRMPVILG
Sbjct: 121 GSKKQPYYIHFKDDRPLVFAALYDSWQNSEGEILDTFTILTTSSSSALQWLHDRMPVILG 180

Query: 181 DKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSN 240
           DK  +D WLN SSSSK+D +LKPYE PDLVWYPVTP+MGK SFDGP+CIKEI LK +   
Sbjct: 181 DKGAIDTWLNGSSSSKFDIMLKPYENPDLVWYPVTPAMGKISFDGPECIKEIHLKTEDKG 240

Query: 241 LISKFFSAKETKKEYSVSQEKTCSNTSVKPEASPSLEEHKREVNRGAS--------SEES 300
            ISKFFS KE K E    QE     ++ +  A  +  +  +E +  A         + +S
Sbjct: 241 TISKFFSRKEIKSE----QESNLQGSTCEKSADVNTPKRVKEEDVIADKLDIPSLVNNDS 300

Query: 301 KDCLAKCSSDTSLTYQIKRDREDISSDLKSGMDDYSKVGSSPKIRKKGNLKTGNDNQLTL 360
           +  +   + +    Y+ KRD E+  +D K G+D   K   SP  RKK NLK   D Q TL
Sbjct: 301 RSSVCTITKEDGTKYKTKRDYEETLNDSKLGLDKDEKPPQSP-ARKKVNLKIDGDKQPTL 360

BLAST of CSPI07G15020 vs. TrEMBL
Match: A0A151SS58_CAJCA (UPF0361 protein DC12 isogeny OS=Cajanus cajan GN=KK1_003941 PE=4 SV=1)

HSP 1 Score: 439.5 bits (1129), Expect = 3.9e-120
Identity = 223/362 (61.60%), Postives = 272/362 (75.14%), Query Frame = 1

Query: 1   MCGRARCTLRADDITRACHCTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVV 60
           MCGRARC+LRADD+ RACH T  P R+L++DR+RP +N SPGSD+PVVRR+D S   G V
Sbjct: 1   MCGRARCSLRADDVPRACHRTVAPTRTLDIDRYRPSYNVSPGSDVPVVRREDVSDGEGYV 60

Query: 61  LQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHRLVPKRRCLVAVEGFYEWKKD 120
           L CMKWGLIPSFT+K EKP++++MFNARSESI EKASF RL+PK RCLVAVEGFYEWKKD
Sbjct: 61  LHCMKWGLIPSFTKKTEKPDHYRMFNARSESIDEKASFRRLLPKSRCLVAVEGFYEWKKD 120

Query: 121 GSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILG 180
           GS+KQPYYIHFKDG+PL  AALYD W+N EGE LYTFTI+TTSSS AL+WLHDRMPVILG
Sbjct: 121 GSRKQPYYIHFKDGRPLVFAALYDSWQNSEGETLYTFTIVTTSSSSALQWLHDRMPVILG 180

Query: 181 DKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSN 240
            KE  D WL+ SS+S + SVLKPYE  DLVWYPVTP+MGKPSFDGP+CIKEIQ+K++G+ 
Sbjct: 181 SKESTDTWLS-SSASSFKSVLKPYEESDLVWYPVTPAMGKPSFDGPECIKEIQVKSEGNT 240

Query: 241 LISKFFSAKETKKEYSVSQEKTCSNTSVKPEASPSLEEHKREVNRGASSEE-SKDCLAKC 300
            ISKFFS K  + E +  ++K      +K        EH  +++ GA SEE  KD     
Sbjct: 241 SISKFFSKKGAESEDTKPKQKISCPELIK-------TEHTEDLSEGAKSEEGDKDLKFSG 300

Query: 301 SSDTSLT--YQIKRDREDISSDLKSGMDDYSKVGSSPKIRKKGNLKTGNDNQLTLFSYFG 360
           SS +  T    IKR+ E  S+D K  + ++ ++ S+P  +K+   KT +D Q TLFSYFG
Sbjct: 301 SSHSQNTSMLPIKREYETFSADSKPALANHDQISSNPG-KKREKAKTADDKQPTLFSYFG 353

BLAST of CSPI07G15020 vs. TrEMBL
Match: B9GQQ8_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0002s25190g PE=4 SV=1)

HSP 1 Score: 429.5 bits (1103), Expect = 4.1e-117
Identity = 221/369 (59.89%), Postives = 272/369 (73.71%), Query Frame = 1

Query: 1   MCGRARCTLRADDITRACHCTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESS-DG-- 60
           MCGRARCTLRADDI RACH     VRS+NMDR+RP +NASPGS+L VVRRDD +S DG  
Sbjct: 1   MCGRARCTLRADDIPRACHRNTATVRSVNMDRYRPSYNASPGSNLAVVRRDDAASGDGAS 60

Query: 61  ---GVVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHRLVPKRRCLVAVEGF 120
              G  + CMKWGLIP FT+K EKP+++KMFNARSES+ EKASF RL+PK RCLVAVEGF
Sbjct: 61  GGDGYAIHCMKWGLIPGFTKKSEKPDFYKMFNARSESLSEKASFRRLIPKSRCLVAVEGF 120

Query: 121 YEWKKDGSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDR 180
           YEWKKDGSKKQPYYIHFKDG+PL  AALYD W+N EGE+LYTFTI+TT++S A++WLH+R
Sbjct: 121 YEWKKDGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGEILYTFTIVTTAASSAIQWLHER 180

Query: 181 MPVILGDKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQL 240
           MPVILGDKE  D WL+ SS+SK+D+VLKPYE  DLVWYPVTP+MGKPSFDGP+CIKEI L
Sbjct: 181 MPVILGDKEATDTWLSVSSNSKFDTVLKPYEHSDLVWYPVTPAMGKPSFDGPECIKEIHL 240

Query: 241 KNDGSNLISKFFSAKETKKEYSVSQEKTCSNTSVKPEASPSLEEHKREVNRGASSE---- 300
           K +    ISKFFS KE K+E +  +     +  ++P++     E + ++    S++    
Sbjct: 241 KMEEKGTISKFFSRKEFKEESNPEESTHGKSLKLEPKSVKEENESEEKLETPCSAKTVDY 300

Query: 301 ESKDCLAKCSSDTSLTYQIKRDREDISSDLKSGMDDYSKVGSSPKIRKKGNLKTGNDNQL 360
           + K  L   S +     + KRDRE++  D K   D+  K  +SP  +KK NLK+ +D Q 
Sbjct: 301 DLKSELETFSHEGETKCKTKRDREEL-VDSKLKTDEIVKPRASP-AKKKANLKSVDDKQP 360

BLAST of CSPI07G15020 vs. TAIR10
Match: AT2G26470.1 (AT2G26470.1 unknown protein)

HSP 1 Score: 369.0 bits (946), Expect = 3.3e-102
Identity = 174/280 (62.14%), Postives = 220/280 (78.57%), Query Frame = 1

Query: 1   MCGRARCTLRADDITRACHCTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDG-GV 60
           MCGR RCTLR DD+ RA H    P R L++DR+RP +N +PGS +PV+RRD+E   G GV
Sbjct: 1   MCGRTRCTLRPDDVPRASHRHTVPTRFLHLDRYRPSYNVAPGSYIPVLRRDNEEVVGDGV 60

Query: 61  VLQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHRLVPKRRCLVAVEGFYEWKK 120
           V+ CMKWGL+PSFT+K +KP++FKMFNARSES+ EKASF RL+PK RCLVAV+GFYEWKK
Sbjct: 61  VVHCMKWGLVPSFTKKTDKPDFFKMFNARSESVAEKASFRRLLPKNRCLVAVDGFYEWKK 120

Query: 121 DGSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVIL 180
           +GSKKQPYYIHF+DG+PL  AAL+D W+N  GE LYTFTILTT+SS AL+WLHDRMPVIL
Sbjct: 121 EGSKKQPYYIHFEDGRPLVFAALFDTWQNSGGETLYTFTILTTASSSALQWLHDRMPVIL 180

Query: 181 GDKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGS 240
           GDK+ +D WL+D S++K   +L PYE  DLVWYPVT ++GKP+FDGP+CI++I LK   +
Sbjct: 181 GDKDSIDTWLDDPSTTKLQPLLSPYEKSDLVWYPVTSAIGKPTFDGPECIQQIPLKTSQN 240

Query: 241 NLISKFFSAKETKKEYSVSQEK-TCSNTSVKPEASPSLEE 279
           +LISKFFS K+ K +    + K T +N  V  +  P+ E+
Sbjct: 241 SLISKFFSTKQPKTDEGDKETKSTDANIIVDLKKEPTAEK 280

BLAST of CSPI07G15020 vs. NCBI nr
Match: gi|778727168|ref|XP_011659220.1| (PREDICTED: embryonic stem cell-specific 5-hydroxymethylcytosine-binding protein isoform X1 [Cucumis sativus])

HSP 1 Score: 734.2 bits (1894), Expect = 1.1e-208
Identity = 358/359 (99.72%), Postives = 358/359 (99.72%), Query Frame = 1

Query: 1   MCGRARCTLRADDITRACHCTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVV 60
           MCGRARCTLRADDITRACH TGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVV
Sbjct: 1   MCGRARCTLRADDITRACHRTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVV 60

Query: 61  LQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHRLVPKRRCLVAVEGFYEWKKD 120
           LQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHRLVPKRRCLVAVEGFYEWKKD
Sbjct: 61  LQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHRLVPKRRCLVAVEGFYEWKKD 120

Query: 121 GSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILG 180
           GSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILG
Sbjct: 121 GSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILG 180

Query: 181 DKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSN 240
           DKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSN
Sbjct: 181 DKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSN 240

Query: 241 LISKFFSAKETKKEYSVSQEKTCSNTSVKPEASPSLEEHKREVNRGASSEESKDCLAKCS 300
           LISKFFSAKETKKEYSVSQEKTCSNTSVKPEASPSLEEHKREVNRGASSEESKDCLAKCS
Sbjct: 241 LISKFFSAKETKKEYSVSQEKTCSNTSVKPEASPSLEEHKREVNRGASSEESKDCLAKCS 300

Query: 301 SDTSLTYQIKRDREDISSDLKSGMDDYSKVGSSPKIRKKGNLKTGNDNQLTLFSYFGKK 360
           SDTSLTYQIKRDREDISSDLKSGMDDYSKVGSSPKIRKKGNLKTGNDNQLTLFSYFGKK
Sbjct: 301 SDTSLTYQIKRDREDISSDLKSGMDDYSKVGSSPKIRKKGNLKTGNDNQLTLFSYFGKK 359

BLAST of CSPI07G15020 vs. NCBI nr
Match: gi|778727171|ref|XP_011659221.1| (PREDICTED: embryonic stem cell-specific 5-hydroxymethylcytosine-binding protein isoform X2 [Cucumis sativus])

HSP 1 Score: 725.7 bits (1872), Expect = 3.9e-206
Identity = 356/359 (99.16%), Postives = 356/359 (99.16%), Query Frame = 1

Query: 1   MCGRARCTLRADDITRACHCTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVV 60
           MCGRARCTLRADDITRACH TGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVV
Sbjct: 1   MCGRARCTLRADDITRACHRTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVV 60

Query: 61  LQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHRLVPKRRCLVAVEGFYEWKKD 120
           LQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHRLVPKRRCLVAVEGFYEWKKD
Sbjct: 61  LQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHRLVPKRRCLVAVEGFYEWKKD 120

Query: 121 GSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILG 180
           GSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILG
Sbjct: 121 GSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILG 180

Query: 181 DKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSN 240
           DKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKE  LKNDGSN
Sbjct: 181 DKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKE--LKNDGSN 240

Query: 241 LISKFFSAKETKKEYSVSQEKTCSNTSVKPEASPSLEEHKREVNRGASSEESKDCLAKCS 300
           LISKFFSAKETKKEYSVSQEKTCSNTSVKPEASPSLEEHKREVNRGASSEESKDCLAKCS
Sbjct: 241 LISKFFSAKETKKEYSVSQEKTCSNTSVKPEASPSLEEHKREVNRGASSEESKDCLAKCS 300

Query: 301 SDTSLTYQIKRDREDISSDLKSGMDDYSKVGSSPKIRKKGNLKTGNDNQLTLFSYFGKK 360
           SDTSLTYQIKRDREDISSDLKSGMDDYSKVGSSPKIRKKGNLKTGNDNQLTLFSYFGKK
Sbjct: 301 SDTSLTYQIKRDREDISSDLKSGMDDYSKVGSSPKIRKKGNLKTGNDNQLTLFSYFGKK 357

BLAST of CSPI07G15020 vs. NCBI nr
Match: gi|659116512|ref|XP_008458105.1| (PREDICTED: embryonic stem cell-specific 5-hydroxymethylcytosine-binding protein isoform X1 [Cucumis melo])

HSP 1 Score: 523.1 bits (1346), Expect = 3.9e-145
Identity = 246/253 (97.23%), Postives = 249/253 (98.42%), Query Frame = 1

Query: 1   MCGRARCTLRADDITRACHCTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVV 60
           MCGRARCTLRADDITRACH TGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVV
Sbjct: 1   MCGRARCTLRADDITRACHRTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVV 60

Query: 61  LQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHRLVPKRRCLVAVEGFYEWKKD 120
           LQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEK SFHRLVPKRRCLVAVEGFYEWKKD
Sbjct: 61  LQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKPSFHRLVPKRRCLVAVEGFYEWKKD 120

Query: 121 GSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILG 180
           GSKKQPYYIHFKDG+PLALAALYDCWENLEGELLYTFTILTTS SPALKWLHDRMPVILG
Sbjct: 121 GSKKQPYYIHFKDGKPLALAALYDCWENLEGELLYTFTILTTSPSPALKWLHDRMPVILG 180

Query: 181 DKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSN 240
           DKERMDMWL+DSSSSKYD+V KPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSN
Sbjct: 181 DKERMDMWLDDSSSSKYDTVFKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSN 240

Query: 241 LISKFFSAKETKK 254
           LISKFFSAKETKK
Sbjct: 241 LISKFFSAKETKK 253

BLAST of CSPI07G15020 vs. NCBI nr
Match: gi|659116514|ref|XP_008458106.1| (PREDICTED: embryonic stem cell-specific 5-hydroxymethylcytosine-binding protein isoform X2 [Cucumis melo])

HSP 1 Score: 514.2 bits (1323), Expect = 1.8e-142
Identity = 244/253 (96.44%), Postives = 247/253 (97.63%), Query Frame = 1

Query: 1   MCGRARCTLRADDITRACHCTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVV 60
           MCGRARCTLRADDITRACH TGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVV
Sbjct: 1   MCGRARCTLRADDITRACHRTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVV 60

Query: 61  LQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHRLVPKRRCLVAVEGFYEWKKD 120
           LQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEK SFHRLVPKRRCLVAVEGFYEWKKD
Sbjct: 61  LQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKPSFHRLVPKRRCLVAVEGFYEWKKD 120

Query: 121 GSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILG 180
           GSKKQPYYIHFKDG+PLALAALYDCWENLEGELLYTFTILTTS SPALKWLHDRMPVILG
Sbjct: 121 GSKKQPYYIHFKDGKPLALAALYDCWENLEGELLYTFTILTTSPSPALKWLHDRMPVILG 180

Query: 181 DKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSN 240
           DKERMDMWL+DSSSSKYD+V KPYEAPDLVWYPVTPSMGKPSFDGPDCIKE  LKNDGSN
Sbjct: 181 DKERMDMWLDDSSSSKYDTVFKPYEAPDLVWYPVTPSMGKPSFDGPDCIKE--LKNDGSN 240

Query: 241 LISKFFSAKETKK 254
           LISKFFSAKETKK
Sbjct: 241 LISKFFSAKETKK 251

BLAST of CSPI07G15020 vs. NCBI nr
Match: gi|1009149608|ref|XP_015892568.1| (PREDICTED: putative SOS response-associated peptidase YoqW [Ziziphus jujuba])

HSP 1 Score: 471.5 bits (1212), Expect = 1.3e-129
Identity = 246/374 (65.78%), Postives = 287/374 (76.74%), Query Frame = 1

Query: 1   MCGRARCTLRADDITRACHCTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGG-- 60
           MCGRARCTLRADDI RACH TGG VR++N+DR+RP +N SPGS+LPVVRR D SSDGG  
Sbjct: 1   MCGRARCTLRADDIPRACHRTGGSVRTVNIDRYRPSYNVSPGSNLPVVRRADGSSDGGED 60

Query: 61  VVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHRLVPKRRCLVAVEGFYEWK 120
           VVL+CMKWGLIPSFT+K EKP+++KMFNARSESI EKASF RLVP+ RCLVAVEGFYEWK
Sbjct: 61  VVLECMKWGLIPSFTKKTEKPDHYKMFNARSESIGEKASFRRLVPRSRCLVAVEGFYEWK 120

Query: 121 KDGSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVI 180
           KDGSKKQPYYIHFKDG+PL  AALYD WEN EGE+ YTFTILTTSSS ALKWLHDRMPVI
Sbjct: 121 KDGSKKQPYYIHFKDGRPLVFAALYDSWENSEGEMFYTFTILTTSSSSALKWLHDRMPVI 180

Query: 181 LGDKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDG 240
           LGDKE  D WL  SS++K+D++LKPYE  DLVWYPVTP+MGKPSFDGP+CIKEI+LK +G
Sbjct: 181 LGDKESSDKWLTGSSATKFDTLLKPYENSDLVWYPVTPAMGKPSFDGPECIKEIKLKTEG 240

Query: 241 SNLISKFFSAKETKKEYSVSQEK-TCSNTSVKPEASPSLEE------HKREVNRGASS-- 300
           SNL+SKFFS K  KKE  +  EK + S+ SVK +   SL+E        RE N G SS  
Sbjct: 241 SNLLSKFFSPKGIKKESELKSEKESTSDISVKSDLPKSLKEEPKEEPEPRESNEGQSSLT 300

Query: 301 ----EESKDCLAKCSSDTSLTYQIKRDREDISSDLKSGMDDYSKVGSSPKIRKKGNLKTG 359
               ++ K        D +   Q KR  E++S+D +   D+  K+ +SP  +KKGNLK+ 
Sbjct: 301 ENEEQDFKSSEPTLPKDDAGKCQTKRAYEELSADSELATDETEKLITSP-AKKKGNLKSA 360

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
HMCES_XENTR9.2e-3834.90Embryonic stem cell-specific 5-hydroxymethylcytosine-binding protein OS=Xenopus ... [more]
HMCES_XENLA1.0e-3633.89Embryonic stem cell-specific 5-hydroxymethylcytosine-binding protein OS=Xenopus ... [more]
HMCES_CHICK8.1e-3433.46Embryonic stem cell-specific 5-hydroxymethylcytosine-binding protein OS=Gallus g... [more]
HMCES_RAT4.0e-3330.75Embryonic stem cell-specific 5-hydroxymethylcytosine-binding protein OS=Rattus n... [more]
YOQW_BACSU9.9e-3234.73Putative SOS response-associated peptidase YoqW OS=Bacillus subtilis (strain 168... [more]
Match NameE-valueIdentityDescription
A0A0A0K6X8_CUCSA7.7e-20999.72Uncharacterized protein OS=Cucumis sativus GN=Csa_7G371760 PE=4 SV=1[more]
M5VHM0_PRUPE2.9e-12363.36Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa018685mg PE=4 SV=1[more]
A0A067LKP4_JATCU4.7e-12161.04Uncharacterized protein OS=Jatropha curcas GN=JCGZ_15120 PE=4 SV=1[more]
A0A151SS58_CAJCA3.9e-12061.60UPF0361 protein DC12 isogeny OS=Cajanus cajan GN=KK1_003941 PE=4 SV=1[more]
B9GQQ8_POPTR4.1e-11759.89Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0002s25190g PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G26470.13.3e-10262.14 unknown protein[more]
Match NameE-valueIdentityDescription
gi|778727168|ref|XP_011659220.1|1.1e-20899.72PREDICTED: embryonic stem cell-specific 5-hydroxymethylcytosine-binding protein ... [more]
gi|778727171|ref|XP_011659221.1|3.9e-20699.16PREDICTED: embryonic stem cell-specific 5-hydroxymethylcytosine-binding protein ... [more]
gi|659116512|ref|XP_008458105.1|3.9e-14597.23PREDICTED: embryonic stem cell-specific 5-hydroxymethylcytosine-binding protein ... [more]
gi|659116514|ref|XP_008458106.1|1.8e-14296.44PREDICTED: embryonic stem cell-specific 5-hydroxymethylcytosine-binding protein ... [more]
gi|1009149608|ref|XP_015892568.1|1.3e-12965.78PREDICTED: putative SOS response-associated peptidase YoqW [Ziziphus jujuba][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR003738SRAP
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI07G15020.1CSPI07G15020.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003738SOS response associated peptidase (SRAP)GENE3DG3DSA:3.90.1680.10coord: 1..231
score: 1.7
IPR003738SOS response associated peptidase (SRAP)PANTHERPTHR13604DC12-RELATEDcoord: 2..326
score: 1.3E
IPR003738SOS response associated peptidase (SRAP)PFAMPF02586SRAPcoord: 1..220
score: 1.4
IPR003738SOS response associated peptidase (SRAP)unknownSSF143081BB1717-likecoord: 2..232
score: 5.49
NoneNo IPR availablePANTHERPTHR13604:SF0EMBRYONIC STEM CELL-SPECIFIC 5-HYDROXYMETHYLCYTOSINE-BINDING PROTEINcoord: 2..326
score: 1.3E