ClCG01G007140 (gene) Watermelon (Charleston Gray)

NameClCG01G007140
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionPutative endonuclease or glycosyl hydrolase LENGTH=673
LocationCG_Chr01 : 8209394 .. 8213340 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GACGAGTGGTCCGTTTTGCGGCTGACTCTTTCCCATTTCCACTTCCTCTCCCTGCACAGTTTCTCAGTCGCCATGAATGGAGATGTAGCTCCGGCCGCCGCTCCGGCGGGCTCAGCCGAGCCCCAGTACGTCAGGGCTAAGACTTCTGTATGGTGGGACATCGAGAACTGTCAGGTCCCTAAGGGCTGCGATCCTCACGCCATCGCACAAAACATCAGTTCCGCCCTCGTTAAGATCAACTATTGCGGCCCTGTTTCCATTTCCGCTTATGGAGACACTAATCGCATTCCCAATTCTATTCAGCAAGCTCTTTCCAGTACTGGCATCGCCTTGAATCATGTCCCTGCCGGTAATTCTCTTTCCTCTCATGCTGTTTCTCGTTGTAACTGGTCATGCAAACCCTAGGGTTTCTGTTTTGTTTTGTTTTTTGGTCTAACTTTTCAATTGTACGGTTACATTATGTGATTAGTGTTTGATGAATTTGGTTTTCTCCACCGAATTGTGGATCGGATGTGATTTTTCTGGTGTGATGCTGGAATCTATGTAAGAATTATTATCAATTCCCCTTGTGGTTAGTACGATTACATGAATGGAACTGGAACAGTTTCCTTACTTATTCCAATTCACGAATTGAAAAGAAAAATGAGGGACTCGTCTCCGCGGGGGAGGCTTGTTGATTTCCTCCTTCGTTGTTTTCATCTGTGATTTTGTGAGTGTTTTCTTGCACTGCTTTTGAAGTTCAATTTTCAAATAGCTTTCCATTTCCATTTAGAATTTATTTTTGGCGGGTTAAAATATTTTGGTTGTATTAGGTATTTAGTGAGTGATTGAGCTTGCTCTTTTGTAATCCCATATTGGATAATAATGAAAAATATTGTCTTCACTCATGGATGTAGGCTTATATAAGTAGAATGGCACTAGTTGTGTGTGCTCTCTTTTCTCTCTCTTTATTTTTTTGCGATTGTTTTACTTATAGTTTAATCTAGCTTATTTTTCTGCTAATTTTCTGCTATGCATTGTAAAAGTAGTATAACTGGTTCTTTTATCAAATCATTACATACTGTTCTGGAGTGATTTTCTAAATTCTTTGAAACAATAATTTTGCCTTTTGATTTTTTTCAACAGTCTTGTACTGTTTTGTGGTGCTCTTTCTTATAAGTTGGTTTCACTCTTTGATCTAGTAGATGTGTGATCATGATGTATCCATCCTTCATTGATTTTAAAACAATTCATGAAAACATTACTGGTTGAAGGACATGTTGGACTCTAATTTGATAAGCTTTCTTCTAACAGGTGTTAAAGATGCAAGTGACAAGAAGATTCTAGTCGATATGTTGTTTTGGGCAGTTGATAACCCTGCTCCTGCAAATTATTTGCTAATTTCCGGTGATAGGGATTTTTCTAACGCACTTCATCAATTAAGAATGAGAAGATATAATATTCTTCTTGCACAGCCACAAAAGGCTTCTGCACCACTAGTTGCAGCAGCAAAGAGTATTTGGCTTTGGATGAGTCTTGTAGCTGGAGGACCCCCAATGTCCAGTGCTGAATCATCTCAACTTGTGAATGGTATCCTTACTTCTGACCCGCAAATATCACAGGGCTCTGAATTTGATCACAATCAGCAGACAGGACAAGCTATAGTATATAAATCTGAAAATGTCTCTTTGGGAAACCAAAGATCGTATTCTACTGAGAGGATGGGCGACAACAAGCATAAAGGTAAATATATACAGAAAAGCTCCAACCAACCAGTCCTATCCAGAGCTTTAAGCTCACCTGTTTCTATGCAAGAGAAAAACCCTAATTTTTTAAACCAACCGAATCATATGCAAGCAAAGCAGTTTAAGAAAGCACCCCATGAATTCTTTGGTAATAGCAATCCTATAGCCTCTTCTAGTCAGTCTACTCCAACCCCGTTTATTGAAAATTCCAGTCATGCTAGGACTGATGGCAATGTTCCAATGGGTAGTTCCTCGAGTTACCAACCTCCTCACATGGGTCTTGCTAGGACTGATGGCAATATGTCAATGGGTAGTTCTTCAAGTTACCAACCTCCCCACATGAGGCAAAACAACATGCAGCTCCATCCTCCTTTTCGTCCAGATAATGTTTTTCCTCCTAACTCCCTTAATCATAATTCTTTTCCAGTCCCTGGTCAACCTGATCTCTCTGCACCCAATATTAGTAAGCTACATATCTCTGATTATCCCAATTATGCTATAAATCCACAAAATTTCCATCATCAAGCTGGTGAATTTAGACCACATACCAAGTCTCAAAATCCAAACTTTAATTCACCAGACAAAGGCCGTAGTCAGCATGGTGGCCATTCATTCCATCATGATGCATTGAATAAAAGACATGCTCGTGATGTAGTAGAGTATACACCTCATTCATCTTCTACCACTGTTACTAGAAGTTTATCTTATAATGATGCCTGGGGATCTCAGGGGCAACCCCCACCTTCGGAGTACATTCAAGGCCTTATTGGAGTTATTCTTCTTGCATTAAACACCCTGAAAACAGAAAAAATTATGCCAACTGAGGCAAATATAGCTGACTGCATCCGATATGGAGATTTGAGAAACAGCAATACTGATGTAAAAATGGCACTAGATAGTGCGATAGAGCATAATATGGTAGTGAAGCAGAATTTAGGAGCAGTGCAATTGTATGTTGGTAAAACAGAAAAACTGTGGAAGTGTGTGAACCCTCTAGGTGGGCATCCTAATCAATACCAAAAAGCTATATGGGATAAGATTCATAATTGTTTAGCTTCTCCAGCTGGTCGATCTGCAATAATGGCCTCTCGTTGCAGGTGATGTGATGTCTGGTCATCTCAAGCTGATCTGGCTTTCTAGATTGTTGGGTTCTAGTTTCAAAAATAACTTCTGTTTGAATTGATTTGCAGATATGAAGCAGCATTGATTCTGAAGAAAGAATGCTTAACAGATTTTGCCTTGGGCGATGTGCTTCAGATTTTGAATATGATAACCTCAATGAAGAAATGGATTACTCATCACATTTCAGGATGGCAGCCAATTAATATTATGCTAACAGAAGTTAACACAGATGCAAGTTCAAGAACTGAACTTGATTGATCAAACACAAGTACCAATCAAGTAAAAAATTTACCCTCAAACAGCCGCAATGAAGGCTGATTAAGGCGAAATATTAATCACATTGAAGGGGGGAGTATTATTACAAAAGATACCAAAGGTGCCTGGGAACAGACGTTTAAAACCTCAAGGTTTGCGTATTCTGTTCTTTCACTTTTACGTTATTGGAGGAAATTTTCACGTTCTGAAGTGCATGAAAAAACGGGGATCTAGTAAGCTTACTACTCTATGAAGATCATCCCAGTTGCGGGGCTGCCATTCTAAGTTTCTTTCTCAAAGGAATCGCTTTGTTCACATGTGTTTTTCTCGTCTTCTCAGCGAGTTGTACATTTCATGTCTGTTTTGAGATGCTATAAGTTTGTCATCTATGGCGCTGGAAAGAAACACGATGTTCTTTTGTTGTGACTACACAAAGTTAGCTAGCTAGATAATATGGCTGTTGACCAGTTTCGGTATCCCACCTTGTATGTCACCATGCCATGCTTACTTACGTAAAGATATTAGCATAAAGGATTGCTGCCGGTGGATCAGCTTGAAGAAGTTAGCCAGCAAATTAACATGGGGCATTCTGAAGCTTTTCTTTTTCTCTTCTTTTTTCCCCATAGATACTTTTGCCTAGTTGTATTGCAGTAGGGTTTTATGCTCTGTTCTTGGTTATGGGATTTGATTGTATGGGCACAGTGGTATCCCTTGAATGTTTGTACTTCTTTCTACTGTCTAGAACAGATGAGACGTTTCATATGTTTTGAATGTTACGTTATAGACAACGTGGGCAAGGGAATTGGTTCACTTGCTTTTAATGTGTTATAAATAAATGAACATAGT

mRNA sequence

GACGAGTGGTCCGTTTTGCGGCTGACTCTTTCCCATTTCCACTTCCTCTCCCTGCACAGTTTCTCAGTCGCCATGAATGGAGATGTAGCTCCGGCCGCCGCTCCGGCGGGCTCAGCCGAGCCCCAGTACGTCAGGGCTAAGACTTCTGTATGGTGGGACATCGAGAACTGTCAGGTCCCTAAGGGCTGCGATCCTCACGCCATCGCACAAAACATCAGTTCCGCCCTCGTTAAGATCAACTATTGCGGCCCTGTTTCCATTTCCGCTTATGGAGACACTAATCGCATTCCCAATTCTATTCAGCAAGCTCTTTCCAGTACTGGCATCGCCTTGAATCATGTCCCTGCCGGTGTTAAAGATGCAAGTGACAAGAAGATTCTAGTCGATATGTTGTTTTGGGCAGTTGATAACCCTGCTCCTGCAAATTATTTGCTAATTTCCGGTGATAGGGATTTTTCTAACGCACTTCATCAATTAAGAATGAGAAGATATAATATTCTTCTTGCACAGCCACAAAAGGCTTCTGCACCACTAGTTGCAGCAGCAAAGAGTATTTGGCTTTGGATGAGTCTTGTAGCTGGAGGACCCCCAATGTCCAGTGCTGAATCATCTCAACTTGTGAATGGTATCCTTACTTCTGACCCGCAAATATCACAGGGCTCTGAATTTGATCACAATCAGCAGACAGGACAAGCTATAGTATATAAATCTGAAAATGTCTCTTTGGGAAACCAAAGATCGTATTCTACTGAGAGGATGGGCGACAACAAGCATAAAGGTAAATATATACAGAAAAGCTCCAACCAACCAGTCCTATCCAGAGCTTTAAGCTCACCTGTTTCTATGCAAGAGAAAAACCCTAATTTTTTAAACCAACCGAATCATATGCAAGCAAAGCAGTTTAAGAAAGCACCCCATGAATTCTTTGGTAATAGCAATCCTATAGCCTCTTCTAGTCAGTCTACTCCAACCCCGTTTATTGAAAATTCCAGTCATGCTAGGACTGATGGCAATGTTCCAATGGGTAGTTCCTCGAGTTACCAACCTCCTCACATGGGTCTTGCTAGGACTGATGGCAATATGTCAATGGGTAGTTCTTCAAGTTACCAACCTCCCCACATGAGGCAAAACAACATGCAGCTCCATCCTCCTTTTCGTCCAGATAATGTTTTTCCTCCTAACTCCCTTAATCATAATTCTTTTCCAGTCCCTGGTCAACCTGATCTCTCTGCACCCAATATTAGTAAGCTACATATCTCTGATTATCCCAATTATGCTATAAATCCACAAAATTTCCATCATCAAGCTGGTGAATTTAGACCACATACCAAGTCTCAAAATCCAAACTTTAATTCACCAGACAAAGGCCGTAGTCAGCATGGTGGCCATTCATTCCATCATGATGCATTGAATAAAAGACATGCTCGTGATGTAGTAGAGTATACACCTCATTCATCTTCTACCACTGTTACTAGAAGTTTATCTTATAATGATGCCTGGGGATCTCAGGGGCAACCCCCACCTTCGGAGTACATTCAAGGCCTTATTGGAGTTATTCTTCTTGCATTAAACACCCTGAAAACAGAAAAAATTATGCCAACTGAGGCAAATATAGCTGACTGCATCCGATATGGAGATTTGAGAAACAGCAATACTGATGTAAAAATGGCACTAGATAGTGCGATAGAGCATAATATGGTAGTGAAGCAGAATTTAGGAGCAGTGCAATTGTATGTTGGTAAAACAGAAAAACTGTGGAAGTGTGTGAACCCTCTAGGTGGGCATCCTAATCAATACCAAAAAGCTATATGGGATAAGATTCATAATTGTTTAGCTTCTCCAGCTGGTCGATCTGCAATAATGGCCTCTCGTTGCAGATATGAAGCAGCATTGATTCTGAAGAAAGAATGCTTAACAGATTTTGCCTTGGGCGATGTGCTTCAGATTTTGAATATGATAACCTCAATGAAGAAATGGATTACTCATCACATTTCAGGATGGCAGCCAATTAATATTATGCTAACAGAAGTTAACACAGATGCAAGTTCAAGAACTGAACTTGATTGATCAAACACAAGTACCAATCAAGTAAAAAATTTACCCTCAAACAGCCGCAATGAAGGCTGATTAAGGCGAAATATTAATCACATTGAAGGGGGGAGTATTATTACAAAAGATACCAAAGGTGCCTGGGAACAGACGTTTAAAACCTCAAGGTTTGCGTATTCTGTTCTTTCACTTTTACGTTATTGGAGGAAATTTTCACGTTCTGAAGTGCATGAAAAAACGGGGATCTAGTAAGCTTACTACTCTATGAAGATCATCCCAGTTGCGGGGCTGCCATTCTAAGTTTCTTTCTCAAAGGAATCGCTTTGTTCACATGTGTTTTTCTCGTCTTCTCAGCGAGTTGTACATTTCATGTCTGTTTTGAGATGCTATAAGTTTGTCATCTATGGCGCTGGAAAGAAACACGATGTTCTTTTGTTGTGACTACACAAAGTTAGCTAGCTAGATAATATGGCTGTTGACCAGTTTCGGTATCCCACCTTGTATGTCACCATGCCATGCTTACTTACGTAAAGATATTAGCATAAAGGATTGCTGCCGGTGGATCAGCTTGAAGAAGTTAGCCAGCAAATTAACATGGGGCATTCTGAAGCTTTTCTTTTTCTCTTCTTTTTTCCCCATAGATACTTTTGCCTAGTTGTATTGCAGTAGGGTTTTATGCTCTGTTCTTGGTTATGGGATTTGATTGTATGGGCACAGTGGTATCCCTTGAATGTTTGTACTTCTTTCTACTGTCTAGAACAGATGAGACGTTTCATATGTTTTGAATGTTACGTTATAGACAACGTGGGCAAGGGAATTGGTTCACTTGCTTTTAATGTGTTATAAATAAATGAACATAGT

Coding sequence (CDS)

ATGAATGGAGATGTAGCTCCGGCCGCCGCTCCGGCGGGCTCAGCCGAGCCCCAGTACGTCAGGGCTAAGACTTCTGTATGGTGGGACATCGAGAACTGTCAGGTCCCTAAGGGCTGCGATCCTCACGCCATCGCACAAAACATCAGTTCCGCCCTCGTTAAGATCAACTATTGCGGCCCTGTTTCCATTTCCGCTTATGGAGACACTAATCGCATTCCCAATTCTATTCAGCAAGCTCTTTCCAGTACTGGCATCGCCTTGAATCATGTCCCTGCCGGTGTTAAAGATGCAAGTGACAAGAAGATTCTAGTCGATATGTTGTTTTGGGCAGTTGATAACCCTGCTCCTGCAAATTATTTGCTAATTTCCGGTGATAGGGATTTTTCTAACGCACTTCATCAATTAAGAATGAGAAGATATAATATTCTTCTTGCACAGCCACAAAAGGCTTCTGCACCACTAGTTGCAGCAGCAAAGAGTATTTGGCTTTGGATGAGTCTTGTAGCTGGAGGACCCCCAATGTCCAGTGCTGAATCATCTCAACTTGTGAATGGTATCCTTACTTCTGACCCGCAAATATCACAGGGCTCTGAATTTGATCACAATCAGCAGACAGGACAAGCTATAGTATATAAATCTGAAAATGTCTCTTTGGGAAACCAAAGATCGTATTCTACTGAGAGGATGGGCGACAACAAGCATAAAGGTAAATATATACAGAAAAGCTCCAACCAACCAGTCCTATCCAGAGCTTTAAGCTCACCTGTTTCTATGCAAGAGAAAAACCCTAATTTTTTAAACCAACCGAATCATATGCAAGCAAAGCAGTTTAAGAAAGCACCCCATGAATTCTTTGGTAATAGCAATCCTATAGCCTCTTCTAGTCAGTCTACTCCAACCCCGTTTATTGAAAATTCCAGTCATGCTAGGACTGATGGCAATGTTCCAATGGGTAGTTCCTCGAGTTACCAACCTCCTCACATGGGTCTTGCTAGGACTGATGGCAATATGTCAATGGGTAGTTCTTCAAGTTACCAACCTCCCCACATGAGGCAAAACAACATGCAGCTCCATCCTCCTTTTCGTCCAGATAATGTTTTTCCTCCTAACTCCCTTAATCATAATTCTTTTCCAGTCCCTGGTCAACCTGATCTCTCTGCACCCAATATTAGTAAGCTACATATCTCTGATTATCCCAATTATGCTATAAATCCACAAAATTTCCATCATCAAGCTGGTGAATTTAGACCACATACCAAGTCTCAAAATCCAAACTTTAATTCACCAGACAAAGGCCGTAGTCAGCATGGTGGCCATTCATTCCATCATGATGCATTGAATAAAAGACATGCTCGTGATGTAGTAGAGTATACACCTCATTCATCTTCTACCACTGTTACTAGAAGTTTATCTTATAATGATGCCTGGGGATCTCAGGGGCAACCCCCACCTTCGGAGTACATTCAAGGCCTTATTGGAGTTATTCTTCTTGCATTAAACACCCTGAAAACAGAAAAAATTATGCCAACTGAGGCAAATATAGCTGACTGCATCCGATATGGAGATTTGAGAAACAGCAATACTGATGTAAAAATGGCACTAGATAGTGCGATAGAGCATAATATGGTAGTGAAGCAGAATTTAGGAGCAGTGCAATTGTATGTTGGTAAAACAGAAAAACTGTGGAAGTGTGTGAACCCTCTAGGTGGGCATCCTAATCAATACCAAAAAGCTATATGGGATAAGATTCATAATTGTTTAGCTTCTCCAGCTGGTCGATCTGCAATAATGGCCTCTCGTTGCAGATATGAAGCAGCATTGATTCTGAAGAAAGAATGCTTAACAGATTTTGCCTTGGGCGATGTGCTTCAGATTTTGAATATGATAACCTCAATGAAGAAATGGATTACTCATCACATTTCAGGATGGCAGCCAATTAATATTATGCTAACAGAAGTTAACACAGATGCAAGTTCAAGAACTGAACTTGATTGA

Protein sequence

MNGDVAPAAAPAGSAEPQYVRAKTSVWWDIENCQVPKGCDPHAIAQNISSALVKINYCGPVSISAYGDTNRIPNSIQQALSSTGIALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYLLISGDRDFSNALHQLRMRRYNILLAQPQKASAPLVAAAKSIWLWMSLVAGGPPMSSAESSQLVNGILTSDPQISQGSEFDHNQQTGQAIVYKSENVSLGNQRSYSTERMGDNKHKGKYIQKSSNQPVLSRALSSPVSMQEKNPNFLNQPNHMQAKQFKKAPHEFFGNSNPIASSSQSTPTPFIENSSHARTDGNVPMGSSSSYQPPHMGLARTDGNMSMGSSSSYQPPHMRQNNMQLHPPFRPDNVFPPNSLNHNSFPVPGQPDLSAPNISKLHISDYPNYAINPQNFHHQAGEFRPHTKSQNPNFNSPDKGRSQHGGHSFHHDALNKRHARDVVEYTPHSSSTTVTRSLSYNDAWGSQGQPPPSEYIQGLIGVILLALNTLKTEKIMPTEANIADCIRYGDLRNSNTDVKMALDSAIEHNMVVKQNLGAVQLYVGKTEKLWKCVNPLGGHPNQYQKAIWDKIHNCLASPAGRSAIMASRCRYEAALILKKECLTDFALGDVLQILNMITSMKKWITHHISGWQPINIMLTEVNTDASSRTELD
BLAST of ClCG01G007140 vs. Swiss-Prot
Match: MARF1_BOVIN (Meiosis arrest female protein 1 OS=Bos taurus GN=MARF1 PE=3 SV=2)

HSP 1 Score: 65.5 bits (158), Expect = 2.6e-09
Identity = 43/149 (28.86%), Postives = 71/149 (47.65%), Query Frame = 1

Query: 26  VWWDIENCQVPKGCDPHAIAQNISSALVKINYCGPVSISAYGDTNRIPNSIQQALSSTGI 85
           V+WDIENC VP G    A+ Q I     K +           D ++    + Q L++  +
Sbjct: 353 VFWDIENCSVPSGRSATAVVQRIREKFFKGH--REAEFICVCDISKENKEVIQELNNCQV 412

Query: 86  ALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYLLISGDRDFSNALHQLRMRR-YNILL 145
            + H+ A  K+A+D K+   +  +A  + APA  +L+S D +F+  L  LR R  ++I+L
Sbjct: 413 TVAHINATAKNAADDKLRQSLRRFANTHTAPATVVLVSTDVNFALELSDLRHRHGFHIIL 472

Query: 146 AQPQKASAPLVAAAKSIWLWMSLVAGGPP 174
               +AS  L+  A  +  +   ++  PP
Sbjct: 473 VHKNQASEALLHHANELIRFEEFISDLPP 499

BLAST of ClCG01G007140 vs. Swiss-Prot
Match: MARF1_HUMAN (Meiosis arrest female protein 1 OS=Homo sapiens GN=KIAA0430 PE=1 SV=6)

HSP 1 Score: 65.5 bits (158), Expect = 2.6e-09
Identity = 43/149 (28.86%), Postives = 71/149 (47.65%), Query Frame = 1

Query: 26  VWWDIENCQVPKGCDPHAIAQNISSALVKINYCGPVSISAYGDTNRIPNSIQQALSSTGI 85
           V+WDIENC VP G    A+ Q I     K +           D ++    + Q L++  +
Sbjct: 355 VFWDIENCSVPSGRSATAVVQRIREKFFKGH--REAEFICVCDISKENKEVIQELNNCQV 414

Query: 86  ALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYLLISGDRDFSNALHQLRMRR-YNILL 145
            + H+ A  K+A+D K+   +  +A  + APA  +L+S D +F+  L  LR R  ++I+L
Sbjct: 415 TVAHINATAKNAADDKLRQSLRRFANTHTAPATVVLVSTDVNFALELSDLRHRHGFHIIL 474

Query: 146 AQPQKASAPLVAAAKSIWLWMSLVAGGPP 174
               +AS  L+  A  +  +   ++  PP
Sbjct: 475 VHKNQASEALLHHANELIRFEEFISDLPP 501

BLAST of ClCG01G007140 vs. Swiss-Prot
Match: MARF1_CHICK (Meiosis arrest female protein 1 homolog OS=Gallus gallus GN=MARF1 PE=3 SV=1)

HSP 1 Score: 64.3 bits (155), Expect = 5.7e-09
Identity = 43/149 (28.86%), Postives = 70/149 (46.98%), Query Frame = 1

Query: 26  VWWDIENCQVPKGCDPHAIAQNISSALVKINYCGPVSISAYGDTNRIPNSIQQALSSTGI 85
           V+WDIENC VP G    A+ Q I     K +           D ++    + Q L++  +
Sbjct: 347 VFWDIENCSVPTGRSAVAVVQRIREKFFKGH--REAEFICVCDISKENKEVIQELNNCQV 406

Query: 86  ALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYLLISGDRDFSNALHQLRMRR-YNILL 145
            + H+ A  K+A+D K+   +  +A  + APA  +L+S D +F+  L  LR R  + I+L
Sbjct: 407 TVAHINATAKNAADDKLRQSLRRFADTHTAPATVVLVSTDVNFALELSDLRHRHGFRIIL 466

Query: 146 AQPQKASAPLVAAAKSIWLWMSLVAGGPP 174
               +AS  L+  A  +  +   ++  PP
Sbjct: 467 VHKNQASEALLHHAHELVCFEEFISDLPP 493

BLAST of ClCG01G007140 vs. Swiss-Prot
Match: MARF1_MOUSE (Meiosis arrest female protein 1 OS=Mus musculus GN=Marf1 PE=1 SV=3)

HSP 1 Score: 62.8 bits (151), Expect = 1.7e-08
Identity = 41/149 (27.52%), Postives = 70/149 (46.98%), Query Frame = 1

Query: 26  VWWDIENCQVPKGCDPHAIAQNISSALVKINYCGPVSISAYGDTNRIPNSIQQALSSTGI 85
           V+WDIENC VP G     + Q I     + +           D ++    + Q L++  +
Sbjct: 354 VFWDIENCSVPSGRSATTVVQRIREKFFRGH--REAEFICVCDISKENKEVIQELNNCQV 413

Query: 86  ALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYLLISGDRDFSNALHQLRMRR-YNILL 145
            + H+ A  K+A+D K+   +  +A  + APA  +L+S D +F+  L  LR R  ++I+L
Sbjct: 414 TVAHINATAKNAADDKLRQSLRRFANTHTAPATVVLVSTDVNFALELSDLRHRHGFHIIL 473

Query: 146 AQPQKASAPLVAAAKSIWLWMSLVAGGPP 174
               +AS  L+  A  +  +   ++  PP
Sbjct: 474 VHKNQASEALLHHANQLIRFEEFISDLPP 500

BLAST of ClCG01G007140 vs. Swiss-Prot
Match: MARF1_RAT (Meiosis arrest female protein 1 OS=Rattus norvegicus GN=Marf1 PE=1 SV=2)

HSP 1 Score: 62.8 bits (151), Expect = 1.7e-08
Identity = 41/149 (27.52%), Postives = 70/149 (46.98%), Query Frame = 1

Query: 26  VWWDIENCQVPKGCDPHAIAQNISSALVKINYCGPVSISAYGDTNRIPNSIQQALSSTGI 85
           V+WDIENC VP G     + Q I     + +           D ++    + Q L++  +
Sbjct: 353 VFWDIENCSVPSGRSATTVVQRIREKFFRGH--REAEFICVCDISKENKEVIQELNNCQV 412

Query: 86  ALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYLLISGDRDFSNALHQLRMRR-YNILL 145
            + H+ A  K+A+D K+   +  +A  + APA  +L+S D +F+  L  LR R  ++I+L
Sbjct: 413 TVAHINATAKNAADDKLRQSLRRFANTHTAPATVVLVSTDVNFALELSDLRHRHGFHIIL 472

Query: 146 AQPQKASAPLVAAAKSIWLWMSLVAGGPP 174
               +AS  L+  A  +  +   ++  PP
Sbjct: 473 VHKNQASEALLHHANQLIRFEEFISDLPP 499

BLAST of ClCG01G007140 vs. TrEMBL
Match: A0A0A0KJL9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G119690 PE=4 SV=1)

HSP 1 Score: 1217.2 bits (3148), Expect = 0.0e+00
Identity = 593/665 (89.17%), Postives = 617/665 (92.78%), Query Frame = 1

Query: 1   MNGDVAPAAAPAGSAEPQYVRAKTSVWWDIENCQVPKGCDPHAIAQNISSALVKINYCGP 60
           MNGDVAPAA PA SAEPQY+RAKTSVWWDIENCQVPKGCDPHAIAQNISSALVKINYCGP
Sbjct: 1   MNGDVAPAATPATSAEPQYIRAKTSVWWDIENCQVPKGCDPHAIAQNISSALVKINYCGP 60

Query: 61  VSISAYGDTNRIPNSIQQALSSTGIALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYL 120
           VSISAYGDTNRIPNSIQQALSSTGIALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYL
Sbjct: 61  VSISAYGDTNRIPNSIQQALSSTGIALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYL 120

Query: 121 LISGDRDFSNALHQLRMRRYNILLAQPQKASAPLVAAAKSIWLWMSLVAGGPPMSSAESS 180
           LISGDRDFSNALHQLRMRRYNILLAQPQKASAPLVAAAKS+WLWMSLVAGG P+SS ESS
Sbjct: 121 LISGDRDFSNALHQLRMRRYNILLAQPQKASAPLVAAAKSVWLWMSLVAGGLPISSTESS 180

Query: 181 QLVNGILTSDPQISQGSEFDHNQQTGQAIVYKSENVSLGNQRSYSTERMGDNKHKGKYIQ 240
           QLVNGI TS+PQISQ S FDHNQ TGQAIVYK ENV+LGNQRSYSTERMGDNKHKGKY+Q
Sbjct: 181 QLVNGIPTSEPQISQTSGFDHNQHTGQAIVYKPENVNLGNQRSYSTERMGDNKHKGKYVQ 240

Query: 241 KSSNQPVLSRALSSPVSMQEKNPNFLNQPNHMQAKQFKKAPHEFFGNSNPIASSSQSTPT 300
           K+SNQPV+SRALSSP SMQEKNPNFLNQPNHMQAKQFKKAPHEFFGN NP+ SSSQS P 
Sbjct: 241 KNSNQPVISRALSSPASMQEKNPNFLNQPNHMQAKQFKKAPHEFFGNGNPVGSSSQSIPN 300

Query: 301 PFIENSSHARTDGNVPMGSSSSYQPPHMGLARTDGNMSMGSSSSYQPPHMRQNNMQLHPP 360
            FIENSSHAR DGN  MGSSS YQP H+  AR+DGN+SM +SSSYQPPHMRQNNMQLHPP
Sbjct: 301 LFIENSSHARIDGNGSMGSSSCYQPSHLAHARSDGNISMSNSSSYQPPHMRQNNMQLHPP 360

Query: 361 FRPDNVFPPNSLNHNSFPVPGQPDLSAPNISKLHISDYPNYAINPQNFHHQAGEFRPHTK 420
           FRPDNVFPPNSLNHN FPV GQPDL APNIS+LHISDYPNY INPQNFH Q GEFRPH+K
Sbjct: 361 FRPDNVFPPNSLNHNPFPVLGQPDLPAPNISQLHISDYPNYPINPQNFHQQTGEFRPHSK 420

Query: 421 SQNP-NFNSPDKGRSQHGGHSFHHDALNKRHARDVVEYTPHSSSTTVTRSLSYNDAWGSQ 480
           SQNP NFN+PDK RS HGG SFHHDALNKRHARD VEYTPHSS TTVTRSLS+ND WGSQ
Sbjct: 421 SQNPANFNAPDKSRSHHGGQSFHHDALNKRHARDAVEYTPHSSFTTVTRSLSHNDGWGSQ 480

Query: 481 GQPPPSEYIQGLIGVILLALNTLKTEKIMPTEANIADCIRYGDLRNSNTDVKMALDSAIE 540
           GQPPPSEYIQGLIGVILLALNTLK EKIMP E NIA+CIRYGDLRN NTDVKMALDSAIE
Sbjct: 481 GQPPPSEYIQGLIGVILLALNTLKVEKIMPKEENIAECIRYGDLRNCNTDVKMALDSAIE 540

Query: 541 HNMVVKQNLGAVQLYVGKTEKLWKCVNPLGGHPNQYQKAIWDKIHNCLASPAGRSAIMAS 600
           HNMVVKQ +G +QLYVGKTEKLWKCVNPLGG+PNQY KAIWDKIH  LASPAGRSA+MAS
Sbjct: 541 HNMVVKQEIGELQLYVGKTEKLWKCVNPLGGYPNQYPKAIWDKIHYFLASPAGRSAMMAS 600

Query: 601 RCRYEAALILKKECLTDFALGDVLQILNMITSMKKWITHHISGWQPINIMLTEVNTDASS 660
           RCRYEAALILKKECLTDFALGDVLQIL+MITSMKKWITHH SGWQPINI+L E NTDASS
Sbjct: 601 RCRYEAALILKKECLTDFALGDVLQILHMITSMKKWITHHNSGWQPINIILAEGNTDASS 660

Query: 661 RTELD 665
           RTELD
Sbjct: 661 RTELD 665

BLAST of ClCG01G007140 vs. TrEMBL
Match: A0A061DZG7_THECC (Endonuclease or glycosyl hydrolase with C2H2-type zinc finger domain, putative isoform 1 OS=Theobroma cacao GN=TCM_004925 PE=4 SV=1)

HSP 1 Score: 767.3 bits (1980), Expect = 1.5e-218
Identity = 406/663 (61.24%), Postives = 481/663 (72.55%), Query Frame = 1

Query: 13  GSAEPQYVRAKTSVWWDIENCQVPKGCDPHAIAQNISSALVKINYCGPVSISAYGDTNRI 72
           G+ E QYV AKTSVWWDIENCQVPK CDPHAIAQNISSALVK+NYCGPVSISAYGDTNRI
Sbjct: 27  GTPEAQYVAAKTSVWWDIENCQVPKSCDPHAIAQNISSALVKMNYCGPVSISAYGDTNRI 86

Query: 73  PNSIQQALSSTGIALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYLLISGDRDFSNAL 132
           P+S+QQALSSTGIALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYLLISGDRDFSNAL
Sbjct: 87  PSSVQQALSSTGIALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYLLISGDRDFSNAL 146

Query: 133 HQLRMRRYNILLAQPQKASAPLVAAAKSIWLWMSLVAGGPPMSSAESSQLVNGILTSDPQ 192
           HQLRMRRYNILLAQPQKASAPLVAAAKS+WLW SL AGGPP+SS ESS+L NG  + + +
Sbjct: 147 HQLRMRRYNILLAQPQKASAPLVAAAKSVWLWTSLSAGGPPLSSGESSKLANGHSSFNSE 206

Query: 193 ISQGSEFDHNQQTGQAIVYKSENVSLGNQRSYSTERMGDNKHKGKYIQKSSNQPVLSRAL 252
           +   +         Q +V+ SENV+LGNQ   +  R GD+K+KGKYI+K+ NQP +SRA 
Sbjct: 207 MLY-NPIPETVLYSQPMVFSSENVALGNQNVSNAGRNGDSKYKGKYIRKTPNQPSISRAS 266

Query: 253 SSPVSMQEKNPN--FLNQPNHMQAKQFKKAPHEFFGNSNPIASSSQSTPTPFIENSSHAR 312
           S P S  ++N N  +  QP + QAK FKKAPHEFFG S    S+S+STP  F  N +   
Sbjct: 267 SVPTSSIQENMNNGYSYQPEYAQAKSFKKAPHEFFGGSEAAVSASKSTPNFFPSNPN--- 326

Query: 313 TDGNVPMGSSSSYQPPHMGLARTDGNMSMGSSSSYQPPHMRQNNMQLHPPFRPDNVFPPN 372
                P GS+             +GN  MG   ++ P  +R NN+ L P F  +N+ PPN
Sbjct: 327 -----PPGSN-------------NGNF-MGIHQNH-PHSLRPNNLPLQPAFAQENLLPPN 386

Query: 373 SLNHNSFPVPGQ--------PDLSAPNISKLHISDYPNYAINPQNFHHQAG-EFRPHTKS 432
           S NH   P+P +        P  + P+I KL+IS++  YA NP NFHH+ G EF+  +  
Sbjct: 387 SQNHGFRPMPPRVEGPRFPAPPSNMPDIGKLNISEHSTYAQNPSNFHHRIGEEFKTSSIE 446

Query: 433 QNPN---FNSPDKGRSQHGGHSFHHDALNKRHARDVVEYTPHSSSTTVTRSLSYNDAWGS 492
             PN    N+P K    HGG +  HD  N R+ R   E+ P SSS  ++ S S N  WG+
Sbjct: 447 SLPNQASLNAPQKSLVLHGGQASQHDTFNNRYPRS-PEFPPPSSS-AISNSPS-NGTWGT 506

Query: 493 QGQPPPSEYIQGLIGVILLALNTLKTEKIMPTEANIADCIRYGDLRNSNTDVKMALDSAI 552
           QG+ PPSEY+QGLIGVILLALNTLK EKIMPTEANI DCIRYGD ++ NTDV+ ALDSAI
Sbjct: 507 QGRSPPSEYVQGLIGVILLALNTLKIEKIMPTEANITDCIRYGDPKHRNTDVRKALDSAI 566

Query: 553 EHNMVVKQNLGAVQLYVGKTEKLWKCVNPLGGHPNQYQKAIWDKIHNCLASPAGRSAIMA 612
           E +MV+KQ+LGA+QLYVG+ EKLWKCVNP+GG+PNQ+ K  WD I   L+SPAG+SA+MA
Sbjct: 567 EQHMVLKQSLGALQLYVGRNEKLWKCVNPIGGNPNQFSKTTWDGIQKFLSSPAGQSAMMA 626

Query: 613 SRCRYEAALILKKECLTDFALGDVLQILNMITSMKKWITHHISGWQPINIMLTEVNTDAS 662
           S+CRYEAAL LK  CL +FALGDVLQILNMI +MKKWI HH SGWQPI + L E   +  
Sbjct: 627 SQCRYEAALALKDACLEEFALGDVLQILNMIIAMKKWIIHHQSGWQPITVTLPETKMEMG 662

BLAST of ClCG01G007140 vs. TrEMBL
Match: F6HZE4_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0005g02300 PE=4 SV=1)

HSP 1 Score: 760.0 bits (1961), Expect = 2.4e-216
Identity = 397/672 (59.08%), Postives = 478/672 (71.13%), Query Frame = 1

Query: 2   NGDVAPAAAPAGSAEPQYVRAKTSVWWDIENCQVPKGCDPHAIAQNISSALVKINYCGPV 61
           +G+   AA     AEPQYV  KTSVWWDIENCQVPKGCDPHAIAQNISSAL K+ Y GPV
Sbjct: 4   DGNGGTAARATLPAEPQYVSVKTSVWWDIENCQVPKGCDPHAIAQNISSALAKLYYSGPV 63

Query: 62  SISAYGDTNRIPNSIQQALSSTGIALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYLL 121
           SISAYGDTNRIP S+QQALSSTGIALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYLL
Sbjct: 64  SISAYGDTNRIPASVQQALSSTGIALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYLL 123

Query: 122 ISGDRDFSNALHQLRMRRYNILLAQPQKASAPLVAAAKSIWLWMSLVAGGPPMSSAESSQ 181
           ISGDRDFSNALHQLRMRRYNILLAQPQKASAPL+AAAKS+WLW SLVAGG P++S ESSQ
Sbjct: 124 ISGDRDFSNALHQLRMRRYNILLAQPQKASAPLIAAAKSVWLWTSLVAGGFPLTSGESSQ 183

Query: 182 LVNGILTSDPQISQGSEFDHNQQTGQAIVYKSENVSLGNQRSYSTERMGDNKHKGKYIQK 241
           L +     +P++SQ        QT Q +   S+ +S G Q+ +S  R+GD K KGK+I+K
Sbjct: 184 LADCNNVFNPEMSQ-YPVPETMQTSQPVDSNSDGLSAGTQKFFSAGRVGDTKSKGKFIRK 243

Query: 242 SSNQPVLSRALSSPVSMQEKNPNFLNQPNHMQAKQFKKAPHEFFGNSNPIASSSQSTPTP 301
            +NQP ++RA S  V +QE N +F +QP + Q KQFKKAPHEFFG S  + S++ STP  
Sbjct: 244 IANQPNITRASSVLVGIQESN-SFSHQPEYTQGKQFKKAPHEFFGASESVVSANGSTPNY 303

Query: 302 FIENSSHARTDGNVPMGSSSSYQPPHMGLARTDGNMSMGSSSSYQPPHMRQNNMQLHPPF 361
           F          GN          P   G+   +GN  +G+   + P  +R NN+     F
Sbjct: 304 F---------QGN----------PDSSGI---NGNNFIGNPQDHYPHPLRPNNIPTQASF 363

Query: 362 RPDNVFPPNSLNHNSFPV---------PGQPDLSAPNISKLHISDYPNYAINPQNFHHQ- 421
             +N++PPNS +H   P+         P  P  + P+IS+L +S+YPNYA NP NFH + 
Sbjct: 364 ASNNLYPPNSYSHGFRPMPPRSEGPRFPSAPPANVPDISRLSMSEYPNYAQNPPNFHQRI 423

Query: 422 AGEFRPHTKS--QNPNFNSPDKGRSQHGGHSFHHDALNKRHARDVVEYTPHSSSTTVTRS 481
            GE++P++      P  N P KG   H     + D  + R+     +   HSSS     S
Sbjct: 424 GGEYKPYSSESPHPPGLNVPQKGYLPHTSQLLYQDTSSNRYPGG-PDLPAHSSSPVGANS 483

Query: 482 LSYNDAWGSQGQPPPSEYIQGLIGVILLALNTLKTEKIMPTEANIADCIRYGDLRNSNTD 541
           +S N  WGSQG P PSEY+QGLIGVILL LNTLKTEKIMPTE NI+DCIR+GD ++ NTD
Sbjct: 484 VSSNGVWGSQGCPQPSEYVQGLIGVILLTLNTLKTEKIMPTEVNISDCIRHGDPKHQNTD 543

Query: 542 VKMALDSAIEHNMVVKQNLGAVQLYVGKTEKLWKCVNPLGGHPNQYQKAIWDKIHNCLAS 601
           V+ AL+SA+E  MVVKQNLGAVQLYVGK E+LWKCVNP+GG+PNQY KA WD+I   LA+
Sbjct: 544 VRKALESAVEQQMVVKQNLGAVQLYVGKKERLWKCVNPIGGNPNQYPKATWDRIQMFLAT 603

Query: 602 PAGRSAIMASRCRYEAALILKKECLTDFALGDVLQILNMITSMKKWITHHISGWQPINIM 661
             GRSAIMAS+C+YEAALIL+ +CL +FALGDVLQILNM+++MKKWI +H SGWQPI I 
Sbjct: 604 SIGRSAIMASQCKYEAALILRNKCLEEFALGDVLQILNMLSTMKKWIVNHQSGWQPIKIT 650

BLAST of ClCG01G007140 vs. TrEMBL
Match: A0A0D2U2K8_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_008G135500 PE=4 SV=1)

HSP 1 Score: 753.4 bits (1944), Expect = 2.3e-214
Identity = 401/676 (59.32%), Postives = 478/676 (70.71%), Query Frame = 1

Query: 5   VAPAAAPAGSAEPQYVRAKTSVWWDIENCQVPKGCDPHAIAQNISSALVKINYCGPVSIS 64
           +A A+   G+AEPQYV AKTSVWWDIENC VPK CDPHAIAQNISSAL K+NYCGPVSIS
Sbjct: 14  MAAASYGGGAAEPQYVSAKTSVWWDIENCHVPKNCDPHAIAQNISSALAKMNYCGPVSIS 73

Query: 65  AYGDTNRIPNSIQQALSSTGIALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYLLISG 124
           AYGDTNRIP+S+QQALSSTGIALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYLLISG
Sbjct: 74  AYGDTNRIPSSVQQALSSTGIALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYLLISG 133

Query: 125 DRDFSNALHQLRMRRYNILLAQPQKASAPLVAAAKSIWLWMSLVAGGPPMSSAESSQLVN 184
           DRDFSNALHQLRMRRYNILLAQP KASAPLVAAAKS+WLWMSL AGGPP+SS ES++L N
Sbjct: 134 DRDFSNALHQLRMRRYNILLAQPMKASAPLVAAAKSVWLWMSLSAGGPPLSSGESTKLAN 193

Query: 185 GILTSDPQISQGSEFDHNQ-----QTGQAIVYKSENVSLGNQRSYSTERMGDNKHKGKYI 244
           G      Q S  SE  +N      Q  Q ++  SENV+LG Q   +  R GDNK+KGKYI
Sbjct: 194 G------QNSFNSEMSYNPIPEMVQYSQPMISSSENVTLG-QNVSNAGRNGDNKYKGKYI 253

Query: 245 QKSSNQPVLSRALSSPVSMQEKNPN--FLNQPNHMQAKQFKKAPHEFFGNSNPIASSSQS 304
           +K++NQP +SRA S+P +  ++N N  +  QP + Q K FKKAPHEFFG++ P  S+S+ 
Sbjct: 254 RKTTNQPSISRASSAPTTAIQENMNNGYSYQPEYAQTKTFKKAPHEFFGSNEPAVSASKF 313

Query: 305 TPTPFIENSSHARTDGNVPMGSSSSYQPPHMGLARTDGNMSMGSSSSYQPPHMRQNNMQL 364
           TP  F  N          P GS++S                MG   +  PP MR  N+ L
Sbjct: 314 TPNLFPSNPD--------PSGSNNS--------------NFMGVPQNPPPPSMRPINLPL 373

Query: 365 HPPFRPDNVFPPNSLNHNSFPVPGQPD--------LSAPNISKLHISDYPNYAINPQNFH 424
            P F  D + PPNS NH   P+P + +         + P++ KL+IS++  Y  N  NF 
Sbjct: 374 RPAFAQDKLLPPNSQNHGFRPIPPRVEGPRFPALFSNMPDVGKLNISEHSTYPQNSNNFP 433

Query: 425 HQAGE-FRPHTKSQNPN---FNSPDKGRSQHGGHSFHHDALNKRHARDVVEYTPHSSSTT 484
           HQ GE F+  +    PN    N+P +    H G +  HD  + R+ R   E+ P SSS  
Sbjct: 434 HQIGEKFKTSSVESMPNQTGLNAPQRSHF-HTGQASQHDTYSNRYPRG-PEFPPPSSSAI 493

Query: 485 VTRSLSYNDAWGSQGQPPPSEYIQGLIGVILLALNTLKTEKIMPTEANIADCIRYGDLRN 544
              S S N  WG++G+ PPSEY+QGLIGVILLALNTLK EKIMPTEANI DCIR+GD ++
Sbjct: 494 ---SSSSNGVWGAEGRSPPSEYVQGLIGVILLALNTLKNEKIMPTEANITDCIRFGDPKH 553

Query: 545 SNTDVKMALDSAIEHNMVVKQNLGAVQLYVGKTEKLWKCVNPLGGHPNQYQKAIWDKIHN 604
            NT+V+ ALDSAIE +MV+KQ+LGAVQLYVG+ EKLWKC+NP+GG+PNQY K  WD I  
Sbjct: 554 RNTNVRKALDSAIEQHMVLKQSLGAVQLYVGRNEKLWKCINPIGGNPNQYPKTTWDGIQK 613

Query: 605 CLASPAGRSAIMASRCRYEAALILKKECLTDFALGDVLQILNMITSMKKWITHHISGWQP 662
            L+SPAGRSA+ AS+CRYEAAL L+K CL +FALGDVLQILNMI +MKKWI HH SGWQP
Sbjct: 614 FLSSPAGRSAMTASQCRYEAALALRKGCLEEFALGDVLQILNMIIAMKKWIIHHQSGWQP 655

BLAST of ClCG01G007140 vs. TrEMBL
Match: A0A0B0PUB5_GOSAR (Limkain-b1 OS=Gossypium arboreum GN=F383_08550 PE=4 SV=1)

HSP 1 Score: 752.3 bits (1941), Expect = 5.1e-214
Identity = 400/676 (59.17%), Postives = 475/676 (70.27%), Query Frame = 1

Query: 5   VAPAAAPAGSAEPQYVRAKTSVWWDIENCQVPKGCDPHAIAQNISSALVKINYCGPVSIS 64
           +A A+   G+AEPQYV AKTSVWWDIENC VPK CDPHAIAQNISSAL K+NYCGPVSIS
Sbjct: 14  MAAASYGGGAAEPQYVSAKTSVWWDIENCHVPKNCDPHAIAQNISSALAKMNYCGPVSIS 73

Query: 65  AYGDTNRIPNSIQQALSSTGIALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYLLISG 124
           AYGDTNRIP+S+QQALSSTGIALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYLLISG
Sbjct: 74  AYGDTNRIPSSVQQALSSTGIALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYLLISG 133

Query: 125 DRDFSNALHQLRMRRYNILLAQPQKASAPLVAAAKSIWLWMSLVAGGPPMSSAESSQLVN 184
           DRDFSNALHQLRMRRYNILLAQP KASAPLVAAAKS+WLWMSL AGGPP+SS ES++L N
Sbjct: 134 DRDFSNALHQLRMRRYNILLAQPMKASAPLVAAAKSVWLWMSLSAGGPPLSSGESTKLAN 193

Query: 185 GILTSDPQISQGSEFDHNQ-----QTGQAIVYKSENVSLGNQRSYSTERMGDNKHKGKYI 244
           G      Q S  SE  +N      Q  Q ++  SENV+LG Q   +  R GDNK+KGKYI
Sbjct: 194 G------QNSFNSEMSYNPIPEMVQYSQPLISSSENVTLG-QNVSNAGRNGDNKYKGKYI 253

Query: 245 QKSSNQPVLSRALSSPVSMQEKNPN--FLNQPNHMQAKQFKKAPHEFFGNSNPIASSSQS 304
           +K++NQP +SRA S+P +  ++N N  +  QP + Q K FKKAPHEFFG + P  S+S+ 
Sbjct: 254 RKTTNQPSISRASSAPTTAIQENMNNGYSYQPEYAQTKTFKKAPHEFFGGNEPAVSASKF 313

Query: 305 TPTPFIENSSHARTDGNVPMGSSSSYQPPHMGLARTDGNMSMGSSSSYQPPHMRQNNMQL 364
           TP  F  N          P GS++S                MG   +  PP MR  N+ L
Sbjct: 314 TPNLFPSNPD--------PSGSNNS--------------NFMGVPQNPPPPSMRPINLPL 373

Query: 365 HPPFRPDNVFPPNSLNHNSFPVPGQPD--------LSAPNISKLHISDYPNYAINPQNFH 424
            P F  D + PPNS NH   P+P + +         + P++ KL+IS++  Y  NP NF 
Sbjct: 374 RPAFAQDKLLPPNSQNHGFRPIPPRVEGPRFPALFSNMPDVGKLNISEHSTYPQNPNNFP 433

Query: 425 HQAGE-FRPHTKSQNPN---FNSPDKGRSQHGGHSFHHDALNKRHARDVVEYTPHSSSTT 484
           H+ GE F+  +    PN    N+P +    H G +  HD  + R+ R      P SS+  
Sbjct: 434 HRIGEKFKTSSVESMPNQTGLNAPQRSHF-HTGQASQHDTYSNRYPRGPEFPLPSSSAI- 493

Query: 485 VTRSLSYNDAWGSQGQPPPSEYIQGLIGVILLALNTLKTEKIMPTEANIADCIRYGDLRN 544
              S S N  WG++G+ PPSEY+QGLIGVILLALNTLK EKIMPTEANI DCIR+GD ++
Sbjct: 494 ---SSSSNGVWGAEGRSPPSEYVQGLIGVILLALNTLKNEKIMPTEANITDCIRFGDPKH 553

Query: 545 SNTDVKMALDSAIEHNMVVKQNLGAVQLYVGKTEKLWKCVNPLGGHPNQYQKAIWDKIHN 604
            NT+V+ ALD AIE +MV+KQ+LGAVQLYVG+ EKLWKCVNP+GG+PNQY K  WD I  
Sbjct: 554 RNTNVRKALDGAIEQHMVLKQSLGAVQLYVGRNEKLWKCVNPIGGNPNQYPKTTWDGIQK 613

Query: 605 CLASPAGRSAIMASRCRYEAALILKKECLTDFALGDVLQILNMITSMKKWITHHISGWQP 662
            L+SPAGRSAI AS+CRYEAAL L+K CL +FALGDVLQILNMI +MKKWI HH SGWQP
Sbjct: 614 FLSSPAGRSAITASQCRYEAALALRKGCLEEFALGDVLQILNMIIAMKKWIIHHQSGWQP 655

BLAST of ClCG01G007140 vs. TAIR10
Match: AT3G62200.1 (AT3G62200.1 Putative endonuclease or glycosyl hydrolase)

HSP 1 Score: 608.6 bits (1568), Expect = 4.6e-174
Identity = 347/695 (49.93%), Postives = 422/695 (60.72%), Query Frame = 1

Query: 2   NGDVAPAAAPAGSAEPQYVRAKTSVWWDIENCQVPKGCDPHAIAQNISSALVKINYCGPV 61
           +GD    + PA  AE QYVRAKTSVWWDIENCQVP G D H IAQNI+SAL K+NYCGPV
Sbjct: 8   DGDFGTNSVPAEMAEAQYVRAKTSVWWDIENCQVPNGLDAHGIAQNITSALQKMNYCGPV 67

Query: 62  SISAYGDTNRIPNSIQQALSSTGIALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYLL 121
           SISAYGDTNRIP +IQ AL+STGIALNHVPAGVKDASDKKILVDMLFWA+DNPAPAN++L
Sbjct: 68  SISAYGDTNRIPLTIQHALNSTGIALNHVPAGVKDASDKKILVDMLFWALDNPAPANFML 127

Query: 122 ISGDRDFSNALHQLRMRRYNILLAQPQKASAPLVAAAKSIWLWMSLVAGGPPMSSAESSQ 181
           ISGDRDFSNALH LRMRRYN+LLAQP KAS PLV AAK++WLW SL AGG P++ AES Q
Sbjct: 128 ISGDRDFSNALHGLRMRRYNVLLAQPLKASVPLVHAAKTVWLWTSLSAGGIPLTRAESLQ 187

Query: 182 LVNGILTSDPQISQGSEFDHNQQTGQAIVYKSENVSLGNQRSYSTERMGDNKHKGKYIQK 241
           LV    T  P    GSE   +Q                   +  + R+ DNK K KY+ K
Sbjct: 188 LVANQTTPKP----GSEIPSSQPL---------------DSNSDSRRVFDNKSKVKYVPK 247

Query: 242 SSNQPVLSRALSSPVSMQEKNPNFLNQPNHMQAKQFKKAPHEFFGNSNPIASSSQSTPTP 301
            SN               + N N+  Q  + Q KQFKKAPHEFFG S P  S+S+  P P
Sbjct: 248 PSN--------------HQPNNNYRQQQQNTQGKQFKKAPHEFFGTSEPSVSTSR-PPPP 307

Query: 302 FIENSSHARTDGNVPMGSSSSYQ----------PPHMGLARTDGNMSMGSS-----SSYQ 361
            + +S+     GNV     +  Q          PP      TD + + G+S      +Y 
Sbjct: 308 NLPSSNVNTFPGNVMTNPQNQNQYTYPPRPGPFPPRQPYPNTDPSWNNGNSIPNHAQNYY 367

Query: 362 PPHMRQNNMQLHP-------PFRPDNVFPP-----NSLNH--NSFP-VPGQPDLSAPNIS 421
           P   R     + P       P+RP+N+ PP       + H  N  P  P  P L+  +IS
Sbjct: 368 PNAARPGAATMRPPYGNVFRPYRPENLNPPVGNGFRPMQHPRNDGPRFPSPPLLTPLDIS 427

Query: 422 KLHISDYPNYAINPQNFHHQA-GEFRPHTKSQNPNFNSPDKGR-SQHGGHSFHHDALNKR 481
            L +S YP+   N  NF+ Q   EFRP  +S   + N P+K    +              
Sbjct: 428 NLSVSQYPSQTQNRPNFNPQVRQEFRPKMESSYTH-NGPNKSYIPRCSSAPVTQSTTTTA 487

Query: 482 HARDVVEYTPHSSSTTVTRSLSYNDAWGSQGQPPPSEYIQGLIGVILLALNTLKTEKIMP 541
           H        P S    VT S S ND WG+Q  PPPSEY+QGLIGVIL AL+ LKTEK+MP
Sbjct: 488 HTYPSSPGVPPSQPPMVTGSGSSNDRWGTQECPPPSEYVQGLIGVILHALHILKTEKVMP 547

Query: 542 TEANIADCIRYGDLRNSNTDVKMALDSAIEHNMVVKQNLGAVQLYVGKTEKLWKCVNPLG 601
           TE NI+DCI+YGD ++  TDVK AL+SA+EH+M++  N+G ++LY+GK E LW CVNPLG
Sbjct: 548 TEPNISDCIQYGDPKHHGTDVKKALESALEHHMIMMTNVGKLKLYIGKNEALWNCVNPLG 607

Query: 602 GHPNQYQKAIWDKIHNCLASPAGRSAIMASRCRYEAALILKKECLTDFALGDVLQILNMI 661
            +  QY K  WD+I   L S +GR    A+ CRYEAA +LKKECL +F LGD+LQILN+ 
Sbjct: 608 ANAKQYPKETWDRIQQFLTSSSGRVEFTATTCRYEAAQVLKKECLKEFTLGDILQILNIT 666

Query: 662 TSMKKWITHHISGWQPINIMLTEVNTDASSRTELD 665
            + KKWITHH +GW+PI I L    T+ ++ TE D
Sbjct: 668 ATTKKWITHHQTGWKPITISLAAETTNETA-TEAD 666

BLAST of ClCG01G007140 vs. TAIR10
Match: AT5G61190.1 (AT5G61190.1 putative endonuclease or glycosyl hydrolase with C2H2-type zinc finger domain)

HSP 1 Score: 281.6 bits (719), Expect = 1.3e-75
Identity = 136/184 (73.91%), Postives = 155/184 (84.24%), Query Frame = 1

Query: 14  SAEPQYVRAKTSVWWDIENCQVPKGCDPHAIAQNISSALVKINYCGPVSISAYGDTNRIP 73
           +AE  YV+AKTSVWWDIENC+VP+G D H IA N+SS+L+K+NYCGPVSISAYGDTN IP
Sbjct: 3   TAEADYVKAKTSVWWDIENCEVPRGWDAHVIALNVSSSLLKMNYCGPVSISAYGDTNLIP 62

Query: 74  NSIQQALSSTGIALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYLLISGDRDFSNALH 133
              QQALSSTG+ALNH+PAGVKDASDKKILVDML WA+DNPAPAN LLISGDRDFSNALH
Sbjct: 63  LHHQQALSSTGVALNHIPAGVKDASDKKILVDMLLWAIDNPAPANLLLISGDRDFSNALH 122

Query: 134 QLRMRRYNILLAQPQKASAPLVAAAKSIWLWMSLVAGGPPMSSAESSQLVN--GILTSDP 193
           QLRMRRYNILLAQP +AS PLVAAA+ +WLW  L +GGPP++S ESS L N  G   S+ 
Sbjct: 123 QLRMRRYNILLAQPPRASVPLVAAARDVWLWTVLASGGPPLTSVESSLLFNNGGFRVSNK 182

Query: 194 QISQ 196
            +S+
Sbjct: 183 GVSK 186

BLAST of ClCG01G007140 vs. TAIR10
Match: AT3G62210.1 (AT3G62210.1 Putative endonuclease or glycosyl hydrolase)

HSP 1 Score: 266.5 bits (680), Expect = 4.3e-71
Identity = 137/194 (70.62%), Postives = 152/194 (78.35%), Query Frame = 1

Query: 12  AGSAEPQYVRAKTSVWWDIENCQVPKGCDPHAIAQNISSALVKINYCGPVSISAYGDTNR 71
           A +AE QYV AKTSVWWDIENCQVPKG D H IAQNISSAL K+NYCG VSISAYGDT+ 
Sbjct: 12  ADTAEAQYVMAKTSVWWDIENCQVPKGLDAHGIAQNISSALKKMNYCGRVSISAYGDTSG 71

Query: 72  IPNSIQQALSSTGIALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYLLISGDRDFSNA 131
           IP+ IQ AL+STGI L+HVPAGVKDASDKKILVDMLFWA DNPAP+N +LISGDRDFSNA
Sbjct: 72  IPHVIQHALNSTGIELHHVPAGVKDASDKKILVDMLFWAFDNPAPSNIMLISGDRDFSNA 131

Query: 132 LHQLRMRRYNILLAQPQKASAPLVAAAKSIWLWMSLVAGGPPM--SSAESSQLVNGILTS 191
           LH+L +RRYNILLA P KASAPL  AA ++WLW SL+AGG P+     ++SQLV    TS
Sbjct: 132 LHKLSLRRYNILLAHPPKASAPLSQAATTVWLWTSLLAGGNPLIRGKVKTSQLVANASTS 191

Query: 192 DPQISQGSEFDHNQ 204
              +S      HNQ
Sbjct: 192 SNVMSSP---PHNQ 202

BLAST of ClCG01G007140 vs. TAIR10
Match: AT5G61180.1 (AT5G61180.1 Putative endonuclease or glycosyl hydrolase)

HSP 1 Score: 233.8 bits (595), Expect = 3.1e-61
Identity = 110/178 (61.80%), Postives = 135/178 (75.84%), Query Frame = 1

Query: 14  SAEPQYVRAKTSVWWDIENCQVPKGCDPHAIAQNISSALVKINYCGPVSISAYGDTNRIP 73
           SA+  +  AKTSVWWDIENC+VPKGCDPH +AQ+I S L K N+CGP++I AYGDTN+IP
Sbjct: 72  SAKADFAGAKTSVWWDIENCEVPKGCDPHGVAQSIRSVLSKSNFCGPLTIYAYGDTNQIP 131

Query: 74  NSIQQALSSTGIALNHVPA------------------GVKDASDKKILVDMLFWAVDNPA 133
           +S+QQALSSTG++LNHVPA                  GVKD SDKK+LVD++ WA+DN A
Sbjct: 132 SSVQQALSSTGVSLNHVPAVSNGLIILYVLDDGEHLTGVKDGSDKKLLVDIMLWAMDNQA 191

Query: 134 PANYLLISGDRDFSNALHQLRMRRYNILLAQPQKASAPLVAAAKSIWLWMSLVAGGPP 174
           PAN +LISGD+DFS  LH+L M+RYNILLA+P+KAS PL+AAAK++WLW S+  G  P
Sbjct: 192 PANIMLISGDKDFSYLLHKLGMKRYNILLARPEKASTPLIAAAKTVWLWTSIFNGDCP 249

BLAST of ClCG01G007140 vs. TAIR10
Match: AT3G60940.1 (AT3G60940.1 Putative endonuclease or glycosyl hydrolase)

HSP 1 Score: 157.1 bits (396), Expect = 3.6e-38
Identity = 86/186 (46.24%), Postives = 113/186 (60.75%), Query Frame = 1

Query: 7   PAAAPAGSAEPQYVRAKTSVWWDIENCQVPKGCDPHAIAQNISSALVKINYCGPVSISAY 66
           P   P       +  AKT VWWD EN  VP+G D + I  NI +AL +  Y GP+SI A+
Sbjct: 66  PVRLPGYPQLVDFSTAKTQVWWDTENSPVPRGFDAYRIGGNIRNALNENGYRGPISIRAF 125

Query: 67  GDTNRIPNSIQQALSSTGIALNHVPAG-------VKDASDKKILVDMLFW-AVDNPAPAN 126
           G+   IP  IQ AL+STGI L HVP         +KDASD KI+  +L W A+++P P+N
Sbjct: 126 GNMRLIPTPIQLALTSTGIDLYHVPGNKVGSRKTIKDASDFKIIGHVLTWIALNHPQPSN 185

Query: 127 YLLISGDRDFSNALHQLRMRRYNILLAQPQKA-SAPLVAAAKSIWLWMSLVAGGPPMSSA 184
            ++I+GDRD+S ALHQLR R +NILLA P+ + S  L+ AA S+W W SL+ G  P++  
Sbjct: 186 LMVITGDRDYSVALHQLRCRSFNILLACPESSTSTALLRAATSVWKWNSLILGQKPLAEN 245

BLAST of ClCG01G007140 vs. NCBI nr
Match: gi|659072609|ref|XP_008466414.1| (PREDICTED: uncharacterized protein LOC103503825 [Cucumis melo])

HSP 1 Score: 1242.3 bits (3213), Expect = 0.0e+00
Identity = 603/665 (90.68%), Postives = 627/665 (94.29%), Query Frame = 1

Query: 1   MNGDVAPAAAPAGSAEPQYVRAKTSVWWDIENCQVPKGCDPHAIAQNISSALVKINYCGP 60
           MNGDVAPAA PA SAEPQY+RAKTSVWWDIENCQVPKGCDPHAIAQNISSALVKINYCGP
Sbjct: 1   MNGDVAPAATPAASAEPQYIRAKTSVWWDIENCQVPKGCDPHAIAQNISSALVKINYCGP 60

Query: 61  VSISAYGDTNRIPNSIQQALSSTGIALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYL 120
           VSISAYGDTNRIPNSIQQALSSTGIALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYL
Sbjct: 61  VSISAYGDTNRIPNSIQQALSSTGIALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYL 120

Query: 121 LISGDRDFSNALHQLRMRRYNILLAQPQKASAPLVAAAKSIWLWMSLVAGGPPMSSAESS 180
           LISGDRDFSNALHQLRMRRYNILLAQPQKASAPLVAAAKS+WLWMSLVAGGPPMSS ESS
Sbjct: 121 LISGDRDFSNALHQLRMRRYNILLAQPQKASAPLVAAAKSVWLWMSLVAGGPPMSSTESS 180

Query: 181 QLVNGILTSDPQISQGSEFDHNQQTGQAIVYKSENVSLGNQRSYSTERMGDNKHKGKYIQ 240
           QLVNGI TS+PQISQ S FD N  TGQAIV+K ENV+LGNQRSYSTER GDNKHKGKY+Q
Sbjct: 181 QLVNGIPTSEPQISQTSGFDQNMHTGQAIVHKPENVNLGNQRSYSTERTGDNKHKGKYVQ 240

Query: 241 KSSNQPVLSRALSSPVSMQEKNPNFLNQPNHMQAKQFKKAPHEFFGNSNPIASSSQSTPT 300
           KSSNQPV+SRALSSP SMQEKNPNFLNQPNHMQAKQFKKAPHEFFGNSNP+ SSSQSTP 
Sbjct: 241 KSSNQPVISRALSSPASMQEKNPNFLNQPNHMQAKQFKKAPHEFFGNSNPVGSSSQSTPN 300

Query: 301 PFIENSSHARTDGNVPMGSSSSYQPPHMGLARTDGNMSMGSSSSYQPPHMRQNNMQLHPP 360
            FIENSSHARTD N  +GSSS +QPPH+  AR+DGN+SMG+SSSYQPPHMRQNNMQLHPP
Sbjct: 301 LFIENSSHARTDANGSIGSSSFHQPPHLNHARSDGNISMGNSSSYQPPHMRQNNMQLHPP 360

Query: 361 FRPDNVFPPNSLNHNSFPVPGQPDLSAPNISKLHISDYPNYAINPQNFHHQAGEFRPHTK 420
           FRPDNVFPPNSLNHNSFPVPGQP+LSAPNIS+LHISDYPNY IN QNFH Q GEFRPH+K
Sbjct: 361 FRPDNVFPPNSLNHNSFPVPGQPELSAPNISQLHISDYPNYPINSQNFHQQTGEFRPHSK 420

Query: 421 SQNP-NFNSPDKGRSQHGGHSFHHDALNKRHARDVVEYTPHSSSTTVTRSLSYNDAWGSQ 480
           SQNP NFN+PDKGRSQHGG SFHHDALNKRHARD VEY PHSSST VTRSLS+ND WGSQ
Sbjct: 421 SQNPANFNAPDKGRSQHGGQSFHHDALNKRHARDAVEYAPHSSSTIVTRSLSHNDGWGSQ 480

Query: 481 GQPPPSEYIQGLIGVILLALNTLKTEKIMPTEANIADCIRYGDLRNSNTDVKMALDSAIE 540
           GQPPPSEYIQGLIGVILLALNTLK EKIMP EANIADCIRYGDLRN NTDVKMALDSA+E
Sbjct: 481 GQPPPSEYIQGLIGVILLALNTLKVEKIMPIEANIADCIRYGDLRNCNTDVKMALDSAVE 540

Query: 541 HNMVVKQNLGAVQLYVGKTEKLWKCVNPLGGHPNQYQKAIWDKIHNCLASPAGRSAIMAS 600
           HNMVVKQNLGAVQLYVGKTEKLWKCVNPLGG+PNQY KAIWDKI NCLASPAGRSA+MAS
Sbjct: 541 HNMVVKQNLGAVQLYVGKTEKLWKCVNPLGGYPNQYPKAIWDKIRNCLASPAGRSAMMAS 600

Query: 601 RCRYEAALILKKECLTDFALGDVLQILNMITSMKKWITHHISGWQPINIMLTEVNTDASS 660
           RCRYEAALILKKECLTDFALGDVLQIL+MITSMKKWITHHISGWQPINI+L E NTDASS
Sbjct: 601 RCRYEAALILKKECLTDFALGDVLQILHMITSMKKWITHHISGWQPINIILAEGNTDASS 660

Query: 661 RTELD 665
           RTELD
Sbjct: 661 RTELD 665

BLAST of ClCG01G007140 vs. NCBI nr
Match: gi|778698526|ref|XP_011654555.1| (PREDICTED: uncharacterized protein LOC101219837 [Cucumis sativus])

HSP 1 Score: 1217.2 bits (3148), Expect = 0.0e+00
Identity = 593/665 (89.17%), Postives = 617/665 (92.78%), Query Frame = 1

Query: 1   MNGDVAPAAAPAGSAEPQYVRAKTSVWWDIENCQVPKGCDPHAIAQNISSALVKINYCGP 60
           MNGDVAPAA PA SAEPQY+RAKTSVWWDIENCQVPKGCDPHAIAQNISSALVKINYCGP
Sbjct: 1   MNGDVAPAATPATSAEPQYIRAKTSVWWDIENCQVPKGCDPHAIAQNISSALVKINYCGP 60

Query: 61  VSISAYGDTNRIPNSIQQALSSTGIALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYL 120
           VSISAYGDTNRIPNSIQQALSSTGIALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYL
Sbjct: 61  VSISAYGDTNRIPNSIQQALSSTGIALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYL 120

Query: 121 LISGDRDFSNALHQLRMRRYNILLAQPQKASAPLVAAAKSIWLWMSLVAGGPPMSSAESS 180
           LISGDRDFSNALHQLRMRRYNILLAQPQKASAPLVAAAKS+WLWMSLVAGG P+SS ESS
Sbjct: 121 LISGDRDFSNALHQLRMRRYNILLAQPQKASAPLVAAAKSVWLWMSLVAGGLPISSTESS 180

Query: 181 QLVNGILTSDPQISQGSEFDHNQQTGQAIVYKSENVSLGNQRSYSTERMGDNKHKGKYIQ 240
           QLVNGI TS+PQISQ S FDHNQ TGQAIVYK ENV+LGNQRSYSTERMGDNKHKGKY+Q
Sbjct: 181 QLVNGIPTSEPQISQTSGFDHNQHTGQAIVYKPENVNLGNQRSYSTERMGDNKHKGKYVQ 240

Query: 241 KSSNQPVLSRALSSPVSMQEKNPNFLNQPNHMQAKQFKKAPHEFFGNSNPIASSSQSTPT 300
           K+SNQPV+SRALSSP SMQEKNPNFLNQPNHMQAKQFKKAPHEFFGN NP+ SSSQS P 
Sbjct: 241 KNSNQPVISRALSSPASMQEKNPNFLNQPNHMQAKQFKKAPHEFFGNGNPVGSSSQSIPN 300

Query: 301 PFIENSSHARTDGNVPMGSSSSYQPPHMGLARTDGNMSMGSSSSYQPPHMRQNNMQLHPP 360
            FIENSSHAR DGN  MGSSS YQP H+  AR+DGN+SM +SSSYQPPHMRQNNMQLHPP
Sbjct: 301 LFIENSSHARIDGNGSMGSSSCYQPSHLAHARSDGNISMSNSSSYQPPHMRQNNMQLHPP 360

Query: 361 FRPDNVFPPNSLNHNSFPVPGQPDLSAPNISKLHISDYPNYAINPQNFHHQAGEFRPHTK 420
           FRPDNVFPPNSLNHN FPV GQPDL APNIS+LHISDYPNY INPQNFH Q GEFRPH+K
Sbjct: 361 FRPDNVFPPNSLNHNPFPVLGQPDLPAPNISQLHISDYPNYPINPQNFHQQTGEFRPHSK 420

Query: 421 SQNP-NFNSPDKGRSQHGGHSFHHDALNKRHARDVVEYTPHSSSTTVTRSLSYNDAWGSQ 480
           SQNP NFN+PDK RS HGG SFHHDALNKRHARD VEYTPHSS TTVTRSLS+ND WGSQ
Sbjct: 421 SQNPANFNAPDKSRSHHGGQSFHHDALNKRHARDAVEYTPHSSFTTVTRSLSHNDGWGSQ 480

Query: 481 GQPPPSEYIQGLIGVILLALNTLKTEKIMPTEANIADCIRYGDLRNSNTDVKMALDSAIE 540
           GQPPPSEYIQGLIGVILLALNTLK EKIMP E NIA+CIRYGDLRN NTDVKMALDSAIE
Sbjct: 481 GQPPPSEYIQGLIGVILLALNTLKVEKIMPKEENIAECIRYGDLRNCNTDVKMALDSAIE 540

Query: 541 HNMVVKQNLGAVQLYVGKTEKLWKCVNPLGGHPNQYQKAIWDKIHNCLASPAGRSAIMAS 600
           HNMVVKQ +G +QLYVGKTEKLWKCVNPLGG+PNQY KAIWDKIH  LASPAGRSA+MAS
Sbjct: 541 HNMVVKQEIGELQLYVGKTEKLWKCVNPLGGYPNQYPKAIWDKIHYFLASPAGRSAMMAS 600

Query: 601 RCRYEAALILKKECLTDFALGDVLQILNMITSMKKWITHHISGWQPINIMLTEVNTDASS 660
           RCRYEAALILKKECLTDFALGDVLQIL+MITSMKKWITHH SGWQPINI+L E NTDASS
Sbjct: 601 RCRYEAALILKKECLTDFALGDVLQILHMITSMKKWITHHNSGWQPINIILAEGNTDASS 660

Query: 661 RTELD 665
           RTELD
Sbjct: 661 RTELD 665

BLAST of ClCG01G007140 vs. NCBI nr
Match: gi|1009148180|ref|XP_015891799.1| (PREDICTED: uncharacterized protein LOC107426200 [Ziziphus jujuba])

HSP 1 Score: 776.9 bits (2005), Expect = 2.8e-221
Identity = 413/693 (59.60%), Postives = 485/693 (69.99%), Query Frame = 1

Query: 8   AAAPAGSAEPQYVRAKTSVWWDIENCQVPKGCDPHAIAQNISSALVKINYCGPVSISAYG 67
           ++A AG AEPQYV+AKTSVWWDIENCQVPKG DPHAIAQNISSALVKINYCGPVSISAYG
Sbjct: 20  SSARAGLAEPQYVKAKTSVWWDIENCQVPKGSDPHAIAQNISSALVKINYCGPVSISAYG 79

Query: 68  DTNRIPNSIQQALSSTGIALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYLLISGDRD 127
           DTNRIP S+Q ALSSTGIALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYLLISGDRD
Sbjct: 80  DTNRIPASVQHALSSTGIALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYLLISGDRD 139

Query: 128 FSNALHQLRMRRYNILLAQPQKASAPLVAAAKSIWLWMSLVAGGPPMSSAESSQLVNGIL 187
           FSNALHQLRMRRYNILLAQPQKASAPL+AAAKS+WLW SL AGG P+S+ ESSQL NG  
Sbjct: 140 FSNALHQLRMRRYNILLAQPQKASAPLIAAAKSVWLWTSLSAGGSPLSNGESSQLANGNH 199

Query: 188 TSDPQISQ--GSEFDHNQQTGQAIVYKSENVSLGNQRSYSTERMGDNKHKGKYIQKSSNQ 247
           + +P+  Q  GSE     QT Q ++Y  E +SLGNQ+  +  R GD K KGK ++K+SNQ
Sbjct: 200 SFNPETLQHPGSE---PFQTNQPMIY-HETLSLGNQKPNTIGRTGDPKLKGKLVRKTSNQ 259

Query: 248 PVLSRALSSPVSMQE-KNPNFLNQPNHMQAKQFKKAPHEFFGNSNPIASSSQSTPTPFIE 307
           P++SRA + PV  QE KN +   Q  + QAKQFKKAPHE+FG + P+ S+S+ST T F  
Sbjct: 260 PIISRAPNVPVVTQESKNTDHPYQQEYAQAKQFKKAPHEYFGPNEPVVSASRSTTTNFFP 319

Query: 308 NSSHARTDGNVPMGSSSSYQPPHMGLARTDGNMSMGSSSSYQPPHMRQNNMQLHPPFRPD 367
            +S   + GNV                       +G+  S+ PP +R NN  + P F  D
Sbjct: 320 GNSDP-SGGNV--------------------YNLLGNPQSHYPPPLRPNNFHMQPTFGQD 379

Query: 368 NVFPPNSLNHNSFPVPGQPD---------LSAPNISKLHISDYPNYAINPQNFHHQAGEF 427
           N+ PPN  NH   PVP +PD          + P++ KL I +Y NY  NPQNFHH+ GE 
Sbjct: 380 NLHPPNFHNHGFRPVPTRPDGPRFSSAPPTNIPDVGKLGILEYSNYVQNPQNFHHRNGEE 439

Query: 428 ---------------------------RPHTKSQNPNFNSPDKGRSQHGGHSFHHDALNK 487
                                      RP   + + + N   KG + +GG +FHHDA+N 
Sbjct: 440 CKPRPADKPRPADKPRPADKPRPANKPRPAESANSASLNISQKGHNFNGGQAFHHDAINN 499

Query: 488 RHARDVVEYTPHSSSTTVTRSLSYNDAWGSQGQPPPSEYIQGLIGVILLALNTLKTEKIM 547
           R+     +Y P SSS  V  S+S N  WG+QG  PP EY+QGLIGVILLALNTLK EKIM
Sbjct: 500 RYPPG-SDYVPVSSSPVVANSVSSNGIWGTQGCAPPPEYVQGLIGVILLALNTLKVEKIM 559

Query: 548 PTEANIADCIRYGDLRNSNTDVKMALDSAIEHNMVVKQNLGAVQLYVGKTEKLWKCVNPL 607
           PTE NI DCIRYGD ++ NTDVK ALD AIE +MVVKQNLGAVQLYVGK EKLWKCVN +
Sbjct: 560 PTEVNITDCIRYGDPKHCNTDVKKALDCAIEQHMVVKQNLGAVQLYVGKNEKLWKCVNLI 619

Query: 608 GGHPNQYQKAIWDKIHNCLASPAGRSAIMASRCRYEAALILKKECLTDFALGDVLQILNM 662
           GG+ N Y K +WD++   LAS AGRSA++AS+CRYEAALILKK CL + +LG+VLQ+LNM
Sbjct: 620 GGNVNHYPKPMWDRVEKFLASSAGRSALLASQCRYEAALILKKSCLEELSLGNVLQVLNM 679

BLAST of ClCG01G007140 vs. NCBI nr
Match: gi|590720180|ref|XP_007051260.1| (Endonuclease or glycosyl hydrolase with C2H2-type zinc finger domain, putative isoform 1 [Theobroma cacao])

HSP 1 Score: 767.3 bits (1980), Expect = 2.2e-218
Identity = 406/663 (61.24%), Postives = 481/663 (72.55%), Query Frame = 1

Query: 13  GSAEPQYVRAKTSVWWDIENCQVPKGCDPHAIAQNISSALVKINYCGPVSISAYGDTNRI 72
           G+ E QYV AKTSVWWDIENCQVPK CDPHAIAQNISSALVK+NYCGPVSISAYGDTNRI
Sbjct: 27  GTPEAQYVAAKTSVWWDIENCQVPKSCDPHAIAQNISSALVKMNYCGPVSISAYGDTNRI 86

Query: 73  PNSIQQALSSTGIALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYLLISGDRDFSNAL 132
           P+S+QQALSSTGIALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYLLISGDRDFSNAL
Sbjct: 87  PSSVQQALSSTGIALNHVPAGVKDASDKKILVDMLFWAVDNPAPANYLLISGDRDFSNAL 146

Query: 133 HQLRMRRYNILLAQPQKASAPLVAAAKSIWLWMSLVAGGPPMSSAESSQLVNGILTSDPQ 192
           HQLRMRRYNILLAQPQKASAPLVAAAKS+WLW SL AGGPP+SS ESS+L NG  + + +
Sbjct: 147 HQLRMRRYNILLAQPQKASAPLVAAAKSVWLWTSLSAGGPPLSSGESSKLANGHSSFNSE 206

Query: 193 ISQGSEFDHNQQTGQAIVYKSENVSLGNQRSYSTERMGDNKHKGKYIQKSSNQPVLSRAL 252
           +   +         Q +V+ SENV+LGNQ   +  R GD+K+KGKYI+K+ NQP +SRA 
Sbjct: 207 MLY-NPIPETVLYSQPMVFSSENVALGNQNVSNAGRNGDSKYKGKYIRKTPNQPSISRAS 266

Query: 253 SSPVSMQEKNPN--FLNQPNHMQAKQFKKAPHEFFGNSNPIASSSQSTPTPFIENSSHAR 312
           S P S  ++N N  +  QP + QAK FKKAPHEFFG S    S+S+STP  F  N +   
Sbjct: 267 SVPTSSIQENMNNGYSYQPEYAQAKSFKKAPHEFFGGSEAAVSASKSTPNFFPSNPN--- 326

Query: 313 TDGNVPMGSSSSYQPPHMGLARTDGNMSMGSSSSYQPPHMRQNNMQLHPPFRPDNVFPPN 372
                P GS+             +GN  MG   ++ P  +R NN+ L P F  +N+ PPN
Sbjct: 327 -----PPGSN-------------NGNF-MGIHQNH-PHSLRPNNLPLQPAFAQENLLPPN 386

Query: 373 SLNHNSFPVPGQ--------PDLSAPNISKLHISDYPNYAINPQNFHHQAG-EFRPHTKS 432
           S NH   P+P +        P  + P+I KL+IS++  YA NP NFHH+ G EF+  +  
Sbjct: 387 SQNHGFRPMPPRVEGPRFPAPPSNMPDIGKLNISEHSTYAQNPSNFHHRIGEEFKTSSIE 446

Query: 433 QNPN---FNSPDKGRSQHGGHSFHHDALNKRHARDVVEYTPHSSSTTVTRSLSYNDAWGS 492
             PN    N+P K    HGG +  HD  N R+ R   E+ P SSS  ++ S S N  WG+
Sbjct: 447 SLPNQASLNAPQKSLVLHGGQASQHDTFNNRYPRS-PEFPPPSSS-AISNSPS-NGTWGT 506

Query: 493 QGQPPPSEYIQGLIGVILLALNTLKTEKIMPTEANIADCIRYGDLRNSNTDVKMALDSAI 552
           QG+ PPSEY+QGLIGVILLALNTLK EKIMPTEANI DCIRYGD ++ NTDV+ ALDSAI
Sbjct: 507 QGRSPPSEYVQGLIGVILLALNTLKIEKIMPTEANITDCIRYGDPKHRNTDVRKALDSAI 566

Query: 553 EHNMVVKQNLGAVQLYVGKTEKLWKCVNPLGGHPNQYQKAIWDKIHNCLASPAGRSAIMA 612
           E +MV+KQ+LGA+QLYVG+ EKLWKCVNP+GG+PNQ+ K  WD I   L+SPAG+SA+MA
Sbjct: 567 EQHMVLKQSLGALQLYVGRNEKLWKCVNPIGGNPNQFSKTTWDGIQKFLSSPAGQSAMMA 626

Query: 613 SRCRYEAALILKKECLTDFALGDVLQILNMITSMKKWITHHISGWQPINIMLTEVNTDAS 662
           S+CRYEAAL LK  CL +FALGDVLQILNMI +MKKWI HH SGWQPI + L E   +  
Sbjct: 627 SQCRYEAALALKDACLEEFALGDVLQILNMIIAMKKWIIHHQSGWQPITVTLPETKMEMG 662

BLAST of ClCG01G007140 vs. NCBI nr
Match: gi|694370679|ref|XP_009363078.1| (PREDICTED: uncharacterized protein LOC103953079 [Pyrus x bretschneideri])

HSP 1 Score: 765.8 bits (1976), Expect = 6.4e-218
Identity = 402/676 (59.47%), Postives = 478/676 (70.71%), Query Frame = 1

Query: 1   MNGDVAPAAAPA-GSAEPQYVRAKTSVWWDIENCQVPKGCDPHAIAQNISSALVKINYCG 60
           +NG    A AP+ G AE QYV AKTSVWWDIENCQVPK CD HAIAQNISSALVK+NYCG
Sbjct: 5   VNGSTTGAGAPSMGMAEAQYVNAKTSVWWDIENCQVPKVCDVHAIAQNISSALVKMNYCG 64

Query: 61  PVSISAYGDTNRIPNSIQQALSSTGIALNHVPAGVKDASDKKILVDMLFWAVDNPAPANY 120
           PVSISAYGDTN IP S+Q ALSSTGIALNHVPAGVKDASDKKILVDMLFWAVDN APANY
Sbjct: 65  PVSISAYGDTNGIPASVQHALSSTGIALNHVPAGVKDASDKKILVDMLFWAVDNSAPANY 124

Query: 121 LLISGDRDFSNALHQLRMRRYNILLAQPQKASAPLVAAAKSIWLWMSLVAGGPPMSSAES 180
           LLISGDRDFSNALHQLRMRRYNILLAQPQKASAPLVAAAKS+WLW SL AGGPP+SS ES
Sbjct: 125 LLISGDRDFSNALHQLRMRRYNILLAQPQKASAPLVAAAKSVWLWTSLSAGGPPLSSGES 184

Query: 181 SQLVNGILTSDPQISQGSEFDHNQQTGQAIVYKSENVSLGNQRSYSTERMGDNKHKGKYI 240
           SQL NG  + +P+++Q S  +        + Y+     LGNQ+  ++ R+GD K+KGK  
Sbjct: 185 SQLANGNNSYNPEMAQHSMPETFNINPPPVYYEH---PLGNQKPSTSGRVGDTKNKGKNN 244

Query: 241 QKSSNQPVLSRALSSPVSMQ-EKNPNFLNQPNHMQAKQFKKAPHEFFGNSNPIASSSQST 300
           +K+ NQP +SR  S PV  Q +KN ++  Q  H  AKQFKKAPHEFFG+ +   S+S+S 
Sbjct: 245 RKNPNQPNISRVSSMPVGNQDDKNTDYFYQSEHTHAKQFKKAPHEFFGSGDTPVSNSRSP 304

Query: 301 PTPFIENSSHARTDGNVPMGSSSSYQPPHMGLARTDGNMSMGSSSSYQPPHMRQNNMQLH 360
           P  F  NS  + +DGN  +G  + Y PP                        R NN  + 
Sbjct: 305 PNFFHGNSDPSGSDGNSFLGQPNQYPPP-----------------------QRPNNFHMQ 364

Query: 361 PPFRPDNVFPPNSLNHNSFPVP---------GQPDLSAPNISKLHISDYPNYAINPQNFH 420
           P F PD++ PPNS ++   P+P           P  + P++SKL+IS+Y NYA NPQ F 
Sbjct: 365 PNFGPDSMLPPNSHSYGLRPIPPRPGGPRFTSAPPTNVPDMSKLNISEYNNYAQNPQRFP 424

Query: 421 HQAGE--FRPHTKS--QNPNFNSPDKGRSQHGGHSFHHDALNKRHARDVVEYTPHSSSTT 480
           H+ GE   RP +     + + N P KG +   G +FHHD++N R+ R   EY P  SS  
Sbjct: 425 HRNGEESSRPRSSDSLNSASLNVPYKGHNMQSGQAFHHDSMNNRYPRG-SEYRPPQSSPA 484

Query: 481 VTRSLSYNDAWGSQGQPPPSEYIQGLIGVILLALNTLKTEKIMPTEANIADCIRYGDLRN 540
              ++  N  WG+QG  PPSEY+QGLIGVILLALNTLK EKIMPTEANI DCIRYGDL++
Sbjct: 485 AGNNIPSNGTWGAQGCTPPSEYVQGLIGVILLALNTLKVEKIMPTEANITDCIRYGDLKH 544

Query: 541 SNTDVKMALDSAIEHNMVVKQNLGAVQLYVGKTEKLWKCVNPLGGHPNQYQKAIWDKIHN 600
            NTDV+ ALD AIE +MVVKQ+LGA+QLYVGK EKLWKCVNP+GG+ NQY KA W++I N
Sbjct: 545 RNTDVRKALDYAIEQHMVVKQSLGALQLYVGKNEKLWKCVNPIGGNLNQYSKATWERIQN 604

Query: 601 CLASPAGRSAIMASRCRYEAALILKKECLTDFALGDVLQILNMITSMKKWITHHISGWQP 660
            L+S  GRSAIMAS+CRYEAA+IL+K C  + ALG+VLQILNMI SMKKWI HH SGWQP
Sbjct: 605 FLSSSYGRSAIMASQCRYEAAIILRKACSEELALGNVLQILNMIVSMKKWIIHHQSGWQP 653

Query: 661 INIMLTEVNTDASSRT 662
           I   L E N + ++ T
Sbjct: 665 ITFTLEETNAETAAET 653

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
MARF1_BOVIN2.6e-0928.86Meiosis arrest female protein 1 OS=Bos taurus GN=MARF1 PE=3 SV=2[more]
MARF1_HUMAN2.6e-0928.86Meiosis arrest female protein 1 OS=Homo sapiens GN=KIAA0430 PE=1 SV=6[more]
MARF1_CHICK5.7e-0928.86Meiosis arrest female protein 1 homolog OS=Gallus gallus GN=MARF1 PE=3 SV=1[more]
MARF1_MOUSE1.7e-0827.52Meiosis arrest female protein 1 OS=Mus musculus GN=Marf1 PE=1 SV=3[more]
MARF1_RAT1.7e-0827.52Meiosis arrest female protein 1 OS=Rattus norvegicus GN=Marf1 PE=1 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0KJL9_CUCSA0.0e+0089.17Uncharacterized protein OS=Cucumis sativus GN=Csa_5G119690 PE=4 SV=1[more]
A0A061DZG7_THECC1.5e-21861.24Endonuclease or glycosyl hydrolase with C2H2-type zinc finger domain, putative i... [more]
F6HZE4_VITVI2.4e-21659.08Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0005g02300 PE=4 SV=... [more]
A0A0D2U2K8_GOSRA2.3e-21459.32Uncharacterized protein OS=Gossypium raimondii GN=B456_008G135500 PE=4 SV=1[more]
A0A0B0PUB5_GOSAR5.1e-21459.17Limkain-b1 OS=Gossypium arboreum GN=F383_08550 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G62200.14.6e-17449.93 Putative endonuclease or glycosyl hydrolase[more]
AT5G61190.11.3e-7573.91 putative endonuclease or glycosyl hydrolase with C2H2-type zinc fing... [more]
AT3G62210.14.3e-7170.62 Putative endonuclease or glycosyl hydrolase[more]
AT5G61180.13.1e-6161.80 Putative endonuclease or glycosyl hydrolase[more]
AT3G60940.13.6e-3846.24 Putative endonuclease or glycosyl hydrolase[more]
Match NameE-valueIdentityDescription
gi|659072609|ref|XP_008466414.1|0.0e+0090.68PREDICTED: uncharacterized protein LOC103503825 [Cucumis melo][more]
gi|778698526|ref|XP_011654555.1|0.0e+0089.17PREDICTED: uncharacterized protein LOC101219837 [Cucumis sativus][more]
gi|1009148180|ref|XP_015891799.1|2.8e-22159.60PREDICTED: uncharacterized protein LOC107426200 [Ziziphus jujuba][more]
gi|590720180|ref|XP_007051260.1|2.2e-21861.24Endonuclease or glycosyl hydrolase with C2H2-type zinc finger domain, putative i... [more]
gi|694370679|ref|XP_009363078.1|6.4e-21859.47PREDICTED: uncharacterized protein LOC103953079 [Pyrus x bretschneideri][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR021139NYN_limkain-b1
IPR024768Marf1
IPR025677OST-HTH-assoc_dom
Vocabulary: Cellular Component
TermDefinition
GO:0005777peroxisome
Vocabulary: Biological Process
TermDefinition
GO:0010468regulation of gene expression
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010468 regulation of gene expression
cellular_component GO:0005777 peroxisome
molecular_function GO:0003674 molecular_function
molecular_function GO:0016787 hydrolase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG01G007140.1ClCG01G007140.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR021139NYN domain, limkain-b1-typePFAMPF01936NYNcoord: 23..159
score: 1.8
IPR024768Meiosis arrest female protein 1PANTHERPTHR14379LIMKAIN B LKAPcoord: 1..661
score: 8.9E
IPR025677OST-HTH associated domainPFAMPF14418OHAcoord: 599..654
score: 1.