Clc01G20790 (gene) Watermelon (cordophanus) v2

Overview
NameClc01G20790
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionProtein MID1-COMPLEMENTING ACTIVITY 1 isoform X1
LocationClcChr01: 32653506 .. 32658096 (+)
RNA-Seq ExpressionClc01G20790
SyntenyClc01G20790
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCTTCGTGGGATAGTCTTGGGGACGTTGCCGGTGTGGCCCAGCTAACGGGTATCAATGCAGTTCAACTGATTTCAATGATTGTAAAAGCAGCAAACACAGCAAGGATGCACAAGAAGAACTGCAAGCAATTTGCACAACATCTCAAGTTGATCGGGAACTTATTGGATCAACTCAAGATCTCAGAGCTGAAGAAATATCCTGAGACTCGAGAGCCTCTAGAGCAGCTGGAGGATGCCTTAAGAAAATCATACATTTTGATCAATAGTTGCCAGGATCGTAGCTATCTCTATTTGTTGGCTATGGGATGGAATGTTGTTTATCAATTCAGGAAGGCTCAAAGTGAAATCGATAGATACTTAAGGCTTGTCCCCCTGATTAATTTGGTGGACAATGCTCGAGTCAGAGTAATAATTCACTTTCTTTTGCTGTAATTTTGCTTTTTCTCTTTTGTTAGTTTCACAAATTTATCGCTGTTTTAATCCGTTTTGTTTACTGACAAATGGATATTCATCAATTATTGCCTATTAAAACAAAAACAAGTCCATTGTAGTGTCCTTAGAACCTGTCCTTTTAATACGAATCATTCTATCTTCATTTTGGAAGTAAAAATGAAAAATGGTTCCTAATAGTTTGATGCTTGTGCTTAAGATGGCCTAACTCATCTGAAGTCATTTTCTCTGTTATTTTTTACTCAGGAGAGGCTTGATGATATTGAAAAGCATCAGTGTGAGTATACATTTGAGGAGGATGACAGAAGGATCCAGGATGTGATCCTCAAACCAGAATCTGTCAAGAACGATGCTTCGATATTGAAAAAAACTCTTTCTCGTTCCTACCCAAACTTGGGCCTCCATGATGCACTTCAAAAGGAAAATGAAAAACTTCAGCTTGAGCTGCAAATATCTCAATCTAATATGGATGTGGGGCAATGTCAAATAATTGAACGATTATTTGATATCACAGAAGCCTTGTCTGCAAATTATTTTATAGAAAAGGATTTACAGAGAGGCATTCCGACACAACATGAATATAAATACCCTGAGGCCAATGGTGAAACTGCTCATGCGTATGATGGAAGTTTTAACAAGAATAGAGATGCTATTACGGCAAGGTAGGTTTGTCATTGTTTTAGTATTTTCTTTGTGTTTTTGTGGGAATATCTATGGATATGATTATTCAGATGCCAGCAGTTTTAGCGGTCTTCCATATTGTTAGGATTAACAAACTCCCGATATTGTCCACTCTAAACATGACCACATGAGCACTCATGCTTTGTTTTTGTTTCACCAAAAACGCTTTATAACGATGGAGATGTTGTTTCTCCCTTATAACTTACGATTTTCTCCTTTTCCTAGATAATGTTGCACATTGGTTGCACTCGCAGCACGAATGATGAAAGTATTACTAGAATAAAGATGACAATATAGCAGAGTTGGTATTTCTCCCTTCTGTATGCCAGACGTTTATTCAATTTATTACTCGAAAGCACGCACCCTTACTGAAATTCAGTTGAGGAAATCAAAGATATATATTCTAACTTCTAACTTGGATAAAGTTACATGATTTGGGCGGTTATTTCATTTAAAATTCATTAGGTTATATCATTTAAAATTCAAGAGCCAATTCCCTGTCCGCTGGTTATTTCTTATGTCTGTTTTGAACGTTCAAACTCTTCTATTTCTATCAGGTTTAGAATGCTTTGATAAGCTAACTGATAAACATAATGTCTTCTAGTGAAACTTTACTTCGTGATGTCTTGGCATATTGAATATCTTGAACGTTCCCCCCCCCCCCCCCCCCTTCAGAAAGGGATCATCAGTTTCATCAAGGCATGATCTGCTATCCAGCAATTGTCAACATGAAGAATGGCATGCTGATTTGTTTGGTTGTTGTTCACGACCCTACCTTTGTACGTTCATCTCACTTTTGAAGCTGTACTTGTTAGCTTGTGCTCGAGATAGGACTGAAGAGTGTTCTTTCTTTAGTGTAATTGGTTGCGGTATGTGCATATGTTTTGTTTCTCATTCCAAGTATCTTTTTACATTTTCCAGGTATAAAGACATTTTTCTGTCCTTGTTGGACATTGTCAAAGGTTGCTTCTGTCGCTACCAACAGGCATGTGTGTAAGTAATTACTGAATGTCTATTTTCAACTCGGGCAGTGCTTCATAATGATGGCTAAGATAACATGATTCGATAGCGATGTAATCGGAGATAATCTAATGATTTATAGATTTATGTGGAATCATCTCACAAAATGTAGAATATCTCAAAGAGTGTCCCTCTCGCTTTCATCTTTTCAACACCATATATGGCTCCAGTGATGAAGATCTTGAATAAATTACATGTCTTGTAACGTATCAAAAGCTCTAAAAAAAATCTTGCATTTCCTTTGATTATCAGCTTCAGCAGATGCATGTAACGAGTTGATGGCATATTCTTTGGTGTTCTCATGCTGTTGTTACACTTGCTGTTTCCGACGAAAACTCCGGAATACGTTAAATATCAAGGTAACTTATGTTCCAACTTCTAAATTGAAATTTTCCTTCTTTATTTCAACTGTTTTTAAAGTCCACTCCACCCACCATCCACCCTTGCATGAAAGAATAGGTCTCAATTAACAAGTTGGATTGGTGGATGCAGCAATATGCTTCTTTTGCAAGTTCAGATGTTCTTATCCCATAGAAACGCCAATGAAAAATGATTGTTTCTTGAACTCATGGTCTAGCAGCCCATAGTTGCTAGCCACGAGAATTAAATTACTTGGATGCTTTATGACGCGTCTTGCTTCAATTCCTCTTGCTTTACTCTATATCTATTTGGTGTTTGTGTCTACTTTCTAGGATAGTCATTCAACCTCTACATTTCTATAAGAAAAAAATCTAGGTACAGCTGTACCCAATTAAATGTTGACATATGTCTACATATTTTTTAAAAAGTATTTCTTTTTTTTTTTGGTTAAAAATGGGGAGTATTTGACTTAAGTGCAGCCTAGGCAACAACTAAGTATTCCTCTTTCTATAATGTAGTTTGAAACTTGTTTTTGTTGGAGTAGAAGCAACAATAACTATTTTAGAAGCTTCTTGAAATATCGCATAAGAAATGTTCTAGAAATATCCTAGCATGTCTTCAGGCTCCAAGGAATTCGAATTCCTTTAGATAAGGATAATCTCATAGAAAAATCTAGAATAGTTTAGATCTAAAAAAGCCTAAAAAATCCTCTTGCTTGGGCTCCATGAATCCCACCCCTATAAATAGGAATTGTATCCACATTTGTAAGTGAAGTGAGAAAAGAAAACAAAGCAAGAGAGCTAGATAAACTTAGAGGAAAGAAAATGAATAAGTACTAAGATAGTGAAAAAGTATTCTCTTGAAAGTGTTCGTATTTATACTTTCTTAATAAAAGAAATTTTCTTCCAAGTGTTTGTCTCTATTCCTTTGGGGCCAATTTCTTACCATTTTATTTTGACACATGGAGCCTTTCGTTACCAATCCGAACGGGTCATTTGTATGTCTTTCGCTTCATATTCACCTTTTTTTGTAGGGTGGCCTTATTGATGATTTTCTTTCTCACTTCCTGTGTTGTTGCTGTGCACTCGTCCAAGAATGGCGAGAAGTAGAAATGCGTTGTGGTATGTACTTACATTTCTTCATTTCTATTTTTTGGTCTAAAATTTTATTTTGATCTCCAATCTTTACACAAAAGTTTGTACAAGTTATGCCTGCTAATCTTTTATAGTACATCTACCGAATTTACGTCCTACCACATCGATTTATATGAAAATAAGAGCCTATCTGAAAATATGTTAGATGCTATACAAAATTTTAACAATGGCCATAACAAATTTCTATGAAAATATAACTTAGTGCTGCACCATACATCCTTTGGCATGTGGGTTTCCGTATAATATGGAAAGACTTTCATGTCGGTACAATTACAGGTAGAAAGACCTTCCCATATATTTGATTCAAAATATATCTTGCAGGTACAGAGAACACAAAAACGACCCCTCCACCATTGCAATACATGGAATCCTAAGAGGTAACCAAAAACATATTCCAGTCGCTTTGGAAAGTTAAAAATACTTGATTGTCCTTGGCGCTTCCATGTAAGTTGTAAAATAAACATTGCTTAGGTCTCATTGTTAATGATTAACCCAGTGTAAGGTTGGTATACTCTGTACATAGTGTTATAAATACTCTCCAATAGTGAACATTTCTTTGTGTATAAAACAGAAAAAGCACGCCATAGACTGTATACAGATGTTCATGGTTCAAGGAAGGATTTGAACTCAATACTGTGACTCTGAATTGTGCAAGAAAAATTGGTGTATAAATAGCAAATTACTAAGACAATTGCTTTCCTTTTTTGGTTATCCAAGTCAGGGAAGTTGTTCCGTTGATTCATGGGAATTGAATTAGTAAAAACAAGCATTCCCTTGATTCATCTTGCTCTAGTTCTCTCTCTTTAGACGGCGGTAGAGTTTTGCGCGAAGTAGCTCTACTAGCTCTGCAACTAGGCTGCTCTCTAAGGTATAGTTTCAGGAAACTTGCAATTTCTTCAAATCAAACTGATCCAGATGAATGA

mRNA sequence

ATGTCTTCGTGGGATAGTCTTGGGGACGTTGCCGGTGTGGCCCAGCTAACGGGTATCAATGCAGTTCAACTGATTTCAATGATTGTAAAAGCAGCAAACACAGCAAGGATGCACAAGAAGAACTGCAAGCAATTTGCACAACATCTCAAGTTGATCGGGAACTTATTGGATCAACTCAAGATCTCAGAGCTGAAGAAATATCCTGAGACTCGAGAGCCTCTAGAGCAGCTGGAGGATGCCTTAAGAAAATCATACATTTTGATCAATAGTTGCCAGGATCGTAGCTATCTCTATTTGTTGGCTATGGGATGGAATGTTGTTTATCAATTCAGGAAGGCTCAAAGTGAAATCGATAGATACTTAAGGCTTGTCCCCCTGATTAATTTGGTGGACAATGCTCGAGTCAGAGAGAGGCTTGATGATATTGAAAAGCATCAGTGTGAGTATACATTTGAGGAGGATGACAGAAGGATCCAGGATGTGATCCTCAAACCAGAATCTGTCAAGAACGATGCTTCGATATTGAAAAAAACTCTTTCTCGTTCCTACCCAAACTTGGGCCTCCATGATGCACTTCAAAAGGAAAATGAAAAACTTCAGCTTGAGCTGCAAATATCTCAATCTAATATGGATGTGGGGCAATGTCAAATAATTGAACGATTATTTGATATCACAGAAGCCTTGTCTGCAAATTATTTTATAGAAAAGGATTTACAGAGAGGCATTCCGACACAACATGAATATAAATACCCTGAGGCCAATGGTGAAACTGCTCATGCGTATGATGGAAGTTTTAACAAGAATAGAGATGCTATTACGGCAAGAAAGGGATCATCAGTTTCATCAAGGCATGATCTGCTATCCAGCAATTGTCAACATGAAGAATGGCATGCTGATTTGTTTGGTTGTTGTTCACGACCCTACCTTTGTACGTTCATCTCACTTTTGAAGCTGTACTTGTTAGCTTGTGCTCGAGATAGGACTGAAGAGTGTTCTTTCTTTAGTGTAATTGGTTGCGGTATAAAGACATTTTTCTGTCCTTGTTGGACATTGTCAAAGGTTGCTTCTGTCGCTACCAACAGGCATGTGTCTTCAGCAGATGCATGTAACGAGTTGATGGCATATTCTTTGGTGTTCTCATGCTGTTGTTACACTTGCTGTTTCCGACGAAAACTCCGGAATACGTTAAATATCAAGGGTGGCCTTATTGATGATTTTCTTTCTCACTTCCTGTGTTGTTGCTGTGCACTCGTCCAAGAATGGCGAGAAGTAGAAATGCGTTGTGGTATGTACTTACATTTCTTCATTTCTATTTTTTGGTACAGAGAACACAAAAACGACCCCTCCACCATTGCAATACATGGAATCCTAAGAGACGGCGGTAGAGTTTTGCGCGAAGTAGCTCTACTAGCTCTGCAACTAGGCTGCTCTCTAAGGTATAGTTTCAGGAAACTTGCAATTTCTTCAAATCAAACTGATCCAGATGAATGA

Coding sequence (CDS)

ATGTCTTCGTGGGATAGTCTTGGGGACGTTGCCGGTGTGGCCCAGCTAACGGGTATCAATGCAGTTCAACTGATTTCAATGATTGTAAAAGCAGCAAACACAGCAAGGATGCACAAGAAGAACTGCAAGCAATTTGCACAACATCTCAAGTTGATCGGGAACTTATTGGATCAACTCAAGATCTCAGAGCTGAAGAAATATCCTGAGACTCGAGAGCCTCTAGAGCAGCTGGAGGATGCCTTAAGAAAATCATACATTTTGATCAATAGTTGCCAGGATCGTAGCTATCTCTATTTGTTGGCTATGGGATGGAATGTTGTTTATCAATTCAGGAAGGCTCAAAGTGAAATCGATAGATACTTAAGGCTTGTCCCCCTGATTAATTTGGTGGACAATGCTCGAGTCAGAGAGAGGCTTGATGATATTGAAAAGCATCAGTGTGAGTATACATTTGAGGAGGATGACAGAAGGATCCAGGATGTGATCCTCAAACCAGAATCTGTCAAGAACGATGCTTCGATATTGAAAAAAACTCTTTCTCGTTCCTACCCAAACTTGGGCCTCCATGATGCACTTCAAAAGGAAAATGAAAAACTTCAGCTTGAGCTGCAAATATCTCAATCTAATATGGATGTGGGGCAATGTCAAATAATTGAACGATTATTTGATATCACAGAAGCCTTGTCTGCAAATTATTTTATAGAAAAGGATTTACAGAGAGGCATTCCGACACAACATGAATATAAATACCCTGAGGCCAATGGTGAAACTGCTCATGCGTATGATGGAAGTTTTAACAAGAATAGAGATGCTATTACGGCAAGAAAGGGATCATCAGTTTCATCAAGGCATGATCTGCTATCCAGCAATTGTCAACATGAAGAATGGCATGCTGATTTGTTTGGTTGTTGTTCACGACCCTACCTTTGTACGTTCATCTCACTTTTGAAGCTGTACTTGTTAGCTTGTGCTCGAGATAGGACTGAAGAGTGTTCTTTCTTTAGTGTAATTGGTTGCGGTATAAAGACATTTTTCTGTCCTTGTTGGACATTGTCAAAGGTTGCTTCTGTCGCTACCAACAGGCATGTGTCTTCAGCAGATGCATGTAACGAGTTGATGGCATATTCTTTGGTGTTCTCATGCTGTTGTTACACTTGCTGTTTCCGACGAAAACTCCGGAATACGTTAAATATCAAGGGTGGCCTTATTGATGATTTTCTTTCTCACTTCCTGTGTTGTTGCTGTGCACTCGTCCAAGAATGGCGAGAAGTAGAAATGCGTTGTGGTATGTACTTACATTTCTTCATTTCTATTTTTTGGTACAGAGAACACAAAAACGACCCCTCCACCATTGCAATACATGGAATCCTAAGAGACGGCGGTAGAGTTTTGCGCGAAGTAGCTCTACTAGCTCTGCAACTAGGCTGCTCTCTAAGGTATAGTTTCAGGAAACTTGCAATTTCTTCAAATCAAACTGATCCAGATGAATGA

Protein sequence

MSSWDSLGDVAGVAQLTGINAVQLISMIVKAANTARMHKKNCKQFAQHLKLIGNLLDQLKISELKKYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRYLRLVPLINLVDNARVRERLDDIEKHQCEYTFEEDDRRIQDVILKPESVKNDASILKKTLSRSYPNLGLHDALQKENEKLQLELQISQSNMDVGQCQIIERLFDITEALSANYFIEKDLQRGIPTQHEYKYPEANGETAHAYDGSFNKNRDAITARKGSSVSSRHDLLSSNCQHEEWHADLFGCCSRPYLCTFISLLKLYLLACARDRTEECSFFSVIGCGIKTFFCPCWTLSKVASVATNRHVSSADACNELMAYSLVFSCCCYTCCFRRKLRNTLNIKGGLIDDFLSHFLCCCCALVQEWREVEMRCGMYLHFFISIFWYREHKNDPSTIAIHGILRDGGRVLREVALLALQLGCSLRYSFRKLAISSNQTDPDE
Homology
BLAST of Clc01G20790 vs. NCBI nr
Match: KAA0025651.1 (protein MID1-COMPLEMENTING ACTIVITY 1 isoform X1 [Cucumis melo var. makuwa] >TYK12524.1 protein MID1-COMPLEMENTING ACTIVITY 1 isoform X1 [Cucumis melo var. makuwa])

HSP 1 Score: 769.6 bits (1986), Expect = 1.6e-218
Identity = 384/429 (89.51%), Postives = 391/429 (91.14%), Query Frame = 0

Query: 1   MSSWDSLGDVAGVAQLTGINAVQLISMIVKAANTARMHKKNCKQFAQHLKLIGNLLDQLK 60
           MSSWDSLGDVA VAQLTGINAVQLISMIVKAANTARMHKKNCKQFAQHLKLIGNLLDQLK
Sbjct: 1   MSSWDSLGDVASVAQLTGINAVQLISMIVKAANTARMHKKNCKQFAQHLKLIGNLLDQLK 60

Query: 61  ISELKKYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRY 120
           ISELKKYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRY
Sbjct: 61  ISELKKYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRY 120

Query: 121 LRLVPLINLVDNARVRERLDDIEKHQCEYTFEEDDRRIQDVILKPESVKNDASILKKTLS 180
           LRLVPLINLVDNARVRERLDDIEKHQCEYTFEEDDRRIQDVILKPES+KNDASILKKTLS
Sbjct: 121 LRLVPLINLVDNARVRERLDDIEKHQCEYTFEEDDRRIQDVILKPESIKNDASILKKTLS 180

Query: 181 RSYPNLGLHDALQKENEKLQLELQISQSNMDVGQCQIIERLFDITEALSANYFIEKDLQR 240
           RSYPNLGLHDALQKENEKLQLELQISQSNMDVGQCQIIERLFDITEALSANYFIEKDLQR
Sbjct: 181 RSYPNLGLHDALQKENEKLQLELQISQSNMDVGQCQIIERLFDITEALSANYFIEKDLQR 240

Query: 241 GIPTQHEYKYPEANGETAHAYDGSFNKNRDAITARKGSSVSSRHDLLSSNCQHEEWHADL 300
           GIPTQHEY Y +ANGET HAYDG+F+KNRD I  RKGSSVSSRHDLLSSNCQHEEWHADL
Sbjct: 241 GIPTQHEYNYSDANGETTHAYDGNFHKNRDGIMTRKGSSVSSRHDLLSSNCQHEEWHADL 300

Query: 301 FGCCSRPYLCTFISLLKLYLLACARDRTEECSFFSVIGCGIKTFFCPCWTLSKVASVATN 360
           FGCCS+PYLC                              +KTFFCPCWTLSKVASVATN
Sbjct: 301 FGCCSQPYLC------------------------------MKTFFCPCWTLSKVASVATN 360

Query: 361 RHVSSADACNELMAYSLVFSCCCYTCCFRRKLRNTLNIKGGLIDDFLSHFLCCCCALVQE 420
           RHVSSADACNELMAYSLVFSCCCYTCCFRRKLR+ LNIKGGLIDDFLSHFLCCCCALVQE
Sbjct: 361 RHVSSADACNELMAYSLVFSCCCYTCCFRRKLRSMLNIKGGLIDDFLSHFLCCCCALVQE 399

Query: 421 WREVEMRCG 430
           WREVEMRCG
Sbjct: 421 WREVEMRCG 399

BLAST of Clc01G20790 vs. NCBI nr
Match: XP_008440856.1 (PREDICTED: protein MID1-COMPLEMENTING ACTIVITY 1 isoform X1 [Cucumis melo] >XP_016899365.1 PREDICTED: protein MID1-COMPLEMENTING ACTIVITY 1 isoform X1 [Cucumis melo])

HSP 1 Score: 767.3 bits (1980), Expect = 8.0e-218
Identity = 383/429 (89.28%), Postives = 390/429 (90.91%), Query Frame = 0

Query: 1   MSSWDSLGDVAGVAQLTGINAVQLISMIVKAANTARMHKKNCKQFAQHLKLIGNLLDQLK 60
           MSSWDSLGDVA VAQLTGINAVQLISMIVKAANTARMHKKNCKQFAQHLKLIGNLLDQLK
Sbjct: 1   MSSWDSLGDVASVAQLTGINAVQLISMIVKAANTARMHKKNCKQFAQHLKLIGNLLDQLK 60

Query: 61  ISELKKYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRY 120
           ISEL KYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRY
Sbjct: 61  ISELTKYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRY 120

Query: 121 LRLVPLINLVDNARVRERLDDIEKHQCEYTFEEDDRRIQDVILKPESVKNDASILKKTLS 180
           LRLVPLINLVDNARVRERLDDIEKHQCEYTFEEDDRRIQDVILKPES+KNDASILKKTLS
Sbjct: 121 LRLVPLINLVDNARVRERLDDIEKHQCEYTFEEDDRRIQDVILKPESIKNDASILKKTLS 180

Query: 181 RSYPNLGLHDALQKENEKLQLELQISQSNMDVGQCQIIERLFDITEALSANYFIEKDLQR 240
           RSYPNLGLHDALQKENEKLQLELQISQSNMDVGQCQIIERLFDITEALSANYFIEKDLQR
Sbjct: 181 RSYPNLGLHDALQKENEKLQLELQISQSNMDVGQCQIIERLFDITEALSANYFIEKDLQR 240

Query: 241 GIPTQHEYKYPEANGETAHAYDGSFNKNRDAITARKGSSVSSRHDLLSSNCQHEEWHADL 300
           GIPTQHEY Y +ANGET HAYDG+F+KNRD I  RKGSSVSSRHDLLSSNCQHEEWHADL
Sbjct: 241 GIPTQHEYNYSDANGETTHAYDGNFHKNRDGIMTRKGSSVSSRHDLLSSNCQHEEWHADL 300

Query: 301 FGCCSRPYLCTFISLLKLYLLACARDRTEECSFFSVIGCGIKTFFCPCWTLSKVASVATN 360
           FGCCS+PYLC                              +KTFFCPCWTLSKVASVATN
Sbjct: 301 FGCCSQPYLC------------------------------MKTFFCPCWTLSKVASVATN 360

Query: 361 RHVSSADACNELMAYSLVFSCCCYTCCFRRKLRNTLNIKGGLIDDFLSHFLCCCCALVQE 420
           RHVSSADACNELMAYSLVFSCCCYTCCFRRKLR+ LNIKGGLIDDFLSHFLCCCCALVQE
Sbjct: 361 RHVSSADACNELMAYSLVFSCCCYTCCFRRKLRSMLNIKGGLIDDFLSHFLCCCCALVQE 399

Query: 421 WREVEMRCG 430
           WREVEMRCG
Sbjct: 421 WREVEMRCG 399

BLAST of Clc01G20790 vs. NCBI nr
Match: XP_038882775.1 (protein MID1-COMPLEMENTING ACTIVITY 1 [Benincasa hispida])

HSP 1 Score: 762.7 bits (1968), Expect = 2.0e-216
Identity = 380/429 (88.58%), Postives = 389/429 (90.68%), Query Frame = 0

Query: 1   MSSWDSLGDVAGVAQLTGINAVQLISMIVKAANTARMHKKNCKQFAQHLKLIGNLLDQLK 60
           MSSWDSLGDVAGVAQLTGINAVQLISMIVKAANTARMHKKNCKQFAQHLKLIGNLLDQLK
Sbjct: 1   MSSWDSLGDVAGVAQLTGINAVQLISMIVKAANTARMHKKNCKQFAQHLKLIGNLLDQLK 60

Query: 61  ISELKKYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRY 120
           ISELKKYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRY
Sbjct: 61  ISELKKYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRY 120

Query: 121 LRLVPLINLVDNARVRERLDDIEKHQCEYTFEEDDRRIQDVILKPESVKNDASILKKTLS 180
           LRLVPLINLVDNARVRERLDDIEKHQCEYTFEEDDR+IQDVILKPE +K DASILKKTLS
Sbjct: 121 LRLVPLINLVDNARVRERLDDIEKHQCEYTFEEDDRKIQDVILKPECIKYDASILKKTLS 180

Query: 181 RSYPNLGLHDALQKENEKLQLELQISQSNMDVGQCQIIERLFDITEALSANYFIEKDLQR 240
           RSYPNLGLHDALQKENEKLQLELQISQSNMDVGQCQIIERLFDITEALSANYF+EKD QR
Sbjct: 181 RSYPNLGLHDALQKENEKLQLELQISQSNMDVGQCQIIERLFDITEALSANYFMEKDFQR 240

Query: 241 GIPTQHEYKYPEANGETAHAYDGSFNKNRDAITARKGSSVSSRHDLLSSNCQHEEWHADL 300
           GIPTQHEY Y +ANG T HAYDGSFNKNRDAI ARKGSS+SSR DLL+SNCQHEEWHADL
Sbjct: 241 GIPTQHEYNYSDANGGTTHAYDGSFNKNRDAIMARKGSSISSRRDLLASNCQHEEWHADL 300

Query: 301 FGCCSRPYLCTFISLLKLYLLACARDRTEECSFFSVIGCGIKTFFCPCWTLSKVASVATN 360
           FGCCS+PYLC                              IKTFFCPCWTLSKVASVAT+
Sbjct: 301 FGCCSQPYLC------------------------------IKTFFCPCWTLSKVASVATD 360

Query: 361 RHVSSADACNELMAYSLVFSCCCYTCCFRRKLRNTLNIKGGLIDDFLSHFLCCCCALVQE 420
           RHVSSADACNELMAYSLVFSCCCYTCCFRRKLRN LNIKGGL+DDFLSHFLCCCCALVQE
Sbjct: 361 RHVSSADACNELMAYSLVFSCCCYTCCFRRKLRNMLNIKGGLVDDFLSHFLCCCCALVQE 399

Query: 421 WREVEMRCG 430
           WREVEMRCG
Sbjct: 421 WREVEMRCG 399

BLAST of Clc01G20790 vs. NCBI nr
Match: XP_004135021.1 (protein MID1-COMPLEMENTING ACTIVITY 1 [Cucumis sativus])

HSP 1 Score: 758.1 bits (1956), Expect = 4.9e-215
Identity = 380/430 (88.37%), Postives = 389/430 (90.47%), Query Frame = 0

Query: 1   MSSWDSLGDVAGVAQLTGINAVQLISMIVKAANTARMHKKNCKQFAQHLKLIGNLLDQLK 60
           MSSWDSLGDVAGVAQLTGINAVQLISMIVKAANTARMHKKNCKQFAQHLKLIGNLLDQLK
Sbjct: 1   MSSWDSLGDVAGVAQLTGINAVQLISMIVKAANTARMHKKNCKQFAQHLKLIGNLLDQLK 60

Query: 61  ISELKKYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRY 120
           ISE+KKYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRY
Sbjct: 61  ISEMKKYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRY 120

Query: 121 LRLVPLINLVDNARVRERLDDIEKHQCEYTFEEDDRRIQDVILKPESVKNDAS-ILKKTL 180
           LRLVPLINLVDNARVRERLDDIEKHQCEYTFEEDDRRIQDVILKPES+KNDAS ILKKTL
Sbjct: 121 LRLVPLINLVDNARVRERLDDIEKHQCEYTFEEDDRRIQDVILKPESIKNDASTILKKTL 180

Query: 181 SRSYPNLGLHDALQKENEKLQLELQISQSNMDVGQCQIIERLFDITEALSANYFIEKDLQ 240
           SRSYP LGLHDALQKENEKLQLELQISQSNMDVGQCQIIERLFDITEALSANYFIEKDLQ
Sbjct: 181 SRSYPKLGLHDALQKENEKLQLELQISQSNMDVGQCQIIERLFDITEALSANYFIEKDLQ 240

Query: 241 RGIPTQHEYKYPEANGETAHAYDGSFNKNRDAITARKGSSVSSRHDLLSSNCQHEEWHAD 300
           RGIPTQH+Y Y + NGET HAY G+F+KNRD I  RKGSSVSSRHDLLSSNCQHEEWHAD
Sbjct: 241 RGIPTQHDYNYSDTNGETTHAYVGNFHKNRDGIMTRKGSSVSSRHDLLSSNCQHEEWHAD 300

Query: 301 LFGCCSRPYLCTFISLLKLYLLACARDRTEECSFFSVIGCGIKTFFCPCWTLSKVASVAT 360
           LFGCCS+PYLC                              +KTFFCPCWTLSKVASVAT
Sbjct: 301 LFGCCSQPYLC------------------------------MKTFFCPCWTLSKVASVAT 360

Query: 361 NRHVSSADACNELMAYSLVFSCCCYTCCFRRKLRNTLNIKGGLIDDFLSHFLCCCCALVQ 420
           NRHVSSADACNELMAYSLVFSCCCYTCCFRRKLR+ LNIKGGLIDDFLSHFLCCCCALVQ
Sbjct: 361 NRHVSSADACNELMAYSLVFSCCCYTCCFRRKLRSKLNIKGGLIDDFLSHFLCCCCALVQ 400

Query: 421 EWREVEMRCG 430
           EWREVEMRCG
Sbjct: 421 EWREVEMRCG 400

BLAST of Clc01G20790 vs. NCBI nr
Match: KAG6603896.1 (Cell number regulator 13, partial [Cucurbita argyrosperma subsp. sororia] >KAG7034076.1 Cell number regulator 13, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 743.4 bits (1918), Expect = 1.2e-210
Identity = 367/429 (85.55%), Postives = 381/429 (88.81%), Query Frame = 0

Query: 1   MSSWDSLGDVAGVAQLTGINAVQLISMIVKAANTARMHKKNCKQFAQHLKLIGNLLDQLK 60
           MSSWDSLGDVAGVAQL G NAVQLISMIV+AANTARMHKKNCKQFAQH+KLIGNLLDQLK
Sbjct: 1   MSSWDSLGDVAGVAQLVGFNAVQLISMIVRAANTARMHKKNCKQFAQHIKLIGNLLDQLK 60

Query: 61  ISELKKYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRY 120
           ISELKKYPETREPLEQLEDALRKSYILINSCQDRSYLYL AMGWNVVYQFRKAQSEIDRY
Sbjct: 61  ISELKKYPETREPLEQLEDALRKSYILINSCQDRSYLYLFAMGWNVVYQFRKAQSEIDRY 120

Query: 121 LRLVPLINLVDNARVRERLDDIEKHQCEYTFEEDDRRIQDVILKPESVKNDASILKKTLS 180
           LRLVPLINLVDNARVRERLDDIEKHQCEYTFEEDDRRIQD ILKPES+KNDASILKKTLS
Sbjct: 121 LRLVPLINLVDNARVRERLDDIEKHQCEYTFEEDDRRIQDAILKPESIKNDASILKKTLS 180

Query: 181 RSYPNLGLHDALQKENEKLQLELQISQSNMDVGQCQIIERLFDITEALSANYFIEKDLQR 240
           RSYPNLGLHDALQKENEKLQLELQ+SQSNMDVGQCQIIERLFDITEALSANYFIEKDLQ+
Sbjct: 181 RSYPNLGLHDALQKENEKLQLELQVSQSNMDVGQCQIIERLFDITEALSANYFIEKDLQK 240

Query: 241 GIPTQHEYKYPEANGETAHAYDGSFNKNRDAITARKGSSVSSRHDLLSSNCQHEEWHADL 300
           GIP QH Y Y +  GETAHAY G+F+KNRDA T RKGSSVSSRHD LSSNCQHEEWHADL
Sbjct: 241 GIPIQHGYSYSDVTGETAHAYGGNFHKNRDASTTRKGSSVSSRHDPLSSNCQHEEWHADL 300

Query: 301 FGCCSRPYLCTFISLLKLYLLACARDRTEECSFFSVIGCGIKTFFCPCWTLSKVASVATN 360
            GCCS+PYLC                              IKTFFCPCWTLSKVASVATN
Sbjct: 301 LGCCSQPYLC------------------------------IKTFFCPCWTLSKVASVATN 360

Query: 361 RHVSSADACNELMAYSLVFSCCCYTCCFRRKLRNTLNIKGGLIDDFLSHFLCCCCALVQE 420
           +HVS ADACNELMAY+LVFSCCCYTCCFRRKLRN LNIKGG++DDFLSH LCCCCALVQE
Sbjct: 361 KHVSPADACNELMAYALVFSCCCYTCCFRRKLRNMLNIKGGIVDDFLSHLLCCCCALVQE 399

Query: 421 WREVEMRCG 430
           WRE+EMRCG
Sbjct: 421 WREIEMRCG 399

BLAST of Clc01G20790 vs. ExPASy Swiss-Prot
Match: Q8L7E9 (Protein MID1-COMPLEMENTING ACTIVITY 1 OS=Arabidopsis thaliana OX=3702 GN=MCA1 PE=1 SV=1)

HSP 1 Score: 503.8 bits (1296), Expect = 2.2e-141
Identity = 259/431 (60.09%), Postives = 318/431 (73.78%), Query Frame = 0

Query: 3   SWDSLGDVAGVAQLTGINAVQLISMIVKAANTARMHKKNCKQFAQHLKLIGNLLDQLKIS 62
           SWD LG++A VAQLTG++AV+LI +IVKAANTA MHKKNC+QFAQHLKLIGNLL+QLKIS
Sbjct: 4   SWDGLGEIASVAQLTGLDAVKLIGLIVKAANTAWMHKKNCRQFAQHLKLIGNLLEQLKIS 63

Query: 63  ELKKYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRYLR 122
           E+KKYPETREPLE LEDALR+SY+L+NSC+DRSYLYLLAMGWN+VYQFRK Q EIDR+L+
Sbjct: 64  EMKKYPETREPLEGLEDALRRSYLLVNSCRDRSYLYLLAMGWNIVYQFRKHQDEIDRFLK 123

Query: 123 LVPLINLVDNARVRERLDDIEKHQCEYTFEEDDRRIQDVILKPESVKNDASILKKTLSRS 182
           ++PLI LVDNAR+RER + I++ Q EYT +E+DR +QDVILK ES +  AS+LKKTLS S
Sbjct: 124 IIPLITLVDNARIRERFEYIDRDQREYTLDEEDRHVQDVILKQESTREAASVLKKTLSCS 183

Query: 183 YPNLGLHDALQKENEKLQLELQISQSNMDVGQCQIIERLFDITEALSANYFIEKDLQRGI 242
           YPNL   +AL+ ENEKLQ+ELQ SQ + DV QC++I+RL  +T+A +A   +E D ++ +
Sbjct: 184 YPNLRFCEALKTENEKLQIELQRSQEHYDVAQCEVIQRLIGVTQAAAA---VEPDSEKEL 243

Query: 243 PTQHEYKYPEANG-ETAHAYD-GSFNKNRDAITARKGSSVSSRHDLLSSNC----QHEEW 302
             +   K   ++  +T ++YD  S  K+     +R  S+VSS HDLLS        HEEW
Sbjct: 244 TKKASKKSERSSSMKTEYSYDEDSPKKSSTRAASRSTSNVSSGHDLLSRRASQAQHHEEW 303

Query: 303 HADLFGCCSRPYLCTFISLLKLYLLACARDRTEECSFFSVIGCGIKTFFCPCWTLSKVAS 362
           H DL  CCS P LC                               KTFF PC TL+K+A+
Sbjct: 304 HTDLLACCSEPSLC------------------------------FKTFFFPCGTLAKIAT 363

Query: 363 VATNRHVSSADACNELMAYSLVFSCCCYTCCFRRKLRNTLNIKGGLIDDFLSHFLCCCCA 422
            A+NRH+SSA+ACNELMAYSL+ SCCCYTCC RRKLR TLNI GG IDDFLSH +CCCCA
Sbjct: 364 AASNRHISSAEACNELMAYSLILSCCCYTCCVRRKLRKTLNITGGFIDDFLSHVMCCCCA 401

Query: 423 LVQEWREVEMR 428
           LVQE REVE+R
Sbjct: 424 LVQELREVEIR 401

BLAST of Clc01G20790 vs. ExPASy Swiss-Prot
Match: B6SJQ0 (Cell number regulator 13 OS=Zea mays OX=4577 GN=CNR13 PE=2 SV=1)

HSP 1 Score: 498.0 bits (1281), Expect = 1.2e-139
Identity = 249/440 (56.59%), Postives = 322/440 (73.18%), Query Frame = 0

Query: 1   MSSWDSLGDVAGVAQLTGINAVQLISMIVKAANTARMHKKNCKQFAQHLKLIGNLLDQLK 60
           M+SWD+LG+++ +AQLTG++AV+LIS+IV+AA+TAR+HK+NC++FAQHLKLIG LL+QL+
Sbjct: 1   MASWDNLGELSNIAQLTGLDAVKLISLIVRAASTARLHKRNCRRFAQHLKLIGGLLEQLR 60

Query: 61  ISELKKYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRY 120
           +SEL+KYPETREPLEQLEDALR+ Y+L+NSCQDRSYLYLLAMGWN+VYQFRKAQSEID Y
Sbjct: 61  VSELRKYPETREPLEQLEDALRRGYLLVNSCQDRSYLYLLAMGWNIVYQFRKAQSEIDNY 120

Query: 121 LRLVPLINLVDNARVRERLDDIEKHQCEYTFEEDDRRIQDVILKPESVKNDASILKKTLS 180
           LRLVPLI LVDNAR+R+RL+ IE+ QCEY+F+E+D+++QD +L P+   N   +LKKTLS
Sbjct: 121 LRLVPLITLVDNARIRDRLEYIERDQCEYSFDEEDKKVQDALLNPDPCTNPTIVLKKTLS 180

Query: 181 RSYPNLGLHDALQKENEKLQLELQISQSNMDVGQCQIIERLFDITEALSANYFIEKDLQR 240
            SYPNL  ++AL+KE+EKLQ+ELQ SQSNMD+G C++I+ L  +T+ + +    EK+   
Sbjct: 181 CSYPNLPFNEALRKESEKLQVELQRSQSNMDLGSCEVIQHLLGVTKTVEST-IPEKETNV 240

Query: 241 GIPTQHEYKYPEANGETAHAYD----------GSFNKNR--DAITARKGSSVSSRHDLLS 300
             P +    Y E+ GETA ++D          G + K +     T R  S V   HDL+S
Sbjct: 241 KAPEKKGSNYSESKGETAKSFDDDDDYPKKQNGDYPKKQKDTCSTQRCSSQVPYGHDLVS 300

Query: 301 SNCQH-EEWHADLFGCCSRPYLCTFISLLKLYLLACARDRTEECSFFSVIGCGIKTFFCP 360
           S   + +EWHADL GCCS+P LC                              +KT F P
Sbjct: 301 SRGSYSDEWHADLLGCCSKPALC------------------------------LKTLFFP 360

Query: 361 CWTLSKVASVATNRHVSSADACNELMAYSLVFSCCCYTCCFRRKLRNTLNIKGGLIDDFL 420
           C T S++AS+A +R +SS +ACN++MAYSL+ SCCCYTCC RRKLR  L+I GG  DDFL
Sbjct: 361 CGTFSRIASIAKDRPMSSGEACNDIMAYSLILSCCCYTCCVRRKLRQKLDIAGGCCDDFL 409

Query: 421 SHFLCCCCALVQEWREVEMR 428
           SH LCCCCALVQEWREVE+R
Sbjct: 421 SHLLCCCCALVQEWREVEIR 409

BLAST of Clc01G20790 vs. ExPASy Swiss-Prot
Match: Q3EBY6 (Protein MID1-COMPLEMENTING ACTIVITY 2 OS=Arabidopsis thaliana OX=3702 GN=MCA2 PE=2 SV=1)

HSP 1 Score: 443.0 bits (1138), Expect = 4.6e-123
Identity = 233/433 (53.81%), Postives = 301/433 (69.52%), Query Frame = 0

Query: 2   SSWDSLGDVAGVAQLTGINAVQLISMIVKAANTARMHKKNCKQFAQHLKLIGNLLDQLKI 61
           +SWD LG++A VAQLTGI+A++LI MIV AANTARMHKKNC+QFA HLKLI NLL+Q+K 
Sbjct: 3   NSWDQLGEIASVAQLTGIDALKLIGMIVNAANTARMHKKNCRQFAHHLKLIRNLLEQIKN 62

Query: 62  SELKKYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRYL 121
           SE+ +  E  EPL+ L+DALR+SYIL+ SCQ++SYLYLLAMGWN+V QF KAQ+EID +L
Sbjct: 63  SEMNQRSEILEPLQGLDDALRRSYILVKSCQEKSYLYLLAMGWNIVNQFEKAQNEIDLFL 122

Query: 122 RLVPLINLVDNARVRERLDDIEKHQCEYTFEEDDRRIQDVILKPESVKNDA-SILKKTLS 181
           ++VPLIN+ DNAR+RERL+ IE+ Q EYT +E+DR++QDVILK ES +  A S+LKKTLS
Sbjct: 123 KIVPLINMADNARIRERLEAIERDQREYTLDEEDRKVQDVILKQESTREAATSVLKKTLS 182

Query: 182 RSYPNLGLHDALQKENEKLQLELQISQSNMDVGQCQIIERLFDITEALSANYFIEKDLQR 241
           RSYPN+G  +AL+ E EKLQLELQ S++  D  QC++I+RL D+T+  +    +E +L++
Sbjct: 183 RSYPNMGFCEALKTEEEKLQLELQRSRARYDADQCEVIQRLIDVTQTAAT---VEPNLEK 242

Query: 242 GIPTQHEYKYPEANGETAHAYDGSFNKNRDAITARKGSSVSSRHDLLSS-NCQHE-EWHA 301
            +  + E    +   +    YD   +  R    +R  S VSS H+LLS  + QH   WHA
Sbjct: 243 VLTKKEELTSSKKRDD---LYDTDSSSIR--ADSRSTSYVSSGHELLSGRSLQHRGNWHA 302

Query: 302 DLFGCCSRPYLCTFISLLKLYLLACARDRTEECSFFSVIGCGIKTFFCPCWTLSKVASVA 361
           DL  CCS P LC                              +KT F PC TL+K+++VA
Sbjct: 303 DLLDCCSEPCLC------------------------------LKTLFFPCGTLAKISTVA 362

Query: 362 TNRHVSSADACNELMAYSLVFSCCCYTCCFRRKLRNTLNIKGGLIDDFLSHFLCCCCALV 421
           T+R +SS + C  L+ YSL+ SCCCYTCC R+KLR TLNI GG IDDFLSH +CCCCALV
Sbjct: 363 TSRQISSTEVCKNLIVYSLILSCCCYTCCIRKKLRKTLNITGGCIDDFLSHLMCCCCALV 397

Query: 422 QEWREVEMRCGMY 432
           QE REVE+    Y
Sbjct: 423 QELREVEIHRASY 397

BLAST of Clc01G20790 vs. ExPASy Swiss-Prot
Match: Q9LQU4 (Protein PLANT CADMIUM RESISTANCE 2 OS=Arabidopsis thaliana OX=3702 GN=PCR2 PE=1 SV=1)

HSP 1 Score: 63.9 bits (154), Expect = 5.8e-09
Identity = 36/98 (36.73%), Postives = 49/98 (50.00%), Query Frame = 0

Query: 331 CSFFSVIGCGIKTFFCPCWTLSKVASVATNRHVSSADACNELMAYSLVFSCCC-YTCCFR 390
           C  FS       TF+CPC T  +VA +      S   A       ++V  C C Y+C +R
Sbjct: 21  CDCFSDCKNCCITFWCPCITFGQVAEIVDRGSTSCGTAGALYALIAVVTGCACIYSCFYR 80

Query: 391 RKLRNTLNIKGGLIDDFLSHFLCCCCALVQEWREVEMR 428
            K+R   NIKG    D L HF C  C+L Q++RE++ R
Sbjct: 81  GKMRAQYNIKGDDCTDCLKHFCCELCSLTQQYRELKHR 118

BLAST of Clc01G20790 vs. ExPASy Swiss-Prot
Match: P0CW97 (Protein PLANT CADMIUM RESISTANCE 3 OS=Arabidopsis thaliana OX=3702 GN=PCR3 PE=3 SV=1)

HSP 1 Score: 63.2 bits (152), Expect = 9.8e-09
Identity = 35/98 (35.71%), Postives = 49/98 (50.00%), Query Frame = 0

Query: 331 CSFFSVIGCGIKTFFCPCWTLSKVASVATNRHVSSADACNELMAYSLVFSC-CCYTCCFR 390
           C  FS       T+ CPC T  +VA +    + S   A    +  + +  C C Y+C +R
Sbjct: 21  CDCFSDCQNCCITWLCPCITFGQVADIVDRGNTSCGTAGALYVLLAAITGCGCLYSCIYR 80

Query: 391 RKLRNTLNIKGGLIDDFLSHFLCCCCALVQEWREVEMR 428
            K+R   NI+G    D L HF C  CAL QE+RE++ R
Sbjct: 81  GKIRAQYNIRGDGCTDCLKHFCCELCALTQEYRELKHR 118

BLAST of Clc01G20790 vs. ExPASy TrEMBL
Match: A0A5A7SN29 (Protein MID1-COMPLEMENTING ACTIVITY 1 isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G001050 PE=4 SV=1)

HSP 1 Score: 769.6 bits (1986), Expect = 7.9e-219
Identity = 384/429 (89.51%), Postives = 391/429 (91.14%), Query Frame = 0

Query: 1   MSSWDSLGDVAGVAQLTGINAVQLISMIVKAANTARMHKKNCKQFAQHLKLIGNLLDQLK 60
           MSSWDSLGDVA VAQLTGINAVQLISMIVKAANTARMHKKNCKQFAQHLKLIGNLLDQLK
Sbjct: 1   MSSWDSLGDVASVAQLTGINAVQLISMIVKAANTARMHKKNCKQFAQHLKLIGNLLDQLK 60

Query: 61  ISELKKYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRY 120
           ISELKKYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRY
Sbjct: 61  ISELKKYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRY 120

Query: 121 LRLVPLINLVDNARVRERLDDIEKHQCEYTFEEDDRRIQDVILKPESVKNDASILKKTLS 180
           LRLVPLINLVDNARVRERLDDIEKHQCEYTFEEDDRRIQDVILKPES+KNDASILKKTLS
Sbjct: 121 LRLVPLINLVDNARVRERLDDIEKHQCEYTFEEDDRRIQDVILKPESIKNDASILKKTLS 180

Query: 181 RSYPNLGLHDALQKENEKLQLELQISQSNMDVGQCQIIERLFDITEALSANYFIEKDLQR 240
           RSYPNLGLHDALQKENEKLQLELQISQSNMDVGQCQIIERLFDITEALSANYFIEKDLQR
Sbjct: 181 RSYPNLGLHDALQKENEKLQLELQISQSNMDVGQCQIIERLFDITEALSANYFIEKDLQR 240

Query: 241 GIPTQHEYKYPEANGETAHAYDGSFNKNRDAITARKGSSVSSRHDLLSSNCQHEEWHADL 300
           GIPTQHEY Y +ANGET HAYDG+F+KNRD I  RKGSSVSSRHDLLSSNCQHEEWHADL
Sbjct: 241 GIPTQHEYNYSDANGETTHAYDGNFHKNRDGIMTRKGSSVSSRHDLLSSNCQHEEWHADL 300

Query: 301 FGCCSRPYLCTFISLLKLYLLACARDRTEECSFFSVIGCGIKTFFCPCWTLSKVASVATN 360
           FGCCS+PYLC                              +KTFFCPCWTLSKVASVATN
Sbjct: 301 FGCCSQPYLC------------------------------MKTFFCPCWTLSKVASVATN 360

Query: 361 RHVSSADACNELMAYSLVFSCCCYTCCFRRKLRNTLNIKGGLIDDFLSHFLCCCCALVQE 420
           RHVSSADACNELMAYSLVFSCCCYTCCFRRKLR+ LNIKGGLIDDFLSHFLCCCCALVQE
Sbjct: 361 RHVSSADACNELMAYSLVFSCCCYTCCFRRKLRSMLNIKGGLIDDFLSHFLCCCCALVQE 399

Query: 421 WREVEMRCG 430
           WREVEMRCG
Sbjct: 421 WREVEMRCG 399

BLAST of Clc01G20790 vs. ExPASy TrEMBL
Match: A0A1S3B1M6 (protein MID1-COMPLEMENTING ACTIVITY 1 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103485153 PE=4 SV=1)

HSP 1 Score: 767.3 bits (1980), Expect = 3.9e-218
Identity = 383/429 (89.28%), Postives = 390/429 (90.91%), Query Frame = 0

Query: 1   MSSWDSLGDVAGVAQLTGINAVQLISMIVKAANTARMHKKNCKQFAQHLKLIGNLLDQLK 60
           MSSWDSLGDVA VAQLTGINAVQLISMIVKAANTARMHKKNCKQFAQHLKLIGNLLDQLK
Sbjct: 1   MSSWDSLGDVASVAQLTGINAVQLISMIVKAANTARMHKKNCKQFAQHLKLIGNLLDQLK 60

Query: 61  ISELKKYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRY 120
           ISEL KYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRY
Sbjct: 61  ISELTKYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRY 120

Query: 121 LRLVPLINLVDNARVRERLDDIEKHQCEYTFEEDDRRIQDVILKPESVKNDASILKKTLS 180
           LRLVPLINLVDNARVRERLDDIEKHQCEYTFEEDDRRIQDVILKPES+KNDASILKKTLS
Sbjct: 121 LRLVPLINLVDNARVRERLDDIEKHQCEYTFEEDDRRIQDVILKPESIKNDASILKKTLS 180

Query: 181 RSYPNLGLHDALQKENEKLQLELQISQSNMDVGQCQIIERLFDITEALSANYFIEKDLQR 240
           RSYPNLGLHDALQKENEKLQLELQISQSNMDVGQCQIIERLFDITEALSANYFIEKDLQR
Sbjct: 181 RSYPNLGLHDALQKENEKLQLELQISQSNMDVGQCQIIERLFDITEALSANYFIEKDLQR 240

Query: 241 GIPTQHEYKYPEANGETAHAYDGSFNKNRDAITARKGSSVSSRHDLLSSNCQHEEWHADL 300
           GIPTQHEY Y +ANGET HAYDG+F+KNRD I  RKGSSVSSRHDLLSSNCQHEEWHADL
Sbjct: 241 GIPTQHEYNYSDANGETTHAYDGNFHKNRDGIMTRKGSSVSSRHDLLSSNCQHEEWHADL 300

Query: 301 FGCCSRPYLCTFISLLKLYLLACARDRTEECSFFSVIGCGIKTFFCPCWTLSKVASVATN 360
           FGCCS+PYLC                              +KTFFCPCWTLSKVASVATN
Sbjct: 301 FGCCSQPYLC------------------------------MKTFFCPCWTLSKVASVATN 360

Query: 361 RHVSSADACNELMAYSLVFSCCCYTCCFRRKLRNTLNIKGGLIDDFLSHFLCCCCALVQE 420
           RHVSSADACNELMAYSLVFSCCCYTCCFRRKLR+ LNIKGGLIDDFLSHFLCCCCALVQE
Sbjct: 361 RHVSSADACNELMAYSLVFSCCCYTCCFRRKLRSMLNIKGGLIDDFLSHFLCCCCALVQE 399

Query: 421 WREVEMRCG 430
           WREVEMRCG
Sbjct: 421 WREVEMRCG 399

BLAST of Clc01G20790 vs. ExPASy TrEMBL
Match: A0A6J1ISW3 (cell number regulator 13-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111478376 PE=4 SV=1)

HSP 1 Score: 742.3 bits (1915), Expect = 1.3e-210
Identity = 368/429 (85.78%), Postives = 381/429 (88.81%), Query Frame = 0

Query: 1   MSSWDSLGDVAGVAQLTGINAVQLISMIVKAANTARMHKKNCKQFAQHLKLIGNLLDQLK 60
           MSSWDSLGDVAGVAQL G NAVQLISMIV+AANTARMHKKNCKQFAQHLKLIGNLLDQLK
Sbjct: 1   MSSWDSLGDVAGVAQLVGFNAVQLISMIVRAANTARMHKKNCKQFAQHLKLIGNLLDQLK 60

Query: 61  ISELKKYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRY 120
           ISELKKYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRY
Sbjct: 61  ISELKKYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRY 120

Query: 121 LRLVPLINLVDNARVRERLDDIEKHQCEYTFEEDDRRIQDVILKPESVKNDASILKKTLS 180
           LRLVPLINLV NARVRERLDDIEKHQCEYTFEEDDRRIQD ILKPES+KNDASILKKTLS
Sbjct: 121 LRLVPLINLVGNARVRERLDDIEKHQCEYTFEEDDRRIQDAILKPESIKNDASILKKTLS 180

Query: 181 RSYPNLGLHDALQKENEKLQLELQISQSNMDVGQCQIIERLFDITEALSANYFIEKDLQR 240
           RSYPNLGLHDALQKENEKLQLELQ+SQSNMDVGQCQIIERLFDITEALSANYFIEKDLQ+
Sbjct: 181 RSYPNLGLHDALQKENEKLQLELQVSQSNMDVGQCQIIERLFDITEALSANYFIEKDLQK 240

Query: 241 GIPTQHEYKYPEANGETAHAYDGSFNKNRDAITARKGSSVSSRHDLLSSNCQHEEWHADL 300
           GIP QH Y Y +  GETAHAY G+F+KNRDA T RKGSSVSSRHD LSSNCQHEEWHADL
Sbjct: 241 GIPIQHGYSYSDVAGETAHAYGGNFHKNRDASTTRKGSSVSSRHDPLSSNCQHEEWHADL 300

Query: 301 FGCCSRPYLCTFISLLKLYLLACARDRTEECSFFSVIGCGIKTFFCPCWTLSKVASVATN 360
            GCCS+PYLC                              IKTFFCPCWTLSKVASVATN
Sbjct: 301 LGCCSQPYLC------------------------------IKTFFCPCWTLSKVASVATN 360

Query: 361 RHVSSADACNELMAYSLVFSCCCYTCCFRRKLRNTLNIKGGLIDDFLSHFLCCCCALVQE 420
           +HVS ADACNELMAY+LVFSCCCYTCCFRRKLRN LNIKGG++DDFLSH LCCCCALVQE
Sbjct: 361 KHVSPADACNELMAYALVFSCCCYTCCFRRKLRNMLNIKGGIVDDFLSHLLCCCCALVQE 399

Query: 421 WREVEMRCG 430
           WRE+EMRCG
Sbjct: 421 WREIEMRCG 399

BLAST of Clc01G20790 vs. ExPASy TrEMBL
Match: A0A6J1GFL1 (cell number regulator 13 OS=Cucurbita moschata OX=3662 GN=LOC111453713 PE=4 SV=1)

HSP 1 Score: 740.0 bits (1909), Expect = 6.7e-210
Identity = 365/429 (85.08%), Postives = 380/429 (88.58%), Query Frame = 0

Query: 1   MSSWDSLGDVAGVAQLTGINAVQLISMIVKAANTARMHKKNCKQFAQHLKLIGNLLDQLK 60
           MSSWDSLGDVAGVAQL G NAVQLISMIV+AANTARMHKKNCKQFAQH+KLIGNLLDQLK
Sbjct: 1   MSSWDSLGDVAGVAQLVGFNAVQLISMIVRAANTARMHKKNCKQFAQHIKLIGNLLDQLK 60

Query: 61  ISELKKYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRY 120
           ISELKKYPETREPLEQLEDALRKSYILINSCQDRSYLYL AMGWNVVYQFRKAQSEIDRY
Sbjct: 61  ISELKKYPETREPLEQLEDALRKSYILINSCQDRSYLYLFAMGWNVVYQFRKAQSEIDRY 120

Query: 121 LRLVPLINLVDNARVRERLDDIEKHQCEYTFEEDDRRIQDVILKPESVKNDASILKKTLS 180
           LRLVPLINLVDNARVRERLDDIEKHQCEYTFEEDDRRIQD ILKPES+KNDASILKKTLS
Sbjct: 121 LRLVPLINLVDNARVRERLDDIEKHQCEYTFEEDDRRIQDAILKPESIKNDASILKKTLS 180

Query: 181 RSYPNLGLHDALQKENEKLQLELQISQSNMDVGQCQIIERLFDITEALSANYFIEKDLQR 240
           RSYPNLGLHDALQKENEKLQLELQ+SQSNMDVGQCQIIERLFDITEALSANYFIEKDLQ+
Sbjct: 181 RSYPNLGLHDALQKENEKLQLELQVSQSNMDVGQCQIIERLFDITEALSANYFIEKDLQK 240

Query: 241 GIPTQHEYKYPEANGETAHAYDGSFNKNRDAITARKGSSVSSRHDLLSSNCQHEEWHADL 300
           GIP QH Y Y +  GETAHAY G+ +KNRDA T RKGSSVSSRHD LSSNCQHEEWHADL
Sbjct: 241 GIPIQHGYSYSDVTGETAHAYGGNLHKNRDASTTRKGSSVSSRHDPLSSNCQHEEWHADL 300

Query: 301 FGCCSRPYLCTFISLLKLYLLACARDRTEECSFFSVIGCGIKTFFCPCWTLSKVASVATN 360
            GCCS+PYLC                              IKTFFCPCWTLSKVA+VATN
Sbjct: 301 LGCCSQPYLC------------------------------IKTFFCPCWTLSKVATVATN 360

Query: 361 RHVSSADACNELMAYSLVFSCCCYTCCFRRKLRNTLNIKGGLIDDFLSHFLCCCCALVQE 420
           +HVS ADACNELMAY+LVFSCCCYTCCFRRKLRN LNIKGG++DDFLSH LCCCCALVQE
Sbjct: 361 KHVSPADACNELMAYALVFSCCCYTCCFRRKLRNMLNIKGGIVDDFLSHLLCCCCALVQE 399

Query: 421 WREVEMRCG 430
           WRE+EMRCG
Sbjct: 421 WREIEMRCG 399

BLAST of Clc01G20790 vs. ExPASy TrEMBL
Match: A0A6J1IKW7 (cell number regulator 13-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111478376 PE=4 SV=1)

HSP 1 Score: 737.3 bits (1902), Expect = 4.3e-209
Identity = 368/431 (85.38%), Postives = 381/431 (88.40%), Query Frame = 0

Query: 1   MSSWDSLGDVAGVAQLTGINAVQLISMIVKAANTARMHKKNCKQFAQHLKLIGNLLDQLK 60
           MSSWDSLGDVAGVAQL G NAVQLISMIV+AANTARMHKKNCKQFAQHLKLIGNLLDQLK
Sbjct: 1   MSSWDSLGDVAGVAQLVGFNAVQLISMIVRAANTARMHKKNCKQFAQHLKLIGNLLDQLK 60

Query: 61  ISELKKYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRY 120
           ISELKKYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRY
Sbjct: 61  ISELKKYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRY 120

Query: 121 LRLVPLINLVDNARVR--ERLDDIEKHQCEYTFEEDDRRIQDVILKPESVKNDASILKKT 180
           LRLVPLINLV NARVR  ERLDDIEKHQCEYTFEEDDRRIQD ILKPES+KNDASILKKT
Sbjct: 121 LRLVPLINLVGNARVRVIERLDDIEKHQCEYTFEEDDRRIQDAILKPESIKNDASILKKT 180

Query: 181 LSRSYPNLGLHDALQKENEKLQLELQISQSNMDVGQCQIIERLFDITEALSANYFIEKDL 240
           LSRSYPNLGLHDALQKENEKLQLELQ+SQSNMDVGQCQIIERLFDITEALSANYFIEKDL
Sbjct: 181 LSRSYPNLGLHDALQKENEKLQLELQVSQSNMDVGQCQIIERLFDITEALSANYFIEKDL 240

Query: 241 QRGIPTQHEYKYPEANGETAHAYDGSFNKNRDAITARKGSSVSSRHDLLSSNCQHEEWHA 300
           Q+GIP QH Y Y +  GETAHAY G+F+KNRDA T RKGSSVSSRHD LSSNCQHEEWHA
Sbjct: 241 QKGIPIQHGYSYSDVAGETAHAYGGNFHKNRDASTTRKGSSVSSRHDPLSSNCQHEEWHA 300

Query: 301 DLFGCCSRPYLCTFISLLKLYLLACARDRTEECSFFSVIGCGIKTFFCPCWTLSKVASVA 360
           DL GCCS+PYLC                              IKTFFCPCWTLSKVASVA
Sbjct: 301 DLLGCCSQPYLC------------------------------IKTFFCPCWTLSKVASVA 360

Query: 361 TNRHVSSADACNELMAYSLVFSCCCYTCCFRRKLRNTLNIKGGLIDDFLSHFLCCCCALV 420
           TN+HVS ADACNELMAY+LVFSCCCYTCCFRRKLRN LNIKGG++DDFLSH LCCCCALV
Sbjct: 361 TNKHVSPADACNELMAYALVFSCCCYTCCFRRKLRNMLNIKGGIVDDFLSHLLCCCCALV 401

Query: 421 QEWREVEMRCG 430
           QEWRE+EMRCG
Sbjct: 421 QEWREIEMRCG 401

BLAST of Clc01G20790 vs. TAIR 10
Match: AT4G35920.1 (PLAC8 family protein )

HSP 1 Score: 503.8 bits (1296), Expect = 1.5e-142
Identity = 259/431 (60.09%), Postives = 318/431 (73.78%), Query Frame = 0

Query: 3   SWDSLGDVAGVAQLTGINAVQLISMIVKAANTARMHKKNCKQFAQHLKLIGNLLDQLKIS 62
           SWD LG++A VAQLTG++AV+LI +IVKAANTA MHKKNC+QFAQHLKLIGNLL+QLKIS
Sbjct: 4   SWDGLGEIASVAQLTGLDAVKLIGLIVKAANTAWMHKKNCRQFAQHLKLIGNLLEQLKIS 63

Query: 63  ELKKYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRYLR 122
           E+KKYPETREPLE LEDALR+SY+L+NSC+DRSYLYLLAMGWN+VYQFRK Q EIDR+L+
Sbjct: 64  EMKKYPETREPLEGLEDALRRSYLLVNSCRDRSYLYLLAMGWNIVYQFRKHQDEIDRFLK 123

Query: 123 LVPLINLVDNARVRERLDDIEKHQCEYTFEEDDRRIQDVILKPESVKNDASILKKTLSRS 182
           ++PLI LVDNAR+RER + I++ Q EYT +E+DR +QDVILK ES +  AS+LKKTLS S
Sbjct: 124 IIPLITLVDNARIRERFEYIDRDQREYTLDEEDRHVQDVILKQESTREAASVLKKTLSCS 183

Query: 183 YPNLGLHDALQKENEKLQLELQISQSNMDVGQCQIIERLFDITEALSANYFIEKDLQRGI 242
           YPNL   +AL+ ENEKLQ+ELQ SQ + DV QC++I+RL  +T+A +A   +E D ++ +
Sbjct: 184 YPNLRFCEALKTENEKLQIELQRSQEHYDVAQCEVIQRLIGVTQAAAA---VEPDSEKEL 243

Query: 243 PTQHEYKYPEANG-ETAHAYD-GSFNKNRDAITARKGSSVSSRHDLLSSNC----QHEEW 302
             +   K   ++  +T ++YD  S  K+     +R  S+VSS HDLLS        HEEW
Sbjct: 244 TKKASKKSERSSSMKTEYSYDEDSPKKSSTRAASRSTSNVSSGHDLLSRRASQAQHHEEW 303

Query: 303 HADLFGCCSRPYLCTFISLLKLYLLACARDRTEECSFFSVIGCGIKTFFCPCWTLSKVAS 362
           H DL  CCS P LC                               KTFF PC TL+K+A+
Sbjct: 304 HTDLLACCSEPSLC------------------------------FKTFFFPCGTLAKIAT 363

Query: 363 VATNRHVSSADACNELMAYSLVFSCCCYTCCFRRKLRNTLNIKGGLIDDFLSHFLCCCCA 422
            A+NRH+SSA+ACNELMAYSL+ SCCCYTCC RRKLR TLNI GG IDDFLSH +CCCCA
Sbjct: 364 AASNRHISSAEACNELMAYSLILSCCCYTCCVRRKLRKTLNITGGFIDDFLSHVMCCCCA 401

Query: 423 LVQEWREVEMR 428
           LVQE REVE+R
Sbjct: 424 LVQELREVEIR 401

BLAST of Clc01G20790 vs. TAIR 10
Match: AT4G35920.3 (PLAC8 family protein )

HSP 1 Score: 503.8 bits (1296), Expect = 1.5e-142
Identity = 259/431 (60.09%), Postives = 318/431 (73.78%), Query Frame = 0

Query: 3   SWDSLGDVAGVAQLTGINAVQLISMIVKAANTARMHKKNCKQFAQHLKLIGNLLDQLKIS 62
           SWD LG++A VAQLTG++AV+LI +IVKAANTA MHKKNC+QFAQHLKLIGNLL+QLKIS
Sbjct: 4   SWDGLGEIASVAQLTGLDAVKLIGLIVKAANTAWMHKKNCRQFAQHLKLIGNLLEQLKIS 63

Query: 63  ELKKYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRYLR 122
           E+KKYPETREPLE LEDALR+SY+L+NSC+DRSYLYLLAMGWN+VYQFRK Q EIDR+L+
Sbjct: 64  EMKKYPETREPLEGLEDALRRSYLLVNSCRDRSYLYLLAMGWNIVYQFRKHQDEIDRFLK 123

Query: 123 LVPLINLVDNARVRERLDDIEKHQCEYTFEEDDRRIQDVILKPESVKNDASILKKTLSRS 182
           ++PLI LVDNAR+RER + I++ Q EYT +E+DR +QDVILK ES +  AS+LKKTLS S
Sbjct: 124 IIPLITLVDNARIRERFEYIDRDQREYTLDEEDRHVQDVILKQESTREAASVLKKTLSCS 183

Query: 183 YPNLGLHDALQKENEKLQLELQISQSNMDVGQCQIIERLFDITEALSANYFIEKDLQRGI 242
           YPNL   +AL+ ENEKLQ+ELQ SQ + DV QC++I+RL  +T+A +A   +E D ++ +
Sbjct: 184 YPNLRFCEALKTENEKLQIELQRSQEHYDVAQCEVIQRLIGVTQAAAA---VEPDSEKEL 243

Query: 243 PTQHEYKYPEANG-ETAHAYD-GSFNKNRDAITARKGSSVSSRHDLLSSNC----QHEEW 302
             +   K   ++  +T ++YD  S  K+     +R  S+VSS HDLLS        HEEW
Sbjct: 244 TKKASKKSERSSSMKTEYSYDEDSPKKSSTRAASRSTSNVSSGHDLLSRRASQAQHHEEW 303

Query: 303 HADLFGCCSRPYLCTFISLLKLYLLACARDRTEECSFFSVIGCGIKTFFCPCWTLSKVAS 362
           H DL  CCS P LC                               KTFF PC TL+K+A+
Sbjct: 304 HTDLLACCSEPSLC------------------------------FKTFFFPCGTLAKIAT 363

Query: 363 VATNRHVSSADACNELMAYSLVFSCCCYTCCFRRKLRNTLNIKGGLIDDFLSHFLCCCCA 422
            A+NRH+SSA+ACNELMAYSL+ SCCCYTCC RRKLR TLNI GG IDDFLSH +CCCCA
Sbjct: 364 AASNRHISSAEACNELMAYSLILSCCCYTCCVRRKLRKTLNITGGFIDDFLSHVMCCCCA 401

Query: 423 LVQEWREVEMR 428
           LVQE REVE+R
Sbjct: 424 LVQELREVEIR 401

BLAST of Clc01G20790 vs. TAIR 10
Match: AT4G35920.2 (PLAC8 family protein )

HSP 1 Score: 503.8 bits (1296), Expect = 1.5e-142
Identity = 259/431 (60.09%), Postives = 318/431 (73.78%), Query Frame = 0

Query: 3   SWDSLGDVAGVAQLTGINAVQLISMIVKAANTARMHKKNCKQFAQHLKLIGNLLDQLKIS 62
           SWD LG++A VAQLTG++AV+LI +IVKAANTA MHKKNC+QFAQHLKLIGNLL+QLKIS
Sbjct: 4   SWDGLGEIASVAQLTGLDAVKLIGLIVKAANTAWMHKKNCRQFAQHLKLIGNLLEQLKIS 63

Query: 63  ELKKYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRYLR 122
           E+KKYPETREPLE LEDALR+SY+L+NSC+DRSYLYLLAMGWN+VYQFRK Q EIDR+L+
Sbjct: 64  EMKKYPETREPLEGLEDALRRSYLLVNSCRDRSYLYLLAMGWNIVYQFRKHQDEIDRFLK 123

Query: 123 LVPLINLVDNARVRERLDDIEKHQCEYTFEEDDRRIQDVILKPESVKNDASILKKTLSRS 182
           ++PLI LVDNAR+RER + I++ Q EYT +E+DR +QDVILK ES +  AS+LKKTLS S
Sbjct: 124 IIPLITLVDNARIRERFEYIDRDQREYTLDEEDRHVQDVILKQESTREAASVLKKTLSCS 183

Query: 183 YPNLGLHDALQKENEKLQLELQISQSNMDVGQCQIIERLFDITEALSANYFIEKDLQRGI 242
           YPNL   +AL+ ENEKLQ+ELQ SQ + DV QC++I+RL  +T+A +A   +E D ++ +
Sbjct: 184 YPNLRFCEALKTENEKLQIELQRSQEHYDVAQCEVIQRLIGVTQAAAA---VEPDSEKEL 243

Query: 243 PTQHEYKYPEANG-ETAHAYD-GSFNKNRDAITARKGSSVSSRHDLLSSNC----QHEEW 302
             +   K   ++  +T ++YD  S  K+     +R  S+VSS HDLLS        HEEW
Sbjct: 244 TKKASKKSERSSSMKTEYSYDEDSPKKSSTRAASRSTSNVSSGHDLLSRRASQAQHHEEW 303

Query: 303 HADLFGCCSRPYLCTFISLLKLYLLACARDRTEECSFFSVIGCGIKTFFCPCWTLSKVAS 362
           H DL  CCS P LC                               KTFF PC TL+K+A+
Sbjct: 304 HTDLLACCSEPSLC------------------------------FKTFFFPCGTLAKIAT 363

Query: 363 VATNRHVSSADACNELMAYSLVFSCCCYTCCFRRKLRNTLNIKGGLIDDFLSHFLCCCCA 422
            A+NRH+SSA+ACNELMAYSL+ SCCCYTCC RRKLR TLNI GG IDDFLSH +CCCCA
Sbjct: 364 AASNRHISSAEACNELMAYSLILSCCCYTCCVRRKLRKTLNITGGFIDDFLSHVMCCCCA 401

Query: 423 LVQEWREVEMR 428
           LVQE REVE+R
Sbjct: 424 LVQELREVEIR 401

BLAST of Clc01G20790 vs. TAIR 10
Match: AT2G17780.3 (PLAC8 family protein )

HSP 1 Score: 443.4 bits (1139), Expect = 2.5e-124
Identity = 236/445 (53.03%), Postives = 306/445 (68.76%), Query Frame = 0

Query: 2   SSWDSLGDVAGVAQLTGINAVQLISMIVKAANTARMHKKNCKQFAQHLKLIGNLLDQLKI 61
           +SWD LG++A VAQLTGI+A++LI MIV AANTARMHKKNC+QFA HLKLI NLL+Q+K 
Sbjct: 3   NSWDQLGEIASVAQLTGIDALKLIGMIVNAANTARMHKKNCRQFAHHLKLIRNLLEQIKN 62

Query: 62  SELKKYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRYL 121
           SE+ +  E  EPL+ L+DALR+SYIL+ SCQ++SYLYLLAMGWN+V QF KAQ+EID +L
Sbjct: 63  SEMNQRSEILEPLQGLDDALRRSYILVKSCQEKSYLYLLAMGWNIVNQFEKAQNEIDLFL 122

Query: 122 RLVPLINLVDNARVRERLDDIEKHQCEYTFEEDDRRIQDVILKPESVKNDA-SILKKTLS 181
           ++VPLIN+ DNAR+RERL+ IE+ Q EYT +E+DR++QDVILK ES +  A S+LKKTLS
Sbjct: 123 KIVPLINMADNARIRERLEAIERDQREYTLDEEDRKVQDVILKQESTREAATSVLKKTLS 182

Query: 182 RSYPNLGLHDALQKENEKLQLELQISQSNMDVGQCQIIERLFDITEALSANYFIEKDLQR 241
           RSYPN+G  +AL+ E EKLQLELQ S++  D  QC++I+RL D+T+  +    +E +L++
Sbjct: 183 RSYPNMGFCEALKTEEEKLQLELQRSRARYDADQCEVIQRLIDVTQTAAT---VEPNLEK 242

Query: 242 GIPTQHEYKYPEANGETAHAYDGSFNKNRDAITARKGSSVSSRHDLLSS-NCQHE-EWHA 301
            +  + E    +   +    YD   +  R    +R  S VSS H+LLS  + QH   WHA
Sbjct: 243 VLTKKEELTSSKKRDD---LYDTDSSSIR--ADSRSTSYVSSGHELLSGRSLQHRGNWHA 302

Query: 302 DLFGCCSRPYLCTFISLLKLYLLACARDRTEECSFFSVIGCGIKTFFCPCWTLSKVASVA 361
           DL  CCS P LC                              +KT F PC TL+K+++VA
Sbjct: 303 DLLDCCSEPCLC------------------------------LKTLFFPCGTLAKISTVA 362

Query: 362 TNRHVSSADACNELMAYSLVFSCCCYTCCFRRKLRNTLNIKGGLIDDFLSHFLCCCCALV 421
           T+R +SS + C  L+ YSL+ SCCCYTCC R+KLR TLNI GG IDDFLSH +CCCCALV
Sbjct: 363 TSRQISSTEVCKNLIVYSLILSCCCYTCCIRKKLRKTLNITGGCIDDFLSHLMCCCCALV 409

Query: 422 QEWREVEMRCGMY--LHFFISIFWY 442
           QE REVE+    Y  ++ F  I  Y
Sbjct: 423 QELREVEIHRASYGKIYTFTDILTY 409

BLAST of Clc01G20790 vs. TAIR 10
Match: AT2G17780.1 (PLAC8 family protein )

HSP 1 Score: 443.0 bits (1138), Expect = 3.2e-124
Identity = 233/433 (53.81%), Postives = 301/433 (69.52%), Query Frame = 0

Query: 2   SSWDSLGDVAGVAQLTGINAVQLISMIVKAANTARMHKKNCKQFAQHLKLIGNLLDQLKI 61
           +SWD LG++A VAQLTGI+A++LI MIV AANTARMHKKNC+QFA HLKLI NLL+Q+K 
Sbjct: 3   NSWDQLGEIASVAQLTGIDALKLIGMIVNAANTARMHKKNCRQFAHHLKLIRNLLEQIKN 62

Query: 62  SELKKYPETREPLEQLEDALRKSYILINSCQDRSYLYLLAMGWNVVYQFRKAQSEIDRYL 121
           SE+ +  E  EPL+ L+DALR+SYIL+ SCQ++SYLYLLAMGWN+V QF KAQ+EID +L
Sbjct: 63  SEMNQRSEILEPLQGLDDALRRSYILVKSCQEKSYLYLLAMGWNIVNQFEKAQNEIDLFL 122

Query: 122 RLVPLINLVDNARVRERLDDIEKHQCEYTFEEDDRRIQDVILKPESVKNDA-SILKKTLS 181
           ++VPLIN+ DNAR+RERL+ IE+ Q EYT +E+DR++QDVILK ES +  A S+LKKTLS
Sbjct: 123 KIVPLINMADNARIRERLEAIERDQREYTLDEEDRKVQDVILKQESTREAATSVLKKTLS 182

Query: 182 RSYPNLGLHDALQKENEKLQLELQISQSNMDVGQCQIIERLFDITEALSANYFIEKDLQR 241
           RSYPN+G  +AL+ E EKLQLELQ S++  D  QC++I+RL D+T+  +    +E +L++
Sbjct: 183 RSYPNMGFCEALKTEEEKLQLELQRSRARYDADQCEVIQRLIDVTQTAAT---VEPNLEK 242

Query: 242 GIPTQHEYKYPEANGETAHAYDGSFNKNRDAITARKGSSVSSRHDLLSS-NCQHE-EWHA 301
            +  + E    +   +    YD   +  R    +R  S VSS H+LLS  + QH   WHA
Sbjct: 243 VLTKKEELTSSKKRDD---LYDTDSSSIR--ADSRSTSYVSSGHELLSGRSLQHRGNWHA 302

Query: 302 DLFGCCSRPYLCTFISLLKLYLLACARDRTEECSFFSVIGCGIKTFFCPCWTLSKVASVA 361
           DL  CCS P LC                              +KT F PC TL+K+++VA
Sbjct: 303 DLLDCCSEPCLC------------------------------LKTLFFPCGTLAKISTVA 362

Query: 362 TNRHVSSADACNELMAYSLVFSCCCYTCCFRRKLRNTLNIKGGLIDDFLSHFLCCCCALV 421
           T+R +SS + C  L+ YSL+ SCCCYTCC R+KLR TLNI GG IDDFLSH +CCCCALV
Sbjct: 363 TSRQISSTEVCKNLIVYSLILSCCCYTCCIRKKLRKTLNITGGCIDDFLSHLMCCCCALV 397

Query: 422 QEWREVEMRCGMY 432
           QE REVE+    Y
Sbjct: 423 QELREVEIHRASY 397

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA0025651.11.6e-21889.51protein MID1-COMPLEMENTING ACTIVITY 1 isoform X1 [Cucumis melo var. makuwa] >TYK... [more]
XP_008440856.18.0e-21889.28PREDICTED: protein MID1-COMPLEMENTING ACTIVITY 1 isoform X1 [Cucumis melo] >XP_0... [more]
XP_038882775.12.0e-21688.58protein MID1-COMPLEMENTING ACTIVITY 1 [Benincasa hispida][more]
XP_004135021.14.9e-21588.37protein MID1-COMPLEMENTING ACTIVITY 1 [Cucumis sativus][more]
KAG6603896.11.2e-21085.55Cell number regulator 13, partial [Cucurbita argyrosperma subsp. sororia] >KAG70... [more]
Match NameE-valueIdentityDescription
Q8L7E92.2e-14160.09Protein MID1-COMPLEMENTING ACTIVITY 1 OS=Arabidopsis thaliana OX=3702 GN=MCA1 PE... [more]
B6SJQ01.2e-13956.59Cell number regulator 13 OS=Zea mays OX=4577 GN=CNR13 PE=2 SV=1[more]
Q3EBY64.6e-12353.81Protein MID1-COMPLEMENTING ACTIVITY 2 OS=Arabidopsis thaliana OX=3702 GN=MCA2 PE... [more]
Q9LQU45.8e-0936.73Protein PLANT CADMIUM RESISTANCE 2 OS=Arabidopsis thaliana OX=3702 GN=PCR2 PE=1 ... [more]
P0CW979.8e-0935.71Protein PLANT CADMIUM RESISTANCE 3 OS=Arabidopsis thaliana OX=3702 GN=PCR3 PE=3 ... [more]
Match NameE-valueIdentityDescription
A0A5A7SN297.9e-21989.51Protein MID1-COMPLEMENTING ACTIVITY 1 isoform X1 OS=Cucumis melo var. makuwa OX=... [more]
A0A1S3B1M63.9e-21889.28protein MID1-COMPLEMENTING ACTIVITY 1 isoform X1 OS=Cucumis melo OX=3656 GN=LOC1... [more]
A0A6J1ISW31.3e-21085.78cell number regulator 13-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC11147... [more]
A0A6J1GFL16.7e-21085.08cell number regulator 13 OS=Cucurbita moschata OX=3662 GN=LOC111453713 PE=4 SV=1[more]
A0A6J1IKW74.3e-20985.38cell number regulator 13-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC11147... [more]
Match NameE-valueIdentityDescription
AT4G35920.11.5e-14260.09PLAC8 family protein [more]
AT4G35920.31.5e-14260.09PLAC8 family protein [more]
AT4G35920.21.5e-14260.09PLAC8 family protein [more]
AT2G17780.32.5e-12453.03PLAC8 family protein [more]
AT2G17780.13.2e-12453.81PLAC8 family protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 189..209
NoneNo IPR availablePANTHERPTHR46604PROTEIN MID1-COMPLEMENTING ACTIVITY 1coord: 2..311
coord: 341..428
NoneNo IPR availablePANTHERPTHR46604:SF3PROTEIN MID1-COMPLEMENTING ACTIVITY 1coord: 2..311
coord: 341..428
IPR006461PLAC8 motif-containing proteinTIGRFAMTIGR01571TIGR01571coord: 337..426
e-value: 1.6E-16
score: 58.9
IPR006461PLAC8 motif-containing proteinPFAMPF04749PLAC8coord: 296..423
e-value: 1.6E-10
score: 41.7
IPR036537Adaptor protein Cbl, N-terminal domain superfamilyGENE3D1.20.930.20coord: 13..127
e-value: 8.9E-18
score: 66.3

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc01G20790.2Clc01G20790.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0007166 cell surface receptor signaling pathway