ClCG01G000050 (gene) Watermelon (Charleston Gray)

NameClCG01G000050
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionNuclear protein localization, putative
LocationCG_Chr01 : 46335 .. 54954 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCGCCGTCTCAACGGAGATTGTCTATTTCCTGATTCCCAAGCCACCGCGGTATACTTCTTATTCGATATAAATACAATCCCTTGCGTCTACTCCCTGAACAAGAAATTCGAAGAAGCATCATACCCTTCTAACTTCTGTTCTCTAAATTTCCGGCAGCATTTTCTGAATCCCACCGTCGACTTGCAACAGCTGTTTATCCCTCAAATCCGTCTTTAAATAACGATCAATCTCCTCATATAGAAGAGAATCTCATTACAACTTCCGCCGACACCAATATGATGCTCAGAATTCGCAGTAGAGATGGCCTGGAGCGAGTCGCTGTAGACAACCCACACATCACAATCGCTCAACTCAAAGCCATAATTCAATCCCAGCTCAATATCCCAATCCACAACCAAACCCTCTCAACTAACCAAAATATTCTATTGGCGAAGACTCATGACGATCTTTCCAAATTCACTGACATGTCTAATCCTAATACCCATCTCTCGTCGCTTAATTTGTCTCATGGGTCTATTGTCTTTCTCGCCTACGAGGGCGAGCGCACTGTTGCTGGCCCTACTTTCCATCCCGCCGGATCTTTTGGCCGTAAAATGACTATGGATGATCTAATTGCTAAGCAGATGCGGATCACTCGTCAAGAAAACCCCCATTGTGAATTGGTTTCTTTCGACCGGGATTGCGCCAATGCTTTCCAGCATTATGTTAATGAAACGCTAGCCTTCGCTGTCAAACGTGGGGGGATGATGTACGGAACCGTGTCACCAGAAGGCAAGGTCGAGGTAGATTTCATATATGAGCCGCCACAGCAAGGGACTGAGGACAATTTACTGTTTTTCCGAGATCCCGATGAGGAAAGATTGGTAGAAGCAATTGCAGTTGGGTTGGGGATGAGGAAAGTTGGGTTTATATTCACGCAGACGATTAGTCAGGACAAAAAGGACTACACCTTATCCAACAGGGAAGTACTCCAGGCGGCTCAGTTTCACTCCGAGAGCGAGTTGAAGGAGTGGGTGACAGCAGTTGTGAAGTTGGAGGTGAACGAGGATGGGGGTGCTGATGTTCATTTTGAGGCTTTTCAAATGAGTGACATGTGCATTAGATTGTTCAAGGAAGGTTGGTTTGAGACGGATATTGGAGAGGATTTTGATCCCAAGCTCTCGAAGATGAGGAGGGACGTTGTTGTTGGTGTCAAAGACACTAGAGATGTTGACAATGACTTTTTCCTGGTCGTAGTTAAGATTTTCGACCATCAGGTTCGTGGACCCTTTAATTTTGCTAATCATATTACCTAATTCCTCTGCTGCGTCGAATTGTTTGCGTTTTCTCCTCAATCAATGACTCATAATTGCAATCATTGAACTTGTTTGCACAGAATTCTCGGATTTTTGTTAATTTTCCTTGTATTTTGCTTCCTTGACTGAAAGTGGATAGTCTATTAGCTTTGTAAATTGAATGTGTTAGCCCAGTATTGGCCGGAACGTAGGAGTTGAAGGTTTGGGTGTGGTGTATTTTCCATTTTCTTAGAGCCAGAATATTAGCGCTACTTGGTAATAGTTTTTGGAAATATCAATCTGGTATTTCGTTCATTAAATATTTGTTATTTATTCTACCATGACAGGTTTTAACAGATTACTAGTGTCAAGTAAGAAGACTGTGATGGCTGTATAAAATTTTACTTCTGTTCTCTTTCTCAGGCAGATGCTTACAAATAATTTATCCTTCTTGCAGAAAGTTTTCATGATACAATTCTTTGAAACAGATGAGTGCAAAATATCTTATTCTTTTTGTGAACCAAAGCCTTAGACTTCATTCTGTTTTTCTTTTTTTCCCACATCTACATTGGTCAAGAATGAGAATTACCTTTTTCAAATTCACTAATTGCTGCCTAGCTCTCCTTTTTCTTTTCCATTCTTGCTCACTCTTAAAATTGAATCCCTAAATTCCTTAATGATGACATTTACAATGGGAAAAAGCTTTCTTGATTTCTTCAAGTATAAAATGTTATCTCTTGTGATCTATTATGTTTTTCTGGCTTATAGTTTTTTTTTCCATTCAAGAATTAAGTGTACTTTTGTGCTTTATATTTCTTCACCTTTTTGTCTTCTTAGGAGCTGCTTGATTGGGCATGTTCAAAATTTTCTTATAGCATTCTTATCAGATCTAAGAAGCAATTACTTACTCTGAAAGGGTTGTATACAACATTTCAGGAAATTAGGTTTAAGAAAGTGGTCACCCTACAGTCACCTGCATTGTATTAAAGTCCCCCAAGTTTCTAGAATCTCAAATATTGTATGTTCCATGATAATCTTAATGTTTAGCTTAGGTTGAGGCCTTGGAAGTTTTTTTATGCTTAAAAAATGAAATGAGGACCAACGGTTTCCTTATTGATTTACTGATATAGTTTCTGGGGTTTACTAAGTAAATTTTGTTGCAATATTTAGTTCCGGGACAATGTTCTTATTGCTAATAAGACTGAGGTGTTGTTTACAAGTTATTATATCAGTCTTGTCACTCAAGGTCCCTTTTCCCTTTGTAGGAAGACAAATTTTGGAGTTTGGACCATGTTGTTTTTTTAATTATTAATGAGTATATCAAGGACTATGGGTTTCATAGATTGGAAGGAGTTAATTTTCTATTGATCTTGAAGAAGTCCATGTCCATGTTGATTGGAGCTTTTGTGATAAGGCTTATAAAAGAAGGGTTTTGGGTTCAAGTGGAGAATGTGGGTTTGTTGCTATCCTAAGTTGGTTAATCAATTTTGGTTAGGAGGAATTTAGTGGAAGAACTTTAACTTATCAGGAGAGTTTGGGGGGGGGGGGGGGGGGGGGGTTTTCCCTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGGGGGGGGGGGGGGGGGGTGGTTATCCCTATTCTTTTCCCAGTTGTTGTTGATGTCCAAAGCAAATTAGTGTCAACAAGAACAGATAAGGATCTTATTTCAGGGTTCTATGTGGGGACGTAATTAATTTTTCTATCTCATTTTTAGTTTGTTGATAATACAAACTATTTTTCTCGGTTAGGGAGGGCTCTCTTGTTAGTCTTAACATAGTTCTATCGGTTTGTGAAATCATTTTGGTCATAATTAGAGACAAGAGTTTTGTGATTAGTATTAGTCATGATTCCTTCTAAGCTGCCTGGATGAATGTCTAATTGGGGTTTGCATGAAAGTGGCTTCCTACCATTTATATACAGTGCTTCCCCTTGGCCATACTTCCAATCATTTGTTGTTTTGGAATCTTTAATCTCACAGGACAATCTGCCTGACTCTACAACATTTGGGTGTCTAAGGAAACTTGTAGGATATTAAATCCTAGGTAGGTGGCCACCATGGATTGGACCCATGACCTCTTAGCCTTTTATTGAGATTGTCTCCTTTTTTACCACTAAGCCAACCCATGGTGGTTTATTTCGGAATCTAATTGTGGAATATGTCGAAAGATATCTCTTTTATTTGAAAATTATGTTACTTTCTTAGGGTGATAGGACAATACTTTTATAGTCTGGAGATGTTTTGAGTGATTTTCTAAAAGTCACATCTTGTACTGAGGATGTCAGTATTTCTTGAGGTCCAAGTTCATATCAATGCAGAGGGAATTCCAATAGATATTCTACTTAGACGTCAATATTTCTTGAGGTCCAAGTTCATATCAACGCAGAGGGAATTCCAATAGATATTCTACTTAGACTCTTCCTCTCTTTTATAGCCAAAGCACATATAAATTGGCTTCATGGTTGTGGGTTTATGAAATCAGTGGGTCAAAATTGAAAGAATCTGAATGAAGCATTTATAAGGAAGTAGGTGAGAAGCCTATGATAAATCAATCTACTGATTTGAAGCTTGTTCTAAACCACGGAAAAATTGGAGTTTTAAGTCTCTATCATTTGGTTGTTCAATCAACAGATGATGCCATTATTACAAGACCTTAGGAGCATGAATTTTCTGAGGAGGATATTGACGTCAGCATTAGTAGTTTGGATTCTCCCAGTGGTTTTCTGGTTTTAGAATTGTTAAATAATGTAATGGGTAACCAATCATTTCATGATGGGCTCCTTAATCTCTTAAGGGAAGATAAAATGTCACATGATGGTGAAAATATAAAAACTTTTGCTGTCCCTTCAGTAAAGGGGAAATCTATGTTAGCCTCGTGGATGGGGGTCTTTCTAAATACATCAAATAGGCTGGTATATCTCTTCTCCCATTCATCCCATCTCAATATTGATAATTATAGAGGGCTTATTTCTCTGACTCTCAATAGGTCTTTTTTGTTATGCACTCATTATGAAAATCATTCTTGGTTTCCTATTAAGCCCAAAGACCATTTTGGTGGACTCAAGCTCCAAAAGGACTTGAAACTCTAAGAAGACAAAGTTTCATCTTTCTAGATTTCTTTTTCTTTTTTCCTTTTTCTTTTTCTTTTTCTTTTTTTTTCTTTTTTCCCCCTCCTTCTGGTATGTTGTTCTGTTTGGATAGTTTCAGCTTTCTGAGAGAGAGTGAATGGGTTAATCAGGATTTTGCTTAATGGTAATTTAAATTCAGCTGATATCAAAAAAGCCTCCACGTTCTATTCTGAAGCCATCAGTATGTGTTTTATGTTATGCAGACAGTGAATTCCAAAGTCTTTTGTTCTCCCACTGCCCATACGCTTTGGCTTCTTGGTATTCAATGTTTCACCTTTCTAATATGCATTGGATTTTTTCTCAAGATTTAAGAAGTAATTTATTCCAGTTTCTCATCGGTCCAGTTCTCCACTCTCAAGCTCATCTTTTATGGGTTAATGCCGTTAAATCCATCTGGTCAGAATTATGGCTCTAACAGAATCAAAGAACTTAAGTTTGGTCTGAATGTTTTGAGAGTTGAGTTCACAGATTCTCAATGGTGCTCACCCAGCTTATTTTATCATGGAATGCTTTTACTCATTCTTCCTTTTTTGCTTTGCCATACTTTTTATTTTCCCTCTTTGTCATGAGGAAGTGGGCCTCAACCCAATTATTTAAAATGAGGCCACACCTACGAAGCCCCCATGGTTTGACGTTCCTCTAGATCCATATGATTTCACGCCATTAGGGTTAGAGGTTATCACCAAACCAACTTGTTGTCTTTTCTCCAGCACACGGTGCCTTCCTCGAGGCTTCATCTCCATTTACACATGCTCATGTCGCTGGTAAGAGCCTCCATTAGCTTTTCTCTTCTTTTCTTTCGAACTTTATAGCTACTAAAGTGTTTTCCTCTTAATGACCGATGATCCCACTCTAAGAATCCAACCTTCGACATGCTAGATATCTTTTCAGCCCATTTACCCCAGATTACGAGTCATTCTCCCCTCTCTTGCAACTGAATCTTCTGTGCTGATATTTTTTGTTGGCCTTCCCATTTTTCTGTTTTTCTACTCGTATTGTGAACAAACACAACCTTGGGCTCTGTGTTCTTTCCATCTTGTTTCTTATATGGGAATTCTTCCTTTCTTGCAGTTGTTGCGCCATCCTGATCATCGACCTTCCTAGTATCTTCTTTCTTGCCATATGAGATACTTGATTCTGTTATGGAACTCTGCCTCTTTGTCTTTGCCAACCTAATCTCTTTTTTTATTGTGTAATCTCCACTTTTATTTCAACTCTATGCTGTTGTGTCTCACATAGATTGAAAATCGAGTTCATTGACTCCAATGGTTTCTTCAGTTGATCCATTTATTTTGATAAAATTTTCAATATCTCTTCATTTAAATTAACATCTAACTCCGTGACCAGAATTGAACTGCACTATACTAATTGATTGAACCAAAATATACTCAATTATAAAAGGTAACGAAACCATAAGATAACCAAGTTTCCAAGTAATTCAAGGTATTTGGACCCTCCTTTTAAGCACACTCACTCAAATCTTAGCCAACTTAATTAGACCATGCCTTCATCCTTTCTAACCCCCTCTATTTATAACAAACTCTCCTAACTAACTTTCTTATCTAATTACCAATGTACTCTTGTGGCCTCTCGTTCCCTTGGTGGGATTGGGATTGGGGGAAAAACTCTCTTTTTCTCAAATGTTAGTACTTCGAGAAAGAAAATAAAAAAAGCTCTTTGGAGAAGGGTGGTAAAGAGCACATATGGGGTGGATTCTTTAAAAGACCTGAAAAAACAAAAAACAAAAAACAAAATAGAGAGGAAGGTTATAACAAAAAACAAAATAGCACATATGTTTAGTCCGTGGAAGGTTATAACAAAAAAATACGATTTCTTTGGAAAAACGTTAGTTTTAAACTAGGTGATGACAATGACCTTTTTTGGGAAGATAGTTGGCATGTTAATCAGGCCCCAGAAGATGTTGTTACTTCCTTTTATAGCTTTTTATTTATTTATTTATTTTCAGAAACAATTTTATCGATGAATAAAATAATCCAAAAGGATATACAAAACTGTCCTATCAAAGGGAATACAAAAGTCTGTTCCAATTTGCGACAAGATAATTTAGGCTATGATCTTTAAGGGGATGTAGGTGTTTATACCAAAAAAGAGCATTAAAAATAGAAGAATCAAGGATGTCTATTTGGCCTTGGAAGAAAATAGAAGGCTTTTGGAAAAAGAGATTTGCTCTTAGATTTGGGAAGGTTTAGTACTTAAAAAGGTTAAATTCTTCCTTTCAACGGTTGCTCATGGTAGCATTAATACTTGTGATAAGGTGCAAAGAAGATATCCCCACCTTAACATTTTTCCTAATACGTGCACCATCTGCAGGAGTGAGGGATCACTTCCTCACTTTTTTCTTCACTTGCAGCTTCAGCAAAAAAGTGTGGAACTATTTTGGTAACTTATTCGGTCTGAAATAGTGTTACCCTCTGGTGTTAGATCAGGGGCTGGTGGAGTTTTAATGTTGTGGACGGCAAAATTATGTAATATGTATTAGTTCCTTTACTTATTGTGCCCTGTTTCACGGTTATACTCTGTTTTGGCAGTTTATTTTTCGTTCCAGTTTTTGTTCTTTTATTTAGGAGGCTCATTTGTTTTTAACGTGAGGCCCACCTCCTCCGTATAACTAGTAGCCTTTTTCTATGAAGGATCGAGTAAATTTATTTTCTTGGTTTACACCACAAAGGCTATTTTGTGGCTTATATGGGAGGAAAGAAACGTTAGAATTTTCTTCCATTATTTTTGTGGCTTTGTACGATTCACGGCTCCCAATTGAATTTTGATGTTTCTCGTTGGATGTGGGTGGTGAGTCCTTTTGTAATTATTCACTAGGTCACATTTTATTAGATTGGGGTCCCTTTCTCTAGTGTGCTCCCTTTTTGTGGGCTGTCTTTTTTCATTTTTTTAAATTCTTTTGGTTATTGTTTTTTAAAATTTTTTATTTTTATGGCCTTGTATTCTTTCCTTTTTTTCTCTATGAAAGTTGGTCTTCGAATAGAAAAACAAAAACGAAGAACACAGAGTTCTTTTGTATTAACTACACCTTGCTCATTAATTGTCTCTTTTTCAGAAGACTGATATACCCTTAATATCCCAATAGCAATCCTAATTCTATTCCAGTAGCACTCCTCGCATGTATCTAGTGTATTGGTTGGATGCTTCATTTTTTGTGTTGTATGTGCCAAATATCTGCAATGTGTTTGAGGGTTTAATGCAAGAGTTTGATGTGATTATGAAGTTGGATGTGAATCGGCCAACAATTTTAACATTTTCCAGTAGATGTGGAGAGTAATGTGTTTCCTAGTGAGAATTTTTTTTTGGGACTTGGTCAAAATTTAAATATATGGTTGTGCTTGATGAATAGTATGAATGGAGGGGGAAAAACTACACAAAACTGAAGTATTTTTCACTTTGCTAACTATATGAATACGGTAGAGATTTTAAATTTGTTTCTGGTGGCCGCTTAACTGATGTTAATGAGAACTTTGTAATGCAGGGCCCACTTTCAACAACATTTCCGATTGAAAACCGAAATGCTCCAGTGACCATGAAGGCATTGAAGAATCACCTGGACCGCTCAAAAGGGCTTCCGTTTGTGAAGCGAATTTCTGACTTTCATTTGCTGCTGTTACTAGCCAGGGTGTTGGATGTGAGCTCTGACGTTCCTGCACTGGCCGAGTGCGTTCAGACCCAGACAGCCATACCTGAAGGTTACAAAATATTAATCGAGTCTATGGCAAGTGCTGCTTGATATAAATAGAGTGTCAACTGGGACTGGTTACAGGAAGTTGTTCTAGTTTGTCAATTACTTTTCATCGTCTCTCTTGACAATTTAGAGACTGTAGTTATATTTTAATATATTATCATCTGAAAATGGTGTTATTTTAACTTATGGCATAACAAAATGGCTTCTTTGTGTCTTGGGGAAAAAGGCCACCTGAGAACAAATTACAGGGCATGACACAATTAAACGAAAACTAATTGAG

mRNA sequence

CCGCCGTCTCAACGGAGATTGTCTATTTCCTGATTCCCAAGCCACCGCGGTATACTTCTTATTCGATATAAATACAATCCCTTGCGTCTACTCCCTGAACAAGAAATTCGAAGAAGCATCATACCCTTCTAACTTCTGTTCTCTAAATTTCCGGCAGCATTTTCTGAATCCCACCGTCGACTTGCAACAGCTGTTTATCCCTCAAATCCGTCTTTAAATAACGATCAATCTCCTCATATAGAAGAGAATCTCATTACAACTTCCGCCGACACCAATATGATGCTCAGAATTCGCAGTAGAGATGGCCTGGAGCGAGTCGCTGTAGACAACCCACACATCACAATCGCTCAACTCAAAGCCATAATTCAATCCCAGCTCAATATCCCAATCCACAACCAAACCCTCTCAACTAACCAAAATATTCTATTGGCGAAGACTCATGACGATCTTTCCAAATTCACTGACATGTCTAATCCTAATACCCATCTCTCGTCGCTTAATTTGTCTCATGGGTCTATTGTCTTTCTCGCCTACGAGGGCGAGCGCACTGTTGCTGGCCCTACTTTCCATCCCGCCGGATCTTTTGGCCGTAAAATGACTATGGATGATCTAATTGCTAAGCAGATGCGGATCACTCGTCAAGAAAACCCCCATTGTGAATTGGTTTCTTTCGACCGGGATTGCGCCAATGCTTTCCAGCATTATGTTAATGAAACGCTAGCCTTCGCTGTCAAACGTGGGGGGATGATGTACGGAACCGTGTCACCAGAAGGCAAGGTCGAGGTAGATTTCATATATGAGCCGCCACAGCAAGGGACTGAGGACAATTTACTGTTTTTCCGAGATCCCGATGAGGAAAGATTGGTAGAAGCAATTGCAGTTGGGTTGGGGATGAGGAAAGTTGGGTTTATATTCACGCAGACGATTAGTCAGGACAAAAAGGACTACACCTTATCCAACAGGGAAGTACTCCAGGCGGCTCAGTTTCACTCCGAGAGCGAGTTGAAGGAGTGGGTGACAGCAGTTGTGAAGTTGGAGGTGAACGAGGATGGGGGTGCTGATGTTCATTTTGAGGCTTTTCAAATGAGTGACATGTGCATTAGATTGTTCAAGGAAGGTTGGTTTGAGACGGATATTGGAGAGGATTTTGATCCCAAGCTCTCGAAGATGAGGAGGGACGTTGTTGTTGGTGTCAAAGACACTAGAGATGTTGACAATGACTTTTTCCTGGTCGTAGTTAAGATTTTCGACCATCAGGGCCCACTTTCAACAACATTTCCGATTGAAAACCGAAATGCTCCAGTGACCATGAAGGCATTGAAGAATCACCTGGACCGCTCAAAAGGGCTTCCGTTTGTGAAGCGAATTTCTGACTTTCATTTGCTGCTGTTACTAGCCAGGGTGTTGGATGTGAGCTCTGACGTTCCTGCACTGGCCGAGTGCGTTCAGACCCAGACAGCCATACCTGAAGGTTACAAAATATTAATCGAGTCTATGGCAAGTGCTGCTTGATATAAATAGAGTGTCAACTGGGACTGGTTACAGGAAGTTGTTCTAGTTTGTCAATTACTTTTCATCGTCTCTCTTGACAATTTAGAGACTGTAGTTATATTTTAATATATTATCATCTGAAAATGGTGTTATTTTAACTTATGGCATAACAAAATGGCTTCTTTGTGTCTTGGGGAAAAAGGCCACCTGAGAACAAATTACAGGGCATGACACAATTAAACGAAAACTAATTGAG

Coding sequence (CDS)

ATGATGCTCAGAATTCGCAGTAGAGATGGCCTGGAGCGAGTCGCTGTAGACAACCCACACATCACAATCGCTCAACTCAAAGCCATAATTCAATCCCAGCTCAATATCCCAATCCACAACCAAACCCTCTCAACTAACCAAAATATTCTATTGGCGAAGACTCATGACGATCTTTCCAAATTCACTGACATGTCTAATCCTAATACCCATCTCTCGTCGCTTAATTTGTCTCATGGGTCTATTGTCTTTCTCGCCTACGAGGGCGAGCGCACTGTTGCTGGCCCTACTTTCCATCCCGCCGGATCTTTTGGCCGTAAAATGACTATGGATGATCTAATTGCTAAGCAGATGCGGATCACTCGTCAAGAAAACCCCCATTGTGAATTGGTTTCTTTCGACCGGGATTGCGCCAATGCTTTCCAGCATTATGTTAATGAAACGCTAGCCTTCGCTGTCAAACGTGGGGGGATGATGTACGGAACCGTGTCACCAGAAGGCAAGGTCGAGGTAGATTTCATATATGAGCCGCCACAGCAAGGGACTGAGGACAATTTACTGTTTTTCCGAGATCCCGATGAGGAAAGATTGGTAGAAGCAATTGCAGTTGGGTTGGGGATGAGGAAAGTTGGGTTTATATTCACGCAGACGATTAGTCAGGACAAAAAGGACTACACCTTATCCAACAGGGAAGTACTCCAGGCGGCTCAGTTTCACTCCGAGAGCGAGTTGAAGGAGTGGGTGACAGCAGTTGTGAAGTTGGAGGTGAACGAGGATGGGGGTGCTGATGTTCATTTTGAGGCTTTTCAAATGAGTGACATGTGCATTAGATTGTTCAAGGAAGGTTGGTTTGAGACGGATATTGGAGAGGATTTTGATCCCAAGCTCTCGAAGATGAGGAGGGACGTTGTTGTTGGTGTCAAAGACACTAGAGATGTTGACAATGACTTTTTCCTGGTCGTAGTTAAGATTTTCGACCATCAGGGCCCACTTTCAACAACATTTCCGATTGAAAACCGAAATGCTCCAGTGACCATGAAGGCATTGAAGAATCACCTGGACCGCTCAAAAGGGCTTCCGTTTGTGAAGCGAATTTCTGACTTTCATTTGCTGCTGTTACTAGCCAGGGTGTTGGATGTGAGCTCTGACGTTCCTGCACTGGCCGAGTGCGTTCAGACCCAGACAGCCATACCTGAAGGTTACAAAATATTAATCGAGTCTATGGCAAGTGCTGCTTGA

Protein sequence

MMLRIRSRDGLERVAVDNPHITIAQLKAIIQSQLNIPIHNQTLSTNQNILLAKTHDDLSKFTDMSNPNTHLSSLNLSHGSIVFLAYEGERTVAGPTFHPAGSFGRKMTMDDLIAKQMRITRQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQGTEDNLLFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHSESELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKMRRDVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNAPVTMKALKNHLDRSKGLPFVKRISDFHLLLLLARVLDVSSDVPALAECVQTQTAIPEGYKILIESMASAA
BLAST of ClCG01G000050 vs. Swiss-Prot
Match: NPL41_ARATH (NPL4-like protein 1 OS=Arabidopsis thaliana GN=At3g63000 PE=1 SV=1)

HSP 1 Score: 632.9 bits (1631), Expect = 2.5e-180
Identity = 302/411 (73.48%), Postives = 361/411 (87.83%), Query Frame = 1

Query: 2   MLRIRSRDGLERVAVDNPHITIAQLKAIIQSQLNIPIHNQTLSTNQNILLAKTHDDLSKF 61
           MLR+RSRDGLERV+VD PHIT++QLK +IQ QL IPIHNQTLSTN+N+LLAK+  D   F
Sbjct: 3   MLRVRSRDGLERVSVDGPHITVSQLKTLIQDQLQIPIHNQTLSTNRNLLLAKSPSDFLAF 62

Query: 62  TDMSNPNTHLSSLNLSHGSIVFLAYEGERTV-AGPTFHPAGSFGRKMTMDDLIAKQMRIT 121
           TDM++PN  +SSLNL+HGS+V+LAYEGERT+  GP   PAGSFGRKMT++DLIA+QMR+ 
Sbjct: 63  TDMADPNLRISSLNLAHGSMVYLAYEGERTIRGGPAVTPAGSFGRKMTVEDLIARQMRVG 122

Query: 122 RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQG 181
           RQE  HC+ VSFDRDCANAFQH+VNE+LAFAVKRGG MYG VS +G+VEV+FIYEPPQQG
Sbjct: 123 RQEKAHCDSVSFDRDCANAFQHFVNESLAFAVKRGGFMYGNVSEDGQVEVNFIYEPPQQG 182

Query: 182 TEDNLLFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHSE 241
            EDNL+  RD +EE+ V+AIA+GLGMR+VGFIF QT++QDKK+YTLSN EVL AAQ H+E
Sbjct: 183 MEDNLILMRDSEEEKRVDAIALGLGMRRVGFIFNQTVTQDKKEYTLSNVEVLLAAQLHAE 242

Query: 242 SELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKMRR 301
           SELKEWVTAVVKLE+NEDGGADVHFE FQMSDMC+RLFKEGWFET+IG + DPKLSK+++
Sbjct: 243 SELKEWVTAVVKLEINEDGGADVHFEPFQMSDMCVRLFKEGWFETEIGPEDDPKLSKLKK 302

Query: 302 DVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNAPVTMKALKNHLDRSKGLPF 361
           +VVVGVKD ++VDNDFFLV+VKI DHQGPLS TFPIENRN   TM+ALK H++R++ LPF
Sbjct: 303 EVVVGVKDVKEVDNDFFLVLVKILDHQGPLSCTFPIENRNTQTTMRALKTHMERARSLPF 362

Query: 362 VKRISDFHLLLLLARVLDVSSDVPALAECVQTQTAIPEGYKILIESMASAA 412
           VKRISDFHLLL +A+ LDVSSDVPALAECV+ Q+ +PEGY++LI+SMA+ +
Sbjct: 363 VKRISDFHLLLFVAQFLDVSSDVPALAECVRLQSHVPEGYELLIDSMANTS 413

BLAST of ClCG01G000050 vs. Swiss-Prot
Match: NPL42_ARATH (NPL4-like protein 2 OS=Arabidopsis thaliana GN=At2g47970 PE=1 SV=1)

HSP 1 Score: 618.6 bits (1594), Expect = 4.9e-176
Identity = 302/410 (73.66%), Postives = 355/410 (86.59%), Query Frame = 1

Query: 1   MMLRIRSRDGLERVAVDNPHITIAQLKAIIQSQLNIPIHNQTLSTNQNILLAKTHDDLSK 60
           MMLRIRSRDGLERV  +  HIT++QLK +I  QL IP+H QTLSTN+++LLAKT  DL  
Sbjct: 2   MMLRIRSRDGLERVTAEGAHITVSQLKTLIADQLQIPLHKQTLSTNRDLLLAKTPADLLA 61

Query: 61  FTDMSNPNTHLSSLNLSHGSIVFLAYEGERTVAG-PTFHPAGSFGRKMTMDDLIAKQMRI 120
           FTD+++PN  LSSLNL HGS+++LAY+GER++ G P   PAGSFGRKMT+DDLIA+QMR+
Sbjct: 62  FTDLTDPNLPLSSLNLGHGSMLYLAYDGERSIPGAPPVTPAGSFGRKMTVDDLIARQMRV 121

Query: 121 TRQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQ 180
           TRQE  HC+ VSFDRD ANAFQHYVNE+LAFAVKRGG MYGTV+ EG+VEVDFIYEPPQQ
Sbjct: 122 TRQETSHCDSVSFDRDAANAFQHYVNESLAFAVKRGGFMYGTVTEEGQVEVDFIYEPPQQ 181

Query: 181 GTEDNLLFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHS 240
           GTE NL+  RD DEE+ V+AIA+GLGMR+VGFIF QT+ QDK +YTLSN EVLQAA+ H+
Sbjct: 182 GTEANLILMRDADEEKRVDAIAMGLGMRRVGFIFNQTVVQDKTEYTLSNAEVLQAAELHA 241

Query: 241 ESELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKMR 300
           ESELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKE WFET+I  D DPKLSKM+
Sbjct: 242 ESELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEEWFETEIMPDDDPKLSKMK 301

Query: 301 RDVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNAPVTMKALKNHLDRSKGLP 360
           ++VVVGVKD ++VDNDFFLV+V+I DHQGPLS+TFPIENR++  TM+ALK HLDR+K LP
Sbjct: 302 KEVVVGVKDLKEVDNDFFLVLVRILDHQGPLSSTFPIENRSSRATMRALKTHLDRAKSLP 361

Query: 361 FVKRISDFHLLLLLARVLDVSSDVPALAECVQTQTAIPEGYKILIESMAS 410
            VK++SDFHLLL +A+ LDVSSDVPALAECV+ Q+ +PEGY +LIESMA+
Sbjct: 362 LVKKMSDFHLLLFVAQFLDVSSDVPALAECVRLQSPVPEGYALLIESMAN 411

BLAST of ClCG01G000050 vs. Swiss-Prot
Match: NPL4_ORYSJ (NPL4-like protein OS=Oryza sativa subsp. japonica GN=Os01g0377700 PE=2 SV=1)

HSP 1 Score: 457.2 bits (1175), Expect = 1.9e-127
Identity = 239/419 (57.04%), Postives = 304/419 (72.55%), Query Frame = 1

Query: 1   MMLRIRSRDGLERVAVDNPHI-TIAQLKAIIQSQLNIPIHNQTLSTNQNILLAKTHDDLS 60
           M+LRIRSRDG +R+ V +P   T+  L+ +I +++ +P+  Q LS +  +LL  +    +
Sbjct: 1   MILRIRSRDGTDRITVPDPAAATVGDLQRLIAARVTVPVPLQRLSLDPALLLPSS----A 60

Query: 61  KFTDMSNPNTHLSSLNLSHGSIVFLAYEGERTVAGP----TFHPAGSFGRKMTMDDLIAK 120
               +++P   LSSL LS+GS V+L+Y  +   + P        AGSFG+KMTMDDLIA+
Sbjct: 61  SAALLADPAAPLSSLRLSNGSFVYLSYPPDARSSQPPPPKALSAAGSFGKKMTMDDLIAR 120

Query: 121 QMRITRQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGK-VEVDFIY 180
           Q+R+TRQE P C   SFDRD ANAFQ +V E+LAFA KR G +YG V  + K V VDFIY
Sbjct: 121 QIRVTRQEAPLCAAASFDRDSANAFQLHVAESLAFATKRAGFLYGRVDADTKEVFVDFIY 180

Query: 181 EPPQQGTEDNLLFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQ---DKKDYTLSNREV 240
           EPPQ GTED +   RD  EE  V+AIA GLGMR+VG +FTQ + +   D  +YT+SNREV
Sbjct: 181 EPPQVGTEDVVQLMRDAQEEARVDAIAHGLGMRRVGLVFTQAVGRKTSDTGEYTMSNREV 240

Query: 241 LQAAQFHSESELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDF 300
           LQA +  +E  + EWVTA+VKLEV +DG  DVHFEAFQMS++C++LFK+G  ET+IG+  
Sbjct: 241 LQATELQAEGGIPEWVTAIVKLEVGDDGSGDVHFEAFQMSEICVKLFKDGVLETEIGDKD 300

Query: 301 DPKLSKMRRDVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNAPVTMKALKNH 360
           DP+LSKMR++VV G KDT +VDNDFFLV VKI DHQGPLST FPIENR  PV M ALK+H
Sbjct: 301 DPRLSKMRKEVVAGGKDTMEVDNDFFLVPVKISDHQGPLSTGFPIENRGNPVAMSALKSH 360

Query: 361 LDRSKGLPFVKRISDFHLLLLLARVLDVSSDVPALAECVQTQTAIPEGYKILIESMASA 411
           LDR+K LPFVKRISDFHLLLL+A  LD+ +DVPAL  CV+ Q+ +PEGY++LIES+A A
Sbjct: 361 LDRAKHLPFVKRISDFHLLLLVAAFLDIKADVPALTACVKNQSVVPEGYQLLIESLAGA 415

BLAST of ClCG01G000050 vs. Swiss-Prot
Match: NPL4_DICDI (Nuclear protein localization protein 4 homolog OS=Dictyostelium discoideum GN=nploc4 PE=3 SV=1)

HSP 1 Score: 128.3 bits (321), Expect = 2.0e-28
Identity = 92/285 (32.28%), Postives = 151/285 (52.98%), Query Frame = 1

Query: 116 QMRITRQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYE 175
           +MR+  Q+NPH      D   AN FQ Y+  +  +  +R G ++G    +G V VD IYE
Sbjct: 282 KMRLKSQDNPHAPGALVDFQSANIFQQYIANS-KYEQQRIGFLFGNFLSDGSVVVDSIYE 341

Query: 176 PPQQGTEDNL-LFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQA 235
           PPQ+  +        DP  ++ +E++A  LG+ +VG+IF    S   + YT+S+ E++QA
Sbjct: 342 PPQECKDKQTPTLLPDPLADK-IESMASMLGLTRVGWIF----SHPSRKYTMSSTEIIQA 401

Query: 236 AQFHSESELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPK 295
           A + ++     +VT +  L VN DG +++  EAFQ+SD  ++L K G F   +    DP 
Sbjct: 402 ASYQNKYG-PSFVTLI--LSVNSDGQSNM--EAFQVSDQALKLEKTGEF---LPTQPDPT 461

Query: 296 LSKMRRDVVVGVKDTRDVDNDFFLVVV--KIFDHQGPLSTTFPIENRNAPVTMKALKNHL 355
             K++  V     +T + D  FF+V V  K  + +   + +FP+ENR    T+  L ++ 
Sbjct: 462 KCKLKSPVFEEGTETINADTHFFIVTVPLKAREDKSIFNISFPVENRIPVNTLSDLASYK 521

Query: 356 DRSKGLPFVKRISDFHLLLLLA--RVLDVSSDVPALAECVQTQTA 396
              K +  +K  SDFH L+ L   + LD  SD P + E ++++++
Sbjct: 522 LEHKDVSPLKFFSDFHFLIFLLENQFLDFQSDFPIICENIRSRSS 552

BLAST of ClCG01G000050 vs. Swiss-Prot
Match: NPL4_MOUSE (Nuclear protein localization protein 4 homolog OS=Mus musculus GN=Nploc4 PE=1 SV=3)

HSP 1 Score: 93.2 bits (230), Expect = 7.1e-18
Identity = 92/334 (27.54%), Postives = 149/334 (44.61%), Query Frame = 1

Query: 119 ITRQENPHCELVSFDRDC-ANAFQHYVNETLAFAVKRGGMMYGTVS-----PEG-KVEVD 178
           + RQ+  H + + F+    A+ F  +  +T     +  G +YG  +     P G + EV 
Sbjct: 215 LNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAEVA 274

Query: 179 FIYEPPQQGTEDNLLFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQD----------- 238
            IYEPPQ GT+++L    DP  E +V+ IA  LG+RKVG+IFT  +S+D           
Sbjct: 275 AIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYSRN 334

Query: 239 KKDYTLSNREVLQAAQFHSESEL-----------KEWVTAVVKLEVNEDGGAD--VHFEA 298
           K  Y LS+ E + A  F ++               ++VTAV        GG D  VHFE 
Sbjct: 335 KDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVA------TGGPDNQVHFEG 394

Query: 299 FQMSDMCIRLFKE------------GWFETDIGEDFDP-----KLSKMRRDVVVGVKDTR 358
           +Q+S+ C+ L ++            G+ +    E + P      + K   ++    +  R
Sbjct: 395 YQVSNQCMALVRDECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEI---TQLAR 454

Query: 359 DVDNDFFLVVVKIFDHQGPLST------TFPIENRNA---PVTMKALKNHLDRSKGLPFV 394
            +  ++ ++ +     + P+ T       FPIENR+         +L  +L ++    F+
Sbjct: 455 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQDFHSLATYLSQNTSSVFL 514

BLAST of ClCG01G000050 vs. TrEMBL
Match: M5VVX8_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006427mg PE=4 SV=1)

HSP 1 Score: 717.2 bits (1850), Expect = 1.1e-203
Identity = 347/412 (84.22%), Postives = 390/412 (94.66%), Query Frame = 1

Query: 1   MMLRIRSRDGLERVAVDNP-HITIAQLKAIIQSQLNIPIHNQTLSTNQNILLAKTHDDLS 60
           MMLR+RSRDGLERV VDNP H T++QLKA+IQ+QL IP  NQT+STNQN+LLAKTHDD+S
Sbjct: 1   MMLRVRSRDGLERVTVDNPQHTTVSQLKALIQTQLRIPFQNQTISTNQNLLLAKTHDDIS 60

Query: 61  KFTDMSNPNTHLSSLNLSHGSIVFLAYEGERTVAGPTFHPAGSFGRKMTMDDLIAKQMRI 120
           +FTDM+NPNT LS+LNLSHGSIV+LAY+GERTVAGPTFHPAGSFGRKMTMDDLIAKQM++
Sbjct: 61  RFTDMANPNTPLSALNLSHGSIVYLAYDGERTVAGPTFHPAGSFGRKMTMDDLIAKQMKV 120

Query: 121 TRQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQ 180
           TRQENPH ELVSFDRDCANAFQHYVN+TLAFAVKRGG MYGTVS EGKVEVDFIYEPPQQ
Sbjct: 121 TRQENPHSELVSFDRDCANAFQHYVNDTLAFAVKRGGFMYGTVSEEGKVEVDFIYEPPQQ 180

Query: 181 GTEDNLLFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHS 240
           GTE NL+FFRDPDEE+ VEAIA+GLGMR+VGFIFTQT+SQDKKDYTLSNREVLQA++FH+
Sbjct: 181 GTEANLVFFRDPDEEKSVEAIAMGLGMRRVGFIFTQTVSQDKKDYTLSNREVLQASEFHA 240

Query: 241 ESELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKMR 300
           ES LKEWVTA+VKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFET+I E  DPKLSKM+
Sbjct: 241 ESGLKEWVTAMVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETEIEEGHDPKLSKMK 300

Query: 301 RDVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNAPVTMKALKNHLDRSKGLP 360
           +DVVVGVKDTR+VDNDFFLVVVKIFDHQGPLS++FPIENRN PVT+KALKNHLDR+K LP
Sbjct: 301 KDVVVGVKDTREVDNDFFLVVVKIFDHQGPLSSSFPIENRNTPVTLKALKNHLDRAKSLP 360

Query: 361 FVKRISDFHLLLLLARVLDVSSDVPALAECVQTQTAIPEGYKILIESMASAA 412
           FVKRISDFHL+LLLAR LDV++D+PALA CV T++ IPEGY++LIESMA+A+
Sbjct: 361 FVKRISDFHLMLLLARFLDVAADIPALAVCVHTESPIPEGYQLLIESMANAS 412

BLAST of ClCG01G000050 vs. TrEMBL
Match: W9SJC4_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_020438 PE=4 SV=1)

HSP 1 Score: 700.3 bits (1806), Expect = 1.4e-198
Identity = 347/411 (84.43%), Postives = 375/411 (91.24%), Query Frame = 1

Query: 1   MMLRIRSRDGLERVAVDNPHITIAQLKAIIQSQLNIPIHNQTLSTNQNILLAKTHDDLSK 60
           MMLRIRSRDGLERV VDNPH T+AQLKA+IQSQL IP HNQTLSTNQN+LLAKT  D S+
Sbjct: 1   MMLRIRSRDGLERVTVDNPHATVAQLKALIQSQLQIPFHNQTLSTNQNLLLAKTAGDFSR 60

Query: 61  FTDMSNPNTHLSSLNLSHGSIVFLAYEGERTVAG-PTFHPAGSFGRKMTMDDLIAKQMRI 120
           F DM++PN  LSSLNLSHGSIVFLAY+GERTVAG P FHPAGSFGRKMTMDDLIAKQ R+
Sbjct: 61  FADMADPNATLSSLNLSHGSIVFLAYDGERTVAGCPAFHPAGSFGRKMTMDDLIAKQTRV 120

Query: 121 TRQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQ 180
           +RQE  HCE VSFDRDCANAFQHYVNE LAFAVKRGG MYGTVS EGKVEVDFIYEPPQQ
Sbjct: 121 SRQETSHCESVSFDRDCANAFQHYVNEALAFAVKRGGFMYGTVSEEGKVEVDFIYEPPQQ 180

Query: 181 GTEDNLLFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHS 240
           GTE+NLL FRDPDEE+LVEAIA+GLGMRKVGFIFTQTISQDKKDYTLSNREVLQAA+ H+
Sbjct: 181 GTEENLLLFRDPDEEKLVEAIALGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAELHA 240

Query: 241 ESELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKMR 300
           ES LKEWVTAVVKLEVN+DG ADVHFEAFQMSDMCIRLFKEGWF T+IGED DPKLSKM+
Sbjct: 241 ESGLKEWVTAVVKLEVNDDGSADVHFEAFQMSDMCIRLFKEGWFVTEIGEDDDPKLSKMK 300

Query: 301 RDVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNAPVTMKALKNHLDRSKGLP 360
            DVVVGVKDT++VDNDFFLVVVKI DHQGPLS+TFPIENRN  VTMKALK+HLDR+K LP
Sbjct: 301 NDVVVGVKDTKEVDNDFFLVVVKILDHQGPLSSTFPIENRNTGVTMKALKSHLDRAKNLP 360

Query: 361 FVKRISDFHLLLLLARVLDVSSDVPALAECVQTQTAIPEGYKILIESMASA 411
           FVKRISDFHLLLLLAR LD+ +DVPAL +CV  QT +PEGYKILIES+A+A
Sbjct: 361 FVKRISDFHLLLLLARFLDLGADVPALTDCVHKQTEVPEGYKILIESLANA 411

BLAST of ClCG01G000050 vs. TrEMBL
Match: I1JQF4_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_03G207900 PE=4 SV=1)

HSP 1 Score: 694.5 bits (1791), Expect = 7.8e-197
Identity = 342/413 (82.81%), Postives = 383/413 (92.74%), Query Frame = 1

Query: 1   MMLRIRSRDGLERVAVDNPH-ITIAQLKAIIQSQLNIPIHNQTLSTNQNILLAKTHDDLS 60
           MMLRIRSRDGLERV+V+NPH  T++ LK II++QL IP+HNQTLSTNQN+LLAK+ +DL 
Sbjct: 1   MMLRIRSRDGLERVSVENPHSTTVSYLKRIIETQLGIPVHNQTLSTNQNLLLAKSLEDLH 60

Query: 61  KFTDMSNPNTHLSSLNLSHGSIVFLAYEGERTVAGPTFHPAGSFGRKMTMDDLIAKQMRI 120
           +FTDMSNP+  LSSLNL HGS+VFLAYEGER VAGP F+PAGSFGRKMTMDDLIAKQMR+
Sbjct: 61  RFTDMSNPDAPLSSLNLGHGSMVFLAYEGERRVAGPAFNPAGSFGRKMTMDDLIAKQMRV 120

Query: 121 TRQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQ 180
           TRQENPHCELVSFDRDCANAFQHYVN+TLAFAVKRGG MYGTVS EGKVEVDFIYEPPQQ
Sbjct: 121 TRQENPHCELVSFDRDCANAFQHYVNDTLAFAVKRGGFMYGTVSEEGKVEVDFIYEPPQQ 180

Query: 181 GTEDNLLFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHS 240
           G+EDNL+FFRD +EE+ VEAIAVGLGM KVGFIFTQTI+QDKKDYTLSNREVLQA ++H+
Sbjct: 181 GSEDNLVFFRDTEEEKFVEAIAVGLGMTKVGFIFTQTITQDKKDYTLSNREVLQAVEYHA 240

Query: 241 ESELKEWVTAVVKLEVNED-GGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKM 300
           ES LKEWVTAVVKLE NE+ GGADVHFEAFQMSD+C+RLFKEGWFET+I ED DPKLSKM
Sbjct: 241 ESGLKEWVTAVVKLEANEEMGGADVHFEAFQMSDVCVRLFKEGWFETEIKEDDDPKLSKM 300

Query: 301 RRDVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNAPVTMKALKNHLDRSKGL 360
           ++DVVVGVKDTR+VDNDFFLVVVKI DHQGPLS++FPIENRN  V MKALKNHLDR+K L
Sbjct: 301 KKDVVVGVKDTREVDNDFFLVVVKIADHQGPLSSSFPIENRNTQVPMKALKNHLDRTKNL 360

Query: 361 PFVKRISDFHLLLLLARVLDVSSDVPALAECVQTQTAIPEGYKILIESMASAA 412
           PFVKRISDFHLLL+LARVLD+++DVPAL ECVQTQT++PEGY+ILIESMAS A
Sbjct: 361 PFVKRISDFHLLLVLARVLDLNADVPALTECVQTQTSVPEGYQILIESMASTA 413

BLAST of ClCG01G000050 vs. TrEMBL
Match: I1NB01_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_19G205200 PE=4 SV=1)

HSP 1 Score: 694.1 bits (1790), Expect = 1.0e-196
Identity = 344/413 (83.29%), Postives = 383/413 (92.74%), Query Frame = 1

Query: 1   MMLRIRSRDGLERVAVDNP-HITIAQLKAIIQSQLNIPIHNQTLSTNQNILLAKTHDDLS 60
           MMLRIRSRDGLERV+V+NP   T++ LK IIQ+QL IP+HNQTLSTNQN+LLAK+ +DL 
Sbjct: 1   MMLRIRSRDGLERVSVENPLATTVSDLKRIIQTQLGIPVHNQTLSTNQNLLLAKSLEDLL 60

Query: 61  KFTDMSNPNTHLSSLNLSHGSIVFLAYEGERTVAGPTFHPAGSFGRKMTMDDLIAKQMRI 120
           +FTDMSN +  LSSLNL HGSIVFLAYEGER VAGP F+PAGSFGRKMTMDDLIAKQMR+
Sbjct: 61  RFTDMSNLDASLSSLNLGHGSIVFLAYEGERRVAGPAFNPAGSFGRKMTMDDLIAKQMRV 120

Query: 121 TRQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQ 180
           TRQENPHCELVSFDRDCANAFQHYVN+TLAFAVKRGG MYGTVS  GKVEVDFIYEPPQQ
Sbjct: 121 TRQENPHCELVSFDRDCANAFQHYVNDTLAFAVKRGGFMYGTVSEVGKVEVDFIYEPPQQ 180

Query: 181 GTEDNLLFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHS 240
           G+EDNL+FFRDP+EE+ VEAIAVGLGMRKVGFIFTQTI+QDKKDYTLSNREVLQA ++H+
Sbjct: 181 GSEDNLVFFRDPEEEKFVEAIAVGLGMRKVGFIFTQTITQDKKDYTLSNREVLQAVEYHA 240

Query: 241 ESELKEWVTAVVKLEVNED-GGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKM 300
           ES LKEW TAVVKLEVNE+ GGADVHFEAFQMSD+C+RLFKEGWFET+I ED DPKLSKM
Sbjct: 241 ESGLKEWATAVVKLEVNEEMGGADVHFEAFQMSDVCVRLFKEGWFETEIKEDDDPKLSKM 300

Query: 301 RRDVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNAPVTMKALKNHLDRSKGL 360
           ++DVVVGVKDTR+VDNDFFLVVVKI DHQGPLS++FPIENRN  VTMKALKNHL+RSK L
Sbjct: 301 KKDVVVGVKDTREVDNDFFLVVVKIADHQGPLSSSFPIENRNTQVTMKALKNHLERSKNL 360

Query: 361 PFVKRISDFHLLLLLARVLDVSSDVPALAECVQTQTAIPEGYKILIESMASAA 412
           PFVKRISDFHLLL+LARVLD+++DVPAL ECVQTQT++PEGY+ILIESMAS A
Sbjct: 361 PFVKRISDFHLLLVLARVLDLNADVPALTECVQTQTSVPEGYQILIESMASTA 413

BLAST of ClCG01G000050 vs. TrEMBL
Match: V7D1G5_PHAVU (Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_001G201100g PE=4 SV=1)

HSP 1 Score: 693.3 bits (1788), Expect = 1.7e-196
Identity = 338/412 (82.04%), Postives = 382/412 (92.72%), Query Frame = 1

Query: 1   MMLRIRSRDGLERVAVDNPHIT-IAQLKAIIQSQLNIPIHNQTLSTNQNILLAKTHDDLS 60
           MMLR+RSRDGLERV V+NPH T +  LK +I++QL IP+ NQTLSTNQN+LLAK+ +DL 
Sbjct: 1   MMLRLRSRDGLERVMVENPHATTVLGLKRLIEAQLRIPVQNQTLSTNQNLLLAKSREDLH 60

Query: 61  KFTDMSNPNTHLSSLNLSHGSIVFLAYEGERTVAGPTFHPAGSFGRKMTMDDLIAKQMRI 120
           +FTDM+NP+  LSSLNL+HGSIVFL YEGER VAGP F+PAGSFGRKMTMDDLIAKQMR+
Sbjct: 61  RFTDMANPDVTLSSLNLAHGSIVFLTYEGERHVAGPAFNPAGSFGRKMTMDDLIAKQMRV 120

Query: 121 TRQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQ 180
           TRQENPHCELVSFDRDCANAFQHYVN+TLAFAVKRGG MYG VS EGKVEVDFIYEPPQQ
Sbjct: 121 TRQENPHCELVSFDRDCANAFQHYVNDTLAFAVKRGGFMYGNVSEEGKVEVDFIYEPPQQ 180

Query: 181 GTEDNLLFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHS 240
           G+E+NLLFFRDP+EE+LVEAIA GLGMR+VGFIFTQTISQDKKDYTLSNREVLQAA++H+
Sbjct: 181 GSEENLLFFRDPEEEKLVEAIAAGLGMRRVGFIFTQTISQDKKDYTLSNREVLQAAEYHA 240

Query: 241 ESELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKMR 300
           ES LKEWVTAVVKLEVNED  ADVHFEAFQ+SD+C+RLFKEGWFET+I ED DPKLSKM+
Sbjct: 241 ESGLKEWVTAVVKLEVNEDMSADVHFEAFQISDVCVRLFKEGWFETEIKEDDDPKLSKMK 300

Query: 301 RDVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNAPVTMKALKNHLDRSKGLP 360
           +DVVVGVKDT++VDNDFFLVVVKI DHQGPLS+TFP+ENRN  +T+KALKNHLDR+K LP
Sbjct: 301 KDVVVGVKDTKEVDNDFFLVVVKISDHQGPLSSTFPVENRNTQMTVKALKNHLDRTKSLP 360

Query: 361 FVKRISDFHLLLLLARVLDVSSDVPALAECVQTQTAIPEGYKILIESMASAA 412
           FVKRISDFHLLL+LARVLDV++DVPAL ECVQTQT++PEGY+ILIESMAS A
Sbjct: 361 FVKRISDFHLLLVLARVLDVNADVPALTECVQTQTSVPEGYQILIESMASTA 412

BLAST of ClCG01G000050 vs. TAIR10
Match: AT3G63000.1 (AT3G63000.1 NPL4-like protein 1)

HSP 1 Score: 632.9 bits (1631), Expect = 1.4e-181
Identity = 302/411 (73.48%), Postives = 361/411 (87.83%), Query Frame = 1

Query: 2   MLRIRSRDGLERVAVDNPHITIAQLKAIIQSQLNIPIHNQTLSTNQNILLAKTHDDLSKF 61
           MLR+RSRDGLERV+VD PHIT++QLK +IQ QL IPIHNQTLSTN+N+LLAK+  D   F
Sbjct: 3   MLRVRSRDGLERVSVDGPHITVSQLKTLIQDQLQIPIHNQTLSTNRNLLLAKSPSDFLAF 62

Query: 62  TDMSNPNTHLSSLNLSHGSIVFLAYEGERTV-AGPTFHPAGSFGRKMTMDDLIAKQMRIT 121
           TDM++PN  +SSLNL+HGS+V+LAYEGERT+  GP   PAGSFGRKMT++DLIA+QMR+ 
Sbjct: 63  TDMADPNLRISSLNLAHGSMVYLAYEGERTIRGGPAVTPAGSFGRKMTVEDLIARQMRVG 122

Query: 122 RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQG 181
           RQE  HC+ VSFDRDCANAFQH+VNE+LAFAVKRGG MYG VS +G+VEV+FIYEPPQQG
Sbjct: 123 RQEKAHCDSVSFDRDCANAFQHFVNESLAFAVKRGGFMYGNVSEDGQVEVNFIYEPPQQG 182

Query: 182 TEDNLLFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHSE 241
            EDNL+  RD +EE+ V+AIA+GLGMR+VGFIF QT++QDKK+YTLSN EVL AAQ H+E
Sbjct: 183 MEDNLILMRDSEEEKRVDAIALGLGMRRVGFIFNQTVTQDKKEYTLSNVEVLLAAQLHAE 242

Query: 242 SELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKMRR 301
           SELKEWVTAVVKLE+NEDGGADVHFE FQMSDMC+RLFKEGWFET+IG + DPKLSK+++
Sbjct: 243 SELKEWVTAVVKLEINEDGGADVHFEPFQMSDMCVRLFKEGWFETEIGPEDDPKLSKLKK 302

Query: 302 DVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNAPVTMKALKNHLDRSKGLPF 361
           +VVVGVKD ++VDNDFFLV+VKI DHQGPLS TFPIENRN   TM+ALK H++R++ LPF
Sbjct: 303 EVVVGVKDVKEVDNDFFLVLVKILDHQGPLSCTFPIENRNTQTTMRALKTHMERARSLPF 362

Query: 362 VKRISDFHLLLLLARVLDVSSDVPALAECVQTQTAIPEGYKILIESMASAA 412
           VKRISDFHLLL +A+ LDVSSDVPALAECV+ Q+ +PEGY++LI+SMA+ +
Sbjct: 363 VKRISDFHLLLFVAQFLDVSSDVPALAECVRLQSHVPEGYELLIDSMANTS 413

BLAST of ClCG01G000050 vs. TAIR10
Match: AT2G47970.1 (AT2G47970.1 Nuclear pore localisation protein NPL4)

HSP 1 Score: 618.6 bits (1594), Expect = 2.7e-177
Identity = 302/410 (73.66%), Postives = 355/410 (86.59%), Query Frame = 1

Query: 1   MMLRIRSRDGLERVAVDNPHITIAQLKAIIQSQLNIPIHNQTLSTNQNILLAKTHDDLSK 60
           MMLRIRSRDGLERV  +  HIT++QLK +I  QL IP+H QTLSTN+++LLAKT  DL  
Sbjct: 2   MMLRIRSRDGLERVTAEGAHITVSQLKTLIADQLQIPLHKQTLSTNRDLLLAKTPADLLA 61

Query: 61  FTDMSNPNTHLSSLNLSHGSIVFLAYEGERTVAG-PTFHPAGSFGRKMTMDDLIAKQMRI 120
           FTD+++PN  LSSLNL HGS+++LAY+GER++ G P   PAGSFGRKMT+DDLIA+QMR+
Sbjct: 62  FTDLTDPNLPLSSLNLGHGSMLYLAYDGERSIPGAPPVTPAGSFGRKMTVDDLIARQMRV 121

Query: 121 TRQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQ 180
           TRQE  HC+ VSFDRD ANAFQHYVNE+LAFAVKRGG MYGTV+ EG+VEVDFIYEPPQQ
Sbjct: 122 TRQETSHCDSVSFDRDAANAFQHYVNESLAFAVKRGGFMYGTVTEEGQVEVDFIYEPPQQ 181

Query: 181 GTEDNLLFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHS 240
           GTE NL+  RD DEE+ V+AIA+GLGMR+VGFIF QT+ QDK +YTLSN EVLQAA+ H+
Sbjct: 182 GTEANLILMRDADEEKRVDAIAMGLGMRRVGFIFNQTVVQDKTEYTLSNAEVLQAAELHA 241

Query: 241 ESELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKMR 300
           ESELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKE WFET+I  D DPKLSKM+
Sbjct: 242 ESELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEEWFETEIMPDDDPKLSKMK 301

Query: 301 RDVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNAPVTMKALKNHLDRSKGLP 360
           ++VVVGVKD ++VDNDFFLV+V+I DHQGPLS+TFPIENR++  TM+ALK HLDR+K LP
Sbjct: 302 KEVVVGVKDLKEVDNDFFLVLVRILDHQGPLSSTFPIENRSSRATMRALKTHLDRAKSLP 361

Query: 361 FVKRISDFHLLLLLARVLDVSSDVPALAECVQTQTAIPEGYKILIESMAS 410
            VK++SDFHLLL +A+ LDVSSDVPALAECV+ Q+ +PEGY +LIESMA+
Sbjct: 362 LVKKMSDFHLLLFVAQFLDVSSDVPALAECVRLQSPVPEGYALLIESMAN 411

BLAST of ClCG01G000050 vs. NCBI nr
Match: gi|659074543|ref|XP_008437660.1| (PREDICTED: NPL4-like protein 1 [Cucumis melo])

HSP 1 Score: 797.0 bits (2057), Expect = 1.6e-227
Identity = 399/411 (97.08%), Postives = 405/411 (98.54%), Query Frame = 1

Query: 1   MMLRIRSRDGLERVAVDNPHITIAQLKAIIQSQLNIPIHNQTLSTNQNILLAKTHDDLSK 60
           MMLRIRSRDGLERVAV+NPHITIAQLKAIIQSQL IPIHNQTLS NQNILLAKT DDLSK
Sbjct: 1   MMLRIRSRDGLERVAVENPHITIAQLKAIIQSQLKIPIHNQTLSANQNILLAKTQDDLSK 60

Query: 61  FTDMSNPNTHLSSLNLSHGSIVFLAYEGERTVAGPTFHPAGSFGRKMTMDDLIAKQMRIT 120
           FTDMSNPNT+LSSLNLSHGSIVFLAYEGERTVAGPT HPAGSFGRKMTMDDLIAKQMRIT
Sbjct: 61  FTDMSNPNTYLSSLNLSHGSIVFLAYEGERTVAGPTVHPAGSFGRKMTMDDLIAKQMRIT 120

Query: 121 RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQG 180
           RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQG
Sbjct: 121 RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQG 180

Query: 181 TEDNLLFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHSE 240
           TEDNLLFFRD DEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHSE
Sbjct: 181 TEDNLLFFRDHDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHSE 240

Query: 241 SELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKMRR 300
           SELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKM++
Sbjct: 241 SELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKMKK 300

Query: 301 DVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNAPVTMKALKNHLDRSKGLPF 360
           DVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRN PVTMKALKNHLDRSKGLPF
Sbjct: 301 DVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNVPVTMKALKNHLDRSKGLPF 360

Query: 361 VKRISDFHLLLLLARVLDVSSDVPALAECVQTQTAIPEGYKILIESMASAA 412
           VKRISDFHLLLLL+RVLDVSSDVPALAECVQTQTA+PEGYKILIESMASAA
Sbjct: 361 VKRISDFHLLLLLSRVLDVSSDVPALAECVQTQTAVPEGYKILIESMASAA 411

BLAST of ClCG01G000050 vs. NCBI nr
Match: gi|449456429|ref|XP_004145952.1| (PREDICTED: NPL4-like protein 1 isoform X1 [Cucumis sativus])

HSP 1 Score: 792.7 bits (2046), Expect = 3.0e-226
Identity = 398/411 (96.84%), Postives = 403/411 (98.05%), Query Frame = 1

Query: 1   MMLRIRSRDGLERVAVDNPHITIAQLKAIIQSQLNIPIHNQTLSTNQNILLAKTHDDLSK 60
           MMLRIRSRDGLERVAV+NPHITIAQLKAIIQSQL IPIHNQTLSTNQNILLAKT DDLSK
Sbjct: 1   MMLRIRSRDGLERVAVENPHITIAQLKAIIQSQLKIPIHNQTLSTNQNILLAKTQDDLSK 60

Query: 61  FTDMSNPNTHLSSLNLSHGSIVFLAYEGERTVAGPTFHPAGSFGRKMTMDDLIAKQMRIT 120
           FTDMSNPNT+LSSLNLSHGSIVFLAYEGERTVAGPT HPAGSFGRKMTMDDLIAKQMRIT
Sbjct: 61  FTDMSNPNTYLSSLNLSHGSIVFLAYEGERTVAGPTVHPAGSFGRKMTMDDLIAKQMRIT 120

Query: 121 RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQG 180
           RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQG
Sbjct: 121 RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQG 180

Query: 181 TEDNLLFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHSE 240
           TEDNLLFFRD DEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHSE
Sbjct: 181 TEDNLLFFRDHDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHSE 240

Query: 241 SELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKMRR 300
           SELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKE WFETDIGEDFDPKLSKM++
Sbjct: 241 SELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKECWFETDIGEDFDPKLSKMKK 300

Query: 301 DVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNAPVTMKALKNHLDRSKGLPF 360
           DVVVGVKDTRDVDNDFFLVVVKI DHQGPLSTTFPIENRN PVTMKALKNHLDRSKGLPF
Sbjct: 301 DVVVGVKDTRDVDNDFFLVVVKILDHQGPLSTTFPIENRNVPVTMKALKNHLDRSKGLPF 360

Query: 361 VKRISDFHLLLLLARVLDVSSDVPALAECVQTQTAIPEGYKILIESMASAA 412
           VKRISDFHLLLLLARVLDVSSDVPALAECVQTQT +PEGYKILIESMASAA
Sbjct: 361 VKRISDFHLLLLLARVLDVSSDVPALAECVQTQTGVPEGYKILIESMASAA 411

BLAST of ClCG01G000050 vs. NCBI nr
Match: gi|1009147803|ref|XP_015891604.1| (PREDICTED: NPL4-like protein 2 [Ziziphus jujuba])

HSP 1 Score: 724.9 bits (1870), Expect = 7.7e-206
Identity = 357/411 (86.86%), Postives = 384/411 (93.43%), Query Frame = 1

Query: 1   MMLRIRSRDGLERVAVDNPHITIAQLKAIIQSQLNIPIHNQTLSTNQNILLAKTHDDLSK 60
           MMLRIRSRDGLERV VDNPHIT+AQLK++IQSQL IP HNQTLSTNQN+LLAKTHD++S+
Sbjct: 1   MMLRIRSRDGLERVTVDNPHITVAQLKSLIQSQLQIPFHNQTLSTNQNLLLAKTHDEISR 60

Query: 61  FTDMSNPNTHLSSLNLSHGSIVFLAYEGERTVAGPTFHPAGSFGRKMTMDDLIAKQMRIT 120
           FTDM++PNT LSSLNLSHGSIVFL Y+GERTVAGP  +P+GSFGRKMTMDDLIAKQMRIT
Sbjct: 61  FTDMADPNTLLSSLNLSHGSIVFLYYDGERTVAGPPVNPSGSFGRKMTMDDLIAKQMRIT 120

Query: 121 RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQG 180
           RQENPHCELVSFDRDCAN FQHYVNETL FAVKRGG MYGTVS  GKVEVDFIYEPPQQG
Sbjct: 121 RQENPHCELVSFDRDCANVFQHYVNETLVFAVKRGGFMYGTVSEVGKVEVDFIYEPPQQG 180

Query: 181 TEDNLLFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHSE 240
           TE+NLL  RDPDEE+LVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAA+ H+E
Sbjct: 181 TEENLLLLRDPDEEKLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAELHAE 240

Query: 241 SELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKMRR 300
           S LKEWVTAVVKLEVNEDG ADVHFEAFQMSDMCIRLFKEGWFET+IGED DPKLSKM++
Sbjct: 241 SGLKEWVTAVVKLEVNEDGNADVHFEAFQMSDMCIRLFKEGWFETEIGEDVDPKLSKMKK 300

Query: 301 DVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNAPVTMKALKNHLDRSKGLPF 360
           DVVVGVKDTR+VDNDFFLV+VKIFDHQGPLS TFPIENR  PVTMKALKNHLDR+K LPF
Sbjct: 301 DVVVGVKDTREVDNDFFLVLVKIFDHQGPLSATFPIENRITPVTMKALKNHLDRAKSLPF 360

Query: 361 VKRISDFHLLLLLARVLDVSSDVPALAECVQTQTAIPEGYKILIESMASAA 412
           VKRISDFHLLLLLAR LD+ SDV ALAECV TQT +PEGY++LIESMA+AA
Sbjct: 361 VKRISDFHLLLLLARFLDLGSDVHALAECVHTQTPVPEGYQLLIESMANAA 411

BLAST of ClCG01G000050 vs. NCBI nr
Match: gi|645258621|ref|XP_008234969.1| (PREDICTED: NPL4-like protein 2 [Prunus mume])

HSP 1 Score: 721.8 bits (1862), Expect = 6.5e-205
Identity = 349/412 (84.71%), Postives = 392/412 (95.15%), Query Frame = 1

Query: 1   MMLRIRSRDGLERVAVDNP-HITIAQLKAIIQSQLNIPIHNQTLSTNQNILLAKTHDDLS 60
           MMLR+RSRDGLERV VDNP H T++QLKA+IQ+QL IP  NQT+STNQN+LLAKTHDD+S
Sbjct: 1   MMLRVRSRDGLERVTVDNPQHTTVSQLKALIQTQLRIPFQNQTISTNQNLLLAKTHDDIS 60

Query: 61  KFTDMSNPNTHLSSLNLSHGSIVFLAYEGERTVAGPTFHPAGSFGRKMTMDDLIAKQMRI 120
           +FTDM+NPNT LS+LNLSHGSIV+LAY+GERTVAGPTFHPAGSFGRKMTMDDLIAKQM++
Sbjct: 61  RFTDMANPNTSLSALNLSHGSIVYLAYDGERTVAGPTFHPAGSFGRKMTMDDLIAKQMKV 120

Query: 121 TRQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQ 180
           TRQENPH ELVSFDRDCANAFQHYVN+TLAFAVKRGG MYGTVS EGKVEVDFIYEPPQQ
Sbjct: 121 TRQENPHSELVSFDRDCANAFQHYVNDTLAFAVKRGGFMYGTVSEEGKVEVDFIYEPPQQ 180

Query: 181 GTEDNLLFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHS 240
           GTE NL+FFRDPDEE+ VEAIA+GLGMR+VGFIFTQT+SQDKKDYTLSNREVLQA++FH+
Sbjct: 181 GTEANLVFFRDPDEEKSVEAIAMGLGMRRVGFIFTQTVSQDKKDYTLSNREVLQASEFHA 240

Query: 241 ESELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKMR 300
           ES LKEWVTA+VKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFET+I E  DPKLSKM+
Sbjct: 241 ESGLKEWVTAMVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETEIEEGHDPKLSKMK 300

Query: 301 RDVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNAPVTMKALKNHLDRSKGLP 360
           +DVVVGVKDTR+VDNDFFLVVVKIFDHQGPLS++FPIENRNAPVT+KALKNHLDR+K LP
Sbjct: 301 KDVVVGVKDTREVDNDFFLVVVKIFDHQGPLSSSFPIENRNAPVTLKALKNHLDRAKSLP 360

Query: 361 FVKRISDFHLLLLLARVLDVSSDVPALAECVQTQTAIPEGYKILIESMASAA 412
           FVKRISDFHL+LLLAR LDV++D+PALAECV T++ IPEGY++LIESMA+A+
Sbjct: 361 FVKRISDFHLMLLLARFLDVAADIPALAECVHTESPIPEGYQLLIESMANAS 412

BLAST of ClCG01G000050 vs. NCBI nr
Match: gi|595792271|ref|XP_007199884.1| (hypothetical protein PRUPE_ppa006427mg [Prunus persica])

HSP 1 Score: 717.2 bits (1850), Expect = 1.6e-203
Identity = 347/412 (84.22%), Postives = 390/412 (94.66%), Query Frame = 1

Query: 1   MMLRIRSRDGLERVAVDNP-HITIAQLKAIIQSQLNIPIHNQTLSTNQNILLAKTHDDLS 60
           MMLR+RSRDGLERV VDNP H T++QLKA+IQ+QL IP  NQT+STNQN+LLAKTHDD+S
Sbjct: 1   MMLRVRSRDGLERVTVDNPQHTTVSQLKALIQTQLRIPFQNQTISTNQNLLLAKTHDDIS 60

Query: 61  KFTDMSNPNTHLSSLNLSHGSIVFLAYEGERTVAGPTFHPAGSFGRKMTMDDLIAKQMRI 120
           +FTDM+NPNT LS+LNLSHGSIV+LAY+GERTVAGPTFHPAGSFGRKMTMDDLIAKQM++
Sbjct: 61  RFTDMANPNTPLSALNLSHGSIVYLAYDGERTVAGPTFHPAGSFGRKMTMDDLIAKQMKV 120

Query: 121 TRQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQ 180
           TRQENPH ELVSFDRDCANAFQHYVN+TLAFAVKRGG MYGTVS EGKVEVDFIYEPPQQ
Sbjct: 121 TRQENPHSELVSFDRDCANAFQHYVNDTLAFAVKRGGFMYGTVSEEGKVEVDFIYEPPQQ 180

Query: 181 GTEDNLLFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHS 240
           GTE NL+FFRDPDEE+ VEAIA+GLGMR+VGFIFTQT+SQDKKDYTLSNREVLQA++FH+
Sbjct: 181 GTEANLVFFRDPDEEKSVEAIAMGLGMRRVGFIFTQTVSQDKKDYTLSNREVLQASEFHA 240

Query: 241 ESELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKMR 300
           ES LKEWVTA+VKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFET+I E  DPKLSKM+
Sbjct: 241 ESGLKEWVTAMVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETEIEEGHDPKLSKMK 300

Query: 301 RDVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNAPVTMKALKNHLDRSKGLP 360
           +DVVVGVKDTR+VDNDFFLVVVKIFDHQGPLS++FPIENRN PVT+KALKNHLDR+K LP
Sbjct: 301 KDVVVGVKDTREVDNDFFLVVVKIFDHQGPLSSSFPIENRNTPVTLKALKNHLDRAKSLP 360

Query: 361 FVKRISDFHLLLLLARVLDVSSDVPALAECVQTQTAIPEGYKILIESMASAA 412
           FVKRISDFHL+LLLAR LDV++D+PALA CV T++ IPEGY++LIESMA+A+
Sbjct: 361 FVKRISDFHLMLLLARFLDVAADIPALAVCVHTESPIPEGYQLLIESMANAS 412

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
NPL41_ARATH2.5e-18073.48NPL4-like protein 1 OS=Arabidopsis thaliana GN=At3g63000 PE=1 SV=1[more]
NPL42_ARATH4.9e-17673.66NPL4-like protein 2 OS=Arabidopsis thaliana GN=At2g47970 PE=1 SV=1[more]
NPL4_ORYSJ1.9e-12757.04NPL4-like protein OS=Oryza sativa subsp. japonica GN=Os01g0377700 PE=2 SV=1[more]
NPL4_DICDI2.0e-2832.28Nuclear protein localization protein 4 homolog OS=Dictyostelium discoideum GN=np... [more]
NPL4_MOUSE7.1e-1827.54Nuclear protein localization protein 4 homolog OS=Mus musculus GN=Nploc4 PE=1 SV... [more]
Match NameE-valueIdentityDescription
M5VVX8_PRUPE1.1e-20384.22Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006427mg PE=4 SV=1[more]
W9SJC4_9ROSA1.4e-19884.43Uncharacterized protein OS=Morus notabilis GN=L484_020438 PE=4 SV=1[more]
I1JQF4_SOYBN7.8e-19782.81Uncharacterized protein OS=Glycine max GN=GLYMA_03G207900 PE=4 SV=1[more]
I1NB01_SOYBN1.0e-19683.29Uncharacterized protein OS=Glycine max GN=GLYMA_19G205200 PE=4 SV=1[more]
V7D1G5_PHAVU1.7e-19682.04Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_001G201100g PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G63000.11.4e-18173.48 NPL4-like protein 1[more]
AT2G47970.12.7e-17773.66 Nuclear pore localisation protein NPL4[more]
Match NameE-valueIdentityDescription
gi|659074543|ref|XP_008437660.1|1.6e-22797.08PREDICTED: NPL4-like protein 1 [Cucumis melo][more]
gi|449456429|ref|XP_004145952.1|3.0e-22696.84PREDICTED: NPL4-like protein 1 isoform X1 [Cucumis sativus][more]
gi|1009147803|ref|XP_015891604.1|7.7e-20686.86PREDICTED: NPL4-like protein 2 [Ziziphus jujuba][more]
gi|645258621|ref|XP_008234969.1|6.5e-20584.71PREDICTED: NPL4-like protein 2 [Prunus mume][more]
gi|595792271|ref|XP_007199884.1|1.6e-20384.22hypothetical protein PRUPE_ppa006427mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR007717NPL4_C
IPR024682Npl4_Ub-like_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006511 ubiquitin-dependent protein catabolic process
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
molecular_function GO:0003674 molecular_function
molecular_function GO:0043130 ubiquitin binding
molecular_function GO:0031625 ubiquitin protein ligase binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG01G000050.1ClCG01G000050.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007717Nuclear pore localisation protein NPL4, C-terminalPFAMPF05021NPL4coord: 167..280
score: 2.9
IPR024682Nuclear pore localisation protein Npl4, ubiquitin-like domainPFAMPF11543UN_NPL4coord: 1..85
score: 1.
NoneNo IPR availableGENE3DG3DSA:3.10.20.90coord: 1..98
score: 7.1
NoneNo IPR availablePANTHERPTHR12710NUCLEAR PROTEIN LOCALIZATION 4coord: 26..410
score: 3.9E
NoneNo IPR availablePANTHERPTHR12710:SF0NUCLEAR PROTEIN LOCALIZATION PROTEIN 4 HOMOLOGcoord: 26..410
score: 3.9E