'\" te .\" Copyright (c) 1992, Sun Microsystems, Inc. .\" The contents of this file are subject to the terms of the Common Development and Distribution License (the "License"). You may not use this file except in compliance with the License. .\" You can obtain a copy of the license at usr/src/OPENSOLARIS.LICENSE or http://www.opensolaris.org/os/licensing. See the License for the specific language governing permissions and limitations under the License. .\" When distributing Covered Code, include this CDDL HEADER in each file and include the License file at usr/src/OPENSOLARIS.LICENSE. If applicable, add the following below this CDDL HEADER, with the fields enclosed by brackets "[]" replaced with your own identifying information: Portions Copyright [yyyy] [name of copyright owner] .TH XSTR 1 "Sep 14, 1992" .SH NAME xstr \- extract strings from C programs to implement shared strings .SH SYNOPSIS .LP .nf \fBxstr\fR \fB-c\fR \fIfilename\fR [\fB-v\fR] [\fB-l\fR \fIarray\fR] .fi .LP .nf \fBxstr\fR [\fB-l\fR \fIarray\fR] .fi .LP .nf \fBxstr\fR \fIfilename\fR [\fB-v\fR] [\fB-l\fR \fIarray\fR] .fi .SH DESCRIPTION .sp .LP \fBxstr\fR maintains a file called \fBstrings\fR into which strings in component parts of a large program are hashed. These strings are replaced with references to this common area. This serves to implement shared constant strings, which are most useful if they are also read-only. .sp .LP The command: .sp .in +2 .nf \fBexample% xstr \(mic \fR\fIfilename\fR .fi .in -2 .sp .sp .LP extracts the strings from the C source in name, replacing string references by expressions of the form \fB&xstr[\fR\fInumber\fR\fB]\fR for some number. An appropriate declaration of \fBxstr\fR is prepended to the file. The resulting C text is placed in the file \fBx.c\fR, to then be compiled. The strings from this file are placed in the \fBstrings\fR data base if they are not there already. Repeated strings and strings which are suffixes of existing strings do not cause changes to the data base. .sp .LP After all components of a large program have been compiled, a file declaring the common \fBxstr\fR space called \fBxs.c\fR can be created by a command of the form: .sp .in +2 .nf \fBexample%\fR \fBxstr\fR .fi .in -2 .sp .sp .LP This \fBxs.c\fR file should then be compiled and loaded with the rest of the program. If possible, the array can be made read-only (shared) saving space and swap overhead. .sp .LP \fBxstr\fR can also be used on a single file. A command: .sp .LP \fBexample% xstr \fR\fIfilename\fR .sp .LP creates files \fBx.c\fR and \fBxs.c\fR as before, without using or affecting any \fBstrings\fR file in the same directory. .sp .LP It may be useful to run \fBxstr\fR after the C preprocessor if any macro definitions yield strings or if there is conditional code which contains strings which may not, in fact, be needed. \fBxstr\fR reads from the standard input when the argument `\fB\(mi\fR\&' is given. An appropriate command sequence for running \fBxstr\fR after the C preprocessor is: .sp .in +2 .nf \fBexample% cc \(miE name.c | xstr \(mic \(mi example% cc \(mic x.c example% mv x.o name.o\fR .fi .in -2 .sp .sp .LP \fBxstr\fR does not touch the file \fBstrings\fR unless new items are added; thus \fBmake\fR(1S) can avoid remaking \fBxs.o\fR unless truly necessary. .SH OPTIONS .sp .ne 2 .na \fB\fB\(mic \fR\fIfilename\fR\fR .ad .RS 18n Take C source text from \fIfilename\fR. .RE .sp .ne 2 .na \fB\fB-v\fR\fR .ad .RS 18n Verbose: display a progress report indicating where new or duplicate strings were found. .RE .sp .ne 2 .na \fB\fB\(mil \fR\fIarray\fR\fR .ad .RS 18n Specify the named \fIarray\fR in program references to abstracted strings. The default array name is \fBxstr\fR. .RE .SH FILES .sp .ne 2 .na \fB\fBstrings\fR\fR .ad .RS 12n data base of strings .RE .sp .ne 2 .na \fB\fBx.c\fR\fR .ad .RS 12n massaged C source .RE .sp .ne 2 .na \fB\fBxs.c\fR\fR .ad .RS 12n C source for definition of array "xstr*(rq .RE .sp .ne 2 .na \fB\fB/tmp/xs*\fR\fR .ad .RS 12n temp file when \fBxstr \fR\fIfilename\fR doesn't touch \fBstrings\fR .RE .SH SEE ALSO .sp .LP .BR make (1S), .BR attributes (7) .SH BUGS .sp .LP If a string is a suffix of another string in the data base, but the shorter string is seen first by \fBxstr\fR both strings will be placed in the data base, when just placing the longer one there would do. .SH NOTES .sp .LP Be aware that \fBxstr\fR indiscriminately replaces all strings with expressions of the form \fB&xstr[\fR\fInumber\fR\fB]\fR regardless of the way the original C code might have used the string. For example, you will encounter a problem with code that uses \fBsizeof()\fR to determine the length of a literal string because \fBxstr\fR will replace the literal string with a pointer that most likely will have a different size than the string's. To circumvent this problem: .RS +4 .TP .ie t \(bu .el o use \fBstrlen()\fR instead of \fBsizeof()\fR; note that \fBsizeof()\fR returns the size of the array (including the null byte at the end), whereas \fBstrlen()\fR doesn't count the null byte. The equivalent of \fBsizeof("\fR\fIxxx\fR\fB")\fR really is \fB(strlen("\fR\fIxxx\fR\fB"))+1\fR. .RE .RS +4 .TP .ie t \(bu .el o use \fB#define\fR for operands of \fBsizeof()\fR and use the \fBdefine\fR'd version. \fBxstr\fR ignores \fB#define\fR statements. Make sure you run \fBxstr\fR on \fIfilename\fR before you run it on the preprocessor. .RE .sp .LP You will also encounter a problem when declaring an initialized character array of the form .sp .in +2 .nf \fBchar x[] = "\fR\fIxxx\fR\fB";\fR .fi .in -2 .sp .sp .LP \fBxstr\fR will replace \fIxxx\fR with an expression of the form \fB&xstr[\fR\fInumber\fR\fB]\fR which will not compile. To circumvent this problem, use \fBstatic char *x = "xxx"\fR instead of \fBstatic char x[] = "xxx"\fR.