xref: /freebsd/contrib/one-true-awk/FIXES (revision 282a3889ebf826db9839be296ff1dd903f6d6d6e)
1/****************************************************************
2Copyright (C) Lucent Technologies 1997
3All Rights Reserved
4
5Permission to use, copy, modify, and distribute this software and
6its documentation for any purpose and without fee is hereby
7granted, provided that the above copyright notice appear in all
8copies and that both that the copyright notice and this
9permission notice and warranty disclaimer appear in supporting
10documentation, and that the name Lucent Technologies or any of
11its entities not be used in advertising or publicity pertaining
12to distribution of the software without specific, written prior
13permission.
14
15LUCENT DISCLAIMS ALL WARRANTIES WITH REGARD TO THIS SOFTWARE,
16INCLUDING ALL IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS.
17IN NO EVENT SHALL LUCENT OR ANY OF ITS ENTITIES BE LIABLE FOR ANY
18SPECIAL, INDIRECT OR CONSEQUENTIAL DAMAGES OR ANY DAMAGES
19WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR PROFITS, WHETHER
20IN AN ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION,
21ARISING OUT OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF
22THIS SOFTWARE.
23****************************************************************/
24
25This file lists all bug fixes, changes, etc., made since the AWK book
26was sent to the printers in August, 1987.
27
28May 1, 2007:
29	fiddle in makefile to fix for BSD make; thanks to igor sobrado.
30
31Mar 31, 2007:
32	fixed some null pointer refs calling adjbuf.
33
34Feb 21, 2007:
35	fixed a bug in matching the null RE in sub and gsub.  thanks to al aho
36	who actually did the fix (in b.c), and to wolfgang seeberg for finding
37	it and providing a very compact test case.
38
39	fixed quotation in b.c; thanks to Hal Pratt and the Princeton Dante
40	Project.
41
42	removed some no-effect asserts in run.c.
43
44	fiddled maketab.c to not complain about bison-generated values.
45
46	removed the obsolete -V argument; fixed --version to print the
47	version and exit.
48
49	fixed wording and an outright error in the usage message; thanks to igor
50	sobrado and jason mcintyre.
51
52	fixed a bug in -d that caused core dump if no program followed.
53
54Jan 1, 2007:
55	dropped mac.code from makefile; there are few non-MacOSX
56	mac's these days.
57
58Jan 17, 2006:
59	system() not flagged as unsafe in the unadvertised -safe option.
60	found it while enhancing tests before shipping the ;login: article.
61	practice what you preach.
62
63	removed the 9-years-obsolete -mr and -mf flags.
64
65	added -version and --version options.
66
67	core dump on linux with BEGIN {nextfile}, now fixed.
68
69	removed some #ifdef's in run.c and lex.c that appear to no
70	longer be necessary.
71
72Apr 24, 2005:
73	modified lib.c so that values of $0 et al are preserved in the END
74	block, apparently as required by posix.  thanks to havard eidnes
75	for the report and code.
76
77Jan 14, 2005:
78	fixed infinite loop in parsing, originally found by brian tsang.
79	thanks to arnold robbins for a suggestion that started me
80	rethinking it.
81
82Dec 31, 2004:
83	prevent overflow of -f array in main, head off potential error in
84	call of SYNTAX(), test malloc return in lib.c, all with thanks to
85	todd miller.
86
87Dec 22, 2004:
88	cranked up size of NCHARS; coverity thinks it can be overrun with
89	smaller size, and i think that's right.  added some assertions to b.c
90	to catch places where it might overrun.  the RE code is still fragile.
91
92Dec 5, 2004:
93	fixed a couple of overflow problems with ridiculous field numbers:
94	e.g., print $(2^32-1).  thanks to ruslan ermilov, giorgos keramidas
95	and david o'brien at freebsd.org for patches.  this really should
96	be re-done from scratch.
97
98Nov 21, 2004:
99	fixed another 25-year-old RE bug, in split.  it's another failure
100	to (re-)initialize.  thanks to steve fisher for spotting this and
101	providing a good test case.
102
103Nov 22, 2003:
104	fixed a bug in regular expressions that dates (so help me) from 1977;
105	it's been there from the beginning.  an anchored longest match that
106	was longer than the number of states triggered a failure to initialize
107	the machine properly.  many thanks to moinak ghosh for not only finding
108	this one but for providing a fix, in some of the most mysterious
109	code known to man.
110
111	fixed a storage leak in call() that appears to have been there since
112	1983 or so -- a function without an explicit return that assigns a
113	string to a parameter leaked a Cell.  thanks to moinak ghosh for
114	spotting this very subtle one.
115
116Jul 31, 2003:
117	fixed, thanks to andrey chernov and ruslan ermilov, a bug in lex.c
118	that mis-handled the character 255 in input.  (it was being compared
119	to EOF with a signed comparison.)
120
121Jul 29, 2003:
122	fixed (i think) the long-standing botch that included the beginning of
123	line state ^ for RE's in the set of valid characters; this led to a
124	variety of odd problems, including failure to properly match certain
125	regular expressions in non-US locales.  thanks to ruslan for keeping
126	at this one.
127
128Jul 28, 2003:
129	n-th try at getting internationalization right, with thanks to volker
130	kiefel, arnold robbins and ruslan ermilov for advice, though they
131	should not be blamed for the outcome.  according to posix, "."  is the
132	radix character in programs and command line arguments regardless of
133	the locale; otherwise, the locale should prevail for input and output
134	of numbers.  so it's intended to work that way.
135
136	i have rescinded the attempt to use strcoll in expanding shorthands in
137	regular expressions (cclenter).  its properties are much too
138	surprising; for example [a-c] matches aAbBc in locale en_US but abBcC
139	in locale fr_CA.  i can see how this might arise by implementation
140	but i cannot explain it to a human user.  (this behavior can be seen
141	in gawk as well; we're leaning on the same library.)
142
143	the issue appears to be that strcoll is meant for sorting, where
144	merging upper and lower case may make sense (though note that unix
145	sort does not do this by default either).  it is not appropriate
146	for regular expressions, where the goal is to match specific
147	patterns of characters.  in any case, the notations [:lower:], etc.,
148	are available in awk, and they are more likely to work correctly in
149	most locales.
150
151	a moratorium is hereby declared on internationalization changes.
152	i apologize to friends and colleagues in other parts of the world.
153	i would truly like to get this "right", but i don't know what
154	that is, and i do not want to keep making changes until it's clear.
155
156Jul 4, 2003:
157	fixed bug that permitted non-terminated RE, as in "awk /x".
158
159Jun 1, 2003:
160	subtle change to split: if source is empty, number of elems
161	is always 0 and the array is not set.
162
163Mar 21, 2003:
164	added some parens to isblank, in another attempt to make things
165	internationally portable.
166
167Mar 14, 2003:
168	the internationalization changes, somewhat modified, are now
169	reinstated.  in theory awk will now do character comparisons
170	and case conversions in national language, but "." will always
171	be the decimal point separator on input and output regardless
172	of national language.  isblank(){} has an #ifndef.
173
174	this no longer compiles on windows: LC_MESSAGES isn't defined
175	in vc6++.
176
177	fixed subtle behavior in field and record splitting: if FS is
178	a single character and RS is not empty, \n is NOT a separator.
179	this tortuous reading is found in the awk book; behavior now
180	matches gawk and mawk.
181
182Dec 13, 2002:
183	for the moment, the internationalization changes of nov 29 are
184	rolled back -- programs like x = 1.2 don't work in some locales,
185	because the parser is expecting x = 1,2.  until i understand this
186	better, this will have to wait.
187
188Nov 29, 2002:
189	modified b.c (with tiny changes in main and run) to support
190	locales, using strcoll and iswhatever tests for posix character
191	classes.  thanks to ruslan ermilov (ru@freebsd.org) for code.
192	the function isblank doesn't seem to have propagated to any
193	header file near me, so it's there explicitly.  not properly
194	tested on non-ascii character sets by me.
195
196Jun 28, 2002:
197	modified run/format() and tran/getsval() to do a slightly better
198	job on using OFMT for output from print and CONVFMT for other
199	number->string conversions, as promised by posix and done by
200	gawk and mawk.  there are still places where it doesn't work
201	right if CONVFMT is changed; by then the STR attribute of the
202	variable has been irrevocably set.  thanks to arnold robbins for
203	code and examples.
204
205	fixed subtle bug in format that could get core dump.  thanks to
206	Jaromir Dolecek <jdolecek@NetBSD.org> for finding and fixing.
207	minor cleanup in run.c / format() at the same time.
208
209	added some tests for null pointers to debugging printf's, which
210	were never intended for external consumption.  thanks to dave
211	kerns (dkerns@lucent.com) for pointing this out.
212
213	GNU compatibility: an empty regexp matches anything (thanks to
214	dag-erling smorgrav, des@ofug.org).  subject to reversion if
215	this does more harm than good.
216
217	pervasive small changes to make things more const-correct, as
218	reported by gcc's -Wwrite-strings.  as it says in the gcc manual,
219	this may be more nuisance than useful.  provoked by a suggestion
220	and code from arnaud desitter, arnaud@nimbus.geog.ox.ac.uk
221
222	minor documentation changes to note that this now compiles out
223	of the box on Mac OS X.
224
225Feb 10, 2002:
226	changed types in posix chars structure to quiet solaris cc.
227
228Jan 1, 2002:
229	fflush() or fflush("") flushes all files and pipes.
230
231	length(arrayname) returns number of elements; thanks to
232	arnold robbins for suggestion.
233
234	added a makefile.win to make it easier to build on windows.
235	based on dan allen's buildwin.bat.
236
237Nov 16, 2001:
238	added support for posix character class names like [:digit:],
239	which are not exactly shorter than [0-9] and perhaps no more
240	portable.  thanks to dag-erling smorgrav for code.
241
242Feb 16, 2001:
243	removed -m option; no longer needed, and it was actually
244	broken (noted thanks to volker kiefel).
245
246Feb 10, 2001:
247	fixed an appalling bug in gettok: any sequence of digits, +,-, E, e,
248	and period was accepted as a valid number if it started with a period.
249	this would never have happened with the lex version.
250
251	other 1-character botches, now fixed, include a bare $ and a
252	bare " at the end of the input.
253
254Feb 7, 2001:
255	more (const char *) casts in b.c and tran.c to silence warnings.
256
257Nov 15, 2000:
258	fixed a bug introduced in august 1997 that caused expressions
259	like $f[1] to be syntax errors.  thanks to arnold robbins for
260	noticing this and providing a fix.
261
262Oct 30, 2000:
263	fixed some nextfile bugs: not handling all cases.  thanks to
264	arnold robbins for pointing this out.  new regressions added.
265
266	close() is now a function.  it returns whatever the library
267	fclose returns, and -1 for closing a file or pipe that wasn't
268	opened.
269
270Sep 24, 2000:
271	permit \n explicitly in character classes; won't work right
272	if comes in as "[\n]" but ok as /[\n]/, because of multiple
273	processing of \'s.  thanks to arnold robbins.
274
275July 5, 2000:
276	minor fiddles in tran.c to keep compilers happy about uschar.
277	thanks to norman wilson.
278
279May 25, 2000:
280	yet another attempt at making 8-bit input work, with another
281	band-aid in b.c (member()), and some (uschar) casts to head
282	off potential errors in subscripts (like isdigit).  also
283	changed HAT to NCHARS-2.  thanks again to santiago vila.
284
285	changed maketab.c to ignore apparently out of range definitions
286	instead of halting; new freeBSD generates one.  thanks to
287	jon snader <jsnader@ix.netcom.com> for pointing out the problem.
288
289May 2, 2000:
290	fixed an 8-bit problem in b.c by making several char*'s into
291	unsigned char*'s.  not clear i have them all yet.  thanks to
292	Santiago Vila <sanvila@unex.es> for the bug report.
293
294Apr 21, 2000:
295	finally found and fixed a memory leak in function call; it's
296	been there since functions were added ~1983.  thanks to
297	jon bentley for the test case that found it.
298
299	added test in envinit to catch environment "variables" with
300	names beginning with '='; thanks to Berend Hasselman.
301
302Jul 28, 1999:
303	added test in defn() to catch function foo(foo), which
304	otherwise recurses until core dump.  thanks to arnold
305	robbins for noticing this.
306
307Jun 20, 1999:
308	added *bp in gettok in lex.c; appears possible to exit function
309	without terminating the string.  thanks to russ cox.
310
311Jun 2, 1999:
312	added function stdinit() to run to initialize files[] array,
313	in case stdin, etc., are not constants; some compilers care.
314
315May 10, 1999:
316	replaced the ERROR ... FATAL, etc., macros with functions
317	based on vprintf, to avoid problems caused by overrunning
318	fixed-size errbuf array.  thanks to ralph corderoy for the
319	impetus, and for pointing out a string termination bug in
320	qstring as well.
321
322Apr 21, 1999:
323	fixed bug that caused occasional core dumps with commandline
324	variable with value ending in \.  (thanks to nelson beebe for
325	the test case.)
326
327Apr 16, 1999:
328	with code kindly provided by Bruce Lilly, awk now parses
329	/=/ and similar constructs more sensibly in more places.
330	Bruce also provided some helpful test cases.
331
332Apr 5, 1999:
333	changed true/false to True/False in run.c to make it
334	easier to compile with C++.  Added some casts on malloc
335	and realloc to be honest about casts; ditto.  changed
336	ltype int to long in struct rrow to reduce some 64-bit
337	complaints; other changes scattered throughout for the
338	same purpose.  thanks to Nelson Beebe for these portability
339	improvements.
340
341	removed some horrible pointer-int casting in b.c and elsewhere
342	by adding ptoi and itonp to localize the casts, which are
343	all benign.  fixed one incipient bug that showed up on sgi
344	in 64-bit mode.
345
346	reset lineno for new source file; include filename in error
347	message.  also fixed line number error in continuation lines.
348	(thanks to Nelson Beebe for both of these.)
349
350Mar 24, 1999:
351	Nelson Beebe notes that irix 5.3 yacc dies with a bogus
352	error; use a newer version or switch to bison, since sgi
353	is unlikely to fix it.
354
355Mar 5, 1999:
356	changed isnumber to is_number to avoid the problem caused by
357	versions of ctype.h that include the name isnumber.
358
359	distribution now includes a script for building on a Mac,
360	thanks to Dan Allen.
361
362Feb 20, 1999:
363	fixed memory leaks in run.c (call) and tran.c (setfval).
364	thanks to Stephen Nutt for finding these and providing the fixes.
365
366Jan 13, 1999:
367	replaced srand argument by (unsigned int) in run.c;
368	avoids problem on Mac and potentially on Unix & Windows.
369	thanks to Dan Allen.
370
371	added a few (int) casts to silence useless compiler warnings.
372	e.g., errorflag= in run.c jump().
373
374	added proctab.c to the bundle outout; one less thing
375	to have to compile out of the box.
376
377	added calls to _popen and _pclose to the win95 stub for
378	pipes (thanks to Steve Adams for this helpful suggestion).
379	seems to work, though properties are not well understood
380	by me, and it appears that under some circumstances the
381	pipe output is truncated.  Be careful.
382
383Oct 19, 1998:
384	fixed a couple of bugs in getrec: could fail to update $0
385	after a getline var; because inputFS wasn't initialized,
386	could split $0 on every character, a misleading diversion.
387
388	fixed caching bug in makedfa: LRU was actually removing
389	least often used.
390
391	thanks to ross ridge for finding these, and for providing
392	great bug reports.
393
394May 12, 1998:
395	fixed potential bug in readrec: might fail to update record
396	pointer after growing.  thanks to dan levy for spotting this
397	and suggesting the fix.
398
399Mar 12, 1998:
400	added -V to print version number and die.
401
402Feb 11, 1998:
403	subtle silent bug in lex.c: if the program ended with a number
404	longer than 1 digit, part of the input would be pushed back and
405	parsed again because token buffer wasn't terminated right.
406	example:  awk 'length($0) > 10'.  blush.  at least i found it
407	myself.
408
409Aug 31, 1997:
410	s/adelete/awkdelete/: SGI uses this in malloc.h.
411	thanks to nelson beebe for pointing this one out.
412
413Aug 21, 1997:
414	fixed some bugs in sub and gsub when replacement includes \\.
415	this is a dark, horrible corner, but at least now i believe that
416	the behavior is the same as gawk and the intended posix standard.
417	thanks to arnold robbins for advice here.
418
419Aug 9, 1997:
420	somewhat regretfully, replaced the ancient lex-based lexical
421	analyzer with one written in C.  it's longer, generates less code,
422	and more portable; the old one depended too much on mysterious
423	properties of lex that were not preserved in other environments.
424	in theory these recognize the same language.
425
426	now using strtod to test whether a string is a number, instead of
427	the convoluted original function.  should be more portable and
428	reliable if strtod is implemented right.
429
430	removed now-pointless optimization in makefile that tries to avoid
431	recompilation when awkgram.y is changed but symbols are not.
432
433	removed most fixed-size arrays, though a handful remain, some
434	of which are unchecked.  you have been warned.
435
436Aug 4, 1997:
437	with some trepidation, replaced the ancient code that managed
438	fields and $0 in fixed-size arrays with arrays that grow on
439	demand.  there is still some tension between trying to make this
440	run fast and making it clean; not sure it's right yet.
441
442	the ill-conceived -mr and -mf arguments are now useful only
443	for debugging.  previous dynamic string code removed.
444
445	numerous other minor cleanups along the way.
446
447Jul 30, 1997:
448	using code provided by dan levy (to whom profuse thanks), replaced
449	fixed-size arrays and awkward kludges by a fairly uniform mechanism
450	to grow arrays as needed for printf, sub, gsub, etc.
451
452Jul 23, 1997:
453	falling off the end of a function returns "" and 0, not 0.
454	thanks to arnold robbins.
455
456Jun 17, 1997:
457	replaced several fixed-size arrays by dynamically-created ones
458	in run.c; added overflow tests to some previously unchecked cases.
459	getline, toupper, tolower.
460
461	getline code is still broken in that recursive calls may wind
462	up using the same space.  [fixed later]
463
464	increased RECSIZE to 8192 to push problems further over the horizon.
465
466	added \r to \n as input line separator for programs, not data.
467	damn CRLFs.
468
469	modified format() to permit explicit printf("%c", 0) to include
470	a null byte in output.  thanks to ken stailey for the fix.
471
472	added a "-safe" argument that disables file output (print >,
473	print >>), process creation (cmd|getline, print |, system), and
474	access to the environment (ENVIRON).  this is a first approximation
475	to a "safe" version of awk, but don't rely on it too much.  thanks
476	to joan feigenbaum and matt blaze for the inspiration long ago.
477
478Jul 8, 1996:
479	fixed long-standing bug in sub, gsub(/a/, "\\\\&"); thanks to
480	ralph corderoy.
481
482Jun 29, 1996:
483	fixed awful bug in new field splitting; didn't get all the places
484	where input was done.
485
486Jun 28, 1996:
487	changed field-splitting to conform to posix definition: fields are
488	split using the value of FS at the time of input; it used to be
489	the value when the field or NF was first referred to, a much less
490	predictable definition.  thanks to arnold robbins for encouragement
491	to do the right thing.
492
493May 28, 1996:
494	fixed appalling but apparently unimportant bug in parsing octal
495	numbers in reg exprs.
496
497	explicit hex in reg exprs now limited to 2 chars: \xa, \xaa.
498
499May 27, 1996:
500	cleaned up some declarations so gcc -Wall is now almost silent.
501
502	makefile now includes backup copies of ytab.c and lexyy.c in case
503	one makes before looking; it also avoids recreating lexyy.c unless
504	really needed.
505
506	s/aprintf/awkprint, s/asprintf/awksprintf/ to avoid some name clashes
507	with unwisely-written header files.
508
509	thanks to jeffrey friedl for several of these.
510
511May 26, 1996:
512	an attempt to rationalize the (unsigned) char issue.  almost all
513	instances of unsigned char have been removed; the handful of places
514	in b.c where chars are used as table indices have been hand-crafted.
515	added some latin-1 tests to the regression, but i'm not confident;
516	none of my compilers seem to care much.  thanks to nelson beebe for
517	pointing out some others that do care.
518
519May 2, 1996:
520	removed all register declarations.
521
522	enhanced split(), as in gawk, etc:  split(s, a, "") splits s into
523	a[1]...a[length(s)] with each character a single element.
524
525	made the same changes for field-splitting if FS is "".
526
527	added nextfile, as in gawk: causes immediate advance to next
528	input file. (thanks to arnold robbins for inspiration and code).
529
530	small fixes to regexpr code:  can now handle []], [[], and
531	variants;  [] is now a syntax error, rather than matching
532	everything;  [z-a] is now empty, not z.  far from complete
533	or correct, however.  (thanks to jeffrey friedl for pointing out
534	some awful behaviors.)
535
536Apr 29, 1996:
537	replaced uchar by uschar everywhere; apparently some compilers
538	usurp this name and this causes conflicts.
539
540	fixed call to time in run.c (bltin); arg is time_t *.
541
542	replaced horrible pointer/long punning in b.c by a legitimate
543	union.  should be safer on 64-bit machines and cleaner everywhere.
544	(thanks to nelson beebe for pointing out some of these problems.)
545
546	replaced nested comments by #if 0...#endif in run.c, lib.c.
547
548	removed getsval, setsval, execute macros from run.c and lib.c.
549	machines are 100x faster than they were when these macros were
550	first used.
551
552	revised filenames: awk.g.y => awkgram.y, awk.lx.l => awklex.l,
553	y.tab.[ch] => ytab.[ch], lex.yy.c => lexyy.c, all in the aid of
554	portability to nameless systems.
555
556	"make bundle" now includes yacc and lex output files for recipients
557	who don't have yacc or lex.
558
559Aug 15, 1995:
560	initialized Cells in setsymtab more carefully; some fields
561	were not set.  (thanks to purify, all of whose complaints i
562	think i now understand.)
563
564	fixed at least one error in gsub that looked at -1-th element
565	of an array when substituting for a null match (e.g., $).
566
567	delete arrayname is now legal; it clears the elements but leaves
568	the array, which may not be the right behavior.
569
570	modified makefile: my current make can't cope with the test used
571	to avoid unnecessary yacc invocations.
572
573Jul 17, 1995:
574	added dynamically growing strings to awk.lx.l and b.c
575	to permit regular expressions to be much bigger.
576	the state arrays can still overflow.
577
578Aug 24, 1994:
579	detect duplicate arguments in function definitions (mdm).
580
581May 11, 1994:
582	trivial fix to printf to limit string size in sub().
583
584Apr 22, 1994:
585	fixed yet another subtle self-assignment problem:
586	$1 = $2; $1 = $1 clobbered $1.
587
588	Regression tests now use private echo, to avoid quoting problems.
589
590Feb 2, 1994:
591	changed error() to print line number as %d, not %g.
592
593Jul 23, 1993:
594	cosmetic changes: increased sizes of some arrays,
595	reworded some error messages.
596
597	added CONVFMT as in posix (just replaced OFMT in getsval)
598
599	FILENAME is now "" until the first thing that causes a file
600	to be opened.
601
602Nov 28, 1992:
603	deleted yyunput and yyoutput from proto.h;
604	different versions of lex give these different declarations.
605
606May 31, 1992:
607	added -mr N and -mf N options: more record and fields.
608	these really ought to adjust automatically.
609
610	cleaned up some error messages; "out of space" now means
611	malloc returned NULL in all cases.
612
613	changed rehash so that if it runs out, it just returns;
614	things will continue to run slow, but maybe a bit longer.
615
616Apr 24, 1992:
617	remove redundant close of stdin when using -f -.
618
619	got rid of core dump with -d; awk -d just prints date.
620
621Apr 12, 1992:
622	added explicit check for /dev/std(in,out,err) in redirection.
623	unlike gawk, no /dev/fd/n yet.
624
625	added (file/pipe) builtin.  hard to test satisfactorily.
626	not posix.
627
628Feb 20, 1992:
629	recompile after abortive changes;  should be unchanged.
630
631Dec 2, 1991:
632	die-casting time:  converted to ansi C, installed that.
633
634Nov 30, 1991:
635	fixed storage leak in freefa, failing to recover [N]CCL.
636	thanks to Bill Jones (jones@cs.usask.ca)
637
638Nov 19, 1991:
639	use RAND_MAX instead of literal in builtin().
640
641Nov 12, 1991:
642	cranked up some fixed-size arrays in b.c, and added a test for
643	overflow in penter.  thanks to mark larsen.
644
645Sep 24, 1991:
646	increased buffer in gsub.  a very crude fix to a general problem.
647	and again on Sep 26.
648
649Aug 18, 1991:
650	enforce variable name syntax for commandline variables: has to
651	start with letter or _.
652
653Jul 27, 1991:
654	allow newline after ; in for statements.
655
656Jul 21, 1991:
657	fixed so that in self-assignment like $1=$1, side effects
658	like recomputing $0 take place.  (this is getting subtle.)
659
660Jun 30, 1991:
661	better test for detecting too-long output record.
662
663Jun 2, 1991:
664	better defense against very long printf strings.
665	made break and continue illegal outside of loops.
666
667May 13, 1991:
668	removed extra arg on gettemp, tempfree.  minor error message rewording.
669
670May 6, 1991:
671	fixed silly bug in hex parsing in hexstr().
672	removed an apparently unnecessary test in isnumber().
673	warn about weird printf conversions.
674	fixed unchecked array overwrite in relex().
675
676	changed for (i in array) to access elements in sorted order.
677	then unchanged it -- it really does run slower in too many cases.
678	left the code in place, commented out.
679
680Feb 10, 1991:
681	check error status on all writes, to avoid banging on full disks.
682
683Jan 28, 1991:
684	awk -f - reads the program from stdin.
685
686Jan 11, 1991:
687	failed to set numeric state on $0 in cmd|getline context in run.c.
688
689Nov 2, 1990:
690	fixed sleazy test for integrality in getsval;  use modf.
691
692Oct 29, 1990:
693	fixed sleazy buggy code in lib.c that looked (incorrectly) for
694	too long input lines.
695
696Oct 14, 1990:
697	fixed the bug on p. 198 in which it couldn't deduce that an
698	argument was an array in some contexts.  replaced the error
699	message in intest() by code that damn well makes it an array.
700
701Oct 8, 1990:
702	fixed horrible bug:  types and values were not preserved in
703	some kinds of self-assignment. (in assign().)
704
705Aug 24, 1990:
706	changed NCHARS to 256 to handle 8-bit characters in strings
707	presented to match(), etc.
708
709Jun 26, 1990:
710	changed struct rrow (awk.h) to use long instead of int for lval,
711	since cfoll() stores a pointer in it.  now works better when int's
712	are smaller than pointers!
713
714May 6, 1990:
715	AVA fixed the grammar so that ! is uniformly of the same precedence as
716	unary + and -.  This renders illegal some constructs like !x=y, which
717	now has to be parenthesized as !(x=y), and makes others work properly:
718	!x+y is (!x)+y, and x!y is x !y, not two pattern-action statements.
719	(These problems were pointed out by Bob Lenk of Posix.)
720
721	Added \x to regular expressions (already in strings).
722	Limited octal to octal digits; \8 and \9 are not octal.
723	Centralized the code for parsing escapes in regular expressions.
724	Added a bunch of tests to T.re and T.sub to verify some of this.
725
726Feb 9, 1990:
727	fixed null pointer dereference bug in main.c:  -F[nothing].  sigh.
728
729	restored srand behavior:  it returns the current seed.
730
731Jan 18, 1990:
732	srand now returns previous seed value (0 to start).
733
734Jan 5, 1990:
735	fix potential problem in tran.c -- something was freed,
736	then used in freesymtab.
737
738Oct 18, 1989:
739	another try to get the max number of open files set with
740	relatively machine-independent code.
741
742	small fix to input() in case of multiple reads after EOF.
743
744Oct 11, 1989:
745	FILENAME is now defined in the BEGIN block -- too many old
746	programs broke.
747
748	"-" means stdin in getline as well as on the commandline.
749
750	added a bunch of casts to the code to tell the truth about
751	char * vs. unsigned char *, a right royal pain.  added a
752	setlocale call to the front of main, though probably no one
753	has it usefully implemented yet.
754
755Aug 24, 1989:
756	removed redundant relational tests against nullnode if parse
757	tree already had a relational at that point.
758
759Aug 11, 1989:
760	fixed bug:  commandline variable assignment has to look like
761	var=something.  (consider the man page for =, in file =.1)
762
763	changed number of arguments to functions to static arrays
764	to avoid repeated malloc calls.
765
766Aug 2, 1989:
767	restored -F (space) separator
768
769Jul 30, 1989:
770	added -v x=1 y=2 ... for immediate commandline variable assignment;
771	done before the BEGIN block for sure.  they have to precede the
772	program if the program is on the commandline.
773	Modified Aug 2 to require a separate -v for each assignment.
774
775Jul 10, 1989:
776	fixed ref-thru-zero bug in environment code in tran.c
777
778Jun 23, 1989:
779	add newline to usage message.
780
781Jun 14, 1989:
782	added some missing ansi printf conversion letters: %i %X %E %G.
783	no sensible meaning for h or L, so they may not do what one expects.
784
785	made %* conversions work.
786
787	changed x^y so that if n is a positive integer, it's done
788	by explicit multiplication, thus achieving maximum accuracy.
789	(this should be done by pow() but it seems not to be locally.)
790	done to x ^= y as well.
791
792Jun 4, 1989:
793	ENVIRON array contains environment: if shell variable V=thing,
794		ENVIRON["V"] is "thing"
795
796	multiple -f arguments permitted.  error reporting is naive.
797	(they were permitted before, but only the last was used.)
798
799	fixed a really stupid botch in the debugging macro dprintf
800
801	fixed order of evaluation of commandline assignments to match
802	what the book claims:  an argument of the form x=e is evaluated
803	at the time it would have been opened if it were a filename (p 63).
804	this invalidates the suggested answer to ex 4-1 (p 195).
805
806	removed some code that permitted -F (space) fieldseparator,
807	since it didn't quite work right anyway.  (restored aug 2)
808
809Apr 27, 1989:
810	Line number now accumulated correctly for comment lines.
811
812Apr 26, 1989:
813	Debugging output now includes a version date,
814	if one compiles it into the source each time.
815
816Apr 9, 1989:
817	Changed grammar to prohibit constants as 3rd arg of sub and gsub;
818	prevents class of overwriting-a-constant errors.  (Last one?)
819	This invalidates the "banana" example on page 43 of the book.
820
821	Added \a ("alert"), \v (vertical tab), \xhhh (hexadecimal),
822	as in ANSI, for strings.  Rescinded the sloppiness that permitted
823	non-octal digits in \ooo.  Warning:  not all compilers and libraries
824	will be able to deal with \x correctly.
825
826Jan 9, 1989:
827	Fixed bug that caused tempcell list to contain a duplicate.
828	The fix is kludgy.
829
830Dec 17, 1988:
831	Catches some more commandline errors in main.
832	Removed redundant decl of modf in run.c (confuses some compilers).
833	Warning:  there's no single declaration of malloc, etc., in awk.h
834	that seems to satisfy all compilers.
835
836Dec 7, 1988:
837	Added a bit of code to error printing to avoid printing nulls.
838	(Not clear that it actually would.)
839
840Nov 27, 1988:
841	With fear and trembling, modified the grammar to permit
842	multiple pattern-action statements on one line without
843	an explicit separator.  By definition, this capitulation
844	to the ghost of ancient implementations remains undefined
845	and thus subject to change without notice or apology.
846	DO NOT COUNT ON IT.
847
848Oct 30, 1988:
849	Fixed bug in call() that failed to recover storage.
850
851	A warning is now generated if there are more arguments
852	in the call than in the definition (in lieu of fixing
853	another storage leak).
854
855Oct 20, 1988:
856	Fixed %c:  if expr is numeric, use numeric value;
857	otherwise print 1st char of string value.  still
858	doesn't work if the value is 0 -- won't print \0.
859
860	Added a few more checks for running out of malloc.
861
862Oct 12, 1988:
863	Fixed bug in call() that freed local arrays twice.
864
865	Fixed to handle deletion of non-existent array right;
866	complains about attempt to delete non-array element.
867
868Sep 30, 1988:
869	Now guarantees to evaluate all arguments of built-in
870	functions, as in C;  the appearance is that arguments
871	are evaluated before the function is called.  Places
872	affected are sub (gsub was ok), substr, printf, and
873	all the built-in arithmetic functions in bltin().
874	A warning is generated if a bltin() is called with
875	the wrong number of arguments.
876
877	This requires changing makeprof on p167 of the book.
878
879Aug 23, 1988:
880	setting FILENAME in BEGIN caused core dump, apparently
881	because it was freeing space not allocated by malloc.
882
883July 24, 1988:
884	fixed egregious error in toupper/tolower functions.
885	still subject to rescinding, however.
886
887July 2, 1988:
888	flush stdout before opening file or pipe
889
890July 2, 1988:
891	performance bug in b.c/cgoto(): not freeing some sets of states.
892	partial fix only right now, and the number of states increased
893	to make it less obvious.
894
895June 1, 1988:
896	check error status on close
897
898May 28, 1988:
899	srand returns seed value it's using.
900	see 1/18/90
901
902May 22, 1988:
903	Removed limit on depth of function calls.
904
905May 10, 1988:
906	Fixed lib.c to permit _ in commandline variable names.
907
908Mar 25, 1988:
909	main.c fixed to recognize -- as terminator of command-
910	line options.  Illegal options flagged.
911	Error reporting slightly cleaned up.
912
913Dec 2, 1987:
914	Newer C compilers apply a strict scope rule to extern
915	declarations within functions.  Two extern declarations in
916	lib.c and tran.c have been moved to obviate this problem.
917
918Oct xx, 1987:
919	Reluctantly added toupper and tolower functions.
920	Subject to rescinding without notice.
921
922Sep 17, 1987:
923	Error-message printer had printf(s) instead of
924	printf("%s",s);  got core dumps when the message
925	included a %.
926
927Sep 12, 1987:
928	Very long printf strings caused core dump;
929	fixed aprintf, asprintf, format to catch them.
930	Can still get a core dump in printf itself.
931
932
933