xref: /freebsd/lib/libc/stdio/scanf.3 (revision 09e8dea79366f1e5b3a73e8a271b26e4b6bf2e6a)
1.\" Copyright (c) 1990, 1991, 1993
2.\"	The Regents of the University of California.  All rights reserved.
3.\"
4.\" This code is derived from software contributed to Berkeley by
5.\" Chris Torek and the American National Standards Committee X3,
6.\" on Information Processing Systems.
7.\"
8.\" Redistribution and use in source and binary forms, with or without
9.\" modification, are permitted provided that the following conditions
10.\" are met:
11.\" 1. Redistributions of source code must retain the above copyright
12.\"    notice, this list of conditions and the following disclaimer.
13.\" 2. Redistributions in binary form must reproduce the above copyright
14.\"    notice, this list of conditions and the following disclaimer in the
15.\"    documentation and/or other materials provided with the distribution.
16.\" 3. All advertising materials mentioning features or use of this software
17.\"    must display the following acknowledgement:
18.\"	This product includes software developed by the University of
19.\"	California, Berkeley and its contributors.
20.\" 4. Neither the name of the University nor the names of its contributors
21.\"    may be used to endorse or promote products derived from this software
22.\"    without specific prior written permission.
23.\"
24.\" THIS SOFTWARE IS PROVIDED BY THE REGENTS AND CONTRIBUTORS ``AS IS'' AND
25.\" ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
26.\" IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE
27.\" ARE DISCLAIMED.  IN NO EVENT SHALL THE REGENTS OR CONTRIBUTORS BE LIABLE
28.\" FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL
29.\" DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS
30.\" OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION)
31.\" HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT
32.\" LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY
33.\" OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF
34.\" SUCH DAMAGE.
35.\"
36.\"     @(#)scanf.3	8.2 (Berkeley) 12/11/93
37.\" $FreeBSD$
38.\"
39.Dd December 11, 1993
40.Dt SCANF 3
41.Os
42.Sh NAME
43.Nm scanf ,
44.Nm fscanf ,
45.Nm sscanf ,
46.Nm vscanf ,
47.Nm vsscanf ,
48.Nm vfscanf
49.Nd input format conversion
50.Sh LIBRARY
51.Lb libc
52.Sh SYNOPSIS
53.In stdio.h
54.Ft int
55.Fn scanf "const char *format" ...
56.Ft int
57.Fn fscanf "FILE *stream" "const char *format" ...
58.Ft int
59.Fn sscanf "const char *str" "const char *format" ...
60.In stdarg.h
61.Ft int
62.Fn vscanf "const char *format" "va_list ap"
63.Ft int
64.Fn vsscanf "const char *str" "const char *format" "va_list ap"
65.Ft int
66.Fn vfscanf "FILE *stream" "const char *format" "va_list ap"
67.Sh DESCRIPTION
68The
69.Fn scanf
70family of functions scans input according to a
71.Fa format
72as described below.
73This format may contain
74.Em conversion specifiers ;
75the results from such conversions, if any,
76are stored through the
77.Em pointer
78arguments.
79The
80.Fn scanf
81function
82reads input from the standard input stream
83.Em stdin ,
84.Fn fscanf
85reads input from the stream pointer
86.Fa stream ,
87and
88.Fn sscanf
89reads its input from the character string pointed to by
90.Fa str .
91The
92.Fn vfscanf
93function
94is analogous to
95.Xr vfprintf 3
96and reads input from the stream pointer
97.Fa stream
98using a variable argument list of pointers (see
99.Xr stdarg 3 ) .
100The
101.Fn vscanf
102function scans a variable argument list from the standard input and
103the
104.Fn vsscanf
105function scans it from a string;
106these are analogous to
107the
108.Fn vprintf
109and
110.Fn vsprintf
111functions respectively.
112Each successive
113.Em pointer
114argument must correspond properly with
115each successive conversion specifier
116(but see the
117.Cm *
118conversion below).
119All conversions are introduced by the
120.Cm %
121(percent sign) character.
122The
123.Fa format
124string
125may also contain other characters.
126White space (such as blanks, tabs, or newlines) in the
127.Fa format
128string match any amount of white space, including none, in the input.
129Everything else
130matches only itself.
131Scanning stops
132when an input character does not match such a format character.
133Scanning also stops
134when an input conversion cannot be made (see below).
135.Sh CONVERSIONS
136Following the
137.Cm %
138character introducing a conversion
139there may be a number of
140.Em flag
141characters, as follows:
142.Bl -tag -width ".Cm l No (ell)"
143.It Cm *
144Suppresses assignment.
145The conversion that follows occurs as usual, but no pointer is used;
146the result of the conversion is simply discarded.
147.It Cm hh
148Indicates that the conversion will be one of
149.Cm dioux
150or
151.Cm n
152and the next pointer is a pointer to a
153.Vt char
154(rather than
155.Vt int ) .
156.It Cm h
157Indicates that the conversion will be one of
158.Cm dioux
159or
160.Cm n
161and the next pointer is a pointer to a
162.Vt "short int"
163(rather than
164.Vt int ) .
165.It Cm l No (ell)
166Indicates that the conversion will be one of
167.Cm dioux
168or
169.Cm n
170and the next pointer is a pointer to a
171.Vt "long int"
172(rather than
173.Vt int ) ,
174that the conversion will be one of
175.Cm aefg
176and the next pointer is a pointer to
177.Vt double
178(rather than
179.Vt float ) ,
180or that the conversion will be one of
181.Cm c
182or
183.Cm s
184and the next pointer is a pointer to an array of
185.Vt wchar_t
186(rather than
187.Vt char ) .
188.It Cm ll No (ell ell)
189Indicates that the conversion will be one of
190.Cm dioux
191or
192.Cm n
193and the next pointer is a pointer to a
194.Vt "long long int"
195(rather than
196.Vt int ) .
197.It Cm L
198Indicates that the conversion will be one of
199.Cm aef
200or
201.Cm g
202and the next pointer is a pointer to
203.Vt "long double" .
204(This type is not implemented; although the argument is
205required to be a pointer to
206.Vt "long double" ,
207no additional precision is used in the conversion.)
208.It Cm j
209Indicates that the conversion will be one of
210.Cm dioux
211or
212.Cm n
213and the next pointer is a pointer to a
214.Vt intmax_t
215(rather than
216.Vt int ) .
217.It Cm t
218Indicates that the conversion will be one of
219.Cm dioux
220or
221.Cm n
222and the next pointer is a pointer to a
223.Vt ptrdiff_t
224(rather than
225.Vt int ) .
226.It Cm z
227Indicates that the conversion will be one of
228.Cm dioux
229or
230.Cm n
231and the next pointer is a pointer to a
232.Vt size_t
233(rather than
234.Vt int ) .
235.It Cm q
236(deprecated.)
237Indicates that the conversion will be one of
238.Cm dioux
239or
240.Cm n
241and the next pointer is a pointer to a
242.Vt "long long int"
243(rather than
244.Vt int ) .
245.El
246.Pp
247In addition to these flags,
248there may be an optional maximum field width,
249expressed as a decimal integer,
250between the
251.Cm %
252and the conversion.
253If no width is given,
254a default of
255.Dq infinity
256is used (with one exception, below);
257otherwise at most this many characters are scanned
258in processing the conversion.
259Before conversion begins,
260most conversions skip white space;
261this white space is not counted against the field width.
262.Pp
263The following conversions are available:
264.Bl -tag -width XXXX
265.It Cm %
266Matches a literal
267.Ql % .
268That is,
269.Dq Li %%
270in the format string
271matches a single input
272.Ql %
273character.
274No conversion is done, and assignment does not occur.
275.It Cm d
276Matches an optionally signed decimal integer;
277the next pointer must be a pointer to
278.Vt int .
279.It Cm i
280Matches an optionally signed integer;
281the next pointer must be a pointer to
282.Vt int .
283The integer is read in base 16 if it begins
284with
285.Ql 0x
286or
287.Ql 0X ,
288in base 8 if it begins with
289.Ql 0 ,
290and in base 10 otherwise.
291Only characters that correspond to the base are used.
292.It Cm o
293Matches an octal integer;
294the next pointer must be a pointer to
295.Vt "unsigned int" .
296.It Cm u
297Matches an optionally signed decimal integer;
298the next pointer must be a pointer to
299.Vt "unsigned int" .
300.It Cm x , X
301Matches an optionally signed hexadecimal integer;
302the next pointer must be a pointer to
303.Vt "unsigned int" .
304.It Cm e , E , f , F , g , G
305Matches an optionally signed floating-point number;
306the next pointer must be a pointer to
307.Vt float .
308.It Cm a , A
309Matches a hexadecimal number represented in the style
310.Sm off
311.Oo \- Oc Li 0x Ar h Li \&. Ar hhh Cm p Oo \\*[Pm] Oc Ar d .
312.Sm on
313This is an exact conversion of the sign, exponent, mantissa internal
314floating point representation; the
315.Sm off
316.Oo \- Oc Li 0x Ar h Li \&. Ar hhh
317.Sm on
318portion represents exactly the mantissa; only denormalized
319mantissas have a zero value to the left of the hexadecimal
320point.
321The
322.Cm p
323is a literal character
324.Ql p ;
325the exponent is preceded by a positive or negative sign
326and is represented in decimal.
327.It Cm s
328Matches a sequence of non-white-space characters;
329the next pointer must be a pointer to
330.Vt char ,
331and the array must be large enough to accept all the sequence and the
332terminating
333.Dv NUL
334character.
335The input string stops at white space
336or at the maximum field width, whichever occurs first.
337.It Cm S
338The same as
339.Cm ls .
340.It Cm c
341Matches a sequence of
342.Em width
343count
344characters (default 1);
345the next pointer must be a pointer to
346.Vt char ,
347and there must be enough room for all the characters
348(no terminating
349.Dv NUL
350is added).
351The usual skip of leading white space is suppressed.
352To skip white space first, use an explicit space in the format.
353.It Cm C
354The same as
355.Cm lc .
356.It Cm \&[
357Matches a nonempty sequence of characters from the specified set
358of accepted characters;
359the next pointer must be a pointer to
360.Vt char ,
361and there must be enough room for all the characters in the string,
362plus a terminating
363.Dv NUL
364character.
365The usual skip of leading white space is suppressed.
366The string is to be made up of characters in
367(or not in)
368a particular set;
369the set is defined by the characters between the open bracket
370.Cm [
371character
372and a close bracket
373.Cm ]
374character.
375The set
376.Em excludes
377those characters
378if the first character after the open bracket is a circumflex
379.Cm ^ .
380To include a close bracket in the set,
381make it the first character after the open bracket
382or the circumflex;
383any other position will end the set.
384The hyphen character
385.Cm -
386is also special;
387when placed between two other characters,
388it adds all intervening characters to the set.
389To include a hyphen,
390make it the last character before the final close bracket.
391For instance,
392.Ql [^]0-9-]
393means the set
394.Dq "everything except close bracket, zero through nine, and hyphen" .
395The string ends with the appearance of a character not in the
396(or, with a circumflex, in) set
397or when the field width runs out.
398.It Cm p
399Matches a pointer value (as printed by
400.Ql %p
401in
402.Xr printf 3 ) ;
403the next pointer must be a pointer to
404.Vt void .
405.It Cm n
406Nothing is expected;
407instead, the number of characters consumed thus far from the input
408is stored through the next pointer,
409which must be a pointer to
410.Vt int .
411This is
412.Em not
413a conversion, although it can be suppressed with the
414.Cm *
415flag.
416.El
417.Pp
418The decimal point
419character is defined in the program's locale (category
420.Dv LC_NUMERIC ) .
421.Pp
422For backwards compatibility, a
423.Dq conversion
424of
425.Ql %\e0
426causes an immediate return of
427.Dv EOF .
428.Sh RETURN VALUES
429These
430functions
431return
432the number of input items assigned, which can be fewer than provided
433for, or even zero, in the event of a matching failure.
434Zero
435indicates that, while there was input available,
436no conversions were assigned;
437typically this is due to an invalid input character,
438such as an alphabetic character for a
439.Ql %d
440conversion.
441The value
442.Dv EOF
443is returned if an input failure occurs before any conversion such as an
444end-of-file occurs.
445If an error or end-of-file occurs after conversion
446has begun,
447the number of conversions which were successfully completed is returned.
448.Sh SEE ALSO
449.Xr getc 3 ,
450.Xr printf 3 ,
451.Xr strtod 3 ,
452.Xr strtol 3 ,
453.Xr strtoul 3
454.Sh STANDARDS
455The functions
456.Fn fscanf ,
457.Fn scanf ,
458and
459.Fn sscanf
460conform to
461.St -isoC-99 .
462.Sh BUGS
463Earlier implementations of
464.Nm
465treated
466.Cm \&%D , \&%E , \&%F , \&%O
467and
468.Cm \&%X
469as their lowercase equivalents with an
470.Cm l
471modifier.
472In addition,
473.Nm
474treated an unknown conversion character as
475.Cm \&%d
476or
477.Cm \&%D ,
478depending on its case.
479This functionality has been removed.
480.Pp
481Numerical strings are truncated to 512 characters; for example,
482.Cm %f
483and
484.Cm %d
485are implicitly
486.Cm %512f
487and
488.Cm %512d .
489.Pp
490The
491.Cm %n$
492modifiers for positional arguments are not implemented.
493.Pp
494The
495.Cm l
496modifier for
497.Cm %c
498and
499.Cm %s
500(and
501.Cm %C
502and
503.Cm %S )
504to specify wide characters and strings is not implemented.
505.Pp
506The
507.Cm \&%a
508and
509.Cm \&%A
510floating-point formats are not implemented.
511