xref: /freebsd/lib/libc/stdio/scanf.3 (revision 4b2eaea43fec8e8792be611dea204071a10b655a)
1.\" Copyright (c) 1990, 1991, 1993
2.\"	The Regents of the University of California.  All rights reserved.
3.\"
4.\" This code is derived from software contributed to Berkeley by
5.\" Chris Torek and the American National Standards Committee X3,
6.\" on Information Processing Systems.
7.\"
8.\" Redistribution and use in source and binary forms, with or without
9.\" modification, are permitted provided that the following conditions
10.\" are met:
11.\" 1. Redistributions of source code must retain the above copyright
12.\"    notice, this list of conditions and the following disclaimer.
13.\" 2. Redistributions in binary form must reproduce the above copyright
14.\"    notice, this list of conditions and the following disclaimer in the
15.\"    documentation and/or other materials provided with the distribution.
16.\" 3. All advertising materials mentioning features or use of this software
17.\"    must display the following acknowledgement:
18.\"	This product includes software developed by the University of
19.\"	California, Berkeley and its contributors.
20.\" 4. Neither the name of the University nor the names of its contributors
21.\"    may be used to endorse or promote products derived from this software
22.\"    without specific prior written permission.
23.\"
24.\" THIS SOFTWARE IS PROVIDED BY THE REGENTS AND CONTRIBUTORS ``AS IS'' AND
25.\" ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
26.\" IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE
27.\" ARE DISCLAIMED.  IN NO EVENT SHALL THE REGENTS OR CONTRIBUTORS BE LIABLE
28.\" FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL
29.\" DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS
30.\" OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION)
31.\" HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT
32.\" LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY
33.\" OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF
34.\" SUCH DAMAGE.
35.\"
36.\"     @(#)scanf.3	8.2 (Berkeley) 12/11/93
37.\" $FreeBSD$
38.\"
39.Dd January 4, 2003
40.Dt SCANF 3
41.Os
42.Sh NAME
43.Nm scanf ,
44.Nm fscanf ,
45.Nm sscanf ,
46.Nm vscanf ,
47.Nm vsscanf ,
48.Nm vfscanf
49.Nd input format conversion
50.Sh LIBRARY
51.Lb libc
52.Sh SYNOPSIS
53.In stdio.h
54.Ft int
55.Fn scanf "const char * restrict format" ...
56.Ft int
57.Fn fscanf "FILE * restrict stream" "const char * restrict format" ...
58.Ft int
59.Fn sscanf "const char * restrict str" "const char * restrict format" ...
60.In stdarg.h
61.Ft int
62.Fn vscanf "const char * restrict format" "va_list ap"
63.Ft int
64.Fn vsscanf "const char * restrict str" "const char * restrict format" "va_list ap"
65.Ft int
66.Fn vfscanf "FILE * restrict stream" "const char * restrict format" "va_list ap"
67.Sh DESCRIPTION
68The
69.Fn scanf
70family of functions scans input according to a
71.Fa format
72as described below.
73This format may contain
74.Em conversion specifiers ;
75the results from such conversions, if any,
76are stored through the
77.Em pointer
78arguments.
79The
80.Fn scanf
81function
82reads input from the standard input stream
83.Dv stdin ,
84.Fn fscanf
85reads input from the stream pointer
86.Fa stream ,
87and
88.Fn sscanf
89reads its input from the character string pointed to by
90.Fa str .
91The
92.Fn vfscanf
93function
94is analogous to
95.Xr vfprintf 3
96and reads input from the stream pointer
97.Fa stream
98using a variable argument list of pointers (see
99.Xr stdarg 3 ) .
100The
101.Fn vscanf
102function scans a variable argument list from the standard input and
103the
104.Fn vsscanf
105function scans it from a string;
106these are analogous to
107the
108.Fn vprintf
109and
110.Fn vsprintf
111functions respectively.
112Each successive
113.Em pointer
114argument must correspond properly with
115each successive conversion specifier
116(but see the
117.Cm *
118conversion below).
119All conversions are introduced by the
120.Cm %
121(percent sign) character.
122The
123.Fa format
124string
125may also contain other characters.
126White space (such as blanks, tabs, or newlines) in the
127.Fa format
128string match any amount of white space, including none, in the input.
129Everything else
130matches only itself.
131Scanning stops
132when an input character does not match such a format character.
133Scanning also stops
134when an input conversion cannot be made (see below).
135.Sh CONVERSIONS
136Following the
137.Cm %
138character introducing a conversion
139there may be a number of
140.Em flag
141characters, as follows:
142.Bl -tag -width ".Cm l No (ell)"
143.It Cm *
144Suppresses assignment.
145The conversion that follows occurs as usual, but no pointer is used;
146the result of the conversion is simply discarded.
147.It Cm hh
148Indicates that the conversion will be one of
149.Cm dioux
150or
151.Cm n
152and the next pointer is a pointer to a
153.Vt char
154(rather than
155.Vt int ) .
156.It Cm h
157Indicates that the conversion will be one of
158.Cm dioux
159or
160.Cm n
161and the next pointer is a pointer to a
162.Vt "short int"
163(rather than
164.Vt int ) .
165.It Cm l No (ell)
166Indicates that the conversion will be one of
167.Cm dioux
168or
169.Cm n
170and the next pointer is a pointer to a
171.Vt "long int"
172(rather than
173.Vt int ) ,
174that the conversion will be one of
175.Cm aefg
176and the next pointer is a pointer to
177.Vt double
178(rather than
179.Vt float ) ,
180or that the conversion will be one of
181.Cm c ,
182.Cm s
183or
184.Cm \&[
185and the next pointer is a pointer to an array of
186.Vt wchar_t
187(rather than
188.Vt char ) .
189.It Cm ll No (ell ell)
190Indicates that the conversion will be one of
191.Cm dioux
192or
193.Cm n
194and the next pointer is a pointer to a
195.Vt "long long int"
196(rather than
197.Vt int ) .
198.It Cm L
199Indicates that the conversion will be one of
200.Cm aef
201or
202.Cm g
203and the next pointer is a pointer to
204.Vt "long double" .
205(This type is not implemented; although the argument is
206required to be a pointer to
207.Vt "long double" ,
208no additional precision is used in the conversion.)
209.It Cm j
210Indicates that the conversion will be one of
211.Cm dioux
212or
213.Cm n
214and the next pointer is a pointer to a
215.Vt intmax_t
216(rather than
217.Vt int ) .
218.It Cm t
219Indicates that the conversion will be one of
220.Cm dioux
221or
222.Cm n
223and the next pointer is a pointer to a
224.Vt ptrdiff_t
225(rather than
226.Vt int ) .
227.It Cm z
228Indicates that the conversion will be one of
229.Cm dioux
230or
231.Cm n
232and the next pointer is a pointer to a
233.Vt size_t
234(rather than
235.Vt int ) .
236.It Cm q
237(deprecated.)
238Indicates that the conversion will be one of
239.Cm dioux
240or
241.Cm n
242and the next pointer is a pointer to a
243.Vt "long long int"
244(rather than
245.Vt int ) .
246.El
247.Pp
248In addition to these flags,
249there may be an optional maximum field width,
250expressed as a decimal integer,
251between the
252.Cm %
253and the conversion.
254If no width is given,
255a default of
256.Dq infinity
257is used (with one exception, below);
258otherwise at most this many bytes are scanned
259in processing the conversion.
260In the case of the
261.Cm lc ,
262.Cm ls
263and
264.Cm l[
265conversions, the field width specifies the maximum number
266of multibyte characters that will be scanned.
267Before conversion begins,
268most conversions skip white space;
269this white space is not counted against the field width.
270.Pp
271The following conversions are available:
272.Bl -tag -width XXXX
273.It Cm %
274Matches a literal
275.Ql % .
276That is,
277.Dq Li %%
278in the format string
279matches a single input
280.Ql %
281character.
282No conversion is done, and assignment does not occur.
283.It Cm d
284Matches an optionally signed decimal integer;
285the next pointer must be a pointer to
286.Vt int .
287.It Cm i
288Matches an optionally signed integer;
289the next pointer must be a pointer to
290.Vt int .
291The integer is read in base 16 if it begins
292with
293.Ql 0x
294or
295.Ql 0X ,
296in base 8 if it begins with
297.Ql 0 ,
298and in base 10 otherwise.
299Only characters that correspond to the base are used.
300.It Cm o
301Matches an octal integer;
302the next pointer must be a pointer to
303.Vt "unsigned int" .
304.It Cm u
305Matches an optionally signed decimal integer;
306the next pointer must be a pointer to
307.Vt "unsigned int" .
308.It Cm x , X
309Matches an optionally signed hexadecimal integer;
310the next pointer must be a pointer to
311.Vt "unsigned int" .
312.It Cm e , E , f , F , g , G
313Matches an optionally signed floating-point number;
314the next pointer must be a pointer to
315.Vt float .
316.It Cm a , A
317Matches a hexadecimal number represented in the style
318.Sm off
319.Oo \- Oc Li 0x Ar h Li \&. Ar hhh Cm p Oo \\*[Pm] Oc Ar d .
320.Sm on
321This is an exact conversion of the sign, exponent, mantissa internal
322floating point representation; the
323.Sm off
324.Oo \- Oc Li 0x Ar h Li \&. Ar hhh
325.Sm on
326portion represents exactly the mantissa; only denormalized
327mantissas have a zero value to the left of the hexadecimal
328point.
329The
330.Cm p
331is a literal character
332.Ql p ;
333the exponent is preceded by a positive or negative sign
334and is represented in decimal.
335.It Cm s
336Matches a sequence of non-white-space characters;
337the next pointer must be a pointer to
338.Vt char ,
339and the array must be large enough to accept all the sequence and the
340terminating
341.Dv NUL
342character.
343The input string stops at white space
344or at the maximum field width, whichever occurs first.
345.Pp
346If an
347.Cm l
348qualifier is present, the next pointer must be a pointer to
349.Vt wchar_t ,
350into which the input will be placed after conversion by
351.Xr mbrtowc 3 .
352.It Cm S
353The same as
354.Cm ls .
355.It Cm c
356Matches a sequence of
357.Em width
358count
359characters (default 1);
360the next pointer must be a pointer to
361.Vt char ,
362and there must be enough room for all the characters
363(no terminating
364.Dv NUL
365is added).
366The usual skip of leading white space is suppressed.
367To skip white space first, use an explicit space in the format.
368.Pp
369If an
370.Cm l
371qualifier is present, the next pointer must be a pointer to
372.Vt wchar_t ,
373into which the input will be placed after conversion by
374.Xr mbrtowc 3 .
375.It Cm C
376The same as
377.Cm lc .
378.It Cm \&[
379Matches a nonempty sequence of characters from the specified set
380of accepted characters;
381the next pointer must be a pointer to
382.Vt char ,
383and there must be enough room for all the characters in the string,
384plus a terminating
385.Dv NUL
386character.
387The usual skip of leading white space is suppressed.
388The string is to be made up of characters in
389(or not in)
390a particular set;
391the set is defined by the characters between the open bracket
392.Cm [
393character
394and a close bracket
395.Cm ]
396character.
397The set
398.Em excludes
399those characters
400if the first character after the open bracket is a circumflex
401.Cm ^ .
402To include a close bracket in the set,
403make it the first character after the open bracket
404or the circumflex;
405any other position will end the set.
406The hyphen character
407.Cm -
408is also special;
409when placed between two other characters,
410it adds all intervening characters to the set.
411To include a hyphen,
412make it the last character before the final close bracket.
413For instance,
414.Ql [^]0-9-]
415means the set
416.Dq "everything except close bracket, zero through nine, and hyphen" .
417The string ends with the appearance of a character not in the
418(or, with a circumflex, in) set
419or when the field width runs out.
420.Pp
421If an
422.Cm l
423qualifier is present, the next pointer must be a pointer to
424.Vt wchar_t ,
425into which the input will be placed after conversion by
426.Xr mbrtowc 3 .
427.It Cm p
428Matches a pointer value (as printed by
429.Ql %p
430in
431.Xr printf 3 ) ;
432the next pointer must be a pointer to
433.Vt void .
434.It Cm n
435Nothing is expected;
436instead, the number of characters consumed thus far from the input
437is stored through the next pointer,
438which must be a pointer to
439.Vt int .
440This is
441.Em not
442a conversion, although it can be suppressed with the
443.Cm *
444flag.
445.El
446.Pp
447The decimal point
448character is defined in the program's locale (category
449.Dv LC_NUMERIC ) .
450.Pp
451For backwards compatibility, a
452.Dq conversion
453of
454.Ql %\e0
455causes an immediate return of
456.Dv EOF .
457.Sh RETURN VALUES
458These
459functions
460return
461the number of input items assigned, which can be fewer than provided
462for, or even zero, in the event of a matching failure.
463Zero
464indicates that, while there was input available,
465no conversions were assigned;
466typically this is due to an invalid input character,
467such as an alphabetic character for a
468.Ql %d
469conversion.
470The value
471.Dv EOF
472is returned if an input failure occurs before any conversion such as an
473end-of-file occurs.
474If an error or end-of-file occurs after conversion
475has begun,
476the number of conversions which were successfully completed is returned.
477.Sh SEE ALSO
478.Xr getc 3 ,
479.Xr mbrtowc 3 ,
480.Xr printf 3 ,
481.Xr strtod 3 ,
482.Xr strtol 3 ,
483.Xr strtoul 3 ,
484.Xr wscanf 3
485.Sh STANDARDS
486The functions
487.Fn fscanf ,
488.Fn scanf ,
489.Fn sscanf ,
490.Fn vfscanf ,
491.Fn vscanf
492and
493.Fn vsscanf
494conform to
495.St -isoC-99 .
496.Sh BUGS
497Earlier implementations of
498.Nm
499treated
500.Cm \&%D , \&%E , \&%F , \&%O
501and
502.Cm \&%X
503as their lowercase equivalents with an
504.Cm l
505modifier.
506In addition,
507.Nm
508treated an unknown conversion character as
509.Cm \&%d
510or
511.Cm \&%D ,
512depending on its case.
513This functionality has been removed.
514.Pp
515Numerical strings are truncated to 512 characters; for example,
516.Cm %f
517and
518.Cm %d
519are implicitly
520.Cm %512f
521and
522.Cm %512d .
523.Pp
524The
525.Cm %n$
526modifiers for positional arguments are not implemented.
527.Pp
528The
529.Cm \&%a
530and
531.Cm \&%A
532floating-point formats are not implemented.
533.Pp
534The
535.Nm
536family of functions do not correctly handle multibyte characters in the
537.Fa format
538argument.
539