xref: /freebsd/lib/libc/stdio/scanf.3 (revision 51015e6d0f570239b0c2088dc6cf2b018928375d)
1.\" Copyright (c) 1990, 1991, 1993
2.\"	The Regents of the University of California.  All rights reserved.
3.\"
4.\" This code is derived from software contributed to Berkeley by
5.\" Chris Torek and the American National Standards Committee X3,
6.\" on Information Processing Systems.
7.\"
8.\" Redistribution and use in source and binary forms, with or without
9.\" modification, are permitted provided that the following conditions
10.\" are met:
11.\" 1. Redistributions of source code must retain the above copyright
12.\"    notice, this list of conditions and the following disclaimer.
13.\" 2. Redistributions in binary form must reproduce the above copyright
14.\"    notice, this list of conditions and the following disclaimer in the
15.\"    documentation and/or other materials provided with the distribution.
16.\" 3. Neither the name of the University nor the names of its contributors
17.\"    may be used to endorse or promote products derived from this software
18.\"    without specific prior written permission.
19.\"
20.\" THIS SOFTWARE IS PROVIDED BY THE REGENTS AND CONTRIBUTORS ``AS IS'' AND
21.\" ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
22.\" IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE
23.\" ARE DISCLAIMED.  IN NO EVENT SHALL THE REGENTS OR CONTRIBUTORS BE LIABLE
24.\" FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL
25.\" DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS
26.\" OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION)
27.\" HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT
28.\" LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY
29.\" OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF
30.\" SUCH DAMAGE.
31.\"
32.\"     @(#)scanf.3	8.2 (Berkeley) 12/11/93
33.\" $FreeBSD$
34.\"
35.Dd April 2, 2022
36.Dt SCANF 3
37.Os
38.Sh NAME
39.Nm scanf ,
40.Nm fscanf ,
41.Nm sscanf ,
42.Nm vscanf ,
43.Nm vsscanf ,
44.Nm vfscanf
45.Nd input format conversion
46.Sh LIBRARY
47.Lb libc
48.Sh SYNOPSIS
49.In stdio.h
50.Ft int
51.Fn scanf "const char * restrict format" ...
52.Ft int
53.Fn fscanf "FILE * restrict stream" "const char * restrict format" ...
54.Ft int
55.Fn sscanf "const char * restrict str" "const char * restrict format" ...
56.In stdarg.h
57.Ft int
58.Fn vscanf "const char * restrict format" "va_list ap"
59.Ft int
60.Fn vsscanf "const char * restrict str" "const char * restrict format" "va_list ap"
61.Ft int
62.Fn vfscanf "FILE * restrict stream" "const char * restrict format" "va_list ap"
63.Sh DESCRIPTION
64The
65.Fn scanf
66family of functions scans input according to a
67.Fa format
68as described below.
69This format may contain
70.Em conversion specifiers ;
71the results from such conversions, if any,
72are stored through the
73.Em pointer
74arguments.
75The
76.Fn scanf
77function
78reads input from the standard input stream
79.Dv stdin ,
80.Fn fscanf
81reads input from the stream pointer
82.Fa stream ,
83and
84.Fn sscanf
85reads its input from the character string pointed to by
86.Fa str .
87The
88.Fn vfscanf
89function
90is analogous to
91.Xr vfprintf 3
92and reads input from the stream pointer
93.Fa stream
94using a variable argument list of pointers (see
95.Xr stdarg 3 ) .
96The
97.Fn vscanf
98function scans a variable argument list from the standard input and
99the
100.Fn vsscanf
101function scans it from a string;
102these are analogous to
103the
104.Fn vprintf
105and
106.Fn vsprintf
107functions respectively.
108Each successive
109.Em pointer
110argument must correspond properly with
111each successive conversion specifier
112(but see the
113.Cm *
114conversion below).
115All conversions are introduced by the
116.Cm %
117(percent sign) character.
118The
119.Fa format
120string
121may also contain other characters.
122White space (such as blanks, tabs, or newlines) in the
123.Fa format
124string match any amount of white space, including none, in the input.
125Everything else
126matches only itself.
127Scanning stops
128when an input character does not match such a format character.
129Scanning also stops
130when an input conversion cannot be made (see below).
131.Sh CONVERSIONS
132Following the
133.Cm %
134character introducing a conversion
135there may be a number of
136.Em flag
137characters, as follows:
138.Bl -tag -width ".Cm l No (ell)"
139.It Cm *
140Suppresses assignment.
141The conversion that follows occurs as usual, but no pointer is used;
142the result of the conversion is simply discarded.
143.It Cm hh
144Indicates that the conversion will be one of
145.Cm dioux
146or
147.Cm n
148and the next pointer is a pointer to a
149.Vt char
150(rather than
151.Vt int ) .
152.It Cm h
153Indicates that the conversion will be one of
154.Cm dioux
155or
156.Cm n
157and the next pointer is a pointer to a
158.Vt "short int"
159(rather than
160.Vt int ) .
161.It Cm l No (ell)
162Indicates that the conversion will be one of
163.Cm dioux
164or
165.Cm n
166and the next pointer is a pointer to a
167.Vt "long int"
168(rather than
169.Vt int ) ,
170that the conversion will be one of
171.Cm a , e , f ,
172or
173.Cm g
174and the next pointer is a pointer to
175.Vt double
176(rather than
177.Vt float ) ,
178or that the conversion will be one of
179.Cm c ,
180.Cm s
181or
182.Cm \&[
183and the next pointer is a pointer to an array of
184.Vt wchar_t
185(rather than
186.Vt char ) .
187.It Cm ll No (ell ell)
188Indicates that the conversion will be one of
189.Cm dioux
190or
191.Cm n
192and the next pointer is a pointer to a
193.Vt "long long int"
194(rather than
195.Vt int ) .
196.It Cm L
197Indicates that the conversion will be one of
198.Cm a , e , f ,
199or
200.Cm g
201and the next pointer is a pointer to
202.Vt "long double" .
203.It Cm j
204Indicates that the conversion will be one of
205.Cm dioux
206or
207.Cm n
208and the next pointer is a pointer to a
209.Vt intmax_t
210(rather than
211.Vt int ) .
212.It Cm t
213Indicates that the conversion will be one of
214.Cm dioux
215or
216.Cm n
217and the next pointer is a pointer to a
218.Vt ptrdiff_t
219(rather than
220.Vt int ) .
221.It Cm z
222Indicates that the conversion will be one of
223.Cm dioux
224or
225.Cm n
226and the next pointer is a pointer to a
227.Vt size_t
228(rather than
229.Vt int ) .
230.It Cm q
231(deprecated.)
232Indicates that the conversion will be one of
233.Cm dioux
234or
235.Cm n
236and the next pointer is a pointer to a
237.Vt "long long int"
238(rather than
239.Vt int ) .
240.El
241.Pp
242In addition to these flags,
243there may be an optional maximum field width,
244expressed as a decimal integer,
245between the
246.Cm %
247and the conversion.
248If no width is given,
249a default of
250.Dq infinity
251is used (with one exception, below);
252otherwise at most this many bytes are scanned
253in processing the conversion.
254In the case of the
255.Cm lc ,
256.Cm ls
257and
258.Cm l[
259conversions, the field width specifies the maximum number
260of multibyte characters that will be scanned.
261Before conversion begins,
262most conversions skip white space;
263this white space is not counted against the field width.
264.Pp
265The following conversions are available:
266.Bl -tag -width XXXX
267.It Cm %
268Matches a literal
269.Ql % .
270That is,
271.Dq Li %%
272in the format string
273matches a single input
274.Ql %
275character.
276No conversion is done, and assignment does not occur.
277.It Cm d
278Matches an optionally signed decimal integer;
279the next pointer must be a pointer to
280.Vt int .
281.It Cm i
282Matches an optionally signed integer;
283the next pointer must be a pointer to
284.Vt int .
285The integer is read in base 16 if it begins
286with
287.Ql 0x
288or
289.Ql 0X ,
290in base 8 if it begins with
291.Ql 0 ,
292and in base 10 otherwise.
293Only characters that correspond to the base are used.
294.It Cm o
295Matches an octal integer;
296the next pointer must be a pointer to
297.Vt "unsigned int" .
298.It Cm u
299Matches an optionally signed decimal integer;
300the next pointer must be a pointer to
301.Vt "unsigned int" .
302.It Cm x , X
303Matches an optionally signed hexadecimal integer;
304the next pointer must be a pointer to
305.Vt "unsigned int" .
306.It Cm a , A , e , E , f , F , g , G
307Matches a floating-point number in the style of
308.Xr strtod 3 .
309The next pointer must be a pointer to
310.Vt float
311(unless
312.Cm l
313or
314.Cm L
315is specified.)
316.It Cm s
317Matches a sequence of non-white-space characters;
318the next pointer must be a pointer to
319.Vt char ,
320and the array must be large enough to accept all the sequence and the
321terminating
322.Dv NUL
323character.
324The input string stops at white space
325or at the maximum field width, whichever occurs first.
326.Pp
327If an
328.Cm l
329qualifier is present, the next pointer must be a pointer to
330.Vt wchar_t ,
331into which the input will be placed after conversion by
332.Xr mbrtowc 3 .
333.It Cm S
334The same as
335.Cm ls .
336.It Cm c
337Matches a sequence of
338.Em width
339count
340characters (default 1);
341the next pointer must be a pointer to
342.Vt char ,
343and there must be enough room for all the characters
344(no terminating
345.Dv NUL
346is added).
347The usual skip of leading white space is suppressed.
348To skip white space first, use an explicit space in the format.
349.Pp
350If an
351.Cm l
352qualifier is present, the next pointer must be a pointer to
353.Vt wchar_t ,
354into which the input will be placed after conversion by
355.Xr mbrtowc 3 .
356.It Cm C
357The same as
358.Cm lc .
359.It Cm \&[
360Matches a nonempty sequence of characters from the specified set
361of accepted characters;
362the next pointer must be a pointer to
363.Vt char ,
364and there must be enough room for all the characters in the string,
365plus a terminating
366.Dv NUL
367character.
368The usual skip of leading white space is suppressed.
369The string is to be made up of characters in
370(or not in)
371a particular set;
372the set is defined by the characters between the open bracket
373.Cm \&[
374character
375and a close bracket
376.Cm \&]
377character.
378The set
379.Em excludes
380those characters
381if the first character after the open bracket is a circumflex
382.Cm ^ .
383To include a close bracket in the set,
384make it the first character after the open bracket
385or the circumflex;
386any other position will end the set.
387The hyphen character
388.Cm -
389is also special;
390when placed between two other characters,
391it adds all intervening characters to the set.
392To include a hyphen,
393make it the last character before the final close bracket.
394For instance,
395.Ql [^]0-9-]
396means the set
397.Dq "everything except close bracket, zero through nine, and hyphen" .
398The string ends with the appearance of a character not in the
399(or, with a circumflex, in) set
400or when the field width runs out.
401.Pp
402If an
403.Cm l
404qualifier is present, the next pointer must be a pointer to
405.Vt wchar_t ,
406into which the input will be placed after conversion by
407.Xr mbrtowc 3 .
408.It Cm p
409Matches a pointer value (as printed by
410.Ql %p
411in
412.Xr printf 3 ) ;
413the next pointer must be a pointer to
414.Vt void .
415.It Cm n
416Nothing is expected;
417instead, the number of characters consumed thus far from the input
418is stored through the next pointer,
419which must be a pointer to
420.Vt int .
421This is
422.Em not
423a conversion, although it can be suppressed with the
424.Cm *
425flag.
426.El
427.Pp
428The decimal point
429character is defined in the program's locale (category
430.Dv LC_NUMERIC ) .
431.Pp
432For backwards compatibility, a
433.Dq conversion
434of
435.Ql %\e0
436causes an immediate return of
437.Dv EOF .
438.Sh RETURN VALUES
439These
440functions
441return
442the number of input items assigned, which can be fewer than provided
443for, or even zero, in the event of a matching failure.
444Zero
445indicates that, while there was input available,
446no conversions were assigned;
447typically this is due to an invalid input character,
448such as an alphabetic character for a
449.Ql %d
450conversion.
451The value
452.Dv EOF
453is returned if an input failure occurs before any conversion such as an
454end-of-file occurs.
455If an error or end-of-file occurs after conversion
456has begun,
457the number of conversions which were successfully completed is returned.
458.Sh SEE ALSO
459.Xr getc 3 ,
460.Xr mbrtowc 3 ,
461.Xr printf 3 ,
462.Xr strtod 3 ,
463.Xr strtol 3 ,
464.Xr strtoul 3 ,
465.Xr wscanf 3
466.Sh STANDARDS
467The functions
468.Fn fscanf ,
469.Fn scanf ,
470.Fn sscanf ,
471.Fn vfscanf ,
472.Fn vscanf
473and
474.Fn vsscanf
475conform to
476.St -isoC-99 .
477.Sh HISTORY
478The functions
479.Fn scanf ,
480.Fn fscanf ,
481and
482.Fn sscanf
483first appeared in
484.At v7 ,
485and
486.Fn vscanf ,
487.Fn vsscanf ,
488and
489.Fn vfscanf
490in
491.Bx 4.3 Reno .
492.Sh BUGS
493Earlier implementations of
494.Nm
495treated
496.Cm \&%D , \&%E , \&%F , \&%O
497and
498.Cm \&%X
499as their lowercase equivalents with an
500.Cm l
501modifier.
502In addition,
503.Nm
504treated an unknown conversion character as
505.Cm \&%d
506or
507.Cm \&%D ,
508depending on its case.
509This functionality has been removed.
510.Pp
511Numerical strings are truncated to 512 characters; for example,
512.Cm %f
513and
514.Cm %d
515are implicitly
516.Cm %512f
517and
518.Cm %512d .
519.Pp
520The
521.Cm %n$
522modifiers for positional arguments are not implemented.
523.Pp
524The
525.Nm
526family of functions do not correctly handle multibyte characters in the
527.Fa format
528argument.
529