1.\" Copyright (c) 1990, 1991, 1993 2.\" The Regents of the University of California. All rights reserved. 3.\" 4.\" This code is derived from software contributed to Berkeley by 5.\" Chris Torek and the American National Standards Committee X3, 6.\" on Information Processing Systems. 7.\" 8.\" Redistribution and use in source and binary forms, with or without 9.\" modification, are permitted provided that the following conditions 10.\" are met: 11.\" 1. Redistributions of source code must retain the above copyright 12.\" notice, this list of conditions and the following disclaimer. 13.\" 2. Redistributions in binary form must reproduce the above copyright 14.\" notice, this list of conditions and the following disclaimer in the 15.\" documentation and/or other materials provided with the distribution. 16.\" 3. All advertising materials mentioning features or use of this software 17.\" must display the following acknowledgement: 18.\" This product includes software developed by the University of 19.\" California, Berkeley and its contributors. 20.\" 4. Neither the name of the University nor the names of its contributors 21.\" may be used to endorse or promote products derived from this software 22.\" without specific prior written permission. 23.\" 24.\" THIS SOFTWARE IS PROVIDED BY THE REGENTS AND CONTRIBUTORS ``AS IS'' AND 25.\" ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE 26.\" IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE 27.\" ARE DISCLAIMED. IN NO EVENT SHALL THE REGENTS OR CONTRIBUTORS BE LIABLE 28.\" FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL 29.\" DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS 30.\" OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) 31.\" HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT 32.\" LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY 33.\" OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF 34.\" SUCH DAMAGE. 35.\" 36.\" @(#)scanf.3 8.2 (Berkeley) 12/11/93 37.\" $FreeBSD$ 38.\" 39.Dd January 4, 2003 40.Dt SCANF 3 41.Os 42.Sh NAME 43.Nm scanf , 44.Nm fscanf , 45.Nm sscanf , 46.Nm vscanf , 47.Nm vsscanf , 48.Nm vfscanf 49.Nd input format conversion 50.Sh LIBRARY 51.Lb libc 52.Sh SYNOPSIS 53.In stdio.h 54.Ft int 55.Fn scanf "const char * restrict format" ... 56.Ft int 57.Fn fscanf "FILE * restrict stream" "const char * restrict format" ... 58.Ft int 59.Fn sscanf "const char * restrict str" "const char * restrict format" ... 60.In stdarg.h 61.Ft int 62.Fn vscanf "const char * restrict format" "va_list ap" 63.Ft int 64.Fn vsscanf "const char * restrict str" "const char * restrict format" "va_list ap" 65.Ft int 66.Fn vfscanf "FILE * restrict stream" "const char * restrict format" "va_list ap" 67.Sh DESCRIPTION 68The 69.Fn scanf 70family of functions scans input according to a 71.Fa format 72as described below. 73This format may contain 74.Em conversion specifiers ; 75the results from such conversions, if any, 76are stored through the 77.Em pointer 78arguments. 79The 80.Fn scanf 81function 82reads input from the standard input stream 83.Dv stdin , 84.Fn fscanf 85reads input from the stream pointer 86.Fa stream , 87and 88.Fn sscanf 89reads its input from the character string pointed to by 90.Fa str . 91The 92.Fn vfscanf 93function 94is analogous to 95.Xr vfprintf 3 96and reads input from the stream pointer 97.Fa stream 98using a variable argument list of pointers (see 99.Xr stdarg 3 ) . 100The 101.Fn vscanf 102function scans a variable argument list from the standard input and 103the 104.Fn vsscanf 105function scans it from a string; 106these are analogous to 107the 108.Fn vprintf 109and 110.Fn vsprintf 111functions respectively. 112Each successive 113.Em pointer 114argument must correspond properly with 115each successive conversion specifier 116(but see the 117.Cm * 118conversion below). 119All conversions are introduced by the 120.Cm % 121(percent sign) character. 122The 123.Fa format 124string 125may also contain other characters. 126White space (such as blanks, tabs, or newlines) in the 127.Fa format 128string match any amount of white space, including none, in the input. 129Everything else 130matches only itself. 131Scanning stops 132when an input character does not match such a format character. 133Scanning also stops 134when an input conversion cannot be made (see below). 135.Sh CONVERSIONS 136Following the 137.Cm % 138character introducing a conversion 139there may be a number of 140.Em flag 141characters, as follows: 142.Bl -tag -width ".Cm l No (ell)" 143.It Cm * 144Suppresses assignment. 145The conversion that follows occurs as usual, but no pointer is used; 146the result of the conversion is simply discarded. 147.It Cm hh 148Indicates that the conversion will be one of 149.Cm dioux 150or 151.Cm n 152and the next pointer is a pointer to a 153.Vt char 154(rather than 155.Vt int ) . 156.It Cm h 157Indicates that the conversion will be one of 158.Cm dioux 159or 160.Cm n 161and the next pointer is a pointer to a 162.Vt "short int" 163(rather than 164.Vt int ) . 165.It Cm l No (ell) 166Indicates that the conversion will be one of 167.Cm dioux 168or 169.Cm n 170and the next pointer is a pointer to a 171.Vt "long int" 172(rather than 173.Vt int ) , 174that the conversion will be one of 175.Cm aefg 176and the next pointer is a pointer to 177.Vt double 178(rather than 179.Vt float ) , 180or that the conversion will be one of 181.Cm c , 182.Cm s 183or 184.Cm \&[ 185and the next pointer is a pointer to an array of 186.Vt wchar_t 187(rather than 188.Vt char ) . 189.It Cm ll No (ell ell) 190Indicates that the conversion will be one of 191.Cm dioux 192or 193.Cm n 194and the next pointer is a pointer to a 195.Vt "long long int" 196(rather than 197.Vt int ) . 198.It Cm L 199Indicates that the conversion will be one of 200.Cm aef 201or 202.Cm g 203and the next pointer is a pointer to 204.Vt "long double" . 205(This type is not implemented; although the argument is 206required to be a pointer to 207.Vt "long double" , 208no additional precision is used in the conversion.) 209.It Cm j 210Indicates that the conversion will be one of 211.Cm dioux 212or 213.Cm n 214and the next pointer is a pointer to a 215.Vt intmax_t 216(rather than 217.Vt int ) . 218.It Cm t 219Indicates that the conversion will be one of 220.Cm dioux 221or 222.Cm n 223and the next pointer is a pointer to a 224.Vt ptrdiff_t 225(rather than 226.Vt int ) . 227.It Cm z 228Indicates that the conversion will be one of 229.Cm dioux 230or 231.Cm n 232and the next pointer is a pointer to a 233.Vt size_t 234(rather than 235.Vt int ) . 236.It Cm q 237(deprecated.) 238Indicates that the conversion will be one of 239.Cm dioux 240or 241.Cm n 242and the next pointer is a pointer to a 243.Vt "long long int" 244(rather than 245.Vt int ) . 246.El 247.Pp 248In addition to these flags, 249there may be an optional maximum field width, 250expressed as a decimal integer, 251between the 252.Cm % 253and the conversion. 254If no width is given, 255a default of 256.Dq infinity 257is used (with one exception, below); 258otherwise at most this many bytes are scanned 259in processing the conversion. 260In the case of the 261.Cm lc , 262.Cm ls 263and 264.Cm l[ 265conversions, the field width specifies the maximum number 266of multibyte characters that will be scanned. 267Before conversion begins, 268most conversions skip white space; 269this white space is not counted against the field width. 270.Pp 271The following conversions are available: 272.Bl -tag -width XXXX 273.It Cm % 274Matches a literal 275.Ql % . 276That is, 277.Dq Li %% 278in the format string 279matches a single input 280.Ql % 281character. 282No conversion is done, and assignment does not occur. 283.It Cm d 284Matches an optionally signed decimal integer; 285the next pointer must be a pointer to 286.Vt int . 287.It Cm i 288Matches an optionally signed integer; 289the next pointer must be a pointer to 290.Vt int . 291The integer is read in base 16 if it begins 292with 293.Ql 0x 294or 295.Ql 0X , 296in base 8 if it begins with 297.Ql 0 , 298and in base 10 otherwise. 299Only characters that correspond to the base are used. 300.It Cm o 301Matches an octal integer; 302the next pointer must be a pointer to 303.Vt "unsigned int" . 304.It Cm u 305Matches an optionally signed decimal integer; 306the next pointer must be a pointer to 307.Vt "unsigned int" . 308.It Cm x , X 309Matches an optionally signed hexadecimal integer; 310the next pointer must be a pointer to 311.Vt "unsigned int" . 312.It Cm e , E , f , F , g , G 313Matches an optionally signed floating-point number; 314the next pointer must be a pointer to 315.Vt float . 316.It Cm a , A 317Matches a hexadecimal number represented in the style 318.Sm off 319.Oo \- Oc Li 0x Ar h Li \&. Ar hhh Cm p Oo \\*[Pm] Oc Ar d . 320.Sm on 321This is an exact conversion of the sign, exponent, mantissa internal 322floating point representation; the 323.Sm off 324.Oo \- Oc Li 0x Ar h Li \&. Ar hhh 325.Sm on 326portion represents exactly the mantissa; only denormalized 327mantissas have a zero value to the left of the hexadecimal 328point. 329The 330.Cm p 331is a literal character 332.Ql p ; 333the exponent is preceded by a positive or negative sign 334and is represented in decimal. 335.It Cm s 336Matches a sequence of non-white-space characters; 337the next pointer must be a pointer to 338.Vt char , 339and the array must be large enough to accept all the sequence and the 340terminating 341.Dv NUL 342character. 343The input string stops at white space 344or at the maximum field width, whichever occurs first. 345.Pp 346If an 347.Cm l 348qualifier is present, the next pointer must be a pointer to 349.Vt wchar_t , 350into which the input will be placed after conversion by 351.Xr mbrtowc 3 . 352.It Cm S 353The same as 354.Cm ls . 355.It Cm c 356Matches a sequence of 357.Em width 358count 359characters (default 1); 360the next pointer must be a pointer to 361.Vt char , 362and there must be enough room for all the characters 363(no terminating 364.Dv NUL 365is added). 366The usual skip of leading white space is suppressed. 367To skip white space first, use an explicit space in the format. 368.Pp 369If an 370.Cm l 371qualifier is present, the next pointer must be a pointer to 372.Vt wchar_t , 373into which the input will be placed after conversion by 374.Xr mbrtowc 3 . 375.It Cm C 376The same as 377.Cm lc . 378.It Cm \&[ 379Matches a nonempty sequence of characters from the specified set 380of accepted characters; 381the next pointer must be a pointer to 382.Vt char , 383and there must be enough room for all the characters in the string, 384plus a terminating 385.Dv NUL 386character. 387The usual skip of leading white space is suppressed. 388The string is to be made up of characters in 389(or not in) 390a particular set; 391the set is defined by the characters between the open bracket 392.Cm [ 393character 394and a close bracket 395.Cm ] 396character. 397The set 398.Em excludes 399those characters 400if the first character after the open bracket is a circumflex 401.Cm ^ . 402To include a close bracket in the set, 403make it the first character after the open bracket 404or the circumflex; 405any other position will end the set. 406The hyphen character 407.Cm - 408is also special; 409when placed between two other characters, 410it adds all intervening characters to the set. 411To include a hyphen, 412make it the last character before the final close bracket. 413For instance, 414.Ql [^]0-9-] 415means the set 416.Dq "everything except close bracket, zero through nine, and hyphen" . 417The string ends with the appearance of a character not in the 418(or, with a circumflex, in) set 419or when the field width runs out. 420.Pp 421If an 422.Cm l 423qualifier is present, the next pointer must be a pointer to 424.Vt wchar_t , 425into which the input will be placed after conversion by 426.Xr mbrtowc 3 . 427.It Cm p 428Matches a pointer value (as printed by 429.Ql %p 430in 431.Xr printf 3 ) ; 432the next pointer must be a pointer to 433.Vt void . 434.It Cm n 435Nothing is expected; 436instead, the number of characters consumed thus far from the input 437is stored through the next pointer, 438which must be a pointer to 439.Vt int . 440This is 441.Em not 442a conversion, although it can be suppressed with the 443.Cm * 444flag. 445.El 446.Pp 447The decimal point 448character is defined in the program's locale (category 449.Dv LC_NUMERIC ) . 450.Pp 451For backwards compatibility, a 452.Dq conversion 453of 454.Ql %\e0 455causes an immediate return of 456.Dv EOF . 457.Sh RETURN VALUES 458These 459functions 460return 461the number of input items assigned, which can be fewer than provided 462for, or even zero, in the event of a matching failure. 463Zero 464indicates that, while there was input available, 465no conversions were assigned; 466typically this is due to an invalid input character, 467such as an alphabetic character for a 468.Ql %d 469conversion. 470The value 471.Dv EOF 472is returned if an input failure occurs before any conversion such as an 473end-of-file occurs. 474If an error or end-of-file occurs after conversion 475has begun, 476the number of conversions which were successfully completed is returned. 477.Sh SEE ALSO 478.Xr getc 3 , 479.Xr mbrtowc 3 , 480.Xr printf 3 , 481.Xr strtod 3 , 482.Xr strtol 3 , 483.Xr strtoul 3 , 484.Xr wscanf 3 485.Sh STANDARDS 486The functions 487.Fn fscanf , 488.Fn scanf , 489.Fn sscanf , 490.Fn vfscanf , 491.Fn vscanf 492and 493.Fn vsscanf 494conform to 495.St -isoC-99 . 496.Sh BUGS 497Earlier implementations of 498.Nm 499treated 500.Cm \&%D , \&%E , \&%F , \&%O 501and 502.Cm \&%X 503as their lowercase equivalents with an 504.Cm l 505modifier. 506In addition, 507.Nm 508treated an unknown conversion character as 509.Cm \&%d 510or 511.Cm \&%D , 512depending on its case. 513This functionality has been removed. 514.Pp 515Numerical strings are truncated to 512 characters; for example, 516.Cm %f 517and 518.Cm %d 519are implicitly 520.Cm %512f 521and 522.Cm %512d . 523.Pp 524The 525.Cm %n$ 526modifiers for positional arguments are not implemented. 527.Pp 528The 529.Cm \&%a 530and 531.Cm \&%A 532floating-point formats are not implemented. 533.Pp 534The 535.Nm 536family of functions do not correctly handle multibyte characters in the 537.Fa format 538argument. 539