'\" te .\" Copyright (c) 2009 Sun Microsystems, Inc. All rights reserved. .\" The contents of this file are subject to the terms of the Common Development and Distribution License (the "License"). You may not use this file except in compliance with the License. You can obtain a copy of the license at usr/src/OPENSOLARIS.LICENSE or http://www.opensolaris.org/os/licensing. .\" See the License for the specific language governing permissions and limitations under the License. When distributing Covered Code, include this CDDL HEADER in each file and include the License file at usr/src/OPENSOLARIS.LICENSE. If applicable, add the following below this CDDL HEADER, with the .\" fields enclosed by brackets "[]" replaced with your own identifying information: Portions Copyright [yyyy] [name of copyright owner] .TH DSP 7I "May 11, 2009" .SH NAME dsp \- generic audio device interface .SH SYNOPSIS .LP .nf \fB#include\fR \fB\fR .fi .SH DESCRIPTION .sp .LP To record audio input, applications \fBopen()\fR the appropriate device and read data from it using the \fBread()\fR system call. Similarly, sound data is queued to the audio output port by using the \fBwrite\fR(2) system call. Device configuration is performed using the \fBioctl\fR(2) interface. .sp .LP Because some systems can contain more than one audio device, application writers are encouraged to open the \fB/dev/mixer\fR device and determine the physical devices present on the system using the \fBSNDCTL_SYSINFO\fR and \fBSNDCTL_AUDIOINFO\fR ioctls. See \fBmixer\fR(7I). The user should be provided a the ability to select a different audio device, or alternatively, an environment variable such as \fBAUDIODSP\fR can be used. In the absence of any specific configuration from the user, the generic device file, \fB/dev/dsp\fR, can be used. This normally points to a reasonably appropriate default audio device for the system. .SS "Opening the Audio Device" .sp .LP The audio device is not treated as an exclusive resource. .sp .LP Each \fBopen()\fR completes as long as there are channels available to be allocated. If no channels are available to be allocated, the call returns \fB-1\fR with the \fBerrno\fR set to \fBEBUSY\fR. .sp .LP Audio applications should explicitly set the encoding characteristics to match the audio data requirements after opening the device, and not depend on any default configuration. .SS "Recording Audio Data" .sp .LP The \fBread()\fR system call copies data from the system's buffers to the application. Ordinarily, \fBread()\fR blocks until the user buffer is filled. The \fBpoll\fR(2) system call can be used to determine the presence of data that can be read without blocking. The device can alternatively be set to a non-blocking mode, in which case \fBread()\fR completes immediately, but can return fewer bytes than requested. Refer to the \fBread\fR(2) manual page for a complete description of this behavior. .sp .LP When the audio device is opened with read access, the device driver allocates resources for recording. Since this consumes system resources, processes that do not record audio data should open the device write-only (\fBO_WRONLY\fR). .sp .LP The recording process can be stopped by using the \fBSNDCTL_DSP_HALT_INPUT\fR ioctl, which also discards all pending record data in underlying device FIFOs. .sp .LP Before changing record parameters, the input should be stopped using the \fBSNDCTL_DSP_HALT_INPUT\fR ioctl, which also flushes the any underlying device input FIFOs. (This is not necessary if the process never started recording by calling \fBread\fR(2). Otherwise, subsequent reads can return samples in the old format followed by samples in the new format. This is particularly important when new parameters result in a changed sample size. .sp .LP Input data can accumulate in device buffers very quickly. At a minimum, it accumulates at 8000 bytes per second for 8-bit, 8 KHz, mono, \fBu-Law\fR data. If the device is configured for more channels, higher sample resolution, or higher sample rates, it accumulates even faster. If the application that consumes the data cannot keep up with this data rate, the underlying FIFOs can become full. When this occurs, any new incoming data is lost until the application makes room available by consuming data. Additionally, a record overrun is noted, which can be retrieved using the \fBSNDCTL_DSP_GETERROR\fR ioctl. .sp .LP Record volume for a stream can be adjusted by issuing the \fBSNDCTL_DSP_SETRECVOL\fR ioctl. The volume can also be retrieved using the \fBSNDCTL_DSP_GETRECVOL\fR. .SS "Playing Audio Data" .sp .LP The \fBwrite()\fR system call copies data from an application's buffer to the device output FIFO. Ordinarily, \fBwrite()\fR blocks until the entire user buffer is transferred. The device can alternatively be set to a non-blocking mode, in which case \fBwrite()\fRcompletes immediately, but might have transferred fewer bytes than requested. See \fBwrite\fR(2). .sp .LP Although \fBwrite()\fR returns when the data is successfully queued, the actual completion of audio output might take considerably longer. The \fBSNDCTL_DSP_SYNC\fR ioctl can be issued to allow an application to block until all of the queued output data has been played. .sp .LP The final \fBclose\fR(2) of the file descriptor waits until all of the audio output has drained. If a signal interrupts the \fBclose()\fR, or if the process exits without closing the device, any remaining data queued for audio output is flushed and the device is closed immediately. .sp .LP The output of playback data can be halted entirely, by calling the \fBSNDCTL_DSP_HALT_OUTPUT\fR ioctl. This also discards any data that is queued for playback in device FIFOs. .sp .LP Before changing playback parameters, the output should be drained using the \fBSNDCTL_DSP_SYNC\fR ioctl, and then stopped using the \fBSNDCTL_DSP_HALT_OUTPUT\fR ioctl, which also flushes the any underlying device output FIFOs. This is not necessary if the process never started playback, such as by calling \fBwrite\fR(2). This is particularly important when new parameters result in a changed sample size. .sp .LP Output data is played from the playback buffers at a default rate of at least 8000 bytes per second for u-Law, A-Law or 8-bit PCM data (faster for 16-bit linear data or higher sampling rates). If the output FIFO becomes empty, the framework plays silence, resulting in audible stall or click in the output, until more data is supplied by the application. The condition is also noted as a play underrun, which can be determined using the \fBSNDCTL_DSP_GETERROR\fR ioctl. .sp .LP Playback volume for a stream can be adjusted by issuing the \fBSNDCTL_DSP_SETPLAYVOL\fR ioctl. The volume can also be retrieved using the \fBSNDCTL_DSP_GETPLAYVOL\fR. .SS "Asynchronous I/O" .sp .LP The \fBO_NONBLOCK\fR flag can be set using the \fBF_SETFL\fR \fBfcntl\fR(2) to enable non-blocking \fBread()\fR and \fBwrite()\fR requests. This is normally sufficient for applications to maintain an audio stream in the background. .sp .LP It is also possible to determine the amount of data that can be transferred for playback or recording without blocking using the \fBSNDCTL_DSP_GETOSPACE\fR or \fBSNDCTL_DSP_GETISPACE\fR ioctls, respectively. .SS "Mixer Pseudo-Device" .sp .LP The \fB/dev/mixer\fR provides access to global hardware settings such as master volume settings, etc. It is also the interface used for determining the hardware configuration on the system. .sp .LP Applications should \fBopen\fR(2) \fB/dev/mixer\fR, and use the \fBSNDCTL_SYSINFO\fR and \fBSNDCTL_AUDIOINFO\fR ioctls to determine the device node names of audio devices on the system. See \fBmixer\fR(7I) for additional details. .SH IOCTLS .SS "Information IOCTLs" .sp .LP The following ioctls are supported on the audio device, as well as the mixer device. See \fBmixer\fR(7I) for details. .sp .in +2 .nf OSS_GETVERSION SNDCTL_SYSINFO SNDCTL_AUDIOINFO SNDCTL_MIXERINFO SNDCTL_CARDINFO .fi .in -2 .SS "Audio IOCTLs" .sp .LP The \fBdsp\fR device supports the following ioctl commands: .sp .ne 2 .na \fB\fBSNDCTL_DSP_SYNC\fR\fR .ad .RS 27n The argument is ignored. This command suspends the calling process until the output FIFOs are empty and all queued samples have been played, or until a signal is delivered to the calling process. An implicit \fBSNDCTL_DSP_SYNC\fR is performed on the final \fBclose()\fR of the \fBdsp\fR device. .sp This ioctl should not be used unnecessarily, as if it is used in the middle of playback it causes a small click or pause, as the FIFOs are drained. The correct use of this ioctl is just before changing sample formats. .RE .sp .ne 2 .na \fB\fBSNDCTL_DSP_HALT\fR\fR .ad .br .na \fB\fBSNDCTL_DSP_HALT_INPUT\fR\fR .ad .br .na \fB\fBSNDCTL_DSP_HALT_OUTPUT\fR\fR .ad .RS 27n The argument is ignored. All input or output (or both) associated with the file is halted, and any pending data is discarded. .RE .sp .ne 2 .na \fB\fBSNDCTL_DSP_SPEED\fR\fR .ad .RS 27n The argument is a pointer to an integer, indicating the sample rate (in Hz) to be used. The rate applies to both input and output for the file descriptor. On return the actual rate, which can differ from that requested, is stored in the integer pointed to by the argument. To query the configured speed without changing it the value 0 can be used by the application .RE .sp .ne 2 .na \fB\fBSNDCTL_DSP_GETFMTS\fR\fR .ad .RS 27n The argument is a pointer to an integer, which receives a bit mask of encodings supported by the device. Possible values are .sp .in +2 .nf AFMT_MU_LAW 8-bit unsigned u-Law AFMT_A_LAW 8-bit unsigned a-Law AFMT_U8 8-bit unsigned linear PCM AFMT_S16_LE 16-bit signed little-endian linear PCM AFMT_S16_BE 16-bit signed big-endian linear PCM AFMT_S16_NE 16-bit signed native-endian linear PCM AFMT_U16_LE 16-bit unsigned little-endian linear PCM AFMT_U16_BE 16-bit unsigned big-endian linear PCM AFMT_U16_NE 16-bit unsigned big-endian linear PCM AFMT_S24_LE 24-bit signed little-endian linear PCM, 32-bit aligned AFMT_S24_BE 24-bit signed big-endian linear PCM, 32-bit aligned AFMT_S24_NE 24-bit signed native-endian linear PCM, 32-bit aligned AFMT_S32_LE 32-bit signed little-endian linear PCM AFMT_S32_BE 32-bit signed big-endian linear PCM AFMT_S32_NE 32-bit signed native-endian linear PCM AFMT_S24_PACKED 24-bit signed little-endian packed linear PCM .fi .in -2 .sp Not all devices support all of these encodings. This implementation uses \fBAFMT_S24_LE\fR or \fBAFMT_S24_BE\fR, whichever is native, internally. .RE .sp .ne 2 .na \fB\fBSNDCTL_DSP_SETFMT\fR\fR .ad .RS 27n The argument is a pointer to an integer, which indicates the encoding to be used. The same values as for \fBSNDCTL_DSP_GETFMT\fR can be used, but the caller can only specify a single option. The encoding is used for both input and output performed on the file descriptor. .RE .sp .ne 2 .na \fB\fBSNDCTL_DSP_CHANNELS\fR\fR .ad .RS 27n The argument is a pointer to an integer, indicating the number of channels to be used (1 for mono, 2 for stereo, etc.) The value applies to both input and output for the file descriptor. On return the actual channel configuration (which can differ from that requested) is stored in the integer pointed to by the argument. To query the configured channels without changing it the value 0 can be used by the application. .RE .sp .ne 2 .na \fB\fBSNDDCTL_DSP_GETCAPS\fR\fR .ad .RS 27n The argument is a pointer to an integer bit mask, which indicates the capabilities of the device. The bits returned can include .sp .in +2 .nf PCM_CAP_OUTPUT Device supports playback PCM_CAP_INPUT Device supports recording PCM_CAP_DUPLEX Device supports simultaneous playback and recording .fi .in -2 .sp .RE .sp .ne 2 .na \fB\fBSNDCTL_DSP_GETPLAYVOL\fR\fR .ad .br .na \fB\fBSNDCTL_DSP_GETRECVOL\fR\fR .ad .RS 27n The argument is a pointer to an integer to receive the volume level for either playback or record. The value is encoded as a stereo value with the values for two channels in the least significant two bytes. The value for each channel thus has a range of 0-100. In this implementation, only the low order byte is used, as the value is treated as a monophonic value, but a stereo value (with both channel levels being identical) is returned for compatibility. .RE .sp .ne 2 .na \fB\fBSNDCTL_DSP_SETPLAYVOL\fR\fR .ad .br .na \fB\fBSNDCTL_DSP_SETRECVOL\fR\fR .ad .RS 27n The argument is a pointer to an integer indicating volume level for either playback or record. The value is encoded as a stereo value with the values for two channels in the least significant two bytes. The value for each channel has a range of 0-100. Note that in this implementation, only the low order byte is used, as the value is treated as a monophonic value. Portable applications should assign the same value to both bytes .RE .sp .ne 2 .na \fB\fBSNDCTL_DSP_GETISPACE\fR\fR .ad .br .na \fB\fBSNDCTL_DSP_GETOSPACE\fR\fR .ad .RS 27n The argument is a pointer to a \fBstruct audio_buf_info\fR, which has the following structure: .sp .in +2 .nf typedef struct audio_buf_info { int fragments;* /# of available fragments */ int fragstotal; /* Total # of fragments allocated */ int fragsize; /* Size of a fragment in bytes */ int bytes; /* Available space in bytes */ /* Note! 'bytes' could be more than fragments*fragsize */ } audio_buf_info; .fi .in -2 .sp The fields fragments, \fBfragstotal\fR, and \fBfragsize\fR are intended for use with compatible applications (and in the future with \fBmmap\fR(2)) only, and need not be used by typical applications. On successful return the bytes member contains the number of bytes that can be transferred without blocking. .RE .sp .ne 2 .na \fB\fBSNDCTL_DSP_CURRENT_IPTR\fR\fR .ad .br .na \fB\fBSNDCTL_DSP_CURRENT_OPTR\fR\fR .ad .RS 27n The argument is a pointer to an \fBoss_count_t\fR, which has the following definition: .sp .in +2 .nf typedef struct { long long samples; /* Total # of samples */ int fifo_samples; /* Samples in device FIFO */ int filler[32];/* For future use */ } oss_count_t; .fi .in -2 .sp The \fBsamples\fR field contains the total number of samples transferred by the device so far. The \fBfifo_samples\fR is the depth of any hardware FIFO. This structure can be useful for accurate stream positioning and latency calculations. .RE .sp .ne 2 .na \fB\fBSNDCTL_DSP_GETIPTR\fR\fR .ad .br .na \fB\fBSNDCTL_DSP_GETOPTR\fR\fR .ad .RS 27n The argument is a pointer to a \fBstruct count_info\fR, which has the following definition: .sp .in +2 .nf typedef struct count_info { unsigned int bytes; /* Total # of bytes processed */ int blocks; /* # of fragment transitions since last time */ int ptr;/* Current DMA pointer value */ } count_info; .fi .in -2 .sp These ioctls are primarily supplied for compatibility, and should not be used by most applications. .RE .sp .ne 2 .na \fB\fBSNDCTL_DSP_GETODELAY\fR\fR .ad .RS 27n The argument is a pointer to an integer. On return, the integer contains the number of bytes still to be played before the next byte written are played. This can be used for accurate determination of device latency. The result can differ from actual value by up the depth of the internal device FIFO, which is typically 64 bytes. .RE .sp .ne 2 .na \fB\fBSNDCTL_DSP_GETERROR\fR\fR .ad .RS 27n The argument is a pointer to a \fBstruct audio_errinfo\fR, defined as follows: .sp .in +2 .nf typedef struct audio_errinfo { int play_underruns; int rec_overruns; unsigned int play_ptradjust; unsigned int rec_ptradjust; int play_errorcount; int rec_errorcount; int play_lasterror; int rec_lasterror; int play_errorparm; int rec_errorparm; int filler[16]; } audio_errinfo; .fi .in -2 .sp For this implementation, only the \fBplay_underruns\fR and \fBrec_overruns\fR values are significant. No other fields are used in this implementation. .sp These fields are reset to zero each time their value is retrieved using this ioctl. .RE .SS "Compatibility IOCTLS" .sp .LP These ioctls are supplied exclusively for compatibility with existing applications. Their use is not recommended, and they are not documented here. Many of these are implemented as simple no-ops. .sp .in +2 .nf SNDCTL_DSP_POST SNDCTL_DSP_STEREO SNDCTL_DSP_SETDUPLEX SNDCTL_DSP_LOW_WATER SNDCTL_DSP_PROFILE SNDCTL_DSP_GETBLKSIZE SNDCTL_DSP_SUBDIVIDE SNDCTL_DSP_SETFRAGMENT SNDCTL_DSP_COOKEDMODE SNDCTL_DSP_READCTL SNDCTL_DSP_WRITECTL SNDCTL_DSP_SILENCE SNDCTL_DSP_SKIP SNDCTL_DSP_POST SNDCTL_DSP_GET_RECSRC SNDCTL_DSP_SET_RECSRC SNDCTL_DSP_SET_RECSRC_NAMES SNDCTL_DSP_GET_PLAYTGT SNDCTL_DSP_SET_PLAYTGT SNDCTL_DSP_SET_PLAYTGT_NAMES SNDCTL_DSP_GETTRIGGER SNDCTL_DSP_SETTRIGGER SNDCTL_AUDIOINFO_EX SNDCTL_ENGINEINFO .fi .in -2 .sp .SH ERRORS .sp .LP An \fBopen()\fR fails if: .sp .ne 2 .na \fB\fBEBUSY\fR\fR .ad .RS 10n The requested play or record access isbusy and either the \fBO_NDELAY\fR or \fBO_NONBLOCK\fR flag was set in the \fBopen()\fR request. .RE .sp .ne 2 .na \fB\fBEINTR\fR\fR .ad .RS 10n The requested play or record access is busy and a signal interrupted the \fBopen()\fR request. .RE .sp .ne 2 .na \fB\fBEINVAL\fR\fR .ad .RS 10n The device cannot support the requested play or record access. .RE .sp .LP An \fBioctl()\fR fails if: .sp .ne 2 .na \fB\fBEINVAL\fR\fR .ad .RS 10n The parameter changes requested in the ioctl are invalid or are not supported by the device. .RE .SH FILES .sp .LP The physical audio device names are system dependent and are rarely used by programmers. Programmers should use the generic device names listed below. .sp .ne 2 .na \fB\fB/dev/dsp\fR\fR .ad .RS 28n Symbolic link to the system's primary audio device .RE .sp .ne 2 .na \fB\fB/dev/mixer\fR\fR .ad .RS 28n Symbolic link to the pseudo mixer device for the system .RE .sp .ne 2 .na \fB\fB/dev/sndstat\fR\fR .ad .RS 28n Symbolic link to the pseudo mixer device for the system .RE .sp .ne 2 .na \fB\fB/usr/share/audio/samples\fR\fR .ad .RS 28n Audio files .RE .SH ATTRIBUTES .sp .LP See \fBattributes\fR(5) for a description of the following attributes: .sp .sp .TS box; c | c l | l . ATTRIBUTE TYPE ATTRIBUTE VALUE _ Architecture SPARC, x86 _ Stability Level Uncommitted .TE .SH SEE ALSO .sp .LP \fBclose\fR(2), \fBfcntl\fR(2), \fBioctl\fR(2), \fBmmap\fR(2), \fBopen\fR(2), \fBpoll\fR(2), \fBread\fR(2), \fBwrite\fR(2), \fBattributes\fR(5), \fBaudio\fR(7D), \fBmixer\fR(7I)