1.. SPDX-License-Identifier: GPL-2.0 2 3===================================================== 4sysfs - _The_ filesystem for exporting kernel objects 5===================================================== 6 7Patrick Mochel <mochel@osdl.org> 8 9Mike Murphy <mamurph@cs.clemson.edu> 10 11:Revised: 16 August 2011 12:Original: 10 January 2003 13 14 15What it is 16~~~~~~~~~~ 17 18sysfs is a RAM-based filesystem initially based on ramfs. It provides 19a means to export kernel data structures, their attributes, and the 20linkages between them to userspace. 21 22sysfs is tied inherently to the kobject infrastructure. Please read 23Documentation/core-api/kobject.rst for more information concerning the kobject 24interface. 25 26 27Using sysfs 28~~~~~~~~~~~ 29 30sysfs is always compiled in if CONFIG_SYSFS is defined. You can access 31it by doing:: 32 33 mount -t sysfs sysfs /sys 34 35 36Directory Creation 37~~~~~~~~~~~~~~~~~~ 38 39For every kobject that is registered with the system, a directory is 40created for it in sysfs. That directory is created as a subdirectory 41of the kobject's parent, expressing internal object hierarchies to 42userspace. Top-level directories in sysfs represent the common 43ancestors of object hierarchies; i.e. the subsystems the objects 44belong to. 45 46sysfs internally stores a pointer to the kobject that implements a 47directory in the kernfs_node object associated with the directory. In 48the past this kobject pointer has been used by sysfs to do reference 49counting directly on the kobject whenever the file is opened or closed. 50With the current sysfs implementation the kobject reference count is 51only modified directly by the function sysfs_schedule_callback(). 52 53 54Attributes 55~~~~~~~~~~ 56 57Attributes can be exported for kobjects in the form of regular files in 58the filesystem. sysfs forwards file I/O operations to methods defined 59for the attributes, providing a means to read and write kernel 60attributes. 61 62Attributes should be ASCII text files, preferably with only one value 63per file. It is noted that it may not be efficient to contain only one 64value per file, so it is socially acceptable to express an array of 65values of the same type. 66 67Mixing types, expressing multiple lines of data, and doing fancy 68formatting of data is heavily frowned upon. Doing these things may get 69you publicly humiliated and your code rewritten without notice. 70 71 72An attribute definition is simply:: 73 74 struct attribute { 75 char *name; 76 struct module *owner; 77 umode_t mode; 78 }; 79 80 81 int sysfs_create_file(struct kobject * kobj, const struct attribute * attr); 82 void sysfs_remove_file(struct kobject * kobj, const struct attribute * attr); 83 84 85A bare attribute contains no means to read or write the value of the 86attribute. Subsystems are encouraged to define their own attribute 87structure and wrapper functions for adding and removing attributes for 88a specific object type. 89 90For example, the driver model defines struct device_attribute like:: 91 92 struct device_attribute { 93 struct attribute attr; 94 ssize_t (*show)(struct device *dev, struct device_attribute *attr, 95 char *buf); 96 ssize_t (*store)(struct device *dev, struct device_attribute *attr, 97 const char *buf, size_t count); 98 }; 99 100 int device_create_file(struct device *, const struct device_attribute *); 101 void device_remove_file(struct device *, const struct device_attribute *); 102 103It also defines this helper for defining device attributes:: 104 105 #define DEVICE_ATTR(_name, _mode, _show, _store) \ 106 struct device_attribute dev_attr_##_name = __ATTR(_name, _mode, _show, _store) 107 108For example, declaring:: 109 110 static DEVICE_ATTR(foo, S_IWUSR | S_IRUGO, show_foo, store_foo); 111 112is equivalent to doing:: 113 114 static struct device_attribute dev_attr_foo = { 115 .attr = { 116 .name = "foo", 117 .mode = S_IWUSR | S_IRUGO, 118 }, 119 .show = show_foo, 120 .store = store_foo, 121 }; 122 123Note as stated in include/linux/kernel.h "OTHER_WRITABLE? Generally 124considered a bad idea." so trying to set a sysfs file writable for 125everyone will fail reverting to RO mode for "Others". 126 127For the common cases sysfs.h provides convenience macros to make 128defining attributes easier as well as making code more concise and 129readable. The above case could be shortened to: 130 131static struct device_attribute dev_attr_foo = __ATTR_RW(foo); 132 133the list of helpers available to define your wrapper function is: 134 135__ATTR_RO(name): 136 assumes default name_show and mode 0444 137__ATTR_WO(name): 138 assumes a name_store only and is restricted to mode 139 0200 that is root write access only. 140__ATTR_RO_MODE(name, mode): 141 for more restrictive RO access; currently 142 only use case is the EFI System Resource Table 143 (see drivers/firmware/efi/esrt.c) 144__ATTR_RW(name): 145 assumes default name_show, name_store and setting 146 mode to 0644. 147__ATTR_NULL: 148 which sets the name to NULL and is used as end of list 149 indicator (see: kernel/workqueue.c) 150 151Subsystem-Specific Callbacks 152~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 153 154When a subsystem defines a new attribute type, it must implement a 155set of sysfs operations for forwarding read and write calls to the 156show and store methods of the attribute owners:: 157 158 struct sysfs_ops { 159 ssize_t (*show)(struct kobject *, struct attribute *, char *); 160 ssize_t (*store)(struct kobject *, struct attribute *, const char *, size_t); 161 }; 162 163[ Subsystems should have already defined a struct kobj_type as a 164descriptor for this type, which is where the sysfs_ops pointer is 165stored. See the kobject documentation for more information. ] 166 167When a file is read or written, sysfs calls the appropriate method 168for the type. The method then translates the generic struct kobject 169and struct attribute pointers to the appropriate pointer types, and 170calls the associated methods. 171 172 173To illustrate:: 174 175 #define to_dev_attr(_attr) container_of(_attr, struct device_attribute, attr) 176 177 static ssize_t dev_attr_show(struct kobject *kobj, struct attribute *attr, 178 char *buf) 179 { 180 struct device_attribute *dev_attr = to_dev_attr(attr); 181 struct device *dev = kobj_to_dev(kobj); 182 ssize_t ret = -EIO; 183 184 if (dev_attr->show) 185 ret = dev_attr->show(dev, dev_attr, buf); 186 if (ret >= (ssize_t)PAGE_SIZE) { 187 printk("dev_attr_show: %pS returned bad count\n", 188 dev_attr->show); 189 } 190 return ret; 191 } 192 193 194 195Reading/Writing Attribute Data 196~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 197 198To read or write attributes, show() or store() methods must be 199specified when declaring the attribute. The method types should be as 200simple as those defined for device attributes:: 201 202 ssize_t (*show)(struct device *dev, struct device_attribute *attr, char *buf); 203 ssize_t (*store)(struct device *dev, struct device_attribute *attr, 204 const char *buf, size_t count); 205 206IOW, they should take only an object, an attribute, and a buffer as parameters. 207 208 209sysfs allocates a buffer of size (PAGE_SIZE) and passes it to the 210method. sysfs will call the method exactly once for each read or 211write. This forces the following behavior on the method 212implementations: 213 214- On read(2), the show() method should fill the entire buffer. 215 Recall that an attribute should only be exporting one value, or an 216 array of similar values, so this shouldn't be that expensive. 217 218 This allows userspace to do partial reads and forward seeks 219 arbitrarily over the entire file at will. If userspace seeks back to 220 zero or does a pread(2) with an offset of '0' the show() method will 221 be called again, rearmed, to fill the buffer. 222 223- On write(2), sysfs expects the entire buffer to be passed during the 224 first write. sysfs then passes the entire buffer to the store() method. 225 A terminating null is added after the data on stores. This makes 226 functions like sysfs_streq() safe to use. 227 228 When writing sysfs files, userspace processes should first read the 229 entire file, modify the values it wishes to change, then write the 230 entire buffer back. 231 232 Attribute method implementations should operate on an identical 233 buffer when reading and writing values. 234 235Other notes: 236 237- Writing causes the show() method to be rearmed regardless of current 238 file position. 239 240- The buffer will always be PAGE_SIZE bytes in length. On x86, this 241 is 4096. 242 243- show() methods should return the number of bytes printed into the 244 buffer. 245 246- show() should only use sysfs_emit() or sysfs_emit_at() when formatting 247 the value to be returned to user space. 248 249- store() should return the number of bytes used from the buffer. If the 250 entire buffer has been used, just return the count argument. 251 252- show() or store() can always return errors. If a bad value comes 253 through, be sure to return an error. 254 255- The object passed to the methods will be pinned in memory via sysfs 256 reference counting its embedded object. However, the physical 257 entity (e.g. device) the object represents may not be present. Be 258 sure to have a way to check this, if necessary. 259 260 261A very simple (and naive) implementation of a device attribute is:: 262 263 static ssize_t show_name(struct device *dev, struct device_attribute *attr, 264 char *buf) 265 { 266 return sysfs_emit(buf, "%s\n", dev->name); 267 } 268 269 static ssize_t store_name(struct device *dev, struct device_attribute *attr, 270 const char *buf, size_t count) 271 { 272 snprintf(dev->name, sizeof(dev->name), "%.*s", 273 (int)min(count, sizeof(dev->name) - 1), buf); 274 return count; 275 } 276 277 static DEVICE_ATTR(name, S_IRUGO, show_name, store_name); 278 279 280(Note that the real implementation doesn't allow userspace to set the 281name for a device.) 282 283 284Top Level Directory Layout 285~~~~~~~~~~~~~~~~~~~~~~~~~~ 286 287The sysfs directory arrangement exposes the relationship of kernel 288data structures. 289 290The top level sysfs directory looks like:: 291 292 block/ 293 bus/ 294 class/ 295 dev/ 296 devices/ 297 firmware/ 298 fs/ 299 hypervisor/ 300 kernel/ 301 module/ 302 net/ 303 power/ 304 305devices/ contains a filesystem representation of the device tree. It maps 306directly to the internal kernel device tree, which is a hierarchy of 307struct device. 308 309bus/ contains flat directory layout of the various bus types in the 310kernel. Each bus's directory contains two subdirectories:: 311 312 devices/ 313 drivers/ 314 315devices/ contains symlinks for each device discovered in the system 316that point to the device's directory under root/. 317 318drivers/ contains a directory for each device driver that is loaded 319for devices on that particular bus (this assumes that drivers do not 320span multiple bus types). 321 322fs/ contains a directory for some filesystems. Currently each 323filesystem wanting to export attributes must create its own hierarchy 324below fs/ (see ./fuse.rst for an example). 325 326module/ contains parameter values and state information for all 327loaded system modules, for both builtin and loadable modules. 328 329dev/ contains two directories: char/ and block/. Inside these two 330directories there are symlinks named <major>:<minor>. These symlinks 331point to the sysfs directory for the given device. /sys/dev provides a 332quick way to lookup the sysfs interface for a device from the result of 333a stat(2) operation. 334 335More information on driver-model specific features can be found in 336Documentation/driver-api/driver-model/. 337 338 339TODO: Finish this section. 340 341 342Current Interfaces 343~~~~~~~~~~~~~~~~~~ 344 345The following interface layers currently exist in sysfs. 346 347 348devices (include/linux/device.h) 349-------------------------------- 350Structure:: 351 352 struct device_attribute { 353 struct attribute attr; 354 ssize_t (*show)(struct device *dev, struct device_attribute *attr, 355 char *buf); 356 ssize_t (*store)(struct device *dev, struct device_attribute *attr, 357 const char *buf, size_t count); 358 }; 359 360Declaring:: 361 362 DEVICE_ATTR(_name, _mode, _show, _store); 363 364Creation/Removal:: 365 366 int device_create_file(struct device *dev, const struct device_attribute * attr); 367 void device_remove_file(struct device *dev, const struct device_attribute * attr); 368 369 370bus drivers (include/linux/device.h) 371------------------------------------ 372Structure:: 373 374 struct bus_attribute { 375 struct attribute attr; 376 ssize_t (*show)(const struct bus_type *, char * buf); 377 ssize_t (*store)(const struct bus_type *, const char * buf, size_t count); 378 }; 379 380Declaring:: 381 382 static BUS_ATTR_RW(name); 383 static BUS_ATTR_RO(name); 384 static BUS_ATTR_WO(name); 385 386Creation/Removal:: 387 388 int bus_create_file(struct bus_type *, struct bus_attribute *); 389 void bus_remove_file(struct bus_type *, struct bus_attribute *); 390 391 392device drivers (include/linux/device.h) 393--------------------------------------- 394 395Structure:: 396 397 struct driver_attribute { 398 struct attribute attr; 399 ssize_t (*show)(struct device_driver *, char * buf); 400 ssize_t (*store)(struct device_driver *, const char * buf, 401 size_t count); 402 }; 403 404Declaring:: 405 406 DRIVER_ATTR_RO(_name) 407 DRIVER_ATTR_RW(_name) 408 409Creation/Removal:: 410 411 int driver_create_file(struct device_driver *, const struct driver_attribute *); 412 void driver_remove_file(struct device_driver *, const struct driver_attribute *); 413 414 415Documentation 416~~~~~~~~~~~~~ 417 418The sysfs directory structure and the attributes in each directory define an 419ABI between the kernel and user space. As for any ABI, it is important that 420this ABI is stable and properly documented. All new sysfs attributes must be 421documented in Documentation/ABI. See also Documentation/ABI/README for more 422information. 423