xref: /linux/mm/Kconfig (revision 8804d970fab45726b3c7cd7f240b31122aa94219)
1ec8f24b7SThomas Gleixner# SPDX-License-Identifier: GPL-2.0-only
259e0b520SChristoph Hellwig
359e0b520SChristoph Hellwigmenu "Memory Management options"
459e0b520SChristoph Hellwig
57b42f104SJohannes Weiner#
67b42f104SJohannes Weiner# For some reason microblaze and nios2 hard code SWAP=n.  Hopefully we can
77b42f104SJohannes Weiner# add proper SWAP support to them, in which case this can be remove.
87b42f104SJohannes Weiner#
97b42f104SJohannes Weinerconfig ARCH_NO_SWAP
107b42f104SJohannes Weiner	bool
117b42f104SJohannes Weiner
12519bcb79SJohannes Weinermenuconfig SWAP
137b42f104SJohannes Weiner	bool "Support for paging of anonymous memory (swap)"
147b42f104SJohannes Weiner	depends on MMU && BLOCK && !ARCH_NO_SWAP
157b42f104SJohannes Weiner	default y
167b42f104SJohannes Weiner	help
177b42f104SJohannes Weiner	  This option allows you to choose whether you want to have support
187b42f104SJohannes Weiner	  for so called swap devices or swap files in your kernel that are
197b42f104SJohannes Weiner	  used to provide more virtual memory than the actual RAM present
207b42f104SJohannes Weiner	  in your computer.  If unsure say Y.
217b42f104SJohannes Weiner
22519bcb79SJohannes Weinerconfig ZSWAP
23fcab9b44SDavid Heidelberg	bool "Compressed cache for swap pages"
24b3fbd58fSJohannes Weiner	depends on SWAP
25b3fbd58fSJohannes Weiner	select CRYPTO
262ccd9fecSJohannes Weiner	select ZSMALLOC
27519bcb79SJohannes Weiner	help
28519bcb79SJohannes Weiner	  A lightweight compressed cache for swap pages.  It takes
29519bcb79SJohannes Weiner	  pages that are in the process of being swapped out and attempts to
30519bcb79SJohannes Weiner	  compress them into a dynamically allocated RAM-based memory pool.
31519bcb79SJohannes Weiner	  This can result in a significant I/O reduction on swap device and,
321a44131dSSophia Gabriella	  in the case where decompressing from RAM is faster than swap device
33519bcb79SJohannes Weiner	  reads, can also improve workload performance.
34519bcb79SJohannes Weiner
35b3fbd58fSJohannes Weinerconfig ZSWAP_DEFAULT_ON
36b3fbd58fSJohannes Weiner	bool "Enable the compressed cache for swap pages by default"
37b3fbd58fSJohannes Weiner	depends on ZSWAP
38b3fbd58fSJohannes Weiner	help
39b3fbd58fSJohannes Weiner	  If selected, the compressed cache for swap pages will be enabled
40b3fbd58fSJohannes Weiner	  at boot, otherwise it will be disabled.
41b3fbd58fSJohannes Weiner
42b3fbd58fSJohannes Weiner	  The selection made here can be overridden by using the kernel
43b3fbd58fSJohannes Weiner	  command line 'zswap.enabled=' option.
44b3fbd58fSJohannes Weiner
45b5ba474fSNhat Phamconfig ZSWAP_SHRINKER_DEFAULT_ON
46b5ba474fSNhat Pham	bool "Shrink the zswap pool on memory pressure"
47b5ba474fSNhat Pham	depends on ZSWAP
48b5ba474fSNhat Pham	default n
49b5ba474fSNhat Pham	help
50b5ba474fSNhat Pham	  If selected, the zswap shrinker will be enabled, and the pages
51b5ba474fSNhat Pham	  stored in the zswap pool will become available for reclaim (i.e
52b5ba474fSNhat Pham	  written back to the backing swap device) on memory pressure.
53b5ba474fSNhat Pham
54b5ba474fSNhat Pham	  This means that zswap writeback could happen even if the pool is
55b5ba474fSNhat Pham	  not yet full, or the cgroup zswap limit has not been reached,
56b5ba474fSNhat Pham	  reducing the chance that cold pages will reside in the zswap pool
57b5ba474fSNhat Pham	  and consume memory indefinitely.
58b5ba474fSNhat Pham
59519bcb79SJohannes Weinerchoice
60b3fbd58fSJohannes Weiner	prompt "Default compressor"
61519bcb79SJohannes Weiner	depends on ZSWAP
62519bcb79SJohannes Weiner	default ZSWAP_COMPRESSOR_DEFAULT_LZO
63519bcb79SJohannes Weiner	help
64519bcb79SJohannes Weiner	  Selects the default compression algorithm for the compressed cache
65519bcb79SJohannes Weiner	  for swap pages.
66519bcb79SJohannes Weiner
67519bcb79SJohannes Weiner	  For an overview what kind of performance can be expected from
68519bcb79SJohannes Weiner	  a particular compression algorithm please refer to the benchmarks
69519bcb79SJohannes Weiner	  available at the following LWN page:
70519bcb79SJohannes Weiner	  https://lwn.net/Articles/751795/
71519bcb79SJohannes Weiner
72519bcb79SJohannes Weiner	  If in doubt, select 'LZO'.
73519bcb79SJohannes Weiner
74519bcb79SJohannes Weiner	  The selection made here can be overridden by using the kernel
75519bcb79SJohannes Weiner	  command line 'zswap.compressor=' option.
76519bcb79SJohannes Weiner
77519bcb79SJohannes Weinerconfig ZSWAP_COMPRESSOR_DEFAULT_DEFLATE
78519bcb79SJohannes Weiner	bool "Deflate"
79519bcb79SJohannes Weiner	select CRYPTO_DEFLATE
80519bcb79SJohannes Weiner	help
81519bcb79SJohannes Weiner	  Use the Deflate algorithm as the default compression algorithm.
82519bcb79SJohannes Weiner
83519bcb79SJohannes Weinerconfig ZSWAP_COMPRESSOR_DEFAULT_LZO
84519bcb79SJohannes Weiner	bool "LZO"
85519bcb79SJohannes Weiner	select CRYPTO_LZO
86519bcb79SJohannes Weiner	help
87519bcb79SJohannes Weiner	  Use the LZO algorithm as the default compression algorithm.
88519bcb79SJohannes Weiner
89519bcb79SJohannes Weinerconfig ZSWAP_COMPRESSOR_DEFAULT_842
90519bcb79SJohannes Weiner	bool "842"
91519bcb79SJohannes Weiner	select CRYPTO_842
92519bcb79SJohannes Weiner	help
93519bcb79SJohannes Weiner	  Use the 842 algorithm as the default compression algorithm.
94519bcb79SJohannes Weiner
95519bcb79SJohannes Weinerconfig ZSWAP_COMPRESSOR_DEFAULT_LZ4
96519bcb79SJohannes Weiner	bool "LZ4"
97519bcb79SJohannes Weiner	select CRYPTO_LZ4
98519bcb79SJohannes Weiner	help
99519bcb79SJohannes Weiner	  Use the LZ4 algorithm as the default compression algorithm.
100519bcb79SJohannes Weiner
101519bcb79SJohannes Weinerconfig ZSWAP_COMPRESSOR_DEFAULT_LZ4HC
102519bcb79SJohannes Weiner	bool "LZ4HC"
103519bcb79SJohannes Weiner	select CRYPTO_LZ4HC
104519bcb79SJohannes Weiner	help
105519bcb79SJohannes Weiner	  Use the LZ4HC algorithm as the default compression algorithm.
106519bcb79SJohannes Weiner
107519bcb79SJohannes Weinerconfig ZSWAP_COMPRESSOR_DEFAULT_ZSTD
108519bcb79SJohannes Weiner	bool "zstd"
109519bcb79SJohannes Weiner	select CRYPTO_ZSTD
110519bcb79SJohannes Weiner	help
111519bcb79SJohannes Weiner	  Use the zstd algorithm as the default compression algorithm.
112519bcb79SJohannes Weinerendchoice
113519bcb79SJohannes Weiner
114519bcb79SJohannes Weinerconfig ZSWAP_COMPRESSOR_DEFAULT
115519bcb79SJohannes Weiner       string
116519bcb79SJohannes Weiner       depends on ZSWAP
117519bcb79SJohannes Weiner       default "deflate" if ZSWAP_COMPRESSOR_DEFAULT_DEFLATE
118519bcb79SJohannes Weiner       default "lzo" if ZSWAP_COMPRESSOR_DEFAULT_LZO
119519bcb79SJohannes Weiner       default "842" if ZSWAP_COMPRESSOR_DEFAULT_842
120519bcb79SJohannes Weiner       default "lz4" if ZSWAP_COMPRESSOR_DEFAULT_LZ4
121519bcb79SJohannes Weiner       default "lz4hc" if ZSWAP_COMPRESSOR_DEFAULT_LZ4HC
122519bcb79SJohannes Weiner       default "zstd" if ZSWAP_COMPRESSOR_DEFAULT_ZSTD
123519bcb79SJohannes Weiner       default ""
124519bcb79SJohannes Weiner
125519bcb79SJohannes Weinerconfig ZSMALLOC
126b3fbd58fSJohannes Weiner	tristate
1272ccd9fecSJohannes Weiner
1282ccd9fecSJohannes Weinerif ZSMALLOC
1292ccd9fecSJohannes Weiner
1302ccd9fecSJohannes Weinermenu "Zsmalloc allocator options"
1312ccd9fecSJohannes Weiner	depends on ZSMALLOC
1322ccd9fecSJohannes Weiner
1332ccd9fecSJohannes Weinercomment "Zsmalloc is a common backend allocator for zswap & zram"
134519bcb79SJohannes Weiner
135519bcb79SJohannes Weinerconfig ZSMALLOC_STAT
136519bcb79SJohannes Weiner	bool "Export zsmalloc statistics"
137519bcb79SJohannes Weiner	select DEBUG_FS
138519bcb79SJohannes Weiner	help
139519bcb79SJohannes Weiner	  This option enables code in the zsmalloc to collect various
140519bcb79SJohannes Weiner	  statistics about what's happening in zsmalloc and exports that
141519bcb79SJohannes Weiner	  information to userspace via debugfs.
142519bcb79SJohannes Weiner	  If unsure, say N.
143519bcb79SJohannes Weiner
1444ff93b29SSergey Senozhatskyconfig ZSMALLOC_CHAIN_SIZE
1454ff93b29SSergey Senozhatsky	int "Maximum number of physical pages per-zspage"
146b46402faSSergey Senozhatsky	default 8
1474ff93b29SSergey Senozhatsky	range 4 16
1484ff93b29SSergey Senozhatsky	help
1494ff93b29SSergey Senozhatsky	  This option sets the upper limit on the number of physical pages
1504ff93b29SSergey Senozhatsky	  that a zmalloc page (zspage) can consist of. The optimal zspage
1514ff93b29SSergey Senozhatsky	  chain size is calculated for each size class during the
1524ff93b29SSergey Senozhatsky	  initialization of the pool.
1534ff93b29SSergey Senozhatsky
1544ff93b29SSergey Senozhatsky	  Changing this option can alter the characteristics of size classes,
1554ff93b29SSergey Senozhatsky	  such as the number of pages per zspage and the number of objects
1564ff93b29SSergey Senozhatsky	  per zspage. This can also result in different configurations of
1574ff93b29SSergey Senozhatsky	  the pool, as zsmalloc merges size classes with similar
1584ff93b29SSergey Senozhatsky	  characteristics.
1594ff93b29SSergey Senozhatsky
1604ff93b29SSergey Senozhatsky	  For more information, see zsmalloc documentation.
1614ff93b29SSergey Senozhatsky
1622ccd9fecSJohannes Weinerendmenu
1632ccd9fecSJohannes Weiner
1642ccd9fecSJohannes Weinerendif
1652ccd9fecSJohannes Weiner
1662a19be61SVlastimil Babkamenu "Slab allocator options"
1677b42f104SJohannes Weiner
1687b42f104SJohannes Weinerconfig SLUB
1692a19be61SVlastimil Babka	def_bool y
170af92793eSAlexei Starovoitov	select IRQ_WORK
171eb07c4f3SVlastimil Babka
172c9f8f124SVlastimil Babkaconfig KVFREE_RCU_BATCHED
173c9f8f124SVlastimil Babka	def_bool y
174c9f8f124SVlastimil Babka	depends on !SLUB_TINY && !TINY_RCU
175c9f8f124SVlastimil Babka
176e240e53aSVlastimil Babkaconfig SLUB_TINY
1772a19be61SVlastimil Babka	bool "Configure for minimal memory footprint"
1786f110a5eSLinus Torvalds	depends on EXPERT && !COMPILE_TEST
179e240e53aSVlastimil Babka	select SLAB_MERGE_DEFAULT
180e240e53aSVlastimil Babka	help
1812a19be61SVlastimil Babka	   Configures the slab allocator in a way to achieve minimal memory
182e240e53aSVlastimil Babka	   footprint, sacrificing scalability, debugging and other features.
183e240e53aSVlastimil Babka	   This is intended only for the smallest system that had used the
184e240e53aSVlastimil Babka	   SLOB allocator and is not recommended for systems with more than
185e240e53aSVlastimil Babka	   16MB RAM.
186e240e53aSVlastimil Babka
187e240e53aSVlastimil Babka	   If unsure, say N.
188e240e53aSVlastimil Babka
1897b42f104SJohannes Weinerconfig SLAB_MERGE_DEFAULT
1907b42f104SJohannes Weiner	bool "Allow slab caches to be merged"
1917b42f104SJohannes Weiner	default y
1927b42f104SJohannes Weiner	help
1937b42f104SJohannes Weiner	  For reduced kernel memory fragmentation, slab caches can be
1947b42f104SJohannes Weiner	  merged when they share the same size and other characteristics.
1957b42f104SJohannes Weiner	  This carries a risk of kernel heap overflows being able to
1967b42f104SJohannes Weiner	  overwrite objects from merged caches (and more easily control
1977b42f104SJohannes Weiner	  cache layout), which makes such heap attacks easier to exploit
1987b42f104SJohannes Weiner	  by attackers. By keeping caches unmerged, these kinds of exploits
1997b42f104SJohannes Weiner	  can usually only damage objects in the same cache. To disable
2007b42f104SJohannes Weiner	  merging at runtime, "slab_nomerge" can be passed on the kernel
2017b42f104SJohannes Weiner	  command line.
2027b42f104SJohannes Weiner
2037b42f104SJohannes Weinerconfig SLAB_FREELIST_RANDOM
2047b42f104SJohannes Weiner	bool "Randomize slab freelist"
2052a19be61SVlastimil Babka	depends on !SLUB_TINY
2067b42f104SJohannes Weiner	help
2077b42f104SJohannes Weiner	  Randomizes the freelist order used on creating new pages. This
2087b42f104SJohannes Weiner	  security feature reduces the predictability of the kernel slab
2097b42f104SJohannes Weiner	  allocator against heap overflows.
2107b42f104SJohannes Weiner
2117b42f104SJohannes Weinerconfig SLAB_FREELIST_HARDENED
2127b42f104SJohannes Weiner	bool "Harden slab freelist metadata"
2132a19be61SVlastimil Babka	depends on !SLUB_TINY
2147b42f104SJohannes Weiner	help
2157b42f104SJohannes Weiner	  Many kernel heap attacks try to target slab cache metadata and
2167b42f104SJohannes Weiner	  other infrastructure. This options makes minor performance
2177b42f104SJohannes Weiner	  sacrifices to harden the kernel slab allocator against common
2182a19be61SVlastimil Babka	  freelist exploit methods.
2197b42f104SJohannes Weiner
22067f2df3bSKees Cookconfig SLAB_BUCKETS
22167f2df3bSKees Cook	bool "Support allocation from separate kmalloc buckets"
22267f2df3bSKees Cook	depends on !SLUB_TINY
22367f2df3bSKees Cook	default SLAB_FREELIST_HARDENED
22467f2df3bSKees Cook	help
22567f2df3bSKees Cook	  Kernel heap attacks frequently depend on being able to create
22667f2df3bSKees Cook	  specifically-sized allocations with user-controlled contents
22767f2df3bSKees Cook	  that will be allocated into the same kmalloc bucket as a
22867f2df3bSKees Cook	  target object. To avoid sharing these allocation buckets,
22967f2df3bSKees Cook	  provide an explicitly separated set of buckets to be used for
23067f2df3bSKees Cook	  user-controlled allocations. This may very slightly increase
23167f2df3bSKees Cook	  memory fragmentation, though in practice it's only a handful
23267f2df3bSKees Cook	  of extra pages since the bulk of user-controlled allocations
23367f2df3bSKees Cook	  are relatively long-lived.
23467f2df3bSKees Cook
23567f2df3bSKees Cook	  If unsure, say Y.
23667f2df3bSKees Cook
2370710d012SVlastimil Babkaconfig SLUB_STATS
2380710d012SVlastimil Babka	default n
2392a19be61SVlastimil Babka	bool "Enable performance statistics"
2402a19be61SVlastimil Babka	depends on SYSFS && !SLUB_TINY
2410710d012SVlastimil Babka	help
2422a19be61SVlastimil Babka	  The statistics are useful to debug slab allocation behavior in
2430710d012SVlastimil Babka	  order find ways to optimize the allocator. This should never be
2440710d012SVlastimil Babka	  enabled for production use since keeping statistics slows down
2450710d012SVlastimil Babka	  the allocator by a few percentage points. The slabinfo command
2460710d012SVlastimil Babka	  supports the determination of the most active slabs to figure
2470710d012SVlastimil Babka	  out which slabs are relevant to a particular load.
2480710d012SVlastimil Babka	  Try running: slabinfo -DA
2490710d012SVlastimil Babka
250519bcb79SJohannes Weinerconfig SLUB_CPU_PARTIAL
251519bcb79SJohannes Weiner	default y
2522a19be61SVlastimil Babka	depends on SMP && !SLUB_TINY
2532a19be61SVlastimil Babka	bool "Enable per cpu partial caches"
254519bcb79SJohannes Weiner	help
255519bcb79SJohannes Weiner	  Per cpu partial caches accelerate objects allocation and freeing
256519bcb79SJohannes Weiner	  that is local to a processor at the price of more indeterminism
257519bcb79SJohannes Weiner	  in the latency of the free. On overflow these caches will be cleared
258519bcb79SJohannes Weiner	  which requires the taking of locks that may cause latency spikes.
259519bcb79SJohannes Weiner	  Typically one would choose no for a realtime system.
260519bcb79SJohannes Weiner
2613c615294SGONG, Ruiqiconfig RANDOM_KMALLOC_CACHES
2623c615294SGONG, Ruiqi	default n
2632a19be61SVlastimil Babka	depends on !SLUB_TINY
2643c615294SGONG, Ruiqi	bool "Randomize slab caches for normal kmalloc"
2653c615294SGONG, Ruiqi	help
2663c615294SGONG, Ruiqi	  A hardening feature that creates multiple copies of slab caches for
2673c615294SGONG, Ruiqi	  normal kmalloc allocation and makes kmalloc randomly pick one based
2683c615294SGONG, Ruiqi	  on code address, which makes the attackers more difficult to spray
2693c615294SGONG, Ruiqi	  vulnerable memory objects on the heap for the purpose of exploiting
2703c615294SGONG, Ruiqi	  memory vulnerabilities.
2713c615294SGONG, Ruiqi
2723c615294SGONG, Ruiqi	  Currently the number of copies is set to 16, a reasonably large value
2733c615294SGONG, Ruiqi	  that effectively diverges the memory objects allocated for different
2743c615294SGONG, Ruiqi	  subsystems or modules into different caches, at the expense of a
2753c615294SGONG, Ruiqi	  limited degree of memory and CPU overhead that relates to hardware and
2763c615294SGONG, Ruiqi	  system workload.
2773c615294SGONG, Ruiqi
2782a19be61SVlastimil Babkaendmenu # Slab allocator options
279519bcb79SJohannes Weiner
2807b42f104SJohannes Weinerconfig SHUFFLE_PAGE_ALLOCATOR
2817b42f104SJohannes Weiner	bool "Page allocator randomization"
2827b42f104SJohannes Weiner	default SLAB_FREELIST_RANDOM && ACPI_NUMA
2837b42f104SJohannes Weiner	help
2847b42f104SJohannes Weiner	  Randomization of the page allocator improves the average
2857b42f104SJohannes Weiner	  utilization of a direct-mapped memory-side-cache. See section
2867b42f104SJohannes Weiner	  5.2.27 Heterogeneous Memory Attribute Table (HMAT) in the ACPI
2877b42f104SJohannes Weiner	  6.2a specification for an example of how a platform advertises
2887b42f104SJohannes Weiner	  the presence of a memory-side-cache. There are also incidental
2897b42f104SJohannes Weiner	  security benefits as it reduces the predictability of page
2907b42f104SJohannes Weiner	  allocations to compliment SLAB_FREELIST_RANDOM, but the
2915e0a760bSKirill A. Shutemov	  default granularity of shuffling on the MAX_PAGE_ORDER i.e, 10th
29223baf831SKirill A. Shutemov	  order of pages is selected based on cache utilization benefits
29323baf831SKirill A. Shutemov	  on x86.
2947b42f104SJohannes Weiner
2957b42f104SJohannes Weiner	  While the randomization improves cache utilization it may
2967b42f104SJohannes Weiner	  negatively impact workloads on platforms without a cache. For
297b413f9cdSMaíra Canal	  this reason, by default, the randomization is not enabled even
298b413f9cdSMaíra Canal	  if SHUFFLE_PAGE_ALLOCATOR=y. The randomization may be force enabled
299b413f9cdSMaíra Canal	  with the 'page_alloc.shuffle' kernel command line parameter.
3007b42f104SJohannes Weiner
3017b42f104SJohannes Weiner	  Say Y if unsure.
3027b42f104SJohannes Weiner
3030710d012SVlastimil Babkaconfig COMPAT_BRK
3040710d012SVlastimil Babka	bool "Disable heap randomization"
3050710d012SVlastimil Babka	default y
3060710d012SVlastimil Babka	help
3070710d012SVlastimil Babka	  Randomizing heap placement makes heap exploits harder, but it
3080710d012SVlastimil Babka	  also breaks ancient binaries (including anything libc5 based).
3090710d012SVlastimil Babka	  This option changes the bootup default to heap randomization
3100710d012SVlastimil Babka	  disabled, and can be overridden at runtime by setting
3110710d012SVlastimil Babka	  /proc/sys/kernel/randomize_va_space to 2.
3120710d012SVlastimil Babka
3130710d012SVlastimil Babka	  On non-ancient distros (post-2000 ones) N is usually a safe choice.
3140710d012SVlastimil Babka
3150710d012SVlastimil Babkaconfig MMAP_ALLOW_UNINITIALIZED
3160710d012SVlastimil Babka	bool "Allow mmapped anonymous memory to be uninitialized"
3170710d012SVlastimil Babka	depends on EXPERT && !MMU
3180710d012SVlastimil Babka	default n
3190710d012SVlastimil Babka	help
3200710d012SVlastimil Babka	  Normally, and according to the Linux spec, anonymous memory obtained
3210710d012SVlastimil Babka	  from mmap() has its contents cleared before it is passed to
3220710d012SVlastimil Babka	  userspace.  Enabling this config option allows you to request that
3230710d012SVlastimil Babka	  mmap() skip that if it is given an MAP_UNINITIALIZED flag, thus
3240710d012SVlastimil Babka	  providing a huge performance boost.  If this option is not enabled,
3250710d012SVlastimil Babka	  then the flag will be ignored.
3260710d012SVlastimil Babka
3270710d012SVlastimil Babka	  This is taken advantage of by uClibc's malloc(), and also by
3280710d012SVlastimil Babka	  ELF-FDPIC binfmt's brk and stack allocator.
3290710d012SVlastimil Babka
3300710d012SVlastimil Babka	  Because of the obvious security issues, this option should only be
3310710d012SVlastimil Babka	  enabled on embedded devices where you control what is run in
3320710d012SVlastimil Babka	  userspace.  Since that isn't generally a problem on no-MMU systems,
3330710d012SVlastimil Babka	  it is normally safe to say Y here.
3340710d012SVlastimil Babka
3350710d012SVlastimil Babka	  See Documentation/admin-guide/mm/nommu-mmap.rst for more information.
3360710d012SVlastimil Babka
337e1785e85SDave Hansenconfig SELECT_MEMORY_MODEL
338e1785e85SDave Hansen	def_bool y
339a8826eebSKees Cook	depends on ARCH_SELECT_MEMORY_MODEL
340e1785e85SDave Hansen
3413a9da765SDave Hansenchoice
3423a9da765SDave Hansen	prompt "Memory model"
343e1785e85SDave Hansen	depends on SELECT_MEMORY_MODEL
344d41dee36SAndy Whitcroft	default SPARSEMEM_MANUAL if ARCH_SPARSEMEM_DEFAULT
345e1785e85SDave Hansen	default FLATMEM_MANUAL
346d66d109dSMike Rapoport	help
347d66d109dSMike Rapoport	  This option allows you to change some of the ways that
348d66d109dSMike Rapoport	  Linux manages its memory internally. Most users will
349d66d109dSMike Rapoport	  only have one option here selected by the architecture
350d66d109dSMike Rapoport	  configuration. This is normal.
3513a9da765SDave Hansen
352e1785e85SDave Hansenconfig FLATMEM_MANUAL
3533a9da765SDave Hansen	bool "Flat Memory"
354bb1c50d3SMike Rapoport	depends on !ARCH_SPARSEMEM_ENABLE || ARCH_FLATMEM_ENABLE
3553a9da765SDave Hansen	help
356d66d109dSMike Rapoport	  This option is best suited for non-NUMA systems with
357d66d109dSMike Rapoport	  flat address space. The FLATMEM is the most efficient
358d66d109dSMike Rapoport	  system in terms of performance and resource consumption
359d66d109dSMike Rapoport	  and it is the best option for smaller systems.
3603a9da765SDave Hansen
361d66d109dSMike Rapoport	  For systems that have holes in their physical address
362d66d109dSMike Rapoport	  spaces and for features like NUMA and memory hotplug,
363dd33d29aSRandy Dunlap	  choose "Sparse Memory".
364d41dee36SAndy Whitcroft
365d41dee36SAndy Whitcroft	  If unsure, choose this option (Flat Memory) over any other.
3663a9da765SDave Hansen
367d41dee36SAndy Whitcroftconfig SPARSEMEM_MANUAL
368d41dee36SAndy Whitcroft	bool "Sparse Memory"
369d41dee36SAndy Whitcroft	depends on ARCH_SPARSEMEM_ENABLE
370d41dee36SAndy Whitcroft	help
371d41dee36SAndy Whitcroft	  This will be the only option for some systems, including
372d66d109dSMike Rapoport	  memory hot-plug systems.  This is normal.
373d41dee36SAndy Whitcroft
374d66d109dSMike Rapoport	  This option provides efficient support for systems with
375d66d109dSMike Rapoport	  holes is their physical address space and allows memory
376d66d109dSMike Rapoport	  hot-plug and hot-remove.
377d41dee36SAndy Whitcroft
378d66d109dSMike Rapoport	  If unsure, choose "Flat Memory" over this option.
379d41dee36SAndy Whitcroft
3803a9da765SDave Hansenendchoice
3813a9da765SDave Hansen
382d41dee36SAndy Whitcroftconfig SPARSEMEM
383d41dee36SAndy Whitcroft	def_bool y
3841a83e175SRussell King	depends on (!SELECT_MEMORY_MODEL && ARCH_SPARSEMEM_ENABLE) || SPARSEMEM_MANUAL
385d41dee36SAndy Whitcroft
386e1785e85SDave Hansenconfig FLATMEM
387e1785e85SDave Hansen	def_bool y
388bb1c50d3SMike Rapoport	depends on !SPARSEMEM || FLATMEM_MANUAL
389d41dee36SAndy Whitcroft
39093b7504eSDave Hansen#
3913e347261SBob Picco# SPARSEMEM_EXTREME (which is the default) does some bootmem
392c89ab04fSMike Rapoport# allocations when sparse_init() is called.  If this cannot
3933e347261SBob Picco# be done on your architecture, select this option.  However,
3943e347261SBob Picco# statically allocating the mem_section[] array can potentially
3953e347261SBob Picco# consume vast quantities of .bss, so be careful.
3963e347261SBob Picco#
3973e347261SBob Picco# This option will also potentially produce smaller runtime code
3983e347261SBob Picco# with gcc 3.4 and later.
3993e347261SBob Picco#
4003e347261SBob Piccoconfig SPARSEMEM_STATIC
4019ba16087SJan Beulich	bool
4023e347261SBob Picco
4033e347261SBob Picco#
40444c09201SMatt LaPlante# Architecture platforms which require a two level mem_section in SPARSEMEM
405802f192eSBob Picco# must select this option. This is usually for architecture platforms with
406802f192eSBob Picco# an extremely sparse physical address space.
407802f192eSBob Picco#
4083e347261SBob Piccoconfig SPARSEMEM_EXTREME
4093e347261SBob Picco	def_bool y
4103e347261SBob Picco	depends on SPARSEMEM && !SPARSEMEM_STATIC
4114c21e2f2SHugh Dickins
41229c71111SAndy Whitcroftconfig SPARSEMEM_VMEMMAP_ENABLE
4139ba16087SJan Beulich	bool
41429c71111SAndy Whitcroft
41529c71111SAndy Whitcroftconfig SPARSEMEM_VMEMMAP
416*f8f03eb5SDavid Hildenbrand	def_bool y
417a5ee6daaSGeoff Levand	depends on SPARSEMEM && SPARSEMEM_VMEMMAP_ENABLE
418a5ee6daaSGeoff Levand	help
419a5ee6daaSGeoff Levand	  SPARSEMEM_VMEMMAP uses a virtually mapped memmap to optimise
420a5ee6daaSGeoff Levand	  pfn_to_page and page_to_pfn operations.  This is the most
421a5ee6daaSGeoff Levand	  efficient option when sufficient kernel resources are available.
422d65917c4SFrank van der Linden
423d65917c4SFrank van der Lindenconfig SPARSEMEM_VMEMMAP_PREINIT
424d65917c4SFrank van der Linden	bool
4250b376f1eSAneesh Kumar K.V#
4260b376f1eSAneesh Kumar K.V# Select this config option from the architecture Kconfig, if it is preferred
4270b376f1eSAneesh Kumar K.V# to enable the feature of HugeTLB/dev_dax vmemmap optimization.
4280b376f1eSAneesh Kumar K.V#
4290b6f1582SAneesh Kumar K.Vconfig ARCH_WANT_OPTIMIZE_DAX_VMEMMAP
4300b6f1582SAneesh Kumar K.V	bool
4310b6f1582SAneesh Kumar K.V
4320b6f1582SAneesh Kumar K.Vconfig ARCH_WANT_OPTIMIZE_HUGETLB_VMEMMAP
4330b376f1eSAneesh Kumar K.V	bool
43429c71111SAndy Whitcroft
435d65917c4SFrank van der Lindenconfig ARCH_WANT_HUGETLB_VMEMMAP_PREINIT
436d65917c4SFrank van der Linden	bool
437d65917c4SFrank van der Linden
43870210ed9SPhilipp Hachtmannconfig HAVE_MEMBLOCK_PHYS_MAP
4396341e62bSChristoph Jaeger	bool
44070210ed9SPhilipp Hachtmann
44125176ad0SDavid Hildenbrandconfig HAVE_GUP_FAST
442050a9adcSChristoph Hellwig	depends on MMU
4436341e62bSChristoph Jaeger	bool
4442667f50eSSteve Capper
445d59f43b5SAlexander Graf# Enable memblock support for scratch memory which is needed for kexec handover
446d59f43b5SAlexander Grafconfig MEMBLOCK_KHO_SCRATCH
447d59f43b5SAlexander Graf	bool
448d59f43b5SAlexander Graf
44952219aeaSDavid Hildenbrand# Don't discard allocated memory used to track "memory" and "reserved" memblocks
45052219aeaSDavid Hildenbrand# after early boot, so it can still be used to test for validity of memory.
45152219aeaSDavid Hildenbrand# Also, memblocks are updated with memory hot(un)plug.
452350e88baSMike Rapoportconfig ARCH_KEEP_MEMBLOCK
4536341e62bSChristoph Jaeger	bool
454c378ddd5STejun Heo
4551e5d8e1eSDan Williams# Keep arch NUMA mapping infrastructure post-init.
4561e5d8e1eSDan Williamsconfig NUMA_KEEP_MEMINFO
4571e5d8e1eSDan Williams	bool
4581e5d8e1eSDan Williams
459ee6f509cSMinchan Kimconfig MEMORY_ISOLATION
4606341e62bSChristoph Jaeger	bool
461ee6f509cSMinchan Kim
462a9e7b8d4SDavid Hildenbrand# IORESOURCE_SYSTEM_RAM regions in the kernel resource tree that are marked
463a9e7b8d4SDavid Hildenbrand# IORESOURCE_EXCLUSIVE cannot be mapped to user space, for example, via
464a9e7b8d4SDavid Hildenbrand# /dev/mem.
465a9e7b8d4SDavid Hildenbrandconfig EXCLUSIVE_SYSTEM_RAM
466a9e7b8d4SDavid Hildenbrand	def_bool y
467a9e7b8d4SDavid Hildenbrand	depends on !DEVMEM || STRICT_DEVMEM
468a9e7b8d4SDavid Hildenbrand
46946723bfaSYasuaki Ishimatsu#
47046723bfaSYasuaki Ishimatsu# Only be set on architectures that have completely implemented memory hotplug
47146723bfaSYasuaki Ishimatsu# feature. If you are not sure, don't touch it.
47246723bfaSYasuaki Ishimatsu#
47346723bfaSYasuaki Ishimatsuconfig HAVE_BOOTMEM_INFO_NODE
47446723bfaSYasuaki Ishimatsu	def_bool n
47546723bfaSYasuaki Ishimatsu
47691024b3cSAnshuman Khandualconfig ARCH_ENABLE_MEMORY_HOTPLUG
47791024b3cSAnshuman Khandual	bool
47891024b3cSAnshuman Khandual
479519bcb79SJohannes Weinerconfig ARCH_ENABLE_MEMORY_HOTREMOVE
480519bcb79SJohannes Weiner	bool
481519bcb79SJohannes Weiner
4823947be19SDave Hansen# eventually, we can have this option just 'select SPARSEMEM'
483519bcb79SJohannes Weinermenuconfig MEMORY_HOTPLUG
484519bcb79SJohannes Weiner	bool "Memory hotplug"
485b30c5927SDavid Hildenbrand	select MEMORY_ISOLATION
48671b6f2ddSDavid Hildenbrand	depends on SPARSEMEM
48740b31360SStephen Rothwell	depends on ARCH_ENABLE_MEMORY_HOTPLUG
4887ec58a2bSDavid Hildenbrand	depends on 64BIT
4891e5d8e1eSDan Williams	select NUMA_KEEP_MEMINFO if NUMA
4903947be19SDave Hansen
491519bcb79SJohannes Weinerif MEMORY_HOTPLUG
492519bcb79SJohannes Weiner
49344d46b76SGregory Pricechoice
49444d46b76SGregory Price	prompt "Memory Hotplug Default Online Type"
49544d46b76SGregory Price	default MHP_DEFAULT_ONLINE_TYPE_OFFLINE
4968604d9e5SVitaly Kuznetsov	help
49744d46b76SGregory Price	  Default memory type for hotplugged memory.
49844d46b76SGregory Price
4998604d9e5SVitaly Kuznetsov	  This option sets the default policy setting for memory hotplug
5008604d9e5SVitaly Kuznetsov	  onlining policy (/sys/devices/system/memory/auto_online_blocks) which
5018604d9e5SVitaly Kuznetsov	  determines what happens to newly added memory regions. Policy setting
5028604d9e5SVitaly Kuznetsov	  can always be changed at runtime.
50344d46b76SGregory Price
50444d46b76SGregory Price	  The default is 'offline'.
50544d46b76SGregory Price
50644d46b76SGregory Price	  Select offline to defer onlining to drivers and user policy.
50744d46b76SGregory Price	  Select auto to let the kernel choose what zones to utilize.
50844d46b76SGregory Price	  Select online_kernel to generally allow kernel usage of this memory.
50944d46b76SGregory Price	  Select online_movable to generally disallow kernel usage of this memory.
51044d46b76SGregory Price
51144d46b76SGregory Price	  Example kernel usage would be page structs and page tables.
51244d46b76SGregory Price
513cb1aaebeSMauro Carvalho Chehab	  See Documentation/admin-guide/mm/memory-hotplug.rst for more information.
5148604d9e5SVitaly Kuznetsov
51544d46b76SGregory Priceconfig MHP_DEFAULT_ONLINE_TYPE_OFFLINE
51644d46b76SGregory Price	bool "offline"
51744d46b76SGregory Price	help
51844d46b76SGregory Price	  Hotplugged memory will not be onlined by default.
51944d46b76SGregory Price	  Choose this for systems with drivers and user policy that
52044d46b76SGregory Price	  handle onlining of hotplug memory policy.
52144d46b76SGregory Price
52244d46b76SGregory Priceconfig MHP_DEFAULT_ONLINE_TYPE_ONLINE_AUTO
52344d46b76SGregory Price	bool "auto"
52444d46b76SGregory Price	help
52544d46b76SGregory Price	  Select this if you want the kernel to automatically online
52644d46b76SGregory Price	  hotplugged memory into the zone it thinks is reasonable.
52744d46b76SGregory Price	  This memory may be utilized for kernel data.
52844d46b76SGregory Price
52944d46b76SGregory Priceconfig MHP_DEFAULT_ONLINE_TYPE_ONLINE_KERNEL
53044d46b76SGregory Price	bool "kernel"
53144d46b76SGregory Price	help
53244d46b76SGregory Price	  Select this if you want the kernel to automatically online
53344d46b76SGregory Price	  hotplugged memory into a zone capable of being used for kernel
53444d46b76SGregory Price	  data. This typically means ZONE_NORMAL.
53544d46b76SGregory Price
53644d46b76SGregory Priceconfig MHP_DEFAULT_ONLINE_TYPE_ONLINE_MOVABLE
53744d46b76SGregory Price	bool "movable"
53844d46b76SGregory Price	help
53944d46b76SGregory Price	  Select this if you want the kernel to automatically online
54044d46b76SGregory Price	  hotplug memory into ZONE_MOVABLE. This memory will generally
54144d46b76SGregory Price	  not be utilized for kernel data.
54244d46b76SGregory Price
54344d46b76SGregory Price	  This should only be used when the admin knows sufficient
54444d46b76SGregory Price	  ZONE_NORMAL memory is available to describe hotplug memory,
54544d46b76SGregory Price	  otherwise hotplug memory may fail to online. For example,
54644d46b76SGregory Price	  sufficient kernel-capable memory (ZONE_NORMAL) must be
54744d46b76SGregory Price	  available to allocate page structs to describe ZONE_MOVABLE.
54844d46b76SGregory Price
54944d46b76SGregory Priceendchoice
5508604d9e5SVitaly Kuznetsov
5510c0e6195SKAMEZAWA Hiroyukiconfig MEMORY_HOTREMOVE
5520c0e6195SKAMEZAWA Hiroyuki	bool "Allow for memory hot remove"
553f7e3334aSNathan Fontenot	select HAVE_BOOTMEM_INFO_NODE if (X86_64 || PPC64)
5540c0e6195SKAMEZAWA Hiroyuki	depends on MEMORY_HOTPLUG && ARCH_ENABLE_MEMORY_HOTREMOVE
5550c0e6195SKAMEZAWA Hiroyuki	depends on MIGRATION
5560c0e6195SKAMEZAWA Hiroyuki
557a08a2ae3SOscar Salvadorconfig MHP_MEMMAP_ON_MEMORY
558a08a2ae3SOscar Salvador	def_bool y
559a08a2ae3SOscar Salvador	depends on MEMORY_HOTPLUG && SPARSEMEM_VMEMMAP
560a08a2ae3SOscar Salvador	depends on ARCH_MHP_MEMMAP_ON_MEMORY_ENABLE
561a08a2ae3SOscar Salvador
562519bcb79SJohannes Weinerendif # MEMORY_HOTPLUG
563519bcb79SJohannes Weiner
56404d5ea46SAneesh Kumar K.Vconfig ARCH_MHP_MEMMAP_ON_MEMORY_ENABLE
56504d5ea46SAneesh Kumar K.V       bool
56604d5ea46SAneesh Kumar K.V
5674c21e2f2SHugh Dickins# Heavily threaded applications may benefit from splitting the mm-wide
5684c21e2f2SHugh Dickins# page_table_lock, so that faults on different parts of the user address
5694c21e2f2SHugh Dickins# space can be handled with less contention: split it at this NR_CPUS.
5704c21e2f2SHugh Dickins# Default to 4 for wider testing, though 8 might be more appropriate.
5714c21e2f2SHugh Dickins# ARM's adjust_pte (unused if VIPT) depends on mm-wide page_table_lock.
5727b6ac9dfSHugh Dickins# PA-RISC 7xxx's spinlock_t would enlarge struct page from 32 to 44 bytes.
57360bccaa6SWill Deacon# SPARC32 allocates multiple pte tables within a single page, and therefore
57460bccaa6SWill Deacon# a per-page lock leads to problems when multiple tables need to be locked
57560bccaa6SWill Deacon# at the same time (e.g. copy_page_range()).
576a70caa8bSHugh Dickins# DEBUG_SPINLOCK and DEBUG_LOCK_ALLOC spinlock_t also enlarge struct page.
5774c21e2f2SHugh Dickins#
578394290cbSDavid Hildenbrandconfig SPLIT_PTE_PTLOCKS
579394290cbSDavid Hildenbrand	def_bool y
580394290cbSDavid Hildenbrand	depends on MMU
581a3344078SGuenter Roeck	depends on SMP
582394290cbSDavid Hildenbrand	depends on NR_CPUS >= 4
583394290cbSDavid Hildenbrand	depends on !ARM || CPU_CACHE_VIPT
584394290cbSDavid Hildenbrand	depends on !PARISC || PA20
585394290cbSDavid Hildenbrand	depends on !SPARC32
5867cbe34cfSChristoph Lameter
587e009bb30SKirill A. Shutemovconfig ARCH_ENABLE_SPLIT_PMD_PTLOCK
5886341e62bSChristoph Jaeger	bool
589e009bb30SKirill A. Shutemov
590394290cbSDavid Hildenbrandconfig SPLIT_PMD_PTLOCKS
591394290cbSDavid Hildenbrand	def_bool y
592394290cbSDavid Hildenbrand	depends on SPLIT_PTE_PTLOCKS && ARCH_ENABLE_SPLIT_PMD_PTLOCK
593394290cbSDavid Hildenbrand
5947cbe34cfSChristoph Lameter#
59509316c09SKonstantin Khlebnikov# support for memory balloon
59609316c09SKonstantin Khlebnikovconfig MEMORY_BALLOON
5976341e62bSChristoph Jaeger	bool
59809316c09SKonstantin Khlebnikov
59909316c09SKonstantin Khlebnikov#
60018468d93SRafael Aquini# support for memory balloon compaction
60118468d93SRafael Aquiniconfig BALLOON_COMPACTION
60218468d93SRafael Aquini	bool "Allow for balloon memory compaction/migration"
603cd14b018SMasahiro Yamada	default y
60409316c09SKonstantin Khlebnikov	depends on COMPACTION && MEMORY_BALLOON
60518468d93SRafael Aquini	help
60618468d93SRafael Aquini	  Memory fragmentation introduced by ballooning might reduce
60718468d93SRafael Aquini	  significantly the number of 2MB contiguous memory blocks that can be
60818468d93SRafael Aquini	  used within a guest, thus imposing performance penalties associated
60918468d93SRafael Aquini	  with the reduced number of transparent huge pages that could be used
61018468d93SRafael Aquini	  by the guest workload. Allowing the compaction & migration for memory
61118468d93SRafael Aquini	  pages enlisted as being part of memory balloon devices avoids the
61218468d93SRafael Aquini	  scenario aforementioned and helps improving memory defragmentation.
61318468d93SRafael Aquini
61418468d93SRafael Aquini#
615e9e96b39SMel Gorman# support for memory compaction
616e9e96b39SMel Gormanconfig COMPACTION
617e9e96b39SMel Gorman	bool "Allow for memory compaction"
618cd14b018SMasahiro Yamada	default y
619e9e96b39SMel Gorman	select MIGRATION
62033a93877SAndrea Arcangeli	depends on MMU
621e9e96b39SMel Gorman	help
622b32eaf71SMichal Hocko	  Compaction is the only memory management component to form
623b32eaf71SMichal Hocko	  high order (larger physically contiguous) memory blocks
624b32eaf71SMichal Hocko	  reliably. The page allocator relies on compaction heavily and
625b32eaf71SMichal Hocko	  the lack of the feature can lead to unexpected OOM killer
626b32eaf71SMichal Hocko	  invocations for high order memory requests. You shouldn't
627b32eaf71SMichal Hocko	  disable this option unless there really is a strong reason for
628b32eaf71SMichal Hocko	  it and then we would be really interested to hear about that at
629b32eaf71SMichal Hocko	  linux-mm@kvack.org.
630e9e96b39SMel Gorman
631c7e0b3d0SThomas Gleixnerconfig COMPACT_UNEVICTABLE_DEFAULT
632c7e0b3d0SThomas Gleixner	int
633c7e0b3d0SThomas Gleixner	depends on COMPACTION
634c7e0b3d0SThomas Gleixner	default 0 if PREEMPT_RT
635c7e0b3d0SThomas Gleixner	default 1
636c7e0b3d0SThomas Gleixner
637e9e96b39SMel Gorman#
63836e66c55SAlexander Duyck# support for free page reporting
63936e66c55SAlexander Duyckconfig PAGE_REPORTING
64036e66c55SAlexander Duyck	bool "Free page reporting"
64136e66c55SAlexander Duyck	help
64236e66c55SAlexander Duyck	  Free page reporting allows for the incremental acquisition of
64336e66c55SAlexander Duyck	  free pages from the buddy allocator for the purpose of reporting
64436e66c55SAlexander Duyck	  those pages to another entity, such as a hypervisor, so that the
64536e66c55SAlexander Duyck	  memory can be freed within the host for other uses.
64636e66c55SAlexander Duyck
64736e66c55SAlexander Duyck#
6487cbe34cfSChristoph Lameter# support for page migration
6497cbe34cfSChristoph Lameter#
6507cbe34cfSChristoph Lameterconfig MIGRATION
651b20a3503SChristoph Lameter	bool "Page migration"
652cd14b018SMasahiro Yamada	default y
653de32a817SChen Gang	depends on (NUMA || ARCH_ENABLE_MEMORY_HOTREMOVE || COMPACTION || CMA) && MMU
654b20a3503SChristoph Lameter	help
655b20a3503SChristoph Lameter	  Allows the migration of the physical location of pages of processes
656e9e96b39SMel Gorman	  while the virtual addresses are not changed. This is useful in
657e9e96b39SMel Gorman	  two situations. The first is on NUMA systems to put pages nearer
658e9e96b39SMel Gorman	  to the processors accessing. The second is when allocating huge
659e9e96b39SMel Gorman	  pages as migration can relocate pages to satisfy a huge page
660e9e96b39SMel Gorman	  allocation instead of reclaiming.
6616550e07fSGreg Kroah-Hartman
66276cbbeadSChristoph Hellwigconfig DEVICE_MIGRATION
663d90a25f8SChristoph Hellwig	def_bool MIGRATION && ZONE_DEVICE
66476cbbeadSChristoph Hellwig
665c177c81eSNaoya Horiguchiconfig ARCH_ENABLE_HUGEPAGE_MIGRATION
6666341e62bSChristoph Jaeger	bool
667c177c81eSNaoya Horiguchi
6689c670ea3SNaoya Horiguchiconfig ARCH_ENABLE_THP_MIGRATION
6699c670ea3SNaoya Horiguchi	bool
6709c670ea3SNaoya Horiguchi
6714bfb68a0SAnshuman Khandualconfig HUGETLB_PAGE_SIZE_VARIABLE
6724bfb68a0SAnshuman Khandual	def_bool n
6734bfb68a0SAnshuman Khandual	help
6744bfb68a0SAnshuman Khandual	  Allows the pageblock_order value to be dynamic instead of just standard
6754bfb68a0SAnshuman Khandual	  HUGETLB_PAGE_ORDER when there are multiple HugeTLB page sizes available
6764bfb68a0SAnshuman Khandual	  on a platform.
6774bfb68a0SAnshuman Khandual
6785e0a760bSKirill A. Shutemov	  Note that the pageblock_order cannot exceed MAX_PAGE_ORDER and will be
6795e0a760bSKirill A. Shutemov	  clamped down to MAX_PAGE_ORDER.
680b3d40a2bSDavid Hildenbrand
6818df995f6SAlexandre Ghiticonfig CONTIG_ALLOC
6828df995f6SAlexandre Ghiti	def_bool (MEMORY_ISOLATION && COMPACTION) || CMA
6838df995f6SAlexandre Ghiti
68452166607SHuang Yingconfig PCP_BATCH_SCALE_MAX
68552166607SHuang Ying	int "Maximum scale factor of PCP (Per-CPU pageset) batch allocate/free"
68652166607SHuang Ying	default 5
68752166607SHuang Ying	range 0 6
68852166607SHuang Ying	help
68952166607SHuang Ying	  In page allocator, PCP (Per-CPU pageset) is refilled and drained in
69052166607SHuang Ying	  batches.  The batch number is scaled automatically to improve page
69152166607SHuang Ying	  allocation/free throughput.  But too large scale factor may hurt
69252166607SHuang Ying	  latency.  This option sets the upper limit of scale factor to limit
69352166607SHuang Ying	  the maximum latency.
69452166607SHuang Ying
695600715dcSJeremy Fitzhardingeconfig PHYS_ADDR_T_64BIT
696d4a451d5SChristoph Hellwig	def_bool 64BIT
697600715dcSJeremy Fitzhardinge
6982a7326b5SChristoph Lameterconfig BOUNCE
6999ca24e2eSVinayak Menon	bool "Enable bounce buffers"
7009ca24e2eSVinayak Menon	default y
701ce288e05SChristoph Hellwig	depends on BLOCK && MMU && HIGHMEM
7029ca24e2eSVinayak Menon	help
703ce288e05SChristoph Hellwig	  Enable bounce buffers for devices that cannot access the full range of
704ce288e05SChristoph Hellwig	  memory available to the CPU. Enabled by default when HIGHMEM is
705ce288e05SChristoph Hellwig	  selected, but you may say n to override this.
7062a7326b5SChristoph Lameter
707cddb8a5cSAndrea Arcangeliconfig MMU_NOTIFIER
708cddb8a5cSAndrea Arcangeli	bool
70999cb252fSJason Gunthorpe	select INTERVAL_TREE
710fc4d5c29SDavid Howells
711f8af4da3SHugh Dickinsconfig KSM
712f8af4da3SHugh Dickins	bool "Enable KSM for page merging"
713f8af4da3SHugh Dickins	depends on MMU
71459e1a2f4STimofey Titovets	select XXHASH
715f8af4da3SHugh Dickins	help
716f8af4da3SHugh Dickins	  Enable Kernel Samepage Merging: KSM periodically scans those areas
717f8af4da3SHugh Dickins	  of an application's address space that an app has advised may be
718f8af4da3SHugh Dickins	  mergeable.  When it finds pages of identical content, it replaces
719d0f209f6SHugh Dickins	  the many instances by a single page with that content, so
720f8af4da3SHugh Dickins	  saving memory until one or another app needs to modify the content.
721f8af4da3SHugh Dickins	  Recommended for use with KVM, or with other duplicative applications.
722ee65728eSMike Rapoport	  See Documentation/mm/ksm.rst for more information: KSM is inactive
723c73602adSHugh Dickins	  until a program has madvised that an area is MADV_MERGEABLE, and
724c73602adSHugh Dickins	  root has set /sys/kernel/mm/ksm/run to 1 (if CONFIG_SYSFS is set).
725f8af4da3SHugh Dickins
726e0a94c2aSChristoph Lameterconfig DEFAULT_MMAP_MIN_ADDR
727e0a94c2aSChristoph Lameter	int "Low address space to protect from user allocation"
7286e141546SDavid Howells	depends on MMU
729e0a94c2aSChristoph Lameter	default 4096
730e0a94c2aSChristoph Lameter	help
731e0a94c2aSChristoph Lameter	  This is the portion of low virtual memory which should be protected
732e0a94c2aSChristoph Lameter	  from userspace allocation.  Keeping a user from writing to low pages
733e0a94c2aSChristoph Lameter	  can help reduce the impact of kernel NULL pointer bugs.
734e0a94c2aSChristoph Lameter
73534f7c528SJavier Martinez Canillas	  For most arm64, ppc64 and x86 users with lots of address space
736e0a94c2aSChristoph Lameter	  a value of 65536 is reasonable and should cause no problems.
737e0a94c2aSChristoph Lameter	  On arm and other archs it should not be higher than 32768.
738788084abSEric Paris	  Programs which use vm86 functionality or have some need to map
739788084abSEric Paris	  this low address space will need CAP_SYS_RAWIO or disable this
740788084abSEric Paris	  protection by setting the value to 0.
741e0a94c2aSChristoph Lameter
742e0a94c2aSChristoph Lameter	  This value can be changed after boot using the
743e0a94c2aSChristoph Lameter	  /proc/sys/vm/mmap_min_addr tunable.
744e0a94c2aSChristoph Lameter
745d949f36fSLinus Torvaldsconfig ARCH_SUPPORTS_MEMORY_FAILURE
746d949f36fSLinus Torvalds	bool
747e0a94c2aSChristoph Lameter
7486a46079cSAndi Kleenconfig MEMORY_FAILURE
7496a46079cSAndi Kleen	depends on MMU
750d949f36fSLinus Torvalds	depends on ARCH_SUPPORTS_MEMORY_FAILURE
7516a46079cSAndi Kleen	bool "Enable recovery from hardware memory errors"
75297f0b134SXie XiuQi	select RAS
7536a46079cSAndi Kleen	help
7546a46079cSAndi Kleen	  Enables code to recover from some memory failures on systems
7556a46079cSAndi Kleen	  with MCA recovery. This allows a system to continue running
7566a46079cSAndi Kleen	  even when some of its memory has uncorrected errors. This requires
7576a46079cSAndi Kleen	  special hardware support and typically ECC memory.
7586a46079cSAndi Kleen
759cae681fcSAndi Kleenconfig HWPOISON_INJECT
760413f9efbSAndi Kleen	tristate "HWPoison pages injector"
76127df5068SAndi Kleen	depends on MEMORY_FAILURE && DEBUG_KERNEL && PROC_FS
762478c5ffcSWu Fengguang	select PROC_PAGE_MONITOR
763cae681fcSAndi Kleen
764fc4d5c29SDavid Howellsconfig NOMMU_INITIAL_TRIM_EXCESS
765fc4d5c29SDavid Howells	int "Turn on mmap() excess space trimming before booting"
766fc4d5c29SDavid Howells	depends on !MMU
767fc4d5c29SDavid Howells	default 1
768fc4d5c29SDavid Howells	help
769fc4d5c29SDavid Howells	  The NOMMU mmap() frequently needs to allocate large contiguous chunks
770fc4d5c29SDavid Howells	  of memory on which to store mappings, but it can only ask the system
771fc4d5c29SDavid Howells	  allocator for chunks in 2^N*PAGE_SIZE amounts - which is frequently
772fc4d5c29SDavid Howells	  more than it requires.  To deal with this, mmap() is able to trim off
773fc4d5c29SDavid Howells	  the excess and return it to the allocator.
774fc4d5c29SDavid Howells
775fc4d5c29SDavid Howells	  If trimming is enabled, the excess is trimmed off and returned to the
776fc4d5c29SDavid Howells	  system allocator, which can cause extra fragmentation, particularly
777fc4d5c29SDavid Howells	  if there are a lot of transient processes.
778fc4d5c29SDavid Howells
779fc4d5c29SDavid Howells	  If trimming is disabled, the excess is kept, but not used, which for
780fc4d5c29SDavid Howells	  long-term mappings means that the space is wasted.
781fc4d5c29SDavid Howells
782fc4d5c29SDavid Howells	  Trimming can be dynamically controlled through a sysctl option
783fc4d5c29SDavid Howells	  (/proc/sys/vm/nr_trim_pages) which specifies the minimum number of
784fc4d5c29SDavid Howells	  excess pages there must be before trimming should occur, or zero if
785fc4d5c29SDavid Howells	  no trimming is to occur.
786fc4d5c29SDavid Howells
787fc4d5c29SDavid Howells	  This option specifies the initial value of this option.  The default
788fc4d5c29SDavid Howells	  of 1 says that all excess pages should be trimmed.
789fc4d5c29SDavid Howells
790dd19d293SStephen Kitt	  See Documentation/admin-guide/mm/nommu-mmap.rst for more information.
791bbddff05STejun Heo
792519bcb79SJohannes Weinerconfig ARCH_WANT_GENERAL_HUGETLB
793519bcb79SJohannes Weiner	bool
794519bcb79SJohannes Weiner
795519bcb79SJohannes Weinerconfig ARCH_WANTS_THP_SWAP
796519bcb79SJohannes Weiner	def_bool n
797519bcb79SJohannes Weiner
7982d8bd804SPankaj Raghavconfig PERSISTENT_HUGE_ZERO_FOLIO
7992d8bd804SPankaj Raghav	bool "Allocate a PMD sized folio for zeroing"
8002d8bd804SPankaj Raghav	depends on TRANSPARENT_HUGEPAGE
8012d8bd804SPankaj Raghav	help
8022d8bd804SPankaj Raghav	  Enable this option to reduce the runtime refcounting overhead
8032d8bd804SPankaj Raghav	  of the huge zero folio and expand the places in the kernel
8042d8bd804SPankaj Raghav	  that can use huge zero folios. For instance, block I/O benefits
8052d8bd804SPankaj Raghav	  from access to large folios for zeroing memory.
8062d8bd804SPankaj Raghav
8072d8bd804SPankaj Raghav	  With this option enabled, the huge zero folio is allocated
8082d8bd804SPankaj Raghav	  once and never freed. One full huge page's worth of memory shall
8092d8bd804SPankaj Raghav	  be used.
8102d8bd804SPankaj Raghav
8112d8bd804SPankaj Raghav	  Say Y if your system has lots of memory. Say N if you are
8122d8bd804SPankaj Raghav	  memory constrained.
8132d8bd804SPankaj Raghav
8146af8cb80SDavid Hildenbrandconfig MM_ID
8156af8cb80SDavid Hildenbrand	def_bool n
8166af8cb80SDavid Hildenbrand
817519bcb79SJohannes Weinermenuconfig TRANSPARENT_HUGEPAGE
81813ece886SAndrea Arcangeli	bool "Transparent Hugepage Support"
819554b0f3cSSebastian Andrzej Siewior	depends on HAVE_ARCH_TRANSPARENT_HUGEPAGE && !PREEMPT_RT
8205d689240SAndrea Arcangeli	select COMPACTION
8213a08cd52SMatthew Wilcox	select XARRAY_MULTI
8226af8cb80SDavid Hildenbrand	select MM_ID
8234c76d9d1SAndrea Arcangeli	help
8244c76d9d1SAndrea Arcangeli	  Transparent Hugepages allows the kernel to use huge pages and
8254c76d9d1SAndrea Arcangeli	  huge tlb transparently to the applications whenever possible.
8264c76d9d1SAndrea Arcangeli	  This feature can improve computing performance to certain
8274c76d9d1SAndrea Arcangeli	  applications by speeding up page faults during memory
8284c76d9d1SAndrea Arcangeli	  allocation, by reducing the number of tlb misses and by speeding
8294c76d9d1SAndrea Arcangeli	  up the pagetable walking.
8304c76d9d1SAndrea Arcangeli
8314c76d9d1SAndrea Arcangeli	  If memory constrained on embedded, you may want to say N.
8324c76d9d1SAndrea Arcangeli
833519bcb79SJohannes Weinerif TRANSPARENT_HUGEPAGE
834519bcb79SJohannes Weiner
83513ece886SAndrea Arcangelichoice
83613ece886SAndrea Arcangeli	prompt "Transparent Hugepage Support sysfs defaults"
83713ece886SAndrea Arcangeli	depends on TRANSPARENT_HUGEPAGE
83813ece886SAndrea Arcangeli	default TRANSPARENT_HUGEPAGE_ALWAYS
83913ece886SAndrea Arcangeli	help
84013ece886SAndrea Arcangeli	  Selects the sysfs defaults for Transparent Hugepage Support.
84113ece886SAndrea Arcangeli
84213ece886SAndrea Arcangeli	config TRANSPARENT_HUGEPAGE_ALWAYS
84313ece886SAndrea Arcangeli		bool "always"
84413ece886SAndrea Arcangeli	help
84513ece886SAndrea Arcangeli	  Enabling Transparent Hugepage always, can increase the
84613ece886SAndrea Arcangeli	  memory footprint of applications without a guaranteed
84713ece886SAndrea Arcangeli	  benefit but it will work automatically for all applications.
84813ece886SAndrea Arcangeli
84913ece886SAndrea Arcangeli	config TRANSPARENT_HUGEPAGE_MADVISE
85013ece886SAndrea Arcangeli		bool "madvise"
85113ece886SAndrea Arcangeli	help
85213ece886SAndrea Arcangeli	  Enabling Transparent Hugepage madvise, will only provide a
85313ece886SAndrea Arcangeli	  performance improvement benefit to the applications using
85413ece886SAndrea Arcangeli	  madvise(MADV_HUGEPAGE) but it won't risk to increase the
85513ece886SAndrea Arcangeli	  memory footprint of applications without a guaranteed
85613ece886SAndrea Arcangeli	  benefit.
857683ec99fSDmytro Maluka
858683ec99fSDmytro Maluka	config TRANSPARENT_HUGEPAGE_NEVER
859683ec99fSDmytro Maluka		bool "never"
860683ec99fSDmytro Maluka	help
861683ec99fSDmytro Maluka	  Disable Transparent Hugepage by default. It can still be
862683ec99fSDmytro Maluka	  enabled at runtime via sysfs.
86313ece886SAndrea Arcangeliendchoice
86413ece886SAndrea Arcangeli
86538d8b4e6SHuang Yingconfig THP_SWAP
86638d8b4e6SHuang Ying	def_bool y
867dad6a5ebSHugh Dickins	depends on TRANSPARENT_HUGEPAGE && ARCH_WANTS_THP_SWAP && SWAP && 64BIT
86838d8b4e6SHuang Ying	help
86938d8b4e6SHuang Ying	  Swap transparent huge pages in one piece, without splitting.
87014fef284SHuang Ying	  XXX: For now, swap cluster backing transparent huge page
87114fef284SHuang Ying	  will be split after swapout.
87238d8b4e6SHuang Ying
87338d8b4e6SHuang Ying	  For selection by architectures with reasonable THP sizes.
87438d8b4e6SHuang Ying
875519bcb79SJohannes Weinerconfig READ_ONLY_THP_FOR_FS
876519bcb79SJohannes Weiner	bool "Read-only THP for filesystems (EXPERIMENTAL)"
877cc79061bSBaolin Wang	depends on TRANSPARENT_HUGEPAGE
878519bcb79SJohannes Weiner
879519bcb79SJohannes Weiner	help
880519bcb79SJohannes Weiner	  Allow khugepaged to put read-only file-backed pages in THP.
881519bcb79SJohannes Weiner
882519bcb79SJohannes Weiner	  This is marked experimental because it is a new feature. Write
883519bcb79SJohannes Weiner	  support of file THPs will be developed in the next few release
884519bcb79SJohannes Weiner	  cycles.
885519bcb79SJohannes Weiner
886e63ee43eSDavid Hildenbrandconfig NO_PAGE_MAPCOUNT
887e63ee43eSDavid Hildenbrand	bool "No per-page mapcount (EXPERIMENTAL)"
888e63ee43eSDavid Hildenbrand	help
889e63ee43eSDavid Hildenbrand	  Do not maintain per-page mapcounts for pages part of larger
890e63ee43eSDavid Hildenbrand	  allocations, such as transparent huge pages.
891e63ee43eSDavid Hildenbrand
892e63ee43eSDavid Hildenbrand	  When this config option is enabled, some interfaces that relied on
893e63ee43eSDavid Hildenbrand	  this information will rely on less-precise per-allocation information
894e63ee43eSDavid Hildenbrand	  instead: for example, using the average per-page mapcount in such
895e63ee43eSDavid Hildenbrand	  a large allocation instead of the per-page mapcount.
896e63ee43eSDavid Hildenbrand
897e63ee43eSDavid Hildenbrand	  EXPERIMENTAL because the impact of some changes is still unclear.
898e63ee43eSDavid Hildenbrand
899519bcb79SJohannes Weinerendif # TRANSPARENT_HUGEPAGE
900519bcb79SJohannes Weiner
901e63ee43eSDavid Hildenbrand# simple helper to make the code a bit easier to read
902e63ee43eSDavid Hildenbrandconfig PAGE_MAPCOUNT
903e63ee43eSDavid Hildenbrand	def_bool !NO_PAGE_MAPCOUNT
904e63ee43eSDavid Hildenbrand
905e496cf3dSKirill A. Shutemov#
906ac3830c3SPeter Xu# The architecture supports pgtable leaves that is larger than PAGE_SIZE
907ac3830c3SPeter Xu#
908ac3830c3SPeter Xuconfig PGTABLE_HAS_HUGE_LEAVES
909ac3830c3SPeter Xu	def_bool TRANSPARENT_HUGEPAGE || HUGETLB_PAGE
910ac3830c3SPeter Xu
9116857be5fSPeter Xu# TODO: Allow to be enabled without THP
9126857be5fSPeter Xuconfig ARCH_SUPPORTS_HUGE_PFNMAP
9136857be5fSPeter Xu	def_bool n
9146857be5fSPeter Xu	depends on TRANSPARENT_HUGEPAGE
9156857be5fSPeter Xu
9166857be5fSPeter Xuconfig ARCH_SUPPORTS_PMD_PFNMAP
9176857be5fSPeter Xu	def_bool y
9186857be5fSPeter Xu	depends on ARCH_SUPPORTS_HUGE_PFNMAP && HAVE_ARCH_TRANSPARENT_HUGEPAGE
9196857be5fSPeter Xu
9206857be5fSPeter Xuconfig ARCH_SUPPORTS_PUD_PFNMAP
9216857be5fSPeter Xu	def_bool y
9226857be5fSPeter Xu	depends on ARCH_SUPPORTS_HUGE_PFNMAP && HAVE_ARCH_TRANSPARENT_HUGEPAGE_PUD
9236857be5fSPeter Xu
924ac3830c3SPeter Xu#
92559b5ed40SHao Ge# Architectures that always use weak definitions for percpu
92659b5ed40SHao Ge# variables in modules should set this.
92759b5ed40SHao Ge#
92859b5ed40SHao Geconfig ARCH_MODULE_NEEDS_WEAK_PER_CPU
92959b5ed40SHao Ge       bool
93059b5ed40SHao Ge
93159b5ed40SHao Ge#
932bbddff05STejun Heo# UP and nommu archs use km based percpu allocator
933bbddff05STejun Heo#
934bbddff05STejun Heoconfig NEED_PER_CPU_KM
9353583521aSVladimir Murzin	depends on !SMP || !MMU
936bbddff05STejun Heo	bool
937bbddff05STejun Heo	default y
938077b1f83SDan Magenheimer
9397ecd19cfSKefeng Wangconfig NEED_PER_CPU_EMBED_FIRST_CHUNK
9407ecd19cfSKefeng Wang	bool
9417ecd19cfSKefeng Wang
9427ecd19cfSKefeng Wangconfig NEED_PER_CPU_PAGE_FIRST_CHUNK
9437ecd19cfSKefeng Wang	bool
9447ecd19cfSKefeng Wang
9457ecd19cfSKefeng Wangconfig USE_PERCPU_NUMA_NODE_ID
9467ecd19cfSKefeng Wang	bool
9477ecd19cfSKefeng Wang
9487ecd19cfSKefeng Wangconfig HAVE_SETUP_PER_CPU_AREA
9497ecd19cfSKefeng Wang	bool
9507ecd19cfSKefeng Wang
951f825c736SAneesh Kumar K.Vconfig CMA
952f825c736SAneesh Kumar K.V	bool "Contiguous Memory Allocator"
953aca52c39SMike Rapoport	depends on MMU
954f825c736SAneesh Kumar K.V	select MIGRATION
955f825c736SAneesh Kumar K.V	select MEMORY_ISOLATION
956f825c736SAneesh Kumar K.V	help
957f825c736SAneesh Kumar K.V	  This enables the Contiguous Memory Allocator which allows other
958f825c736SAneesh Kumar K.V	  subsystems to allocate big physically-contiguous blocks of memory.
959f825c736SAneesh Kumar K.V	  CMA reserves a region of memory and allows only movable pages to
960f825c736SAneesh Kumar K.V	  be allocated from it. This way, the kernel can use the memory for
961f825c736SAneesh Kumar K.V	  pagecache and when a subsystem requests for contiguous area, the
962f825c736SAneesh Kumar K.V	  allocated pages are migrated away to serve the contiguous request.
963f825c736SAneesh Kumar K.V
964f825c736SAneesh Kumar K.V	  If unsure, say "n".
965f825c736SAneesh Kumar K.V
96628b24c1fSSasha Levinconfig CMA_DEBUGFS
96728b24c1fSSasha Levin	bool "CMA debugfs interface"
96828b24c1fSSasha Levin	depends on CMA && DEBUG_FS
96928b24c1fSSasha Levin	help
97028b24c1fSSasha Levin	  Turns on the DebugFS interface for CMA.
97128b24c1fSSasha Levin
97243ca106fSMinchan Kimconfig CMA_SYSFS
97343ca106fSMinchan Kim	bool "CMA information through sysfs interface"
97443ca106fSMinchan Kim	depends on CMA && SYSFS
97543ca106fSMinchan Kim	help
97643ca106fSMinchan Kim	  This option exposes some sysfs attributes to get information
97743ca106fSMinchan Kim	  from CMA.
97843ca106fSMinchan Kim
979a254129eSJoonsoo Kimconfig CMA_AREAS
980a254129eSJoonsoo Kim	int "Maximum count of the CMA areas"
981a254129eSJoonsoo Kim	depends on CMA
98273307523SAnshuman Khandual	default 20 if NUMA
98373307523SAnshuman Khandual	default 8
984a254129eSJoonsoo Kim	help
985a254129eSJoonsoo Kim	  CMA allows to create CMA areas for particular purpose, mainly,
986a254129eSJoonsoo Kim	  used as device private area. This parameter sets the maximum
987a254129eSJoonsoo Kim	  number of CMA area in the system.
988a254129eSJoonsoo Kim
98973307523SAnshuman Khandual	  If unsure, leave the default value "8" in UMA and "20" in NUMA.
990a254129eSJoonsoo Kim
991e13e7922SJuan Yescas#
992e13e7922SJuan Yescas# Select this config option from the architecture Kconfig, if available, to set
993e13e7922SJuan Yescas# the max page order for physically contiguous allocations.
994e13e7922SJuan Yescas#
995e13e7922SJuan Yescasconfig ARCH_FORCE_MAX_ORDER
996e13e7922SJuan Yescas	int
997e13e7922SJuan Yescas
998e13e7922SJuan Yescas#
999e13e7922SJuan Yescas# When ARCH_FORCE_MAX_ORDER is not defined,
1000e13e7922SJuan Yescas# the default page block order is MAX_PAGE_ORDER (10) as per
1001e13e7922SJuan Yescas# include/linux/mmzone.h.
1002e13e7922SJuan Yescas#
10033800d552SZi Yanconfig PAGE_BLOCK_MAX_ORDER
10043800d552SZi Yan	int "Page Block Order Upper Limit"
1005e13e7922SJuan Yescas	range 1 10 if ARCH_FORCE_MAX_ORDER = 0
1006e13e7922SJuan Yescas	default 10 if ARCH_FORCE_MAX_ORDER = 0
1007e13e7922SJuan Yescas	range 1 ARCH_FORCE_MAX_ORDER if ARCH_FORCE_MAX_ORDER != 0
1008e13e7922SJuan Yescas	default ARCH_FORCE_MAX_ORDER if ARCH_FORCE_MAX_ORDER != 0
1009e13e7922SJuan Yescas	help
1010e13e7922SJuan Yescas	  The page block order refers to the power of two number of pages that
1011e13e7922SJuan Yescas	  are physically contiguous and can have a migrate type associated to
10123800d552SZi Yan	  them. The maximum size of the page block order is at least limited by
10133800d552SZi Yan	  ARCH_FORCE_MAX_ORDER/MAX_PAGE_ORDER.
1014e13e7922SJuan Yescas
10153800d552SZi Yan	  This config adds a new upper limit of default page block
10163800d552SZi Yan	  order when the page block order is required to be smaller than
10173800d552SZi Yan	  ARCH_FORCE_MAX_ORDER/MAX_PAGE_ORDER or other limits
10183800d552SZi Yan	  (see include/linux/pageblock-flags.h for details).
1019e13e7922SJuan Yescas
1020e13e7922SJuan Yescas	  Reducing pageblock order can negatively impact THP generation
1021bafa31a1SPaul Menzel	  success rate. If your workloads use THP heavily, please use this
1022e13e7922SJuan Yescas	  option with caution.
1023e13e7922SJuan Yescas
1024e13e7922SJuan Yescas	  Don't change if unsure.
1025e13e7922SJuan Yescas
1026af8d417aSDan Streetmanconfig MEM_SOFT_DIRTY
1027af8d417aSDan Streetman	bool "Track memory changes"
1028af8d417aSDan Streetman	depends on CHECKPOINT_RESTORE && HAVE_ARCH_SOFT_DIRTY && PROC_FS
1029af8d417aSDan Streetman	select PROC_PAGE_MONITOR
10304e2e2770SSeth Jennings	help
1031af8d417aSDan Streetman	  This option enables memory changes tracking by introducing a
1032af8d417aSDan Streetman	  soft-dirty bit on pte-s. This bit it set when someone writes
1033af8d417aSDan Streetman	  into a page just as regular dirty bit, but unlike the latter
1034af8d417aSDan Streetman	  it can be cleared by hands.
1035af8d417aSDan Streetman
10361ad1335dSMike Rapoport	  See Documentation/admin-guide/mm/soft-dirty.rst for more details.
10374e2e2770SSeth Jennings
10389e5c33d7SMark Salterconfig GENERIC_EARLY_IOREMAP
10399e5c33d7SMark Salter	bool
1040042d27acSHelge Deller
104122ee3ea5SHelge Dellerconfig STACK_MAX_DEFAULT_SIZE_MB
104222ee3ea5SHelge Deller	int "Default maximum user stack size for 32-bit processes (MB)"
104322ee3ea5SHelge Deller	default 100
1044042d27acSHelge Deller	range 8 2048
1045042d27acSHelge Deller	depends on STACK_GROWSUP && (!64BIT || COMPAT)
1046042d27acSHelge Deller	help
1047042d27acSHelge Deller	  This is the maximum stack size in Megabytes in the VM layout of 32-bit
1048042d27acSHelge Deller	  user processes when the stack grows upwards (currently only on parisc
104922ee3ea5SHelge Deller	  arch) when the RLIMIT_STACK hard limit is unlimited.
1050042d27acSHelge Deller
105122ee3ea5SHelge Deller	  A sane initial value is 100 MB.
10523a80a7faSMel Gorman
10533a80a7faSMel Gormanconfig DEFERRED_STRUCT_PAGE_INIT
10541ce22103SVlastimil Babka	bool "Defer initialisation of struct pages to kthreads"
1055d39f8fb4SMike Rapoport	depends on SPARSEMEM
1056ab1e8d89SPavel Tatashin	depends on !NEED_PER_CPU_KM
1057889c695dSPasha Tatashin	depends on 64BIT
1058854fa98dSIlya Leoshkevich	depends on !KMSAN
1059e4443149SDaniel Jordan	select PADATA
10603a80a7faSMel Gorman	help
10613a80a7faSMel Gorman	  Ordinarily all struct pages are initialised during early boot in a
10623a80a7faSMel Gorman	  single thread. On very large machines this can take a considerable
10633a80a7faSMel Gorman	  amount of time. If this option is set, large machines will bring up
1064e4443149SDaniel Jordan	  a subset of memmap at boot and then initialise the rest in parallel.
1065e4443149SDaniel Jordan	  This has a potential performance impact on tasks running early in the
10661ce22103SVlastimil Babka	  lifetime of the system until these kthreads finish the
10671ce22103SVlastimil Babka	  initialisation.
1068033fbae9SDan Williams
10691c676e0dSSeongJae Parkconfig PAGE_IDLE_FLAG
10701c676e0dSSeongJae Park	bool
10711c676e0dSSeongJae Park	select PAGE_EXTENSION if !64BIT
10721c676e0dSSeongJae Park	help
10731c676e0dSSeongJae Park	  This adds PG_idle and PG_young flags to 'struct page'.  PTE Accessed
10741c676e0dSSeongJae Park	  bit writers can set the state of the bit in the flags so that PTE
10751c676e0dSSeongJae Park	  Accessed bit readers may avoid disturbance.
10761c676e0dSSeongJae Park
107733c3fc71SVladimir Davydovconfig IDLE_PAGE_TRACKING
107833c3fc71SVladimir Davydov	bool "Enable idle page tracking"
107933c3fc71SVladimir Davydov	depends on SYSFS && MMU
10801c676e0dSSeongJae Park	select PAGE_IDLE_FLAG
108133c3fc71SVladimir Davydov	help
108233c3fc71SVladimir Davydov	  This feature allows to estimate the amount of user pages that have
108333c3fc71SVladimir Davydov	  not been touched during a given period of time. This information can
108433c3fc71SVladimir Davydov	  be useful to tune memory cgroup limits and/or for job placement
108533c3fc71SVladimir Davydov	  within a compute cluster.
108633c3fc71SVladimir Davydov
10871ad1335dSMike Rapoport	  See Documentation/admin-guide/mm/idle_page_tracking.rst for
10881ad1335dSMike Rapoport	  more details.
108933c3fc71SVladimir Davydov
10908690bbcfSMathieu Desnoyers# Architectures which implement cpu_dcache_is_aliasing() to query
10918690bbcfSMathieu Desnoyers# whether the data caches are aliased (VIVT or VIPT with dcache
10928690bbcfSMathieu Desnoyers# aliasing) need to select this.
10938690bbcfSMathieu Desnoyersconfig ARCH_HAS_CPU_CACHE_ALIASING
10948690bbcfSMathieu Desnoyers	bool
10958690bbcfSMathieu Desnoyers
1096c2280be8SAnshuman Khandualconfig ARCH_HAS_CACHE_LINE_SIZE
1097c2280be8SAnshuman Khandual	bool
1098c2280be8SAnshuman Khandual
10992792d84eSKees Cookconfig ARCH_HAS_CURRENT_STACK_POINTER
11002792d84eSKees Cook	bool
11012792d84eSKees Cook	help
11022792d84eSKees Cook	  In support of HARDENED_USERCOPY performing stack variable lifetime
11032792d84eSKees Cook	  checking, an architecture-agnostic way to find the stack pointer
11042792d84eSKees Cook	  is needed. Once an architecture defines an unsigned long global
11052792d84eSKees Cook	  register alias named "current_stack_pointer", this config can be
11062792d84eSKees Cook	  selected.
11072792d84eSKees Cook
110863703f37SKefeng Wangconfig ARCH_HAS_ZONE_DMA_SET
110963703f37SKefeng Wang	bool
111063703f37SKefeng Wang
111163703f37SKefeng Wangconfig ZONE_DMA
111263703f37SKefeng Wang	bool "Support DMA zone" if ARCH_HAS_ZONE_DMA_SET
111363703f37SKefeng Wang	default y if ARM64 || X86
111463703f37SKefeng Wang
111563703f37SKefeng Wangconfig ZONE_DMA32
111663703f37SKefeng Wang	bool "Support DMA32 zone" if ARCH_HAS_ZONE_DMA_SET
111763703f37SKefeng Wang	depends on !X86_32
111863703f37SKefeng Wang	default y if ARM64
111963703f37SKefeng Wang
1120033fbae9SDan Williamsconfig ZONE_DEVICE
11215042db43SJérôme Glisse	bool "Device memory (pmem, HMM, etc...) hotplug support"
1122033fbae9SDan Williams	depends on MEMORY_HOTPLUG
1123033fbae9SDan Williams	depends on MEMORY_HOTREMOVE
112499490f16SDan Williams	depends on SPARSEMEM_VMEMMAP
11253a08cd52SMatthew Wilcox	select XARRAY_MULTI
1126033fbae9SDan Williams
1127033fbae9SDan Williams	help
1128033fbae9SDan Williams	  Device memory hotplug support allows for establishing pmem,
1129033fbae9SDan Williams	  or other device driver discovered memory regions, in the
1130033fbae9SDan Williams	  memmap. This allows pfn_to_page() lookups of otherwise
1131033fbae9SDan Williams	  "device-physical" addresses which is needed for using a DAX
1132033fbae9SDan Williams	  mapping in an O_DIRECT operation, among other things.
1133033fbae9SDan Williams
1134033fbae9SDan Williams	  If FS_DAX is enabled, then say Y.
113506a660adSLinus Torvalds
11369c240a7bSChristoph Hellwig#
11379c240a7bSChristoph Hellwig# Helpers to mirror range of the CPU page tables of a process into device page
11389c240a7bSChristoph Hellwig# tables.
11399c240a7bSChristoph Hellwig#
1140c0b12405SJérôme Glisseconfig HMM_MIRROR
11419c240a7bSChristoph Hellwig	bool
1142f442c283SChristoph Hellwig	depends on MMU
1143c0b12405SJérôme Glisse
114414b80582SDan Williamsconfig GET_FREE_REGION
114514b80582SDan Williams	bool
114614b80582SDan Williams
11475042db43SJérôme Glisseconfig DEVICE_PRIVATE
11485042db43SJérôme Glisse	bool "Unaddressable device memory (GPU memory, ...)"
11497328d9ccSChristoph Hellwig	depends on ZONE_DEVICE
115014b80582SDan Williams	select GET_FREE_REGION
11515042db43SJérôme Glisse
11525042db43SJérôme Glisse	help
11535042db43SJérôme Glisse	  Allows creation of struct pages to represent unaddressable device
11545042db43SJérôme Glisse	  memory; i.e., memory that is only accessible from the device (or
11555042db43SJérôme Glisse	  group of devices). You likely also want to select HMM_MIRROR.
11565042db43SJérôme Glisse
11573e9a9e25SChristoph Hellwigconfig VMAP_PFN
11583e9a9e25SChristoph Hellwig	bool
11593e9a9e25SChristoph Hellwig
116063c17fb8SDave Hansenconfig ARCH_USES_HIGH_VMA_FLAGS
116163c17fb8SDave Hansen	bool
116266d37570SDave Hansenconfig ARCH_HAS_PKEYS
116366d37570SDave Hansen	bool
116430a5b536SDennis Zhou
11657a87225aSMatthew Wilcox (Oracle)config ARCH_USES_PG_ARCH_2
1166b0284cd2SCatalin Marinas	bool
11677a87225aSMatthew Wilcox (Oracle)config ARCH_USES_PG_ARCH_3
11687a87225aSMatthew Wilcox (Oracle)	bool
1169b0284cd2SCatalin Marinas
11700710d012SVlastimil Babkaconfig VM_EVENT_COUNTERS
11710710d012SVlastimil Babka	default y
11720710d012SVlastimil Babka	bool "Enable VM event counters for /proc/vmstat" if EXPERT
11730710d012SVlastimil Babka	help
11740710d012SVlastimil Babka	  VM event counters are needed for event counts to be shown.
11750710d012SVlastimil Babka	  This option allows the disabling of the VM event counters
11760710d012SVlastimil Babka	  on EXPERT systems.  /proc/vmstat will only show page counts
11770710d012SVlastimil Babka	  if VM event counters are disabled.
11780710d012SVlastimil Babka
117930a5b536SDennis Zhouconfig PERCPU_STATS
118030a5b536SDennis Zhou	bool "Collect percpu memory statistics"
118130a5b536SDennis Zhou	help
118230a5b536SDennis Zhou	  This feature collects and exposes statistics via debugfs. The
118330a5b536SDennis Zhou	  information includes global and per chunk statistics, which can
118430a5b536SDennis Zhou	  be used to help understand percpu memory usage.
118564c349f4SKirill A. Shutemov
11869c84f229SJohn Hubbardconfig GUP_TEST
11879c84f229SJohn Hubbard	bool "Enable infrastructure for get_user_pages()-related unit tests"
1188d0de8241SBarry Song	depends on DEBUG_FS
118964c349f4SKirill A. Shutemov	help
11909c84f229SJohn Hubbard	  Provides /sys/kernel/debug/gup_test, which in turn provides a way
11919c84f229SJohn Hubbard	  to make ioctl calls that can launch kernel-based unit tests for
11929c84f229SJohn Hubbard	  the get_user_pages*() and pin_user_pages*() family of API calls.
119364c349f4SKirill A. Shutemov
11949c84f229SJohn Hubbard	  These tests include benchmark testing of the _fast variants of
11959c84f229SJohn Hubbard	  get_user_pages*() and pin_user_pages*(), as well as smoke tests of
11969c84f229SJohn Hubbard	  the non-_fast variants.
11979c84f229SJohn Hubbard
1198f4f9bda4SJohn Hubbard	  There is also a sub-test that allows running dump_page() on any
1199f4f9bda4SJohn Hubbard	  of up to eight pages (selected by command line args) within the
1200f4f9bda4SJohn Hubbard	  range of user-space addresses. These pages are either pinned via
1201f4f9bda4SJohn Hubbard	  pin_user_pages*(), or pinned via get_user_pages*(), as specified
1202f4f9bda4SJohn Hubbard	  by other command line arguments.
1203f4f9bda4SJohn Hubbard
1204baa489faSSeongJae Park	  See tools/testing/selftests/mm/gup_test.c
12053010a5eaSLaurent Dufour
1206d0de8241SBarry Songcomment "GUP_TEST needs to have DEBUG_FS enabled"
1207d0de8241SBarry Song	depends on !GUP_TEST && !DEBUG_FS
12083010a5eaSLaurent Dufour
12096ca297d4SPeter Zijlstraconfig GUP_GET_PXX_LOW_HIGH
121039656e83SChristoph Hellwig	bool
121139656e83SChristoph Hellwig
1212def85743SKeith Buschconfig DMAPOOL_TEST
1213def85743SKeith Busch	tristate "Enable a module to run time tests on dma_pool"
1214def85743SKeith Busch	depends on HAS_DMA
1215def85743SKeith Busch	help
1216def85743SKeith Busch	  Provides a test module that will allocate and free many blocks of
1217def85743SKeith Busch	  various sizes and report how long it takes. This is intended to
1218def85743SKeith Busch	  provide a consistent way to measure how changes to the
1219def85743SKeith Busch	  dma_pool_alloc/free routines affect performance.
1220def85743SKeith Busch
12213010a5eaSLaurent Dufourconfig ARCH_HAS_PTE_SPECIAL
12223010a5eaSLaurent Dufour	bool
122359e0b520SChristoph Hellwig
1224c5acad84SThomas Hellstromconfig MAPPING_DIRTY_HELPERS
1225c5acad84SThomas Hellstrom        bool
1226c5acad84SThomas Hellstrom
1227298fa1adSThomas Gleixnerconfig KMAP_LOCAL
1228298fa1adSThomas Gleixner	bool
1229298fa1adSThomas Gleixner
1230825c43f5SArd Biesheuvelconfig KMAP_LOCAL_NON_LINEAR_PTE_ARRAY
1231825c43f5SArd Biesheuvel	bool
1232825c43f5SArd Biesheuvel
1233626e98cbSThomas Weißschuhconfig MEMFD_CREATE
1234626e98cbSThomas Weißschuh	bool "Enable memfd_create() system call" if EXPERT
1235626e98cbSThomas Weißschuh
12361507f512SMike Rapoportconfig SECRETMEM
123774947724SLukas Bulwahn	default y
123874947724SLukas Bulwahn	bool "Enable memfd_secret() system call" if EXPERT
123974947724SLukas Bulwahn	depends on ARCH_HAS_SET_DIRECT_MAP
124074947724SLukas Bulwahn	help
124174947724SLukas Bulwahn	  Enable the memfd_secret() system call with the ability to create
124274947724SLukas Bulwahn	  memory areas visible only in the context of the owning process and
124374947724SLukas Bulwahn	  not mapped to other processes and other kernel page tables.
12441507f512SMike Rapoport
12459a10064fSColin Crossconfig ANON_VMA_NAME
12469a10064fSColin Cross	bool "Anonymous VMA name support"
12479a10064fSColin Cross	depends on PROC_FS && ADVISE_SYSCALLS && MMU
12489a10064fSColin Cross
12499a10064fSColin Cross	help
12509a10064fSColin Cross	  Allow naming anonymous virtual memory areas.
12519a10064fSColin Cross
12529a10064fSColin Cross	  This feature allows assigning names to virtual memory areas. Assigned
12539a10064fSColin Cross	  names can be later retrieved from /proc/pid/maps and /proc/pid/smaps
12549a10064fSColin Cross	  and help identifying individual anonymous memory areas.
12559a10064fSColin Cross	  Assigning a name to anonymous virtual memory area might prevent that
12569a10064fSColin Cross	  area from being merged with adjacent virtual memory areas due to the
12579a10064fSColin Cross	  difference in their name.
12589a10064fSColin Cross
1259430529b5SPeter Xuconfig HAVE_ARCH_USERFAULTFD_WP
1260430529b5SPeter Xu	bool
1261430529b5SPeter Xu	help
1262430529b5SPeter Xu	  Arch has userfaultfd write protection support
1263430529b5SPeter Xu
1264430529b5SPeter Xuconfig HAVE_ARCH_USERFAULTFD_MINOR
1265430529b5SPeter Xu	bool
1266430529b5SPeter Xu	help
1267430529b5SPeter Xu	  Arch has userfaultfd minor fault support
1268430529b5SPeter Xu
126997219cc3SPeter Xumenuconfig USERFAULTFD
127097219cc3SPeter Xu	bool "Enable userfaultfd() system call"
127197219cc3SPeter Xu	depends on MMU
127297219cc3SPeter Xu	help
127397219cc3SPeter Xu	  Enable the userfaultfd() system call that allows to intercept and
127497219cc3SPeter Xu	  handle page faults in userland.
127597219cc3SPeter Xu
127697219cc3SPeter Xuif USERFAULTFD
12771db9dbc2SPeter Xuconfig PTE_MARKER_UFFD_WP
127881e0f15fSPeter Xu	bool "Userfaultfd write protection support for shmem/hugetlbfs"
127981e0f15fSPeter Xu	default y
128081e0f15fSPeter Xu	depends on HAVE_ARCH_USERFAULTFD_WP
12811db9dbc2SPeter Xu
12821db9dbc2SPeter Xu	help
12831db9dbc2SPeter Xu	  Allows to create marker PTEs for userfaultfd write protection
12841db9dbc2SPeter Xu	  purposes.  It is required to enable userfaultfd write protection on
12851db9dbc2SPeter Xu	  file-backed memory types like shmem and hugetlbfs.
128697219cc3SPeter Xuendif # USERFAULTFD
12871db9dbc2SPeter Xu
1288ac35a490SYu Zhao# multi-gen LRU {
1289ec1c86b2SYu Zhaoconfig LRU_GEN
1290ec1c86b2SYu Zhao	bool "Multi-Gen LRU"
1291ec1c86b2SYu Zhao	depends on MMU
1292ec1c86b2SYu Zhao	# make sure folio->flags has enough spare bits
1293ec1c86b2SYu Zhao	depends on 64BIT || !SPARSEMEM || SPARSEMEM_VMEMMAP
1294ec1c86b2SYu Zhao	help
129507017acbSYu Zhao	  A high performance LRU implementation to overcommit memory. See
129607017acbSYu Zhao	  Documentation/admin-guide/mm/multigen_lru.rst for details.
1297ec1c86b2SYu Zhao
1298354ed597SYu Zhaoconfig LRU_GEN_ENABLED
1299354ed597SYu Zhao	bool "Enable by default"
1300354ed597SYu Zhao	depends on LRU_GEN
1301354ed597SYu Zhao	help
1302354ed597SYu Zhao	  This option enables the multi-gen LRU by default.
1303354ed597SYu Zhao
1304ac35a490SYu Zhaoconfig LRU_GEN_STATS
1305ac35a490SYu Zhao	bool "Full stats for debugging"
1306ac35a490SYu Zhao	depends on LRU_GEN
1307ac35a490SYu Zhao	help
1308ac35a490SYu Zhao	  Do not enable this option unless you plan to look at historical stats
1309ac35a490SYu Zhao	  from evicted generations for debugging purpose.
1310ac35a490SYu Zhao
1311ac35a490SYu Zhao	  This option has a per-memcg and per-node memory overhead.
131261dd3f24SKinsey Ho
131361dd3f24SKinsey Hoconfig LRU_GEN_WALKS_MMU
131461dd3f24SKinsey Ho	def_bool y
131561dd3f24SKinsey Ho	depends on LRU_GEN && ARCH_HAS_HW_PTE_YOUNG
1316ac35a490SYu Zhao# }
1317ac35a490SYu Zhao
13180b6cc04fSSuren Baghdasaryanconfig ARCH_SUPPORTS_PER_VMA_LOCK
13190b6cc04fSSuren Baghdasaryan       def_bool n
13200b6cc04fSSuren Baghdasaryan
13210b6cc04fSSuren Baghdasaryanconfig PER_VMA_LOCK
13220b6cc04fSSuren Baghdasaryan	def_bool y
13230b6cc04fSSuren Baghdasaryan	depends on ARCH_SUPPORTS_PER_VMA_LOCK && MMU && SMP
13240b6cc04fSSuren Baghdasaryan	help
13250b6cc04fSSuren Baghdasaryan	  Allow per-vma locking during page fault handling.
13260b6cc04fSSuren Baghdasaryan
13270b6cc04fSSuren Baghdasaryan	  This feature allows locking each virtual memory area separately when
13280b6cc04fSSuren Baghdasaryan	  handling page faults instead of taking mmap_lock.
13290b6cc04fSSuren Baghdasaryan
1330c2508ec5SLinus Torvaldsconfig LOCK_MM_AND_FIND_VMA
1331c2508ec5SLinus Torvalds	bool
1332c2508ec5SLinus Torvalds	depends on !STACK_GROWSUP
1333c2508ec5SLinus Torvalds
13348f23f5dbSJason Gunthorpeconfig IOMMU_MM_DATA
13358f23f5dbSJason Gunthorpe	bool
13368f23f5dbSJason Gunthorpe
133712af2b83SMike Rapoport (IBM)config EXECMEM
133812af2b83SMike Rapoport (IBM)	bool
133912af2b83SMike Rapoport (IBM)
134087482708SMike Rapoport (Microsoft)config NUMA_MEMBLKS
134187482708SMike Rapoport (Microsoft)	bool
134287482708SMike Rapoport (Microsoft)
1343b0c4e27cSMike Rapoport (Microsoft)config NUMA_EMU
1344b0c4e27cSMike Rapoport (Microsoft)	bool "NUMA emulation"
1345b0c4e27cSMike Rapoport (Microsoft)	depends on NUMA_MEMBLKS
1346a24f2fb7SHuacai Chen	depends on X86 || GENERIC_ARCH_NUMA
1347b0c4e27cSMike Rapoport (Microsoft)	help
1348b0c4e27cSMike Rapoport (Microsoft)	  Enable NUMA emulation. A flat machine will be split
1349b0c4e27cSMike Rapoport (Microsoft)	  into virtual nodes when booted with "numa=fake=N", where N is the
1350b0c4e27cSMike Rapoport (Microsoft)	  number of nodes. This is only useful for debugging.
1351b0c4e27cSMike Rapoport (Microsoft)
1352bcc9d04eSMark Brownconfig ARCH_HAS_USER_SHADOW_STACK
1353bcc9d04eSMark Brown	bool
1354bcc9d04eSMark Brown	help
1355bcc9d04eSMark Brown	  The architecture has hardware support for userspace shadow call
1356bcc9d04eSMark Brown          stacks (eg, x86 CET, arm64 GCS or RISC-V Zicfiss).
1357bcc9d04eSMark Brown
13586375e95fSQi Zhengconfig ARCH_SUPPORTS_PT_RECLAIM
13596375e95fSQi Zheng	def_bool n
13606375e95fSQi Zheng
13616375e95fSQi Zhengconfig PT_RECLAIM
13626375e95fSQi Zheng	bool "reclaim empty user page table pages"
13636375e95fSQi Zheng	default y
13646375e95fSQi Zheng	depends on ARCH_SUPPORTS_PT_RECLAIM && MMU && SMP
13656375e95fSQi Zheng	select MMU_GATHER_RCU_TABLE_FREE
13666375e95fSQi Zheng	help
13676375e95fSQi Zheng	  Try to reclaim empty user page table pages in paths other than munmap
13686375e95fSQi Zheng	  and exit_mmap path.
13696375e95fSQi Zheng
13706375e95fSQi Zheng	  Note: now only empty user PTE page table pages will be reclaimed.
13716375e95fSQi Zheng
13724c89792eSDavid Hildenbrandconfig FIND_NORMAL_PAGE
13734c89792eSDavid Hildenbrand	def_bool n
13746375e95fSQi Zheng
13752224d848SSeongJae Parksource "mm/damon/Kconfig"
13762224d848SSeongJae Park
137759e0b520SChristoph Hellwigendmenu
1378