xref: /linux/mm/Kconfig (revision af92793e52c3a99b828ed4bdd277fd3e11c18d08)
1ec8f24b7SThomas Gleixner# SPDX-License-Identifier: GPL-2.0-only
259e0b520SChristoph Hellwig
359e0b520SChristoph Hellwigmenu "Memory Management options"
459e0b520SChristoph Hellwig
57b42f104SJohannes Weiner#
67b42f104SJohannes Weiner# For some reason microblaze and nios2 hard code SWAP=n.  Hopefully we can
77b42f104SJohannes Weiner# add proper SWAP support to them, in which case this can be remove.
87b42f104SJohannes Weiner#
97b42f104SJohannes Weinerconfig ARCH_NO_SWAP
107b42f104SJohannes Weiner	bool
117b42f104SJohannes Weiner
12b3fbd58fSJohannes Weinerconfig ZPOOL
13b3fbd58fSJohannes Weiner	bool
14b3fbd58fSJohannes Weiner
15519bcb79SJohannes Weinermenuconfig SWAP
167b42f104SJohannes Weiner	bool "Support for paging of anonymous memory (swap)"
177b42f104SJohannes Weiner	depends on MMU && BLOCK && !ARCH_NO_SWAP
187b42f104SJohannes Weiner	default y
197b42f104SJohannes Weiner	help
207b42f104SJohannes Weiner	  This option allows you to choose whether you want to have support
217b42f104SJohannes Weiner	  for so called swap devices or swap files in your kernel that are
227b42f104SJohannes Weiner	  used to provide more virtual memory than the actual RAM present
237b42f104SJohannes Weiner	  in your computer.  If unsure say Y.
247b42f104SJohannes Weiner
25519bcb79SJohannes Weinerconfig ZSWAP
26fcab9b44SDavid Heidelberg	bool "Compressed cache for swap pages"
27b3fbd58fSJohannes Weiner	depends on SWAP
28b3fbd58fSJohannes Weiner	select CRYPTO
29519bcb79SJohannes Weiner	select ZPOOL
30519bcb79SJohannes Weiner	help
31519bcb79SJohannes Weiner	  A lightweight compressed cache for swap pages.  It takes
32519bcb79SJohannes Weiner	  pages that are in the process of being swapped out and attempts to
33519bcb79SJohannes Weiner	  compress them into a dynamically allocated RAM-based memory pool.
34519bcb79SJohannes Weiner	  This can result in a significant I/O reduction on swap device and,
351a44131dSSophia Gabriella	  in the case where decompressing from RAM is faster than swap device
36519bcb79SJohannes Weiner	  reads, can also improve workload performance.
37519bcb79SJohannes Weiner
38b3fbd58fSJohannes Weinerconfig ZSWAP_DEFAULT_ON
39b3fbd58fSJohannes Weiner	bool "Enable the compressed cache for swap pages by default"
40b3fbd58fSJohannes Weiner	depends on ZSWAP
41b3fbd58fSJohannes Weiner	help
42b3fbd58fSJohannes Weiner	  If selected, the compressed cache for swap pages will be enabled
43b3fbd58fSJohannes Weiner	  at boot, otherwise it will be disabled.
44b3fbd58fSJohannes Weiner
45b3fbd58fSJohannes Weiner	  The selection made here can be overridden by using the kernel
46b3fbd58fSJohannes Weiner	  command line 'zswap.enabled=' option.
47b3fbd58fSJohannes Weiner
48b5ba474fSNhat Phamconfig ZSWAP_SHRINKER_DEFAULT_ON
49b5ba474fSNhat Pham	bool "Shrink the zswap pool on memory pressure"
50b5ba474fSNhat Pham	depends on ZSWAP
51b5ba474fSNhat Pham	default n
52b5ba474fSNhat Pham	help
53b5ba474fSNhat Pham	  If selected, the zswap shrinker will be enabled, and the pages
54b5ba474fSNhat Pham	  stored in the zswap pool will become available for reclaim (i.e
55b5ba474fSNhat Pham	  written back to the backing swap device) on memory pressure.
56b5ba474fSNhat Pham
57b5ba474fSNhat Pham	  This means that zswap writeback could happen even if the pool is
58b5ba474fSNhat Pham	  not yet full, or the cgroup zswap limit has not been reached,
59b5ba474fSNhat Pham	  reducing the chance that cold pages will reside in the zswap pool
60b5ba474fSNhat Pham	  and consume memory indefinitely.
61b5ba474fSNhat Pham
62519bcb79SJohannes Weinerchoice
63b3fbd58fSJohannes Weiner	prompt "Default compressor"
64519bcb79SJohannes Weiner	depends on ZSWAP
65519bcb79SJohannes Weiner	default ZSWAP_COMPRESSOR_DEFAULT_LZO
66519bcb79SJohannes Weiner	help
67519bcb79SJohannes Weiner	  Selects the default compression algorithm for the compressed cache
68519bcb79SJohannes Weiner	  for swap pages.
69519bcb79SJohannes Weiner
70519bcb79SJohannes Weiner	  For an overview what kind of performance can be expected from
71519bcb79SJohannes Weiner	  a particular compression algorithm please refer to the benchmarks
72519bcb79SJohannes Weiner	  available at the following LWN page:
73519bcb79SJohannes Weiner	  https://lwn.net/Articles/751795/
74519bcb79SJohannes Weiner
75519bcb79SJohannes Weiner	  If in doubt, select 'LZO'.
76519bcb79SJohannes Weiner
77519bcb79SJohannes Weiner	  The selection made here can be overridden by using the kernel
78519bcb79SJohannes Weiner	  command line 'zswap.compressor=' option.
79519bcb79SJohannes Weiner
80519bcb79SJohannes Weinerconfig ZSWAP_COMPRESSOR_DEFAULT_DEFLATE
81519bcb79SJohannes Weiner	bool "Deflate"
82519bcb79SJohannes Weiner	select CRYPTO_DEFLATE
83519bcb79SJohannes Weiner	help
84519bcb79SJohannes Weiner	  Use the Deflate algorithm as the default compression algorithm.
85519bcb79SJohannes Weiner
86519bcb79SJohannes Weinerconfig ZSWAP_COMPRESSOR_DEFAULT_LZO
87519bcb79SJohannes Weiner	bool "LZO"
88519bcb79SJohannes Weiner	select CRYPTO_LZO
89519bcb79SJohannes Weiner	help
90519bcb79SJohannes Weiner	  Use the LZO algorithm as the default compression algorithm.
91519bcb79SJohannes Weiner
92519bcb79SJohannes Weinerconfig ZSWAP_COMPRESSOR_DEFAULT_842
93519bcb79SJohannes Weiner	bool "842"
94519bcb79SJohannes Weiner	select CRYPTO_842
95519bcb79SJohannes Weiner	help
96519bcb79SJohannes Weiner	  Use the 842 algorithm as the default compression algorithm.
97519bcb79SJohannes Weiner
98519bcb79SJohannes Weinerconfig ZSWAP_COMPRESSOR_DEFAULT_LZ4
99519bcb79SJohannes Weiner	bool "LZ4"
100519bcb79SJohannes Weiner	select CRYPTO_LZ4
101519bcb79SJohannes Weiner	help
102519bcb79SJohannes Weiner	  Use the LZ4 algorithm as the default compression algorithm.
103519bcb79SJohannes Weiner
104519bcb79SJohannes Weinerconfig ZSWAP_COMPRESSOR_DEFAULT_LZ4HC
105519bcb79SJohannes Weiner	bool "LZ4HC"
106519bcb79SJohannes Weiner	select CRYPTO_LZ4HC
107519bcb79SJohannes Weiner	help
108519bcb79SJohannes Weiner	  Use the LZ4HC algorithm as the default compression algorithm.
109519bcb79SJohannes Weiner
110519bcb79SJohannes Weinerconfig ZSWAP_COMPRESSOR_DEFAULT_ZSTD
111519bcb79SJohannes Weiner	bool "zstd"
112519bcb79SJohannes Weiner	select CRYPTO_ZSTD
113519bcb79SJohannes Weiner	help
114519bcb79SJohannes Weiner	  Use the zstd algorithm as the default compression algorithm.
115519bcb79SJohannes Weinerendchoice
116519bcb79SJohannes Weiner
117519bcb79SJohannes Weinerconfig ZSWAP_COMPRESSOR_DEFAULT
118519bcb79SJohannes Weiner       string
119519bcb79SJohannes Weiner       depends on ZSWAP
120519bcb79SJohannes Weiner       default "deflate" if ZSWAP_COMPRESSOR_DEFAULT_DEFLATE
121519bcb79SJohannes Weiner       default "lzo" if ZSWAP_COMPRESSOR_DEFAULT_LZO
122519bcb79SJohannes Weiner       default "842" if ZSWAP_COMPRESSOR_DEFAULT_842
123519bcb79SJohannes Weiner       default "lz4" if ZSWAP_COMPRESSOR_DEFAULT_LZ4
124519bcb79SJohannes Weiner       default "lz4hc" if ZSWAP_COMPRESSOR_DEFAULT_LZ4HC
125519bcb79SJohannes Weiner       default "zstd" if ZSWAP_COMPRESSOR_DEFAULT_ZSTD
126519bcb79SJohannes Weiner       default ""
127519bcb79SJohannes Weiner
128519bcb79SJohannes Weinerchoice
129b3fbd58fSJohannes Weiner	prompt "Default allocator"
130519bcb79SJohannes Weiner	depends on ZSWAP
13104cb7502SMatthew Wilcox (Oracle)	default ZSWAP_ZPOOL_DEFAULT_ZSMALLOC if MMU
132519bcb79SJohannes Weiner	help
133519bcb79SJohannes Weiner	  Selects the default allocator for the compressed cache for
134519bcb79SJohannes Weiner	  swap pages.
135519bcb79SJohannes Weiner	  The default is 'zbud' for compatibility, however please do
136519bcb79SJohannes Weiner	  read the description of each of the allocators below before
137519bcb79SJohannes Weiner	  making a right choice.
138519bcb79SJohannes Weiner
139519bcb79SJohannes Weiner	  The selection made here can be overridden by using the kernel
140519bcb79SJohannes Weiner	  command line 'zswap.zpool=' option.
141519bcb79SJohannes Weiner
142519bcb79SJohannes Weinerconfig ZSWAP_ZPOOL_DEFAULT_ZSMALLOC
143519bcb79SJohannes Weiner	bool "zsmalloc"
144519bcb79SJohannes Weiner	select ZSMALLOC
145519bcb79SJohannes Weiner	help
146519bcb79SJohannes Weiner	  Use the zsmalloc allocator as the default allocator.
147519bcb79SJohannes Weinerendchoice
148519bcb79SJohannes Weiner
149519bcb79SJohannes Weinerconfig ZSWAP_ZPOOL_DEFAULT
150519bcb79SJohannes Weiner       string
151519bcb79SJohannes Weiner       depends on ZSWAP
152519bcb79SJohannes Weiner       default "zsmalloc" if ZSWAP_ZPOOL_DEFAULT_ZSMALLOC
153519bcb79SJohannes Weiner       default ""
154519bcb79SJohannes Weiner
155519bcb79SJohannes Weinerconfig ZSMALLOC
156b3fbd58fSJohannes Weiner	tristate
1575ad7a998SSergey Senozhatsky	prompt "N:1 compression allocator (zsmalloc)" if (ZSWAP || ZRAM)
15804cb7502SMatthew Wilcox (Oracle)	depends on MMU
159519bcb79SJohannes Weiner	help
160519bcb79SJohannes Weiner	  zsmalloc is a slab-based memory allocator designed to store
161b3fbd58fSJohannes Weiner	  pages of various compression levels efficiently. It achieves
162b3fbd58fSJohannes Weiner	  the highest storage density with the least amount of fragmentation.
163519bcb79SJohannes Weiner
164519bcb79SJohannes Weinerconfig ZSMALLOC_STAT
165519bcb79SJohannes Weiner	bool "Export zsmalloc statistics"
166519bcb79SJohannes Weiner	depends on ZSMALLOC
167519bcb79SJohannes Weiner	select DEBUG_FS
168519bcb79SJohannes Weiner	help
169519bcb79SJohannes Weiner	  This option enables code in the zsmalloc to collect various
170519bcb79SJohannes Weiner	  statistics about what's happening in zsmalloc and exports that
171519bcb79SJohannes Weiner	  information to userspace via debugfs.
172519bcb79SJohannes Weiner	  If unsure, say N.
173519bcb79SJohannes Weiner
1744ff93b29SSergey Senozhatskyconfig ZSMALLOC_CHAIN_SIZE
1754ff93b29SSergey Senozhatsky	int "Maximum number of physical pages per-zspage"
176b46402faSSergey Senozhatsky	default 8
1774ff93b29SSergey Senozhatsky	range 4 16
1784ff93b29SSergey Senozhatsky	depends on ZSMALLOC
1794ff93b29SSergey Senozhatsky	help
1804ff93b29SSergey Senozhatsky	  This option sets the upper limit on the number of physical pages
1814ff93b29SSergey Senozhatsky	  that a zmalloc page (zspage) can consist of. The optimal zspage
1824ff93b29SSergey Senozhatsky	  chain size is calculated for each size class during the
1834ff93b29SSergey Senozhatsky	  initialization of the pool.
1844ff93b29SSergey Senozhatsky
1854ff93b29SSergey Senozhatsky	  Changing this option can alter the characteristics of size classes,
1864ff93b29SSergey Senozhatsky	  such as the number of pages per zspage and the number of objects
1874ff93b29SSergey Senozhatsky	  per zspage. This can also result in different configurations of
1884ff93b29SSergey Senozhatsky	  the pool, as zsmalloc merges size classes with similar
1894ff93b29SSergey Senozhatsky	  characteristics.
1904ff93b29SSergey Senozhatsky
1914ff93b29SSergey Senozhatsky	  For more information, see zsmalloc documentation.
1924ff93b29SSergey Senozhatsky
1932a19be61SVlastimil Babkamenu "Slab allocator options"
1947b42f104SJohannes Weiner
1957b42f104SJohannes Weinerconfig SLUB
1962a19be61SVlastimil Babka	def_bool y
197*af92793eSAlexei Starovoitov	select IRQ_WORK
198eb07c4f3SVlastimil Babka
199c9f8f124SVlastimil Babkaconfig KVFREE_RCU_BATCHED
200c9f8f124SVlastimil Babka	def_bool y
201c9f8f124SVlastimil Babka	depends on !SLUB_TINY && !TINY_RCU
202c9f8f124SVlastimil Babka
203e240e53aSVlastimil Babkaconfig SLUB_TINY
2042a19be61SVlastimil Babka	bool "Configure for minimal memory footprint"
2056f110a5eSLinus Torvalds	depends on EXPERT && !COMPILE_TEST
206e240e53aSVlastimil Babka	select SLAB_MERGE_DEFAULT
207e240e53aSVlastimil Babka	help
2082a19be61SVlastimil Babka	   Configures the slab allocator in a way to achieve minimal memory
209e240e53aSVlastimil Babka	   footprint, sacrificing scalability, debugging and other features.
210e240e53aSVlastimil Babka	   This is intended only for the smallest system that had used the
211e240e53aSVlastimil Babka	   SLOB allocator and is not recommended for systems with more than
212e240e53aSVlastimil Babka	   16MB RAM.
213e240e53aSVlastimil Babka
214e240e53aSVlastimil Babka	   If unsure, say N.
215e240e53aSVlastimil Babka
2167b42f104SJohannes Weinerconfig SLAB_MERGE_DEFAULT
2177b42f104SJohannes Weiner	bool "Allow slab caches to be merged"
2187b42f104SJohannes Weiner	default y
2197b42f104SJohannes Weiner	help
2207b42f104SJohannes Weiner	  For reduced kernel memory fragmentation, slab caches can be
2217b42f104SJohannes Weiner	  merged when they share the same size and other characteristics.
2227b42f104SJohannes Weiner	  This carries a risk of kernel heap overflows being able to
2237b42f104SJohannes Weiner	  overwrite objects from merged caches (and more easily control
2247b42f104SJohannes Weiner	  cache layout), which makes such heap attacks easier to exploit
2257b42f104SJohannes Weiner	  by attackers. By keeping caches unmerged, these kinds of exploits
2267b42f104SJohannes Weiner	  can usually only damage objects in the same cache. To disable
2277b42f104SJohannes Weiner	  merging at runtime, "slab_nomerge" can be passed on the kernel
2287b42f104SJohannes Weiner	  command line.
2297b42f104SJohannes Weiner
2307b42f104SJohannes Weinerconfig SLAB_FREELIST_RANDOM
2317b42f104SJohannes Weiner	bool "Randomize slab freelist"
2322a19be61SVlastimil Babka	depends on !SLUB_TINY
2337b42f104SJohannes Weiner	help
2347b42f104SJohannes Weiner	  Randomizes the freelist order used on creating new pages. This
2357b42f104SJohannes Weiner	  security feature reduces the predictability of the kernel slab
2367b42f104SJohannes Weiner	  allocator against heap overflows.
2377b42f104SJohannes Weiner
2387b42f104SJohannes Weinerconfig SLAB_FREELIST_HARDENED
2397b42f104SJohannes Weiner	bool "Harden slab freelist metadata"
2402a19be61SVlastimil Babka	depends on !SLUB_TINY
2417b42f104SJohannes Weiner	help
2427b42f104SJohannes Weiner	  Many kernel heap attacks try to target slab cache metadata and
2437b42f104SJohannes Weiner	  other infrastructure. This options makes minor performance
2447b42f104SJohannes Weiner	  sacrifices to harden the kernel slab allocator against common
2452a19be61SVlastimil Babka	  freelist exploit methods.
2467b42f104SJohannes Weiner
24767f2df3bSKees Cookconfig SLAB_BUCKETS
24867f2df3bSKees Cook	bool "Support allocation from separate kmalloc buckets"
24967f2df3bSKees Cook	depends on !SLUB_TINY
25067f2df3bSKees Cook	default SLAB_FREELIST_HARDENED
25167f2df3bSKees Cook	help
25267f2df3bSKees Cook	  Kernel heap attacks frequently depend on being able to create
25367f2df3bSKees Cook	  specifically-sized allocations with user-controlled contents
25467f2df3bSKees Cook	  that will be allocated into the same kmalloc bucket as a
25567f2df3bSKees Cook	  target object. To avoid sharing these allocation buckets,
25667f2df3bSKees Cook	  provide an explicitly separated set of buckets to be used for
25767f2df3bSKees Cook	  user-controlled allocations. This may very slightly increase
25867f2df3bSKees Cook	  memory fragmentation, though in practice it's only a handful
25967f2df3bSKees Cook	  of extra pages since the bulk of user-controlled allocations
26067f2df3bSKees Cook	  are relatively long-lived.
26167f2df3bSKees Cook
26267f2df3bSKees Cook	  If unsure, say Y.
26367f2df3bSKees Cook
2640710d012SVlastimil Babkaconfig SLUB_STATS
2650710d012SVlastimil Babka	default n
2662a19be61SVlastimil Babka	bool "Enable performance statistics"
2672a19be61SVlastimil Babka	depends on SYSFS && !SLUB_TINY
2680710d012SVlastimil Babka	help
2692a19be61SVlastimil Babka	  The statistics are useful to debug slab allocation behavior in
2700710d012SVlastimil Babka	  order find ways to optimize the allocator. This should never be
2710710d012SVlastimil Babka	  enabled for production use since keeping statistics slows down
2720710d012SVlastimil Babka	  the allocator by a few percentage points. The slabinfo command
2730710d012SVlastimil Babka	  supports the determination of the most active slabs to figure
2740710d012SVlastimil Babka	  out which slabs are relevant to a particular load.
2750710d012SVlastimil Babka	  Try running: slabinfo -DA
2760710d012SVlastimil Babka
277519bcb79SJohannes Weinerconfig SLUB_CPU_PARTIAL
278519bcb79SJohannes Weiner	default y
2792a19be61SVlastimil Babka	depends on SMP && !SLUB_TINY
2802a19be61SVlastimil Babka	bool "Enable per cpu partial caches"
281519bcb79SJohannes Weiner	help
282519bcb79SJohannes Weiner	  Per cpu partial caches accelerate objects allocation and freeing
283519bcb79SJohannes Weiner	  that is local to a processor at the price of more indeterminism
284519bcb79SJohannes Weiner	  in the latency of the free. On overflow these caches will be cleared
285519bcb79SJohannes Weiner	  which requires the taking of locks that may cause latency spikes.
286519bcb79SJohannes Weiner	  Typically one would choose no for a realtime system.
287519bcb79SJohannes Weiner
2883c615294SGONG, Ruiqiconfig RANDOM_KMALLOC_CACHES
2893c615294SGONG, Ruiqi	default n
2902a19be61SVlastimil Babka	depends on !SLUB_TINY
2913c615294SGONG, Ruiqi	bool "Randomize slab caches for normal kmalloc"
2923c615294SGONG, Ruiqi	help
2933c615294SGONG, Ruiqi	  A hardening feature that creates multiple copies of slab caches for
2943c615294SGONG, Ruiqi	  normal kmalloc allocation and makes kmalloc randomly pick one based
2953c615294SGONG, Ruiqi	  on code address, which makes the attackers more difficult to spray
2963c615294SGONG, Ruiqi	  vulnerable memory objects on the heap for the purpose of exploiting
2973c615294SGONG, Ruiqi	  memory vulnerabilities.
2983c615294SGONG, Ruiqi
2993c615294SGONG, Ruiqi	  Currently the number of copies is set to 16, a reasonably large value
3003c615294SGONG, Ruiqi	  that effectively diverges the memory objects allocated for different
3013c615294SGONG, Ruiqi	  subsystems or modules into different caches, at the expense of a
3023c615294SGONG, Ruiqi	  limited degree of memory and CPU overhead that relates to hardware and
3033c615294SGONG, Ruiqi	  system workload.
3043c615294SGONG, Ruiqi
3052a19be61SVlastimil Babkaendmenu # Slab allocator options
306519bcb79SJohannes Weiner
3077b42f104SJohannes Weinerconfig SHUFFLE_PAGE_ALLOCATOR
3087b42f104SJohannes Weiner	bool "Page allocator randomization"
3097b42f104SJohannes Weiner	default SLAB_FREELIST_RANDOM && ACPI_NUMA
3107b42f104SJohannes Weiner	help
3117b42f104SJohannes Weiner	  Randomization of the page allocator improves the average
3127b42f104SJohannes Weiner	  utilization of a direct-mapped memory-side-cache. See section
3137b42f104SJohannes Weiner	  5.2.27 Heterogeneous Memory Attribute Table (HMAT) in the ACPI
3147b42f104SJohannes Weiner	  6.2a specification for an example of how a platform advertises
3157b42f104SJohannes Weiner	  the presence of a memory-side-cache. There are also incidental
3167b42f104SJohannes Weiner	  security benefits as it reduces the predictability of page
3177b42f104SJohannes Weiner	  allocations to compliment SLAB_FREELIST_RANDOM, but the
3185e0a760bSKirill A. Shutemov	  default granularity of shuffling on the MAX_PAGE_ORDER i.e, 10th
31923baf831SKirill A. Shutemov	  order of pages is selected based on cache utilization benefits
32023baf831SKirill A. Shutemov	  on x86.
3217b42f104SJohannes Weiner
3227b42f104SJohannes Weiner	  While the randomization improves cache utilization it may
3237b42f104SJohannes Weiner	  negatively impact workloads on platforms without a cache. For
324b413f9cdSMaíra Canal	  this reason, by default, the randomization is not enabled even
325b413f9cdSMaíra Canal	  if SHUFFLE_PAGE_ALLOCATOR=y. The randomization may be force enabled
326b413f9cdSMaíra Canal	  with the 'page_alloc.shuffle' kernel command line parameter.
3277b42f104SJohannes Weiner
3287b42f104SJohannes Weiner	  Say Y if unsure.
3297b42f104SJohannes Weiner
3300710d012SVlastimil Babkaconfig COMPAT_BRK
3310710d012SVlastimil Babka	bool "Disable heap randomization"
3320710d012SVlastimil Babka	default y
3330710d012SVlastimil Babka	help
3340710d012SVlastimil Babka	  Randomizing heap placement makes heap exploits harder, but it
3350710d012SVlastimil Babka	  also breaks ancient binaries (including anything libc5 based).
3360710d012SVlastimil Babka	  This option changes the bootup default to heap randomization
3370710d012SVlastimil Babka	  disabled, and can be overridden at runtime by setting
3380710d012SVlastimil Babka	  /proc/sys/kernel/randomize_va_space to 2.
3390710d012SVlastimil Babka
3400710d012SVlastimil Babka	  On non-ancient distros (post-2000 ones) N is usually a safe choice.
3410710d012SVlastimil Babka
3420710d012SVlastimil Babkaconfig MMAP_ALLOW_UNINITIALIZED
3430710d012SVlastimil Babka	bool "Allow mmapped anonymous memory to be uninitialized"
3440710d012SVlastimil Babka	depends on EXPERT && !MMU
3450710d012SVlastimil Babka	default n
3460710d012SVlastimil Babka	help
3470710d012SVlastimil Babka	  Normally, and according to the Linux spec, anonymous memory obtained
3480710d012SVlastimil Babka	  from mmap() has its contents cleared before it is passed to
3490710d012SVlastimil Babka	  userspace.  Enabling this config option allows you to request that
3500710d012SVlastimil Babka	  mmap() skip that if it is given an MAP_UNINITIALIZED flag, thus
3510710d012SVlastimil Babka	  providing a huge performance boost.  If this option is not enabled,
3520710d012SVlastimil Babka	  then the flag will be ignored.
3530710d012SVlastimil Babka
3540710d012SVlastimil Babka	  This is taken advantage of by uClibc's malloc(), and also by
3550710d012SVlastimil Babka	  ELF-FDPIC binfmt's brk and stack allocator.
3560710d012SVlastimil Babka
3570710d012SVlastimil Babka	  Because of the obvious security issues, this option should only be
3580710d012SVlastimil Babka	  enabled on embedded devices where you control what is run in
3590710d012SVlastimil Babka	  userspace.  Since that isn't generally a problem on no-MMU systems,
3600710d012SVlastimil Babka	  it is normally safe to say Y here.
3610710d012SVlastimil Babka
3620710d012SVlastimil Babka	  See Documentation/admin-guide/mm/nommu-mmap.rst for more information.
3630710d012SVlastimil Babka
364e1785e85SDave Hansenconfig SELECT_MEMORY_MODEL
365e1785e85SDave Hansen	def_bool y
366a8826eebSKees Cook	depends on ARCH_SELECT_MEMORY_MODEL
367e1785e85SDave Hansen
3683a9da765SDave Hansenchoice
3693a9da765SDave Hansen	prompt "Memory model"
370e1785e85SDave Hansen	depends on SELECT_MEMORY_MODEL
371d41dee36SAndy Whitcroft	default SPARSEMEM_MANUAL if ARCH_SPARSEMEM_DEFAULT
372e1785e85SDave Hansen	default FLATMEM_MANUAL
373d66d109dSMike Rapoport	help
374d66d109dSMike Rapoport	  This option allows you to change some of the ways that
375d66d109dSMike Rapoport	  Linux manages its memory internally. Most users will
376d66d109dSMike Rapoport	  only have one option here selected by the architecture
377d66d109dSMike Rapoport	  configuration. This is normal.
3783a9da765SDave Hansen
379e1785e85SDave Hansenconfig FLATMEM_MANUAL
3803a9da765SDave Hansen	bool "Flat Memory"
381bb1c50d3SMike Rapoport	depends on !ARCH_SPARSEMEM_ENABLE || ARCH_FLATMEM_ENABLE
3823a9da765SDave Hansen	help
383d66d109dSMike Rapoport	  This option is best suited for non-NUMA systems with
384d66d109dSMike Rapoport	  flat address space. The FLATMEM is the most efficient
385d66d109dSMike Rapoport	  system in terms of performance and resource consumption
386d66d109dSMike Rapoport	  and it is the best option for smaller systems.
3873a9da765SDave Hansen
388d66d109dSMike Rapoport	  For systems that have holes in their physical address
389d66d109dSMike Rapoport	  spaces and for features like NUMA and memory hotplug,
390dd33d29aSRandy Dunlap	  choose "Sparse Memory".
391d41dee36SAndy Whitcroft
392d41dee36SAndy Whitcroft	  If unsure, choose this option (Flat Memory) over any other.
3933a9da765SDave Hansen
394d41dee36SAndy Whitcroftconfig SPARSEMEM_MANUAL
395d41dee36SAndy Whitcroft	bool "Sparse Memory"
396d41dee36SAndy Whitcroft	depends on ARCH_SPARSEMEM_ENABLE
397d41dee36SAndy Whitcroft	help
398d41dee36SAndy Whitcroft	  This will be the only option for some systems, including
399d66d109dSMike Rapoport	  memory hot-plug systems.  This is normal.
400d41dee36SAndy Whitcroft
401d66d109dSMike Rapoport	  This option provides efficient support for systems with
402d66d109dSMike Rapoport	  holes is their physical address space and allows memory
403d66d109dSMike Rapoport	  hot-plug and hot-remove.
404d41dee36SAndy Whitcroft
405d66d109dSMike Rapoport	  If unsure, choose "Flat Memory" over this option.
406d41dee36SAndy Whitcroft
4073a9da765SDave Hansenendchoice
4083a9da765SDave Hansen
409d41dee36SAndy Whitcroftconfig SPARSEMEM
410d41dee36SAndy Whitcroft	def_bool y
4111a83e175SRussell King	depends on (!SELECT_MEMORY_MODEL && ARCH_SPARSEMEM_ENABLE) || SPARSEMEM_MANUAL
412d41dee36SAndy Whitcroft
413e1785e85SDave Hansenconfig FLATMEM
414e1785e85SDave Hansen	def_bool y
415bb1c50d3SMike Rapoport	depends on !SPARSEMEM || FLATMEM_MANUAL
416d41dee36SAndy Whitcroft
41793b7504eSDave Hansen#
4183e347261SBob Picco# SPARSEMEM_EXTREME (which is the default) does some bootmem
419c89ab04fSMike Rapoport# allocations when sparse_init() is called.  If this cannot
4203e347261SBob Picco# be done on your architecture, select this option.  However,
4213e347261SBob Picco# statically allocating the mem_section[] array can potentially
4223e347261SBob Picco# consume vast quantities of .bss, so be careful.
4233e347261SBob Picco#
4243e347261SBob Picco# This option will also potentially produce smaller runtime code
4253e347261SBob Picco# with gcc 3.4 and later.
4263e347261SBob Picco#
4273e347261SBob Piccoconfig SPARSEMEM_STATIC
4289ba16087SJan Beulich	bool
4293e347261SBob Picco
4303e347261SBob Picco#
43144c09201SMatt LaPlante# Architecture platforms which require a two level mem_section in SPARSEMEM
432802f192eSBob Picco# must select this option. This is usually for architecture platforms with
433802f192eSBob Picco# an extremely sparse physical address space.
434802f192eSBob Picco#
4353e347261SBob Piccoconfig SPARSEMEM_EXTREME
4363e347261SBob Picco	def_bool y
4373e347261SBob Picco	depends on SPARSEMEM && !SPARSEMEM_STATIC
4384c21e2f2SHugh Dickins
43929c71111SAndy Whitcroftconfig SPARSEMEM_VMEMMAP_ENABLE
4409ba16087SJan Beulich	bool
44129c71111SAndy Whitcroft
44229c71111SAndy Whitcroftconfig SPARSEMEM_VMEMMAP
443a5ee6daaSGeoff Levand	bool "Sparse Memory virtual memmap"
444a5ee6daaSGeoff Levand	depends on SPARSEMEM && SPARSEMEM_VMEMMAP_ENABLE
445a5ee6daaSGeoff Levand	default y
446a5ee6daaSGeoff Levand	help
447a5ee6daaSGeoff Levand	  SPARSEMEM_VMEMMAP uses a virtually mapped memmap to optimise
448a5ee6daaSGeoff Levand	  pfn_to_page and page_to_pfn operations.  This is the most
449a5ee6daaSGeoff Levand	  efficient option when sufficient kernel resources are available.
450d65917c4SFrank van der Linden
451d65917c4SFrank van der Lindenconfig SPARSEMEM_VMEMMAP_PREINIT
452d65917c4SFrank van der Linden	bool
4530b376f1eSAneesh Kumar K.V#
4540b376f1eSAneesh Kumar K.V# Select this config option from the architecture Kconfig, if it is preferred
4550b376f1eSAneesh Kumar K.V# to enable the feature of HugeTLB/dev_dax vmemmap optimization.
4560b376f1eSAneesh Kumar K.V#
4570b6f1582SAneesh Kumar K.Vconfig ARCH_WANT_OPTIMIZE_DAX_VMEMMAP
4580b6f1582SAneesh Kumar K.V	bool
4590b6f1582SAneesh Kumar K.V
4600b6f1582SAneesh Kumar K.Vconfig ARCH_WANT_OPTIMIZE_HUGETLB_VMEMMAP
4610b376f1eSAneesh Kumar K.V	bool
46229c71111SAndy Whitcroft
463d65917c4SFrank van der Lindenconfig ARCH_WANT_HUGETLB_VMEMMAP_PREINIT
464d65917c4SFrank van der Linden	bool
465d65917c4SFrank van der Linden
46670210ed9SPhilipp Hachtmannconfig HAVE_MEMBLOCK_PHYS_MAP
4676341e62bSChristoph Jaeger	bool
46870210ed9SPhilipp Hachtmann
46925176ad0SDavid Hildenbrandconfig HAVE_GUP_FAST
470050a9adcSChristoph Hellwig	depends on MMU
4716341e62bSChristoph Jaeger	bool
4722667f50eSSteve Capper
473d59f43b5SAlexander Graf# Enable memblock support for scratch memory which is needed for kexec handover
474d59f43b5SAlexander Grafconfig MEMBLOCK_KHO_SCRATCH
475d59f43b5SAlexander Graf	bool
476d59f43b5SAlexander Graf
47752219aeaSDavid Hildenbrand# Don't discard allocated memory used to track "memory" and "reserved" memblocks
47852219aeaSDavid Hildenbrand# after early boot, so it can still be used to test for validity of memory.
47952219aeaSDavid Hildenbrand# Also, memblocks are updated with memory hot(un)plug.
480350e88baSMike Rapoportconfig ARCH_KEEP_MEMBLOCK
4816341e62bSChristoph Jaeger	bool
482c378ddd5STejun Heo
4831e5d8e1eSDan Williams# Keep arch NUMA mapping infrastructure post-init.
4841e5d8e1eSDan Williamsconfig NUMA_KEEP_MEMINFO
4851e5d8e1eSDan Williams	bool
4861e5d8e1eSDan Williams
487ee6f509cSMinchan Kimconfig MEMORY_ISOLATION
4886341e62bSChristoph Jaeger	bool
489ee6f509cSMinchan Kim
490a9e7b8d4SDavid Hildenbrand# IORESOURCE_SYSTEM_RAM regions in the kernel resource tree that are marked
491a9e7b8d4SDavid Hildenbrand# IORESOURCE_EXCLUSIVE cannot be mapped to user space, for example, via
492a9e7b8d4SDavid Hildenbrand# /dev/mem.
493a9e7b8d4SDavid Hildenbrandconfig EXCLUSIVE_SYSTEM_RAM
494a9e7b8d4SDavid Hildenbrand	def_bool y
495a9e7b8d4SDavid Hildenbrand	depends on !DEVMEM || STRICT_DEVMEM
496a9e7b8d4SDavid Hildenbrand
49746723bfaSYasuaki Ishimatsu#
49846723bfaSYasuaki Ishimatsu# Only be set on architectures that have completely implemented memory hotplug
49946723bfaSYasuaki Ishimatsu# feature. If you are not sure, don't touch it.
50046723bfaSYasuaki Ishimatsu#
50146723bfaSYasuaki Ishimatsuconfig HAVE_BOOTMEM_INFO_NODE
50246723bfaSYasuaki Ishimatsu	def_bool n
50346723bfaSYasuaki Ishimatsu
50491024b3cSAnshuman Khandualconfig ARCH_ENABLE_MEMORY_HOTPLUG
50591024b3cSAnshuman Khandual	bool
50691024b3cSAnshuman Khandual
507519bcb79SJohannes Weinerconfig ARCH_ENABLE_MEMORY_HOTREMOVE
508519bcb79SJohannes Weiner	bool
509519bcb79SJohannes Weiner
5103947be19SDave Hansen# eventually, we can have this option just 'select SPARSEMEM'
511519bcb79SJohannes Weinermenuconfig MEMORY_HOTPLUG
512519bcb79SJohannes Weiner	bool "Memory hotplug"
513b30c5927SDavid Hildenbrand	select MEMORY_ISOLATION
51471b6f2ddSDavid Hildenbrand	depends on SPARSEMEM
51540b31360SStephen Rothwell	depends on ARCH_ENABLE_MEMORY_HOTPLUG
5167ec58a2bSDavid Hildenbrand	depends on 64BIT
5171e5d8e1eSDan Williams	select NUMA_KEEP_MEMINFO if NUMA
5183947be19SDave Hansen
519519bcb79SJohannes Weinerif MEMORY_HOTPLUG
520519bcb79SJohannes Weiner
52144d46b76SGregory Pricechoice
52244d46b76SGregory Price	prompt "Memory Hotplug Default Online Type"
52344d46b76SGregory Price	default MHP_DEFAULT_ONLINE_TYPE_OFFLINE
5248604d9e5SVitaly Kuznetsov	help
52544d46b76SGregory Price	  Default memory type for hotplugged memory.
52644d46b76SGregory Price
5278604d9e5SVitaly Kuznetsov	  This option sets the default policy setting for memory hotplug
5288604d9e5SVitaly Kuznetsov	  onlining policy (/sys/devices/system/memory/auto_online_blocks) which
5298604d9e5SVitaly Kuznetsov	  determines what happens to newly added memory regions. Policy setting
5308604d9e5SVitaly Kuznetsov	  can always be changed at runtime.
53144d46b76SGregory Price
53244d46b76SGregory Price	  The default is 'offline'.
53344d46b76SGregory Price
53444d46b76SGregory Price	  Select offline to defer onlining to drivers and user policy.
53544d46b76SGregory Price	  Select auto to let the kernel choose what zones to utilize.
53644d46b76SGregory Price	  Select online_kernel to generally allow kernel usage of this memory.
53744d46b76SGregory Price	  Select online_movable to generally disallow kernel usage of this memory.
53844d46b76SGregory Price
53944d46b76SGregory Price	  Example kernel usage would be page structs and page tables.
54044d46b76SGregory Price
541cb1aaebeSMauro Carvalho Chehab	  See Documentation/admin-guide/mm/memory-hotplug.rst for more information.
5428604d9e5SVitaly Kuznetsov
54344d46b76SGregory Priceconfig MHP_DEFAULT_ONLINE_TYPE_OFFLINE
54444d46b76SGregory Price	bool "offline"
54544d46b76SGregory Price	help
54644d46b76SGregory Price	  Hotplugged memory will not be onlined by default.
54744d46b76SGregory Price	  Choose this for systems with drivers and user policy that
54844d46b76SGregory Price	  handle onlining of hotplug memory policy.
54944d46b76SGregory Price
55044d46b76SGregory Priceconfig MHP_DEFAULT_ONLINE_TYPE_ONLINE_AUTO
55144d46b76SGregory Price	bool "auto"
55244d46b76SGregory Price	help
55344d46b76SGregory Price	  Select this if you want the kernel to automatically online
55444d46b76SGregory Price	  hotplugged memory into the zone it thinks is reasonable.
55544d46b76SGregory Price	  This memory may be utilized for kernel data.
55644d46b76SGregory Price
55744d46b76SGregory Priceconfig MHP_DEFAULT_ONLINE_TYPE_ONLINE_KERNEL
55844d46b76SGregory Price	bool "kernel"
55944d46b76SGregory Price	help
56044d46b76SGregory Price	  Select this if you want the kernel to automatically online
56144d46b76SGregory Price	  hotplugged memory into a zone capable of being used for kernel
56244d46b76SGregory Price	  data. This typically means ZONE_NORMAL.
56344d46b76SGregory Price
56444d46b76SGregory Priceconfig MHP_DEFAULT_ONLINE_TYPE_ONLINE_MOVABLE
56544d46b76SGregory Price	bool "movable"
56644d46b76SGregory Price	help
56744d46b76SGregory Price	  Select this if you want the kernel to automatically online
56844d46b76SGregory Price	  hotplug memory into ZONE_MOVABLE. This memory will generally
56944d46b76SGregory Price	  not be utilized for kernel data.
57044d46b76SGregory Price
57144d46b76SGregory Price	  This should only be used when the admin knows sufficient
57244d46b76SGregory Price	  ZONE_NORMAL memory is available to describe hotplug memory,
57344d46b76SGregory Price	  otherwise hotplug memory may fail to online. For example,
57444d46b76SGregory Price	  sufficient kernel-capable memory (ZONE_NORMAL) must be
57544d46b76SGregory Price	  available to allocate page structs to describe ZONE_MOVABLE.
57644d46b76SGregory Price
57744d46b76SGregory Priceendchoice
5788604d9e5SVitaly Kuznetsov
5790c0e6195SKAMEZAWA Hiroyukiconfig MEMORY_HOTREMOVE
5800c0e6195SKAMEZAWA Hiroyuki	bool "Allow for memory hot remove"
581f7e3334aSNathan Fontenot	select HAVE_BOOTMEM_INFO_NODE if (X86_64 || PPC64)
5820c0e6195SKAMEZAWA Hiroyuki	depends on MEMORY_HOTPLUG && ARCH_ENABLE_MEMORY_HOTREMOVE
5830c0e6195SKAMEZAWA Hiroyuki	depends on MIGRATION
5840c0e6195SKAMEZAWA Hiroyuki
585a08a2ae3SOscar Salvadorconfig MHP_MEMMAP_ON_MEMORY
586a08a2ae3SOscar Salvador	def_bool y
587a08a2ae3SOscar Salvador	depends on MEMORY_HOTPLUG && SPARSEMEM_VMEMMAP
588a08a2ae3SOscar Salvador	depends on ARCH_MHP_MEMMAP_ON_MEMORY_ENABLE
589a08a2ae3SOscar Salvador
590519bcb79SJohannes Weinerendif # MEMORY_HOTPLUG
591519bcb79SJohannes Weiner
59204d5ea46SAneesh Kumar K.Vconfig ARCH_MHP_MEMMAP_ON_MEMORY_ENABLE
59304d5ea46SAneesh Kumar K.V       bool
59404d5ea46SAneesh Kumar K.V
5954c21e2f2SHugh Dickins# Heavily threaded applications may benefit from splitting the mm-wide
5964c21e2f2SHugh Dickins# page_table_lock, so that faults on different parts of the user address
5974c21e2f2SHugh Dickins# space can be handled with less contention: split it at this NR_CPUS.
5984c21e2f2SHugh Dickins# Default to 4 for wider testing, though 8 might be more appropriate.
5994c21e2f2SHugh Dickins# ARM's adjust_pte (unused if VIPT) depends on mm-wide page_table_lock.
6007b6ac9dfSHugh Dickins# PA-RISC 7xxx's spinlock_t would enlarge struct page from 32 to 44 bytes.
60160bccaa6SWill Deacon# SPARC32 allocates multiple pte tables within a single page, and therefore
60260bccaa6SWill Deacon# a per-page lock leads to problems when multiple tables need to be locked
60360bccaa6SWill Deacon# at the same time (e.g. copy_page_range()).
604a70caa8bSHugh Dickins# DEBUG_SPINLOCK and DEBUG_LOCK_ALLOC spinlock_t also enlarge struct page.
6054c21e2f2SHugh Dickins#
606394290cbSDavid Hildenbrandconfig SPLIT_PTE_PTLOCKS
607394290cbSDavid Hildenbrand	def_bool y
608394290cbSDavid Hildenbrand	depends on MMU
609a3344078SGuenter Roeck	depends on SMP
610394290cbSDavid Hildenbrand	depends on NR_CPUS >= 4
611394290cbSDavid Hildenbrand	depends on !ARM || CPU_CACHE_VIPT
612394290cbSDavid Hildenbrand	depends on !PARISC || PA20
613394290cbSDavid Hildenbrand	depends on !SPARC32
6147cbe34cfSChristoph Lameter
615e009bb30SKirill A. Shutemovconfig ARCH_ENABLE_SPLIT_PMD_PTLOCK
6166341e62bSChristoph Jaeger	bool
617e009bb30SKirill A. Shutemov
618394290cbSDavid Hildenbrandconfig SPLIT_PMD_PTLOCKS
619394290cbSDavid Hildenbrand	def_bool y
620394290cbSDavid Hildenbrand	depends on SPLIT_PTE_PTLOCKS && ARCH_ENABLE_SPLIT_PMD_PTLOCK
621394290cbSDavid Hildenbrand
6227cbe34cfSChristoph Lameter#
62309316c09SKonstantin Khlebnikov# support for memory balloon
62409316c09SKonstantin Khlebnikovconfig MEMORY_BALLOON
6256341e62bSChristoph Jaeger	bool
62609316c09SKonstantin Khlebnikov
62709316c09SKonstantin Khlebnikov#
62818468d93SRafael Aquini# support for memory balloon compaction
62918468d93SRafael Aquiniconfig BALLOON_COMPACTION
63018468d93SRafael Aquini	bool "Allow for balloon memory compaction/migration"
631cd14b018SMasahiro Yamada	default y
63209316c09SKonstantin Khlebnikov	depends on COMPACTION && MEMORY_BALLOON
63318468d93SRafael Aquini	help
63418468d93SRafael Aquini	  Memory fragmentation introduced by ballooning might reduce
63518468d93SRafael Aquini	  significantly the number of 2MB contiguous memory blocks that can be
63618468d93SRafael Aquini	  used within a guest, thus imposing performance penalties associated
63718468d93SRafael Aquini	  with the reduced number of transparent huge pages that could be used
63818468d93SRafael Aquini	  by the guest workload. Allowing the compaction & migration for memory
63918468d93SRafael Aquini	  pages enlisted as being part of memory balloon devices avoids the
64018468d93SRafael Aquini	  scenario aforementioned and helps improving memory defragmentation.
64118468d93SRafael Aquini
64218468d93SRafael Aquini#
643e9e96b39SMel Gorman# support for memory compaction
644e9e96b39SMel Gormanconfig COMPACTION
645e9e96b39SMel Gorman	bool "Allow for memory compaction"
646cd14b018SMasahiro Yamada	default y
647e9e96b39SMel Gorman	select MIGRATION
64833a93877SAndrea Arcangeli	depends on MMU
649e9e96b39SMel Gorman	help
650b32eaf71SMichal Hocko	  Compaction is the only memory management component to form
651b32eaf71SMichal Hocko	  high order (larger physically contiguous) memory blocks
652b32eaf71SMichal Hocko	  reliably. The page allocator relies on compaction heavily and
653b32eaf71SMichal Hocko	  the lack of the feature can lead to unexpected OOM killer
654b32eaf71SMichal Hocko	  invocations for high order memory requests. You shouldn't
655b32eaf71SMichal Hocko	  disable this option unless there really is a strong reason for
656b32eaf71SMichal Hocko	  it and then we would be really interested to hear about that at
657b32eaf71SMichal Hocko	  linux-mm@kvack.org.
658e9e96b39SMel Gorman
659c7e0b3d0SThomas Gleixnerconfig COMPACT_UNEVICTABLE_DEFAULT
660c7e0b3d0SThomas Gleixner	int
661c7e0b3d0SThomas Gleixner	depends on COMPACTION
662c7e0b3d0SThomas Gleixner	default 0 if PREEMPT_RT
663c7e0b3d0SThomas Gleixner	default 1
664c7e0b3d0SThomas Gleixner
665e9e96b39SMel Gorman#
66636e66c55SAlexander Duyck# support for free page reporting
66736e66c55SAlexander Duyckconfig PAGE_REPORTING
66836e66c55SAlexander Duyck	bool "Free page reporting"
66936e66c55SAlexander Duyck	help
67036e66c55SAlexander Duyck	  Free page reporting allows for the incremental acquisition of
67136e66c55SAlexander Duyck	  free pages from the buddy allocator for the purpose of reporting
67236e66c55SAlexander Duyck	  those pages to another entity, such as a hypervisor, so that the
67336e66c55SAlexander Duyck	  memory can be freed within the host for other uses.
67436e66c55SAlexander Duyck
67536e66c55SAlexander Duyck#
6767cbe34cfSChristoph Lameter# support for page migration
6777cbe34cfSChristoph Lameter#
6787cbe34cfSChristoph Lameterconfig MIGRATION
679b20a3503SChristoph Lameter	bool "Page migration"
680cd14b018SMasahiro Yamada	default y
681de32a817SChen Gang	depends on (NUMA || ARCH_ENABLE_MEMORY_HOTREMOVE || COMPACTION || CMA) && MMU
682b20a3503SChristoph Lameter	help
683b20a3503SChristoph Lameter	  Allows the migration of the physical location of pages of processes
684e9e96b39SMel Gorman	  while the virtual addresses are not changed. This is useful in
685e9e96b39SMel Gorman	  two situations. The first is on NUMA systems to put pages nearer
686e9e96b39SMel Gorman	  to the processors accessing. The second is when allocating huge
687e9e96b39SMel Gorman	  pages as migration can relocate pages to satisfy a huge page
688e9e96b39SMel Gorman	  allocation instead of reclaiming.
6896550e07fSGreg Kroah-Hartman
69076cbbeadSChristoph Hellwigconfig DEVICE_MIGRATION
691d90a25f8SChristoph Hellwig	def_bool MIGRATION && ZONE_DEVICE
69276cbbeadSChristoph Hellwig
693c177c81eSNaoya Horiguchiconfig ARCH_ENABLE_HUGEPAGE_MIGRATION
6946341e62bSChristoph Jaeger	bool
695c177c81eSNaoya Horiguchi
6969c670ea3SNaoya Horiguchiconfig ARCH_ENABLE_THP_MIGRATION
6979c670ea3SNaoya Horiguchi	bool
6989c670ea3SNaoya Horiguchi
6994bfb68a0SAnshuman Khandualconfig HUGETLB_PAGE_SIZE_VARIABLE
7004bfb68a0SAnshuman Khandual	def_bool n
7014bfb68a0SAnshuman Khandual	help
7024bfb68a0SAnshuman Khandual	  Allows the pageblock_order value to be dynamic instead of just standard
7034bfb68a0SAnshuman Khandual	  HUGETLB_PAGE_ORDER when there are multiple HugeTLB page sizes available
7044bfb68a0SAnshuman Khandual	  on a platform.
7054bfb68a0SAnshuman Khandual
7065e0a760bSKirill A. Shutemov	  Note that the pageblock_order cannot exceed MAX_PAGE_ORDER and will be
7075e0a760bSKirill A. Shutemov	  clamped down to MAX_PAGE_ORDER.
708b3d40a2bSDavid Hildenbrand
7098df995f6SAlexandre Ghiticonfig CONTIG_ALLOC
7108df995f6SAlexandre Ghiti	def_bool (MEMORY_ISOLATION && COMPACTION) || CMA
7118df995f6SAlexandre Ghiti
71252166607SHuang Yingconfig PCP_BATCH_SCALE_MAX
71352166607SHuang Ying	int "Maximum scale factor of PCP (Per-CPU pageset) batch allocate/free"
71452166607SHuang Ying	default 5
71552166607SHuang Ying	range 0 6
71652166607SHuang Ying	help
71752166607SHuang Ying	  In page allocator, PCP (Per-CPU pageset) is refilled and drained in
71852166607SHuang Ying	  batches.  The batch number is scaled automatically to improve page
71952166607SHuang Ying	  allocation/free throughput.  But too large scale factor may hurt
72052166607SHuang Ying	  latency.  This option sets the upper limit of scale factor to limit
72152166607SHuang Ying	  the maximum latency.
72252166607SHuang Ying
723600715dcSJeremy Fitzhardingeconfig PHYS_ADDR_T_64BIT
724d4a451d5SChristoph Hellwig	def_bool 64BIT
725600715dcSJeremy Fitzhardinge
7262a7326b5SChristoph Lameterconfig BOUNCE
7279ca24e2eSVinayak Menon	bool "Enable bounce buffers"
7289ca24e2eSVinayak Menon	default y
729ce288e05SChristoph Hellwig	depends on BLOCK && MMU && HIGHMEM
7309ca24e2eSVinayak Menon	help
731ce288e05SChristoph Hellwig	  Enable bounce buffers for devices that cannot access the full range of
732ce288e05SChristoph Hellwig	  memory available to the CPU. Enabled by default when HIGHMEM is
733ce288e05SChristoph Hellwig	  selected, but you may say n to override this.
7342a7326b5SChristoph Lameter
735cddb8a5cSAndrea Arcangeliconfig MMU_NOTIFIER
736cddb8a5cSAndrea Arcangeli	bool
73799cb252fSJason Gunthorpe	select INTERVAL_TREE
738fc4d5c29SDavid Howells
739f8af4da3SHugh Dickinsconfig KSM
740f8af4da3SHugh Dickins	bool "Enable KSM for page merging"
741f8af4da3SHugh Dickins	depends on MMU
74259e1a2f4STimofey Titovets	select XXHASH
743f8af4da3SHugh Dickins	help
744f8af4da3SHugh Dickins	  Enable Kernel Samepage Merging: KSM periodically scans those areas
745f8af4da3SHugh Dickins	  of an application's address space that an app has advised may be
746f8af4da3SHugh Dickins	  mergeable.  When it finds pages of identical content, it replaces
747d0f209f6SHugh Dickins	  the many instances by a single page with that content, so
748f8af4da3SHugh Dickins	  saving memory until one or another app needs to modify the content.
749f8af4da3SHugh Dickins	  Recommended for use with KVM, or with other duplicative applications.
750ee65728eSMike Rapoport	  See Documentation/mm/ksm.rst for more information: KSM is inactive
751c73602adSHugh Dickins	  until a program has madvised that an area is MADV_MERGEABLE, and
752c73602adSHugh Dickins	  root has set /sys/kernel/mm/ksm/run to 1 (if CONFIG_SYSFS is set).
753f8af4da3SHugh Dickins
754e0a94c2aSChristoph Lameterconfig DEFAULT_MMAP_MIN_ADDR
755e0a94c2aSChristoph Lameter	int "Low address space to protect from user allocation"
7566e141546SDavid Howells	depends on MMU
757e0a94c2aSChristoph Lameter	default 4096
758e0a94c2aSChristoph Lameter	help
759e0a94c2aSChristoph Lameter	  This is the portion of low virtual memory which should be protected
760e0a94c2aSChristoph Lameter	  from userspace allocation.  Keeping a user from writing to low pages
761e0a94c2aSChristoph Lameter	  can help reduce the impact of kernel NULL pointer bugs.
762e0a94c2aSChristoph Lameter
76334f7c528SJavier Martinez Canillas	  For most arm64, ppc64 and x86 users with lots of address space
764e0a94c2aSChristoph Lameter	  a value of 65536 is reasonable and should cause no problems.
765e0a94c2aSChristoph Lameter	  On arm and other archs it should not be higher than 32768.
766788084abSEric Paris	  Programs which use vm86 functionality or have some need to map
767788084abSEric Paris	  this low address space will need CAP_SYS_RAWIO or disable this
768788084abSEric Paris	  protection by setting the value to 0.
769e0a94c2aSChristoph Lameter
770e0a94c2aSChristoph Lameter	  This value can be changed after boot using the
771e0a94c2aSChristoph Lameter	  /proc/sys/vm/mmap_min_addr tunable.
772e0a94c2aSChristoph Lameter
773d949f36fSLinus Torvaldsconfig ARCH_SUPPORTS_MEMORY_FAILURE
774d949f36fSLinus Torvalds	bool
775e0a94c2aSChristoph Lameter
7766a46079cSAndi Kleenconfig MEMORY_FAILURE
7776a46079cSAndi Kleen	depends on MMU
778d949f36fSLinus Torvalds	depends on ARCH_SUPPORTS_MEMORY_FAILURE
7796a46079cSAndi Kleen	bool "Enable recovery from hardware memory errors"
780ee6f509cSMinchan Kim	select MEMORY_ISOLATION
78197f0b134SXie XiuQi	select RAS
7826a46079cSAndi Kleen	help
7836a46079cSAndi Kleen	  Enables code to recover from some memory failures on systems
7846a46079cSAndi Kleen	  with MCA recovery. This allows a system to continue running
7856a46079cSAndi Kleen	  even when some of its memory has uncorrected errors. This requires
7866a46079cSAndi Kleen	  special hardware support and typically ECC memory.
7876a46079cSAndi Kleen
788cae681fcSAndi Kleenconfig HWPOISON_INJECT
789413f9efbSAndi Kleen	tristate "HWPoison pages injector"
79027df5068SAndi Kleen	depends on MEMORY_FAILURE && DEBUG_KERNEL && PROC_FS
791478c5ffcSWu Fengguang	select PROC_PAGE_MONITOR
792cae681fcSAndi Kleen
793fc4d5c29SDavid Howellsconfig NOMMU_INITIAL_TRIM_EXCESS
794fc4d5c29SDavid Howells	int "Turn on mmap() excess space trimming before booting"
795fc4d5c29SDavid Howells	depends on !MMU
796fc4d5c29SDavid Howells	default 1
797fc4d5c29SDavid Howells	help
798fc4d5c29SDavid Howells	  The NOMMU mmap() frequently needs to allocate large contiguous chunks
799fc4d5c29SDavid Howells	  of memory on which to store mappings, but it can only ask the system
800fc4d5c29SDavid Howells	  allocator for chunks in 2^N*PAGE_SIZE amounts - which is frequently
801fc4d5c29SDavid Howells	  more than it requires.  To deal with this, mmap() is able to trim off
802fc4d5c29SDavid Howells	  the excess and return it to the allocator.
803fc4d5c29SDavid Howells
804fc4d5c29SDavid Howells	  If trimming is enabled, the excess is trimmed off and returned to the
805fc4d5c29SDavid Howells	  system allocator, which can cause extra fragmentation, particularly
806fc4d5c29SDavid Howells	  if there are a lot of transient processes.
807fc4d5c29SDavid Howells
808fc4d5c29SDavid Howells	  If trimming is disabled, the excess is kept, but not used, which for
809fc4d5c29SDavid Howells	  long-term mappings means that the space is wasted.
810fc4d5c29SDavid Howells
811fc4d5c29SDavid Howells	  Trimming can be dynamically controlled through a sysctl option
812fc4d5c29SDavid Howells	  (/proc/sys/vm/nr_trim_pages) which specifies the minimum number of
813fc4d5c29SDavid Howells	  excess pages there must be before trimming should occur, or zero if
814fc4d5c29SDavid Howells	  no trimming is to occur.
815fc4d5c29SDavid Howells
816fc4d5c29SDavid Howells	  This option specifies the initial value of this option.  The default
817fc4d5c29SDavid Howells	  of 1 says that all excess pages should be trimmed.
818fc4d5c29SDavid Howells
819dd19d293SStephen Kitt	  See Documentation/admin-guide/mm/nommu-mmap.rst for more information.
820bbddff05STejun Heo
821519bcb79SJohannes Weinerconfig ARCH_WANT_GENERAL_HUGETLB
822519bcb79SJohannes Weiner	bool
823519bcb79SJohannes Weiner
824519bcb79SJohannes Weinerconfig ARCH_WANTS_THP_SWAP
825519bcb79SJohannes Weiner	def_bool n
826519bcb79SJohannes Weiner
8276af8cb80SDavid Hildenbrandconfig MM_ID
8286af8cb80SDavid Hildenbrand	def_bool n
8296af8cb80SDavid Hildenbrand
830519bcb79SJohannes Weinermenuconfig TRANSPARENT_HUGEPAGE
83113ece886SAndrea Arcangeli	bool "Transparent Hugepage Support"
832554b0f3cSSebastian Andrzej Siewior	depends on HAVE_ARCH_TRANSPARENT_HUGEPAGE && !PREEMPT_RT
8335d689240SAndrea Arcangeli	select COMPACTION
8343a08cd52SMatthew Wilcox	select XARRAY_MULTI
8356af8cb80SDavid Hildenbrand	select MM_ID
8364c76d9d1SAndrea Arcangeli	help
8374c76d9d1SAndrea Arcangeli	  Transparent Hugepages allows the kernel to use huge pages and
8384c76d9d1SAndrea Arcangeli	  huge tlb transparently to the applications whenever possible.
8394c76d9d1SAndrea Arcangeli	  This feature can improve computing performance to certain
8404c76d9d1SAndrea Arcangeli	  applications by speeding up page faults during memory
8414c76d9d1SAndrea Arcangeli	  allocation, by reducing the number of tlb misses and by speeding
8424c76d9d1SAndrea Arcangeli	  up the pagetable walking.
8434c76d9d1SAndrea Arcangeli
8444c76d9d1SAndrea Arcangeli	  If memory constrained on embedded, you may want to say N.
8454c76d9d1SAndrea Arcangeli
846519bcb79SJohannes Weinerif TRANSPARENT_HUGEPAGE
847519bcb79SJohannes Weiner
84813ece886SAndrea Arcangelichoice
84913ece886SAndrea Arcangeli	prompt "Transparent Hugepage Support sysfs defaults"
85013ece886SAndrea Arcangeli	depends on TRANSPARENT_HUGEPAGE
85113ece886SAndrea Arcangeli	default TRANSPARENT_HUGEPAGE_ALWAYS
85213ece886SAndrea Arcangeli	help
85313ece886SAndrea Arcangeli	  Selects the sysfs defaults for Transparent Hugepage Support.
85413ece886SAndrea Arcangeli
85513ece886SAndrea Arcangeli	config TRANSPARENT_HUGEPAGE_ALWAYS
85613ece886SAndrea Arcangeli		bool "always"
85713ece886SAndrea Arcangeli	help
85813ece886SAndrea Arcangeli	  Enabling Transparent Hugepage always, can increase the
85913ece886SAndrea Arcangeli	  memory footprint of applications without a guaranteed
86013ece886SAndrea Arcangeli	  benefit but it will work automatically for all applications.
86113ece886SAndrea Arcangeli
86213ece886SAndrea Arcangeli	config TRANSPARENT_HUGEPAGE_MADVISE
86313ece886SAndrea Arcangeli		bool "madvise"
86413ece886SAndrea Arcangeli	help
86513ece886SAndrea Arcangeli	  Enabling Transparent Hugepage madvise, will only provide a
86613ece886SAndrea Arcangeli	  performance improvement benefit to the applications using
86713ece886SAndrea Arcangeli	  madvise(MADV_HUGEPAGE) but it won't risk to increase the
86813ece886SAndrea Arcangeli	  memory footprint of applications without a guaranteed
86913ece886SAndrea Arcangeli	  benefit.
870683ec99fSDmytro Maluka
871683ec99fSDmytro Maluka	config TRANSPARENT_HUGEPAGE_NEVER
872683ec99fSDmytro Maluka		bool "never"
873683ec99fSDmytro Maluka	help
874683ec99fSDmytro Maluka	  Disable Transparent Hugepage by default. It can still be
875683ec99fSDmytro Maluka	  enabled at runtime via sysfs.
87613ece886SAndrea Arcangeliendchoice
87713ece886SAndrea Arcangeli
87838d8b4e6SHuang Yingconfig THP_SWAP
87938d8b4e6SHuang Ying	def_bool y
880dad6a5ebSHugh Dickins	depends on TRANSPARENT_HUGEPAGE && ARCH_WANTS_THP_SWAP && SWAP && 64BIT
88138d8b4e6SHuang Ying	help
88238d8b4e6SHuang Ying	  Swap transparent huge pages in one piece, without splitting.
88314fef284SHuang Ying	  XXX: For now, swap cluster backing transparent huge page
88414fef284SHuang Ying	  will be split after swapout.
88538d8b4e6SHuang Ying
88638d8b4e6SHuang Ying	  For selection by architectures with reasonable THP sizes.
88738d8b4e6SHuang Ying
888519bcb79SJohannes Weinerconfig READ_ONLY_THP_FOR_FS
889519bcb79SJohannes Weiner	bool "Read-only THP for filesystems (EXPERIMENTAL)"
890cc79061bSBaolin Wang	depends on TRANSPARENT_HUGEPAGE
891519bcb79SJohannes Weiner
892519bcb79SJohannes Weiner	help
893519bcb79SJohannes Weiner	  Allow khugepaged to put read-only file-backed pages in THP.
894519bcb79SJohannes Weiner
895519bcb79SJohannes Weiner	  This is marked experimental because it is a new feature. Write
896519bcb79SJohannes Weiner	  support of file THPs will be developed in the next few release
897519bcb79SJohannes Weiner	  cycles.
898519bcb79SJohannes Weiner
899e63ee43eSDavid Hildenbrandconfig NO_PAGE_MAPCOUNT
900e63ee43eSDavid Hildenbrand	bool "No per-page mapcount (EXPERIMENTAL)"
901e63ee43eSDavid Hildenbrand	help
902e63ee43eSDavid Hildenbrand	  Do not maintain per-page mapcounts for pages part of larger
903e63ee43eSDavid Hildenbrand	  allocations, such as transparent huge pages.
904e63ee43eSDavid Hildenbrand
905e63ee43eSDavid Hildenbrand	  When this config option is enabled, some interfaces that relied on
906e63ee43eSDavid Hildenbrand	  this information will rely on less-precise per-allocation information
907e63ee43eSDavid Hildenbrand	  instead: for example, using the average per-page mapcount in such
908e63ee43eSDavid Hildenbrand	  a large allocation instead of the per-page mapcount.
909e63ee43eSDavid Hildenbrand
910e63ee43eSDavid Hildenbrand	  EXPERIMENTAL because the impact of some changes is still unclear.
911e63ee43eSDavid Hildenbrand
912519bcb79SJohannes Weinerendif # TRANSPARENT_HUGEPAGE
913519bcb79SJohannes Weiner
914e63ee43eSDavid Hildenbrand# simple helper to make the code a bit easier to read
915e63ee43eSDavid Hildenbrandconfig PAGE_MAPCOUNT
916e63ee43eSDavid Hildenbrand	def_bool !NO_PAGE_MAPCOUNT
917e63ee43eSDavid Hildenbrand
918e496cf3dSKirill A. Shutemov#
919ac3830c3SPeter Xu# The architecture supports pgtable leaves that is larger than PAGE_SIZE
920ac3830c3SPeter Xu#
921ac3830c3SPeter Xuconfig PGTABLE_HAS_HUGE_LEAVES
922ac3830c3SPeter Xu	def_bool TRANSPARENT_HUGEPAGE || HUGETLB_PAGE
923ac3830c3SPeter Xu
9246857be5fSPeter Xu# TODO: Allow to be enabled without THP
9256857be5fSPeter Xuconfig ARCH_SUPPORTS_HUGE_PFNMAP
9266857be5fSPeter Xu	def_bool n
9276857be5fSPeter Xu	depends on TRANSPARENT_HUGEPAGE
9286857be5fSPeter Xu
9296857be5fSPeter Xuconfig ARCH_SUPPORTS_PMD_PFNMAP
9306857be5fSPeter Xu	def_bool y
9316857be5fSPeter Xu	depends on ARCH_SUPPORTS_HUGE_PFNMAP && HAVE_ARCH_TRANSPARENT_HUGEPAGE
9326857be5fSPeter Xu
9336857be5fSPeter Xuconfig ARCH_SUPPORTS_PUD_PFNMAP
9346857be5fSPeter Xu	def_bool y
9356857be5fSPeter Xu	depends on ARCH_SUPPORTS_HUGE_PFNMAP && HAVE_ARCH_TRANSPARENT_HUGEPAGE_PUD
9366857be5fSPeter Xu
937ac3830c3SPeter Xu#
93859b5ed40SHao Ge# Architectures that always use weak definitions for percpu
93959b5ed40SHao Ge# variables in modules should set this.
94059b5ed40SHao Ge#
94159b5ed40SHao Geconfig ARCH_MODULE_NEEDS_WEAK_PER_CPU
94259b5ed40SHao Ge       bool
94359b5ed40SHao Ge
94459b5ed40SHao Ge#
945bbddff05STejun Heo# UP and nommu archs use km based percpu allocator
946bbddff05STejun Heo#
947bbddff05STejun Heoconfig NEED_PER_CPU_KM
9483583521aSVladimir Murzin	depends on !SMP || !MMU
949bbddff05STejun Heo	bool
950bbddff05STejun Heo	default y
951077b1f83SDan Magenheimer
9527ecd19cfSKefeng Wangconfig NEED_PER_CPU_EMBED_FIRST_CHUNK
9537ecd19cfSKefeng Wang	bool
9547ecd19cfSKefeng Wang
9557ecd19cfSKefeng Wangconfig NEED_PER_CPU_PAGE_FIRST_CHUNK
9567ecd19cfSKefeng Wang	bool
9577ecd19cfSKefeng Wang
9587ecd19cfSKefeng Wangconfig USE_PERCPU_NUMA_NODE_ID
9597ecd19cfSKefeng Wang	bool
9607ecd19cfSKefeng Wang
9617ecd19cfSKefeng Wangconfig HAVE_SETUP_PER_CPU_AREA
9627ecd19cfSKefeng Wang	bool
9637ecd19cfSKefeng Wang
964f825c736SAneesh Kumar K.Vconfig CMA
965f825c736SAneesh Kumar K.V	bool "Contiguous Memory Allocator"
966aca52c39SMike Rapoport	depends on MMU
967f825c736SAneesh Kumar K.V	select MIGRATION
968f825c736SAneesh Kumar K.V	select MEMORY_ISOLATION
969f825c736SAneesh Kumar K.V	help
970f825c736SAneesh Kumar K.V	  This enables the Contiguous Memory Allocator which allows other
971f825c736SAneesh Kumar K.V	  subsystems to allocate big physically-contiguous blocks of memory.
972f825c736SAneesh Kumar K.V	  CMA reserves a region of memory and allows only movable pages to
973f825c736SAneesh Kumar K.V	  be allocated from it. This way, the kernel can use the memory for
974f825c736SAneesh Kumar K.V	  pagecache and when a subsystem requests for contiguous area, the
975f825c736SAneesh Kumar K.V	  allocated pages are migrated away to serve the contiguous request.
976f825c736SAneesh Kumar K.V
977f825c736SAneesh Kumar K.V	  If unsure, say "n".
978f825c736SAneesh Kumar K.V
97928b24c1fSSasha Levinconfig CMA_DEBUGFS
98028b24c1fSSasha Levin	bool "CMA debugfs interface"
98128b24c1fSSasha Levin	depends on CMA && DEBUG_FS
98228b24c1fSSasha Levin	help
98328b24c1fSSasha Levin	  Turns on the DebugFS interface for CMA.
98428b24c1fSSasha Levin
98543ca106fSMinchan Kimconfig CMA_SYSFS
98643ca106fSMinchan Kim	bool "CMA information through sysfs interface"
98743ca106fSMinchan Kim	depends on CMA && SYSFS
98843ca106fSMinchan Kim	help
98943ca106fSMinchan Kim	  This option exposes some sysfs attributes to get information
99043ca106fSMinchan Kim	  from CMA.
99143ca106fSMinchan Kim
992a254129eSJoonsoo Kimconfig CMA_AREAS
993a254129eSJoonsoo Kim	int "Maximum count of the CMA areas"
994a254129eSJoonsoo Kim	depends on CMA
99573307523SAnshuman Khandual	default 20 if NUMA
99673307523SAnshuman Khandual	default 8
997a254129eSJoonsoo Kim	help
998a254129eSJoonsoo Kim	  CMA allows to create CMA areas for particular purpose, mainly,
999a254129eSJoonsoo Kim	  used as device private area. This parameter sets the maximum
1000a254129eSJoonsoo Kim	  number of CMA area in the system.
1001a254129eSJoonsoo Kim
100273307523SAnshuman Khandual	  If unsure, leave the default value "8" in UMA and "20" in NUMA.
1003a254129eSJoonsoo Kim
1004e13e7922SJuan Yescas#
1005e13e7922SJuan Yescas# Select this config option from the architecture Kconfig, if available, to set
1006e13e7922SJuan Yescas# the max page order for physically contiguous allocations.
1007e13e7922SJuan Yescas#
1008e13e7922SJuan Yescasconfig ARCH_FORCE_MAX_ORDER
1009e13e7922SJuan Yescas	int
1010e13e7922SJuan Yescas
1011e13e7922SJuan Yescas#
1012e13e7922SJuan Yescas# When ARCH_FORCE_MAX_ORDER is not defined,
1013e13e7922SJuan Yescas# the default page block order is MAX_PAGE_ORDER (10) as per
1014e13e7922SJuan Yescas# include/linux/mmzone.h.
1015e13e7922SJuan Yescas#
10163800d552SZi Yanconfig PAGE_BLOCK_MAX_ORDER
10173800d552SZi Yan	int "Page Block Order Upper Limit"
1018e13e7922SJuan Yescas	range 1 10 if ARCH_FORCE_MAX_ORDER = 0
1019e13e7922SJuan Yescas	default 10 if ARCH_FORCE_MAX_ORDER = 0
1020e13e7922SJuan Yescas	range 1 ARCH_FORCE_MAX_ORDER if ARCH_FORCE_MAX_ORDER != 0
1021e13e7922SJuan Yescas	default ARCH_FORCE_MAX_ORDER if ARCH_FORCE_MAX_ORDER != 0
1022e13e7922SJuan Yescas	help
1023e13e7922SJuan Yescas	  The page block order refers to the power of two number of pages that
1024e13e7922SJuan Yescas	  are physically contiguous and can have a migrate type associated to
10253800d552SZi Yan	  them. The maximum size of the page block order is at least limited by
10263800d552SZi Yan	  ARCH_FORCE_MAX_ORDER/MAX_PAGE_ORDER.
1027e13e7922SJuan Yescas
10283800d552SZi Yan	  This config adds a new upper limit of default page block
10293800d552SZi Yan	  order when the page block order is required to be smaller than
10303800d552SZi Yan	  ARCH_FORCE_MAX_ORDER/MAX_PAGE_ORDER or other limits
10313800d552SZi Yan	  (see include/linux/pageblock-flags.h for details).
1032e13e7922SJuan Yescas
1033e13e7922SJuan Yescas	  Reducing pageblock order can negatively impact THP generation
1034bafa31a1SPaul Menzel	  success rate. If your workloads use THP heavily, please use this
1035e13e7922SJuan Yescas	  option with caution.
1036e13e7922SJuan Yescas
1037e13e7922SJuan Yescas	  Don't change if unsure.
1038e13e7922SJuan Yescas
1039af8d417aSDan Streetmanconfig MEM_SOFT_DIRTY
1040af8d417aSDan Streetman	bool "Track memory changes"
1041af8d417aSDan Streetman	depends on CHECKPOINT_RESTORE && HAVE_ARCH_SOFT_DIRTY && PROC_FS
1042af8d417aSDan Streetman	select PROC_PAGE_MONITOR
10434e2e2770SSeth Jennings	help
1044af8d417aSDan Streetman	  This option enables memory changes tracking by introducing a
1045af8d417aSDan Streetman	  soft-dirty bit on pte-s. This bit it set when someone writes
1046af8d417aSDan Streetman	  into a page just as regular dirty bit, but unlike the latter
1047af8d417aSDan Streetman	  it can be cleared by hands.
1048af8d417aSDan Streetman
10491ad1335dSMike Rapoport	  See Documentation/admin-guide/mm/soft-dirty.rst for more details.
10504e2e2770SSeth Jennings
10519e5c33d7SMark Salterconfig GENERIC_EARLY_IOREMAP
10529e5c33d7SMark Salter	bool
1053042d27acSHelge Deller
105422ee3ea5SHelge Dellerconfig STACK_MAX_DEFAULT_SIZE_MB
105522ee3ea5SHelge Deller	int "Default maximum user stack size for 32-bit processes (MB)"
105622ee3ea5SHelge Deller	default 100
1057042d27acSHelge Deller	range 8 2048
1058042d27acSHelge Deller	depends on STACK_GROWSUP && (!64BIT || COMPAT)
1059042d27acSHelge Deller	help
1060042d27acSHelge Deller	  This is the maximum stack size in Megabytes in the VM layout of 32-bit
1061042d27acSHelge Deller	  user processes when the stack grows upwards (currently only on parisc
106222ee3ea5SHelge Deller	  arch) when the RLIMIT_STACK hard limit is unlimited.
1063042d27acSHelge Deller
106422ee3ea5SHelge Deller	  A sane initial value is 100 MB.
10653a80a7faSMel Gorman
10663a80a7faSMel Gormanconfig DEFERRED_STRUCT_PAGE_INIT
10671ce22103SVlastimil Babka	bool "Defer initialisation of struct pages to kthreads"
1068d39f8fb4SMike Rapoport	depends on SPARSEMEM
1069ab1e8d89SPavel Tatashin	depends on !NEED_PER_CPU_KM
1070889c695dSPasha Tatashin	depends on 64BIT
1071854fa98dSIlya Leoshkevich	depends on !KMSAN
1072e4443149SDaniel Jordan	select PADATA
10733a80a7faSMel Gorman	help
10743a80a7faSMel Gorman	  Ordinarily all struct pages are initialised during early boot in a
10753a80a7faSMel Gorman	  single thread. On very large machines this can take a considerable
10763a80a7faSMel Gorman	  amount of time. If this option is set, large machines will bring up
1077e4443149SDaniel Jordan	  a subset of memmap at boot and then initialise the rest in parallel.
1078e4443149SDaniel Jordan	  This has a potential performance impact on tasks running early in the
10791ce22103SVlastimil Babka	  lifetime of the system until these kthreads finish the
10801ce22103SVlastimil Babka	  initialisation.
1081033fbae9SDan Williams
10821c676e0dSSeongJae Parkconfig PAGE_IDLE_FLAG
10831c676e0dSSeongJae Park	bool
10841c676e0dSSeongJae Park	select PAGE_EXTENSION if !64BIT
10851c676e0dSSeongJae Park	help
10861c676e0dSSeongJae Park	  This adds PG_idle and PG_young flags to 'struct page'.  PTE Accessed
10871c676e0dSSeongJae Park	  bit writers can set the state of the bit in the flags so that PTE
10881c676e0dSSeongJae Park	  Accessed bit readers may avoid disturbance.
10891c676e0dSSeongJae Park
109033c3fc71SVladimir Davydovconfig IDLE_PAGE_TRACKING
109133c3fc71SVladimir Davydov	bool "Enable idle page tracking"
109233c3fc71SVladimir Davydov	depends on SYSFS && MMU
10931c676e0dSSeongJae Park	select PAGE_IDLE_FLAG
109433c3fc71SVladimir Davydov	help
109533c3fc71SVladimir Davydov	  This feature allows to estimate the amount of user pages that have
109633c3fc71SVladimir Davydov	  not been touched during a given period of time. This information can
109733c3fc71SVladimir Davydov	  be useful to tune memory cgroup limits and/or for job placement
109833c3fc71SVladimir Davydov	  within a compute cluster.
109933c3fc71SVladimir Davydov
11001ad1335dSMike Rapoport	  See Documentation/admin-guide/mm/idle_page_tracking.rst for
11011ad1335dSMike Rapoport	  more details.
110233c3fc71SVladimir Davydov
11038690bbcfSMathieu Desnoyers# Architectures which implement cpu_dcache_is_aliasing() to query
11048690bbcfSMathieu Desnoyers# whether the data caches are aliased (VIVT or VIPT with dcache
11058690bbcfSMathieu Desnoyers# aliasing) need to select this.
11068690bbcfSMathieu Desnoyersconfig ARCH_HAS_CPU_CACHE_ALIASING
11078690bbcfSMathieu Desnoyers	bool
11088690bbcfSMathieu Desnoyers
1109c2280be8SAnshuman Khandualconfig ARCH_HAS_CACHE_LINE_SIZE
1110c2280be8SAnshuman Khandual	bool
1111c2280be8SAnshuman Khandual
11122792d84eSKees Cookconfig ARCH_HAS_CURRENT_STACK_POINTER
11132792d84eSKees Cook	bool
11142792d84eSKees Cook	help
11152792d84eSKees Cook	  In support of HARDENED_USERCOPY performing stack variable lifetime
11162792d84eSKees Cook	  checking, an architecture-agnostic way to find the stack pointer
11172792d84eSKees Cook	  is needed. Once an architecture defines an unsigned long global
11182792d84eSKees Cook	  register alias named "current_stack_pointer", this config can be
11192792d84eSKees Cook	  selected.
11202792d84eSKees Cook
112163703f37SKefeng Wangconfig ARCH_HAS_ZONE_DMA_SET
112263703f37SKefeng Wang	bool
112363703f37SKefeng Wang
112463703f37SKefeng Wangconfig ZONE_DMA
112563703f37SKefeng Wang	bool "Support DMA zone" if ARCH_HAS_ZONE_DMA_SET
112663703f37SKefeng Wang	default y if ARM64 || X86
112763703f37SKefeng Wang
112863703f37SKefeng Wangconfig ZONE_DMA32
112963703f37SKefeng Wang	bool "Support DMA32 zone" if ARCH_HAS_ZONE_DMA_SET
113063703f37SKefeng Wang	depends on !X86_32
113163703f37SKefeng Wang	default y if ARM64
113263703f37SKefeng Wang
1133033fbae9SDan Williamsconfig ZONE_DEVICE
11345042db43SJérôme Glisse	bool "Device memory (pmem, HMM, etc...) hotplug support"
1135033fbae9SDan Williams	depends on MEMORY_HOTPLUG
1136033fbae9SDan Williams	depends on MEMORY_HOTREMOVE
113799490f16SDan Williams	depends on SPARSEMEM_VMEMMAP
11383a08cd52SMatthew Wilcox	select XARRAY_MULTI
1139033fbae9SDan Williams
1140033fbae9SDan Williams	help
1141033fbae9SDan Williams	  Device memory hotplug support allows for establishing pmem,
1142033fbae9SDan Williams	  or other device driver discovered memory regions, in the
1143033fbae9SDan Williams	  memmap. This allows pfn_to_page() lookups of otherwise
1144033fbae9SDan Williams	  "device-physical" addresses which is needed for using a DAX
1145033fbae9SDan Williams	  mapping in an O_DIRECT operation, among other things.
1146033fbae9SDan Williams
1147033fbae9SDan Williams	  If FS_DAX is enabled, then say Y.
114806a660adSLinus Torvalds
11499c240a7bSChristoph Hellwig#
11509c240a7bSChristoph Hellwig# Helpers to mirror range of the CPU page tables of a process into device page
11519c240a7bSChristoph Hellwig# tables.
11529c240a7bSChristoph Hellwig#
1153c0b12405SJérôme Glisseconfig HMM_MIRROR
11549c240a7bSChristoph Hellwig	bool
1155f442c283SChristoph Hellwig	depends on MMU
1156c0b12405SJérôme Glisse
115714b80582SDan Williamsconfig GET_FREE_REGION
115814b80582SDan Williams	bool
115914b80582SDan Williams
11605042db43SJérôme Glisseconfig DEVICE_PRIVATE
11615042db43SJérôme Glisse	bool "Unaddressable device memory (GPU memory, ...)"
11627328d9ccSChristoph Hellwig	depends on ZONE_DEVICE
116314b80582SDan Williams	select GET_FREE_REGION
11645042db43SJérôme Glisse
11655042db43SJérôme Glisse	help
11665042db43SJérôme Glisse	  Allows creation of struct pages to represent unaddressable device
11675042db43SJérôme Glisse	  memory; i.e., memory that is only accessible from the device (or
11685042db43SJérôme Glisse	  group of devices). You likely also want to select HMM_MIRROR.
11695042db43SJérôme Glisse
11703e9a9e25SChristoph Hellwigconfig VMAP_PFN
11713e9a9e25SChristoph Hellwig	bool
11723e9a9e25SChristoph Hellwig
117363c17fb8SDave Hansenconfig ARCH_USES_HIGH_VMA_FLAGS
117463c17fb8SDave Hansen	bool
117566d37570SDave Hansenconfig ARCH_HAS_PKEYS
117666d37570SDave Hansen	bool
117730a5b536SDennis Zhou
11787a87225aSMatthew Wilcox (Oracle)config ARCH_USES_PG_ARCH_2
1179b0284cd2SCatalin Marinas	bool
11807a87225aSMatthew Wilcox (Oracle)config ARCH_USES_PG_ARCH_3
11817a87225aSMatthew Wilcox (Oracle)	bool
1182b0284cd2SCatalin Marinas
11830710d012SVlastimil Babkaconfig VM_EVENT_COUNTERS
11840710d012SVlastimil Babka	default y
11850710d012SVlastimil Babka	bool "Enable VM event counters for /proc/vmstat" if EXPERT
11860710d012SVlastimil Babka	help
11870710d012SVlastimil Babka	  VM event counters are needed for event counts to be shown.
11880710d012SVlastimil Babka	  This option allows the disabling of the VM event counters
11890710d012SVlastimil Babka	  on EXPERT systems.  /proc/vmstat will only show page counts
11900710d012SVlastimil Babka	  if VM event counters are disabled.
11910710d012SVlastimil Babka
119230a5b536SDennis Zhouconfig PERCPU_STATS
119330a5b536SDennis Zhou	bool "Collect percpu memory statistics"
119430a5b536SDennis Zhou	help
119530a5b536SDennis Zhou	  This feature collects and exposes statistics via debugfs. The
119630a5b536SDennis Zhou	  information includes global and per chunk statistics, which can
119730a5b536SDennis Zhou	  be used to help understand percpu memory usage.
119864c349f4SKirill A. Shutemov
11999c84f229SJohn Hubbardconfig GUP_TEST
12009c84f229SJohn Hubbard	bool "Enable infrastructure for get_user_pages()-related unit tests"
1201d0de8241SBarry Song	depends on DEBUG_FS
120264c349f4SKirill A. Shutemov	help
12039c84f229SJohn Hubbard	  Provides /sys/kernel/debug/gup_test, which in turn provides a way
12049c84f229SJohn Hubbard	  to make ioctl calls that can launch kernel-based unit tests for
12059c84f229SJohn Hubbard	  the get_user_pages*() and pin_user_pages*() family of API calls.
120664c349f4SKirill A. Shutemov
12079c84f229SJohn Hubbard	  These tests include benchmark testing of the _fast variants of
12089c84f229SJohn Hubbard	  get_user_pages*() and pin_user_pages*(), as well as smoke tests of
12099c84f229SJohn Hubbard	  the non-_fast variants.
12109c84f229SJohn Hubbard
1211f4f9bda4SJohn Hubbard	  There is also a sub-test that allows running dump_page() on any
1212f4f9bda4SJohn Hubbard	  of up to eight pages (selected by command line args) within the
1213f4f9bda4SJohn Hubbard	  range of user-space addresses. These pages are either pinned via
1214f4f9bda4SJohn Hubbard	  pin_user_pages*(), or pinned via get_user_pages*(), as specified
1215f4f9bda4SJohn Hubbard	  by other command line arguments.
1216f4f9bda4SJohn Hubbard
1217baa489faSSeongJae Park	  See tools/testing/selftests/mm/gup_test.c
12183010a5eaSLaurent Dufour
1219d0de8241SBarry Songcomment "GUP_TEST needs to have DEBUG_FS enabled"
1220d0de8241SBarry Song	depends on !GUP_TEST && !DEBUG_FS
12213010a5eaSLaurent Dufour
12226ca297d4SPeter Zijlstraconfig GUP_GET_PXX_LOW_HIGH
122339656e83SChristoph Hellwig	bool
122439656e83SChristoph Hellwig
1225def85743SKeith Buschconfig DMAPOOL_TEST
1226def85743SKeith Busch	tristate "Enable a module to run time tests on dma_pool"
1227def85743SKeith Busch	depends on HAS_DMA
1228def85743SKeith Busch	help
1229def85743SKeith Busch	  Provides a test module that will allocate and free many blocks of
1230def85743SKeith Busch	  various sizes and report how long it takes. This is intended to
1231def85743SKeith Busch	  provide a consistent way to measure how changes to the
1232def85743SKeith Busch	  dma_pool_alloc/free routines affect performance.
1233def85743SKeith Busch
12343010a5eaSLaurent Dufourconfig ARCH_HAS_PTE_SPECIAL
12353010a5eaSLaurent Dufour	bool
123659e0b520SChristoph Hellwig
1237c5acad84SThomas Hellstromconfig MAPPING_DIRTY_HELPERS
1238c5acad84SThomas Hellstrom        bool
1239c5acad84SThomas Hellstrom
1240298fa1adSThomas Gleixnerconfig KMAP_LOCAL
1241298fa1adSThomas Gleixner	bool
1242298fa1adSThomas Gleixner
1243825c43f5SArd Biesheuvelconfig KMAP_LOCAL_NON_LINEAR_PTE_ARRAY
1244825c43f5SArd Biesheuvel	bool
1245825c43f5SArd Biesheuvel
1246626e98cbSThomas Weißschuhconfig MEMFD_CREATE
1247626e98cbSThomas Weißschuh	bool "Enable memfd_create() system call" if EXPERT
1248626e98cbSThomas Weißschuh
12491507f512SMike Rapoportconfig SECRETMEM
125074947724SLukas Bulwahn	default y
125174947724SLukas Bulwahn	bool "Enable memfd_secret() system call" if EXPERT
125274947724SLukas Bulwahn	depends on ARCH_HAS_SET_DIRECT_MAP
125374947724SLukas Bulwahn	help
125474947724SLukas Bulwahn	  Enable the memfd_secret() system call with the ability to create
125574947724SLukas Bulwahn	  memory areas visible only in the context of the owning process and
125674947724SLukas Bulwahn	  not mapped to other processes and other kernel page tables.
12571507f512SMike Rapoport
12589a10064fSColin Crossconfig ANON_VMA_NAME
12599a10064fSColin Cross	bool "Anonymous VMA name support"
12609a10064fSColin Cross	depends on PROC_FS && ADVISE_SYSCALLS && MMU
12619a10064fSColin Cross
12629a10064fSColin Cross	help
12639a10064fSColin Cross	  Allow naming anonymous virtual memory areas.
12649a10064fSColin Cross
12659a10064fSColin Cross	  This feature allows assigning names to virtual memory areas. Assigned
12669a10064fSColin Cross	  names can be later retrieved from /proc/pid/maps and /proc/pid/smaps
12679a10064fSColin Cross	  and help identifying individual anonymous memory areas.
12689a10064fSColin Cross	  Assigning a name to anonymous virtual memory area might prevent that
12699a10064fSColin Cross	  area from being merged with adjacent virtual memory areas due to the
12709a10064fSColin Cross	  difference in their name.
12719a10064fSColin Cross
1272430529b5SPeter Xuconfig HAVE_ARCH_USERFAULTFD_WP
1273430529b5SPeter Xu	bool
1274430529b5SPeter Xu	help
1275430529b5SPeter Xu	  Arch has userfaultfd write protection support
1276430529b5SPeter Xu
1277430529b5SPeter Xuconfig HAVE_ARCH_USERFAULTFD_MINOR
1278430529b5SPeter Xu	bool
1279430529b5SPeter Xu	help
1280430529b5SPeter Xu	  Arch has userfaultfd minor fault support
1281430529b5SPeter Xu
128297219cc3SPeter Xumenuconfig USERFAULTFD
128397219cc3SPeter Xu	bool "Enable userfaultfd() system call"
128497219cc3SPeter Xu	depends on MMU
128597219cc3SPeter Xu	help
128697219cc3SPeter Xu	  Enable the userfaultfd() system call that allows to intercept and
128797219cc3SPeter Xu	  handle page faults in userland.
128897219cc3SPeter Xu
128997219cc3SPeter Xuif USERFAULTFD
12901db9dbc2SPeter Xuconfig PTE_MARKER_UFFD_WP
129181e0f15fSPeter Xu	bool "Userfaultfd write protection support for shmem/hugetlbfs"
129281e0f15fSPeter Xu	default y
129381e0f15fSPeter Xu	depends on HAVE_ARCH_USERFAULTFD_WP
12941db9dbc2SPeter Xu
12951db9dbc2SPeter Xu	help
12961db9dbc2SPeter Xu	  Allows to create marker PTEs for userfaultfd write protection
12971db9dbc2SPeter Xu	  purposes.  It is required to enable userfaultfd write protection on
12981db9dbc2SPeter Xu	  file-backed memory types like shmem and hugetlbfs.
129997219cc3SPeter Xuendif # USERFAULTFD
13001db9dbc2SPeter Xu
1301ac35a490SYu Zhao# multi-gen LRU {
1302ec1c86b2SYu Zhaoconfig LRU_GEN
1303ec1c86b2SYu Zhao	bool "Multi-Gen LRU"
1304ec1c86b2SYu Zhao	depends on MMU
1305ec1c86b2SYu Zhao	# make sure folio->flags has enough spare bits
1306ec1c86b2SYu Zhao	depends on 64BIT || !SPARSEMEM || SPARSEMEM_VMEMMAP
1307ec1c86b2SYu Zhao	help
130807017acbSYu Zhao	  A high performance LRU implementation to overcommit memory. See
130907017acbSYu Zhao	  Documentation/admin-guide/mm/multigen_lru.rst for details.
1310ec1c86b2SYu Zhao
1311354ed597SYu Zhaoconfig LRU_GEN_ENABLED
1312354ed597SYu Zhao	bool "Enable by default"
1313354ed597SYu Zhao	depends on LRU_GEN
1314354ed597SYu Zhao	help
1315354ed597SYu Zhao	  This option enables the multi-gen LRU by default.
1316354ed597SYu Zhao
1317ac35a490SYu Zhaoconfig LRU_GEN_STATS
1318ac35a490SYu Zhao	bool "Full stats for debugging"
1319ac35a490SYu Zhao	depends on LRU_GEN
1320ac35a490SYu Zhao	help
1321ac35a490SYu Zhao	  Do not enable this option unless you plan to look at historical stats
1322ac35a490SYu Zhao	  from evicted generations for debugging purpose.
1323ac35a490SYu Zhao
1324ac35a490SYu Zhao	  This option has a per-memcg and per-node memory overhead.
132561dd3f24SKinsey Ho
132661dd3f24SKinsey Hoconfig LRU_GEN_WALKS_MMU
132761dd3f24SKinsey Ho	def_bool y
132861dd3f24SKinsey Ho	depends on LRU_GEN && ARCH_HAS_HW_PTE_YOUNG
1329ac35a490SYu Zhao# }
1330ac35a490SYu Zhao
13310b6cc04fSSuren Baghdasaryanconfig ARCH_SUPPORTS_PER_VMA_LOCK
13320b6cc04fSSuren Baghdasaryan       def_bool n
13330b6cc04fSSuren Baghdasaryan
13340b6cc04fSSuren Baghdasaryanconfig PER_VMA_LOCK
13350b6cc04fSSuren Baghdasaryan	def_bool y
13360b6cc04fSSuren Baghdasaryan	depends on ARCH_SUPPORTS_PER_VMA_LOCK && MMU && SMP
13370b6cc04fSSuren Baghdasaryan	help
13380b6cc04fSSuren Baghdasaryan	  Allow per-vma locking during page fault handling.
13390b6cc04fSSuren Baghdasaryan
13400b6cc04fSSuren Baghdasaryan	  This feature allows locking each virtual memory area separately when
13410b6cc04fSSuren Baghdasaryan	  handling page faults instead of taking mmap_lock.
13420b6cc04fSSuren Baghdasaryan
1343c2508ec5SLinus Torvaldsconfig LOCK_MM_AND_FIND_VMA
1344c2508ec5SLinus Torvalds	bool
1345c2508ec5SLinus Torvalds	depends on !STACK_GROWSUP
1346c2508ec5SLinus Torvalds
13478f23f5dbSJason Gunthorpeconfig IOMMU_MM_DATA
13488f23f5dbSJason Gunthorpe	bool
13498f23f5dbSJason Gunthorpe
135012af2b83SMike Rapoport (IBM)config EXECMEM
135112af2b83SMike Rapoport (IBM)	bool
135212af2b83SMike Rapoport (IBM)
135387482708SMike Rapoport (Microsoft)config NUMA_MEMBLKS
135487482708SMike Rapoport (Microsoft)	bool
135587482708SMike Rapoport (Microsoft)
1356b0c4e27cSMike Rapoport (Microsoft)config NUMA_EMU
1357b0c4e27cSMike Rapoport (Microsoft)	bool "NUMA emulation"
1358b0c4e27cSMike Rapoport (Microsoft)	depends on NUMA_MEMBLKS
1359a24f2fb7SHuacai Chen	depends on X86 || GENERIC_ARCH_NUMA
1360b0c4e27cSMike Rapoport (Microsoft)	help
1361b0c4e27cSMike Rapoport (Microsoft)	  Enable NUMA emulation. A flat machine will be split
1362b0c4e27cSMike Rapoport (Microsoft)	  into virtual nodes when booted with "numa=fake=N", where N is the
1363b0c4e27cSMike Rapoport (Microsoft)	  number of nodes. This is only useful for debugging.
1364b0c4e27cSMike Rapoport (Microsoft)
1365bcc9d04eSMark Brownconfig ARCH_HAS_USER_SHADOW_STACK
1366bcc9d04eSMark Brown	bool
1367bcc9d04eSMark Brown	help
1368bcc9d04eSMark Brown	  The architecture has hardware support for userspace shadow call
1369bcc9d04eSMark Brown          stacks (eg, x86 CET, arm64 GCS or RISC-V Zicfiss).
1370bcc9d04eSMark Brown
13716375e95fSQi Zhengconfig ARCH_SUPPORTS_PT_RECLAIM
13726375e95fSQi Zheng	def_bool n
13736375e95fSQi Zheng
13746375e95fSQi Zhengconfig PT_RECLAIM
13756375e95fSQi Zheng	bool "reclaim empty user page table pages"
13766375e95fSQi Zheng	default y
13776375e95fSQi Zheng	depends on ARCH_SUPPORTS_PT_RECLAIM && MMU && SMP
13786375e95fSQi Zheng	select MMU_GATHER_RCU_TABLE_FREE
13796375e95fSQi Zheng	help
13806375e95fSQi Zheng	  Try to reclaim empty user page table pages in paths other than munmap
13816375e95fSQi Zheng	  and exit_mmap path.
13826375e95fSQi Zheng
13836375e95fSQi Zheng	  Note: now only empty user PTE page table pages will be reclaimed.
13846375e95fSQi Zheng
13856375e95fSQi Zheng
13862224d848SSeongJae Parksource "mm/damon/Kconfig"
13872224d848SSeongJae Park
138859e0b520SChristoph Hellwigendmenu
1389