xref: /linux/Documentation/admin-guide/verify-bugs-and-bisect-regressions.rst (revision 7d32e779eb9add47bfdb4731c4d4ff443a7f7fa6)
1.. SPDX-License-Identifier: (GPL-2.0+ OR CC-BY-4.0)
2.. [see the bottom of this file for redistribution information]
3
4=========================================
5How to verify bugs and bisect regressions
6=========================================
7
8This document describes how to check if some Linux kernel problem occurs in code
9currently supported by developers -- to then explain how to locate the change
10causing the issue, if it is a regression (e.g. did not happen with earlier
11versions).
12
13The text aims at people running kernels from mainstream Linux distributions on
14commodity hardware who want to report a kernel bug to the upstream Linux
15developers. Despite this intent, the instructions work just as well for users
16who are already familiar with building their own kernels: they help avoid
17mistakes occasionally made even by experienced developers.
18
19..
20   Note: if you see this note, you are reading the text's source file. You
21   might want to switch to a rendered version: it makes it a lot easier to
22   read and navigate this document -- especially when you want to look something
23   up in the reference section, then jump back to where you left off.
24..
25   Find the latest rendered version of this text here:
26   https://docs.kernel.org/admin-guide/verify-bugs-and-bisect-regressions.html
27
28The essence of the process (aka 'TL;DR')
29========================================
30
31*[If you are new to building or bisecting Linux, ignore this section and head
32over to the* ':ref:`step-by-step guide <introguide_bissbs>`' *below. It utilizes
33the same commands as this section while describing them in brief fashion. The
34steps are nevertheless easy to follow and together with accompanying entries
35in a reference section mention many alternatives, pitfalls, and additional
36aspects, all of which might be essential in your present case.]*
37
38**In case you want to check if a bug is present in code currently supported by
39developers**, execute just the *preparations* and *segment 1*; while doing so,
40consider the newest Linux kernel you regularly use to be the 'working' kernel.
41In the following example that's assumed to be 6.0, which is why its sources
42will be used to prepare the .config file.
43
44**In case you face a regression**, follow the steps at least till the end of
45*segment 2*. Then you can submit a preliminary report -- or continue with
46*segment 3*, which describes how to perform a bisection needed for a
47full-fledged regression report. In the following example 6.0.13 is assumed to be
48the 'working' kernel and 6.1.5 to be the first 'broken', which is why 6.0
49will be considered the 'good' release and used to prepare the .config file.
50
51* **Preparations**: set up everything to build your own kernels::
52
53    # * Remove any software that depends on externally maintained kernel modules
54    #   or builds any automatically during bootup.
55    # * Ensure Secure Boot permits booting self-compiled Linux kernels.
56    # * If you are not already running the 'working' kernel, reboot into it.
57    # * Install compilers and everything else needed for building Linux.
58    # * Ensure to have 15 Gigabyte free space in your home directory.
59    git clone -o mainline --no-checkout \
60      https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git ~/linux/
61    cd ~/linux/
62    git remote add -t master stable \
63      https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux.git
64    git switch --detach v6.0
65    # * Hint: if you used an existing clone, ensure no stale .config is around.
66    make olddefconfig
67    # * Ensure the former command picked the .config of the 'working' kernel.
68    # * Connect external hardware (USB keys, tokens, ...), start a VM, bring up
69    #   VPNs, mount network shares, and briefly try the feature that is broken.
70    yes '' | make localmodconfig
71    ./scripts/config --set-str CONFIG_LOCALVERSION '-local'
72    ./scripts/config -e CONFIG_LOCALVERSION_AUTO
73    # * Note, when short on storage space, check the guide for an alternative:
74    ./scripts/config -d DEBUG_INFO_NONE -e KALLSYMS_ALL -e DEBUG_KERNEL \
75      -e DEBUG_INFO -e DEBUG_INFO_DWARF_TOOLCHAIN_DEFAULT -e KALLSYMS
76    # * Hint: at this point you might want to adjust the build configuration;
77    #   you'll have to, if you are running Debian.
78    make olddefconfig
79    cp .config ~/kernel-config-working
80
81* **Segment 1**: build a kernel from the latest mainline codebase.
82
83  This among others checks if the problem was fixed already and which developers
84  later need to be told about the problem; in case of a regression, this rules
85  out a .config change as root of the problem.
86
87  a) Checking out latest mainline code::
88
89       cd ~/linux/
90       git switch --discard-changes --detach mainline/master
91
92  b) Build, install, and boot a kernel::
93
94       cp ~/kernel-config-working .config
95       make olddefconfig
96       make -j $(nproc --all)
97       # * Make sure there is enough disk space to hold another kernel:
98       df -h /boot/ /lib/modules/
99       # * Note: on Arch Linux, its derivatives and a few other distributions
100       #   the following commands will do nothing at all or only part of the
101       #   job. See the step-by-step guide for further details.
102       sudo make modules_install
103       command -v installkernel && sudo make install
104       # * Check how much space your self-built kernel actually needs, which
105       #   enables you to make better estimates later:
106       du -ch /boot/*$(make -s kernelrelease)* | tail -n 1
107       du -sh /lib/modules/$(make -s kernelrelease)/
108       # * Hint: the output of the following command will help you pick the
109       #   right kernel from the boot menu:
110       make -s kernelrelease | tee -a ~/kernels-built
111       reboot
112       # * Once booted, ensure you are running the kernel you just built by
113       #   checking if the output of the next two commands matches:
114       tail -n 1 ~/kernels-built
115       uname -r
116       cat /proc/sys/kernel/tainted
117
118  c) Check if the problem occurs with this kernel as well.
119
120* **Segment 2**: ensure the 'good' kernel is also a 'working' kernel.
121
122  This among others verifies the trimmed .config file actually works well, as
123  bisecting with it otherwise would be a waste of time:
124
125  a) Start by checking out the sources of the 'good' version::
126
127       cd ~/linux/
128       git switch --discard-changes --detach v6.0
129
130  b) Build, install, and boot a kernel as described earlier in *segment 1,
131     section b* -- just feel free to skip the 'du' commands, as you have a rough
132     estimate already.
133
134  c) Ensure the feature that regressed with the 'broken' kernel actually works
135     with this one.
136
137* **Segment 3**: perform and validate the bisection.
138
139  a) Retrieve the sources for your 'bad' version::
140
141       git remote set-branches --add stable linux-6.1.y
142       git fetch stable
143
144  b) Initialize the bisection::
145
146       cd ~/linux/
147       git bisect start
148       git bisect good v6.0
149       git bisect bad v6.1.5
150
151  c) Build, install, and boot a kernel as described earlier in *segment 1,
152     section b*.
153
154     In case building or booting the kernel fails for unrelated reasons, run
155     ``git bisect skip``. In all other outcomes, check if the regressed feature
156     works with the newly built kernel. If it does, tell Git by executing
157     ``git bisect good``; if it does not, run ``git bisect bad`` instead.
158
159     All three commands will make Git check out another commit; then re-execute
160     this step (e.g. build, install, boot, and test a kernel to then tell Git
161     the outcome). Do so again and again until Git shows which commit broke
162     things. If you run short of disk space during this process, check the
163     section 'Complementary tasks: cleanup during and after the process'
164     below.
165
166  d) Once your finished the bisection, put a few things away::
167
168       cd ~/linux/
169       git bisect log > ~/bisect-log
170       cp .config ~/bisection-config-culprit
171       git bisect reset
172
173  e) Try to verify the bisection result::
174
175       git switch --discard-changes --detach mainline/master
176       git revert --no-edit cafec0cacaca0
177       cp ~/kernel-config-working .config
178       ./scripts/config --set-str CONFIG_LOCALVERSION '-local-cafec0cacaca0-reverted'
179
180    This is optional, as some commits are impossible to revert. But if the
181    second command worked flawlessly, build, install, and boot one more kernel
182    kernel; just this time skip the first command copying the base .config file
183    over, as that already has been taken care off.
184
185* **Complementary tasks**: cleanup during and after the process.
186
187  a) To avoid running out of disk space during a bisection, you might need to
188     remove some kernels you built earlier. You most likely want to keep those
189     you built during segment 1 and 2 around for a while, but you will most
190     likely no longer need kernels tested during the actual bisection
191     (Segment 3 c). You can list them in build order using::
192
193       ls -ltr /lib/modules/*-local*
194
195    To then for example erase a kernel that identifies itself as
196    '6.0-rc1-local-gcafec0cacaca0', use this::
197
198       sudo rm -rf /lib/modules/6.0-rc1-local-gcafec0cacaca0
199       sudo kernel-install -v remove 6.0-rc1-local-gcafec0cacaca0
200       # * Note, on some distributions kernel-install is missing
201       #   or does only part of the job.
202
203  b) If you performed a bisection and successfully validated the result, feel
204     free to remove all kernels built during the actual bisection (Segment 3 c);
205     the kernels you built earlier and later you might want to keep around for
206     a week or two.
207
208* **Optional task**: test a debug patch or a proposed fix later::
209
210    git fetch mainline
211    git switch --discard-changes --detach mainline/master
212    git apply /tmp/foobars-proposed-fix-v1.patch
213    cp ~/kernel-config-working .config
214    ./scripts/config --set-str CONFIG_LOCALVERSION '-local-foobars-fix-v1'
215
216  Build, install, and boot a kernel as described in *segment 1, section b* --
217  but this time omit the first command copying the build configuration over,
218  as that has been taken care of already.
219
220.. _introguide_bissbs:
221
222Step-by-step guide on how to verify bugs and bisect regressions
223===============================================================
224
225This guide describes how to set up your own Linux kernels for investigating bugs
226or regressions you intend to report. How far you want to follow the instructions
227depends on your issue:
228
229Execute all steps till the end of *segment 1* to **verify if your kernel problem
230is present in code supported by Linux kernel developers**. If it is, you are all
231set to report the bug -- unless it did not happen with earlier kernel versions,
232as then your want to at least continue with *segment 2* to **check if the issue
233qualifies as regression** which receive priority treatment. Depending on the
234outcome you then are ready to report a bug or submit a preliminary regression
235report; instead of the latter your could also head straight on and follow
236*segment 3* to **perform a bisection** for a full-fledged regression report
237developers are obliged to act upon.
238
239 :ref:`Preparations: set up everything to build your own kernels <introprep_bissbs>`.
240
241 :ref:`Segment 1: try to reproduce the problem with the latest codebase <introlatestcheck_bissbs>`.
242
243 :ref:`Segment 2: check if the kernels you build work fine <introworkingcheck_bissbs>`.
244
245 :ref:`Segment 3: perform a bisection and validate the result <introbisect_bissbs>`.
246
247 :ref:`Complementary tasks: cleanup during and after following this guide <introclosure_bissbs>`.
248
249 :ref:`Optional tasks: test reverts, patches, or later versions <introoptional_bissbs>`.
250
251The steps in each segment illustrate the important aspects of the process, while
252a comprehensive reference section holds additional details for almost all of the
253steps. The reference section sometimes also outlines alternative approaches,
254pitfalls, as well as problems that might occur at the particular step -- and how
255to get things rolling again.
256
257For further details on how to report Linux kernel issues or regressions check
258out Documentation/admin-guide/reporting-issues.rst, which works in conjunction
259with this document. It among others explains why you need to verify bugs with
260the latest 'mainline' kernel (e.g. versions like 6.0, 6.1-rc1, or 6.1-rc6),
261even if you face a problem with a kernel from a 'stable/longterm' series
262(say 6.0.13).
263
264For users facing a regression that document also explains why sending a
265preliminary report after segment 2 might be wise, as the regression and its
266culprit might be known already. For further details on what actually qualifies
267as a regression check out Documentation/admin-guide/reporting-regressions.rst.
268
269If you run into any problems while following this guide or have ideas how to
270improve it, :ref:`please let the kernel developers know <submit_improvements>`.
271
272.. _introprep_bissbs:
273
274Preparations: set up everything to build your own kernels
275---------------------------------------------------------
276
277The following steps lay the groundwork for all further tasks.
278
279Note: the instructions assume you are building and testing on the same
280machine; if you want to compile the kernel on another system, check
281:ref:`Build kernels on a different machine <buildhost_bis>` below.
282
283.. _backup_bissbs:
284
285* Create a fresh backup and put system repair and restore tools at hand, just
286  to be prepared for the unlikely case of something going sideways.
287
288  [:ref:`details <backup_bisref>`]
289
290.. _vanilla_bissbs:
291
292* Remove all software that depends on externally developed kernel drivers or
293  builds them automatically. That includes but is not limited to DKMS, openZFS,
294  VirtualBox, and Nvidia's graphics drivers (including the GPLed kernel module).
295
296  [:ref:`details <vanilla_bisref>`]
297
298.. _secureboot_bissbs:
299
300* On platforms with 'Secure Boot' or similar solutions, prepare everything to
301  ensure the system will permit your self-compiled kernel to boot. The
302  quickest and easiest way to achieve this on commodity x86 systems is to
303  disable such techniques in the BIOS setup utility; alternatively, remove
304  their restrictions through a process initiated by
305  ``mokutil --disable-validation``.
306
307  [:ref:`details <secureboot_bisref>`]
308
309.. _rangecheck_bissbs:
310
311* Determine the kernel versions considered 'good' and 'bad' throughout this
312  guide:
313
314  * Do you follow this guide to verify if a bug is present in the code the
315    primary developers care for? Then consider the version of the newest kernel
316    you regularly use currently as 'good' (e.g. 6.0, 6.0.13, or 6.1-rc2).
317
318  * Do you face a regression, e.g. something broke or works worse after
319    switching to a newer kernel version? In that case it depends on the version
320    range during which the problem appeared:
321
322    * Something regressed when updating from a stable/longterm release
323      (say 6.0.13) to a newer mainline series (like 6.1-rc7 or 6.1) or a
324      stable/longterm version based on one (say 6.1.5)? Then consider the
325      mainline release your working kernel is based on to be the 'good'
326      version (e.g. 6.0) and the first version to be broken as the 'bad' one
327      (e.g. 6.1-rc7, 6.1, or 6.1.5). Note, at this point it is merely assumed
328      that 6.0 is fine; this hypothesis will be checked in segment 2.
329
330    * Something regressed when switching from one mainline version (say 6.0) to
331      a later one (like 6.1-rc1) or a stable/longterm release based on it
332      (say 6.1.5)? Then regard the last working version (e.g. 6.0) as 'good' and
333      the first broken (e.g. 6.1-rc1 or 6.1.5) as 'bad'.
334
335    * Something regressed when updating within a stable/longterm series (say
336      from 6.0.13 to 6.0.15)? Then consider those versions as 'good' and 'bad'
337      (e.g. 6.0.13 and 6.0.15), as you need to bisect within that series.
338
339  *Note, do not confuse 'good' version with 'working' kernel; the latter term
340  throughout this guide will refer to the last kernel that has been working
341  fine.*
342
343  [:ref:`details <rangecheck_bisref>`]
344
345.. _bootworking_bissbs:
346
347* Boot into the 'working' kernel and briefly use the apparently broken feature.
348
349  [:ref:`details <bootworking_bisref>`]
350
351.. _diskspace_bissbs:
352
353* Ensure to have enough free space for building Linux. 15 Gigabyte in your home
354  directory should typically suffice. If you have less available, be sure to pay
355  attention to later steps about retrieving the Linux sources and handling of
356  debug symbols: both explain approaches reducing the amount of space, which
357  should allow you to master these tasks with about 4 Gigabytes free space.
358
359  [:ref:`details <diskspace_bisref>`]
360
361.. _buildrequires_bissbs:
362
363* Install all software required to build a Linux kernel. Often you will need:
364  'bc', 'binutils' ('ld' et al.), 'bison', 'flex', 'gcc', 'git', 'openssl',
365  'pahole', 'perl', and the development headers for 'libelf' and 'openssl'. The
366  reference section shows how to quickly install those on various popular Linux
367  distributions.
368
369  [:ref:`details <buildrequires_bisref>`]
370
371.. _sources_bissbs:
372
373* Retrieve the mainline Linux sources; then change into the directory holding
374  them, as all further commands in this guide are meant to be executed from
375  there.
376
377  *Note, the following describe how to retrieve the sources using a full
378  mainline clone, which downloads about 2,75 GByte as of early 2024. The*
379  :ref:`reference section describes two alternatives <sources_bisref>` *:
380  one downloads less than 500 MByte, the other works better with unreliable
381  internet connections.*
382
383  Execute the following command to retrieve a fresh mainline codebase while
384  preparing things to add branches for stable/longterm series later::
385
386    git clone -o mainline --no-checkout \
387      https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git ~/linux/
388    cd ~/linux/
389    git remote add -t master stable \
390      https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux.git
391
392  [:ref:`details <sources_bisref>`]
393
394.. _stablesources_bissbs:
395
396* Is one of the versions you earlier established as 'good' or 'bad' a stable or
397  longterm release (say 6.1.5)? Then download the code for the series it belongs
398  to ('linux-6.1.y' in this example)::
399
400    git remote set-branches --add stable linux-6.1.y
401    git fetch stable
402
403.. _oldconfig_bissbs:
404
405* Start preparing a kernel build configuration (the '.config' file).
406
407  Before doing so, ensure you are still running the 'working' kernel an earlier
408  step told you to boot; if you are unsure, check the current kernelrelease
409  identifier using ``uname -r``.
410
411  Afterwards check out the source code for the version earlier established as
412  'good'. In the following example command this is assumed to be 6.0; note that
413  the version number in this and all later Git commands needs to be prefixed
414  with a 'v'::
415
416    git switch --discard-changes --detach v6.0
417
418  Now create a build configuration file::
419
420    make olddefconfig
421
422  The kernel build scripts then will try to locate the build configuration file
423  for the running kernel and then adjust it for the needs of the kernel sources
424  you checked out. While doing so, it will print a few lines you need to check.
425
426  Look out for a line starting with '# using defaults found in'. It should be
427  followed by a path to a file in '/boot/' that contains the release identifier
428  of your currently working kernel. If the line instead continues with something
429  like 'arch/x86/configs/x86_64_defconfig', then the build infra failed to find
430  the .config file for your running kernel -- in which case you have to put one
431  there manually, as explained in the reference section.
432
433  In case you can not find such a line, look for one containing '# configuration
434  written to .config'. If that's the case you have a stale build configuration
435  lying around. Unless you intend to use it, delete it; afterwards run
436  'make olddefconfig' again and check if it now picked up the right config file
437  as base.
438
439  [:ref:`details <oldconfig_bisref>`]
440
441.. _localmodconfig_bissbs:
442
443* Disable any kernel modules apparently superfluous for your setup. This is
444  optional, but especially wise for bisections, as it speeds up the build
445  process enormously -- at least unless the .config file picked up in the
446  previous step was already tailored to your and your hardware needs, in which
447  case you should skip this step.
448
449  To prepare the trimming, connect external hardware you occasionally use (USB
450  keys, tokens, ...), quickly start a VM, and bring up VPNs. And if you rebooted
451  since you started that guide, ensure that you tried using the feature causing
452  trouble since you started the system. Only then trim your .config::
453
454     yes '' | make localmodconfig
455
456  There is a catch to this, as the 'apparently' in initial sentence of this step
457  and the preparation instructions already hinted at:
458
459  The 'localmodconfig' target easily disables kernel modules for features only
460  used occasionally -- like modules for external peripherals not yet connected
461  since booting, virtualization software not yet utilized, VPN tunnels, and a
462  few other things. That's because some tasks rely on kernel modules Linux only
463  loads when you execute tasks like the aforementioned ones for the first time.
464
465  This drawback of localmodconfig is nothing you should lose sleep over, but
466  something to keep in mind: if something is misbehaving with the kernels built
467  during this guide, this is most likely the reason. You can reduce or nearly
468  eliminate the risk with tricks outlined in the reference section; but when
469  building a kernel just for quick testing purposes this is usually not worth
470  spending much effort on, as long as it boots and allows to properly test the
471  feature that causes trouble.
472
473  [:ref:`details <localmodconfig_bisref>`]
474
475.. _tagging_bissbs:
476
477* Ensure all the kernels you will build are clearly identifiable using a special
478  tag and a unique version number::
479
480    ./scripts/config --set-str CONFIG_LOCALVERSION '-local'
481    ./scripts/config -e CONFIG_LOCALVERSION_AUTO
482
483  [:ref:`details <tagging_bisref>`]
484
485.. _debugsymbols_bissbs:
486
487* Decide how to handle debug symbols.
488
489  In the context of this document it is often wise to enable them, as there is a
490  decent chance you will need to decode a stack trace from a 'panic', 'Oops',
491  'warning', or 'BUG'::
492
493    ./scripts/config -d DEBUG_INFO_NONE -e KALLSYMS_ALL -e DEBUG_KERNEL \
494      -e DEBUG_INFO -e DEBUG_INFO_DWARF_TOOLCHAIN_DEFAULT -e KALLSYMS
495
496  But if you are extremely short on storage space, you might want to disable
497  debug symbols instead::
498
499    ./scripts/config -d DEBUG_INFO -d DEBUG_INFO_DWARF_TOOLCHAIN_DEFAULT \
500      -d DEBUG_INFO_DWARF4 -d DEBUG_INFO_DWARF5 -e CONFIG_DEBUG_INFO_NONE
501
502  [:ref:`details <debugsymbols_bisref>`]
503
504.. _configmods_bissbs:
505
506* Check if you may want or need to adjust some other kernel configuration
507  options:
508
509  * Are you running Debian? Then you want to avoid known problems by performing
510    additional adjustments explained in the reference section.
511
512    [:ref:`details <configmods_distros_bisref>`].
513
514  * If you want to influence other aspects of the configuration, do so now using
515    your preferred tool. Note, to use make targets like 'menuconfig' or
516    'nconfig', you will need to install the development files of ncurses; for
517    'xconfig' you likewise need the Qt5 or Qt6 headers.
518
519    [:ref:`details <configmods_individual_bisref>`].
520
521.. _saveconfig_bissbs:
522
523* Reprocess the .config after the latest adjustments and store it in a safe
524  place::
525
526     make olddefconfig
527     cp .config ~/kernel-config-working
528
529  [:ref:`details <saveconfig_bisref>`]
530
531.. _introlatestcheck_bissbs:
532
533Segment 1: try to reproduce the problem with the latest codebase
534----------------------------------------------------------------
535
536The following steps verify if the problem occurs with the code currently
537supported by developers. In case you face a regression, it also checks that the
538problem is not caused by some .config change, as reporting the issue then would
539be a waste of time. [:ref:`details <introlatestcheck_bisref>`]
540
541.. _checkoutmaster_bissbs:
542
543* Check out the latest Linux codebase.
544
545  * Are your 'good' and 'bad' versions from the same stable or longterm series?
546    Then check the `front page of kernel.org <https://kernel.org/>`_: if it
547    lists a release from that series without an '[EOL]' tag, checkout the series
548    latest version ('linux-6.1.y' in the following example)::
549
550      cd ~/linux/
551      git switch --discard-changes --detach stable/linux-6.1.y
552
553    Your series is unsupported, if is not listed or carrying a 'end of life'
554    tag. In that case you might want to check if a successor series (say
555    linux-6.2.y) or mainline (see next point) fix the bug.
556
557  * In all other cases, run::
558
559      cd ~/linux/
560      git switch --discard-changes --detach mainline/master
561
562  [:ref:`details <checkoutmaster_bisref>`]
563
564.. _build_bissbs:
565
566* Build the image and the modules of your first kernel using the config file you
567  prepared::
568
569    cp ~/kernel-config-working .config
570    make olddefconfig
571    make -j $(nproc --all)
572
573  If you want your kernel packaged up as deb, rpm, or tar file, see the
574  reference section for alternatives, which obviously will require other
575  steps to install as well.
576
577  [:ref:`details <build_bisref>`]
578
579.. _install_bissbs:
580
581* Install your newly built kernel.
582
583  Before doing so, consider checking if there is still enough space for it::
584
585    df -h /boot/ /lib/modules/
586
587  For now assume 150 MByte in /boot/ and 200 in /lib/modules/ will suffice; how
588  much your kernels actually require will be determined later during this guide.
589
590  Now install the kernel's modules and its image, which will be stored in
591  parallel to the your Linux distribution's kernels::
592
593    sudo make modules_install
594    command -v installkernel && sudo make install
595
596  The second command ideally will take care of three steps required at this
597  point: copying the kernel's image to /boot/, generating an initramfs, and
598  adding an entry for both to the boot loader's configuration.
599
600  Sadly some distributions (among them Arch Linux, its derivatives, and many
601  immutable Linux distributions) will perform none or only some of those tasks.
602  You therefore want to check if all of them were taken care of and manually
603  perform those that were not. The reference section provides further details on
604  that; your distribution's documentation might help, too.
605
606  Once you figured out the steps needed at this point, consider writing them
607  down: if you will build more kernels as described in segment 2 and 3, you will
608  have to perform those again after executing ``command -v installkernel [...]``.
609
610  [:ref:`details <install_bisref>`]
611
612.. _storagespace_bissbs:
613
614* In case you plan to follow this guide further, check how much storage space
615  the kernel, its modules, and other related files like the initramfs consume::
616
617    du -ch /boot/*$(make -s kernelrelease)* | tail -n 1
618    du -sh /lib/modules/$(make -s kernelrelease)/
619
620  Write down or remember those two values for later: they enable you to prevent
621  running out of disk space accidentally during a bisection.
622
623  [:ref:`details <storagespace_bisref>`]
624
625.. _kernelrelease_bissbs:
626
627* Show and store the kernelrelease identifier of the kernel you just built::
628
629    make -s kernelrelease | tee -a ~/kernels-built
630
631  Remember the identifier momentarily, as it will help you pick the right kernel
632  from the boot menu upon restarting.
633
634* Reboot into your newly built kernel. To ensure your actually started the one
635  you just built, you might want to verify if the output of these commands
636  matches::
637
638    tail -n 1 ~/kernels-built
639    uname -r
640
641.. _tainted_bissbs:
642
643* Check if the kernel marked itself as 'tainted'::
644
645    cat /proc/sys/kernel/tainted
646
647  If that command does not return '0', check the reference section, as the cause
648  for this might interfere with your testing.
649
650  [:ref:`details <tainted_bisref>`]
651
652.. _recheckbroken_bissbs:
653
654* Verify if your bug occurs with the newly built kernel. If it does not, check
655  out the instructions in the reference section to ensure nothing went sideways
656  during your tests.
657
658  [:ref:`details <recheckbroken_bisref>`]
659
660.. _recheckstablebroken_bissbs:
661
662* Did you just built a stable or longterm kernel? And were you able to reproduce
663  the regression with it? Then you should test the latest mainline codebase as
664  well, because the result determines which developers the bug must be submitted
665  to.
666
667  To prepare that test, check out current mainline::
668
669    cd ~/linux/
670    git switch --discard-changes --detach mainline/master
671
672  Now use the checked out code to build and install another kernel using the
673  commands the earlier steps already described in more detail::
674
675    cp ~/kernel-config-working .config
676    make olddefconfig
677    make -j $(nproc --all)
678    # * Check if the free space suffices holding another kernel:
679    df -h /boot/ /lib/modules/
680    sudo make modules_install
681    command -v installkernel && sudo make install
682    make -s kernelrelease | tee -a ~/kernels-built
683    reboot
684
685  Confirm you booted the kernel you intended to start and check its tainted
686  status::
687
688    tail -n 1 ~/kernels-built
689    uname -r
690    cat /proc/sys/kernel/tainted
691
692  Now verify if this kernel is showing the problem. If it does, then you need
693  to report the bug to the primary developers; if it does not, report it to the
694  stable team. See Documentation/admin-guide/reporting-issues.rst for details.
695
696  [:ref:`details <recheckstablebroken_bisref>`]
697
698Do you follow this guide to verify if a problem is present in the code
699currently supported by Linux kernel developers? Then you are done at this
700point. If you later want to remove the kernel you just built, check out
701:ref:`Complementary tasks: cleanup during and after following this guide <introclosure_bissbs>`.
702
703In case you face a regression, move on and execute at least the next segment
704as well.
705
706.. _introworkingcheck_bissbs:
707
708Segment 2: check if the kernels you build work fine
709---------------------------------------------------
710
711In case of a regression, you now want to ensure the trimmed configuration file
712you created earlier works as expected; a bisection with the .config file
713otherwise would be a waste of time. [:ref:`details <introworkingcheck_bisref>`]
714
715.. _recheckworking_bissbs:
716
717* Build your own variant of the 'working' kernel and check if the feature that
718  regressed works as expected with it.
719
720  Start by checking out the sources for the version earlier established as
721  'good' (once again assumed to be 6.0 here)::
722
723    cd ~/linux/
724    git switch --discard-changes --detach v6.0
725
726  Now use the checked out code to configure, build, and install another kernel
727  using the commands the previous subsection explained in more detail::
728
729    cp ~/kernel-config-working .config
730    make olddefconfig
731    make -j $(nproc --all)
732    # * Check if the free space suffices holding another kernel:
733    df -h /boot/ /lib/modules/
734    sudo make modules_install
735    command -v installkernel && sudo make install
736    make -s kernelrelease | tee -a ~/kernels-built
737    reboot
738
739  When the system booted, you may want to verify once again that the
740  kernel you started is the one you just built::
741
742    tail -n 1 ~/kernels-built
743    uname -r
744
745  Now check if this kernel works as expected; if not, consult the reference
746  section for further instructions.
747
748  [:ref:`details <recheckworking_bisref>`]
749
750.. _introbisect_bissbs:
751
752Segment 3: perform the bisection and validate the result
753--------------------------------------------------------
754
755With all the preparations and precaution builds taken care of, you are now ready
756to begin the bisection. This will make you build quite a few kernels -- usually
757about 15 in case you encountered a regression when updating to a newer series
758(say from 6.0.13 to 6.1.5). But do not worry, due to the trimmed build
759configuration created earlier this works a lot faster than many people assume:
760overall on average it will often just take about 10 to 15 minutes to compile
761each kernel on commodity x86 machines.
762
763.. _bisectstart_bissbs:
764
765* Start the bisection and tell Git about the versions earlier established as
766  'good' (6.0 in the following example command) and 'bad' (6.1.5)::
767
768    cd ~/linux/
769    git bisect start
770    git bisect good v6.0
771    git bisect bad v6.1.5
772
773  [:ref:`details <bisectstart_bisref>`]
774
775.. _bisectbuild_bissbs:
776
777* Now use the code Git checked out to build, install, and boot a kernel using
778  the commands introduced earlier::
779
780    cp ~/kernel-config-working .config
781    make olddefconfig
782    make -j $(nproc --all)
783    # * Check if the free space suffices holding another kernel:
784    df -h /boot/ /lib/modules/
785    sudo make modules_install
786    command -v installkernel && sudo make install
787    make -s kernelrelease | tee -a ~/kernels-built
788    reboot
789
790  If compilation fails for some reason, run ``git bisect skip`` and restart
791  executing the stack of commands from the beginning.
792
793  In case you skipped the 'test latest codebase' step in the guide, check its
794  description as for why the 'df [...]' and 'make -s kernelrelease [...]'
795  commands are here.
796
797  Important note: the latter command from this point on will print release
798  identifiers that might look odd or wrong to you -- which they are not, as it's
799  totally normal to see release identifiers like '6.0-rc1-local-gcafec0cacaca0'
800  if you bisect between versions 6.1 and 6.2 for example.
801
802  [:ref:`details <bisectbuild_bisref>`]
803
804.. _bisecttest_bissbs:
805
806* Now check if the feature that regressed works in the kernel you just built.
807
808  You again might want to start by making sure the kernel you booted is the one
809  you just built::
810
811    cd ~/linux/
812    tail -n 1 ~/kernels-built
813    uname -r
814
815  Now verify if the feature that regressed works at this kernel bisection point.
816  If it does, run this::
817
818    git bisect good
819
820  If it does not, run this::
821
822    git bisect bad
823
824  Be sure about what you tell Git, as getting this wrong just once will send the
825  rest of the bisection totally off course.
826
827  While the bisection is ongoing, Git will use the information you provided to
828  find and check out another bisection point for you to test. While doing so, it
829  will print something like 'Bisecting: 675 revisions left to test after this
830  (roughly 10 steps)' to indicate how many further changes it expects to be
831  tested. Now build and install another kernel using the instructions from the
832  previous step; afterwards follow the instructions in this step again.
833
834  Repeat this again and again until you finish the bisection -- that's the case
835  when Git after tagging a change as 'good' or 'bad' prints something like
836  'cafecaca0c0dacafecaca0c0dacafecaca0c0da is the first bad commit'; right
837  afterwards it will show some details about the culprit including the patch
838  description of the change. The latter might fill your terminal screen, so you
839  might need to scroll up to see the message mentioning the culprit;
840  alternatively, run ``git bisect log > ~/bisection-log``.
841
842  [:ref:`details <bisecttest_bisref>`]
843
844.. _bisectlog_bissbs:
845
846* Store Git's bisection log and the current .config file in a safe place before
847  telling Git to reset the sources to the state before the bisection::
848
849    cd ~/linux/
850    git bisect log > ~/bisection-log
851    cp .config ~/bisection-config-culprit
852    git bisect reset
853
854  [:ref:`details <bisectlog_bisref>`]
855
856.. _revert_bissbs:
857
858* Try reverting the culprit on top of latest mainline to see if this fixes your
859  regression.
860
861  This is optional, as it might be impossible or hard to realize. The former is
862  the case, if the bisection determined a merge commit as the culprit; the
863  latter happens if other changes depend on the culprit. But if the revert
864  succeeds, it is worth building another kernel, as it validates the result of
865  a bisection, which can easily deroute; it furthermore will let kernel
866  developers know, if they can resolve the regression with a quick revert.
867
868  Begin by checking out the latest codebase depending on the range you bisected:
869
870  * Did you face a regression within a stable/longterm series (say between
871    6.0.13 and 6.0.15) that does not happen in mainline? Then check out the
872    latest codebase for the affected series like this::
873
874      git fetch stable
875      git switch --discard-changes --detach linux-6.0.y
876
877  * In all other cases check out latest mainline::
878
879      git fetch mainline
880      git switch --discard-changes --detach mainline/master
881
882    If you bisected a regression within a stable/longterm series that also
883    happens in mainline, there is one more thing to do: look up the mainline
884    commit-id. To do so, use a command like ``git show abcdcafecabcd`` to
885    view the patch description of the culprit. There will be a line near
886    the top which looks like 'commit cafec0cacaca0 upstream.' or
887    'Upstream commit cafec0cacaca0'; use that commit-id in the next command
888    and not the one the bisection blamed.
889
890  Now try reverting the culprit by specifying its commit id::
891
892    git revert --no-edit cafec0cacaca0
893
894  If that fails, give up trying and move on to the next step; if it works,
895  adjust the tag to facilitate the identification and prevent accidentally
896  overwriting another kernel::
897
898    cp ~/kernel-config-working .config
899    ./scripts/config --set-str CONFIG_LOCALVERSION '-local-cafec0cacaca0-reverted'
900
901  Build a kernel using the familiar command sequence, just without copying the
902  the base .config over::
903
904    make olddefconfig &&
905    make -j $(nproc --all)
906    # * Check if the free space suffices holding another kernel:
907    df -h /boot/ /lib/modules/
908    sudo make modules_install
909    command -v installkernel && sudo make install
910    make -s kernelrelease | tee -a ~/kernels-built
911    reboot
912
913  Now check one last time if the feature that made you perform a bisection works
914  with that kernel: if everything went well, it should not show the regression.
915
916  [:ref:`details <revert_bisref>`]
917
918.. _introclosure_bissbs:
919
920Complementary tasks: cleanup during and after the bisection
921-----------------------------------------------------------
922
923During and after following this guide you might want or need to remove some of
924the kernels you installed: the boot menu otherwise will become confusing or
925space might run out.
926
927.. _makeroom_bissbs:
928
929* To remove one of the kernels you installed, look up its 'kernelrelease'
930  identifier. This guide stores them in '~/kernels-built', but the following
931  command will print them as well::
932
933    ls -ltr /lib/modules/*-local*
934
935  You in most situations want to remove the oldest kernels built during the
936  actual bisection (e.g. segment 3 of this guide). The two ones you created
937  beforehand (e.g. to test the latest codebase and the version considered
938  'good') might become handy to verify something later -- thus better keep them
939  around, unless you are really short on storage space.
940
941  To remove the modules of a kernel with the kernelrelease identifier
942  '*6.0-rc1-local-gcafec0cacaca0*', start by removing the directory holding its
943  modules::
944
945    sudo rm -rf /lib/modules/6.0-rc1-local-gcafec0cacaca0
946
947  Afterwards try the following command::
948
949    sudo kernel-install -v remove 6.0-rc1-local-gcafec0cacaca0
950
951  On quite a few distributions this will delete all other kernel files installed
952  while also removing the kernel's entry from the boot menu. But on some
953  distributions kernel-install does not exist or leaves boot-loader entries or
954  kernel image and related files behind; in that case remove them as described
955  in the reference section.
956
957  [:ref:`details <makeroom_bisref>`]
958
959.. _finishingtouch_bissbs:
960
961* Once you have finished the bisection, do not immediately remove anything you
962  set up, as you might need a few things again. What is safe to remove depends
963  on the outcome of the bisection:
964
965  * Could you initially reproduce the regression with the latest codebase and
966    after the bisection were able to fix the problem by reverting the culprit on
967    top of the latest codebase? Then you want to keep those two kernels around
968    for a while, but safely remove all others with a '-local' in the release
969    identifier.
970
971  * Did the bisection end on a merge-commit or seems questionable for other
972    reasons? Then you want to keep as many kernels as possible around for a few
973    days: it's pretty likely that you will be asked to recheck something.
974
975  * In other cases it likely is a good idea to keep the following kernels around
976    for some time: the one built from the latest codebase, the one created from
977    the version considered 'good', and the last three or four you compiled
978    during the actual bisection process.
979
980  [:ref:`details <finishingtouch_bisref>`]
981
982.. _introoptional_bissbs:
983
984Optional: test reverts, patches, or later versions
985--------------------------------------------------
986
987While or after reporting a bug, you might want or potentially will be asked to
988test reverts, debug patches, proposed fixes, or other versions. In that case
989follow these instructions.
990
991* Update your Git clone and check out the latest code.
992
993  * In case you want to test mainline, fetch its latest changes before checking
994    its code out::
995
996      git fetch mainline
997      git switch --discard-changes --detach mainline/master
998
999  * In case you want to test a stable or longterm kernel, first add the branch
1000    holding the series you are interested in (6.2 in the example), unless you
1001    already did so earlier::
1002
1003      git remote set-branches --add stable linux-6.2.y
1004
1005    Then fetch the latest changes and check out the latest version from the
1006    series::
1007
1008      git fetch stable
1009      git switch --discard-changes --detach stable/linux-6.2.y
1010
1011* Copy your kernel build configuration over::
1012
1013    cp ~/kernel-config-working .config
1014
1015* Your next step depends on what you want to do:
1016
1017  * In case you just want to test the latest codebase, head to the next step,
1018    you are already all set.
1019
1020  * In case you want to test if a revert fixes an issue, revert one or multiple
1021    changes by specifying their commit ids::
1022
1023      git revert --no-edit cafec0cacaca0
1024
1025    Now give that kernel a special tag to facilitates its identification and
1026    prevent accidentally overwriting another kernel::
1027
1028      ./scripts/config --set-str CONFIG_LOCALVERSION '-local-cafec0cacaca0-reverted'
1029
1030  * In case you want to test a patch, store the patch in a file like
1031    '/tmp/foobars-proposed-fix-v1.patch' and apply it like this::
1032
1033      git apply /tmp/foobars-proposed-fix-v1.patch
1034
1035    In case of multiple patches, repeat this step with the others.
1036
1037    Now give that kernel a special tag to facilitates its identification and
1038    prevent accidentally overwriting another kernel::
1039
1040    ./scripts/config --set-str CONFIG_LOCALVERSION '-local-foobars-fix-v1'
1041
1042* Build a kernel using the familiar commands, just without copying the kernel
1043  build configuration over, as that has been taken care of already::
1044
1045    make olddefconfig &&
1046    make -j $(nproc --all)
1047    # * Check if the free space suffices holding another kernel:
1048    df -h /boot/ /lib/modules/
1049    sudo make modules_install
1050    command -v installkernel && sudo make install
1051    make -s kernelrelease | tee -a ~/kernels-built
1052    reboot
1053
1054* Now verify you booted the newly built kernel and check it.
1055
1056[:ref:`details <introoptional_bisref>`]
1057
1058.. _submit_improvements:
1059
1060Conclusion
1061----------
1062
1063You have reached the end of the step-by-step guide.
1064
1065Did you run into trouble following any of the above steps not cleared up by the
1066reference section below? Did you spot errors? Or do you have ideas how to
1067improve the guide?
1068
1069If any of that applies, please take a moment and let the maintainer of this
1070document know by email (Thorsten Leemhuis <linux@leemhuis.info>), ideally while
1071CCing the Linux docs mailing list (linux-doc@vger.kernel.org). Such feedback is
1072vital to improve this text further, which is in everybody's interest, as it
1073will enable more people to master the task described here -- and hopefully also
1074improve similar guides inspired by this one.
1075
1076
1077Reference section for the step-by-step guide
1078============================================
1079
1080This section holds additional information for almost all the items in the above
1081step-by-step guide.
1082
1083Preparations for building your own kernels
1084------------------------------------------
1085
1086  *The steps in this section lay the groundwork for all further tests.*
1087  [:ref:`... <introprep_bissbs>`]
1088
1089The steps in all later sections of this guide depend on those described here.
1090
1091[:ref:`back to step-by-step guide <introprep_bissbs>`].
1092
1093.. _backup_bisref:
1094
1095Prepare for emergencies
1096~~~~~~~~~~~~~~~~~~~~~~~
1097
1098  *Create a fresh backup and put system repair and restore tools at hand.*
1099  [:ref:`... <backup_bissbs>`]
1100
1101Remember, you are dealing with computers, which sometimes do unexpected things
1102-- especially if you fiddle with crucial parts like the kernel of an operating
1103system. That's what you are about to do in this process. Hence, better prepare
1104for something going sideways, even if that should not happen.
1105
1106[:ref:`back to step-by-step guide <backup_bissbs>`]
1107
1108.. _vanilla_bisref:
1109
1110Remove anything related to externally maintained kernel modules
1111~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
1112
1113  *Remove all software that depends on externally developed kernel drivers or
1114  builds them automatically.* [:ref:`...<vanilla_bissbs>`]
1115
1116Externally developed kernel modules can easily cause trouble during a bisection.
1117
1118But there is a more important reason why this guide contains this step: most
1119kernel developers will not care about reports about regressions occurring with
1120kernels that utilize such modules. That's because such kernels are not
1121considered 'vanilla' anymore, as Documentation/admin-guide/reporting-issues.rst
1122explains in more detail.
1123
1124[:ref:`back to step-by-step guide <vanilla_bissbs>`]
1125
1126.. _secureboot_bisref:
1127
1128Deal with techniques like Secure Boot
1129~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
1130
1131  *On platforms with 'Secure Boot' or similar techniques, prepare everything to
1132  ensure the system will permit your self-compiled kernel to boot later.*
1133  [:ref:`... <secureboot_bissbs>`]
1134
1135Many modern systems allow only certain operating systems to start; that's why
1136they reject booting self-compiled kernels by default.
1137
1138You ideally deal with this by making your platform trust your self-built kernels
1139with the help of a certificate. How to do that is not described
1140here, as it requires various steps that would take the text too far away from
1141its purpose; 'Documentation/admin-guide/module-signing.rst' and various web
1142sides already explain everything needed in more detail.
1143
1144Temporarily disabling solutions like Secure Boot is another way to make your own
1145Linux boot. On commodity x86 systems it is possible to do this in the BIOS Setup
1146utility; the required steps vary a lot between machines and therefore cannot be
1147described here.
1148
1149On mainstream x86 Linux distributions there is a third and universal option:
1150disable all Secure Boot restrictions for your Linux environment. You can
1151initiate this process by running ``mokutil --disable-validation``; this will
1152tell you to create a one-time password, which is safe to write down. Now
1153restart; right after your BIOS performed all self-tests the bootloader Shim will
1154show a blue box with a message 'Press any key to perform MOK management'. Hit
1155some key before the countdown exposes, which will open a menu. Choose 'Change
1156Secure Boot state'. Shim's 'MokManager' will now ask you to enter three
1157randomly chosen characters from the one-time password specified earlier. Once
1158you provided them, confirm you really want to disable the validation.
1159Afterwards, permit MokManager to reboot the machine.
1160
1161[:ref:`back to step-by-step guide <secureboot_bissbs>`]
1162
1163.. _bootworking_bisref:
1164
1165Boot the last kernel that was working
1166~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
1167
1168  *Boot into the last working kernel and briefly recheck if the feature that
1169  regressed really works.* [:ref:`...<bootworking_bissbs>`]
1170
1171This will make later steps that cover creating and trimming the configuration do
1172the right thing.
1173
1174[:ref:`back to step-by-step guide <bootworking_bissbs>`]
1175
1176.. _diskspace_bisref:
1177
1178Space requirements
1179~~~~~~~~~~~~~~~~~~
1180
1181  *Ensure to have enough free space for building Linux.*
1182  [:ref:`... <diskspace_bissbs>`]
1183
1184The numbers mentioned are rough estimates with a big extra charge to be on the
1185safe side, so often you will need less.
1186
1187If you have space constraints, be sure to hay attention to the :ref:`step about
1188debug symbols' <debugsymbols_bissbs>` and its :ref:`accompanying reference
1189section' <debugsymbols_bisref>`, as disabling then will reduce the consumed disk
1190space by quite a few gigabytes.
1191
1192[:ref:`back to step-by-step guide <diskspace_bissbs>`]
1193
1194.. _rangecheck_bisref:
1195
1196Bisection range
1197~~~~~~~~~~~~~~~
1198
1199  *Determine the kernel versions considered 'good' and 'bad' throughout this
1200  guide.* [:ref:`...<rangecheck_bissbs>`]
1201
1202Establishing the range of commits to be checked is mostly straightforward,
1203except when a regression occurred when switching from a release of one stable
1204series to a release of a later series (e.g. from 6.0.13 to 6.1.5). In that case
1205Git will need some hand holding, as there is no straight line of descent.
1206
1207That's because with the release of 6.0 mainline carried on to 6.1 while the
1208stable series 6.0.y branched to the side. It's therefore theoretically possible
1209that the issue you face with 6.1.5 only worked in 6.0.13, as it was fixed by a
1210commit that went into one of the 6.0.y releases, but never hit mainline or the
12116.1.y series. Thankfully that normally should not happen due to the way the
1212stable/longterm maintainers maintain the code. It's thus pretty safe to assume
12136.0 as a 'good' kernel. That assumption will be tested anyway, as that kernel
1214will be built and tested in the segment '2' of this guide; Git would force you
1215to do this as well, if you tried bisecting between 6.0.13 and 6.1.15.
1216
1217[:ref:`back to step-by-step guide <rangecheck_bissbs>`]
1218
1219.. _buildrequires_bisref:
1220
1221Install build requirements
1222~~~~~~~~~~~~~~~~~~~~~~~~~~
1223
1224  *Install all software required to build a Linux kernel.*
1225  [:ref:`...<buildrequires_bissbs>`]
1226
1227The kernel is pretty stand-alone, but besides tools like the compiler you will
1228sometimes need a few libraries to build one. How to install everything needed
1229depends on your Linux distribution and the configuration of the kernel you are
1230about to build.
1231
1232Here are a few examples what you typically need on some mainstream
1233distributions:
1234
1235* Arch Linux and derivatives::
1236
1237    sudo pacman --needed -S bc binutils bison flex gcc git kmod libelf openssl \
1238      pahole perl zlib ncurses qt6-base
1239
1240* Debian, Ubuntu, and derivatives::
1241
1242    sudo apt install bc binutils bison dwarves flex gcc git kmod libelf-dev \
1243      libssl-dev make openssl pahole perl-base pkg-config zlib1g-dev \
1244      libncurses-dev qt6-base-dev g++
1245
1246* Fedora and derivatives::
1247
1248    sudo dnf install binutils \
1249      /usr/bin/{bc,bison,flex,gcc,git,openssl,make,perl,pahole,rpmbuild} \
1250      /usr/include/{libelf.h,openssl/pkcs7.h,zlib.h,ncurses.h,qt6/QtGui/QAction}
1251
1252* openSUSE and derivatives::
1253
1254    sudo zypper install bc binutils bison dwarves flex gcc git \
1255      kernel-install-tools libelf-devel make modutils openssl openssl-devel \
1256      perl-base zlib-devel rpm-build ncurses-devel qt6-base-devel
1257
1258These commands install a few packages that are often, but not always needed. You
1259for example might want to skip installing the development headers for ncurses,
1260which you will only need in case you later might want to adjust the kernel build
1261configuration using make the targets 'menuconfig' or 'nconfig'; likewise omit
1262the headers of Qt6 if you do not plan to adjust the .config using 'xconfig'.
1263
1264You furthermore might need additional libraries and their development headers
1265for tasks not covered in this guide -- for example when building utilities from
1266the kernel's tools/ directory.
1267
1268[:ref:`back to step-by-step guide <buildrequires_bissbs>`]
1269
1270.. _sources_bisref:
1271
1272Download the sources using Git
1273~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
1274
1275  *Retrieve the Linux mainline sources.*
1276  [:ref:`...<sources_bissbs>`]
1277
1278The step-by-step guide outlines how to download the Linux sources using a full
1279Git clone of Linus' mainline repository. There is nothing more to say about
1280that -- but there are two alternatives ways to retrieve the sources that might
1281work better for you:
1282
1283* If you have an unreliable internet connection, consider
1284  :ref:`using a 'Git bundle'<sources_bundle_bisref>`.
1285
1286* If downloading the complete repository would take too long or requires too
1287  much storage space, consider :ref:`using a 'shallow
1288  clone'<sources_shallow_bisref>`.
1289
1290.. _sources_bundle_bisref:
1291
1292Downloading Linux mainline sources using a bundle
1293"""""""""""""""""""""""""""""""""""""""""""""""""
1294
1295Use the following commands to retrieve the Linux mainline sources using a
1296bundle::
1297
1298    wget -c \
1299      https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/clone.bundle
1300    git clone --no-checkout clone.bundle ~/linux/
1301    cd ~/linux/
1302    git remote remove origin
1303    git remote add mainline \
1304      https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git
1305    git fetch mainline
1306    git remote add -t master stable \
1307      https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux.git
1308
1309In case the 'wget' command fails, just re-execute it, it will pick up where
1310it left off.
1311
1312[:ref:`back to step-by-step guide <sources_bissbs>`]
1313[:ref:`back to section intro <sources_bisref>`]
1314
1315.. _sources_shallow_bisref:
1316
1317Downloading Linux mainline sources using a shallow clone
1318~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
1319
1320First, execute the following command to retrieve the latest mainline codebase::
1321
1322    git clone -o mainline --no-checkout --depth 1 -b master \
1323      https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git ~/linux/
1324    cd ~/linux/
1325    git remote add -t master stable \
1326      https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux.git
1327
1328Now deepen your clone's history to the second predecessor of the mainline
1329release of your 'good' version. In case the latter are 6.0 or 6.0.13, 5.19 would
1330be the first predecessor and 5.18 the second -- hence deepen the history up to
1331that version::
1332
1333    git fetch --shallow-exclude=v5.18 mainline
1334
1335Afterwards add the stable Git repository as remote and all required stable
1336branches as explained in the step-by-step guide.
1337
1338Note, shallow clones have a few peculiar characteristics:
1339
1340* For bisections the history needs to be deepened a few mainline versions
1341  farther than it seems necessary, as explained above already. That's because
1342  Git otherwise will be unable to revert or describe most of the commits within
1343  a range (say 6.1..6.2), as they are internally based on earlier kernels
1344  releases (like 6.0-rc2 or 5.19-rc3).
1345
1346* This document in most places uses ``git fetch`` with ``--shallow-exclude=``
1347  to specify the earliest version you care about (or to be precise: its git
1348  tag). You alternatively can use the parameter ``--shallow-since=`` to specify
1349  an absolute (say ``'2023-07-15'``) or relative (``'12 months'``) date to
1350  define the depth of the history you want to download. When using them while
1351  bisecting mainline, ensure to deepen the history to at least 7 months before
1352  the release of the mainline release your 'good' kernel is based on.
1353
1354* Be warned, when deepening your clone you might encounter an error like
1355  'fatal: error in object: unshallow cafecaca0c0dacafecaca0c0dacafecaca0c0da'.
1356  In that case run ``git repack -d`` and try again.
1357
1358[:ref:`back to step-by-step guide <sources_bissbs>`]
1359[:ref:`back to section intro <sources_bisref>`]
1360
1361.. _oldconfig_bisref:
1362
1363Start defining the build configuration for your kernel
1364~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
1365
1366  *Start preparing a kernel build configuration (the '.config' file).*
1367  [:ref:`... <oldconfig_bissbs>`]
1368
1369*Note, this is the first of multiple steps in this guide that create or modify
1370build artifacts. The commands used in this guide store them right in the source
1371tree to keep things simple. In case you prefer storing the build artifacts
1372separately, create a directory like '~/linux-builddir/' and add the parameter
1373``O=~/linux-builddir/`` to all make calls used throughout this guide. You will
1374have to point other commands there as well -- among them the ``./scripts/config
1375[...]`` commands, which will require ``--file ~/linux-builddir/.config`` to
1376locate the right build configuration.*
1377
1378Two things can easily go wrong when creating a .config file as advised:
1379
1380* The oldconfig target will use a .config file from your build directory, if
1381  one is already present there (e.g. '~/linux/.config'). That's totally fine if
1382  that's what you intend (see next step), but in all other cases you want to
1383  delete it. This for example is important in case you followed this guide
1384  further, but due to problems come back here to redo the configuration from
1385  scratch.
1386
1387* Sometimes olddefconfig is unable to locate the .config file for your running
1388  kernel and will use defaults, as briefly outlined in the guide. In that case
1389  check if your distribution ships the configuration somewhere and manually put
1390  it in the right place (e.g. '~/linux/.config') if it does. On distributions
1391  where /proc/config.gz exists this can be achieved using this command::
1392
1393    zcat /proc/config.gz > .config
1394
1395  Once you put it there, run ``make olddefconfig`` again to adjust it to the
1396  needs of the kernel about to be built.
1397
1398Note, the olddefconfig target will set any undefined build options to their
1399default value. If you prefer to set such configuration options manually, use
1400``make oldconfig`` instead. Then for each undefined configuration option you
1401will be asked how to proceed; in case you are unsure what to answer, simply hit
1402'enter' to apply the default value. Note though that for bisections you normally
1403want to go with the defaults, as you otherwise might enable a new feature that
1404causes a problem looking like regressions (for example due to security
1405restrictions).
1406
1407Occasionally odd things happen when trying to use a config file prepared for one
1408kernel (say 6.1) on an older mainline release -- especially if it is much older
1409(say 5.15). That's one of the reasons why the previous step in the guide told
1410you to boot the kernel where everything works. If you manually add a .config
1411file you thus want to ensure it's from the working kernel and not from a one
1412that shows the regression.
1413
1414In case you want to build kernels for another machine, locate its kernel build
1415configuration; usually ``ls /boot/config-$(uname -r)`` will print its name. Copy
1416that file to the build machine and store it as ~/linux/.config; afterwards run
1417``make olddefconfig`` to adjust it.
1418
1419[:ref:`back to step-by-step guide <oldconfig_bissbs>`]
1420
1421.. _localmodconfig_bisref:
1422
1423Trim the build configuration for your kernel
1424~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
1425
1426  *Disable any kernel modules apparently superfluous for your setup.*
1427  [:ref:`... <localmodconfig_bissbs>`]
1428
1429As explained briefly in the step-by-step guide already: with localmodconfig it
1430can easily happen that your self-built kernels will lack modules for tasks you
1431did not perform at least once before utilizing this make target. That happens
1432when a task requires kernel modules which are only autoloaded when you execute
1433it for the first time. So when you never performed that task since starting your
1434kernel the modules will not have been loaded -- and from localmodonfig's point
1435of view look superfluous, which thus disables them to reduce the amount of code
1436to be compiled.
1437
1438You can try to avoid this by performing typical tasks that often will autoload
1439additional kernel modules: start a VM, establish VPN connections, loop-mount a
1440CD/DVD ISO, mount network shares (CIFS, NFS, ...), and connect all external
1441devices (2FA keys, headsets, webcams, ...) as well as storage devices with file
1442systems you otherwise do not utilize (btrfs, ext4, FAT, NTFS, XFS, ...). But it
1443is hard to think of everything that might be needed -- even kernel developers
1444often forget one thing or another at this point.
1445
1446Do not let that risk bother you, especially when compiling a kernel only for
1447testing purposes: everything typically crucial will be there. And if you forget
1448something important you can turn on a missing feature manually later and quickly
1449run the commands again to compile and install a kernel that has everything you
1450need.
1451
1452But if you plan to build and use self-built kernels regularly, you might want to
1453reduce the risk by recording which modules your system loads over the course of
1454a few weeks. You can automate this with `modprobed-db
1455<https://github.com/graysky2/modprobed-db>`_. Afterwards use ``LSMOD=<path>`` to
1456point localmodconfig to the list of modules modprobed-db noticed being used::
1457
1458  yes '' | make LSMOD='${HOME}'/.config/modprobed.db localmodconfig
1459
1460That parameter also allows you to build trimmed kernels for another machine in
1461case you copied a suitable .config over to use as base (see previous step). Just
1462run ``lsmod > lsmod_foo-machine`` on that system and copy the generated file to
1463your build's host home directory. Then run these commands instead of the one the
1464step-by-step guide mentions::
1465
1466  yes '' | make LSMOD=~/lsmod_foo-machine localmodconfig
1467
1468[:ref:`back to step-by-step guide <localmodconfig_bissbs>`]
1469
1470.. _tagging_bisref:
1471
1472Tag the kernels about to be build
1473~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
1474
1475  *Ensure all the kernels you will build are clearly identifiable using a
1476  special tag and a unique version identifier.* [:ref:`... <tagging_bissbs>`]
1477
1478This allows you to differentiate your distribution's kernels from those created
1479during this process, as the file or directories for the latter will contain
1480'-local' in the name; it also helps picking the right entry in the boot menu and
1481not lose track of you kernels, as their version numbers will look slightly
1482confusing during the bisection.
1483
1484[:ref:`back to step-by-step guide <tagging_bissbs>`]
1485
1486.. _debugsymbols_bisref:
1487
1488Decide to enable or disable debug symbols
1489~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
1490
1491  *Decide how to handle debug symbols.* [:ref:`... <debugsymbols_bissbs>`]
1492
1493Having debug symbols available can be important when your kernel throws a
1494'panic', 'Oops', 'warning', or 'BUG' later when running, as then you will be
1495able to find the exact place where the problem occurred in the code. But
1496collecting and embedding the needed debug information takes time and consumes
1497quite a bit of space: in late 2022 the build artifacts for a typical x86 kernel
1498trimmed with localmodconfig consumed around 5 Gigabyte of space with debug
1499symbols, but less than 1 when they were disabled. The resulting kernel image and
1500modules are bigger as well, which increases storage requirements for /boot/ and
1501load times.
1502
1503In case you want a small kernel and are unlikely to decode a stack trace later,
1504you thus might want to disable debug symbols to avoid those downsides. If it
1505later turns out that you need them, just enable them as shown and rebuild the
1506kernel.
1507
1508You on the other hand definitely want to enable them for this process, if there
1509is a decent chance that you need to decode a stack trace later. The section
1510'Decode failure messages' in Documentation/admin-guide/reporting-issues.rst
1511explains this process in more detail.
1512
1513[:ref:`back to step-by-step guide <debugsymbols_bissbs>`]
1514
1515.. _configmods_bisref:
1516
1517Adjust build configuration
1518~~~~~~~~~~~~~~~~~~~~~~~~~~
1519
1520  *Check if you may want or need to adjust some other kernel configuration
1521  options:*
1522
1523Depending on your needs you at this point might want or have to adjust some
1524kernel configuration options.
1525
1526.. _configmods_distros_bisref:
1527
1528Distro specific adjustments
1529"""""""""""""""""""""""""""
1530
1531  *Are you running* [:ref:`... <configmods_bissbs>`]
1532
1533The following sections help you to avoid build problems that are known to occur
1534when following this guide on a few commodity distributions.
1535
1536**Debian:**
1537
1538* Remove a stale reference to a certificate file that would cause your build to
1539  fail::
1540
1541   ./scripts/config --set-str SYSTEM_TRUSTED_KEYS ''
1542
1543  Alternatively, download the needed certificate and make that configuration
1544  option point to it, as `the Debian handbook explains in more detail
1545  <https://debian-handbook.info/browse/stable/sect.kernel-compilation.html>`_
1546  -- or generate your own, as explained in
1547  Documentation/admin-guide/module-signing.rst.
1548
1549[:ref:`back to step-by-step guide <configmods_bissbs>`]
1550
1551.. _configmods_individual_bisref:
1552
1553Individual adjustments
1554""""""""""""""""""""""
1555
1556  *If you want to influence the other aspects of the configuration, do so
1557  now.* [:ref:`... <configmods_bissbs>`]
1558
1559At this point you can use a command like ``make menuconfig`` or ``make nconfig``
1560to enable or disable certain features using a text-based user interface; to use
1561a graphical configuration utility, run ``make xconfig`` instead. Both of them
1562require development libraries from toolkits they are rely on (ncurses
1563respectively Qt5 or Qt6); an error message will tell you if something required
1564is missing.
1565
1566[:ref:`back to step-by-step guide <configmods_bissbs>`]
1567
1568.. _saveconfig_bisref:
1569
1570Put the .config file aside
1571~~~~~~~~~~~~~~~~~~~~~~~~~~
1572
1573  *Reprocess the .config after the latest changes and store it in a safe place.*
1574  [:ref:`... <saveconfig_bissbs>`]
1575
1576Put the .config you prepared aside, as you want to copy it back to the build
1577directory every time during this guide before you start building another
1578kernel. That's because going back and forth between different versions can alter
1579.config files in odd ways; those occasionally cause side effects that could
1580confuse testing or in some cases render the result of your bisection
1581meaningless.
1582
1583[:ref:`back to step-by-step guide <saveconfig_bissbs>`]
1584
1585.. _introlatestcheck_bisref:
1586
1587Try to reproduce the problem with the latest codebase
1588-----------------------------------------------------
1589
1590  *Verify the regression is not caused by some .config change and check if it
1591  still occurs with the latest codebase.* [:ref:`... <introlatestcheck_bissbs>`]
1592
1593For some readers it might seem unnecessary to check the latest codebase at this
1594point, especially if you did that already with a kernel prepared by your
1595distributor or face a regression within a stable/longterm series. But it's
1596highly recommended for these reasons:
1597
1598* You will run into any problems caused by your setup before you actually begin
1599  a bisection. That will make it a lot easier to differentiate between 'this
1600  most likely is some problem in my setup' and 'this change needs to be skipped
1601  during the bisection, as the kernel sources at that stage contain an unrelated
1602  problem that causes building or booting to fail'.
1603
1604* These steps will rule out if your problem is caused by some change in the
1605  build configuration between the 'working' and the 'broken' kernel. This for
1606  example can happen when your distributor enabled an additional security
1607  feature in the newer kernel which was disabled or not yet supported by the
1608  older kernel. That security feature might get into the way of something you
1609  do -- in which case your problem from the perspective of the Linux kernel
1610  upstream developers is not a regression, as
1611  Documentation/admin-guide/reporting-regressions.rst explains in more detail.
1612  You thus would waste your time if you'd try to bisect this.
1613
1614* If the cause for your regression was already fixed in the latest mainline
1615  codebase, you'd perform the bisection for nothing. This holds true for a
1616  regression you encountered with a stable/longterm release as well, as they are
1617  often caused by problems in mainline changes that were backported -- in which
1618  case the problem will have to be fixed in mainline first. Maybe it already was
1619  fixed there and the fix is already in the process of being backported.
1620
1621* For regressions within a stable/longterm series it's furthermore crucial to
1622  know if the issue is specific to that series or also happens in the mainline
1623  kernel, as the report needs to be sent to different people:
1624
1625  * Regressions specific to a stable/longterm series are the stable team's
1626    responsibility; mainline Linux developers might or might not care.
1627
1628  * Regressions also happening in mainline are something the regular Linux
1629    developers and maintainers have to handle; the stable team does not care
1630    and does not need to be involved in the report, they just should be told
1631    to backport the fix once it's ready.
1632
1633  Your report might be ignored if you send it to the wrong party -- and even
1634  when you get a reply there is a decent chance that developers tell you to
1635  evaluate which of the two cases it is before they take a closer look.
1636
1637[:ref:`back to step-by-step guide <introlatestcheck_bissbs>`]
1638
1639.. _checkoutmaster_bisref:
1640
1641Check out the latest Linux codebase
1642~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
1643
1644  *Check out the latest Linux codebase.*
1645  [:ref:`... <checkoutmaster_bissbs>`]
1646
1647In case you later want to recheck if an ever newer codebase might fix the
1648problem, remember to run that ``git fetch --shallow-exclude [...]`` command
1649again mentioned earlier to update your local Git repository.
1650
1651[:ref:`back to step-by-step guide <checkoutmaster_bissbs>`]
1652
1653.. _build_bisref:
1654
1655Build your kernel
1656~~~~~~~~~~~~~~~~~
1657
1658  *Build the image and the modules of your first kernel using the config file
1659  you prepared.* [:ref:`... <build_bissbs>`]
1660
1661A lot can go wrong at this stage, but the instructions below will help you help
1662yourself. Another subsection explains how to directly package your kernel up as
1663deb, rpm or tar file.
1664
1665Dealing with build errors
1666"""""""""""""""""""""""""
1667
1668When a build error occurs, it might be caused by some aspect of your machine's
1669setup that often can be fixed quickly; other times though the problem lies in
1670the code and can only be fixed by a developer. A close examination of the
1671failure messages coupled with some research on the internet will often tell you
1672which of the two it is. To perform such investigation, restart the build
1673process like this::
1674
1675  make V=1
1676
1677The ``V=1`` activates verbose output, which might be needed to see the actual
1678error. To make it easier to spot, this command also omits the ``-j $(nproc
1679--all)`` used earlier to utilize every CPU core in the system for the job -- but
1680this parallelism also results in some clutter when failures occur.
1681
1682After a few seconds the build process should run into the error again. Now try
1683to find the most crucial line describing the problem. Then search the internet
1684for the most important and non-generic section of that line (say 4 to 8 words);
1685avoid or remove anything that looks remotely system-specific, like your username
1686or local path names like ``/home/username/linux/``. First try your regular
1687internet search engine with that string, afterwards search Linux kernel mailing
1688lists via `lore.kernel.org/all/ <https://lore.kernel.org/all/>`_.
1689
1690This most of the time will find something that will explain what is wrong; quite
1691often one of the hits will provide a solution for your problem, too. If you
1692do not find anything that matches your problem, try again from a different angle
1693by modifying your search terms or using another line from the error messages.
1694
1695In the end, most issues you run into have likely been encountered and
1696reported by others already. That includes issues where the cause is not your
1697system, but lies in the code. If you run into one of those, you might thus find
1698a solution (e.g. a patch) or workaround for your issue, too.
1699
1700Package your kernel up
1701""""""""""""""""""""""
1702
1703The step-by-step guide uses the default make targets (e.g. 'bzImage' and
1704'modules' on x86) to build the image and the modules of your kernel, which later
1705steps of the guide then install. You instead can also directly build everything
1706and directly package it up by using one of the following targets:
1707
1708* ``make -j $(nproc --all) bindeb-pkg`` to generate a deb package
1709
1710* ``make -j $(nproc --all) binrpm-pkg`` to generate a rpm package
1711
1712* ``make -j $(nproc --all) tarbz2-pkg`` to generate a bz2 compressed tarball
1713
1714This is just a selection of available make targets for this purpose, see
1715``make help`` for others. You can also use these targets after running
1716``make -j $(nproc --all)``, as they will pick up everything already built.
1717
1718If you employ the targets to generate deb or rpm packages, ignore the
1719step-by-step guide's instructions on installing and removing your kernel;
1720instead install and remove the packages using the package utility for the format
1721(e.g. dpkg and rpm) or a package management utility build on top of them (apt,
1722aptitude, dnf/yum, zypper, ...). Be aware that the packages generated using
1723these two make targets are designed to work on various distributions utilizing
1724those formats, they thus will sometimes behave differently than your
1725distribution's kernel packages.
1726
1727[:ref:`back to step-by-step guide <build_bissbs>`]
1728
1729.. _install_bisref:
1730
1731Put the kernel in place
1732~~~~~~~~~~~~~~~~~~~~~~~
1733
1734  *Install the kernel you just built.* [:ref:`... <install_bissbs>`]
1735
1736What you need to do after executing the command in the step-by-step guide
1737depends on the existence and the implementation of ``/sbin/installkernel``
1738executable on your distribution.
1739
1740If installkernel is found, the kernel's build system will delegate the actual
1741installation of your kernel image to this executable, which then performs some
1742or all of these tasks:
1743
1744* On almost all Linux distributions installkernel will store your kernel's
1745  image in /boot/, usually as '/boot/vmlinuz-<kernelrelease_id>'; often it will
1746  put a 'System.map-<kernelrelease_id>' alongside it.
1747
1748* On most distributions installkernel will then generate an 'initramfs'
1749  (sometimes also called 'initrd'), which usually are stored as
1750  '/boot/initramfs-<kernelrelease_id>.img' or
1751  '/boot/initrd-<kernelrelease_id>'. Commodity distributions rely on this file
1752  for booting, hence ensure to execute the make target 'modules_install' first,
1753  as your distribution's initramfs generator otherwise will be unable to find
1754  the modules that go into the image.
1755
1756* On some distributions installkernel will then add an entry for your kernel
1757  to your bootloader's configuration.
1758
1759You have to take care of some or all of the tasks yourself, if your
1760distribution lacks a installkernel script or does only handle part of them.
1761Consult the distribution's documentation for details. If in doubt, install the
1762kernel manually::
1763
1764   sudo install -m 0600 $(make -s image_name) /boot/vmlinuz-$(make -s kernelrelease)
1765   sudo install -m 0600 System.map /boot/System.map-$(make -s kernelrelease)
1766
1767Now generate your initramfs using the tools your distribution provides for this
1768process. Afterwards add your kernel to your bootloader configuration and reboot.
1769
1770[:ref:`back to step-by-step guide <install_bissbs>`]
1771
1772.. _storagespace_bisref:
1773
1774Storage requirements per kernel
1775~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
1776
1777  *Check how much storage space the kernel, its modules, and other related files
1778  like the initramfs consume.* [:ref:`... <storagespace_bissbs>`]
1779
1780The kernels built during a bisection consume quite a bit of space in /boot/ and
1781/lib/modules/, especially if you enabled debug symbols. That makes it easy to
1782fill up volumes during a bisection -- and due to that even kernels which used to
1783work earlier might fail to boot. To prevent that you will need to know how much
1784space each installed kernel typically requires.
1785
1786Note, most of the time the pattern '/boot/*$(make -s kernelrelease)*' used in
1787the guide will match all files needed to boot your kernel -- but neither the
1788path nor the naming scheme are mandatory. On some distributions you thus will
1789need to look in different places.
1790
1791[:ref:`back to step-by-step guide <storagespace_bissbs>`]
1792
1793.. _tainted_bisref:
1794
1795Check if your newly built kernel considers itself 'tainted'
1796~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
1797
1798  *Check if the kernel marked itself as 'tainted'.*
1799  [:ref:`... <tainted_bissbs>`]
1800
1801Linux marks itself as tainted when something happens that potentially leads to
1802follow-up errors that look totally unrelated. That is why developers might
1803ignore or react scantly to reports from tainted kernels -- unless of course the
1804kernel set the flag right when the reported bug occurred.
1805
1806That's why you want check why a kernel is tainted as explained in
1807Documentation/admin-guide/tainted-kernels.rst; doing so is also in your own
1808interest, as your testing might be flawed otherwise.
1809
1810[:ref:`back to step-by-step guide <tainted_bissbs>`]
1811
1812.. _recheckbroken_bisref:
1813
1814Check the kernel built from a recent mainline codebase
1815~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
1816
1817  *Verify if your bug occurs with the newly built kernel.*
1818  [:ref:`... <recheckbroken_bissbs>`]
1819
1820There are a couple of reasons why your bug or regression might not show up with
1821the kernel you built from the latest codebase. These are the most frequent:
1822
1823* The bug was fixed meanwhile.
1824
1825* What you suspected to be a regression was caused by a change in the build
1826  configuration the provider of your kernel carried out.
1827
1828* Your problem might be a race condition that does not show up with your kernel;
1829  the trimmed build configuration, a different setting for debug symbols, the
1830  compiler used, and various other things can cause this.
1831
1832* In case you encountered the regression with a stable/longterm kernel it might
1833  be a problem that is specific to that series; the next step in this guide will
1834  check this.
1835
1836[:ref:`back to step-by-step guide <recheckbroken_bissbs>`]
1837
1838.. _recheckstablebroken_bisref:
1839
1840Check the kernel built from the latest stable/longterm codebase
1841~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
1842
1843  *Are you facing a regression within a stable/longterm release, but failed to
1844  reproduce it with the kernel you just built using the latest mainline sources?
1845  Then check if the latest codebase for the particular series might already fix
1846  the problem.* [:ref:`... <recheckstablebroken_bissbs>`]
1847
1848If this kernel does not show the regression either, there most likely is no need
1849for a bisection.
1850
1851[:ref:`back to step-by-step guide <recheckstablebroken_bissbs>`]
1852
1853.. _introworkingcheck_bisref:
1854
1855Ensure the 'good' version is really working well
1856------------------------------------------------
1857
1858  *Check if the kernels you build work fine.*
1859  [:ref:`... <introworkingcheck_bissbs>`]
1860
1861This section will reestablish a known working base. Skipping it might be
1862appealing, but is usually a bad idea, as it does something important:
1863
1864It will ensure the .config file you prepared earlier actually works as expected.
1865That is in your own interest, as trimming the configuration is not foolproof --
1866and you might be building and testing ten or more kernels for nothing before
1867starting to suspect something might be wrong with the build configuration.
1868
1869That alone is reason enough to spend the time on this, but not the only reason.
1870
1871Many readers of this guide normally run kernels that are patched, use add-on
1872modules, or both. Those kernels thus are not considered 'vanilla' -- therefore
1873it's possible that the thing that regressed might never have worked in vanilla
1874builds of the 'good' version in the first place.
1875
1876There is a third reason for those that noticed a regression between
1877stable/longterm kernels of different series (e.g. 6.0.13..6.1.5): it will
1878ensure the kernel version you assumed to be 'good' earlier in the process (e.g.
18796.0) actually is working.
1880
1881[:ref:`back to step-by-step guide <introworkingcheck_bissbs>`]
1882
1883.. _recheckworking_bisref:
1884
1885Build your own version of the 'good' kernel
1886~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
1887
1888  *Build your own variant of the working kernel and check if the feature that
1889  regressed works as expected with it.* [:ref:`... <recheckworking_bissbs>`]
1890
1891In case the feature that broke with newer kernels does not work with your first
1892self-built kernel, find and resolve the cause before moving on. There are a
1893multitude of reasons why this might happen. Some ideas where to look:
1894
1895* Check the taint status and the output of ``dmesg``, maybe something unrelated
1896  went wrong.
1897
1898* Maybe localmodconfig did something odd and disabled the module required to
1899  test the feature? Then you might want to recreate a .config file based on the
1900  one from the last working kernel and skip trimming it down; manually disabling
1901  some features in the .config might work as well to reduce the build time.
1902
1903* Maybe it's not a kernel regression and something that is caused by some fluke,
1904  a broken initramfs (also known as initrd), new firmware files, or an updated
1905  userland software?
1906
1907* Maybe it was a feature added to your distributor's kernel which vanilla Linux
1908  at that point never supported?
1909
1910Note, if you found and fixed problems with the .config file, you want to use it
1911to build another kernel from the latest codebase, as your earlier tests with
1912mainline and the latest version from an affected stable/longterm series were
1913most likely flawed.
1914
1915[:ref:`back to step-by-step guide <recheckworking_bissbs>`]
1916
1917Perform a bisection and validate the result
1918-------------------------------------------
1919
1920  *With all the preparations and precaution builds taken care of, you are now
1921  ready to begin the bisection.* [:ref:`... <introbisect_bissbs>`]
1922
1923The steps in this segment perform and validate the bisection.
1924
1925[:ref:`back to step-by-step guide <introbisect_bissbs>`].
1926
1927.. _bisectstart_bisref:
1928
1929Start the bisection
1930~~~~~~~~~~~~~~~~~~~
1931
1932  *Start the bisection and tell Git about the versions earlier established as
1933  'good' and 'bad'.* [:ref:`... <bisectstart_bissbs>`]
1934
1935This will start the bisection process; the last of the commands will make Git
1936check out a commit round about half-way between the 'good' and the 'bad' changes
1937for you to test.
1938
1939[:ref:`back to step-by-step guide <bisectstart_bissbs>`]
1940
1941.. _bisectbuild_bisref:
1942
1943Build a kernel from the bisection point
1944~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
1945
1946  *Build, install, and boot a kernel from the code Git checked out using the
1947  same commands you used earlier.* [:ref:`... <bisectbuild_bissbs>`]
1948
1949There are two things worth of note here:
1950
1951* Occasionally building the kernel will fail or it might not boot due some
1952  problem in the code at the bisection point. In that case run this command::
1953
1954    git bisect skip
1955
1956  Git will then check out another commit nearby which with a bit of luck should
1957  work better. Afterwards restart executing this step.
1958
1959* Those slightly odd looking version identifiers can happen during bisections,
1960  because the Linux kernel subsystems prepare their changes for a new mainline
1961  release (say 6.2) before its predecessor (e.g. 6.1) is finished. They thus
1962  base them on a somewhat earlier point like 6.1-rc1 or even 6.0 -- and then
1963  get merged for 6.2 without rebasing nor squashing them once 6.1 is out. This
1964  leads to those slightly odd looking version identifiers coming up during
1965  bisections.
1966
1967[:ref:`back to step-by-step guide <bisectbuild_bissbs>`]
1968
1969.. _bisecttest_bisref:
1970
1971Bisection checkpoint
1972~~~~~~~~~~~~~~~~~~~~
1973
1974  *Check if the feature that regressed works in the kernel you just built.*
1975  [:ref:`... <bisecttest_bissbs>`]
1976
1977Ensure what you tell Git is accurate: getting it wrong just one time will bring
1978the rest of the bisection totally off course, hence all testing after that point
1979will be for nothing.
1980
1981[:ref:`back to step-by-step guide <bisecttest_bissbs>`]
1982
1983.. _bisectlog_bisref:
1984
1985Put the bisection log away
1986~~~~~~~~~~~~~~~~~~~~~~~~~~
1987
1988  *Store Git's bisection log and the current .config file in a safe place.*
1989  [:ref:`... <bisectlog_bissbs>`]
1990
1991As indicated above: declaring just one kernel wrongly as 'good' or 'bad' will
1992render the end result of a bisection useless. In that case you'd normally have
1993to restart the bisection from scratch. The log can prevent that, as it might
1994allow someone to point out where a bisection likely went sideways -- and then
1995instead of testing ten or more kernels you might only have to build a few to
1996resolve things.
1997
1998The .config file is put aside, as there is a decent chance that developers might
1999ask for it after you report the regression.
2000
2001[:ref:`back to step-by-step guide <bisectlog_bissbs>`]
2002
2003.. _revert_bisref:
2004
2005Try reverting the culprit
2006~~~~~~~~~~~~~~~~~~~~~~~~~
2007
2008  *Try reverting the culprit on top of the latest codebase to see if this fixes
2009  your regression.* [:ref:`... <revert_bissbs>`]
2010
2011This is an optional step, but whenever possible one you should try: there is a
2012decent chance that developers will ask you to perform this step when you bring
2013the bisection result up. So give it a try, you are in the flow already, building
2014one more kernel shouldn't be a big deal at this point.
2015
2016The step-by-step guide covers everything relevant already except one slightly
2017rare thing: did you bisected a regression that also happened with mainline using
2018a stable/longterm series, but Git failed to revert the commit in mainline? Then
2019try to revert the culprit in the affected stable/longterm series -- and if that
2020succeeds, test that kernel version instead.
2021
2022[:ref:`back to step-by-step guide <revert_bissbs>`]
2023
2024Cleanup steps during and after following this guide
2025---------------------------------------------------
2026
2027  *During and after following this guide you might want or need to remove some
2028  of the kernels you installed.* [:ref:`... <introclosure_bissbs>`]
2029
2030The steps in this section describe clean-up procedures.
2031
2032[:ref:`back to step-by-step guide <introclosure_bissbs>`].
2033
2034.. _makeroom_bisref:
2035
2036Cleaning up during the bisection
2037~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
2038
2039  *To remove one of the kernels you installed, look up its 'kernelrelease'
2040  identifier.* [:ref:`... <makeroom_bissbs>`]
2041
2042The kernels you install during this process are easy to remove later, as its
2043parts are only stored in two places and clearly identifiable. You thus do not
2044need to worry to mess up your machine when you install a kernel manually (and
2045thus bypass your distribution's packaging system): all parts of your kernels are
2046relatively easy to remove later.
2047
2048One of the two places is a directory in /lib/modules/, which holds the modules
2049for each installed kernel. This directory is named after the kernel's release
2050identifier; hence, to remove all modules for one of the kernels you built,
2051simply remove its modules directory in /lib/modules/.
2052
2053The other place is /boot/, where typically two up to five files will be placed
2054during installation of a kernel. All of them usually contain the release name in
2055their file name, but how many files and their exact names depend somewhat on
2056your distribution's installkernel executable and its initramfs generator. On
2057some distributions the ``kernel-install remove...`` command mentioned in the
2058step-by-step guide will delete all of these files for you while also removing
2059the menu entry for the kernel from your bootloader configuration. On others you
2060have to take care of these two tasks yourself. The following command should
2061interactively remove the three main files of a kernel with the release name
2062'6.0-rc1-local-gcafec0cacaca0'::
2063
2064  rm -i /boot/{System.map,vmlinuz,initr}-6.0-rc1-local-gcafec0cacaca0
2065
2066Afterwards check for other files in /boot/ that have
2067'6.0-rc1-local-gcafec0cacaca0' in their name and consider deleting them as well.
2068Now remove the boot entry for the kernel from your bootloader's configuration;
2069the steps to do that vary quite a bit between Linux distributions.
2070
2071Note, be careful with wildcards like '*' when deleting files or directories
2072for kernels manually: you might accidentally remove files of a 6.0.13 kernel
2073when all you want is to remove 6.0 or 6.0.1.
2074
2075[:ref:`back to step-by-step guide <makeroom_bissbs>`]
2076
2077Cleaning up after the bisection
2078~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
2079
2080.. _finishingtouch_bisref:
2081
2082  *Once you have finished the bisection, do not immediately remove anything
2083  you set up, as you might need a few things again.*
2084  [:ref:`... <finishingtouch_bissbs>`]
2085
2086When you are really short of storage space removing the kernels as described in
2087the step-by-step guide might not free as much space as you would like. In that
2088case consider running ``rm -rf ~/linux/*`` as well now. This will remove the
2089build artifacts and the Linux sources, but will leave the Git repository
2090(~/linux/.git/) behind -- a simple ``git reset --hard`` thus will bring the
2091sources back.
2092
2093Removing the repository as well would likely be unwise at this point: there
2094is a decent chance developers will ask you to build another kernel to
2095perform additional tests -- like testing a debug patch or a proposed fix.
2096Details on how to perform those can be found in the section :ref:`Optional
2097tasks: test reverts, patches, or later versions <introoptional_bissbs>`.
2098
2099Additional tests are also the reason why you want to keep the
2100~/kernel-config-working file around for a few weeks.
2101
2102[:ref:`back to step-by-step guide <finishingtouch_bissbs>`]
2103
2104.. _introoptional_bisref:
2105
2106Test reverts, patches, or later versions
2107----------------------------------------
2108
2109  *While or after reporting a bug, you might want or potentially will be asked
2110  to test reverts, patches, proposed fixes, or other versions.*
2111  [:ref:`... <introoptional_bissbs>`]
2112
2113All the commands used in this section should be pretty straight forward, so
2114there is not much to add except one thing: when setting a kernel tag as
2115instructed, ensure it is not much longer than the one used in the example, as
2116problems will arise if the kernelrelease identifier exceeds 63 characters.
2117
2118[:ref:`back to step-by-step guide <introoptional_bissbs>`].
2119
2120
2121Additional information
2122======================
2123
2124.. _buildhost_bis:
2125
2126Build kernels on a different machine
2127------------------------------------
2128
2129To compile kernels on another system, slightly alter the step-by-step guide's
2130instructions:
2131
2132* Start following the guide on the machine where you want to install and test
2133  the kernels later.
2134
2135* After executing ':ref:`Boot into the working kernel and briefly use the
2136  apparently broken feature <bootworking_bissbs>`', save the list of loaded
2137  modules to a file using ``lsmod > ~/test-machine-lsmod``. Then locate the
2138  build configuration for the running kernel (see ':ref:`Start defining the
2139  build configuration for your kernel <oldconfig_bisref>`' for hints on where
2140  to find it) and store it as '~/test-machine-config-working'. Transfer both
2141  files to the home directory of your build host.
2142
2143* Continue the guide on the build host (e.g. with ':ref:`Ensure to have enough
2144  free space for building [...] <diskspace_bissbs>`').
2145
2146* When you reach ':ref:`Start preparing a kernel build configuration[...]
2147  <oldconfig_bissbs>`': before running ``make olddefconfig`` for the first time,
2148  execute the following command to base your configuration on the one from the
2149  test machine's 'working' kernel::
2150
2151    cp ~/test-machine-config-working ~/linux/.config
2152
2153* During the next step to ':ref:`disable any apparently superfluous kernel
2154  modules <localmodconfig_bissbs>`' use the following command instead::
2155
2156    yes '' | make localmodconfig LSMOD=~/lsmod_foo-machine localmodconfig
2157
2158* Continue the guide, but ignore the instructions outlining how to compile,
2159  install, and reboot into a kernel every time they come up. Instead build
2160  like this::
2161
2162    cp ~/kernel-config-working .config
2163    make olddefconfig &&
2164    make -j $(nproc --all) targz-pkg
2165
2166  This will generate a gzipped tar file whose name is printed in the last
2167  line shown; for example, a kernel with the kernelrelease identifier
2168  '6.0.0-rc1-local-g928a87efa423' built for x86 machines usually will
2169  be stored as '~/linux/linux-6.0.0-rc1-local-g928a87efa423-x86.tar.gz'.
2170
2171  Copy that file to your test machine's home directory.
2172
2173* Switch to the test machine to check if you have enough space to hold another
2174  kernel. Then extract the file you transferred::
2175
2176    sudo tar -xvzf ~/linux-6.0.0-rc1-local-g928a87efa423-x86.tar.gz -C /
2177
2178  Afterwards :ref:`generate the initramfs and add the kernel to your boot
2179  loader's configuration <install_bisref>`; on some distributions the following
2180  command will take care of both these tasks::
2181
2182    sudo /sbin/installkernel 6.0.0-rc1-local-g928a87efa423 /boot/vmlinuz-6.0.0-rc1-local-g928a87efa423
2183
2184  Now reboot and ensure you started the intended kernel.
2185
2186This approach even works when building for another architecture: just install
2187cross-compilers and add the appropriate parameters to every invocation of make
2188(e.g. ``make ARCH=arm64 CROSS_COMPILE=aarch64-linux-gnu- [...]``).
2189
2190Additional reading material
2191---------------------------
2192
2193* The `man page for 'git bisect' <https://git-scm.com/docs/git-bisect>`_ and
2194  `fighting regressions with 'git bisect' <https://git-scm.com/docs/git-bisect-lk2009.html>`_
2195  in the Git documentation.
2196* `Working with git bisect <https://nathanchance.dev/posts/working-with-git-bisect/>`_
2197  from kernel developer Nathan Chancellor.
2198* `Using Git bisect to figure out when brokenness was introduced <http://webchick.net/node/99>`_.
2199* `Fully automated bisecting with 'git bisect run' <https://lwn.net/Articles/317154>`_.
2200
2201..
2202   end-of-content
2203..
2204   This document is maintained by Thorsten Leemhuis <linux@leemhuis.info>. If
2205   you spot a typo or small mistake, feel free to let him know directly and
2206   he'll fix it. You are free to do the same in a mostly informal way if you
2207   want to contribute changes to the text -- but for copyright reasons please CC
2208   linux-doc@vger.kernel.org and 'sign-off' your contribution as
2209   Documentation/process/submitting-patches.rst explains in the section 'Sign
2210   your work - the Developer's Certificate of Origin'.
2211..
2212   This text is available under GPL-2.0+ or CC-BY-4.0, as stated at the top
2213   of the file. If you want to distribute this text under CC-BY-4.0 only,
2214   please use 'The Linux kernel development community' for author attribution
2215   and link this as source:
2216   https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/plain/Documentation/admin-guide/verify-bugs-and-bisect-regressions.rst
2217
2218..
2219   Note: Only the content of this RST file as found in the Linux kernel sources
2220   is available under CC-BY-4.0, as versions of this text that were processed
2221   (for example by the kernel's build system) might contain content taken from
2222   files which use a more restrictive license.
2223