156 lines
7.9 KiB
ReStructuredText
156 lines
7.9 KiB
ReStructuredText
.. SPDX-License-Identifier: GPL-2.0
|
|
|
|
=================
|
|
x86 Feature Flags
|
|
=================
|
|
|
|
Introduction
|
|
============
|
|
|
|
On x86, flags appearing in /proc/cpuinfo have an X86_FEATURE definition
|
|
in arch/x86/include/asm/cpufeatures.h. If the kernel cares about a feature
|
|
or KVM want to expose the feature to a KVM guest, it can and should have
|
|
an X86_FEATURE_* defined. These flags represent hardware features as
|
|
well as software features.
|
|
|
|
If users want to know if a feature is available on a given system, they
|
|
try to find the flag in /proc/cpuinfo. If a given flag is present, it
|
|
means that the kernel supports it and is currently making it available.
|
|
If such flag represents a hardware feature, it also means that the
|
|
hardware supports it.
|
|
|
|
If the expected flag does not appear in /proc/cpuinfo, things are murkier.
|
|
Users need to find out the reason why the flag is missing and find the way
|
|
how to enable it, which is not always easy. There are several factors that
|
|
can explain missing flags: the expected feature failed to enable, the feature
|
|
is missing in hardware, platform firmware did not enable it, the feature is
|
|
disabled at build or run time, an old kernel is in use, or the kernel does
|
|
not support the feature and thus has not enabled it. In general, /proc/cpuinfo
|
|
shows features which the kernel supports. For a full list of CPUID flags
|
|
which the CPU supports, use tools/arch/x86/kcpuid.
|
|
|
|
How are feature flags created?
|
|
==============================
|
|
|
|
a: Feature flags can be derived from the contents of CPUID leaves.
|
|
------------------------------------------------------------------
|
|
These feature definitions are organized mirroring the layout of CPUID
|
|
leaves and grouped in words with offsets as mapped in enum cpuid_leafs
|
|
in cpufeatures.h (see arch/x86/include/asm/cpufeatures.h for details).
|
|
If a feature is defined with a X86_FEATURE_<name> definition in
|
|
cpufeatures.h, and if it is detected at run time, the flags will be
|
|
displayed accordingly in /proc/cpuinfo. For example, the flag "avx2"
|
|
comes from X86_FEATURE_AVX2 in cpufeatures.h.
|
|
|
|
b: Flags can be from scattered CPUID-based features.
|
|
----------------------------------------------------
|
|
Hardware features enumerated in sparsely populated CPUID leaves get
|
|
software-defined values. Still, CPUID needs to be queried to determine
|
|
if a given feature is present. This is done in init_scattered_cpuid_features().
|
|
For instance, X86_FEATURE_CQM_LLC is defined as 11*32 + 0 and its presence is
|
|
checked at runtime in the respective CPUID leaf [EAX=f, ECX=0] bit EDX[1].
|
|
|
|
The intent of scattering CPUID leaves is to not bloat struct
|
|
cpuinfo_x86.x86_capability[] unnecessarily. For instance, the CPUID leaf
|
|
[EAX=7, ECX=0] has 30 features and is dense, but the CPUID leaf [EAX=7, EAX=1]
|
|
has only one feature and would waste 31 bits of space in the x86_capability[]
|
|
array. Since there is a struct cpuinfo_x86 for each possible CPU, the wasted
|
|
memory is not trivial.
|
|
|
|
c: Flags can be created synthetically under certain conditions for hardware features.
|
|
-------------------------------------------------------------------------------------
|
|
Examples of conditions include whether certain features are present in
|
|
MSR_IA32_CORE_CAPS or specific CPU models are identified. If the needed
|
|
conditions are met, the features are enabled by the set_cpu_cap or
|
|
setup_force_cpu_cap macros. For example, if bit 5 is set in MSR_IA32_CORE_CAPS,
|
|
the feature X86_FEATURE_SPLIT_LOCK_DETECT will be enabled and
|
|
"split_lock_detect" will be displayed. The flag "ring3mwait" will be
|
|
displayed only when running on INTEL_FAM6_XEON_PHI_[KNL|KNM] processors.
|
|
|
|
d: Flags can represent purely software features.
|
|
------------------------------------------------
|
|
These flags do not represent hardware features. Instead, they represent a
|
|
software feature implemented in the kernel. For example, Kernel Page Table
|
|
Isolation is purely software feature and its feature flag X86_FEATURE_PTI is
|
|
also defined in cpufeatures.h.
|
|
|
|
Naming of Flags
|
|
===============
|
|
|
|
The script arch/x86/kernel/cpu/mkcapflags.sh processes the
|
|
#define X86_FEATURE_<name> from cpufeatures.h and generates the
|
|
x86_cap/bug_flags[] arrays in kernel/cpu/capflags.c. The names in the
|
|
resulting x86_cap/bug_flags[] are used to populate /proc/cpuinfo. The naming
|
|
of flags in the x86_cap/bug_flags[] are as follows:
|
|
|
|
a: The name of the flag is from the string in X86_FEATURE_<name> by default.
|
|
----------------------------------------------------------------------------
|
|
By default, the flag <name> in /proc/cpuinfo is extracted from the respective
|
|
X86_FEATURE_<name> in cpufeatures.h. For example, the flag "avx2" is from
|
|
X86_FEATURE_AVX2.
|
|
|
|
b: The naming can be overridden.
|
|
--------------------------------
|
|
If the comment on the line for the #define X86_FEATURE_* starts with a
|
|
double-quote character (""), the string inside the double-quote characters
|
|
will be the name of the flags. For example, the flag "sse4_1" comes from
|
|
the comment "sse4_1" following the X86_FEATURE_XMM4_1 definition.
|
|
|
|
There are situations in which overriding the displayed name of the flag is
|
|
needed. For instance, /proc/cpuinfo is a userspace interface and must remain
|
|
constant. If, for some reason, the naming of X86_FEATURE_<name> changes, one
|
|
shall override the new naming with the name already used in /proc/cpuinfo.
|
|
|
|
c: The naming override can be "", which means it will not appear in /proc/cpuinfo.
|
|
----------------------------------------------------------------------------------
|
|
The feature shall be omitted from /proc/cpuinfo if it does not make sense for
|
|
the feature to be exposed to userspace. For example, X86_FEATURE_ALWAYS is
|
|
defined in cpufeatures.h but that flag is an internal kernel feature used
|
|
in the alternative runtime patching functionality. So, its name is overridden
|
|
with "". Its flag will not appear in /proc/cpuinfo.
|
|
|
|
Flags are missing when one or more of these happen
|
|
==================================================
|
|
|
|
a: The hardware does not enumerate support for it.
|
|
--------------------------------------------------
|
|
For example, when a new kernel is running on old hardware or the feature is
|
|
not enabled by boot firmware. Even if the hardware is new, there might be a
|
|
problem enabling the feature at run time, the flag will not be displayed.
|
|
|
|
b: The kernel does not know about the flag.
|
|
-------------------------------------------
|
|
For example, when an old kernel is running on new hardware.
|
|
|
|
c: The kernel disabled support for it at compile-time.
|
|
------------------------------------------------------
|
|
For example, if 5-level-paging is not enabled when building (i.e.,
|
|
CONFIG_X86_5LEVEL is not selected) the flag "la57" will not show up [#f1]_.
|
|
Even though the feature will still be detected via CPUID, the kernel disables
|
|
it by clearing via setup_clear_cpu_cap(X86_FEATURE_LA57).
|
|
|
|
d: The feature is disabled at boot-time.
|
|
----------------------------------------
|
|
A feature can be disabled either using a command-line parameter or because
|
|
it failed to be enabled. The command-line parameter clearcpuid= can be used
|
|
to disable features using the feature number as defined in
|
|
/arch/x86/include/asm/cpufeatures.h. For instance, User Mode Instruction
|
|
Protection can be disabled using clearcpuid=514. The number 514 is calculated
|
|
from #define X86_FEATURE_UMIP (16*32 + 2).
|
|
|
|
In addition, there exists a variety of custom command-line parameters that
|
|
disable specific features. The list of parameters includes, but is not limited
|
|
to, nofsgsbase, nosmap, and nosmep. 5-level paging can also be disabled using
|
|
"no5lvl". SMAP and SMEP are disabled with the aforementioned parameters,
|
|
respectively.
|
|
|
|
e: The feature was known to be non-functional.
|
|
----------------------------------------------
|
|
The feature was known to be non-functional because a dependency was
|
|
missing at runtime. For example, AVX flags will not show up if XSAVE feature
|
|
is disabled since they depend on XSAVE feature. Another example would be broken
|
|
CPUs and them missing microcode patches. Due to that, the kernel decides not to
|
|
enable a feature.
|
|
|
|
.. [#f1] 5-level paging uses linear address of 57 bits.
|