System Architecture Overview

Modes of operation

IA-32共有以下四种操作模式：

实模式
保护模式
虚拟8086模式

该模式允许在保护模式下执行8086程序
系统管理模式（system management mode, SMM)

为操作系统或执行环境提供了电源管理的透明机制，通过激活外部系统中断引脚（SMI#）触发系统管理中断（system managemtn interrupt, SMI）来进入SMM模式。在SMM模式中，处理器切换到分离的地址空间，透明执行指定的代码，然后执行RSM指令返回到之前的模式。

其中实模式和保护模式是我们关注的重点。所有的Intel 64和32位处理器在加电后都会进入实模式，之后软件切换到保护模式下运行。

系统总览图如下：

EFLAGS

EFLAGS寄存器控制着IO、可屏蔽硬件中断、任务切换和虚拟8086模式等。只有特权代码才允许修改这些位。

TF：Trap(bit 8)，为1时表示单步中断模式，处理器在每条指令执行后触发调试异常（debug exception），这允许检查程序每条指令执行的状态。当程序通过POPF, POPFD, IRET指令设置了TF位，则后面跟着的指令开始触发调试一场。
IF：Interrupt enable(bit 9)，控制处理器对可屏蔽中断的响应，如果为1，则表示处理器响应可屏蔽中断，为0表示禁止可屏蔽中断。**IF位不影响异常和不可屏蔽中断。**CPL、IOPL和CR4中的VME位决定IF位是否能被CLI，STI，POPF，POPFD和IRET指令修改。
IOPL：I/O privilege level field(bits 12 and 13)表示当前运行程序的IO特权级，当前运行程序的CPL必须数值上小于或等于IOPL值才可以访问IO地址空间，只有当CPL为0时，才可以通过POPF和IRET指令修改IOPL位。
NT：Nested task(bit 14)，控制被中断和被调用任务，当通过call指令、异常或中断进入任务时，处理器置该位为1。当IRET从任务中返回时会检查和修改该位。
RF：Resume(bit 16)，控制处理器对程序断点的响应，为1时，屏蔽指令断点产生debug异常；为0时，允许指令断点产生debug异常。RF的主要功能是防止处理器进入debug异常死循环。
VM：Virtual-8086 mode(bit 17)，置为表示进入虚拟8086模式，清除表示返回保护模式。
ID：Identification(bit21)，表示是否支持CPUID指令。

GDTR、LDTR、IDTR和TR

GDT和IDT都不能称之为段，因为他们既没有段选择符，也没有段描述符。

GDTR

当处理器上电或reset时，GDTR的基地址为0，界限为0FFFFH，在进入保护模式之前，新的基地址和界限必须被加载进GDTR中。

LDTR

LDTR寄存器包含16位的段选择符、基地址、短界限和描述符属性，当LLDT指令向LDTR中加载段选择符时，基地址、段界限和描述符属性会被自动加载如LDTR中。当发生任务切换时，LDTR自动加载新任务的段选择符和描述符。

当处理器上电或reset时，GDTR的基地址为0，界限为0FFFFH。

IDTR

当处理器上电或reset时，GDTR的基地址为0，界限为0FFFFH。

TR

TR寄存器包含当前任务TSS的16位的段选择符、基地址、短界限和描述符属性。段选择符指向GDT表中的TSS描述符。

当LTR指令向TR中加载段选择符时，基地址、段界限和描述符属性会被自动加载如TR中。当发生任务切换时，TR自动加载新任务TSS的段选择符和描述符。

Control Registers

CR0：包含控制操作模式和处理器状态的系统控制位
- PG：Paging(bit 31)：为1时表示启动分页，为0时表示禁止分页。当禁止分页时，所有的线性地址可以直接看作物理地址。当PE位为0时，设置PG位会触发保护异常，即只有操作系统是保护模式时，才可以启动分页。
- CD：Cache Disable
- NW：Not Write-through
- WP：Write Protect(bit 16)，为1时，阻止高特权级程序向只读页面进行写操作；为0时，允许高特权级程序向只读页面写入（忽略U/S位）
- EM：Emulation(bit 2)，为1时表示处理器没有内部或外部的x87 FPU；为0时表示存在x87 FPU
- PE：Protection Enable(bit 0)，为1时表示启动保护模式；为0时表示进入是模式。
CR1：Reserved
CR2：包含触发缺页异常的线性地址
CR3：包含页目录的起始物理地址和两个控制位（PCD和PWT）
CR4：包含一些架构扩展控制位
- PVI：Protected-Mode Virtual Interrupts(bit 1)，为1时启动对虚拟中断的硬件支持；为0时禁止虚拟中断。
- PSE：Page Size Extensions(bit 4)，为1时表示页的大小为4MB，为0时表示页的大小为4KB。
- PAE：Physical Address Extension(bit 5)，为1时，启动物理地址扩展，允许分页产生超过32位的物理地址；为0时，禁止物理地址扩展。
- VMXE：VMX-Enable Bit(bit 13)，为1时，启动虚拟机操作扩展；为0时，禁止虚拟机操作扩展。

System Instruction Summary

系统指令只允许在特权级0上执行，其他特权级不允许执行。

Protected-Mode Memory Management

Overview

IA-32中的内存管理机制被分为两部分：分段和分页。**在保护模式中，分段模式自动开启，没有控制位来禁止分段。**而分页机制则是可选的。

Segment

分段机制可用来实现一系列系统设计，这些设计包含简单使用分段的平坦模型到使用分段建立了健壮的操作环境的多分段模型。

Basic Flat Model

代码段和数据段基地址从0开始，界限为4GB，访问不存在内存设备的物理地址也不会触发异常，需要分页机制来实现进程和进程的内存空间隔离与保护。
Protected Flat Model

设置了段界限，避开了不存在物理内存的物理地址，需要分页机制来实现进程和进程的内存空间隔离与保护。
Multi-segmented Model

充分使用了分段机制，每个程序拥有自己的段描述符表和段，每个段都做到了充分的隔离。

Segment Selector

由于GDT的第一个描述符不会被处理器使用，因此指向该描述符的选择符（index和ti为0）被用作null段选择符，可以用来初始化一个未使用的段寄存器，然而，通过null段选择符访问数据以及向CS和SS寄存器加载null段选择符会导致保护异常(#GP)。

Segment Registers

为了减少地址转换时间和代码复杂度，处理器提供了6个段寄存器。每一个段寄存器都有一个可见部分和隐藏部分，当段选择符被加载到段寄存器的可见部分，处理器会自动将段寄存器的隐藏部分（段基址、段界限和访问信息）加载进来。这样处理器每次访问内存便不用再去段选择符指向的描述符中获取信息，而是直接从段寄存器中获取。

当段描述符发生变化，软件有责任重新更新段寄存器的内容，否则段寄存器很可能使用过时的信息来访问内存。

Segment Descriptor

段描述符通常由编译器或操作系统创建，而不是应用程序。

注：段基址最好为16字节对齐，这样有利于提高性能。

描述符共有以下几种类型：

代码和数据段描述符（S位为1）
系统描述符（S位为0）
- 系统段描述符（指向系统段）
  - LDT段描述符
- TSS描述符
门描述符（包含指向代码段或TSS的选择符）
- 调用门
- 中断门
- 陷阱门
- 任务门

Paging

软件通过MOV指令设置CR0的PG位来启动分页，在此之前要确保CR3中包含页目录的起始地址以及初始化相应内容。

IA-64处理器共支持四种不同的分页方式：

32位分页
PAE分页
4级分页
5级分页

我们用到的是32位分页。此时CR0.PG=1以及CR4.PAE=0。

当CR4的PSE和PDE的PS位为0，表示采用4KB页大小，寻址方式如下所示：

R/W：0表示只读，1表示可读写
U/S：为0表示不允许用户模式访问，只允许特权级为0、1、2的程序访问。为1时，允许所有特权级程序访问。
A：Accessd，由处理器固件来设置，用来指示此表象所指向的页是否被访问过。
D：Dirty，由处理器固件来设置，用来指示此表象所指向的页是否被写过数据。
PAT
G：如果CR4.PGE=1，决定是否地址转换为全局。

Access Rights

对线性地址的访问可以分为两种：

supervisor-mode access：CPL < 3
user-mode access：CPL=3

其中对线性地址隐式的访问如从GDT或LDT中加载段选择符被视为supervisor-mode access（不管其CPL为多少）。

其访问权限的判断过于复杂，此处略去，推荐同学们去手册中直接查看。粗略总结就是supervisor-mode access可以访问supervisor和user的地址（由U/S位决定）。但user只能访问自己的，不可以访问supervisor的。

Page Fault Exception

缺页异常由两种情况导致：

没有线性地址对应的物理地址转换（P位为0）
有地址转换，但access rights不允许本次访问

Protection

Limit Checking

对于向上扩展的段，段界限指向了段中最后一个允许被访问的字节，数值上为段大小减一。

对于向下扩展的段，段界限指向了段中最后一个不允许被访问的字节。

Type Checking

处理器通常会在以下几种情况执行类型检查：

当段选择符加载进段寄存器
- CS寄存器只能加载代码段选择符
- 系统段和不可读代码段选择符不能被加载到DS、ES、FS和GS寄存器中
- 只有可写的数据段可以被加载到SS寄存器
当通过指令访问段时
- 不能写入可执行代码段
- 不能写入只读数据段
- 不能读不可读代码段
当指令操作数为段选择符时
- 远跳转的call和jmp指令只能访问一致性代码段、非一致性代码段、调用门、任务门和TSS。
- LLDT指令的选择符指向LDT段描述符
- LTR指令的选择符指向TSS描述符
- IDT的表项必须是中断、陷阱和任务门。
在特定的内部操作中
- 当call或jump通过调用门转移时，处理器自动检查指向的段描述符是代码段描述符
- 当call或jump通过任务门转移时，处理器自动检查任务门指向的段描述符为TSS
- 当call或jump直接通过TSS转移时，处理器自动检查指向的段描述符是TSS
- 当从嵌套任务中返回时，处理器检查目前TSS中指向的前一个任务的段描述符是TSS

Privilege Checking

特权检查发生在向段寄存器中加载段选择符时。

CPL：当前任务或程序的特权级，被存储在CS寄存器和SS寄存器的最低两位。通常情况下，CPL等于代码段的DPL，处理器在转移到不同特权级的代码段下时，会改变CPL。但是当转移到一致性代码段时，CPL不发生变化，此时CPL和代码段的DPL可能会出现不一致的情况。
DPL：段或门的特权级，被存储在段或门描述符的DPL域中。当目前执行的代码尝试访问一个段或门时，段或门的DPL用来和CPL和选择符的RPL进行比较，确认程序是否具有权限访问。
- 数据段DPL：显示允许访问该段的最低特权级。
- 非一致性代码段（不使用调用门）：显示访问该段需要具有的特权级
- 调用门DPL：显示允许调用该门的最低特权级
- 一致性代码段和非一致性代码段通过门访问：显示允许访问该段的最高特权级。
- TSS DPL：显示允许访问该段的最低特权级
RPL：段选择符的最低两位，用来确保高特权级代码不会代替低特权级代码访问一个段，除非低特权级代码有权限来访问。主要和ARPL指令搭配使用。

Accessing Data Segments

在将数据段选择符加载到段寄存器之前，处理器会根据CPL，段选择符的RPL和数据段描述符的DPL进行比较，当DPL数值上都大于等于CPL和RPL时，允许加载，否则产生general-protection异常。

值得注意的是，段选择符的RPL是软件控制的，举个例子，CPL为3的应用程序可以设置数据段选择符的RPL为0，从而绕过RPL检查，只需要检查CPL。

Accessing Stack Segments

加载入SS寄存器中的选择符RPL和段描述符DPL必须和当前CPL保持一致。如果RPL和DPL与当前的CPL不一致，则触发general-protection 异常。

Transfer between Code Segments

程序的转移可以通过以下指令实现：

JMP、CALL、RET、SYSENTER、SYSEXIT、SYSCALL、SYSRET，INT n和IRET指令，以及异常和中断。

JMP和CALL指令可以通过以下四种方式实现代码段的切换：

操作数为指向目标代码段的选择符
操作数为指向调用门的选择符，调用门描述符中包含指向目标代码段的选择符。
操作数为指向TSS的选择符，TSS中包含指向目标代码段的选择符。
操作数为指向任务门的选择符，任务门中包含指向TSS的选择符，TSS中包含指向目标代码段的选择符。

Direct calls or jumps to Code Segment

CPL：调用程序的CPL
DPL：目标代码段的DPL
RPL：指向目标代码段的选择符的RPL
C flag：目标代码段描述符的C位，表示该段是一致性代码段还是非遗执行代码段。

访问非一致性代码段（C=0）

CPL必须等于DPL，RPL数值像小于等于CPL（可以看作DPL）。
访问一致性代码段

CPL数值上大于等于DPL，RPL不用检查。转移后CPL不发生变化。

Call Gates

调用门描述符可以放在GDT或LDT中，不能放在IDT中。

Segment Selector：指向目标代码段
Offset：目标代码段的入口点
P：调用门是否有效
Param Count：参数个数，当堆栈发生切换，表示需要从调用者堆栈复制到被调用者堆栈的参数个数。

由于调用门中包含代码段偏移量，因此jmp和call中的代码偏移量是不被使用的，但要给出一个值，随便设置即可。

调用门的特权级检查需要用到下面五个值：

CPL
指向调用门的选择符的RPL
调用门描述符中的DPL
目标代码段的DPL
目标代码段的C flag

call和jmp规则稍有不同，jmp跳转不能更改特权级，call跳转可以更改特权级（当跳转到非一致性高特权级代码段时），特权级检查规则如下：

Stack Switching

当特权级发生变换时，如通过call调用门到非一致性高特权级代码段，会发生栈切换，因为不同的特权级具有不同的堆栈，防止因为栈空间不足而崩溃，以及特权级隔离。

Uses the DPL of the destination code segment (the new CPL) to select a pointer to the new stack (segment selector and stack pointer) from the TSS.
Reads the segment selector and stack pointer for the stack to be switched to from the current TSS. Any limit violations detected while reading the stack-segment selector, stack pointer, or stack-segment descriptor cause an invalid TSS (#TS) exception to be generated.
Checks the stack-segment descriptor for the proper privileges and type and generates an invalid TSS (#TS) exception if violations are detected.
Temporarily saves the current values of the SS and ESP registers.
Loads the segment selector and stack pointer for the new stack in the SS and ESP registers.
Pushes the temporarily saved values for the SS and ESP registers (for the calling procedure) onto the new stack (see Figure 5-13).
Copies the number of parameter specified in the parameter count field of the call gate from the calling procedure’s stack to the new stack. If the count is 0, no parameters are copied.
Pushes the return instruction pointer (the current contents of the CS and EIP registers) onto the new stack.
Loads the segment selector for the new code segment and the new instruction pointer from the call gate into the CS and EIP registers, respectively, and begins execution of the called procedure.

当从高特权级retf返回低特权级时，执行以下操作

Checks the RPL field of the saved CS register value to determine if a privilege level change is required on the return.
Loads the CS and EIP registers with the values on the called procedure’s stack. (Type and privilege level checks are performed on the code-segment descriptor and RPL of the code- segment selector.)
(If the RET instruction includes a parameter count operand and the return requires a privilege level change.) Adds the parameter count (in bytes obtained from the RET instruction) to the current ESP register value (after popping the CS and EIP values), to step past the parameters on the called procedure’s stack. The resulting value in the ESP register points to the saved SS and ESP values for the calling procedure’s stack. (Note that the byte count in the RET instruction must be chosen to match the parameter count in the call gate that the calling procedure referenced when it made the original call multiplied by the size of the parameters.)
(If the return requires a privilege level change.) Loads the SS and ESP registers with the saved SS and ESP values and switches back to the calling procedure’s stack. The SS and ESP values for the called procedure’s stack are discarded. Any limit violations detected while loading the stack-segment selector or stack pointer cause a general-protection exception (#GP) to be generated. The new stack-segment descriptor is also checked for type and privilege violations.
(If the RET instruction includes a parameter count operand.) Adds the parameter count (in bytes obtained from the RET instruction) to the current ESP register value, to step past the parameters on the calling procedure’s stack. The resulting ESP value is not checked against the limit of the stack segment. If the ESP value is beyond the limit, that fact is not recognized until the next stack operation.
(If the return requires a privilege level change.) Checks the contents of the DS, ES, FS, and GS segment registers. If any of these registers refer to segments whose DPL is less than the new CPL (excluding conforming code segments), the segment register is loaded with a null segment selector.

Page-Level Protection

处理器执行两种页面级别的保护

Restriction of addressable domain (supervisor and user modes).

当CPL为0，1和2时，我们称代码运行在supervisor mode，如果为3，则运行在user mode。当处理器在supervisor mode下工作时，可以访问所有的页，当在user mode下工作室，只能访问用户级别的页。
Page type (read only or read/write).

当处理器在S模式下，且CR0的WP位为0，所有的页均为可读写。当处理器在U模式下，只能写入用户级别的可读写的页。当WP为1，只读页不可以被任何特权级下的代码写入。

任何违反了以上两种保护的操作都会触发page-fault异常。

Interrupt And Exception Handling

当检测到中断或异常时，当前执行程序或任务会被挂起，处理器转去执行中断或异常处理程序，当处理程序执行完毕，处理器会恢复被中断程序或任务的执行。

x86架构对中断和异常进行了区分，中断和异常的区别在于中断是异步事件，异常是同步事件。

Exception and interrupt vectors

为了更方便地处理异常和中断，处理器为一些中断异常事件分配了一个专门的数字，称为向量号（vector number），处理器使用向量号作为IDT表的索引，获取中断或异常处理程序的入口点。

向量号的范围是0~255，通常0~31被intel所保留，用于架构预先定义的异常和中断，但并不是所有的向量号都已经被分配，有些在0~31范围内的向量号还未分配给特定的异常和中断，请不要使用这些未分配的向量号，因为未来的某一天intel可能就会用到。

而32~255范围内的向量号用于分配给用户自定义的中断，这些中断的来源可能是外部I/O设备等。

Sources of interrupts

中断的来源分为以下两种

外部中断
软件生成的中断

External Interrupts

外部中断由处理器的引脚或本地APIC设备获取

External interrupts are received through pins on the processor or through the local APIC. The primary interrupt pins on Pentium 4, Intel Xeon, P6 family, and Pentium processors are the LINT[1:0] pins, which are connected to the local APIC. When the local APIC is enabled, the LINT[1:0] pins can be programmed through the APIC’s local vector table (LVT) to be associated with any of the processor’s exception or interrupt vectors.

When the local APIC is global/hardware disabled, these pins are configured as INTR and NMI pins, respectively. Asserting the INTR pin signals the processor that an external interrupt has occurred. The processor reads from the system bus the interrupt vector number provided by an external interrupt controller, such as an 8259A (see Section 6.2, “Exception and Interrupt Vectors”). Asserting the NMI pin signals a non-maskable interrupt (NMI), which is assigned to interrupt vector 2.

Software-Generated Interrupts

Int n指令允许软件触发中断，比如，Int 35可以隐式地调用35号中断的处理器程序。0~255号中断都可以通过该指令触发，但是当触发NMI中断时，处理器的行为和当NMI从外部触发时的行为不一致，会调用对应的中断处理程序，但不会执行相应的硬件操作。

软件触发的中断不能被IF位所屏蔽。

Exceptions

异常的来源有以下几种：

处理器检测到程序的错误导致的异常

比如除0
软件触发的异常

INTO,INT1,INT3和BOUND指令用来触发软件异常
机器检查的异常（Machine-Check exceptions）

P6和奔腾处理器提供了机器检查机制来检查内部芯片硬件和总线事务，当出现错误时，处理器会产生机器检查一场（vector 18）

异常的分类：

Faults：处理后重新执行异常指令
Traps：处理后执行异常指令下一条指令
Aborts：不需要精确报告异常指令的位置，也不允许重新执行导致异常的程序或任务。

Enable and Disable Interrupts

处理器根据处理器状态、IF和RF位来屏蔽一些中断的产生。

Masking Maskable Hardware interrupt

当处理器IF位为0，INTR引脚接受的中断都会被屏蔽掉。IF为不影响不可屏蔽中断（NMI）和异常。

当CPL数值上小于IOPL时，才能使用STI/CLI指令设置或清楚IF位，否则会产生异常。

Masking Instruction Breakpoints

RF为1屏蔽指令断点触发的debug异常。

Masking Exceptions and interrupts when switching stacks

当MOV或POP指令修改了SS寄存器的值时，在下一条指令执行时屏蔽指令断点触发的debug异常和单步陷阱异常。

Intel推荐使用LSS指令同时加载SS和ESP寄存器。

Prioritization of concurrent events

处理器先处理高特权级事件，此时低特权级异常会被抛弃，低特权级终端仍然挂起等待处理。

IDT

和GDT、LDT一样，IDT也是8字节描述符的数组，将每个中断或异常和一个门描述符联系起来。但和GDT不一样的是，IDT第一个表项是可用的描述符。

由于只有256个中断或异常向量，因此IDT只需要容纳256个描述符即可，也可以少于256个。

IDT包含以下三种门

任务门
中断门

清除TF位和IF位
陷阱门

清除TF位，不清除IF位

中断门陷阱门和调用门非常类似，其中包含了目标代码段的选择符和偏移量。

当处理器调用中断或异常处理程序时，执行以下操作：

如果目标代码段特权级更高，发生栈切换

从TSS获取对应堆栈的选择符和栈指针，在新栈中压入SS、ESP、EFLAGS、CS和EIP的值，如果该异常存在错误码（error code），最后压入Error Code。
如果目标代码段特权级和当前特权级一致，

向堆栈中依次压入EFLAGS、CS、EIP和Error Code（如果有的话）

Privilege Checking

中断或异常门的特权级检查和直接通过call指令访问调用门类似，除了以下两点：

中断和异常向量没有RPL，因此不用检查RPL
当通过INT n，INT3或INTO等指令产生软件中断或异常时，需要检查门描述符的DPL，需要确保DPL数值上大于等于CPL，防止低特权级程序通过这些指令访问高特权级代码。对于硬件产生的中断和处理器检测到的异常，门描述符的DPL不用被检查。

Task Management

Overview

任务是处理器可以分派、执行和挂起的工作单位

任务由两部分组成：

任务执行空间：代码段、栈段和数据段
任务状态段（TSS）

Executing a Task

Software or the processor can dispatch a task for execution in one of the following ways:

A explicit call to a task with the CALL instruction.（task gate or tss）
A explicit jump to a task with the JMP instruction.（task gate or tss）
An implicit call (by the processor) to an interrupt-handler or exception-handler task.（task gate）
A return (initiated with an IRET instruction) when the NT flag in the EFLAGS register is set.

Task management data structure

处理器根据五项数据结构来处理和任务有关的活动

在保护模式下，对于一个任务来说，TSS和TSS descriptor是必要的，且指向TSS descriptor的选择符需要加载到TR寄存器中。

Task-state segment

TSS的域被分为两个主要的部分：

dynamic field：任务切换时进行更新
- 通用寄存器
- 段选择符
- EFLAGS
- EIP
- Previous Task Link
static field：任务切换时不进行更新，在任务创建时设置
- LDT segment selector field
- CR3
- 不同特权级的堆栈
- T flag
- IO 位图
- SSP

TSS descriptor

TSS描述符只能放在GDT中，不能放在LDT或IDT中

当G=0，段界限必须等于或大于0x67（比TSS最小长度少1），访问一个段界限小于0x67的TSS会触发无效TSS异常。

任何CPL数值上小于等于DPL的程序可以通过call或jump分配任务，RPL不检查。

Task register

保存TSS的段选择符，分为两部分

可见部分：TSS段选择符
不可见部分：TSS段描述符

Task-gate descriptor

可以被放置在GDT、LDT或IDT中，其中的TSS段选择符指向一个GDT中的TSS描述符。

当call或jump指令操作数为任务门描述符时，CPL和任务门选择符的RPL需要数值上都小于等于任务门的DPL，TSS的DPL不检查。

NT flag in EFLAGS register

Task Switching

The processor performs the following operations when switching to a new task:

Obtains the TSS segment selector for the new task as the operand of the JMP or CALL instruction, from a task gate, or from the previous task link field (for a task switch initiated with an IRET instruction).
Checks that the current (old) task is allowed to switch to the new task. Data-access privilege rules apply to JMP and CALL instructions. The CPL of the current (old) task and the RPL of the segment selector for the new task must be less than or equal to the DPL of the TSS descriptor or task gate being referenced. Exceptions, interrupts (except for those identified in the next sentence), and the IRET and INT1 instructions are permitted to switch tasks regardless of the DPL of the destination task-gate or TSS descriptor. For interrupts generated by the INT n, INT3, and INTO instructions, the DPL is checked and a general-protection exception (#GP) results if it is less than the CPL.1
Checks that the TSS descriptor of the new task is marked present and has a valid limit (greater than or equal to 67H). If the task switch was initiated by IRET and shadow stacks are enabled at the current CPL, then the SSP must be aligned to 8 bytes, else a #TS(current task TSS) fault is generated. If CR4.CET is 1, then the TSS must be a 32 bit TSS and the limit of the new task’s TSS must be greater than or equal to 107 bytes, else a #TS(new task TSS) fault is generated.
Checks that the new task is available (call, jump, exception, or interrupt) or busy (IRET return).
Checks that the current (old) TSS, new TSS, and all segment descriptors used in the task switch are paged into system memory.
Saves the state of the current (old) task in the current task’s TSS. The processor finds the base address of the current TSS in the task register and then copies the states of the following registers into the current TSS: all the general-purpose registers, segment selectors from the segment registers, the temporarily saved image of the EFLAGS register, and the instruction pointer register (EIP).
Loads the task register with the segment selector and descriptor for the new task’s TSS.
The TSS state is loaded into the processor. This includes the LDTR register, the PDBR (control register CR3), the EFLAGS register, the EIP register, the general-purpose registers, and the segment selectors. A fault during the load of this state may corrupt architectural state. (If paging is not enabled, a PDBR value is read from the new task’s TSS, but it is not loaded into CR3.)
If the task switch was initiated with a JMP or IRET instruction, the processor clears the busy (B) flag in the current (old) task’s TSS descriptor; if initiated with a CALL instruction, an exception, or an interrupt: the busy (B) flag is left set. (See Table 7-2.)
If the task switch was initiated with an IRET instruction, the processor clears the NT flag in a temporarily saved image of the EFLAGS register; if initiated with a CALL or JMP instruction, an exception, or an interrupt, the NT flag is left unchanged in the saved EFLAGS image.
If the task switch was initiated with a CALL instruction, an exception, or an interrupt, the processor will set the NT flag in the EFLAGS loaded from the new task. If initiated with an IRET instruction or JMP instruction, the NT flag will reflect the state of NT in the EFLAGS loaded from the new task (see Table 7-2).
If the task switch was initiated with a CALL instruction, JMP instruction, an exception, or an interrupt, the processor sets the busy (B) flag in the new task’s TSS descriptor; if initiated with an IRET instruction, the busy (B) flag is left set.
The descriptors associated with the segment selectors are loaded and qualified. Any errors associated with this loading and qualification occur in the context of the new task and may corrupt architectural state.
Begins executing the new task. (To an exception handler, the first instruction of the new task appears not to have been executed.)

Task Linking

call指令，中断和异常导致的任务切换，处理器会设置新任务的NT位以及TSS的previous task link位。

jmp指令导致的任务切换，previous task link域不修改

Reference

Intel® 64 and IA-32 Architectures Software Developer’s Manual Volume 3 (3A, 3B, 3C & 3D): System Programming Guide