27.10.2015 Views

Advanced Configuration and Power Interface Specification

ACPI_6.0

ACPI_6.0

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

ACPI Platform Error <strong>Interface</strong>s (APEI)<br />

18<br />

ACPI Platform Error <strong>Interface</strong>s (APEI)<br />

This section describes the ACPI Platform Error <strong>Interface</strong>s (APEI), which provide a means for the<br />

platform to convey error information to OSPM. APEI extends existing hardware error reporting<br />

mechanisms <strong>and</strong> brings them together as components of a coherent hardware error infrastructure.<br />

APEI takes advantage of the additional hardware error information available in today’s hardware<br />

devices <strong>and</strong> integrates much more closely with the system firmware.<br />

As a result, APEI provides the following benefits:<br />

• Allows for more extensive error data to be made available in a st<strong>and</strong>ard error record format for<br />

determining the root cause of hardware errors.<br />

• Is extensible, so that as hardware vendors add new <strong>and</strong> better hardware error reporting<br />

mechanisms to their devices, APEI allows the platform <strong>and</strong> the OSPM to gracefully<br />

accommodate the new mechanisms.<br />

This provides information to help system designers underst<strong>and</strong> basic issues about hardware errors,<br />

the relationship between the firmware <strong>and</strong> OSPM, <strong>and</strong> information about error h<strong>and</strong>ling <strong>and</strong> the<br />

APEI architecture components.<br />

APEI consists of four separate tables:<br />

• Error Record Serialization Table (ERST)<br />

• BOOT Error Record Table (BERT)<br />

• Hardware Error Source Table (HEST)<br />

• Error Injection Table (EINJ)<br />

18.1Hardware Errors <strong>and</strong> Error Sources<br />

A hardware error is a recorded event related to a malfunction of a hardware component in a<br />

computer platform. The hardware components contain error detection mechanisms that detect when<br />

a hardware error condition exists. Hardware errors can be classified as either corrected errors or<br />

uncorrected errors as follows:<br />

• A corrected error is a hardware error condition that has been corrected by the hardware or by the<br />

firmware by the time the OSPM is notified about the existence of the error condition.<br />

• An uncorrected error is a hardware error condition that cannot be corrected by the hardware or<br />

by the firmware. Uncorrected errors are either fatal or non-fatal.<br />

— A fatal hardware error is an uncorrected or uncontained error condition that is determined to<br />

be unrecoverable by the hardware. When a fatal uncorrected error occurs, the system is<br />

restarted to prevent propagation of the error.<br />

— A non-fatal hardware error is an uncorrected error condition from which OSPM can attempt<br />

recovery by trying to correct the error. These are also referred to as correctable or<br />

recoverable errors.<br />

Version 6.0 713

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!