This error appears when executing a stress test on a server and already discarded the possibility of being a HW Issue(already replace OCP and entire conections to the OCP cables, boards, etc), haven't change CPU's, RAM's, or SSD's because is not very probable that will be the cause.
device_id: 0000:64:02.0
Dmesg check............................[FAIL]
[ 250.275668] {1}[Hardware Error]: Hardware error from APEI Generic Hardware Error Source: 0
[ 250.275670] {1}[Hardware Error]: It has been corrected by h/w and requires no further action
[ 250.275671] {1}[Hardware Error]: event severity: corrected
[ 250.275672] {1}[Hardware Error]: Error 0, type: corrected
[ 250.275673] {1}[Hardware Error]: section_type: PCIe error
[ 250.275673] {1}[Hardware Error]: port_type: 4, root port
[ 250.275674] {1}[Hardware Error]: version: 3.0
[ 250.275674] {1}[Hardware Error]: command: 0x0547, status: 0x0010
[ 250.275675] {1}[Hardware Error]: device_id: 0000:64:02.0
[ 250.275675] {1}[Hardware Error]: slot: 6
[ 250.275676] {1}[Hardware Error]: secondary_bus: 0x65
[ 250.275676] {1}[Hardware Error]: vendor_id: 0x8086, device_id: 0x347a
[ 250.275677] {1}[Hardware Error]: class_code: 060400
[ 250.275677] {1}[Hardware Error]: bridge: secondary_status: 0x2000, control: 0x0013