HP server disk health check

Use smartctl

1. Find the disks

# smartctl --scan
/dev/sda -d scsi # /dev/sda, SCSI device

The scan will give a number of disks and device type.
2. Find device type

# hwinfo --disk
189: SCSI 10.0: 10600 Disk
[Created at block.245]
Unique ID: R7kM.RMnGJ5UEoK3
Parent ID: svHJ.ccZlDNM+DaE
SysFS ID: /class/block/sda
SysFS BusID: 0:1:0:0
SysFS Device Link: /devices/pci0000:00/0000:00:01.0/0000:03:00.0/host0/target0:1:0/0:1:0:0
Hardware Class: disk
Model: "HP LOGICAL VOLUME"
Vendor: "HP"
Device: "LOGICAL VOLUME"
Revision: "1.34"
Serial ID: "PDNLH0BRH7Q68A"
Driver: "hpsa", "sd"
Driver Modules: "hpsa", "sd_mod"
Device File: /dev/sda (/dev/sg1)
Device Files: /dev/sda, /dev/disk/by-id/scsi-0HP_LOGICAL_VOLUME_00000000, /dev/disk/by-id/scsi-3600508b1001c445903700dba678a8bcd, /dev/disk/by-id/scsi-SHP_LOGICAL_VOLUME_PDNLH0BRH7Q68A, /dev/disk/by-id/wwn-0x600508b1001c445903700dba678a8bcd, /dev/disk/by-path/pci-0000:03:00.0-scsi-0:1:0:0
Device Number: block 8:0-8:15 (char 21:1)
BIOS id: 0x80
Geometry (Logical): CHS 72957/255/63
Size: 1172058032 sectors a 512 bytes
Capacity: 558 GB (600093712384 bytes)
Config Status: cfg=no, avail=yes, need=no, active=unknown
Attached to: #43 (RAID bus controller)

Since the server come with a hardware raid controller, disk device shown in hwinfo –disk is not an individual disk, but a logical drive. The device file is /dev/sda (/dev/sg1).

According smartctl manual, for HP Smart Array RAID controllers, there are three currently supported drivers: cciss, hpsa, and hpahcisr. For disks accessed via the cciss driver the device nodes are of the form “/dev/cciss/c[0-9]d0”. For disks accessed via the hpahcisr and hpsa drivers, the device nodes you need are “/dev/sg[0-9]*”. (“lsscsi -g” is helpful in determining which scsi generic device node corresponds to which device.) Use the nodes corresponding to the RAID controllers, not the nodes corresponding to logical drives. See the -d option below.

cciss,N – [FreeBSD and Linux only] the device consists of one or more SCSI/SAS or SATA disks connected to a cciss RAID controller. The non-negative integer N (in the range from 0 to 15 inclusive) denotes which disk on the controller is monitored.

To look at disks behind HP Smart Array controllers, use syntax such as:
smartctl -a -d cciss,0 /dev/cciss/c0d0 (cciss driver under Linux)
smartctl -a -d cciss,0 /dev/sg2 (hpsa or hpahcisr drivers under Linux)

SysFS BusID: 0:1:0:0 tells the disk index is 1, so the command should be:
smartctl -a -d cciss,1 /dev/sg1

Use HP hpacucli

SMART is a set of disk properties, a logical drive consists of multiple disks, so that doesn’t align. The RAID controller could interpret the status of the underlying disks and present a SMART like interface, but typically the RAID driver and/or manufacturer tooling provides better or more relevant status information.
HP provides tools to monitor your hardware, the typical Linux utility to monitor HP raid controller status and the drives contained would be hpacucli.

Some usefull commands would be:

hpacucli> ctrl all show config 
hpacucli> ctrl all show config detail
hpacucli> ctrl all show status
hpacucli> ctrl slot=0 pd all show 
hpacucli> ctrl slot=0 pd 1 show
hpacucli> ctrl slot=0 pd all show status

Some useful guides
http://www.thegeekstuff.com/2014/07/hpacucli-examples/

Advertisements

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s