分类目录归档:服务器

MEGACLI查看硬盘状态

通过megacli的如下命令查看RAID的情况,命令如下:

/opt/MegaRAID/MegaCli/MegaCli64 LDPDInfo -Aall

重点关注以下几点:

Media Error Count
Other Error Count
Predictive Failure Count
Last Predictive Failure
Drive has flagged a S.M.A.R.T alert

如果这几个数值不为0,则可能为硬盘故障,需要更换硬盘。

可以通过让指定硬盘闪烁的方式来定位磁盘位置,命令如下:

MegaCli -PdLocate -start -physdrv [E:S] -aALL

其中 E表示 Enclosure Device ID,S表示Slot Number。比如坏盘的位置为:
Enclosure Device ID: 1
Slot Number: 0

可执行以下命令让其闪烁:
root@Storage-c2:/opt/MegaRAID/MegaCli# ./MegaCli64 -PdLocate -start -physdrv[1:0] -a0
Adapter: 0: Device at EnclId-1 SlotId-0 — PD Locate Start Command was successfully sent to Firmware Exit Code: 0x00
root@Storage-c2:/opt/MegaRAID/MegaCli#

更换硬盘后,关闭闪烁的命令如下:
MegaCli -PdLocate -stop -physdrv [E:S] -aALL

如果raid中有硬盘故障,更换硬盘后,一般都无需做操作,阵列卡会自动做rebuild,从拔出硬盘到插入新盘,一般会有以下的过程:

  • Device
    Normal —>Damage —>Rebuild —>Normal
  • Virtual Drive
    Optimal —>Degraded —>Degraded —>Optimal
  • Physical Drive
    Online —>Failed Unconfigured —>Rebuild —>Online

查看rebuild进度的命令如下:

/opt/MegaRAID/MegaCli/MegaCli64 -PDRbld -showprog -physDrv [1:0] -a0

输出一般如下:

root@Storage-c2:/opt/MegaRAID/MegaCli# ./MegaCli64 -PDRbld -showprog -physDrv [1:0] -a0 
Rebuild Progress on Device at Enclosure 1, Slot 0 Completed 10% in 0 Minutes.
Exit Code: 0x00
root@Storage-c2:/opt/MegaRAID/MegaCli#

HP Smart Array存储控制器(服务器阵列卡)说明

HP智能阵列控制器-Smart Array
HP智能主机总线适配器-Smart HBA
HP智能存储控制器功能与特性
Smart Array产品命名规则
Smart HBA产品命名
HP智能阵列卡控制器(10&100系列)
HP智能阵列控制器(300系列)
HP智能阵列控制器(BL系列)
HP Gen9智能存储控制器产品列表
HP Gen9智能存储控制器产品列表-图示
HP Gen9 DL//ML Smart Array-技术参数
HP Gen9 DL/ML Smart HBA技术参数
Smart Storage Battery智能存储电池-共享备份电池
HP智能存储电池与阵列卡安装实例
HP ProLiant服务器支持列表
用户HP智能阵列控制器的HP SmartCache
HP SmartCache
HP Secure Encryption
高级数据镜像(SAAP 2.0)
HBA mode(HBA模式)

华为RH2288H V5服务器windows 2012阵列卡驱动

华为RH2288H V5服务器阵列卡有多个版本,其中有一款阵列卡芯片为AVAGO MegaRAID<SAS3408>。

该型阵列卡的介绍:

Broadcom Tri-Mode SerDes technology enables seamless operation of PCIe, SAS or SATA storage devices in a single drive bay. The introduction of PCIe devices executing NVMe to the existing SAS/SATA infrastructure makes industry standard hot-pluggable drive bays even more versatile. Whether you’re building external or cloud-based storage systems for high connectivity, or outfitting servers, the SAS3408 8-port, 12Gb/s Tri-Mode SAS/SATA/PCIe IOC provides choices for storage optimization by enhancing direct-attached storage (DAS) solutions.

This high-performance, sixth-generation I/O controller supports T-10 data protection model and optical support, PCIe hot plugging, and up to 2,000 connected devices.

  • Capitalize on the wide bandwidth of 8 PCI Express 3.1 lanes with SAS transfer rates of up to 12Gb/s and SATA rates up to 6Gb/s
  • Extend existing end-user investments with DataBolt technology, which provides the benefits of 12Gb/s SAS with existing 6Gb/s drive devices
  • Deliver more than one million IOPS

详情点击官网链接:https://www.broadcom.com/products/storage/sas-sata-controllers/sas-3408

采用SAS3408芯片的阵列卡型号是MegaRAID 9440-8i,可以在官网https://www.broadcom.com/products/storage/raid-controllers/megaraid-9440-8i#overview下载到windows 2012的驱动,驱动下载地址为https://docs.broadcom.com/docs/MR_WINDOWS_DRIVER_VENTURA_7.6-7.706.02.00-WHQL.zip

华为RH V5系列windows操作系统下PCI数据捕获和信号处理控制器的驱动问题

在通过华为引导盘安装windows系统后,在设备管理器中“PCI数据捕获和信号处理控制器”上面有感叹号,其实“PCI数据捕获和信号处理控制器”设备是iBMC新增的黑匣子功能,主要用于系统崩溃时问题的定位,由于目前该设备的驱动微软尚未认证通过,故显示为异常设备。可在BMC中关闭该功能。

“PCI 数据捕获和信号处理控制器”未知设备对业务正常运行无任何影响,可以不做处理,如必需处理,也可以按照下面的步骤关闭黑匣子功能:

Intel xeon系列BIOS平台与CPU型号对照表

BIOS平台与CPU型号对照表

BIOS平台 CPU型号
Purley Platinum 81XX
Gold 51XX/61XX
Silver 41XX
Bronze 31XX
Brickland IvyBridge E7-48XX V2/E7-88XX V2
Haswell E7-48XX V3/E7-88XX V3
Broadwell E7-48XX V4/E7-88XX V4
Grantley Haswell E5-26XX V3
Broadwell E5-26XX V4
Romley IvyBridge E5-26XX V2/E5-24XX V2/E5-46XX V2
SandyBridge E5-26XX/E5-24XX

联想服务器配备常用SAS RAID卡规格

联想服务器已有万全、ThinkServer、System x及ThinkSystem四条产品线和十余代产品,各代产品所配备的SAS RAID卡互有交叉。这里对采用LSI/Avago芯片的SAS RAID卡进行一个资料整理。

 LSI芯片 ThinkSystem System x ThinkServer LSI型号 类型 缓存 接口 驱动
 SAS3516 RAID 930-16i   RAID 930-8e    MegaRAID 9460-16i   MegaRAID 9480-8i8e  MR   (RoC)   4GB SAS12G megasas35
 SAS3508 RAID 930-8i   RAID 930-24i    MegaRAID 9460-8i   MegaRAID 9365-28i  MR   (RoC)   2GB   4GB  SAS12G megasas35
 SAS3416 430-16i   430-16e    HBA 9400-16i   HBA 9400-16e  I/T   (IOC)   无 SAS12G mpt35sas
 SAS3408 RAID 530-8i   MegaRAID 9440-8i iMR   (IOC)   无 SAS12G megasas35
 430-8i   430-8e    HBA 9400-8i   HBA 9400-8e  I/T   (IOC)   无 SAS12G mpt35sas
 SAS3108 RAID 730-8i ServeRAID M5210 ServeRAID M5215 ServeRAID M5225 RAID 720i AnyRAID 720i AnyRAID 720ix  MegaRAID 9361-8i   MegaRAID 9364-8i MegaRAID 9380-8e  MR   (RoC)   1GB   2GB 4GB  SAS12G megasas2
 SAS3008  ServeRAID M1215 RAID 520i MegaRAID 9340-8i iMR   (IOC)   无 SAS12G megasas2
  N2215   N2225 N2226   9300-8i   9300-8e 9300-16e  I/T   (IOC)   无 SAS12G mpt3sas
 SAS2308  N2115   N2125   9207-8i   9207-8e  I/T   (IOC)   无 SAS6G mpt2sas
 SAS2208  ServeRAID M5110   ServeRAID M5115 ServeRAID M5120 ServeRAID M5016  RAID 710 MegaRAID 9270CV-8i   MegaRAID 9286CV-8e MegaRAID 9265CV-8i  MR   (RoC)  512MB   1GB 2GB  SAS6G megasas2
 SAS2108  ServeRAID M5015   ServeRAID M5014 ServeRAID M5025  RAID 700 MegaRAID 9260-8i   MegaRAID 9280-8e  MR   (RoC)  512MB   256MB  SAS6G megasas2
 SAS2008  ServeRAID M1015   ServeRAID M1115  RAID 500   AnyRAID 510i  MegaRAID 9240-8i      iMR   (IOC)   无 SAS6G megasas2
  6Gb SSD HBA   6Gb SAS HBA   9210-8i   9212-4i4e  I/T   (IOC)   无 SAS6G mpt2sas
 SAS2004  ServeRAID H1110  9211-4i IR   (IOC)   无 SAS6G mpt2sas
 SAS1078    MegaRAID 8708EM2 MR   (RoC)  256MB SAS3G megasas2

LSI阵列卡常见的工作模式

MR – MegaRAID模式,使用RoC芯片硬件实现RAID功能,常见的带缓存的阵列卡工作在此模式,如ServeRAID M5210、RAID720ix、9260-8i等;

iMR – Integrated MegaRAID模式,通过软件(驱动)实现高级RAID功能(如RAID5),常见的不带缓存的阵列卡工作在此模式,如ServeRAID M1215及移除缓存模块的ServeRAID M5210、9240-8i等;

IR – Integrated RAID模式,只能提供最简单RAID功能(RAID0/1/1E)的SAS卡工作在此模式,如ServeRAID H1110等;

I/T – Initiator and Target模式,即直通模式,无任何RAID功能,SAS HBA卡工作在此模式,如N2215、N2225等。

HP 服务器 UID 指示灯的作用

The Virtual Indicators allow you to monitor, and in some cases control, the state of indicators on the host system, including the Unit Identification Light (UID).

The Unit Identification Light (UID)

The UID helps you identify and locate a system, especially in high-density rack environments. Additionally, the UID is used to indicate that a critical operation is underway on the host, such as Remote console access or ROM flash.
In general, you control the state of the UID using the UID controls or using the physical UID button on the front of your system (or the back, if supported).

The “current state” (on or off) of the UID is the last state chosen using one of these methods. If a new state is chosen while the UID is blinking, this new state becomes the current state, and takes effect when the UID stops blinking.

NOTE: The Unit ID Light web page does not automatically refresh itself if the state of the actual light changes after the page is loaded. To insure the page accurately reflects the state of the UID Light, click on the Virtual Indicators link to update the page.

UID is blinking

The following circumstances cause the UID to blink:

  • Remote Console is currently active.
  • Upgrade of iLO Firmware is in progress.
  • The UID continues to blink until the circumstances causing it to blink subside. The UID reverts to the current state when the cause of the blinking is gone.

Turn Unit ID On

Clicking this button sets the current state of the UID On. However, blinking the UID overrides this state until the cause has subsided.
The UID current state can also by toggled by pressing the physical button on the front of your system (or on the back, if supported). The UID is blue when lit.

Turn Unit ID Off

Clicking this button sets the current state of the UID Off. However, blinking the UID overrides this state until the cause has subsided.
The UID current state can also be toggled by pressing the physical button on the front of your system (or on the back, if supported). The UID has no color when it is off.

浪潮服务器硬盘报错处理方法

现有一台浪潮服务器,阵列卡型号LSI MegaRAID SAS 9260-4i,突然出现多块硬盘告警,并伴随蜂鸣声,现将服务器重启,提示以下信息:

FW could not sync up config/prop changes for some of the VD’s/PD’s.
Press any key to continue, or ‘C’ to load the configuration utility.

进入阵列卡后,发现多块硬盘状态为Unconfigured Bad

此时需要在Physical View下,点击单块硬盘,将硬盘状态改为Unconfigured Good

每块硬盘都改为Unconf Good后,即如下图所示:

此时,在Scan Devices下,导入foreign阵列信息

点击Preview,

Import后,即可,如下图所示,阵列信息恢复正常

所有操作完成,重启后即可正常进入操作系统。