® Intel Server Compute Blade SBX82: Hardware Maintenance Manual and Troubleshooting Guide A Guide for Technically Qualified Assemblers of Intel Identified Subassemblies & Products Order Number C90896-001...
Page 2
Disclaimer ® Information in this document is provided in connection with Intel products. No license, express or implied, by estoppel or otherwise, to any intellectual property rights is granted by this document. Except as provided in Intel's Terms and Conditions of Sale for such products. Intel assumes no liability whatsoever, and Intel disclaims any express or implied warranty, relating to sale and/or use of Intel products including liability or warranties relating to fitness for a particular purpose, merchantability, or infringement of any patent, copyright or other intellectual property right.
Contents Safety and regulatory information ..........vii General Safety .
Page 4
Installing the blade server in the SBCE unit ....... . 44 Updating your blade server configuration .
Page 5
Steps to extract the Windows Update to the hard drive (GUI) ....72 Steps to extract DOS update files to diskette (GUI) ..... . 72 Command Line Operation .
Safety and regulatory information NOTE The service procedures are designed to help you isolate problems. They are written with the assumption that you have model-specific training on all computers, or that you are familiar with the computers, functions, terminology, and service information provided in this manual.
General Safety Follow these rules to ensure general safety: • Observe good housekeeping in the area of the machines during and after maintenance. • When lifting any heavy object: Ensure you can stand safely without slipping. Distribute the weight of the object equally between your feet. Use a slow lifting force.
Page 9
• Find the emergency power-off (EPO) switch, disconnect switch, or electrical outlet in the room. If an electrical accident occurs, you can quickly turn off the switch or unplug the power cord. • Do not work alone under hazardous conditions, or near equipment that has hazardous voltages. •...
Handling electrostatic discharge-sensitive devices Any computer part containing transistors or integrated circuits (IC) should be considered sensitive to electrostatic discharge (ESD). ESD damage can occur when there is a difference in charge between objects. Protect against ESD damage by equalizing the charge so that the server, the part, the work mat, and the person handling the part are all at the same charge.
Page 11
To Connect To Disconnect Turn everything OFF. Turn everything OFF. First, attach all cables to devices. First, remove power cords from outlet. Attach signal cables to connectors. Remove signal cables from connectors. Attach power cords to outlet. Remove all cables from devices. Turn device ON.
Page 12
≥18 kg (37 lbs) ≥32 kg (70.5 lbs) ≥55 kg (121.2 lbs) xx CAUTION: Use safe practices when lifting. xx CAUTION: The power control button on the device and the power switch on the power supply do not turn off the electrical current supplied to the device. The device also might have more than one power cord.
xx CAUTION: To avoid personal injury, before lifting the unit, remove all the blades to reduce the weight. xx CAUTION: Hazardous energy is present when the blade is connected to the power source. Always replace the blade cover before installing the blade. Regulatory specifications and disclaimers Safety compliance USA:...
Electromagnetic compatibility (ECM) USA: FCC CFR 47 Part 2 and 15, Verified Class A Limit Canada: IC ICES-003 Class A Limit Europe: EMC Directive, 89/336/EEC EN55022, Class A Limit, Radiated & Conducted Emissions EN55024 ITE Specific Immunity Standard EN61000-4-2 ESD Immunity (Level 2 Contact Discharge, Level 3 Air Discharge) EN61000-4-3 Radiated Immunity (Level 2) EN61000-4-4 Electrical Fast Transient (Level 2) EN61000-4-5 AC Surge...
Electromagnetic compatibility notices (International) Europe (CE Declaration of Conformity): This product has been tested in accordance too, and complies with the Low Voltage Directive (73/23/EEC) and EMC Directive (89/336/EEC). The product has been marked with the CE Mark to illustrate its compliance. Japan EMC Compatibility: English translation of the notice above: This is a Class A product based on the standard of the Voluntary Control Council for Interference by Information Technology Equipment (VCCI).
Page 16
RRL Korea: English translation of the notice above: Device User’s Information Class A device This device complies with RRL EMC and is operated in a commercial environment so that distributors or users pay attention to this point. If this product is sold or purchased improperly, please exchange this product to one that can be used at home.
® 1 Introducing the Intel Server Compute Blade SBX82 These high-performance blade servers are ideally suited for networking environments that require superior processor performance, efficient memory management, flexibility, and reliable data storage. This Hardware Maintenance Manual and Troubleshooting Guide provides information about: •...
Page 18
Figure 1. Blade server release levers Release levers Release button A set of user labels comes with the Intel® Server Compute Blade SBX82. When you install the blade server in the SBCE unit, write identifying information on a label and place the label on the SBX82 unit bezel.
Page 19
Figure 2. Label placement on the SBCE unit Important: Do not place the label on the blade server itself or in any way block the ventilation holes on the blade server.
Features and specifications This section provides a summary of the features and specifications of your blade server. Use the Configuration/Setup Utility program to determine the specific type of processor that is in the blade server. Reliability, availability, and serviceability features Three of the most important features in server design are reliability, availability, and serviceability (RAS).
Intel® Server Compute Blade SBX82 features The design of your blade server takes advantage of advancements in memory management and data storage. Your blade server uses the following features and technologies: • Disk drive support The blade server supports up to two 2.5-inch SCSI disk drives. •...
• Power throttling Each blade server is powered by two SBCE unit redundant 2000 W power supply modules. By enforcing a power policy known as oversubscription, the SBCE unit can load-share power between two power modules to ensure efficient power for each device in the SBCE unit. This policy is enforced when the initial power is applied to the SBCE unit or when a blade server is inserted into the SBCE unit.
Processor: Size: Electrical Input: • Height: 24.5 cm (9.7 inches) Supports up to two processors 12 V dc • Depth: 44.6 cm (17.6 inches) ® ™ • Intel Xeon processors with Environment: • Width: 2.9 cm (1.14 inches) an 800 MHz FSB at speeds •...
Notices and statements used in this document The following notices and statements are used in the documentation: • Note: These notices provide important tips, guidance, or advice. • Important: These notices provide information or advice that might help you avoid inconvenient or problem situations.
2 Using power, controls, jumpers, switches, and indicators This chapter describes the power features, how to turn on and turn off the blade server, what the controls and indicators mean, and where the system board jumpers and switches are located and how to use them.
Turning off the blade server When you turn off the blade server, it is still connected to power through the SBCE unit. The blade server can respond to requests from the service processor, such as a remote request to turn on the blade server.
Understanding the control panel and LEDs This section describes the controls and LEDs on your SBCE unit. NOTE The illustrations in this document might differ slightly from your hardware. NOTE The control panel door is shown in the closed (normal) position in the illustration. To access the power-control button, you must open the control panel door.
Page 28
You can also press keyboard keys in the following sequence to switch keyboard/mouse/video control between blade servers: NumLock NumLock blade_server_number Enter Where blade_server_number is the two-digit number for the blade bay in which the blade server is installed. CD/diskette/USB select button: Press this button to associate the CD-ROM drive, diskette drive, and USB port with this blade server.
Using switch block 2 (SW2) You must remove the blade server from the SBCE unit, open the cover, and press the Light Path Diagnostics button to light any error LEDs that were turned on during processing. The following illustration and Table 1 on page 14 show the location and the settings for SW2. Figure 4.
Using Light Path Diagnostics to troubleshoot the system board After the system board is removed from the chassis, you can press Light Path Diagnostics (SW4) to troubleshoot system board component problems. See Figure 5 on page 15 and Table 2 on page 15 for more information about locating Light Path Diagnostics LEDs and what to do if an error LED is lit.
Page 32
Table 3. SW4 Light Path Diagnostics LED locations LED error Action Check the error log for additional information. Reboot the blade server. If the error still exists, replace the system board. Check the processors to make sure they are at the same speed.
3 Customer replaceable units This chapter provides instructions for installing hardware options in your blade server. Some option- removal instructions are provided in case you need to remove one option to install another. Installation guidelines Before you begin installing options in the blade server, read the following information: •...
• Do not leave the device where others can handle and possibly damage it. • While the device is still in its non-conductive package, touch it to an unpainted metal part of the SBCE unit or any unpainted metal surface on any other grounded rack component in the rack you are installing the device in for at least 2 seconds.
Removing the blade server from the SBCE unit NOTE The illustrations in this section might differ slightly from your hardware. The following figure shows how to remove the blade server from the SBCE unit. Attention • To maintain proper system cooling, do not operate the SBCE unit for more than one minute without either a blade server, expansion unit, or filler blade installed in each blade bay.
Opening the blade server cover The following illustration shows how to open the cover on the blade server. Blade-cover release Blade-cover release Complete the following steps to open the blade server cover: 1. Read the safety information beginning on page vii and “Installation guidelines” on page 17. 2.
Removing the blade server bezel assembly To install certain options, you must first remove the blade server bezel assembly. The following illustration shows how to remove the bezel assembly from the blade server. Bezel-assembly release Bezel-assembly release Control-panel Control panel cable connector Complete the following steps to remove the blade server bezel assembly:...
Figure 8. Installing a SCSI drive Hard drive release lever SCSI ID 1 SCSI ID 0 Hard drive release lever Note Do not install a SCSI hard disk drive in SCSI connector 1 (SCSI ID 1) if you intend to also install an optional standard expansion card.
5. Locate SCSI connector 1 and slowly pull the blue lever at the back of the hard disk drive tray to disengage the drive from its tray. 6. From the rear edge of the drive, slide the drive out of the SCSI connector. Attention: To maintain proper system cooling, do not operate the system unit without either a blade server, expansion unit, or filler blade installed in each blade bay for more than 1 minute.
Page 40
Figure 9. Installing DIMMS DIMM slot 2 (J111) DIMM slot 1 (J113) DIMM slot 4 (J110) DIMM slot 3 (J112) Before you begin, read the documentation that comes with the DIMMs. Complete the following steps to install a DIMM: 1. Read the safety information beginning on page vii and “Installation guidelines” on page 17. 2.
Installing an additional processor The blade server comes with one or two processors installed on the system board. The blade server supports two processors. With two processors, your blade server can operate as a symmetric multiprocessing (SMP) server. With SMP, certain operating systems and application programs can distribute the processing load between the processors.
Page 42
Heat sink Alignment marks Microprocessor Microprocessor locking lever Heat sink filler Complete the following steps to install an additional processor: 1. Read the safety information beginning on page vii and “Installation guidelines” on page 17. 2. Shut down the operating system, turn off the blade server, and remove the blade server from the SBCE unit (see “Removing the blade server from the SBCE unit”...
Page 43
Lever fully open Lever closed Lever fully open Lever closed d. Center the processor over the processor socket. Align the triangle on the corner of the processor with the triangle on the corner of the socket and carefully press the processor into the socket.
Heat sink Thermal grease a. Remove the plastic protective cover from the bottom of the heat sink. b. Align and place the heat sink on top of the processor in the retention bracket, grease side down. Press firmly on the heat sink. c.
If the SCSI hard disk drive that is installed in SCSI connector 2 is part of a RAID array, delete this SCSI RAID array configuration before removing the hard disk drive. When you delete the RAID array, the array configuration information is removed; no data is deleted. There are two types of I/O expansion cards supported by the blade server: •...
Figure 10. Installing a small form-factor I/O card in the blade server Expansion Card a. Orient the I/O expansion card as shown by number 1 in Figure 10. b. Slide the notch at the narrow end of the card into the raised hook on the tray; then gently pivot the card into the expansion card connectors, as shown by number 2 the illustration.
Page 47
3. Carefully lay the blade server on a flat, non-conductive surface. 4. Open the cover (see “Opening the blade server cover” on page 20 for instructions). 5. If an SCSI drive is in SCSI connector 2, remove the drive and tray (see “Removing a SCSI hard disk drive”...
® Installing the Intel Blade Server SCSI Expansion Module SBESCSI ® The Intel Blade Server SCSI Expansion Module SBESCSI supports up to two hot-swap SCSI hard disk drives and up to two standard form-factor I/O cards or two small form-factor I/O cards. To help ensure proper cooling and system reliability, make sure that: •...
• Take additional care when handling devices during cold weather. Heating reduces indoor humidity and increases static electricity. Installing a SCSI storage expansion unit ® To use SCSI hard disk drives with your blade server, you must install the Intel Blade Server SCSI Expansion Module SBESCSI on the blade server.
Page 50
5. Locate the SCSI expansion connector (J132) on the system board and lift the protective film from the connector. Cover pins 6. Install the SCSI storage expansion unit: a. Touch the non-conductive package that contains the expansion unit to any unpainted metal surface on the SBCE chassis or any unpainted metal surface on another grounded rack component.
Page 51
With the expansion unit installed on your blade server, you can install up to two hot-swap SCSI hard disk drives in the expansion unit. Each SCSI device must have a unique SCSI ID. This ID enables the SCSI controller in the expansion unit to identify the device and ensure that different devices on the same SCSI channel do not attempt to transfer data simultaneously.
Installing a SCSI disk drive After you have installed the SCSI storage expansion unit on the blade server, you can install up to two SCSI disk drives in the expansion unit. If a hot-swap hard disk drive in the expansion unit fails, you can replace it without turning off the blade server.
d. Push the tray handle to the closed (locked) position. e. Check the hard disk drive LEDs to verify that the hard disk drive is operating properly. • If the amber hard disk drive status LED for a drive is lit continuously, that drive is faulty and must be replaced.
Installing an I/O expansion card in the SCSI storage expansion unit You can add optional I/O expansion cards to your expansion unit to give the unit additional connections for communicating on a network. Attention: When you add an I/O expansion card, you must make sure that the I/O modules in I/O module bays 3 and 4 on the SBCE unit both support the I/O expansion card network-interface type.
b. Slide the notch in the narrow end of the card into the raised hook on the tray; then gently pivot the wide end of the card into the I/O expansion card connectors, as shown in the illustration. NOTE For device driver and configuration information to complete the installation of the I/O expansion card, see the documentation for the option.
Page 56
5. Locate the battery on the system board. Figure 13. Battery location Battery 6. To remove the battery, use your finger to press down on one side of the battery; Then slide the battery from the socket. The spring mechanism will push the battery out towards you as you slide it from the socket.
8. Close the blade server cover (see “Closing the blade server cover” on page 43). Statement 21: xx CAUTION: Hazardous energy is present when the blade server is connected to the power source. Always replace the blade cover before installing the blade server. 9.
Installing the blade server bezel assembly The following illustration shows how to install the bezel assembly on the blade server. Bezel-assembly release Bezel-assembly release Control-panel Control panel cable connector Complete the following steps to install the blade server bezel assembly: 1.
Closing the blade server cover Important: The blade server cannot be inserted into the SBCE unit until the cover is installed and closed or an expansion unit is installed. Do not attempt to override this protection. The following illustration shows how to close the blade server cover. Cover pins Complete the following steps to close the blade server cover:...
Installing the blade server in the SBCE unit The following illustration shows how to install the blade server into the SBCE unit. Complete the following steps to install a blade server in the SBCE unit: Statement 21: xx CAUTION: Hazardous energy is present when the blade server is connected to the power source. Always replace the blade cover before installing the blade server.
5. Slide the blade server into the bay until it stops. The spring-loaded doors farther back in the bay that cover the bay opening move out of the way as you insert the blade server. 6. Push the release levers on the front of the blade server closed. 7.
Page 62
Intel® Server Compute Blade SBX82: Hardware Maintenance Manual and Troubleshooting Guide...
4 Field replaceable units This chapter describes the removal of field-replaceable server components. Microprocessor removal This section includes the guidelines to follow and the instructions for removing a microprocessor. Removal Guidelines Read these important guidelines before removing a microprocessor that is not faulty (for example, when replacing the system board assembly).
Page 64
a. Loosen one captive screw fully; then, loosen the other captive screw. Attention: Loosening one screw fully before loosening the other screw will help to break the thermal bond that adheres the heat sink to the microprocessor. b. Gently pull the heat sink off of the microprocessor. 7.
Microprocessor Microprocessor- release lever Figure 16. Removing the microprocessor To install a microprocessor, see “Installing an additional processor” on page 25 and the documentation provided with the microprocessor option for complete installation instructions. Attention: If you are not installing a replacement microprocessor in socket 2, you must reinstall the microprocessor heat sink filler in that socket.
Switches Figure 18 shows the location of the system board switch-block (SW2). Switch block (SW2) Figure 18. System board switch block (SW2) location The following table describes the function of each switch on switch block SW2. Table 5. Switch block (SW2) Switch Default value Switch...
System board LED locations Figure 19 shows the location of the LEDs on the system board. You might need to refer to this illustration when solving problems with the blade server. You have to remove the blade server from the SBCE unit, open the cover, and press the light path diagnostics button (SW4) to light any error LEDs that were turned on during processing.
Page 68
5. Remove any of the installed components listed below from the system board assembly; then, place them on a static-protective surface or install them on the new system board assembly. • I/O expansion cards (reverse the steps in “Installing an I/O expansion card” on page 28) •...
5 Configuring the blade server The following configuration programs come with your blade server: • Configuration/Setup Utility program This is part of the basic input/output system (BIOS) code in your blade server. Use it to change interrupt request (IRQ) settings, set the date and time, and set passwords. See “Using the Configuration/Setup Utility program”...
Page 70
• System Information Select this choice to display information about the blade server. When you make configuration changes through other options in the Configuration/Setup Utility program, some of those changes are reflected in the system information; you cannot change settings directly in the system information.
Page 71
If you enable the boot fail count, the BIOS default settings will be restored after three consecutive failures to find a boot record. You can enable a virus-detection test that checks for changes in the boot record when the blade server starts.
— System Partition Visibility Select this choice to specify whether the System Partition is to be visible or hidden. — Integrated System Management Processor Settings Select this choice to enable or disable the Reboot on System NMI option on the menu. If you enable this option, the blade server will automatically restart 60 seconds after the service processor issues a nonmaskable interrupt (NMI) to the blade server.
Using the PXE boot agent utility program This program is a built-in, menu-driven configuration utility program that you can use to: • Select the boot protocol and other boot options • Select a power-management option NOTE The RPL selection for the boot protocol option is not supported for this server. Complete the following steps to start the PXE boot agent utility program: 1.
The service processor in your blade server provides the following features: • Continuous health monitoring and control • Configurable notification and alerts • Event logs that are timestamped, saved in nonvolatile memory, and can be attached to e-mail alerts • Remote graphics console redirection •...
Blade server Ethernet controller enumeration The enumeration of the Ethernet controllers in a blade server is operating-system dependent. You can verify the Ethernet controller designations a blade server uses through your operating-system settings. The routing of an Ethernet controller to a particular I/O-module bay depends on the type of blade server.
Using the LSI Logic Configuration Utility program You can use the LSI Logic Configuration Utility to: • Set the SCSI device scan order • Set the SCSI ID for the controller Complete the following steps to start the LSI configuration utility program: 1.
6 Diagnostics This section provides basic troubleshooting information to help you solve some common problems that might occur with the blade server. If you cannot locate and correct the problem using the information in this section, see Appendix A, “Getting help and technical assistance,” on page 131 for more information. General checkout The server diagnostic programs are stored in the upgradeable read-only memory (ROM).
YES. Run the diagnostic programs. If you receive an error, see Chapter 8, “Symptom-to-FRU index,” on page 95. If the diagnostic programs completed successfully and you still suspect a problem, see “Undetermined problems” on page 126. Diagnostic tools overview The following tools are available to help you identify and solve hardware-related problems: •...
Viewing error logs from the Configuration/Setup Utility program Start the Configuration/Setup Utility program; then, select Error Logs from the main menu. See “Using the Configuration/Setup Utility program” on page 53 for more information. Diagnostic programs and error messages The server diagnostic programs are stored in ROM on the system board. These programs are the primary method of testing the major components of the server.
1. Turn on the blade server and watch the screen. NOTE When running the diagnostic programs, make sure that the blade server controls the needed components for the tests, including the CD-ROM drive, diskette drive, and USB port. You can use the selection buttons on the blade server to make necessary adjustments. 2.
Diagnostic error message tables For descriptions of the error messages that might appear when you run the diagnostic programs, see “Diagnostic error codes” on page 99 Notes: 1. Depending on the server configuration, some of these error messages might not appear when you run the diagnostic programs.
DIMM 1 error LED DIMM 2 error LED DIMM 3 error LED DIMM 4 error LED Microprocessor 1 error LED Microprocessor 2 error LED Figure 20. System board LED locations For example: A blade server error has occurred and you have noted that the blade server blade-error LED is lit on the blade server control panel.
Recovering the BIOS code The flash memory (BIOS) of the server consists of a primary page and a backup page. If the primary BIOS page is corrupt, the system must boot from the backup page and the primary page must be reflashed.
Page 84
4. Locate switch block SW2 on the system board (see Figure 21). Switch block (SW2) Figure 21. System board: BIOS backup page jumper 5. Move switch 1 (BIOS backup page jumper) to the ON position to enable BIOS recovery mode. 6.
7 BIOS, Diagnostics and Firmware update procedures This section describes the usage of system update procedures for BIOS, diagnostics, and firmware updates. Updating the BIOS ® Use the following procedure to update the BIOS for the Intel Server Compute Blade SBX82: ®...
Updating the Diagnostics ® Use the following procedure to update the diagnostics for the Intel Server Compute Blade SBX82: ® 1. Obtain the latest BIOS flash image from the most current Intel Server Compute Blade SBX82 firmware release package. The firmware package may be found on the web at http://www.intel.com/ibl or http://support.intel.com/support/motherboards/server/blade.htm.
7. At the next user prompt, the system will display "Start to program flash? (Y/N)". Select "Y" to program the BMC and SDR. 8. At this point, the system will flash and initialize the BMC. The SDR will also be automatically updated.
Steps to perform update (GUI) 1. Boot the system into the Microsoft Windows operating system. ® 2. Download the appropriate Windows Update package for the Intel Server Compute blade SBX82. (Contact your Intel customer representative for more information.) 3. The Windows update package contains the utility and the associated BIOS files.
< > Operation mode Perform update (silently). Extract Microsoft* Windows* update to hard drive directory < >. If -x <path> path no < > is specified, defaults to A: path Extract DOS update to floppy < >. If no < >...
BIOS Update from Linux Operating System ® The Intel Server Compute Blade SBX82 Linux Update Package includes the file, the lflash.exe data files for the firmware being flashed and all files needed to create a DOS flash diskette. The following are supported Linux operating systems for the utility: lflash •...
3. The Linux update package contains the utility and the associated BIOS files. lflash ® 4. Open a command shell and run the “Intel Server Compute Blade Windows BIOS Update” with the following command line: <update>.sh -x <path> where is the downloaded update and is the path to which the Linux <update>.sh <path>...
• Examine all SEL entries stored in the non-volatile storage area of the blade server in either text or hexadecimal format. • Examine previously stored SEL entries from a file in either text or hexadecimal format. • Save the SEL entries to a file. •...
Table 7. Command-line mode examples Action Command Save SEL data to a file Selview selfile.sel /save Selview selfile.sel –save Save SEL data to a file in HEX Selview selfile.sel /save /hex mode Selview selfile.sel -save –hex Clear the SEL Selview /clear Selview –clear Graphical User Interface (GUI) The SEL Viewer utility can also be run in an interactive mode through a series of pull-down menus.
Page 94
• Timestamp • Sensor Type and Number • Event Description • Generator ID The display of data in these columns varies depending on the type of SEL record. For detail information regarding the text displayed, refer to the SEL Data section of this document. Figure 23 SEL Viewer Utility Main Window The SEL Viewer can also display event logs in raw hexadecimal format as read from the server (see Figure 24 on page 80).
Page 95
Table 8. Abbreviations used in hexadecimal mode display Action Command Record ID Record Type Time Stamp Generator ID Event Message Format Revision Sensor Type Sensor Number EDIR Event Dir and Event Type Event Data 1 Event Data 2 Event Data 3 Manufacturer ID OEM Defined 1.
Figure 24 SEL records in hexadecimal format Pull-Down Menu – File The File pull-down menu includes options for opening and saving system event records to and from data files, respectively. These options are further described in the following sections. File Menu Item – Open... This option allows you to open an existing SEL data file for viewing.
Figure 25 File Open dialog box The File Open dialog box provides you with the ability to browse drives and directories for existing files. The edit box for entering a file name supports full editing capabilities with the following keys: <Home>, <End>, Left/Right arrows and <Ins>.
SEL Menu Item – Properties... This option allows you the ability to view the SEL properties. The text “Warning: System Event Log is FULL” is displayed if the SEL is full; otherwise, the text is omitted. The “Number of Entries” and “Free Space Remaining”...
View Menu Item – Hide SEL Info Window/View SEL Info Window This option allows you to view/hide the SEL Information Window. By default, the SEL Information window is visible and the sub-menu is shown as “Hide SEL Info Window”. If you select “Hide SEL Info Window”, the SEL Information window is removed from the display area and the sub-menu text changes to “View SEL Info Window”.
Figure 28 Help window Help Menu Item – About This option displays the SEL Viewer utility version, copyright information and IPMI version information. OEM SEL data In addition to logging a wide variety of sensor types and record IDs for possible system events (as defined by the IPMI Specification), the Intel®...
Page 101
Table 9. OEM SEL records with timestamp SEL Viewer column Text displayed Time Stamp Time Stamp Sensor Type & Number OEM data not defined. Event Description OEM system event record (Record Type: 0xC0) Generator ID OEM data not defined. • OEM sensor type and record IDs without timestamps (E0 –...
OEM SEL entry definitions Table 11. POST OEM SEL definitions Sensor type Record ID Byte definitions/description OEM POST 0xC0 Byte 11 POST Error / Event Type with Time Stamp 0x00 POST PCI POST Event/Error 0x01 POST PCI Processor Event 0x02 POST Memory Error 0x03 POST Scalability Event 0x04 POST Bus Event 0x05 POST Chipset Event...
POST OEM SEL formats with timestamp Table 12. POST PCI event/error SEL format Byte Description 0x00 POST PCI Event/Error Error Type 0x00 PCI Event/Error occurred. Next non-time stamped OEM SEL entry will contain details of the specific PCI event/error 13:15 Reserved Revision Number = 0x00 SMI OEM SEL formats with timestamp...
POST OEM SEL formats without timestamp Table 13. POST PCI event/error SEL format Byte Description 0x00 POST PCI Event/Error Error Type 0x00 Device OK 0x01 Required ROM space not available 0x02 Required IO space not available 0x03 Required memory not available 0x04 Required memory below 1MB not available 0x05 ROM checksum failed 0x06 BIST failed...
Table 13. POST PCI event/error SEL format Byte Description Reserved Revision Number = 0x00 POST processor event/error SEL format Table 14. POST processor event/error SEL format Byte Description 0x01 POST Processor Event / Error Error Type 0x00 Processor Failed BIST 0x01 Unable to Apply Microcode (Patch) Update 0x02 POST Does Not Support Current Stepping of Processor Chassis Number (0x00 if not applicable)
SMI processor event/error SEL format Table 16. SMI MCA Data A SEL format Byte Description 0x01 SMI Processor Event / Error 0x00 Data A Reserved Reserved 8 – 9 Bank 10 – 11 APIC ID 12 – 15 Revision Number = 0x00 Table 17.
Page 108
Table 19. SMI MCA Data C SEL format Byte Description 0x01 SMI Processor Event / Error 0x03 Detail C Reserved Reserved 8 – 11 MCA Status Register high 12 – 15 MCA Status Register low Revision Number = 0x00 Table 20. SMI MCA Data D SEL format Byte Description 0x01 SMI Processor Event / Error...
SMI bus event/error SEL format Table 24. SMI Front Side Bus Event SEL format Byte Description 0x04 SMI Bus Event / Error Bus Type 0x00 FSB 0x00 FSB Fatal 0x01 FSBNonFatal 7 – 8 FSB FERR or NERR 9 – 15 Reserved Revision Number = 0x00 SMI chipset event/error SEL format...
8 Symptom-to-FRU index ® This index supports the Intel Server Compute Blade SBX82. Notes: 1. Check the configuration before you replace a FRU. Configuration problems can cause false errors and symptoms. For devices not supported by this index, refer to the manual for that device. The symptom-to-FRU index lists symptoms, errors, and the possible causes.
Page 112
NOTE See “System” on page 130 to determine which components should be replaced by a field service technician. Beep/symptom FRU/action 1-1-2 1. Optional microprocessor (if installed) (Microprocessor register test Microprocessor failed) System board assembly 1-1-3 1. Battery (CMOS write/read test failed) System board assembly 1-1-4 1.
Page 113
NOTE See “System” on page 130 to determine which components should be replaced by a field service technician. Beep/symptom FRU/action 2-2-3 1. Battery (CMOS power failure and System board assembly checksum checks failed) 2-2-4 1. Battery (CMOS configuration information System board assembly validation failed) 2-3-1 •...
Page 114
NOTE See “System” on page 130 to determine which components should be replaced by a field service technician. Beep/symptom FRU/action 3-3-1 1. Verify that both DIMMs in bank are of the same size, speed, (Memory size mismatch type and technology. occurred.) DIMM System board assembly...
No-beep symptoms NOTE See “System” on page 130 to determine which components should be replaced by a field service technician. No-beep symptom FRU/action No beep and the system • System board assembly operates correctly. No beep and no video • See “Undetermined problems”...
Page 116
NOTE See “System” on page 130 to determine which components should be replaced by a field service technician. Error code/symptom FRU/action 001-250-000 • System board assembly (Failed processor board ECC) 001-292-000 (Core system: • Load BIOS defaults and rerun test. failed/CMOS checksum failed) 001-XXX-000 •...
Page 117
NOTE See “System” on page 130 to determine which components should be replaced by a field service technician. Error code/symptom FRU/action 166-406-001 System Management: 1. Remove blade server, wait 30 seconds, reinsert the blade Failed (BMC indicates failure in I2C bus server, and retry.
NOTE See “System” on page 130 to determine which components should be replaced by a field service technician. Error code/symptom FRU/action 202-XXX-001 1. Microprocessor 1 (Failed system cache test) System board assembly 202-XXX-002 1. Microprocessor 2 (Failed system cache test) System board assembly 217-198-XXX •...
Page 119
NOTE See “System” on page 130 to determine which components should be replaced by a field service technician. Error code/symptom FRU/action 1. Run the Configuration/Setup Utility program. (Three consecutive startup failures Battery. using the default configuration.) System board assembly. Microprocessor. 101, 102 •...
Page 120
NOTE See “System” on page 130 to determine which components should be replaced by a field service technician. Error code/symptom FRU/action • Run the Configuration/Setup Utility program, and enter the (An attempt was made to access the administrator password. server with invalid passwords) 1.
Page 121
NOTE See “System” on page 130 to determine which components should be replaced by a field service technician. Error code/symptom FRU/action Keyboard (Keyboard or keyboard controller error) System board assembly • System board assembly (Keyboard controller error) 1. Diskette (Invalid diskette boot record) Diskette drive Cable System board assembly...
Page 122
NOTE See “System” on page 130 to determine which components should be replaced by a field service technician. Error code/symptom FRU/action 1302 1. Cable (I2C cable from system board to Power switch assembly power-on and reset switches not System board assembly found) 1303 1.
Page 123
NOTE See “System” on page 130 to determine which components should be replaced by a field service technician. Error code/symptom FRU/action 2462 1. Verify that the keyboard/mouse/video select button LED (Video memory configuration error) on the front of the blade server is on, indicating that the blade server is connected to the shared monitor.
NOTE See “System” on page 130 to determine which components should be replaced by a field service technician. Error code/symptom FRU/action 01298101 1. Ensure that microprocessor 1 is supported. (System BIOS installed on this server Microprocessor 1. does not support level of processor) 01298102 1.
Page 125
Lit blade-error Cause Action DIMM x error A memory error occurred. 1. Reseat the DIMM indicated by the lit DIMM failure LED. Replace the DIMM. NOTE Multiple DIMM LEDs do not necessarily indicate multiple DIMM failures. If more than one DIMM LED is on, reseat/replace one DIMM at a time until error goes away.
Error symptoms You can use the error symptom table to find solutions to problems that have definite symptoms. If you have just added new software or a new option and the server is not working, do the following before using the error symptom charts: •...
Page 127
NOTE See “System” on page 130 to determine which components should be replaced by a field service technician. CD-ROM drive problems Symptom FRU/action CD-ROM drive is not NOTE recognized after being Because the SBCE unit uses a USB bus to communicate with switched back to blade the media tray devices, switching ownership of the media tray server running Windows...
Page 128
NOTE See “System” on page 130 to determine which components should be replaced by a field service technician. Expansion enclosure problems Symptom FRU/action The SCSI storage expansion 1. Verify that the enclosure is installed correctly. unit used to work but does For more information, see the SCSI storage expansion unit not work now.
Page 129
NOTE See “System” on page 130 to determine which components should be replaced by a field service technician. Intermittent problems Symptom FRU/action A problem occurs only • Verify that: occasionally and is difficult to — When the computer is turned on, air is flowing from the rear of the detect.
Page 130
NOTE See “System” on page 130 to determine which components should be replaced by a field service technician. Keyboard, mouse, or pointing-device problems Symptom FRU/action Mouse function lost during • If, while installing Red Hat Linux 7.3 to a blade server, you or someone Red Hat installation.
Page 131
NOTE See “System” on page 130 to determine which components should be replaced by a field service technician. Microprocessor problems Symptom FRU/action The blade server emits a 1. Verify that the startup microprocessor is seated properly. continuous tone during Startup microprocessor. POST.
Page 132
NOTE See “System” on page 130 to determine which components should be replaced by a field service technician. Monitor problems Symptom FRU/action The monitor goes blank • Verify that the monitor cable is connected to the video port on the SBCE when you direct it to a management module.
Page 133
NOTE See “System” on page 130 to determine which components should be replaced by a field service technician. Option problems Symptom FRU/action An option that was just 1. Verify that: installed does not work. • The option is designed for the computer. •...
Page 134
NOTE See “System” on page 130 to determine which components should be replaced by a field service technician. Power problems Symptom FRU/action The blade server does not 1. Verify that: turn on. The power LED on the front of the SBCE unit is on. The LEDs on all the SBCE power modules are on.
Page 135
NOTE See “System” on page 130 to determine which components should be replaced by a field service technician. Power problems Symptom FRU/action The blade server turns off for 1. Verify that all blade bays have a blade server, expansion unit, no apparent reason or filler blade properly installed.
Page 136
NOTE See “System” on page 130 to determine which components should be replaced by a field service technician. Software problem Symptom FRU/action Suspected software 1. To determine if problems are caused by the software, verify that: problem. • The computer has the minimum memory needed to use the software. For memory requirements, see the information that comes with the software.
NOTE See “System” on page 130 to determine which components should be replaced by a field service technician. Network connection problems Symptom FRU/action One or more blade servers Verify that: are unable to communicate • The switch modules for the network interface being used are installed in with the network.
error code numbers. You can view the BMC system event log from the Configuration/Setup Utility menu by selecting Advanced Setup > Baseboard Management Controller (BMC) Settings > BMC System Event Log. You can view additional information and error codes in plain text by viewing the SBCE Management Module event log.
Power error messages NOTE See “System” on page 130 to determine which components should be replaced by a field service technician. Message Action BSE +12V over recommended voltage ® 1. Check SBCE power (see the Intel Blade Server Chassis SBCE: Hardware Maintenance Manual and Troubleshooting ®...
Page 140
NOTE See “System” on page 130 to determine which components should be replaced by a field service technician. Message Action VRM Power Good fault 1. Check SBCE power (see the ® Intel Blade Server Chassis SBCE: Hardware Maintenance Manual and Troubleshooting ®...
NOTE See “System” on page 130 to determine which components should be replaced by a field service technician. Message Action System under recommended 5V fault. 1. Reseat blade server. Replace blade server. System shutdown Refer to the following tables when experiencing system shutdown related to voltage or temperature problems.
Temperature-related system shutdown NOTE See “System” on page 130 to determine which components should be replaced by a field service technician. Message Action System shutoff due to CPU x over 1. Ensure that the system is being properly cooled; see temperature “System reliability considerations”...
• If all of the blade servers exhibit the same symptom, it is probably a SBCE unit problem; for more ® information, s ee the Intel Blade Server Chassis SBCE: Hardware Maintenance Manual and ® Troubleshooting Guide on the Intel Blade Server Chassis SBCE Resource CDD.
Page 144
— What, when, where, single, or multiple systems? — Is the failure repeatable? — Has this configuration ever worked? — If it has been working, what changes were made prior to it failing? — Is this the original reported failure? •...
® 9 Parts listing, Intel Server Compute Blade SBX82 This parts listing supports the Intel® Server Compute Blade SBX82. NOTE The illustrations in this document might differ slightly from your hardware. Figure 29. Blade server – exploded view...
System NOTES • Field replaceable units (FRUs) must be serviced only by qualified field service technicians. • Customer replaceable units (CRUs) can be replaced by the customer. Index SBX82 Customer and Field Replaceable Units Memory, 1 GB PC3200 ECC DDR Memory, 2 GB PC3200 ECC DDR Memory, 256 MB PC3200 ECC DDR Memory, 512 MB PC3200 ECC DDR...
A Getting help and technical assistance ® If you need help, technical assistance, or just want more information about Intel products, you will find a wide variety of sources available from Intel to assist you. This appendix contains information about where to go for additional information about Intel and Intel products, and what to do if you experience a problem with your blade server system.
Page 148
Intel® Server Compute Blade SBX82: Hardware Maintenance Manual and Troubleshooting Guide...