Page 4
IBM may make improvements and/or changes in the product(s) and/or the program(s) described in this publication at any time. This publication was developed for products and services offered in the United States of America. IBM may not offer the products, services, or features discussed in this document in other countries, and the information is subject to change without notice.
This manual contains diagnostic information, a Symptom-to-FRU index, service information, error codes, error messages, and configuration information for the Netfinity 7600 – Type 8665. Important: This manual is intended for trained servicers who are familiar with IBM PC Server products. Important safety information Be sure to read all caution and danger statements in this book before performing any of the instructions.
File download address is: http://www.us.pc.ibm.com/files.html IBM online addresses The HMM manuals online address is: http://www.us.pc.ibm.com/cdt/hmm.html The IBM PC Company Support Page is: http://www.us.pc.ibm.com/support/index.html The IBM PC Company Home Page is: http://www.pc.ibm.com Hardware Maintenance Manual: Netfinity 7600 – Type 8665 Models 1RY, 2RY...
Power-on password ......48 IBM online addresses ......iv Administrator password .
Page 8
FRU information (service only) ..127 Managing your IBM Netfinity server with Netfinity Diagnostic switch card ......127 Manager .
Page 9
Introduction to IBM ServeRAID cluster solution ..175 Symptom-to-FRU index....215 Monitoring and updating an IBM ServeRAID cluster Beep symptoms ....... . 215 solution .
Page 10
viii Hardware Maintenance Manual: Netfinity 7600 – Type 8665 Models 1RY, 2RY...
Page 12
YES. Schedule maintenance with the customer. Shut down all systems related to the cluster. Run storage test. NO. Go to step 2. THE SYSTEM IS NOT PART OF A CLUSTER. • Power-off the computer and all external devices. • Check all cables and power cords. •...
The IBM Netfinity 7600 server comes with a three-year limited warranty and 90-Day IBM Start Up Support. If you have access to the World Wide Web, you can obtain up- to-date information about the server model and other IBM server products at the following World Wide Web address: http://www.ibm.com/pc/us/netfinity/...
Page 14
— Keyboard port — Mouse port — Video port • RAID technology — IBM ServeRAID controller (three channels) • Acoustical noise emissions: — Sound power, idling: 6.3 bel maximum — Sound power, operating: 6.3 bel maximum — Sound pressure, operating: 48 dBa maximum •...
Integrated disk-array subsystem Although many operating systems provide software fault tolerance through mirroring, IBM provides hardware fault tolerance through the redundant array of independent disks (RAID) controller. The IBM ServeRAID™ controller is a standard feature. It provides three channels and supports RAID levels 0, 1, 1E, 5, and 5E.
NIC. This switching occurs without data loss and without user intervention. • IBM ServerGuide CDs The ServerGuide CDs included with the Netfinity server provide programs to help you set up the server and install the network operating system (NOS). The ServerGuide program detects the hardware options that are installed, and provides the correct configuration program and device drivers.
Controls and indicators The following illustration shows the controls and indicators on the server. Hard-disk drive status light: Each of the hot-swap drives has a status light. When this amber light is on continuously, the drive has failed. When the light flashes slowly (one flash per second), the drive is being rebuilt.
Page 18
The power control button on the device and/or the power switch on the power supply do not turn off the electrical current supplied to the device. The device also might have more than one power cord. To remove all electrical current from the device, ensure that all power cords are disconnected from the power source.
Information LED panel: The lights on this panel give status information for the server. See “Information LED panel”. Information LED panel The following illustration shows the status lights on the Information LED panel. System power light: When this green light is on, power is present in the server.
Page 20
Ethernet transmit/receive activity light: When this green light is on, there is activity between the server from the network. Ethernet-link status light: When this green light is on, there is an active connection on the Ethernet port. Ethernet speed 100 Mbps: When this green light is on, the Ethernet speed is 100 Mbps.
POST When you turn on the server, it performs a series of tests to check the operation of server components and some of the options installed in the server. This series of tests is called the power-on self-test or POST. If POST finishes without detecting any problems, a single beep sounds, the first screen of the operating system or application program appears, and the System POST Complete (OK) light is illuminated on the operator information panel.
1.When the ServeRAID controller requires your input, a list of function keys will appear below the message. 2.Where the Action information tells you to start the IBM ServeRAID configuration program, insert the IBM ServeRAID Support CD into the CD-ROM drive; then, restart the server.
Page 24
Auto rearrange. Explanation: Auto rearrange is enabled or disabled. Action: This is an information message. No action is required. Battery-Backup Write Cache Not Responding Explanation: BIOS code detected a bad or failed battery-backup write cache. Action: Press F9 to remove the battery-backup write cache from the configuration, or press F10 to exit without change. Battery-Backup Write Cache Replacement Explanation: The ServeRAID controller detects that the battery-backup write cache is defective.
Page 25
Configured drives are missing. Explanation: When the ServeRAID controller detects that a previously configured drive is missing, the following message appears: x online drives not responding where x is the number of drives not responding. Example of a possible message: Online Drive on Channel 3 SCSI ID 3 is not responding.
Page 26
Controller is not responding to commands. No logical drives are installed. Explanation: The ServeRAID controller is not operational. Action: Run the IBM ServeRAID Support CD and reseat the adapter. If the problem persists, replace the adapter. Error: Cannot disable this controller BIOS.
Page 27
Installation stopped. Explanation: The server cannot access the ServeRAID controller. Action: This is a follow-on message to a preceding message. Follow the Action instructions for the preceding message to resolve the problem. New controller installed in a configured server or drives are imported. Explanation: When the ServeRAID controller detects that the identifiers of the drives do not match the controller configuration information, the following message appears: x online drive(s) found with mismatch configuration Examples of possible messages: Configuration mismatch Channel 1 SCSI ID 0 with Host ID Configuration mismatch...
Recoverable configuration error. Explanation: The configuration data stored in NVRAM does not match the configuration data stored in the EEPROM. Action: Press Ctrl+I to access the ServeRAID Mini-Configuration menu. Select Advanced Functions from the Main Menu. Select Copy the Configuration from Drives to the Controller. Unrecoverable configuration error.
Page 29
Action: Replace the adapter or the I/O board with integrated controller. Explanation: Invalid flash configuration. Action: Start the IBM ServeRAID Support CD and follow the instructions that appear on the screen. Explanation: Invalid NVRAM configuration. Action: Start the IBM ServeRAID Support CD and follow the instructions that appear on the screen.
Page 30
Explanation and possible recovery action Explanation: No configuration was found in drives, or online/rebuild and hot- spare/standby hot-spare drives are not responding and unidentified drives were found. Action: Press F4, F5, F7, or F10. Explanation: No configuration was found in drives, or online/rebuild and ready/standby drives are not responding and unidentified drives were found.
Page 31
Explanation and possible recovery action Explanation: Online/rebuild drives are not responding. Action: Press F4, F5, or F10. Explanation: Online/rebuild drives are not responding, and a drive was found at the incorrect SCSI ID. Action: Press F4, F5, F6, or F10. Explanation: Online/rebuild and hot-spare/standby hot-spare drives are not responding.
Explanation and possible recovery action Explanation: Online/rebuild and ready/standby drives are not responding, a drive was found at the incorrect SCSI ID, and unidentified drives were found. Action: Press F4, F5, F6, or F10. Explanation: Online/rebuild, ready/standby, and hot-spare/standby hot-spare drives are not responding, and unidentified drives were found.
On IBM servers, when a hot-spare drive is available, the rebuild operation will begin automatically without the requirement to replace the failed drive.
Use ipssend format to restart the format of the physical drive. The syntax is: ipssend format controller channel sid where: • controller is the ServeRAID controller number (1–12) • channel is the channel number for the device (1, 2, or 3) •...
where: is the three-digit function code that indicates the function being tested when the error occurred. For example, function code 089 is for the microprocessor. is the three-digit failure code that indicates the exact test failure that was encountered. is the three-digit device ID. date is the date that the diagnostic test was run and the error recorded.
Note: To run the diagnostic programs, you must start the server with the highest level password that is set. That is, if an administrator password is set, you must enter the administrator password, not the power-on password, to run the diagnostic programs. When the message appears, press F2.
Diagnostic error message tables For descriptions of the error messages that might appear when you run the diagnostic programs, see “Diagnostic error codes” on page 220. If diagnostic error messages appear that are not listed in those tables, make sure that the server has the latest levels of BIOS, Advanced System Management Processor, ServeRAID, and diagnostics microcode installed.
AC power LED DC power LED Description and action There is a dc power problem. Possible causes: The server is not turned on (the power LED is blinking on the front of the server). Action: Press the power-control button to start the server. The power supply has failed.
PCIA PCIB PCIC DASD1 TEMP OVER SPEC Light path diagnostics You can use the light path diagnostics built into the server to quickly identify the type of system error that occurred. Your server is designed so that LEDs remain illuminated when the server shuts down, as long as the power supplies are operating properly.
Page 40
Table 1. Light path diagnostics. Cause Action None The system error log is 75% or more full; a PFA alert was logged; or a failure occurred Check the system error log and correct any problems. See “Choices available from the Configuration/Setup on the I2C bus.
Table 1. Light path diagnostics. Cause Action The fourth power supply has failed. Replace the fourth power supply. Power checkout Power problems can be difficult to troubleshoot. For instance, a short circuit can exist anywhere on any of the power distribution busses. Usually a short circuit will cause the power subsystem to shut down because of an overcurrent condition.
12. Restart the server. The system should start up normally. Replacing the battery IBM has designed this product with your safety in mind. The lithium battery must be handled correctly to avoid possible danger. If you replace the battery, you must adhere to the following instructions.
Page 43
Repair or disassemble Dispose of the battery as required by local ordinances or regulations. Note: In the U. S., call 1-800-IBM-4333 for information about battery disposal. If you replace the original lithium battery with a heavy-metal battery or a battery with heavy-metal components, be aware of the following environmental consideration.
Tilt the battery so that you can insert it into the front of the socket, under the battery clip. b. As you slide it under the battery clip, press the battery down into the socket. Reinstall any adapters that you removed. 10.
If you directly connect two workstations (without a hub), or if you are not using a hub with X ports, use a crossover cable. Note: To determine whether a hub has an X port, check the port label. If the label contains an X, the hub has an X port.
Table 2. Ethernet troubleshooting chart. Ethernet controller problem Suggested Action Ethernet Link Status light Check the following: does not light. • Make sure that the hub is turned on. • Check all connections at the Ethernet controller and the hub. •...
Explanation: You tried to load another instance of the driver with a different I/O address. This new adapter could not be found. Action: Verify that you installed an IBM Netfinity 10/100 Fault Tolerant Adapter and make sure that the adapter is seated correctly. If the adapter is seated correctly, go to “Starting the diagnostic programs”...
Page 48
Table 3. Novell NetWare or IntraNetWare ODI driver messages for the Ethernet controller. PCNTNW-NW-074 The hardware interrupt cannot be set. Explanation: An attempt was made to initialize a given hardware interrupt. The attempt was not successful. Action: Verify that the Ethernet controller is enabled. If the Ethernet controller is enabled, go to “Starting the diagnostic programs”...
Table 3. Novell NetWare or IntraNetWare ODI driver messages for the Ethernet controller. PCNTNW-NW-165 The device was not found at IOADDRESS. Explanation: The Ethernet controller cannot be found at the I/O address specified.Action:The Ethernet controller does not require a parameter for the I/O address. Remove the I/O address parameter.
Page 50
Explanation: The SunSoft Solaris driver cannot find any more Ethernet controllers. Action: Verify that additional IBM Netfinity 10/100 Fault Tolerant Adapters are present or replace the Ethernet adapter that fails to respond. If the problem persists, run the diagnostic programs.
Page 51
Run the NETCONFIG program to search for another Ethernet controller. pnt0-26 The DMA number is not necessary for PCI device. Explanation: The IBM Netfinity 10/100 Fault Tolerant Adapter does not require a DMA setting. Action: Edit the SPACE.C file to delete the DMA parameter. pnt0-29 The IRQ number is already in use.
ServerGuide CDs The ServerGuide CDs include software setup and installation tools specifically designed for IBM Netfinity servers. You can use these CDs during the initial installation of the server to configure the server hardware and simplify the network operating system installation. The ServerGuide CDs also contain a collection of application programs, which you can install after the server is up and running.
Page 55
Select this choice to view vital product data (VPD) for some server components. • Devices and I/O Ports Select this choice to view or change the assignments for devices and input/output ports. This choice appears only on the full Configuration/Setup Utility main menu.
Page 56
— Core Chipset Control Select this choice to modify settings that control features of the core chip set. Attention: Changes here must only be made by an IBM authorized service representative. — Cache Control Select this choice to enable or disable the microprocessor cache. In addition, you can define the microprocessor cache type as write-back (WB) or write- through (WT).
Page 57
Attention: You must use the menu selections to save custom settings for the PCI Slot/Device Information choice. The save, restore and load default settings choices on the main menu of the Configuration/Setup Utility do not save the PCI Slot/Device Information settings. After making changes, select: –...
Select this choice to delete the changes and restore the previous settings. • Load Default Settings Select this choice to cancel the changes and restore the factory settings. • Exit Setup If you have made any changes, the program will prompt you to save the changes or exit without saving the changes.
Table 6. Power-on and administrator password features. Type of password Results Power-on password • Enter the password to complete the system startup. • All choices are available on the Configuration/Setup Utility main menu. Administrator password • No password is required to start the system. •...
Choices available from the SCSISelect menu The following choices appear on the SCSISelect Utility menu: • Configure/View Host Adapter Settings Select this choice to view or change the SCSI controller settings. To reset the SCSI controller to its default values, press F6; then, follow the instructions that appear on the screen.
Page 61
• Format Disk Attention: The Low-Level Format program erases all data and programs. Select this choice to perform a low-level format on a hard disk drive. Depending on the hard disk drive capacity, the Low-Level Format program could take up to two hours.
Page 62
Note: Hard disks normally contain more tracks than their stated capacity (to allow for defective tracks). A message appears on the screen if the defect limit is reached. If this happens, replace the hard disk drive. Hardware Maintenance Manual: Netfinity 7600 – Type 8665 Models 1RY, 2RY...
Memory board Memory module Microprocessor terminator card Voltage regulator module I/O board Shuttle Component locations This section provides illustrations of the I/O board, processor board, and memory board component locations. I/O board component locations The following illustration shows the location of the input/output (I/O) connectors on the I/O board.
Power backplane connector Processor board connector Ethernet port connector (J14) USB 1 and USB 2 port connectors (J31) Video port connector (J2) Mouse connector (J30) Keyboard connector (J32) SCSI external connector (J21) Parallel port connector (J33) ASM Interconnect connector (J10) Serial port B connector (J15) Serial port A connector (J16) Management port connector (J18)
Microprocessor 1 error LED (CR9) Microprocessor 2 error LED (CR3) Microprocessor 3 error LED (CR8) Microprocessor 4 error LED (CR4) Battery VRM 4 error LED (CR10) VRM 3 error LED (CR5) VRM 2 error LED (CR7) VRM 1 error LED (CR6) Processor board connectors The following illustration shows the connectors.
Microprocessor core-frequency-selection jumper block (J13-J16) Reserved (J10) Reserved (J12) Reserved (J11) Memory board component locations The illustrations in this section show certain locations on the memory board. Memory board connectors The following illustration shows the location of the DIMM connectors on the memory board.
Connector 14 (J14) Connector 15 (J15) Connector 16 (J16) Memory board LED locations The following illustration shows the location of the error LEDs on the memory board. DIMM 1 error LED (CR4) DIMM 2 error LED (CR5) DIMM 3 error LED (CR2) DIMM 4 error LED (CR1) DIMM 9 error LED (CR13) DIMM 10 error LED (CR12)
• For a list of supported options for the Netfinity 7600, refer to http://www.ibm.com/pc/us/compat on the World Wide Web. Working inside the server with the power on The server is designed with features that make it highly available. These features are hot-add and hot-plug devices.
Removing the top cover Top cover Captive screws To remove the server top cover: Loosen the two thumbscrews on the back edge of the top cover. Slide the top cover slightly toward the rear of the server. Lift the cover off the server and set the cover aside.
To remove the media-bay bezel: Release the two tabs at the bottom edge of the media-bay bezel and pull the bottom of the bezel slightly away from the server. Pull the media-bay bezel down to release the two tabs at the top edge of the bezel. Store the bezel in a safe place.
Non-hot-plug PCI slots 1-2 (on PCI bus A) (3.3 volt) Hot-plug PCI slots 3-6 (on PCI bus B) Power LEDs for hot-plug slots Internal Attention LEDs for hot-plug slots External Attention LEDs for hot-plug slots Adapter considerations Before you continue with the adapter-installation procedure: •...
Page 75
• Review the information in “Before you begin” on page 60. • Read the documentation that comes with the option. Adapter retention latch Adapter retention latch Adapter To install a hot-plug PCI adapter: Attention: Do not remove a hot-plug adapter before performing the operating- system-defined procedure for disabling the hot-plug PCI slot that contains the adapter.
Attention: You must install expansion-slot covers on all empty slots. This maintains the electromagnetic emissions characteristics of the system and ensures proper cooling of system components. Refer to the documentation that comes with the adapter for any cabling instructions. It might be easier for you to route any cables before you install the adapter.
3.5 inch diskette drive CD-ROM drive Media bays 1 and 2 Hot-swap hard disk Your server contains hardware that enables you to replace a failed hard disk drive without turning off the server. Therefore, you have the advantage of continuing to operate the system while a hard disk drive is removed or installed.
• If you are installing a device with a 50-pin connector in one of the left front bays, you will need a 68-to-50-pin converter (option number 32G3925). • Your server supports ten 1-inch (26 mm) slim-high or seven 1.6-inch (42 mm) half- high, 3.5-inch hot-swap hard disk drives in the hot-swap bays.
Page 79
Slim filler Filler panel Drive tray handle (in open position) Note: You do not have to turn off the server to install hot-swap drives in these bays. To install a drive in a hot-swap bay: Review the information in “Before you begin” on page 60. Remove the filler panel from one of the empty hot-swap bays by inserting your finger into the depression at the left side of the filler panel and pulling it away from the server.
Installing a 5.25-inch removable-media drive Refer to the following illustrations when installing a 5.25-inch removable media drive. Before you begin: • Read “Safety information” on page 249. • Review the information in “Before you begin” on page 60. • Read the documentation that comes with the option. Slide rails for use with a tape drive Filler panel with...
If you are installing a SCSI device, go to step 10.. If you are installing an IDE device, go to step 11.. 10. Install the two-drop SCSI cable: Locate the two-drop SCSI cable. The two-drop SCSI cable is located in the bottom of the server between the open bays and the power supply area.
Page 82
• If you install 16 GB of memory, the Configuration/Setup Utility will display the memory that is usable by the network operating system. This amount of memory might differ from the amount of memory you have installed. • Installing or removing DIMMs changes the configuration information in the server.
To install a memory module: Before you begin: • Read “Safety information” on page 249. • Read the documentation that comes with your option. Review “Before you begin” on page 60. Turn off the server and all attached devices. Disconnect external cables and power cords and remove the top cover.
Page 84
Microprocessor internal and external clock frequencies must be identical. To order additional microprocessor options, contact your IBM reseller or IBM marketing representative. If you replace the microprocessor in the server with a microprocessor that operates at a different speed, be sure to set the microprocessor core-frequency- selection jumpers correctly.
Touch the static-protective package containing the new microprocessor to any unpainted metal surface on the server; then, remove the microprocessor from the package. b. Hold the microprocessor by the open latches, and center the microprocessor over the microprocessor connector. Attention: Make sure the microprocessor is oriented and aligned correctly before you try to close the latches.
Page 87
Before you begin: • Read “Safety information” on page 249. • Review the information in “Before you begin” on page 60. • Read the documentation that comes with the option. Refer to the following illustration to install a hot-swap power supply. Handle Filler panel DC power light...
If you install a strain-relief bracket option over the end of the power cord that is connected to the device, you must connect the other end of the power cord to an easily accessible power source. Replacing a hot-swap fan Before you begin: •...
Pull out on the orange pop rivet on the rear fan bracket; then, lift the fan assembly up and out of the hinge cutouts. b. Slide the replacement fan assembly into the server. (Make sure the fan assembly fits correctly into the hinge cutouts on the rear of the shuttle.) When you have the fan assembly properly seated, push on the pop rivet until it clicks into place.
Media-bay bezel To install the media-bay bezel: Insert the two tabs on the top of the media-bay bezel into the matching holes on the server chassis. Push the bottom of the media-bay bezel toward the server until the two tabs at the bottom of the bezel snap into place.
If you disconnected any cables from the back of the server, reconnect the cables; then, plug the power cords into properly grounded electrical outlets. Note: If necessary, see “Cabling the server” on page 93 for connector locations. Reconfiguring the server When you start the server for the first time after you add or remove an internal option or an external SCSI device, you might see a message telling you that the configuration has changed.
Advanced Systems Management Interconnect knockout Parallel Serial A System management Ethernet Serial B ASM Interconnect USB1 port A USB2 ASM Interconnect port B Video Keyboard Ultra2 SCSI Mouse Parallel port The server has one parallel port. This port supports three standard Institute of Electrical and Electronics Engineers (IEEE) 1284 modes of operation: Standard Parallel Port (SPP), Enhanced Parallel Port (EPP), and Extended Capability Port (ECP).
Parallel port connector The following table shows the pin-number assignments for the 25-pin, female D-shell parallel-port connector on the rear of the server. Table 10. Parallel-port connector pin-number assignments. SPP/ECP Signal EPP Signal -STROBE -WRITE Data 0 Data 0 Data 1 Data 1 Data 2 Data 2...
Note: If you install a PCI video adapter, the server BIOS will automatically disable the integrated video controller. The following table shows the pin-number assignments for the 15-pin analog video connector on the rear of the server. Table 11. Video-port connector pin-number assignments. Signal Green or monochrome Blue...
Not connected Ultra2 SCSI ports The server comes with an IBM ServeRAID controller. This controller provides three independent SCSI channels. A 16-bit (wide) LVD SCSI cable connects the hard disk drive backplane to one channel of the RAID controller. Two 68-pin SCSI connectors for two additional channels are on the rear of the server.
SCSI cabling requirements If you plan to attach external SCSI devices, you must order additional SCSI cables. To select and order the correct cables for use with external devices, contact your IBM reseller or IBM marketing representative. For information about the maximum length of SCSI cable between the terminated ends of the cable, refer to the ANSI SCSI standards.
Table 14. 68-pin SCSI connector pin-number assignments. Signal Signal DIFFSENS Ground Term power Term power Term power Term power Reserved Reserved Ground Ground +Attention -Attention Ground Ground +Busy -Busy +Acknowledge -Acknowledge +Reset -Reset +Message -Message +Select -Select +Control/Data -Control/Data +Request -Request +Input/Output -Input/Output...
Select the serial port; then, use the arrow keys to advance through the settings available. Select Save Settings; then, select Exit Setup to exit from the Configuration/Setup Utility main menu. Serial-port connectors The following table shows the pin-number assignments for the 9-pin, male D-shell serial-port connectors on the rear of the server.
ServerGuide CDs. Failover for redundant Ethernet The IBM Netfinity 10/100 Fault Tolerant Adapter is an optional redundant network interface card (NIC adapter) that you can install in the server. If you install this NIC adapter and connect it to the same logical LAN segment as the primary Ethernet controller, you can configure the server to support a failover function.
Page 100
Enter the brand type of Server, click on Downloadable files and look for Hot Plug. The IBM Netfinity Hot Plug PCI for Windows NT Server 4.0 package uses the Intel Desktop Management Interface (DMI) to control hot-plug support for the Ethernet controllers.
Page 101
Two options are available for recovering from a failover condition. The options are determined by the Enable for DMI / Hot Swap Support checkbox. If the IBM Netfinity Hot Plug PCI for Windows NT Server 4.0 package is installed, this checkbox will appear at the bottom of the Adapter Properties panel.
Enable for DMI / Hot Swap Support checkbox is checked, a DMI alert will also be generated. IntraNetWare: The IBM PCI Hot Plug System Bus Driver Diskette for Novell Netware V4.41, V4.42, and V5.0 is available for download from the IBM Web site at: http://www.pc.ibm.com/support The failover function is now enabled. If a failover occurs: •...
You can attach a dedicated modem to the D-shell system-management connector on the rear of the server to communicate with the integrated Netfinity Advanced System Management Processor. The ASM Interconnect function uses the RJ-45 system-management connectors. This function enables you to connect the Advanced System Management Processors of several rack-mounted servers so that they can communicate with each other in half- duplex mode.
• A LAN adapter card and one or more of the following communications protocols: — IBM TCP/IP for OS/2 version 1.2 or later (required for Netfinity Manager with Web Enhancement) — NetBIOS Note: The Netfinity Manager NetBIOS requirements are three names, two sessions, and nine network control blocks (NCBs).
communicate with Netfinity Manager systems using any of the other supported SNA stacks. • A 9600 baud or greater modem (optional). The hard disk requirement for Client Services for Netfinity Manager is 6.5 MB–9 MB of hard disk space. All other system requirements are the same as for Netfinity Manager for OS/2.
— NetBIOS Note: The Netfinity Manager NetBIOS requirements are three names, two sessions, and nine network control blocks (NCBs). — IPX — SNA (Microsoft SNA Server version 2.11 with Service Pack 1 and WCPIC32.DLL dated 01/22/97 or later. This DLL is available from Microsoft) Note: Systems using Netfinity Manager with Microsoft SNA Server cannot communicate with systems running the Microsoft SNA Server client.
Page 109
CD-ROM drive in the Command Line field and then click OK. The ServerGuide Netfinity Applications windows opens. d. Select IBM Netfinity Manager from the Available Applications selection list. Note: To install Client Services for Netfinity Manager, select Client Services for Netfinity Manager from the Available Applications list and then refer to the Client Services for Netfinity Manager User’s...
Page 110
Click Advanced System Management Support to install the Advanced System Management service on this system. Use this service to monitor and manage IBM Advanced System Management processors and adapters. Note: If you install Advanced System Management support an enhanced version of the Serial Connection Control service, named Dynamic Connection Manager, will be installed instead of Serial Connection Control.
Page 111
Click Update Connector Manager (NT 4.0, TCP/IP, and Web Browser Required) if Netfinity Manager is being installed on a system that will be used to manage system updates on client systems using updates that are available from the IBM selection server. • World Wide Web Enhancement (TCP/IP Required)
Page 112
Note: The graphical user interface (GUI) program files for DMI Browser, ECC Memory Setup, System Partition Access, RAID Manager, and Predictive Failure Analysis will be installed regardless of whether the system has a DMI Service Layer, ECC Memory, a System Partition, a RAID adapter, or a PFA-enabled disk drive.
Page 113
appear beneath the Driver Enabled check box if the IPX or TCP/IP Network Driver is selected. 3)When enabling the NetBIOS Network Driver, a network address will be selected and displayed in the Network Address field. To change this default name, enter any 1–8 character address. However, this address must be unique to the system.
Page 114
8.24.67.32) to communicate with remote systems and will not attempt to resolve the numeric address into an alphanumeric, Domain Name Server (DNS) address (for example, your.system.ibm.com). Select this option if you are using Netfinity Manager is a WINS networking environment or if you are using TCP/IP in an environment that does not have a Domain Name Server.
Netfinity Manager database support Netfinity Manager supports the collection and export of a vast amount of system- specific data to a Netfinity Manager database. Netfinity Manager databases can be created on database management systems using DB2®, Lotus Notes®, or Microsoft SQL. Microsoft SQL is supported through open database connectivity (ODBC), and is available on systems running Netfinity Manager for OS/2, Windows 95, or Windows NT.
databasename where • databasename is the name of the database • d is the drive where the database is created Press Enter. Next, you must configure the tables for ODBC database support so the tables can be configured. For information on configuring the database for ODBC, refer to “ODBC database configuration”...
b. In the DB2 command window, type the following command: DB2 BIND x:\directory\DB2SQLC0.bnd where x:\directory is the name of the drive and directory where Netfinity Manager is installed. Press Enter. Granting and revoking database privileges The following information describes how to grant and revoke database privileges. Note: If your primary system is OS/2, refer to “Database privileges on OS/2 systems”.
Page 118
and then press Enter. Note: Specify PUBLIC or authorize only specific manager systems. To limit access to the database, substitute the authorization name of the managing system for PUBLIC. Use the CONNECT RESET command to disconnect current users from the database.
Cataloging the database in the managing system: If the manager is using a remote database, catalog the node the database resides on and then catalog the database to that node. Once this step has been completed, the name of the Netfinity Manager database will appear in the System Information Tools Database Selection window.
• Lotus Notes client For information on what versions of Lotus Notes and the Lotus Notes client are supported, see Table 18 on page 114. See the Notes Administrator’s Guide for general system requirements and supported communications protocols. Installing the database To enable Netfinity Manager to export system data to a Lotus Notes database, install the Netfinity Manager database template on the Lotus Notes server.
• Windows NT — ODBC — ODBC support for one or more of the following database management systems (DBMS): – IBM DB2 – Microsoft SQL Server • Windows 95, with: — ODBC — ODBC support for one or more of the following database management systems: –...
— Microsoft SQL Server (using the Visigenic driver provided with Microsoft SQL Server) — IBM DB2 For detailed information about supported ODBC platforms, see Table 18 on page 114 and Table 19 on page 115. Note: The ODBC database name must not be longer than 8 characters.
Page 123
Select a DBMS. Click in the Select DBMS listing the name of the DBMS which Netfinity Manager will be using to export data. To create the Netfinity Manager table groups within the database, click Create and then click OK to open the Database Server window. The following table actions are also available: •...
Windows 95 Windows NT 3.51 Windows NT 4.0 IBM DB2 Version 2.1.2 IBM DB2 Universal Database ODBC - IBM DB2 Version 2.1.2 Yes, DB2 Driver Yes, DB2 Driver Yes, DB2 Driver ODBC - IBM DB2 Universal Yes, DB2 Driver Yes, DB2 Driver Yes, DB2 Driver Database 5.0...
Notes: a.In your Netfinity Manager folder or program group is a document titled Read Me First, which contains information about Netfinity Manager that might not be covered in your documentation. b.The Netfinity Manager folder also contains the Network Driver Configuration object, which you can use to reconfigure your network protocols and system keywords, and the Netfinity Manager Database Tables object, which contains a handy online reference for all of the data tables in the Netfinity Manager...
To start any Netfinity Manager service that appears in the Service Manager window, double-click the icon for that service. To start a Netfinity Manager service on a remote system, use the Remote System Manager service. Netfinity Manager service descriptions Each Netfinity Manager service consists of a base program and a graphical user interface (GUI).
Advanced System Management The Advanced System Management service provides extensive functionality available for your IBM PC Server Advanced System Management Adapter, Netfinity Advanced System Management PCI Adapter, or Netfinity Advanced System Management Processor. With the Advanced System Management service, you can configure system management events (such as POST, loader, and operating system timeouts or critical temperature, voltage, and tamper alerts).
size, or when it is deleted or created. Critical File Monitor can also be used to monitor any other files that reside on a Netfinity Manager system. DMI Browser DMI Browser enables you to examine information about the DMI-compliant hardware and software products installed in or attached to the system. ECC Memory Setup The ECC Memory Setup allows for monitoring of ECC memory single-bit errors, and can automatically "scrub,"...
other RAID system tasks. This service is available for both stand alone and network use by any system that has a supported RAID adapter. Remote Session Use Remote Session to establish a fully active command session with any remote Netfinity Manager system. Remote System Manager Use Remote System Manager to access and manage any Netfinity Manager service on any Netfinity Manager system in the network.
System Information Tool gathers information about almost any computer; however, the most detail is provided when this service is used with IBM computers. This service is available for both stand-alone and network use.
discovered, use Update Connector Manager to apply updates to your systems remotely. Updates can be applied to individual systems, or you can apply multiple updates to multiple systems, all from Netfinity Manager. You can also use Update Connector Manager to remove previously applied updates. Update Connector Manager also includes a scheduler that you can use to discover, apply, or remove updates automatically and periodically.
If the system has a CID-enabled (CID stands for customization, installation, and distribution) software distribution manager utility (such as LAN CID, included with IBM Network Transport Services/2), Netfinity Manager can be installed on systems within the network by using the Netfinity Manager installation program command line parameters and response file.
Page 134
To create a source directory for a Client Services for Netfinity Manager installation: Create a new directory on your system. This new directory will serve as a source directory for the program files. Copy the files from the appropriate Client Services for Netfinity Manager directory location on the CD-ROM to source directory you created.
will install Netfinity Manager, using the options in the response file NETFBASE.RSP, from the directory Y:\NETFIN, to drive C: (the directory to which the files are installed is taken from the response file), and will modify the CONFIG.SYS file in the D:\ directory. Note: For information on how to use a CID-enabled software distribution manager, refer to the publications provided with the individual CID-enabled product.
Page 136
The third section contains the list of options that can be installed. These are the options that are selected by the choices in the first section. The options are consecutively numbered starting at 1, so any inserted options will change the number of all following options.
Replace the card and reattach the front LED card cable and the diagnostic switch card cable. Disconnecting the shuttle Review the information in “Before you begin” on page 60. Power-off the server, if it is on. Disconnect the power cord for each power source from the electrical outlet. Attention: The presence of ac standby power might result in damage to the hardware unless you disconnect the power cord for each power source from the electrical outlet.
Retaining screws Front LED card assembly Power-off the server, if it is on. Disconnect the power cord for each power source from the electrical outlet. Attention: The presence of ac standby power might result in damage to the hardware unless you disconnect the power cord for each power source from the electrical outlet.
Power-off the server off, if it is on. Disconnect the power cord for each power source from the electrical outlet. Attention: The presence of ac standby power might result in damage to the hardware unless you disconnect the power cord for each power source from the electrical outlet.
Page 141
Pop rivets Processor housing cover DIMM Memory board To replace the memory card assembly: Power-off the server and all attached devices. Disconnect external cables and power cords and remove the top cover. Pull up the two pop rivets on the processor housing cover and remove the cover from the processor housing.
PCI switch card Before you begin: • Read “Safety information” on page 249. • Review “Before you begin” on page 60. Switch card latches Switch card assembly Power-off the server, if it is on. Disconnect the power cord for each power source from the electrical outlet. Attention: The presence of ac standby power might result in damage to the hardware unless you disconnect the power cord for each power source from the electrical outlet.
Retaining screws Power backplane assembly Power-off the server, if it is on. Disconnect the power cord for each power source from the electrical outlet. Attention: The presence of ac standby power might result in damage to the hardware unless you disconnect the power cord for each power source from the electrical outlet.
Page 144
Cutouts Processor/PCI Backplane assembly Microprocessors Power-off the server, if it is on. Disconnect the power cord for each power source from the electrical outlet. Attention: The presence of ac standby power might result in damage to the hardware unless you disconnect the power cord for each power source from the electrical outlet.
Removing the shuttle Power-off the server, if it is on. Disconnect the power cord for each power supply from the electrical outlet. Attention: The presence of ac standby power might result in damage to the hardware unless you disconnect the power cord for each power supply from the electrical outlet.
Power-off the server, if it is on. Disconnect the power cord for each power source from the electrical outlet. Attention: The presence of ac standby power might result in damage to the hardware unless you disconnect the power cord for each power source from the electrical outlet.
Page 147
Pry the SCSI daughter card off of the four retaining pins. To install a SCSI daughter card, reverse the above procedure. FRU information (service only)
This section contains information about the features and connector locations of the ServeRAID-4H controller, the ServeRAID-4L controller, and the ServeRAID-4M controller (hereafter sometimes referred to as IBM ServeRAID-4x Ultra-160 SCSI controllers), and describes the following steps needed to install and configure the controller: Step 1.
The ServeRAID-4H controller uses a lithium battery. CAUTION: When replacing the lithium battery, use only IBM Part Number 38L3386 or an equivalent type battery recommended by the manufacturer. If your system has a module containing a lithium battery, replace it only with the same module type...
made by the same manufacturer. The battery contains lithium and can explode if not properly used, handled, or disposed of. Do not: • Throw or immerse into water. • Heat to more than 100 C (212 F) • Repair or disassemble Dispose of the battery as required by local ordinances or regulations.
Page 152
CAUTION: When replacing the lithium battery, use only IBM Part Number 38L3386 or an equivalent type battery recommended by the manufacturer. If your system has a module containing a lithium battery, replace it only with the same module type made by the same manufacturer.
Features and connector locations of ServeRAID-4M controller This section contains important information on the controller features and connector locations. Controller features Standard features of the ServeRAID-4M controller are as follows. Feature ServeRAID-4M Arrays (max.) Battery-backup cache Cache memory 64 MB Hard disk drives (max.) Logical drives (max.) Microprocessor...
Page 154
CAUTION: When replacing the lithium battery, use only IBM Part Number 38L3386 or an equivalent type battery recommended by the manufacturer. If your system has a module containing a lithium battery, replace it only with the same module type made by the same manufacturer.
To use Hot-plug PCI with Windows NT 4.0 and a ServeRAID-4x controller, you must install the IBM Netfinity Hot Plug for Windows NT 4.0 Package, version 4.1 or later. This package is available from the IBM Support Web site; select downloadable files for the IBM server you are using.
-4M controllers as well as basic information about the standard features of these controllers. If you are using a ServeRAID controller built into an IBM server system board, you will not need the information in this section. Continue with “Step 2: Updating BIOS and firmware code”...
Page 157
Insert the controller in the PCI expansion slot. Press the controller firmly into the slot so that it is fully seated. Note: The following illustration depicts the ServeRAID-4H controller. The ServeRAID-4L and -4M controllers will have a slightly different appearance. Figure 4.
Page 158
Notes: See “Features and connector locations of ServeRAID-4H controller” on page 139 for the channel connector locations of the 4H controller, “Features and connector locations of ServeRAID-4L controller” on page 141 for the channel connector locations of the 4L controller, and “Features and connector locations of ServeRAID-4M controller”...
Page 159
15. If you want to attach an external SCSI device to the ServeRAID controller, continue with “Connecting external devices” on page 150. Otherwise, go to “Step 2: Updating BIOS and firmware code” on page 151. Installing and configuring ServeRAID controllers...
Connecting external devices Note: The ServeRAID-4H, -4L and -4M controllers do not support configurations that use both the internal and external connectors on the same channel concurrently. For example, you cannot use both external Channel 1 and internal Channel 1. To attach SCSI devices to an external channel connector on the ServeRAID controller: Connect one end of a 68-pin very high density connector interface (VHDCI) SCSI cable to an external channel connector on the ServeRAID controller.
ServeRAID ROM Update Wizard marks the controller as a candidate for update. If the BIOS and firmware code do not require updating, the IBM ServeRAID ROM Update Wizard automatically exits and the ServeRAID Manager program opens. Continue with “Step 3: Configuring ServeRAID controllers”.
To start the ServeRAID Manager program, do the following: Turn on the server; then, Insert the IBM ServeRAID Support CD (or the CD that contains the ServeRAID Manager program that came with your system) into the CD-ROM drive. One of the following windows will open: •...
Page 163
Figure 7. Configuration mode window Follow the instructions in the right panel to create logical drives on your ServeRAID controller. If you do not want to configure your ServeRAID controller, click Cancel. This will switch the program from Configuration mode to Information mode. •...
Figure 8. Information mode window Click on the tool bar to switch from Information mode to Configuration mode. Continue with one of the following: • “Using Configuration mode”. • “Using Information mode” on page 162. Using Configuration mode You can use Configuration mode to create up to eight independent disk arrays for each ServeRAID controller.
Page 165
Attention: The new controller stripe-unit size is set at the factory to 8 KB. If you need to change this setting, you must change the stripe-unit size before you store data in the logical drives. After you store data in the logical drives, you cannot change the stripe-unit size without destroying data in the logical drives.
Page 166
Figure 9. Configuration summary window Notes: a. Some operating systems have size limitations for logical drives. Before you save the configuration, verify that the size of the logical drive is appropriate for your operating system. See your operating-system documentation for more detailed information.
Page 167
Note: If you are configuring your startup (boot) ServeRAID controller, you must install the device driver while installing the operating system. Using Custom configuration: To use Custom configuration: Note: If the Express configuration mode or Custom configuration mode buttons do not display on the right panel, click ) on the tool bar.
Page 168
After you select the ready drives for your arrays and define your hot-spare drives, click Next. If you are not creating spanned arrays, the Create logical drives window opens. Continue with step 8 on page 159. If you are creating spanned arrays, the Create spanned arrays window opens. Figure 11.
Page 169
Figure 12. Create logical drives window When creating your logical drives, you might change your mind about the arrays you created. If so, you can: • Remove a specific drive from a newly defined array or delete an entire newly defined array.
Page 170
Notes: a.You can define from one to eight logical drives. b.Some operating systems have size limitations for logical drives. Before you save the configuration, verify that the size of the logical drive is appropriate for your operating system. See your operating-system documentation for more detailed information.
Page 171
Figure 14. Configuration summary window Note: To change the configuration, click Modify arrays or Modify logical drives. 16. Click Apply to accept and save the configuration. The configuration is saved in the ServeRAID controller and on the physical drives. 17. If you have multiple controllers, for each controller repeat steps 1 on page 157 through step 16.
Obtaining ServeRAID updates IBM periodically makes updated versions of the ServeRAID firmware, device drivers, and utility programs available from the IBM Support page on the World Wide Web. In addition to the updated device drivers and utility programs, there is a command-line program available from the World Wide Web.
, Novell NetWare, and Red Hat Linux 6.1 • DOS CD-ROM drive device drivers The IBM ServeRAID Device Drivers for SCO Operating Systems - UnixWare HBA and OpenServer BTLD diskette contains the following: • Device drivers for SCO OpenServer 5.0.5 •...
Download the latest versions of Clustering software, the ServeRAID Manager program, BIOS/firmware, device driver updates, and other important information. Hints and tips Obtain useful information for the IBM Clustering and for ServeRAID products and troubleshooting potential problems. Online publications Download the installation and user’s guides, references, redbooks, and other IBM publications.
The ServeRAID Mini-Configuration program provides a quick way to display the current settings for the ServeRAID controller. You also can use this program to perform a limited set of the configuration functions without using the IBM ServeRAID Support CD. To access the Mini-Configuration program: Turn on the server.
Viewing the controller status When you select View Controller Status from the Main Menu, the following information appears on the screen: • All physical drives and their states • Unattended shows the current state of the Unattended mode. — When this option is set to Off, you can choose the recovery method when a ServeRAID controller startup error occurs.
Page 177
When you select the Advanced Functions from the Main Menu, the following choices appear on the screen. Attention: Be careful when making selections from this menu. If you change the configuration, you might lose data. • Restore to Factory Default Settings is used to reset the configuration. When you reset the configuration, you will not have access to any data stored on the logical drives attached to the selected ServeRAID controller.
Using the IPSSEND command-line program The IPSSEND command-line program provides a number of functions. Some of these functions are useful as diagnostic or error-recovery tools. This manual addresses only these diagnostic and error-recovery functions. Descriptions of the other functions of the IPSSEND command-line program can be found in the documentation that shipped with the controller.
Error-recovery functions The IPSSEND error-recovery functions include the following: getstatus Use the getstatus function to display the current logical-drive status for the most recent rebuild, synchronization, or logical-drive migration. The status includes such information as the remaining size of the logical drive, the percentage completed for the function in process, and information about the most recently completed logical-drive function.
unblock Use the unblock function to gain access to a blocked logical drive. RAID level-0 logical drives become blocked if they are in an array that contains RAID level-1 or level-5 logical drives and the array is being rebuilt. Because the RAID level-0 logical drives cannot be rebuilt, the data stored on the RAID level-0 logical drives is damaged and should not be accessed.
Page 181
SCSI ID for the device (0–15) Use the getevent function to display information about various getevent unexpected events. IBM uses the soft and hard event logs for development and debugging purposes only. The device event log contains event counters for each attached physical device. These counters include: •...
Page 182
Note: channel represents the drive channel (1, 2, 3, or 4), and sid is the SCSI ID for the device (0–15). startdely Use the startdely function to set the number of drives in a group to start up simultaneously and to set the delay in seconds between the startup of these groups of drives.
RAID Configuration functions The IPSSEND RAID configuration functions include the following: devinfo Use the devinfo function to retrieve information (such as size and state) about a specific device. Supported operating systems: Windows NT, Windows 2000, OS/2, NetWare, Linux, OpenServer, UnixWare, and DOS Command: ipssend devinfo controller channel sid where: •...
Page 184
To run the IPSMON program: From the console, type: Press Enter. When you start IPSMON, it immediately begins polling all ServeRAID controllers for specific conditions. If a condition being monitored is found, this program logs a message to the screen, to a file, or to both the screen and a file. Note: IPSMON continuously polls the ServeRAID controllers until you type at the system console.
IBM ServeRAID Cluster Solution, as discussed in this book, uses the IBM ServeRAID- 4x Ultra160 SCSI controller. IBM ServeRAID Cluster Solution also uses IBM expansion enclosures, such as the following: •...
The solution you use for your installation depends on your own specific requirements. For more information about requirements and types of installations, refer to the IBM Shared Disk Clustering Hardware Reference. You can obtain this publication from the IBM Support Web site. See “Downloadable files from the World Wide Web”...
Checking the ownership of a physical drive A physical drive that is owned (and configured) on Server B will appear in the ready or defunct state on Server A. To determine if a ready drive is owned by the other server, do the following: In the Main Tree, click the controller that contains the ready drives.
Figure 16. Configure controller for clustering window Click Shared or Non-shared for the logical drive you have just defined. Select the merge group number you want to use for the logical drive you just defined. Click OK. Validating a cluster Use this feature to determine if your cluster is configured correctly.
Page 189
Figure 17. ServeRAID cluster validation window The server you selected in step 1 on page 178 is listed in the Node A box. Note: You cannot change the Node A system in the Cluster validation window. To select a different system, click Cancel to return to the Main Tree. From the Node B pull-down menu, select your second server.
Viewing merge group numbers and other cluster information To view the merge group numbers and other cluster information, do the following: In the Main Tree, click the controller you are using for the cluster. Click on the tool bar. A window similar to the following opens. Figure 18.
Page 191
+--------------------------------------------------------------------------+ __________________________________ IBM ServeRAID BIOS Copyright IBM Corp. 1995, 2000 Ver: 3.00.12 08/14/2000 __________________________________ Press Ctrl+I to access the Mini-Configuration Program. Initializing ServeRAID Controller 1 in Slot 1. Power-on self-test can take up to 59 Second(s) -- test Passed. Drive configuration can take up to 6 Minute(s) 0 Second(s) -- test Passed.
drives Warning: NON-RESPONDING drives will be set to DDD, DHS, or EMP state F10 - Exit without change +---------------------------------------------------------------------------+ Note: Do not replace a ServeRAID-4 controller or a system board containing an imbedded ServeRAID-4 controller for ServeRAID problems unless replacing the controller is listed in the “Symptom-to-FRU index”...
Page 193
Note: ServeRAID-4H controllers have 4 channels; ServeRAID-4L controllers have only one channel; and ServeRAID-4M controllers have 2 channels. Table 20. SCSI identifier. SCSI Channel Code (z) Descriptions Channel 1 Channel 2 Channel 1 & 2 Channel 3 Channel 1 & 3 Channel 2 &...
If ISPR code is EF10 after disconnecting cables, follow the steps below until the error is eliminated: Identify which channel is causing the error by reconnecting cables one at a time and rebooting until the error returns. b. Check termination of identified channel in step a.. Note: Refer to the HMM (Hardware Maintenance Manual) specific to the system comprising the SCSI channel for termination details.
Rebuild operation has been successfully completed. On IBM servers, when a hot-spare drive is available, the Rebuild operation will begin automatically without the requirement to replace the failed drive. To start a hot-swap rebuild, do the following: Without removing the drive completely, gently remove the hard disk drive from the server, using the handle of the hot-swap tray.
• channel is the channel number for the device (1, 2, or 3) • sid is the SCSI ID for the device (0-15) After the format is complete, the ServeRAID controller will be able to recognize the drive again. Recovering from a failure in a failover-environment In a failover environment, when the ServeRAID device driver is unable to send a command to the primary controller and the primary controller does not respond to a reset command, the device driver attempts to failover to the secondary controller.
Automatically rebuilding the defunct drive The ServeRAID controller will rebuild a defunct drive automatically when all of the following conditions exist: • The physical drive that failed is part of a redundant RAID logical drive. See “Reference information” on page 192 for more information on data redundancy. •...
IBM web site: http://www.ibm.com/pc/files.html Search on “ServeRAID.” Netfinity Manager is part of ServerGuide, which is shipped with every IBM server. The following is an excerpt from a ServeRAID log created by ServeRAID monitor: Table 21. Sample ServeRAID log.
• SID 1: HSP • SID 2: ONL • SID 3: ONL • SID 4: ONL The format is as follows: date time error type: Ax Cx Bxx message The x following A is the controller number; the x following C is the channel, and the xx following B is the SID number.
Note: these instructions are valid both when your operating system is accessible and when it is not accessible. If if is accessible, use the ServeRAID Manager. If it is not, boot the ServeRAID options CD. Recovery procedures for one defunct drive Follow the steps below when one hard disk drive is Defunct and all Logical Drives are working.
Using the ServeRAID Manager or options CD, make sure that the Channel Record Table contains the correct information for the status of all drives, not just those listed in the RAID log. Using the ServeRAID Manager or Options CD, select the last drive listed in the Defunct Drive log.
Reference information This section contains information to help you understand the principles and uses of high-performance, redundant array of independent disk (RAID) controllers. This section describes the different ways that drives can be grouped together to create data-storage environments that require superior performance, flexibility, and reliable data storage.
Stripe-unit size With RAID technology, data is striped across an array of physical drives. This data- distribution scheme complements the way the operating system requests data. The granularity at which data is stored on one drive of the array before subsequent data is stored on the next drive of the array is called the stripe-unit size.
Page 204
Create an array using the two physical drives. Then, create a logical drive within that array. The data is striped across the drives, creating blocks. Notice that the data is striped across all the drives in the array, but no redundant data is stored.
Page 205
original data; the second half of a stripe is a mirror (that is, a copy) of the data, but written to the other drive in the RAID level-1 array. Because the data is mirrored, the capacity of the logical drive when assigned RAID level-1 is 50% of the array capacity.
Page 206
Notice that the data on the drive on the right is a copy of the drive on the left. With RAID level-1, if one of the physical drives fails, the ServeRAID controller switches read and write requests to the remaining functional drive in the RAID level- 1 array.
Page 207
Then, create a logical drive within that array. The data is striped across the drives, creating blocks. Notice that the stripe labeled is the data stripe and the stripe labeled the copy of the preceding data stripe. Also notice that each block on the mirror stripe is shifted one drive.
Page 208
Create an array using three of the physical drives, leaving the fourth as a hot-spare drive. Then, create a logical drive within that array. The data is striped across the drives, creating blocks. Notice that the storage of the data parity (denoted by ) also is striped, and it shifts from drive to drive.
Page 209
Understanding RAID level-5 Enhanced: RAID level-5 Enhanced (RAID level-5E) requires a minimum of four physical drives. RAID level-5E is also firmware-specific. You can think of RAID level-5E as “RAID level-5 with a built-in spare drive.” Reading from and writing to four disk drives is more efficient than three disk drives and an idle hot spare and therefore improves performance.
Page 210
The data is striped across the drives, creating blocks in the logical drive. The storage of the data parity (denoted by ) is striped, and it shifts from drive to drive as it does in RAID level-5. Notice that the spare drive is not striped. If a physical drive fails in the array, the data from the failed drive is compressed into the distributed spare drive.
Page 211
Note: RAID level-x0 is only available on the IBM ServeRAID-4H and ServeRAID-4L Ultra160 SCSI controllers. RAID level-x0 requires a minimum of 2 drives and supports a maximum of 60 drives. The following illustration is an example of a RAID level-10 logical drive.
Then, create a logical drive within the spanned array (1). The data is striped across this logical drive, creating blocks. Notice that none of these blocks are redundant. This is because the logical drive (1) is RAID level-0 in a RAID level-x0 implementation (see the following table).
Page 213
Drive State Meaning Standby Hot Spare A standby hot spare is a hot-spare drive that the ServeRAID controller has spun down. If an online drive becomes defunct and no suitable hot-spare drive is available, a standby hot-spare drive of the appropriate size automatically spins up, and enters the rebuild state.
Page 214
Miscellaneous programs: The IPSSEND and IPSMON programs are advanced command-line programs that can be used to manage the ServeRAID controllers. You can use the IPSSEND program to view the configuration of a ServeRAID controller, rebuild a defunct drive, and perform other functions. You can use the ISPMON program to monitor a ServeRAID controller for defunct drives, predictive failure analysis (PFA) warnings, rebuild operators synchronizations, and logical drive migration.
Page 215
See “Combination failure protection” on page 208 for information about protection provided at the ServeRAID controller level. Note: IBM provides management software, Netfinity Director, with IBM servers that ship with ServerGuide. The software monitors the status of the hardware and provides alerts when conditions are not optimal.
Page 216
• Analyzes data from periodic internal measurements • Recommends replacement when specific thresholds are exceeded The data from periodic internal measurements is collected when data sectors are accessed. Data scrubbing performs the following operations: • Forces all data sectors to be read •...
Page 217
received from the target device (hard disk drive, CD ROM, tape drive, etc.) since the Device Error Table was last cleared. There are many types of SCSI check conditions that can be received by the ServeRAID controller. Some of the check conditions indicate errors while others indicate unexpected (but not error) conditions such as the command queue on a drive being temporarily full.
Page 218
RAID-5 controller level protection: At the controller level, RAID-5 has become an industry standard method to provide increased availability for servers. RAID-5 and RAID-1 implementations allow servers to continue operation even if there is a “catastrophic” failure of a hard drive. During normal operations in a RAID-5 environment, redundant information is calculated and written out to the drives.
Page 219
combining Record 1 with the check sum data on Drive 3. At this point, the media sector error is discovered. • If the error is minor, the disk can re-create the missing information from its ECC data (as before) and potentially remap the bad sector. •...
• By reading every file on the disk, the backup process forces the disk and the controller to discover any sector media errors and to correct them, greatly decreasing the likelihood of the rebuild failures described in the previous section. If weekly tape backups are performed, data scrubbing is still recommended to detect and correct sector media errors in non-data areas of the drives.
Page 221
A PROM that can be erased by a special process and reused. Empty. No device is present in the SID or the adapter cannot communicate with the drive. This state is represented with dashes (- - -) on the IBM ServeRAID configuration screen, or a blank space on the Administration and Monitor screen.
Page 222
The array specifies which drives to include in the striping of data. Each array is subdivided into one or more logical drives. The logical drives specify the following: • The number and size of the physical drives as seen by the operating system.
Page 223
Standby. A standby drive is a hard disk drive that the ServeRAID adapter has spun down. Devices such as tape drives and CD-ROM drives are also considered to be in a standby state. Only the IBM ServeRAID adapter has the state.
Page 224
synchronization The process that reads all the data bits of the entire logical drive, calculates the parity bit for the data, compares the calculated parity with the existing parity, and updates the existing parity if inconsistent. System. Tape. throughput A measure of the amount of work performed by a computer system over a period of time, for example, number of jobs per day.
Setup to be stored in CMOS. 3.For IBM devices not supported by index, refer to the manual for that device. 4.Always start with “Chapter . General checkout,” on page 1.
Beep/Symptom FRU/Action One Long and Two Short Beeps Video adapter (if present) I/O Legacy Board Two Long and Two Short Beeps Video adapter No beep symptoms No Beep Symptom FRU/Action No beep and the system Diagnostic LED Panel operates correctly. No Beeps occur after Run Configuration/Setup, set the Start Options Power-On Status to enable.
Page 229
Diagnostic Panel LED FRU/Action All LEDs off (Check System System Error Log is 75% full; clear the log. Error Log for error condition, then Bad, missing or mis-installed processor terminator; remove and reinsert clear System Error Log when the terminator. problem is found.) PFA alert;...
Diagnostic Panel LED FRU/Action PS1 LED on Check the DC Good LED on power supply 1. If off, replace power supply Power Backplane. PS2 LED on Check the DC Good LED on power supply 2. If off, replace power supply Power Backplane.
Page 232
Error Code/Symptom FRU/Action 035-XXX-SNN (Check System Error Fixed Disk with SCSI ID NN on RAID adapter in PCI slot S. Log before replacing a FRU. number of failing PCI slot; NN = SCSI ID of failing fixed disk.) 035-253-S99 (RAID adapter ServeRAID Adapter in slot S is not configured properly.
Page 233
Error Code/Symptom FRU/Action 201-XXX-0NN (Failed Memory DIMM Location J1-J16 test, see "Memory Settings" on page 47.) Memory card NN = DIMM location. 201-XXX-999 (Multiple DIMM failure, See error text for failing DIMMs see error text.) Memory card 202-XXX-001 (Failed System VRM 1 Cache test) Microprocessor CPU 1...
Page 234
Error Code/Symptom FRU/Action 217-XXX-005 (Failed BIOS Fixed Disk Fixed Disk 6 test) Note: If RAID is configured, the fixed disk number refers to the RAID logical array. 217-XXX-006 (Failed BIOS Fixed Disk Fixed Disk 7 test) Note: If RAID is configured, the fixed disk number refers to the RAID logical array.
Run Diskette Drive Diagnostics. Cable Diskette Drive I/O Legacy Board Monitor problems (general). Some IBM Monitor monitors have their own self-tests. If you Run Video Diagnostics. If diagnostics pass, the problem may be a suspect a problem with the monitor, refer to video driver.
• Power Backplane • Processor/PCI Board (with pins 2 and 3 on J23 connected together to bypass the power switch; see “Processor board jumpers” on page 57). • Processor/PCI Board (Verify that Processor/PCI Board and I/O Legacy Board are properly connected together; see “Processor board component locations”...
Page 237
Error Code/Symptom FRU/Action 114 (Adapter read-only memory error) Failing adapter Run diagnostics 129 (Internal cache error) Processor Optional processor (if installed) Processor/PCI board (Real time clock error) Run Diagnostics Battery I/O Legacy Board (Real time clock battery error) Run Configuration/Setup Battery I/O Legacy Board (Device Configuration Error)
Page 238
Error Code/Symptom FRU/Action (Memory test error, see "Memory Run memory diagnostics. Settings" on page 47.) If the server does DIMM not have the latest level of BIOS installed, Memory card update the BIOS to the latest level and run the diagnostic program again. Processor/PCI Board I/O Legacy Board (Cache error)
Page 239
Error Code/Symptom FRU/Action 1301 (cable to front panel not Cable found) Front Panel Power Switch Assembly I/O Legacy Board (cable from system board to 1302 Cable power on and reset switches not found) Power Switch Assembly I/O Legacy Board 1303 (cable from system board to Cable power backplane not found)
Page 240
I/O Legacy Board (Drive does not contain a valid 1962 Verify a bootable operating system is installed. boot sector) If RAID system, refer to the IBM ServeRAID Hardware Maintenance Manual for more information. Run Diagnostics Hard Disk Drive Cable SCSI Backplane...
Error Code/Symptom FRU/Action 01298003 (No update data for Ensure all processors are the same stepping level and cache size. processor 3) Processor 3 01298004 (No update data for Ensure all processors are the same stepping level and cache size. processor 4) Processor 4 01298101 (Bad update data for...
Page 243
Flash the controller with the latest IBM ServeRAID BIOS/Firmware from the IBM Website. Remove the jumpers. If the error persists, replace ServeRAID controller. 6XXX (Cache DRAM Error) Reseat Cache Memory SIMM. Install two download jumpers parallel to the PCI slots. For jumper locations, see “Processor board component locations”...
Disconnect one drive at a time attached to the channel identified in step 2a, then reboot to determine which drive is causing the error. g. Replace SCSI cable. h. Replace SCSI backplane. EF10 (Default ISPR) No ISPR error present. Controller is probably functioning properly. Do not replace system board unless you have run the controller’s diagnostics and you get an error message.
Page 245
Error Code/Symptom Action 1xxx (flash checksum error) ServeRAID 3HB Adapter/Controller 2xxx (code DRAM error) Install download jumpers, flash latest level BIOS and firmware for adapter. Remove jumpers. ServeRAID 3HB Adapter/Controller 2601 to 260B (POST detected error with ServeRAID Install download jumpers, flash latest level BIOS and subsystem) firmware for adapter.
Error Code/Symptom Action Bxxx (SCSI bus interface error) Refer to “ServeRAID POST (ISPR) error procedures” on page 232. SCSI cable SCSI backplane Hard drive ServeRAID 3HB Adapter/Controller SCSI error codes Error Code FRU/Action All SCSI Errors One or more of the following might be External SCSI devices must be powered-on before causing the problem: you power-on the server.
Message Action System over temperature for CPU "X" Ensure system is being properly cooled; see (level-warning; CPU "X" reporting over temperature “Temperature checkout” on page 31. condition) Replace CPU "X". Replace Processor/PCI Board. System under recommended CPU "X" temperature Ambient temperature must be within normal operating (level-warning;...
Message Action System under recommended voltage for "X" v (level- Check connections to power subsystem warning; indicated voltage supply under nominal value; Replace power supply value for "X" can be +12, -12, or +5) Replace power backplane System under recommended voltage on 3.3 v (level- Check connections to power subsystem warning;...
Temperature related system shutdown Message Action System shutoff due to board over temperature (level- Ensure system is being properly cooled, see critical; processor board is over temperature) “Temperature checkout” on page 31. Replace processor board System shutoff due to CPU "X" over temperature (level- Ensure system is being properly cooled, see critical;...
Page 250
Message Action Failure reading I2C device. Check devices on bus 1. Reseat I/O Legacy Board Reseat power supplies Replace power supply Replace Power Backplane Replace I/O Legacy Board Failure reading I2C device. Check devices on bus 2. Reseat front panel connector Reseat diagnostics panel connector Reseat I/O Legacy Board Replace front panel...
Remove or disconnect the following (one at a time) until you find the failure (power-on the computer and reconfigure each time). Any external devices Surge suppressor device (on the computer) Modem, printer, mouse, or non-IBM devices Each adapter Hard disk drives Memory-Modules (Minimum requirement = 512 MB (4x128 MB DIMMs))
Index System (Netfinity 7600 – Type 8665) Models 1RY, 2RY FRU No. Power cable-power backplane to CD (all models) 37L0130 Signal cable-power backplane to SCSI backplane (all models) 37L0147 Signal cable-power backplane to 5.25 bays (all models) 37L0149 Signal cable-power backplane to hard disk drive (all models) 37L0144 Signal cable-power backplane to CD-ROM (all models) 37L0146...
Safety information The following section contains the safety information that you need to be familiar with before servicing an IBM mobile computer. General safety Follow these rules to ensure general safety: •...
• Reinstall all covers correctly before returning the machine to the customer. Electrical safety CAUTION: Electrical current from power, telephone, and communication cables can be hazardous. To avoid personal injury or equipment damage, disconnect the attached power cords, telecommunication systems, networks, and modems before you open the server covers, unless instructed otherwise in the installation and configuration procedures.
This guide addresses only those items. However, good judgment should be used to identify potential safety hazards due to attachment of non-IBM features or options not covered by this inspection guide.
Insulation must not be frayed or worn. Remove the cover. Check for any obvious non-IBM alterations. Use good judgment as to the safety of any non-IBM alterations. Check inside the unit for any obvious unsafe conditions, such as metal filings, contamination, water or other liquids, or signs of fire or smoke damage.
Grounding requirements Electrical grounding of the computer is required for operator safety and correct system function. Proper grounding of the electrical outlet can be verified by a certified electrician. Safety notices (multi-lingual translations) The caution and danger safety notices in this section are provided in the following languages: •...
Page 264
Statement 2 CAUTION: When replacing the lithium battery, use only IBM Part Number 33F8354 or an equivalent type battery recommended by the manufacturer. If your system has a module containing a lithium battery, replace it only with the same module type made by the same manufacturer.
Page 265
DANGER Some laser products contain an embedded Class 3A or Class 3B laser diode. Note the following. Laser radiation when open. Do not stare into the beam, do not view directly with optical instruments, and avoid direct exposure to the beam. Some laser products contain an embedded Class 3A or Class 3B laser diode.
Page 266
CAUTION: Do not place any object weighing more than 82 kg (180 lbs.) on top of rack-mounted devices. Importante: Todas as instruções de cuidado e perigo da :cit.Netfinity Server Library:ecit. começam com um número. Este número é utilizado para fazer referência cruzada de uma instrução de cuidado ou perigo no idioma inglês com as versões traduzidas das instruções de cuidado ou perigo encontradas nesta seção.
Page 267
Aqueça a mais de 100°C (212°F) • Conserte nem desmonte Para descartar a bateria, entre em contato com a área de atendimento a clientes IBM, pelo telefone (011) 889-8986, para obter informações sobre como enviar a bateria pelo correio para a IBM.
Page 268
Radiação de laser quando aberto. Não olhe diretamente para o raio a olho nu ou com instrumentos íticos, e evite exposição direta ao raio. Instrução 4 18 kg (37 lbs) 32 kg (70,5 lbs) 55 kg (121,2 lbs) CUIDADO: Ao levantar a máquina, faça-o com segurança. Instrução 5 CUIDADO: Os botões Liga/Desliga localizados no dispositivo e na fonte de alimentação não...
Page 269
Não coloque nenhum objeto com peso superior a 82 kg (180 lbs.) sobre dispositivos montados em rack. Related service information...
Page 275
Important: Toutes les consignes Attention et Danger indiquées dans la bibliothèque :cit.Netfinity Server:ecit. sont précédées d'un numéro. Ce dernier permet de mettre en correspondance la consigne en anglais avec ses versions traduites dans la présente section. Par exemple, si une consigne de type Attention est précédée du chiffre 1, ses traductions sont également précédées du chiffre 1 dans la présente section.
Page 276
ATTENTION: Remplacez la pile au lithium usagée par une pile de référence identique exclusivement - voir la référence IBM - ou par une pile équivalente recommandée par le fabricant. Si votre système est doté d'un module contenant une pile au lithium, vous devez le remplacer uniquement par un module identique, produit par le même fabricant.
Page 277
Rayonnement laser lorsque le carter est ouvert. évitez de regarder fixement le faisceau ou de l'observer à l'aide d'instruments optiques. évitez une exposition directe au rayon. Notice n° 4 18 kg 32 kg 55 kg ATTENTION: Faites-vous aider pour soulever ce produit. Notice n°...
Page 278
Wichtig: Alle Sicherheitshinweise in dieser :cit.Netfinity Server-Bibliothek:ecit. beginnen mit einer Nummer. Diese Nummer verweist auf einen englischen Sicherheitshinweis mit den übersetzten Versionen dieses Hinweises in diesem Abschnitt. Wenn z. B. ein Sicherheitshinweis mit der Nummer 1 beginnt, so erscheint die übersetzung für diesen Sicherheitshinweis in diesem Abschnitt unter dem Hinweis 1.
Page 279
ACHTUNG: Eine verbrauchte Batterie nur durch eine Batterie mit der IBM Teilenummer 33F8354 oder durch eine vom Hersteller empfohlene Batterie ersetzen. Wenn Ihr System ein Modul mit einer Lithium-Batterie enthält, ersetzen Sie es immer mit dem selben Modultyp vom selben Hersteller. Die Batterie enthält Lithium und kann bei unsachgemäßer Verwendung, Handhabung oder Entsorgung explodieren.
Page 280
18 kg 32 kg 55 kg ACHTUNG: Beim Anheben der Maschine die vorgeschriebenen Sicherheitsbestimmungen beachten. Hinweis 5 ACHTUNG: Mit dem Betriebsspannungsschalter an der Vorderseite des Servers und dem Betriebsspannungsschalter am Netzteil wird die Stromversorgung für den Server nicht unterbrochen. Der Server könnte auch mehr als ein Netzkabel aufweisen. Um die gesamte Stromversorgung des Servers auszuschalten, muß...
Page 281
confrontare avvisi di attenzione o di pericolo in inglese con le versioni tradotte riportate in questa sezione. Ad esempio, se un avviso di attenzione inizia con il numero 1, la relativa versione tradotta è presente in questa sezione con la stessa numerazione. Prima di eseguire una qualsiasi istruzione, accertarsi di leggere tutti gli avvisi di attenzione e di pericolo.
Page 282
Quando si sostituisce la batteria al litio, utilizzare solo una batteria IBM con numero parte 33F8354 o batterie dello stesso tipo o :erev refid=a. di tipo equivalente consigliate dal produttore. Se il sistema di cui si dispone è provvisto di un modulo contenente una batteria al litio, sostituire tale batteria solo con un tipo di modulo uguale a quello fornito dal produttore.
Page 283
ATTENZIONE: Durante il sollevamento della macchina seguire delle norme di sicurezza. Avviso 5 ATTENZIONE: Il pulsante del controllo dell'alimentazione situato sull'unità e l'interruttore di alimentazione posto sull'alimentatore :erev refid=a. non disattiva la corrente elettrica fornita all'unità. L'unità potrebbe disporre di più di un cavo di alimentazione. Per disattivare la corrente elettrica dall'unità, accertarsi che tutti i cavi di alimentazione siano scollegati dalla sorgente di alimentazione.
Page 286
Importante: Todas las declaraciones de precauciín de esta :cit.Biblioteca del servidor Netfinity:ecit. empiezan con un número. Dicho número se emplea para establecer una referencia cruzada de una declaraciín de precauciín o peligro en inglés con las versiones traducidas que de dichas declaraciones pueden encontrarse en esta secciín. Por ejemplo, si una declaraciín de peligro empieza con el número 1, las traducciones de esta declaraciín de precauciín aparecen en esta secciín bajo Declaraciín 1.
Page 287
Cuando desee sustituir la batería de litio, utilice únicamente el número de pieza 33F8354 de IBM o cualquier tipo de batería equivalente que recomiende el fabricante. Si el sistema tiene un mídulo que contiene una batería de litio, sustitúyalo únicamente por el mismo tipo de mídulo, que ha de estar creado por el mismo fabricante.
Page 288
Algunos productos láser contienen un diodo de láser incorporado de Clase 3A o de Clase 3B. Tenga en cuenta la advertencia siguiente. Cuando se abre, hay radiaciín láser. No mire fijamente el rayo ni lleve a cabo ningún examen directamente con instrumentos ípticos; evite la exposiciín directa al rayo. Declaración 4 18 kg (37 libras) 32 kg (70,5 libras)
We want to know your opinion about this manual (part number 09N1030). Your input will help us to improve our publications. Please photocopy this survey, complete it, and then fax it to IBM HMM Survey at 919-543-8167 (USA). Name: _________________________________________...
IBM intends to make these available in all countries in which IBM operates. Any reference to an IBM product, program, or service is not intended to state or imply that only that IBM product, program, or service may be used. Subject to IBM’s valid...
Trademarks The following items are trademarks of the IBM Corporation in the United States or other countries or both: HelpCenter HelpFax OS/2 NetBAY3 Netfinity Netfinity Manager ThinkPad SystemXtra The following terms are trademarks of other companies: Notes Lotus Development Corporation MMX and Pentium are trademarks or registered trademarks of Intel Corporation.
Page 294
IBM@ Part Number: 09N1030 Printed in the United States of America on recycled paper containing 10% recovered post-consumer fiber. 09N1030 0409N1030...